| Contig_ID | Contig_def | CRISPR array number | Contig Signature genes | Self targeting spacer number | Target MGE spacer number | Prophage number | Anti-CRISPR protein number |
|---|---|---|---|---|---|---|---|
| NZ_AP021884 | Sulfuriferula plumbiphila strain Gro7 | 4 crisprs | csa3,cas3,cas5,cas6e,cas2,DEDDh,DinG,WYL,cas8c,cas7,cas4,cas1 | 0 | 4 | 7 | 0 |
| CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| NZ_AP021884_1 | 1148298-1148383 | Unclear |
I-E
Consensus repeat of NZ_AP021884_1
|
1 spacers
spacers of NZ_AP021884_1
>1.1|1148323|36|NZ_AP021884|CRISPRCasFinder ACCGCAGCCGCAGCCAATCGCCACGCAGCCTGTCAG |
cas2,cas6e,cas5 |
CRISPR arrays and Neighbor proteins around NZ_AP021884_1
The CRISPR arrays of NZ_AP021884_1 >merge|NZ_AP021884|1|1148298-1148383|CRISPRCasFinder GTGTTCCCCGCACCCGCGGGGATGAACCGCAGCCGCAGCCAATCGCCACGCAGCCTGTCAGGTGTTCCCCGCACCCGCGGGATGAG >NZ_AP021884|1|1|1148298-1148383|CRISPRCasFinder GTGTTCCCCGCACCCGCGGGGATGA ACCGCAGCCGCAGCCAATCGCCACGCAGCCTGTCAG GTGTTCCCCGCACCCGCGGGATGAG
>NZ_AP021884.1|WP_147070477.1|1147906_1148203_+|type-I-E-CRISPR-associated-endoribonuclease-Cas2 MLVIVLENVPPRLRGRLAIWLLEIRAGVYVGNYSDKVRDHIWHQVEVGIGEGNAVMAWRTSSEAGFDFVTLGKNRRIPVELDGAKLVSFLPQTDTDAL >NZ_AP021884.1|WP_147070479.1|1146692_1146992_+|type-II-toxin-antitoxin-system-RelE/ParE-family-toxin MRYQVRFASAAADDLQRLFDFLAEQDLAAAERARAVISQAIEVLQIFPFSCRKASPENPFLRELVISFGSYGYVALFEVEDAESVTVLAVRHQREDDYH >NZ_AP021884.1|WP_147070480.1|1146414_1146696_+|prevent-host-death-protein MKNATLPPLRVESELRAAAESVLQEGETLSGFVLEAVRLNIARREAQREFITRGLVAREEAKLSGHYVSSDEMLKRLDASLAKARAKQAVGNR >NZ_AP021884.1|WP_147070482.1|1145722_1146346_+|type-I-E-CRISPR-associated-protein-Cas6/Cse3/CasE MFLSRVEIPWDAARNPYNLHRQLWHLFPGEDRESRSSDDETRQGFLFRIEENATGRPARLLVQSRRAPTRANGLLLVGTREITPCPSAGQRLAFVLTANPVKTIVDAQRDAKPGKQSEKCRVPFIKEEEQRQWLLRKLGEAGEVEAVSVLPHAPVYFHKGSRAGKLVTATFEGVLRVRDPDRLAALLANGIGPAKAFGCGLLLVRRI >NZ_AP021884.1|WP_147070484.1|1145195_1145732_+|type-I-E-CRISPR-associated-protein-Cas5/CasD MFSREWPLLAESDIKPKTGKLGWTNSPAFSSCTRSWTARRAPITRTDLKARLECSSASLTSAHRRGAYLFDAAFTVAVGSKPGASVTLTQLAAALRQPLYTPSLGRRSCPLARPLLEGELEAEDALAALAKTAPVDGLVYSETQQSDQPLRLRDVPLHGHKRQFGTRLVYLHKDPTCS >NZ_AP021884.1|WP_147070486.1|1142295_1145139_-|aconitate-hydratase-AcnA MSTAHNLFNTLSEFTLGNGTPGRFYSLSALEAVGIGKISRLPVSIRIVLEAVLRNCDGRKITEQHIRELANWQPNGPRTEEIPFVVARILLQDFTGVPLLADLAAMRSAAAQAGKNPKVIEPLVPVDMVVDHSVQVDVFNQPDALQKNMELEFIRNRERYQFLKWGMQAFDTFKVVPPGIGIVHQVNLEYLARGVMEKDGVHYPDTLVGTDSHTTMINGLGIVAWGVGGIEAEAGMLGQPVYFLTPDVVGVHLKGQIREGVTATDVVLTVTEMLRKAKVVGKFVEFFGAGAAALSLPDRATIANMAPEYGATMGFFPVDEASCAYYAATGRSAEQVDTIRNYFMAQGLFGIPQAGDCDYSQELEIDLGSVVPSVAGPRRPQDRIELGHVKQAFAGLFAKPVAEGGYGKAAATLAQRVALAPAPAGTDIAGGGVQNSDTLPAGGTDPAVVIEREMVDNRPTPDHLASNAVYTAAQSGTLGHGDVVIAAITSCTNTSNPGVMLAAGLLAKKALEKGLTVPAHVKTSLGPGSRVVTEYLKAAGLLDALGEMGFKLVGYGCTTCIGNSGPLPAAIESAITGNDLIAASVLSGNRNFEARVHQNVKANFLMSPPLVVAYAIAGSMNTDLASEPLGTGRDGAPVYLKDIWPSLDEVAAVMATATNPDTYRKLYADFSADNPLWAAVPAPAGAVYDWDGASTYIRQPPFFDGAAGDSGVIRGARALAVFGDSVTTDHISPAGSIKPASPAGKFLLEHGVDRADFNSYGARRGNHEVMMRGTFANVRIRNLMLPGSEGGVTRHQPDGAEMAIYDAAMQYQAAGTPLMIFAGEEYGTGSSRDWAAKGTRLLGVKAVVAKSFERIHRANLVGMGVLPCQFRDGMGADSLKLDGSETFDLLGLEHGITPQQDITLVIHRADGSADAVAVKLRIDTPIEVDYYQSGGILPFVLAQLLAD >NZ_AP021884.1|WP_147070488.1|1141914_1142286_-|DUF202-domain-containing-protein MSDLNDPRVFFAAERTLLAWNRTCLTLMAFGFVVERFGLFLHMLAPQTPQHLERGISFWVGLGFILLGSLMAVLAVIQYRRVLRTLKPVEIPEGYWVNMAALSTLLLAVLGIVLSAYLTMGLK >NZ_AP021884.1|WP_147070490.1|1139808_1141854_-|methionine--tRNA-ligase MTRKILVTSALPYANGAIHLGHLVEYIQTDIWVRFQKMHGHECYYVCADDTHGTPIMLRAEKEGITPEQLIARVHGEHLRDFTGFHVGFDSYHSTNSGENRELSGTVYLKLREAGLIEQKTIEQYYDPVREMFLPDRFIKGQCPKCGAQDQYGDGCEVCGATYTPTDLINPVSAISGSTPVRRESEHYFFRLGACEAFLREWTRSGALQQEAANKLDEWFAAGLQNWDISRDAPYFGFEIPDAPGKYFYVWLDAPIGYMASFKKLAAEKNLDFDAWWQNDSGAELYHFIGKDILYFHALFWPAMLKNAGYRTPSGVFAHGFLTVNGAKMSKSRGTFITAESYLASGMDPEWLRYYYAAKINGSMEDLDLNLADFIARVNSDLVGKYVNIASRTAGFIARRFDGKLAARLPTSELLAEVQHAATLIGECYETREYGKALREIMRLTDLANQYVNDNKPWELAKQEGSEALLHEVCSVSVNLFRLLTLYLKPVLPRLATEVETFLNIAALAWVDAGTLLTSHSINAYSHLMTRVEQKQVDALVAANQQSLAASADAHSPARHAEAQNHVIAPIADTITADDFARIDLRVAKIVNAEHVEGADKLIRLTLDIGEGKTRNVFAGIKSAYDPEKLIGRMTVMVANLAPRKMKFGVSEGMVLAASGETAGLYILSPDDGAVPGMRVK >NZ_AP021884.1|WP_147070492.1|1138856_1139744_+|LysR-family-transcriptional-regulator MDIEQARTFLHVVAIGNFLGAAEKLHVTQSTVSARIQNLERKLGAKLFSRGKQGAALTAAGQRFVRHAQTLVRTADIAKQDVGLPDGYSGGLTVSGRIALWEGFLSRWVAWMRQAAPAISLRLEIGFEQDIMHGLVQNTLDIGLMYTPEARPGLGLERLFDETLVLVTTDRMRPWPDPGYVHVDWGTEFFHQFSLNFPDHPPPALSANVGWLGIQQLLTSGGSAYFPLRMVRTLLAKKRLHRVPGTPHFSVTAHMVYPLSRNDDFLQQALAGLRLLGREERRGQISMDTDNSPNP >NZ_AP021884.1|WP_147070494.1|1137963_1138608_-|maleylacetoacetate-isomerase MQLFSFFSSSTAFRVRIALALKGADYEYQAVNLRAGEQHQQAFLDRNPSGNVPALVDGDFNLGQSLAILDYLDSRYPEPRLIPADTIQRARVLELVNVIACDIHPVNNLRVQLYLKNILGVTEAQKNAWYRHWVAQGLDVVERLLARQEDTPYCFGTHPTLADCCLAPQVWSAARAGCDIAAYPRIDRIYRHCMAQPAFIQAAPEQQADAPQGG >NZ_AP021884.1|WP_147070475.1|1148555_1149455_+|enoyl-CoA-hydratase/isomerase-family-protein MNAVVEFQQFSNASLEQVRIRFDEEYGVMWSFMRPEPRPCFTRTTLQDLLQHHTYLESMKGRVVSNGNFQQTNYLILASDLQGVFNLGGDLAAFGEAIRAQTRKELLSYAKLCIDNVWTFYNLQAPITTISMVQGQAMGGGFEAALSAHVMIAEKSALMGLPEVLFNLFPGMGALSFLSRKIGMRAAEAMVRSGRVYTATELHEMGVVDVLAEDGQGEKTLYDWIRKNHRSLNSFQAIQRARQRVNPLTVEELYEITEIWVDAALRLSERDLRIMERLVRAQNRKVTEPEPVVAEQASA >NZ_AP021884.1|WP_147070472.1|1149459_1151595_-|response-regulator MLDKMKIWFTDRVAKCDRVELEQSLIRLGIGLAILVYLLYRYLTHTTLSHNDIVAFSILSVFLFLTLVLIGSILYSSKPSVVRRLAGAWVDQGGTTLFMAFTGEVGVMVVGVYLWVIFGNGFRFGRKYLIHAQVLSIVGFAITTQVNPYWDEHEAISYSVMLMLLALPIYVSALIRRMNEARQKAEEANAAKTRFVANMSHEIRTPLSGIIGISTLLKATPLNSEQQDLLGTLNSSSRLLVSLLNNVLDFAKIEDGKLAIEHTDFSVNSLLEETVKIFRSQAEAKSIRLDTHIAAAAGTLRGDPHRLQQVLANLVGNAVKFTERGSVTLSLSILGENEHHRNMRFEVADTGVGIPTSAQGKIFESFTQADISTTRRFGGSGLGLTITRHLVEAMGGRLSFESAEGLGSRFWFDLPLEKAVQAQPGSAEIVPLPATRDAGLENTLRILVCEDDATNQKILLRLLELAGHHVSLSANGEELLDQLEQSSFDLVIADLNMAGLSGTDALKLYRFTRADDTRTRFILFTADATLSARQAAKEAGFDAFLSKPVDASTLFGTIANLLGMPSASAEHWLNTVMGGSRSSPPASAETRAVLDAATLRELEILGAGDALFVQRLLRNYLRDSGELLDRIEHAVQQKQYGALRDHCHALKGNSLSIGARGVFGRAETIDRAGPGELRFRGSAMVGLLRTDYAAARAAIEDYLSRRQTAAR >NZ_AP021884.1|WP_147070471.1|1151614_1152046_-|response-regulator MSVRDIRSAPTYRQTVLIIDDQPMVLAIHTAVLKSLSMDLRIVSMTDPKAALEWLRQKPADLIVTDYRMHQMDGIHFVNAVRDSSIEPMRPIIVVTALKDEKIHQQLLAAGVSACLIKPARAAQLSKIARTLLEQSRRQYTTQ >NZ_AP021884.1|WP_147070469.1|1152139_1153207_-|response-regulator MTNFNLPDTSAVLILDDQATSRTILAQVVRSIGSGIRVQEETTPSAALAWAAAHPADLVLADYLMPDMNGVEFIGRLRQLPGYQHVPVVMVTIKQDMETRYAALDAGMTDFLTKPVDMRECLSRCRNLLTLRQQQLALEDKSRVLEDMVGQATEEIRCREKDTLMRLARAGEYRDTDTARHLLRMSRYSRVLADAIGLPEDEAELIELAAPLHDIGKIGIPDSILRKNGPLSDEELAIMRQHPKIGHDILEDSPSKYLRLGGEIALAHHERYDGSGYPFGTTGQDIPLSARIVAIADVFDALTSVRPYKSAWSIKSAMQYLLKESGRHFDPALVKAMLTLEASVEKIQEEHAEPG >NZ_AP021884.1|WP_147070467.1|1153469_1154669_+|malate-dehydrogenase MPTLKQQALDYHQFPKPGKLSVESSKPCATQHELSLAYSPGVAEPVRAIGADPELAYRYTNKGNLVAVITDGTAILGLGNLGPLAAKPVMEGKGVLFKRFANIDVFDIEVNAPSVQAFIDTVVNIAPTFGGINLEDIAAPHCFEIEKALSERLDIPVFHDDQHGTAVIICAGLINALHVQGKKLADARIVCLGAGAAGNASLRLLLAMGADKSRLLVVDKVGVLHTGMIDLPPHHAFFAADTDARTLADAMQGADAFIGVSAANLVTPAMIKSMADKPVVFALANPDPEIAPHDVHAARDDAIIATGRSDYPNQVNNILGFPFIFRGALDARAKRITQKMLIAAVHALMDLAREPVPADVLAIYNLTELAFGRDYILPKPFDARLIERIPPAVMKAAKE >NZ_AP021884.1|WP_147070465.1|1154672_1155050_+|succinate-dehydrogenase,-cytochrome-b556-subunit MRHPSRPVYLNIFKIHLPLPGWMSILQRMSGAVLFLVTPLLLYLLQTSFDADGYARLREWLHIPVVKALSTLLLWGYLLHLLGGLRFLLLDIHVGTALATARKLSAATLLASALLTLVIAGIGLW >NZ_AP021884.1|WP_147070463.1|1155043_1155373_+|succinate-dehydrogenase,-hydrophobic-membrane-anchor-protein MVGGALSAWLVQRVSALLLAAYALFFPVWVALHWPLDFAVWRGLFAPLPMRIVTLLFVVALALHAWVGMRDIFMDYVQPLGLRLALHVGALLWLATCVVWAGAVLWSLP >NZ_AP021884.1|WP_147070461.1|1155369_1157133_+|succinate-dehydrogenase-flavoprotein-subunit MMPVKRKFDAVIVGGGGAGLRAALQLSGSGLQVAVVSKVFPTRSHTVSAQGGITAALGNVTPDNWHWHMYDTVKGSDYLGDQDAIEFMCRHAAEAVIELEHMGLPFSRLDNGRIYQRAFGGQSMNYGGEQATRTCAAADRTGHALLHTLHQQNLKAHTHFFDEYFALDLLRDADGYVLGVTALCIETGAPLVIEARATLLATGGAGRIFRYSTNAHINTGDGLGMVLRAGLALQDMEFWQFHPTGLPGSGSLITEGVRGEGGYLVNNQGERFMERYAPHAKDLAGRDVVARALALEIHAGRGCGPHGDTIHLKLDHLGAALIKDKLPGIRELALRFAGVDPIDAPIPVVPTAHYMMGGIPTDLHGQVVMPARFGPEEPVPGLYAVGECACVSVHGANRLGGNSLLDLVVFGRAAGNHIIETLRDNPFPRLLPESAAEAALARLARWNKTGAGESVAELRLALQTLMQKHCGVFRTETLMGEGIAALDILQARLDNARLADHSQVFNTARIEALELENLFAVARATLVSAHARTESRGAHAREDYPERDDGHWLKHTLYTRENDQIDTKPVRLKPLTVEPFLPKERIY >NZ_AP021884.1|WP_147070460.1|1157132_1157831_+|succinate-dehydrogenase-iron-sulfur-subunit MRFSIYRYDPEHDTKPHMQAYDVDIEPAGNMLLDALLRIKDTLDSTLTLRRSCREGVCGSDGMNINGSNGLACITPLADLRQPVEVRPLPGLPVIRDLVVDMTPFNQQYRSVEPWLNNADPAPEIERLQSPEQRAQLDGLVECIQCGCCSSACPSFWWNPDKFVGPAGLLAAYRFIADSRDQGANQRLDNLQDPYRLFRCHGIMNCVSVCPKGLNPTAAIGKIKTLLVKRGA >NZ_AP021884.1|WP_161984192.1|1157869_1158085_+|succinate-dehydrogenase-assembly-factor-2 MLELDILLLDFLEQQYPVLPSSQQIAFGALLELGDSELWDMIQTGQSAAQPEQAKIIEWLRTGKQKNESTD |
You can click texts colored in the table to view more detailed information
| CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| NZ_AP021884_2 | 1831685-1831793 | Orphan |
NA
Consensus repeat of NZ_AP021884_2
|
1 spacers
spacers of NZ_AP021884_2
>2.1|1831723|33|NZ_AP021884|CRISPRCasFinder TTGCGTTGGATACCTCATCCTCATCATTGCGCT |
CRISPR arrays and Neighbor proteins around NZ_AP021884_2
The CRISPR arrays of NZ_AP021884_2 >merge|NZ_AP021884|2|1831685-1831793|CRISPRCasFinder GGCAGCATCACGCCTATCCAGGCCAGTCCCGGTGCGACTTGCGTTGGATACCTCATCCTCATCATTGCGCTGGCAGCATCACGCCTATCCAGGCCAGTCCCGGTGCGAC >NZ_AP021884|2|2|1831685-1831793|CRISPRCasFinder GGCAGCATCACGCCTATCCAGGCCAGTCCCGGTGCGAC TTGCGTTGGATACCTCATCCTCATCATTGCGCT GGCAGCATCACGCCTATCCAGGCCAGTCCCGGTGCGAC
>NZ_AP021884.1|WP_147074724.1|1830036_1830279_-|HypC/HybG/HupF-family-hydrogenase-formation-chaperone MCLALPARIVEMRKQDIGIVDLGGVRKEVSLALVDDLQVDDYVIVHVGYALSKLDPEEAERTLRIFAEMESMPGNVGVGA >NZ_AP021884.1|WP_147074723.1|1828900_1830040_-|hydrogenase-formation-protein-HypD MKYVDEFRDGELANGLASTIARAADTGRNYSFMEFCGGHTHAISRYGVTDLLPANIQMIHGPGCPVCVLPIGRIDLAIGLALDQGVILCTYGDTLRVPASDGLSLMKAKARGGDVRMIYSTADVLAIARDNPDRDVVFLAIGFETTTPPTALLIEQAKNEGIGNLSVLCNHVLTPSAITHILESPEVREYGTLPLDGFIGPSHVSTVIGTQPYEHFAREYRKPVVISGFEPLDVMQGILMLVRQVNEGRAEVENEFFRAVTRGGNRKAQTLVAKIFELRRTFEWRGLGEVPYSALQIRSEYAAFDAEQRYGLRYAPVADNKACECGAILRGVKKPTDCKIFGTVCTPETPMGSCMVSSEGACAAHYTYGRFKDVEIVAA >NZ_AP021884.1|WP_147074722.1|1827848_1828904_-|hydrogenase-expression/formation-protein-HypE MSTVKPGYTRPLDVRNGRIDLSHGGGGRAMAQLIEELFAAAFDNEYLAQGNDGAVLAMPSAGGRLVMATDAHVVSPLFFPGGDIGCLSVHGTVNDVAVMGARPLWLAASFVLEEGFPLSDLKRIVESMANAAKSAGVSVVTGDTKVVERGKGDGVFITTTGVGVLPKGLDLSGNKATPGDVILLSGTIGDHGMAIMSKRENLAFDAPIESDTAALHGLVADMLASGSGIRVLRDPTRGGLATTLNEIAKQSGVGMQLDESSIPVRPVVDAACEFLGLDPLYIANEGKLVAICAPEDAGGLLAVMRAHPLGRESAIIGTVHADPHHFVQMKTRFGGRRNVDWLSGEQLPRIC >NZ_AP021884.1|WP_147074721.1|1826161_1827847_-|hydrogenase-maturation-protein MRILFLTHSFNSLTQRLYVALTELGHEVSVEFDIADSVTEEAVALYRPDIILAPFLKRAIPASVWRHHTCLVVHPGIVGDRGPSALDWAVQNAETEWGVTVLQANAVMDGGDIWANEIFPMRLAKKSSLYRNEVTEAATRAVTTAIERYAQRDFVPCPLEKWSNVAGQERPVMWQEDRRINWLRDDTQTILRRIHAADGFPGVRDSLFDHACFLFDAHAAPDYSGAPGTILGWQGTSLVRATVDGAIRIGHVRRPESAHPFKLPALVAFAAEQASIPVLCEADGESIRYEEQDGVGYLYFDFYNGAMSTAQCRELLAAYRQACSRPTRVIVLMGGDDFWSNGIHLNLIEASEHPAEESWENIQAMDDLAEAIITATSHITVSALANNAGAGGAFLALAADHVWARPSVLLNLHYKNMGNLYGSEFWTYTLPRRVGLEKTSRIVENRLPMSARQAARLGIVDACFGTDAAMFRREVKQRATAISRSPDYDVLRKTKTEARDRDESEKPLLRYRESELSEMHRNFFGFDPSYHYARRYFVHKTLPAWTPRHLCKHRGMVQGNH >NZ_AP021884.1|WP_147074720.1|1824193_1825636_-|sigma-54-interacting-transcriptional-regulator MSLPTVLIVDDEIRSLEALRRTLEEDFTVFTASNVDAALEILRQEFIQIIVCDQRMPVQSGVTFLKHVRADWPDVVRIMLSGYTDTEDIIAGINEAGIFQYLLKPWQPEQLMLVLRSAADVYRLQLENQRLSLELRDSPALLAERVANKRQHVREKFSLDRVARAPDSPLNATCEMIDRIAPYDISVLITGESGTGKELLAHALHYRSGRAAQAFVTQNCGALPDALLEAELFGYKRGAFTGAYSDRVGLFQQADGGTIFLDEIGETTPSFQVKLLRVLQEGEIRPLGSPRSVQVNVRVIAATNRDLEEEVRAGRLRQDLYYRIANLTMHLPPLRERPMDIPLIAEGLLQRAMRQLNRKVRGFTPETLDCFKAYRWPGNVRELQNEILRILALTDSEWLEARLLSPKVLRAAMEESEEQQLDLLAGLDGSLKDRMEQLEARLIRETLIRHRWNKTHAAQELGLSRVGLRSKLVRYGMDKT >NZ_AP021884.1|WP_147074751.1|1823172_1824168_-|HupU-protein MNLIWLQSGGCGGCTMSLLSADVRDLFGMLKDAGINIVWHPGLSEQTGSEAIEVLEACASGDLPLDILCVEGSLLRGPNGTGRFHVLSGTGKPMIEWVRQLAEKAQYTIAVGTCATYGGVTAGGCNPTDACGMQYDGASRGGLLGVDYLSQSGLPVINIAGCPTHPGWVLETLLALAMDSFTQADLDELGRPRFYADHLVHHGCARNEYYEFKASAEKPSDQGCMMENMGCKGTQAHADCNIRPWNGGGSCTDGGYACIGCTEPGFEEPGHPFTQTPKIAGIPIGLPTDMPKAWFVALATLSKSATPKRVKENATSDHLVIVPGIRKTGVK >NZ_AP021884.1|WP_147074719.1|1821718_1823176_-|nickel-dependent-hydrogenase-large-subunit MSRLVVGPFNRVEGDLEVTLDISGGRVDRAYVDSTLYRGFEQILRGKDPMDALVFVPRICGICSVTQSVAAANALRNAMGISIPRNGQLATNLILANENLTDHFTHFYLFFMPDFARDGYSGRPWHGMAEQRFKAVTGSAAGDALPARAAFLHMMGVLAGKWPHTLTLQPGGSSRAVSSTEKIRLYAMLREFRAYLEKIMFGDKLENIVKLDSMRALEAWRDARPPDASDFRLFLEVARDLELHRIGRATDIFLSYGSYEMAGEYLFSPGVWDASKGTLSAIDPSDIVEDLSHSRMTGERDARHPYQGETQPAPDKPDAYTWCKAPRWRGQVLECGALARQVVTGHPLIRDMVEKTGGNVTTRVVARLLEISRVIPAMESWIKSLSPGEPFCVQGRMPDNAKGVGMVEAARGALGHWLVVKEGKIANYQIVAPTTWNFSPRDRDGIPGALEQALVGVPVGEHERVPLAVQHVVRSFDPCMVCTVH >NZ_AP021884.1|WP_147074718.1|1821238_1821706_-|nickel-responsive-transcriptional-regulator-NikR MERFTISLDEDLAQEFDRLILARGYSNRSEAVRDMLRAELEKSRQVRYEGTHCIAALSYVYNHHERELAERLTALQHDHHDLTVSTLHAHLDHDNCIECVVLRGKTAEVRDFAGKLIAERGVRHGNLSVITVSQEQHKHRHGLFARSHIHYKPHN >NZ_AP021884.1|WP_147074717.1|1819887_1821237_-|PAS-domain-containing-protein MFSKTGLLSLPDMPIEGVGEQFWMEVIRKMDEVYSDLLKYQTALEEQNNKLEESQQFIFGVLAAMSDILVVCDQTGTIEDVNQSLIELTGKTSAEWRGHPLVELFADDISRKQAELKFNGLQGQAIHDCEMQIRMANGSSMPVSVNCTARFNKKGKSVGMVITGRPVGELRRAYHALQEAHEALKRTQQQLVHSEKMASLGRLVAGVAHELNNPISFVLGNVHVLERYAGRLKEYLDAVHAGRSGIELAELREKLKIDRILGDIRPLIEGTIEGAERTRDIVDGLKRFSAIDREEECEFNLVEIIQRAVHWVTNITSESFQVEMDLPHFIPVLGSAAQIQQVIMNLVQNAVDATAEVKSPRLRIQAKIEKDKAVVEFRDNGSGILPENFPKIFDPFFTTKPVGKGTGLGLAISYGIVERHNGALFAANDAHDGGTIFVLNLPLYQSAKN >NZ_AP021884.1|WP_147074716.1|1818851_1819808_+|HTH-type-transcriptional-regulator-CysB MKIQQLRYLHEVARQGLNVSLAAEKLHTSQPGVSKQIQLLEEELGVDILVRHGKRVTGITEPGQKILAITERILREAENLKRVGADFTNETHGSLSIATTHTQARYALPSVIKTFSERYPGVQLRLHQGNPAQIVEMVLSGEADIAIATEAIALHDELVTLPCYQWNRCVIVQPDHPLLGEPTLTLERIADYSIITYDFAFAGRSQINKAFMERNLSPNVVLTAIDADVIKTYVGIGLGIGIMASMAFDPGRDQNLRAIDASHLFEPSTTRIGIRQGTYLRGYTFEFIQMFAPHLNHEAVNMAISAACRSAHQEAPKI >NZ_AP021884.1|WP_147074726.1|1832692_1833727_-|hydrogenase-nickel-incorporation-protein-HypB MCTTCGCSAGETRIEGQAMDGHSHVHADGTVHDHRHEAPAADGKMQYHAHHDENAHGHRHADGTWHSHDHGHEGEHVHEHGEDVIDYGQGPAHAHAPGLTQSQMVRIEQDILGKNNAYAGRNRNYFDEHGIFALNLVSSPGSGKTTLLVRTIETLKSRIQVAVVEGDQQTSQDAERIRSTGVRALQINTGKGCHLDANMVGHALERLHPEDDSVLMIENVGNLVCPAAYDLGEAHKVVILSVTEGEDKPLKYPDMFRAASLMLLNKTDLLPYVPFNVQLAIEYAKQVNPGLHIIQTSSTNGDGYEAWLGWIETGLARQRKKRAQTVAVLQKRIQELEAHLAARG >NZ_AP021884.1|WP_147074727.1|1833787_1834129_-|hydrogenase-maturation-nickel-metallochaperone-HypA MHEMSLAEGVLQILEDTATHHGFQQIKRVRLEIGELACVEVESLRFCLDVVVRGSVAENTMLDIVQTPGGGWCMNCSDTVPISALFSACPRCGSYQVQPTHGTEMRVLELEGV >NZ_AP021884.1|WP_147074728.1|1834121_1835225_-|nickel-dependent-hydrogenase-large-subunit MSLAGKLTFSVGWDGYRVTSVEVRSSRPQAACLLEGKTVEEAMRLVPLLFGICGKAQTVAARSAAQAAQNLCGDKQLMLRQRRLVALEAAQEHLWRLLVDWPNRLGLPAKQGLMMEWVKRISISRGDDDVLALGEAMLTMIEQDVLDESLDCWAATLERAERTPMRGLAGASLEMLRGLEPLHSGHPVFGHFLPRQAACLWGNELQPYLDGHFAVRPLWRNAPAEAGALALHHQIPLLAELLRTGHAASARYLARLVDWVSCVRLLRGEASSTELRLDACKLGKNAGLACVDTARGLLLHYIEVALGQIVRYVIVAPTEWNFHPAGPFVQTLRSLRADDAASLYQRINILILAFDPCVEYEVNLHHA >NZ_AP021884.1|WP_161984236.1|1835224_1835764_-|[NiFe]-hydrogenase-assembly-chaperone-HybE MKLMNPRPLENPSRMIESVFDGIARHRMAGLPILNPSLHVEAVGFRLWEGLWLGILITPWTINLMLLPADNPDYAALGLGETRRWRFPSGQYDFMGGEEPGLGSYQACSLFSPVFEFASQEDAVATARAALEQLLLEDLEAAVKREKAQWDQARFSDAPLAEQALSRRGFLRGAFLRDP >NZ_AP021884.1|WP_147074730.1|1835770_1835986_-|rubredoxin MDTFEGSYLGHDDRIDESVRLECGICWLVYDPEVGDPYWHIPPGTPFSRLPEHWTCPNCDAPRHKFMVLKD >NZ_AP021884.1|WP_161984237.1|1836001_1836784_-|hydrogenase-expression/formation-protein MNMPKGMAVFNPPSVPDDVAPELRDQAANLIRQLLAQMRAYRFGATSYPKIDLLKYDPRVVPLINDILGQGEVSIIAHQPTALRAQETVFASVWRVCYPGADGVLERDYLEVCPIPAVVAEIALAPTLKQISPPPPPAGAMNSPALLHEILDVVSTYQAGNPAHIINLTLLPLTPDDLAYLVQALGPGSVSILSRGYGNCRITSSGLANVWWVQYFNSSDQLILNTIEVVEVPEVALAAEEDFSDSIERVEEWLGTMLAA >NZ_AP021884.1|WP_147074732.1|1836869_1837361_-|hydrogenase MSEGVLDYALVAQEKVATEQNALGMLITRLCEQHQFVLVDEGNLEALTQASGDMVLLLTEDVVRSPETWDVAIVLPEILKLFGGRLKAAIADTENSKKLQARFGTTRFPAMVFLRDGEYVDVIQRMLDWDEFVAEVTGVLEKPIGRAPTIGIPVRNEVASSCH >NZ_AP021884.1|WP_147074733.1|1837357_1837666_-|HypC/HybG/HupF-family-hydrogenase-formation-chaperone MCLGIPMQVIEAEESYAVCRGRDGNLARIDTMLVGSVQSGQWLMTFLGGAREILNEQQAEQVNSALNALAAVSRGASDVDVHFADLVGREPQLPDFLRKGGQ >NZ_AP021884.1|WP_147074734.1|1837670_1838294_-|HyaD/HybD-family-hydrogenase-maturation-endopeptidase MVEGSQFDTLILGIGNVLWADEGFGVRCVEAMNATYAFPDNVRVMDGGTQGLYLLPYVEAARRLVIFDAVDYGLEPGTLKLVENAEVPKFMGAKKMSLHQTGFQEVLACADLVDHLPEEMVLIGVQPEELEDYGGSLRPRIKARIPEVLEIAVERLVGWGIPVVARGTGETMRTESGILDIQRYEMERPTEEQACRLGDIRFLATGV >NZ_AP021884.1|WP_147074735.1|1838453_1838801_-|HigA-family-addiction-module-antidote-protein MVKTFLPSGLGFGAGALDPRFFSFSLRAPIAPGRFLESRFLHPLGLSQDRLARELGISRRRVNELIRGKRAITPDTAIRLGLFFGTGPVLWLTLQQAWDIHQEWRNFRRRSKAHG |
You can click texts colored in the table to view more detailed information
| CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| NZ_AP021884_3 | 2644962-2646084 | TypeI |
I-C
Consensus repeat of NZ_AP021884_3
|
15 spacers
spacers of NZ_AP021884_3
>3.1|2644999|36|NZ_AP021884|PILER-CR,CRISPRCasFinder,CRT ATGGTGCGATCCTGTTGTTGCTGGTTGTGCTGCGGG >3.2|2645072|37|NZ_AP021884|PILER-CR,CRISPRCasFinder,CRT TCCATGGTGGGATTCATGATCCAGTGGGCGATTGCGG >3.3|2645146|37|NZ_AP021884|PILER-CR,CRISPRCasFinder,CRT GTTTTCTGGTGGCAAGGATTCGGCTGTCATGCTGCGG >3.4|2645220|36|NZ_AP021884|PILER-CR,CRISPRCasFinder,CRT CGGCCGCTGTGGTCGCGCCCGACATCCTCGCCGCGG >3.5|2645293|36|NZ_AP021884|PILER-CR,CRISPRCasFinder,CRT AACTGGCGGAGGAGATCGAAATGCAAAAAGCCCGGG >3.6|2645366|35|NZ_AP021884|PILER-CR,CRISPRCasFinder,CRT ATACCGGTAGCGTCGGCAATACCCTGACCGCAGCG >3.7|2645438|35|NZ_AP021884|PILER-CR,CRISPRCasFinder,CRT ACGACGCAGGTTATAGAGCGTTGCACGGCAAAATT >3.8|2645510|34|NZ_AP021884|PILER-CR,CRISPRCasFinder,CRT GGCCCATGGCAGCTTAAGGTCGGGTATCCCGCTG >3.9|2645581|35|NZ_AP021884|PILER-CR,CRISPRCasFinder,CRT CTAACCTGCCTTACACAGCCAGCCGCTACGATGAG >3.10|2645653|35|NZ_AP021884|PILER-CR,CRISPRCasFinder,CRT ACCGGTTCATCGCCGTGCCGCCTGTCCATCGCCGC >3.11|2645725|36|NZ_AP021884|PILER-CR,CRISPRCasFinder,CRT TCGGTCATGGGTGCCAGTTACACTATCCCGATGGAC >3.12|2645798|36|NZ_AP021884|PILER-CR,CRISPRCasFinder,CRT GCATAATGGGACGTTCCGTCAATCTGCGAAGCGCGA >3.13|2645871|34|NZ_AP021884|PILER-CR,CRISPRCasFinder,CRT AACAATTTCTTGCTGGATAAAATCAAGCCGCTTA >3.14|2645942|34|NZ_AP021884|PILER-CR,CRISPRCasFinder,CRT AGCATGATCCAAATGGATGCCTTGCGGTAGCTTG >3.15|2646013|35|NZ_AP021884|CRISPRCasFinder,CRT GCCATGGGTAGCACCGATTAGCACCTTGCCAAAGC |
cas2,cas1,cas4,cas7,cas8c,cas5 |
CRISPR arrays and Neighbor proteins around NZ_AP021884_3
The CRISPR arrays of NZ_AP021884_3 >merge|NZ_AP021884|3|2644962-2646084|PILER-CR,CRISPRCasFinder,CRT GCATCGCCCGGCCTCGCGGCCGGGCGCGGATTGAAACATGGTGCGATCCTGTTGTTGCTGGTTGTGCTGCGGGGCATCGCCCGGCCTCGCGGCCGGGCGCGGATTGAAACTCCATGGTGGGATTCATGATCCAGTGGGCGATTGCGGGCATCGCCCGGCCTCGCGGCCGGGCGCGGATTGAAACGTTTTCTGGTGGCAAGGATTCGGCTGTCATGCTGCGGGCATCGCCCGGCCTCGCGGCCGGGCGCGGATTGAAACCGGCCGCTGTGGTCGCGCCCGACATCCTCGCCGCGGGCATCGCCCGGCCTCGCGGCCGGGCGCGGATTGAAACAACTGGCGGAGGAGATCGAAATGCAAAAAGCCCGGGGCATCGCCCGGCCTCGCGGCCGGGCGCGGATTGAAACATACCGGTAGCGTCGGCAATACCCTGACCGCAGCGGCATCGCCCGGCCTCGCGGCCGGGCGCGGATTGAAACACGACGCAGGTTATAGAGCGTTGCACGGCAAAATTGCATCGCCCGGCCTCGCGGCCGGGCGCGGATTGAAACGGCCCATGGCAGCTTAAGGTCGGGTATCCCGCTGGCATCGCCCGGCCTCGCGGCCGGGCGCGGATTGAAACCTAACCTGCCTTACACAGCCAGCCGCTACGATGAGGCATCGCCCGGCCTCGCGGCCGGGCGCGGATTGAAACACCGGTTCATCGCCGTGCCGCCTGTCCATCGCCGCGCATCGCCCGGCCTCGCGGCCGGGCGCGGATTGAAACTCGGTCATGGGTGCCAGTTACACTATCCCGATGGACGCATCGCCCGGCCTCGCGGCCGGGCGCGGATTGAAACGCATAATGGGACGTTCCGTCAATCTGCGAAGCGCGAGCATCGCCCGGCCTCGCGGCCGGGCGCGGATTGAAACAACAATTTCTTGCTGGATAAAATCAAGCCGCTTAGCATCGCCCGGCCTCGCGGCCGGGCGCGGATTGAAACAGCATGATCCAAATGGATGCCTTGCGGTAGCTTGGCATCGCCCGGCCTCGCGGCCGGGCGCGGATTGAAACGCCATGGGTAGCACCGATTAGCACCTTGCCAAAGCGCATCGCCCGGCCTCGCGGCCGGGCGCGGTCAGGTAC >NZ_AP021884|3|1|2644962-2646012|PILER-CR GCATCGCCCGGCCTCGCGGCCGGGCGCGGATTGAAAC ATGGTGCGATCCTGTTGTTGCTGGTTGTGCTGCGGG GCATCGCCCGGCCTCGCGGCCGGGCGCGGATTGAAAC TCCATGGTGGGATTCATGATCCAGTGGGCGATTGCGG GCATCGCCCGGCCTCGCGGCCGGGCGCGGATTGAAAC GTTTTCTGGTGGCAAGGATTCGGCTGTCATGCTGCGG GCATCGCCCGGCCTCGCGGCCGGGCGCGGATTGAAAC CGGCCGCTGTGGTCGCGCCCGACATCCTCGCCGCGG GCATCGCCCGGCCTCGCGGCCGGGCGCGGATTGAAAC AACTGGCGGAGGAGATCGAAATGCAAAAAGCCCGGG GCATCGCCCGGCCTCGCGGCCGGGCGCGGATTGAAAC ATACCGGTAGCGTCGGCAATACCCTGACCGCAGCG GCATCGCCCGGCCTCGCGGCCGGGCGCGGATTGAAAC ACGACGCAGGTTATAGAGCGTTGCACGGCAAAATT GCATCGCCCGGCCTCGCGGCCGGGCGCGGATTGAAAC GGCCCATGGCAGCTTAAGGTCGGGTATCCCGCTG GCATCGCCCGGCCTCGCGGCCGGGCGCGGATTGAAAC CTAACCTGCCTTACACAGCCAGCCGCTACGATGAG GCATCGCCCGGCCTCGCGGCCGGGCGCGGATTGAAAC ACCGGTTCATCGCCGTGCCGCCTGTCCATCGCCGC GCATCGCCCGGCCTCGCGGCCGGGCGCGGATTGAAAC TCGGTCATGGGTGCCAGTTACACTATCCCGATGGAC GCATCGCCCGGCCTCGCGGCCGGGCGCGGATTGAAAC GCATAATGGGACGTTCCGTCAATCTGCGAAGCGCGA GCATCGCCCGGCCTCGCGGCCGGGCGCGGATTGAAAC AACAATTTCTTGCTGGATAAAATCAAGCCGCTTA GCATCGCCCGGCCTCGCGGCCGGGCGCGGATTGAAAC AGCATGATCCAAATGGATGCCTTGCGGTAGCTTG GCATCGCCCGGCCTCGCGGCCGGGCGCGGATTGAAAC >NZ_AP021884|3|3|2644962-2646084|CRISPRCasFinder GCATCGCCCGGCCTCGCGGCCGGGCGCGGATTGAAAC ATGGTGCGATCCTGTTGTTGCTGGTTGTGCTGCGGG GCATCGCCCGGCCTCGCGGCCGGGCGCGGATTGAAAC TCCATGGTGGGATTCATGATCCAGTGGGCGATTGCGG GCATCGCCCGGCCTCGCGGCCGGGCGCGGATTGAAAC GTTTTCTGGTGGCAAGGATTCGGCTGTCATGCTGCGG GCATCGCCCGGCCTCGCGGCCGGGCGCGGATTGAAAC CGGCCGCTGTGGTCGCGCCCGACATCCTCGCCGCGG GCATCGCCCGGCCTCGCGGCCGGGCGCGGATTGAAAC AACTGGCGGAGGAGATCGAAATGCAAAAAGCCCGGG GCATCGCCCGGCCTCGCGGCCGGGCGCGGATTGAAAC ATACCGGTAGCGTCGGCAATACCCTGACCGCAGCG GCATCGCCCGGCCTCGCGGCCGGGCGCGGATTGAAAC ACGACGCAGGTTATAGAGCGTTGCACGGCAAAATT GCATCGCCCGGCCTCGCGGCCGGGCGCGGATTGAAAC GGCCCATGGCAGCTTAAGGTCGGGTATCCCGCTG GCATCGCCCGGCCTCGCGGCCGGGCGCGGATTGAAAC CTAACCTGCCTTACACAGCCAGCCGCTACGATGAG GCATCGCCCGGCCTCGCGGCCGGGCGCGGATTGAAAC ACCGGTTCATCGCCGTGCCGCCTGTCCATCGCCGC GCATCGCCCGGCCTCGCGGCCGGGCGCGGATTGAAAC TCGGTCATGGGTGCCAGTTACACTATCCCGATGGAC GCATCGCCCGGCCTCGCGGCCGGGCGCGGATTGAAAC GCATAATGGGACGTTCCGTCAATCTGCGAAGCGCGA GCATCGCCCGGCCTCGCGGCCGGGCGCGGATTGAAAC AACAATTTCTTGCTGGATAAAATCAAGCCGCTTA GCATCGCCCGGCCTCGCGGCCGGGCGCGGATTGAAAC AGCATGATCCAAATGGATGCCTTGCGGTAGCTTG GCATCGCCCGGCCTCGCGGCCGGGCGCGGATTGAAAC GCCATGGGTAGCACCGATTAGCACCTTGCCAAAGC GCATCGCCCGGCCTCGCGGCCGGGCGCGGTCAGGTAC >NZ_AP021884|3|1|2644962-2646084|CRT GCATCGCCCGGCCTCGCGGCCGGGCGCGGATTGAAAC ATGGTGCGATCCTGTTGTTGCTGGTTGTGCTGCGGG GCATCGCCCGGCCTCGCGGCCGGGCGCGGATTGAAAC TCCATGGTGGGATTCATGATCCAGTGGGCGATTGCGG GCATCGCCCGGCCTCGCGGCCGGGCGCGGATTGAAAC GTTTTCTGGTGGCAAGGATTCGGCTGTCATGCTGCGG GCATCGCCCGGCCTCGCGGCCGGGCGCGGATTGAAAC CGGCCGCTGTGGTCGCGCCCGACATCCTCGCCGCGG GCATCGCCCGGCCTCGCGGCCGGGCGCGGATTGAAAC AACTGGCGGAGGAGATCGAAATGCAAAAAGCCCGGG GCATCGCCCGGCCTCGCGGCCGGGCGCGGATTGAAAC ATACCGGTAGCGTCGGCAATACCCTGACCGCAGCG GCATCGCCCGGCCTCGCGGCCGGGCGCGGATTGAAAC ACGACGCAGGTTATAGAGCGTTGCACGGCAAAATT GCATCGCCCGGCCTCGCGGCCGGGCGCGGATTGAAAC GGCCCATGGCAGCTTAAGGTCGGGTATCCCGCTG GCATCGCCCGGCCTCGCGGCCGGGCGCGGATTGAAAC CTAACCTGCCTTACACAGCCAGCCGCTACGATGAG GCATCGCCCGGCCTCGCGGCCGGGCGCGGATTGAAAC ACCGGTTCATCGCCGTGCCGCCTGTCCATCGCCGC GCATCGCCCGGCCTCGCGGCCGGGCGCGGATTGAAAC TCGGTCATGGGTGCCAGTTACACTATCCCGATGGAC GCATCGCCCGGCCTCGCGGCCGGGCGCGGATTGAAAC GCATAATGGGACGTTCCGTCAATCTGCGAAGCGCGA GCATCGCCCGGCCTCGCGGCCGGGCGCGGATTGAAAC AACAATTTCTTGCTGGATAAAATCAAGCCGCTTA GCATCGCCCGGCCTCGCGGCCGGGCGCGGATTGAAAC AGCATGATCCAAATGGATGCCTTGCGGTAGCTTG GCATCGCCCGGCCTCGCGGCCGGGCGCGGATTGAAAC GCCATGGGTAGCACCGATTAGCACCTTGCCAAAGC GCATCGCCCGGCCTCGCGGCCGGGCGCGGTCAGGTAC
>NZ_AP021884.1|WP_147072286.1|2644492_2644783_+|CRISPR-associated-endonuclease-Cas2 MLIIVTYDVSTETAAGRKRLRRVAKACEKMGQRVQKSVFECTVNEMQFEQLERTLLAEIDETQDNLRFYRITEPVEVRVKQHGCFRSVDFEGPLIA >NZ_AP021884.1|WP_147072284.1|2643449_2644484_+|type-I-C-CRISPR-associated-endonuclease-Cas1 MHTIQNTLYVMTPHAYAHLENATLRIDVEREKKLQVPLHHLGGVVCFGNVMVSPALMHRLADEGKSLVLLDDSGRFKARLEGPVSGNILLRQAHHSKASEPAFALGVARAVVAGKLKNSRTNLQRGAREAADPDEAATLTRSADNLAASLRAAAVANTMDELRGVEGEAARGYFAALNLIVKPLARPSFALNGRSRRPPLDRFNALLSFLYAMLMNDCRSAVEAAGLDAQLGFLHAVRPGRAALALDLQEEFRSILADRLALTLINRGQINAADFDEREGGAVMLGDKGRRTVVTAWQERKQEEITHPLTENKIPIGLLPFIQARFIARTIRGEMEGYLPYQAK >NZ_AP021884.1|WP_147072282.1|2643084_2643402_-|ribbon-helix-helix-protein,-CopG-family MATLTLRLPDNLDRQLTALAAQTHQNRSELARTALEKFLRELEQEQLLAEMVEAARFLATNPEARAESIAIAEEFLPLDNEALDIAEGRKPGDPWPEELGEKWWK >NZ_AP021884.1|WP_147072279.1|2642722_2643097_-|type-II-toxin-antitoxin-system-PemK/MazF-family-toxin MVEIMRRGEIWLARLNPNTGAEAGKVRPVLILLNDALLATGMSPVLCIPLTSKLYKNLAGLRIAIAPRGLLLKPCYAMPEQARALDRNRFGEGSLATLTNAEMAQVEKLFIAACGMAQYLIPQH >NZ_AP021884.1|WP_147072277.1|2642112_2642739_+|CRISPR-associated-protein-Cas4 MANSADEIVALSALQHWIYCPRQCGLIHLEQAFEDNVHTARGQAVHHLVDTPGYEIKSGVRVERALPVWCDRLNLIGKADLVEFHPDDSVYPVEFKHGAKRQKLHDDIQLAAQAICLEEMLNRPVPKGAIFHATSHRRREVSITPELKQLVEETANAIRAMLASGKLPPPVNDARCRECSLKEICQPEALAERGRLERLREELFSAAG >NZ_AP021884.1|WP_147072275.1|2641869_2642112_+|type-II-toxin-antitoxin-system-HicA-family-toxin MRVPRDLSGADLVKRLERMGYCVTRQTGSHMRLTSTVRGEHHITIPNHDPLRLGTLASILASVAAHHGLTRDELIQRLFD >NZ_AP021884.1|WP_124705901.1|2641666_2641873_+|2-oxoisovalerate-dehydrogenase MSEIHFIVEEAPEGGYVARAVGVDIVTEADDLPSLHAQVRDAVHCHFDEGKLPGLIRLHITREEVLTA >NZ_AP021884.1|WP_147072272.1|2640483_2641584_+|type-I-C-CRISPR-associated-protein-Cas7/Csd2 MSIHNRYDFVLLFDVKDGNPNGDPDAGNLPRLDTETGQGLITDVSIKRKIRNFVGITKCKEDGTYETGFDIYIKEKAVLGRAHFAAFEKLGISLGQDATELIPDDLAEQFEALTLPEGMEIDTDEEGRSILNLSGATLDKKEAQKWLKDINPAKPLKNFISKVLKNVTARKPKQEESEKGRVQMCQDFYDIRTFGAVLSLKTAPNCGQVRGPVQITFARSIDPIVTLEHSITRCAVATEAEAEKQGGDNRTMGRKFTVPYGLYRTHGFVSAHLAGQTKFDESDLELLWEALKNMFEHDHSAARGEMATRGLYVFKHESHLGNEAAHKLFDRIKVNKTKDVPRGFEDYEVSVDETEMPSGVALLQKC >NZ_AP021884.1|WP_147072270.1|2638681_2640466_+|type-I-C-CRISPR-associated-protein-Cas8c/Csd1 MILQSLHEYYGRKRDSLPGDGIERKELPFLFVLKPDGAFLHIEDTRQGEGKRKRGNAFLVPQGVKKSVNVAANLLWGNVEYVIGQPDSKKLEEQRKKGKEKHYRERLGDMCSAFRTEIEQLPSEVKSTPEVAAVLAFLSSGNFTHVLADPLWPQVSATGANVSFKLTGAESPVCSASGILASVGQSTEDKGETRICLITGNSDVVERLHPPIKGVWGAQTSGANIVSFNLSAFNSFAREQGSNAPVGKRAAFAYTTALNHLLVSKQRIQIGDASTVFWAAEDNKMESLLSQFFDEPPQDNPDQGTNAVKELLEATLAGTPAIYDDGTRFYVLGLAPNAARIAVRFWHVATVGDLAGHIRQHFEDLEIVRPQYVERPFLSLKALLLAVSPLGDLDKLPPKLAGDFMKAILDGTSYPQTLLQAALRRIHAEQAKKDEKTGKHRDHVPYARAALIKAWLNRQTRNANPDQERKITMSLDESNINSGYRLGRLFAVLEKVQAEANPGLNTTIRDSYFGSASSTPSAVFPTLMRRNQHHMTKLRKEKPGLYVTRDKLIQTICNDGIDGQLGFRPILSLADQGRFVIGYYQQRQDLFTKS >NZ_AP021884.1|WP_147072268.1|2637986_2638685_+|type-I-C-CRISPR-associated-protein-Cas5 MPKTLCLKVWGDFACFTRPEMKVERVSYDVITPSAARAVFEAILWKPEIRWTVTKIEVLKPIKWISVRRNEIGKVASADNGQGDRGLYIEEHRQQRAGLFLRDVAYRLHAQFEVVDGSKHVHHYPELRGRFPAEPEESQPEHPAKYLSMFQRRAKKGQCFWQPYLGCREFSAHFELVDDAAAASLAEPPISDSPSLGWMLHDIDFADAMRPGFFRAEMKSGIIDLEDVEVRR >NZ_AP021884.1|WP_147072288.1|2647299_2647656_-|DUF2934-domain-containing-protein MAESKAKSKASGKPVSAVAETKPKAKTAQPAAGKAAAQSAVAAKPKVAKPKVAAPGANEPAAKRSVKLSNPAVSAEQRYRMIAEAAYYIAERRNFAPGDAAADWAQAEVQIVALLNKK >NZ_AP021884.1|WP_147072290.1|2647849_2649325_-|metalloprotease-TldD MTTSTLTGVLIPNPEMLFQTAHETLLVPNQLEASQLDGVFGRLMDHHVDYADLYFQYTRSEGWSLEEGQVKSGSFNIEQGVGVRAVSGEKTAFAYSDDISQPALLAAAEATRAIARSGAVRKPHAVARGGGHALYQPLDPLTTLKDAEKVALLEKLERYARAIDSRVTQVMASLASEYDVVLIARSDGHQAADVRPLVRLSLQVITEQDGRREQGSAGGGGRFGYDYFSDAMLKKYAEQAVHQALTNLAARPAPAGSMTVVLGAGWPGILLHEAIGHGLEGDFNRKGSSAFAGRVGERVAATGVTVVDDGTLMNRRGSLNVDDEGNLTQCTTLIENGVLKGYMQDTLNARLMGVPITGNARRESFAHIPMPRMTNTYMLNGDKDPEEIIASVKHGLYAVNFGGGQVDITSGKFVFSAAEAYMIEDGKITYPVKGATLIGNGPDVLTRVSMIGNDMALDPGVGTCGKEGQSVPVGVGQPTLRIDGLTVGGTA >NZ_AP021884.1|WP_147072292.1|2649382_2650324_-|carbon-nitrogen-hydrolase-family-protein MEKIVSDKTSKSPSSFAKPKSRTTVAKPARAPAPGVIRMAAIQMASGPNVSANLAEAERLVALAVAGGAKLVVLPEFFAIMGNKDTDKVAAREEEGKGPIQKFLASAAKKHKIWLVGGSVPLACDNPKKVRNSCLVYDDKGKLVARYDKIHLFGLDLGVEHYQEEKTIEPGDQIVVLDSPFGRIGLSVCYDLRFPELYRAMPNVDIILVPSAFTATTGKAHFETLVRARAIENLAYVIAPAQGGYHLSGRETHGDTMIVDPWGVVLDRLPRGSGVVMAGINPAYQASLRKSLPALKHRTLDCSHIQIKDKAIK >NZ_AP021884.1|WP_147072295.1|2650366_2654170_-|TIGR02099-family-protein MIAFSRRWIRRSVDYVVLPLALVVVVLVLLLRLWILPDIDRWRDDIAASISHSAGQRVTLGEINANWQGLHPHLRIRDIRVFGADGRPVLFLADVRATLSWTSLLHGELRLAVLTMDDVALTIRRDMQGIHVAGILLNQSDSSGGFGDWLLAQRHIQVNHATLAWNDERRGAPYLVARDVNLTLQNRGHRHRFRLTAIPPEQLAQPLDIRGDFSGRSLDDLASWHGQVYARVDRTDLGQWRQWLTLPYAISQGYGGLRMWLDVASRQVIAATVDASLRQVSVRFAADLPVLRLADVSGRGLWKRLGPAQSFAVKQLSLRTANFVYVAPFDLTLRLDPANAIQPGSGRIDTNSVQLDRLAALAPYLPLDAVQRRRLADLQPRGQLEKFTLAWSGNADQPLDYQIKGRFTRLGWQAQGNLPGAAGLSGNIDATRSNGTLALTSSGVMLALPRVLFEPDVALTTLTARMNWRATQAGYLIKLTEASFANPDLAGSAFGEYQLQAGRRGVIDLTGRLSRANVASAYHYLPLVVKDPTYQWVRSALLAGQGGAASIRLQGDLSRFPFRKAGDGVFEISTPISNGVLQYAAGWPRIEGIQAQLKFTGTRMEISSDAATIYGAALRRVSAVIPDLVDPDEILEVKGEAAGPLAELVRFANTSPLAAKLDNVTDNLRTTGNSRLGLDLKLPLRRAHHATLVGDIRFLGNTLIPAHGLPTLENVQGRLSFTDTGISAQSISARLLGGAATLSAVTQPGGVTRLLVDGRMTAAGLRPYLGTALAGHLSGMADWHARVDLHQMQAQADFESNLVGMASDLPPPFAKAAADSQPLRVKKSLRGADESLLAIHYGQVASALLLQKQKDGEPVIERGTLRFGGEAVLPEESGLWITGSLLLSDLDLWRNELTAAGNGAIGLPPLAGVNLSFRTLDLFGRRFQDININARNQAGTWRANVAGRGVNGDVTWQAADSRAGQPQDRLGAHFKTLAIPAALPVQGVKSSPSGSLPALDISVDNLQLGNRPLGRLSVSATPLDSGLNFESIRLTQPDSTLTMQGIWNPDRIPQTRAKIHLEVNDVGRFLARFDHPGLVKRGQATLDGEGEWNGTPADIAIPSLSGTFALKASSGQFAKVDPGIGKLLGVLSLQALPRRIGLDFRDVFSDGFAFDEISGTMRLSRGVVYSDDFRMQGPSAKVRMSGMVDINAETQQLRVAVSPKLSESVALAGTLIGGPFVGLGALAVQKLLKDPFGQAATFEYSVTGAWTDPVVKRVARIAGGGEP >NZ_AP021884.1|WP_147072299.1|2654382_2656353_-|acetate--CoA-ligase MANIESVLQETRVFPPSAAFQAQANVSGMASHQALTARAAADYEGFWADMARAGISWKKDFSKILDESNAPFYKWFYDGELNVSYNCLDRHLPEKADKTALIFEADDGAVRRVTYQALYNQVCAFANGLKSRGVQKGDRVIIYMPMGVEAVVAMQACARIGAIHSVVFGGFSAKSLHERIRDAGARLVVTADGSIRGGKMLPLKSAVDAAIALGDCECVEAVVVYRRSGDDTAWNAARDIWWHDLVNGMAQTCEPEWVNAEHPLFILYTSGSTGHPKGVQHSSGGYLLGAILSMQWVFDARPDTDVFWCTADVGWITGHSYVVYGPLALGMTEVIFEGVPTYPDAGRFWKMIQDHQVTTFYTAPTAIRSLIKLGSDLPRQYDLSSLRLLGTVGEPINPEAWMWYYEAVGQSRCPIADTWWQTETGSHMIAPLPGAVATKPGSCTLPLPGIMADVVDEHGGSVPLGQGGYLVIKRPFPSLLRSLWGDPERFRKTYFPAELGGKTYLAGDSAHRDADGYYWIMGRIDDVLNVSGHRLGTMEIESALAANPRVAEAAVVGKPHDIKGEAVVAFVVLKGARASGDEAKKIVAELRDWVGKEIGPIAKPDEIRFGDNLPKTRSGKIMRRLLRAIARGEEITQDVSTLENPAILEQLKEAVR >NZ_AP021884.1|WP_147072301.1|2656415_2659091_+|bifunctional-[glutamate--ammonia-ligase]-adenylyl-L-tyrosine-phosphorylase/[glutamate--ammonia-ligase]-adenylyltransferase MPAHHLIERAASHSRYLARLLAADAQFVDSLASGLAQPFGADAMQAQLQAAAPGDEAMLKTALRKLRQAVMARLIVRDLGGLADLSEVMGTCTDLAETTLRCALAHHSTWLAQKHGMPKNPDGSDMQLVVVGMGKLGGRELNVSSDIDLIYLYPEQGETTGAKPVSHHEFFVLLGKKLGLAISDLTADGFVFRVDMRLRPWGDAGPLAMSYAALEDYLVAHGREWERYAWIKGRALTGTRLAELDQIIRPFVFRKYLDFNAFAAMRELHVQIRREVIRRDRADNIKLGPGGIREIEFTAQVFQLIRGGQVAVLQTRSLLAVLPLLAARGLLPENAVAELQAAYVFLRNLEHRLQYLDDAQTQMLPTQPDDRTRIATSMGFTDYPAFLAALNAHRTQVSRHFDQVFAAPQADSGSHPLAGLWQGALEHADALATLAGLGYTAPAEVCNRLRQIRTSIRYTTLPASNRARFDTLMPALIEVAASCNPPDATLARILDLLETVARRDSYLALLVEYPATLQRVARLCAASPWAAQYLARNPMLLDELLDTRQLYATPDWPALGDELQALMHTHCGDTERQMDAMRQFRQRVTFHLLAQDLAGVLALETLSDHLSDLAALILSATLPLAWAGVRNRHRDTPRFAVIGYGKLGGREMGYASDLDLVFLYEDPAPAAAEHYARLAQRINTWLGSTTAAGVLYETDLRLRPDGTSGLLVSSVEAFSQYQHSHAWTWEHQALTRARYVAGDAAVGAAFERIRCDILTQPRDPARLREDVLAMRQKMHAGHPNHSDLFDLKHDAGGIVDVEFMVQYLVLAHAARHRELTRNSGNIALLRLAAELELIPASDAEAVRSAYRELRRLQHALRLHGIQTARIEPMQVAGHAAAVRRLWRTLFG >NZ_AP021884.1|WP_147072457.1|2659154_2660069_+|branched-chain-amino-acid-transaminase MADRDGFIWYDGKMVPWRDATTHVLTHTLHYGMGVFEGVRAYNTDQGTAIFRLQEHTDRLFRSAHILGMKMPFDKAAISAAQLAAVRDNQLESAYIRPMAFYGAEAMGISAKTLSTHVIVAAWTWGAYMGAEALERGIRVKTSSFARHHVNIAMCKAKANGNYMNSILAHQEAAQDGYQEALLLDVDGFVAEGSGENVFIVRNGKLITPDLTSALEGITRDTIVQLAGEIGLQVVEKRITRDEMYSADEAFFTGTAAEVTPIRELDNRTIGTGARGPITAQLQKMYFDCVTGKDPKHAGWLSYI >NZ_AP021884.1|WP_147072303.1|2660079_2660286_+|zinc-finger-domain-containing-protein MAQVQHENTQRIIEVTADDLPLHCPTPGMIAWDSHPRVFLPVEVKGEALCPYCGTMYILKGGAVAHGH >NZ_AP021884.1|WP_147072305.1|2660694_2661141_+|6-carboxytetrahydropterin-synthase-QueD MLITRRLEFDAGHRIPNHASQCKHLHGHRYAIEITLSGDIITAEGQSEQGMVMDFSDVKRIAREQLVDAWDHAFLAYRGDKPVCDFLATLTDHKTIILELVPTVENLAHIAFDILDPAYRDTYGNQLRLKQVRIYETPNNWADCRQPE >NZ_AP021884.1|WP_147072307.1|2661191_2661659_+|YbhB/YbcL-family-Raf-kinase-inhibitor-like-protein MGMTMTSTAFAHHGAIPEHYTCDATDTSPPLAWAGVPVGAKSLVLIVDDPDAPDPAAPQRTWVHWLLYNLPPTSSGLAEGVTALPAGTLEGINDWKRTGYGGPCPPIGRHRYFHKLYALDVVLPNLDRPSKAALEKAMQGHILAQTELIGLYQRH |
You can click texts colored in the table to view more detailed information
| CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| NZ_AP021884_4 | 2907296-2907401 | Orphan |
NA
Consensus repeat of NZ_AP021884_4
|
1 spacers
spacers of NZ_AP021884_4
>4.1|2907330|38|NZ_AP021884|CRISPRCasFinder CAGCCACCTTTGGAAAATATCCTGTCTGGTCATGCTGC |
CRISPR arrays and Neighbor proteins around NZ_AP021884_4
The CRISPR arrays of NZ_AP021884_4 >merge|NZ_AP021884|4|2907296-2907401|CRISPRCasFinder GATTCCAACGACTGGGTGCGGCTGGGCAGTATGGCAGCCACCTTTGGAAAATATCCTGTCTGGTCATGCTGCGATTCCAACGACTGGGTGCGGCTGGGCAATATGG >NZ_AP021884|4|4|2907296-2907401|CRISPRCasFinder GATTCCAACGACTGGGTGCGGCTGGGCAGTATGG CAGCCACCTTTGGAAAATATCCTGTCTGGTCATGCTGC GATTCCAACGACTGGGTGCGGCTGGGCAATATGG
>NZ_AP021884.1|WP_147070033.1|2903228_2905838_+|alanine--tRNA-ligase MKSSEIRQRFLDFFARHGHTPVASSPLVPGNDPTLLFTNAGMVQFKDVFLGRETRPYARAVSSQRCVRAGGKHNDLENVGYTARHHTFFEMLGNFSFGDYFKRNAIQFAWEFLTQELGIAKDKLWITVYHTDDEAHTIWTAEMGVPDERVIRIGDKPGGGSDNFWQMGDTGPCGPCTEIFYDHGAEVAGGPPGSADEDGDRYIEIWNLVFMQFNRDEAGNLQPLPRPSVDTGMGLERISAVMQHVHSNYEIDLFQALIHAAARVTGSADLTDNSLKVIADHIRACAFLITDGIIPGNEGRGYVLRRIIRRAIRHGYQLGQKQPFFHLLVADLAMAMGAAYPELVAAQARVTAVLKQEEERFAETLEHGMDILEQALQSGANVLDGATAFKLYDTYGFPLDLTADVGRERGFTVDMAGFEAAMEAQRKRARAASKFTMQAGMRFDGPPTEFRGYDTLSLDSRILALYQDGSPVHSIAAGEAAVIVLDRTPFYAESGGQVGDSGELHGSGSVFVVDDTQKIQPDVFGHTGLLQSGSLKLGDTVSAQVDADARSRAACNHSATHLLHAALRQVLGTHVTQKGSLVDAARTRFDFAHSEAVSAAQLQQIEDLVNREIRRNVIVEARLMNYDAAIAHGAMALFGEKYGDQVRVIGMGEFSTELCGGTHVSRSGDIGLFKIISESGVAAGIRRIEAVTGPAALAMIQAQQRQILEAAALLKAPPQELQQKIAQIVDNVKNLEKELDRLKSRLAAAQGDDLVSQATAVGNAKVLAAMLEGADVKTLRETVDKLKDRLKSCAVVLGSCSDGRVTLVAGVSADLTSKVKAGELANFVASQVGGKGGGRPDMAQAGGTEPAQLPAALQSVAGWVAQRLE >NZ_AP021884.1|WP_147070267.1|2901373_2903095_+|thiosulfohydrolase-SoxB MNRREFLQILAVAAASGMAIDNKQALAGNAPSGFYDLPKFGNVSLLHMTDCHAQLLPIYFREPDVNIGVGAAIGQPPHLVGEYLLKYYGIRPGTREAYAFTYLDFAAAARTYGKVGGFAYLKTLVDKVRASRPGSLLLDGGDTWQGSATSLWTNGQDMVDAAKLLGVNVMTGHWEFTYGAERVKHVVDNDFKGHIDFVAQNIKTNDFGDPVFKPYVIKTMNGVQVAIIGQAFPYTPIANPRYMVPDWSFGINDDNMQKVVNEARAKGAQVVVVLSHNGMDVDLKMATRVTGIDAIFGGHTHDGVPQPTQVKNAKGTTLVTNAGSNGKFLGVMDFDVKNGKIAAWKYRLLPVFSNLLEPDAKMAKLIEDVRAPYASKLNEKLAVTEELLYRRGNFNGTFDQLILDALMAVKGADAAFSPGFRWGTTLLSGDVITMDHLMDQTAITYPSTTLTEMTGATIKSIMEDVCDNLFNADPYYQQGGDMVRVGGIQYAVAPNNKIGNRISNMTLKGKPVMASKKYKVAGWAPVGEGVSGTPIWDVVAEYLRDIKVVKPRKLNEPKIVGIGKNPGIAPGIA >NZ_AP021884.1|WP_147070035.1|2900312_2901191_+|sulfur-oxidation-c-type-cytochrome-SoxA MKTTFREPQAQSPKAHGKKILLALAGAGLLLGALNASATPEQDRQSLLKFYSSKYPDIKVANYIYGALAFDPDAMEQYNSIMDFPPFGSVIEHGKKMWETPFKNGKKYADCFPNGGKNVAGNYPYFDDKAGKVVTFEMAINACRTANGEEAFKYNDMQTMGTLTAYARTLSDGMPMNIKVQGAAATAAYEAGKSQFYSRRGQLNFSCASCHVANAGNHLRSELLSPAVGQATHWPVFRGGEQLVTLQERYVGCNKQVRAVPFAPGSEEYNNLEYFHSYISNGLPLKASVFRK >NZ_AP021884.1|WP_147070037.1|2899922_2900237_+|thiosulfate-oxidation-carrier-complex-protein-SoxZ MAEPMKMRASVSGDVADIKVLMNHPMETGLRKDAKTGQLIPAHFINEVHATVNGKPVLDAQWGGGVSKNPYLGFKVKGAKAGDKVEVSWKDNKGESNKVDGVVA >NZ_AP021884.1|WP_147070039.1|2899397_2899865_+|thiosulfate-oxidation-carrier-protein-SoxY MNALRRNILKSAGATGIVAMAAAAGLLKSGNVLAAWNSSAFAAKTVPEAIKDLGLSTPADSKAISIKAPDIAENGAVVPVEVTSSIAGTTGIAIFAEKNATPLITDFKLSNGAEGFISTRIKMGQTAMVRAVVTAGGKTYTAAKEVKVTIGGCGG >NZ_AP021884.1|WP_147070041.1|2898997_2899366_+|sulfur-oxidation-c-type-cytochrome-SoxX MRAGASLILTASMVGMFVMANYAVAADTPKQEETGKSIAFDKTKGNCLACHAMPTVPDAVAAGTIGPPLIAMSARYPDKAKLRAQIWDATVANPQSVMIPFGKHKVLTEQEIDKVTDFVYGL >NZ_AP021884.1|WP_147070043.1|2897364_2898786_+|M48-family-metalloprotease MKSASLFLALCLASQQLAASELPDLGDVSQGAFSPRDEARVGNEIMRDIYAEPAYYDDPELTDYLNNLGYRLVAASPENRLAFQFFVLRDHTLNAFALPGGFIGVHTGLIEATQSESELAGVLGHEIAHVTQHHLARMIESRNQGILPSLAALAVAILAARSNPQAASAAIATVQATSIQKQLNFSRANEREADRIGMQIMRGAGFDPRAMATFFERLQKNSRLYENNAPAYLLTHPLTSERIADMQNRAASMPVKQVADSLEFQLLRAKLLAGEGRPEEAVRRFTEAIRDTRYNSLAAERYGLVVALLRTRQFDRAEQELDRLNQSGASSPMIAMLGARLRQEAGDLNTALARYQAGRARFPGYRPLLYADANALLQAGKADAALALVTDHLALYPDDYRLYQLQSRAYAMQGKDFLRHHAQAEAYVRQGNLDAAIEQLKLGLKSRDGDFYQMSIAEARLKELVALNQPAKP >NZ_AP021884.1|WP_147070045.1|2896848_2897325_-|cyclic-pyranopterin-monophosphate-synthase-MoaC MNQLTHFDDRGRAQMVDVADKSDTRRVAVAAGRIVMQPATLKMILDGSARKGDVLGVARIAAIAASKRTADLIPLCHPLALTRVAVEFLAEEADSAIECRVTAETVGKTGVEMEALTALSVGLLTIYDMCKAVDRGMRMEGLRLLEKQGGKSGHWRAP >NZ_AP021884.1|WP_147070047.1|2894604_2896824_+|EAL-domain-containing-protein MTQIDTRLISTATWLAGALAGLIALAFPLVYFSLSYEHQAASMETEAEFEAARIARLINANPELWPFEQSRLQELLQDQTETELPESRRIVDVNGRLIAQSQGKSARPYLLRTADLRNSGSVAGRVEIIRSLRPLLLKTAMASLLGLLLGSLAMVIFRAYPLRILKRALNTLANEKERAEVTLHSIGDAVITTNASGHIEYLNPVAEQLTGWTNEAARGLPSWRVFNIINESTGAPLDSPAEKAIKENRIVPLANHAGLVKRNGKIIPIENSAAPICDSQGQIIGAVLVFHDVSHARAMATKLSHQASHDPLTGLINRHTFESRLQQALDNVRRENSHHTLCYMDLDQFKIVNDTCGHRAGDELLRQLAGELRTKVRNSDCLARLGGDEFGLLLEGCTVQQAEHVAATLLQTVKEFRFHWQEHTCAVGVSIGLVGINAGCGDLAKIMGAADSSCYAAKDRGRNCIYVYQPDDKEVAQRRGEMQWVARITRAIDEGRLRLYYQTIQPLAGTQGAHYEILLRMLDEEGRIVPPGTFIPAAERYGLMPAIDRWVIENTFATLGRLYRGDAKKRLHTCAINLSGTSWADESLAGFICGMTGRHGVPARSICFEITETAAISNLGKTIALIRDLKEAGFRFSLDDFGSGVSSFGYLKQLPVDYLKIDGGFVRNIIHDKIDHAMVAAINQIGHIMGIKTIAEFVENEEILERITAMGVDYAQGYAIARPQPLDHINLASAPVLQQ >NZ_AP021884.1|WP_147070049.1|2893538_2894183_-|methyltransferase-domain-containing-protein MQAAEYDAWYQTPRGRWVGETESDLLRRMLGPQSGESLLDVGCGTGFFTRRFARGSRSAVTGLDPNRDWLAFAERHAVSTENYVNGSALALPFDAGGFDLVMAVTALCFISDQRLALTEMLRVARRRIALGLLNRHSLLYWQKGRGGGQGAYRGAHWHTAAQVRELFAGQPVRNLRMAFSIFVPGGGWIAQQLESVLPTSLPLGSFFVVTADVV >NZ_AP021884.1|WP_147070029.1|2907798_2908467_-|alpha/beta-fold-hydrolase MTAFAPLEFVAGSQAVQASVIWLHGLGADGHDFAPVVQALDLPGVRFILPHAPTRPVTINGGHVMPAWYDIRSTGLDADEDAAGLAQSSRVVEDLVAHELARGVASARIIVAGFSQGGALALYAGLAPGRVLGGIMVLSAYLPLMAGFNEWCAAGTHTIPVFMAHGVQDRVVPLQLAERSRQKLVACGFDVEWQIYPMAHSVCEEEIDAIRGWLIRVLQLHV >NZ_AP021884.1|WP_147070027.1|2908463_2909963_-|hypothetical-protein MRRIPLVGKLLPAAGEALAVPVRDDPASYSAHEICESIEQLIETLLSARKKNLDWQRHSIDSLHRQDNFSAPFMTRLTQHYLALPPFVSSVSGRFLAAISGYWEEMSAIHLQCVTYLLGHPESRLGALLPLLIQRALYHHAMQMKWRWLRYQLIPSCFWARLHRLYAVAEKHEFARVPLPLPGMERADSCCETLYLRPQMLHSLRPDTLLPCEIEQVDEWIVRWSKSVLLEPMLLSGKHRYGVNLKGASPPRPLAMLNEPGSYRYWGPGLMLAALHAEHDEADAGAHAGWRQALWRRVVNDWSGIPPLRHHPRQMIGKQTELFLGFNEIHTRIDHHPSRRAHDLPYWRRCRVRDESAEGLGLALNTSDGVPVAINSLIGINSGRHFLVGVVCRIRRHESGWTEIGIRRLAANAVPVKLESVNVNLAGQVVDALYLSMAGAFGQRRCVLIPARISWQDGQWQLLCKGRRHLIRLRAPLKATEDYVLADFDGLAQSEAIAS >NZ_AP021884.1|WP_147070024.1|2909981_2910410_-|hypothetical-protein MKDFIALLVEQSRLQSVAINPALDDALTHLDHALAGLCAAVQVEYRGPYVGVETPLAHQMVVRRHEWKIHQPAWSMKICVAAPAANCRAEWPVQGVGRLRKALVVKALPAFFAGFAEAIKQAGKQDSSAGLRVLELSRRFNL >NZ_AP021884.1|WP_147070022.1|2910437_2912018_+|sigma-54-interacting-transcriptional-regulator MRAQIRANWRKYYHTTRRAVRARSATRTAGNLAQSGQNANNAKEGTNVSLQGISAKPSLLIVDDDPLITDTLNFVLSRDFEVFVADSRSQVKSLLTQLDTPPQLALVDLGLPPLPHKPDEGFHLISELLGYSPGIKILVLSGQNDETNARHARALGAIDFVGKPCEPAQIKSLLFNALLIQDVERSAETEAPAAENLIVGTSFNLDRLRQQITQYANAPFPVLIEGESGSGKELVAASLHKLSGRTKKPYLALNCAAISPTLVEPTLFGYCKGAFTGATSNRAGYFEDACDGTLFLDEIGELPLELQAKLLRVLENGEFQRVGETQSRFSNARVVTATNRDLRQEIKAGRFRADLYHRLSVFGIAVPPLRELGEDKVRLLEHFREFYAREARVKPFALDNRARQMWEDYHFPGNVRELRNIVIRLTTKCAGQNVTAEQLETELDTDTAFPSEIPLPNDGKALYDTARRHLQTLANFSLDQTMKQWEKSYVEAALNLTHGNLSQAAKILGINRTTLYSRMQTYTNEA >NZ_AP021884.1|WP_147070020.1|2912147_2913497_+|AAA-family-ATPase MYHEFFGLKEAPFRITPDTGFFFSGGERGAILQGLAYAIRQGEGIIKVTGEVGSGKTMLCWMLEQHLPDHIETVYLANPNVKPEDVLPSILAELELVRPADASRAGHLRTLNDYLLARHDAGKQVVMFVEEAQGMTLDTLEEIRLLSNLETEREKLLQIVLFGQPELDAKLADPRIRQLRERITTAITLAPLTPDAIRAYLAFRLTTAGYRGPDLFDRRAVRSIARASRGLTRRVNILADKSLLAAYTDNTRTIQPRHIRIALRDSAFNDDANKPQRWLLPVIAMGVMVAVLASFYWRSKPAAAPSRQTQTRPAAGLPGRASAAAPDPVAPLSADPFQQRLAATRTWLMQQPADTRTIQLSLLNSPSEFAAYLRGEGGGLAPDQLRIFRTQAQGHPSWTVIYGSYPTRQTANRALLALPEAVRKRHPYLRTVGGIRNETRQIQQVGEQS >NZ_AP021884.1|WP_147070265.1|2913576_2915352_+|secretin-N-terminal-domain-containing-protein MWLPMLAVPLLAGCVPAAMIQPSQGHIQQSSQPATRLADIPPLVKTIPYLPSPRAETQVPTYTIVVDNVPVKDLLFSLARDTKKNIDIGTGITGNVTLNAVNEPLPAILERIARQASIRYRMEGDTLSIMPDTPYLKTYKVNYVNLSRNTSSSIGVAAQIASTGSGAVGAAASGSAQGGNSSSTTVDSQSNNNFWEVLTENVRAILTSTRASTQRAEDKSARLDAERNARADRLEQAQAVARAGAAAPTLYREAFGNTSSSLLQDSKNEVIVNPVAGTVSVLGNERQQQVVQQYLDGVSQSSQRQVLIEATIVEVSLKDQYRAGIDWSRLANGSKGIFFNTMPAATTNLANSLLPFFNIGYRDRNLTATLNLLESFGNLRVLSSPKLMALNNQTALLKVVDNLVYFTVQAQQGTLSSTGTPLQPTTFTTTAKTVPVGLVMSLTPQISESGMVTLDVRPTISRKIGDVSDPNPGLPVSTPNKIPVIQVREMESVLQVGSGQTVILGGLMQDDSDRARDGIPVLSRPQGFGAIFGQHEHNVQKTELVIFLRPTVITNPSLDSDELKFYKRYLPRANAAPEQWHNGADAAGDPQ >NZ_AP021884.1|WP_147070018.1|2915348_2916524_+|tetratricopeptide-repeat-protein MSLLLKALKQAGDKSAAGARNPSATLADSLSLEPISGSAPDGTAYTSWDGAAPFKRSTARAAWYTPWLSGQRWLVPAVAVVAALFMLIYGVFVYWQTRTPAALVVTPTPHSAAPAAAPPAAAPAQLAAVPSQESGPPLPEINSAVPDAPAALPPPPVQADPTPQWGSGELIREAPPPRRARTQPGRRETRSALPFSMQTATTHINPQLEAAYQAYQAGHTREARNLYLQIPDGERNVDVQLGLAAIALRDNDTPAAARHYQRVLELDPRNSTANSALIGMMGDADPNASETRLKSLIASQPSSQLYFALGNLYAGQNRWPDAEQAYFEAYQKNAANADYAYNLAVSLEHISQSRAALNYYQKARDLMQPGNVQFDPLRLEARIDQLKARQE >NZ_AP021884.1|WP_147070016.1|2916529_2918233_+|Flp-pilus-assembly-complex-ATPase-component-TadA MEARKTLRLGEMLVQQGLITLDQLRIALKEQQHTNLPLGRLLVKLGFITEAVIRDQLAHTIGQTSLDLANVVADPEALKLISEDFARRHHLLPIAFDAQRQVLVVAITDMFNVVALDHLRALLGAGVEVDTVLSGEAQLLEAIDNFYGFELSVDGILREIETGEVDYQSLAMDTEEYTQPVVRLVGSLLVDAVKRGASDIHFEPEHAFLRIRYRIDGVLEQVRSLHKSYWPGIAVRLKVISGMNIAENRAPQDGRLSLTLHGRPIDFRVSSQPTIHGENIVLRVLDREKSIIPLANMDLPTDTHTALQRMMARPEGILIITGPTGSGKTTTLYSLLTHLNNETVNIMTLEDPVEYPVTLMRQSSVNETLKLDFANGIRSIMRQDPDIILVGEIRDRDTAEMAFRAAMTGHQVFTTLHTNSALGAFPRLLDIGIVPDIMAGNIIGVVAQRLVRVLCPHCRAAYTPDADEQKLLDWQATDRRPVYRAVGCPACNGKGYRGRMALMELLRMDSELDDLVARRATHREILNAALMRGYRSLAVDGISRVLEGKTSLAEVSRVVDLTQRILS >NZ_AP021884.1|WP_147070015.1|2918246_2919452_+|type-II-secretion-system-F-family-protein MPYFSYRAVDQIGRTNRGSLSAANEVDLELRLRRMGLDLITLRQMDSRASGFARGAASRRDLITFCFHLEQISRAGIPILDGVRDLRDSMDNPRFRDILTALLEDMEGGRLMSQALAAHPAVFDTVIVNLVRAGEQTGLMREVFENLGASLKRQDELAAQTRRLLIYPTLVLSMVGIIILLLLLFLVPQIADLIKNMGIALPIQTRVLLWLSETLRTWWPLFLILPVAIGSALVVTLRASERARFVADDVKLRLPVIGPILQKIALARFSNFFALMYRSGITILDALRAGEDIAANRVIADAIRRAGGRIGNGEGLTESFQSLSVFPPLVIRMLRVGETTGALDTALENVSYFYTREVSESIEKSLKILEPALTVVLGLVMAVIVGSVLLPMYDVIGTLKP >NZ_AP021884.1|WP_147070013.1|2919448_2920984_+|hypothetical-protein MMFAPQLLVYVCAWSITVACRRAGKIRLVGQFNADEGGRRAFAAVLQAFKNSPVSVMVDGVDEDYRLETLPHVLGNARREMLERRLRQISRNALFSAAWPQGREASGRRDDRYLFISLSNHDAVRPWLDLLHQHGVHLAELTVLPAISHVLLQRIQPTEPHVLLVSEHCGGLRLSYFEHGNLRFSRLTAPESLAEGHAPDLASEINKTDLYLNSQRLMPRDAQLAVYVLDPENAYAGLCREISAENKNLICQAVGSVALAKLVGVDEPLLHRTADVAYLAVLGRSRAAVNLAPAAYTRGYVQLMLRHKLYTGAFAVLATALAISGYLFSRQHDLEQQRLVTQDRIQQQASLYRAVQLALPRAPTSPQNLKRVVETARALYAAPQPMSDFARVSQALETVPDIAVLRLRWLDHDAADTTATHSAVSDNPGAAVRALYFDGEVSPFQGDYKTALASIEHFAATLRNDPGVAEVRVLALPINTDPTATLDESQHTGNSAPRARFRLKLLMRPAR |
You can click texts colored in the table to view more detailed information
| CRISPR_ID | Spacer_Info | Spacer_region | Spacer_length | Hit_ID | Protospacer_location | Mismatch | Identity |
|---|
| CRISPR_ID | Spacer_Info | Spacer_region | Spacer_length | Hit_phage_ID | Hit_phage_def | Protospacer_location | Mismatch | Identity |
|---|---|---|---|---|---|---|---|---|
| NZ_AP021884_2 | 2.1|1831723|33|NZ_AP021884|CRISPRCasFinder | 1831723-1831755 | 33 | NC_007766 | Rhizobium etli CFN 42 plasmid p42f, complete sequence | 395837-395869 | 6 | 0.818 |
| NZ_AP021884_2 | 2.1|1831723|33|NZ_AP021884|CRISPRCasFinder | 1831723-1831755 | 33 | NZ_CP020911 | Rhizobium etli strain NXC12 plasmid pRetNXC12e, complete sequence | 526121-526153 | 6 | 0.818 |
| NZ_AP021884_2 | 2.1|1831723|33|NZ_AP021884|CRISPRCasFinder | 1831723-1831755 | 33 | NC_021911 | Rhizobium etli bv. mimosae str. Mim1 plasmid pRetMIM1f, complete sequence | 601591-601623 | 6 | 0.818 |
| NZ_AP021884_3 | 3.13|2645871|34|NZ_AP021884|PILER-CR,CRISPRCasFinder,CRT | 2645871-2645904 | 34 | MN692973 | Marine virus AFVG_117M33, complete genome | 35754-35787 | 9 | 0.735 |
| NZ_AP021884_3 | 3.1|2644999|36|NZ_AP021884|PILER-CR,CRISPRCasFinder,CRT | 2644999-2645034 | 36 | NC_008043 | Ruegeria sp. TM1040 megaplasmid, complete sequence | 694259-694294 | 10 | 0.722 |
| NZ_AP021884_3 | 3.10|2645653|35|NZ_AP021884|PILER-CR,CRISPRCasFinder,CRT | 2645653-2645687 | 35 | NZ_CP007794 | Azospirillum brasilense strain Az39 plasmid AbAZ39_p1, complete sequence | 1356140-1356174 | 10 | 0.714 |
1. spacer 2.1|1831723|33|NZ_AP021884|CRISPRCasFinder matches to NC_007766 (Rhizobium etli CFN 42 plasmid p42f, complete sequence) position: , mismatch: 6, identity: 0.818
ttgcgttggatacctcatcctcatcattgcgct--- CRISPR spacer ctgcgtcggctacctcatcctcatca---cgcttgc Protospacer .*****.** **************** ****
2. spacer 2.1|1831723|33|NZ_AP021884|CRISPRCasFinder matches to NZ_CP020911 (Rhizobium etli strain NXC12 plasmid pRetNXC12e, complete sequence) position: , mismatch: 6, identity: 0.818
ttgcgttggatacctcatcctcatcattgcgct--- CRISPR spacer ctgcgtcggctacctcatcctcatca---cgcttgc Protospacer .*****.** **************** ****
3. spacer 2.1|1831723|33|NZ_AP021884|CRISPRCasFinder matches to NC_021911 (Rhizobium etli bv. mimosae str. Mim1 plasmid pRetMIM1f, complete sequence) position: , mismatch: 6, identity: 0.818
ttgcgttggatacctcatcctcatcattgcgct--- CRISPR spacer ctgcgtcggctacctcatcctcatca---cgcttgc Protospacer .*****.** **************** ****
4. spacer 3.13|2645871|34|NZ_AP021884|PILER-CR,CRISPRCasFinder,CRT matches to MN692973 (Marine virus AFVG_117M33, complete genome) position: , mismatch: 9, identity: 0.735
aacaatttcttgctggataaaatcaagccgctta CRISPR spacer tacaatttcgttctggataaaatcaagtgttgca Protospacer ******** * ***************. . .*
5. spacer 3.1|2644999|36|NZ_AP021884|PILER-CR,CRISPRCasFinder,CRT matches to NC_008043 (Ruegeria sp. TM1040 megaplasmid, complete sequence) position: , mismatch: 10, identity: 0.722
atggtgcgatcctgttgttgctggttgtgctgcggg CRISPR spacer tcgctccgatcctgttgtggctggtggtgctgatca Protospacer .* * ************ ****** ****** .
6. spacer 3.10|2645653|35|NZ_AP021884|PILER-CR,CRISPRCasFinder,CRT matches to NZ_CP007794 (Azospirillum brasilense strain Az39 plasmid AbAZ39_p1, complete sequence) position: , mismatch: 10, identity: 0.714
accggttcatcgccgtgccgcctgtccatcgccgc CRISPR spacer gtccagccgtcgccgtgccccctggccatcgccgg Protospacer ..* . .*.********** **** *********
| Region | Region Position | Protein_number | Hit_taxonomy | Key_proteins | Att_site | Prophage annotation | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| DBSCAN-SWA_1 |
622875 : 631682
Sequences of DBSCAN-SWA_1
Nucleotide sequences of DBSCAN-SWA_1 >NZ_AP021884|622875:631682|DBSCAN-SWA TTCAGGAATTGCCGCTGTTGTCGATGAAGCTCTTGAGACGGTCAGAGCGCGATGGGTGGCGCAGTTTGCGCAGCGCTTTGGCTTCGATCTGGCGGATACGCTCACGGGTTACGTCGAACTGTTTGCCGACTTCCTCCAGGGTGTGGTCGGTATTCATCTCGATGCCGAAACGCATGCGCAGCACTTTGGCTTCGCGTTGCGTCAGGCCATCGAGAATATCCTTGGTCACTTCCTGCAGGCTGCCGTAAACGGCGGCGTCTATCGGCGCCAGCGTGGCGGTGTCTTCTATGAAATCGCCCAGATGGGAATCCTCGTCGTCGCCGATAGGGGTTTCCATGGAAATAGGCTCTTTGGAGATTTTGAGTATCTTGCGGATTTTCTCCTCGGTCATTTCCATTTTTTCGGCCAGCAATTCCGGGTCGGGTTCCTTGCCGGTTTCCTGAAGAATCTGACGCGAGATACGGTTCATCTTGTTGATGGTCTCGATCATGTGCACCGGAATACGAATGGTGCGTGCCTGATCCGCGATGGAGCGGGTGATGGCCTGACGGATCCACCAGGTGGCGTAGGTCGAGAACTTGTAGCCGCGCCGGTATTCGAATTTGTCCACGGCTTTCATCAGGCCGATGTTGCCTTCCTGGATCAGGTCGAGGAATTGCAGGCCACGGTTGGTGTATTTTTTTGCAATGGAAATCACCAGGCGCAGGTTGGCCTCGATCATTTCACGTTTGGCGCGGCGCGCGCGCGCTTCGCCAGTGGACATCTGGCGATTGATTTCCTTCAAGTCCTTGATCGGGATGCCCACTTTTTCCTGCAGGGCAATCAGGCGTTGCTGACGTTCGACGATAGTGTGCTGGTAGCGTGTCAAATCCTCTGAGTAGGCCTTTTTCGAATTGATTTCCTTGGTAACCCAGTCCAGATTGCTTTCGTTGCCTGGAAACACCTTGATGAAGTGGGCGCGGGGCATACCTGATTTGTTCACCGCGAATTCCATGATGTCACGCTCGTGGCTGCGGACTTCTTCCACCAGATTGCGCAAGCCTTCGCATAGCGCCTCGACTTGCTTGGCAGAAAAGCGGATATTCATTAGCTCAGCGGAGAGCTCTTCCTGCAGTTGCAGATACTGCGGGCTGCCGAAGCCGTTTTTCTTCAACGTGGCCTGTATCCGCTTGAATACCTTGCGGATGACTTCAAAGTGCGCCATGGCGTCGATCTTGAGCTGAGCGAGATTGGCTGCAGCCAGCGCGCTACCGTCATCTTCCTCGGCATCTTCGTCGAGCTCTTCTTCGAGTTCGCTGATGTCAACCTCGGGCTCGCTGGCGATCGCTTCGAGTTCTTCTGCTACGACACCATCCACGAAATCGTCAATGCGAATTTCCTCACGCTCAACCTTGTCCACCAGTGTCAGGATTTCCTGAATCGTGGTGGGACAGGCGGAGATGGCCTGAATCATGTGTTTCAAGCCGTCTTCAATGCGTTTGGCAATTTCGATTTCGCCTTCACGCGTGAGCAGTTCCACCGAGCCCATTTCGCGCATGTACATGCGCACCGGGTCAGTGGTGCGGCCGAACTCGGAATCCACGGTAGATAACGCAGCCTCGGCCTCGGCCACCACGTCCTCATCGGCGACTGCCGGGGCGGCGTCGGACATCAACAGGGTCTCGGCATCAGGGGCTTCGTCATAGACCTGGATGCCCATGTCATTGATCATGCTGATGACGCCTTCGATCTGTTCGGCATCCAGCATGTCATCTGGCAAATGGTCGTTGATCTCGGCGTAGGTCAGGTAGCCCCGCTCCTTGCCGAGCACAATCAGATTCTTCAGGCGTGTGCGTCGGGCTTCGACATCTACCGTTTTCACCTCGCCTTTTTCGTGATCGTTTGCCATATGTCTTGGTCCAGTAAAATTTGGTGCATCAAAAAAACTGACAATTATACCTTTGTTCGAGCCTGATCGCTCATTTTTCCACCTGCTGTCTGGCATTAACCAATTGTCGCAACGCTGCTTTTTCGGTTTCACTCAGTGCGCTGACGGGCTTGTCGGTTACACGCGCACCCCGCTGTTTGTCGAGTTGCATGCGTGCCTGATAACAGGCGTCAGCAAATTCGGCGCTGATGTCGAGATCGGCTGCCCAGCCCATTATTTCCGCGCTGGCGCGCTGCAGGATGGACGCCAAAGCGTTATCGCGAAAATGGTCTATCACGCTCGCAGTGCCTAAATTGGGATGAACGCGCAGCAATTCAACCACAGCGCGTAGCGCGTCTGCATCCGGGTCAGTTGGGGCAATCAAACTGGCGTCAAGTTCCCGTGCCAGGCTGGGCATAAACAGAATCGCACGCAGCAGCCAGTGCCAAATGGAAGCGGGTGCCTGGCGTGGGGCACGGACAGGTGCACGCGCCGGGTTGAATTGCCGGGATTTGATCTGCCACAAGCTGTCGAGTTCGGACAGCTGCAGATTTGCCAGTTCGGCGCAACGTTTGCGCAGCAGGAGTGCCAGGGCGGGTGCGCGTATCTGGCTTAACAGGGGGTGCGCAGCCTGCAAAAAGGCACTGCGTCCTTCGCTGCTGGCCAGGTCATGCTGGGCTGCCAGCTCCTTGAACAGATAGGCAGATAGCGGTACCACCTCGCCGCCCAGCAGCGCTTCGAAAGCGCCTTTGCCAAATGCGCGAATATAGCTGTCCGGGTCGTGTTCCGGCGCCAGAAACAAAAACCCGACACGACTGCCATCGCTCAGTATGGCCAGGCTGTTTTCCAATGCGCGCCAGGCGGCGTGCCGTCCGGCGGCGTCGCCATCAAAGCAGAACACGAGCTCATCAGTGTGGCGCAGCAGTTTTTGCACGTGCGCCGCGGTGGTGGCGGTGCCCAGGGTGGCGACGGCGTACTCCACCCCATGCTGCGCCAGTGCCACCACGTCCATATAGCCTTCCACCACAATCACGCGTCCCGCATCGCGGATTGCGCGGCGTGCCTGGAACAGGCCATACAGCTCGTTACCTTTCTGGAATAGCGGCGTTTCCGGTGAATTCAGGTACTTGGGTTCAGCTGCGTCGAGCACGCGGCCGCCATAGCCGATGACATCCCCGCGTTGCCCGACAATCGGGAACATGATGCGGTCGCGAAAACGGTCATAACGCTGCCCGGCGTCGTTAACGATGACCAGCCCGGCTTCGGCCAGCGCCGGATCGGCATACTGGTCGAACACCGCTGCGAGATTCTGCCAGCCGGCTGGGGCATAGCCAAGGCCAAAGCGGGCGGCGATTTCGCCCGTCAAGCCGCGTTTCTTGAGGTAGTCAATTGCGTGCGGTGTTTGTTTGAGTTGCTGGCGATAGAACTGTGCGGCGCGCTGCATGATTTCGACCAGGCTGGCGGCCTGCCTGGCGCGTTCCGGGTTGGCGGCAGGCCCCTCCGGCACGCTTATGCCCATTTGGCCGGCCAGTTCCCTGATGGCGTCCACATAGCCCAGCCCGGCGTATTCCATCAAAAAACCAATGGCGCTGCCGTGGGCGCCACAGCCAAAGCAATGATAGAACTGCTTGGTGGGCGAGACGGTGAAAGAAGGGGATTTTTCGTTATGAAACGGGCAACACGCCTGATAGTTGGCGCCGGCTTTTTTCAGCGGCACACGGCGGTCTATCACCTCCACGATATCCACGCGGTTCAGCAGCGTCTGGATAAAATCCTGCGGGATCATCTTGTTCCCTCGGGGTGGTGACGTAACAACGCCGCTACGCGGCGGACAATCAGCCCGCCAGCTTGGCTTTGATGTGGATGGAAACTTGCGCCATGTCTGCGCGCCCGGCGAGTTGTGTTTTTAGAAGCGCCATCACCTTGCCCATATCCTTGATGCCGGCAGCGCCGGTATTCGTAATGGCCTGGATGATCAGGCTATCTATTTCTTCAGCGGATGCGGCTTGAGGCATGTAAGCTTGCAACACGCCGCTTTCGAATTTTTCGATGTCCGCCAGCTCCTGGCGTCCGGCAGCCTCAAACTGGGTAATCGAATCGCGGCGCTGCTTGAGCATTTTGTCGATCACGGCGATGATTTGTGCATCATCCAGTTCGATACGCTCATCCACCTCGCGTTGTTTGATCGCGGCGAGCAATAACCGAATTGCCCCCAGGCGTGCCGCATCCTTGGCGCGCATGGCGGTTTTCATGTCTTCGGTGATGCGTGCTTTGAGACTCATAAGCTTATTACCGGCTGGAACAGCGCCCGGTTTGATGATTAATACATCTTGGGTGGCAGGGTCTGGCTGCGGATGCGTTTGAAGTGACGCTTGACTGCGGCGGCCAGCTTGCGCTTGCGCTCGGCAGTGGGCTTTTCGTAAAACTCGCGTGCGCGCAGTTCGGTCAACAGACCGGTTTTCTCAACAGTGCGCTTGAAACGACGCATGGCAACTTCAAAAGGCTCGTTTTCCTTGACGCGAATGTTCGGCATGAAATCTTGTCTCCAGGGACGGAAAAAACCTCAATTATAACTGAAAAAGCGTTTTTTTCAAGGCTATGCATTGCCCTGTCCGTGTGGCGCGATTAAACTCCTGCCCATGCTTATTCTGGGAATCGAATCTTCCTGCGACGAAACCGGTATCGCCCTATACGACACCGGGCGCGGACTACTGGCGCACGCACTTCATTCCCAGGTTGCCATGCACGCCGAATATGGCGGTGTGGTGCCCGAGCTCGCCTCACGCGACCATATCCGGCGTGCGCTACCGCTGACCCGCCAGGTACTGGCACAGGCAGGGTGCACGTTGGCCGACATTGACGCGATTGCCTATACCGAAGGTCCCGGTCTGGCTGGCGCGCTGCTGGTAGGTGCCGGCATCGCCCATGCGCTGGGCGTGGCGCTGGGGGTGCCGGTGCTGGGGGTACATCACCTCGAGGGGCATTTGCTCTCGGCGCTGATTTCCGATACGCCGCCGCAATTTCCGTTTGTGGCGCTGCTGGTATCGGGCGGGCATACGCAATTGATGCAGGTCGACAGCGTGGGGCGTTACACCACGCTGGGCGATACCCTGGATGACGCTGCGGGCGAGGCGTTCGACAAGACCGCACAACTGCTTGGTTTGGGCTATCCGGGCGGGGCGGCGTTATCGACGCTGGCGCAGACCGGCGACCCGCAGCGCTTCAAGCTGCCGCGTCCGATGTTACATTCGGGCGACCTCAATTTCAGTTTCAGCGGTTTGAAAACCGCGGTGCTCACACTCACGCAAAAACATCCCGGTCCCGCTGACCGCGCCGACATCGCTGCTGCGTTTCAGCTTGCCATGGCCGAGGTGCTGACGGCCAAATCGCTGGCGGCGCTCAAACAAACCCGATCCAGGCGGCTGGTGGTAGCCGGTGGCGTGGGTGCCAACCGGCAGTTGCGCGAGGCCTTGAACGCAGGCGTTAGCAAACTGGGCGGTGCGGTATTTTTCCCGCGCCTGGAGTTTTGTACCGATAACGGCGCGATGATTGCCTTTGCCGGCGCGATGCGCCTGGTGCATGGCGGGCGTGCCGCAGGGGTGTTTACGGTACGGCCACGCTGGGACTTGCAGGAAATCCCGGCACCCCATAATCATCCGGGCACCGTCATGGCTTAAGATGCGGTGCCATGCCGTGCACCCAGCCAAGCAACAGCTTGCCGCCCGGCAGTTGCGCCAGCACCTCCGGGAACAAAACCAGGCCGAACAACAGCGCCAGCACGCAGATCAATAACAGGGTGGAAAACACGCTCTGCGGGCGTAACACCCCCCAGTAGCGCATGGCGAGGAACATGACATCCTTGCGATGCTCTGACATGCGCCGCCAGCGCAATGCCAGCCGCACCACCAAATACACCGCATAGCCGCCGGTGAGCGCGGCGAACACAATGCGGAACACGCCAAACAAATCAATGCTGCCTGCCACAAAGTGCTGGTACAGCCACGACACAAACCCCACCAGGATGGAAGCGCTTACCGCAATGAGCACATTGAAAAACAGTCGCCGCCGCTCGCGCGACAAATCCTGCTGGCGCTGGATGAAAAAGCTGTCCACTTCCAGCTTAAAGTATTCCACCTTGTCCTGTACCAGCGCAGAAATTTCGCGTTTGGCCACCGCCATGCGTTCATCCAGAACAGCGCCCAGACGATCAGCAGCATAGTCCACCAGCTCGCGCACGTCGTCCTTGGTGAACTGGCGCTGGTTGTGCAATTCGGCAGAAATCTTGTCGAGCTTGGCGTCGATTTCCTGGCTGGCGCCCACGACCACGTCGCGTAATTCCGCACCGACGCTGGCTGCGCCTTCCTTGACCACGCTGCCCAGTTTGTCGCCCGCCAACTCGATGCTGTCTTTCGAGACCTGGGCCAGGCTCTCGCGCGCGTAGTTGATTTCTTTTTCAAACCAGGCCATGTCTGTCCCGGTTTATTCGGAAGAGGTGTCCGCCGGTTTGGCGAAGCGCCCTTCTTCACCCGCCAGCAGGCGGCGGATATTGGTGCGGTGACGCCAAAAAATCAGCGCACTGATGATGCATAACGCGACCACAGGCAGGGCGCCACCGATTAAATAGGTACCGAGTACTGGCGCCAGTGTGGCCGCAGTGAGTGCGGCCAGTGACGAGATGCGGGTGAGGACAAACACGACAAGCCAGCTGGCCAGCGTTGCCAAACCCAGCCACGGCGAAATCGCGAGCAGTATACCCAGCGCGGTGGCCACGCCCTTGCCCCCCTTGAAGCCGAAAAACAGCGGATATAAATGTCCGAGGAACACGGCGACCGCAGCGCCGTAAGTTGCGGCAATTTCCACGCCGTATTGGCTGCCAAAATAACGCGCCAGATACACCGCCAGCCAGCCTTTGGCCATGTCGCCGACGAGGGTCAGCAGCGCCGCTGATTTGCGCCCGGTGCGCAGCATGTTGGTCGCACCCGGATTGCCCGAGCCATGCTTGCGCGGGTCAGGCAGGCCGAATAGGCGGCTGACGATAACCGCGAAGGACAGCGAGCCGATCAGGTAAGCCGATACGATGAAAAATGAAATGAACATCAGGGATTCCGCTTAAGATATAGGATTTTCGCGCTTTGACAACTACGCATGGATATTCTGTTTCTCAAGGATTTCAGAGTCGAGCTCATTATCGGTATTTACGAGTGGGAACGCAAAGTACCCCAGCCGGTATTGCTCGACCTGGAAATCGGCCTGCCCAATAGTCGTGCCGGTGAAACCGACAATGTGGCAGATACCATTGACTATGGCCAGGTTGCCGCGCGTATCAGGGCGGCCTGTGCCGCACTGCGCCCAGCCCTGGTAGAGGCGCTGGCAGAGCATGTTGCACAATTGATACGCAATGAATTTGGCGCGCCCTGGGTCAGGGTCACCGTGACCAAGCTCGCCATCGTGCGCGGCGTCAAGGCGCTGGGCATCACCATCGAGCGCGGTCAGCGCGGATGCGTAATGAGTCATATGCAGCCAGCGCGCTGAATTAGCCGCGCGGATGATGCCTGGCGTGCAGCTGTTTCAGGCGCTCGCGTGCGACGTGCGTGTAAATTTGCGTGGTCGAAATATCCGCATGGCCGAGGAGCATCTGTACCACGCGCAAATCCGCGCCGTGGTTGAGCAGATGCGTGGCGAAGGCATGACGCAAAACGTGGGGCGAGGGCAGGCGCGCCAGCCCGGCCTGCTGCGCGCGGCGCTTGATGAGATACCAGAATGCCTGGCGCGTCATGGCCGTGCCGCGCCGGGTGACAAACAGCGCATCGCTGATCGTGCCTGCCAGTATCTGCGGGCGTGCGCCAGTCAGATAGCGCGCCAGCCAGAGCAGTGCCTCTTCACCCAGCGGTACCATGCGCTCCTTGCCGCCTTTGCCCATCACGCTTAGCACGCCCATGTCCAGACTCACATTTGCCACTCTCAGTGTCACCAGTTCGGAGACGCGCAGGCCGCTGGCGTAGAGGATTTCCAGCATGGCCTTGTCGCGTAGCCCGAGTGGCTGTGATGTGTCGGGTGCGTTCAACAGCATATCCACGTCGGCCTCCGACAAGCTCTTGGGTAATGAGCGCGGCAGCTTGGGGGTATCGATTTTCAGTGTCGGGTCCAGCACGATACGGCCATCGCGCAGCGCCAGCCGATAGAAGCGTTTCAGTGCGGAGAGCAGGCGCGCAGTGCTGCGCGGGCTGGTTTTACGCGAAAAACGGTATTGCAGATAGGCCTCGATATCCGCCTGGCCAGCGTCCAGCAACAGCGTGCTGCGCAGTGCCTCCAGCCAGGCCGAGAACTGCGTCAGGTCGCGCCGGTAGCTTTGCAGCGTGTTGGGGGAGAGTCCATCCTCCAGCCACAGCAGGTCGCAAAAGCTGTCAAGCGCCTGCTGCGATACCGGATTCATGGTTGAGTAGCCAGTCCTTGTAAGCCAGCGGCGCCCCGCTTGCGGCGTGCATGAAGCCGCCGCGCCCGTTTGCTGCCACCACCCGGTGGCAGGGGATAATGATGGGCAGCGGATTGGCGCCACAGGCCTGACCCACCGCGCGCGGACTCGAATCCAGCCAACGCGCGAGCTGCCCGTAAGTGGTGGTATGGCCGGGCGGGATCGCAGTCAGCGCGCGCCATACCCTGACTTGATGCGCTGTGCCAAGGATTGCCAGCGGCAGGTCAAAGCTGCTGCCGGGGTTATCAAAATAGTGATTCAAGGCAGCGGCGACGCGGCGTGACAGCGGTGAGTCGGGGGCTTGCAAAGGGTAATCCGCAGGCAAAAAATCGATGCCGTGAAGTTTTTCATTGGCCACACTCATGCCGACACAGCCGAATGGCGTGGGCAGCACGGCCTGATAAGGTGATTGAATCGTGCGGCTTTTCAT
Protein sequences of DBSCAN-SWA_1 >NZ_AP021884|622875:631682|628446_629247_-|WP_147070758.1|DBSCAN-SWA MAWFEKEINYARESLAQVSKDSIELAGDKLGSVVKEGAASVGAELRDVVVGASQEIDAKLDKISAELHNQRQFTKDDVRELVDYAADRLGAVLDERMAVAKREISALVQDKVEYFKLEVDSFFIQRQQDLSRERRRLFFNVLIAVSASILVGFVSWLYQHFVAGSIDLFGVFRIVFAALTGGYAVYLVVRLALRWRRMSEHRKDVMFLAMRYWGVLRPQSVFSTLLLICVLALLFGLVLFPEVLAQLPGGKLLLGWVHGMAPHLKP >NZ_AP021884|622875:631682|630313_631213_-|WP_147070764.1|DBSCAN-SWA MNPVSQQALDSFCDLLWLEDGLSPNTLQSYRRDLTQFSAWLEALRSTLLLDAGQADIEAYLQYRFSRKTSPRSTARLLSALKRFYRLALRDGRIVLDPTLKIDTPKLPRSLPKSLSEADVDMLLNAPDTSQPLGLRDKAMLEILYASGLRVSELVTLRVANVSLDMGVLSVMGKGGKERMVPLGEEALLWLARYLTGARPQILAGTISDALFVTRRGTAMTRQAFWYLIKRRAQQAGLARLPSPHVLRHAFATHLLNHGADLRVVQMLLGHADISTTQIYTHVARERLKQLHARHHPRG >NZ_AP021884|622875:631682|629925_630312_+|WP_147070762.1|DBSCAN-SWA MDILFLKDFRVELIIGIYEWERKVPQPVLLDLEIGLPNSRAGETDNVADTIDYGQVAARIRAACAALRPALVEALAEHVAQLIRNEFGAPWVRVTVTKLAIVRGVKALGITIERGQRGCVMSHMQPAR >NZ_AP021884|622875:631682|631184_631682_-|WP_147070766.1|DBSCAN-SWA MKSRTIQSPYQAVLPTPFGCVGMSVANEKLHGIDFLPADYPLQAPDSPLSRRVAAALNHYFDNPGSSFDLPLAILGTAHQVRVWRALTAIPPGHTTTYGQLARWLDSSPRAVGQACGANPLPIIIPCHRVVAANGRGGFMHAASGAPLAYKDWLLNHESGIAAGA >NZ_AP021884|622875:631682|622875_624762_-|WP_147070750.1|DBSCAN-SWA MANDHEKGEVKTVDVEARRTRLKNLIVLGKERGYLTYAEINDHLPDDMLDAEQIEGVISMINDMGIQVYDEAPDAETLLMSDAAPAVADEDVVAEAEAALSTVDSEFGRTTDPVRMYMREMGSVELLTREGEIEIAKRIEDGLKHMIQAISACPTTIQEILTLVDKVEREEIRIDDFVDGVVAEELEAIASEPEVDISELEEELDEDAEEDDGSALAAANLAQLKIDAMAHFEVIRKVFKRIQATLKKNGFGSPQYLQLQEELSAELMNIRFSAKQVEALCEGLRNLVEEVRSHERDIMEFAVNKSGMPRAHFIKVFPGNESNLDWVTKEINSKKAYSEDLTRYQHTIVERQQRLIALQEKVGIPIKDLKEINRQMSTGEARARRAKREMIEANLRLVISIAKKYTNRGLQFLDLIQEGNIGLMKAVDKFEYRRGYKFSTYATWWIRQAITRSIADQARTIRIPVHMIETINKMNRISRQILQETGKEPDPELLAEKMEMTEEKIRKILKISKEPISMETPIGDDEDSHLGDFIEDTATLAPIDAAVYGSLQEVTKDILDGLTQREAKVLRMRFGIEMNTDHTLEEVGKQFDVTRERIRQIEAKALRKLRHPSRSDRLKSFIDNSGNS >NZ_AP021884|622875:631682|627419_628457_+|WP_147070756.1|tRNA|DBSCAN-SWA MLILGIESSCDETGIALYDTGRGLLAHALHSQVAMHAEYGGVVPELASRDHIRRALPLTRQVLAQAGCTLADIDAIAYTEGPGLAGALLVGAGIAHALGVALGVPVLGVHHLEGHLLSALISDTPPQFPFVALLVSGGHTQLMQVDSVGRYTTLGDTLDDAAGEAFDKTAQLLGLGYPGGAALSTLAQTGDPQRFKLPRPMLHSGDLNFSFSGLKTAVLTLTQKHPGPADRADIAAAFQLAMAEVLTAKSLAALKQTRSRRLVVAGGVGANRQLREALNAGVSKLGGAVFFPRLEFCTDNGAMIAFAGAMRLVHGGRAAGVFTVRPRWDLQEIPAPHNHPGTVMA >NZ_AP021884|622875:631682|627100_627313_-|WP_124706067.1|DBSCAN-SWA MPNIRVKENEPFEVAMRRFKRTVEKTGLLTELRAREFYEKPTAERKRKLAAAVKRHFKRIRSQTLPPKMY >NZ_AP021884|622875:631682|624832_626566_-|WP_147070752.1|DBSCAN-SWA MIPQDFIQTLLNRVDIVEVIDRRVPLKKAGANYQACCPFHNEKSPSFTVSPTKQFYHCFGCGAHGSAIGFLMEYAGLGYVDAIRELAGQMGISVPEGPAANPERARQAASLVEIMQRAAQFYRQQLKQTPHAIDYLKKRGLTGEIAARFGLGYAPAGWQNLAAVFDQYADPALAEAGLVIVNDAGQRYDRFRDRIMFPIVGQRGDVIGYGGRVLDAAEPKYLNSPETPLFQKGNELYGLFQARRAIRDAGRVIVVEGYMDVVALAQHGVEYAVATLGTATTAAHVQKLLRHTDELVFCFDGDAAGRHAAWRALENSLAILSDGSRVGFLFLAPEHDPDSYIRAFGKGAFEALLGGEVVPLSAYLFKELAAQHDLASSEGRSAFLQAAHPLLSQIRAPALALLLRKRCAELANLQLSELDSLWQIKSRQFNPARAPVRAPRQAPASIWHWLLRAILFMPSLARELDASLIAPTDPDADALRAVVELLRVHPNLGTASVIDHFRDNALASILQRASAEIMGWAADLDISAEFADACYQARMQLDKQRGARVTDKPVSALSETEKAALRQLVNARQQVEK >NZ_AP021884|622875:631682|626615_627062_-|WP_147070754.1|DBSCAN-SWA MSLKARITEDMKTAMRAKDAARLGAIRLLLAAIKQREVDERIELDDAQIIAVIDKMLKQRRDSITQFEAAGRQELADIEKFESGVLQAYMPQAASAEEIDSLIIQAITNTGAAGIKDMGKVMALLKTQLAGRADMAQVSIHIKAKLAG >NZ_AP021884|622875:631682|629259_629880_-|WP_147070760.1|DBSCAN-SWA MMFISFFIVSAYLIGSLSFAVIVSRLFGLPDPRKHGSGNPGATNMLRTGRKSAALLTLVGDMAKGWLAVYLARYFGSQYGVEIAATYGAAVAVFLGHLYPLFFGFKGGKGVATALGILLAISPWLGLATLASWLVVFVLTRISSLAALTAATLAPVLGTYLIGGALPVVALCIISALIFWRHRTNIRRLLAGEEGRFAKPADTSSE |
10 | Vibrio_phage(16.67%) | tRNA | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| DBSCAN-SWA_2 |
751882 : 760457
Sequences of DBSCAN-SWA_2
Nucleotide sequences of DBSCAN-SWA_2 >NZ_AP021884|751882:760457|DBSCAN-SWA ATTACGCGCCGACTTCTGCCGTAACGGCCATCGGTGCGACACCAATCGCCGCGCGCACTTTATTTTCAATCTCGTGGGCGACCTCGGGGTGCTCGCGCAGGTATTCGCGCGCGTTGTCTTTGCCCTGGCCGATTTTTTCGCCGTTATAGGCATACCAGGCGCCGGATTTTTCCACCAGTTTGTGCTCCACGCCGAGCTCGATGATTTCCCCCTCGCGCGAGATGCCTTCGCCGTAAAGGATATCGAATTCAGCCTGCTTGAATGGCGGCGCGACCTTGTTCTTGACGACCTTGACGCGAGTCTCGGAGCCGATCACTTCGTCGCCTTTCTTGATTGCGCCGGTGCGGCGGATGTCGAGGCGCACCGAGGCGTAGAATTTGAGTGCATTGCCGCCGGTGGTGGTCTCCGGGTTGCCGAACATGACGCCGATTTTCATGCGGATCTGGTTGATGAAGATCACCAGGGTATTGGTGCGCTTGATGTTGCCGGTGAGCTTGCGCAACGCCTGGCTCATCAGGCGGGCTTGCAGGCCCATGTGCGAGTCGCCCATTTCGCCTTCGATTTCGGCCTTGGGAGTCAACGCCGCCACCGAGTCTATCACCACCACGTCCACCGAGCCGGAGCGCACCAGCATGTCGGCAATTTCCAGCGCCTGCTCACCGGTGTCGGGCTGCGAGATGAGCAGGTCGGAGACATTCACCCCGAGTTTTTGTGCGTATTGCGGGTCGAGCGCGTGCTCGGCATCAATGAACGCTGCGGTGCCGCCCAGTTTCTGCATTTCGGCGATGACCTGCAGGGTGAGCGTGGTTTTGCCGGAGGATTCCGGACCGAAGATTTCAACCACGCGGCCGCGCGGCAAACCGCCGACGCCCAGCGCAATATCTAAGCCCAGGGAGCCGGTGGAGACCACCTGAATATCGCGCACGACTTCGCCATCGCCGAGGCGCATGATGGAACCTTTGCCGAAGCTTTTTTCGATTTGTGCCAGGGCGGCGGCGAGGGCTTTGCTTCTGTTTTCGTCCATGTGTTTTTCCTCAAATTAGTGCGGGATTATGGCATAAACCGTTAGAACCCGTTAGCCACGGGCGAATGGTTTTTGTCTAGTCCAGCAACTCAATCACCCCGCGCAACGCGGCAGCCACGGCGCGCGCGCGAATTTCATCGCGGTCGCCGCATAACCTGCAGGTGGTGGCGAGGCGGGTACCGTCCTGCATCGCCCAGGCTATGCACACGGTGCCGACGGGTTTTTGCGGGGTGGCGCCGCCGGGCCCGGCAATGCCGGAGATGGCCAGGGCGATCTGCGCGCGGCTGTGGGCGAGTGCGCCCTGCGCCATTTCCAGTACGGTGGGTTCGGATACCGCGCCCGATGCTTGCAGCGTGGCGTTTTTCACCCCCAGCATGTCGTGTTTGGCGGCGTTGCTGTAGGTGATGAAGCCGCGCTCATACCAGGCCGAGCTGCCCGGCACGGCGGTGATCAGCATGCCCGCCCAGCCGCCGGTGCAGGACTCGGCGCTGGCGAGCATGATGCCGCGCCGGCTGAGGGCCTGGCCGGTTTGTTCGGCCAGTTGGTAGAGTGCTGCGTCGGTCGGTCTCATGGCAGAATTTTTTGCGCGAGGAACAGTGCCAGCAAAGTGTAGCCAGCCGCCAGCAAGTCGTCGAGCATGACGCCGAAGCCATTTTTCAGGCGCGCATCGAATTGGCGGATGGGAAACGGCTTCCAGATATCGAACAGCCGGAACAGGCCAAAGGCAGCGGCGACCCACAGCGGCGTTTGCGGGGTGGCAGCGAGCACGATCCAGAACGCGGCAATCTCATCCCAGACGATGCCGCCGTGATCCGCCACGCCCAAATCGCGTCCGGTTTTGCCGCAAATCCAGATGCCGGCGACAACCGCGATGCCGATGATGAGGTACAGCTGGGTCGGCGTGGCGAACCAGGCCAGCAGGTAGTACAGCGGCAGCGCGGCCAGGGTGCCAAACGTGCCCGGCGCCCTGGGTGCGAGTCCGCTGCCCAGACCAAAGGCGAGAAAATAGGCCGGGTGGCGGGTGATGAAACGCCAGTCAGGCGGAAAAGTGGTCATAGCCGGTGTGGCGCATATCCAGAACTTGTCCTTGTGCATCGCGCACTATCAGGCCTGGCTCGGCGCGAATGCTGCCGATGGCAGTGAGCCGCACGCCGAGGCGGGCGGCGATTTCGCCGAGTGCCTTACGGTGTGCGACGGGTGCAGTGAAACACAGCTCGTAGTCGTCACCGCCGCTCAGTACGCAGGCATCAAATTCCGGATGTGCGGCATAGTCATGGACGATTTCACCCAATGGCAAATGCGTATATTCAACGATGGCACCTACACCGGAGCGCGCCAGAATATGCCCCAAGTCAGCCAGCAGGCCATCCGACACATCGATTGCGCTGCGCGCCAGCCCGCGCAAGGCCAGGCCCAGTTCGACACGCGGCGTGGGGGTATATAGGCGCGCTGCCAGGGTGATCAGATCGGCGTCAGTCAGATTGACCCGGCCGTGCAGGGCGGCAAGTGCCAGTGCTGCGTCACCCAGCGTGCCGGATACCCAGATTTCATCGCCCGCCTGAGCGCCGTCGCGACGCAGGGCCTGGTTGGGCGGCACCTCGCCCAGGATAGTGAGGGTGAGGCTCAATGCGCCGCGCGTGGTGTCGCCACCCACCAGGCTCACGCCAAATTGATCGGCACAGCGATACAGTCCCGTTGCGAACGCCGCCAGCCAATCATCGTCTACTTCCGGCAGTGTCAGCGCCAGCGTCGCCCAGCGCGGCGCGGCACCCATCGCGGCGAGATCGGAGAGATTGACCGCGAGGCTTTTCCAGCCGAGCTTTTCCGGGTCGGCATCGGCGAAAAAATGCACATCGGCGACCAGGGTGTCGGTGGAAACGGCGAGCTGCATCCCGGTTGCGGGTTGCAGCAGGGCGCAATCGTCGCCCACTCCCAGCACCGCGCCGGGTGTGGCGCGGGAGAAATGACGCTGAATCAGGCCGAATTCGGAAGTCATGGATGTTGCATTGCCATCCTGGCGGTTAGCTTGCGGTTTTCTGGGTCATCAACCCTGGCGGCGCGGCGCGTTCACTTCCACTGTGCGCACCTCGGCAGCCAGCTTGTCCATCACGCCGTTGACGTATTTGTGGCCATCGGTACCGCCGAATATCTTGGCGAGTTCGACCGCTTCGTTGATGACGACGCGATACGGCACTTCCAGATGATGCAGCAGCTCCTGCGTGCCCAGCAGCAGAATGGCATGCTCCACCGGGCTCAGTTCCGCAGGCTTGCGATCCAGATAGGCAGCGAGCCGCAGGTCCAGCGCCGGTGCTTCATCGACCACGCCATTCAGTAGTGCCAGGAACATTTTTTCATCAATGTTGCGATACACGGGATCGTCGCGCAGCTGTTTGACGATATCGGCAGTCGGCTGATGGTTGAGCAGCCACTGATACACGCCCTGCACTGCGAATTCGCGGGCCTTGCGACGATTGCCGCTCATAGCTGTTTGAGCAGGTTGGCCATTTCGATCGCGCATTCGCCCGCTTCGGCGCCTTTTACCGACATGCGCGAGGTGGCCTGGTGATCGGTATCGGTGGTCAATACGCCATTGGCAATAGGCACGCCGGTATCGAGCTGGATACGCGCGATACCGTTGGCCATTTCGTTGGCAACCACCTCGAAATGATAGGTATCGCCGCGCACCACGGCGCCCAGCGCCACCAGCGCGTCGAATTTTCCGCTCATGGCCATTTTGCGCAGCGCCAGCGGAATTTCCAGCGCACCCGGCACGGTGGCGAGGAGCAGGTTGCCGGTTTTCACGCCGCGTTTGCCAAGTGCGGTGGTGCAAGCCGCCAGCAAACCCTCGCAAATGTCCATGTTGAAGCGGCTCATGACGATGCCGATACGCAATGCGCTGCCGTCGAGACTGGATTCAAGTTCGGGAATATCGTCGTAGCTTGCCATGATTTATTTCTCCTGATGAGACTGGATTGCATGTATTGCCTGTGCAAGGCGGGGGCGGGTTCAGGTGTTCTCGTCGTAGCCTGTCACTTCCAGATCGAATCCCGCCAGCGACGGCATTTTGCGCTGAGTAGCCAGCAGGCGCATTTTGCCCACGCCGACGTCTTTCAAAATCTGCGCGCCAATGCCATGGTTGCGCGCGTCCCATTTTTGCGGAAGCCTGACACCGGCTTCAGGCATAGCGCGGGTGAGCAGTTCTGCTGCGCTCTCCGGACGGTGCAGCAGGACGACAACGCCCTTGCCCACCGCAGCGATTTTTGCCAGGGCCTGGTTGACACTGTAGGCATGGGTACGGCTGCCGACTTCGAGCATGTCCATCACCGACACTGGCTCGTGTACCCGCACCAGCGTTTCGCTGGCGGCGCTGATTTCGCCCTTGACCAGGGCCAGATGAGCTGCCCCGGAGATTTTTTCACGGTAGGCGATGAGCTGGAATTCGCCGTAAACCGTCTCGATACAGCGGCTGCCTGCACGCTCCACCAGGCTTTCATTGTGGCTGCGGTAGTGGATCAGGTCGACGATTGCGCCAATTTTCAGGCCGTGAATTTTGGCGTATTCCAGCAAATCCGGCAGACGCGCCATGGTACCGTCATCCTTGAGGATTTCGCAAATCACTGCGGCAGGTTCCAGGCCGGCCAGCCCGGCCAGATCGCAGCCTGCCTCGGTGTGGCCGGCACGAATCAGCACGCCGCCGGGTTGGGCGCGCAGCGGAAAAATGTGGCCCGGCTGGATGATGTCGGCGGCCTTGGCGTGTTTGGCGACGGCGGCCTGAATGGTAAGCGCCCGGTCGGCGGCGGAAATGCCGGTGGTAACCCCTGTGGCGGCTTCGATGGAGACAGTGAAAGCAGTGCTATAGGGCGTCTGGTTATCGGCCACCATCTGGCGCAAGCCCAGTTGCTTGCAGCGCTCGTCGGTCAGCGTCAGGCAAATCAGGCCGCGTCCGTGCTTGGCCATGAAGTTGATCGCTTCGGGGGTGGCGAATTCGGCCGCCATCACCAGATCGCCCTCGTTTTCGCGGTCTTCCTCGTCCACCAGTACGACCATTTTTCCGGCTTTCAGGTCGGCGATGATGTCTTCGATAGGGCTCAGGCTCATGTTTGATCTCGATAATTAAGGATGCGTTCGGCGTAGCGCGCCATCATGTCCACTTCCAGATTGACCCGGCTGGCAGGTTGGAGCGTATGCAGATTGGTATGTTCCAGCGTATGCGGAATAAGGTTGATGCTGAAACGGTCGCCGTCTACGCGATTAACGGTGAGGCTCACACCGTTGACGGTGATGGAACCTTTGCTGACGACAAAGCGGGCCAGATCGCCGGGAGCTCGGATGACCAGTTCGAAGCAATCCCCGGCCGGCGCGAAATGCAGCACCTCACCCACCCCGTCCACATGGCCGGAAACCAGGTGTCCGCCGAGACGGTCGGACAGGCGCAGCGCCTTTTCCAGATTGACCAGGCCGTGTTCAGGAAAGCCCGTCGTACAGCGGAAAGTCTCTGCCGATACGTCTACCGAGAAGCTCGCCGCACCCAGCGCCACGACGGTCAGGCACACACCGTTGCAGGCGATGGAGTCACCTGGCGCCACGTCGCTCAAATCCAGATGGGCGGCGTCGATCACCAGGCGTGCATCCGCGTTTCTGGGTTCCACTGCCGCCACCTTGCCTACTGCCTGAATAATGCCTGTAAACATGTTAGTTCTTTTCAAATTGTGCAGTGATGCGTATATCGGCACCGACCTGACGTATGTCGCGCAGCACCAGTTTGCGGCGCTCTTGCATCTGCGCCGGTTCGGCCAGGGAAAACAGGCCACGCGCCGTGTCACCCAGCAATACCGGGGCGACATACATCACCCATTCATCCACAAATCCAGCCGCGATCAGCGCGCCGTTGAGTGTGGCACCGGCTTCGGTCATCACTTCATTTATCCCGCGCTGCGCCAGGAGTGACAATAGCGCCCCCAGATCAACCTGCCCGGCATCACCCGGCAGACAGCGGATTTCTGCCCCGGCGGCTTCCAGTCCGGCGCTGCGCCGGGCGTCCGGTTCGGCACAGGCAATCAGGGTCGGGGCACCGCCCAGTATATTCGCCGAGGGCGGGGTTTGCAGGCGCGAATCGACGATGACCTTGAGCGGCTGGCGCGTGGTTTCCACTGCGCGCACATTCAGTTCCGGATTATCCGCCAGCACCGTACCGATACCGGTCAGAATGGCGCATGAACGCGCACGCAGCCGGTGCACGTCGCGGCGCGCAGGCTCTCCGGTGATCCATTTGCTGGCACCGCCGGATAAGGCCGTTTTGCCATCCAGCGAACTGGCGGTCTTGATGCGCAGCCACGGATGCCCCATGACCATGCGCTTGATAAAGCCCGCGTTGAGTTCACGCGCCTGCGCTTCGAGCAGGCCGCATTCAGTCGCAATCCCGGCCTTTTGCAGCAGGGCCAGACCATTGCCCGCCACCTGGGGGTTGGGATCCTGCATGGCGGCCACGACGCGTGCCACGCCTGCGGCTATCAACGCCTCGGCGCAGGGCGGGGTGCGCCCATGGTGGCTGCACGGCTCAAGCGTGACGTACACCGTGGCACCGCGCGCTGCGTCGCCGGCCTCGCGCAGCGCGTGGATTTCGGCATGCGGCTGTCCGGCCTGCTGGTGCCAGCCGCTGCCGACCATCGCTCCATTCCTGACAATCACACAGCCCACGCGCGGATTCGGGCTGGTAGTGTACAGGCCGTGCTCCGCAAGCTGCAGCGCGCGCGCCATATGGATGTAATCTGTCTGCGAAAACACAGGTTTATTTGTCGAAGTCCTTGAGCACGTCGCGGAAGTCGCCCACATCCTGGAAGCTCTTGTACACTGAGGCAAAGCGGATGTAAGCGATCTTGTCCAGGCGCTTCAACTCGTTCATCACCATCTCGCCAATCTGGCGCGCGGGCAATTCACGTTCGCCCAGCGACAGCACTTGTTTGACGATGCGCCCAATCGCCGCATCCACATATTCGGTCGGTACCGGGCGCTTGTGCAGCGCGCGGCGAAAGCCCTCGTGCAGTTTTTCCTGGCTGAATTCCTGGCGCACGCCGTTGCTTTTAATCACCTGCGGCAGGCGCAGTTCTATGGTTTCGTAGGTGGTAAAGCGCTTGTCGCAAGACGTGCAGCGGCGCCGGCGACGAATCGAGTCGCCGGCTTCGGAAAGGCGCGAATCAACGACCTGGGAGTCGAACGCGCTGCAGAACGGACATTTCATGAAGACGGTTTGTAGCGGGTAAGGTAATAGTCAGCAGCAGGCAGCCTGGTTGGGCGGCACGCTACCCGTTCATCCGTACACCGGATATTTCTTGCACAAGGCTTGCGCGGCAGTGGCCGCGCGCCCAATGACCGCCTCGTCATTGGGCGCGTCGAGCACATCGGCAATCAGGTGCGCCAGTTGCTCGGCTTCCAGTTCCTTGAAACCGCGTGTGGTCATTGCCGGCGTGCCGATGCGGATGCCGGAGGTGACGAAGGGTTTTTGCGGATCGTTGGGGATGGCGTTTTTGTTGACCGTGATGTGCGCCCGTCCGAGCGCGGCTTCTGCCTCCTTGCCGGTAATGCTTTTGGCCTGCAAGTCCACCAGAAACAGATGCGAATCGGTGCGGCCGGAGACAATGCGCAGGCCGCGTTCCTGCAGCACCTTCGCCATCACGCGGGCGTTATCGATCACCTGCTCCTGGTACAACTTGAAGTCCTTGCCCATTGCCTCCTGAAACGCCACTGCCTTGGCGGCGATGACGTGCATCAGCGGACCGCCCTGCAAGCCCGGGAAGATGGCGGAATTGATCGCCTTTTCGTGTTCGGCTTTCATCAGGATAATGCCGCCTCTGGGACCGCGCAGCGTCTTGTGCGTGGTGGAGGTTACCACATCCGCATGCGGTACCGGATTGGGATACACCCCCGCCGCGATCAGTCCGGCGTAGTGCGCCATATCCACCATGAAAATCGCGCCGACTTCCCTGGCTATTTTGGCAAAGCGCTCGAAGTCGATGTGCAGCGAATACGCCGAGGCGCCAGCGATGATGAGTCTGGGCTTATGCTCACGGGCAAGCGCTTCCATACGCGGGTAATCGATTTCTTCTTTTTTATTCAGGCCGTAGGCCACGGCGTTGAACCACTTGCCCGACATGTTGAGCGCCATGCCGTGGGTGAGGTGTCCGCCTTCAGCCAGGCTCATGCCCATGATGGTATCGCCCGGCTTGAGGAAGGCCAGGAATACCGCCTGGTTGGCCTGCGAGCCAGAATGCGGCTGCACGTTGGCGGCTTCCGCACCGAATAATTTCCTGATACGGTCAATTGCCAGTTGTTCGGCGATATCCACATATTCGCAGCCGCCGTAGTAGCGCTTGCCGGGATAGCCTTCCGCGTATTTGTTGGTCAGCACCGAACCCTGGGCTTCCATTACCGCCGGACTGGCATAATTTTCCGAAGCGATCAGCTCGATGTGATCTTCCTGGCGGCCGCGTTCTGCCTCCATGGCTTTCCAGAGAGCGGGATCGGTTTGGGCGAGAGTGTGCTGAGGGTTAAACAT
Protein sequences of DBSCAN-SWA_2 >NZ_AP021884|751882:760457|751882_752905_-|WP_147074510.1|DBSCAN-SWA MDENRSKALAAALAQIEKSFGKGSIMRLGDGEVVRDIQVVSTGSLGLDIALGVGGLPRGRVVEIFGPESSGKTTLTLQVIAEMQKLGGTAAFIDAEHALDPQYAQKLGVNVSDLLISQPDTGEQALEIADMLVRSGSVDVVVIDSVAALTPKAEIEGEMGDSHMGLQARLMSQALRKLTGNIKRTNTLVIFINQIRMKIGVMFGNPETTTGGNALKFYASVRLDIRRTGAIKKGDEVIGSETRVKVVKNKVAPPFKQAEFDILYGEGISREGEIIELGVEHKLVEKSGAWYAYNGEKIGQGKDNAREYLREHPEVAHEIENKVRAAIGVAPMAVTAEVGA >NZ_AP021884|751882:760457|756999_757596_-|WP_147074503.1|DBSCAN-SWA MFTGIIQAVGKVAAVEPRNADARLVIDAAHLDLSDVAPGDSIACNGVCLTVVALGAASFSVDVSAETFRCTTGFPEHGLVNLEKALRLSDRLGGHLVSGHVDGVGEVLHFAPAGDCFELVIRAPGDLARFVVSKGSITVNGVSLTVNRVDGDRFSINLIPHTLEHTNLHTLQPASRVNLEVDMMARYAERILNYRDQT >NZ_AP021884|751882:760457|754949_755387_-|WP_147074506.1|DBSCAN-SWA MSGNRRKAREFAVQGVYQWLLNHQPTADIVKQLRDDPVYRNIDEKMFLALLNGVVDEAPALDLRLAAYLDRKPAELSPVEHAILLLGTQELLHHLEVPYRVVINEAVELAKIFGGTDGHKYVNGVMDKLAAEVRTVEVNAPRRQG >NZ_AP021884|751882:760457|752981_753476_-|WP_147074509.1|DBSCAN-SWA MRPTDAALYQLAEQTGQALSRRGIMLASAESCTGGWAGMLITAVPGSSAWYERGFITYSNAAKHDMLGVKNATLQASGAVSEPTVLEMAQGALAHSRAQIALAISGIAGPGGATPQKPVGTVCIAWAMQDGTRLATTCRLCGDRDEIRARAVAAALRGVIELLD >NZ_AP021884|751882:760457|753941_754901_-|WP_147074507.1|DBSCAN-SWA MTSEFGLIQRHFSRATPGAVLGVGDDCALLQPATGMQLAVSTDTLVADVHFFADADPEKLGWKSLAVNLSDLAAMGAAPRWATLALTLPEVDDDWLAAFATGLYRCADQFGVSLVGGDTTRGALSLTLTILGEVPPNQALRRDGAQAGDEIWVSGTLGDAALALAALHGRVNLTDADLITLAARLYTPTPRVELGLALRGLARSAIDVSDGLLADLGHILARSGVGAIVEYTHLPLGEIVHDYAAHPEFDACVLSGGDDYELCFTAPVAHRKALGEIAARLGVRLTAIGSIRAEPGLIVRDAQGQVLDMRHTGYDHFSA >NZ_AP021884|751882:760457|758693_759143_-|WP_147074501.1|DBSCAN-SWA MKCPFCSAFDSQVVDSRLSEAGDSIRRRRRCTSCDKRFTTYETIELRLPQVIKSNGVRQEFSQEKLHEGFRRALHKRPVPTEYVDAAIGRIVKQVLSLGERELPARQIGEMVMNELKRLDKIAYIRFASVYKSFQDVGDFRDVLKDFDK >NZ_AP021884|751882:760457|755911_757003_-|WP_147074504.1|DBSCAN-SWA MSLSPIEDIIADLKAGKMVVLVDEEDRENEGDLVMAAEFATPEAINFMAKHGRGLICLTLTDERCKQLGLRQMVADNQTPYSTAFTVSIEAATGVTTGISAADRALTIQAAVAKHAKAADIIQPGHIFPLRAQPGGVLIRAGHTEAGCDLAGLAGLEPAAVICEILKDDGTMARLPDLLEYAKIHGLKIGAIVDLIHYRSHNESLVERAGSRCIETVYGEFQLIAYREKISGAAHLALVKGEISAASETLVRVHEPVSVMDMLEVGSRTHAYSVNQALAKIAAVGKGVVVLLHRPESAAELLTRAMPEAGVRLPQKWDARNHGIGAQILKDVGVGKMRLLATQRKMPSLAGFDLEVTGYDENT >NZ_AP021884|751882:760457|755383_755851_-|WP_147074505.1|DBSCAN-SWA MASYDDIPELESSLDGSALRIGIVMSRFNMDICEGLLAACTTALGKRGVKTGNLLLATVPGALEIPLALRKMAMSGKFDALVALGAVVRGDTYHFEVVANEMANGIARIQLDTGVPIANGVLTTDTDHQATSRMSVKGAEAGECAIEMANLLKQL >NZ_AP021884|751882:760457|753472_753961_-|WP_147074508.1|DBSCAN-SWA MTTFPPDWRFITRHPAYFLAFGLGSGLAPRAPGTFGTLAALPLYYLLAWFATPTQLYLIIGIAVVAGIWICGKTGRDLGVADHGGIVWDEIAAFWIVLAATPQTPLWVAAAFGLFRLFDIWKPFPIRQFDARLKNGFGVMLDDLLAAGYTLLALFLAQKILP >NZ_AP021884|751882:760457|759212_760457_-|WP_147074500.1|DBSCAN-SWA MFNPQHTLAQTDPALWKAMEAERGRQEDHIELIASENYASPAVMEAQGSVLTNKYAEGYPGKRYYGGCEYVDIAEQLAIDRIRKLFGAEAANVQPHSGSQANQAVFLAFLKPGDTIMGMSLAEGGHLTHGMALNMSGKWFNAVAYGLNKKEEIDYPRMEALAREHKPRLIIAGASAYSLHIDFERFAKIAREVGAIFMVDMAHYAGLIAAGVYPNPVPHADVVTSTTHKTLRGPRGGIILMKAEHEKAINSAIFPGLQGGPLMHVIAAKAVAFQEAMGKDFKLYQEQVIDNARVMAKVLQERGLRIVSGRTDSHLFLVDLQAKSITGKEAEAALGRAHITVNKNAIPNDPQKPFVTSGIRIGTPAMTTRGFKELEAEQLAHLIADVLDAPNDEAVIGRAATAAQALCKKYPVYG >NZ_AP021884|751882:760457|757597_758737_-|WP_147074502.1|DBSCAN-SWA MWATSATCSRTSTNKPVFSQTDYIHMARALQLAEHGLYTTSPNPRVGCVIVRNGAMVGSGWHQQAGQPHAEIHALREAGDAARGATVYVTLEPCSHHGRTPPCAEALIAAGVARVVAAMQDPNPQVAGNGLALLQKAGIATECGLLEAQARELNAGFIKRMVMGHPWLRIKTASSLDGKTALSGGASKWITGEPARRDVHRLRARSCAILTGIGTVLADNPELNVRAVETTRQPLKVIVDSRLQTPPSANILGGAPTLIACAEPDARRSAGLEAAGAEIRCLPGDAGQVDLGALLSLLAQRGINEVMTEAGATLNGALIAAGFVDEWVMYVAPVLLGDTARGLFSLAEPAQMQERRKLVLRDIRQVGADIRITAQFEKN |
11 | Staphylococcus_phage(42.86%) | NA | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| DBSCAN-SWA_3 |
882982 : 892137
Sequences of DBSCAN-SWA_3
Nucleotide sequences of DBSCAN-SWA_3 >NZ_AP021884|882982:892137|DBSCAN-SWA CCTAGGCGGGCATCAGCACGGTCAGCCCCCCCATGTAGGGACGCAACACTTCGGGAAGCGCCACCGATCCATCCGCCTGCTGATGATTTTCCAGAATCGCGACCAGGGTGCGCCCTACCGCCAGCCCGGAGCCGTTCACGCTGTGCAGCAGTTCCGGCTTGCCTTTTTCACCTCTGAAGCGCGCTTGCATGCGGCGTGTCTGAAACGCCTCGAAATTGCTGCACGAGGAAATTTCGCGATAAGTATTTTGCGCCGGCAGCCACACTTCCAGATCGTAGGTCTTGGCGGCAGAAAAACCCATGTCGCCGCCGCACAGCGCCATTTTCCGGTAGGGCAGCCCGAGTGCTTGCAGGATGGCCTCGGCATGGCCGGTGAGTGCTTCCAGCGCGGTGTAGGATTGCTCCGGTTCGACCAGTTGCACCAGCTCCACCTTGTCGAACTGATGCTGGCGGATCATGCCGCGGGTGTCGCGGCCGTAGGAACCGGCCTCGGAGCGGAAGCAGGGGGTGTGGGCGACGAATTTCAGCGGCAGTTGCTCGCGCGCCACGATCGCGTCGCGCACCATGTTGGTCAGCGGCACTTCGGCGGTGGGGATGAGGTAGAGTTTTTCCGCATCCGCACGCGGCACGTGAAACAGATCTTCCTCGAACTTGGGCAACTGCCCGGTACCGCGCATGGAGTCGGCGTTGACCAGATACGGCACATACACTTCGGTGTAGCCATGCACAGCCGTGTGGGTGTCCAGCATGAACTGCGCCAGCGCCCGGTGCAGCCGCGCCAGCCCGCCGCGCAGCAGTGAAAAGCGCGCGCCGGCCAGTTTGCTGGCGGTCTCGAAATCCAGCCCCAGCGCGGTACCTACGTCCACGTGATCTTTCACCGCAAAATCAAACACGCGCGGTGTGCCGACACGTGCGATCTCTACGTTGTCGGCGTCGGATTTACCCGTCGGCACCGATGCATGCGGCAGGTTGGGAATGGTCATCAACAGCGCATTGAGCCGGGCTTGCAGGGCTTCCAGCGCGGATTCCGCGGCTTTCAGTTCGGCGCCGAGATTCGCCACTTCCGCCATGATGGTGGAGACATCCTCGCCCTTGGCCTTGGCCATGCCTATCTGTCTGGAGCTGGCGTTGCGCCTGGCCTGCAGCTCCTGGGTGCGGGTTTGCAGTTGTTTGCGCTCGGCTTCCAGGCGCTGGAATTCGGCAGTGTCCAGGGTGTAGCCGCGCATGGCAAGGCGTTGCGCCACGTCGTCGAGGTCGTTGCGGAGGTGTTGAATGTCTAACATTATTTTTGCCCTGTTTTCTTGTTGGCTTGTTCATCCAGCTTGCGCAAATACGCCAGCCGTTCGGCGATCTTGCCTTCCAGCCCGCGCGGGGTTGGTGCGTAAAAGTGCGCGTTGACACCTTCCGGTAAATAGTCCTCCCCGGCGGCGTAGGCGTCCGGTTCGTCGTGCGCGTAGCGGTAGGCGTGGCCGTAGCCCAGTTGTTTCATCAGTTTGGTGGGGGCGTTGCGCAGGTGCACCGGCACCTCGCGCGATTTGTCCGCCGCCACAAAGCTGCGGGCGTTATTGTACGCCACGTACACGGCGTTGCTCTTGGGCGCGCAGGCGAGATAAATCACCGCCTGCGCCAGCGCCAGTTCGCCTTCCGGACTGCCCAGGCGCTGGTAGGTTTCCACCGCGTCCAGGGTCAGGCGCAGCGCGCGTGGGTCGGCCAGACCGATGTCCTCGCTGGCCATGCGGATCAGGCGGCGGCCGACGTACAGCGGATCGGCACCGCCGTCGAGCATGCGCACCATCCAGTACAGCGCGGCGTCGGGATGGGAACCGCGCACCGATTTGTGCAGCGCGGATATCTGGTCGTAGAAGTTGTCGCCGCCCTTGTCAAAACGGCGTGCGCCGCGCGCCAGCGTGGTCTGGATGAAATCCTCGTCAATCTCATGACGGGCGGCATCGAGTGCGGCATTGGCGGCTTGTTCCAGCAGGTTCAACAGGCGCCGCGCGTCGCCGTCGGCGTAGCCGGTGAGCTGGGCGCGCGCCGCCCCGGTGATGGCAATGTCCGGATAGGTGCTGATCCGCGCGCGCTCCAGCAGGGCGGCGAGGTCGGTTTCCACGATGGGCTTTAACACATACACCTGGGCGCGCGAGAGCAGCGCGCTATTGACCTCGAACGAGGGATTCTCGGTGGTGGCGCCGATGAAAGTAATCAACCCGGCTTCAACGAACGGCAGGAAAGCGTCCTGCTGGGATTTGTTGAAGCGGTGCACTTCGTCCACAAACAGCAGGGTGCGGCGCCCCTGGCCTTGCATCATTTCGGCGCGCGCCACCGCCTCGCGGATTTCCTTGACCCCCGAGAGCACGGCGGACAGCGCGATGAACTCCATGTCAAAACCGTGGCTCATCAGCCGTGCCAGTGTGGTCTTGCCTACGCCCGGCGGCCCCCACAGGATCATCGAGTGTGGCTTGCCGGATTCGAATGCGACGCGCAGCGGCTTGCCGGGGCCGAGCAAATGCGTCTGTCCGATCACTTCGTCCAGATTGCGCGGCCGCAGCCGTTCGGCCAGCGGCGCGCTGTCCAGCGGATGATCGAACAGGTCGGCGTGGCTCACTTGCCGTCGGAGATGACGTCCACGCCCTGGGGCGGGGTGAAATGGAAATCGCTGGCGGGAAGGGCCGGGTTACGTTCCAGCCCGGCGAATTTCAGCACCGTGGTCTGGCCGAAGTTGTCCTTGATCTCCATCGCCACCAGCGTGTTTTTGCTGAATCCCATGCGCACGTTCTCAAACGCGCTTTCCTTGTCGCGCGGGCGCGCGTCCAGCCATTCCAGACCGTCGCGGCTGCCGGCGTCCGTGATCGTGTAAAACCTGCCGATGTCCTTGCTGCCCGCCAGCAAGGCCGCCGGGCTGCTGCCCAGCGCCTGGCCCAGTTTCTTGATGGTGACCTGTTGCAGATCGGCGTCGTACAGCCAGATTCTTTTGCCGTCGCCGACGATTATCTGTTCATAGGGCTTCTCATACACCCAGCGGAACTTGCCGGGACGCGCGAAAGCCATGGTGCCGGACGACTGCTGGCGCGCGTGTCCGTTTTTGTCCAGCACGGTCTGGGTGAACGTGGCGCGCGCGGTCTGGGTATCCGCGACGAACGCCTTGAGCGCGTCGATGCTGGATGCTGCGGCGCTGGCAGAGAAGATCAAGAGTGCGGTAAAGAGACTGAGTTTTTTCATTGGGACTTTAAGTCGAATGGACAGGATTTTTAATGCTTAACAAGAGAGTTGGTCTGGTTTTAGTGCAGAAAATTCAACACCCCCCAAAATTCATATTTATTCTGAATGATCCGGTCCTTATCCTGTTAATCCTGTCTAATCACATTGTTCATTCCCGGTTAGGTGCAATCACTTCGCGGTTGCCGTTGCTCTGCATCGGCGTCACCAGACCCGCCTGTTCCATTGCCTCGATCAGCCGCGCGGCACGGTTGTAGCCGATGCGCAGGTGACGCTGCACGGCGGAGATGGAGGGGCGCCGGGTTTTCAGCACGATGGCGACGGCTTCGTCATAGAGCGGGTCGCTTTCGGCGTCACTGCCGCCGACCGCGCTTTCGCCGCTGCCTTCATTGTCTTCCGGGGTGTCGAGGATGCCGTCAATGTAATCGGGCTCGCCCAGCTGTTTCAGGTATTCCACGACTTTATGCACTTCTTCGTCGGCCACAAAAGCGCCGTGCACGCGCTGCGGATAGCCGGTGCCGGGCGGCAGGTAGAGCATGTCGCCCTGGCCGAGCAGGGCTTCTGCGCCCATCTGGTCGAGGATGGTGCGCGAGTCGATTTTGCTCGATACCTGGAACGCGACGCGGGTGGGGATGTTGGCCTTGATCAGGCCGGTAATCACGTCCACCGACGGGCGCTGCGTGGCCAGGATCAGATGCACCCCGGCGGCGCGGGCCTTCTGCGCCAGCCGCGCAATGAGCTGTTCGACGGCCTTGCCTACCACCATCATCATGTCAGCCAGCTCGTCGATCACCACCACAATCATGGGCTGCTCTTCCAGCGGCTCGGGATTATCCGGGGTCAGGCTGAACGGGTGGGTGAGCGGGGTGGCGGCCCTTTTCGCGTCGCGCACTTTCTGGTTGTAACCAGCCAGGTTGCGCACGCCGAGTGCGGACATCAGCTTGTAGCGGCGCTCCATCTCGGCCACGCACCAGTTGAGCGCGGCGGCGGCCTGACGCATGTCGGTGACTACCGGGGCGAGCAGATGCGGAATGCCCTCGTATACCGACAGCTCGAGCATTTTCGGGTCGACCAGAATCAGTCGCACACGGCTGGCGTCCGCTTTGTACAACAGGGAAAGGATCATCGCGTTGATGGCAACTGATTTGCCCGAGCCGGTGGTGCCCGCCACCAGTACGTGCGGCATTTTGGCCAGATCGGCCACCACCGGGTTGCCGGCTATGTCCTTGCCCATCGCCATTGCCAGCGGGCTGGCCATGGCGTGGTACACCTTGGCGGAAAGGATTTCCGATAACCGCACGATCTCGCGCTTGGGGTTGGGGATTTCCAGCGCCATGGTGGTCTTGCCGGGGATGGTTTCCACCACGCGGATGCTCACCACCGACAGCGCGCGCGCCAGGTCTTTGGCCAGGTTGACGATCTGGCTGCCCTTGACCCCGGAGGCCGGCTCGATCTCGTAGCGGGTGATGACCGGGCCGGGCAGAGCGGCGACCACGCGCGCCTGCACGTTGAATTCGGCCAGCTTGCGCTCGATCAGGCGCGAGGTGAATCCCAGGGTTTCCGCCGACAGGCTCTCCACATGGGCAGTCGCAGCATCCAGCAAATGCAGTGGCGGCAGCGGGGAATCCGGCATTTCGGCAAACAGCGGCACCTGCTTTTCCACCTGCACGCGTTCCGATTTGATAATGGTGGGCGGCGGGGTTTCGATGACCACCGGAGCGCGGTCCATATTGCGGCGCTTTTCTTCCAGTACAATCTCGTCGCGCCGCAGCGTGGCGGCTTGACCCATGCGCCGTTCGCGCCGCGCGGACCAGGTCTCGATTGCCCATAACCAGCTGCGTTCGAGGTATTCCCCGGTGGATTCCACTAGCGCCAGCCAGGAGATGCCGGTAAACAGGCTCAAGCCCGCGGCAATCAGCACCAGCAAGGCCAGCGTGCCGCCGGTAAAGCCCAGCAAATGGGTGACTGCTTCTCCCACAACCGCTCCCAGAATGCCTCCCGGGTGCTCCGGTAGTGCTATGTGCAAGCTATACAGACGCTGCGATTCCAGCCCGGCGCTGGACAGCAGCAGCAACAAAAACCCGGCACTGCGGATATATAACGGGCGGCGGTCGGGCGCCTCGCTAATCTCCAGTTTGCGATATCCCCACCAGGTGGCATACAGCGCCAGCATCACCAGCCACCAGGTGGAGAGCCCGAACAGTGCAAGCAGAATGTCGGCCAGATACGCCCCCAGCGTGCCGCCCATATTGTGCACACTGCCTTGTCCGCTGTGCGACCAACCCGGATCAGACTGGGTGTACGTGAACAGGATGACCGCCAGGAACGCCGCCAGCGCCACGCTCAGCAGCCAGCCGACCTCGCGCAGCAGGCCGGTGAGCCTGGGCGGGAGCGGTTTGCGGACGGATTTGGGCGTATTGCGCCGGGGAATGGACATGGTCAGCAATTTAATCGGAAGGCAAAATTATAACTTGAACCTTGTCCAGTCCGTGACCATCTTCCATACTGTTCCAGTTTTCCAGTTTTCAATCCCGCTACGGAGTACTACATGACCACCCCTCAACATCACCGTCTCATCATCCTCGGCTCCGGCCCCGCAGGCTATTCCGCCGCCGTTTACGCTGCGCGCGCCAACCTGAATCCGGTCGTCATTACCGGCATGGCGCAAGGCGGTCAGCTGATGACCACCACCGATGTGGACAACTGGCCTGCCGACGCCGACGGCGTGCAGGGGCCGGAGTTGATGGCGCGTTTCGAGAAGCACGCGCGCCGCTTCAACACCGAAATTATTTTCGACCACATCCACACTGCCAAACTGACCGACAAGCCCATCGCGCTGGTCGGCGACCAGGGTAGCTACACCTGCGATGCGCTGATTATCGCCACCGGCGCGTCGGCCATGTATCTGGGGCTGGAATCCGAGCAGGCGTTCATGGGCAAGGGCGTATCCGGCTGCGCCACCTGCGATGGATTTTTCTACCGCAATCAGGACGTGGCGGTGATCGGCGGTGGCAACACTGCCGTGGAAGAAGCGCTGTATCTGTCCAACATTGCGCGTCATGTCACCGTGGTGCATCGCCGCGACAAGTTCAAGTCGGAAAAAATTCTCGCCGATCATCTGATGGAGAAGGTCAAGGAAGGCAAGATCAGCGTGGAGTGGAACAGCGAACTGGACGAGGTGCTGGGCGACAAGACGGGTGTGACCGGCATGCGCATCAAGTCCACCGTGGATGGCAGCACCAGGGATATCGCCCTGACCGGCGTGTTCATCGCCATCGGCCACAAGCCCAACACCGATATTTTCACTGGCCAGATCGCGATGGAAGGCGGCTATATCGTCACCCAGGGGGGCAACAAGGGCAATGCCACCGCCACCAGTGTTCCCGGCGTGTTTGCCGCGGGTGACGTGCAGGATCACATCTACCGCCAGGCGGTGACCAGCGCCGGTACCGGCTGCATGGCCGCGCTGGACGCCGACCGCTATCTGGAAAGCCTCGGCAAGTAATCTCCCCATGGCCGGGCGTGTGCCTCCAGCGGACGAAGCCGCACTGTTCAAAGCGGCGGTGCAGGACGCCCAGCCGCTACCCGACCACGGCAAGGTGGAACCGCCCTTGCCGCGCGTTTCCCCTATCCCGCGCCAGCGTATTCGCGATGAGCGTCAGGTCTTGGCCGACAGCCTGTCTGACCACATCGTGTGGGAGGATACCATGGAAACCGGCGAGGAGCTGGTGTTCCTGCGCACTGGCTTGCGCCGCGACACGCTCAAAAAACTGCGGCGCGGGCACTGGGTGCTGCAGGCCGAACTGGATTTGCATGGCCTGGTGAGCGTGGAAGCGCGCCAGGCGCTGAGCGCGTTTATCGCCGGCTGCGGCAAGCGCGGCCTGCGTTGCGTGCGCATCATCCACGGCAAAGGGCTGCGTTCCAAAAACCGCGAGCCGGTGTTGCGCACCAAGGTGAAAAACTGGCTGATGCAAAAAGACGAAGTGCTGGCGTTTTGCCAGGCGCGTGCGGTGGACGGCGGCAGCGGCGCGGTGGTAGTGTTACTCAAGTCTTCATGAAAACTTTTTGGAGGAATGCCATGACCGCAATTACCGAATTCGAACTTCCTTCCACCGGCAACCGAACCTTCAAACTCACCGACATGCGCGGCAAGAAGCTGGTGGTGTACTTCTATCCCAAGGACGACACGCCGGGCTGTACCGTGGAGGGCTCCGACTTCCGCGATTTGTATGCCGGGTTTCAGGCGCACAATTGCGAGATCGTGGGTATTTCGCGCGACGATATGAAATCCCACGAGAAATTCAAGACCAAGCTCAGCCTGCCGTTCGAGCTGTTGTCCGACGCAGACGAAAAAGTGTGCGAACTGTTCGGCGTGATAAAGCTGAAGAACATGTACGGCAAGGAAGTCCGTGGCATTGATCGCAGCACTTTTGTGTTCGACAGCGACGGCAAGCTGGTCAAGGAATGGCGCGGCGTGAAATCCGCCGGCCACGCGCAGGAAGTGCTGGATACCATTAAAACGCTCCAGGAGAAAATCTGAATGCCCCGCAAAGCCGCTGCGCCCACCAAGCTGTTTGTTCTTGATACCAACGTGCTGATGCACGACCCCACCAGCTTGTTCCGTTTTGAGGAGCACGACATATTTCTGCCGATGGGCACGCTGGAGGAACTCGATCACAACAAGAAAGGTATGACCGAAGTGGCGCGCAATGCGCGTCAGGCCAGCCGCTTCCTGGACGAAATCGTATCGGGTTGCGAAGATGCAATCGAAGCCGGGATTCCGCTCAGCAGCCATAGCCGCAAGGCAGCGACCGGACGGCTGTTTCTGCAGACCAGAATGGCGCGCATTGAAACCCCGCTCAGCCTGCCCAACAGCAAGGTGGACAACCAGATTCTCGGCGTGATTCTGAGCCTGCGCGAAGAGCAGCCCAGGCGCCCGATAATCCTGGTGTCCAAAGACATCAACATGCGCATCAAGGCGCGTGCACTGGGCTTACCCGCCGAGGACTACTTCAACGACAAGGTACTCGAAGACACCGACCTGCTCTATGCCGGGGTGCGTGAATTGCCGGAGGATTTCTGGGACAGGCACGGCAAGGGTATCGAGTCCTGGCAGGCTGATGGCCACACCTGGTATCGCGTCAAAGGCCCGCTGGTGACCCGCCTGCTGGTCAACGAATTCGTCTATCAGGAGAGCGCCAGCCCGCTCTACGCCATCGTCAAAACCATCAAGGGCAATGTCGCCGAGCTGCAGACCATCAAGGACTACAGCCACCAGAAAAACAATGTGTGGGGCATCACCGCGCGCAACCGCGAACAGAATTTCGCGCTCAATGTGCTGATGGATCCGGAGGTGGATTTTGTCACTTTATTAGGCCAGGCCGGTACCGGCAAAACCCTGCTCACCCTCGCGGCGGGGCTGATGCAGACGCTGGAGCACAAGCGTTATTCGGAAATCATCATGACCCGCGTGACCGTGCCGGTAGGCGAAGACATCGGTTTCCTGCCCGGCACCGAGGAAGAAAAAATGACGCCGTGGATGGGCGCGCTGGAAGACAATCTCGACGTGCTCAACAAGACCGACGACAGCGCCGGCGACTGGGGACGCGCCGCCACGCAGGACCTGATCCGCAGTCGCATCAAGGTCAAATCGCTCAACTTCATGCGCGGGCGTACCTTCCTCAACAAATACCTGATCATCGACGAGGCGCAGAACCTCACCCCCAAACAGATGAAAACCCTCATCACCCGCGCCGGTCCCGGCACCAAGGTGGTGTGCCTGGGCAACATCTCGCAGATTGATACGCCTTACCTCACCGAGGGCAGCTCCGGCCTGACCTACGTGGTGGACCGCTTCAAGGGCTGGCCCCACGGCGGCCATATCACCCTGGCGCGGGGCGAGCGTTCGCGCCTGGCCGACTGGGCGGCGGAAATGCTATGA
Protein sequences of DBSCAN-SWA_3 >NZ_AP021884|882982:892137|884262_885555_-|WP_147071312.1|DBSCAN-SWA MDSAPLAERLRPRNLDEVIGQTHLLGPGKPLRVAFESGKPHSMILWGPPGVGKTTLARLMSHGFDMEFIALSAVLSGVKEIREAVARAEMMQGQGRRTLLFVDEVHRFNKSQQDAFLPFVEAGLITFIGATTENPSFEVNSALLSRAQVYVLKPIVETDLAALLERARISTYPDIAITGAARAQLTGYADGDARRLLNLLEQAANAALDAARHEIDEDFIQTTLARGARRFDKGGDNFYDQISALHKSVRGSHPDAALYWMVRMLDGGADPLYVGRRLIRMASEDIGLADPRALRLTLDAVETYQRLGSPEGELALAQAVIYLACAPKSNAVYVAYNNARSFVAADKSREVPVHLRNAPTKLMKQLGYGHAYRYAHDEPDAYAAGEDYLPEGVNAHFYAPTPRGLEGKIAERLAYLRKLDEQANKKTGQK >NZ_AP021884|882982:892137|888744_889701_+|WP_147071121.1|DBSCAN-SWA MTTPQHHRLIILGSGPAGYSAAVYAARANLNPVVITGMAQGGQLMTTTDVDNWPADADGVQGPELMARFEKHARRFNTEIIFDHIHTAKLTDKPIALVGDQGSYTCDALIIATGASAMYLGLESEQAFMGKGVSGCATCDGFFYRNQDVAVIGGGNTAVEEALYLSNIARHVTVVHRRDKFKSEKILADHLMEKVKEGKISVEWNSELDEVLGDKTGVTGMRIKSTVDGSTRDIALTGVFIAIGHKPNTDIFTGQIAMEGGYIVTQGGNKGNATATSVPGVFAAGDVQDHIYRQAVTSAGTGCMAALDADRYLESLGK >NZ_AP021884|882982:892137|889708_890254_+|WP_147071119.1|DBSCAN-SWA MAGRVPPADEAALFKAAVQDAQPLPDHGKVEPPLPRVSPIPRQRIRDERQVLADSLSDHIVWEDTMETGEELVFLRTGLRRDTLKKLRRGHWVLQAELDLHGLVSVEARQALSAFIAGCGKRGLRCVRIIHGKGLRSKNREPVLRTKVKNWLMQKDEVLAFCQARAVDGGSGAVVVLLKSS >NZ_AP021884|882982:892137|882982_884263_-|WP_147071127.1|tRNA|DBSCAN-SWA MLDIQHLRNDLDDVAQRLAMRGYTLDTAEFQRLEAERKQLQTRTQELQARRNASSRQIGMAKAKGEDVSTIMAEVANLGAELKAAESALEALQARLNALLMTIPNLPHASVPTGKSDADNVEIARVGTPRVFDFAVKDHVDVGTALGLDFETASKLAGARFSLLRGGLARLHRALAQFMLDTHTAVHGYTEVYVPYLVNADSMRGTGQLPKFEEDLFHVPRADAEKLYLIPTAEVPLTNMVRDAIVAREQLPLKFVAHTPCFRSEAGSYGRDTRGMIRQHQFDKVELVQLVEPEQSYTALEALTGHAEAILQALGLPYRKMALCGGDMGFSAAKTYDLEVWLPAQNTYREISSCSNFEAFQTRRMQARFRGEKGKPELLHSVNGSGLAVGRTLVAILENHQQADGSVALPEVLRPYMGGLTVLMPA >NZ_AP021884|882982:892137|886347_888633_-|WP_147071123.1|DBSCAN-SWA MSIPRRNTPKSVRKPLPPRLTGLLREVGWLLSVALAAFLAVILFTYTQSDPGWSHSGQGSVHNMGGTLGAYLADILLALFGLSTWWLVMLALYATWWGYRKLEISEAPDRRPLYIRSAGFLLLLLSSAGLESQRLYSLHIALPEHPGGILGAVVGEAVTHLLGFTGGTLALLVLIAAGLSLFTGISWLALVESTGEYLERSWLWAIETWSARRERRMGQAATLRRDEIVLEEKRRNMDRAPVVIETPPPTIIKSERVQVEKQVPLFAEMPDSPLPPLHLLDAATAHVESLSAETLGFTSRLIERKLAEFNVQARVVAALPGPVITRYEIEPASGVKGSQIVNLAKDLARALSVVSIRVVETIPGKTTMALEIPNPKREIVRLSEILSAKVYHAMASPLAMAMGKDIAGNPVVADLAKMPHVLVAGTTGSGKSVAINAMILSLLYKADASRVRLILVDPKMLELSVYEGIPHLLAPVVTDMRQAAAALNWCVAEMERRYKLMSALGVRNLAGYNQKVRDAKRAATPLTHPFSLTPDNPEPLEEQPMIVVVIDELADMMMVVGKAVEQLIARLAQKARAAGVHLILATQRPSVDVITGLIKANIPTRVAFQVSSKIDSRTILDQMGAEALLGQGDMLYLPPGTGYPQRVHGAFVADEEVHKVVEYLKQLGEPDYIDGILDTPEDNEGSGESAVGGSDAESDPLYDEAVAIVLKTRRPSISAVQRHLRIGYNRAARLIEAMEQAGLVTPMQSNGNREVIAPNRE >NZ_AP021884|882982:892137|890274_890736_+|WP_147071117.1|DBSCAN-SWA MTAITEFELPSTGNRTFKLTDMRGKKLVVYFYPKDDTPGCTVEGSDFRDLYAGFQAHNCEIVGISRDDMKSHEKFKTKLSLPFELLSDADEKVCELFGVIKLKNMYGKEVRGIDRSTFVFDSDGKLVKEWRGVKSAGHAQEVLDTIKTLQEKI >NZ_AP021884|882982:892137|890736_892137_+|WP_147071115.1|DBSCAN-SWA MPRKAAAPTKLFVLDTNVLMHDPTSLFRFEEHDIFLPMGTLEELDHNKKGMTEVARNARQASRFLDEIVSGCEDAIEAGIPLSSHSRKAATGRLFLQTRMARIETPLSLPNSKVDNQILGVILSLREEQPRRPIILVSKDINMRIKARALGLPAEDYFNDKVLEDTDLLYAGVRELPEDFWDRHGKGIESWQADGHTWYRVKGPLVTRLLVNEFVYQESASPLYAIVKTIKGNVAELQTIKDYSHQKNNVWGITARNREQNFALNVLMDPEVDFVTLLGQAGTGKTLLTLAAGLMQTLEHKRYSEIIMTRVTVPVGEDIGFLPGTEEEKMTPWMGALEDNLDVLNKTDDSAGDWGRAATQDLIRSRIKVKSLNFMRGRTFLNKYLIIDEAQNLTPKQMKTLITRAGPGTKVVCLGNISQIDTPYLTEGSSGLTYVVDRFKGWPHGGHITLARGERSRLADWAAEML >NZ_AP021884|882982:892137|885581_886199_-|WP_147071125.1|DBSCAN-SWA MKKLSLFTALLIFSASAAASSIDALKAFVADTQTARATFTQTVLDKNGHARQQSSGTMAFARPGKFRWVYEKPYEQIIVGDGKRIWLYDADLQQVTIKKLGQALGSSPAALLAGSKDIGRFYTITDAGSRDGLEWLDARPRDKESAFENVRMGFSKNTLVAMEIKDNFGQTTVLKFAGLERNPALPASDFHFTPPQGVDVISDGK |
8 | uncultured_Mediterranean_phage(16.67%) | tRNA | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| DBSCAN-SWA_4 |
1605656 : 1613255
Sequences of DBSCAN-SWA_4
Nucleotide sequences of DBSCAN-SWA_4 >NZ_AP021884|1605656:1613255|DBSCAN-SWA CATGCAAAATCTCGACACTATTGTTGCCGCAGCGCTAGCCGAATTCGCCGCAGTCAACCAGGCCGTTGAACTGGAGCAGGCAAAAGCCCGCTATCTCGGCAAAGCCGGTTTGCTCACCGGGCAATTGAAACAACTGGGCAAGCTTCCCGCCGCAGAACGCCCGGCAGCGGGCAACGTGATCAATCAGGCCAAGGAACGGATTCAGCAGGCGCTGGAAGCGCGCCGCGCAGCCTTGTCCCGGGCTGAGCTGGATAACAGGCTGGCGGCGGAAACCCTGGATGTGACGCTGCCCGGACGCGGCCTGGGCACAGGCGGCCTGCACCCGGTGACGCGCACGCTGGCACGCATCCAGGCGCTGTTCGCCTCGATCGGTTTCGAGGTGGCGGAAGGCCCGGAGATTGAAACCGATTTCTACAATTTCACCGCACTGAATATTCCGGAAAACCACCCGGCGCGCGCCATGCACGACACTTTCTACGTGGATGACAAACACCTGCTGCGCACCCACACGTCGCCGGTGCAGATACATTATTTGCAGAACAATCAGCCGCCGCTCAAGATCATCGCGCCAGGCCGGGTATATCGCTGCGATTCCGACGTGACCCACACACCCATGTTTCATCAAGTCGAGGGATTGTGGGTGGACGAAGAGGTGAGTTTCGCGGCATTGAAAGGCGTGCTGGCGGATTTCATGCAGCGTTTTTTTGAACGCGATGACCTGAAGGTGCGCTTCCGCCCATCGTTTTTCCCGTTCACCGAACCGTCGGCGGAAATGGATATCGCTTGCGTGATGTGCGGTGGCGGCGGTTGCCGCGTATGCAGCCATACCGGCTGGCTGGAAGTGCTGGGCTGCGGCATGGTGCATCCCAATGTGCTGGGACATGTGCATGTGGATAGCGAAAAATACCTCGGTTTCGCGTTTGGCATGGGGGTGGAACGGCTGGCCATGCTGCGCTACGGTGTGGATGACCTGCGCCTGTTTTTCGCTAATGATTTGCGTTTCCTGAAACAGTTCAACTGAACCATCAAGATGAAATTCTCCGAGCTCTGGTTGCGTACCCTTGTTAATCCTGCGCTGGACAGCGCGGCGCTGTCCCATCTTCTTACCATGGCCGGACTGGAGGTCGAAGCGCTGGACCCGGTCGCGGCGGATTTTTCCGGCGTGGTGGTGGGGCAGGTGCTGTCCGTAGCGCCGCATCCGGATGCCGATCGCCTGCGCGTGTGCCTGGTAGATGCCGGCACTGGCAGCCCGTTGCAGATCGTATGTGGCGCACCCAATGTAAGTGAAGGCGCGCGCGTGCCTTGCGCCCTGGCAGGCGCCCGCTTGCCGGGCTTTGAAATCAAGAAAGCCAAGTTGCGGGGTGTGGAATCGCAGGGTATGTTGTGCTCCGCGCGCGAGCTGGGACTGGCAGAACAAGCCGATGGCCTGCTGTTGTTGCCGAACGACGCACCGGTGGGTAGCAATATCCGCGATTATCTGCATCTGGATGACAGGCTTTATACGCTCAAACTTACCCCCAATCGCAGCGATTGCCTGAGCGTGGCCGGCGTGGCGCGTGAAGTGGCCGCGCTTACCGGCAGTCCATTGAACTTGCCCCGGATTGAACCCGCAGCGGTCACCGGCAGGCTCACCCGCATGGTGCAGGTGACTGCAGGACAAGCCTGCCCGCGCTATTGCGGGCGCGTCATCAGCCAGCTCAATCGCGCGGCTCAAACACCGGGCTGGATGATTGAACGCCTTTCCCGCAGTGGCCTGCGCAGTATCAGTCCGGTAGTGGACATTACCAACTATGTATTGCTGGAGTTGGGACAACCCTTGCATGCCTTTGATCTGGACAAGCTTGCTGGCGATATCCAGGTGCGCATGGCCACGCCGGGTGAAACGCTGACGCTGCTGAATGATCAGCGTGCGACGCTGGAAGCGGACATGCTGGTGATCGCCGATGACAACGGCGCGCAGGCGCTCGCTGGCATCATGGGGGGGGCGGCCACCGCAGTGGATGAAAATACCTCGGAAATTTTTCTTGAGGCAGCGTATTTCAGCCCCGGCGCGATTGCCGGACGGGCGCGCCGGCTGGGCTTGTCCACCGATTCATCGCACCGTTTTGAGCGCGGGGTGGACTACGCAGCCACGCGCGATGCGCTGGAACGCGCCACGGCATTGATACTTGAAATTTGCGCTGGCGCGGCAAGTGCAATCACCGAAATAACAGGCGATCTGCCACAACGTGCGCCTGTCATGCTGCGCACCGCGCGTGCCAGCAAAGTGTTGGGCGTGGCGCTGAGTGACGCGCAGGTGGAAGTGTTGCTGGGCCGCCTGTGCTTTGACTTTCAGCGCGATGGCGCGGCCTATCAGGTGACGCCGCCCAGCTACCGCTTTGACCTGAATATCGAGGAAGACCTGATCGAAGAACTGGCGCGGCTCCATGGTTATGACAACATTGTTGCGCAGGCCCCGGTCGCCCGCCTGACCATGTTGCCGCAGCCGGAGCAACAGCGTGGGGTGGATGCGTTGCGCACCCTGCTCACCGCGCGTGATTATCAGGAAGTCATTACCTACAGCTTTGTAGATGCCGCATGGGAAGCGGATTTCGCACCCGGCGCTCAGCCCGTCGTGCTGAAAAATCCCATCGCCAGTCAGATGGGCGTGATGCGCTCCACCTTGTTGGGCGGCCTGATGGATGTGCTGCGCAACAATCTGAACCGGCGCCAGGAGCGTGTGCGTATTTTTGAGAGCGGACGCTGTTATCTGCCGGCGGCCGAGGGCTTCGATCAGCCGCAACGCCTGGCTGGACTGGCTTACGGCAGCGCTATGCCGGAGCAGTGGGGGAGTGCGGCGCGCAACGTGGACTTTTTTGACGTCAAAGCCGATCTCGAAGCGCTGTGCTGGCCACAGCCTGCACGCTTTGAAAAATCCGCTCATCCTGCGCTGCATCCAGGCCAGTGCGCTGAAATGTGGTTGAATGGTGTCCATGCCGGCTGGCTGGGTACATTACACCCACGGCTGACGCAGCAATATGATTTGGCGACAGCGCCGGTTGTGTTTGAACTCGCCCTGCCGGCATTGTTAACGCGGAAGCTGCCCAGGCATGGCGAGATTTCGCGTTTCCAGAGCGTGCGCCGTGATCTGGCCGTGATAGTCGATGAATCGGCGCCGGTACAGGCTTTGATTGATGCGATGTACGCAGCACGCATAGAGGGTGTTGCCGAGATTACATTGTTTGACGTGTATCGCGGCAAAGGCATTGATTCTGATAAAAAAAGTCTTGCATTCCGGGTGCTGTTGCAAGATACTCAAAAGACCTTTACCGACACTGAAGTGGATACCGCCATGGCGTACTTCACCGATCTGTTAAAACAACAATTCAACGCGCAATTACGTTCCTGAGGTAGTCATGACCCTGACCAAGGCAGAACTGGCAGACATGCTGTTCGAAAAAGTTGGCCTGAATAAACGCGAAGCCAAAGACATGGTGGAGTCGTTTTTCGAAGAAATACGCATTGCACTGGAAGCGGGCGATACCGTGAAGCTTTCCGGCTTTGGCAATTTTCAGCTGCGTGACAAACCGCAGCGTCCTGGCCGCAATCCCAAAACCGGCGAAGAAATGCCAATCACGGCACGCCGCGTGGTGACCTTTCACGCCAGCCAGAAACTCAAATCGCAGGTAGAAGACGCGCATGGCGGAACATCAGCCAACTAGTCAACTGCCGCCGATTCCTGCCAAGCGCTACTTCACTATCGGCGAGGTCAGCGAACTGTGCGGGGTGAAGCCGCACGTACTGCGCTACTGGGAACAGGAATTCGGCCAGCTCAAACCAGTCAAGCGACGTGGTAACCGTCGTTACTATCAGCATCATGAAGTGCTGCTGATTCGCCGCATCCGGGAACTGCTTTATGAGCAGGGATTCACGATCAATGGCGCACGCCATCGTCTGGATGTGCTGGCCACATCCGACGCCGCCGAGGCCGCACCCACGGTGACTGAATCGGTAACGGATTATGCAGCACTGCGTCGCGAAATGATGGAAATTGTCGAGTTGCTGCGCCTGTGATTTTTTAGCTCCAGTCTTTGTCCAGGCGCTACAGACTCTGCTATAATCGCGGCCTTCGGGGCGTAGCGCAGCCTGGTAGCGTACTTGCATGGGGTGCAAGTGGTCGGAGGTTCAAATCCTCTCGCCCCGACCAGAACAATGCCCAGGTGTCATGCCTTTCTAGAAAGCAATTCACGGGAATTTCAGAGTAGCCGTCCACTATGCGCATTTTGCTGAGCAACGACGACGGTTACTTTGCACCCGGTCTCGCCATCCTGGCGGATACACTCTCACACATCGCAGATATCACGGTGGTTGCCCCCGAGCGCGACCGCAGCGGCGCCAGCAATTCCCTCACACTCGATCGTCCGCTGATGCTGCGCCAGGCGCACTCCGGGTTTTATTACGTCAATGGTACGCCCACGGACTGCGTTCACCTCGCGGTTACGGGTATGCTCGATCACCTGCCGGACATGGTTATCTCCGGTATCAATCACGGTGCCAACATGGGCGACGACACTATTTATTCGGGCACCATAGCGGCAGCCACCGAAGGGTTTTTACTGGGGGTGCCGTCGCTGGCGATATCCCTGGCGAGCCATGCAGCGGGCAACTATGCCACAGCCGCGCGTGTTGCCAGCGAGCTCGCGCAACGTGTCATGGCACGGCCTTTTGCGGCACCGCTACTGCTCAACGTGAACGTGCCGGATATTCCCTATCAGGACTTGCAAGGCACCGCAATCACCCGCCTCGGACGCCGCCACAAAGCCGAACCGGTGGTCAAATCCACCAATCCGCGCGGTCAGACGGTGTATTGGGTGGGGGCTGCGGGTGCGGCGCAGGATGCAGGCGAAGGCACGGATTTCCATGCGGTGGCGAATGGGCGTGTGTCGGTGACGCCGTTGCAGATGGATCTCACCCAGTTCAGTCAACTGGCGCCGCTGCGGGCGTGGTTGCAGGCATGAGCGTGACGCGTCACAGCGGTATCGGCATGACTTCCGAGCGTACCCGCGCGCGCATGGTCGAGCGCCTGCGTGCGCAGGGGATCAAGGACAACAACGTACTCACCGCGATGGGCATGGTGCCCCGGCATATTTTTGTGGATGAGGCACTGTCCATCCGGGCTTATGAGGACAGCGCGCTGCCGATAGGTTTCGGCCAGACCATTTCCAGCCCCTATAGCGTGGCGCGCATGATCGAGGTGCTGCGTGGCGGCGCCGACCTGCAGTGCGTGCTGGAAGTCGGCACAGGTTGCGGCTACCAGGCCGCTGTACTGGCCAAGCTGGCACGCGAAGTGTACTCGGTTGAGCGCATTGCCACGCTGCTCGGGCGCGCGCGTCGTACCATACGCGAACTACGCATCGGCAATATCAAACTTAAACATGGCGATGGTAGCATTGGGTTAAAGGATGTGGCACCTTTCGATGGCATCATCCTTGCCGCCGCCATACCCACTCCCCCCCAGGCGTTGCTGGAACAACTGGCGCAGGGCGGCCGCATGGTATTGCCGCGAGGTATTGGTGAAACGCAGCAAATGGTGCTGATCGAGCGCACCGCAGAAGGTTTTCAGGAGACGGTGCTGGAAATGGTACATTTTGTTCCACTGTTGCCCGGAGTGCGCTGACGTGATGGTATTCGACAAGATTGCATGCCCGGTTTATCTGGTTGCTGTGATCCTTGCCCTGGGAGGATGCGCCACGCAGAATTCTGCGCCGGTTGTGGATGGCACCCAGCCTGGCACAAGCAATATTGTCAAGCCGGCAATCAAGTCCGCCACTACCCGCGCCGGCGCAGCAAAGCTGCATGACTGGCGACCGGACAGCCATACCGTGCAAAAAGGCGACACGCTCTACAGCATCGCGCTTGAATATGGCCTGGACTACCGTGATCTGGCGAGCTGGAATGCACTATCTGACAATAACCTGATCCGTGTCGGGCAGGTGTTGAAACTGAGTGCGCCGCAGCCAGGCAGCGGCATTGCGCAAGTCACGACATCTGAATCAGCAGTTCAGACCATCCCCCTCAAGATCGAACCGTTACCGCAGGCCCAGATAGCGACCGGCGCGGTGTTGATAACCCAGCCCAAGGCGGTCAAATTGCCCTACTCTGCCGCCGCATTGGCGCAGCTTGAACAAGGCGGGACGCCGCAGCCGGCCGCGCGGCCTCCCGCAACACCGGAGGCAGCGTCCGGGGTGGCGCCCGAGCCCGCATCCTCTGCAGAACAATCCCGACCGGCCGCGACTGCCAAGGAAACCGATGATACGGGTATTGATTGGATCTGGCCTACGCAAGGCCGGGTCATTGCCGGATTTGACGAAGCCAAAAACAGCAAGGGGCTGGATATAGCAGGCAAAGCCGGACAAGCCATATTCGCAGCCGCGCCGGGCAAGGTGGTGTATAGCGGCGCTGGTTTGCGCGGCTATGGCAAGCTGGTTATTATCAAGCACAACGCCATTTATTTGAGTGCCTACGCACACAATCAGCGGGTGCTGGTGAAAGAAGGTCAGACGGTTGCGCGCGGGCAGAAAATCGCCGAAATGGGTGACAGTGATGCCGATCAGGTCGCGCTGCATTTCGAAATCAGGAAAATGGGCCAACCGGTGGACCCGATGAAATATCTTCCCGGAGCACAAAAATAGCGATGCATGACGACAACGAGCAGGAAGACTACGCCGGTATCCCGGACGACAAATCCCAGACCGAAGTGCCCGAGGTCGAACTGGGATTTACCGAGGATGCGCATACCGATGTGACGCAGATGTACCTCAACGAAATTGGCCACAACGCATTGCTCAGCCCCACTGAGGAACGCCGCCTGGCCGAACTCACCCGTGCGGGCGATTTTGACGCCCGGCAAAAAATGATCGAACACAATCTGCGGCTGGTGGTGAATATCGCCAAGCATTACGCCAATCGCGGGCTGGCACTGCTGGACCTGATCGAGGAGGGCAACCTGGGACTGATTCATGCGCTGGAAAAGTTCGAGCCCGAACGCGGATTTCGTTTTTCCACTTATGCCACATGGTGGATACGCCAGAATATCGAGCGCGCCATCATGAACCAGTCGCGCACCATCCGCCTGCCGGTGCACGTCATCAAGGAACTCAACGTCATTTTGCGCGCCCGCCGTCATCTGGAAAATCACGGCGCCTCCGACCCGAGTGACGAGGACATCGCCCATCTGGTCGGGCTGCCGGTAGAAGATGTACGACGCATGTTGCGCCTGAATGACCGGGTGGCATCGCTCGACGCACCGCTCGATATTGATCCCAGTCTGTCCATCGGGGAGGCCATTGCAGATGGCAACAGCGCGTTGCCTGAAGACATGCTTGAGCACGCCGAGACTGAAGCCTTTGTGCGCCTGTGGCTGAGTGACCTCAACGACAAGCAGCGCTGGGTAATTGAGCGGCGTTTCGGGCTGGGCGGGCAGGATGTGCACACCCTGGAACAGCTGGCCGAAAGCCTCGACGTCACCCGTGAACGCGTGCGCCAGATCCAGATGGAAGCCCTGCACCATTTGCGGCGCATGCTGAAACGCACCGGCGTCAACAAGGACGCTCTGTTGTGA
Protein sequences of DBSCAN-SWA_4 >NZ_AP021884|1605656:1613255|1605656_1606676_+|WP_147073445.1|tRNA|DBSCAN-SWA MQNLDTIVAAALAEFAAVNQAVELEQAKARYLGKAGLLTGQLKQLGKLPAAERPAAGNVINQAKERIQQALEARRAALSRAELDNRLAAETLDVTLPGRGLGTGGLHPVTRTLARIQALFASIGFEVAEGPEIETDFYNFTALNIPENHPARAMHDTFYVDDKHLLRTHTSPVQIHYLQNNQPPLKIIAPGRVYRCDSDVTHTPMFHQVEGLWVDEEVSFAALKGVLADFMQRFFERDDLKVRFRPSFFPFTEPSAEMDIACVMCGGGGCRVCSHTGWLEVLGCGMVHPNVLGHVHVDSEKYLGFAFGMGVERLAMLRYGVDDLRLFFANDLRFLKQFN >NZ_AP021884|1605656:1613255|1610648_1611311_+|WP_147073435.1|DBSCAN-SWA MSVTRHSGIGMTSERTRARMVERLRAQGIKDNNVLTAMGMVPRHIFVDEALSIRAYEDSALPIGFGQTISSPYSVARMIEVLRGGADLQCVLEVGTGCGYQAAVLAKLAREVYSVERIATLLGRARRTIRELRIGNIKLKHGDGSIGLKDVAPFDGIILAAAIPTPPQALLEQLAQGGRMVLPRGIGETQQMVLIERTAEGFQETVLEMVHFVPLLPGVR >NZ_AP021884|1605656:1613255|1609050_1609356_+|WP_147073441.1|DBSCAN-SWA MTLTKAELADMLFEKVGLNKREAKDMVESFFEEIRIALEAGDTVKLSGFGNFQLRDKPQRPGRNPKTGEEMPITARRVVTFHASQKLKSQVEDAHGGTSAN >NZ_AP021884|1605656:1613255|1611315_1612326_+|WP_147073482.1|DBSCAN-SWA MVFDKIACPVYLVAVILALGGCATQNSAPVVDGTQPGTSNIVKPAIKSATTRAGAAKLHDWRPDSHTVQKGDTLYSIALEYGLDYRDLASWNALSDNNLIRVGQVLKLSAPQPGSGIAQVTTSESAVQTIPLKIEPLPQAQIATGAVLITQPKAVKLPYSAAALAQLEQGGTPQPAARPPATPEAASGVAPEPASSAEQSRPAATAKETDDTGIDWIWPTQGRVIAGFDEAKNSKGLDIAGKAGQAIFAAAPGKVVYSGAGLRGYGKLVIIKHNAIYLSAYAHNQRVLVKEGQTVARGQKIAEMGDSDADQVALHFEIRKMGQPVDPMKYLPGAQK >NZ_AP021884|1605656:1613255|1612328_1613255_+|WP_147073433.1|DBSCAN-SWA MHDDNEQEDYAGIPDDKSQTEVPEVELGFTEDAHTDVTQMYLNEIGHNALLSPTEERRLAELTRAGDFDARQKMIEHNLRLVVNIAKHYANRGLALLDLIEEGNLGLIHALEKFEPERGFRFSTYATWWIRQNIERAIMNQSRTIRLPVHVIKELNVILRARRHLENHGASDPSDEDIAHLVGLPVEDVRRMLRLNDRVASLDAPLDIDPSLSIGEAIADGNSALPEDMLEHAETEAFVRLWLSDLNDKQRWVIERRFGLGGQDVHTLEQLAESLDVTRERVRQIQMEALHHLRRMLKRTGVNKDALL >NZ_AP021884|1605656:1613255|1606685_1609043_+|WP_147073443.1|tRNA|DBSCAN-SWA MKFSELWLRTLVNPALDSAALSHLLTMAGLEVEALDPVAADFSGVVVGQVLSVAPHPDADRLRVCLVDAGTGSPLQIVCGAPNVSEGARVPCALAGARLPGFEIKKAKLRGVESQGMLCSARELGLAEQADGLLLLPNDAPVGSNIRDYLHLDDRLYTLKLTPNRSDCLSVAGVAREVAALTGSPLNLPRIEPAAVTGRLTRMVQVTAGQACPRYCGRVISQLNRAAQTPGWMIERLSRSGLRSISPVVDITNYVLLELGQPLHAFDLDKLAGDIQVRMATPGETLTLLNDQRATLEADMLVIADDNGAQALAGIMGGAATAVDENTSEIFLEAAYFSPGAIAGRARRLGLSTDSSHRFERGVDYAATRDALERATALILEICAGAASAITEITGDLPQRAPVMLRTARASKVLGVALSDAQVEVLLGRLCFDFQRDGAAYQVTPPSYRFDLNIEEDLIEELARLHGYDNIVAQAPVARLTMLPQPEQQRGVDALRTLLTARDYQEVITYSFVDAAWEADFAPGAQPVVLKNPIASQMGVMRSTLLGGLMDVLRNNLNRRQERVRIFESGRCYLPAAEGFDQPQRLAGLAYGSAMPEQWGSAARNVDFFDVKADLEALCWPQPARFEKSAHPALHPGQCAEMWLNGVHAGWLGTLHPRLTQQYDLATAPVVFELALPALLTRKLPRHGEISRFQSVRRDLAVIVDESAPVQALIDAMYAARIEGVAEITLFDVYRGKGIDSDKKSLAFRVLLQDTQKTFTDTEVDTAMAYFTDLLKQQFNAQLRS >NZ_AP021884|1605656:1613255|1609908_1610652_+|WP_147073437.1|DBSCAN-SWA MRILLSNDDGYFAPGLAILADTLSHIADITVVAPERDRSGASNSLTLDRPLMLRQAHSGFYYVNGTPTDCVHLAVTGMLDHLPDMVISGINHGANMGDDTIYSGTIAAATEGFLLGVPSLAISLASHAAGNYATAARVASELAQRVMARPFAAPLLLNVNVPDIPYQDLQGTAITRLGRRHKAEPVVKSTNPRGQTVYWVGAAGAAQDAGEGTDFHAVANGRVSVTPLQMDLTQFSQLAPLRAWLQA >NZ_AP021884|1605656:1613255|1609333_1609708_+|WP_147073439.1|DBSCAN-SWA MAEHQPTSQLPPIPAKRYFTIGEVSELCGVKPHVLRYWEQEFGQLKPVKRRGNRRYYQHHEVLLIRRIRELLYEQGFTINGARHRLDVLATSDAAEAAPTVTESVTDYAALRREMMEIVELLRL |
8 | uncultured_Mediterranean_phage(33.33%) | tRNA | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| DBSCAN-SWA_5 |
1977054 : 2023954
Sequences of DBSCAN-SWA_5
Nucleotide sequences of DBSCAN-SWA_5 >NZ_AP021884|1977054:2023954|DBSCAN-SWA AATGCAAACCCAAGTTCCATCAATCGAATCCGGTCGGAATCCCCGCCGGATGAATCCCGGCGGTGCAACCTGCATCGCCCTCGACGAAAACGAGCTCGCCATCCGCTGGGGGCTCTCCGTCAAGACGCTGCGCCGCTGGCGTCAAGAGCAGCTCGGCCCCATCTACTGCAAGCTCGGTCGCCGGGTCACCTACCTCCTGCACGAAATCGAAGCCTTCGAGCGCCGCGTCTCGCGCTACTCGAGCTTCACTCGTGCGTACCAGTGAGGAGGACGGCCATGAGCGATCTGACCATCTTCCCCGTCGACATCGCTGAGATGTCTGTGAGCCAACTGGCCGCGCTGCCGCCCGAGCAGAAGTGCGAGGTCGACAAGAACCTTGATGCTGCCATCGACTGGCTAAAGAAGGCTCGCACCAAGTTCGATGCGGCGCTGGAACAGTGCTACGGCGAGCAGGCCCGTGTCGCACTGCGTGAATCAGGCCGTGACTTTGGTACCGCCCACATCAGCGACGGCCCGCTGCACATCAAGTTCGAGCTGCCCAAAAAGGTCAGCTGGAACCAGAAACAGTTGGGCGAAATCGCCGAGCGCATCGTGGCCTCAGGCGAGAAGGTCGAGGGCTACCTCGACGTCAAGCTCTCAGTGTCCGAGTCCCGGTACATCAACTGGCCGCCTGCATTGCAGCAGCAATTCGCGGCCGCCCGCACGGTCGATTCCGGCAAGCCGTCCTTCACCCTGAGCACCGATGGGGGTGAGGCATGAAGCGGCTACCCATCGTGTCCGCCGTCGAGCGGATGGCCGAGCGCAAGGGCGTGAAGCTGCTGATGCTGGGCAAGTCCGGCATCGGCAAGACGTCCCGGCTCAAAGACCTCGACCCCGCCACCACACTGTTCCTTGACATCGAGGCAGGCGACTTGGCGGTCGCCGACTGGCCGGGCGACACCATCCGCCCGGCGTCCTGGCCCGAGAGCCGCGACTTCTTCGTGTTCCTTGCGGGCCCGGACAAGTCGCTGCCGCCGGAGAGCGCGTTCTCGCAGGCGCACTACGACCACGTCATCGAGAAGTTTGGCGATGCGACGCAGCTCGGTCGCTACCAGACCTTCTTCCTTGACTCGATCACGCAACTGTCTCGCCAGTGCTTTGCGTGGTGCAAGACGCAGCCCGGGGCGGTCAGTGATCGTTCCGGCAAGCCCGATCTGCGCGCGGCCTACGGGCTGCTCGGCCAGGAAATGATCGGCGCGTTGACCCACCTGCAGCACGCCCGTGGCAAGAACGTTGTGTTCGTGGCGATCCTCGATGAGCGACTGGATGACTTCAATCGCAAGGTGTTCGTCCCGCAGATCGAAGGCAGCAAGACCAGCCTGGAGCTGCCCGGCATCGTCGATGAGGTCGTGACGCTGGCCGAGATCAAGGCCGAGGACGGCAGTTCCTACCGCGCCTTCATCACGCACACCGTCAATCCCTACGGCTTCCCGGCCAAAGACCGCAGCGGTCGTCTCGACCTGCTGGAGCCGCCGCATCTCGGCGCGCTGATCGCCAAGTGCGCGGGCGCTGTGCCCGCGCTAGCCAGCGCCGCCAACCCCGCACACATCGAATCTCAGGAGTAATCGCAATGACCGCATGGAATGACTTCAACGACGCCGACTCTCAGCAATCCGGCTTCGATCTGATCCCCAAGGGCACCGTCGTGCCGGTGCGAATGACCATCAAGCCGGGTGGCTATGACGACCCCGAGCAAGGCTGGGGTGGCGGCTACGCCACCGAATCGTTCGAGACCGGTTCCATCTATCTGGCCGCTGAGTTTGTGGTCACCGCTGGCGATCATGCCAAGCGCAAGATGTGGAGCAACGTCGGCCTGCTCTCCAAGAAAGGCCCGACCTGGGGCCAGATGGGGCGCAGCTTCATCCGGGCCGCGCTCAACAGCGCCCGCAACGTCCACCCGCAGGACAACAGCCCACAGGCCGCCGCCGCGCGCCGCATCAATGGCTTCGCCGAACTGGACGGTCTGGAGTTCTTGGCGCGCGTCGACATCGAGAAGGACGCGAAGGGTCAAGACCGCAACGTGGTCAAGCTGGCAGTCGAGCCCGACCACCCCGACTACGCCAAGTTGAAAGGTGTGCCGCCGAAGGGCAGTCCGGGCGGTGGCAACTCCGGCGCTCCGGCGCAGGCGGCCCCGGCCTATTCCGCGCCCACCCCGCAACGCGCACCAGTGACGGGCAAACCGTCCTGGGCTCAGTGAGGAGACGGCTATGAATGCATCCGTCCTCACTGCCAGTCACTACGGCGTCGTGCGCTTCGGCGATCTGCAATGCGAGGCCGTCGTCCTCAAGGGCGGCGAGCGTGGCTACGTTCGTCGCCAACTGGCCAAGCTGCTGGGTTTCCACGAGACGCACAAGGGTGGCCGATTTGCCCGGTTTCTTGCCGACTTCGCTCCTAAGTCCTTGTCGGCATTGGAGAAAACTCGTGAGCCGATTCTGTTGCCGTCAGGTCGGCAGGCGCAGTTCTTCCCGGCCGGGATCATTGCCGACGTCGCGTCGGCGGTGGTCAGCGCGGCCATCAACGGCACGCTGCACAAGGCCCGCCAGGGCATCGTGCCCAATTGCATGAAGATCATGCGCGCGCTGGCCACCACCGGCGAGGTCGCGCTGATCGACGAGGCGACGGGCTACCAGTACCACCGCGCGCCTGACGCGCTGCAGGAACTGATCTCCAAGCTGCTGCGCCAGTCGTGCTCTTCGTGGGAGCGCCGCTTCCACCCGGACTACTACCGCGCCCTCTACCGGCTGTTCGGCTGGAAGTACCAGGGCCACGACCAGAACCCGCCCCACGTTGTCGGTCAGATCACGCAGCGCTGGGTCTACGGCCCGGTGCTGCCCGTCACGCTGATCGACGAGATTCGCGCCCGCAAGGGCATCTCGCAGAAGCACCACCAGTGGCTGTCCGATCAGGGCCTCGCCCGTCTGGAAACGCAGATTCACGCGGTCACCGCCATTGCGCGCAGCTCGACCTGCTACCGCGACTTCGACCGCCGCTGTGAAGCGGCCTTCGCTGGCGGCGCGCTGCAGCTGGCGCTGCTGGCCGAAGACTTTGAGGAGGGGGCGTGAAATGCTGGGTCTGCAAACGACAGGCCCGGGGATTCGGTCACACCGACAACCGACACGGTATCGGCGATCCCCGGCGCTACCCCATCGACTGGGTGTTCTGCTCGCAGCGCTGCCAATCCGCGTTCCACGCTATGTACGGCAACTGGTCGCGCGCCAAGGATGGTCGCAGCGACATCAAGGGGGTCGCCATGATCGATCCCTCTGATATCGAGCTGGCCGCGATGCGCAAGTGCCTCAAGTCCTTCGGCGAGGCGGCAAGCGAGATCGGCTTCACCAAACCACTGGGCAACTACTCCGAAGCCGAGGCGCTGCAGGTGATCGACGCCATCGTCACTTGCTACACCGAGGCGATGGTTGAGCACCACGAGGCGAGCAAGTACCCGCCCGTACGCGGCATGACGCCAACGCCCGACCCCATGACACCGAGTGCAGCCAATCCGTTCGCGGATCTGGACGACGACCTGCCTTGGGAAGAACCGAAGGGGAAGAAGCCATGATGGACTTCAACTCCACTTCGAGCATCTCGGGCCAGATCACTGCGCTGGTCGACGCCGGGATGCAGCGGGCGCGAGCCCAGCAGTCCGAGCGCCAGTACCTTGGTGCCTCGCGGTTGGGCGCTGCCTGCGAGCGTGCGCTGCAGTTTGAGTACGCCAAGGCTCCCGTCGATCACGGCCGGGACACCCCGGGCCGGATGCTGCGCATCTTCGAGCGCGGCCACGTCATGGAGGACTGCATGGTCGCGTGGCTGCGCGACGCCGGTTTCGAATTGCGTACCCGCAGGGCCGATGGCGAGCAGTTTGGCTTCTCCGTGGCTGATGGCCGTCTGCAGGGCCACATCGACGGCGTCATCGTCGATGGCCCGGAGGGCTTTGCCTACCCGGCGCTCTGGGAAAACAAGTGCCTCGGCATGAAGTCCTGGCGCGAGCTGGAGAAGAACCGGCTCGCCGTGGCCAAGCCCGTCTACGCCGCGCAAGTGGCGATCTACCAAGCCTATCTCGAACTGCACGAGCACCCGGCGATCTTCACGGCGCTCAACGCCGACACGATGGAGATCTACACCGAGGCCGTGCCCTTTGACGCAGCCCTGGCCCAGCGAATGTCGGATCGGGCGGTGAAGGTCATCACGGCGACTGAAAGCGCAGATCTCCTGCCGCGTGCCTTCAATGACCCGACCCACTTCGAGTGCCGGATGTGCGCGTGGCAAGACCGCTGCTGGAGAACACAAGCATGACCGACAACAACACCCCGACCACCGGCATCGAGCCGATGATCGATGCCAAGCAGGCGGCCGCCGCGTTGCGCCTGCCGTACTACTGGTTCGCCGACCACGCGATGCGCACCAAGTACCGGATTCCGCACTACCTGATGGGCGGTCTGGTGCGCTACCGGCTGTCCGAACTCTCTGCGTGGGCCACGCGTACCACCGCCGTTCAGGGCCGTGATTCCCAAGATGCGGACGCACCTGTCGAGGGAGCCGAATGATCGACTTCAACGACACCACCCAACCTGCGGAGCACAACAGGGAATCTGAACGAGACGAGATTCGCGCCGACTTGCTTGCGCGTCTGGAGTCGGTGCTGACCACGATGTTTCCGGCTGGCAAGAAGCGCCGTGGCAAGTTCCTGATCGGCGACATCCTCGGCAGTCCAGGTGACAGCCTCGAGGTGGTGCTGGAAGGTGAGAAGGCCGGTCTGTGGACGGATCGTGCCACCGGCGATGGCGGCGACATCTTCGCCCTGATCGCGGCCTATCTCGGTGCGAACGTCCACACCGATTTCCCTCGCGTGCTGGATGAAGCTGCCGATCTGCTCGGGCGGTCGCGGTCGGTGCCAGTGCGCAAGGCGAAGAAGGAAGCGCCTGTAGACGACCTCGGCCCGGCCACGGCGAAGTGGGACTACTTCGATGCCGGTGGCAAGCTGATCGCCGTCGTCTACCGCTATGACCCACCGGGAGGCAAGAAGGAATTCCGACCGTGGGACGCGAAGCGCCGCAAGATGGCCCCGCCTGAGCCGCGCCCGCTGTTCAACCAGCCGGGCATCGGTGCGGCCAGCCACGTCGTCCTGGTCGAGGGCGAGAAGTGCGCGCAGGCCTTGATCGCCAGCGGCGTGGTGGCCACCACCGCCATGCACGGTGCCAATGCCCCGGTCGACAAGACCGACTGGTCGCCACTGGCTGGCAAGACGGTGCTGATCTGGCCCGACCGCGATGCGCCAGGGTGGGACTACGCCGACCGCGCGTCGCAGGCGATCTTGCAGGCAGGCGCGACCTCGGTCGCCATCCTCATGCCACCCGACGACAAGCCGGAGGGGTGGGACGCTGCAGATGCCATTCCCGAAGGTTTCGATGTCGGTGGCTTTCTGGCCGTCGGCGAGCGGATGCCGGTGATGCGCTCGGTGGAGGAAGCGCCTTCGCCAGACTTGCTGACGGGCATTGATTGGACGACCGAGGATGGCCTGTCCAGCGCTTTCACCCGCCGCTATGGCGAAGACTGGCGCTACTGTGCCCTGTGGGGCAAGTGGCTGGTCTGGACGGGTGTGCGCTGGAATCCCGATCAGGTGCTCTACGTGTCGCATCTTTCCAGGGGCATCTGCCGTAACGCCTCGCTGAAAGCGGACACGCCGAGGCTCAAGGGCAAGCTGGCCAGTTCGGCCACGATTTCGTCGGTTGAAAAGATCGCGCGCTCTGACCCGAAGCACGCATCCACCGCCGAGGAATGGGACGCCGATGTCTGGGCGTTGAACACCCCCGGTGGCGTGGTCGATCTGCGCACCGGCCGGATGCGCCCGCACCGGCGGGACGACCGAATGACCAAGGTGACCACGGCTACCCCGCAGGGCAATCCGGACAGTGCCTGCCCAACGTGGCGAGGGTTCCTGACAGACGTCACCGGCGGCGATGCCGATCTGATGGCCTACCTGCAACTGATGGTTGGCTACTGCCTGACGGGCGTCACCAGCGAGCACGCGCTGTTCTTCCTGTACGGCACGGGCGCGAACGGCAAGTCGGTGTTCGTCAACGTGCTAACCACCATCCTGGGCGACTACGCGGCCAACGCCCCGATGGACACTTTCATGGAGGCGCGCAATGACCGACACCCCACCGATCTCGCCGGGTTGCGCGGTGCACGATTCGTGTCATCCATCGAAACGGAGCAAGGGCGGCGCTGGAACGAGTCCAAGGTCAAGGCCATCACCGGTGGCGACAAGGTGTCCGCGCGCTTCATGCGCCAGGACTTCTTCGAGTACCTGCCGCAGTTCAAGTTGGTGATCGCGGGCAATCACAAGCCGTCGATCCGCAACGTCGACGAGGCGATGAAGCGTCGACTGCACCTGATCCCGTTCACGGTGACGATCCCGCCCGAGCGCCGCGACGGCAGGCTGACCGAGAAGCTGCTCAAGGAACGCGATGGGATTTTGGCGTGGGCCGTCGAGGGCTGCAGCCGCTGGCAAAGCCAGGGCTTGAAGCCGCCCGCCAGCGTGGTGTCGGCGACCGAGGAGTATTTCGAGGCCGAGGACGCGCTCGGGCAGTGGATCGAAGAACGCTGTCTGCTGGCCAAGTCGCACCGCGAAGGTGTCTCCGAACTGTTCGCCGATTGGCGTGAATGGGCCGAGCGCGCTGGCGAGTACGTGGGCTCGGTCAAACGCTTCTCGGAGCTGATGGCGACTCGCAAGTTCGACAAGTGTCGGCTGACCGGAGGGGCTCGCGCCATCGCGGGCATCGCCCTCAGGCCCAAGCCGTACAGCCACGCCTACCCCTACCGCGATGACTGATCAATCCGGTCGAGTGACGGATTTGACGGGTTTCCTGATTGACGCGCTACACGTGCGCGCACGTAAAGGGCGTTGTCCTGACAAACCGTCGCATCCGTCACTCGCCCACCCAACACGGAGTAAAGACGATGAAAACGACGATCCTCGCCCTCGATCTGGGCACACACACCGGGTGGGCTCTGCAGCACCTGGACGGCACCATCACCAGCGGCACGGAGCACTTCAAGCCGCAGCGATTTGAAGGCGGCGGGATGCGTTTCCTTCGATTCAAGCGCTGGCTCAACGAACTGCTGTCGGTCAGCAATCACATCAACGCGGTGTTCTTCGAGGAAGTTCGGAGGCACGCTGGCGTTGACGCAGCGCACGCCTACGGCGGATTCATGGGGCACCTGACCGCGTGGTGTGAACATCACAACATCCCCTACCAGGGCGTTCCGGTCGGCACGATCAAGAAGCACGCGACCGGCAAGGGCAATGCGAGCAAGGACGAAATGATCACGTCCGTCCGCGAGCGTGGTCACACCCCAGTCGACGACAACGAAGCCGACGCGCTGGCTCTGCTGCACTGGGCAGTCGAGACGCAGGAGGTGTGACGTGAAGGTTTCGACACCCCAATACCGCTGCCCCCTTGGTCGGCTGCAACCCCAGACCACCGATCTGGACGCCATCAAGGAACGTGGCTGGCGTGACCAGCACATCCTGGTGGTCAACGCGTCCGACGACCGTCTGGACTTCATCGAGCGCGAGATCGTGCGACGCATTGGTGAACGCCTGTACGGGCTGGGAGGGACGCGTCATGGCTGAGTGGACAACCGACGACGTGGCAGCACGCTTCGAGGAGGCCGCCACCACCGGACGACGCTTGCCCCCTGTACGTGTGCAGGGCTACTTCAACTGCTGGCCTGCCTTCGTCCGCAAGGAGTGGGAAGCCTTTGCTGCTGACGAGAAGGTGTATCGCCCCTTCCCACCAAGCCCCGAGGCCATCGACCGGATGCTGGAGACGATGCGCTGGGTGCAGTGGCTCGAGGTCGAGCAGCGACACCTCGTGTGGATGCGGGCCAAGCGCTACGGCTGGAGGGACATCACCATTCGATTTGCCTGCGACCGCACCACGGCGTGGCGGCGTTGGCAGAGGGCAATGGAGATCGTGGCCACGAACCTCAACAGCGAAGGCGTGCGGTTGCCTTCCAAAAACGTGGGCAATTTAGGGTAATGCTTGCCGCGCTTGTCCCTGCTTTGCCTTGATTGTCCGTTTCGAGGCCCGGCAGCCCTGCAACAAAACAGCCCGGTCGGGGGTAGTATTTCGGCTATCTTCTGGACAGCGGTGACGGTTGAGGCGATGGGCCCAGGCAAAAGGGGTCCTTCCTTCCCGAATCGCAATGCGGGGGGCGCGAGCGCGGCATTCGCCTAGCGTCCGACTGCAAACCAAGGTTTGCAGGGTTTGCAGTTTGCACCCGCACCAGTCCGCACCCATCACGAGCCCGCCCACGGTTTTCCGTCGGCGGGTTTTCTTTTTGAGGAAACGATTCTGAACACGCTTAACGTTGAGTACCGCAAGGTCGAGGCGCTGATCCCCTACGCCCGCAATCCACGCACTCACACCGACGAGCAGGTGGCCAAGATCGCCGCCAGCATCGTCGAGTACGGCTGGACGAATCCGGTGCTGGTGGACGGCGACAACGGGATCATTGCGGGCCACGGTCGTTTGGCCGCCGCGCGCAAGCTCGGGCTGGATCAGGTACCGGTCATCGAACTGGCGCACCTCTCACCCACCCAGAAGCGTGCCTACGTCATCTCCGATAACCGGCTGGCGCTCGACGCCGGTTGGAACGAGGAGATGCTGGCGCTGGAAATGGCCGAGCTGTCCGAGGCCGGGTACGACCTTGCACTGACCGGTTTCGAGGATGCTGAGATCGAGGCCTTGCTCGCTGACGAAGTCGCCTCCGATGCCGCCGACCAAGAGCCCGATGCCGACGAGCCGGACGATGGCGACGATGTGCCGGATAGCCCAGTGGTGCCGGTGTCCCGCACCGGCGATTTCTGGGCCATCGGTACCCACCGTCTGATCTGTGGCGACGCCACCGACCCGACCGTGGTCGCCACTCTAATGCAGGGTGATGCGGCCCGGCTGTGCTTTACATCACCGCCTTACGGCAACCAGCGCGACTACACCTCCGGCGGCATCACCGATTGGGATGGCCTGATGCGCGGTGTGTTCGCCAAGGTGCCAATGGACGACGACGGGCAGGTGCTGGTCAACCTCGGGCTGATCCACCGCGACAACGAAGTCATCCCGTATTGGGATGCGTGGCTGGGCTGGATGCGCACGCAGGGTTGGCGGCGCTTTGCTTGGTACGTCTGGGATCAGGGGCCGGGGATGCCCGGAGACTGGGCTGGTCGTTTTGCGCCGAGTTTCGAGTTCGTCTTTCACTTCAACCGCTCTAGTCGCAAGCCCAACAAGATCGTGCCCTGCAAGCACGCGGGCCAGGAATCGCACTTGCGCGCCGACGGGTCGTCCACGGCCATGCGCGGCAAGGACGGCGAAGTCGGTGGCTGGACGCACAAGGGCCAGCCGACGCAGGACACCCGGATTCCCGACTCGGTGATCCGCGTGATGCGGCACAAGGGCAAGATTGGTCAGGACATCGACCACCCGGCTGTGTTCCCGGTGGCGTTGCCCGAGTTCGTGATCGAGGCCTATACGGACGCAGGCGACATCGTGTTCGAACCTTTTGGCGGCAGCGGTACCACGATGCTGGCCGCGCAGCGCAAGGGTCGTGTGTGCCGCTGCGTGGAGATCGCGCCGGAGTACGTGGACGTCGCCATCAAGCGCTTCCAGCAGAACCACCCCGGCGTGCCCGTCACGCTGCTGGCCACAGGCCAGTCCTTCGACGATGTGGTCAATGAACGTCAGGCCACCACGGAGGTAGAGCAATGACCGCCTCCTGGTTTGCCGACAAGATCGAAAAGTGGCCGACTGCCAAGCTGCTGCCCTATGCCCGCAACGCGCGTACTCACTCGGACGATCAGGTGGCGCAGATCGCCGCGTCGATTGCCGAGTTCGGATTCACCAATCCGATCCTGGCGGGCAGCGATGGCGTGATCGTCGCCGGTCACGGACGGCTTGCTGCTGCGCAGAAGCTTGGGCTGGCGGTGGTGCCGGTGGTGGTGCTCGATCATCTGAGCCCGACACAGCGCCGGGCCCTGGTGATCGCAGACAACCGCATCGCCGAGAACGCGGGCTGGGACGATGCGATGCTGCGCATCGAGATCGCATCACTGCAGGACGACGACTTCGACGTGTCGCTGACCGGCTTCGATGCAGATGCGCTGGCCGAATTGATGGCGGGCGACGAGCCGGATGGCGAAGGCGAAACCGATGACGATGCCGTGCCCGAGTTGTCGGAGACGCCGATCTCTCGTCCGGGTGATGTCTGGTCGCTTGGCGGCCACCGGCTGCTGTGCGGGGACTCCACCGTGACTGAGAGCTACGACAGGCTTCTCGATGGCGAGCAGGTCGACATGGTGTTCACCGACCCGCCGTACAACGTGAATTACGCCAACAGCGCCAAGGACAAGATGCGTGGCAAGGACCGCGCGATCCTGAACGACAACCTCGGCGACGGCTTCTACGACTTCCTGTTGGCGGCGCTGACGCCGACCATCGCGCATTGCCGGGGCGGGATCTACGTGGCGATGTCGTCCAGCGAACTGGATGTACTGCAGGCCGCATTCCGCGCCGCCGGTGGCAAGTGGTCGACGTTCATCATCTGGGCCAAGAACACCTTCACGCTGGGCCGTGCCGATTACCAGCGCCAGTACGAGCCGATCCTGTACGGATGGCCAGAGGGCGCGCAGCGTCACTGGTGCGGCGACCGCGACCAGGGCGACGTCTGGAACATCAAGAAGCCGCAGAAGAACGACCTGCATCCGACGATGAAGCCGGTGGAGTTGGTCGAGCGCGCGATCCGCAATTCGAGCCGACCGGGCAACGTGGTGCTCGACCCGTTCGGGGGCTCCGGCACGACGCTGATTGCCGCCGAAAAGTCAGGACGGCTGGCACGGCTGATCGAACTCGACCCTAAGTACGCGGACGTGATCGTGCGCCGCTGGCAGGAATGGACTGGCAAGCAAGCCACCCGTGAGTCGGATGGCGCGCTGTTCGATGATCAGGCGGCGATCGACTCTTCCGCGATCTCGCAATGAATCACGAACCCCGTCAGGTAAGGCAGGCCGCGCGGGATGCCGTACTGCTTGCTGGTCTGGCGGCCAATCGTCCAGCCCATCCACTGTTGGGTGGCGGCGTTGATCGCGTCCGCCAGGGTCTGGCCCCGGTACAGCCCGTTTTGCAAATCGTCCGCAAAGTGGCGGCCGTGGCGACTGTCGAGGAAGACGCGGACTGATTCGAGGGGCTGACCGGTAGCGTCGGAGATGGCGGTCATCGCCAGGGGCCACGCGGTGCTGGCGTGTTCGTTCATCGTGCCCCAAAAGCCCCAGGCATCGTTCTGGGTGGCGGGCATTTGCTGGTTGGTGTTCATCTCTGGCTCCTTGGGGTTGATCGTTGCGACACCCGTAGTAACGCGCTGTTCGATTGAGAAGCCAAGCTGTTCTTGGCCTCTTTCTCAATCAATTTCGATTACCCGAGACGGGCCACGTACCGGGCGTAGTCGCCGCCCTCTGGATTCACGTAAAGGTAGGGGCGACCCGGTGCGGTGACCTCGACGCAAAGATAGCCGTCGCCGGTGCCGCCACCTTTGCCGCGCAGCCAGTCGCGCGATACCAACAGGCTGCGGGCAAAGGCATCGAACTCGTCGACGGTCAGTTCCTTGGTCTCGGTGACATAGACCTTGGTCTGACCCTGGCCGCCAACTTCGTCCAAGTCGGCAGGCTTACGGGCAAACGGCAATCGGACGCTCAACTCCTCGACCTGGAAGGTGGTGTCGCCAAACTGCAGGGTGCGCGGGGTGCGTTCGATGGTGATGGTCATGGTGCTCATGAATGTTCTCCTGGGTGTTGGCGTTGCGATCAGGCTTCTGCGGCGATCCGGTAGACCCGCTCGCTGCCCTGGGCCTTGTCCGAGACGATGGTCAGCCCGAGCTTCTTCTTGAAGGCACCGGCAAAGGTGCCGCGCACCGTGTGCGCCTGCCAGCCGGTGGTCTCGCAGATCTGCTGCACCGTTGCCCCTTCGGGGCGCTGCAGCATCTGGATCACCGTGGCCTGCTTGCTGTTCTCGCGGGTGCGAGGTTTGGCGGCCGCCTTTTCTTGCGCCCACGCGGCCTCGGCTGCCGTCACGGCTGCGTCGAGTTCAGGGTCTGCGGCCACTGGCGCAGGCGTTGGCCGGGCGCGCCCCATCGCGTCGTAGCCCTCGGCGGCGACGAACCAGTGGGTGCCGTCGGAGGTGATCAGCGCGCGGTTGAACAGGCCGTCGAGCACCTTCTTGCGTGCGCCGCCTTTGATGTTGTCGGGGAACCAGTCGATCTTGCCGTCGGTGTGTTCGAGGGCGTAAGCCAGGATCGCGTGCTGGGCCGGGGTCAGTTGGGTGGTGGTCATTTGCTTCTCCTTGTGCAAGGGGTTGATGGGGTGACGTGATGAACGCGCTGTTCGGGAGTGAAGCCAAGCGTTTTCTGCTTGGCTTCGAAGGTTCTTGATCAGCTGTTGGCCTTGTCCGACTTCGTCGCCTTGCGGCCTTGTTCGACGCCTGCGTTGAACGCGGCCTCCAGGGCGTCGCGCAGGCACCAGACCGCCACGTCGTGGAAGTCGAGGCTGTCTGACTTGCGGGTTTCCAGGGTTTCGATGCCCAGCTTGTTTTGTGCGATCTGGGTCAGGAGTTGTTCGAACTTGCTCATTGCTGCTTCCTTTGATGGTGTTGATGACGTCCGTATGAACGCGCTGTTCCAGAGAGAAGCCAAGCTGATTTCGAGTGAAGGTCGAAAAAATGATTGAAGGGGTAACCGGTTCTCAAAATGGGCATTTCGATTCGCGCTTACGCCCGTCACCGTGGTGTGACCGACACCGCTGTTCACAAGGCAATTCGCGCAGGTCGGATCACGCCGGAGGCTGACGGCACCATTGATGCCGACCGTGCTGATCGCGAGTGGGCTCGCAACTCCGATGTGCCGAAGACCGGTACGCGGGCCAAGGCCGCAAAGGTCGCCGTGCCGGAAGGCGGTACGGGTGTTGGCGGTGATGGGCCCGCCGCATTACCCGCTGGCGGCGCGTCCTTACTTCAGGCGCGCACGGTCAACGAGGTCGTCAAGGCGCAGACGAACAAGGTGCGTCTGGCCCGACTGAAGGGCGAGTTGGTGGATCGGCCGCAGGCCATCGCCCACGTCTTCAAGTTGGCGCGCTCCGAGCGTGATGCGTGGCTGAACTGGCCCGCGCGCATCTCTGCGCAGATGGCGGCCAAGCTCAATATCGATCCGCACACGATGCACGTCGCCCTGGAGGCGGCGATACGTGAGCACCTGCAGGAACTGGGCGAACTCCGGCCCCGGGTGGACTGATGCTGAATGTTGAATACGAAGGCGCTGCCGAAATCGAGCGCGCGTGGCGTGAAGGGCTGACACCTGATCCTCTGCTCTCGGTCTCTGAATGGTCGGATCGCCACAGGATGCTATCGAGCAAGGCGTCCGCTGAGCCTGGGCGCTGGCGCACCAGCCGCACGCCGTACCTGAAGGCCATCATGGACTGCCTGTCGCCGACCTCGCCGGTCGAGCGCGTGGTGTTCATGAAAGCCGCACAGCTCGGTGCGACTGAAATGGGCTCGAACTGGATTGGCTATGTGATTCACCACGCACCGGGGCCGATGATGGCGGTGTGGCCAACGGTGGATATGGCTAAGCGCAATTCCAAGCAGCGGATCGATCCGTTGATCGAGGAGTCGGCGGCACTGAGCGAATTGATCTCCCCAGCACGGTCACGCGACTCGGGCAACACCATTCTGGCCAAGGAGTTCCGGGGCGGCGTGCTGGTGATGACCGGGGCGAACAGCGCGGTGGGCTTGCGCTCGATGCCGGTGCGCTACCTGTTCCTCGATGAGGTTGACGGGTATCCGCTGGACGTCGAGGGTGAAGGTGATGCGATCTCGCTGGCCGAGGCGCGCACGCGAACCTTTGCCCGGCGCAAGATCTTCATCGTGTCGACGCCGACGATCTCGGGGGCGAGCGCCATCGAACGCGAGTACGAGGCCAGTGACCAACGTCGCTACTTCTTGCCTTGTCCGCACTGCTCGCATCGCCAATGGCTGCGCTTCGAGCAGTTGCGATGGGAAAAGGGGCAACCGGACACGGCGTCCTACATCTGCGAGTCCTGCGATAAGTCGATTGCCGAGCACCACAAGACCTGGATGCTGGAGCACGGTGAGTGGCGCGCGATGATCAGCGACGGCACGGGCAAGACAGCGGGGTTTCACCTGTCGTCGCTTTACAGCCCGGTTGGCTGGCGCGGTTGGCGCGACATTGCTGCCGCGTGGGAAAGCTCTGTGAACAAGGAATCGGGGTCGGCGGCCGCCATCAAGACCTTCAAAAACACCGAACTGGGTGAAACCTGGGTTGAGGAAGGCGAAGCGCCAGATTGGCAACGGCTGGTCGAACGCCGCGAGGACTACCGGGTTGGCACGGTGCCGCCGGGTGGGTTGCTCCTGGTGGGCGCTGCCGACGTGCAGAAGGATCGCATCGAGGCGTCCATCTGGGCCTTCGGGCGCGGCAAGGAGTCCTGGTTGGTCGAACACCGCGTGCTGATGGGCGACACCGCCCGAGACGCCGTGTGGAAGCGACTCGCCGAGTTGCTCGCCGAAAACTGGACGCACGCCTCGGGCGCGGCGATGCCGCTGGCCCGTTTCGCTCTGGACACCGGCTTTGCGACGCAGGAGGCCTACGCCTTCGTGCGGGCCTGCCGTGACCCGCGCGTGATGCCGGTCAAGGGGGTGCCGCGCGGCGCGGCCCTGATCGGCACGCCGACGGCCATCGATGTTTCGCAGGGCGGCAAGAAGCTGCGCCGGGGCATCAAGGTGTTCACGGTGGCGGTTGGCATCGCCAAGCTGGAGTTCTACAACAACCTGCGCAAGGGCGCGGACGTCAGCGAGGACGGCGTGACCACCGTCTACCCGACGGGGTTCGTTCACTTGCCAAAGATTGACGCGGAGTTCATTCAGCAGCTCTGCGCCGAACAGTTGATTACCCGTCGCGACCGCAACGGCTTCCCGGTGCGCGAATGGCAAAAGATGCGCGAGCGCAATGAAGCGCTCGATTGCTACGTGTACGCCCGCGCGGCCGCATCGGCGGCGGGCCTGGATCGCTTCGAGGAACGCCACTGGCGCGAACTGGAACGCCAACTCGGGATGGAACGGCCACCGGATGAGCCACCCCCGATTCAAGCATTCGACCCAAACGAGGCCACCCAACGCGGTGGCCTCTCTGTTTCTGCAAACCCACCACGGCGGCGCGTCATCAAGAGCCGCTGGTTGTCCTGATTTTCAGAGGAGTTTTCATGAGTCTTGCCACCCGTATCGAGAGCCTGGTCATCCGGGTTGCCCAGGAGTTCAACGACGTCCGCGCGACGGCAGGCAGTCTGGCCAGCCTGTCCACCAACGACAAGTCGAGTCTGGTCGCCGCCATCAACGAGCTCAAGGCAGCGGTTCTGTCCGCGATGGCCATCGATGACAACCAGATCGCCACCACCAGCACCTACTCGTCGAACAAGATCGTGTCGCTGCTGGACGCGCTCAAGACCGACATCCTGGGCGGAGCCGATGCTGCCTACGACACCCTGGTGGAAATCCAGCAGGCGCTGCAGAGCGGTACCAGCGGCCTGGACGCGATTCTGGCTGCGGTCAATCTCCGTGTCCGCTTCGATGCGGCGCAGACCCTGACCGTGGCCGAGCAACTGCAAGCACGTACCAACATTGGTGCGGTCGCTGTCAGTGATGTCGGCAACACCGACACCGATTTCGTCGTGATCTTTGACGGCGCGCTGGCCTGATGAGCCTCGCTTCCAGCATCGCCGCTTTGGCGGCGCGCATCGGCTTCGAGGTCAAAACCAAGATCGACGCCACGCATCCCGGCATTGCCCGGGTGTGGGTCAGCTTCGGCTACGTGGGCGGTCAGGTCGTGATCGCCAGCGCGCACAACGTCGCCAGCGTGGTGCGCACGGCGGCGGGCCGGTACCGCGTGCATTTCGCTGTGGCGATGCCGGATGCGAATTACTGCTGGACGGCGCTCGCGCGCAGCAGCACCAACACCGGTCAGCAGCGCTTGGCCCTGGTACGTGCCAGCTCCGACCTGAAGACCGCGCAGTACGTCGACGTCTCGTGTGCGACGGCCGCGTCGTCGTTTGACGACTCCTCTGAAATCAACCTCGTGGTGTACCGCTGATGGCCTACACAGAAGCCCAACTCCAGGCATTGGAGACCGCGCTCGCCAAGGGCGAACACCGCGTCAGCTTCGGCGACAAGACCGTCGAGTACCGCTCGGTCGATGAACTGAAAGCTGCGATCCGCGAAGTCAAGCGCGGCATCCTGGAGCAGGCAGCCGCCACCGGACTATGGCCGGGTGCGCCGCGCCAGATCCGGGTCACGACCTCGAAGGGGTTCTGATGGCCTGGTATTCGAAGATCCGAAGCCTGTTCGGCCAGCAACCCGTCCACGAAGCGGCTGGCCGTGGTCGCCGCTCGTTGGCTTGGATGCCCGGCAACCCGGGCGCGGTCGCCGCGATGCTGGCGACCAACACCGAACTGCGCATCAAGAGCCGCGACCTCGTGCGCCGCAACGCGTGGGCGCAAGCCGGTATCGAGGCCTTCGTGTCCAACGCGGTCGGCACTGGCATCAAGCCGCAGAGTCTTGCTGCAGACGAGCGCTTCAAGACCGACGTGCAGGCGCTGTGGCGTGACTGGACAGAAGAAGCCGACGCCGCAGGACAGACCGATTTCTACGGCCTGCAGGCATTGGCCTGTCGCGCGATGCTCGAAGGCGGTGAATGCCTGATCCGGCTGCGCCCGCGCCGCCCGGAGGACGGACTGGTCGTTCCTCTGCAGCTTCAGTTGCTGGAGCCCGAGCATCTGCCGATCAGCCTCAACCTCGATCTGCCTTCGGGCAACGTGGTGCGCTCTGGCATCGAATTCGACAGCCTCGGGCGGCGCGTCGCTTACCACCTGTACCGCTCGCACCCCGAAGACGGTCGGCTGGCTCCGATGTCGGGCCAGGGCGGGATGGACACGGTGCGCATCGATGCGAAGGAAATCATCCACCTGTTCCGCGTCCTGCGTCCCGGCCAGATCCGGGGCGAGCCGTGGTTGTCGCGGGCCCTGGTCAAGCTCAACGAACTTGACCAGTACGACGACGCAGAACTGGTGCGCAAGAAGACCGCCGCGATGTTCGCCGGGTTCGTGACACGGCAGAACCCGGAGGACAACCTGATGGGTGAAGGTGCGGCCGATGGCGATGGCATTGCGCTCGCCGGGCTGGAACCGGGCACTTTGCAGATTCTGGAGCCCGGCGAGGACATCAAGTTCTCCGACCCGGCCGACGTCGGTGGCTCGTATGGCGAGTTCCTGCGCACGCAGTTCCGCGCGGTCGCCGCTGCCATCGGTGTCACCTACGAGCAGTTGACCGGCGACCTCACAGGCGTGAACTACTCGTCCATCCGCGCCGGGATGCTGGAGTTTCGGCGTCGCTGCGAGATGGTGCAGCACGGGGTGCTTGTGCATCAGATGTGCCGTCCGGTTTGGGCCGCGTGGATGAAGCAGGCAGTGCTCGCCGGTGCCATCGATGCTCCCGGCTTCGCGCGTGGCGGCCCAGCCCGTCGCCGCCGGTACCTGCAGGTGAAGTGGATTCCACAGGGCTGGCAGTGGGTCGATCCTGAGAAGGAGTTCAAGGCCATGCTGCTGGCCATCAGGGCGGGACTGATGAGCCGCTCGGAAGCCATTTCCGCCTTTGGCTACGACGCCGAGGACGTTGACCGCGAGATCGCCGCCGACAACCAGCGCGCCGACGACCTGGGGTTGATCTTCGACTCCGACCCGCGCCGCACCTCCAAGGACGGCGGAAGCGCCGAGCCGAACAAGAACGCTGCCGACACCACGCAAACCGGCAGCTCATCGTCTGCCTGAAGGATTTCCATGACCCTGTTGCCCCATTTGGCGGCGCGCCTCTACGGTGTGCCGCTGGCGATCCATCGCCCAAAACTTGACGTGATCCTGGCCGTGCTCGGCCCCCGGATCGGCTTGGCTGATTTGGCTGCACCCTCGGGCTTCACGCCGCCCGCACGTCCCGCATCCACCCAGACGACGAAGGTCGCGGTCATCCCCATCCACGGCACGCTGGTGCGCCGCACAGTGGGCCTGGAAGCCGAATCCGGCTTGACCAGCTACGCAGGGCTGACCGCGCAGTTGGACGCCGCGCTGGCCAGCCCGGATGTCGCTGCCATCCTGCTCGATGTCGACTCACCGGGTGGCGAGTCGGGCGGCGTGTTCGATCTGGCCGACCGCATCCGTGCGGCTGCTAAGACGAAGCCGGTCTGGGCTGTAGCCAATGACATGGCGTTCTCGGCAGCTTACGCCCTGGCGTCTGCGGCCAGCAAGGTGTTCGTGTCGCGCACCGGCGGCGTCGGCTCGATTGGCGTCATTGCGATGCACGTCGACCAGTCCGAGAAGGATGCGCAGGACGGCGTTCGGTACACGGCGGTCTTTGCGGGCGACCGCAAGAACGATCTGAACCCACACGAGCCGATTTCCAGCGAAGCCCACGCCTTTCTCAAGGGTGAGGTGAATCGCGTCTACGGCCTGTTCGTCGAGACGGTGGCCCGCAACCGTGGCATCGAGGCATCTGCCGTGCGCGACACCGAGGCGGGGCTGTTCTTCGGGCAGGCCGCCGTGGCTATCGGGTTGGCCGATGCCATCGGCACCTTCGACGACGCCCTTGCGCAGCTTTGCGAATCCGTTTCCCCACTCCCGAAGTTGGCGGCAAGCCACTCCGGTCTTTTTAGCAACCCCCAGATGGAGTCATCAATGAATGATCGAACCGACCCCGCTGCTCCTGATCGGCTTGCTGCTGATCCTGCTGGCAGTCCTTCTCAACCGGCGGCCGCCACCGCCATGACCGTGGCTGACGCGATTGAGGTCGCCCAGACCTGCACCCTGGCCGGGCGCACCGACCTGATCGCGGGCTTCCTCGAAGCGAAGGCACCACCCGCCAAGGTACGCAGCCAGTTGCTGGCCACCCAGGCCGAAGCCAGTCCCGAAATCGTCAGCCGCATCGACCCGCAGTCGGCCATGTCGGCGAGTAGCACTGGCCATCCTGCCTCTTCCCACAACCCTCTGATCCAGGCCGTCAAAAGTCGCCTGGGCACAAAGTAACCCAAAAAGGAGCATCCCGTGCCCGCAATGCAAGAACCAATCAACCTCGGCGACCTCCTGAAGTACGAGGCGCCCAATCTCTATTCGCGCGACCGCGTGACCGTGGCAGCTGGCCAGACCTTGCCGCTGGGTACGGTGCTCGGGCAGATCACGGCGACGGGCAAGGTCAAGCAGATCGACCCGTCGGCCACCGATGGCAGCCAGTACTCCGCTGGTGTGCTGATGCAGGACGCCGATGCTGCTCTCGCCGACCGCAACGACGGGCTGATGGTGGCGCGTCACGCCATCGTGTCAGACCACGCACTGCATTGGCCCACCGGCATCACGACTGCGGAGCAGCAAGCAGCGATCCAACAACTCAAAGCACTGGGCGTCCTGGTGCGTATCGGCGCCTAACGCCAAGGAGACTCAATATGCAAAACCCATTCATCAGTCCGGCATTTTCGATGGCATCAATGACTGCAGCCATCAACTTGATCCCCAACCGCTACGGACGCCTGGAGGAGTTGAATCTGTTTCCGCCCAAGCCGGTTCGAACGCGCCAGGTGATTGTTGAAGAACGCGCCGGTGTCCTGAACCTCCTTCCGACCCAGCCGCCAGGCTCTCCGGGAACAGTGAATGTGCGTGGCAAGCGAACCGTCCGGTCCTTCGTCGTTCCGCACATTCCGCACGACGACGTTGTGTTGCCCGAAGAGGTTCAAGGTCTACGTGCTTTTGGCAGCGAAACCGAAATGGAGTCGATTGCCGGAGTGCTGGCCCAACACTTAGAGACGATGCGCAACAAGCACGCCATCACCCTAGAGCACTTGCGTATGGGGGCGTTGAAAGGCGAGATTCTCGACGCCGACGGCAGCCGTATCTACAACCTGTTTGACGAGTTTGGCATCGATCAACAGAGTGTGGACTTCGAAATCAGCAGCCCGACTACTGGCACTGATGTCAAGGGCAAGTGCACTGATGTGTTGGGCATCATCGAAGAAGCCCTTCTCGGCGAGTTCATGACGGGAGTCCACTGCTTGTGTTCTCCAGAGTTTTTCAAGGCATTGACCGGCCACAAGGATGTCAAGACTGCCTTCACGAACTGGCAGCAAGGCGCCGTCCTTATCAATGATGTTCGCCGTGGCTTCACTTTTGGCGGCATCACTTTCGAGGAGTACCGAGGTAAGGCGACTGATGTCAACAAGACGGTTCGTCGCTTCATCGCTGCTGGCGAAGCACATGCGTTCCCTCTTGGCACTATCGACACCTTCGGAACTTACTTTGCACCGGCCGACTTCAACGAGACTGTCAACACGATGGGCCAGCCGCTTTATGCGAAGCAGGAGCCGCGCAAATTCGACAGGGGCACAGATCTGCACACGCAGGCCAACCCGCTACCGATGTGCCATCGTCCCGGGGTTCTGGTCAGGCTCGTCATGGGTGGTGGCGTATGAGTTTGGTCGCCCAGATCTATGAGTCGGCCGCGAACGCTGGGCTGCTGAAGGAATGCCTTTGGTATCCGTCGAACGGTGCGCCATCGCAACTACATCAGATCGGCTTTGCCGCGCCCGATGAATCACTGCTCGATGGCCTGGCCCTGAGCACCGACTACGAGATGACCTACCCGGTCACGGCATTCGGGGGTCTTGCAGTCCGCGAGGTTGTCGAAATCGGTGGCACGTCCTTCCAGGTGCGAGACATCCGATCGTTAAGCGACGGCTCCGAGATCCGCGCCAAGCTCACCCGGCTGTAAACCCATGGCAGATAACTCGATCCGCGAGCGGATTCTGCTGGCGGTGATGGCGGCTGCCCGTCCGGCGGTCGAAGGTCTCGGGGCCACTTTGCACCGGTCGCCCACGGTGGCCATCAGCCGCGAACTTTGCCCGGCGCTCGCGGTGTTTCCCGAGTCGGAGTCCATCACTGAGCGCGCCAACGACCGCGTCACACGCGAACTGACCGTTCGCGTTGTGGCTCTGGCTCGGGCCGTTCCACCCGCGTCCCCCGAAACCGAGGCCGACCGTCTGCTCACCGCTGCCCACGCTGCCTTGTTCGGGGACGGCACGTTCGGTGGGTTGGCGCTGGGCATCCGTGAACAAGAGAGCGAGTGGGAGGTCGAGGACGCCGACGCGGTGGCCGTGGCCCTCCCGGCGCGCTATCGGCTGACGTACCGGACGCTGGCCAATGACCTTTCAACTCTTGGATGACACCTATGACCCAACTTGTCCTGACGCGCCCGCACACCCACGCGGGCAAGACCTATGGCGTCGGTGACCGGATCGAGATCGACGCGACATCAGCCGACTGGCTGATCGCGCACGACATCGCCACGCCGGAGCCGACCGCCCCAACTGCTGAACCCGTCCCCGAACCCAAACCCCTCCAACGCAAGGAACCCAAGCAATGAGCACCTATGCCAGTTTTCAAGGCCGCGTCTTCCTCGGCAAGCGCGACACCGACGGCCTTCCCATCGAAGTGCGCTCGCCCGGCAACGTCGCAGAGCTGAAGCTCTCCCTCAAGACCGACGTCCTGGAGCATTACGAGAGCCAGACCGGCCAGCGCTCGCTGGATCACCGGATGGTCAAGCAGAAGTCCGCCACCGTGAACCTCACCATCGAGGAATTCACCAAGGAGAATCTCGCGCTGGCCCTGTACGGCAACCACGTCGTCGGCACGCCGGGCACGGTCACCGCCGAGCCAGTGGGCGGTGCCACGCCGATTGCGGGCGACCGCTACTTCCTTGCCCACCCGAAGGTATCGTCCTTGGTCGTGACGGATTCGGCTGGCACGCCCGCGACCCTGGCCTTGGGCACGAACTACACGGCTGATCCCGACTTCGGTGCCCTCCAGTTTCTGGATACCACCGGCTTCACTGCGCCGTTCAAGGCCAGTTACGCCTACGGTGTGGCCACCGAGATCGGCATCTTCACGCAGGCGCTGCCGGAACGCTTCCTGCGGCTCGAAGGCATCAACACGGCCCAGGGCAATGCCAAGGTGCTGGTCGAGCTCTACCGCGTGGCATTCGATCCGCTGAAGGAAATCTCCTTCATCTCGGACGAGTACAACAAATTCGAGCTGGAGGGATCGCTGCTGGCCGACACCACCAAGCCCTTCGACGCGGTGCTGGGCCAGTTCGGCCGCATCGTGCAACTGTGATGGGTGCCGCCATGAGTGATCTGGACACCCTGATTCCGCAGGCGGTCGAACTGGTGATCGACGGTGAGCCGCTGGCCATCAAACCGCTGAAGGTCGGGCAGATGCCCGGTTTTCTGCGAGCGATGTCGCCGGTGATGCAGCAGCTCACTGCCTCCAACATCGACTGGCTGGCGTTGTTCGGCGAGCGCGGCGACGACCTGCTGTCGGCCATCGCCATTGCCGTCGGCAAGCCTCGGGCGTGGGTCGATGAGCTGGCTGCCGACGAGGCCATCCTGCTGGCGGCCAAGGTGATCGAGGTGAACGCCGATTTTTTTACCCAGACGGTGATTCCGAAGCTCGACGGGCTGTTCGGCCAAGTGAAGCTGCCGCCCATCGTGAAAGCGGCGGCTGGTTCGATGCCGTCCAGCACCTGATCGAGCACGGTCACCGCTTGCCCGACATCCTCGACTACACGTTGGCGCAGGTGCGCGGCTTCGTCGTAGCGACGGCGCGCACCGATGCGGCCCGCGATGCACGGCTGCTGTCCGTGATTGCCATCGGCACGCGCAGCGATGCCCGCCAGCTCGACCAAACCCTCGACCGACTTACTGACAAGGCCACCGACCGTGCCTGATGACCATGCGCATTTCCGTCCAGATCGATAGCGCCGCAGCCCAGGCGCAATTGCGCCGCTGGGGCGGCGAATTCCGCGACAAGGTCAAGAAGGCGGTGTCGCGGGCGATTGCCAGCGAGGCGGTCGAACTCAAGCAGGACGTGCGCAGCCACGTCGCCAGCCAGATGGCCGTGGTCAAGAAGTCCTTCCTCAAGGGCTTCACCGCCAAGGTGCTGGACAAAGACCTGAACCGACTGCCCGCGCTGTACGTGGGTTCGCGCATTCCGTGGTCGGCGATGCACGAGACCGGCGGCCAGATTGCCGGGCGGATGCTGATTCCACTGAACGGTCGGGTGGGCCGCAAGCGCTTCAAGGCGCAGGTGGCCGAGCTGATGCGCGGCGGCAATGCCTATTTCATCAAGAACGCGAAGGGAAACATCGTCCTGATGGCCGAGAACATCAAAGAGCACGACCGGCCACTGGCGGGCTTCAAGCGCCGCTACCGCAAGGCAGAGGGCATCAAGCGCCTCAAGCGCGGCGCGGACATCCCGATTGCCGTCCTAGTGCCCAAGGTCGTACTCAAGAAGCGCCTCGATGTCGAGCGGCTGGTCGCGAGTCGCATCCCGCGTCTGGCGGCGGCCGTCGAGAATCAGATCAGTACGGTGGATTGATTCATGGCCAAGCGAATTTCCATCCTCGTCGCGCTCGAAGGGGCCGACGAGGGGCTCAAACGCGCCATCACGTCGGCCGAGCGCAGTCTCGGTGAGCTGTCGACCACCGCCAAGACCGCCGGAGCCAAGGCTGCCGCCGGAATGGCCGAGGTCAAGGCCGGGATGTCGGCCTTCGGCGATCAGGTGGCGACGGCCAAGACGCAATTGCTGGCCTTCCTATCGATCAGCTGGGCGGCAGGAAAGGTGCAAGAGATCGTCCAGATCGCCGACGCATGGAACATGATGTCGGCGCGCTTGAAGTTGGCGACGGCGGGACAGCGTGAATTCACGACCGCGCAAGCGGCCCTGTTCGACATCGCCCAGCGCATCGGTGTGCCGATTCAGGAAACGGCCACGCTGTACGGCAAGCTCCAGCAGGCAATTCGGATGCTGGGTGGCGAGCAGAAGGACGCGCTCACGATCGCCGAGAGCATTTCGCAGGCACTGCGCCTGTCGGGCGCTTCGGCCACCGAGGCGCAGTCCTCTTTGCTGCAATTCGGGCAGGCGCTCGCCTCTGGTGTGCTGCGAGGCGAGGAATTCAACTCCGTCGTCGAAAACAGCCCCCGTCTGGCGCAGGCACTGGCCGATGGCCTGAATGTGCCCATCGGGCGACTGCGCAAGCTGGCCGAAGAAGGCCGCCTGACCGCTGACGTGGTGGTCAACGCGCTGATGAGCCAGAAGGACAAGCTGGCCAGCGAGTACGCCCAACTGCCGCAGACGGTGAGCCAGGCCTTCGAGCGCCTGCGCAATGCCTTCGGGCAGTGGATCAACCGGGTCGATGAATCGACGGGTTTGACCAAGAAGCTGGCCGAGGCTCTGACCATTCTCGCCAACAACCTCGACACGGTCATGCAGTGGTTGAAGCGCATCGCCGAAGTCGGTCTGGCGGTGCTGATCTACCGCCTGATCCCGGCGCTCATCACCGCGTGGCAGACCGCCGGTGCGGCGGCCGTCACGGCCGCCAGTGCCACCGCTGCGGCGTGGACGACGGCCAACCTGTCGGTGTCGGCCGCCGTGGCCAGCGTCGGCTTGCTCAAGACGGCATTCGCCGTGCTGGGTGCCTTCCTGGTCGGCTGGGAGATCGGCACGTGGCTGTCGGAGAAGTTCGAGATCGTCCGCAAGGCGGGCATCTTCATGGTCGAGATGCTGGTCAAGGCGGTCGAGCAGTTGCGCTACCGCTGGGAGGCATTCGCCGCCATCTTCACCTCGGACACGATTGCCGAGGCGACCCAGCGCCACGAGGCCCGTCTCGCGGAGATGAACCAGATCTTCGCGCAGATGTACGCCGACGCGACCAAGGGGGCGGATGCTGCCAAGGGCGCGATGAATACCGCCGCGACGGCTGCGGAGGAAATCGCCAAGCGGCTCGAAGCCGTGCGTCAGGGCACGCAGGAGGCAGTCGGGCGCGGTATCGAGGCTGTCCACAGCGCCCTGGAGAAGCTGAAATCCCGCCTCGGTGAGGTTGAGCAGGCTGTCGGCAAGGCCAATCAGACAGTCAACGACGCCACCGCCAAAATGGCCGAGGCCTATAAGGGCCTGACGTCCATCGTTGAGGCCAACCTGCTGCGCCAGATCGAAGCGGTCAAGGCGCGCTATCAGCAGGAACAGTCGGCGCTGGAGACATCCAAGCAGTCCGAAGCGGCGCTGATCACCAAGTCGACACAGTTGCTGACGGAAGCCCTCACGCAGCAGACCACGTTGCGGCGGCAGTCCACGACAGACACGCTGAAGCTCATTGACGATGAGTCCAAGGCGCGGATCGAGTCGGCCCGCCGCCAGGGTCAGACGGAAGAAGAGCGCCGCGCCAACGTCCAGCGGGTCGAAAACGACATCCTGGCCACCAAGCGCCAGACGATGACGCAGGCGCTGGCCGAGTACCGGCAGCACATCGATGCGCTCAACGCAGAGGCCAACCGGCATCTGACTGAGATCAAGCGCATCGAGGAGGAGAAGCGCCAGCTCTCGATGACGACCGAGGAACGTGTCCGCGACATCCGTCGGCAGGGCATGACCGATTTCGAGGCGACGGAAGATCGCAAGCGCCAGATCGCCGAGTACCAGGGGAAGGCACGTGAGGCGCTGGCCAACGGCGAGTTCGAGCAGGCTCGGCAACTCGCCCAGAAAGCGATGGACTTGGCCGCACAGGTGGCCAGCTCGCAAACCAGTGAAGCCAAGCGCGGCGAAGATGCCCGCAAGCAGTCCGAGCAGGCGGTTTCGCAGGTCACCCAGCTCGAATCGCAGTCACGCGATGCCTATCGCAAGCAGGAATACGCGCAAGCCGAAGCCCTGATGCGCCAAGCGGACGCATTGCGCGCCGAACTGGCCCAGAAGACCAAGGATGCCGACGCACAGATCGCACAGGGCAAGGATGGCGTCAATCAAGCCATCCAGCGCATCCGCGAGTCCGAGGAGATTCTCAACAAGACCCTGGATGCCGAAGCCAAGGCGCACCAGACCGCCGCGCAGTCGGCATTGACCGCGCGCGACCAGATCCAGCAGACCCTCACCCAGACCGAAACCCAGATCGACCAAATCACAGCCAAGCTAAAAGACGGTCTGAAGGTCACGCTGGATGCCGACACGACCCGCTTCGACAAAGCCATCGCTGATCTCGACAAGGCCCTGGCAGAAAAAGAGTACCTGCTCAAGATTCAGGCCGACTTGCAGGAGGCCGAGAAGAAGCTGCAGCAGTACGAACAACTGCTGAAAGAGGGCAAGACACTCCCGGTCGATGCCGACGTGTCCAAGGCCAAGGAGGCGCTGGACAAACTCAAGACCTACGCCGACCAGAACTCGCAGTTCGAACTGAAGGTGGCGACCGAGAAGGCGCAGGCCGCGATCACCAAAGTCGAAGGGATGATCAAGGCGCTGGACCGCATCCAGACCGAGTCCCGGCATCAGGTCAGCACCAATGCCGACGCAGCCCGCTCGGAAATCATGAGTCTCAACTGGGCCAACACCTCGAGCACGCACACGATCTATGTGCGCAAGGTAGAGGCAAACGCGACTGGCGGTTTGGTGGGCGGTGGCGTGCGCCGCTACGCCGATGGCGGCGCTGTGGCCCCGGCCTTTCCTCGGATGAGTGGTGGCTCGGTTCCGGGCTCGGGCCACCACGACACCGTGCCACGCACCCTGGATGCCGGTGCCTTCGTGATTCGCAAGGCGGCGGTGCAGAAGTACGGCGGCGGCGCGCTCTCGCGTCTGGCCAATGGCGTGGCACGGTTTGCCACTGGCGGCGCGGTGATGCTGGGTGGCGGCAAGCGCCCATCCGGCAACGATGCTGATGGCACGCCCAGCACACCAAAGAAGAACCGGGAGGCAGTCGAGGCGATGAAGATGATCGACCTCGGCCTGCAGGGGATGAACGAGTACACCAATTGGCTCCAGTGGAACTACGGTGCCTCGGTCAGTCTGGATATGCGTAGCAAGACGATGGATAGCTACGGCAAGCAGGCCCAACAGGATCGGCGCGCGCTGGAGGACTTCATCAGCCGCAAGACGCTCACCGGCAACGAGCGCCAGAACCTGGAGCGCATCAAGCAGACGTGGCGGCAGGCAATGGCCCAGCCGCTGCTTTGGGGCAAAGACCTAGAGCGCGAGCTGATCGACTATATGGAGCAGAACCAGGGCGAGTTCTACCGTCGCGGTGGCATGGCCAAGTCCGACACCGTCCCGGCGATGCTCACGCCGGGCGAGTTCGTCGTGAACAAGGATGCCGTTTCCCGCTACGGCGCTGGCTTCTTCGAAGCGATCAACAACCTGTCTGCCCCGGCACAAGCTCTGGCCGGTCGCGCGCTTGCGGGCGTTCAGGGCTTCGCCACCGGCGGTCTGGTGCAGCCAAGTGGCTCGCGGTTGGCCCGACCGGTGTTGGCGGCCGATGCCGGGCCCAGCCGCACGGTACGCGTGGAACTGTCCTCGGGGCAGCAGAAGGTCAATGCCACCGTCGACGCACGAGACGAGTCTCGTCTGCTGCAACTTCTGGACGCTGCCCGCGCCCGCACTGCCTGAAGGATTCCCGATGCAACTGACGAACCTCGATGCCGGGGTGGCTTTGCCATTGCCTGACGATTTGCTGTGGAGTGATGAGCACGCGTGGTCGCCCGCCGTGGCGACCACGTCTTACCTCATCACCGGAGCCTTGCTTATCCAGTCTGCCACCCGGCAAGCCGGTCGCCCCATCACGCTGGTGGGCGCACCCGATATGGCCTGGGTGACGCGGGCCACGGTCGAGCAACTGCAGGCCTGGGCCGCGCTTCCAGTGGGCAGCGCCACAGGTCGCTTCGGCTTGACCTTCTCCGATGGCCGCTCGTTCACCGTGGCATTCCGCCACGCAGAAACGGCCATCGAAGCCGAGCCCGTGCTGGGCATCCCGGCCCGTGCCGCTACCGACTTCTATCGCCTGACCCTTCGATTCCTGGAGATTTGAAATGCCGATCCAATCCGGCGACGTGAAACTGCTGAAGTCCGCCGTGATGGCGGATGTGCCCGAGGGCGGTGGCGCGCCCACGGGCAACACCATTGCCGATGGCGTCTCGAACGCCATCTTTCCTGACATCTCCGAGCTGGATCGCGCCGGGGGTCGGGTCAACCTGCGCAAGTCCTTCGTGTCGGTGCAGACCGACGACACCGACACCTACTTCGGTGCCAACGTGATCGTGGCAGAGCCGCCGCAGGATGCGCGCGTCAGCGTCACGCTGTTCAGCACCGAGAAGACCTTCGACACCCGCGAGCAGGCGCAAGTCCGCATCGAGGCCTACCTCAACAAGGGCCCGGAGTGGGCTGGCTACCTGTTCGAGAACCACATCGCCGGTCAGCGGGTGATTCAGCTTTTCCAGCGCACCACCGACACCGTTCCCAATGTCGGCCAGACCTTGGTCTTGATCGAGAACGAGGGCCTGGGCACCCAGAAGGAGCAGTACATCCGGGCCACCTCGGTGTCCGTCGTCGAGCGCACGTTCACTTACGACGGCGACAAGGACTACAAGGCCAGCATCGTCACGGTCGACATCAGCGACGCACTGCGCTACGACTTCACCGGCTCGCCTGCAAGCCGCACGTTCACCCGGGCCGCGAACAGCACCAAGACGCGCGACACGGTCGTGGCGGACGCCGGAACCTACGTCGGCGTAGTACCGCTGACGCAGGCCGCCGCCGTCGGCGACTTCACGATCAAGGGCACCTCGATCTACACGCAGCTGGTGCCAAGCGCGCAAACCGAGACGCCCATTTCCTTCGTTCCTCCCTACGCGGCCGCCGGACTGCCGGTGCCGGGGGCCGTCGCGGTGAGCTACACAGCCAGCCACGCGTGGACGACCAGCATCAAATTCAATCTCCCGGGCGGTTGCTTGCCGGGGTCACTGACCATCGGCACGGACGGCATCACGATATTTGACGACGCGGGCCTGCTCAAGACCGCCAGCGGGACGGTCGGAACCATCGACTACGCCAACGGCATCCTGACCCTGAACTCGGGGACGATGTCGAACGCGAAGGCCATCACCTACACGCCCGCCGCGCAGATTCTGCGTGCTCCGCAAAGCTCGGAGATCCCGGTCACGCCCGAGTCGCGCAGCCAGTCCTACGTGGGCACGGTCAACCCGGTGCCGCAGCCCGGAACGCTGTCCATCAGCTACATGGCCCAAGGGCGCTGGTATGTGCTTTCCGACAGTGGCAACGGCTCGCTCAAGGGCCTGGACGCCAGCTACGGCGCGGGCACCTTCAACAGGAATACCGGAGCCTTCGTGGTCACGCTGGGTGCGTTGCCCGACGTGGGCAGTTCGCTCGTGCTGACCTGGAACGTGCCGACGCAGGAGACGCAGCAGCCATCCACCACCCTGAAGGCCACCCAGAGCCTGGCATTGAACCCACCTGCAGGGACGGCGGTGCAACCCGGGTCGCTCACCGTGTCCTGGGAGTACGGCGGCACCAAGACCTCAACGGCGGCCACGTCGGGCGTGCTGTCGGGTGCCGCCACGGGCAGTCTGAGCGTGGCGCAGAACCGCGTGGACTTCGCGCCCAATGTGCTGCCAGCGGTGGGCACGCAACTCACCGTGAGCTACGTCGCGGGCCCGAAGCAGGAGGACTCGTTTGCTCACCCCTCCCGCAATGGTGCGGGGACGCTGCCAGTCACCGCGACCTTGGGGGCCATCGAACCGGGCTCGCTCGAAGTCGAGTGGAACACGTTCACCGACGAGGCGGTTCTCGGTGCGTACACCTTCGCTCAATTGCAGGAGATGGGTATCGCCGTCTCGATCTGGCGCGACCCCACCCAGATCGCCCGAGATGACGGGAACGGCGGTGTAGTGCTGAACGGGATCTCGATTGGCACCGTCAACTACGCAACCGGTCAGGTGACCTTCAATCCGGATGTCTCGATCCGTATCCCACGCCCGGTCTACACGGCAGTCGCCATCAACGGCACCGGTCGGTGGCGATTGAACTACGGCGGCATCGCCTACGTCGATGCGCCATCGCTGTACCCCAACGACGAATCCGGCTACGTCAAGCTGCGCTACAACAGCGCGGGCTCGACCAGCAACCAGACCGAGACGTTCCAGTTCCTACCGGCCTTCAAGCTGGTACCGGGGGTGAATGCCCAGGTGGTGACAGGCACGGTGCTTCTCTCCATCAGTGGCGCGCAGCCTTGGGGCGACAACGGCCAGGGCACCCTGCGCGAGTTCACCACCAGTGGCTGGGTCACGCGCGGCACGATTAACTACCTATCCGGGGACGTGGCGCTGACGTCCTGGACGGCGGGCACGAACAACGCGATCACACGAGCCAGTTGCGTGACCACGGTCGGCGAGAACATCTCCAGCGAGTTCGTGTTCCGAACTGGCGCGGCACCGCTTCGTCCTGGGTCGCTGTCGATCCAGTACGCCCGCGCGGTTGGTGGCACGCAAAACGTGACGGCCGGGATTGACGGCAAGATCGAGGCAACCGGCATCAGCGGCAGCGTCGACTACGAGACCGGTCTGGTGCGCGTTCGCTTCGGAACGATGGTCACGGCGGCCGGGAACGAGAGCCAGCCTTGGTACGCCGCCGACCGGGTGGGCACGGACGGCAAGATCTTCCGACCCGAGCCGGTGGCCGCATCCAGCGTGCGTTACAGCGCGGTCGCCTACAGCTATCTGCCGCTGGATGCTGATTTGCTTGGCATCGATCCGGTGCGCCTGCCCAGTGATGGGCGGGTGCCGATCTTCCGCCCCGGCGGCTTCGCCGTGGTGGGCCACACCGGCAAGATCACCTCCTCGGTCAGCAACGGCCAGACCATCAACTGCGCTCGGGTGCGCCTGTCGCGCGTGCGCGTCGTCGGCCACGACGGAGCGGTGATCCACACCGGGTACTCCACCGATCTGGAAGCGGGCACCGTCACCTTCATCAACGTGTCGGGCTACAGCCAGCCCGTGACCATCGAGCACCGGATCGAGGACATGGCCGTGGTGCGGGATGTGCAGATCAGCGGCGAGATCAGTTTCACGCGCGCCCTGACGCACGAATATCCGCTGGGGAGTCACGTCTCCAGCGCCCTGGTGGCCGGTGACCTGTTTGCCCGCGTGAATCTGGTGTTCGACCAGTCAACGTGGAACGGCGCGTGGTCAGATGCCTTGTCAGGCAGTTCCGCAACAGCAACGTTCAACAACACGCAGTACCCGATCCGCGTGACGAACCGGGGGGCACTGACCGAGCGTTGGATCGTGCGCCTGACCAACAGCACCTCGTTCGAAGTCATCGGCGAGAACGTCGGCGTGATCGCCACGGGCAACACCAGTGCGGATTGCGCGCCCAACAACCCGGCGACCGGCGTGCCGTACTTCCATCTGCCCGCACTTGGCTGGGGCAATGGCTGGGCCACCGGCAACGTGCTGCGCTTCAACACCATCGGCGCGCAGTTCCCGGTCTGGGTGGTGCGCACCGTTCAGCAGGGGCCGGAGTCCGTGCCCGACGACAACTTCACGTTGCTGATTCGCGGCGACGTGGACACCCCCTGATTTCGTAGACAGGAACCCATGAAATGATCGACCTGACCGTCAAATACTTCAACAGCGGCATGACCGGCGCGCCACAGATCTCCAACAACTGGGGCGATCTGGTGACGATGCTCGATGCCTGCCTCGTCAATGGCTTCGCGCTGAAAGCCATCGACACCTTGACCTTCGCCGATGGCATCGCCACAGCCACCATTTCCACCGGCCACGCCTATCGGCCTTTTCAGGTGGTCGAGATCGCTGGAGCCGAGCAGCCTGAGTACAACGGTTCATTCCGTGTGCTGTCGACGACCACGACCGCCTTCACCTATGCGGTGACCGGAGCGCCGGTGTCGCCCGCGACGACGACCACTAACCTGAGCGCCAAAGTGGCTCCACTTGGGTGGGAGAAGCCGTTCGCGGGGACGAGCAAGGCCGCCTATCGCAGCAAAAACCCACAGTCGCCGCAGAACATCCTGCTGATCGACAACAGCCTCAAGACGCCCAACTACACGACGGGGTGGGCGAAGTGGGCCAACGTCGGGATCGTGGAAGACCTGTCCGACATCGACACCATCGTTGGCGCACAGGCTCCGTATGACCCGAACAACCCGACGCAGAACTGGAAACAGGTCACCGCTAGCCAGTGGGGTTGGTACAAATGGTTCCACGCACGTGGCCCCCAGTACGAAAGCAACGGCGACAGCGGCGGCGGAGGTCGCAACTGGGTGTTGATCGGTGACGACCGTCTGTTCTTCCTGTTCTGCACCAATGCAGCGGGCTACGGCTGGTATGGCCGCAACAGCTATTGCTTCGGCGATCTGATCAGCTTCAAACCCGGTGACAACTACGCGACGGTGCTGGCCGCCGACGACAACTACTCGGGGATGAGCAACTACTGGAGCTATCCAGGGCAGTTCAGCGGCTACGGGCTGGTTTCGTCCCTGGACTTCACCGGCAAGGTGCTGCTGCGCAATCACACCCAACTCGGCAATCCCGTCCGGTTCGGACTCACGTCGCTGAACACCAACAACGGCCAGCAGATCTGCGGTCGGGGCCCGATGCCGTTCCCGAATGGAGCCGACTACAGTTTGTGGCTGTTGCCCACCTACGTGCGGCAGGAGGACGGCCATATGCGCGGCATCCTGCCCGGGATGCTGTGGATGCCCCAGGACCGGCCCTATAGCGATCAGACCATCGTGGACAACGTGGTGGGTCAGGCGGGTAAGCGCTTCTTGCTGGTCAGGACGCAGTACAGCTCGGAAACCGAAGGCGCACAGATCGCGTTCGACATCACTGGCCCGTGGAGGTAAGCCATGAGCTACCCGCTGAGCGAGTCTTTCGCCACGGCTCCTGCGACCGGCTACACCGCCGTCCTCGGCGGAATGGCCGCGACACACAACAACGTGCAGCAGTCCATCGATATCTCGGCCCCCAATAGCCAGTCCATCCTGCGCTTCAACGAAACCGCCCACGGTGACTTCTGGTTCGAGGCGGATGTTGAGTTTCTGACCGACCCGAGCGCCCGCAAGCACATCGGCCTGTGGATGACCACCGGCAACGGTTCCGAGGGCTACCGGTTCGCACATATTGACGGTGCCTGGAGCGTGACACGCTGGAACAGCGGCTTTGGCGACGGCGCGGCAGTGACGGGCGGTGTCAACGATGGAGCGAAGCCGGTCGCGGGCGTTATAGACGTGGCCCCGACCTTCAACGTCGGCCAGCGGATGCCCCTGCGCTGTGAGGTCATCGTCGGAGCCTTTGACGCCAACGGCGTTCCGTGGGCGCGCCTGATCCAGTTCAAGGCCGGTGGGGTGCTGATGTTCCAGGTCGGGGATGCTGCCTACAGGGGCAAGCTGATCCCGGGCGTGTTCCTGTATGGGGCCACGGCGCGCGTCCACGCGATTGCGGGTGACACGCCGTCAGGTCTGCCCGCGTTTCCGGCAACCGTGGGCGTGAACGCCGCCGATGACCTGCTGCCGCTCGCGGGTGGGTCGACTTCGGTGCCCCCTGATCCGGCCGCCAACATCGCCGTCAACGCCGACTGCGACCTGATGCGCTTGAACAGCCCCAACTCTGAGCTGTGGAACCGGGGTGGTGGCTACGACTGGCACTTTCACGCGATTCCGAATGGCCGCAAGAACATCCACTTCAGCGGCCACGGCTTCATCGCCGGAACCGTCAAGGAGAAGGGCCAGCCCGACCAGCCCCTGGTGCGGCGGGTGCAACTGGTCAGCGAGAACACCCGCGTCCTGGTGGCCGAGACCTGGAGCGACACCACGGGCGCGTACCGGTTCGAGCTTATCGACCCGGCCCAGAGATACACCGTGGTCAGCTACGACCACAAGCAGATGTACCGCGCCGTGATCGCGGACAACCTTCATCCGGAGATGATGCCGTGACCGTTGCCATCACTGTCGAACACAACGAGGCGCGACTGGCGGGCACCCTGGCATTCCTGGATGCCGGTAGCAATCCGGCGCGTCTGCGCATCTACGGCGGGACACGACCCGCCAACCCGGCCACGACGCCGACCAGCGCGATGCTGGTCGAGATCAGGCTGACCAAACCCGCAGGCACGATTGCAGGTGGACTCTTGACGCTGACGCAGCAAGAAGACGGGCTGATCACGGCGACCGGCATCGCCACCTGGGCGCGGCTGGTCAACGGCAACGAAGTCACGGCCCTGGATCTGGACTGCAGCGGTACCGACGGCAGCGGTGACGTGAAGCTGGCCAGCACCAACCTCTATCTGGGCGGCGATGCCCGGATGGTGTCTGCGATCCTGGGGTAGTCCGTGCCTGCCGTTCTCAACGAGGTGACCCTGGTCGCCGCGTTGCCTGCGCCCACCGCCAGCGTGGCGGTCGGGCCTCCGCTGGTCGATCTGCTGTTTGACCAACCGGCTGCCACCGACGCCAACTTGGTGTTCGGGGCCAACTACATCGCGCCGCGCGACGACGTCGTGGTGCTGGCCAGCCTGCCGTTGCCGGTCGTGGCGATCAAGTTCATCCCGCCAGCGCGGGCCGCACTGCTGGCCGAGCTACCTGCATTGACGGTGACCACGCTGTTGCTGCGCCCGAGCGTCCCCTTGGACGTGACCGGTGCAAGTCTTCCTGGTGTCGTGTTCTCCGGCGAGGTCAGGTACTACTCGCGCACGCAGCGACCGACAGTCGGCCAGACCGCACACGCTTGGCAGGTGGCAGCGCAGACGGAAGATGGTTCGACACAGGGCCAGCAGGACGCTGCCGCTACACCCGCAGGCTGGGACACGTTCTGGCGACGCACCTTGGGTGTTCCTCAAGGCATCGAGCACAGGTTGCCGCCGGTGCTGGCGGCAGCGCCCGAGCAACGAGGCGCTCGCCACCAGGATGCGACCCGGCTGCAGGATTCGACGTGGTTTGCGCACCAGGACGCCACGCGTTTTGCGGCGACCCGACAAGGTCTGTTCCAGAACGCAGGCCCGTTGCGGGACACCACGCGATTTCGGCATCAGGACGGCGACCGCACCAAACGCGCGGGGCGGGTGAGCTTTTGGCAAATCGCGCGTCTGCTCACCGAGCGCCAGGGGAGTGATTTTCAGATTGCCAGCCCGTCACTCAAGGGCTGGAGTGTCCGGTATCAGGACGCCGTGCCGCCACCGCTGGGGATCAGCGTCTGGGTGGTTCCACAACCGCCAGCGCCGATACCTTGCTACACGCCGAGCGCGCATCTGCTGTTCGCCGCTTTGGCCCCAGCGGACAGCCACTTGCTGTTCGTCTGTGAAAACCACATCAACCCACCGCCTCCCGATGGGGAGCCGGTGGTCGTTCCTGTTCGGAGGGTCTATTTCGTGATCAACAACGTGACCCTGTACCGCGTGTCCGATGGCGCGCCGGTGCCGGTGTTCAACCTTTCGCTGTCGCTCGATGCATCGTCCTGGGCGTGGGGCTTCGATGCGGTGCTCCCTGCGAAAGCCGAGGCGCTGGTCGCGGGCAGCGCTTCCGGGCCCGTCGAACTCGTGGCCAGCGTCAACGGCACCCCGTTTCGCGTGCTGGCCGAGAGCATCAGCCGCGAGCGCATCTTTGGTGACGCCAGCATCCGCATCTCCGGACGGGGGCGCAACGCCGTTCTGGCCGCGCCCTACGCGCCGGTGATGACGTTCTCGAATACCGAAGGCCGCACTGCTCGGCAGTTGATGGACGATGTGCTCACGGTCAATGGCATCCCGCTGGGCTGGGCGGTCGATTGGGGCCTGACGGACTGGAACGTCCCCGCCGGTGCGTTCGCGCAGCAGGGGTCGTGGATCGACGCACTGACCGCCATTGCCGGTGCTGCAGGTGGCTACTTGATTCCGCATCCCTCGGCCCAGAGCATCCGCGTGCGTCACCGCTACCCGGTCGCGCCTTGGGAATGGAGCACGGTCACGCCCGACTTCGTGTTGCCCGTCGATGCTGTCGCCCGCGAGTCGCTGCGCTGGTTGGAAAAGCCTGCGTACAACCGCGTGTTCGTTTCCGGGCAGGACGTAGGCGTGCTCGGGCAGGTGACCCGGGCCGGGACTGCCGGAGAAGTGCTGGCACCGATGGTCGTCGACCCGCTGATCACCGAGGCGGCCGCCGCGCGGCAGCGTGGCGTAGCCGTGCTCGCCGACACGGGTCACCAGCTCGAGGTCAGCCTGCGCCTGCCGGTGCTCGCCGAGACCGGGATCATCGAGCCCGGTGCGTTCGTGGAGTACCAGGACGGCAGCGTCACGCGATTGGGCATTGTCCGCGCGACCCAAGTGGAAGCCGGGTTGCCCGAGGTCTGGCAGACGCTGGGAGTGCAGGCCTATGCGTAACCTCTACGAGCAGTTTCGCCAACTGATCCCCGACCCGCCGCTGCAGGCGGGCACAGTGAGCGACGTCGGCTCTGGCGTGGTCACGGTCGCATTGCCCGGTGGCGGCCGAATCAAGGCGAGGGGCTCTGCGGCCCTTGGCCAGAAGGTGTTCGTGCGCGACGACGCCATCGAAGGCATTGCGCCCAGCCTGACGCTGGAAATCATCGAGATCTGAAACCCAACTGATTCAACCCTGAGACCCGCCCTGATGCTCACGCATCGGGCGGGTTTCGTATTTCTGGAGAAAGCAAATGACCGAACCTGAACAACAACCGGCGCTCGTCGAGAACATGCTCCTGCTGCGCAAGGAGGATTTCGACGATCTGCTCGACCGTGCCGCCGAACGTGGTGCTGAACGCTGCCTCGCCCATCTTGGACTGGAGAACGGCCACGCTGCCCGCGACATCCGGGAGCTGCGCGATCTGCTGGAAGCTTGGCGCGACGCCCGCCGCACGGCTTGGCAGACGACCATCAAGGTGGCCACCACAGGCATCCTGGCCGCGTTGCTGGTCGGTGCCGCCATCAAGCTCAAGCTGATGGGAGGCCCCCAATGATCGAAACCCTCCTCGGTGGCCTCCTGGGCGGGGTCTTCCGTCTTGCGCCCGAAATCCTCAAGTGGCTGGACCGCAAGGGCGAACGCGGCCACGAGCTGGCCATGCAGGACAAGGCGCTGGAGTTCGAGAAAATTCGCGGCGCGCAGCGGATGGCCGAGATCGGTGCGAGCGCCGAAGCCGCCTGGAACGTCGGTGCCGTCGATGCGCTGCGTGAGGCCGTCCGCACCCAAGGTGAGAAGACCGGTGTGCGCTGGGCCGATGCGTTGTCTATCAGCGTGCGACCGGTAATCACCTACTGGTTCATGGCGCTGTACTGTGCGGCCAAGACGGCTGCTTTCGCGGCCGCCGTCACCGCTGGCTCTGGCTGGGGCACGGCCATCCTGCACGCATGGACGGAAGCCGATCAGGCGCTGTGGGCCGGGGTTCTGAACTTCTGGTTCCTCGGGCGCGTGTTCGACCGGGTGCGCTCGTGACCGGGGTGCCGAAAACGGCCATCGAGCTGGCCAAGCGCTTTGAGGGGTTCCACCGGGTGCCGAGGATCGATCCGGGCCGCGCGCATCCGTACATCTGCCCAGCGGGCTACTGGACGATTGGCTACGGCCATCTGTGCGAGTCGACGCACCCGCCGATCACGGAGTCCGAGGCCGAGGTCTATCTTACGCACGACCTGCAAACGGCGCTCGCCGCAACGCTGCGCTACTGCCCGGTGCTCGCAACCGAACCCGAAGGGCGACTTTCGGCCATTGTGGATTTCACCTTCAACCTTGGCGCGGGGCGGCTGCAGACATCGACGCTTCGGCGACGGATCAACCAGCGGGATTGGGCGGTATCAGCTCAGGAGCTCTGCCGATGGATCTATGGTGGCGGAAAGGTCTTGCCAGGTTTGGTGGCAAGGCGAAAGGTCGAGGCGGCGCTGATGTCTGGCCTTCACTACTGAAGCGCTGGCTCACACAAGCGCAGCAATTGCGATCGAATCTCGCCGGGCGATGCTGTCAGATCCACCGTCGCAAATCGGATGCCATGCCCCTGGATGACAACCGTCTCATCAACGGCGTCGCCGACGGAAGGGTGAAGCAACAGCCCACTGGCGTCATCCGCAAGCGGATCGCCGCGCCCGACCTGGGATCGCAGGTAGGCGTAGATCTGGTACACGTACCCGCTGCGCAGCGTCTCCTCTCGATACCATCCGCTCGTCACGATCGACGTGAACTTGGTGTCGATGACGATTCGCCGGCCAGAAGGCGCATGGTCGAGTACCACGTCGGTTCGCATCGTCGGCAAGATCTTGTCGATTCCCGATGTCTTCTGTTCGATTTGCCAGCCCAGCGTACCGCCACAGCGAACCCGCCATCCCTGCGGTTGCAGTACCACGTCATAGAAGCCGCCCACGGCCTTCTCGAACAGTCGCCGGACCCAGGTGACCTCGCGTTCCGGCAGAGTCAACACGTTCGTCCCTGCCACCTCAGTTGGCAAGGCAAGATCAAAAGCCAGCTTCGCCGCAGCCACCATGAATCTGTCATCAGCGTCGTTGCGGCCAAAGCGATCTGTGCTCATCTGCGCGCGAGTGGGTACATCGCCGGAAACACCCATGGCCTTCATGCCGCTGGCAAGTGAGCGGCAGCGATGAGACACGTCCTTCCTTTGCACAATTCGGGCGATGGTCTCTAGTGCCGCGCGCACAAAGCGATTGCGTGGCGTGTCGACGGTCAGCTCATCAAATCGACAGGCCACTAAGCCACGATCCAGCAATCGATGCCGCTCCGTATTCAGGACATCAATTCGCCCGCGCACGCGATTTAGGACAGCATCCCGAGATCGGTAGCCCAGGTTGAGGCGGCGGCGCTGCCTGACTTCGACTGCGTGGGCCAGAATCTCGGCGACCAGATCGGGGAGATCGTCAGGGTTGTCCTCCAGGCCAACCTTGCCGATGCCGCGAGTGCGGAACAATTCCGACGCGTACAGCATCAGCAGCCACAGATTGCGCACCGGAATACGTCCGATGTAGCCATTTGCGTCGACCGACGCGCTCTCGACCTGCTCCGCGACCACACCCATTACCAGCCCTGAAGCAGCCGCGCACACGCCTTCTGCGCTTCGTCAGGGGCGTCAAACCAATACTCATCGAGCAGTGGCCCGATCTCAGTCTCGACCACCTGCTGGAACCACTTTTTCGTGTCCCCGGCCTCCAGCCTATGGGCGGGCGTCACGTAGCTATGGCCGATCCGGAACTGCTTTCCAAGGCGAGCGTCAGCCGCTATCTGGTCATTTAGCTCTGCAATCCGGTGCTCAATATCCGCGACCAGAGCAGGATCGACAGCACACTCCTTGACAGCCCAATCCCGCCATGCGGTGCCAAGCCTTGGCTCGAGCCCCACGAAGGCGAAACGCCGACGCAACGCGAGATCGACCAGGGCCAACGACCTGTCGGCGATATTCATCGTGCCGACGACGTAGAGGTTTTCCGGAATGTGGACGGGGCGACGTTTGCCATCCGCGTCCGGGTAGCACAGTTCCAGTGCCTCATTGGGCGTGCGCTTTCCAGCTTCAAGCAGCGTCAGGAGTTCGCCGAAGATCTGCGCCGGGTTGCCACGGTTGATCTCCTCGATCACCACGACGAACTTCGACGATGGGTCCTTCGACGCTGCCTTGATCGCTTCCATGAATACGCCGTCGGCTAGCGACAATTTCCCCTCGCCGGTCGGCCGCCATCCTCTGACGAAATCCTCGTAGGACAGGTTGGGGTGAAACTGCACCGCACGGACTTTGCTTTCGTCCTTTTGACCCATCAGCGCAAACGCCAGGCGCTTCGCAAGCCACGTCTTGCCGGTGCCTGGTGGCCCCTGAAGGATCAGGTTCTTCTTGGTGCGAAGGCGGTCCAGAAGCCGATCGATCTCGTTGCGTTCAAGGAAGCATCCGTCCTTGAGAATGTCGTCGACCGAGTACGGCACGATCGGCACGGCAACATGAACGTCCTCTGGTGCAGTTGCCTCGGTAGCATCGTCACCATCATCGACGTCACCAGCGTCGTCTTCGCCGACCGGAGACTTCTCATCGGTGGGGTCCTTGTACAGCCATGCTTCCAGCGAAAGCTCAGGGTAGGAATGGACCGGGTAAGCGGTCTCCTGGAAGCGCGGCTCCAATACATCCATCACGGCAAGGTAGTCTGCCGAATTGCAGCGCCTCTTCGGGCCGTGCATGCCAATCGGCACACCGAGCTTCTTGCTGACATAGAGCTGAGAGTTGTGATCAAGGCTCAGGAACGCCCAAGGCCTGATCCAATACAGGCCGAACGTCAGATTCCATGCAACGCCTCGCCGACCGTTCGCGCTGTCGAAAGCCTTGGCGAACTCCTCGCGGGCAAGGTCATCGTCGGTATCTGCGTACGCGATACCCGCTGCGAAGACCCCCCAAAGCGCGTCAATGTGGTCGGTGGCGCGATTGATCTCGAATGGAAAGTACCAGGACTTCAAGTTGTTCAGCAGCGGGATGCCTTCGAACGTCTCTGGAACCGGCTCGTCGACGCCCAGGAATTTTGCCAACTCGGTCGCGATGATCTTGCGGTTGGAGTCCTTGATGCCCCGATTGAACAGACCCATCGTCGTGAACGGGCAGATGTCCTTTACGAAGCCCGTAGTGCCGTCTGCATACTTATCCTCTGCCAGATGGCCGAGTCCGTCGACGCGAACGGAGATCTCCCGGATTCCCTCCACAAGGGCTGCCCTGTTGGCGCGATAGGTGAGCAGCTTGTCCGCGATTGCTTCATAGAACTTTGTCCAGCCGAAGCGATGTTTGTCTGCTGCCACCGTCCCGAATCGCTCCCGCCAGTAGGGAGCGTTTCGGAAACGCTCAACGTCCTGCGGCTTGTTGTGGAAGGCGAAGCTGATCAAACCATCGGTCATCCATTCACCAGGCAAAACGCGCCACACCGTTCCCCGGTGGGTATAGAAATACCACTCACGCACCGGTTCGATTTTTGTCCAGTCAACCTTAACTCGCTGACCATCATTCAGGTTCTCAGTGATCGTACCAATTGCCTTGATCGCCATCACCGAAACCGCTTGTCCACGGCTGTCAAAGGGTAGCCCATGCTTGCGCGTGTAGGACGACTTGATGGCGATCTGATCTCCTGGTCGCATCGATCGCACCACGTCGAGATGCTTGTCCTCGTAGCCGTTCTCCCAAATCCCCTCGGACAGAAAGCGTGGCAACTGATCGTCCGTGCCCCCGTAGCTTGCGCCAACAAACCAGCTCGCCTGTGCGCTCGAGTTCTCTGTTTTTATGTTCATGGATGTCCTCAGATAGTGCTGCGCAAAGCGACGAACTGGTTACTCGGCCCCTCCGACCGTAGGTCGAGGTATGGGGAGTTCGAGACGACGTAGGCCAAAGGTCTCGTTAAACGGAATGGAAACAGGACAGGCAGTGATTGCAGGCTATGAACTGAAACAGGCTTGCCGACGTAGCGCACAGCCGCCTCGATCAGCCATACCGTGAGTTCGTCGCTGTCGATAGGCGTCGCTGCAAGACGGATGACGCGCTTGCCTTTCTCGACACGCTCTACCGCGCCCCAGCTTGCTTGTGTCTGGATGACCATATTGGTCATGCGTCGCGTGCCTTCGCGTTCGCCGTAGATCTCGCTCATACGGCGATGCACTTCGGCAGCTGCGCAGTCGTCTTGGATGGCCGACAAGCGACCCACCAACTCAGACACTTTCCCGAAAAACGGATAGCAGGCTATGGCCACGCCCCAGCACAGTACCGGAATTGGTGTGTCGGGCTGGCTTTTGTAAAGAGCCGCCGCCCGGTCGGCGAAGTCCACCAGCTCGGCGCGTGGCTCCAACCACAGCCGGTTCAAGACGGTTCGTGTTTTTTTCTTGGCTTCCACGCCAAGTTCGGCTGCGTCGAGCAGTGCATTCAGATCATCTAGCCCAGCTGTACCCGCACGAACCCGCAGTGCCGCAGCCGCCCAATCAAGCTGAATGAACCGATCGAACCCAATCTGAGGGGCTGATGTATTCATTCGTTTCCAATCACATATTTAACTTTCACAAACGGCACGATCAGCTCTTCTACCGAGATGCCGCCGTGGACAACCACTTGCTCGCCATCGGGAACAAAAGCCGTTCGGCCACCCGCGAATAAGGGCATATAGCCCGCCGGTAGTCCAGCGATGTCCAAGTGCACTGAGTTGGCATTTGCCGCTGCGGATTTAGCCAACAACGATTCACTGCGATACACGCGCACCCGTTCGCCACGGGCTTCTGGAACATCCCCTTCAGATGGACGTCCCACGCCAACAGCTTCAACGTTGCCGTGATCCGCTGTCAGGTAAATATGAAAGCCTCTGTCGAGCAGCATCGCAAACAAGCGATCGACGAAGCCTGTTTTCAACCAGTTCGCGATCCACAAGGCCACATCTTGTTTGGAGCGTTCCTTGTGCAGTCGGTCATCCACCTCGTCGACCACCAACCCGACCACCTTGGGCCGACGATCGTCCAGCGCTGCTTGCAAGGCATCCAGCTGCTCAATCTGCCGCAACGAGCGCTGGTAGTAGATTTCGCCCGGCTTGACGCCTTGCTCCTGCCAGTAGGCCTTCCACAGGTACTCTTCTTTGTTGGTATGGCCGATCGACTCTTCGAATTCCCGCGGTTTACGGCCAGAGAACAGGGCCTGCCGCGACACCGAGGTCACGGTGGGGAGCCAAGCGAAGGAAGTACCTTCATCAAACGCAAAGCGCTTCGTCGCCTCGACTAACCGCTCCCGAATCTGTACCCACTGGTCCAAGGCTAAGCCATCAAAAACCAGGAGTGCGATCTTGTCGGCCCCGAGCGCTTTCCGGCGAGAGCTCAGATAGTCGGGAATCCGATGCACCATCACCGGCCCGTTGTGGAACGAGAGCGTGCTCAGATCCGCATAGTGCTTGGCAGCCACCCACGCATGTAGCTGCGCATCCGACTGTTCTTGCAGTGTTTTCACGAGAGCCTGCACCTCTGCCAAACCGTCTGTGCCATAGGCATTGCCCAGATCGTGCACGCGGGCGAGAGTTTCGCCGTACTGCTTGGCAAACTCGCTCCAGGCCTTGTGCGTAGCCGTCGCGCTTGGAATCGCGTCAATCAGCTTGGCAATTCCCTTTTTCACAAAAGCGTTGCGCGCTTGCGGGTCTTGCACAATCCCGGCCTTGATCCAGCCTGGCGCGTCAGCTGGCACAACCTCAACGACCAGCGGGTGCAAGGAGCCATTCAAGAACATCGAATCAACGATGGATTGCACATCCGAGTGCGCAAACGGAATATCGACTTTGGCCACGTAATCGGGGGGTGGTGGCTCGCCAATGCGTGAGCCTTCCATGCCCAAGTTCGCCAGGTAGCGATGCCAAGCCTCCTGCACCACGCGCAGCAACGCACTCTTCGACGACAGCCACGCGGCCACCGGAAGGCCTGCAAGCAGTCCCTTGCTCTGGATGATGCTGGCTGCGTGCTCGGCAAAGACCAGGGGCAGCGCACGGTTAGCAAAGTGCATCCGCAGCACATCACGCCAGAAATCACTTTCCGTGCGAATGCTGGAGCGCGGTGCCAGGTGGTAGACCCGCTCCAGTATGAAATCTTTGGACTCGTTCTCGCCGCGAATACCCTGCAGCTCGGTGTCGTGCGCCTCCAGCAAGGCGGCAAAGTGCTCGGGTTCAAGCTGCTTGACCACGTTGTAAGCCAAACGCGGAAACAACTGTGCCAAGCCCAGACTTACGACACGACCATAGTGCCCAAGGTCCCAAGGCAGCTCATTGGGATCTGCACCGCGCCAATGCACAACCACAGCGGGCGTGGGGCCGGGCTCGCCTCGGTCCCAGGCGGCGCGATAGCGCTCCTCAAACTCTGTACGGAACACAAAGGGGTCTTCGTACAACAGCACCTCGAAGCCACGGCTGCGCAGTTCGGCCAGCAAACGCTCGTCCAGCAAGACGTCGTCCGGATCGCAGGCCACCCACAGCCGGTCGAGATCAGCTGTGAAGCGACTGAGAATTCGTTCAATCCACTGGCTCATGTGCTTTGTGTCTCCCTCACGTTCTCGCCAGGCTGAACCTCGGCGGAGACGCGTACCATCATTACCGCGTTCAGATCCGGCACACTGGCCGCAGCCTCGGCCAAGGCGGCCAGTCTGGCGTCGTGTTCTTGTTGCAGACGCTTGCGACGATGCTCGCGGACAGCAGGCAGCCCGATGCGGCCGATAGCCTGTTGCCTTGCTTCAAAGGCGTAGTCAGCACGCTCCCGCTCCTCTTGCAGTCGGGTCCGATGCGCCTCCAGCATCTCGGTAAAGATCCGCTCGCCTTGGGCTTTTGCCGCCGTGAGTGCTGCGTCAAACCATTTCACAGCCTCTTCGGTGCCGCTGACCCCGTGTACGACCACTGTTTCGGTCAGCAGCAAATCCCACACGCGTTTGGCCGTCGGCACAAAAGCGCGGCCCTCTTCGTTGGTAAAAACGGGCAGGTAACGCTTGCGGTTCAAGCCTTCCGCCGCAAGGCTGATCTCCCAGAGGGACCACACTCCGCTCACAGAATCAGGCAGGCCCGTCACCCGTATCACTGGCAAGGGTTGGCCTGCCACAAAGCGGGGCAACTCGCTGATCACTGCGCGCGCGCGCGGGTCTTCCAGCGTCACCCACTCAATGTCATGGTTCTCGTCAGCGGTACGCGCGTCAAAGCAAACCTGCGCTGCTTCGCTGCCATCCGCCCAAGTCACACGCCACGCCTTGCCCACCTTGGTTGCTGCACCGCCGCGTGCGGCCAAGCCAGAGGTGATGGCTCGCTCCAGCCAGAACTGTGCCGGGTGGTCGCGCCATTTGCGGGCATCGTCCGCTTCCAGCTCGTGCGCGTCCGAGAGCAATTCGCTGCTCTTTTTTGACTCGGCCAAGGTCTCGCGCAACTGCGAAACCACCGCATCGCATTCCTGCTCTATCGAAGCCGGGTTCTGCAAACCATGCACGAACAGCTCTTCGAACAAAGGCTCTGCTTCCGCCGAGTCCATCACGTCGGACGCCTTGTCCACGCCGAACTGTTGGGCGATCACTTCCAGCTTTTCTTCCAGCACCTGGCGAACCCGGTGCTCCACGGTGTCTTCCAGCACAAAGTTGATGGCGCGCACCACGTGCCGCTGGCCAATACGGTCGACGCGGCCGATGCGCTGTTCAATGCGCATCGGGTTCCAAGGCATGTCGAAATTGACGATGACGTGGCAGAACTGCAGGTTCAAACCTTCGCCGCCAGCGTCTGTCGAGATCAGTACGCGCACGTCTTGAGAGAACGCCTTTTGCGCCTTGCTGCGTGCATCCAGATCCATGCCGCCGTTAAGGGTGGCCACCGAAAAGCCACGGCTTTCCAGGTAATTGGCCAGCATGGCTTGGGTCGGCACAAACTCGGTGAAGAGCAGCACCTTCAGGGCAGGGTCGTTTTCTTCCTGCTGCAGTTTGTAGATCAGCTCCAGCAAGGCTTCTGCCTTGGCATCGGTGCCCGAGGCTTCGGTCTCGCGGGCCAGAGCGAGCAGTATTTCCACTTCGGACTTTTCCAGCTCCCAGCCGGTGGCCTGCATGGCCAAATCCACCTGCGACTGGCCGTCCAGGTCAGCCCAGTCTTCCTCGCTGGTGTTCTCAAACAACGAGGGTTGTGGCTGGGGCTCCTCCAGCAATGCCAGCCGCTTTTCCAGCGTCGTGCGAATAGCGGCAGTGCTGGACGTCACTAAACGCTGCATCAGAATCATCAAAAACCCGATATGACGCTGCTTGGCGGCCATTGCCTGGTTGTAGCCATGGCGCACGTAGTCGGTCACGGCCTCATACAAGCGCCGCTGCGCGTTGTGGCGCGCCTGCCAGGCCACGGCCTGCAGCCGGGTGACTCGCGGCTTGAAGAGTGGCTGACCATCGGCGTTGATCGACAGCCGCTTTTCTGTACGAATCACAAACGGCCGCACGCGATCGCGGTTCACGCTGCTTTCGTCAGGGAAGGCGTCGCGGTCCAGCAACTGCATCAAGCGCAGGAACTGGTCGGTCTTTCCCTGGTGCGGCGTGGCTGACAGCAACAGGAGGTAGGGCGATGCTTCTGCCAATGCTGCACCCAGCTTGTAGCGCGCCACCTGTTCGGTGCTGCCGCCCATACGGTGGGCTTCATCGATGATGACCAAATCCCAAGAGGCCGAGATCAGGTCTTCAAAGCGTTCGCGGTTGTAGTTGTTGAGCTGCTCCAGACTCCAGCCGCGCCGACTCTCCATCGGTTTGACCGAATCCAGTGAGCAGATCACCTGGTCATGCATACGCCACAGGTTGTCTTCATCCCCCTGGTTGCCACTGCGCCATTGGCGAAATGCAGCCAACTCAGAGGGCTCGATGAACTGCAGATGCTCACCGAAATGCAAACGCATTTCCGCCTGCCACTGGCGCACCAACCCCTTAGGCGCGACCACCAGCACTCTTTTCACCCGGCCGCGTAGCTTCAATTCCCGCAACACCAGCCCGGCTTCGATGGTCTTGCCCAAGCCCACCTCATCTGCCAGCAGGTAACGAATACGGTCGCGGCTGATGGCGCGATTCAGTGCGTACAACTGATGCGGCAGTGGCACCACGCTGGACTGGATGGGCGCGAGCAACAAGTTGTCTTCCAGCGCATCCAGCAGCTTGGCCGCCGCCGTGGTGTGCAGGATTTCCTCCACCGTAGGGCGAACGCTGTCCAACGGCGCAAGATCTGAGGCACGTGCCCGCACCACTGCGTCTTTGGCTGGCAGCCAGACGCGGTAAGCACTCTCACCCCATACCTCTTGCCGATCGATGACGCGACAAGACGCAGCCTGTCGCGTCAGCCAGCACCAATCGCCAACGTTGAAGCCGCCGCCCGCCAC
Protein sequences of DBSCAN-SWA_5 >NZ_AP021884|1977054:2023954|2021110_2023954_-|WP_147073232.1|DBSCAN-SWA MAGGGFNVGDWCWLTRQAASCRVIDRQEVWGESAYRVWLPAKDAVVRARASDLAPLDSVRPTVEEILHTTAAAKLLDALEDNLLLAPIQSSVVPLPHQLYALNRAISRDRIRYLLADEVGLGKTIEAGLVLRELKLRGRVKRVLVVAPKGLVRQWQAEMRLHFGEHLQFIEPSELAAFRQWRSGNQGDEDNLWRMHDQVICSLDSVKPMESRRGWSLEQLNNYNRERFEDLISASWDLVIIDEAHRMGGSTEQVARYKLGAALAEASPYLLLLSATPHQGKTDQFLRLMQLLDRDAFPDESSVNRDRVRPFVIRTEKRLSINADGQPLFKPRVTRLQAVAWQARHNAQRRLYEAVTDYVRHGYNQAMAAKQRHIGFLMILMQRLVTSSTAAIRTTLEKRLALLEEPQPQPSLFENTSEEDWADLDGQSQVDLAMQATGWELEKSEVEILLALARETEASGTDAKAEALLELIYKLQQEENDPALKVLLFTEFVPTQAMLANYLESRGFSVATLNGGMDLDARSKAQKAFSQDVRVLISTDAGGEGLNLQFCHVIVNFDMPWNPMRIEQRIGRVDRIGQRHVVRAINFVLEDTVEHRVRQVLEEKLEVIAQQFGVDKASDVMDSAEAEPLFEELFVHGLQNPASIEQECDAVVSQLRETLAESKKSSELLSDAHELEADDARKWRDHPAQFWLERAITSGLAARGGAATKVGKAWRVTWADGSEAAQVCFDARTADENHDIEWVTLEDPRARAVISELPRFVAGQPLPVIRVTGLPDSVSGVWSLWEISLAAEGLNRKRYLPVFTNEEGRAFVPTAKRVWDLLLTETVVVHGVSGTEEAVKWFDAALTAAKAQGERIFTEMLEAHRTRLQEERERADYAFEARQQAIGRIGLPAVREHRRKRLQQEHDARLAALAEAAASVPDLNAVMMVRVSAEVQPGENVRETQST >NZ_AP021884|1977054:2023954|1988093_1988462_-|WP_147073293.1|DBSCAN-SWA MNTNQQMPATQNDAWGFWGTMNEHASTAWPLAMTAISDATGQPLESVRVFLDSRHGRHFADDLQNGLYRGQTLADAINAATQQWMGWTIGRQTSKQYGIPRGLPYLTGFVIHCEIAEESIAA >NZ_AP021884|1977054:2023954|1989572_1989770_-|WP_147073287.1|DBSCAN-SWA MSKFEQLLTQIAQNKLGIETLETRKSDSLDFHDVAVWCLRDALEAAFNAGVEQGRKATKSDKANS >NZ_AP021884|1977054:2023954|2015009_2016134_-|WP_147073241.1|DBSCAN-SWA MGVVAEQVESASVDANGYIGRIPVRNLWLLMLYASELFRTRGIGKVGLEDNPDDLPDLVAEILAHAVEVRQRRRLNLGYRSRDAVLNRVRGRIDVLNTERHRLLDRGLVACRFDELTVDTPRNRFVRAALETIARIVQRKDVSHRCRSLASGMKAMGVSGDVPTRAQMSTDRFGRNDADDRFMVAAAKLAFDLALPTEVAGTNVLTLPEREVTWVRRLFEKAVGGFYDVVLQPQGWRVRCGGTLGWQIEQKTSGIDKILPTMRTDVVLDHAPSGRRIVIDTKFTSIVTSGWYREETLRSGYVYQIYAYLRSQVGRGDPLADDASGLLLHPSVGDAVDETVVIQGHGIRFATVDLTASPGEIRSQLLRLCEPALQ >NZ_AP021884|1977054:2023954|1997701_1998004_+|WP_147073273.1|DBSCAN-SWA MSLVAQIYESAANAGLLKECLWYPSNGAPSQLHQIGFAAPDESLLDGLALSTDYEMTYPVTAFGGLAVREVVEIGGTSFQVRDIRSLSDGSEIRAKLTRL >NZ_AP021884|1977054:2023954|1984518_1984728_+|WP_147073299.1|DBSCAN-SWA MKVSTPQYRCPLGRLQPQTTDLDAIKERGWRDQHILVVNASDDRLDFIEREIVRRIGERLYGLGGTRHG >NZ_AP021884|1977054:2023954|1999813_2000017_+|WP_147073264.1|DBSCAN-SWA MIEHGHRLPDILDYTLAQVRGFVVATARTDAARDARLLSVIAIGTRSDARQLDQTLDRLTDKATDRA >NZ_AP021884|1977054:2023954|1998460_1998655_+|WP_147073270.1|DBSCAN-SWA MTQLVLTRPHTHAGKTYGVGDRIEIDATSADWLIAHDIATPEPTAPTAEPVPEPKPLQRKEPKQ >NZ_AP021884|1977054:2023954|2013478_2013700_+|WP_147073248.1|DBSCAN-SWA MRNLYEQFRQLIPDPPLQAGTVSDVGSGVVTVALPGGGRIKARGSAALGQKVFVRDDAIEGIAPSLTLEIIEI >NZ_AP021884|1977054:2023954|2014075_2014552_+|WP_147073244.1|DBSCAN-SWA MIETLLGGLLGGVFRLAPEILKWLDRKGERGHELAMQDKALEFEKIRGAQRMAEIGASAEAAWNVGAVDALREAVRTQGEKTGVRWADALSISVRPVITYWFMALYCAAKTAAFAAAVTAGSGWGTAILHAWTEADQALWAGVLNFWFLGRVFDRVRS >NZ_AP021884|1977054:2023954|1981381_1981636_+|WP_147073305.1|DBSCAN-SWA MTDNNTPTTGIEPMIDAKQAAAALRLPYYWFADHAMRTKYRIPHYLMGGLVRYRLSELSAWATRTTAVQGRDSQDADAPVEGAE >NZ_AP021884|1977054:2023954|2004724_2005132_+|WP_147073260.1|DBSCAN-SWA MQLTNLDAGVALPLPDDLLWSDEHAWSPAVATTSYLITGALLIQSATRQAGRPITLVGAPDMAWVTRATVEQLQAWAALPVGSATGRFGLTFSDGRSFTVAFRHAETAIEAEPVLGIPARAATDFYRLTLRFLEI >NZ_AP021884|1977054:2023954|1977329_1977812_+|WP_024973177.1|DBSCAN-SWA MSDLTIFPVDIAEMSVSQLAALPPEQKCEVDKNLDAAIDWLKKARTKFDAALEQCYGEQARVALRESGRDFGTAHISDGPLHIKFELPKKVSWNQKQLGEIAERIVASGEKVEGYLDVKLSVSESRYINWPPALQQQFAAARTVDSGKPSFTLSTDGGEA >NZ_AP021884|1977054:2023954|2005133_2008697_+|WP_147073258.1|DBSCAN-SWA MPIQSGDVKLLKSAVMADVPEGGGAPTGNTIADGVSNAIFPDISELDRAGGRVNLRKSFVSVQTDDTDTYFGANVIVAEPPQDARVSVTLFSTEKTFDTREQAQVRIEAYLNKGPEWAGYLFENHIAGQRVIQLFQRTTDTVPNVGQTLVLIENEGLGTQKEQYIRATSVSVVERTFTYDGDKDYKASIVTVDISDALRYDFTGSPASRTFTRAANSTKTRDTVVADAGTYVGVVPLTQAAAVGDFTIKGTSIYTQLVPSAQTETPISFVPPYAAAGLPVPGAVAVSYTASHAWTTSIKFNLPGGCLPGSLTIGTDGITIFDDAGLLKTASGTVGTIDYANGILTLNSGTMSNAKAITYTPAAQILRAPQSSEIPVTPESRSQSYVGTVNPVPQPGTLSISYMAQGRWYVLSDSGNGSLKGLDASYGAGTFNRNTGAFVVTLGALPDVGSSLVLTWNVPTQETQQPSTTLKATQSLALNPPAGTAVQPGSLTVSWEYGGTKTSTAATSGVLSGAATGSLSVAQNRVDFAPNVLPAVGTQLTVSYVAGPKQEDSFAHPSRNGAGTLPVTATLGAIEPGSLEVEWNTFTDEAVLGAYTFAQLQEMGIAVSIWRDPTQIARDDGNGGVVLNGISIGTVNYATGQVTFNPDVSIRIPRPVYTAVAINGTGRWRLNYGGIAYVDAPSLYPNDESGYVKLRYNSAGSTSNQTETFQFLPAFKLVPGVNAQVVTGTVLLSISGAQPWGDNGQGTLREFTTSGWVTRGTINYLSGDVALTSWTAGTNNAITRASCVTTVGENISSEFVFRTGAAPLRPGSLSIQYARAVGGTQNVTAGIDGKIEATGISGSVDYETGLVRVRFGTMVTAAGNESQPWYAADRVGTDGKIFRPEPVAASSVRYSAVAYSYLPLDADLLGIDPVRLPSDGRVPIFRPGGFAVVGHTGKITSSVSNGQTINCARVRLSRVRVVGHDGAVIHTGYSTDLEAGTVTFINVSGYSQPVTIEHRIEDMAVVRDVQISGEISFTRALTHEYPLGSHVSSALVAGDLFARVNLVFDQSTWNGAWSDALSGSSATATFNNTQYPIRVTNRGALTERWIVRLTNSTSFEVIGENVGVIATGNTSADCAPNNPATGVPYFHLPALGWGNGWATGNVLRFNTIGAQFPVWVVRTVQQGPESVPDDNFTLLIRGDVDTP >NZ_AP021884|1977054:2023954|2014548_2015016_+|WP_170227448.1|DBSCAN-SWA MTGVPKTAIELAKRFEGFHRVPRIDPGRAHPYICPAGYWTIGYGHLCESTHPPITESEAEVYLTHDLQTALAATLRYCPVLATEPEGRLSAIVDFTFNLGAGRLQTSTLRRRINQRDWAVSAQELCRWIYGGGKVLPGLVARRKVEAALMSGLHY >NZ_AP021884|1977054:2023954|2000670_2004714_+|WP_147073261.1|DBSCAN-SWA MAKRISILVALEGADEGLKRAITSAERSLGELSTTAKTAGAKAAAGMAEVKAGMSAFGDQVATAKTQLLAFLSISWAAGKVQEIVQIADAWNMMSARLKLATAGQREFTTAQAALFDIAQRIGVPIQETATLYGKLQQAIRMLGGEQKDALTIAESISQALRLSGASATEAQSSLLQFGQALASGVLRGEEFNSVVENSPRLAQALADGLNVPIGRLRKLAEEGRLTADVVVNALMSQKDKLASEYAQLPQTVSQAFERLRNAFGQWINRVDESTGLTKKLAEALTILANNLDTVMQWLKRIAEVGLAVLIYRLIPALITAWQTAGAAAVTAASATAAAWTTANLSVSAAVASVGLLKTAFAVLGAFLVGWEIGTWLSEKFEIVRKAGIFMVEMLVKAVEQLRYRWEAFAAIFTSDTIAEATQRHEARLAEMNQIFAQMYADATKGADAAKGAMNTAATAAEEIAKRLEAVRQGTQEAVGRGIEAVHSALEKLKSRLGEVEQAVGKANQTVNDATAKMAEAYKGLTSIVEANLLRQIEAVKARYQQEQSALETSKQSEAALITKSTQLLTEALTQQTTLRRQSTTDTLKLIDDESKARIESARRQGQTEEERRANVQRVENDILATKRQTMTQALAEYRQHIDALNAEANRHLTEIKRIEEEKRQLSMTTEERVRDIRRQGMTDFEATEDRKRQIAEYQGKAREALANGEFEQARQLAQKAMDLAAQVASSQTSEAKRGEDARKQSEQAVSQVTQLESQSRDAYRKQEYAQAEALMRQADALRAELAQKTKDADAQIAQGKDGVNQAIQRIRESEEILNKTLDAEAKAHQTAAQSALTARDQIQQTLTQTETQIDQITAKLKDGLKVTLDADTTRFDKAIADLDKALAEKEYLLKIQADLQEAEKKLQQYEQLLKEGKTLPVDADVSKAKEALDKLKTYADQNSQFELKVATEKAQAAITKVEGMIKALDRIQTESRHQVSTNADAARSEIMSLNWANTSSTHTIYVRKVEANATGGLVGGGVRRYADGGAVAPAFPRMSGGSVPGSGHHDTVPRTLDAGAFVIRKAAVQKYGGGALSRLANGVARFATGGAVMLGGGKRPSGNDADGTPSTPKKNREAVEAMKMIDLGLQGMNEYTNWLQWNYGASVSLDMRSKTMDSYGKQAQQDRRALEDFISRKTLTGNERQNLERIKQTWRQAMAQPLLWGKDLERELIDYMEQNQGEFYRRGGMAKSDTVPAMLTPGEFVVNKDAVSRYGAGFFEAINNLSAPAQALAGRALAGVQGFATGGLVQPSGSRLARPVLAADAGPSRTVRVELSSGQQKVNATVDARDESRLLQLLDAARARTA >NZ_AP021884|1977054:2023954|1998651_1999404_+|WP_147073268.1|DBSCAN-SWA MSTYASFQGRVFLGKRDTDGLPIEVRSPGNVAELKLSLKTDVLEHYESQTGQRSLDHRMVKQKSATVNLTIEEFTKENLALALYGNHVVGTPGTVTAEPVGGATPIAGDRYFLAHPKVSSLVVTDSAGTPATLALGTNYTADPDFGALQFLDTTGFTAPFKASYAYGVATEIGIFTQALPERFLRLEGINTAQGNAKVLVELYRVAFDPLKEISFISDEYNKFELEGSLLADTTKPFDAVLGQFGRIVQL >NZ_AP021884|1977054:2023954|1980150_1980651_+|WP_147073309.1|DBSCAN-SWA MKCWVCKRQARGFGHTDNRHGIGDPRRYPIDWVFCSQRCQSAFHAMYGNWSRAKDGRSDIKGVAMIDPSDIELAAMRKCLKSFGEAASEIGFTKPLGNYSEAEALQVIDAIVTCYTEAMVEHHEASKYPPVRGMTPTPDPMTPSAANPFADLDDDLPWEEPKGKKP >NZ_AP021884|1977054:2023954|1980647_1981385_+|WP_147073307.1|DBSCAN-SWA MMDFNSTSSISGQITALVDAGMQRARAQQSERQYLGASRLGAACERALQFEYAKAPVDHGRDTPGRMLRIFERGHVMEDCMVAWLRDAGFELRTRRADGEQFGFSVADGRLQGHIDGVIVDGPEGFAYPALWENKCLGMKSWRELEKNRLAVAKPVYAAQVAIYQAYLELHEHPAIFTALNADTMEIYTEAVPFDAALAQRMSDRAVKVITATESADLLPRAFNDPTHFECRMCAWQDRCWRTQA >NZ_AP021884|1977054:2023954|1984052_1984517_+|WP_147073301.1|DBSCAN-SWA MKTTILALDLGTHTGWALQHLDGTITSGTEHFKPQRFEGGGMRFLRFKRWLNELLSVSNHINAVFFEEVRRHAGVDAAHAYGGFMGHLTAWCEHHNIPYQGVPVGTIKKHATGKGNASKDEMITSVRERGHTPVDDNEADALALLHWAVETQEV >NZ_AP021884|1977054:2023954|1993294_1993516_+|WP_146463160.1|DBSCAN-SWA MAYTEAQLQALETALAKGEHRVSFGDKTVEYRSVDELKAAIREVKRGILEQAAATGLWPGAPRQIRVTTSKGF >NZ_AP021884|1977054:2023954|1981632_1983924_+|WP_147073303.1|DBSCAN-SWA MIDFNDTTQPAEHNRESERDEIRADLLARLESVLTTMFPAGKKRRGKFLIGDILGSPGDSLEVVLEGEKAGLWTDRATGDGGDIFALIAAYLGANVHTDFPRVLDEAADLLGRSRSVPVRKAKKEAPVDDLGPATAKWDYFDAGGKLIAVVYRYDPPGGKKEFRPWDAKRRKMAPPEPRPLFNQPGIGAASHVVLVEGEKCAQALIASGVVATTAMHGANAPVDKTDWSPLAGKTVLIWPDRDAPGWDYADRASQAILQAGATSVAILMPPDDKPEGWDAADAIPEGFDVGGFLAVGERMPVMRSVEEAPSPDLLTGIDWTTEDGLSSAFTRRYGEDWRYCALWGKWLVWTGVRWNPDQVLYVSHLSRGICRNASLKADTPRLKGKLASSATISSVEKIARSDPKHASTAEEWDADVWALNTPGGVVDLRTGRMRPHRRDDRMTKVTTATPQGNPDSACPTWRGFLTDVTGGDADLMAYLQLMVGYCLTGVTSEHALFFLYGTGANGKSVFVNVLTTILGDYAANAPMDTFMEARNDRHPTDLAGLRGARFVSSIETEQGRRWNESKVKAITGGDKVSARFMRQDFFEYLPQFKLVIAGNHKPSIRNVDEAMKRRLHLIPFTVTIPPERRDGRLTEKLLKERDGILAWAVEGCSRWQSQGLKPPASVVSATEEYFEAEDALGQWIEERCLLAKSHREGVSELFADWREWAERAGEYVGSVKRFSELMATRKFDKCRLTGGARAIAGIALRPKPYSHAYPYRDD >NZ_AP021884|1977054:2023954|1995036_1996272_+|WP_147073277.1|DBSCAN-SWA MTLLPHLAARLYGVPLAIHRPKLDVILAVLGPRIGLADLAAPSGFTPPARPASTQTTKVAVIPIHGTLVRRTVGLEAESGLTSYAGLTAQLDAALASPDVAAILLDVDSPGGESGGVFDLADRIRAAAKTKPVWAVANDMAFSAAYALASAASKVFVSRTGGVGSIGVIAMHVDQSEKDAQDGVRYTAVFAGDRKNDLNPHEPISSEAHAFLKGEVNRVYGLFVETVARNRGIEASAVRDTEAGLFFGQAAVAIGLADAIGTFDDALAQLCESVSPLPKLAASHSGLFSNPQMESSMNDRTDPAAPDRLAADPAGSPSQPAAATAMTVADAIEVAQTCTLAGRTDLIAGFLEAKAPPAKVRSQLLATQAEASPEIVSRIDPQSAMSASSTGHPASSHNPLIQAVKSRLGTK >NZ_AP021884|1977054:2023954|1978662_1979289_+|WP_024973179.1|DBSCAN-SWA MTAWNDFNDADSQQSGFDLIPKGTVVPVRMTIKPGGYDDPEQGWGGGYATESFETGSIYLAAEFVVTAGDHAKRKMWSNVGLLSKKGPTWGQMGRSFIRAALNSARNVHPQDNSPQAAAARRINGFAELDGLEFLARVDIEKDAKGQDRNVVKLAVEPDHPDYAKLKGVPPKGSPGGGNSGAPAQAAPAYSAPTPQRAPVTGKPSWAQ >NZ_AP021884|1977054:2023954|1999415_1999817_+|WP_147073266.1|DBSCAN-SWA MSDLDTLIPQAVELVIDGEPLAIKPLKVGQMPGFLRAMSPVMQQLTASNIDWLALFGERGDDLLSAIAIAVGKPRAWVDELAADEAILLAAKVIEVNADFFTQTVIPKLDGLFGQVKLPPIVKAAAGSMPSST >NZ_AP021884|1977054:2023954|1996685_1997705_+|WP_058719286.1|capsid|DBSCAN-SWA MQNPFISPAFSMASMTAAINLIPNRYGRLEELNLFPPKPVRTRQVIVEERAGVLNLLPTQPPGSPGTVNVRGKRTVRSFVVPHIPHDDVVLPEEVQGLRAFGSETEMESIAGVLAQHLETMRNKHAITLEHLRMGALKGEILDADGSRIYNLFDEFGIDQQSVDFEISSPTTGTDVKGKCTDVLGIIEEALLGEFMTGVHCLCSPEFFKALTGHKDVKTAFTNWQQGAVLINDVRRGFTFGGITFEEYRGKATDVNKTVRRFIAAGEAHAFPLGTIDTFGTYFAPADFNETVNTMGQPLYAKQEPRKFDRGTDLHTQANPLPMCHRPGVLVRLVMGGGV >NZ_AP021884|1977054:2023954|1992411_1992903_+|WP_147073283.1|DBSCAN-SWA MSLATRIESLVIRVAQEFNDVRATAGSLASLSTNDKSSLVAAINELKAAVLSAMAIDDNQIATTSTYSSNKIVSLLDALKTDILGGADAAYDTLVEIQQALQSGTSGLDAILAAVNLRVRFDAAQTLTVAEQLQARTNIGAVAVSDVGNTDTDFVVIFDGALA >NZ_AP021884|1977054:2023954|2009989_2011075_+|WP_147073254.1|DBSCAN-SWA MSYPLSESFATAPATGYTAVLGGMAATHNNVQQSIDISAPNSQSILRFNETAHGDFWFEADVEFLTDPSARKHIGLWMTTGNGSEGYRFAHIDGAWSVTRWNSGFGDGAAVTGGVNDGAKPVAGVIDVAPTFNVGQRMPLRCEVIVGAFDANGVPWARLIQFKAGGVLMFQVGDAAYRGKLIPGVFLYGATARVHAIAGDTPSGLPAFPATVGVNAADDLLPLAGGSTSVPPDPAANIAVNADCDLMRLNSPNSELWNRGGGYDWHFHAIPNGRKNIHFSGHGFIAGTVKEKGQPDQPLVRRVQLVSENTRVLVAETWSDTTGAYRFELIDPAQRYTVVSYDHKQMYRAVIADNLHPEMMP >NZ_AP021884|1977054:2023954|1979299_1980154_+|WP_147073311.1|DBSCAN-SWA MNASVLTASHYGVVRFGDLQCEAVVLKGGERGYVRRQLAKLLGFHETHKGGRFARFLADFAPKSLSALEKTREPILLPSGRQAQFFPAGIIADVASAVVSAAINGTLHKARQGIVPNCMKIMRALATTGEVALIDEATGYQYHRAPDALQELISKLLRQSCSSWERRFHPDYYRALYRLFGWKYQGHDQNPPHVVGQITQRWVYGPVLPVTLIDEIRARKGISQKHHQWLSDQGLARLETQIHAVTAIARSSTCYRDFDRRCEAAFAGGALQLALLAEDFEEGA >NZ_AP021884|1977054:2023954|1985452_1986862_+|WP_147073341.1|DBSCAN-SWA MNTLNVEYRKVEALIPYARNPRTHTDEQVAKIAASIVEYGWTNPVLVDGDNGIIAGHGRLAAARKLGLDQVPVIELAHLSPTQKRAYVISDNRLALDAGWNEEMLALEMAELSEAGYDLALTGFEDAEIEALLADEVASDAADQEPDADEPDDGDDVPDSPVVPVSRTGDFWAIGTHRLICGDATDPTVVATLMQGDAARLCFTSPPYGNQRDYTSGGITDWDGLMRGVFAKVPMDDDGQVLVNLGLIHRDNEVIPYWDAWLGWMRTQGWRRFAWYVWDQGPGMPGDWAGRFAPSFEFVFHFNRSSRKPNKIVPCKHAGQESHLRADGSSTAMRGKDGEVGGWTHKGQPTQDTRIPDSVIRVMRHKGKIGQDIDHPAVFPVALPEFVIEAYTDAGDIVFEPFGGSGTTMLAAQRKGRVCRCVEIAPEYVDVAIKRFQQNHPGVPVTLLATGQSFDDVVNERQATTEVEQ >NZ_AP021884|1977054:2023954|1986858_1988130_+|WP_147073295.1|DBSCAN-SWA MTASWFADKIEKWPTAKLLPYARNARTHSDDQVAQIAASIAEFGFTNPILAGSDGVIVAGHGRLAAAQKLGLAVVPVVVLDHLSPTQRRALVIADNRIAENAGWDDAMLRIEIASLQDDDFDVSLTGFDADALAELMAGDEPDGEGETDDDAVPELSETPISRPGDVWSLGGHRLLCGDSTVTESYDRLLDGEQVDMVFTDPPYNVNYANSAKDKMRGKDRAILNDNLGDGFYDFLLAALTPTIAHCRGGIYVAMSSSELDVLQAAFRAAGGKWSTFIIWAKNTFTLGRADYQRQYEPILYGWPEGAQRHWCGDRDQGDVWNIKKPQKNDLHPTMKPVELVERAIRNSSRPGNVVLDPFGGSGTTLIAAEKSGRLARLIELDPKYADVIVRRWQEWTGKQATRESDGALFDDQAAIDSSAISQ >NZ_AP021884|1977054:2023954|2008720_2009986_+|WP_147073256.1|DBSCAN-SWA MIDLTVKYFNSGMTGAPQISNNWGDLVTMLDACLVNGFALKAIDTLTFADGIATATISTGHAYRPFQVVEIAGAEQPEYNGSFRVLSTTTTAFTYAVTGAPVSPATTTTNLSAKVAPLGWEKPFAGTSKAAYRSKNPQSPQNILLIDNSLKTPNYTTGWAKWANVGIVEDLSDIDTIVGAQAPYDPNNPTQNWKQVTASQWGWYKWFHARGPQYESNGDSGGGGRNWVLIGDDRLFFLFCTNAAGYGWYGRNSYCFGDLISFKPGDNYATVLAADDNYSGMSNYWSYPGQFSGYGLVSSLDFTGKVLLRNHTQLGNPVRFGLTSLNTNNGQQICGRGPMPFPNGADYSLWLLPTYVRQEDGHMRGILPGMLWMPQDRPYSDQTIVDNVVGQAGKRFLLVRTQYSSETEGAQIAFDITGPWR >NZ_AP021884|1977054:2023954|2016133_2018365_-|WP_147073239.1|DBSCAN-SWA MNIKTENSSAQASWFVGASYGGTDDQLPRFLSEGIWENGYEDKHLDVVRSMRPGDQIAIKSSYTRKHGLPFDSRGQAVSVMAIKAIGTITENLNDGQRVKVDWTKIEPVREWYFYTHRGTVWRVLPGEWMTDGLISFAFHNKPQDVERFRNAPYWRERFGTVAADKHRFGWTKFYEAIADKLLTYRANRAALVEGIREISVRVDGLGHLAEDKYADGTTGFVKDICPFTTMGLFNRGIKDSNRKIIATELAKFLGVDEPVPETFEGIPLLNNLKSWYFPFEINRATDHIDALWGVFAAGIAYADTDDDLAREEFAKAFDSANGRRGVAWNLTFGLYWIRPWAFLSLDHNSQLYVSKKLGVPIGMHGPKRRCNSADYLAVMDVLEPRFQETAYPVHSYPELSLEAWLYKDPTDEKSPVGEDDAGDVDDGDDATEATAPEDVHVAVPIVPYSVDDILKDGCFLERNEIDRLLDRLRTKKNLILQGPPGTGKTWLAKRLAFALMGQKDESKVRAVQFHPNLSYEDFVRGWRPTGEGKLSLADGVFMEAIKAASKDPSSKFVVVIEEINRGNPAQIFGELLTLLEAGKRTPNEALELCYPDADGKRRPVHIPENLYVVGTMNIADRSLALVDLALRRRFAFVGLEPRLGTAWRDWAVKECAVDPALVADIEHRIAELNDQIAADARLGKQFRIGHSYVTPAHRLEAGDTKKWFQQVVETEIGPLLDEYWFDAPDEAQKACARLLQGW >NZ_AP021884|1977054:2023954|2011470_2013486_+|WP_147073250.1|DBSCAN-SWA MPAVLNEVTLVAALPAPTASVAVGPPLVDLLFDQPAATDANLVFGANYIAPRDDVVVLASLPLPVVAIKFIPPARAALLAELPALTVTTLLLRPSVPLDVTGASLPGVVFSGEVRYYSRTQRPTVGQTAHAWQVAAQTEDGSTQGQQDAAATPAGWDTFWRRTLGVPQGIEHRLPPVLAAAPEQRGARHQDATRLQDSTWFAHQDATRFAATRQGLFQNAGPLRDTTRFRHQDGDRTKRAGRVSFWQIARLLTERQGSDFQIASPSLKGWSVRYQDAVPPPLGISVWVVPQPPAPIPCYTPSAHLLFAALAPADSHLLFVCENHINPPPPDGEPVVVPVRRVYFVINNVTLYRVSDGAPVPVFNLSLSLDASSWAWGFDAVLPAKAEALVAGSASGPVELVASVNGTPFRVLAESISRERIFGDASIRISGRGRNAVLAAPYAPVMTFSNTEGRTARQLMDDVLTVNGIPLGWAVDWGLTDWNVPAGAFAQQGSWIDALTAIAGAAGGYLIPHPSAQSIRVRHRYPVAPWEWSTVTPDFVLPVDAVARESLRWLEKPAYNRVFVSGQDVGVLGQVTRAGTAGEVLAPMVVDPLITEAAAARQRGVAVLADTGHQLEVSLRLPVLAETGIIEPGAFVEYQDGSVTRLGIVRATQVEAGLPEVWQTLGVQAYA >NZ_AP021884|1977054:2023954|2019092_2021114_-|WP_147073235.1|DBSCAN-SWA MSQWIERILSRFTADLDRLWVACDPDDVLLDERLLAELRSRGFEVLLYEDPFVFRTEFEERYRAAWDRGEPGPTPAVVVHWRGADPNELPWDLGHYGRVVSLGLAQLFPRLAYNVVKQLEPEHFAALLEAHDTELQGIRGENESKDFILERVYHLAPRSSIRTESDFWRDVLRMHFANRALPLVFAEHAASIIQSKGLLAGLPVAAWLSSKSALLRVVQEAWHRYLANLGMEGSRIGEPPPPDYVAKVDIPFAHSDVQSIVDSMFLNGSLHPLVVEVVPADAPGWIKAGIVQDPQARNAFVKKGIAKLIDAIPSATATHKAWSEFAKQYGETLARVHDLGNAYGTDGLAEVQALVKTLQEQSDAQLHAWVAAKHYADLSTLSFHNGPVMVHRIPDYLSSRRKALGADKIALLVFDGLALDQWVQIRERLVEATKRFAFDEGTSFAWLPTVTSVSRQALFSGRKPREFEESIGHTNKEEYLWKAYWQEQGVKPGEIYYQRSLRQIEQLDALQAALDDRRPKVVGLVVDEVDDRLHKERSKQDVALWIANWLKTGFVDRLFAMLLDRGFHIYLTADHGNVEAVGVGRPSEGDVPEARGERVRVYRSESLLAKSAAANANSVHLDIAGLPAGYMPLFAGGRTAFVPDGEQVVVHGGISVEELIVPFVKVKYVIGNE >NZ_AP021884|1977054:2023954|1977808_1978657_+|WP_024973178.1|DBSCAN-SWA MKRLPIVSAVERMAERKGVKLLMLGKSGIGKTSRLKDLDPATTLFLDIEAGDLAVADWPGDTIRPASWPESRDFFVFLAGPDKSLPPESAFSQAHYDHVIEKFGDATQLGRYQTFFLDSITQLSRQCFAWCKTQPGAVSDRSGKPDLRAAYGLLGQEMIGALTHLQHARGKNVVFVAILDERLDDFNRKVFVPQIEGSKTSLELPGIVDEVVTLAEIKAEDGSSYRAFITHTVNPYGFPAKDRSGRLDLLEPPHLGALIAKCAGAVPALASAANPAHIESQE >NZ_AP021884|1977054:2023954|2011071_2011467_+|WP_147073252.1|DBSCAN-SWA MTVAITVEHNEARLAGTLAFLDAGSNPARLRIYGGTRPANPATTPTSAMLVEIRLTKPAGTIAGGLLTLTQQEDGLITATGIATWARLVNGNEVTALDLDCSGTDGSGDVKLASTNLYLGGDARMVSAILG >NZ_AP021884|1977054:2023954|1992902_1993295_+|WP_147073281.1|DBSCAN-SWA MSLASSIAALAARIGFEVKTKIDATHPGIARVWVSFGYVGGQVVIASAHNVASVVRTAAGRYRVHFAVAMPDANYCWTALARSSTNTGQQRLALVRASSDLKTAQYVDVSCATAASSFDDSSEINLVVYR >NZ_AP021884|1977054:2023954|1989887_1990427_+|WP_147073285.1|DBSCAN-SWA MGISIRAYARHRGVTDTAVHKAIRAGRITPEADGTIDADRADREWARNSDVPKTGTRAKAAKVAVPEGGTGVGGDGPAALPAGGASLLQARTVNEVVKAQTNKVRLARLKGELVDRPQAIAHVFKLARSERDAWLNWPARISAQMAAKLNIDPHTMHVALEAAIREHLQELGELRPRVD >NZ_AP021884|1977054:2023954|2013776_2014079_+|WP_147073246.1|DBSCAN-SWA MTEPEQQPALVENMLLLRKEDFDDLLDRAAERGAERCLAHLGLENGHAARDIRELRDLLEAWRDARRTAWQTTIKVATTGILAALLVGAAIKLKLMGGPQ >NZ_AP021884|1977054:2023954|1977054_1977318_+|WP_024973176.1|DBSCAN-SWA MQTQVPSIESGRNPRRMNPGGATCIALDENELAIRWGLSVKTLRRWRQEQLGPIYCKLGRRVTYLLHEIEAFERRVSRYSSFTRAYQ >NZ_AP021884|1977054:2023954|1993515_1995027_+|WP_147073279.1|portal|DBSCAN-SWA MAWYSKIRSLFGQQPVHEAAGRGRRSLAWMPGNPGAVAAMLATNTELRIKSRDLVRRNAWAQAGIEAFVSNAVGTGIKPQSLAADERFKTDVQALWRDWTEEADAAGQTDFYGLQALACRAMLEGGECLIRLRPRRPEDGLVVPLQLQLLEPEHLPISLNLDLPSGNVVRSGIEFDSLGRRVAYHLYRSHPEDGRLAPMSGQGGMDTVRIDAKEIIHLFRVLRPGQIRGEPWLSRALVKLNELDQYDDAELVRKKTAAMFAGFVTRQNPEDNLMGEGAADGDGIALAGLEPGTLQILEPGEDIKFSDPADVGGSYGEFLRTQFRAVAAAIGVTYEQLTGDLTGVNYSSIRAGMLEFRRRCEMVQHGVLVHQMCRPVWAAWMKQAVLAGAIDAPGFARGGPARRRRYLQVKWIPQGWQWVDPEKEFKAMLLAIRAGLMSRSEAISAFGYDAEDVDREIAADNQRADDLGLIFDSDPRRTSKDGGSAEPNKNAADTTQTGSSSSA >NZ_AP021884|1977054:2023954|1984720_1985137_+|WP_147073297.1|DBSCAN-SWA MAEWTTDDVAARFEEAATTGRRLPPVRVQGYFNCWPAFVRKEWEAFAADEKVYRPFPPSPEAIDRMLETMRWVQWLEVEQRHLVWMRAKRYGWRDITIRFACDRTTAWRRWQRAMEIVATNLNSEGVRLPSKNVGNLG >NZ_AP021884|1977054:2023954|1998008_1998455_+|WP_147073272.1|DBSCAN-SWA MADNSIRERILLAVMAAARPAVEGLGATLHRSPTVAISRELCPALAVFPESESITERANDRVTRELTVRVVALARAVPPASPETEADRLLTAAHAALFGDGTFGGLALGIREQESEWEVEDADAVAVALPARYRLTYRTLANDLSTLG >NZ_AP021884|1977054:2023954|1990429_1992394_+|WP_147073339.1|terminase|DBSCAN-SWA MNVEYEGAAEIERAWREGLTPDPLLSVSEWSDRHRMLSSKASAEPGRWRTSRTPYLKAIMDCLSPTSPVERVVFMKAAQLGATEMGSNWIGYVIHHAPGPMMAVWPTVDMAKRNSKQRIDPLIEESAALSELISPARSRDSGNTILAKEFRGGVLVMTGANSAVGLRSMPVRYLFLDEVDGYPLDVEGEGDAISLAEARTRTFARRKIFIVSTPTISGASAIEREYEASDQRRYFLPCPHCSHRQWLRFEQLRWEKGQPDTASYICESCDKSIAEHHKTWMLEHGEWRAMISDGTGKTAGFHLSSLYSPVGWRGWRDIAAAWESSVNKESGSAAAIKTFKNTELGETWVEEGEAPDWQRLVERREDYRVGTVPPGGLLLVGAADVQKDRIEASIWAFGRGKESWLVEHRVLMGDTARDAVWKRLAELLAENWTHASGAAMPLARFALDTGFATQEAYAFVRACRDPRVMPVKGVPRGAALIGTPTAIDVSQGGKKLRRGIKVFTVAVGIAKLEFYNNLRKGADVSEDGVTTVYPTGFVHLPKIDAEFIQQLCAEQLITRRDRNGFPVREWQKMRERNEALDCYVYARAAASAAGLDRFEERHWRELERQLGMERPPDEPPPIQAFDPNEATQRGGLSVSANPPRRRVIKSRWLS >NZ_AP021884|1977054:2023954|2000022_2000667_+|WP_147073263.1|DBSCAN-SWA MRISVQIDSAAAQAQLRRWGGEFRDKVKKAVSRAIASEAVELKQDVRSHVASQMAVVKKSFLKGFTAKVLDKDLNRLPALYVGSRIPWSAMHETGGQIAGRMLIPLNGRVGRKRFKAQVAELMRGGNAYFIKNAKGNIVLMAENIKEHDRPLAGFKRRYRKAEGIKRLKRGADIPIAVLVPKVVLKKRLDVERLVASRIPRLAAAVENQISTVD >NZ_AP021884|1977054:2023954|1988560_1988920_-|WP_147073291.1|DBSCAN-SWA MSTMTITIERTPRTLQFGDTTFQVEELSVRLPFARKPADLDEVGGQGQTKVYVTETKELTVDEFDAFARSLLVSRDWLRGKGGGTGDGYLCVEVTAPGRPYLYVNPEGGDYARYVARLG >NZ_AP021884|1977054:2023954|1996290_1996668_+|WP_147073275.1|head|DBSCAN-SWA MPAMQEPINLGDLLKYEAPNLYSRDRVTVAAGQTLPLGTVLGQITATGKVKQIDPSATDGSQYSAGVLMQDADAALADRNDGLMVARHAIVSDHALHWPTGITTAEQQAAIQQLKALGVLVRIGA >NZ_AP021884|1977054:2023954|2018373_2019096_-|WP_147073237.1|DBSCAN-SWA MNTSAPQIGFDRFIQLDWAAAALRVRAGTAGLDDLNALLDAAELGVEAKKKTRTVLNRLWLEPRAELVDFADRAAALYKSQPDTPIPVLCWGVAIACYPFFGKVSELVGRLSAIQDDCAAAEVHRRMSEIYGEREGTRRMTNMVIQTQASWGAVERVEKGKRVIRLAATPIDSDELTVWLIEAAVRYVGKPVSVHSLQSLPVLFPFRLTRPLAYVVSNSPYLDLRSEGPSNQFVALRSTI >NZ_AP021884|1977054:2023954|1988949_1989474_-|WP_147073289.1|DBSCAN-SWA MTTTQLTPAQHAILAYALEHTDGKIDWFPDNIKGGARKKVLDGLFNRALITSDGTHWFVAAEGYDAMGRARPTPAPVAADPELDAAVTAAEAAWAQEKAAAKPRTRENSKQATVIQMLQRPEGATVQQICETTGWQAHTVRGTFAGAFKKKLGLTIVSDKAQGSERVYRIAAEA |
50 | Acidithiobacillus_phage(45.45%) | head,portal,terminase,capsid | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| DBSCAN-SWA_6 |
2044160 : 2053121
Sequences of DBSCAN-SWA_6
Nucleotide sequences of DBSCAN-SWA_6 >NZ_AP021884|2044160:2053121|DBSCAN-SWA ATCAGTGTTTCAGCCCCAGGGTGTGGCTGAGGAATGGCACGCCGCCATACCGTCCCATCAGGTAGCCGCGTTCCAGTGCCTCGTTCTTGCGCGGGCTGAATTCGGATAAGCTGACTAACGCCGAGGTCTGATCACGGTCTTTTTCTACGAACCACACTTGATCGCGTCGGAACAAATCTGGTGCGTCCAGCAGCGACGTGTCGTGCGTCGTGAATATCAGTTGTGCGCCGCCGGTGTTGATCTCTGGGCGGTGGAACAGTCGCACGAGTTCGCGCACCAGCAAGGTGTGCAAGCTGGTGTCGAGCTCGTCGATGACCAGCGTTAGCCCTTTGCGGAGGATGTCGAGCACTGGTCCTGCAAGAAACAGCAAATTGCGCGTGCCGTTGGATTCGTCCATCAATTCAAACACGGCTTTGCCCTGTTCGGTGACGTGATGGAAGCGCAGCTTGTGTTCTTCCATTTCTTCCGAGCGCACTTCAGTTTTACCGGCCACCAAGTCAAAGTGAACGGCCTGCCCAGGGACTTTGCGTGTTTCCACATCAATGTCGGCGATGCTGATGTCGGCAGCAGAGAGGAAGTTGCAAATCTCCTTGCGGCCGTCATCCTGCTTGAGCATCTGAATGGACACCTGTGGACTGAGTTGGGCTTGCTCGTTGAAGATCACCAGGCGATTCACAAACCAGTCGAACACCGGTCGCAAGGCTTCGCTGTTGAGTTGTACGGCCATCGACAGGAAGAGGGCGTTCGGTCGGGTGGCACCTTCCCACAGGTTCTTGGGTCCTTTCAGGCCGGGACCGAAGTCGTAGACATCCTTGCCGGTCTCAGTGTCAAAGCGGCGTGTGAACCAACGCTGGGGCTTGAACGCCTTGTAAACCAGCAGGTGCTCACTGACGATCCGCTGAGCTGTCATGGAAAAGCCATACTGGTAGCGCACACCATCGAGCAGGAACGTGACTTCGAACTCGCTGGGTTGACTGGCGGAATCGACATCGAGTCGGAAGGGCTGAACTGCGAAGGTCTGGCCCGGCTGGATCGCCGTCGCGGACTCGGTCACCACGCCGCGCATGTACTGCAGGGCTTTAATGAGATTGGACTTGCCGCTCGCGTTGGCACCATAGACCACAGCACTGCGCACCAGGGTGGGTGCGGCACTGATGCCGGTGGCTTGGGTGTGGGTGTCCTGCAGGGTCTTGTCCTTGGACGCAACAAGACTAAGCACCTGCTCGTCGCGCAGACTGCGGAAGTTCTTGACGCGGAACTCGACCAGCATTGCATTCACCTCATTTTTATAAAATAGAGTCTTATTATGACTCAAAAGTGCAAAAAACAAATATATTTTTGAATTCTGGGGTGGTTTGAGTGATCTGGGTTGGCATCGCAGGCCTGCCGGGTGTCAAGTCCCCGTTTGAACTTCGCTGGAGCCTTGACGTGGCACCCAGCTGCGGTTCGGCCAAATCGTCGAATCCGCGCCCTAAAGACCCGTGCGTTACTATCTGGTTCTGAAAGCGGACGCGGGTTCGGTTCCGTGTTCCACCACCAATAATGAAACCCCAACCGTTCTCGGTTGGGGTTTTTTCTTGCCTGATCGCGCCGGTTTCCGCGTGTTGTTGGGGCTTCCTGCGGAAGCCTGCGGACTGCACCGGTCAGCCTTCCAGCCCGTTCCGGGCCACATTCCACTCTCTCCTGGCCATTCCTCGCTCCGACCTCGCTCCCTGGAACTGGCCCGAAGTCCGCAAAGGCCGCAATTCAGACCTATACAGATCAAAGAGTTACGCGCGGACTAATCAAGTGGTTGGATTGCGGCATTGGCTACCGGAGGGGACACACGCTTGCCCCAGCGAGCTATTGCACTTGCAGTTTGTCAGAAACTCGCATATATTTTCGTACATGAACACGACGACCAAGACTGCCGAGCTGATTCGCGAGCGCATCGAGGCGATGCCGATCGGGGAGCCTTTCACCCCGACAGCATTCCTGGAGTGCGGCACGCGTGCGTCCGTCGATCAGACCCTCTCCCGCCTCGTCAAGGCAGGGTTGATCGAGCGCGTGACGCGCGGTGTCTTTGTGCGTCCCGAGGTCAGCCGTTTCGTCGGCAAGGTTAGCCCCTCGCCGCTGAAGGTGGCCGAGACCGTCGCCAAGACCACGGGTGCCGTCGTCCAGGTTCACGGTGCCGAAGCGGCGCGTCGGCTCGAACTAACCACGCAGGTTCCGACCCAGTCGGTATTCGTGACATCCGGCCCGTCGAAGCGCATCCGCGTGGGGAAGATGGAGATCCGTCTGCAGCACGTCTGTCAGCGCAAGTTGGCCCTGGCAGGTCGACCCGCCGGGCTCGCGCTTGCGGCGATGTGGTATCTCGGCAAGAAGGAAGTGACGCCGGCCCTCGTCGAGAAGATTCGGCGCAAGCTGGGATCGAGCGAGTTCGAGGTGCTGAAGTCAGCCACCAGCTCGATGCCTGCGTGGATGAGCGACGCCATCTTCCGAAACGAGCGGATGGCCGCTCATGCCTGAGTCCTTCCTGCACCTGAAGCCTCAAGAGCAGTCCCAGATCTATCGGGCACTGGCTCCGCAGCTTGCCCGCACGCCCGTCGTACTGGAAAAAGATGTCTGGGTCTGCTGGGTGCTGCAGACCCTGTTCACCATGCCCGACCGACTGCCGATGGCCTTCAAGGGCGGCACATCACTCTCCAAGGTGTTCGGCGCCATTGCGCGCTTCTCCGAGGACGTGGACATCACGCTCGACTACCGTGGCTTAGACGGCTCCTTCGACCCGTTTGCCGAAGGCGTCTCACGCAATCGGCTGAAGAAATTCAGCGAGGATCTCAAGTCCTTCGTGCGCGGCCATGCCCACGGTGTCGTGGCGCCGCACTTTCAGAAGATGCTGGCGGACGAGTTCGATGCCGATGCATTCCAGCTTGAAGTCAGCGATGACGGCGAGCAGATGCGGGTGCACTACCCGAGCGTGCTGGAGGCACCAGGAGACTATGTGGGCAACAGTGTCCTGATCGAGTTCGGTGGCCGTAACATCACCGAGCCGAATGAGGAGCGTGAGGTGCGACCCGACATCGCGGAACATGTCGCTGAACTCGATTTCCCTCGCTCGACGGTCAGTGTGCTGTCTCCGACACGTACCTTCTGGGAAAAGGCGACGCTGATACACGTCGAGTGTCAGCGCGACGAGTTCCGCACAGGCGCCGAACGTCTGTCACGCCACTGGTACGACCTGGCCATGCTGGCCGATCTTGCCCATGGGCAAGCCGCTGTGGCCGATCGCGCTCTGCTCGCGGATGTTGTCAAGCACAAGAAGGTCTTCTACAACGCGAGCTACGCCAACTACGACGCATGCCTGTCCGGGCAGCTCAGACTAATTCCGGAAGATGCTGCACTGGCCGCGCTGCGCGATGACTTCCAGCGCATGATCGGTGCCGGCATGTTCATCGGCGAGCCTCCCGCCTTCGATGCCATCGTCGATCGCCTGCGCGCGCTGGAAACAACAATCAATCAGTGACCTCCCGCTGGCGTTGAGTCGGCAATCCTCGCCCGCTGCTCATCCCACAAGGCTGGCGGATCAACCGCCAGATCAAACAGCGTGATGTGGTTCGGCAGTGCGTCGTCCAGGATGGCGGCCACGATGTCGGGGGCTAGCGTGGTCAGGTTGACCATACGGCTGACGTAGCTGTTGTCGATGCCTTCCCGTGTGGCGATCTCCTTCAAGGACTTCGCTTCTCCTGATTCCAGCATCGCCAACCAGCGGTGGCCCCTGGCCAACGCCAGCTGGATGGAGGTCGGCGCCATGTCCCACGGTCTGACCGGCGCGGTTTCTCCGTTCGGCAAGGTGACCAGCTTGCGGCCGCTACGGCGCTTGATCTGGATCGGTACAGACAGGGTCAGCCTGCCGTCGCTGGTCTGCAGGATGTCCGGCTCGCCGGTTTTCTGGATGCGGATGTCGCTCATGCCAGGGCCTCCTCAGTTTGCTCGACCGGCTCGGGACGCAGTTCCAGCACCAGGCGTTCGATGCCGTTGGTGCGCAGGCGCACTTCGAGGTCGTTGGGTGACACGATGACTTTCTCGACCAGCAATTTCACGATCCGGGTCTGCTCCGCCGGGAATAGCTGATCCCAAATCGCATCAAGCCGGGTCATGGCCACGGTGATCTTGGCCTCGTCCAGCGTCGGGTCGAGCTTGATCGCCTGTGGCAGCATGTTGCCGAGCAGATTCGGGGCATGCAAAATCGCGCGTAGTTGATCGAGTACCGCCGACTCCAGTTCTGCGGCGGGCAGTCGCGGCAGCCCCGAGGCACCCGCGTGTTCCTTGGCGTCGCGCTGGGGCACGTAGTAACGGTAGCGCCGGCCATTCTTCTTGGTGGTGTGCCACGGCGACAGTGCGCGGCCATCGTTGCCGAACACGATGCCCTTGAGCAGATAGGGAACCTTGGCCCGCGTCTTGTTGCCCCGCACCCGGCCATTCGTCTCCAGGATCGCGTGGACGCTGTCCCACAGTTCGCGGCTGACGATCGGCGGGTGTTCGGCCTGGTACCACTGGTCCTTGTGCCGCAACTCGCCAAGGTAGGTCCGGTTGCTCAGGAGCTTGTAAATGTGGCCCTTGTCGATCGGCCTGCCATCGCGGGTCTTGCCGTCTTGTGTGGTCCACGCCTTCGACGTCACGCCATCCAGTTTCAGCTCCTTGACCAGTGCGGTGCTGGAACCGAGTTCAACGAAGCGCTGGAAGATGTGCCGGATCAGCTTGGACTCACGCTCGTTGGGCACCAACCGCCGGTTCTCGACGTCGTAGCCCAGCGGCGGCACGCCACCCATCCACATACCCTTGCGCTTGCTGGCCGCGATCTTGTCTCTGATGCGCTCACCGGTGACCTCGCGCTCAAACTGCGCAAAGGACAGCAGGATGTTCAACATCAACCTGCCCATCGAGGTCGTCGTGTTGAACTGCTGGGTGACCGACACGAACGACACGCCATAGCGCTCGAACACTTCGACCATCTTGGAGAAGTCCGCCAGGCTGCGCGTCAGGCGGTCGATCTTGTAGATGACGACCACGTCGATCTTGCCGGCTTCGATGTCCGCCATCATTCGCTGGAGCGCCGGGCGTTCCATGTTGCCGCCGGAAAAAGCTGGATCGTCGTAATCGTCGGCGACCGGTATCCAGCCTTCGGCGCGCTGGCTGGCGATGTAGGCATGGCCGGCGTCGCGCTGGGCATCGATGGAGTTGTATTCCTGGTCCAGCCCTTCATCGGTGGATTTGCGCGTGTAGACCGCACAGCGCATGCGGCGCTTCAAGACTTCGCTCATCGTCCACCTCTCTTCTTGGTGGACGGCTTGGCCTTGGCGTTGGACGGCGGCTTGAGCCCGAAGAACAGCGGCCCCGACCAGCGCATGCCGGTGATTTCGCGGGCGATCATCGATAGGCTCGGGTACATGCGTCCCTGGAAGTCATACTGGCCGTCGGCGGTTGCGATCACGCGGTATTCGACGCCTTTGTATTCCCGGACCAGCACCGTGCCTGCCGCCGGACGGTAATCGCGGTCACGCTTTTTCACCTTGCCTGTTTCCACCAGAGATGCGATGCGACGCTGGTTGCGATCCAGCAGGTTGGCGTCGGCCTTGCGGAATTCCAGCTCCTGCAGCCGGTAGGCAATCCGGCGTTCGAGGAACTGGCGGTTGTGGGTGGGAGTGTCGCCACCGACCAGCTTCTGCCAGAGGGCCCGGATCTCTGCCATCGGCATCTCGGGCAGCCTGGCGATCTGCGCCGCCACCGATGGCGGCGTGGAAAATGATGGTGTTTGCGTGCTCATTTCGACTCCGTAGTTGTCTTGTTGACGGGGTCTGTATGAACGCGCTGGTTGCCAGAGAAGCCAAGCTCAAACTCGCTCGCTTCTGCCCTGGTTGCGGACTGTTCTGTGCCGGTGATACGCAAGCGTGCCAGGCCGTTGGCCAGCAACGACGCGATCTCGTGACGACGCTGTTTAGCACTGGGCGTCTATCCTGACGTGACGCTATCTGTTGCGAGAAACAAGCGATATGAGGCACGCAAGCTGCTTTCCAATGATGTAGACCCGGCTATGCTCAAACAGGTGACTAAGCGCGCATCGCGCGTGTCTGCTGAAAACAGTTTTGAAGCAATAGCGAGAGAATGGTATGCAAAATTCTCGGGCGAATGGGTGCCTAGCCATGGCGAAAAAATCATCCGCAGATTAGAACGCGACCTGTTTCCCTGGATCGGTAAACGCCCTATTGCCGAGATCACCGCACCTGAACTGTTAGCCGTCTTACGCCGCATTGAAAACCGAGGCGCGCTAGATACGGCGCACCGTGCGCATCAAAACTGCGGGCAAGTGTTCCGCTATGCAATCGCCACTGGGCGCGCTGAACGCGATCCTAGCCCCGACTTGCGCGGCGCATTGCCGCCAGCTGGGTATTCTGGATCATCGTGACCGCTGATTCCGGGCTATCGTGACCGGTCATTCCGGCGCATCGTGACCGGCGATTCCGGTCTATCGTGACCGATTTTGCAGGGTTTCCGGAATCAGTGGTCACGATAGCGGAATCATCGGTCACGATAGCGGAATGGTGTCGTACCGCATGGAAATGGTGTTACGCATAGAGCAACCGAACGAGTACGCTTCCAGCCTTTTGTCTGGAGACAGCGTGCCCGTATCAAGGATCACCATGCGTAAAATTAAAGACGTATTGCGTTTGAAACTGGACGCCAGGCTGTCGCACCAGCAGATCGCCGCTGCGCTGGGCATATCGAAGGGAGTCGTCACCAAGTATGTCGGTCTGGCCGCCGCCGCAGGCCTGGATTGGGCTGCCGTGCAAGACATTGACGAAACCACGTTGGGGCGGCGCCTGCTGGTTACCCCCGAGCGACCGCGCGATCATGTTCAGCCGGACTACGGCCGTTTGCATCAAGAGCTGCGGCGCAAAGGCATGACATTGATGTTGCTCTGGGAAGAGTACCGAGCCGACCACGCCGACCGGCAGACCTATGCTTACTCGCAGTTCTGCGACAACTACCGGCGCTTCGCCAGGCAACTCAAGCGCTCCATGCGCCAGGTTCACCGTGCCGGCGAGAAGCTGTTCATTGATTTCGCCGGCCCCACCATCGCGCTGACCGACGGCAGTCGCGCGCACATCTTCGTCGCGGCACTGGGCGCTTCCAGCTATACCTTTGCCTGCGCCACGCCGCGCGAGACCATGACCGACTGGCTGAAATCGACAGCGCGCGCGTTAAGCTTCATCGGCGGCATGCCCCAGATGATCGTGCCCGACAACCCGAAGGCGCTGATTGCGGACGCCAACCGTTACGAGCCGCGCAGCAACGATACCGTGCTCGATTTCGCGCGCCACTATGGGACGTCGGTGTTGCCAGCACGACCCTACCACCCGCAGGACAAAGCCAAAGCAGAATCGGCGGTACAGATCGTCGAACGCTGGATCATGGCGCGCCTGCGCCACCAGCAATTTGCCAGCGTAGATGATGTCAATCAGGCCATCGCACCGCTGCTTGCCAGGCTCAACGAGAAGCCATTCCAGAAGCTGCCCGGCAGTCGCGCCAGTGCATTTGCCGAAATCGGCGCACCCGCCTTGGCTCCGTTGCCGCTGCAAGCTTATGAGATGGCACACTTCAAGACGGTCAAGGTTCACATCGACTATCACGTAGAAGTCGAACGACACCGCTACAGCGTGCCGCATTCATTGGTCGGACAAGTACTTGAAGCACGGATCACAGTGGCAGTGGTCGAGATCCTGCATCGCGGTAACCGCGTGGCCAGCCATGCCCGCAGCAGTCTGGCCGGTGGCTTTACCACCACCGCCGCGCACATGCCGGCGGCGCATCGCGCCCAGATGGAATGGTCGCCACAACGGCTGATCCACTGGGGCCAAAGCATTGGCCCTGCCGCCGCCGAAGTGGTGACACGGCTACTGAACAAGTACAAGCATCCCGAACATGGCTACCGCGCCTGCCTTGGGCTGCTGTCGCTGGTCAAGCGTTATGGCAAACCCAGACTGGAGGCGGCCTGTACGCTGGCTTTGCAGATCGGCGTCTGCCAGTACCGCCATGTGCGCGACATCCTGAAGAATAACCGCGACGCAGCCGCGCCGCTCAGCACTGAAGAATGGGTCAGCCCCAACCATGTCCACGTGCGCGGTCCTGGCTACTACCAATAAGGAAAGACAACATGATGATGCATACCACGCTGACGCAATTGCGCAGCCTGAAACTGGATGGCCTGGCGACGGGGCTGGAAGAACAACTGGCACAGCCCGGTATGGCTGCACTCAGCTTCGAAGAACGCGTAGCACTGTTGGTGGACCGGGAAGTCCATGCCCGTAATGACCGCAAACTGGCGCGCCTGCTCAAGAACGCTCGCCTGAAATACGGGCAGGCGGCCATCGAGGATATCGACAGCCGCGCAGGACGCGGTATCGACCGGCGCGAGGTGATGAGCCTGGCTTTGGGCGACTGGGTCAACGCCGGCCACAGCATCCTGATTACAGGACCGACCGGCGCCGGTAAATCCTGGCTGGCCTGCGCATTGGCACAATACGTCTGCCGCCGTGGTTACTCAGCCATCTATCAGCGCGTACCCCGCATGCAGGAAGAACTGCGCATCCGGCACGGCAGCGGCACCTTCGGCAAATGGCTGCTGCAACTGGCCAAGACCGACGTATTGGTTCTCGATGACTGGGGCATGGGCGCTATCGACAGCATGACCCGTTCCGACTTGCTGGAGATCATCGACGACCGTGCCGCCAACAAGGCCACCATCATCACCAGTCAGTTGCCGGTGGAGCACTGGCACGCCTGGATAGGCGATGCCACCATCGCCGACGCCATCCTCGACCGCATCATGCAGCGCAACCACCGCTTCACGCTGACCGGCGAGTCGCTGCGAACAGAACAATCAAAAACAAGCAAAAAGGAGGAAAAAACCACCCCATCGTGA
Protein sequences of DBSCAN-SWA_6 >NZ_AP021884|2044160:2053121|2049458_2049965_-|WP_147074830.1|DBSCAN-SWA MSTQTPSFSTPPSVAAQIARLPEMPMAEIRALWQKLVGGDTPTHNRQFLERRIAYRLQELEFRKADANLLDRNQRRIASLVETGKVKKRDRDYRPAAGTVLVREYKGVEYRVIATADGQYDFQGRMYPSLSMIAREITGMRWSGPLFFGLKPPSNAKAKPSTKKRGGR >NZ_AP021884|2044160:2053121|2044160_2045429_-|WP_147073207.1|DBSCAN-SWA MLVEFRVKNFRSLRDEQVLSLVASKDKTLQDTHTQATGISAAPTLVRSAVVYGANASGKSNLIKALQYMRGVVTESATAIQPGQTFAVQPFRLDVDSASQPSEFEVTFLLDGVRYQYGFSMTAQRIVSEHLLVYKAFKPQRWFTRRFDTETGKDVYDFGPGLKGPKNLWEGATRPNALFLSMAVQLNSEALRPVFDWFVNRLVIFNEQAQLSPQVSIQMLKQDDGRKEICNFLSAADISIADIDVETRKVPGQAVHFDLVAGKTEVRSEEMEEHKLRFHHVTEQGKAVFELMDESNGTRNLLFLAGPVLDILRKGLTLVIDELDTSLHTLLVRELVRLFHRPEINTGGAQLIFTTHDTSLLDAPDLFRRDQVWFVEKDRDQTSALVSLSEFSPRKNEALERGYLMGRYGGVPFLSHTLGLKH >NZ_AP021884|2044160:2053121|2047657_2048110_-|WP_147074828.1|DBSCAN-SWA MSDIRIQKTGEPDILQTSDGRLTLSVPIQIKRRSGRKLVTLPNGETAPVRPWDMAPTSIQLALARGHRWLAMLESGEAKSLKEIATREGIDNSYVSRMVNLTTLAPDIVAAILDDALPNHITLFDLAVDPPALWDEQRARIADSTPAGGH >NZ_AP021884|2044160:2053121|2046659_2047664_+|WP_147074827.1|DBSCAN-SWA MPESFLHLKPQEQSQIYRALAPQLARTPVVLEKDVWVCWVLQTLFTMPDRLPMAFKGGTSLSKVFGAIARFSEDVDITLDYRGLDGSFDPFAEGVSRNRLKKFSEDLKSFVRGHAHGVVAPHFQKMLADEFDADAFQLEVSDDGEQMRVHYPSVLEAPGDYVGNSVLIEFGGRNITEPNEEREVRPDIAEHVAELDFPRSTVSVLSPTRTFWEKATLIHVECQRDEFRTGAERLSRHWYDLAMLADLAHGQAAVADRALLADVVKHKKVFYNASYANYDACLSGQLRLIPEDAALAALRDDFQRMIGAGMFIGEPPAFDAIVDRLRALETTINQ >NZ_AP021884|2044160:2053121|2046046_2046667_+|WP_147074826.1|DBSCAN-SWA MNTTTKTAELIRERIEAMPIGEPFTPTAFLECGTRASVDQTLSRLVKAGLIERVTRGVFVRPEVSRFVGKVSPSPLKVAETVAKTTGAVVQVHGAEAARRLELTTQVPTQSVFVTSGPSKRIRVGKMEIRLQHVCQRKLALAGRPAGLALAAMWYLGKKEVTPALVEKIRRKLGSSEFEVLKSATSSMPAWMSDAIFRNERMAAHA >NZ_AP021884|2044160:2053121|2050160_2050604_+|WP_147074831.1|DBSCAN-SWA MTLSVARNKRYEARKLLSNDVDPAMLKQVTKRASRVSAENSFEAIAREWYAKFSGEWVPSHGEKIIRRLERDLFPWIGKRPIAEITAPELLAVLRRIENRGALDTAHRAHQNCGQVFRYAIATGRAERDPSPDLRGALPPAGYSGSS >NZ_AP021884|2044160:2053121|2048106_2049462_-|WP_147074829.1|DBSCAN-SWA MSEVLKRRMRCAVYTRKSTDEGLDQEYNSIDAQRDAGHAYIASQRAEGWIPVADDYDDPAFSGGNMERPALQRMMADIEAGKIDVVVIYKIDRLTRSLADFSKMVEVFERYGVSFVSVTQQFNTTTSMGRLMLNILLSFAQFEREVTGERIRDKIAASKRKGMWMGGVPPLGYDVENRRLVPNERESKLIRHIFQRFVELGSSTALVKELKLDGVTSKAWTTQDGKTRDGRPIDKGHIYKLLSNRTYLGELRHKDQWYQAEHPPIVSRELWDSVHAILETNGRVRGNKTRAKVPYLLKGIVFGNDGRALSPWHTTKKNGRRYRYYVPQRDAKEHAGASGLPRLPAAELESAVLDQLRAILHAPNLLGNMLPQAIKLDPTLDEAKITVAMTRLDAIWDQLFPAEQTRIVKLLVEKVIVSPNDLEVRLRTNGIERLVLELRPEPVEQTEEALA >NZ_AP021884|2044160:2053121|2052353_2053121_+|WP_147074833.1|DBSCAN-SWA MMMHTTLTQLRSLKLDGLATGLEEQLAQPGMAALSFEERVALLVDREVHARNDRKLARLLKNARLKYGQAAIEDIDSRAGRGIDRREVMSLALGDWVNAGHSILITGPTGAGKSWLACALAQYVCRRGYSAIYQRVPRMQEELRIRHGSGTFGKWLLQLAKTDVLVLDDWGMGAIDSMTRSDLLEIIDDRAANKATIITSQLPVEHWHAWIGDATIADAILDRIMQRNHRFTLTGESLRTEQSKTSKKEEKTTPS >NZ_AP021884|2044160:2053121|2050818_2052342_+|WP_147074832.1|transposase|DBSCAN-SWA MPVSRITMRKIKDVLRLKLDARLSHQQIAAALGISKGVVTKYVGLAAAAGLDWAAVQDIDETTLGRRLLVTPERPRDHVQPDYGRLHQELRRKGMTLMLLWEEYRADHADRQTYAYSQFCDNYRRFARQLKRSMRQVHRAGEKLFIDFAGPTIALTDGSRAHIFVAALGASSYTFACATPRETMTDWLKSTARALSFIGGMPQMIVPDNPKALIADANRYEPRSNDTVLDFARHYGTSVLPARPYHPQDKAKAESAVQIVERWIMARLRHQQFASVDDVNQAIAPLLARLNEKPFQKLPGSRASAFAEIGAPALAPLPLQAYEMAHFKTVKVHIDYHVEVERHRYSVPHSLVGQVLEARITVAVVEILHRGNRVASHARSSLAGGFTTTAAHMPAAHRAQMEWSPQRLIHWGQSIGPAAAEVVTRLLNKYKHPEHGYRACLGLLSLVKRYGKPRLEAACTLALQIGVCQYRHVRDILKNNRDAAAPLSTEEWVSPNHVHVRGPGYYQ |
9 | Acidithiobacillus_phage(66.67%) | transposase | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| DBSCAN-SWA_7 |
2294456 : 2305315
Sequences of DBSCAN-SWA_7
Nucleotide sequences of DBSCAN-SWA_7 >NZ_AP021884|2294456:2305315|DBSCAN-SWA TCTAACTGCGCTGGTAAATGTCCTCAAAGCGCACAATATCATCCTCCCCCAGATACGCGCCCGACTGCACCTCGATCATGTGCAATGCAATCTTGCCCGCATTTTCCAGCCGGTGCGTACTGCCCAATGGAATGTAGGTGGACTGATTCTCGGTGAGCAGCAGCACTTCGTCATTACGCGTAACGCGTGCCGTGCCGCTCACCACTATCCAGTGCTCGGCACGGTGGTGATGCATTTGCAACGACAGTTTCTCGCCCGGTTTGACCATGATGCGCTTGACCTGAAAACGCTCCCCCGCATCAATACCTTCGTACCAACCCCACGGGCGAAACACACGGGTATGGTTGAGATGCTCGGTACGACGGTGTTGTTTCAGGTGCTCGACCACTTTTTTGACGTCCTGCACGCGGTCTTTGTGCGCCACCATCACGGCATCGCTGGTTTCCACGATCACCAGGTCGGATACGCCAATTACCGCGACCATGCGGCTCTCGGCACGGATCAGATTGTTGCTGGCACCGTCGTTGTAGATGTCACCGCGCATGACATTGCCAGCACGATCCTTGGCACCGATTTCCCACAGTGCCGACCACGAGCCGATATCGCTCCAGCCAATGTCGGCGGGCACCACCACGGCGTTACGGGTGCGTTCCATGACGGCGTAATCGATGGATTCGGAGGGGCAAGCCGTAAATGCCTGGGTATCCAGACGGACAAAGTCCAGATCACGCTGGCTGCTGTCCAGTGCCGCCTGGCTGGCGGCGAGAATATCCGGGCGGTAGCCGCGCAATTCATTGACAAACGCCGCAGCCTTGAACAGAAACATGCCGCTGTTCCAGAAATAATCGCCCGATTGCAGATAGCCCTGGGCGATTTCACGGCTGGGTTTTTCCACAAAACGCCCTACTGCAAACACACCGTCCAAATGGCTGTCCGCCGGGCCGCGCTGGATATAGCCATAGCCGGTTTCAGGCGCCTGTGGCACGATGCCGAAGGTCACCAGCTGACTAGCCTGGGCGGCATCCACGGCCTGGGCCACGGCGCGCTCAAATGCGTCCACATCTGCTATCAAGTGATCGGCCGGCAACAGCAGCATCAGGGCCTCGGCATCCCGCGCCATGAGTGCCAGCGCTGCGACTGCCGCTGCCGGTGCGGTATTGCGCCCGATGGGCTCGAGAAAAATAGTCTCGGGCGTGACCTCAATCGCGCGCATCTGCTCGGCCACCATGAAGCGGTGCTCATGGTTGCATACCAGCGTGGGCGGCGTGATGTCGGCGATACCGGAAAGGCGCAGCACGGTTTCCTGCAGCATGGTGCGCTCCGACACCAGCGGCAACAACTGCTTGGGCAACGCGGCGCGGGACAAGGGCCACAGGCGGGTACCGGACCCCCCGGAGAGAATGACGGGATGAATGCGCATGTTGGTTGTTTCCTCAATCAATTCAGTGAATTCATCAGCCTGCCCAAGCTGGGGAAAGGCATCCATGCGATACAGTCCGGCGCGCTGCATCGAGAGCAAGACATCTACGCTACGCGAACCAATGAAAACCGGCTGGCCAGTCAGCCAAACACATATCATGTCTCCTTTTCGGGCTTGTCCAAACACAGGCCCAACGCCGCATCCCAATCCGGCAATAGCAAACCAAAGCTGCGCAGCAGCTTGTCCCCGGCCAGCACGGAATTAGCCGGGCGCCTGGCCGGGGTGGGGTAATCAGTGCTTGGGATCGGGATCAGCTCAGGACGTTTGGCGCCCGCCTGGGTGGTTTGGTCGAGGATGGCCTGGGCGAATTGATACCAGCTCGCCCGGCCCCGGCTGCTGAGGTGGTAAGTGCCGTGCAGGGCTTCTTTACCCTGGCCAAACGGCTGTTGCGCCAGTATCTGCGCCGTGGCCTCGGCGATCATGCGTGACCAGGTGGGTGCGCCGTATTGATCGGCAACAATCCGCAACTGCTCGCGCTCCTTGAACAGGCGCTGCATGGTGAGCAGGAAATTGCCAGCGCGCAGGCCATACACCCAGCTGGTGCGCAGAATGAGGTGGGGAATGGCGGCAGCACGGATGGCGTTCTCACCTTCGAGTTTGGTCTGCCCATAGACGCCTAGCGGGTGGGTGGTGTCGTCCTCAGTGTAGGCACCGGGCTTGTTGCCATCGAACACATAATCGGTAGAGTAGTGAATCATCGCCGCGCCGAGTTTTGCCACCTCTTCGGCCATGATGGCGGGAGCAATGGCATTCACCGCACGTGCCAGTTCCGGCTCGGATTCCGCCTTGTCCACGGCGGTATGGGCGGCCGGGTTGACGATCAGGTTCGGGCGCAAGCTACGTATGAGGCTGCGGATGGCACCCGGGTCGGTCAGGTCAAGCTGGCTGCGGGTGGGCGCGCTCACCTTGCCCAAAGTCGCCAATGTGCGGCGCAACTCCCAGCCGACCTGGCCATTGACACCGGTGAGCAGGATATTCACGCAAACACCTCGCACTGCGCCAGCGGCAGGCCGGCGGCGTCCTTGGCGGCAAGTGCGGGTTCGCCGTGCAGTGGCCAGGTGATGCCCAGCGCCGGATCATTCCACAGCAGGCTGCGCTCGAACTGCGGCGCCCAGTAGTCGGTGGTTTTGTAGAGAAACTCTGCGCTGTCGGAGATCACCAGGAAACCATGGGCGAAGCCTTTGGGTATCCAGGCCATGCGTTTGTTTTCGGCGGACAGTTCCATCCCCACCCATTTGCCAAACGTGGGTGAGGATTTGCGCAAGTCCACCGCCACGTCATACACGCTACCACTAATGACGCGTACCAGCTTTCCCTGAGTATTTTGAATCTGGTAATGCAAGCCACGTAATACGCCCTTGGCTGAGCGCGAGTGATTGTCCTGCACGAAGTCGTCGGGGATACCGGCCCCGGTCATGGCGCGACGGTTGTAGCTCTCGTAGAAAAAGCCGCGCGCGTCGCCGAATACTTTGGGTTCGAGCACAAGCACGTCGGGGATTTCAGTGGGGATGATGTTCATGGTTAAAACAACCGTTCGTTGAGCATGGCCAATAAGTATTGGCCATAGGCATTTTTCTGCAAAGGCGCTGCCAGCCGTCCGACCTGGGCGGCATCAATATAGCCCTGGCGGTAGGCGATTTCTTCCGGACAGGAAATTTTGAGACCCTGGCGCTTTTCAATGGTCTGGATGAATAGCGATGCCTCCAGCAGGGATTCATGGGTGCCGGTGTCCAGCCAGGCATGGCCGCGCCCCATGACTTCCACCTGCAATTGCCCCATTTCGAGGTAATGCCGGTTAATGTCGGTGATTTCCAGTTCGCCGCGCGGAGATGGCTTGAGCTTGCGGGCGATATCGATGACCTGGTTATCGTAAAAATACAGGCCAGTGACGGCGTAGCGCGATTTGGGCTGGGCGGGTTTCTCTTCCAGGCTGATGGCGTTGCCTTGAGCGTCGAATTCAACCACGCCATAGCGTTCCGGGTCATGCACCGGGTAGGCAAACACCGATGCGCCAAACTGGCGGGCAGCGGCGGCGCGCAGGCCGCCGGAAAATTCATGACCGTAAAAGATGTTGTCGCCCAGTATGAGAGCGCTCGACGCATCGCCAATGAAATCCGCGCCAATGACAAAGGCTTGCGCCAGACCGTCGGGCGAGGGCTGAACAGCATAGCTGAGGCGAATCCCCCACTGGCTGCCATCACCCAGCAACTGCTCGAAACGCGGGGTATCCTGCGGGGTGGAAATGATGAGGATGTCCCGGATTCCTGCCAGCATCAGCGTGGTAAGCGGGTAGTAGATCATGGGCTTGTCGTACACCGGCAGCAATTGCTTGGAGACTGCCTGGGTCACGGGATATAGCCGGGTACCCGAGCCACCGGCCAGAATAATGCCCCTGCGCGCGCTCATGCCTGGCTCCCATACTGTTTTTCTACCCAGTGGCGGTATTCACCGGTGGCGATGTTGGCTACCCAGTCCGGGTTGGCCAGGTACCAGGCAACGGTTTTGCGAATCCCGGTCTCAAACGTCTCTTGTGGGCGCCAGCCCAGTTCGCGCTCGATTTTGTGTGCGTCAATGGCGTAGCGACGATCGTGGCCAGCGCGGTCCTTGACATGAGTGATGAGTTTTTCGTGAGGGGTGACGGGTGAACCCGGGTGCAGCGCGTCAAGCATGGCGCAGATGGTCCTGACCACATCGATATTGGTTTTTTCGTTGCAACCGCCAATGTTGTATACCTCGCCTGCCTTGCCCGCAGCCAGCACGGCGCGGATGGCGCTGCAATGGTCGCCGACATAGAGCCAGTCACGTACGTTAAGTCCGTCGCCGTAGATCGGCAGCGGCTTGCCGGCCACGGCGTTCATCATTACCAGCGGGATGAGCTTTTCCGGGAACTGGTAAGGGCCGTAGTTGTTGGAGCAGTTGGTAGTGAGTACCGGCAGGCCATAGGTGTGGTGATAGGCGCGCACCAGGTGGTCGGAGGCAGCCTTGGAGGCTGAGTACGGGCTGTTGGGTGCGTAGGCAGTGGTCTCGGTAAACGCGGCATCATCCGGGCCGAGGGAACCGTACACCTCATCGGTGGAAACGTGCAGGAAGCGGAATCCGGCCTTCTCTATCCCTTCCATTGCATTCCAGTAAGCGCGGATTTCTTCCAGAAAATGAAAAGTGCCGACGACATTGGTCTGGATAAAGTCTTCGGGGCCGTGAATGGAACGATCGACGTGGCTTTCGGCAGCGAAGTTGATGACGGCGCGCGGGCGATGTTCAGCCAAAAGACCTGCGACCAGGGCGCGGTCGCCAATATCGCCTTGTACGAAGAGATGGCGGGCGTCACCCTCAATACTGGCGAGGTTGTGCAGATTGCCTGCGTAGGTGAGTTTGTCGAGGTTGATGACCGGCTCGTCACTGTCCGCCAGCCAGTCCAATACGAAATTGGCACCGATAAAACCGGCGCCGCCTGTGATTAGAATCATGGTAATTCCCGGTTAGCCTTTATTTTATTTCCGGTAACCCGGCGCTGCTGTTCTCGGACTTGCTCTGTGCCATTTTTCCCAGGCTTGCAAACTCGCCCACGTATTCAATTTTTGCTGCATCCCGTAGTTTTTTGATGTCAGCGGTAGCGGCATCGCGCTGTCGGGCAGCAGCCAGATAACGCTCTATTAACGTATGTGCCTTGTCCAGGGTAATCGGTGCAGATTGTATGGAGGCTATCTGCAATACCGTTATCCCGTTTGCAGACGGTAACGCCAGTAACTGTCCATTCTGCATGGTCTGCATGCGCGGCACAATATTCATCGGCAATTGTTCCGCCGCTTTGACCTCGCTGCCGGTGCGGAACGGTATATTCTGGCTGCGCAGCCAGTCCACAAATTCGCCCAGATCATGGCTGGTTTCAAGACGTGAGTTGAGTACAGCGATCCTGCCCCTGGGGGCAGCGATGGCCAGTTCCTGCAGATTATAGATACGACGCTGACTGAACAACTCGGGATGCCGGTTGTAGTAGGCGCTGATGTCGGCTTCGCCAGGTTTGGCGACAGCCTGCCCTGCACGCTGCAGATAGGCCTGGGACAGCACCTGATTGCGCGCGGCCTCCAGCAATTGCTGCACAGCAGGATCCTGATCCAGTTTTTGCGCAGTGGCTTTTTGTACCAGAAGTTGCTGATCCACCAGTGCCTGCACTACCCGATTTTTAGCTTGCGGCGTCAAATCCTGCGCTGCCAGATTCAAGCGCGACAAAGCCAGGTCCAGTTGTGCGCTAGTGATAGCGGTGCCGTTCACGCTGGCGATGGCTGAAGACGGCGTTTCCTGCTTGCTGCAGCCGGCGAGCACTGTTCCCAGCATCAGCGCCAGCGCCATTTTATGCATAATTGGATGCGATTTCATTCGCGTCATCTCCCTGGATTGGTATGTTTTGTGGCGTGATTCACACCCGGCGTCCATGATGCGCAATTATCGGTTGTCTCACAAGCGCCAGCATATAGACCGATCGAATTACAAAATGGTAATGATCATAAAAAATGGCTGGCTCCCTCCGCGGAAGCCAGCCACTACGCTACCTGATGCAGGCAATATCTAGACAGCCACGTTTTCCTCGTCGAATGCCAGTTTGATTGTGCCGGATTCGTCCACGTCCACGACCACATGCCCACCATTGGCCAGACGGCCAAACAGCAATTCATCTGCCAGTGCCCGCCGTATCTCGTCCTGGATCAGCCTGGCCATGGGGCGCGCGCCCATCAACGGGTCAAAACCGCGTTTGCCGAGATGCGCCTTGAGCGCGTCGGTAAACGTCGCCTCGACCTTTTTCTCGTGCAACTGGTCTTCCAGCTGCATCAGGAATTTATCCACCACGCGCAGGATGACCTCCTGGGACAAGGGTGCAAACGAGATCATCGCATCCAGCCGGTTACGGAACTCCGGCGTAAAGGCACGCTTGATGTCTGCCATTTCGTCGCCGGTTTGTTTTTCCTGGGTAAAGCCAATCCCCGACTTGTTGAGCGACTCCGCACCCGCGTTGGTGGTCATCACAATCACCACGTTACGGAAATCCGCCTTGCGCCCATTGTTGTCGGTAAGCGTGCCGTGATCCATCACCTGCAACAGTACGTTGAATACGTCCGGATGCGCTTTTTCGATTTCATCCAGCAACAGCACCGCGTAGGGATGTTTGGTAATCGCCTCGGTCAACAGGCCGCCCTGGTCAAACCCGACATAGCCCGGTGGGGCGCCTATCAACCGCGACACGGCATGGCGTTCCATGTATTCGGACATATCGAAGCGAATCAGCTCAATGCCCATGATATACGCGAGCTGCCGCGCCACTTCGGTCTTGCCCACCCCGGTGGGGCCAGAAAACAGGAAAGAGCCGATAGGCTTCTGCGGATTACCCAAGCCACTGCGTGCCATCTTGATCGCGGCTGCCAGCGCATTGATCGCCTTATCCTGGCCAAACACCACGGTCTTGAGGTCGCGATCGAGGTTTTTCAGCGCATCGCGATCATCACTATTCACATTCTTGGACGGAATCCGCGCAATCTTGGCGATGATCTCTTCGATCTCGCGCTTGCTGATGACTTTCTTCTGCCGTGATTTGGGCAGGATACGCTGCGCCGCGCCGGCTTCGTCGATCACGTCGATTGCCTTGTCCGGCAGGTGGCGGTCATTGATATAGCGTGCCGACAGCTCGGCTGCCGTGGTGAGCGCCGACGCGGTGTATTTGATGCCGTGATGCGCTTCAAAACGCGATTTCAAGCCACGCAGAATCTCGACTGTCTCCTCGATCGAAGGCTCATTCACATCAATCTTCTGAAACCGCCGCGATAACGCATGGTCTTTTTCGAAAATGCCACGATACTCGTTGTAAGTGGTTGCACCGATGCATTTGAGTTGCCCGGATGAAAGCGCCGGTTTAAGCAGGTTGGAAGCATCCAGGGTGCCGCCCGAAGCCGCACCTGCACCAATCAGCGTGTGAATTTCGTCTATGAACAAAATTGCCTGCGGGTTTTCGTATAGCTGCTTCAACACGGCCTTGAGGCGCTGCTCAAAATCACCACGGTATTTGGTGCCTGCCAACAGGGCTCCCATGTCCAGCGAATACACCGTGCTGTCCGATAGAATATCCGGTACCACACCCTCAACGATACGCCGCGCCAGACCTTCAGCGATGGCCGTTTTGCCGACTCCGGCTTCACCCACCAGCAGCGGGTTGTTCTTGCGCCGCCGGCACAATGTCTGGATGACACGCTCCAGTTCCAGCGCACGGCCAATAAGCGGGTCTATTTTTCCCGCCAGCGCCTGCACATTCAGGTTCTGCGTATAGGTTTCCAGCGCCGTTGCGGGTGCAGCTTCCTCACCCGCCTCAGGGGTCGCCTCCGGGCGCGCAGTGCTGCCCTGCGGGACTTTGCTCACCCCATGGGAAATGAAATTCACCACGTCCAGACGCGATACACCCTGCTGATTCAGGAAATACACCGCGTGGGAATCCTTCTCGCCAAAAATGGCCACCAGCACGTTCGCGCCGGTCACTTCTTTTTTGCCTGACGACTGCACATGCAAAATGGCGCGTTGAATCACGCGCTGAAATCCGAGCGTAGGCTGGGTATCGACTTCCTCGCTTCCTGCAACGGTAGGGGTGTGTTCGGTGATGAAATCAGCCAGTCCACGACGCAGTTCGTCGGTATTGGTGCCGCACGCGCGCAACACCTCGGCGGCGGACGGATTATCCAGCATCGCCAGCAACAGGTGCTCGACCGTAATAAACTCATGGCGCTTTTGTCTCGCCTCCATGAACGCCATATGTAAACTAACTTCCAATTCCTGCGCAATCATCTAATTTTCCTCCATCACGCATTGCAGCGGATGCTGGTGCTGGCGGGCGAACCCGACCACTTGCTCTACCTTGGTTGCCGCCACATCGCGGGGGAATACGCCACACACTCCCATGCCGTCCCTATGTACTTTGAGCATGATTTGCGTAGCCTGTTCACGGCTCTTGTAAAAAAAGTTCTGAATCACAAGAACCACAAAATCCATGGGCGTGTAGTCGTCATTCAACAACATTACCTTGTACAAAGGCGGCGGCTTGAGTTTTGTTTCGCTTGCTTCCAGAACGGTGTCATCGCGGTGCTTGGTTGCCATGGCGCTGGATAGTTTCCGGATAGTCAGGAACCATTTTGACGACTGGCGCGAAATTTTCAAGTGCTATCCAGTAAAAAAAAATTTGCCTGGCACTTGCCAAATCAACTAGGCAGGCGTAAAAAGCAATCTGGAGTTTGGCGTCAAGGTTCCTGCAAGTCTGGTAATGCCGTTGTGCGATCTTGATTTTCAAACAGCCCCTGGCCGTTTTGGCCTGTTTTTATCAAGGAAGTAGCAATGGCAACTGGCACTGTAAAGTGGTTCAACGATTCTAAAGGCTTTGGGTTTATTACCCCGGACGACGGTAGTGAAGATCTTTTCGCTCACTTCTCCGCCATCAACATGGGTGGTTTCAAAACCCTGAAGGAAGGTCAAAAAGTCCAATTCGAGGTCTCCCAGGGCCCGAAAGGCAAACAGGCTTCGAACATTCAGCCTGCATAAATCGGCTCACCGATTACTTGAAAACGCGGAACCTGGTTCCGCGTTTTTTTCGCCTCGGGTTTTGAGTTCTGGCTTTTCAAAAGCGCTGTGCTACATTGAAATATCTGTAACACATCCATTTAAGGAGAGCAGAATATGCACATTCAACACCAGCCTGATGGTTCCCTGGTCCTGGACATGAGCCAGAAACAGGCGCGAGAACTCGCAAAAACCGTCATCCAGCACGCCGAAGATGCGCATACCGCACTGCTGGATTTTGCCTACCTGCTGAACGAAGCGCATTACGATGCGGAGAACCAGTTCCGGCAACCACCTCATGCCTGGGAACCGGGTGCGCATCAGCCTGGTACAGAATAGGGGGCTACCATGAACATTTCTGCACTCGACAAACAGACTGCCCAGATCAGTGTGTTGCCGACCGAGGCCGCGCATTTGCTGGAGGGCCTCGAAGCCATGCGCGACGAACTCGGTGAAATCGCCGACGAGTTAATCAGTCTGCTGCGCGGCAGTGGCATTGAACCACCACCCAAACCCGATCATGTTCGCACTGAATACGCCGGGCCTGAGTAAACTTACATGCGCGCGATCATCGCCTCGCCAAATGCTGAACAGGACACCTGCGTCGCACCGTCCATAAGGCGAGCAAAATCGTAGGTTACCGTTTTAGCAGCAATTGCACGCTGCATACTCGCGGTGATGATATCAGCGGCTTCCAGCCAGCCAAGGTGGCGCAGCATCATCTCCGCGGAAAGAATAATCGAGCCCGGGTTGACGTAATCCTGGCCCGCATATTTTGGTGCAGTACCATGCGTGGCCTCAAACATGGCGACTGAATCGGACAGATTGGCGCCCGGCGCAATACCAATGCCGCCCACCTCAGCCGCGAGCGCGTCCGAGATGTAATCGCCGTTCAGATTAAGCGTCGCAATCACGTCGTACTCATCCGGGCGCAACAGTATCTGTTGCAAGAATGCATCGGCAATCACATCCTTGATGACAATGCCGTTGGGCAGCCTGCACCACGGGCCGCCATCCATCTCCACCGCGCCAAATTCACGCCTGGCCAGTTCATAGCCCCATTTTTTGAAGCCTCCCTCGGTGAACTTCATGATATTGCCCTTGTGTACCAAGGTAACGGACTCGCGGCCATTGTCGATGGCATACTGAATCGCCTTGCGGATCAGGCGCTCGCTGCCCTGCACGGAAACCGGTTTGATACCAATGGCGGAAGTTTCCGGGAAGCGAATTTTCTTCACCCCCATTTCGCCTTGCAGGAAGGCGATGATCTTTTTCACCTCATCCGAGCCAGCTTGCCACTCCACCCCGGCGTAAATATCCTCGGTATTTTCGCGGAAGATCACCATATCCACTTTTTCCGGCGCTTTCACCGGACTGGGCACGCCATCGAAGTAACGCACCGGGCGCAGGCAGACATACAAATCCAGCAACTGGCGCAACGCCACATTCAGGGAGCGCATGCCACCGGAAGTCGGCGTGGTCAACGGCCCCTTGATGGAGACGACGTATTCGCGCACGGCGGCTACCGTTTCATCGGGTAACCAGTTGTCGCCACCATAGACCTTGACGGCCTTTTCGCCCGCATACACTTCCATCCAGGCGATACTGCGCCTGCCACCATATGCCTTGGCCACGGCCGCATCCACCACGCGACGCATCACCGGGGTGATATCCACACCGGTACCATCACCTTCGATGAAGGGAATAACCGGCTGATCGGGGACATTGAGCGAAGCGTCAGTGTTGATCGTGATTTTTTCGCCGTGAGTCGGCAGCTGTATATGCTGGTACAT
Protein sequences of DBSCAN-SWA_7 >NZ_AP021884|2294456:2305315|2300501_2302757_-|WP_147074450.1|protease|DBSCAN-SWA MIAQELEVSLHMAFMEARQKRHEFITVEHLLLAMLDNPSAAEVLRACGTNTDELRRGLADFITEHTPTVAGSEEVDTQPTLGFQRVIQRAILHVQSSGKKEVTGANVLVAIFGEKDSHAVYFLNQQGVSRLDVVNFISHGVSKVPQGSTARPEATPEAGEEAAPATALETYTQNLNVQALAGKIDPLIGRALELERVIQTLCRRRKNNPLLVGEAGVGKTAIAEGLARRIVEGVVPDILSDSTVYSLDMGALLAGTKYRGDFEQRLKAVLKQLYENPQAILFIDEIHTLIGAGAASGGTLDASNLLKPALSSGQLKCIGATTYNEYRGIFEKDHALSRRFQKIDVNEPSIEETVEILRGLKSRFEAHHGIKYTASALTTAAELSARYINDRHLPDKAIDVIDEAGAAQRILPKSRQKKVISKREIEEIIAKIARIPSKNVNSDDRDALKNLDRDLKTVVFGQDKAINALAAAIKMARSGLGNPQKPIGSFLFSGPTGVGKTEVARQLAYIMGIELIRFDMSEYMERHAVSRLIGAPPGYVGFDQGGLLTEAITKHPYAVLLLDEIEKAHPDVFNVLLQVMDHGTLTDNNGRKADFRNVVIVMTTNAGAESLNKSGIGFTQEKQTGDEMADIKRAFTPEFRNRLDAMISFAPLSQEVILRVVDKFLMQLEDQLHEKKVEATFTDALKAHLGKRGFDPLMGARPMARLIQDEIRRALADELLFGRLANGGHVVVDVDESGTIKLAFDEENVAV >NZ_AP021884|2294456:2305315|2294456_2295875_-|WP_147074479.1|DBSCAN-SWA MRIHPVILSGGSGTRLWPLSRAALPKQLLPLVSERTMLQETVLRLSGIADITPPTLVCNHEHRFMVAEQMRAIEVTPETIFLEPIGRNTAPAAAVAALALMARDAEALMLLLPADHLIADVDAFERAVAQAVDAAQASQLVTFGIVPQAPETGYGYIQRGPADSHLDGVFAVGRFVEKPSREIAQGYLQSGDYFWNSGMFLFKAAAFVNELRGYRPDILAASQAALDSSQRDLDFVRLDTQAFTACPSESIDYAVMERTRNAVVVPADIGWSDIGSWSALWEIGAKDRAGNVMRGDIYNDGASNNLIRAESRMVAVIGVSDLVIVETSDAVMVAHKDRVQDVKKVVEHLKQHRRTEHLNHTRVFRPWGWYEGIDAGERFQVKRIMVKPGEKLSLQMHHHRAEHWIVVSGTARVTRNDEVLLLTENQSTYIPLGSTHRLENAGKIALHMIEVQSGAYLGEDDIVRFEDIYQRS >NZ_AP021884|2294456:2305315|2298337_2299402_-|WP_147074452.1|DBSCAN-SWA MILITGGAGFIGANFVLDWLADSDEPVINLDKLTYAGNLHNLASIEGDARHLFVQGDIGDRALVAGLLAEHRPRAVINFAAESHVDRSIHGPEDFIQTNVVGTFHFLEEIRAYWNAMEGIEKAGFRFLHVSTDEVYGSLGPDDAAFTETTAYAPNSPYSASKAASDHLVRAYHHTYGLPVLTTNCSNNYGPYQFPEKLIPLVMMNAVAGKPLPIYGDGLNVRDWLYVGDHCSAIRAVLAAGKAGEVYNIGGCNEKTNIDVVRTICAMLDALHPGSPVTPHEKLITHVKDRAGHDRRYAIDAHKIERELGWRPQETFETGIRKTVAWYLANPDWVANIATGEYRHWVEKQYGSQA >NZ_AP021884|2294456:2305315|2297456_2298341_-|WP_147074453.1|DBSCAN-SWA MSARRGIILAGGSGTRLYPVTQAVSKQLLPVYDKPMIYYPLTTLMLAGIRDILIISTPQDTPRFEQLLGDGSQWGIRLSYAVQPSPDGLAQAFVIGADFIGDASSALILGDNIFYGHEFSGGLRAAAARQFGASVFAYPVHDPERYGVVEFDAQGNAISLEEKPAQPKSRYAVTGLYFYDNQVIDIARKLKPSPRGELEITDINRHYLEMGQLQVEVMGRGHAWLDTGTHESLLEASLFIQTIEKRQGLKISCPEEIAYRQGYIDAAQVGRLAAPLQKNAYGQYLLAMLNERLF >NZ_AP021884|2294456:2305315|2302757_2303066_-|WP_124705779.1|protease|DBSCAN-SWA MATKHRDDTVLEASETKLKPPPLYKVMLLNDDYTPMDFVVLVIQNFFYKSREQATQIMLKVHRDGMGVCGVFPRDVAATKVEQVVGFARQHQHPLQCVMEEN >NZ_AP021884|2294456:2305315|2299421_2300312_-|WP_161984264.1|DBSCAN-SWA MKSHPIMHKMALALMLGTVLAGCSKQETPSSAIASVNGTAITSAQLDLALSRLNLAAQDLTPQAKNRVVQALVDQQLLVQKATAQKLDQDPAVQQLLEAARNQVLSQAYLQRAGQAVAKPGEADISAYYNRHPELFSQRRIYNLQELAIAAPRGRIAVLNSRLETSHDLGEFVDWLRSQNIPFRTGSEVKAAEQLPMNIVPRMQTMQNGQLLALPSANGITVLQIASIQSAPITLDKAHTLIERYLAAARQRDAATADIKKLRDAAKIEYVGEFASLGKMAQSKSENSSAGLPEIK >NZ_AP021884|2294456:2305315|2304076_2305315_-|WP_147074447.1|DBSCAN-SWA MYQHIQLPTHGEKITINTDASLNVPDQPVIPFIEGDGTGVDITPVMRRVVDAAVAKAYGGRRSIAWMEVYAGEKAVKVYGGDNWLPDETVAAVREYVVSIKGPLTTPTSGGMRSLNVALRQLLDLYVCLRPVRYFDGVPSPVKAPEKVDMVIFRENTEDIYAGVEWQAGSDEVKKIIAFLQGEMGVKKIRFPETSAIGIKPVSVQGSERLIRKAIQYAIDNGRESVTLVHKGNIMKFTEGGFKKWGYELARREFGAVEMDGGPWCRLPNGIVIKDVIADAFLQQILLRPDEYDVIATLNLNGDYISDALAAEVGGIGIAPGANLSDSVAMFEATHGTAPKYAGQDYVNPGSIILSAEMMLRHLGWLEAADIITASMQRAIAAKTVTYDFARLMDGATQVSCSAFGEAMIARM >NZ_AP021884|2294456:2305315|2296911_2297454_-|WP_147074454.1|DBSCAN-SWA MNIIPTEIPDVLVLEPKVFGDARGFFYESYNRRAMTGAGIPDDFVQDNHSRSAKGVLRGLHYQIQNTQGKLVRVISGSVYDVAVDLRKSSPTFGKWVGMELSAENKRMAWIPKGFAHGFLVISDSAEFLYKTTDYWAPQFERSLLWNDPALGITWPLHGEPALAAKDAAGLPLAQCEVFA >NZ_AP021884|2294456:2305315|2296030_2296915_-|WP_147074455.1|DBSCAN-SWA MNILLTGVNGQVGWELRRTLATLGKVSAPTRSQLDLTDPGAIRSLIRSLRPNLIVNPAAHTAVDKAESEPELARAVNAIAPAIMAEEVAKLGAAMIHYSTDYVFDGNKPGAYTEDDTTHPLGVYGQTKLEGENAIRAAAIPHLILRTSWVYGLRAGNFLLTMQRLFKEREQLRIVADQYGAPTWSRMIAEATAQILAQQPFGQGKEALHGTYHLSSRGRASWYQFAQAILDQTTQAGAKRPELIPIPSTDYPTPARRPANSVLAGDKLLRSFGLLLPDWDAALGLCLDKPEKET >NZ_AP021884|2294456:2305315|2303870_2304074_+|WP_147074448.1|DBSCAN-SWA MNISALDKQTAQISVLPTEAAHLLEGLEAMRDELGEIADELISLLRGSGIEPPPKPDHVRTEYAGPE >NZ_AP021884|2294456:2305315|2303639_2303861_+|WP_147074449.1|DBSCAN-SWA MHIQHQPDGSLVLDMSQKQARELAKTVIQHAEDAHTALLDFAYLLNEAHYDAENQFRQPPHAWEPGAHQPGTE >NZ_AP021884|2294456:2305315|2303300_2303504_+|WP_124705778.1|DBSCAN-SWA MATGTVKWFNDSKGFGFITPDDGSEDLFAHFSAINMGGFKTLKEGQKVQFEVSQGPKGKQASNIQPA |
12 | Escherichia_phage(33.33%) | protease | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
| Acr ID | Acr position | Acr size | Homology with known anti | Neighbor HTH/AcRanker | Neighbor Aca | In prophage | Protospacer in prophage |
|---|