Contig_ID | Contig_def | CRISPR array number | Contig Signature genes | Self targeting spacer number | Target MGE spacer number | Prophage number | Anti-CRISPR protein number |
---|---|---|---|---|---|---|---|
NZ_CP014620 | Salmonella enterica subsp. enterica serovar Anatum str. USDA-ARS-USMARC-1676 isolate SAN082 chromosome, complete genome | 2 crisprs | PD-DExK,WYL,cas3,cas8e,cse2gr11,cas7,cas5,cas6e,cas1,cas2,csa3,DEDDh,DinG | 0 | 8 | 9 | 0 |
CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
NZ_CP014620_1 | 972653-973840 | TypeI-E |
I-E
Consensus repeat of NZ_CP014620_1
|
19 spacers
spacers of NZ_CP014620_1
>1.1|972682|32|NZ_CP014620|PILER-CR,CRISPRCasFinder,CRT AACTGAAACCAGGCCAGGTGATATTTATCAAA >1.2|972743|32|NZ_CP014620|PILER-CR,CRISPRCasFinder,CRT CCGAGTGTGAGCAGGCTATTTATGATGAGCGC >1.3|972804|32|NZ_CP014620|PILER-CR,CRISPRCasFinder,CRT GTCATCGTTATACACGTGACGGTTTTAATAGT >1.4|972865|32|NZ_CP014620|PILER-CR,CRISPRCasFinder,CRT AAAATGAACAGCCACACATCCGCCAATAAAAA >1.5|972926|32|NZ_CP014620|PILER-CR,CRISPRCasFinder,CRT GCTGTCGGTCGCAGTGTGGATATTGCGATCAA >1.6|972987|32|NZ_CP014620|PILER-CR,CRISPRCasFinder,CRT GGGCTGAACGGCGATCTGATTACGTGGAGTAA >1.7|973048|32|NZ_CP014620|PILER-CR,CRISPRCasFinder,CRT TACGCCAGCTATAAGGGGTACACGAACAGCTT >1.8|973109|32|NZ_CP014620|PILER-CR,CRISPRCasFinder,CRT GCCCGAGAAAAGTTGCTTCTCTTTGCTGCTGC >1.9|973170|32|NZ_CP014620|PILER-CR,CRISPRCasFinder,CRT CATACCCTGTAGTTTCAATTTCCGCAGGTGGG >1.10|973231|32|NZ_CP014620|PILER-CR,CRISPRCasFinder,CRT AGCGCGGAATGATTTTTAACGCTGAGATGGTG >1.11|973292|32|NZ_CP014620|PILER-CR,CRISPRCasFinder,CRT TACCGCGACACCGTCAACGACAGCAACCACTT >1.12|973353|32|NZ_CP014620|PILER-CR,CRISPRCasFinder,CRT CAGGTCACTAAAATTTGTAGGGTTATCCACAG >1.13|973414|32|NZ_CP014620|PILER-CR,CRISPRCasFinder,CRT GCGCAATTGCAGTTTGACGCGGTGCTGTCATT >1.14|973475|32|NZ_CP014620|PILER-CR,CRISPRCasFinder,CRT TGTCTTAACTCCATTGCTGAGTCGATTGTGAA >1.15|973536|32|NZ_CP014620|PILER-CR,CRISPRCasFinder,CRT CACACAGAACGCCAGTTATAATCATCGGTGCT >1.16|973597|32|NZ_CP014620|PILER-CR,CRISPRCasFinder,CRT TCGTTTGTGGCGTCAGTAATACTATTATCGGT >1.17|973658|32|NZ_CP014620|PILER-CR,CRISPRCasFinder,CRT TTTTTAAATCCGGACAGACCCTGTAACGGATC >1.18|973719|32|NZ_CP014620|PILER-CR,CRISPRCasFinder,CRT ATCCGACTGTATGCCCAGCAGAACGAGGGCGC >1.19|973780|32|NZ_CP014620|CRISPRCasFinder,CRT CACGAGTGGCAAATTGATTTCGACGAAAAACC |
cas3,cas8e,cse2gr11,cas7 |
CRISPR arrays and Neighbor proteins around NZ_CP014620_1
The CRISPR arrays of NZ_CP014620_1 >merge|NZ_CP014620|1|972653-973840|PILER-CR,CRISPRCasFinder,CRT GTGTTCCCCGCGCCAGCGGGGATAAACCGAACTGAAACCAGGCCAGGTGATATTTATCAAAGTGTTCCCCGCGCCAGCGGGGATAAACCGCCGAGTGTGAGCAGGCTATTTATGATGAGCGCGTGTTCCCCGCGCCAGCGGGGATAAACCGGTCATCGTTATACACGTGACGGTTTTAATAGTGTGTTCCCCGCGCCAGCGGGGATAAACCGAAAATGAACAGCCACACATCCGCCAATAAAAAGTGTTCCCCGCGCCAGCGGGGATAAACCGGCTGTCGGTCGCAGTGTGGATATTGCGATCAAGTGTTCCCCGCGCCAGCGGGGATAAACCGGGGCTGAACGGCGATCTGATTACGTGGAGTAAGTGTTCCCCGCGCCAGCGGGGATAAACCGTACGCCAGCTATAAGGGGTACACGAACAGCTTGTGTTCCCCGCGCCAGCGGGGATAAACCGGCCCGAGAAAAGTTGCTTCTCTTTGCTGCTGCGTGTTCCCCGCGCCAGCGGGGATAAACCGCATACCCTGTAGTTTCAATTTCCGCAGGTGGGGTGTTCCCCGCGCCAGCGGGGATAAACCGAGCGCGGAATGATTTTTAACGCTGAGATGGTGGTGTTCCCCGCGCCAGCGGGGATAAACCGTACCGCGACACCGTCAACGACAGCAACCACTTGTGTTCCCCGCGCCAGCGGGGATAAACCGCAGGTCACTAAAATTTGTAGGGTTATCCACAGGTGTTCCCCGCGCCAGCGGGGATAAACCGGCGCAATTGCAGTTTGACGCGGTGCTGTCATTGTGTTCCCCGCGCCAGCGGGGATAAACCGTGTCTTAACTCCATTGCTGAGTCGATTGTGAAGTGTTCCCCGCGCCAGCGGGGATAAACCGCACACAGAACGCCAGTTATAATCATCGGTGCTGTGTTCCCCGCGCCAGCGGGGATAAACCGTCGTTTGTGGCGTCAGTAATACTATTATCGGTGTGTTCCCCGCGCTAGCGGGGATAAACCGTTTTTAAATCCGGACAGACCCTGTAACGGATCGTGTTCCCCGCGCCAGCGGGGATAAACCGATCCGACTGTATGCCCAGCAGAACGAGGGCGCGTGTTCCCCGCGCTAGCGGGGATAAACCGCACGAGTGGCAAATTGATTTCGACGAAAAACCGTGTTCCCCGCGCCAACAAGGATAGCCGT >NZ_CP014620|1|1|972653-973779|PILER-CR GTGTTCCCCGCGCCAGCGGGGATAAACCG AACTGAAACCAGGCCAGGTGATATTTATCAAA GTGTTCCCCGCGCCAGCGGGGATAAACCG CCGAGTGTGAGCAGGCTATTTATGATGAGCGC GTGTTCCCCGCGCCAGCGGGGATAAACCG GTCATCGTTATACACGTGACGGTTTTAATAGT GTGTTCCCCGCGCCAGCGGGGATAAACCG AAAATGAACAGCCACACATCCGCCAATAAAAA GTGTTCCCCGCGCCAGCGGGGATAAACCG GCTGTCGGTCGCAGTGTGGATATTGCGATCAA GTGTTCCCCGCGCCAGCGGGGATAAACCG GGGCTGAACGGCGATCTGATTACGTGGAGTAA GTGTTCCCCGCGCCAGCGGGGATAAACCG TACGCCAGCTATAAGGGGTACACGAACAGCTT GTGTTCCCCGCGCCAGCGGGGATAAACCG GCCCGAGAAAAGTTGCTTCTCTTTGCTGCTGC GTGTTCCCCGCGCCAGCGGGGATAAACCG CATACCCTGTAGTTTCAATTTCCGCAGGTGGG GTGTTCCCCGCGCCAGCGGGGATAAACCG AGCGCGGAATGATTTTTAACGCTGAGATGGTG GTGTTCCCCGCGCCAGCGGGGATAAACCG TACCGCGACACCGTCAACGACAGCAACCACTT GTGTTCCCCGCGCCAGCGGGGATAAACCG CAGGTCACTAAAATTTGTAGGGTTATCCACAG GTGTTCCCCGCGCCAGCGGGGATAAACCG GCGCAATTGCAGTTTGACGCGGTGCTGTCATT GTGTTCCCCGCGCCAGCGGGGATAAACCG TGTCTTAACTCCATTGCTGAGTCGATTGTGAA GTGTTCCCCGCGCCAGCGGGGATAAACCG CACACAGAACGCCAGTTATAATCATCGGTGCT GTGTTCCCCGCGCCAGCGGGGATAAACCG TCGTTTGTGGCGTCAGTAATACTATTATCGGT GTGTTCCCCGCGCTAGCGGGGATAAACCG TTTTTAAATCCGGACAGACCCTGTAACGGATC GTGTTCCCCGCGCCAGCGGGGATAAACCG ATCCGACTGTATGCCCAGCAGAACGAGGGCGC GTGTTCCCCGCGCTAGCGGGGATAAACCG >NZ_CP014620|1|1|972653-973840|CRISPRCasFinder GTGTTCCCCGCGCCAGCGGGGATAAACCG AACTGAAACCAGGCCAGGTGATATTTATCAAA GTGTTCCCCGCGCCAGCGGGGATAAACCG CCGAGTGTGAGCAGGCTATTTATGATGAGCGC GTGTTCCCCGCGCCAGCGGGGATAAACCG GTCATCGTTATACACGTGACGGTTTTAATAGT GTGTTCCCCGCGCCAGCGGGGATAAACCG AAAATGAACAGCCACACATCCGCCAATAAAAA GTGTTCCCCGCGCCAGCGGGGATAAACCG GCTGTCGGTCGCAGTGTGGATATTGCGATCAA GTGTTCCCCGCGCCAGCGGGGATAAACCG GGGCTGAACGGCGATCTGATTACGTGGAGTAA GTGTTCCCCGCGCCAGCGGGGATAAACCG TACGCCAGCTATAAGGGGTACACGAACAGCTT GTGTTCCCCGCGCCAGCGGGGATAAACCG GCCCGAGAAAAGTTGCTTCTCTTTGCTGCTGC GTGTTCCCCGCGCCAGCGGGGATAAACCG CATACCCTGTAGTTTCAATTTCCGCAGGTGGG GTGTTCCCCGCGCCAGCGGGGATAAACCG AGCGCGGAATGATTTTTAACGCTGAGATGGTG GTGTTCCCCGCGCCAGCGGGGATAAACCG TACCGCGACACCGTCAACGACAGCAACCACTT GTGTTCCCCGCGCCAGCGGGGATAAACCG CAGGTCACTAAAATTTGTAGGGTTATCCACAG GTGTTCCCCGCGCCAGCGGGGATAAACCG GCGCAATTGCAGTTTGACGCGGTGCTGTCATT GTGTTCCCCGCGCCAGCGGGGATAAACCG TGTCTTAACTCCATTGCTGAGTCGATTGTGAA GTGTTCCCCGCGCCAGCGGGGATAAACCG CACACAGAACGCCAGTTATAATCATCGGTGCT GTGTTCCCCGCGCCAGCGGGGATAAACCG TCGTTTGTGGCGTCAGTAATACTATTATCGGT GTGTTCCCCGCGCTAGCGGGGATAAACCG TTTTTAAATCCGGACAGACCCTGTAACGGATC GTGTTCCCCGCGCCAGCGGGGATAAACCG ATCCGACTGTATGCCCAGCAGAACGAGGGCGC GTGTTCCCCGCGCTAGCGGGGATAAACCG CACGAGTGGCAAATTGATTTCGACGAAAAACC GTGTTCCCCGCGCCAACAAGGATAGCCGT >NZ_CP014620|1|1|972653-973840|CRT GTGTTCCCCGCGCCAGCGGGGATAAACCG AACTGAAACCAGGCCAGGTGATATTTATCAAA GTGTTCCCCGCGCCAGCGGGGATAAACCG CCGAGTGTGAGCAGGCTATTTATGATGAGCGC GTGTTCCCCGCGCCAGCGGGGATAAACCG GTCATCGTTATACACGTGACGGTTTTAATAGT GTGTTCCCCGCGCCAGCGGGGATAAACCG AAAATGAACAGCCACACATCCGCCAATAAAAA GTGTTCCCCGCGCCAGCGGGGATAAACCG GCTGTCGGTCGCAGTGTGGATATTGCGATCAA GTGTTCCCCGCGCCAGCGGGGATAAACCG GGGCTGAACGGCGATCTGATTACGTGGAGTAA GTGTTCCCCGCGCCAGCGGGGATAAACCG TACGCCAGCTATAAGGGGTACACGAACAGCTT GTGTTCCCCGCGCCAGCGGGGATAAACCG GCCCGAGAAAAGTTGCTTCTCTTTGCTGCTGC GTGTTCCCCGCGCCAGCGGGGATAAACCG CATACCCTGTAGTTTCAATTTCCGCAGGTGGG GTGTTCCCCGCGCCAGCGGGGATAAACCG AGCGCGGAATGATTTTTAACGCTGAGATGGTG GTGTTCCCCGCGCCAGCGGGGATAAACCG TACCGCGACACCGTCAACGACAGCAACCACTT GTGTTCCCCGCGCCAGCGGGGATAAACCG CAGGTCACTAAAATTTGTAGGGTTATCCACAG GTGTTCCCCGCGCCAGCGGGGATAAACCG GCGCAATTGCAGTTTGACGCGGTGCTGTCATT GTGTTCCCCGCGCCAGCGGGGATAAACCG TGTCTTAACTCCATTGCTGAGTCGATTGTGAA GTGTTCCCCGCGCCAGCGGGGATAAACCG CACACAGAACGCCAGTTATAATCATCGGTGCT GTGTTCCCCGCGCCAGCGGGGATAAACCG TCGTTTGTGGCGTCAGTAATACTATTATCGGT GTGTTCCCCGCGCTAGCGGGGATAAACCG TTTTTAAATCCGGACAGACCCTGTAACGGATC GTGTTCCCCGCGCCAGCGGGGATAAACCG ATCCGACTGTATGCCCAGCAGAACGAGGGCGC GTGTTCCCCGCGCTAGCGGGGATAAACCG CACGAGTGGCAAATTGATTTCGACGAAAAACC GTGTTCCCCGCGCCAACAAGGATAGCCGT
>NZ_CP014620.1|WP_001199961.1|971684_972356_+|7-carboxy-7-deazaguanine-synthase-QueE MQYPINEMFQTLQGEGYFTGVPAIFIRLQGCPVGCAWCDTKHTWDKLSDREVSLFSILAKTKESDKWGAASSEDLLAVINRQGYTARHVVITGGEPCIHDLMPLTDLLEKSGFSCQIETSGTHEVRCTPNTWVTVSPKVNMRGGYDVLSQALERANEIKHPVGRVRDIEALDELLATLSDDKPRVIALQPISQKEDATRLCIETCIARNWRLSMQTHKYLNIA >NZ_CP014620.1|WP_000036734.1|970250_971549_+|phosphopyruvate-hydratase MSKIVKVIGREIIDSRGNPTVEAEVHLEGGFVGMAAAPSGASTGSREALELRDGDKSRFLGKGVTKAVGAVNGPIAQAILGKDAKDQAGIDKIMIDLDGTENKSNFGANAILAVSLANAKAAAAAKGMPLYEHIAELNGTPGKYSMPVPMMNIINGGEHADNNVDIQEFMIQPVGAKTVKEAIRMGSEVFHHLAKVLKGKGMNTAVGDEGGYAPNLGSNAEALAVIAEAVKAAGYELGKDITLAMDCAASEFYKDGKYVLAGEGNKAFTSEEFTHFLEELTKQYPIVSIEDGLDESDWDGFAYQTKVLGDKIQLVGDDLFVTNTKILKEGIEKGIANSILIKFNQIGSLTETLAAIKMAKDAGYTAVISHRSGETEDATIADLAVGTAAGQIKTGSMSRSDRVAKYNQLIRIEEALGEKAPYNGRKEIKGQA >NZ_CP014620.1|WP_000210863.1|968530_970168_+|CTP-synthase-(glutamine-hydrolyzing) MTTNYIFVTGGVVSSLGKGIAAASLAAILEARGLNVTIMKLDPYINVDPGTMSPIQHGEVFVTEDGAETDLDLGHYERFIRTKMSRRNNFTTGRIYSDVLRKERRGDYLGATVQVIPHITNAIKERVLEGGEGHDVVLVEIGGTVGDIESLPFLEAIRQLAVDIGREHALFMHLTLVPYLAAAGEVKTKPTQHSVKELLSIGIQPDILICRSDRAVPANERAKIALFCNVPEKAVISMKDVDSIYKIPGLLKSQGLDDYICKRFSLNCPEANLSEWEQVIYEEANPAGEVTIGMVGKYIELPDAYKSVIEALKHGGLKNRVTVNIKLIDSQDVETRGVEILKDLDAILIPGGFGYRGVEGKIATARYARENNIPYLGICLGMQVALIEFARNVAGMDNANSTEFVPDCKYPVVALITEWRDEDGNVEVRSEKSDLGGTMRLGAQQCQLSDDSLVRQLYGASTIVERHRHRYEVNNMLLKQIEAAGLRVAGRSGDDQLVEIIEVPNHPWFVACQFHPEFTSTPRDGHPLFAGFVKAANEHQKRQAK >NZ_CP014620.1|WP_000210454.1|967502_968303_+|nucleoside-triphosphate-pyrophosphohydrolase MTTNHQIDRLLTLMQRLRDPENGCPWDKEQTFASIAPYTLEETYEVLDAIAREDFDDLRGELGDLLFQVVFYAQMAQEEGRFDFNDICAAISDKLERRHPHVFGELSADNSEEVLARWEQIKTEERAQKAQHSALDDIPRSLPALMRAQKIQKRCSNVGFDWTTLGPVVDKVYEEIDEVMFEARQAVVDQAKLEEEMGDLLFATVNMARHLGTKAELALQKANDKFERRFREVERIVAARGLEMTGVDLETMEEVWQEVKRQEIDL >NZ_CP014620.1|WP_000859612.1|966441_966738_+|type-II-toxin-antitoxin-system-RelE/ParE-family-toxin MKTVKLTPKASEDLENIWHYGWQHFGEIQADRYINHLSEIFSIMSANNIGTPRPELGEYIYALPFERHIIYFIQSVTEVIVIRILSQNQDAGKHVNWL >NZ_CP014620.1|WP_000480218.1|966119_966476_+|type-II-toxin-antitoxin-system-ParD-family-antitoxin MFLTYISFPVYSYVRFILTVEAVMARTMTVDLGDELREFIESLIESGDYRTQSEVIRESLRLLREKQAESRLQALRELLAEGLNSGEPQAWEKDAFLRKVKTGMIKPDENGKINAKGQ >NZ_CP014620.1|WP_000226842.1|963801_966036_+|GTP-diphosphokinase MVAVRSAHINKAGEFDPKKWIASLGISSQQSCERLAETWAYCLQQTQGHPDADLLLWRGVEMVEILSTLSMDIDTLRAALLFPLADANVVSEDVLRESVGKSIVTLIHGVRDMAAIRQLNATHNDSVSSEQVDNVRRMLLAMVDDFRCVVIKLAERIAHLREVKEAPEDERVLAAKECTNIYAPLANRLGIGQLKWELEDYCFRYLHPAEYKRIAKLLHERRLDREHYIEEFVGHLRAEMKNEGVQAEVYGRPKHIYSIWRKMQKKHLAFDELFDVRAVRIVAERLQDCYAALGIVHTHYRHLPDEFDDYVANPKPNGYQSIHTVVLGPGGKTVEIQIRTKQMHEDAELGVAAHWKYKEGAASGGVRSGHEDRIAWLRKLIAWQEEMADSGEMLDEVRSQVFDDRVYVFTPKGDVVDLPAGSTPLDFAYHIHSDVGHRCIGAKIGGRIVPFTYQLQMGDQVEIITQKQPNPSRDWLNPNLGYVTTSRGRSKIHAWFRKQDRDKNIQAGRQILDDELAHLGISLKEAEKHLLPRYNFNELEELLAAIGGGDIRLNQMVNFLQSQFNKPSAEEQDAAALKQLQQKTYAPQNRRKDDGRVVVEGVGNLMHHIARCCQPIPGDEIVGFITQGRGISVHRADCEQLAELRSHAPERIVEAVWGESYSAGYSLVVRVQANDRSGLLRDITTILANEKVNVLGVASRSDIKQQIATIDMTIEIYNLQVLGRVLGKLNQVPDVIDARRLHGG >NZ_CP014620.1|WP_023243200.1|962454_963750_+|23S-rRNA-(uracil(1939)-C(5))-methyltransferase-RlmD MAQFYSAKRRVTTRQIITVKVNDLDSFGQGVARHNGKALFIPGLLPEESAEVIITEDKKQFARARVSRRLNDSPERETPRCPHFGVCGGCQQQHVSIALQQRSKSAALARLMKHEVNDIIAGAPWGYRRRARLSLNCPPDKPLQMGFRKAGSSDIVNVEQCPVLAPQLAALLPRIRACLASLHGTRHLGHVELVQAGSGTLMILRHTAPLSAADKEKLECFSHSEGLSLFLAPFSEILETVSGEAPWYDSHGLRLAFSPRDFIQVNEAVNQQMVARALEWLDVRAEDRVLDLFCGMGNFTLPLATRAASVVGVEGVPALVEKGRENAIRNGLHNVTFFHENLEEDVTKQPWAKNGFDKVLLDPARAGATGVMRHIIKLKPIRIVYVSCNPATLARDSEALVNAGYEVTRLAMLDMFPHTGHLESMVLFERM >NZ_CP014620.1|WP_000186400.1|959640_962397_-|two-component-sensor-histidine-kinase-BarA MTNYSLRARMMILILAPTVLIGLLLSIFFVVHRYNDLQRQLEDAGASIIEPLAVSSEYGMNLQNRESIGQLISVLHRRHSDIVRAISVYDDHNRLFVTSNFHLDPSQMQLPAGAPFPRRLSVDRHGDIMILRTPIISESYSPDESAIADAKNTKNMLGYVALELDLKSVRLQQYKEIFISSVMMLFCIGIALIFGWRLMRDVTGPIRNMVNTVDRIRRGQLDSRVEGFMLGELDMLKNGINSMAMSLAAYHEEMQHNIDQATSDLRETLEQMEIQNVELDLAKKRAQEAARIKSEFLANMSHELRTPLNGVIGFTRLTLKTELNPTQRDHLNTIERSANNLLAIINDVLDFSKLEAGKLILESIPFPLRNTLDEVVTLLAHSSHDKGLELTLNIKNDVPDNVIGDPLRLQQVITNLVGNAIKFTESGNIDILVEKRALSNTKVQIEVQIRDTGIGIPERDQSRLFQAFRQADASISRRHGGTGLGLVITQKLVNEMGGDISFHSQPNRGSTFWFHINLDLNPNVIIDGPSTACLAGKRLAYVEPNATAAQCTLDLLSDTPVEVVYSPTFSALPLAHYDIMILSVPVTFREPLTMQHERLAKAASMTDFLLLALPCHAQINAEKLKQGGAAACLLKPLTSTRLLPALTEYCQLNHHPEPLLMDTSKITMTVMAVDDNPANLKLIGALLEDKVQHVELCDSGHQAVDRAKQMQFDLILMDIQMPDMDGIRACELIHQLPHQQQTPVIAVTAHAMAGQKEKLLSAGMNDYLAKPIEEEKLHNLLLRYKPGANVAARLMAPEPAEFIFNPNATLDWQLALRQAAGKPDLARDMLQMLIDFLPEVRNKIEEQLVGENPNGLVDLVHKLHGSCGYSGVPRMKNLCQLIEQQLRSGVHEEELEPEFLELLDEMDNVAREAKKILG >NZ_CP014620.1|WP_000706479.1|958454_959597_+|glycerate-kinase MKIVIAPDSYKESLSALEVATAIEQGFREIWPDADYLKLPLADGGEGTVEAMVEATAGRIVHVEVTGPLGHRVNAFYGLSGDARSAFIEMAAASGLEQVPPAQRDPLKTTSWGTGELIRHALDAGVEHIIIGIGGSATNDGGAGMVQALGARLRDAQGNDIAQGGIGLETLASIDISGLDKRLSACHIEVACDVTNPLTGKEGASAVFGPQKGATPEMIERLDTALTRYAHLIARDLHVDVLDLAGGGAAGGMGAALYAFCGAQLRRGIEIVTDALHLEACLADADLVITGEGRIDSQTIHGKVPIGVANIAKRYNKPVIGIAGSLTADVSVVHEHGLDAVFSVIYTICTLEDALKNASENVRMTARNVAATLKAGQQLR >NZ_CP014620.1|WP_001208002.1|973937_974735_+|MBL-fold-metallo-hydrolase MALRIRVLLENHKGAGADKSLKARPGLSLLVEDESTSILFDTGPDGSFMQNALAMGIDLSDVSAVVLSHGHYDHCGGVPWLPDNSRIICHPDIARERYAAMTFLGITRKIKKLSCEVDYSRYRMMYTRGPLPIGENFIWSGEIPVVAPEAYGIFGGHDAEPDSILDEGVLIYQSTKGLVIITGCGHRGIANIVRHCQNITGIKRIYALVGGFHLRCASPFTLWRVRRFLQEQKPEKLCGCHCTGAWGRLWLPEITAPATGDVLRF >NZ_CP014620.1|WP_000108313.1|974824_975187_-|6-carboxytetrahydropterin-synthase-QueD MSTTLYKDFTFEAAHRLPHVPEGHKCGRLHGHSFMVRLEITGEVDPHTGWIMDFADLKAAFKPTYDRLDHYYLNDIPGLSNPTSEVLAKWIWDQVKPVVPLLSAVMVKETCTAGCVYRGE >NZ_CP014620.1|WP_023244652.1|975621_977421_+|NADPH-dependent-assimilatory-sulfite-reductase-flavoprotein-subunit MTTPAPLTGLLPLNPEQLARLQAATTDLTPEQLAWVSGYFWGVLNPRSGAVAVTPVPERKMSGITLISASQTGNARRVAEALRDDLLAANLNVTLVNAGDYKFKQIASEKLLVIVTSTQGEGEPPEEAVALHKFLFSKKAPKLENTAFAVFSLGDTSYEFFCQSGKDFDSKLAELGGERLLDRVDADVEYQAAASEWRARVVDVLKSRAPVAAPSQSVATGAVNDIHTSPYTKDAPLIATLSVNQKITGRNSEKDVRHIEIDLGDSGLRYQPGDALGVWYQNDPALVKELVELLWLKGDEPVMVDGKTLPLAEALEWHFELTVNTANIVENYATLTRSESLLPLVGDKAQLQHYAATTPIVDMVRFSPAQLDAEALIGLLRPLTPRLYSIASAQAEVESEVHVTVGVVRYDIEGRARAGGASSFLADRVEEEGEVRVFIEHNDNFRLPANPQTPVIMIGPGTGIAPFRAFMQQRAAEGAEGKNWLFFGNPHFTEDFLYQVEWQRYVKEGVLNRIDLAWSRDQKEKIYVQDKLREQGAELWRWINDGAHIYVCGDARRMAADVEKALLEVIAEFGGMDLESADEYLSELRVERRYQRDVY >NZ_CP014620.1|WP_001290660.1|977420_979133_+|assimilatory-sulfite-reductase-(NADPH)-hemoprotein-subunit MSEKHPGPLVVEGKLSDAERMKLESNYLRGTIAEDLNDGLTGGFKGDNFLLIRFHGMYQQDDRDIRAERAAQKLEPRHAMLLRCRLPGGVITTTQWQAIDKFAADNTIYGSIRLTNRQTFQFHGILKKNVKPVHQMLHSVGLDALATANDMNRNVLCTSNPYESQLHAEAYEWAKKISEHLLPRTRAYAEIWLDQEKVATTDEEPILGQTYLPRKFKTTVVIPPQNDIDLHANDMNFVAIAENGKLVGFNLLVGGGLSIEHGNKKTYARTASEFGYLPLEHTLAVAEAVVTTQRDWGNRTDRKNAKTKYTLERVGLETFKAEVERRAGIKFEPIRPYEFTGRGDRIGWVKGIDDKWHLTLFIENGRILDYPGRPLKTGLLEIAKIHQGEFRITANQNLIIASVPESQKAKIETLARDHGLMNAVKPQRENSMACVSFPTCPLAMAEAERFLPSFTDKVEAILEKHGIPDEHIVMRVTGCPNGCGRAMLAEIGLVGKAPGRYNLHLGGNRIGSRIPRMYKENIAEPDILASLDELIGRWAKEREAGEGFGDFTVRAGIIRPVLDPARDFWE >NZ_CP014620.1|WP_023243195.1|979208_979943_+|phosphoadenosine-phosphosulfate-reductase MSQLDLNALNELPKVDRVMALAETNAQLEKLSAEERVAWALENLPGEYVLSSSFGIQAAVSLHLVNQIRPDIPVILTDTGYLFPETYQFIDEITDKLKLNLKVYRAGESPAWQEARYGKLWEQGVEGIEKYNDINKVEPMNRALKELKAQTWFAGLRREQSGSRAHLPVLAIQRGVFKVLPIIDWDNRTVYQYLQKHGLKYHPLWDQGYLSVGDTHTTRKWEPGMAEEETRFFGLKRECGLHEG >NZ_CP014620.1|WP_023244651.1|980155_981109_-|SPI-1-type-III-secretion-system-effector-SopD MPVTLSFGNHQNYTLNESRLAHLLSADKEKAIHMGGWDKVQDHFRAEKKDHALEVLHSIIHGQGRGEPGEMEVNVEDINKIYAFKRLQHLACPAHQDLFTIKMDASQTQFLLMVGDTVISQSNIKDILNISDDAVIESMSREERQLFLQICEVIGAKMTWHPELLQESISTLRKEVTGNAQIKAAVYEMMRPAEAPDHPLVEWQDLLTADEKSMLACINAGNFEPTTQFCKIGYQEVQGEVAFSMMHPCISYLLHSYSPFAEFKPTNSGFLKKLNQDYNDYHAKKMFIDVILEKLYLTHERSLHIGKDGCSRNILLT >NZ_CP014620.1|WP_023243194.1|981552_984216_+|CRISPR-associated-helicase/endonuclease-Cas3 MSIYHYWGKSRRGETNGGDDYHLLCWHSLDVAAVGYWMVINNIYFIDHYLKKLGIQDKEQAAQFFAWILCWHDIGKFAHSFQQLYRHEALNIFNEPTRHYEKIAHTTLGYVLWNSWLSECPELFPPSSLSVRKSKRVMALWMPVTTGHHGRPPEAIQELDHFRQQDKDAARDFLLRIKALFPLITLPEAWDEDEGIDQFQQLSWFISAAVVLADWTGSASRYFPRTAEKMPVDTYWQQALAKAQTAITLFPSAANVSAFTGIETLFPFIQHPTPLQQKALELDINVDGAQLFILEDVTGAGKTEAALILAHRLMAAGKAQGLYFGLPTMATANAMFERMANTWLALYQPDSRPSLILAHSARRLMDRFNQSIWSVTLSGTEEPDEAQPDSQGCAAWFADSNKKALLAEVGVGTLDQAMMAVMPFKHNNLRLLGLSNKILLADEIHACDAWMSRILEGLIERQASNGNATILLSATLSQQQRDKLVAAFSRGVRRSVQAPLLGHDDYPWLTQVTQTELISQRVDTRKEVERSVDIGWLHSEEACLERIGEAVEKGNCIAWIRNSVDDAIRIYRQLQLSKVVATENLLLFHSRFAFHDRQRIESQTLNLFGKQSGAQRAGKVIIATQVIEQSLDIDCDEMISDLAPVDLLIQRAGRLQRHIRDRNGLVKKSGQDERETPVLRILAPEWDDAPRENWLSSAMCNSAYVYPDHGRMWLTQRILREQGTIRMPQSARLLIESVYGEDVNMPVGFAKTEQLQEGKFYCDRAFAGQMLLNFAPGYCAEISDSLPEKMSTRLAEESVTLWLAKIVDGVVTPYTSGEHAWEMSVLRVRQSWWNKHKDEFEKLDGEPLRKWCAQQHQDKDFATVIVVTDFAACGYSANEGLIGMMGE >NZ_CP014620.1|WP_023243193.1|984227_985784_+|type-I-E-CRISPR-associated-protein-Cse1/CasA MDNFSLLTTPWLPVRFKDGSTGKLAPVNLADENVMDIAATRADLQGAAWQFLLGLLQCSIAPKRYKNWEDIWFDGLHADVLHKALAPLEHAFQFGAESPSFMQDFEPLSGEKVSIASLLPEIPGAQTTKFNKDHFVKRGVTERFCPHCAALALFSLQLNAPAGGKGYRTGLRGGGPLTTLVELQEYQGERQTPIWRKLWLNVMPQDTADLPLPDQCDATVFPWLAATRTSEQANAVTTPEQVNKLQAYWGMPRRIRLDFATLQSGGCDICGAESDELLGFMTVKNYGVNYDGWRHPLTPYRAPVKDQNAFFSVKPQPGGLIWRDWLGLSQNNQTEANYESPAQVVKVFNARSLTDVKAGIWGFGADFDNMKIRCWYEHHFPLLMTEGLIPDLRKATQTATRLLSLLRGALKEAWFTNAKDARGDFSFIDIDFWNLTQGRFLNLIQDLENGHKPDERLNKWQRELWLFTRRYFDDRVFTNPYESSDLKRIMTARKKYFTSSAEKQSAKAAKAKKQEAAE >NZ_CP014620.1|WP_000117946.1|985780_986341_+|type-I-E-CRISPR-associated-protein-Cse2/CasB MSVVTKDDKATLRQWHEELQEKRGLRASLRRSKTVNDACLAEGLHSLLMQTHSLWKNKAPWNVTALAITAALAAHIKFIDEQKSFAAQLGQKKGGDTPVMSKLRFSHLLAVKTPDELLRQLRRAVKLLDGSVNLFSLADDIFCWCQEQNDLLNHHRRQQRPTEFLRIRWALEYYQAGDGDTDNEQD >NZ_CP014620.1|WP_023243192.1|986354_987413_+|type-I-E-CRISPR-associated-protein-Cas7/Cse4/CasC MTTFIQLHLLTAYPAANLNRDDTGAPKTVVLGGATRLRISSQSLKRAWRTSELFEQALAGHIGIRTGRIAREAAQILVDSGIDAKKAVEYVKNIANCFGKVKEDKKPKDELTNAETEQLVHISPAEFEAVKALARRLAEEKRPATEEEAELLRHDRMAVDIAMFGRMLAKKTDFNVEAACQVAHAFGVSETIVEDDFFTAVDDLRQASAEDAGAGHLGETGFGSALFYTYICINKDLLVKNLNDNEELANKTLRAFTEAALKVSPTGKQNSFASRAYASWALAEKGTDQPRSLAAAFYEPINGTDQLNAAVKRITALHENMNEVYAQETAFKNFNVMNQQGSMKDVLDFICA |
You can click texts colored in the table to view more detailed information
CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
NZ_CP014620_2 | 990154-990304 | TypeI-E |
I-E
Consensus repeat of NZ_CP014620_2
|
2 spacers
spacers of NZ_CP014620_2
>2.1|990183|32|NZ_CP014620|CRISPRCasFinder GAGCGGCTAAACGATGAATTAACCAGGGAGCG >2.2|990244|32|NZ_CP014620|CRISPRCasFinder TCGCACAACGCCTGGATATCCGCCCATCGGCC |
cas2,cas1,cas6e,cas5,cas7,cse2gr11,cas8e,cas3 |
CRISPR arrays and Neighbor proteins around NZ_CP014620_2
The CRISPR arrays of NZ_CP014620_2 >merge|NZ_CP014620|2|990154-990304|CRISPRCasFinder GTGTTCCCCGCGCTAGCGGGGATAAACCGGAGCGGCTAAACGATGAATTAACCAGGGAGCGGTGTTCCCCGCGCCAGCGGGGATAAACCGTCGCACAACGCCTGGATATCCGCCCATCGGCCGTGTTCCCCGCGTCAGCGGGGATAAACAC >NZ_CP014620|2|2|990154-990304|CRISPRCasFinder GTGTTCCCCGCGCTAGCGGGGATAAACCG GAGCGGCTAAACGATGAATTAACCAGGGAGCG GTGTTCCCCGCGCCAGCGGGGATAAACCG TCGCACAACGCCTGGATATCCGCCCATCGGCC GTGTTCCCCGCGTCAGCGGGGATAAACAC
>NZ_CP014620.1|WP_001518648.1|989763_990057_+|type-I-E-CRISPR-associated-endoribonuclease-Cas2 MSMVVVVTENVPPRLRGRLAVWLLEVRAGVYVGDTSKRIREMIWQQITQLGGVGNVVMAWATNTESGFEFQTWGENRRIPVDLDGLRLVSFLPVENQ >NZ_CP014620.1|WP_023244650.1|988798_989764_+|type-I-E-CRISPR-associated-endonuclease-Cas1 MTFVPLNPIPLKDRTSMIFLQYGQIDVLDGAFVLIDKTGIRTHIPVGSVACIMLEPGTRVSHAAVHLASTVGTLLVWVGEAGVRVYSSGQPGGARADKLLYQAKLALDDDLRLKVVRKMYELRFREPPPARRSVEQLRGIEGSRVRATYALLAKQYGVKWHGRNYDPKDWEKGDVVNRCISAATSCLYGISEAAILAATSCLYGISEAAILAAGYAPAIGFIHSGKPLSFVYDIADIIKFESVVPKAFEIAARHPAEPDKEVRLACRDIFRSSKLTGKLIPLIEEVLAAGEIEPPQPAPDMLPPAIPEPESLGDSGHRGHG >NZ_CP014620.1|WP_000281483.1|988151_988802_+|type-I-E-CRISPR-associated-protein-Cas6/Cse3/CasE MYLSRITLHTSELSPAQLLHLVERGEYVMHQWLWDLFPGGKERQFLYRREELQGAFRFFVLSQEQPAASTIFDVQTRPFAPMLSAGQTLRFNLRANPTICKNGKRHDLLMEAKRQRKTQGDSQDIWSYQQQAALEWLARQGEQNGFTLREASVDAYRQQQIRREKSRQMIQFSSVDYTGVLVINEPALFLQRLAQGYGKSRAFGCGMMMIKPGDDA >NZ_CP014620.1|WP_000085115.1|987423_988170_+|type-I-E-CRISPR-associated-protein-Cas5/CasD MSQYLVFQLHGPMASWGVDAPGEVRHSHELPSRSALLGLLAAALGIRRDEEERLNTFNRHYQFLLCASGNPRWARDYHTVQMPKEVRKARYFSRREELQDPELLSALISRRDYYTDAWWMIAVSATPDAPYTLAQLQAALQHPVFPLYLGRKSHPLALPLAPQLLEGNAADVLREAYRWYQDQFNALKLTLPGLQNECWWEGEHDGLTANKILRRRDMPLSRQQWLFGERSVNQGPWLRKEDACISQE >NZ_CP014620.1|WP_023243192.1|986354_987413_+|type-I-E-CRISPR-associated-protein-Cas7/Cse4/CasC MTTFIQLHLLTAYPAANLNRDDTGAPKTVVLGGATRLRISSQSLKRAWRTSELFEQALAGHIGIRTGRIAREAAQILVDSGIDAKKAVEYVKNIANCFGKVKEDKKPKDELTNAETEQLVHISPAEFEAVKALARRLAEEKRPATEEEAELLRHDRMAVDIAMFGRMLAKKTDFNVEAACQVAHAFGVSETIVEDDFFTAVDDLRQASAEDAGAGHLGETGFGSALFYTYICINKDLLVKNLNDNEELANKTLRAFTEAALKVSPTGKQNSFASRAYASWALAEKGTDQPRSLAAAFYEPINGTDQLNAAVKRITALHENMNEVYAQETAFKNFNVMNQQGSMKDVLDFICA >NZ_CP014620.1|WP_000117946.1|985780_986341_+|type-I-E-CRISPR-associated-protein-Cse2/CasB MSVVTKDDKATLRQWHEELQEKRGLRASLRRSKTVNDACLAEGLHSLLMQTHSLWKNKAPWNVTALAITAALAAHIKFIDEQKSFAAQLGQKKGGDTPVMSKLRFSHLLAVKTPDELLRQLRRAVKLLDGSVNLFSLADDIFCWCQEQNDLLNHHRRQQRPTEFLRIRWALEYYQAGDGDTDNEQD >NZ_CP014620.1|WP_023243193.1|984227_985784_+|type-I-E-CRISPR-associated-protein-Cse1/CasA MDNFSLLTTPWLPVRFKDGSTGKLAPVNLADENVMDIAATRADLQGAAWQFLLGLLQCSIAPKRYKNWEDIWFDGLHADVLHKALAPLEHAFQFGAESPSFMQDFEPLSGEKVSIASLLPEIPGAQTTKFNKDHFVKRGVTERFCPHCAALALFSLQLNAPAGGKGYRTGLRGGGPLTTLVELQEYQGERQTPIWRKLWLNVMPQDTADLPLPDQCDATVFPWLAATRTSEQANAVTTPEQVNKLQAYWGMPRRIRLDFATLQSGGCDICGAESDELLGFMTVKNYGVNYDGWRHPLTPYRAPVKDQNAFFSVKPQPGGLIWRDWLGLSQNNQTEANYESPAQVVKVFNARSLTDVKAGIWGFGADFDNMKIRCWYEHHFPLLMTEGLIPDLRKATQTATRLLSLLRGALKEAWFTNAKDARGDFSFIDIDFWNLTQGRFLNLIQDLENGHKPDERLNKWQRELWLFTRRYFDDRVFTNPYESSDLKRIMTARKKYFTSSAEKQSAKAAKAKKQEAAE >NZ_CP014620.1|WP_023243194.1|981552_984216_+|CRISPR-associated-helicase/endonuclease-Cas3 MSIYHYWGKSRRGETNGGDDYHLLCWHSLDVAAVGYWMVINNIYFIDHYLKKLGIQDKEQAAQFFAWILCWHDIGKFAHSFQQLYRHEALNIFNEPTRHYEKIAHTTLGYVLWNSWLSECPELFPPSSLSVRKSKRVMALWMPVTTGHHGRPPEAIQELDHFRQQDKDAARDFLLRIKALFPLITLPEAWDEDEGIDQFQQLSWFISAAVVLADWTGSASRYFPRTAEKMPVDTYWQQALAKAQTAITLFPSAANVSAFTGIETLFPFIQHPTPLQQKALELDINVDGAQLFILEDVTGAGKTEAALILAHRLMAAGKAQGLYFGLPTMATANAMFERMANTWLALYQPDSRPSLILAHSARRLMDRFNQSIWSVTLSGTEEPDEAQPDSQGCAAWFADSNKKALLAEVGVGTLDQAMMAVMPFKHNNLRLLGLSNKILLADEIHACDAWMSRILEGLIERQASNGNATILLSATLSQQQRDKLVAAFSRGVRRSVQAPLLGHDDYPWLTQVTQTELISQRVDTRKEVERSVDIGWLHSEEACLERIGEAVEKGNCIAWIRNSVDDAIRIYRQLQLSKVVATENLLLFHSRFAFHDRQRIESQTLNLFGKQSGAQRAGKVIIATQVIEQSLDIDCDEMISDLAPVDLLIQRAGRLQRHIRDRNGLVKKSGQDERETPVLRILAPEWDDAPRENWLSSAMCNSAYVYPDHGRMWLTQRILREQGTIRMPQSARLLIESVYGEDVNMPVGFAKTEQLQEGKFYCDRAFAGQMLLNFAPGYCAEISDSLPEKMSTRLAEESVTLWLAKIVDGVVTPYTSGEHAWEMSVLRVRQSWWNKHKDEFEKLDGEPLRKWCAQQHQDKDFATVIVVTDFAACGYSANEGLIGMMGE >NZ_CP014620.1|WP_023244651.1|980155_981109_-|SPI-1-type-III-secretion-system-effector-SopD MPVTLSFGNHQNYTLNESRLAHLLSADKEKAIHMGGWDKVQDHFRAEKKDHALEVLHSIIHGQGRGEPGEMEVNVEDINKIYAFKRLQHLACPAHQDLFTIKMDASQTQFLLMVGDTVISQSNIKDILNISDDAVIESMSREERQLFLQICEVIGAKMTWHPELLQESISTLRKEVTGNAQIKAAVYEMMRPAEAPDHPLVEWQDLLTADEKSMLACINAGNFEPTTQFCKIGYQEVQGEVAFSMMHPCISYLLHSYSPFAEFKPTNSGFLKKLNQDYNDYHAKKMFIDVILEKLYLTHERSLHIGKDGCSRNILLT >NZ_CP014620.1|WP_023243195.1|979208_979943_+|phosphoadenosine-phosphosulfate-reductase MSQLDLNALNELPKVDRVMALAETNAQLEKLSAEERVAWALENLPGEYVLSSSFGIQAAVSLHLVNQIRPDIPVILTDTGYLFPETYQFIDEITDKLKLNLKVYRAGESPAWQEARYGKLWEQGVEGIEKYNDINKVEPMNRALKELKAQTWFAGLRREQSGSRAHLPVLAIQRGVFKVLPIIDWDNRTVYQYLQKHGLKYHPLWDQGYLSVGDTHTTRKWEPGMAEEETRFFGLKRECGLHEG >NZ_CP014620.1|WP_000490481.1|990318_991365_-|aminopeptidase MFSATRRFAVILALGVGFILPAQAASPGPGEIANTQARHIATFFPGRMTGSPAEMLSADYLRQQFTQMGYQSDIRTFNSRFIYTTKDNRKNWHNVTGSTVIAAHEGRVPQQIIIMAHLDTYAPQSDADVDANLGGLTLQGMDDNAAGLGVMLELAARLKDIPTHYGIRFIATSGEEEGKLGAENLLKRMSDAEKKNTLLVINLDNLIVGDKLYFNSGKNTPEAVRTLTRDRALAIARRYGIAANTNPGRNPSYPKGTGCCNDAEVFDKAGISVLSVEATNWNLGKKDGYQQRVKNASFPNGNSWHDVRLDNQQHIDKALPGRIERRSRDVVRIMLPLVKELAKAEKTS >NZ_CP014620.1|WP_000372384.1|991615_992524_+|sulfate-adenylyltransferase-subunit-CysD MDQKRLTHLRQLEAESIHIIREVAAEFANPVMLYSIGKDSSVMLHLARKAFYPGTLPFPLLHVDTGWKFREMYAFRDRTANAYGCELLVHKNPEGVAMGINPFVHGSAKHTDIMKTEGLKQALNKYGFDAAFGGARRDEEKSRAKERIYSFRDRFHRWDPKNQRPELWRNYNGQINKGESIRVFPLSNWTEQDIWQYIWLENIDIVPLYLAAERPVLERDGMLMMVDDDRIDLQPGEVIKKRMVRFRTLGCWPLTGAVESHAQTLPEIIEEMLVSTTSERQGRMIDRDQAGSMELKKRQGYF >NZ_CP014620.1|WP_001092255.1|992533_993973_+|sulfate-adenylyltransferase-subunit-CysN MNTILAQQIANEGGVEAWMIAQQHKSLLRFLTCGSVDDGKSTLIGRLLHDTLQIYEDQLSSLHNDSKRHGTQGEKLDLALLVDGLQAEREQGITIDVAYRYFSTEKRKFIIADTPGHEQYTRNMATGASTCDLAILLIDARKGVLDQTRRHSFISTLLGIKHLVVAINKMDLVDYREETFARIREDYLTFAEQLPGDLDIRFVPLSALEGDNVAAQSANMRWYSGPTLLEVLETVDIQRAVDRQPMRFPVQYVNRPNLDFRGYAGTLASGSVKVGERIKVLPSGVESSVARIVTFDGDKEEACAGEAITLVLNDDIDISRGDLLLAANETLAPARHAAIDVVWMAEQPLAPGQSYDVKLAGKKTRARIEAIRYQIDINNLTQRDVESLPLNGIGLVEMTFDEPLALDIYQQNPVTGGLIFIDRLSNVTVGAGMVRELDERGATPPMEYSAFELELNALVRRHFPHWDARDLLGDKHGAA >NZ_CP014620.1|WP_001173664.1|993959_994565_+|adenylyl-sulfate-kinase MALHDENVVWHSHPVTVAAREQLHGHRGVVLWFTGLSGSGKSTVAGALEEALHQRGVSTYLLDGDNVRHGLCRDLGFSDADRQENIRRVGEVASLMADAGLIVLTAFISPHRAERQLVKERVGHDRFIEIYVNTPLAICEQRDPKGLYKKARSGELRNFTGIDAIYEAPDSPQVHLNGEQLVTNLVSQLLDLLRRRDIIRS >NZ_CP014620.1|WP_001118109.1|994582_994939_+|DUF3561-family-protein MPGMVKVTGFNMRNSHNITFTRSDAFMVDDDATSAFPGAVVGFVSWLLALGIPFLLYGPNTLFFFLYTWPFFLALMPVSVIIGIALHLLVKGKILFSIMFTLLAVGALFGALFIWLLG >NZ_CP014620.1|WP_000517480.1|995129_995441_+|cell-division-protein-FtsB MGKLTLLLLALLVWLQYSLWFGKNGIHDYSRVNDDVVAQQATNAKLKARNDQLFAEIDDLNGGQEAIEERARNELSMTKPGETFYRLVPDASKRAATAGQTHR >NZ_CP014620.1|WP_023244649.1|995459_996170_+|2-C-methyl-D-erythritol-4-phosphate-cytidylyltransferase MAATLLDVCAVVPAAGFGRRMQTECPKQYLSIGNKTILEHSVHALLAHPRVTRVVIAISPGDHRFAQLPLANHPQITVVDGGNERADSVLAGLQAVAKAQWVLVHDAARPCLHQDDLARLLAISENSRVGGILASPVRDTMKRGEPGKNAIAHTVERADLWHALTPQFFPRELLYDCLTRALNEGATITDEASALEYCGFHPALVEGRADNIKVTRPEDLALAEFYLTRTIQQEKA >NZ_CP014620.1|WP_001219245.1|996169_996649_+|2-C-methyl-D-erythritol-2,4-cyclodiphosphate-synthase MRIGHGFDVHAFGGEGPIIIGGVRIPYEKGLLAHSDGDVALHALTDALLGAAALGDIGKLFPDTDPAFKGADSRELLREAWRRIQAKGYTLGNVDVTIIAQAPKMLPHIPQMRVFIAEDLGCHMDEVNVKATTTEKLGFTGRGEGIACEAVALLMKAAK >NZ_CP014620.1|WP_000134246.1|996645_997695_+|tRNA-pseudouridine(13)-synthase-TruD MTEFDNLTWLHGKPQGSGLLKANPEDFVVVEDLGFTPDGEGEHILLRILKNGCNTRFVADALAKFLKIHAREVSFAGQKDKHAVTEQWLCARVPGKEMPDFSAFQLEGCKVLEYARHKRKLRLGALKGNAFTLVLREISDRRDVETRLQAIRDGGVPNYFGAQRFGIGGSNLQGALRWAQSNAPVRDRNKRSFWLSAARSALFNQIVHQRLKKPDFNQVVDGDALQLAGRGSWFVATSEELPELQRRVDEKELMITASLPGSGEWGTQRAALAFEQDAIAQETVLQSLLLREKVEASRRAMLLYPQQLSWNWWDDVTVELRFWLPAGSFATSVVRELINTMGDYAHIAE >NZ_CP014620.1|WP_001221538.1|997675_998437_+|5'/3'-nucleotidase-SurE MRILLSNDDGVHAPGIQTLAKALREFADVQVVAPDRNRSGASNSLTLESSLRTFTFDNGDIAVQMGTPTDCVYLGVNALMRPRPDIVVSGINAGPNLGDDVIYSGTVAAAMEGRHLGFPALAVSLNGYQHYDTAAAVTCALLRGLSREPLRTGRILNVNVPDLPLAQVKGIRVTRCGSRHPADKVIPQEDPRGNTLYWIGPPGDKYDAGPDTDFAAVDEGYVSVTPLHVDLTAHSAHDVVSDWLDSVGVGTQW |
You can click texts colored in the table to view more detailed information
CRISPR_ID | Spacer_Info | Spacer_region | Spacer_length | Hit_ID | Protospacer_location | Mismatch | Identity |
---|
CRISPR_ID | Spacer_Info | Spacer_region | Spacer_length | Hit_phage_ID | Hit_phage_def | Protospacer_location | Mismatch | Identity |
---|---|---|---|---|---|---|---|---|
NZ_CP014620_1 | 1.10|973231|32|NZ_CP014620|PILER-CR,CRISPRCasFinder,CRT | 973231-973262 | 32 | CP053320 | Salmonella enterica subsp. arizonae serovar 41:z4,z23:- strain 2016K-0011 plasmid unnamed, complete sequence | 27359-27390 | 0 | 1.0 |
NZ_CP014620_2 | 2.2|990244|32|NZ_CP014620|CRISPRCasFinder | 990244-990275 | 32 | NZ_CP044185 | Salmonella enterica subsp. enterica strain AR-0403 plasmid pAR-0403 | 35345-35376 | 0 | 1.0 |
NZ_CP014620_2 | 2.2|990244|32|NZ_CP014620|CRISPRCasFinder | 990244-990275 | 32 | NZ_CP029990 | Salmonella enterica subsp. diarizonae serovar 48:i:z strain SA20121591 plasmid pSA20121591.1, complete sequence | 95164-95195 | 1 | 0.969 |
NZ_CP014620_2 | 2.2|990244|32|NZ_CP014620|CRISPRCasFinder | 990244-990275 | 32 | NZ_CP054718 | Salmonella enterica strain 85-0120 plasmid unnamed2, complete sequence | 9403-9434 | 1 | 0.969 |
NZ_CP014620_2 | 2.2|990244|32|NZ_CP014620|CRISPRCasFinder | 990244-990275 | 32 | NZ_CP054718 | Salmonella enterica strain 85-0120 plasmid unnamed2, complete sequence | 39737-39768 | 1 | 0.969 |
NZ_CP014620_2 | 2.2|990244|32|NZ_CP014620|CRISPRCasFinder | 990244-990275 | 32 | NZ_CP054718 | Salmonella enterica strain 85-0120 plasmid unnamed2, complete sequence | 113376-113407 | 1 | 0.969 |
NZ_CP014620_1 | 1.10|973231|32|NZ_CP014620|PILER-CR,CRISPRCasFinder,CRT | 973231-973262 | 32 | CP054337 | Escherichia coli strain SCU-120 plasmid pSCU-120-2, complete sequence | 47898-47929 | 3 | 0.906 |
NZ_CP014620_1 | 1.10|973231|32|NZ_CP014620|PILER-CR,CRISPRCasFinder,CRT | 973231-973262 | 32 | CP042641 | Escherichia coli strain NCYU-24-74 plasmid pNCYU-24-74-3, complete sequence | 30480-30511 | 3 | 0.906 |
NZ_CP014620_1 | 1.10|973231|32|NZ_CP014620|PILER-CR,CRISPRCasFinder,CRT | 973231-973262 | 32 | NZ_CP034821 | Salmonella sp. SSDFZ54 plasmid pTB502, complete sequence | 8985-9016 | 3 | 0.906 |
NZ_CP014620_1 | 1.10|973231|32|NZ_CP014620|PILER-CR,CRISPRCasFinder,CRT | 973231-973262 | 32 | NZ_CP030188 | Salmonella enterica strain SA20094620 plasmid pSA20094620.3, complete sequence | 71040-71071 | 3 | 0.906 |
NZ_CP014620_1 | 1.10|973231|32|NZ_CP014620|PILER-CR,CRISPRCasFinder,CRT | 973231-973262 | 32 | NZ_CP023732 | Escherichia coli strain FORC 064 plasmid pFORC64.1, complete sequence | 48005-48036 | 3 | 0.906 |
NZ_CP014620_1 | 1.10|973231|32|NZ_CP014620|PILER-CR,CRISPRCasFinder,CRT | 973231-973262 | 32 | NZ_CP017632 | Escherichia coli strain SLK172 plasmid pSLK172-1, complete sequence | 116267-116298 | 3 | 0.906 |
NZ_CP014620_1 | 1.10|973231|32|NZ_CP014620|PILER-CR,CRISPRCasFinder,CRT | 973231-973262 | 32 | NZ_CP039862 | Salmonella enterica subsp. enterica serovar 1,4,[5],12:i:- strain PNCS014880 plasmid p16-6773.2, complete sequence | 27391-27422 | 3 | 0.906 |
NZ_CP014620_1 | 1.10|973231|32|NZ_CP014620|PILER-CR,CRISPRCasFinder,CRT | 973231-973262 | 32 | NZ_CP033632 | Escherichia coli isolate ECCNB12-2 plasmid pTB-nb1, complete sequence | 71244-71275 | 3 | 0.906 |
NZ_CP014620_1 | 1.10|973231|32|NZ_CP014620|PILER-CR,CRISPRCasFinder,CRT | 973231-973262 | 32 | NZ_CP041922 | Escherichia coli strain Ec40743 plasmid unnamed3, complete sequence | 64019-64050 | 3 | 0.906 |
NZ_CP014620_1 | 1.10|973231|32|NZ_CP014620|PILER-CR,CRISPRCasFinder,CRT | 973231-973262 | 32 | CP027320 | Escherichia coli strain 2014C-3084 plasmid unnamed1 | 33729-33760 | 3 | 0.906 |
NZ_CP014620_1 | 1.10|973231|32|NZ_CP014620|PILER-CR,CRISPRCasFinder,CRT | 973231-973262 | 32 | MN510445 | Escherichia coli strain JIE250 plasmid pJIE250_3, complete sequence | 64871-64902 | 3 | 0.906 |
NZ_CP014620_1 | 1.10|973231|32|NZ_CP014620|PILER-CR,CRISPRCasFinder,CRT | 973231-973262 | 32 | MN510447 | Escherichia coli strain TZ20_1P plasmid pTZ20_1P, complete sequence | 54320-54351 | 3 | 0.906 |
NZ_CP014620_1 | 1.10|973231|32|NZ_CP014620|PILER-CR,CRISPRCasFinder,CRT | 973231-973262 | 32 | NC_050152 | Enterobacteria phage P7, complete genome | 86888-86919 | 3 | 0.906 |
NZ_CP014620_1 | 1.10|973231|32|NZ_CP014620|PILER-CR,CRISPRCasFinder,CRT | 973231-973262 | 32 | NC_042128 | Escherichia phage RCS47, complete genome | 91937-91968 | 3 | 0.906 |
NZ_CP014620_1 | 1.10|973231|32|NZ_CP014620|PILER-CR,CRISPRCasFinder,CRT | 973231-973262 | 32 | NC_031129 | Salmonella phage SJ46, complete genome | 84791-84822 | 3 | 0.906 |
NZ_CP014620_1 | 1.10|973231|32|NZ_CP014620|PILER-CR,CRISPRCasFinder,CRT | 973231-973262 | 32 | MK448230 | Klebsiella phage ST16-OXA48phi5.2, complete genome | 5645-5676 | 3 | 0.906 |
NZ_CP014620_1 | 1.10|973231|32|NZ_CP014620|PILER-CR,CRISPRCasFinder,CRT | 973231-973262 | 32 | NZ_CP042632 | Escherichia coli strain NCYU-25-82 plasmid pNCYU-25-82-5, complete sequence | 40633-40664 | 4 | 0.875 |
NZ_CP014620_1 | 1.10|973231|32|NZ_CP014620|PILER-CR,CRISPRCasFinder,CRT | 973231-973262 | 32 | NZ_CP042620 | Escherichia coli strain NCYU-26-73 plasmid pNCYU-26-73-5, complete sequence | 54231-54262 | 4 | 0.875 |
NZ_CP014620_1 | 1.10|973231|32|NZ_CP014620|PILER-CR,CRISPRCasFinder,CRT | 973231-973262 | 32 | NZ_AP018804 | Escherichia coli strain E2863 plasmid pE2863-2, complete sequence | 19433-19464 | 4 | 0.875 |
NZ_CP014620_1 | 1.10|973231|32|NZ_CP014620|PILER-CR,CRISPRCasFinder,CRT | 973231-973262 | 32 | NZ_CP021720 | Escherichia coli strain AR_0128 plasmid tig00000793, complete sequence | 110385-110416 | 4 | 0.875 |
NZ_CP014620_1 | 1.10|973231|32|NZ_CP014620|PILER-CR,CRISPRCasFinder,CRT | 973231-973262 | 32 | NZ_CP021537 | Escherichia coli strain AR_0119 plasmid unitig_3, complete sequence | 23452-23483 | 4 | 0.875 |
NZ_CP014620_1 | 1.10|973231|32|NZ_CP014620|PILER-CR,CRISPRCasFinder,CRT | 973231-973262 | 32 | NZ_CP027309 | Escherichia coli strain 2015C-3108 plasmid unnamed2, complete sequence | 32859-32890 | 4 | 0.875 |
NZ_CP014620_1 | 1.10|973231|32|NZ_CP014620|PILER-CR,CRISPRCasFinder,CRT | 973231-973262 | 32 | CP050999 | Escherichia coli O39:NM str. F8704-2 plasmid pF8704-2_2 | 40868-40899 | 4 | 0.875 |
NZ_CP014620_1 | 1.10|973231|32|NZ_CP014620|PILER-CR,CRISPRCasFinder,CRT | 973231-973262 | 32 | NZ_CP047663 | Escherichia coli strain LD93-1 plasmid pLD93-1-90kb, complete sequence | 62453-62484 | 4 | 0.875 |
NZ_CP014620_1 | 1.10|973231|32|NZ_CP014620|PILER-CR,CRISPRCasFinder,CRT | 973231-973262 | 32 | NZ_CP029365 | Escherichia coli strain WCHEC035148 plasmid p1_035148, complete sequence | 65901-65932 | 4 | 0.875 |
NZ_CP014620_1 | 1.10|973231|32|NZ_CP014620|PILER-CR,CRISPRCasFinder,CRT | 973231-973262 | 32 | NZ_CP020051 | Escherichia coli strain AR_0118 plasmid unitig_3, complete sequence | 61123-61154 | 4 | 0.875 |
NZ_CP014620_1 | 1.10|973231|32|NZ_CP014620|PILER-CR,CRISPRCasFinder,CRT | 973231-973262 | 32 | KY271396 | Klebsiella phage 2 LV-2017, complete genome | 41444-41475 | 4 | 0.875 |
NZ_CP014620_1 | 1.5|972926|32|NZ_CP014620|PILER-CR,CRISPRCasFinder,CRT | 972926-972957 | 32 | NZ_CP034838 | Rahnella aquatilis strain KM12 plasmid pKM12v1, complete sequence | 103026-103057 | 6 | 0.812 |
NZ_CP014620_1 | 1.5|972926|32|NZ_CP014620|PILER-CR,CRISPRCasFinder,CRT | 972926-972957 | 32 | NZ_CP034839 | Rahnella aquatilis strain KM25 plasmid pKM12v2, complete sequence | 103026-103057 | 6 | 0.812 |
NZ_CP014620_1 | 1.11|973292|32|NZ_CP014620|PILER-CR,CRISPRCasFinder,CRT | 973292-973323 | 32 | NZ_CP018865 | Arthrobacter crystallopoietes strain DSM 20117 plasmid pLDW-10, complete sequence | 205587-205618 | 6 | 0.812 |
NZ_CP014620_1 | 1.14|973475|32|NZ_CP014620|PILER-CR,CRISPRCasFinder,CRT | 973475-973506 | 32 | NC_015727 | Cupriavidus necator N-1 plasmid pBB1, complete sequence | 1236642-1236673 | 6 | 0.812 |
NZ_CP014620_1 | 1.14|973475|32|NZ_CP014620|PILER-CR,CRISPRCasFinder,CRT | 973475-973506 | 32 | NC_015727 | Cupriavidus necator N-1 plasmid pBB1, complete sequence | 1370300-1370331 | 6 | 0.812 |
NZ_CP014620_1 | 1.5|972926|32|NZ_CP014620|PILER-CR,CRISPRCasFinder,CRT | 972926-972957 | 32 | NZ_CP053542 | Vibrio europaeus strain NPI-1 plasmid pVEu, complete sequence | 229056-229087 | 7 | 0.781 |
NZ_CP014620_1 | 1.5|972926|32|NZ_CP014620|PILER-CR,CRISPRCasFinder,CRT | 972926-972957 | 32 | NZ_CP009356 | Vibrio tubiashii ATCC 19109 plasmid p251, complete sequence | 46871-46902 | 7 | 0.781 |
NZ_CP014620_1 | 1.10|973231|32|NZ_CP014620|PILER-CR,CRISPRCasFinder,CRT | 973231-973262 | 32 | MK575466 | Vibrio phage Rostov 7, complete genome | 14119-14150 | 7 | 0.781 |
NZ_CP014620_2 | 2.2|990244|32|NZ_CP014620|CRISPRCasFinder | 990244-990275 | 32 | NZ_CP040720 | Rhodococcus pyridinivorans strain YF3 plasmid unnamed1, complete sequence | 270963-270994 | 7 | 0.781 |
NZ_CP014620_1 | 1.11|973292|32|NZ_CP014620|PILER-CR,CRISPRCasFinder,CRT | 973292-973323 | 32 | NZ_CP015738 | Shinella sp. HZN7 plasmid pShin-02, complete sequence | 176307-176338 | 8 | 0.75 |
NZ_CP014620_1 | 1.11|973292|32|NZ_CP014620|PILER-CR,CRISPRCasFinder,CRT | 973292-973323 | 32 | NZ_CP010410 | Xanthomonas sacchari strain R1 plasmid unnamed, complete sequence | 332281-332312 | 8 | 0.75 |
NZ_CP014620_1 | 1.17|973658|32|NZ_CP014620|PILER-CR,CRISPRCasFinder,CRT | 973658-973689 | 32 | NZ_CP025224 | Enterococcus sp. CR-Ec1 plasmid pCREc1, complete sequence | 62016-62047 | 8 | 0.75 |
NZ_CP014620_1 | 1.19|973780|32|NZ_CP014620|CRISPRCasFinder,CRT | 973780-973811 | 32 | NC_014310 | Ralstonia solanacearum PSI07 plasmid mpPSI07, complete sequence | 1655546-1655577 | 8 | 0.75 |
NZ_CP014620_1 | 1.19|973780|32|NZ_CP014620|CRISPRCasFinder,CRT | 973780-973811 | 32 | NZ_CP022760 | Ralstonia solanacearum strain T98 plasmid unnamed, complete sequence | 1565948-1565979 | 8 | 0.75 |
NZ_CP014620_1 | 1.19|973780|32|NZ_CP014620|CRISPRCasFinder,CRT | 973780-973811 | 32 | NZ_CP022789 | Ralstonia solanacearum strain SL3175 plasmid unnamed, complete sequence | 1565949-1565980 | 8 | 0.75 |
NZ_CP014620_1 | 1.5|972926|32|NZ_CP014620|PILER-CR,CRISPRCasFinder,CRT | 972926-972957 | 32 | NZ_CP013634 | Rhizobium sp. N324 plasmid pRspN324d, complete sequence | 374514-374545 | 9 | 0.719 |
NZ_CP014620_1 | 1.11|973292|32|NZ_CP014620|PILER-CR,CRISPRCasFinder,CRT | 973292-973323 | 32 | AP018399 | Xanthomonas phage XacN1 DNA, complete genome | 89668-89699 | 9 | 0.719 |
NZ_CP014620_1 | 1.11|973292|32|NZ_CP014620|PILER-CR,CRISPRCasFinder,CRT | 973292-973323 | 32 | NC_030917 | Gordonia phage OneUp, complete genome | 3597-3628 | 9 | 0.719 |
NZ_CP014620_1 | 1.19|973780|32|NZ_CP014620|CRISPRCasFinder,CRT | 973780-973811 | 32 | NZ_CP022775 | Ralstonia solanacearum strain T12 plasmid unnamed, complete sequence | 1578348-1578379 | 9 | 0.719 |
NZ_CP014620_1 | 1.19|973780|32|NZ_CP014620|CRISPRCasFinder,CRT | 973780-973811 | 32 | NZ_CP022762 | Ralstonia solanacearum strain T95 plasmid unnamed, complete sequence | 1496565-1496596 | 9 | 0.719 |
NZ_CP014620_1 | 1.19|973780|32|NZ_CP014620|CRISPRCasFinder,CRT | 973780-973811 | 32 | NZ_CP023017 | Ralstonia solanacearum strain SL3022 plasmid unnamed, complete sequence | 1568471-1568502 | 9 | 0.719 |
NZ_CP014620_1 | 1.19|973780|32|NZ_CP014620|CRISPRCasFinder,CRT | 973780-973811 | 32 | NZ_CP014703 | Ralstonia solanacearum strain KACC 10722 plasmid, complete sequence | 1495587-1495618 | 9 | 0.719 |
NZ_CP014620_1 | 1.19|973780|32|NZ_CP014620|CRISPRCasFinder,CRT | 973780-973811 | 32 | NZ_CP022771 | Ralstonia solanacearum strain T51 plasmid unnamed, complete sequence | 1496557-1496588 | 9 | 0.719 |
NZ_CP014620_1 | 1.19|973780|32|NZ_CP014620|CRISPRCasFinder,CRT | 973780-973811 | 32 | NZ_CP022777 | Ralstonia solanacearum strain T11 plasmid unnamed, complete sequence | 1495910-1495941 | 9 | 0.719 |
NZ_CP014620_1 | 1.19|973780|32|NZ_CP014620|CRISPRCasFinder,CRT | 973780-973811 | 32 | NZ_CP022799 | Ralstonia solanacearum strain SL2064 plasmid unnamed, complete sequence | 1496547-1496578 | 9 | 0.719 |
NZ_CP014620_1 | 1.19|973780|32|NZ_CP014620|CRISPRCasFinder,CRT | 973780-973811 | 32 | NZ_CP022764 | Ralstonia solanacearum strain T82 plasmid unnamed, complete sequence | 1578515-1578546 | 9 | 0.719 |
NZ_CP014620_1 | 1.19|973780|32|NZ_CP014620|CRISPRCasFinder,CRT | 973780-973811 | 32 | NZ_CP022797 | Ralstonia solanacearum strain SL2312 plasmid unnamed, complete sequence | 1578498-1578529 | 9 | 0.719 |
NZ_CP014620_1 | 1.19|973780|32|NZ_CP014620|CRISPRCasFinder,CRT | 973780-973811 | 32 | NZ_CP022758 | Ralstonia solanacearum strain T101 plasmid unnamed, complete sequence | 1578478-1578509 | 9 | 0.719 |
NZ_CP014620_1 | 1.8|973109|32|NZ_CP014620|PILER-CR,CRISPRCasFinder,CRT | 973109-973140 | 32 | NZ_CP020413 | Leptospira interrogans serovar Copenhageni strain FDAARGOS_203 plasmid unnamed1, complete sequence | 252721-252752 | 10 | 0.688 |
NZ_CP014620_1 | 1.11|973292|32|NZ_CP014620|PILER-CR,CRISPRCasFinder,CRT | 973292-973323 | 32 | NZ_CP018784 | Curtobacterium pusillum strain AA3 plasmid pCPAA3, complete sequence | 161882-161913 | 10 | 0.688 |
1. spacer 1.10|973231|32|NZ_CP014620|PILER-CR,CRISPRCasFinder,CRT matches to CP053320 (Salmonella enterica subsp. arizonae serovar 41:z4,z23:- strain 2016K-0011 plasmid unnamed, complete sequence) position: , mismatch: 0, identity: 1.0
agcgcggaatgatttttaacgctgagatggtg CRISPR spacer agcgcggaatgatttttaacgctgagatggtg Protospacer ********************************
2. spacer 2.2|990244|32|NZ_CP014620|CRISPRCasFinder matches to NZ_CP044185 (Salmonella enterica subsp. enterica strain AR-0403 plasmid pAR-0403) position: , mismatch: 0, identity: 1.0
tcgcacaacgcctggatatccgcccatcggcc CRISPR spacer tcgcacaacgcctggatatccgcccatcggcc Protospacer ********************************
3. spacer 2.2|990244|32|NZ_CP014620|CRISPRCasFinder matches to NZ_CP029990 (Salmonella enterica subsp. diarizonae serovar 48:i:z strain SA20121591 plasmid pSA20121591.1, complete sequence) position: , mismatch: 1, identity: 0.969
tcgcacaacgcctggatatccgcccatcggcc CRISPR spacer tcgcacaacgcctggatattcgcccatcggcc Protospacer *******************.************
4. spacer 2.2|990244|32|NZ_CP014620|CRISPRCasFinder matches to NZ_CP054718 (Salmonella enterica strain 85-0120 plasmid unnamed2, complete sequence) position: , mismatch: 1, identity: 0.969
tcgcacaacgcctggatatccgcccatcggcc CRISPR spacer tcgcacaacgcctggatattcgcccatcggcc Protospacer *******************.************
5. spacer 2.2|990244|32|NZ_CP014620|CRISPRCasFinder matches to NZ_CP054718 (Salmonella enterica strain 85-0120 plasmid unnamed2, complete sequence) position: , mismatch: 1, identity: 0.969
tcgcacaacgcctggatatccgcccatcggcc CRISPR spacer tcgcacaacgcctggatattcgcccatcggcc Protospacer *******************.************
6. spacer 2.2|990244|32|NZ_CP014620|CRISPRCasFinder matches to NZ_CP054718 (Salmonella enterica strain 85-0120 plasmid unnamed2, complete sequence) position: , mismatch: 1, identity: 0.969
tcgcacaacgcctggatatccgcccatcggcc CRISPR spacer tcgcacaacgcctggatattcgcccatcggcc Protospacer *******************.************
7. spacer 1.10|973231|32|NZ_CP014620|PILER-CR,CRISPRCasFinder,CRT matches to CP054337 (Escherichia coli strain SCU-120 plasmid pSCU-120-2, complete sequence) position: , mismatch: 3, identity: 0.906
agcgcggaatgatttttaacgctgagatggtg CRISPR spacer agcgcggcatgatttttaacgatgagatggtc Protospacer ******* ************* *********
8. spacer 1.10|973231|32|NZ_CP014620|PILER-CR,CRISPRCasFinder,CRT matches to CP042641 (Escherichia coli strain NCYU-24-74 plasmid pNCYU-24-74-3, complete sequence) position: , mismatch: 3, identity: 0.906
agcgcggaatgatttttaacgctgagatggtg CRISPR spacer agcgcggcatgatttttaacgatgagatggtc Protospacer ******* ************* *********
9. spacer 1.10|973231|32|NZ_CP014620|PILER-CR,CRISPRCasFinder,CRT matches to NZ_CP034821 (Salmonella sp. SSDFZ54 plasmid pTB502, complete sequence) position: , mismatch: 3, identity: 0.906
agcgcggaatgatttttaacgctgagatggtg CRISPR spacer agcgcggcatgatttttaacgatgagatggtc Protospacer ******* ************* *********
10. spacer 1.10|973231|32|NZ_CP014620|PILER-CR,CRISPRCasFinder,CRT matches to NZ_CP030188 (Salmonella enterica strain SA20094620 plasmid pSA20094620.3, complete sequence) position: , mismatch: 3, identity: 0.906
agcgcggaatgatttttaacgctgagatggtg CRISPR spacer agcgcggcatgatttttaacgatgagatggtc Protospacer ******* ************* *********
11. spacer 1.10|973231|32|NZ_CP014620|PILER-CR,CRISPRCasFinder,CRT matches to NZ_CP023732 (Escherichia coli strain FORC 064 plasmid pFORC64.1, complete sequence) position: , mismatch: 3, identity: 0.906
agcgcggaatgatttttaacgctgagatggtg CRISPR spacer agcgcggcatgatttttaacgatgagatggtc Protospacer ******* ************* *********
12. spacer 1.10|973231|32|NZ_CP014620|PILER-CR,CRISPRCasFinder,CRT matches to NZ_CP017632 (Escherichia coli strain SLK172 plasmid pSLK172-1, complete sequence) position: , mismatch: 3, identity: 0.906
agcgcggaatgatttttaacgctgagatggtg CRISPR spacer agcgcggcatgatttttaacgatgagatggtc Protospacer ******* ************* *********
13. spacer 1.10|973231|32|NZ_CP014620|PILER-CR,CRISPRCasFinder,CRT matches to NZ_CP039862 (Salmonella enterica subsp. enterica serovar 1,4,[5],12:i:- strain PNCS014880 plasmid p16-6773.2, complete sequence) position: , mismatch: 3, identity: 0.906
agcgcggaatgatttttaacgctgagatggtg CRISPR spacer agcgcggcatgatttttaacgatgagatggtc Protospacer ******* ************* *********
14. spacer 1.10|973231|32|NZ_CP014620|PILER-CR,CRISPRCasFinder,CRT matches to NZ_CP033632 (Escherichia coli isolate ECCNB12-2 plasmid pTB-nb1, complete sequence) position: , mismatch: 3, identity: 0.906
agcgcggaatgatttttaacgctgagatggtg CRISPR spacer agcgcggcatgatttttaacgatgagatggtc Protospacer ******* ************* *********
15. spacer 1.10|973231|32|NZ_CP014620|PILER-CR,CRISPRCasFinder,CRT matches to NZ_CP041922 (Escherichia coli strain Ec40743 plasmid unnamed3, complete sequence) position: , mismatch: 3, identity: 0.906
agcgcggaatgatttttaacgctgagatggtg CRISPR spacer agcgcggcatgatttttaacgatgagatggtc Protospacer ******* ************* *********
16. spacer 1.10|973231|32|NZ_CP014620|PILER-CR,CRISPRCasFinder,CRT matches to CP027320 (Escherichia coli strain 2014C-3084 plasmid unnamed1) position: , mismatch: 3, identity: 0.906
agcgcggaatgatttttaacgctgagatggtg CRISPR spacer agcgcggcatgatttttaacgatgagatggtc Protospacer ******* ************* *********
17. spacer 1.10|973231|32|NZ_CP014620|PILER-CR,CRISPRCasFinder,CRT matches to MN510445 (Escherichia coli strain JIE250 plasmid pJIE250_3, complete sequence) position: , mismatch: 3, identity: 0.906
agcgcggaatgatttttaacgctgagatggtg CRISPR spacer agcgcggcatgatttttaacgatgagatggtc Protospacer ******* ************* *********
18. spacer 1.10|973231|32|NZ_CP014620|PILER-CR,CRISPRCasFinder,CRT matches to MN510447 (Escherichia coli strain TZ20_1P plasmid pTZ20_1P, complete sequence) position: , mismatch: 3, identity: 0.906
agcgcggaatgatttttaacgctgagatggtg CRISPR spacer agcgcggcatgatttttaacgatgagatggtc Protospacer ******* ************* *********
19. spacer 1.10|973231|32|NZ_CP014620|PILER-CR,CRISPRCasFinder,CRT matches to NC_050152 (Enterobacteria phage P7, complete genome) position: , mismatch: 3, identity: 0.906
agcgcggaatgatttttaacgctgagatggtg CRISPR spacer agcgcggcatgatttttaacgatgagatggtc Protospacer ******* ************* *********
20. spacer 1.10|973231|32|NZ_CP014620|PILER-CR,CRISPRCasFinder,CRT matches to NC_042128 (Escherichia phage RCS47, complete genome) position: , mismatch: 3, identity: 0.906
agcgcggaatgatttttaacgctgagatggtg CRISPR spacer agcgcggcatgatttttaacgatgagatggtc Protospacer ******* ************* *********
21. spacer 1.10|973231|32|NZ_CP014620|PILER-CR,CRISPRCasFinder,CRT matches to NC_031129 (Salmonella phage SJ46, complete genome) position: , mismatch: 3, identity: 0.906
agcgcggaatgatttttaacgctgagatggtg CRISPR spacer agcgcggcatgatttttaacgatgagatggtc Protospacer ******* ************* *********
22. spacer 1.10|973231|32|NZ_CP014620|PILER-CR,CRISPRCasFinder,CRT matches to MK448230 (Klebsiella phage ST16-OXA48phi5.2, complete genome) position: , mismatch: 3, identity: 0.906
agcgcggaatgatttttaacgctgagatggtg CRISPR spacer aacgcggaatgatttttaacgccgatatggtg Protospacer *.********************.** ******
23. spacer 1.10|973231|32|NZ_CP014620|PILER-CR,CRISPRCasFinder,CRT matches to NZ_CP042632 (Escherichia coli strain NCYU-25-82 plasmid pNCYU-25-82-5, complete sequence) position: , mismatch: 4, identity: 0.875
agcgcggaatgatttttaacgctgagatggtg CRISPR spacer aacgcgggatgatttttaacgatgagatggtc Protospacer *.*****.************* *********
24. spacer 1.10|973231|32|NZ_CP014620|PILER-CR,CRISPRCasFinder,CRT matches to NZ_CP042620 (Escherichia coli strain NCYU-26-73 plasmid pNCYU-26-73-5, complete sequence) position: , mismatch: 4, identity: 0.875
agcgcggaatgatttttaacgctgagatggtg CRISPR spacer aacgcgggatgatttttaacgatgagatggtc Protospacer *.*****.************* *********
25. spacer 1.10|973231|32|NZ_CP014620|PILER-CR,CRISPRCasFinder,CRT matches to NZ_AP018804 (Escherichia coli strain E2863 plasmid pE2863-2, complete sequence) position: , mismatch: 4, identity: 0.875
agcgcggaatgatttttaacgctgagatggtg CRISPR spacer aacgcgggatgatttttaacgatgagatggtc Protospacer *.*****.************* *********
26. spacer 1.10|973231|32|NZ_CP014620|PILER-CR,CRISPRCasFinder,CRT matches to NZ_CP021720 (Escherichia coli strain AR_0128 plasmid tig00000793, complete sequence) position: , mismatch: 4, identity: 0.875
agcgcggaatgatttttaacgctgagatggtg CRISPR spacer aacgcgggatgatttttaacgatgagatggtc Protospacer *.*****.************* *********
27. spacer 1.10|973231|32|NZ_CP014620|PILER-CR,CRISPRCasFinder,CRT matches to NZ_CP021537 (Escherichia coli strain AR_0119 plasmid unitig_3, complete sequence) position: , mismatch: 4, identity: 0.875
agcgcggaatgatttttaacgctgagatggtg CRISPR spacer aacgcgggatgatttttaacgatgagatggtc Protospacer *.*****.************* *********
28. spacer 1.10|973231|32|NZ_CP014620|PILER-CR,CRISPRCasFinder,CRT matches to NZ_CP027309 (Escherichia coli strain 2015C-3108 plasmid unnamed2, complete sequence) position: , mismatch: 4, identity: 0.875
agcgcggaatgatttttaacgctgagatggtg CRISPR spacer aacgcgggatgatttttaacgatgagatggtc Protospacer *.*****.************* *********
29. spacer 1.10|973231|32|NZ_CP014620|PILER-CR,CRISPRCasFinder,CRT matches to CP050999 (Escherichia coli O39:NM str. F8704-2 plasmid pF8704-2_2) position: , mismatch: 4, identity: 0.875
agcgcggaatgatttttaacgctgagatggtg CRISPR spacer aacgcgggatgatttttaacgatgagatggtc Protospacer *.*****.************* *********
30. spacer 1.10|973231|32|NZ_CP014620|PILER-CR,CRISPRCasFinder,CRT matches to NZ_CP047663 (Escherichia coli strain LD93-1 plasmid pLD93-1-90kb, complete sequence) position: , mismatch: 4, identity: 0.875
agcgcggaatgatttttaacgctgagatggtg CRISPR spacer aacgcgggatgatttttaacgatgagatggtc Protospacer *.*****.************* *********
31. spacer 1.10|973231|32|NZ_CP014620|PILER-CR,CRISPRCasFinder,CRT matches to NZ_CP029365 (Escherichia coli strain WCHEC035148 plasmid p1_035148, complete sequence) position: , mismatch: 4, identity: 0.875
agcgcggaatgatttttaacgctgagatggtg CRISPR spacer aacgcgggatgatttttaacgatgagatggtc Protospacer *.*****.************* *********
32. spacer 1.10|973231|32|NZ_CP014620|PILER-CR,CRISPRCasFinder,CRT matches to NZ_CP020051 (Escherichia coli strain AR_0118 plasmid unitig_3, complete sequence) position: , mismatch: 4, identity: 0.875
agcgcggaatgatttttaacgctgagatggtg CRISPR spacer aacgcgggatgatttttaacgatgagatggtc Protospacer *.*****.************* *********
33. spacer 1.10|973231|32|NZ_CP014620|PILER-CR,CRISPRCasFinder,CRT matches to KY271396 (Klebsiella phage 2 LV-2017, complete genome) position: , mismatch: 4, identity: 0.875
agcgcggaatgatttttaacgctgagatggtg CRISPR spacer aacacggaatgatttttaacggggagatggtg Protospacer *.*.***************** *********
34. spacer 1.5|972926|32|NZ_CP014620|PILER-CR,CRISPRCasFinder,CRT matches to NZ_CP034838 (Rahnella aquatilis strain KM12 plasmid pKM12v1, complete sequence) position: , mismatch: 6, identity: 0.812
gctgtcggtcgcagtgtggatattgcgatcaa CRISPR spacer gcaacgggtcgcagcgtggatatcgcgatcaa Protospacer ** .. ********.********.********
35. spacer 1.5|972926|32|NZ_CP014620|PILER-CR,CRISPRCasFinder,CRT matches to NZ_CP034839 (Rahnella aquatilis strain KM25 plasmid pKM12v2, complete sequence) position: , mismatch: 6, identity: 0.812
gctgtcggtcgcagtgtggatattgcgatcaa CRISPR spacer gcaacgggtcgcagcgtggatatcgcgatcaa Protospacer ** .. ********.********.********
36. spacer 1.11|973292|32|NZ_CP014620|PILER-CR,CRISPRCasFinder,CRT matches to NZ_CP018865 (Arthrobacter crystallopoietes strain DSM 20117 plasmid pLDW-10, complete sequence) position: , mismatch: 6, identity: 0.812
taccgcgacaccgtcaacgacagcaaccactt CRISPR spacer aaccgcgtcaccggcaacgacagcaaccggct Protospacer ****** ***** **************. .*
37. spacer 1.14|973475|32|NZ_CP014620|PILER-CR,CRISPRCasFinder,CRT matches to NC_015727 (Cupriavidus necator N-1 plasmid pBB1, complete sequence) position: , mismatch: 6, identity: 0.812
tgtcttaactccattgctgagtcga-ttgtgaa CRISPR spacer tgctttaactccatttttgagtcgatttgtgc- Protospacer **..*********** .******** *****
38. spacer 1.14|973475|32|NZ_CP014620|PILER-CR,CRISPRCasFinder,CRT matches to NC_015727 (Cupriavidus necator N-1 plasmid pBB1, complete sequence) position: , mismatch: 6, identity: 0.812
tgtcttaactccattgctgagtcga-ttgtgaa CRISPR spacer tgctttaactccatttttgagtcgatttgtgc- Protospacer **..*********** .******** *****
39. spacer 1.5|972926|32|NZ_CP014620|PILER-CR,CRISPRCasFinder,CRT matches to NZ_CP053542 (Vibrio europaeus strain NPI-1 plasmid pVEu, complete sequence) position: , mismatch: 7, identity: 0.781
gctgtcggtcgcagtgtggatattgcgatcaa CRISPR spacer ggtgtcggtagcagtgtggattttggtaccat Protospacer * ******* *********** *** *.**
40. spacer 1.5|972926|32|NZ_CP014620|PILER-CR,CRISPRCasFinder,CRT matches to NZ_CP009356 (Vibrio tubiashii ATCC 19109 plasmid p251, complete sequence) position: , mismatch: 7, identity: 0.781
gctgtcggtcgcagtgtggatattgcgatcaa CRISPR spacer ggtgtcggtagcagtgtggattttggtaccat Protospacer * ******* *********** *** *.**
41. spacer 1.10|973231|32|NZ_CP014620|PILER-CR,CRISPRCasFinder,CRT matches to MK575466 (Vibrio phage Rostov 7, complete genome) position: , mismatch: 7, identity: 0.781
-agcgcggaatgatttttaacgctgagatggtg CRISPR spacer tagttc-caatgatttttaacactgacatggtt Protospacer **. * *************.**** *****
42. spacer 2.2|990244|32|NZ_CP014620|CRISPRCasFinder matches to NZ_CP040720 (Rhodococcus pyridinivorans strain YF3 plasmid unnamed1, complete sequence) position: , mismatch: 7, identity: 0.781
tcgcacaacgcctggatatccgcccatcggcc CRISPR spacer tcgagaaacgcctggatctccgcccaccgccg Protospacer *** . *********** ********.** *
43. spacer 1.11|973292|32|NZ_CP014620|PILER-CR,CRISPRCasFinder,CRT matches to NZ_CP015738 (Shinella sp. HZN7 plasmid pShin-02, complete sequence) position: , mismatch: 8, identity: 0.75
taccgcgacaccgtcaacgacagcaaccactt CRISPR spacer cgccgcgacaccgtcaacgacatcatcctgcc Protospacer ..******************** ** ** ..
44. spacer 1.11|973292|32|NZ_CP014620|PILER-CR,CRISPRCasFinder,CRT matches to NZ_CP010410 (Xanthomonas sacchari strain R1 plasmid unnamed, complete sequence) position: , mismatch: 8, identity: 0.75
taccgcgacaccgtcaacgacagcaaccactt-- CRISPR spacer gacagcgacaccgtcatcgacagca--cggtgaa Protospacer ** ************ ******** *. *
45. spacer 1.17|973658|32|NZ_CP014620|PILER-CR,CRISPRCasFinder,CRT matches to NZ_CP025224 (Enterococcus sp. CR-Ec1 plasmid pCREc1, complete sequence) position: , mismatch: 8, identity: 0.75
tttttaaatccggacagaccctgtaacggatc CRISPR spacer tttttaaatccggaaagacactgtcaaaaaag Protospacer ************** **** **** * ..*
46. spacer 1.19|973780|32|NZ_CP014620|CRISPRCasFinder,CRT matches to NC_014310 (Ralstonia solanacearum PSI07 plasmid mpPSI07, complete sequence) position: , mismatch: 8, identity: 0.75
cacgagtggcaaattgatttcgacgaaaaacc CRISPR spacer ccagagcggcaaatggatttcgacgacctacg Protospacer * ***.******* *********** **
47. spacer 1.19|973780|32|NZ_CP014620|CRISPRCasFinder,CRT matches to NZ_CP022760 (Ralstonia solanacearum strain T98 plasmid unnamed, complete sequence) position: , mismatch: 8, identity: 0.75
cacgagtggcaaattgatttcgacgaaaaacc CRISPR spacer ccagagcggcaaatggatttcgacgacctacg Protospacer * ***.******* *********** **
48. spacer 1.19|973780|32|NZ_CP014620|CRISPRCasFinder,CRT matches to NZ_CP022789 (Ralstonia solanacearum strain SL3175 plasmid unnamed, complete sequence) position: , mismatch: 8, identity: 0.75
cacgagtggcaaattgatttcgacgaaaaacc CRISPR spacer ccagagcggcaaatggatttcgacgacctacg Protospacer * ***.******* *********** **
49. spacer 1.5|972926|32|NZ_CP014620|PILER-CR,CRISPRCasFinder,CRT matches to NZ_CP013634 (Rhizobium sp. N324 plasmid pRspN324d, complete sequence) position: , mismatch: 9, identity: 0.719
gctgtcggtcgcagtgtggatattgcgatcaa CRISPR spacer gctgtcggtcgcggtgtggacatggtcactct Protospacer ************.*******.** *. *..
50. spacer 1.11|973292|32|NZ_CP014620|PILER-CR,CRISPRCasFinder,CRT matches to AP018399 (Xanthomonas phage XacN1 DNA, complete genome) position: , mismatch: 9, identity: 0.719
taccgcgacaccgtcaacgacagcaaccactt CRISPR spacer acttgcgacaccgacaccgacagcaacccgta Protospacer ..********* ** *********** *
51. spacer 1.11|973292|32|NZ_CP014620|PILER-CR,CRISPRCasFinder,CRT matches to NC_030917 (Gordonia phage OneUp, complete genome) position: , mismatch: 9, identity: 0.719
taccgcgacaccgtcaacgacagcaaccactt CRISPR spacer gtgagctacaccgtcaacgacatcaacgagta Protospacer ** *************** **** * *
52. spacer 1.19|973780|32|NZ_CP014620|CRISPRCasFinder,CRT matches to NZ_CP022775 (Ralstonia solanacearum strain T12 plasmid unnamed, complete sequence) position: , mismatch: 9, identity: 0.719
cacgagtggcaaattgatttcgacgaaaaacc CRISPR spacer tcagagcggcaaatggatttcgacgacctacg Protospacer . ***.******* *********** **
53. spacer 1.19|973780|32|NZ_CP014620|CRISPRCasFinder,CRT matches to NZ_CP022762 (Ralstonia solanacearum strain T95 plasmid unnamed, complete sequence) position: , mismatch: 9, identity: 0.719
cacgagtggcaaattgatttcgacgaaaaacc CRISPR spacer tcagagcggcaaatggatttcgacgacctacg Protospacer . ***.******* *********** **
54. spacer 1.19|973780|32|NZ_CP014620|CRISPRCasFinder,CRT matches to NZ_CP023017 (Ralstonia solanacearum strain SL3022 plasmid unnamed, complete sequence) position: , mismatch: 9, identity: 0.719
cacgagtggcaaattgatttcgacgaaaaacc CRISPR spacer tcagagcggcaaatggatttcgacgacctacg Protospacer . ***.******* *********** **
55. spacer 1.19|973780|32|NZ_CP014620|CRISPRCasFinder,CRT matches to NZ_CP014703 (Ralstonia solanacearum strain KACC 10722 plasmid, complete sequence) position: , mismatch: 9, identity: 0.719
cacgagtggcaaattgatttcgacgaaaaacc CRISPR spacer tcagagcggcaaatggatttcgacgacctacg Protospacer . ***.******* *********** **
56. spacer 1.19|973780|32|NZ_CP014620|CRISPRCasFinder,CRT matches to NZ_CP022771 (Ralstonia solanacearum strain T51 plasmid unnamed, complete sequence) position: , mismatch: 9, identity: 0.719
cacgagtggcaaattgatttcgacgaaaaacc CRISPR spacer tcagagcggcaaatggatttcgacgacctacg Protospacer . ***.******* *********** **
57. spacer 1.19|973780|32|NZ_CP014620|CRISPRCasFinder,CRT matches to NZ_CP022777 (Ralstonia solanacearum strain T11 plasmid unnamed, complete sequence) position: , mismatch: 9, identity: 0.719
cacgagtggcaaattgatttcgacgaaaaacc CRISPR spacer tcagagcggcaaatggatttcgacgacctacg Protospacer . ***.******* *********** **
58. spacer 1.19|973780|32|NZ_CP014620|CRISPRCasFinder,CRT matches to NZ_CP022799 (Ralstonia solanacearum strain SL2064 plasmid unnamed, complete sequence) position: , mismatch: 9, identity: 0.719
cacgagtggcaaattgatttcgacgaaaaacc CRISPR spacer tcagagcggcaaatggatttcgacgacctacg Protospacer . ***.******* *********** **
59. spacer 1.19|973780|32|NZ_CP014620|CRISPRCasFinder,CRT matches to NZ_CP022764 (Ralstonia solanacearum strain T82 plasmid unnamed, complete sequence) position: , mismatch: 9, identity: 0.719
cacgagtggcaaattgatttcgacgaaaaacc CRISPR spacer tcagagcggcaaatggatttcgacgacctacg Protospacer . ***.******* *********** **
60. spacer 1.19|973780|32|NZ_CP014620|CRISPRCasFinder,CRT matches to NZ_CP022797 (Ralstonia solanacearum strain SL2312 plasmid unnamed, complete sequence) position: , mismatch: 9, identity: 0.719
cacgagtggcaaattgatttcgacgaaaaacc CRISPR spacer tcagagcggcaaatggatttcgacgacctacg Protospacer . ***.******* *********** **
61. spacer 1.19|973780|32|NZ_CP014620|CRISPRCasFinder,CRT matches to NZ_CP022758 (Ralstonia solanacearum strain T101 plasmid unnamed, complete sequence) position: , mismatch: 9, identity: 0.719
cacgagtggcaaattgatttcgacgaaaaacc CRISPR spacer tcagagcggcaaatggatttcgacgacctacg Protospacer . ***.******* *********** **
62. spacer 1.8|973109|32|NZ_CP014620|PILER-CR,CRISPRCasFinder,CRT matches to NZ_CP020413 (Leptospira interrogans serovar Copenhageni strain FDAARGOS_203 plasmid unnamed1, complete sequence) position: , mismatch: 10, identity: 0.688
gcccgagaaaagttgcttctctttgctgctgc CRISPR spacer gcccgagaaaattttcttctctttagaatcct Protospacer *********** ** *********. ... .
63. spacer 1.11|973292|32|NZ_CP014620|PILER-CR,CRISPRCasFinder,CRT matches to NZ_CP018784 (Curtobacterium pusillum strain AA3 plasmid pCPAA3, complete sequence) position: , mismatch: 10, identity: 0.688
taccgcgacaccgtcaacgacagcaaccactt CRISPR spacer agccgcgccaccggcaacgacagcacgaagac Protospacer .***** ***** *********** * .
Region | Region Position | Protein_number | Hit_taxonomy | Key_proteins | Att_site | Prophage annotation | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
DBSCAN-SWA_1 |
1156229 : 1162033
Sequences of DBSCAN-SWA_1
Nucleotide sequences of DBSCAN-SWA_1 >NZ_CP014620|1156229:1162033|DBSCAN-SWA GTTATTTCGCTATGGGGTCATCGCATTTTGGTAGCCAGTCGGCGTTGCTTTCCTCTCTGAGTGTGAGATTGGTCTGTATGCCCTGATTTTTACGGCGCTTTTCATAACTCAGCCCGTACTCTTTCAGCATGGCTGGCAGTCCTTTACCGAACATGGTAAGGCTGAGTGTATTCCTGTAGCCGTGGGCTTCCATGTACGCCAGATAGGCATGATACAAATACAGACGCGGCTGACGCGGGATGATGTTAGCATTGCCAATATACATACCGTCAGGATCCGGCAGTGCCTCCAGATAGCCACAAAAATCAAATGCCGGGTCGGCGTCGCGCTTGATGCTGAGCGCCTCGTCGGAATTCTGCTGTGACTGGAGCAGGGTGCGGGCGGTCATCGGGTCGCTGAACTTCTGCATTAGCTGGCGCACAATCACGGCCAGCTCGCGGGCGATTTTGTTCTTGAGCTGCGGGTCGCGTTCCTCCGGGGCAATCTGTTCCGGGAAATGCAGGATCACCCGGCGCCGTGAGACGCCGCCGCTGCGGTCAGTAAAGCGCATCGGGTTATTGTTCACGGCCAGAATCACCGCCGGAATATGGGTGGAGTATGCATCCTTGTATTTCGGGTCTACCGAGACCGCATCCCCGCCGGTGATGGCCTTGAGTCCTGCCCCGTCACCGCTCCATTTTTCCTGGTCAGGCAGACGAATCAGCGAGAAGCCAATCAGCGCAGCACGTTCACGTGGTGATTCCAGCGTTTCGATGGTAGCCGACGTGGCGTTATCTTCCCCGGCAAGCATGGTCGCAATTTCGGCCAGAATACTTTTTCCGCTCCCGCCGGGGCCGGTGACTTCGAGAAAGAGCTGCCAGTCGTAACGGTTCGCCAGAACCATAAACAGCGCGGCCAGAATCACATCGCGTTTTTCCGGTTTGCCACCGGCAGCGCGGTCAAGCCAGCGCCAGAAATGAGGGGCGTGGGTTTCCAGCGTTTCGCCCTCCACCGGCGGGGTGAAATCAACATCACATAGCGTGCGCAGCCAGTGTGATTTATGGTGCGGGCTGAATGTGCCGGTGGCGGTATCGAGTACGCCGTTGCGAAAGCCAATCAGACGGCGCGCAGGGGCGTCCTGCTGCGGAATAATCAGTTTCAGGGTCTCCACCACTGAGGCAATTTTCCCCGACGAGAACGGGGCGCGCAGACGCTGAAACAACCCGGCCACGTCGCGGGCAAAATCCGACGGGGGAATGATTTTCCATATTCCGGCCTCATATCGGGACAGGAGCTGGCCGTTCGCATCCACGGCCAGCGCTTCGCCGTAATGTTCATGCACCCGCATTGCCTTTTCACTGGTGCTCATGGCGGTAAATTCCGCTTCGCTCATGGTAGTGAAAGGGCTGTCAGCCGGAGGCCGGATGGCGTCATAAATCGCTTTCCGCGTGGCCTCCTCGCCTTTCTGCATAAACGCATCATTCCAGTCACCGAACACCGGCGGCAGGGCGACAATGCCCTCGCAGGCGTCTGCGGCCGCAGCGGCTTTACTCTGGCCGTTGCCGTTAAGGTCACGGTCGGCGGCGAGGACAATCTGACAGGCCGGGTGTTTCTGACGGGCAAGGCTCGCCAGAGAAAGAAGGTTCACGGACGACAGTGCCACCATGACGGTTTCCCCGGTCAGGTGATGCACGGTGAGCGCGGTCGCATAGCCCTCCGCAATCCACAGGCGTTTTCCTGCCTGTTTTTTCCCTTCGATGACATGACATGCCCCTTTAACCTGACCGCCCTTCAGGGTGCGTTTGAGACCCTCAGAATTGATGAGCTGAAGGTTTACCAGCGCGCCGGTATTGTCATACAGCGGGACAACCACATCCCCGGCGCGGAACGTCACGCCGCCGGTTTTATGCACAGCCGTGAGCGTCAGACATTCCAGCGCTGGGAAACCCTTGCGGGTGAGGTAGGCGTTGCCGGTGGCCGGTCGGGTTTTATCCATAAGCCTGACGGCCAGCGCGGCCGCCGCTTTGCGGTCAGCCTCCGTTTCTGCTTCTGCGGCCGCAATCACTTCCGGGGCAACCGGCGGCAGATTGCCGGTCACGGCGTTCACCTTCCCGGCGGCCTCAGAGGCTGATACACCGAACACCTTCTCGACCAGTTTCAGTCCGTCACCCGCGCCGCACTGGTTACAGAACCACGTGCCGCGCCCCTCTTTATCGTCAAAGCGAAAGCGGTCAGAGCCACCGCACACCGGGCAGGACTGATGGCGGTTTTTAATCACCTTCACACCCAGCGCAGGGAGAATGTGCGGCCAGTGGCCGCACGCCTGTTTTACCGTTTCTGTTACGTTCATTTTCATGGTTATTTTCTCCCTCAGTGCAGTACCGGTGCGGTGATATGACGGGCGCAGAGTTCATCCATTACGGCCAGCCCGAGAAAGGACAGCGACGGCGCGGCCTTGAGTGGTCCGGCTTCCATTAAATCCTCCAGCAGTGCACAGGCAATCTGGCGGCCTTTTTCCTCGCCGTGCTGGCGCAGGTAGAATCCCTCCAGCTCGGCGGCAATGGCGCTTTCCAGTGTGTCGAGGGTGAGTTGCGGGTAGCGGTGCTGACGTTCGCACAGGGTCAGCCAGGCACAGGCCACGGCGCGACGATACAGCGCGGCGCGTAATACGGGCGGTAATGGCTTTTTCATACGTTGCCCTCCCCGGTCAGCCACTGCTGATTGCAGCGTTCGACCACACCGTCGAGCTGGGCGGTCATGAGGTAAATCACGGAGGTGAGCTGTAACTGCTGCTCAGGGTCACGACGAACGGTGGCGCAGTCCTGCACCTGCATCAGATCGCCGACGAGCTGGCCGACATTGCGCATATGCTCCAGACATTCGAGGTCACGGGCGGTAATGGTGGTGTGTCTCATGCTCGCACCTCCGCAACCGGCAGACGGCCAGCGAATGAGAGGACGTAATCGCGAATGAGGGAAAGGCGTGCGGTGTGCTCATCACCGGCAACGGTGCGAAGCATACAGATACGGGGTTTACGGTCTGCGCGACGGACGGCGGCAAACACAAAGACAAACTGCGGGTGTGACGGGGTGAGGGTCGTAGCCATAGGGGCAACCTCCTTGAAGTAGCGGTAAATGCCACCACCGGAGTTCCTACGCTCATGGGTGGTGACCCGAACGGGGGTAGGAATACCGGCCTTCAAGGAAACCGGCCAGCCCGAAGGCTGCCCCGCCCGGACCACCATTATCTGACAGGGGCTAAGGTATAAGCACCACAGCCCGAAAAATGGGGGTGCCTGAGCAACGACATAAAAAAAGACGCATGGCGCGTCTGGTGTCGCCTTGAAGTAACTCGGGTTCCTACGCCCGGCTGCCGATTTTGCGACAGCGGGAAAACTATACATGGAAACGATGAAAAGAAGCAAGCCAGAAAAAGGGGCTGTTTGCTGGACGGTCATCATCATGCGTCATAACCCCGGTTGCGTTCGGCGATGCGATCCGCCATCCATGCGGTGATTTCAGACTGCGCCCACGCCACGTTTTTACCACCGAGGGAGATTTGTTTCGGGAAGGCTTCCCGGCTGATGAGGTCGTAAATGGTCGAGCGGGACAGGCCGCATAAATGCATCACTTCGGGCAGACGGATAAAGCGCTCGTGAACGGTATCAGAAACCTGCATCAACGGCGCGGCAGGGGCGGAAGACGGGGAAGAAAAAGCGGTGTGCATCGGGCTACCTCACAAAGTCCATACAGTGCCGGTCGTGTCCGTCCGGCTTCGGGTAGCTCCTTATTATGTCTATATTTTTCCTCAGGTCATGTGAGATTTTCGTGGAAACAAACATTGACTTTTCGCTATGGCAAACAAAGGCAAACGCTGGCAAACAGATGCAAATCACTGCATTACAATGCAGCAATTTCTATTTCCTTTAGTTATATATTTTCGATTTTTAATCAAAATAAAGTCTAAATGGTATCGGCAGATAAAAACAGAAGGGTGAACAGTAGTGAACAGTCGGTGAACAGTTACACCCTCAACTGTTCACCCTTTATCTGACTGTATTACTTATCTTTTTCTTTTCAGTGAACAGTAGTGAATAGTTATAAGTAAAAAAACAAACAGTGAGTAAGGTTTTCCTGAGACCTTTCTCTGGCCAGCCGGGTTTTAAGGTCTGTTTGTGCCATTTTTGCCACAACGGCAATGAATCGTGTTGTTGTGTCTGGCGCGGCAGAATCTCCTCAGATTGAAACGAAGAGGAGACCCGACATGACTCAGACCGCTGTTATTCCCGACTACCTTAAACCTGCAATGGAACGCCTTGAGACTGCCCGCTCGGCGCATCTCGCCAATGCCAGCCGTATGGATGAAACCACGACGGTCATCAGCCAGGTGCAAACGCAAAAAAATGAACTGGAGCAGGAAAACGGCAATGATTCCGGCGCATGGCGCGCCGCCTTTCGTGCCGGTGGTGCTGTCATTACCGACGAGCTGAAACAACGCCATCTGGCGCACGTGGCACGGCGGGAACTGGCGCAGGAATGTGGCAGCATGAACGAGGTACTGTCTTTTGAGCTGGACAGGCTCAAAGGAGCCTGTGACCGCACGGCCAGAGCATACCGTCAGGCACATCACGGCGTCCTCAGTCAGTATGCAGAGCATGAACTTGATGCAGCCCTGCGTGAAAGCTGCGGTGCCCTCATCAGAGCAATGAAACTCAACATACTGGTTCTGAATAATCCGCTTGCTAATACGACCGGGCATCAGGGATATACCGAACCGGAAAAAGTTGTAATGCAGCAGGTGAAAGCGTGGCTTGAACAGGCCGTGAAGGACTGCAATATCCGTCTGACCGATGAACTGGTGCTGTTTAAAACAGGGCTGTCGGCTTCCACACTGCCGCATATGGAGCATGATGTTGCGACCACGCCCGGCCAGCGAAAAGTCTGGCAGGAAAAAATGCGTGAACGTGAAGCCAACCTTAAAGCACGGGGGTTACTGTCATGATGCGCTGTCCTTTCTGCCGCACAGCGGCACACGTTCGCACCAGCCGCTATATGTCTGAGAGCGTCAAAGAGAGTTACCTGCAGTGCCAGAATGTGCACTGCTCGGCGACATTCAAAACGCATGAGTCCATCTTTGAAGTGATACGTTCGCCGGTCGTCGATGAGAAACCCGCGCCGGTGCCGACAGCCCCCGTGGCACCCCGTCGGGTAAAAGGCTGCTACAGCTCGCCGTTCCGCCATTAATCAGGAGAGACAACCCGTGACCACTCTGACCTTACAGCAGGCCTGTGACGCCTGTCAGACGAACAAAACCGCGTGGCTTAACCGTAAAACCGAACTGGCCGCCGCAATGCAGGAATATCAGGAATTATTGCTGGATGACAATGTATCAGGCTCCCGCAGATTACAGATGCTGCGTGACCTGATTGACGTAAAAAAATGGGAAGTTAATCAGGCCGCCGGTCGCTACATCTTCTCGCATGAGGAGGTGCAGCGCATCAGCATCCGTAACCGGCTGCATGATTTTATGCAGCAGAACGGCGCAGAGCTGGCCGCCGCACTGGCACCGGAGCTGATGGGGATTAAAAACCAGCCCGCGATGATAAAAAATCGCGCGCTTGACCGTTCAGTCTCTTACCTGAGAGAAGCTCTTTCCGTCTGGCTGACCGCTGGAGATGAAATTAATTATTCTGCACAGGATAAAGATATTTTAACGGCCATCGGATACAGGCCTGACGCGCCTTCGCGGGATGATAATCGTGAAAAATTCACCCCTGCACAGAACATGATTTACACCCGTCGACGCGCCGGACTGGCCGCGCAGTAG
Protein sequences of DBSCAN-SWA_1 >NZ_CP014620|1156229:1162033|1159666_1159933_-|WP_001604627.1|DBSCAN-SWA MHTAFSSPSSAPAAPLMQVSDTVHERFIRLPEVMHLCGLSRSTIYDLISREAFPKQISLGGKNVAWAQSEITAWMADRIAERNRGYDA >NZ_CP014620|1156229:1162033|1160470_1161208_+|WP_039500098.1|DBSCAN-SWA MTQTAVIPDYLKPAMERLETARSAHLANASRMDETTTVISQVQTQKNELEQENGNDSGAWRAAFRAGGAVITDELKQRHLAHVARRELAQECGSMNEVLSFELDRLKGACDRTARAYRQAHHGVLSQYAEHELDAALRESCGALIRAMKLNILVLNNPLANTTGHQGYTEPEKVVMQQVKAWLEQAVKDCNIRLTDELVLFKTGLSASTLPHMEHDVATTPGQRKVWQEKMREREANLKARGLLS >NZ_CP014620|1156229:1162033|1158577_1158898_-|WP_000743150.1|DBSCAN-SWA MKKPLPPVLRAALYRRAVACAWLTLCERQHRYPQLTLDTLESAIAAELEGFYLRQHGEEKGRQIACALLEDLMEAGPLKAAPSLSFLGLAVMDELCARHITAPVLH >NZ_CP014620|1156229:1162033|1161204_1161450_+|WP_000984211.1|DBSCAN-SWA MMRCPFCRTAAHVRTSRYMSESVKESYLQCQNVHCSATFKTHESIFEVIRSPVVDEKPAPVPTAPVAPRRVKGCYSSPFRH >NZ_CP014620|1156229:1162033|1158894_1159122_-|WP_001604623.1|DBSCAN-SWA MRHTTITARDLECLEHMRNVGQLVGDLMQVQDCATVRRDPEQQLQLTSVIYLMTAQLDGVVERCNQQWLTGEGNV >NZ_CP014620|1156229:1162033|1161466_1162033_+|WP_000210078.1|DBSCAN-SWA MTTLTLQQACDACQTNKTAWLNRKTELAAAMQEYQELLLDDNVSGSRRLQMLRDLIDVKKWEVNQAAGRYIFSHEEVQRISIRNRLHDFMQQNGAELAAALAPELMGIKNQPAMIKNRALDRSVSYLREALSVWLTAGDEINYSAQDKDILTAIGYRPDAPSRDDNREKFTPAQNMIYTRRRAGLAAQ >NZ_CP014620|1156229:1162033|1156229_1158563_-|WP_021000674.1|DBSCAN-SWA MKMNVTETVKQACGHWPHILPALGVKVIKNRHQSCPVCGGSDRFRFDDKEGRGTWFCNQCGAGDGLKLVEKVFGVSASEAAGKVNAVTGNLPPVAPEVIAAAEAETEADRKAAAALAVRLMDKTRPATGNAYLTRKGFPALECLTLTAVHKTGGVTFRAGDVVVPLYDNTGALVNLQLINSEGLKRTLKGGQVKGACHVIEGKKQAGKRLWIAEGYATALTVHHLTGETVMVALSSVNLLSLASLARQKHPACQIVLAADRDLNGNGQSKAAAAADACEGIVALPPVFGDWNDAFMQKGEEATRKAIYDAIRPPADSPFTTMSEAEFTAMSTSEKAMRVHEHYGEALAVDANGQLLSRYEAGIWKIIPPSDFARDVAGLFQRLRAPFSSGKIASVVETLKLIIPQQDAPARRLIGFRNGVLDTATGTFSPHHKSHWLRTLCDVDFTPPVEGETLETHAPHFWRWLDRAAGGKPEKRDVILAALFMVLANRYDWQLFLEVTGPGGSGKSILAEIATMLAGEDNATSATIETLESPRERAALIGFSLIRLPDQEKWSGDGAGLKAITGGDAVSVDPKYKDAYSTHIPAVILAVNNNPMRFTDRSGGVSRRRVILHFPEQIAPEERDPQLKNKIARELAVIVRQLMQKFSDPMTARTLLQSQQNSDEALSIKRDADPAFDFCGYLEALPDPDGMYIGNANIIPRQPRLYLYHAYLAYMEAHGYRNTLSLTMFGKGLPAMLKEYGLSYEKRRKNQGIQTNLTLREESNADWLPKCDDPIAK >NZ_CP014620|1156229:1162033|1159118_1159670_-|WP_023243884.1|DBSCAN-SWA MMMTVQQTAPFSGLLLFIVSMYSFPAVAKSAAGRRNPSYFKATPDAPCVFFYVVAQAPPFFGLWCLYLSPCQIMVVRAGQPSGWPVSLKAGIPTPVRVTTHERRNSGGGIYRYFKEVAPMATTLTPSHPQFVFVFAAVRRADRKPRICMLRTVAGDEHTARLSLIRDYVLSFAGRLPVAEVRA |
8 | Enterobacteria_phage(100.0%) | NA | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_2 |
1684908 : 1694079
Sequences of DBSCAN-SWA_2
Nucleotide sequences of DBSCAN-SWA_2 >NZ_CP014620|1684908:1694079|DBSCAN-SWA GATGATTGAATTTAACCATGTCAGTAAAACCTTCGGCGATCAACAGGCTGTTAGCGACCTCAATTTGCACTTTAGCGAAGGCAGCTTTTCGGTGTTAATTGGCACCTCCGGTTCGGGAAAATCGACCACTCTGAAGATGATTAACCGGCTGGTAGAGCATGATAGCGGAACGATCCGTTTTGCCGGGGAAGAGATCCGCAGCCTGCCGGTGCTTGAACTACGCCGTCGCATGGGCTATGCCATTCAGTCTATCGGTCTTTTTCCCCACTGGACGGTGGCGCAAAATATCGCCACCGTACCGCAACTACAAAAGTGGTCGCGTGCGCGGATTAACGATCGTATTGACGAACTGATGGCATTATTGGGTCTGGAAAGCGCGCTGCGCGATCGCTATCCGCATCAGCTTTCCGGCGGGCAACAGCAGCGGGTCGGCGTTGCGCGGGCGCTGGCTGCCGATCCGCAGGTATTGCTGATGGACGAGCCTTTCGGCGCGCTTGATCCGGTAACGCGCGGCGCATTGCAGCAGGAGATGACCCGCATTCATCAGCTGCTGGGGCGCACCATCGTACTGGTGACGCACGACATCGACGAGGCGCTACGCCTCGCCGACCATCTGGTGCTGATGGACGGGGGCCACGTTATCCAACAGGGATCGCCGCTTTCTATGCTGACCTCGCCGGAAAATGATTTCGTGCAGGCGTTTTTTGGCCGCAGCGAGCTGGGCGTAAGGCTGCTTTCGTTACGTAGCGTAGGCGATTATGTACGCCGGCATGAACAGCTCAGCGGCGATGCGCTGGTGGAAGAGATGACGCTACGCGATGCGCTATCGATGTTTGTCGCCCGTCGGTGCGACGTCCTGCCGGTGGCGAATCAGCAGGGCGAGCCCTGCGGTACGCTCCATTTCCGCGATCTGCTTTCGGAGACGTCCCCCCGTGAAACGACTGTGTGATCCGCTTCTCTGGCTTATTGTTCTGTTCTTGCTTCTGCTGTTTGGATTGCCTTATAGCCAGCCGTTCTTCGCCGCGCTGTTTCCCGATTTACCGCGCCCGGTCTACCAACAGGAGAGTTTTGCCGCCCTCGCGCTCGCCCATTTCTGGTTGGTGGGCATCTCAAGTTTGTTTGCCGTCGTGGTGGGCGTCGGCGCAGGGATTGCGGTCACGCGAGAAAGTGGGAAAGAGTTTCGTCCCCTGGTGGAGACTATCGCCGCCGTCGGGCAGACCTTTCCCCCGGTGGCGGTACTGGCGATCGCGGTACCCGTCATGGGTTTTGGTCAGCAACCAGCCATTATCGCCTTGATCCTGTATGGAGTGTTGCCCATCCTGCAGGCGACCCTGGCCGGGCTGGGCGCGGTGCCTGCCAGCGTGATGAGCGTTGCCAGCGGTATGGGAATGAGCCGTCGCCAACAGTTGTATCAGGTTGAGCTGCCGCTGGCCGCGCCGGTGATTCTGGCGGGCATCCGAACCTCGGTGATTATCAATATTGGTACGGCGACCATCGCTTCAACGGTGGGGGCCAGTACGTTAGGCACGCCGATCATTATCGGGCTTAGCGGCTTTAATACGGCCTATGTTATCCAGGGGGCGCTGCTGGTGGCGCTGGCGGCGATCATTATCGATCGCCTGTTTGAAAGGCTGACGCGCGCGCTTACCCGGCACGCAAAATAAAACTGTAACCTGCCAGCATCACGCCGCCGATACCGCCAATAGCCATCAGCAGGAAAAGGGCGATCACCCCGATTTTCGCTACGCGCATTATGTACTCCTTATGTTAATAAAAGGAGTATACATTAAAGCGAATTTGTTAGCTGCTGTTTAAACGCCAAGGGGATGAATGTCGCGTCCCTGGGCGCGCCATGCCAGGAGTTGCTGCTGCTGCGCCAGCGTCTGGTTTTCTCCGCACCATACCAGTAACGTCTTGCCGTCAAACAGTTCCGGGCGGAACTGGCTAAGCGAATGCGCCAACACGTCGATTCGCCATCCCTGTTGGCTGGCGACCCAACCTTCCAGCCACAGGCGGGTGGTATCATGGATATTCCAGCCGATCACCAACGCATCTTTTCCCTGTTTCTTACGCGCAGACGCCAGGCAGAGCGCAATATAGTTGATCAGGATACCGTCAAGAATGCCGAGCAGCGCCTGAAGGGCGGGTTGTTGGCACTGTAATCGTCGCCGCAGCGGGACGAACAGGTTAGTGGTCAATGTTTGGGCTGGATAATCCTGACCGCGTTCTTTGACCCATAACCGTAAACTGTGCAGATTACTGCTTTGCAGATAGTGCAGCAGGATCTCCTGCTGTTCGCGCCAGCCGTTAGGTTGTTCGCTACTGTCGCTACTGAGCAGCACTTTGACTTTGCTGACCTGGACGCCGTTATCTATCCAGCGCTTGATTTCGCGGATTCTGTCGATATCGGCATCGTTAAACAGACGATGACCGCCATCCGTTCGCTGTGGTTTTAAAAGTCCATAACGTCTCTGCCACGCGCGCAACGTGACAGGATTGATATCACAAAGCAAAGCCACTTCACCAATTGTGTAAAGCGCCATCGTTTCACCCTTGCTCGCGAGGTCCCGGTTTAACTTTAGACGCTCTTTTAGGAACCAGGAAGTTTTGCCTGTTTTTTATGCATTAAAACGCGAAGTAGCGGGTTGCGGCGCGGCGTTTAAGTGATCGTATTCACGAATTCATATTTTTATGCAACAGTTCAAAGAAAGTTAATCGTACTCAATGTATGTTACGCGCTTTTAATTGAAGTGTGGTTTGCGGGTATGTACGAGTTTAATCTGGTGTTGCTGCTGCTTCAGCAGATGTGCGTGTTTCTGGTCATTGCGTGGCTAATGAGTAAAACGCGCCTGTTCATCCCGCTTATGCAGGTCACGGTTCGTCTGCCGCACAAGCTTCTGTGTTACGTCACGTTTTCTATCTTCTGCATTATGGGCACTTATTTTGGGCTACATATCGAAGATTCGATTGCCAATACCCGCGCGATTGGCGCGGTGATGGGCGGCCTACTCGGCGGGCCAGTCGTCGGCGGGCTGGTCGGCCTGACCGGTGGGTTACATCGGTATTCTATGGGCGGCATGACGGCGCTGAGCTGTATGATTTCCACCATCGTCGAAGGGCTGCTGGGCGGGTTGGTACACAGCGTTCTCATACGTCGCGGACGCCCGGACAAAGTGTTTAGCCCGCTGACGGCGGGAGCAATTACGTGTGTTGCCGAACTGGTGCAGATGTTGATCATTTTACTGATAGCCAGGCCGTTTGACGATGCCCTGCATCTGGTCAGTAATATTGCCGCGCCGATGATGGTGACGAATACCGTTGGCGCCGCGCTGTTTATGCGTATTTTGCTCGATAAGCGCGCCATGTTCGAAAAATATACTTCGGCATTTTCTGCTACCGCGCTGAAGGTCGCTGCGTCAACGGAGGGGATTCTGCGTCAGGGATTTAACGAAGTGAACAGTATGAAGGTGGCGCAGGTGTTATATCAGGAGCTGGATATTGGCGCCGTCGCCATCACCGATCGCGAAAAACTGCTGGCTTTTACTGGTATTGGCGACGATCACCATCTACCGGGCAAACCCATTTCATCAGGTTATACGCTGAAAGCAATTGAAACCGGAGAGGTGGTTTATGCCGATGGCAACGAAGTGCCGTATCGCTGTTCGCTACACCCGCAGTGTAAACTCGGTTCGACGCTGGTGATCCCGCTGCGTGGCGAAAATCAGCGAGTCATGGGCACCATTAAATTGTACGAAGCGAAAAACCGACTGTTCAGCTCAATTAACCGCACCCTGGGAGAGGGTATTGCGCAGCTTTTATCCGCGCAGATCCTGGCCGGGCAGTATGAACGGCAGAAGGCGTTGCTGACGCAGTCAGAGATCAAGCTGTTGCACGCGCAGGTGAACCCGCATTTTCTGTTTAACGCGCTCAATACCATTAAAGCGGTGATTCGCCGCGACAGCGAACAGGCCAGCCAACTGGTGCAGTACTTGTCGACCTTTTTTCGCAAAAATTTAAAACGCCCGTCGGAAATCGTCACGCTGGCGGATGAAATTGAACACGTAAACGCTTATCTGCAAATTGAAAAAGCGCGTTTTCAGTCGCGTCTGCAGGTACAGCTTGATGTTCCATCGACGCTTTCACGTCAGAAATTGCCTGCGTTTACATTACAGCCGATTGTTGAGAACGCCATTAAACATGGCACGTCGCAACTGCTTGATACCGGCAACGTCGCTATTCGCGCCCGGCGCGAAGGGCAGCATTTGATGTTAGATATTGAGGATAATGCGGGACTGTATCAGTCTTCCGCCGGCAGTAGCGGGCTGGGGATGAGTCTGGTTGATAAACGTCTGCGCGAACACTTTGGCGATGATTATGGTATTAGCGTGGCCTGCGAGCCGGACTGTTTTACCCGAATTACATTACGACTTCCACTGGAGGAGGACGCATGATTAAAGTGCTGATTGTGGATGATGAGCCGTTAGCGCGGGAAAATCTGCGGATTTTGCTCCAGGGGCAGGATGACATTGAGATTGTGGGAGAGTGCGCGAACGCGGTAGAAGCGATTGGCGCGGTACATAAGTTGCGACCTGATGTGCTGTTTCTGGATATTCAGATGCCGCGTATCAGTGGACTGGAGATGGTAGGAATGCTTGATCCGGAACACCGCCCGTATATCGTTTTTTTAACCGCGTTTGACGAATACGCCATCAAAGCCTTTGAAGAACACGCTTTTGATTATCTGCTCAAGCCGATAGAGGAGAAACGGCTGGAAAAAACGTTACATCGTCTGCGTCAGGAGCGCAGTAAACAGGATGTTTCGTTGTTGCCGGAAAACCAGCAGGCGCTTAAATTCATTCCCTGTACCGGACACAGCCGGATCTATTTGTTGCAAATGGATGATGTCGCCTTTGTCAGTAGCCGTATGAGCGGCGTTTATGTGACCAGCAGTGAAGGGAAAGAGGGGTTTACCGAGCTGACGCTGCGCACGCTGGAAAGCCGGACGCCGCTACTGCGTTGTCATCGTCAGTTTCTGGTGAATATGGCCCATTTGCAGGAAATTCGGCTGGAGGATAATGGGCAGGCAGAGCTGATTTTACGCAACGGCCTGACGGTGCCGGTAAGCCGTCGCTATCTGAAAAGTTTAAAAGAGGCGATTGGCCTGTAAAAGACTGTTAGAATATCGTTTTGCCATAGAAACGACCGAAGGCCTCATGCTGAGTAACGATATTCTGCGTAGCGTGCGCTACATTTTAAAAGCTAATAATACCGATCTGGCGCGTATCCTGGCGCTGGGTAACGTTGATGCTACGCCGGAGCAGATAGCAATCTGGTTGCGCAAAGAAGAGGAAGAGGGGTTTCAGCGTTGCCCGGATATCGTGTTGTCCTCATTTCTCAATGGCCTCATTTATGAAAAACGCGGCAAAGATGAGGCGGCGCCTGCATTGACGGCGGAACGTCGTATCAACAACAATATTGTGCTGAAAAAGCTGCGTATTGCCTTTTCGCTAAAAACAGATGATATCCTGGCGATACTTACCGGTCAGTTGTTTCGTGTCTCAATGCCAGAGATCACCGCGATGATGCGCGCGCCGGACCATAAGAACTTCCGCGAATGCGGCGATCAGTTTATGCGTTATTTTCTGCGCGGTCTGGCGGCCCGTGAACACGCGGCGAAGTAATTCTGCGGTATTGTTCCCGGCAGCGTCCTGTCTGACCGGGAAAACGCATTATTATACTAATTGATTCTATGATACCCGCTCTCTTCCAACAGTTTCTGCGAGCGAATCATTGACAGATAGTACGCGGAACAGTTGTCAATTGATGATCCTGGCAATTTACAGAGGTCGCTTATTTTTGCCTGGGTAAAATCAATATCCACATATTCCGTAGCATAGCTATCATAATAGTCGATTCGTTCAGTCAAACCCGGCATACCCTGATAAGCTTCGCCGACTTGACTCAGCATTTTTTGTGCTTCTTCTTTATTATTGGCTTTCAGGGTCTTATAAAGTAGTTTATGTTCAGATATTTGACGTAAAACGATATCCCCTTTGTAGTAATAGGTTAACTTGATTTCAATACCGTTGAGATTACCTACATAGCGTTGTGTTTCCTCTGACTCTTTGCTAGCGGCTATCTTCTTGATAAACGCTGTCATGTTATTTTGCTTTCCCTGGAGAGTATCGTTTTTCTGATCGCAGCCAGTTATGCTAACCGATAGAGAGAGCGCGAATAGTGGCAGTGCCATAAGACGTAGAACCTGCATAACAATTCCTTGTCGTTAAGTATTGGTGTGGCCAGGAATTCAGGGATTATAGGCTTTGGCGAGGGGACTTACAGCGAGGCTGTCTTTTTTCGGAATTCATAAAGAAAAGACGCTGCCGAAGCAGCGCCCTGAGCGACTTTACCAGTCGATGCAATACATTATGCCTGCCAGTTATTTCGCTTCTTTAAAACCAGCAGCTTCCAGCAGCGTCTGGGTTTGTTTCATGCTGATACCTTTGCTGGTATCGCCGGACACCATCGTCCCTGAGATTTGCTGTAACGCTTTAAAGTCCACTTTTTCCATATCCACAGAGACGTTTTCCTGGGCATAGGTATCTTCATAGGTTAATTTTTCTTCCACTCCGGCGATATTTTTATATTTCGCGCTCAGCGGATCGAGAATTTTGGCGGCATCTTCTTTCGTTTTAGCGCCTACAGTGGCATAGCTGATTTTACTTTCAGACGTCTGCTTAATGATTTTGTCACCTTTATAGGTGTAAGTAATTGAAATTTCTGTCCCCGCCAGGTTTGCGTTAAAGGTCTTTGATTCTTCTTTATCGCCACAGCCAGCAAGAGAGAACACCAGTACGGAAGCCAAAGCCGCGGACAATAACTTGCCAGAAATTTTCATCTAAAACTCCATTTTATATAATGATTGGGTTTTTAAAATAATTTCAATGAATTAATTTAACCCAGTAATAGCAATGTATCAGGGAGAGATAGAATATGACTTTTAGCCGTTATTTAGCAGTCCGGATATGGAGTCTTAACGCTATTGCTTATTAAGGAAAAAGTTAAAACACGCGGATGGGGTGATATGCCAGTCAGGATTAAGCGGTTAAAAAAGCCGGAGCATGCTCCGGCTTGTTGCTTATTTCACCTGTTGGCCAGGCTTCGCGCCGTCATCAGGGCTTAACAGGAAGATATCTTTCCCGCCAGGTCCTGCGGCCATCACCATTCCCTCGGAGACGCCAAAGCGCATTTTGCGCGGCGCGAGGTTGGCGACCATTACCGTCTGGCGGCCAATCAGCGCCTGCGGGTCCGGGTAGGCGGAACGAATGCCGGAGAAGACGTTACGCTTCTCGCCGCCCAGATCCAGCGTCAGACGCAGCAATTTGTCAGAACCATCTACGAACTCAGCATTTTCAATCAATGCTACGCGCAGGTCAATTTTGGCGAAATCGTCAAAGGTGATGGTTTCCTGAATCGGGAAGTCGGCTAACGGGCCGGTAACCGGCGCGGCTGCGGCTTTCACCTCTTCTTTAGACGATTCAACCAGCGCTTCAACTTGCTTCATGTCGATGCGATTGTAGAGCGCCTTAAAGGTGTTGACCTTGTGACCGAGCAGCGGCTGTTCGATGGCATCCCAGTTCAACTCACTGTTCAGGAAGGCTTCAACGCGTTCAGAAAGCGTCGGCAGTACCGGTTTCAGATACGTCATCAGCACGCGGAACAGGTTGATGCCCATTGAGCAAATGGCCTGCAGGTCCGCGTCGCGGCCTTCCTGTTTAGCCACCACCCACGGCGCTTGCTCGTCAACATAACGGTTAGCGATGTCGGCCAGCGCCATAATCTCACGGATAGCTTTACCGAATTCACGGCTTTCCCATGCTTCGCCAATCACCGCAGCGGCGTCAGTAAAGGTTTTATACAATTGCGGATCGGCCAGTTCAGCCGCCAGCACGCCGTCGAAACGCTTATTGATAAAACCGGCGTTACGCGAGGCCAGGTTGACTACTTTATTGACGATATCGGCATTGACGCGCTGGACAAAGTCTTCCAGGTTCAGGTCGATGTCATCAATGCGTGAAGAAAGCTTCGCGGTGTAGTAGTAGCGCAGGCTGTCGGCGTCAAAGTGTTTCAGCCAGGTGCTGGCCTTAATAAAGGTGCCGCGAGACTTAGACATCTTCGCGCCGTTCACCGTCACGTAACCGTGAACGAACAGGTTGGTCGGCTTACGGAAGTGGCTGCCTTCCAGCATGGCAGGCCAGAACAGGCTGTGGAAATAGACGATGTCTTTGCCGATAAAGTGGTACAGCTCGGCATCGGAGTCTTTTTTCCAGTACTCATCAAAACTGGTCGTGTCACCGCGCTTATCGCACAGATTTTTGAAGGAGCCCATATAGCCAATCGGCGCGTCCAGCCAGACGTAGAAATATTTGCCCGGCGCGTTCGGGATTTCGAAACCAAAATACGGCGCGTCGCGGGAAATGTCCCACTGTTGCAGGCCGGATTCAAACCACTCCTGCATTTTGTTCGCCACCTGCTCCTGCAGCGCGCCGCTGCGGGTCCACGCCTGCAACATTTCGCTGAATGACGGCAGGTCAAAGAAAAAGTGCTCGGAGTCACGCATTACCGGCGTCGCGCCGGACACCACGGATTTCGGTTCGATAAGTTCGGTCGGGCTGTAGGTCGCGCCGCACACTTCACAGTTATCGCCGTACTGGTCTGCGGATTTACATTTCGGGCAGGTGCCTTTCACAAATCGGTCCGGCAGGAACATGCCTTTTTCCGGATCGTAGAGTTGAGAGATGGTGCGGTTCTTAATAAAACCGTTCTCTTTCAGGCGCGTATAAATCAGCTCGGACAGCTCGCGATTCTCGTCGCTGTGCGTTGAGTGGTAGTTGTCGTAGCTAATATTAAAACCGGCGAAATCGGTCTGGTGCTCCTGGCTCATTTCACCGATCATTTGCTCCGGCGTAATACCAAGCTGCTGCGCTTTCAGCATGATCGGCGTGCCATGAGCGTCATCGGCACAGATGAAGTTAACCTCATGGCCGCGCATTCGCTGGTAACGGACCCAGACATCAGCCTGGATGTGCTCCAGCATATGGCCGAGGTGGATAGAGCCGTTGGCGTACGGCAGCGCGCACGTTACCAGAATTTTCTTCGCGACTTGAGTCAT
Protein sequences of DBSCAN-SWA_2 >NZ_CP014620|1684908:1694079|1686551_1686659_-|WP_001261696.1|DBSCAN-SWA MRVAKIGVIALFLLMAIGGIGGVMLAGYSFILRAG >NZ_CP014620|1684908:1694079|1686718_1687450_-|WP_001240418.1|DBSCAN-SWA MALYTIGEVALLCDINPVTLRAWQRRYGLLKPQRTDGGHRLFNDADIDRIREIKRWIDNGVQVSKVKVLLSSDSSEQPNGWREQQEILLHYLQSSNLHSLRLWVKERGQDYPAQTLTTNLFVPLRRRLQCQQPALQALLGILDGILINYIALCLASARKKQGKDALVIGWNIHDTTRLWLEGWVASQQGWRIDVLAHSLSQFRPELFDGKTLLVWCGENQTLAQQQQLLAWRAQGRDIHPLGV >NZ_CP014620|1684908:1694079|1692045_1694079_-|WP_000195332.1|tRNA|DBSCAN-SWA MTQVAKKILVTCALPYANGSIHLGHMLEHIQADVWVRYQRMRGHEVNFICADDAHGTPIMLKAQQLGITPEQMIGEMSQEHQTDFAGFNISYDNYHSTHSDENRELSELIYTRLKENGFIKNRTISQLYDPEKGMFLPDRFVKGTCPKCKSADQYGDNCEVCGATYSPTELIEPKSVVSGATPVMRDSEHFFFDLPSFSEMLQAWTRSGALQEQVANKMQEWFESGLQQWDISRDAPYFGFEIPNAPGKYFYVWLDAPIGYMGSFKNLCDKRGDTTSFDEYWKKDSDAELYHFIGKDIVYFHSLFWPAMLEGSHFRKPTNLFVHGYVTVNGAKMSKSRGTFIKASTWLKHFDADSLRYYYTAKLSSRIDDIDLNLEDFVQRVNADIVNKVVNLASRNAGFINKRFDGVLAAELADPQLYKTFTDAAAVIGEAWESREFGKAIREIMALADIANRYVDEQAPWVVAKQEGRDADLQAICSMGINLFRVLMTYLKPVLPTLSERVEAFLNSELNWDAIEQPLLGHKVNTFKALYNRIDMKQVEALVESSKEEVKAAAAPVTGPLADFPIQETITFDDFAKIDLRVALIENAEFVDGSDKLLRLTLDLGGEKRNVFSGIRSAYPDPQALIGRQTVMVANLAPRKMRFGVSEGMVMAAGPGGKDIFLLSPDDGAKPGQQVK >NZ_CP014620|1684908:1694079|1690644_1691175_-|WP_001197951.1|DBSCAN-SWA MQVLRLMALPLFALSLSVSITGCDQKNDTLQGKQNNMTAFIKKIAASKESEETQRYVGNLNGIEIKLTYYYKGDIVLRQISEHKLLYKTLKANNKEEAQKMLSQVGEAYQGMPGLTERIDYYDSYATEYVDIDFTQAKISDLCKLPGSSIDNCSAYYLSMIRSQKLLEESGYHRIN >NZ_CP014620|1684908:1694079|1684908_1685856_+|WP_000569166.1|DBSCAN-SWA MIEFNHVSKTFGDQQAVSDLNLHFSEGSFSVLIGTSGSGKSTTLKMINRLVEHDSGTIRFAGEEIRSLPVLELRRRMGYAIQSIGLFPHWTVAQNIATVPQLQKWSRARINDRIDELMALLGLESALRDRYPHQLSGGQQQRVGVARALAADPQVLLMDEPFGALDPVTRGALQQEMTRIHQLLGRTIVLVTHDIDEALRLADHLVLMDGGHVIQQGSPLSMLTSPENDFVQAFFGRSELGVRLLSLRSVGDYVRRHEQLSGDALVEEMTLRDALSMFVARRCDVLPVANQQGEPCGTLHFRDLLSETSPRETTV >NZ_CP014620|1684908:1694079|1687672_1689358_+|WP_023243038.1|DBSCAN-SWA MYEFNLVLLLLQQMCVFLVIAWLMSKTRLFIPLMQVTVRLPHKLLCYVTFSIFCIMGTYFGLHIEDSIANTRAIGAVMGGLLGGPVVGGLVGLTGGLHRYSMGGMTALSCMISTIVEGLLGGLVHSVLIRRGRPDKVFSPLTAGAITCVAELVQMLIILLIARPFDDALHLVSNIAAPMMVTNTVGAALFMRILLDKRAMFEKYTSAFSATALKVAASTEGILRQGFNEVNSMKVAQVLYQELDIGAVAITDREKLLAFTGIGDDHHLPGKPISSGYTLKAIETGEVVYADGNEVPYRCSLHPQCKLGSTLVIPLRGENQRVMGTIKLYEAKNRLFSSINRTLGEGIAQLLSAQILAGQYERQKALLTQSEIKLLHAQVNPHFLFNALNTIKAVIRRDSEQASQLVQYLSTFFRKNLKRPSEIVTLADEIEHVNAYLQIEKARFQSRLQVQLDVPSTLSRQKLPAFTLQPIVENAIKHGTSQLLDTGNVAIRARREGQHLMLDIEDNAGLYQSSAGSSGLGMSLVDKRLREHFGDDYGISVACEPDCFTRITLRLPLEEDA >NZ_CP014620|1684908:1694079|1690120_1690588_+|WP_000950413.1|DBSCAN-SWA MLSNDILRSVRYILKANNTDLARILALGNVDATPEQIAIWLRKEEEEGFQRCPDIVLSSFLNGLIYEKRGKDEAAPALTAERRINNNIVLKKLRIAFSLKTDDILAILTGQLFRVSMPEITAMMRAPDHKNFRECGDQFMRYFLRGLAAREHAAK >NZ_CP014620|1684908:1694079|1691346_1691805_-|WP_000703137.1|DBSCAN-SWA MKISGKLLSAALASVLVFSLAGCGDKEESKTFNANLAGTEISITYTYKGDKIIKQTSESKISYATVGAKTKEDAAKILDPLSAKYKNIAGVEEKLTYEDTYAQENVSVDMEKVDFKALQQISGTMVSGDTSKGISMKQTQTLLEAAGFKEAK >NZ_CP014620|1684908:1694079|1689354_1690074_+|WP_000598637.1|DBSCAN-SWA MIKVLIVDDEPLARENLRILLQGQDDIEIVGECANAVEAIGAVHKLRPDVLFLDIQMPRISGLEMVGMLDPEHRPYIVFLTAFDEYAIKAFEEHAFDYLLKPIEEKRLEKTLHRLRQERSKQDVSLLPENQQALKFIPCTGHSRIYLLQMDDVAFVSSRMSGVYVTSSEGKEGFTELTLRTLESRTPLLRCHRQFLVNMAHLQEIRLEDNGQAELILRNGLTVPVSRRYLKSLKEAIGL >NZ_CP014620|1684908:1694079|1685839_1686571_+|WP_000824854.1|DBSCAN-SWA MKRLCDPLLWLIVLFLLLLFGLPYSQPFFAALFPDLPRPVYQQESFAALALAHFWLVGISSLFAVVVGVGAGIAVTRESGKEFRPLVETIAAVGQTFPPVAVLAIAVPVMGFGQQPAIIALILYGVLPILQATLAGLGAVPASVMSVASGMGMSRRQQLYQVELPLAAPVILAGIRTSVIINIGTATIASTVGASTLGTPIIIGLSGFNTAYVIQGALLVALAAIIIDRLFERLTRALTRHAK |
10 | Enterobacteria_phage(66.67%) | tRNA | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_3 |
1764303 : 1770600
Sequences of DBSCAN-SWA_3
Nucleotide sequences of DBSCAN-SWA_3 >NZ_CP014620|1764303:1770600|DBSCAN-SWA TATGCCCGTGACTAAATTCTCCCGACGTACCCTCCTGACGGCAGGTTCTGCGCTTGCTGTTCTTCCTTTTCTGCGCGCCTTGCCGGTACAGGCGCGTGAACCTCGCGAGACCGTCGATATTAAGGATTATCCGGCGGATGACGGTATCGCCTCGTTCAAACAGGCCTTCGCCGACGGACAGACCGTGGTCGTACCGCCAGGATGGGTGTGTGAAAATATCAATGCGGCGATAACGATTCCGGCGGGAAAAACGCTGCGGGTACAGGGCGCGGTGCGTGGGAATGGCCGGGGACGGTTTATTTTGCAGGACGGGTGTCAGGTGGTGGGGGAGCAGGGCGGCAGTCTGCACAATGTGACGCTGGATGTTCGCGGGTCGGACTGTGTGATTAAAGGCGTGGCGATGAGCGGCTTTGGCCCCGTCGCGCAAATTTTCATCGGTGGTAAGGAACCGCAGGTGATGCGTAATCTCATTATCGATGACATCACCGTTACCCACGCCAACTACGCCATTCTCCGCCAGGGATTTCATAACCAAATGGACGGCGCGCGGATTACGCATAGCCGCTTTAGCGATTTGCAGGGGGACGCCATTGAGTGGAATGTCGCGATTCACGACCGCGACATCCTGATTTCCGATCATGTCATCGAACGCATTGATTGTACCAATGGCAAAATCAACTGGGGGATCGGCATCGGGCTGGCGGGTAGCACCTATGACAACAGTTATCCTGAAGACCAGGCAGTAAAAAACTTTGTGGTGGCCAATATTACCGGATCTGATTGCCGACAGCTTGTGCACGTAGAAAATGGCAAACATTTCGTCATTCGCAATGTCAAAGCCAAAAACATCACGCCCGATTTCAGTAAAAATGCGGGTATTGATAACGCAACGATCGCAATTTATGGCTGTGATAATTTCGTCATTGATAATATTGACATGACGAATAGTGCCGGGATGCTTATCGGCTATGGCGTTGTTAAAGGAAAATACCTGTCAATTCCGCAAAACTTTAAATTAAACGCTATTCGGTTGGATAATCGCCAGGTTGCTTATAAATTACGCGGTATTCAAATTTCCTCCGGCAACGCTCCTTCGTTTGTCGCTATCACCAACGTACGGATGACGCGTGCTACGCTGGAACTGCATAATCAACCGCAGCACCTCTTTCTGCGCAATATCAACGTGATGCAAACTTCAGCGATTGGCCCGGCGTTAAAAATGCATTTCGATTTGCGTAAAGATGTCCGTGGTCAATTTATGGCCCGCCAGGACACGCTGCTTTCCCTCGCTAATGTTCATGCCATCAATGAAAACGGGCAGAGTTCCGTGGATATCGACAGGATTAATCACCAAACCGTGAATGTCGAAGCAGTGAATTTTTCGCTGCCGAAGCGGGGAGGGTAAGTACCGCTATTTTTACGAAAATTCCTGGGAAAAAGTTGTTCATACTTAATGTTATGGTGCCGACTAAGACGTAATGTAGAGCGTGCCATCATTATCCCTGGCAGCAGTGTAATTCATGCTGGCGAAAACAAGCTAAAGAGCTATAATTCAGCAACCATTTTACAGGTGGAAGAAACAATGATGAATTTGAAAGCAGTTATACCGGTAGCGGGTTTGGGTATGCATATGTTGCCTGCCACCAAGGCAATCCCAAAAGAGATGCTACCGATCGTCGACAAGCCAATGATTCAGTACATTGTCGATGAGATTGTGGCTGCAGGGATCAAAGAAATCGTACTGGTGACTCACGCGTCTAAAAACGCCGTTGAGAACCACTTCGACACCTCTTATGAACTTGAATCACTTCTTGAACAGCGCGTTAAGCGTCAGCTTTTGGCGGAAGTGCAATCTATCTGCCCACCGGGCGTGACGATTATGAACGTTCGCCAGGCGCAGCCGTTAGGACTGGGGCATTCTATTCTGTGCGCGCGTCCGGTCGTGGGCGATAACCCTTTCATTGTGGTACTCCCGGATATTATTATTGATGATGCTACCGCCGATCCGCTGCGCTATAACCTTGCGGCGATGGTGGCGCGTTTCAATGAAACGGGTCGCAGCCAGGTGCTGGCGAAGCGCATGAAAGGTGATTTATCGGAGTATTCCGTTATCCAGACGAAAGAACCTCTGGATAATGAAGGCAAAGTCAGCCGGATTGTGGAGTTTATCGAAAAACCGGATCAGCCGCAGACGCTGGATTCCGATTTGATGGCGGTAGGCCGTTATGTGCTTTCAGCCGACATCTGGGCGGAACTGGAAAGAACCGAACCGGGCGCCTGGGGCCGCATCCAGCTCACCGATGCCATTGCTGAGCTGGCGAAAAAACAGTCGGTTGACGCGATGCTAATGACGGGTGACAGCTATGACTGCGGTAAAAAAATGGGCTACATGCAGGCATTTGTGAAGTACGGGCTGCGCAACCTGAAAGAAGGAGCGAAGTTCCGTAAGAGCATAGAGCAGCTTCTGCATGAATAAGTATTAACAACCGTGATAAATGGTTGGTGATAAACATAATAATGGCAGTGAACATTCGAAGCGGCAAGTTGGCTGAAACGAGTGTTGACTGCCGTTTTAGTTTTGTATAAAGGGCTTAAGTAACAAGGGGTTATCTGGAGCATTTTAATGTTGATTTTATAAGATTAATCCTTGTTTCCGGATGCAATTAATAAGACAATTAGCGTTTAAGTTTTAGTGAGCTTTGCCCTGCTGGGCGAGGTTTGTAACAAGTCGATATGTACGCAGTGCACTGGTAGCTGATGAGCCAGGGGCGGTAGCGTGTGTAACGACTTGAGCAATTAATTTTTATTGGCAAATTAAATACCACATTAAATACGCCTTATGGAATAGAAAAGTGAAGATACTTATTACTGGCGGGGCAGGTTTTATTGGATCAGCTGTTGTCCGCCATATTATTAAGAATACACAGGACACTGTAGTTAATATTGATAAATTAACCTACGCCGGTAATCTTGAATCCCTTTCTGATATTTCTGAAAGTAATCGCTACAATTTTGAACACGCGGATATTTGTGATTCCGCTGAAATAACGCGTATTTTTGAGCAGTACCAGCCGGACGCGGTGATGCATTTGGCTGCGGAAAGTCATGTGGACCGTTCGATTACCGGGCCGGCAGCATTTATTGAAACCAATATCGTTGGCACCTATGTACTTCTTGAAGTTGCGCGTAAATACTGGTCTGCGCTTGGCGAAGATAAAAAAAATAATTTTCGTTTTCATCATATTTCCACTGATGAAGTTTACGGCGATTTACCGCATCCTGATGAAGTTGAAAACAGCGTTACGCTGCCGTTATTTACTGAAACGACGGCATATGCGCCAAGTAGCCCCTATTCTGCGTCAAAAGCATCCAGCGATCATTTAGTCCGTGCCTGGCGGCGTACCTATGGTCTACCAACGATCGTTACCAATTGTTCTAATAACTATGGCCCTTATCACTTCCCTGAAAAACTGATTCCGCTGGTCATTTTGAACGCGCTGGAAGGAAAGCCTTTGCCAATTTATGGCAAAGGGGATCAGATTCGCGATTGGCTATATGTAGAAGATCACGCTCGCGCGCTTCATATGGTAGTGACTGAAGGCAAGGCGGGGGAGACTTATAACATTGGTGGCCACAATGAGAAGAAAAATCTCGATGTGGTATTTACCATCTGTGATCTGCTGGACGAGATTGTACCCAAAGCGACTTCTTATCGTGAACAAATCACTTATGTCGCGGATCGTCCGGGCCATGATCGTCGTTATGCCATTGATGCAGGTAAAATTAGCCGCGAATTAGGCTGGAAACCGCTGGAGACCTTTGAAAGCGGTATTCGTAAAACAGTGGAATGGTACCTTGCAAATACTCAATGGGTAAACAATGTTAAAAGTGGGGCGTATCAGAGTTGGATAGAACAGAACTATGAAGGACGCCAGTAATGAATATCTTACTTTTTGGTAAGACAGGGCAAGTAGGCTGGGAGTTGCAACGTTCTCTGGCACCAGTAGGGAATCTGATTGCCCTGGATGTCCATTCAAAAGAGTTTTGCGGTGATTTTAGTAATCCGAAAGGCGTTGCCGAAACCGTTCGTAAGCTTCGTCCCGATGTGATTGTTAACGCAGCAGCACATACTGCAGTAGATAAAGCAGAGTCTGAACCAGAACTGGCGCAGTTACTTAACGCCACCAGTGTGGAAGCCATCGCTAAAGCAGCCAACGAAACTGGCGCATGGGTAGTGCATTATTCAACCGATTATGTATTTCCTGGTACCGGCGATATCCCATGGCAGGAAACGGACGCTACGTCGCCGCTGAATGTCTATGGCAAGACCAAACTGGCGGGAGAAAAGGCCCTGCAGGATAACTGCCCTAAGCATCTTATCTTCCGCACCAGTTGGGTTTATGCAGGTAAGGGCAATAATTTCGCAAAGACAATGCTTCGTCTGGCGAAAGAGCGTCAGACACTTTCAGTCATCAACGATCAGTACGGTGCGCCGACCGGTGCGGAATTACTGGCTGACTGCACGGCGCATGCGATCCGTGTGGCGTTAAATAAACCAGAAGTCGCAGGTCTTTACCATCTGGTTGCCGGGGGAACCACAACCTGGCATGACTACGCGGCCTTAGTCTTTGACGAGGCGCGCAAGGCAGGGATAACGCTTGCGCTGACTGAGCTTAATGCTGTGCCGACCAGCGCCTACCCGACGCCGGCGAGCAGACCTGGAAATTCGCGTCTCAATACTGAAAAGTTTCAGCGTAATTTTGACCTTATTCTGCCGCAATGGGAATTAGGAGTTAAGCGTATGCTGACTGAAATGTTTACGACGACAACCATCTGATAAATTTAAATGCCCATCAGGGCATTTTCTATGAATGAGAAATGGAAATGAAAACGCGTAAGGGCATTATTTTAGCGGGGGGCTCCGGCACCCGTCTTTATCCGGTGACCATGGCGGTAAGTAAGCAATTGCTACCAATTTATGATAAACCGATGATTTACTATCCCCTTTCCACACTTATGCTGGCAGGTATTCGGGATATCCTGATCATCAGTACGCCACAGGACACGCCGCGTTTTCAACAACTGCTGGGAGACGGCAGCCAGTGGGGGCTGAATCTTCAATATAAAGTACAGCCAAGCCCGGATGGCTTAGCACAGGCGTTTATTATTGGTGAAGAGTTCATTGGTAATGATGATTGTGCATTAGTACTGGGTGACAATATCTTCTATGGTCATGATTTACCAAAGTTAATGGAAGCTGCCGTTAATAAAGAAAGTGGTGCTACCGTCTTTGCTTATCATGTAAACGATCCAGAGCGCTACGGTGTGGTTGAGTTTGACCAAAGTGGCACAGCCGTTAGTCTGGAGGAAAAACCGTTACAACCGAAGAGTAATTACGCGGTAACGGGGCTGTATTTTTACGATAACAGCGTGGTGGAGATGGCGAAAAATCTTAAGCCTTCCGCTCGCGGTGAGTTAGAAATCACGGATATTAACCGTATCTATATGGAGCAGGGAAGATTGTCTGTCGCTATGATGGGGCGCGGTTATGCTTGGCTGGATACGGGAACGCATCAGAGTTTGATAGAGGCCAGTAATTTTATTGCAACCATCGAAGAACGCCAGGGTCTGAAAGTGTCTTGCCCGGAAGAGATCGCTTATAGAAAAGGGTTTATTGATGCAGAGCAGATTAAAAATCTGGCTAAACCGTTGTCGAAGAATGCTTATGGGCAGTATCTCCTGAACATGATTAAAGGTTATTAATAAAATGAACGTTATTAAAACAGAAATTCCTGATGTATTAATTTTTGAACCTAAAGTGTTTAGTGATGAACGTGGGTTCTTTATGGAAAGTTTTAATCAGAAAGTATTTGAGGAAGCGGTTGGTCGAAAGATTGAATTTGTTCAGGATAATCATTCAAAATCAACTAAAGGTGTGTTACGTGGTTTACATTATCAAGTTGAACCTTATGCTCAAGGGAAGCTTGTACGCTGTATAGCGGGAGAAGTTTTTGATGTTGCTGTAGATATTCGCAACGATTCCGAAACGTTTGGTAAATGGGTTGGTGTCAATATTTCTTCTGAAAACAAAAGGCAGTTGTGGATACCTGAAGGTTTTGCTCATGGGTTCTTAGTATTAAGTGAAGAAGCTGAATTTGTTTATAAGACATCAAACTATTACTCTGGCGAACATGAAAGAGGTATTATTTGGAATGATCCTGATATTAATATTACATGGGGAATAGATAGTCCAATTCTTTCATTAAAAGATAAGATTCATAAAGGTTTAGTAAAGTAA
Protein sequences of DBSCAN-SWA_3 >NZ_CP014620|1764303:1770600|1768239_1769139_+|WP_001023662.1|DBSCAN-SWA MNILLFGKTGQVGWELQRSLAPVGNLIALDVHSKEFCGDFSNPKGVAETVRKLRPDVIVNAAAHTAVDKAESEPELAQLLNATSVEAIAKAANETGAWVVHYSTDYVFPGTGDIPWQETDATSPLNVYGKTKLAGEKALQDNCPKHLIFRTSWVYAGKGNNFAKTMLRLAKERQTLSVINDQYGAPTGAELLADCTAHAIRVALNKPEVAGLYHLVAGGTTTWHDYAALVFDEARKAGITLALTELNAVPTSAYPTPASRPGNSRLNTEKFQRNFDLILPQWELGVKRMLTEMFTTTTI >NZ_CP014620|1764303:1770600|1770069_1770600_+|WP_001100808.1|DBSCAN-SWA MNVIKTEIPDVLIFEPKVFSDERGFFMESFNQKVFEEAVGRKIEFVQDNHSKSTKGVLRGLHYQVEPYAQGKLVRCIAGEVFDVAVDIRNDSETFGKWVGVNISSENKRQLWIPEGFAHGFLVLSEEAEFVYKTSNYYSGEHERGIIWNDPDINITWGIDSPILSLKDKIHKGLVK >NZ_CP014620|1764303:1770600|1767154_1768240_+|WP_000697846.1|DBSCAN-SWA MKILITGGAGFIGSAVVRHIIKNTQDTVVNIDKLTYAGNLESLSDISESNRYNFEHADICDSAEITRIFEQYQPDAVMHLAAESHVDRSITGPAAFIETNIVGTYVLLEVARKYWSALGEDKKNNFRFHHISTDEVYGDLPHPDEVENSVTLPLFTETTAYAPSSPYSASKASSDHLVRAWRRTYGLPTIVTNCSNNYGPYHFPEKLIPLVILNALEGKPLPIYGKGDQIRDWLYVEDHARALHMVVTEGKAGETYNIGGHNEKKNLDVVFTICDLLDEIVPKATSYREQITYVADRPGHDRRYAIDAGKISRELGWKPLETFESGIRKTVEWYLANTQWVNNVKSGAYQSWIEQNYEGRQ >NZ_CP014620|1764303:1770600|1769186_1770065_+|WP_023243995.1|DBSCAN-SWA MKTRKGIILAGGSGTRLYPVTMAVSKQLLPIYDKPMIYYPLSTLMLAGIRDILIISTPQDTPRFQQLLGDGSQWGLNLQYKVQPSPDGLAQAFIIGEEFIGNDDCALVLGDNIFYGHDLPKLMEAAVNKESGATVFAYHVNDPERYGVVEFDQSGTAVSLEEKPLQPKSNYAVTGLYFYDNSVVEMAKNLKPSARGELEITDINRIYMEQGRLSVAMMGRGYAWLDTGTHQSLIEASNFIATIEERQGLKVSCPEEIAYRKGFIDAEQIKNLAKPLSKNAYGQYLLNMIKGY >NZ_CP014620|1764303:1770600|1765884_1766778_+|WP_000981469.1|DBSCAN-SWA MMNLKAVIPVAGLGMHMLPATKAIPKEMLPIVDKPMIQYIVDEIVAAGIKEIVLVTHASKNAVENHFDTSYELESLLEQRVKRQLLAEVQSICPPGVTIMNVRQAQPLGLGHSILCARPVVGDNPFIVVLPDIIIDDATADPLRYNLAAMVARFNETGRSQVLAKRMKGDLSEYSVIQTKEPLDNEGKVSRIVEFIEKPDQPQTLDSDLMAVGRYVLSADIWAELERTEPGAWGRIQLTDAIAELAKKQSVDAMLMTGDSYDCGKKMGYMQAFVKYGLRNLKEGAKFRKSIEQLLHE >NZ_CP014620|1764303:1770600|1764303_1765707_+|WP_023244537.1|DBSCAN-SWA MPVTKFSRRTLLTAGSALAVLPFLRALPVQAREPRETVDIKDYPADDGIASFKQAFADGQTVVVPPGWVCENINAAITIPAGKTLRVQGAVRGNGRGRFILQDGCQVVGEQGGSLHNVTLDVRGSDCVIKGVAMSGFGPVAQIFIGGKEPQVMRNLIIDDITVTHANYAILRQGFHNQMDGARITHSRFSDLQGDAIEWNVAIHDRDILISDHVIERIDCTNGKINWGIGIGLAGSTYDNSYPEDQAVKNFVVANITGSDCRQLVHVENGKHFVIRNVKAKNITPDFSKNAGIDNATIAIYGCDNFVIDNIDMTNSAGMLIGYGVVKGKYLSIPQNFKLNAIRLDNRQVAYKLRGIQISSGNAPSFVAITNVRMTRATLELHNQPQHLFLRNINVMQTSAIGPALKMHFDLRKDVRGQFMARQDTLLSLANVHAINENGQSSVDIDRINHQTVNVEAVNFSLPKRGG |
6 | Enterobacteria_phage(50.0%) | NA | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_4 |
1876754 : 1888139
Sequences of DBSCAN-SWA_4
Nucleotide sequences of DBSCAN-SWA_4 >NZ_CP014620|1876754:1888139|DBSCAN-SWA CTCAGGCCGCTTTATTTTGCCGCGCATAAATATAGGGCGGGATAAAACCCTCCCGGCACGTATCCAGATAATCCGACCACCACTGCATCATGGCTTTACGGGCGTCGAGGTGCTCGGCGCGGTGAATATACGCCGCCCGCACACTGTTGCGCTCCTGATGGCTCATTTGCCGCTCAACCGCATCCCGCGACCAGAGTTCAGACTCGACTAACGCGCTACAAGCTATTGCCCGAAAGCCGTGACCGCAAACATCCGCCTGCGTGTCGTATCCCATCAGACGTAGCGCTTTGTTGATGGTATTTTCACACATCGGCTTATACGGGTTGTGATCGCCGGGGAATACCAGTTCCAGATGCCCGGAAATCTCCCTGATCCGTTTCAAGATGTCGATGGTCTGACGCGAAAGCGGCACAATATGCGGCGTGCGCATTTTTGCGCCACGCCCGGAATAGCGTACTTTGTCTATTTCCTCGCGGGTGGCGGGAAGCGTCCAGATTTTGTTTTTGAAATCAATCTCACCCCAGCGCGCAAAACGCAGTTCACTGGAGCGAATAAACAGGTGAAGGGTCAGTTCAACCGCCAGTCGCGTGATCTCCCGGCCTTTTGTGTAGCCGTCGATACGCGCCAGCAGTTCCGGCAGACGTTCCGGCGGTAACGCGGGGTAGTGCTTCCTGACAGGTGCCGCAGTCACACCTTCCAGATATTGCGCCGGGTTACTTTCCGTCAGCCCCTGCTGCACGGCGTAGCGCATGATGTTGTTCAGGTGCTGCCGGGTACGGGACGCGACTTCCAGCAGCCCCTTTTCTTCAATCCCTTTCAGTAAAGCGGTGAAGTGCGGCGTTTTGAGGTCAGTGACCCGCATATTCCCAATCACCGGGAAGATGTGGTTACTCATACTGGCAAGAATGCGACTGGCATGATGTTCAGACCATTTCTTATTGGCCTTATGCCAGCCGAGCGCCACGGCCTTAAAACATTTCTCCGGCGAACTGGCGGCTTTCTCTTCCATGCGCTGTTGCACCGGATTGATATTCTGCGCCAGTTGCTTACGAAATACGTCACGCTGCTGACGAGCATCGGCAAGAGAAACCAGCGGATAAGCGCCTAAACCGATGCGCGATTCTTTACCGTTGAAGCGATATTTGAGATACCAGATGCGTGAACCGCCAGGATTAACCAGCAGATAGAGACCATGCGAATCTGACACTTTGAAAGGTTTAGCCGAGGGTTTTAAGTTACGGATTCTAGAATCGTTTAAAGACATTTGGGGGTCACTCCACAATCGAACCAAACTGACCCCAGATCTGACCACCAAATTTCTCGGATGCGGAGAGAAAACCAGATACGCATCGGGAAGGTTTTTTATGCTAACTTACTGAATCTAAAACGTATTTTGACGTATAAGGAAGCATAAAAACAAGAAATTGGCTCCTCTGACTGGACTCGAACCAGTGACATACGGATTAACAGTCCGCCGTTCTACCGACTGAACTACAGAGGAATCGTGTGAACGGGGCGCATAGTAACGATGTGCGATCGGCTTGTCAAAGGGGGAAATAAGGTTGCGCGTTTGTTTGCTGACAAAAACAACAAAGCGTTGAAGTTTTGATCTAACTCCTACTTTGCTCCGGCATGGCGCAACTTTGTCTGTAATTGCACAAGTCAAATGCTGTGACCTTACCGCAATGGCTATGTACCAGCGTCTGATGAAACGTGAAAAACTGGCAGGCACTTGGCAAATAATTCTGAGACATAACGCCGTAGAGATTAAGGGCAGGGAGTAGAATGAACTTTAGACGTGAAATATTTTGTGAAAATGGTTGATACAGGCAGTCTGACGCCGGTAGCGGAAATGGCAGATAAATTTCTGGTGCAGGCGAAAAGATTTCCGTCAATATCATAGGCAGAATTATGGTGCATCAGCTTTTGGCGACGACACGGAACGAGCGGGTTTTATCGCGCTTTTCCTGAAGGATTTTTTCATCAGCCTGTTTTTTGCGTTCGGGTCAATCCCCTGATCCAGCAGCCTTTTGACCTCGTCACGGCGTTGTCTGGCATCAGCAAGAGAAACCGCAGGGTAAACCCCAATAGAAAACACCTTCTGTTTGCCATTGAAGCGATAGCCTGTCTGCCAGTATTTTGAATCGATAGAGGCCAAACCCGTCAGTGAGCTTGACGGCCTTTTCCGCTGGTCTGGCATTTTTTACTTTAGTATCAGTCAGTGACATGACGGTTCCCCCCGCGTGCTGGTAAAACGCAAATCGAACCAGCTTTACCAGCAAAAGGGTATGGCTTCGAGTGGTTTTTAGTGAACGAGGGTGAACCTGAAGAAGGGCATAACCAGTTGATATAAATGCAGAAAGCAGACGCCAGTGAACGTCTGCTTCCCTAAATTTGGCTCCTCTGACTGGACTCGAACCAGTGACATACGGATTAACAGTCCGCCGTTCTACCGACTGAACTACAGAGGAATTGTTTCAACGAGGCGCATAATACTGGGCCGGCTATAACGTGTCAACAGTAAAATTAACGCGCTAATTCAATTGGTTAATTAACCCACAAAGTGAGGAATTAATGTTCGGATTCCTCGCCGTAGTCGCCTTCGGCAGCGCTTACCCGTAAGCGCTGAGAAGGATCCTGGCGATAGAACTGGCAAAAACGCTGCCATAGTGCCGGGAAACGTGGAGCAAACAGTTCTGGCGCGCTGAAAAAATACTCTGACAACACGGCAAAACATTCTGCAGGGTCGGTGGCGGCATAGGCATCTATACTGGCAGCGCTTTCGCCAACAAGATCGATTTCATCCTGAATATTATTCATTGCCGCGTGGAGATCGTGTTCCCAGCCAGCCACATCGCGCAACGGGATGAAAGGGATGCCGCTGGCGCGATCGCCATTACGCATATCCAGTTTGTGCGCGACTTCATGAATAATGAGGTTGAAACCCGAAGCATCGAACGAGTCCTGGATATCCAGCCAGTTCAGAATAATGGGCCCTTGTTGCCAGCTTTGCCCCGACTGTACGACACGCTGGCTGTGCACCAGACCTATGTCATCTTCCCATTCATCATCTACCACAAAGGGCGCGGGATAAATGAGCACTTCATGAAAACCATCAAGCCACTCAATACCGAGCTCCAGGATCGGTAAGCAAAAAATTAACGCAATACGTGCACTTTTTAACGAGTCGAGCTCAAATCCCTGTAGCGCTACCAGTCTTTTCTGCTGCAAAAAACGTTCGGCTAGCGCAATAAGCCGAGCCTGTTCTTGCGCGGTGAGGTTTACCAGAAGAGGTATAGCCAGCGCATCATCCCACGGCCAGTCTTCGTTCTGGGTTATTTCTTGTGCTTTCCAGGGCCACTTAATCATCGTTTTGCTCGTAAACTCGTCACTTGAACAAAATTACCCGAATAGGGTCTGTTAAAATGCCAAATTACCTGGCATCATTGCAATATACGGAGAGATGCCGGAGCGGCTGAACGGACCGGTCTCGAAAACCGGAGTAGGGGCAACTCTACCGGGGGTTCAAATCCCCCTCTCTCCGCCACAATTCAAACACTTAGCTCATCTTCTTTCAGCGATCAGTCTCACACTTAGAATACACTTAGAATATTCTGTTAGAATATTACGTGAAAAACGTATCGCCATCTTATGCTTTTTCTGCCAGAAGAGGGGGCCAGGGATGGTGTTATTTTTACTTTTCGATCATAAGTCAACTCCTGGTTTTCTGCCCCGGGCTTTCGCCGTACCCCATCTGGATTAATATTGATGTCACTCGCGGCGACTTAACGCCGTCGTCCTCGGTCATCGTCGCACCCGGCCTGTTGCCCGACATGTTCCCCTGTAGACATGGGCGCCGGGCAGAAGATTAATTGCTGTGAGGGAAACGCGCTGGCGCGTGGCGATAACTTACGCTGCGGGACTGTCGACGCTGTACAGAAAATCTGGCCTCCAGACTGGCTTAAATATGCGCACATGACAATACAACCGGAAAATTTACAAAACGCATAATTTGAACTGAGAGAGAAACTTACAAACGAAGCGACGAAGATTTAAACAGCCGTAGCGACTCCGGTATCTTGCGCGCATGTTTAAATAATACTACTGTATATAAAAACAGTATTAGAGGTATGAATTATGGAATTTTTCAGACCTATAGAGTTGCGCGAAATTATTCCTCTCCCATTTTTCAGTTACTTAGTGCCGTGTGGATTCCCCAGCCCCGCAGCGGACTACATTGAGCAGCGTATCGATCTTAATGAGTTGCTCGTTTCTCATCCCAGCTCAACATATTTTGTCAAAGCCTCGGGGGATTCAATGATTGAAGCAGGCATCAGCGACGGTGACCTGCTGGTGGTGGATAGCTCACGGAACGCTGACCACGGTGACATTGTAATTGCGGCAATTGAAGGAGAGTTCACCGTAAAACGGTTGCAGTTGCGCCCGACAGTGCAGTTAATCCCCATGAACGGCGCCTATCGACCTATACCTGTCGGCAGTGAAGACACGCTCGACATATTCGGGGTGGTGACCTTTATCATTAAAGCGGTCAGTTGATTATGTTCGCGCTCTGCGATGTTAATAGCTTTTACGCCTCCTGCGAAACGGTCTTTCGTCCTGATTTATGTGGCCGACCGGTGGTGGTGTTATCAAACAATGATGGCTGCGTTATCGCGTGTAGCGCCGAGGCGAAACAGCTCGGTATCGCACCAGGTGAGCCATACTTCAAACAGAAAGAACGCTTCCGGCGATCCGGTGTTGTTTGCTTCAGCAGTAATTACGAGCTTTACGCTGATATGTCGAACCGGGTAATGACCACACTCGAGGAGATGGTGCCGCGGGTAGAAATTTACAGCATTGATGAGGCTTTTTGTGATCTGACGGGGGTACGAAACTGCCGGGATCTGACAGATTTCGGGCGCGAGATAAGAGCGACGGTCCTGAAGCGCACGCACCTGACTGTCGGTGTAGGCATTGCCCAGACGAAAACCCTTGCCAAGCTGGCTAACCATGCTGCGAAAAAGTGGCAGCGCCAGACCGACGGGGTGGTTGACCTGTCGAACATCGATCGCCAGCGTCGGCTGCTGGCCCTGATACCCGTAGAGGATGTCTGGGGTGTCGGCAGACGCATCAGTAAGAAGCTAAATTCCCTGGGCATCAAGACTGCTCTCGATCTCTCTGAACAAAGTACCTGGATCATCAGGAAACACTTCAATGTCGTGCTGGAGCGTACCGTGAGAGAGCTTCGCGGAGAGCCATGTCTGGAGCTTGAAGAGTTTGCGCCGGCAAAGCAGGAAATCGTTTGTAGCCGCTCTTTCGGCGAGCGGGTCACAGACTATGAGGAAATGCGCCAGGCTGTTTACAGCTACGCTGCGCGCGCGGCAGAAAAACTCCGCGGCGAGCACCAGTACTGCCGTTTCATTTCAACATTCGTCAAAACATCACCCTTTGCCCTGAACGAGCCCTACTACGGTAACAGCGCCGCGGTGACGCTTCTCACCCCCACGCAGGATTCACGTGACATTATCAATGCGGCTGTGAAATGTCTGGATAAAATCTGGCGCGACGGCCATCGCTACCAGAAAGCGGGGGTGATGCTGGGTGACTTCTTCAGCCAGGGCGTAGCGCAACTCAACCTTTTCGACGATAACGCGCCGCGCGCCGGTAGTGCGAAGTTGATGGAAGTACTGGACCATCTTAACGCAAAAGACGGGAAGGGGACGCTGTACTTCGCCGGGCAGGGGATGTCGCAACAGTGGGCTATGAAGCGAGAAATGCTTTCGCCTCGGTACACCACAAGATACTCTGATCTACTGCGTGTTAAGTAACTTGTGCGATCAATGCCTGAGATGGTTGCCAAATCATCCCCGTTCTCTAACCGGTTTTGGTCGCACAAGATCACAGGAACCTCTCACGATGAGGCGCATGTATCCTGGTTTACGACATCAGAAAATGTGGCGCGTTTATTGCCAGGTAGGCGTTGTGAGACGTCACTTATTTACGCCTGGTTTCAGCCGTAGCGCCGGGCATGGATAAAAAGAGTATGGCAATCAGCGTGATAATGCTAAAAAACAATTAATATTTTTTTAACAAAACTAAAGCTTGCTATGTTCAGTTAACCATGCGTTAATGGTTGTGCGGTTTGATACAAACTTATCTGAAGTAGTGATTGTAATATTTCTCATCATTTGTTCCTCTTGAGATCTCCTTTAGGTTTTTTTCTCTCTGATAATTTTCTTCAGGCCATTTCGCCCAAGGGCTCATTCGAAAGGTAACAATATTATGACGACGAAAATCACTGGTTTAGTAAAATGGTTTAACCCTGAAAAGGGCTTTGGTTTCATTACGCCTAAAGATGGCAGCAAAGATGTGTTTGTGCATTTTTCAGCCATTCAAAGTAATGAATTCCGCACTCTGAATGAAAATCAGGAAGTGGAGTTTTCAGTAGAGCAGGGACCAAAAGGTCCATCAGCGGTTAACGTTGTGGCGCTTTAAGGCAACTGATATTACTAATAAAATTCACTTCCGGTGTCCATGTTGCCATGGTTCACAATACAGAACATCGACATTCGATGTTACTGAGCAAAACCCGTTTGGCGCGAAATGTATTTTTTGTAAGTCAACCATGATCACTTTTGATAATGTTGGATTATACATTCGCTCAGGACAGGTTCCGCTAGATTTTAGAAAATAATTCATATTAGCTCCGTACAGGAGCTTTTTTATGCCCGGATGATTATCATCTATAGACGCTGACATCCATCATCTATAGTGGCATTTACCTTTCCCCAAAGGTGTTATTTCTCTTGCAGACAGCGCCTGAAAAAAGCGACGTTGTCCTCATCTATCCGATGAAACCAGGTCACCTGTCTTTGCGCCGTTATCCGTTATTTAATTCTTGCCCTATAATAACAAGCCCGCGCTAAGCACGGGCTTGACTAACATAAAGCGTCTTAGAACTGGTAGACCAGACCAACACCTACGATATCATCGGTTGCAATACCGTTGCTTGCGTAGAAGTCATCATCTTCGTCCAGCAGGTTGATTTTATAATCAACGTAGGTGGACATATTTTTGTTGAAGTAGTAAGTCATACCGACGTCAACATATTTAACCAGATCTTTGTCGGTGTAATGCCAGTTGCCACGATGGACTTCCTGACCGCCTAAATCTTTGCCCTTAGACTGCAAGTAGGCGATGGATGGACGCAGACCGAAATCGAACTGGTACTGCGCAACCACTTCAAAGTTCTGGGTTTTGTTAGCAATACCGCCATTACCTTCGCCATTGCCGCCGCCATAATAGGTCATGTTGCGGGTTTCAGCGTACATCGCCGCCAGGTAAACATTGTAGGCGTCATATTTGGCACCTACGGTCCAGGCTTCGGCAGTTTCACCGCCGGCATAGTTGTTGCGCTCATTCATGCCGTCGCCATAACCACGAGCAACCTGATTGTCCGTACGGTCAGAGGAAGAGTAGGCCGCACCCAGGCTTAACCCGAAGTCAAAGTCGTAGGAGGCAGACATACCGAAACCGTCGCCGTTTTCACGGGCCAGTTTGCGAGAACCACTATTTGCATCGCTGCCGTTGGCTGTTCCTTCGCCTGCGCCAGGATCTTCGTTATTACCCTGGTACTGCAACGCGAAGTTCAGGCCTTCCACCAGACCGAAGAAGTCGGTATTACGGTAGGTGGCAACGCCGTTGGTTCTGCCCAGCATATATACATCGGTCTGGGTATAAGTATCACCGCCGAATTCCGGCAGCGCATCGGTCCAGGCTTCAATGTCGTAGATAACACCATAGTTACGGCCATAATCGAAAGAGCCGTACTCGCCGAATTTCAGACCGGCAAAGCCCAGACGAGTCCAGGAGTTTGCACCTTCGCCTTCGGTGGTGTTCACCTTAATGTTGTATTCCCACTGACCATAGCCGGTCAGCATATCGTTGATCTGCGTTTCGCCTTTAAAGCCAATACGGGCGTAGGACTGGTCGCCATCGTCGCCTGCATTGTCAGAGAAGTAACGCAGACCATCAACTTTGCCGTACAGGTCGAGTTTGTTGCCATTTTTATTATAAATTTCAGCCGCATTTGCTGCGCCTGCCACTAATAACGCCGGGACAAGCAGTGCCAGAACTTTTCTGTTCATTATGTATTCCCTTATGATAATAATTTATATGAATATGTAGCCACTTCAACAAAACTACAAATTGATACTATTCTATGAAGTTCATGGAATTTAAAAAATAACATGTAACAAAGGTATTTAAAATATTTCAATTTGTTTCTGTTTGGTTTTTTATAATACAGGCCATATAAGTAATTAAGAATATATATTCTATAATTATCATTTTTTATCAATGGTTTATGTGTTTTGATTTGATGGCTGTTGGTGTGAATATAATTTGTTTTTTATATGTATTTGATGCTTTGATTATGAAAAGGCATAAAAAAACCGGCATAAATGCCGGTAATGTGGGTAATGATAAAATAAATGATAAAAGGCTAATAACCGAATCATTCTGACATTTAATGCTGATAAAATAAAACGCTATCGCGGTGCGTAAAAATAAGTTGTTCTGGTTATAGGTTATTCTGCATCAGAGTGCGATTCAATCACCATTCCCTTATTTAACAGCAGGTTCAGCGCCAGACGTAACATTAACTGAGAATAAACTATTTGTGGAATGAATAGCCAGAACAGGTGGATGAGGTATTCCATGGTCGAAATGTGATTAATATCACATTATAATGTAATTAATGTCATATGACTTACATCACAAAAGCGGAGTAAAGTTTGAGTACCAGGGAGGACAACGCCACCCGTAGCATGGGCGGTAAATTAGCGTTATGGGTTTTTTATACCTTTTGCGGCTACTTCATCTGGGCGATGGCGCGTTGCGTGTGGCTGATGTCCGCCATACAAACCGAGCCGGTTCTCGGCCCAATCAGCACTCCTGGCAGCGCAACGGAAAAATGGCTTAACGCGCTTTCGCTGGGCGTCGTCTGGCTTATTCTGGGGAGTATTGCCTGGTACACCCGGCCTCGCAAAAACAGGGGGTATCCCGCCGACACTCAGCCAGAAACGCGCAAGCACGCAAGGATGTAAAGTGGTGGCGGATATGTCAGGATAGCGAACCCTGTGCTCAGGCATGTAATAAAAATGGTCTGCTATAAAGAGAGGGCGTATGGACTCAGGATACTGGCAATCGCAGTTTGAAGACTGGCTACGTCACCACCACCAGGAACAGGATGCCGCCCATGACATCTTCCATTTTCGTCGTGTTTGGGCAACCGCGCAAACGCTCGGGGAAAACGTTCCTGTCGACTGGCTGGTGGTGCTATCGGCATGTTATTTCCATGACATCGTCAGCCTGGCGAAAAATCATCCGCAGCGGCATCGTTCTTCCATTCTGGCGGCGGCAGAAACCCGGCGTATTTTTCTGCGGGATTTTCCTGACTTTCCGGCAGAAAAACTGGCGGGCATTTGTCATGCTATCGAAGCGCATAGTTTCAGCGCAAAAATTGCGCCCACCACGCCAGAGGCAAAAATCGTGCAGGATGCAGACAGGCTGGAGGCGTTGGGCGCCATTGGTCTGGCGCGGGTCTTCGCGGTCTCCGGCGCGCTGGGCGTCGCGCTGTTTGATGCCGACGATCCCTTTGCCGACAGACGGCCTCTTAACGATAAGCAATTCGCGCTCGACCATTTTCAAACCAAACTGCTGAAACTGCCGCTGACGATGCAGACCGAACGGGGCAAGTACCTGGCGCAGCGTAATGCGGATTTTCTGGTGTCGTACATGGCGAAACTGAGCGCTGAGCTGAAAGGCGACTATGAAACACGGGATGAGGCGGTCATCCAGATGTTTGCTACGCATCAGTAACCCCTGTCGCTGAAACGTAAACCGCGTCATTTTATGTTAGTCTGTCGGCAATTATTTTTGGCCGGTTAAATGTATGCAGGAAAATATTTCAGTAACACACGCCCGGAACCTCATCGCCGACGACGCCGGAAGCGAGATCCAGGCGATGCTGAGTCAATTGCTGGAAATCTATGATGTTAAAACGCTGGTGGCGCACCTTAACGGCCTGGGCGAACAGCACTGGAGCCCGGCCATCTTCAGGCGCGTAATGATGAACGCGGCATGGCATCGTTTGAGCGACAATGAACTCAGCTGTCTTAAAACAGAGTTGCCGACGCCGCCAGCGCATCATCCACATTACGCCTTTCGTTTTATCGATCTCTTCGCGGGCATCGGCGGCATTCGCCGCGGATTTGAAGCGATAGGCGGACAGTGCGTGTTTACCAGCGAATGGAATAAGCACGCGGTACGGACATATAAAGCGAACTATTTTTGCGATCCGCTGCAACATCGCTTTAATGAAGATATCCGCGATATCACATTGAGCCACCGGGAAGGGGTCAGCGATGATGAGGCGGCGGAACACATTCGCCAGCATATTCCGCAACATGATGTCCTGCTGGCGGGCTTTCCCTGTCAGCCATTTTCTCTGGCGGGCGTTTCCAAGAAAAATGCGCTGGGCCGCGCCCACGGCTTTGCCTGCGAGACTCAGGGGACATTATTTTTTGATGTCGTAAGAATTATCGACGCCCGCCGCCCCGCGCTGTTTGTGCTGGAAAACGTGAAAAACCTTAAAAGTCACGACCAGGGCAACACCTTCCGCATTATTATGCAAACGCTCGATGAACTGGGATATGACGTGGCGGATGCCGCTGACAATGGCCCGGACGATCCGAAAATTATCGACGGGCAGCACTTTCTTCCTCAGCATCGGGAACGTATTGTGTTGGTGGGATTCCGTCGCGATTTAAACCTGAAAACCGATTTTACGTTACGCAATATCGCCCGTTGTTATCCACCGCGCCGTCCGACGCTGGCAGAACTGCTGGAGCCCGTCGTCGAAGCCAAATATATCCTGACGCCGGTGCTGTGGAAATATTTATATCGCTACGCGAAAAAGCACCAGGCGCGGGGAAACGGTTTTGGCTATGGCATGGTTTATCCTGACAATCCGGAAAGTGTGGCGCGCACGTTATCTGCTCGCTACTACAAAGATGGTGCCGAAATTCTGATCGATCGTGGTTGGGATATGGCGAAAGGCGAAGTGAATTTCGACGATGCTGGCAACCAACAACATCGTCCCCGCCGACTCACGCCGAGAGAGTGCGCGCGTTTAATGGGATTTGAGGCGCCGCAAACGTACCAGTTCAGGATACCTGTCTCGGATACGCAGGCCTATCGCCAGTTTGGCAACTCCGTGGTGGTGCCGGTATTTGCTGCGGTAGCAAAGCTGCTGGAACCCAAAATTCACCAGGCGGTGACGCTGCGTCAGAGAGAGACGGTAGATGGCGGACGTTCACGATAA
Protein sequences of DBSCAN-SWA_4 >NZ_CP014620|1876754:1888139|1885548_1885860_+|WP_000107435.1|DBSCAN-SWA MSTREDNATRSMGGKLALWVFYTFCGYFIWAMARCVWLMSAIQTEPVLGPISTPGSATEKWLNALSLGVVWLILGSIAWYTRPRKNRGYPADTQPETRKHARM >NZ_CP014620|1876754:1888139|1880890_1881310_+|WP_023243860.1|DBSCAN-SWA MEFFRPIELREIIPLPFFSYLVPCGFPSPAADYIEQRIDLNELLVSHPSSTYFVKASGDSMIEAGISDGDLLVVDSSRNADHGDIVIAAIEGEFTVKRLQLRPTVQLIPMNGAYRPIPVGSEDTLDIFGVVTFIIKAVS >NZ_CP014620|1876754:1888139|1881312_1882581_+|WP_000457663.1|DBSCAN-SWA MFALCDVNSFYASCETVFRPDLCGRPVVVLSNNDGCVIACSAEAKQLGIAPGEPYFKQKERFRRSGVVCFSSNYELYADMSNRVMTTLEEMVPRVEIYSIDEAFCDLTGVRNCRDLTDFGREIRATVLKRTHLTVGVGIAQTKTLAKLANHAAKKWQRQTDGVVDLSNIDRQRRLLALIPVEDVWGVGRRISKKLNSLGIKTALDLSEQSTWIIRKHFNVVLERTVRELRGEPCLELEEFAPAKQEIVCSRSFGERVTDYEEMRQAVYSYAARAAEKLRGEHQYCRFISTFVKTSPFALNEPYYGNSAAVTLLTPTQDSRDIINAAVKCLDKIWRDGHRYQKAGVMLGDFFSQGVAQLNLFDDNAPRAGSAKLMEVLDHLNAKDGKGTLYFAGQGMSQQWAMKREMLSPRYTTRYSDLLRVK >NZ_CP014620|1876754:1888139|1883706_1884900_-|WP_001080661.1|DBSCAN-SWA MNRKVLALLVPALLVAGAANAAEIYNKNGNKLDLYGKVDGLRYFSDNAGDDGDQSYARIGFKGETQINDMLTGYGQWEYNIKVNTTEGEGANSWTRLGFAGLKFGEYGSFDYGRNYGVIYDIEAWTDALPEFGGDTYTQTDVYMLGRTNGVATYRNTDFFGLVEGLNFALQYQGNNEDPGAGEGTANGSDANSGSRKLARENGDGFGMSASYDFDFGLSLGAAYSSSDRTDNQVARGYGDGMNERNNYAGGETAEAWTVGAKYDAYNVYLAAMYAETRNMTYYGGGNGEGNGGIANKTQNFEVVAQYQFDFGLRPSIAYLQSKGKDLGGQEVHRGNWHYTDKDLVKYVDVGMTYYFNKNMSTYVDYKINLLDEDDDFYASNGIATDDIVGVGLVYQF >NZ_CP014620|1876754:1888139|1880602_1880764_+|WP_000500830.1|DBSCAN-SWA MGAGQKINCCEGNALARGDNLRCGTVDAVQKIWPPDWLKYAHMTIQPENLQNA >NZ_CP014620|1876754:1888139|1879324_1880122_-|WP_000598920.1|DBSCAN-SWA MIKWPWKAQEITQNEDWPWDDALAIPLLVNLTAQEQARLIALAERFLQQKRLVALQGFELDSLKSARIALIFCLPILELGIEWLDGFHEVLIYPAPFVVDDEWEDDIGLVHSQRVVQSGQSWQQGPIILNWLDIQDSFDASGFNLIIHEVAHKLDMRNGDRASGIPFIPLRDVAGWEHDLHAAMNNIQDEIDLVGESAASIDAYAATDPAECFAVLSEYFFSAPELFAPRFPALWQRFCQFYRQDPSQRLRVSAAEGDYGEESEH >NZ_CP014620|1876754:1888139|1883258_1883447_+|WP_024131163.1|DBSCAN-SWA MTNKIHFRCPCCHGSQYRTSTFDVTEQNPFGAKCIFCKSTMITFDNVGLYIRSGQVPLDFRK >NZ_CP014620|1876754:1888139|1883035_1883248_+|WP_000208509.1|DBSCAN-SWA MTTKITGLVKWFNPEKGFGFITPKDGSKDVFVHFSAIQSNEFRTLNENQEVEFSVEQGPKGPSAVNVVAL >NZ_CP014620|1876754:1888139|1885939_1886635_+|WP_023243859.1|DBSCAN-SWA MDSGYWQSQFEDWLRHHHQEQDAAHDIFHFRRVWATAQTLGENVPVDWLVVLSACYFHDIVSLAKNHPQRHRSSILAAAETRRIFLRDFPDFPAEKLAGICHAIEAHSFSAKIAPTTPEAKIVQDADRLEALGAIGLARVFAVSGALGVALFDADDPFADRRPLNDKQFALDHFQTKLLKLPLTMQTERGKYLAQRNADFLVSYMAKLSAELKGDYETRDEAVIQMFATHQ >NZ_CP014620|1876754:1888139|1876754_1878017_-|WP_023244267.1|integrase|DBSCAN-SWA MSLNDSRIRNLKPSAKPFKVSDSHGLYLLVNPGGSRIWYLKYRFNGKESRIGLGAYPLVSLADARQQRDVFRKQLAQNINPVQQRMEEKAASSPEKCFKAVALGWHKANKKWSEHHASRILASMSNHIFPVIGNMRVTDLKTPHFTALLKGIEEKGLLEVASRTRQHLNNIMRYAVQQGLTESNPAQYLEGVTAAPVRKHYPALPPERLPELLARIDGYTKGREITRLAVELTLHLFIRSSELRFARWGEIDFKNKIWTLPATREEIDKVRYSGRGAKMRTPHIVPLSRQTIDILKRIREISGHLELVFPGDHNPYKPMCENTINKALRLMGYDTQADVCGHGFRAIACSALVESELWSRDAVERQMSHQERNSVRAAYIHRAEHLDARKAMMQWWSDYLDTCREGFIPPYIYARQNKAA >NZ_CP014620|1876754:1888139|1886708_1888139_+|WP_023243858.1|DBSCAN-SWA MQENISVTHARNLIADDAGSEIQAMLSQLLEIYDVKTLVAHLNGLGEQHWSPAIFRRVMMNAAWHRLSDNELSCLKTELPTPPAHHPHYAFRFIDLFAGIGGIRRGFEAIGGQCVFTSEWNKHAVRTYKANYFCDPLQHRFNEDIRDITLSHREGVSDDEAAEHIRQHIPQHDVLLAGFPCQPFSLAGVSKKNALGRAHGFACETQGTLFFDVVRIIDARRPALFVLENVKNLKSHDQGNTFRIIMQTLDELGYDVADAADNGPDDPKIIDGQHFLPQHRERIVLVGFRRDLNLKTDFTLRNIARCYPPRRPTLAELLEPVVEAKYILTPVLWKYLYRYAKKHQARGNGFGYGMVYPDNPESVARTLSARYYKDGAEILIDRGWDMAKGEVNFDDAGNQQHRPRRLTPRECARLMGFEAPQTYQFRIPVSDTQAYRQFGNSVVVPVFAAVAKLLEPKIHQAVTLRQRETVDGGRSR >NZ_CP014620|1876754:1888139|1878662_1878953_-|WP_023243861.1|DBSCAN-SWA MPDQRKRPSSSLTGLASIDSKYWQTGYRFNGKQKVFSIGVYPAVSLADARQRRDEVKRLLDQGIDPNAKNRLMKKSFRKSAIKPARSVSSPKADAP |
12 | Stenotrophomonas_phage(25.0%) | integrase | attL 1862642:1862657|attR 1878179:1878194 |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_5 |
1992459 : 2000222
Sequences of DBSCAN-SWA_5
Nucleotide sequences of DBSCAN-SWA_5 >NZ_CP014620|1992459:2000222|DBSCAN-SWA TTTATTTAACTTTGGGTTCGTAAGGGAGTCGGGATAGTGATACGGAAGCCAGACGATGGGCAATCAGCCGCTCACGAAACCAGGCGCGTAGATGCTCTGGCTGCTCACGTTCGACCGCCTCCGCGACAACTGGCATATTGTAGCGCTCTTTAAATGCCACCCCGGCTGCCGCCAGGTCCACATTGACTTTATCCATTTCCGCTTGTTCAAGCTGAGCCAGATTCGTTTTCATACGCGTCTCCTTTTTTGTTGCCGCCAGAAGGTACTAAGAGTCCGCGGTAATGGCAAGATGGAAATGATGATCCTCGACTCATTGATTGTTGAAGGTTATTCTTTGTAACGTAGGCATAACAAAAAGGAATAGTGAATATGGCTTCTTCTGCACCATCGCGACGTTTAGCTTTACTGCTGTTGGCATCGACATTTGCGACGCCAGCGGCCTGGGCACATGCGCACCTGACGCATCAGTATCCAGCGGCGAATGCTGCCGTTACGGCCTCGCCACAGGCGCTGACCCTGAACTTTTCTGAAGGGATTGAGCCAGGGTTCAGCGGCGCAACCATTACTGGCCCTCAGCAAGAGCTCATCAAAACGCGCCCGGCAAAGCGAAATGAACAGGATAAAACGCAGTTGATTATCCCGCTTGAGCAGCCGTTAAAATCTGGCGCTTACACGGTAGACTGGCACGTTGTGTCGGTGGATGGACATAAAACAAAAGGGAAATACACCTTCAGCGTGAAATAAATGATGCTGACATTCGTCTGGATAACTCTCCGATTTATTCATTTTGCTAGTGTGATGCTGGTCTACGGCTGCGCGCTTTACGGCGCCTGGCTGGCACCCGCATCAATTCGTCGTTTAATGACGCGTCGATTTTTACATCTGCAACGACATGCCGCCGCCTGGAGCGTTATCAGCGCGGCTTTTATGCTGGCGATTCAGGGCGGACTGATGGGCGGCGGCTGGCCCGATGTTTTTTCCGTCTCGGTGTGGGGCGCGGTACTGCAAACCCGCTTTGGTGCGGTCTGGATATGGCAAATTATCCTCGCGCTGGTCACGCTGGCGGTGGTAGTCATTGCGCCGGTAAAAATGCAACGACGGCTTCTTATTCTAACCGTTGCTCAGTTTATCCTGCTGGCAGGCGTTGGACATGCGACGATGCGCGACGGTGTAGCGGGAACATTACAGCAGATTAACCATGCTCTGCATTTACTCTGTGCCGCTGCCTGGTTTGGTGGGTTGTTGCCAGTGGTTTATTGTATGCGCATGGCTCAGGGACGCTGGCGTCAACATGCTATTAGCGCCATGATGCGTTTTTCTCGTTATGGTCACTTTTTTGTGGCGGGCGTATTGCTCACAGGCATTGGCAACACGCTATTTATCACGGGACTTACCGCTATCTGGCAGACCACCTATGGACAGTTGCTTTTGTTAAAATGTGCGCTGGTGGTGCTTATGGTAGCAATTGCGCTGACGAATCGGTATGTTCTCGTACCACGTATGCGACAGGAAAATCCCCGGACTGACCTATGGTTTGTCAGGATGACGCAAATTGAATGGGGAGTTGGAGGCATAGTTCTGGCGATCGTCAGCCTGTTTGCAACCCTCGAACCTTTTTGATGGACTGGCATAACGAATGAAAAAAATACTCCTTCCGGCGCTTCTGCTGGCCACTTCGGGCGTAGCGTTGGCGGCGCCGCAGGTGATTACCGTAAGTCGTTTTGAAGTAGGAAAAGACAAGTGGGCGTTTAATCGGGAAGAGGTCATGTTGACCTGTCGGCCTGGCCAGGCGCTCTATGTGATCAACCCCAGTACGCTGGTGCAGTATCCCTTGAATGCCATTGCCGAACAGCAAGTAGCGGAGGGTAAAACGCGCGCTCAGCCTATTGCCGTCATTCAAATCGATAACCCGGCGAAGCCCGGTGAGAAAATGAGTCTGGCGCCGTTTATCGAACGTGCGCAAAAGCTTTGTGATCCATCCAATAGCTGACTGATTTTTAATAAAAAACCGTAAACCTTCACGAAAAGGCTTACGGTTTTTTTATCTCTGATAACAGACAAAACGCCAGGTTTTTTCAATCACCTTCGTCGCAAACTGGAAAACCTGGCGTCGTCATCTATTCTTAAAGGGCAAGGCGATTTAGCCTGCATTAATGCCAACTTTTAGCGCACGGCTCTCTCCCAAGAGCCATTTCCCTGGACCGAATACAGGAATCGTATTCGGTCTCTTTTTATTTTGATTATAAATAAGCTACTTACAATTAACGATCCGAAATTTTCCGAATTTCGGTATTCCGGTCTTTTTGGTTATATCACAATCAAATTAAATTTAACATTTATTTCACAACAAAAATTGGAGTATTAGAGCATCATATAAGCTTTATCATCACGCTCATCGAGATAGAGTTTCGTGGTGTTCGCTGATGTGTGGCCCAGGAGTTTTTGGGCGAACACCTCGCCGTGCTCGTTTTTGTACAGCCGCCCGGCCAGACTTCGGATCTCGTGAAATGTCGGTGGATTATTGCTGAAGTTAACGCCGGAGGCTTTTCTTGCTTTTACAAATGTCTTTGTCAGCCCATCCGGGTGAATATTCCCGGTCGGGCTATTTTTCCTGATTCCTGCACTGATCATGAAATCAGTTCTGCTTACCAGCCGGCAGCGATCGATAACCGTTCCCAGACGTAACCCTGGTGCCTCAAGGGTCAGGGATAGGGGAATGGCTATTTTCATTCCGGTTTTAATCTGAGTTACGTATAAGCGGTTGTCAACAACATCACTAAATTTCATATTTACGATATCCTCCCTACGTTGACCAGTAACCAGCGCGAGATCCATCGCAAGAGGAAACCACACAGGCAGATGTTCTGCCGCCGTCCTCGTGGCGTTATATGTTTCCAGTTGCAGGCGTTCCCTGGCAACCTTAATCTCTGGTATCCGGGTTGCTTCCACCGGGTTTTTCACAATATGCCCTTCGACAATAGCCTCTCTGAACATGTCAGATAGAACTGATCTCATTGCTCCCGCCATAGTCTTTTTTCCCTCGGTTATCCACGACTCAAGAAATTTGGCAATGTGCCGGGTTGTTGCTTCTGCCAGTATTATTTCTCCCATTTTTTCGCGTACGGTCGCTAATTGATTACTGCGAATCTTGTAGGTATTAACCGACAGATTCCGGCGCTGTAATAAAACCTCATAGCGATCAATCCATGCGGACACAGTGAATGAGTCCGTTCCTTTTAGTTTTTCAATAAGCGCCACTGGCGTGTGGTTTTGCGCTATGAAGTTGTTTGCCTCTATGGCCTGTGTGATTGCGTCCCTGCGGGCGATCTGACCGAGCGGAAATTCCTTGTCAGTTACCGGGTTACGCCAGAAAAAAGATTTACTGGCCTTACGGTAGGTGAGGTACCTCGGAAGGTTAGCATCGTACTTTTTTCGACTCACTGATCAACTTCTCCAGCAATGCACTCGGTTAACGCCATCACACTCTGGTACATAAATTCGATTTTCCGGCCTTCTTCTGCCGTAAGCACGGTTATTCCTGTCCGGGCGCACTCTTCCAGAAAGGTTTTCTCTTCTTCTTTTCCTGCACTGGTACGGCGGTTAAACTCCGGTGCGATGATGAAGCGTTTACTGAATTCCTCTGGTTCCAGTACCCGGCAGTGAAAAGCCGTTCCTGTATCAAGAGACTTTGTTTTCTCCGTGTCCACGGGGGCATTTTTGCGCCAAAGATAAATTGCTGGTGTATCTACGATATCATCAAGCTGTGATTTACTGACCCCCTGGCCAGCGTGATACGCCTCGTTAGGGATGTCATAGTAAATGCCTGGCTGTATATCATCAGGGACAGTGAAATTTCCGTTTTCTACGGGATCTGCCGCTTCGCCAGCTTCATCACCGCCAGTACCTGATCCACCGTCCGTTGTAATTTCCTGCCCTGTATCGCCAGCCGTTTCCTGCTGGTTGCTCTCTTTCGGCGTTTCTCCATCTCTTTCTGTTCTGGCTTCCGTTTTTTCGGTCTGGTTTGAGGGGGGCGGGAATAGCGCTGATACATCGAAAGTCCCGTCCGCGTTTCTGGTGACAGCCTCCGGCTCTGCTGCTGGTTGTTTTTCCTCCGGCACCACATCTTCTTTTTCACCCTGATTTGAGGCGCTGTAATTGTTATGAACCCACCTCGGATCGTTCGGGTCGCTGATGTCTTCGACATATTCACCGCGCGCGGCTGCCAGTTGTTTACCAACATCAACCGGGTTTTTGGGTGGAATGTTTTTACGTGCTTCGTGCAGTTCTGCCCGTATTTTCTGGTAGCCTGCTTCTGTCTGGCTTACAGGTGGTTCATTCTCCAGCGGCTGTGGGTCCGGATGATGTTCAGTTGTGTCCTGTTCCACTGCTTCAGGCGTTGCTGGTTCATCTGCCAGTTCGCCTGTCGGTTGCTGTTTTTCTTCATCACACTGAAATCTCCCTGCCTCAATATCCCGCAGACATTTGCCCGCCTGACTAAGCCTTGCTGCATTTTCTTCATGGGTTGTTGGGGTGTTATCAGGCACATATTCGTACCAGTTCGGATCGCGAACGCCATGAACGGCAAGAAAGCTTTCGCACCACGTTCGGCGAAGATCAGGATTACCGGGCTGGAGAATTACACCAGATGTGGCGTTGCACTGAAACTGGATCTAGTGGCGAATCCAGGACAGCTTGAGCTAGAACGTCATGCCGCCCGATCCGCAGCGTGGCTTTTTGTGACTAAAGGGTGTCTGAAATATTCCGGCGACCTGGTACGTGTTACGCAGATCATCAACGGAGGGTAGAACGGCATCGGTGATCGGCGGGAGCGCTTTGAGAAAGCAAAATCGGTGCTGGTATGAATCTGTTATCTGCTCTTCTGAAAAGATACTGGTTGCAGCTGGTGTTTATTTTGCTGATGGCTGGTGCGTTTATCGCCGGTAATGTCTGGAGTGACAGGGGCTGGCAAAAAAAATGGGCAGATCGCGACAGCGCTGAATCCTCTCAGGAAGTCAACGCCCAGACCGCCGCCCGTATTATTGAACAGGGCCGCGTTATTGCCCGTGATGAGGCTGTGAAAGATGCACAAGCGCAAGCCGCTAAATCTGCTGCCACTGCTGCTGGCCTGTCTGCCACTGTTAGCCAGCTGCGTACCGAAGCAAAAAAACTTGCCACCCGCCTGGACGCCGCAAAGCACACCGCAAATCTTGCCGCTGCCGTCAGAAGCAAAACAACCAACGCCGACGCCAGAATGCTTGCCAACATGCTCGGAGATATTGCAGAAGAAGCTAAACATTATGCTGGAATCGCTGACGAGCGCTACCGGGCAGGAATGACGTGTGAACGAGTATATGATTCGGTGAGAGAGTCAAATAATTACAGGAGGCATTGAAACTCCCCCTGTAATATTGCTGTAAAAAAGTGACTACATATCATCAGATGGAACCAGATGAATAAGAACAGGTTTTTCACCAGATGAAACTGATAAGTACTCACTCAGTTTTGATATGGCTGAAATCTGTCTGAATAACCTGTCGGGGTGCTGGAATAACAACTTTCCGGAAATTCTTCTGCAATGGATTTTACTTTTAGTGACCATTCGCCTCCTTATCTGTAGAGGTGGGTAACGAATTTAAAAAGCATTCTGCTTACTTAGGGGGAACATCCTGATGACTGCCTGCAATATTGCAAATTCCATTTTCATTGTATGAACCACCTGAATCAAGGCACTCATCTTCCATCAGGAATTTCTGCGACCACATACCTGCATAAAAAACGATAATAATGGCTACGATAATAGTGATGATATTTTTCATTTATGTTCTCTGTGTGTTGTTATTGAAAATGATAATCAATATCGCAAAATGAAATAAATAATCATTAAGTGGTAGTTGTTGATAATTGTTCGCATTTTAAAAAGGTACTCCCGGCGGGGCGGCCTGCCACGGGGCGGCAGCGGCGCGGGATTTGGCGCATTTTTGATTTTTCATGCATCATCATCATGTTGTAACTCTCTGTTTTAATGTAATTTATTTTTAAAAGATGATGGTTTGTATGTTTTTTGTTCATTATATTTTGTTTTTCCGGGGGAGGGCGCGCTAAGAAACAGCCCCAGAGGTAAAAATGGACGGCGAACTGAAGAACCTCAAATGCAATATCTGTCAGCTTGCCGCTATTACAGGGTTACATCGACAGACGGTTGTCAGTCGCCTCTCGGGCGTTCCCCTGGCACCGGGAAGCAATGAAAAAAACAAGCTGTATCTCCTGACGGATGTGATCCGCGTACTGATGGAAACGCCCGTTTCCCAGGCTGCTGAACATCAGGACCCGAATAAAATGACTCCAAAAGAGCGTAAGAACTGGTTTGACTCCGAAAAGGGGCGTTTCTGGCTGGAAAAAGAGATGAAGCAGGTCGTCCCGTTGCCGGAAGTCCGTCAACAAATGGCGGCGATAGTCAAGGCCATTACGCAGGTACTTGAAGTCTGGTCGGATAAACTGGAAAGGGATAAGGGATGGTCTGCGGATCAGCTAAACGAGGCCCAGGATGTGGTGGATGAGGCCAGAATACTGTTAGTTAAGGCAATACAGGAGACCGCAGACGATGACGGGGAATAAATATGGCTCCGCAGCGGCAGTACGCCGGGAGGTTGCTGAATATCTCAGGCCTCCACGCAGAATGCCGGTAGCGGAAGGAATAAAACAATTTATGTTTGTTCCCCGCGGTGCCAATACGGCGGTTCCTTGGGATGACACGTTAGCGTCTCAGTCCTTCCCGAAATGACAACAAAGTCCACGATAAAATGTTTCGTGATGGTTCATTCCTGCAAATTGGCTGGCCGTCCATAACCGTTTTTTCTTCGTCGGATTACAAGCGGGTGGCGCTGACCGACTATGACCGTTTCCCTGAAGATATCGATGGCGAGGGAGATGGTTTTTCCCTGGCATCCAAACGTACCACCACCTTTATGTCTGCGGGGATGACACCGGCAGAGAGTTCGCCTGGTCGGGAAATCACCGATGTGAAATGGCGGCGTTCTTCGCCGCACGAGGCCCCACCCACGACAGGCATTCTTTCTCTTTATAACCGGGGCGATCGCCGTCGGTGGTACTGGCCCTGTCCACACTGCGGCGACTGGTTCCAGTCCGCGATGGAAAACATGGTGGGGTATGGGTGAGGCACAGACCAAAGCCCCGCTGGACAGTCCGGCACTGACCGGTACGCCAACGGCACCAATGCCGGAAACCACAGCTGCAGGTATTGAAATTGCCACGGCAGCGTTTGTGGCTGCGAAAGTGGCGCAGTTGGTTGGTTCTGCGCCGGAAGCGCTGGACACCCTGCAGGAACTGGCTGACGCGTTGGGAAACGATCCGAACTTTGCCATCACGGTACTGAATAAACTGGCGGGCAAGCAGCCGCTGGACGAAACCCTGACGGCGCTGTCAGGAAAAAGCGCTGATGGTTTTATCGAATACGTTGGTTTACGGGAAACGATAAATCACGCCGCCGATGCGTTACATAAATCACAGAACGGTGGCGATATTCCGGAAAAGCCGCTGTTTGTACAAAATATCGGAGCGCTCCCTGCATCAGGTACGGCTGTTGCAGCGAACAGACTGGCATCACGCGGCGGGCTTCCGGCACTGACTGGTACGACAAGAGGCAGTGATAGCGGCCTGATAATGGGCGAGGTTTACAATAACGGTTACCCAACGCAATACGGGAATATTTTGCGTCTGACCGGAACCGGTGATGGAGAGTATTAA
Protein sequences of DBSCAN-SWA_5 >NZ_CP014620|1992459:2000222|1998338_1998527_-|WP_001521334.1|DBSCAN-SWA MNKKHTNHHLLKINYIKTESYNMMMMHEKSKMRQIPRRCRPVAGRPAGSTFLKCEQLSTTTT >NZ_CP014620|1992459:2000222|1998581_1999073_+|WP_000348541.1|DBSCAN-SWA MDGELKNLKCNICQLAAITGLHRQTVVSRLSGVPLAPGSNEKNKLYLLTDVIRVLMETPVSQAAEHQDPNKMTPKERKNWFDSEKGRFWLEKEMKQVVPLPEVRQQMAAIVKAITQVLEVWSDKLERDKGWSADQLNEAQDVVDEARILLVKAIQETADDDGE >NZ_CP014620|1992459:2000222|1994819_1995899_-|WP_020438172.1|integrase|DBSCAN-SWA MSRKKYDANLPRYLTYRKASKSFFWRNPVTDKEFPLGQIARRDAITQAIEANNFIAQNHTPVALIEKLKGTDSFTVSAWIDRYEVLLQRRNLSVNTYKIRSNQLATVREKMGEIILAEATTRHIAKFLESWITEGKKTMAGAMRSVLSDMFREAIVEGHIVKNPVEATRIPEIKVARERLQLETYNATRTAAEHLPVWFPLAMDLALVTGQRREDIVNMKFSDVVDNRLYVTQIKTGMKIAIPLSLTLEAPGLRLGTVIDRCRLVSRTDFMISAGIRKNSPTGNIHPDGLTKTFVKARKASGVNFSNNPPTFHEIRSLAGRLYKNEHGEVFAQKLLGHTSANTTKLYLDERDDKAYMML >NZ_CP014620|1992459:2000222|1993202_1994078_+|WP_072101102.1|DBSCAN-SWA MMLTFVWITLRFIHFASVMLVYGCALYGAWLAPASIRRLMTRRFLHLQRHAAAWSVISAAFMLAIQGGLMGGGWPDVFSVSVWGAVLQTRFGAVWIWQIILALVTLAVVVIAPVKMQRRLLILTVAQFILLAGVGHATMRDGVAGTLQQINHALHLLCAAAWFGGLLPVVYCMRMAQGRWRQHAISAMMRFSRYGHFFVAGVLLTGIGNTLFITGLTAIWQTTYGQLLLLKCALVVLMVAIALTNRYVLVPRMRQENPRTDLWFVRMTQIEWGVGGIVLAIVSLFATLEPF >NZ_CP014620|1992459:2000222|1997316_1997850_+|WP_001050883.1|DBSCAN-SWA MNLLSALLKRYWLQLVFILLMAGAFIAGNVWSDRGWQKKWADRDSAESSQEVNAQTAARIIEQGRVIARDEAVKDAQAQAAKSAATAAGLSATVSQLRTEAKKLATRLDAAKHTANLAAAVRSKTTNADARMLANMLGDIAEEAKHYAGIADERYRAGMTCERVYDSVRESNNYRRH >NZ_CP014620|1992459:2000222|1992827_1993202_+|WP_000168393.1|DBSCAN-SWA MASSAPSRRLALLLLASTFATPAAWAHAHLTHQYPAANAAVTASPQALTLNFSEGIEPGFSGATITGPQQELIKTRPAKRNEQDKTQLIIPLEQPLKSGAYTVDWHVVSVDGHKTKGKYTFSVK >NZ_CP014620|1992459:2000222|1994094_1994448_+|WP_000722368.1|DBSCAN-SWA MKKILLPALLLATSGVALAAPQVITVSRFEVGKDKWAFNREEVMLTCRPGQALYVINPSTLVQYPLNAIAEQQVAEGKTRAQPIAVIQIDNPAKPGEKMSLAPFIERAQKLCDPSNS >NZ_CP014620|1992459:2000222|1992459_1992690_-|WP_000856224.1|DBSCAN-SWA MKTNLAQLEQAEMDKVNVDLAAAGVAFKERYNMPVVAEAVEREQPEHLRAWFRERLIAHRLASVSLSRLPYEPKVK >NZ_CP014620|1992459:2000222|1998106_1998274_-|WP_000789530.1|DBSCAN-SWA MKNIITIIVAIIIVFYAGMWSQKFLMEDECLDSGGSYNENGICNIAGSHQDVPPK >NZ_CP014620|1992459:2000222|1999625_2000222_+|WP_023244117.1|DBSCAN-SWA MGEAQTKAPLDSPALTGTPTAPMPETTAAGIEIATAAFVAAKVAQLVGSAPEALDTLQELADALGNDPNFAITVLNKLAGKQPLDETLTALSGKSADGFIEYVGLRETINHAADALHKSQNGGDIPEKPLFVQNIGALPASGTAVAANRLASRGGLPALTGTTRGSDSGLIMGEVYNNGYPTQYGNILRLTGTGDGEY >NZ_CP014620|1992459:2000222|1997032_1997263_+|WP_001013467.1|DBSCAN-SWA MNGKKAFAPRSAKIRITGLENYTRCGVALKLDLVANPGQLELERHAARSAAWLFVTKGCLKYSGDLVRVTQIINGG >NZ_CP014620|1992459:2000222|1995895_1997002_-|WP_023244250.1|DBSCAN-SWA MPDNTPTTHEENAARLSQAGKCLRDIEAGRFQCDEEKQQPTGELADEPATPEAVEQDTTEHHPDPQPLENEPPVSQTEAGYQKIRAELHEARKNIPPKNPVDVGKQLAAARGEYVEDISDPNDPRWVHNNYSASNQGEKEDVVPEEKQPAAEPEAVTRNADGTFDVSALFPPPSNQTEKTEARTERDGETPKESNQQETAGDTGQEITTDGGSGTGGDEAGEAADPVENGNFTVPDDIQPGIYYDIPNEAYHAGQGVSKSQLDDIVDTPAIYLWRKNAPVDTEKTKSLDTGTAFHCRVLEPEEFSKRFIIAPEFNRRTSAGKEEEKTFLEECARTGITVLTAEEGRKIEFMYQSVMALTECIAGEVDQ |
12 | Enterobacteria_phage(28.57%) | integrase | attL 1994669:1994691|attR 2006792:2006814 |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_6 |
2915459 : 2924541
Sequences of DBSCAN-SWA_6
Nucleotide sequences of DBSCAN-SWA_6 >NZ_CP014620|2915459:2924541|DBSCAN-SWA TTTATGCTGCCTTATCGACAGGTTCATATCCGCAATAATCGGTCCAGTCATCCATCAGCGCAACGCGTTTAGGCCACAGGGTTCCACGCTGGTAGGCTGCTTCTGCTTTATCCGCCAACTGATGGGCCAGCGCATGTTCAATCACTTCCCGCTGGTAGGATGTGGTCTCCCCGGCCCATTCCCTAAACGTGGAGCGAAAACCATGCTGGGTTAAATCGCTGCGGCCCATGCGTTTTAATACCGCTGTTAATGACATATCCGACAGTTGCCCGCCGCGCGGAGCAGGGAAAACAAGGTTATTGTCCTTAAACCGGGGCAGCATTTTGAGTAATGCCACCGCCGCATCAGACAACGGAACACGATGTTCTTTACTGGCCTTCATCCTTTCTGCCGGAATAGTCCATGTTTTTGCCGCCAGATCGATTTCATCCCACACCGCACCACGAATTTCACCAGAACGGGCCACCGTCAGTATGGAAAATTCCAGTGCTCTGGCCGAGACACCTGTCCGGGTACGGAGTTCAGCCATAAACGCGCCAAGTTCACCATACGGCAATGCAGCATGATGTTTTTTGTTCTGCACCTTACTCGGCATTGGCAGCAACGGCTTCAGCATCCCCTTCCACGCCGCGGGGTTATCACCTTCAAGGTATTCTTTCGCTTTAGCGTAATCCAGCACAGTTTCAATACGACCACGCACACGGCTGGCGGTTTCGTTTTTGGTTAACCAGATAGGTTCCAGTATTGCCAGCAGATCGGCCTTGGTAATCTCACCCACTCTTTTCTGACCAATAACCGGATAAGCATAGGTCTCCAGTGAAGAGCGCCACTGTGCGACATGCTTTTTGCTCTTCAGTTCGCGCCCCTTAATTTCCAGCACGGCTTCCGCACATGCGTGGAACGTCTTCTGTTTACGGGCTGTTTTTTCCTGATGTGCTTTTTGGGCATGTTTTTCTTTGAGCGGATCAATGCCATTACGGATCTGCCTGCGCAGTTCACGCGCCTTATCACGCGCTTCTGCCAGTGAAACTTCCTGATAAGGGCCAAGACCCATATTCAGTCGCCGGGGAACAGTCTTACCTGCCCTGTTGATTCGGGTTCCCATCGCAACGCAAAGGACCCATGCACGTGAGCGCCCGGCGATCCGCAGATACAATCCGTCAACACCACCCACTGCATAGCGACCTTCCGCTTTAAGTCTGGAAACGGCTAAGGCGGACAGTTCTCTGGCTTTCTTTGGCATAATCCCCACTCTTCTGTAATGCATCCTGTACTGCATAAAATGACGGATTTACAAAAAGATAGCAATACACCAGCGAACAAAGAATAATGACAACACATTGTATATAAAGAAATAATTATGATTAACGCGAATGATAATGAACGTAAATATGGCAGCCCACCTCACCGCCATATTCAAGTTGAGAGCCCGTACAGAAGTACGGGCTTTTTGCTTATATGTATCCCCCAAACCAAGGGGGGATGAGAACACCCGACCGGGGTTCGACAACTGGCGCAGCCAGTTGGACAGACCGGGCGCGCAGCGAACGGGCTGCCCCGAAGGGGCGAGCAAAGCGAGTCAATCCCCCCCTCACCGCCATATTCAAGAAAGAGCTCGTACGAAAGTACGGGCTTTTGTATCTAAACCTATTGTTTTTACTGTGATTTTCCTGATGAAGGTTCGTTTTTTGACCTTTGGTTGCTACACTAAACTTACTATAGATTGTGTAAAACAGCCCCACTCCCCTACCTGAAACGCTTTAGAAAAGTTTGCATAAGTATCTCTCGGTAGTAAAAAAGCACCGAGTTCCTCTGTCTGATGCTGCCGTTGCTTTATTAGAAGGCTTACCGCGTTTGAAAAATAACAATCATGTATTCCCTGCCCCTCGCGCTGAAACACTTTCTGATATGTCGTTATTGGCTGTATTGAAGCGAATGGAATATACCAACTTAACGCAGCATGGCTTCCGTTCTACTTTCCATGAGTGGGCTGGTGAAACAACGGACTATCAACGTGAGGTTATTGAACATGCGTTGGCGCGCCAGTTGGTAGATAAGGCTGAAGCAGCGTATCAGCGTGGGACGTTATGGCCTAAACGGGTGGCGTTGATGGATGATTGGACGGGGTATAGCACTGCCAACAGCTAAGCTACCTGTACGAAAGCATTATCGTTGATAACAACGTAGAAAGTGTGATGCTAATAGCATTCGCTTTCGAAAATGTGATAAGCAATAATTTCATAATGAACTATTTCTTATACAATTATTATCATGGTTTGCAAATTACATAAACCACTCAAGGAGAGGTTATGCCCGGACTGATAGGCTACTGGAAGCAACTTCCAACCAAAGATGAATATATTAAAAAACACAATATGAGTAAAATATCCTGCTACAGTTGTGGTCACGAGAAATTCAGCGATGTTGGTTTGATACAGGTATGGGATAATCACAGAAGAATTCTTTGTGCTAAGTGTAAGACTACTCTTTTCAGAGAAGAGGATTAGTTTTTTTGGCATTGGTAACAGCGGCTTCAGCATCCCTTTTCACGCAGCGGATCGGGCTTTTTTTTCGCATTTGACCCGTCGATTACCGGATGATGACGCAATTTACAAGCGCCTTGTCCGCCTACCGCGAGCACAACGCCATCAGGCTAACTATTAGCCGGCGTAAAAAAACCGGGCGCTAAGGCCCGGTTTGTACGGCAGTGAAACGAAGATTAATGCGCGGCTTCCGGCTTGTGCTTTTGCGCACTCTGGAAGCCATACGTCAACGCATTTTTCTCTTTATCCAGCGCGACGGTGACCTGTCCGCCATCAACCAGCGATCCAAACAGCAACTCATTGGCCAGCGGTTTTTTCAGGTTATCCTGAATCACACGTGCCATTGGTCGTGCGCCCATCGCCCGGTCATAGCCCTTTTCCGCCAGCCAGTCGCGCGCTTCCTGACTGACTTCCAGAGAGACGCCTTTCTGATCCAACTGAGCCTGCAACTCGACGATAAACTTATCGACAACCTGATGAATCACCTCGCCAGACAGATGATCGAACCAAATAATGTTGTCGAGACGGTTACGGAACTCCGGCGTAAACACTTTCTTGATCTCGCCCATCGCATCGGTACTGTTGTCCTGATGAATAAGACCAATAGATTTACGTTCGGTTTCTCGCACGCCGGCGTTGGTGGTCATCACCAGCACCACGTTGCGGAAATCCGCCTTACGGCCATTGTTATCGGTCAGCGTACCGTTATCCATCACCTGCAGCAGCAGGTTAAAGACATCCGGGTGCGCTTTTTCGATCTCATCCAGCAACAGCACCGCATGAGGATGCTTAATCACCGCATCCGTCAGCAGCCCGCCCTGGTCGAAACCGACGTATCCCGGAGGCGCGCCGATCAAACGGCTCACCGTATGACGCTCCATATATTCGGACATATCGAAGCGCAACAGCTCAATACCCAGCGCTTTTGAAAGCTGTACCGTAACTTCAGTTTTCCCTACGCCAGTTGGCCCGGCGAACAAGAATGAGCCGACAGGTTTATGCTCATGGCCCAGACCGGCACGACTCATCTTAATAGCTTCGGTCAGCGCCTCAATCGCGTTATCCTGGCCGAAGACCAGCATTTTCAGACGATCGCCCAGGTTCTTCAGCGTATCGCGATCGCTCTGCGAGACGCTCTTTTCAGGAATTCGCGCAATTCGCGCCACTACGGACTCAATATCCGCCACGTTGACCGTTTTCTTACGTTTGCTCACCGGCATCAGACGCGCCCGAGCGCCCGCTTCGTCAATCACGTCAATGGCTTTATCCGGCAGATGGCGGTCATTGATATATTTTACCGCCAACTCGACCGCCGCACGCACCGCTTTCGCGGTATAACGCACGTCGTGGTGCGCTTCGTACTTAGGTTTCAAGCCGTTGATAATTTGCACCGTCTCTTCCACCGAAGGCTCGGTAATATCAATTTTCTGGAAACGGCGCGCTAATGCACGGTCTTTCTCAAAAATATTGCTGAATTCCTGATAGGTCGTTGAGCCGATCACCCGGATCTTGCCGCTGGAAAGCAGCGGTTTAATCAGATTTGCCGCATCCACCTGTCCGCCCGACGCCGCGCCAGCGCCGATAATGGTATGGATTTCATCGATAAACAGGATGCTGTTGGTATCCTGCTCAAGCTGTTTCAGCAACGCCTTAAACCGTTTTTCAAAATCGCCACGGTATTTGGTGCCCGCCAGCAGCGAACCGATATCCAGAGAGTAAATGGTGCAATCGGCCATCACTTCCGGCACATCGCCCTGCACGATACGCCAGGCCAGCCCTTCGGCAATCGCCGTTTTGCCAACACCGGATTCCCCTACCAGCAACGGGTTATTTTTACGGCGACGACACAAGACCTGGATCGCGCGTTCCAGTTCTTTTTCACGACCAATCAGCGGATCGATGCCGCCCACGCGAGCAAGTTGGTTAAGATTCGTCGTGAAGTTTTCCATACGTTCCTCCCCGCCAGCTTGTTCGTCGCCAGTTGGCTGATTGCCGAGATCGGAAGATTGGCTCGGTTCGTCTTTTCGCGTCCCGTGAGAAATAAAGTTCACGATATCCAGACGGCTCACTTCATGCTTACGCAGCAGATAAGCCGCCTGTGATTCCTGTTCGCTAAAGATAGCCACCAGCACATTCGCGCCAGTCACTTCACTACGCCCGGAAGACTGAACGTGGAAGACGGCACGTTGCAGGACACGCTGGAAACTTAACGTCGGCTGCGTATCACGCTCTTCTTCACTGGCAGGCAGTACGGGTGTGGTTTGTTCAATGAAGGCTTCGAGTTCCTGACGGAGCGCCACCAGATCCACGGAGCATGCTTCCAGCGCTTCGCGAGCCGATGGGTTGCTGAGCAGCGCCAGCAACAGATGCTCGACGGTCATAAACTCATGACGGTGCTCGCGCGCTCTGGCGAAAGCCATGTTTAAACTGAGTTCCAGTTCTTGATTGAGCATAGGCACCTCCCCCAATTTTTATACCTGCATTCAGGCTTTTTCCAGCGTACACAGCAACGGATGCTCGTTCTCCCTTGCATACTTGTTCACCATCGCCACTTTGGTTTCCGCCACCTCGGCGGTGAACACGCCGCAGATGGCTTTGCCTTGATAGTGAACTGCAAGCATCAATTGCGTTGCACGTTCTACATCATAAGAAAAGAATTTTTGTAACACGTCAATAACAAACTCCATCGGAGTGTAGTCATCATTGACTAATATCACTTTATACATAGATGGCGGTTTTAGCGCGTCGCGCACGCTATCTTCCACCAACTGGTCAAAATCCAGCCAATCGTTCGTCTTACCCATTGTCAGTCGTCATTATCGGTTACGGTTGTCGGCAGAAAAATCTGCCGCTGACCAGAGTCTATGCACACAATCAATCTACCTCAATTGATAGATAACTAACATCTATCAGTACCATCCGCGACATCTGTCACATTCCCGGCAATAGCGTTAACTGCTTCAAATTTTTGATTCATTTTTACCCGATCCCCCCTGCCTGATGCTTGACGCCTCGCCTGATTTCTCTAAATTGTAATGTCGAGAGTTGGTGAGGTTTTGAACAGCCCCCACTCCGTCACCGGTTCATTCCATCTTACTTATATAAGATTTACGAAGGATGTCGAAGCATGGAAACGGGTACTGTAAAGTGGTTCAACAATGCCAAAGGGTTTGGTTTCATCTGCCCTGAAGGCGGCGGCGAGGATATTTTCGCCCATTATTCCACCATTCAAATGGATGGTTACAGAACGCTTAAAGCCGGACAGTCTGTCCGGTTTGATGTCCACCAGGGGCCAAAAGGCAATCACGCCAGCGTCATCGTGCCCATCGAAGCAGAGGCCGTTGCATAGCTCCTCTGTCTCATTGTGTACATCCAGGAGGCAAAATGCCAGCCCGATCGGCTGGCATTTTTATTTAACGCCAGTGCCTGGTGGCAACACTGTTGCATCTTATCAGGCCGACAAATGACGTCAGCAAGATTACTCCCTTGCCAGCGCATCCACCGGGTCCAGTCGCGCCGCGTTTCTCGCCGGTAGCCAGCCAAACAGTATCCCGGTAAATGTCGAACATAAAAACGCGCTCGCCAGCGCAGTCAGTGAAAAACCGATCTCCCAGCCGGGCAGGAAAAGCTGTAGCATAAATGCGATGAACATCGACAAGCTAATCCCCAGCGCTCCACCAACCAGGCAAACCAGCACCGCTTCAATAAGAAACTGCTGTAGCACATCGCTGGCGCGCGCGCCTACCGCCATACGGATGCCGATTTCACGCGTTCGCTCGGTGACGGAAACCAGCATAATATTCATAACACCGATGCCGCCGACAACCAGCGAAATGACGGCCACCAGCGTCAGAAATAACTGAAGAGTATAGGTGGTTTTTTCAGCCGTTTTCAGGACGCTGTCCATATTCCAGGTGAAGAAGTCTTTTTTACCGTGGCGTAAGGTGAGCAGGCGGGTAAGCTGCTGTTCAGCCTGATCGCTATCAACGCCATCTTTCACACGAACGGTGATCGAGTTAAGCCATGACTGACCCATTATGCGATCTGACATCGTGCTATAGGGCAACCAAATTTGCAACAGATTGCTATTGCCGTACATGGACGGTTTCTCTTCCGCCACGCCAATAACAATAACCGGCATATTACCCACCAGCACCACTTCCCCTACGACATTCGCTTTATTTGGAAATAGCTGGCGTCGCGTGTTGGCATCCAGCACCACCACCTGCGCACGATCCTGTTGCTGTACAGCATTGAAGGTGTTCCCCTCCCTAAAGGACATGCCGTAAACGTTAAAATAATCGCCACTGACGCCATTAGCATTTACGGCAATATCAATATTGCCATAGCGAAGACGTAAGCTCTTTGAAACACTGGGCGTCGCAGAGTTAACCCACGGCTGTTTCTGAATAGCCACCAGATCGTCATATTTCAGCGCCTGTCGATACTGCGGGTTGTCGTCGCCGAAATCTTTGCCTGGATGAATATCAATCGTGTTAGTGCCCATAGCGCGGATATCCGCCAGTACCATCTGTTTTGCGGCGTCGCCGACCACCACAATCGACACCACCGACGCAATACCGATAATAATTCCCAGCATGGTCAGTAAAGTACGCATTTTGTTAGCGGCCATCGCTAACCACGCCATTGACAGCGCTTCGCGAAAGCTGCTGGCAAATTGCCGCCAGCCGGGAGCCGTATTAACTACGGCAGCGTCAACGCCCTGTTCGCGTTTCTTTTCCTGCGCGGGCGGATTATGGACAATCTTGCCATCGTGAATTTCAATAATCCGCTCCGCCTGGGCGGCAATCAGCGGATCGTGCGTCACAATGATCACCGTATGTCCGCGATCGCGCAGTTGGCGCAAAATCGCCATCACCTCTTCGCCGGAATGGCTATCCAGCGCGCCGGTCGGCTCATCCGCCAGAATCACCTGTCCGCCGTTCATCAGCGCGCGGGCAATACTGACACGCTGCTGCTGTCCGCCAGAAAGCTGTGAAGGCGGGTAATCGACGCGATCGCTTAATCCCAGCCGCAGAAGTAACTCTCTGGCGCGCGCCTGGCGTTTTTTGCGTTCAATGCCGGCGTAGACGGCGGGGATTTCAACATTTTGCGCTGCCGTTAAATGCGACAACAGATGGTAGCGCTGAAAGATAAAGCCAAAATGCTCACGCCGCAGCTGCGCCAGCGCGTCCGGGTCCAGCGTCGAGACGTCCCGCCCCGCCACCCGATAAGTGCCGCTGGTCGGTTTATCCAGGCACCCGAGGATATTCATCAGCGTTGATTTTCCAGAACCGGAAACGCCGACGATCGCCACCATCTCCCCGGCGTGGATTTGCAGGGAGATATCTTTCAACACCGCCACCTGCTCTTCTCCGGAGGGGTAGCTGCGACTCACATTGCGCAGTTCAAGCAATGCCGTCATGGCGTCGCTCCTGGCCTGCTCTCACCGATGATCACCTCATCGCCCGCTTCCAGACCTTTAACCACTTCTACGTCTGTATCGTTACGCTCGCCAATGACCACTTCGCGCTCACGTTTTTCACCGTTACGCAACAGCGCCACTTTATAACGATTGCCGCCCACCGGTTCGCCAAGCGCGGCGAGAGGAATAATCAGCACATTTTTGACATCCATGAGTTGAATATAAACCTGTGCGGTCATATCAAGACGCAAGATTCTTTTGGGATTCGGCACTTCAAACCGGGCGTAATAAAAAATAGCGTCGTTGATCTTTTCCGGCGTCGGCAGAATATCTTTTAAAACGCCTTCATAGCGCGTTTGCGGATCGCCTGCAATGGTGAACCATGCTTTCTGCCCCGCCCGAAGATGGATCACGTCCGCTTCCGAGACCTGCGCTTTTACCAGCATAGTGCTCATATCCGCCAGCGTCAGAATATTGGGCGCCTGCTGAGCTGCAATCACCGTTTGTCCTTGCAGGGTAGTGATTTGCGTCACTTCCCCCGCCATGGGAGCAACAATACGGGTATATTCCAGGTTGGTTTTCGCGGTGTCCAACGAGGCCCGATTACGTTTGATCTGGGCATCTATGGTGCCAATACGCGCCTGTTTAACCGCCATCTCCGTCGCCGCGGTATCCAGATCCTGTTGCGATACCGCCTGAGTCTTAGCTAACTGCTGCTGGCGCGCCAGCGTAACCCGCGCCAGCTTTAACTCAGCCGCTGCCTGCTGACGCTCCGCGTTCAGCTCCATCAGGGTGGCCTCGACCTCTTTTATCTGGTTCTCCGCCTGATCTGGGTCAATCACGCCGAGTAGCTGATCTTTTTTAACGTTATCGCCAATGGAGACCAGCAGCGTTTTCAACTGGCCGCTCACCTGCGCGCCGACATCCACTTTACGCAACGCGTCCAGTTTTCCAGTCGCCAGTACACTCTGTTCAAGATCGCCTGGCCGCACGATTAATGTCTGGTAAGTTGGCAGCGGCGCATTTAGCATTCGCCAGCCAGCCATCCCCCCCACTAAAAGAATTAAAATAATGACCAGATAACGCTTTTTAAATTTCTTTCCCTTAGCACGCAT
Protein sequences of DBSCAN-SWA_6 >NZ_CP014620|2915459:2924541|2915459_2916701_-|WP_024155556.1|integrase|DBSCAN-SWA MPKKARELSALAVSRLKAEGRYAVGGVDGLYLRIAGRSRAWVLCVAMGTRINRAGKTVPRRLNMGLGPYQEVSLAEARDKARELRRQIRNGIDPLKEKHAQKAHQEKTARKQKTFHACAEAVLEIKGRELKSKKHVAQWRSSLETYAYPVIGQKRVGEITKADLLAILEPIWLTKNETASRVRGRIETVLDYAKAKEYLEGDNPAAWKGMLKPLLPMPSKVQNKKHHAALPYGELGAFMAELRTRTGVSARALEFSILTVARSGEIRGAVWDEIDLAAKTWTIPAERMKASKEHRVPLSDAAVALLKMLPRFKDNNLVFPAPRGGQLSDMSLTAVLKRMGRSDLTQHGFRSTFREWAGETTSYQREVIEHALAHQLADKAEAAYQRGTLWPKRVALMDDWTDYCGYEPVDKAA >NZ_CP014620|2915459:2924541|2921479_2923426_-|WP_000125893.1|DBSCAN-SWA MTALLELRNVSRSYPSGEEQVAVLKDISLQIHAGEMVAIVGVSGSGKSTLMNILGCLDKPTSGTYRVAGRDVSTLDPDALAQLRREHFGFIFQRYHLLSHLTAAQNVEIPAVYAGIERKKRQARARELLLRLGLSDRVDYPPSQLSGGQQQRVSIARALMNGGQVILADEPTGALDSHSGEEVMAILRQLRDRGHTVIIVTHDPLIAAQAERIIEIHDGKIVHNPPAQEKKREQGVDAAVVNTAPGWRQFASSFREALSMAWLAMAANKMRTLLTMLGIIIGIASVVSIVVVGDAAKQMVLADIRAMGTNTIDIHPGKDFGDDNPQYRQALKYDDLVAIQKQPWVNSATPSVSKSLRLRYGNIDIAVNANGVSGDYFNVYGMSFREGNTFNAVQQQDRAQVVVLDANTRRQLFPNKANVVGEVVLVGNMPVIVIGVAEEKPSMYGNSNLLQIWLPYSTMSDRIMGQSWLNSITVRVKDGVDSDQAEQQLTRLLTLRHGKKDFFTWNMDSVLKTAEKTTYTLQLFLTLVAVISLVVGGIGVMNIMLVSVTERTREIGIRMAVGARASDVLQQFLIEAVLVCLVGGALGISLSMFIAFMLQLFLPGWEIGFSLTALASAFLCSTFTGILFGWLPARNAARLDPVDALARE >NZ_CP014620|2915459:2924541|2923422_2924541_-|WP_023202044.1|DBSCAN-SWA MRAKGKKFKKRYLVIILILLVGGMAGWRMLNAPLPTYQTLIVRPGDLEQSVLATGKLDALRKVDVGAQVSGQLKTLLVSIGDNVKKDQLLGVIDPDQAENQIKEVEATLMELNAERQQAAAELKLARVTLARQQQLAKTQAVSQQDLDTAATEMAVKQARIGTIDAQIKRNRASLDTAKTNLEYTRIVAPMAGEVTQITTLQGQTVIAAQQAPNILTLADMSTMLVKAQVSEADVIHLRAGQKAWFTIAGDPQTRYEGVLKDILPTPEKINDAIFYYARFEVPNPKRILRLDMTAQVYIQLMDVKNVLIIPLAALGEPVGGNRYKVALLRNGEKREREVVIGERNDTDVEVVKGLEAGDEVIIGESRPGATP >NZ_CP014620|2915459:2924541|2918177_2920454_-|WP_000934064.1|protease|DBSCAN-SWA MLNQELELSLNMAFARAREHRHEFMTVEHLLLALLSNPSAREALEACSVDLVALRQELEAFIEQTTPVLPASEEERDTQPTLSFQRVLQRAVFHVQSSGRSEVTGANVLVAIFSEQESQAAYLLRKHEVSRLDIVNFISHGTRKDEPSQSSDLGNQPTGDEQAGGEERMENFTTNLNQLARVGGIDPLIGREKELERAIQVLCRRRKNNPLLVGESGVGKTAIAEGLAWRIVQGDVPEVMADCTIYSLDIGSLLAGTKYRGDFEKRFKALLKQLEQDTNSILFIDEIHTIIGAGAASGGQVDAANLIKPLLSSGKIRVIGSTTYQEFSNIFEKDRALARRFQKIDITEPSVEETVQIINGLKPKYEAHHDVRYTAKAVRAAVELAVKYINDRHLPDKAIDVIDEAGARARLMPVSKRKKTVNVADIESVVARIARIPEKSVSQSDRDTLKNLGDRLKMLVFGQDNAIEALTEAIKMSRAGLGHEHKPVGSFLFAGPTGVGKTEVTVQLSKALGIELLRFDMSEYMERHTVSRLIGAPPGYVGFDQGGLLTDAVIKHPHAVLLLDEIEKAHPDVFNLLLQVMDNGTLTDNNGRKADFRNVVLVMTTNAGVRETERKSIGLIHQDNSTDAMGEIKKVFTPEFRNRLDNIIWFDHLSGEVIHQVVDKFIVELQAQLDQKGVSLEVSQEARDWLAEKGYDRAMGARPMARVIQDNLKKPLANELLFGSLVDGGQVTVALDKEKNALTYGFQSAQKHKPEAAH >NZ_CP014620|2915459:2924541|2917228_2917606_+|WP_023243338.1|integrase|DBSCAN-SWA MHKYLSVVKKHRVPLSDAAVALLEGLPRLKNNNHVFPAPRAETLSDMSLLAVLKRMEYTNLTQHGFRSTFHEWAGETTDYQREVIEHALARQLVDKAEAAYQRGTLWPKRVALMDDWTGYSTANS >NZ_CP014620|2915459:2924541|2917767_2917965_+|WP_001117984.1|DBSCAN-SWA MPGLIGYWKQLPTKDEYIKKHNMSKISCYSCGHEKFSDVGLIQVWDNHRRILCAKCKTTLFREED >NZ_CP014620|2915459:2924541|2920484_2920805_-|WP_000520789.1|protease|DBSCAN-SWA MGKTNDWLDFDQLVEDSVRDALKPPSMYKVILVNDDYTPMEFVIDVLQKFFSYDVERATQLMLAVHYQGKAICGVFTAEVAETKVAMVNKYARENEHPLLCTLEKA >NZ_CP014620|2915459:2924541|2921128_2921350_+|WP_000447499.1|DBSCAN-SWA METGTVKWFNNAKGFGFICPEGGGEDIFAHYSTIQMDGYRTLKAGQSVRFDVHQGPKGNHASVIVPIEAEAVA |
8 | Ralstonia_phage(16.67%) | integrase,protease | attL 2913852:2913864|attR 2933038:2933050 |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_7 |
4007613 : 4016561
Sequences of DBSCAN-SWA_7
Nucleotide sequences of DBSCAN-SWA_7 >NZ_CP014620|4007613:4016561|DBSCAN-SWA GTTATGTTGCTGCGGGGTCGTCGCACTTCGGCAGCCAGTCGCCGTAGCTTTCCTCTTTCAGCGACAGATTGGTTTGTATCCCCTGTTTGGTGTGCCGCTTCTCATAATTCAGGCCGTACTCTTTCAGCATCATGGGCAGCCCCAGCCCGAACATTTTCAGGCTGAGCACGTTCCTGTACCCGTTAGCCTCCATATAGGCCAGATACGCGTGATAGAGATATTTACGATAATTACGCGGGATGATGCTGGCATTCCCCATAAACATCCCGTTGGTCTGCGGGAGCATTTCCAGATAACCGCAAAAATCAAACGTCGGATCAGCATCGCGCTTAATGCTGAGCGCCTCGTCGGAGTTCTGCTGCGACTGGAGCAGTGCGCGGGCGGTCATAGGGGCGCTGAATTTCTGCATAAGCTGGCGCACAATCACGGCCAGCTCGCGCGCAATTTTATCCCTGAGCTGCGGGTCGCGTTCCTCCGGGGCAATCTGTTCCGGGAAGTGAATAATCACCCGGCGACGTGACACACCGCCGCTGCGGTCGGTGAAGCGCATGGGATTATTGTTCACGGCCAGAATCACCGCCGGAATATGCGTGGAATAGGGGTTCTGGTATTTCGGGTCAACCGAGACCGCATCGCCGCCGGTGATGGCCTTAAGCCCTGCGCCGTCACCGCTCCATTTTTCCTGGTCAGGCAGACGAATAAGCGAGAAGCCAATCAGGGAGGCACGCTTGCGCGGGTCTTCCAGCGTGTCGATATCGGCTGACGTGGCGTTATCCTCTCCGGCGAGCAGGGTCGCGATTTCAGCCAGAATACTTTTTCCACTCCCGCCGGGACCGGTCACTTCGAGAAAGAGCTGCCAGTCGTAACGGTTCGCCAGCACCATAAACAGCGCAGCCAGTATCACATCGCGTTTTTGTGGATTTTTGCCAGCCGCACGGTCGAGCCAGCGCCAGAAGTTCGGCGCGTGAGTCTCCAGCGTTTCCCCGTCCACCGGCGGGGTGAAATCCACGTCGCACAACGTGCGCAGCCAGTGTGATTTACTGTGCGGGCTGAACAGGCCGCTTTGGGTATCGAGTACCCCGTTGCGAAAGCCAATCAGACGACGCGCCGGAGTATCCTGCTGCGGAATAATCAGTTTCAGGGTCTCCACCACCGAGGCAATTTTCCCCGATGAGAACGGGGCGCGCAGACGCTGGAATAAGTCAGCCACATTCCGTGAAAAAGTGGCGGCAGGGATATTTTTCCAGATGCCGTTTTCATAGCGGGACAGGAGCTGGCCGTTCGCATCCACCGCCAGCGCTTCGCCGTAATGCTCATGCACCCGCAAGGCCTTGTCGCTGGCGCTCATGGCGGTAAATTCTGCCTCGCTCATGGTATCAAACGGACTTTGCGCCGGTGGCCGGATTGCGTCATATATCGCTTTCCGCGTGGCCTCCCCGCCGTGCTGCTTAAACGTATCATTCCAGTCACCGAACACCGGAGGCAGGGCAACAACGCCCTCACAGGCGTCTGCGGCCGCAGCGGCTTTGTTCTGGCCGTCGCCGTTAAGGTCACGGTCGGCGGCGAGCACAATCTGACAGGCCGGGTGTTTCTGACGGGCAAGACTCGCCAGAGAAAGGAGGTTCACGGACGAGAGCGCCACCATGACGGTTTCGCCGGTCAGGTGATGCACGGTGAGCGCTGTCGCATAGCCCTCCGCAATCCACAGGCGTTTTCCGGCCTGTTTTTTCCCTTCGATGATATGACATGCCCCTTTGACCTGACCGCCTTTCAGGGTGCGCTTGAGACCGTCAGCATTGATAAGCTGAAGGTTAACCAGTGTGCCGGTATCCTCATACAGCGGGACAACCACATCACCGGCGCGGAACGTCACGCCGCCGGTTTTATGCATGACGGTGAGCGTCAGACATTCCCGGTCGGGGAAACCCTTGCGGGTGAGGTAGGCATTGCCGGTGGCCGGTCGGGTTTTCTCCATGAGCCTGACGGCCAGCGCGGCCGCCGCTTTGCGGTCGGCCACAGTTTCAGCCTCTGCGGCCGCAATCACTTCCGGGGCAACCGGTAACAGATTGCCGGTCACGGCGTTCACCTTCCCGGCGGCCTCTGAGGGAGTCACGCCAAACACTTTTTCTACCAGCTTAAGTCCGTCACCCGCGCCGCACTGGTTGCAGAACCATGTCCCGCGCCCCTGTTTATCGTCAAAGCGAAAGCGGTCGGAGCCGCCGCATACCGGGCAGGACTGATGGCGGTTTTTAATCACATTCACACCCAGCGCAGGGAGAATGCGCGGCCAGTGGCCGCACGCCTGTTTTACCGTTTCTGTTACGTTCATTTTCATGGTTATTTTCTCCCTCAGCGCAGTACCGGTGCGGTGATATGACGGGCGCAGAGTTCATCCATTACGGCCAGCCCGAGAAAGGACAGCGACGGCGCGGCCTTGAGTGGTCCGGCTTCCATTAAATCTTCCAGCAGTGCACAGGCAATCTGACGGCCTTTTTCCTCGCCGTGCTGGCGCAGGTAGAAGCCCTCCAGCTCGGCGGCAATGGCGCTTTCCAGCGCGTCGAGGGTGAGGTGCGGGTAGCGGTGCTGGCGTTCGCACAGGGTCAGCCATGCACAGGCCACGGCGCGACGATAGAGCGCGGCGCGTAATACGGGTGGTAATGGCTTTTTCATACGTTACCCTCCCCGGTCAGCCACTGCTGATTGCAGCGTTCGACCACACCGTCGAGCTGGGCGGTCATGAGGTAAATCACGGAGGTGAGCTGTAAGTGCTGCGCCGGGTCACGACGAACGGTGGCGCAGTCCTGCACCTGCATCAGGTCGCCGACGAGCTGGCCGACGTTGCGCATATGCTCCAGACATTCGAGGTCACGGGCGGTAATAGTGGTGTGTCTCATGCGCGCACCTCCGCAATCGGCAGACGGCCAGCAAACGAGAGGACGTAATCGCGAACAAGAGAAAGGCGTGCGGTGTGTTCATCACCGGCAACGGTGCGGAGCATACAGATACGGGGTTTACGGTCTGCGCGAGGAACGGCGGCAAACACAAAGACGAATTGCGGGTGTGACGGGGTGAGGGTCGTAGCCATAGGGGCAACCTCCATTGAGTAGCGGTTATCGCCACCACCGGAGCTGCAAATCTCATGGGTGGTGGCCCGGACAGGGTTTGCAGTACCGGCCTCAATGGATACCGGCCAGCCCGAAGGCTGCCCCGCCCGAACCACCATTGTCTGAAAGGAGCCACGGTGTAAACACCACAGCCCGAAAAATGGGTGTGTCTGAGCTACGACGTAAAAAAAGACGCATGGCGCGTCTGGTGTCGCCATTGAGTTACACGGGCTGCAAATCCCGACTGCCGATTTTGCGACAGCGGGAAAACTATACCTGGAAACGGCGAAAAGAAGCAAGCCAGAAAAAGGGGCTGTTTGCTGAGCGGCCATCATCATGCGTCATAGCCCCGGTTGCGTTCGGCAATGCGATCCGCCATCCATGCAGTGATTTCAGACTGCGCCCACGCCACGTTTTTTCCGCCGAGGGAGATTTGTTTCGGGAAGGCTTCCCGGCTGATGAGGTCGTAAATGGTCGAGCGGGACAGGCCGCACAGATGCATCACTTCGGGCAGACGGATAAAGCGTTCGTGAACGGTATCAGAAACCGGCATCAGCGGGGCGGCAGGGGCAGAAGACGGGGAAGAAAAAGCGGTGTGCATCGGGCTACCTCACAAAGTCCATACAGTGCCGGTCGTGTCCGTCCGGCTTCGGGTAGCTCTCTATTTTGTGAATATTTTCCCTCAGGGCAACAAGTCATTTTGTACTGCTCCACCACACAACAGAGCGTTTTTTATACAGTGGCAAACGTTGGCCGTTTTTTGGCAAACGTTGGCAAACCGGTGGCCCATTGCTGATTACTTTTGTTTATATATTTATTATTTTTAATCACTAAAAAGTCTAAGTGGCTGACTGGCTGAAAAAACTGAAGGGTGAACAGTGGTGAACAGACGGTGAACAGTCAGACCTTCAACTGTTCACCATTTAACTTACTGTATTACTTATCTTTTTATTTAAGGTGAACAGTGGTGAATAGTTATAAGTAAAAAAACAAACGGTGAGTAAGGTTTTCCTGCGACCTTTCTCTGGCCAGCCGGTTTTTAAGGTCTGTTTGTGCCAGCACTCTGACAACGGCAATGAATCGTGTTGTTGTGCAGGAGGCGTCAGAATCATTTCAGGTTGAACACACGGAGAGCCTGAACATGAAACCCGAACTCATTATCAAAGCCATGCAGACCGTTATCAGTAAACAGGATGAAGGCGCGGAACAACGTATTGCCGGTGCGCTGGCCGCACTTAACGAAGCAAAAGACGCACACACGGCCAGCATGGGTAAACTCAGCGACATTGAGGCTTCCATTCAGCGTTGTGAGCAGGAACGACAGACCGCGCTCAGTGAAAGTGCACAGGCCGAACAGGACTGGCGCAGTCGCTTTCGAACTCTGCGCGGCAACCTTACTCCTGAACTGAAAGCTGAACACAGTAAACGTATCGCCAGCCGCGAACTGGCTGATGAGTTCACCGGTCTGATTACCGAGCTGAAGAAAGACAAAGGCCTCGCCATGCTCGATGCATGCTCCTCCGGTACTGCTTATATCAGCGCCCATGAAAAAGCGTTCACCACTTACGCCAACAGCGAGTGGAAGAAGGCGCTGGCCAGTATCAGCCCCGCACTGTTACGTGCCTTTCTGTTGCGTATACGGTCGCTGGAAATGAGCGGAGAAACCTCGCCGCGTGCGACCGTGACCCGTGAGCTGGGTGATGCCCTGAATATGCAGTCAGCCCTGTATCATTTTGATATGGAGCAGGAGCCGGTCCTGTCCGTAACGGGTATGAATCGCCCGGTCATAACCGGGGTTGATATGGCGCTGTTAAGAAGCCCGGCCAGACGGATGAAGCTTGCCGCTGAACTGGCCGAAAAATCCCACGAACAGGCAGAGGGCTGAATCATGTTTCACTGTCCGTTCTGCAAAAAGACCGCGCACGTCCGTACCAGCCGTTATCTGTCGGAAAACGTCAAACAGCGTTATCACCAGTGTACCAATATCGAATGTTCGGCCACTTTCCGCACCATCGAGTCGGTTGACGGTGTGATACGTGCCGCACCGGAAAAAACCGACCCCGCACCGGTGACGCCACCGCCGCCGCGTAAAGTACAGGGCTGCTACAGCTCGCCGTTCCGGCATTAATCAGGAGAGAGACACGTGACCACTGTGACATTGCAGCAGGCCTTTGAGGCCTGTCAGACGAACAAAAACACCTGGCTGAAACGTAAAGCCGAACTGGCCGACCTTGAACTTGAATACCGTGAACAGCTCCTTGCCGGTGACGAACAAATCCCGTGCAGAATGCAGGATTTGCGCGACAATATCGACGTGAAAAAGTGGGAGATTAATCAGGCCGCCGGTCGCTATATCCGCTCACATGAGGAGGTACAGCACATCAGCATCCGCAACCGGCTCCATGACTTTATGCAGCAGCACGGCGCGGAGCTGGCCGCCACGCTGGCGCCTGAGCTGATGGGATATCACGAACAAATTCCCGCAGTAAAACAGAGCGCCATGCAGCACTCGGTTGATTATCTGCGTGAAGCCCTGTCGGTGTGGCTGGCCGCAGGTGAAAAAATTAATTATTCCGCGCAGGACAGCGACATTTTAACGGCCATCGGATTCAGGCCTGATGCGGCTTCGCGGGATGATAATCGCCAGAAATTCACCCCGGCACAGAACCTGATTTACACCCGCCGACGTGCAGAACTGGCTGCACGGTAGCACTCAAAAAAATCCCCGAAAATTCCGCTATTTTTCCTGAAAAAAGCCATGCATCCATAAGGTGCATGGTTTTGCATGCAAATCCCCGTATTTTTTATCCCACGCAACACCAGTACCGGCGCGGTCTGTGCCGGTTCATGCAACTGCATGAAAACTGCCCTATAAAGCGGGCAGGCGTGGCGGGGAGAGCATTGCGCGCTAATAGCAATGATGCACATTTATTTTCGAGCCCTGAGTACATCGTAGATGTATGCAGATAAAACGATTATGGATCTGTGGCAAATGCATAGAGTAGGTTTGATGCTCTTACGGGGCATGCAGGTGAACAATACCAATACATAATATATTGAACCTACCCCAAAAAATGAGTAGGCATACTTGTTAAAAGTATATTTCTGATCAATGATATAAAAATCAACACTCTAGGCGAAAGAGAACATGAAGTTAAATTATTTCACTTACAGAATAACCGACAATAGAAATCAACAAGTTTATTTTGATAACATTTCAGATATTATCAAAAATTTTTGCCTGCATAGAAAAAAATCTCTTTTTGAGAAAAGTAAAGGATTAAAAAGACTATACCTAGCCATGCCTACCTCATTTGATGGCATATACTATTTGACCACGCCAGCAATTACTACAGCATTTAAAGCTGTAGATAGAGCAACTGGTGTAGTAAATGATTTGGCGTCTGTGCTGGGTAAAGATAGTCTAGAGAAAGTTACTTACTTTTTTATTGACCCTAAACATTCAATAATCGGAGTAACCGAGGGTAAAGGTAATGCTGACATTGATGATTTGCAATTCTTTATTAACGAAATAATTAATCAAGATTTTCAATCGCATATTTACACCTTCGAGCTTTGCACATTAAAAATAGAAATTAAATCCACATCGGCTACGAAATTCAAACTCATCACTGAAGCGCGAGTGAAGTTAAATAATGATTCCGTTGGAGATGTAATAAATGGATTGTTTGGCAAGGAACCATCAGATAATATGGAAGTGCAAATTATCGTAAAAAGAAAAGATAGAAAAGAGAATGTAAAAGATTATATACAACCATTACTCTCAAGTTTATCCTCATCCAATGACAAAGAATATGCAGAAATTTATTTCAGAGCTAAGGCAGATGAGTTTCAATCAAATGTGAAAGAGTTCATTCTAGACCAGAATCAGAATATTTTTGATATAATAAACCCTCATCTCAAGGCAAAGATTGAAGAGCAAATACTTGAAAAAAGATACAAAAACCAAGTTGTGATAAGTGAATTGAGTACATATGCACAAAAATTTACTGGACGCATACATTCAGGTATTATTGATCCAGTGTGGAATGATCTAAAAACAGAAAGTTACCACAAAGCAAAAAGTTGAGGATGACGATGCTTACTAACCAAGCAATAGTAATAATTAATTTAGCCACGTGGGGTGTAAGCATTCTTATTGCTGTTGTTTTTTCTCTCATTGCGGTGTTTTGTGAAAACCAATACATAGAGATAAAACCTGAAGGTATAATTGGCATCGCTACATTATTAGGGACTTTCAGTTTCACAATGACTGGATTCATTGCTGCAATTGGCGCTTATATCATATCGGTGTCTGATAAGACTTCTTTTCTAAGGTGGCGACAGCAAGGATATATAAATATCTTCTACCATATATATGGGCAGAGCATTGTTTTTTTATTGGTAACATTTTTATTATGCATGGTGGCTATCATAATGCCATTTAATGTTGCATTAACAGTTTTGAAATGTGGTTTATACATTCTCATTCTTAATATTGTTCACATCATATTAATAACTGTAATTACACTCGGTCAAATGCAGAAAAAATAAGTTAGCTTTTGACCGGTATTTCTCACTTTTTAACTTTAAGCGTGTTTCCAAACTCATATGGCGTGACTTGAGAATGCTTACTTATATCTAAATAATCCGCCCACCATTGAACCATAAGACGCCTCTCATCTAGATGTTCAGAAGTATGAATATAAGCTGCACGTACATTATTACGCTCTGAATGGCTCAACTGTCGTTCTATAGCATCTTCACTCCATAATCCGGACTCACCCAATGCACCACGGGCCATCGTTCTAAACCCGTGCCCGCAAACTTCGGTTTTCGTGTCATAGCCCATCGCACGCAATGCGCTATTTACCGTGTTTTCGCTCATAACCTTAGTTGCGTCATGATCACCCGGAAAAAGCAGCTCTTTATCACCACTAATCTGCTTTAACTGGTTTAGCAAAATCATCGCCTGCCGACTAAGCGGAACGATATGCTCCTCTTTCATCTTCATGCCACGGTACGAGTAACGCACACCTTTAATTTCTTCTCGTTTTGCAGGTACTCGCCAGAGAGATTTATCGAAGTCGAACTCATCCCAACGCGCGAAACGTAACTCACTGGAACGCACAAAAGTTAGTAAGGAAAGCTCAACCGCGATCCGCGTCATTACACGGCCACGATATGCAGCAAGACGAGCAAGAAACTCAGGGAACCGGCTGGAGGGCAAAGCGGGGTAATGTCGCGCTTTGGTTGTCGATAGCGCACCGGCCATATCACTGGCTGGATTTGAGTCGATGTAATCGTTCTGTACTGCATAACGCATAATAGCCGTGACTCGCTGCTGAAGGCGCTGTGCAACGTCATGCTTACCACTGGCATCAACTTTTTTAATCGGGGCTAACAGGTGGCTAGTTTTGAGCTGACGGATGTCAGACGAACCAATATGAGGAAAGATATAAAGCTCAAGATAACGTAGAACGCGTGATCGGTGATCTTCACTCCAGCGCTTATTACTAGCATGCCATTCACGAGCGATAGTTTCGAAAGAATATGCCCCCGAATTCTCGGCCTGAGCTTCTTTCTGTTCGGCTTTTGGATCAATGCCCTGCGCTAACAGCTTTTTAGCTTCATCGCGCTTTGCTCTTGCCTGAGCAAGCGTCACAGTAGGCCAAACACCAAAAGCAAGGCGATCCTCTTTTTTGTCTGAGGGGCGTCTGTATTTCATGCGCCAGTATTTAGAACCCTTGGCCGAAACCTCGAGATACAAACCGCCGCCATCGGCCATTTTATAGGTTTTGTCTTTTGGCTTTGCGGTCTCGACCTGTCTGGCGTTGAGCTTCAT
Protein sequences of DBSCAN-SWA_7 >NZ_CP014620|4007613:4016561|4012859_4013426_+|WP_000214429.1|DBSCAN-SWA MTTVTLQQAFEACQTNKNTWLKRKAELADLELEYREQLLAGDEQIPCRMQDLRDNIDVKKWEINQAAGRYIRSHEEVQHISIRNRLHDFMQQHGAELAATLAPELMGYHEQIPAVKQSAMQHSVDYLREALSVWLAAGEKINYSAQDSDILTAIGFRPDAASRDDNRQKFTPAQNLIYTRRRAELAAR >NZ_CP014620|4007613:4016561|4013864_4014806_+|WP_000775190.1|DBSCAN-SWA MKLNYFTYRITDNRNQQVYFDNISDIIKNFCLHRKKSLFEKSKGLKRLYLAMPTSFDGIYYLTTPAITTAFKAVDRATGVVNDLASVLGKDSLEKVTYFFIDPKHSIIGVTEGKGNADIDDLQFFINEIINQDFQSHIYTFELCTLKIEIKSTSATKFKLITEARVKLNNDSVGDVINGLFGKEPSDNMEVQIIVKRKDRKENVKDYIQPLLSSLSSSNDKEYAEIYFRAKADEFQSNVKEFILDQNQNIFDIINPHLKAKIEEQILEKRYKNQVVISELSTYAQKFTGRIHSGIIDPVWNDLKTESYHKAKS >NZ_CP014620|4007613:4016561|4010502_4011054_-|WP_000979749.1|DBSCAN-SWA MMMAAQQTAPFSGLLLFAVSRYSFPAVAKSAVGICSPCNSMATPDAPCVFFYVVAQTHPFFGLWCLHRGSFQTMVVRAGQPSGWPVSIEAGTANPVRATTHEICSSGGGDNRYSMEVAPMATTLTPSHPQFVFVFAAVPRADRKPRICMLRTVAGDEHTARLSLVRDYVLSFAGRLPIAEVRA >NZ_CP014620|4007613:4016561|4009961_4010282_-|WP_000743145.1|DBSCAN-SWA MKKPLPPVLRAALYRRAVACAWLTLCERQHRYPHLTLDALESAIAAELEGFYLRQHGEEKGRQIACALLEDLMEAGPLKAAPSLSFLGLAVMDELCARHITAPVLR >NZ_CP014620|4007613:4016561|4007613_4009947_-|WP_000783715.1|DBSCAN-SWA MKMNVTETVKQACGHWPRILPALGVNVIKNRHQSCPVCGGSDRFRFDDKQGRGTWFCNQCGAGDGLKLVEKVFGVTPSEAAGKVNAVTGNLLPVAPEVIAAAEAETVADRKAAAALAVRLMEKTRPATGNAYLTRKGFPDRECLTLTVMHKTGGVTFRAGDVVVPLYEDTGTLVNLQLINADGLKRTLKGGQVKGACHIIEGKKQAGKRLWIAEGYATALTVHHLTGETVMVALSSVNLLSLASLARQKHPACQIVLAADRDLNGDGQNKAAAAADACEGVVALPPVFGDWNDTFKQHGGEATRKAIYDAIRPPAQSPFDTMSEAEFTAMSASDKALRVHEHYGEALAVDANGQLLSRYENGIWKNIPAATFSRNVADLFQRLRAPFSSGKIASVVETLKLIIPQQDTPARRLIGFRNGVLDTQSGLFSPHSKSHWLRTLCDVDFTPPVDGETLETHAPNFWRWLDRAAGKNPQKRDVILAALFMVLANRYDWQLFLEVTGPGGSGKSILAEIATLLAGEDNATSADIDTLEDPRKRASLIGFSLIRLPDQEKWSGDGAGLKAITGGDAVSVDPKYQNPYSTHIPAVILAVNNNPMRFTDRSGGVSRRRVIIHFPEQIAPEERDPQLRDKIARELAVIVRQLMQKFSAPMTARALLQSQQNSDEALSIKRDADPTFDFCGYLEMLPQTNGMFMGNASIIPRNYRKYLYHAYLAYMEANGYRNVLSLKMFGLGLPMMLKEYGLNYEKRHTKQGIQTNLSLKEESYGDWLPKCDDPAAT >NZ_CP014620|4007613:4016561|4014814_4015270_+|WP_000957221.1|DBSCAN-SWA MLTNQAIVIINLATWGVSILIAVVFSLIAVFCENQYIEIKPEGIIGIATLLGTFSFTMTGFIAAIGAYIISVSDKTSFLRWRQQGYINIFYHIYGQSIVFLLVTFLLCMVAIIMPFNVALTVLKCGLYILILNIVHIILITVITLGQMQKK >NZ_CP014620|4007613:4016561|4011421_4011550_+|WP_162491381.1|DBSCAN-SWA MLHHTTERFLYSGKRWPFFGKRWQTGGPLLITFVYIFIIFNH >NZ_CP014620|4007613:4016561|4010278_4010506_-|WP_001216597.1|DBSCAN-SWA MRHTTITARDLECLEHMRNVGQLVGDLMQVQDCATVRRDPAQHLQLTSVIYLMTAQLDGVVERCNQQWLTGEGNV >NZ_CP014620|4007613:4016561|4011050_4011317_-|WP_000556587.1|DBSCAN-SWA MHTAFSSPSSAPAAPLMPVSDTVHERFIRLPEVMHLCGLSRSTIYDLISREAFPKQISLGGKNVAWAQSEITAWMADRIAERNRGYDA >NZ_CP014620|4007613:4016561|4015292_4016561_-|WP_000772664.1|integrase|DBSCAN-SWA MKLNARQVETAKPKDKTYKMADGGGLYLEVSAKGSKYWRMKYRRPSDKKEDRLAFGVWPTVTLAQARAKRDEAKKLLAQGIDPKAEQKEAQAENSGAYSFETIAREWHASNKRWSEDHRSRVLRYLELYIFPHIGSSDIRQLKTSHLLAPIKKVDASGKHDVAQRLQQRVTAIMRYAVQNDYIDSNPASDMAGALSTTKARHYPALPSSRFPEFLARLAAYRGRVMTRIAVELSLLTFVRSSELRFARWDEFDFDKSLWRVPAKREEIKGVRYSYRGMKMKEEHIVPLSRQAMILLNQLKQISGDKELLFPGDHDATKVMSENTVNSALRAMGYDTKTEVCGHGFRTMARGALGESGLWSEDAIERQLSHSERNNVRAAYIHTSEHLDERRLMVQWWADYLDISKHSQVTPYEFGNTLKVKK >NZ_CP014620|4007613:4016561|4012604_4012844_+|WP_000468231.1|DBSCAN-SWA MFHCPFCKKTAHVRTSRYLSENVKQRYHQCTNIECSATFRTIESVDGVIRAAPEKTDPAPVTPPPPRKVQGCYSSPFRH >NZ_CP014620|4007613:4016561|4011791_4012601_+|WP_075207146.1|capsid|DBSCAN-SWA MNRVVVQEASESFQVEHTESLNMKPELIIKAMQTVISKQDEGAEQRIAGALAALNEAKDAHTASMGKLSDIEASIQRCEQERQTALSESAQAEQDWRSRFRTLRGNLTPELKAEHSKRIASRELADEFTGLITELKKDKGLAMLDACSSGTAYISAHEKAFTTYANSEWKKALASISPALLRAFLLRIRSLEMSGETSPRATVTRELGDALNMQSALYHFDMEQEPVLSVTGMNRPVITGVDMALLRSPARRMKLAAELAEKSHEQAEG |
12 | Enterobacteria_phage(83.33%) | capsid,integrase | attL 4004837:4004853|attR 4016731:4016747 |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_8 |
4266227 : 4309790
Sequences of DBSCAN-SWA_8
Nucleotide sequences of DBSCAN-SWA_8 >NZ_CP014620|4266227:4309790|DBSCAN-SWA GTTACTCATCATCGTATTGCGGTCCCGCATAGTTATCGAAGCGCGACCACTGACCATTGAACGTCAGACGAACCGTACCGATAGGACCGTTACGTTGCTTACCAATAATAATTTCAGCAATGCCTTTTAAGTCGCTGTTCTCGTGATAAACCTCATCACGGTAGATAAACATAATTAAGTCGGCATCCTGTTCAATAGAGCCGGATTCACGCAGGTCGGAGTTCACCGGACGTTTATCCGCGCGTTGTTCCAGGGAGCGGTTAAGCTGCGATAGCGCCACGACCGGCACCTGGAGTTCCTTTGCCAGCGCTTTCAACGAGCGGGAAATTTCGGCGATTTCCAGAGTACGGTTATCAGAAAGCGACGGCACGCGCATCAATTGCAGGTAGTCGATCATAATCAGACTTAACCCGCCATGTTCGCGGAAAATACGCCGCGCGCGCGAACGGACTTCTGTCGGCGTAAGACCTGAGGAATCGTCAATATACATATTGCGTTTCTCCAGCAGAATGCCCATCGTGCCGGAGATTCGCGCCCAGTCTTCATCATCGAGTTGACCGGTACGAATACGTGTCTGATCGACGCGGGACAGCGAGGCCAGCATACGCATCATGATCTGTTCGCCGGGCATCTCCAGACTAAAGATCAGTACCGGTTTATCCTGCAACATCGCCGCATTTTCGCAGAGGTTCATCGCAAAAGTGGTTTTACCCATGGAGGGACGCGCCGCGACGATAATCAAATCCGAACGCTGTAACCCTGCCGTCTTTTTATTGAGATCCTGATAGCCGGTATCCACGCCTGTAACGCCATCGTGCGGTTGCTGGAACAACTGCTCAATACGCGCCACGGTAGCGTCGAGAATCTGGTCGATGCTTTTCGGACCTTCGTCTTTGTTGGCCCGGTTTTCCGCGATCTGGAAGACGCGCGACTCCGCCAGATCCAGCAGTTCGTCGCTATTGCGCCCCTGTGGATCGTAACCGGCATCCGCAATTTCATGCGCCACCGCGATCATATCGCGGACCACGGCGCGTTCGCGCACAATGTCCGCATAAGCACTGATGTTCGCCGCGCTTGGCGTATTTTTAGACAACTCCGCCAGATAGGCGAAGCCGCCGACGCTGTCCAGTTGGCCCTGCCGCTCCAGCGATTCCGCGAGCGTAATCAGGTCGATAGGACTGCCGCTTTCCTGCAAGCGCCCCATCTCCGTAAAGATATGGCGATGCGGGCGGGTATAGAAATCTTCCGCCACCACGCGCTCGGCCACATCGTCCCAGCGCTCGTTATCCAGCATTAAACCGCCCAACACCGACTGTTCCGCTTCAATCGAGTGCGGCGGCACTTTTATCCCGGCAACCTGCGGATCGCGGTCGCGGGCATCAGTCTGTGGTTTGTTGAAGGGTTTATTTCCTGCCATAGTGAATGGAGTTACCGAGATAGTGATTGGGTCGAAAGATTACCACATTTCTTTTGGAGGAAGCATGGCAACGCGTATTGAATTTCACAAGCATGGTGGTCCGGAAGTGCTTCAGACCGTGGAGTTTACGCCAACGGAACCGGCGGAACACGAAATCCAGGTTGAGAACAAAGCCATTGGTATCAACTTCATCGACACCTATATCCGTAGCGGACTCTATCCGCCCCCGTCGTTGCCTGCGGGCCTGGGAACCGAAGCTGCGGGTGTGGTCAGTAAAGTCGGCAACGGCGTGGAGCACATTCGCGTGGGCGATCGCGTCGTCTACGCGCAGTCAACGCTCGGCGCTTACAGTTCCGTCCATAACGTCCCCGCAGATAAAGCCGCGATTTTACCTGACGCCATTTCCTTCGAACAGGCGGCAGCCTCTTTTCTCAAGGGGTTGACCGTTTTTTACCTGTTGCGCAAAACCTATGAAGTGAAACCCGACGAACCCTTCCTGTTTCATGCCGCTGCGGGCGGCGTCGGTCTGATCGCCTGCCAATGGGCAAAAGCGCTGGGCGCGAAGCTTATCGGTACCGTCGGTAGCGCGCAAAAAGCGCAGCGGGCGCTGGACGCCGGTGCCTGGCAGGTAATTAATTACCGTGAGGAGAGCATTGTCGAACGGGTAAAAGAGATCACCGGCGGCAAAAAAGTCCGCGTGGTCTATGACTCCGTGGGGAAAGATACCTGGGAAGCCTCACTGGACTGCCTGCAACGTCGGGGACTGATGGTCAGTTTCGGCAATGCGTCCGGCCCCGTCACTGGCGTGAATTTAGGTATTCTGAATCAGAAAGGTTCCCTGTATGCCACGCGACCTTCACTACAGGGGTATATTACGACGCGTGAAGAACTGACCGAAGCCAGCAATGAATTGTTCTCATTGATCGCCAGCGGCGTGATTAAAGTTGATGTGGCTGAAAATCAACGCTATGCGTTAAAAGATGCCCGTCGCGCGCATGAGGTACTGGAAAGCCGGGCCACACAGGGCTCAAGCCTGCTGATTCCGTAATAGCTCTGCAAAGAAATTGGGCTTCCACCCGGGAAGCCCTTTCTTTTTTTGTTCGGCTGTATGTAGGGTACAGCGCGATGAATTCGTTACCTGCGCAATCATGACAGATTTAATAATCGATTCCTATTTGCTTGTGAGGGCAAAGTTCCAGGTTGTGACGAACCGCTCAATACCTTAGTAAAACCGACGGTTATTGCGCTGATACTGTGGGATTTTTGGCGTTTTTACTGCTTTGATCACCCACACCACAGCCACCGCCAGCAGTAGCCACGGTAACAGCTTGATCATCAGGGCGAACATTCCACCCAGGAACATGACGGCAGTCGCTACAACCAGCGCGGCCAGAATGCCCAGCAAGGAGACGCCCGTCACCATTAACATCAGAAAAAAGCCAAGCACAAAAAGTAGTTCCAGCATAGTCGCTCCCCATAAAGATGGCATTGCCCGGCGGCATGGCGCTTACCGGGTTTGGTCAGGTAAGCTATTACAAAAATCATGCCAATATTTATGTTTTTGATATATAAAGAAAACGCCCTGCAAGACTGCACAGAGCGTGGTGAGATTGACTAATTTTTGGCGAACTTTTAACGCTTGTCTGCTACCAGTTTTAGCGCCTGCTCCAGTACAGCAACATCCGCGCCAGCTTTATGGGCGTTTTCGCTCAGATAGCGACGCCACTGCCGCGCGCCGGGGATGCCCTGGAACAACCCCAGCATATGGCGAGTGATATGCCCCAGATACGCTCCCTGGCTCAATTCACGCTCAATATAGGGATACATCGCGCGAACCACCGTAACCGGGTCGGCATCGGTGGTATCGGCGCCGAAAATCTCCCGATCTACCGCGGCCAGTATACCCGGATTCTGATAAGCTTCGCGGCCAACCATGACGCCATCCATATGGCGCAGGTGTTCCTTCGCCTCTTCCAACGATTTGATGCCGCCGTTAATGGACATGGTCAGGTGCGGAAAATCCCGCTTTAGCTGATAGACGCGCGGGTAATCCAGCGGCGGGATCTCACGATTTTCTTTCGGGCTTAAGCCAGAAAGCCAGGCTTTGCGCGCATGGATAATAAACATCTCGCATTCGCCCCGACCAGAAACCGTATCGATGAAATCACACAGAAACGCATAACTGTCCTGATCGTCAATACCAATGCGGGTTTTTACCGTCACCGGAATCGAGACGACATCACGCATGGCTTTAACACAATCGGCGACCAGTTGCGCATTGCCCATCAAACAGGCGCCAAACATACCATTTTGCACACGATCGGAGGGGCACCCCACGTTGAGGTTAATTTCATCGTAGCCACGCGCTTCCGCCAGCTTTGCACAATGCGCAAGCTGAGCCGGATCGCTTCCACCAAGCTGTAGAGCGACCGGATGCTCTTCTTCGCTGTAAGCCAGATAGTCACCCTTACCGTGAATAATTGCGCCCGTGGTCACCATTTCGGTGTAGAGCAGCGTCTGGCGAGACAGCAAACGCAGGAAATAGCGGCAATGTCTGTCCGTCCAGTCGAGCATAGGAGCAATGCTAAACCGAGAATTCCAGTAAACACCAGTTTTTTCAGGCATCACGCTGGTTTGATTAATTTTTTTTGTTTCATGATTATCGTGCATTTTTGAACATTTCAGGCTATTTTTCTCGCGTTAGGTTCCCGCACAGGTTCCCACGTTTTATGGGAACCCGAAATAACGAGGTCGTGTAATGGCGTACTATAACATAGAGAAACGACTAAAATCCGATGGCACACCACGCTATCGCTGTAATGTGATTATCAAAGAAAAAGGTGTTATCACTTACAGGGAAAGCAAAACATTCCCTAAACATGCTCATGCCAAAACATGGGGCACACAGAAAGTGATGGAATTAGATCTATATGGCATTCCATCATCAAATGCAGTTGACGGACTTACAGTCCGTGACTTACTACACAAATATTTAAATGACCCAAATGCCGGAGGTAAAGCAGGCCGTACTAAAAGATATGTGCTGGAACTGCTTATGGATAGTGACATCTCCGCGATCAAACTATCTGAACTGACAGAAAATGACGTAATTGAACATTGCAGGCTAAGAAACAACGCTGGTGCAGGTCCAGCTACAGTTAGCCACGATGTTAGTTATCTTGGCAGTGTTCTGGATGCTGCCAAACCTGTATATGGAATTAATTACACATCAAACCCAGCAAAAGCCGCTCGTCCATATCTACTTAAACTTGGTTTAATTGGTAAATCAAATCGTCGTAATCGTAGACCGGCATCTGATGAACTGGACATGCTCATTGAAGGTCTTCAACAACGATCTACACATAAATGCTCAAAAATTCCGTTCGTTGATATCCTCAAATTTTCTGTGTGGTCATGTATGCGAATCGGTGAAGTATGCCGATTACGATGGGAGGATCTCGATCAGGAACAAAAATCCATACTCGTAAGAGACAGGAAAGATCCACGTAAAAAGGAAGGCAACCATATGAAAGTAGCCTTGCTTGGGGAAGCCTGGGATATCGTCCAACGACAACCCAAAAAATCAGAATTCATTTTTCCATATAACAGCACTTCTGTTACTGCGGGATTCCAGAGGGTAAGAAGCAAATTAGGTATTAAAGATCTGCGATACCATGATTTGCGTAGAGAAGGGGCAAGTCGCTTATTTGAGGCTGGTTTTAGTATTGAGGAAGTCGCCCAGGTTACAGGGCATCGTTCATTAAACGTGCTATGGCAGGTATATACCGAACTGTATCCGAAATCTTTACATAATCGTTTTGAAGAGCTCCAAAGGAGCAGAAATAAGACCTCTTGACACTGTTTATCCATACAGTTAAAAATAATGCTGTATACAAACACAGTATAGAGGGACTTTTATGCGTATTGAAATCTGCATAGCCAAAGAAAAAATGACTAAAATGCCAACCGGTGCTGTGGATGCGTTAAAGGAAGAATTAACCCGACGCATCAGTAAACGTTATGACGATGTAGAGGTGATCGTAAAAGCCACCAGCAACGATGGCCTTTCTGTTACACGCACCGCAGATAAGGATTCTGCAAAAACTTTTGTTCAGGAGACTCTGAAAGATACCTGGGAATCTGCTGACGAGTGGTTTGTTCACTAATTAACACGTAAAATCGGTAACGGCTGGAAATCATTCAATACTCGCACTATCGAAAGTTTGCCAGCCAGCCGCAGCACGTTCTTGCATACGACGTGGCTGCGGCTTCCAACATTAGACAAATAACTCTTTAAATTGCTTTTAAATTATTTCGTTTGAATGCCAGTAACAGGAAATCGTTTATATAGGGTTGATAGCCCAACGTTATAGATACGTGCAACATAACGCCGTGATTTCCCTGCCGCTATGAGCGCTCCCATCTGTTGCCACTGCTCGTCGCTAAACTTCGGTCTACGCCCACCAATCCGGCCTTTGGATCTGGCAATAGCCAAACCAGCTAAAGTTCGCTCGCTATTCAAATCAGATTCATACTGCGCAGCAGAAAGAATATTACGGAAATTATAGCGACCACTTGCTGTTTTCAGGTCTACGCCATCTGTAATACTCCGAAAATTAACACCTTTTTCGTGCAGATTTTGAAACATCAATAGCGCATGCAGCACATTTCTCCCTATCCGATCTAACTTCCAGACAATCAACTCATCTCCACTTTTCATCACCGTAATTAATTCCTTTAACACAGGGCGATTAGCTGTTCTGCCACTGGCATATTCTTCATAAATTCGCTCACAGCCAGCTGACTCAAGTGCAAGACGTTGCAACTCTGTATCCTGATGATTTGTTGATACACGAACATACCCGTAAATCATGAGTGCTTCTCCTGTTGTAAAAACAGGAGAAGAGGCGAAATATCACCTGATTCAGAAAAATATTTGAAAGGTTGGTTTAGGAGAAACGATAAATCTGGCAAGAAATGCCGTTCCGGCGACACGCCGGATTAACAGTAAACCACTGACCGGTGATATCACCCTGTGGGCGTCAGATGTGGGGGCATTACCAATTGCCGGAGGACGACTGAATGGTGCGTTAGGCATTGGTGCTGATAATGCGCTGGGTGGTAATTCGATTGTGCTCGGTGATAACGACACAGGAATTAAGCAAAACGGAGATGGTGTGCTTGATATTTACGCGAACTCCGCACATGTACTCCGCTTTATCAGTATCCTCGTGGAGAGCATGGTTTCCCTGAAAGTAAACGGAAACGCTGTAGCCACAGGCGAAGTACAGGCAGGAAATGGCTCATCGCGCATGACTAATAACGGCGACATCTTTGGTTCTGTCTGGGGGAATAGCTGGCTGAGTCTGTGGATTAATAATAATTTTGTCGCAGATGTTCAGTTAGGGGCTGGCACATCTGTGACTACCTGGAACAATGCGGGGTCATGGCCTAACACTCCCGGATATGTAGTTACTTCCGTCTGGAAAGATAATCAAGGCGAAAATATTGATGGTATTAATTATGCGCCTTTGCAAAAACGAGTCGGGAATCAGTGGTATACCGTACAAGGGGGAACGACATAATGAAAAAATATCAGGATATTAAAAATTTCAGACTTATTGACGCGCCCGTAAACAGAGGGAAAACGCAGTCCGAAATAAACATAGGTGCATATTTTCTGGAGTCAGAAGACGGGCAGGACTGGTATGAGTGTCAGTCATTATTTTCTGATGATACTGCAAAAATTATGTACGATCCTGAAGGGGTTATCTGGAGTGTTGTTAATCAGCCAGTCCCGCAACGTGGAAACACATACGCCGTATCAATGTTGTGGCCGGTTAATATGTCTGTTGCGGAAATAGACGCTGCTGACTGCCCTGATGATTGTCGTGGTGATGGCTCATGGTTGTACAGGGATGGTCAGGTTTTACCCGTTCCGGTGGATTATCAGGCTAAGGCGGAAACCACCCGACAGAAATTACTTAACGATGCAAATAATGTCATTAAGGACTGGCGTACAGAATTAACGCTGGGGATTATTAGTGATGAAAACAAAGTCACTCTAATAAATTGGATGGGATACATTAATAAGTTGAAAGATATTGATTTTTCACAAGTTAATGATGAGGCCACCTTTGAAAAAATAAAGTGGCCTGAATTACCTAAATAATGTCTTACTGACTGGCTGGCTTCTCCGGCCAGTCAGGGTCAGATGAATCAACCCGACTGACCAGAACGCTGTAGAGTTCCCATGCTTCCAGCCGCTTCCGCTCCTCATCGGTTGCGATACCCATCTTTACTGCGCGCGACAGTGGCTTAATAACGACTTCGGCTTCTTCGAGTAACTTCTGTTTTTTCGCCTCTGCCTGCTGGCGTAGCTCCTCCGGCGAATAAACACGTTTACTCACCTGCTCACCATTAAACATCCAGCGTCCTGATACATCCGCCCGGCGATTAGCTGTGATATCAGGTAACTCAACAACGCTGCACCCATCAGGATTTATTGCCGAAACATCTTTGTTAATATCCACAATAACATTATTTTTATCGTAGGCAATTTTTAACGAGTCGGCAGAAAATTTCTTCTGTTCCTCATACCAGTTTTTACCATCTTCATCAAACAGCCACACCACACCAAATTTTTTAGTGAGTTGATACTGGTCAGGCGTTTTTGGATTACCGGCTACGATATTTTTCAGATGCATCATAATTAAATACTCACCACGTTATACCACTGGTTGCCAATTAGTTTTTGTATTGGGCGTCTGTGCGCTCCGTCAACCAGTTCATCACTATTGCCATTAGTGATGCCGGTTATTACATAACCAGAAGTATCACTGAACCCCGGACCGTTCCATACCTGCCCATATTGCAGGCTACCCAGCCTGATATCCTGCACGTAACGGCTGTCAAAGTTGGAATAGCTATTCGGTTCCATCTGACCATTTACTCTGAACGAAATACTGCCATCTGTATTTCTCTGGCTGTAGAATTGCCATCCCTGATCGTCGTCCAGTTCAATCACTGTGGGTCTGTCTGCACCACCCCATAAATTAAACGTGGCTGTCATTGTCGAGTTATTATTACTCGTCAGTGAAAACTGTTTTCCGTCACCTACTCGTATGCCACCATTAGTGAGAACATTTACTGACATGTGCAGCCCGGAATTGTCGATATAACCGATCCTGGCATTATTGGCGTAAATACCCAGAACGCCGTCGCCATCCTGTTTAAACCCTGTATCGTTATCACCGAGCACAATCGAATTACCACCCAGCGCATTATCAGCACCAATGCCTAACGCACCATTCAGTCGTCCTCCGGCAATTGGTAATGCCCCCACATCTGACGCCCACAGGGTGATATCACCGGTTAGTGGTTTACTGTTAATCCGGCGTGTCGCCGGAACGGCATTTCTTGCCAGATTTATCGTTTCTCCTAAACCAAGGTTTTCGAGAGCCGTTTTCACCGTGCCGTCCGATTTGATATCGCCAAACGGATTCTTGCGGCTTAACAGCAGCGCACGAAGCGCGGTAAGCAGCTGGTCGTGCCGCCCCTTCTCCAGGCTGGCACCGGATGCCTCCACCACGCTGCAAAGCTCCTCCTGCAACATGTCAAAGTAGTCATCATCCAGATCGGTGGCAGGCGTTCCGGTCTGGGGGTTACCACGGGTAAAACCGTTCTTACCCGCGCCGAACTTATCCTTCTGCGCGGTTTTCGTGTCTATACGATGCATGGATTACTCCGGATATTTAAAAATTACGTAGGTATGCGAAGGGCAGAGTTTGTTAAGCACGCACTCGACAACGGTGTCGCCCCAGATACGCAGTGCGGAATCACAGGGATCGCCACATGTCATCCAGGTGGTGTTGGTGGCGGCTGGCATGTTGACCTGCCAGTAATACCGCCATTCCGGCGCATTCACCGCGTCAGTACAGGCCGATGAGCAGGTGAACGTACTTTTATCGTATCGCGTGATAGTGGCGTCTGGTCTGCCCAGGGCAGCAAGCTGTGCAAGGTAAAAATCCTCATTGATGCCGCCCGCCAGATTAACCTTCGCATCCAGCCGTTGCTGACGCTGGCGAAGGGTCTGTGTCCCTGCGGGAATACATTCATCCGGCAGACCGCACAGACGCTCCCAGCGGTTTATCAGTTCAGTGGTGGTGCGCGGATCCAACTCCCGCATCAGGGCATCCGCACGCTGATGAACACGGGTTAATGACGGTGCCGCACCTGCAATCGCCGGATCGCTGGCTGACCACGCCGGACCGGGCGGCAGCAGTGCCGACAACAGACGGATGTAATCATCGTTTGTCACGTCCATGAAATCGTCCCCAGAACCGCCAGTTCATTTTTTGCAATGGAGATATTGTCTGCCGGTGCAAGCAACTGATGGCTGTATTCCCCGTTCGCACCGGAAATCGCCTCACTGATACACGATACCTTCAGTTCTCCCTGCGGATAACCATCACGCAGCAGGAACGAACGCAACTCCGCGGTGATGACAGCCCGTATTTCCGGTGTGTCCGGCGTCACACGGATATGAAAATCCACCGTATGTGCCACCGGCCTGAACACATACAAATCAGAGCCTGCCACCGGGGCCAGTGGCTCGATATGTTGTCTTGCCGCCGTTTCCGTTGATTCTTCCGGAATGGGATTAATCAGGTCACTGCTGGCAATCATCACACCGACAGTTCCCGTTCCCATCCAGTGACGGTATGTCCATGCGCGGGTAATGCCGGGCACTTCTTTAGCCCAGACGACATAGTCCCCGTCAGCCCCGCCCTGAGGCGTCCAGTAATACCGCTCAATGACGCGGGCGCGCCACGTTTCCAGCTCTTCAGTATCAAATCCGCCTGTCAGGGTGTCAGCCACACCGGAAGACGGCAGACCATTCACCGGCGTGACCAGGATTAATGCCGTACCGTCGTCAGCGTTACCGACCGCACCTGCACTTGAGCAGGCGATCGGCACGCGCAGGACACCACCAGAGCTGGTTGCATCGGCAGTTGCCGTGTACTGAACCAGGTCATCGCGCTGAATAACACTCCCGGCAGTCACCTTCAGGCCATCGCTGACACCTTCCCAGCGCATATACCCGCTGGCAGCCGTGGCCTCCTTGCGCGGACACCGTTTCATCGCAGCATGTCGCGCCAGCCAGGACTCATCGCACAGGTCAGGCAGCATGTTCATTGCCAGATAATCGATGTAACCGTAAACCGTATGCAGCGCCGCCGCATACACCTTTGCCCGCACGTCTTCATCCATGCGCCGGAGCGTGTCGCTGACGTCCAGCCTGGCGAATAAATCGTTACGGAGCATACTGATATTTTCTGCCAGCGTCGGGCGCTGAAATTCACTGTCCGCCATGCGTTATCGCACTCCACAGATCATCAAAAGAAATCATTACCGGTCCGTCACGACGCCAGAGAGTGATACTGTTACCCAGTTCATTAATCCCGGTGCGGCGGATATCCAGATCAATACGGGACACCACGCCGTCATCAATCATCCATTGCAGGCATTCGCGGATATACCCCCTTACCGTCTGCACCAGCTGATTGGTCAGTTTGCTGCGCTGAAGCAGCCACAGTCGGGAGCCGTAACGGTCATTCTGTACCGCAGGCCAGGTATCCCCCCACCATCCCATCGGGACGTCGGCATTATCATCAGGCTCCGCCCGCCGCCAGGTGAACAGGGAAATCACCACGGCGCGGGTCAGCGGATCCAGCGGTGCGCTGGCGCAGGTGCGTTTACCGTTCACCGTCAGCCACAGTTCCATCATGCCTCCATCGCTTTATCCGGTTTGTCGGTGTTACTGCCCTGACCGTTCTCTCTGTGACGATGGCCGTTATAGGCAAGCCGCATCGCTGACATGGTGGTGCCGCCGGAATCGCACAGGTCTTTCACCTGTCCGGTCACTTCCAGGTCCATTTCAAAACGCGCTTTAGGTGAATTGCGAAACGTGATCGTTTTACCTGCACCGTCCACCACGAGCCCCTCCCGGGTCAGCGTCACGGACTGCCCCTGATCGTCATAGACAGCCACCTCACCCGTCTGCAGCCCTTTCAGGCGGTAGCGCCGGTCCGACACCGTAACAACCACCGCATGAGAACGGTCGCCATCCGGAAACAACACCACCGCTTCCGCACCGCTGTTTGCCCTTGAGGTAAAACCGTAGGGTTCAAGATGTTCAACCCCGGCTTTGGGTTCACCGGCAATCAGGGACACATCCACGGTCTGACATTTCGTGGCGGCACTGATGCTTTTCACCACGGCCCGCCCAATCAGGCCGAGGAGTTGTCGCTGCATGGCTTCAATCGTCCTCATCAGAACGGGTCCTCCTGTACTCTGGCTTTTTTCTTTTTCCGCGCGCCGGGGGCTTCGGGTTCAGGCAGATAAGCATCAGGTGGGCCGACACGGATTTCCGTCAGGGTGCCGTTCTGGTCCTGAGTAAACGTGACTTCCGAAACAAGCAGTTCGGTATTGTCGAAACCACAGACCGGATCAAAGACAATCACCCGCTGGTTGGGCTGCCACAGCGTACCGTTACCCTGTCGCCAGCCCTGCACCACATAGGTGGTTTCATCCGTCCGCGCCGCCCGTTGTCGGGCTTCAAAGTCCGCACGGGCAATACAGCCTGCCCCCGTAGCCTGCCCTGTCTGCCTGATATACATCGGACGGTAACGGGCAATAAATGCGTCCTCTGTGCGGGCCCGCAGCGCGGTGGTGGTGGCCTCACCGAAATCATCGTCGTTTCCGGCACGCTGCCCCGCCACCTGGTAAACAGAAAACCGCTCCCGGATACTCTTCTCCGTATCGCAGGAAAGGATGTTTTCCCCGAGTACCAGCGCAGTATGTGCCCGCGTTGAGCCAATACCGCCAATCACCAGCCTGCCGTGCGGGTCGTCGTAAGCCAGTGCCTGCTGCTGACCGAGTATTTTGTTGATTACCTCAATCACCGTTTCACCGTGATCAGGCTGGACATCAGGAATAACACCCGACGGCGCACCGTTGTTCACCACCTCAATGCCGAAAGGCGCAGCAAGCGCCTGCGCAATCTGTACCAGCGATCGTCCATTAAACTGTGTCGGTTCGGCTGCACAGTCAATCAGGTCAGCGGTCAGACTGCGTCCGGCAATACCGGTGCTGACCGAACGGGCATCGTAACGAACGGGCGTCGCCTCCACCCAGCCGGTGATCACCAGCTCATCACCAATCAGCACCTCCACTTTTGAACCGTTTTTAATGCGCGGCTGAAGCGTGGTGATACCCTCATCTCCCGGCCACTGGCGGGTGATCTCCACACTGAAATCCCGCGCCAGCCGTTCAATACCGGCACCGATGCGCACCGATGTCCAGCCATTCCACTCCCGGCCATTTACCCGTAGCGTGACATTGTCGTTCATTGCACTGGCACCTTCAGAGGGATCACCGGCACAAAGCCGGGATGCGTAATGGCATTACGCCGGATAATGTCCGCGTCACGCGCCGCGTTATCAAACCAGGTCGCCGCCAGCACCAGCGCGGGTAAAACCTCATCCGGTGTGCGCTGAATGATCCGTGCAGACTGTTCAAGGCGCGTGTTGATATCCGCATTCAGATCTGCTTTCACCCGGCGCAGCGCCAGAAACAGCGCATCGCTGGTTGTACGGGACAACTCCTTATCAATTGCCGTATTCAGTGTGTCGCGAATGTCAGTCAGTTCTTCCCACGTCGGCAGGTCAACCGTGTTTTTCACCGCCGGTGCATTGTTCAGTGCCGGATGCGTGACGGAAGGCCAGCCAGTGCTCTGCGCGGGTGTTGTTGCCTGCCCCACTGCGGAATTCTGCATCACCGCGGAAGTTGTTGGCGCAGGCAATCGGGTGACGGCATACGCCGCTTCGCTGATTGCGGTCGTACGAAGGGTGCTGGCAACCACGTTACGCTGCTGCGTCGCCGTGGCGGTGGTTTTACTGTCCGTTTTCCAGACGCCGCGCGGTTGCAGATCGCTGCCGAGGCTGACACCGGAAAGCGTTTTGATCATGGTGACCAGGTCGCTGGCGTTACCATAAAGGCGTTTCCCGGTACGCCACATTTTCTGCACCTGCTCAACGAAATTTTTGCCTGACGATGGCGGCGGCAGAAGTACCGAGATATCCCCCTGCAACAGCCTGGCGGCATCCGATACGGCAGAATCCACCACTTTCATCGCATCAGAAACATACCCCAGCATTATGCTGGCATTACCGATAACGTCGTTCTGCACGAAATCCGCCACACCATCGATACTGAAACCGCTGAAGCTGTCACTGATGCAGTCATCCAGTGCAGAACAGGATGACATCAGCGTCTGCGCCGTCGCCGCACCTGATGTGGGGTAAGAGAGTTCTCCTGCTTCGACAAACTTCAGGTCAAAGCGGACAATACGCCCTTCACTTTTCGATGTGCTGACCCGAACTTCCCCGTCAACACAGACTTTCAGCTCACCATATGTCGGGTGGACAAGCGTGCCGGGACCGGGTTTATTCAGCGCGTCAATCAGGCGATCGCGCTGGTCAAAGCAGTCATCTCCCACCACATAAGCTGTGATGGACGGGCGGAAAGTGACTTTTCCCAGATCTTCGGTATAGGGCTTGTCGCGGTTCGGATATTCGTGTGTTTCCACACGGCGACCGGTTCCCGCACTTTCTTCTTCAACCTTAAACGGTACGCCGCGAAATGACGCATCCTGAAGCCTGTCTTTCCACGTCATATAAACTCCGGATACAAAAAACCCGCCAAATCTGCTTTGTCAGTTATTTACATCGCAGAAGATGTGGCGGGAACCTAATATTTTTAATTACTATCTGAGTTGAACATCAATGGAATAAATATCACCACTCTTTATAAATTTAGAATCTGTCCTTTCATCAAAAGATTCAAATGACTGTACCTTTAAAAACTTTTTCATTTTATTTTCAAAAATACTTTCATTAACACCAGTTAAATACTTGAACGCTCTACCAGCAAGGACCTCATTACTTAAATCCATTGTGTTTTTATTGTCTTTGAAAAACCAAACAATAACCTTTTGTGGGCATGATGGATTATAAACAGATATATAAAACTGCGGCTCATATTTTTCATCAGCGTCATCACTAAGCATTTCTTCAGAAGATAATTCTCTTCTGAATTCATATTGCCGCTTAGTTATTCCTTCGTCCTTTATTATCTCTTGCTTAACTGGTGCAATACCTATAGAAGAGATTAATTCTGACTCATTAAAGCTGAACTTACACTCTTCCGCAGCCAAGTTAAAAGATAAAAGTGCAGATATAAAAAAAACAAAGATACGCATAATCATCCCTTCAATCATTTGTAAGGAATGATTATATTAACTACTTAAAGCTGAAAACCCAAATTATGCCAGACAAAAACACATTAATCATTTTGTACACTACCTGAACCGCGTATAGCCAACATCATGGCTGACATCAAAACCGCTGGATCGCGTTTCCATAACCCGCATACCCGGAGGCGAATTCACAAAAGATACCTTGATCTCACCATCAACTTTTGGCACAGAAGCTTTGTTAATCATGAAGGGATTCGAGCCTGTGGCATCGGAGGCGTTGTTTGACTGAGCCGGATCCACCGCCGGATAAGGTGTGTATCCCCGCGCCGGTATTCCCGTCCCATAAGCATCATAAGCACCCGCGCCCCACTGCGCAGAGTTAATGGCATCGACCGTGTCACCGAAACTGTCGGTAAACCACTCAATAATTGGCTTCAGCTTGTCCCACATATCCTGAAACCACTTAACAACCGGTCCCCAGTTATTGATCACCATCCCCAGCGGCGACCAGGCAAAAACCTTCTTCAGAAGTTCCCAACCTGCCTCAAAATAAGGACCAATGGTTTCCCAGAGCTTCTTGAAATAAGGTCCGACAACATCCCAGTTAGTGATAATTAATCCCGCAGCCAGAGCAATCGCCGTCGCAATCATGCCAATCGGCGTCATCGACATAATCCTGCTGACAATACTGATGGCACTGCCCACGCCCATCAATCCCAGTTTCAGAATCGCAAGACCGGCAGCAAGCCCGACGACGCCGCGAATAACCCGGGGATTTTCATCCGCAAACTTCGTGAATTTCTCCCCCAACTCCCCCAGCCATTGTGTGATATTTTTAGCGTCACCAGAAAATGCGCCGCCAATAGCCGCAAGGCCGTTAGTTGCGGTCCCTGTCATTGCCTCCCACAGGTTGGACAGCGTACCAAGCTGTGCCTGAACACGTTTATTCAGGCTGGCCTGTTTATTCATCTTCTGCTGGATCTGATCGTAGCCATCCTTTCCTTTATCGATTAGTGCATTGACCACCTGAATGGTTTCGGCATCATCACCAAATATTGCCTTAAGTACATCTGTTCGCTTAACGTCGGTCAGTTTTCGCAGCTTTGCCAGTTGCCTGAACATGTTATCAAGACCGCCAAAACTTCCTTTGCCGTCAGTAAAATCGAGCTGTACCCCGAGTTTCTGGCGGGCCATAACTTTATTAACGTCCCTGATTTTCTTAACGCTTAATCCGGACTGGATAACTTTTCGCAGGGCATTACCTGCCGACTCCCCGTTCATCCCCATCTGATCCATCATGACGCTGATGGGGGCAAGGCTCTGTGCAGCCTGAAGACCATCCTTGTTCACCATCTTCAGAACAGAACTGGTTTTAGTGAAGAAGGACAACATGTTGGTATCGTCAACGCCCAGATAAAACGCCTTCTGGATAGTGTCGAACAGCCCCATCATGTCTTCTGACGCCGTTCCGGTAGCATCCTGCATCTTTGCAGCAAACTCAGCAGCCGCTTCCGGTGTTTTTTTCAGTTGTACCGCAAGATAAGCTGTCGCTTTACCCACACCACCCAGAATGTTTTCTGCCGGGATCCCCTGACGCACCAGCATCTGCATCATGTTCTGGAAATCAGCCGTTGTACCAGGTAGCTGGTTACCCAGGCCAATAGCCAGTTTATTGATGTCCTGAAAGCTCTTTCCAACCTCGCCGTTCGCATCCATCATGGCGACTTTCAGCCCGGTGGCGGCGTTTTCCTGATCGGCATAAGATTTCAGGGAAAGCGTCAGACCCGCTGCCAATCCGCCCCCAAGCGCCAGCCCACCCTGTGACGCTTCTTCCGCCTGGCGTTTAAATCCCCGGATTTTCTTTTGCATTTTCGACAGCGCGGGAGAAAGCCTGTCGACACCGGTGATCAACGCCTTAAGCTCAAATTCAGCCATGTGTGCGTTTCTCCTGCTCTATCCTGTTTGCCTGACTGACCAGCAAGGGAATTTCACTGATCGGCATACTCAGCAATTCGAAGGGATTAATGCGCCAGTAGCTGGCGCAGTCAAAGAAGCGATCAGTGAGGTATTCAGCCGTCAGGCCTGGAGGAAAAAACCAGCCACAAGCCACGCCGCTGCATTCAGGTCTGCCGGAGACATCTGGTCGACAGAGCTTTGCGGCACTTTCGCCAGCCGCACAATGTATTTCGACACCACATGCGCCAGAAGTCTGACGGACTCATCCTGATTCATCTGGTAGGGATACCCCAGCTCGCGGACATCCTTCCCGGTGGGTTCATCAAACTCCAGTACGGAGAGTGTCTCACCATGAGCGATAATCGGTTTCTTTAACTCAAGCTCTTTCATTACTGGTAATCCCCTTCTTCACCGTGGAACTCAAGATCAACCGTGCCTTCTTCGGCATTATGGTTCGCTTCTCCGTGCAGCCAGGCGGACGACAATACATAGACCTGACCGTTCGCCAGCTCGGCAGTGATGGTCATCTCATCAGACGAGGTGATTTTGCTCACCGGAAAATTCTTCGGCACCTTGAAGGTCCCTTTGACATAAGGCGCACGGTGAGTTTCCTTGCGGTCCACTGAACCGTCCAGGCCGATGATGTCATCATTGACCGTCCTGTTCATGGGCACCTCAATGCCGCCGGTCAGCGATAGCTGCTGACCGTCAATTTTGAAATAACAGGTTCCCCCGATACGGGCCATTATGCAGACTCCTCTGAATACTGAAGACGGAACTGGTTAACCACGGCAAAAACACGCAACTGGTTAACATAGTCAGGCGGGAACAGCGTGTTCAGGCGGTTCGGATCGCTGGCATCACGCTCCACAACCAGGTACTGCTTAAACAGTTCGTAGTTTTCCACGATCCCCGCACGCTCAAGCTGACGGTAGGTTGCCAGCAGTTCCCCTTTGATTACCGCCGGGGTGACAATCGCCTGACCGGGACCAAAGCGGGTACCGTCGCTGGCAAGCTTGTGACGCCCGTACTTACTGGTAATGACGGATTTCAGTTTGCGCAGTACATACGCACTGGTATGCAGCGTCTCGCTGTCGAGGTAGCTGTTATCCGCAACCCCGTAAGCATTTTTCCTGTACGTGGTGACATCACGCTGAATGCGCAGCACCCCGCTTTCGACATACGCCGTTGCCACGCCATGAGACAGCAGGGTCTGCTGCTCGGTCATCGTGAACCGTTTCCCCTTCGGCGCAGGCAGCATACCCACCAGCTCACCGGTCTGCGTGGGACGTGCCGGATCGTTGCGGATAAACACCGCTGCGCGGGCGGTACGGCTTGCCGCCAGCTCGTCGGCAGGCGTCTGGGTGTCTTTTTCGTACCCCGCCAGGGTAATGTGCTGCTGGTTAAACTGGTCACCTGCGGTCACCAGTTCTGACAGCGTGCCGGTCTTTGCCGTATACACATGACCATACAGCTGACGCGCATAGCTCCAGCGACCGCTGGTATCGTTCATCTCGGTCACCAGCGTGTTAACGGAGGCCGTGTCGTTGAACGGCAGGCCAATATAATCAAACGGCTCATCCGCCATTGCAGCCACCGCGCCGGTGAGAACCGGAGCACCCGTTCCGGCGGTACCCGTCGCCACGGCAATCTGTACGCCCGCTGGCAGCACTTCGCCCCCACCAAAGCCGTAGTAATTGAGGCTGACAGGAATTTCATTCCCGCAAAGCCCCTTATGACGCGCGGTCAGTGTGACCACGCCTGCCGAAGATGAAGCCGTAAACGGCAGGGTCGGAACGGCATTGATGGCATCCTGGATACTGCTGGCAATCATCGTGACGTTATCGCCGTTAGTCACCGGTGCCTGCACGCGGGTACGTCCCACATACACATTCACCGTGCCGGTTTCGGTTGCCGCCCCGGTCACCGTCAGCGTAACCGTTGCCGCCGCGCCTGTGGATTCAGGAACGGCAATCACATACAGCTCGCCAAACGGGTCAGTCTGGCGATAAGCCTCGACCATACGCGCCAGCTGACTTCCCGCACCACAAATCTGGCGTGCATAGTCTGCCGACGGCATCAGTACCAGACTGTTGGCAACAATCTCTGCACCGTTATTGGCATGACCAATCAGCAGCGATGCTCCGCTGTCCTGTGCAGTATTCGCCGCCTGGTTATCCATTTCCGCATAAAACAACGGAACCAGCGTATTCGACGGAATGGTGTTAAAGCTTATCGTCATCGGTATTCACCTTTTTATTCACGCGCCGGATATCACCAGCTGCTTCACGGCGCAGCCAGTAGTTGTTCTCGTCAACATTTCGCCCTTCGGCGGGCAAAAGGTCGCCGCGGGCAGGATCAGGAACTGACCGCCCTTTAACAGGTTTGACAAACATGAGGATCCTCAGGAAGGAAGAGTTATTTCGGTGTGATGTTCGATATCGCCGTCAGGCCCGTTACCGGGCTCGAGATAATCAACATCAATCGCCAGCGTTTGCAGTTCATCCAGACTGTTCAGATCATCCTGCTGGCGGGTATCGTCTTCAGTCAGCTCGCTGATGACCGAAAAATCGAACTGATAAATCAGCTCATGACGATTCAGATCCAGCAGCGTGCCGCCGTCATAGGTAATCGGGTTACCGCACGCCTCCGGGTTCCAGCCCAGCAGAGCCTTAAAGAGCATCTGCCGGACATCGTCCACCACATCATACGAGGCAAACTGACCGCGCTCATCACGCCCGTTACTCAGTATGACAACCACGGAGAAACCCTCTTTCAGCTCCTGCCAGTAGTCGGTCTGGCTTTTGTTTTCTCCCGGAGAATCATCACCCGGTACCACATATGCCGCCGGGAGCTTCAGCTTTCCGACCTCCGGCAGATTTTTGAACTGGGCCGCGCCTGCAACCCGGTTTTCAAAATACGGACAGCGGGCACGCAGTGCAGCAATAACAGGCGTCAGTTTCATCTGTGTCGTCGCTCCGGCTTCAGTGATTTACGCAATTCCCGCGCCAGAAAATAGCGTGTCCAGCTGCGGTTCTTTTCAAGCGTTTCCACCATGAAGTTATTACGTGGAGCAAGTCGCCAGCCGCTGCCACCGGATGCACCACGATGATGGCTGCGACGACGCTTTTCCCCTCGCCTCACGCCATAGAACAAAAAAGCCGGATAAAAATCACCGGTGATACGGCGGTTTCCCTCTCCATTACGCTGGTTAGGGGCTATACGTGCCATAAAACCAGGGCGATGTTTACTGGCTCTGGGTACCATGTAACCAATCGAACGAGCCAGGCGTCCGGTCTGATAACCGGGGTTTTCACCCGGTGCCGACCGCGCACGGCGCATCACCAGCCGACGGGCATCACGCATATGACGCTGACCAATCGTGACAAACGCCCGCCGGACACGGGCGCGGTTAAAGCGCATCTCCGCGGGCTGCTGAAAATCAACGTGCAAAAAGGAAGTCGTCATTGTTGCCTCCGTGACTCTGCCTACATTCGCCCAGCTCCGTACACTCCAGCAGCAGAAAGCGCCGCGCCCCGTTCAGATCGCGCTGACGTTTCACCCGGTACACACTGTCACCGCAGACCACCTCATAATCAGCGGTGATCCCCCGGCGGTAACGAATGGTGATGTAATGGGTGATGGCGTCCCCGGTCTGCGCGGTTTCCTGCCAGGTGGTGGCACTGGTCTGGATAACCTTCGCCCATGTCCGGAACGTAACCGGGTATTGAGGCTCCACGCCAAAGTTATCCGCGGGCATATCCACCCGCAGGCGGATCAGGACGCGTTTATTCAGTTCACCGGGGTCCGGCAGAATGTAGGTTGCGCTGGTCTGCGCCTGACGAATTTTCATTGCGGAAAGTACCTGTACGGGCCGACAAGCCAGCCAAAACTCTGCGGCATGTCGAGTTTCTCCACTTCCGTAACCGACGAGCGGTTTTCGTAAAAATGGCTGATAAGCATCAGCATCCCCAGACGAATATCATCCGGCAGGTGCAGCCCGTCCGGATCGCTGTCCGGAATGGTTTCATCCGGTGCATAGAGCTTCCGGTTCAGATACGTTTCCGTCCGCTTTTGTGCCGCACATGCCAGCAGTTGCAGATGGCGGTCATCAGTATCGAAATCCTCATCCAGCCGGAGTTGGGCTTTAATCTCTTCCATTGTCAGAAGCATACTCAGCCCTCTTTACTGGTCGTGGCTTTTTCTCTTTTGCCGCTTTACTGCTTTTTGCACTGATTCCGCGCTCTGCTAACCCGGCCTGAAGTGCAATCTCCTGCACCCGGGCAGGAAGCGCCCCGTCGTCATACTCACCGGCCTGAATGACCTCAACACGCATACCGTCCGGTGACCATTTCAGATCTTGTTTCAGGATCATGATTCTTCACCCGTCAGAACAGGGGGCGCGGTTCCGCGCCCCTGAGTGATTACGCCGCTGCAATCTTCAGCAGTTTGATGGCCTGCGAATCGACCAGCATCCCGCCGGTGCGCTTGGTGGTATAAAAACCGACAAACGGTTTATTGGTGTACGGGTCACGCAGAATGCGGGTGCCGATACGGTCAACGATGGTGTAACCCCGTTTGAAGTTACCAAATGCAATGGCTTTCGCATCAGCGGCGATATCCGGCATCTGTTCGTTTTCAGCGATACCGTAACCCGCCAGAGAGGACGGCTGCCCCAGTTCCAGCCCCGGACGCCACAGATAGTTACCCTCGGTGTCTTTCAGCAGACGGATGGCAAACAGGCTGTTGTTGTTCATCATGAACTTCGCGCCAGTGCGGTGTGCCTTACGCAGCGTGTAAATCAGTTTGATAATGGCGTCTGCGGTCACCGCGGTCGCTTCGCCGGATACAATATGCTGAAGTTTGCCGAACGCCCGGACCTTGTCGGTTTCATCAGTGGATTCATACGCCAGGAACCCTTTCGGCTTCTTGGTGCCATCGCCTGAGGTAAAGGCAATTTCTTCCTGTTCGGCAAATTCGGTTGCCAGCTCGCTGTTGATCCAGGCCTCCACGTTGAAGAAGGCATCGTCCAGCATTTTCTGGGTAGCCTGCGGGTTGCCGTAGATTTCCCCCATGAGAGGTTCAATCAGCTCCAGTCTGGAGGTGGCAGTCTGGGATCGCGTATCCGTTTCCCCCACCCATCCGGAAGCCGTACCGCCCAGATTCACCAGTTTTTTGTAGTCGGAACCGCCAACGGTGATCACCGTGGCTTCCTGACGCATCACCACTTCATCTTTCAGCAGGTTAAGAATGTTGCGATCCAGTTCTTCCGGCACGGCGTAGCCACCGTCTTCATCGGTACCCACCTGCAATGCCTTACGCTCCAGATCGCGCAGACCGTCTTCACGGCCTTTACGTAGAAAGCCCACAAACGCCTCTTTATGCTCGGTGGCCAGTTTATTTTGCGCTCCACCTGCCGGACGTTTCAGCTCAAGCAGCTCTTTTTCAAGGTCGCTTTTGAGATTTTCCAGCTCGCTGAGTTTCCCGTTCAGGGTTTCCACCTGCCCGGCAAGCTTGCCTTTTTCCTGCTCAATCGCATCCACGCGCTTGTCGTTCTTTGCTTTGAAGTCGTCAAACTTCTGCTGCAGCTCCTGCGCGACCTGTTCGACATCTTTAATATCAACCGCCATCGTATTTCTCCTGATTAGAAGTTCAGATTTTTCAGTGCATTCAGTGCATTCAGTGCAGAGCCCACATCCTCAGCGTCGCGCAGGGACAGTGCGCCATAGCCCCCGGCCATGAATGCTTTGGCCTGGGTACGGGAGAGTCCGACATCACGCAGGACTCTTTCGATTTTTTTCTGTTCGGGGATTTCCCCGCGGGCCAGTGCGTTCTTGACGTCGCTGATCCGCGCCTCGTCGTTAGACGGGAACGTCACCAGGCTGACTTCCCAGAGGTCGATTTCTTTCAGCAGAAAGGCTTCTTTGCTCCGGTCGTATTCCCAGTCTTTCAGGACGTACCCAATAGAAAGGCCGGTTAACGAACCGGCCTTCATGTGTGCATGTGCGCGTTTTGCGAGGGGATCATCATCAATAAGCAACCGTCCCCTGACGTAAAGCCCGACATCGTCTTCCTTCATTTCGGTGTAAACACCGATGGGTTCATCCATGCGGTGCTGCCAGAGCAGCGCAGGTAACGCTTTTCTGTCACTCCACGCCCGCAGGGAAGCAGCAAATGCCCCGGACATCACCACATCATCGTGGCTGTCCTTTACACCAAAGACGGAGCCATACCCTTCAAACTCACCGGAGTCACTGACAGATTTCAGACTCAGCGGTACATCAAGACGTTGTTTCGTCTGCATTGGCGTTATCCTTCTGCTTACCGGCTTTACTGCCATCGGAGGGTTTCGTGGTCATGTTCATCGGTGTGAGATAGACATCACCACCGGGACGCGGATTCATATCTTCCAGGTCGCGGCAGTCATTGGGAGAGTAAATTCCCCAGTTGATCCCGGTGGCGTAGGCTTCAAAACGGGACTTCATATCCCCGCGCAGTAACGCCCCGGCGTTAAATTTGGCGTAATAAACGCCCTGCTTACTTTTTCGTACCAGTCCGGTGTTGATCCGCTGTTCGATGCGGGTCAGATACGGCACCAGTGAATAGTTGATAAATCCCAGCCCCAGCTCTTCGATATTATTGAAGGTGGCGCGATCGGTGTTCTGCACCATGTGCAACGGCACCCGGAACAGACGACAGATTTCTTCAAGCTGAAACTTGCGGGTTTCCAGGAACTGGCTGTCCTCGGCGTTCAGCGCCATCGACTTCCAGTCCAGCCCCATCTCAAGGATCATCGGGCGGTGAGCATTGCCAAGCCCGGTGTGACGCTCCTCAAAATCTTTCTTCAGGCGCTCATAAGCCTGATCCGACAGCGTCTGCTCTGTACGCAAAACACCCGATGTCACCGCGCCATTGCTGAACAGTCTGGCCCCGTGCTCTTCGGTCGCTGCCGCCAGCGATATTGCCTCGCGGGCATAGGCGATGGGATTCAGCCCCACCAGTCCGTCCAGCGTCAGCGTACGCACATGCCAGATATCCTCCTGGCTCAGTACATCCGTGGAGCCATCCGGGAATGTGACCTGATAGACCGGTTCCCAGCTACTGTTAAGCTTCGGTACCACACTGCCGGGATCGACGGGCAGCAGTTCAGCCACTTCGCCAAATGCTTTCACTTTGTAGGCGTAAAAGTTTCCCCTCAGGCACAGACAGGTGACCACCAGCTCCCAGAACTCCTGCGGCGTCATATAGCCATTGGGATGCGTGGAGATCAGTTTATGCAGACGTTCGCCGGTGGCTCTCTGCTTCAGGCTGCCGTTCAGGTGATACAGATTGCAGGGCAACATCCCGACCGACTCTGCCAGCACTCTGACGCAGGAAAAAACCGCCGTCAGTCGCATGGCCCGCTGACTGCTGATCTGCTTTCCGGTATAGGTGTCATACGACAGCCCGATGGCATCCGCCAGCTCTGCTGGCGTGGTCACCGGTGCGTCACTTTTTCGTTGAAATAATCCCGAAAAGAACACTATTTACCTCCGCCGACAGACGACTGTGTACGGTCGAGATATCGCGCCACCAGCCACGACCAGAACAGGCACAGCGCCCCGGCAACAACAAAACCCGCCGGGGGATAAATCAGCCAGGCACCATACGCCAGCAAAAGCGCACCCAGCACGCCCACCAGAGGCGCGAGAATCAGCATGATCATAATTACCTCAGTTAAAGCGAGCGGATCCCATAGGACTCAATGTGGTCAGACAGCGTGTCTTCTTTCTCGTACAGCATGGCTCTGCCAACCGCCATAATCAGCGCAACTGCACCATCGATTTTGTTTTCCGCCTGCTCTTTGACGGGCTTCACTAAATCATCGTTACCTGGCATGTTTTTGCCGACCACATTGCCGATACACCAGGTCATGATGGGATTGCCGTCATGATGAAAGCGTCCCGATTCAATCGCTGCCTCCAGCTCTTTCATCGGGTCGGACATATTGGCGAAGTTCTGGACGATAGTAACGGGATTCAGGTCTTCATCAGCAAGGTCATGTGACAGCCCGGTCGCTCCAAAAGGGTCGATGGGTGACTCACTGACCGGGCTGATTTTGTTCGCCGCTTTGGCCTCTTCGAGGATGTAGCGATAATCCACCTCTGCACCATCGGTAACGGTCAGAACGCCCATTTCCACCCATTTCTGAAAGCGTTCGGCTGTCCGGCGATCTTCATTTTTCTCGACGCTGTACACCGTGTCATACGGTACCCAGAAACGCGGGGCCACACTGTAGTAATGCGTTTTACCGTCAATCTCGCGGGTATAAAGTCGCGCCATGCTGTTCATATCCAGCTTACGCGCCAGGTCAAAGGCCAGAATGCACGGCTGCCCCTCGAACTGCTCAAGAGTCAGTGATTTATCCTCGCAGCTCTGCCAGCTCACCAGGTTGAAATACGCCGAACGCGCCGACACCCAGATATTGAGGTGTTTTGTTTTAAAGACGTTTGCCAGACGGGCATTATTTTTCGCACGCTGCTGCTGACTTAACAAAAATTCGCGATAAACCGACACGCCAATATTTGGATTGGCTTTTTCCAGCACCTGCGGGTCGGTCCAGTCGTCACCTTCATCAACGGTATAGATGATCCCGAACAGTTCATCGTTAGGCACCGAGCCGTTGAGCATCTCGATGACTTCCCGCCGTTTGTCGTAGCACGGCCCCTCAATGTTGTACCCGGCGGTGGTGATGGCCCACATCAGTGGCTGACGTCGCGCCCCCATCCCGGTAAGCATTGTGGTATAAAGCGCATCGGTGGCATGCTCGTGATATTCATCCACCACGGCACAGTGGGGTGATGAACCATCACCTGGGTTGCCGATCAGCGGTTCAAACCGCGCGCCATCCTCCGGACGGTTCATGTTTGAGGCGTTAACCTCAATCCCGAACGCTTCTGTCAGCATGGGTGTGCGTTTACACATCAGTCGCGCCGGGCGAAAGACTTCCCACGCCTGTTTCTCTGTCGTGGCACCGGAATACACTTCCGCGCCAAACTCGTTATCACAGGCAAAACAATACAGGGCAACACCGGCAGAGATTGCTGATTTGCCGTTCTTACGGGGGATTTCGGTGTACACCTCCCGGAAGCGGCGCAACCGGGTGCCTTTATTGACCCAGCCAAACGCACAGCAGATCACAAATAGCTGCCACGGCTCCAGCGTGATGGGCATCCGTTTGAATGCCCACTCCCCCTTGGTGTGCGGCAACAGCTGAATAAATTTCGCGGCCCGTTCAGCCAGGTCCTTGTCGAAGCGGTAACGAAACGACTTACTTTTTTCCGCCATCAGGTCATCAAGATGGCGCTGGCAGGCCTGAATCACAAACTGGCAGGCAACAATCTTTCCGCGCACGACATCCCGGGCATACTGATTGGCTGCATTTACGTTGGGGTAAGATTTCCGGCTCATGATTCGATGATTTTCAGATTGTCAGAAACGGGTTAGTGGCTTTCTTCTTCCCCGCCAGGCCAATCAGACGCTGGCGGCTGCTGGGGTCGAGTCCGAGCATTGCCCCCGTGCTGCTCATCTCGGACTCCTGTTCTTTTTTGGCGGTCAGCTCCGGATTTTTGACCATGCCGCCCATTGCACCGGTGATGGTGTTGCCCTGTCTGGCAATATTTTTCACGGCACGTCGCCAGAATTCATAGGCCACACACCACCGCTCAAGTACCGCCAGGTCAGTCACGCACAGCAGGCCCTGACCGCAGAGTTCTTTGGTTGTCAGTTGCCACATGATCGTGGCGAGAGGGAGATCTTCTTCAGCGAACCACTCCGGTGGCTCAACACCTTTGATGGGCGTAAAAACAGGTTCATCTTTATTCAGGGCTCGCTTGCCGGGGTTTCCGGCCAGCGCCTTGCGCGCCGTTGGCTTGGGGCGACGCCCGGAACGCCCCGCCGTTCCAGCCATATGCGGCACTCCTGGTTAAATTTCATTTTTCGCGGGTATAAAAAAACGATGGGGCGGGCAGTCCGGAAGACGTCAGGTCACAGGGATTTGACCCGCCCCTCCCCTCAGACAGTTGAGAATTATTATCACTTTAACCGTTCACGGGCCGTCTTCGCCTTATGGCAGGGCCAGCACAGACTCTGCAGATTACTGTCGGCATCAGTGCCGCCATGCGCTTTAGGAACGATGTGGTCAACGGTTTTCGCCTCACGCACCACACCAGCATGCAGACATGACTGACATAAACCTTTGTCACGCTTCAGGACGCGCTCGCGGATACCGTCCCACTTCGAACCATAACCGCGCTGATGACGGGACTGGCCCGGCTTGTATTGCTTCCAGCCTTCGCTTTTGTGGCTTTCGCAATAGCCTGACGGGTCTGTGGTTGTAGAGCGGCAGCCGCGAACACGGCAGGCTTTTGGGATTCGTGGGGGCATATGCACTCCAATGAAGAAGCCACCGACATAGCCTCCTCCATTCATAGTGAAACTATTTTCATCTACCCAGTAATGAATTCTTTGAAGAGTCGAGATCAATACAACTCACTAATGGGAGAGGTTTGTCCAACACGTTGGACAAGCCTCCCGTTTGATTTACTTGACACTATAGAAGGACAGAATGCCTTCCTCACTCGAATAACATCAATTAAGGAGGTTCAACATGTTTCATTCCACAAGTCATCAGTCTGTAATTATGGTAGCATCAGTTTGTGCCACATACCTTTTCCGCTTCACTTTGAGTCTGATTCATTTCTACCTGACCGGCTCGCCTCTATCTTTCTAATCCCCGCTTTGTCAATATTGCATTGACCCAACGCTGACAACAGACTCACATTCAACTCCAGACTGGCACCATACGTCAGCGGATTGGGTATAAACGGTACAGAAGTATCAGAAGTCAGGCTGGCTGGCAGTGGTGCCACCGGAGCGCTCACGTAAACCGTTCGCGAATTTCCGCAACCGGTCAGCAGCGGCAGCAGGCACAGGACGTGAAGCACAATCATCATCCGCAACAGCCACTTTGATATCTTCCTGGGTTCTCTGTGACTCCAGTGTGATCTGCTGTTTTGCATGCTGGTTAGCCTCCAGAACTGTATTGACGATTTGCAGTGATTGCAGGACGTTATTGGTAATGACAGTTGCCGATTTGGCATTTTGTACAGCCTCATCAGCACGTTTCTTTTCGTGCTGATATTTGCTGTAGTAGTGGTTGGCAGACCAGATGAAAGAACCGATGACAGTAAAGAAGAATGCAGCGATAACCAACTTATAGCTCAACTTCATTTACCACCCCACCAGCCTCTTTAAACCGGGCAATCAGATCACCGATTTTATGTTCATACTGACCGTAACCAGCACCCGGCAATGAAGCCCAGATATTGCTGCAACGGTCGATTGCCTGACGGATATCACCGCGATCAATCATCGGCAAAGCGCCACGCTCTTTAATCTGTTGCAATGCCACAGCGTCCTGACTTTTGGGAGAGAAGTCTTTCAGGCCAAGCTGATTACGGTAGGCATCCCACCAACGGGAAAGAAGCTGGTAACGTCCGGCGGCTGTTGATTTGAGTTTGGGGTTTAGCGTGACAAGTTTGCGAGGGTGATCGGAGTAATCAGTAAACAGTTCACCACCGACAATAACATCATAACCGTGGTTACGTGTCGGTTGTCGCCCGTTATCCGTTCCTTCTGACCATGCAACCATATCCAGGAAAGCTTTACGCTGGGAATTTAGTACCTGCATAAATTACTCCTTCGAGCTACCAAACTTGTTACCGATTACTCTCATTGCAGCCCCACGAATAGCATCGACACCGATCAGCCCCACTCCACCACCAATGGCAACAGAAAGTGATTTAGGCCATCCGACATACTCAAGAGCGGATGCAAAGGTCAGCGTCAGAGCACCACAGAGTAAAATCTCGAGCGTTTTTCGCTTCCAGCCACCACCGCCACCAAAATAGGCAATACGTAAACCAGCCATAACGATCGACATAATTACTGCGCCCAGCGGTGTGTCTCCACGCCACCAGCTCTGTAACAATTCAAGTAAGTCAGACCAGGAATGAGGATCGTTATGCATTTTTATAATTCCCACCTCCGGTTATCGGAAGTGCAACGAGTGAAGGGAAAGAAGCTGGTTATAGCGCTGAGTCGCAAAAGTTGCGTAGTGCACAAAAAAGGCCGCCTACAGGCAGCCTCTTTTTATAATTCATTGAGTTAACAACATTTAAATGCTGGTGGTATAGAAGGTTTTTCACCAGAACGACAAGCCGGACACCATGACTGAACGATATGCCTTCCGTCTCCCATATCCCTATAACCAAAGTCTATTGTTTTATAAAAAACGTGCACTCCGGCATCAGCAGAACATTTTGGGCAAGATTTATATTCAACGCCATCTTGTTCAACTTCTCGTGAATAACTTAATGACTGCTCACAAACAGAACAACGCTCCACCATATATGCTCTCCTGTTTTTGATAGAGATTTATGGGTAGCAATTCCATTCAAAAGAAACATTGAAGGGTGTCACTTTTTCAAAATGAGCGTAGCTGGCTGCCAGTTTTTTGTACAACACACTTTAAGGAAGGAGAGCCTTAAAAACACAATTGACATCAATAAAAAACCGCTCGGTGGCGGTTTCTTGAAGATTATCAACGGTAGACACACAAAACCCATCGTTAGGAGAATCCTAACCAGATTTTTTGAAAAATGCAAGAATCATGTCGCTATCTTCGGCGAAAATCATTTATCTCGTCACTTTTCTTAATTGCGCCTCAGCATATGCTTCTTCCTGCCAGCACTTTGTCACCAGTTTATCAATGACATCTGCATATCCTTTGTACCACTCATAATCCGTCAGGTCTGGTACCAGCTTCTGGACATGATGCCGCGCCAGTGTGGTTGGTAAACGGCTAAACCGGTTTCCATTGCAACGCCCACAAATCTTATAAACAGGCGTGCCATGAAGCCGGGTTCTTTTTTCATCCAGGACAATACCTTTACCCTTACACCCTCTGCACGCTGTGCTGACTTCTCCCTTACCATGGCAATGCTGACATAGTTCCTTCACCCACTCTTCCTTGACAACAGATTCCCCGCTTCTGGAGTGTTTCACCACTTCGCGCAATACATTATGAAATCCAGTACCAGCACAATGCTCACAGCGAGCCTTACTTGCCGCAGACCTGGAATAATCAGCAAAGGCAAAATTCACAAGGTAAGGGATGATCTGTAACCGGGTTTCTTCACTCAATTTATTCAATGTCGGGTTATCCAGTGCCATCGCGTAATTGAGCAGACCTTCAATCGCAAACTGAGGATCCTGAACACCAACTTTTGCCAGGAATAAGGCAAAACCCAGTGGTGCTTTCGACTGCACCATCCCCTGCGCAGCCATCACATCTGTAATTGTTAAACCACCCGAGCCTGTCGCCGGTGCGTCATCACTCAATTTTGGAGATTTTGGGGAGTAATATTTTGGTAAGGCTTCAAGGTTCATGCTCGTTCTCCACTTACGCCAGTACGCCTATTGCCAGCGCACGATCGATAAAACGAAATATCAGCTCCAGCTGGGAGCCATACTTCTCTTCAAATGCCACGGTATCCGCATGCAGCTCGTCGTGATGCTTTCTGCACAAAGGCAACACAAAGAGGTCATGCGCTTTTGTACCCATTCCACCCTGACCGTGACCTATCAGGTGGTGGGGATCATCAGCAGGCTTTCCACAACATGCACACGGCTGCGTCTTAACCCAGCGCGTGTACTTTTCATTAACCCAGCGGCGACGTTTTGGACGTAACATAAAAGACTCCGGCGACTCCGGATCCACTTTCAGCGCCAGCACCTTTTTCGCTTTATCCTGGATGATGCTGGTGGCAGGAACCGAATGTACAAGGTCACTTTCCCGGGTGGCCGCCTGCACAACAGGCTTCGGTAATCTCAGTGCCTTACGGGCTGCGCTTTCCGGTAAGGCATCCGCCAGATCATTACGAACCAGCCACCAGCACAGTTCCGGCATTGTCACAACGTGACTGTCATCAAAACCGAGATCCCGACGCACAACAGACAACACCCAGCGGGCACAGTTATCCGTTGCCATTGATTCCAGCCGTTCCGTGAACTGATCGCGCAGCTGGTTATCGCAGTGCCAGCACAGACGGATTGCGCCCGGAGCGTGTCGCATTGTGGTCATGTTCTCGCTGTGCCAGTCGGAATGAGGCCACTGGCAGCCTTTTTCACGAAGTAACCAGCTTTCAAGACATTCCACGCCACCAGCACGACGGATCACCGCCTCATTGCGGAACACAGCCCGAACGGCAGGATCATCCGCCAGTGGTTGTGATGCCGCCGGAACGGCACCACTGGCAAAAGATGAATAACGTTCCGGCTCAGGCTCCAGCAGGACACGCCCCTGCATAAACAGGGGCATCAGCTCTGAACCGGGTCTGAACAATACGATCCCCATACGCGGGGCAATCTCAGGGGTCAGTAGCGCTCTCACGGTCACCTCAATGAACGGTATCGAGCAGCTTTAACAGCTCAGGGAATCGGGATTCGAAGAAATGCGGCTGCGTCTCACGCGGATTTGCCGGACTGGTGATGTTCTTGCCGAACATGCAGCCTTTCGCCGTCAGCGACCAGAATTTTTTGATGTTATTAATCGCGGTACGGCTGTATCGTTCGCGCTGTTCGACGATCCCCAGTTTCACCATCTGGTGATATGCCTGATTAGCCGTAAGGCGGATACCATACTGTTTCAGCAGTGCACTCAGTGATAGTGTCGGGCGACTTGAGCCATCGTGTGCATCAGCAGGGGCATCAATGGCATAGCGCGGTGCCAGATTCGGTAAGCCAACAGCCTCCTGGAGTTTCTGACAGGCACCAAGCACAGATGAGTTAGACAGATTTAACTCCCGACGCATAAAGTCCAGCAGAATCACGCCAGCCTGCATCTTGTCAGCAGCCTGCCCGGATAATTTTTCCGGTGCGCTGGTTACCATATCGAAAGTACGGATCACCTTCAGATGAAATGACGGGCTGATCCACATTGCATAGGCATACACCAGTTCTTTGCAGACATACGTCCCCTGGTTATTTCCGCCACGAATAACGTTAACTGGCTCTATATTGACCGAGTTGCAAATCTGCAACTCGCTTATTAAACGTTCAGTTTGCTCATTGCGGAGCCAGAATGCAGGCTTATGCTTATCCAGAGAACCAGCAGCCCTGTGCAGATCGTTCAGGCTGTAACGCCCAAAAGCATCACGACGAACTTCAATACCATCAATGACCATCAGATTATTCATACTTCGTTTCTCCTCTCAATCAGGCGGCTGCACCCGCCGTTTTCTCGTACTTACTGATAGTGATCTCGACCTTCCCTTCCGGGATAACCGGTCCCCACTCCACCAGCATTCTTTTCACCTGACTGTCGTCTTCCCACACACCCGCGTGGGTCAGGGCGTCAAACAGCGCCTTGTTATAGTTGTCCAGATCGCGGATCCGGTTATCCGGAGGAAACAACACGATCTCCACTGAAGCAGGTGCCGACGTTGGTTTTGGCAGACGACGTAACTGCTCAACTATTGCTGCACACGCCGTGCTCTGGAATTTGCGCCCCGCCGCGCTTATCAGGCTCTTACCTGCAAACGCTCCTTTGTTGGGGTGTCGCCAGTACGTGTTCACGCTGGGCGGAAAAGGCAGGATCAGCTTCATACTTTCAGGCCCCTCTCATGTAACCAGTGGGCTGCACGCAACCTGGCGTTTTCCTCACCGGCAAGCAGTGCGCGGATAATCCCGGCCGCCTCGCTGTCGTCGTCCTTCACCGCGGTATGAAGCGTTATCCCCCGGGCCACGCCACGCTTTATCGTGATGACGCCTTTTTTCTCCAGTGCGCGAAGATGCTCCACCGCTGCATTCACCGAACGGTATCCCAGCATGGTTGCCACCTCCTGATTGGTTGGCGGGAAGCCACGTTCTTTCTGATAAGAAATCAGCATATCCAGCACCTGCTGCTGGCATTGAGTTAACGTCGTCATGCCGCCATCTCCCTGACCAGTTTTTCTGCCTGCTGGCGAACCTGCGCCAGAAAGGCCTCACCACATGCCTCAAGTTCGTCGCGCCCGATGTAGCTGATTGCCGGTCCCTTCCAGGTCTTGTCGAAAACAGCAATAGCACCAGCGAAGAAAGCTCCTGTCGGCACCTGCTTCTCATCCTTCGGGATAAACCAGGCAGGCAGTTCAAAACCAATACGCCCGCGAATAAAAGCAATATGGTCCGCATCTTCCGGCCACCACACTTCGCTGGTGGCAGCTTTGATCAGGAAAACATAGCGCCCGCCCTTATCACGCATGGCACTGGCATGTTTCATGATGTAACGCATGCCGGTGATGTATTGCCCCTCATGCTGACTGGCGCGGCTGTATGGGGGATTACCAAAGGCAGCACCTTTAAGCTCCGCAAGACGTTCTGACCAGTCATGCGCCAGCGCGTTGTCTTCCGCCGTGTAATACGCGGCACATTTGGCGTTATCACCGTCAGTGAACAGATCCAGAACAAACGGACCAAACAAGGTGTTAATTCCCCAGAAAATGTTGTCCGGCGTGCGCCACTGATCGCCCACTTCCTTCAGTTCATGGGCTGGTTTGTTCCGCAGTTCTACCAGCGCCTGGCAATATTTATTACTCATTAAGCCCCCACGTAAAAAGCATCCGCAATGTCTCCGGAAGTACAGCCCGGATGGGCTTCAATGAATTTCTGAACGTCATTTAACAGACTCATGATCACCCCCTGAATCCTGCCGGGATCTGGCTGTAGTCCACGTTGTCGTAACTGGCTTTGAAGTACGGGTCTTCGCGTTTTTCTGTGTACGTGCTGACGGACGGCGATAAGCGCAGGGAAAGCTCATCCCATTTTTCCCGCAGCTTCGACGGGCTGAGCACGTTACGGCACCAGAACGGATCGCGGCTGACGCGGCTGTACATCTCGCAGATTTGTTTGTGAGTACGACCATCCTGCACACACATCAGGCGAATTTCGTTTGCCCAGGCTGTCCAGTTCGGTTCTTTGGGACGAACCACCTCGCCGTCACATTCGGCGGCCTGCTCGTACAGGGCGATGATTTTTTTCCAGAGCCACTGTGCGCAGGTCAAATCATCCTGCGTTCCCCACTGGCGCTTTTTAGGGCTGAATACAACCGCATCAGGATGGCGAGTTAAAAACTCCTGTTCAGCCGTCTGCGTGTCCGGTTGCGAAGCGTCCGGACGAGAAGTTTTTTTATCTGACGGATCATGTTTTGATTTTACTGACGGATCCCCGCCAGATTCTGACGGGTGAAAACCCGCTTTTTTGCCAGATTTCGACGCATCAAATTTTGACGGGTCAGATTTTGATGCGTCAGATTTTGACGGGTCAGAATCTGACAGTTGAGAAAATGCCGCTGCCTGAAGCTTCGCAACGTTAAGCTGATAAACATTCGACGCATTGCGGTTACCCTGGCGACGCGCCTTACGCGTTAACCAGCCTTCTGCTTCCAGCCGTGCGATAGCCGTTCTGACGGTACTCATTCCCGCGCCAATCTGGCGGGCAATGGTTTCAATTGATGGCCAGCACACACCTTCGTCATTACTGAAATCAGCCAGGCGGGCCATAATTGCCACGCTGGATAATTTCATGCCTGATGCAGCGCAACCATCCCATACATAGCCGGTTAATTTAGTGCTCATGACCGACCTCTATTTCCCTGAATTTACGACGAAACTGTTCGAGCGGGCTGAAGCACTCATGCTCATAGCCTTCGCGGAGGTAGATAACTCGTTGTGTTTCCGGCTCCCAACGAATGACTCTGACGGGCACTCCGTAGTGATCTTTGAACCAGCGGTTAACTTGTCGCAAAGGACTGTCTCCTTCTGCCGGTTGAAATCACCCACAGCCCACTCAGCAAAGCTGTGGGTTACAATTTCCCTGTCACCTGGTACATTAACTGCATAGCAATACTCCACCTTCGCTTTTCCACCCGGTACAGGAAGCGCAATCAGTTGCGAGCGACGGTAGTGTGTTGTTAAACTGTTCATGCGTTAGTTTCTCCACAGTCACGACACGCCACGGCGCCCGGAGCTGCACACTCGCGGGCGTCATTACTTTCTGAAATGCAAAAGATTTTGTAGACCAGTGCTGCATGCTCCTGCAGCTTCGAAATTGAGAGATACAGCTCGTCGTTAATTGCTGTCTTCTCATGCGGTTCCACCACACCGTCTTCGATTGCCGAACGAATCTGCTTTGAGTAACTCCCGATCTGTTCGATGACTTCCAGCAGGCGCTGGTTTATATCGGCGTTCTCTACTTCCTCAATTTCAGGAAGCGATACGAACACCCCACCAGCAGACTGTGCGACAGCATCCGCAATGTAGTGAGTGCCAGCTGCGCGCTGTAAAACCATTGCCCATCCCAGCGGGAAAATCTGATCGCCATCGGCACGAAGGCGGTTAAATAATGCGTTCTCTGTTACATCCAGCCAGTCAGCAGCTTCAGCGTAACCACCCGGCAACGCCGCGATAGTTTTTCTGACAGCTTTCACGTACCACTTAGGCTGTTTTTCTACTTTCCAGTGATGCTTACCCACGGCTATCTCCTTAAAACTGTGGTTACTTTTCATCTGATGAATCTTTAATCTTTTGAAAAATATCTGGACGTAATTTTTCTTTTGATATGCCAGTGGTCTTTTCAATGAATATCGAGAGCTTTGCAGGGGGACGCTTTTCTCTGTTCAACCAGTTCCAGACATGTTGTTGCTTTACTAAATGACCGCTGCTGGCTGTGAGCTTCCGAGCCAATTCTGATTGACCACCAGCCAGAGCGATTGCCTCCGATAAGGCTAATTGCTCAGGTGTCATAGCTTTCTCCTTTTTAGGTAGTTAAGTTGTTACGAGTTGCAAGAATACAACATTAACAACTTTTATCACAACTTTTAGGTGTTGGAAAGCTAAAACATAAAGTTGTAACCTCATCAAAAAAGAGAGGGATATGTTGTGAAAACACTGGCAGAACGATTAAAGATAGGTAGAGAGAAAGCTGGCATGAGCCAAGCTCAACTAGCTGAAAAAATTGGACTTTCACAACAATCTGTAGCCAAAATAGAGAATGGCGAAACTCTACAACCGCGCAAAATTAAAGAAATTGCAAAAGTTTTAGGTGTATCACAAAAGTGGTTACAACTTGGTATTGAAGACAACGCATCCATACCTGATCTTGTTGTAAAAGAAGCAGAAAGCACCGCATTAGACCCCGATATTTTCGTAAACATTCCTGTTTTAGATGTCGAGTTATCGGCAGGTAACGGATGTCTGGCTGAAATAGTTGAATCAGCTATTGACTGGTTTCCATTAAGAAGAGCAGATTTGAGAAAATCTGGCGTATGTGCATCTAATGCCAAGATCGTAAAAATATGGGGGAACAGTTTATTACCGGTTCTCAATAATGGAGATCTTGTTGCCGTTGATATTTCTCAAACCGTTCCTATTCGTGATGGCGATCTTTATGCCGTACGAGATGGTGTATTGCTAAGGGTTAAAATACTTATCAACTTACCTGACGGTGGCTTGATTCTTAGAAGCTTCAACAAAGATGAGTACCCAGATGAAATACTCACCTTTGAAGATAGACGAGCCAGAATTCATGTTATAGGTAGGGTATTCTGGTCATCGCGAACTTGGTAATGCATCGAAAAGCATTTCTTCAGAAATAATTTTAAGTTTTGCACCATTATCATCCCTATAAGATATAGCCTTTTCGATCTTCCTTCCGTGACTTGAGAATTTCCAATCACGGGAGGAAAGCGTCCCAATTACTAAAAAATCCAACTTTTGAGTAATTCCACTACTGATGTTCCCACCAGCATTTTTAATCAAATTTTCAACTACGGCTCTCTTTCCTGCAACAAAAGTGCCTGTAAGACAATAGGTTTTACCCTCTAACTCTATCGAAGCCCCTACATCAATAGGCAGCCTGGTCGCCAAACCATCCACCACTCCACTTTCCAAGTCACATCCTGTGAAGTCTACTAATGCCTTATGTAGAGTTAAACTCTCATCTTCAGTAATAACACCATCTTTAAGAATTTCCTTTACAAGTGCATAAAGTTTTTTTCCTGGGTAGTTGTTCTTCAAAGCTCCATTTTGCTCAAGCCACCAATTAAGATATCTTATTTCTTCTTGAGTTAAGTTCCGATCAGCAATTAATCCTTTACATAGTCCATTAAGTAAATGGACATCTACATCCTTGGAGTAAAAATCAATTTCAGGGATATCAAGAATTTCCCTCTGTATTTGGAGAAGGCTATTTTTAAGGTCATCACGTTCTTCTGATGTGATTATTCCATCTGCAAGAATATCCGACACCCGTGCTGATAGACTTTTTATAACTCCATTATTGATAATCTGCTTTGCTTCAAGTAACCATGTATCTAAGTAAAGAACCTCCTCTTCACGGACAACTCCATCTGCAATAATTCCATCAATGATGCTAATCAAGTTAGCAAATAACTTGTCCCGGTTCTGTGTGTAATTAAAAGCGTAAAGCGCGTCTTCCATACAACCTCCTTTTTTTGATAATCCTTGCACTCCTTGGCTACTCGTTCAAACCACATAAAGTTGTTGACAACATTCAAAACCACAACTAAATTACAACTTAAAGGTGTTAAAACAACGAACAGGCAGGACGCCCACGAAGTAGCCCGCCTGGTACGTACGAAGACCGGGATGATTCGTTAGCGGATGATTTCAGTGGAGAGAATAGATGAATGAGCAGAATTTGAAGCATGTGATCGCATTGTTGCTGGAAGACGCTAAACGTTTGCAGCAGATAGAGCCAAATGCAGGCACTGAGGCCCGTATTTTGTTAGCAAAACAGGCATTAAAGACTTGCGGGGCGCAAGACCCTGATCGAACCAAGTTCATGAATTTCATGGCTAACACGATCACCCCCCTGCCATGCAATGGAGAGAGGGTGAGCCGTGTTTATCACGACACAATGGTTAAGGCATTAAGAATCGAGCTTGATGGGCTTAGGCGTAAGATCGTGATGAACAAAATCGTTGCCAACTAAGGAAGCAGACGGAAGTAAGCATGCGCTTTGTTCAAATTTGCAGACAAATATATTTGCGTCAACACCAGCACTGTTAGCAATGGAAAAAGTTTGATCAAGGATTTGTTGGCAGTTCATTGTGCTTTTGAGGATATATCCCTCTGGAATCAGTCTGCAGCAGCTATCGTCAGACTCTTTGATTGTTTTCTGGTACAGAAAGTTAAGCATTAATTCTTCAAATTTTTTGGTCTGTTCGGCTGTTGCTTCAAACAGACGAACGTGAACATAAAACTGGTTCATTAGGTTTCCTTGCTGGCTGTGTGAGAACTCCAGCATACCACCGAGCCTGAAGTGGTGAAAAGACAGGCAATAGTTTCATTGCTGTGTGTAGTCTTGGAGGTACCAGCTTGTACCCTTGCTTCCGGCTGGTACCGTCCTTTTTACAAAACAGAGAAGAGCATCACCGGACGACGAGCTCATAACCCAATCCATCCGGGCGGCTGCCACCGCAGGTGTTCTTCTCTGTTTTGTGGAGAAACTAACCGCCCCTACGGGGGCATTCATGGAAATGTAATTGACTCAATAATCGCCGGACGGTGAGGGCTTTCTTTTACCCGAATTCAGCGCGGTGCAGCGCATATACGTGGAGAACAAAATGTCATTTATTAAAACTTTTTCCGGGAAGCATTTTTATTATGACAGGATAAATAAAGACAACATCGATATTAACGATATCGCGGTTTCCCTTTCAAATATCTGTCGCTTTGCCGGTCATCTTTCACACTTCTACAGCGTCGCCCAACATGCGGTGCTTTGCAGCCAGCTGGTGCCGCAGGAATTTGCTTTTGAAGCGTTAATGCATGATGCAACAGAAGCGTATTGCCAGGACATCCCCGCGCCACTGAAACGCCTTCTTCCTGACTATAAACGGATGGAAGAAAAAATAGACGCCGTAATCCGTGAGAAATACGGGTTGCCCCCGGTTATGAGTACGCCCGTGAAATATGCCGATCTCATCATGCTGGCAACCGAACGCCGTGATCTCGGGCTTGATGATGGCTCTTTCTGGCCTGTACTGGAAGGCATCCCGGCAACAGAGATGTTCAACGTGATTCCACTGGCACCGGGCCATGCCTACGGGATGTTTATGGAACGCTTCAACGAGTTATCGGAATTACGCAAATGTGCATAACTCATGTAGTTAGTTTTTCTGGCGGGAGAACATCTGCATATCTTGTTCACCTGATGGAAGAACAAAGAAAGGCTGGCAATAACGTCTGCTACATCTTTATGGATACCGGTTGCGAACATCCGCTGACATACCGCTTTATTCGGGAGGTTGTGAAGTTCTGGGGCATATCGCTAACTGTGTTGCAGGTCGATATAAATCCAGAGCTTGGGCAGCCAAATGGTTATACGGAATGGGAACCAAAGGATATTCAGACACGAATGCCGGTGCTTAAACCGTTTATGGACATGGTAAAAAAATATGGCACGCCATACATCGGCGGCGCGTTCTGCACTGACAGATTAAAACTCACCCCCTTCACAAAATACTGCGATGACCATTTCGGACGAGGGAATTACATCACATGGCTGGGTATTCGTGCAGACGAACCTCGTAGGCTGAAACCGAAATCGGGCGTCCGGTATCTTGCCGAGCTATCTGATTTTGATAAGTCGGATGTTATCCGGTGGTGGCATAAACAACCTTTTGATTTGCAAACCCCGGAGCACCTCGGGAACTGTGTTTTCTGCATCAAAAAGTCCACGCAAAAGCTGGGGCTTGCATGTAAAGACGAACCTGGTCTGATGCGAGTTTTTAATGAGCTGGTTACAGGTAAACACGTCCGGGATGGTCACCGAAAGACAAATAAAGACGTTATGTACCGTGGTCATCTGAGCCTTGACGGGATTGCCAGAATGTATGCCGACAGCGACTACAGAAATTTGTATCAGGCGATGGTGCAAGCCAGGCAATTCGATACCGGCTCGTGTTCAGAGTCATGTGAAATCTGGGGTGATCAATTGGAGTTGAAATTCGAAGAGGTGGTGGCATGACAACCAAAATTAACTATCAGGCACTGCGTGAGGCGGCAGAAGCAATAAAAATAGTAGCCACACCACAAAAATTGCTGGCATTTCGTATGAAAGTCACACCGCAGGTTGTGCTGGCGCTGCTGGATGAGCTGGAAGCAGCAGAGAAGCGAAACGCTGAATTACAAAGCGAGAATGCATACATCCGCAACCGGTACAAAGAACTGGACCTATTAATCGGGAAAAACATTCTGGTCATGCAGGCTGCCATTATCGAATGGCAGGCAACTGGCGACGCTAAGAGCGGACTGGCATGGATTTATAACACACTGTTTGGCCCTGGCGAATTACCGGACGAATCTGAGAAAGATGCTCAGGCCTACTTTAATCGCAAATATGCACCGATTGACGAAAAGCTTATGGCGCTTCACAAGTGGTTTTGGGAACAAAGTGAAGCCGAGCGCGCCGCTGGCATTCGCATCAAAGGAGAGTGATATGGCAACTTTGCAGGAATTAATCGACCTGACGCCAGAACAGGAAAAAGCGTGGAATCGCCTTGTAAAGGCTGTAAAGGATTTCAGGGCAGCCGGAGGAAAGTTTTATAGCGTCCTGGACACGCTGAGCGCATACAACGGCGAGCACGTTGCCAGCATTGATAACGATAAGGGCTACCACACTGCAAGCGTCTATATGCCTAGCATTGATGCGCCAGGGCTAACCAGTTGGGCTGATGATTGGCACGGCATCACGCTGAAAGATGGGGTTGAAGTGGATGAGGACTAACACATGACTACTTTTACCGACAAAGAACTGATTAAAGAAATCAAAGAGCGCATAGGCAGCCTGGACGTTCGAGACAATATTGAGCGCCGGGCTTATGAAATAGCGTTAGCCTCGCTGGAAGCAGAACCGGTGGCCTGGCTGCATTCAGACAACGGCTTAGGTATTCCGGCAATAACCCGGAGTAAAAACATTGCTGACAGTTGGTTATCGAATGGCTGGTATGTTCAGCCGCTATATATAGCCCAGCCAGTACCGGTAATTCCTGATGAGGTGTTGTCCGCAATCCGGGAGGTTGCCAGGATTCGCGCCGATTTCGATGATTTTGACGGTGACAGGCGAGGTATCGGTGATTGTCTGGATGAGGCCGAGCAAGAGCTTATCGTTACCATTAACAAATATGCCAGTCAGTTGGCAGTAGAGCCGGTAGTGCCTGCTGTTCTGGAACGTTTGCGAACCATTGTAGCGGACCCACGCGCATTACCTCGCAGAAAAGAATGGGTTAGTGGGCAGCAGTACAGTTACGTACTTCTCGAAAACGTAGAAGCTATGGTTGATGAAGCCTGCCACGCTGCCATGCTTCAGGGTAGCCAACCTGTAAGCCAAACTTACAACTTGCCAGAATTAATCGAAGGCATGGAAGTTTCCATTGATGTAAGCACTTGTGATGCTGATTTAGGTAATCGCTATTTCGGCACCGTCACCGAGGCGTTAGAACTTGATACAGCCAAGAATGGTTACATCCTCCTGGTTCAGGACGCAGAGCCAAACTTCGATGTAAATGGCAACTCTCCGGGAACTCCGGATAGTTGGATAAGCTGTAGTGATCGAATGCCTGAAAAGGGCCAGAACGTGCTTATTTCGGTGAATTTCGATAGCTCTCTGGTTGAACCGCTAATATGCTCCGCACGCTATACCGGAAGCACCTTTCGGCGCGGAGATGCAACGATTAAGCCGGGTAATGGTATTGAGCAAGCAACTCACTGGATGCCGCTAACCGCCGCAGGAGGTGAAGTGATGAACAACTTAATGATCGACCTTGAGACGATGGGGAAAAATAAGGATGCACCGATCGTTTCCATTGGCGCGGTGTTCTTCACTCCAGAAACCGGAGACATCGGACAAGAATTCTATACGGTTGTTAGCCTGGAAAGTGCTATGGGGCAAGGAGCTACACCTGACGGCGATACCATCCTGTGGTGGTTGAAACAAAGCCCTGAAGCACGAGCTGCAATCTGTATTGATGATACTTTGTCGATCAGCGATGCTCTCTCAGAACTAAATCATTTCATTAACCGGCACGCAGCCAATACGAAATATTTAAAAGTCTGGGGTAACGGGGCCACCTTCGACAACGTAATTTTACGTGGAGCTTATGAGCGAGCAGGACAAATCTGCCCGTGGGCATACTGGAATGACCACGATGTACGCACGATCGTTACGCTTGGGCGTTCCATCGGATTCGACCCCAAAATGGACATCCCTTTCGATGGCGAACGGCACAACGCCCTAGCCGATGCCCGTCATCAGGCAAAATATGTTTCCGCTATCTGGCAGAAATTAATTCCTGCCACCAGCACAGAATTATGATTTTCCCGGGTGCAGCCGGTTTTGATGGAGAAAATTATGAATACCTTGTTTTTACTGATGGCTGAATTCAATACCCCTAACATTGAACTCTCAGCAGTTAGCCAAAAGTACTTTGGCATGAGTCCAGCCACGGCAGAAGCAAAAGCAAACGCTTGTAAGTTGCCCGTTCCAACATATCGCATCGGCACATCACAAAAAGCAAAACGTTGCATCAATATTCAGGATCTTGCGGAATACATAGACAAAAGGCGAGAAGAAGGGCGTGCTGAGTGGGAAAGGGTCAGAACCCATAAACAAAGGCTCATTTAAATAGAATATGAATAAACCCATCCAAAGATGGGTTTATTCATAATGTTGAAGAGCAGCGAGTATCAGTTTTTTATGCCGTTTAACCATAGTTTTAGATATCTCAACTGCACATCGAACTTGTCTCATACAATGATCTGTAATGCGACCTTTTAGCACGTCACCATATTGGATGGCTATTTTCTCTTTAATAATTTCAAGATCGAAAGCAGCATGAGCCTCAATGCAATTAACAAATGAGTCCCATTTAAGAAAATCATGGTCCTCTCTATTAATTTCGACCTGACAAGCCATAAGATCATTGTTTCTCTTAATAAATTCATTTATATCTGAATTTATTAGAAGAACTAAAAGAGGTTCACAGCAAACAACCACCATATATTTTACTTTAGGTGGGGTCGTAAAATCACAATGAAGATACAATACATCACCGGGTGATATACCTCTTTCACGACTAAAATTAGCCTTAAAATCAGGAGGGAAACAATCACCCAGCATAAGTATCAATATCCGTTCTTCAGATAATCCAGTATTAATTTGCTATTTTTTAGCTGTGCAACTATCGATTCCAAAGACATCTCACCATTGTGATCTGCCTGTTCCCATGCCGCGTCATGGCTCATGGTTCTGATAGCTTCAAAGGACATGTTTCCAAGCATCGCGATAGACTTATCAATACACTCTAAATCTGAGTCACTAAAAAAGTCTTCATCAGCTTCACGGCTCGGCACAATCGTCATACCTGATACAGAAAATGCTTTTCGCACAGAATCGACATCACAACCATTAGGAATGTAACGTCCATCTCCACGAGCAATTTTTATAATATCGTATGTGTTGCTTGCTACAGGCCCATCCTTCATAGCGTTATAGTGATCGCCCGTTATGAGGCGTCCAAAACTTTCAAGGTGAAACCTGTCAGCATAATAAAGAATTTTTCCGACATGATAGATATCTGGGATCGGTGCTTTAGAGGCGACGTACAGAATGGCCTCTAAAGCCTTTTCTGAATCAAACCTTACATTTAGCATCAATACACCCTTCATCCAAACAACATCGTCAAGCTCTGGCAAATGCAACCGCAT
Protein sequences of DBSCAN-SWA_8 >NZ_CP014620|4266227:4309790|4268865_4269108_-|WP_000891414.1|DBSCAN-SWA MLELLFVLGFFLMLMVTGVSLLGILAALVVATAVMFLGGMFALMIKLLPWLLLAVAVVWVIKAVKTPKIPQYQRNNRRFY >NZ_CP014620|4266227:4309790|4305174_4306053_+|WP_023200800.1|DBSCAN-SWA MCITHVVSFSGGRTSAYLVHLMEEQRKAGNNVCYIFMDTGCEHPLTYRFIREVVKFWGISLTVLQVDINPELGQPNGYTEWEPKDIQTRMPVLKPFMDMVKKYGTPYIGGAFCTDRLKLTPFTKYCDDHFGRGNYITWLGIRADEPRRLKPKSGVRYLAELSDFDKSDVIRWWHKQPFDLQTPEHLGNCVFCIKKSTQKLGLACKDEPGLMRVFNELVTGKHVRDGHRKTNKDVMYRGHLSLDGIARMYADSDYRNLYQAMVQARQFDTGSCSESCEIWGDQLELKFEEVVA >NZ_CP014620|4266227:4309790|4295127_4295370_-|WP_001306866.1|DBSCAN-SWA MVERCSVCEQSLSYSREVEQDGVEYKSCPKCSADAGVHVFYKTIDFGYRDMGDGRHIVQSWCPACRSGEKPSIPPAFKCC >NZ_CP014620|4266227:4309790|4293798_4294191_-|WP_023259402.1|DBSCAN-SWA MKLSYKLVIAAFFFTVIGSFIWSANHYYSKYQHEKKRADEAVQNAKSATVITNNVLQSLQIVNTVLEANQHAKQQITLESQRTQEDIKVAVADDDCASRPVPAAAADRLRKFANGLRERSGGTTASQPDF >NZ_CP014620|4266227:4309790|4267707_4268691_+|WP_000235555.1|DBSCAN-SWA MATRIEFHKHGGPEVLQTVEFTPTEPAEHEIQVENKAIGINFIDTYIRSGLYPPPSLPAGLGTEAAGVVSKVGNGVEHIRVGDRVVYAQSTLGAYSSVHNVPADKAAILPDAISFEQAAASFLKGLTVFYLLRKTYEVKPDEPFLFHAAAGGVGLIACQWAKALGAKLIGTVGSAQKAQRALDAGAWQVINYREESIVERVKEITGGKKVRVVYDSVGKDTWEASLDCLQRRGLMVSFGNASGPVTGVNLGILNQKGSLYATRPSLQGYITTREELTEASNELFSLIASGVIKVDVAENQRYALKDARRAHEVLESRATQGSSLLIP >NZ_CP014620|4266227:4309790|4294654_4294990_-|WP_023200326.1|holin|DBSCAN-SWA MHNDPHSWSDLLELLQSWWRGDTPLGAVIMSIVMAGLRIAYFGGGGGWKRKTLEILLCGALTLTFASALEYVGWPKSLSVAIGGGVGLIGVDAIRGAAMRVIGNKFGSSKE >NZ_CP014620|4266227:4309790|4298624_4298951_-|WP_000210148.1|DBSCAN-SWA MTTLTQCQQQVLDMLISYQKERGFPPTNQEVATMLGYRSVNAAVEHLRALEKKGVITIKRGVARGITLHTAVKDDDSEAAGIIRALLAGEENARLRAAHWLHERGLKV >NZ_CP014620|4266227:4309790|4300626_4300806_-|WP_001250269.1|DBSCAN-SWA MRQVNRWFKDHYGVPVRVIRWEPETQRVIYLREGYEHECFSPLEQFRRKFREIEVGHEH >NZ_CP014620|4266227:4309790|4286769_4287093_-|WP_000927719.1|head,tail|DBSCAN-SWA MLLTMEEIKAQLRLDEDFDTDDRHLQLLACAAQKRTETYLNRKLYAPDETIPDSDPDGLHLPDDIRLGMLMLISHFYENRSSVTEVEKLDMPQSFGWLVGPYRYFPQ >NZ_CP014620|4266227:4309790|4292994_4293345_-|WP_023200328.1|DBSCAN-SWA MPPRIPKACRVRGCRSTTTDPSGYCESHKSEGWKQYKPGQSRHQRGYGSKWDGIRERVLKRDKGLCQSCLHAGVVREAKTVDHIVPKAHGGTDADSNLQSLCWPCHKAKTARERLK >NZ_CP014620|4266227:4309790|4277429_4277978_-|WP_022630975.1|plate|DBSCAN-SWA MRTIEAMQRQLLGLIGRAVVKSISAATKCQTVDVSLIAGEPKAGVEHLEPYGFTSRANSGAEAVVLFPDGDRSHAVVVTVSDRRYRLKGLQTGEVAVYDDQGQSVTLTREGLVVDGAGKTITFRNSPKARFEMDLEVTGQVKDLCDSGGTTMSAMRLAYNGHRHRENGQGSNTDKPDKAMEA >NZ_CP014620|4266227:4309790|4306524_4306812_+|WP_000212745.1|DBSCAN-SWA MATLQELIDLTPEQEKAWNRLVKAVKDFRAAGGKFYSVLDTLSAYNGEHVASIDNDKGYHTASVYMPSIDAPGLTSWADDWHGITLKDGVEVDED >NZ_CP014620|4266227:4309790|4303950_4304292_-|WP_001307125.1|DBSCAN-SWA MNQFYVHVRLFEATAEQTKKFEELMLNFLYQKTIKESDDSCCRLIPEGYILKSTMNCQQILDQTFSIANSAGVDANIFVCKFEQSACLLPSASLVGNDFVHHDLTPKPIKLDS >NZ_CP014620|4266227:4309790|4294174_4294651_-|WP_023200327.1|DBSCAN-SWA MQVLNSQRKAFLDMVAWSEGTDNGRQPTRNHGYDVIVGGELFTDYSDHPRKLVTLNPKLKSTAAGRYQLLSRWWDAYRNQLGLKDFSPKSQDAVALQQIKERGALPMIDRGDIRQAIDRCSNIWASLPGAGYGQYEHKIGDLIARFKEAGGVVNEVEL >NZ_CP014620|4266227:4309790|4285881_4286388_-|WP_022630978.1|DBSCAN-SWA MTTSFLHVDFQQPAEMRFNRARVRRAFVTIGQRHMRDARRLVMRRARSAPGENPGYQTGRLARSIGYMVPRASKHRPGFMARIAPNQRNGEGNRRITGDFYPAFLFYGVRRGEKRRRSHHRGASGGSGWRLAPRNNFMVETLEKNRSWTRYFLARELRKSLKPERRHR >NZ_CP014620|4266227:4309790|4283040_4283310_-|WP_000661047.1|tail|DBSCAN-SWA MKELELKKPIIAHGETLSVLEFDEPTGKDVRELGYPYQMNQDESVRLLAHVVSKYIVRLAKVPQSSVDQMSPADLNAAAWLVAGFFLQA >NZ_CP014620|4266227:4309790|4285145_4285316_-|WP_000497751.1|DBSCAN-SWA MFVKPVKGRSVPDPARGDLLPAEGRNVDENNYWLRREAAGDIRRVNKKVNTDDDKL >NZ_CP014620|4266227:4309790|4281066_4282899_-|WP_022630976.1|tail|DBSCAN-SWA MAEFELKALITGVDRLSPALSKMQKKIRGFKRQAEEASQGGLALGGGLAAGLTLSLKSYADQENAATGLKVAMMDANGEVGKSFQDINKLAIGLGNQLPGTTADFQNMMQMLVRQGIPAENILGGVGKATAYLAVQLKKTPEAAAEFAAKMQDATGTASEDMMGLFDTIQKAFYLGVDDTNMLSFFTKTSSVLKMVNKDGLQAAQSLAPISVMMDQMGMNGESAGNALRKVIQSGLSVKKIRDVNKVMARQKLGVQLDFTDGKGSFGGLDNMFRQLAKLRKLTDVKRTDVLKAIFGDDAETIQVVNALIDKGKDGYDQIQQKMNKQASLNKRVQAQLGTLSNLWEAMTGTATNGLAAIGGAFSGDAKNITQWLGELGEKFTKFADENPRVIRGVVGLAAGLAILKLGLMGVGSAISIVSRIMSMTPIGMIATAIALAAGLIITNWDVVGPYFKKLWETIGPYFEAGWELLKKVFAWSPLGMVINNWGPVVKWFQDMWDKLKPIIEWFTDSFGDTVDAINSAQWGAGAYDAYGTGIPARGYTPYPAVDPAQSNNASDATGSNPFMINKASVPKVDGEIKVSFVNSPPGMRVMETRSSGFDVSHDVGYTRFR >NZ_CP014620|4266227:4309790|4303704_4304013_+|WP_047624181.1|DBSCAN-SWA MNEQNLKHVIALLLEDAKRLQQIEPNAGTEARILLAKQALKTCGAQDPDRTKFMNFMANTITPLPCNGERVSRVYHDTMVKALRIELDGLRRKIVMNKIVAN >NZ_CP014620|4266227:4309790|4307826_4308399_+|WP_024146004.1|DBSCAN-SWA MNNLMIDLETMGKNKDAPIVSIGAVFFTPETGDIGQEFYTVVSLESAMGQGATPDGDTILWWLKQSPEARAAICIDDTLSISDALSELNHFINRHAANTKYLKVWGNGATFDNVILRGAYERAGQICPWAYWNDHDVRTIVTLGRSIGFDPKMDIPFDGERHNALADARHQAKYVSAIWQKLIPATSTEL >NZ_CP014620|4266227:4309790|4301939_4302626_+|WP_000853319.1|DBSCAN-SWA MKTLAERLKIGREKAGMSQAQLAEKIGLSQQSVAKIENGETLQPRKIKEIAKVLGVSQKWLQLGIEDNASIPDLVVKEAESTALDPDIFVNIPVLDVELSAGNGCLAEIVESAIDWFPLRRADLRKSGVCASNAKIVKIWGNSLLPVLNNGDLVAVDISQTVPIRDGDLYAVRDGVLLRVKILINLPDGGLILRSFNKDEYPDEILTFEDRRARIHVIGRVFWSSRTW >NZ_CP014620|4266227:4309790|4297421_4298219_-|WP_001061375.1|DBSCAN-SWA MNNLMVIDGIEVRRDAFGRYSLNDLHRAAGSLDKHKPAFWLRNEQTERLISELQICNSVNIEPVNVIRGGNNQGTYVCKELVYAYAMWISPSFHLKVIRTFDMVTSAPEKLSGQAADKMQAGVILLDFMRRELNLSNSSVLGACQKLQEAVGLPNLAPRYAIDAPADAHDGSSRPTLSLSALLKQYGIRLTANQAYHQMVKLGIVEQRERYSRTAINNIKKFWSLTAKGCMFGKNITSPANPRETQPHFFESRFPELLKLLDTVH >NZ_CP014620|4266227:4309790|4283309_4283666_-|WP_000090998.1|DBSCAN-SWA MARIGGTCYFKIDGQQLSLTGGIEVPMNRTVNDDIIGLDGSVDRKETHRAPYVKGTFKVPKNFPVSKITSSDEMTITAELANGQVYVLSSAWLHGEANHNAEEGTVDLEFHGEEGDYQ >NZ_CP014620|4266227:4309790|4286362_4286773_-|WP_000702388.1|head|DBSCAN-SWA MKIRQAQTSATYILPDPGELNKRVLIRLRVDMPADNFGVEPQYPVTFRTWAKVIQTSATTWQETAQTGDAITHYITIRYRRGITADYEVVCGDSVYRVKRQRDLNGARRFLLLECTELGECRQSHGGNNDDFLFAR >NZ_CP014620|4266227:4309790|4299695_4300637_-|WP_000104967.1|DBSCAN-SWA MSTKLTGYVWDGCAASGMKLSSVAIMARLADFSNDEGVCWPSIETIARQIGAGMSTVRTAIARLEAEGWLTRKARRQGNRNASNVYQLNVAKLQAAAFSQLSDSDPSKSDASKSDPSKFDASKSGKKAGFHPSESGGDPSVKSKHDPSDKKTSRPDASQPDTQTAEQEFLTRHPDAVVFSPKKRQWGTQDDLTCAQWLWKKIIALYEQAAECDGEVVRPKEPNWTAWANEIRLMCVQDGRTHKQICEMYSRVSRDPFWCRNVLSPSKLREKWDELSLRLSPSVSTYTEKREDPYFKASYDNVDYSQIPAGFRG >NZ_CP014620|4266227:4309790|4287344_4288550_-|WP_000257507.1|capsid|DBSCAN-SWA MAVDIKDVEQVAQELQQKFDDFKAKNDKRVDAIEQEKGKLAGQVETLNGKLSELENLKSDLEKELLELKRPAGGAQNKLATEHKEAFVGFLRKGREDGLRDLERKALQVGTDEDGGYAVPEELDRNILNLLKDEVVMRQEATVITVGGSDYKKLVNLGGTASGWVGETDTRSQTATSRLELIEPLMGEIYGNPQATQKMLDDAFFNVEAWINSELATEFAEQEEIAFTSGDGTKKPKGFLAYESTDETDKVRAFGKLQHIVSGEATAVTADAIIKLIYTLRKAHRTGAKFMMNNNSLFAIRLLKDTEGNYLWRPGLELGQPSSLAGYGIAENEQMPDIAADAKAIAFGNFKRGYTIVDRIGTRILRDPYTNKPFVGFYTTKRTGGMLVDSQAIKLLKIAAA >NZ_CP014620|4266227:4309790|4302608_4303499_-|WP_000389078.1|DBSCAN-SWA MEDALYAFNYTQNRDKLFANLISIIDGIIADGVVREEEVLYLDTWLLEAKQIINNGVIKSLSARVSDILADGIITSEERDDLKNSLLQIQREILDIPEIDFYSKDVDVHLLNGLCKGLIADRNLTQEEIRYLNWWLEQNGALKNNYPGKKLYALVKEILKDGVITEDESLTLHKALVDFTGCDLESGVVDGLATRLPIDVGASIELEGKTYCLTGTFVAGKRAVVENLIKNAGGNISSGITQKLDFLVIGTLSSRDWKFSSHGRKIEKAISYRDDNGAKLKIISEEMLFDALPSSR >NZ_CP014620|4266227:4309790|4277977_4279057_-|WP_000999499.1|plate|DBSCAN-SWA MNDNVTLRVNGREWNGWTSVRIGAGIERLARDFSVEITRQWPGDEGITTLQPRIKNGSKVEVLIGDELVITGWVEATPVRYDARSVSTGIAGRSLTADLIDCAAEPTQFNGRSLVQIAQALAAPFGIEVVNNGAPSGVIPDVQPDHGETVIEVINKILGQQQALAYDDPHGRLVIGGIGSTRAHTALVLGENILSCDTEKSIRERFSVYQVAGQRAGNDDDFGEATTTALRARTEDAFIARYRPMYIRQTGQATGAGCIARADFEARQRAARTDETTYVVQGWRQGNGTLWQPNQRVIVFDPVCGFDNTELLVSEVTFTQDQNGTLTEIRVGPPDAYLPEPEAPGARKKKKARVQEDPF >NZ_CP014620|4266227:4309790|4273819_4274353_-|WP_022631136.1|tail|DBSCAN-SWA MMHLKNIVAGNPKTPDQYQLTKKFGVVWLFDEDGKNWYEEQKKFSADSLKIAYDKNNVIVDINKDVSAINPDGCSVVELPDITANRRADVSGRWMFNGEQVSKRVYSPEELRQQAEAKKQKLLEEAEVVIKPLSRAVKMGIATDEERKRLEAWELYSVLVSRVDSSDPDWPEKPASQ >NZ_CP014620|4266227:4309790|4308435_4308708_+|WP_001093914.1|DBSCAN-SWA MNTLFLLMAEFNTPNIELSAVSQKYFGMSPATAEAKANACKLPVPTYRIGTSQKAKRCINIQDLAEYIDKRREEGRAEWERVRTHKQRLI >NZ_CP014620|4266227:4309790|4292383_4292869_-|WP_000929174.1|terminase|DBSCAN-SWA MAGTAGRSGRRPKPTARKALAGNPGKRALNKDEPVFTPIKGVEPPEWFAEEDLPLATIMWQLTTKELCGQGLLCVTDLAVLERWCVAYEFWRRAVKNIARQGNTITGAMGGMVKNPELTAKKEQESEMSSTGAMLGLDPSSRQRLIGLAGKKKATNPFLTI >NZ_CP014620|4266227:4309790|4308741_4309203_-|WP_000900143.1|DBSCAN-SWA MLGDCFPPDFKANFSRERGISPGDVLYLHCDFTTPPKVKYMVVVCCEPLLVLLINSDINEFIKRNNDLMACQVEINREDHDFLKWDSFVNCIEAHAAFDLEIIKEKIAIQYGDVLKGRITDHCMRQVRCAVEISKTMVKRHKKLILAALQHYE >NZ_CP014620|4266227:4309790|4266227_4267643_-|WP_000918353.1|DBSCAN-SWA MAGNKPFNKPQTDARDRDPQVAGIKVPPHSIEAEQSVLGGLMLDNERWDDVAERVVAEDFYTRPHRHIFTEMGRLQESGSPIDLITLAESLERQGQLDSVGGFAYLAELSKNTPSAANISAYADIVRERAVVRDMIAVAHEIADAGYDPQGRNSDELLDLAESRVFQIAENRANKDEGPKSIDQILDATVARIEQLFQQPHDGVTGVDTGYQDLNKKTAGLQRSDLIIVAARPSMGKTTFAMNLCENAAMLQDKPVLIFSLEMPGEQIMMRMLASLSRVDQTRIRTGQLDDEDWARISGTMGILLEKRNMYIDDSSGLTPTEVRSRARRIFREHGGLSLIMIDYLQLMRVPSLSDNRTLEIAEISRSLKALAKELQVPVVALSQLNRSLEQRADKRPVNSDLRESGSIEQDADLIMFIYRDEVYHENSDLKGIAEIIIGKQRNGPIGTVRLTFNGQWSRFDNYAGPQYDDE >NZ_CP014620|4266227:4309790|4279053_4280382_-|WP_000219913.1|DBSCAN-SWA MTWKDRLQDASFRGVPFKVEEESAGTGRRVETHEYPNRDKPYTEDLGKVTFRPSITAYVVGDDCFDQRDRLIDALNKPGPGTLVHPTYGELKVCVDGEVRVSTSKSEGRIVRFDLKFVEAGELSYPTSGAATAQTLMSSCSALDDCISDSFSGFSIDGVADFVQNDVIGNASIMLGYVSDAMKVVDSAVSDAARLLQGDISVLLPPPSSGKNFVEQVQKMWRTGKRLYGNASDLVTMIKTLSGVSLGSDLQPRGVWKTDSKTTATATQQRNVVASTLRTTAISEAAYAVTRLPAPTTSAVMQNSAVGQATTPAQSTGWPSVTHPALNNAPAVKNTVDLPTWEELTDIRDTLNTAIDKELSRTTSDALFLALRRVKADLNADINTRLEQSARIIQRTPDEVLPALVLAATWFDNAARDADIIRRNAITHPGFVPVIPLKVPVQ >NZ_CP014620|4266227:4309790|4296424_4297414_-|WP_012513026.1|DBSCAN-SWA MRALLTPEIAPRMGIVLFRPGSELMPLFMQGRVLLEPEPERYSSFASGAVPAASQPLADDPAVRAVFRNEAVIRRAGGVECLESWLLREKGCQWPHSDWHSENMTTMRHAPGAIRLCWHCDNQLRDQFTERLESMATDNCARWVLSVVRRDLGFDDSHVVTMPELCWWLVRNDLADALPESAARKALRLPKPVVQAATRESDLVHSVPATSIIQDKAKKVLALKVDPESPESFMLRPKRRRWVNEKYTRWVKTQPCACCGKPADDPHHLIGHGQGGMGTKAHDLFVLPLCRKHHDELHADTVAFEEKYGSQLELIFRFIDRALAIGVLA >NZ_CP014620|4266227:4309790|4271560_4271809_+|WP_001217553.1|DBSCAN-SWA MRIEICIAKEKMTKMPTGAVDALKEELTRRISKRYDDVEVIVKATSNDGLSVTRTADKDSAKTFVQETLKDTWESADEWFVH >NZ_CP014620|4266227:4309790|4270401_4271499_+|WP_000332264.1|integrase|DBSCAN-SWA MAYYNIEKRLKSDGTPRYRCNVIIKEKGVITYRESKTFPKHAHAKTWGTQKVMELDLYGIPSSNAVDGLTVRDLLHKYLNDPNAGGKAGRTKRYVLELLMDSDISAIKLSELTENDVIEHCRLRNNAGAGPATVSHDVSYLGSVLDAAKPVYGINYTSNPAKAARPYLLKLGLIGKSNRRNRRPASDELDMLIEGLQQRSTHKCSKIPFVDILKFSVWSCMRIGEVCRLRWEDLDQEQKSILVRDRKDPRKKEGNHMKVALLGEAWDIVQRQPKKSEFIFPYNSTSVTAGFQRVRSKLGIKDLRYHDLRREGASRLFEAGFSIEEVAQVTGHRSLNVLWQVYTELYPKSLHNRFEELQRSRNKTS >NZ_CP014620|4266227:4309790|4287067_4287295_-|WP_021577001.1|DBSCAN-SWA MILKQDLKWSPDGMRVEVIQAGEYDDGALPARVQEIALQAGLAERGISAKSSKAAKEKKPRPVKRAEYASDNGRD >NZ_CP014620|4266227:4309790|4288564_4289224_-|WP_022630979.1|head,protease|DBSCAN-SWA MQTKQRLDVPLSLKSVSDSGEFEGYGSVFGVKDSHDDVVMSGAFAASLRAWSDRKALPALLWQHRMDEPIGVYTEMKEDDVGLYVRGRLLIDDDPLAKRAHAHMKAGSLTGLSIGYVLKDWEYDRSKEAFLLKEIDLWEVSLVTFPSNDEARISDVKNALARGEIPEQKKIERVLRDVGLSRTQAKAFMAGGYGALSLRDAEDVGSALNALNALKNLNF >NZ_CP014620|4266227:4309790|4298947_4299601_-|WP_000066917.1|DBSCAN-SWA MSNKYCQALVELRNKPAHELKEVGDQWRTPDNIFWGINTLFGPFVLDLFTDGDNAKCAAYYTAEDNALAHDWSERLAELKGAAFGNPPYSRASQHEGQYITGMRYIMKHASAMRDKGGRYVFLIKAATSEVWWPEDADHIAFIRGRIGFELPAWFIPKDEKQVPTGAFFAGAIAVFDKTWKGPAISYIGRDELEACGEAFLAQVRQQAEKLVREMAA >NZ_CP014620|4266227:4309790|4301555_4301804_-|WP_000187185.1|DBSCAN-SWA MTPEQLALSEAIALAGGQSELARKLTASSGHLVKQQHVWNWLNREKRPPAKLSIFIEKTTGISKEKLRPDIFQKIKDSSDEK >NZ_CP014620|4266227:4309790|4290636_4292370_-|WP_000088161.1|terminase|DBSCAN-SWA MSRKSYPNVNAANQYARDVVRGKIVACQFVIQACQRHLDDLMAEKSKSFRYRFDKDLAERAAKFIQLLPHTKGEWAFKRMPITLEPWQLFVICCAFGWVNKGTRLRRFREVYTEIPRKNGKSAISAGVALYCFACDNEFGAEVYSGATTEKQAWEVFRPARLMCKRTPMLTEAFGIEVNASNMNRPEDGARFEPLIGNPGDGSSPHCAVVDEYHEHATDALYTTMLTGMGARRQPLMWAITTAGYNIEGPCYDKRREVIEMLNGSVPNDELFGIIYTVDEGDDWTDPQVLEKANPNIGVSVYREFLLSQQQRAKNNARLANVFKTKHLNIWVSARSAYFNLVSWQSCEDKSLTLEQFEGQPCILAFDLARKLDMNSMARLYTREIDGKTHYYSVAPRFWVPYDTVYSVEKNEDRRTAERFQKWVEMGVLTVTDGAEVDYRYILEEAKAANKISPVSESPIDPFGATGLSHDLADEDLNPVTIVQNFANMSDPMKELEAAIESGRFHHDGNPIMTWCIGNVVGKNMPGNDDLVKPVKEQAENKIDGAVALIMAVGRAMLYEKEDTLSDHIESYGIRSL >NZ_CP014620|4266227:4309790|4280472_4280985_-|WP_001439754.1|DBSCAN-SWA MIEGMIMRIFVFFISALLSFNLAAEECKFSFNESELISSIGIAPVKQEIIKDEGITKRQYEFRRELSSEEMLSDDADEKYEPQFYISVYNPSCPQKVIVWFFKDNKNTMDLSNEVLAGRAFKYLTGVNESIFENKMKKFLKVQSFESFDERTDSKFIKSGDIYSIDVQLR >NZ_CP014620|4266227:4309790|4300981_4301533_-|WP_058652097.1|DBSCAN-SWA MGKHHWKVEKQPKWYVKAVRKTIAALPGGYAEAADWLDVTENALFNRLRADGDQIFPLGWAMVLQRAAGTHYIADAVAQSAGGVFVSLPEIEEVENADINQRLLEVIEQIGSYSKQIRSAIEDGVVEPHEKTAINDELYLSISKLQEHAALVYKIFCISESNDARECAAPGAVACRDCGETNA >NZ_CP014620|4266227:4309790|4277004_4277430_-|WP_000424732.1|DBSCAN-SWA MELWLTVNGKRTCASAPLDPLTRAVVISLFTWRRAEPDDNADVPMGWWGDTWPAVQNDRYGSRLWLLQRSKLTNQLVQTVRGYIRECLQWMIDDGVVSRIDLDIRRTGINELGNSITLWRRDGPVMISFDDLWSAITHGGQ >NZ_CP014620|4266227:4309790|4290442_4290625_-|WP_000605606.1|DBSCAN-SWA MIMLILAPLVGVLGALLLAYGAWLIYPPAGFVVAGALCLFWSWLVARYLDRTQSSVGGGK >NZ_CP014620|4266227:4309790|4304647_4305184_+|WP_000008249.1|DBSCAN-SWA MSFIKTFSGKHFYYDRINKDNIDINDIAVSLSNICRFAGHLSHFYSVAQHAVLCSQLVPQEFAFEALMHDATEAYCQDIPAPLKRLLPDYKRMEEKIDAVIREKYGLPPVMSTPVKYADLIMLATERRDLGLDDGSFWPVLEGIPATEMFNVIPLAPGHAYGMFMERFNELSELRKCA >NZ_CP014620|4266227:4309790|4275384_4275969_-|WP_000383548.1|DBSCAN-SWA MDVTNDDYIRLLSALLPPGPAWSASDPAIAGAAPSLTRVHQRADALMRELDPRTTTELINRWERLCGLPDECIPAGTQTLRQRQQRLDAKVNLAGGINEDFYLAQLAALGRPDATITRYDKSTFTCSSACTDAVNAPEWRYYWQVNMPAATNTTWMTCGDPCDSALRIWGDTVVECVLNKLCPSHTYVIFKYPE >NZ_CP014620|4266227:4309790|4274355_4275381_-|WP_063269512.1|DBSCAN-SWA MHRIDTKTAQKDKFGAGKNGFTRGNPQTGTPATDLDDDYFDMLQEELCSVVEASGASLEKGRHDQLLTALRALLLSRKNPFGDIKSDGTVKTALENLGLGETINLARNAVPATRRINSKPLTGDITLWASDVGALPIAGGRLNGALGIGADNALGGNSIVLGDNDTGFKQDGDGVLGIYANNARIGYIDNSGLHMSVNVLTNGGIRVGDGKQFSLTSNNNSTMTATFNLWGGADRPTVIELDDDQGWQFYSQRNTDGSISFRVNGQMEPNSYSNFDSRYVQDIRLGSLQYGQVWNGPGFSDTSGYVITGITNGNSDELVDGAHRRPIQKLIGNQWYNVVSI >NZ_CP014620|4266227:4309790|4272739_4273228_+|WP_024144069.1|DBSCAN-SWA MGADNALGGNSIVLGDNDTGIKQNGDGVLDIYANSAHVLRFISILVESMVSLKVNGNAVATGEVQAGNGSSRMTNNGDIFGSVWGNSWLSLWINNNFVADVQLGAGTSVTTWNNAGSWPNTPGYVVTSVWKDNQGENIDGINYAPLQKRVGNQWYTVQGGTT >NZ_CP014620|4266227:4309790|4275959_4277018_-|WP_063269513.1|plate|DBSCAN-SWA MADSEFQRPTLAENISMLRNDLFARLDVSDTLRRMDEDVRAKVYAAALHTVYGYIDYLAMNMLPDLCDESWLARHAAMKRCPRKEATAASGYMRWEGVSDGLKVTAGSVIQRDDLVQYTATADATSSGGVLRVPIACSSAGAVGNADDGTALILVTPVNGLPSSGVADTLTGGFDTEELETWRARVIERYYWTPQGGADGDYVVWAKEVPGITRAWTYRHWMGTGTVGVMIASSDLINPIPEESTETAARQHIEPLAPVAGSDLYVFRPVAHTVDFHIRVTPDTPEIRAVITAELRSFLLRDGYPQGELKVSCISEAISGANGEYSHQLLAPADNISIAKNELAVLGTISWT >NZ_CP014620|4266227:4309790|4295658_4296411_-|WP_023200325.1|DBSCAN-SWA MNLEALPKYYSPKSPKLSDDAPATGSGGLTITDVMAAQGMVQSKAPLGFALFLAKVGVQDPQFAIEGLLNYAMALDNPTLNKLSEETRLQIIPYLVNFAFADYSRSAASKARCEHCAGTGFHNVLREVVKHSRSGESVVKEEWVKELCQHCHGKGEVSTACRGCKGKGIVLDEKRTRLHGTPVYKICGRCNGNRFSRLPTTLARHHVQKLVPDLTDYEWYKGYADVIDKLVTKCWQEEAYAEAQLRKVTR >NZ_CP014620|4266227:4309790|4285324_4285885_-|WP_000779279.1|DBSCAN-SWA MKLTPVIAALRARCPYFENRVAGAAQFKNLPEVGKLKLPAAYVVPGDDSPGENKSQTDYWQELKEGFSVVVILSNGRDERGQFASYDVVDDVRQMLFKALLGWNPEACGNPITYDGGTLLDLNRHELIYQFDFSVISELTEDDTRQQDDLNSLDELQTLAIDVDYLEPGNGPDGDIEHHTEITLPS >NZ_CP014620|4266227:4309790|4269275_4270313_-|WP_052934728.1|tRNA|DBSCAN-SWA MHDNHETKKINQTSVMPEKTGVYWNSRFSIAPMLDWTDRHCRYFLRLLSRQTLLYTEMVTTGAIIHGKGDYLAYSEEEHPVALQLGGSDPAQLAHCAKLAEARGYDEINLNVGCPSDRVQNGMFGACLMGNAQLVADCVKAMRDVVSIPVTVKTRIGIDDQDSYAFLCDFIDTVSGRGECEMFIIHARKAWLSGLSPKENREIPPLDYPRVYQLKRDFPHLTMSINGGIKSLEEAKEHLRHMDGVMVGREAYQNPGILAAVDREIFGADTTDADPVTVVRAMYPYIERELSQGAYLGHITRHMLGLFQGIPGARQWRRYLSENAHKAGADVAVLEQALKLVADKR >NZ_CP014620|4266227:4309790|4289201_4290443_-|WP_001514795.1|portal|DBSCAN-SWA MFFSGLFQRKSDAPVTTPAELADAIGLSYDTYTGKQISSQRAMRLTAVFSCVRVLAESVGMLPCNLYHLNGSLKQRATGERLHKLISTHPNGYMTPQEFWELVVTCLCLRGNFYAYKVKAFGEVAELLPVDPGSVVPKLNSSWEPVYQVTFPDGSTDVLSQEDIWHVRTLTLDGLVGLNPIAYAREAISLAAATEEHGARLFSNGAVTSGVLRTEQTLSDQAYERLKKDFEERHTGLGNAHRPMILEMGLDWKSMALNAEDSQFLETRKFQLEEICRLFRVPLHMVQNTDRATFNNIEELGLGFINYSLVPYLTRIEQRINTGLVRKSKQGVYYAKFNAGALLRGDMKSRFEAYATGINWGIYSPNDCRDLEDMNPRPGGDVYLTPMNMTTKPSDGSKAGKQKDNANADETTS >NZ_CP014620|4266227:4309790|4298238_4298628_-|WP_000767130.1|DBSCAN-SWA MKLILPFPPSVNTYWRHPNKGAFAGKSLISAAGRKFQSTACAAIVEQLRRLPKPTSAPASVEIVLFPPDNRIRDLDNYNKALFDALTHAGVWEDDSQVKRMLVEWGPVIPEGKVEITISKYEKTAGAAA >NZ_CP014620|4266227:4309790|4273197_4273815_+|WP_010835343.1|tail|DBSCAN-SWA MVYRTRGNDIMKKYQDIKNFRLIDAPVNRGKTQSEINIGAYFLESEDGQDWYECQSLFSDDTAKIMYDPEGVIWSVVNQPVPQRGNTYAVSMLWPVNMSVAEIDAADCPDDCRGDGSWLYRDGQVLPVPVDYQAKAETTRQKLLNDANNVIKDWRTELTLGIISDENKVTLINWMGYINKLKDIDFSQVNDEATFEKIKWPELPK >NZ_CP014620|4266227:4309790|4271952_4272516_-|WP_000639149.1|DBSCAN-SWA MIYGYVRVSTNHQDTELQRLALESAGCERIYEEYASGRTANRPVLKELITVMKSGDELIVWKLDRIGRNVLHALLMFQNLHEKGVNFRSITDGVDLKTASGRYNFRNILSAAQYESDLNSERTLAGLAIARSKGRIGGRRPKFSDEQWQQMGALIAAGKSRRYVARIYNVGLSTLYKRFPVTGIQTK >NZ_CP014620|4266227:4309790|4283665_4285162_-|WP_022630977.1|tail|DBSCAN-SWA MTISFNTIPSNTLVPLFYAEMDNQAANTAQDSGASLLIGHANNGAEIVANSLVLMPSADYARQICGAGSQLARMVEAYRQTDPFGELYVIAVPESTGAAATVTLTVTGAATETGTVNVYVGRTRVQAPVTNGDNVTMIASSIQDAINAVPTLPFTASSSAGVVTLTARHKGLCGNEIPVSLNYYGFGGGEVLPAGVQIAVATGTAGTGAPVLTGAVAAMADEPFDYIGLPFNDTASVNTLVTEMNDTSGRWSYARQLYGHVYTAKTGTLSELVTAGDQFNQQHITLAGYEKDTQTPADELAASRTARAAVFIRNDPARPTQTGELVGMLPAPKGKRFTMTEQQTLLSHGVATAYVESGVLRIQRDVTTYRKNAYGVADNSYLDSETLHTSAYVLRKLKSVITSKYGRHKLASDGTRFGPGQAIVTPAVIKGELLATYRQLERAGIVENYELFKQYLVVERDASDPNRLNTLFPPDYVNQLRVFAVVNQFRLQYSEESA >NZ_CP014620|4266227:4309790|4309208_4309790_-|WP_001535325.1|DBSCAN-SWA MRLHLPELDDVVWMKGVLMLNVRFDSEKALEAILYVASKAPIPDIYHVGKILYYADRFHLESFGRLITGDHYNAMKDGPVASNTYDIIKIARGDGRYIPNGCDVDSVRKAFSVSGMTIVPSREADEDFFSDSDLECIDKSIAMLGNMSFEAIRTMSHDAAWEQADHNGEMSLESIVAQLKNSKLILDYLKNGY >NZ_CP014620|4266227:4309790|4306049_4306523_+|WP_023200799.1|DBSCAN-SWA MTTKINYQALREAAEAIKIVATPQKLLAFRMKVTPQVVLALLDELEAAEKRNAELQSENAYIRNRYKELDLLIGKNILVMQAAIIEWQATGDAKSGLAWIYNTLFGPGELPDESEKDAQAYFNRKYAPIDEKLMALHKWFWEQSEAERAAGIRIKGE |
61 | Shigella_phage(45.28%) | integrase,protease,plate,head,tail,portal,holin,terminase,capsid,tRNA | attL 4268361:4268376|attR 4271767:4271782 |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_9 |
4335620 : 4353775
Sequences of DBSCAN-SWA_9
Nucleotide sequences of DBSCAN-SWA_9 >NZ_CP014620|4335620:4353775|DBSCAN-SWA CTTACAGCATACGCCGCTGGTGCTGTTTTTCCGTGTGGACCCGCTTAACGATTTTATAGATCCACTGCAGTGTTAAATTGTACTTTTTGGCCAGCTCCGCATAATTACGCCCATCGCACTCGCTGTAAATCTGATAATCCCGTTCCGAGGCTCGACCAGAGATACCTTTGGGGAAATAAATGCTTTGCCCGCCCCAGTTGCGCATCATTCTGTCAGCAATTGCCTGACCGGCATTTTCGGCTGACGCGCTATCAATATTCATGCTTTCAATAAGTACCTGCGACGCATGAAACGCCAGATCGTTAATAATCTCCGGAAGCCGAACCATTTCTTTTGATTTCTGAATCATATTTTATCTCCATATCCAGCACCATGCTCCCCGTGACGGCACCAAAAAGGGAAGCGTAAAAAACGGTGCAATAACTACCTATCAATACATTCAAAATATATATAAATATTTACTATCAACGCATTTAATTATAAATTAAAAAAATATATATATCGTTTTCAATTTATAATATGTATTTATTTTATAGGTGAAGTAAGCCTTCACTGATGTCATCATTTTACACAATACTGTATGCATATACAGTTAATTTTTTGCATTTTTTTCTGCGTTCGATCAAAAAGATTGCTTTCCTCACAATCGCGTGAATTTTTAAACTGACTTTAAAAATCAATAAGATAATTTTAATCAAAGATTATTATGACGCAGCAATACCGCCATCGTAGCCTGCCACAGACACATACTGACTCTCCGCAGGCAACCTCTACCCCCTGCCCCGAAAGCTTTATTTATTCGTTTATTTGTTGGCATTTGACGCCATGCGCTAAACATTTTTTAAAGTTATTTATTAAAGCCCTTTTCGTTCTTCCGCTATGGGGAAAAGCGATACTTCTCTCTCAATTATCAGGAGATAGAGTATGCGAAAAATTATTGTTCCCCGACTTTCCGGCTGGCTGATGGCCTCTGTCGTACTGTTTGCGCTTATCGGCTGGACATCATCAGCGCAAATTCCGGTCGTTATCTATAAACTCAGCCTGGTTTCGTTATCAGCGGTGCTGGGTTACTGGCTCGACCGCAGTCTTTTCCCCTGGGCGCGTCCCGACTCCTTTTGCCCCTGGGAAGAATCGCTGTGCTGCGCCGCGGCGATGATTCGTCGCGCGATCATCGTTGCGGCAATTTGCCTTGCCGTCGCGCTGGGGCTGTAACGATGCGTTATCAATACGTATGCCTGGTCTGCGCTATGACTTTCTTGTCTGCCGACGCTGCCGAACCGCCGCGCGCTTCTCTGCAATGGCGAAACGAAGTGATTCGTACCGCGCGCGAAATCTGGGGGCTTAACGCCCCCGTTGCGGATTTTGCCGGGCAACTACATCAGGAATCCGGCTGGGCGCCTGACGCGCTTTCTCCGGCTGGCGCGCAGGGTATGGCGCAATTTATGCCCGCAACGGCAAAATGGGTAAGCCAGTTGTACCCAGCGCTTCGCGAGAACAAGCCGTTTAATCCCGCCTGGGCGATACGCGCGCTGGTACAGTATGACCGCCAGTTGTGGAAAAGCGTGTCAGCAAAAAATAGCTGCCAGCGAATGGCTTTCACTCTGAGCGCCTATAACGGCGGGCAAGGCTGGGTTAACAGAGATAAAAAGCTGGCCGCCGCAAAGGGGTTGGATGCATCCATCTGGTTTGAACATGTAGAACGCGTTAACGCCGGGCGCAGCGCCGCAAACTGGCGCGAGAATCGTCACTATCCCAAAGCGATTTTATACCAACATGCTCCCCGTTATTTGCAATGGGGGCAGGCTAGCTGCATTCATTAATCAGAGGGAGTAATGAAACTCAGTATCGATTTTTGGGAAGTCATCTCCCTGTTGCTTTCTTTTGTTGGATTAATGTTTGCTGCCGGTAAATTGCTGCTGGCGCAAATTGAAAAACGGCTGAATGAACGTTTTGAAGCACTGGAAGCTGCCCGGCGCGAATCAGAAGCGGGCTGGTCCAGGCTGGAGAGAGAATTTCTGGAATTTCGCGCCGATTTACCGCTGCATTATGTCCGCCGCGAGGATTATCTTCGCGGCCAGGCCGTTCTGGAAGCAAAACTGGATGCGCTATATAGCAAAATAGAACTGATTCAACGAGGTAACCATTAAACAAATCCCCGTCGTTTATACGCCGGGGATTTTTTATTTGTTTTAACTTTTTTCTCTAAAGGAGATTAAAAGACGCCTGATGCCTGATACATCAAAATACGTCTGAACGATAACGAAGAGAGAATATTATCTCTCCCGGTAAATCAAGGAAAACCGTATATGGAAACATTATCTGTTATACATACCGTGGCGAATAGACTACGTGAATTAAACCCTGATATGGATATACATATTTCATCAACCGATGCGAAAGTATATATCCCAACAGGACAGCAGGTAACGGTATTAATTCACTACTGCGGTTCGGTTTTTGCCGAACCAGAAAATACGGATGCCACGGTACAAAAACAACTAATCCGGATTTCCGCCACCGTTATTGTTCCGCAAATAAGTGACGCGATAAACGCGCTGGATCGTCTACGTCGCTCGTTGGGGGGCATTGAACTTCCCGACTGTGATCGTCCGCTCTGGCTGGAAAGCGAAAAATATATCGGCGACGCCGCAAACTTCTGCCGTTACGCCCTGGATATGACCGCCAGCACCCTGTTTATCGCGGAACAGGAAAGCAAGGACTCCCCCCTGCTGACAATCGTTAATTATGAGGAAATTCAATGAAATATATCTACAGTGGCCCGGCAAGCGGCGTCACGCTCGCCGACGGTCAGGAAGTCTTACTGTGGCCCAATAGCGAAATCTCGCTGCCGGAAGATAACGAGTGGGTAATCACCATGATTGCCCGCCGTCACCTGACGCCAGTGGTTACGCAAGAAGTAGAAACTAATGAAGAGGAAATTGTCCATGGCAGCTAATTACCTGCACGGTGTAGAGACCATTGAGATCGAAACCGGCCCACGTCCGGTTAAGGCGGTTAAATCTGCGGTTATTGGTCTGATCGGCACCGCGCCATGCGGCCCGGTTAACCAGCCGACGCTGTGCCTTTCTGAAAGCGACGCGGCGCAGTTTGGCCCAGGTCTGGCAAATTTCACCATCCCGCAGGCGCTGAAGGCGATCTACGATCACGGCGCAGGGACGGTCGTGGTGATTAACGTGCTGAATCCGGCGGTACACAAAAGTACCATTCCCAGTGAAACCGTGAAGGTTGATGACAATGGTCAGATTCAACTCAAGCACGGGGCCGTGCAAACGATGAGCATTGGCCGCAGCACGAACGCCGGAAACGCTTATATCAAAGGCACCGATTACACCATTGATATGCTGACCGGTAAAATCACCTGCATGGGGACCAACCTGAAACCCGGTGTTCAGGCCTACGTGAATTATACTTACGCGGACCCCACTAAAGTGACTGCTGCCGATATCGTTGGCGATGTAAACACCGCGGGCGATCGTACCGGTATGAAGCTGTTGCAGGACACCTGGAACCAGTTTGGTTTTTACGCAAAGATCCTGATTGCGCCGGTCTTTTGTACGCAAAACTCGGTCGCCGTTAAGCTTATCGCTCAGGCAGAAGCGCTGGGAGCCATTACCTACATTGATGCGCCCATCGGCACGACTTTCCAGCAAGTTCTGGCAGGGCGCGGCCCGCAGGGGGCGATTAACTTCAATACCAGTTCCGATCGCGCGCGTCTGTGCTATCCGCACGTTAAAGTTTACGACAGTGCAACAAATGCAGAGGTTCTGGAACCACTCTCCTCTCGCGCCGCTGGCCTGCGTGCCAAAGTGGATCTGGAAAAAGGCTTCTGGTGGAGCAACTCAAACCAGGAAATTCAGGGCATTACCGGCGTAGAGCGCTCGCTGTCAGCGATGATCGACGATCCGCAAAGCGAGGTGAATCAACTGAATGAAAACGGCATCACCACCATCTTCAACAGCTATGGCTCCGGTTTGCGCCTGTGGGGCAACCGTACCGCCGCCTGGCCGACGGTTACTCATATGCGTAACTTTGAGAACGTGCGCCGTACCGGCGATGTAATCAACGAATCCATTCGCTATTTCAGCCAGCAGTATATGGATATGCCGATTAATCAGGCGCTGATCGACGCGCTAACCGAATCGGTGAACACCTGGGGCCGCAAGCTGATTGCCGACGGCGCGCTGTTGGGTTTTGAATGCTGGTACGACCCGGCGCGTAACGAACAGACTGAACTGGCAGCCGGGCATCTGTTGCTGAGCTACAAATTCACTCCGCCGCCGCCGCTGGAACGTCTGACGTTTGAAACCGAAATTACCTCTGAATATTTAGTTTCTCTGGAGAGCAATCGCTAATGGCTGGAAAAATTCAAATTAACCGTATTACCAACGCCAATATTTATCTTGATGGTAATAATCTTTTAGGTCGCGCGAGTGAAATTAAACTGCCTGATATCAGCATGATTATGCAGGAGCATAAAGCGCTGGGGATGGTCGGTAAAATTGAACTGCCTGCCGGTTTTGACAAACTGGAAGGTGAAATTAAATGGAACTCGTTTTACCACGACGTCATGCGTAAAACGGCAAACCCGTGGCAGGCGGTGGCATTGCAGTGCCGCTCCAGTATCGATTGTTATAACTCGCAGGGTAAAGCGGATCAGTTAGCGCTGGTGACGCATATGACCGTAATGTTTAAAAAGAACCCGCTGGGAACGTTTAAACAGAATGAAAACCCGGAATTCAGCAGCGCCTTCGGCTGCACTTATATTAAACAGGTGGTTGACGGTGAAACGCTTCTTGAACTGGATTATCTGGCGAATATTTTCCGCGTAAATGGCACGGATCAATTAAATGCCTACCGCAATAATATTGGCGGTTAATTACTTCGGGGCTACGGCCCCGAATTTAACACGACGAATAAGGACTATACCATGAACGAAAAATATACCCTGCAGTTCCCGTTTACCTCCGCCGCCGGGGAACGTATCGACGTTCTGCAATTACGTCGCCTGAAGGTAAAAGATATGCGCGCCGCGCGACGCGCCAGCGATAAACCGGAAGAGTGGGATGAGCCGCTGATGGCGGCTATGACCGGGCTGGTAACCGAAGATCTGGCGGAAATGGATCTTCTGGACTATCAGGCATTGCAGAAACGATTTCAGGCCATGCTTAGCATGGCTACAGAACCCACAGCAACTGTGGCAGGCAATGGCGCTGCTGGCGAGGTGGTTTCGCTTTCCGCCCAGTGAAATTGACGCGCTGTCGGTTGACGATTTTACCTGCTGGCTGGATGAAGCCAGCGCGCAAATTAAACACGAATACGACTCGCAGGCTTAATGCCTGTGGGTTTCCAGACCCAAGCCCGGTTCACTCCTTCCCTGTCCTTTTTCCGGCAAGCAGCCTGTTACCGGGCAACTTATACGAGACACTATTTTGGCCAACGACATTATTACTCAGCTTCAGGCGCGTAATGAGACGTTGACGCAGGCAATAGCCCGTTACGGCTCACTCAACGCCAGCACGCTGCACACGCTCAGCTTTGAGCAAACAAAAATCACCCGGCTTACGCAACAGCTCGCTAACTCTGCCCTTCGCCGGGAAGAGAACGATAAACAGCGCGCCGGGTTACTGGAAAAAACACAAACCTTCGCCGGGCAGTTCGGCAAGCTCCTGAACGTTGAGACTCCCGACTGGAAGCTGCCTTACGAATTTCAGGGCAACATGGTCGATATGGCGGCGAAAGGCGGCATGGATAACACCGCGCGGGACGCCCTGAGCCTGAATATCCGCGACTGGAGCCTTGATTTCAATCAGGATCAAAAAGATCTGCAAAGCGCCGCCGCCACGATGATCGAAGGCGGCGTCAGCGCATTGCAGGATCTTAGCCGCTACATGCCCGATATCGCCAAAGCCGCAACCGCCTCCCGTGACAGCGCGCAAAGCTGGGCGCAGGCGGCTCTGGCCACTCGCGACAAACTGAACATCGCCCCTGACGACTTCCGTTTTGCGCAAAATATGCTGTACAGCGTGGCAAAAAGCGGCGGCGGCTCCGTTGCAGAACAAACCCAGTGGATTAACGCCTTTGCCAGAAAAACCGGCACTCAGGGGAAAGAAGGCATTGCGGAACTGACCGCAACGATGCAAATCGCCATGAAAAATGCCCCTGACGCAGGCGCGGCGGCAGCGAATTTTGACCATTTCCTGAAATCTACCTTCTCAAAAGAGACGGACAGTTGGTTTGCCCGCCAGGGCGTGGATCTTCAGGGATCGCTGCTGGAACATCAGCAAAACGGGATCGGCGTGACGGAAGCGATGGCCCACATCGTGCAGATGCAACTGGAGAAAATGAACCCGCAGATCCTCGACACCTTCAGGCAAACCATGAAGATTGAGGATCTTTCCGCGCGCGGCGACGCGCTACAGGCCATGACGGAGAAATTTAACCTCGGCGCGATGTTCGGCGATGCGCAAACGCGGGATTTTCTTGCCCCGATGCTGGCGAATATGGACGAATATCGCCAGCTAAAAGCCTCCGCAATGCAGGCGGCGGGGCAAAATTTTATTGATGATGACTTCGCCGCGAAAATGACATCGCCCAAAGAACAGACCAAAGCGTTACAACTTTCACTTAACGATCTGTGGCTGACCGTCGGCCTGGAACTGATGCCCGCCATTGGCGAACTGGCGCAAAGCATCACGCCGCTGGTGCGGCAGTTCAGCGCCTGGCTGCGGGAAAATCCGGCGCTGGTGCAAGGGGTCGCCAAAGTCGTTGGCGTTATCTGGCTGTTCAACGGGGCGCTGAATATTCTCAGGCTGGGAGCAAACCTCATTGCGTCACCGTTTATTCGCCTGATCGATATCTTCCTGAAGGTCAAAGCCGGTCTGGCGCTGGGCGGCGGCAGTCGCGCGCTGTCGGTTCTGAAATCGTTTGGCAACGGTGCGAAAAGCCTGACGGTGCTGCTGGGAAACGGCCTGATAAAAGGGCTACGGCTGGTCGGCCAGGCGTTTATCTGGCTGGGTCGGGCGCTGCTGATGAACCCTGTCGGCCTGACTATCACCGCTATCGCAGGCGCCGCCTATTTACTTTATCGCTACTGGGAACCGATTTCCGGTTTCTTTGCCGGAGTCTGGGAGCGTATCAAAACCGCCTTTGACGGAGGCATTGCCGGCGTCACGCGTTTAATTCTCGACTGGTCGCCGCTGGGGCTGTTTTACCGCGCCTTCGCCAGCGTACTGGACTGGTTTGGCATTGAACTCCCCGCCAGCTTTAGCGAATTTGGCGGCAATATTCTGGATAGCTTGATCAACGGCATTCTGAATGCGCTTCCTTTCCTGAACGGGGCGATTGAGAAGATAAAAGCGCTGATCCCCGACTGGGCGAAAAGCGCGCTGGGCATCAGCGCTGAAATGCCGTCTGTCGCCGCCGCCGTCCCCGGTATTGCCGGAACAATGGTCGCGCAACAGACCAGCGCGCCGCTGGCATCGGGAGCGAAAGCGGTGACAACCTCGGCCAAAACGATGGCCTCGCCGCAGCCTATGAAGACGAACAGCGCCGCCACGCCGCCGACGCCAGCCGCGCTTCCCGGCAAATCCGGCGGGAAACCTTATACGCTGCCCTCCCGCGCGCAAAGCAACGTGCAGGTACACTTTTCCCCGCAGGTTACCATGCAGGGAAGCGGCGCGAATGTCGCCAAAGATATCAACAACGTGCTGTCGCTGAGCAAACGCGAGCTGGAGAGAATGATTAACGATGTCATGGCGCAACAACGGCGCCGGGAGTACGCATAATGTATGCCGTATTAGGCGAAATAGAATTTGACGTCGTCGCTTACTGGGACGAATTTGAAAGCACGATGGGCGTGGATTATACCAGCCATGCCCGTATTGAAGGGAAACCGGGCGTGCAATTTATCGGCGATAAGCTGGACAAAATCACCCTGAAATTCAACTTTCATAGTCAGTATTGCCAGCCGACCACCGAGCTGAACCGTCTGCGGGAAGCGATGACCGCGCACCAGGCGATGGCGCTGGTGTTCGGCAACGGCGATTATCGCGGCTGGTTCGTGATTACCGATCTGACCGCTACCCACCAGCACACCGATCCTTACGGTAACGTCATTGCCCAGGGCGGCACTCTGTCGCTACAGGAGTACACCGGCGATCCGAAGAGCCCGTTACTGCCTCCGGCCATCACCACCCAGGAACCGAACATTGACGAGATGTTGGATGAGCTTCCCGACGTTAGCGATTCCTGGTTCGATGAACTGCTGAGCGTCGTTGAAGAGGGTATGCGTGAAGCCAAAGAGATGATGGATGAGGTGGCCGACGCCATTGATGACATCAAAAAAACGATCGCCCAGGCGAAAGAACTGGTGAAGGAAGCCAAAGCGCTGAAAGAAAAATGCGGCGATATCGTCGATTCGCTGAAAAAAACCATTAGCGCGATAGACGCGCTGTTCCAGCAGCCGCTGGATTTGCAAACGCTGGCCGGGCTGCCGAAAGCGCTGGCGGCGAAAATGCAGGAACTGATCGACAGCCTGCCGGGGATCCGCGAATGCGCGGGCGATGCCGGCACGCTTATCGAACACGCCGAATCGCTGTTTGACGCTATCACCAGCAGCGTCGCGGAAGCGACTTACGACAGCGCCGCGACGCTGGTCAATCAGGCGCGCGGCACGCTGCAAACAAGCGCCCCTGACGTGAGCCAGCTTGCCGCCGCCGATATTACGAGGAGTCTGTAATGCGCTACCTTGAACATGTCACCACCGACGGCGAACGCTGGGATAATCTCGCCTGGCGCTATTACGGCGATGCGCTGGCCTACGAACGCATCATCGCGGCCAATCCGCACGTCGCTATTATGCCGGTTTTGCCGTCAGGCGTGCGGCTGATCATCCCGGTTATCAGCGTCACGCAAACGACCCCGGAGCTACCGCCATGGCTGAGATAACGGTATCCGGCGGGGTGTTCGCCACCCTGACGCCCATTTTTACCCTTTGGTACGGACATAAAGAGATCACTTACGACATCGCGCCTTATGTCACCAGCATCAGTTACAGCGACAGCATTAAAAACGAGTCGGATGTTATTGCCATTGCGCTGGAAGATAGCGCCGGGCGCTGGGTAAACGAATGGTATCCGGGAAAAGGCGACACGCTGGCGCTGCGCCTGGGCTACCAGGGCGAAGATCTGCTCGATTGCGGAATCTATGTCATTGATAAAATTGATATCAGCGCGCCGCCTTCGACGGTCAATATCGACGGTATCGCCACCTCGGTCAGCAAAGCGCTACGCACCAAAAACAGCCAGGGCTTTGAGGAGACGACGCTTTACGCCATCGCCAGTCGCATCGCGCAAAAACACGGTTTAACGCTGGTGGGCAAGATTGCGCCGCTGACGATTGATCGGGTCACGCAATATGCCGAAACCGATGTGGCGTTTCTCAAACGGCTGGCGAGTGAATATGGCTATACCGTGAAAGTGACGGCGACGGAGCTGATCTTTTCGCATCTGCCGACGCTGCGCTGTCTGGCGCCGGTGAAGACGCTCAGGCGGACGGATGTTTCGCACTACACGTTCAAAGATACCATCAACCGGATCTACAAAAACGCCACCGTGCAGCATCAAAATAGCAAGCAAAAAGAACTGGTTATTTATACCCATGATAGCCAGGAAAAGACCTCGGCGCGCGGTGCGGCGACCAGCGCCGATACCCTGAAGATCAACAGTCGCGCTCCGGATACCGGCGCGGCGCAGGCTAAAGCCAATGCCGCGCTGGACAGCCACAACGAATACCAGCAGACCGGCACGCTCAGCTTGATGGGCTGCCCGCAGTTGACGGCGGGCAACAAGATAGAACTGAGCGATTTTGGCGTACTTTCCGGGCAGTGGCTGATTGATAAATCCATGCACAAACTCACGCGCAGCGGCGGCTACACTACCGAAATCGACATTTCACGCGGACCGGCAACCAGCCAGTAAGGAGGCAATATGAAAGGCGTTACCCGCCAGACGGGCATTATCAGCGATATTGATGAGGCGGTCGTGCGCGTCAGAGTCACTCTACCGGAGTGCGATAACCTGCGCAGTAACTGGCTTGCGGTGCTGCAACGCAACACGCAGGACAACAAAGATTACTGGTTGCCGGATATTGGCGAACAGGTGGAGGTTTTGCTCGACGACAACGGCGAAGACGGCGTGGTGCTGGGCGCGGTCTACTCCAGCGTAGATACCGCGCCGCTGGCCTCGCGCGACAAGCGCTACGTGCAGTTTTCCGACGGCGCGGCCTTTGAATATGACCGTGCGTTACACCAGCTCACTGTCAACGGCGGCATAGAAAAAATCGTCATTGAAGTGAAGGAACGTACGCAGCTTACTTCACCGCAAGTAGAGGTCAGGGCGCAGCACGTCACGGTGATATCAGAAACCGTAGACGTGGCGGCCACCTCCGTGGGCGTCAAGGCGGTAGATGTCAACGTGGAAGCGCCCCATACGGGCATTAAAGCGCTGAATGTCACCGTCGATGCGCCGCTCAGCACCTTTACCGGCGACGTTACCGTGATGAAAAAACTCACCTGGCTTGGCGGTATGGCAGGCAGCGGCGGCGTCGGAAACAGCGCGGTTATCACGGGCAACGTAAATGTCCTCGGCAACGTTAACGCCAGCGGCACGCTGATGGACAACGGCGGCAACTCTAACCACCACTCTCACTAACCTGCAAATTGCTGCTGGATGGTGGCTTCGCCTTATCCAGCCTGCAAAAGGTGCATAAACACCGGCCCGGTAAGCGCAGAAGCGCCACCGGGCAAATTGCTGGAGGATATTTATTTTCAGCGCAACGTGATTAGTCCCTTTTCGCGCGCTATTTCCGATGAAAATGTAATCACTTTGCGCTGCAAATATTGCATATATGTATATTGGAAAACTAGCGTTATTTTTTACTTTAACTTCGCCCTGTTTACATAAAATCTGCTGTTCAGGAATGATCCTCTCAGTTTTGTCTGGTAGACTTCGCTGAATTACAACTTCTTGATTGCTATAATGATAAAATTATTTATAAAGTACGTTTCGATAGGCGTACTTAATACTGCTTTGCATTGGGCTATCTTTGCCCTTTGCGTCTATGGATTTCAAACAAGTCAGGCTTTGGCAAACGTAGCGGGCTTTGCTGTCGCTGTCAGTTTTAGTTTTTTCGCTAACGCCCGGTTTACCTTTGGAGCCAGCGTATCAACCGGACGTTATTTGCTGTACGTCGGTTTTATGGGCGTGCTGAGCGCCGTCGTGGGATGGACAGGCGACAAGTGTGCTATGCCTCCCATTTTTACGCTCATTGTATTTTCCGCAATTAGCCTTATTTGCGGATTTTTATATTCCAGATTCATTGTTTTCAGGAATGAGAAATGAAAATTTCATTAGTGGTTCCCGTCTTTAATGAAGAGGACGCGATCCCTATTTTCTATAAAACGGTCAGAGAATACAGTTCACTTAAACCTTATAACGTTGAGATTATCTTCGTTAATGATGGGAGTCACGATGCGACTGAATCAATCATCAGCGCATTAGCTGTTGCCGATCCTCTTGTTGTTCCGATCTCATTTACCCGCAATTTTGGTAAAGAACCTGCACTTTTTGCCGGATTAGATCACGCGACCGGAGATGTGGTGATCCCTATCGATGTTGATTTACAAGATCCCATCGAAGTAATCCCACATTTGATCAATAAATGGCAGGCTGGTGCAGAAATGGTGCTGGCTAAGCGTATCGATCGTTCAACGGATGGCCACCTGAAGCGTAAAAGCGCTGAGTGGTTCTACAGGCTGCATAACAAAATCAGTACGCCAAAGATTGAAGAGAATGTCGGTGATTTTCGATTGATGTCGCGCGAGATTGTAGAAAATATCAAGCTATTACCAGAACGTAACCTTTTCATGAAAGGTATACTTTCATGGGTTGGAGGTCAAACAGATGTGGTCGAATATGCCCGTGCTGAACGTGTCGCAGGTAACTCAAAATTTAATGGCTGGAAACTCTGGAACCTGGCGCTGGAGGGGATTACAAGTTTTTCTACTTTCCCTTTGCGTATCTGGACGTATATAGGAGTGAGCGTTTCTGCCCTCTCCCTGATATATGCCATGTGGATGATCATTGATAAATTGATGTGGGGAAACCCTGTTCCTGGTTATCCTTCGCTTATGACCGCGATTCTCTTCTTAGGCGGCATCCAGCTTATCGGCATAGGCATCATGGGTGAATATATCGGACGCGTTTACACGGAGGTGAAGCAAAGACCCCGCTATATCGTGAAAAACAAAAAAACAATGATGGAATAATGATTACTATGCTCAAGATATTACCGAAAACGGCGATGATACTACTGGCTTTTTTGGCCATTTTTCTTATTGAATGGTATACCCCCATTCACTCTGATGATTACCGCTATTACCTTTTAGGAATTTCGCCGGAATCACATTTTCATCATTATATGACCTGGAGTGGCAGGATTATAGCTGATTACACCAGCGCACTCATCCTGTATACACGTTCTCAACTCGTGTATTCCATCAGCGCTGCCGTTTCGACACTGGTATTTTGTTATTTCATTGTGAAGACACCCTCAGGTACATTACGCTGGAATAAATCCGACTACTTATTATTCCCACTAATATTCTTCACTTACTGGATTTCGAACCCGAATTTGGGTCAAACCACTTTCTGGATCGTTGGTGCTGCGAATTATTTGTGGACGAATCTGTTCGTTGTTGTATGGCTGTTCTTCTTTTACACCATAACAATAAAAAACAGTAAAGCGATCAGCCCGTGGGTTGCATTACTAAGCTTTATGGCAGGCTGTTCCAATGAAAGCGTCTCACCTTTCGTCTCGCTTATTTCTGTTCTGGCCATTGCATACGAGTTATGGCAAAACAAATCTGTTTCGCGCAATAAGATAGTTTATAGTCTCTGTGCAATCGCAGGTTCATGCGTATTGATACTTTCTCCGGGCAATTTCATCCGCGCCAGCGGCAAAGAATTCTGGTATGGAAGGCCGATTTTTGAACGTATTTTCATTCACTTAACAGAACGCGTTCATAACCATCTGGCGCTGATCTGGATAGCTTATGTTGTTTTGTTATTGCTGGTCTTACTGGTCATATTCAATAAGCAGATTCGCGCCAAAATTGATAAAACGTCCCTTATCTGCGCTGCGTTAGTCGTATGTATAGGTATTAGCACTTCCTTAATCATGTTCGCGTCGCCGTCCTACCCCGATCGGGTTATGAACGGTACGTTTATGTTTTTCCTTTTAGCTATCTCCTTCATCGCTTACGCCCTGTTGAAAAGTGGCGTTAAGGCTGGAGTCGTCGGCGTAACTGCCGTGACTGTCCTCTGTGGTATCGTATTCCTTTGGTCCTATTCATTGATGCTTAACGGTTATAAAAAAACGGCCGGACAGGAAATCGTAAGACAAGAAATCATTACTAAAGAAATAGCGGCAGGTAAACAGAAGTTTATCATCCCTGACTATTATTTCGTCAAGTTGCAAAATAGCGGTGGTCATTTTGGTTTATTCCATGATCCTGCTGTTTACGGCGAGTATTATCATGTACAAGCTATTTTCAAAAAGAAAGTCAATTTTGATTATTCTGTAATCGCTAATGGAGCGAAGCACAGCCTTTCCAATGAAACGACGGCTTATAGCAACACCCGCGGGGATTTCGCTATTATCAGCCGGGAGCAGCTAACGGGTTCGATCACACTCTCGGTTAATGGACGGCAGAAAACGATTCCAGTTGAAAAAATGAAGCACGCAGAAATCAATGATGAATTCTGGTACTACGCTTCTGTAGGCAAAGGTGAAATTACAGCAATTTCATTTTAACTTTACGTAAAACGCGATCTTCGCCATTTAACAAAATGTGCATCAACACAGGCCCGGTAAGCGCAGAAGCGCCGCCGGGCAAAACACATTCTGACCCCGCCATCAATTATTTCCTTAAAGCGCTTTAATATCTCTCCCCCCGCCGGAAGGCGAAAATAGCCTCATGAACACGAAAACACGACCCTCGACCCTGCACTGGCAACCTGCCTTGCAACGTCCTGAAGAATACGTCTGCGGGCTGGATGATATTCATCAGGCAATACACATCATTCTGCGCACGCCGCGCGGCAGCGATCCCCACAGGCCGCTTTTTGGCAGCAATCTGTGGCGCTATATCGATTACCCGATCGAGCGGGCCATTCCGCACGTTGTTCGGGAGTCGGTGGAAGCGATTCGCATGTGGGAACCCCGCTGCCGGTTGCTGAAGGTGACGCCGACGATTGACGGCGAACACCTGACGTTACGCGTGCAATGGCGCGCCGCAGACGGCGTAATCAACTCAACGGAGGTGTTATGGCGATAGCCGAACCCGACTTTATTGACCGCGATCCCGCGCAAATCACCAGCGAGATGATTGCGCAATATGAAGAAGCCAGCGGTAAAAAACTCTATCCGGCGCAGGCTGAGCGGCTGCTCATTGACCTGTTTGCTTATCGTGAAAACCTTGTCCGCATCGCCATCCAGGAGGCAGCGAAGCAAAACCTGGTCGCGTATTCCCGTGCACCGATGCTGGATTATTTAGGCGAGCTGGTTGGCGTTCACCGTCTGCCCGCTCAGGCGGCAAAAACCACGCTGCAGTTTTCTGTTACTCAAGCGGCTAAAAGTAACCTGGTGATTCCACAGGGTACCCGCGCCAGCGCGTCGGATAGCGTGATGTTCGCCACCGACGAAGATGTTCTGTTGCCTGCGGGCAGCCTGAGCGTTGCGGTAACTGCAACCTGTGTAGTGACCGGTGAACCCGGCAACAACTGGCAGCCTGCGCAAATCAGCGCGCTGGTAGACCGCGTGGGCAATTACGATATCAGCGTCACCAATCTGACGGCCTCAAGTGGCGGCTGCGGCGAAGAGAACGACGACGCACTACGTAAACGCATCCAGCTAGCGCCGGAAAGTTTCAGCAACGCGGGCAGCTATGGCGCCTATCGCTTCCATACGCTCTCGGTCAGCCAGTCGATTATCGACGTGGCGGTACTGGGGCCGGATGAAGGGCTGGCGGAAGGCTGCGTGGAACTCTATCCGCTGACCCTGAACGGTCTGCCGGGGCCGGAGCTTCTTGCCCAGATCGAACGGGAGGTGAGCAAAGAGAAAAAGCGCCCGCTAACCGATAAGGTGAGCGCTAAATGTTCTCCGCGCATGGCTTATCAGATCAGCGCCCGGCTGACGCTGTTTACCACCGCCGATCAGGAGACGACGCTTGCCGCCGCGCGTGAAGCGATTAATACATGGACGCGCTCGCGCCAGACCCGGCTGGGCCAGGACATTGTGCCAAACCAGATAATTAAAGTGCTACAAGTGGATGGCGTTTACGACGTCGCGCTGGATATGCCCGCGAAAAAGGTATTGCAGGCGCACGAATGGGCGGAATGTACGGCTATTGACGTGACGATTGCCGGAGTCAGCGATGGATAAACTGCTTCTGCCGCCGCCGCTGGCCAGCGACGAACGTTTCTCAATTCTGGCGAACATTGCCGCCGAACGTTTCGCGCAAATCGACCTGACGGCGTTGCTGGTCTATCTGGTGGATATCGTTGATGCCTCGGCATTGCCCTCGTTGGCCGAACAGTTTCATGTACAGGGGCTCGAAGGCTGGCTATTTGCTGCCAATGAACAGGAGAAACGAGAGTTAATTAAGCAGGCGATTGAACTGCATAAATATAAAGGAACCCCCTGGGCCGTTCGCCGCGCACTGGAAATATTATCCTTACCCGGCACGATCTCCGAATGGTTTGAGTATGGAGGTAAGGCTTATTTCTTCAAGGTTGAAATTAAGCTAATCAACCAGGGCATGGATGAAAATCTGTTTAATAATCTGGTCGATCTTATTCATGAATATAAGAACGTGCGTTCAAAACTGGAAGCGTTAATTGTCTGGATAATTAACCAAAGCGCTATTCCTGTTATTGGCAGCGCGCTTTACGGTGGAGAAATAACGACCGTCTTACCCTTCCAGGTTCTGGAAGTTCAACAAACTAAACCGATCTATTTCGGTACAGGGCAATGGAGCCTTGAAATTACATCTATTTACCCGGAGTAATTATGGATAATGAGTTTTATACCCTCCTGACCGACAGGGGAATGGCGAAAATCGCCAGCGCCCTTGCGGATAAAAAACAGCTACATCTGCAAAAGATGGCGGTTGGCGACGGCGGCGGACAATATTATGAACCGACCGCCAGCCAGACCAATTTACGCCACGAAGTCTGGCGCGGCGAGATGAATACGCTGACCGTTGCGCCGAATAATCCTAACTGGCTGATTGCCGAGTTGGTGCTGCCGGAAGAGGTTGGCGGCTGGTACGTGCGTGAAGTGGGCGTGTTCGACAACGAGGGCGAGCTAATCGCCATCGGCAAATTCCCGGAATCCTACAAACCGCTGCTGCCGGGCGGCTGCGGCAAGCAGGTCTGTATCCGCCTGATTATGGAAGTCTCCAACACCACGGCGGTGACGCTGACGGTCGATCCGAGCATTGTGCTGGCGACGCGCGACTATGTGGATGTCCGGCTGGACGAGCATGAACATTCGACAAATCACCCGGATGCGACATTAACGCAGAAAGGCTTTACACGGCTCAGTAACGCCACTGACAGCGATGACGAGACCAAAGCGGCTACGCCAAAGGCGGTCAAGGCGGCGATGGCGGAAGCGCGTAATCACACGCATACCTGGAACCAGATTACCGGCGTTCCGGACGGTACGCTGACGCAAAAGGGGATTGTTAAGCTTAACAGTGCGACGGACAGCACCAGCACAACGGAAGCGGCAACGCCGAGCGCGGTAAAGGCGGCGATGGATAAGGCGAATGCGGCAGCTCCGGCGAACCATACTCACGTCTGGAACCAGGTTACCGGCGTCCCGGACGGCACGCTGACGCAAAAAGGGATCGTGAAACTTAACAGCGCGACGGACAGCACCAGTACGACGGAGGCGGCGACGCCGAGCGCGGTAAAGGCGGCGTATGACAAGGCGAGCGCAGCGGCCCCGGCCGGCCATACTCACTCCTGGGGGCAGATCACCGGCACCCCGGACGGTACGCTGACGCAAAAAGGGATCGTGAAGCTTAATAGCGCCACCGACAGCACCAGTACGACGGAGGCGGCGACGCCGAGCGCGGTGAAAGCGGCGTATGACCTGGCGAATGGGAAGGCGGCGGGGAGTCACAAACATGCGTGGGGGGATATTACCGACGTGCCGGATGGGACTACGGCGCAGAAAGGGATCGTAAAGCTCAACAGTGCAACGAACAGCACCAGTACGACGGAGGCAGCGACGCCGAGCGCGGTAAAGGCGGCGTATGATTTGGCAAAAAGCAAAACCTCTGCAACGAATATATATACCAGGACACAATCTGATGCACGATACGTGCAAAATGTTATGTTAGGTGCAGAGGTACAAGCACCAACAATGGCACCTGCTGGATGTGTAATAACATTTGTTGATGGTGGTGATAAAATGGAATGTGTGAGATATAAACCACTTCAGATTAACATCAACGGTTTTTGGCGAACTATTTCAGGATAAGGAAAAAAAATGCAATTAAGAAATTTCACACGTTATTACCCAGAACATATGCCGTTTGGAGAAAATATACAATACTTTATTGATGAAAACGGCTTAGATTTTTATAATTCAATAGATACTTTTAAACTAAAATACAAGCTATGTATTCACCCTGACACAAAAGTTATTCACTCTGTGAGTGAAGATATTTCAACGTTATATCCAGCAGGCTTTGATATTGTTGAATCCGACAGTTTACCATATGATGATATCATTTCTGGAAAATATCAATTTGTAGATAACAAAATAATACCCAGGACATATAATGAAGTAGAACTTACTCAAATCACCAATGCAGAAAAATCAAAAAAACTGAAACTAGCAAATGAAAAAATAAGACCATTACAAGATGCTGTAGACCTTGGAATAGCCACTGACGAAGAGATACAAAAATTGGGTGCATGGAAAAGGTATCGAGTTGAAATCAATAGGATTGATACCAGTAACTTACTCGACATTAGCTGGCCTTTACCTCCAGATGTATAA
Protein sequences of DBSCAN-SWA_9 >NZ_CP014620|4335620:4353775|4338564_4339992_+|WP_023242909.1|tail|DBSCAN-SWA MAANYLHGVETIEIETGPRPVKAVKSAVIGLIGTAPCGPVNQPTLCLSESDAAQFGPGLANFTIPQALKAIYDHGAGTVVVINVLNPAVHKSTIPSETVKVDDNGQIQLKHGAVQTMSIGRSTNAGNAYIKGTDYTIDMLTGKITCMGTNLKPGVQAYVNYTYADPTKVTAADIVGDVNTAGDRTGMKLLQDTWNQFGFYAKILIAPVFCTQNSVAVKLIAQAEALGAITYIDAPIGTTFQQVLAGRGPQGAINFNTSSDRARLCYPHVKVYDSATNAEVLEPLSSRAAGLRAKVDLEKGFWWSNSNQEIQGITGVERSLSAMIDDPQSEVNQLNENGITTIFNSYGSGLRLWGNRTAAWPTVTHMRNFENVRRTGDVINESIRYFSQQYMDMPINQALIDALTESVNTWGRKLIADGALLGFECWYDPARNEQTELAAGHLLLSYKFTPPPPLERLTFETEITSEYLVSLESNR >NZ_CP014620|4335620:4353775|4335620_4335968_-|WP_000615248.1|DBSCAN-SWA MIQKSKEMVRLPEIINDLAFHASQVLIESMNIDSASAENAGQAIADRMMRNWGGQSIYFPKGISGRASERDYQIYSECDGRNYAELAKKYNLTLQWIYKIVKRVHTEKQHQRRML >NZ_CP014620|4335620:4353775|4353259_4353775_+|WP_023244443.1|tail|DBSCAN-SWA MQLRNFTRYYPEHMPFGENIQYFIDENGLDFYNSIDTFKLKYKLCIHPDTKVIHSVSEDISTLYPAGFDIVESDSLPYDDIISGKYQFVDNKIIPRTYNEVELTQITNAEKSKKLKLANEKIRPLQDAVDLGIATDEEIQKLGAWKRYRVEINRIDTSNLLDISWPLPPDV >NZ_CP014620|4335620:4353775|4339991_4340516_+|WP_000907495.1|tail|DBSCAN-SWA MAGKIQINRITNANIYLDGNNLLGRASEIKLPDISMIMQEHKALGMVGKIELPAGFDKLEGEIKWNSFYHDVMRKTANPWQAVALQCRSSIDCYNSQGKADQLALVTHMTVMFKKNPLGTFKQNENPEFSSAFGCTYIKQVVDGETLLELDYLANIFRVNGTDQLNAYRNNIGG >NZ_CP014620|4335620:4353775|4340844_4340973_+|WP_001185654.1|tail|DBSCAN-SWA MALLARWFRFPPSEIDALSVDDFTCWLDEASAQIKHEYDSQA >NZ_CP014620|4335620:4353775|4351133_4351766_+|WP_001749149.1|tail|DBSCAN-SWA MDKLLLPPPLASDERFSILANIAAERFAQIDLTALLVYLVDIVDASALPSLAEQFHVQGLEGWLFAANEQEKRELIKQAIELHKYKGTPWAVRRALEILSLPGTISEWFEYGGKAYFFKVEIKLINQGMDENLFNNLVDLIHEYKNVRSKLEALIVWIINQSAIPVIGSALYGGEITTVLPFQVLEVQQTKPIYFGTGQWSLEITSIYPE >NZ_CP014620|4335620:4353775|4336833_4337439_+|WP_001270438.1|DBSCAN-SWA MRYQYVCLVCAMTFLSADAAEPPRASLQWRNEVIRTAREIWGLNAPVADFAGQLHQESGWAPDALSPAGAQGMAQFMPATAKWVSQLYPALRENKPFNPAWAIRALVQYDRQLWKSVSAKNSCQRMAFTLSAYNGGQGWVNRDKKLAAAKGLDASIWFEHVERVNAGRSAANWRENRHYPKAILYQHAPRYLQWGQASCIH >NZ_CP014620|4335620:4353775|4351768_4353250_+|WP_000368203.1|tail|DBSCAN-SWA MDNEFYTLLTDRGMAKIASALADKKQLHLQKMAVGDGGGQYYEPTASQTNLRHEVWRGEMNTLTVAPNNPNWLIAELVLPEEVGGWYVREVGVFDNEGELIAIGKFPESYKPLLPGGCGKQVCIRLIMEVSNTTAVTLTVDPSIVLATRDYVDVRLDEHEHSTNHPDATLTQKGFTRLSNATDSDDETKAATPKAVKAAMAEARNHTHTWNQITGVPDGTLTQKGIVKLNSATDSTSTTEAATPSAVKAAMDKANAAAPANHTHVWNQVTGVPDGTLTQKGIVKLNSATDSTSTTEAATPSAVKAAYDKASAAAPAGHTHSWGQITGTPDGTLTQKGIVKLNSATDSTSTTEAATPSAVKAAYDLANGKAAGSHKHAWGDITDVPDGTTAQKGIVKLNSATNSTSTTEAATPSAVKAAYDLAKSKTSATNIYTRTQSDARYVQNVMLGAEVQAPTMAPAGCVITFVDGGDKMECVRYKPLQININGFWRTISG >NZ_CP014620|4335620:4353775|4341069_4343424_+|WP_023242910.1|tail|DBSCAN-SWA MANDIITQLQARNETLTQAIARYGSLNASTLHTLSFEQTKITRLTQQLANSALRREENDKQRAGLLEKTQTFAGQFGKLLNVETPDWKLPYEFQGNMVDMAAKGGMDNTARDALSLNIRDWSLDFNQDQKDLQSAAATMIEGGVSALQDLSRYMPDIAKAATASRDSAQSWAQAALATRDKLNIAPDDFRFAQNMLYSVAKSGGGSVAEQTQWINAFARKTGTQGKEGIAELTATMQIAMKNAPDAGAAAANFDHFLKSTFSKETDSWFARQGVDLQGSLLEHQQNGIGVTEAMAHIVQMQLEKMNPQILDTFRQTMKIEDLSARGDALQAMTEKFNLGAMFGDAQTRDFLAPMLANMDEYRQLKASAMQAAGQNFIDDDFAAKMTSPKEQTKALQLSLNDLWLTVGLELMPAIGELAQSITPLVRQFSAWLRENPALVQGVAKVVGVIWLFNGALNILRLGANLIASPFIRLIDIFLKVKAGLALGGGSRALSVLKSFGNGAKSLTVLLGNGLIKGLRLVGQAFIWLGRALLMNPVGLTITAIAGAAYLLYRYWEPISGFFAGVWERIKTAFDGGIAGVTRLILDWSPLGLFYRAFASVLDWFGIELPASFSEFGGNILDSLINGILNALPFLNGAIEKIKALIPDWAKSALGISAEMPSVAAAVPGIAGTMVAQQTSAPLASGAKAVTTSAKTMASPQPMKTNSAATPPTPAALPGKSGGKPYTLPSRAQSNVQVHFSPQVTMQGSGANVAKDINNVLSLSKRELERMINDVMAQQRRREYA >NZ_CP014620|4335620:4353775|4343423_4344377_+|WP_023242911.1|DBSCAN-SWA MYAVLGEIEFDVVAYWDEFESTMGVDYTSHARIEGKPGVQFIGDKLDKITLKFNFHSQYCQPTTELNRLREAMTAHQAMALVFGNGDYRGWFVITDLTATHQHTDPYGNVIAQGGTLSLQEYTGDPKSPLLPPAITTQEPNIDEMLDELPDVSDSWFDELLSVVEEGMREAKEMMDEVADAIDDIKKTIAQAKELVKEAKALKEKCGDIVDSLKKTISAIDALFQQPLDLQTLAGLPKALAAKMQELIDSLPGIRECAGDAGTLIEHAESLFDAITSSVAEATYDSAATLVNQARGTLQTSAPDVSQLAAADITRSL >NZ_CP014620|4335620:4353775|4340567_4340885_+|WP_001003640.1|tail|DBSCAN-SWA MNEKYTLQFPFTSAAGERIDVLQLRRLKVKDMRAARRASDKPEEWDEPLMAAMTGLVTEDLAEMDLLDYQALQKRFQAMLSMATEPTATVAGNGAAGEVVSLSAQ >NZ_CP014620|4335620:4353775|4349675_4350035_+|WP_001093501.1|plate|DBSCAN-SWA MNTKTRPSTLHWQPALQRPEEYVCGLDDIHQAIHIILRTPRGSDPHRPLFGSNLWRYIDYPIERAIPHVVRESVEAIRMWEPRCRLLKVTPTIDGEHLTLRVQWRAADGVINSTEVLWR >NZ_CP014620|4335620:4353775|4337925_4338381_+|WP_000449433.1|DBSCAN-SWA METLSVIHTVANRLRELNPDMDIHISSTDAKVYIPTGQQVTVLIHYCGSVFAEPENTDATVQKQLIRISATVIVPQISDAINALDRLRRSLGGIELPDCDRPLWLESEKYIGDAANFCRYALDMTASTLFIAEQESKDSPLLTIVNYEEIQ >NZ_CP014620|4335620:4353775|4350025_4351141_+|WP_001749150.1|DBSCAN-SWA MAIAEPDFIDRDPAQITSEMIAQYEEASGKKLYPAQAERLLIDLFAYRENLVRIAIQEAAKQNLVAYSRAPMLDYLGELVGVHRLPAQAAKTTLQFSVTQAAKSNLVIPQGTRASASDSVMFATDEDVLLPAGSLSVAVTATCVVTGEPGNNWQPAQISALVDRVGNYDISVTNLTASSGGCGEENDDALRKRIQLAPESFSNAGSYGAYRFHTLSVSQSIIDVAVLGPDEGLAEGCVELYPLTLNGLPGPELLAQIEREVSKEKKRPLTDKVSAKCSPRMAYQISARLTLFTTADQETTLAAAREAINTWTRSRQTRLGQDIVPNQIIKVLQVDGVYDVALDMPAKKVLQAHEWAECTAIDVTIAGVSDG >NZ_CP014620|4335620:4353775|4347035_4347965_+|WP_000703633.1|DBSCAN-SWA MKISLVVPVFNEEDAIPIFYKTVREYSSLKPYNVEIIFVNDGSHDATESIISALAVADPLVVPISFTRNFGKEPALFAGLDHATGDVVIPIDVDLQDPIEVIPHLINKWQAGAEMVLAKRIDRSTDGHLKRKSAEWFYRLHNKISTPKIEENVGDFRLMSREIVENIKLLPERNLFMKGILSWVGGQTDVVEYARAERVAGNSKFNGWKLWNLALEGITSFSTFPLRIWTYIGVSVSALSLIYAMWMIIDKLMWGNPVPGYPSLMTAILFLGGIQLIGIGIMGEYIGRVYTEVKQRPRYIVKNKKTMME >NZ_CP014620|4335620:4353775|4346676_4347039_+|WP_000593182.1|DBSCAN-SWA MIKLFIKYVSIGVLNTALHWAIFALCVYGFQTSQALANVAGFAVAVSFSFFANARFTFGASVSTGRYLLYVGFMGVLSAVVGWTGDKCAMPPIFTLIVFSAISLICGFLYSRFIVFRNEK >NZ_CP014620|4335620:4353775|4336543_4336831_+|WP_001226440.1|DBSCAN-SWA MRKIIVPRLSGWLMASVVLFALIGWTSSAQIPVVIYKLSLVSLSAVLGYWLDRSLFPWARPDSFCPWEESLCCAAAMIRRAIIVAAICLAVALGL >NZ_CP014620|4335620:4353775|4338377_4338575_+|WP_023242908.1|DBSCAN-SWA MKYIYSGPASGVTLADGQEVLLWPNSEISLPEDNEWVITMIARRHLTPVVTQEVETNEEEIVHGS >NZ_CP014620|4335620:4353775|4345626_4346349_+|WP_000679393.1|plate|DBSCAN-SWA MKGVTRQTGIISDIDEAVVRVRVTLPECDNLRSNWLAVLQRNTQDNKDYWLPDIGEQVEVLLDDNGEDGVVLGAVYSSVDTAPLASRDKRYVQFSDGAAFEYDRALHQLTVNGGIEKIVIEVKERTQLTSPQVEVRAQHVTVISETVDVAATSVGVKAVDVNVEAPHTGIKALNVTVDAPLSTFTGDVTVMKKLTWLGGMAGSGGVGNSAVITGNVNVLGNVNASGTLMDNGGNSNHHSH >NZ_CP014620|4335620:4353775|4344573_4345617_+|WP_023244444.1|DBSCAN-SWA MAEITVSGGVFATLTPIFTLWYGHKEITYDIAPYVTSISYSDSIKNESDVIAIALEDSAGRWVNEWYPGKGDTLALRLGYQGEDLLDCGIYVIDKIDISAPPSTVNIDGIATSVSKALRTKNSQGFEETTLYAIASRIAQKHGLTLVGKIAPLTIDRVTQYAETDVAFLKRLASEYGYTVKVTATELIFSHLPTLRCLAPVKTLRRTDVSHYTFKDTINRIYKNATVQHQNSKQKELVIYTHDSQEKTSARGAATSADTLKINSRAPDTGAAQAKANAALDSHNEYQQTGTLSLMGCPQLTAGNKIELSDFGVLSGQWLIDKSMHKLTRSGGYTTEIDISRGPATSQ >NZ_CP014620|4335620:4353775|4337451_4337766_+|WP_000777266.1|DBSCAN-SWA MKLSIDFWEVISLLLSFVGLMFAAGKLLLAQIEKRLNERFEALEAARRESEAGWSRLEREFLEFRADLPLHYVRREDYLRGQAVLEAKLDALYSKIELIQRGNH >NZ_CP014620|4335620:4353775|4347964_4349512_+|WP_000632053.1|DBSCAN-SWA MITMLKILPKTAMILLAFLAIFLIEWYTPIHSDDYRYYLLGISPESHFHHYMTWSGRIIADYTSALILYTRSQLVYSISAAVSTLVFCYFIVKTPSGTLRWNKSDYLLFPLIFFTYWISNPNLGQTTFWIVGAANYLWTNLFVVVWLFFFYTITIKNSKAISPWVALLSFMAGCSNESVSPFVSLISVLAIAYELWQNKSVSRNKIVYSLCAIAGSCVLILSPGNFIRASGKEFWYGRPIFERIFIHLTERVHNHLALIWIAYVVLLLLVLLVIFNKQIRAKIDKTSLICAALVVCIGISTSLIMFASPSYPDRVMNGTFMFFLLAISFIAYALLKSGVKAGVVGVTAVTVLCGIVFLWSYSLMLNGYKKTAGQEIVRQEIITKEIAAGKQKFIIPDYYFVKLQNSGGHFGLFHDPAVYGEYYHVQAIFKKKVNFDYSVIANGAKHSLSNETTAYSNTRGDFAIISREQLTGSITLSVNGRQKTIPVEKMKHAEINDEFWYYASVGKGEITAISF >NZ_CP014620|4335620:4353775|4344376_4344586_+|WP_001269716.1|DBSCAN-SWA MRYLEHVTTDGERWDNLAWRYYGDALAYERIIAANPHVAIMPVLPSGVRLIIPVISVTQTTPELPPWLR |
23 | Burkholderia_phage(45.0%) | tail,plate | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
Acr ID | Acr position | Acr size | Homology with known anti | Neighbor HTH/AcRanker | Neighbor Aca | In prophage | Protospacer in prophage |
---|