Contig_ID | Contig_def | CRISPR array number | Contig Signature genes | Self targeting spacer number | Target MGE spacer number | Prophage number | Anti-CRISPR protein number |
---|---|---|---|---|---|---|---|
NZ_CP041972 | Salmonella enterica subsp. enterica serovar Enteritidis strain NCCP 16206 plasmid unnamed, complete sequence | 0 crisprs | cas14j | 0 | 0 | 0 | 0 |
NZ_CP041973 | Salmonella enterica subsp. enterica serovar Enteritidis strain NCCP 16206 chromosome, complete genome | 2 crisprs | DEDDh,cas3,DinG,WYL,csa3,cas2,cas1,cas6e,cas5,cas7,cse2gr11,cas8e,PD-DExK | 0 | 13 | 8 | 0 |
CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
NZ_CP041973_1 | 2894977-2895493 | TypeI-E |
I-E
Consensus repeat of NZ_CP041973_1
|
8 spacers
spacers of NZ_CP041973_1
>1.1|2895006|32|NZ_CP041973|CRISPRCasFinder,CRT TATTTATAAGCGTGTCATCTATGCAACCCAAC >1.2|2895067|32|NZ_CP041973|CRISPRCasFinder,CRT ACCTGCCCGACCCAATAAGGGGGCCCTCGTGA >1.3|2895128|32|NZ_CP041973|CRISPRCasFinder,CRT GGCCGCTGGTCAAATTCCCAATCTGAGCAATC >1.4|2895189|32|NZ_CP041973|CRISPRCasFinder,CRT ATAGCCCCGGCAGCGATAGCTAAACCAGTTCC >1.5|2895250|32|NZ_CP041973|CRISPRCasFinder,CRT GCCTCAAAATCTCTCGGTGAGATGTAAGCGTC >1.6|2895311|32|NZ_CP041973|CRISPRCasFinder,CRT ACCAGTGGTCAGCGGCGGATGAATTTGCCCTG >1.7|2895372|32|NZ_CP041973|CRISPRCasFinder,CRT GAGAATGCTCATGCGCGTGAGCGCCATATATT >1.8|2895433|32|NZ_CP041973|CRISPRCasFinder,CRT AGGCGGACCGAAAAACCGTTTTCAGCCAACGT >1.9|2895006|34|NZ_CP041973|PILER-CR TATTTATAAGCGTGTCATCTATGCAACCCAACCG >1.10|2895067|34|NZ_CP041973|PILER-CR ACCTGCCCGACCCAATAAGGGGGCCCTCGTGACG >1.11|2895128|34|NZ_CP041973|PILER-CR GGCCGCTGGTCAAATTCCCAATCTGAGCAATCCG >1.12|2895189|34|NZ_CP041973|PILER-CR ATAGCCCCGGCAGCGATAGCTAAACCAGTTCCCG >1.13|2895250|34|NZ_CP041973|PILER-CR GCCTCAAAATCTCTCGGTGAGATGTAAGCGTCCG >1.14|2895311|34|NZ_CP041973|PILER-CR ACCAGTGGTCAGCGGCGGATGAATTTGCCCTGCG >1.15|2895372|34|NZ_CP041973|PILER-CR GAGAATGCTCATGCGCGTGAGCGCCATATATTCG >1.16|2895433|34|NZ_CP041973|PILER-CR AGGCGGACCGAAAAACCGTTTTCAGCCAACGTCG |
cas2,cas1,cas6e,cas5,cas7,cse2gr11,cas8e,cas3 |
CRISPR arrays and Neighbor proteins around NZ_CP041973_1
The CRISPR arrays of NZ_CP041973_1 >merge|NZ_CP041973|1|2894977-2895493|CRISPRCasFinder,CRT,PILER-CR GTGTTTATCCCCGCTGACGCGGGGAACACTATTTATAAGCGTGTCATCTATGCAACCCAACCGGTTTATCCCCGCTGGCGCGGGGAACACACCTGCCCGACCCAATAAGGGGGCCCTCGTGACGGTTTATCCCCGCTGGCGCGGGGAACACGGCCGCTGGTCAAATTCCCAATCTGAGCAATCCGGTTTATCCCCGCTGGCGCGGGGAACACATAGCCCCGGCAGCGATAGCTAAACCAGTTCCCGGTTTATCCCCGCTGGCGCGGGGAACACGCCTCAAAATCTCTCGGTGAGATGTAAGCGTCCGGTTTATCCCCGCTGGCGCGGGGAACACACCAGTGGTCAGCGGCGGATGAATTTGCCCTGCGGTTTATCCCCGCTGGCGCGGGGAACACGAGAATGCTCATGCGCGTGAGCGCCATATATTCGGTTTATCCCCGCTGGCGCGGGGAACACAGGCGGACCGAAAAACCGTTTTCAGCCAACGTCGGTTTATCCCCGCTGGCGCGGGGAACAC >NZ_CP041973|1|1|2894977-2895493|CRISPRCasFinder GTGTTTATCCCCGCTGACGCGGGGAACAC TATTTATAAGCGTGTCATCTATGCAACCCAAC CGGTTTATCCCCGCTGGCGCGGGGAACAC ACCTGCCCGACCCAATAAGGGGGCCCTCGTGA CGGTTTATCCCCGCTGGCGCGGGGAACAC GGCCGCTGGTCAAATTCCCAATCTGAGCAATC CGGTTTATCCCCGCTGGCGCGGGGAACAC ATAGCCCCGGCAGCGATAGCTAAACCAGTTCC CGGTTTATCCCCGCTGGCGCGGGGAACAC GCCTCAAAATCTCTCGGTGAGATGTAAGCGTC CGGTTTATCCCCGCTGGCGCGGGGAACAC ACCAGTGGTCAGCGGCGGATGAATTTGCCCTG CGGTTTATCCCCGCTGGCGCGGGGAACAC GAGAATGCTCATGCGCGTGAGCGCCATATATT CGGTTTATCCCCGCTGGCGCGGGGAACAC AGGCGGACCGAAAAACCGTTTTCAGCCAACGT CGGTTTATCCCCGCTGGCGCGGGGAACAC >NZ_CP041973|1|1|2894977-2895493|CRT GTGTTTATCCCCGCTGACGCGGGGAACAC TATTTATAAGCGTGTCATCTATGCAACCCAAC CGGTTTATCCCCGCTGGCGCGGGGAACAC ACCTGCCCGACCCAATAAGGGGGCCCTCGTGA CGGTTTATCCCCGCTGGCGCGGGGAACAC GGCCGCTGGTCAAATTCCCAATCTGAGCAATC CGGTTTATCCCCGCTGGCGCGGGGAACAC ATAGCCCCGGCAGCGATAGCTAAACCAGTTCC CGGTTTATCCCCGCTGGCGCGGGGAACAC GCCTCAAAATCTCTCGGTGAGATGTAAGCGTC CGGTTTATCCCCGCTGGCGCGGGGAACAC ACCAGTGGTCAGCGGCGGATGAATTTGCCCTG CGGTTTATCCCCGCTGGCGCGGGGAACAC GAGAATGCTCATGCGCGTGAGCGCCATATATT CGGTTTATCCCCGCTGGCGCGGGGAACAC AGGCGGACCGAAAAACCGTTTTCAGCCAACGT CGGTTTATCCCCGCTGGCGCGGGGAACAC >NZ_CP041973|1|1|2894979-2895493|PILER-CR GTTTATCCCCGCTGACGCGGGGAACAC TATTTATAAGCGTGTCATCTATGCAACCCAACCG GTTTATCCCCGCTGGCGCGGGGAACAC ACCTGCCCGACCCAATAAGGGGGCCCTCGTGACG GTTTATCCCCGCTGGCGCGGGGAACAC GGCCGCTGGTCAAATTCCCAATCTGAGCAATCCG GTTTATCCCCGCTGGCGCGGGGAACAC ATAGCCCCGGCAGCGATAGCTAAACCAGTTCCCG GTTTATCCCCGCTGGCGCGGGGAACAC GCCTCAAAATCTCTCGGTGAGATGTAAGCGTCCG GTTTATCCCCGCTGGCGCGGGGAACAC ACCAGTGGTCAGCGGCGGATGAATTTGCCCTGCG GTTTATCCCCGCTGGCGCGGGGAACAC GAGAATGCTCATGCGCGTGAGCGCCATATATTCG GTTTATCCCCGCTGGCGCGGGGAACAC AGGCGGACCGAAAAACCGTTTTCAGCCAACGTCG GTTTATCCCCGCTGGCGCGGGGAACAC
>NZ_CP041973.1|WP_000490481.1|2893915_2894962_+|aminopeptidase MFSATRRFAVILALGVGFILPAQAASPGPGEIANTQARHIATFFPGRMTGSPAEMLSADYLRQQFTQMGYQSDIRTFNSRFIYTTKDNRKNWHNVTGSTVIAAHEGRVPQQIIIMAHLDTYAPQSDADVDANLGGLTLQGMDDNAAGLGVMLELAARLKDIPTHYGIRFIATSGEEEGKLGAENLLKRMSDAEKKNTLLVINLDNLIVGDKLYFNSGKNTPEAVRTLTRDRALAIARRYGIAANTNPGRNPSYPKGTGCCNDAEVFDKAGISVLSVEATNWNLGKKDGYQQRVKNASFPNGNSWHDVRLDNQQHIDKALPGRIERRSRDVVRIMLPLVKELAKAEKTS >NZ_CP041973.1|WP_000372384.1|2892756_2893665_-|sulfate-adenylyltransferase-subunit-CysD MDQKRLTHLRQLEAESIHIIREVAAEFANPVMLYSIGKDSSVMLHLARKAFYPGTLPFPLLHVDTGWKFREMYAFRDRTANAYGCELLVHKNPEGVAMGINPFVHGSAKHTDIMKTEGLKQALNKYGFDAAFGGARRDEEKSRAKERIYSFRDRFHRWDPKNQRPELWRNYNGQINKGESIRVFPLSNWTEQDIWQYIWLENIDIVPLYLAAERPVLERDGMLMMVDDDRIDLQPGEVIKKRMVRFRTLGCWPLTGAVESHAQTLPEIIEEMLVSTTSERQGRMIDRDQAGSMELKKRQGYF >NZ_CP041973.1|WP_001092251.1|2891307_2892747_-|sulfate-adenylyltransferase-subunit-CysN MNTILAQQIANEGGVEAWMIAQQHKSLLRFLTCGSVDDGKSTLIGRLLHDTLQIYEDQLSSLHNDSKRHGTQGEKLDLALLVDGLQAEREQGITIDVAYRYFSTEKRKFIIADTPGHEQYTRNMATGASTCDLAILLIDARKGVLDQTRRHSFISTLLGIKHLVVAINKMDLVDYCEETFARIREDYLTFAEQLPGDLDIRFVPLSALEGDNVAAQSANMRWYSGPTLLEVLETVDIQRAVDRQPMRFPVQYVNRPNLDFRGYAGTLASGSVKVGERIKVLPSGVESSVARIVTFDGDKEEACAGEAITLVLNDDIDISRGDLLLAANETLAPARHAAIDVVWMAEQPLAPGQSYDVKLAGKKTRARIEAIRYQIDINNLTQRDVESLPLNGIGLVEMTFDEPLALDIYQQNPVTGGLIFIDRLSNVTVGAGMVRELDERGATPPVEYSAFELELNALVRRHFPHWDARDLLGDKHGAA >NZ_CP041973.1|WP_001173663.1|2890715_2891321_-|adenylyl-sulfate-kinase MALHDENVVWHSHPVTVAAREQLHGHRGVVLWFTGLSGSGKSTVAGALEEALHQRGVSTYLLDGDNVRHGLCRDLGFSDADRQENIRRVGEVASLMADAGLIVLTAFISPHRAERQLVKERVGHDRFIEIYVNTPLAICEQRDPKGLYKKARAGELRNFTGIDAIYEAPDSPQVHLNGEQLVTNLVSQLLDLLRRRDIIRS >NZ_CP041973.1|WP_001118109.1|2890341_2890698_-|DUF3561-family-protein MPGMVKVTGFNMRNSHNITFTRSDAFMVDDDATSAFPGAVVGFVSWLLALGIPFLLYGPNTLFFFLYTWPFFLALMPVSVIIGIALHLLVKGKILFSIMFTLLAVGALFGALFIWLLG >NZ_CP041973.1|WP_000517480.1|2889839_2890151_-|cell-division-protein-FtsB MGKLTLLLLALLVWLQYSLWFGKNGIHDYSRVNDDVVAQQATNAKLKARNDQLFAEIDDLNGGQEAIEERARNELSMTKPGETFYRLVPDASKRAATAGQTHR >NZ_CP041973.1|WP_000741653.1|2889110_2889821_-|2-C-methyl-D-erythritol-4-phosphate-cytidylyltransferase MAATLLDVCAVVPAAGFGRRMQTECPKQYLSIGNKTILEHSVHALLAHPRVTRVVIAISPGDHRFAQLPLANHPQITVVDGGNERADSVLAGLQAVAKAQWVLVHDAARPCLHQDDLARLLTISENSRVGGILASPVRDTMKRGEPGKNAIAHTVERADLWHALTPQFFPRELLHDCLTRALNEGATITDEASALEYCGFHPALVEGRADNIKVTRPEDLALAEFYLTRTIHQEKA >NZ_CP041973.1|WP_001219253.1|2888631_2889111_-|2-C-methyl-D-erythritol-2,4-cyclodiphosphate-synthase MRIGHGFDVHAFGGEGPIIIGGVRISYEKGLLAHSDGDVALHALTDALLGAAALGDIGKLFPDTDPAFKGADSRELLREAWRRIQAKGYTLGNVDVTIIAQAPKMLPHIPQMRVFIAEDLGCHMDDVNVKATTTEKLGFTGRGEGIACEAVALLMKAAK >NZ_CP041973.1|WP_000134246.1|2887585_2888635_-|tRNA-pseudouridine(13)-synthase-TruD MTEFDNLTWLHGKPQGSGLLKANPEDFVVVEDLGFTPDGEGEHILLRILKNGCNTRFVADALAKFLKIHAREVSFAGQKDKHAVTEQWLCARVPGKEMPDFSAFQLEGCKVLEYARHKRKLRLGALKGNAFTLVLREISDRRDVETRLQAIRDGGVPNYFGAQRFGIGGSNLQGALRWAQSNAPVRDRNKRSFWLSAARSALFNQIVHQRLKKPDFNQVVDGDALQLAGRGSWFVATSEELPELQRRVDEKELMITASLPGSGEWGTQRAALAFEQDAIAQETVLQSLLLREKVEASRRAMLLYPQQLSWNWWDDVTVELRFWLPAGSFATSVVRELINTMGDYAHIAE >NZ_CP041973.1|WP_001221538.1|2886843_2887605_-|5'/3'-nucleotidase-SurE MRILLSNDDGVHAPGIQTLAKALREFADVQVVAPDRNRSGASNSLTLESSLRTFTFDNGDIAVQMGTPTDCVYLGVNALMRPRPDIVVSGINAGPNLGDDVIYSGTVAAAMEGRHLGFPALAVSLNGYQHYDTAAAVTCALLRGLSREPLRTGRILNVNVPDLPLAQVKGIRVTRCGSRHPADKVIPQEDPRGNTLYWIGPPGDKYDAGPDTDFAAVDEGYVSVTPLHVDLTAHSAHDVVSDWLDSVGVGTQW >NZ_CP041973.1|WP_001518648.1|2895589_2895883_-|type-I-E-CRISPR-associated-endoribonuclease-Cas2 MSMVVVVTENVPPRLRGRLAVWLLEVRAGVYVGDTSKRIREMIWQQITQLGGVGNVVMAWATNTESGFEFQTWGENRRIPVDLDGLRLVSFLPVENQ >NZ_CP041973.1|WP_000144830.1|2895882_2896803_-|type-I-E-CRISPR-associated-endonuclease-Cas1 MTFVPLNPIPLKDRTSMIFLQYGQIDVLDGAFVLIDKTGIRTHIPVGSVACIMLEPGTRVSHAAVHLASTVGTLLVWVGEAGVRVYSSGQPGGARADKLLYQAKLALDDDLRLKVVRKMYELRFREPPPARRSVEQLRGIEGSRVRATYALLAKQYGVKWHGRNYDPKDWEKGDVVNRCISAATSCLYGISEAAILAAGYAPAIGFIHSGKPLSFVYDIADIIKFESVVPKAFEIAARHPAEPDKEVRLACRDIFRSSKLTGKLIPLIEEVLAAGEIEPPQPAPDMLPPAIPEPESLGDSGHRGHG >NZ_CP041973.1|WP_000281483.1|2896799_2897450_-|type-I-E-CRISPR-associated-protein-Cas6/Cse3/CasE MYLSRITLHTSELSPAQLLHLVERGEYVMHQWLWDLFPGGKERQFLYRREELQGAFRFFVLSQEQPAASTIFDVQTRPFAPMLSAGQTLRFNLRANPTICKNGKRHDLLMEAKRQRKTQGDSQDIWSYQQQAALEWLARQGEQNGFTLREASVDAYRQQQIRREKSRQMIQFSSVDYTGVLVINEPALFLQRLAQGYGKSRAFGCGMMMIKPGDDA >NZ_CP041973.1|WP_000085115.1|2897431_2898178_-|type-I-E-CRISPR-associated-protein-Cas5/CasD MSQYLVFQLHGPMASWGVDAPGEVRHSHELPSRSALLGLLAAALGIRRDEEERLNTFNRHYQFLLCASGNPRWARDYHTVQMPKEVRKARYFSRREELQDPELLSALISRRDYYTDAWWMIAVSATPDAPYTLAQLQAALQHPVFPLYLGRKSHPLALPLAPQLLEGNAADVLREAYRWYQDQFNALKLTLPGLQNECWWEGEHDGLTANKILRRRDMPLSRQQWLFGERSVNQGPWLRKEDACISQE >NZ_CP041973.1|WP_000206417.1|2898188_2899247_-|type-I-E-CRISPR-associated-protein-Cas7/Cse4/CasC MTTFIQLHLLTAYPAANLNRDDTGAPKTVVLGGATRLRISSQSLKRAWRTSELFEQALAGHIGIRTGRIAREAAQILVDSGIDAKKAVEYVKNIANCFGKVKEDKKPKDELTNAETEQLVHISPAEFEAVKALARRLAEEKRPATEEEAELLRHDRMAVDIAMFGRMLAKKTDFNVEAACQVAHAFGVSETIIEDDFFTAVDDLRQASAEDAGAGHLGETGFGSALFYTYICIDKDLLVKNLNGNEELANKTLRAFTEAALKVSPTGKQNSFASRAYASWALAEKGTDQPRSLAAAFYEPINGTDQLNVAVKRITALHENMNEVYAQETAFKNFNVMNQQGSMKDVLDFICA >NZ_CP041973.1|WP_000117945.1|2899260_2899815_-|type-I-E-CRISPR-associated-protein-Cse2/CasB MSVVTKDDKATLRQWHDELQEKRGLRASLRRSKTVNDACLAEGLHSLLMQTHSLWKNKAPWNVTALAITAALAAHIKFIDEQKSFAAQLGQKKGGDTPVMSKLRFSHLLAVKTPDELLRQLRRAVKLLDGSVNLFSLADDIFCWCQEQNDLLNHHRRQQRPTEFLRIRWALEYYQAGDTDNEQD >NZ_CP041973.1|WP_000368579.1|2899811_2901368_-|type-I-E-CRISPR-associated-protein-Cse1/CasA MDNFSLLTTPWLPVRFKDGSTGKLAPVDLADENVVDIAATRADLQGAAWQFLLGLLQCSIAPKRYKNWEDIWFDGLHADVLHKALAPLEHAFQFGAESPSFMQDFEPLSGEKVSIASLLPEIPGAQTTKFNKDHFVKRGVTERFCPHCAALALFSLQLNAPAGGKGYRTGLRGGGPLTTLVELQEYQGERQTPIWRKLWLNVMPQDTADLPLPDQCDATVFPWLAATRTSEQANAVTTPEQVNKLQAYWGMPRRIRLDFATLQSGCCDICGAESDELLGFMTVKNYGVNYDGWRHPLTPYRAPVKDQNAFFSVKPQPGGLIWRDWLGLSQNNQTEANYESPAQVVKVFNARSLTDVKAGIRGFGADFDNMKIRCWYEHHFPLLMTEGLIPDLRKAVQTAARLLSLLRSALKEAWFTNAKDARGDFSFIDIDFWNLTQGRFLNLIHDLENGHKPDERLNKWQRELWLFTRCYFDDHVFTNPYESSDLERIMKARKKYFTSSAEKQSAKAAKAKKQEAAE >NZ_CP041973.1|WP_159413259.1|2901379_2904043_-|CRISPR-associated-helicase-Cas3' MSIYHYWGKSRRGETDGGDDYHLLCWHSLDVAAVGYWMVINNIYFIDHYLKKLGIQDKEQAAQFFAWILCWHDIGKFAHSFQQLYRHEALNIFNEPTRHYEKIAHTTLGYMLWNSWLSECPELFPPSSLSVRKSKRVMALWMPVTTGHHGRPPEAIQELDHFRQQDKDAARDFLLRIKALFPLITLPEAWDEDEGIDQFQQLSWFISAAVVLADWTGSASRYFPRTAEKMPVDTYWQQALAKAQTAITLFPSAANVSAFTGIETLFPFIQHPTPLQQKALELDINVDGAQLFILEDVTGAGKTEAALILAHRLMAAGKAQGLYFGLPTMATASAMFERMANTWLALYQPDSRPSLILAHSARRLMDRFNQSIWSVTLSGTEEPDEAQPYSQGCAAWFADSNKKALLAEVGVGTLDQAMMAVMPFKHNNLRLLGLSNKILLADEIHACDAWMSRILEGLIERQASNGNATILLSATLSQQQRDKLVAAFSRGVRRSVQAPLLGHDDYPWLTQVTQTELISQRVDTRKEVERCVDIGWLHSEEACLERIGEAVEKGNCIAWIRNSVDDAIRIYRQLQLSKVVATENLLLFHSRFAFHDRQRIESQTLNLFGKQSGAQRAGKVIIATQVIEQSLDIDCDEMISDLAPVDLLIQRAGRLQRHIRDRNGLVKKSGQDERETPVLRILAPEWDDAPRENWLSSAMRNSAYVYPDHGRMWLTQRILREQGTIRMPQSARLLIESVYGEDVNMPVGFAKTEQLQEGKFYCDRAFAGQMLLNFAPGYCAEISDSLPEKMSTRLAEESVTLWLAKIVDSVVTPYASGEHAWEMSVLRVRQSWWNKHKDEFEKLDGEPLRKWCAQQHQDKDFATVIVVTDFAACGYSANEGLIGMMGE >NZ_CP041973.1|WP_001145541.1|2904486_2905440_+|SPI-1-type-III-secretion-system-effector-SopD MPVTLSFGNHQNYTLNESRLAHLLSADKEKAIHMGGWDKVQDHFRAEKKDHALEVLHSIIHGQGRGEPGEMEVNVEDINKIYAFKRLQHLACPAHQDLFTIKMDASQTQFLLMVGDTVISQSNIKDILNISDDAVIESMSREERQLFLQICEVIGAKMTWHPELLQESISTLRKEVTGNAQIKAAVYEMMRPAEAPDHPLVEWQDSLTEDEKSMLACINAGNFEPTTQFCKIGYQEVQGEVAFSMMHPCISYLLHTYSPFAEFKPTNSGFLKKLNQDYNDYHAKKMFIDVILEKLYLTHERSLHIGKDGCSRNILLT >NZ_CP041973.1|WP_000039870.1|2905527_2906262_-|phosphoadenosine-phosphosulfate-reductase MSKLDLNALNELPKVDRVLALAETNAQLETLTAEERVAWALENLPGEYVLSSSFGIQAAVSLHLVNQIRPDIPVILTDTGYLFPETYQFIDELTDKLKLNLKVYRAGESPAWQEARYGKLWEQGVEGIEKYNEINKVEPMNRALKELKAQTWFAGLRREQSGSRAHLPVLAIQRGVFKVLPIIDWDNRTVYQYLQKHGLKYHPLWDQGYLSVGDTHTTRKWEPGMAEEETRFFGLKRECGLHEG |
You can click texts colored in the table to view more detailed information
CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
NZ_CP041973_2 | 2911645-2912284 | TypeI-E |
I-E
Consensus repeat of NZ_CP041973_2
|
10 spacers
spacers of NZ_CP041973_2
>2.1|2911674|32|NZ_CP041973|CRISPRCasFinder,CRT GGCTACACGCAAAAATTCCAGTCGTTGGCGCA >2.2|2911735|32|NZ_CP041973|CRISPRCasFinder,CRT CCGATTAAGATCCGCAGTCTGCATCAGTAACT >2.3|2911796|32|NZ_CP041973|CRISPRCasFinder,CRT CGATTCTACGGCAACAGGCCAGGCTGCGACCG >2.4|2911857|32|NZ_CP041973|CRISPRCasFinder,CRT ATCAAACATGGAAACCCCTTTAATGAGAGCAA >2.5|2911918|33|NZ_CP041973|CRISPRCasFinder,CRT TCAGGAACGCGCGGCGGAAGAGCTTGGTGTTTG >2.6|2911980|32|NZ_CP041973|CRISPRCasFinder,CRT GCTGCCTTTCCCGGAGTTCCGGCCCCTAAATT >2.7|2912041|32|NZ_CP041973|CRISPRCasFinder,CRT TCATGCGCTATAAAAATCAGACTGTCACATGC >2.8|2912102|32|NZ_CP041973|CRISPRCasFinder,CRT TGATTATTGACGACAACAGCACAGACCGGCAG >2.9|2912163|32|NZ_CP041973|CRISPRCasFinder,CRT AATAATCGGCAATTTGTCCTGGACAGGCACGG >2.10|2912224|32|NZ_CP041973|CRISPRCasFinder,CRT GAATCTGGAGGCCAACAGCGCGGCGAAATCCT >2.11|2911735|34|NZ_CP041973|PILER-CR CCGATTAAGATCCGCAGTCTGCATCAGTAACTCG >2.12|2911796|34|NZ_CP041973|PILER-CR CGATTCTACGGCAACAGGCCAGGCTGCGACCGCG >2.13|2911857|34|NZ_CP041973|PILER-CR ATCAAACATGGAAACCCCTTTAATGAGAGCAACG >2.14|2911918|35|NZ_CP041973|PILER-CR TCAGGAACGCGCGGCGGAAGAGCTTGGTGTTTGCG >2.15|2911980|34|NZ_CP041973|PILER-CR GCTGCCTTTCCCGGAGTTCCGGCCCCTAAATTGG >2.16|2912041|34|NZ_CP041973|PILER-CR TCATGCGCTATAAAAATCAGACTGTCACATGCCG >2.17|2912102|34|NZ_CP041973|PILER-CR TGATTATTGACGACAACAGCACAGACCGGCAGCA >2.18|2912163|34|NZ_CP041973|PILER-CR AATAATCGGCAATTTGTCCTGGACAGGCACGGCA >2.19|2912224|34|NZ_CP041973|PILER-CR GAATCTGGAGGCCAACAGCGCGGCGAAATCCTCA |
cas3,cas8e,cse2gr11,cas7 |
CRISPR arrays and Neighbor proteins around NZ_CP041973_2
The CRISPR arrays of NZ_CP041973_2 >merge|NZ_CP041973|2|2911645-2912284|CRISPRCasFinder,CRT,PILER-CR ACGGCTATCCTTGTTGGCGCGGGGAACACGGCTACACGCAAAAATTCCAGTCGTTGGCGCACGGTTTATCCCCGCTGGCGCGGGGAACACCCGATTAAGATCCGCAGTCTGCATCAGTAACTCGGTTTATCCCCGCTGGCGAGGGGAACACCGATTCTACGGCAACAGGCCAGGCTGCGACCGCGGTTTATCCCCGCTGGCGCGGGGAACACATCAAACATGGAAACCCCTTTAATGAGAGCAACGGTTTATCCCCGCTGGCGCGGGGAACACTCAGGAACGCGCGGCGGAAGAGCTTGGTGTTTGCGGTTTATCCCCGCTGGCGCGGGGAACACGCTGCCTTTCCCGGAGTTCCGGCCCCTAAATTGGGTTTATCCCCGCTGGCGCGGGGAACACTCATGCGCTATAAAAATCAGACTGTCACATGCCGGTTTATCCCCGCTGGCGCGGGGAACACTGATTATTGACGACAACAGCACAGACCGGCAGCAGTTTATCCCCGCTGGCGCGGGGAACACAATAATCGGCAATTTGTCCTGGACAGGCACGGCAGTTTATCCCCGCTGGCGCGGGGAACACGAATCTGGAGGCCAACAGCGCGGCGAAATCCTCAGTTTATCCCCGCTGGCGCGGGGAACAC >NZ_CP041973|2|2|2911645-2912284|CRISPRCasFinder ACGGCTATCCTTGTTGGCGCGGGGAACAC GGCTACACGCAAAAATTCCAGTCGTTGGCGCA CGGTTTATCCCCGCTGGCGCGGGGAACAC CCGATTAAGATCCGCAGTCTGCATCAGTAACT CGGTTTATCCCCGCTGGCGAGGGGAACAC CGATTCTACGGCAACAGGCCAGGCTGCGACCG CGGTTTATCCCCGCTGGCGCGGGGAACAC ATCAAACATGGAAACCCCTTTAATGAGAGCAA CGGTTTATCCCCGCTGGCGCGGGGAACAC TCAGGAACGCGCGGCGGAAGAGCTTGGTGTTTG CGGTTTATCCCCGCTGGCGCGGGGAACAC GCTGCCTTTCCCGGAGTTCCGGCCCCTAAATT GGGTTTATCCCCGCTGGCGCGGGGAACAC TCATGCGCTATAAAAATCAGACTGTCACATGC CGGTTTATCCCCGCTGGCGCGGGGAACAC TGATTATTGACGACAACAGCACAGACCGGCAG CAGTTTATCCCCGCTGGCGCGGGGAACAC AATAATCGGCAATTTGTCCTGGACAGGCACGG CAGTTTATCCCCGCTGGCGCGGGGAACAC GAATCTGGAGGCCAACAGCGCGGCGAAATCCT CAGTTTATCCCCGCTGGCGCGGGGAACAC >NZ_CP041973|2|2|2911645-2912284|CRT ACGGCTATCCTTGTTGGCGCGGGGAACAC GGCTACACGCAAAAATTCCAGTCGTTGGCGCA CGGTTTATCCCCGCTGGCGCGGGGAACAC CCGATTAAGATCCGCAGTCTGCATCAGTAACT CGGTTTATCCCCGCTGGCGAGGGGAACAC CGATTCTACGGCAACAGGCCAGGCTGCGACCG CGGTTTATCCCCGCTGGCGCGGGGAACAC ATCAAACATGGAAACCCCTTTAATGAGAGCAA CGGTTTATCCCCGCTGGCGCGGGGAACAC TCAGGAACGCGCGGCGGAAGAGCTTGGTGTTTG CGGTTTATCCCCGCTGGCGCGGGGAACAC GCTGCCTTTCCCGGAGTTCCGGCCCCTAAATT GGGTTTATCCCCGCTGGCGCGGGGAACAC TCATGCGCTATAAAAATCAGACTGTCACATGC CGGTTTATCCCCGCTGGCGCGGGGAACAC TGATTATTGACGACAACAGCACAGACCGGCAG CAGTTTATCCCCGCTGGCGCGGGGAACAC AATAATCGGCAATTTGTCCTGGACAGGCACGG CAGTTTATCCCCGCTGGCGCGGGGAACAC GAATCTGGAGGCCAACAGCGCGGCGAAATCCT CAGTTTATCCCCGCTGGCGCGGGGAACAC >NZ_CP041973|2|2|2911708-2912284|PILER-CR GTTTATCCCCGCTGGCGCGGGGAACAC CCGATTAAGATCCGCAGTCTGCATCAGTAACTCG GTTTATCCCCGCTGGCGAGGGGAACAC CGATTCTACGGCAACAGGCCAGGCTGCGACCGCG GTTTATCCCCGCTGGCGCGGGGAACAC ATCAAACATGGAAACCCCTTTAATGAGAGCAACG GTTTATCCCCGCTGGCGCGGGGAACAC TCAGGAACGCGCGGCGGAAGAGCTTGGTGTTTGCG GTTTATCCCCGCTGGCGCGGGGAACAC GCTGCCTTTCCCGGAGTTCCGGCCCCTAAATTGG GTTTATCCCCGCTGGCGCGGGGAACAC TCATGCGCTATAAAAATCAGACTGTCACATGCCG GTTTATCCCCGCTGGCGCGGGGAACAC TGATTATTGACGACAACAGCACAGACCGGCAGCA GTTTATCCCCGCTGGCGCGGGGAACAC AATAATCGGCAATTTGTCCTGGACAGGCACGGCA GTTTATCCCCGCTGGCGCGGGGAACAC GAATCTGGAGGCCAACAGCGCGGCGAAATCCTCA GTTTATCCCCGCTGGCGCGGGGAACAC
>NZ_CP041973.1|WP_001207998.1|2910748_2911546_-|MBL-fold-metallo-hydrolase MALRIRVLLENHKGAGADKSLKARPGLSLLVEDESTSILFDTGPDGSFMQNALAMGIDLSDVSAVVLSHGHYDHCGGVPWLPDNSRIICHPDIARERYAAMTFLGITRKIKKLSCEVDYSRYRMMYTRDPLPIGKNFIWSGEIPVVAPEAYGIFGGHDAEPDSILDEGVLIYQSTKGLVIITGCGHRGIANIVRHCQNITGIKRIYALVGGFHLRCASPFTLWRVRRFLQEQKPEKLCGCHCTGAWGRLWLPEITAPATGDVLRF >NZ_CP041973.1|WP_000108313.1|2910298_2910661_+|6-carboxytetrahydropterin-synthase-QueD MSTTLYKDFTFEAAHRLPHVPEGHKCGRLHGHSFMVRLEITGEVDPHTGWIMDFADLKAAFKPTYDRLDHYYLNDIPGLSNPTSEVLAKWIWDQVKPVVPLLSAVMVKETCTAGCVYRGE >NZ_CP041973.1|WP_000210932.1|2908075_2909875_-|NADPH-dependent-assimilatory-sulfite-reductase-flavoprotein-subunit MTTPAPLTGLLPLNPEQLARLQAATTDLTPEQLAWVSGYFWGVLNPRSGVVAVTPVPERKMPGVTLISASQTGNARRVAEALRDDLLAANLNVTLVNAGDYKFKQIASEKLLVIVTSTQGEGEPPEEAVALHKFLFSKKAPKLENTAFAVFSLGDTSYEFFCQSGKDFDSKLAELGGERLLDRVDADVEYQAAASEWRARVVDVLKSRAPVAAPSQSVATGAVNDIHTSPYTKDAPLIATLSVNQKITGRNSEKDVRHIEIDLGDSGLRYQPGDALGVWYQNDPALVKELVELLWLKGDEPVTVDGKTLPLAEALEWHFELTVNTANIVENYATLTRSESLLPLVGDKAQLQHYAATTPIVDMVRFSPAQLDAEALIGLLRPLTPRLYSIASAQAEVESEVHVTVGVVRYDIEGRARAGGASSFLADRVEEEGEVRVFIEHNDNFRLPANPQTPVIMIGPGTGIAPFRAFMQQRAADGAEGKNWLFFGNPHFTEDFLYQVEWQRYVKEGVLSRIDLAWSRDQKEKIYVQDKLREQGAELWCWINDGAHIYVCGDARRMAADVEKALLEVIAEFGGMDLESADEYLSELRVERRYQRDVY >NZ_CP041973.1|WP_001290670.1|2906363_2908076_-|assimilatory-sulfite-reductase-(NADPH)-hemoprotein-subunit MSEKHPGPLVVEGKLSDAERMKLESNYLRGTIAEDLNDGLTGGFKGDNFLLIRFHGMYQQDDRDIRAERAEQKLEPRHAMLLRCRLPGGVITTTQWQAIDKFAADNTIYGSIRLTNRQTFQFHGILKKNVKPVHQMLHSVGLDALATANDMNRNVLCTSNPYESQLHAEAYEWAKKISEHLLPRTRAYAEIWLDQEKVAITDEEPILGQTYLPRKFKTTVVIPPQNDIDLHANDMNFVAIAENGKLVGFNLLVGGGLSIEHGNKKTYARTASEFGYLPLEHTLAVAEAVVTTQRDWGNRTDRKNAKTKYTLERVGLETFKAEVERRAGIKFEPIRPYEFTGRGDRIGWVKGIDNNWHLTLFIENGRILDYPGRPLKTGLLEIAKIHQGEFRITANQNLIIASVPESQKAKIETLARDHGLMNAVSAQRENSMACVSFPTCPLAMAEAERFLPSFTDKVEAILEKHGIPDEHIVMRVTGCPNGCGRAMLAEIGLVGKAPGRYNLHLGGNRIGTRIPRMYQENITEPDILASLDELIGRWAKEREAGEGFGDFTVRAGIIRPVLDPARDFWE >NZ_CP041973.1|WP_000039870.1|2905527_2906262_-|phosphoadenosine-phosphosulfate-reductase MSKLDLNALNELPKVDRVLALAETNAQLETLTAEERVAWALENLPGEYVLSSSFGIQAAVSLHLVNQIRPDIPVILTDTGYLFPETYQFIDELTDKLKLNLKVYRAGESPAWQEARYGKLWEQGVEGIEKYNEINKVEPMNRALKELKAQTWFAGLRREQSGSRAHLPVLAIQRGVFKVLPIIDWDNRTVYQYLQKHGLKYHPLWDQGYLSVGDTHTTRKWEPGMAEEETRFFGLKRECGLHEG >NZ_CP041973.1|WP_001145541.1|2904486_2905440_+|SPI-1-type-III-secretion-system-effector-SopD MPVTLSFGNHQNYTLNESRLAHLLSADKEKAIHMGGWDKVQDHFRAEKKDHALEVLHSIIHGQGRGEPGEMEVNVEDINKIYAFKRLQHLACPAHQDLFTIKMDASQTQFLLMVGDTVISQSNIKDILNISDDAVIESMSREERQLFLQICEVIGAKMTWHPELLQESISTLRKEVTGNAQIKAAVYEMMRPAEAPDHPLVEWQDSLTEDEKSMLACINAGNFEPTTQFCKIGYQEVQGEVAFSMMHPCISYLLHTYSPFAEFKPTNSGFLKKLNQDYNDYHAKKMFIDVILEKLYLTHERSLHIGKDGCSRNILLT >NZ_CP041973.1|WP_159413259.1|2901379_2904043_-|CRISPR-associated-helicase-Cas3' MSIYHYWGKSRRGETDGGDDYHLLCWHSLDVAAVGYWMVINNIYFIDHYLKKLGIQDKEQAAQFFAWILCWHDIGKFAHSFQQLYRHEALNIFNEPTRHYEKIAHTTLGYMLWNSWLSECPELFPPSSLSVRKSKRVMALWMPVTTGHHGRPPEAIQELDHFRQQDKDAARDFLLRIKALFPLITLPEAWDEDEGIDQFQQLSWFISAAVVLADWTGSASRYFPRTAEKMPVDTYWQQALAKAQTAITLFPSAANVSAFTGIETLFPFIQHPTPLQQKALELDINVDGAQLFILEDVTGAGKTEAALILAHRLMAAGKAQGLYFGLPTMATASAMFERMANTWLALYQPDSRPSLILAHSARRLMDRFNQSIWSVTLSGTEEPDEAQPYSQGCAAWFADSNKKALLAEVGVGTLDQAMMAVMPFKHNNLRLLGLSNKILLADEIHACDAWMSRILEGLIERQASNGNATILLSATLSQQQRDKLVAAFSRGVRRSVQAPLLGHDDYPWLTQVTQTELISQRVDTRKEVERCVDIGWLHSEEACLERIGEAVEKGNCIAWIRNSVDDAIRIYRQLQLSKVVATENLLLFHSRFAFHDRQRIESQTLNLFGKQSGAQRAGKVIIATQVIEQSLDIDCDEMISDLAPVDLLIQRAGRLQRHIRDRNGLVKKSGQDERETPVLRILAPEWDDAPRENWLSSAMRNSAYVYPDHGRMWLTQRILREQGTIRMPQSARLLIESVYGEDVNMPVGFAKTEQLQEGKFYCDRAFAGQMLLNFAPGYCAEISDSLPEKMSTRLAEESVTLWLAKIVDSVVTPYASGEHAWEMSVLRVRQSWWNKHKDEFEKLDGEPLRKWCAQQHQDKDFATVIVVTDFAACGYSANEGLIGMMGE >NZ_CP041973.1|WP_000368579.1|2899811_2901368_-|type-I-E-CRISPR-associated-protein-Cse1/CasA MDNFSLLTTPWLPVRFKDGSTGKLAPVDLADENVVDIAATRADLQGAAWQFLLGLLQCSIAPKRYKNWEDIWFDGLHADVLHKALAPLEHAFQFGAESPSFMQDFEPLSGEKVSIASLLPEIPGAQTTKFNKDHFVKRGVTERFCPHCAALALFSLQLNAPAGGKGYRTGLRGGGPLTTLVELQEYQGERQTPIWRKLWLNVMPQDTADLPLPDQCDATVFPWLAATRTSEQANAVTTPEQVNKLQAYWGMPRRIRLDFATLQSGCCDICGAESDELLGFMTVKNYGVNYDGWRHPLTPYRAPVKDQNAFFSVKPQPGGLIWRDWLGLSQNNQTEANYESPAQVVKVFNARSLTDVKAGIRGFGADFDNMKIRCWYEHHFPLLMTEGLIPDLRKAVQTAARLLSLLRSALKEAWFTNAKDARGDFSFIDIDFWNLTQGRFLNLIHDLENGHKPDERLNKWQRELWLFTRCYFDDHVFTNPYESSDLERIMKARKKYFTSSAEKQSAKAAKAKKQEAAE >NZ_CP041973.1|WP_000117945.1|2899260_2899815_-|type-I-E-CRISPR-associated-protein-Cse2/CasB MSVVTKDDKATLRQWHDELQEKRGLRASLRRSKTVNDACLAEGLHSLLMQTHSLWKNKAPWNVTALAITAALAAHIKFIDEQKSFAAQLGQKKGGDTPVMSKLRFSHLLAVKTPDELLRQLRRAVKLLDGSVNLFSLADDIFCWCQEQNDLLNHHRRQQRPTEFLRIRWALEYYQAGDTDNEQD >NZ_CP041973.1|WP_000206417.1|2898188_2899247_-|type-I-E-CRISPR-associated-protein-Cas7/Cse4/CasC MTTFIQLHLLTAYPAANLNRDDTGAPKTVVLGGATRLRISSQSLKRAWRTSELFEQALAGHIGIRTGRIAREAAQILVDSGIDAKKAVEYVKNIANCFGKVKEDKKPKDELTNAETEQLVHISPAEFEAVKALARRLAEEKRPATEEEAELLRHDRMAVDIAMFGRMLAKKTDFNVEAACQVAHAFGVSETIIEDDFFTAVDDLRQASAEDAGAGHLGETGFGSALFYTYICIDKDLLVKNLNGNEELANKTLRAFTEAALKVSPTGKQNSFASRAYASWALAEKGTDQPRSLAAAFYEPINGTDQLNVAVKRITALHENMNEVYAQETAFKNFNVMNQQGSMKDVLDFICA >NZ_CP041973.1|WP_001199961.1|2912580_2913252_-|7-carboxy-7-deazaguanine-synthase-QueE MQYPINEMFQTLQGEGYFTGVPAIFIRLQGCPVGCAWCDTKHTWDKLSDREVSLFSILAKTKESDKWGAASSEDLLAVINRQGYTARHVVITGGEPCIHDLMPLTDLLEKSGFSCQIETSGTHEVRCTPNTWVTVSPKVNMRGGYDVLSQALERANEIKHPVGRVRDIEALDELLATLSDDKPRVIALQPISQKEDATRLCIETCIARNWRLSMQTHKYLNIA >NZ_CP041973.1|WP_000036734.1|2913387_2914686_-|phosphopyruvate-hydratase MSKIVKVIGREIIDSRGNPTVEAEVHLEGGFVGMAAAPSGASTGSREALELRDGDKSRFLGKGVTKAVGAVNGPIAQAILGKDAKDQAGIDKIMIDLDGTENKSNFGANAILAVSLANAKAAAAAKGMPLYEHIAELNGTPGKYSMPVPMMNIINGGEHADNNVDIQEFMIQPVGAKTVKEAIRMGSEVFHHLAKVLKGKGMNTAVGDEGGYAPNLGSNAEALAVIAEAVKAAGYELGKDITLAMDCAASEFYKDGKYVLAGEGNKAFTSEEFTHFLEELTKQYPIVSIEDGLDESDWDGFAYQTKVLGDKIQLVGDDLFVTNTKILKEGIEKGIANSILIKFNQIGSLTETLAAIKMAKDAGYTAVISHRSGETEDATIADLAVGTAAGQIKTGSMSRSDRVAKYNQLIRIEEALGEKAPYNGRKEIKGQA >NZ_CP041973.1|WP_000210863.1|2914768_2916406_-|CTP-synthase-(glutamine-hydrolyzing) MTTNYIFVTGGVVSSLGKGIAAASLAAILEARGLNVTIMKLDPYINVDPGTMSPIQHGEVFVTEDGAETDLDLGHYERFIRTKMSRRNNFTTGRIYSDVLRKERRGDYLGATVQVIPHITNAIKERVLEGGEGHDVVLVEIGGTVGDIESLPFLEAIRQLAVDIGREHALFMHLTLVPYLAAAGEVKTKPTQHSVKELLSIGIQPDILICRSDRAVPANERAKIALFCNVPEKAVISMKDVDSIYKIPGLLKSQGLDDYICKRFSLNCPEANLSEWEQVIYEEANPAGEVTIGMVGKYIELPDAYKSVIEALKHGGLKNRVTVNIKLIDSQDVETRGVEILKDLDAILIPGGFGYRGVEGKIATARYARENNIPYLGICLGMQVALIEFARNVAGMDNANSTEFVPDCKYPVVALITEWRDEDGNVEVRSEKSDLGGTMRLGAQQCQLSDDSLVRQLYGASTIVERHRHRYEVNNMLLKQIEAAGLRVAGRSGDDQLVEIIEVPNHPWFVACQFHPEFTSTPRDGHPLFAGFVKAANEHQKRQAK >NZ_CP041973.1|WP_000210451.1|2916633_2917434_-|nucleoside-triphosphate-pyrophosphohydrolase MTTNHQIDRLLTLMQRLRDPENGCPWDKEQTFASIAPYTLEETYEVLDAIAREDFDDLRGELGDLLFQVVFYAQMAQEEGRFDFNDICAAISDKLERRHPHVFGELSADNSEEALVRWEQIKTEERAQKAQHSALDDIPRSLPALMRAQKIQKRCSNVGFDWTTLGPVVDKVYEEIDEVMFEARQAVVDQAKLEEEMGDLLFATVNMARHLGTKAELALQKANDKFERRFREVERIVAARGLEMTGVDLETMEEVWQEVKRQEIDL >NZ_CP041973.1|WP_000842512.1|2918041_2918629_+|fimbrial-protein MKSSHFCKLAVTASLVMGIVSGAQAAGSNTAKVTFLGNIVDSPCSVTLDTEDQTVNMGSSIGNGTLSNGKTTINNARTFHIDLEGCTWATEKNMNVVFTTGSGTTAATGATDNLALMKTDGTGAISNVSLAIGDAGKNNIKLGDTYTQAIADLDGDTILDEKQSLNFTAWLVGAATGTVGTGEFSSAANVTISYL >NZ_CP041973.1|WP_000981797.1|2918708_2921408_+|fimbrial-biogenesis-outer-membrane-usher-protein MMNNTWKSVLCPIACGVGMLLSLSPYSASGKDIEFNTDFLDVKNRDNVNIAQFSRKGFILPGVYLLQIKINGQTLPQEFPVNWVIPEHDPQGSEVCAEPELVTQLGIKPELAEKLVWITHGERQCLAPDSLKGMDFQADLGHSTLLVNLPQAYMEYSDVDWDPPARWDNGIPGIILDYNINNQLRHDQESGSEEQSISGNGTLGANLGAWRLRADWQASYDHRDDDENTSTLHDQSWSRYYAYRALPTLGAKLTLGESYLQSDVFDSFNYIGASVVSDDQMLPPKLRGYAPEIVGIARSNAKVKVSWQGRVLYETQVPAGPFRIQDLNQSVSGTLHVTVEEQNGQTQEFDVNTASVPFLTRPGMVRYKMALGRPQDWDHHPITGTFASAEASWGVTNGWSLYGGAIGESNYQAVALGSGKDLGVVGAVAVDITHSIAHMPQDDGFDGETLQGNSYRISYSRDFDEIDSRLTFAGYRFSEKNFMSMSDYLDAKTYHHLNAGHEKERYTVTYNQNFREQGMSAYFSYSRSTFWDSPDQSNYNLSLSWYFDLGSIKNLSASLNGYRSEYNGDKDDGVYISLSVPWGNDSISYNGTFNGSQHRNQLGYSGHSQNGDNWQLHVGQDEQGAQADGYYSHQGALTDIDLSADYEEGSYRSLGMSLRGGMTLTTQGGALHRGSLAGSTRLLVDTDGIADVPVSGNGSPTSTNIFGKAVIADVGSYSRSLARIDLNKLPEKAEATKSVVQITLTEGAIGYRHFDVVSGEKMMAVFRLADGDFPPFGAEVKNERQQQLGLVADDGNAWLAGVKAGETLKVFWDGAAQCEASLPSTFTPELLANALLLPCKMLEGQPPTAPQKSSPLPAQPLIQEHTQTDGQPAAPVATTTQTPPIPLADNHAVNRKDME >NZ_CP041973.1|WP_001044459.1|2921420_2922194_+|fimbria/pilus-periplasmic-chaperone MNKTNHFKRQALIASVLLAAPLVSHSAIVPDRTRVIFNGNENSITVTLKNGNATLPYLAQAWLEDDKFAKDTRYFTALPPLQRIEPKSDGQVKVQPLPAAASLPQDRESLFYFNVREIPPKSDKPNTLQLALQTRIKFFYRPVAVARQVDKTHPWQTKLTLTYQGDGVIFDNPTPFYLVISNAGSKENETASGFKNLLIAPREKVTSPIKGASLGSSPVVGYVDDYGGHRLLVFTCSGNTCKVNEEKTRDAEKKANK >NZ_CP041973.1|WP_000178270.1|2922213_2922720_+|fimbrial-protein MTMLTRWKMLVLLCGGFVTGTEAAGTKTVQLELHLVVTQPPPCTVGGASVEFGDVLTTKVGDASQTKPVGYSLNCDGRASDYLKLQIQGTTTTISGEQVLQTSVQGLGIRIQQAGNKQLVPVGITDWLNFTLSGSNGPELEAVPVKEPTTQLAGGDFNASATLVVDYQ >NZ_CP041973.1|WP_000832393.1|2922734_2923205_+|fimbrial-protein MKRVLILTLLITQFACADNLTFHGKLINPPACTINNGEMLEVSFGSVIIDNIDGVNYLTEIPWTLTCDSSFRDDALTFTLSYLGTATPYSAKALTTSVPELGIELQQNGTVFPPGTSLTINESSLPTLKAVPVKQPGKEPAEGDFEAFATLQVDYQ >NZ_CP041973.1|WP_001079646.1|2923201_2923738_+|fimbrial-protein MNRIFQTAGHLIGGVMLWAVCNTLPAATPNVHYSGKLVAGACNLVVDNDTMATVDFHTIGSDNFDASGQTTPVPFTLSLQDCKTALANGVLVTFQGVEDSTLPGLLALEPSSEASGFAIGVETAAQQPVSINATVGTAFVLKEGITTINLQARLQKYAGEEVMPGEFSGSATVSFEYQ |
You can click texts colored in the table to view more detailed information
CRISPR_ID | Spacer_Info | Spacer_region | Spacer_length | Hit_ID | Protospacer_location | Mismatch | Identity |
---|
CRISPR_ID | Spacer_Info | Spacer_region | Spacer_length | Hit_phage_ID | Hit_phage_def | Protospacer_location | Mismatch | Identity |
---|---|---|---|---|---|---|---|---|
NZ_CP041973_2 | 2.5|2911918|33|NZ_CP041973|CRISPRCasFinder,CRT | 2911918-2911950 | 33 | NZ_CP032236 | Yersinia ruckeri strain NHV_3758 plasmid pYR4, complete sequence | 63403-63435 | 4 | 0.879 |
NZ_CP041973_2 | 2.5|2911918|33|NZ_CP041973|CRISPRCasFinder,CRT | 2911918-2911950 | 33 | NZ_LN681230 | Yersinia ruckeri strain CSF007-82 plasmid pYR3, complete sequence | 91355-91387 | 4 | 0.879 |
NZ_CP041973_2 | 2.14|2911918|35|NZ_CP041973|PILER-CR | 2911918-2911952 | 35 | NZ_CP032236 | Yersinia ruckeri strain NHV_3758 plasmid pYR4, complete sequence | 63403-63437 | 5 | 0.857 |
NZ_CP041973_2 | 2.14|2911918|35|NZ_CP041973|PILER-CR | 2911918-2911952 | 35 | NZ_LN681230 | Yersinia ruckeri strain CSF007-82 plasmid pYR3, complete sequence | 91353-91387 | 5 | 0.857 |
NZ_CP041973_2 | 2.15|2911980|34|NZ_CP041973|PILER-CR | 2911980-2912013 | 34 | NZ_CP044178 | Salmonella enterica subsp. enterica serovar Concord strain AR-0407 plasmid pAR-0407-1 | 18967-19000 | 5 | 0.853 |
NZ_CP041973_2 | 2.15|2911980|34|NZ_CP041973|PILER-CR | 2911980-2912013 | 34 | CP053324 | Salmonella enterica subsp. salamae serovar 40:c:e,n,z15 strain 2013K-0524 plasmid unnamed, complete sequence | 25136-25169 | 5 | 0.853 |
NZ_CP041973_2 | 2.6|2911980|32|NZ_CP041973|CRISPRCasFinder,CRT | 2911980-2912011 | 32 | NZ_CP044178 | Salmonella enterica subsp. enterica serovar Concord strain AR-0407 plasmid pAR-0407-1 | 18969-19000 | 6 | 0.812 |
NZ_CP041973_2 | 2.6|2911980|32|NZ_CP041973|CRISPRCasFinder,CRT | 2911980-2912011 | 32 | CP053324 | Salmonella enterica subsp. salamae serovar 40:c:e,n,z15 strain 2013K-0524 plasmid unnamed, complete sequence | 25138-25169 | 6 | 0.812 |
NZ_CP041973_2 | 2.15|2911980|34|NZ_CP041973|PILER-CR | 2911980-2912013 | 34 | NZ_LN890526 | Salmonella enterica subsp. enterica serovar Weltevreden strain 2511STDY5712385 plasmid 3, complete sequence | 31716-31749 | 6 | 0.824 |
NZ_CP041973_2 | 2.6|2911980|32|NZ_CP041973|CRISPRCasFinder,CRT | 2911980-2912011 | 32 | NZ_LN890526 | Salmonella enterica subsp. enterica serovar Weltevreden strain 2511STDY5712385 plasmid 3, complete sequence | 31718-31749 | 7 | 0.781 |
NZ_CP041973_2 | 2.3|2911796|32|NZ_CP041973|CRISPRCasFinder,CRT | 2911796-2911827 | 32 | MN694003 | Marine virus AFVG_250M677, complete genome | 17629-17660 | 8 | 0.75 |
NZ_CP041973_2 | 2.6|2911980|32|NZ_CP041973|CRISPRCasFinder,CRT | 2911980-2912011 | 32 | NZ_CP053022 | Sphingobium yanoikuyae strain YC-XJ2 plasmid p-A-Sy, complete sequence | 329022-329053 | 8 | 0.75 |
NZ_CP041973_2 | 2.8|2912102|32|NZ_CP041973|CRISPRCasFinder,CRT | 2912102-2912133 | 32 | MG592432 | Vibrio phage 1.050.O._10N.286.48.A6, partial genome | 21687-21718 | 8 | 0.75 |
NZ_CP041973_2 | 2.8|2912102|32|NZ_CP041973|CRISPRCasFinder,CRT | 2912102-2912133 | 32 | MG592431 | Vibrio phage 1.049.O._10N.286.54.B5, partial genome | 21426-21457 | 8 | 0.75 |
NZ_CP041973_2 | 2.12|2911796|34|NZ_CP041973|PILER-CR | 2911796-2911829 | 34 | MN694003 | Marine virus AFVG_250M677, complete genome | 17627-17660 | 8 | 0.765 |
NZ_CP041973_1 | 1.1|2895006|32|NZ_CP041973|CRISPRCasFinder,CRT | 2895006-2895037 | 32 | NZ_MG266000 | Clostridioides difficile strain 7032985 plasmid pCD-ISS1, complete sequence | 5501-5532 | 9 | 0.719 |
NZ_CP041973_1 | 1.3|2895128|32|NZ_CP041973|CRISPRCasFinder,CRT | 2895128-2895159 | 32 | MK449011 | Streptococcus phage Javan92, complete genome | 36157-36188 | 9 | 0.719 |
NZ_CP041973_1 | 1.3|2895128|32|NZ_CP041973|CRISPRCasFinder,CRT | 2895128-2895159 | 32 | MK448835 | Streptococcus phage Javan93, complete genome | 36157-36188 | 9 | 0.719 |
NZ_CP041973_1 | 1.3|2895128|32|NZ_CP041973|CRISPRCasFinder,CRT | 2895128-2895159 | 32 | MK448836 | Streptococcus phage Javan95, complete genome | 37400-37431 | 9 | 0.719 |
NZ_CP041973_1 | 1.3|2895128|32|NZ_CP041973|CRISPRCasFinder,CRT | 2895128-2895159 | 32 | MK448825 | Streptococcus phage Javan639, complete genome | 37400-37431 | 9 | 0.719 |
NZ_CP041973_1 | 1.7|2895372|32|NZ_CP041973|CRISPRCasFinder,CRT | 2895372-2895403 | 32 | KY006853 | Erythrobacter phage vB_EliS_R6L, complete genome | 41418-41449 | 9 | 0.719 |
NZ_CP041973_1 | 1.15|2895372|34|NZ_CP041973|PILER-CR | 2895372-2895405 | 34 | KY006853 | Erythrobacter phage vB_EliS_R6L, complete genome | 41416-41449 | 9 | 0.735 |
NZ_CP041973_2 | 2.5|2911918|33|NZ_CP041973|CRISPRCasFinder,CRT | 2911918-2911950 | 33 | NZ_CP031947 | Ruegeria sp. AD91A plasmid unnamed1, complete sequence | 143751-143783 | 9 | 0.727 |
NZ_CP041973_2 | 2.6|2911980|32|NZ_CP041973|CRISPRCasFinder,CRT | 2911980-2912011 | 32 | NZ_CP048340 | Escherichia coli strain 142 plasmid p142_C, complete sequence | 2410-2441 | 9 | 0.719 |
NZ_CP041973_2 | 2.6|2911980|32|NZ_CP041973|CRISPRCasFinder,CRT | 2911980-2912011 | 32 | NZ_LR130559 | Escherichia coli strain MS14385 isolate MS14385 plasmid 5 | 41882-41913 | 9 | 0.719 |
NZ_CP041973_2 | 2.6|2911980|32|NZ_CP041973|CRISPRCasFinder,CRT | 2911980-2912011 | 32 | NZ_CP020518 | Escherichia coli strain 222 plasmid unnamed2, complete sequence | 13450-13481 | 9 | 0.719 |
NZ_CP041973_2 | 2.6|2911980|32|NZ_CP041973|CRISPRCasFinder,CRT | 2911980-2912011 | 32 | NZ_CP020497 | Escherichia coli strain 103 plasmid unnamed2, complete sequence | 37140-37171 | 9 | 0.719 |
NZ_CP041973_2 | 2.6|2911980|32|NZ_CP041973|CRISPRCasFinder,CRT | 2911980-2912011 | 32 | NZ_CP040921 | Escherichia coli strain FC853_EC plasmid p853EC2, complete sequence | 32060-32091 | 9 | 0.719 |
NZ_CP041973_2 | 2.6|2911980|32|NZ_CP041973|CRISPRCasFinder,CRT | 2911980-2912011 | 32 | CP053252 | Escherichia coli strain SCU-204 plasmid pSCU-204-5, complete sequence | 19381-19412 | 9 | 0.719 |
NZ_CP041973_2 | 2.6|2911980|32|NZ_CP041973|CRISPRCasFinder,CRT | 2911980-2912011 | 32 | NZ_CP042622 | Escherichia coli strain NCYU-26-73 plasmid pNCYU-26-73-7, complete sequence | 2614-2645 | 9 | 0.719 |
NZ_CP041973_2 | 2.6|2911980|32|NZ_CP041973|CRISPRCasFinder,CRT | 2911980-2912011 | 32 | NZ_LT985302 | Escherichia coli strain ECOR 39 genome assembly, plasmid: RCS82_pI | 11943-11974 | 9 | 0.719 |
NZ_CP041973_2 | 2.6|2911980|32|NZ_CP041973|CRISPRCasFinder,CRT | 2911980-2912011 | 32 | NZ_CP028194 | Escherichia coli strain CFSAN018748 plasmid pGMI14-004_3, complete sequence | 15383-15414 | 9 | 0.719 |
NZ_CP041973_2 | 2.6|2911980|32|NZ_CP041973|CRISPRCasFinder,CRT | 2911980-2912011 | 32 | NZ_CP024865 | Escherichia coli strain AR_0015 plasmid unitig_3_pilon, complete sequence | 22646-22677 | 9 | 0.719 |
NZ_CP041973_2 | 2.6|2911980|32|NZ_CP041973|CRISPRCasFinder,CRT | 2911980-2912011 | 32 | AP019710 | Escherichia coli O145:H28 122715 plasmid pO145_122715_2 DNA, complete genome | 4361-4392 | 9 | 0.719 |
NZ_CP041973_2 | 2.6|2911980|32|NZ_CP041973|CRISPRCasFinder,CRT | 2911980-2912011 | 32 | NZ_CP024829 | Escherichia coli strain CREC-544 plasmid pCREC-544_3, complete sequence | 2221-2252 | 9 | 0.719 |
NZ_CP041973_2 | 2.6|2911980|32|NZ_CP041973|CRISPRCasFinder,CRT | 2911980-2912011 | 32 | NZ_CP009861 | Escherichia coli strain ECONIH1 plasmid pECO-b75, complete sequence | 2868-2899 | 9 | 0.719 |
NZ_CP041973_2 | 2.6|2911980|32|NZ_CP041973|CRISPRCasFinder,CRT | 2911980-2912011 | 32 | CP025877 | Escherichia coli strain 503458 plasmid p503458_49, complete sequence | 18343-18374 | 9 | 0.719 |
NZ_CP041973_2 | 2.6|2911980|32|NZ_CP041973|CRISPRCasFinder,CRT | 2911980-2912011 | 32 | NZ_CP023368 | Escherichia coli strain 1428 plasmid p48, complete sequence | 4914-4945 | 9 | 0.719 |
NZ_CP041973_2 | 2.6|2911980|32|NZ_CP041973|CRISPRCasFinder,CRT | 2911980-2912011 | 32 | NZ_CP032259 | Escherichia coli strain AR_0067 plasmid unnamed2, complete sequence | 23402-23433 | 9 | 0.719 |
NZ_CP041973_2 | 2.6|2911980|32|NZ_CP041973|CRISPRCasFinder,CRT | 2911980-2912011 | 32 | NZ_CP037450 | Escherichia coli strain ATCC 25922 plasmid unnamed, complete sequence | 15851-15882 | 9 | 0.719 |
NZ_CP041973_2 | 2.8|2912102|32|NZ_CP041973|CRISPRCasFinder,CRT | 2912102-2912133 | 32 | NC_047790 | Pseudoalteromonas phage C5a, complete genome | 34441-34472 | 9 | 0.719 |
NZ_CP041973_2 | 2.10|2912224|32|NZ_CP041973|CRISPRCasFinder,CRT | 2912224-2912255 | 32 | CP006879 | Rhizobium gallicum bv. gallicum R602 plasmid pRgalR602b, complete sequence | 405613-405644 | 9 | 0.719 |
NZ_CP041973_2 | 2.4|2911857|32|NZ_CP041973|CRISPRCasFinder,CRT | 2911857-2911888 | 32 | NZ_LR134399 | Listeria monocytogenes strain NCTC7974 plasmid 2, complete sequence | 103231-103262 | 10 | 0.688 |
NZ_CP041973_2 | 2.10|2912224|32|NZ_CP041973|CRISPRCasFinder,CRT | 2912224-2912255 | 32 | NZ_CP049244 | Rhizobium pseudoryzae strain DSM 19479 plasmid unnamed3, complete sequence | 699963-699994 | 10 | 0.688 |
NZ_CP041973_2 | 2.14|2911918|35|NZ_CP041973|PILER-CR | 2911918-2911952 | 35 | NZ_CP031947 | Ruegeria sp. AD91A plasmid unnamed1, complete sequence | 143751-143785 | 11 | 0.686 |
1. spacer 2.5|2911918|33|NZ_CP041973|CRISPRCasFinder,CRT matches to NZ_CP032236 (Yersinia ruckeri strain NHV_3758 plasmid pYR4, complete sequence) position: , mismatch: 4, identity: 0.879
tcaggaacgcgcggcggaagagcttggtgtttg CRISPR spacer tcaggaacgcgcagcggaagagcttggtaaatg Protospacer ************.***************. **
2. spacer 2.5|2911918|33|NZ_CP041973|CRISPRCasFinder,CRT matches to NZ_LN681230 (Yersinia ruckeri strain CSF007-82 plasmid pYR3, complete sequence) position: , mismatch: 4, identity: 0.879
tcaggaacgcgcggcggaagagcttggtgtttg CRISPR spacer tcaggaacgcgcagcggaagagcttggtaaatg Protospacer ************.***************. **
3. spacer 2.14|2911918|35|NZ_CP041973|PILER-CR matches to NZ_CP032236 (Yersinia ruckeri strain NHV_3758 plasmid pYR4, complete sequence) position: , mismatch: 5, identity: 0.857
tcaggaacgcgcggcggaagagcttggtgtttgcg CRISPR spacer tcaggaacgcgcagcggaagagcttggtaaatgcc Protospacer ************.***************. ***
4. spacer 2.14|2911918|35|NZ_CP041973|PILER-CR matches to NZ_LN681230 (Yersinia ruckeri strain CSF007-82 plasmid pYR3, complete sequence) position: , mismatch: 5, identity: 0.857
tcaggaacgcgcggcggaagagcttggtgtttgcg CRISPR spacer tcaggaacgcgcagcggaagagcttggtaaatgcc Protospacer ************.***************. ***
5. spacer 2.15|2911980|34|NZ_CP041973|PILER-CR matches to NZ_CP044178 (Salmonella enterica subsp. enterica serovar Concord strain AR-0407 plasmid pAR-0407-1) position: , mismatch: 5, identity: 0.853
gctgcctttcccggagttccggcccct----aaattgg CRISPR spacer ggtgcctttcccggagttccggccccttctcaaa---- Protospacer * ************************* ***
6. spacer 2.15|2911980|34|NZ_CP041973|PILER-CR matches to CP053324 (Salmonella enterica subsp. salamae serovar 40:c:e,n,z15 strain 2013K-0524 plasmid unnamed, complete sequence) position: , mismatch: 5, identity: 0.853
gctgcctttcccggagttccggcccct----aaattgg CRISPR spacer ggtgcctttcccggagttccggccccttctcaaa---- Protospacer * ************************* ***
7. spacer 2.6|2911980|32|NZ_CP041973|CRISPRCasFinder,CRT matches to NZ_CP044178 (Salmonella enterica subsp. enterica serovar Concord strain AR-0407 plasmid pAR-0407-1) position: , mismatch: 6, identity: 0.812
gctgcctttcccggagttccggcccctaaatt CRISPR spacer ggtgcctttcccggagttccggccccttctca Protospacer * ************************* .
8. spacer 2.6|2911980|32|NZ_CP041973|CRISPRCasFinder,CRT matches to CP053324 (Salmonella enterica subsp. salamae serovar 40:c:e,n,z15 strain 2013K-0524 plasmid unnamed, complete sequence) position: , mismatch: 6, identity: 0.812
gctgcctttcccggagttccggcccctaaatt CRISPR spacer ggtgcctttcccggagttccggccccttctca Protospacer * ************************* .
9. spacer 2.15|2911980|34|NZ_CP041973|PILER-CR matches to NZ_LN890526 (Salmonella enterica subsp. enterica serovar Weltevreden strain 2511STDY5712385 plasmid 3, complete sequence) position: , mismatch: 6, identity: 0.824
gctgcctttcccggagttccggcccct----aaattgg CRISPR spacer ggtgccttttccggagttccggccccttctcaaa---- Protospacer * *******.***************** ***
10. spacer 2.6|2911980|32|NZ_CP041973|CRISPRCasFinder,CRT matches to NZ_LN890526 (Salmonella enterica subsp. enterica serovar Weltevreden strain 2511STDY5712385 plasmid 3, complete sequence) position: , mismatch: 7, identity: 0.781
gctgcctttcccggagttccggcccctaaatt CRISPR spacer ggtgccttttccggagttccggccccttctca Protospacer * *******.***************** .
11. spacer 2.3|2911796|32|NZ_CP041973|CRISPRCasFinder,CRT matches to MN694003 (Marine virus AFVG_250M677, complete genome) position: , mismatch: 8, identity: 0.75
cgattctacggcaacaggccaggctgcgaccg CRISPR spacer ggcgagcacggcaacagcccaggctgcgatcg Protospacer * .********** ***********.**
12. spacer 2.6|2911980|32|NZ_CP041973|CRISPRCasFinder,CRT matches to NZ_CP053022 (Sphingobium yanoikuyae strain YC-XJ2 plasmid p-A-Sy, complete sequence) position: , mismatch: 8, identity: 0.75
gctgcctttcccggagttccggcccctaaatt--- CRISPR spacer tatgcctttcccggctttccggccc---aactgac Protospacer ************ ********* **.*
13. spacer 2.8|2912102|32|NZ_CP041973|CRISPRCasFinder,CRT matches to MG592432 (Vibrio phage 1.050.O._10N.286.48.A6, partial genome) position: , mismatch: 8, identity: 0.75
tgattattgacgacaacagcacagaccggcag CRISPR spacer ttataattgactacaacagcacagagcagatt Protospacer * ** ****** ************* *.*
14. spacer 2.8|2912102|32|NZ_CP041973|CRISPRCasFinder,CRT matches to MG592431 (Vibrio phage 1.049.O._10N.286.54.B5, partial genome) position: , mismatch: 8, identity: 0.75
tgattattgacgacaacagcacagaccggcag CRISPR spacer ttataattgactacaacagcacagagcagatt Protospacer * ** ****** ************* *.*
15. spacer 2.12|2911796|34|NZ_CP041973|PILER-CR matches to MN694003 (Marine virus AFVG_250M677, complete genome) position: , mismatch: 8, identity: 0.765
cgattctacggcaacaggccaggctgcgaccgcg CRISPR spacer ggcgagcacggcaacagcccaggctgcgatcgcg Protospacer * .********** ***********.****
16. spacer 1.1|2895006|32|NZ_CP041973|CRISPRCasFinder,CRT matches to NZ_MG266000 (Clostridioides difficile strain 7032985 plasmid pCD-ISS1, complete sequence) position: , mismatch: 9, identity: 0.719
tatttataagcgtgtcatctatgcaacccaac CRISPR spacer aatttataatcatgtcatctatgccataattc Protospacer ******** *.************ *. *
17. spacer 1.3|2895128|32|NZ_CP041973|CRISPRCasFinder,CRT matches to MK449011 (Streptococcus phage Javan92, complete genome) position: , mismatch: 9, identity: 0.719
ggccgctggtcaaattcccaatctgagcaatc CRISPR spacer tacatcttgacaaattcccaatctgagcgact Protospacer .* ** * ******************.*..
18. spacer 1.3|2895128|32|NZ_CP041973|CRISPRCasFinder,CRT matches to MK448835 (Streptococcus phage Javan93, complete genome) position: , mismatch: 9, identity: 0.719
ggccgctggtcaaattcccaatctgagcaatc CRISPR spacer tacatcttgacaaattcccaatctgagcgact Protospacer .* ** * ******************.*..
19. spacer 1.3|2895128|32|NZ_CP041973|CRISPRCasFinder,CRT matches to MK448836 (Streptococcus phage Javan95, complete genome) position: , mismatch: 9, identity: 0.719
ggccgctggtcaaattcccaatctgagcaatc CRISPR spacer tacatcttgacaaattcccaatctgagcgact Protospacer .* ** * ******************.*..
20. spacer 1.3|2895128|32|NZ_CP041973|CRISPRCasFinder,CRT matches to MK448825 (Streptococcus phage Javan639, complete genome) position: , mismatch: 9, identity: 0.719
ggccgctggtcaaattcccaatctgagcaatc CRISPR spacer tacatcttgacaaattcccaatctgagcgact Protospacer .* ** * ******************.*..
21. spacer 1.7|2895372|32|NZ_CP041973|CRISPRCasFinder,CRT matches to KY006853 (Erythrobacter phage vB_EliS_R6L, complete genome) position: , mismatch: 9, identity: 0.719
gagaatgctcatgcgcgtgagcgccatatatt CRISPR spacer cgaaatgatcatgcgcgtcagcgccattgcgt Protospacer ..**** ********** ******** *
22. spacer 1.15|2895372|34|NZ_CP041973|PILER-CR matches to KY006853 (Erythrobacter phage vB_EliS_R6L, complete genome) position: , mismatch: 9, identity: 0.735
gagaatgctcatgcgcgtgagcgcca-tatattcg CRISPR spacer cgaaatgatcatgcgcgtcagcgccattgcgttc- Protospacer ..**** ********** ******* *...***
23. spacer 2.5|2911918|33|NZ_CP041973|CRISPRCasFinder,CRT matches to NZ_CP031947 (Ruegeria sp. AD91A plasmid unnamed1, complete sequence) position: , mismatch: 9, identity: 0.727
tcaggaacgcgcggcggaagagcttggtgtttg CRISPR spacer ctttgcccgtgcggcggaagaccttggtgtttc Protospacer .. * **.*********** **********
24. spacer 2.6|2911980|32|NZ_CP041973|CRISPRCasFinder,CRT matches to NZ_CP048340 (Escherichia coli strain 142 plasmid p142_C, complete sequence) position: , mismatch: 9, identity: 0.719
gctgcctttcccggagttccggcccctaaatt CRISPR spacer gtgccctttaccggagttccggccccttctca Protospacer *. ***** ***************** .
25. spacer 2.6|2911980|32|NZ_CP041973|CRISPRCasFinder,CRT matches to NZ_LR130559 (Escherichia coli strain MS14385 isolate MS14385 plasmid 5) position: , mismatch: 9, identity: 0.719
gctgcctttcccggagttccggcccctaaatt CRISPR spacer gtgccctttaccggagttccggccccttctca Protospacer *. ***** ***************** .
26. spacer 2.6|2911980|32|NZ_CP041973|CRISPRCasFinder,CRT matches to NZ_CP020518 (Escherichia coli strain 222 plasmid unnamed2, complete sequence) position: , mismatch: 9, identity: 0.719
gctgcctttcccggagttccggcccctaaatt CRISPR spacer gtgccctttaccggagttccggccccttctca Protospacer *. ***** ***************** .
27. spacer 2.6|2911980|32|NZ_CP041973|CRISPRCasFinder,CRT matches to NZ_CP020497 (Escherichia coli strain 103 plasmid unnamed2, complete sequence) position: , mismatch: 9, identity: 0.719
gctgcctttcccggagttccggcccctaaatt CRISPR spacer gtgccctttaccggagttccggccccttctca Protospacer *. ***** ***************** .
28. spacer 2.6|2911980|32|NZ_CP041973|CRISPRCasFinder,CRT matches to NZ_CP040921 (Escherichia coli strain FC853_EC plasmid p853EC2, complete sequence) position: , mismatch: 9, identity: 0.719
gctgcctttcccggagttccggcccctaaatt CRISPR spacer gtgccctttaccggagttccggccccttctca Protospacer *. ***** ***************** .
29. spacer 2.6|2911980|32|NZ_CP041973|CRISPRCasFinder,CRT matches to CP053252 (Escherichia coli strain SCU-204 plasmid pSCU-204-5, complete sequence) position: , mismatch: 9, identity: 0.719
gctgcctttcccggagttccggcccctaaatt CRISPR spacer gtgccctttaccggagttccggccccttctca Protospacer *. ***** ***************** .
30. spacer 2.6|2911980|32|NZ_CP041973|CRISPRCasFinder,CRT matches to NZ_CP042622 (Escherichia coli strain NCYU-26-73 plasmid pNCYU-26-73-7, complete sequence) position: , mismatch: 9, identity: 0.719
gctgcctttcccggagttccggcccctaaatt CRISPR spacer gtgccctttaccggagttccggccccttctca Protospacer *. ***** ***************** .
31. spacer 2.6|2911980|32|NZ_CP041973|CRISPRCasFinder,CRT matches to NZ_LT985302 (Escherichia coli strain ECOR 39 genome assembly, plasmid: RCS82_pI) position: , mismatch: 9, identity: 0.719
gctgcctttcccggagttccggcccctaaatt CRISPR spacer gtgccctttaccggagttccggccccttctca Protospacer *. ***** ***************** .
32. spacer 2.6|2911980|32|NZ_CP041973|CRISPRCasFinder,CRT matches to NZ_CP028194 (Escherichia coli strain CFSAN018748 plasmid pGMI14-004_3, complete sequence) position: , mismatch: 9, identity: 0.719
gctgcctttcccggagttccggcccctaaatt CRISPR spacer gtgccctttaccggagttccggccccttctca Protospacer *. ***** ***************** .
33. spacer 2.6|2911980|32|NZ_CP041973|CRISPRCasFinder,CRT matches to NZ_CP024865 (Escherichia coli strain AR_0015 plasmid unitig_3_pilon, complete sequence) position: , mismatch: 9, identity: 0.719
gctgcctttcccggagttccggcccctaaatt CRISPR spacer gtgccctttaccggagttccggccccttctca Protospacer *. ***** ***************** .
34. spacer 2.6|2911980|32|NZ_CP041973|CRISPRCasFinder,CRT matches to AP019710 (Escherichia coli O145:H28 122715 plasmid pO145_122715_2 DNA, complete genome) position: , mismatch: 9, identity: 0.719
gctgcctttcccggagttccggcccctaaatt CRISPR spacer gtgccctttaccggagttccggccccttctca Protospacer *. ***** ***************** .
35. spacer 2.6|2911980|32|NZ_CP041973|CRISPRCasFinder,CRT matches to NZ_CP024829 (Escherichia coli strain CREC-544 plasmid pCREC-544_3, complete sequence) position: , mismatch: 9, identity: 0.719
gctgcctttcccggagttccggcccctaaatt CRISPR spacer gtgccctttaccggagttccggccccttctca Protospacer *. ***** ***************** .
36. spacer 2.6|2911980|32|NZ_CP041973|CRISPRCasFinder,CRT matches to NZ_CP009861 (Escherichia coli strain ECONIH1 plasmid pECO-b75, complete sequence) position: , mismatch: 9, identity: 0.719
gctgcctttcccggagttccggcccctaaatt CRISPR spacer gtgccctttaccggagttccggccccttctca Protospacer *. ***** ***************** .
37. spacer 2.6|2911980|32|NZ_CP041973|CRISPRCasFinder,CRT matches to CP025877 (Escherichia coli strain 503458 plasmid p503458_49, complete sequence) position: , mismatch: 9, identity: 0.719
gctgcctttcccggagttccggcccctaaatt CRISPR spacer gtgccctttaccggagttccggccccttctca Protospacer *. ***** ***************** .
38. spacer 2.6|2911980|32|NZ_CP041973|CRISPRCasFinder,CRT matches to NZ_CP023368 (Escherichia coli strain 1428 plasmid p48, complete sequence) position: , mismatch: 9, identity: 0.719
gctgcctttcccggagttccggcccctaaatt CRISPR spacer gtgccctttaccggagttccggccccttctca Protospacer *. ***** ***************** .
39. spacer 2.6|2911980|32|NZ_CP041973|CRISPRCasFinder,CRT matches to NZ_CP032259 (Escherichia coli strain AR_0067 plasmid unnamed2, complete sequence) position: , mismatch: 9, identity: 0.719
gctgcctttcccggagttccggcccctaaatt CRISPR spacer gtgccctttaccggagttccggccccttctca Protospacer *. ***** ***************** .
40. spacer 2.6|2911980|32|NZ_CP041973|CRISPRCasFinder,CRT matches to NZ_CP037450 (Escherichia coli strain ATCC 25922 plasmid unnamed, complete sequence) position: , mismatch: 9, identity: 0.719
gctgcctttcccggagttccggcccctaaatt CRISPR spacer gtgccctttaccggagttccggccccttctca Protospacer *. ***** ***************** .
41. spacer 2.8|2912102|32|NZ_CP041973|CRISPRCasFinder,CRT matches to NC_047790 (Pseudoalteromonas phage C5a, complete genome) position: , mismatch: 9, identity: 0.719
tgattattgacgacaacagcacagaccggcag CRISPR spacer agcttattgacgaaaacggcacagacaccaaa Protospacer * ********** ***.******** *.
42. spacer 2.10|2912224|32|NZ_CP041973|CRISPRCasFinder,CRT matches to CP006879 (Rhizobium gallicum bv. gallicum R602 plasmid pRgalR602b, complete sequence) position: , mismatch: 9, identity: 0.719
gaatctggaggccaacagcgcggcgaaatcct CRISPR spacer gaatctggagggcgacagcgcggtcgaccctg Protospacer *********** *.*********. .* .*.
43. spacer 2.4|2911857|32|NZ_CP041973|CRISPRCasFinder,CRT matches to NZ_LR134399 (Listeria monocytogenes strain NCTC7974 plasmid 2, complete sequence) position: , mismatch: 10, identity: 0.688
atcaaacatggaaacccctttaatgagagcaa CRISPR spacer ctaaaacatggaaaccactgtaatgacgaatc Protospacer * ************* ** ****** ..
44. spacer 2.10|2912224|32|NZ_CP041973|CRISPRCasFinder,CRT matches to NZ_CP049244 (Rhizobium pseudoryzae strain DSM 19479 plasmid unnamed3, complete sequence) position: , mismatch: 10, identity: 0.688
gaatctggaggccaacagcgcggcgaaatcct CRISPR spacer gtggtcataggccatcagcgcggcgatatccc Protospacer * . ... ****** *********** ****.
45. spacer 2.14|2911918|35|NZ_CP041973|PILER-CR matches to NZ_CP031947 (Ruegeria sp. AD91A plasmid unnamed1, complete sequence) position: , mismatch: 11, identity: 0.686
tcaggaacgcgcggcggaagagcttggtgtttgcg CRISPR spacer ctttgcccgtgcggcggaagaccttggtgtttctc Protospacer .. * **.*********** ********** .
Region | Region Position | Protein_number | Hit_taxonomy | Key_proteins | Att_site | Prophage annotation | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
DBSCAN-SWA_1 |
476943 : 482996
Sequences of DBSCAN-SWA_1
Nucleotide sequences of DBSCAN-SWA_1 >NZ_CP041973|476943:482996|DBSCAN-SWA CATATACTTTTTGTGGGACTGCTTCAACCTTTGGGCAGATATCGGAAATGAAAAAGATAGACCAGGCAACTATTCGCTATCAGAATATCCGGTACACCAACTACCAACTACCAACTACCAACTACCAACAAATCATTTAGTCGATGGTCTCGTTGCCATTGGTTCATAGAGTGTTGGTCTTGGTATGGATGGCTGGGGAAGTTATGTATCGAACATTCTTATGCAAGATTGCGCAGGGTCTGGTGATCTATGGTACACATATGGGAAGGCATTCACATATATTTCTGTAATCGATACTAAAACTTTAACACTAACTAATTGTTTGTAGAAAGTGGTTGCATTATTAATGGTTTGAGACTCATTGACATAAAACAAACACCATCTGGTAATCTGTCAGACCCCGCATCCTTAATAGTTAACTATAAAGATTGTATGGTAGTTGAGATGTCGTTAGTTCTAATATCGTGTCAGATAAATTTATAAAAGATTACTCATGTTTTGTGTCACATGCAAAAATAATTCGGCTTCGTTAAGGTCTTTAGGGGAAATACCTAATGGATAATTAGTTAGATTAACGTTAACAACACTTTGAACGTGTAATGAATATGGGGGTAAAATATAAGTATTGGGAGATTGTAATTAAAAATTATGTAATTGTCTGATTATTATATATTCACTCCAGCAAAGGAGAAAGGCAATTATGGACGAAAAGAAACTCACAGCTCTTGCGGCTGAACTGGCTAAAGGTCTTAAAACTGAAGCCGACCTCAGTCAGTTTTCCCATATGCTGACGACATTAACCGTCGAAACGGCGCTCAATGCTGAGCTGGCTGACCATCTCGGGCATGAGAAAAATGCTCCTAAAACGAGCTCAAACTCCCACAATGGCTACTCATTAAAAACGGTGCTGTATGATGGCGGCGAGATAGTGCTGAACATGCTGCGTGACCGTGAAAATACCTTTGAACTGCAGCTGATTAAGGAGCATCAGACGCGTATTACGAGATGGACAGTCAGATTTATCCCTTTATGCCAAAGGCATGACATGACTACCCGCGAAATCATCGCCACCTTCAAAGAGATGTACAACGCTGATGTGTCGCCCATGCTGATACTTAAGATCACCGCCGCAGTAAAAGAGTTGGTCAAAGGGGCGTCAAAAAATGGAGTATGCCGATCCAGGACTGGTCGCTGGCAATGAGTCGCTTTATTATCGAGTTCGGTGACCGCCTGAGCGATCACCGTTAATATAGTGGCAGTTACACAGTATGACTGGCAGGTTCTATATTGGATTTATATGTAATAGTAACAGCCTATCTATTTGATTATTTTATTTCCGATTTTATTAAAAAGGAAATCAGAACCAGGCTTTGTTAAATGTCCCCAATCAACGGCAGTGATAAAATCAGGACCATTACCAACTCTTGTAAGACATCCACTTTCGTTACATAATGCTTTGTATGCTGATATATATTCAATTCCCATTTTTGGAACATTGTTACTAAAGTAAGAGTCCCACTCGCTTATTTCACTATTTAATCCATATGTCATATACAATGGTGGGGTTTTTTTAAACTCACTCAGGTAGTTAGATATTATTTTAACTAAATTTGCATTCCATTCCGGGACTGGTCCAATGAAAATAATCCTTGAGTCAGGGGATGCCTCTTTAATTTTTTTTATGGTTAATGATAACGTATCAATTGCTAACTTTTTATCATGTACTCCATTTGTTCCTCGAACTGACCATGTCAGCAGAACCACCTCAGGCTGAACACGTTTAATTTCATTAATTCTATTATTGTTTAGAGTGATGACACTTCTCTGTAAATCATCTTTACCGTCAACAAATAGAGGAGGAGCGTTACCATCTGTCATTTGGCTTATTATATAATCAGAACCTTTATTATCTATATAATGAGAAAGTCCATTGAAAAGAGCCGCCGCATAAGAATCACCAATGATAAATATATTATGCTTGCCATTTTTTATACATCCATTGGATATGGCAGCAGTAAGTTGTACTGAGTGGCATATCCCTCCACGGAGTAGTTCTCCATATTTATAATAATTGTACACGTCAGTAACAGAAGCATATTCACCTGCTGATTTATTGATTTCCCTGTCTTTAACTCCATTTATATGAAAAATAAATGCGCCAATTAAACCTGTCCCAAATACTGATAATGCTAATAATATTGCTGTGATATATTTATTTCTGGCATTTCTCAGTGGTTTTTCAATTAAATAATAAGTTAATATCGCCAAAAAGAACGATGATAATAAAAGAAGAATTAATTCATGGTAGTCTGGTGAACCAGCAAATATTGAACGATAGAATGAATAAATAGGCCAATGCCACAAATAAAGAGGATAGCTAATAAGACCAAAGAAAACAACAGGCCTAACACTAAGCAATTTCGACACAACTAAATCATTACCATTAGATGCTATTATAAGAGAGGCGCCAAGTACTGGGATTATTGCTATATATCCAGGAAATGACATCTTTTCATCTATCATGGTTATTGATAATGCGATTAGTATAATTCCTAACAGGGACATTAATTTTGATAACGAAGTGTTTATTCCTATAAATCTCAATGTGGATATAATCGCTCCAGCCATTAACTCCCAAAATCTTGATGCGGGAGAGTAGTAATTAGCTCCGCCATCAGATGCCATTGTAAAAATGCTAATCGCATAGCTAATTATAAATATAGTTGCGCATGATAATACTATGTTTCTGTTATGGTTTTTGCTTCTAAAGCATAGCAATATAACTACTGGCCATATTATATAAAATTGCTCTTCAATTCCCAGTGACCACAAATGTAGTAAAGGTTTAAGGTATGACTTTGAATCAAAATAGCCAGACTCACTCCAAAGAGTAAAGTTTGATATAAAGAATGAGCCGCTAAAAACATGCTTACCAAGTAATTTGTAATCATCCTGGAATAAATAAACCCAGCCAACAATAAGACATGATACAAGAACTATGGATAATGCTGGAAATATTCTAAGCACTCTCCTTTTATAGAAATCAAGGTATGAAAATGATTTGTTTGATGCAGATTTTAATATTATTGATGTTATAAGGTATCCAGATATCACAAAGAATATATCTACTCCAACAAACCCACCCGGCAATAATGATGGGAAATAATGGAATATTACCACAGATAAAACCGCTATTGCGCGTAATCCATCTATATCAGGTCTGTATTTTAAGTGTTCCAACTTAAATTACCTCAATTTTAAAAAAAGATTAATAAAATGGTTGTGCATCTTGCATCATTCCCGAAGTTTCGTGTAAACGAAAACGGAATGACGAGTGGATCAGATACGGCTATTATTTTAATTATTGACTCTGTCACATCTTTACTTCCGTCATTAATAAAAATAATCTCAACTTTATATTTTTCAAGTTCATTAAACTCATGTACCGTTCTATAGAAAATCGGTATCGTGTCTTCTTCGTTAAAAACTAGGACGACAAGAGAGATTTTCATTTTTATCCCTGAAGATAAAGAATCTGGAATAGATAAAGCCGCATACCAGGCTAATTGCCGAGAAAGTGATGAGGGTAACCAATGGTGGCAAAGAACATTGGTCAGCCATCCATCCAACAACAGCGCTCAGTGCTCCCATGAATCCCATATACATCATGTAGCGAATTGCAGTAGTGCTGGCATTAAAGGTGAAGCGCGCATTAGCATAGAAGCTGAACGATACAGCGATAACAAAATCGGAAAAGTTCGTCAGCGCCTGATGCGTATGCATCCCATACATACAGAAAGCAAATACTCTCCAATGAATGAGCATGTTAAGAACACCGATCGATGTGTACTTAGCGAATAACTTCAACATTATGAAAATTATCAGATTCAGAAAGGTCTGGAGTGTAGCACTACAAATTGGTTTGATCGATATAAGCGATCAATAATTGTATTTTTAATAGTTTTAAACTATTGAGTTTTAATATATTGATCGATGTTATCGATCAATTGGTATTGCTGATTGCCAAGCGTCTTGGAATAAAAACGGGACATGTAAAGCTTTGCATCGTCTTACAAGGCTTTGCATTTTTTTTCAGGGAGAGGTACTTGAAAGGGTGGAAGTGCTGGGGGGAGGGGGAGCGTTAAAAATTCTGTATAATTTTAGTAACATAAAATAAAAAAGAATGGCACATGTCCCATCCCTTCGATTTCGACAAAGCACTTAAAGCCCTTCAGGTCCGGCCAGGCATTAACGGGCAAAGATGGCATCTTAACGCCATTAATCAAGTATTTAACCGAGTCTACCCTGTCTGCTGAACTTGATTCCCATCTGGCTCAGGATGTTGAGGCAAACCGTAAAAATGGTTCCGGCAAAAAGCCATTAAAGCCCCAACAGGCAGTTTTGAACTGGCAACTCCGCGCAATCGTAACGGCACTTTTGAGCCATAACTGGTGAAGAAGCTTCAGACCCCCTGTACGACGAGATCGAGCGCAATATCATTCGACTGTTTGCGCTGGAGATGAGTTATCAGGACATCGGCCGGGAGAGTGAAGATCTTTATGCCTTCAGCGTTTCAACCGCCACCGTCAGTGCAGTACCGATAAAGTTATCCCTGAACTAAAACAGTGGCAACAGCGCCCGCTGGAGAAGGTTTATCCCGTCGTCTGGCTGGACGCTATTCATTATAAAAACCGTGAGGATGGCCGTTATCAGAGCAAGGCGGTTTATACCGTTCTGGCACTGAATCTGGAAGGCAAAAAAGAAGTTCTGGGCCTATATCTGTCGGAAAGTGAAGGTGCTAACTTTTGGTTAAGTTCTAACAGCGAGAGGGTACTTTAAAGGGATGCTTTTCGTTATGTTTATAGGCACTATTCGCTGGAAATCATAAGACATCAAAAACGCTGCAACGCCTTGTGTGGTGTGGGGTTGCTGAGATTTGTGAGAGGTGGGTAAAAGAGGTCATGGTGTCCCCTGCAGGAATCGAACCTGCAACTAGCCCTTAGGAGGGGCTCGTTATATCCATTTAACTAAGGGGACAACGCGGCGCCAGTATAGCGTTTTTTATTCGCCGGAGTAAGTGTAGCGCCGCCTGACTGGTTAAACCGTCGCCACTCAGCGCTGTTTTTCCGCTTTTTTCCGCTCCCGTTCCAGGCGCTCGCCGCGTAGCCTCGCTTCTTCCTTACGCTTATTGCTCATATCGTTGCGGATCTGCGCGTGGCTCATCAATGCGAAAATAAAAGTGCCGCCGCAGATATTTCCGGCAAGTGTGGGAAGGGCGAAGGGCCAGAGAAAGTCGCTCCAGGGCAGCGTGCCGTTGAAAACCAAATACAAAATTTCAACGGAACCGACGACAATATGGGTGGTATCGCCCAGCGCGATAAGCCAGGTCATCAAAATAATGACCACAATCTTTGCCCCGCCTGCTGCAGGAAACATCCATACCATTGTGGCGATGATCCAGCCAGAGATAATCGCGTTGGCAAACATCTCCGTTGGGCTATTTTTCATGACCTCCATACCAATTTTGACAAAGGCGTCGCGGGTCTCTTCATCAAATATAGGCATATATTCAAATGCCCACGCCGCAACCCCGGTGCCAATAAGGTTGCCCAATAAGACTACGCCCCACAAGCGCATCAGCAGGCCAACGTTACTCAGAGTGGGATTTTGCATTACCGGCAACACGGCGGTAACGGTATTTTCAGTAAATAATTGCTGGCGGGCCATGATGACAATGATAAAACCAAAGGTATAGCCGAGATTTTCCAGTAAAAAGCCGCCGGGAACGCCTTCAAGCTGCACGTGGAAAATCCCTTTCGCCAGGAGTGATGCCCCCATAGAAAGTCCTGCGGCAATGGCTGACCAGAGCAAAGCCATCGCATCGCGTTCCATCTCTTTTTCACCATCCTGGCGAATATGTTCATGAATCGCCATGGCGCGGGAAGGAAGACGATCTTCATCCACTTCAATCTCTTTACCGCTTTGTTTTTCTTCGCTTTCAACTTCCAGGTCACTGCTTTGCCGGTTAATTTTATCGTCGTTAAGGCTATCCAT
Protein sequences of DBSCAN-SWA_1 >NZ_CP041973|476943:482996|482054_482996_-|WP_000377779.1|DBSCAN-SWA MDSLNDDKINRQSSDLEVESEEKQSGKEIEVDEDRLPSRAMAIHEHIRQDGEKEMERDAMALLWSAIAAGLSMGASLLAKGIFHVQLEGVPGGFLLENLGYTFGFIIVIMARQQLFTENTVTAVLPVMQNPTLSNVGLLMRLWGVVLLGNLIGTGVAAWAFEYMPIFDEETRDAFVKIGMEVMKNSPTEMFANAIISGWIIATMVWMFPAAGGAKIVVIILMTWLIALGDTTHIVVGSVEILYLVFNGTLPWSDFLWPFALPTLAGNICGGTFIFALMSHAQIRNDMSNKRKEEARLRGERLERERKKAEKQR >NZ_CP041973|476943:482996|480422_480812_-|WP_001576268.1|DBSCAN-SWA MLKLFAKYTSIGVLNMLIHWRVFAFCMYGMHTHQALTNFSDFVIAVSFSFYANARFTFNASTTAIRYMMYMGFMGALSAVVGWMADQCSLPPLVTLITFSAISLVCGFIYSRFFIFRDKNENLSCRPSF >NZ_CP041973|476943:482996|480199_480454_-|WP_000703599.1|DBSCAN-SWA MKISLVVLVFNEEDTIPIFYRTVHEFNELEKYKVEIIFINDGSKDVTESIIKIIAVSDPLVIPFSFTRNFGNDARCTTILLIFF >NZ_CP041973|476943:482996|476943_477111_+|WP_105789229.1|DBSCAN-SWA MYFLWDCFNLWADIGNEKDRPGNYSLSEYPVHQLPTTNYQLPTNHLVDGLVAIGS >NZ_CP041973|476943:482996|478259_480182_-|WP_000400616.1|DBSCAN-SWA MEHLKYRPDIDGLRAIAVLSVVIFHYFPSLLPGGFVGVDIFFVISGYLITSIILKSASNKSFSYLDFYKRRVLRIFPALSIVLVSCLIVGWVYLFQDDYKLLGKHVFSGSFFISNFTLWSESGYFDSKSYLKPLLHLWSLGIEEQFYIIWPVVILLCFRSKNHNRNIVLSCATIFIISYAISIFTMASDGGANYYSPASRFWELMAGAIISTLRFIGINTSLSKLMSLLGIILIALSITMIDEKMSFPGYIAIIPVLGASLIIASNGNDLVVSKLLSVRPVVFFGLISYPLYLWHWPIYSFYRSIFAGSPDYHELILLLLSSFFLAILTYYLIEKPLRNARNKYITAILLALSVFGTGLIGAFIFHINGVKDREINKSAGEYASVTDVYNYYKYGELLRGGICHSVQLTAAISNGCIKNGKHNIFIIGDSYAAALFNGLSHYIDNKGSDYIISQMTDGNAPPLFVDGKDDLQRSVITLNNNRINEIKRVQPEVVLLTWSVRGTNGVHDKKLAIDTLSLTIKKIKEASPDSRIIFIGPVPEWNANLVKIISNYLSEFKKTPPLYMTYGLNSEISEWDSYFSNNVPKMGIEYISAYKALCNESGCLTRVGNGPDFITAVDWGHLTKPGSDFLFNKIGNKIIK >NZ_CP041973|476943:482996|477126_477270_+|WP_105789228.1|DBSCAN-SWA MDGWGSYVSNILMQDCAGSGDLWYTYGKAFTYISVIDTKTLTLTNCL |
6 | Salmonella_virus(50.0%) | NA | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_2 |
719420 : 728591
Sequences of DBSCAN-SWA_2
Nucleotide sequences of DBSCAN-SWA_2 >NZ_CP041973|719420:728591|DBSCAN-SWA GATGATTGAATTTAACCATGTCAGTAAAACCTTCGGCGATCAACAGGCTGTTAGCGACCTCAATTTGCACTTTAGCGAAGGCAGCTTTTCGGTGTTAATTGGCACCTCCGGTTCGGGAAAATCGACCACTCTGAAGATGATTAACCGGCTGGTAGAGCATGATAGCGGAACGATCCGTTTTGCCGGGGAAGAGATCCGCAGCCTGCCGGTGCTTGAACTGCGCCGTCGCATGGGCTATGCCATTCAGTCTATCGGTCTTTTTCCCCACTGGACGGTGGCGCAAAATATCGCCACCGTACCGCAACTACAAAAGTGGTCGCGTGCGCGGATTAACGATCGTATTGATGAACTGATGGCATTATTGGGTCTGGAAAGCGCGCTGCGCGATCGTTATCCGCATCAGCTTTCCGGCGGGCAACAGCAGCGGGTCGGCGTTGCGCGGGCGCTGGCTGCCGATCCGCAGGTATTGCTGATGGACGAGCCTTTCGGCGCGCTTGATCCGGTAACGCGCGGCGCATTGCAGCAGGAGATGACCCGCATTCATCAGCTACTGGGGCGCACCATCGTACTGGTGACGCACGACATCGACGAGGCGCTACGCCTCGCCGACCATCTGGTGCTGATGGACGGGGGCAATGTTATCCAACAGGGATCGCCACTTTCTATGCTGACCTCGCCGGAAAATGATTTCGTGCAGGCGTTTTTTGGCCGCAGCGAGCTGGGCGTAAGGCTGCTTTCGTTACGTAGCGTAGGCGATTATGTACGTCGGCATGAACAGCTCAGCGGCGACGCGCTGGTGGAAGAGATGACGCTACGCGATGCGCTATCGATGTTTGTCGCCCGTCGGTGCGACGTCCTGCCGGTGGCGAATCAGCAGGGCGAGCCCTGCGGTACGCTCCATTTCCGCGATCTACTTTCGGAGACGTCCCCCCGTGAAACGACTGTGTGATCCGCTTCTCTGGCTTATTGTTCTGTTCTTGCTTCTGCTGTTTGGATTGCCTTATAGCCAGCCGTTCTTCGCCGCGCTGTTTCCCGATTTACCGCGCCCGGTCTACCAACAGGAGAGTTTTGCCGCCCTCGCGCTCGCCCATTTCTGGTTGGTGGGCATCTCAAGTCTGTTTGCCGTCGTGGTGGGCGTCGGCGCAGGGATTGCGGTCACGCGAGAAAGTGGGAAGGAGTTTCGTCCCCTGGTGGAGACTATCGCCGCCGTCGGGCAGACCTTTCCCCCGGTCGCGGTACTGGCGATCGCGGTACCCGTCATGGGTTTTGGTCAGCAACCAGCCATTATCGCCTTGATCCTGTATGGAGTGTTGCCCATCCTGCAGGCGACCCTGGCCGGGCTGGGCGCGGTGCCTGCCAGCGTGATGAGCGTTGCCAGCGGTATGGGAATGAGCCGTCGCCAACAGTTGTATCAGGTTGAACTGCCGCTGGCCGCGCCGGTGATTCTGGCGGGCATCCGAACCTCGGTGATTATCAATATTGGTACGGCGACCATCGCTTCAACAGTGGGGGCCAGTACGTTAGGCACGCCGATCATTATCGGGCTTAGCGGCTTTAATACGGCCTATGTTATCCAGGGGGCGCTGCTGGTGGCGCTGGCGGCGATCATTATCGATCGCCTGTTTGAAAGGCTGACGCGCGCGCTTACCCGGCACGCAAAATAAAACTGTAACCTGCCAGCATCACGCCGCCGATACCGCCAATAGCCATCAGCAGGAAAAGGGCGATCACCCCGATTTTCGCTACGCGCATTATGTACTCCTTATGTTAATAAAAGGAGTATACATTAAAGCGAATTTGTTAGCTGCTGTTTAAACGCCAAGGGGATGAATGTCGCGTCCCTGGGCGCGCCATGCCAGGAGTTGCTGCTGCTGCGCCAGCGTCTGGTTTTCTCCGCACCATACCAGTAACGTCTTGCCGTCAAACAGTTCCGGGCGGAACTGGCTAAGCGAATGCGCCAGCACGTCGATTCGCCATCCCTGTTGGCTGGCGACCCAACCTTCCAGCCACAGGCGGGTGGTATCATGGATATTCCAGCCGATCACCAACGCATCTTTTCCCTGTTTCTTACGCGCAGACGCCAGGCAGAGCGCAATATAGTTGATCAGGATACCGTCAAGAATGCCGAGCAGCGCCTGAAGGGCAGGTTGTTGGCACTGTAATCGTCGCCGCAGCGGGACGAACAGGTTAGTGGTCAATGTTTGGGCTGGATAATCCTGACAGCGTTCTTTGACCCATAACCGTAAACTGTGCAGATTACTGCTTTGCAGGTAGTGCAGCAGGATCTCCTGCTGTTCGCGCCAGCCGTTAGGTTGTTCGCTACTGTCGCTACTGAGCAGCACTTTGACTTTGCTGACCTGGACGCCGTTATTTATCCAGCGCTTGATTTCGCGGATTCTGTCGATATCGGCATCGTTAAACAGACGATGACCGCCATCCGTTCGCTGTGGTTTTAAAAGTCCATAACGTCTCTGCCACGCGCGCAACGTGACAGGATTGATATCACAAAGCAAAGCCACTTCACCAATTGTGTAAAGCGCCATCGTTTCACCCTTGCTCGCGAGGTCCCGGTTTAACTTTAGACGCTCTTTTACGAACCAGGAAGTTTTGCCTGTTTTTTATGCATTAAAACGCGAAGTAGCGGGTTGCGGCGGGGTGTTTAAGTGATCGTATTCACGAATTCATATTTTTATGCAACAGTTCAAAGAAAGTTAATCGTACTCAATGTATGTTACGCGCTTTTAATTGAAGTGTGGTTTGCGGGTATGTACGAGTTTAATCTGGTGTTGCTGCTGCTTCAGCAGATGTGCGTGTTTCTGGTCATTGCGTGGCTAATGAGTAAAACGCGTCTGTTCATCCCGCTTATGCAGGTCACGGTTCGTCTGCCGCACAAGCTTCTGTGTTACGTCACGTTTTCTATCTTCTGCATCATGGGCACTTATTTTGGGCTGCATATCGAAGATTCGATTGCCAATACCCGCGCAATTGGCGCGGTGATGGGCGGCCTACTCGGCGGGCCGGTCGTCGGCGGGCTGGTCGGTCTGACCGGTGGGTTACATCGGTATTCTATGGGCGGCATGACGGCGCTGAGCTGTATGATTTCCACCATCGTCGAAGGGCTGCTGGGCGGGTTGGTACACAGCGTTCTCATACGTCGCGGACGCCCGGACAAAGTGTTTAGCCCGCTGACGGCGGGAGCAATTACGTGTATTGCCGAACTGGTGCAGATGCTGATCATTTTACTGATAGCCAGGCCGTTTGACGATGCCTTGCATCTGGTCAGTAATATTGCCGCGCCGATGATGGTGACGAATACCGTTGGCGCCGCGCTGTTCATGCGTATTTTGCTCGATAAGCGCGCCATGTTCGAAAAATATACTTCGGCATTTTCTGCTACCGCGCTGAAGGTCGCCGCGTCAACGGAGGGGATTCTGCGTCAGGGATTTAACGAAGTGAACAGTATGAAGGTGGCGCAGGTGTTATATCAGGAGCTGGATATTGGCGCCGTCGCCATCACCGATCGCGAAAAACTGCTGGCTTTTACTGGTATTGGCGACGATCACCATCTACCGGGCAAACCCATTTCATCAGGTTATACGCTGAAAGCAATTGAAACCGGAGAGGTGGTTTATGCCGATGGCAACGAAGTGCCGTATCGCTGTTCGCTACACCCGCAGTGTAAACTCGGCTCGACGCTGGTGATCCCGCTGCGTGGCGAAAATCAGCGAGTCATGGGCACCATTAAATTGTACGAAGCGAAAAACCGGCTGTTTAGCTCAATTAACCGCACCCTGGGAGAGGGTATTGCGCAGCTTTTATCCGCGCAGATCCTGGCCGGGCAGTATGAACGGCAGAAGGCGTTGCTGACGCAGTCAGAGATCAAGCTGTTGCACGCGCAGGTGAACCCGCATTTTCTGTTTAACGCGCTCAATACCATTAAAGCGGTGATTCGCCGCGACAGCGAACAGGCCAGCCAACTGGTGCAGTACTTGTCGACCTTTTTTCGCAAAAATTTAAAACGCCCGTCGGAAATCGTCACGCTGGCGGATGAAATTGAACACGTAAACGCTTATCTGCAAATTGAAAAAGCGCGTTTTCAGTCGCGTCTGCAGGTACAGCTTGATGTTCCATCGACGCTTTCACGTCAGAAATTGCCTGCGTTTACATTACAGCCGATTGTTGAGAACGCCATTAAACATGGCACGTCGCAACTGCTTGATACCGGCAACGTCGCTATTCGCGCCCGGCGCGAAGGGCAGCATTTGATGTTAGATATTGAGGATAATGCGGGACTGTATCAGCCTTCCGCCGGCAGTAGCGGGCTGGGGATGAGTCTGGTTGATAAACGTCTGCGCGAACACTTTGGCGATGATTATGGTATTAGCGTGGCCTGCGAGCCGGACTGTTTTACCCGAATTACATTACGACTTCCACTGGAGGAGGACGCATGATTAAAGTGCTGATTGTGGATGATGAGCCGTTAGCGCGGGAAAATCTGCGGATTTTGCTCCAGGGGCAGGATGACATTGAGATTGTGGGAGAGTGCGCGAACGCGGTAGAAGCGATTGGCGCGGTACATAAGTTGCGACCTGATGTGCTGTTTCTGGATATTCAGATGCCGCGTATCAGTGGACTGGAGATGGTAGGAATGCTTGATCCGGAACACCGCCCGTATATCGTTTTTTTAACCGCGTTTGACGAATACGCCATCAAAGCCTTTGAAGAACACGCTTTTGATTATCTGCTCAAGCCGATAGAGGAGAAACGGCTGGAAAAAACGTTACATCGTCTGCGTCAGGAGCGCAGTAAACAGGATGTTTCGTTGTTGCCGGAAAACCAGCAGGCGCTTAAATTCATTCCCTGTACCGGACACAGCCGGATCTATTTGTTGCAAATGGATGATGTCGCCTTTGTCAGTAGCCGTATGAGCGGCGTTTATGTGACCAGCAGTGAAGGGAAAGAGGGGTTTACCGAGCTGACGCTGCGCACGCTGGAAAGCCGGACGCCGCTACTGCGTTGTCATCGTCAGTTTCTGGTGAATATGGCCCATTTGCAGGAAATTCGGCTGGAGGATAATGGGCAGGCAGAGCTGATTTTACGCAACGGCCTGACGGTGCCGGTAAGCCGTCGCTATCTGAAAAGTTTAAAAGAGGCGATTGGCCTGTAAAAGACTGTTAGAATATCGTTTTGCCATAGAAACGACCGAAGGCCTCATGCTGAGTAACGATATTCTGCGTAGCGTGCGCTACATTTTAAAAGCTAATAATACCGATCTGGCGCGTATCCTGGCGCTGGGTAACGTTGATGCTACGCCGGAGCAGATAGCAATCTGGTTGCGCAAAGAAGAGGAAGAGGGGTTTCAGCGTTGCCCGGATATCGTGTTGTCCTCATTTCTCAATGGCCTCATTTATGAAAAACGCGGCAAAGATGAGGCGGCGCCTGCATTGACGGCGGAACGTCGTATCAACAACAATATTGTGCTGAAAAAGCTGCGTATTGCCTTTTCGCTAAAAACAGATGATATCCTGGCGATACTTACCGGTCAGTTGTTTCGTGTCTCAATGTCAGAGATCACCGCGATGATGCGCGCGCCGGACCATAAGAACTTCCGCGAATGCGGCGATCAGTTTATGCGTTATTTTCTGCGCGGTCTGGCGGCCCGTGAACACGCGGCGAAGTAATTCTGCGGTATTGTTCCCGGCAGCGTCCTGTCTGACCGGGAAAACGCATTATTATACTAATTGATTCTATGATACCCGCTCTCTTCCAACAGTTTCTGCGAGCGAATCATTGACAGATAGTACGCGGAACAGTTGTCAATTGATGATCCTGGCAATTTACAGAGGTCGCTTATTTTTGCCTGGGTAAAATCAATATCCACATATTCCGTAGCATAGCTATCATAATAGTCGATTCGTTCAGTCAAACCCGGCATACCCTGATAAGCTTCGCCGACTTGACTCAGCATTTTTTGTGCTTCTTCTTTATTATTGGCTTTCAGGGTCTTATAAAGTAGTTTATGTTCAGATATTTGACGTAAAACGATATCCCCTTTGTAGTAATAGGTTAACTTGATTTCAATACCGTTGAGATTACCTACATAGCGTTGTGTTTCCTCTGACTCTTTGCTAGCGGCTATCTTCTTGATAAACGCTGTCATGTTATTTTGCTTTCCCTGGAGAGTATCGTTTTTCTGATCGCAGCCAGTTATGCTAACCGATAGAGAGAGCGCGAATAGTGGCAGTGCCATAAGACGTAGAACCTGCATAACAATTCCTTGTCGTTAAGTATTGGTGTGGCCAGGAATTCAGGGATTATAGGCTTTGGCGAGGGGACTTACAGCGAGGCTGTCTTTTTTCGGAATTCATAAAGAAAAGACGCTGCCGAAGCAGCGCCCTGAGCGACTTTACCAGTCGATGCAATACATTATGCCTGCCAGTTATTTCGCTTCTTTAAAACCAGCAGCTTCCAGCAGCGTCTGGGTTTGTTTCATGCTGATACCTTTGCTGGTATCGCCGGACACCATCGTCCCTGAGATTTGCTGTAACGCTTTAAAGTCCACTTTTTCCATATCCACAGAGACGTTTTCCTGGGCATAGGTATCTTCATAGGTTAATTTTTCTTCCACTCCGGCGATATTTTTATATTTCGCGCTCAGCGGATCGAGAATTTTGGCGGCATCTTCTTTCGTTTTAGCGCCTACAGTGGCATAGCTGATTTTACTTTCAGACGTCTGCTTAATGATTTTGTCACCTTTATAGGTGTAAGTAATTGAAATTTCTGTCCCCGCCAGGTTTGCGTTAAAGGTCTTTGATTCTTCTTTATCGCCACAGCCAGCAAGAGAGAACACCAGTACGGAAGCCAAAGCCGTGGACAATAACTTGCCAGAAATTTTCATCTAAAACTCCATTTTATATAATAATTGGGCTTTTAAAATAATTTCAATGAATTAATTTAACCCAGTAATAGCAATGTATCAGGGAGAGATAGAATATGACTTTTAGCCGTTATTTAGCAGTCCGGATATGGAGTCTTAGCGCTATTGCTTATTAAGGAAAAAGTTAAAACGTGCGGAGGAGGCGATATGCCAGTCAGGATTAAGCGGTTAAAAAAGCCGGAGCATGCTCCGGCTTGTTGCTTATTTCACCTGTTGGCCAGGCTTCGCGCCGTCATCAGGGCTTAACAGGAAGATATCTTTCCCGCCAGGGCCTGCGGCCATCACCATTCCCTCGGAGACGCCAAAGCGCATTTTGCGCGGCGCGAGGTTGGCGACCATTACCGTCTGGCGGCCGATCAGCGCCTGCGGGTCCGGGTAGGCGGAACGAATGCCGGAGAAGACGTTACGCTTCTCGCCGCCCAGATCCAGCGTCAGACGCAGCAATTTGTCGGAGCCTTCCACGAACTCAGCGTTTTCAATCAATGCTACGCGCAGGTCAATTTTGGCGAAATCGTCAAAGGTGATGGTTTCCTGAATCGGGAAGTCGGCTAACGGGCCGGTAACCGGTGCGGCTGCGGCTTTCACCTCTTCTTTAGACGCTTCAACCAGCGCTTCAACTTGCTTCATGTCGATGCGATTGTAGAGCGCCTTAAAGGTGTTGACCTTGTGACTGAGCAGCGGCTGTTCGATGGCATCCCAGTTCAGTTCGCTGTTCAGGAAGGCTTCAACGCGTTCAGAAAGCGTCGGCAGTACCGGTTTCAGATACGTCATCAGCACGCGGAACAGGTTGATGCCCATCGAGCAAATGGCCTGCAGGTCAGCGTCGCGGCCTTCCTGTTTAGCCACCACCCACGGCGCTTGCTCGTCAACATAACGGTTAGCGACGTCGGCCAGCGCCATAATCTCACGGATAGCTTTGCCGAATTCACGGCTTTCCCATGCTTCGCCAATCACCGCAGCGGCGTCAGTAAAGGTTTTGTACAATTGCGGATCGGCCAGTTCAGCCGCCAGCACGCCGTCGAAACGCTTATTGATAAAACCGGCGTTACGGGATGCCAGGTTGACTACTTTATTGACGATATCGGCATTGACGCGCTGGACAAAGTCTTCCAGGTTCAGGTCGATGTCATCAATGCGTGAAGAAAGCTTCGCGGTGTAGTAGTAGCGCAGGCTGTCGGCGTCAAAGTGTTTCAGCCAGGTGCTGGCCTTAATAAAGGTGCCGCGAGACTTAGACATCTTCGCGCCGTTCACCGTCACGTAACCGTGAACGAACAGGTTGGTCGGCTTACGGAAGTGGCTGCCTTCCAGCATGGCAGGCCAGAACAGGCTGTGGAAATAGACGATGTCTTTGCCGATAAAGTGATACAGCTCGGCGTCGGAGTCTTTTTTCCAGTACTCATCAAAACTGGTCGTGTCACCGCGCTTATCGCACAGATTTTTGAAGGAGCCCATATAGCCAATCGGCGCGTCCAGCCAGACGTAGAAATATTTGCCCGGCGCGTTCGGGATTTCGAAACCAAAATACGGCGCGTCGCGGGAAATGTCCCACTGTTGCAGGCCGGATTCAAACCACTCCTGCATTTTGTTCGCCACCTGCTCCTGCAGCGCGCCGCTGCGGGTCCACGCCTGCAGCATTTCGCTGAATGACGGCAGATCAAAGAAAAAGTGCTCGGAGTCACGCATTACCGGCGTCGCGCCGGACACCACGGATTTCGGCTCGATAAGTTCGGTCGGGCTGTAAGTTGCGCCGCAGACTTCACAGTTATCGCCGTACTGGTCCGCGGATTTACATTTCGGGCAGGTGCCTTTCACAAATCGGTCCGGCAGGAACATGCCTTTTTCCGGATCGTAGAGTTGAGAGATAGTGCGGTTCTTAATAAAACCGTTCTCTTTCAGGCGCGTATAAATCAGCTCGGACAGCTCGCGATTCTCGTCGCTGTGCGTTGAGTGGTAGTTGTCGTAGCTAATATTAAAACCGGCGAAATCGGTCTGGTGCTCCTGGCTCATTTCACCGATCATTTGCTCCGGCGTAATACCAAGCTGCTGCGCTTTCAGCATGATCGGCGTGCCATGAGCGTCATCGGCACAGATGAAGTTAACCTCATGGCCGCGCATTCGCTGGTAACGGACCCAGACATCAGCCTGGATGTGCTCCAGCATATGGCCGAGGTGGATAGAGCCGTTGGCGTACGGCAGCGCGCACGTTACCAGAATTTTCTTCGCGACTTGAGTCAT
Protein sequences of DBSCAN-SWA_2 >NZ_CP041973|719420:728591|726557_728591_-|WP_000195340.1|tRNA|DBSCAN-SWA MTQVAKKILVTCALPYANGSIHLGHMLEHIQADVWVRYQRMRGHEVNFICADDAHGTPIMLKAQQLGITPEQMIGEMSQEHQTDFAGFNISYDNYHSTHSDENRELSELIYTRLKENGFIKNRTISQLYDPEKGMFLPDRFVKGTCPKCKSADQYGDNCEVCGATYSPTELIEPKSVVSGATPVMRDSEHFFFDLPSFSEMLQAWTRSGALQEQVANKMQEWFESGLQQWDISRDAPYFGFEIPNAPGKYFYVWLDAPIGYMGSFKNLCDKRGDTTSFDEYWKKDSDAELYHFIGKDIVYFHSLFWPAMLEGSHFRKPTNLFVHGYVTVNGAKMSKSRGTFIKASTWLKHFDADSLRYYYTAKLSSRIDDIDLNLEDFVQRVNADIVNKVVNLASRNAGFINKRFDGVLAAELADPQLYKTFTDAAAVIGEAWESREFGKAIREIMALADVANRYVDEQAPWVVAKQEGRDADLQAICSMGINLFRVLMTYLKPVLPTLSERVEAFLNSELNWDAIEQPLLSHKVNTFKALYNRIDMKQVEALVEASKEEVKAAAAPVTGPLADFPIQETITFDDFAKIDLRVALIENAEFVEGSDKLLRLTLDLGGEKRNVFSGIRSAYPDPQALIGRQTVMVANLAPRKMRFGVSEGMVMAAGPGGKDIFLLSPDDGAKPGQQVK >NZ_CP041973|719420:728591|721230_721962_-|WP_001240420.1|DBSCAN-SWA MALYTIGEVALLCDINPVTLRAWQRRYGLLKPQRTDGGHRLFNDADIDRIREIKRWINNGVQVSKVKVLLSSDSSEQPNGWREQQEILLHYLQSSNLHSLRLWVKERCQDYPAQTLTTNLFVPLRRRLQCQQPALQALLGILDGILINYIALCLASARKKQGKDALVIGWNIHDTTRLWLEGWVASQQGWRIDVLAHSLSQFRPELFDGKTLLVWCGENQTLAQQQQLLAWRAQGRDIHPLGV >NZ_CP041973|719420:728591|723866_724586_+|WP_000598637.1|DBSCAN-SWA MIKVLIVDDEPLARENLRILLQGQDDIEIVGECANAVEAIGAVHKLRPDVLFLDIQMPRISGLEMVGMLDPEHRPYIVFLTAFDEYAIKAFEEHAFDYLLKPIEEKRLEKTLHRLRQERSKQDVSLLPENQQALKFIPCTGHSRIYLLQMDDVAFVSSRMSGVYVTSSEGKEGFTELTLRTLESRTPLLRCHRQFLVNMAHLQEIRLEDNGQAELILRNGLTVPVSRRYLKSLKEAIGL >NZ_CP041973|719420:728591|721063_721171_-|WP_001261696.1|DBSCAN-SWA MRVAKIGVIALFLLMAIGGIGGVMLAGYSFILRAG >NZ_CP041973|719420:728591|725156_725687_-|WP_001197951.1|DBSCAN-SWA MQVLRLMALPLFALSLSVSITGCDQKNDTLQGKQNNMTAFIKKIAASKESEETQRYVGNLNGIEIKLTYYYKGDIVLRQISEHKLLYKTLKANNKEEAQKMLSQVGEAYQGMPGLTERIDYYDSYATEYVDIDFTQAKISDLCKLPGSSIDNCSAYYLSMIRSQKLLEESGYHRIN >NZ_CP041973|719420:728591|719420_720368_+|WP_000569168.1|DBSCAN-SWA MIEFNHVSKTFGDQQAVSDLNLHFSEGSFSVLIGTSGSGKSTTLKMINRLVEHDSGTIRFAGEEIRSLPVLELRRRMGYAIQSIGLFPHWTVAQNIATVPQLQKWSRARINDRIDELMALLGLESALRDRYPHQLSGGQQQRVGVARALAADPQVLLMDEPFGALDPVTRGALQQEMTRIHQLLGRTIVLVTHDIDEALRLADHLVLMDGGNVIQQGSPLSMLTSPENDFVQAFFGRSELGVRLLSLRSVGDYVRRHEQLSGDALVEEMTLRDALSMFVARRCDVLPVANQQGEPCGTLHFRDLLSETSPRETTV >NZ_CP041973|719420:728591|722184_723870_+|WP_000272845.1|DBSCAN-SWA MYEFNLVLLLLQQMCVFLVIAWLMSKTRLFIPLMQVTVRLPHKLLCYVTFSIFCIMGTYFGLHIEDSIANTRAIGAVMGGLLGGPVVGGLVGLTGGLHRYSMGGMTALSCMISTIVEGLLGGLVHSVLIRRGRPDKVFSPLTAGAITCIAELVQMLIILLIARPFDDALHLVSNIAAPMMVTNTVGAALFMRILLDKRAMFEKYTSAFSATALKVAASTEGILRQGFNEVNSMKVAQVLYQELDIGAVAITDREKLLAFTGIGDDHHLPGKPISSGYTLKAIETGEVVYADGNEVPYRCSLHPQCKLGSTLVIPLRGENQRVMGTIKLYEAKNRLFSSINRTLGEGIAQLLSAQILAGQYERQKALLTQSEIKLLHAQVNPHFLFNALNTIKAVIRRDSEQASQLVQYLSTFFRKNLKRPSEIVTLADEIEHVNAYLQIEKARFQSRLQVQLDVPSTLSRQKLPAFTLQPIVENAIKHGTSQLLDTGNVAIRARREGQHLMLDIEDNAGLYQPSAGSSGLGMSLVDKRLREHFGDDYGISVACEPDCFTRITLRLPLEEDA >NZ_CP041973|719420:728591|724632_725100_+|WP_000950414.1|DBSCAN-SWA MLSNDILRSVRYILKANNTDLARILALGNVDATPEQIAIWLRKEEEEGFQRCPDIVLSSFLNGLIYEKRGKDEAAPALTAERRINNNIVLKKLRIAFSLKTDDILAILTGQLFRVSMSEITAMMRAPDHKNFRECGDQFMRYFLRGLAAREHAAK >NZ_CP041973|719420:728591|725858_726317_-|WP_000703145.1|DBSCAN-SWA MKISGKLLSTALASVLVFSLAGCGDKEESKTFNANLAGTEISITYTYKGDKIIKQTSESKISYATVGAKTKEDAAKILDPLSAKYKNIAGVEEKLTYEDTYAQENVSVDMEKVDFKALQQISGTMVSGDTSKGISMKQTQTLLEAAGFKEAK >NZ_CP041973|719420:728591|720351_721083_+|WP_000824854.1|DBSCAN-SWA MKRLCDPLLWLIVLFLLLLFGLPYSQPFFAALFPDLPRPVYQQESFAALALAHFWLVGISSLFAVVVGVGAGIAVTRESGKEFRPLVETIAAVGQTFPPVAVLAIAVPVMGFGQQPAIIALILYGVLPILQATLAGLGAVPASVMSVASGMGMSRRQQLYQVELPLAAPVILAGIRTSVIINIGTATIASTVGASTLGTPIIIGLSGFNTAYVIQGALLVALAAIIIDRLFERLTRALTRHAK |
10 | Enterobacteria_phage(66.67%) | tRNA | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_3 |
795789 : 806296
Sequences of DBSCAN-SWA_3
Nucleotide sequences of DBSCAN-SWA_3 >NZ_CP041973|795789:806296|DBSCAN-SWA TATGCCCGTGAATAAGTTCTCCCGACGTACCCTCCTGACGGCAGGTTCCGCGCTTGCTGTTCTTCCTTTTCTGCGCGCCTTGCCGGTACAGGCGCGTGAACCTCGCGAGACCGTCGATATTAAGGATTATCCGGCGGATGACGGTATCGCCTCGTTCAAACAGGCCTTCGCCGACGGACAGACCGTGGTCTTACCGCCAGGATGGGTGTGTGAAAATATCAATGCGGCGATAACGATTCCGGCGGGAAAAACGCTGCGGGTACAGGGCGCGGTGCGTGGGAATGGCCGGGGACGGTTTATTTTGCAGGACGGGTGTCAGGTGGTGGGGGAGCAGGGCGGCAGTCTGCACAATGTGACGCTGGATGTTCGCGGGTCGGACTGTGTGATTAAAGGCGTGACGATGAGCGGCTTTGGCCCCGTCGCGCAAATTTTCATCGGCGGTAAGGAACCGCAGGTGATGCGTAATCTCATTATCGATGACATCACCGTTACCCACGCCAACTACGCCATTCTCCGCCAGGGATTTCATAACCAAATGGACGGCGCGCGGATTACGCATAGCCGCTTTAGCGATTTGCAGGGGGACGCCATTGAGTGGAATGTCGCGATTCACGACCGCGACATCCTGATTTCCGATCATGTCATCGAACGCATTGATTGTACCAATGGCAAAATCAACTGGGGGATCGGCATCGGGCTGGCGGGTAGCACCTATGACAACAGTTATCCTGAAGATCAGGCAGTAAAAAACTTTGTGGTGGCCAATATTACCGGATCTGATTGCCGACAGCTGGTGCACGTAGAAAATGGCAAACATTTCGTCATTCGCAATGTCAAAGCCAAAAACATCACGCCCGATTTCAGTAAAAATGCGGGTATTGATAACGCAACGATCGCCATTTATGGCTGTGATAATTTCGTCATTGATAATATTGATATGACGAATAGTGCTGGGATGCTCATCGGCTATGGCGTCGTTAAAGGAAAATACCTGTCAATTCCGCAAAACTTTAAATTAAACGCTATTCGGTTGGATAATCGCCAGGTTGCTTATAAATTACGCGGCATTCAAATTTCCTCCGGCAACATCCCCTCTTTTGTCGCCATCACCAATGTACGGATGACGCGTGCTACGCTGGAACTGCATAATCAACCGCAGCACCTCTTTCTGCGTAATATCAACGTGATGCAAACTTCAGCGATTGGCCCGGCGTTAAAAATGCATTTCGATTTGCGTAAAGATGTCCGTGGTCAATTTATGGCCCGCCAGGACACGCTGCTTTCCCTCGCTAATGTTCATGCCATCAATGAAAACGGGCAGAGTTCCGTGGATATCGACAGGATTAATCACCAAACCGTGAATGTCGAAGCAGTGAATTTTTCGCTGCCGAAGCGGGGAGGGTAAGTACCGCTATTTTTACGAAAATTCCTGGGAAAAAGTTGTTCATACTTAATGTTATGGTGCCGACTAAGACGTAATGTAAAGCGTGCCATCATTATCCCTGGCAGCAGAGTAATTCATGCTGGCGAAAACAAGCTAAAGAGCTATAATTCAGCAACCATTTTACAGGTGGAAGAAACAATGATGAATTTGAAAGCAGTTATACCGGTAGCGGGTTTGGGTATGCATATGTTGCCTGCCACCAAGGCAATCCCAAAAGAGATGCTACCGATCGTCGACAAGCCAATGATTCAGTACATTGTCGATGAGATTGTGGCTGCAGGGATCAAAGAAATCGTACTGGTGACTCACGCGTCTAAAAACGCCGTTGAGAACCACTTCGACACCTCTTATGAACTTGAATCACTTCTTGAGCAGCGCGTTAAGCGCCAGCTTTTGGCGGAAGTGCAATCTATCTGCCCACCGGGCGTGACGATTATGAACGTTCGCCAGGCGCAGCCGTTAGGGCTGGGGCACTCTATTCTGTGCGCGCGTCCGGTCGTGGGCGATAACCCTTTCATTGTGGTACTCCCGGATATTATTATCGATGATGCTACCGCCGATCCGCTGCGCTATAACCTTGCGGCGATGGTGGCGCGTTTCAATGAAACGGGTCGCAGCCAGGTGCTGGCGAAGCGCATGAAAGGTGATTTATCGGAGTATTCCGTTATCCAGACGAAAGAACCTCTGGATAATGAAGGCAAAGTCAGCCGGATTGTGGAGTTTATCGAAAAACCGGATCAGCCGCAGACGCTGGATTCCGATTTGATGGCGGTAGGCCGTTATGTGCTTTCAGCCGACATCTGGGCGGAACTGGAAAGAACCGAACCGGGCGCCTGGGGCCGTATCCAGCTCACCGATGCCATTGCAGAACTGGCGAAAAAACAGTCGGTTGACGCGATGCTAATGACGGGTGACAGCTATGACTGCGGTAAAAAAATGGGCTACATGCAGGCATTTGTGAAGTACGGGCTGCGCAACCTGAAAGAAGGAGCGAAGTTCCGTAAGAGCATAGAGCAGCTTTTGCATGAATAAGTATTAACAACCGTGATAAATGGTTGGTGATAAACATAATAACGGCAGTGAACATTCGAAGCGGCAAGTTGGCTGAAGCGAGTGTTGACTGCCGTTTTAGTTTTGTATAAAGGGCTTAAGTAACAAGGGGTTATCTGGAGCATTTTAATGCTGATTTTATAAGATTAATCCTTGTTTCCGGATGCAATTAATAAGACAATTAGCGTTTAAGTTTTAGTGAGCTTTGCCCTGCTGGGCGAGGTTTGTAACAAGTCGATATGTACGCAGTGCACTGGTAGCTGATGAGCCAGGGGCGGTAGCGTGTGTAACGACTTGAGCAATTAATTTTTATTGGCAAATTAAATACCACATTAAATACGCCTTATGGAATAGAAAAGTGAAGATACTTATTACTGGCGGGGCAGGTTTTATTGGATCAGCTGTTGTCCGCCATATTATTAAGAATACACAGGACACTGTAGTTAATATTGATAAATTAACCTACGCCGGTAATCTTGAATCCCTTTCTGATATTTCTGAGAGTAATCGCTACAATTTTGAACACGCGGATATTTGTGATTCCGCTGAAATAACGCGTATTTTTGAGCAGTACCAGCCGGACGCGGTGATGCATTTGGCTGCGGAAAGTCATGTGGACCGTTCGATTACCGGGCCAGCAGCATTTATTGAAACCAATATCGTCGGCACCTATGTACTTCTTGAAGTTGCGCGTAAATACTGGTCTGCGCTTGGCGAAGATAAAAAAAATAATTTTCGTTTTCATCATATTTCCACTGATGAAGTTTACGGCGATTTACCGCATCCTGATGAAGTTGAAAACAGCGTTACGCTGCCGTTATTTACTGAAACGACGGCATATGCGCCAAGTAGCCCCTATTCTGCGTCAAAAGCATCCAGCGATCATTTAGTCCGTGCCTGGCGGCGTACCTATGGTCTACCAACGATCGTTACCAATTGTTCTAATAACTATGGCCCTTATCACTTCCCTGAAAAACTGATTCCGTTGGTCATTTTGAACGCACTGGAAGGAAAGCCTTTGCCAATTTATGGCAAAGGGGATCAGATTCGCGATTGGCTATATGTAGAAGATCATGCTCGCGCGCTTCATATGGTAGTGACTGAAGGCAAGGCGGGGGAGACTTATAACATTGGTGGCCACAATGAGAAGAAAAATCTCGATGTGGTATTTACCATCTGTGATCTGCTGGACGAGATTGTACCCAAAGCGACTTCTTATCGTGAACAAATCACTTATGTCGCGGATCGTCCGGGCCATGATCGTCGTTATGCCATTGATGCAGGTAAAATTAGCCGCGAATTAGGCTGGAAACCGCTGGAGACCTTTGAAAGCGGTATTCGTAAAACAGTGGAATGGTACCTTGCAAATACTCAATGGGTAAACAATGTTAAAAGTGGGGCGTATCAGAGTTGGATAGAACAGAACTATGAAGGACGCCAGTAATGAATATCTTACTTTTTGGTAAGACAGGGCAAGTAGGCTGGGAGTTGCAACGTTCTCTGGCACCAGTAGGGAATCTGATTGCCCTGGATGTCCATTCAAAAGAGTTTTGCGGTGATTTTAGTAATCCGAAAGGCGTTGCCGAAACCGTTCGTAAGCTTCGTCCCGATGTGATTGTTAACGCAGCAGCACATACTGCAGTAGATAAAGCAGAGTCTGAACCAGAACTGGCGCAGTTACTTAACGCCACCAGTGTGGAAGCCATCGCTAAAGCAGCCAACGAAACTGGCGCATGGGTAGTGCATTATTCAACCGATTATGTATTTCCTGGTACCGGCGATATCCCATGGCAGGAAACGGACGCTACGTCGCCGCTGAATGTCTATGGCAAGACCAAACTGGCGGGAGAAAAGGCCCTGCAGGATAACTGCCCTAAGCATCTTATCTTCCGCACCAGTTGGGTTTATGCAGGTAAGGGCAATAATTTCGCAAAGACAATGCTTCGTCTGGCGAAAGAGCGTCAGACACTTTCAGTCATCAACGATCAGTACGGTGCGCCAACCGGTGCAGAATTACTGGCTGACTGCACGGCTCATGCGATCCGTGTGGCGTTAAAGAAACCAGAAGTCGCAGGTCTTTACCATCTGGTTGCCGGGGGAACCACAACCTGGCATGACTACGCGGCCTTAGTCTTTGACGAGGCGCGCAAAGCAGGGATAACGCTTGCGCTGACTGAGCTTAATGCTGTGCCGACCAGCGCCTACCCGACGCCGGCGAGCAGACCAGGCAATTCGCGTCTCAATACTGAAAAGTTTCAGCGTAATTTTGACCTTATTCTGCCGCAATGGGAATTAGGAGTTAAGCGCATGCTGACTGAAATGTTTACGACGACAACCATCTGATAAATTTAAATGCCCATCAGGGCATTTTCTATGAATGAGAAATGGAAATGAAAACGCGTAAGGGCATTATTTTAGCGGGGGGCTCCGGCACCCGTCTTTATCCGGTGACCATGGCGGTAAGTAAGCAATTGCTACCAATTTATGATAAACCGATGATTTACTATCCCCTTTCCACGCTTATGCTGGCAGGCATTCGGGATATCCTGATCATCAGTACGCCACAGGACACGCCGCGTTTTCAACAACTGCTGGGAGACGGCAGCCAGTGGGGGCTGAATCTTCAATATAAAGTACAGCCAAGCCCGGATGGCTTAGCACAGGCGTTTATTATTGGTGAAGAGTTCATTGGTAATGATGATTGTGCATTAGTACTGGGTGACAATATCTTCTATGGTCATGATTTACCAAAGTTAATGGAAGCTGCCGTTAATAAAGAAAGTGGTGCTACCGTCTTTGCTTATCATGTAAACGATCCGGAGCGCTACGGTGTGGTTGAGTTTGACCAAAGTGGCACAGCCGTTAGTCTGGAGGAAAAACCGTTACAACCGAAGAGTAATTACGCGGTAACGGGGCTGTATTTTTATGATAATAGCGTGGTGGAGATGGCGAAAAATCTTAAGCCTTCCGCTCGCGGTGAGTTAGAAATCACGGATATTAACCGTATCTATATGGATCAGGGAAGATTGTCTGTCGCCATGATGGGGCGCGGTTATGCCTGGCTGGATACAGGGACGCATCAGAGTTTGATAGAGGCCAGTAATTTTATTGCAACCATCGAAGAACGCCAGGGGCTAAAAGTGTCCTGCCCGGAAGAGATCGCATTTCGTAAAAATTTTATAAATGCACAACAGGTTATAGAACTGGCCGGGCCATTATCAAAAAATGATTATGGCAAATATTTGCTGAAGATGGTGAAAGGTTTATAAGTGATGATTGTGATTAAAACAGCAATACCAGATGTCTTGATCTTAGAGCCTAAAGTTTTTGGCGATGAGAGGGGATTCTTTTTTGAAAGTTATAACCAGCAGACCTTTGAAGAGTTGATTGGACGTAAAGTTACATTTGTTCAAGATAATCATTCAAAATCCAAAAAGAACGTACTCAGAGGGCTACATTTTCAGAGAGGAGAAAATGCACAGGGGAAGTTAGTTCGTTGTGCTGTCGGTGAGGTTTTTGATGTTGCGGTCGATATCCGAAAAGAATCGCCTACTTTTGGTCAATGGGTTGGCGTAAATCTATCTGCTGAGAATAAGCGACAGCTTTGGATTCCAGAAGGTTTTGCTCATGGTTTTGTTACTCTTAGTGAGTATGCAGAGTTTCTGTACAAAGCAACTAATTATTACTCACCTTCATCGGAAGGTAGCATTTTATGGAATGATGAGACAATAGGTATTGAATGGCCTTTTTCTCAGCTGCCTGAGCTTTCAGCAAAAGATGCTGCAGCACCTTTACTGCATCAAGCCTTGTTAACAGAGTAAGCATCGTGTCTCATATTATTAAGATTTTTCCATCAAATATTGAATTTTCCGGTAGAGAGGATGAATCAATCCTCGATGCTGCGCTATCGGCTGGCATCCATCTTGAACATAGCTGCAAAGCGGGTGATTGTGGTATCTGTGAGTCCGATTTGTTGGCGGGAGAAGTTGTTGACTCCAAAGGTAATATTTTTGGACAGGGTGATAAAATACTAACCTGCTGCTGTAAACCTAAAACCGCCCTTGAGCTAAATGCGCATTTTTTTCCTGAACTAGCTGGACAGACAAAAAAAATTGTCCCATGCAAGGTAAATAGTGCTGTACTGGTTTCAGGCGATGTTATGACTTTGAAGTTACGCACACCACCAACAGCAAAAATTGGCTTCCTTCCAGGGCAGTATATCAATTTACATTATAAAGGTGTAACTCGCAGTTATTCTATCGCTAATAGTGATGAGTCGAATGGTATTGAGTTGCATGTAAGGAATGTTCCCAATGGTCAGATGAGCTCTCTCATTTTTGGGGAGTTACAAGAAAATACTCTTATGCGCATTGAAGGACCTTGCGGAACATTTTTTATTCGTGAAAGTGACAGACCTATAATCTTCCTTGCAGGCGGTACTGGATTCGCTCCAGTTAAATCAATGGTTGAGCATCTCATTCAGGGAAAATGTCGTCGTGAGATCTACATCTACTGGGGAATGCAAGATAGTAAAGATTTTTACTCTGCATTACCGCAGCAGTGGAGTGAACAGCACGACAACGTTCATTATATCCCTGTTGTTTCTGGTGATGACGCCGAATGGGGGGGAAGAAAGGGATTTGTCCATCATGCTGTGATGGATGATTTTGATTCTCTAGAGTTCTTCGATATATATGCATGTGGTTCACCTGTGATGATCGATGCCAGTAAAAAGGACTTTATGATGAAAAATCTCTCTGTAGAACATTTCTATTCTGATGCATTTACCGCATCTAAATAATATTGAGGATAATTTATGAAAGCGGTCATCCTGGCTGGTGGACTTGGTACCAGACTAAGTGAAGAAACAATTGTAAAACCAAAACCGATGGTAGAAATTGGTGGCAAGCCTATTCTTTGGCACATTATGAAAATGTATTCTGTGCATGGTATCAAGGATTTTATTATCTGCTGTGGTTATAAAGGATATGTGATTAAAGAATATTTTGCGAACTACTTCCTTCACATGTCAGATGTAACATTCCATATGGCTGAAAATCGTATGGAAGTTCACCATAAACGTGTTGAACCATGGAATGTCACATTGGTTGATACGGGTGATTCTTCAATGACTGGTGGTCGTCTGAAACGTGTTGCTGAATACGTAAAAGATGACGAGGCTTTCCTGTTTACTTATGGTGATGGCGTTGCCGACCTTGATATCAAAGCGACTATCGATTTCCATAAGGCTCACGGTAAGAAAGCGACTTTAACAGCTACTTTTCCACCAGGACGTTTTGGCGCATTAGATATCCAAGCTGGTCAGGTCCGGTCATTCCAGGAAAAACCGAAAGGCGATGGGGCAATGATCAATGGTGGTTTCTTTGTGTTGAATCCATCGGTTATCGATCTCATCGATAACGATGCAACAACCTGGGAACAAGAGCCATTAATGACATTGGCACAACAGGGGGAGTTAATGGCTTTTGAACACCCAGGTTTCTGGCAGCCGATGGATACCCTACGTGATAAAGTTTACCTTGAAGGGCTGTGGGAAAAAGGTAAAGCTCCGTGGAAAACCTGGGAGTAAGTAGATGATTGATAAAAATTTTTGGCAAGGTAAACGTGTATTCGTTACCGGCCATACTGGCTTTAAAGGAAGCTGGCTTTCGCTATGGCTGACTGAAATGGGTGCAATTGTAAAAGGCTATGCACTTGATGCGCCAACTGTTCCAAGTTTATTTGAGATAGTGCGTCTTAGTGATCTTATGGAATCTCATATTGGCGACATTCGTGATTTTGAAAAGCTGCGCAATTCTATTGCAGAATTTAAGCCAGAAATTGTTTTCCATATGGCAGCCCAGCCTTTAGTGCGCCTATCTTATGAACAGCCAATCGAAACATACTCAACAAATGTTATGGGTACTGTCCATTTGCTTGAAGCAGTTAAGCAAGTAGGTAACATAAAGGCAGTCGTAAATATCACCAGTGATAAGTGCTACGACAATCGTGAGTGGGTGTGGGGCTATCGTGAGAACGAACCCATGGGAGGGTACGATCCATACTCTAATAGTAAAGGTTGTGCAGAATTAGTCGCGTCTGCATTCCGGAACTCATTCTTCAATCCTGCAAATTATGAGCAACATGGCGTTGGTTTGGCGTCTGTGAGGGCTGGTAATGTCATAGGCGGAGGCGATTGGGCTAAAGACCGTTTAATTCCCGATATTCTGCGCTCATTTGAAAATAACCAGCAGGTTATTATTCGAAACCCATATTCTATCCGTCCCTGGCAGCATGTACTGGAGCCTCTTTCTGGTTACATTGTGGTGGCGCAACGCTTATATACAGAAGGTGCTAAGTTTTCTGAAGGATGGAATTTCGGCCCGCGTGATGAAGATGCGAAGACGGTCGAATTTATTGTTGACAAGATGGTCACGCTTTGGGGTGATGATGCAAGCTGGTTACTGGATGGTGAGAATCATCCTCATGAGGCACATTACCTGAAACTGGATTGCTCTAAAGCAAATATGCAATTAGGATGGCATCCGCGTTGGGGATTGACTGAAACACTTGGTCGCATCGTAAAATGGCATAAAGCATGGATTCGCGGCGAAGATATGTTGATTTGTTCAAAGCGTGAAATCAGCGACTATATGTCTGCAACTACTCGTTAAGAAAATAAGTTTAAGGAATCAAAGTAATGACAGCAAATAACCTGCGTGAGCAAATCTCTCAGCTTGTCGCTCAGTATGCGAATGAGGCATTGAGCCCGAAACCTTTTGTTGCAGGTACAAGCGTTGTGCCTCCTTCCGGGAAGGTTATTGGTGCCAAAGAGTTACAATTGATGGTTGAGGCGTCTCTTGATGGATGGCTAACTACTGGTCGTTTCAATGATGCCTTTGAGAAAAAACTTGGGGAATTTATTGGGGTTCCTCATGTTTTAACGACTACATCTGGCTCTTCGGCAAACTTGCTGGCACTGACTGCGCTGACTTCCCCAAAATTAGGCGAGCGTGCTCTCAAACCTGGTGATGAGGTTATTACTGTCGCTGCTGGCTTCCCGACTACAGTTAACCCGGCGATCCAGAATGGTTTAATACCGGTATTCGTGGATGTTGATATCCCGACATATAATATCGATGCCTCTCTCATTGAAGCTGCAGTTACTGAGAAATCAAAAGCGATAATGATCGCTCATACACTCGGTAATGCATTTAACCTGAGTGAAGTTCGTCGGATTGCCGATAAATATAACTTATGGTTGATTGAAGACTGCTGTGATGCCCTTGGGACGACTTATGAAGGCCAGATGGTAGGTACCTTTGGTGACATCGGAACCGTTAGTTTTTATCCGGCTCACCATATCACAATGGGTGAAGGCGGTGCTGTATTCACCAAGTCAGGTGAACTGAAGAAAATTATTGAGTCGTTCCGTGACTGGGGCCGGGATTGTTATTGTGCGCCAGGATGCGATAACACCTGCGGTAAACGTTTTGGTCAGCAATTGGGATCACTTCCTCAAGGCTATGATCACAAATATACTTATTCCCACCTCGGATATAATCTCAAAATCACGGACATGCAGGCAGCATGTGGTCTGGCTCAGTTGGAGCGCGTAGAAGAGTTTGTAGAGCAGCGTAAAGCTAACTTTTCCTATCTGAAACAGGGCTTGCAATCTTGCACTGAATTCCTCGAATTACCAGAAGCAACAGAGAAATCAGACCCATCCTGGTTTGGCTTCCCTATCACCCTGAAAGAAACTAGCGGTGTTAACCGTGTCGAACTGGTGAAATTCCTTGATGAAGCAAAAATCGGTACACGTTTACTGTTTGCTGGAAATCTGATTCGCCAACCGTATTTTGCTAATGTGAAATATCGTGTAGTGGGTGAGTTGACAAATACCGACCGTATAATGAATCAAACGTTCTGGATTGGTATTTATCCTGGCTTGACTACAGAGCATTTAGATTATGTAGTTAGCAAATTTGAAGAGTTTTTTGGTTTAAATTTCTAA
Protein sequences of DBSCAN-SWA_3 >NZ_CP041973|795789:806296|797370_798264_+|WP_000981469.1|DBSCAN-SWA MMNLKAVIPVAGLGMHMLPATKAIPKEMLPIVDKPMIQYIVDEIVAAGIKEIVLVTHASKNAVENHFDTSYELESLLEQRVKRQLLAEVQSICPPGVTIMNVRQAQPLGLGHSILCARPVVGDNPFIVVLPDIIIDDATADPLRYNLAAMVARFNETGRSQVLAKRMKGDLSEYSVIQTKEPLDNEGKVSRIVEFIEKPDQPQTLDSDLMAVGRYVLSADIWAELERTEPGAWGRIQLTDAIAELAKKQSVDAMLMTGDSYDCGKKMGYMQAFVKYGLRNLKEGAKFRKSIEQLLHE >NZ_CP041973|795789:806296|803098_803872_+|WP_000648783.1|DBSCAN-SWA MKAVILAGGLGTRLSEETIVKPKPMVEIGGKPILWHIMKMYSVHGIKDFIICCGYKGYVIKEYFANYFLHMSDVTFHMAENRMEVHHKRVEPWNVTLVDTGDSSMTGGRLKRVAEYVKDDEAFLFTYGDGVADLDIKATIDFHKAHGKKATLTATFPPGRFGALDIQAGQVRSFQEKPKGDGAMINGGFFVLNPSVIDLIDNDATTWEQEPLMTLAQQGELMAFEHPGFWQPMDTLRDKVYLEGLWEKGKAPWKTWE >NZ_CP041973|795789:806296|801551_802103_+|WP_000973709.1|DBSCAN-SWA MMIVIKTAIPDVLILEPKVFGDERGFFFESYNQQTFEELIGRKVTFVQDNHSKSKKNVLRGLHFQRGENAQGKLVRCAVGEVFDVAVDIRKESPTFGQWVGVNLSAENKRQLWIPEGFAHGFVTLSEYAEFLYKATNYYSPSSEGSILWNDETIGIEWPFSQLPELSAKDAAAPLLHQALLTE >NZ_CP041973|795789:806296|799725_800625_+|WP_001023658.1|DBSCAN-SWA MNILLFGKTGQVGWELQRSLAPVGNLIALDVHSKEFCGDFSNPKGVAETVRKLRPDVIVNAAAHTAVDKAESEPELAQLLNATSVEAIAKAANETGAWVVHYSTDYVFPGTGDIPWQETDATSPLNVYGKTKLAGEKALQDNCPKHLIFRTSWVYAGKGNNFAKTMLRLAKERQTLSVINDQYGAPTGAELLADCTAHAIRVALKKPEVAGLYHLVAGGTTTWHDYAALVFDEARKAGITLALTELNAVPTSAYPTPASRPGNSRLNTEKFQRNFDLILPQWELGVKRMLTEMFTTTTI >NZ_CP041973|795789:806296|795789_797193_+|WP_001144948.1|DBSCAN-SWA MPVNKFSRRTLLTAGSALAVLPFLRALPVQAREPRETVDIKDYPADDGIASFKQAFADGQTVVLPPGWVCENINAAITIPAGKTLRVQGAVRGNGRGRFILQDGCQVVGEQGGSLHNVTLDVRGSDCVIKGVTMSGFGPVAQIFIGGKEPQVMRNLIIDDITVTHANYAILRQGFHNQMDGARITHSRFSDLQGDAIEWNVAIHDRDILISDHVIERIDCTNGKINWGIGIGLAGSTYDNSYPEDQAVKNFVVANITGSDCRQLVHVENGKHFVIRNVKAKNITPDFSKNAGIDNATIAIYGCDNFVIDNIDMTNSAGMLIGYGVVKGKYLSIPQNFKLNAIRLDNRQVAYKLRGIQISSGNIPSFVAITNVRMTRATLELHNQPQHLFLRNINVMQTSAIGPALKMHFDLRKDVRGQFMARQDTLLSLANVHAINENGQSSVDIDRINHQTVNVEAVNFSLPKRGG >NZ_CP041973|795789:806296|804982_806296_+|WP_000126349.1|DBSCAN-SWA MTANNLREQISQLVAQYANEALSPKPFVAGTSVVPPSGKVIGAKELQLMVEASLDGWLTTGRFNDAFEKKLGEFIGVPHVLTTTSGSSANLLALTALTSPKLGERALKPGDEVITVAAGFPTTVNPAIQNGLIPVFVDVDIPTYNIDASLIEAAVTEKSKAIMIAHTLGNAFNLSEVRRIADKYNLWLIEDCCDALGTTYEGQMVGTFGDIGTVSFYPAHHITMGEGGAVFTKSGELKKIIESFRDWGRDCYCAPGCDNTCGKRFGQQLGSLPQGYDHKYTYSHLGYNLKITDMQAACGLAQLERVEEFVEQRKANFSYLKQGLQSCTEFLELPEATEKSDPSWFGFPITLKETSGVNRVELVKFLDEAKIGTRLLFAGNLIRQPYFANVKYRVVGELTNTDRIMNQTFWIGIYPGLTTEHLDYVVSKFEEFFGLNF >NZ_CP041973|795789:806296|802108_803083_+|WP_000018223.1|DBSCAN-SWA MSHIIKIFPSNIEFSGREDESILDAALSAGIHLEHSCKAGDCGICESDLLAGEVVDSKGNIFGQGDKILTCCCKPKTALELNAHFFPELAGQTKKIVPCKVNSAVLVSGDVMTLKLRTPPTAKIGFLPGQYINLHYKGVTRSYSIANSDESNGIELHVRNVPNGQMSSLIFGELQENTLMRIEGPCGTFFIRESDRPIIFLAGGTGFAPVKSMVEHLIQGKCRREIYIYWGMQDSKDFYSALPQQWSEQHDNVHYIPVVSGDDAEWGGRKGFVHHAVMDDFDSLEFFDIYACGSPVMIDASKKDFMMKNLSVEHFYSDAFTASK >NZ_CP041973|795789:806296|798640_799726_+|WP_000697846.1|DBSCAN-SWA MKILITGGAGFIGSAVVRHIIKNTQDTVVNIDKLTYAGNLESLSDISESNRYNFEHADICDSAEITRIFEQYQPDAVMHLAAESHVDRSITGPAAFIETNIVGTYVLLEVARKYWSALGEDKKNNFRFHHISTDEVYGDLPHPDEVENSVTLPLFTETTAYAPSSPYSASKASSDHLVRAWRRTYGLPTIVTNCSNNYGPYHFPEKLIPLVILNALEGKPLPIYGKGDQIRDWLYVEDHARALHMVVTEGKAGETYNIGGHNEKKNLDVVFTICDLLDEIVPKATSYREQITYVADRPGHDRRYAIDAGKISRELGWKPLETFESGIRKTVEWYLANTQWVNNVKSGAYQSWIEQNYEGRQ >NZ_CP041973|795789:806296|800672_801551_+|WP_000857535.1|DBSCAN-SWA MKTRKGIILAGGSGTRLYPVTMAVSKQLLPIYDKPMIYYPLSTLMLAGIRDILIISTPQDTPRFQQLLGDGSQWGLNLQYKVQPSPDGLAQAFIIGEEFIGNDDCALVLGDNIFYGHDLPKLMEAAVNKESGATVFAYHVNDPERYGVVEFDQSGTAVSLEEKPLQPKSNYAVTGLYFYDNSVVEMAKNLKPSARGELEITDINRIYMDQGRLSVAMMGRGYAWLDTGTHQSLIEASNFIATIEERQGLKVSCPEEIAFRKNFINAQQVIELAGPLSKNDYGKYLLKMVKGL >NZ_CP041973|795789:806296|803876_804956_+|WP_000565913.1|DBSCAN-SWA MIDKNFWQGKRVFVTGHTGFKGSWLSLWLTEMGAIVKGYALDAPTVPSLFEIVRLSDLMESHIGDIRDFEKLRNSIAEFKPEIVFHMAAQPLVRLSYEQPIETYSTNVMGTVHLLEAVKQVGNIKAVVNITSDKCYDNREWVWGYRENEPMGGYDPYSNSKGCAELVASAFRNSFFNPANYEQHGVGLASVRAGNVIGGGDWAKDRLIPDILRSFENNQQVIIRNPYSIRPWQHVLEPLSGYIVVAQRLYTEGAKFSEGWNFGPRDEDAKTVEFIVDKMVTLWGDDASWLLDGENHPHEAHYLKLDCSKANMQLGWHPRWGLTETLGRIVKWHKAWIRGEDMLICSKREISDYMSATTR |
10 | Enterobacteria_phage(37.5%) | NA | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_4 |
919218 : 966714
Sequences of DBSCAN-SWA_4
Nucleotide sequences of DBSCAN-SWA_4 >NZ_CP041973|919218:966714|DBSCAN-SWA ATTATTGTTTCTGAGCAAACTCAAACGGCCTGACCATCTCATACCGATTCTCATCAAGAAAATCAGCCCACCACTGGAGCATCAGGCGGCGTTGGTCAAGATGCTTGGCCTTATGGGTATAGGCTGCGCGAACGCTGTTACTTTCCTTATGGCTCATTTGAAGCTCTACAGCATCTTCTGACCATAGACCAGACTCAATTAAGGCACTACACGCCAGTGTGCGGAAACCATGACCGCAGATGTCCTGTGTGGTGTCATAGCCCATCTTACGTAGCGCCTTGTTAATAGTGTTTTCACTCATGGGTTTGAATGAATCATAACAGCCAGTGAAAATTAATTCTGCTTCGTTACCTTCTTCATAGGTAAGCTGGCGGATCTCTTTCAGTATCTTGAGAGCCTGCCTGCAAAGGGGAACGAAGTGCTGACGTTTCATTTTAGCCCCACGAGTCGAGTGCTTGACGTTTTCAATCGCTTCCCGCTGTTCGGGGATCACCCATAACTTACTTTTGAAGTCGATTTCCGACCACCGGGCGAAACGAAGTTCGCTGGAACGAATGAAGATCAGCAGATTGAGTTTAATCGCTAGCGTAGTCAGTCCACGACCTTTGTAGGCATCAATACGTTCAAGCAGTAGCGGGATCTCTTCCAGCTCCAGTGCAGGGCGGTGTTCCGTCTCTGGTTTCTGGACAGCGCCTTCCATATCATAGGCCGGATTATGACGCATAAGCTTTTGCTGGACGGCATGACGTAGGATCGCGGTGATGTATTGCTTAATCCGCATGGCAATTTCAAGGTAGCCGAGTGTTTCCGCTTTTTTGACCGGGACAAGCAGATCACCCGTATCCAGTTCTGAAACGTTTCTGTCGCCTATATCCGGGAAGACATAGGTTTCAAGGCGCTTCCAGACAGTATCGGCGTAATCTTCTGACCATTTTGTTTTGGTGGCAAACCAGCTTTTGGCGACGACACGGAACGAGCGGGTTTTATCCCGCTTCTCCTGAAGGACTTTTTCATCAGCCTGTTTTTTAGCGTTCGGGTCAATCCCCTGAGTCAGCAGCCTTTTGGCCTCGTCCCGGCGTTGTCTGGCATCGGCAAGTGAAACCGCAGGGTAAACCCCAATGGAAAACACCTTCTGTTTGCCATCAAAGCGATAGCCTAACTGCCAGTATTTTGAACCGTTAGGATGCACCAGCAGATAGAGGCCAAACCCGTCAGTGAGCTTGACGGCCTTTTCCGATGGTCTGGTATTTTTTACTTTGGTATCAGTAAGTGACATGACGGTTCCCTCCGCGTGCTGGTAAAACACAAATCGAACCAGCTTTACCAGCATTTTTACCAGCAAAAGGGTATGGCTTCGAGTGGTTTTTAGTGAACGAGAGTGAACCTGAAGAAGGGAATAACCAGTTGATATAAATGCAGAAAGCAGACGTCAGTGAACGTCTGCTTTCCTTAATTTGGCTCCTCTGACTGGACTCGAACCAGTGACATACGGATTAACAGTCCGCCGTTCTACCGACTGAACTACAGAGGAATCGTGTGAACGGGGCGCATAGTAACGATGTGCGATCGGCTTGTCAAAGGGGGAAATAAGGTTGCGCGTTTGTTTGCTGACAAAAACAACAAAGCGTTGAAGTTTTGATCTAATTTCTACTTTGCCCCGGCATGGCGCAACTTTGTCTGTAATTGCACAAGTCAAATGCTGTGACCTTACCGCAATGGCTATGTGCCGGCGTCTGATGAAACGTGAAAAACTGGCAGGCACTTGGCAAATAATTCTGAGACATAACGCCGTAGAGATTAAGGGCAGGGGGCAGAATGAACTTTAGACGTGAAATATTTTGTAAAAATGGTTGATACAGGCAGTCTGACGCCGGTAGCGGAAATGGCAGATAAATTTCTGGTGCAGGCGAAAAGATTTCCGTCAATATCATAGGCAGAATTATGGCGCATCAGCTTTTGGCGACGACACGGGACGAGCGGGTTTTATCGCGCTTTTCCTGAAGGATTTTTTCATCAGCCTGTTTTTTGCGTTCGGGTCAATCCCCTGAGCCAGCAGCCTTTTGACCTCGTCACGGCGTTGTCTGGCATCAGTAAGAGAAACCGCAAGGTAAACCCCAATAGAAAACACCTTCTGTTTGCCATTGAAGCGATAGCCTGTCTGCCAGTATTTTGAATCGATAGAGGCCAAACCCGTCAGTGAGCTTGACGGCCTTTTCCGCTGGTCTGGCATTTTTTACTTTAGTATCAGTAAGTGACATGACGGTTCCCCCCGCGTGCTGGTAAAACGCAAATCGAACCAGCTTTACCAGCAAAAGGGTATGGCTTCGAGTGGTTTTTAGTGAACGAGGGTGAACCTGAAGAAGGGCATAACCAGTTGATATAAATGCAGAAAGCAGACGTCACTGAACGTCTGCTTCCCTAAATTTGGCTCCTCTGACTGGACTCGAACCAGTGACATACGGATTAACAGTCCGCCGTTCTACCGACTGAACTACAGAGGAATTGTTTCAACGAGGCGCATAATACTGGGCCGGCTATAACGTGTCAACAGTAAAATTAACGCGCCAATTCAATTGGTTAATTAACCCACAAAGTGAGGAATTAATGTTCGGATTCCTCGCCGTAGTCGCCTTCGGCAGCGCTTACCCGTAAGCGCTGAGAAGGATCCTGGCGATAGAACTGGCAAAAACGCTGCCATAGTGCCGGGAAACGTGGAGCAAACAGTTCTGGCGCGCTGAAAAAATACTCTGACAACACGGCAAAACATTCTGCAGGGTCGGTGGCGGCATAGGCATCTATACTGGCAGCGCTTTCGCCAACAAGATCGATTTCATCCTGAATATTATTCATTGCCGCGTGGAGATCGTGTTCCCAGCCAGCCACATCGCGTAACGGGATGAAAGGGATGCCGCTGGCGCGATCGCCATTACGCATATCCAGTTTGTGCGCGACTTCATGAATAATGAGGTTGAAACCCGAAGCATCGAACGAGTCCTGGATATCCAGCCAGTTCAGAATGATGGGCCCTTGTTGCCAGCTTTGCCCCGACTGTACGACACGCTGGCTATGCACCAGACCTATGTCATCTTCCCATTCATCATCTACCACAAAGGGCGCGGGATAAATGAGCACTTCATGAAAACCATCAAGCCACTCAATACCGAGCTCCAGGATCGGTAAGCAAAAAATTAACGCAATACGTGCACTTTTTAACGAGTCGAGCTCAAATCCCTGTAGCGCTACCAGTCTTTTCTGCTGCAAAAAACGTTCGGCTAGCGCAATAAGCCGAGCCTGTTCTTGCGCGGTGAGGTTTACCAGAAGAGGTATAGCCAGCGCATCATCCCACGGCCAGTCTTCGTTCTGGGTTATTTCTTGTGCTTTCCAGGGCCACTTAATCATCGTTTTGCTCGTAAACTCGTCACTTGAACAAAATTACCCGAATAGGGTCTGTTAAAATGCCAAATTACCTGGCATCATTGCAATATACGGAGAGATGCCGGAGCGGCTGAACGGACCGGTCTCGAAAACCGTTGCGGGGGTAACTCCGCCGAGGGTTCGAATCCCTCTCTCTCCGCCACTATTCAAGCACTTACGTGATTTTCTTATAGTGATGAAAATCACGTTGAGAAAAAATGAGAAAATTCGGTGAGAAAAAAACGCCAGAATTTTAACTGGCGCACATCGAAAAGCTCAACGCTTCCTGTCCAGGGTTGGGCTCATTTTCACCTTGCGATCGTAAACAAGAACCTGGGATTCGGTTTTGTGGCCACTGTACTTCTGCTTGTCTTTTGCCGTTCCCTCATAGTCTGAAATCCCCTTTGCCTTTAGATCGTGGAAAGTGCAGTCAAGAGGACGTCCCAGATCATCCCCCGCAGCCTTTCGCGCCTTTCTCCACGCCTCGTTAAATCCTTTATAAGAATAACGCTCGCCATACATAGTCCTGATAACAGGGCCTTCCTCTCCCCATTCACGACATATTTCAACGGCATCACGTAAGCGATCTGTCCAGGATTTAATTTGTTTAACTCCGGTTTTTCCTTGCTGAATAAAAATTCCTTTCTCCAGTATTTGATTCCAGTTCATTTTCAATACATCAGAAACTCTGGCAGCACATAAATAAGCTATTTCCATTGCAGCCCTGACGGCTGGCGTTGCGTTATTATATATCGCTCTGTACTCTTCATCGGTAATATATCGATCGCGTTGAGGCTTAGGAAACTTATCCACACCAACGCAAGGATTACCAGGAACATAACCACGTTGATAACTCCAACGAAATACGCGCGACATAGAGCTGTGTTCATGATTCGCCTGGACACGGCTTTTTTGCCCACGGGCATCCATATAACGCCGGATATGTTCTGGCTTTATTGCTTTAGCTTCGGCATCACCAAATACGGCAAGTATATATTTCTCATGTGCCAGATAATCTTTCTGCGTTCTTGGGGCCAGATCAGCATAATCGGCACTGGCAAGAAATTTTCGCCATAATTGTGTGAATGTAATTCTGTTTTTTCTACCCTCAACTTTTTTTTCGTAAGCCACCCAGACCTCAGCTTTAGTTGCATCAGCTGGAGCTATATTTTCTGTTGATCCTCCCGGTTTCCAGTAGTAACCAGAAGGGCGAAAGAATACACCCTTTGGCATCCACTCATTACCGGGCGCACGTTTGCGACCCATGTTAACTCTCTATAGCGTCAAAATTCATGCCTGGTACAGGCTGATACCCTGCTGGTGGAAGAAGGCGCGAAACTGGGTGATTTATATGATACCAGGTTGTTCTGATTGAACCGTCCCGGCGTTCTATAAAATAAATACCGTTCAGCGTTAATACTTCTTTCTGGAGTGACTTCTGGCTTGCTCCTGTAGCATCTTCCAGTTCCTCCTCAGTCAGGAAGCGATCGCTCATGAGTTGCTTCTCCATCTAACCGGCTGCACCCGGTTTAAATCAGATATTGTTGCTGGTGGGCGGGATCAGTTTCTGCCAGATAGCTGAAACGTATTTCGCCTGATGCCTGGCATCAGCCAGCGCGTTGTGCATATCGCCTTCGAACGGCATGTCACGTTTGGGATCGAAGCCGATTGCACGACCCAAAGTAACCATCGTGCGGACGTCATGATCGTTCCGGTAATTCCACAGGCAGGGGAGGCTGGCACGTTCGAAAGCGCCACGCAGGATAACGTTATCGAAATTGGCACCGTTACCCCATACCTTCAGGTATTTCAAATCGTCGGCGTGACAGGTGATAAAGTCATTGAACTCAATAAGCGCGGTCGTAACAGATACTGCATCAGCGCAAATAGCCGCTCGCGCTTCCGGGCTTTGTCTTAACCACCATAGAATGGTATCGCCATCAGGAACGGCACCTTGTTCCATTGCGCTTTCAAGGCTAACGGCGGTATAGAACTCAGGTCCAATTTCACCACTTTGCGGATCGAAGAACACAGCACCGATGGAGACAACAGGCGCGTTAGGTTTTTTACCCATCGTTTCAAGGTCGACCATTAAGTTGTTCATTCGTTATTTATCTCCGGTTTTGGTGCTGCTGGCACTGGCATCCAGTGTGTAACAAAGATGTGCTCAATACAGTTCATCTGATTGCCGCCTCGCATGTCAAAAAATAGTCCAGAATGCTCATCAAAATATGAAACATAACGGTATCCCAACTTGTTATGAACAATTACTTCTTGCTCGTCTTCTGGCATCCGCTCACTACAGCTTATCCAACCATCCGGAATTACCGGAGAGTTGCCAGCATTTGGATGCAGAGCGCGAATGCCTTCGGCCAATTCTTCCAGGGTGGATGAATATCCGCGTTGGGCATCATTGCCGAACTCGAACGCTCCGGTATCAGGATCAGACCATCCATGCTCGCTGTCGTATGCCTCACGCTGCTGATCTATCCATTTGGCGGCGGCTTCAACGCCATCGCGATAGAAAGTGACTACCGGCGCTGACTGTTCTTTGCCAACCAGACGCGTTATTTCTGATTCCAGAAGTGAACCTAAAACTGTACTCGTGCAGTGCTCAGCCCATTCGTTGTTTTCCAACAGGCTGATGATATTTAGCACATCATTGTAAACGCCAGTTTCCTGTACTGGCTGGGCGGTGTAAAAATACGGTCTGATAGTCCACTTTTTATTCCAGAAATCTCGCGTTTTCTCGGCTTCCTCAAGCGTTGCAACACTACAGCCAACCTTTCCGCACTCTTTGATTACGTGATATCCGGCTGGCTCAGCTTCCAGCGATGCCAGCGCACGCTTCAGCACAATAAGAACTTTGGCGTCGTCATCGCTCAGGCCAAACGGAATATCGTCGCGAGTGTTTTCAAATTCAGCGATAGTTTGCTGTAGCCATTCTTTGGTAATAGTGGTCATGGTGTACTCCAGTTATCTTCGATAGCCACACTAAGTCGGTGCAGCCAGTCAGCTAATTTCAGCATTGCTTCGCGTTCGCTAAGCCCTTCCGGAAAATCTTCAAGTTCAACGAAAGGCTTAAACCGTCCGAAGGCGTCGTTCTTAACTGTCAGCTTCTGTTCAAGCGTGGTCTTCTTAACCTTGCTGTGATGCCGTAGAAGGTAAACAGACTGAGATTTTTCGGTTTCCGGATCGTATTCGTAGGCAGTTAGAATCATCTGGCTGCCGCCGCGATTTAATCCTCTCCACATAGTCACTTCCCCTTGCCGATGCCAGCGGCGTTTAATTTCTGCCGTAATTCTACTAATTTGGTTTGTACCGCTCTTTGCCTGTGCCATTGCAGGACGAGCATTTCGGACTACCGTTGTGGTCGTAATAACCACTGCCGTTACACGCCGTGCAGGGACGCAGCTTCCAGCCAAAAACGAAACGTTGGTAATATTCAGTTCGACGAACTTTGCGTTCGTGGAAGTTACACATCTCACCCCCTCTCAATACGCAGCGCCTGTTTATCGATGTTGCTCATTGGGCTGTCTCCTGTGGATAACAAATATCGTCGAAATATTTTTCTGCTACGCGCATGTTGAAGTGATCGAGATTCATCTCCTTCACCTGGAGTTTTGCCCCAACAATGCCTGTGCATTGATTGACGTAATCCCGGTTTTCTGGGGATTCCGTTACCCACTCCATAAGGTTTTCGGTGACACTTTTTAAGCAACGTAAAGCGCAGTCCAAATCAGTAAAATGCTGAGCATCAGTTATGCAGGAGACGACATAATACGTGGTGACTTTTGGCCCATCAGCGCGTCGTTTAAGCTCTCTTTCGATAGCGTTTTTCAGATCAACCAGTTCATGGTCATTGAGTTTGTCGATGTTGCTCATTGGGCAGCCTCCATTAGTTGCATTACGTGTCGCTTGTGCTCTTCGCTTTGCGGAACACCTGTAAAGTTCACCGCCATGAAATAAGCCAGGCGATCCTTGTACGTCACCTCATCCAGCACAACTGCTGGGAGTGGACGACGCCCAAATGCCAATTGCTCCGCACGAGTCATATCTCGCCAGTAAAGCGGACCATCAGCCAAGATGATTGGAATCTCATTGGTGATGAGTTTTTTCAGGGTTGTGAGACGCTGCTTGCCGTCAACAACTTCTATGTAAGGAAGTTCACGCGAACACCAGTCAGGTGCCTTTGCCAGCGCCACTGAGCCGATAGGAAAACCAGAAATAACTGCGTTTAAGAATGCCTGCTGCTCTTCATGCCCCCAGACATACCCGCGCTGATAATTGGCATCAAAATCAAGTTCACCACCAATGATCCAGCGAATGTACATATCAACCGGGTACTCACCGGTGCGCGCGTCGAATACCTGAGCATTGCGAATTCGGTTGCTCATTGAGCTGCTCCTTCAGCTTTCTTTTCATCAACGCTCCACGCCGTAGCCAGCGCACTTGTTACCTGAAGAAACGAGTGTTTAACCCTCACGGTGAAAGTTTCTCCGGTGGCTGAAATAGTCTCGATGACTGTTAACTCGCCGCCGCTTTTGTAGTCTGGATAGAACTGTGTTACCAGATTGCTATCAACAATCACCGATCCGTCCAGGGTGTACATTTCCAGTTTCATGCTGCACCGCCTTCAACGCGCTCCCACAAAAGTCTTGATCTGATTGCCTTCACTACAGACTCTTTATCTTTTATGCAGCACATTGGCGTAGCTCCATCAGTTCATTAAAGCGGGCCATAAACAGGCCGAAAGCCTGACCGGGGCGAAGTGGGTAGATTTCGAATAAATCTGTCGGGGGGATACCTTCCAGTATTACCCAGGGAATACTGTCATCAATATCCAGATCACGGCGTTCAGTTGCCAGCATGGTCAGATCTGCATACTTCACTACGCTGGCTTCTTCCAGTGGCAAGCCAAACTTAAAGCGGATCAGTTGATCGGTACGTTTCTCAATCTCGCGATAATCAGGCAGTAACGCTTTTAATGGGGCAGGGATATCCTGGCAATACGCTTCGGCTGCGTCGTGCATCAGGGCTTCAAAGGCAAACTCCGGTGATACAAGCTGGCTGCACAGTACGGAATGCTGCGCCACGCTATAAAATTCAGGAAGATGTCCGGAGAAGCGGCAAATATTGGAAAGCGCCACGGCGATATCTTCAATATCAATGTCGTCAATAGTTGCGCTGAGATAATCAAATTGTTTACCTGAAAGTGTTTGAATAAAACTCATCGTTGGTTCTCCTTATAATTTATTTCGCGCTGCACCGCGTGAATTTTGAGTACAGCAACCCAACCCACGATGTGGGGTTAATTGCCGCTATGAGTTATCGCTTGGCTTCGCCGCCGAGGGCAGCCGTTAAATCAGAAATAAGATTACTGAGTTCGCCTGTCATCAGCGTAATGTCAGCATCAAATCGCTGCACGACATCTTCGCTGTCGATATCGTCGTTCTGGCTGATTAACTGATCTGCAAATTTTATGCGTTTCAATATACCGTCACAAGAAAGGGTGAAACTGATACGCTGCTGCCATTCCATTGAAATCTGGGTAACTACCTTCCCAGCTTCGATATGGGTAAGAATTTCATCGCAGGCAAGATCCTGCTTTTTAAACCGGCCTGTGCCACCATCTTCGAGAATAGCTTTAAGGACCGCTTCATCGCCGATGGAGAACCCGGAAGGAGCCGCTTCGCTACGAACCCACTCAGTTAGCGTGAGCTCGATAGGGTTTTCCATCGTCAGCGGCACGACTGGCAAGGAACCTAGGGTTTTACGAAGCAGGGCGAGAGAATCTTCTGCGCGCTTGATGCTGGATGTATCAACAACGATAAACCCGGCTGCAGTGTTTATCCAAATACGAACCAGACTGTTTTTAGTAAACGCCCTGGGTAACAGGGAATGAAGAACCTCATCACGAATAGAGTCTTTCTCAGTTTTCTTAAGACGGCGGCCTTGCTCTCGCTCAAGCGTGGAAACCTTCTTATTAATCTCATCGGCGATCGTTTGTTTAGGTATGATTTTTTCTTCACGACGAATAACCAAAAGTAACTGGTTATTGACTGCATGATATAGCACATCTGAATACTGGACTAATGGTGAAAACCATCCGCTTTTTGCCATATCCTGGCTTCCGCATGGTGAGAAGCGAAACAGCTCAAGTTTCTTATCAAGAGAGTCTATGTCGATGTTAAAGTCGCGGCTGAAGCGATATATCAGCATATTTTTAAAAAATGGGTTATGCATTTTGTTTCCTTAACGCCTCTGCACTGGCGTTTTACGTTGGTTTCTCCACAAAACAGAAAAGAGCACCTGCTGTAACAGCTTTCCGGGTGGATTGGGTAATGAGCCCGTCGCGCGGAGATGCTCTTTTCTGTTGTGTAAAAAGGTCGGCGTCACGGCAGAACACTGTCGCCTTCCTCCTGTTGTTGGAAGAGCCGGACGCCGACAAGACTTCACACAGCAATAACGTTGTGGTGGGGCTGTCACTCAGGCGCATGGTCAACCTGACAACCCGGTGTCCTACTGGGTACAAATGGAGAAAAACCCGCCATACTTACCGCCGCGCCATTTCGCGGATTACCACAACGAAGAGAGCACTGCCGGTGTCCGAATTGAACGGACCTTTTCTCTGCCCAACCCTCCTGACTAAACAGGACTGTCTGGAATCGAACCAGCACTTATGCCTTGCTCGTCAATGCTCTCATCGTTGTGTGCCTGTCTTTTCACCACATCAGGCTCGGTGGACCTTGCTATTCCCCAACAGTAAGGATTCGGGTAATCTTTTTAATTCCCCAACAACATAAGGGCTTAACATGTCTCAGAAGGATGATATTCCTGTCTTTCCCGTAACCGGCTGGCAGGCTGGACCGCTTCCTGGTTACGACGCTCTGGTAGTGAAATTCCAGTTTCTCTCATCACCGATGCAACCAATTGAGTCTGCTCAGGAAACGCAATTTTTAGTACTTACTCCTGAGATGGCTGAGAGCCTGGCTTCAGACTTGCAAAGGCATATTCAGGATTTGCGAAATTCCGACGTTCACAGCCCACAAGAAGGCAAGCACTAATAAGGAACACCTGAACTACTTCATTTCCCTTAAAGCGCCGTTGGGTGATGGCGCTTTTCTTTGCATTAACCAGCATCATTCCCCCTTCGTGACGTTCAGTTTTACTGGCTTTATCACGGCTGCGTAGTTGATAAGAATGTTTACGCATGAAACAACGCACTCGGAACAGATAGCTGGCTCGTCCTTACTTCCTTTAGCGATGAGCTTTTTTGCATCCAGCTCACTGACCCCGCAGAAGGAGCATGTGTAGTAGTTATTCATCTGAACTCCTGTGTAATGCATCATTGCGAATCATCCGGTCATTCGTATGCCACCGGCGGCTACTTCGTGGGCGTCCTGCCTGTTCGCTGCTCTATGAGTGCAAATTACATTTAAATTGCACATTGCGCAAGTATAAAATTGCGATATATGCAATTTTGAGTCAAAAAAAAAGCCACTATAATGGTGGCCTTGTCGACGCTTTCTATTAATTGTGTCGTTTGAGTGACTGCGTCTGGCTTATCAGAACCTTGCCAAAAACACCGAACCTGCACTCGTTGTCTTTGGTAATACTCCATTCCCTGTAGTTAGTGTTATCAGATATCACCAGTAATTTATCGGGGATCATCTGTAGCCTTTTTACGTATATTTTATCATCAAAGCCAAAGACATAGATGCCATCACCATCGAACTGGTTGATGCTTATATCGACAAAAATAAGATCTCCGGGTTCAATTGTTGGTGCCATGCTGTCACCACGCACGTTAATCACTTTAAGCTCAGCGGCAGGACGCCCGCCGAACATAGCTAGTGCTTTGTCCTTGTTATATTCGATAGCATGGATTACATCGATAACATCACCGCCCTGAATGAGTCCATTACCGGCGCTTGCACTGACATCCAGTATCTCGATACGGAACAAATCCTTCACGTTAGCTGAATCCTTCCTCATATCACTGTGTTTACATACAGTATTACCTTTTGGGTCTGAGGTAAAGAGTTCTGCTATATCAACACTTAAGCAGTCAGCCAGCCTAGAAAGTGTTTGTTCGGTAAATTGCTTTTGCTTGCCAGTCTCCAGACGAGAGATGTTTGCGGCATCCACGCCGATCGCTTCTGCTAGCTCAGCAATTTTCATGTTCTTCGCGCGGCGAAGTTGTCTGACACGGTTTCCTATATTCATGCGTTCATTACATTAATTTTTTGCGCATTGTGCAAATCAACTTGCGCAAGTTTGCTGTATGAAATAACATGCGACATACGCAAAAGAAGGAGGTTTTATGCAATCACCATTGAGAAAATTGCGGAAATCGCATGGTTATACGTTACAGCACGTCGCTAAAGGGGTTCAGGTTGATCCTGCAACATTAAGCCGGGTTGAAAGATGCGAGCAGGCTCCTTCAACAGAGCTTGCTGAGCGCCTGGCTCAATTTTACGCCGGAGAAATTAGCGAGATGCAAATTTTGTATCCAAACAGATATCAGCTTAGTGATTCGGCGATTTGACCGCCACCACAGCAGAAGGAGTAGATCCGTGGGACATGAACCTGAATGGAAAGTTGAAAAGCAGCCCCGCTGGCTGGTGGCTGCGATTAAAAAGACGATTTCCAGTCTGCATGGCGGTTATGAAGAAGCTGCGGAATGGCTGGATGTCACCAAAGATGCTCTGTTTAACCGCCTGCGTACTGGTGGTGATCAGATCTTCCCGATTGGGTGGGCGCTGGTACTGCAACGTGCCGGAGGAACCTATCACCTGGCACATTCAGTAGCCAGGGCATCAGGTGGCGTTTTTGTTCCGCTGGCAGATATGGAAGAAGTGGATAACGCAGATATTAATCATCGCCTGCTGGAAGCGATTGAGCAGATCACCAGTTATTCCCAGCAAATCAGGGTGGCTATCGAAGATGGCGTTATTGAGCCACATGAAAAAGCCGTGATTGATGAGGAGTTGTATCAGGCGATCGCAAAGCTGCAACAGCATTCGACACTGGTATACAGAGTTTTTTGCGTGCCAGAAAAGGGTGACGCCCGCGAGTGTGCAGCTCCGGGCGCCGTGGCGTCAAATTTTATGGAGAAAACCAACGCATGAACAGTTTAACGGTAAATAACCGTTTGTCGCAACAACCGGGGATGTATGAGTACCGGCCGTTGCGTCATGAATGCAGATTATCAAATAGCCTGGTCGTGCGTAACCACAGGGAACACAGCCTGACCGTGGGGGATGAATCGTGCAGGAACTTAACCGCTGGTTTCGGGATGGAAGGGGACTTTATGTCCATGTCATTCGCTGGGAACCAGAAACTGAGCGCGTTATCTATCTGCGCAAGGGCTATCCGCATGAGTGTTTTAGCCCTTTGTGGAAATTCAGGCGTGATTTTGTTGAGTGTGAAGCGCCAGGAACACATTGATTCTGCAATTCCGGGACGTTACACTGTTCAGGCACCTCATAAAGCGGGTGCCGGGCGTGGAAACCCGGAATTCAATATAGAGCACAACCGCGCTCATGCGGTTTTTTCTTGTCATGAGCATTGTTACGCCCAAATTATGGTGGGGCGTGCAGGGCCAGTTTCGGCTGGGCCGGGTTCTATGTTGACCGGTATTTCCACCCCTGTACGTCTCACCACCTATAAGGTCGTGGAAAGCCTTGGTGGTGAGTTCATTGAATTCAACATAGAGGCTGCCACTATGGCTACTGTCCCAACCCTCGCTCAACCTGAAATTAGAATTATTAACGGCCAAGCCGTTACTTCCTCCCTGGCTGTTGCCGACTACTTCATCAAGCGTCACGCTGATGTTATCCGTAAAATAGAATCTCTCGAATGTTCCACTCTATTTCGTAAACGCAATTTTGCGTTTACATCGATTTCAATAAATCAGCCCAACGGCGGTACTCGCAAACTCCCATGCTATCAAATCACACGCGATGGTTTTGCGTTTTTGGCAATGGGTTTCACTGGTAAACGTGCTGCTCAGTTTAAAGAGGCATACATCGATGCCTTTAACCAGATGGAGAAACAGCTTTCAACTCCATCGGTGCTGAGCGATGCAGCACATAATGCCAGCGTTCTTTATTCCTACATTTCATCCATTCATCAGGTTTGGTTACAGCAGCTTTATCCCATGCTGGAAAAAGCGGAATCTCCGCTGGCTGTAAGCCTGCACGATCGCATCAATGACGCTGCGGCGCTTGCGAGCCTTATCAATATGACACTGAACCGTTCAGAGGTAAGGGGGCGCAAATGATCCGGAATATTTTTAAACGGTTCACCAGCCAACGTTTTCATTGCCCTCGTCCAGGACAGTGGTACAGCACACCAGAAGGGTACGTTCTGCGTATTAGCCTGGTCGATCGCGAATGTCAGAAGGTTGTCTGTGAGCCTCTTGGGCGTAATTACCGCGTCAACATGCCGCTTATTGCCTTTCGTTCCGGCAAAAACATGAAGCATCTCGGAGGTGCTGCATGAGTTCCCTTATTCAATTACTCGATCGCCCCATCGCCTACAACCCTGCTTTTGCAAAACTGAAAGCCGGGAAGGTAAAAGCTGGCCCGGTTGCGGCAGTATTCCTGTCCCAGCTTGTTTACTGGCATAACCGGATGGATGGCGGCTGGATGTACAAAACACAGGCTGATATTGCCAGTGAAACGGCGCTAACCCGCGACGAACAGGAAACAGCACGTAAACGTCTGGTAGCACTTGGTGTACTGGAAGAAGCCCGTCGCGGTGTACCTGCCACCATGCACTACCGCATCAACACCGCACGGCTTGAAGCGCTGTTGCTGGAAACGGCGAAGCCAGTGAAAAAGGGCGCTCAGGAGAAAACCAGATTGCGGGACTTCCAGAATGTGGAAACCCCGCAATCTGGATTGGTGCAACCCCGCAAACCAGATTGCGGTGATGCCGCAAACAAGAATGTGGAAACCCCGCAAACAAGTACGGGGCAACCCAACGAACAAGCATGTGGCGATCCCACAATCTTTCCTACAGGAGATTACACAGAGACTACTCAGGAGATTACACAGGAGAGTAAAACCCCTTTTTGTCCGGTTGCTGAGCAACCCGACCCCGAAGTGACGCTCACCGATCAGGCGATTGAGGTTTTAACCCACCTGAACCAGGTAAGTGGCTCCCGGTATCAGAAGTCAAAAACCTCCCTGGAAAACATCCGTGCCCGACTGCGTGAGGGGTACAGCGTTGCTGATCTGCAACTGGTTATCGACCTGAAGCATGAGCACTGGCACGAGAACGACGAGCAGTACCAGTACATGCGCCCGGAAACGCTGTTCGGTCCGAAGAAATTCGAGAGCTATCTGCAAAGCGCTACCCGCTGGGATCAGAAGGGGCGGCCTAAACGCGCTGACTGGGGTGCGAAGAAGCGCGATGTGATGGCTTTTGGTCCGGTTGATACAACGATTCCGGAGGGATTCAGAGGATGAGTCTGTTAGCAAAAGTGCAGGCGTTTATCGAGCTTAATCCGGGGCTGACATCAAATGAGATTGCCGATGCTTTTCCTGAATACGCACGCTTTGATGTGCAGCGTTCAGCGAGCAAGTTGTATCGGTGTAAGCGTGTTAACCGCCGCCTGGATGGAGATGTATTTCGCTATTACGCGGGTAAAGACGAGGCAGTGATTTTGACGTTACGACAGAAAAGGTCAGGTCATACAGGTTCGGGTGATCCGATGGTGATTGCAAAGCTGGTAAGCCGCGCTGAAGAACTGGAATCCAGAGGGTTATTTAATCGTGCATCGATAGTGTGGCTGGAGGCATTTAGCGAAAGCCAGTTTATCTACGAACGCGAGGAATTTTTACGCCGCCGTCAGAAGTGTCTGAACCGCATCAAAAAGAGAATCAGACCCGTAGAGCAGGTTTATCTGGCAGGGCGATTTGTGGGGAACGTGGAATGACCAGTGAATCCGTTTGTATTGAAAGCAGTGATGTAACGATATCTGTTGATGAATCCGCTTCGCGCACCTGGCGTCGCCCGTTCCTGAAATGGGCAGGCGGTAAATATTCCATGTTACCCGATCTTTACCAGGTCATTCCGGCAGGTATGCGCCTGATTGAACCGTTTGTCGGCGGTGGTTCGGTGTTTCTCAACTCAGACAAACACGCCTGCTTCCTGCTGGCCGATGTGAATACCGACCTTATCAATCTGTATCAGATGCTGGCTGTTGTACCTGGTGCGGTGATAAGACATGCTAGGGTAATGTTTGACCGTCTCAATGACGCTGAAAGCTATATGGCGCTACGGGAAGAGTTCAATGCTCAGGTGATGGACGCTCCGGAACGCGCCGCCGCTTTCCTTTTCCTTAATCGTCACTGCTTCAATGGCCTGATCCGGTACAACCGCAACAACCAGTTTAACGTTGGCTGGGGCAAATACCCGTCGCCTTATTTCCCGGAAGAAGAAATCAGGGCATTTACCGAAATGGCGCACAACTGCGTATTCATGGCGGCAGGATTTCGCCGGACGCTGGCACTTGCGGGAGAGGGTGACGTTGTGTACTGCGATCCACCCTACGAACCGATGCCCGGCAAGGATGGTTTTACTCACTACGCCGCTGGTGGCTTTACCTGGGATGATCATATCGCGCTGGCGGAATGTTGTGTTGCTGCTCATCAGCGAGGTGCCAGAGTCGTGATCGGCAATTCCACATCTCCGCGTGTTATCGACCTGTACTCGCAGCACGGCTTTGAAATCCGCTATATCAGCGCCCGCCGCTCAATATCAAGTAAGGGCAGTACCCGCGAGAAAGCGAAAGATCTCGTGGCGATTCTGTAGGGGGCGGCATGAAACTGACATTGCCATTTCCACCCAGCGTTAACACCTACTGGAGGGCTCCGAATAAGGGACCGCTTAAAGGTCGTCACATGGTCAGCGCCAGCGGCCGGAAGTATCAGAGCGAGGCGTGCGCGGCAGTGATTGAGCAGTTACGCCGTCTGCCAAAACCTTCAACAGCCCCGGCAGCGGTGGAAATCACCCTGTATCCGCCAGACAAGCGGATCAGGGATCTGGACAACTACAACAAGGCGCTGTTTGACGCCCTGACCCACGCGGGTGTGTGGGAAGACGACAGCCAGGTGAAAAGAATGCTGGTGGAGTGGGGACCAGTTTTCCCGAAGGGGAAGGTAGAAATCACGATCACGAAATTTGAAACAGGGGCGGGTGCAGCTGCCTGAACATGGAGAAAGAAGCATGAATAATTTAATGGTCATTGATGGTATCGAAGTTCGCCGCGACGTTCATGGGCGCTATTGTCTTAACGATTTGCACCGGGCTGCGGGTGGAGAGCAGAAATACCGTCCGAAGTACTGGCTTGATAATAAGCAAACCCGTGAGCTGATTGAGCAACTTTTCACCGAGGGCGGAATTCCACCCTCGGAACAAAATCAATCTGTTAGCTTTTTTCAGGGCGGTAGTGATACCCGAAGTTTGGCACGTGCTCCAGTAAATACTGTTCGCGGTGGTGCTGAACAAGGTACATACGTATGCAAAGAACTGGTATTTGCTTATGCAATGTGGATCAGTCCGTCTTTCCATCTCAAGGTGATCCGCACGTTCGATCGGATTACCAGTGCGCCACAAATATCTTCTGGTATGGCTGCCGATAAGATGCAGGCGGGGGTGATTCTGCTGGGTTTTATGCGCAAAGAGTTAAACCTGTCCAATTCATCGGTACTGGGCGCGTGCCAGAAACTCCAGGAGGCAGTGGGACTACCTAACCTGGCGCCACAATATGCCATTGATGCTCCGGCTGGCGCGCCGGATGGTTCAAGCCGCCCGACGCTTGCACTGAGCGCGCTGTTAAAACAGCATGGTATCCGGATGACGGCTAATCAGGCGTATCAGCAGTTAGCAAAGCTGGGTGTTGTTGAACATCGTGAGCGTTACAGTCGCTCCGCGATTAACGGCATTAAAAAATTTTGGTCGCTGACGGCGAAAGGCTGCATGTTCGGCAAAAACATCACCAGCCCGGCAAACCCTCGCGAGACGCAGCCGCATTTCTTCGAATCCAAATTCCCTGAGCTGCTGAAGCTGCTCGATACCGTTCATTGAGGTGATCGTGAGAGCGTTACTGACCCCTGAAATTGCTCCTCGTATGGGCGTTGTATTGTTCAGGCCGGGATCGGAACTGATGCCCCTGTTTATGCAGGGGCGTGTTCTGCTTGAACCAGAGCCGGAACAATATTCATCTTTCGCCTGCGGCGCGGTCCCGGCGGTATCACAGCCGCTGGCGGATGATCCTGCTGTTCGTGATGTGTTCCGTAATGAGTCGGTTATCTATCGTGCTGGTGGTCTCGATAGTCTGGAAAGCTGGCTACTCCGGGGGAATGGCTGTCAGTGGCCGCATTCAGTCTGGCACAGCGAACAGATGACAACCATGCGCCACGCACCGGGGGCAATCCGACTGTGCTGGCACTGCGATAACCTGCTGCGCGAACAGTTTACGGAACGGCTGGAATCAATAGCTGTGGAGAACACGACAAAATGGGTTTTATCGGTTGTTTGTCGTGATCTGGGTTTTGACGATATGCACGCAGTCACGCTCCCGGAACTGTGCTGGTGGATGGTACGCAATGACCTGGCAGAAGTCTTACCGGAGAGCGCTGCGAGAAAAGCATTAAGGATGCCGAAGGCAATTGTCCAGTCAGCTACCCGTGAAAGTGAAATTGTCCCCTCGGTGCCGGCCACCAGCATTGTACAGGATAAGGCGAAAAAGGTACTGGCACTCAGGGTTGATCCGGAATCGCCGGAAAGCTTCATGTTACGTCCGAAACGCCGTCGATGGATCAATGAAAGATATACCCGCTGGGTTAAATCCCAGCCGTGCGCGTGCTGCGGGAAGCAGGCGGATGATCCGCACCACCTGACAGGCCACGGTCAGGGAGGGATGGGAACAAAGGCGCATGACCTCTTTGTGCTGCCGTTGTGCAGAACGCATCACAATGAGTTACATGCGGACACCGTGGCATTCGAAGAGAAATACGGCTCTCAACTGGAGTTGATATTTCGTTTTATCGATCGCGCGCTGGCGATCGGCGTGCTGGCGTAAATGGAGAACACGCATGAACCTTGAAGCCTTACCAAAATATTACTCACCAAAATCTCCAAAATTGAGCGATGACGCTCCGGCGACAACCTCCGAATCTTTGACGATTACGGATGTAATGGCGGCGCAGGGGATGGTGCAATCGAAAGCACCACTGGGGTTTGCTTTATTCCTGGCAAAAGTTGGTATTCAGAATCCTGACTTCGCGATTGAAGGGCTGATTCATTACGCGGTGGCACTGGATAACCCGACACTGAATAAATTGAGTGAAGAAACTCGGTTACAGATTGTTCCTTACCTCGTGAATTTTGCATTTGCTGATTATTCCAGATCTGCTGCAAGCAAGGCTCGCTGTGAGCATTGTGCTGGTACGGGATTTCATCATGTATTACGTGAAGTGGTGAAACACTCCAGAAATGGTGAACCCGTCATCAAAGAGGAGTGGGAGAAGGAACTATGTCAGCATTGTCATGGTAAGGGAGAAGTCAGCACGGTGTGCAGAGGGTGTAAGGGTAAAGGTATTGTCTTGGATGAAAAAAGAACTCGGTTTCATGGCGCGCCTGTTTATAAGATTTGTGGGCGTTGCAATGGAAACCGGTTTAGTCGTTTACCAACCACACTGGCGCGGCACCATGTCCAGAAACTGGTACCGGATCTGACGGATTATCAGTGGTACAAAGGATATGCAGACGTCATTGATAAACTGGTTACAAAATGCTGGCAGGAAGAAGCATATGCAGAAGCGCAATTAAGAAAGGTGACAAGATGAAAGATTTTCAACGAAGATAGCGACATGATGCTTGCATATTTCAAAAAATATGGATAAGATTCTCCCAACGATGGGCTTTGTATGTCTATCGTTGATAAGTCTCAAGAACCCGCCTCCGAGTGGGTTTTTTATTTGTGATCACTTTATTTTTTGTCTTGCTAAGTTATTGTATGGACAAGAACTAAAATTAAGTGGTGACATTGTGCTCTCTAATAACGAACGTTGGGTTTCCTTTTTTGACTTTGCTTTTACGCCTACACACGCAGCGGCGCCGAGTATTCCCATTGAAGACATACTCAAGAAATTGAAGGTACTGGTGAGCTCAGGGAGTGCTGTAAAGTTATACAATCATAGGTCTAGAGCGCTTAGGATTTCGGAGATGAAATATTCTATTGGGGATAGCCAGGCGACTCTACTTATCCAGCTTTGTGATAAAAATGGTTCTGACCCTGTTTTTGGTGAGTTAACAACAGGCAACCTTAGAGTAGAACCTAAGCTTGCCGGTGAAGGTATCGCAGTTTCTTGTCACATTGTAATATCCACAGATGTTGTCAAAAACACTGCCGATCACCACAAAACTCTCGTTGAATCTGTCCCCGGTATCAGTAAGTCAGTTCTTGAGCCATTTTTAAATGCTATGCTCAGAGAAGCCTTCGCTGGATGTGAGTTTAAAAATCCTGCAACTAAAGGTATGTGCCAGCACCGCCCAAAGCTGGAAATCTATTCTCATGGTTCACAAACGCTGATGGATGCATTAAAAGGTGCAAAGATTCATAACGTTAAACTTGTGAGTACAAGAAGGAAAGGTGGATTGGACCAAACGGCGTACACTGAGCTCTCAGAAAGGTCCGTAAAGTATAAAATCATTAGACAGCCGCCATTGAAAGATAAAGAAAGGTTGTTAGAGATTTTAAGAAAGAAAGGGCAGCAGTCTGGATATACCAAGGTTTCAATTAGTTACTCAAAAGATGGCAAGCAAGCCAGTTTGGATCTTGACCGTAACGAAGATGCTGCCACAAAACTGTTCACTAAAAGTGAGAGGGTAATATTAGGTAACCTCATCAACCAATGTGAAAGCACAGTACATCTGCAGCTTGAAACAAAAATGATAGGGTTGCTCTAACGGGAGTTTCATATGAAACTTTTTTCACCGCTGAGTTATCTCCGCATCAAGCATGAGGAAAAGGACTGGTATGATTACAAAATACCAGCTGCAGTGTCTCTAATCGTCACTATTGTTTATTATTTTCACGCTAGCAAAATTTCTTTAATCGAGACTAACGGACTCCTGCTTCAGGTTAATGGGTTACTTCAAGTCTTGATTGGTTTTTATATCGCAGCACTGGCTGCGGTTTCTACTTTTTCTAGCTCTTCGATCGACGAAGTAATGGCGGGCGTACCTCCGACTCTAGTAGAGAAATTCCGAGGGCAGAAGCTTACTGTAGAACTGACGCGCAGGCGCTTTGTTTGTTACCTTTTTGGTTATCTAGCTCTTGTGAGCTTTATGTTATTTTGCTTAGGGATGATTTCTATTCTGATTGGGAAGCCTTTCCATTTGTGGCTGCTCACATTCTGTTCTCCTGATGCAATCTTGTGGCTTAAAACGGTATTTGTTGGCGTTTATATATTCATCTTAATGAATATCATAACAACAACTTTGCTGGGACTTTACTTCCTTGCAGTTCGGTTCCACCAATCATCGCTGTAAAAAATCTAAATACTTTTAGGCTGCCTTCGGGCGGCCTTTTTTATTTCCCCTCATAACTGAGAGGACCCACATAACCAGAGGGGGATGAATGTCCGAACCTGTATCCAGTGCGACAGTGTTGGCTGGTGGATTAATGGGGGCCAGTGTATTCGGTCTGGCAACCGGAACTGATTATGGTGTGGTATTCGGCGCTTTTGCCGGCGCGGTGTTTTATGTCGCCACGGCAACCAACATCGGACGCATCAGGCTGGTCGCTTATTTTATTACATCATTTATTGTGGGAGTGCTTGGTGCCGGGCTGATAGGTACTAAGCTTGCGGCAATAACGCATTATGAAAAACCACTGGATGCACTTGGCGCAGTGATTATTTCTGCAATGTGTATAAAGTTTCTCACTTTTCTCAACAGTCAGGATCTGAACACCCTGTTCAGTATTCTCTCTCGTATCAGGGGAGGGGGATCAGATGGTAGCAAATGACCCTTCTGCAGCTCTGAATGCCGTAATTTGTGGGGTGATAGTCATCGTTCTGATGTTTTACCGACGCGGTGATGCGACACACCGCCCCCTGATTTCGTTACTGGCCTATGTCATGGTGCTGGTATATGCCAGCGTCCCTTTCCGGTTTGTTTTTGGTTTATATGAATCATCCCACTGGCTGGTGGTGATGGTGAATATCCTTATCTGCGCCGCTGTGCTGTGGGCTCGCGGTAATGTGGCGCGTCTGGTCGATGCACTGAGGCACTGATGAATCAACAACAATTTCAGCAGGCGGCTGGTATTAGCGCCGGGCTTTCTGCACGCTGGTTTCCGCACATTGATGCGGCAATGAAAGAGTTTGGTATTACAGCAGTTAATGATCAGGCCATGTTTATTGCACAAACGGGACATGAATCAGCAGGATTTACTGTTCTGAAGGAAAGCTTCAATTATTCGGTGGAGGCGCTGAAAAAGACGTTTGGTAAACGCCTGACGCCGTATCAGTGTGAAATGCTGGGGCGTATTGATGGTCGCCAGGTTGCCCACCAGCCGCAAATAGCCAATCTGGTTTACGGTGGCCGCATGGGTAACAAAGACGCCGGAGATGGCTGGAAGTATCGCGGGCGTGGTCTGCTTCAAATCACCGGCCGCGAGAACTACGTCAAATGCGGAGCTGCGCTGAAGCTTGATCTGATCAGCACACCAGAGTTGCTGGCACAGGAGAAGCATGCAGCCCGTTCTGCTGCATGGTTTTTCACATTACGTGGTTGCCTGATGTATTCAGGTGATGTTGTCCGTGTAACGCAGATCATCAACGGTGGCCAGAATGGACTGGCTGACAGAAATAGTCGTTATAACAAAGCGCGGGCGGCGTTGCTGGTATGACAGCGGTCTTTGCTTTCGTTAAGGCGCGGTGGAAAACAATCATTGTTTTGCTGATGTTGGCTGGTGCATTTCTTGCCGGGATCATCTGGAGTGATCGGGGCTGGCAAAAGAAGTGGGCTGACCGCAATAGCATGGAATCTTCACAGGAAGCGAACGCGCAGACTGCCGCACGCTGGATTGAACAAGGGCGCATAATTGCCCGTGATGAGGCTGTAAAAGATGCACAAGCACAAGCCGCTAAATCTGCTGCCACTGCTGCTGGCCTGTCTGCCACTGTTAGCCAGCTGCGTACCGAAGCAACAAAGCTTGCCGCCCGCCTGGACGCCGCAAAGCACACCTCAGATCTTGCCGCTGCCGTCAGAAGCAAAACAGCCGGAGCCGACGCCGCAGTGCTCGCCGACATGCTCGGACGCCTTGCAGAAGAAGCTCGATATTATGCTGAGCGATCTGACGAAAGCTACCGCGCAGGAATGACGTGTGAGCGTATTTACAACTCGGTGAGAGAGTCAACCAACAATCCCATAGCCCCGCACTAGCGGTGCTTTTTACCGGAGTTTATATGCCACCAAGAACACCTAAATCCTGTCGTGTTCGCGGCTGTCGCAGTACAACAACAGATCCATCCGGATATTGTGAAAGTCACAGAAGCGAGGGCTGGAAACAATACAAGCCAGGACAATCCCGTCATCAACGCGGTTATGGTTCGAAGTGGGACGTTATCCGTGAGCGCATACTGAAGCGTGATAAAGGTTTATGCCAGTTATGTCTGCGTGCCGGTGTGGTGCGTGAGGCGAAAACTGTTGACCACATTATCCCTAAAGCGCATGGCGGAACTGATGCCGACAGTAATCTGCAGAGTCTGTGCTGGCCCTGCCATAAGGCGAAGACGGCTCGTGAACGGCTGAAATAAAAACCAGTTTCCACAGCCAGAGGGGAGGGGCGGGGTAAATCCCTGTGGCCTGACGTCTTCCGGACTGCCCGCCTCATCAAATTTTTACGCGCCAAAAATAAGAAACTTTTTTCCGGAAGGTTCAACCTATTGAACTGGAGGTTTTGATGGGTGCTGTTGTGAGATCTTCCGGTGGTGGCCGTAAGCGCAATTTGCCTTCGGGCCAGAAAAGCAAGCTGACCAGGATCGCACCGCCGGAAGAGTTAATGAGTGATATCGCGATCCGCATCTGGAAAACGCAGAGCAAAATTTTAATTGAGCGGGGCGTTTTTGATCTTGAAGACGCGCCGCTACTCCTGGCGTACTGCAATGCGTTTCACTTGATGATTGAGGCCGAAAAAGTCATCGCGGAAGAAGGCCTGACCGTATCAAGTGAAATGGGTGGTGAGAAAAAACACCCTGCAGTCAATGTCCGTAATGACTCCGTTTCGCAGCTCGCCCGTCTGGGTTCACTTCTCGGGTTAGACCCGCTCAGCCGCATAAGAATGACCAGCGGAAAAAATGATCCGGACGATGAAGGGAATGAATTTGATGAGTTTGACTGATGGCTACATATCCGAACGTCAATGCGGCGAACCAGTATGCGCGGGACGTCGTGAACGGGAAGATACTGGCCTGCCGGTTAACCATGCTTGCCTGTCAGCGACATCTTGACGACCTGGAACGTGCCAAAGATCCGCATTGGCCTTACCGCTTCGATAAAAATAAAGCAGAACGTTTCCTTCGCTTTTCCCAGAAAATGCCGCACACCTCCGGAGAGTGGGCTCGCCGGAAGTTGCGGATAGAATTTGAACCCTGGCAAAAATTTGCGCTGGGCGTGCCGTTTGGCTGGGTGCGCAAGGATACCGGTTTTCGCCGCTTCACTGAGATTTACATCGAGGTACCGCGTAAAAATGGGAAATCGGCGATTGCGGCCGCCGTCGGTAACTATATGTTCTGTGCAGATGGCGAGTACGCAGCGGAAGTTTACTGTGGTGCCACAACGGAAAAACAAGCCTGGAAAGTTTTTGCGCCTGCACTGGCGATGGTGAAAAAGCTGCCGGCGTTGCGTCAGAAGTTCTGTATCAAACCCTGGGCAAAGAAAATGACTCGCCCGGATGGTTCCCTGTTCGCGCCAATTATCGGTGACCCTGGAGATGGCGACTCACCATCATGTGCGATCATCGATGAGTACCACGAGCATGATACTGACGCGCTATACACCACAATGACTACCGGGATGGGGGCGAGGGAGCAGCCCATCACGCTGATCATCACCACGGCAGGCTTTGATATTGCCTCGCCTTGCTATGAAAAACGTACTCAGGTGGTCGAGATACTGGAGCGCATCCGGGAGGGTGGTGAAAACGAGGCAATTTTCGGGATCATCTATACCCTGGATGATGACGATGACTGGACACAGCCGGAAGCTCTGATCAAAGCCAACCCGAATTACAACATTTCGGTGAAAGAGGGATTCCTCAAGGCTAAACAGTTGCTGGCGATGTCCACGCCAGGCCAGACCAATAAAATACTCACCAAGCATTTCAACAAATGGGTGAGTTCTAAAGCAGCTTACTACAACCTGCAGAAGTGGATGACCGCAGCAGACAAAACGCTCAGACTGTCCGATTTTGCAGGTGAGGAGTGTTATCCCGGCATCGACCTGGCATCAAAACTTGACCTTAATGCAGTGGTGCCGGTATTCCGCCGTGAAATAGACGGCCTGAGTCATTATTACTGCGTTTCGCCTATGTTCTGGGTACCGGAAGACACCGTCTACGCCACGGACCCGGCGTTGAAAACTATTGCAGACCGTTACCAGTCTTTTGTTAATCAGGGCGTGCTGGTTCCGTCAGACGGTGCAGAAGTGGATTACCGCCTTATCCTGGAAGCGATCCTGAAATTACGGGAAACGGTGAAGATAGCCGCGAGTCCGATTGACCCCTACGGTGCAACAGGCTTATCTCATATGCTGCAGGATGAAGGGCTTGAACCTGTCACCATTACCCAGAACTACACGAACATGAGCGACCCGATGCGTGAGATTGAGGCTGCGATCGCTGCTGGCCGATTCCATCATGACGGTAATCCCTTGATGACCTGGTGTATTTCGAACGTGGTTGGCAAGTACCTGCCTGGTAGCGACGATGTTGTTCGCCCGGTGAAAGAAGGCGCAGGCAACAAAATTGATGGTGCAGTTGGCCTGATGATGGGGGTTGGCCGCGCAATGCTGAACGAGCCGAAAGACTTTCTTTCTAACCTCGATCCTGATGAGGAACTGTTATTCCTGTGAAATCACTAATTATCGATGTGGCCGGGGTGGCAGGCTTCGGCGCGCTGGTGGGAGGTATTTACCTCAAATTTGGCGCGGCGGTTGCTCTTATGGCTGGTGGTAGTGGCCTGCTGCTGTGGGCACTGCTGGCGGCCAGGAGAATAAAAACATGCTGATTGATGCCATTTTCAGAAGCAACTCGCTGGAAAACCCAGCTGTTCCGGTCACCGTTGAAGCGGTCGAAAACGACGGGATCTTTAATGGTGATGTGATTGTTAACCCCCGGACGGCAATGAAACTGGCGGCGGTGTATGCATGTATCTACGTTATTTCATCCAACGTTGCGCAGATGCCCCTGCACGTCATGCGGCGAACCGGGAAGAAGGTTGAAACTGCCCGCGACCATCCTGCCTTTTACCTGGTTCATGACGAACCCAATTCCTGGCAGACCAGCTATAAATGGCGCGAGCTCAAACAACGTCACATTCTGGGCTGGGGTAACGGATATACCAGAGTTCTCCGTCACCGCCGAACCGGTGAAGTCACTGGCCTTGAAGCCTGTATGCCGTGGGAAACAACGCTGCTGAACACCGGCGGGCGCTATACCTACGGCGTGTATAACGAAGAAGGTTCCTTTGCCATTAATCCTGATGACATGATCCACGTCAGGGCGTTGGGTAACGATCAGAAAATGGGGCTCAGTCCGGTTCTTCAGCACGCCGAAACCATCGGTATGGGTATGAGCGGGCAGAAATACACGGAAAGTTTTTTCAGCGGTAACGCCAGACCAGCGGGCATAGTTTCAGTAAAAGGAGAATTGAATGACGGCTCCTGGAAAAGGCTGAAAGAGATGTGGCAAAAAGCCACGGCGATGCTGCGCAGCCAGGAAAACAGGACAATGTTGCTCCCGGCTGAACTGGATTATAAAGCGCTGACGGTTTCCCCAGTCGATGCCCAGCTCATCGACATGATGAAGCTCAACCGTTCCATGATTGCCGGGATTTTCAACGTGCCGGCACACATGATCAACGACCTCGAAAAAGCCACCTTCTCCAATATTTCCGAACAGGCGATTCAGTTTGTTCGCTACACAATGATGCCGTGGGTGACGAACTGGGAGCAGGAGCTTAACCGTCGGTTGTTCACCCGCGCCGAACGGGAAGCCGGGTATTACGTGCGCTTTAACCTGGCGGGTTTATTGCGCGGTACTGCCAAAGAGCGCGCGGAGTTCTATCACTTCGCTATCACCGATGGCTGGATGAGCCGCAACGAAGCACGCGCGTTTGAGGATATGAATCCGAAAGACGGCCTTGATGAAATGCTGGTCAGCGTTAACGCCTCCCGGCCAGCCAAATCCACAACCCAGGAGAACACTCAAGATGAGTGAACGTGAAATTCGCTGTTACAGCGGCGAGGTGCGCGCAGAAACGCACGACAGCGAGCCCAGCCGGATCATCGGGTATGGTTCGGTCTTTGACAGCCGTTCTGAACTGATTTTCGGTTCGTTTCGCGAAATCATCCGGCCCGGTGCGTTTGATGAAGTGCTGAATGACGATGTACGGGCGTTATTCAACCATGACCCCAATTTTATCCTGGGTCGCAGAAGTGCGGGCACGCTGGCACTGACGGTTGATGAGCGGGGTCTGCGTTATGACATCACCGCGCCAGAAACTCAGACAATCCGTGATCTGGTGCTGGCACCAATGCAGCGCGGGGATATCAACCAGTCCTCTTTTGCATTTCGCGTCGCCCGCGACGGAGAGGAATGGTACCAGGACGAGGATGGTGTGGTGATTCGTGAGATTACCCGTTTTTCCCGTCTGCTGGATGTCAGCCCTGTGACATATCCGGCGTATCAGGAGGCAGATTCCGCCGTCCGCTCTATGAAAGCCTGGCAGGAGGCGCGCGATAGTAGCGCACTGCAGAAAGCCATTAACCAACGAATGGCGCGTGAGCGCGTCCTGACCCTTCTTAACGCGTAAGGAAAAACCATGAAATTGCATGAACTGAAACAAAAACGTAACACCATCGCGACCGACATGCGCGCGCTGAACGAAAAAATCGGCGATAACCCATGGACGGATGAGCAGCGTACCGAATGGAACAAGGCAAAATCTGAACTGGAAGCACTCGACGAGCGCATCGCCCGCGAAGAAGAGCTGCGCCGCCAGGACCAGACCTACGTTGATGAAAACGAGGAAGAGCAGCGCAATAATCAGGATCCTGATAAAGACCCGCAGCAGGACGAAAAACGCGGCCAGATTTTTGATAAATGGATGCGTCACGGCGCCAGCGAACTGAGTTCCGAAGAGCGCAAAGCCTTACGCGAACTGCGTGCGCAGGGTGTGGCGCCGGATGAAAAGGGCGGCTATACCGTGCCTGATACCTTCCTGGCGAAAGTGGTCGAACAGATGAAATCCTACGGTGGTATTGCCAGCGTGGCGCAGATCCTCGCTACATCCGATGGGCGCACTATGGAATGGGCCACTGCTGATGGTACCGCTGAAGTGGGTGTGCTGCTGGGTGAAAACGAAGAAGCGGGTGAAGAAGATACCGAATTCGGTATGGATAGTCTGGGCGCGCTGAAAATGACATCCAAAATTATCCGCGTATCCAACGAGCTGCTACAGGACAGTGCGATCGACATGGAAGCCTATCTCGCCCGCCGTATTGCGGAGCGCATTGGCCGCGGTGAAGCGCGTTACCTTATTCAGGGGACCGGCACCGGTACGCCAAAACAGCCTAAGGGTCTGAAAGCATCCGTAACCGGCACTACGCAGACGGCCGCTGCCGGAGCTGTTAAATGGCAAGAGATTCTGGCGCTGAAACACAGTATTGATCCGGCGTACCGCCGCGGGCCGAAGTTCCGCCTGGCGTTCAATGACAATACGCTGAAACTCATCAGCGAGATGGAAGACGGTCAGGGCCGTCCACTCTGGCTGCCTGATATCGTCGGCGTGGCGCCAGCATCAGTGCTGAATGTTCCGTACGTTATTGATCAGGAGATCGATGATATTGGCGCGGGCAAAAAATTCATGTTCTGTGGCGACTTCGACCGCTTCATTATCCGCCGTGTTCGCTACATGATCCTGAAGCGCCTGGTGGAGCGTTACGCGGAATTCGACCAGACCGGCTTCCTGGCGTTCCATCGCTTTGACTGTATTCTCGAAGATACCTCTGCGATTAAAGCGCTGGTGGGCAAAGGCTCGGCAAGCAGCTAATCCCTCTCACCTCTGAACAAACCATGCCGCGTTAAGCGGTTTTTTTGTGCCCGCCACCCGGCGGGCGCAGGAGGATCCTATGTTGCTTTCTCCTGAGGAGATCAAGTTGCAGCTCAGGCTGGATGAGGATTACGCCGATGAAGATAAATTTCTTGAGCTGTTGGGGCGGGCGGTTCAGGCCAGGACAGAAAATTTTCTGAACCGGAGACTTTATACGGCGGAGGCGGGGGTGCCAGCCGACGATCCGGAGGGGCTTATTCTCTCGGATGACATCAGGATGGGGATGCTGCTTCTGGTGACGCACTTCTACGAGAATCGTTCTACCGTCACCGAAGTGGAGAAAGTCGAACTGCCGATGAGCTTTAACTGGCTCGTCGGTCCATACAGGTACATCCCGCTATGAAACTCAGGCAGGCGCAGGCCAGCGCCACATACCTTTTGCCCGACCCAGGCGAACTTGACCAGCGCATTGTTATCCGGCGGCGTGTCGATGTTCCGGCTGATGACTTTGGCGTAACGCCGACGTACCCGGAGCAGATCCGGGCGTGGGCCAAAAAAGCGCAACCCGGCGCGGCAGCTTATCAGGGGGCTGTGCAGATAGAAAACAGGGTGACGCACTATTTCACCATCCGTTTTCGCCGCGGTATCACCGCCGATCATGAAGTGCTCCACGACGATATTTCTTATCGGGTTAAACGGGTCCGTGATCTGAACAGTAAACGCCGCTTTCTGTTGCTCGAGTGCGAAGAGCTGGGTACCGATAACGGGAGTGACTATGCCGCAGAAAGCATATTTACACGTTGATTTTGAACAGCCGGAAACGCTTGTTTTTAACCGGGCGCGTATGCGCCGGGCGTTTGTCAGTATCGGGCAGGTACATATGCGCGATGCCCGCCGCCTGGTCATGAAGCGGGGGCGTTCCGGACCCGGCGATAATCCTTCATACAGAACGGGAAAACTGGCACGCTCCATCGGGTATTACGTTCCGCGGGCATCCAGTCGCCGTCCTGGATTGATGGTGAAAATTGCCCCTAATCAGAAGAACGGGGAAGGGAACCGCCCGATCTCAGGCGCATTTTACCCTGCCTTTCTGTTCTACGGTGTTCGCCGTGGGGCGAAGCGTAAGAAAGGCCATCATCGAGGCGCATCAGGCGGCAGCGGCTGGCGTGTGGCACCACGTAACAACTACATGACTGAGGTTCTGGATAAACGCCGCAGCTGGACACGTTATGTGCTCTCCCGCGAATTGCGAAAATCACTCCGTCCTCAGCGAAGGAAGAAAAAATGAAATTAACCCCGATTATTGCGGCACTTCGCAGCCGTTGCCCTCGGTTTGAAAACCGTGTGGGTGGCGCAGCGCAGTTTAAAGCGATACCGGAGGCCGGAAAGCTCAGGCTACCAGCCGCGTATGTTGTGCCAGCCGAAGACGTCACGGGTGAGCAGAAATCGCAGACCGACTACTGGCAGGATTTGACGGAGGGTTTTTCCGTCATCGTGGTACTCAGCAACGAACGGGATGAAAAAGGGCAGTGGGCTTCTTACGACGCAGTTCACGACGTCAGGCAGGAAATCTGGAAGGCGCTGCTGGGGTGGGAACCGGACCCGCAGGCGCATGAAATTCAGTATGCGGGTGGGATGCTTCTCGATCTGAACCGCCACGAACTGTATTACCAGTTCGACTTCACGGTGAAGTATGAAATTACCGAAACAGACACCCGCCAGCAGGATGATCTGGACGGCCTGCCCGACCTTAAAACGCTCAGTATTGATGTTGATTTTATCGAACCCGGTACCGGGCCAGATGGCGACATCGAGCACCACACCGAAATTACATTTCAGGAATAAACCATGTTTGTGAAACCCGCAAAAGGGCGATCGGTTCCCGATCCGGCCCGTGGCGACCTTTTACCTGAAGGAGGTCGAAATGTTGATGAGAATAACTACTGGCTGCGCCGCGAGGCCGCTGGTGATGTCCGGCGCACGAATAAAAAGGTGAAAACAAATGGCGATTAGTTTTAATTCCATCCCGTCAGATACGCGGGTTCCGCTGTTTTATGCCGAGATGGATAACTCGGCGGCAAATACCGCCCGGGACAGCGGGGCATCACTGCTGATTGGTCACGCCAGCAATGATGCGTCAATTGCCGTCAACAGTCTTGTTCTGGTGTCATCGGTTGATTATGCCCGTCAGATTTGCGGTGCAGGAAGCCAGCTGGCCCGTATGGTCGGGGCGTACCGTAAGACCGATCCATTTGGCGAACTGTATGTCATTGCCGTACCTGAATCCACAGGCGCGGCAGCAACCGTCGCTTTGACGGTAACTGGCGAAGCGACGGAAACCGGAACGGTGAATGTCTATACCGGCCGAACCCGCGTTCAGGCTCCCGTGACCAGCGGTGATGACGCTGCGGCGGTGGCTGTGAGCATTAAGGATGCGGTCAATGCAAACCCTGATCTTCCCTTTACGGCAACATCAGAAGCGGGGGTGGTGACACTGACTGCGCGCCACAAGGGGTTATATGGAAATGAAATTCCGGTCACTCTCAATTATTACGGCTTTGGCGGTGGGGAGGTGTTACCGGCAGGTGTGAATATTACGGTTGCCAGCGGCGTGAAGGGGGCTGGTGCGCCAGCTCTTAACGACGCGGTGGCAGCGATGGGAGATGAGCCGTTCGATTATATCGGCCTTCCGTTTAACGACACGGCATCGGTGAACACGATGGCAACTGAAATGAATGATTCCAGCGGTCGCTGGAGTTATATCCGGCAGTTGTATGGTCACGTTTATACGGCGAAGACGGGGACGCTGTCGGAGCTTGTGGCCGCGGGTGACCAGTTTAACCTGCAGCACATCACCCTGGCGGGCTATGAGAAAGACACCCAGACGCCTGCTGATGAACTGGCTGCAAGCCGTACTGCCCGTGCTGCGGTTTTTATCCGTAACGATCCGGCGCGCCCGACCCAGACCGGGGAACTGGTGGACATGCTGCCGGCACCGAAAGGCAAACGCTTCACGACGACTGAACAGCAGACGTTACTTTCCCACGGTGTGGCAACGGCGTATGTGGAAAGCGGCGTGCTGCGTATTCAGCGGGATATCACGACGTACAGGAAAAATGCGTATGGTGTGGCGGATAACAGCTACCTTGACAGCGAGACGCTGCATACCAGTGCTTATGTGTTGCGCCGTCTGAAATCTGTTATTACCAGTAAATACGGGCGCCATAAACTTGCTAATGATGGTACGCGTTTCGGGCCTGGTCAGGCCATTGTCACGCCTGCCGTTATCCGTGGTGAGCTGGGATCAACATATCGCCAGCTGGAGCGGGAAGGCATCGTGGAAAACTTCGATCTGTTCCAGCAACATCTGATAGTTGAGCGTAACGCGAACGATTCGAACCGCCTTGATGTGCTGTTTCCGCCTGATTATGTCAATCAGTTACGTGTGTTTGCGGTGCTTAACCAGTTCCGTCTGCAGTACAGCGAGGAGGCTGCATAATGGGAAAAATTGCGGGAACAACATATTTCAAAATCGACGGACAGCAACTGTCGGTAACCGGAGGGATTGAAGTCCCCATGAACACCAAAGTTCGTGACGACGTGATTGGCCTGGATGGTTCCGTTGACTACAAGGAAACCAGCCGGGCACCGTATACGAAGGTGACCGCCAAAGTGCCGAAAAACTTCCCGGTCGATAAAATTACGTCTTCTGATGTCATGACCATCACATCAGAGCTGGCAAATGGTCAGGTGTATGTTCTCTCAAACGCCTGGCTGCACGGCGAAGCCAACCATAACCCGGAAGAGGGCACCGTGGATCTTGAGTTCCACGGTGAGGAGGGATTTTACCAGTGATAAAAGAACTTGTGCTCAAAAAGCCGATTATGGCGCATAACGAAAAGCTTCATGTGCTGGAGCTGCGCGAACCGTCCTACGATGAAATCGAAGCCATTGGTTTTCCGTTCACCGTTTCCGGTGATGGCGGCGTCCGGCTGGACAGTTCGGTTGCTCTGAAATATATCCCTGTGCTGGCAGGTATTCCACGCTCCTCGGCAGCGCAACTGGCAAAACTGGATATTTTCAAAGCCTGTATGTTGATCCTCAATTTTTTTACCCGGTCGGAGACGGAGGAGGACTCAGAAAGCGGGTCTACAACACCGCATACTTCTGGCGAATAAATCCCCTGGAGCTCCGGCGGGCGGCGATATCCGATTTTCTGGAGCTGGAGTCGGAGGCTGTCCGTATCAATGAGGAAATGAAGCATGGCTGACAGTTTCCAGTTAAAGGCCATTATCACTGCCGTTGACCAGTTATCGGGTCCGCTGAAAGGGATGCAGCGGGAACTGAAGGGATTTCAGAAAGAAATGGCCGGGCTGGCGATCGGTGCTGCCGCTGCCGGGACCGCTGTTCTTGGGGCGCTGGCGCTGCCCGTGAATGCTGCGATCGGCTTTGAGTCAAAAATGGCTGACATCCGGAAGGTGGTTGACGGCCTGGATGATAAAAAAGCATTCGCGCAGATGAGTGACGATATCCTGACGCTGTCCACACAGTTACCGATGGCGGCGGAGGGAATTGCAGAGATCGTGGCGGCGGGCGGGCAGGCAGGCATTGCCCGCGGCGATTTGATGCAGTTTGCGAACGACGCAGTGAAAATGGGTGTGGCGTTTGATACCACTGCCGAAGAGTCCGGTCAGATGATGGCGCAGTGGCGGACAGCGTTCAGACTGACGCAGGAAGACGTGGTTGTCCTGGCCGATAAAATCAACTATCTGGGGAATACCGGCCCGGCAAATGCGAAGAAAATTTCTGATATCGTGACGCGGATTGGTCCGCTGGGCGGTGTTGCCGGAGTGGCATCCGGCGAAATTGCCGCGATGGGCGCCACCATTGCCGGGATGGGGGTTGAATCAGAAATTGCCTCCACCGGCATCAAAAACTTCATGCTGTCGTTAACCGCAGGTAATTCGGCAACCAAAGCCCAGAAACAGGCTATGGCTTTCCTGAAGCTGAATCCCCGGAAACTCGCTGAGGATATGCAAAAGGATTCGCGCGGGGCCATGCTGAAGGTGCTGGACTCGCTCGCGAAAGTGCCAAAAGCTAAACAGGCCGCCGTCATGAATGCGCTGTTTGGCAAGGAGTCACTTAGCGCGATTGCCCCGCTGCTGACCAACCTGGATTTGTTACGCACCAATTTTGATCGTGTGGCTGATGCCCAGGAATATGGCGGCTCGATGCAGAAGGAATACGCATCCCGCGCGTCCACAACAGAAAACCAGCTGGTTCTGCTGAAAAACAGCGTCAATGCGATTTCGGTAACGCTGGGCGATACCTTCCTGCCCGCCATTAACGAAGCTGCAGAAGCGGTCATGCCTTACCTGGAGCAGCTCCGGACATTCGTTCGCGCGAATCCTGAACTGGTTCAGTCTGCGGCGAAGTTCGGCGCGGCGCTGCTGGCTGTTGGCGTATCCATTGGCAGCCTGTCCCGGGCTGTCAAAATCCTGAACAGTGTCATTAACCTCTCTCCGGCGAAAGTCGCCATTGCGGCGCTGGTGGCCGGCGCTATGCTGATCATTGAGAACTGGGACGATGTTGCTCCGGTGATTAAGGCGGTATGGCAGGAGGTCGATAACGTTGCGCAGGAGATGGGCGGATGGGAGACGGTGATTGAAGGGGTTGGTCTGGTTATGGCTGGTTCTTTTACCGTCAGGACCATTGGTGCCCTGCAGCAGTCCGTCCTGCTGGCCGGACGGCTTTCCGGTCTGCTGGGTAAAATTGGCCGGATGGGGGCCATGACGCTGACAATTGGCGTGGCGGTGTCACTCTTTAAAGAGCTTAAGGATCTGGAGCAGGGGGCAAAGGATGCGGGTATGGATGCTGGCGCATTCGCTGTACAGAAGCTGCAAACGAAGGAGCGTGAACGCGGGTATAACGGTTTTATTCCCAGACTCAAAGAGCTTCTTGGTATGGACACCCCGATTCCGCAGGGGCGTTATCAACCTTATGTGCCACTGACCCGGCGTTCTGGCGTACTCGGGCGAGCTGTCCCGCCATCAACGCAGCGCAGCGAACTCAAAGTGACATTTGAGAATGCACCACAAGGTATGCGTGTGACTGATATACCGAAATCCGGTAATCCATTGATGAACATCAGCCATGATGTGGGTTACTCACCCTTTCGTACATCACGATAAACCTGCTCCGGCAGGTTTTCTTATGGGGTAAATATGGCTTTTTTCTCCTCAACTGGCTGGCGCGGGCGCCTGCGTGATGCATCATTTCGTGGAGTGCCTTTCTCCGTTGAAGATGATGAAAGCACGTTTGGACGCCGCGTACAGGTACATGAATATCCGAACAGGGATAAGCCCTGGACGGAGGATTTAGGTCGCGCCACGCGCCGCCTGACGATAAATGCTTATCTTGTCGGTGATGATTACGCAGACAGGCGGGATCGTCTTATTGGTGCCATTGAAACCGCAGGCCCTGGTACGCTGGTCCATCCGCAGTATGGCGAAATGCAGGGCAGCATTGACGGACAGGTCAGGATCACTCACAGCAGTACAGAAGGGCGCATGTGTCGTGTCTCCTTTCAGTTTGTGGAAAGTGGTGAACTTTCTTTTCCTGTGGCGGGAATGGCAACGGCGAAGCGCCTGGAAACATCAGGCGGGCTTTTCGACGATGCGATTGACAGTATGTTTTCCACATTCTCGTTGTCAGGTATTTCTGATTTTATCCAGAACGATGTCATTGCCGATGCTGCCTCCATGCTGGGCGATGTTGCCGATGCTTTCAGGATGGTTGACTCCGGCGTGTCTGCCGCAATGCGGCTGTTACAGGGGGATTTGTCTGTCATTCTGATGCCACCGGGCGCCGCAAGTGATTTCGTTAACGCACTGCAAAAAGCCTGGCGCTCAGGTGACAGGCTCAGAGGCAGTACATCGGATCTGGTCACGATGATAAAAACGATGTCAGGTATCACCCTTGATCCCGGTCTTTCCCCCCGTGGCACCTGGCCCACTGACTCCGGATCTGCTGCGAAACAGAAAATGCAACGCAATATGATCGCAGCCGCCATCAGGACAACAGCCATCAGCACAGCCGTCCACGCCGTGACAACACTGAAGCAGCCGCGTGATGTACCTGATGTCCGGGGCGTAAATCAGCCTGCAGGAACAGGCCGTGACTCAGACATTATCACTGTCATGCACCCGGCGCTGGATGGTGTACAGACAGTCAGTAATGGCAGCTTTCCACCGAATTATGAAGATCTGAAAGCTATCCGGACCGCGCTCAATGCTGCGATTGACCAGGAGCAGTTGCGTATCCGGGATGATGTGCTTTTCCAGCAAATTTCCGTTATGCGGACGGATCTCAATCGCGATATTTCTGCACGACTGGCACAGGTTGAACGTACTGCATTGCGAACGCCTGATGATGTTCTGCCTGCACTGGTACTGGCTGCAGCCTGGTATGACGACGCCGGGCGGGAATCTGATATCCTCACTCGTAATCCCGTTCCCCATCCGGGATTTATCCCGGTTGAGCCGCTGAGGGTTCCGGTACGATGAATAATACGGTTTTTTTACGCGTCAACGGGCGTGACTGGGGAGGATGGACGTCAGTACGGATAAGTGCGGGCATTGACCGTATTGCCCGGGACTTTAATGTCTCGATCACCCGGCAGTGGCCTGGTGGAGAAGACGTACCGCCAGTAAAAAATGGTGACGCTGTAGAGGTACTCATTGGCGATGATTTAGTTATTACCGGCTGGGTTGAGGCGTTACCGCTACGTTATGATGCGCAGACCATTATGACGGGCATTGTCGGGCGCAGCAAAACGGCAGATCTTATCGACTGTTCTGCATCGCCTGCACAGCATAACGGGAAAAATTTATTCCTGATCGCCAGCGCACTTGCCCGGCCATTCGGTGTGGACGTTGTTGATGCAGGCGCGCCGGCAGCCGCCGTTATTGAGGCTCAGCCGGAACATGGTGAAACGGTTGTGGACTGTCTGAACAGGCTGCTTGGACAGGCTCAGGCGCTGGCATATGACGACGAACGGGGACGGCTGGTTCTCGGCAGGCCGGGCAGTATGAAAGCAGCCACGGCACTGGTACTTGGCGAAAATATTCTTTCCTGTGATACCGAGCGTAGTGTTCGTGAGCGTTTCTCCAGTTATCTGGTTACGGGGCAGCGTCCTGGTACGGATGACGATTTCGGCGAGGCAACCATTGCTGCTATCCGGCAGAGTACTGGTGATGCAGGCGTCACGCGGTATCGTCCCCACACCATTCAGCAGTCAGGAACTGCCACAACTGACAGCTGCAAATCCCGCTGTGAATTTGAAGCCCGTCAGCGTGCGGCGAAAACGCTGGAAACCACCTATACCGTACAGGGATGGAGACAGGGGAATGGCGAATTGTGGAAACCGAATCAGGCCGTGGTGGTGTATGACCCGCTGAACGGTTTTGACAATGAAACGCTGGTGATCGCCGAAGTGACGTACAGCCAGGACAATAACGGCACCCTGACCGAAATCCGGGTGGGGCCTGCGGATGCCTATCTTCCTGAACCATTCAGGCCGAAAGCGAAGAAAAAAGTCAGTGAGGAGGCAGATTTCTGATGGCTAACCATCCTCTTCAGAACATGATAACGCGCGCAGTCATTACCGCGATTGATACCGTCAGAAAATGCCAGACTGCCGGACTGAAACTTATTGCCGGTGAAAAAAAAGAGAATGTGGAGCATCTTGAACCTTACGGTTTCACCTCTGCAGCACAGAATGGCGCAGAAGCGGTGGTATTGTTTCCCGGCGGTGGCCGTTCGCACGGAGTGGCTGTGGTTGTGGCTGACCGCCGCTTCAGACTGAAAGGGCTGGCGCGCGGGGAAGTCGCGCTATATGACGATCAGGGGCAGTCGGTCACATTAACCCGAGCCGGAATAGTGGTAAATGGCGGCGGAAAGCCAGTTATTTTCACGAATGCCACTAAAGCACGTTTTGAAATGCCGATCGAATCCACTGGCGATATCAGGGACAACTGTGACAGCAGTGGAAAAACGATGGCTGAAATGCGCACGACCTATAACGGTCATACCCATAAAGAAAATGGCGATGGCGGCGGTATAACCGATAAGCCTGGCCAACCCATGAGCTGACACCATGATCCTTTATGTTAATGGAATCCGTAAGGATGCCACGGCTTCGCTCGACTTTCTGACGCGGGCAGTGGTGATTTCTCTTTTTACCTGGCGCCGGGCGGAGCGGGATGACAGGACCCCACAGCCATACGGCTGGTGGGGGGACACCTGGCCTGCTGTTCAGAATGACCGCATCGGTTCCCGCCTCTACCTGCTGAAACGCCGCAAACTCACCAATAAAACGCCGCAGGATGCCCGCGAATACATGCAGCAGGCGCTGGCGTGGATGACAGACGATGGCGTGGCGGCACGTATTGATGTGACATCTGAACGCACAGGAACAGATACCCTGGCAGCTGGCGTGACGATATATCAGCGGGACGGGGTAATTCACAATATTACATTCGATGATATATGGAGCAAACTTAATGGCTGACAGTCAATTTGCACGTCCTGAACTTCCTCAGTTGATTGCTACCATTCGCAGCGATTTACTGACCCGTTTTCAGCAGGATGTTGTGTTACGTCGCATGGATGCCGAGGTTTACAGCCGGGTACAGGCTGCTGCCGTACATACGCTGTATGGTTATATCGATTATCTGGCCCGGAATATGCTGCCTGATATGTGTGATGAGGACTGGCTTTACCGTCACGCGAGGATTAAGCGTTGTCCCAGGAAAAATGCCGTATCTGCGAAGGGATTTGCACGCTGGGATGGTATTGCCGGAACGCCGGAGATCCCCGCGGGTACACAGATTCAGCGGGATGATCAGGTTACATTCACGACCCTGCAGACGGTGAAAGCTTCCGGCGGCCTGTTACGTGTGCCGGTTATTGGTGATGTGGCGGGAACTGCCGGTAATACTGACGATGGTACGGCGTTACGCCTTGGCACGCCGATTACTGGTATTCCTTCTACAGGTTACGCTGACACTCTGACCGGGGGGGCTGATACAGAGGAGCCTGAAACGTGGCGCGCGCGCGTCATGGAACGCTATTACTGGATACCACAGGGGGGCGCTGATCCTGATTACGTCATCTGGGCAAAGGAAATCGCGGGAATAACCCGTGCGTGGACATTCCGCCATTATAAGGGGACCGGCACCGTTGGTGTGATGGTGGCTACCAGTAACCCGGTGAATCCGGCTCCTGGCGACGATCTCGTTAAGGCTGTACGTGACCATATTTTGCCGCTGGCACCTGTTGCTGGCGGCGGACTCTTTGTTTTCGCTGCCACTGAAAAAAGCATTCCGGTAACAGTCGCACTGGCCAAAGATACCCCGGAAATTCGTACTGCCATTATTGCGGAGCTAAATGCGCTGATGCTGCGTGATGGCGCGCCGTCCGGAAAAATTTATGTTTCGCGAATCAGCGAGGCGATAAGTCTGGCGACCGGGGAAGTGGCACATCAGCTGCGTGTGCCGGCGGCAGATGTGGTGCTGGGAAAAACTGAACTTCCTGTCCTGGGGAATATAACCTGGGCCACCTATACCGGGGAGAACGGATAACTATGGCATTACAGGACGAATATACGCAGTTACTTTATCACCTTCTGCCGGAAGGGCCTGCCTGGGACGGAGAAAACCCACTGATTGAAGGGCTGGCGCCGTCGCTGAACCGGGTACATCAGAGAGCGGATGAACTGATGGCTGAAATTGATCCGGCCAGAACCACAGAACTGATAGACCGTTATGAACAGCTGTATGGCCTGCCTGATTCCTGTGCACCGGAAGGCGTTCAGACATTACAGCAGCGCCAGCAACGGCTGGATGCAAAGGCAAATGTTGCTGGCGGTATAAACGAGAGGTTTTATCGGGAACAGCTTGATGCGTTGGGGTATACCGCTGCCACCATTGAGCAGTTTCAGAATCTCGACAGCACACCCGATCCTGAATGGGGGGAATTCTGGCGTTACTACTGGCGTGTGAATATTCCGGCTGATGCGAACATCAGCTGGCAGACCTGTACAAGCACCTGCGACTCTGCGATCAGAACGTGGGGCGATACTGTTGCTGAATGTGTGATTGATAAGCTTTGTCCGTCACATACGGTTGTTGTTTTTGCTTATCCGGAAGGAAAAGAGAATGCACAGAATTGATACGCCCACCGCGCAAAAAGATAAATTTGGTCAGGGAAAAAACGGATTTACGAATGGTGATCCCGCCACGGGCCGCCGCGCAACGGATCTCAACAGTGATATGTGGGATGCAGTCCAGGAAGAGGTCTGTACTGTTATTGAAGCCGCCGGCATACCACTCAGTAAAGGCGAACATACGCAGCTTCACGCCGCCATTGGCAGGCTGATCGATGAACAGGTTAAAACCCGTCTTGAAAAAAATCAGAATGGCGCGGACATCCCGAATAAGCCGCTGTTTCTCCAGAACGTCGGTTTAGGAGAAACGATAAATCTCGCTGCAGGGGCCCTGCAAAAATCGCAGAACGGCGGCGATATTCCTGACAAAAAACAATTTGCGAGAACCATCGGTGCGGTAACGTCAACCACCATTACACTTGGCGAATCAGGCTGGTTCAAAATCGCCACGGTTGTAATGCCGCAGGCTACATCAACTGCGGTGATTAAACTGTACGGTGGGGCGGGGTTTAACGCTGGTTCACCTGAACAGGCGGCAATCAGCGAACTGGTATTGCGTGCCGGTAATGGTTCACCTGTTGGAATAACCGCCACATTATGGAGGCGTTCACCTTCTGCTGCTAACGAGGTCGCATGGGTTAATACATCAGGCGACACCTACGATATTTATATTAATATCGGCCAGTATGCGTACTGGTTAATTGCGCAATATGATTACACCGGTAATGCAAATGTCACGCTGCACAGTACGCCTGAATATTCATCAGTTCAGCCGGGAAACTCAACCAGCGGTCAGACATATACACTGTTTAATAGTCTGATGAAACCCACAGCCGGTGACGTTGAGGCACTGTCAGTTAATGGAGGGAGGCTAAACGGTCCGTTAGGCATTGGTACTGATAATGCGCTGGGTGGTAATTCGATTGTATTCGGAGATAACGATACAGGGTTTAAGTGGCACAGTGACGGCGTTCTGGGGATTTATGCCAATAATGCTCTGGTTGGTTATATCGACAATTCCGGGCTGCACATGTCAGTAGATGTTCTCACTAATGGTGCCGTACGCGCAGGCAACGCAAAAAAACTGTCACTGACGAGCAATAATAATTCGACAATGACAGCCACGTTTAATTTATGGGGCGACGCAAACAGGCCAACAGTTATTGAACTGGACGACGATCAGGGATGGCATCTGTACAGCCAGCGAAATCCTGACGGTTCGATTGTCTTTACGGTCAATGGAGATATCACCGCTAACACGCTTCGTGCAAGCGGGGCTATCTATCAGAATAACGGCGACATCTTTGGTTCGCTATGGGGAAATGGCTGGTTAAGTACCTGGATTAATAATAATCTCGTCTTAGATGTTCAGTTAGGGGCTGGCACATCAGTGACTACCTGGAACAATGCAGGTTCCTGGCCTAACACTCCCGGATATGTAGTTACCTCCGTCTGGAAAGATTATCAGGGCGAAAATATTGATGGTATTAATTATGCGCCTTTGCAAAAACGAGTCGGGAGTCAGTGGTATACCGTACAAGGGGGAACGGTATAATGAAAAAATATCAGAATATCAAAAATTTCAGACTTATTGACGCGCCCGTAAACAGGGATAAAACTCAGGCTGAAATAAATATAGGTGCATATTTTCTGGAGTCGGACGATGGACAGGACTGGTATGAGTGTCAGTCATTATTTTCTGATGATACTGCAAAAATAATGTACGACCATGAGGGGGTTATCTGGGGTGTTGTTAATAAGCCAGTCCCGCAACGAGGAAACACATATTCTGTATCAATGTTGTGGCCGGTTAATATGTCTGTTGCGGAAATAGACGCTGCTGACTGCCCTGATGATTGCCGTGGTGATGGTACGTGGTTATATCAGGACGGTAAAGTCGTTCAACGGGGTTATTCTCCGGAAGAGCTGCGTAAAAAGGCGGAGGCTGAAAAAGTTCGCCGCCTTGCTGAGGCTGAATCAGCCATCGCACCACTGGCACGGGCAGTAAAACTAAAAATTGCCACAGATGAAGAGATTAAACGGCTGGAAGCATGGGAACTTTATAGCGTAATGGTAAACAGGGTGGATACATCTGCGCCTGACTGGCCGGATATACCACGCTAAATATTCAGGCGGGTTTATTACCCGCCTTTTCTTTTTCCTGTCGTTGTGCCATCAACCTGACAGCCGGTACAAATAGCCCCCTCTTGTGTACTGACCTGAAAATATACTCACCCCTTAACCACGGAGTTAACCGGATGAGTGATTTTCACCACGGCACGCAGGTCATCCAATTTAATGGAGGTACGCGCGTCACATCCACGATATCGATCGAAATCGTCAGTATGGTCTTTACGTTTAGCTTTGCCGAAACATTGCTCGCGTTTATATCATACGTTTGCAAATAATTATCTCCAGGAGCTGATAGTCAAACTGCTGGTATCCATAAAAATATGTTCATGCTTAAATTCTATAAAATTACAAAACTCTTCATAATTGTGCCGTATAGAACAATTAAAATAATGCCTTAGCCCACCGCAGACACCATCAATGTATGGATCGCCATAAGGTTTACTGTGCATAATTTCAAGTCCTTTTTTCAATGCCGGATGCTCGCTGCGGTTAACGGCGATAATCCCATTTTCAAGACTCATGCTATTACCTTTACGACTTACATGAACAGCAATACCATCAGGTAAATACAAAGTGCCGAGTTTACCTGTAAGTAACATATCAGCATCAAGATATATGCAGCCACCTCCAGGTTGCAGGTGATGGCAGCCATGTTTGCCCGCCTCCAGAAAAGCATTACTTCCTTTTAATAAAAAAAGATTTCTGTAAAAATCAAACCGTACATGTCCCAAGCGTTTATCATGGGACGAAACAAGGGACTCCTCTGGATTGTTCTTTAAAACTTCATTTAAACTCTTTTTTATCTCACCAAGCAGATATTCATCTCTGACATTTGCTGGTTGAGCTTCAATTTTAGCGATGTTTTCTAAATAAATATCTGATAGTTTCTTGTCATACATGCTATAATCCAGGTCGGAATTATAGATAACCTTTATATTTTCATATTGTTTTTCCAGCTTTGCTAATGCTTTCTTTTGTCCAGCACTAAAATTCCCATCAACTAAAACCCCGATAGTTCTCTCTTTTTCTATTATAGCGGCGTTGATAATATTATTGAGATAGGGGTTTTGTTGAGTATTAATAATTGGGATCTGGTTTTCCCCAAACCTGCTTGGGTTTCGTTCAAACCATTGAAAAAGTAGGGGGGTGTGCTGATCTAATGGCAATAAAGGATATTCAACTCCGGCAAAGTTTGCACTACCTGATGAAGGCAGAGTAATAGCTGGAGTTGCAGTATGAGAATAGTTCTGGCATGAAAGAAAACCTCTGACTCGAGAAAACATTTTTCAACCCTTACGCTATTAATATAACATACCATGATTTGAGATTGATAAGATATCGGGTTTTAACCTTAGTTTAATAAATTGGAAGATTTTTGTATGAAGATTTGCTACCGTATTTTCGTGCCATAGCTATCTGAAACGATAGTTTTTACTTGGTTAGGGGGGGCTTAAAATTACATTTTTGAATAAGTATTTTATACTTCTTATACGATATGTTTTTATTGTATTGCAGGGGGACAGGGGAGATTGGCGGGCGAAGACCAAAGTTATCACCTGAGCAATGGGCGCAGGCCGGGCGTCTGATTGGGGCAGGAATACCGCGACAGCAGGTAGCGATTATTTATGATGTGGGGCTGTCGATACACGGGCTGTACTGGATTCTGTTATCGGCTAACCGGAAGGGAGTTAAAAAACGGGATTACCGCGCTAACTAGAAGCATGAGGGAAGGTTGTGGCATCCGGTGAACCTGTAGGAATCCCATAACTGATTAAGGACGGAAACCACCAGATGCCACGAAGATGTTATGGCGCATGTTGTTGTGGGAAGTCAATAAAGGAGATGGTTTTGTTTAAAAAATAGTCTATCATGGTGAGAAAATGATTTTGATAAGTAGGCTAACTTTCTGAAAATACTGCGTACAAAAATGCTACTTTTTTCTCATGTTATTAGATAAGTTAATGTTAAATAAGAATATTGGGACGGTCTCGAAAACCGGAGTAGGGGCAACTCTACCGGGGGTTCAAATCCCCCTCTCTCCGCCAATCATTCAACAAAATCAATCACTGACAAAGCGTTTTTGATTTCGTCATACATAAATCCCTGCATTAAAATTTCTTTCACTCGCTCGATTTTTCTTACTACTGATGCTGTTTTTTACCATTTTGCTGCGCGTCGGGCATCCACTTTTATCTTCTTTAGCTCACAAGCTCACACTCTTTTGCAACGCCCATCTCACATCTCTGGTGACAAAACAGTTTAACTGCCGTTGCTTCACACGGATCCTGTGAGGGTATCTTAAAGGAAATGGATTGCTGGAGGATGGAGGAAAAGAGGCGAAATCTTTTTCAGTGGTGATACCGCCTCCCCTGGACATGAGCTTGCCAGAACCTACGCGGTAAATCCTACGACACGAAGCCACGCGCAATTTTATGCGAAAACGCTGGCAATAGCCTTGCTGACTGATGCAAGCTATCTAAAATCGCTTGAAGATGACATCATGCAAAACAATCGCTTAAAAGACTCACCTGTTGATAAGCATGTTGAGGACGAAAAGAAGAAGAAAAATGCAGCCAGGTAAAAAGGGTTACGTTCGCCGTAAGCGAACGTGATCGCATCCAGCCTTAACGAACCTTTTTCCGTTCATCTCCCTTCATGTTTTCAAAAAACTCTTTTATGAATGCCCCTAAAACGTCATACGCGAATTCGCTATTATTGTTCCCGTCGCGACAACATAGGGATTACGTAAAAAAACAGAGGGCAGACATTGAAAATATTGGAAATAACGTAAAACGATTTGTCCGTGTTTTTATCTGTCATTTAAACTTACAGAAATGGCTGTACTGCTATCAGTATAAAGATAATTTCAGTTGTCCAGATAAATCATTTTTAAAACTATTTGTATTAACAGGGTTTATTCATTTATTGAATGGATGCGTGCTCGTTTGGTCTTGTCATCTTGACGCTTTGAGATCACATATTCTATTCTATCATAGACAGCAGGGAAAATTAGCTGCTGGGATCGATTTTTGATTAATAAAATATTGGATGGAACCACGATGCAATTACCAGAACAGGATGAGTTTTCTGATTTTTTTGCTGCCAATGATGATGAACAAGCCTCTTTAAGGCGTAAGTTTTTTTTGGAGAAACATAAAGAACCGTGTCTGTCTGAGTCTGCATTAGAGGACTACCAGGCGCTGTTTATGAGTATCTACGGAATTAATATTGACTGGAAAGAGGGGACTTTTAGCCTGCTTGAGGCACTTTCAGATAATCAGGGAGGGAAGCCTGTCACGGTCAAATTCGATTATGACAGTGAGATTGAAACAGCAACGATAAATTTGGTTGATACGCAGTATGTGTTTCATCACTACCCAATGGGAAGTGATGGTTTTGATACAGAACTGGTGCGCATTGAGCATATATTGGCTAATAGTGGATATAGTTTGCGGGTATATCAGAACAGCACTTTTAGTGATACATTATCGTTCTTACTCATTCCGTCAGATGAGTGGAAACGTGTTGAACAGCATTATAGCCCAGAGCATATTTCTGAATACTTCGTTCCGTATGGAAAACAACTTGTTATTCCTGAGGTTACTGCTCCAGTCGTAAATTATGTGCCATCCGTTAAACAAGAAGCATCAAATGTCCCAGCGTTGTTTAATGCTCGGGGTATCCGTATTTGTTTTTTAAGTATAATGCTAATCGCATTTGCAATTTATATTTTGTGGAATATCCTGACAAAAATAGAGCCTTTATCATCAGGCCAACCTGCTGGCTGTGAAAATCTACAAAATTTATACTCAAAATTACGTCCAGAAGTAGCCGAGCCATTAAAAGAAAAAATGCGTAAGAGTTTGGGCTGTAAATGATAAATTTTCAGTTTAAGAAAATTACACTAATGGATAGGTAATAGAGATATATTTAATTTGATATTGTGTTATCGAAATAGTAATAACAGATAATATACATATTCTTGCGGTGGTGGATGAAGCGATTTTTATCTTATTTGTACAGGAAAAGAAAAAGAGTTTTTTTAGCTACCATGACTCTTATTAATTTTGTTGCCGCAAATCAAAAATACTACGCTTTTAATTTGTTGTCAGTGTTTATCATGGTGTTCTGGTCAGTTCAGTACCTCCTGCAGCATCTCCTTGTCCAGACTCAGGTCAGCCACCAGCTTCTTCAGCCGCTGATTCTCATCCTCCAGTTGCCGCAGACGCCGCAGTTCCGTCACGCCCGGCCCGGCAAATTTTTTCTTCCAGTTAATGGGATGGACTACTTCCTCCCGCTGCGGTTAACTGTACGAAATGTGCTCACCAGGAGAATCACCATGAATATCGTATTTCTGGGTATTGATCTGGCTAAAAATGTTTTTCAGCTCTGCGGGTTAAACCAGGCCGGCAAACCGGTTTATACGAAACGCACTGGCCGAAAAGAATTGCTCCAGACGCTGGCAAATATTCCTGCATGTCTGATTGGGATCGAAGCGTCCACCGGGGCATTTTACTGGCAGCGTGAGTTTGAGAAACTGGGGCACAAAGTAAAGGTCATCAGTCCTCAGTATGTAAGACCCTTTGTCCGCGGGCAAAAAAATGATGGTAATGATGCACAGGCCATCGCAGTGGCTCTGATGCAACCGACAATGCAGTTCGTGCCGCCAAAAAGCCCCGAACAGCAGGATATCCAGGCTTTACACCGGGCAAGGCAGCGTATTGTCAATCACCGCACTGCTACAGTCTGTCAAATAAGGGGGCTGTTACTTGACCGGGGGATCCCCATTGGCAGTGCTGTCTCCAGAGCTCGCCGTGCTATTCCTCTTATCCTTGAAGATGCAGAAAACGGTCTAAGTTCCCGTATGCGCAGAACAATTGCCGAACTCTATGATCTCTTTAACGATCTCGGGCGTCGGATCCATTTTTTTGATAAGGAAATTGAAACAGTATTCAGGCAATCAGAAGCCTGTCCGCGTATCGCCAAAGTTAAAGGCATTGGTCCTAAAACGGCCACGGCCGTTGTTGCTGCTATTGGCAAAGGAACTGAATTTAAGAATGGTCGCCACTTTGCTGCATGGCTGGGTCTGGTTCCACGCCAGCATTCGAGTGGCGACAGGCAGGTGCTGATGAATATGACGAAAAAAGGCGACAAGCATCTGCGGACACTTTTTATTCATGGTGCCCGCGCTGTCGTCAGGGTTGCCACGAATAACAATGATGGTCATATGAATCAGTGGGTTAACCAGTTAAAGGAACGGCGCGGATTTAATAAAACGACCGTGGCGGTCGCTAACAAAAACGCGAGAATAATCTGGTCGATGCTGAGAAATGATACCGGGTATCAGGTAGTGTGTAATTAA
Protein sequences of DBSCAN-SWA_4 >NZ_CP041973|919218:966714|948767_949280_+|WP_001135695.1|DBSCAN-SWA MPQKAYLHVDFEQPETLVFNRARMRRAFVSIGQVHMRDARRLVMKRGRSGPGDNPSYRTGKLARSIGYYVPRASSRRPGLMVKIAPNQKNGEGNRPISGAFYPAFLFYGVRRGAKRKKGHHRGASGGSGWRVAPRNNYMTEVLDKRRSWTRYVLSRELRKSLRPQRRKKK >NZ_CP041973|919218:966714|927024_927255_-|WP_000764235.1|DBSCAN-SWA MKLEMYTLDGSVIVDSNLVTQFYPDYKSGGELTVIETISATGETFTVRVKHSFLQVTSALATAWSVDEKKAEGAAQ >NZ_CP041973|919218:966714|924731_925595_-|WP_000208076.1|DBSCAN-SWA MTTITKEWLQQTIAEFENTRDDIPFGLSDDDAKVLIVLKRALASLEAEPAGYHVIKECGKVGCSVATLEEAEKTRDFWNKKWTIRPYFYTAQPVQETGVYNDVLNIISLLENNEWAEHCTSTVLGSLLESEITRLVGKEQSAPVVTFYRDGVEAAAKWIDQQREAYDSEHGWSDPDTGAFEFGNDAQRGYSSTLEELAEGIRALHPNAGNSPVIPDGWISCSERMPEDEQEVIVHNKLGYRYVSYFDEHSGLFFDMRGGNQMNCIEHIFVTHWMPVPAAPKPEINNE >NZ_CP041973|919218:966714|964411_965230_+|WP_001176778.1|DBSCAN-SWA MQLPEQDEFSDFFAANDDEQASLRRKFFLEKHKEPCLSESALEDYQALFMSIYGINIDWKEGTFSLLEALSDNQGGKPVTVKFDYDSEIETATINLVDTQYVFHHYPMGSDGFDTELVRIEHILANSGYSLRVYQNSTFSDTLSFLLIPSDEWKRVEQHYSPEHISEYFVPYGKQLVIPEVTAPVVNYVPSVKQEASNVPALFNARGIRICFLSIMLIAFAIYILWNILTKIEPLSSGQPAGCENLQNLYSKLRPEVAEPLKEKMRKSLGCK >NZ_CP041973|919218:966714|959211_960774_+|WP_000554738.1|DBSCAN-SWA MHRIDTPTAQKDKFGQGKNGFTNGDPATGRRATDLNSDMWDAVQEEVCTVIEAAGIPLSKGEHTQLHAAIGRLIDEQVKTRLEKNQNGADIPNKPLFLQNVGLGETINLAAGALQKSQNGGDIPDKKQFARTIGAVTSTTITLGESGWFKIATVVMPQATSTAVIKLYGGAGFNAGSPEQAAISELVLRAGNGSPVGITATLWRRSPSAANEVAWVNTSGDTYDIYINIGQYAYWLIAQYDYTGNANVTLHSTPEYSSVQPGNSTSGQTYTLFNSLMKPTAGDVEALSVNGGRLNGPLGIGTDNALGGNSIVFGDNDTGFKWHSDGVLGIYANNALVGYIDNSGLHMSVDVLTNGAVRAGNAKKLSLTSNNNSTMTATFNLWGDANRPTVIELDDDQGWHLYSQRNPDGSIVFTVNGDITANTLRASGAIYQNNGDIFGSLWGNGWLSTWINNNLVLDVQLGAGTSVTTWNNAGSWPNTPGYVVTSVWKDYQGENIDGINYAPLQKRVGSQWYTVQGGTV >NZ_CP041973|919218:966714|925591_925885_-|WP_000267991.1|DBSCAN-SWA MWRGLNRGGSQMILTAYEYDPETEKSQSVYLLRHHSKVKKTTLEQKLTVKNDAFGRFKPFVELEDFPEGLSEREAMLKLADWLHRLSVAIEDNWSTP >NZ_CP041973|919218:966714|935873_936734_+|WP_001061459.1|DBSCAN-SWA MNNLMVIDGIEVRRDVHGRYCLNDLHRAAGGEQKYRPKYWLDNKQTRELIEQLFTEGGIPPSEQNQSVSFFQGGSDTRSLARAPVNTVRGGAEQGTYVCKELVFAYAMWISPSFHLKVIRTFDRITSAPQISSGMAADKMQAGVILLGFMRKELNLSNSSVLGACQKLQEAVGLPNLAPQYAIDAPAGAPDGSSRPTLALSALLKQHGIRMTANQAYQQLAKLGVVEHRERYSRSAINGIKKFWSLTAKGCMFGKNITSPANPRETQPHFFESKFPELLKLLDTVH >NZ_CP041973|919218:966714|929773_929959_-|WP_001067433.1|DBSCAN-SWA MNNYYTCSFCGVSELDAKKLIAKGSKDEPAICSECVVSCVNILINYAAVIKPVKLNVTKGE >NZ_CP041973|919218:966714|927325_927865_-|WP_000008351.1|DBSCAN-SWA MSFIQTLSGKQFDYLSATIDDIDIEDIAVALSNICRFSGHLPEFYSVAQHSVLCSQLVSPEFAFEALMHDAAEAYCQDIPAPLKALLPDYREIEKRTDQLIRFKFGLPLEEASVVKYADLTMLATERRDLDIDDSIPWVILEGIPPTDLFEIYPLRPGQAFGLFMARFNELMELRQCAA >NZ_CP041973|919218:966714|922907_923897_-|WP_000532847.1|integrase|DBSCAN-SWA MGRKRAPGNEWMPKGVFFRPSGYYWKPGGSTENIAPADATKAEVWVAYEKKVEGRKNRITFTQLWRKFLASADYADLAPRTQKDYLAHEKYILAVFGDAEAKAIKPEHIRRYMDARGQKSRVQANHEHSSMSRVFRWSYQRGYVPGNPCVGVDKFPKPQRDRYITDEEYRAIYNNATPAVRAAMEIAYLCAARVSDVLKMNWNQILEKGIFIQQGKTGVKQIKSWTDRLRDAVEICREWGEEGPVIRTMYGERYSYKGFNEAWRKARKAAGDDLGRPLDCTFHDLKAKGISDYEGTAKDKQKYSGHKTESQVLVYDRKVKMSPTLDRKR >NZ_CP041973|919218:966714|935467_935857_+|WP_000779149.1|DBSCAN-SWA MKLTLPFPPSVNTYWRAPNKGPLKGRHMVSASGRKYQSEACAAVIEQLRRLPKPSTAPAAVEITLYPPDKRIRDLDNYNKALFDALTHAGVWEDDSQVKRMLVEWGPVFPKGKVEITITKFETGAGAAA >NZ_CP041973|919218:966714|957149_957563_+|WP_000605050.1|DBSCAN-SWA MILYVNGIRKDATASLDFLTRAVVISLFTWRRAERDDRTPQPYGWWGDTWPAVQNDRIGSRLYLLKRRKLTNKTPQDAREYMQQALAWMTDDGVAARIDVTSERTGTDTLAAGVTIYQRDGVIHNITFDDIWSKLNG >NZ_CP041973|919218:966714|937744_938497_+|WP_001047141.1|DBSCAN-SWA MNLEALPKYYSPKSPKLSDDAPATTSESLTITDVMAAQGMVQSKAPLGFALFLAKVGIQNPDFAIEGLIHYAVALDNPTLNKLSEETRLQIVPYLVNFAFADYSRSAASKARCEHCAGTGFHHVLREVVKHSRNGEPVIKEEWEKELCQHCHGKGEVSTVCRGCKGKGIVLDEKRTRFHGAPVYKICGRCNGNRFSRLPTTLARHHVQKLVPDLTDYQWYKGYADVIDKLVTKCWQEEAYAEAQLRKVTR >NZ_CP041973|919218:966714|934577_935459_+|WP_000200166.1|DBSCAN-SWA MTSESVCIESSDVTISVDESASRTWRRPFLKWAGGKYSMLPDLYQVIPAGMRLIEPFVGGGSVFLNSDKHACFLLADVNTDLINLYQMLAVVPGAVIRHARVMFDRLNDAESYMALREEFNAQVMDAPERAAAFLFLNRHCFNGLIRYNRNNQFNVGWGKYPSPYFPEEEIRAFTEMAHNCVFMAAGFRRTLALAGEGDVVYCDPPYEPMPGKDGFTHYAAGGFTWDDHIALAECCVAAHQRGARVVIGNSTSPRVIDLYSQHGFEIRYISARRSISSKGSTREKAKDLVAIL >NZ_CP041973|919218:966714|951490_951847_+|WP_000515952.1|tail|DBSCAN-SWA MGKIAGTTYFKIDGQQLSVTGGIEVPMNTKVRDDVIGLDGSVDYKETSRAPYTKVTAKVPKNFPVDKITSSDVMTITSELANGQVYVLSNAWLHGEANHNPEEGTVDLEFHGEEGFYQ >NZ_CP041973|919218:966714|961627_962635_-|WP_000492926.1|DBSCAN-SWA MFSRVRGFLSCQNYSHTATPAITLPSSGSANFAGVEYPLLPLDQHTPLLFQWFERNPSRFGENQIPIINTQQNPYLNNIINAAIIEKERTIGVLVDGNFSAGQKKALAKLEKQYENIKVIYNSDLDYSMYDKKLSDIYLENIAKIEAQPANVRDEYLLGEIKKSLNEVLKNNPEESLVSSHDKRLGHVRFDFYRNLFLLKGSNAFLEAGKHGCHHLQPGGGCIYLDADMLLTGKLGTLYLPDGIAVHVSRKGNSMSLENGIIAVNRSEHPALKKGLEIMHSKPYGDPYIDGVCGGLRHYFNCSIRHNYEEFCNFIEFKHEHIFMDTSSLTISSWR >NZ_CP041973|919218:966714|940669_940951_+|WP_000226304.1|holin|DBSCAN-SWA MVANDPSAALNAVICGVIVIVLMFYRRGDATHRPLISLLAYVMVLVYASVPFRFVFGLYESSHWLVVMVNILICAAVLWARGNVARLVDALRH >NZ_CP041973|919218:966714|929446_929698_+|WP_000078504.1|DBSCAN-SWA MSQKDDIPVFPVTGWQAGPLPGYDALVVKFQFLSSPMQPIESAQETQFLVLTPEMAESLASDLQRHIQDLRNSDVHSPQEGKH >NZ_CP041973|919218:966714|949840_950005_+|WP_000497739.1|DBSCAN-SWA MFVKPAKGRSVPDPARGDLLPEGGRNVDENNYWLRREAAGDVRRTNKKVKTNGD >NZ_CP041973|919218:966714|943061_944792_+|WP_000257219.1|terminase|DBSCAN-SWA MATYPNVNAANQYARDVVNGKILACRLTMLACQRHLDDLERAKDPHWPYRFDKNKAERFLRFSQKMPHTSGEWARRKLRIEFEPWQKFALGVPFGWVRKDTGFRRFTEIYIEVPRKNGKSAIAAAVGNYMFCADGEYAAEVYCGATTEKQAWKVFAPALAMVKKLPALRQKFCIKPWAKKMTRPDGSLFAPIIGDPGDGDSPSCAIIDEYHEHDTDALYTTMTTGMGAREQPITLIITTAGFDIASPCYEKRTQVVEILERIREGGENEAIFGIIYTLDDDDDWTQPEALIKANPNYNISVKEGFLKAKQLLAMSTPGQTNKILTKHFNKWVSSKAAYYNLQKWMTAADKTLRLSDFAGEECYPGIDLASKLDLNAVVPVFRREIDGLSHYYCVSPMFWVPEDTVYATDPALKTIADRYQSFVNQGVLVPSDGAEVDYRLILEAILKLRETVKIAASPIDPYGATGLSHMLQDEGLEPVTITQNYTNMSDPMREIEAAIAAGRFHHDGNPLMTWCISNVVGKYLPGSDDVVRPVKEGAGNKIDGAVGLMMGVGRAMLNEPKDFLSNLDPDEELLFL >NZ_CP041973|919218:966714|949276_949837_+|WP_000779215.1|DBSCAN-SWA MKLTPIIAALRSRCPRFENRVGGAAQFKAIPEAGKLRLPAAYVVPAEDVTGEQKSQTDYWQDLTEGFSVIVVLSNERDEKGQWASYDAVHDVRQEIWKALLGWEPDPQAHEIQYAGGMLLDLNRHELYYQFDFTVKYEITETDTRQQDDLDGLPDLKTLSIDVDFIEPGTGPDGDIEHHTEITFQE >NZ_CP041973|919218:966714|952254_954183_+|WP_000785387.1|tail|DBSCAN-SWA MADSFQLKAIITAVDQLSGPLKGMQRELKGFQKEMAGLAIGAAAAGTAVLGALALPVNAAIGFESKMADIRKVVDGLDDKKAFAQMSDDILTLSTQLPMAAEGIAEIVAAGGQAGIARGDLMQFANDAVKMGVAFDTTAEESGQMMAQWRTAFRLTQEDVVVLADKINYLGNTGPANAKKISDIVTRIGPLGGVAGVASGEIAAMGATIAGMGVESEIASTGIKNFMLSLTAGNSATKAQKQAMAFLKLNPRKLAEDMQKDSRGAMLKVLDSLAKVPKAKQAAVMNALFGKESLSAIAPLLTNLDLLRTNFDRVADAQEYGGSMQKEYASRASTTENQLVLLKNSVNAISVTLGDTFLPAINEAAEAVMPYLEQLRTFVRANPELVQSAAKFGAALLAVGVSIGSLSRAVKILNSVINLSPAKVAIAALVAGAMLIIENWDDVAPVIKAVWQEVDNVAQEMGGWETVIEGVGLVMAGSFTVRTIGALQQSVLLAGRLSGLLGKIGRMGAMTLTIGVAVSLFKELKDLEQGAKDAGMDAGAFAVQKLQTKERERGYNGFIPRLKELLGMDTPIPQGRYQPYVPLTRRSGVLGRAVPPSTQRSELKVTFENAPQGMRVTDIPKSGNPLMNISHDVGYSPFRTSR >NZ_CP041973|919218:966714|958637_959225_+|WP_001207832.1|DBSCAN-SWA MALQDEYTQLLYHLLPEGPAWDGENPLIEGLAPSLNRVHQRADELMAEIDPARTTELIDRYEQLYGLPDSCAPEGVQTLQQRQQRLDAKANVAGGINERFYREQLDALGYTAATIEQFQNLDSTPDPEWGEFWRYYWRVNIPADANISWQTCTSTCDSAIRTWGDTVAECVIDKLCPSHTVVVFAYPEGKENAQN >NZ_CP041973|919218:966714|956611_957145_+|WP_001273650.1|plate|DBSCAN-SWA MANHPLQNMITRAVITAIDTVRKCQTAGLKLIAGEKKENVEHLEPYGFTSAAQNGAEAVVLFPGGGRSHGVAVVVADRRFRLKGLARGEVALYDDQGQSVTLTRAGIVVNGGGKPVIFTNATKARFEMPIESTGDIRDNCDSSGKTMAEMRTTYNGHTHKENGDGGGITDKPGQPMS >NZ_CP041973|919218:966714|945063_946158_+|WP_077905357.1|portal|DBSCAN-SWA MKLAAVYACIYVISSNVAQMPLHVMRRTGKKVETARDHPAFYLVHDEPNSWQTSYKWRELKQRHILGWGNGYTRVLRHRRTGEVTGLEACMPWETTLLNTGGRYTYGVYNEEGSFAINPDDMIHVRALGNDQKMGLSPVLQHAETIGMGMSGQKYTESFFSGNARPAGIVSVKGELNDGSWKRLKEMWQKATAMLRSQENRTMLLPAELDYKALTVSPVDAQLIDMMKLNRSMIAGIFNVPAHMINDLEKATFSNISEQAIQFVRYTMMPWVTNWEQELNRRLFTRAEREAGYYVRFNLAGLLRGTAKERAEFYHFAITDGWMSRNEARAFEDMNPKDGLDEMLVSVNASRPAKSTTQENTQDE >NZ_CP041973|919218:966714|936741_937731_+|WP_012543375.1|DBSCAN-SWA MRALLTPEIAPRMGVVLFRPGSELMPLFMQGRVLLEPEPEQYSSFACGAVPAVSQPLADDPAVRDVFRNESVIYRAGGLDSLESWLLRGNGCQWPHSVWHSEQMTTMRHAPGAIRLCWHCDNLLREQFTERLESIAVENTTKWVLSVVCRDLGFDDMHAVTLPELCWWMVRNDLAEVLPESAARKALRMPKAIVQSATRESEIVPSVPATSIVQDKAKKVLALRVDPESPESFMLRPKRRRWINERYTRWVKSQPCACCGKQADDPHHLTGHGQGGMGTKAHDLFVLPLCRTHHNELHADTVAFEEKYGSQLELIFRFIDRALAIGVLA >NZ_CP041973|919218:966714|946150_946753_+|WP_000003793.1|head,protease|DBSCAN-SWA MSEREIRCYSGEVRAETHDSEPSRIIGYGSVFDSRSELIFGSFREIIRPGAFDEVLNDDVRALFNHDPNFILGRRSAGTLALTVDERGLRYDITAPETQTIRDLVLAPMQRGDINQSSFAFRVARDGEEWYQDEDGVVIREITRFSRLLDVSPVTYPAYQEADSAVRSMKAWQEARDSSALQKAINQRMARERVLTLLNA >NZ_CP041973|919218:966714|932915_933140_+|WP_000620702.1|DBSCAN-SWA MIRNIFKRFTSQRFHCPRPGQWYSTPEGYVLRISLVDRECQKVVCEPLGRNYRVNMPLIAFRSGKNMKHLGGAA >NZ_CP041973|919218:966714|948071_948395_+|WP_000927251.1|head,tail|DBSCAN-SWA MLLSPEEIKLQLRLDEDYADEDKFLELLGRAVQARTENFLNRRLYTAEAGVPADDPEGLILSDDIRMGMLLLVTHFYENRSTVTEVEKVELPMSFNWLVGPYRYIPL >NZ_CP041973|919218:966714|949994_951491_+|WP_001007993.1|tail|DBSCAN-SWA MAISFNSIPSDTRVPLFYAEMDNSAANTARDSGASLLIGHASNDASIAVNSLVLVSSVDYARQICGAGSQLARMVGAYRKTDPFGELYVIAVPESTGAAATVALTVTGEATETGTVNVYTGRTRVQAPVTSGDDAAAVAVSIKDAVNANPDLPFTATSEAGVVTLTARHKGLYGNEIPVTLNYYGFGGGEVLPAGVNITVASGVKGAGAPALNDAVAAMGDEPFDYIGLPFNDTASVNTMATEMNDSSGRWSYIRQLYGHVYTAKTGTLSELVAAGDQFNLQHITLAGYEKDTQTPADELAASRTARAAVFIRNDPARPTQTGELVDMLPAPKGKRFTTTEQQTLLSHGVATAYVESGVLRIQRDITTYRKNAYGVADNSYLDSETLHTSAYVLRRLKSVITSKYGRHKLANDGTRFGPGQAIVTPAVIRGELGSTYRQLEREGIVENFDLFQQHLIVERNANDSNRLDVLFPPDYVNQLRVFAVLNQFRLQYSEEAA >NZ_CP041973|919218:966714|921818_922616_-|WP_000598920.1|DBSCAN-SWA MIKWPWKAQEITQNEDWPWDDALAIPLLVNLTAQEQARLIALAERFLQQKRLVALQGFELDSLKSARIALIFCLPILELGIEWLDGFHEVLIYPAPFVVDDEWEDDIGLVHSQRVVQSGQSWQQGPIILNWLDIQDSFDASGFNLIIHEVAHKLDMRNGDRASGIPFIPLRDVAGWEHDLHAAMNNIQDEIDLVGESAASIDAYAATDPAECFAVLSEYFFSAPELFAPRFPALWQRFCQFYRQDPSQRLRVSAAEGDYGEESEH >NZ_CP041973|919218:966714|940293_940683_+|WP_001294874.1|DBSCAN-SWA MSEPVSSATVLAGGLMGASVFGLATGTDYGVVFGAFAGAVFYVATATNIGRIRLVAYFITSFIVGVLGAGLIGTKLAAITHYEKPLDALGAVIISAMCIKFLTFLNSQDLNTLFSILSRIRGGGSDGSK >NZ_CP041973|919218:966714|946762_947992_+|WP_000766103.1|capsid|DBSCAN-SWA MKLHELKQKRNTIATDMRALNEKIGDNPWTDEQRTEWNKAKSELEALDERIAREEELRRQDQTYVDENEEEQRNNQDPDKDPQQDEKRGQIFDKWMRHGASELSSEERKALRELRAQGVAPDEKGGYTVPDTFLAKVVEQMKSYGGIASVAQILATSDGRTMEWATADGTAEVGVLLGENEEAGEEDTEFGMDSLGALKMTSKIIRVSNELLQDSAIDMEAYLARRIAERIGRGEARYLIQGTGTGTPKQPKGLKASVTGTTQTAAAGAVKWQEILALKHSIDPAYRRGPKFRLAFNDNTLKLISEMEDGQGRPLWLPDIVGVAPASVLNVPYVIDQEIDDIGAGKKFMFCGDFDRFIIRRVRYMILKRLVERYAEFDQTGFLAFHRFDCILEDTSAIKALVGKGSASS >NZ_CP041973|919218:966714|927959_928877_-|WP_000551790.1|DBSCAN-SWA MHNPFFKNMLIYRFSRDFNIDIDSLDKKLELFRFSPCGSQDMAKSGWFSPLVQYSDVLYHAVNNQLLLVIRREEKIIPKQTIADEINKKVSTLEREQGRRLKKTEKDSIRDEVLHSLLPRAFTKNSLVRIWINTAAGFIVVDTSSIKRAEDSLALLRKTLGSLPVVPLTMENPIELTLTEWVRSEAAPSGFSIGDEAVLKAILEDGGTGRFKKQDLACDEILTHIEAGKVVTQISMEWQQRISFTLSCDGILKRIKFADQLISQNDDIDSEDVVQRFDADITLMTGELSNLISDLTAALGGEAKR >NZ_CP041973|919218:966714|930957_931182_+|WP_001191666.1|DBSCAN-SWA MQSPLRKLRKSHGYTLQHVAKGVQVDPATLSRVERCEQAPSTELAERLAQFYAGEISEMQILYPNRYQLSDSAI >NZ_CP041973|919218:966714|930164_930860_-|WP_001020644.1|DBSCAN-SWA MNIGNRVRQLRRAKNMKIAELAEAIGVDAANISRLETGKQKQFTEQTLSRLADCLSVDIAELFTSDPKGNTVCKHSDMRKDSANVKDLFRIEILDVSASAGNGLIQGGDVIDVIHAIEYNKDKALAMFGGRPAAELKVINVRGDSMAPTIEPGDLIFVDISINQFDGDGIYVFGFDDKIYVKRLQMIPDKLLVISDNTNYREWSITKDNECRFGVFGKVLISQTQSLKRHN >NZ_CP041973|919218:966714|942624_943062_+|WP_000501481.1|terminase|DBSCAN-SWA MGAVVRSSGGGRKRNLPSGQKSKLTRIAPPEELMSDIAIRIWKTQSKILIERGVFDLEDAPLLLAYCNAFHLMIEAEKVIAEEGLTVSSEMGGEKKHPAVNVRNDSVSQLARLGSLLGLDPLSRIRMTSGKNDPDDEGNEFDEFD >NZ_CP041973|919218:966714|938546_939620_+|WP_000357930.1|DBSCAN-SWA MDKILPTMGFVCLSLISLKNPPPSGFFICDHFIFCLAKLLYGQELKLSGDIVLSNNERWVSFFDFAFTPTHAAAPSIPIEDILKKLKVLVSSGSAVKLYNHRSRALRISEMKYSIGDSQATLLIQLCDKNGSDPVFGELTTGNLRVEPKLAGEGIAVSCHIVISTDVVKNTADHHKTLVESVPGISKSVLEPFLNAMLREAFAGCEFKNPATKGMCQHRPKLEIYSHGSQTLMDALKGAKIHNVKLVSTRRKGGLDQTAYTELSERSVKYKIIRQPPLKDKERLLEILRKKGQQSGYTKVSISYSKDGKQASLDLDRNEDAATKLFTKSERVILGNLINQCESTVHLQLETKMIGLL >NZ_CP041973|919218:966714|960773_961343_+|WP_000760554.1|tail|DBSCAN-SWA MKKYQNIKNFRLIDAPVNRDKTQAEINIGAYFLESDDGQDWYECQSLFSDDTAKIMYDHEGVIWGVVNKPVPQRGNTYSVSMLWPVNMSVAEIDAADCPDDCRGDGTWLYQDGKVVQRGYSPEELRKKAEAEKVRRLAEAESAIAPLARAVKLKIATDEEIKRLEAWELYSVMVNRVDTSAPDWPDIPR >NZ_CP041973|919218:966714|931210_931765_+|WP_000509728.1|DBSCAN-SWA MGHEPEWKVEKQPRWLVAAIKKTISSLHGGYEEAAEWLDVTKDALFNRLRTGGDQIFPIGWALVLQRAGGTYHLAHSVARASGGVFVPLADMEEVDNADINHRLLEAIEQITSYSQQIRVAIEDGVIEPHEKAVIDEELYQAIAKLQQHSTLVYRVFCVPEKGDARECAAPGAVASNFMEKTNA >NZ_CP041973|919218:966714|934107_934581_+|WP_000054227.1|DBSCAN-SWA MSLLAKVQAFIELNPGLTSNEIADAFPEYARFDVQRSASKLYRCKRVNRRLDGDVFRYYAGKDEAVILTLRQKRSGHTGSGDPMVIAKLVSRAEELESRGLFNRASIVWLEAFSESQFIYEREEFLRRRQKCLNRIKKRIRPVEQVYLAGRFVGNVE >NZ_CP041973|919218:966714|955553_956612_+|WP_001066630.1|plate|DBSCAN-SWA MNNTVFLRVNGRDWGGWTSVRISAGIDRIARDFNVSITRQWPGGEDVPPVKNGDAVEVLIGDDLVITGWVEALPLRYDAQTIMTGIVGRSKTADLIDCSASPAQHNGKNLFLIASALARPFGVDVVDAGAPAAAVIEAQPEHGETVVDCLNRLLGQAQALAYDDERGRLVLGRPGSMKAATALVLGENILSCDTERSVRERFSSYLVTGQRPGTDDDFGEATIAAIRQSTGDAGVTRYRPHTIQQSGTATTDSCKSRCEFEARQRAAKTLETTYTVQGWRQGNGELWKPNQAVVVYDPLNGFDNETLVIAEVTYSQDNNGTLTEIRVGPADAYLPEPFRPKAKKKVSEEADF >NZ_CP041973|919218:966714|954216_955557_+|WP_000863817.1|DBSCAN-SWA MAFFSSTGWRGRLRDASFRGVPFSVEDDESTFGRRVQVHEYPNRDKPWTEDLGRATRRLTINAYLVGDDYADRRDRLIGAIETAGPGTLVHPQYGEMQGSIDGQVRITHSSTEGRMCRVSFQFVESGELSFPVAGMATAKRLETSGGLFDDAIDSMFSTFSLSGISDFIQNDVIADAASMLGDVADAFRMVDSGVSAAMRLLQGDLSVILMPPGAASDFVNALQKAWRSGDRLRGSTSDLVTMIKTMSGITLDPGLSPRGTWPTDSGSAAKQKMQRNMIAAAIRTTAISTAVHAVTTLKQPRDVPDVRGVNQPAGTGRDSDIITVMHPALDGVQTVSNGSFPPNYEDLKAIRTALNAAIDQEQLRIRDDVLFQQISVMRTDLNRDISARLAQVERTALRTPDDVLPALVLAAAWYDDAGRESDILTRNPVPHPGFIPVEPLRVPVR >NZ_CP041973|919218:966714|923898_924141_-|WP_000414876.1|DBSCAN-SWA MEKQLMSDRFLTEEELEDATGASQKSLQKEVLTLNGIYFIERRDGSIRTTWYHINHPVSRLLPPAGYQPVPGMNFDAIES >NZ_CP041973|919218:966714|921156_921447_-|WP_001675175.1|DBSCAN-SWA MPDQRKRPSSSLTGLASIDSKYWQTGYRFNGKQKVFSIGVYLAVSLTDARQRRDEVKRLLAQGIDPNAKNRLMKKSFRKSAIKPARPVSSPKADAP >NZ_CP041973|919218:966714|957555_958635_+|WP_001699732.1|plate|DBSCAN-SWA MADSQFARPELPQLIATIRSDLLTRFQQDVVLRRMDAEVYSRVQAAAVHTLYGYIDYLARNMLPDMCDEDWLYRHARIKRCPRKNAVSAKGFARWDGIAGTPEIPAGTQIQRDDQVTFTTLQTVKASGGLLRVPVIGDVAGTAGNTDDGTALRLGTPITGIPSTGYADTLTGGADTEEPETWRARVMERYYWIPQGGADPDYVIWAKEIAGITRAWTFRHYKGTGTVGVMVATSNPVNPAPGDDLVKAVRDHILPLAPVAGGGLFVFAATEKSIPVTVALAKDTPEIRTAIIAELNALMLRDGAPSGKIYVSRISEAISLATGEVAHQLRVPAADVVLGKTELPVLGNITWATYTGENG >NZ_CP041973|919218:966714|965691_966714_+|WP_001028172.1|transposase|DBSCAN-SWA MNIVFLGIDLAKNVFQLCGLNQAGKPVYTKRTGRKELLQTLANIPACLIGIEASTGAFYWQREFEKLGHKVKVISPQYVRPFVRGQKNDGNDAQAIAVALMQPTMQFVPPKSPEQQDIQALHRARQRIVNHRTATVCQIRGLLLDRGIPIGSAVSRARRAIPLILEDAENGLSSRMRRTIAELYDLFNDLGRRIHFFDKEIETVFRQSEACPRIAKVKGIGPKTATAVVAAIGKGTEFKNGRHFAAWLGLVPRQHSSGDRQVLMNMTKKGDKHLRTLFIHGARAVVRVATNNNDGHMNQWVNQLKERRGFNKTTVAVANKNARIIWSMLRNDTGYQVVCN >NZ_CP041973|919218:966714|926512_927028_-|WP_000071068.1|DBSCAN-SWA MSNRIRNAQVFDARTGEYPVDMYIRWIIGGELDFDANYQRGYVWGHEEQQAFLNAVISGFPIGSVALAKAPDWCSRELPYIEVVDGKQRLTTLKKLITNEIPIILADGPLYWRDMTRAEQLAFGRRPLPAVVLDEVTYKDRLAYFMAVNFTGVPQSEEHKRHVMQLMEAAQ >NZ_CP041973|919218:966714|940950_941568_+|WP_001075993.1|DBSCAN-SWA MNQQQFQQAAGISAGLSARWFPHIDAAMKEFGITAVNDQAMFIAQTGHESAGFTVLKESFNYSVEALKKTFGKRLTPYQCEMLGRIDGRQVAHQPQIANLVYGGRMGNKDAGDGWKYRGRGLLQITGRENYVKCGAALKLDLISTPELLAQEKHAARSAAWFFTLRGCLMYSGDVVRVTQIINGGQNGLADRNSRYNKARAALLV >NZ_CP041973|919218:966714|951843_952170_+|WP_000588852.1|tail|DBSCAN-SWA MIKELVLKKPIMAHNEKLHVLELREPSYDEIEAIGFPFTVSGDGGVRLDSSVALKYIPVLAGIPRSSAAQLAKLDIFKACMLILNFFTRSETEEDSESGSTTPHTSGE >NZ_CP041973|919218:966714|919218_920493_-|WP_001680077.1|integrase|DBSCAN-SWA MSLTDTKVKNTRPSEKAVKLTDGFGLYLLVHPNGSKYWQLGYRFDGKQKVFSIGVYPAVSLADARQRRDEAKRLLTQGIDPNAKKQADEKVLQEKRDKTRSFRVVAKSWFATKTKWSEDYADTVWKRLETYVFPDIGDRNVSELDTGDLLVPVKKAETLGYLEIAMRIKQYITAILRHAVQQKLMRHNPAYDMEGAVQKPETEHRPALELEEIPLLLERIDAYKGRGLTTLAIKLNLLIFIRSSELRFARWSEIDFKSKLWVIPEQREAIENVKHSTRGAKMKRQHFVPLCRQALKILKEIRQLTYEEGNEAELIFTGCYDSFKPMSENTINKALRKMGYDTTQDICGHGFRTLACSALIESGLWSEDAVELQMSHKESNSVRAAYTHKAKHLDQRRLMLQWWADFLDENRYEMVRPFEFAQKQ >NZ_CP041973|919218:966714|942127_942478_+|WP_001135228.1|DBSCAN-SWA MPPRTPKSCRVRGCRSTTTDPSGYCESHRSEGWKQYKPGQSRHQRGYGSKWDVIRERILKRDKGLCQLCLRAGVVREAKTVDHIIPKAHGGTDADSNLQSLCWPCHKAKTARERLK >NZ_CP041973|919218:966714|926156_926516_-|WP_000065085.1|DBSCAN-SWA MSNIDKLNDHELVDLKNAIERELKRRADGPKVTTYYVVSCITDAQHFTDLDCALRCLKSVTENLMEWVTESPENRDYVNQCTGIVGAKLQVKEMNLDHFNMRVAEKYFDDICYPQETAQ >NZ_CP041973|919218:966714|962847_963069_+|WP_001526483.1|DBSCAN-SWA MFLLYCRGTGEIGGRRPKLSPEQWAQAGRLIGAGIPRQQVAIIYDVGLSIHGLYWILLSANRKGVKKRDYRAN >NZ_CP041973|919218:966714|924165_924735_-|WP_001061370.1|DBSCAN-SWA MNNLMVDLETMGKKPNAPVVSIGAVFFDPQSGEIGPEFYTAVSLESAMEQGAVPDGDTILWWLRQSPEARAAICADAVSVTTALIEFNDFITCHADDLKYLKVWGNGANFDNVILRGAFERASLPCLWNYRNDHDVRTMVTLGRAIGFDPKRDMPFEGDMHNALADARHQAKYVSAIWQKLIPPTSNNI >NZ_CP041973|919218:966714|948391_948796_+|WP_000776844.1|head|DBSCAN-SWA MKLRQAQASATYLLPDPGELDQRIVIRRRVDVPADDFGVTPTYPEQIRAWAKKAQPGAAAYQGAVQIENRVTHYFTIRFRRGITADHEVLHDDISYRVKRVRDLNSKRRFLLLECEELGTDNGSDYAAESIFTR >NZ_CP041973|919218:966714|941564_942104_+|WP_000127618.1|DBSCAN-SWA MTAVFAFVKARWKTIIVLLMLAGAFLAGIIWSDRGWQKKWADRNSMESSQEANAQTAARWIEQGRIIARDEAVKDAQAQAAKSAATAAGLSATVSQLRTEATKLAARLDAAKHTSDLAAAVRSKTAGADAAVLADMLGRLAEEARYYAERSDESYRAGMTCERIYNSVRESTNNPIAPH >NZ_CP041973|919218:966714|933136_934111_+|WP_000096529.1|DBSCAN-SWA MSSLIQLLDRPIAYNPAFAKLKAGKVKAGPVAAVFLSQLVYWHNRMDGGWMYKTQADIASETALTRDEQETARKRLVALGVLEEARRGVPATMHYRINTARLEALLLETAKPVKKGAQEKTRLRDFQNVETPQSGLVQPRKPDCGDAANKNVETPQTSTGQPNEQACGDPTIFPTGDYTETTQEITQESKTPFCPVAEQPDPEVTLTDQAIEVLTHLNQVSGSRYQKSKTSLENIRARLREGYSVADLQLVIDLKHEHWHENDEQYQYMRPETLFGPKKFESYLQSATRWDQKGRPKRADWGAKKRDVMAFGPVDTTIPEGFRG >NZ_CP041973|919218:966714|939632_940205_+|WP_000765639.1|DBSCAN-SWA MKLFSPLSYLRIKHEEKDWYDYKIPAAVSLIVTIVYYFHASKISLIETNGLLLQVNGLLQVLIGFYIAALAAVSTFSSSSIDEVMAGVPPTLVEKFRGQKLTVELTRRRFVCYLFGYLALVSFMLFCLGMISILIGKPFHLWLLTFCSPDAILWLKTVFVGVYIFILMNIITTTLLGLYFLAVRFHQSSL >NZ_CP041973|919218:966714|931761_932919_+|WP_001087406.1|DBSCAN-SWA MNSLTVNNRLSQQPGMYEYRPLRHECRLSNSLVVRNHREHSLTVGDESCRNLTAGFGMEGDFMSMSFAGNQKLSALSICARAIRMSVLALCGNSGVILLSVKRQEHIDSAIPGRYTVQAPHKAGAGRGNPEFNIEHNRAHAVFSCHEHCYAQIMVGRAGPVSAGPGSMLTGISTPVRLTTYKVVESLGGEFIEFNIEAATMATVPTLAQPEIRIINGQAVTSSLAVADYFIKRHADVIRKIESLECSTLFRKRNFAFTSISINQPNGGTRKLPCYQITRDGFAFLAMGFTGKRAAQFKEAYIDAFNQMEKQLSTPSVLSDAAHNASVLYSYISSIHQVWLQQLYPMLEKAESPLAVSLHDRINDAAALASLINMTLNRSEVRGRK |
60 | Salmonella_phage(77.78%) | plate,holin,integrase,tail,transposase,portal,protease,terminase,capsid,head | attL 922119:922133|attR 970493:970507 |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_5 |
1499077 : 1515019
Sequences of DBSCAN-SWA_5
Nucleotide sequences of DBSCAN-SWA_5 >NZ_CP041973|1499077:1515019|DBSCAN-SWA TATGAACAGAACGATTCTTGTACCCATCGATATTTCAGATTCAGAATTAACTCAACGCGTAATTTCGCATGTTGAAGCTGAAGCAAAGATTGACGACGCTAAAGTGCACTTTTTGACTGTAATCCCGTCTTTGCCCTATTACGCTTCACTGGGACTGGCTTATTCAGCAGAGCTTCCCGCAATGGACGATTTGAAAGCCGAAGCCAAATCTCAACTGGAAGCGATTATCAAGAAATTCAACCTTCCTGCGGACCGCGTGCAGGCTCACGTTGCAGAAGGCTCTCCTAAAGATAAGATTCTGGAAATGGCAAAAAAATTACCGGCCGATATGGTGATTATCGCCTCGCATCGCCCGGATATTACTACCTATCTGTTGGGTTCCAACGCCGCAGCCGTTGTGCGTCATGCGGAATGCTCCGTACTGGTGGTACGCTAAATACCCGAGCCCGCATAAGAGAGTGCGGGCTCAAATCAATCATCATCGCCTAGTGACTGACACTTTTTGAAAACAGATTGATTACGATTACGCCGCAGATGATAAGCAACATGCCAATAATAGCCGGCATATCCAGTTTTTGGCCGAGAAATAACCATCCTATCAATCCAATAAGAACAATCCCTACCCCAGACCAAATAGCATAAATGATACCCGCAGGGATGGTTCGCATTGGGATGGTAAGACACCAGAACGCAATACAATATCCGATGATAGTAACGAGGCTCGGTACCAGACGCGTAAAACTATCTGATAATTTTAATGAGATCGTGGCGATAACTTCTACCACGATGGCGATAAATAGAAAGATTACAGCTTCTTTAGTCATACATCTCTTCGTTCCAATATTTTTCGGGGCGTGATGATATCTTAAGTAATACGGATGACAGAATAAAGACGCTTTTGAACCATTGCGTATCTTGCCATAAAAACGTGCTGGCATTCGTCGCCGTAATCCGTACCATACGCGACATGCTTTTGCCTGCTGGCTGTTGACGGCAGGAGCGAACCCGGCATTTATCGCCAGCCAAATGGGGCATGAAACTGCGCAGATGGTGTATGAAATTTACGGTATGTGGATTGATGACATGAACGACGAACAAGTAGCGATGTTGAATGCGCGGTTATCGTAATTGCAAAGTTTGCCCCCAATTTGCCCCATTTAGTACCAGAGAACTGAAATAATGCAAGAAATTCAAAAGAATACAAAGAAAGAGCAATACAACCTCAACAAGTAAGGGCAAAAATCACAACCATCTGATGCATAATAGTTTTATTTATTTTTCAATGCGTTAAGCATTATCAGCAACAACAATAAGCTACGATAATCCACTCTTTTGTTGCCCCATTTTTGCCCCTTTTACAGCATTTTGCCCCATTTTTGCCACCGAAAAAATTCCAAAACTTCTCAACCTCAGCACGTTCTTGAATATGGCATATGACAATTTCCTTTACGATTTATATGCTGTTATAAAAATCCCCAGTTGTCAGCAACAACTGGGGATTTACTTTTCAGGCCCAAAAGACGTTCACTACGACTCTGCCCGGCAGCTTCGAATATCTGGCGCGCCTTATCCAGGCTGGCGCACCCCACCAGTAAAAAAGGCACCAGTATCGCTACCAGTGCCCATTTCGCCGCCATTCGCGGCATTCTGTGTGTCCAGTGTTTTCGGTTCATATCAATACGCGCTCTTTCATCCAGCCATAGACAAACGACTCGTCAGCCTCCCGCTTTTCCGCCAGCTCCAGATAGCGCTCCCCCTGCGTGCAGTTCAGCGCCACCAGCAGTACACGCTCGCCATCCTTACCGCGCTTTTCCAGATAAACACGTAACGCGTTAAGGGTTCGCGGCCCGATGCGCCCGTCCGTATCCATATCCGGATACAGCCTCCCGCCCTGGTTGAACACGTTCAGCCAGCGCTGCAACATTTTCGCTGCCACCGACGGCCCCATGTTCACGCCCGTGTCACACAGTTCGGCAGCAACATCCGGCGAGGCCTTCGCCACCCGGTCAAAGCGGGGACCGTACCAGTAGTCGGTCTCCAGAATTTCCAGCGCCTGTCCACGGGTTAAATTGCGCATATCACCACGGTATCCGTGGGCGCGGGCAACTTTTTCCGTAATACCCCATTTTGTCGGCCCGCCTTTATCGTCCGGATGGTTGACGTAGCCGCCTTCCTTACCCAGAATTTCATCAAAAATTTCGTCCTTCGGTTTCATCGCAGCCTCAGCAGTGAAAGTATTTTCGACACATTACCCCGCGCCCACAGCACCAGCCCGCAGAACAGCGCGTTCATCAGCACCACAGGCCAGGATGAAGACGGGTAATGTCCGGCCAGAAAACGGAACGGAACCAGCGCATAGCCCAGCATCAGCAGGTACGACAGCCAGGCTATTCCCGGTTTGTATCTGGCGCCGTTCCGGCGGTAGAGAAAAAGGGCCAGCACTGTCACCAGGCACAGGCCCGCATTCAGTGTGCCGGTCAGATTACTTTCCATGACCACTCCCTCCTCCCCGCATCCGTGAAAAAACCCCGGTCAGCGATGAAATGTCCTGGTCATGAATGAAGGTCAGAAATTTCACTGACAGCACCGACATCACCACTGCACACAGCGCATCCAGCGGTTTACCGTTGTAGTCAGCAAGGCGGGACAGGAACGAGGCCAGTACAGGCACAGCACATCAACCGCTTCTGCTGTTCCTGCCACGTCAGCCAGCTCACGCAGTGCCTGAGCATTCGCGCTACGCGGTGACGCCTCATAGCGGGCGCGGATATTGGCACTCATCTGGGCAATCTCGCCGGAGAACGGCGCAAAGGCAGGAAGTGCCGCGATGGTTTTCTCCAGCACTGCAATTTCTTTCGCGTCACAGGTGCCATCGGCGTATGCAATGGAATATGCGCCCCAGACGGTCGCCTCCACCGCATCCCGGTTCTCCATCTTCTTCACTTCAACAATAGCCTTGCGGGTTTTCTTTCTGAAAATACCTAACATCGTGACTTTTCCTTTTAGTGGGTGAGCCTGCGCCCGGGGGTGACCAGCCCACAGAGAAAGTCACACTGACCATCCCGTAAGCTCACCCCTGAAAGGCTCCGTGGTTAGTTATGAATGTGCGCCGGGCGTGGCGCGGGGAATGAAATAAGCCTTACCGGAAAATAAGGTTGTTTCCGGGGTTCCGGTCTTATTTTGTTATGAGTAAAATAATGGCGACCTCCCGAAAGGGATAGCCTGAAATTTTTCAACTATCTATCACGGACATTGTCCCGCGGCTTAAATCCGACAGCCGCGCTTCTTTTTTAATTCATTATTTCCCGCCCCGGATACTTCCCGCACCCTGGTCTGAAAACGTTAACTTTATTATATGGCGGGCAGAACTTTTTCCCGGAATAAAAAAAACCGCCTCACGGGAGGCGGTGCTCAATATCTGAAGCATGTTTTATAGTTATCGACTGGAAAAACAGTAGAGGTGTCGGGTGCCTCCCGAAAACACATATTAGCCCGGTATGTGTCTGTGACCATCGGCAGAACATTTTTCCCGGTCACCCCCGCACTGGGGAACACCTCAATAAATAATGCAGTGCCGGAATTATCTCCGGAGGGCCGTTGGTCAGTCACCATGACCCGCAGACGAAAATAAACGCACCTCATATCCTGTTTATGCCCCACTTTATGTGGGTTTCACTGCATTCAGATGGCGTACCACAAATCCCGTATGTGTTACAAGAGCATCAGACAGTGGCGCAAGCAATGACCGTTTTTTTGCAGTACGCCATCTGAATGCAGTAAAAAAATTGCGGTGCCTGATTCACTATCAGGAGGTTAAATTAAAGGAACATTAATGTAACCACCCCATATCCATATGCCTGCACCACAAAGAGTAAGAACGATACCTATATCAATGGAATCTTAATTTCACCTTAATGCCAAATCGAGTGATGATTCTTTAATCAGACGACTCAGGAGGTATAAAGAATCAACTCTCTTATTCGACGGAGTGGAGAAATCTGCCAGTCACCTCCGCCAAACTGTCGACAATAATAAATCATAAAAATTTCATTTCAACAAGCAGTCGCGTCAAGAAATGTAAATTTATCATTAGATGTTTATTTATTCCAAATGTTTTGCATTAATTCAAAAAAGACAAAATACAATCAGGGCATATTCTGATGACTCCATCTTATTTCTGCAATTTGCGGGAATAAAAAAACCACCTCATGAGAGAGAGGTGGTAATCATAACATGGAGTTATTATTATAGTTTTATTGATGGCGATAAAAAACCTGAGGCGCCGGGGACTCCCGAAAAATATTGCTGTGTCAGGTGTTGTATTGACAAGAATTTTCCGTATGTTCTGCCTCTGTGATATGAAGATTTTTAGTAATAAATGCAGTGCCGGGATGCCCCGGAGAGTCTTTAGTCAGCCACTATGACCCGCTACAAATTGATCAAACGCAGACAATGCTGTTTAGCTCTGCTCAGAGCAGATTTCACTGCATTCAGATAGCGCACTGAGAACCCCGCATTACCAACACACTGACATTCTCATTTATCCGATTCACAGTACGCTATCTGAATGCAATAAAAAATTGTGGCACTGAGTCCATCGGGGCTTCTCCGCCCCCACAAACCGGATGCTGTATCAGGATATACATCATGCACTCATCATAAACGCACTATCCACACTTTACGTGCCAGCACCACCAACGGAGTAATTACGATAGTTGTATCGAAAAAAACGCCTCTTCTTCTCTGGTTGAGAGCCGGTTCCTTCACGTGAATTTTCCAGGATGCACAAAAAAACCGGCTCTCTGATCCTCGGGGGCAGAAGAAGGTTATCCATCATCCCCTTCGGACCGATGAAAATAATACGAGTCAGAACAGGCGTTTCAACAATTCATTGTGTCAATAAACGTAAAGTTATTATTACATTTTTATTTTATCAAAACATTTCACATTAATTTAATGTACATAAAACATATTTAAAACGCACCATGTTTAACAACCCGGATTACACTGGCGAATATTTTTCTGTATTGCAGAAACGACAAAACCCGCACGATGGCGGGTTTCAAAATGCGTTCATGTCTGTCATTAGCCTCGCGATACAGCTTTGCGAAGCGTACTGGAATTGAAGCAGTTTGTGGCTCATTTTGCAAATGATTTTTTAAGCATAATCGAACGCTTTTCTCATAGGTGAATACAAAATGAACTCAGCGATACTCAACCACTGCTCAACACGGCGTCGGCATGTAATCAACGCCCACTCTGGGTACTGTTCGTTTAGTAACTCGGCCATCCTTCTCTTACTCATCCCCCGCCCCACGTAACGCTGGTTCAGGATACCAAGCAGCCCCGGATGATCCGTCAGTACCTCGCCTACAACCCGATCGATAATCAGCGCTTCCGTATCCGTACAGTGTGCCAGCCAGCTTTTTTGCTTTCCGTTGATCATCTCTCTCAGGAACACTTCCAGTTCAGGTTTCTCCAGCCCCGCTTTTTTCATACGGCGCAAAGCATCGTTAATGGCCGTCTTCGTCAGTTTTTTGGACGCCAGCAACTGGTTAAACATATTCCCTGTTTTGCCGCCACCAATGTATGACCAGCGCCCCCACATACGCAATTTCCCCTGAATCCACACACTTTCCAGCGTGCTGAGACGAAGGTGTTCTCCGCTTTTTCCGGTGTTTGTTGGGTAAATCATAAAATGCCTTTCTCTCTCCAGATTTCCTGCGTGCGGAAAACGCCCTCTGCGTGCATCAGGCGAATTTCTGTTTTCGTGAAGTCGTCGATTTTTACCCGCCCATCGACAATATCGTGGCACGAGCTACAGGCAATTGCTGCCTGCATATCGTTTGGTTTTGTCGCCGTTCCGCACGTACCCGCCAGCCGGTAATGTGCCAGTACGGACGTTTCAGGATTATGGTTGCAATGGCCGGGAATTCTTACCGTACACATCAGGCCCCGCGCCGCTTTACGTAAATCCGCCATTACGCAAACTCCAGCAGTTGCGCTGCGACATTTTCGACCTCTTCCGGAGAGGAGAATTTACGAAACAGAATCCAGTTCCACAGGACGTTCAGTACGGCCTTATAAACCTGCTGAAACTCGGTTTCGTCCATATTCGCAAAAGCGATGGATTTCGCCCGGCGCCCACGGCTGCCATCCGGATAATAATGCTCAGTATAAAACCCGGCCTGAATGGTTACCCACTCGCGGAAAGCGTCGAAGGACTTGAGCAGCGCAACATCGAGCGTTCGGTTGACAGCAACCTTATGCAGGTACTGTTCCGCCGCGTCACTGAGTGCGGGACTGTGGCCCTGCGCTGCTGACTCACACAGGAAATCGACAAAGCCGGTAATCAGCTCCTGTTCCTGAGGCAGGATCGCCCCGCCGATCGGAGTCCAGTAATCGAAACCAAGCTGAAGGAGCTTGAAAAAACGTTTGTGGAACGCGTAGTTACGTACTCGCTTAAAATCGGCGTGTATCCACGCACCGATTTTTACTGAGCGCAGGAAGTCCTCACTCTCCGGCGTTGCCGGGAGCAGAAGCCCTGATGAGGTTTGTTTAACAAGTTGTAGATGCGCCATCGTTCTCTCCGGTGGCGCTGTAGGTTGCTGATTGTTCAGGTCAGCCGTAACATATTAAAACATTAATAACTGACAGTGAAACCCAGTCTTATCAGATAATCAATAAACGCTTCAACAGACAGAATCAGATGGTCGTCAGGAATTAGCGTACAGAATGAGATTTCACCATTTTTTACACGTACTGCATAAAGCCCGTCTTCATCTAACTCTAATAAATCCTTGAGTTTTTTCACGTTACCTCCAGACAACTAAGGAAAAATGAAAAGGTGCGATTTCAACGCGATTTCTGTTGAGGCGGGAAATATAAACACTGCGACTATTTATTTCATTATATAAATTTGCTTATTTTATGTTCACCAGCAAGGACATTTTTCACTTGTTGCGCAACCAATCTGAAAGTTGATCATTTTTATGAATTTTTATTTTACGGGTAACAAAAAACCCGCCGAAGCGGGTTAAGTGTGGGTGCGTTGAGGATGCCTGACACGTCAGAGGTGGCGGGGATTTCTCCCCGCCAGGTCTCTTACTCCTCAGGTTCGTAAGCTGTGAAGACAGCGACCTCCGTCTGGCCGGTTCGGATTCGTACCTCGCAGAGGTCTTTCCTCGTTACCAGTGCCGTCACTATGACGGTTAAACAGATGACGATCAGGGCGATTAACATCGCCTTTTGCTGCTTCATAGCCTGCTTCTCCTTGCCTTTCGGCACGTAAGAGGCTAACCTACATGTGCAAAGCATGAAATTGGCCTCAGATTAATGTTAAGCGTCTTGCCGGACGCGTAATGTTAACTGGGGCTTTTCTCTATCTGCCTTTTGGTGTTCATGCCTGAGGCAGATAGCCTCAAGCACCCGCAACAATTCTACTTAACTCTCCTTTTCCCGCAAACCGTTTTTATCCCCAACGCAAATTTTACCAATACCCCTTAATACATCTCCCTTGCCCTGACGATACATCCCTCTTTACACAGACCAAAATTTATGTATTATCGCTTGAAAACAATCATTTAAGAGCTATCGGTGGGTGAATTCGCCCTGCGGTAGCTTTTCCTTTATGCATTGCATACTTTATGTTCTAGTATATTCCTGTATACTCAAAAGGATTTTTCATGCACAGCGTTAATTTCTATTCATTCCGCGTATTGACCCATAAAGGCAGTCGAGCCAGCAAAAAACTTAATGACTTAGGTTTAAGTAATAAAAAAACGGCATATGAACTTTTTGTTGATTATTTTACTCTTTATAAAAACACCCCCATCGAGTTCGGCGTTTCAAAAACTAAAATATCTCTGGAACAACATACTAAACTTCACTTTGATAACACAAAAAAAATCATATATGGTTATATAAAAGTTGGAAAATATGGAGAAAGCAGTGAAATAAAAGATGTAAAACTCAAAAAAGTCCATTACAGAACAACTGCTTATGATGTAACACTCAAAGAGCGTTATATTTTAATATATCTACCAGATAATCTTGAAGAAGGAATTATTGCATTCCATTCATGCGATAATATTTCTGCTCGAGGTGTTCTTTCTGATTCTATCACTGAATATCTAAAAAAACAATTTCAACTAGAAGCAAGAATCAATCCATTACATCATAAGAAAATCCCTCAATACATTCTCAATTCTGAATTGAAACAAATTAAAGCTCAAGGATATAAAGCACCAGAAGATATTGCTGATTCCTTTGGTAAAAACAAAACAAACATCAAGACAGACTTAATAATAAAAGCAAACGATGGCATATTCGGAAGTTTCAGGGATTTAAGAAACAAGAATATAGGAAACATCATTGAGATTATTGAAGATAAATGTGATGCAATAAAAGTAAGCTTACAGCTCGGCAGTCGGACTGTCGTTTTCAATTATGATACCATACTAAAAAAAGGAATTTCAGCAGAGTTAGATGATAATGATCTAAAAATCAACCCATTAACAGGTATACCTGATCTAACAGCACTTCATGACACGATAAAAAACCTTTCCAATGATATATTGGAAGAACTGCACTGTGGAAATAAAGGGGTGATTATATGAATAAAATAAATGTGCTGGGTGTAATAATAAAACACTACAAAACAATGTCAGATCAGCGTGGAACAATGTTGATGAGCGACATTACCGTACATTTTATAGTTCCTCTATCTCTTTCTTTCGTTCTGTGCTGGACATACGGAATAATGAAACCGGCAATTGCTTCCGTCTTCGTTAACTTCGGGGCTATTACAACAGCACTATTAATGAGTGCAGTAATAATGATTTATGAACAAAAACAAAAAACCATTACTAAGATATCAGATATAATTGAAGGAAACAAATCAAGAGACAAATTGATATCATTAAACACTAACAAAACCATATATGAGCAGTTATGCCACAACGTCGCTTATGCAATATTAACTTCAATAGTATTGGTTATATTTTCAGTAATAATATATTTCCTGCCTGACAATGCAGTGGATTTAATGAAATGGTATTTTCGCGCACCCGCATATATCGTTAGCTTTTTAGCCTATACATCCTTTTTTATCACTGTCATAACTTTCTTAATGGTAATAAAAAGATTTAGCACGATTTTAGACAATTAAGCAATGGAGCTACCGCCCTTTCGGGCGGTCTCCTGGTGTTCTGAGCGTGCAGGAATCCATCCGGTTAAGGATTAAAGTTTATTTACATCACTAAATTTAATTATTCATATTTGGATTATGCTTTCTCTTTCACTTCACGCAGTTCCGATTGTTAATTTGGCTCACAACAGCACCTCCTGAAAGTTTCCCCGATAAAACGCCAGTACCCGCTGCATGTACTCGCTTTTACGACACTCACGACAAATTACGTTGTGGTGCCTGTCGTAACGACGTACCTCACCGTCAGGCAGCTTCCAAATCAGGTCAGCGTCCACTTTCGCCTGTTTCTTCCAGGCACGATAAGCCTCTTCTGACGGGAAAATCCCCCCTCTTCTGCCCGCCTGGTACACATCCCCACAACGTTCTGCCTTGTCCAGGTAGTGGCGGGTCGAATAAATGGTTAACCCCGTCATCCTCCTCAGCTCCGTAAGTGTTATCAGGCCGGAGTTCATGAGGTGTAACCTCGAAATTCGTAGCCTCTGCGATACGCAATACCTTTTCGGGGCTTAACTGACTACGGCCAGTAGTAACAAGGCTAATCATCGATTGCGAACAACCAGCCAGCGTGGCCAAACAAGACTGTCGTACACGATTTTTTTCAAATATTCATCTAACGTCATAAAGGTCACCTTAGTAATTCTCGCTAAACATTAACCATACTAATTTAAATGATCAATACTTATATCAGTTTGAGTTTATGAGTCACATTCATAAGATGAGCGCATGAGAAAAAAACGTGAAGAAATAGCTCCACCAGAAGCTACCCAGCGCTTACGCGCCATCTGGGACGCCAAAAAGCGAGACCTCAAACTTACTCAGGAGATCGCCGCTGATCTTATGGGCTTTGAGACACAATCTACCGTCAGTCACTATTTGAACGGTAAGGCACCTCTCAACACTGATGCGGCCTTAAAATTTTCTGTTCTATTGAGAGTTAAACCCGAAGAGTTACGACCGGATTTGGCCGATCTAATGAACTACGTTCGTTCTTCTGGTACTTACGATGACAACTTCGAAGGTGGTGGCTGGCGAATGGTGAGCAGACAACAGGCTGATTTACTAAACCTTTTTGATATACTCCCTGAATCTGAGAAAGAAAAACTCATTGACCGGCTTAAAGGTCAGAATGAGCTATATAAAGAGGCTTTCCAGAACATGCTCGCTGCACAAAAGCGCCTAAAAAATCAGTGACGAACCAACCACATCAAACCGCCTCTCCCGGCGGTTTTCTTTTGGCTAAAGCTCCTCCCTCCCCCTCTTGCATCAGATAAAACACATTTATTTTCATTAGGATAGATCATATTTATGATATATATGTATATTTATCTTGATCAACTATATGAATATCGCTAATATAATCTCAGAAACAGCACGGCGCTGTAGGTTTTAGTTCCGCCACCCGGCGTTAAGGGGAGAGGGAAAGATGGGAAGGAATGAAGTGATTCAGTATTTGATGGATAGTTGCAACGTCAGCTTTAGCGCAGCTCTCCAAGCATTGCGCGACAATGGATGGGATATGTTTTTGGCTCAATGCGAGCTACAGGAACAGTATTATCCGGGGTGATAATGGACAAGCTGCAAAAAATCCACCTCGGCAATAACGAATCCCTGGTGTGTGGCGTGTTCCCCAACCAGGATGGAACGTTCACCGCCATGACGTACACCAGAAGCAGGACGTTTAAAACTGAAACAGGCGCACGTCGCTGGTTAGAAAGAAACTCAGGTGAGTGATATGGATTTCGACACAATCATGGAAAAGGCTTACGAAGAATACTTTGAAAGCCTTGACGAAGGAGAAGAAGCACTCAGTTTCAGTGAGTTTCTGCTGGCGCTTTCAGCTAACGGCTAATATCGAACCGTTTTGTGCAGGGATTCCAGGTGGAAAGCATTAACGACTCATACTCGGGTTCGATTTTGAACATCACAGAAACACCGTCGATACACGAACCAGCCGGACGAACGATACCTGCCTTAACCAGAGCATCTGCATCAGGGTTATCCCAGGACGCCCGGAAAGTGGGAGAATGAGTTTTAAGAAAGGGTTCAAGTAAAAACAGCTGTTCAGTACTCAATGAATTAAGCGTTTTTAACATGCGTTTTGCCCTGCCCCTACGCTGGCACAACGGCAGCGCCGACACGAAAAAATAACCAGTGACTTTGAGGATCAGCACTATAAGGTAAGCCACAGCAAAACTAAAAATCTGAACGGCATATGGTAAATCGCTACGCGCCTCAATAAGCTCCGTAATATCTCTAGGAATGAAGATGATAATTAAGAAAAATAAAGCCACTGTGAGCATAAATTGTCCGACGGATTTACCAATTAGGAATTTTACGATAACCAGAGCGGTTTCTGACATATCAATAACTCTCAACTGTAAGGGTATTGAAATGTTAACACAGGTTCTCGCTGTAGGGGTATAGCCGAGACCACCGAAGCCCGGAGGTGGTTAAATAAAGCCGGGCACAACACGAAGGCGCATTTCTGATGTTTTCTGAGTCGGTCTTGTCTGTAAATCCAAATAGTGGAAGTGCGCCTCCGGTTGTAGTTGCCACTGCGACAATAATGCTGTGTGTAGTACTTGGCGGCATCAGTTTTTCTTAGTCCTTTCTGATGTCCGCCCTTTTTAAAGTGAATTTTGTGATGCGGTGAATGCGGCTAAGCGCACGCGGCACAGTTAAAACTCCCTGAATCAGTATGGGTGGTTTAAGTCGGCATTAATTGTTAACTGGTTAATGTCACCTGGAGGCACCAGGCACCGCACCACAAAATTTATTTACCAGAAATGGAGGGGCTATGATTGCTCATCACTTCGGAACGGATGAAATACCGCGTCAGTGTATTACGCCGGGAGATTATGTTATCCATGATGGTCGTACTTATATCGCCTCAGCGAATAACATTAAAAAACGCCGTTTATATATCCGTGATTTAACAACGCAAAGATGTATTACCGATTGCATGGTAAAAGTCTGGCTGAACAGAAATGGTCTGCCTGCCAAAGCTGAATCATGGTAACAGAACAGTAATCGTTTAAACCATCCTGTTTTTAAATATGCCTGCAATGGCAGGAATTCACTCAACCTGAAAAAAGGAATCTATATGAAAAATGTACCTGAATCAGTAATTGCAGAGCTCCGCCAGCTTTCAGGAAAAATTCGTACGTTATGTATCGAAAACAATATGCCCTGTGTTGTTTCTTATGCCCGGGACTGTGACGATGAAAGTGTTTCCAGAACTCTTGTTGCATATACAGACAGTGAAACAGGCGCATATGACAGGTCAATAACAGCCGCAATAATGCTGTTAAAAATGAACGAAGCTCCTCCCGAAAATTTTATTTCATTGTTGAAACTGATGGAGTGTAAAGAGCTCGTCACGAACGCATTCTGCTCAATGAAAAATGAAAGTCTTCATTAAGTATGATTTGTAATAAGGGTAATGATAATGAGCGACAATAAAACAGAATACTCATATTATATTAAGGTTAAAAATGAAAGCGCCCGGAAACGTCTCGGCTTCCCTTTTGCTTTCTGGTGGAAAACTGAAAGCAGCGAAGCCGCTGCCACAGCACGCCTTGCCGTATCAATGCTTGACGCCGGATTCGAACCGACAGATTTTGCAAAACCGGTTCGCGTTAATTCCCCCGCTGTTAACGAACTTCCGCCGGAGGGAAGTTTTGATACCACCTTCTGTCAGAAATATGAGCTGGGCGGCGAAGATGGCAAAACATTTATGCTCATCCCCGGCACGCCCGCTACTGACGCCCACGACGAAAAAACGGAGGAATGCGCCGACGACGCTGGCACCGAAGAAAGCGGGACAGACACCAGTGACAACGACGAATGTCAGGACTGCGAAGTTTCCGTCGCCACCCTGCCGTTCCCCCAGCGCGTGTTGCACATTTTTACTTACGCTGCCACAGACAAAAAATATTTGCATCACGCCACCCGCGCTCAACGCAGGCATATTACCGTTCTCGAAATGGAACAGGAAAACAGCTATATCCAGAACCTGTTAATGGTATTGCGGAAGTCTGAACAGGTTCATGCCCAGGATGAGTAAAGGAAAACAGAAACCTTATTACTCGGTAAGCAGTTTAGGGGCAAGGTGGAATGCGGCAGTAAAACGTGCTGGTATTCGCCGCCGTAATCCGTACCATACGCGACATACTTTTGCCTGCTGGCTGTTGACGGCAGGAGCGAACCCGGCATTTATCGCCAGCCAAATGGGGCATGAAACTGCGCAGATGGTGTATGAAATTTACGGTATGTGGATTGATGACATGAACGACGAACAAGTAGCGATGTTGAATGCACGGTTATCGTAGTTGCAAAGTTTGCCCCCAATTTGCCCCATTTAGTACCAGAGAACTGAAATAATGCAAGAAATTCAAAAGAATACAAAGAAAGAACAATACAACCTCAACAAGTTGCAAAAGCGCCTGCGCCGTAACGTTGGCGAAGCGATTGCCGATTTTAATATGATTGAAGAAGGCGATCGCATTATGGTTTGCCTTTCTGGCGGCAAAGATAGCTATACGATGCTGGAAATTTTACGTAATTTGCAGCAAAGCGCCCCGATCAATTTTTCACTGGTCGCCGTCAACCTCGATCAAAAGCAGCCAGGTTTTCCGGAACATATCCTGCCAGCCTACCTTGAGCAGCTGGGCGTAGAATATAAAATCGTCGAAGAAAACACCTACGGCATTGTGAAAGAAAAGATTCCGGAAGGAAAAACCACCTGCTCGCTGTGCTCGCGTTTGCGTCGGGGTATCCTGTATCGTACGGCGACTGAACTGGGCGCGACCAAAATCGCCCTGGGCCACCATCGCGACGATATTCTGCAAACCCTGTTTCTGAATATGTTCTATGGCGGAAAAATGAAAGGGATGCCGCCGAAACTGATGAGCGATGACGGCAAACATATCGTGATCCGCCCGCTGGCTTACTGCCGCGAGAAAGATATTGTCCGTTTTGCTGAGGCCAAAGCCTTCCCTATCATTCCTTGTAATCTGTGCGGTTCGCAACCAAACCTGCAACGCCAGGTGATTGCCGACATGCTACGCGACTGGGATAAGCGCTATCCTGGACGGATCGAGACGATGTTTAGCGCCATGCAGAATGTCGTGCCGTCTCACCTTTGTGACACTAACCTGTTCGATTTCAAAGGAATCACTCACGGTTCCGAGGTCGTCGACGGCGGCGATTTAGCGTTCGATCGTGAAGAGATTCCCTTGCAGCCCGCTGGCTGGCAGCCGGAAGAAGATGACACCGCCTTAGAGGCGTTGCGGCTTGATGTTATCGAAGTGAAATAATCTGCAGGCGTCTCAGCACTCCGCTGAGACGCCATGCTATCGATCATTTTAATAGCCGTACCCGGCATGACTTGCCTTTGATCTTCCCGTTTTGCAACTGCTTCCAGGCTTTTTGCGCTACTGCTTGACGTACGGCGACGTAAACGTGCATTGGATGCACGTTAATTTTGCCAATATCCGCCCCGTCTAATCCAATATCGCCGGTCAGCGCGCCCAAAATATCTCCCGGACGCATTTTCGCTTTTTTGCCGCCGTCAATGCATAGGGTAGCCATCTCTGCGGCCAGAGGGAGTGACGGCTGCCGGGCGGGCGCATTCAGCCAGTTCAGCTTGAGTTGCAGCATTTCTGAAAGAATATTCGCCCGCTGCGCCTCTTCCGGCGCGCAGAAACTGATCGCCAGGCCGCTGCTTCCCGCGCGCGCCGTACGGCCAATACGATGGACATGCACCTCCGGGTCCCAGGCCAGTTCATAGTTAACCACCAGTTCGAGCGATTTAATGTCTAATCCTCGCGCGGCAACGTCGGTGGCAACCAGAATGCGCGCGCTACCGTTTGCAAAACGCACCAACGTCTGGTCGCGGTCGCGTTGTTCCAGATCGCCGTGGAGCGCCAACGCGCTTTGTCCTACCGCATTAAGCGCATCACAAACGGCCTGACAATCTTTTTTGGTATTGCAAAATACCACGCAGGACGCTGGCTGATGCTGGCTAAGCAACGTTTGTAGCAGCGAAATTTTTTCATGCGCAGACGTTTCGAAGAACTGTTGTTCGATAGCCGGTAGCGCATCTACCGTATCGATTTCAATACGTATTGGCTGCTGCTGTACACGACCGCTAATCGCCGCGATGGCCTCAGGCCAGGTTGCTGAAAACAATAACGTCTGGCGCGTCGCAGGCGCAAAGCGGATCACCTCATCAATGGCGTCACTGAATCCCATGTCCAGCATTCGGTCTGCTTCATCCATTACCAGAATATGCAGCGCATCCAGCGATACGGTTTCTTTTTGTAAATGATCCAGCAGGCGCCCCGGCGTCGCGACAATGATATGCGGAGCGTGCTGAAGCGAGTCGCGCTGTGCGCCAAAGGGTTGCCCGCCACACAAGGTCAGAATTTTGGTATTTGGCAGAAAACGGGCCAGGCGACGTAACTCTCCGGCAACCTGATCCGCCAGCTCCCGCGTCGGGCACAGCACTAATGCCTGTGTCTGGAACAGAGTGACGTCAATTCGATGCAAGAGCCCAAGACCAAACGCCGCCGTTTTGCCGCTACCGGTCCTGGCCTGCACACGCACATCATTACCCGCCAGAATGACGGGTAATGCTGCGGCCTGAACAGGCGTCATCTCAAGATAGCCCAGCTCAGTAAGGTTATTGAGCTGGGCGGCGGGCAAAACATTCAGGGTTGAAAAAGCGGTCAC
Protein sequences of DBSCAN-SWA_5 >NZ_CP041973|1499077:1515019|1511081_1511303_+|WP_000560208.1|DBSCAN-SWA MIAHHFGTDEIPRQCITPGDYVIHDGRTYIASANNIKKRRLYIRDLTTQRCITDCMVKVWLNRNGLPAKAESW >NZ_CP041973|1499077:1515019|1510122_1510644_-|WP_000004762.1|DBSCAN-SWA MSETALVIVKFLIGKSVGQFMLTVALFFLIIIFIPRDITELIEARSDLPYAVQIFSFAVAYLIVLILKVTGYFFVSALPLCQRRGRAKRMLKTLNSLSTEQLFLLEPFLKTHSPTFRASWDNPDADALVKAGIVRPAGSCIDGVSVMFKIEPEYESLMLSTWNPCTKRFDISR >NZ_CP041973|1499077:1515019|1501287_1501569_-|WP_000445513.1|holin|DBSCAN-SWA MESNLTGTLNAGLCLVTVLALFLYRRNGARYKPGIAWLSYLLMLGYALVPFRFLAGHYPSSSWPVVLMNALFCGLVLWARGNVSKILSLLRLR >NZ_CP041973|1499077:1515019|1509007_1509475_+|WP_001227859.1|DBSCAN-SWA MRKKREEIAPPEATQRLRAIWDAKKRDLKLTQEIAADLMGFETQSTVSHYLNGKAPLNTDAALKFSVLLRVKPEELRPDLADLMNYVRSSGTYDDNFEGGGWRMVSRQQADLLNLFDILPESEKEKLIDRLKGQNELYKEAFQNMLAAQKRLKNQ >NZ_CP041973|1499077:1515019|1506180_1506393_-|WP_000882662.1|DBSCAN-SWA MLCTCRLASYVPKGKEKQAMKQQKAMLIALIVICLTVIVTALVTRKDLCEVRIRTGQTEVAVFTAYEPEE >NZ_CP041973|1499077:1515019|1501668_1502064_-|WP_000900605.1|DBSCAN-SWA MLGIFRKKTRKAIVEVKKMENRDAVEATVWGAYSIAYADGTCDAKEIAVLEKTIAALPAFAPFSGEIAQMSANIRARYEASPRSANAQALRELADVAGTAEAVDVLCLYWPRSCPALLTTTVNRWMRCVQW >NZ_CP041973|1499077:1515019|1505057_1505657_-|WP_000940751.1|DBSCAN-SWA MAHLQLVKQTSSGLLLPATPESEDFLRSVKIGAWIHADFKRVRNYAFHKRFFKLLQLGFDYWTPIGGAILPQEQELITGFVDFLCESAAQGHSPALSDAAEQYLHKVAVNRTLDVALLKSFDAFREWVTIQAGFYTEHYYPDGSRGRRAKSIAFANMDETEFQQVYKAVLNVLWNWILFRKFSSPEEVENVAAQLLEFA >NZ_CP041973|1499077:1515019|1507690_1508245_+|WP_001033796.1|DBSCAN-SWA MNKINVLGVIIKHYKTMSDQRGTMLMSDITVHFIVPLSLSFVLCWTYGIMKPAIASVFVNFGAITTALLMSAVIMIYEQKQKTITKISDIIEGNKSRDKLISLNTNKTIYEQLCHNVAYAILTSIVLVIFSVIIYFLPDNAVDLMKWYFRAPAYIVSFLAYTSFFITVITFLMVIKRFSTILDN >NZ_CP041973|1499077:1515019|1501558_1501747_-|WP_001688615.1|DBSCAN-SWA MPVLASFLSRLADYNGKPLDALCAVVMSVLSVKFLTFIHDQDISSLTGVFSRMRGGGSGHGK >NZ_CP041973|1499077:1515019|1506761_1507694_+|WP_000556390.1|DBSCAN-SWA MHSVNFYSFRVLTHKGSRASKKLNDLGLSNKKTAYELFVDYFTLYKNTPIEFGVSKTKISLEQHTKLHFDNTKKIIYGYIKVGKYGESSEIKDVKLKKVHYRTTAYDVTLKERYILIYLPDNLEEGIIAFHSCDNISARGVLSDSITEYLKKQFQLEARINPLHHKKIPQYILNSELKQIKAQGYKAPEDIADSFGKNKTNIKTDLIIKANDGIFGSFRDLRNKNIGNIIEIIEDKCDAIKVSLQLGSRTVVFNYDTILKKGISAELDDNDLKINPLTGIPDLTALHDTIKNLSNDILEELHCGNKGVII >NZ_CP041973|1499077:1515019|1509859_1510015_+|WP_085981757.1|DBSCAN-SWA MQKIHLGNNESLVCGVFPNQDGTFTAMTYTRSRTFKTETGARRWLERNSGE >NZ_CP041973|1499077:1515019|1504767_1505058_-|WP_000774470.1|DBSCAN-SWA MADLRKAARGLMCTVRIPGHCNHNPETSVLAHYRLAGTCGTATKPNDMQAAIACSSCHDIVDGRVKIDDFTKTEIRLMHAEGVFRTQEIWREKGIL >NZ_CP041973|1499077:1515019|1499561_1499900_-|WP_000159240.1|DBSCAN-SWA MTKEAVIFLFIAIVVEVIATISLKLSDSFTRLVPSLVTIIGYCIAFWCLTIPMRTIPAGIIYAIWSGVGIVLIGLIGWLFLGQKLDMPAIIGMLLIICGVIVINLFSKSVSH >NZ_CP041973|1499077:1515019|1499077_1499512_+|WP_001082296.1|DBSCAN-SWA MNRTILVPIDISDSELTQRVISHVEAEAKIDDAKVHFLTVIPSLPYYASLGLAYSAELPAMDDLKAEAKSQLEAIIKKFNLPADRVQAHVAEGSPKDKILEMAKKLPADMVIIASHRPDITTYLLGSNAAAVVRHAECSVLVVR >NZ_CP041973|1499077:1515019|1512666_1513602_+|WP_001156217.1|tRNA|DBSCAN-SWA MQEIQKNTKKEQYNLNKLQKRLRRNVGEAIADFNMIEEGDRIMVCLSGGKDSYTMLEILRNLQQSAPINFSLVAVNLDQKQPGFPEHILPAYLEQLGVEYKIVEENTYGIVKEKIPEGKTTCSLCSRLRRGILYRTATELGATKIALGHHRDDILQTLFLNMFYGGKMKGMPPKLMSDDGKHIVIRPLAYCREKDIVRFAEAKAFPIIPCNLCGSQPNLQRQVIADMLRDWDKRYPGRIETMFSAMQNVVPSHLCDTNLFDFKGITHGSEVVDGGDLAFDREEIPLQPAGWQPEEDDTALEALRLDVIEVK >NZ_CP041973|1499077:1515019|1511732_1512350_+|WP_001676915.1|DBSCAN-SWA MSDNKTEYSYYIKVKNESARKRLGFPFAFWWKTESSEAAATARLAVSMLDAGFEPTDFAKPVRVNSPAVNELPPEGSFDTTFCQKYELGGEDGKTFMLIPGTPATDAHDEKTEECADDAGTEESGTDTSDNDECQDCEVSVATLPFPQRVLHIFTYAATDKKYLHHATRAQRRHITVLEMEQENSYIQNLLMVLRKSEQVHAQDE >NZ_CP041973|1499077:1515019|1508406_1508736_-|WP_001676916.1|DBSCAN-SWA MNSGLITLTELRRMTGLTIYSTRHYLDKAERCGDVYQAGRRGGIFPSEEAYRAWKKQAKVDADLIWKLPDGEVRRYDRHHNVICRECRKSEYMQRVLAFYRGNFQEVLL >NZ_CP041973|1499077:1515019|1511387_1511705_+|WP_000800272.1|DBSCAN-SWA MKNVPESVIAELRQLSGKIRTLCIENNMPCVVSYARDCDDESVSRTLVAYTDSETGAYDRSITAAIMLLKMNEAPPENFISLLKLMECKELVTNAFCSMKNESLH >NZ_CP041973|1499077:1515019|1513645_1515019_-|WP_000123686.1|DBSCAN-SWA MTAFSTLNVLPAAQLNNLTELGYLEMTPVQAAALPVILAGNDVRVQARTGSGKTAAFGLGLLHRIDVTLFQTQALVLCPTRELADQVAGELRRLARFLPNTKILTLCGGQPFGAQRDSLQHAPHIIVATPGRLLDHLQKETVSLDALHILVMDEADRMLDMGFSDAIDEVIRFAPATRQTLLFSATWPEAIAAISGRVQQQPIRIEIDTVDALPAIEQQFFETSAHEKISLLQTLLSQHQPASCVVFCNTKKDCQAVCDALNAVGQSALALHGDLEQRDRDQTLVRFANGSARILVATDVAARGLDIKSLELVVNYELAWDPEVHVHRIGRTARAGSSGLAISFCAPEEAQRANILSEMLQLKLNWLNAPARQPSLPLAAEMATLCIDGGKKAKMRPGDILGALTGDIGLDGADIGKINVHPMHVYVAVRQAVAQKAWKQLQNGKIKGKSCRVRLLK >NZ_CP041973|1499077:1515019|1500745_1501291_-|WP_000802786.1|DBSCAN-SWA MKPKDEIFDEILGKEGGYVNHPDDKGGPTKWGITEKVARAHGYRGDMRNLTRGQALEILETDYWYGPRFDRVAKASPDVAAELCDTGVNMGPSVAAKMLQRWLNVFNQGGRLYPDMDTDGRIGPRTLNALRVYLEKRGKDGERVLLVALNCTQGERYLELAEKREADESFVYGWMKERVLI >NZ_CP041973|1499077:1515019|1504234_1504771_-|WP_000640113.1|DBSCAN-SWA MIYPTNTGKSGEHLRLSTLESVWIQGKLRMWGRWSYIGGGKTGNMFNQLLASKKLTKTAINDALRRMKKAGLEKPELEVFLREMINGKQKSWLAHCTDTEALIIDRVVGEVLTDHPGLLGILNQRYVGRGMSKRRMAELLNEQYPEWALITCRRRVEQWLSIAEFILYSPMRKAFDYA |
21 | Escherichia_phage(62.5%) | holin,tRNA | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_6 |
1739444 : 1755559
Sequences of DBSCAN-SWA_6
Nucleotide sequences of DBSCAN-SWA_6 >NZ_CP041973|1739444:1755559|DBSCAN-SWA GTCAGTTTGTGAAGTTGCTCCCCGACCGGGAAACCATCACCAGCGGCCAGACGGAAGCAGACGTGGTGTACTGCCCACGGACCCTCAGAGAGACGCTGATATCCACGACAGGTGAAGTGGTGTAGACCGAAAAGACGACGGTCTGATACATGGCCGGAATCCCTGCGGTATACGGCATAACCTCCGCCGTTTTCACCTGGCCGTTAATATTTATCGTGACGGTGATGGCACCGGTGCCACCGTTACGCTCACAGTTAGCCATCACCGTGATGGTTTTCCCTATCTGATAGGTGGCGCTGTCGGTATACCGTGTTGAGGTGCTGCGTTCGTCGTTCGTCGCCCTGATGCTCACGCCCTGCATGACTTTTGAGCCGCAGATATCACCGACAAACTCTCTTGCTTCTATCACGCCAGAAAACTTACCGGAGGTGGCATTGATTTCTCCCGTAAACGAGCCAGATACAGCGTTGATATGGCCGCTGATATCCGCATTTTTCGCAGTCAGCTTTCCATCCGGCGTCAGGGAAAATGCCGGAGGATTCCCGCCACTGGTAATGGTCGGCGCGCTCAGGTATTTCAGGAACGCCTCATTCATGATTATCTGGTCGCCCTGCATGACGAATCCGGGCGTCTCGTTTCCGTTTGCCGGGTTAATATAAGCAATGCGATCCGCCGCCACCAGGAACTGGCTTATCTTCCCGTCAGGCGTGTCTTCCATGCTCAGTCCAAGTCCGGCCACATAATATTTGCCGTCTTTGGTCTGCTCTATTTTGACGCCCCACGTGGCGCTCCATTTATCGTTAGCGTCCTGCCACTCCTTCGAAAATTGTTGCAGTTTGCTGGCGTTATCCTCCGTCAGGTCAATTTTTTTTCAGCAACTCCGTACCAAGATACGTCTCCGTAATCAGTCCCTTAAAAAAATCCAGATACCCTTTCGCGTCATCCCCCGGACGTCCGGATGCTTCCGCAAATACTGATTTTCCAGCCAGATTTACACTGCGCACGTAAAACCAGGCATCATGCAGTGGTTTCAGTCCATCCTTTATCCAGAATGACCCGACGCCCAGATACTGTGCTTTTGACTGAATATCGGCGGCAGTCGCCAGTTGCGTGGCGGAGTACCAGAACTCATACTGCACACTGGCATCGTAGACAGTCTGGTGCGGCGTCACCGTTATCTGAAAATAACCCGGCGTCATCTCAATCGTGGATGGCGCTTCCGGTGCCTGAATACTGAATGCCACGGACGCCGGTTCACCCTGCTGCCCGTAACCGTTTATTGCCCTGACTGTCAGCGTGTAGTCACCCAGTGGCAGTTCGTGGAAGGCGTACTCCGTTTCGCTGGTCGTCGCCGTTGTCACCAGACGAACCGGATCGCCCTCGTTCCCACTGCCTGTGGTCAGCCTCACCACAAAACGCACATCTTTTACCACCCGCGGCGTGCCCCACTTCGCTTTGGCCTGATACAGGGTACTGTCGTTATCCGTGCTGACTGTCAGATGCTGCACAGCGGGCGGAATAATGCTGTTGGTGGTCCCCGGTAACGGGTCAAAGTGCGCCCCGTTGTCCACGATGGACTCTTTTTCCGGAACGTGCTGCAAGGCAGTGATGGCGTATGTGCCGTCGTCATTCTCCTTAATACGCACGCAACGGAAAAGGCGGCGCTTCAGGGAGGGCAGTTTCAGCCCCCAGATACTGTATGGCTGCACGGTTTCCGGCAGGACTTTCGTTACCACCCGATCCGGTGCGGGCTGCGACTGAATCTCCGTACTGAACGGCTTACCGTCAGGCCCGACAATATTCAGCGTGGTGGCGCCGCTTTCCGGTAGTGTTATTTCCCGGTCAAGCGTCAGCGTGCGGGTGGAAATATCCAGGTCAGTGATACGCCCACCGACCGACGCCCCGGCGTAATCGTTGTCGCAGACCTCAATAATATCGCCCGGTGTATGACGCAGACCTTCCGCACCGACAGAAAAATCCACGGTCTGCGTTTCCAGCAGCTCCGTCATCATCACCCACAACCCCGTCCGGTGCGCCTGTCCACGTGAGGTACAGCCGAACGCGTCCATTTTCAGCAGATTGCGTCCATAACGGGCCTGTGAGGCATGGTCTTCCACCAGCTCCGTGGAGGTTTGCCAGCCATTCAGCGGATCGGTGTATCTCACTTCTATCGCGTTATGGCGGTCTTTCAGGGCACTGAAGCTGTATTTAAAGCGCCCGCCCACCACGTTACCGTTGGTGTAGGTCCATGCTTTATCGGAGGGGCGGTCCTGGATGAAGGTCATTTTGCGGCCATTCCATACCGGCATACAACGCATCACCGAGCAGAAATCCGCCAGAACGTCATACGCCTTACGCTGGGTGGTAATATACGCATTAAGCGTCATGCGGGGTTCCGTGCCGCCAAATCCGTCCGGCACCGGTTGATCGCAGTACTGCGCGATGGCGTACAGCGCCCATTTATCCACATCCGCCCCCCCGATACGCCTGCCCAGCCCGTAACGGGGGTGGGTCAGTTTATCCATCGTGCACCACGCCGGGTTATTCGTGTACGCCGGTTTAAACGCCCCGTCCCACAGGCCGGTATATGTGCGGGTATCCGGGTCATAGTTTGAGGGGACCTGAAAAATACGTCCGCGCAGGTGGTAGTTACGCGTGACCTGCTGGCTGCCGAACTGTTCCGCATCCACCAGCAGACCGGCAACCGCTGTGCCAGGATAACCCTGCCGGATATCGATGATTTCCGTATACGACGACCACAGCGTTTTGTTCTGAAGCCTGTCGGTGGTGCTGTCCGGTGTCACCCTGACCATGCGGACACTGAACGGGCGCGGCGGTAAATTATCAGCCACTACCGATGCCAGATATTGTGTTGTGATCTTGCCGTTAATAGTGATATCAAATTCTGTGTTCCAGATCCCGCTACGCTGAAACTGTATCAGCAGATTCACGGAGGACGGGTTACGGTCCCCCTTGTCCATGGTCTCCTGCAGCATCTGTACACCAAAGGTGAAGCGTAGCCGGTCGACATTCTCTGAGACAACAGTACGGGTAACGGGATTATCGTGTTTCACTTCCACACCCAGCACCGTTTCCGCGCCGGAAGCCTCAAAACCTTCCAGCGGTGCCTGTGGTGTCTCCCCCACCTGATATACCACGGTCACGCCGTGAATATTACTGTTACCGTCCGCGTCCACCACCGGCGTGTTATTAATCAGCACGCTCTGCAGACCGTTCCGCTTTGTATCTGGAGGGTACGATCCGCACTGCTGAGACAGGGCTTACCGTCAGGATGGCTGTGTACCAGCGCCACGATGTCGCCGCGGTTCCGGGCATTCAGGTAATCCTCCGGGGATATACGAAAATACATCGTGGGTTCAGCAGACAGATTTTCACACGGAAAATACCGCTCTCCCTGTGCTGTTCTGACCACATAACCGCACGATTCCGCAGGCGCACACTGTCGGGCATGTGCCAGAATGTCATCGTTAATCATGGGAACCTGTTAAGACAGTTTGTTGATGGAAGCGAAAAATCCGGCATTCACCAGATTGTTACGCATTTCACAGCCTTTCATGCAGTGGCTGCATTTATCCTTTTTCGGGTCTGAGGTGGGCTTATCGAACTCATCGGCCACGGGCGGGCCGTCGTATCCGCAGTTTTCATCCCGGTAATCCCACGGACAGGAGTCCGCCAGCATGGTACGCCCCGGCACCACAGAACCGTCGGTTTCTGCCGGTGATGCCAGAATAATGGTAGCAGTTGATGAATCCAGTTCTGACAACTGCTCCACGTTATAGCGCGCTACCGCCTCCTGCTCCGGGTCAGCGCCCGGATTGCCGTTACTGAAATTCACCGCATCAAGAAACTTGCTGTAAACCTGATGCCTTACCACTGACGCGCCGACGAGACTTTGCAAATCCTCCGCCATCCCCGTGACCAGACCAAAGAGATTGGCAACAACGAGGTTCGGGCGGGGAGATGCGCCTTTCCCGTTCATCTCAAAATCCTGTACCTGTATCGGGTACGGTTCGTACTGCCTCCCCTGCCAGGTTAACGGCTCGCCTTTTTCGTTCGGTTCGTTACAGAAGAAAAAGCGCTCACCGCCAATCGCGGTTAAATCAAATTCCCACAAATCCACCTTCGCGGACTGCTCCGCTTTGGTGGTCTCGCTCAAGGTTTCCTGTGGTATATCCTGCATATATGAGAGATCCTTTATTATTTATCTTGCAAAAATATACCTGCTTTTATTAATGGTATTTACGATACAACCAAAAAACGAGGTAACTAATGAAATACACAATATTGTCGCTGGTAGCTGGTGCGCTCATCAGTTGTTCAGCAATGGCAGAGAATACCCTGACTGTAAAGATGAACGATGCCCTGTCCAGCGGAACAGGAGAAAACATAGGTGAAATCACAGTTTCAGAGACACCTTACGGTCTGCTTTTCACTCCTCACCTAAATGGTCTTACGCCAGGAATTCACGGCTTCCATGTCCACACAAACCCAAGTTGTATGCCGGGAATGAAAGACGGTAAAGAGGTTCCGGCGCTCATGGCCGGAGGACATCTTGACCCCGAAAAAACCGGGAAACATCTTGGCCCATATAATGACAAAGGGCATTTGGGGGATCTGCCTGGACTGGTTGTCAATGCAGATGGTACAGCCACGTATCCGTTACTGGCACCACGCCTTAAATCACTGTCAGAACTGAAAGGTCACTCATTGATGATCCATAAAGGCGGTGACAATTACTCCGATAAACCTGCTCCACTGGGTGGTGGCGGTGCACGTTTTGCCTGTGGTGTCATTGAGAAATAACAGCAACATAGCCATATCGTCATAATTTCGTTTTACCCATAAAAAAGCCCTCTCACTGGAGGGCATTAAATCTGTATCGATGTTAAAGGTCAGAAGCTGTAACCTACGCCAAGCACCTAGGTTCCAGCTTTGACGTCACTGTCAGCATCAGTGGAAAAACTTGTATGCTCATAAGACGCATTAACGGCAATATTTTCAACCGGGTTAAGCTGAATACCTGCCCCATAAGCAAAGGCGGTTTTATTGTCAGAATTTCCCCAGTTATCCTTAATATGTCCGTTTGCTGCACCAATCATCACGTAAGCATTCAGATAGTCGTTAAAACGGTATGAAGGACCAACAAGAAGGGAGGTATAATCAGCATCACCTACCTTAAACGCTCAGCACCTCGCTGGCCGGGATCAGAGTTCTCATCTCCTTTTCCAGCTCAATACGCGTCATTTCCGACTGAAACCATGCCCGTCGATCTGAGGGTTTCATCTTATTGGGGTCATTCTCCCCGGTAACAGCCGGCATCGTCATCATGGCGGTCAGAATATCCACCAGCCGGTAGATTTTCAGGTTACTACCGTTCCCACCTGAGGTTTTTACGCCCTTCAGCCTGCTGGCGATGGTCTGTCGGTGTGCACCAGTGATGGCCGAAAGCTGTGTGATGTTTAATTCGAGGCTTTTAATTTCCTGATCCACAGTCGTGCTCTTTTCCTGTATACGGTGAAAATGGCGTTCAGTGTCGAACAAAAAACGTACCACCTCGACACTGAAAACAGTAAATGTATTGATTTTTAAGGTTATTTTTCAGTGCTGACAGAGACTAAAAAATCAAAAATCAGCCGATTCCCGCGAGCCCGAAGCCACCCGTGGCGCCCCCTGCCCGGGAGTACCTTTTTAATACAGTCACCATTGGTTACTAGTTTTTCCTGCATTACCGTGGCATTGGGTGCGAATGTACGCCTGCGCCCCTTCCAGTTGCTTTTGCATCGTCTTCACTCGCTCTTTGAGGGCGAAATAATCCCGTTGAGCGGAGTCTGCCAGTCTGGGGCTGGCTGCATTATCCATGCGGGCGGTGGAGGTGGATTTACCTGTCGGCACTGCGGGGCATGTTGCGTTGACGTACAGGCGACGGCGGCCAGCGGCAACATCATCGCGCAAAGCATCATTCTCAGCTTTCGCATCGGCTAATTCCTTCGTGTATTTTTCATCGAGGGCGGCAACGTCACGCTGGCGCTTAGTCATGTCGGTAATTGTCGCGTTCGCCAGCGTCAGCCTATGAGTAACGGTATCGCGCTGCTCTTTGTAGGTGATGGCGTTATTTCGGTAGTGATTTGCCAGCCGACCGGCAACAATTAGCGAGACAAGCAACAGGCCAACAAACATCGTTTTCCAGTTGAACATCATGACAGGAACAGAGCACGCTCCGCCTCACGCCGACGGGTAAGCCCGTTCAGTACTTTGCCACCAGCCTTATTCCAGCGCAGGAACTCATCAGCGGCGCCAGCGTAATCACCAGCGTTTAGCTTCCGCAGCAGAGTTGATGAGGATAATGTCCGGGCGCCGAGGTTGTACGCGAACGACACCAGCGCATCAAACTGGCCTTGCGTCAACTTGACCTTAACCAGTCTGGACACATCATTTTCATAACCGACTAAACCAGTGTTAAGCAAGCGCTCGGCAGTAGCCTCGTCAATCATCATTCCGGGCTTAACTGGCTTACCGTCAACAGAGTGGGTCCAGCCATAACCAATCGTCCAGGGATCTCCCCCCGTTCCCGGGTCCGGATAAGCTGTCAGGCTACAACCTTCAAACTCTTTGATTAGGGTAATGCCTTTTTCACTGATTCTCATCATTAACCCCTGCACGTTTTTTGAGTGCGCTAATTGCGATTTCGCGCAGCTTGTCCACACCGACAAAGCCAATAATTCCGCCAACGAAAGGCGAAATGGAAACCGGCAGGCCTACCACATCAAGCGCACTGGTGACACATAAGGAAAGAGCGCCACACAGGACGCCCTCAAGCCATTTATTTTTACGGGTGGCGCCGTCGTATATCAGTCGGCCGTAGGCAATGAGTCCGGCCATTAACGCCCCAAGTATCTGGGGCCACGCATTTTTGAGTCCGGTCAAAACCGCAGCCCAGAATTCAGGAGTCTTGTCATTCATTTTCATAAGCCTCACCTCCGATGATTTCGGATGGTAACTAGAGTGAGTGAAATGGTTGGGTTGCAGGGTTTAATATCTTGTAAAACAGGATTGCCTGTGGTTGCAGAATCTGAAAGTAAAATCACGCAGAGTACAATTTTAATGGAGGTGAGGCACAAATACTGCAAATTTAGCTTTTAGTTTAATTGATTGCGTGCTGAGTGAATTCTGTTTGACAAAAACATGCTATTTATAGAATGTTAATTCCATGTAATAAAAAGGATGTGTAACTCATCATGCCAACGGGAATTAAACCAATATTTATCAATAATATGATGTCAACATATGGATTATCCCATCCTCATGACAGCAAGGTATTTCCAGACCTTCCAGAACACCAAGATAATCCTTCGCAATTACGCCTCCAACATGATGGTCTTGCTACCGATGATAAAGCCAGGCTGGAACCAATGTGTCTTGCTGAATACCTTATCTCTGGACCAGGAGGAATGGATCCTGATATCGAAATTGATGATGATACCTATGATGAATGCCGTGAGGTGCTATCACGCATACTTGAAGATGCATACACTCAAAGCGGGACATTCCGCAGACTGATGAATTATGCCTACGATCAGGAATTGCGTGATGTAGAACAACGCTGGTTGCTGGGAGCCGGAGAAAACTTTGGTACTACCGTAACTGATGAGGACCTGGAGAGTTCAGAAGGCAGAAAAGTGATTGCCCTCAACCTGGATGATACAGACGATGATTCAATACCAGAGTACTATGAAAGTAATGATGGCCCACAACAATTTGATACAACACGCTCATTTATTCACGAAGTTGTACACGCGTTGACTCACCTTCAGGACAAAGAAGACAGTAATCCAAGAGGCCCGGTAGTCGAGTATACCAATATCATTTTAAAAGAGATGGGTCACACATCACCACCAAGAATCGCCTACGAATTTAGTAATTGACACTCATCAAAAAATGCAAAATCCCACGATGCTACAACACAGTAACCAGTTCAGGTCTGAGCTAATACAGGTCAGCAGTCCATAGACACTGGCTCCTGTCAGGATGCCACCTGCTAACCCAGTACCAGAAATCGATTCGGACATTCATCCCCCTCTGGTTGTGTGGGGCCTCTCAGTTATGAGGGGAAATAATAAATATCCTCCGGCATAGCCGGAGGATATTTATTCATAAAGAACACAATTAAGAATAATACCGATTTAATTAAAATAACTTGATCTCACAGTTGAAGAATGAATAATAGCGAGCCCTGCCAAGGCAGGGCATAGAAATAACCAACGAGAAGAAATAGGTAGGAACTAATGAAAAACACCGCTCTGGGTAAGTTCATTTTTATCGTCGGCACCGCGTTACTGCTCGGTGGCTGTAGTGGCATGGTCATGCCTCCCTATGCCACCCACGGTACATCGGTCGGAATCATTGCGCCAGCGGGAGGCTATAGCGAGTGGCACACGGATAGCCGCAACCACACCACAGGAGACAGTCACAGCCAGTCACAGGGAAACTGCACCCAAAGTGAAGATAGCCAGCTCAGCGAAAATAGTCTCACACGGACACACCAAAGCAACTGTAACACCCGTAGTCAAACCCACAGCAGTAGCACCAGCAAAACCCGCTCCAGCAGCGTCGGTTTCAGCGTCGGGGGGCCTGTTGGTGCTAGCATAGGGTTGATCAAGCAGATGGAGTCGATGAACCGTGCGCCAGCCAACGATATGAGTAGTAATGAGATGTTCAAGAATTTCGGTTTCTAGCACATAACGCCACCTGGTACCGTTGTGGTGTCTGGCCCGGCGGCTATCTGTAACGACTCACAATCGAAAAAAGTCAGACTCGCAATCAGCGCAAAAATAGATTGTGAGTCCATTGAATGGGGATCGTTGTGCATTTTCATAAGCCTCGCCCCCGATAGCTTGGATGGCGCTGTATTTGTAAGGGTGAGAGGCCCTCGGGCGGGGTTTTAACAACGAAGCGTGTAGATGATGATTTCCGAGGGCTGAATAAAAAAACCGGCGAAAAGCCAGGAAGATGTAAATAAGGCCATTTCGACTCTGTGGACGAAGATACCCTAACATTAGTTTGATGTGTGGCAATTCTTACCGAGGGTGTTGAGCAATCCTCTCTAAACTATTTCTGCGAGGCTATATAAAGTTCATGAGTATCAGCTAACTGCACAAATTTGTACTTAATAGCCCCCAATAAAGTACCTAATCTTGTATGACCCTCCATAAGGTGTAAGCCGCTCTCTCCAGGAATAATAAGCGAACGCTCAATAAACATCGGTGGTTCAGCCCATGTACCGAATTTAAGCCAATGGTTTGCAACCTCTTCACGGGCATCAATGCAAAACTTGCTGCCGCAGGCATTAAAGTCTTCTGAAATCTCGAGCATGTAATCAGGATATGTGGCATTTCTGCCAAACTTTGTAAACTCTGCTGTTTTCAATCTGACCAAATCCCACTTCAGTGATTTAAGATTTAGATGCCCATACAAGGTTTGAAATTCAGAATTATTAGATAACCCACAATAAATTTGCTTAAAAATTTGTTCTGGAGCTTCGATCCCATATTGCTCACGAAGGATGGCAATTCCTTCTTCTTCCTTATACAACGGGTCGGGACCAAAAACTTGAAATAAATCACGATAAAACATCATCAATTCCTATTGCTGATTAATACAAAAATCCCGTTCCTCAGCAGGCTTGCATATTTTAGGCATGATATCAAATTTACATGAAATATATGTATTTCAGTTCGGTTTTGCAAGACTTATATCCAAATTTGTCGCCTTTTGTTGTGAACGCGATCGTGTTACGGAGATAAGCGCATCACTATCAAGCCGTTTAAAGCTGTTACGCATTACTAACCAATGAGACATGTACGTCTCTGTCCAGGTAGATTTTGCCACGCCCATCAGTTCCGCCAACGTTTGGTACTCATACGTCTCACGTCTGGCCAGTTCCGCCTTCACATCCTGCGCAGCAAGCCAGATGAGCTGGCGCAACCGTGCTAAAGTCTTCTTCGCAACTCGCTTCCCTTCCAACTGCTGGCTGAATTTTTCCCAAGCCCATCGAGTTACAGTGACCTGATATTCCCAGCAGGTGTTTTCACTGTAATTCCACAATAGCCAGGCCTTATTATGTTCTTCGAGTGACAAAAGCGCACGGCGCCATGAGGAAGTGGAGTATTCTACGGGCTGCACAAGCGGAATTGCGCTTCCTTTCGCCAGTGATTGCTTACCAGCAATCGGTGGATTATTCAGCGTTATCATTTTTCCGGTCACTTCATCCCTGATACGCTGTTTTTTTCGGGGATAGTTTTTCGTATAGAGCTGGGCGTTCTCCAGCCATGCCTGCAACTGACCTTTCGTGGCGCCACTCAGATCTGCAGTCGCCACGATGAGCTGCTGTCGCACATATTCCAGATATTGTGTATTCATACGGTACCGCCCGTTATCTTCACGTAGTTTTTCAAAATCCGGTAATCGATCAGGATGGAACCCGGAAATGGTATAAGCACAACTGCTACCAGCGAGCACGGAGATGATCGGCAAAGTAGGATTCGAATGTCATGCCGCCTCCAGCTTTTTTAGCGCACGCAGATCCGCCAGTGCAGCGATCCTGATTTCTTTCAGCTCCTCAACCGTCCAACGTTGCGGAGTATTATTGCTCTCAAGTTCCAGCACCTCCGCCTCACCGTAACGCTCAACCTTCACGCCACGGCGCCGGTATCCCGCCACCAGCTCATCCGCCTCTTCGGTGGTACACACCGGATGCTGAAACCATGTCATTTTCATGCGAACTCCAGCAGATGCGCGGCCACATTTTCAACTTCTTCCAGAGAGGAAAATTTACGAAACAGAATCCAGTTCCACAGGACGTTCAGCACAGCTTTATAGACCTGTTGAAACTCGGTTTCGTCCATACTGGCGAATGCTATGGATTTCGCCCGGCGCCCGCGACTGCCATCCGGATAAAAATGCTCGGTATAAAACCCGGCCTGAACGGTTACCCATTCCCGGAAGGCATCGAAAGACTTAAGAAGGGCGACGTCCCCGGTTCGCAGGGTAGCTACGTTATGGAGGTACTGTTCCGCCGCCTCGTTAAGGGCCGGGGTATATTCCTGGCCTGCGGAGTCGCAAAGAAAATTAACGAACCCGGAGATAAGTTTCTGTTCCCGCGATGTGACCGTGCCGCCCGTTGGCATCCAGTAGTCGAAACCAAGCTGAAGGAGTTTAAAAAATCGTTTATGAAAGGCGTAGTTGCGGACACGTTTAAAATCGGCGTGTATCCACTCACCGATTTTTACTGAGCGCAGGAAATCCCCACTCTCCGGCGTCGCCGGGAGCAGAAGCCCTGATGAGGTTTGCTTGACCAGTTGTAAATGCGCCATCGTTCTCTCCGTTGGCGCAGTAGATTGGGAGTTCAGCCCGCAGACGAGTATAACAAAGGATGATTATTCATGATAACCGGCCCTGATAGTCAGCTCATTAATCAGGGTATCGCTCCCCATGATGTCATTTTGCAACAACGGCAGAAACCGGACATAGCGGCCATCCCGATACATCAATGACCTGTTGCAGTCAGGAAAAAAATCCATTTCAGCAATTACTGTCATGTCATCACGGCGAATAACAGCATATTTACAAGTGAATGTTTTATTTAAATTTTTCACGGTGTCTCCATAGATAACGAACTTGAGCATTTTTAAAGCATCTTCATTCTCAACATGAATATATAGGAGACTATTAATTATCATCATCAATAAATATATCTATTTTTTGACCATATGCAATGACATTTTCTCTGTGTTCTATTTATAATCTTATAACTGGTTATTTTTTGACATGCTCATTTCCCGGACATTAAAAAAACCGCCGGCGCAGGTATTAAGTGCGGGTACATTGAGGTTGTCTGACACATCACAGGTGATGGAGATTCATCCCCCAAGGTCTCTTACTTAGCAATGAAGACAACTACCTCCTCTCTGTCTGGCCGGTTCGATCGCAGTCTCTCCTCGTTACTGGTGCAGTCACTGTGACAGTGATGCAGATGATAATCAGGACGATTAACATCGCTGCGGTTGACTTATCCGGCAAAATTATGCTGCCATGATGCCAGTTAACCATACTGGCATCATGGCCAACCGGCATCGAAAAGCATGTTGGCCAGACTCGCAGGCCATTGAATCACGCAACAACCAGTTACTGTCATCTGATGAAAAAGGCTGTGCATAACAGAATCAGAACTGACTGGTATCAGGGCCATGTTCTTCAGCAGCAAATACATAAGATGAAGCAAGATATAAAGAATGAAGGGAAAATAGAGTATAAAAAACGTACAGAATTGTCTGAAGTAACTTCCCTGCAGCATTGACGCCGCAGGGAATCTATTTATGGTGTAACTATATTGAACCAGAACTCAAACTTGTCCATATAGCCCAGCATCTCATCCAGTTTCGCAGCATTACCGGTAACGTTGACTTCTCCTTTATCTTGAGCCTGCTTCAGAGTTTCTTCCTTCAGGATAATTTTATTCAGCGTGTCACGGTTCAGAGTAATCGTGGCATCAGCATCTTTCGCTTCAGCATTAGCCGTGTGGTTCAGCACGCCATTTTCCAGCTCAAGCTTGTACTTTCCGCCGTCGCTGCCAAGGTCAATATTAAATACCGCCCGGGCATTACCCGCTTTTTCACCGTTGATATGTACAGCCAGAAAGTCGAAGAACATTTCAGGGGTCATCGCCCGAACGGTATCCGGACTTGCTGTATTTGGCGTCGGACCTTTAACCACACCGTTACGCAGCTCCTGCGCACCGGTCAGGTAGAAGTTACGCCATGGACCAGATTCAGCCTGATACCCCAATTGCTCCAGCGCATCGGCTTCAAGGTTACGTGCATTCTGGTTATTTGGATCGGCAAACACGACCTTACTCACCACCTGAGCAACCCAACGGTAGTTCCCCTGGTCAAAGTCTGCTTTAGCTTTCTGAAGAATCGCATCGGCACCGCCCATGTATTCAACAAATTTCTTGGCCGCTTCTTCGGGTGGCAGCTCATCAAGGGTTGCCGGATTGCCATCGAACCAACCGAGATACAGCACATACGTTGCTTTTACGTCATGGCTGATGGAGCCGTAATAGCCGCGGTTGGCCCAGGTTTTTGCCAGGCTATCCGGTAGTTTAAAGTTGGCCGCTATTTCGTCGCGAGTCAGACCTTCATTGGCCATGCGCAGAGTCTGGTCATTGATATAACGATACAGGTCTCGCTGGCTTTTCAGCAGACCAACAACATTCTCGTTACCCCAGGTCGGCCAGTGGTGCTGGGCCATAATAATTTCAGCTTTGTCACCCCAACGCACTATAGCTTCGTTGATATATTTCGACCACGGCAACGGCTCACGAATTTTTGCGCCACGTAGCGAGTAAGTGTTATGCAGGGTGTGAGTGACGTCCTCTGCGGCTTCGATGAGTTTCTTCTCTTCGATGAACCACAGCATTTCCGAAGGGGCTTCCGAACCAGGGGCCAGCATAAAGTCGTAAGTCAGGCCATCAATCACTTCTTTCTGGCCGTCTTTATCGATGATATTAGTGGGCGCAATCAGTGTCACCGTCCCCGCAGAGGTGGTCGTCCCCAGTCCGGCGCCAACCTGGCCGGAGGCATCTGGTTTCAGGAGGTTGCCATACATATAGCTGGCACGGCGGCTCATCACGTTGCCGGCCATAATATTCTCGGCTACTGCTGCCTCCATAAAGCCAGCAGGCGCATACACTTTCACCTTGCCGGATTTCACGTCCGCTTCATCGACAACGCCACGCACACCGCCATAGTGGTCAACATGGCTATGAGTATAAATGATGGCGACAACAGGCTTATTGCCACGGTTTTTGAAATACAAATCCATACCGGCTTTGGCTGTTTCCGCAGAAACCAGCGGATCGACAACCGTAATCCCCTCTTTACCTTCGATAATCGTCATGTTGGATAAATCAAGGTTACGAATCTGGTAGACGCCGTCTGTGACTTCAAACAAGCCACTGATATTGATTAGCTGGGACTGACGCCACAGACTAGGGTTAACAGTGTCAGGAGATTTTTCCCCTTCTTTTATGAAAGCGTACTGCTGTGGATTCCAGATGACATTCCCTTGCTCTCCCTTAATCACCTCTTCAGGTAAACCAGCGATAAAGCCTTTATGGGCATTCGTGAAATCGGTGTTATCAGAGAAAGGAAGTTGGTTATAAAGCGCATCGTTAGCTTGCTTGGTTGAAGCAGTGGCACCTTTTGGGGCTTCCTGTGCAAATAAAGGTGTCAGCGCAGTGGAAGAGAGTAGCCCCGCCAGCGCAAAACTTTTAACGATCAACTTAAGTCTCATTTGTACCCCTCATGTAAAAATATTCTGTATCACTCAGTCTGGTAGATTAATTATCTGTTAATTCAAACAATTAAAGTTATTGCTGACCATTTTCTCTCTTTTAAATATAACCAAAACGTTACATTTCGCTATTTATGGATACAAATAAATCGTGTTTTACGTCAGCCAGTTCCATCCTCTTTTAGTAAGTGGGGTAAGCTCGCTTCCCGTTTCCGGGACCCTGCCTGACTGAAGAGCAGGCTGACAGGATACGCGCCGCGCATTGGGCAGGATTTACAGGACGGCGCGCCGGAGATGTAAAAGTTTTACCGGGCGGCGGTCAGCAGTCCTTTAAATTTTACCCTGATCATTGATGTTCAACCCTGACCGACCGCCACACCGTATAGTTGGCGGCGGTCATGAAGTAAAGAGACATGACTATGAGCTTTGTGAGACTTGAAACCTGGGGTGAATTAAATTATCCCGATGATCCACCACCTCTCACAACACTAAGACGATGGGCGCGAAACGGAAATATTTACCCGACTCCAGTATTACATGGCAGGACGTATCGGGTTGATCCGGACGCGTTTTATATCAAGCCGAATAAAGTGGGACTTGTGCTTGAACAGCACCACCCAAACGGGCGCACCGGAAAACCGAGTGCATTGCTGGAGAAGTTGATCAGTGAGTCGAAAAAAGTACGATGCTAACCTTCCGAGGAACCTCACCTACCGTAAGGCCAGTAAATCTTTTTTCTGGCGTAACCCGCTAACTGACAAGGAATTTCCGCTCGGTCAGATCGCCCGCAGGGACGCTATCACACAGGCCATAGAGGCAAACAACTTCATAGCGCAAAACCACACACCAGTGGCGCTTATTGAAAAGCTAAAAGGAACTGACTCATTCACTGTGTCCGCATGGATTGATCGCTATGAGGTTTTATTACAGCGCCGGAGTCTGTCGGTTAATACCTACAAGATTCGCGGTAATCAATTAGCGACCGTACGCGAAAAAATGGGGGAAATAATACTGGCAGAAGTAACAACCAGGCACATTGCCAAGTTTCTTGAGTCGTGGATAACCGAGGGAAAAAACACTATGGCGGGAGCAATGAGATCAGTTCTATCTGACATGTTCAGAGAGGCTATTGTCGAAGGGCATATTGTGAAAAACCCGGTGGAAGCAACCCGGATACCAGAGATTAAGGTGGCCAGGGAACGCCTGCAACTGGAAACGTATAACGCCACACGAGCGGCAGCAGAGCATATGCCTGCATGGTTCCCTCTCGCGATGGATTTAGCGCTCGTTACTGGTCAACGTAGGGAGGATATCGTAAATATGAAATTTAGTGATGTTTTTGACAACCGCTTATACGTAACTCAGATTAAAACCGGAATGAAAATAGCCATTCCCCTCTCCCTGACACTTCGGGCGACGGGGTTACGTCTGGGAACGGTAATCGATCGCTGCCGACTGGTAAGCCGCACTGATTTCATGATCAGTGCCGGAATCAGGAAAAATAGCCCAACCGGGAATATTCATCCGGATGGATTGACAAAGACATTTGTAAAAGCAAGAAAAGCCTCCGGTGTTAACTTCAGCAATAATCCACCGACATTTCACGAGATCCGAAGTCTGGCCGGGCGGCTGTACAAAAACGAGCACGGCGAGGTGTTCGCCCAAAAACTCCTGGGCCACACATCAGCGAACACCACGAAACTCTATCTCGATGAGCGTGATGATAAAGCTTATATGATGCTCTAA
Protein sequences of DBSCAN-SWA_6 >NZ_CP041973|1739444:1755559|1748569_1749094_-|WP_001574213.1|DBSCAN-SWA MFYRDLFQVFGPDPLYKEEEGIAILREQYGIEAPEQIFKQIYCGLSNNSEFQTLYGHLNLKSLKWDLVRLKTAEFTKFGRNATYPDYMLEISEDFNACGSKFCIDAREEVANHWLKFGTWAEPPMFIERSLIIPGESGLHLMEGHTRLGTLLGAIKYKFVQLADTHELYIASQK >NZ_CP041973|1739444:1755559|1754479_1755559_+|WP_000087636.1|integrase|DBSCAN-SWA MSRKKYDANLPRNLTYRKASKSFFWRNPLTDKEFPLGQIARRDAITQAIEANNFIAQNHTPVALIEKLKGTDSFTVSAWIDRYEVLLQRRSLSVNTYKIRGNQLATVREKMGEIILAEVTTRHIAKFLESWITEGKNTMAGAMRSVLSDMFREAIVEGHIVKNPVEATRIPEIKVARERLQLETYNATRAAAEHMPAWFPLAMDLALVTGQRREDIVNMKFSDVFDNRLYVTQIKTGMKIAIPLSLTLRATGLRLGTVIDRCRLVSRTDFMISAGIRKNSPTGNIHPDGLTKTFVKARKASGVNFSNNPPTFHEIRSLAGRLYKNEHGEVFAQKLLGHTSANTTKLYLDERDDKAYMML >NZ_CP041973|1739444:1755559|1750896_1751202_-|WP_000972675.1|DBSCAN-SWA MMIINSLLYIHVENEDALKMLKFVIYGDTVKNLNKTFTCKYAVIRRDDMTVIAEMDFFPDCNRSLMYRDGRYVRFLPLLQNDIMGSDTLINELTIRAGYHE >NZ_CP041973|1739444:1755559|1754226_1754505_+|WP_001575998.1|DBSCAN-SWA MTMSFVRLETWGELNYPDDPPPLTTLRRWARNGNIYPTPVLHGRTYRVDPDAFYIKPNKVGLVLEQHHPNGRTGKPSALLEKLISESKKVRC >NZ_CP041973|1739444:1755559|1747746_1748196_+|WP_000798708.1|DBSCAN-SWA MKNTALGKFIFIVGTALLLGGCSGMVMPPYATHGTSVGIIAPAGGYSEWHTDSRNHTTGDSHSQSQGNCTQSEDSQLSENSLTRTHQSNCNTRSQTHSSSTSKTRSSSVGFSVGGPVGASIGLIKQMESMNRAPANDMSSNEMFKNFGF >NZ_CP041973|1739444:1755559|1742948_1743644_-|WP_001152416.1|tail|DBSCAN-SWA MQDIPQETLSETTKAEQSAKVDLWEFDLTAIGGERFFFCNEPNEKGEPLTWQGRQYEPYPIQVQDFEMNGKGASPRPNLVVANLFGLVTGMAEDLQSLVGASVVRHQVYSKFLDAVNFSNGNPGADPEQEAVARYNVEQLSELDSSTATIILASPAETDGSVVPGRTMLADSCPWDYRDENCGYDGPPVADEFDKPTSDPKKDKCSHCMKGCEMRNNLVNAGFFASINKLS >NZ_CP041973|1739444:1755559|1746699_1747386_+|WP_001574215.1|DBSCAN-SWA MPTGIKPIFINNMMSTYGLSHPHDSKVFPDLPEHQDNPSQLRLQHDGLATDDKARLEPMCLAEYLISGPGGMDPDIEIDDDTYDECREVLSRILEDAYTQSGTFRRLMNYAYDQELRDVEQRWLLGAGENFGTTVTDEDLESSEGRKVIALNLDDTDDDSIPEYYESNDGPQQFDTTRSFIHEVVHALTHLQDKEDSNPRGPVVEYTNIILKEMGHTSPPRIAYEFSN >NZ_CP041973|1739444:1755559|1746094_1746424_-|WP_001574216.1|holin|DBSCAN-SWA MNDKTPEFWAAVLTGLKNAWPQILGALMAGLIAYGRLIYDGATRKNKWLEGVLCGALSLCVTSALDVVGLPVSISPFVGGIIGFVGVDKLREIAISALKKRAGVNDENQ >NZ_CP041973|1739444:1755559|1750009_1750237_-|WP_000784710.1|DBSCAN-SWA MKMTWFQHPVCTTEEADELVAGYRRRGVKVERYGEAEVLELESNNTPQRWTVEELKEIRIAALADLRALKKLEAA >NZ_CP041973|1739444:1755559|1751833_1753813_-|WP_001237395.1|DBSCAN-SWA MRLKLIVKSFALAGLLSSTALTPLFAQEAPKGATASTKQANDALYNQLPFSDNTDFTNAHKGFIAGLPEEVIKGEQGNVIWNPQQYAFIKEGEKSPDTVNPSLWRQSQLINISGLFEVTDGVYQIRNLDLSNMTIIEGKEGITVVDPLVSAETAKAGMDLYFKNRGNKPVVAIIYTHSHVDHYGGVRGVVDEADVKSGKVKVYAPAGFMEAAVAENIMAGNVMSRRASYMYGNLLKPDASGQVGAGLGTTTSAGTVTLIAPTNIIDKDGQKEVIDGLTYDFMLAPGSEAPSEMLWFIEEKKLIEAAEDVTHTLHNTYSLRGAKIREPLPWSKYINEAIVRWGDKAEIIMAQHHWPTWGNENVVGLLKSQRDLYRYINDQTLRMANEGLTRDEIAANFKLPDSLAKTWANRGYYGSISHDVKATYVLYLGWFDGNPATLDELPPEEAAKKFVEYMGGADAILQKAKADFDQGNYRWVAQVVSKVVFADPNNQNARNLEADALEQLGYQAESGPWRNFYLTGAQELRNGVVKGPTPNTASPDTVRAMTPEMFFDFLAVHINGEKAGNARAVFNIDLGSDGGKYKLELENGVLNHTANAEAKDADATITLNRDTLNKIILKEETLKQAQDKGEVNVTGNAAKLDEMLGYMDKFEFWFNIVTP >NZ_CP041973|1739444:1755559|1745161_1745641_-|WP_001541990.1|lysis|DBSCAN-SWA MFVGLLLVSLIVAGRLANHYRNNAITYKEQRDTVTHRLTLANATITDMTKRQRDVAALDEKYTKELADAKAENDALRDDVAAGRRRLYVNATCPAVPTGKSTSTARMDNAASPRLADSAQRDYFALKERVKTMQKQLEGAQAYIRTQCHGNAGKTSNQW >NZ_CP041973|1739444:1755559|1749190_1749880_-|WP_001097218.1|DBSCAN-SWA MNTQYLEYVRQQLIVATADLSGATKGQLQAWLENAQLYTKNYPRKKQRIRDEVTGKMITLNNPPIAGKQSLAKGSAIPLVQPVEYSTSSWRRALLSLEEHNKAWLLWNYSENTCWEYQVTVTRWAWEKFSQQLEGKRVAKKTLARLRQLIWLAAQDVKAELARRETYEYQTLAELMGVAKSTWTETYMSHWLVMRNSFKRLDSDALISVTRSRSQQKATNLDISLAKPN >NZ_CP041973|1739444:1755559|1750233_1750833_-|WP_000940753.1|DBSCAN-SWA MAHLQLVKQTSSGLLLPATPESGDFLRSVKIGEWIHADFKRVRNYAFHKRFFKLLQLGFDYWMPTGGTVTSREQKLISGFVNFLCDSAGQEYTPALNEAAEQYLHNVATLRTGDVALLKSFDAFREWVTVQAGFYTEHFYPDGSRGRRAKSIAFASMDETEFQQVYKAVLNVLWNWILFRKFSSLEEVENVAAHLLEFA >NZ_CP041973|1739444:1755559|1739444_1740308_-|WP_072100756.1|DBSCAN-SWA MDLTEDNASKLQQFSKEWQDANDKWSATWGVKIEQTKDGKYYVAGLGLSMEDTPDGKISQFLVAADRIAYINPANGNETPGFVMQGDQIIMNEAFLKYLSAPTITSGGNPPAFSLTPDGKLTAKNADISGHINAVSGSFTGEINATSGKFSGVIEAREFVGDICGSKVMQGVSIRATNDERSTSTRYTDSATYQIGKTITVMANCERNGGTGAITVTININGQVKTAEVMPYTAGIPAMYQTVVFSVYTTSPVVDISVSLRVRGQYTTSASVWPLVMVSRSGSNFTN >NZ_CP041973|1739444:1755559|1743733_1744267_+|WP_000877926.1|DBSCAN-SWA MKYTILSLVAGALISCSAMAENTLTVKMNDALSSGTGENIGEITVSETPYGLLFTPHLNGLTPGIHGFHVHTNPSCMPGMKDGKEVPALMAGGHLDPEKTGKHLGPYNDKGHLGDLPGLVVNADGTATYPLLAPRLKSLSELKGHSLMIHKGGDNYSDKPAPLGGGGARFACGVIEK >NZ_CP041973|1739444:1755559|1745658_1746111_-|WP_000984586.1|DBSCAN-SWA MMRISEKGITLIKEFEGCSLTAYPDPGTGGDPWTIGYGWTHSVDGKPVKPGMMIDEATAERLLNTGLVGYENDVSRLVKVKLTQGQFDALVSFAYNLGARTLSSSTLLRKLNAGDYAGAADEFLRWNKAGGKVLNGLTRRREAERALFLS |
16 | Salmonella_phage(30.77%) | tail,holin,lysis,integrase | attL 1736074:1736103|attR 1755695:1755724 |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_7 |
1927944 : 1968640
Sequences of DBSCAN-SWA_7
Nucleotide sequences of DBSCAN-SWA_7 >NZ_CP041973|1927944:1968640|DBSCAN-SWA TATGCTTCCGGTCACCTACAGATTAATACCTCAAAGCGGAGTATCCACATATGGATTAAATACCGCAGATACACCTGTTTTCCCCGATATTCCCGAACATGCACCAAACCCCTCACGGCTACGCCTTGCTCATGACAGCCTTGCCATAAACAGTGAATTCCGTCTGGAGCCAGAGTGTGTGGTGGAATACCTTATCTCAGGCGCGGGTGGAATAGACCCTGATACAGAAATTGATGACGACACTTATGACGAATGCTACGATGAACTATCCTCCGTACTTCAAAATGCGTATACCCAAAGCGAAACATTCCGCAGACTGATGAGTTACGCATATGAAAAAGAACTACATGATGTGGAGCAGCGCTGGCTACTGGGGGCAGGCGAAGCCTTTGAAACTACCGTGGCTCAGGAACACTTCAAACTTTCAGAAGGCAGGAAAGTTATTTGTCTCAATCTGGACGATTCTGATGATTCATATACCGAACATTATGAAAGTAACGAAGGAAGACAACTTTTTGACACAAAACGTTCATTTATTCATGAAGTTGTACATGCACTGACCCATCTTCAGGATAAAGAAGAAAATCATCCAAGAGGCCCTGTTGTCGAATATACCAACATTATTCTGAAAGAGATGGGGCATCCTTCACCTCCCAGAATGGTCTACATCTTCAATAAATAGACACATCAGGAAACGAAAAGAAACTAAAAACCCACGTAGTCCGTTTTTTCGGGAAATGTTCTAGCAGTATTTTCTAACTATATTCTAAGCACCCAAAAAACAAAGGGGTTACCTTTCGGTAACCCCTTGTTTAATCTGGCGGAAGCGCAGAGATTCGAACTCTGGAACCCTTTCGGGTCGCCGGTTTTCAAGACCGGTGCCTTCAACCGCTCGGCCACACTTCCGGAATGAGGCGCACTATAAACATCCCGGTGCGTCATGTAAAGACCGAATGTGTTCGTTTGCGTGAAAAACAGGCAAATTTTCGTTAATTGCCTGAAATAGCGGCACATTGACCATTTCTTCAACACAAAAACGGGCTTAATACGTGTTGATCGTCCCCACTGTTTTACCGCTGTGATTATCAATTTTTTATCGGTAACGCGTGGCTGGCGCAGCCGCCTGTTTTATGGCGTAGCGCCAGGCATGAGCAGACCGTTTTAACGACGGTAATCACCTGAAATCTCTAAAGGAAACTAAACGTAAACGAATGCGTAAAGATGGGTAAAACCACTGGGTGACAGTGACTGTTTTTATTATTCTCCTTTATTAATTACATCTGTCATAAGAGAGTGACTAATGGATCGTATCATTACATCATCGCGTGATCGTAGCTCGCTACTGAGTACGCACAAAGTACTGCGCAACACCTATTTTCTGTTGAGCCTGACGCTGGCTTTCTCCGCAATTACCGCGACAGCCAGTACGGTACTGATGCTGCCCTCCCCCGGCCTGATTCTGACGCTGGTCGGTATGTATGGGCTGATGTTCCTGACCTATAAAACCGCCAATAAGCCGGTCGGTATCCTGTCTGCTTTTGCGTTTACCGGCTTCCTCGGGTATATCCTGGGCCCGATTCTTAACGCCTATCTGTCAGCAGGCATGGGCGATGTCATTGGCCTGGCGCTGGGTGGTACCGCGTTAGTATTTTTCTGCTGCTCCGCTTACGTTCTGACCACCCGTAAGGATATGTCTTTCCTGGGCGGTATGTTAATGGCCGGGATCGTCGTCGTGCTGATTGGTATGGTGGCGAATATTTTCCTGCAACTGCCTGCATTGCACTTGGCGATTAGCGCGGTGTTTATCCTGATTTCCTCGGGCGCTATCCTGTATGAAACCAGTAACATCATTCACGGCGGTGAAACCAACTATATTCGTGCGACCGTCAGTCTGTACGTTTCGCTGTACAACATCTTTGTCAGCCTGCTCAGCATTCTGGGCTTCGCCAGCCGCGACTAATCGACATTCCCCTCTTCCAGCCCTGCTTATGCAGGGCTTTATTTTTTGCTACACTGCCGTCCGTGTTGATTACGCAGTGAAACGTTATGTTGATCTTTGAAGGTAAAGAAATCAGTACTGATAGCGAAGGCTATCTGAAAGAGACGACGCAGTGGAGTGAAACGCTGGCAGTCGCTATCGCCGCCAATGAAGGCATTGAGCTCTCTGCGGAACACTGGGAAGTCGTGCGCTTCGTGCGCGAATTTTACCTGGAGTTTAATACCTCTCCCGCCATCCGAATGCTGGTAAAAGCGATGGCGAATAAGTTCGGCGAAGAAAAAGGCAACAGCCGCTATCTGTATCGTCTGTTCCCGAAAGGCCCGGCAAAGCAAGCGACCAAAATCGCCGGTCTGCCCAAGCCGGTAAAATGTATCTAATACCGAATACTGAAGCCTGTTAACGTCTCGCGAGGGCTGTGCGGTTCAGTGAGGATTTTGTCCACACGAGCAGAGCGCGGCCCGCCCTCTTTCAGCCACTTGATGAGCTTTTCTACCTGCGCCGCGTCGCCACAGGCCACCACTTCTACGCTTCCGTCATCCATATTCTTCGCATAGCCGGTTAACCCCAGCCGCTGCGCCTCATGCTGCGTGGTATAGCGAAACCCGACGCCCTGAACGCGACCATAAACCCAGGCGATAATGCAGACGTTCGACATGATGGTTCTCCTTTCATAATCAACGGTGAAGACGCATACAGGACATACTTAACGTACCTGACTAGGCAGACACCCATAATGTTGGCGGAATCTGGCCGTAAATCGCGACCCCGACAGGTAGCCGCACCGTAGCGCTATTTCGCCGATCGGCAGCGTCGTCGATTGCAGTTGAGAAAGCGCGCAGGACATGCGTACCTCTTCAATAATTTGCCGATAACTCTGCGATTCGCGCTGCAAACGCCGACGCAGCGTCGATGTTCCCATAGCAAGACGGCGGGCTATCTCCTGGGCTGTCCATAGCTTAGCAGGTGAGAGCATAATCAGTTGCCGCACCTGCTCCGTGAGGGTGTAACGCCGTTCAATAAGCAGCGGTCCCGCCGCGCCATCATGCAAAAGCGCTAACAGTAACCCCATCGCCTGATGTTCCTGTAGCCCGACAGGTAAGCCCTGGCGCACAGCATCCAACACGTTCTCCCACATAAATGTCAGGCTACGGCTCATCGGGGTACACAGCGATGTCAGGTTTGCCGGTGGATAATCCTGAACGTACATCGTTTTAAAGCGAGCAATAATCTCCGGTGAAAGTAACAACAGGTCGGAGCGAAAACCGTTCTGCGCAGGCTGATTAATAATCTCCAGCGGCGTGTTCGCCGGAATAATAATCAACTCGCCCGGCCCGGCGACAAGGCGGCTATCATCCTGAATGATGACTTTACTGCCCTGCGTAATATGGCAGATAGCGGCGGAAAAAAGCGTAACACGATGCAAACGGTGCAGATGACTGGAACGCACCTCTGCCGTCGTAAGATCCTGATGTCGAATATGCATCTCCTGCACCGTGCCGTCCTCTGTTTTATCTTCCGTATGTGGCCGTGAATTTTGCTTTCGCAATCACATGCGCGTTAAGCATAAACCCTACCAACGCGCCACTCGACTCACTGTCGAGCGGCAGTGTTGCCGTATTCAGCGCCCACACAGTAAACTGATAGCGATGAGGCTTATCGCCCGGCGGCGGACAGGCGCCGCCAAAACCAGCATAACCGAAATCATTCCGTCCCTGCACCACTCCAGCGGGTAACGTTTTTTTATCAGCCCCCGTCGGCAGGTCGTGAATCTGCGCAGGAATATTCACCATCGTCCAGTGCCACCAGCCGCTTCCGGTGGGCGCATCAGGATCGAAGACGGTAATAGCGTAGCTTTTGGTTCCGGCAGGCGGATTGCGCCAGCTCAGCTGCGGCGAAATATTTTCTCCGCTACAACCAAATCCTTTAAAGACGTGCTGTTGCGTTAAACGAAAATCCGCCGGAATATCCGCGCTACTGAGGCTAAACGCCGACGCGGCAGATACGCCGCCACTCAATGCCATCATTGCGGCTGCGGTCAAAGAGAAGACTGTTTTCATTGTATTTCCTTCTTCATGTAAAGAAATAACACTATAAAAAGCGTCCGCTGAAGAAACTGTGCCTTTACGATTCTATTTTGTGCAGAAACGCTCACCACCATACGCTGGTGATTGCCCCTCTCATCTGTGCTACCCTACGCGCCATTTCGTTTTCACGGTCCCAGGCTGGAACTATCATCCGGCCCGCGTCACTGTTTGTGAGAGGTTACTCCGCATGACTGAATCTACATTTCCGCAATATCCCCGTTTAGTCCTCAGCAAAGGGCGAGAAAAATCTTTACTCCGCCGCCACCCATGGGTCTTTTCCGGCGCCGTATCCCGTCTGGAAGGCAAAGCCAACCTCGGTGAAACTATCGATATCGTCGACCATCAGGGAAAGTGGTTAGCACGCGGCGCCTGGTCACCAGCCTCCCAGATCCGCGCGCGCGTCTGGACATTTGATAAAGCAGAATCCATTGATATTGCGTTTTTCACCCGCCGCCTGCGCCAGGCGCAGCAGTGGCGCGACTGGCTGGCGAAAAAAGACGGCCTGGATAGCTATCGTCTGATCGCCGGTGAGTCCGATGGCCTGCCTGGCGTCACTATCGATCGTTTTGGTCATTTTTTGGTGCTGCAACTGCTCAGCGCCGGGGCCGAATATCAACGCGCCGCATTAATTAGTGCGCTGCAAACATGCGATCCGGATTGCGCTATTTACGATCGCAGCGACGTCGCCGTGCGGAAAAAAGAAGGGATGGCGCTGACGCAAGGTCCGGTCACTGGCGAACTGCCGCCTGCGCTTTTGCCAATTGAAGAACACGGTATGAAATTGCTGGTCGATATCCAGGGCGGCCACAAAACCGGTTATTATCTTGATCAGCGCGACAGTCGTCTGGCGACGCGCCGCTACGTGGAAAATCAGCGGGTACTGAACTGCTTCTCTTATACCGGCGGTTTTGCCGTGTCGGCGTTAATGGGCGGTTGTCGCCAGGTTGTCAGCGTGGATACCTCACAGGATGCGCTGGATATCGCCAGGCAAAACGTTGAACTGAACCAACTGGACTTGAGCAAAGCCGAATTCGTGCGCGACGACGTGTTTAAGTTGCTGCGCGCTTACCGTGAACACGGCGAAAAATTCGACGTCATCATCATGGACCCGCCCAAATTCGTTGAAAATAAAAGCCAGTTAATGGGCGCCTGCCGGGGCTATAAAGACATTAACATGTTAGCGATTCAACTGCTCAATCCGGGCGGCATACTGCTGACATTCTCTTGCTCCGGACTGATGACCAGCGATTTATTTCAGAAAATCATTGCCGATGCCGCAATAGATGCCGGTCGTGATGTACAATTTATAGAGCAGTTCCGTCAGGCCGCCGATCACCCGGTGATCGCCACCTACCCGGAAGGGCTGTATCTGAAAGGGTTTGCCTGTCGCGTCATGTAACTTGAAAAGTGGAATAGTATCCTCATATAAAGGGTATCTATTTCCCGGGAGGTGACTATGATAGCCAGCAAATTCGGTATCGGCCAACAGGTCCGCCATTCCCTGTTAGGTTACCTCGGAGTGGTCGTCGATATCGACCCGGAATATTCGCTTGATGAGCCGTCGCCTGATGAACTGGCGGTTAACGACGAACTTCGCGCCGCTCCGTGGTACCACGTGGTAATGGAAGATGATGATGGTCAGCCAGTGCATACTTATCTGGCCGAGGCCCAGTTGCGAAGCGAAATGCGGGACGAGCATCCAGAACAGCCATCGATGGATGAACTGGCGCGTACCATTCGCAAGCAGCTTCAGGCGCCGCGACTACGTAACTGATTGTATGTAAAAGGCCGGAGAGCGATATCCGGCCATTTAAACTTTTATTTCGCCAGCCCCAGGCGAGGGATCTCAATCGCCGGGCAGCGATCCATCACCACGGTCATGCCTGCATCCCGCGCCAGCACCGCCGCCTGTTCGTTGATCACGCCAAGCTGTAGCCATAGCGTTTTTGCGCCGATGGCGATAGCTTCCTGCGCGACGCCCCATGCAGCTTCAGAGTTGCGAAAAACATCCACCATATCGACTTTTTCGGGAATGTCCGCCAGCGTGTCATACCCCTGTTGCCCCAGCAATGTCTTACCCGCGACTTTTGGCGCAACCGGAATCACATGGTAGCCCTGTTCAAGCAGGTATTTCATTACCCGATAACTTGGACGATCGGGTTTATCGCTCGCACCTACCAGCGCGATAGTGCGAGTGGACGTCAAAACATCAGCAATATCGGTCTCTTTCATCATCTTTCTCCTGGCTGTTTTGCAAAGTGTACGACAAACCTGACATAGCAACCATTCATCCCATACCGATGTATGAGAGCAAAACGCCAAAATATGTCTATATGTTAGTAAACATGATCTAAGAAAAATTTCGATACATTATTAGCCAGACGGCCTTTGGACAGGAGAACCTTATGAAAACCGGCGCGCTAGCCACCTTCCTTGCTCTCTGTTTGCCGGTGACTGTTTTTGCCACAACGCTCCGTTTGTCTAATGAGGTTGATCTGCTGGTGCTGGACGGCAAAAAGGTGTCCAGCTCTTTATTACGCGGCGCAGAGAGCATTGAACTGGAAAACGGGCCGCATCAACTCGTTTTCCGGGTGGAAAAAACCATCCGCCTGCCCGGTAATGAAGAGCGGCTTTATATTTCTCCGCCGCTGGTGATAAGTTTCGATACCCAACTGATAAGTCAGGTCAATTTCCAGCTCCCGCGGCTGGAAAACGAGCGCGAAGCCTCGCATTTTAACGCGGCGCCGCGTCTGGCGCTTTTGGACGGCGACGCCATGCCCATTCCGGTAAAACTCGATATTCTGGCCATTACTTCTACGGCTAAAGTGGTCGATTATGAAATAGAGACCGAGCGTTATAATAAATCGGCGAAACGCGCCTCGCTTCCTCAGTTCGCCACGATGATGGCAGACGACAGCACATTACTTTCCGACGTCTCAGAGCTTGATACCGTACCGCCGCAATCACAAACTCTGACAGAACAGCGGCTGAAATATTGGTTCCGACTGGCGACCCGCAGACACGCCATCATTTTCTGCAATGGGCGGAAAAACAGCCGCCCTCTTGATATGAGTTGTCCCAGCGCGCAATTTTTTTCTCTTTCTCTTGCAGCATCAAGCGCTTCCAGTAATCTGTAGACAGGTTAACTACGGAATCGAATTATGGAACTGACGACTCGCACCTTGCCGACGCGCAAACATATCGCGTTGGTTGCTCACGACCACTGCAAACAGATGTTAATGAACTGGGTGGAACGCCATCAGCCGTTGCTGGAAAAACACGTTCTTTATGCAACCGGCACCACGGGGAATCTGATCCAGCGCGCAACCGGTATGGACGTTAATGCGATGCTGAGCGGCCCGATGGGCGGCGACCAGCAGGTTGGCGCACTCATTTCAGAAGGGAAAATCGACGTGTTGATTTTCTTCTGGGACCCGCTTAACGCGGTACCGCACGACCCCGATGTCAAAGCGCTACTGCGTCTGGCCACCGTATGGAATATCCCTGTCGCCACTAACGTCTCAACGGCGGACTTTATCATCCAGTCCCCGCATTTTAATGACGCTGTAGATATTCTTATTCCGGATTATGCGCGTTATCTGGCCGAGCGCCTGAAATAACCGCTACGCGGGCGGGATGCTGTACCGCCCGCGCTTTAGGGCTTCCTCGCCACCGGCACATCCAGCTGTTTAAGCGCCTCCACAAAGCGCGAGGGATTATCTTTATTAAAAAGCAGCCAGACCCGCGCACGCGCGCGCGTCAACGCCACATATAACAAACGCCGCTCTTCAGCGTCCGGAAAATCCTCAACCTGAGGAAGAAGCGCACTTTCCATAATGGATTCTCGCGCCGGGGCGGGGAAACCGTCGTTACCCTCCTGCAATCCGACAAGAATCACGTAATCGGCCTGTTGGCCTTTGCTGGCATGGATAGTCATGAAGTCTATCTGTAACTTCGGCCAGCGAGTCGCCGCTTTTTGTAGACTCGCGGGTTTCAGATGGTGATAGCGCGCCAGCACCAGGATACGTTCATCTTCCTTCGCATAGCCGGATAATTTATCCAGTAACGCGTCCAACTGGCTTTCATCCAATAACGTCACCGCTTTTTTATCGCCTGGGGTCAGACTATTTAACGGTTTTTTAAGCTGGTGCGGATTCTGCTGTACAAAGCGATTGGCAATATCGCCAATCCGACTGTTAAAGCGGTACGTGGTGTCCAGGTGGCAATGCTCGCCCTCGCCGAACGTCTGATGAAACGCCGTCGTTAAGGAGAGCTGCGCCCCGCTAAAACGGTAAATCGCCTGCCAGTCATCGCCAACGGCAAACAGCGTAGTCTGGCTATTCTGCTTGCGCAGCGCCTCTAACAGCGCCGCCCGTTGCGGGGAAATATCCTGAAATTCATCGACCAGAATATGCTTCCACGGGCTGATAAAACGCCCTTTTTCGAGGATCACCATTGCCTGATGGATCAGCCCGGAGAAATCAACGGCATTTTCCGCTTTCAACGCGCTTTTCCAGGCCTTTAGCAGCGGCGCCATCAGCTTAATGCGTTTGCCAAACAGCTCGCGGCACTCCTCTGGCGCGCCAGCGATCATTTCTGCCTGCGCGCCGCCGTGCATACGCATCAAACTGACCCAACGATCCAGGCGAGGGGCCAGACGCCGCTGCAATGTCTCATCGTCCCAGAAATTACCTTCCGGCACTACCCACTGCATCTCCTCTTCCAGCCACTGACGCCAGCCTTTGGCCTGCGCTTTTTTCTCGCTACACTGCTGACGCCAGGTGCGCAGAAATAGCTGATGCCGCGCGGTGGCGTCACTTTCCAGCTTACTGACAACCGGCGCTTTTTTACTGCCTTGCTGAATAATATACAGCGCCAGCGAATGGAACGTACGGGCAGTAATCTCTTCCGTATGTAAGCGCTCGCGGATACGCTCATCCATCTCTTCTGCCGCCTTGCGACCAAACGCCAGTAAAAGAATTTGCCCCGCATCGGCTTGTCCGCGCGCCAACAGCCAACCCGCGCGCGCCACCAGCACCGAGGTCTTACCGCTTCCCGCGCCCGCCAGTACCAGTAACGATGATTCGCCGTTAACCACCGCCCTGGCCTGGGAAGGATTAAGCGGAGAGGATTCTATCTGCGTAAAAAAGTCGGCATGCGCTTCCAGCATGGCATCAGCATACGCCTGATTATGCTGCTGCCGACTTCCCTCGCTATCCTGTAACCATGCCAGACACTTACGCCATATCTCGCGACAGTGAGCAAACTCCTCCAGACGTGAGACCGGCAGCGGCAAGGCGGCAAACGTCTGGCGAATTTCGTGTTCCAGCCCCCGAACGCGTTCACGGGTAAGCCATTGGTTTCCGCCCGTTCGCTCACTAATACGCGCCCATTGCTCCTGTAATGCCTGCGCGGCGACATCACTCATCTCCTGGCTCCAGCGACGCCAGTGCGCGTCCAGATAGCGATGGAATTGCTGGGTTTCCGACCACTCGGTACCGTGCAAACGCACGACTTTATCCTCCGGCAGGACAAACTCCAGCTCGCCCCATACCAGCCCACGCTTACAGTGAATCGCCAATAACTGATTGAATGGAATAAGATATTCATGCCTGTCGCCAGACACTTTCACCCCAGCGTTGAGGATCTCGGCACGATCATACGGGTGTTGCGCCAGACGTTTTCCAAGAGAAGTTGCTTTCAGTTCCATGACTCAGCGAATCCAGACGTAAAGATGTGCTTATCAGTGTAACCGCCAGAGAAAACGTGCTCCAGTCTAAAAAAACGTTACAATTGCCCGCAGTGATAGAAAACCGGAATGAACTGAGGGTTTATGCGTACCGTTCTGAATATTTTAAACTTTGTACTGGGCGGCTTTGCCACTACGCTGGCCTGGCTGCTGGCGACGCTTGTCAGTATTGTGCTTATTTTTACCCTGCCGTTGACCCGCTCCTGCTGGGAGATAACCAAACTGTCCCTGTTCCCTTACGGTAATGAAGCCATTCACGTTGACGAACTTAATCCGGCGGCGAAAAGCGTATTAATGAATACTGGCGGTACCTTGCTGAATATTTTCTGGTTACTTTTTTTCGGCTGGTGGCTATGCCTGATGCACATTGCCTCCGGTATCGCTCAGTGTGTCACTATCATCGGGATACCTGTCGGTATTGCGAACTTTAAAATTGCCGCGATTGCGCTTTGGCCTGTCGGTCGCCGCGTCGTCTCTGTAGAAACCGCTCGCGCCGCGCGAGAAGCTAACGCGCGCCGCCGTTTTGAATGATCGGGACAAATAGCCTTTATGTTAAGTCCGTTGATTCGCCGTTATACCTGGAACAGTACCTGGCTGTATTACATCCGCATTTTTATCGCTCTGTGCGGCACCACCGCCCTGCCCTGGTGGCTGGGCGACGTCAAACTGACCATCCCGCTCACGCTCGGTATGGTTGCCGCGGCGCTAACCGATCTCGACGATCGCCTTGCCGGACGCTTGCGTAATTTAATCATTACCTTAATTTGCTTTTTTATCGCGTCGGCTTCTGTGGAGCTGCTCTTTCCCTGGCCGTGGCTATTTGCGTTGGGCTTAACGTTATCCACCAGCGGGTTTATTCTGTTGGGAGGACTGGGGCAACGCTATGCCACCATCGCGTTCGGCGCGTTACTCATTGCCATCTATACGATGCTGGGTACCTCTTTATACGATCACTGGTATCAGCAACCACTGCTCCTGCTGGCAGGCGCAGTATGGTATAACCTACTGACGCTAACCGGGCATCTGCTATTTCCGATCCGTCCGTTGCAGGATAATCTGGCACGCAGTTACGAACAGTTAGCGCACTATCTGGAACTGAAATCACGTCTGTTTGATCCTGATATTGAAGATGAAAGCCAGGCGCCGCTCTATGATTTAGCGTTAGCGAACGGGCAGTTAATGGCGACGCTGAACCAAACGAAAGTGTCGTTATTGAGTCGCCTGCGCGGCGATCGCGGTCAACGCGGTACGCGCCGCACCCTCCATTACTATTTTGTGGCGCAGGATATTCATGAACGCGCCAGTTCTTCGCATATTCAATACCAGACACTGCGCGATTATTTTCGCCATAGCGACGTCATGTTCCGCTTTCAGCGTCTGATGTCGATGCAGGCGCAGGCCTGTACGCAGCTGGCGCGCTGTATCTTACTGCGTACGCCGTACCAGCATGATCCGCGTTTTGAACGCGTCTTTACCCACATTGACGCCGCGCTTGAACGTATGCGCGCCAGCGGCGCTTCTTTAGAGCTGCTGAATACGCTTGGATTCTTATTAACCAACCTACGCGCCATTGATGCGCAACTGGCGACGATCGAGTCGGAGCAGGCCCAGGCAATGCCGCGCAATGAGTCAGAAAACCAGTTGGCTGATGATAGCCTGCACGGGTTTAGCGACATCTGGCTGCGTCTGAGCCGTAATTTTACCCCGGAGTCCGCTCTCTTTCGCCATGCGGTACGCATGTCGCTGGTATTGTGCATCGGTTATGCTCTCATCCAAATTACCGGGATGCGCCACGGGTACTGGATATTGCTCACCAGCCTGTTTGTTTGCCAGCCTAACTATAACGCGACCCGCCATCGCCTTGCGCTCAGGATTATCGGCACGTTGGTAGGCGTTGCTATCGGCCTGCCGATTTTATGGTTTGTTCCTTCGCTTGAAGGACAGTTAGTTCTGCTGGTGATTACCGGCGTGCTTTTCTTCGCATTCCGTAATGTGCAGTATGCCCATGCAACGATGTTCATCACCCTGCTGGTATTACTCTGCTTTAACCTCCTGGGCGAAGGCTTCGAGGTAGCGTTACCGCGCGTCGTCGACACGTTAATTGGCTGCGCTATCGCCTGGGCTGCGGTCAGCTTTATTTGGCCGGACTGGCGCTTTCGCAACCTTCCCCGGGTACTCCAGCGCGCCACCGATGCTAATTGCCGCTACCTTGATGCGATCCTTGAGCAATATCACCAGGGACGGGATAACCGCCTGGCCTATCGCATTGCCAGACGCGACGCGCACAACCGCGATGCAGAACTGGCATCCGTGGTTTCTAATATGTCGAGCGAACCCGACGTCACGGCTGAAACCCGGGAGGCGGCGTTCAGGCTGCTTTGCCTCAATCACACTTTTACCAGCTATATCTCCGCCCTCGGCGCGCACCGCGAAAAGCTCAGTAATCCGGACGTGTTGGGGCTTCTGGATGACGCGGTCTGCTACGTTGATGATGCGCTTCATCATCAACCTGAAGACGAACAGCGCGTACACCAGGCGCTGGAGGGTTTAAAGCAGCGAGTTCAGTCACTGGAAACACGTCCGGACAGCAAAGAACCTCTGGTTGTTCAACAAATTGGTTTGCTCATTGCCTTACTGCCCGAGATTGGACGCTTGCAACGGCAAATTTCACCGCCGACTTCTACATTAATTACCCAGCCGTAAGCGAATGAGCCCAGTCGGCAAGCTCCTGGCGACGACTCGCCGGGAGCGCCGCTTCATGTACGCCGACAATTGCGCCTTCCAGGGCATACAAAACTTTCACCGTCAGCAAGGGATTGCTTTGTCGCAACCTTAGCCAGCACATTTTCGCGCCCAGTATGCGTAACATATTTTCGTCCTTTATCCCTGACTCATTGAGCAATGTTTCCAGATGGAAGGTCATATTGGGAAGATCTTTGAGTCTGTGCTGTAAAATACGGCTGTGTTTTTCCTTCATTGCCGCGTCAAGAGAATACTTCGATAAACGTACCAGCTGCTGCTGATCGCGCCACAGGCTTTCATCCACCCGGTAATAGTTGAGCATAACGGGACGTCCACATTTCATAAACATGAGCCAGGCGGGGGGATGCTTCACACAGTAGGGTACGCTTTCTTCACAGGCGCGGAGATAAAGCTCGCCATTAGCCACCATCGCAAAGACGGTATCCTCCACGGTCAGACTATAGCTACCGAATAAAGATCGATACTGGATCGTCCCCAAAGAGGCCAAATATTCTTGCGATTTATAGATCCTGTCATAAGAGAGTGCTCTCATAAAATTCCTTTTAAATCATAAAGTAAAAGAATGATTTGCAGTAACGGATCCGTTAATGACGAAAATAGGCAACTTATACTCCGCGAGCAAGATGATTTTTATTTTTGACGCCACTAAGAATAAAAATTGCGAGACAGTTTCCGAAAATAGAGTTGATCTTTCATCGCCACAGGGGTACTGTATGAATATACAGTAACTCACAGGGCTGGATTGATTATGTACACTTCAGGTTATGCAAATCGTTCTTCGTCATTTCCTACCACTACCCACAACGCTGCGCGCACCGCTACGGAAAATGCCGCGGCAGGACTGGTCAGTGAAGTTGTCTACCACGAAGACCAGCCCATGATGGCGCAACTCCTGCTTTTGCCTTTACTCCGTCAGTTAGGCCAACAATCACGCTGGCAGCTCTGGCTCACGCCGCAGCAAAAGCTCAGCCGTGAATGGGTACAGTCTTCAGGTTTGCCATTAACGAAAGTGATGCAAATTAGCCAGCTTGCGCCTCGTCATACGCTGGAGTCGATGATCCGCGCTTTGCGTACAGGAAATTACAGCGTGGTAATTGGTTGGATGACTGAAGAACTGACAGAAGAAGAACATGCCAGCCTGGTTGAAGCAGCGAAGGTAGGTAATGCGGTAGGGTTTATCATGCGCCCTGTACGTGCGCACGCTTTATCCAGGAGACAGCATTCCGGGCTAAAAATTCACTCTAATTTGTATCATTAAGTAAAATTAGGATTTATCCTGGACTTTTTTTTACGCGAACGTATCTCCTTTGAGTGCTAACGTTTTTTTTGCGAGAACGCTTGTCAGAAGCGGTTTCCGCAATTTTTGCTGTACGATTTATCATCTGAAACTGTTAAATGATGTGTATATCCGTCATGTTTTTTTCACATGTCTGACGGAGTTCACACTTGTAAGTTTCCAACTACGTTGTAGACTTTACATCGCCAGGGGTGCTCAGCATAAGCCGTAGATATCGGTAGAGTAACTATTGAGCAGATCCCCCGGTGAAGGATTTAACCGTGTTATCTCGTTGGAGATATTCATGGCGTATTTTGGATGATAACGAGGCGCAAAAAATGAAAAAGACAGCTATCGCGATTGCAGTGGCACTGGCTGGTTTCGCTACCGTAGCGCAGGCCGCTCCGAAAGATAACACCTGGTACGCTGGTGCTAAACTGGGCTGGTCTCAGTACCATGACACCGGCTTCATTCACAATGATGGCCCGACTCATGAAAACCAACTGGGCGCAGGTGCTTTTGGTGGTTACCAGGTTAACCCGTATGTTGGCTTTGAAATGGGCTACGACTGGTTAGGCCGTATGCCGTACAAAGGCGACAACATCAATGGCGCTTATAAAGCTCAGGGCGTTCAGTTGACCGCTAAACTGGGTTATCCAATCACTGACGATCTGGACGTTTATACCCGTCTGGGTGGTATGGTATGGCGTGCAGACACCAAGTCTAACGTCCCTGGCGGCCCGTCTACTAAAGACCACGACACCGGCGTTTCCCCGGTATTCGCGGGCGGTATCGAGTATGCCATCACCCCTGAAATCGCAACCCGTCTGGAATACCAGTGGACTAACAACATCGGTGATGCCAACACCATCGGCACCCGTCCGGACAACGGCCTGCTGAGCGTAGGTGTTTCCTACCGTTTCGGCCAGCAAGAAGCTGCTCCGGTAGTAGCTCCGGCACCGGCTCCGGCTCCGGAAGTACAGACCAAGCACTTCACTCTGAAGTCTGACGTACTGTTCAACTTCAACAAATCTACCCTGAAGCCGGAAGGCCAGCAGGCTCTGGATCAGCTGTACAGCCAGCTGAGCAACCTGGATCCGAAAGACGGTTCCGTTGTCGTTCTGGGCTTCACTGACCGTATCGGTTCTGACGCTTACAACCAGGGTCTGTCCGAGAAACGTGCTCAGTCTGTTGTTGATTACCTGATCTCCAAAGGTATTCCGTCTGACAAAATCTCCGCACGTGGTATGGGCGAATCTAACCCGGTTACCGGCAACACCTGTGACAACGTGAAACCTCGCGCTGCCCTGATCGATTGCCTGGCTCCGGATCGTCGCGTAGAGATCGAAGTTAAAGGCGTTAAAGACGTGGTAACTCAGCCGCAGGCTTAAGTTTCCGTCTGATAAAAAACCCCGCGTCGCGGGGTTTTTTGCTCTGGTCTGGGTGACAACGCCTTTCAGCGTTACTTCTTGCCTAATAACGCCTGTAAATCCTGCTTTAACGTGGTCATTTGCGTGGCATATTTCTCTTTATGCTCCGCGTCTTCTATCAGTTGCACTATCGTTTCGGATAATGTTTTACCGCGTCGCTGCGCAAGGCCCGCCAGACGCTGCCAGACCATAAACTCTAAATCGATAGATTTTTTGCGCGTATGCTGATGTTCCGCATTGAAGTGTCGTTTGCGTCTTGCCCGAATGGTTTGTTTCATCCGATTAAGCAAGGCAGGATTCATATGCCTGTCTATCCAGACATTTACCCGCACCGGTTCATTTTCGAGGGCCAACAACAAATTGACGGCTTCCTGGGCAGCACTGGCTTCCACGTAGCGGGTGATCAACTCCCCTTCGCGGTGCTTTTTCACCAGATATTTCCATTTCCAACCGCTTTCAAGATTTTCAAGTTGTTGATATTTCATTGCGATCCCAACGTTACCGTGTAACTGTTATCAGAATATCAGTTTTTTAGCCATCTGAAGAAAAGAATCTCGAACCTGTGAATGGTTACACGTCTTCATATTACTTTGCATTCGCTTTCCGCGTGTGCAGCGTGACGGGTTAGGCTATAATCCCCCCTTTTACAACAGACTAAAAAACCTCAACTTTGACCATTACGAAACTTGCATGGCGTGATCTGGTTCCGGATAGCGAAAGCTATCAGGAGATATTTGCACAGCCACACGCGACTGACGAAAACGACACCTTACTCAGTGATACTCAGCCACGACTGCAATTTGCGCTTGAGCAACTTATACAGCCGTGGGCATCATCCTCTTTTATGCTGACTAAAGCGCCTGAAGAGCAAGAGTATCTCACTTTACTTTCAGATGCCGTCCGCGCTCTGCAAACCGATGCCGGACAATTAACCGGCGGACATTATGACGTTTCCGGGCATACTGTTCATTACCGCGCCGCGCAGAATGCGCAAGACAACTTTGCCACCGTCACACAAGTCGTCAGCGCGGACTGGGTCGAAGCCGAACAGCTCTTTGGTTGCCTGCGGCAGTATAACGGCGACATTATCCTGCAGCCGGGACTGGTTCATCAGGCGAACGGCGGCGTGCTGATTATTTCCTTACGAACCCTTCTGGCGCAGCCGTTACTGTGGATGCGTCTGAAAGCCATCGTTAGCCGCGAGCGTTTTGACTGGGTGGCCTTTGACGAGTCGCGTCCATTACCGGTCTCCGTGCCATCAATGCCGCTCAAACTGAAGGTGATTCTGGTTGGCGAACGTGAATCACTGGCTGATTTTCAGGAGATGGAACCGGAGCTCGCGGAACAGGCTATCTACAGTGAATTTGAAGACAATTTACAGATAGCGGACGCAGAAGCTATGACCCTGTGGTGTCAATGGGTGACGCGTATCGCTTTACGCGATAATTTGCCCCCTCCGGCACCGGACGCCTGGCCCGTCCTGATACGCGAGGCTGTGCGCTATACCGGCGAACAGGATACGCTGCCTCTTTGCCCACTGTGGATAGCCCGCCAGTTTAAGGAGGCGTCGCCTTTATGCGAAGGCGATACCTGCGGCGCAGAAGCGCTCAGCCTGATGCTTGCCCGACGCGAATGGCGAGAAGGCTTTCTGGCGGAGCGGATGCAGGATGAGATTCTGCAAGAGCAGATCCTGATTGAAACCGAAGGCGAACGCGTTGGACAAATCAATGCGCTTTCCGTCATTGAGTTTCCCGGGCATCCGCGCGCCTTTGGCGAACCGTCGCGAATTAGCTGTGTTGTGCATATCGGCGATGGCGAATTTAACGATATTGAGCGCAAGGCCGAACTTGGCGGGAATATCCACGCTAAGGGAATGATGATTATGCAGGCCTTCCTGATGTCGGAGTTGCAGCTGGAGCAACAAATTCCCTTCTCTGCCTCGTTAACCTTTGAGCAGTCCTACAGCGAAGTGGATGGCGATAGCGCCTCAATGGCGGAATTATGTGCGCTCATCAGCGCGCTGGCCAATGTGCCGGTGAATCAAAACATTGCGATTACCGGCTCGGTCGATCAGTTTGGTCGCGCGCAACCGGTGGGTGGGCTAAACGAAAAAATTGAAGGTTTCTTCGCCATCTGCGAGCAGCGGGAATTAAACGGTAAACAGGGCGTGATTATCCCTGCAGCCAATGTCCGCCATCTCAGTCTTAAATCTGAACTGCTGCAAGCGGTTAAAGAAGAGAAGTTCACTATCTGGGCGGTAGACGACGTGACCGACGCCTTACCGCTACTGTTAAATCTGGTGTGGGATGGCGAAGGTCAAACGACGTTGATGCAGACTATCCAGGAGCGTATCGCGCAGGCGACGCAACAGGAAGGCCGTCATCGTTTCCCGTGGCCATTACGTTGGCTGAACGCTTTTATTCCGAACTGATCGGACTTGTTCAGCGTACACGTGTTAGCTATCCTGCGTGCTTCAATAAAATAAGGTTTACATAAAACATGGTAGATAAACGCGAATCCTATACAAAAGAAGACCTTCTTGCCTCTGGTCGTGGTGAACTGTTTGGCGCTAAAGGGCCGCAACTCCCTGCGCCGAACATGCTGATGATGGACCGCGTCGTTAAGATGACCGAAACGGGCGGCAATTTCGACAAAGGCTATGTCGAAGCCGAGCTGGATATCAATCCGGATCTATGGTTCTTCGGATGCCACTTTATCGGCGATCCGGTGATGCCCGGTTGTCTGGGTCTGGATGCTATGTGGCAATTGGTGGGATTCTACCTGGGCTGGCTGGGCGGCGAAGGCAAAGGCCGCGCTCTGGGCGTGGGCGAAGTGAAATTTACCGGCCAAGTTCTGCCGACAGCCAGGAAAGTCACCTATCGTATTCATTTCAAACGTATCGTAAACCGTCGCCTGATCATGGGCCTGGCGGACGGTGAGGTTCTGGTGGATGGTCGTCTGATCTATACCGCACACGATTTGAAAGTCGGTTTGTTCCAGGATACTTCCGCGTTCTAAGTGTTGAATTGATATTCCGCGCTATCGGTATCATGCTTGCAAAGCCCATAAAGGCGAAACCTCCGCACTGCGGAGGTTTCTTTTTCTAAAGAGACAGAATCAGGCCATTACCGCCCTGTCCTCCATGGCTTGTCGCCAACCTCCCAACCAGTATGACCGTTGATTCAGCGTCTGATAGGGACACATTTCTTTTGATCGTCCGGCGATGCCGGCCTGATATCCACGTTGATGTGCCCGTTCCAGGCGATCTCGTTTTTGTCTCTTCATGCCTCGTTTCCCTCATATTTGTATCTGGTGGAAAAGAAAACAGTGATTACAAAATGTGCAATCACACTACACGAATACCTCTAAATGGATCGCCCGTCAATGCGCAAAATTCACACCAATGTCATATTTGTGAGCTATACGGTAAATCATTTGTACAAAAAATGTGAGCCATAACAACTATTTTTCCGGCGTCAAAAATAAAAAAACCGCCTCAGGCGCACGACCTGAAGCGGTTAAATTTACATTACTTTTATCTCAGGGAAGGCGCTTTATCTCATCTGCGATAGCGGCTGCTTCCTGACTCCATGCGCTTGCCAGTACTTTGACCATCTCATCATAGCCATCTTTCTGCTGACTGGCTTCGATATGGAAAGGCCGCTTAATCAGCTGTCCATTGTGGTTTAATAACCACTCCCCGCTGACGATAACCTTACCATCGTAGCGTCCATGGAATCCGGTTACGGTAACGTTTAACGTATCCTGGGTGGTTCCCAACGGCTGGGAAGCGACGACCCAGCCAGGAAGCCGGGCGCTAAGATTCGCCACCAGCGTGTTACGTAATTGCTGGTCCAGCGGGCTGGCCCACAGATTGTTATTGGCAATCACATACTGAACATCACTGGTTTGATACACCACGCCGTTACCTGCCAGATAGTCAGGTACGGAAACCTGCTCTACCCATAAGAGACGGTTGCCCTGGCTTGCGGTGCTTTGCACACCGCTTTGCGCTATCGGGAGCTGATAATAGCTTTTATTCTCTCCGCCGGAGCTACATGACGCCAGCCAGAACGCCATTATCACGACTAGCCATTTTTTCATTGTTTCGCCCTCTTAGGCTCAGGATCTTTTTTATCCTTCGCTTCAAATACCAGCGCGTTGCTCTTCTCGTTCAGAGTTTTCAACACCGGTTGTAACTCACGAAGCACCTGATCGAGACGCTGCATATCCGCCACCATTTTGTTATACGCCGCCGATCCAGGCTGGAAGCCCTGCATACTGCGGTTAAGTTCGCGTAACGTCGTTTGCATATCCGCCGGAAGCTGCTGCATCGACTGACTGGAGGTAATCTTGTTCATATTATCCAGCGTGGTTTGCAGCCGACGCATAGTACGCTGGCTTTCAGACAGCGTATTGGTCGCTTGTTCAATCATCGGATTCAGCGGCAGGTTGTTGATCTTATCCAACGTTTCCACCAGTCGCTGTTGAATTTGCGCCAGGCCGCTGCTGACGGTGGGAATAATTTCATAACCATCAAATTCGCGTAGCCCGGTAATCGGCGGCTCCTTAGGATAAAAGTCCAGATCGACATACAGCGCCCCGGTCACCAGGTTACCGGTTTTAAGCGAAGCGCGTAAACCGCGCTTAAGCAGTTCCGTCAAATGCGCGCCAACATCCGCATTTTCTCCCAATTGCGCTTTCAGACGCTCCGGTTCAATGCGCACCAGCACAGGAATACGGTAATCGTCGTTAAATACCTGGCGCATTTTAGACGCAAAGAAAGGCACTTTGCTTACCGTCCCCAGACGAATACCGCGGAACTCCAGCGGCGCGCCGGGTTGTAATCCGCGCACCGAATCTTTAAAGAACATCAGGTAATCGATATGATCGGTATACAGCGAATCCTGAATACTTTTTTGATCGTCGTAGAGATTAAACGCCGTTTTTTCGGCGACGGGTTGCCCCTGCTCAAGTCCTTCCGGCACATCAAAACTCACGCCGCCGCCAAACAACGTCGTCAACGATCCCATTTCCACGCGCATTCCCGCCGATGTCAGATCCACAGCGATACCGCTATCTTTCCAGAATCGGACATTACTGGTGACCAACCGATCGTTTGGCGCCTTAATGAACAACTGATAACTCATTGTCCGTTTTTGCGGATCGAAAGAGCTGGTTTCAACGGACCCTACCCGATAGCCCCGGAACAGAACGGGATCGCCAGGACTGAGCTGACCGGCCTTTTTGCTGTCCAGAATCACACGAATACCTTTGGCATCGGGCGGCGCCAACGGCGGTGAGTCAAGAAGCTGATAGCTTTCCGGCTGACTGCCCTTACTTCCTGGTTGTAGTTCAATATACGCACCTGATAGCAGCGTCCCAAGCCCGCTGATGCCTTCACGCCCCACCTGCGGTTTTACCACCCAAAATACCGAGTCTTTATGCAGCAACTTTTCCATACCGGAATGGAGCCGCGCTTTGATTTGTACGTGGGTCAGATCGTCAGTCAGCGTCGCGCTTTCAACCACGCCAACATCCACGCTGCGACTTTTGATCGTCGTTTTTCCGCCTTCAATGCCTTCCGCATTGGTGGTGATTAAGGTAACTTCCGGTCCCTGGTGGCTGTAGTGATAAAACAGAATCCAGGCTCCGATAAGCGCAGTAACAATCGGGAATATCCAGACAGGCGACCAATTTTTCACCTTTTGTACTTTGGCTTCCCCTTTTTTAGGTTCCATGCTATCAGGACTCCTCATGGCCTGGTTCGTACTCACGATCCCACGACAGACGCGGATCAAAGGTCATCGCAGAAAACATTGTCATTATGACGACTAAAGCAAACATCAACGCACCCATCGCAGGATAAATATTCATTAACCCTCCCATACGCACCAGCGCAGAGAGTACGGCAATGACAAAGACATCAATCATTGACCAGCGCCCCACAAACTCCACTACTTCATAAATAAAATGCATCCGTTCACTGTCGCGCTTACCGTGGCCTTTCGCATCCCAACAAAGCCAGGCAATGGCGATCATTTTTAGCGTCGGCACCATAATACTGGCGAGAAAGATAACCGCCGCCACCGGATAAGACCCCTCGCTCCACAGCAAAATCACGCCAGCCAGAATGGTCGACGGCATTTTCGAGCCCAGTAAGTCGGTAATCATAATCGGCAAAATATTGGCGGGCAGATAAAGCATGATCGAGGTAAATAATAACGCCAGCGTCCACTGCAAACTGTTTTTACGCCGGACATAGCCTTTCGTATGGCAACGCGGACAGACGAGACTCTCCGCAGGCAGGATCGCCGTACAGCAGGCGCAAGAACGCAAAGACTGCCGGATACCGGTAATCCCCGGCGTTAACGGCTGCGCCAGTGCAGGCTGCGGGGCGATATCATCCCACAACCAGCGGCGATCGACACACTGGAACGCCCGCAGTTGAACGAGACAAAATAAGCACCAGGGAATAAAGCTGCTGCCGATGCCGATATCGCCGTAGGCCATCAGCTTAACAAAACTGACCAACACCCCGGCGAGAAATATTTCCGCCATTCCCCACGATTTGAGGAGGAAAAAGATCCTTGCCAGCGTTTTTTTTACCGACAACGGCAGACTGGCGCGGTTGACAAGTAATAGAATAGTCACCAGGCAAAATGCCGGCACCAGTTGGACAAATAATAAAAAGAACGTGCCGAGGCTGGCGTAATCTTCAGAAAACATGACGCCGGGGATTTCCAGAAGCGTGACTTCGCTGGTGACGCCCGCGACATTCATATTCACGAAGGGAAAAAGGTTAGAAAGCAGCAACATAAATAGTGCCGCTAACGCACAGGCGGTAGGACGCTGCCGGGGCGCGTCCCACTCGGTCGTTAACGTTGCGCCGCATCGTGGACATGCTGCTTTTTGCCCATGCGAGAGGCGGGGTAAAGCCACCAGCATGTCACATTGCGGGCACAATATGTGCTTCGCAGCATGGTGATGTTCACACATAGGCGCTCCTTTCGTTATGCCCCGTTTTTCAGGCCTTCAAGATACTCCCAACGTTCGAAGGCTTGCTCAAGTTCTTGCTCCGCCTGACTTAAATCGGCCAGAACTTTTTGCGTTTGCTCATGGGGTTGGCTAAAAAAGGCCGCATCCGCAACCTGCGCCTGTAGCGCTTCCAGCTTCGCTTCCAGGTCTTCAAGCTGACCGGGTAACTGCTCCAGCTCGCGCTGCAGTTTATAGCTTAGTTTGCTACTGCCACGTTTTACAATTTCTGCTTTAGGGGCAATAACTTCCTCATTTTTTTTCGCCATCGGCTGTTTCGTCGCCAGATGCTGCTCTTGCTGCGCACGCGCATCATGGTAGCCGCCGATATAACGTCCGATTTTGCCGCCGCCCTCGAAAATCCAGCATTCCGTCACGGTATTATCGACAAATTGCCGATCGTGGCTGACCAGTAGTACCGTGCCCTGATAGCCATCAATTAATTCTTCTAATAGTTCCAGCGTTTCGACGTCAAGATCGTTCGTTGGTTCATCGAGAATTAAAAGATTGCTCGGCTTGAGGAACAGTCGCGCCAGCAGCAGACGGTTACGTTCGCCGCCGGAAAGCGCGCGGACGGGCGTCATCGCCCGTTTGGGGTGAAACAGGAAGTCCTGCAAATAGCCCAGTACATGGCGCGGCTTACCGTTTACCATCACCTCCTGCTTGCCTTCCGCCAGGTTATCCATCACGGTTTTTTCCGGGTCCAGTTCGGCGCGATGCTGATCGAAGTAGGCGACTTCCAGCTTCGTTCCTACGTGGATGCGCCCGCTGTCAGCCTGAAGCTGTCCCAGCATCAGTTTCAGTAGCGTGGTTTTACCGCAGCCGTTCGGGCCAATTAACGCAATCTTGTCACCGCGCTGTACCTGAGCGGAGAAATCTTTTACCAGTTGTTTTCCTTCTACCTGGTAATCGACGTTTTCCATCTCAAAAACGATTTTACCGGAGCGCGTCGCCTCTTCGACCTGCATCTTCGCCGTGCCCATCACTTCCCGGCGCTCGCTGCGTTCACGACGCATCGCTTTTAACGCCCGCACGCGCCCTTCATTACGGGTGCGGCGCGCCTTGATCCCCTGGCGAATCCAGACTTCTTCCTGCGCCAGTTTGCGATCAAATTCCGCGTTTTGTAACTCTTCTACGCGCAGCGCTTCTTCTTTCTCCAGCAGGTATTGATCGTAATTTCCCGGATAGGTGACCAGTTTGCCACGATCGAGATCGACAATGCGGGTCGCCATATTGCGAATAAACGAACGATCGTGAGAGATAAAAATAATCGTTCCGTTAAAGGTTTTCAGAAACCCTTCCAGCCAGTCGATAGTTTCGATATCCAGATGGTTAGTCGGTTCATCCAGTAACAATACGCGCGGATTGCTGACCAGCGCCCGACCCAGCGCGGCTTTACGCAGCCAGCCGCCGGAGAGCGACGACAGCGCGGCGTTAGGATCGAGTCCAAGCTGCGCCAGCACTTCATTAATACGGTTTTCCAGCTGCCACAGATTATGATGATCGAGCTGTTCCTGTACGCGCGCCATTTCATTGAGGTTTTTCTCGCTGGGATCGGTCATCACCAGACGGGAAATCTCATGATAGCGCTTGAGGTATTCCGCTTGTTCTTCGATACCTTCGGCGACAAAGTCATAGACGCTGCCCGCAATATTACGGGGCGGGTCCTGTTGCAGACGCGCGACGATCAGATCCTGCTCATAAATAATACGCCCATCGTCCAGACCCTGTTCGCGGTTGAGGATCTTCATCAAGGTGGATTTCCCGGCGCCGTTACGCCCCACCAGACAGACGCGTTCGTTATCTTCGATATGCAACTCTGCGTTATCGAGAAGCGGCGCGTCGCTGAACGACAGCCATGCGCCATGCATACTGATTAATGACATTTACTTTTCCTTTCAGGCGGCGCGGATCAGCCAGCAGTTATGGATCTGACGGTTACGGGCAAAATCCGGGGAAAGCGTTTTTTGCGTAATTTCTTGTGCGGTAAGCCCCAGCTCAGCCAGCCCTTCCAGATCCATACGGAATCCGCGCTTATTATTTGAGAACATGATGGTGCCGCCTTTACGCAGCAGACGTTTTAAATCTTTCATTAACGCGACATGATCGCGCTGAACATCAAACGACTCTTCCATACGTTTTGAGTTAGAGAACGTCGGCGGATCGATAAAGATCAAATCGAACTGTTCATTCGCCTCGCGCAGCCAGCCCAGGCAGTCGGCCTGAATCAGGCGATGCGCGCGGCCGCTCAGTCCGTTCAGACGCAAATTACGTTCGGCCCACTCCAGATAGGTGCGGGACATATCCACCGTTGTGGTGCTGCGCGCGCCGCCCAGACCCGCATGTACGCTGGCGCTGCCGGTATAAGAAAAGAGATTCAGGAAATCTTTGCCTTTGCTCATTTCTCCCAGCATCCTGCGGGCAATACGGTGATCGAGGAACAGACCGGTGTCGAGATAATCCGTCAGATTTACCCATAAGCGCGCATTATATTCGCTGACTTCAAGGAACTCGCCCTTCTCGCTCATTTTCTGATACTGGTTTTTTCCTTTTTGCCGTTCACGGGTTTTTAACACCAGTTTATTCGGCGGAATACCGAGCACTGACAAGGTTGCCGCAATAATATCGAACAGGCGCTGCCGCGCTTTTTGCGCATCCACCGTTTTCGGCGGCGCATATTCCTGAATCACCGCCCAGTCGCCGTAACGGTCTACCGCCACGTTATATTCCGGCAGGTCGGCATCATACAAGCGATAGCATTCAATCCCTTCCTGGCGCGCCCATTTCTCCAGCTTTTTAAGATTTTTACGCAGGCGGTTAGCGTAATCTTCCGCCACCGTCGCCGGTTTACTGTCCGCCGTGGTTTCCGCAATATGATAGTTTTTCTGCACGCAGTCCAGCGGGCCATTCTTGGCTTTAAACTGTTTGTCGGCACGTAATTGCAGGCTGCCCAGCAGATCGGGCGAAGCGCTGAACAGCGACAGGTTCCAGCCGCCAAACTGATTTTTCATGGTACGGCCCAGCAGACTGTGCAACGCAATCAGCGCCGGTTCGCTGTCCAGACGTTCGCCGTAAGGCGGGTTACTGATCACCGTACCATACGGGCCTTTCGGCAATGGATTACTCAGTTGCGCCACATCTTTCACTTCAAAGGTGATAAGCTCCCCGATACCGGCGCGACGGGCGTTGCTGCGCGCCCGCTCAATGACGCGCGCATCGCTGTCGGAACCGTAGAAATGAGAGGAATACTCCGCCAGCCCCTTACGCGCCCGGGTCTGCGCTTCGGCTTTCACTTCCTGCCAGATAGTTTCGTCATGCTGCGCCCAGCCGCTAAATCCCCAGTGACCACGGTGCAGTCCCGGCGCGCGATCGGTCGCCCACATCGCGGCCTCAATCAACAGTGTCCCCGAACCACACATAGGGTCGAGCAGCGGCGTACCTGGTTGCCAGCCGGAACGCATAACAATCGCTGCCGCCAGCGTCTCTTTAATTGGCGCCAGCCCGGTGCGATCGCGATAACCGCGCAGGTGCAGGCCATCACCACTGAGATCCAGCGCAATGCTGGCAGTTTCTTTATTCAGCCAGACGTTAATACGGAGGTCCGGCGATTCGCGGTCCACATTTGGACGCGGAAGATTTTTCCGCGTAAACGCATCGACAATCGCGTCTTTAACTTTCATCGCGCCATACTGACTATTACGGATGGTGTCGTTCAGGCCGCTGAAATGCACCGCAAACGTCGCGCCAGGATTAAAAATCTCTGTCCAGTTTATCGCCTGAACGCCGAGGTAAAGATCGAGATCGCTGTAGACCTTGCACTCACCCATCGGCAGGATAATACGCGAGGCCAGGCGGCTCCACATCAGGCTCTGGTAAATAAGCCGCGTGTCGCCCTGAAAATGGACCCCACCCTGAACAACCTGACACCCTACGGCGCCCAGTTTTTCCAGTTCAGTTTTTAACAGCTCTTCCAGCCCGCGGGCCGTACTGGCAAACAGAGAATTCATATTGTCACTTTTACGCTAAGAAAATTGTTGCGCATTATAGCTAATCTCACGCCCATGTCATAAAGTTGAAGGCTTATTTTCATTTGAGGGACTGTACGGTGGCGACGTTATCACGGCTCTTTATTCATCCGGTCAAATCCATGCGCGGCATTGGCCTGACTCATGCGCTGGCAGATATCAGCGGCCTGGCTTTTGATCGCATCTTCATGGTGACCGAGCCTGACGGCACATTTATTACCGCCCGCCAGTTTCCACAATGGTACGTTTTACCCCTTCTCCCTTACACGACGGCCTCCATTTGACCGCGCCAGACGGCAGTAGCGCGCTGGTTCGCTTTACGGATTTCACCCCGCAGGATGCGCCGACCGAGGTCTGGGGAAACCATTTTACCGCTCGCGTCGCCCCGACGGCGATTAATCAATGGTTGAGCGGCTTTTTCTCCCGCGATGTCCAGTTGCGCTGGGTTGGGCCGCAGTTGACGCGCCGGGTCAAACGACATAACGCGGTGCCGCTGGGATTTGCCGATGGCTACCCGTATTTATTGACCAACGAAGCCTCGCTGCGCGATCTGCAACAGCGTTGTCCGGCAGGCGTACAAATGGAACAATTTCGCCCAAACTTAGTGGTTTCCGGCGTAGCGGCCTGGGAGGAAGATAGCTGGAAAGTGCTTCGCATTGGCGATGTGATTTTTGACGTCGTGAAGCCCTGTAGCCGCTGTATTTTTACAACCGTCAGCCCTGAAAAGGGGCAAAAACATCCTTCCGGAGAACCGCTGGCGACACTGCAAGCTTTTCGTACCGCCCAGGACAATGGCGATGTGGATTTCGGTCAGAATCTGATTGCCCGCAATAGCGGCGTCGTTCGCGTCGGCGATGAAGTGGAGATTCTGGCGACAGCTCCGGCAAAAGCTTATGGCGCCACAACGGTCGACGACAGCGTTACGCCAGATAAACACCCGGACGCGAGTGTAACCATCGACTGGCAGGGGCAAACCTTCTGCGGGAATAATCAACAGGTACTACTGGAACAGTTGGAGAATCAGGGGATTCGTATTCCGTATTCTTGCCGGGCTGGTATCTGTGGTTGCTGCCGGATACGTTTGCTGGAAGGCGAAGTAAGTCCGCTGAAAAAATCGGCTATGGGTGACGATGGTACGATTCTGAGCTGTAGCTGCGTGCCTAAGACCGCGTTACGACTGGAGAATTAAACCGCTTGTTCAAGGCTGAAACTGTGACAGTGAACCTGCGGTTTCAGCCTGTCGTTCATAATTTTAATCGCATCGCCAAGCTGCATGGTTCGGCCTGCTATCACCACGCCCGGCTGGGCCAGCAGACACAACGCGGCATTTTCTCCCGGCTCAACGACCAGCAAATTAACCTGTTCCGCCGTATCGCTTAACCAGACGCTTGCGGCATCGCCCGTTCCTGGGGTCCACATTTCGCCATGCGCTACAAAATGCCAGCTTTTTGGCATCTGCGGTTTGAGATAGCGAATCGCCACCAGCGCATTCAACACCAGCTCCGCGCGTTGCTCTTTGGTTAATTCGAAATCCCGGCATTTTTCTTCAAAGGAAAAATAGAGCGCGGCATCATCCACACAAAAACCGGTCGGGCAAAACGCGTCCGGCGTAAGCATTTTACGAGAGAAGCGCGAGCGAAAAAGTATACCATTGGCGAGATCGAGCATCATACGATCGTGCTCTTCATCATAATACCAGCGCCAGTTATCGTCAGGTTTAATTCGCATTACCTCTCTCCCGCTTTTAAGCAACTCGTAACAGCATTGTCCTTTTGCCGCTTCGTTTATTACCCCAATAATAAAAGCCTAAAATGTCTAAATAAGCAACAGTGGAGAAATATAAAACAACCAGCGCCGGAAATAAAGCCCTGGTTGTCTGATAAAGGCTAAAGTTTAGATGTGCGTTACGATTTCTTTAATCAATGGCGGGCCTTTAAAAATAAAGCCGGAATAAATTTGTACCAGCGTAGCTCCTGCCGCTATCTTCTCGCGCGCGGCGATAACTGAGTCAATGCCGCCGACGCCGATAATAGGCAATTGTCCCTTTAACTCCTGGGATAAACGGCGAATAATTTCTGTGCTTTTTAATTGTAATGGCCGGCCACTTAATCCCCCCGTTTGCTGGCAATTTTTCATTCCTTGTACCAGAGAACGATCGAGGGTGGTATTTGTCGCAATCACCCCATCAATATTATGACGAAGCAGGCTATCGGCAACCTGGATCAATTCTTCTTCACAAAGATCCGGCGCGATCTTTACTGCCACCGGCACATATTTATGGTGGATCGCCTGAAGATCGTTTTGCTTATTTTTAATGGCAGTTAACAGATCGTCCAGCGCATCGCCATACTGGAGCGTACGTAGCCCTGGCGTATTCGGCGAAGAAATATTAATGGCGATATAACCCGCATAAGCATAGACTTTTTCCATACAAATCAGGTAGTCATCTTTGCCATTTTCGACAGGCGTATCTTTATTTTTACCGATGTTAATTCCCAGAATACCATCAAAATGGGCTTTTTTAACATTCTCGACCAGGTTATCGACGCCCAGATTATTAAAGCCCATCCGATTGATCAGACCTTCAGCATCCACCAGACGAAAAAGACGCGGCTTATCGTTACCCGGCTGTGGGCGCGGCGTCACGGTGCCGATTTCCAGGGAGCCAAACCCCATCGCGCCTAACGCGTCGATGCACTCCCCGTCTTTATCCAGACCGGCAGCCAGCCCCAGTGGATTTTTAAAGGTAAGTCCCATGCAGGTAACCGGCTTTGTCGGTACTTTCTGGCGCACCAGCGCTTCCAGCGGCGTACCTGTAATGCGGCGTAATTGTTGAAATGTAAATTCATGAGCGCGCTCTGGATCGAGCTGGAAAAGGGCTTTACGAACGAAGGGATAGTACATGAACTCTCCTGGATTCCCGGTGTGCAAACCGGGGGCGTATTATGGGCGATAACAAGGCAAAAGGGAATTGACCTACGGCAATAAATAGCAATCGTTTTCCTTCATTCTCACTTCACGTTTCGCCAGCGAGCGTCATCGTCCGGCAAAAAAATGCCCGGCAGCGCTAACGCTTACCGGGCATGTCTTACCTCTTTATAACGGATACTGTCAGGCTAACGCTTTAGTTATCTTCTCGTACAGATCGCCGGAAAGATTCTCCAGTCCTTTTAACTGCTCCAGCGCCGCACGCATTTTCTCCTGACGCTTTTCATCGTAACGTTTCAGACGAATCAGCGGTTCAATGAGGCGAGATGCTACCTGCGGGTTACGGCTATTCAGATCGGTCAGCATCTCGACCAGGAACTGGTATCCGCTACCGTCTTGCGCATGGAACGCCGCCGGGTTGCTGCCAGCAAACGCGCCAATTAATGAACGGACGCGGTTCGGGTTGCTCATACTGAAAGAACGGTGTTTGAGCAGGCCGCGTACGGTTTCCAGTACATTTTCCGCCGGGCTTGTGGATTGCAGGATAAACCATTTATCCATCACCAGGCCGTCCTGATGCCACTTATCGTCATACTCCTGCATCAGCGTATCGCGGCACGGCAACTGCGCCGCCACCGCAGCAGACAGGGCCGCCAGCGCATCGGTCATATTATTGGCGTCGCGATACTGTTTGCTGACCAGCGTATTAGCCAGCTCCGTCTCGCCGAACGCCAGGAAGCGCAGGCAAGCATTGCGCAGCGTGCGCTTACCGATATCGCCGTGATCAACACGATACTCATCCAGATGATTGGCGTTATAGATAGCCAGGAACTCATCCGCCAGTTCTGCCGCCAGCGTACGCGTTAGCGCTTCACGAACTTGCGCAATGGCGATCGGGTCAATGACCTCAAACAGCTCCGCAATTTCATTGGCCGAAGGCAGCGTTAAAATTTCTGCGGCCAACGCCGGATCGATTTTCTCATCCAACAGTACTGCACGGAACGCATCAGCGACATGCACCGGAAGCGATAGCGGTTGCCCCTGCTGATGACGCGCCACATTCAGTTTAATGTATGTGGCCAGCAGGCTTTGCGCCGCATCCCAACGGGAGAAATCATTGCGCGCATGGCGCATCAGGAACGTCAACTGCTGATCGCTCCATTTATATTCCAGTTTCACCGGCGCTGAAAACTCGCACAGCAAGGCCGGAACAGGCTGGAAGTAAACATTATCGAAGGTAAATGTCTGCTCCGCCTGCGTGACGTTCAGCACGGCGTTGACCGGGTGACCGCCTTTTTGCAACGGAATGACGTTGCCTTCGTTATCGTACAGTTCGATGGCGAATGGAATATGCAGCGGCTGCTTCTCCGCCTGATCCGCCGTCGCCGGAGTGCGCTGGCTGATGGTCAACGTGTACTGCTCGGTTTCCGGATTATAATCATCTTTTACCGTTACAATCGGCGTGCCGGACTGACTGTACCAGCGGCGGAAATGGGACAAATCGACATTAGAAGCATCTTCCATCGCCTGTACGAAGTCATCACACGTCGCGGCGCTGCCGTCATGGCGCTCAAAATAAAGCTGCATCCCCTTCTGGAAATTTTCCTCACCCAGCAACGTGTGGATCATGCGAATGACTTCCGCGCCCTTTTCATAAACGGTGAGGGTGTAGAAGTTATTCATTTCGATTACTTTATCCGGGCGGATAGGATGCGCCATCGGGCTGGCGTCTTCCGCGAATTGTAAACCGCGCATGGTACGCACGTTACTGATGCGGTTCACCGCGCGTGACCCCAAATCAGAGCTAAACTCCTGATCGCGGAACACGGTTAGCCCCTCTTTAAGGCTCAACTGGAACCAGTCGCGGCAGGTGACGCGGTTGCCGGTCCAGTTGTGGAAATACTCATGGCCTATCACGCGCTCAATATCGAGATAATCTTTATCCGTCGCGGTATCGGTTCGCGCCAGCACGTATTTGGAGTTAAAGATATTGAGACCTTTATTCTCCATCGCGCCCATATTAAAGAAATCCACCGCGACAATCATATAGATGTCGAGGTCATATTCGAGCCCAAAACGCGCTTCATCCCATTTCATGGAATTTTTCAGCGAGGTCATTGCCCACGGCGCGCGATCCAGATTGCCACGGTCAACGTACAGTTCTAATGCGACGTCACGCCCGGAGCGGGTGGTAAAGGTATCGCGCAGCACGTCAAAATCACCGGCCACCAGCGCAAACAGATAACACGGTTTCGGGAACGGATCTTGCCACTGAACCCAGTGACGGCCATTCTCCAGCTCGCCCTGTGCAACACGGTTGCCATTGGAGAGCAGGAACGGATATTTGCTTTTATCGGCAATAATTTTGGTGGTAAATCGCGCCAGTACGTCCGGGCGGTCAAGATACCAGGTAATATGGCGGAAGCCCTCCGCTTCACACTGGGTACAGAGCGCATCGCCGGACTGGTACAATCCTTCCAGCGCCGTATTCGCCGCCGGACTTATCTCGTTGACAATGCGTAACGTAAAACGCTCTGGCAGGTCGCTGATGATAAGCGCGCCCTCTTCTTCCTTATATGCTGTCCACGGCGCATCGTTGACGTGGATAGATACCAGCGTTAAATCTTCCCCATCAAGGCGAAGAGGCGCATCAGGCGCGCTATGACGAACAGCCTGGCTTATTGCGGTGACCACGGTTTTTTCGGCATCGAGGTCAAAGGTCAAGTCAATATCAGTAATCTGGTAATCCGGCGCGCGATAGTCATGGCGGTATTTGGCTTGTGGCTGTTGTGTCATAAAAAACCTTTCGCATCTTTGTGTAGAGTGTCGACTCCAGTCTATTCCTGTTGCGCAAATCGCGCTACGCAGAATGTTCATCTTTTCAGGCACAAACGGCCTATTTGCTACATTTTTATAATATGTACTCATAGTTTTTAAAATCGATAAAGATCGTCCAGGAGCGCTTTAAACAAGGGATGAGCGAGATCGAGATGAAGCTTGATTAACGAACTTTAACGAACTTCACAGACCACTTTTGCCCCATCCATGCCCCACACATAATTTTAGTCAACGCCAGACCCCATCAGCCCTGCAGGGAAAATCGATAACAAACACGCCCGCAGTATGTTCCCCAAAATTTAAAAACAAGTGGTTCATGCCTTGACAGACAAAACCGCCAAAGCAAAAATACTGTATATACAAACAGTTATTAGCGAAACACGTATGTTCGTAGAACTGGTTTATGACAAGCGTAATGTTGAAGGACTCGAAGGGGCCAGCGAGATCATTCTGGCCGAACTGACGAAGCAGGTGCACCAGATTTTCCCTGATGCCGAAGTGAGGGTGAAGCCGATGCAGGCAAACTGCTTGAATAGTGATACCAACAAAAGCGATCGCGAAAATTTGAACAGATAGCTTATTAAAAATAGATTTATCTTAAACCACGTCATTTACATTTAGCCACCTCCCCAAAATCCGGATTCAGCTTAAGAAAAATGCGACAATACAATAAAAACATATCATATAAGCCCCCTCAACAAATGTAATTTTAAGGCCAACAAACACCTCTAACTTATTCACTTTCAATTAATTTCATAAATAATAATTAACAACAAAAGAATTGTATTAATATCCACACTGTAGTATATAATTACATTAACAAAATTACTATTCGGCGAGTATATTATGTTAAGACACATTCAAAATAGTTTAGGCAGCGTTTACAGAAGTAATACAGCAACTCCTCAGGGTCAGATTATTCACCATCGTAACTTTCAAAGCCAGTTTGATACCACAGGCAACACCCTCTACAATAATTGCTGGGTTTGCTCATTAAATGTTATCAAATCCAGAGATGGCAATAATTATAGTGCATTAGAGGACATCACTTCTGATAATCAAGCGTTTAATAATATATTAGAGGGTATTGATATAATAGAATGTGAGAATTTATTAAAAGAAATGAATGTGCAAAAAATACCTGAATCCTCTCTTTTTACAAACATTAAAGAAGCTTTACAGGCAGAAGTTTTCAATAGTACTGTAGAAGATGACTTTGAGAGTTTTATTTCTTACGAATTACAAAACCATGGACCACTGATGTTGATCAGGCCTTCACTTGGCTCGGAATGTCTACATGCAGAGTGCATTGTAGGCTATGATAGTGAAGTGAAAAAAGTATTAATTTATGATTCAATGAATACCTCACCTGAATGGCAATCAAATATTGATGTCTATGACAAGCTTACCTTAGCATTCAATGATAAATATAAAAATGAAGATTGCAGTATTTGTGGTCTTTACTATGACGGTGTTTATGAGCCAAAACCTTTACACTCCTCCTCCTGGAAAGACTGGTGTACCATTTTATGATAGTTAACCTTTACCAAGATAATTATTCAGGCTACCGCCAACATGGGGGGTCGGGGGTCGGAGGTTCAAATCCTCTCGTGCCGACCAAAATTCCCCTTAAAAACCAGCCTGTCAGGGCTGTTTTTTTTATGGCTCAATTTCCTACGGGGAAGCTATGGGGTGAAACTGGGGAATAAAGCCGTCGAGATTCGACGCAATTTGCGATTGATTCATCAGTTTGCTCACTGCTCAGTTTCGGAATTCATCAATACACAAATTTTCATTTCGAATTACTGTATAAATTCCCTGTAAATCATTACCGGAGCGCGCCACATTTTCCCCCTGCCCTATACTTTCAGTCTGACGACTGGAGGTTTCATATGTGTGGACGCTTTGCACAAGCACAGACCCGTGAAGAATATCTGGCATATCTGGCCGACGAAGCCGATCGTAATATTGCTTATGACCCTCAGCCTATAGGCCGGTATAACGTGGCGCCCGGGACTAAAGTCCTGCTATTGAGCGAACGCGACGAGCAATTACATCTCGACCCGGTGATTTGGGGTTACGCTCCCGGATGGTGGGATAAAGCTCCACTTATTAACGCCCGTGTCGCGACAGCGGCCTCCAGCAGAATGTTTAAGCCACTATGGCAGCATGGCCGGGCTATCTGTTTTGCCGATAGATGGTTCGAGTGGAAGAAGGAAGGCGACAAAAAACAGCCGTATTTCATTCACAGAAAGGACGGGAAGCCGATATTCATGGCTGCCATTGGCAGTACGCCGTTTGAGCGCGGTGATGAAGCAGAGGGATTCCTGATTGTTACCTCCGCAGCCGATAAAGGTCTGGTAGACATTCACGATCGTCGCCCGCTGGCACTGACACCGGAAACTGCTCGGGTATGGATGCGCCAGTTCCTGGAACCACATTCTAAGTCAATAACATACCGCGTCATACCTGCGCTCACACGTCCCATGATGCGAAAAGATACCAATCCATGCCAATAGTTAAAAACGGATGACTGTCCCGAATCCGTCCCACCTGCCCTATCCCACAAACCGGCGTTCAGATGCTCATTAAGAAAACCACCTCACCCTCATAACTCAGTAAGCGTCCCGTTTAGGACGTAGCGTAAGGATTATTTTACGGTTTCGAGGTTCCAGGGCAGCAGTTCGTGCACCCGGTTCGATGACCAGTCGCTGATTTTCCACAGCACGTCGCGTAACCATGCCTCGGACTCTACGCCGTTTAGTTTGCACGTACCCAGCAGGCTGTAGATGATCGCCGCTGCCTCGCCGCTCCTGTCTGAGCCGAAGAACAGATAGTTACGTCGGCCCAGCGCCACGCACCGTAAGGCGTTTTCACAGATGTTGTTGTCGATCTCCACCCGACCGTCGCTGCAGAGGACACGCTCAACGCATCACACTGCTTCAGCATGTAACCGAACTCCTTCGCCATCTCCGCATGCACCGACAACGTTTTCAACTGCGCCTGTATCCAGTCGTACAGCGACTGGCTTTTCTCTTTCCTGACCGACAGCCGTGTTTGCGCCGGGCTGTGGACTCTGTCATCTGTGATAGTGTCCATCAAAATTTAAGTGGACACTATCATCGCCGGATTGACAGGGTTCTGACAGACGTCCTCCACGGTGCGCTTACATTTTACCTATTAAGGAATATTTTTGCTTTTTAAAGGTATTAAACCATCTCGGTGATGTAACAAAAACTTTCCCTGCCATAGATTCTGATTCTAATTCTCGTGGTAATGCATCATAGGCATTAGCTGCTATACTTGAATTACTAAAATCAGTATAATAAATAAGTTTCCTTCTTGTTGCCATTCTATATTTACACACCCATTCATCTGCTGAAAGAATTAATGGGCCATTCAGATTACTCATACCTCTATTTTCAAACTGATGGGCTGAAACATCAAACACATAGTCTTTCCCTTCTTTATTTCCAACCACTGCAAAATGATTTGTTGGTATTTCCTCTGTTGGTTTATCCCAGATAAATATACCTCGATAACGAATATTATCGAACCCTTTTTCATTCATAAAATTGCTTACAGGAGTCATTAATGACTCACACTGCCCTACCGGATTCATTATTTTATTATTTATAATTGGATTCTGTTTCAATTCCTCCAGATAGGCCGCAGCATCAATATCACTGGTTAGGTTGTAAGTTATATCTGTGCGTTCCACTCCCGGTTCTGTTGCCATGGTTAAGTGATGTGTTTCACTGTACCCCTGGCAATTTACGGTATAGTTCCCGACGTCATCCAGGGTGACTGATAATATCTCCTGACTGTCTTCATCCAGAATACAGAAGTAGTTTTCCCCGTGCAGGCCGGAATGAATGTTTTCCTCCCATCCGTCATACGCGAGCGTCCTGAGCAGTTCAAATCTGCTGACCACATCCTCCCGCGTCGTTCCGGCCGGCGGGTGACAAATCGTCCAGATGCACTCCAGCGCTTCAGTCTGGTGCGTTGAGCAAAAAAATTCCTTCATTTTTTCCCAGGAACTCATTTCAGGGGGGGTATCAGACCAGGCAATACGATAAATGCGGCGGTTACTGATGATGGCGGGAAGACATCCGCTTCCAATATGAAAGGGCATAACAAAAAACCTTTATAAATTTACATATAGTATCTGTCCGACAGACATCATCTCTTCCTGGTCTTAATTTCACAATAAGGTTATCGGCGGATTCATGGTCGTCCTGCCATGGCGGGCTTCAGAAGGTGCAGAAGAAAAATCCGTTATGATGACCGGATGGCGGGACTGTCATTTTACAGCTAAAGTGTCGATTTTTTCAGGGTCGCTTTCCACGATGACCAGATCATCCGGCATCAGCGCCCGGGCGTTCAATTTTGGCGGGCAGAAGTCACCCGGCAGAAATTACTTAACGATGCAGATAATGCCATTAAGGACTGGCGCACAGAATTAACGTTGGGAATTATCAGTGATGAAAATAAAGCAGCTTTGATTCTGCCGATGAATTATATCAATGTTCTTAAATCGCTGGACTTAACAGGTGTTTCAGATGAGGCCACCTTCACAGCAATCAGGTGGCCTGCATTACCACAGTAACGCCTACTGGCTGGCTGGTCTTTCCGGCCAGTCAGGGGCGGCTGTATCCATCCGGTTTACCAGCACCCTATATTTTTTCCATTCGTCGAGCCGCGCTTTCTCATCATCTGTTGCGATTCCAAGATCAACTGCATCCTGCAATGGCGCGATTTTTTCAGATGCCATTTGCAAAAGACGGCTTTTGGTTTCTTCCGCCTGACGAAGCTGCGCTGCTTTTTCAGCCGCTTCGTCTTTTACCCAGACCTTAGCCTTACCATCCCATTTCTGGTATTCACCACCTGGTGAAACTGATGTGACATTTTCGGGCAACGGACCAGGAGCGGAGATATAAACCTGATTGCCGGTTGTTGTGTCGTAAACCATCTCGCCGCGGTGATCCTCATGCAAACTCCATGTCTGGGTTTCAGCGTCAAATATAGCAATATGACTGGCGGGAATATCAGGAGGGGCGATATCAGTACAGTTTGCCGGTAGTCCTGTGTGCGGCGGAATATACGCATCACCTGCACCAATAAATTCGTTAGTATCTGAACGCAGATTGAAAATTTTAATTGTCTGCGCCTGTTCGCTCATTTTAAAAGTCATTATGCCAGCCTCACTATGTAGTTAAATGCAATGTTTTTAACCGTGGTTTCCGCATTACCGTCTGCGTCCACAATAACGACGTGTCCGTGTGGACCTATATACATGGTGTGCTCATGTCCTCCGATATAAACTGTATGTGCATGGTCGCCAGCGGCCTGTGTCCATGCACCACCTCCTGGCTGAAATGAGGTGTGATTGGAATCTCCCCAGTATGAATTGATATAACCGCCGAACTGGTGAGTATGATTGCCCGTGGTATTGGTCGATTTCGTGCCGTAATCAAAGGATGAGGTAGATTTTGTCCCTAAGTCAGTAACCTGCGCCCGCGCGGTGTGCGAGTGCGATTTATTGCCGTCCATTTCTTGCGACAATACGGCACGTCCACTGATGGGCTTACCCTTTATTGTCCAGCCTCTCATGTCAGGGATAACGCCGGACGGATACGCTATAGCCAGTAACGGGTAAGCAGATTTATCGAAGGACTGCCCCTGCATCAGAGCGTAACCTGCCGGAGTAGCATCAGATGGCCATGCAATCGCCGCCCCTACTGGATGCGAATCCGGAGGTGGGTTTAGTGTGGTGTAAAGCATTGCCCATTCGGACCACTCAGCATCGGCGGTATCTCGATGGCTGCGAATATATGCGGGCGCTGGCGCACCATTTGTCCCGCTCCAGCCAATGAGGATTTCCCCATCACCGGTTCCGGTCAGACGTAAAATATTTCCGTATTGCGTCGGATAGCCATTGTTGTAAACCTCGCCCATTATCAGGCCACTATCGCTGCCTCTTGTCGTACCAGTCAGTGCCGGAAGCGCGCCGCGTGATGCCAGTCTGTTCGCTGCAACAGCCGTACCTGATGCAGGGAGCGCTCCGATATTTTGTACAAACAGCGGCTTATTCGGGATATCCGCACCACACTGGCTTTTAGCCATGTAGTTTTCATCACTTTCGGTTTTGCTGTAGACCTCAAGACTGGAGCGACCTTTGGCCTTATCCGGTACGTCGGACAGGTTCTGGTCCTTCTGCAAATACCGTGATCCCAAATAAATCTCCAGGGGATTAAGCACAGAAAAATAGGTTTTTGTATTATCCAGAACGCATAAGACAGGAATATCTTTAATAATATCATTGGCCGATAACTCTGCTTTATTCCCCTTGTATAGTGGGAATATGCCAAGCACACGTCCTCCCATCGTCAGTTGCAGAGTGCTGGCTCCGGTATTGTTTAGCGCCGGAATAACCACAAGTGGAGTGCGCAATGTCCAGTCAACTCCACCATTGACGAAATAAGTTGCTGGTAACTCCAGCGTCAGATTATTTTCTGTACCTCCGGCCACACCAGCGACATAATGCCCACTCTGGAGCTCTTCAATTTGTACAAACTGATTTTCAGATCCTCGCGTCGCAAAATTCGCTATAACGTCATTCAGTGACCATCCCTTCGCTGTTGTACCTTCCTGACCGCGAATAACCGTCAGCATGTCATTATTAACTGCTGTCAGATGGCATACCTCAAAAACTGTTTCTTTTGCGTCTGTCAGTGTAATTTTGGCGTAAGTTTTAAGAGGGTTTGAGCTGTTTGCATAATCGCTGGTCAGCAAATTAGCAAACATCGCTCCCACACCAGGCATCACCTGAATGGTCGTCTGGCTGGCGGTAATATCAGCCGCCAGTGAGGAGACGACATTATTTCCGAATCCAATAATCATTGCTCAACCACCGTTACCGAATAGGTATAAATAAAAGGGAGTTTCACCAGCGACTGGTCAATTGCATCTTTAAGAAAGTGTCCGACACCATCGCCATAGTCAGGAATGGAGACAAAAAAAATGCCCTTATCGGGCATTACACTAATATCAAAAGTGGACTGTACAGGTGGGTCTATTCCGTTAGCTCCATGTATAAAGCGTGCAAGCCGTCGTTTGAACCAGTTGATACAGAAGTGCGAACCATCGCCTTTATAAAAATTCCATGTCAGTATCCGTTTAAAATAGTCGTCCGGAACATATGACGCTGAGCCGGGAACATAATTTCTCAGTTTTGCATACGCGACATTATTGTACTCAATAGTGTTATACGCCCCACGAGCAATGGCATCCTCGGAGATTTGAAGCAAGGGGCGTGATTCCCCATAAATACCCGCCGCAATCCAGTCCAGCAACTCACCGGTAATCGCCGGGGAGGTCCAGCAAGGTAAATTCAGGTTGTTAAAGTAATCAAGATACCCCTGTGCCAGTTTGTTATAAGCATCAAAAAAGGCAACTATATCCGGATCGTCATTATATTGCGTATAGGGGTAGGCCGGAATAATGCTTTCAAGAAGAGCTGCCATATTGCTTAACCTGAATTTGTGAAGATGAAGTGGAAAAATAGGCGTAAGTATCACCATAAACCAGGCTGGAGTCGGTTGCAGGTGGGACAATTTTTCCGTTTATTCCAACCTGAATATCAATCATTGATACAAGGTTTGAAGATACAAGCCCCTTAACCTGATTAAGAAAAATATCCCGAATCAGGAAAATGTTTATTGGTTCACCCGTTGCAATTCCGTTAATGTAATCAGCAATGCTTTGCTGCACTGCTTTTTCAATCCCGGTTGGATCGATATAGCTGGTTGAGGCTGTATTCCAGGTGATTAAAAGCGTAACGTTTTGTGATGATGGCACTACAAACGGCACGTGATACGTATCCGGATACACAATGATCGGTATCGTTTTTTTATCCACCGCAGCGCCTGATGGATTCACTACATCATTCGTCAGTACGGAGATATCTGGCACGGCTTTATAGATAGCGTAAGCCACTTCATAAGGATCGCCGCCACCAGCAATCGCTACCCATGCCCCCAGCGATGCCTGTCGGTATGAGATCAGATTCTCCTGTACACCATAAACATTTTTCAGTTCAATCCGGTAACAGTCAGGCGTTCCCTGTACACCGTACATACAGGCATAAGGGGAAAATCATGGAAAACACAAATATTGTTACCACTGAGCAGCAGGCACCAAACACCATTTCTGCCAGTAACGCAATTTTTAACGTTCAGGCACTGGGTCAGTTAACAGCTTTCGCTAACCTGATGGCAGACTCACAGGTGACGGTACCGGCACACCTTGCAGGGAAACCAGCCGACTGTATGGCTATCGTCATGCAGGCTATGCAATGGGGCATGAACCCTTATGCATGCTGGTCATTATCAGCTGGTGCTAACCCAAACTTTATAGCAACGCAGATGGGGCATACCGATGCACAGATGGTTTACAAGGTGTATGGAAAGTGGATGTCAGAGAAGAGCGCAGAACAGGTTTCTCTGCTCAACCAGGCACTTTCCCGCTATGCCCCATCACTGCCCCAAAGCATGGTAGCAGCGCAGTAG
Protein sequences of DBSCAN-SWA_7 >NZ_CP041973|1927944:1968640|1931450_1931999_-|WP_000859416.1|DBSCAN-SWA MKTVFSLTAAAMMALSGGVSAASAFSLSSADIPADFRLTQQHVFKGFGCSGENISPQLSWRNPPAGTKSYAITVFDPDAPTGSGWWHWTMVNIPAQIHDLPTGADKKTLPAGVVQGRNDFGYAGFGGACPPPGDKPHRYQFTVWALNTATLPLDSESSGALVGFMLNAHVIAKAKFTATYGR >NZ_CP041973|1927944:1968640|1946784_1947348_-|WP_000759136.1|DBSCAN-SWA MKKWLVVIMAFWLASCSSGGENKSYYQLPIAQSGVQSTASQGNRLLWVEQVSVPDYLAGNGVVYQTSDVQYVIANNNLWASPLDQQLRNTLVANLSARLPGWVVASQPLGTTQDTLNVTVTGFHGRYDGKVIVSGEWLLNHNGQLIKRPFHIEASQQKDGYDEMVKVLASAWSQEAAAIADEIKRLP >NZ_CP041973|1927944:1968640|1940464_1941070_-|WP_001202375.1|DBSCAN-SWA MRALSYDRIYKSQEYLASLGTIQYRSLFGSYSLTVEDTVFAMVANGELYLRACEESVPYCVKHPPAWLMFMKCGRPVMLNYYRVDESLWRDQQQLVRLSKYSLDAAMKEKHSRILQHRLKDLPNMTFHLETLLNESGIKDENMLRILGAKMCWLRLRQSNPLLTVKVLYALEGAIVGVHEAALPASRRQELADWAHSLTAG >NZ_CP041973|1927944:1968640|1947344_1948985_-|WP_000433414.1|DBSCAN-SWA MEPKKGEAKVQKVKNWSPVWIFPIVTALIGAWILFYHYSHQGPEVTLITTNAEGIEGGKTTIKSRSVDVGVVESATLTDDLTHVQIKARLHSGMEKLLHKDSVFWVVKPQVGREGISGLGTLLSGAYIELQPGSKGSQPESYQLLDSPPLAPPDAKGIRVILDSKKAGQLSPGDPVLFRGYRVGSVETSSFDPQKRTMSYQLFIKAPNDRLVTSNVRFWKDSGIAVDLTSAGMRVEMGSLTTLFGGGVSFDVPEGLEQGQPVAEKTAFNLYDDQKSIQDSLYTDHIDYLMFFKDSVRGLQPGAPLEFRGIRLGTVSKVPFFASKMRQVFNDDYRIPVLVRIEPERLKAQLGENADVGAHLTELLKRGLRASLKTGNLVTGALYVDLDFYPKEPPITGLREFDGYEIIPTVSSGLAQIQQRLVETLDKINNLPLNPMIEQATNTLSESQRTMRRLQTTLDNMNKITSSQSMQQLPADMQTTLRELNRSMQGFQPGSAAYNKMVADMQRLDQVLRELQPVLKTLNEKSNALVFEAKDKKDPEPKRAKQ >NZ_CP041973|1927944:1968640|1930315_1930597_-|WP_000072884.1|DBSCAN-SWA MSNVCIIAWVYGRVQGVGFRYTTQHEAQRLGLTGYAKNMDDGSVEVVACGDAAQVEKLIKWLKEGGPRSARVDKILTEPHSPRETLTGFSIRY >NZ_CP041973|1927944:1968640|1933482_1933800_+|WP_000561983.1|DBSCAN-SWA MIASKFGIGQQVRHSLLGYLGVVVDIDPEYSLDEPSPDELAVNDELRAAPWYHVVMEDDDGQPVHTYLAEAQLRSEMRDEHPEQPSMDELARTIRKQLQAPRLRN >NZ_CP041973|1927944:1968640|1934431_1935163_+|WP_151256218.1|DBSCAN-SWA MKTGALATFLALCLPVTVFATTLRLSNEVDLLVLDGKKVSSSLLRGAESIELENGPHQLVFRVEKTIRLPGNEERLYISPPLVISFDTQLISQVNFQLPRLENEREASHFNAAPRLALLDGDAMPIPVKLDILAITSTAKVVDYEIETERYNKSAKRASLPQFATMMADDSTLLSDVSELDTVPPQSQTLTEQRLKYWFRLATRRHAIIFCNGRKNSRPLDMSCPSAQFFSLSLAASSASSNL >NZ_CP041973|1927944:1968640|1960916_1961603_+|WP_001525490.1|DBSCAN-SWA MLRHIQNSLGSVYRSNTATPQGQIIHHRNFQSQFDTTGNTLYNNCWVCSLNVIKSRDGNNYSALEDITSDNQAFNNILEGIDIIECENLLKEMNVQKIPESSLFTNIKEALQAEVFNSTVEDDFESFISYELQNHGPLMLIRPSLGSECLHAECIVGYDSEVKKVLIYDSMNTSPEWQSNIDVYDKLTLAFNDKYKNEDCSICGLYYDGVYEPKPLHSSSWKDWCTIL >NZ_CP041973|1927944:1968640|1960454_1960646_+|WP_000497441.1|DBSCAN-SWA MFVELVYDKRNVEGLEGASEIILAELTKQVHQIFPDAEVRVKPMQANCLNSDTNKSDRENLNR >NZ_CP041973|1927944:1968640|1930645_1931425_-|WP_000548080.1|DBSCAN-SWA MHIRHQDLTTAEVRSSHLHRLHRVTLFSAAICHITQGSKVIIQDDSRLVAGPGELIIIPANTPLEIINQPAQNGFRSDLLLLSPEIIARFKTMYVQDYPPANLTSLCTPMSRSLTFMWENVLDAVRQGLPVGLQEHQAMGLLLALLHDGAAGPLLIERRYTLTEQVRQLIMLSPAKLWTAQEIARRLAMGTSTLRRRLQRESQSYRQIIEEVRMSCALSQLQSTTLPIGEIALRCGYLSGSRFTARFRQHYGCLPSQVR >NZ_CP041973|1927944:1968640|1955489_1956032_-|WP_001574119.1|DBSCAN-SWA MRIKPDDNWRWYYDEEHDRMMLDLANGILFRSRFSRKMLTPDAFCPTGFCVDDAALYFSFEEKCRDFELTKEQRAELVLNALVAIRYLKPQMPKSWHFVAHGEMWTPGTGDAASVWLSDTAEQVNLLVVEPGENAALCLLAQPGVVIAGRTMQLGDAIKIMNDRLKPQVHCHSFSLEQAV >NZ_CP041973|1927944:1968640|1965263_1966973_-|WP_000583382.1|tail|DBSCAN-SWA MIIGFGNNVVSSLAADITASQTTIQVMPGVGAMFANLLTSDYANSSNPLKTYAKITLTDAKETVFEVCHLTAVNNDMLTVIRGQEGTTAKGWSLNDVIANFATRGSENQFVQIEELQSGHYVAGVAGGTENNLTLELPATYFVNGGVDWTLRTPLVVIPALNNTGASTLQLTMGGRVLGIFPLYKGNKAELSANDIIKDIPVLCVLDNTKTYFSVLNPLEIYLGSRYLQKDQNLSDVPDKAKGRSSLEVYSKTESDENYMAKSQCGADIPNKPLFVQNIGALPASGTAVAANRLASRGALPALTGTTRGSDSGLIMGEVYNNGYPTQYGNILRLTGTGDGEILIGWSGTNGAPAPAYIRSHRDTADAEWSEWAMLYTTLNPPPDSHPVGAAIAWPSDATPAGYALMQGQSFDKSAYPLLAIAYPSGVIPDMRGWTIKGKPISGRAVLSQEMDGNKSHSHTARAQVTDLGTKSTSSFDYGTKSTNTTGNHTHQFGGYINSYWGDSNHTSFQPGGGAWTQAAGDHAHTVYIGGHEHTMYIGPHGHVVIVDADGNAETTVKNIAFNYIVRLA >NZ_CP041973|1927944:1968640|1938324_1940478_+|WP_000950876.1|DBSCAN-SWA MLSPLIRRYTWNSTWLYYIRIFIALCGTTALPWWLGDVKLTIPLTLGMVAAALTDLDDRLAGRLRNLIITLICFFIASASVELLFPWPWLFALGLTLSTSGFILLGGLGQRYATIAFGALLIAIYTMLGTSLYDHWYQQPLLLLAGAVWYNLLTLTGHLLFPIRPLQDNLARSYEQLAHYLELKSRLFDPDIEDESQAPLYDLALANGQLMATLNQTKVSLLSRLRGDRGQRGTRRTLHYYFVAQDIHERASSSHIQYQTLRDYFRHSDVMFRFQRLMSMQAQACTQLARCILLRTPYQHDPRFERVFTHIDAALERMRASGASLELLNTLGFLLTNLRAIDAQLATIESEQAQAMPRNESENQLADDSLHGFSDIWLRLSRNFTPESALFRHAVRMSLVLCIGYALIQITGMRHGYWILLTSLFVCQPNYNATRHRLALRIIGTLVGVAIGLPILWFVPSLEGQLVLLVITGVLFFAFRNVQYAHATMFITLLVLLCFNLLGEGFEVALPRVVDTLIGCAIAWAAVSFIWPDWRFRNLPRVLQRATDANCRYLDAILEQYHQGRDNRLAYRIARRDAHNRDAELASVVSNMSSEPDVTAETREAAFRLLCLNHTFTSYISALGAHREKLSNPDVLGLLDDAVCYVDDALHHQPEDEQRVHQALEGLKQRVQSLETRPDSKEPLVVQQIGLLIALLPEIGRLQRQISPPTSTLITQP >NZ_CP041973|1927944:1968640|1948989_1950243_-|WP_000333139.1|DBSCAN-SWA MCEHHHAAKHILCPQCDMLVALPRLSHGQKAACPRCGATLTTEWDAPRQRPTACALAALFMLLLSNLFPFVNMNVAGVTSEVTLLEIPGVMFSEDYASLGTFFLLFVQLVPAFCLVTILLLVNRASLPLSVKKTLARIFFLLKSWGMAEIFLAGVLVSFVKLMAYGDIGIGSSFIPWCLFCLVQLRAFQCVDRRWLWDDIAPQPALAQPLTPGITGIRQSLRSCACCTAILPAESLVCPRCHTKGYVRRKNSLQWTLALLFTSIMLYLPANILPIMITDLLGSKMPSTILAGVILLWSEGSYPVAAVIFLASIMVPTLKMIAIAWLCWDAKGHGKRDSERMHFIYEVVEFVGRWSMIDVFVIAVLSALVRMGGLMNIYPAMGALMFALVVIMTMFSAMTFDPRLSWDREYEPGHEES >NZ_CP041973|1927944:1968640|1950257_1952165_-|WP_000053044.1|DBSCAN-SWA MSLISMHGAWLSFSDAPLLDNAELHIEDNERVCLVGRNGAGKSTLMKILNREQGLDDGRIIYEQDLIVARLQQDPPRNIAGSVYDFVAEGIEEQAEYLKRYHEISRLVMTDPSEKNLNEMARVQEQLDHHNLWQLENRINEVLAQLGLDPNAALSSLSGGWLRKAALGRALVSNPRVLLLDEPTNHLDIETIDWLEGFLKTFNGTIIFISHDRSFIRNMATRIVDLDRGKLVTYPGNYDQYLLEKEEALRVEELQNAEFDRKLAQEEVWIRQGIKARRTRNEGRVRALKAMRRERSERREVMGTAKMQVEEATRSGKIVFEMENVDYQVEGKQLVKDFSAQVQRGDKIALIGPNGCGKTTLLKLMLGQLQADSGRIHVGTKLEVAYFDQHRAELDPEKTVMDNLAEGKQEVMVNGKPRHVLGYLQDFLFHPKRAMTPVRALSGGERNRLLLARLFLKPSNLLILDEPTNDLDVETLELLEELIDGYQGTVLLVSHDRQFVDNTVTECWIFEGGGKIGRYIGGYHDARAQQEQHLATKQPMAKKNEEVIAPKAEIVKRGSSKLSYKLQRELEQLPGQLEDLEAKLEALQAQVADAAFFSQPHEQTQKVLADLSQAEQELEQAFERWEYLEGLKNGA >NZ_CP041973|1927944:1968640|1964682_1965264_-|WP_000143167.1|tail|DBSCAN-SWA MTFKMSEQAQTIKIFNLRSDTNEFIGAGDAYIPPHTGLPANCTDIAPPDIPASHIAIFDAETQTWSLHEDHRGEMVYDTTTGNQVYISAPGPLPENVTSVSPGGEYQKWDGKAKVWVKDEAAEKAAQLRQAEETKSRLLQMASEKIAPLQDAVDLGIATDDEKARLDEWKKYRVLVNRMDTAAPDWPERPASQ >NZ_CP041973|1927944:1968640|1927944_1928625_+|WP_000938186.1|protease|DBSCAN-SWA MLPVTYRLIPQSGVSTYGLNTADTPVFPDIPEHAPNPSRLRLAHDSLAINSEFRLEPECVVEYLISGAGGIDPDTEIDDDTYDECYDELSSVLQNAYTQSETFRRLMSYAYEKELHDVEQRWLLGAGEAFETTVAQEHFKLSEGRKVICLNLDDSDDSYTEHYESNEGRQLFDTKRSFIHEVVHALTHLQDKEENHPRGPVVEYTNIILKEMGHPSPPRMVYIFNK >NZ_CP041973|1927944:1968640|1967579_1968209_-|WP_000274547.1|DBSCAN-SWA MYGVQGTPDCYRIELKNVYGVQENLISYRQASLGAWVAIAGGGDPYEVAYAIYKAVPDISVLTNDVVNPSGAAVDKKTIPIIVYPDTYHVPFVVPSSQNVTLLITWNTASTSYIDPTGIEKAVQQSIADYINGIATGEPINIFLIRDIFLNQVKGLVSSNLVSMIDIQVGINGKIVPPATDSSLVYGDTYAYFSTSSSQIQVKQYGSSS >NZ_CP041973|1927944:1968640|1941286_1941796_+|WP_000288733.1|DBSCAN-SWA MYTSGYANRSSSFPTTTHNAARTATENAAAGLVSEVVYHEDQPMMAQLLLLPLLRQLGQQSRWQLWLTPQQKLSREWVQSSGLPLTKVMQISQLAPRHTLESMIRALRTGNYSVVIGWMTEELTEEEHASLVEAAKVGNAVGFIMRPVRAHALSRRQHSGLKIHSNLYH >NZ_CP041973|1927944:1968640|1957415_1960028_-|WP_000193790.1|DBSCAN-SWA MTQQPQAKYRHDYRAPDYQITDIDLTFDLDAEKTVVTAISQAVRHSAPDAPLRLDGEDLTLVSIHVNDAPWTAYKEEEGALIISDLPERFTLRIVNEISPAANTALEGLYQSGDALCTQCEAEGFRHITWYLDRPDVLARFTTKIIADKSKYPFLLSNGNRVAQGELENGRHWVQWQDPFPKPCYLFALVAGDFDVLRDTFTTRSGRDVALELYVDRGNLDRAPWAMTSLKNSMKWDEARFGLEYDLDIYMIVAVDFFNMGAMENKGLNIFNSKYVLARTDTATDKDYLDIERVIGHEYFHNWTGNRVTCRDWFQLSLKEGLTVFRDQEFSSDLGSRAVNRISNVRTMRGLQFAEDASPMAHPIRPDKVIEMNNFYTLTVYEKGAEVIRMIHTLLGEENFQKGMQLYFERHDGSAATCDDFVQAMEDASNVDLSHFRRWYSQSGTPIVTVKDDYNPETEQYTLTISQRTPATADQAEKQPLHIPFAIELYDNEGNVIPLQKGGHPVNAVLNVTQAEQTFTFDNVYFQPVPALLCEFSAPVKLEYKWSDQQLTFLMRHARNDFSRWDAAQSLLATYIKLNVARHQQGQPLSLPVHVADAFRAVLLDEKIDPALAAEILTLPSANEIAELFEVIDPIAIAQVREALTRTLAAELADEFLAIYNANHLDEYRVDHGDIGKRTLRNACLRFLAFGETELANTLVSKQYRDANNMTDALAALSAAVAAQLPCRDTLMQEYDDKWHQDGLVMDKWFILQSTSPAENVLETVRGLLKHRSFSMSNPNRVRSLIGAFAGSNPAAFHAQDGSGYQFLVEMLTDLNSRNPQVASRLIEPLIRLKRYDEKRQEKMRAALEQLKGLENLSGDLYEKITKALA >NZ_CP041973|1927944:1968640|1952177_1954286_-|WP_001086485.1|DBSCAN-SWA MNSLFASTARGLEELLKTELEKLGAVGCQVVQGGVHFQGDTRLIYQSLMWSRLASRIILPMGECKVYSDLDLYLGVQAINWTEIFNPGATFAVHFSGLNDTIRNSQYGAMKVKDAIVDAFTRKNLPRPNVDRESPDLRINVWLNKETASIALDLSGDGLHLRGYRDRTGLAPIKETLAAAIVMRSGWQPGTPLLDPMCGSGTLLIEAAMWATDRAPGLHRGHWGFSGWAQHDETIWQEVKAEAQTRARKGLAEYSSHFYGSDSDARVIERARSNARRAGIGELITFEVKDVAQLSNPLPKGPYGTVISNPPYGERLDSEPALIALHSLLGRTMKNQFGGWNLSLFSASPDLLGSLQLRADKQFKAKNGPLDCVQKNYHIAETTADSKPATVAEDYANRLRKNLKKLEKWARQEGIECYRLYDADLPEYNVAVDRYGDWAVIQEYAPPKTVDAQKARQRLFDIIAATLSVLGIPPNKLVLKTRERQKGKNQYQKMSEKGEFLEVSEYNARLWVNLTDYLDTGLFLDHRIARRMLGEMSKGKDFLNLFSYTGSASVHAGLGGARSTTTVDMSRTYLEWAERNLRLNGLSGRAHRLIQADCLGWLREANEQFDLIFIDPPTFSNSKRMEESFDVQRDHVALMKDLKRLLRKGGTIMFSNNKRGFRMDLEGLAELGLTAQEITQKTLSPDFARNRQIHNCWLIRAA >NZ_CP041973|1927944:1968640|1933844_1934261_-|WP_000975204.1|DBSCAN-SWA MMKETDIADVLTSTRTIALVGASDKPDRPSYRVMKYLLEQGYHVIPVAPKVAGKTLLGQQGYDTLADIPEKVDMVDVFRNSEAAWGVAQEAIAIGAKTLWLQLGVINEQAAVLARDAGMTVVMDRCPAIEIPRLGLAK >NZ_CP041973|1927944:1968640|1964430_1964679_+|WP_072100753.1|tail|DBSCAN-SWA MRHQRPGVQFWRAEVTRQKLLNDADNAIKDWRTELTLGIISDENKAALILPMNYINVLKSLDLTGVSDEATFTAIRWPALPQ >NZ_CP041973|1927944:1968640|1942152_1943205_+|WP_001674965.1|DBSCAN-SWA MKKTAIAIAVALAGFATVAQAAPKDNTWYAGAKLGWSQYHDTGFIHNDGPTHENQLGAGAFGGYQVNPYVGFEMGYDWLGRMPYKGDNINGAYKAQGVQLTAKLGYPITDDLDVYTRLGGMVWRADTKSNVPGGPSTKDHDTGVSPVFAGGIEYAITPEIATRLEYQWTNNIGDANTIGTRPDNGLLSVGVSYRFGQQEAAPVVAPAPAPAPEVQTKHFTLKSDVLFNFNKSTLKPEGQQALDQLYSQLSNLDPKDGSVVVLGFTDRIGSDAYNQGLSEKRAQSVVDYLISKGIPSDKISARGMGESNPVTGNTCDNVKPRAALIDCLAPDRRVEIEVKGVKDVVTQPQA >NZ_CP041973|1927944:1968640|1943914_1945675_+|WP_000156448.1|protease|DBSCAN-SWA MTITKLAWRDLVPDSESYQEIFAQPHATDENDTLLSDTQPRLQFALEQLIQPWASSSFMLTKAPEEQEYLTLLSDAVRALQTDAGQLTGGHYDVSGHTVHYRAAQNAQDNFATVTQVVSADWVEAEQLFGCLRQYNGDIILQPGLVHQANGGVLIISLRTLLAQPLLWMRLKAIVSRERFDWVAFDESRPLPVSVPSMPLKLKVILVGERESLADFQEMEPELAEQAIYSEFEDNLQIADAEAMTLWCQWVTRIALRDNLPPPAPDAWPVLIREAVRYTGEQDTLPLCPLWIARQFKEASPLCEGDTCGAEALSLMLARREWREGFLAERMQDEILQEQILIETEGERVGQINALSVIEFPGHPRAFGEPSRISCVVHIGDGEFNDIERKAELGGNIHAKGMMIMQAFLMSELQLEQQIPFSASLTFEQSYSEVDGDSASMAELCALISALANVPVNQNIAITGSVDQFGRAQPVGGLNEKIEGFFAICEQRELNGKQGVIIPAANVRHLSLKSELLQAVKEEKFTIWAVDDVTDALPLLLNLVWDGEGQTTLMQTIQERIAQATQQEGRHRFPWPLRWLNAFIPN >NZ_CP041973|1927944:1968640|1937859_1938306_+|WP_001261222.1|DBSCAN-SWA MRTVLNILNFVLGGFATTLAWLLATLVSIVLIFTLPLTRSCWEITKLSLFPYGNEAIHVDELNPAAKSVLMNTGGTLLNIFWLLFFGWWLCLMHIASGIAQCVTIIGIPVGIANFKIAAIALWPVGRRVVSVETARAAREANARRRFE >NZ_CP041973|1927944:1968640|1943276_1943729_-|WP_000877172.1|DBSCAN-SWA MKYQQLENLESGWKWKYLVKKHREGELITRYVEASAAQEAVNLLLALENEPVRVNVWIDRHMNPALLNRMKQTIRARRKRHFNAEHQHTRKKSIDLEFMVWQRLAGLAQRRGKTLSETIVQLIEDAEHKEKYATQMTTLKQDLQALLGKK >NZ_CP041973|1927944:1968640|1968229_1968640_+|WP_001676370.1|DBSCAN-SWA MENTNIVTTEQQAPNTISASNAIFNVQALGQLTAFANLMADSQVTVPAHLAGKPADCMAIVMQAMQWGMNPYACWSLSAGANPNFIATQMGHTDAQMVYKVYGKWMSEKSAEQVSLLNQALSRYAPSLPQSMVAAQ >NZ_CP041973|1927944:1968640|1946361_1946529_-|WP_001537784.1|DBSCAN-SWA MKRQKRDRLERAHQRGYQAGIAGRSKEMCPYQTLNQRSYWLGGWRQAMEDRAVMA >NZ_CP041973|1927944:1968640|1932213_1933425_+|WP_000140478.1|DBSCAN-SWA MTESTFPQYPRLVLSKGREKSLLRRHPWVFSGAVSRLEGKANLGETIDIVDHQGKWLARGAWSPASQIRARVWTFDKAESIDIAFFTRRLRQAQQWRDWLAKKDGLDSYRLIAGESDGLPGVTIDRFGHFLVLQLLSAGAEYQRAALISALQTCDPDCAIYDRSDVAVRKKEGMALTQGPVTGELPPALLPIEEHGMKLLVDIQGGHKTGYYLDQRDSRLATRRYVENQRVLNCFSYTGGFAVSALMGGCRQVVSVDTSQDALDIARQNVELNQLDLSKAEFVRDDVFKLLRAYREHGEKFDVIIMDPPKFVENKSQLMGACRGYKDINMLAIQLLNPGGILLTFSCSGLMTSDLFQKIIADAAIDAGRDVQFIEQFRQAADHPVIATYPEGLYLKGFACRVM >NZ_CP041973|1927944:1968640|1963236_1964205_-|WP_001674638.1|DBSCAN-SWA MPFHIGSGCLPAIISNRRIYRIAWSDTPPEMSSWEKMKEFFCSTHQTEALECIWTICHPPAGTTREDVVSRFELLRTLAYDGWEENIHSGLHGENYFCILDEDSQEILSVTLDDVGNYTVNCQGYSETHHLTMATEPGVERTDITYNLTSDIDAAAYLEELKQNPIINNKIMNPVGQCESLMTPVSNFMNEKGFDNIRYRGIFIWDKPTEEIPTNHFAVVGNKEGKDYVFDVSAHQFENRGMSNLNGPLILSADEWVCKYRMATRRKLIYYTDFSNSSIAANAYDALPRELESESMAGKVFVTSPRWFNTFKKQKYSLIGKM >NZ_CP041973|1927944:1968640|1966969_1967596_-|WP_000729406.1|DBSCAN-SWA MAALLESIIPAYPYTQYNDDPDIVAFFDAYNKLAQGYLDYFNNLNLPCWTSPAITGELLDWIAAGIYGESRPLLQISEDAIARGAYNTIEYNNVAYAKLRNYVPGSASYVPDDYFKRILTWNFYKGDGSHFCINWFKRRLARFIHGANGIDPPVQSTFDISVMPDKGIFFVSIPDYGDGVGHFLKDAIDQSLVKLPFIYTYSVTVVEQ >NZ_CP041973|1927944:1968640|1935187_1935646_+|WP_000424187.1|DBSCAN-SWA MELTTRTLPTRKHIALVAHDHCKQMLMNWVERHQPLLEKHVLYATGTTGNLIQRATGMDVNAMLSGPMGGDQQVGALISEGKIDVLIFFWDPLNAVPHDPDVKALLRLATVWNIPVATNVSTADFIIQSPHFNDAVDILIPDYARYLAERLK >NZ_CP041973|1927944:1968640|1956197_1957208_-|WP_000291723.1|DBSCAN-SWA MYYPFVRKALFQLDPERAHEFTFQQLRRITGTPLEALVRQKVPTKPVTCMGLTFKNPLGLAAGLDKDGECIDALGAMGFGSLEIGTVTPRPQPGNDKPRLFRLVDAEGLINRMGFNNLGVDNLVENVKKAHFDGILGINIGKNKDTPVENGKDDYLICMEKVYAYAGYIAINISSPNTPGLRTLQYGDALDDLLTAIKNKQNDLQAIHHKYVPVAVKIAPDLCEEELIQVADSLLRHNIDGVIATNTTLDRSLVQGMKNCQQTGGLSGRPLQLKSTEIIRRLSQELKGQLPIIGVGGIDSVIAAREKIAAGATLVQIYSGFIFKGPPLIKEIVTHI >NZ_CP041973|1927944:1968640|1935681_1937736_-|WP_000420505.1|DBSCAN-SWA MELKATSLGKRLAQHPYDRAEILNAGVKVSGDRHEYLIPFNQLLAIHCKRGLVWGELEFVLPEDKVVRLHGTEWSETQQFHRYLDAHWRRWSQEMSDVAAQALQEQWARISERTGGNQWLTRERVRGLEHEIRQTFAALPLPVSRLEEFAHCREIWRKCLAWLQDSEGSRQQHNQAYADAMLEAHADFFTQIESSPLNPSQARAVVNGESSLLVLAGAGSGKTSVLVARAGWLLARGQADAGQILLLAFGRKAAEEMDERIRERLHTEEITARTFHSLALYIIQQGSKKAPVVSKLESDATARHQLFLRTWRQQCSEKKAQAKGWRQWLEEEMQWVVPEGNFWDDETLQRRLAPRLDRWVSLMRMHGGAQAEMIAGAPEECRELFGKRIKLMAPLLKAWKSALKAENAVDFSGLIHQAMVILEKGRFISPWKHILVDEFQDISPQRAALLEALRKQNSQTTLFAVGDDWQAIYRFSGAQLSLTTAFHQTFGEGEHCHLDTTYRFNSRIGDIANRFVQQNPHQLKKPLNSLTPGDKKAVTLLDESQLDALLDKLSGYAKEDERILVLARYHHLKPASLQKAATRWPKLQIDFMTIHASKGQQADYVILVGLQEGNDGFPAPARESIMESALLPQVEDFPDAEERRLLYVALTRARARVWLLFNKDNPSRFVEALKQLDVPVARKP >NZ_CP041973|1927944:1968640|1961962_1962589_+|WP_000334547.1|DBSCAN-SWA MCGRFAQAQTREEYLAYLADEADRNIAYDPQPIGRYNVAPGTKVLLLSERDEQLHLDPVIWGYAPGWWDKAPLINARVATAASSRMFKPLWQHGRAICFADRWFEWKKEGDKKQPYFIHRKDGKPIFMAAIGSTPFERGDEAEGFLIVTSAADKGLVDIHDRRPLALTPETARVWMRQFLEPHSKSITYRVIPALTRPMMRKDTNPCQ >NZ_CP041973|1927944:1968640|1945743_1946262_+|WP_000227928.1|DBSCAN-SWA MVDKRESYTKEDLLASGRGELFGAKGPQLPAPNMLMMDRVVKMTETGGNFDKGYVEAELDINPDLWFFGCHFIGDPVMPGCLGLDAMWQLVGFYLGWLGGEGKGRALGVGEVKFTGQVLPTARKVTYRIHFKRIVNRRLIMGLADGEVLVDGRLIYTAHDLKVGLFQDTSAF >NZ_CP041973|1927944:1968640|1929243_1929903_+|WP_000374046.1|protease|DBSCAN-SWA MDRIITSSRDRSSLLSTHKVLRNTYFLLSLTLAFSAITATASTVLMLPSPGLILTLVGMYGLMFLTYKTANKPVGILSAFAFTGFLGYILGPILNAYLSAGMGDVIGLALGGTALVFFCCSAYVLTTRKDMSFLGGMLMAGIVVVLIGMVANIFLQLPALHLAISAVFILISSGAILYETSNIIHGGETNYIRATVSLYVSLYNIFVSLLSILGFASRD >NZ_CP041973|1927944:1968640|1929989_1930319_+|WP_000904449.1|DBSCAN-SWA MLIFEGKEISTDSEGYLKETTQWSETLAVAIAANEGIELSAEHWEVVRFVREFYLEFNTSPAIRMLVKAMANKFGEEKGNSRYLYRLFPKGPAKQATKIAGLPKPVKCI |
39 | Salmonella_phage(28.57%) | tail,protease | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_8 |
2039917 : 2047230
Sequences of DBSCAN-SWA_8
Nucleotide sequences of DBSCAN-SWA_8 >NZ_CP041973|2039917:2047230|DBSCAN-SWA TTTGCATAAGTATCTCTCGGTAGTAAAAAAGCACCGAGTTCCTCTGTCTGATGCTGCTGTTGATTTGTTAAAAGATTTACCACGATTAAAAGATAACAATCATGTATTCCCTGCCCCTCGCGCTGAAACACTTTCTGATATGTCGTTATTGGCTGTATTGAAACGAATGGGATATATCGACTTAACGCAACATGGCTTCCGTTCTACTTTCCGTGAGTGGGCTGGTGAAGCAACGGATTATCAACGTGAGGTTATTGAACATGCGTTGGCGCACCAGTTGGCAGATAAGGCTGAAGCAGCGTATCAGCGTGGGACGTTATGGCCTAAACGGGTGGCGTTGATGGATGATTGGACGGGGTATAGCACTGCCAACAGCTAAGCTACCTGTACGAAAGCATTATCGTTGATAACAACGTAGAAAGTGTGATGCTAATAGCATTCGCTTTCGAAAATGTGATAAGTAATAATTTCATACTGAACTATTTCTTATATAATTATTATCATAATTTGCAAATTACATAACCCACTCAAGGAGAGGTTATGCCCGGACTGATAGGCTACTGGAAGCAACTTCCCACCAAAGATGAATATATTAAAAAACACAATATGAGCAAAATATCCTGCTACAGTTGTGGTCACGAGAAATTCAGCGATGTTGGTTTGATACAGGTATGGGATAATCACAGAAGAATTCTTTGTGCTAAGTGTAAGACTACTCTTTTCAGAGAAGAGGATTAGTTTTTTTGGCATTGGTAACAGCGGCTTCAGCATCCCTTTTCACGCAGCGGGTCGGGCTTTTTTTTCGCATTTGACCCGTCGATTACCGGATGATGACGCAATTTACAAGCGCCTTGTCCGCCTACCGCGAGCACAACGCCATCAGGCTAACTATTAGCCGGCGTAAAAAAACCGGGCGCTAAGGCCCGGTTTGTACGGCAGTGAAACGAAGATTAATGCGCGGCTTCCGGCTTGTGCTTTTGCGCACTCTGGAAGCCATACGTCAACGCATTTTTCTCTTTATCCAGCGCGACGGTGACCTGTCCGCCATCAACCAGCGATCCAAACAGCAACTCATTGGCCAGCGGTTTTTTCAGGTTATCCTGAATCACACGCGCCATCGGTCGTGCGCCCATCGCCCGGTCATAGCCCTTTTCCGCCAGCCAGTCGCGCGCTTCCTGACTGACTTCCAGAGAGACGCCTTTCTGATCCAACTGAGCCTGCAACTCGACGATAAACTTATCGACAACCTGATGAATCACCTCGCCAGACAGATGATCGAACCAAATAATGTTGTCGAGACGGTTACGGAACTCCGGCGTAAACACTTTCTTGATCTCGCCCATCGCATCGGTACTGTTGTCCTGATGAATAAGACCAATAGATTTACGTTCGGTTTCTCGCACGCCGGCGTTGGTGGTCATCACCAGCACCACGTTGCGGAAATCCGCCTTACGGCCATTGTTATCGGTCAGCGTACCGTTATCCATCACCTGCAGCAGCAGGTTAAAGACATCCGGGTGCGCTTTTTCGATCTCATCCAGCAACAGCACCGCATGAGGATGCTTAATCACCGCATCCGTCAGCAGCCCGCCCTGGTCGAAACCGACGTATCCCGGAGGCGCGCCGATCAAACGGCTCACCGTATGACGCTCCATATATTCGGACATATCGAAGCGCAACAGCTCAATACCCAGCGCTTTTGAAAGCTGTACCGTAACTTCAGTTTTCCCTACGCCAGTTGGCCCGGCGAACAAGAATGAGCCGACAGGTTTATGCTCATGGCCCAGACCGGCACGACTCATCTTAATAGCTTCGGTCAGCGCCTCAATCGCGTTATCCTGGCCGAAGACCAGCATTTTCAGACGATCGCCCAGGTTCTTCAGCGTATCGCGATCGCTCTGCGAGACGCTCTTTTCAGGAATTCGCGCAATTCGCGCCACTACGGACTCAATATCCGCCACGTTGACCGTTTTCTTACGTTTGCTCACCGGCATCAGACGCGCCCGAGCGCCCGCTTCGTCAATCACGTCAATGGCTTTATCCGGCAGATGGCGGTCATTGATATATTTTACCGCCAACTCGACCGCCGCACGCACCGCTTTCGCGGTATAACGCACGTCGTGGTGCGCTTCGTACTTAGGTTTCAAGCCGTTGATAATTTGCACCGTCTCTTCCACCGAAGGCTCGGTAATATCAATTTTCTGGAAACGGCGCGCTAATGCACGGTCTTTCTCAAAAATATTGCTGAATTCCTGATAGGTCGTTGAGCCGATCACCCGGATCTTGCCGCTGGAAAGCAGCGGTTTAATCAGATTTGCCGCATCCACCTGTCCGCCCGACGCCGCGCCAGCGCCGATAATGGTATGGATTTCATCGATAAACAGGATGCTGTTGGTATCCTGCTCAAGCTGTTTCAGCAACGCCTTAAACCGTTTTTCAAAATCGCCGCGGTATTTGGTGCCCGCCAGCAGCGAACCGATATCCAGAGAGTAAATGGTGCAATCGGCCATCACTTCCGGCACATCGCCCTGCACGATACGCCAGGCCAGCCCTTCGGCAATCGCCGTTTTGCCGACGCCGGATTCCCCTACCAGCAACGGGTTATTTTTACGGCGACGACACAAGACCTGGATCGCGCGTTCAAGTTCTTTTTCACGACCAATCAGCGGATCGATGCCGCCCACGCGAGCAAGTTGGTTAAGATTCGTCGTGAAGTTTTCCATACGTTCCTCCCCGCCAGCTTGTTCGTCGCCAGTTGGCTGATTGCCGAGATCGGAAGATTGGCTCGGTTCGTCTTTTCGCGTCCCGTGAGAAATAAAGTTCACGATATCCAGACGGCTCACTTCATGCTTGCGCAGCAGATAAGCCGCCTGTGATTCCTGTTCGCTAAAGATAGCCACCAGCACATTCGCGCCAGTCACTTCACTACGCCCGGAAGACTGAACATGGAAGACGGCACGCTGCAGGACACGCTGGAAACTTAACGTCGGCTGCGTATCACGCTCTTCTTCACTGGCAGGCAGTACGGGTGTGGTTTGTTCAATGAAGGCTTCGAGTTCCTGACGGAGCGCCACCAGATCCACGGAGCATGCTTCCAGCGCTTCGCGAGCCGATGGGTTGCTGAGCAGCGCCAGCAACAGATGCTCGACGGTCATAAACTCATGACGGTGCTCGCGCGCTCTGGCGAAAGCCATGTTTAAACTGAGTTCCAGTTCTTGATTGAGCATAGGCACCTCCCCCAATTTTTATACCTGCATTCAGGCTTTTTCCAGCGTACACAGCAACGGATGCTCGTTCTCCCTTGCATACTTGTTCACCATCGCCACTTTGGTTTCCGCCACCTCGGCGGTGAACACGCCGCAGATGGCTTTGCCTTGATAGTGAACTGCAAGCATCAATTGCGTTGCACGTTCTACATCATAAGAAAAGAATTTTTGTAACACGTCAATAACAAACTCCATCGGAGTGTAATCATCATTGACTAATATCACTTTATACATAGATGGCGGTTTTAGCGCGTCGCGCACGCTATCTTCCACCAACTGGTCAAAATCCAGCCAATCGTTCGTCTTACCCATTGTCAGTCGTCATTATCGGTTACGGTTGTCGGCAGGAAAATCTGCCGCTGACCAGAGTCTATGCACACAATCAATCTACCTCAATTGATAGATAACTAACATCTATCAGTACCATCCGCGACATCTGTCACATTCCCGGCAATAGCGTTAACTGCTTCAAATTTTTGATTCATTTTTACCCGATCCCCCCTGCCTGATGCTTGACGCCTCGCCTGATTTCTCTAAATTGTAATGTCGAGAGTTGGTGAGGTTTTGAACAGCCCCCACTCCGTCACCGGTTCATTCCATCTTACTTATATAAGATTTACGAAGGATGTCGAAGCATGGAAACGGGTACTGTAAAGTGGTTCAACAATGCCAAAGGGTTTGGTTTCATCTGCCCTGAAGGCGGCGGCGAGGATATTTTCGCCCATTATTCCACCATTCAAATGGATGGTTACAGAACGCTTAAAGCCGGACAGTCTGTCCGGTTTGATGTCCACCAGGGGCCAAAAGGCAATCACGCCAGCGTCATCGTGCCCATCGAAGCAGAGGCCGTTGCATAGCTCCTCTGTCTCATTGTGTACATCCAGGAGGCAAAATGCCAGCCCGATCGGCTGGCATTTTTATTTAACGCCAGTGCCTGATAGCGACACTGTTGCATCTTATCAGGCCGACAAATGACGTCAGCGAGATTACTCCCTTGCCAGCGCATCCACCGGGTCCAGTCGCGCCGCGTTTCTCGCCGGTAGCCAGCCAAACAGTATCCCGGTAAACGTCGAACATAAAAACGCGCTCGCCAGCGCAGTCAGTGAAAAACCGATCTCCCAGCCGGGCAGGAAAAGCTGTAGCATAAATGCGATGAACATCGACAAGCTAATCCCCAGCGCTCCCCCAACCAGGCAAACCAGCACCGCTTCAATAAGAAACTGCTGTAGCACATCGCTGGCGCGCGCGCCTACCGCCATACGGATGCCGATTTCACGCGTTCGCTCGGTGACGGAAACCAGCATAATATTCATAACGCCAATGCCGCCGACAACCAGCGAAATGACGGCCACCAGCGTCAGAAATAACTGAAGAGTATAGGTGGTTTTTTCAGCCGTTTTCAGAACGCTGTCCATATTCCAGGTGAAGAAGTCTTTTTTACCGTGGCGTAAGGTGAGCAGGCGGGTAAGCTGCTGTTCAGCCTGATCGCTATCAACGCCATCTTTCACACGAACGGTGATCGAGTTAAGCCATGACTGACCCATTATGCGATCTGACATCGTGCTATAGGGCAACCAAACTTGCAACAGATTGCTATTGCCGTACATGGACGGTTTCTCTTCCGCCACGCCAATAACAATAACCGGCATATTACCCACCAGCACCACTTCCCCTACGACATTCGCTTTATTTGGAAATAGCTGGCGTCGCGTGTTGGCATCCAGCACCACCACCTGCGCGCGATCCTGTTGCTGTACAGAATTGAAGGTGTTCCCCTCCCTAAAGGACATGCCGTAAACGTTAAAATAATCGCCACTGACGCCATTAGCATTTACGGCAATATCAATATTGCCATAGCGAAGACGTAAGCTCTTTGAAACGCTGGGCGTCGCAGAGTTAACCCACGGCTGTTTCTGAATAGCGACCAGATCGTCATATTTCAGCGCCTGTCGATACTGCGGATTGTCGTCGCCAAAATCTTTGCCTGGATGAATATCAATCGTGTTAGTGCCCATAGCGCGGATATCCGCCAGTACCATCTGTTTTGCGGCGTCGCCGACCACCACAATCGACACCACCGACGCAATACCGATAATAATTCCCAGCATGGTCAGTAAAGTACGCATTTTGTTAGCGGCCATCGCTAACCACGCCATTGACAGCGCTTCGCGAAAGCTGCTGGCAAATTGCCGCCAGCCGGGAGCCGTATTAACTACGGCAGCGTCAACGCCCTGTTCGCGTTTCTTTTCCTCCGCGGGCGGATTATGGACAATCTTGCCATCGTGAATTTCAATAATCCGCTCCGCCTGGGCGGCAATCAGCGGATCGTGCGTCACAATGATCACCGTATGTCCGCGATCGCGCAGTTGGCGCAAAATCGCCATCACCTCTTCGCCGGAATGGCTATCCAGCGCGCCGGTCGGCTCATCTGCCAGAATCACCTGTCCACCGTTCATCAGAGCGCGGGCAATACTGACACGCTGCTGCTGTCCGCCAGAAAGCTGTGAAGGTGGGTAATCGACGCGATCGCTTAATCCCAGCCGCAAAAGCAACTCTCTGGCGCGCGCCTGGCGTTTTTTGCGTTCAATGCCGGCGTAGACGGCGGGGATTTCAACATTTTGCGCTGCCGTTAAATGCGACAACAGATGGTAGCGCTGAAAGATAAAGCCAAAATGCTCACGCCGCAGCTGCGCCAGCGCGTCCGGGTCCAGCGTCGAGACGTCCCGCCCCGCCACCCGATAAGTGCCGCTGGTCGGTTTATCCAGGCACCCGAGGATATTCATCAGCGTTGATTTTCCAGAACCGGAAACGCCGACGATCGCCACCATCTCCCCGGCGTGGATTTGCAGGGAGATATCTTTCAACACCGCCACCTGCTCTTCTCCGGAGGGGTAGCTACGACTCACATTGCACAGTTCAAGCAATGCCGTCATGGCGTCGCTCCTGGCCTGCTTTCGCCGATGATCACCTCATCGCCCGCTTCCAGACCTTTAACCACTTCCACGTCTGTATCGTTACGCTCGCCAATGACCACTTCGCGCTCACGTTTTTCGCCGTTACGCAACAGCGCCACTTTATAACGATTGCCGCCCACCGGTTCGCCAAGCGCGGCGAGAGGAATAATCAGCACATTTTTGACATCCATGAGTTGAATATAAACCTGTGCGGTCATATCAAGACGCAAGATTCTTTTGGGATTCGGCACTTCAAACCGGGCGTAATAAAAAATAGCGTCGTTGATCTTTTCCGGCGTCGGCAGAATATCTTTTAAAACGCCTTCATAGCGCGTTTGCGGATCGCCTGCAATGGTGAACCATGCTTTCTGCCCCGCCCGAAGATGGATCACGTCCGCTTCCGAGACCTGCGCTTTTACCAGCATGGTGCTCATATCCGCCAGCGTCAGAATATTGGGCGCCTGCTGAGCTGCAATCACCGTTTGTCCTTGCAGGGTAGTGATTTGCGTCACTTCCCCCGCCATGGGGGCGACAATACGGGTATATTCCAGGTTGGTTTTCGCGGTGTCCAACGAGGCCCGATTACGTTTGATCTGGGCATCTATGGTGCCAATACGCGCCTGTTTAACCGCCATCTCCGTCGCCGCGGTATCCAGATCCTGTTGCGATACCGCCTGAGTCTTAGCTAACTGCTGCTGGCGCGCCAGCGTAACCCGCGCCAGCTTTAACTCAGCGGCTGCCTGCTGACGCTCCGCGTTCAGCTCCATCAAGGTGGCCTCGACCTCTTTTATCTGGTTCTCCGCCTGATCTGGGTCAATCACGCCGAGTAGCTGATCTTTTTTAACGTTATCGCCAATGGAGACCAGCAGCGTTTTCAACTGGCCGCTCACCTGCGCGCCGACATCCACTTTACGCAACGCGTCCAGTTTTCCAGTCGCCAGTACACTCTGTTCAAGATCGCCTGGCCGCACGATTAATGTCTGATAAGTTGGCAGCGGGGCATTTATCATTCGCCAGCCAGCCATCCCCCCCACTAAAAGAATTAAAATAATGACCAGATAACGCTTTTTAAATTTCTTTCCCTTAGCACGCAT
Protein sequences of DBSCAN-SWA_8 >NZ_CP041973|2039917:2047230|2046111_2047230_-|WP_001201751.1|DBSCAN-SWA MRAKGKKFKKRYLVIILILLVGGMAGWRMINAPLPTYQTLIVRPGDLEQSVLATGKLDALRKVDVGAQVSGQLKTLLVSIGDNVKKDQLLGVIDPDQAENQIKEVEATLMELNAERQQAAAELKLARVTLARQQQLAKTQAVSQQDLDTAATEMAVKQARIGTIDAQIKRNRASLDTAKTNLEYTRIVAPMAGEVTQITTLQGQTVIAAQQAPNILTLADMSTMLVKAQVSEADVIHLRAGQKAWFTIAGDPQTRYEGVLKDILPTPEKINDAIFYYARFEVPNPKRILRLDMTAQVYIQLMDVKNVLIIPLAALGEPVGGNRYKVALLRNGEKREREVVIGERNDTDVEVVKGLEAGDEVIIGESRPGATP >NZ_CP041973|2039917:2047230|2044168_2046115_-|WP_000125875.1|DBSCAN-SWA MTALLELCNVSRSYPSGEEQVAVLKDISLQIHAGEMVAIVGVSGSGKSTLMNILGCLDKPTSGTYRVAGRDVSTLDPDALAQLRREHFGFIFQRYHLLSHLTAAQNVEIPAVYAGIERKKRQARARELLLRLGLSDRVDYPPSQLSGGQQQRVSIARALMNGGQVILADEPTGALDSHSGEEVMAILRQLRDRGHTVIIVTHDPLIAAQAERIIEIHDGKIVHNPPAEEKKREQGVDAAVVNTAPGWRQFASSFREALSMAWLAMAANKMRTLLTMLGIIIGIASVVSIVVVGDAAKQMVLADIRAMGTNTIDIHPGKDFGDDNPQYRQALKYDDLVAIQKQPWVNSATPSVSKSLRLRYGNIDIAVNANGVSGDYFNVYGMSFREGNTFNSVQQQDRAQVVVLDANTRRQLFPNKANVVGEVVLVGNMPVIVIGVAEEKPSMYGNSNLLQVWLPYSTMSDRIMGQSWLNSITVRVKDGVDSDQAEQQLTRLLTLRHGKKDFFTWNMDSVLKTAEKTTYTLQLFLTLVAVISLVVGGIGVMNIMLVSVTERTREIGIRMAVGARASDVLQQFLIEAVLVCLVGGALGISLSMFIAFMLQLFLPGWEIGFSLTALASAFLCSTFTGILFGWLPARNAARLDPVDALARE >NZ_CP041973|2039917:2047230|2043173_2043494_-|WP_000520789.1|protease|DBSCAN-SWA MGKTNDWLDFDQLVEDSVRDALKPPSMYKVILVNDDYTPMEFVIDVLQKFFSYDVERATQLMLAVHYQGKAICGVFTAEVAETKVAMVNKYARENEHPLLCTLEKA >NZ_CP041973|2039917:2047230|2039917_2040295_+|WP_001539594.1|integrase|DBSCAN-SWA MHKYLSVVKKHRVPLSDAAVDLLKDLPRLKDNNHVFPAPRAETLSDMSLLAVLKRMGYIDLTQHGFRSTFREWAGEATDYQREVIEHALAHQLADKAEAAYQRGTLWPKRVALMDDWTGYSTANS >NZ_CP041973|2039917:2047230|2040456_2040654_+|WP_001117984.1|DBSCAN-SWA MPGLIGYWKQLPTKDEYIKKHNMSKISCYSCGHEKFSDVGLIQVWDNHRRILCAKCKTTLFREED >NZ_CP041973|2039917:2047230|2043817_2044039_+|WP_000447499.1|DBSCAN-SWA METGTVKWFNNAKGFGFICPEGGGEDIFAHYSTIQMDGYRTLKAGQSVRFDVHQGPKGNHASVIVPIEAEAVA >NZ_CP041973|2039917:2047230|2040866_2043143_-|WP_000934064.1|protease|DBSCAN-SWA MLNQELELSLNMAFARAREHRHEFMTVEHLLLALLSNPSAREALEACSVDLVALRQELEAFIEQTTPVLPASEEERDTQPTLSFQRVLQRAVFHVQSSGRSEVTGANVLVAIFSEQESQAAYLLRKHEVSRLDIVNFISHGTRKDEPSQSSDLGNQPTGDEQAGGEERMENFTTNLNQLARVGGIDPLIGREKELERAIQVLCRRRKNNPLLVGESGVGKTAIAEGLAWRIVQGDVPEVMADCTIYSLDIGSLLAGTKYRGDFEKRFKALLKQLEQDTNSILFIDEIHTIIGAGAASGGQVDAANLIKPLLSSGKIRVIGSTTYQEFSNIFEKDRALARRFQKIDITEPSVEETVQIINGLKPKYEAHHDVRYTAKAVRAAVELAVKYINDRHLPDKAIDVIDEAGARARLMPVSKRKKTVNVADIESVVARIARIPEKSVSQSDRDTLKNLGDRLKMLVFGQDNAIEALTEAIKMSRAGLGHEHKPVGSFLFAGPTGVGKTEVTVQLSKALGIELLRFDMSEYMERHTVSRLIGAPPGYVGFDQGGLLTDAVIKHPHAVLLLDEIEKAHPDVFNLLLQVMDNGTLTDNNGRKADFRNVVLVMTTNAGVRETERKSIGLIHQDNSTDAMGEIKKVFTPEFRNRLDNIIWFDHLSGEVIHQVVDKFIVELQAQLDQKGVSLEVSQEARDWLAEKGYDRAMGARPMARVIQDNLKKPLANELLFGSLVDGGQVTVALDKEKNALTYGFQSAQKHKPEAAH |
7 | Ralstonia_phage(16.67%) | integrase,protease | attL 2034714:2034728|attR 2045966:2045980 |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
Acr ID | Acr position | Acr size | Homology with known anti | Neighbor HTH/AcRanker | Neighbor Aca | In prophage | Protospacer in prophage |
---|