Contig_ID | Contig_def | CRISPR array number | Contig Signature genes | Self targeting spacer number | Target MGE spacer number | Prophage number | Anti-CRISPR protein number |
---|---|---|---|---|---|---|---|
CP036358 | Agrobacterium sp. 33MFTa1.1 chromosome circular, complete sequence | 0 crisprs | WYL,csa3,cas3,DEDDh | 0 | 0 | 5 | 0 |
CP036359 | Agrobacterium sp. 33MFTa1.1 chromosome linear, complete sequence | 0 crisprs | csa3,DEDDh | 0 | 0 | 1 | 0 |
CP036360 | Agrobacterium sp. 33MFTa1.1 plasmid p_JBx_073812, complete sequence | 0 crisprs | csa3 | 0 | 0 | 44 | 0 |
CRISPR_ID | Spacer_Info | Spacer_region | Spacer_length | Hit_ID | Protospacer_location | Mismatch | Identity |
---|
CRISPR_ID | Spacer_Info | Spacer_region | Spacer_length | Hit_phage_ID | Hit_phage_def | Protospacer_location | Mismatch | Identity |
---|
Region | Region Position | Protein_number | Hit_taxonomy | Key_proteins | Att_site | Prophage annotation | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
DBSCAN-SWA_1 |
603253 : 610628
Sequences of DBSCAN-SWA_1
Nucleotide sequences of DBSCAN-SWA_1 >CP036358|603253:610628|DBSCAN-SWA CATGCGATTTCCCTTTTCCCTGCCGCGCAAGCGCCCGGCGGATGGCAACGCCATGCCCGAAAACCGGAAAATGGCTGGCGGCTTCATGGCCGTGGCCATGCAGGGCGGGCAGGCCTTCTGGTCCGGCCGGTCCTATGCCGCGCTTGCCCGTGAGGGTTTCATGAAGAATCCCGTAGCCCACCGGGCTGCCCGCATGGTGGCGGAAGCGGCCGCGTCGGTGAACTGGCTGCTTTACGACGGCGATGACGAGATAGGCGATCATCCGCTGCTGGCGCTTCTGGCGAGGCCGGGCGCGCACATGGGCGGGCCGGATTTCTTCGAGGCGCTTTATGGCCACCTCATGCTGGCCGGAAACGCCTATGTCGAGCCGCTTGTGATCGGCGGGCGGCTGCGTGAACTGCATCTTCTCCGGCCTGACCGGCTCAGCATTGTCGAAGGGCCGGATGGCTGGCCGGCGGCTTATGATTACCGCGCCGAAGGCCGGGCCACGCGGCGCATCGCCGCCGAGCGGGACGGGCTGGGGCTGCTGCATCTCAAACTGTTCCATCCGCTGGATGACCGGGTGGGGTTCGCGCCGCTCGCCTCCGCCGGCGCGGCGCTTGATCTCCACAACGCGGCCAGCCAGTGGAACAAGCGCCTGCTCGATAATTCCGCCCGGCCTTCCGGCGCGCTGGTCTATCAGCCGCGTGAAGGCGGCAATCTTTCCGCCGAACAATATGAACGGCTGAAGCGCGAGCTGGAGGAGGGCTATCAGGGCGCCATGAATGCCGGCCGGCCGCTGCTTCTGGAGGGCGGGCTGGACTGGAAGGCGATGGGACTATCGCCGCGCGACATGGACTTTCTGGAAGCCCGCAACGGCGCGGCCCGCGATATCGCGCTCTCACTCGGAGTACCGCCAATGCTGATCGGCATTCCCGGTGACAATACCTATGCGAATTACCAGGAAGCGAACCGCGCCTTCTATCGCCTCACCGTGCTGCCGCTCGTCAACCGCACGGCGGCGAGGCTTTGCGGCTGGCTTGCGCCGATCTTCGGCGTGGGCCTGCGACTGGAAGCCGATCTCGATCGGATTGCCGGGCTCGCGGGCGAGCGGGACGCGCTCTGGACACGCATCGGCGCAGCCTCGTTCCTGAGCGACGAGGAAAAACGCGAGGCCGTCGGTTATTGAAAGGACGGCGTTTCTTTCGCTTTGACGGTTCAAGCATTGTCTGAAGAAGCGGATTCTCTTTGTGAGCTCATCCGTCCAAGCGATTCCGAAGATTCGTAAAATCACCCGGTGCAGGCGCGCAGGAAGCGGCTGCGTGCGACGCGGCGCGTCCGCCCGATGACAACATCCTAGAACGGAATGATTACCCATGTCTGAATTCGCCAATGAGGCCGGCATCTGGGCCGCCCGCATCACCGGCGCCGTGGCGGGCGCGGGTGTATCGCTCGTTTATCTCCTGCCGAAAAGCAAACGCGAGGCCGCGAGCCGTTTTATCACCGGCGTCTCCTGCGGCATGATCTTCGGCGGGCCTATCGGCCTGTGGATCGTGCAGCAGCTTGATATTGCCGGTGCGCTCTCCGGTCGCGAAATCATGGTGGCGGGTTCCGCCGCCGCCAGCATGGGGGCCTGGTGGGGGTTGGGCGTGCTGGTGCGCATCGCCGACCGTTACAGCGCCCGCCCGCGCGCCTGATATTCCCCTCACATCGCAGGAGTTACCCATGCACGCCTATCGCGGGCCGCGTCCCGCCACGCGTAAATTCGCCAATCTGGAACTGCGCGGCATTGCCGGCGACGGCACTTTTTCCGGTTACGCCAGCGTCTTTGGCGAGATCGATCTCGGCCGCGATGTGATCGAGCGCGGCGCCTTCCGCCGCTCCATCGAGGAGCGCGGTGCGTCCGGTATCCGCATGCTCTACCAGCACGATCCGGCCCAGCCTATCGGCGTCTGGCGCACCATCCGCGAGGATGAGCGTGGCCTTTATGTCGAGGGTATCCTCACGCCCGGCGTCGCCCGGTCGAGGGAAGTCCATTCGCTGATGAAGACCGGCGCTCTGGACGGGCTGTCCATCGGTTTCCGCACGGTTCGTTCGAGTAGGGAAGGGCGTTCCGGCAAGGCGGCCCGCGCCGGCGTCCGGCGCATTCTGGAGGCCGATCTCTGGGAGATCTCCGTCGTGACTTTTCCGATGCTTCCCTCGGCTCGCGTCTCCGATGTCAAGCATGCCCGCTTCTTCCGCGACCGCGACACCGAACTGGTGCGCAGCATGCGCCGTGCCGCCCGTTCGCTTTTCGATACCGCCTTCAAACGCTGATTTCCGGACAAACCGCAACAAGGAAAACCACATGACAGAGCAGACAACGACACCGGCCCCGATGACGGTTGCGCCGCAGTTGAAGGCCGTCCCCGATACGATGACGGCCGCCTTCGACGAGTTCATGGAGGCCTTCGAAGCCTTCCGTGAGACCAACGATCAGCGGCTTGCCGATATCGAACGCAAGATGGGGGCTGATGTCGTGACCCGCGACAAGCTCGACCGCATCGACAGGGCGCTCGACGACAACCGCAGGCTCATGGACGATCTGGCTTTGAAGAAGGCGCGGCCCGCGCTCGGCCGGAAGGACACGCTGTCGCAGGATGCCGAAGAGCACAAGACCGCTTTCGAGGCCTATATCCGCCGGGGCGAGGAAGCCGCGCTGCGCGATCTCGAGGCAAAGGCCTTTGCCGGCTCGGCCGGGGCGGACGGCGGCTTTCTTCTTCCGACCGAGACGGATGGCGAGATCGGCCGGCGCATGACGGCGATTTCGCCTGTCCGGGCGCTCGCCACGGTGCGACAGGTTTCCGCTGCCGTGCTGAAAAAACCCTTCGCCGCCGGCGGGCTCGCCACCGGCTGGGTTTCGGAAACGGCGGCGCGCCCGGAAACGGCGACGCCGAAACTGTCGGAGCTTTCTTTCCCGACCATGGAGCTTTATGCCATGCCTGCCGCCACCCAGAGCCTGCTGGACGACGCGGCGGTCGATATCGAGGCGTGGATCGCATCCGAGGTGGACATCGCATTCGCCGAACAGGAGGCGGCGGCTTTCGTTGGCGGCGACGGCATCAACAAGCCCAGGGGGTTCCTCGCCTATACGACCGTTGCCAATAACGACTGGAGCTGGGGCAATATCGGTTATGTCGCGACCGGCGTATCGGCGGGCTTTTCCTCCGCCGGTCCCATGGACGTGCTGCTCGACGCCATCTATGCGTTGAAAGCCGGCCATCGCCAGAACGGCACCTTCCTGATGAACCGCAAGACGCAGGGTGCGCTTCGACGCTTCAAGGACACGACCGGCGCCTATCTTTGGCATCCGCCGGCTGCCGTCGGTCAGCCCGCCTCGCTGATGGGCTTTCCGGTGACGGAAGCCGAGGACATGCCCAATGTGGCGGCCAACAGCTTCGCCATCGCCTTCGGGGATTTCCGCGCCGGTTACCTCGTTGTTGACCGAACCGGCGTCCGCATCCTGCGCGATCCCTATTCGGCCAAACCCTATGTGCTTTTCTACACCACCAAACGCGTGGGCGGCGGCGTGCAGAATTTCGAGGCGATCAAGCTGGTGAAGTTCGGGGTGAATTGAACCTCGACGGTGCCGCGCGTTTCTTCTCCCCGCCGATGCCCGACAGGGCAGATGAGGGGGCAAGCTCTCGGGTTACCCCGACCCTTGCCCCCTCAACCCGCTGCCGCGACCTTCTCCCGGCGGGAAGAAGGCGAAAGACGCAGCCACTCGCCCTAATCTCCGGAGACCCCATGACCTATGCCCTCATCCATCCGCCGCAGGCGGAGCCGTTGACCCTTGCCGACGTCAAGGCGCATCTGCGTCTCGACAGCGGTGACGAGGACACCTTGCTTGCCGCGTTGATACGCGCGGCCCGCGAGCATCTGGAACGCACCACCGGGCTGTGCCTCCTGCGCCAGACCTGGCGGCTTTATCTTGATCGGTGGCCCGAAACGGGCGTGATTCTGATTGGCAAGACGCCGGTGCAAGCCATCGAAACGATTCTGGTTTTCGACGGGCAGGGGCGCGTGGCAGACATCACCGCCACCGAAAAGCTGCTCGACGGCGCGGCGCGGCCGGCACGGCTGTGGCTGCGCGAACCGCCGGCCCCGGAACGGGAGCTGAATGGCATCGAGATCGATTTCACCGCCGGTCATGGCGAGGCCGCGACGGATGTGCCCGATACGCTGAAACGCGCCATGCTGATGCATGTGGCGCAGATGTTCGCCTTTCGCGGCGCCGTCGCGCCGGAAAACCAGCCGGCCGCCGTTCCGGCTGGTTATGAGCGGCTGGTGTCTCCTTTCTGCCGTCTGGGGCTTTAGGCCATGAACCTCGTCTTTCTTGATCCTGGCAGGCTGACGGCGCGACTGGAACTGGAAGTGCGGTCGGAAACGCCTGATGGACAGGGCGGCGCCGCCGAAAGCTGGAAGGCCGTGCGCGCGTTGTGGGGCGCCATCGAGCCGGTTTCCGAAGCCTCGCACGAGCGGGCTTCGGCCGAGGGGGCGACGATTACCCATCGTGTCTGGCTCGCCTGGCGCAACGACATCGACATCGGCACCGGGATGCGCTTTCGCAAGGGGTCGCGCATCCTCAACATCCGAACGGCGATGGATCCCGACGAAACCCGCCGCTTCATCGTCTGCCGTTGCGAAGAGGAAAACCGATGAGCGCCGCGAACCCGCTTCTTCAGGCGATCGTCGCAAAGCTCGCTGGCGATGCCGAATTGGCAGACCTCAACCCCGGCGGCATCGTCGACCGGCTTCTGACGCGTGGCCTTTTGCCATGTATTGTCTTCGACGAGGTCGAGACCCGCGATTATTCGACGGCGACGGAAAGCGCGGAGGAGCATTTTCTGACGCTGCAAATCTGGGGTGATGCCAATCGCCGCAGATCCACGGGTGAGATCGCCGCAAGGGTAAAAGCGCTGCTTGATGACGCCGCGCTGCCGCTCGTCGGCTTTTCGCTCGTCAACATGCACATGCTGTCCAGCCGCTCCCGCCGTGAGGCGAAGTCACGGAACTTCGTCCTTGAAATGCGGTTCAGGGCGGTAACGGAATGAAGGGCCCTACGCTTTCTGGCCGGCCACGGTTTTCCACAGCAGGATCAGCAGCAGCAGCGAAAGGGTGATCAGGATGGAAGCGATGCCAAGCATCGCCGCCACACCGCCGCGATCGAGCGCCATGCTGAAAATGACAGGTGCAAAGGCGATGGCGAGATTTTGCGGAAGCGAAATCCGTGCGGCCTGAAGGCCGTATTGCTGCGGAGAAAACACTGCAAGCGGCAGAACGGCGCGGCTGACCGTCAGCACGCCCGCGCCAAAGCCGAACAGGGCGATGAAGCCGGCAAAGGCTGGTATGGCCGGAGAAAAGACGAGCAGGACCAGAAAGGATGTCAGCAGAAGACAGAGGCCGATCAGGCAGGTGATGAAGGGATTGCCGCGTTTTCCCAGCAGGAAATCTAGCCCGCGTGCGGTAATCGCCAGCACGCTGCGCACCGATGCCAGCTGGATTGCAAGCGCTTGCGATGCGCCTGAATGCACGAGAAGCAAAGGCAGAAGCGGCGAGAAGCCAAAGGCGGTGAAGGCGCTGAGCGTGGTCATTGCCGCCAGAAGCAGGAAGGTGCGCCGCGTATCCACCGGCGAGGCCGGGATTTCGCCAGGATGCGGGCGCTGGCCTGGATTGCCGCCCTTTCTCTTCGGAAGAACGAAGAGGTAAAGCGGCAGAAGCAGGAACAGCTGCAGACAGCCGTAGAGGATGAGGGTGCCGCGCCAGCCGAGATGCTGCTCCGCAAAGGCTGTAACGGGCAGGAACACGGCCGCCGAAAGGCCGGTAAAGACCATCAGCAGCGTCAGGGATCGACCACTTTCCGCCCCGACGCGCTCCACCACGGCGGTGTGTGCCGCCGTTGTCAGCCCGCAGGCGCCGGCAAAGCCCATCACCGCCCAGCCCAGCAGATAGCTTGTCACGCCGCCTGCAAAAGCAAGCAGCGCAAAGCCGCAGGCAAAAAGCAGGGTGCCGCTGACCAGCACCGGCGCTGCGCCGTGGCGCACAAGCATCCGCCCGAGCAGCGGCCCGCAAAGCGCACTGATGGTCATCATCACGGTCAGGCCCGCAAAAACGACTTCGTTGGCGATGGCGAGTTCGCTGCCCATTCGTGGACCGAGAATGGCCAGCATGTCGAAACCGGAACCCCAGCTGACGACCTGCCCAAGCGCCAGCACGCCGATAAGGCGTGTACGGAACGTCGGGGAAGCAGCATCGGACATGGCAAGATCGCACAATGCGAGGGCAAGGAACATTTTTGGTCGCAGGCCCGCAACACCCTGCAACCGCAATCGAAACGGAAGGAAACAGGATGGTGGCGCAGAAGGGCAAGGATTTTCTGCTGAAATTCAACAATGCCGGAACATATGTCACTGTTGCCGGGCTCAGAACGCGGCGGCTGGCCTTTAACGCCCAGGCGGTGGACGTTACCGATGGGGAAAGCGTCGGGCGCTGGCGCGAGTTGCTGGCGGGTGCCGGCGTGCAAAGGGCCGCGCTGACGGCGTCAGGCATATTCAAGGATGCGGCGAGCGATGCGCTCGTGCGGGGCGCGTTTTTCGCGGGCACCATTCCCGGCTGGCAGATCGTCATACCCGATTTCGGCACGATTATCGGGCCGTTCCAGATCGTGGCGCTGGAGTATTCCGGCCGTCACGATGGCGAAGTGCAATTTGAGATCGCGCTGGAATCGGCCGGTCTTTTGACCTTTGGAGCGCTGTGATGCCGCAAGGGTTGCGTTATGGGCGGGCGAACCGCCATCGCGGCGAGATCGAAGCGCTGTTCGACGGCGAGAGGCGCATATTGTGCCTGACGCTCGGCGCGCTTGCCGAGCTCGAAACCGCCTTCGAAGTCGATGACCTGACAGCGCTTGCCGAACGTTTTGCGAGCGGACGCATGAAGGCCACCGACATGATCCGGGTGATCGGCGCGGGGTTGCGCGGCGCCGGCAATGTTTTTTCGGATGAGGATGTTGCCAGCGCCACGGTGGAGGGCGGCATTGCCGGCCATGCCGCCATCGTCGCCGAACTTCTGACCGCCACCTTCGGCGGTTTGAAAGGGGAGACGCCGCGGGACCCCTGAATGCCGCAGCAGGCGATGCGACGCCGCGCCCCTTTCCCTGGGATGCGGTTCTCCATGCGGGTTTCTGCCTGCTGCGGCTTTCTTCCGAAGTTTTCTGGCGGCTGACGCCCAGAGAGTTTTTCGCCATGACGGGCGGCGTGCGCTCCGGTTCACACGGCCCGGATCGACAGGAGATGGAGGCGATGATGCGGCGTTTTCCCGACAGGGACGCATCCGATCATCGCTTCGTGAAAATCTTTTAA
Protein sequences of DBSCAN-SWA_1 >CP036358|603253:610628|610382_610628_+|QBJ14434.1|tail|DBSCAN-SWA MNAAAGDATPRPFPWDAVLHAGFCLLRLSSEVFWRLTPREFFAMTGGVRSGSHGPDRQEMEAMMRRFPDRDASDHRFVKIF >CP036358|603253:610628|607017_607587_+|QBJ12478.1|DBSCAN-SWA MTYALIHPPQAEPLTLADVKAHLRLDSGDEDTLLAALIRAAREHLERTTGLCLLRQTWRLYLDRWPETGVILIGKTPVQAIETILVFDGQGRVADITATEKLLDGAARPARLWLREPPAPERELNGIEIDFTAGHGEAATDVPDTLKRAMLMHVAQMFAFRGAVAPENQPAAVPAGYERLVSPFCRLGL >CP036358|603253:610628|608330_609440_-|QBJ14433.1|DBSCAN-SWA MLAILGPRMGSELAIANEVVFAGLTVMMTISALCGPLLGRMLVRHGAAPVLVSGTLLFACGFALLAFAGGVTSYLLGWAVMGFAGACGLTTAAHTAVVERVGAESGRSLTLLMVFTGLSAAVFLPVTAFAEQHLGWRGTLILYGCLQLFLLLPLYLFVLPKRKGGNPGQRPHPGEIPASPVDTRRTFLLLAAMTTLSAFTAFGFSPLLPLLLVHSGASQALAIQLASVRSVLAITARGLDFLLGKRGNPFITCLIGLCLLLTSFLVLLVFSPAIPAFAGFIALFGFGAGVLTVSRAVLPLAVFSPQQYGLQAARISLPQNLAIAFAPVIFSMALDRGGVAAMLGIASILITLSLLLLILLWKTVAGQKA >CP036358|603253:610628|604956_605547_+|QBJ12476.1|head,protease|DBSCAN-SWA MHAYRGPRPATRKFANLELRGIAGDGTFSGYASVFGEIDLGRDVIERGAFRRSIEERGASGIRMLYQHDPAQPIGVWRTIREDERGLYVEGILTPGVARSREVHSLMKTGALDGLSIGFRTVRSSREGRSGKAARAGVRRILEADLWEISVVTFPMLPSARVSDVKHARFFRDRDTELVRSMRRAARSLFDTAFKR >CP036358|603253:610628|605578_606847_+|QBJ12477.1|capsid|DBSCAN-SWA MTEQTTTPAPMTVAPQLKAVPDTMTAAFDEFMEAFEAFRETNDQRLADIERKMGADVVTRDKLDRIDRALDDNRRLMDDLALKKARPALGRKDTLSQDAEEHKTAFEAYIRRGEEAALRDLEAKAFAGSAGADGGFLLPTETDGEIGRRMTAISPVRALATVRQVSAAVLKKPFAAGGLATGWVSETAARPETATPKLSELSFPTMELYAMPAATQSLLDDAAVDIEAWIASEVDIAFAEQEAAAFVGGDGINKPRGFLAYTTVANNDWSWGNIGYVATGVSAGFSSAGPMDVLLDAIYALKAGHRQNGTFLMNRKTQGALRRFKDTTGAYLWHPPAAVGQPASLMGFPVTEAEDMPNVAANSFAIAFGDFRAGYLVVDRTGVRILRDPYSAKPYVLFYTTKRVGGGVQNFEAIKLVKFGVN >CP036358|603253:610628|603253_604420_+|QBJ12474.1|portal|DBSCAN-SWA MRFPFSLPRKRPADGNAMPENRKMAGGFMAVAMQGGQAFWSGRSYAALAREGFMKNPVAHRAARMVAEAAASVNWLLYDGDDEIGDHPLLALLARPGAHMGGPDFFEALYGHLMLAGNAYVEPLVIGGRLRELHLLRPDRLSIVEGPDGWPAAYDYRAEGRATRRIAAERDGLGLLHLKLFHPLDDRVGFAPLASAGAALDLHNAASQWNKRLLDNSARPSGALVYQPREGGNLSAEQYERLKRELEEGYQGAMNAGRPLLLEGGLDWKAMGLSPRDMDFLEARNGAARDIALSLGVPPMLIGIPGDNTYANYQEANRAFYRLTVLPLVNRTAARLCGWLAPIFGVGLRLEADLDRIAGLAGERDALWTRIGAASFLSDEEKREAVGY >CP036358|603253:610628|607590_607932_+|QBJ12479.1|head,tail|DBSCAN-SWA MNLVFLDPGRLTARLELEVRSETPDGQGGAAESWKAVRALWGAIEPVSEASHERASAEGATITHRVWLAWRNDIDIGTGMRFRKGSRILNIRTAMDPDETRRFIVCRCEEENR >CP036358|603253:610628|609619_610027_+|QBJ12481.1|tail|DBSCAN-SWA MVAQKGKDFLLKFNNAGTYVTVAGLRTRRLAFNAQAVDVTDGESVGRWRELLAGAGVQRAALTASGIFKDAASDALVRGAFFAGTIPGWQIVIPDFGTIIGPFQIVALEYSGRHDGEVQFEIALESAGLLTFGAL >CP036358|603253:610628|604607_604928_+|QBJ12475.1|DBSCAN-SWA MSEFANEAGIWAARITGAVAGAGVSLVYLLPKSKREAASRFITGVSCGMIFGGPIGLWIVQQLDIAGALSGREIMVAGSAAASMGAWWGLGVLVRIADRYSARPRA >CP036358|603253:610628|607928_608324_+|QBJ12480.1|DBSCAN-SWA MSAANPLLQAIVAKLAGDAELADLNPGGIVDRLLTRGLLPCIVFDEVETRDYSTATESAEEHFLTLQIWGDANRRRSTGEIAARVKALLDDAALPLVGFSLVNMHMLSSRSRREAKSRNFVLEMRFRAVTE >CP036358|603253:610628|610026_610386_+|QBJ12482.1|DBSCAN-SWA MPQGLRYGRANRHRGEIEALFDGERRILCLTLGALAELETAFEVDDLTALAERFASGRMKATDMIRVIGAGLRGAGNVFSDEDVASATVEGGIAGHAAIVAELLTATFGGLKGETPRDP |
11 | Geobacillus_phage(33.33%) | portal,protease,capsid,head,tail | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_2 |
1049176 : 1103571
Sequences of DBSCAN-SWA_2
Nucleotide sequences of DBSCAN-SWA_2 >CP036358|1049176:1103571|DBSCAN-SWA CATGAAAGATCATCATTTGGCCCTTCCGGAGTTGTCAGAACCGTTCAGCATCGGCTCAGTTACGATCCGTAATCGGGCTGTGCTGGCGCCCATGTCGGGCGTGACGGATCTGCCTTTCCGACAGCTCGCCTGGCGTTATGGAGCAGGCCTCGTGGTGACGGAAATGGTGGCAAGCCGGGAGCTTGTTGCCAATCGCGGGGAATCCTGGGCGCGGCTGAAAAATGTAGGCATGGTTCCGCACATGGTGCAGCTTGCGGGGCGCGAGGCGCATTTCATGGCCGAGGCGGCGAAGATCGCTGCTGGCAATGGCGCCGGTATCATCGATATCAATATGGGCTGTCCTGCCAAGAAGGTGACGGGCGGTTATTCCGGCTCGGCGCTGATGCGCGATCCCGATCACGCGCTTTCGCTGATCGAGGCGACTGTCAATGCGGTCGATGTGCCGGTGACCTTGAAGATGCGGCTCGGCTGGGATGAAAACAGCATTAACGCACCTGATATAGCTCGGCGCGCCGAAGCAGCAGGCGTTCGCCTTATTACCATTCACGGTCGCACCCGCATGCAGTTTTACGAGGGGCGGGCGGATTGGGATGCCATCCGCGCCGTGCGTGAGGTGATTTCCGTGCCGCTGATCGCCAATGGTGACATCGAGACGGCGGAAGATGCTCGCGAAATCCTGCGGCGGTCGGGTGCGGACGCCGTCATGGTCGGTCGCGGCGCGCAGGGGCAGCCGTGGTTGCCGGCGGTGCTCGCGGGGCATGCAGCGCCGCATCGAGAGGATATCCCCGCCATCGCCGTCGAACATTATGAGATGATGCTGGAATTTTACGGAAGGGAAGCCGGCCTCCGGCATGCCCGCAAACATCTCGGCTGGTATCTCGACCGTTTCGCGCCAGGTATCGCCACGACCGACAAGGCAAAAATCATGACATCGCGCGAGACGGGGGAGGTGGCAGACCTGCTGCGTTCCGCCCTTTGCGAAAATGCGGGCGAAGATGCCGCAAGGAAGGCCGCATGAGCGCTGATAAACACAGCCCTGCAGATGCGAACCCGCTCGCCATGGCGGTTCTGAACGCGGTGCAGAATCCCGTCATTCTGGTGGATGCCGAAGGTTTTATCGCCTTCGCCAACTGGGAAGCCGAAGCCTTTTTCGGCGCCAGCGCCTCCCACCTCGCGCGCTATAGGGTTTCCACTTTCATCCCCTTCGGCAGCCCGCTGCTTGCCCTGATCGATCAGGTGCGGGAGCGCCGCGCGCCGGTCAATGAATATCGCGTTGATCTGAGCTCGCCGCGTCTCGGCCAGGACAAGCTGGTCGATCTTTATGTCGCGCCCGTCGTCAGCGAGCCTGGATCGGTCGTCGTCGTCTTTCAGGAACGGTCTATGGCCGACAAGATCGACCGGCAGCTGACCCATCGCGCTGCTGCCCGTTCAGTCACCGGTCTTGCGTCCATGCTCGCGCATGAGATCAAGAACCCTCTGTCGGGCATTCGCGGCGCGGCCCAGCTTCTGGAGACCTCGGTTGCGGATGAGGACCGGGCGCTGACCCGGCTGATCTGCGACGAGACGGATCGCATCGTTTCGCTGGTCGACCGCATGGAGGTGTTTTCCGACGAGCGGCCGGTGGATCGCGTGCCGGTCAACATCCATTCGGTTCTCGATCATGTAAAGGCGATCGCCAAGGCCGGTTTCGCCCGTAACATCAAGATATCGGAAAATTACGACCCGTCGCTGCCGCCGGTCTACGCAAACCGCGACCAGCTGGTGCAGGTCTTCCTCAACCTGGTGAAGAATGCGGCCGAAGCGGTGGCCAATCAGTCGGATGGCGAGATCGTGCTGACGACCGCCTATCGTCCCGGGATCCGACTATCGGTTGCCGGCAGCCGGGAGCGCATCTCGCTTCCGCTGGAATTCTGCGTGCATGACAATGGCCCGGGCGTACCCGCCGATCTCTTGCCGCATCTTTTCGATCCGTTCATCACCACCAAGACGAATGGTTCGGGCCTCGGTCTAGCGCTTGTCGCCAAGCTGATCGGCGCCCATGGCGGCATTGTCGAATGCGACAGCCAGAACCACCGCACGACTTTCCGCGTATTGATGCCCGTCTCGCCGGAAGTGGCGCTCGACGACAGCACTTTGCCGAACACGACAGGGAATGACCAATGACAGCTACGATCCTCGTCGCCGATGATGATGCCGCAATCCGCACGGTGCTCAACCAGGCGCTGAGCCGCGCCGGTTATGACGTGCGCATCACCTCCAATGCTGCAACGCTCTGGCGCTGGGTTTCGGCAGGTGAGGGCGATCTTGTCGTCACGGACGTCGTCATGCCCGATGAGAACGCCTTCGACCTTCTGCCGCGCATCAAGAAAGCCCGTCCGGACCTTCCCGTTCTCGTCATGAGCGCTCAGAACACCTTCATGACCGCCATCAAGGCCTCGGAAAAGGGCGCCTACGATTACCTGCCCAAGCCCTTCGATCTGACAGAACTGATCGCCATCATCGGCCGCGCACTCTCCGAGCCGAAGCGCAAGCCGGCCAAGCTCGATGACGACATGCAGGACGGTATGCCGCTTGTGGGTCGTTCGGCGGCGATGCAGGAAATCTACCGTGTTCTTGCCCGCCTGATGCAGACCGACCTGACGCTGATGATCACGGGTGAATCCGGTACGGGTAAGGAACTGGTGGCGCGCGCGCTGCATGATTATGGCAAGCGCCGCAACGGTCCCTTCGTTGCCATCAACATGGCGGCCATCCCGCGCGATCTCATCGAATCCGAACTCTTCGGCCATGAAAAGGGTGCCTTTACCGGCGCTCAGAACCGCTCGACCGGCCGTTTCGAACAAGCTGAGGGCGGCACGCTCTTCCTCGATGAAATCGGTGACATGCCGATGGACGCCCAGACGCGCCTGCTGCGCGTATTGCAGCAGGGTGAATATACGACGGTCGGCGGACGCACGCCGATCCGTACAGATGTCCGTATCGTCGCGGCAACCAACAAGGATCTGAAGCAGTCGATCAATCAGGGCCTGTTCCGCGAGGACCTCTATTACCGCCTGAACGTGGTGCCGTTGCGCCTGCCACCCCTGCGTGACCGCGCCGAGGATATTCCCGATCTGGTGCGCCATTTCATCCAGACGGGCGAGAAGGAAGGCCTGGAAGGCAAGCGTTTCGAGACCGAAGCTCTTGAAGTCATGAAGGCCTATGCCTGGCCGGGCAACGTGCGCGAGCTGGAAAACCTGATCCGCCGACTGATGGCGCTTTATCCGCAGGAAGTCATCACCCGCGAGATCATCGAGCAGGAACTGCAATCGGATGTTCCCGACAGCCCGCTCGACAAGATGGCGGTCCGCACCGGTTCACTCACCATCTCGCAGGCTGTCGAGGAAAACATGCGGGACTATTTCGCCAGCTTCGGCGATGGCCTTCCGCCGCCCGGCCTTTATGACCGCGTGCTGCGCGAACTGGAGTATCCGCTGATTCTGGCTGCGTTGACTGCCACGCGCGGCAACCAGATCAAGGCTGCCGATCTTCTCGGTCTGAACCGAAATACCCTGCGCAAGAAAATCCGCGAGCTGGGTGTTTCCGTCTATCGCAGCTCCCGCCCCAGCTGAGAATGTTGACAAGGCCGCATTTGTCGTTGCATTTTCGCCACATTGTGTTGCTTGGAAAACACTAGGAAGGACAGAGATTCGCGGGACACGCGCGAACCTCGCCTGACTTCATGACTGTGGCGCTGGCTGGCCAGCGATTGGGAGTAGTGCATGCGGAAACCTGCCGCCGAAAACGCCGTTGCCGACGAAGCAGCGGGAATGGTGGTCGCCGATCGCAGGATGTCTTTCGCTCTGCCGGGGCTGGTGCTCGCCGGCGTGGCCCTCGTGTGCGCCATCATCACGCTTTTCGTGCTGCTGGGCGTCACGCCAATCGCGCCGACATCGAATGTGGTCATCGCTTCCGTCGTCATCAACTCGATTTTCGTGATCGGCCTGATCTTCCTGATCGGCCGCGAAATCAACCGGCTGCTGAAAGCGCGCAAAAAGGGCAGGGCGGCGGCGCGTCTGCACGTGCGCATTGTCGTCCTTTTCTCCATCGTGGCGATCACGCCCGCAGTGCTCGTCGCCATCTTCGCCAGCCTGACGCTGAATGTCGGTCTCGATCGCTGGTTTTCGCTGCGCACCCAGTCGATCGTCGATTCATCAAGTAATATCGCCCAGGCATACATGATGGAGAATGCCGGTTACCTCCAGGGGCAGACCCTGTCGATGGCGACCGACCTCGACCGCAACCGCGCTCTCTTTTATCTCGACCGCACCGGATTCGTTGATCTTATGACCCGCCAGGCGAAGGGGCGCGGCCTTCTTGGTGCGTTTCTCGTGCAGGAGGATGGCGACGCCGTCGCCCAGGCCGACATCAAGACGGAAAAGCCGCTTCCTGCGATACCGCATGACGCGCTCGAAAAGGCGGCCGCCGGCCAGCCGACGCTGATCCCGCCCGGTATTACCAACCTCGTAGGCGCCATCATCAAGCTCGAAGGCATCAGCGGCACATTCCTTTATACGGTGCGTGCGGTCGATCCCAAGGTGATGGGCGCGATGCGGCTGATGGAAGAGAACCGCGCCGAATACAAGGCCATGGAGGCAGGCCGCGCCCCGCTGCAGATCGCTTTCGCAATTCTCTATCTCGGTTTTGCGTTGATCGTGCTGCTCGCGGCAATCTGGACGGCCATTGCGGTGGCCGACCGTATCGTGCGGCCGATCCGCCTTCTCATCAGCGCCGCAGACAGTGTCGCCACCGGCAACATGCATGTTCTCGTTCCCGTTCGCGCCGTCGATGGCGACGTCGGGCGCCTGTCCCGCACCTTCAACAAGATGGTGTCGGAACTGCGCAGCCAGCAGGAGCAGATCATCGAGGCCAAGGACGACATCGATGATCGCCGCCGTTTCATCGAGGCCGTGCTGTCAGGCGTGACCGCCGCGGTTATCGGCGTTGACGAAAACCGCAGGATAACCATCGTCAATCCGTCGGGCGAAGAGTTCCTCGCACAAACGTCGGAACAGCTCATTGGCGCACAACTCTCCGAGATCGCGCCTGAGATCGAACAGGTCGTCAACGAGGCCAACAGCTGGTCGCGCGGCAACTTCCGCAAGCAGATCAACATTATGCGGCGTGGCAAGGAACGAACGCTGAATGTACAGGTAACGCGCGAAGACGCCCGCGACAGCCGTGACAGCTACGTCATCACGCTCGACGACATCACCGATCTCGTCATCGCGCAACGCTCGACGGCGTGGTCGGATGTCGCGCGGCGTATCGCGCACGAAATCAAGAACCCGCTGACGCCGATCCAGCTTTCCGCCGAGCGCATCAAGCGCCGTTTCGGCAAGCAGATCGACGAAAGCGACCGCGCCGTCTTCGACCAGTGTACGGACACCATCGTCCGGCAGGTCGGCGATATCGGCCGCATGGTGGATGAATTCTCCGCTTTTGCGCGTATGCCGAAGCCAACGAAGGAAAAGTCGGACCTGAGGGCGATCCTGAAGGACGCAGCGTTTCTCCGGGAAATCAGCGCCGCTGACACCAAGTTCACGACGGAGCTGGGTGACACCCCGCTGGAGGGCATGTTCGATGCCCGCATGCTGGGGCAGGCCTTTGGCAACCTCATCAAGAACGCGACGGAAGCCATCGAAGCGGTGGAAGGCGAAAAGCGGCCGGGCAAGATCCTCGTTCGCGCGTCCTTCGACGAGGCCAACTCGCGTTTTGTGGCCGACATCATCGACAATGGCCGCGGTCTGCCGGTGGAAAATCGCCACCGCATTCTCGAACCCTATATGACGATGCGGGATAAGGGCACGGGCCTTGGCCTCGCCATCGTCAAGAAGATCATCGAAGAGCATGGCGGGTATCTGGAACTCCATGATGCCCCGCCGGAGTTCGATCATGGCCACGGCGCCATGATCAGAGTGCTGCTGCCCTATATCGAGGCGGTCGGCGGCGAAAATAACAAGGAAGCAGCATATGGCGTCTGATATTCTGGTCGTGGATGACGAGGCGGACATTCGCGAAATCGTGGCGGGCATTCTCTCCGATGAGGGGCATGAAACGCGTATGGCGTTCGACAGCGACAGCGCGCTGGCTGCCATTTCGGAACGCGTGCCGCGCCTGATCTTTCTCGACATCTGGATGCAGGGTTCGAAGCTCGACGGTCTGGCGCTGCTCGACGAGATCAAGAGCCGCCATCCCGAAATTCCGGTCGTGATGATCTCGGGTCACGGCAATATCGAAACCGCCGTCAACGCCATCAAGCGCGGCGCCTTCGACTTCATCGAAAAGCCGTTCAAGGCGGACCGTCTGATTCTCATTGCCGAACGCGCGCTGGAAAACTCCAAGCTGAAGCGCGAGGTTCAGGAGCTCAAGAAGCGCACGGGGGATGCGGTTGAACTTGTCGGTGCGTCACTTGCGGTATCGCAGCTGCGCCAGACCATCGACCGGGTCGCGCCCACCAACAGCCGAATCATGATTCTCGGCCCGTCAGGTTCCGGCAAGGAACTGGTGGCGCGGATGATCCACAAAAAGTCGTCTCGGGCGACAGGTCCCTTTGTTGCCCTTAACGCCGCGACGATCACGCCGGATCGCATGGAGATTGCACTCTTCGGCACCGAAGGCCTGCCTGGACAACCCCGGAAGGTGGGCGCACTGGAAGAAGCCCATCGTGGCGTTCTTTATCTCGACGAAGTGGGCGAGATGCCCCGCGAGACCCAGAACAAGATCCTCCGCGTGCTTGTCGACCAGCAATTCGAGCGTGTGGGCGGCGGCAAGCGCGTGAAGGTGGATGTGCGCATCATTTCGTCGACGGCCCATCACCTCGAAAGCCTGATCGCCGAGGGCCAGTTCCGTGAGGACCTCTACCACCGTCTCGCCGTCGTGCCGGTGAAGGTACCGGCGCTGTCCGAGCGACGTGAGGATATTCCCTTCCTCGTCGACATGTTCATGCGGCAGATTTCCGAACAGGCTGGCATCCGTCCGCGCAAGATCGGCGACGACGCAATGGCCGTTCTTCAGACCCATGACTGGCCGGGCAACATTCGACAGCTTCGCAACAATATCGAGCGGTTGATGATACTGGCGCGTCCCGAAGGCGGTGAAGCCCCGATCTCTGCTGACATGCTGCCGTCCGATATCGGCGACATGCTGCCGAAGATATCGGCTCAGGGCGATCAGCACATCATGACGCTGCCGTTGCGCGAAGCTCGCGAAATGTTCGAGCGAGACTATCTGGTGGCGCAGATCAACCGATTCGGGGGCAATATTTCGCGAACGGCGGAATTTGTCGGCATGGAACGCTCTGCACTGCATCGCAAACTGAAATCTCTCGGCGTATAAGGGGCCAGGGCTAGGTCGCCGTAACCGATTATCCGTCAGAGAGACAGCATGAGAGTCATCATTTGCGGCGCAGGGCAGGTGGGTTACGGCATCGCCGAACAATTGTCGCGCGAGGACAACGAAGTATCCGTCATCGATACCGCCGCGTCCTTGATAACCGCCATCACAGAGACGCTTGATGTCCGCGGTTACGTCGGCCATGGCGCGCACCCAGACATGCTGGCCAAGGCCGGAGCGGACCAGGCGGACATGATCATCGCCGTGACGCTGCATGACGAAATCAACATCGTCGCCTGCGAGGTTGCGCATGCCTTGTTCAGCGTGCCGACCAAGATTGCCCGCATCCGCGACCAGAGCTATCTTAAGCCTGAATATGCCGATCTCTTCAGCCGGGAAAACATGTCCATCGACGTGACGATCTCGCCGGAAGTTGAAGTCGGCAAGATGGTGCTGCGGCGCATCGCCTTTCCGGGCGCGACCGACGTGGTGCGTTTCGCGGACGACACGGTCTACATGCTGGCCATCGAATGTATGGAGGATTGCCCGGTCATCAATACGCCGTTGCAGCAACTCTCCAGCCTCTTTCCAGACCTCATCGCGACGGTTGTGGGTGTCTACCGGGATGGTTTCCTCAAGGTGGCGCATTCTTCCGAACAATTGCGGGTCGGCGACCTTGCCTACGTCATCTGCCAGCGGCAACACGCCCGCCGCACGCTCAGCCTGTTCGGTCACGAAGAGCAGGAGGCGCAGCGCATCGTTATTGCGGGCGCGGGCAATATCGGCCACTTCGTTGCAAGCAAGATCGAGGAGTTGCAGCCAAAGACGCGGGTGAAGATAATCGAAGCGGATCGCGACAGGGCAGTTGCCGCATCGGAACAGCTCAGCCACACCATCGTCATGCATGGTTCAGCGCTGGACCAGAAAATTCTCATGCAGGCCGACATTCAGGACGCGGACCTGATCGTCACCCTGACCAATAACGACCAGACCAATATCCTGGCTGCGGTCATGGCCAAGCAACTCGGCTGCAAGTCGAACCTCGCGCTTTTGAACAGTTCCTCCTTTCACGAGGTTGCCGACTCGCTCGGACTTGACGCCTATATCAATCCGAGGGCGGTCACCATTTCCCGCGTTCTCCAGCATGTGCGCAAGGGTCGTATTCGTTCCGTCTATGCCGTGCAGCGCGGTTCGGCAGAGGTGATCGAGGCGGAGGCGCTGGAAACGTCACCGCTCGTCGGTCAGTCCTTTCGTGACGTTGAAATGCCCGAAGGCGTCCGTATCGGCGCGATATATCGCGATGGCGTGGTCATCCGGCCGGACGGGAGCACCAAGATCAAGGCGAAAGACCGTGTCGTACTGTTCGCCTCCGCCGACGCGGTGCGGGACGTGGAACAGCTTTTCCGCGTTTCGATACAATATTTCTGACTGTTTCGACGGCAGAAAACGGGTCAATACTCGTACCCATTCGGCTGTTTTCGCCGTTGCCGAACAAGACACGATTTGATACCAGAGTTTGAGGTCGGCCAAACGGGGGACGAGATTTGGCCGGCTTATTATTGATTGATTATTTTCCGGTATCCCTGCCGGCAAAAAAGAAAGAAGCGGCGCCATGGCGGAACGTTCTCAAAACCTTCAAGATCTTTTCCTCAACACCGTACGCAAGAAAAAGATTTCACTGACGATCTTCCTGATCAACGGTGTGAAGCTCACGGGCGTGATAACGTCCTTCGATAATTTCTGCGTTCTGCTGCGCCGGGACGGTCATTCGCAGCTGGTTTACAAGCATGCGATCTCGACCATCATGCCCGGACAGCCGCTGGAACTCGAAATCGAAGAAGGTCCGGACCGCTCGCAGAATCTTCAGGATCTGTTCCTCAGCACGGTTCGCAAGAAGAACATTGCGCTGACGATCTTCCTGATCAACGGTGTCAAGCTCACGGGCGTGGTCACGGCTTTCGACAATTTCTGCGAGCTTCTGCGTCGCGACGATCACGCACAGCTTGTCTACAAGCACGCTATTTCGACCATCATGCCGGGTCAGCCGATGCAGATGTTCGAAAACGAAGAAGGCGCTGCACGCGAGGCGTCCTGACCTTCTCGTCTTAATGGAACCAGCCGCGCCGGGAACTTATTTGCCGCTGGGCGGTTGCTTCCTATTGACGTCCGCGCCGGCGCGGACGAATTCCAACCGTCTTTCCCGATGAGGACTGCCATTTCGACCCGCGACACCTCGAACGAATCGATCATTCCCGAGCAGGAAAAACACCGCGACGACATGCGCGCCGTGGTTCTTGTGCCGGTCCTGAAGCAGCGCGACACCCGCGACGCGGCTGCATTACCTGCCGCGGCTGGCCGCTCGGTCGAGGCGAAGCTTGAGGAAGCGAAGGGGCTAGCGCTCGCCATCGATCTCGAGGTGACCCAGGGGCTCATCGTCGCGGTCAACCAGCCACGCCCGGCGACATTGTTCGGCACCGGCAAGATCGAGGAAATCGGTCATCTTCTGGATGAAACCAATTCCGGCCTGGTCATTGTCGACCATCCGCTGACGCCGGTGCAGCAGCGCAATCTCGAAAAGCATTGGAATGCCAAGGTCATCGACCGAACGGGCCTGATCCTGGAAATCTTCGGGCGCCGAGCCTCCACCAAGGAGGGTACGCTGCAGGTCGATCTTGCGCATCTGAACTATCAGAAGGGTCGCCTCGTTCGAAGCTGGACACACCTTGAACGCCAGCGCGGTGGCGCGGGCTTCATGGGCGGCCCTGGTGAAACGCAGATCGAGGCTGACCGGCGCCTGCTTCAGGAACGCATCGTCAAGCTTGAGCGGGAACTGGAGCAGGTGGTGCGAACCCGCCAGCTTCATCGCGCCAAGCGCCGCAAGGTGCCGCATCCGATCGTGGCGCTGGTCGGGTATACCAATGCCGGCAAGTCGACGCTGTTCAACCGCATTACCGGTGCGGGCGTTCTGGCCGAGGACATGCTGTTCGCCACACTCGATCCGACATTGCGGCGCATGAAACTGCCGCACGGCCGCACGGTGATCCTGTCTGATACGGTCGGCTTCATTTCGGATCTGCCGACACATCTGGTCGCGGCTTTCCGTGCGACGCTGGAAGAAGTGCTGGAAGCCGATCTCATCCTGCATGTCCGCGATATGTCCGATCCGGACAATGCCGCGCAGTCGGCGGATGTGCTGCGTATTCTCGGCGATCTCGGCATTGACGAAAAGGAAGCCGAGCACCGTATCATCGAGGTCTGGAACAAGGTTGACCGCCTCGATCCGGAAGCCCACGACGCCATCATGCAAAGAGCCGAAGGCAGCGCCAATATTCGTGCTGTCTCTGCGATCACCGGCGAGGGCGTCGATGCCCTGATGGACGAGATTTCGAAACGCCTCTCCGGTGTCCTGACGGAGACGACCGTGGTCCTGTCCGTTGAACAATTGCCGCTGATTTCCTGGGTCTACAGCAATTCCATCGTCGACGGGCGGGAAGATCACGAGGATGGTTCGGTTGCCCTCGATGTTCGCCTGTCCGAAGCGCAGGCCGCAGAGCTGGAACGCAAGCTCGGGAAGACGACCACACGCGAGCGGGAGGACTGGGAACGCTGAGCGCTCCCGGCCGCCCCTTATTCCAAGTCATTATTCCGCGCAAAGCCGAGAGTTGACGTTTGCTGAAACGTCTTACGCCGCAGACCAGACCGCCAGTTCATATCCATCGGGATCGGTAAAATGGAACCGGCGTCCGCCGGGAAAGGCGAAGATTGCGACGGTTATCTCGCCACCCGCCGCCGCCACCTGCCGCTGGGCATCGTCGATATCAGCCGCATAGAGAATAACCAGCGGTCCACCCTTCGCGGAAACCGGCGCCGTGGTGGTAAAACCGCCGGTCAGGCGGCCGTCGGAAAATTCGCAATATTGCGGACCGTAATCCTTGAAAGACCAGCCGAAAGCGCCGCCGTAAAAGGCTTTGCTGCGCTCAATATCGCTCACGTTGAATTCGACATAGTCGATTCGCCGGTCGTTGCCATTTTCCGTCATCATTTACTCCATCACGTCTCTTCGAGCAGCGCCTTCAAGGCTTCGAGATCGGCTGTGACAGCCCTCGCATCCCCGTTGAAATCCTCGTCCGTCATCCCCTCAAGACGAAGCAGCGTGAAACTGACCTCGGCGCCACCGCCGTTCGGGGTCACCCTGAGCGCGTTATAGACCTTGAGGCCATCGGGCAAGGTGACGACGTGATCCACGACCCCGAATTCGTTTGCTGGCGCAAAATTCACCCTGACCTCGCCGAGTGGTCCGCCTTCGGCGATCCAGTCTTGGCCATCCGGCCTGAGGCCACCTGCCAGTCCGGCAGCCCAGCGAGCCATGTTGTTCGGATCGGAAACGAAGTCATAGACGTGACTCCACGGCTTCTCGATAGACAGATGAATGATTCTCGACTGCATGACAGACATTTCATCCTCCTCGTGCGGCACGCTCCTCTTGCTGCGCCACTTTGTCGTCACTTGATGCAAGTCAAGGATGTTTCACCGCAGAGATGGGAGAAAGCAAGGAGATAAAAACCGGACTCCTATCATGAACATCTATCGCGCCGGACAGGAACGCCCGCAGCAGCCCGTTGAAAGGCTGGACGCGGAGGCCGTCAATTCCGCAAAAACGCGCAAACCGCTTTACGAAGCACGCAGGAAAATTTTTCCCAAGCGCGCGGAAGGTCGGTTCCGGCGGTTCAAATGGCTTGTCATGCTCGTCACGCTCGGCATTTATTATCTCACGCCGTTCCTGCGTTGGGACCGTGGCCCCTATGCGCCCGATCAGGCAGTATTGATCGATATCGCCAATCGACGCTTTTATTTCTTTTTCATCGAAATCTGGCCGCAGGAATTTTTCTTCGTCGCGGGATTGCTGGTCATGGCGGGGTTGGGGCTGTTTCTCGTTACCTCCGCCGTGGGACGTGCATGGTGCGGTTACACCTGTCCCCAGACAGTGTGGGTGGACCTGTTTCTCGTCGTCGAACGGGCGATCGAGGGTGACCGAAATGCACGCATGAAGCTGGACGCCGCGCCCCTTACAGTCGGCAAATTCCGCAAGCGAGTGTTGAAGCACGCGATCTGGCTGGTGATTGGCGCCCTGACCGGCGGCGCCTGGATTTTCTACTTCGCCGATGCGCCGACGCTGGCGCGGGAATTCGTCACCGGCCAGGCGCCCATGATTGCCTATTCGACCGTCGCCATCCTCACCGCCACGACCTATGTCTTTGGCGGGTTGATGCGGGAACAGGTCTGCACCTACATGTGCCCTTGGCCACGCATTCAGGCGGCGATGCTGGATGAGAATTCGCTTGTCGTCACCTATAACGACTGGCGCGGCGAGCCACGCTCACGGCACGCCAAAAAGGCCGCCGCAGCGGGGGAGAGCGTCGGCGATTGCGTCGATTGCAACGCTTGCGTCGCGGTCTGTCCCATGGGCATCGACATCAGGGACGGGCAGCAGCTGGAATGCATCACCTGTGCGCTCTGCATCGATGCCTGCGACGGGGTGATGGACAAGATCGGCAAGCCGCGTGGCCTGATCGCCTATGCCACGCTTGCCGAGTATCAATCCAACATGGCGCTCGCGACCGGCAACGGTCAGCACGCAATCCGCCCAGCCAACGTGCGTGAGGAAGACGGCAAGTTCAGCAAGCGCGTGCGCCACTTCAACTGGCGCATCATCTTCCGTCCGCGCACGCTGCTTTATACTGCCATCTGGGCGGCTGTCGGCATCGGCATGCTGTCTGCGCTGGTGACGCGCGAGCGGCTGGCGCTCAATGTTCTGCACGACCGCAATCCGCAATATGTGCTGGAATCGAACGGATCCATCCGTAACGGCTACACCGTGCGCATCCTCAACATGGTGCCGCAGCCGCGCACAATGAGCCTGACCATCAATGGCCTTCCTGATGCCGTGATGAAGATAAACGGTATGCCCGATGCGGCCGCGCGTGCCTTTGAGGTGACGGTGGAGCCGGATGAAGCGACGACGCTGAAGGTCTTCGTCACCCGGCCGGGCGGGCGCGTCGCTCGGGCCGCGGAAAATTTCGAGTTTATCGTCAGCGATACGGGCGGCCATGAGACGGCCCGCTATGACGCTGTTTTCAACGCTCCGGGAGCGACAAAATGACCGTGAACAATCGTCACACGTCCGGTTTCACTTTTACCGGCTGGCATATGCTGGGCGTCATGCTGCTGTTTTTCGGCACCGTCATCACCGTCAACATGGTGATGGCCTGGAACGCGGTCAACAGCTGGAGCGGGCTTGTCGTGCCGAATACCTATGTGGCCAGCCAGCAGTTCAACGCGAAGGCGGAAGCAGCCAAGGCGAGGGCGGCGACCGGCATCAAGGGTAAACTTGCGGTGGACGAAAGGACGGTCCGCTACGAGGTCTTTCATCCCGATACCGGCCCGGTCGATACCGATCAGGTGATCGCGCATTTTCGCCGGCCCGTCGGGGAGCGCCAGAATTTCGACATGGAGCTGACCCCGGTTGCCAAAGGGGTCTTTACCGGCCCGCATGACATGCTGCCGGGTCAGTGGATCGTAGAAATTACAGCCGTCAAGAACGGCCGGATCATCGTGCACGAGGGTACGCGGATCGCCGTTGTCAGGGGGCGCCAATGAGCTGTTGTGCACCGGGGACGGAAGGATCGCTGCAGCTTGCCGATCCGGTCAATCCACCGTCGTCAGAGGAGCTGATGCTGGCAAGCCGTGATCTCGGGCAGGGTCTCCGCCAGACCGATCTCAGCGTGCCGGATGTTTATTGCGGCGCCTGCATTACCACCGTGGAGAGCGCGCTCGGCCGCTTGCCGCAGGTGGAGCGGGCGCGGTTGAACCTTTCGTCCAAGCGTGTTGCGATTGTCTGGAGACAAGAGGTCGAGGGTGTCAGGACTGATCCCGCCGACCTTGCCCGCGCCATTCTGGCGACGGGATACCGCACGCATCTTTTCGCCAGTGGTCAGGACGCTTCCGACGCGTTGCGTTCGCAGCTGATCAGGGCGGTTGCGCTATGCGGCTTTGCCTCGGCCAATATCATGCTTTTGTCCGTTTCCGTCTGGTCCGGGGCGGATGCGGCGACGCGCGACATGTTTCACTGGATTTCGGCGATGATCGCCGCCCCGGCGCTGATCTACGGTGGGCGTTTCTTTTATCAGTCGGCCTGGAGCGCGCTGAAACACGGCCGCACCAACATGGACGTGCCGATCGCACTGGCCATCACGCTGTCCTATGCCGTTTCGCTGTGGGAGACGATCCACCACGGTGAACATGCCTGGTTCGATGCGACGGTGTCGCTGCTGTTCTTCCTGCTGATCGGCCGCACGCTCGATCACATCATGCGCGACAAGGCTCGTTCGGCGATTGCCGGGCTCGCCCGGCTCTCGCCACGGGGCGCGACCGTCCTCGGTGAGAACGGAACACGCGAATACCGCCCGCTTGCTGATATCGAGCCGGGAATGTCGATTGCGATAGCGGCAGGAGACCGTGTGGCGGTGGACGCGGTGGTCGAGAGTGGCAGCAGCGATCTCGACATGTCGATCGTCAATGGTGAAAGCGCGCCACGGCGGGTTGCGGCGGGTGACAGCCTGCAGGCGGGTACGCTGAACCTTACCGGTTCGCTGGTGGCCCGGGTGACCGCCTCGGCCAGGGATTCATTCCTGTCGGAGGTCATCAGTCTGATGGAGGCGGCCGAAGGTGGGCGAGCGCGTTACCGCCGTATTGCCGACCGGGCCGCAAGCTATTATTCGCCCGTCGTCCATCTTCTGGCGCTTGTGTCGTTTCTCGGATGGGGCTTCTTCGGTGGCGACTGGAAACAGGCGATGCTGATCGCCATCGCGGTTCTCATCATCACCTGCCCCTGCGCGCTCGGCCTTGCCGTTCCCGTCGTGCAGGTGGTGGCCGCGGGAAGGCTTTTCCGGCACGGGATCATGGTCAAGGAAGGGTCGGCCATGGAGCGACTGGCGGAGATCGATACCGTACTGTTCGACAAGACCGGCACATTGACCATCGGCAGGCCGCGACTGGTCGAGACGGGCGAGGTGAAGCCCGCCACCATGGCAATTGCGGCAGGACTTGCTGCCCATTCCCGCCATCCCCTCTCAAAAGCTCTGCATGCCGCCTACAGCGGCCCTTTGCCGGCGTATGAGACGGTGCATGAAATCCCGGGTTCGGGCGTCGAGGCGAAGACTGATGTGGGAACCTATCGGCTCGGCAACCGCCGTTTCGCGTGCCCGGATGATGAGGGTGCCGACAACGGCAATGCGCGGTCGGAGGTGGTGTTGTCACTCGATGGCCGCCTTCTGGTGAGCTTCGGTTTCGATGACAATCCGCGCGCAGGTGCGGCTGCGGCATTGCGCAGCCTGTCGGTGCGGGGTTTGGCTCAGGAGATCGTTTCGGGTGACCGGGCAGCAGCTGTCAGCGCCATGGCCGACCGGCTGGGGATCGCCAACTGGAGCGCGGACCTGTCGCCGAAGGACAAGGCGGCGCGTTGCGCCTCGCTTGCCGGGGAAGGGCACAAAGTCCTGATGGTGGGAGACGGTATCAACGATGCGCCGGCGCTCGCCGCCGCACATGTTTCTGTGGCGCCCGCCACCGCGGCCGATATCGGCCGGCAGGCCGCCGATTTTGTTTTCATGCAGGAGGACCTCGATGCCGTGCCTTTCGCGATCGAGACATCGCAGCAGGCCGGAAAGCTCATCCGGCAGAACTTCGCGCTTGCCATCGGCTATAATATCATTGCTGTGCCGATCGCGATTGCTGGTTATGCGACGCCGATGATCGCGGCCATCGCCATGTCGACATCGTCGCTGATCGTCGTCGCCAACGCATTGCGCCTCGCCGGATCCGCCGGTCGTCGGAATGAACCAGAGACCGGGTTTGCGGGCGAGGGGGCGCAACTGGCATGAACATGCTGATCTATCTCATTCCCGTTGCGCTGCTATTAGGCGCACTCGGACTTTTTGCCTTCCTCTGGTCGGTGCGTTCGGGCCAGTATGAGGACATGGATGGGGCCGCCTGGCGGGCGCTGGACGATGGTGATAACCGGCCGCGATCCTCGATTTCGAATGGGCCATGACCTGCCGCAAATATGCAAAGCTGATTTGCGTCATCAAAATTTAATCGTTTCAGTCACGCCCAGCCTTGCCGTTTGGTGATTACAGTGCCACAACACGACCACCAAAAAGCAAGATGGAGGCACATCCGTGGAACAGGCAGAAATCGGTTTGATCGGTCTCGGCGTCATGGGCTCTAACCTGGCGCTCAACATCGCCGAAAAGGGCAACAAGATCGCCGTATTCAACCGAACCCCTGAAGCGACGAGAAAATTCTACGCTGAAGCCGGCGAGTTGCAGGGACAATTGATCCCTTGCGAAACCATCGAGGAATTCGTTGCCGCCATTCGTCCTCCGCGCCCGATCATCATCATGATCAAGGCTGGCGATCCGGTCGACCAGCAGATGGAAATCCTGAAGCCGCATCTTTCGAACGGCGACATCATGATCGACGCGGGCAATGCGAATTTCCGCGACACGATCCGCCGTTTCGACAATCTGAAGGACAGCGGCCTCACCTTTATCGGCATGGGCGTTTCGGGTGGCGAAGAGGGCGCGCGTCATGGACCGTCCATCATGGTTGGCGGCACCGAGGACAGCTGGAAGCGCGTCGAGAAAGTGCTCACCTCCATCTCGGCCAAATATAATGACGATCCGTGCGTGGCATGGCTCGGCAATGATGGTGCTGGCCACTTCGTCAAGACCATCCATAACGGTATCGAATATGCCGACATGCAGATGATCGCGGAAATCTACGGCATCCTGCGTGACGGTCTGAAGATGAGCGCTGCCGAAATCGCCGATGTGTTCGGTGAATGGAACAAGGGCCGTCTTAATTCCTACCTGATCGAAATCACCGAAAAGGTTCTGCGCGCCGCCGATCCGATCACCGGCAAACCGATGGTCGATCTGATCCTCGACAAGGCCGGCCAGAAGGGCACCGGCAAATGGTCCGTCATCGAGGCGCAGAACATGGGCGTCGCCGCAACCGCCATCGAAGCGGCCGTGGCTGCCCGTATTCTCTCCTCGCAGAAGGATGAGCGCGAAGCTGCCGAGAAGATCTTCGGCCTTCCGGCCCTTGCAGCCGCACCGGCCGACAGGAAGGCCTTCATCGCCGATCTGGAAAGCGCGCTTCTGGCCGCCAAGGTCGGTGCCTATGCGCAGGGCTTCGCCATCATGTCGGCCGCTTCGAAGGAGTTCAACTGGAACCTGCCGATGCCGACGATTGCCCGCATCTGGCGCGCCGGCTGCATCATCCGCTCGGAGTTCCTCGATGAGATCACCTCGGCCTTTACCAAGGATCCGCATGTGGCCAACCTCATCGTGACGCCGGCTTTCTCCGCCGTCGTCAAGGAAACCGATGCGCCGCTGCGCCGCGTCGTTTCTTACGCTGCCCTGTCGGGCCTGCCGGTTTCGGCGCTGGCTTCCGCCCTTGGCTATTTCGACGCCTATCGTCGCGGCCGCGGCAGCGCCAATCTCATCCAGGCGCAGCGCGACTTCTTCGGCGCGCATGGTTTCGAACGCACCGACGGCGTGGACAAGCCGCATGGCCCGTGGGGCAGCGGCGCCGATATTTTCTGATCGGCCATTGGATAAAGATAAAGGCCCGCGCAAGCGGGCCTTTTTCATTTCACCGGTGCTTGGAATTAGCGAGCGGCGGCAAGGCCGACGATCAGGCCGCTGACGGCATTGATGAGCAGGAACTGGTTGTCAACGCGCACCCAGCGCTGGTCGCGGCGCGGTTCGGCCAGGCGGTAACGGCGATAATCGCGACGATCGACGAAATGACGACGCTCTGCTGGCGAGAGGCGCTGACCACGATCCCAGCGGTGCTTCTTGACGATGACCTTTTTCGTGACATGACGCTCGACGGTGACGCCGCGGTGGCGATCATCGTGACGGCCCTGCGCCTGAGCCATCGGGGCCGCGAGAACGGTTGCAGCCAGAAGAATGGTGACGAACTTTTTCATGGAAGTTTCCTCTCGTGTTTCAATGAGAAGAACTTACGAATAAACGAATGAACGGAAACTGAATTTAAAATTACAATTATGTAATGGTTGTTAAGGCTTAGAGGCGGAAGTTAATCTTTTGTTTTGAAGTTTCTTGACGCGGTCGGACCGCCCGGAATAACCGGGCGGTCTCTCCTCGGTCAGACGCGTTCGGCAGATTTTGCCCAGAGGTTGATATCCGCGTCCTTGGCATAGCGGTCGATTTCAGCGAGTTCTTCGGTCGTGAAATCCGGTTTGTCCAGCGCTTTCACACAGTCTTCAACCTGCTCGACGCGGCTGGCGCCGATCAGCGCCGAGGTGATGCGACCGCCACGCAGCACCCAGGCGATGGCCATCTGCGCCAGTGTCTGGCCGCGGCGTTCGGCGATGCCGTTCAGCGCACGGATATTCTCGACATTACGTTCGTTGAGGAAAGCCGGGTTCAGCGACTTGCTCTGCGATGCGCGGCTGCCATCGGGCACGCCGCCGAGATATTTCGTCGTCAGCATGCCCTGCGCCAGCGGCGAGAAGACGATGGAGCCAATGCCTAGTTCCTCCAGCGTATCAACGAGACCATCTTCTTCAATCCAGCGGTTGATCATCGAATAGCTCGGCTGGTGGATGATGCAGGGCGTGCCGAGGTCCTTCAGGATGGCAGCTGCCTCGCGGGTGCGCTTCGAATTGTAGGAAGAGATGCCGACATAAAGCGCCTTGCCCGAACGCACGATCTGATCGAGCGCACCACAGGTTTCCTCAAGCGGCGTGTTGGGATCGAAACGGTGAGAATAGAAGATATCGACGTAATCCAGCCCCATGCGTTTCAGGCTCTGGTCGCAGGAGGCGATAAGATATTTGCGGCTGCCCCATTCGCCGTAAGGGCCTGGCCACATGTTGTAACCCGCCTTGGAGGAGATGATCATCTCGTCACGATAACCCCGGAAATCCGTCCTCAGGATTTCGCCGAAGGCGGTCTCGGCGCTGCCGGGCGGCGGGCCATAATTGTTGGCGAGGTCGAAATGGGTAATGCCGAGATCGAAGGCACGGCGGCAGATGGCCTGTTTCGTCTGATGCGGCGTATCGTTGCCGAAATTATGCCAGAGTCCCAGCGAGATCGCCGGCAGTTTCAGTCCGGTCTTTCCGCAATAATTGTATTTCATGGACGTGTAACGGTTTTCGGCCGGTTGCCAAACCATGGGTCTTCCTCCACTTGCTGATGATCGGCTGCCGAAGAACCCGGCAACCATTTGAACTGATGAGGGCAGGCCGCGACGCCTGTCTTCCCGAACAGGATGGTTATGCCTTTCTCCGCGTCGTTTCAATGCGGGGACTGGCAGCCATGAAGATCGAAGACACGGAAGTGCGGCTGTCCGCCGCCCGCTTTCGGGGTGAAGATGATACGCCGCGTTTCGCCGGCGGCGAGATCGAAAAGGTTGTCGGAATAACGCCCGGGCTGATCTGCCTCCAGCATGACGAAGAGCGCAAGCCCGCTTGCGGTGACATCTATTTCGAACTGCCCGTTTTTCAGTGAGCTGGTGGTGAATGTCAGACCGGCGGGCAGAAGTTCCAGCGCTTTATAGGTGTCGCGCACATGGTGGCCTTCCCCGGTCATGCCGTTGGAGGCGATGAAGCTCCAGGCAAGGATGGTGCCGTCAGGCAGGCTGCTCATGTCGATGTCGGTCAGCGTAGCGGCCTTGTCGGTCGTGCAGGTGCCGTTGGCGGATTTCAACGGCACTCGGTTGCCGTCCATAGTCAGGGCAACGATATTCATGTCGATCTCGACGTCTTCCGCCGTGTCGTTCACCATGGAGAAGTTTATTCGGCGTCCATCCTCCGCAGGGATAGCCGAAACCGTTACTGGCTGGAAGAAGCGGCGGGCGGCATAATGCAGCGCCTTCCAGCCGCCGCCATAATCGAGGCTTGACCACGAGGCGACCGGCCAGGTGTCGTTCAGCTGCCAGTATAGCGTTCCCATGCAATGCGGCTTCAGCGAGCGCCAGTAATCGACGGCCGTGCGGATAGCCAATGCCTGCTGGACCTGGCTCAGATAAACGAAATTCTCGAAGTCCTTTGGGAAGCGGAAGTAGCGGAACATCGTACCGGCGATCCGCTCGTTGCCGCCGGCATTTTTCTGGTGCAGTTCGATGACGGGGGAGGCGATGTTCATGTCCTTGTCTTCCGCATAGGTGCGGATGACCGGCATGGAGGTGTAGGACTGGAAACCGAATTCCGAGCAGAAACGCGGCTTCACCGAACGGTAATTGTCGAATGACTTGTTCTCGTGCCACACGGACCAGTAATGCATGTCGCCGGAACCATCCGCGTGCCAGGCATCGCCATAATCGAGATAACCGGAGGCGGGGCTTGATGGCCACCACAGGGCTTCGGGAGCGGCCTTTTTCAGGGCCTTTTCAATGGTGCGGTTCAGTCGGTCATAAGCCACAAGGTATCGGTCGCGATTGTTGCGGGACTCATCGAACCAGGTCAGTGCACCCACCAGCTCGTTATCGCCGCACCAGAGCGCGATGGAAGGATGCGAGGAGAGGCGTTTCACCTGATAATCAACCTCGTGTTCGACATTATCGAGAAAGTCCTCGCTGCAGGGATAAAGATTGCAGGCGAACATGAAGTCCTGCCACACCAGCAGGCCAAGATGGTCGCAGAGATCGTAGAACCAGTCTTCCTCATAAAATCCGCCGCCCCAGACCCGGATCATGTTCATGTTAGCCTCGACCGCGGAGCAGAGGAGGTCTTCGGTCTTTTCCCGGCTGGTGGGCGAATAGAGCGCATCTGCGGGAATCCAGTTGGCACCGCGGCAGAAGATTTCCCGGCCGTTGATGCGGAACGCGAAGCGGCTGCCAGCCTCATCCTTATCGGTCAGAAGCTCGATGGTCCTGAAGCCGATCTGCCGGGTCACAGTCTCATCGGGCAGCTCAACCATCAGCGTGTAGAGCGTCTGCTCGCCACTGCCGGCCGGCCACCAGAGCTCCGGATTTTCCACGAAGAAGACGTGACGCACCACGGTTTCGCCTGCGCCGACGCCGCAATCGAGCCTCAGCCTTTCATCGTTGAGAGACAGATAAACCGGCAGGCTTGCCGGTCCCTCGGCAAAGAGGGTGACGGCGACATGCAGCTCGACGCCGCCGTCGATATGGCGCTGTGAGGTAACGACATGCTCGATCCGGGCCGTATCGAGCCGTTTCAGCAGGATTTTGCCGTAAAGCCCGAGTGGCGCGATCGCGATGTTCCAGTCCCAGCCGAAATGGCATTGCGGCTTGCGCAGCATATTGCCATTGGCGATGGGCGAATTCCCCGGGTGATAGGGTATATAAAACGGCTGGCGCGCCTGCCGCTCCGCCCCGGCGGTTATATTGGAGTGGAAATGGATGCGAATGGTGTTTTCACCGGGGCGCACTGCACGGGAAATGTCAGGGCGGTAGCGACGGAAGCAATTGTCCGCGCTGAGCACCGGAACGTCGTTCACGAAGACAATCGCGACGGTGTCGAGATAGTCGATGTCGAGATACCAGCTTGCGTCCGCCTCATCGAGGATGAACGTTCGCTCAAGAACCCAGTCCTGTTGGGCCACCCATTGCACGGCCTTTTCATTGGCTCCGTGGTAGGGTTCTGGAATGATGCCCGCATCTTTGAGTGCCGTATGGATATCCCCAGGGACGGTAAGCTCGGTCGTATGATCTCTGTCCACTGATGTCAGACGCCAGAGCCCTGCCAGATCGATGACGGTTTCGGGCGTGGAGGAAGAAATCATGACGGAACCCTGCATTTATGAATTCAAATAAGAAAGCCGGCATTGTGATGCAGGCTTGTGTCAGAATCTGGGGAGGATGGACGCCGCAAGTGGCTAACCGCCCGCTTGCTCGGTCTCATGATACCCTCCCTTCGGCAACGTTACCGTAAAACCTCCATTCCGTCATGTCTTTCAGGTACAACATATGTCAGGAAAAATTTCTGGAACGGAGCATCAATCGCGGCCGGATGGATCGGTCTAAAGGTTATTTTATTTTTCGATATTATATCAATAAGTTATTATAATTTTTCAATGTTTATGATTAAATCATATATTGGCTACGCAGCCATCACGGGCGGGCGAGAGAAAAGCGCAGTAAGAACCGGACATTTTTTCGAGCTACATCGTTATAGCCCAAACAGGGGCTCCCGCGCGGAAAGCGGATCCCGGTTGAGGACCTTTGCCTTTGCAACGACGTAAAAGCTATCTATGATCACCGTTCGGTCAAGCATGGCGAAATGACATGAACGATACTGGTAAATCCGGCGGGAGCGAAGCGCCCGTGACGACGGGCGAGCGCCCGACCCTGAAGACCATCGCTTACATGACGGGCCTCGGCATCACCACCGTTTCGCGCGCTCTCAAGGATGCGCCTGATATCGGCGCGGAGACCAAGGAGCGCGTGCGGCTGATTGCCCGCCAGATCGGCTATCAGCCAAACCGCGCCGGCGTGCGCCTTCGCACCGGCAAGACCAATGTGATTGCCCTCGTTCTCAGCGTCGATGAAGAGCTGATGGGTTTTACCAGCCAGATGGTGTTCGGCATCACCGAGGTTCTGTCCTCGACCCAATATCATCTGGTCGTAACGCCCCATATCCACGCCAAGGATTCCATGGTGCCGATCCGCTACATTCTCGAAACGGGATCGGCGGATGGCGTCATCATTTCGAAGATCGAACCTAACGATCCGCGAGTCCGTTTCATGACGGAGCGCAAGATGCCCTTCGTCACCCATGGACGTTCGGACATGGGCATCGACCATGCCTTTCATGATTTTGACAATGAAGCCTACGCCTACGAAGCGGTCGAGCGATTGGCTCAATGCGGACGCAGGAAGATCGCCGTCATCGTGCCGCCGTCACGTTTTTCGTTTCACGACCACGCTCGCAAGGGGTTTAATCGGGCAATCAGGGATTTCGGCCTTACCGAATTTCCGTTGGACACCGTTACAATCGAAACGCCGCTGGAAAAAATCCGCGATTTCGGTCAGAGGCTCATGCAGTCCGATGATCGTCCCGATGGCATCGTTTCGATCAGTGGCAGCAGCACCATAGCGCTGGTGGCCGGGTTCGAGGCCGCGGGCGTGAAAATCGGAAAGGACATCGACATCGTCTCGAAACAGTCCGCCGAATTCCTGAACTGGATCAAGCCGCAGATCCATACCGTCAATGAGGATATCAAACTGGCGGGCAGGGAGCTTGCGAAAGCCCTTTTGGCCCGCATCAACGGCGCGCCCGCGGAAACTCTGCAGAGTATCAGCCGCCCGGTCTGGTCCTCCATGGCGCCGAAGCCTTAAGGCACACGCAAGACCGAGCCGGCATGGCGCATTGGCCATGCCGGGCTTGATGAGTGCCAATCAGGACAAACAATTCTTCAGCGACCCGGCGATACCGCGCATCAGCTTGAAGTAAAGCTCCGGGCCTGGCTCCAGCGTTGCCGCTTCCGGATCAAGCGTAGCCGACTTCGCTGCCGTACCTTCGGTGATGACCGAGACCAGCTTAGGCTCGAATTGCGGTTCGGCAAAAACGCAGGTCGCGCCGAGCTGGCGAACCTTTTCTTGCATCTGTTTGACGCGATCCGCGCCCGGCAGGGTTTCGGGGCTGACGGTGATCGATCCCGCCGTCTTCACGCCATAGCGATGTTCGAAATATTGGTAGGCGTCGTGGAAGACAATGAACGGCTTGTCCTTCACCGGCTGAACCGTTTTGGCCAGTTCGGCGTCAAGTGTATCGAGATCATCGATCAGTTTCTTTGTGTTGGACTGATAGATCGCGGCATTGCCGGGATCGGCCGCGATCAGCGCGGTTTCGATTGCCTGGGCCATGGCCTTGGCATTGGCGGGGTCAAGCCAGAGATGTGTGTCATAGGTGCCATGCTCATGGTCGTCGTGGCTCTCACCGTCGTGACCGGCGTGACCGCCTCCGTCATGCTCGTGACCATGCTCGTGCGCGTCCGGCTTTTCGGCATGACCGTCATTCGCCTCATGGCCTTCTTCACCATGATCATGCGCTTCAAAGGGCCCCCCTTCACGGAAGGGCAGCTTTTCCAGCCCCTTGGCATCTTCCAGTTCCACAACAGTCGCCTTGGCGGCCAGCGCTTCCAGCGGCTTCTCCAGAAATGCTTCCAGTCCGGGACCCACCCAGAAAACCACATCCGCCGTTTCGAGCTTGCGTGCATTCGACGGACGGAGATTATAGGTATGCGGCGAGGCCGCGCCATCGACGATGAGCTGTGGTTCCCCGACACCTTGCATGATGGCCGCAACGAGGGAGTGGACGGGCTTGATCGACACCACGACCTCGGGTGCAGCGGTGGCGCCCGAAGCGGCGACCGCGATTGCCATGGAGGCCGCGAGCGGGATCAGGATCGATTTCATGAAATGCACTCCACTTGAAGAACGTTAGTTATGTTATTACATTTATTGCGTAACTCTATAACGTATGCGATAGCGGTAAGCAACATCGCCAAACGGACAATCCCATGCTTCATTCCGCCCATCCGGGTAACGAGATACTGGTCTCCCTTGCCAATGCCGGCGTCCAGCGCAACGGCCGCTGGCTGGTGCGCGGCGTGGAGTTTTCCGTCAGCAAGGGCGAGATCGTCACACTGATCGGGCCGAACGGATCAGGCAAGTCCACGAGCGCGAAAATGGCGATTGGTGTCGTCAAACCAACCGAGGGGGTGGTTACCCGCAAGGCTGGTCTGAAAGTGGGTTATGTGCCGCAGAAGCTTTCGGTCGACTGGACAATGCCGCTTTCGGTGCGCCGACTGATGACACTGACAGGACCGCTCGCTGCCCGCGAAATAGACGCCGCTCTGAATGCGACCGGTATCGCGCATCTTGCCAATGCCGAGGTGCAGCATCTGTCCGGTGGCGAGTTCCAACGCGCGCTTCTTGCCCGCGCCATTGCCCGCAAGCCCGACCTTCTGGTTCTTGACGAACCCGTGCAGGGGGTGGACTTCTCCGGCGAGATCGCGCTTTATGATCTGATAAAAAATATCAGGAATTCAAATAATTGCGGAATTCTGTTAATCTCGCATGATCTGCATGTAGTGATGGCGGAAACCGATACGGTGATATGCCTCAACGGCCATGTCTGCTGCCGCGGCACACCGCAGGCGGTGAGCCAGAGTCCGGAATATATGCGCCTGTTTGGCGGCGCGGCGGCGAAGGGGCTGGCCGTCTACAGTCACCATCATGATCACACCCATTTGCCGGACGGCCGGGTGCAACATGCGGATGGCACGGTGACGGACCATTGCCACCCCGAGGATGGTCACCATCACGGGCATGATCTTCATCATGACCACGGGCATGACCATCATGACCACGGCCACCGCCACGATGATCATGGAGAATGCGGCTGCGGCCATGAACGTGACGACGACGCCCACCTTAACCAGCGGCAGGGAGAACGCCATGTTTGACGATTTCTTCGTCCGCGCCATGGTCGCCGGCATTGGGGTTGCATTGACAGCCGGTCCGCTTGGCTGCTTTGTCGTCTGGCGGCGCATGGCCTATTTTGGCGATACCATGGCCCATTCGGCGCTTCTGGGTGTTGCCCTGTCTCTGCTGCTGCAGCTGAACCTCATTGTCAGCGTGTTTCTGGTCGCGTCGGCCGTGTCGCTCCTTTTGATTTTCCTGCAACGGCGGCAGGCGCTGTCCGCCGATGCGCTGCTGGGCATCCTGTCCCACTCCGCATTGGCGATCGGCCTTGTCATCGTTGCCTTCATGAGCTGGGTCCGCATCGATCTCGTCTCGTTCCTGTTCGGCGACATTCTCGCCGTCACCCGCAGCGATATTGCACTGATCTGGGGCGGCGGACTGGTTGTCATCGTCTCCATGGTGTTCCTGTGGCGATCGCTGCTTGCCTCCACCGTCAATACGGAGCTGGCGGAAGCCGAGGGGCTGAAACCGGAACGGGCGAAATTGATCTTCACGTTGCTGATGGCGCTGGTGATCGCCATCGCGATGAAGGTGGTCGGTATCATGCTCATCACATCGCTGCTCATCATACCGGCTGCGACCGCCAGACGTTTCTCCGCTACACCGGAGGTAATGGCGGTGGTGGCTTCGCTGATCGGTGCGGTTGCCGTTGTTGGCGGCCTCTTCGGATCGCTCACCTATGATACACCGTCCGGTCCATCCATCGTGGTTGCCGCGGTGATCCTCTTCGTTATAAGCCTGTTGCCGGCGCCGGGTTTGTCCCGCTCCGCGGATGAAGGAGGCAAGTCATGAATGCGCAGACCCAGCAGAACCTCACCAAGAACCAGTCGCTGGTCATGAACGCCCTTTCAAACGCGCATCAGCCGCTCAGCGCCTATATGATCCTCGACAAGTTGCGCGATGATGGTTTCCGCGCGCCGCTGCAGGTCTATCGGGCGCTGGAAAAACTGGTGGAATATGGCCTTGTGCACCGGCTGGAGAGCCTTAACGCCTTCGTCGCCTGCACCCATACCCAGGCGGAGTGCTGCTCCAGTCACCATGGCACCGTCGCCTTCGCCATCTGCGAGTCCTGCGGGCAGGTCACGGAATTCCACGATCACGAGATCGATCACCGGCTGGAGCGCTGGGTGAAGGACAGCAAGTTCAAAGCGGAAAAGACCACCATCGAAATCCGCGGTCTTTGCGCCGCCTGCTCGGCCTGACGGACGATGGATGGAATGGGTATTGTTGCCCGCTCCCGATCACCTTTCGATGACCGCCAGATCCTGCGAGAAACGCTTCCAGTTTTCGACGTATTTTTTCGCTGAGCGGCGAATCCCCGCCACGGCGTCGTCATCAAGCGTGCGGATAGCGCGCGCGGGCGAGCCGACGATCAGGGAATTATCCGGAAATTCCTTCCCCTCCGTCACCAGAGCATTGGCGCCGACGAGGCAGTTTCGGCCGATCTTTGCGCCGTTCAGGATCGTCGCGCCCATGCCGATCAACGAATTATCGCCGATGGTACAGCCATGCACGATAGCGTGGTGGCCAATGGTGCACATTTCGCCGATCGTTGCGGCAAAGCCGGGATCGCTGTGAACCATCACGCCTTCCTGAATATTGGTGCCGCGCCCGACGATGATCGGCTCATTGTCTCCACGCAGCGTTGCACCGAACCAAATGCCGACGTCTTCGCCGAGCGTGACTGAGCCAATGACATTGGCGTCGGGAGCGATCCAGTAGCGATCCGCTGCAGGCGTTTGCGGCACACGGTCGGCAAGGCGGTAAAGCGGCATGTCGATGTCCTCCCTTATGTTCGTGCGTCAGACGACCGTCACCGACAGTGTGCCGACACCCTCGACGCCGCATTCTATGCGGTCACCGCGAACGATTGGACCAACGCCAGCCGGCGTGCCGGTCATGATGACGTCGCCAGCGGCAAGGGTGAACAGCTTGGAAAGCTCGGCAATCACTTCCGGCACCTTCCAGATCATCTGGGCGAGATCGCCCGTCTGCTTGCGCTCGCCATTGACGTCGAGCCAGATCGCGCCGGTTGCGGGATGCCCGATCCTGTCCGCTGGGACGACGGGGGACACCGGCGCGGAATATTCGAAGGCCTTCGCGCCCTCCCAGGAACGTCCCATCTTCTTCAGCCCGTCCTGCAGGTCGCGGCGCGTCATGTCGATACCGACGGCGTAACCCCAGACATGGTTCAGCGCTTCCGACGCCGGAATATCCGCTCGGCCGCTTTTCAGCACCACGACGCACTCAACTTCGTAATGCACATTCGATGAGAGCGACGGATAGGGAAAATCCTGTCCTGCGGGCAGCAGGTTGTCCGGGTTCTTCTGAAAATAGAAGGGCGGCTCGCGCGAGGGGTCGTGGCCCATCTCAATGGCGTGATCGGCATAATTGCGGCCGACACAATAGACCCGCCGTACAGGGAACAGATCGCTCGTGCCTTCGACGGGCAGGAGAACGGGCTTCGGGGCAGGGATGACGGTAGCGGCCATGTGGGTTTCCTGTCGTCTTTTAGGGTGCAATTCTTGTCCCGCGATTCCGGCGCGATTTCCAGTTCAAAATTGCCTCGCCTGCAAAACTGGCTTTTTACGGTCCTTTGCTGTAGAAGCCGGCGCAAAGAGGTTAATCCATGTACAGACAAGCGCTTGCAAGCGGCGAAAAAATCTTCGCCGTGGCCCCCATGATCGACTGGACCGATACGAGGTGCAGGTTTCTGCACCGGCAATTGTCGAAGCGTGCGCTGCTTTACACCGAAATGATCGTTGCCGACGCCATCATCCATGGCCAGCGTGACAGGCTGCTCGGTTATCACCCTCAGGAACACCCCGTCGCGTTGCAGCTCGGCGGGTCTGATCCGGCCAAGCTTGCCGAAGCGGTGCGGATCGCCGGGGACTACGGTTATGATGAAATCAATCTGAATGTTGGCTGTCCCTCGGATCGGGTGCAATCCGGCACCTTCGGCGCTTGCCTGATGCGCGAGCCTGATGTGGTGGCGCAATGTGTGGCCGCAATGAAGGCTGCCGCCAGTGGGCCGGTTACCGTGAAATGCCGGATCGGGGTGGATGAGCAGGAGCCGGAAGCGGTTCTGCCTGATTTTCTGAAGAGGGTTGTTGCGGCGGGTGCGGACGCGGTCTGGGTTCACGCGCGCAAAGCCTGGCTGCAAGGGCTTTCGCCGAAGGAAAACCGCGAAGTGCCGCCGCTCGATTATGACCTCGTCTATCGCATGAAGCGGGAAAATCCTGACGTCTTTATCGGCATCAACGGCGGCATCGCCGATCTCGATCAGGCGAGCGGACATCTCCAGCATATGGATGGCGTGATGCTCGGCCGGGCTGCCTATCACAACACCTCCATCCTTGCCGACGTCGACCATCGCATCTATGGCGAGGAGGCCCGCCACCCGGACTGGATGGCGTTGCGTGACGCGATGATGGCCTATGCCGCCGATTATATCGCCGCGGGCGGACGTCTCAACCACGTCACCCGCCATATGGTCGGACTGTTTCAGGGCATGCCCGGCGCGCGCCGTTTCCGGCAAATCCTCTCCAGCGATGCCACGAGGCCCGGGGCAGGGCCGGAGGTCATCGAGGCCGCTTTCGCCAGCATCGATGTCAGCACCACGAAAGACATGGCGGGCTGAAAGCGCTCAGACCGCCTTTTGCAGTTCCTGCGCACCATTAGGCGCATTGACCTTGCCGCCGGTAATGAAGAAAGCGAAAACATCGCGCGGGGTCTTCTCAACCTTCCGATCCGGCGCGATCAGCGGCAGATCGTCCGGCACACCGCCACTGGCATAGGTCTTTACCCGCTCAGCCCAGTGCGCGATCTCTTTTTTCGGGTAACAGGTCTCGATGTCGTCCTCGCCCTTCTGCAGGCGGCAATAAACGAAATCGGCCGTTACGTCAGGCAGCATCGGATAATCATGATGGTCAGCGCAGACGACGGCGACCTTGTGCCTGGCAAGAAGTTCGATGAATTCGGGAACCTGGAATGTCGGGTTGCGCACCTCCACGACATGGCGAAGAGTGAGCCCGTCCTGCTTTTCGGGCAGCAGCGCCAGAAATGCACCGAAATCCTCCGCATCGAATTTCTTCGTGGGCGCAAACTGCCAGAGGATGGGACCGAGATGGGACCCGAGTTCCGTCAGCCCCTGCGTCAGGAACTTCGTCATCGACTCACCTGCCTCGGCCAGAACCCTGCGGTTGGTGACAAAGCGGCTTGCCTTTAGGGAAAACACGAAGTCCTCGGGCACTTCGGATGCCCATTTCGCAAAGGTCTCCGGTTTCTGGCTGCTGTAATAGGTGCCGTTCACCTCGATTGCCGTGAGCTGCCGGCTGGCGTGTTCCAGCTGGCGTTTTTTCGGCAGTTTTTCGGGATAAAAAGTGCCTTCCCAGGGCTCGAAGGTCCAGCCGCCGATACCGGTGCGGATCGTGCCTGATGTCGACATATCTAGTTCCTCCTCCGGCCTTCGGCCTTTCGAACCTTTGCGACGAAAGCTGCGGCTTACTCCGCAGCCTCCCGTTTTCCGGCCGGCCGGCGCTCCAGAAGCTCCTTCAGGAAGTGGCCGGTATAGGAACGCTCCACCTTGACGATGTCTTCCGGCGTGCCCGTGGCCACCACTTCGCCGCCGCCCGTGCCGCCTTCAGGCCCGATATCGATGATCCAGTCCGCCGTCTTGATGACTTCGAGATTATGCTCGATCACCACCACGGAATTGCCCTGTTCCACCAGCGCATGCAGCATTTCGAGCAGCTTGTTGACGTCGTGGAAATGCAGGCCCGTCGTCGGCTCGTCGAGAATATAGAGCGTGCGGCCCGTCGAGCGCTTCGACAGTTCCTTGGCGAGCTTGACGCGCTGCGCCTCGCCGCCTGAAAGCGTATTGGCCTGCTGTCCGACCTTGATGTAACCGAGGCCGACATCGAAGAGCGATTGCAGCTTGTCGCGCACGGCGGGAACGGCGGCGAAAAATTCGACGCCCTCCTCAACCGTCATATCCAGCACGTCGGCGATGGATTTGCCCTTGAAGGTGACGTCCAGCGTCTCGCGGTTATAACGCTTGCCGTGGCAGACGTCGCAGGTGACGTAGACGTCAGGCAGGAAGTGCATCTCGATCTTGATCACCCCGTCGCCCTGGCAGGCCTCGCAGCGGCCACCCTTGACGTTGAAGGAGAAACGGCCGGGCGCGTAACCACGAGCCTTGGCTTCCGGCAGGCCGGCGAACCAGTCACGGATCGGCGTAAAGGCGCCGGTATAGGTGGCCGGGTTGGAGCGCGGCGTGCGTCCGATCGGCGACTGGTCGATATCGATCACCTTGTCGATGAATTCGAAACCATCGATGCGGTCGTGTTCGGCCGGGATTTCACGCGCGCCCATGACCCTGCGCGCCGCCGATTTATAGAGCGTCTCGATCAGGAAGGTGGATTTGCCGCCACCCGAAACCCCGGTTACGGCCGTGAAAACACCGAGCGGCACAGCGGCCGTGACATTCTTCAGGTTGTTGCCGCGTGCGCCGAAAACCTTGATCTCGCGGCCCTTCTTGGGTTTGCGGCGCTCAGCCGGCACGGCAACGCCGAGTTCACCGGAGAGATATTTGCCGGTCAGCGATTTCGGATTGGCCATCACTTCCTGCGGGGATCCTTCGGCAATCACCTGGCCGCCGTGAATACCGGCCGCCGGGCCGATATCGACCACATAATCCGCCGTCAGAATGGCGTCCTCATCATGTTCGACCACGATGACGGTGTTGCCGATATCGCGCAGATGTTTGAGCGTATCCAGCAGGCGCGCATTGTCGCGCTGGTGCAGGCCGATCGAGGGCTCATCGAGAACGTAGAGAACGCCGGTGAGGCCAGAGCCGATCTGCGAGGCCAGACGGATACGCTGGCTCTCCCCGCCCGACAGCGTGCCCGAATTGCGCGACAGGCTGAGATAATCCAGCCCTACATCGTTGAGGAATCGCAGGCGTTCGCGGATTTCCTTGAGAATACGGACGGCAATTTCATTCTGCTTCGCGTTCAGGCTTTCCGGCAGCACCTCAAACCAGTCGCGCGCCGTCCGGATCGACATTTCGGTGACTTCACCGATATGCAGCTTGTTGATCTTGACCGCCAGCGCCTCCGGCTTCAGGCGGTAACCGGCGCAGGCCGGGCAGGGGGCTGCCGACATATAGCGCTCGATTTCCTCGCGCGCCCAGGCGCTGTCTGTCTCTTTCCAGCGCCGCTCCAGATTGGGCACGATGCCTTCGAAGTTCTTGACTGTCTTGTAGGAGCGGGCGCCGTCCTGGTAATTGAACTCGATCTTGTCGTCAGTGCCCTGAAGAATGGCGTGCTGCGCCTCCTTCGAAAGATCGGACCACTTGCTCGAAAGCTTGAATCCGAAGGCTTTGCCGAGCGCTTCCAGCGTCTGGTTGTAATAGGGCGACGATGATTTTGCCCAGGGAGCAATGGCCCCGTCACGCAGGGTACGGGCCGGCTCCGGCACGATCAGGTTCTCATCCACCTTCTGCTGCGAGCCAAGGCCATCGCAGCTGGGGCAGGCGCCGAAGGGGTTGTTGAAGGAGAAGAGACGCGGTTCGATTTCCGGAATGGTGAAGCCGGAAACGGGGCAGGCGAATTTTTCCGAAAACAGCACCCGCTCATGGGTCTCGTTGAGAGACTTGTTGGCGGAACCACCGGCGGCTGTCTCTTCGGGCGGCAGGGGCCTGTCGGCAAATTCGGCAATTGCCAGACCATCGGCAAGCTTGAGGCAGGTCTCGAGACTGTCGGCAAGACGCGCGGCCATATCCGGGCGCACGACGGCGCGGTCAACCACCACGTCAATGTCGTGTTTGTACTTCTTGTCGAGAGCAGGAACGTCGGCGATCTCGTAGAACTGGCCGTCCACCTTGACGCGCTGGAAGCCCTTCTTCATCAGCTCGGCAAGTTCTTTCTTGTACTCGCCTTTGCGGCCGCGCACGATCGGCGCGAGGATGTAAAGACGCGTGCCTTCCTCGAAAGCGAGGATGCGGTCGACCATCTGGCTGACCGTCTGGCTTTCAATCGGCAGGCCCGTGGCCGGCGAATAGGGGACACCGACACGCGCGAAGAGAAGGCGCATATAGTCGTAGATCTCGGTGACCGTGCCGACCGTGGAGCGCGGGTTGCGCGAGGTGGTCTTCTGTTCGATGGAGATTGCCGGCGACAGGCCCTCGATCTGGTCGACATCCGGCTTCTGCATCATTTCGAGGAACTGGCGCGCATAGGCCGAGAGGCTTTCGACATAACGGCGCTGGCCTTCGGCGTAGATCGTATCGAAGGCGAGCGACGACTTGCCTGAGCCGGAAAGCCCGGTCATGACGATCAGCTTGTTGCGTGGCAGATCGAGATCGATACCCTTGAGATTGTGTTCACGGGCACCACGGATGGAAATCGTCTTCAGTTCACTCATGACAAGGCTCGGACAAAAATAACAGGCTGGATGGCGGCGCGTCCTTATGTAGTGACGCAGCCGGACGTGTCGAGGTCTATTCCCTATCGGTCGCAGCTTTTCACGATGCTTGACTCGAAATCTGCAATTCAATAGAACAAAAAAGGAACGAATTGCAAGCGCGAATGTCTGCCCCCACAGCTTAAGATTATTCTTTGTGGATAGTTCCATGCTTGCGGTGCCCTTTGCGGGCGGCGGCAGCTATTGTGTGCCGAATTTTTTGAAATGCAATTGCCGGTCATGATGGCCGGCGGACAGGATGACGAGATGGCTGGTAGCGTAAACAAGGTAATTCTGATCGGAAACGTGGGCGCTGATCCCGAAATCCGCCGGACGCAGGATGGCCGACCCATCGCCAACCTGCGCATCGCCACCTCGGAGACCTGGCGGGACCGCAACAGCGGCGAGCGCAAGGAGAAGACCGAGTGGCACACGGTCGTCGTCTTTAACGAAGGCCTGTGCAAGGTCGTCGAGCAATATGTGAAGAAGGGCGCCAAGCTCTATATCGAAGGCCAGTTGCAGACCCGCAAATGGCAGGACCAGACCGGTAACGACCGTTACTCCACCGAAATCGTGCTGCAGGGCTTCAACTCCACGCTGACCATGCTCGATGGTCGCGGTGAGGGCGGCGGCCGTAGCGGCGGCGGCGACTTCGGCGGCGGTAATGATTATGGCAGTGGCGGCGGCTCCAGCTATGGCGGCGGTTACGACCAGCAGTCCTCCTCGCGCGGCGGTTCTTCCCGTGGTGGCAACCAGCCTTCGGGCGGATTCTCCAACGACATGGATGACGATATCCCGTTCTGATCGGACCGGGGACGAAGAAACGTGTGACAGGAGGTGAGCTTGACTGACGAAACCCTGCAGCCAAAACAGCCCGCCACCTGGCGCATCATCCTCGCTTTTTTTCTCGACTTCTGGACGGCGTTTTTCGCCGCCGGATTTCTTGTCGCGACGGTAGCGGGAGGGCGAACGCCCGAAGGTTTTGCCCTCAACGGTGCGCCCGCCTTTATCGCCTTCGTGCTGATCATTGCCTATTTTGTGGTGCTCGGGCGGTTCTTCGGTGGCACACTGTGGCAGCGCCTTTTAAAGGCGAGACGCTGACATCGGGCCTTGCCGTTTTTTTAAGAGAAGAACACTTCGACGCCCGGGGCAATTTATGCCGCGACTTTGTCGGCCAGCATCGCCACCATGACATCGGATTGCGACACGATGCCGACCATCTTACCCTTGGCGTCGATGACAGGCAGATAATGCAATCCTTCCTCCGCAAAGTGGATGATTGCTTCCTCGATCGGTGTTTCCGGCCGCACGGTTTTAACGGGCGATGTCATGATGTCCTTCACCGTATCGTTGGGGGCACTTGCGCCTGACAGGATGAGGCGAAGCCGCTGCAGGAAACCGATTGAGGGACGACCGTTTCTCCAGCTCGCCTTTTCCAGAAAGTCTGTCTGGGTGACGATGCCGACGATTTCGGCGCAGTCATTGGTGACAGGCAACGCCTTGAAATGATGGCTTTGCATCAGGGCATGCGCGTGGCGCAGGCTGTCATCAGGGGCGACGCCGACCACATCCCGGGACATGACACTTTCGCAATCGAGATGCAGCGCGCGACGCCGGTAGGAGCGCAACTCTGTCTTCCGCAGGATGGTTTCCAGCGCATCGCGGTCGATATCGATGAACTCGTCATATTCCTTCAGCACGTCGTCGAGATCGGTGGAGGAAAAGCCGATTTTCTGGATGGGTGTCGGATCGGCTGTCCCGTGCGCGGCCTTGCCAAGCTTCAGACCGTGTGGATAGGCGCGGCCGGTGGCATTGTTGTAAGCGACGGCCAGGGCAAGAAGGATCAGCGAATTTCCGGCGACCGGCCACAGCAGGAAACCATAGCCGAGGCCATGTATGGCAGGGCCGCCCAGAACGGCAGTCAGCGCCACCGCACCGCTCGGCGGATGCAGGCAGCGCAGCGCCATCATGGCGGCGATGGCCAGGCTGATGGCGAGCGCGCTTGCGAGGAACGGATCCGCAACCAGCAGCGCCACCGTCACCCCGACAAAAGCCGAGACGAAATTCCCGCACAGGATCGACCAGGGCTGCGCCAGCGGGCTGGAGGGTACGGCGAACAACAGCACGGCCGAAGCCCCCATCGGCGCGATCATGGCGGGCAGCGTGGGATCGAAACGAAGAGCGAAACTGGCGAGAATGCCAGTCAGCAGTATGCCTACGAAGGCGCCTGTGGCGGAGCGGAGCCTTTCCCTGTTGCTGACGGGTGCGGCATCGGGAATGATGCGGCGCAGCGTGGAACGCATGGAGATAACTTTCGAATCCGAAATGACACGCGTCTTTTAGCCGAAGCGCCAGCTGAATTTAAGGGAGGATTTTCCGGCATTTCAGCGGTGGGCAGGAATTTATTGCGTTGGTGTCGCGCAGGAACGGCTCAGAAAGCGAATGCGGATTTGATCCCGTCGACCACGAATTGCACCGACAGTGCAGCCAGCAGCACGCCGAGAAGCCGGGTAAGGATGGCGCGGCCGGTATTGCCGAGAAAGCGGTCCATCCGTTCCGAGACGATCAAGGCCGCGTAAACGAGAGCCATCGCGAAAGCGAGGATAAGGATGAGCACCACCATCTCGACCGTGGTCTTCATTGAGCCGGCAAGCAGAACGGTGGCGGAAATAGCACCTGGCCCGGCAATCAACGGCAGGGCCAGCGGGAAGACCGCGAGATTGTGGAGATGGTCCTTGGTAATGGCGATTTCCGAGGTCTTTTCCTTGCGCTCCTGGCGCTTTTCGAAAATCATTTCAAAGGCGATCCAGAACAGCAGAAGGCCGCCGGCGATGCGGAAGGCGCCCAGCGAAATACCGAGCAGACTGAGGATTGCGAGGCCGAAGAGGGCAAAGACGGCAAGGATACCGAAAGCGATGATGGAGCCGCGCAACGCCACCTGGCTGCGCTGGTCACGGGTCATTCCGGCAGTCAGCGCCAGAAAGACGGGCGCAAGCCCAGGCGGATCGAGTGTGACCAGAAGTGTGGTAAGCGCATTGATGAGCGTTTCGCTGCTGGCCATTCTTTCCCCGTTGGTTTTGTTTTTAGGCCGGAGCCTACGCAAGTCGCGGGCACTTTGAAAGAGAGCGCCACGCATTTGGGCGGCTAAATCGTGAAAAGCGGCGTTCGTCTTGCAAATGACGCAAAAGCCTGTTCAAAACTCGCCGGGAAATTGGCTTAACGTGGCTGTTTCCGCTATAAAAATCCATCCGATTCTTAACAGAGAATGTGATCCGTTTTGACTGACCAGAGCCCCCCCGGCGGCGGAAAGCTTCCGCCAGGCATTGAACCGATTTCCATCATGGAGGAAATGCAGCGGTCGTATCTCGATTACGCCATGAGCGTTATCGTGTCCCGCGCGCTTCCCGATGTGCGAGACGGCCTGAAGCCGGTTCATCGCCGTATTCTGTACGGCATGTCCGAACTCGGCATCGACTGGAACAAGAAATACGTCAAATGCGCCCGCGTAACCGGGGACGTGATGGGTAAATTCCATCCGCACGGCAACTCGGCAATCTATGACGCGCTGGCGCGTATGGCGCAGGACTGGTCGCTCCGCCTGCCGCTGATCGACGGCCAGGGCAACTTCGGCTCCATCGACGGTGATCCGCCGGCCGCTGAACGTTATACCGAATGCCGTCTGGATAAGGCCGCGCATTCGCTGCTTGACGATCTCGACAAGGAAACGGTCGATTTCCGTGACAACTATGACGGCACCTTGCAGGAGCCGGTCGTCATTCCGGCCAAGTTCCCGAACCTGCTGGTCAACGGGGCAGGCGGCATCGCGGTCGGCATGGCCACGAACATCCCGCCGCATAACCTGTCAGAGGTTATCGATGGCTGTATCGCCCTGATCGACAATCCGGCGATCGAGCTGCCGGAACTGATGCAGATCATTCCGGGCCCTGATTTCCCGACCGGTGCGCTGATCATGGGCCGTTCCGGCATCCGCTCGGCTTACGAAACGGGCCGCGGATCGGTCATCATGCGTGGCCGCGCGACAATCGAGCCGATGCGTGGCGACCGCGAGCAGATCATCATCACCGAAGTTCCCTATCAGGTGAACAAGGCGTCGATGATCGAAAAAATGGCTGAGTTGGTAAAGGAAAAGCGCATCGAGGGCATTTCCGACCTGCGCGACGAATCCGACCGCCAGGGTTATCGCGTCGTCATCGAATTGAAGCGCGACGCCAATGCAGACGTCATCCTGAACCAGCTTTACCGCTACACACCGCTGCAGACCTCGTTTGGCTGCAACATGGTGGCGCTGAACGGCGGCAAGCCCGAGCAGATGACGCTGCTCGACATGTTGCGTGCATTCGTGTCCTTCCGCGAAGATGTCGTCAGCCGTCGCACGAAGTATCTGCTGCGCAAGGCGCGCGAGCGGGCGCATGTGTTGGTCGGTCTGGCGATTTCCGTCGCAAATATCGATGAGGTCATCCGCGTCATCCGGCATGCTCCCGATCCGGCTTCGGCCCGAGAAGAACTGATGACGCGTCGCTGGCCTGCACAGGATGTGGAAAGCCTGATCCGCCTTATCGATGATCCGCGCCACCGCATCAACGAAGACGGGACTTACAACCTTTCCGAGGAACAGGCGCGCGCCATTCTCGAATTGCGCCTCGCACGTCTTACAGCGCTCGGCCGCGATGAAATTGGCGACGAGCTGAACAAGATCGGCGCTGAGATCAGCGAATATCTGGAAATCCTGTCATCGCGCCTGCGCATCATGCAGATCGTCAAGGACGAGCTTTCCGCCGTCCGCGACGAATTCGGCACGCCGCGTCGCTCCGAGATCGTCGAGGGCGGTCCCGATATGGACGATGAAGACCTCATCGCCCGCGAAGACATGGTCGTGACCGTTTCGCATCTCGGTTACATCAAGCGCGTGCCGCTGACGACGTATCGTGCACAGCGTCGCGGCGGCAAGGGTCGCTCCGGCATGGCAACGCGCGATGAGGATTTCGTCAACCGTCTGTTCGTCGCCAACACCCACACGCCGGTCCTGTTCTTCTCCTCGCGTGGCATCGTCTACAAGGAAAAGGTCTGGCGTCTGCCGATCGGCACGCCGCAGTCCAAGGGCAAGGCCCTTATCAACATGCTGCCGCTGGAACCCGGCGAGCGCATCACCACCATCATGCCGTTGCCCGAGGACGAAACGACCTGGGAAACGCTTGACGTGATGTTCTCGACGACGCGCGGCACGGTTCGCCGCAACAAGCTCGGCGATTTCGTACAGGTCAACCGCAACGGCAAGATCGCCATGAAGCTGGACGAGGAGGGCGATGAAATCCTCTCCGTCGAAACCTGTACGGACCGCGACGATGTGCTGCTGACGACCGCGCTCGGCCAGTGCATCCGCTTCCCCGTGGACGATGTACGCGTCTTTGCCGGCCGCAATTCCGTCGGCGTGCGCGGCATCAACCTGGCCGAAGGCGACCGCATCATCTCCATGACCATTGTGGGCCATGTGGAAGCGGAGCCGTGGGAGCGTGCGGCTTACCTCAAGCGCTCGGCTGCCGAACGGCGTGCCGCAGGCGTGGATGAGGACGATATCGCGCTCGTCGGTGAGGAAGTAACGGAAGAGGGCGAACTCAGTGAGGAGCGCTACCAGGAACTGAAGGCGCGCGAAGAATTCGTGCTGACGGTTTCCGTGAAGGGCTTCGGCAAGCGTTCATCGTCTTACGATTTCCGCACTTCAGGCCGCGGCGGCAAGGGCATCCGCGCCACCGATACCGCGAAGACGTCGGAGATCGGTGAACTTGTTGCGGCCTTCCCGGTGGAAGAAGGCGACCAGATCATGCTGGTTTCCGATGGCGGACAGCTCATCCGTGTGCCTGTGAACGGCATCCGCATCGCCAGCCGCGCGACCAAGGGTGTGACAATCTTCTCGACCGCGAAGGATGAGAAGGTGGTTTCCGTGGAGCGCATCAACGAGCCTGAAGGCGACGACGAGGCTGAAAATGGCAATGGCGAGGAGGCGGATGACAATCTGCCGACCACGCCGGAGGCACCGGAAAGCGAGGCGTAAGCCGCCTCGTTTTTCACATCAGGATCAACAAAAACCCCGCGGTGCCTGTGCGCCGCGGGGTTTTTCTTTGGGTTTCCGTCTGGTGCGGTCTGTTAGCGAATACGTATATGAACCGGTGAGCGCCCCGACCGACGGGCGACGTGCAGGTGAATGCTGGCTTCCGGCTGGTTGGCGCGCCAGGCACGGGCGCCGTGCGAGAGCAGCAACGCCACGAGATCACGCGTTTTTGCCTTGGAACGGTTGAGACCGAGATCGGCGATACGCATCGCCGCCTCATGAATGGGATGCAGAACCGACAGGCTGCCATGCCCGTTGCCAGACGCAATGTCCGAATAATTCTCGTTGACTGCCATCTTGGAAAGCCTCGCTACTCATTACACAATTCGAGGAGACAGGCGATGTTCGAAACAGAGAGCACCGCCTGCGGGAACGTCTTCACTGAAGGGATGCCCAGCAGCTGACGCCTTTCTTTTTCAATGCATTGCATGCATTCACCGCGGCGTTCTGGCCATCGAAGCCGCCGAAACGGGCGCGGTATATCTGGGAGCCATCCTTGCTGTAGGCCACTGTGAAGGGCTTGGCTGAACGCAGCGCCGCACCACCTTTTTCCTGGGCGCTCTGCAGCAGGCCGCGCGCGGAATTTTCGTCCGGCGATGCGCCAATCTGGATGACCCAGCCGCCCTGCGGCGCGTCAGACCTTGCCGCAACGGAAGGTACGGAGGCTGCGGAAGCCGAGGTGGAAGCCGTGACGATATTATCGACCTTGGTGGAATTGGTTATCAGCTTGCGATCGGGCGCGGAGGGCGCTTGACGGATGGGGTCTTTACCCTTCTGCCAGGTAGCCGCTTCCATTGCCTTGACGGCTGCATTGGAAGAGGAGCCAGCAAAGGCGGTAACCGGCGCTTCCTCGTAACGCGTCTGCGGAACGGGTCCGGCATTGGGCAGGCCGGCATCCGTGGCGGCGACGGCGACAGAGGGAACGGCGGCGGCCACAGCGACCGGCTCTTCGACCGGGGCGGAACGCGTCTGTGCAATCAGGTTGCTGTTGCCGCCACGCGACGCCTGCGGCAGATAGGTTGCAACCAGTTTGCGCATGGTGGCGTCACGGGCGGCGCTGGAGCGACCGCCGAGAACGACGGCGACAATGGAGCGCCCGTCGAGCTGCGCGGAGGTGGCAAGATTGCTGCCGGCTGCACGCGTGTAACCCGTCTTGATACCGTCGACGCCCTTCACGGTTCCGACGAGGCGATTGTGATTGCCGATTACCTGCTTGCCGAAATTAAAAGTGCGGGTGGAGAAGTAACCGTAATATTGCGGAAAATGCTGGCGAAGCGCGATGCCGAGGCGGGCTTGGTCGCGCGCCGTGGTCATCTGGGCTGTATTGGGCAACCCGTTGGCATTGCGGTAGGTCGTGCGCGTCATGCCCAGCGCACGGGCCTTTGCGGTCATCATTCTGGCGAAACGGTCTTCTGAGCCGCCGAGATATTCACCGAGCGCCGTCGCCATGTCGTTGGCCGACCGGGTAACGAGAGCAAGGATCGCCTGCTCGACCGTAATGCTGCCGCCAGCGCGCACACCGAGTTTGGACGGCGGCTCCTTGGCTGCATTCGCCGAAACCGGCACCTTGGAATCAAGAGAGATACGTCCGGCACTGAGGGCTTCGAACGTCAGATAAAGCGTCATCATCTTTGTGAGCGAAGCCGGGTAACGCAGCCCGTCGGGATCCTCGCCGTAAAGCACCTTGCCGGTCTTCGCATCGATGACGATGCCCGCATATTTCGGATCGGCCTTTGCGGTCTGGACGGATGTTGCGGTTATGAACGCCGCCGCAAGTGTCACCGCCACAAACCTTACAAAAGACCCTGTACCCGCTTTGCGCGAATGTGCCCAAGATAGACTTTTCAACACTGCTGAACTCTTCAATCGCATATTTATGTTCCGCTCGGACACGCCCCGCCCGAATGCTCCGGGTGGAAACTTTATCGGGATGGCGTTACCAAGTGGTTTATGAAAGGTATCCGCCTTCCATAAAATGCGACTTTTCACACGCGATCTGTCTCGTGCCGTGTCACGGGTTTTCGTCTTTTACGAGGCGTTTCTCGACATAGCGTGTCACGGCCGCGTCTAGCGCCTGCCATTGCGGCAGCAGGTTGGCCGAGAATCGGTAGAACAGGCTCAGGTCCTGGCCGACGTGGATATCTCTCTGACAATCGCTACCCGTTGTTTCCTGAGGCGTCTGCGGCAGCAGGCAGCGCACCACATAGACGCTTTCGCCGTCGCGTCGGCCGGTCAATATGACTTCCTTGCCATAACCGGAATCAGCGCGCAGCCGATGGAGCAGCAGACCGTGCTTCAGGGGCGTCGGCTCACCTTCTATAAGTCTGGTATAAATCGGCTCGAAACGTCCTGACATGTCCTGCGACATGGTGCTTTGCGACAGCTGAACGAAAATCAGCCCGACTGCATTTTTCGGATCGTCGAAAAGGGCACGATTCGAATCTCGATACCCCGCAAGGTCGGGCCAGGTGAGGTAGAGGTCGGCGCGCTCATGTGCACCACTGTCACGATCAGACGGATTCCGAAGGGTGTTTTTTGCGAGTTTCAGCCGGTCGTTGCCGATGGTGATCTCCACTTCGTCGGTCGACGTGGTGTGTCCCGCCTGAATAATTCTGTCGCCATACCATTTCATGCCGAAATTGAGCGCAGTGAGAAGGGCGGCAAGGGCTCCGCATATGACGATGACGCGTCTTATGAGAGAAGAGGAAAGGAAGCCTGTGTCGCTATGCATGAGCTTTCAAACTCGTTCTGTCCGCCGGGTTTCGGCGCGGAAATAACAGGCAAAGCTTTAACCCGCGGCGATCGCCATTGAAATGATGTTTTGCCGCTGCTACCTGATAAGCGCAAGCAAAGGACGGCATATGGCAGCGCGTGACAGATATTCCCGGAAACTCGAATGTTCCAAATGTGGCCATCAGGGTTTTGCCGAGGCGTCCGAGTCGGATGACAAAACCCGCCGGGACGTCGATTTCCGCATTGACGAAATGCCTCGGGGATTTCGCGCAGAACGCCCGTCGGGCGACCCAACCGACTTCATGATCCGCTGCGCCCAATGCGGCAACATATTCCGCTTCCTGCAGAAAACCGCCTATGCGCCGGGAGGCGAGCCGCGCGTCAAGGCCTGAACGGAAGGTGCCGGCATGACTTTACCGCAACAGCCCAACGGAGCGCCGCAAAGGCGCAAACCAATCTGGCCCTGGATCGTTCTGCTGGTGCTCACGCTCTGCGCGATCTATTCTTATAACCGCGCCAACGAAATCATCGCCGAGTTGTCCGATACCTTACCGCCGGCGCTCCTGAACCTCTTCGAGGACCTTCTCCGGGGGGAAGGCCGGCACGGCCATGGCGGGCCCAGCATCGGCGTTTGAGTGAGACGCCCGCCGTAAAGGGTCGCCCATCCTCCGGCCGCACATCATGTCATGACCATAAAAGAAAAAAGCCCCGCAAAGAGGCGGGGCAGGTTGATCGCTCGGGGAAAGTTGGGCAATGCCCGGGAGCGATGAGTGGGATATAAGAGACAATGATGCGGACGCGATAGGCCACTTCCGTCCCAAACAGTACGCCAATGCACAAATTTCACTTAATCCGCAGATTTTTGAACGATGAATTTGACGGATCTGTAAGACAACTTGACAGGCTTTCGCTTTATTCTTTGCTTGCCTCGAATTGCACATATTCTCCTTTAACGTAACGTGACATGAGCAGACGCATGGATAAATCCGGAATCCTCTTTGTGGCGCTTTTGTTGGTGCTTGTTGCAATCGTTTTGAGCCTCACGCTGTTCGATACAGCGCAGCAGGTCCCTTCGATACCTTCAAGTGACGGCTCCCTGCTGCCGCCCGCAGCGACCCCGGGGCAATGATCCGCCAACTGTGAGAGAAGGACGACGATGCTCGTCGGAGCGGGATTGACGCCCTTTTTCAAACCGCTATAAAGCCTCACCACCATGGATGGGTGTCCGAGTGGTTTAAGGAACCGGTCTTGAAAACCGGCGTGCGTGAGAGCGTACCGTGGGTTCGAATCCCACCCCATCTGCCATTTCTCAACAAAATCAATGGCTTGCGACCGTTTTTGTCCCGTTGCCATACCTTGTGACTGAGCGAGGTCGATCAAAAAGGAATCTCGCCCGCTTCGATATCGATATTCAAGCGCTCGACGAAGCTTCCCTCGGAACTCACGATCTCGACTGTTAAGACCCCTTTTAGCTGATATTTCTCTTGCGGCTCTTCGGTTCCTTGGCCCCATTTCGTCCAAACTGTCTTTTTCGCCTCAACCGCGAGCTGAGCCCACGCTCCGGGTGAGAGTCTTGGCACAGGCGGTTTGATGAAGTGCCTGAGTTGGTCGTTGCTGATTGTTGATTCGCTGAAAGGACATAGCTCTCCATTTTGGCTAAAAGACCATCCCTTGCCAGTGTGGAGATAGATAACCTCAATGTCGGGCGTCCGGCGCTGAGTGTGATTATGAATGTCAATTTTGAAGATGGCCTCAGCTGTCGCAGAGTATTTGGTTTTTGAAAGATCAGCGAACAAGCGTGCAAGAGACACGTGAAATGTGGTAGAGCGACGCTTCTCGAGTTCGGATTTTTGCCATGCGATCTCGTTCAGGAGCTGTTCTTTGAGTTTCTGTATTTTTCCTTCGTAGACAAGATGTCTGTGATGTTTCAGATCAAATGGAATGTCTTCTGTACGTTGGGTAAGCAGTGTACACGGCTTTCCCTTAGCGTGCGCGTAACCAATTTCATAAAAAACGTTAGGGTTTTTTCCGGTAAGATCGGCGATGATGAAGTCCGCTGCATCGATCTGACGATAAATCCGTTCGAGCATCGACTCGCTGTAAATTTGTTCATCGACGCGCTCTGCGACCACCTCAGACTCGATTGCCGCAGCCTGAATACCGAGTTTGTAGATGTCGTCAAATTCGGAGGAAAAAGGCATCAAAACAAAAGCAAAAGGCTTGATGGTCATACCGGCTCCTGTTTGGCCTCATGGGAGGGCACTAAAGTGCATAACATAGAAAGTTGAGACTTTACTTTGTTCTCGAAAAACGCGAAACCCTGCAGCTCTCTGGAGCCACAGGGTTTTCGTTTAAGCTGCTGCCTTATATAGCGCCTTCAGTTCGTCCAAGTCTTCAGCGAGCATGCCGCTATAGGCCGTTGCCGCCGCTTCCGCCAAAGCTTTGGCGAAGGGCGAGTTTCGCACCCATTGGAGCGTTGACTGCCCTTCTTTGGGTTTGACGTGCGCCACAGCGAACCCGTGTTCGTGGTCGTGGGTCAAGACTACATCGCGGATCGTCATCTGCATATCAGGCAAGAAGAATTGAACCTTTGCCAACTTGCGGAAACCCGTGTCACGGTCATAGCCCTTACTGATAGGCTCAATGCTGATAGTCTTTACCTTCACGGCTTTCCCTCCTCTGCTCGAAATGTAATGAAATCCCCTTTCTCCGAAGGCGTCATTAGCTTGATGTTGTATCTGCGGTTATCGCGGGTATCGCGCATGAACCATGTCGTGTTGACGCCCTTTGACTTCGGCTGCATTCGAATGGTGACTTCCACCGTCCTAGTGCCGCTGACCACGCCGTTAAGGTAAATCTCGTCACCATTCAGGACGCGGAACCGGCCAGCCGTGGTGAATTGCAAAGCTGGCGGGCCAGCGCCGGGAATGACGGTCCCGAAGCCGTCATTCGAGTCAGTTGGTTTCCAGAAGCCGAAGGAACGGCGAAGATCGCCGCCCCCGGTCACTGCTGCACCTCCGATGCCTGGGCATTTTGTGACACGCCTAGACGGGCCATGTTCAACGGCTCCATGTACGTATCGCCGCCCGGAACCTTGCTCATGTTCTCGAAGCCGCGAATTTCATCAACCGACAGGAATCCAGCCTCACGGCCAATCTTGTAAGCAGCGTAGCGGGTAGCCAAATCGCCACGAAGCAAGCCGGACAGGTCATGCTCGATGTAGTGCGTTTTTCGAGCCTCCGGCGAAAGCAGCGTCGTGTTGTAGACGCTTTCGATACGCTTGGCCCACGGCGCAAGGACTCGAGTCACAAGGGCGCGACTTTCCTCGCCAATGTTGCTGTAGGTCGCATCGTCGGTAATGCCCACGGCAGACGGCGGAACGCCATAGACGCGGCAGATATCCAGATTTGACAGCTTGCGGCTTTCGAGGAATTCGGAGTCTCGGCTGTTAAACTGGAAAGTTTCGAACTTAGCGCCGCCATCCAGCACCATGACTTCGTTCGCCTTGAGTTGCCCGACAAATCGATCCTTGAATTTCTTGATTGCCTCGTCTTTACCAGCGCCGCCGAGCTTTTCAGGAAACACCAGAGCGCCAGCCGGTCGGAATGCATTCTCAGCCGCTGTGCCTGCCGTGTCCTGTTGAGCCAAGGCAAGGCCGAATGTGGCGCTTGCGATCTGGATAGGCGACAGACCTAGAACGCCGTCTTTCGTCCGGTAGCGCACGTGCAAGATTTCGTCTTGCGTGTAGGTTTCCGTTCCACCGTTCGCCAAAGCGACCTTGTAGCGAAGGCGTCCGGTTGACAGTCGTTCCACCGTGACCGAAGGCGACGGAATTGGATGCAGCGCGGTAATCTGGCTGCGGCCATTCCGTTCAATCCGGGCATAAGCATTGCCGTACATCAGCGCCGAAACATTCATCCATTCCCGGCCCTCAAATGCGGTCAGGAGCGGAGAGAACGTATCTTTCAGCACTGGGTAGAGCGCATGATCAGAAGCCGCTTCCCGGCCACCTTCATCCGTTCTACGGTAGACCTTCAGCGGCACGGCTGCCAGTTGTTCCGCAATTAGCTGGATACACCGATGGGCAACGGCATGGCCGCTGGCCTTCTCAATGTCGGCACGGGCCTGCCAACGCGCACCAAGGAATTTGCCTAGGAACGGATCGCTTGAAGAGACAGCGCGCGTTTCCTTTGTCTTAAACGGCCACATATGCACCTCCTTCCAATTCCAAGATTCGAATGCGGCGCTCCGCGTCTGTCATTGGCTTGCGCGACCGGACAGCAAGAGACGTGCCAGAATATGCCGGGAACGCCTGCACAACGCTTATTTCGCGGAGGTCAATCGCTTTCAGGGTCCGTCGTTCGCCGTGCCATTCGTCGCCGCCTTCAGGCACATTGAAGCCGAAGGACATACCGCCGATATCATGGCGGGCAGCAAGCGAAGCGATATCGCGGCCCAACTGTGTGTCTGGCAAATCTAGTTCGAAACGGAGTCCTTTCTGATCCTCTTCGAGGATCAGGCTGCCCGACGCGCTACGGCCCAACACCTTGCCGGGGTCATGATCGACAAGCGCCAGAATGTCCGGGTTCGAACGAAGCGAAGCACCAAACGCACCAGCCTGAATCACTTCGCTGAAATCGCCAATGCGGGTTTCAAGGTCGAATGTCGCGACGTAGCCCGTCAGTTTCTTTCCTACGGCCTTAACGTCGGTTGCCGCACGCTTTTCAAAGTTGCTCAAAACGATACCTCCCTGAATGGCTGCACAAGGGCGTCAACGCCGAATGCGATTGCTTGTGGTGGTTTCTCCGCTGCGGCCTCGCGAAAGGCAAACCAATGCCCGACTAGCAAAAGCGCCGCATGTTTGACGGGGGGTGAGGCCAGCTTGTCGGTAGGTACGCCGATTTCTGAAATGTAGCCTTCCGCCGCGTTGATGAGTGTCGAGACGTAGGCATCATCCTCCGGGTAATCTACCCGGAGGTGAGCCTTTGCCTCTGCGAGAGAGACAGACGCCATATTAGGCGACGGCCTTCCATGCGAACGCTTCGGGATGGCGGATAGCCACGTCAGCATCAAGGAACGCATGCAGGCGCAGACCGCCCTTAGAAGCGTCGGTGTAAGGGTTGGCGAGAATGTCCACGCCGCTCCAGTAACCAATCATGAGGTTGGCCCATGCGCCGAAGATCAGCGGGTTTTCGCCGCCAATGGTCGGAACCTGATTCGTGGAAACAACGCGCTCATTGTGGAACGTGTCAGCCGTGGAAATCGGGCGCTGCTGGCCATCCTTCAGCTTGCGGGCAACGCCCATCAGCGCCGGGTTGGTCAAGAAACCCGTGGTGCCGGTCACGTCGTCAATCTGGAGTGCTGCGATAAGGTCCGCTGCAATGTCCGTCAGGTCCGTCGCAGTGGTCGCGCTTTCTGTGATTTGCGTCAGGATGCCAACGGGCTGCTTTGCGGCGGCGGTGCCGTTGATCGCAGCGGAGTCGAGAGCCTGCGCAAGAACGAAAGCCAAGTCCTGACGAAGCACGTTTTCGAGCGCAACGCCGTTCTGAAGCAGGAGACGGCGGGAAAGGTACATTTCGCCGGAAACCGTCTTCGGAGACAGCGAAACCTTGTCGAAAGTCGCATCGCTCGCCGTGGTGGCTTCATCTTCGTTCACCCAGTAGGCCTGCGGGCCAGAGGTGAGGCGCGGCAGGTCGAGATTGCCGGTCAGGCCGGAAATGACCGTTGCGCCGAGAGACTGCACAGCCAGAACCGGGCGAAGGCGATCAATGAGGCCGCCGAGGTTCGTTGCGACAGTATTGCCAGCAGTGCCGGTCGTGAGCATCGCGCGGTTTTCGTCGCCGAAGATGAGAGACGTAGGGACCATCACGCCGCGCACTTCGCGACCCTTGGACAGTTCGTCATGGACTTCGCGTTCAACGCCGGTCAGGCTGTCGCCATTGCCCTCGCGGATGGCCTTGGAAACGGAATAGGAACGCAGTTCGCGGGCCATGGCGTCACCCTGCGGTGAGGCTTCGTGGCGCTCGAATTCTGCTATGGTTGCTGCATTCTTGATCTGGCTATCAAGGGCGCGAATTTCGCCTTCGATTGCTGTAAACTTCGCGTTGTCCGGGATCTCGCCGAGGGCCTTAAGTTCGTTGAGCTTTGTGGAACGGGTTTCGCGAAGATGATGAATGTTCAAATAAGTCTCCTTTTCTAGTAAGCGCGCAAAGAGCCATGCCGATGCGGAATTGCTCGGTTGGGTAGGCGCAACAATGTCATGGTTGTTTTACTGCCTTCGGCAGTGCTGCCCGTCCCTCAGTCGGGACAGGCTCGCCGGTTCAAGTCTTTTGCTTGCGTCCGGTGATTGTGGTTATACTCCGTTTTGCTTTCGCATCAACAGTAAATCGTAAAAAGCAAGGAAGTATATTTTCGCGCTGTTTGCTTGACAAAACAGCGGGCACGGATTTTTTATTAGCTGGTTTCCCACCAACGGTTCATAGTTGCTGAGCGCGCTACAGCTGCACTCTCTAACTCTGCGTTTCCCTTCCGTAGCGCTTGAACCTTCCGGTTAGCATGTGGCGGACCTCCCATGGAAAGTCATATAGATCACTCGCTGTCGGATCGTTCTCACGTCGTTCGAACAAGACTAGAAAGTCGTCATCATCCCCAAGATGGATCGGCCAAGCTTTCTCGTATCCCGGCAATTTTGTGTACGGGTTCGCCATTTTGCCTCCTTTTAAGCATCCTCCATAAGACACGCTGGAAGCGAATTATCTTCCTCATGTCGCGCGGCAGTTTGCAATGCCATCGCGAGCGCCACAAGCCCGTCAATGCGCCCGGAGGCTTTCGACTTATCAAGCTTTCGAGCGCCTGATGGGTCTTTCGTGACGACAGCGTTCGCCGCGCACATCCGAAGCAACGGGTTTCCGGCGTGGTTGACCTTCTGTTGCGCGACCGCCACCTCCAACATGTCAACGGCTGGAGACATGTCTTTGTAGCCTTGCCCGAACGGCGTTAGCGGCAACTCAATCGAAGCTTTTGCCAGTTCGCGCCGTAAGTCCTCAATGCGCCAGCGGTCGAATGCAATCTCCTTGATATCGAACCGGCCCGCCTCGTCAGCGATGTATTCCGCAACGGCCTGCGGATCGATGACCTTTCCCGGCAACAGCGTTAGCCTCGCATCCGCCTGCCGCGCCCAAACGTTGTACGGCACGCGGTCATTCTCCGATTTGCCGTCAATATCGAACTGAGGCAGGAAGAACCGAGGCAGGACAGTGAAGCGGCCATCGTCTTCAGGGAAAACCAGCACGAACGCCGTCAAGTCTCGCGCCGCTGACAGGTCAAGAGCGCCATAGCATTCCCGGCCTTCAAGCGCGGCTTCGTCAATAGGCTCTAGATCGCAGTCGTTCCACTCGCGCGCCGCGATGAAGCGTACCGTGCCGTCAATTCTCTGGTTGAGAATCTTGTTGCGGAAGTCCGCTTCCTTTGACGGGATTCGCTGCGCCTGTGCCGCCATGCGTTCAACCTGTTCCAAGGCAAGGAAGTCGCCTAGAGCCGGGTTTGCTTTGATCCACGTATCATAGGACCATGCATCCTCATCCGGGTCAGTGGTGAACAATGCCAGATGGAAGCTTTCGTCTTCGACTTCGCCGGTCTTCACCTTCAGGCCGTAATCAATCATTTCGGAGAAGAAATGCGTGTCGTCTTTCGCCTGTGTGGAAATGACCACAACAAGCGGTTCATCACGCGCGCCAAGGGCCGAATCCATCGCGTCGAACAGATCGCGCTTCACCCAATAACCGGCTTCGTCAGCGAGGAAGAACGATGGCGACAACCCCAACTTTGAATCGGCGTCTGCGCTCACAGCCTTAAGAACGGAGCCTTCACCGGGATAACCGGCCTCGACTTCAATCTCTTTGCTGAATTTGATGATGTTGACACGCTCTGAAAGTTCGACATGGGCCTCCAGCATGGCTTTGCACTCGGCCCACGCTTTACCAGCCTGAATCTTATCCATCGCGCCGAAATACAATTCGCCGCGCTGTTCGGCTTCAGGGCCTACCAGATGGCAAAGGGCAAGCGCCGCGCTCAGTCCTGTCTTTCCGTTCTTGCGGCCCATAGATAGGACGGCGGTTCGTACAGGGCGACGGCCATACTCGTCAGTGGCGTAAATCGGCGCAAGGAATTCTTCGATTTGCCATTCACGCAACTGCATGTTCTGGCCCGCAAGTTTGCCCTGCGTAATTTTCATGTCGTTGACGAATGCGACAACGGCTTCAAGCCGGGTGAGGCCTTCGACCTCCCAAGGAAGCACCTTGCGGTGCTTCGGCTGCCCGCCACTCATGATATCCCCGGTCTTGCCGGATGCTTTCGGCTTTGCGCCGGGTCCACGACGGCCCATAATTGCCTCCTTTCGTTGTTCATTTTTTCATGACGATTTCCTTTTCGTCGTTCAATATTTGAAACTTAGTCTTTGCGAAGGTACCCCACCGGTCCTTAGACGTAGGACTCCCGTCTTTCGATGCCCCCCTACCTTTGGAACGGGGCAGGGAAGCCAGTCAGGTGTCGTTCCAGCCTTCCGGGTCAATCGGGTTGCCATCCACGTCAAAGCCAGCCCATGCCCGCCTGAAGCCGCTGCTCCTGCCTGTCTTAGCGGTGCGTCGGTCCTTCGCGTTGGTCTTGCGGTTATGACAGGATGCGCACATGCTGGTCAGGCCGGACAGGGGAGGGAACGCGTCACCTCCCGCATTGATAGCCTTGTCATGGTCTACGACTTCCGCAATCTCAATGACGCCACGCGCACGGCACGGCTCGCAGACTGGGCATTCCATCAGCTTGGCTATGCGCAGCCGTTGCCACTGAGCCGTGTTATACGGCCATCTGCTCATGAGGGGCGGGCAGCCGAGGCCAAGGGCCGAGCCTTAACAACCTGCGGCTTGACCGGGATAGGCTGTTTCCGCACCGGATCGATACGGCTCAGACGTGCAAACACCTGCCGGTTTTGGTGCATCAGTTTCATTGCACTGCCACCCATTCACCGTGTCCAACGGCTTTAGCCAGACCATGGCGCTTCAGGCGCAGCAGCACCTCGTTGACCTCGTGACGCCCGTAACCCGTCAGCCGTGACAAGATCATAGGCGCTGCCGGTATTGACCGCTTCCGTAGGAGGGCAAGAACCTTTTCGTCTGACTCCGGCAATCCACTCCATGCGACCGGCTTGCCTACCAGCGCGTTGATGACCTGCATATTTTCGGTGTGTGTCCTTTCGTTCATTCGTTTCTCCTTTCGTCTATATCATAGGTACAAACGGTAGTTACGGTAGTTTGCACAAACCGAAATACCCTTACTACCGATACTATTTAATGTGATGCTAGAAGGACTTAGCTAGACTGCCCTCATATTAGCAATTGCGAAACTACCGTAACTACCGTTCCTACCGAATGGTATGGTAGTTTCATTGTCATGAGCTTTTCGGCTCTAGGCCGCGAAGAACCCATTTGCCGCGCCCGATCCTCGTAACCTTCCGGCTGGCACTCAACCGTTTCAGGCTCTTCGAAACGTCAAGCGGCTTTTCGTGGATGATTGCCGCCAGTTCGCTAGGGCCAATCGGATTGCCATTCATGAATCCAAGCTGCTGCAATATCTTGCCGGTCACGTCGCCGTAGCCTTGGCGGTCGCTTTCGTCGCTTTCGGTGTCGATCTCCCAAACACAAGCATCGGCGTCGAACCGCACCCTGAAGTCAAATTCCTGAAAGTCGCGCCCACGGCCATAAAGCCCCAAATCGCCGTCTTCATTTGGGACAAGCAAAATCGTCCCGTCAGCCGCACCCGAAATGCCCCCCGTGCCACTCACACGGTCGAAAGGATCGACCGCGCTGGCGCTACCCTTGTTCGTATGGTGAACGATGACGATGCACACCCGATACTTGTTCGCCAGCTTCGTCAAAGGTCGAACGTCGCCATAATCACGCTCATACGGATCAACCTTGCCCTTGCGCGGTTCTCGAAACATTTTGAGAACGTCCACAATTACCAAGGATGCATCGGGGTGAGCTATCAGCCATTCCTCCAACTCCTTCAGGCCGCCCTTTTCCGCCGTGGGTATCTGGATTTGGAAATCCAGCGCATCGGTGAATGATTCCGCGCCTTCGATTTCCTGCTTTCCCAAGCGGTCCTGAAGGCGAATGAATCCGTCTTCCAGCGCCAAATACAGAACGTCACCTTGTTCGGTTCGATGCCCCATGAACGGAATTCCGGCTTCCACGCAACGGGCCAATTGTAGAGAGAGCCAAGATTTCCCTTGCTTCGGAGGACCGACAAGAAGAAGGCAACCGGCAGGGAACAAGCCCTCCACAATCTGGCGCGGCCTCTCAAATCGCCGCCGCATCAACTGCTTTGCTGTCAGTCTGACTTCTTCGGACTCATCTTCCGGTTCCGGCTTTGGCTTTGATTTCCGCGCTGGCGCGCTAAAGGCTGCCCGCCGTCTACCAATCTTCTCGAAATTGAACTTCCCGCCCCAACCGGGGGACTCGTCTTCTGCATGCGGTTCCATCTTTAATGCGTGGTGCGCGATGCAATGCGTTCATCAAGCCAAGCCGTCACTTCGGAACGAACGTAAGCGATGCGGCGCGCCGATAGTTGGACCGGCTTCGGGAAATGGCCGGTCGCGCTTAAAATACCAAGCTGCATCGCGGAAAATGTGGTTTCGGCGGCAGCCTCCTTTGGCGACATGAGGCGCGGTGGAGATTTTTCGGACATGGATGCTCCTTTGCTGTTTCGTAAGACATTCCTTTCGTCAGCGATGACAGGGCGGGCAGCTGCAAGCCCGTGAGGGCGCGCAGAAATCCAGCCGGTCAAGCCGTGACAAGGTTCAAATTGTTAGGATTTGTTAGGGTCGGCGCATAATGCAGGCAACGTTCTCGACGTGCAGGGCTTCCGCGCTTCAGGTAACAATCAGTCTCCTACACGCAAGATTATTGCGCGTCAACTCCCTTTTTGCACATTTTACGAAAAGGGGCCTCATCGCGCTATTAATCAAGATAAGCACTCCAATCGTCCATCAACTTGCGCCGTTTCGCCAGCGCATCGGACCGACGATAAGCGGCCTCCGTCTTGTCTTTGAGCGTGTGAGCAAGCGCTGTTTCAATGACCTCGCGGGGGTGATGCGTTTCGTCACCGGCCCAGTCTCGGAACGAGGATCGCAAGCCGTGCAATGTCTCTGTGCCGCCCGTGGCTGCCCGGAGGGCTTTCACCATGGCCGTGTCTGATATTGCCTTGCCTTCGCTCTCACCTTCGAAAACAAGCACGCCTGTGGCCATTTCCTGCCGCTCTTTGAGGATTTCCAAGGCACGGGCCGATAGCGGGACTCTGTGCTCCTTGCCTGCCTTCATCCGTTCCGCTGGAATGATCCAAAGCGCGTTCTCAAGATCGATTTCAGACCAAACCGCGCCCCGCGCTTCGCCAGAGCGGGCCGCAGTCAGGCAAGCGAATTCCGCCGCAACAGAGGACACACCTTTAGCCGCCCGCAGCTTCTTGATCACAGCGGGCAACACTTTGTAGTCGATTGCCTCATGATGGCCGCGATATAGCTTTTGGCGGGCAGGGAGCAATTCCTTAAGGCCCCCGCGCCAGTCGGCAGGATTGTCGCCGGTATATAGCCCGCGAGCCTTGGCGTGGTCGATCACTGCTGCAATGCGCATGCGGGTCCGGTCGGCGGTTTCGGGCTTCTCTGTCCAGATGGGTTTGAGCGTCTCAACCACGTCATCACGTGTAATGTCGGCCACGGCCTTCTTGTGCAACGGTGCGGCGTACTTATCGAGCGTCATGCGCCATGCAGCTTTGTGGCCCTTGCTCTTAGACTCGGTCATCTTCTTTTGGATCACGTCTTCCATGATCGCAGCGAAGGTCGGTCGCACCGCCAGTTCCTCACCACGCGCCAGACGCTGCCGGATGCCTTCGGCTTTCTCTCTGGCAAGATCAAGGGAAACGGGCGCAGTTCCTTGGCCGTAGCCGCCAAGTCCTATCTCGGTACGCTTGCCGTCTCGCTTGTAGATGAAGAACCATTGCTTCGAGCCGCCCGCCCGTACACGTAAATAAAGCCCGTCACCGTCGCTATAGATGCCAGCGGTGGAAAGCTTCTTGATCTTCGTTTCCGTGAGCTTGTTGCGTGCCAT
Protein sequences of DBSCAN-SWA_2 >CP036358|1049176:1103571|1056600_1057977_+|QBJ12865.1|DBSCAN-SWA MRVIICGAGQVGYGIAEQLSREDNEVSVIDTAASLITAITETLDVRGYVGHGAHPDMLAKAGADQADMIIAVTLHDEINIVACEVAHALFSVPTKIARIRDQSYLKPEYADLFSRENMSIDVTISPEVEVGKMVLRRIAFPGATDVVRFADDTVYMLAIECMEDCPVINTPLQQLSSLFPDLIATVVGVYRDGFLKVAHSSEQLRVGDLAYVICQRQHARRTLSLFGHEEQEAQRIVIAGAGNIGHFVASKIEELQPKTRVKIIEADRDRAVAASEQLSHTIVMHGSALDQKILMQADIQDADLIVTLTNNDQTNILAAVMAKQLGCKSNLALLNSSSFHEVADSLGLDAYINPRAVTISRVLQHVRKGRIRSVYAVQRGSAEVIEAEALETSPLVGQSFRDVEMPEGVRIGAIYRDGVVIRPDGSTKIKAKDRVVLFASADAVRDVEQLFRVSIQYF >CP036358|1049176:1103571|1060231_1060588_-|QBJ12867.1|DBSCAN-SWA MTENGNDRRIDYVEFNVSDIERSKAFYGGAFGWSFKDYGPQYCEFSDGRLTGGFTTTAPVSAKGGPLVILYAADIDDAQRQVAAAGGEITVAIFAFPGGRRFHFTDPDGYELAVWSAA >CP036358|1049176:1103571|1060599_1061004_-|QBJ12868.1|DBSCAN-SWA MSVMQSRIIHLSIEKPWSHVYDFVSDPNNMARWAAGLAGGLRPDGQDWIAEGGPLGEVRVNFAPANEFGVVDHVVTLPDGLKVYNALRVTPNGGGAEVSFTLLRLEGMTDEDFNGDARAVTADLEALKALLEET >CP036358|1049176:1103571|1095611_1096154_-|QBJ12901.1|head,protease|DBSCAN-SWA MSNFEKRAATDVKAVGKKLTGYVATFDLETRIGDFSEVIQAGAFGASLRSNPDILALVDHDPGKVLGRSASGSLILEEDQKGLRFELDLPDTQLGRDIASLAARHDIGGMSFGFNVPEGGDEWHGERRTLKAIDLREISVVQAFPAYSGTSLAVRSRKPMTDAERRIRILELEGGAYVAV >CP036358|1049176:1103571|1074125_1075073_+|QBJ12878.1|DBSCAN-SWA MLHSAHPGNEILVSLANAGVQRNGRWLVRGVEFSVSKGEIVTLIGPNGSGKSTSAKMAIGVVKPTEGVVTRKAGLKVGYVPQKLSVDWTMPLSVRRLMTLTGPLAAREIDAALNATGIAHLANAEVQHLSGGEFQRALLARAIARKPDLLVLDEPVQGVDFSGEIALYDLIKNIRNSNNCGILLISHDLHVVMAETDTVICLNGHVCCRGTPQAVSQSPEYMRLFGGAAAKGLAVYSHHHDHTHLPDGRVQHADGTVTDHCHPEDGHHHGHDLHHDHGHDHHDHGHRHDDHGECGCGHERDDDAHLNQRQGERHV >CP036358|1049176:1103571|1061125_1062709_+|QBJ12869.1|DBSCAN-SWA MNIYRAGQERPQQPVERLDAEAVNSAKTRKPLYEARRKIFPKRAEGRFRRFKWLVMLVTLGIYYLTPFLRWDRGPYAPDQAVLIDIANRRFYFFFIEIWPQEFFFVAGLLVMAGLGLFLVTSAVGRAWCGYTCPQTVWVDLFLVVERAIEGDRNARMKLDAAPLTVGKFRKRVLKHAIWLVIGALTGGAWIFYFADAPTLAREFVTGQAPMIAYSTVAILTATTYVFGGLMREQVCTYMCPWPRIQAAMLDENSLVVTYNDWRGEPRSRHAKKAAAAGESVGDCVDCNACVAVCPMGIDIRDGQQLECITCALCIDACDGVMDKIGKPRGLIAYATLAEYQSNMALATGNGQHAIRPANVREEDGKFSKRVRHFNWRIIFRPRTLLYTAIWAAVGIGMLSALVTRERLALNVLHDRNPQYVLESNGSIRNGYTVRILNMVPQPRTMSLTINGLPDAVMKINGMPDAAARAFEVTVEPDEATTLKVFVTRPGGRVARAAENFEFIVSDTGGHETARYDAVFNAPGATK >CP036358|1049176:1103571|1075880_1076294_+|QBJ12880.1|DBSCAN-SWA MNAQTQQNLTKNQSLVMNALSNAHQPLSAYMILDKLRDDGFRAPLQVYRALEKLVEYGLVHRLESLNAFVACTHTQAECCSSHHGTVAFAICESCGQVTEFHDHEIDHRLERWVKDSKFKAEKTTIEIRGLCAACSA >CP036358|1049176:1103571|1052936_1055198_+|QBJ12863.1|DBSCAN-SWA MRKPAAENAVADEAAGMVVADRRMSFALPGLVLAGVALVCAIITLFVLLGVTPIAPTSNVVIASVVINSIFVIGLIFLIGREINRLLKARKKGRAAARLHVRIVVLFSIVAITPAVLVAIFASLTLNVGLDRWFSLRTQSIVDSSSNIAQAYMMENAGYLQGQTLSMATDLDRNRALFYLDRTGFVDLMTRQAKGRGLLGAFLVQEDGDAVAQADIKTEKPLPAIPHDALEKAAAGQPTLIPPGITNLVGAIIKLEGISGTFLYTVRAVDPKVMGAMRLMEENRAEYKAMEAGRAPLQIAFAILYLGFALIVLLAAIWTAIAVADRIVRPIRLLISAADSVATGNMHVLVPVRAVDGDVGRLSRTFNKMVSELRSQQEQIIEAKDDIDDRRRFIEAVLSGVTAAVIGVDENRRITIVNPSGEEFLAQTSEQLIGAQLSEIAPEIEQVVNEANSWSRGNFRKQINIMRRGKERTLNVQVTREDARDSRDSYVITLDDITDLVIAQRSTAWSDVARRIAHEIKNPLTPIQLSAERIKRRFGKQIDESDRAVFDQCTDTIVRQVGDIGRMVDEFSAFARMPKPTKEKSDLRAILKDAAFLREISAADTKFTTELGDTPLEGMFDARMLGQAFGNLIKNATEAIEAVEGEKRPGKILVRASFDEANSRFVADIIDNGRGLPVENRHRILEPYMTMRDKGTGLGLAIVKKIIEEHGGYLELHDAPPEFDHGHGAMIRVLLPYIEAVGGENNKEAAYGV >CP036358|1049176:1103571|1093767_1094082_-|QBJ12898.1|DBSCAN-SWA MKVKTISIEPISKGYDRDTGFRKLAKVQFFLPDMQMTIRDVVLTHDHEHGFAVAHVKPKEGQSTLQWVRNSPFAKALAEAAATAYSGMLAEDLDELKALYKAAA >CP036358|1049176:1103571|1078738_1079539_-|QBJ12884.1|DBSCAN-SWA MSTSGTIRTGIGGWTFEPWEGTFYPEKLPKKRQLEHASRQLTAIEVNGTYYSSQKPETFAKWASEVPEDFVFSLKASRFVTNRRVLAEAGESMTKFLTQGLTELGSHLGPILWQFAPTKKFDAEDFGAFLALLPEKQDGLTLRHVVEVRNPTFQVPEFIELLARHKVAVVCADHHDYPMLPDVTADFVYCRLQKGEDDIETCYPKKEIAHWAERVKTYASGGVPDDLPLIAPDRKVEKTPRDVFAFFITGGKVNAPNGAQELQKAV >CP036358|1049176:1103571|1076894_1077584_-|QBJ12882.1|DBSCAN-SWA MAATVIPAPKPVLLPVEGTSDLFPVRRVYCVGRNYADHAIEMGHDPSREPPFYFQKNPDNLLPAGQDFPYPSLSSNVHYEVECVVVLKSGRADIPASEALNHVWGYAVGIDMTRRDLQDGLKKMGRSWEGAKAFEYSAPVSPVVPADRIGHPATGAIWLDVNGERKQTGDLAQMIWKVPEVIAELSKLFTLAAGDVIMTGTPAGVGPIVRGDRIECGVEGVGTLSVTVV >CP036358|1049176:1103571|1084987_1085617_-|QBJ12889.1|DBSCAN-SWA MASSETLINALTTLLVTLDPPGLAPVFLALTAGMTRDQRSQVALRGSIIAFGILAVFALFGLAILSLLGISLGAFRIAGGLLLFWIAFEMIFEKRQERKEKTSEIAITKDHLHNLAVFPLALPLIAGPGAISATVLLAGSMKTTVEMVVLILILAFAMALVYAALIVSERMDRFLGNTGRAILTRLLGVLLAALSVQFVVDGIKSAFAF >CP036358|1049176:1103571|1096150_1096498_-|QBJ12902.1|head,tail|DBSCAN-SWA MRSLMLTWLSAIPKRSHGRPSPNMASVSLAEAKAHLRVDYPEDDAYVSTLINAAEGYISEIGVPTDKLASPPVKHAALLLVGHWFAFREAAAEKPPQAIAFGVDALVQPFREVSF >CP036358|1049176:1103571|1050189_1051338_+|QBJ12861.1|DBSCAN-SWA MSADKHSPADANPLAMAVLNAVQNPVILVDAEGFIAFANWEAEAFFGASASHLARYRVSTFIPFGSPLLALIDQVRERRAPVNEYRVDLSSPRLGQDKLVDLYVAPVVSEPGSVVVVFQERSMADKIDRQLTHRAAARSVTGLASMLAHEIKNPLSGIRGAAQLLETSVADEDRALTRLICDETDRIVSLVDRMEVFSDERPVDRVPVNIHSVLDHVKAIAKAGFARNIKISENYDPSLPPVYANRDQLVQVFLNLVKNAAEAVANQSDGEIVLTTAYRPGIRLSVAGSRERISLPLEFCVHDNGPGVPADLLPHLFDPFITTKTNGSGLGLALVAKLIGAHGGIVECDSQNHRTTFRVLMPVSPEVALDDSTLPNTTGNDQ >CP036358|1049176:1103571|1088718_1088979_-|QBJ12891.1|DBSCAN-SWA MAVNENYSDIASGNGHGSLSVLHPIHEAAMRIADLGLNRSKAKTRDLVALLLSHGARAWRANQPEASIHLHVARRSGRSPVHIRIR >CP036358|1049176:1103571|1085833_1088626_+|QBJ12890.1|DBSCAN-SWA MTDQSPPGGGKLPPGIEPISIMEEMQRSYLDYAMSVIVSRALPDVRDGLKPVHRRILYGMSELGIDWNKKYVKCARVTGDVMGKFHPHGNSAIYDALARMAQDWSLRLPLIDGQGNFGSIDGDPPAAERYTECRLDKAAHSLLDDLDKETVDFRDNYDGTLQEPVVIPAKFPNLLVNGAGGIAVGMATNIPPHNLSEVIDGCIALIDNPAIELPELMQIIPGPDFPTGALIMGRSGIRSAYETGRGSVIMRGRATIEPMRGDREQIIITEVPYQVNKASMIEKMAELVKEKRIEGISDLRDESDRQGYRVVIELKRDANADVILNQLYRYTPLQTSFGCNMVALNGGKPEQMTLLDMLRAFVSFREDVVSRRTKYLLRKARERAHVLVGLAISVANIDEVIRVIRHAPDPASAREELMTRRWPAQDVESLIRLIDDPRHRINEDGTYNLSEEQARAILELRLARLTALGRDEIGDELNKIGAEISEYLEILSSRLRIMQIVKDELSAVRDEFGTPRRSEIVEGGPDMDDEDLIAREDMVVTVSHLGYIKRVPLTTYRAQRRGGKGRSGMATRDEDFVNRLFVANTHTPVLFFSSRGIVYKEKVWRLPIGTPQSKGKALINMLPLEPGERITTIMPLPEDETTWETLDVMFSTTRGTVRRNKLGDFVQVNRNGKIAMKLDEEGDEILSVETCTDRDDVLLTTALGQCIRFPVDDVRVFAGRNSVGVRGINLAEGDRIISMTIVGHVEAEPWERAAYLKRSAAERRAAGVDEDDIALVGEEVTEEGELSEERYQELKAREEFVLTVSVKGFGKRSSSYDFRTSGRGGKGIRATDTAKTSEIGELVAAFPVEEGDQIMLVSDGGQLIRVPVNGIRIASRATKGVTIFSTAKDEKVVSVERINEPEGDDEAENGNGEEADDNLPTTPEAPESEA >CP036358|1049176:1103571|1065481_1065655_+|QBJ12872.1|DBSCAN-SWA MNMLIYLIPVALLLGALGLFAFLWSVRSGQYEDMDGAAWRALDDGDNRPRSSISNGP >CP036358|1049176:1103571|1094419_1095625_-|QBJ12900.1|portal|DBSCAN-SWA MWPFKTKETRAVSSSDPFLGKFLGARWQARADIEKASGHAVAHRCIQLIAEQLAAVPLKVYRRTDEGGREAASDHALYPVLKDTFSPLLTAFEGREWMNVSALMYGNAYARIERNGRSQITALHPIPSPSVTVERLSTGRLRYKVALANGGTETYTQDEILHVRYRTKDGVLGLSPIQIASATFGLALAQQDTAGTAAENAFRPAGALVFPEKLGGAGKDEAIKKFKDRFVGQLKANEVMVLDGGAKFETFQFNSRDSEFLESRKLSNLDICRVYGVPPSAVGITDDATYSNIGEESRALVTRVLAPWAKRIESVYNTTLLSPEARKTHYIEHDLSGLLRGDLATRYAAYKIGREAGFLSVDEIRGFENMSKVPGGDTYMEPLNMARLGVSQNAQASEVQQ >CP036358|1049176:1103571|1091544_1091808_+|QBJ12894.1|DBSCAN-SWA MAARDRYSRKLECSKCGHQGFAEASESDDKTRRDVDFRIDEMPRGFRAERPSGDPTDFMIRCAQCGNIFRFLQKTAYAPGGEPRVKA >CP036358|1049176:1103571|1065782_1067213_+|QBJ12873.1|DBSCAN-SWA MEQAEIGLIGLGVMGSNLALNIAEKGNKIAVFNRTPEATRKFYAEAGELQGQLIPCETIEEFVAAIRPPRPIIIMIKAGDPVDQQMEILKPHLSNGDIMIDAGNANFRDTIRRFDNLKDSGLTFIGMGVSGGEEGARHGPSIMVGGTEDSWKRVEKVLTSISAKYNDDPCVAWLGNDGAGHFVKTIHNGIEYADMQMIAEIYGILRDGLKMSAAEIADVFGEWNKGRLNSYLIEITEKVLRAADPITGKPMVDLILDKAGQKGTGKWSVIEAQNMGVAATAIEAAVAARILSSQKDEREAAEKIFGLPALAAAPADRKAFIADLESALLAAKVGAYAQGFAIMSAASKEFNWNLPMPTIARIWRAGCIIRSEFLDEITSAFTKDPHVANLIVTPAFSAVVKETDAPLRRVVSYAALSGLPVSALASALGYFDAYRRGRGSANLIQAQRDFFGAHGFERTDGVDKPHGPWGSGADIF >CP036358|1049176:1103571|1089061_1090555_-|QBJ12892.1|DBSCAN-SWA MRLKSSAVLKSLSWAHSRKAGTGSFVRFVAVTLAAAFITATSVQTAKADPKYAGIVIDAKTGKVLYGEDPDGLRYPASLTKMMTLYLTFEALSAGRISLDSKVPVSANAAKEPPSKLGVRAGGSITVEQAILALVTRSANDMATALGEYLGGSEDRFARMMTAKARALGMTRTTYRNANGLPNTAQMTTARDQARLGIALRQHFPQYYGYFSTRTFNFGKQVIGNHNRLVGTVKGVDGIKTGYTRAAGSNLATSAQLDGRSIVAVVLGGRSSAARDATMRKLVATYLPQASRGGNSNLIAQTRSAPVEEPVAVAAAVPSVAVAATDAGLPNAGPVPQTRYEEAPVTAFAGSSSNAAVKAMEAATWQKGKDPIRQAPSAPDRKLITNSTKVDNIVTASTSASAASVPSVAARSDAPQGGWVIQIGASPDENSARGLLQSAQEKGGAALRSAKPFTVAYSKDGSQIYRARFGGFDGQNAAVNACNALKKKGVSCWASLQ >CP036358|1049176:1103571|1082823_1083360_+|QBJ12886.1|DBSCAN-SWA MAGSVNKVILIGNVGADPEIRRTQDGRPIANLRIATSETWRDRNSGERKEKTEWHTVVVFNEGLCKVVEQYVKKGAKLYIEGQLQTRKWQDQTGNDRYSTEIVLQGFNSTLTMLDGRGEGGGRSGGGDFGGGNDYGSGGGSSYGGGYDQQSSSRGGSSRGGNQPSGGFSNDMDDDIPF >CP036358|1049176:1103571|1083399_1083657_+|QBJ12887.1|DBSCAN-SWA MTDETLQPKQPATWRIILAFFLDFWTAFFAAGFLVATVAGGRTPEGFALNGAPAFIAFVLIIAYFVVLGRFFGGTLWQRLLKARR >CP036358|1049176:1103571|1076333_1076867_-|QBJ12881.1|DBSCAN-SWA MPLYRLADRVPQTPAADRYWIAPDANVIGSVTLGEDVGIWFGATLRGDNEPIIVGRGTNIQEGVMVHSDPGFAATIGEMCTIGHHAIVHGCTIGDNSLIGMGATILNGAKIGRNCLVGANALVTEGKEFPDNSLIVGSPARAIRTLDDDAVAGIRRSAKKYVENWKRFSQDLAVIER >CP036358|1049176:1103571|1100413_1100671_-|QBJ12906.1|DBSCAN-SWA MNERTHTENMQVINALVGKPVAWSGLPESDEKVLALLRKRSIPAAPMILSRLTGYGRHEVNEVLLRLKRHGLAKAVGHGEWVAVQ >CP036358|1049176:1103571|1098134_1099799_-|QBJ12904.1|terminase|DBSCAN-SWA MGRRGPGAKPKASGKTGDIMSGGQPKHRKVLPWEVEGLTRLEAVVAFVNDMKITQGKLAGQNMQLREWQIEEFLAPIYATDEYGRRPVRTAVLSMGRKNGKTGLSAALALCHLVGPEAEQRGELYFGAMDKIQAGKAWAECKAMLEAHVELSERVNIIKFSKEIEVEAGYPGEGSVLKAVSADADSKLGLSPSFFLADEAGYWVKRDLFDAMDSALGARDEPLVVVISTQAKDDTHFFSEMIDYGLKVKTGEVEDESFHLALFTTDPDEDAWSYDTWIKANPALGDFLALEQVERMAAQAQRIPSKEADFRNKILNQRIDGTVRFIAAREWNDCDLEPIDEAALEGRECYGALDLSAARDLTAFVLVFPEDDGRFTVLPRFFLPQFDIDGKSENDRVPYNVWARQADARLTLLPGKVIDPQAVAEYIADEAGRFDIKEIAFDRWRIEDLRRELAKASIELPLTPFGQGYKDMSPAVDMLEVAVAQQKVNHAGNPLLRMCAANAVVTKDPSGARKLDKSKASGRIDGLVALAMALQTAARHEEDNSLPACLMEDA >CP036358|1049176:1103571|1062705_1063206_+|QBJ12870.1|DBSCAN-SWA MTVNNRHTSGFTFTGWHMLGVMLLFFGTVITVNMVMAWNAVNSWSGLVVPNTYVASQQFNAKAEAAKARAATGIKGKLAVDERTVRYEVFHPDTGPVDTDQVIAHFRRPVGERQNFDMELTPVAKGVFTGPHDMLPGQWIVEITAVKNGRIIVHEGTRIAVVRGRQ >CP036358|1049176:1103571|1063202_1065485_+|QBJ12871.1|DBSCAN-SWA MSCCAPGTEGSLQLADPVNPPSSEELMLASRDLGQGLRQTDLSVPDVYCGACITTVESALGRLPQVERARLNLSSKRVAIVWRQEVEGVRTDPADLARAILATGYRTHLFASGQDASDALRSQLIRAVALCGFASANIMLLSVSVWSGADAATRDMFHWISAMIAAPALIYGGRFFYQSAWSALKHGRTNMDVPIALAITLSYAVSLWETIHHGEHAWFDATVSLLFFLLIGRTLDHIMRDKARSAIAGLARLSPRGATVLGENGTREYRPLADIEPGMSIAIAAGDRVAVDAVVESGSSDLDMSIVNGESAPRRVAAGDSLQAGTLNLTGSLVARVTASARDSFLSEVISLMEAAEGGRARYRRIADRAASYYSPVVHLLALVSFLGWGFFGGDWKQAMLIAIAVLIITCPCALGLAVPVVQVVAAGRLFRHGIMVKEGSAMERLAEIDTVLFDKTGTLTIGRPRLVETGEVKPATMAIAAGLAAHSRHPLSKALHAAYSGPLPAYETVHEIPGSGVEAKTDVGTYRLGNRRFACPDDEGADNGNARSEVVLSLDGRLLVSFGFDDNPRAGAAAALRSLSVRGLAQEIVSGDRAAAVSAMADRLGIANWSADLSPKDKAARCASLAGEGHKVLMVGDGINDAPALAAAHVSVAPATAADIGRQAADFVFMQEDLDAVPFAIETSQQAGKLIRQNFALAIGYNIIAVPIAIAGYATPMIAAIAMSTSSLIVVANALRLAGSAGRRNEPETGFAGEGAQLA >CP036358|1049176:1103571|1083710_1084859_-|QBJ12888.1|DBSCAN-SWA MRSTLRRIIPDAAPVSNRERLRSATGAFVGILLTGILASFALRFDPTLPAMIAPMGASAVLLFAVPSSPLAQPWSILCGNFVSAFVGVTVALLVADPFLASALAISLAIAAMMALRCLHPPSGAVALTAVLGGPAIHGLGYGFLLWPVAGNSLILLALAVAYNNATGRAYPHGLKLGKAAHGTADPTPIQKIGFSSTDLDDVLKEYDEFIDIDRDALETILRKTELRSYRRRALHLDCESVMSRDVVGVAPDDSLRHAHALMQSHHFKALPVTNDCAEIVGIVTQTDFLEKASWRNGRPSIGFLQRLRLILSGASAPNDTVKDIMTSPVKTVRPETPIEEAIIHFAEEGLHYLPVIDAKGKMVGIVSQSDVMVAMLADKVAA >CP036358|1049176:1103571|1051334_1052786_+|QBJ12862.1|DBSCAN-SWA MTATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWVSAGEGDLVVTDVVMPDENAFDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIAIIGRALSEPKRKPAKLDDDMQDGMPLVGRSAAMQEIYRVLARLMQTDLTLMITGESGTGKELVARALHDYGKRRNGPFVAINMAAIPRDLIESELFGHEKGAFTGAQNRSTGRFEQAEGGTLFLDEIGDMPMDAQTRLLRVLQQGEYTTVGGRTPIRTDVRIVAATNKDLKQSINQGLFREDLYYRLNVVPLRLPPLRDRAEDIPDLVRHFIQTGEKEGLEGKRFETEALEVMKAYAWPGNVRELENLIRRLMALYPQEVITREIIEQELQSDVPDSPLDKMAVRTGSLTISQAVEENMRDYFASFGDGLPPPGLYDRVLRELEYPLILAALTATRGNQIKAADLLGLNRNTLRKKIRELGVSVYRSSRPS >CP036358|1049176:1103571|1049176_1050193_+|QBJ12860.1|tRNA|DBSCAN-SWA MKDHHLALPELSEPFSIGSVTIRNRAVLAPMSGVTDLPFRQLAWRYGAGLVVTEMVASRELVANRGESWARLKNVGMVPHMVQLAGREAHFMAEAAKIAAGNGAGIIDINMGCPAKKVTGGYSGSALMRDPDHALSLIEATVNAVDVPVTLKMRLGWDENSINAPDIARRAEAAGVRLITIHGRTRMQFYEGRADWDAIRAVREVISVPLIANGDIETAEDAREILRRSGADAVMVGRGAQGQPWLPAVLAGHAAPHREDIPAIAVEHYEMMLEFYGREAGLRHARKHLGWYLDRFAPGIATTDKAKIMTSRETGEVADLLRSALCENAGEDAARKAA >CP036358|1049176:1103571|1058827_1060159_+|QBJ14477.1|DBSCAN-SWA MRAVVLVPVLKQRDTRDAAALPAAAGRSVEAKLEEAKGLALAIDLEVTQGLIVAVNQPRPATLFGTGKIEEIGHLLDETNSGLVIVDHPLTPVQQRNLEKHWNAKVIDRTGLILEIFGRRASTKEGTLQVDLAHLNYQKGRLVRSWTHLERQRGGAGFMGGPGETQIEADRRLLQERIVKLERELEQVVRTRQLHRAKRRKVPHPIVALVGYTNAGKSTLFNRITGAGVLAEDMLFATLDPTLRRMKLPHGRTVILSDTVGFISDLPTHLVAAFRATLEEVLEADLILHVRDMSDPDNAAQSADVLRILGDLGIDEKEAEHRIIEVWNKVDRLDPEAHDAIMQRAEGSANIRAVSAITGEGVDALMDEISKRLSGVLTETTVVLSVEQLPLISWVYSNSIVDGREDHEDGSVALDVRLSEAQAAELERKLGKTTTREREDWER >CP036358|1049176:1103571|1071888_1072941_+|QBJ12876.1|DBSCAN-SWA MNDTGKSGGSEAPVTTGERPTLKTIAYMTGLGITTVSRALKDAPDIGAETKERVRLIARQIGYQPNRAGVRLRTGKTNVIALVLSVDEELMGFTSQMVFGITEVLSSTQYHLVVTPHIHAKDSMVPIRYILETGSADGVIISKIEPNDPRVRFMTERKMPFVTHGRSDMGIDHAFHDFDNEAYAYEAVERLAQCGRRKIAVIVPPSRFSFHDHARKGFNRAIRDFGLTEFPLDTVTIETPLEKIRDFGQRLMQSDDRPDGIVSISGSSTIALVAGFEAAGVKIGKDIDIVSKQSAEFLNWIKPQIHTVNEDIKLAGRELAKALLARINGAPAETLQSISRPVWSSMAPKP >CP036358|1049176:1103571|1092792_1093647_-|QBJ12897.1|DBSCAN-SWA MTIKPFAFVLMPFSSEFDDIYKLGIQAAAIESEVVAERVDEQIYSESMLERIYRQIDAADFIIADLTGKNPNVFYEIGYAHAKGKPCTLLTQRTEDIPFDLKHHRHLVYEGKIQKLKEQLLNEIAWQKSELEKRRSTTFHVSLARLFADLSKTKYSATAEAIFKIDIHNHTQRRTPDIEVIYLHTGKGWSFSQNGELCPFSESTISNDQLRHFIKPPVPRLSPGAWAQLAVEAKKTVWTKWGQGTEEPQEKYQLKGVLTVEIVSSEGSFVERLNIDIEAGEIPF >CP036358|1049176:1103571|1099956_1100286_-|QBJ12905.1|DBSCAN-SWA MSRWPYNTAQWQRLRIAKLMECPVCEPCRARGVIEIAEVVDHDKAINAGGDAFPPLSGLTSMCASCHNRKTNAKDRRTAKTGRSSGFRRAWAGFDVDGNPIDPEGWNDT >CP036358|1049176:1103571|1096430_1097672_-|QBJ12903.1|capsid|DBSCAN-SWA MVAPTQPSNSASAWLFARLLEKETYLNIHHLRETRSTKLNELKALGEIPDNAKFTAIEGEIRALDSQIKNAATIAEFERHEASPQGDAMARELRSYSVSKAIREGNGDSLTGVEREVHDELSKGREVRGVMVPTSLIFGDENRAMLTTGTAGNTVATNLGGLIDRLRPVLAVQSLGATVISGLTGNLDLPRLTSGPQAYWVNEDEATTASDATFDKVSLSPKTVSGEMYLSRRLLLQNGVALENVLRQDLAFVLAQALDSAAINGTAAAKQPVGILTQITESATTATDLTDIAADLIAALQIDDVTGTTGFLTNPALMGVARKLKDGQQRPISTADTFHNERVVSTNQVPTIGGENPLIFGAWANLMIGYWSGVDILANPYTDASKGGLRLHAFLDADVAIRHPEAFAWKAVA >CP036358|1049176:1103571|1102428_1103571_-|QBJ12909.1|integrase|DBSCAN-SWA MARNKLTETKIKKLSTAGIYSDGDGLYLRVRAGGSKQWFFIYKRDGKRTEIGLGGYGQGTAPVSLDLAREKAEGIRQRLARGEELAVRPTFAAIMEDVIQKKMTESKSKGHKAAWRMTLDKYAAPLHKKAVADITRDDVVETLKPIWTEKPETADRTRMRIAAVIDHAKARGLYTGDNPADWRGGLKELLPARQKLYRGHHEAIDYKVLPAVIKKLRAAKGVSSVAAEFACLTAARSGEARGAVWSEIDLENALWIIPAERMKAGKEHRVPLSARALEILKERQEMATGVLVFEGESEGKAISDTAMVKALRAATGGTETLHGLRSSFRDWAGDETHHPREVIETALAHTLKDKTEAAYRRSDALAKRRKLMDDWSAYLD >CP036358|1049176:1103571|1100858_1101950_-|QBJ12907.1|DBSCAN-SWA MEPHAEDESPGWGGKFNFEKIGRRRAAFSAPARKSKPKPEPEDESEEVRLTAKQLMRRRFERPRQIVEGLFPAGCLLLVGPPKQGKSWLSLQLARCVEAGIPFMGHRTEQGDVLYLALEDGFIRLQDRLGKQEIEGAESFTDALDFQIQIPTAEKGGLKELEEWLIAHPDASLVIVDVLKMFREPRKGKVDPYERDYGDVRPLTKLANKYRVCIVIVHHTNKGSASAVDPFDRVSGTGGISGAADGTILLVPNEDGDLGLYGRGRDFQEFDFRVRFDADACVWEIDTESDESDRQGYGDVTGKILQQLGFMNGNPIGPSELAAIIHEKPLDVSKSLKRLSASRKVTRIGRGKWVLRGLEPKSS >CP036358|1049176:1103571|1101952_1102156_-|QBJ12908.1|DBSCAN-SWA MSEKSPPRLMSPKEAAAETTFSAMQLGILSATGHFPKPVQLSARRIAYVRSEVTAWLDERIASRTTH >CP036358|1049176:1103571|1067781_1068813_-|QBJ12875.1|DBSCAN-SWA MVWQPAENRYTSMKYNYCGKTGLKLPAISLGLWHNFGNDTPHQTKQAICRRAFDLGITHFDLANNYGPPPGSAETAFGEILRTDFRGYRDEMIISSKAGYNMWPGPYGEWGSRKYLIASCDQSLKRMGLDYVDIFYSHRFDPNTPLEETCGALDQIVRSGKALYVGISSYNSKRTREAAAILKDLGTPCIIHQPSYSMINRWIEEDGLVDTLEELGIGSIVFSPLAQGMLTTKYLGGVPDGSRASQSKSLNPAFLNERNVENIRALNGIAERRGQTLAQMAIAWVLRGGRITSALIGASRVEQVEDCVKALDKPDFTTEELAEIDRYAKDADINLWAKSAERV >CP036358|1049176:1103571|1058161_1058644_+|QBJ12866.1|DBSCAN-SWA MAERSQNLQDLFLNTVRKKKISLTIFLINGVKLTGVITSFDNFCVLLRRDGHSQLVYKHAISTIMPGQPLELEIEEGPDRSQNLQDLFLSTVRKKNIALTIFLINGVKLTGVVTAFDNFCELLRRDDHAQLVYKHAISTIMPGQPMQMFENEEGAAREAS >CP036358|1049176:1103571|1079595_1082517_-|QBJ12885.1|DBSCAN-SWA MSELKTISIRGAREHNLKGIDLDLPRNKLIVMTGLSGSGKSSLAFDTIYAEGQRRYVESLSAYARQFLEMMQKPDVDQIEGLSPAISIEQKTTSRNPRSTVGTVTEIYDYMRLLFARVGVPYSPATGLPIESQTVSQMVDRILAFEEGTRLYILAPIVRGRKGEYKKELAELMKKGFQRVKVDGQFYEIADVPALDKKYKHDIDVVVDRAVVRPDMAARLADSLETCLKLADGLAIAEFADRPLPPEETAAGGSANKSLNETHERVLFSEKFACPVSGFTIPEIEPRLFSFNNPFGACPSCDGLGSQQKVDENLIVPEPARTLRDGAIAPWAKSSSPYYNQTLEALGKAFGFKLSSKWSDLSKEAQHAILQGTDDKIEFNYQDGARSYKTVKNFEGIVPNLERRWKETDSAWAREEIERYMSAAPCPACAGYRLKPEALAVKINKLHIGEVTEMSIRTARDWFEVLPESLNAKQNEIAVRILKEIRERLRFLNDVGLDYLSLSRNSGTLSGGESQRIRLASQIGSGLTGVLYVLDEPSIGLHQRDNARLLDTLKHLRDIGNTVIVVEHDEDAILTADYVVDIGPAAGIHGGQVIAEGSPQEVMANPKSLTGKYLSGELGVAVPAERRKPKKGREIKVFGARGNNLKNVTAAVPLGVFTAVTGVSGGGKSTFLIETLYKSAARRVMGAREIPAEHDRIDGFEFIDKVIDIDQSPIGRTPRSNPATYTGAFTPIRDWFAGLPEAKARGYAPGRFSFNVKGGRCEACQGDGVIKIEMHFLPDVYVTCDVCHGKRYNRETLDVTFKGKSIADVLDMTVEEGVEFFAAVPAVRDKLQSLFDVGLGYIKVGQQANTLSGGEAQRVKLAKELSKRSTGRTLYILDEPTTGLHFHDVNKLLEMLHALVEQGNSVVVIEHNLEVIKTADWIIDIGPEGGTGGGEVVATGTPEDIVKVERSYTGHFLKELLERRPAGKREAAE >CP036358|1049176:1103571|1091823_1092051_+|QBJ12895.1|DBSCAN-SWA MTLPQQPNGAPQRRKPIWPWIVLLVLTLCAIYSYNRANEIIAELSDTLPPALLNLFEDLLRGEGRHGHGGPSIGV >CP036358|1049176:1103571|1073001_1074021_-|QBJ12877.1|DBSCAN-SWA MKSILIPLAASMAIAVAASGATAAPEVVVSIKPVHSLVAAIMQGVGEPQLIVDGAASPHTYNLRPSNARKLETADVVFWVGPGLEAFLEKPLEALAAKATVVELEDAKGLEKLPFREGGPFEAHDHGEEGHEANDGHAEKPDAHEHGHEHDGGGHAGHDGESHDDHEHGTYDTHLWLDPANAKAMAQAIETALIAADPGNAAIYQSNTKKLIDDLDTLDAELAKTVQPVKDKPFIVFHDAYQYFEHRYGVKTAGSITVSPETLPGADRVKQMQEKVRQLGATCVFAEPQFEPKLVSVITEGTAAKSATLDPEAATLEPGPELYFKLMRGIAGSLKNCLS >CP036358|1049176:1103571|1055187_1056552_+|QBJ12864.1|DBSCAN-SWA MASDILVVDDEADIREIVAGILSDEGHETRMAFDSDSALAAISERVPRLIFLDIWMQGSKLDGLALLDEIKSRHPEIPVVMISGHGNIETAVNAIKRGAFDFIEKPFKADRLILIAERALENSKLKREVQELKKRTGDAVELVGASLAVSQLRQTIDRVAPTNSRIMILGPSGSGKELVARMIHKKSSRATGPFVALNAATITPDRMEIALFGTEGLPGQPRKVGALEEAHRGVLYLDEVGEMPRETQNKILRVLVDQQFERVGGGKRVKVDVRIISSTAHHLESLIAEGQFREDLYHRLAVVPVKVPALSERREDIPFLVDMFMRQISEQAGIRPRKIGDDAMAVLQTHDWPGNIRQLRNNIERLMILARPEGGEAPISADMLPSDIGDMLPKISAQGDQHIMTLPLREAREMFERDYLVAQINRFGGNISRTAEFVGMERSALHRKLKSLGV >CP036358|1049176:1103571|1075065_1075884_+|QBJ12879.1|DBSCAN-SWA MFDDFFVRAMVAGIGVALTAGPLGCFVVWRRMAYFGDTMAHSALLGVALSLLLQLNLIVSVFLVASAVSLLLIFLQRRQALSADALLGILSHSALAIGLVIVAFMSWVRIDLVSFLFGDILAVTRSDIALIWGGGLVVIVSMVFLWRSLLASTVNTELAEAEGLKPERAKLIFTLLMALVIAIAMKVVGIMLITSLLIIPAATARRFSATPEVMAVVASLIGAVAVVGGLFGSLTYDTPSGPSIVVAAVILFVISLLPAPGLSRSADEGGKS >CP036358|1049176:1103571|1077721_1078732_+|QBJ12883.1|tRNA|DBSCAN-SWA MYRQALASGEKIFAVAPMIDWTDTRCRFLHRQLSKRALLYTEMIVADAIIHGQRDRLLGYHPQEHPVALQLGGSDPAKLAEAVRIAGDYGYDEINLNVGCPSDRVQSGTFGACLMREPDVVAQCVAAMKAAASGPVTVKCRIGVDEQEPEAVLPDFLKRVVAAGADAVWVHARKAWLQGLSPKENREVPPLDYDLVYRMKRENPDVFIGINGGIADLDQASGHLQHMDGVMLGRAAYHNTSILADVDHRIYGEEARHPDWMALRDAMMAYAADYIAAGGRLNHVTRHMVGLFQGMPGARRFRQILSSDATRPGAGPEVIEAAFASIDVSTTKDMAG >CP036358|1049176:1103571|1092380_1092545_+|QBJ12896.1|DBSCAN-SWA MSRRMDKSGILFVALLLVLVAIVLSLTLFDTAQQVPSIPSSDGSLLPPAATPGQ >CP036358|1049176:1103571|1094078_1094438_-|QBJ12899.1|DBSCAN-SWA MGGAAVTGGGDLRRSFGFWKPTDSNDGFGTVIPGAGPPALQFTTAGRFRVLNGDEIYLNGVVSGTRTVEVTIRMQPKSKGVNTTWFMRDTRDNRRYNIKLMTPSEKGDFITFRAEEGKP >CP036358|1049176:1103571|1067278_1067602_-|QBJ12874.1|DBSCAN-SWA MKKFVTILLAATVLAAPMAQAQGRHDDRHRGVTVERHVTKKVIVKKHRWDRGQRLSPAERRHFVDRRDYRRYRLAEPRRDQRWVRVDNQFLLINAVSGLIVGLAAAR >CP036358|1049176:1103571|1090694_1091414_-|QBJ12893.1|DBSCAN-SWA MHSDTGFLSSSLIRRVIVICGALAALLTALNFGMKWYGDRIIQAGHTTSTDEVEITIGNDRLKLAKNTLRNPSDRDSGAHERADLYLTWPDLAGYRDSNRALFDDPKNAVGLIFVQLSQSTMSQDMSGRFEPIYTRLIEGEPTPLKHGLLLHRLRADSGYGKEVILTGRRDGESVYVVRCLLPQTPQETTGSDCQRDIHVGQDLSLFYRFSANLLPQWQALDAAVTRYVEKRLVKDENP |
51 | uncultured_Mediterranean_phage(14.29%) | portal,protease,capsid,head,terminase,tRNA,integrase,tail | attL 1066668:1066686|attR 1109716:1109734 |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_3 |
1270740 : 1278997
Sequences of DBSCAN-SWA_3
Nucleotide sequences of DBSCAN-SWA_3 >CP036358|1270740:1278997|DBSCAN-SWA GATGACACCTCCTTTCCTGCTGCACCACCTGCTGGGCGCGCGCGCGGCAAGCGACGGGCAGGCGATCGTTTATAAAGACACATCCCTAAGCTACCGGCAATTTGCCGAAGCGGCGGAACGATGCGCGGCTGCCTTGCAGCAGGCTGGGGCGCAACGCGGCGACCGGGTCGTCATTTTTCTGCCGCGAGGCACGGAAGAATGCTGGGCGATCTTCGGCGTCAGCATGGCCGGATGCGTCTTCGTGCCGGTCAATGCGCTGCTGAAGGCGCAGCAGATACGCCATATCATCGTGGATTGCGGCGCAGAACTTGTCATCAGCAATGCGACCATGCGCGACGAGCTGAGCGCGGCGCTGGAAGGTCTGGCCGGTGTCCGCGTGCTTCTGGCGAACGATATCGCCGAAGGCGGCAAGACGTCGGTCAAAAGCCCGGCGGCAATCGGCGAAGACCTCGCGGCGATCCTCTATACATCCGGTTCTACCGGCTCACCCAAGGGCGTGATGCTGTCGCATCGCAATCTCCTGGCCGGCGCGCGCATCGTGCGCACCTATCTGGAGATTACCGGCAAGGATCGCATTCTTTCCCTTCTGCCCTTCAGCTTTGACTATGGCCTCAACCAGCTGCTGACGGCAGTGGAACAGGGCGCGACAACAATCATCTCCACCTTTAGGCTCGGTGACGATATCGTCCGCGACCTGCGCGACCACGCCGTGACCGGGCTTGCCGGGGTGCCGACAGTCTGGGCGATCCTGACGAGGGCGGCGCCGTCGCTTGCCAGGACGCCGCTTCCGCATCTGCGTTACATCACCAATTCTGGCGGGCGCGTGCCGCAGGAAACTGTGAAGGCGCTGCGCGAAAAGCTGCCGGACACGAAAATCTACCTGATGTACGGCCTGACCGAAGCCTTCCGCTCCACCTTCCTGCCGCCGGATGAAATCGATCGCCGCCCGACCTCCATCGGCAAGGCCATTCCCGAATGTGAGATATTCATCGTCACCGCCGAGGGACAGCGGGCAAAGCCCGGCGAACCCGGCATTCTCGTGCATCGAGGCCCGACCGTTTCGCTGGGCTACTGGAACCGGCCGGAAGACACGGCGAAAGTCCTGCGCCCGCATCCCTTCATTCCGGCGGCGCTGGGCGGAGAAACCGTATGTTACTCCGGCGACCTGGCGGTGGAAGACGAAGACGGGTTCTTCAGCTTCGTTGCCCGTAACGATGCGATGATCAAGTCTTCGGGTTATCGCATAAGCCCCACCGAAGTCGAAGAAAGCCTGATGTCGACAGGCTTGTTCCAGCAGGTCGCCGTCATCGGCCTTCCGGACCCCTTTGCGGGGGAAAAGGTGCATGCCGTGGCGACTGCCGCCAATGAAAATATCGATGTTTCCGCGGCACTGAAGAAGGCCGCTGAAATGCTCGCCCCCTTCATGATCCCGCGCGCCATCGAGCTGGTCGACCGGTTGCCGGTCACCGCCAATGGCAAGGTGGACTACCGCGCGCTTGTGCGCGAACGGACGGACAATGGCGCCCACGGATAAACCCCAGAGCCACGGTCCCGCCCTTGCGGCCGCGTCATTCGATATCAGCGAAAACGATCTTGTGATCGGCGGGCTTGCCGTGCGCGATATCGTCGCCAAAGCCGGAACCCCATGCTTTCTCTATGATGCTGGCGCGATGCGCCGCGCCTACCGCGATCTCCAGGCGACGCTCTCCGGTTTTGCCGATATCTATTATTCGGTGAAGGCCAATCCCCTGCCCGCCATCATCGCGCTCTTCCGGCAGGAAGGCGCTGGCGCGGAAATCGCTTCCGCCGGCGAATATCGCGCCGCTATAAAGGCTGGTGTCGCGCCGGAAAACATCATCTTTGCCGGGCCCGGCAAAAGCAGAGCCGAATTGCACGAAGTGATCGCGGGCGGAATCGGCGAAATCCACGTGGAAAGCGCCGAGGAAATTGCCCGCATCGAGGCAATCGGCAAACCCGTCAAGGCCTCTGTCCGCATCAATCCGGTGCCGGATGCGCAAGCCGGCGCCATGCGCATGGGCGGCAAGGCCACCGCTTTTGGTTTCGATGAGGAAGAACTGGAAAATATCCTGGCGCTTTTCCGGCATGCGGGCGTTATCGAGCTCGTCGGCATCCATATCTATGGCGGCACTCAGATCCTCGATGCGGACATGCTTGTCTCGCAATGGCGACATGCCATCTCAATTGCCGCCCGCATGGCGGAAATGCTCGGCAAGCCGCTTGAAACCATCGATCTCGGCGGCGGCCTCGGCATCCCCTACTTCGCCGGCGAAGCACCGCTCGATCTCGCGAAGGTAAAAGCCGCCGTGCCCGATCTGAAGGCACTGCTGAAAGCGCATCCGCTGATCGCCGACGCCCATGTCATCGTCGAGCCCGGCCGCTTCCTCGCCGGTCCGGGCGGCATCTATACGGCCGAAGTCAATTCCGTGAAGACCTCGCGCGGAACCACCTTCGTTGTGACGGATGGCGGCATGCACCATCACCTGGCCGCTTCGGGCAATCTCGGTCAGATCGTCAAGCGCAACTACCCCATCGTTGCGCCCGCCATGATGCAGGCCGACCACGCGGAGACTGCAACCATCGTCGGCCCGCTCTGCACGCCGCTCGATACGCTGGCTCGCAACGCGGCATTGCCGAGGCTCCAGGCCGGTGACCTCATCGCCATCCTGCAGTCAGGCGCCTATGGCGCAAGCGCCAGCCCCGGGGATTTTCTAAGCCACGCGGTGGCGAAGGAAGTGCTGGTGGAAAATGGGGCGTTTGAGATGATCGGGCGATAATCACCCTCGCAGTAGCCTGTCCCCCACAGCGAATTTCGATTTCAGCAGGCACTCGTCATATTCCGCCTCAGCCACGGAATCGAAGACGATGCCGCCGCCGACATTAAAGACGGCGTAACCGTCATCGAACAGTGTCATGGTACGGATCGCCACCGAAAAGCGCATTTCACCACCAGGAGACATGAGGCCGATGGCGCCGCAATAGGCGTCGCGGGGCGCCGCTTCCAGTTCGCGCAGGATTTTCATCGCCCACATTTTTGGCGCACCGGTAACCGAGCCGCAGGGAAACAAGGCGGCGAAAATGTTTTCCACCGTCATGCCAGGCAAAAGTTTCGCCCTGATATGGCTGACCATCTGGTGCACGGTCGGATAGGTCTCGATATCGAAAAGCCGTGGCACATCGAGGCTGCCGACCTCGGTGATGCGGGAAATATCGTTGCGCAAGAGATCAACGATCATCCGGTTTTCGGCGAGCGTCTTTTCATCTGCAAGCATCGCCGCGATGATCGCCCTGTCCTCCTCCGCATCCGCTCCGCGCGGCGTCGTTCCCTTCATGGGATGGGTTTCGATGAACCCCTGCCCGTCAACCGAGAAAAACAGTTCCGGCGAGCGTGACAGGATGACCGGGCCGCCGAGATCGACCAGCGCGCCATATTTCACCGGCTGGCGTTCGATCAACGACCAGAATGCCGTCAGCGGATCGCCGCTCCAGCGGGCATGGACGGGCATGGTCAGATTACCCTGATAACAGTCGCCCCGGCGTATATGGTCGTGCAACAGCTCGAAACGCCGCCGATACTCGGGAAGTGTCCATGCCGGAACCGGATCGGCAAGAAACGCATCCGCGCCCGGAACTTCGTCAGGCCGCGCAAAGCGGCCGTCATCCGGCTGCGGGCCGGAAAAGACCCCGAAATTCAGGAACGGCACGTTGCGCGGTTCTTCCGCAAAAGGCGCAAGCTTCGGCTCGAACAGGAAACCGGCCTCATAAGACATATAGCCAGCTAAATACTTTCCTACACGTCGCAGTTCTTCCATGCGCTTCAGCGCTGCAAAAAACGCTCCTCGCTCATCCGCGACGATGATTTCCTCCGGCTCGGTGAAGGCTGTCACCGTGCCGGTCGTGTCATCCCGGAAAAGAACGTAAGGCGCATGTGCCAAGGTAAAATCCGTAGAAGGTAAATCTGGAGACTTCAGGGCGTCGGAAAGAAACCTCTCCATCGTCATCCCCGGTTCCCGGATCAAGTCCGAGGATGTCCCGAGGATCTACCACCCGTTGACTTTAGCGACCTCGGCAGATCCTCGGCACGAGACCGAGGATGACGTCGAGTGTGGAGACAGGTCACGAGGCATTAAACCGGGTCTATATCGCCCCGCGCCCACATTTCGATGGTTTCCGCATAAAAATCGGCGAAACGGCCTTCCTCGATCGACTTGCGGATACCCTGCATCAGCTCCTGATAATAGGCAAGATTGTGCCAGGAAAGGAGCATGCCGCCCAGCGCCTCGTTGGCGCGTACGAGGTGATGCAGATAGGCGCGGGAGTAATCGCGCGAGGCCGGGCAGTTGGATTGTTCGTCGAGCGGGCGCATATCCTCGGCATGGCGGGCATTGCGGATATTGACCTTGCCGCGGCGCGTAAAGGCCAGACCATGACGGCCGGAACGCGTCGGCATCACGCAGTCGAACATGTCGATGCCGCGCGCCACCGATTTCAGGATATCGTCAGGCGTGCCGACGCCCATCAGGTAACGCGGCTTTTCAGTGGGCAGCACCGGCAGGGTAATATCGAGCATGCCCAGCATCACATCCTGCGGCTCACCGACGGCAAGGCCGCCGACCGCATAACCCTTGAGATCGAGCTGCTTCAGCCCTTCGGCGGAGCGAATGCGCAGGTCCGGCTGGTCGCCGCCCTGCACGATGCCGAACATGGCCTTGCCGGGCTGGTCGCCAAAGGCGACGCGGCAGCGCTCGGCCCAGCGCAGCGACATTTCCATGGCGCGCTCGATTTCCTTGCGCTCGGCCGGCAGCGCGATGCACTCATCGAGCTGCATCTGGATGTCCGAATCGAGCAGTCCCTGAATTTCGATCGAGCGTTCGGGTGACATGTGATGCAGCGAACCGTCCACATGGCTCTTGAAAGTCACGCCCTTCTCATCCAGCTTGCGCAGACCGGACAGCGACATGACCTGAAAACCGCCGCTGTCGGTGAGGATCGGGTGTGGCCAGCGGATCAGCTCATGCAGGCCGCCAAGGCGGGCGACACGCTCGGGGCCGGGCCGGAGCATCAGGTGATAGGTATTGCCGAGAATGATGTCCGCCCCCAGCTCGCGCACCTGATCAAGATACATGGCCTTGACGGTGCCGACAGTGCCGACGGGCATGAAGGCGGGCGTGCGGATGACGCCACGCGGCATGGCGACTTCGCCGAGGCGCGCGCCGCCGCTCGTGGCTTTCAGGGTGAAGGTGAATTTGTCGTGCATCAGTTTTTCCGGAACAACAGGCTGGAATCGCCATAGGAATAGAAACGGTATCCGGTTTCGATGGCGTGCTTATATGCGTCACGCATCGTTTCGAGACCGCAGAAAGCCGAAACCAGCATGAACAGCGTCGATTTCGGCAGGTGGAAATTGGTCATCAGTATGTCGACAGCCCGGAAACGATAACCGGGCGTGATGAAGATGCCGGTCGCATCGGACCACGGGTGAATGGTGCCATCCTCCGCAGCCGCGCTTTCGATCAGACGCAGCGACGTCGTGCCCACGCAGACAATGCGCCCGCCCCGCGCCTTGACCGCATTCAGCCTGTCGGCTGTTTCCTGCGAGACGTGGCCGATTTCGAAATGCATCTTGTGATCGTCGGTATCATCGGCCTTGACCGGCAGGAAGGTGCCCGCCCCAACGTGAAGCGTCACGAAATGCCGCTCGATACCGGCCTTGTCGAGCGCGGCAAACAGGTCGGGCGTGAAATGCAGCCCGGCGGTAGGCGCGGCAACGGCGCCCTTCTCGCGGGCGTAGATGGTCTGGTAGTCGGTCTGGTCCTGCGCATCTTCCGGCCGTTTGGCCGCGATATAGGGCGGCAGGGGAATATGGCCGACGGAGGCAATGGCCTCGTCCAGAACGGGACCGGAGACGTCGAACAGCAGCGTGATCTCGCCCTCCTCGCCCTTCTGCTCGACGGTGGCCTCCAGATGGGCAAGGCCGCAGGCATTATCGCGCTCATAACCGAAACGGATGCGGTCGCCCTGCTTGATGCGCTTGCCGGGACGCGCGAAAGCCTTCCAGCGGGACTGATCGGCGCGCATGTGCAGCGTGGCCGAAACGGCCGTCTCCGGCGCGCCTTCGCGCAGCCTGACACCTTCAAGCTGGGCGGGAATGACGCGGGTGTCGTTGAAAACGAGCGCATCGCCGGCCCGCAGGAAGGACGGCAGGTCGAAAACGCGATGGTCTTCCATGCGATTTTCATTCGGATCGACCACCAGCAGGCGCGCGCTATCACGCGGGTTCGCCGGGCGAAGGGCGATATTCTCCTCGGGCAGGTCGAAATCGAACAGGTCTACACGCATTGCAAGGGCTCTCGAAAATATCAAAACCCGCCCTCAATGGGGCGGGCGGTGTGTAGTGACAAGGCTTCGTCTAACGAGACGTCATCCTCGGGCTTGTCCCGAGGATCTAACCACGTTGCCGTTGGGGATCGTTAGATCCTCGGGACATCCTCGGGCTTGATCCGGGGACCGAGGATGACGGCTGAGAGGTTGGCACCCTTGTCATCAATGTGAAACGCCCCGCAGTTTCCTGCGGAGCGGGTAAACACTTTATAAGCTATTCTCAAGCCGCCGAAGCGAGCTTCATCGAAACGATCGAATCGGGGTCCTTCACCGGCTCGCCGCGCTTGACCTTGTCGATGGCTTCCATGCCCTCGATGACCTGGCCCCAGACGGTGTACTGCTTGTTGAGCCACGGGGAATCCGTGAAGCAGATGAAGAACTGCGAGTTGGCGGAGTTCGGGTTCTGCGAACGGGCCATGGAGCAGGTGCCGCGCACATGCGGGATGGCGGAGAATTCTGCCTTCAGGTCCGGCTTTTCGGAGCCGCCCATGCCGGCGCGGGCCGGGTTGAAGCTTTCGGAACCCTTCTTGCCGAATTTCACGTCGCCGGTCTGCGCCATGAAGTCTTCGATGACGCGGTGGAAAACGACGCCGTCATAAGCACCTTCCGATACCAGTTCCTTGATGCGGGCAACGTGGCCCGGAGCAACTTCCGGAAGCAGCTGGATCACGACCTTGCCGGTCGTCGTTTCCATGATGATGGTGTTTTCCGGATCCTTGATCTCGGCCATGATTATTCTCCTCTGTTCGGGCTCTATTGCCCTGTCTTACTTCTTGCCGACAGTGACCTTGATCATCCGGTCGGGGTTGGAAACTTCGCCGTTGCTGCCCTGCCCGCGCTTGATCTTGTCCACGGCTTCCATGCCGGACACGACCTTGCCGACCACGGTGTACTGGCCGTTCAGGAACGGACCGTCGGCAAACATGATGAAGAACTGCGAATTGGCGGAATTCGGATCCTGCGAACGGGCCATGCCGACCACGCCGCGCGTGAACGGAACCTTGGAGAATTCGGCCGGAATATCCGGCAGGTCGGAGCCGCCAGTACCTGCGCGCTGGGCGCTGAAGCCCTTCTCCATGTTGCCATACTGCACGTCGCCGGTCTGCGCCATGAAGCCGTCGATGACGCGGTGGAAGGCGACGTTGTCGTAAGCGCCCTTCTTGGCCAGCGCCTCGATCTGCGCCACGTGCTTCGGCGCGACGTCGGGCATAAGCTCGATGACAACGGGGCCGTCCTTAAGCTGGACGGTGAGAAGTTCGGCGGCAGACGCAAAGGTGCTGGCGGCAAGCGCGCCCGCAAACATGGCGCCGGCAAAGGCAAATCGAACGAGTTTCATCGGATTGGCTCCAAATGTAAGGCTGAAGTCAGCGCTTCAGCTTGGCGTTAAGGGCTTCAAGCACGGCTTTCGGCACGAAGGCTTCGACATTGCCGCCCATGGCGGCGATCTGCCGAACCAATGTGGCTGTAATGGGTCGCGACGAGGTGCCGGCGGGCAGGAACACGGTCTGGATATCAGGCGCCATCTGGCGGTTCATGCCCGCCATCTGCATTTCATAATCGAGATCGGTGCCGTCACGCAGGCCGCGCAGCAGAAGTCGCGCGCCATGCTGACGGGCCGCATCGACAACCAGATTATCGAAGGAAACGACGTCCATGCGCGCCGCTTCGCCCGGCAGCTGCTCCGCAAGCGCCTGCTTTATCAGCCCTGCCCGTTCCTCGAAACTGAACATCGGCGCTTTTCCGGGATGAATGCCAACCGCGACAATGACCTTTGACGCCACGTTCAGCGCTTGGAGAAGCACATCCAGATGTCCGTTGGTCATCGGATCGAAGGATCCTGGATAAAAGGCAATCGTCAT
Protein sequences of DBSCAN-SWA_3 >CP036358|1270740:1278997|1276013_1277096_-|QBJ13061.1|tRNA|DBSCAN-SWA MRVDLFDFDLPEENIALRPANPRDSARLLVVDPNENRMEDHRVFDLPSFLRAGDALVFNDTRVIPAQLEGVRLREGAPETAVSATLHMRADQSRWKAFARPGKRIKQGDRIRFGYERDNACGLAHLEATVEQKGEEGEITLLFDVSGPVLDEAIASVGHIPLPPYIAAKRPEDAQDQTDYQTIYAREKGAVAAPTAGLHFTPDLFAALDKAGIERHFVTLHVGAGTFLPVKADDTDDHKMHFEIGHVSQETADRLNAVKARGGRIVCVGTTSLRLIESAAAEDGTIHPWSDATGIFITPGYRFRAVDILMTNFHLPKSTLFMLVSAFCGLETMRDAYKHAIETGYRFYSYGDSSLLFRKN >CP036358|1270740:1278997|1272352_1273534_+|QBJ14492.1|DBSCAN-SWA MRDIVAKAGTPCFLYDAGAMRRAYRDLQATLSGFADIYYSVKANPLPAIIALFRQEGAGAEIASAGEYRAAIKAGVAPENIIFAGPGKSRAELHEVIAGGIGEIHVESAEEIARIEAIGKPVKASVRINPVPDAQAGAMRMGGKATAFGFDEEELENILALFRHAGVIELVGIHIYGGTQILDADMLVSQWRHAISIAARMAEMLGKPLETIDLGGGLGIPYFAGEAPLDLAKVKAAVPDLKALLKAHPLIADAHVIVEPGRFLAGPGGIYTAEVNSVKTSRGTTFVVTDGGMHHHLAASGNLGQIVKRNYPIVAPAMMQADHAETATIVGPLCTPLDTLARNAALPRLQAGDLIAILQSGAYGASASPGDFLSHAVAKEVLVENGAFEMIGR >CP036358|1270740:1278997|1274883_1276014_-|QBJ13060.1|tRNA|DBSCAN-SWA MHDKFTFTLKATSGGARLGEVAMPRGVIRTPAFMPVGTVGTVKAMYLDQVRELGADIILGNTYHLMLRPGPERVARLGGLHELIRWPHPILTDSGGFQVMSLSGLRKLDEKGVTFKSHVDGSLHHMSPERSIEIQGLLDSDIQMQLDECIALPAERKEIERAMEMSLRWAERCRVAFGDQPGKAMFGIVQGGDQPDLRIRSAEGLKQLDLKGYAVGGLAVGEPQDVMLGMLDITLPVLPTEKPRYLMGVGTPDDILKSVARGIDMFDCVMPTRSGRHGLAFTRRGKVNIRNARHAEDMRPLDEQSNCPASRDYSRAYLHHLVRANEALGGMLLSWHNLAYYQELMQGIRKSIEEGRFADFYAETIEMWARGDIDPV >CP036358|1270740:1278997|1278502_1278997_-|QBJ13064.1|DBSCAN-SWA MTIAFYPGSFDPMTNGHLDVLLQALNVASKVIVAVGIHPGKAPMFSFEERAGLIKQALAEQLPGEAARMDVVSFDNLVVDAARQHGARLLLRGLRDGTDLDYEMQMAGMNRQMAPDIQTVFLPAGTSSRPITATLVRQIAAMGGNVEAFVPKAVLEALNAKLKR >CP036358|1270740:1278997|1277358_1277868_-|QBJ13062.1|DBSCAN-SWA MAEIKDPENTIIMETTTGKVVIQLLPEVAPGHVARIKELVSEGAYDGVVFHRVIEDFMAQTGDVKFGKKGSESFNPARAGMGGSEKPDLKAEFSAIPHVRGTCSMARSQNPNSANSQFFICFTDSPWLNKQYTVWGQVIEGMEAIDKVKRGEPVKDPDSIVSMKLASAA >CP036358|1270740:1278997|1270740_1272273_+|QBJ13058.1|DBSCAN-SWA MTPPFLLHHLLGARAASDGQAIVYKDTSLSYRQFAEAAERCAAALQQAGAQRGDRVVIFLPRGTEECWAIFGVSMAGCVFVPVNALLKAQQIRHIIVDCGAELVISNATMRDELSAALEGLAGVRVLLANDIAEGGKTSVKSPAAIGEDLAAILYTSGSTGSPKGVMLSHRNLLAGARIVRTYLEITGKDRILSLLPFSFDYGLNQLLTAVEQGATTIISTFRLGDDIVRDLRDHAVTGLAGVPTVWAILTRAAPSLARTPLPHLRYITNSGGRVPQETVKALREKLPDTKIYLMYGLTEAFRSTFLPPDEIDRRPTSIGKAIPECEIFIVTAEGQRAKPGEPGILVHRGPTVSLGYWNRPEDTAKVLRPHPFIPAALGGETVCYSGDLAVEDEDGFFSFVARNDAMIKSSGYRISPTEVEESLMSTGLFQQVAVIGLPDPFAGEKVHAVATAANENIDVSAALKKAAEMLAPFMIPRAIELVDRLPVTANGKVDYRALVRERTDNGAHG >CP036358|1270740:1278997|1273534_1274692_-|QBJ13059.1|DBSCAN-SWA MAHAPYVLFRDDTTGTVTAFTEPEEIIVADERGAFFAALKRMEELRRVGKYLAGYMSYEAGFLFEPKLAPFAEEPRNVPFLNFGVFSGPQPDDGRFARPDEVPGADAFLADPVPAWTLPEYRRRFELLHDHIRRGDCYQGNLTMPVHARWSGDPLTAFWSLIERQPVKYGALVDLGGPVILSRSPELFFSVDGQGFIETHPMKGTTPRGADAEEDRAIIAAMLADEKTLAENRMIVDLLRNDISRITEVGSLDVPRLFDIETYPTVHQMVSHIRAKLLPGMTVENIFAALFPCGSVTGAPKMWAMKILRELEAAPRDAYCGAIGLMSPGGEMRFSVAIRTMTLFDDGYAVFNVGGGIVFDSVAEAEYDECLLKSKFAVGDRLLRG >CP036358|1270740:1278997|1277904_1278474_-|QBJ13063.1|DBSCAN-SWA MKLVRFAFAGAMFAGALAASTFASAAELLTVQLKDGPVVIELMPDVAPKHVAQIEALAKKGAYDNVAFHRVIDGFMAQTGDVQYGNMEKGFSAQRAGTGGSDLPDIPAEFSKVPFTRGVVGMARSQDPNSANSQFFIMFADGPFLNGQYTVVGKVVSGMEAVDKIKRGQGSNGEVSNPDRMIKVTVGKK |
8 | uncultured_Mediterranean_phage(66.67%) | tRNA | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_4 |
1295312 : 1303715
Sequences of DBSCAN-SWA_4
Nucleotide sequences of DBSCAN-SWA_4 >CP036358|1295312:1303715|DBSCAN-SWA CCTATTCAAGGAAAGTGGATGGGTTCACCGGCGTCGCATCCTTGCGAACTTCGAAGTGAACCTGCGGACGTTTGGCGCTGCCGGTCATGCCCGAGGTGGCGATGGTCTGGCCGCGCTGAACCTTCTGGCCACGCTGCACGTCGAGATTGGCAGCGTTGCCGTATACCGTGACCTTGCCGTCGTCATGGCGGACAAGAACCGTGTTGCCGAGCTGCTTCAGGCCATTGCCGGCATAGATGACGACACCGTTTTCAGCGGCCTTGATCGGCGTGCCCTCGGGAACCGAAATGTTGATGCCGTCGTTGCGGCTGCCTTCGACATTGTCACCGAAATTATTGATGACCGCGCCGCGCACTGGCCAGCGATATTTGCCGATGCCGGTGGATTCCGGTGCGACGGACGCCATGTCGGCCTTTTTCTCGATATCGCTGACACTGGCGGTGGCCGGAGTTGCGGGCGCAGCCGGAGCGGCGGCGGCGGCCGGCGCCTTATAGGGCTCTGGCTTGACGGATGCGGTTTCGACGGGCTTTGCGGCTTCCTTGGCGGGAACGGAAGCGGTCTTGATGGCATCCGTCGAAGCACCCGGCATGTTGAGCGTCTGGCCGACGCGGATGCTTTCGTTGCCGAGACCGTTTGCCGCCTTGAGAGCAGCCACCGACACGCCGTTTGCACGGGCGATCTTGGCGAGGCTGTCGCCCGGCTGAACCTTGTAGCCACCGGAGGGCGGCAGCGGCTTGCCGCCCGGAGGCGTCAGTTTGCCAGCTTCAGCAGAAACCTTGTCACGGGCGGCGGCCTGGGACGGCAGCACGGCCACGTTTCCATCCGGCGCACGCAGCGGTGTCGGCTGGTCGCCATTGCGGTTGAGCGCAATATCGCCGGCGGCATCCTTGGCGGCGTTGCGGACCTGGCCGAACTTCGGAATGAGGATCGACTGGCCGGCCTGCGCGGCGGAAGCGGTCTTCAGGCCATTGACCCGCAGCAGTTCCTTTTCCGGAACGCCGTAGCGGTTGGAGAGGGTCGCGATGCTTTCGCCGGGACGCAGGGTCACGGAGGCGGCGCCATCGGTGTGCCAGCCGTTTTTGTCGGAACGAACGGTCGCGGTGGTCAGGTTGTCAGCGGCGGGTGCGGCAGCCTGCCGTGCGGGCGCTGCCAGAGACGGCGTGGCCTTGGGCTGGGCGGACGGGAAAGGCTGGGCCATGGCTTCGTTGCGGGTCGACGGATCGCGGCTCGCCACGGCTGTCGGCGCAGACAGCTCGGAACGCTGCACCGGCGAGGCCGAGGCACGGGCCGTGGACGGCTGAACGCCACCGTAACCACCCGATTGCTGCGGATAGGAATTGCCGGCCATGTTCTGGTTGGCATAACCTCCCTGTGGCGCGGCAGAGGCGTAACCGCCATTCATGTCGCCCTGCGGAACCGGGGCCTGCCCGTAAGCACCGCCGCCCTGACGGGCAGGAATTGACGCCGTGGTCATCTGATCCGGTCCACTGGAAAAGAGGCCGCTGAACCGCGTTGCATCCGAGCTACAGCCCGTTGCGACGCTTGCCAGCAATGCCGCTGCGAACATTTTTGCGACTGATTTTCCGATTTTAGACGAATGTCTCATACGCATGACTCGATCTACACTGACGCCAACCCGCCGCATCGAGATCGATCCGGCACAAATTTCAATTCTTGCAAGGCCCATTTCCCGAGGTGGGGAAAACCATGCGTCGATGTTTATGATTAAAGCGCGTTAGTGTTACCACGCCGTTAAAACGTTAAGCCGTTTGGCGCTTTTTTGACACTTCGATAACCATGACGAAGACAGGATGCGCCATGATGAAACAATAAAATCTCTTAAAATCAGATGCTTACGAAAATTCCGCACATTAGGAGACCAAAATACAATGTCTCCAAAATTTTCATTTTGTCACAAATGACGCGCAATATGGGTGCTGAGCGGCAGGTAGGGCGCCTCGAACAGTCCTTCCTTCTCGAAGCGGCTGCCGGTCTTGGAAAAGCGCGTCATCACGCAGCGATCACCTTCAACGATGATCGGCGCGATCATCATACCGCCGGAAACGATCTGCTCGGCAAAGAAACGGGGCATGGTGGTGAAGGCCGCCGTGGAAACGATTCGGTCAAAAGTGCCCTCACCCACCAGGCCGTTGCTGCCATCCGCCTGGCGGATGACGACGTTGCGGATTGCAAGATCATCGAGACAGTTCTGCGCCTGCTGCACCAGCGTCTTGTACCGCTCCAGCGAAAAGACGCGCTCCACCCGGCGGGCAATGATCGCCGTCATGAAACCGCTGCCGGTACCGATTTCCAGCACGCGCTGGCCGGGTTTCAGCTGCAGGCGGGCGAGGATTTTCACCGCCATGTCGGCCCCTTCCATGAAGGAACCGCAGTCTATCGGGATGGTGCGGCTGGAATAGGCGTCGGCGGCAAATTGCGGCGGTACGAATTTGGATCGTGGCGTCTGCTCAACCGCCGTCAGCAGGTCGAGATCGGAAATCCCCTCGCCCCGCAAGCGCAGAACGAGAGCTGCAAAACCTTCCTTTTCGACCATGGCAGATTTCAATCGGCGACTCCGAATCCAAGCGCTTGCGCAACCCGATCCTTCACCGTGTAATCCGTCAGGTCAAGCTTCAGCGGCGTGACGGAAATCTTGCCATGCTTTAAGGCGTGGATATCGGTGCCTTCGCGGAAGGTGCCCATGCGTTCGCCGAAACGCAGCCAGTAATAGGGAAAACCGCGGCCATCCTGGCGTTCTTCCACCGTCAGGCCGAAATCGAGCTTGCCCTGCCCCGTGACGGACACGCCCTGCACATCCTTTGGCGCGCAATTGGGAAAATTGAGATTGAGGAAGGTGCCGTCAGGCAGATCGACATCGATGAGCTTGCGAATGAGATCGGGCGCATAGGTCTCGGCCACCTCCCACGGCACGACGCGGCCTTCGGCATGGCTGAAGGCCTGGCTGAGCGCAAAAGACCGCACGCCCTGCAGCGTGCCTTCAATGGCCCCGGCAATCGTACCGGAATAGGTCACATCATCGGCCATGTTGGCGCCGGCATTGACACCTGACAGGACGAGATCTGGCTTTTCCGGCAGAACCTCGCGAATGCCCATGATAACGCAATCAGTCGGCGTGCCGCGCAGAGCGAAATGCTTGTCGGAAACCTTGCGAAGACGCAACGGCTCGGACAGCGTCAGGGAGTGAGCAAGCCCGCTCTGGTCTGTCTCCGGCGCAACGATCCAGACGTCGTCGGAGAGCGTGCGAGCGATACGCTCCAGCACGGCCAGGCCCTCGGCGTGAATACCGTCGTCATTCGTCAGCAAAATCCGCATGTTTTCCTCCCGCTCTTCATTTTGCTCAGGCAGAACACTTCCCGAACCGTCATCCCGGCCTTGAGCCGGGATCCAGTCGACGCGCGTCTGCGCCACGGGAATAGTCCTTTTCAGCCCAACGACTTGGGCTGGATGGATGCCGGATCAAGTCCGGCATGTCGGCATCTTTGAAATCAAGCCGCCTTTTCGATCCGCGTCAGACCGCCCATATAGGGCAGCAGCACATCAGGAATTGTGACAGAGCCATCGTCGTTCAGGTAATTTTCGAGAACGGCGATCAGGCAGCGGCCGACAGCCGTGCCGGAACCGTTCAGCGTATGCACGAACTTCGTCGCCTTGTCGTCCTTGCCGCGATAGCGCGCATTCATGCGGCGCGCCTGGAAATCGCCGCAGACCGAGCAGGACGAGATTTCGCGATAGGTATTCTGACCGGGCAGCCACACTTCCAGATCGTAGGTCTTGCGCGCGCCAAAGCCCATGTCACCGGTGCAGAGCGTCATGGTGCGGAAATGCAGGCCCAGCCGCTTCAGCACTTCCTCGGCGCAGGCCGTCATGCGCTCATGCTCGGCGACAGCGCTTTCCGCGTCGGTGATGGAGACGAGCTCGCACTTCCAGAACTGGTGCTGGCGCAGCATGCCGCGCGTGTCACGGCCTGCCGAACCCGCTTCCGAGCGGAAGGACGGTGTGAGTGCGGTGAAACGCAGGGGCAGCTTTTCCTGCTCGAGGATTTCGCCGGCCACGAGGTTGGTCAGGGTCACCTCCGCCGTCGGGATCAACCAGCGGCCATCGGTGGTCTTGAACAGGTCCTCGGAGAATTTCGGCAATTGCCCCGTGCCGAACATCGCCTCGTCGCGCACCATCAGCGGCGAGGAAACTTCGGTATAGCCATGTTCCGAGGTGTGCAGGTCGATCATGAACTGGCCGAGCGCCCGCTCCAGCCGCGCGAGCTGGCTGGTGAGAACGGTGAAACGCGAACCCGAAAGTTTGGCGGCCCGCTCGAAATCCATATAACCAAGCGCTTCGCCGATTTCGAAATGTTCCCTGGCCTCGTGGTTCCAGCCGGGCTTCTGGCCGACGACGCGGGTCACCACATTGTCATGCTCGTCCTTGCCATCGGGCACATCGTCGAAGGGCATGTTCGGCAGGCGCGACAGGGCGTCGTTCAACTCGGCGGTGACCTGGCGGTCCTCCTCCTCGGCACGCGGCATCTTGTCCTTGAGGTCGGCGACCTCGGCTTTGAGTTTTTCGGCAAGCTCCATGTTCTTCTGCGCCATGGCGGCGCCGATTTCCTTGGAGGCGGCATTGCGGCGGGACTGCATGTCCTGCAGGGACTGGATGACGGAACGGCGCTTTTCATCGAGCGCGATCAGACCTGCGGCCGCAGGCTCCGCGCCACGGCGTGCGAGCGCCGCGTCAAAAGCTTCGGGGTTTTCACGTATCCATTTAATGTCGTGCATCGTCGTTCCAGACCGTTGTTGCATCACCGATGTTTTACAGCAAAACGCCGGGCAGAAGCCCGGCGCGAGGAGATGTCTTGAGCGGCGAGACCTTCGAAAACTCAGGTCTCCTCCAGCTCCGTGCTTGCGGATTCCGCAGCGCGCTTCCTCTCCACGAGTCGAGCCATGTAGATGGAAATCTCGTAGAGAATGATCGCAGGCAGTGCAAGACCGATCTGGGACATCGGGTCCGGCGGGGTCAGCACGGCCGCGACCACAAAGGCCATGACAATCGCGAACTTGCGCTTCTCACGCAGCCAGTCGCTGGTCAGAAGGCCCACGCGCGCCAGAAGCGTCGTGACCACGGGCAGCTGGAACACCAGACCGAAGGACAGCACCAGCGTCATGATGAGGCTCAGATATTCCGACACCTTCGGCATCAGCGAAATCGCGACCTCGCCATCTTCCGGCAATTGCTGCATGGCGAGGAAGAACCACATGACCATGGGCGTGAAGAAGAAATAGACGAGCGCCGCTCCGATGAGGAACAGGATCGGCGATGCGACGAGGAAGGGCAGGAAAGCGGCGCGCTCGTTCTTGTAGAGGCCGGGTGCCACGAATTTATAAAGCTGCGAGGCGATCACCGGAAAGGAGATGACCATCGCGCCGAACATGGCGACCTTGATCTGCGTGAAGAAGAATTCCTGCGGCGCGGTATAGATCAGCGACGACTTCGTCACATCAAGGCCGGCCCAGAGAACCGCCCACTTATAGGGAATGACAAGCAGGTTGAAGAGATGCTTGGCAACGGCAAAACAGGCGATGAAGGCAACGAAGAACGCACCCAGCGACCAAATCAGCCGTGTGCGCAGTTCCATCAGATGTTCGATAAGCGGCTGCGGCTTGTCCTCGATGTCCCCGCTCATGCTTCATCCTTTTTGGGCTTTGCAGCCTTTGTTCTCGCCGGCTTCACCGGCTTGGCATCGGCGGCCACGACAGGCTTTTCAGCCACGACTTTCCTGGCGGCCGCTTTTCTCGCGACGGCTTTTACGGTCGGCTTTGCTGCTGAAGTCTCCGTGGAATTGACAGCAACGGGTTCGGACGCCGCGACCGCTTTGGCGCGGCTGGCGCGTTTCGGCTTGGCGGCAACCGCTTCCGCCTCGACGGTCGCAATCGATTTAGCCCGGGCACGCTTCGGCTTTTCCGCCGGTACGACCGGTGCCGCTACGGGGGCGCTGGCAGTCACGGGCGGGGTATCCGGCAGCTTCATCTCAGGCTCGGGAATGCTGACGAGCGGTGCAACCGGCTCACTGGTCGCCGGTGCTGCGGTCGACGTAGCAGGCGAGGATGACAGGCCCTCAGGCGGCGTGGTCGCCTTCTGAAGATCGGACTTGATCTCGTTGCCGAGCTGGCGAAGCGGGTTCATCGCATCACGCAGCGAGTTGGTCGGGTTGAGATTGCGGACATCCGATATGGTTTGGCGCACATCGTCCATGTCGGCCTCTTTGAGGGCCTCATCGAACTGGGTACGGAAATCCCCCGCCATCTTGCGAAGGCCAGCCATGGTCTTGCCAAAGGCGCGGATCATGGGCGGCAAGTCTTTCGGCCCGACAACCACGATCAGTACGACCGCAATCACCAGCAGCTCGCTCCAGCCGATATCAAACATCAATATGCTCCCGAGACCCTAACGCGCGTCAGTCTCAAACCAAACGGCAGGGCCGCTTACTTGATCTCGTCAGCCTTGTGGTCGACGGTCTTCGTGTTGGCATCGGCCGGCGGCGGCGTCTGGTCTTCGTCAGCCATGCCCTTCTTGAAGCTCTTGATGCCCTTGGCGACATCGCCCATCAATTCCGGGATCTTGCCGCGTCCGAAAAGGACGAGAACAATCACCAGCACGATGAGCCAATGCCACACACTAAAAGAACCCATAACTGAAACTCCTGAATTTCGCTTTCAGACGATGTAAGACGTTTGAAAGGCTTTTTCAAACAACAAATTGCCGCTTCAACAGGGTGTGAGCATAAAGTTTTTGCGTCGTGTCGCCCGCGCGATAACGATCTCAATCGTCGCCGTCGCCACCACCAGGCGCAAGCAGGCCCAGCTCTTCCAGATCAAGTTGCGTGATCGGGTCTTCATCCTCACGCAATTCGTCGCTCATCATCGGCAGGGGCACGCCGAAATTGGAAGGTATGCGGCCGGAAAGCAGCCCTGCTCCCTTCAGCTCCTCGAGCCCCGGCAGGTCGCGCAACTCCTCGAGGCCGAAATGGTCGAGAAACTCAACCGTGGTACCGATCGTCACCGGCCTGCCGGGTGTGCGCCTGCGGCCACGGAACCGGACCCAGCCCGCCTCCATCAAGACGTCAAGCGTGCCGCGTGAGGTCTGAACACCGCGGATTTCCTCGATTTCAGCGCGTGTCACCGGCTGATGATATGCGATAATCGCCAGGACTTCCAGCGCCGCGCGAGAGAGCTTCTTAGGTTCCTTTTCCTCGGCGCGGATGACGAAGGAAAGATCGCCGGCAGTGCGGAAGGCCCATTGCCCGCCCACCTGCACGAGATTGACGCCCCGGCCCGCATAGGCGGCTTTCAGATGCTGAAGAACGGCGTCGACATCCATGCCGCGCGGCAGGCGCTCCGCAATGAAACCCGGAGAGACCGGCTCGGCCGAGGCGAAAACCAAAGCCTCGGCGATCCGCTCGGCCTCTTTCAACTGGCGCTCGGAAAATACCGTGGGTTCTAGTCCCACCCCATCCGTCGCGATCGTCGCCTCGGCCGTCTCATCATTATCAGTCATTGTCAGTCCTCTCGGCGACCGCCGCACGATCATCCCTCGTGCCACGGCGCATATAGATGGGCTGGAAAGCGCCCTCCTGCCGGATTTGCAGCGTACCCTCGCGCACAAGTTCCAGTGACGCGGCAAAGGCGCTGGCGATCGCCGTCACCCGCATCGCGGGATCGGGGACATATTGCAGCAGATATTGATCGAGCGCGGTCCATTCTCCGACATCGCCGAGGAGACCGGTTAAAAGCTCCCGCGCCTCGACCAGCGACCAGACCTGCCGTTTTTCAATCGTGACCTGGGTGATCGCCTGTCTCTGCCGCAGATTGGCGTAGGCGCTGAGCAGGTCATAAAGGCTCGCCTCGTAGGCAGAGCGGTTGATGTGCGGAATATGCTCCGGCGCACCACGGGCAAACACATCGCGGCCGAGCTGGGCGCGGTTGACGAGGCGTTCCGCCGCCTCGCGCATCGCTTCCAGCCGCTTCAGCCGGAAGGCAAGGGTTGCGGCCATTTCCTCACCCGAGGGGCCGTCATCCTTGGATTGCTGCGGAATGAGAAGCTTGGACTTGAGGAAGGCGAGCCACGCCGCCATGACCAGATAATCGGCCGCCAACTCGATGCGCACGCGCCGCGCGCTTTCAACGAATTGCAGATATTGCTCGGCAAGCGCCAGCACTGAAATGCGCGACAGGTCCACCTTCTGCGTGCGGGCAAGATGCAGCAGGAGATCAAGCGGGCCTTCGAAACCCGCGACATCGATGACCAGTCCGGCCTCGCCCGTCAGCCGCTCCGGCGTCACATCCTGCCAGAGCTTGTCCATCGGTGTCGAATTGCGAGATTTGTCTGCGGCCAT
Protein sequences of DBSCAN-SWA_4 >CP036358|1295312:1303715|1297873_1298644_-|QBJ14494.1|DBSCAN-SWA MRILLTNDDGIHAEGLAVLERIARTLSDDVWIVAPETDQSGLAHSLTLSEPLRLRKVSDKHFALRGTPTDCVIMGIREVLPEKPDLVLSGVNAGANMADDVTYSGTIAGAIEGTLQGVRSFALSQAFSHAEGRVVPWEVAETYAPDLIRKLIDVDLPDGTFLNLNFPNCAPKDVQGVSVTGQGKLDFGLTVEERQDGRGFPYYWLRFGERMGTFREGTDIHALKHGKISVTPLKLDLTDYTVKDRVAQALGFGVAD >CP036358|1295312:1303715|1302142_1302877_-|QBJ13087.1|DBSCAN-SWA MTDNDETAEATIATDGVGLEPTVFSERQLKEAERIAEALVFASAEPVSPGFIAERLPRGMDVDAVLQHLKAAYAGRGVNLVQVGGQWAFRTAGDLSFVIRAEEKEPKKLSRAALEVLAIIAYHQPVTRAEIEEIRGVQTSRGTLDVLMEAGWVRFRGRRRTPGRPVTIGTTVEFLDHFGLEELRDLPGLEELKGAGLLSGRIPSNFGVPLPMMSDELREDEDPITQLDLEELGLLAPGGGDGDD >CP036358|1295312:1303715|1302869_1303715_-|QBJ13088.1|DBSCAN-SWA MAADKSRNSTPMDKLWQDVTPERLTGEAGLVIDVAGFEGPLDLLLHLARTQKVDLSRISVLALAEQYLQFVESARRVRIELAADYLVMAAWLAFLKSKLLIPQQSKDDGPSGEEMAATLAFRLKRLEAMREAAERLVNRAQLGRDVFARGAPEHIPHINRSAYEASLYDLLSAYANLRQRQAITQVTIEKRQVWSLVEARELLTGLLGDVGEWTALDQYLLQYVPDPAMRVTAIASAFAASLELVREGTLQIRQEGAFQPIYMRRGTRDDRAAVAERTDND >CP036358|1295312:1303715|1301805_1302012_-|QBJ13086.1|DBSCAN-SWA MGSFSVWHWLIVLVIVLVLFGRGKIPELMGDVAKGIKSFKKGMADEDQTPPPADANTKTVDHKADEIK >CP036358|1295312:1303715|1295312_1296923_-|QBJ13082.1|DBSCAN-SWA MRMRHSSKIGKSVAKMFAAALLASVATGCSSDATRFSGLFSSGPDQMTTASIPARQGGGAYGQAPVPQGDMNGGYASAAPQGGYANQNMAGNSYPQQSGGYGGVQPSTARASASPVQRSELSAPTAVASRDPSTRNEAMAQPFPSAQPKATPSLAAPARQAAAPAADNLTTATVRSDKNGWHTDGAASVTLRPGESIATLSNRYGVPEKELLRVNGLKTASAAQAGQSILIPKFGQVRNAAKDAAGDIALNRNGDQPTPLRAPDGNVAVLPSQAAARDKVSAEAGKLTPPGGKPLPPSGGYKVQPGDSLAKIARANGVSVAALKAANGLGNESIRVGQTLNMPGASTDAIKTASVPAKEAAKPVETASVKPEPYKAPAAAAAPAAPATPATASVSDIEKKADMASVAPESTGIGKYRWPVRGAVINNFGDNVEGSRNDGINISVPEGTPIKAAENGVVIYAGNGLKQLGNTVLVRHDDGKVTVYGNAANLDVQRGQKVQRGQTIATSGMTGSAKRPQVHFEVRKDATPVNPSTFLE >CP036358|1295312:1303715|1298817_1300101_-|QBJ13083.1|tRNA|DBSCAN-SWA MHDIKWIRENPEAFDAALARRGAEPAAAGLIALDEKRRSVIQSLQDMQSRRNAASKEIGAAMAQKNMELAEKLKAEVADLKDKMPRAEEEDRQVTAELNDALSRLPNMPFDDVPDGKDEHDNVVTRVVGQKPGWNHEAREHFEIGEALGYMDFERAAKLSGSRFTVLTSQLARLERALGQFMIDLHTSEHGYTEVSSPLMVRDEAMFGTGQLPKFSEDLFKTTDGRWLIPTAEVTLTNLVAGEILEQEKLPLRFTALTPSFRSEAGSAGRDTRGMLRQHQFWKCELVSITDAESAVAEHERMTACAEEVLKRLGLHFRTMTLCTGDMGFGARKTYDLEVWLPGQNTYREISSCSVCGDFQARRMNARYRGKDDKATKFVHTLNGSGTAVGRCLIAVLENYLNDDGSVTIPDVLLPYMGGLTRIEKAA >CP036358|1295312:1303715|1297223_1297865_-|QBJ14493.1|DBSCAN-SWA MVEKEGFAALVLRLRGEGISDLDLLTAVEQTPRSKFVPPQFAADAYSSRTIPIDCGSFMEGADMAVKILARLQLKPGQRVLEIGTGSGFMTAIIARRVERVFSLERYKTLVQQAQNCLDDLAIRNVVIRQADGSNGLVGEGTFDRIVSTAAFTTMPRFFAEQIVSGGMMIAPIIVEGDRCVMTRFSKTGSRFEKEGLFEAPYLPLSTHIARHL >CP036358|1295312:1303715|1301002_1301749_-|QBJ13085.1|DBSCAN-SWA MFDIGWSELLVIAVVLIVVVGPKDLPPMIRAFGKTMAGLRKMAGDFRTQFDEALKEADMDDVRQTISDVRNLNPTNSLRDAMNPLRQLGNEIKSDLQKATTPPEGLSSSPATSTAAPATSEPVAPLVSIPEPEMKLPDTPPVTASAPVAAPVVPAEKPKRARAKSIATVEAEAVAAKPKRASRAKAVAASEPVAVNSTETSAAKPTVKAVARKAAARKVVAEKPVVAADAKPVKPARTKAAKPKKDEA >CP036358|1295312:1303715|1300202_1301006_-|QBJ13084.1|DBSCAN-SWA MSGDIEDKPQPLIEHLMELRTRLIWSLGAFFVAFIACFAVAKHLFNLLVIPYKWAVLWAGLDVTKSSLIYTAPQEFFFTQIKVAMFGAMVISFPVIASQLYKFVAPGLYKNERAAFLPFLVASPILFLIGAALVYFFFTPMVMWFFLAMQQLPEDGEVAISLMPKVSEYLSLIMTLVLSFGLVFQLPVVTTLLARVGLLTSDWLREKRKFAIVMAFVVAAVLTPPDPMSQIGLALPAIILYEISIYMARLVERKRAAESASTELEET |
9 | uncultured_Mediterranean_phage(75.0%) | tRNA | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_5 |
1426159 : 1431693
Sequences of DBSCAN-SWA_5
Nucleotide sequences of DBSCAN-SWA_5 >CP036358|1426159:1431693|DBSCAN-SWA GTTACGATTTCTCGTGCGCAATCTGGTTACACTTGTCCCAGTAGCGCTCACTCATCTGGACGGCCAGGCGCGCGGTATCAGCGTAGCCGAGATTGGGCATGGGCGGCGAGTGACGGAAAGGCTTGGGTGTCCATCCAACCCATTGCCACTTACCCTTTATGGGGCCATGGAATTCCTTGCGGACGCGACCGATCGGCATCCCTCCGTCGTAGCCCATCCAGTCGTAAACGGTTGGCGGGTCGGCTTTGTCGATCTGGGTGCGGATCCACATGTATTTGGGCTGATACAGGTCAGGCATGGCATATTATAGCGTGCGTGCGCCCCACGGCCAACGATCGTCAGCAGGCATGGAAAAGCCCACCGGATCGCTCTGGTGGGCTTTTTGGGCGGCAATCGGCTTATATCTAAATCGTAGCGTCTTCTGCTGCGTAATAGGCGGGCCTGCGATCGAGCTCCGCTTGGATGAACATCTCTGCAAACAGCGGGTACTTCTGCCGCATGCGGTTCTGCAGGTTTCGTATGCGCGCTTTGGATTTTGCTTCGCGGCTCCAGCGTCGGATTGGCTGTTGCGTCACCAGTTCCCAACCGATGCTGTACCCCCCGCCAGGAACCCACAGCGCCATGATTTCAGGCGGGCAACGCTCGCCTCGAGGGACCAGCATGGCTTTTAGTACACGAGGGCCAGGCGGAGGAACATCTGGACGCCGCCATGCCAAAGAGTAGCGAACCAGTTCAACTGTCACCTGTCGGCCTCCATACGTTCGTCATAGCCCGCTAGATCGACCAGGGTAACAGTCTTTGCGACGCGGTTGATCTTGACGGTTTTATCGGCCTTCGACAGCTCGCGCGGCACGATTGTCTGTGGTGCCTGCTTGAGCGCCAGCGTCCAGCCGCCGTGGTTGAATTCACCATCGGCGAACACCCGGTTGAGATTGTCATCGTTGTGCGGGATCGCCAGCGCATAGCCCCTGCAGGGCTTGCCCTTCTCGCGCAGGCCGAGGCCGATCAGCGCCAGGCGGGCGCGGGCATCCTCGATGAGCATATCCGTGCGCTGGTCGTTTTCGAGCGCTTCGATGATGCCGCCGACGGTCTGCCGCTCGCCGGCCTTGTAGGCGTCCACCTTTGCGCCCATGATCTTTTCAATGACTTCCTGCCATTTCGGCACCTGCTCGGCCCTGTCGGCCGACGTCGCCGCCTCTATGGTTTCCACAAGCATATCGAGATCCAGACGAACACCATCTGCCGTCGGATCCTGATGCATGCCGGCGTCAACAAGCCCCTGCTCACCAACGAGGAGCTCCGCGCAGGCCAGCACGGTGCCATAGGTGTCGATCGCGCGTGGATCCAGCGCCAGGCGCTGATCAGAAAGGATCTGCCGCCATTTTGGCAGAATGTGCCAGTAGAAGTCATGAAAGCCGTCCATGATCTGCCTGAGGATCATGCGGCCCCATTCCTCCTTTATCAGCGGCTGCGTTGCGTTGGCGTCCTTATCCAGCGCATTGAGGTTCAGAATGATCATGCGGGTACGGTCCTGGACACCAAGGTGTGGATGCAGGATGGCGGAAAAAATGAAGCTCGAGCGCAACTCGAACTCGGTACCGTCGCCATTCGCACCGCCGCGATAGCCCTTGGCACCGGAGTATGACTGGCGGGCAAGCTCGATGATGGCCTGCTCTTTCTGACCCTGCGCCTTTCGCTCGAACTCATCGACTGCAACCGGGCGGCTGTCCTGTCGGATATTCTGATAGATGCCAGCGGCCGTCGTGTTGGCCGTAGAATAGAGCGCGGGTCCGAAAAGCGCCCGCAGGATCCCATGCAGTGTCGATTTACCTGTGCCGGCGCCACCCATGGTGAACATGATCGGGCGCACATCGAGCGCGCCGGAAAGAAACGCCGAGCCGATCCAGCCCAGAAAAAAAATCGGATCGATGTATGGTCGTTCCCATTTCCAGCTTTTGAGATCGTGCAGGATGATATGCGCCGGGCTATCGTTGACGCCGATCGGCGTCTGCCACGGGTGAACGGTGTCGTTGTCCTGGGCGTAGAAGAACCCGTCGTAATCGCCGGGCTTGGCCGCCTGCAGCTGCCAGTCGGTTGCGCGATTGTTCTTGTCGGTTTTGATGTCGACAGAAAACAGGTGCTTGCCGCTGTGCCAGATAAATTTGTCCTGCGCCTTCCATCCGCCACGGCCGCGCACATTGTTCTGCGGATCGAAAAGGCCCTTGCGGCCAGCCTCGCCGATGATCGCCGTCCAGGCCTGGTCGCGTTGCACGCGCTCGACCTTCGGCGGAACATACCCTTCACTCCCGGGCTCGCCTTTGGCCTTTGACCAGGCTGGCCATGCCCAGAACACATAATTCACGAATGGCGAGAAGATCCGAAGCAAAGTGGGCAGATCGTGGCGGGTGATCTCCTCGAGCTCGCCGATCGCGTTTATGACATAGATCGTCTCGCCCTTTTTACCGAGCACGGTTATCGGGCAGTTCGGGGGCATGTTGTGATGCGGTGCGCCTTCCCACTGTCCGGCCTTGATGCCGTCGCGCAGCAGGTTGGCATCCGGATCGACCAACTCTTTCTTTTCGTCCAGCACCTGCTGCGCGTCCAGGAACTGCGCGCGAATGCCTTGGATACCACCCTGAATTTTCGGTTTTCTTGCCATAACTGCCCGCATGAATGAGAGGAAGCCGCCGCGCCTGGTCGCCAGCGCGGCGGGTATAGATCAGAACGTGTTTTCGTCGACGCCGGCAAAGGATCTGCACTGCTCGGCCTGATGCAGCCTTTCACGCAGCAGATAGCCCTCGAGCTCCCAGATCTTGTCACGGGCCTTCTGGCGAGCGATATCGCGACCGATCTCGGCATTGAAATTCTCAGGGCTGGCGCAGGCGCTCTCACCGGTCACCGTGAAACCGTTCTGCAGGACGAGAACGCAGACCGTCAGGCAATCAATAGCTGGATGGCTGGGCTGGTCTGGCGATCGTGCGAGGAAATATTCTCCCGCAATAACGCTATCGATATGGTCTGGCGTAAGGCGCGGTGCGTTCAGACCTTTGTCCTGCAGCTGGCGCTCGATCGCGGCTTCATCTTTCATGGCATTTCTCCTCGAGGTTCGGCCCGGCACCATTGCCGGGCCCATTGGTTGGCGGCTCAGTGCGTAGCCATCGGCCCGTCAACGAGCTCCATGTTCGTTTCGTCGATCGGCACAGGTCTTGGCTTTGGCTTCTCGGCCGCCAGATCCTCGAGGCGCTTCTTTTCAGCCGTGGCGTATTCGTCGAGGTCGATCAGGATCGACGCGAATGCTTTCAGGCCGAGTTCTTCCGCCTTTGTCAGCTCTGCGGACTGGCGGTGCTTTGTGATGACCAGGTGCTGCGCCAGGACATCCGGCGTTGCGCGCGGGCCGATGCGCCGCACCAGTTCGACGAGCTCGCTGACAGCCTCGAAGGATATCGACGGTACCTCGGCCGCCGATCCGCCGTTGATAACCACGCCGGCCGTCTCGACAGAACCATGGACGCCACGGAAGACGTCATAGGCGAACGAGTGTGAGATCTCGCGCGGATCGATATCCTTGGAGAAATTAGGTGCGGTGAGCCACCACGGTTGGCTATTCGGTCTTGATGCCGTCTCGGAACCGACGGGATCAGAATACAGGAATGTATCGACCGGACCCGCTTTAGGCTCCCCGGTATCGAGATCGACTACAGGCGACCCTGCCAAGCATTCCGCGTGGCAGGTTCCCTCGTCGATATCTGTGGCGCAGACATCATCAGCTTTGAAGGGCACGTCGCAGATCGGGCAGCAATGAACCTCCGGATCCGACGCCGGCGTGCCTGCCTTTTCTCGGGCATGCTCGAGGGCGTTGTACAGGTCATGAATAATCTCGGCCGGATCATCCATGGGGGCCCAGTCCTTAAACGGACCCTCAAGCTTTCGAACGTCGTCGATCGCTTCCATGACGTTCGAGCCGATCCTTGCCATGTCGATGAGATTTTCCGCTGTATCGCAGCCGGCGAGCTCGAGCGCCGCTTTTAGATCTGGCGAAATTTCAGAGCCAGCCGATCCTGCTTTATCTTGGCTTCCGTCGCTGTCATCGCCGGGGCCGGCACCAGGAGGCAGGCCGCCATCCCCGTTACCATGGCCGTCATCGCCAAGCTTCCCACCGTTTCCGGGTTTATCGGTTGCGGTTGAGCCAGCGGCAGTCCCAATACTTCCATCGTCCGAAGGTCCACCGGCATGATCGCCATCTGCGCCATCGCCATCCGCATCACTTCCGCCTGCAGCTTTAACGCCGCCCGCGCTCGCGGATCCTTCGCCGCCTCCGGAGAGAGCCGCAGCGTCGCCCAGTCCGACACCTTCCGGCTGCTGAACGGCCTCGGTTGTTGTGGTGCTCGCTGCGTGTGCATCTGTCAGCCCCTTGCCGTCTGCAGCGCCGTCGACGCCATCAGATCCATCCACGAGCGACGCGGTGTCCACGGCCGTGCTGTCAGCTGATGGAGGTGTGCCTGCAGTATCGATCTCAGGCGCATTTCCGCTTGCTGCCGGCGACGATTGATTTCGTGTGGTGCTTCCATTGGTCTGTCCTTTACGTTTCGACATGCTCTTTCTCCTTGTGGTTACTCCATGCCCAATGCGGCCATGTAGGTCTGCAAAATCGTCTCCTCCTCGATGCGCTGGTTGGCATCTTTGGCGCGAAGACGGACGATGGTTTTGATTGCTTTCGTGTCGTAGCCCCGGCCTTTGGCTTCGCCGTAGACATCGGCCTTGTCAGCATTGATCGCCTTGCCTTCTTCCTCGAGGCGCTCGATGCGTTCGATGAATTGGCGCAGCTCGGCGGCCGCGATCGTCTCAGTGCTTTCATTTTTGGGATAGCTCACAGCCGCACCCCTTCCTTGAAGCGGATACGGAGGCGATAGACGACAGAGCTATACTGGGGAGCAGGTCCGAATTGTGCGCAGATCGCAGCGCCAGACGAGGTGTGGTAGGTGCCTACGTATGTCTTTCCCGAGTGTCGCCAGACATTCGCCCAGACGACACGCAGGCCACGTTGATAACTGATGCGCAACTGCTCGATGCTATCGACGTCGATCATGCTGCAGCCCTCCCCGAGCTCGCAGCGCGCACAATCCATTCCATCCACCACCAGCCCTCTTTCGTCGGGCCAACGATGCCCGATTGATCGATTGAAGTGTCATGAGTGATGGACGCGCCTTGGCAGATAACAACGTGGTTGACGCCGTTCGTGCTCTGGCCAGTCAGCATCCAAGGAAGACCGCCGGAAGATTGCTCCCCGATTTCGAGGATCCGCTCGAGCTCGACGGCCCCGTCGAACGCAACGCCGATCATGAAGATGCCGTTCGCGGTCAAGAATTCCCGCATGCGATCGTTCGCCAACCCATCATCTGGACCGTTGCAGACATGTGGAACGTCGACTGGGTCGACGCAAAGCAGGCACGCGACGACGGTGCGGAAGCAATCGCCGTAGACGCCGTTGTCCGGATCATGCCGGAACAGTTGTTGGTGACGCTGAAAAAGCATCAT
Protein sequences of DBSCAN-SWA_5 >CP036358|1426159:1431693|1430768_1431029_-|QBJ13201.1|DBSCAN-SWA MSYPKNESTETIAAAELRQFIERIERLEEEGKAINADKADVYGEAKGRGYDTKAIKTIVRLRAKDANQRIEEETILQTYMAALGME >CP036358|1426159:1431693|1428877_1429246_-|QBJ13199.1|DBSCAN-SWA MKDEAAIERQLQDKGLNAPRLTPDHIDSVIAGEYFLARSPDQPSHPAIDCLTVCVLVLQNGFTVTGESACASPENFNAEIGRDIARQKARDKIWELEGYLLRERLHQAEQCRSFAGVDENTF >CP036358|1426159:1431693|1426159_1426456_-|QBJ13196.1|DBSCAN-SWA MPDLYQPKYMWIRTQIDKADPPTVYDWMGYDGGMPIGRVRKEFHGPIKGKWQWVGWTPKPFRHSPPMPNLGYADTARLAVQMSERYWDKCNQIAHEKS >CP036358|1426159:1431693|1431025_1431244_-|QBJ13202.1|DBSCAN-SWA MIDVDSIEQLRISYQRGLRVVWANVWRHSGKTYVGTYHTSSGAAICAQFGPAPQYSSVVYRLRIRFKEGVRL >CP036358|1426159:1431693|1429302_1430751_-|QBJ13200.1|DBSCAN-SWA MSKRKGQTNGSTTRNQSSPAASGNAPEIDTAGTPPSADSTAVDTASLVDGSDGVDGAADGKGLTDAHAASTTTTEAVQQPEGVGLGDAAALSGGGEGSASAGGVKAAGGSDADGDGADGDHAGGPSDDGSIGTAAGSTATDKPGNGGKLGDDGHGNGDGGLPPGAGPGDDSDGSQDKAGSAGSEISPDLKAALELAGCDTAENLIDMARIGSNVMEAIDDVRKLEGPFKDWAPMDDPAEIIHDLYNALEHAREKAGTPASDPEVHCCPICDVPFKADDVCATDIDEGTCHAECLAGSPVVDLDTGEPKAGPVDTFLYSDPVGSETASRPNSQPWWLTAPNFSKDIDPREISHSFAYDVFRGVHGSVETAGVVINGGSAAEVPSISFEAVSELVELVRRIGPRATPDVLAQHLVITKHRQSAELTKAEELGLKAFASILIDLDEYATAEKKRLEDLAAEKPKPRPVPIDETNMELVDGPMATH >CP036358|1426159:1431693|1431240_1431693_-|QBJ13203.1|DBSCAN-SWA MMLFQRHQQLFRHDPDNGVYGDCFRTVVACLLCVDPVDVPHVCNGPDDGLANDRMREFLTANGIFMIGVAFDGAVELERILEIGEQSSGGLPWMLTGQSTNGVNHVVICQGASITHDTSIDQSGIVGPTKEGWWWMEWIVRAASSGRAAA >CP036358|1426159:1431693|1426562_1426901_-|QBJ13197.1|DBSCAN-SWA MTVELVRYSLAWRRPDVPPPGPRVLKAMLVPRGERCPPEIMALWVPGGGYSIGWELVTQQPIRRWSREAKSKARIRNLQNRMRQKYPLFAEMFIQAELDRRPAYYAAEDATI >CP036358|1426159:1431693|1426897_1428817_-|QBJ13198.1|DBSCAN-SWA MARKPKIQGGIQGIRAQFLDAQQVLDEKKELVDPDANLLRDGIKAGQWEGAPHHNMPPNCPITVLGKKGETIYVINAIGELEEITRHDLPTLLRIFSPFVNYVFWAWPAWSKAKGEPGSEGYVPPKVERVQRDQAWTAIIGEAGRKGLFDPQNNVRGRGGWKAQDKFIWHSGKHLFSVDIKTDKNNRATDWQLQAAKPGDYDGFFYAQDNDTVHPWQTPIGVNDSPAHIILHDLKSWKWERPYIDPIFFLGWIGSAFLSGALDVRPIMFTMGGAGTGKSTLHGILRALFGPALYSTANTTAAGIYQNIRQDSRPVAVDEFERKAQGQKEQAIIELARQSYSGAKGYRGGANGDGTEFELRSSFIFSAILHPHLGVQDRTRMIILNLNALDKDANATQPLIKEEWGRMILRQIMDGFHDFYWHILPKWRQILSDQRLALDPRAIDTYGTVLACAELLVGEQGLVDAGMHQDPTADGVRLDLDMLVETIEAATSADRAEQVPKWQEVIEKIMGAKVDAYKAGERQTVGGIIEALENDQRTDMLIEDARARLALIGLGLREKGKPCRGYALAIPHNDDNLNRVFADGEFNHGGWTLALKQAPQTIVPRELSKADKTVKINRVAKTVTLVDLAGYDERMEADR |
8 | Agrobacterium_phage(33.33%) | NA | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
Acr ID | Acr position | Acr size | Homology with known anti | Neighbor HTH/AcRanker | Neighbor Aca | In prophage | Protospacer in prophage |
---|
CRISPR_ID | Spacer_Info | Spacer_region | Spacer_length | Hit_ID | Protospacer_location | Mismatch | Identity |
---|
CRISPR_ID | Spacer_Info | Spacer_region | Spacer_length | Hit_phage_ID | Hit_phage_def | Protospacer_location | Mismatch | Identity |
---|
Region | Region Position | Protein_number | Hit_taxonomy | Key_proteins | Att_site | Prophage annotation | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
DBSCAN-SWA_1 |
1800516 : 1847820
Sequences of DBSCAN-SWA_1
Nucleotide sequences of DBSCAN-SWA_1 >CP036359|1800516:1847820|DBSCAN-SWA CATGAGCAATACGACTGAAGAGCTTCCGGACGACCTTGCCAGTGCCCTTGCACTGCTGGCCCAGGAACGCGCCCGGCGTGTTGCAGCCGAGGCAGAAGCAGCGACCGCTAAGGCGGAAGCCGCCAGCGCAAAGGCACTCGTATCGCATTCCGAAGCGCTGATCGCACGGCTGAAGCTGGAGATCGACAAGGTTCGTCGCGAACTCTACGGCAGCCGGTCCGAGCGCAAGGCGCGGCTCCTGGAGCAGATGGAACTGCAGCTCGAGGAGTTGGAAGCAGACGCGGGTGAAGATGAATTGGTCGCGGAAATCGCTGCCAAAGCTTCAACCGTCAAAGCCTTCGAGCGCAGGCGTCCGTCGCGTAAGCCCTTTCCTGAACACCTGCCGCGCGTGCGCGTCGTTATCGCGGCTCCCGCCAATTGCGCCTGCTGCGGATCGGCCAAGTTGTCGAAGCTTGGCGAGGACATCACCGAGACCCTGGAGGTCATCCCGCGTCAGTGGAAGGTTATCCAGACGGTGCGGGAGAAGTTTACCTGCCGCGCGTGTGAGAAGATCACGCAGCCACCGGCACCCTTCCATGTGACGCCCCGCGGTTTTGCAGGACCGAACCTGCTAGCGATCATCCTGTTTGAGAAGTTTGCCCAGCACCAGCCGCTCAACCGCCAGAGCGAGCGCTATGCCCGCGAAGGTGTCGACCTCAGCTTATCGACGCTGGCCGACCAGGTCGGAGCATGCGCCGCGGCGTTGAAGCCCATTCACTCCCTTATTGAAGCGCATGTCCTTACCGCCGAGCGACTGCACGGCGACGACACGACCGTGCCAATCCTGGCGAAGGGGAAGACCGATACGGGCCGCATCTGGACCTATGTCCGGGACGATCGGCCGCTTGGAGGGCTCTCACCACCGGCCGCCCTTTATTATGCCTCGCGAGATCGACGGCAGGAGCATCCGGAGCGCCACCTGAAGACCTTCACCGGCATTCTGCAGGCCGACGCTTATGGCGGCTACAATCCACTGTTCAAGGTCGATCGCGATCCCGAACCGCTGCGCCAAGCACTCTGTTGGGCACACTCGCGGCGCAAGTTCTTCGTGCTCGCCGACATTGCCGCAAACGCCAAGCGTGGAAGGAACGCCGTGCCGATCTCGCCCATGGCGCTGGAAGCCGTCAAACGGATCGACGGCCTGTTCGATATCGAGCGGGAGATCAACGGGCTTGACGCCGATCAACGCCTGGCACGTCGCCGCAAGGAAAGCCTGCCGCTCGTCGATGATCTGCAGGCCTGGCTTCAAACCGAGCGGGCAAAGCTGTCACGCAGCTCTCCGGTCGCAGAGGCAATCGACTACATGCTCAAGCGCTGGGATGGCTTCACGTCATTCCTCGCCGACGGCCGGATTTGCCTGACGAACAACGCCGCCGAGCGAGCGCTTAGAGGCTTTGCTCTCGGCAGAAAATCGTGGCTCTTCGCCGGATCAGACCGCGGTGCGGATCGTGCCGCCTTCATGGCCACATTGATCATGACGGCAAAGCTCAGCGACATCGATCCGCAGGCGTGGCTGGCCGACGTTCTTGCCCGCATCGCAGACACGCCGATGACCAGACTGGAGCAACTGCTTCCGTGGAATTGGACACCGACCCAGAAACCCATGGCATTGGCATCATGATCTCGAATGAACGCATCGCACGGCTTCACATCAAGCTCGATCATATCAAGCCAGTCATCTGGCGCAGGGTCGAGGTCCCGATCACCACAAGCCTGAAGGGGCTTCACGACGTCATCCAGGCCGTCATGCTCTTCGAAGATTATCATCTCTTCGAGTTCAATGCGGGTGGCAGGCGATATGCGGTTCCCGATCCCGAATGGGACCTTGGCCGCGAGACCTATGCCGCACGCAATGTTCGGATCGGCGCTCTTGTCGAGCGTGGCATCGAGACCTTGGACTACACCTATGACTTTGGTGACGACTGGCGACATTCAATCACTGTCGAGGCTGTCACGGACGCCGATCCGGCAGTCGAATATCCGCGCTTCGTCGAAGGTGACCGGCGCGCCCCTCCGGAAGATGTCGGCGGCTTGTCCGGCTTCGAGGAATTCCTCGATGCAATGACGAAGCCTCGCCACAAACAATACCGTCAAGTCGTGGACTGGTATGGCGGACGCTTTGAGCCGGAAGATATCAGCGTTGCTACGATCAATGAAAGGTTGGCAAAACTCGCACGCCGCCGAACGCTCGGAAAAGCCGGTTTCGCCAAAAGCCAGAACAAGCATCACTGACCGCAAACCGATCCAGCCGCGGCCTTCGCCGGATGCTTACTCTTTGCCCGCGCGTCGTCAAGGCTCATGAACCAGTGAGCGTTCAGGCACTCGGCGCGGAACTTGCCATTAAAACTCTCGATGTAGCTGTTGTCGGTCGGTTTGCCTGGCCGCGAGAAGTCAAGCACGACACCTTTGTGATAGGCCCAGAGGTCAAGGTCTCGGGAGATGAATTCGCTGCCATTATCCACACGTATGGTCGCCGGATATCCGATCTGTCGGCAGACCCGCTCCAATGTCTGAACGACATCCTCGCCTTTGTAGCTAAAGCGAGCATCCACCACTGGAGAGAAGCGAGAGAAGGTATCAACAACCGTCGGAACCCTGATCTTGCGACCAGTGGCCAGTTGGTCATGAACGAAATCCATCGCCCAGACATGGTTGGAATGGCTAGGCTCCGTGCGGTCAGCCCGCAGCTTCGCCTTGACCCGGCGCTTGGGAACCTTGTTGCGGAGCTGGAGGTCCATCTCCTTGTAAAGACGATAGATTCGCTTCGGGTTCACAGACCAGCCTTCACGCTTGATAAGGATATGCACACGCCTGTAGCCGTAGCGTATCCGCGTCTGGCAGATGTCCTTGATCTTCAGCTTTAACTCGGCCTGCTCGCCACGCCTGGACTTGTAGACATAGAGCGACCGGTCGACTTTCAACACCGAGCATGCACGCCGGATCGAAACCTTCCAGTCCGCCTTGATCGTGTCGACAAGCTTGCGCTTGCGGGCAGGCCTCAAAGCTTTTTTGAAAGCACATCCTGCAGCATGGCCTTGTCCAGCGACAGGTCGGCAACAATCCGCTTCAGCTTGGCATTCTCCTCCTCAAGCTGTCGTAGCCGCTTCATCTCCGAGGGCATCAAGCCCGCGTATTTTGCGCCAATCGTAAAACGTCGCGTCCGAAATTCCCGCCTTGCGGCAGACCTCGCCGATGGGCGTTCCATCCTCTGCCTGCTTCAAAACAAACGCGATCTGCGCTTCCGAAAACTTCGAGGCCTTCATCGAATTCTCCTCTTCTTCCCACAAGGAATCATAAGTGGAAAATTCCAGTTCTAAATGGCCTAATTTATTGGGGGCACGTCAAGCAAACGAAATGACTTCGTATCTTGTCCCGCTTTTGCATTTCGGTATAGGTCAAGTCCGACATGTAGCACCGACTAGCGAGATGTTTGCTCGGCCATGAAGAGGATTGATTATGCGTATTGTAATGGTTGGATCAGGTTACGTAGGCTTGGTTTCAGGCGCATGTTTAGCGGATTTTGGCCATAATGTGATTTGCGTCGATAAAGACGAAGAAAAAATCCGAGCTCTAAAAAGAGGCGCCATACCAATCTTTGAACCCGGCTTGGACGTGTTGGTAGACACTTGCGTGAAAGCTGGTCGACTGTTCTTCACGACAGATCTAGCGGCAGTCGTTGCCGAAGCCGATGTAGTATTCATCGCTGTCGGGACACCATCACGCCGTGGTGACGGTCATGCGGATCTTGGTTACGTTTATGAAGCTTCTCGAGAGATCGGCGCCGCCATAACGGGGTTCACCGTCATTGTCACCAAGTCTACGGTGCCCGTGGGTACTGGCGACGAAGTTGAGCGTATCATTCGGGAAACTAATCCGAACGCCGATTTCGCAGTTGTGTCTAATCCTGAGTTTCTGCGTGAGGGCGCTGCAATCGACGACTTTAAGAGGCCTGACCGCATTGTTGTGGGGCTCTCGGATGAACGTGCACGTCCTGTCATGACTGAGGTTTATCGTCCGCTTTATCTTAATCAACTACCTTTGCTCTTCACCTCGCGACGCACCTCGGAGCTGATAAAATACGCCGGGAACGGCTTCCTTGCGATGAAGATCAGTTTTATCAATGAGATAGCTGACCTTTGTGAGCGCGTAGGTGCCGACGTCCGAGACGTTTCTCGCGGCATCGGGCTTGATGGGCGTATCGGCGCGAAATTCTTGCACGCTGGTCCCGGCTATGGTGGCTCTTGCTTTCCGAAGGATACCTTGGCTCTCGCCAAGACGGCGCAGGATTATGACGCCCCGCTCCGATTGATCGAGACGACGATTGCCGTTAATGACAATCGCAAGCGTGCCATGGGCCGTAAGGTTATTGCGGCCGCTGGTGGTGATGTTCGCGGTAAAAAGATAGCCGTGCTGGGTCTCACCTTCAAGCCGAACACCGATGACATGCGTGAAAGCCCGGCGATTGCGGTTATACAGACCCTACAGGATGCAGGCGCGATCATTTTTGCTTATGATCCAGAGGGCATGGAAAACGCTAGGCAGATTATGGATATCGAGTTGTCTAGCGGCCCCTATGAAGCCGCGCAAGATGCGGATATTGTGGTCCTTATTACCGAGTGGAACCAGTTTCGCGCCCTCGATCTTACTCGCTTGAGGGCAGTGATGAAGTGTCCGATTTTCGTTGACCTCCGGAATGTTTATCGTGCGCGAGAAGTCGAAGCTTATGGATTCACTTATACTGGAATCGGCTTAGCGTAACCTGGACACTATTTGACTTGACAGCCTGAACAGAGGGGTGTGATTGTCTAGAGCGTTTCCGGGCCTGAATAAATCTTTGAGGCTTTGCAAATCAGCCGCTGCCTGATTCCCATGTCCGTTTTCGATGATTGCGACATAGATTTTTGTCTCGCCTGACCTGCCACGGGTAAATTCCTCCAGATCGGATTAGAGTCCGACCTGATTAAGGACGGACGTATGAAGAAGCAGAGATTACGGAAGAGCAGATTATTGCGGTGCTGAAGGAGCATGAGGCTGGCGCGAAGGTGGCTGAGCTCTGCCGCAAGCACGGTATTTCGGAAGCGAGCTTTTATAACTGGAAAGCCAAATACGGCGGTATGGAGGTCTCCGAGGCGAAGCGTCTGAAGTTGCTCGAGGAGGAGAGTGCTAAGCAGCGCTCCGCGAGCGTCTGGCAAAAAATGGTTGGGCATGCCGGCAAGCGCGAAGCCTTCGCGCATCTGAAGATGGGTCTTTCGGAACAGCGGGCCTGCCAGGTCATTTCCGCCGGCCGCAAGATGATCCGTTATCGGTCCAGTCGCCCGCCGGAGGTTGAACTGCGGACGAGGCTGCGCGAGCTCGCCAACGAGCGACGGCGTTTCGGCTATCGCCGGCTGTTCGTCCTGCTCAAGTGGGACGGAGAGCCGCCAGGCGTCAATCGCATCTACCGGCTCTACCGCGAGGAAGGGCTGTCGGTCCGCAACCGGAAAGCCAGACTGCGTGCCGTCGGCACCCGTGCGCCGATCGTGGTGGAAGCGAAGGCCAATGCCCGCTGGTCGCAGGATTTCGTCCACGACCAGTTCGCCTGCGGAAGGAGGTTTCGCGTGCTCAACATCGTCGATGACGTGACGCGCGAATGCCTAGCGGCAATCCCCGACACAACGATCTCCGAGCGTCGGGGCGCCCGCGAAGCCGCCGAGATCGGCGAACTGCGAGTGAAGATCGGCGACGACATCAGGATCGCTGAGGCTGTCACCACCGCGGGCTAGACAGCGTTCTACCGCCTCTGCCGCTGTCGTGCGCAGGACGATGTAATGCAGCGGCCGGGCAAGTGCCGTAAAGGCCGGAAGCCAGTCCGGTCGCACGACGCCATCGAGGATCACGAAGTAGCCCGCCTTCGCGTAGCGACCGGCAACGTCGGCGGCAATCTGCATGATCATCTGGTTCTGCTGATGGGATTGCGGCAGCCAGGGATCGATGCGGCCATGCTTGATATATCCCCATAGGTCATCGCTGTGGAAATGCACCTTTGGCACACCAGGCAAGTGCGCGAGCGCGTCGGCGATTGCGGATTTGCCAGAGCCGGGATGGCCGGAGAGAAGGAGTATGTGGCCCGATAGGTTGTTATGAATGGTCATGGCACAATCGTTCATGCTGCCGGATCGATCGTCAACCGCCTGCTGCCGATCTCGCTGTTCGCGTGATAAAGAGCTCGAAAGTCTGATATTGCGCAATTTTGGGAAAGGATCAGGTGCGATTTCGATCTTTAGCGACGATGCTCGCTCGGTACCGTAACTCGCTGTTATAGGTGTAGGCGATTTACGACAACGGAATGTAATCGTCTTTGTTGACGAGAACACCAAACGTGACGTCGTTGTCGACGAACTTCACGCCGGCTTGTTCTAAAACAAAGCGTATATTCTCGATCGTTGATTCCTTGAGGTCCTCACCTTTTTCTAGGCGGACTAGGGTGTTTGTCGATATTCCGGCATGTTCCGCAAGTTCGACAATTCCCCATCCTAAAGCGGCACGCGCCATCCTCACTTGAACAGAAAACATGAAAAATCGCCATAATTTAGCGAAAAATATTGCTTCGCTATCAATTCCAGTTACTATCAGATGTCTGCGTTAAGCAGGCGGCCGAGTGAGGCGTCCTACCGCCTAACTCGACCTGACCACAATCTAGCTAGCAGGAGCTTGGATCATGGCTGATTCTGAGAATAGCAGAACTTTACCTAAAATTTCGTGCGCAAATGCATTCCCCAACGAGACGTTTGTCGACAACCTGCCAAGCGTGATCAACCGGCGCAATCTTTTGCCGCTGGCGGCACGGATCCTGCCGATGCTGCTCAACGATCTTCCCTCGCGTACCATTGCCGGACCGGTGCATGCCAAGGAGCTGTGGCCGGACTGGTACGACATGTATCGGCAGCGCCTTGCCGCCGAACGCGAATGCCAGGAACTCGAGGCGCGACTGCTGGAAGAGACCGGTGGTCGTCCTTTCGTCGTGATTACCGTCGACGATGGCGGGACGTCGGTCGGTACCGTATCCTCTTTCGAGGAGATCAGGGAACTGGCGCCGCGGATTGGCGCCGATGCGGCAGAGAGCGCACGTCTGGAGTTGCTGAGGCTGAGACGGAGATGGAATGCGGCCGACCGTCGTATCGGCTACAGCGCAAGCCTTGCGAAGGCGCAGGATCTTGCCCGGTTCGAGGGCATTGCCGGGCGCGTCCTGATTTCGCTGCAGCCCTATTACATCCACGATATCGCCGCCAAGCTTCATTGCATGCTGGTGATGTACGATCCGGAGCTTCGCAAGGAAGAGACCCCCTGGCCGGAACTGCGCCGCATGCTGCGTGAACTGATCCAGCCTTACTGGTCTGTGATTGAGCCGCAGTCGCGTATCCGCCTGCTGCGGCCGAAGACGCGCGAAACACCGTTTCAGGAGGAAAGGGGCAGGATCGCCGTGTAAGCGGCGTGAAGGGCTCGGTAGGGCGCGGCAACGGTCTTTGGTGAACAGGATCGTTGCCGTGCCTTCGGCGCCCTGCGACGGCGAGACAGAAGCACGTCCTGCCTATGCGGCAAGGCCCGTTACCAGATAAGCCGACAGCAGAAGCAGGGTGATTATTGTGAAGAATATCAGTCCTGTCACCAGTTCATCGAACCGGCCGTCGCTTTCATTGCGGTCCCAGTCGTAAGCCAAAGCCGACCTCCATCGTTGCTGCGAGAGAAGGGTGGCCCGGTATGGTTAATCCAATCCTAACGGCGGTAGACTGAATGTTGTCGGGTATGATTTGGTGGCTGGGCCTTGGTGACTGATTTCATCGCGAGACTGATTGAAACAGAGGCACAGAAAAACCGTCTTACGCCGGTATTACGGGAGTTTGAGGTGGCAACAACGGTAACGGCCAAAGGGCGGGTCACGATTCCAAAAGGCGTACGCGAGCTTTTAGGGATTTCCCCAGGAAGCTCGGTCGATTTTGTTCGGGCCCCCAATGGAAGGATAGTTCTCGTCAGGGCAGACAAGAAACAGCCACTCACGCGTTTCGCCAAGCTGCGCGGACATGCTGGTGAAGGTCTGGGTACCGACGCCATCATGGCGCTGACCCGTGGTGACGAGTGACGCTTGTCGATACAAATGTCCTGCTTGATCTTGTGACGGACGACCCGGTTTGGGCCGACTGGTCGATTGAGCAGCTCGAACTGGCGAGCGTTTCGGGTCCGCTGTTCATCAATGACGTTGTCTACGCGGAACTATCTGTTCGGTATGAGCGGATAGAAGAACTTGATGTTTTTATTGATCAGGCGGGGCTGAAGTTTAGCCCTTTCCCTCGCGCGGCGTTATTTCTGGCGGGAAAGGCCTTTACCAGATATCGCCGCGGTCGCGGAGTTTCGGGAGCGGGTACGACGCCGCCCGGTTTGACCTTGACACGAGCTCCTTGGAGGATAGACGGGCTGTGCTGTTATTTATGGGAGTTAGACATGAAAACGCTGGCAGCCGTTTCGGCTTTCGTGGGCAAGACTTTTGCCGTCTGGGTCATCCTGTTTGCCGTCCTCGGTTTTTTCTTTCCGGATACGTTTAAGCAGATCGCGCCGTGGATCGTCACGCTACTGTCAATCATCATGTTCGGCATGGGTCTCACAATTTCGGTCGATGACTTCAGGGAGGTCGTGAAAAGGCCGTTCGATGTGACGATCGGCGTGCTCGGTCAGTTTCTCATCATGCCGCTTCTGGCGGTGCTGCTCACCCGCATCATTCCCATGCCGCCGGAAGTCGCGGCGGGCGTCATTCTGGTCGGTTGCTGCCCGGGCGGCACGTCTTCCAATGTCATGACTTATCTCTCTAAGGGCGATGTCGCGCTGTCGGTCGCCTGCACCTCGGTCACCACGCTTGCCGCGCCGCTGGTGACGCCATTCCTCGTCTGGCTGTTCGCAAGCCAGTTCCTGCCCGTCGACGGTTGGGCGATGTTCCTCAGCATCGTCAAGGTCGTTCTGGTTCCGCTGGCTCTGGGTGCCGCCCTGCAAAAGCTGCTGCCCGGCCTTGTCAAAACCGCCGTACCGGCGTTGCCGCTCGTCAGCGTCATTGGCATCGTCCTCATCGTCGCGGCGGTGGTGGGAGGATCGAAGGCGTCCATCGCGCAGTCCGGCCTGATGATTTTTGCCGTCGTGGTTCTGCACAATGGCCTTGGCTATCTGCTCGGTTACCTCGCGGCAAAGGCAACGGGCCTGTCGCTTGCCAAGCGCAAGGCCATTGCGATAGAGGTCGGCATGCAGAATTCGGGCCTTGGCGCGGCACTTGCAACGGCCTATTTTTCGCCTCTCGCCGCGGTTCCAAGCGCCATTTTCAGCGTCTGGCACAATATTTCCGGCGCAATCCTGGCGAACTGGTTTTCGGGCCGGGTGGACGCAGGCTCCGCCAACAAAACCGCCTGAAGCTCCGCATTTGTCAGGCCGCCTGTAGTTCACGCGGCCTGAACTTGGCTTAGAACGGCATAGTCTGGCGTGTTACGGCCGGTGGATTAAGCCGGGCAAAGCCCTCTTGGCGCTGATACGGGAAATGTGGATAGGGAGCAGTCGTCTCGCTGGCCTCGTCTAGCGTCGCTACCTGTGCGGACGTCAGTTCCCATCCCAAGGCATCAAGGTTCTGGCGAAGCTGTTCCTCATTGCGCGCGCCGATGATGACAGTCGAAACAGTCGGGCGTCTGAGTAGCCAGTTGAGGGCGACCTGAGGGACGGTCCGGCCTGTCTCCGTCGCAACGGCGTCGATTGCATCGATCACGCGGTAGAGCTCTTCATCATTGACCGGCGGGCCGAAACCGGCGGTTTCGTGCAGGCGGCTGCCTTCAGGCAGCGGCGTGCCGCGGCGGATTTTTCCGGTAAGCCGCCCCCAGCCAAGCGGGCTCCAGACCATGGCGCCAAGGCCCTGATCCGCCCCGAGCGGCATCAGATCCCACTCGTAGTCGCGGCCAACAAGCGAGTAATAGACCTGGTTGACGACATAACGCGGCCAGCCATATCGATCGGCGACCGCGAGCGACTTCATGATCTGCCAGCCGGAAAAATTCGAGACGCCGGTATATCGCACCTTGCCCGACCGGACGAGCGTATCGAGCGTCGACAGCACTTCCTCAACCGGCGTGAAGGCATCGTAGGCATGGAGCTGGAGTATGTCGATATAGTCGGTCTTTAGGCGGCGCAGGGCGTCATCCACCGCGCGCAGCAGCCGGAAGCGCGACGACCCGGCCTCGTTCGGCCCGTCACCCATCGGCAGGCTGGTCTTCGTAGAGATCAGAACTCTGTCGCGCCTTCCCTTGACGGCTTCGCCGAGCACTTCCTCCGATGCTCCATTGGAATAGACATCGGCGGTATCGAACAGGTTGACGCCTGCCTCCATGCAAACATCGATCAGGCTTCGGGCTTCTGCGGCATCTGAGTTGCCCCAGTTGCTGAAGAGCGGGCCGGAGCCGCCGAAGGTTCCGGTGCCGAAACTCAAGACCGGCACTTTCAGACCGGATTTGCCAAGCTGGCGATATTCCATGAAACTTTCCTTTGTTGTTTCGAAATGGTCGTGGGCGGCGCGTCGCGGCGTGGCCTTCTGCCCGTCCGTTCGTGATCTGACTGATGAAGACTGCGGTGTGTTACAATCGCTGTCGTAAGATAGACGCCTGCTCTTCAATGAAATAGAATGCCGAAACTATTATCACTTGTGAGATGAAGTCATCATGGCGCGATTGGAGATCAACCGGGCAGGGGAGATGGAGATATTCGTACGGGTCATAGAGCTTGGTGGTTTTTCCCGCGCGGCGGCTGCGGCTAGCATGACGCCTTCCGCCGTGAGCAAGCTGATCGCACGGCTGGAAAGCCGCCTTGGTGCGCGCCTCCTCAACCGTTCCACCCGTCAGCTTCAGATCACCCCCGAGGGCTGCGCCTTTTATGAGCGGGCCACCCGTATCCTCGCCGATCTTGAAGAGGCCGAACGCGCTGCCGGGGAAGGTGAACGGCCTCTCGGCCGCGTGCGGATCAACACCAGCGCATCCTATGCAGCGCATATTCTTGCACCCATCCTGCCACGGTTTCTGGCGCTCCATCCAGGCATCACGCTTGATATCGTCCAGACGGATCTGGTGGTTGACCTGCTTGCGGAACGCACGGATGTGGCCGTGCGCGCGGGACCACTGAAAAGCTCAACGCTTGTTGCCCGCAAGCTCGGTGAGACGAGAATGATCATCGCGGCCTCCAGGGATTATCTGGAGCGTCATGGCATTCCACAGACAATAGCCGATCTCGAAGATCACAACCGCCTTGGATTCGGCTACGCCCGATCAATCGACGGCTGGCCGTTGCGAGACGCCGGAAAGACCGTCGTTATTCCGGCGACGGGACGCGTGCAGGTGAGCGATGGCGAAGGCCTTCGCCGTCTTGCCCTTGCCGGTGTCGGTCTGGTTCGCCTTGCGGCTTTTACCGTGCGGGAGGATATCGCCGCCGGCCGTCTGATCCCGGTCCTCGATCATCTCGATACGGGTGAGACCGAGATGTTTCATGCCGTTTATGTCGGCCAACGGGGACCGCTTCCGGCACGTATTCGGGCCCTTCTCGATTTTCTCGGCGAGTATGGGCGGGTGAAGTGACGTTCCGCGCGCCACAAAAGGCGTGAGTGGGATTAAGGCCTCAGGCTGCCGGAAAACGGTAGCAGCCTGAGGAGCGAAGGCTATCGTTTCGGACCTGAGAGTGTGAAAGCACCAGCCGATTCCAGCCTTACGCGCGTGTGGCCAGCAAGTGCGCTTGGTATCGCACCGCAAAGGCTACCCGTCGTCACGACATTGCCAGCCTTCAGCATGTCGCCGGTGTTGAATGCCGAATTGGCGAAGGCGACAAGCGGCGCAAGCGGATCACCATTCGGATGCTTGACCTTGGCATCGAACAGCGACACCGTTCCCTCTGTAACGGTGAGCTGCAGCAGGTTCTCGCCGTTCCCGGTAAAGACCTCCACGATGCCTTCGTCCAGTTCCGGACCGATGACGAAGCCGTGGTTGGCGAGGCGATCCGCCAGAAAGAGCGGAAACGGCACTTGGTTTTTCTCTACCAGCCTGTAGCGGAGAAGTTCGGCGCCAAGATGGACCATGTCGATAAATCCCGTCAGATCGGCCCTTGTCATCCGCCCAGCGGTTGAAGCAGGGATGTCGGCGGAAAGCGTGAAGCAAATCTCGACTTCGACACCTTCGGTGCCATCTTCAGGAAAAGAAGCGAGATTGGCGTCGTTGACCCTAACATAATCGAAAATCGGTGCCCCAACGGCTTCGCCATCCGGCCGGATAGCCAATTTCCAGCCGACGACCGGCGTTGCCCAGTTTGCAGCAAAAGCCCGCTGAATCTCCATGGCCTCTGAAAAGGTGGCGGGAACAAGGCTCTCCGCCTCAAGCTCTGCCAGAGGAATTCTCGCGCCCTCTCGGCTTGCCTCTGCAAAATTCTCCGCCAGTTTTCCGATTGCCGTCACTCCGCTCATGTCTCCTGATGATCTCATGCACGAAATGCTGCCACGATTTGGCGCGGTCTGCCAAGGCCATCAAGGCTGAGTACAGTTCCGACTTTGCGGAGGACGATTCTTCAGCATCGCTGCGGAATCGTCACGACTTTTTCAACGACATGAGGCAGCCGGCCTATCATCGTTTCGTAGGGCTAAGGGGGCAGAACAACGATGGATCGCGCCGCTTTAACGCGGGAAGCACGCTAACCACAAATCGCGCCGTAGCAAATGCGTCCCGATCATTTTTTTTTAAGAGAAGTAAAGTCGAGCGGTATTGCCGCTCGATTTCCTTCCTCGATTACGCGACGTCCGCTTCGAACTGGATGCTGTGGAGGCGCTTGTAGAGACCGTCATTGGCCAGCAATTCACGCCGGCTGCCCTGTTCGACTACTTCACCGTTGTCCATCACGACAATCTTGTCGGCGCGATTGATAGTAGAAAGACGGTGAGCGATGACGATAGAGGTTTTGTTAAACGTCAAACGCTCCAAGGCGTCGCGGACAAGAGCCTCCGACTCTGAATCCAGGGCGCTCGTCGCTTCATCGAGAATCAGAATATCCGCGTCCCGCAGCATTGCACGAGCAATAGCCACACGCTGTCTCTGGCCACCGGACAGGCCGGAGCCGTTCTCTCCGAGCTTGGTGTCGTACCCATCCCGCATCTTGATGATGAAATCATGCGCATTTGCGGCCTTGGCGGCAGCGATGATCTCTTCCTCCGTCGCATTGTCACGACCGACCGAGATGTTGTGCCTGATGGTTCCAGAGAACAGGAAGGTTTCCTGGCCCACGTAAGAAACATGTTCGCGCAGGCTGGAAAAGGATACGTCGCGCAGATCCATACCATTGATTTCAACGGAGCCGTTCTGCGGGTCGTAGAGGCGCATCATGAGGTTGATGATGGTAGATTTGCCGCTGCCAGAAGGGCCTACCAGCGCAGTCATTTTACCTCCTTCGAAGAGGACGCTGACATCCCTGAGAACCGGCTGACCTGAAACGTACTCGAAGCTGACATTCTTGAGCTCGACATCGCCACTTGCCTTGGGGAGCGGCATGGCATCCTTTTTTTCTGCGAGTGTCAGCGGCGCGTCCAGCACTTCGAACATCATGCGCACTCCGATCATCCCGCCCTCGATCTGCACGCGCATGCGAGCCAGACGTTTTGCCGGTTCGTAAGCGAGAAGTAGTGCGGTGATGAATGCCATTAGATCACCTGGCGAGCCGCCTTTCTCAAGGACTGTGTATCCAGAGACGGCGATAACCACCGCGATCGCAAGGCCTGACAAAGTTTCCATGATGGGACTTGTCGCTGCTTCAAGCTTCGCAATGCCGTTGGCGCGCTGCTCGACATCTGAGACCGCCTTATTCATGCGTTGGCGCATAAGCTTCTCCAAGGAGAACGCCTTGATGACGCGCACGCCGATGCTCGTTTCCTGAACCACGTTGACGATTTCACCGAGGGAAGCAAGTTCTGCTTCCATGATCTTGCGCACGCGGCGCAGAACCAGGCGCACGCTAAAAATGGCGAGTGGTCCAATAATCATCGAAATAGCGCTCAACAGAAAATTTTGAGCGAACATTACAATGATTAACCCGAGAAGCGAAAAGAGGTCGCGTACGAACGACGTTACAATTGTGTCTATGACGCTACGAGCTGATTGAGCATTATAGGTAACACGGACCAGCAATTCAGAAGACGGCAAATTCTGAAAAAAGGAGACCCCTTGTTTCAAAAGTCGGTCGTAAATTCTGCGCTGCTGCTCGGCCACAATGCTGTTGCCAGCTTTGCTCAGGTAGTAGCTCTGCACAAAGGTCGCGAGTCCCTTGACTATAAATATAACCGCGACGGCAAATGCAATTTCGAGAACAACGTCGAAACCCTGCTTCAAATAAACACTGTTGACGATATCGCGCATAATCCATGCACTAAGAGATGTCATGCCGGCAACTATCAGCATAGCCACGATTGCCGCGGAGTACCAACCAACGTGTTTTTTGAAATTTTCTTTAAAAAGTCGCGTCAGCAACTTCTTGTCGTTTGGCGACAAAAACGATCCGATCACAGAAACATCCTTATTTGACGTCGACGCTCGCGCAGATTTTCCATCGCACATTCAGGCGCTTGTTGTTTGTGAGGAATGGCTCTCCAGTTTTCAATAAATGCCATGAGACGCCCGATAGTTTTTTTGTCACGCTCTTCGAGTGGAATGCGCAGAAAAGCTCAGCGGAATTAGGAAACCTACATGGCTTCATCCAGACCACGCCAATAAGCCTAGGGGCCTAACGTGTCAACATTGCGCTTTGGTATCCTCGAAACACGCACTACCGGCCCGCCAAGGGTAGCGTCGTGAATAGAAGCTTTCGGGCTGTGCCTGAATTAAGTGGGTGTTCCTTGTACACCGATTGATGTTATGAGGAGCTACTTTTCTGTAACGATTACTGACGGCTCATTAAGTGGTTTATTTTGTTTTCAATATTAATGAGGCGCGTACGTATTTCCGGCGCCAGAGCCCCCATTCGATCGCCATCTTTCTCCGTTGCGGCGCCCAGCTTAATTGCCTCCAATGCAGAAGCGAGAGGCCGAAGCTCCTCCATAGTGAACGTGTGGCCGCTACTAATATCCCAAGACGACTGGTAGCTGACCGCCCTATCATTCAAGGCGTCGCGAACCAAAGTCTTGTATCGATCCATTGTCAGTCTTTTATGCTTCTTTAGAATCGCACTGCGAGATGCGATATTGGCGATCATATCCGCGATTTTGGCCAAAGCTTCAGACTCTGGCGTTCCGACTGACAATACCTCTGCTAGGAAGCCCTTCGTCGTGGATCCGCGAGAGTCATGCGAGATCGCTATTCCTGGTACACCCAGAGATGCAGCGACACCGACGCCGTGCACACGCGGGCCAACCACGAGATCGAACTGAGAATATATGTGGTAATAATCTTTCGACTCGAACGAATATTCGATTTCCAGATCAGGAAAATCGCGCCTAGCCAGCGGAATTTCGTCGATATAGTGACATACAATCGAAAACTGATATTGACCTGCGTAGTCTGATAACAAGCGCCGGTACAGTGAGTTCGTGTAATTAAAGGCGTCGGAAGAGAATCCGTTCCAGATAACTGAATCGTCCGCCGTGGCATGGTAAATCAACGCAATTTTCTTTACATTCTGGATGTTGAGCTCTTGGTTGACCTTGGCGCTGAGCAAGGCCGGACAAGGGATTTCCTCGGCGGAAAAACCTTGTTTCAACACGGCCTTGAAGGTATTGTGGTCTCTCACCGTCAGTAGCTTTGATTTCGAGATTACCTCACGATAGCTTTCGTGGTAAATATCGCAGTTGCCGCCGACACCGATTATCATAACTGGCAAGTTGTAGCGTAATATCAAGGAATAGAGATCGGTCATTCTCGCGCTACACCATTCCGGCGTGCCTGACATAATGACCCAGTCGACAAACTCACAGTTCGTGTCCGGTTTCACAGAGTTGTCAAAAAATCCGAGTTTGAGGTTTGCTTCCAAGTCAACAAAGTCCACCTGAGAGTTGAAATCGCTGGGCAGCTTCGATGTTCTAAAGAGCTGGCGGTCCTGATGCGAGGGGCGGATGTCTGGATTTCTATTGTAAATGACAGGACAGTGGTCAACCTCTAATTCTTTCAGCACATTTCTGACGCCTTGCAAGATGAATTCATCGCCAGGGTTCCACTGCCGGGTTGTCGAAAAAATGATGTTTAGCTTTTTCATTGGAATAAACTCCCTCATCAAATCCAATGGAGCAAAACGGCAAGATCGCCGTTTTAAGCGTCAAGCAAACGTTGCGCGCCGGCGTCGGATCGGTTCAACGGGCTGCGTCGGTGCTACCGCAAAGCGCTGCTTCAGTACGGATTTCCCAATTATCTCTCACATTCAGCGTTCTTCGATTTTCAAGCCAGGGAGGGCAACTACGAGAAATAACCCATAACTTTTGTCGCGGAGCAGTTGCGTCGCTGGCTCCTTTACTGATTGACTATATTCTGGGACTTAGCCTGCTGAGGGCGATAATCAATTGGAACGTGCCACTTCGACAGTTTGGTAGCTGCGCGATTGACAAGCCTACCGGCCGAAATCTATATCGGCCAAATGTCGCGTGGGCTATCGCCTGGCACGTTTTGGCATTGGCGAGGTTGCACTATTCCATTCGCTGATTCAGGACAATTTTGTTCGCAGCCCCGATGACGGTCGTCGGGCCACTTCAAGGCAGGCAGGTCAGTGAATGCCCGATGTTTTTTTTGATCTATCCGAACTATATTTGAATTCCGGCGTGAAGTTCAAATATTATGGTATCGCTCGCACGGTGATGGAAGTGGCATATGAGCTGACGAAGCTTGACGCATCAGTTCGATACGTCATTTATTCGCCATTGCACCGACGGTTTTTCGAAGTTTTTCCTCGCGCGGGTGATGCGTCGCCGACTGGCGTACTTGATCCTAATATTCCAACTTCGGCTACGCCGCTGCGTCTTCGCCAGAATATCTATGATGCAAACGCCGTAAAAAAGACGTTTTACAAGCTGCTGCATAAGATCGTTCAATATCGTAATCGTAAGCGCTGGGCCACGGTTCCTGCGGGGGCCGCCAAAGATGTCGATCTACATGAGCAAATTCTGGTTGCGCTAGGGCGTCCGAAGATTATGTCTGACTATCTGATGGCTCTCGCAGAAAGCGGCACTAAGCCTATTTTTATTCCACTGCTTCACGACATGATACCTCTGCATGATTTCAGTCACCGAAACCAGTTTTCATTCCCTAGAAACTTTATCCACGACAATCAGGTAGCGATCAGTGCGGCATCGATGATCTTGACTAACTCCGAGTTCACGGCCGCTGAAGTGAAGCATTTCTCCAAGGTTGGGACGCTGCCCCCCGTACCGAATGTTATCGCTGTTCCTCTCTGCCATGAATTGCGGCCTACCAATGAGCCCGTCGAGAAACGTGGTCCGGAAAAGCCGTATCTCCTATGCGTCGGCATATACAACGGACGGAAGAATCTGGAGTGCGTCGTCGAGGCTATGCTCTTGCTGCATCGGCGAGGCGTTTCTGTTCCTGAGCTCGTGTTGGCAGGTGCGCGCCGGAAGAGGGTGGAGAAATTTCTAAAGAAAGAGAAATTTGCCCCTTTGTTCACGAAATTTCACTTTGTCCTCAACCCGAATCAGGCGGAGTTGCGGTCGCTTTACGAAAAAGCCTATGCTCTTCTTCTGCCCAGCCGGATGGAAGGTTGGGGGCTTCCCGTCAGTGAGGCATTATGGTGCGGAACTCCTGCCTTAGCCGCAGATGTCCCTGCTTTAAGAGAAGCGGGGGGCGATTGGGCTCGCTACTTCGACCCCGAGAGTCCGGAGGAGCTGGCGACCCAACTAGAGAATCTGATTGAAAATCCAGCGGAATATGCAGCATTGAAAAAAAATATCGAACTATCCAAACCACAAATGCGCACGTGGAAGGACGTTGCAGCTGACCTGCTTGAGGCAACAAAAAAAAGCATGCTAATTAAGTAGAAATTGGACTAATTTAATCGTCTTCTTCCGTATTATTTTTGGTTGGAGTCGTTTCGGGATGTTGCATCTGGTTTTCGGGTCGGCTATCGCTCATTTCCGGAGAGTATATTTGGGATTTATTCTCGGGATATTGAGTGGGCAGAGATGTATTCGCTGGAGATAAGTTGGCGAAATATTCTCGAAACCTGAATGTGGGGTCGGATGTTGGGCGCAATGTTGGTGAATAGGAGATAGATCGATTCGCCTGAATCGTTGTGTGGGGTGTGGGGCTCTGTCGATTTAGAGCGATTTTTTGGCATTGTCGTTTATGTGTTCTCTGGATCATATGAATGAATGGACAGGAAGAGTAACTGGTGATGTTGGACTATTTGAGGTTTTCTGAGGTTAAAGTTCTCTGCATCGGTGACGTCATGCTGGATTGCTTTATTTCTGGTGATGTGGGTCGCATTTCGCCAGAAGGTCCAGTCCCCGTCATGTGTGCGCGGTCAGAAGAGTATTTCGCAGGCGGCGCCGCGAATGTTGCTCGCAATATATCATCTCTCGGCGCAAGTTGTACCCTGCTGGGGGCGATTGGCAGAGACAGCAACGGAAGCAAGCTGCTTGAGCTCTTGAATGCCGTTCATGATGTGTCAGCGGAGTTTGTCACGCTGGATAAACGTCCGACAACCATCAAGACACGCTTTTTAGGCCATGGCCAGCAAATGTTGCGCGTCGATTTTGAAGACGCCTCTCCAATTTCGTCAGAAGAAGAAGATCGAGTTATCTCAACCGCAACATCACTGATCAGCTCGCACAGCGTTGTCATTCTCTCAGACTATGCCAAAGGGCTCTTAAGCGAGAGGGTTGTCAAAGCTGTTATTGCGGCCTGCCGGTCTGAGGGAAAGCCAATTATTATCGACCCCAAGACTTCCGATTTCTCTCGGTATGCTGGGGCCGCGATTGTAACGCCCAATCAAAAGGAAACGCAGGCTGTGACGGGTCTGTGGCCAGATACCGACCAGGAAGCGGAATTGGCCGGCGAACTGATTCTGTCTCGCTTTGACGTCGAGGCGGTCCTCATTACGCGTGCTGAGAAAGGAATGACGCTTATTGAGCGGGATCATCAGCCTTTCCACATCCGCGCAGACGCGAAAGAGGTTTTCGATGTTGTCGGCGCGGGGGATACGGTGATTGCAGCACTCGCAGTCGGCCTTGGTTCACATATTTCGCGCGCTGAAGCGGCGCGCGTCGCGAATGCTGCCGCTGGTATTGTAGTGGGGAAAAAAGGAACGGCGACCGTCTCTGTTGCGGAGCTTCGAGATGCATTGTCTCATGATACCTCATCCGGTGAGATAAGTCCGTCGGATAAACTTGTTGCCTGGAACGATATTCAGGCGCTAACAGAAGATTGGCGAAGCGATAATCTGAAAGTGGGCTTCACCAATGGCTGCTTCGATATCCTGCATGTTGGACATATCAGGCTTTTGCAATATGCACGGGATAATTGCGACCGGCTGGTCGTAGGTGTGAATTCAGATAGCTCGGTAAAGCGCCTCAAGGGACCGGCGCGGCCGATAAATACCGAATTTGATAGGGCGGAATTGTTGGGCGCCCTTGCTTTTGTCGACGCAGTGGTGGTGTTTGAAGAAGACACGCCGCTTGAACTCATCCAAATGATACGTCCTGACGTGTTGGTGAAAGGTGCGGACTACACGGTTGATAAAATTGTCGGTGCCGACGTCGTCATGGCGGCTGGCGGCAAGGTTGTGACGTTCGAGATAGTACCGGGGAAAAGCACGACGGCGACAATAGCGCGAAGCCAGGAAAGGGCAGTTTGATGTATATTGTAACTGGGGCGGCTGGATTTATTGGGTCAAACATTGTCGCTGATTTGGAAGCGGCGGGTCTGGGACCAATCGCCGTGGTTGACTGGTTCGGGCGCGGTGACAAGTGGCGCAACCTAGCGAAGAGGAACATCGCGGCTTATGTTTCTCCTGAGGGATTGCCGGAGTATCTTCGCGGCCTAAAGATGGAAGTTAAGGCTGTGATCCACATGGGCGCGATCTCGGCCACCACTGAAGCTGACGTAGACAGACTGATCGATCTAAACATTAACTATTCAGTTATGTTGTGGGATTGGTGCGCTGAGCATGGTGTCCCATTCATTTACGCATCTTCCGCTGCGACCTACGGCGGAATCGAGGATGGATTTGATGATAATGAGTCATCCGCCGCTCGAGCGGGCCTGGCACCGCTAAATGCGTATGGCTGGAGCAAAAAAACGACAGATGATATTTTTGTCGAACGTGTCGCCCGAGGTGAGAATGCTCCCCCCCAATGGGTTGGTCTGAAATTCTTTAATGTGTATGGCCCCAACGAGTACCACAAGGATGACATGCGCAGCGTAGCCTGCAAGCTCTTTGACGTTGTGCAGACTACGAATGAGGTCTCCCTATTCAAATCCTATCGACCCGACATCGCCCATGGCGAACAACGTCGCGACTTTGTCTATGTTAAAGACTGCACGTCTTTCATTCTTTGGCTTTTGCACAATAGAAACCGGAGTGGAATATTCAACTGCGGATCCGGCAAGGCGAGGTCGTTCGCAGATATCGTCCACGTCATGGGGGATATTTTAGGCAAGAAACTGAATATAAACTTCATTGAAATGCCCGAGGCCATTCGTGGAAAATATCAGTATTTCACGGAGGCAAATATGTCGAAAGTGGCATCGATAGGCTATAACGGCCCGCAGTCGTCGCTGGAAGAAGGGCTAAATGACTACATTAATAGTTATCTACGGCATGATGACAGGTATCGGTAAACTCAATACGTAGTGTGCGAGGAACCTTTAAGTCGAAGTGGATATTTATCTTTGGTGGAAGCCCGTTCGACCTCAAAAATTTCGGTGGCACGATTATCCATCTTCCAATTAATTTAGCCGGACAGTAAGTAGGGAAACCGTCAATGGTGCAGCCGGGTTTTGATTCCGCCCAGAGTGAGTATCTGGACCTTAAGAAATGGCTTTTCGAAGCGGCACTTCCTTTATGGTCTTCCGTCGGGCGAGACGATGTTAATGGCGGCTTTTTTGAGAAAATCGACAAAAACGGAGTTGCTGTGGAAGCTGCTCGCAGGACGAGAGTCGTCTGCCGCCAAATCTACAGTTTTTCTGCTGCGAAGAACATGGGCTGGTCAGGCGACGCTGAAGGGCTGGTTCAGCACGGGTGGAGTTTTCTTCGACGCCACTGTTTTAATGCAAACGGAACGCTGATTGCGACTGTGGACGCGGCTACCGGTTCAAAGAATACTTCGTTTGATCTTTATGACCATGCTTTTGCATTATTTGGCTTGAGTTATGTAGCTGAGACTCTCAGAAGCGCAGAGAATGCCGCAGATGACGCTCTTGATTGTCTCGAGGCGATGATCTCCGGGTGGAAACATCCGATCGCGGGTTTCGAAGAGGCGATTCCCCCCATCGTGCCACTTCGGTCAAATCCGCATATGCATCTTTTTGAGGCTTTTCTTACCTGGTTGGAAAATCCGCTCGTTAAGAATCCCGATCGCTGGCTCTCTTGTCTCAATGAGATCGGAGACTTGTGTCTGTCTGCATTTATATCCAATGAAAATGGATCGCTTAGTGAGTATTTTAATCACGACTGGTCAGTGATGCGTGACAACTCTCTGGCGCCTGTCGAGCCCGGGCACCAATTCGAGTGGGCTTGGCTGCTCACAAGGTGGGGTAAAATGGCTGGGCGGAAGGACGCCCTGATCGCGGCTCGCAGACTGGTGGCGATTGGCGAGAAAGGTGTCGACGAATCGCTTTCATTGGCTCGAAATGGCCTTGATTTCAGCTTAAATTCGTTAGATGGAGCTTTTCGTTTATGGCCGCAAACGGAGCGGATCAAGGCTTGGCTTATGATGGCCGAAACGGCGATCACGCCCGAAGACAGAGAGAATGCTTATGCGAAGGTAGCGGACGCAGCCTCTGGTCTGAAACGTTTTTTCAGCGACGTTCTGCCTGGCCTCTGGATTGACCGTTTTGACGAAAACGGAAATGTTGTAGATGAACACGCACCAGCGTCTTCTCTATATCATATAGTCTGCGCCATTGAGGAAATGCATAGACTGCTGGAGCCTCATACGCAGAGCGTGCCCGGGCTTTTTCTGGACCGTGACGGCGTTATCATCGAAGATACCGGATATCCGAGTAAAGTGGAGGGTGTCCGGCTGATTCCCGGAGCCGCGGAATTTATAAGCTCTTTTAGAGAGCGAGGATACCGCGTATTTGTCGTAACAAATCAGTCTGGCATAGGACGCGGTTATTACGATGATCTCGACTATATCATTTTGAAGGCGCATATTGGGAAGTTGCTCCGGAAGGAGGGAGCTTATATCGATGACGAGCGCCTTTGTCCTTTTCACGAAAGCGCCACCGTCGAAAAGTATCGGGGAAATCACTATTGGAGAAAGCCGTCTCCAGGGATGATTGAGGATATTGTCGTCCGTTGGAACATAGACCGAAAGCGAAGCGTGCTTATTGGCGACAAACCATCTGATATTGAGGCGGCAAACGCAGCTAAAATAGACGGTCGTCTTTTTAGCGGCAACAACTTGATGCATTTCGCCGAAGAAGAAAACATTTTATGATGGGAAATGTTTTGAAGTCTTCGCACGGGAAAACCAAGGCATGAAAGAAGGCGTTTCCTATGAAGCTTTCGAGGCCTATCTGTCGCAGCAGCGTTGTCCCGATTATAGCGGCCCAGCGACATATAAGGTGCTTTTCTGTGCGCTCACCGATGAGTTGAAAGAGTCGCGTGTCGCCGCGCTGTTTGGTTCGTGGTCAGAAAGCGAAATTGATTGGATAACGGCTCAAGGCTGGACAGCGGACGCCGGCGAGAATCCTGATATTTTGGCAGAACGCCCAGATGACGGCAAAAGTTCGACCGCCAATCTGGTGATTTGCCGACTGATGGATGCGCCACTTGATTCAAAGATTGCGTTTCAGATTGTTTCTAAAGCACGTGGTCGCCTTTCTGCCGGGGGCGTCCTTATTTTGGAAATAAAGCGGTCGAACGGTGATGTTGCTGCGTCGCACGTGCGGAATTTTCCCTTTTCCGAAATTGTCCCCGAATTCATCGTGGCTGAAGCCGGACGGTATTTAGGTTTTTCTTCAATCGAGACACGTTACTTTTCAACCGGCCTAGTTGGAGAGTCCGACATGAGCTTTGTAGCTGCGGTTATTTTGCAGAAGTCATATGCTGACGGAAACAATCCCATCTCGGTTGTCGGGCGGGCAATCAATTCAATGAACGGATATACTGATTTAAGAGCTGCAACCCACTCTCAAGAGACCGAAGTGCTTCACCGCGAAGAATTACTGTCTGCAATACGCGACTGGCGCAGTCAGTCGAAGATGAGCGCGGATGTAATTGCGGAACTATCGAGCGCTGCATTAGAAGCAGGAAGTGTTTCCCATCGCCTCTCAAAAGCAGAGCGCCGTATACGACGATTGACTTGGCAGACCGCGATATTCCTGCCTCTGACTTACCCGCTATCTTTGGTGATATCGGGAGCCGCCGGATTGGTGGCGCGGATCAAGGATAAAAGACGCAAAGGTCGCTTTGAACCAAAAAAAAAACTCCCATCTGACCAACGAGAAATCTATGACCGGCAGATAGTAACATCGTCGCCGCTTGCTGGGTATTTTGCCGGCAATTCGGAACCTAAAATACTTATCGTGAAACTGGATCATATAGGCGATTTCATTTTGTCTTTGCCTGCCATTAGGGTCCTAAAGGACGCGTGGCCGCACGGGCATTTTACGATTGTCTGTTCACCGACCAACTCTGGTTTGGCCAAAGCGTGTGGTTACTTCGATGAAGTGAGAGAGTATAATTTTTTTGCGCAGCTTTCGCAGGACGTGAAAAAAGCTGACATGTCTAAATTCTCTAGGATCAGGGATGTTGTTGGTGAGGTTTACGACATCGCGATCGATCTTAGGCATGACCAAGATACGCGACCAACATTGGCTTTTGTTGATGCAAAGATTAAGGCGGGGTATCAGACGCACGGGAAGCACTTTGTTCCACTCGATGTCTCGCTTCCTCCTATTCCCGAACGTTCCGGCCTTCACAAAAGCCCTCATAATATCCGCAGGCTAATGTTGCTTTGCAGCCACGTTGTAAACTCAGTGAAGCCGTTGATGTTCGATGCGGGAAGTGCTTTGGTGTTGGCTGGGCGTGAGGCGCCTCTGCCGAACGGAGAAAAATATGCAATTTTAGCACCTGGTGGTGGCACGCTGGCCAAGAAGTGGCCAGCTGAAAGATTCGCCGAACTGGCTGTTCGAATAGCCCAGCATCATGATCTAAAAATTGTCGTCTTGGGCGGCGCCCAGGAAAAAGAATATGGTGAAGCGGTATCGAGAGCGCTTCCCGCAGGACAGGTATTTGACCTGACAGGAAGTCTTCCTCTTGTCGACATGGCGAAAGTCTCAGCGGGTGCAACGATATTTGTTGGGAGCGATACGGGTGCGACGCAGCTAGCGGCTCTTTTAGGCACCCCTACCGTTGCAGTGTTTAGCGGCGTCGCAGATGTGAACCTGTGGCAACCAGTCGGCATGAGAGTGGAAGTCGTGAGGCGACCAATTCCTTGTTCCCCGTGTTATATCGCGAAATTAGAGAATTGCGTGCAAGGTCATGCGTGTATGAACGACATTCAGGTGGATCACGTCTACAAGTCGGTAGAGAGCTTGATCTCTACAACTGAGTGAAGCCACCTCAGGTTGATTCAGCCGTAAATATCTTGAAAAAGTAGAGAGTGTGTGGTGGCAGTTTTCAAAGCAATCGAAAAAAAGCTGAGACATGCGTATAAACGACGCAAACTCGCATCGTCCGGTATCGTTGAAAATGACGGCGGAGCCATACTGGTAACAGGTGGCTTAGGCGACCTCATTGTGATCGCGCGGTTCATGCGCGATCTCCTGGGGCATTTGGGCGGTGGATCCTTTTCGATTTTTTACAGTTCGCCAATGGTGGCGGATATGGTTTTCAAAAGTGTGCCGGGATTTTCCGGTGCGTATCCGTCGCACTTTTTCAAATATGCGAAAGATGTCTATCTTTACGGGCTGAAGATTAATCAATTTATATATGTCGAGAAATGGCCGGGTCATCGTGCGATGACACTTGCTGATACAAAAATATATGAGCTTGTGAGAATACTTGAGGCTTATGAGAAGAAGATGGAGCCGCTTCATGTTGTGCGTATGCATCATCCGTTTCTTGATGGCACTTTAGGCAGATATGCGGCGATCAAAGGCCGCTCGCGGAACGACTTTCTCCATTACGCAGCCGATATTCCATATGGGGGCGATACATTATCTCTCAGTGGCGATGACACGATTCTAGCCCGACATGATCTCGTCCCCGGTAGATACATAACGATTCACAATGGATATGATGAGGCGATGCAAGGCTTGCCCGGCCAGCGGGCCACCAAGGCCTATCCGGGATGGGGCGATGTGGTTTCCTATTTGCCGGAAAAGTTTGGCGAGGACTTGAAGGTTGTTCAATTAGGTACAGAGAAGACCAGTGCGCCTATTGCCGGTGTAGATGTCAACTTGATTGGAAAGACTTCGTTAAAGCAGGCGCTGGGACTTCTCGCGAATACCGCATTTCATCTGGATAATGAAGGAGGATTTGTACATGCTGCGGCTGCGTTTGGGAAGAAAAGCTGCGTCGTCTTCGGGCCGACATCGGTAGATTATTTTGGTTATCGATCAAACTTGAATTTCTCTCCTCTCCAATGCGGTGATTGTTGGTGGAGCGAAAAGACTTGGATGATGGCGTGTCCAAGAGGTGACACTTTGCCGGCTTGCATGTCGGCCTATGATCCTCGGGTCCTAGCTGAGGAGATTTATCGCTGGTGGACGGCCATGTAGATAACAATCTTGGTAGTAGGCGCGGGGTCTACCCTACTGCCAGGCTGAATCCCAGGCCGCTCAGCTAAGCCACGGGCATAATTTTGATTTCCTGATTTCATCGACAAAAATAAGGTTACCGGACGTAGCGAAATGGACGTAGTATCCTGCTTTTGTCATAAGACGAAAAATTTCGTCTTTTTCACCACCAGCCCCGGTGATCTGAACTGACGTGCTCCTGACGTGCCCCCGTTAATTTAGGCCATTTAGAACTGGAATTTTCCACTTATGATCCCTGGTGGGAAGAAGAGGAGAATTCGATGAAGGCCTCGAAGTTTTCGGAAGCGCAGATCGCATTCGTATTGAAGCAGGCGGATCATGCAACGCCGATAGGCGGGGTCTGCCGCAAGGCAGGGGTTTCGGACGCGACGTTTTACATCTATGGACTGTTCCCCCGTTGCAAGCTTGATGTTGAAGCGGTTTGACACGGGTCGATTGCAAACATCTATACGGCGTCATCATTTTCGAAGCGCCCAAATGGGGTATCCGCGCACATACGCCTCAACAAATGGGAGACCTTGTCGGCCATTATAAAAGACAGGTTCCGTATGTGCCGGTCCGACCGTTCATTCCATTTCAACCTTCAGCCTGCAATTCGCATGCCTGCTCCTAAGGATTGGCTCGGTATTCGGTAAGCCCACGGTGAAGGCCGTCTTGCGACAGAGGGCTTCTGACTGCCAGTCCTAGTGGTAACACCAGGAGTAGTCATATGACGAAGCATCCAATTGAGGTGATCACGTCTGTGGAGCGCCGTCGGCGTTGGTCACGCGAAGATAAAGAGCGGCTCGTCGCTGCGTGCTTTGAGCCAGACGCGGTCATCTCCGAGATTGCCCGCGCGGCCGGCATCCATGTCAGCCAGTTGTTCCGTTGGCGCAAAGAGCTTTGCCGGATCGAAGAGCCAAGGGCTGATACGGCGACTTTGGTGCCGGTGATCGTGTCCGAGGCCGCTTCGACAGTCTCTCCCGTTCAACCGGAATCGCCCACCACATCCCATCCTCGTCGGAAGCGTAGCGATGTGACAATCGAGCTTGGACGGGGTCGGCGCGTGCGCGTGGATAGCGACATCGATACGGATGCCCTTGGCCGGATTCTCGATTGCGTATTGGGTCTGCGATGATCCCGGTTCCTGTTGGCGTGAAGGTCTGGCTGGCCACGGGCTATACGGACATGCGCAAGGGCTTCCCCGGTCTGTCGTTGATGGTGCAGGAGGCGCTGAAGCGTGACCCGATGTGTGGACACCTGTTCGTATTCCGCGGCCGCGGCGGTGGCCTGATCAAGGTGATCTGGCATGACGGCCAGGGCGCTTGCCTGTTCACGAAGAAGCTGGAGCGTGGCCGCTTCATCTGGCCATCAGCGGCCGATGGCACGGTGGTGATCACACCTGCGCAGCTCGGTTATCTGCTGGAAGGTATCGACTGGCGGATGCCGCAAAAGACCTGGCGGCCGACGTCGGCCGGATAAGCAAAAACACTGGAATGACAGGCTCGAATATGATTCCATCCTGCCATGAGCAATACGACTGAAGAGCTTCCGGACGACCTTGCCAGTGCCCTTGCACTGCTGGCCCAGGAACGCGCCCGGCGTGTTGCAGCCGAGGCAGAAGCAGCGACCGCTAAGGCGGAAGCCGCCAGCGCAAAGGCACTCGTATCGCATTCCGAAGCGCTGATCGCACGGCTGAAGCTGGAGATCGACAAGGTTCGTCGCGAACTCTACGGCAGCCGGTCCGAGCGCAAGGCGCGGCTCCTGGAGCAGATGGAACTGCAGCTCGAGGAGTTGGAAGCAGACGCGGGTGAAGATGAATTGGTCGCGGAAATCGCTGCCAAAGCTTCAACCGTCAAAGCCTTCGAGCGCAGGCGTCCGTCGCGTAAGCCCTTTCCTGAACACCTGCCGCGCGTGCGCGTCGTTATCGCGGCTCCCGCCAATTGCGCCTGCTGCGGATCGGCCAAGTTGTCGAAGCTTGGCGAGGACATCACCGAGACCCTGGAGGTCATCCCGCGTCAGTGGAAGGTTATCCAGACGGTGCGGGAGAAGTTTACCTGCCGCGCGTGTGAGAAGATCACGCAGCCACCGGCACCCTTCCATGTGACGCCCCGCGGTTTTGCAGGACCGAACCTGCTAGCGATCATCCTGTTTGAGAAGTTTGCCCAGCACCAGCCGCTCAACCGCCAGAGCGAGCGCTATGCCCGCGAAGGTGTCGACCTCAGCTTATCGACGCTGGCCGACCAGGTCGGAGCATGCGCCGCGGCGTTGAAGCCCATTCACTCCCTTATTGAAGCGCATGTCCTTACCGCCGAGCGACTGCACGGCGACGACACGACCGTGCCAATCCTGGCGAAGGGGAAGACCGATACGGGCCGCATCTGGACCTATGTCCGGGACGATCGGCCGCTTGGAGGGCTCTCACCACCGGCCGCCCTTTATTATGCCTCGCGAGATCGACGGCAGGAGCATCCGGAGCGCCACCTGAAGACCTTCACCGGCATTCTGCAGGCCGACGCTTATGGCGGCTACAATCCACTGTTCAAGGTCGATCGCGATCCCGAACCGCTGCGCCAAGCACTCTGTTGGGCACACTCGCGGCGCAAGTTCTTCGTGCTCGCCGACATTGCCGCAAACGCCAAGCGTGGAAGGAACGCCGTGCCGATCTCGCCCATGGCGCTGGAAGCCGTCAAACGGATCGACGGCCTGTTCGATATTGAGCGCGAGATCAACGGGCTTGACGCCGATCAACGCCTGGAGCGTCGCCGGAAAGACAGTCTGCCACTCGTCGCTGATCTACAGGTCTGGCTTCAAACCGAGCGGGCAAAGCTCTCACGCAGTTCTCCGGTGGCGGAGGCGATCGACTACATGCTCAAGCGCTGGGATGGCTTCACGTCATTCCTGCAGGACGGCCGGATTTGCCTGACGAATAATGCGGCAGAGCGAGCGCTCAGAGGCTTTGCGCTCGGCAGAAAATCCTGGCTCTTCGCCGGATCAGACCGCGGTGCAGATCGTGCGGCCTTCATGGCCACACTGATTATGACGGCAAAGCTCAACGACATCGATCCGCAGGTGTGGCTGGCCGACGTTCTTGCCCGCATCGCTGACACGCCGATTACCAGGCTGGAGCAGTTGCTTCCGTGGAATTGGACGCCGCCGACCGTCAACGCTCAAGCGGCCTGACCTGCGGCCTTCACCGGAGGCTTACGGTATTCGCCACCGTGCTGAAGCAAGGCCCAGATCACCCTTGCATTCTTATTGGCGACTGCAACCGCTGCGCGGTTGTAACCACGCCTTTCCCTAAGCTCGTTGACCCATTCGCTGGCAGGGTCAGATTTTCGTTGCGCCGTTCGCACCACTGACCTTCCGCCGTGGACCAGAAGTGTTCGCAGATGCTGACTACCTCGCTTGCTGATGCCAAGCAGTACGCGTCGATCGCCGCTGGAATGCTGCCGCGGCACCAAGCCCATCCAGGCGGCCAGATGTCGGCCATTCTTAAAGTCTGCTCCATCACCGACAGCCGCGACAACTGCCGTCGCAGTCTTCGGGCCGACGCCACGTATCTTGGCGATACGCTGGCAAACATCGCTCTCGCGGAAGACGGCTTCGATCTGCTTGTCGAAGTATCTGATCTTTTCGTTGAGCTGCAAAAGGAAGGCATGGAGTGTGCCGATGATATCGCGCGTCAGCTCGGTGATGTCGATCCTTGGATTGTTGATGATCTCTGGTATCGTGCGCTGGGCGATCGAAGCCGACTTTGCGATCGCTACACCACGCTCGAGAAGCAAGCCTCGCATCTGCGAGATTAGGGCGACACGATGATTGACCAATCTCTGTCGAGCACGGTGGAGGCCCTGGATGTCCTGCTGGATGATCGATTTCTTCGGCACGAAGCGCATGTGTGGCTGTTGAACGGCAATGCATATCGCCTGGGCATCGTTGCCGTCATTTTTCTGGCCCTTCAAGAACGGCTTCACATACTGTGGCGCGATAATCTTCACAGTGTGCCCCTGAGCCTCAAAATGCCGCTGCCAGTGAAAGGCACCCGTGCAAGCTTCGATGCCGATCAGGCAAGGTGGCATAGCCTCGGTCACCTGCAGCAGTTGATCGCGCATCACGCGCCGTCTCAGGGCAACCTGACCATCGGCGCCCACGCCATGAAGCTGGAACAGCTGTTGGTTTCCGCGCAGAAGTGAGCCGGTTTTTCCACCGAGAAGTGAGCCACCTCTAAGTATGGTTTTGTAATCAGGCTTTGGTCAATGGATTGTAAGCGCCGTTTTGAGAGCACGTCGGTTTACGCTTGGCGGAATGGAATGACGTTGTTTCCGCGCATGAACGCAAGGGCACCCTCTGTCTCAATGGCTTTCACCGCTACGTCCCGTGCCTCTGCGTCGGATCGGAATTGCTCGTCCCAGACGTGACTTGTGTTGTTGTGGTCGACCACCTCAAGCGTCCATGTCGTATCGGTTTCCAGTCGGAAGATGTCGATGGAGAACGGATAGCCATCAACGACGATCCTCTTACTTTTGCCAGAAGTCACCAGATTGGGTTCGTCGTCTTCGATCATTTCGCAAGTCCAGCTTCTGTATCACGACGACGAGTATGCCGGTGGAGCTGAGCCCGTCAAGTGGCTGGCTTGATGGTCCTTTTGAACACCCACGGCAACAATTCATCGATCCGGCTCTGCCTGTGACCATTCACGATGGCGGCGAGTGTGCTGGTCAAGTAGGCTAACGGATCGACCGAATTGAGCTTACAGGTCTCGATGAGCGATGCGATGGCCGCCCAGTTCTCGGCTCCTGCGTCATGGCCTGCGAAGAGTGCGTTCTTGCGATTTAAGGCTATCGGCCGGATGGTTCGCTCGACGGTGTTGTTATCGGGCATGTTGCCATCGGCGTCCCACGAACGCTCGGCGGATTCTATCCTGTCATGTTAGAATTGTCGTGTCGGGACAGGGGCGTCGGAGTGCGAAGAAGCGGGCAGGCCGAATTGACTTTGAGCTGCGAGCTCGCGTTGTTATCTGGCCGCCCCTGCGACTATCCCGTCTGCGCTGCGAGCGCGTATCCGTAGATGTCGTACAGCCCGCGCACTCCCCCATCAACAACATGGAGGATTTGGAATGCGGTCGATCGGAATGGATGTGCATCGAAGTTTTGCGCAGGTCGCGGTTCTTGACCGGGGCAAGGTCACCGAAGAATTCCGGGTGGAACTGGATTACGACGCTGTGGTGGCCTTTGGACAGAAGCTTCGAAAGGATGACGAAGTCATACTGGAAGCGACCGGGAACACCTCGGCCATCGTAAGATTGCTGACGCCGTTCGTGGCCAAGGTTGTGATCGCGAACCCGCTTCAGGTGAAGGCCATCGCCCACGCGCGCGTGAAGACCGACAAAGTCGACGCGCGGATTCTGGCGCAACTTCATGCCGCCGGCTTCTTGCCAGAGGTCTGGGCGGCGGATGATGATACCCTGAACCTGCGTCGTCTCGTGTCGGAGCGCGCGGCGCTCGTGCGATCAGTGCGACGAGTGAAGAGCCGGGTGCAGGCCGTTCTCCACGCCAATCTCGTGCCGAAATACTCGGGACATCTGTTCGGAAAAGACGGCAGAGACTGGCTCTCGACGGCCCCGTTGCCCAAGGCAGAGCGTGACCTCCTTGCCCGCCATCTCGATGAACTGGACTGGCTGAAGGGCAAGCTTGAAAAGCTGGATGGTTCCATGGCTCGAATTGCGCTTGACGATAGTCGCACGCGCAAGCTGATGACGATCGCCGGAATAAATTCTGTGGTCGCCACGGCGGTGATAGCTGCCATTGGCGACATCTCTCGGTTCCCGACGCCCGATCGGCTTGCGAGCTACTTCGGCTTGACCCCGCGGGTGCGCCAGTCCGGGGATCGCGGCGCAATCCATGCCGCATCTCGAAACAGGGAAACGCGATTGCCAGAACGATGCTGATTGAAGCTGCCTGGTCGGCCGCGTCGGTTCCGGGGCCGCTGCGTGCCTTCTTTCTCCGGATCAAGGACCGCAAAGGCCTCAACGTCGCTGCGGTCGCCACGGCGCGCAAGATCGCAAATCTGATCTGGCAGTTGCTGACCAAGGAGGCTCCCTACAGATGGGCTCGGCCCGCCTTTGTCGCCATGAAGATGCGCAAGCTTGAACTTCGTGCTGGCGCTCCCAGAGCACATGGGCCCGCTGGTCCCGCCCGCGATTACTGGATCAAGGAAATCCGACATCGAGAGATGGAGCTCGTTGCACAGGCGGAGGCAGCCTACGCGGCAATGGTCGAAGCCTGGAGAGATCGGCCAGCTAAACCACAAAAAAAATTGAGAAAGGGGCTGTTCAACAACATCCGTGTCGATCTCGATGCGACCGTCGGTCAGGAAGAGTTGGAGACCATCCCAGTACTTGGTGATGTAGGCCAAGGCTTCGCCGAGCGGGGACTTTGCCGCGATACGGACGCGGTTATGCGTGAGCCAAACCTGCATGTCGGCGACAAGCGGTGTAGACCGTTCCTGCCGCCCAGCAAGACGAGCCTCCGGATCAAGGCCGCGTAATTCGGCTTCGATCCGGTACAGATCGCCAATCCGTCTGACGCCGTCCTCGGCAATGGGTGCTGTTCCGTTACGGGTGATCTCCACCAGCTTGCGACGAGCATGTGCCCAACAATAGGCAAGCCGAATGTCCGGGCCAACGCGCTCTGGTGCGATCAGCCTGTTGTATCCGGCATAGCCGTCGACCTGCAGGATGCCGGAGAAGCCCTGCAATATCCGTTCGGCATGAATGCCTCCGCGACCGGTCGCGTAGGTGAAGGCAACACCTGGCGGATCACCGCCGCCCCATGGGCGATCATCCCGTGCCAACGCCCAGAAGTATCCGGTCTTGGTCTTACGCGAGCCAGGATCGAGAACCGGGGCACGGGTCTCGTCCATGAACAGCTTGGTCGAGCGCATCAGGTCGGCAATCAGTGCATTGAAGACTGGACGCAGCTCATAGGCTGCTCGCCCGACCCAGTCGGCAAGTGTGGACCGGTCGAGATCGACGCCCTGTCGGCTCATGATCTGAGCCTGCCGATACAGCGGAAGATGATCGGCATATTTAGAGACCAGAACATGGGCGACGGTCGCCTCCGTCGGCAGCCCTGCCTGGATCAACCGTGCCGGAGCCGGGGCCTGAACGACACCATCAGTGCAGGAACGGCATGCATATTTGGGACGACGTGTGACGATGACGCGGAACTGCGCAGGGACCACATCCAGACGCTCGGAGACATCCTCGCCGATGCGATGCAGGCAACCGCCGCAGGCGCAGATCAGGCTTTTTGGCTCGATCACCTCTTCAACACGGGGAAGATGTTTTGGAAGGGAGCCGCGATTGATCGCCCGTGGCTTGGTGGCTCTGTTGCCAGCAGGAGCGTCCGCCTCGTCCTGGGCATGGATCGCGGCCATCGCCGTCTCCAGATCCTTCAGCGCTAGGTCGAATTGCTCGGGATCGGTCTTCTCGGATTTGGGTCCGAAGGCTGCCTGCTTGAAGGCCGCGACGAGCTTCTCAAGCCGTTCGATCCGCTCATCCTTGCGGGCGATATGCTCGTCCTTACTTGCTATCACGACGTCCTTGGCCGTCTTGCGTGCCTGGGCTGCGATCAGCATCGCCTTCAGCGCAGCAACATCATCGGGAAGATCAGCGGTGTCCAGCATGACTGGAACTTACCAAATTCTGTACCGATTTGCCCTCGGAATCTATTGTGCTGAGTCATCGTGTCGCAGCTATTCGATGGTCTCCGGCGTTCTCGTCCTGACGGCATGAACCCGCCGCCAATCAAGGCCCGCAAACAGGGCTTCGAACTGGGCGTGGGTCAGCGTCATCAGCCCATCCTTGATGCCCGGCCAGGTAAAGCTGTGCTCTTCCAGCCGTTTGTAGGCCATGACGATACCGGAGCCATCCCAGTAGATCAGCTTCAGCCGATCCGCCTTGCGGGACCGGAACACGAAGACCGCTCCTGTGAACGGGTCCTTGTGCAGTTCGTTCTTCACCAGTGCCGCCAATCCGTCATGCCCCTTGCGGAAGTCTACGGGTTTGGTCGCCACCATGATCCGCACGCGGTTCGAGGGGAAGATCATGTCGCTGCCGCCAGGGCCCGAGCGATGGCAGCGATCCTGGCAGCAGACGCGCCTTCCTCAAGACGGATCGCCACTGGACCGACGATAATCTCGGCGCGCGTGGCGGTCTTGACACTCAGCGGCTCCGGCGGGGGTGTCTCAACGACCATAGCCGCGAATTCGACGGCGTCTTCCGGCTCAGGCAAAACCAGCTTGCCATCTCGTGCCAGCGTTCGCCAGGCCGACAGGTGATTGGCCTTCAGCCCATGGCGGGCCGCGACCTCATTCACCGTCACACCCGGCCGCAAACTTTCCGACACGATCCGTGCCTTGACCTCATTCGGCCATTGGCGGTGAGCATCACGTCGGCTCCGCCGCGTCGTGAGAACCTCCAACGTAGTCTCCATGGAGAAACTCCTTATCGCTCGTCCATGAAAACTCGATCACAGATCAGGCGACAGCGGGCAACGTGGGGGTGGGACAGCGGTTACAATGGATTGTCCTTTTCTCCTCTCTTCTGTAGCCGATGTTCTGCCCCCACATTGGCAGCCGCCGCTTGATCTGTGATCGAGGTTTCCATGGATGAAGGACGGGAGTTTCTCCATGGAGACTACATTGGAGGTAAGCGGTTAGCCGAGGGCGTGGGGCTTGAAGTTCCAGGGCATAAGCGCCTCGATTTCAGAGCAGGCCAGCCTGGAGCAATGCGGGTCAATGTCTGTGAGAGCCAGTCGAGCGGATCGACGTTGTTCATCTTGACTGTCTGCAAGAGGGTGGCCAGAGTCGCCCACGTGCGTCCACCGCCTTCGCTGCCGGCGAATAGACTGTTCTTTCCCGTAATTGTTTGGGGCCTGATAGCGCGCTCGACGATATTGGAGTCAATTTCGATGCGTCCGTCCGTCAGGAAGCGCTCCAGTGCTTCGCGCCGGGTGAGCGCGTAACGGATCGCTTCGGCGGTCTTGGACTTGCCAGAGACCTTGCCCAGCTCTTTTTCCCAAAGATGAAAGAGGCGGGTGACGATGGTTGCTGATTTTTCCTGACGCAGCACGGAACGGCTGTCGGCATTGCGACCGCGGACCTCATCCTCGATGCGCCAGAGCTCGGTCATCGCGATGATCGTGTCCGTTGCAGCCTTCGAGACGCCGCTAATGTGAAGGTCGTAAAACTTGCGGCGTAGATGCGCCCAGCAGCCTGCGAGCTGGACCGTTTCATTGCTGCCGTCTTTGGCACGCGTCTTGGCGAGACTGGTATAGGCCGAGTAGCCGTCGACTTGCAGGATGCCGCTGAATCCGGCGAGATGACGCGCCACGCAATTAGCGCCCCTGCTGTCCTCAAAACGATAGGCCACCATCGGCGGACTGGTTCCGCCATAGGGTCTATCATCGCGAGCGTAAGCCCAAAGCCAGGCCTTCGTCGTTTTCCCGGACCCAGGGGCAAGAGTGGGTAAGGTGGTTTCGTCAGCGAAGATCCTTTCGCCCTCCTTGACGCGCTCAAGGATGTAATCGGCGCAGATCTGCAGTTCGAAGCCCAGATGTCCCACCCACTGGGCCATCAACGATCGGCTGATCTCGACACCGTCGCGTAGATAGATTGCCTCCTGCCGGTAAAGCGGAAGGCCGTCGGCGTATTTGGAGACGGCGATATAGGCGAGCAGCCGCTCCGTCGGCAGGCCGCTTTCGATGATGTGCGCCGGTGCCAAAGCCTGGAGCACGCCATCGTGGCCGCGGAAGGTGTATTTGGGGCGGCGCGTCACGACGACCCGGAACTTCGGCGGCACGACATCCAGCCGTTCGGAGCGATCCTCACCGATCAGGACCTTTTCCAAGCCCACGTATTCAGCAGGGATCTCTGGCTCGATCACTTCCTCGATGCGTTCGAGATGAGCGGCAAAACCCTTGCGTGGACGTGCTGCCCGTTTCGGCTTGTCCTTGGCCGCATGATCAAGCTCGCTCTGGATTGCCGAAAGGCCAGTCTCGACTTCTTCGAAGGCAAAGGACACCTGTTCGTCGTTGACGCCAAGCCGCAGTCGCTCGGAACGGGTGCCATGTTGGGTGCGCTGCAGAACCTTCAGGATCGATGTGAGGTTGGCAATCCGCTCGTTGGCGCTCTTCTCCACCGCCTCCAGCCGGGCGATCTCAGCTTCAGCGGCCTTCAGTCGGGCTTCTTTTGCAGCCTGCTCGCGCGCCATCGCGAGGACCATCGCTTTCAGCGCGTCAACGTCGTCCGGCAGGTCTTGAGGGGGCAAATCCATGGCAATGAGTAGAGCACAAAAACAGCCGTTTTCCCAACCATTACAGCGGCATGATTCATCTTGTCGCAGGCGCTGTTAGCCCGTCAACAAGGGTCGCCTGACCTTCGTCGGGCGGATCTTTTTCCAATCCATTCCGGCCAGCAACGCCATCAGCTGAGAATGGTCCAGACGTATCCGTGCCGCCGAGATACTCGGCCAGCAGAAGCCCTGATCTTCCAGGGTTTTCGAATAGAGGCAGACGCCACTGCCATCCCACCAGACAATGCGAATACGATCGGCCCGTTTCGACCGGAAGGCGTAAAGCGCGCCATTGAATGGGTCGAGTCCCCCATCCCGCACCAGCGCCATCAAGGACGCAGCCCCCTTGCGGAAGTCGACCGGCTGGCACGACACATAGACCACCACACCCGAGGCGATCATGCCTTGCGTACCGCCTGCAGAACCTTCACCAGGTGATCAGGATCAAAATTACCGCCGACGCGCACCACCATATCGGCAATGGCAATATCGATCGAGCCGCTGCTTATCGTTTCAACGCGCGCGAACTTCTGCTTGCCCGTTCCCTCAGTCAGGGGTGCAACAACGCCCGATGACAGTGCGGTGCGTCGCCATCCATAAAGCTGTGACGGATCTAAGCCTTCCGACCGGGCGACCGCCGAGACGTTCGCCCCAGGCTGCAGCGCCATTGCCAGGATTCGCGCCTTTTCGTCGTCCGACCACTCACGCGGCTTGCGTCGTCTTCGCACCGGCTCCGCAGTCAAAACCTCAAAGGTTCGAGGGTGATTCACACTGTCGCTCATAGGATTCTCCGCATGATTCACGCAGAAAATCGCCGATCAGCAGTCCGAAAGATACGTGGGGTGGCCTACGCGCTTACAGTGCACCACCTGAGGCACTGAATGGAAGGTATCCCAACTCGTCAAGGATCAGCAGATCAAGGCGAACCAGCGTCTCAGCGATCTGGCCTGCCTTTCCTTTGGCCTTCTCCTGCTCGAGCGCATTGACCAATTCGATGGTCGAGAAGAAGCGGACCTTTCGGCGGTGATGTTCGATAGCCTGGACGCCGAGAGCGGTCGCGACGTGTGTTTTGCCTGTGCCCGGCCCGCCGACGAGGACAATGTTCTGCGCTCCGTCCATGAACTCGCATCGATGCAGTTGGCGCACCGTCGCCTCGTTGATCTCGCTGGCGGCGAAGTCGTAGCCGGAGATGTCCTTGTAGGCAGGAAAGCGCGCTGCTTTCATATGATAGGCGATGGAGCGGACCTCGCGCTCGGCCACCTCGGCTTTTAGCAACTGGGACAGGATCGGCACGGCCGCATCGAAGGCTGGAGCCCCTTGCTCGGTCAGGTCCGTGACGGCTTGGGCCATGCCGTACATCTTCAGGCTACGCAGCATGATGACGACGGCGGCGCTGGCAGGATCATGACGCATGACGACCTCCCACAATCCGGACTCGCAGGCCAACATAGCGCTCTACGTTTGCCTTGGGTTCACGCAGCAAGGTCAGCGCCTGTGGCGTGTCGATGTCAGGACCGTCAGTCGTCTTACCATCGATCAGACGATGCAGCAGGTTCAGCACATGCGTCTTGGTCGCGACGCCTTGATCCAGAGCCAGCTCCACAGCCCTGATGACAACCTGTTCGTCGTGATGAAGCACCAGAGCGAGAATGTCGGCCATCTCCCGATCACCACCGGGGCGGCGAAGCATCTGGTCTTGCAATTGTCGAAAGCCCAGCGGCAATTCCAGGAAGGGTGCGCCGTTGCGCAAAGCACCGGGCTTGTGCTGGATGACGGCAAGGTAATGCCGCCAGTCATAAATCGTCCGTGGTGGCTTGTCGTGGCTGCGCTCGATCACCCGCGGGTGTTCGCAGAGAATATTGCCCTCGGCTGCAACGACCAGCCGCTCAGGATAAATCCGCAGGCTGACGGGCCGGTTCGCAAATGATGCAGGCACACTGTAACGATTACGCTCGAAGGTGATCAGGCATGTCGGTGAAACGCGCTTGCTTTGCTCGACGAAGCCGTCTGCGGGTTTCGGCAATTCGCGGACAGGGATTCCAGTAAATCCCGGACAGCGATGGAGTGCACCCTCAGGGTGAGACAGGGAAAATCGCGGACAGTTTCCCCACACCGAGAGGAAACCGTGCATGGCCCTACGACAAAGCAACATTGATGAAGACCAAAAGCTCGCGCTTCTTAAGCGCATGCAAGCCGGCGAGAATGTCAGCAAGCTGGCGAACGAGGCCGGCGTTGGCCGCCAACGGCTCTATGAGTGGCGAGATCATCTCAGGCTCTACGGCGATCTGAGTTCACGACGGCGAGGTCGACCCCCGAGAGTGGTGTCAATTGTTGACGGTGCTTCGCCATTGCAGACCGCTATGATCGCGCCTTCCCCTCAAGAACAATCGCTGACCAAGGCTAGACGCAGGGTTAGGGAATTGGAGCAGAAAATCGGCCAGCAGCAATTGGACATCGATTTTTTTCGCGAAGCCTTGCGGCACTTCGAGGAAGCTCGTCGTCGGAGCAGCGCTCCTGGCGGAACGGCGTCTTCCAAGTCATCGAAAAAATGATGGCTGATATGCCGCAGGGCAAACTTAACATCGATAGAATGTGCTGGCTCGCCGGCGTCAGCCGCGCGAGTTATTATCGCCATTGGCTCGATTCCGCCCCTCGTAGGGCCGAGACCGGATTGCTATAGCGCGGCGATCAGGATGAGACGCGGGCGACAATCTTGATGAGATTCTTAGGTCGCGGTCGGCGTCAGATTATCATGCTGATTGTCGCTGACAAGTTCTTCGTCATTTTCTAATTGCCGCTCCGCGACAGACAGCGTTGAGGTCCTGATTGTCGCAAACGAAGCGGGTCGGCCTCGCTGGCGCTTTTCCTCAAGTGCTGCTCTGCGGCGATAGCTTTCGACATTCATCTCGAAGATCGTGGCATGGTGCACGAGCCGGTCGACTGCGGCGAGTGTCATGGCAGGATCGGGAAAGACCCTGTTCCACTCTCCGAACGGCTGATTGGCGGTGATCAGGATCGACCTGTGCTCGTATCGGGCCGAGATCAGTTCGAAGAGCACACTTGTTTCCGCGTGGTCCTTGGTGACGTAGGCGAGGTCATCGAGGATGAGCAGGTCGTATTTGTTGAGTTTGTCGATGGCTGATTCAAGCTGGAGTTCTCGACGGGCAACCTGAAGCTTCTGCACAAGATCGGTGGTTCGGGTGAACAGGACCCGCCATCCGTTCTCAATCAGCGCGAGACCGATGGCAGCAGCAAGATGGCTCTTTCCTCCGCCGGGCGGACCGAACATGAGGATGTTCGCTCCCTTGGCGAGCCAACTGTCGCCGGCGGTAATTGCCATGACCTGGGCCTTCGAGATCATAGGTACAGCGTCAAAGGCGAAGCTGTCCAAGGTCTTTCCAGGCGGCAGGTGCGCTTCGGCGAGATGCCGTTCAATCCTGCGATTTGCACGTTCTGCCAGCTCGTGCTCGGCAATAGCCGACAGGAATCGAGCTGCTAGCCATCCTTCCCGGTCCGCCTGCTCGGCAAATTGCGGCCAAAGCGTTTTAATTGTCGGCAACCGAAGTTCGTTGAGCATGATACCGAGCCGTGCTTCATCGATGGCGTGTGCGTTCTTCATGCGGCTTCTCCCGTCCCGATCAGGGCCTCATAACCGTTAAGCGATGCGAGTTGCACATGAACGGTCGGCAGCTTGGCCGGGTCCGGACCGAAGAGCGCTCGCATGGCAGCCAGGTTGGGTAACTCACCAGCATCGAGTGTCCTGGCCAATTCTTCGGCAAGTTCGCGTTCGCAGCCGCGGTCATGCGCCAGTGCCAGAAGCTCAATGGTGATCTTGCAAGCCTGCTTATCGGGAAGATGTTCGGTCAGGGCCTCGAAGGCCTTGCGATATTCTTCTCGCGGGAAGAGCTTGTCGCGATAAACGAGGCCACGGAGCGCCATCGGCTTTTTACGCAGGGAATGGATGACGTGGTGGTAGTTGACGACCTGGTCGTGTCGCCCGTCAGCGTGGCCACGTCCCCGGCGCAACGTCATCAGATGCGTACCGCCAACGAAGACATCCAGCCGATCGTCGAACAGGCGAACTCGTAACCTGTGGCCGATCAGGCGGGACGGCACGGTGTAGAAGACCTTGCGCAGGGTGAAGCCGCCCGTCCGGGACACAGTGACAACAATCTCCTCGAAGTCCGTGGTCCGGCGCTCAGGCAGCGACTGCAGATGGGACCGCTCTGCATCGATACGCCTGCCATGGGCAGCGTTGCGACGGCTGACGATCTCGTCGACAAAAGCGCGGTAAGAACCGAGATCGTCGAATTCCTTGGTACCCCGCATCAGCAGGGCATCGTGAACAGCGTTTTTGAGATGACCATGCGAGCTTTCGATCGAGCCGCTCTCATGCGCGACACCTTTGTTGTTGCGCGTCGGCGTCATCCGATAATGGGAGCAAAGCTGATCGTAACGGTGTGTGAGATCGACCCTGGCGTCGGCATCGAGGTTGCGGAATGCCGCCGACAGGCTGTCGCTGCGATGATAAAGCGGCGTACCACCCACGGACCACAGGGCATTTTGAAGTCCTTCGGCCAGCGCGACGAAGCTTTCCCCGCCGAGAATGACATGGGCGTGTTCGAAGCCCGACCAGACGAGCCGGAAGTGATAGAACAGGTGATCAAGCGGCTGGCCGGCGATCGTTACACCAAGGCCGCTGGCGTCGGTAAAAGCCGAAAGGCCCAGTCGGCCTGGCTCGTGCGTCTGGCGGAAGATGACCTCCTGCGCTTCGCCATTGACAGCACGCCACGACCGGATGCGTCGTTCAAGCGTACGGCGGATACCTTCAGAGAGTTCTGGATGTCGGCGCAGCATCTCGTTGTAGACGGCGACGGCACGGATGCCAGGGGCGGCCTTCAGCAGCGGGACGACTTCAGTCTCGAAAATCGGCTCGAGGGGATCAGGGCGGCGACGTCCACGCGGCGCTTTGCTCTGAGATGGTAGGTGTACCTTCTTGTCGAGGCGGTATGCCGTGGCTCGGCTGATCGACGCCTTTGCCGCGGCGATCTCTACGGAATGCGTTTGTCGGTACTTCATAAAGAGTCTCATCTGATGATCGGTTAAATGGCGACCCGGCACAAACTGGTTCTCCAATCCAGAAAACCGCAAATGTACCGGACCGACCGCGATCTTGAGACGCCGAAAATCTGCGCCACGCCGGGGTATGACTACGATCGGGCTACGCCCTCACTTCGTCACACCCCGGCGCAAGTCTCATCCTGATTGACGCTGAATCTCACCTTGATTGCCGCTATGCAGATTGCGCGATCTCATTCAAAAGCTGGTTCTGGGCAATGCTCATTATGGATATCGAAGGATCGCTGCGTTGCTGCGGCGGGACGGATGGCAAGTCAATCACAAATGCGTTTTGCGCATTATGCGCGAGGATAATCTGCTGTGTTTGCGCTCTCGTCCATTCGTTCCGAGAACCACCGATTCGAAACATGGATGGAGGGTTGTGCCGAACCTTGCCAAGGGAATGATCCTGAATGGCGTCGACCAGCTCTGGGTTGCCGACATCACGTTCCTGCATCTGGCTGAGGAATTTGCTTTCCTTGCTGTTGTGCTAGACGCATTCAGCCGGAAAGTGGTCGGGTGGTCACTGGACACGCATCTGAGGGCAAGCCTGGCCATCGAGGCGCTCGAGATGGCAATTACTGATCGTCAACCCGAGCTGGGCAGCCTGGTTCACCATTCCGACCGGGGAGTCCAATACGCTTGTGGGGCATACTCGGAGCTTTTGCGTCGTCACGGCATCCAGCCGAGCATGAGCCGCGTCGGAAACCCCTATGATAACGCAAAAGCGGAGAGCTTCATGAAGACCCTTAAGCAGAGGAGGTGCAAGGTCTCGCCTATAGGGACGTGGATGATGCACGCAAGCGGATCGGCGTCTTCATCGACACAGTTTACAACACACAGCGCCTACATTCGGCGCTCGGCTATCTTACTCCCGAGGAATACGAACTGAAGCATTCTTGTCGCACCAACATTGGAAATGCGGCATAGAAGAGGGTGGACTGTCCGCGATTTTCCTTGTCTCACCCTGAGGGTGCACTCCAAGACTGTCCGCGATTTAGTGAAACGACTGTCCGGGATTTACCGGAATCGATGTCCCGTTAATCCGAAATCCGCAGCCGTCAAACATGGTGGGTAATGCCATTAAGGCAGGTTGCTCACCACCCCAAACATCCGCGATCGTGCCTGATAAGGTACCATGCGCGGTGTCCCGCCACAGGTCCTGGCAACGCTGCTCCAGCCAGACGTTCAAAGACGGCAAATCAGGAAAGTCCGGCATCTGTTGCCACAGCCGCGGACGAGAATCCTGGACATTCTTCTCCACTTGGCCCTTCTCCCAGCCCGCGGCGGGATTGCAGAACTCAGGTGCAAAGACGTAGTGGTTCGTCATCGCAAGGAACCGGATGTTGACCTGTCGCTCCCTGCCGCGGCCAACGCGATCGACCGCTGTCTTCATGTTATCGTAGATGCCGCGAGCAGGGACGCCGCCGAACACACGGAAGCCGTGCCAATGGGCGTCAAAAAGCATCTCGTGCGTCTGCAGCAGGTAGGCCCTGACCAGAAACGCGCGACTGTGCGATAGCTTGATATGCGCTACCTGCAGCTTCGTCCGCTCGCCGCCGATGACGGCATAGTCTTCACTCCAATCGAACTGGAATGCTTCGCCTGGGCGGAAAGACAGCGGAACGAATATGCCTCGGCCCGTCGTCTGCTGCTCACGTTGTCGATCCGCCCGCCAATCACGGGCGAAGGCGGCAACCCGGCCATAGGAACCGGTAAAGCCAAGCGCCACCAGATCGGCATGAACCTGCTTCAGTGTTCGCCGCTGCTTCCGCGACTTCCCGTCCTCTGTCTTCAGCCAGGCCGCCAGCTTGTCGGCAAAAGGATCAAGCTTGCTCGGTCGTTCTGGTACAGTGAACGTTGGTTCGATCGTACCTGCGTTCAAATACTTCGCGATCGTGTTCCGCGACAGCCCGGTGCGCCGGCTGATCTCGCGGATCGACTGCTTCTCTCGCAGCGCCATCCGACGGATGATGTTTAAAAGTCCCATGTGGATCACTCCGTTGCCCCCGTCGCTCTCCGCGTTGGGGGGAAGGTTCACATGGCTCAATTCTCAATGGAAACTATCCGCCTAACCGGCTCACTTCTGCGCGGAAACCAACAGTTATCGCTTACATTGCTGTTGACACAGCGGGATATATGGCTCAGTCGAAGACGCTAACGCAAGACACTGTCCGGGAGCATTATGGGAAGTGAATTGAAACGCGAGAAGCGTGGCGAATTGGAGGCCATCCGGGGTCTCGCAGCCTGCTCTGTAGTGGGCGGTCATTTTTTTGGTTGCTTCGCTCAGCCGGATAAGTTGCCCGACATCAGCCGCAGTCTTTATTCCGCACTTACAAACGGCTCGGCGGCTGTTGTAGTGTTTTTCGTCCTTTCCGGCGTTGTACTCCCTCTCTCCTTCTTTCGATCCGGAGGCAAGACAAGCACGATCACGGTGGCTGCGTTAAACCGCTTCCCTCGGCTGATGCTCTTAGTGTTTTTGACCGTGATGGGTTCCTACCTAATCACTGTTTTCGGAGTGAACTATTCCAAGGCTGCGGCACAAATATCCGGCTCAACCTGGCTCGCTACCTACGGCTTCCCGCATCCAACAGAACACTTTGAACCGAGTTTCGGTACCGCTCTCTGGCAAGGGCTCTTCGGAACGCTCTTGGAAAACAGGAGCGAGTTCAACGTGTCTCTTTGGACAATGCATCACGAGCTGTACGGCAGCTTCGTCACTTTTGGTTTGGCTATGGTGTTGTTTAAAGCACCGATGAAAATTATCTGTATCATGACCGCGCTGGCTTTTGTTGTTCTTCAGTTTACCGCTTGGCGCTTGACCCCTTTTGTTGTCGGAACCGCACTTTCAGCGTTCTTCTATCAGCGCCCACAGTTTTCGCTTAAACTCCCCTTGTCCCTTGGCCTAATTGTTTTTGGCTTCCTCTTTTACGGGTTCGTTCCGGCTAACAGTGGTTACTGGCCCACGACATTCATTCCTTGGGGCACGGACGATCAAAAAGGGTGGTTGGTTCACACCCTTGCAGGCGTGAGTTTTATCCTCGGCGTGGCGGGCAACGATCGCGTCAGGAGAGTGATGAATGCGGATTGGTTGGTCGCCTTGGGACGGTACTCGTTTGCCATGTATGCCGTCCACATGTTGATTATGAGTTCTTTGGTGAGCGTAGTGCTGATCCACGTTGCACCACTAGGCAAGCTCCCAGCTCTTATTATCATTACGATCGTGTTCGTATTGTCTCTGGCCGCAGCATCTTATGTGCTAACGAAAATCGACGAGTGGTGGACGAAGTGGACGCGGTCAGCGATCAAGTCCCTTACGGCTACCCAACCGACAGCGCCGGCGCAGTGTCCGAGCGTGCCTTCTTTTTGAGGCGCTTGAGGGCGCGCGAATGCGTCCCCAGCACTCGTTTGTTAGCCAATGGACCAGCGTATTATAATTTTTGCCGAAGTTTTGGAACCGGCTAAGATTTCGATTTCGTATAAAAAAAATCCTAAATCGCGCCGTAGGGTACAGTTATTGCACTCTTTGTAAGCGCCGTCTCCTTCCCATCGGGTCCGCTGCTTTCTGAGCATGATTGGTCATGCGAAAAGACTTCCAGGCGTATCTGGAAGTTGAATTATGGATGATGGATTTGTTGTCGTTATGAGGTCGTTGAACCGCGGCGGGGTAACCGGCGGTGGCCGGATGACGTGAAGGCTCGTATTGTCGCGGAAAGCCTCGAGCCTGGTGCGCGGGTAGTGGATGTCGCTCGCGGTCACGGTGTTGTAGCAAATCAACTTTCCGGTTGGCGACGTCAGGTGGGAGCACCATTCTGGCCCTGCCGTTTGCGACGGCAACGACGCCGTCGTGGCACAATGGTTCCGAGCAGTAGTCGTGCCTCTGGCAATTGCTCCGGGGCCGCCTGAGCCTGGCAATCCTTAGTCGCTATGAATAGCCCCGAGATTTGTAGACGCCTTCTTTCCTAATTTTGAGGCAAGAAGAGCATCATGGGCAAATCGAATTTCAGCGAAGAGTTTAAGCGTGACGCGGTGCGGCAGATCACGGAGCGTGGCTATCCGGTGGCTGAGGTGTCGCAGCGTCTCGGCGTCAGCCAGCATTCGCTTATGAATGGAAGAAGAAGTTTGCTGGGTCGCACTCCAAGGGCAACGACGAGGCCGAGGAGATCAGGCGGCTGAAGAAGGAACTGGCTCGCGTCACTGAGGAACGCGATATCCTAAAAAACGATCACGGGCTTTGGTCCGCCTGTGCGCTGTAGAAACAGGTCGTGCTAAGCTTCCGGCCGGAGGGGAGTTGCCGACTGCTTATTGGAGGCTTTCGACCCCGTTAAAAAACGTGGCACATGGGCTGTTGCGAACTTGAGAGTGGTGACGCGGCTTCGACGGCGATCCCGGTTAGGAAAATGACGAACTTGCACAGGCCAGCCCCTCCTGAACCGAAGGAGGAGAAGCCATGCCCAAGACAAAATCATCCGCGCCGCAGAACCTCACCCGTATGAACCCCGGCGCAGCTGCTATCGACATCGGGTCCACGATGCACATGGCGGCCGTCAATCCGAACAGCGACGATATGCCAATCCGAGCCTTTGGCACCTTCACCCATGACCTGCATGATCTCGCCGCCTGGTTCAGATCCTGTGGCGTCACAAGCGTGGCGATGGAATCGACCGGCGTCTACTGGATACCGGCATTCGAGATCCTGGAGCAGCACGGCTTTGAGGTGATCCTGGTCAATGCACGCTACGCGAAGAACGTCCCGGGGCGGAAGACCGACGTCAATGATGCGGCATGGCTGCGCCAGCTGCATTCTTATGGCCTGCTGCGCGGAAGCTTTCGGCCGAATGCCCAGATCGCAACGCTGCGTGCCTATCTGCGTCAGCGAGAACGGCTGGTCGAGTATGCGGCCGCCCATATCCAGCATATGCAGAAGGCCTTGATGGAGATGAACCTGCAGCTCCATCATGTCGTCTCCGACATCACCGGCGTAACAGGCATGAAGATCATACGGGCGATTGTCGCCGGCGAGCGCGATCCCGACGTCCTGGCATCATTGCGCGATGTTCGCTGCCACTCCTCTGTCGACACGATCAGAGCATCGCTGATCGGTAACGATCGTGATGAGCATGTTTTTGCGCTTTCCCAATCGCTGCAGCTTTACGACTTCTATCAGCTAAAGATGATTGAATGCGACCGCAAGCTGGAGGCATCGATTGCTTCCATGACCGCCGAGCCTGAAGCCCCGCTTGCGCCGCTGCCAAAAATCCGGACGAAGACCAAGCAGACGAATGCGCCGTCCTTCGATGTCCGAGCCGCACTATACGGCCTTACGGGAGCAGATTTGACCCAGATTCACGGCCTCGGCTCTTCGCTCGCACTCAAGCTCATCGGCGAGTGCGGAACGGATTTGCGGGCATGGCCAACCGCCAAGCACTTTACATCATGGCTCTGCCTCGCCCCGGGCAACAAGATCTCGGGAGGCAAAGTTCTATCATCGCGAACACGTCGGTCGTCCAGTCGAGCGGCGGCACTGTTGCGATTGGCCGCAACCACGATCGGGCGAAGTGACACTGCCCTCGGTGCGTTTTACCGCCGGTTGGCCGGGCGTATCGGAAAGCAGAAAGCGGTGACAGCGACGGCACGCAAGATTGCAGTCCTGTTCTATAATGCACTGCGGTTTCGAGTGATATACCGCGATCCAGGCGCTGCGGCCTATGACGAGCGACACCGAGGCCGCGTCATTGCCAACCTCCAGCGCCGCGCCCGGTCTCTGGGCTATCAACTCGAACCCATGATTGCTGCAGAATGTGTTTCTTAG
Protein sequences of DBSCAN-SWA_1 >CP036359|1800516:1847820|1840505_1842011_-|QBJ16086.1|transposase|DBSCAN-SWA MPGRHLTDHQMRLFMKYRQTHSVEIAAAKASISRATAYRLDKKVHLPSQSKAPRGRRRPDPLEPIFETEVVPLLKAAPGIRAVAVYNEMLRRHPELSEGIRRTLERRIRSWRAVNGEAQEVIFRQTHEPGRLGLSAFTDASGLGVTIAGQPLDHLFYHFRLVWSGFEHAHVILGGESFVALAEGLQNALWSVGGTPLYHRSDSLSAAFRNLDADARVDLTHRYDQLCSHYRMTPTRNNKGVAHESGSIESSHGHLKNAVHDALLMRGTKEFDDLGSYRAFVDEIVSRRNAAHGRRIDAERSHLQSLPERRTTDFEEIVVTVSRTGGFTLRKVFYTVPSRLIGHRLRVRLFDDRLDVFVGGTHLMTLRRGRGHADGRHDQVVNYHHVIHSLRKKPMALRGLVYRDKLFPREEYRKAFEALTEHLPDKQACKITIELLALAHDRGCERELAEELARTLDAGELPNLAAMRALFGPDPAKLPTVHVQLASLNGYEALIGTGEAA >CP036359|1800516:1847820|1823266_1825348_+|QBJ16075.1|DBSCAN-SWA MKEGVSYEAFEAYLSQQRCPDYSGPATYKVLFCALTDELKESRVAALFGSWSESEIDWITAQGWTADAGENPDILAERPDDGKSSTANLVICRLMDAPLDSKIAFQIVSKARGRLSAGGVLILEIKRSNGDVAASHVRNFPFSEIVPEFIVAEAGRYLGFSSIETRYFSTGLVGESDMSFVAAVILQKSYADGNNPISVVGRAINSMNGYTDLRAATHSQETEVLHREELLSAIRDWRSQSKMSADVIAELSSAALEAGSVSHRLSKAERRIRRLTWQTAIFLPLTYPLSLVISGAAGLVARIKDKRRKGRFEPKKKLPSDQREIYDRQIVTSSPLAGYFAGNSEPKILIVKLDHIGDFILSLPAIRVLKDAWPHGHFTIVCSPTNSGLAKACGYFDEVREYNFFAQLSQDVKKADMSKFSRIRDVVGEVYDIAIDLRHDQDTRPTLAFVDAKIKAGYQTHGKHFVPLDVSLPPIPERSGLHKSPHNIRRLMLLCSHVVNSVKPLMFDAGSALVLAGREAPLPNGEKYAILAPGGGTLAKKWPAERFAELAVRIAQHHDLKIVVLGGAQEKEYGEAVSRALPAGQVFDLTGSLPLVDMAKVSAGATIFVGSDTGATQLAALLGTPTVAVFSGVADVNLWQPVGMRVEVVRRPIPCSPCYIAKLENCVQGHACMNDIQVDHVYKSVESLISTTE >CP036359|1800516:1847820|1800516_1802175_+|QBJ16062.1|transposase|DBSCAN-SWA MSNTTEELPDDLASALALLAQERARRVAAEAEAATAKAEAASAKALVSHSEALIARLKLEIDKVRRELYGSRSERKARLLEQMELQLEELEADAGEDELVAEIAAKASTVKAFERRRPSRKPFPEHLPRVRVVIAAPANCACCGSAKLSKLGEDITETLEVIPRQWKVIQTVREKFTCRACEKITQPPAPFHVTPRGFAGPNLLAIILFEKFAQHQPLNRQSERYAREGVDLSLSTLADQVGACAAALKPIHSLIEAHVLTAERLHGDDTTVPILAKGKTDTGRIWTYVRDDRPLGGLSPPAALYYASRDRRQEHPERHLKTFTGILQADAYGGYNPLFKVDRDPEPLRQALCWAHSRRKFFVLADIAANAKRGRNAVPISPMALEAVKRIDGLFDIEREINGLDADQRLARRRKESLPLVDDLQAWLQTERAKLSRSSPVAEAIDYMLKRWDGFTSFLADGRICLTNNAAERALRGFALGRKSWLFAGSDRGADRAAFMATLIMTAKLSDIDPQAWLADVLARIADTPMTRLEQLLPWNWTPTQKPMALAS >CP036359|1800516:1847820|1820416_1821403_+|QBJ16073.1|DBSCAN-SWA MYIVTGAAGFIGSNIVADLEAAGLGPIAVVDWFGRGDKWRNLAKRNIAAYVSPEGLPEYLRGLKMEVKAVIHMGAISATTEADVDRLIDLNINYSVMLWDWCAEHGVPFIYASSAATYGGIEDGFDDNESSAARAGLAPLNAYGWSKKTTDDIFVERVARGENAPPQWVGLKFFNVYGPNEYHKDDMRSVACKLFDVVQTTNEVSLFKSYRPDIAHGEQRRDFVYVKDCTSFILWLLHNRNRSGIFNCGSGKARSFADIVHVMGDILGKKLNINFIEMPEAIRGKYQYFTEANMSKVASIGYNGPQSSLEEGLNDYINSYLRHDDRYR >CP036359|1800516:1847820|1844193_1845378_+|QBJ16088.1|DBSCAN-SWA MGSELKREKRGELEAIRGLAACSVVGGHFFGCFAQPDKLPDISRSLYSALTNGSAAVVVFFVLSGVVLPLSFFRSGGKTSTITVAALNRFPRLMLLVFLTVMGSYLITVFGVNYSKAAAQISGSTWLATYGFPHPTEHFEPSFGTALWQGLFGTLLENRSEFNVSLWTMHHELYGSFVTFGLAMVLFKAPMKIICIMTALAFVVLQFTAWRLTPFVVGTALSAFFYQRPQFSLKLPLSLGLIVFGFLFYGFVPANSGYWPTTFIPWGTDDQKGWLVHTLAGVSFILGVAGNDRVRRVMNADWLVALGRYSFAMYAVHMLIMSSLVSVVLIHVAPLGKLPALIIITIVFVLSLAAASYVLTKIDEWWTKWTRSAIKSLTATQPTAPAQCPSVPSF >CP036359|1800516:1847820|1829703_1830723_-|QBJ16080.1|transposase|DBSCAN-SWA MRGNQQLFQLHGVGADGQVALRRRVMRDQLLQVTEAMPPCLIGIEACTGAFHWQRHFEAQGHTVKIIAPQYVKPFLKGQKNDGNDAQAICIAVQQPHMRFVPKKSIIQQDIQGLHRARQRLVNHRVALISQMRGLLLERGVAIAKSASIAQRTIPEIINNPRIDITELTRDIIGTLHAFLLQLNEKIRYFDKQIEAVFRESDVCQRIAKIRGVGPKTATAVVAAVGDGADFKNGRHLAAWMGLVPRQHSSGDRRVLLGISKRGSQHLRTLLVHGGRSVVRTAQRKSDPASEWVNELRERRGYNRAAVAVANKNARVIWALLQHGGEYRKPPVKAAGQAA >CP036359|1800516:1847820|1808436_1808670_+|QBJ16386.1|DBSCAN-SWA MATTVTAKGRVTIPKGVRELLGISPGSSVDFVRAPNGRIVLVRADKKQPLTRFAKLRGHAGEGLGTDAIMALTRGDE >CP036359|1800516:1847820|1827263_1827671_+|QBJ16077.1|DBSCAN-SWA MTKHPIEVITSVERRRRWSREDKERLVAACFEPDAVISEIARAAGIHVSQLFRWRKELCRIEEPRADTATLVPVIVSEAASTVSPVQPESPTTSHPRRKRSDVTIELGRGRRVRVDSDIDTDALGRILDCVLGLR >CP036359|1800516:1847820|1807251_1808019_+|QBJ16066.1|DBSCAN-SWA MADSENSRTLPKISCANAFPNETFVDNLPSVINRRNLLPLAARILPMLLNDLPSRTIAGPVHAKELWPDWYDMYRQRLAAERECQELEARLLEETGGRPFVVITVDDGGTSVGTVSSFEEIRELAPRIGADAAESARLELLRLRRRWNAADRRIGYSASLAKAQDLARFEGIAGRVLISLQPYYIHDIAAKLHCMLVMYDPELRKEETPWPELRRMLRELIQPYWSVIEPQSRIRLLRPKTRETPFQEERGRIAV >CP036359|1800516:1847820|1812255_1813041_-|QBJ16388.1|DBSCAN-SWA MTAIGKLAENFAEASREGARIPLAELEAESLVPATFSEAMEIQRAFAANWATPVVGWKLAIRPDGEAVGAPIFDYVRVNDANLASFPEDGTEGVEVEICFTLSADIPASTAGRMTRADLTGFIDMVHLGAELLRYRLVEKNQVPFPLFLADRLANHGFVIGPELDEGIVEVFTGNGENLLQLTVTEGTVSLFDAKVKHPNGDPLAPLVAFANSAFNTGDMLKAGNVVTTGSLCGAIPSALAGHTRVRLESAGAFTLSGPKR >CP036359|1800516:1847820|1810029_1811085_-|QBJ16067.1|DBSCAN-SWA MEYRQLGKSGLKVPVLSFGTGTFGGSGPLFSNWGNSDAAEARSLIDVCMEAGVNLFDTADVYSNGASEEVLGEAVKGRRDRVLISTKTSLPMGDGPNEAGSSRFRLLRAVDDALRRLKTDYIDILQLHAYDAFTPVEEVLSTLDTLVRSGKVRYTGVSNFSGWQIMKSLAVADRYGWPRYVVNQVYYSLVGRDYEWDLMPLGADQGLGAMVWSPLGWGRLTGKIRRGTPLPEGSRLHETAGFGPPVNDEELYRVIDAIDAVATETGRTVPQVALNWLLRRPTVSTVIIGARNEEQLRQNLDALGWELTSAQVATLDEASETTAPYPHFPYQRQEGFARLNPPAVTRQTMPF >CP036359|1800516:1847820|1839615_1840509_-|QBJ16085.1|DBSCAN-SWA MKNAHAIDEARLGIMLNELRLPTIKTLWPQFAEQADREGWLAARFLSAIAEHELAERANRRIERHLAEAHLPPGKTLDSFAFDAVPMISKAQVMAITAGDSWLAKGANILMFGPPGGGKSHLAAAIGLALIENGWRVLFTRTTDLVQKLQVARRELQLESAIDKLNKYDLLILDDLAYVTKDHAETSVLFELISARYEHRSILITANQPFGEWNRVFPDPAMTLAAVDRLVHHATIFEMNVESYRRRAALEEKRQRGRPASFATIRTSTLSVAERQLENDEELVSDNQHDNLTPTAT >CP036359|1800516:1847820|1842177_1842900_+|QBJ16087.1|transposase|DBSCAN-SWA MPLCRLRDLIQKLVLGNAHYGYRRIAALLRRDGWQVNHKCVLRIMREDNLLCLRSRPFVPRTTDSKHGWRVVPNLAKGMILNGVDQLWVADITFLHLAEEFAFLAVVLDAFSRKVVGWSLDTHLRASLAIEALEMAITDRQPELGSLVHHSDRGVQYACGAYSELLRRHGIQPSMSRVGNPYDNAKAESFMKTLKQRRCKVSPIGTWMMHASGSASSSTQFTTHSAYIRRSAILLPRNTN >CP036359|1800516:1847820|1837288_1837672_-|QBJ16083.1|DBSCAN-SWA MSDSVNHPRTFEVLTAEPVRRRRKPREWSDDEKARILAMALQPGANVSAVARSEGLDPSQLYGWRRTALSSGVVAPLTEGTGKQKFARVETISSGSIDIAIADMVVRVGGNFDPDHLVKVLQAVRKA >CP036359|1800516:1847820|1802171_1802786_+|QBJ16063.1|DBSCAN-SWA MISNERIARLHIKLDHIKPVIWRRVEVPITTSLKGLHDVIQAVMLFEDYHLFEFNAGGRRYAVPDPEWDLGRETYAARNVRIGALVERGIETLDYTYDFGDDWRHSITVEAVTDADPAVEYPRFVEGDRRAPPEDVGGLSGFEEFLDAMTKPRHKQYRQVVDWYGGRFEPEDISVATINERLAKLARRRTLGKAGFAKSQNKHH >CP036359|1800516:1847820|1828060_1829716_+|QBJ16079.1|transposase|DBSCAN-SWA MSNTTEELPDDLASALALLAQERARRVAAEAEAATAKAEAASAKALVSHSEALIARLKLEIDKVRRELYGSRSERKARLLEQMELQLEELEADAGEDELVAEIAAKASTVKAFERRRPSRKPFPEHLPRVRVVIAAPANCACCGSAKLSKLGEDITETLEVIPRQWKVIQTVREKFTCRACEKITQPPAPFHVTPRGFAGPNLLAIILFEKFAQHQPLNRQSERYAREGVDLSLSTLADQVGACAAALKPIHSLIEAHVLTAERLHGDDTTVPILAKGKTDTGRIWTYVRDDRPLGGLSPPAALYYASRDRRQEHPERHLKTFTGILQADAYGGYNPLFKVDRDPEPLRQALCWAHSRRKFFVLADIAANAKRGRNAVPISPMALEAVKRIDGLFDIEREINGLDADQRLERRRKDSLPLVADLQVWLQTERAKLSRSSPVAEAIDYMLKRWDGFTSFLQDGRICLTNNAAERALRGFALGRKSWLFAGSDRGADRAAFMATLIMTAKLNDIDPQVWLADVLARIADTPITRLEQLLPWNWTPPTVNAQAA >CP036359|1800516:1847820|1839019_1839442_+|QBJ16084.1|DBSCAN-SWA MALRQSNIDEDQKLALLKRMQAGENVSKLANEAGVGRQRLYEWRDHLRLYGDLSSRRRGRPPRVVSIVDGASPLQTAMIAPSPQEQSLTKARRRVRELEQKIGQQQLDIDFFREALRHFEEARRRSSAPGGTASSKSSKK >CP036359|1800516:1847820|1809029_1809980_+|QBJ16387.1|DBSCAN-SWA MKTLAAVSAFVGKTFAVWVILFAVLGFFFPDTFKQIAPWIVTLLSIIMFGMGLTISVDDFREVVKRPFDVTIGVLGQFLIMPLLAVLLTRIIPMPPEVAAGVILVGCCPGGTSSNVMTYLSKGDVALSVACTSVTTLAAPLVTPFLVWLFASQFLPVDGWAMFLSIVKVVLVPLALGAALQKLLPGLVKTAVPALPLVSVIGIVLIVAAVVGGSKASIAQSGLMIFAVVVLHNGLGYLLGYLAAKATGLSLAKRKAIAIEVGMQNSGLGAALATAYFSPLAAVPSAIFSVWHNISGAILANWFSGRVDAGSANKTA >CP036359|1800516:1847820|1811269_1812175_+|QBJ16068.1|DBSCAN-SWA MARLEINRAGEMEIFVRVIELGGFSRAAAAASMTPSAVSKLIARLESRLGARLLNRSTRQLQITPEGCAFYERATRILADLEEAERAAGEGERPLGRVRINTSASYAAHILAPILPRFLALHPGITLDIVQTDLVVDLLAERTDVAVRAGPLKSSTLVARKLGETRMIIAASRDYLERHGIPQTIADLEDHNRLGFGYARSIDGWPLRDAGKTVVIPATGRVQVSDGEGLRRLALAGVGLVRLAAFTVREDIAAGRLIPVLDHLDTGETEMFHAVYVGQRGPLPARIRALLDFLGEYGRVK >CP036359|1800516:1847820|1818956_1820417_+|QBJ16072.1|DBSCAN-SWA MLDYLRFSEVKVLCIGDVMLDCFISGDVGRISPEGPVPVMCARSEEYFAGGAANVARNISSLGASCTLLGAIGRDSNGSKLLELLNAVHDVSAEFVTLDKRPTTIKTRFLGHGQQMLRVDFEDASPISSEEEDRVISTATSLISSHSVVILSDYAKGLLSERVVKAVIAACRSEGKPIIIDPKTSDFSRYAGAAIVTPNQKETQAVTGLWPDTDQEAELAGELILSRFDVEAVLITRAEKGMTLIERDHQPFHIRADAKEVFDVVGAGDTVIAALAVGLGSHISRAEAARVANAAAGIVVGKKGTATVSVAELRDALSHDTSSGEISPSDKLVAWNDIQALTEDWRSDNLKVGFTNGCFDILHVGHIRLLQYARDNCDRLVVGVNSDSSVKRLKGPARPINTEFDRAELLGALAFVDAVVVFEEDTPLELIQMIRPDVLVKGADYTVDKIVGADVVMAAGGKVVTFEIVPGKSTTATIARSQERAV >CP036359|1800516:1847820|1836947_1837292_-|QBJ16082.1|DBSCAN-SWA MIASGVVVYVSCQPVDFRKGAASLMALVRDGGLDPFNGALYAFRSKRADRIRIVWWDGSGVCLYSKTLEDQGFCWPSISAARIRLDHSQLMALLAGMDWKKIRPTKVRRPLLTG >CP036359|1800516:1847820|1830827_1831100_-|QBJ16081.1|DBSCAN-SWA MIEDDEPNLVTSGKSKRIVVDGYPFSIDIFRLETDTTWTLEVVDHNNTSHVWDEQFRSDAEARDVAVKAIETEGALAFMRGNNVIPFRQA >CP036359|1800516:1847820|1806187_1806685_-|QBJ16065.1|DBSCAN-SWA MTIHNNLSGHILLLSGHPGSGKSAIADALAHLPGVPKVHFHSDDLWGYIKHGRIDPWLPQSHQQNQMIMQIAADVAGRYAKAGYFVILDGVVRPDWLPAFTALARPLHYIVLRTTAAEAVERCLARGGDSLSDPDVVADLHSQFADLGGFAGAPTLGDRCVGDCR >CP036359|1800516:1847820|1827667_1828015_+|QBJ16078.1|DBSCAN-SWA MIPVPVGVKVWLATGYTDMRKGFPGLSLMVQEALKRDPMCGHLFVFRGRGGGLIKVIWHDGQGACLFTKKLERGRFIWPSAADGTVVITPAQLGYLLEGIDWRMPQKTWRPTSAG >CP036359|1800516:1847820|1846458_1847820_+|QBJ16089.1|transposase|DBSCAN-SWA MPKTKSSAPQNLTRMNPGAAAIDIGSTMHMAAVNPNSDDMPIRAFGTFTHDLHDLAAWFRSCGVTSVAMESTGVYWIPAFEILEQHGFEVILVNARYAKNVPGRKTDVNDAAWLRQLHSYGLLRGSFRPNAQIATLRAYLRQRERLVEYAAAHIQHMQKALMEMNLQLHHVVSDITGVTGMKIIRAIVAGERDPDVLASLRDVRCHSSVDTIRASLIGNDRDEHVFALSQSLQLYDFYQLKMIECDRKLEASIASMTAEPEAPLAPLPKIRTKTKQTNAPSFDVRAALYGLTGADLTQIHGLGSSLALKLIGECGTDLRAWPTAKHFTSWLCLAPGNKISGGKVLSSRTRRSSSRAAALLRLAATTIGRSDTALGAFYRRLAGRIGKQKAVTATARKIAVLFYNALRFRVIYRDPGAAAYDERHRGRVIANLQRRARSLGYQLEPMIAAECVS >CP036359|1800516:1847820|1806866_1807085_-|QBJ16385.1|DBSCAN-SWA MARAALGWGIVELAEHAGISTNTLVRLEKGEDLKESTIENIRFVLEQAGVKFVDNDVTFGVLVNKDDYIPLS >CP036359|1800516:1847820|1825402_1826515_+|QBJ16076.1|DBSCAN-SWA MAVFKAIEKKLRHAYKRRKLASSGIVENDGGAILVTGGLGDLIVIARFMRDLLGHLGGGSFSIFYSSPMVADMVFKSVPGFSGAYPSHFFKYAKDVYLYGLKINQFIYVEKWPGHRAMTLADTKIYELVRILEAYEKKMEPLHVVRMHHPFLDGTLGRYAAIKGRSRNDFLHYAADIPYGGDTLSLSGDDTILARHDLVPGRYITIHNGYDEAMQGLPGQRATKAYPGWGDVVSYLPEKFGEDLKVVQLGTEKTSAPIAGVDVNLIGKTSLKQALGLLANTAFHLDNEGGFVHAAAAFGKKSCVVFGPTSVDYFGYRSNLNFSPLQCGDCWWSEKTWMMACPRGDTLPACMSAYDPRVLAEEIYRWWTAM >CP036359|1800516:1847820|1815506_1816832_-|QBJ16070.1|DBSCAN-SWA MREFIPMKKLNIIFSTTRQWNPGDEFILQGVRNVLKELEVDHCPVIYNRNPDIRPSHQDRQLFRTSKLPSDFNSQVDFVDLEANLKLGFFDNSVKPDTNCEFVDWVIMSGTPEWCSARMTDLYSLILRYNLPVMIIGVGGNCDIYHESYREVISKSKLLTVRDHNTFKAVLKQGFSAEEIPCPALLSAKVNQELNIQNVKKIALIYHATADDSVIWNGFSSDAFNYTNSLYRRLLSDYAGQYQFSIVCHYIDEIPLARRDFPDLEIEYSFESKDYYHIYSQFDLVVGPRVHGVGVAASLGVPGIAISHDSRGSTTKGFLAEVLSVGTPESEALAKIADMIANIASRSAILKKHKRLTMDRYKTLVRDALNDRAVSYQSSWDISSGHTFTMEELRPLASALEAIKLGAATEKDGDRMGALAPEIRTRLINIENKINHLMSRQ >CP036359|1800516:1847820|1813369_1815133_-|QBJ16069.1|DBSCAN-SWA MIGSFLSPNDKKLLTRLFKENFKKHVGWYSAAIVAMLIVAGMTSLSAWIMRDIVNSVYLKQGFDVVLEIAFAVAVIFIVKGLATFVQSYYLSKAGNSIVAEQQRRIYDRLLKQGVSFFQNLPSSELLVRVTYNAQSARSVIDTIVTSFVRDLFSLLGLIIVMFAQNFLLSAISMIIGPLAIFSVRLVLRRVRKIMEAELASLGEIVNVVQETSIGVRVIKAFSLEKLMRQRMNKAVSDVEQRANGIAKLEAATSPIMETLSGLAIAVVIAVSGYTVLEKGGSPGDLMAFITALLLAYEPAKRLARMRVQIEGGMIGVRMMFEVLDAPLTLAEKKDAMPLPKASGDVELKNVSFEYVSGQPVLRDVSVLFEGGKMTALVGPSGSGKSTIINLMMRLYDPQNGSVEINGMDLRDVSFSSLREHVSYVGQETFLFSGTIRHNISVGRDNATEEEIIAAAKAANAHDFIIKMRDGYDTKLGENGSGLSGGQRQRVAIARAMLRDADILILDEATSALDSESEALVRDALERLTFNKTSIVIAHRLSTINRADKIVVMDNGEVVEQGSRRELLANDGLYKRLHSIQFEADVA >CP036359|1800516:1847820|1804010_1805312_+|QBJ16064.1|DBSCAN-SWA MRIVMVGSGYVGLVSGACLADFGHNVICVDKDEEKIRALKRGAIPIFEPGLDVLVDTCVKAGRLFFTTDLAAVVAEADVVFIAVGTPSRRGDGHADLGYVYEASREIGAAITGFTVIVTKSTVPVGTGDEVERIIRETNPNADFAVVSNPEFLREGAAIDDFKRPDRIVVGLSDERARPVMTEVYRPLYLNQLPLLFTSRRTSELIKYAGNGFLAMKISFINEIADLCERVGADVRDVSRGIGLDGRIGAKFLHAGPGYGGSCFPKDTLALAKTAQDYDAPLRLIETTIAVNDNRKRAMGRKVIAAAGGDVRGKKIAVLGLTFKPNTDDMRESPAIAVIQTLQDAGAIIFAYDPEGMENARQIMDIELSSGPYEAAQDADIVVLITEWNQFRALDLTRLRAVMKCPIFVDLRNVYRAREVEAYGFTYTGIGLA >CP036359|1800516:1847820|1821546_1823226_+|QBJ16074.1|DBSCAN-SWA MVQPGFDSAQSEYLDLKKWLFEAALPLWSSVGRDDVNGGFFEKIDKNGVAVEAARRTRVVCRQIYSFSAAKNMGWSGDAEGLVQHGWSFLRRHCFNANGTLIATVDAATGSKNTSFDLYDHAFALFGLSYVAETLRSAENAADDALDCLEAMISGWKHPIAGFEEAIPPIVPLRSNPHMHLFEAFLTWLENPLVKNPDRWLSCLNEIGDLCLSAFISNENGSLSEYFNHDWSVMRDNSLAPVEPGHQFEWAWLLTRWGKMAGRKDALIAARRLVAIGEKGVDESLSLARNGLDFSLNSLDGAFRLWPQTERIKAWLMMAETAITPEDRENAYAKVADAASGLKRFFSDVLPGLWIDRFDENGNVVDEHAPASSLYHIVCAIEEMHRLLEPHTQSVPGLFLDRDGVIIEDTGYPSKVEGVRLIPGAAEFISSFRERGYRVFVVTNQSGIGRGYYDDLDYIILKAHIGKLLRKEGAYIDDERLCPFHESATVEKYRGNHYWRKPSPGMIEDIVVRWNIDRKRSVLIGDKPSDIEAANAAKIDGRLFSGNNLMHFAEEENIL >CP036359|1800516:1847820|1817322_1818600_+|QBJ16071.1|DBSCAN-SWA MPDVFFDLSELYLNSGVKFKYYGIARTVMEVAYELTKLDASVRYVIYSPLHRRFFEVFPRAGDASPTGVLDPNIPTSATPLRLRQNIYDANAVKKTFYKLLHKIVQYRNRKRWATVPAGAAKDVDLHEQILVALGRPKIMSDYLMALAESGTKPIFIPLLHDMIPLHDFSHRNQFSFPRNFIHDNQVAISAASMILTNSEFTAAEVKHFSKVGTLPPVPNVIAVPLCHELRPTNEPVEKRGPEKPYLLCVGIYNGRKNLECVVEAMLLLHRRGVSVPELVLAGARRKRVEKFLKKEKFAPLFTKFHFVLNPNQAELRSLYEKAYALLLPSRMEGWGLPVSEALWCGTPALAADVPALREAGGDWARYFDPESPEELATQLENLIENPAEYAALKKNIELSKPQMRTWKDVAADLLEATKKSMLIK |
32 | Stx2-converting_phage(25.0%) | transposase | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
Acr ID | Acr position | Acr size | Homology with known anti | Neighbor HTH/AcRanker | Neighbor Aca | In prophage | Protospacer in prophage |
---|
CRISPR_ID | Spacer_Info | Spacer_region | Spacer_length | Hit_ID | Protospacer_location | Mismatch | Identity |
---|
CRISPR_ID | Spacer_Info | Spacer_region | Spacer_length | Hit_phage_ID | Hit_phage_def | Protospacer_location | Mismatch | Identity |
---|
Region | Region Position | Protein_number | Hit_taxonomy | Key_proteins | Att_site | Prophage annotation | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
DBSCAN-SWA_1 |
0 : 2246
Sequences of DBSCAN-SWA_1
Nucleotide sequences of DBSCAN-SWA_1 >CP036360|0:2246|DBSCAN-SWA
Protein sequences of DBSCAN-SWA_1 >CP036360|0:2246|1220_2246_+|QBJ16405.1|DBSCAN-SWA MTSKSRKSIVANFGLLSAELENRLSTENPTPAPQPVPTARVGAGVIGAAHRAIDDIKSERDRLKALVEAGGGTIRELDPLSIDPSPFPDRLPDDDAADFEAFRNSIRSEGQKVPIQVRKSPSSPDRYQVIYGHRRLQAARDLGIPVKAIEVEISDIELAIAQGIENADRQDLTWIERALFARRMDDAGVKPRDIKAALSIDDPELARMRSVYRVVPTEIIEAIGRAGKVGRPRWADFAKNYSERPDLHDALRNVLSGSAEKRLGSDQKFLAAFNALKPEAPQKDTGKSIAGPAGEQLGRLVRTANEVRISAHTASAKEFLSFIESELPTLAERFSREKSKN |
1 | Ochrobactrum_phage(100.0%) | NA | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_2 |
7006 : 10151
Sequences of DBSCAN-SWA_2
Nucleotide sequences of DBSCAN-SWA_2 >CP036360|7006:10151|DBSCAN-SWA CTCATATGGATTTCAGGCTCACGCGCTGCTCCAGCTTTTCGGCTTCTTCTTTTCGCTCGCTGTAGCGATCGGTGAGGTAGGCAGAGACGTCACGGGTCAGGATCGTGAACTTCACCAGTTCCTCGCAGACGTCGACGACGCGATCATAATAGGAAGACGGCTTCATGCGGCCGTCGGTGTCGAACTCTTGGAAAGCCTTGGCCACCGACGACTGGTTCGGGATGGTGATCATCCGCATCCAGCGACCGAGGACCCGCAGCTGGTTGACCGCATTGAACGACTGCGAACCACCGGAAACCTGCATAACGGCCAGGGTCTTGCCCTGCGTCGGCCGCACCGAACCAACCGAAAGCGGGAGCCAGTCGATCTGCGCCTTCATGATACCGGTCATGGCGCCGTGGCGTTCCGGACTAATCCAGACTTGGCCTTCCGACCAGGCAGAAAGCTCACGCAGCTCCTGCACCTGTGGATGGCTCACCGGCGCCGCATCGGGAAGCGGCAGGCCCGCTGGATTGAAGATCCGAACCTCGCAGCCAAAATGTTCGAGCAGGCGCGCCGCCTCATTCGCCAGGAGCCGGCTATAAGATACGGCGCGCAGGGAACCATAGAGGATCAGAATGCGCGGCTTGTGCGATGATAATGGAGGACGCAGGGCGCCCAAATCTGGCTGACAAAGATGCGAGAGGGATGCGGCAGGAAGATCAGACAATGCGTTTGCCCTCCGCATCGAGGACCTGCTCACCATCTTCTTTGGTGAAGGCGCCCTTGTGGGTGTCGGGCAGAATGTCCAGCACCACTTCGGATGGTCGTGCCAAGCGCGTGCCAAGCGGCGTCACGACCAGAGGCCGGTTGATTAGAATCGGGTTTTTAAGCATCGCGTTGAGCAGCTGATCGTCGCTCAGCTCCGGATTGTCGAGGTTCAACTCCGCATAGGGCGTGCTCTTTTTACGGATGGCGCCGCGCACCGTCAGGCCGGCATCAGCGATCATCTGCACAAGCTGTGCGCGCGTCGGCGGGGTTTTCAGGTATTCGATCACCGTGGGCTCAATGCCGGCGTTGCGGATCATCTCAAGGGTGTTGCGCGACGTTCCGCAGGCGGGGTTGTGATATATTGTGACGTCCATGGTCGTCAGGCCCTTTCGATGGTGGAAGTGGAGGAGCGGACGGCGGCGCCGCGCTCGTACCAGCCCTTGCTGCGGTTGACGATCCAGACCACTGACAGCATGACCGGCACTTCGATCAGCACGCCTACGACGGTCGCCAGGGCTGCTCCGGAATCGAAACCGAACAGGCTGATGGCCGCTGCCACCGCCAGTTCAAAGAAGTTGGATGCGCCAATGAGAGCGGAAGGGCCTGCAACGCAGTGCTGCTCCCCACTCGCGCGGTTGAGAAGGTAGGCCAGGCCGGAGTTGAAATAGACCTGGATGAGGATCGGCACGGCCAACAGGGCGATCACGCCCGGCTGCGCGAAGATCTGCTCGCCTTGAAAGCCGAACAATAATATCAGGGTCGCCAGTAGAGCGACCAGGGATACCGGCTGCAACCTGGCAGCGAGACGGTTTACGTCGGTCAGCCGACGCAGGATTTGCGCCGAGATTACCGGCAGGACGATATAGAGCACGACCGACAGGATAAGAGTATTCCACGGGACCGTAATGGCCGAAAGGCCGAGTAGCAGCCCGACGATCGGTGCGAAGGCCAACACCATGATCACATCGTTTAACGCGACCTGGCTCAGCGTGAAATGCGGCTCGCCTTTGGTCAGGTTCGACCAGACGAACACCATGGCGGTGCAGGGCGCGGCCGCAAGGATAATCAGGCCAGCGATATAGGAATCGATCTGGCCTGCCGGCAGGTAAGGCCGGAACAGATAGCCGATAAACAGCCAGCCAAGCAGCGCCATCGAAAACGGTTTTAATGCCCAGTTGATGAACAGGGTAACGCCGATCCCTCGCCAGTGGCGGCCGACCTGCCCGAGGGACGCGAAATCGATCTTGAGCAGCATTGGGATGATCATCAGCCAGATCAGCACTGCCACTGGCAGGTTGACCTTCGCGACTTCGGCCGCGCCGATCGTCTGGAAGGCCCCGGGCGTAAGGTGGCCGAGGGTCACCCCGACGACGATGCAGGCGATGACCCACACCGTCAGATAGCGTTCAAACGTGGACATGATTTTACCCTTGGATAGTCTTGGCAAACAGGCCGGCGGAGGCCGGGAAAAGGCTTGCCGCCTGGCCTGTTTGCAAGATTGCGGCCGGCGCGCTTGTGCGATCGACCTTCACGAAGCCCAAAATAATCAAAGAATGCAGTGGCCGTTTCTGTCAGCAGGTAAAGGGTGGGGGCGCCATCGCGCGCTGCCTGGTCAAGCATCTGGCGTGAAATGGCTTCGCCATAACCGCGTTTGCGTTGGTCGGGCAGCACGACCACAGAGCGCAACAGCGCATAGTCTCCATAGGGCTCCAGCCCAGCGAAGCCGATAATCTGGGCCCGGTCCGCAAAGCGAAAGAAGGTGCGCCCGCTTTGCTCCAGATCGTCGATCGGAAGTCCCGCCGCCTGCAGTGCTGCCTGGAGGTCCTGATCGCGCCCGCTGGCCGACTGCTGATCCAAAAAGTCGCTCACGCGACGTCGCCCTTGCGGCTGGTGCTGCCTTCCATCGTGCCGATCGTGCGGAGCTGAGTTTCCAGCGCCAGGTTGTCGATCGACGCGAGGGGCAGATTCACAAACGCGGTGATGCGGTTCTTAAGGTAGCGGGCGGCTTGGGAGAAGGCCTGGCCCTTCTCAACCAGAGAACCTTCGACGGCTGCCGGATCTTCAACGCCCCAGTGGGCAGTCATCGGGTGGCCGATCCAGACCGGACAGGCTTCGCCAGCGGCATTGTCGCAGACCGTGAAGATGAAATCCATTTGCGGCGCGCCAGGCTCCGCGAAGACATCCCAGCTTTTTGAGCTGAACCCCGTCGCCTCGTATCCGAGGCTGCGGAGCGTATCCAAAGCAAGCGGATGGACCTCGCCCTTGGGCTGGCTGCCAGCGGAGAAAGCGCGGAATCGGCCCTTGCCTTCCCCATTCAAGATGGACTCGGCGAGGATCGAACGGGCCGAATTGCCGGTGCAGAGAAACAGGACGTTATAAACGCGATCAGCGGTCAT
Protein sequences of DBSCAN-SWA_2 >CP036360|7006:10151|7006_7714_-|QBJ16411.1|DBSCAN-SWA MSDLPAASLSHLCQPDLGALRPPLSSHKPRILILYGSLRAVSYSRLLANEAARLLEHFGCEVRIFNPAGLPLPDAAPVSHPQVQELRELSAWSEGQVWISPERHGAMTGIMKAQIDWLPLSVGSVRPTQGKTLAVMQVSGGSQSFNAVNQLRVLGRWMRMITIPNQSSVAKAFQEFDTDGRMKPSSYYDRVVDVCEELVKFTILTRDVSAYLTDRYSERKEEAEKLEQRVSLKSI >CP036360|7006:10151|9620_10151_-|QBJ16414.1|DBSCAN-SWA MTADRVYNVLFLCTGNSARSILAESILNGEGKGRFRAFSAGSQPKGEVHPLALDTLRSLGYEATGFSSKSWDVFAEPGAPQMDFIFTVCDNAAGEACPVWIGHPMTAHWGVEDPAAVEGSLVEKGQAFSQAARYLKNRITAFVNLPLASIDNLALETQLRTIGTMEGSTSRKGDVA >CP036360|7006:10151|7706_8129_-|QBJ16412.1|DBSCAN-SWA MDVTIYHNPACGTSRNTLEMIRNAGIEPTVIEYLKTPPTRAQLVQMIADAGLTVRGAIRKKSTPYAELNLDNPELSDDQLLNAMLKNPILINRPLVVTPLGTRLARPSEVVLDILPDTHKGAFTKEDGEQVLDAEGKRIV >CP036360|7006:10151|8134_9175_-|QBJ16413.1|DBSCAN-SWA MSTFERYLTVWVIACIVVGVTLGHLTPGAFQTIGAAEVAKVNLPVAVLIWLMIIPMLLKIDFASLGQVGRHWRGIGVTLFINWALKPFSMALLGWLFIGYLFRPYLPAGQIDSYIAGLIILAAAPCTAMVFVWSNLTKGEPHFTLSQVALNDVIMVLAFAPIVGLLLGLSAITVPWNTLILSVVLYIVLPVISAQILRRLTDVNRLAARLQPVSLVALLATLILLFGFQGEQIFAQPGVIALLAVPILIQVYFNSGLAYLLNRASGEQHCVAGPSALIGASNFFELAVAAAISLFGFDSGAALATVVGVLIEVPVMLSVVWIVNRSKGWYERGAAVRSSTSTIERA |
4 | uncultured_Caudovirales_phage(100.0%) | NA | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_3 |
22991 : 24071
Sequences of DBSCAN-SWA_3
Nucleotide sequences of DBSCAN-SWA_3 >CP036360|22991:24071|DBSCAN-SWA ACTAGGCGGATTGGCCGTCATCACTTTCGACCAGAACCAGATCGTACCTCCCGATCGGTGGCAATCGATCGAGCATCTCTCCGCTGCCAAGGTCAAGGCGGGCCCGTGCTAGCGCGATGTCGATCGAGAAAAACCGGAACAGAATAGGTGCGCCGCCGCCGGACGGCATGGTCTCGAAGTGAGTCCAGGTCTCAGGAAGATCCTCTTCAAGCGGCAGATGGAAGTAGGTGCGGTGGAGATCATGATGCACCCCGTCTCGTTCGAAGGTGCGATGGCTGTCGCCTAGAAGCCGGAAACCTGTTCGATTCAACAATCCGGTTTCCTCGCCGAATTCACGACGGGCGGCATCGGTCACTTGTTCCCCCGGCTCGACCGTGCCTCCCGGTACTTGTATGGGAACTTCGGGAAAATCAGGCTCATCGAACACCAAGAGGCCGCGAGAGGTAGTGCCGTAAATCAAGGCCTTCTCCATCAAAGCGTTCCCTCTAGCCGATCTGTGTTCATCGGCCCAAATTCGAGTTCTGATGACATCCGCAACAGCCGGGCCATCTCCCGGGTGGGAGCGGAGCACCGCGACCGGGACATCCGGCCCATGCCGGTCGATGCATTGAATGAAAATCGGCACGTACTTTCTCTCGAAGTTCCAGATATAGGAGAGAAATTCCCGGTCCGGCAGCCGTTCAGGGCAACCCGGCGCCATGGCCGGACGCACCGAACCGTAGAAGCGGCCAACGCGTCTTGCCAGTCCCAATAAGGCAACGCGCCGGGGAACACGTACCCACAGAACCAGATCAGTTCGCGTTAGTCTCAGATCGAAAGATGATGCGCCACTGCCGTCCATGACCCATCGGCTCCGTTGCGCCAAATCGGCAACTAGAGCTCTTTGTTCCTCACGGGCACGCTCTTTCCACCCGGGTAGCCACCTGACATCGCGATCGAGCGACTGAAACTCTAACCCGAACGTAGTCGAAATGCTTCGGGAAAGGGTCGTCTTCCCGCCCCCAGAGCAGCCGATTACGAGAACTCGATCGGCTCTTGCCAGATGCGACGCCGCTTCGGTTAAAGTCACGTATCGAGGCAT
Protein sequences of DBSCAN-SWA_3 >CP036360|22991:24071|22991_24071_-|QBJ16422.1|DBSCAN-SWA MPRYVTLTEAASHLARADRVLVIGCSGGGKTTLSRSISTTFGLEFQSLDRDVRWLPGWKERAREEQRALVADLAQRSRWVMDGSGASSFDLRLTRTDLVLWVRVPRRVALLGLARRVGRFYGSVRPAMAPGCPERLPDREFLSYIWNFERKYVPIFIQCIDRHGPDVPVAVLRSHPGDGPAVADVIRTRIWADEHRSARGNALMEKALIYGTTSRGLLVFDEPDFPEVPIQVPGGTVEPGEQVTDAARREFGEETGLLNRTGFRLLGDSHRTFERDGVHHDLHRTYFHLPLEEDLPETWTHFETMPSGGGAPILFRFFSIDIALARARLDLGSGEMLDRLPPIGRYDLVLVESDDGQSA |
1 | Leuconostoc_phage(100.0%) | NA | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_4 |
32977 : 34303
Sequences of DBSCAN-SWA_4
Nucleotide sequences of DBSCAN-SWA_4 >CP036360|32977:34303|DBSCAN-SWA AATGCTGGACGAGCTTCAGAGGTTGGTGGACAATCCTGTGGAGTCCCTCGAGGTCGAGCTTAAATCGTCGCTGGATCTGAAGGACGACAGGCAACGGGCAAGTCTGGCACGCCATATCGCCGCGCTTGCCAATCATGGAGGTGGAGTTGTTGTTTTTGGTTTCAACGACGATTTTAGCCCATGCGACCCTTCAACAGTTTTACCGCTCGACAGAGACACAGTTTCATCAATCGTCAAAAAATATCTCGAGCCAGCGTTTCAGTGTGATGTTCGTTTAGTGAGGTCGGGAAGGGGACAGGACCACAGTGTCATTGTCGTGCCTCCTCATGGGGCGGCGCCGATATGCGCCAAGGCAGGCGGTCCCGAGGATCGCGGCAAGGCTATCGGCATTGTCAAAGGTTCTTATTACATCCGTAAGGTGGGCCCTTCGAGCGAGGCGATTTCATCCGCGACAGAGTGGGCGCCCCTGATCCGTCGGTGTACGATTCACGATCGGCAGGCATTGTTCGCGGCCCTGCACGGTGCGCTGGCACCTACCCAGCCGCCGCAGGAAGACAAAGCCACTCTATGGCACGAAGCCACGCAACAGGCGGCTCTCAACGCCGTCTCAGGACTTGGCGAAAAAAGTCCCTACAGGACTGGCTGGAAGCAGTTCACTTACGTCATAACGCCTGAAGGCCACCTCCCTGAGAGCTCGTTCCTGCGCCTTCTCGCTGAAGTCAACAACGAAGTGCATGATCGGGTTAACACTGGGTGGAGCATGTTTTTTCCATTCGGGGCTCCTGGTGGCGAATACTGGCAAACAGACGCGGCGTCAGGCGAAGGTGAAGGCGACTTTATCGAATGTTCCCTCGGTAAAGACCCAAAAATGTTAGGTAGAGACTTTTGGAGGATCAGTCCGTCCGGCAAGGCGAGCCTTCTTCGGGAATACTGGGAAGACTCAAGTTTTCACCAGGAGAAAGGCATTCCTTCCGGCTCGTTGTTCGACGTGGATTTGTTTGCTCAGAATGTTGCTGAGATCGTTCGTCATGCCCAAGCTCTGGGCGTACGCATTGAGCACGCCGAAAGTGTTTCATTCATCTGCGAAGTCACAGGCCTCACGGGTCGTCGGGGGGGCACGTACAGAGGATATGTGAGGTCATTTGGCGCGGAGGCCCAGGCTGACAGAAGAAAAGTCAGCGCCAGCTTCCCATTGGCTGCACTGGTAGACGATTGGCCGGAGGTCGTAGAGCGTCTCGCTACCCCGGTAGCCAGGATCTTCCAGGCAGAACAACTCACCACCGCTAAGGCAATCCGCGCCAGTGCAGAGAGATGGCTTCGCTTCTGA
Protein sequences of DBSCAN-SWA_4 >CP036360|32977:34303|32977_34303_+|QBJ16429.1|DBSCAN-SWA MLDELQRLVDNPVESLEVELKSSLDLKDDRQRASLARHIAALANHGGGVVVFGFNDDFSPCDPSTVLPLDRDTVSSIVKKYLEPAFQCDVRLVRSGRGQDHSVIVVPPHGAAPICAKAGGPEDRGKAIGIVKGSYYIRKVGPSSEAISSATEWAPLIRRCTIHDRQALFAALHGALAPTQPPQEDKATLWHEATQQAALNAVSGLGEKSPYRTGWKQFTYVITPEGHLPESSFLRLLAEVNNEVHDRVNTGWSMFFPFGAPGGEYWQTDAASGEGEGDFIECSLGKDPKMLGRDFWRISPSGKASLLREYWEDSSFHQEKGIPSGSLFDVDLFAQNVAEIVRHAQALGVRIEHAESVSFICEVTGLTGRRGGTYRGYVRSFGAEAQADRRKVSASFPLAALVDDWPEVVERLATPVARIFQAEQLTTAKAIRASAERWLRF |
1 | Burkholderia_virus(100.0%) | NA | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_5 |
37650 : 38433
Sequences of DBSCAN-SWA_5
Nucleotide sequences of DBSCAN-SWA_5 >CP036360|37650:38433|DBSCAN-SWA GTTGAACGCTCATACGCAAACAGCTTCCAACGTCGTTGAAGCCATCGACGTTCACAAATCGTTCGGTCCTCTGGAGGTTCTCAAGGGTATCAACCTCTCCGTACGAAAGGGCGAGGTAGTCTGCCTCCTCGGACCTTCGGGAAGTGGCAAGTCGACCTTCCTGCGCTGCATAAACCACTTGGAGCGCATGACGCGGGGGCGAATTCTCGTAGACGGTCAACTGATCGGCTATCACGAGCGCGGTGGAGCCCTGCATGAGATGAGCGGCAGGGAGCTAGCAGTTCAGCGCCGGGACATCGGCATGGTCTTCCAACGCTTCAACCTGTTCGCCCACCAAAATGTCCTCACCAACATTTCCCAGGCGCCGATCCTCAACCGGGGTCAGTCAAAGGACACCGCCGTCGCGCGCGCGAAGGAGTTGCTCAAGATGGTGGATCTGGAGGGGCGGGAGCAGGCCTACCCTAACCAGTTGTCCGGCGGGCAGCAGCAGCGCGTGGCGATTGCCCGTGCTCTGGCTATGGATCCCAAGCTGATGCTGTTTGATGAGCCAACCTCAGCGCTCGATCCCGAGCTGGTCGGGGAAGTGCTGGGAGTCATGCGTAAACTTGCAGCTGATGGGATGACTATGATCGTGGTTACACACGAGATGGCTTTCGCCCGCGATGTAGCCGACCGCGTGGTCTTCATGGACCAGGGTGTGGTTGTCGAGGAGGGGCCGGCGCGGGATGTCATCGACAACCCGCAGCAGGAAAGAACACGAACGTTTCTCAAGCGAATGCACTGA
Protein sequences of DBSCAN-SWA_5 >CP036360|37650:38433|37650_38433_+|QBJ16432.1|DBSCAN-SWA MNAHTQTASNVVEAIDVHKSFGPLEVLKGINLSVRKGEVVCLLGPSGSGKSTFLRCINHLERMTRGRILVDGQLIGYHERGGALHEMSGRELAVQRRDIGMVFQRFNLFAHQNVLTNISQAPILNRGQSKDTAVARAKELLKMVDLEGREQAYPNQLSGGQQQRVAIARALAMDPKLMLFDEPTSALDPELVGEVLGVMRKLAADGMTMIVVTHEMAFARDVADRVVFMDQGVVVEEGPARDVIDNPQQERTRTFLKRMH |
1 | Planktothrix_phage(100.0%) | NA | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_6 |
53318 : 54089
Sequences of DBSCAN-SWA_6
Nucleotide sequences of DBSCAN-SWA_6 >CP036360|53318:54089|DBSCAN-SWA GATGATCAACGAGCCAACACCGCGATCCGTCGGGAGTGCGCTGCCGAGCGTTGAGCTGCACGGCGTCAACAAGTGGTACGGCCATTTCCAGGTGTTGACCGACATCGACTTTACAGTCGCCAAAGGCGAGAAGATCGTGGTTTGCGGTCCGTCTGGATCGGGTAAGTCCACCATGATCCGCTGTATCAACCGTCTTGAGGTGCATCAGAAGGGCGAGATTCGAGTGAAAGGCACCCTGCTCACGGAAAATCGCCGGGCGATCGATCGCGTCCGTCAAGATGTCGGCATGGTGTTCCAGAACTTCAATCTTTACCCGCACCTAACCATTCTTGAAAACTGCGTTCTGGCGCCCCGCTGGGTGAAGAACGTGCCGCGCCCGGAAGCTGAAGCGCGCGCCATGCACTATCTCGAACGGGTGCGCATTCCCGATCAGGCGCACAAATATCCGGTCCAACTGTCCGGCGGCCAGCAGCAGCGTGCAGCGATTGCCCGCTCGCTCTGCATGAACCCGGAAATCATGCTGTTCGACGAGCCGACCTCGGCACTCGATCCCGAGCTGGTGCGGGAGGTGCTCGACACCATGGTGGAACTGGCAAGCGACGGCATGACGATGATCTGCGTCACGCACGAAATGGGCTTTGCCCGGCAGGTGGCCGATCGCGTGGTGTTCATGGATCGTGGCAGCATTGTCGAGGCCGCCGCCCCGGCTGAATTTTTCGGTGCCCCGAGGCAGGAGCGCACCCGCAATTTCCTGAGCCAGATTCTTCACTGA
Protein sequences of DBSCAN-SWA_6 >CP036360|53318:54089|53318_54089_+|QBJ16445.1|DBSCAN-SWA MINEPTPRSVGSALPSVELHGVNKWYGHFQVLTDIDFTVAKGEKIVVCGPSGSGKSTMIRCINRLEVHQKGEIRVKGTLLTENRRAIDRVRQDVGMVFQNFNLYPHLTILENCVLAPRWVKNVPRPEAEARAMHYLERVRIPDQAHKYPVQLSGGQQQRAAIARSLCMNPEIMLFDEPTSALDPELVREVLDTMVELASDGMTMICVTHEMGFARQVADRVVFMDRGSIVEAAAPAEFFGAPRQERTRNFLSQILH |
1 | Bacillus_phage(100.0%) | NA | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_7 |
66499 : 67522
Sequences of DBSCAN-SWA_7
Nucleotide sequences of DBSCAN-SWA_7 >CP036360|66499:67522|DBSCAN-SWA GATGACGGATGCCGCGGAAGTTCTGACCGAAGCGACAGCGCTCGCATTCCGCGGCGTCCTCACAACCGGCGTCTACTGCAGGGCGACCTGCATCTCCCGTCCTCCCCGGCCAGAGAACATGCGCTGGTTCGGGTGCGTCTCCGACGCACAAAGAGCAGGCTTTCGCCCGTGCCTGCGATGCCGTCCCAATGATGAGGAATTCGCAAGGAGGAATGCCGATCTGGTCGCCGAGGCGTGCAAGCTGATTGATGCCGGAGACAGCTCGCCGACCGTCGCGGCGCTCGCGCACGCCCTGGGGATCAGTGAAGGGCACTTCCACCGAACGTTCCGTGCACACACGGGAATGACCCCGCGAGCATACATGCTCGAGAGGAGGGCTCAACTCGTCCGGGAAGGACTAAAACCGGGCAGGACCATTACATCGACGTTCTATGACGCAGGATATGGTTCCAGCGGAAGGTTCTACGCGGTATGCAGCCGCGCGCTCGGAATGGCTCCGGCAGACTACAGGGCCGGCGGGCGCAGGGAAACGCTGCACTTTGCAGTCGGCCAGACATCGCTTGGTTCGATCCTGGTTGCATCGAGCGCCAAGGGCGTTGCGGCAATTCTCCTTGGAGATGATCCTGCTGCCCTCCTGACTGATCTTCAGGATCGATTTCCGAACGCGAACTTTGTCGGAGGGGACGAGCGATACGAGGATACCGTCGCCAAGGTTGTTGGAATGGTCGAGCGCCCGGAGGTCGGCCTCGACTTACCGCTCGACCTGCGTGGGACTACCTTCCAGCGTCGCGTCTGGCTGGCGTTGCGCAATATTTCTCCGGGCGAGATCATCAGCTATGAAGATCTCGCAGCTAGGGTTGGATATCCGAAGGCCGGTAAGGCTGTGGCTAGCGCCTGCGCGAGCAATCCACTCGCCATCGCTATTCCCTGTCACCGTGTTGTGAGAAAAGATGGAGCGTCATTCCGCTATACCTGGGGAATCGAGCGGAAGGCTGAGCTACTGCGAAGAGAGGGACGACATTAA
Protein sequences of DBSCAN-SWA_7 >CP036360|66499:67522|66499_67522_+|QBJ16460.1|DBSCAN-SWA MTDAAEVLTEATALAFRGVLTTGVYCRATCISRPPRPENMRWFGCVSDAQRAGFRPCLRCRPNDEEFARRNADLVAEACKLIDAGDSSPTVAALAHALGISEGHFHRTFRAHTGMTPRAYMLERRAQLVREGLKPGRTITSTFYDAGYGSSGRFYAVCSRALGMAPADYRAGGRRETLHFAVGQTSLGSILVASSAKGVAAILLGDDPAALLTDLQDRFPNANFVGGDERYEDTVAKVVGMVERPEVGLDLPLDLRGTTFQRRVWLALRNISPGEIISYEDLAARVGYPKAGKAVASACASNPLAIAIPCHRVVRKDGASFRYTWGIERKAELLRREGRH |
1 | Acanthamoeba_polyphaga_mimivirus(100.0%) | NA | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_8 |
71498 : 78668
Sequences of DBSCAN-SWA_8
Nucleotide sequences of DBSCAN-SWA_8 >CP036360|71498:78668|DBSCAN-SWA ATCACTTGTTCAGTGCATCCTTGAGCGCCTTGGCAGGCGCAAAGGTCAGCTTCTTTGATGCCGCAACCGTCATCTTCTCTCCGGTCGCAGGATTGCGGCCCTCGCGCTCAGGCGATGCCTTCACCTTGAATTTTCCGAAGCCCGGGATCGAGGTTTCAGCATCGGAGCCTGCGGCATCGGCGATTGCCTGGAAGACGGCCTCGACAATGCCCTTGGCCTGAACCTTCGTCAGGCCATTGTCGGCTGCAATCTTGTCGGCAATTTCATTGGTGGTGGTCATGGAATTCCTCGCATCTTGTTGTCAAAGATAAAATCCGTGTCAGCTTATGACCAGCTTGTCATCGGCGACGATGGAGACGAGGATCAGATCCGTGGGATGTCCACAGGTCGTTTTGGCCGATAACCCCGTTTGCGCTTCTCGCGAAGCACGTCGAGGAACAGGTTGATGGCCTGGCTCTCGTTGTCAAACAGATGCACCATCTGCTGTCCGCGCGTTCCAATCCGCCCCCAGGCGCGCACCAACGACACCTCGCCGAACAATGTCGGCTGAACAGCAAGCGTGTAGAACCGCGCCATGTTCTTTTCCGGTGCGATACGCTCGACATAGAGGTGGTAAGGCTGTGTGATCATAAGAGCAGAATCGCGCGTTCGGATTCTGCCGTCCAATGACAGTTTTGAATCGGACGGGCATCATCGATTCACGATAGTGATGCGCCGGCGCCCGGATCATTCACACCAGCTCCGGCGCGATCCTTCCCACTTCCGAGCCAAGCCTTCGGCGACGAGAATATCGCCGACCGATCGTCGCTCCCGATAGACGGTCCTCAGGTTGCGACCGAAACGATCTTCGTCCCTAGCGGTCGTCTTAAACGAGAGCGGACCCGAATTGAGAGTGTCGAACAGCCGCTGTTTCGCCGCAAGACCTCGCTCGCGTTCGCGCTCACATCTCGGCGGCCTGAGCTCCGGCGTGTCGATGTCCGCTATCCGGATCTTCTCGCCTCGAAACCAGAACGTATCACCATCCGCCACGCACGTGATCCGTTGCCCGCTTCTGCAGACAGAGAAGGTTTCGCTCGCGCTACCTGGCGACCTTGAGGCAACAGATACGAGGATGCCAATGGCTAAAGGGGACAAGCTGGCAAATGCCGTACAAAGTCTCAATGTCATGTCTCCACAGTGTCAGTAGTCAGAGCGGCCTTATAGCTCTTCACCCCGTCCGTGATCGGCTTTTGAAAGCGGTTTTCTCCGACGGACGCACTCATATCCTTGCGTCGGAACCATATGGCGCGTCCGCATCGAAGCGGCGGATTGCCCGATGTGAACACGATCTGTTCGTCGGCGCGCATGCGCAGAATCTCGTGTGGAAGGATCAGTGGCCGGCGGCTGAGCTGCTTCGTCCGCGATCTCGACGATCCCTTCATTCCGGATGAGAGACTGGTCTGGTCGATCTCGACCGTGGTGTCGCCACATCGCTTCGAGATGTAGTCGGCAGTGTCGGGATCGTTGATTGCCGAGAACGAAATCCAGGACGCGGATTCGAACCATTTGCTCGTCGCATCACGCCCACCGTAAGCCTCGCGCATCTGACCCAACGACTGGAAGATCATCGTCAGCGTGATGCCGTATTTGCGGCCTGCGTCTCGAGCGGTTTCAAGGATGCGCAGATAGCCGAGACGCGCCACCTCATCGAGCAGGAAAAGGGTGCGGCCCTTCACATCACCGTTGCGGTTGTAGATTGCGTTCAGCAGCGAACCGATAACAACGCGCGCCATCCCAGGATGCGCTTCCAGCACTTTCAGATCGAGCGCGATGAAAATGTCGGTCTCGCCGTTGGCGAGATCGTCCGTCGAGAAACTGTCGCCGGAGACGAGAGCAGCATAGTTGGGGTAGGACAGCCAGTGGGTTTCCTTGACCGCATTGGCGTAGACACCGGAAAATGTCTCCGGCGTCATGTTGACGAAGACGGCGACGTTTTCCTTCACAAAGTCGGACTCCGACCCCTCGTAGATTTTCGTCAGGCGAGCTCGAAGCTGCGGCTCGGGCTCGGACAGATTGGCCCGCACACGACGCAAAGTCTGCTCCTTTTCATCGGTGTGCCCGGACAAACAAACGTCCGCGATCAGCGCCGTCAGAAGCTGCATGGCGGAGGCCCTGAAGAAGTCGTCACGGGCGGACGCGGTGCGCGCATTGTCGGTCATGATCCATGTGGCGACGGCGACGATATCCTCCTCCTTGGTATTGCCGTGACGGCCAATCCAGTCCAGCGCATTGAACCCGGCACCGCCTGCCGTCGGATCGAGCACGATCACCTTGCGGCCAGCCTTACGCCGATGGTCGGAAACCATCGGTGCGACCTCGCTCGACGGATCCAGCACAACAAGTCCGCCACCCCATTTAAGCGCGGTCGGGATCGTCACTGATGTCGTCTTGAAACCACCGGAACCGGCGAAGACGATGCCGTGCGACGAGCCGAAGGCGCCGTCGAAGCAGAGCAGCGGCGATCTGCCGCCGGCACCCCAGCTTTTTTGATCATCAGCACTGAACGGCATGGCGGCAACGCTGTCGCGATCAACGCGATAGCGCTCACCGATAACGATGCCGCCGGGCTCGGGAAACAGCTTTGCCGCCTCCTGGATCTTCATCCAGTCGGCTTCGCCATGCACGGCGCGTTTGCCGCCAATCCGGCGAGGGCCGCCGCTTGCAAAGGCCGCATTGCCCTTGATGGCGACGCGAAGCGCGAAGACGCCACCCAGGAAGGCGAGACCCGCGCCGACCACGGTCGCAGGATCGGCATAGGCGAGAACCGATTGTCCGGCCGGTACGTTCGCGGCGATGCCGGTCAGACGGATCGTTTCCCGCACGATCGCGATGATGGTGGTCACGCCGCTTCCGGCAAGCACGCTCACACCGGCGGTCTTGATATTTGTCGAACCGTTGGCAGCGAATAGCGCGACGACGCCGATCGCGGCGGCCGCGAGATAAGGCAGGGCGAGACCGGCCCTTCCGAGCAGCAGCTTCGCTTGCGCCGATGTTCCGAGCGCCGCCAGCCGATGTTCCATGCCTGACGACAGTATCATGGCGGCGAGCATGATTGCTGCCGGCAGGATTGCCAGAAGCAGCCTATTCGGCGTCATTGCCAAAGGCCTCCGCACCGATTGCCGCCAGTCGAGCCTTCTCCGTCTCGTCGCCCTTGATCCGCTTGCTCGCGTCGATCAGAAGGCCGAGCAACAGAGAACGCTTCTCGTACCGCAGTCCCGCCTTGACGATCAGGCCGCCGAGTTCGATCTTTTCGCGCGTGTCCCTTTTGCGCGCCTCTGTCGTCGTTGATTTCGCCATTCGCTCAAGCCTCAACAGTCCCGCCCGCATCCGCGCCAGACGGCTGTGCCGTGGCCGACCCGCTTGAGGCTCGGCCGTCCCCTGTGTCCTTCTTCCCGGTCGCTCGACCTTTGCCTCCGCGAAACCGCTTGGCGATTTCCTCGAAGGCAGCCTGAAGCTCTGCCTCGTCGATCTCAATCTCCCCAAGACCCGCCTTGAGTGCGATGCGCCCGATGCGTTCAGCTTCACGGGTCTCGGCCTGCTTGAGCTGATCCTGCAATCTGGCGATTTCTTCCCTGATCTTCGATGACGGCTTCTTCATTCCGGTTGCTCCTCTGAACTCTGGGCGTTGCATTTGCGACGCCCAGACTCTCCCGGATGTCCCTAAATTGAAATCTGCAGATCTGCAGATCGCCAAGGGCGATGCTTTTGTGAATGATCCCGCCGTTCCGAAGGAATGGCTCCAAGGGCGCAATTATACGTCGCGATGCGACGCGTTGCTTTTGGCCGTCTCTCCCGACGAACCTGGTTCTAACCGGGAGCGCTTTTCGCCGTGGCCATCGCCCATTTCTCAGCCAGCATCGTCAGCCGCGGCTCGGGCCGTAGCGTCGTGCTGTCTGCGGCCTACCGGCACTGCGCGAAGGTGGAGTACGAGCGCGAGGCCCGCACCATTGACTACACTCGCAAGCAGGGCTTGCTGCACGAGGAGTTTGTATTACCCGCCGGCGCCCCGAAATGGGCTCGCTCCCTGATCGCTGATCGTTCTGTCTCCGGAGCGTCGGAAGCCTTCTGGAACAACGTCGAAGCTTTCGAGAAGCGCGCCGATGCTCAGCTCGCCCGTGACCTGACCATCGCCTTGCCCCTGGAACTGTCCATGGAGCAGAACATCGCCCTGGTCCGGGATTTTGTTGACAAACACATCCTCGCCAAAGGCATGGTGGCCGACTGGGTTTATCACGACAATCCCGGCAATCCGCATATCCACCTGATGACCACGTTGCGGCCGCTGACGGAGGATGGCTTCGGCTCGAAGAAGGTCGCAATCATTAGCGACGACGGCCAGCCGGTCCGTACGAAGTCCGGAAAGATCCTCTACGAACTCTGGGCCGGTTCAACCGATGACTTCAATATGCTGCGAGACGGCTGGTTCGAACGGCTGAACCATCATCTGGCGCTCGGCGGCATCGATCTGAGGGTTGATGGTCGTTCCTACGAAAAGCAGGGTATCGATCTCGAGCCCACCATTCATCTCGGCGTGCGGGCAAAGGCGATCGAGCGCAAGGCCGAGCGACAGGGTGTCCGCCCAGAGTTGGAGCGGATCGAGCTGAACGAGCGGCGACGCACGGAAAACGCACGTCGCATCTTAAAGAATCCTGCCATCGTGCTCGACCTGATCACGCGGGAAAAGAGCGTCTTCAACGAGCGGGACGTGGCCAAGGTTCTGTATCGCTACATCGACGATCCGGCTGTCTTCCAGCAGTTGATGATCAGGATCATCCTGAACCCGGACGTTCTGCGGCTGGAGCGGGACACCATTGATTTCGCCACGGGCGAGAAACTGCCGGCGCGCTACTCCACGCGAGCGATGATCCGGCTGGAAGCGACGATGGCGCGGCAGGCCATGTGGCTGTCGGCTAAGAAGGCGCATGCCGTCTCCGAGGCTGTACTGGATGTGACGTTCCGGCGTCATGAGCGGCTATCCGACGAGCAGAAAGCAGCGATTGAGCGCATTGCAGGACCTGCCCGCATCGCTGCCGTCGTCGGCCGCGCCGGGGCCGGCAAGACCACGATGATGAAGGCGGCTAGTGAAGCCTGGGAACTGGCCGGATATCGCGTCGTTGGCGGCGCGCTTGCAGGCAAGGCAGCTGAGGGCCTGGAGAAGGAAGCCGGGATCCAGAGCCGCACGCTTGCGTCATGGGAGCTGCGCTGGGAACGCGGCCGTGACGTCCTGGACGACAAGACGACCTTCGTCATGGACGAGGCCGGCATGGTTGCCTCGAAACAAATGGCTGGGTTCGTCGATTCCGTCGTAAGGACGGGTGCGAAGATCGTGCTGGTTGGCGATCCCGAACAGCTCCAGCCGATCGAGGCGGGGGCAGCTTTCCGCGTCATTGTCGATCGCATTGGCTACGCCGAACTCGAAACCATCTATCGCCAGCGCGACGACTGGATGCGGAAGGCATCGCTCGATCTTGCGCGCGGCAATGTCGAAAAGGCTCTCGTTGCCTATGAAGGTCAGGGCAGGGTGCTCGGCTCTCGTCTGAAGTTCGAAGCCGTCGAGAGGCTGATCGCCGATTGGAACCGCGACTACGATCAGACGAAGACAACATTGATCCTCGCCCATCTGCGTCGCGACGTGCGCATGCTCAACGTCATGGCCCGCGAGAAGCTGGTCGAACGCGGTATTGTCGGTGGAGGTCATGTCTTCAAAACGGCCGATGGCGAGCGCCGTTTCGATGTCGGCGACCAGATCGTCTTTTTGAAGAACGAGGGGTCGCTCGGCGTTAAGAACGGCATGATCGGTCATGTCGTCGAGGCCCAACCGAACCGCATCGTGGCCTTGGTCGGCGAGAGGGATCATCGGCGCCAGGTGATCGTTGAACAGCGGTTCTACAACAATCTCGATCATGGATACGCGACGACGATCCACAAGTCTCAAGGAGCGACAGTCGACAGGGTCAAGGTGCTCGCCTCGCTGTCTATGGATCGCCATCTGACCTATGTGGCGATGACCCGACATCGCGAGGATCTGCAGCTCTACTACGGGACCAGGTCCTTCTCCTTCAATGGTGGCCTGGCCAAGGTTCTATCCCGAAGACAAGCCAAGGAGACGACGCTCGAGTACGAGCGCGGCCAGCTCTATCGCGAGGCGCTGCGTTTCGCCGAAACGCGCGGCCTGCACATTGTCCAGGTCGCTCGCACGCTGGTCCGCGACCGGCTTGACTGGACTCTGCGCCAGAAGGCCAAACTCGTCGATCTCGGCCAGCGCCTTAGCGCCTTCGCGGCGCGCCTCGGCTTCACCGAAAGCCCTGACACTCAAACGATGAAGGAGGCCGCGCCCATGGTTGCAGGCATCAAGACATTTTCCGGCACTGTTGCCGATACGGTCGGTGACAAGCTTGGCGAGGATCCCGGCCTGAAACGGCAATGGGAGGAAGTCTCGGCGCGTTTCCGTTATGTCTTCGCCGATCCTGAAACAGCCTTCCGGGCTATGAATTTCGATGTCGTGATCGCCGACAAGGACGCTGCCCGTCAAACGCTCGAAACGCTGGAGACGAACGCCGCATCGATCGGTCCGCTCAAGGGCAAGTCTGGTATTCTTGCCAGCAAGGCGGAGCGTGAGGCGCGTAGCGTCGCAGAGGTGAATGTTCCGGCCCTGAAGCGCGACCTCGACAAGTATCTGCGTATGCGCGAGGCGGCGGCGCAACATATAGCGGCTGATGAACAGGCTTTGCGCCACCAGGTCTCGATCGATATTCCGGCGCTGTCGCCGGCGGCGCGCGTGGTGCTGGAGCGGGTGCGTGATGCGATCGATCGCAACGATCTGCCAGCCGCGATGGCCTATGCGCTGAGCAATCGCGAGACCAAGCTCGAAATCGACGGCTTCAACCGGGCGGTGACCGAGCGGTTCGGGGAACGGACGCTGCTGAGCAACGCGGCGCAAGAGCCTTCGGGAGCGCTGTACGAAAAGCTTTCCGAGGGTATGAAGCCGGAGCAAAAGGAACAGTTGAAACAAGCCTGGCCAGTCATGCGGACAGCGCAACAGCTCGCCGCCCACGAGCGCACCGTCCAGTCGCTAAGACAAGCGGAGGAACATCGTCTTATGCAGCGCCAGACGCCGGTGCTGAAGCAATGA
Protein sequences of DBSCAN-SWA_8 >CP036360|71498:78668|75365_78668_+|QBJ16473.1|DBSCAN-SWA MAIAHFSASIVSRGSGRSVVLSAAYRHCAKVEYEREARTIDYTRKQGLLHEEFVLPAGAPKWARSLIADRSVSGASEAFWNNVEAFEKRADAQLARDLTIALPLELSMEQNIALVRDFVDKHILAKGMVADWVYHDNPGNPHIHLMTTLRPLTEDGFGSKKVAIISDDGQPVRTKSGKILYELWAGSTDDFNMLRDGWFERLNHHLALGGIDLRVDGRSYEKQGIDLEPTIHLGVRAKAIERKAERQGVRPELERIELNERRRTENARRILKNPAIVLDLITREKSVFNERDVAKVLYRYIDDPAVFQQLMIRIILNPDVLRLERDTIDFATGEKLPARYSTRAMIRLEATMARQAMWLSAKKAHAVSEAVLDVTFRRHERLSDEQKAAIERIAGPARIAAVVGRAGAGKTTMMKAASEAWELAGYRVVGGALAGKAAEGLEKEAGIQSRTLASWELRWERGRDVLDDKTTFVMDEAGMVASKQMAGFVDSVVRTGAKIVLVGDPEQLQPIEAGAAFRVIVDRIGYAELETIYRQRDDWMRKASLDLARGNVEKALVAYEGQGRVLGSRLKFEAVERLIADWNRDYDQTKTTLILAHLRRDVRMLNVMAREKLVERGIVGGGHVFKTADGERRFDVGDQIVFLKNEGSLGVKNGMIGHVVEAQPNRIVALVGERDHRRQVIVEQRFYNNLDHGYATTIHKSQGATVDRVKVLASLSMDRHLTYVAMTRHREDLQLYYGTRSFSFNGGLAKVLSRRQAKETTLEYERGQLYREALRFAETRGLHIVQVARTLVRDRLDWTLRQKAKLVDLGQRLSAFAARLGFTESPDTQTMKEAAPMVAGIKTFSGTVADTVGDKLGEDPGLKRQWEEVSARFRYVFADPETAFRAMNFDVVIADKDAARQTLETLETNAASIGPLKGKSGILASKAEREARSVAEVNVPALKRDLDKYLRMREAAAQHIAADEQALRHQVSIDIPALSPAARVVLERVRDAIDRNDLPAAMAYALSNRETKLEIDGFNRAVTERFGERTLLSNAAQEPSGALYEKLSEGMKPEQKEQLKQAWPVMRTAQQLAAHERTVQSLRQAEEHRLMQRQTPVLKQ >CP036360|71498:78668|71498_71777_-|QBJ16467.1|DBSCAN-SWA MTTTNEIADKIAADNGLTKVQAKGIVEAVFQAIADAAGSDAETSIPGFGKFKVKASPEREGRNPATGEKMTVAASKKLTFAPAKALKDALNK >CP036360|71498:78668|74617_74863_-|QBJ16471.1|DBSCAN-SWA MRAGLLRLERMAKSTTTEARKRDTREKIELGGLIVKAGLRYEKRSLLLGLLIDASKRIKGDETEKARLAAIGAEAFGNDAE >CP036360|71498:78668|71860_72127_-|QBJ16468.1|DBSCAN-SWA MITQPYHLYVERIAPEKNMARFYTLAVQPTLFGEVSLVRAWGRIGTRGQQMVHLFDNESQAINLFLDVLREKRKRGYRPKRPVDIPRI >CP036360|71498:78668|72223_72664_-|QBJ16469.1|DBSCAN-SWA MTLRLCTAFASLSPLAIGILVSVASRSPGSASETFSVCRSGQRITCVADGDTFWFRGEKIRIADIDTPELRPPRCERERERGLAAKQRLFDTLNSGPLSFKTTARDEDRFGRNLRTVYRERRSVGDILVAEGLARKWEGSRRSWCE >CP036360|71498:78668|74837_75134_-|QBJ16472.1|DBSCAN-SWA MKKPSSKIREEIARLQDQLKQAETREAERIGRIALKAGLGEIEIDEAELQAAFEEIAKRFRGGKGRATGKKDTGDGRASSGSATAQPSGADAGGTVEA >CP036360|71498:78668|72660_74631_-|QBJ16470.1|DBSCAN-SWA MTPNRLLLAILPAAIMLAAMILSSGMEHRLAALGTSAQAKLLLGRAGLALPYLAAAAIGVVALFAANGSTNIKTAGVSVLAGSGVTTIIAIVRETIRLTGIAANVPAGQSVLAYADPATVVGAGLAFLGGVFALRVAIKGNAAFASGGPRRIGGKRAVHGEADWMKIQEAAKLFPEPGGIVIGERYRVDRDSVAAMPFSADDQKSWGAGGRSPLLCFDGAFGSSHGIVFAGSGGFKTTSVTIPTALKWGGGLVVLDPSSEVAPMVSDHRRKAGRKVIVLDPTAGGAGFNALDWIGRHGNTKEEDIVAVATWIMTDNARTASARDDFFRASAMQLLTALIADVCLSGHTDEKEQTLRRVRANLSEPEPQLRARLTKIYEGSESDFVKENVAVFVNMTPETFSGVYANAVKETHWLSYPNYAALVSGDSFSTDDLANGETDIFIALDLKVLEAHPGMARVVIGSLLNAIYNRNGDVKGRTLFLLDEVARLGYLRILETARDAGRKYGITLTMIFQSLGQMREAYGGRDATSKWFESASWISFSAINDPDTADYISKRCGDTTVEIDQTSLSSGMKGSSRSRTKQLSRRPLILPHEILRMRADEQIVFTSGNPPLRCGRAIWFRRKDMSASVGENRFQKPITDGVKSYKAALTTDTVET |
7 | Burkholderia_phage(33.33%) | NA | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_9 |
89708 : 98286
Sequences of DBSCAN-SWA_9
Nucleotide sequences of DBSCAN-SWA_9 >CP036360|89708:98286|DBSCAN-SWA TATGACTAGCAGCTTCGTTACCACCAAGGACGGCGTCGAAATCTTCTATAAGGACTGGGGTCCGAAGGACGCGCAGCCGATTGTCTTCCATCATGGTTGGCCGCTGTCGTCGGATGACTGGGACGCGCAGATGCTGTTTTTCCTGTCCAAGGGCTACCGTGTTGTCGCGCATGACCGTCGTGGTCATGGCCGATCCACCCAGGTCGCAGACGGTCATGACATGGACCACTACGCCTCCGATGCTTTTGCTGTCATCGAAGCGCTAGATCTCAAGAATGCCGTCCATATCGGCCATTCCACCGGTGGCGGGGAAGTCGCCCGCTATGTCGCCAAGCATGGCCAGCCGGCAGGTCGCGTAGCTAAAGCCGTTCTCGTTTCCGCCGTCCCGCCTCTGATGGTTAAGACGGACACAAATCCGGAGGGCTTGCCGCTGGACGTGTTCGACGGCTTCCGTTCCGCTCTTGCTGCTAACCGCGCGCAGTTCTTCCGCGATGTTCCTTCCGGCCCGTTCTATGGCTTCAACCGTGACGGTGCCACCGTCCATGAAGGTGTGATCCAGAACTGGTGGCGTCAGGGCATGATGGGTGGCGCAAAGGCCCATTACGATGGTATCAAGGCCTTCTCGGAAACGGACCAGACGGAAGACCTGAAGGCGATTACCGTGCAGACGCTGGTTCTGCATGGTGAAGACGACCAGATCGTTCCGATCGCCGATGCCGCGTTGAAGTCGATCAAGCTGCTGAAAACCGGCAAGCTCAAGACTTACCCAGGGTTCTCGCATGGCATGCTGACAATTAATGCTGAAACCTTGAACGAAGATATGCTGGCGTTCATCAGGAGCTAACCCGCATTACGCTCCAGTAGAGTTTCACGAAGCGCAGAGGATTGTCAGTAGCTCGGCCACTGTGCTCTGATCAACTCACATATCTCAGCCCGCGCCTTTTTTAAAACAAAACGGCGCGGCTTCTTATCCGCTACGGCGTGTCGGACGACGGCCAGCTCTTGGGCAAGAGTGACCGCGACACCGAACCCATCACATTGAGCTGACCCGATTAAGCATCGGGTCGACGCAAGCTCGCACCCTTTATCCACGATGGCAGCAGACCTCTCGGCTCTTGCGACCACTTCTTAGCAGAAAGCAAGGATTGCCTGCGCTCCGAGGGTTTCGGAGCTCCGTAAAAGATAGCGCTCGGGCGGTAAGGGTCTCGGGCATTCAATATTGTCTCATCATGCCTCAATGACGATGTCGCGATCGCAATCGCGAACGCTCCTTTTGGCTGAGTTTTTGGAGGTCGCTATTGCCAGCTTCAACTCTTGGGTTGCACGAACCAGGTTCAGCGAAGATCGACGTCGATAGTGCAGCTACTGCGACGGGGGAGGACAACGTGTCCGCGTTCAGGATGCAGATCAGATGCGGAGCCCAAGTGAAGGGCTACAGTAAGCAGAACCTGATTATGCTGAACAGACATAATGCGGTGCCTGTTGACTGCCGCGTCGGTCGCGAAACGCTTAATCACATTGCACATTGTGGGAATATAACTGTCATTCCCGAAGGAGTTGAATTCAACGGAAGCTTCGGGGGCCCCGCTCACCTGCTTGTCGTCTCTATCCCAACTCAGCGCTTGGCTGTTTCAGCGGCTTCGAGCGGTCTTTCCTTTAGATCGTTCGGGCCGCAACTCAGCGGATTCGATGCAGGGATTGGCATTGCCGTTGAAGATTTCTCGACTTGGCTGAATTCGGGCCTCACAGAGGGTGACGGCGAACTGATCGCGGATGACCTCGTAGAACTTGTCGCGAGCTCATACTGTGGTTCTATAGAGAAGCCAGTACCACGTGGGCAGATGCCCAACGAGGTGATGGAAGAAATCAACAGGTACGTGAACGAAAACATCGGCAGCACTTTTTTGATTAACGATCTGGCAGACTTGGCGAAGCAAAATCGCTTCAACTTCCCGCGACTGTTCCGGCGCACCGTTGGGATCAGTCCGCATCAGTATGTCCTACGAATGAGGCTAAGACACGCTCGGGCGCTCATCCTTGCGGAATGCTCACTCGCAGAGGCGGCTATTGCTGCGGGATTTGTTGATCAAAGCCACATGTCGAGCTGGATGCGCCGGATCTACGGAATAACGCCCGGTATGTTATCCAAGGTTCCGCGTAAGTTATGACGATTGACGCATCTTCGCGCAGAGTAAAGCCTCACGGGCTTAGCTTATGCAGGCGCAAGAACTACGGCTTGGGCACTTCGATAACATCTTGCTCAAAAATCAGAGGCAGTGCATCCGACGGCCGATGGCTAGACAACTTTCTCATAATGAAACTCGGATGTAAGCGGCAAGGAGACCCCATCTGGCGGGTCTGATGCTGAGTAAGGTATTTCGGGCATACGAGTTCTTTGAATGTGAGGATCTATGTCTAGACCTGCGGCTAGCCTTGGGTTGATGCGAATGATCGAGGCGGTGCCCGATCGCCTCGACGGGTCGCCGCGACAGTTGCGGCGGTTCTGGTCCGACGAATTCAAGGCCACAGCGGTTGAGCAGGCCTGTCAGCCCGGTGTGAACATGTCGGCAGTGGCGCGGCAGATCGGTATCTTGCCGTCTCAACTCTACCGCTGGCGTAGAGAGCTGTTGGGCAGTGATGAGTCTGGACTAGCAGCACCAGTTGATCCGCAGAGCGTCGCACCTGACCCCGACCCCGTCGTCGAAATCGACATTGACGGTATCGTCGTGCGGGTCGGCCGGCATGTTGAAGAAGCGCATCTGCAGCGTGTCATCCGAGCGGTTCGCTCAGCATGATCCCCGCAGGTGTAAAGGTCTTCCTGGCCAGCCATCCCATCGACTTCCGCAAAGGGCCGGACAGCCTGCTGTCGCTGGTGCGGGATGCTGGCAATGATCCGTTCAACGGCTCGCTTTATGTCTTCCGAGCCAAACGGGCCGACCGGATCAAGATCGTCTGGTGGGATGGATCCGGGGTTTGCCTTTACGCCAAGCGGCTGGAGAAGGCGCAGTTTTGCTGGCCGCGGGTCGGCCACAACCGAGTGCAGCTCAACCATGCCCAGCTCATGGCGCTCGTTGACGGCATGGACTGGAAACGGGTCCGCTCGGTGGCGGTGAGGCCGCCAGAGATTGTTGGGTAAATGCCTGCGGCGAAGTGAATCAACCCCCTGAAAGGGCAGGAAAACCGGAGCAAAATATGCTCAGGTCAGCCTTATGACGCCGCCCGATCTGCAGCTCCCGGATGATGTAGAGACCCTGAAAGCTATGGTCCTTGCCATGGCCGAGAAGGCAGCGCGCACCGATGCTCTTGAGAGCGAGGTCGCAGACCTGAAAGCCAGAAACGCCGATGCTGATGAACGCATCGAACGACTGACCCAGATCCTGAAAGCCTTTGATCGTGCCCGCTTTGGCCGGCGATCAGAAAAGCTCGGCTCTGCAGGCATCGATGATGAGCAGGAGGCCTTTGTCTTCGAGGAAATCGAGACCGGCATCGCTGCGATCCGAGCCCAGGTCAACAAGGGCGCAACCCATCCCGACAGCAAACGTCCGCCGCGGCCGCGCAAAGGCTTCGCACCGCATCTGGAACGGGTCGAAGTGGTGATCGAGCCGGATGAACTGCCCGAACACGTCGGCAAACAGAAGGTGCTGATCGGAGAAGACGTCTCGGAGCGACTTGACGTCGTGCCAGCGAAGTTCCGCGTCATCGTCACACGCCGACCGAAGTATGCCTTCAAAAATGAAGACGGCGTCATTCAGGCCGCCGCACCCGCGCACATTATCGAGGCGGGCATTCCGACGGAAGGGCTTCTTGCCCAAATCGCCGTCTCGAAGTATGCCGATGGCCTTCCGCTCTATCGCCAGGAGGCAATCTATGCCCGCGACAAGGTCGAGCTTGACCGGAAGCTGATGGCTCAATGGATGGGTAAGCTCGGCTTTGAACTTGATATCCTCGCCGACTACATTCTCGCCGAGATCAAGAAGGCTGAGCGTATCTTCGCTGACGAGACGACGTTGCCCACACTCGCGCCCGGATCCGGATCAGCCAAGACGGCCTGGCTTTGGGCTTATGCCCGCGATGACAGACCGTTCGGTGGCAGTAGCCCGCCGATGGTCGCCTATCGCTTCGAAGATAGCCGTGCTGGCGATCGGGTCGCCCGGCATCTGAGTGGCTATCGCGGTATTCTGCAGGTTGACGGACATGGTGCCTATAACAAGCTTGCCAGATCTGACGGAGGCAATGACGGCGTGATGCTGGCCTACTGCTGGTCCCATAGAAGGCGCAAGTTCTACGAACTCCACGCCTCAGACAGCTCCAGGATCGCCACCGAGACGGTGGAACTGATGGCGAAGCTCTGGCAGGTGGAAGAAACGGCTCGCGGGCAAAGCCCTGACGCCCGTGTCGCCGCGCGCCAGGCGACATCTGCGGCGGTTGTCACGGAGCTCTTCGCCCTCTGGCAGAAGACCCTATCGCGGATCTCTGGCAAGTCGAAGCTGGCGGAGGCGATCCGCTATGCCACCTCGCGTTGCTCCATCTTCGAGCGCTTTCTAACCGACGGCCGCATCGAGCTCGATAACAACATAGTCGAGCGTGCAATCCGGCCCCAGACAATTACCAGAAAGAATAGCCTCTTCGCCGGCAGCGATGGCGGTGGAGGGACTTGGGCAACAATCGCCACGCTCCTTCAAACAGCGAAAATGAACAATGTCGATCCGCAGGCCTGGCTCACCCAGACACTCGAGCGCATAGCCAATGATTGGCCCAGCAGCGATCTCGATGCACTCATGCCGTGGAACTACACGCGCTGAACGGTCTCAGCTTGCCGCTTACACTCGGATCGTGAAGCTTTCCAGCAAGAAAGACTATCATTGACGCGCTCGAGCGGCCAGCGGCATCTCGCTCTTGGATTAGGCCGCAAATGCGTTCCATTTCGAAAGCCGGGGAGGCAGCTCTTGCGACAGTCGGGATGCCTCGATTGCGTCAAATGAATTATCATCCGCAATCTGCGCTAAGTCTGGTTGCCGATGACTACTGCCCAACCGGTCGCTGCATTGCTGCACGCCTTCTACTGTTCCCGGTTATCTCGATCTCGCTATCGAACACTTCGCTTTCTTGGAAGATGCATATCTAGAGCGGATATGTCGGAGCTTCGGCCTTTACGGTAACCCATTGCCATTGAGTAAACTCCTCCCAGTTTGCCGGACCGCCGATGCTGGTGCCGTTGCCTGAAGCGCCGATGCCGCCGAATGGGTTGATCACCTCGTCGTTCACGGTTGAGTCATTCACGTGTAGGATGCCAACGTTAAGCTTCTCAGCCAATGCCATCGCTCTTCCGACCGATCGCGAGATTATTGCTGCTGAGAGTCCATAGTTCGTATCGTTGGCAAGATTGACTACGTCGTCGTCGGTTACAAACGTGGTCACGACAGCGACGGGTGCAAAAATCTCCTTCTTGAACGCAGGATTATCCGGATCAACACTTCTCAAGACTGTAGGTTCGAAAAACAGCCCGCTGGAGCCTCCGCCGATTGCGACCTGAGCTCCCTCGGCCACAGCTTCGTTGACGACCGCGAAGGCGTGAGCAACTTGTGCTTCGCTGATCAGGGGCCCAAGAGCAACTTGATCAGTCCTTGGATCACCTACAGTCAGTCTCGACGCATGTTCACAAAGCTTTCTTGTGAACTCATCAGCAATAGCGGCTTGCACCAATATACGACCCGCTGACATGCAGATTTGCCCCTGATGTAAGAAGGTACTCCATACGGCGTTATTCACCGCGAGTTCAACATCCGCGTCGTCAAGGACGATGAAAGCGTTCTTACCACCCAACTCAAGTGATACCTTTTTGAGGTTGCGGCCTGCACGCTCTCCCACCTTGCGGCCAGCTGATGTCGAGCCGGTGAACTGGATCATCCCCACTTCAGGCGCGTCGCATAATGCTTCGCCGCTCGCTGCATCCCCGGGGAGTACGCTTAGAATCCCAGGTGGCAGGCCCGCCTCTTCGAAAATGCGGGCCAGTAGAAGACCGCCGCTGATCGAGGTTCGAAGGTCTGGTTTTAGCAGGACTGAATTGCCAACAGCCAATGCAGGAGCAAGTGCTCTCATAGCGAGATAAAGCGGAAAATTGAATGGCGATATCACGCCGACCAAGCCAATCGGGCGACGCCGAGCGATATTCAGTCGACCAGGCTGAGTTGGAAGAACCTGTCCCAAAGCCTGAGATGGCATGGCCGATGCCTCATGTAGAGCTTTGAGTGAGAGCGCCACTTCAACCTCGGCCTTCGGCCGCGTGGACGCGGATTCTTTCACCAGCCACCCGACAATTTCATTTTTGTTCGATTCAAGAACATTAGCTGCAGATCTAAATACTTCGGCGCGCTTCTCATAATGCGCGCTAGCCCAATCCGCCTGTGCAGCTTTCGCAGCCTTCGAGGCGTGATCGATATCGCTCGGTGATGCGAGGCCGATAAAGCCCAATTTTTCTCCATTTGCTGGCTCAAAGATTTGTTTGCGTTGCTCGGCCTTCTTCCACGTACCATCAAAGAAAGATCCTTCCAGATGTACCGTATCGAGAAGCGTTTTAGTCGCTGTAAAAGTCATCGTGTCCTCATAAGGTCAGTAATAAAAAAACACAGGCGATGTCCTCAGTCTCCGCTGCTCCGAGGCTGGTTCCCACCTCCAATAAGAGCGTGTCCGTCTCCTGTTAGCTTGGGCCACAAAAAGACTCTTGGATATTTGTGCACTTTTGGAACCGACGGGTTCGCAGGCGATCGGCCCACACGCGCCTCGCTCTTAAGCTACCAGTGGTCCTGGATGCCGACTGCGCGGACAATAAAATGGAACGCACGGTCAACGCACTCTACCATTGCCGTCTGTAATCTTTGGCATGATAGGAGGAAATAATGAGCTGGGACTATATCGTACTAGGAAGCGGGTCTGCCGGGTCTGTTGTCGCAAGCAGGCTCTCGGAGGACGGACGAACCCGTGTACTACTTATTGAGGCAGGTCCACGAGACAATTCGCCGTTTATTCGAATCCCTGCTGGGGAAGTGAAGGCGATTGCCAATCCGCGCTTCAACTGGCAGTACATGGCTGAACCAGACCCATCTCTCAACGACAGAGCGGTTATATGGCCCGCAGGTAAGGTTCTGGGCGGCTCAAGCTCTATAAATGGTATGGTTTACGTACGGGGCCAACGCGAGGATTTCGACGACTGGGCAAGACTGCTCGGAAATACAGGCGATTGGAGCTATGAGGATGTCCTTGACTACTTCAAGAAGATGGAAACGAATCCGTTTGGTGCGGGTGAATTCCATGGTGCCGATGGTCCGCTGAAGGTATCCAACGTCGCATCGCCTCATCCGCTTCGGGACGTATTCATCCGCGGCACTCAGGAACTCGGCATACCATTTAACGCCGATATAAACGGCGCGACGCAGGAAGGCGTTGGGCCTAACCAGGGTACGATCGATTTCGGACGTCGAAACAGCTCCGCTCGTGCCTATCTGCGTAAAGCACGGTCACGGAGCAATTTAAAAATACTCACTGGAGCTGTCGCTGATCGCATTGTGTTCGACGGCAAAAAAGCCGTCGGTGTGCGCTATTTTGTCGGAGCGCAAGCATTCCAAGAACGAGCAAATGCCGAGATCATCGTCTGCTGCGGAGCCCTTGGGTCGCCCGGCGTCCTTATGCGTTCCGGCATCGGAAACAAGCATCATCTGGAAGCAAACGGTATCGAGGTGATCAACGACCTACCGGGCGTTGGGCAGAACCTGCAGGAACACCCGCAAGTATGGGTAAGTGGATACGTAAACGTCAGCACCTACAACATGGAAACCTCCCCGAGTCACGTCGTACGCCACGGCCTCAATTGGCTTATGCGTGGGAAGGGTCCGGCAGCCAGTCCGATCTCCCACGCCGTGGCCTTCATCCGCACGCGTCCCGAAACGGAAAGCAGGCCAGATGTACAGCTTCATTTCGTACCGGTTGGATATGAGGTTACGGAAACGGGGATAACGCTGCTTGACCGCCCCGCCGTCACGATCGCGGCCTGCGTGCTTCGTCCGAAAAGTCGCTCGGAAATCCTCTTGCGTTCAGATAGCCCGTTTGATCCTCCTCGCTTGCAATCGCGCATGCTCTCGGATCCGGATGACATTGCCCGCCTTTCAGATGCATTTCGCATTTCGCAAAACATCCTGGAGAGCAACGCTTTCAAGCCCTACTACGAGGGAGCGTTCAAGCCGGCCCGCAAGCTTCAAACCGATGAGGAAATACTCGACTTCTTCAAGCAGTCAGCTGAAGGCAGCTACCATCCTGCCGGCACCTGCAAGATGGGAATTGGCGAGGATGCCGTTGTCAGCAAAGAACTCAAGGTGATCGGTGTCGAAGGCCTGCGCGTTGCTGACGCATCAATCATGCCTGTGATCACAAGCGGCAACACTAATGCTCCAAGCATCATGATCGGAGAGAAGGCCGCGGCGATGATCCTTCTTGAACGGAGCTCAGCACGCCGGACAACTGGCCGGCACTCTGACCGCTTTGCTGACCAATCGCAACATAAATCCACAACCACCACGACGCAGTAA
Protein sequences of DBSCAN-SWA_9 >CP036360|89708:98286|92118_92502_+|QBJ16489.1|DBSCAN-SWA MSRPAASLGLMRMIEAVPDRLDGSPRQLRRFWSDEFKATAVEQACQPGVNMSAVARQIGILPSQLYRWRRELLGSDESGLAAPVDPQSVAPDPDPVVEIDIDGIVVRVGRHVEEAHLQRVIRAVRSA >CP036360|89708:98286|92916_94509_+|QBJ16491.1|transposase|DBSCAN-SWA MTPPDLQLPDDVETLKAMVLAMAEKAARTDALESEVADLKARNADADERIERLTQILKAFDRARFGRRSEKLGSAGIDDEQEAFVFEEIETGIAAIRAQVNKGATHPDSKRPPRPRKGFAPHLERVEVVIEPDELPEHVGKQKVLIGEDVSERLDVVPAKFRVIVTRRPKYAFKNEDGVIQAAAPAHIIEAGIPTEGLLAQIAVSKYADGLPLYRQEAIYARDKVELDRKLMAQWMGKLGFELDILADYILAEIKKAERIFADETTLPTLAPGSGSAKTAWLWAYARDDRPFGGSSPPMVAYRFEDSRAGDRVARHLSGYRGILQVDGHGAYNKLARSDGGNDGVMLAYCWSHRRRKFYELHASDSSRIATETVELMAKLWQVEETARGQSPDARVAARQATSAAVVTELFALWQKTLSRISGKSKLAEAIRYATSRCSIFERFLTDGRIELDNNIVERAIRPQTITRKNSLFAGSDGGGGTWATIATLLQTAKMNNVDPQAWLTQTLERIANDWPSSDLDALMPWNYTR >CP036360|89708:98286|96603_98286_+|QBJ16493.1|holin|DBSCAN-SWA MSWDYIVLGSGSAGSVVASRLSEDGRTRVLLIEAGPRDNSPFIRIPAGEVKAIANPRFNWQYMAEPDPSLNDRAVIWPAGKVLGGSSSINGMVYVRGQREDFDDWARLLGNTGDWSYEDVLDYFKKMETNPFGAGEFHGADGPLKVSNVASPHPLRDVFIRGTQELGIPFNADINGATQEGVGPNQGTIDFGRRNSSARAYLRKARSRSNLKILTGAVADRIVFDGKKAVGVRYFVGAQAFQERANAEIIVCCGALGSPGVLMRSGIGNKHHLEANGIEVINDLPGVGQNLQEHPQVWVSGYVNVSTYNMETSPSHVVRHGLNWLMRGKGPAASPISHAVAFIRTRPETESRPDVQLHFVPVGYEVTETGITLLDRPAVTIAACVLRPKSRSEILLRSDSPFDPPRLQSRMLSDPDDIARLSDAFRISQNILESNAFKPYYEGAFKPARKLQTDEEILDFFKQSAEGSYHPAGTCKMGIGEDAVVSKELKVIGVEGLRVADASIMPVITSGNTNAPSIMIGEKAAAMILLERSSARRTTGRHSDRFADQSQHKSTTTTTQ >CP036360|89708:98286|92498_92843_+|QBJ16490.1|DBSCAN-SWA MIPAGVKVFLASHPIDFRKGPDSLLSLVRDAGNDPFNGSLYVFRAKRADRIKIVWWDGSGVCLYAKRLEKAQFCWPRVGHNRVQLNHAQLMALVDGMDWKRVRSVAVRPPEIVG >CP036360|89708:98286|91005_91875_+|QBJ16488.1|DBSCAN-SWA MPASTLGLHEPGSAKIDVDSAATATGEDNVSAFRMQIRCGAQVKGYSKQNLIMLNRHNAVPVDCRVGRETLNHIAHCGNITVIPEGVEFNGSFGGPAHLLVVSIPTQRLAVSAASSGLSFRSFGPQLSGFDAGIGIAVEDFSTWLNSGLTEGDGELIADDLVELVASSYCGSIEKPVPRGQMPNEVMEEINRYVNENIGSTFLINDLADLAKQNRFNFPRLFRRTVGISPHQYVLRMRLRHARALILAECSLAEAAIAAGFVDQSHMSSWMRRIYGITPGMLSKVPRKL >CP036360|89708:98286|89708_90551_+|QBJ16487.1|DBSCAN-SWA MTSSFVTTKDGVEIFYKDWGPKDAQPIVFHHGWPLSSDDWDAQMLFFLSKGYRVVAHDRRGHGRSTQVADGHDMDHYASDAFAVIEALDLKNAVHIGHSTGGGEVARYVAKHGQPAGRVAKAVLVSAVPPLMVKTDTNPEGLPLDVFDGFRSALAANRAQFFRDVPSGPFYGFNRDGATVHEGVIQNWWRQGMMGGAKAHYDGIKAFSETDQTEDLKAITVQTLVLHGEDDQIVPIADAALKSIKLLKTGKLKTYPGFSHGMLTINAETLNEDMLAFIRS >CP036360|89708:98286|94828_96301_-|QBJ16492.1|DBSCAN-SWA MTFTATKTLLDTVHLEGSFFDGTWKKAEQRKQIFEPANGEKLGFIGLASPSDIDHASKAAKAAQADWASAHYEKRAEVFRSAANVLESNKNEIVGWLVKESASTRPKAEVEVALSLKALHEASAMPSQALGQVLPTQPGRLNIARRRPIGLVGVISPFNFPLYLAMRALAPALAVGNSVLLKPDLRTSISGGLLLARIFEEAGLPPGILSVLPGDAASGEALCDAPEVGMIQFTGSTSAGRKVGERAGRNLKKVSLELGGKNAFIVLDDADVELAVNNAVWSTFLHQGQICMSAGRILVQAAIADEFTRKLCEHASRLTVGDPRTDQVALGPLISEAQVAHAFAVVNEAVAEGAQVAIGGGSSGLFFEPTVLRSVDPDNPAFKKEIFAPVAVVTTFVTDDDVVNLANDTNYGLSAAIISRSVGRAMALAEKLNVGILHVNDSTVNDEVINPFGGIGASGNGTSIGGPANWEEFTQWQWVTVKAEAPTYPL |
7 | Mycobacterium_phage(33.33%) | holin,transposase | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_10 |
101824 : 103717
Sequences of DBSCAN-SWA_10
Nucleotide sequences of DBSCAN-SWA_10 >CP036360|101824:103717|DBSCAN-SWA AATGAGCGATATCGTACTGTCTGTACAGGGCCTGACGGTAGATCTCGTCGGCCGCCGTGAAAATCGAACTATTATATCTGACGTATCGTTCGAACTTAAAGAACGGCAAGTTCTTGGGATCATTGGCGAAAGCGGCTCTGGAAAGACAGTGCTGTCGCGCGCCGTTGCAAACTGGATCGAGCCGCCGCTGTATATCCGACGAGGAGAGGTCGTCTTCCGCAACAGGGATATTCTGAAACTTTCCGCCAAGGAGATGCAGAAAATTCGTGGCAGGGAAGTCGGTTACGTTGGTGCCAATCCCGGAAGCGCTCTTGATCCAACCGTTTCCATCGGCGCGCAAATCGTCGAAAAACTTATGTCCGTGATACCGGGCATCGGAAAGAGCGATGCCGAAGAGCGGGTCGTGAGGCTGCTGGAAGCCGTCCGAATGCCTTCAGCGCGACAACGCTTCCACGATTATCCCTTCCAGTTCAGCGGCGGCATGATGCAACGGGTGCTCATCGTTGATGCGCTCATTTCTAATCCGGCGTTCCTGATCGCCGACAATATTACCCAACCGCTGGACGTCACGGTAAGCGCTCAAATTCTGCGGATCCTTAAGGACTTGCAGACCGAGTTCAAGACCGCGATCCTGTTTGTCTCATCTTCGCTCGGGGTGATCCAAGATATCGCTGACGATGTCCTCGTCCTCGCCGGTGGAAAGGTTGTCGAAAAGCGCGATATACGCTCAATGCTTCGGGCGCCTGAGGCGGACTATACGAGAAAGCTTATTGCGAAGGTGCCTCAGATATGGACGGGCGAAACGCTGCCCCCTCTCCAGCCGGCAGCCGATAGCGACACGATCCTGGATGTCAGGAACATATCGAAGTCCTATCCAGTAAAAGATCGCCAAAGTCTATTTCAGACAAAGTCGGTACATGCGGTCAGAGAAGTTTCATTCGCGGTAAAACGAGGTGAAAATTTCGGAATCGTCGGCGAATCCGGATGCGGCAAATCCACTCTGTCGCGGCTTTTAAGTCGTCTTGAGTCTCCGAACGCAGGCCAGATACTCTTCAAGGGTGAGGACATTGCTCATATGTCGTCACGCGCGCTGCTGCACCTGCGGCGAGGCTTTCAGCTTCTGCTGCAGGATCCCTATAATGCCATACCAGGACACTTTCCTGTTGGTCGGACGGTTATGGAACCACTCATGATCCACGGCGGGCTTTCACGAAAACAGATCGAAGCCCGCGCGCGGGAAGCAATTCGGGAAGTAGGTCTTCCCCCATCTGTCTTCGACAATCTTCCGATCGGGCTTAGCGCTGGTCAACGGCAACGTGTGAACATCGCACGCGCGCTGGTGCTTGATCCTGAATTGATGATCCTTGATGAAACCCTATCGGCCCTCGATCAGGTCGAGCAAGGAAAACTGATCGATCTCTTTGAGACGCTCCAGCTAAAGCGGGGGATCTCATATATCTACATCTCCCACGACCTGGCAATGGTCAGAAGAGTATGCTCGCGTGTTGCGGTTATGTACCTCGGACGTGTCGTGGAGCTCGCGGAAAATGAGACGCTGTTCTTCGATTCGGGCCATCCCTATTCTCGCGCGCTTCTGAGCGCTGCTCCCGTCATCGAGGAGCGGCGTTACGAACCCGAAACCTATCTTCTCGATGGTGAACCGCCGGATCCGATCGATATAGCGCCGGGGTGTACATTCCGCACACGCTGCCCCTTTGCTTTCGAACGATGTAAGACCGACGATCCACGGCTCTATCGCAGGGACGCAAACAACCTCAGCGCCTGTCATCTGATCGACGATACAGATCCGAGAGTGCCGCAAAATGCCATGTTCAAGCGCGTAATCTCAACGGATACGAAGCTGGACCGTCTGACGAACTCCGCAAAGACTTGA
Protein sequences of DBSCAN-SWA_10 >CP036360|101824:103717|101824_103717_+|QBJ16497.1|DBSCAN-SWA MSDIVLSVQGLTVDLVGRRENRTIISDVSFELKERQVLGIIGESGSGKTVLSRAVANWIEPPLYIRRGEVVFRNRDILKLSAKEMQKIRGREVGYVGANPGSALDPTVSIGAQIVEKLMSVIPGIGKSDAEERVVRLLEAVRMPSARQRFHDYPFQFSGGMMQRVLIVDALISNPAFLIADNITQPLDVTVSAQILRILKDLQTEFKTAILFVSSSLGVIQDIADDVLVLAGGKVVEKRDIRSMLRAPEADYTRKLIAKVPQIWTGETLPPLQPAADSDTILDVRNISKSYPVKDRQSLFQTKSVHAVREVSFAVKRGENFGIVGESGCGKSTLSRLLSRLESPNAGQILFKGEDIAHMSSRALLHLRRGFQLLLQDPYNAIPGHFPVGRTVMEPLMIHGGLSRKQIEARAREAIREVGLPPSVFDNLPIGLSAGQRQRVNIARALVLDPELMILDETLSALDQVEQGKLIDLFETLQLKRGISYIYISHDLAMVRRVCSRVAVMYLGRVVELAENETLFFDSGHPYSRALLSAAPVIEERRYEPETYLLDGEPPDPIDIAPGCTFRTRCPFAFERCKTDDPRLYRRDANNLSACHLIDDTDPRVPQNAMFKRVISTDTKLDRLTNSAKT |
1 | Planktothrix_phage(100.0%) | NA | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_11 |
112762 : 113221
Sequences of DBSCAN-SWA_11
Nucleotide sequences of DBSCAN-SWA_11 >CP036360|112762:113221|DBSCAN-SWA TATGCTTATCGTCAATGGTCAAGCACATCATCTCATCCTCAGCATGACGACGACGTTGGCAGACGCGCTGCGTGAGCAGGTTGGCCTGACCGGAACGAAGGTCGGGTGCAACCATGGCCAATGCGGCGCTTGTACAGTCCACGTGGAGGGACGGCGCGTACTTGCGTGTCTCACCCTTGCAGCAAACTGCCAGGGCAAGGAAATTAATACGATCGAGGGCCTTTCTGGTGATAACGGAAGTCTTCATCCCATGCAGCAAGCGTTCATAGATCAAGACGCATTCCAATGCGGGTACTGCACTCCAGGCCAAATCATGTCAGCTGTGGCTCTTGTCGAAGAAGGGAGGTTGGATTCCGTTGACCAGATACGAGAATACATGAGCGGCAATCTGTGTCGGTGTGGAGCATATCCCAAAATCCTAGCTGCGGTGCTCCAGGGTGCCAGCGCGATGAAAGGATAA
Protein sequences of DBSCAN-SWA_11 >CP036360|112762:113221|112762_113221_+|QBJ16505.1|DBSCAN-SWA MLIVNGQAHHLILSMTTTLADALREQVGLTGTKVGCNHGQCGACTVHVEGRRVLACLTLAANCQGKEINTIEGLSGDNGSLHPMQQAFIDQDAFQCGYCTPGQIMSAVALVEEGRLDSVDQIREYMSGNLCRCGAYPKILAAVLQGASAMKG |
1 | Acinetobacter_phage(100.0%) | NA | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_12 |
118389 : 120644
Sequences of DBSCAN-SWA_12
Nucleotide sequences of DBSCAN-SWA_12 >CP036360|118389:120644|DBSCAN-SWA ATCAGAGTTGTGCCCAATCAAGAACGACCTTACCGCTGTTCCCGCTGGCCATGGTGTTGAACCCTGTCTTGTAGTCACTCACCGGCAGCCGGTGGGTGATGACCTTGCGGATATCAAGCCCACTTTCCAGCATCGCCAGCATCTTGTGCCAGGTCTCAAACATTTCGCGACCGTAGATACCCTTGAGGGTCAGCATCTTGAAGACAACCTTTGTCCAGTCGACTAAAGCCGGCCTTGCAGGAATTCCAAGCATGGCGATTTTTCCGCCCATTACGAGCTGCTCTACCATCTGATCGAAAGCGGCAGGTGCTCCGCTCATTTCAAGACCGACATCGAAACCTTCCTTCATGCCAAGGCGAGCCTTGACCGAATTCAGATCCTCGCTGCTGACGTCGACGGGAACGATATCTGCAACTTCGGCGGCCAGAGCGAGCCGCTGAGGGTTGATATCGGTGATAACGACATGGCGCGCACCCACATGTCGGGCAACCGCAGCGGCCATGATACCGATCGGGCCGGCGCCGGTCACGAGAACGTCTTCACCAACCAAGTCGAACGAGAGGGCTGTATGAACTGCATTGCCGAGGGGATCGAGAATCGCGCCCAGATCGTCATCGATCGAATCTGGCAGCGGCACGACGTTGAAGGCGGGCAGGCGAAGATATTCTGCAAATGCGCCCGGCACGTTGACGCCCACGCCTTTGGTTTCGGGGTCGAGATGGAAACGCCCGCCCCGCGCTGCGCGGCTGTTCATGCCGATGACATGGCCTTCGCCCGATACACGCTGTCCGATGGCAAGATGACGGACATTCGCGCCGACTTCCGCGATCTCACCCGCATATTCGTGGCCGACCACCATGGGAACTGGGACTGTGCGCTGTGCCCAGTCGTCCCAGGAGTATATGTGGATGTCAGTTCCGCATATCCCGGTCTTGTTGATTTTGATCAGAACATCGTCCGGACCGATTTGCGGAATGGGACGTTCGTCCATCCAGAGTCCAGGCTTCGCTTCCGCCTTTATGAGCGCTTTCATCAGAGCACCTCTTTGGAGCATCAAACCACACCGAGCGCCTTGCCGGCGTCGGTAAAGGCTGCAATCGCGAAATCGATGTTACCATCATCGAGGGCGGCAGACATCTGGGTTCTGATTCGCGCCTTGCCTTTGGGAACGACCGGATAGAAAAAACCGGCGACGTAAACGCCCCGCTCGTTGAGCGCGCGTGCCATGTCCTGCGCCAGCCGCGCGTCATGCAGCATGACCGGAATGATCGGGGTCTGCCCCGGCAGCAATTCGAAACCGGCATTTTGCAGCCCGGAACGAAAACGCTCGGTATGCCGAGCCAGAACGCGGCGACGGTCGTCGGCTTTCTCGGCAATCTCGATGGCCTTGATCGACCCCGCGGCCACTGCCGGCGCGAGACTGTTGGAGAAGAGATAGGGCCGCGCACGCTGACGCAGGAGATCGATGATCGGCTTCGAAGCCGCCACGAAACCGCCCATAGCGCCGCCAAGGCTCTTGCCCAGCGTGCCGGTGACGATATCGACGCGCTGGTTGACACCTGTCAGTGCCGGCGTCCCGCGTCCGGCATCGCCGATATGCCCAGTAGCATGGCAGTCATCGACCATGACAAGAGCGCCGTACCGCTCCGCGAGATCACAGATGGCGGACAGGTTTGCGACATAACCGTCCATGGAGAACACGCCATCGGTGGCGATCAACTTGAACCGCGCGCCATCGCGTTCCGCTTCTTTCAATCGATCTTCCAGCTCATCCATTTCACCATTGGCGAACCGATAGCGTTTTGCCTTGCAAAGCCGCACGCCATCGATAATCGATGCGTGATTGAGGCTATCGGAAATGATGGCGTCGTCGGCCCCCAGCAAAGGCTCAAAAACACCGCCATTTGCGTCGAAGCAGGCAGCGAACAGGATTGCATCGTCCTTCCCCAGATATCGGGCTATCGTATGTTCAAGCTCCCGGTGGAGTGTCTGCGTGCCGCAGATAAAACGGACGGACGCCATTCCGAATCCGAACTCGTCTAGCCCACGCTTGGCAGCGGAAATGATCTCGTGATTGTTGGCAAGTCCGAGATAGTTATTTGCGCAGAGGTTGACGACCCGGCGGTTCTGCTTGCTGTCACAGATCAATATTTCGCCGTCTTGCTGGCTGGCGATCACCCGCTCGCTCTTGTAGAGACCGTCGGTCTCGATCTCTGTTAGAACGGTCCTAAGATGCTCGTCGAAACTGTTTTTCAT
Protein sequences of DBSCAN-SWA_12 >CP036360|118389:120644|118389_119421_-|QBJ16510.1|DBSCAN-SWA MKALIKAEAKPGLWMDERPIPQIGPDDVLIKINKTGICGTDIHIYSWDDWAQRTVPVPMVVGHEYAGEIAEVGANVRHLAIGQRVSGEGHVIGMNSRAARGGRFHLDPETKGVGVNVPGAFAEYLRLPAFNVVPLPDSIDDDLGAILDPLGNAVHTALSFDLVGEDVLVTGAGPIGIMAAAVARHVGARHVVITDINPQRLALAAEVADIVPVDVSSEDLNSVKARLGMKEGFDVGLEMSGAPAAFDQMVEQLVMGGKIAMLGIPARPALVDWTKVVFKMLTLKGIYGREMFETWHKMLAMLESGLDIRKVITHRLPVSDYKTGFNTMASGNSGKVVLDWAQL >CP036360|118389:120644|119441_120644_-|QBJ16511.1|DBSCAN-SWA MKNSFDEHLRTVLTEIETDGLYKSERVIASQQDGEILICDSKQNRRVVNLCANNYLGLANNHEIISAAKRGLDEFGFGMASVRFICGTQTLHRELEHTIARYLGKDDAILFAACFDANGGVFEPLLGADDAIISDSLNHASIIDGVRLCKAKRYRFANGEMDELEDRLKEAERDGARFKLIATDGVFSMDGYVANLSAICDLAERYGALVMVDDCHATGHIGDAGRGTPALTGVNQRVDIVTGTLGKSLGGAMGGFVAASKPIIDLLRQRARPYLFSNSLAPAVAAGSIKAIEIAEKADDRRRVLARHTERFRSGLQNAGFELLPGQTPIIPVMLHDARLAQDMARALNERGVYVAGFFYPVVPKGKARIRTQMSAALDDGNIDFAIAAFTDAGKALGVV |
2 | Vibrio_phage(50.0%) | NA | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_13 |
124109 : 124601
Sequences of DBSCAN-SWA_13
Nucleotide sequences of DBSCAN-SWA_13 >CP036360|124109:124601|DBSCAN-SWA GCTACTCGCGTCCATCGTCACCACCTGGAAGCTTAACGGCGTCAGTCCGCAATCCCTTATCAGTCAAACCCTCTAAGCCACACTCAACGGTCATCCGCAGTCCCGCATCCGGCTTCCAATCGGCTGGCGCCAGCTCAAAATTCGAAAAATTGAATCCCGGAGCCACCGTGCAGCCAGCAAGAGTAAACTCTCCAAGCGGAGTGGCGGACTGCCACATTCCTCTGGGAATTATGACCTGCGGTCGCTGCAGGCGGTGCAAGTCCGGCCCTAGTATCACGTGGGAGATTGCCCCACCTTCCTGCCACAGGCTTATCTCAAGCGGAGACCCCATGTAAAAGTGCCACACCTCAGAAGCGTTTGTCAGGCGGTGCCAATGCGACCGCTGACCTTCTTCCAGCAGGAAATAGATGGCCGTTGAGTGGACCCTCTGCTCAACGTCATCGTTGTCGCGAAAGGTCTCGGCATACCAGCCACCTTCGGGGTGAGGCTGCAT
Protein sequences of DBSCAN-SWA_13 >CP036360|124109:124601|124109_124601_-|QBJ16845.1|DBSCAN-SWA MQPHPEGGWYAETFRDNDDVEQRVHSTAIYFLLEEGQRSHWHRLTNASEVWHFYMGSPLEISLWQEGGAISHVILGPDLHRLQRPQVIIPRGMWQSATPLGEFTLAGCTVAPGFNFSNFELAPADWKPDAGLRMTVECGLEGLTDKGLRTDAVKLPGGDDGRE |
1 | Pandoravirus(100.0%) | NA | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_14 |
129030 : 130011
Sequences of DBSCAN-SWA_14
Nucleotide sequences of DBSCAN-SWA_14 >CP036360|129030:130011|DBSCAN-SWA GTCAGGCCCTGATAAAGGCGAGCAGGTCCGGATTGATGACGTCGGCGTGGGTGGTCAGCATGCCGTGCGAGTAGCCCTTGTAAACCTTCAGCGTACCATTCTTCAGCAGCTTCACGGACAGCTCGGCGGAGTTGGCGATCGGAACGATCTGATCGTCATCACCATGCACGACGAGGGTCGGAACAGTGATCTTCTTGAGATCTTCGGTCTGGTCGGTTTCGGAGAACGCCTTGATACCCTCATAATGGGCTTTGGCGCTGCCCATCATGCCCTGGCGCCACCAATTCTGAATCACGCCCTGATAAACGGTCGCGCCCTCCCGGTTGAAGCCGTAGAACGGGCCGGTGGGCAAGTCGAGGAAGAACTGAGCACGGTTTTCGGCGACGCCTTTGCGGATACCGTCGAACGCTTCCATCGGCAGTCCGCCCGGATTGGACGCGGTCTTCAGCATCAGCGGGGGAATGGCGCTGACGAGTACGGCCTTGGCGACACGACCCTGTGGCTCACCGAACTTGGCGACATAGCGAGCGACTTCCCCGCCCCCGGTCGAGTGGCCGATATGAACGGCGTTCCTGAGATTCAGATGCTCGGCGACAGCGGAGGCGTCGGCGGCGTAATGATCCATATCGTGGCCCTCGCTGACCTGCGAGGAGCGGCCATGACCGCGGCGATCGTGGGCCACGACACGATAGCCATGCTGGACGAAGAACAGCATCTGAGCGTCCCAATCGTCCGAAGACAGCGGCCAGCCATGATGGAAAACGATCGGCTGCGCGTCCTTCGGACCCCAATCCTTGTAAAAGATCTGTACGCCATCCTTGGTTTCGACGAAGGCGGTGGTCATTGGCGGCATTCCTTTCTTGTGTGCGGGAGTGGAGGCGGAAAAGGTAAGGGATGGCATGGCCATTGCAGCGGAGAGTGTTGCACCGGCCAGCAGTGTTTCCCTGCGGGAAAACATCATATCGAAGGGGTTGTCAGACAT
Protein sequences of DBSCAN-SWA_14 >CP036360|129030:130011|129030_130011_-|QBJ16520.1|DBSCAN-SWA MSDNPFDMMFSRRETLLAGATLSAAMAMPSLTFSASTPAHKKGMPPMTTAFVETKDGVQIFYKDWGPKDAQPIVFHHGWPLSSDDWDAQMLFFVQHGYRVVAHDRRGHGRSSQVSEGHDMDHYAADASAVAEHLNLRNAVHIGHSTGGGEVARYVAKFGEPQGRVAKAVLVSAIPPLMLKTASNPGGLPMEAFDGIRKGVAENRAQFFLDLPTGPFYGFNREGATVYQGVIQNWWRQGMMGSAKAHYEGIKAFSETDQTEDLKKITVPTLVVHGDDDQIVPIANSAELSVKLLKNGTLKVYKGYSHGMLTTHADVINPDLLAFIRA |
1 | Mycobacterium_phage(100.0%) | NA | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_15 |
134159 : 137135
Sequences of DBSCAN-SWA_15
Nucleotide sequences of DBSCAN-SWA_15 >CP036360|134159:137135|DBSCAN-SWA GATGGCAAAGCGCAAGCTTCTTAAAGATCAAGATCGCCGGAAGCTTGTCGACATACCAGTCGACGAGGATAACCTAATCCGACACTATTCGTTGTCATTGGCGGATCGCCTGGAGATTGAACTGCGGAGACGCAATCACAATCGGCTCGGCTTTGCCATCCAGCTATGTCTAATGCGATACCCGGGTCGAGTGCTTGGAGCAGAAGAAACTCCGCCTCGCGCCATGCTGAAGTATGTTGCTGATCAGATCGGCGCCGCTCCAGACGAATTTGCCCTCTATGCACGCCGCGAAGAAACCCGTCGGGATCATATGGCGCGGCTAATGGTGTATCTGGATACGAGAAGCGCGACGCTGCAAGATCGCCGCGCCGCGCTATTGGCGGCAATTCAAGCGGCGACCATGTCCGACGACGGTGCTGCGATAGCCAGTTCGACTGTCGCCGCGTTTCGCGAACGCGGCGCTCTTCTGCCGGCGATCGACACGATCGAACGCATCGGCCTTGCCGGCCGTGCCATAGCCCGGCGGCGAGCAGAAAGAACCCTGATTGAAGACATCCCACTCGATAGGCTTCAATCACTGGATAGGCTGTTAGAGGTTGACCCGTCGATCGGCCAGACCCGCTTTCATTGGCTGCGTTCGGCACCAGAAGCGCCGGCCGCGTCGAACCTCGTCGGGCTGACCGAGCGGATAGCCTTTCTGCGGCGGCTGGAGATCGATCCGGAACTGCAGGCGCGCGTGTCATCCGGACGGTGGGACCAGATGATCCGGGAGGGCAACGCAACGCCGGCGTGGCTTGCCAACGACTTCAACGCCAGCCGTCGCCACGCTCTGATCGTGGCCCAAGTCATCAAGCTTTGCCAGAAGCTCACGGACGATGCGGTGACGATGTTCATCAAGCTGATGGGTAAATTGTTCTCGCAAGCCAACAATCGAAAGAAGCAGCGACAGATGGACTGCAGAGCGAATACCGCCAAAGCGCTGCGCATGTTTCTAGATACGATCACCGCACTGCAGTCCGCCAACGACTATGGTCGGAACGCATTGGATGTTCTCGACCAGAAAGTCGGTTGGGACCGGCTGCTTCGGATGAAGCCTGAGCTTGAGTCGATGGTCGACGACAACGAGGCACCGCCGTTGACCGTGGCAGCCGAACAATACGCGACCGTCCACAAATACATTGGTGCATTTCTTCAGGCCTTCACGTTTCGCTCGGCGCATCGCCACGACCCGCTTCTGGCGGCAATTGCGCTGCTGAAACGGCTCTATGCCGAGAAGCGGCGGACACTCCCTGATCGCGTGCCGCTCACCCATCTCAGCCAAACTGATCGACGGCTTATCTTCGAACAAGGGAAACCTGATCGCCGTCTCTACGAGATTGCCACGCTAGCGGCTTTGCGAGACCGGCTTAGATCTGCGGACATTTGGGTCGACGGCAGCCGATCTTTCCGGCCGATTGACGAGGATCTGATGCCGCGGTCGACATTCATCACGATGAAGGAAGAAGATCGTCTCGGTTTGGGGGTCCAGGGCGACGGCGCGCAGTGGCTTGCTGAAGCGTGCCAGATGCTCGAGTTCAACCTGCAGCGTCTTGCGCACAGAGCACGATCCGGAAAGCTCGAAGGAGTTCGTCTTGAGGCCGGAACGTTGATCGTCACACCAACCGTCGGCGAAGTCCCCGTGGCAGCAGAGGAACTGAATGCCGAGATCAGCGATATGTATCCGTTGGTCGAGGTTCCCGACTTGTTAAGGGAAGTGCATGAATGGACCGGCTTTGCGGATCACTTCACGCATGTTCGCACCGGCGACATCCCGAAAAATGTCTCTGCCATGCTCGCTGGTGTACTGGCCGATGCGACGAACCTCGGCCCAAAGCGAATGGCGGGCGCATCCAAGGGGATCAGCGCTCACCAGATTGGGTGGATGCGCACATTCCATGCCCGATCGGAGACCTACCGCGCGGCGCAAGCGTGCATCACCGATGTCCACACCCAGCATCCGCATTCCCGCCTTTGGGGCAATGGCACAACGTCATCGTCAGATGGCCAATTCTTTCGTGCGAGCGACCGAGCCGCAAAGCGCGGCGACATCAATTTGCACTACGGCAGCGAACCCGGGACGATATTCTATAGCCATCTATCAGATCAGTACGGCTACTTCGGCATTTTGCCCATCAGTCCGACCGAAAGCGAGGCGGCCTATGTGCTCGATGGGCTCTTCGACCAGGACACGGTGCTCGACATTCAGGAACACTTTACCGACACGGGCGGTGCCAGTGATCATATCTTCGGGCTGTTCGCCTTGATCGGAAAGCGGTTCTCACCGCGACTGCGCAATCTCAAAGACCGGAAGTTCCACACGTTCGAGAAGAGCGATGCATATCCAACCCTGTCGAACCATATCGGGGCGCCGATCAACACCGCCCCGATTCTCGACCATTGGGACGATCTGCTCCATCTCGCGGCGTCGATCACGACGCGGTCCGTGGTACCGTCTACGATCCTGAAGAAGCTGTCCGCATCTCCGAAGCAAAGCCATCTCGCGAGGGCGCTCCGCGAACTCGGTCGTATCGAAAGGTCGCTCTTCATGATCGAATGGTACTCAAGCCCGGCATTGCGGCGGAGAAGCCAAGCCGGTCTCAACAAGGGAGAGGCCGCCCATAAGCTGAAACGCGCTGTGTTCTTCCACGAGCGAGGCGAGATCCGTGACCGGTCGTTCGAAAGCCAGGCATTCCGCGCCTCGGGGCTCAATCTCGTCGTCAGTGCGATCGTGCACTGGAATACGGTTTATCTTGATCGCGCGGTCACACAGCTGAAACGAGCGGGGCGAGACATTCCTGATACTCTATTGAAACACATCTCGCCGCTCAGTTGGGAACATATCAACCTTACCGGCATCTACACATGGGATGCCGAACATCAGATGCCGAACGGATTTCGATCGCTCCGCCTTCCGGCTAGGCTAAGGAACGCCGCGTAA
Protein sequences of DBSCAN-SWA_15 >CP036360|134159:137135|134159_137135_+|QBJ16525.1|transposase|DBSCAN-SWA MAKRKLLKDQDRRKLVDIPVDEDNLIRHYSLSLADRLEIELRRRNHNRLGFAIQLCLMRYPGRVLGAEETPPRAMLKYVADQIGAAPDEFALYARREETRRDHMARLMVYLDTRSATLQDRRAALLAAIQAATMSDDGAAIASSTVAAFRERGALLPAIDTIERIGLAGRAIARRRAERTLIEDIPLDRLQSLDRLLEVDPSIGQTRFHWLRSAPEAPAASNLVGLTERIAFLRRLEIDPELQARVSSGRWDQMIREGNATPAWLANDFNASRRHALIVAQVIKLCQKLTDDAVTMFIKLMGKLFSQANNRKKQRQMDCRANTAKALRMFLDTITALQSANDYGRNALDVLDQKVGWDRLLRMKPELESMVDDNEAPPLTVAAEQYATVHKYIGAFLQAFTFRSAHRHDPLLAAIALLKRLYAEKRRTLPDRVPLTHLSQTDRRLIFEQGKPDRRLYEIATLAALRDRLRSADIWVDGSRSFRPIDEDLMPRSTFITMKEEDRLGLGVQGDGAQWLAEACQMLEFNLQRLAHRARSGKLEGVRLEAGTLIVTPTVGEVPVAAEELNAEISDMYPLVEVPDLLREVHEWTGFADHFTHVRTGDIPKNVSAMLAGVLADATNLGPKRMAGASKGISAHQIGWMRTFHARSETYRAAQACITDVHTQHPHSRLWGNGTTSSSDGQFFRASDRAAKRGDINLHYGSEPGTIFYSHLSDQYGYFGILPISPTESEAAYVLDGLFDQDTVLDIQEHFTDTGGASDHIFGLFALIGKRFSPRLRNLKDRKFHTFEKSDAYPTLSNHIGAPINTAPILDHWDDLLHLAASITTRSVVPSTILKKLSASPKQSHLARALRELGRIERSLFMIEWYSSPALRRRSQAGLNKGEAAHKLKRAVFFHERGEIRDRSFESQAFRASGLNLVVSAIVHWNTVYLDRAVTQLKRAGRDIPDTLLKHISPLSWEHINLTGIYTWDAEHQMPNGFRSLRLPARLRNAA |
1 | Salmonella_phage(100.0%) | transposase | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_16 |
158644 : 161777
Sequences of DBSCAN-SWA_16
Nucleotide sequences of DBSCAN-SWA_16 >CP036360|158644:161777|DBSCAN-SWA CTTATTTCTGGATGAATGCCAGCATGTCCGCGTTCAGGACATCAGAGTTGGTAGCAAGCATACCGTGTGGGTAACCGGGATAGACCTTCAGGGTACTATTCTTGAGCAACTTCACGGCAAGTTCTGCAGACGCCTTGACGGGGACGATCTGGTCGTCGTCGCCATGCAGGACGAGCGTCGGCACTGAAATCTTCTTCAGGTCCTCAGTCTGGTCGGTTTCCGAGAATACCTTGATTCCCTCATAGTGGGCCTTGGCGCTCCCAATCATGCCCTGCCGCCACCAATTCTGGATCAGGCCCTGCGAGATCTTGGCGCCTGGTCGGTTAAAGCCGAAGAAGGGCCCTGACGGAAGATCGAGATAAAACTGGGCCCGATTGTCGGCCAATGCCTTTCGTAGGCCATCAAAGGCTTCAATAGGTGTACCATCGGGGTTAGATGGTGTTCTCAGCATCAGCGGTGGAATTGCACTAACGAGAATAGCCTTCGCCACACGGCCTTGCGGCTCACCGAAGTCGGCGACATAGCGAGCTGTCTCGCCGCCGCCCGTCGAGTGGCCGATGTGGACAGCATTCTTGATGTCTAGATGCTCGTAGACTGCCGAGGCGTCGGCGGCATAGTGGTCCATGTCATGACCATCGCTTACCTGCTGTGAACGACCGTGCCCACGTCTATCATGAGCGATCACCCGATAGCCCTTCGAGAGGAAGAACAGCATCTGGTTGTCCCAGTCGTCGGACGACAGCGGCCAGCCATGATGGAACACGATCGGTTGGGCGTCTTTCGGACCCCAATCTTTGTAAAAGAGGTTTACCCCGTCTTTGGTGGTGATGAATGCTTCTGTCATTGCCGTTTCTCCTTCTGGGGTGACTGGTACGTTGCCTGTTGCAGCGAAGGCTGGGATTCCAGAGGCCAGCAGCAGTGCTGAGCCTCCGAGCAAGATGTCTCGCCGCGAGACGTCAAATGGGTGATCTTTACTCATGTTAGTCCTCATGCTGATGGCGTGGCGTTCACTTGCTTCGAAGATCAGAAAATAATCGAAGCGTGCTCGGTTTTAGACGCCCTTGTAGACGAGGCGTGTGAAGAAGGCGGGGCTGATGATCGCCTTGCCCCATTGCTCGTCAAAAGCCTCGGTCGGGTTGGCTGCGACCGCCTCCTCGACAGAGCGGCCGGCTGATTTCAGCCTTGCGACGTTGTCCCGGATGGTAGCTAGCATCTCGTGATAATGCTTCAGCTCGTCCAGCGACCCAAGCGGCTTGCCGTGGCCTGGAATGATGACGGTATCCGCATCCACTGCAGCGATGACCTTTGCTGAAGCTGCGATCGTTCCGTCTATGCTCCCGCCAGTCGAATAATCGATGAATGGGTAAAGGCCGTTCCAGAAGATATCCCCGACGTGAATGACATTTGCTTCCTCAAAATAAACGGAGATGTCGCTGTCCGTGTGGGCGGGCTGATAGGCTGTGATGGCAATCTTTGTGCCTTCGAGCTCCAGGACCTTGTCACTCGTGATCAGGGTCGCTGGGATTGCGCCGGAGGGAGAAGGCGGAAAATCGAAGTTCCAATCATCAACGCGCTGGGCCTCAAGCAGGTGACGGCGGGTGTTCTCATGGGCAATGATTGCGGCGCCTACCGAGTGCAGCCACTCGTTTCCATCTGTATGATCGAAGTGCCAATGCGTGTTGATGAGATGCGTAACGGGCTGGTCGCCCAGCTTGGAAAGGGCCTGCTCGATCCGAGCACGCGTTGCGGTGATACCGGCGTCTACGAGAACCTTACGTTGTTTGCCGGTCAAGACGGCAATATTGCCGCCGGAGCCTTCCAGCACGGCAATATTTCCACGGGTCGTATATGTGGTGATGTCGGCATGGGCTGCCGCTTCCCTGAGCGAATCCACGATCGTTCCGGATTTCGCCAGCGAATTGGATGTCAGAAAAGGCGCAACGGCGAAGGTCGCGAGCGCGCCCGTGAGGAGAGATCGGCGATTAACGGGTTGTGAATTGGCTACGGGGTTCACGGCGAGGTCCTCCTTGGTTCAAACGATGTCCAAACCATACCTCCCGGAGCGCATTCGATAATTGGCAAAGCGCGTAAGTCTTTATTCCGCTTTGCGTAACAATTTCCTGCTCGGCCACGCGTCGGTAATCAGTGCTTCGGTGGCAGGAGGAGACTGGCCGCTACAATCAACGAGGCCCTTCTCAAGAGCCTCTACGCCATTGGCCAGGGGTCACACCCGTCGACTTTGTGAAGACGCGGGTGAAATGGCTTTGATCGGAAAATCCGCAGGAGTAAGCTATCTCGGCGAGCGAAAGCGCCGTTTCTTTTAAAAGACGCTTGGCTGCTTCGACCCTGTAGGCTCGTAGCCACCTAAAGGGCGGGTTTCCGGTTGTCGTTTTAAACGCGTTGCCAAAGACAGAGACAGGCATGTCCAGACTTGCTGCTATATCAGCTAGAGAAGGCTCGACAGTCAGATCTGCTGTCAGCAAATCCTTTGCGAGACGCTCCTGTCTCATACTCAGGCGACGGAACTTCACGTCAGACCGTGGAGCAGCGGAGTAGCGCTCTACAAGGTGACTATGGAGCGCTAGAATCAGATGGTCAAACAACAGACTTTCGACGCCTTCATTATTTGCAAGGGCCGACAGAAGAGCTTGGCCAAGAAGTTCCACAAATGGGTCGACGTGTGAATTCTTTTCGAGAAGGTATGAAAATTCCGGGACGCGGTCATCGTTAGATAGCTCACGAAGGGCCATCTCGGGTAAATGAATCTGTATAGAGGAAAATGGGTTAGTGAAGAGGCCTTGCGTCTCTTCTTCGAGGTGGACCAGGTTCATCGCCCCCGCAGGATAGTCATCCGAATTGACGAAACGACCCCATTTCCATTGCTCGTGGCCGGTCATCGGTGTCAGCCACAACGAAAGAAGAAGCGCTTTTTCACTTGGCAGCGGTGTGGTTAGTTCGCGCATCGCGTGAACCACCTCAAGATGTGTGAGTGTGATGGCGGACGATTTTATCGTGACGGTGTGGCGTGTCGGTGCAGCAACTAAGCCAAAGACGTTTCCGATAACCAGCCCGATCGGCGATCTGGAAGAAGCAATGAGCTCGCGTATCTCCTCGGGGCTATCTTGCTGATCCAT
Protein sequences of DBSCAN-SWA_16 >CP036360|158644:161777|158644_159487_-|QBJ16545.1|DBSCAN-SWA MTEAFITTKDGVNLFYKDWGPKDAQPIVFHHGWPLSSDDWDNQMLFFLSKGYRVIAHDRRGHGRSQQVSDGHDMDHYAADASAVYEHLDIKNAVHIGHSTGGGETARYVADFGEPQGRVAKAILVSAIPPLMLRTPSNPDGTPIEAFDGLRKALADNRAQFYLDLPSGPFFGFNRPGAKISQGLIQNWWRQGMIGSAKAHYEGIKVFSETDQTEDLKKISVPTLVLHGDDDQIVPVKASAELAVKLLKNSTLKVYPGYPHGMLATNSDVLNADMLAFIQK >CP036360|158644:161777|159694_160657_-|QBJ16546.1|DBSCAN-SWA MNPVANSQPVNRRSLLTGALATFAVAPFLTSNSLAKSGTIVDSLREAAAHADITTYTTRGNIAVLEGSGGNIAVLTGKQRKVLVDAGITATRARIEQALSKLGDQPVTHLINTHWHFDHTDGNEWLHSVGAAIIAHENTRRHLLEAQRVDDWNFDFPPSPSGAIPATLITSDKVLELEGTKIAITAYQPAHTDSDISVYFEEANVIHVGDIFWNGLYPFIDYSTGGSIDGTIAASAKVIAAVDADTVIIPGHGKPLGSLDELKHYHEMLATIRDNVARLKSAGRSVEEAVAANPTEAFDEQWGKAIISPAFFTRLVYKGV >CP036360|158644:161777|160838_161777_-|QBJ16547.1|DBSCAN-SWA MDQQDSPEEIRELIASSRSPIGLVIGNVFGLVAAPTRHTVTIKSSAITLTHLEVVHAMRELTTPLPSEKALLLSLWLTPMTGHEQWKWGRFVNSDDYPAGAMNLVHLEEETQGLFTNPFSSIQIHLPEMALRELSNDDRVPEFSYLLEKNSHVDPFVELLGQALLSALANNEGVESLLFDHLILALHSHLVERYSAAPRSDVKFRRLSMRQERLAKDLLTADLTVEPSLADIAASLDMPVSVFGNAFKTTTGNPPFRWLRAYRVEAAKRLLKETALSLAEIAYSCGFSDQSHFTRVFTKSTGVTPGQWRRGS |
3 | Mycobacterium_phage(50.0%) | NA | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_17 |
184119 : 188043
Sequences of DBSCAN-SWA_17
Nucleotide sequences of DBSCAN-SWA_17 >CP036360|184119:188043|DBSCAN-SWA CTTAGGGAAAAAGACGTTGCCGCCGGTCTCGCGCATCGAATAAATGTACCGCGCCCTGACCAAGAGAAATCCTTTGGACAGAGCCGACCTCGAACCATGGCCGCTCGCGCAGAACCGCCGAGACCAGTTGGCTGCCGAGCTTCAGTTCGAGAAAGGTTTCCGAACCGGTCGGCTCGATATTCTCGACGATAGCTTCGAGTGACCCTTCCGGGCCGAGGGCGATTTCTTCAGGGCGCTTGCCGATTAGCAGCTTCTGGCCTTCTCTCGCCCCTCCCGGTGCGGTGAGACCGGAGGCCTGTCCGTCCTCCAGCACCAGCTGTCCGGCGGATACGGTTGCCGGGAACAGGTTCATCGCAGGCGAGCCGATGAACTGGGCGACGAAGGCATTGGCCGGCCGGTCGTAAAGCTCGAGCGGCGTGCCGATCTGCTCGATGCGGCCGGCCTGCATGACGACGATACGGTCGGCCATGGTCATCGCCTCGACCTGGTCATGCGTCAAATAGACCATCGTCGTTCCCAAACGCCGGTACAGGCTCTTGATCTCGCCGCGCATCTGGACGCGCAGCTTGGCATCGAGATTGGAGAGCGGTTCGTCGAAGAGGAAGACCTGCGGATTGCGGACGATCGAGCGGCCCATGGCGACACGCTGGCGCTGGCCGCCGGAAAGCGCCTTCGGCTTGCGTTGCAGCAGGCTTTCTAGGCCGAGGATCTCGGCGGCCTCGCGGACCTTCTTCTGCTTCTCGGCCTTGCTGGCGCCGGCGAGCGTCAGCGAGAAGGCCATGTTTTCCTCGACGGTCATATGCGGGTAGAGCGCATAGTTCTGGAAGACCATCGAGATGTCGCGGCTCTTTGGCGGCAGGGTGTTGACCACCTTTCCGCCGATCGAGATCGTGCCACCGGAAATGTCCTCCAGCCCTGCGATCATCCGCAGCAGGGTGGACTTGCCGCAGCCCGACGGCCCGACGAGGACGACGAATTCGCCGTCGCCGATCGCCGCGTCGATGCCGTGGATGACCTCGGCGCTGCCGAAACGCTTGGCGACCTTGTCGATCGTGACGCTTGCCATGAAGTCCTCCTTATGCCGCGCCGATATGGATGCGCGTGGCCGACAGGGCGCCCTCGTGGATGATCAGGCCGGCGGAACCGCCGGCAAAGGTCTGGTCATTGGCCTCGACCACGGTATCGCCGACCTTCACCGAAAGCTTGCCGCCCTTGGCGGAGAACTTCATCGTGAAGCGTGTCTCGAACTCGACGCGAAGCGGCGCGGAGGCGAGCACGGTCTGCTTTTCGTCCTTGCGGCAGACGATCTGCAACTGCCCCTGACGCGTCACCTGGGCCGCATAATAGCGGCGCTGCCCTCCGGCGCGGATCACCAGGCCACCGTGATCGCCGAGATGCAGCATCATGTCGGCTTCGATCGCGTAATCCGTCCAGTCGCGCGTGCCCTGCAGGATCATGCCTTCGCCGCGGTTCTGGCTGATGCGGAAATCCTGCGGGAAGTTCTTCGAGAAGAAGTGGACGTTGCTGACCCAGGCGCGGCGCCAGAAATCTCCGCCGTCTTCGGGACGGCGGAAGGTGGCATCGGGGCAACCGTCCCAGGTGACGGTATCGACGAGCACGCGGCCGTTGGCACGGCCCTGCGGGGCACTCAGCACGACGCCGATTTCGGCGATCGGCTGGTGGCCGGTCGAGGGAATCGTCCATTCGAGCGCTGCGGAACCGCCGGGGCCGAGAAGCTTGCCGGAGGGCATATCGATATCGACGAGATTGTCGCTGCCGTCATAGTGGCGGATGCGGAGCTTGGCTTCGACCGTGCCGCTGTTTCCGGCTTCTGCCGAAAGCGTGGCGCGCAGCGTCTGACCGGCGTGGATGCGCGGGCTGGCGACCAGATCGTAGGTACGCATCTTGGTGACTTCGTGCGGCGTGAAGACCGGCGTGGTTGCCGCAGCCTCGCGGCCTGGCGCCAGCGCCAGATAGTCGATCGCCAGAGCCGAACCAGTGCCGGTGACCTTGAGGCCGTCGACCGCCGTCTGGCCGGCCTGCGGGCGGAAGCCCTGGACGGAGCCCGGCAGCGCGAAATGGAATTTCGCACCGTTCTTCGGCGCCGGCTTTGCCGGCTCGCCCGCAAGCTTACGGCCGAGATTGGCAAGCCACAGAGAGATATCGGCGGCATCGGAGATCGAGCGGCCGCCGTCGGCGGTCGACATGTAGAGACGGTCGGCAACGGGGCCGCGCCAGTCCGGCCCGGCATCAAGGCCGTCGAGACCGAGCATGATCCCGTGCAGGCAGCCGAGGTTGCCGGCGTTGCAATCGGTATCCCAGCCGGAGGTGTTGACGATCATCTGGCCGCGCTGGAAATCGTGCGGGGCATGCAGCAGCGCCATGATCATCAGTGCGTGGTTCGGTACGACATGGCAGTTGCCCGGATATTTGTCGTAGCCGTAATTGTCCTCGATCTTTTGACGCGTCGCTTCCCAGTCCTCGTTGCCGGAGGTCCAGTTGCGGATATCGCCGACCAGCTTGGCGATCAGGCTGTCCTTCGGGATATGGGCAAGGCCGGTGTCGAGCAGGTGGTTCACATCCTTGCTGACAAAGGCTTCCGCCTCCATCGCCGCCCAGAGCACGGCGGCGTTGACAGCTTCGCCGTCATGGCTGACGGAACCGGCAGCGCGCGCCAGCTTGGTGGCCTGTGCCGGGTTGCCGGGCGAAACGAGAGCCCAGCCGTCGATGAAGATCTGCGCGCCGATCTGCTCGGCGACGGTCGCGCCATTGACCTCGATCGACCCGGATGCCGGAGCCGGGATGCCGCGTTGCAGGTTGAGCCAGGCGGTGTGTTCGGTCGAATTGCCGGCGCCACTCCACCAGAGGATCGAGCGCTCTTCGATGATGTAGTTCAGCCAGGCCTTGCCGATGTCTTCGGCCGAGAGATCGGGCGAAATGCCGTAATCGTCGAGGGCACGCGGGAAGGTGAAGGTGCCGGCAACGTCGTCGTCGGTGACGACGAGCGGATCGCCCAGGCGGTCGTGGACGTAATAGGTGATCGGGCCGAGCTTCTCCAGGATATCCTTGTAGCGCCAGTTCTCGAACGGGCGGCCGAGATAGACGCCGATCAGCTTGCCGAGAACGCCGGCATAGACGCGGTTGGTATAATCTGCGGGAGTGGCGTCGCCTTAACCTGACGTAGGACGCAATCATTTCTTCGTCTTGAATTTCAGGCTTGGATCGGCAACGCGATCTCGCGCCAGATGGCTCTGGCGAGGGCGCGGAGTTGGCGATGATCAGCGGCGTTCAGGTGTTTTCGATGAAGTTGGAAAAGATTGGCAATTTGACCGTGGGTTGAGACGAAACGTTGGCATTGTCGTGCCGACTTGAACCGCATCATGCGTCGCTCTCGTCGTCGAACGGGAAGGTGAGAATTCTCTGCGCGGTTGTTGAGCCCTTTGTGCGAACGGTGCTTGATCCCCGGCATGATCTCTCGCTTTGCGGCGGCATAAGAACGCAGCTTGTCTGTCACCATCACGCGTGGCGCAACACCTTGGCTCTTCAACAACTTGCGCATCAGCCGAAGAGCAGCTCCCTTGTTTCTGCAGCTTTGCAGCAGAGCATCGAGAACGTAGCCGTTGCCATCGACGGCGCGCCACAGCCAGTATTTCTTGCCCTTGATCGATACGACCATCTCATCGAGATGCCATTTGTCTGCGAAGTTGCCGATCGACCGCCGCTTGAGTTGATGGGCGAATTTCAGGCCAAACTTGGTCGCCCATTCTGCGACGGTCTGAAACGAGACGCCAATGCCGCGCTCGGCAAGCAGGTCTTCGACATCGCGCAGGCTCAGCGGGAACCGATAATAGAGCCATACCGCATGCGCGATGATCTCGGCTGGAAAGCGGTGGCGTTTGTATTGGGCGGACGCATCTTCAGTCAT
Protein sequences of DBSCAN-SWA_17 >CP036360|184119:188043|184119_185184_-|QBJ16567.1|DBSCAN-SWA MASVTIDKVAKRFGSAEVIHGIDAAIGDGEFVVLVGPSGCGKSTLLRMIAGLEDISGGTISIGGKVVNTLPPKSRDISMVFQNYALYPHMTVEENMAFSLTLAGASKAEKQKKVREAAEILGLESLLQRKPKALSGGQRQRVAMGRSIVRNPQVFLFDEPLSNLDAKLRVQMRGEIKSLYRRLGTTMVYLTHDQVEAMTMADRIVVMQAGRIEQIGTPLELYDRPANAFVAQFIGSPAMNLFPATVSAGQLVLEDGQASGLTAPGGAREGQKLLIGKRPEEIALGPEGSLEAIVENIEPTGSETFLELKLGSQLVSAVLRERPWFEVGSVQRISLGQGAVHLFDARDRRQRLFP >CP036360|184119:188043|185194_187231_-|QBJ16568.1|DBSCAN-SWA MIGVYLGRPFENWRYKDILEKLGPITYYVHDRLGDPLVVTDDDVAGTFTFPRALDDYGISPDLSAEDIGKAWLNYIIEERSILWWSGAGNSTEHTAWLNLQRGIPAPASGSIEVNGATVAEQIGAQIFIDGWALVSPGNPAQATKLARAAGSVSHDGEAVNAAVLWAAMEAEAFVSKDVNHLLDTGLAHIPKDSLIAKLVGDIRNWTSGNEDWEATRQKIEDNYGYDKYPGNCHVVPNHALMIMALLHAPHDFQRGQMIVNTSGWDTDCNAGNLGCLHGIMLGLDGLDAGPDWRGPVADRLYMSTADGGRSISDAADISLWLANLGRKLAGEPAKPAPKNGAKFHFALPGSVQGFRPQAGQTAVDGLKVTGTGSALAIDYLALAPGREAAATTPVFTPHEVTKMRTYDLVASPRIHAGQTLRATLSAEAGNSGTVEAKLRIRHYDGSDNLVDIDMPSGKLLGPGGSAALEWTIPSTGHQPIAEIGVVLSAPQGRANGRVLVDTVTWDGCPDATFRRPEDGGDFWRRAWVSNVHFFSKNFPQDFRISQNRGEGMILQGTRDWTDYAIEADMMLHLGDHGGLVIRAGGQRRYYAAQVTRQGQLQIVCRKDEKQTVLASAPLRVEFETRFTMKFSAKGGKLSVKVGDTVVEANDQTFAGGSAGLIIHEGALSATRIHIGAA >CP036360|184119:188043|187329_188043_-|QBJ16569.1|transposase|DBSCAN-SWA MTEDASAQYKRHRFPAEIIAHAVWLYYRFPLSLRDVEDLLAERGIGVSFQTVAEWATKFGLKFAHQLKRRSIGNFADKWHLDEMVVSIKGKKYWLWRAVDGNGYVLDALLQSCRNKGAALRLMRKLLKSQGVAPRVMVTDKLRSYAAAKREIMPGIKHRSHKGLNNRAENSHLPVRRRERRMMRFKSARQCQRFVSTHGQIANLFQLHRKHLNAADHRQLRALARAIWREIALPIQA |
3 | Bacillus_virus(50.0%) | transposase | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_18 |
193278 : 195213
Sequences of DBSCAN-SWA_18
Nucleotide sequences of DBSCAN-SWA_18 >CP036360|193278:195213|DBSCAN-SWA CATGAAACACATTTCTATCGTCGGAAAATTTCTTATTATTATGGCATGCTTCGGCTTGTTCTCGCTTGGGGTCGCCCTTTACTCCGGGATCCAGATTGCCAGAGTTGACGAAGACTATGCTGGCCTGATGGACGGCGAATCAACCGCCGCCTTGTATTTGGCTCGCTCCAATCGCAACCTCCAGGCAGCACGAGCGTCCATCGGGGAACTGTTGATGTCGCGCTCGGCAGACCTGAACGAGCGGGCGGAAAAGGGTATCAAGGATGCTGAAGCCGGTTTTGTGAAATACATGGACACCGTTGCTGGCGCTGTCCCGCAGCACGCCGAGATTGCTCTTCTCAAGGCCGAAGGCCTCAAGGTCATGAAAGAGGTGTGTGGCCCGACCATTGTGGCGGCGCGAAACGCGACGACGGAGCAAGACATTGCGGCGTCACAACAATTGTTTCTGAGCCAGTGTCAGCCGGCTTTCAGCGCGCTCACTCCGAAGTTTACAGGTGCGACGAATGAAATCGTTGATAGTGCTGCGAAGACGGGTGAGGAGCTGTCCGCGCACGCCAGCAGCACCGCTACCAAGACCCTCGCCGCTGTCATCGTCGGCTTGATCATCGTTCTCGCGGTCGGCTTCCTCGCTATCCGTTCCTGGTTGGTACGTCCGATCCAGCGGATGTCGGTCACCATGAACGTGCTCGCCAATGGGGATTTGAACGCCGGCGTCGATGGAACGGATCGCCGCGATGAAGTTGGCGGCATGGCGAGAGCCGTCCAGGTATTCAAGGAAAACGGCCTGCGGGCAAGGGAGCTTGAACAGGAAGCGGCCGCGACGCGCTCCGCTAGCGAGGCGGAGCAATTGCGGGTCGCGGAACTTGAGCGTGAGCGCGCCCAGCAGATGGCACAGGCGACGTCAGGCCTTGCGGAGGGTCTGAAGCACCTGTCGAGCGGGAACCTGACTTTCCGACTGAACGAACCCTTTGCCACAGATTTCGAGGCATTGAGGTCTGACTTCAACGCAGCTGTTAGTCAACTGGCAGAGAGCTTGCGATCTGTCGCGAACGCCACAGGATCAATCGATAGTGGAGCACGTGAAATCAGCCAGAGTGCCGAAGACCTCTCAAAACGGACCGAGCAGCAGGCCGCGTCGCTTGAGGAAACGGCAGCCGCTCTCGACCAGATTACCACCAACGTTGCAAACTCCTCCAAACGCACCGAAGAAGCACGTCATGTGGCAATCGAGGCCAACAAGTCAGCGCGCCGCTCGGGCGAAGTAGTTTCGAATGCTGTAATCGCCATGCAGCGCATCGAACAGTCGTCGAACCAGATCTCCAGCATTATCGGTGTCATCGATGAGATTGCCTTCCAGACCAACCTGTTGGCGCTGAATGCCGGTGTCGAGGCTGCGCGGGCCGGGGAAGCGGGCAAGGGCTTTGCGGTGGTCGCTCAGGAGGTGCGCGAGCTTGCTCAGCGGTCGGCTCAAGCAGCGAAGGAGATCAAGTATCTCATCCGAAACTCCGTGGATGAAGTCAGCACGGGTGTAAAGCTGGTTCAGGAAACGGGCCAAGCGCTCAAGGTGATCGAGGAGCAGGTTGTTTCGATCAACACGCAGCTCGACGCCATATCAACGTCGGCCAAGGAGCAGTCGATTGGTCTTGCCGAGGTCAACACGGCCGTGAACCAGATGGATCAGGTGACGCAGCAAAACGCGGCCATGGTGGAAGAATCGACGGCTGCCAGCGCGTCTCTTGCCAGCGAGGTGCAGCGCCTGCGTGAGATCATATCTGAATTTCGCGTCGGCGCGGAAGGAAGAGCTGAAACCGGCAAGCCGACGGCGGTGAGAGTCGAACACAAACCTGTCATTTCTCCTGCCCGCCGCATGCTTGCGAAAGTAGCAGGAGCTATCGGAGGCCGCGCAGCCGCGGCTGAAAGCTGGGAGGAATTCTGA
Protein sequences of DBSCAN-SWA_18 >CP036360|193278:195213|193278_195213_+|QBJ16849.1|DBSCAN-SWA MKHISIVGKFLIIMACFGLFSLGVALYSGIQIARVDEDYAGLMDGESTAALYLARSNRNLQAARASIGELLMSRSADLNERAEKGIKDAEAGFVKYMDTVAGAVPQHAEIALLKAEGLKVMKEVCGPTIVAARNATTEQDIAASQQLFLSQCQPAFSALTPKFTGATNEIVDSAAKTGEELSAHASSTATKTLAAVIVGLIIVLAVGFLAIRSWLVRPIQRMSVTMNVLANGDLNAGVDGTDRRDEVGGMARAVQVFKENGLRARELEQEAAATRSASEAEQLRVAELERERAQQMAQATSGLAEGLKHLSSGNLTFRLNEPFATDFEALRSDFNAAVSQLAESLRSVANATGSIDSGAREISQSAEDLSKRTEQQAASLEETAAALDQITTNVANSSKRTEEARHVAIEANKSARRSGEVVSNAVIAMQRIEQSSNQISSIIGVIDEIAFQTNLLALNAGVEAARAGEAGKGFAVVAQEVRELAQRSAQAAKEIKYLIRNSVDEVSTGVKLVQETGQALKVIEEQVVSINTQLDAISTSAKEQSIGLAEVNTAVNQMDQVTQQNAAMVEESTAASASLASEVQRLREIISEFRVGAEGRAETGKPTAVRVEHKPVISPARRMLAKVAGAIGGRAAAAESWEEF |
1 | uncultured_Caudovirales_phage(100.0%) | NA | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_19 |
198489 : 199503
Sequences of DBSCAN-SWA_19
Nucleotide sequences of DBSCAN-SWA_19 >CP036360|198489:199503|DBSCAN-SWA TTCATCGCTCGAAGGGAGTCAACCATTTCGAATGTCTAGGGGCAGGCCCCGATGAGCGTCGCTCGATAAAGGTCGAGGCGATAAGAATGCGTTTTGCGACGACCGGCTGCCCTCTTGTTTCAATGCGTTCCAACAATAGCCGCATCGCTGTAGCGCCCATCTCCCCCCCTTGTACTCTGATGGTCGTCAGTGGCGGAGATATTTGGGTCGCGGCGGAAAAATCACCGAAGCCGATAACGGACATATCGTCTGGTATGCGGTAGCCGCGTGCTAGAAGTTCGGAAACAGCGGTGAGCGCTAGCCCATCGTGAGCGCAGAACAGGGCTGTTGGTGCGATTCCGGCAGTTTGAAGAGTTTGCAGCGCAGGGATAAAGCCTCCATCTTCGTCAAAGCGCAGGACATGCAGGCGCGCATCGGCATGTTGCTCTAAACTTTCGCGCAGACCGTGATATCGTTCCATACGACCGCGATAGCCTTCCTTTCCTTGCACAAAGGCGATGTCGCGATGGCCGAGCCCAAACAAATAATCGCCGACCGCTATCCCTGCTTCGTGGTCCGTGCCGCCGACATGATCGGCCTGTTCAAGTGGAAGCACCCACCCCAGTCGAACCGAAGGCAAGCCCGACGCTTTGACAATGTCCAGCGTCTTTTTTTCGTGCGGCCCAACAAGAATGATACCTGCGCTATCTTTCACCATCGCCTCGACCCTGTCGGCGTCATGCGTCCACTGCAACCGTACAGCTTGACCTAGTCGATGCGCCTCCTTCTGCATGCCGTCCTGCAGCATCGTCCGCAACTCGTAGCTCACGCTATCGACCTGATCGTGAAAAATCAGAGTGATTTCGCGTTGCTCGTCGGCGGGCTTCGGGCGCTGGTAGCCTAGCATCATGGCAGCGTCCTCGACGCGTTTGCGCGTCGCCTCACTGACTCCTGCTTTTCCTGAAAGTGAACATGAGACCGCAAATTTTGACAGGCCGGTGTGATTGGCGATGTCCTGCAATGTCACTTTGCGCAT
Protein sequences of DBSCAN-SWA_19 >CP036360|198489:199503|198489_199503_-|QBJ16580.1|DBSCAN-SWA MRKVTLQDIANHTGLSKFAVSCSLSGKAGVSEATRKRVEDAAMMLGYQRPKPADEQREITLIFHDQVDSVSYELRTMLQDGMQKEAHRLGQAVRLQWTHDADRVEAMVKDSAGIILVGPHEKKTLDIVKASGLPSVRLGWVLPLEQADHVGGTDHEAGIAVGDYLFGLGHRDIAFVQGKEGYRGRMERYHGLRESLEQHADARLHVLRFDEDGGFIPALQTLQTAGIAPTALFCAHDGLALTAVSELLARGYRIPDDMSVIGFGDFSAATQISPPLTTIRVQGGEMGATAMRLLLERIETRGQPVVAKRILIASTFIERRSSGPAPRHSKWLTPFER |
1 | Enterobacteria_phage(100.0%) | NA | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_20 |
206058 : 209423
Sequences of DBSCAN-SWA_20
Nucleotide sequences of DBSCAN-SWA_20 >CP036360|206058:209423|DBSCAN-SWA GATGACGAAGCACGAAACCTTGCTGGCGGTCAAAGACCTGTCAATCGATTTCCATCTCAGAACCCATGTGCTCCATGCCGTTCGCAATGTCAGTTTTGACCTTGAACGCGGAAAGACGCTTGCGCTGGTGGGAGAAAGCGGTTCAGGCAAATCAGTGACCGCCCGTGCCCTGATGCGGATCATCGACAAGCCCGGACAGATGACAGGCGGCCAGATCGTTCTTGATGGGCCCAACGGACCAGTCGATATCGCGAAATTTGCCGCGAGCAGTCGCGAGGTCCTGGCCATTCGCGGTGGACGGATCGGCCTGATTTTTCAGGAACCGATGAGTTCGCTGTCGCCGGTCCATACCATCGGCTCACAGATCATCGAAGCCGTGCGTCTACATCGCCGCGTGTCGAAGAGACAGGCTCGGGAGCGATGTATCGAACTGCTGCGCCAGGTGGAGATACCGCAGCCGGAAGTGATGGCCGACAGATACACCTTCGAGTTTTCCGGCGGAATGCGCCAGCGAGCGATGATTGCCATGGCACTGGCCTGCGACCCGGAAGTTCTGATCGCCGATGAGCCGACGACGGCGCTGGATGTGACGACGCAGGCCGAAATCCTCGATCTCATCAAGCGGCTGCAGGTGGAGCGCGGCATGGCGATGCTGCTCATCACCCACGACATGGGCATCGTTGCCGAAGTAGCGGACGAGGTCGCCGTCATGCGTTTTGGCAAGATCGTCGAAAAAGGGCCGGTGGACGAGATTTTCCACGCCAGTCAGCATCCCTACACACGGCAACTTCTGGAAGCGACGGTCAAGCTCGAGAGCGGCGCAGCCACGCGCACGCTGCCGGCGTCGCTGACGGCCAACGTGGCTCCGATCCTGTCAGTGCGCAATCTTTCCAAAGTCTACGGCGCGCCGTCCGGACTGTTTTCGTATGGCGGTGGGCGCGGGCTGGTGGCGGTGGACGATGCTAGTCTTGACCTATTTGCCGGAGAAAATCTCGGCATTGTCGGCGAAAGCGGATCAGGCAAAACCACACTAGGACGAATGATACTGCGGATCGTTGAGCCGAGTTCCGGCAGAATTGCGTATCGCGGATGCCCGGATACGGCTCCGATGGACGTCACCACGCTAAACAAAGTGAATTTGCGCCGCTATCACCAGGATGTGCGTCTTATCTTTCAGGACCCATTTGCGTCGCTCAACCCCCGTATGACGGTGAAGCAGATCATCGGAGATCCTCTGGTCATCGCCGGTGGCATGTCCGGCAAAGCGGTGGAGACCCGCGTTGGCGAACTTTTGCAGAAGGTCGGGCTCGATCCCTTGGCGATGGAAAGATACCCACATGCCTTTTCCGGTGGCCAACGCCAGCGGATCGGGATTGCAAGGGCGCTCGCCGTCAACCCGAAGGTCATCGTCGCCGACGAGGCGACCTCGGCGCTCGACGTGTCGATCCGAAGCCAGATCCTCGACCTGTTGCTCGATATTCAAAAGCAGCTCAATCTCAGCTTCATCTTCATTTCGCACGATATCTCGGTCGTGCGCTATTTCTGCGATCGCGTTGCAGTCATGCACAAAGGCAAGATTGTCGAGATCGGAGATGCGGAAACAATCTGCACAACGCCCTCGCACCCCTATACCAAGCGACTGATCTCTTCTGTTCCCAACCCCGACCCCCGCAACAAGCGCATGCTGCACCGCCTGCGCGCAGACCAAGTTTGAGGTCATTGCCGATGCCGAAATTCGCTGCCAATCTGTCCATGCTTTATACCGAACATCCGTTCATGGAACGGTTCGCCGCTGCCGCTGCCGATGGTTTTGCGGCAGTCGAATATGTCAGCCCCTACGAGGAGGCGGCAAAGACGATTGCAGCGGAGCTCAGGAGGTGCAATCTCACACAGGCTCTGTTCAACCTGCCGCCCGGCAACTGGGCCGCAGGCGAGCGCGGCATCGCCAGCCTGCCGGACAGGGTCTCCGAATTTGAGACGTCCGTTGAAACGGCGATCCGTTATGCAAAAATTCTGGGATGCCGAAAGATCAATTGTCTGGCCGGTATTCAACCGCCCGGTGTCGATCCGCAAGTTCTGGAAGACACGCTTGTCGGCAATCTCGGTCACGCTGCGCAACGTCTGGCGGATTCCGGAATTGCCCTTGTCTTCGAGCCCATCAACACTCGCGACATACCCGGATATTTCCTCACCAACACCGATCAGGCCGAGCGGATCATGGACCGGGTCGGGCATTCCAACCTGCTGATCCAATATGATTTCTACCACATGCAAATCATGCAGGGCGATCTCGTCGCGACCTTCGAGCGCCTGCAGGACAAAATCGGCCACGTTCAGATCGCTGATAATCCCGGACGTCATGAGCCGGGGACGGGCGAGATCAACCACGATTTCATCTTCAGGCGTCTGGATGAACTTGGTTACGACGGATGGGTCGGCTGCGAATATCGACCGGCGTCGACCACCAGCGCAGGTCTTGACTGGTTGAAAGCATATCAACGGGAGGATTGAGCCATGAAAATAGGATTTATCGGACTGGGGGTCATGGGCCGCCCGATGGCGCAGCATCTGATCGATGCCGGACATGAGCTTTATCTTCACAGGGTCAAACCCGTGTCGAACTCCCTCCTGAAAAGTGGTGCGAGGGCATGCGGTTCAGGTGCAGAAGTGGCGCGCAGCGCCGAGATCGTGATTTTGATGCTGCCGGACACGCCCGATGTAGAAGCGGTTTTGTTCGGTGACGATGGCGTCGCGCACGGCCTCGATGCGGGAAAGCTCGTCATCGACATGAGTTCGATTTCACCGATTGCGACGAAGGATTTTGCCGCCCGTATCGAGGCGCTGGGATGCGATTATCTCGACGCGCCGGTGTCGGGCGGTGAAGTTGGCGCCAAAGCGGCGTTGCTGACCATCATGGTCGGCGGAAAACCGGATATTTTCGAGCGCGCAAGACCCCTGTTTGAAAAGATGGGCAAGAACATCACCCTGATCGGTGACGTTGGAAACGGTCAGGTTGCAAAGGTCGCCAATCAGATTGTTGTCGCCCTGAATATTCAGGCCGTTTCGGAAGCCCTTCGCTTTTCGAAGAAAGCCGGAGCGGACCCCTCGATCGTGCGCAGGGCCTTGATGGGTGGCTTTGCCGCGTCCCGCGTGCTGGAGGTTCACGGCGAAAGGATGATCAACGAAACATTCGAGCCCGGCTTTCGTATACGTTTGCATCACAAGGACATATCGCTCGCGCTTGAGTCGGCGCGTCTTCTCGAGCTTATGCTGCCCAACACAGCGATGGTTCATCAGTTGATGAATGGCGCGCTGGAAAAAGGCCTGGGCGACAAAGACCACTCGGCCTTGATCAAAGCCTTGTAG
Protein sequences of DBSCAN-SWA_20 >CP036360|206058:209423|206058_207771_+|QBJ16585.1|DBSCAN-SWA MTKHETLLAVKDLSIDFHLRTHVLHAVRNVSFDLERGKTLALVGESGSGKSVTARALMRIIDKPGQMTGGQIVLDGPNGPVDIAKFAASSREVLAIRGGRIGLIFQEPMSSLSPVHTIGSQIIEAVRLHRRVSKRQARERCIELLRQVEIPQPEVMADRYTFEFSGGMRQRAMIAMALACDPEVLIADEPTTALDVTTQAEILDLIKRLQVERGMAMLLITHDMGIVAEVADEVAVMRFGKIVEKGPVDEIFHASQHPYTRQLLEATVKLESGAATRTLPASLTANVAPILSVRNLSKVYGAPSGLFSYGGGRGLVAVDDASLDLFAGENLGIVGESGSGKTTLGRMILRIVEPSSGRIAYRGCPDTAPMDVTTLNKVNLRRYHQDVRLIFQDPFASLNPRMTVKQIIGDPLVIAGGMSGKAVETRVGELLQKVGLDPLAMERYPHAFSGGQRQRIGIARALAVNPKVIVADEATSALDVSIRSQILDLLLDIQKQLNLSFIFISHDISVVRYFCDRVAVMHKGKIVEIGDAETICTTPSHPYTKRLISSVPNPDPRNKRMLHRLRADQV >CP036360|206058:209423|207782_208568_+|QBJ16586.1|DBSCAN-SWA MPKFAANLSMLYTEHPFMERFAAAAADGFAAVEYVSPYEEAAKTIAAELRRCNLTQALFNLPPGNWAAGERGIASLPDRVSEFETSVETAIRYAKILGCRKINCLAGIQPPGVDPQVLEDTLVGNLGHAAQRLADSGIALVFEPINTRDIPGYFLTNTDQAERIMDRVGHSNLLIQYDFYHMQIMQGDLVATFERLQDKIGHVQIADNPGRHEPGTGEINHDFIFRRLDELGYDGWVGCEYRPASTTSAGLDWLKAYQRED >CP036360|206058:209423|208571_209423_+|QBJ16587.1|DBSCAN-SWA MKIGFIGLGVMGRPMAQHLIDAGHELYLHRVKPVSNSLLKSGARACGSGAEVARSAEIVILMLPDTPDVEAVLFGDDGVAHGLDAGKLVIDMSSISPIATKDFAARIEALGCDYLDAPVSGGEVGAKAALLTIMVGGKPDIFERARPLFEKMGKNITLIGDVGNGQVAKVANQIVVALNIQAVSEALRFSKKAGADPSIVRRALMGGFAASRVLEVHGERMINETFEPGFRIRLHHKDISLALESARLLELMLPNTAMVHQLMNGALEKGLGDKDHSALIKAL |
3 | Planktothrix_phage(50.0%) | NA | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_21 |
219078 : 223153
Sequences of DBSCAN-SWA_21
Nucleotide sequences of DBSCAN-SWA_21 >CP036360|219078:223153|DBSCAN-SWA TATGTCAGGACGTTTCATCCGAGCCGCAACGGGAATATTTGCTGCGGTGATCTTGTTTACGGGGGCATCGGCCGCTGGGGCTGCCGAAAAGCAATATACCGTCACATCCTCCGATGGCGTTATAATCGCCGTCGAAGAAACGGGAAATCCCCAAGGTCAGCCAATTGTATTCGTTCACGGTCTGCTTGGCAGCCGCATCAATTGGGACAGCCAGACGGCCGATCCAGACCTGCAGAAGTTTAGGCTGATCACCTTTGATCTGAGAGGACATGGACTGTCGGCAAAACCGGATGACGCTGACGCCTACAAGGATGGCGACCGATGGGCAGACGATCTCGATGCAGTCCTTCGGGGAAGCGGAGCGACGAACCCGGTGCTCGTGGGCTGGTCTCTTGGGGGTGTGGTGCTGTCGAACTATCTTGCCAGCCATGGCGACGCCGGGATTGGTGGCCTCCTTTACGTCGACGGTGTCATCGAACTGAAACCCGATCTCATCACGCCGCATCCGGAGGTCTATGCAGGACTGGCCTCAGAGGACCTGCGGACGCATCTCGACACGGTGCGGACCTTCCTGGCCCTCTGCTTTGCAACGCCGCCAGAAAGCGCAACGTTTGAACGGCTGCTGTCCAATGCTGCGATGGCATCATGGCTGATGACGCGAACCGTCCCCTCGATGACAGTCCAGGCAAAAGAAGGACTGGAGAAGGCAAAAAAGCCGGTTCTGTTGATCTATGGGGGTAGGGATAATCTGGTCCGGCCGCAGCCGAGTATAGAACGCGCGAAGTCGTTCAACGGCGCGATCAAGTCGGAGATTTACGACAACTCGGGTCATGCGCCATTTCTTGAAGAAGCGTCGCGGTTCAATAAGGATTTGGCCAAGTTTGCGGCTTCTGTGGCGGATGGCAAGTAAGTCGGAAACTGGCTCCGCCGGCTCCCATTCTGCTGCGTCTTTCATGCTTCCATTCTAAAATCGAGACTGAATGGTCGTCTGATCTGGAGAAGCGTGCGTGTGGCGCCGTGGTCTTGATAGGTCCGCCCCGTCGCGTTGGGCCGAACACTGGTGCATCACGACGACGGAACTGGAACCGGCTAAACGAGTTATGATGGTTCAGGGAGATGGGGAAGCTGTCAGGAGCGAACGCGTGAACAGATGCAATATAACCCACCGGCGGATGACCATTGGAACGCACTCCCTGTTCCTCCGGGAAGCGGGGCCGATGGACGCGCCCGTTCTGCTCCTGCCCCACGGATACCCGTGTTCGTCCTATCAGTACCGCAGGCTGATGCCGGCGCTCGCCGATCAGTGGCGCACGGTAGCTGTCGATTGGCCGGGTTTTGGCTACAGCGACACGCCTGATCCCGCCGAGTTCGGATATGACTTCGACGCTTACGCCGAGGTGCTCAACAACGTCGCTGAGGCGCTCCGACTGGAGCGCTACGCGCTCTGGCTCCATGACTATGGATCGCAGATTGGTCTGCGGCATGCCATAGCCCACCCGGAGCGGATCGCCGCGCTGATCATCCAGAACGGTGACATCTACGAAGACGTGCTCGGCCCTAAATACGGGACGATCAAAGCGTGGTGGGCCGACAAGTCGCCCGAGAAGCACCGCCCTCTTGAGGAGGCCGTAAGTGAAGACGGATTTCGCGAGGAGTTCGTCGGTGAAGTCTCTGAGGAAGTGGCAAGCCTCGTCCCTCCCGACCTTTGGAAGCTGCATTGGCCGCTTATGGATACGCCGACACGCAAAATGGTGGCAGTCCGCCTGATGGAAAAGCTGGAGGAAAACCTCGACTGGTTCCCTCGCTATCAAGGCTATCTGCGTGAGCACCGCCCTCCGACCCTGGTGGTGTGGGGCCCGCAGGACGGCTATATGCCTGAGGCGTCGGCGCAAGCTTACCGGTGCGATCTCCCGGACGCGGAGTTGCACATTCTCAGCGAGGCCGGGCATTGGCTGCTGGAAACCCATCTCGAACAGGCGTTGCCGCTCGTCCGCGATTTCCTGGCCCGGACGTTCCGATGATCGCACCTTTGAACGTAGCCCGCAACGAGAGGAGACATGCATGGTTACGAAGCCCACAATTGCACGCATCTGGCGCGGCCGTACGCGCCGAGAGGTCGCGGACAGCTACGAGCCCTATTTGCGGGCCGAGCGCATCGAGCTGCATCGCATTCTGGTGAATGATCCGAGCCTTCGTTGAGCCGCGAGCAAGCCCTGCCTAGGGTGATGGAGCTCCGCAAGACAATCGGGATCTCCCGGCCGCAGCAAGGCCGGTCAGTCTCCAGATGGTCCCACCTCTTTGCATCGGGAACGGCGCGTGTTCCGGCGCAATCGCGCTGACCAGGAAAAGAAAGCGATGATGTCACTGCCCCACCGCGTCTCCGGCAATCTCGCTCTTGATCTCGCCAACACCATCAGTTGGCGCAACACGAGCCGGGAAGTCGACCATTTGGCAAGCTTCGATGACGTTGTGGCATGGTCGAAGCAAGTCGGCCTGGTCGGCGGCGACTTCGTCGTATCGCCGCAGGAGCAAGAGATACTGCTCCAGCAGGTGCTCGCGCTTCGCAAGGCAATCGGTGCCGCGGGATCGGCGATAGCTAACGATCTCGATCCGTCTCGGCTGGATCTGGACGTCATACGCGACATTGCTGCGCTGTCTCTGCGCCAGGCTTCGCTTTCCGGAACGCCGTGCACATGGCATTTTGAGGGCATCTATCGCGTGACCGGCACCATCGCGTGGTCCGCGCTCGACCTGCTTCGAGGCGACGAACTCTCCCGGCTGAAGCAATGCCCACCCGACGATTGTCAGTGGCTGTTTATCGACCGGACAAAGAACGCTTCAAGGCGCTGGTGCGACATGTCCACCTGCGGCAATCGCGCCAAAAAAATGGCTCACAGAGCCCGGCGTTGAATCAAATGGCGGCAACGAGCAGACAGCGGGATTCTGGACCCGTGATCAAAGCAGGATGAAATCCAATGATCACTTAGGTGTCGGGATACCGCATTGATTGCGCGCCCCAGGCGGACGTCGACGTGGATGGAGCATGTGGCTGAACTCATTCCTGAGGTCTGAGAAAACCATCAAGGCCACATCCCCTGCCGAGCGAGGAACAAGTGGCCCTCCATCTCTTCTGCGGCGTCCTACGCTCGGGAACTGACAGCGAAACCAGACGGAGCTTCAAATCGGAACGACTGCAAACTTTCATTTACCACAAGGCGTATTGTCGAATTCCACGCCGCGCATATGTCGTCTTTGTCGTAATGTATCCGGCCTTAACAACAACCGATGAGGTGACATGATGTCAATCAACTCGGGTCATGCTCGATATTTCCCGCGACCCATACGAACTACAGCTAACCCGTTGGAACGGGCTATCCTGCAGCTAGAGAAGATGATCGATGCGCAAAGAACCAGTGCGCTACCGGCACCGTTTGCTCTTTATAAAGCCAAACGCATTCTCGAACGATGCCACCAGGGGAGCCCGCTGCCGAAATTTGCGCGTGACTGCATGCCCAACGTGGAATAAGCGATCGGACATCGGTAGCACGGGCCCGAAACAATGAACCTTATCGGTGGTCAAAGTCTGCCCAACTGGTCATGACGACCTTCGCCGGCTCGTGCTACATCCAATATGCCGGCAGCGTCGGCGAGCGTTCCTTCTGCTACAGCAGCCGAAGCGTGCTCTGATCACAAGCTCCTGCGTAAAACATGTCCAGTGCGGCTTGGAAGGGTGAATCCCGTCCAGCAACAGCTCCAAACAACGTCATGCTGGATCGAAATTTGAGATCATCCGGGGATCCGAGAATGTCATGCGCGCTCAAATCGGTGTGCTGCAGCATGGCGGTCGTACACTCGACGAGGCGTGGTCCAAGCATAGGATGCGTCAAATACGCCTCCGCTTCCGCACGACCCGAGATCGCGTAAAAGCGCGCAGTTTCCGAACGGCCAAGGCCACGCAGTTGCGGAAAGATAAACCACATCCAGTGTGAGCGCTTCAGTCCCGCCTGCAATTCGCTGAGGGCCACCTCGTAAATGTTTTGCTGCGCCGTCACAAAGCGCTCGAGATTGAATTTCAT
Protein sequences of DBSCAN-SWA_21 >CP036360|219078:223153|222739_223153_-|QBJ16596.1|DBSCAN-SWA MKFNLERFVTAQQNIYEVALSELQAGLKRSHWMWFIFPQLRGLGRSETARFYAISGRAEAEAYLTHPMLGPRLVECTTAMLQHTDLSAHDILGSPDDLKFRSSMTLFGAVAGRDSPFQAALDMFYAGACDQSTLRLL >CP036360|219078:223153|219078_219987_+|QBJ16594.1|DBSCAN-SWA MSGRFIRAATGIFAAVILFTGASAAGAAEKQYTVTSSDGVIIAVEETGNPQGQPIVFVHGLLGSRINWDSQTADPDLQKFRLITFDLRGHGLSAKPDDADAYKDGDRWADDLDAVLRGSGATNPVLVGWSLGGVVLSNYLASHGDAGIGGLLYVDGVIELKPDLITPHPEVYAGLASEDLRTHLDTVRTFLALCFATPPESATFERLLSNAAMASWLMTRTVPSMTVQAKEGLEKAKKPVLLIYGGRDNLVRPQPSIERAKSFNGAIKSEIYDNSGHAPFLEEASRFNKDLAKFAASVADGK >CP036360|219078:223153|221393_221987_+|QBJ16595.1|DBSCAN-SWA MFRRNRADQEKKAMMSLPHRVSGNLALDLANTISWRNTSREVDHLASFDDVVAWSKQVGLVGGDFVVSPQEQEILLQQVLALRKAIGAAGSAIANDLDPSRLDLDVIRDIAALSLRQASLSGTPCTWHFEGIYRVTGTIAWSALDLLRGDELSRLKQCPPDDCQWLFIDRTKNASRRWCDMSTCGNRAKKMAHRARR >CP036360|219078:223153|220180_221098_+|QBJ16851.1|DBSCAN-SWA MVQGDGEAVRSERVNRCNITHRRMTIGTHSLFLREAGPMDAPVLLLPHGYPCSSYQYRRLMPALADQWRTVAVDWPGFGYSDTPDPAEFGYDFDAYAEVLNNVAEALRLERYALWLHDYGSQIGLRHAIAHPERIAALIIQNGDIYEDVLGPKYGTIKAWWADKSPEKHRPLEEAVSEDGFREEFVGEVSEEVASLVPPDLWKLHWPLMDTPTRKMVAVRLMEKLEENLDWFPRYQGYLREHRPPTLVVWGPQDGYMPEASAQAYRCDLPDAELHILSEAGHWLLETHLEQALPLVRDFLARTFR |
4 | Brazilian_cedratvirus(50.0%) | NA | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_22 |
227694 : 228474
Sequences of DBSCAN-SWA_22
Nucleotide sequences of DBSCAN-SWA_22 >CP036360|227694:228474|DBSCAN-SWA TATGAACTGCGCCATATCAGCTGATCTGGAGGATAATATGCTACATGCTGAACACAGTCTGTCTGGAAAGTCGCATGCACCGTTTTGCGTCAGTTTAAAGCCCGACATGCCCCCTATTCAAGAACCCTTGGCAACAGTCGTTCACGACCTCGGTAATCTTATCCAGGTTGCGACGTCTGCCATAAATGTCCTGTCACGCAATCCCTTGGTCTCTGCCGACGTCCGGGCTTTAGATGTCATTCGCAGCGCCCGAATGTCGCTGGACTGTGCTGGCGGACTCGTTCGCCAATCCCTCCGGGCCAAGAGTAATGGCTATACGCCCGTTGAAACCGTCAATGTTGAATCTTGTCTCCTCGATGTGAAAGACTCTCTGGCCATCTGGGAGCCGGAAACACACGTCGAAGTTCAGGCAGAGCCGAACTTGCCTCGCCTGGTTTGTAACGCCATCGGATTCAAAGCAGTCCTTCTGAATCTGATGGTCAACGCCCGCGAAGCGATGCCGGACGGTGGCACGATTTTGCTGGCTGCGCGGGTACTGGTACCTGACGGCGCACCGGCTGTTGAAATCGTGGTCGCCGATGACGGTCACGGCATGACGCGCGAAACTCTCGAAAACGCGTTCATACCCTTGTTCACGACAAGATCGTCAGGCGTTGGTGGTTACGGACTTTCCGCCGCAAAAAACTTCGTTCAGGCGGCCGGCGGCCACATAAAGGTGGAAAGTGAGCCCACCGTCGGCACGCGGGTGATCATTGTCCTCCCGGCCTACGGCATCTCCTGA
Protein sequences of DBSCAN-SWA_22 >CP036360|227694:228474|227694_228474_+|QBJ16601.1|DBSCAN-SWA MNCAISADLEDNMLHAEHSLSGKSHAPFCVSLKPDMPPIQEPLATVVHDLGNLIQVATSAINVLSRNPLVSADVRALDVIRSARMSLDCAGGLVRQSLRAKSNGYTPVETVNVESCLLDVKDSLAIWEPETHVEVQAEPNLPRLVCNAIGFKAVLLNLMVNAREAMPDGGTILLAARVLVPDGAPAVEIVVADDGHGMTRETLENAFIPLFTTRSSGVGGYGLSAAKNFVQAAGGHIKVESEPTVGTRVIIVLPAYGIS |
1 | Bacillus_phage(100.0%) | NA | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_23 |
239298 : 242113
Sequences of DBSCAN-SWA_23
Nucleotide sequences of DBSCAN-SWA_23 >CP036360|239298:242113|DBSCAN-SWA CATGCAGCATCTGAGTGGCAAAGTCGTTGTAATAACCGGCGGTAATAGCGGGATCGGCAGGGCGACCGCTCAACTCTTCGCGGAGGAAGGCGCCGAGGTGATTATCACGGGACGGCGAGCGGACGTGGTAGAACAGGCAGTTGATGAAATCGGCCACGGTGCGGTGGGCTTCGTCGCTGACAGCGCTGATCTCGACCATCACAGGAGACTTGCCGACTTCGTCGCCGGAAAATTTGGCCGGTTGGATGTCTACATGGCCAACGCCGGTATCATCAATCTGACGACGTCAGCTGACGTGACGCCGGAGGACTATGACCGGCATTTTGCCATCAACACGCGAGGGGTGTTTTTCGGAGTACAGGCTATGTCGCCATTGATACGGGATGGTGGAAGCATCATCGTGACCAGCTCCCTTGCCGCGACCAAGGTGCTGCCCGATCACACGGTCTATGCCGGCTCCAAGGCGGCTGTCGCAGCATTTGCCAGGAACTGGGCAATCGAGCTGAAACCGAGGAGGATCCGGGTCAATATCCTGAGCCCTGGGCCCGTGGAGACAGCAATCCTGGAAAAGTTGGGCGTTCCCGAGACCCTGCTCCCAGCGTTCGAGGAGCAGATGGCGTCTCTTATTCCTGCCGGTCGCATGGGGCGACCGGAAGAACTTGCGCGTGCGGCCCTTTTTCTTGCCTCGGACGCGGGGAGCTTTGTCAACGGCATCGAACTTCACGTTGACGGAGGCATGACGCTCGTATGAGCCAGACTGGCTCAAAGAACTGCATTTCGCCATTCGCTCGGCGTTAGTCCGGTGTGACGTTTAAATGCAACGCTGAATGCGGTCAGCGTCGCATAGCCCAGTTGTCCCGCAACGTCTGTTATCAGCACGCGCGGGTCTCGAAGCATTCCCATCGCTTGCTCCAGCCGCCTCTCCCGCAGCCACTCATGCGGGCTCACCCCCGTACTTTTCTTGAATGCCCGGCAAAAATGGAACCGCGACAGGTTCGCTTCGGCGGCGAGCGCCCCAAGTGAGAAGTCGAGTTCGTCGTCCGATGACAGCCGTTCCAGGACTGTTCGCAACACGTGCGGAGCAAGACCCCCCGAGGCTCTCCCCGGATAGTCGGCTGCATGATCGGGACCCTTTAGAAGATGAACCGCGATCAGTGACGTGATCTGCTGGCGATAAAGCGCCTCCAGGAAGGAGGGATTGTCCGGCGATCCAAGCGATCTTTCCAGGAGCGCCGCCAGGAACGCGTCGGGTCCGGCCGTGATCTCGATCAGCTGGCGAGTAGATCCCGGACATTCCTGCGCGATGTGATCGAGCAGTTGGGGTGCCATATAGAGCTGGACGATATCCATCGGACCATGGATGTCCCAACGGGAACTGGAGCCAGCGGGTATGACGGTGATGGTGCCTCTACGCCCCATGGCGGAGCGGTGGTCTCTCCCGGAGACGCGTTCGAGGCGTTGCATCGCGCCGAGATAGGTCATTACGACATGGTCGGACATCGGCTTTACGACATCGTGAAACGGTTGATGTTTCCAGCGGGCTAAACGGGTCTCGAAAGGGTCATGAATATTAAGGGATTCAATCGGCGGGGTTCGCAGCACTCGCTCCATTTCCTTGTCCGCTGGAACTCCCGACACAGATTGGCCGGCGAAACTGCTCGCCGCCTCACCAGTCTCTGATTGACGGAAATTTGTCCTTAGGCGAACCACGTGCGCGTCCTTCTATCGGCTCCATGTCGAGAACGAGAAAGAACGATACGGGACAGCGCGCCGATGAGGTATCATCCAACAGCCTAGGTTCATCTCGGCGATCGTGGAGCGAGGCTATCCGGATACGAAGAGAGCTTCCCCCATTGCACGAGGTGTCCCATGTCCAATGGTGTCAGAATTGGCGTTTGCTATATGAACAGCGATATCCCTGTGAGATGACTGTCTGGCGCGGGGAGGCGGTGTCAGATCGCGCACCGCCTCCCGACAGTTCAGCTTTTGACAAAAGCGAGGAGGTCCGTGTTAATCAGTTGCGAGTGCGTTGTGCACATGCCGTGGGGGAGACCTTCGTAAACCCTCAGGGTAGAGCGCTTCAGCAATTTCGCGGACAGCGGAGCGGAGTTGGCAAGAGGAACGATCTGGTCGTCGCCTCCATGCATAACAATTGTCGGGACAGTGATTGTTTTCAGATCCTGCGTGAAGTCGGTTTCCGAAAATGCCTTGATGCCGAGGTAGTGCGCATTCGCGGCGCCCATCATCCCCTGCCGCCACCAGTTTTGGATGATAGCATCAACGGGTTTAACGCCGTCTCTGTTGAATCCATAAAACGGGCCACTCGACAGATCAAAGTAGAACTGGGACCGATTGGCGGCAAGCTGGACACGAAGGTCGTCAAAAGCCTCTATCGGAAGGCCTCCGGGGTTATTGTTCGTCTTGACCATTATCGGCGGTACAGCACCGATGAGAGCCAACTTCGCCACCCGACCCTGAGGCTGCCCATGTCTGGCGACGTAATGGGCGGCTTCGCCGCCGCCAGTGGAGTGACCGATGTGGATCGCGTTGCGAAGATCGAGATGCATTGCGAGCGCGGCGACGTCCGCAGCGTAATGGTCCATGTCGTGGCCGGCGCCGGTTTGGCTCGATCGTCCGTGGCCACGCCGGTCATGTGCAATCACACGATATCCCTTGCCGAGGAAGAACAGCATCTGCGCATCCCAATCGTCAGAGGAAAGCGGCCACCCGTGGTGGAAGACCAGCGGCTGACCGTCCCTGGGACCCCAGTCCTTATAAAAAATCTCGACGCCATCGGCTGTGGTCACATAATTCAT
Protein sequences of DBSCAN-SWA_23 >CP036360|239298:242113|241276_242113_-|QBJ16854.1|DBSCAN-SWA MNYVTTADGVEIFYKDWGPRDGQPLVFHHGWPLSSDDWDAQMLFFLGKGYRVIAHDRRGHGRSSQTGAGHDMDHYAADVAALAMHLDLRNAIHIGHSTGGGEAAHYVARHGQPQGRVAKLALIGAVPPIMVKTNNNPGGLPIEAFDDLRVQLAANRSQFYFDLSSGPFYGFNRDGVKPVDAIIQNWWRQGMMGAANAHYLGIKAFSETDFTQDLKTITVPTIVMHGGDDQIVPLANSAPLSAKLLKRSTLRVYEGLPHGMCTTHSQLINTDLLAFVKS >CP036360|239298:242113|240059_240908_-|QBJ16611.1|DBSCAN-SWA MERVLRTPPIESLNIHDPFETRLARWKHQPFHDVVKPMSDHVVMTYLGAMQRLERVSGRDHRSAMGRRGTITVIPAGSSSRWDIHGPMDIVQLYMAPQLLDHIAQECPGSTRQLIEITAGPDAFLAALLERSLGSPDNPSFLEALYRQQITSLIAVHLLKGPDHAADYPGRASGGLAPHVLRTVLERLSSDDELDFSLGALAAEANLSRFHFCRAFKKSTGVSPHEWLRERRLEQAMGMLRDPRVLITDVAGQLGYATLTAFSVAFKRHTGLTPSEWRNAVL >CP036360|239298:242113|239298_240048_+|QBJ16610.1|DBSCAN-SWA MQHLSGKVVVITGGNSGIGRATAQLFAEEGAEVIITGRRADVVEQAVDEIGHGAVGFVADSADLDHHRRLADFVAGKFGRLDVYMANAGIINLTTSADVTPEDYDRHFAINTRGVFFGVQAMSPLIRDGGSIIVTSSLAATKVLPDHTVYAGSKAAVAAFARNWAIELKPRRIRVNILSPGPVETAILEKLGVPETLLPAFEEQMASLIPAGRMGRPEELARAALFLASDAGSFVNGIELHVDGGMTLV |
3 | Trichoplusia_ni_ascovirus(50.0%) | NA | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_24 |
247485 : 250694
Sequences of DBSCAN-SWA_24
Nucleotide sequences of DBSCAN-SWA_24 >CP036360|247485:250694|DBSCAN-SWA TATGACCGCCGCAGGCTCACGTGAACACCAGATGTTTCCTGCACTCGATCCGCAACAGATTGCGACGGCAAGACGTTTCGCTGATGACGAACCGCGGTACTTTCTGCCCGGCGAGATGATTTTCAATGTCGGCGAGAGGCACGCACCCGCCTGGCTGGTGCTCGAAGGATCGATTGAGGTTTTGCGCCAGGACGGCCTGTCATCCAGCGTTCCCGTGACCCGACATACGACGGGTCAATTTTCCGCAGAGGTCAGCCAGCTGTCGGGCAGGCCATCCCTGGCGAGTGGGCGTGCCGGTGCGGAGGGATGCATGGCCGTTTCATTCGACGCCCCGCATCTTAGAGCTCTGATCATCGGCACCGCCGACATCGGAGAGATCGTGATGCGCGCGTTTATACTTCGGCGCGTCGCCCTGGTAGATCAGGGCGGTGCCGGTTCTGTACTGGTAGGTCAACCGGGCAGTGCTGACCTTATCCGCCTTCAAGGCTTTCTGGCGCGAAGCGGATATCCGTATGTCGCGCTTGACGCTGACGCCGACGGACAAGGCCGCGATCTTGTCCATCGACTGGGTATCTTGAGAGAAGAACTGCCACTGATGGTGTGCCCCGGCGGAGCCATTCTGAAGAATCCGACAGACGGCGAGGCCGCAGTTTGTCTGGGCGTCGCACAGGAGATCGTGTCCGGTGCCGTCTATGACGTTGCGATCATCGGCGCCGGCCCGGCCGGCCTGGCGGCCGCCGTCTACGCGGCCTCCGAAGGTTTGAGTGTCCTGGCGATTGACGAACGATCCGCAGGCGGCCAGGCGGGAGCGTCCGCACGGATCGAAAACTACCTTGGCTTCCCTGCGGGGATTTCCGGGCAAGATCTTGCGGCGCGTGCCTTCAACCAGGCGATAAAATTCGGGGTCGAGGTTGCTTTGCCCATCCCCGTTATTGACCTGCGGGTTGTTCAATCATCATGGGGAACCGGTTCGATATTGCGGCTGGGTCTCGATGGCGAACGGAGCGTCGATGCGCGAACCGTGGTGATCGCGTGCGGGGCGCGTTATCGCCGCCCGCAGTTGGCAAATCTCGCCGAATTCGAAGGGTCCGGTGTCTCCTATTGGGCGTCTCCTATCGAGGCAAAACTATGCGAAGGTGAAGAGGTCGTACTCGTCGGAGGGGGGAACTCGGCGGGGCAGGCGATTGTCTTCCTTGCTCCCAAGGTCCGGCAGCTCCATGTTGTTGTCCGAAGACCCCTCAAAGATACGATGTCCAGCTATCTTATTGACCGTATCGGGGCGCTGTCCAATGTCGAAATCCATGTCGACAGCGAGATTGTTGGCCTTGAGGGAGACCGCTCGACTGGTCTGAGCGGGGCCGTGATGCGTCACCGTCAATCCAGTGGTGAGCGAACCTTCAGGGTACGGCATCTTTTCCTGTTCATTGGCGCTGATCCGAATACGGCATGGTTACGCTCCGATGTGGACGTCGACGACAAAGGATTCATCCTGACGGGGAGGGCTTCTTCTGGACGCACATCCGTTCTTCCGCTCGAGACGAGTCTTCAAAACGTGTTTGCGATTGGTGACGTCAGGGCAGGGTCCACCAAGCGTGTCGCCGCCGCCGTGGGCGAAGGTGCCGCTGTAGTCTCTCAGATTCATGAGTGCCTTGGGCGGCAGCCCGGTTGAATTAACCCGGATCAATCCGTCCGGCACACTGAAGACTGATGACGGCAGACGGTTGCTTCGTTTCCTCCCTTGTCGTTGATCCGTCGGCCGCCGTGACAACACGCTGCATCGTGCGGTGTCCTCTCCGGTCATATTCAAGAGGTTCTCAATTCTGCGCGCGGACAAAACCTTCCTTTCGCTTCGCCCGATATGTCCGTGGACCAAGGCAAACTCCAGTCATTTCCGGTCGTGGTGGTGCAATCGGCCACTTTCTATGGAGTCTCCAAGCTTGATAAGGATATATCGTGCTCCCGTAAAACGGCGACGAAGAGACTTCGCATCTGCCTGTTCCCGGGGCCATCGGCAGAGGTAAGCGCATTACCCAAATTCCGCAGCCCCGATATCATATTCCTCAATGCGAATGAGACTGCGCATTCAGGTTTGCGAGCATTGTTGCAAATTGTGAAATAAACTCGACACTATTCAATCAATTGTAGATGTGCGGCTCTCACGGCAATATCGCTGCTGTCAGACGAACACAGCGTCGCCTGAAGTTTCAAAACGAAATCGCAGGACGCGAGATGACCACGAATATCACATCTATCACACGCCGGAACGTGTTGCTGACCGCAGGTACGGCCGTTGCCGTTACAGCGATGTCTCCCGTTTTCAGCTTCGCGGCCAATAGTGATTTCGCCACTACCGCCACCGAAGGAACCAAGAGCATGAGCACTGTAACAACCAGGGACGGCACTGAGATTTTCTACAAGGACTGGGGCCCGAAGGATGCCCAGCCGATCGTTTTCCACCATGGCTGGCCGCTCTCGTCCGATGACTGGGATGCGCAGATGCTGTTCTTCGTCTCCAAGGGGTTTCGTGTTGTCGCCCATGACCGCCGTGGTCACGGCCGTTCCGCCCAGGTTGCGGATGGTCACGACATGGACCACTACGCCGCCGACGCCTTTGCCGTCGTTGAAGCGCTCGATCTGAAGAATGCCGTCCATATCGGCCATTCCACCGGCGGCGGCGAAGTTGCCCGTTATGTCGCCAAGCATGGCGAACCCGCCGGCCGTGTCGCCAAGGCGGTTCTCGTTTCCGCCGTGCCGCCGCTGATGCTGAAGACCGAAGCCAATCCGGAAGGTCTCCCGATGGAGGTTTTTGACGGCTTCCGCTCCGCGCTTGCCGCAAACCGTGCGCAGTTCTTCCGTGATGTCCCGGCCGGCCCGTTCTACGGTTTCAACCGCGACGGTGCGACCGTGCATGAAGGCGTGATCCAGAACTGGTGGCGTCAGGGCATGATGGGTGATGCAAAGGCCCATTACGACGGCATCAAGGCCTTCTCGGAAACCGACCAGACCGAGGACCTGAAGAATATCAGCGTTCCGACGCTTGTTCTGCACGGTGAAGACGACCAGATCGTTCCGATCGCCGACTCCGCCCTGAAATCGGTGAAGCTGCTGAAAAACGGCACACTGAAGACCTATCCCGGCTTCTCGCATGGCATGCTCACCGTCAATGCCGATGTGCTGAACGCCGATCTCCTGGCCTTCATCCGGTCCTAA
Protein sequences of DBSCAN-SWA_24 >CP036360|247485:250694|247485_249153_+|QBJ16616.1|DBSCAN-SWA MTAAGSREHQMFPALDPQQIATARRFADDEPRYFLPGEMIFNVGERHAPAWLVLEGSIEVLRQDGLSSSVPVTRHTTGQFSAEVSQLSGRPSLASGRAGAEGCMAVSFDAPHLRALIIGTADIGEIVMRAFILRRVALVDQGGAGSVLVGQPGSADLIRLQGFLARSGYPYVALDADADGQGRDLVHRLGILREELPLMVCPGGAILKNPTDGEAAVCLGVAQEIVSGAVYDVAIIGAGPAGLAAAVYAASEGLSVLAIDERSAGGQAGASARIENYLGFPAGISGQDLAARAFNQAIKFGVEVALPIPVIDLRVVQSSWGTGSILRLGLDGERSVDARTVVIACGARYRRPQLANLAEFEGSGVSYWASPIEAKLCEGEEVVLVGGGNSAGQAIVFLAPKVRQLHVVVRRPLKDTMSSYLIDRIGALSNVEIHVDSEIVGLEGDRSTGLSGAVMRHRQSSGERTFRVRHLFLFIGADPNTAWLRSDVDVDDKGFILTGRASSGRTSVLPLETSLQNVFAIGDVRAGSTKRVAAAVGEGAAVVSQIHECLGRQPG >CP036360|247485:250694|249857_250694_+|QBJ16617.1|DBSCAN-SWA MSTVTTRDGTEIFYKDWGPKDAQPIVFHHGWPLSSDDWDAQMLFFVSKGFRVVAHDRRGHGRSAQVADGHDMDHYAADAFAVVEALDLKNAVHIGHSTGGGEVARYVAKHGEPAGRVAKAVLVSAVPPLMLKTEANPEGLPMEVFDGFRSALAANRAQFFRDVPAGPFYGFNRDGATVHEGVIQNWWRQGMMGDAKAHYDGIKAFSETDQTEDLKNISVPTLVLHGEDDQIVPIADSALKSVKLLKNGTLKTYPGFSHGMLTVNADVLNADLLAFIRS |
2 | Orpheovirus(50.0%) | NA | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_25 |
268944 : 269757
Sequences of DBSCAN-SWA_25
Nucleotide sequences of DBSCAN-SWA_25 >CP036360|268944:269757|DBSCAN-SWA CCTATCGAATGATCGCATCCGCGACTTCATGCAGCGCGGCCTTCACGTCAGGCTCGTCGGTCAGCGGCTCGATCATCGCCGCGCGCACGGCCTGACGGGCGGGAAGGCCGGTATCGATGAGGCTGGCGCAATAAACGAGCAGGCGGGTGGAGACGCCTTCTTCCAGATCGTGACCTTTCAGGCCGCGCAACCGGTGGGCGAGATCGACGAGCGGCTCGACGTCGCGCGCATCGAGCCCGCTTTCATGCGAAACGACGGCAATCTCCTGCTCCTTCGGCAAAAAATCGAATTCAATGGCGACGAAGCGCTGGCGGGTGCTGGGTTTCAGGCTTTTCAGCAGGTTCTGGTAGCCGGGATTGTAGGATACGACGAGCATGAAACCCGAGGGCGCTTCCAGCACCTCGCCGGTGCGCTCCAGCGGCAGGATACGACGGTCGTCGGTGAGCGGATGCAGCACAACGGCCACATCCTTGCGCGCTTCCACAATCTCGTCGAGGTAACATATGCCGCCCTGACGCACGGATCGTGTCAGCGGGCCGTCCATCCACACGGTTTCACCGCCTTTCAGCAGGTAACGACCGGTCAGATCGGCGGCGGCAAGATCGTCGTGACATGAAACGGTCGAAAGCGGCAGGCCGAGTTTCGCCGCCATATGGCTGACGAAACGGGTCTTGCCGCAACCTGTCGGGCCTTTCAGCAGCAGCGGCAATTGCCGCACCCAGGCGCTTTCGAACAGCGTGCATTCATTGCCAAGCGGTGTATAGAACGGCGTGTCCGGCAAAGGCTGCGGCGGGGGGCGGAAAATCGTATTCAT
Protein sequences of DBSCAN-SWA_25 >CP036360|268944:269757|268944_269757_-|QBJ16635.1|DBSCAN-SWA MNTIFRPPPQPLPDTPFYTPLGNECTLFESAWVRQLPLLLKGPTGCGKTRFVSHMAAKLGLPLSTVSCHDDLAAADLTGRYLLKGGETVWMDGPLTRSVRQGGICYLDEIVEARKDVAVVLHPLTDDRRILPLERTGEVLEAPSGFMLVVSYNPGYQNLLKSLKPSTRQRFVAIEFDFLPKEQEIAVVSHESGLDARDVEPLVDLAHRLRGLKGHDLEEGVSTRLLVYCASLIDTGLPARQAVRAAMIEPLTDEPDVKAALHEVADAIIR |
1 | Halovirus(100.0%) | NA | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_26 |
276791 : 277769
Sequences of DBSCAN-SWA_26
Nucleotide sequences of DBSCAN-SWA_26 >CP036360|276791:277769|DBSCAN-SWA CATGGAACTGATCTGTCCTGCAGGAACGCCTGCCGCCTTCCGCGAGGCCGTGGATGCCGGGGCGGATGCGGTCTATTGCGGTTTTAGCGATGAAACCAATGCCCGCAATTTCCCCGGCCTTAATTTTTCCCGCGAGGAACTTGCCGAGGCCATCGTTTACGCAAAAAAGCGCGGTGTACAGACCTTCGTGGCGCTCAACACCTTCATGCGTGCTGGCAATGAGGATATCTGGTATCGCGGCGCGGCCGATGCCGTGAAGGCGGGGGCGGACGCACTTATCCTTGCCGATTTCGGCCTGATGGCGCATGTGGCGGAACATCATCCGCAGCAGCGTATCCACGTCTCCGTGCAGGCCTCCGCCTCCAATGCGGATGCGGTGAATTTCCTCGTCGATGCCTTCGGCGCGAAACGCGTGGTGCTACCGCGCACCCTGACTATCCCCGACATCGCCCGGTTGGCGCGGCAAATCCGCTGCGAGATCGAAATTTTCGTGTTCGGCGGTCTCTGCGTCATGGCGGAGGGGCGCTGCTCGCTGTCGTCTTACGCCACAGGCAAGTCACCCAACATGAACGGCGTCTGTTCGCCGGCAAGCCACGTCCGCTACCGGCAGGACGGGCAGGCGCTGGTGTCGGAACTTGGCGATTACACCATCAACCGTTTTCCGGCCGGCGAGGCGGCGGGTTACCCCACGCTGTGCAAGGGCCGTTTCGAGATCGCCGATGACAGGTCCTATGCCTTTGAGGATCCGGTGTCGCTTGATGTGATGGACCAGATCGATGCCTTGCGCGAGGCGGGCGTCAGCGCACTGAAGATCGAGGGCCGCCAGCGCGGCAAGGCCTATGTGGCGGAAGTGGTTTCCACCCTGCACCGGGCGCTGGCCGCTAGCGCCGAAGAACGGGGACGGCTGCTCTCGCGCCTGCGGCTCCTAAGCGAAGGCCAGCGCACCACCGTCGGCGCTTATGAGAAACGCTGGAGATGA
Protein sequences of DBSCAN-SWA_26 >CP036360|276791:277769|276791_277769_+|QBJ16645.1|DBSCAN-SWA MELICPAGTPAAFREAVDAGADAVYCGFSDETNARNFPGLNFSREELAEAIVYAKKRGVQTFVALNTFMRAGNEDIWYRGAADAVKAGADALILADFGLMAHVAEHHPQQRIHVSVQASASNADAVNFLVDAFGAKRVVLPRTLTIPDIARLARQIRCEIEIFVFGGLCVMAEGRCSLSSYATGKSPNMNGVCSPASHVRYRQDGQALVSELGDYTINRFPAGEAAGYPTLCKGRFEIADDRSYAFEDPVSLDVMDQIDALREAGVSALKIEGRQRGKAYVAEVVSTLHRALAASAEERGRLLSRLRLLSEGQRTTVGAYEKRWR |
1 | Phage_TP(100.0%) | NA | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_27 |
281038 : 286609
Sequences of DBSCAN-SWA_27
Nucleotide sequences of DBSCAN-SWA_27 >CP036360|281038:286609|DBSCAN-SWA GATGAGCGACCGTTCCATCCTGCGTTTCGAACATGTCGGCCACGCCTTTCTCGGCCGCAGCCTGTTCGAGAATTTCGATCTCGGCATAGCGCCGGGGGAAACCGTGGCGCTGCTCGGCCCCTCCGGCAGCGGCAAGACGACGATTTTACAGATCGCGGCTGGTATCATCGATCCTGTTCGCGGCCGCGTGCACCGCCATTATCGCCGGCAGGGTCTGGTGTTTCAGGAGCCGAGGCTGCTGCCGTGGATGACGCTTATCGACAATATTGCCTACGGCCTTGCGGCGGCCGGCTTGCCGAGACGGGAGCGGCGCGAAAGGGCGGGCCTTTTTGCGCTGGAGGTCGGTCTGGAGGTGGCTGATTTCGGCAAATATCCGGTGGAACTCTCCGGCGGCATGCGCCAGCGCGCCGGCGTGGCGCGGGCGCTCGCCGTGGAGCCGGACATGCTGTTTCTGGATGAGCCCTTCAGCGCCGTCGATGTCGGCCTGCGCCGTCATTTGCAGGAGCTTCTGGTGGGCGCCGCCCGTCGGCGCGGTTTTTCCACCCTTCTCGTCACCCATGATCTTCATGAGGCGTTGCTGGTTGCCGACAGGCTGATCGTGCTCTCCGGCGTCGATGGCCGAGTCATTGCCGCGCATAAGCCCGCGGGTTCGCCGGGCCGGCGCATGGCGCGTACGGTTTTCGACGAAGCGGAACGCCTGTCCGAGAGCCCGGCCTTCGCCGAATTGTTTTCAGCAAGGGAGCGGGTGCGATGAGGCAGCCCCCCATCCTTTCCGAAGCCCTGCGGCTGTTTTTCCCTCTGGCCGCACTGCATGGTGCGGGCTGGCCCCTCCTGTGGATCGTCATCGGTGGTTATGCCCTGCCTTTCGCCGATGCGGTGCCTGCATCGCAATGGCACGCCCATGAAATGATCTACGGCACCTATGGCATGGCGCTTGCCGGTTTCCTTGGCTCGGCGGTGCCGGAATGGACGGATACGAAGCCGGCGCAGGGACGAACGCTGCTTCATCTCGCCGGTCTCTGGTTGCCGGGCCGCCTTATCGGCTTTCTCGGGATGGAGGCAGGGAGCCTGTTTGCAGGTTTTTTCGATCTCGCCTTTCTGCTCGCGCTCTCTGTTCTCATCGCCAGGGCGATGCTGGCGCGGCGCACGATGAAACACCTGGCGTTCCTTATCTGGCTTCTTCTGTTTACGGCGGCTGAAGCCGGTGTGCGTTATGCCTGGTGGAGCGGCGATCTCGAACTTGCATCCCGCATGCTGGAGGCGGCGCTGTGCATTTTCACCGTGCTGTTTTCGCTTTCCGCCGCACGTATCAATGTGGTCGTCATCAATCTGGCGCTCGATCCCGGCGGCGAGACCACGCCTTACCGGCCGCATCCCGGCCGCCAGCATATGGCAGGCGCCATGGTGACGCTTTACATGGCGGCGAAACTTTTCTTTCCGCAGAGCGATGTCTGCGCCTGGCTTTCGCTTGCGGCCGGCGCCGCCTTTTTCGACCGGCTGGCCGAATGGTTCATCGGGCGTGCCGTCTTTAAGACCGAGGTGCTGTTGCTCGGCCTCGGCAACGCCTTTGCGGGCGTGGGTTTTCTGGCGCTCGGCGCGACACGGCTCGGTTTTTCCGTTACGCCCGCCGCAGGGCTGCATCTGCTCTCGGTCGGGGCGCTTGGATGCGCCATCATGGCCGTCTTCATCATTGCCGGGCTGCGGCATACGGGCCGCAATCTGACGCATCTTCCATGGCAGGCGCATGTGGCCGCCCTTTTGATGGCGATGGCCGGGCTGGTCCGCATCCTCCCGGAATTCGACTTTGCCGCAACATTGTCTCCCTATCACCATGGCCTCAGCGCCGTTTTGTGGGCGGTGAGCTTCGGGGTCTGGTTGCAGAGTTTCCTGCCCTTCATGCGCGCGCCTGGCATGGACGATGCAGGAGCCTGCGGATGAGAGCGAGACACTGTTTGCCCGTCTAATGTTTATTTTTTGAGCAGAAGCACAATGCGAAATCAGTGATTTAGGCGAGCCGTCTTGATGAATGTTAATTTTGATAAATAATCAATAAAATCAAGTGGTTGAATATTTCTGTCTTTTGGGGGATGATGACTCGAAACGAATAGGAAACGCGAATATGCCTAAAATAGTCGCGCCTCAACACGCAGATGAAAAGCCAGGTCGGACGAGGGAACTTGTGACCTTCGCCGTTCTGGCCTTCGGAATCTGGCCCATTCTGGCGGTCGGATTTGTTGGAGCCTATGGCTTTATCGTCTGGATGTTCCAGATCATTTACGGCCCGCCGGGGCCGCCCGGACATTGAGGGCGAGATGATGATGGAAGTCTCTCTCAGCCGGCGCGATTTCCTGCGCGGCGGGCAAAAAAATAGACCACGCATCTGCCCCCCCGGGGTCGCGTTGAGCGACCTCGCCGCGTGCAGCGGTTGCGCAAAATGCGTCGAGGCCTGTCCTACCGGCATCATCGCCATGGCGGACGGTTTGCCTTGCGTGGATTTCTCCGCCGGGGAATGCACCTTTTGCGGCAAATGTGCTGAGGCCTGTCCCGAGCCGGTCTTTGCTGCCCCCACGGCACAGCGCTTCGGTCACGTCACGGCGATCGGTGAGGGGTGTCTCGCCTTCGGCAATATCGATTGTCAGGCCTGCCGCGATGCCTGTCCGACCGAGGCCATCCGGTTTCGGCCCCGGCGCGGCGGCCCCTTCGTTCCGGAACTGCTGGAAGATGCCTGCACCGGCTGCGGGGCCTGCGTGTCCGTCTGCCCGGCTGGCGTGATCGAGATCAAAGATAGAGCCACGGAGATGCAATATGCCTGAAAATACAGGCCGGTATCATGTTTCAAGCGCCGTGGTGGCGGTGATGCCGCAGATGCGGGACGCCGTGCTGGCAACGCTTTCGACGCTCGACAATGTCGAGGTTCATGGCGAGGGCAACGGCAAGATCGTCATCGTCATAGACGGCACGAGCACCGGCATGCTGGGCGATACGCTCACTTATATTTCGACGCTCGACGGTGTGATTGCCGCCAACATGGTTTTCGAACACGTCGACACAGAGGAGACAAGCGGCGATGAGCAGCGAACTGACGCGGCGTGATCTATTGAAGGCCCATGCCGCCGGCATTGCGGCGGCAACGGCGGGCATTGCGCTGCCGGCCGCCGCCCAGCCGGTGCCAGGCGGGGTTTCCGCATTGCAGATCAAGTGGTCCAAGGCGCCCTGTCGTTTCTGCGGCACGGGCTGCGGCGTCATGGTCGGCGTCAAGGAAGGCAAGGTCGTCGCCACCCATGGCGACATGCAGGCGGAGGTCAATCGCGGCCTCAACTGCATCAAGGGCTATTTCCTGTCCAAGATCATGTATGGCAAGGACCGGCTGCAAACTCCGCTTCTGCGCAAGAGAAACGGCGTCTACGCCAAGGATGGCGAGTTCGAGCCTGTGAGCTGGGACGAGGCCTTCGACGTCATGGCCACGCAGTGCAAGCGGGTGTTGAAGGAGAAGGGACCGACGGCCGTCGGCATGTTCGGCTCCGGCCAATGGACGATCTTCGAAGGTTACGCCGCAACCAAGCTGATGCGTGCCGGCTTCCGCTCCAACAATCTCGATCCTAATGCCCGTCACTGCATGGCGTCTGCTGCCTATGCCTTCATGCGCACCTTCGGCATGGACGAGCCGATGGGCTGTTACGATGATTTCGAACATGCCGATGCCTTCGTGCTCTGGGGTTCGAACATGGCGGAGATGCATCCCATCCTGTGGACGCGCATTGCCGACCGGCGTCTGGGCTTCGACCATGTGAAGGTGGCAGTGCTTTCGACCTTCACTCATCGCAGCATGGACCTTGCCGATATTCCGATGGTCTTCAAGCCGGGCACGGATCTCGTCATCCTCAATTACATCGCCAACCACATCATCAAGACCGGACGCGTCAACGAAGACTTCGTGAGGAACCACACGAAATTCGTGCGCGGTGTCACCGATATCGGTTATGGCCTGCGGCCCGACAATCCGGTTGAGGTGAATGCCGCCAATTCCGCCGATCCAACCAAGACCGAAGCGATCGATTTCGAGACCTTCAAGGAATTCGTCTCCGAATACACGCTGGAAAAGACCGCAGCCATGACCGGCGTTGAAGCCGGTTTTCTGGAGGAGCTGGCCGAGCTTTATGCCGACCCGAAACGCAAGGTCATGTCGCTGTGGACCATGGGTTTCAACCAGCATGTTCGCGGCGTCTGGGCCAACCAGATGGTCTATAACATCCATCTTTTGACGGGTAAGATTTCCGAGCCGGGTAATAGCCCGTTCTCGCTCACCGGCCAGCCCTCGGCCTGCGGCACGGCGCGTGAGGTGGGAACCTTCGCCCACCGCCTGCCTGCCGACATGACGGTGACCAACCCCGAGCACCGCAAACATGCCGAAGAAATCTGGCGCATTCCCCACGGCATCATCCCGGAAAAGCCGGGTTATCACGCCGTCCAGCAGGACCGCATGCTGCATGACGGCAAGCTGAATTTCTACTGGGTGCAGGTCAATAACAACGTACAGGCAGGTCCCAACACCAAGAACGAGACCTATCAGGGATATCGCAACCCGGAAAACTTCATCGTTGTTTCCGATGCCTATCCGACCATCACGGCTATGAGCGCCGACCTCATCCTGCCCGCCGCCATGTGGGTGGAAAAGGAGGGGGCCTATGGCAATGCCGAGCGGCGCACCCATGTCTGGCACCAGCTTGTCGAGGCCTCGGGTGAGGCGCGTTCCGATCTCTGGCAGCTGGTGGAATTCTCCAAGCGCTTCACCACGGATGAGGTGTGGCCGGCGGAGATACTGGACGCCAATCCCGCCTATCGCGGAAAGACGCTGTACGAGGTGCTCTTCAAGGACAGCGATGTCGGCAAGTTCCCGCTGAGCGAGATCAATGCTGAATACGAAAACCAGGAAGCAAAACACTTCGGATTCTATCTCCAGAAGGGTCTTTTTGAGGAATATGCCGCCTTCGGGCGGGGCCACGGCCATGATCTGGCGCCCTATGATGCCTATCACGAGGTGCGCGGCATGCGCTGGCCGGTGGTGGAGGGCAAGGAAACGCTGTGGCGTTACCGGGAGGGTTACGACCCTTATGTAAAGCCGGGCGAGGGCGTGAAATTCTACGGCAACAAGGACGGCAAGGCGGTCATCATTGCCGTGCCTTACGAACCGCCGGCGGAATCCCCGGATGCGGAATTCGATACCTGGCTGGTGACGGGCCGCGTGCTGGAGCACTGGCATTCCGGTTCCATGACCATGCGTGTGCCGGAACTCTACAAGGCGTTCCCGGGCGCCCGCTGTTTCATGAATGCCGACGATGCGCGAAAGCGCGGCCTCAATCAGGGCGCGGAAATCCGCATCGTGTCGCGCCGCGGCGAAATACGTTCCCGGGTGGAGACGCGCGGCCGCAACCGCATGCCGCCAGGCGTCATCTTCGTTCCCTGGTTCGATGCCAGCCAGCTCATCAACAAGGTCACGCTCGACGCAACCGATCCCATCTCCAAGCAGACGGATTTCAAGAAATGCGCAGTCAAGATAGAGCCAGTCGCATGA
Protein sequences of DBSCAN-SWA_27 >CP036360|281038:286609|284104_286609_+|QBJ16655.1|DBSCAN-SWA MSSELTRRDLLKAHAAGIAAATAGIALPAAAQPVPGGVSALQIKWSKAPCRFCGTGCGVMVGVKEGKVVATHGDMQAEVNRGLNCIKGYFLSKIMYGKDRLQTPLLRKRNGVYAKDGEFEPVSWDEAFDVMATQCKRVLKEKGPTAVGMFGSGQWTIFEGYAATKLMRAGFRSNNLDPNARHCMASAAYAFMRTFGMDEPMGCYDDFEHADAFVLWGSNMAEMHPILWTRIADRRLGFDHVKVAVLSTFTHRSMDLADIPMVFKPGTDLVILNYIANHIIKTGRVNEDFVRNHTKFVRGVTDIGYGLRPDNPVEVNAANSADPTKTEAIDFETFKEFVSEYTLEKTAAMTGVEAGFLEELAELYADPKRKVMSLWTMGFNQHVRGVWANQMVYNIHLLTGKISEPGNSPFSLTGQPSACGTAREVGTFAHRLPADMTVTNPEHRKHAEEIWRIPHGIIPEKPGYHAVQQDRMLHDGKLNFYWVQVNNNVQAGPNTKNETYQGYRNPENFIVVSDAYPTITAMSADLILPAAMWVEKEGAYGNAERRTHVWHQLVEASGEARSDLWQLVEFSKRFTTDEVWPAEILDANPAYRGKTLYEVLFKDSDVGKFPLSEINAEYENQEAKHFGFYLQKGLFEEYAAFGRGHGHDLAPYDAYHEVRGMRWPVVEGKETLWRYREGYDPYVKPGEGVKFYGNKDGKAVIIAVPYEPPAESPDAEFDTWLVTGRVLEHWHSGSMTMRVPELYKAFPGARCFMNADDARKRGLNQGAEIRIVSRRGEIRSRVETRGRNRMPPGVIFVPWFDASQLINKVTLDATDPISKQTDFKKCAVKIEPVA >CP036360|281038:286609|281038_281791_+|QBJ16650.1|DBSCAN-SWA MSDRSILRFEHVGHAFLGRSLFENFDLGIAPGETVALLGPSGSGKTTILQIAAGIIDPVRGRVHRHYRRQGLVFQEPRLLPWMTLIDNIAYGLAAAGLPRRERRERAGLFALEVGLEVADFGKYPVELSGGMRQRAGVARALAVEPDMLFLDEPFSAVDVGLRRHLQELLVGAARRRGFSTLLVTHDLHEALLVADRLIVLSGVDGRVIAAHKPAGSPGRRMARTVFDEAERLSESPAFAELFSARERVR >CP036360|281038:286609|283156_283342_+|QBJ16652.1|DBSCAN-SWA MPKIVAPQHADEKPGRTRELVTFAVLAFGIWPILAVGFVGAYGFIVWMFQIIYGPPGPPGH >CP036360|281038:286609|283349_283850_+|QBJ16653.1|DBSCAN-SWA MMMEVSLSRRDFLRGGQKNRPRICPPGVALSDLAACSGCAKCVEACPTGIIAMADGLPCVDFSAGECTFCGKCAEACPEPVFAAPTAQRFGHVTAIGEGCLAFGNIDCQACRDACPTEAIRFRPRRGGPFVPELLEDACTGCGACVSVCPAGVIEIKDRATEMQYA >CP036360|281038:286609|283842_284130_+|QBJ16654.1|DBSCAN-SWA MPENTGRYHVSSAVVAVMPQMRDAVLATLSTLDNVEVHGEGNGKIVIVIDGTSTGMLGDTLTYISTLDGVIAANMVFEHVDTEETSGDEQRTDAA >CP036360|281038:286609|281787_282975_+|QBJ16651.1|DBSCAN-SWA MRQPPILSEALRLFFPLAALHGAGWPLLWIVIGGYALPFADAVPASQWHAHEMIYGTYGMALAGFLGSAVPEWTDTKPAQGRTLLHLAGLWLPGRLIGFLGMEAGSLFAGFFDLAFLLALSVLIARAMLARRTMKHLAFLIWLLLFTAAEAGVRYAWWSGDLELASRMLEAALCIFTVLFSLSAARINVVVINLALDPGGETTPYRPHPGRQHMAGAMVTLYMAAKLFFPQSDVCAWLSLAAGAAFFDRLAEWFIGRAVFKTEVLLLGLGNAFAGVGFLALGATRLGFSVTPAAGLHLLSVGALGCAIMAVFIIAGLRHTGRNLTHLPWQAHVAALLMAMAGLVRILPEFDFAATLSPYHHGLSAVLWAVSFGVWLQSFLPFMRAPGMDDAGACG |
6 | Bacillus_virus(50.0%) | NA | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_28 |
290541 : 291318
Sequences of DBSCAN-SWA_28
Nucleotide sequences of DBSCAN-SWA_28 >CP036360|290541:291318|DBSCAN-SWA ATTATAAAAGGCGTCCTTGCTCCTCCACCGGTTCTCCACTTTGCGAAACGATCGTTGATCCGTATGGTTCGCGTGACGAAATGATCAGCGCGCCGTTCGGGAGCGGGCGGGCGAGCTCCTTTGCCTCATTCCACGGAGCCCGCATCCAGACTTCGGTTTCTTCCTTCGTCAGCAGCAGGACCGGCATGGCCTTTTCATGGATCGGCTTCACGAGGTCATTCGGATCGGTCGTCAGGAAACCGTATAGGTCGTCGGTTGTGAGCCCATCCCTGACCTTGCGGACGCTCTTCCATTGCGGCACGTGGACGCCGGCAAAAAACATCAGCGATTTCGCATCATCACGGGCAAACCAGGCATTAGGCACATTGCCACCCTCCTGTTTGCTCATCGGATCCGGTTCGGCAAAGCTTGTGACCGGGACAAGGCACCTGTGCTCGACGCCGAACCATCGCGTCCAGTGAGGGAGATTGAGTTTGCGCACATTCGTCACGCCGCGGTCCGGTTCTATGCGGATAAGTTCATCCATATCGACGGCCTGACCTTTGGCCTTCAGTTTTCCCGCTCTCGCTTCCGCAGCTTTCTTCTGCACGAAAATCGGCGAGGGCAGGCCCCATCGTGCATGAACCAGCTGCTTCTTGCCGTCCGCCGTGTTTCGGACGATCGGCCCCATCTGGTCGGGGTTCATCTGATAGGCCGGCATCAGGTTGATCAGGCTTTCGGCGTCCTGGGCCCACTTCGAGACCCAGTCCTTGTCCTCCATCCGATAAAGGTTGCACAT
Protein sequences of DBSCAN-SWA_28 >CP036360|290541:291318|290541_291318_-|QBJ16661.1|DBSCAN-SWA MCNLYRMEDKDWVSKWAQDAESLINLMPAYQMNPDQMGPIVRNTADGKKQLVHARWGLPSPIFVQKKAAEARAGKLKAKGQAVDMDELIRIEPDRGVTNVRKLNLPHWTRWFGVEHRCLVPVTSFAEPDPMSKQEGGNVPNAWFARDDAKSLMFFAGVHVPQWKSVRKVRDGLTTDDLYGFLTTDPNDLVKPIHEKAMPVLLLTKEETEVWMRAPWNEAKELARPLPNGALIISSREPYGSTIVSQSGEPVEEQGRLL |
1 | Sinorhizobium_phage(100.0%) | NA | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_29 |
296471 : 299729
Sequences of DBSCAN-SWA_29
Nucleotide sequences of DBSCAN-SWA_29 >CP036360|296471:299729|DBSCAN-SWA CATGAGCTACGCCGAGCTCCAGGTCACGACCCACTTTTCCTTTCTGCGCGGCGCCTCCTCTGCACAGGAGTTGTTTGAGACCGCGAAACAGCTCGGTATCGACGCAATCGGCGTCGTCGATCGCAATTCGCTTGCCGGAATTGTTCGGGCGCTTGAGGCGTCGCGCGCCACCGGGTTGCGCCTGGTCGTTGGCTGTCGTCTCGATCTGGCGGATGGCATGTCTGTGCTGGTCTATCCGATGGATCGCGCTGCCTATTCGCGCCTGACGCGCCTCATCACGCTGGGAAAATCGCGTGGCGGCAAGAACAACTGTATCCTGCATTGGGACGATATCGTCGCTTACGCCGAGGGCATGATCGGCATTCTGGTACCTGATCTGCCGGATGCCACCTGCGCCTCGCAGCTTCGCAAGATGGCTGAGCTGTTTGGTGATCGCGCTTACGTCTCTCTCTGTTTGCGTCGGCGACCGAACGACCAGTTGCGGCTGCATGAGATTTCCAACATGGCGACGCGGTTCAAGGTCAGAACCGTCGTCACCAATGATGTGCTGTTTCATGAGCCGGGCCGGCGGCAATTGCAGGACATCGTCACCTGTATCAGGCACAACACCACGATCGACGATGTCGGCTTTGAGCGCGAACGCCACGCCGACCGCTACCTCAAGCCACCGGAGGAAATGGCACGCCTGTTTCCGCGCTACGCGCAAGCACTCGCGCGAACCCTGGAAATCGTGCGTCGCTGCAAATTCTCGCTCGAGGAACTGACCTACCAGTACCCGGAGGAAGCCATCGTGCCGGGCAAGGATGCTCAAGGATCGCTCGAGCATTATGTCTGGGAATGTGCGCCCGATCGTTATCCGGAAGGCCTGCCACCGGATGTTTTAAAGACCGTTCGGCACGAGCTCGATCTCATCCGCACCATGAAATACGCGCCTTACTTCCTGACGGTGTTTTCGATCGTCCGTTTTGCCCGAAGTCAGGGCATTCTCTGCCAGGGCAGGGGATCTGCGGCCAACAGCGCTGTCTGCTATATCCTTGGCATCACCTCGATCGATCCCTCGACCAATGATCTCCTCTTCGAGCGCTTCGTCAGTCAGGAACGCGACGAGCCGCCGGATATCGATGTCGACTTCGAGCATGAACGGCGCGAGGAGGTCATCCAGTGGATCTACAAGACCTATACCAAGGATAAGGCGGCACTCTGCGCAACCGTCACCCGCTACCGGGCCAAGGGCGCAATCCGCGATGTCGGTAAGGCGCTTGGCCTGCCTGAAGATGTCATCAAGGCGCTGTCATCGGGCATGTGGTCCTGGTCGGAAGAAGTTCCCGACCGAAACATCAGGGAACTCAATCTCAACCCCAATGACCGGCGTCTGGCGCTCACCCTGAAACTCGCGCAGCAGTTGATGGGCGCGCCTCGGCATCTGGGTCAGCACCCCGGCGGCTTTGTCCTCACCCATGACCGGCTAGACGATCTCGTGCCGATCGAACCGGCGACGATGAAAGACCGGCAGATCATCGAATGGGACAAGGACGACGTTGAGGCACTGAAGTTCATGAAGGTGGATGTCTTGGCGCTCGGGATGCTCACCTGCATGGCCAAGGCTTTTGATCTCATCCGCGAGCACAAAGGGCAGGATCTTGATCTCTCGAAAATCAGCCAAGAGGATGCGGCGACCTATGCAATGATCCGCAAGGCTGACACGCTCGGCACCTTCCAGATCGAAAGCCGTGCGCAGATGGCGATGCTGCCGCGGTTAAAACCGCGCACGTTCTACGATCTCGTCGTGCAGGTGGCCATCGTGCGGCCGGGCCCAATCCAGGGTGACATGGTGCATCCCTATCTCCGTCGTCGCGAAGGCAAGGAGCCCGTCGAGTATCCGACACCCGAGTTGGAGGCGGTGCTCGGCAAGACGCTGGGCGTGCCGCTGTTTCAAGAGTCAGCGATGCGCGTCGCCATGGTCTGTGCCGGCTTTACCGGTGGTGAGGCCGACCAGCTGCGCAAATCCATGGCGACCTTCAAGTTCACCGGCGGCGTCTCGCGCTTCAAGGACAAGCTGGTGTCAGGCATGGTCAAGAACGGATATTCGTCCGAGTTCGCCGAAAAGACCTTTTCGCAGTTGGAAGGCTTTGGCTCATACGGCTTTCCGGAAAGTCACGCGGCCTCCTTTGCGTTGATTGCCTACGCTTCGAACTACATCAAGTGCCATTACCCCGACGTCTTTTGTGCAGCCCTTCTCAACAGCCAGCCCATGGGATTTTATGCTCCGGCCCAGATCGTCGGAGACGCGATCAAGCATGGCGTCGAGGTGCGGCCGGTCTGCGTCAACCGCTCGCGATGGGACTGCACGCTGGAAAGGATTGGAGGCGGCGATCGTCATGCTGTGCGCCTCGGGTTTCGGCAGGTGAAAGGGCTGGCTGTCGCGGACGCCGCACGGATCGTTGCAGCGCGCATGAACAACCCGTTCGCCTCAGTCGATGACATGTGGCGCCGGTCCAGCGTGCCGACCGAAGCGCTTGTTCAACTGGCAGAGGCCGACGCCTTTCTGCCATCGCTGAAACTCGAACGACGCGATGCGCTCTGGGCGATCAAGGCGCTGCGCGACGAGCCCTTGCCGTTGTTTGCAGCCGCAGCCGAACGCGAGGCGACGGCGATTGCCGAGCAGCAGGAGCCAGAGGTGGCGCTTCGGCAAATGACGGACGGCCATAACGTCATCGAGGACTACAGCCACATCGGGCTCACTTTGCGCCAGCATCCTGTCGCCTTTCTGCGCAAGGACCTGTCGGCACGCAATATCATTTCCTGTGCCGAGGCGATGAATGCCCGTGATGGGCGGTGGGTTTATACCGCTGGACTCGTGCTGGTGCGGCAGAAGCCGGGATCGGCCAAGGGTGTGATGTTCATCACCATCGAGGACGAGACCGGACCGGCCAACGTCGTTGTCTGGCCCACCCTGTTCGAGAAACGCAGGCGCATCGTGCTGGGATCTTCGATGATGGCGATCAACGGACGGATTCAGCGGGAAGGGGAGGTCGTCCATCTCGTCGCCCAGCAACTCTTCGATCTCTCAGGCGATCTGGTTGGCCTTGCCGATCGAGATACGGAGTTCAGGCTGCCGGCTGGTCGCGGCGATGAGTTTGCCCACGGAGGCAGCGGGCCGGATTCACGCGATCGGCCGAAGCCGGTCGTCCCAAGGGATATGTTCGTGCCTGATCTCCATATCGAAACGCTCAAGGTGAAGAGCCGGAATTTTCAATGA
Protein sequences of DBSCAN-SWA_29 >CP036360|296471:299729|296471_299729_+|QBJ16665.1|DBSCAN-SWA MSYAELQVTTHFSFLRGASSAQELFETAKQLGIDAIGVVDRNSLAGIVRALEASRATGLRLVVGCRLDLADGMSVLVYPMDRAAYSRLTRLITLGKSRGGKNNCILHWDDIVAYAEGMIGILVPDLPDATCASQLRKMAELFGDRAYVSLCLRRRPNDQLRLHEISNMATRFKVRTVVTNDVLFHEPGRRQLQDIVTCIRHNTTIDDVGFERERHADRYLKPPEEMARLFPRYAQALARTLEIVRRCKFSLEELTYQYPEEAIVPGKDAQGSLEHYVWECAPDRYPEGLPPDVLKTVRHELDLIRTMKYAPYFLTVFSIVRFARSQGILCQGRGSAANSAVCYILGITSIDPSTNDLLFERFVSQERDEPPDIDVDFEHERREEVIQWIYKTYTKDKAALCATVTRYRAKGAIRDVGKALGLPEDVIKALSSGMWSWSEEVPDRNIRELNLNPNDRRLALTLKLAQQLMGAPRHLGQHPGGFVLTHDRLDDLVPIEPATMKDRQIIEWDKDDVEALKFMKVDVLALGMLTCMAKAFDLIREHKGQDLDLSKISQEDAATYAMIRKADTLGTFQIESRAQMAMLPRLKPRTFYDLVVQVAIVRPGPIQGDMVHPYLRRREGKEPVEYPTPELEAVLGKTLGVPLFQESAMRVAMVCAGFTGGEADQLRKSMATFKFTGGVSRFKDKLVSGMVKNGYSSEFAEKTFSQLEGFGSYGFPESHAASFALIAYASNYIKCHYPDVFCAALLNSQPMGFYAPAQIVGDAIKHGVEVRPVCVNRSRWDCTLERIGGGDRHAVRLGFRQVKGLAVADAARIVAARMNNPFASVDDMWRRSSVPTEALVQLAEADAFLPSLKLERRDALWAIKALRDEPLPLFAAAAEREATAIAEQQEPEVALRQMTDGHNVIEDYSHIGLTLRQHPVAFLRKDLSARNIISCAEAMNARDGRWVYTAGLVLVRQKPGSAKGVMFITIEDETGPANVVVWPTLFEKRRRIVLGSSMMAINGRIQREGEVVHLVAQQLFDLSGDLVGLADRDTEFRLPAGRGDEFAHGGSGPDSRDRPKPVVPRDMFVPDLHIETLKVKSRNFQ |
1 | Streptomyces_phage(100.0%) | NA | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_30 |
309077 : 311234
Sequences of DBSCAN-SWA_30
Nucleotide sequences of DBSCAN-SWA_30 >CP036360|309077:311234|DBSCAN-SWA GATGACTAACCTCAATGGAAAGATTGCACTCGTAACTGGCGCCTCGAGCGGCATCGGCGCTGCCACCGCCATCAAGCTCGCGAAGGCCGGAGTCAAGGTCGGCATTGCCGCCCGCCGCACCGAAAAGCTCGAGGATATCAAGCAGCAGATCGGAGCCAACGGTGGTCAAGCTCTGGTTCTGCAGATGGACGTGGTCGATCCGGCCTCGGTCGAAGCAGGCGTCAAGACGCTGATCGATACCCACGGTGCAATCGACATCCTCGTCAACAATGCAGGCCTGATGCCGCTCTCCGATATCGATCAGTTCAAGGTCAATGAGTGGCACCGGATGGTGGATGTGAATGTGAAGGGCCTGTTCAACACGACGGCCGCCGTCCTGCCTCAGATGATCAAGCAGCGGTCCGGGCACGTCTTCAACATGTCCTCGATTGCGGGTCGCAAGGTGTTCAAGGGATTGTCTGTCTACTGCGCCACCAAGCATGCAGTAGCCGCCTTTTCGGATGGTCTGCGCATGGAGGTGGGGCCGAAGCACAATATCCGGGTTACCTGCATTCAGCCGGGTGCTGTCGCAACCGAGCTCTACGATCACATCACCGATCCCGGCTACCGCCAGCAGATGGATGACCTCGCCGGCCAGATGACCTTCCTCAATGGTGAGGACATCGGCGACACCATCGTCTTTGCTGCGCAGGCCCCGGCGCATGTCGATGTTGCCGAGCTATTCGTCCTGCCCGTGGAACAGGGTTGGTGATGCACCGCCCTTTTGGCCCCTGCCATTGCGCAGGGGTCCCGAAGGAAATCCAGATGTCCGAAGGAGATCGAGATGAAGGAACTACCCGCATCGCAGGTTTATCGCCTGCTTGAACCTGGCCCGATCGTGATGGTATCGACGCTCGACAACGGCAGGCCCAATGTCATGACCATGGGTTTCCACATGATGATTCAGCACGATCCGCCGCTCATCGGATGTGTGATCGGCCCGTGGGATCACAGCTATCAGGCGCTTCGTAACACCGGTGAATGCGTGATCGCCGTGCCCGGACTGGATCTGGCCGAAACCGTCGTCGATATCGGAAACTGTTCCGGCGCGGATGTCGAGAAGTTCGAGAGATTTGGCCTCGAGACCAAGCCCGCGGAGCAGGTCTCAGCTCCGCTTCTCGAAGATTGCCTGGCCAACATCGAATGCGTGGTAATCGATGACAGGCTGCTCGATCCCTACAACCTCTTCATCCTTGAGGCGAAGAGAATCTGGCTCAACGAAAGCCGGACGGAACGGCGAACCCTGCACCATCGCGGGGATGGGACCTTCGCCGTCGATAACGGCACGCTCGACCTGAACCACCGTATGGTCAAGTGGCGTCACCTGCCGTGAGCACCGTGCGCCAGGCCGTTGCCTCATTTGAAACCACCATGGAAGGAATATCCGATGACGAACATCGCCAACAAGATCGTCCTTATCACCGGAGCGAGCAGCGGGATAGGCGAAGCGACCGCGCGCACTCTCGCCACTTCAGGCGCTGCCGTTGTGCTGGGGGCAAGACGAACGGATCGTCTCGAAAAGCTTGCCGAGGACATTACCGCTGCCGGGGGTAGGGCAATCTACAGAAGCCTTGATGTGACTTCTCGTGAGAGCGTCCGGTCGTTCGCGGATGCGGCAGTGCAGGAGTTCGGCCGGATCGACGTGATCATTAATAATGCCGGCATCATGCCGCTGTCACCCATGGCATCTCTGAAGGTGGACGAGTGGGACCGGATGATCGACGTCAACATTAAGGGCGTCCTGCATGGCATTGCAGCGGTTCTGCCATTGATGAACAGGCAGGGATCCGGTCAGATCATCAATATCTCGTCGATCGGCGGCTTTGCCGTCTCGCCGACGGCCGCCGTTTACTGCGCCACCAAATATGCAGTTCGCGCGATCTCGGACGGGCTGCGCCAGGAGAACGACAAGCTCCGCGTCACCTGCATCTATCCAGGTGTGGTGGAATCTGAACTGGCCAACACGATCACCGATCCGGTGGCGGCGCAGGCCATGGAGAGTTATCGCCAGATCGCTTTGAAGCCCGAGGCGATAGCAGCGGCCATCATGCATGTCATAGACCAGCCTGACGAGGTGGACACGAGCGACATCGTCGTTCGACCGACGGCCAGTGCCTGA
Protein sequences of DBSCAN-SWA_30 >CP036360|309077:311234|310502_311234_+|QBJ16676.1|DBSCAN-SWA MTNIANKIVLITGASSGIGEATARTLATSGAAVVLGARRTDRLEKLAEDITAAGGRAIYRSLDVTSRESVRSFADAAVQEFGRIDVIINNAGIMPLSPMASLKVDEWDRMIDVNIKGVLHGIAAVLPLMNRQGSGQIINISSIGGFAVSPTAAVYCATKYAVRAISDGLRQENDKLRVTCIYPGVVESELANTITDPVAAQAMESYRQIALKPEAIAAAIMHVIDQPDEVDTSDIVVRPTASA >CP036360|309077:311234|309899_310448_+|QBJ16675.1|DBSCAN-SWA MKELPASQVYRLLEPGPIVMVSTLDNGRPNVMTMGFHMMIQHDPPLIGCVIGPWDHSYQALRNTGECVIAVPGLDLAETVVDIGNCSGADVEKFERFGLETKPAEQVSAPLLEDCLANIECVVIDDRLLDPYNLFILEAKRIWLNESRTERRTLHHRGDGTFAVDNGTLDLNHRMVKWRHLP >CP036360|309077:311234|309077_309827_+|QBJ16674.1|DBSCAN-SWA MTNLNGKIALVTGASSGIGAATAIKLAKAGVKVGIAARRTEKLEDIKQQIGANGGQALVLQMDVVDPASVEAGVKTLIDTHGAIDILVNNAGLMPLSDIDQFKVNEWHRMVDVNVKGLFNTTAAVLPQMIKQRSGHVFNMSSIAGRKVFKGLSVYCATKHAVAAFSDGLRMEVGPKHNIRVTCIQPGAVATELYDHITDPGYRQQMDDLAGQMTFLNGEDIGDTIVFAAQAPAHVDVAELFVLPVEQGW |
3 | Bacillus_phage(50.0%) | NA | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_31 |
315966 : 324108
Sequences of DBSCAN-SWA_31
Nucleotide sequences of DBSCAN-SWA_31 >CP036360|315966:324108|DBSCAN-SWA AATGAGCAATTCCTTACGCAGTGCAGCCGTCCCTTCACGGATAATTCAGGTCCCACAGTCCATCAGCGTCGAAGCACAAGCGGCACTGTCGCGCCTGGTCGACAAGGACGGCAGCCCTATCAATGCGCGGTTCGAAATGCCGTCCCCAGAGGATTTTTCAGGCTGGATGATGATGAAAGCCGCAGTAGATGCGCATTACGCCGCCGCGGCCAAAGATCTTGCCGGGAGTCTGCAGTCTACCGTCACCACAATCGTAGTCGAGCAAGCAACGATCCATGTCGCGACGCCGTACGGAGCATTTCATGAGCGTGGCGCACTCATCGACCTGCATGGCGGCGCATTGGTGTTCGGAGGCGGTGAGGCCTGTCTTGTCAGTGCGCGACGTCAAGCTCACCAGCATGCCGTGCGATGCTACGGCGTCGATTACCGGATGCCGCCTGAGCATCCCTATCCGGCCGCTCTTGATGACTGCTTAGCCACGTACCGTCATGTCTTGGCAGGTCACTCCCCCGACAAGATAATCATCCTGGGGAGATCGGCGGGCGGCAATCTCGCGACCGCCATGCTGCTGCGGGCGAGAGATGAAGGTATGCCAATGCCTGGCAGATTAGTCTTGCTCTCGCCACAGGTTGACCTCACCGAATCCGGTGACAGCTTCCAGACCAACCAGATGATCGATCTCGTTCTGCCCCGCCCGCTAAGACCAAACAACCTGCTCTACGCCGGTGGTGCCGATCTTTCCAATCCCTATCTATCGCCGCTCTTCGGCGATTTGGCGGGCTTTCCGCCGACATTCCTGCAGACCGGCACGCGTGACCTGTTCCTATCGAACACGGTGAGGATGCATCGAGCCTTGCGAAAAGCTGGCGTGGAAACCGAATTGCACGTCTTTGAAGCCATGCCCCATGGTGGCTTCATGGGTGGGACACCGGAAGAGCAGGAACTCGAAGCGGAGATCCACCGGTTCGTCATGGCAAACTGGAACTGAGGCCGGAAACCCACAAAGTTTTGGCGCTGCCCCTCTGTTGTCCTGCTACATCCCTGGCCACATTCCGCCATCAAGTCGGAGGTTTGCTCCTGTGATATAGCCAGCACGGGGGCTCGAGAGAAATGTAATCGCATCCGCAATCTCCTCGAGGGTTCCTACCCGTCCAAGGGGCACCTGAGCAAACAACGGCAGGATTTCTCTCTCTACGTCATTCCAAGGGGCGTCGATGGCCATGCCCCTCTCAATGGCTTTCTTACGGAACGCGGTATCCAAACTGATGCTATGCACTGTTCCAGGCGAAACGGTGTTTGCGGTAATGCCTTGTGCAGCAACGTCTTTTGCGAGCGACGCCGTCATCGCGATCATGGCTGCTTTGGCGGCAGAGTAGTCCGGTCGACTTGCAGGCGGCATCAGGGCCGCCAGGCTTGAGATGTTTATGATCCGACCCCACCGCGACGCTTTCATTGCGGGCAGAACGAGCGAGACGATCCTCACAGATGCGAGCACGTTTCTGTCATAGACAGCGGCCCACGTTTCAGAGTTTGTCGAGGTCCAATCCTCCGCCGGGGCAGATCCGCCAGCATTGTTGACCAGGATGTCGATCGACCCTGCTGCGGCCTTGGAATCCCTAACAAGACGATCGACCTGATCCGACACTGTCAGGTCACCGACTACCGCGAATGCCCGACCTCCAGAGGATATGATGTCATGCGCGACTTCCTCCGTTTTCATTCTGTCGCGACCGTGGACAAGAACGGTTGCCCCTTCCTTTGCGAGGCCTCTGGCCACACCCTCGCCAATTCCCTTGCTGCTTCCCGTGACCAGCGCAACCTTGTTCTGGAGTTGTAAGTCCATGATATTTGGTCCGTGTTATCGTCTCCGGCAGATATGAGGTTCGTCGATGAAACAGGCAATTACGCACTTTAAAGTGCCTGTCGAGGACAGGGCGACATGCCTCGGAGCCGACGGTTCTGTCTCTCATGTCAGCAGGGTGCTTCGGATGATCACAGGACGTTGGAAACTGCCGATCCTTTTCCGATTGTTCGCCGAACCATCATTGCGGGCATCGCAGTTCATGAGAGACATACCTGGCATATCCCAGAAGATGCTAACGCAACATCTCAGGGAACTGGAAAATGACGGCCTCATAAGCCGACACGACTTTCAAGAGCAGCCTCCTCGAGTCGAATACTCGCTGAGTTCAGCAGGCCACGGGCTTATGCCGATTTTGATGGCGGTCAGGGAATTCTCTCGGGATTATCCTGTTGATCGGCGTCGATAACTAACTTGGGTGCCGACGCTTGGCGCGATCCCGAAAATATCGCTTGCATTATGGACCAAATGGTTCCATATAGAACTTATGGTCAAGCGCACTCAAATATCCGCGTCCGTTGGTCGCCCTAGAGAATTTGAACTCGATGAAGCAGTTCGAAAAGCAATGCACGTATTTTGGGATCGTGGATATCACGAAGCATCCCTTCCCGACTTGCTTGAGGGTATGAAACTTTCCAGAGGCAGCTTTTATAAGGCTTTCGTCGATAAGAGAGGCGTCTATCTGCGCGCCCTCGACGCTTACATTGAGGACGCGGTTCGCTCGGTCGGTGAAGCGCTTCATTCCAATCCGTCGCCCAAAGCCGCGATTTTAGAAGCTTTTTCTCAACAGGTGGATCAAGCATCGGGACAGGACGGTTTGCGTGGATGCTTTGTCGTCTTTGCAGCTGTCGAGATGCTCCCAAAGGATAAAGAGGTCGCACCACGTATTTCCCGGCTGTTCAGGCGGCTGCAAGATCTTTATGCGGCGGCGATAATAAGAGCGCAGGCCTTGGGCGAAATCGATCCCGAGCTGGATGAACGAACGCTTGCGAGGTTCCTCGTGTGTCAGATTCAGGGCATGCGGGTCCTTGCCAAAGCAGGAGCGGATCGCGCTGAGACGAGGGCCATGGTCGAACTGGCGCTGAAGGCGCTTGGCTAACTTCAATTTGGAACCAATAGGTTCCCAATTTTTATCATTTCAAACCGTCGATCGATTTTTTGGTCGATGTGGAGACACTTATGAGAGACACGAGCAATCCCGATGACAAACTCGATCCACGTCGTTGGATTGCACTGATCATCCTTTTGACCGGCGCCTTTTTGCCACCGCTTGACTTCTTCATCGTCAATGTGGCGCTGCCCTCGATCCGGGCAGATCTCAGGGCTTCTGCGTCAACGATGCAGCTGATTATCTCCGGTTACGCCACCACCTATGCTGTGATGCTTATCACTGGCGGCAGGCTTGGAGATCTCTATGGTCGACGAAATGTCTTTCTGGCTGGAATGGTCGGGTTTGCAGCCGCATCTGCCTTATGCGGGTTTGCCTGGTCTCCAGCCGCGTTGGTCGCCGGCCGTATCCTTCAGGGGTTTGCAGCGGCAATTATGGCACCGCAGGCCTTGGCTTCGATCAATGCCATTTTCCCAGACCAAGAAAAATCAAGAGCCCTCAGCTTTTATGCCCTGACATTCGGGATGGCATCGATGGTTGGTCTGTTTCTTGGTGGTGCGTTGATTGCGCTTGATGTTCTTGGTCTTGGATGGAGAGCCATCTTCCTCATCAACCTGCCGGTTATTGCAGTCGCTGCGCCGTCGGCCTTCATCATGTTGCGAGAAACCCGATCTGCGCACCCAAGCAAACTGGATCTTGGCGGAGCACTGTTGATCGCGATTGCGCTATTTGCACTGATTGCGCCGCTGATCGAGGGCCGGGAGCACGGGTGGCCGATCTGGCTTATCGTGATGCTTGCCACGTGTCTTCCGTTTTTCGTCTTGTTTTGGCGGCATGAGAAACATCTGGAATCGACGGGGAAAGATCCGATTCTTGCACCGAGCCTGCTGCAAAACCGTGGGCTACTGCGCGGGCTCCTGGCTGCCTTGTTCTTCTACGCCCTTGCGGCCTTCTGGCTGATATTCTCGGTCTATGAGCAGGGTGGCCTTGGTCGGACGCCTTTCGAGGCCGGGCTGGCGATCCTTCCTGCGGCTGTCGGCTTCGTCCTTGGTCCTTTTGCAAGCGAGCGCATCCTCAGCGTCTTCGGGAGATTTTCCGCTGGCGCCGGCATGGTGCTGCAGGCTGCAGGATTGTTTGGAACGGCGGCCTTGATTTCTACTGGCCTTCCGCAATTCCTTTTTGCTGCGCTCTTTCTTATCGGTGCGGGGCAGGGGATCGCCCTTCCAAACCTGGTGAAGAGCATCGTGCAAAGGGTAGACCGGACGCAGTCAGGATTGGCTTCTGGTCTGGTCAATTCGATGTTTCAGATTGGCGGTGCGTTGGCAGCCGCAATCGTTGGCGGCTTGTTCTTCTCGATCTTGGGGCCGGCCACAGACGTTCAATCCATTGGCCAAGCATACAGCGTTGCAGCGGTTGCAATTGCCATTTGTCTTCTCATCGCGGGATGGCTTTCCGTCAGCCTGACATCGACCCAACCGTTGCGTCGATAAGCGGCGCCATCGCAGCGCATCATCAGCCGATCGCGCGGAAACGGGGTAAAACTCCCCAGCAAAACAGAAACATTCAGAGGTAATCTCCATGAACAACATCTCCACACTACCTCTTGCCGGCAAGTGCGCGCTGGTCACCGGTGCGGCACGCGGTATCGGTGCGGCAATTGCGCTGAAACTGGCCGAGGACGGTGCCGACGTTGCGATCACCTATGAGAAGTCGGTCGAGAAAGCGGAAGCCCTCGTTGCTGAGATCCGCGCCATGGGTCGCAAAGCGATCGCCATCCATGCCGATGCCGCGCGTACAGAAGCGGCAAGGTCCACTGTCGAGCAGACCGTCGCCCGCCTCGGCAGCCTTGATATTCTGGTCAATAATGCCGGCGTCCTTTTCGCAAGTGATTTTTCCACGCAGCCACTTGAGGAGATTGATCTGCAGCTGAACGTCAATGTTCGCGGTGTCTTTCTGATCACCCAGGCAGCGCTGAAATATATTCCCAACGGAGGTCGCATCATCAGTACCGGCAGCAACGCGGGTCTGGCTGTGCCGTTCGCGGGGATCGCGGTCTATGCCGCGACCAAGTCTGCGCTGGAAAGCTTCACGCGCGGTCTCGCACGTGAGCTGGGCTCTCGGGAAATTACTGTCAATCTGGTTCGACCAGGTCCGATTGATACTGACATGAACCCTGCCGATGGCGCGCTCGCAGCCGCTATCCTGCCAAACCTCTCGATTGCTCGATACGGTAAAGCCATTGAAGTCGCAGAGGCCGTCGCCTTTCTCGCAGGTCCCGGTGCAGCCTACATCACCGGCTCAGGCATTCTCGTCGATGGCGGCATCAGTGCCTGAGCATTGATACGAGGCGGTGCCTCGCCGCTCTGTCCTGTAGGTGATAATGCGGAATTTAAGCGCGAATGGGCTGCGGAGTTCCCTCCGCTGCACTGTGCGCGTAAATTTGGCGAAGCGGGCGGTGGCCATCATCCAGGAATTTCTCCACTCCGTTGGTGGTTTTCCTGCGTTGCGATCGTAGCAAAATAGCAAATATGGGTAGCCTGGTTTCTGCTCAAGCGCCCGTTGAGCGGTAACAGACCTAACGTATGAACATGGAGAAGAATTTGAAGACCATCGGCATACTTGGCGGAATGTCCGCGACGTCAACCCAGATTTATTACCGAGAACTCTGCAGGCTTACCCGCGAGAGCCTGGGAGGACTCCATTCTCCCGAGCTTCTCATACGGTCTGTCGACTTCGACGAAATCGAAAAGTTGCAGGCATCCTCCGATTGGGACGCGGCGGGCCGAATCCTCAATGAACATGCTATCGCGCTGCAGCGCGGAGGTGCCGACCTCCTCATTCTGGCGACGAATACCATGCACAAGGTGGCGGACAAGATCGTGGGCAGGGTGTCGATCCCGTTCGTGCACATCGCGGACGCGACTGCAACCGCTATCCTCGACCGCGGTTTCCGGAGACCGGGTTTGATGGCAACGGCCTTCACCATGGAGCAGACCTTCTACACCGACCGCCTCATCGCTCAGGGGCTGTCGCCAACGATCCCTGAAGCTGATGATCGAACAGAAACCCATCGTATTATCTATGAGGAGCTGTGCCGCGACATCGTTCGCGAGGAGAGCCGCTTGACCTATGAACGGATTGCGCAGCGTCTGGTCGACAAAGGCTGCGACTGTCTTATCCTCGGCTGTACCGAGGTGGGTATGCTGTTAAACCAGGATAACGTAGGCGTTCCCGTCTTCGACACGACCCTCATTCATTGCAAAGCCGCCCTTCAAACCGCTTTACAATGACATGTTGTGTTCTGACGGTCGATCGATCGTCATAACGGTGAGGGGCAATGGTCAAACCTGTTCGCCCAGGTGATGCCTGACCCACTTCGCTTCTTCCGATGCGGTACGCCCGAAATGCCGTCTGAAGTCGCGGCTGAATTGCGCCGGGCTGACATAGCCCACTGACGCCGCCACCTCTGCAATTGTCGCCGTCTGGCGGGCGATCATCAGTCTCGCTTCATGGAGCCGCATCGCCTTCACGTATTGCATCGGACTTGAGCCTGTCAGCTCCTTGAAATGGACATGGTATGAAGGAATGCTCATGCCCACCGCTCTGGCAAGGTCCGCGACGGTGATTTCAGAGCCGTAGTTTTTGCGCAGCCATGCCAGGCTCTCGATCAGTTTTCCCGACGTCCCTTTCCGCTGCAGGGCGGCGATCACTGCGCCGCCTTGCGGCCCTGACATGACCCGGTAATGCAACTCACGCAGGATGGAGCCGCCGAGGACGGCGACCTCAACCGGACTGCCGAGCACCGTCAGCAAGCGCAGCAGGACGTCTTCGATAGTGCTGTCCATCCTACTCGAGAAGAGCCCTTTTGGCTTGACGCTTGCAGGACCCGCGAGCTTTTCAAGCTGCGTGGCGATCTCGGCCGCCATCTGCATGTCGAACTCTAAATAGACCGCGAGCAGTGGCCGCCGTGGACTGGCTGTAGATGTCATCCTAAAGGGAACGGGCACCGAAACGGCAAGATAGTGGTGCTCGTCATAGAGGTAGATTTCCCCGTCCAGAATTCCGTGTTTGCTCCCCTGCAATACGAACACGGCGCCCGGCTTATAAAGAACCGGAATGTCGTACAGCACTGCTTCCGTGCGTAGGATATGGACACAACTGAGCCCGGTCTGGTTGTAGCCAAGACGAGGAGCGAGTTGTCCGGCCAAGGCGGCAGACTGGTCATGTGTAGAGGGCGGCATTTTATAGTTCATAGGAATTGGCAAAAATTTAAGAGGATTCGCAATCGAACCCGCGTTGTCTCCAGATTATCTGTCACATCAACGGGTAATCAAGGAGTTTCAGAAATGTCGAGCAAAACGTTTTTCATAACAGGGGCGAAATCGGGTTTTGGGCTGGCGATCGCCACTGCTGCAATCGAAGCCGGTCATACCGTCATCGGGACCATCCGCTCCGAAGCCTCGCGCGAAGCGCTGGCAAAGACCCTCCCGGAACTGCGCCCGGTCCTCTGCGACGTCACCGATTTTGATCGTATTCCGGTCGCGGTGCAGCGAGCGGAGGAAGAGCACGGCCCAGTCGATGTGCTGATCAACAATGCCGGATATGGGCACGAGGGTGTGCTTGAGGAATCGCCGATCGAGGAGATGCGCCGCCAGTTCGACGTGAATGTGTTTGGCGCCGTTGCAGTCGCCAAGGCGTTCCTCCCGAGGTTCCGTGAGCGCCGAAGCGGCTTTATCGTCAACGTCACGTCGATGGGAGGCATGATCACCATGCCCGGCATCGCCTATTACTGCGGCAGCAAGTTCGCGTTGCAGGGTATTTCAGAGGTCATGCGGTCGGAAATGGCGCCGTTTGGTGTGCACGTAACAACCCTTTGCCCCGGCTCGTTCCGGACGGACTGGGCAGGCCGTTCTATGGTCCGCACAGAGCGTTCCATTGCTGACTATGATACCCTGTTCGATCCGATCCGCGAGGCGCGTCAGGCAGTGAGCGGCAAGCAGCTCGGAAATCCGAAAAAGCTCGCCGACGCGGTGCTGACCCTCATCGAATCTGAAAATCCCCCGCCGCAACTTCTCCTTGGCAGCGATGCGCTTAGACATGTAACGGCGCGGATCGAACGCCTGACCCAGGAAATCGAAGCTTGGGAGAGCGTGACTGTTTCCACAGACGGCTAG
Protein sequences of DBSCAN-SWA_31 >CP036360|315966:324108|317854_318235_+|QBJ16685.1|DBSCAN-SWA MKQAITHFKVPVEDRATCLGADGSVSHVSRVLRMITGRWKLPILFRLFAEPSLRASQFMRDIPGISQKMLTQHLRELENDGLISRHDFQEQPPRVEYSLSSAGHGLMPILMAVREFSRDYPVDRRR >CP036360|315966:324108|318391_318925_+|QBJ16858.1|DBSCAN-SWA MHVFWDRGYHEASLPDLLEGMKLSRGSFYKAFVDKRGVYLRALDAYIEDAVRSVGEALHSNPSPKAAILEAFSQQVDQASGQDGLRGCFVVFAAVEMLPKDKEVAPRISRLFRRLQDLYAAAIIRAQALGEIDPELDERTLARFLVCQIQGMRVLAKAGADRAETRAMVELALKALG >CP036360|315966:324108|322275_323187_-|QBJ16689.1|DBSCAN-SWA MNYKMPPSTHDQSAALAGQLAPRLGYNQTGLSCVHILRTEAVLYDIPVLYKPGAVFVLQGSKHGILDGEIYLYDEHHYLAVSVPVPFRMTSTASPRRPLLAVYLEFDMQMAAEIATQLEKLAGPASVKPKGLFSSRMDSTIEDVLLRLLTVLGSPVEVAVLGGSILRELHYRVMSGPQGGAVIAALQRKGTSGKLIESLAWLRKNYGSEITVADLARAVGMSIPSYHVHFKELTGSSPMQYVKAMRLHEARLMIARQTATIAEVAASVGYVSPAQFSRDFRRHFGRTASEEAKWVRHHLGEQV >CP036360|315966:324108|319005_320424_+|QBJ16686.1|DBSCAN-SWA MRDTSNPDDKLDPRRWIALIILLTGAFLPPLDFFIVNVALPSIRADLRASASTMQLIISGYATTYAVMLITGGRLGDLYGRRNVFLAGMVGFAAASALCGFAWSPAALVAGRILQGFAAAIMAPQALASINAIFPDQEKSRALSFYALTFGMASMVGLFLGGALIALDVLGLGWRAIFLINLPVIAVAAPSAFIMLRETRSAHPSKLDLGGALLIAIALFALIAPLIEGREHGWPIWLIVMLATCLPFFVLFWRHEKHLESTGKDPILAPSLLQNRGLLRGLLAALFFYALAAFWLIFSVYEQGGLGRTPFEAGLAILPAAVGFVLGPFASERILSVFGRFSAGAGMVLQAAGLFGTAALISTGLPQFLFAALFLIGAGQGIALPNLVKSIVQRVDRTQSGLASGLVNSMFQIGGALAAAIVGGLFFSILGPATDVQSIGQAYSVAAVAIAICLLIAGWLSVSLTSTQPLRR >CP036360|315966:324108|320512_321268_+|QBJ16687.1|DBSCAN-SWA MNNISTLPLAGKCALVTGAARGIGAAIALKLAEDGADVAITYEKSVEKAEALVAEIRAMGRKAIAIHADAARTEAARSTVEQTVARLGSLDILVNNAGVLFASDFSTQPLEEIDLQLNVNVRGVFLITQAALKYIPNGGRIISTGSNAGLAVPFAGIAVYAATKSALESFTRGLARELGSREITVNLVRPGPIDTDMNPADGALAAAILPNLSIARYGKAIEVAEAVAFLAGPGAAYITGSGILVDGGISA >CP036360|315966:324108|321534_322224_+|QBJ16688.1|DBSCAN-SWA MKTIGILGGMSATSTQIYYRELCRLTRESLGGLHSPELLIRSVDFDEIEKLQASSDWDAAGRILNEHAIALQRGGADLLILATNTMHKVADKIVGRVSIPFVHIADATATAILDRGFRRPGLMATAFTMEQTFYTDRLIAQGLSPTIPEADDRTETHRIIYEELCRDIVREESRLTYERIAQRLVDKGCDCLILGCTEVGMLLNQDNVGVPVFDTTLIHCKAALQTALQ >CP036360|315966:324108|316998_317808_-|QBJ16684.1|DBSCAN-SWA MDLQLQNKVALVTGSSKGIGEGVARGLAKEGATVLVHGRDRMKTEEVAHDIISSGGRAFAVVGDLTVSDQVDRLVRDSKAAAGSIDILVNNAGGSAPAEDWTSTNSETWAAVYDRNVLASVRIVSLVLPAMKASRWGRIINISSLAALMPPASRPDYSAAKAAMIAMTASLAKDVAAQGITANTVSPGTVHSISLDTAFRKKAIERGMAIDAPWNDVEREILPLFAQVPLGRVGTLEEIADAITFLSSPRAGYITGANLRLDGGMWPGM >CP036360|315966:324108|315966_316953_+|QBJ16683.1|DBSCAN-SWA MSNSLRSAAVPSRIIQVPQSISVEAQAALSRLVDKDGSPINARFEMPSPEDFSGWMMMKAAVDAHYAAAAKDLAGSLQSTVTTIVVEQATIHVATPYGAFHERGALIDLHGGALVFGGGEACLVSARRQAHQHAVRCYGVDYRMPPEHPYPAALDDCLATYRHVLAGHSPDKIIILGRSAGGNLATAMLLRARDEGMPMPGRLVLLSPQVDLTESGDSFQTNQMIDLVLPRPLRPNNLLYAGGADLSNPYLSPLFGDLAGFPPTFLQTGTRDLFLSNTVRMHRALRKAGVETELHVFEAMPHGGFMGGTPEEQELEAEIHRFVMANWN >CP036360|315966:324108|323280_324108_+|QBJ16690.1|DBSCAN-SWA MSSKTFFITGAKSGFGLAIATAAIEAGHTVIGTIRSEASREALAKTLPELRPVLCDVTDFDRIPVAVQRAEEEHGPVDVLINNAGYGHEGVLEESPIEEMRRQFDVNVFGAVAVAKAFLPRFRERRSGFIVNVTSMGGMITMPGIAYYCGSKFALQGISEVMRSEMAPFGVHVTTLCPGSFRTDWAGRSMVRTERSIADYDTLFDPIREARQAVSGKQLGNPKKLADAVLTLIESENPPPQLLLGSDALRHVTARIERLTQEIEAWESVTVSTDG |
9 | Trichoplusia_ni_ascovirus(40.0%) | NA | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_32 |
330294 : 331143
Sequences of DBSCAN-SWA_32
Nucleotide sequences of DBSCAN-SWA_32 >CP036360|330294:331143|DBSCAN-SWA CATGATCCCAACATTGAAAGGGTTTACCCAGCAATTGGTGCCGGTGAACGGCATCAAGATCAATGCCGTCACGGGCGGCTCCGGCCCCCCGATCCTTCTGCTTCACGGTTGGCCGGAAACATGGTGGGAATGGCACCATGTGATGCCGCTGCTGGCGGAGCAGTTCAGCGTGGCGGCGATTGATCTACGCGGCGCAGGCTTTTCCGAATGCCCGCTTGACGGCTACGATAAGGCGACGATGGCGCGCGACGCGCACGAGGTCATGATTACCCTTGGGCATCAACGCTATGCCGTATGCGGCCATGACATCGGCGGAATGGTAGCGCTTCCGCAGGCCGCTATCTATCGAGATGCGGTTACTCACCTTGCCGTCCTCGATGTTCCGCTTCCTGGCTGGAGCCAGTGGGAGGCGACGACGGCGAAGATCTGGCACTTCGCGTTCCATGCTAATCGGGATCTGCCGGAACGCCTGATTTATGGCCGCGAACATGACTATGTTTCGACCTTCATGGCGGAGCGGTTTTACGATCACAGCATCTTTAATCCTGAAGATATCGACATTTATGCGAAAGCAATGGCACTTCCTGGTCGCACGCGCGGTGGCATGGAATGGTACCGCACACTGACCGCCGACCATGCGGCCGCGCTTGAATACAAAAAACAACCGCTCCGGATTCCGGTTTTAGGTCTCGGCGGCGAGCAGCGTTTCGGTGCTCATATGGTCGCGATGCTCAGGGAGTTCGCCACAGATGTAACTGGCGGCTCGATTGCCCGCTGCAGTCACTATGTGGCGGACGAGCGCCCGGATGAAGTGGCCGCTGCTCTCATCGATTTCCTCAAGGCCAAGTGA
Protein sequences of DBSCAN-SWA_32 >CP036360|330294:331143|330294_331143_+|QBJ16699.1|DBSCAN-SWA MIPTLKGFTQQLVPVNGIKINAVTGGSGPPILLLHGWPETWWEWHHVMPLLAEQFSVAAIDLRGAGFSECPLDGYDKATMARDAHEVMITLGHQRYAVCGHDIGGMVALPQAAIYRDAVTHLAVLDVPLPGWSQWEATTAKIWHFAFHANRDLPERLIYGREHDYVSTFMAERFYDHSIFNPEDIDIYAKAMALPGRTRGGMEWYRTLTADHAAALEYKKQPLRIPVLGLGGEQRFGAHMVAMLREFATDVTGGSIARCSHYVADERPDEVAAALIDFLKAK |
1 | Tupanvirus(100.0%) | NA | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_33 |
334459 : 336471
Sequences of DBSCAN-SWA_33
Nucleotide sequences of DBSCAN-SWA_33 >CP036360|334459:336471|DBSCAN-SWA TATGCGGACGAAGAAAGGATCATTTGTGAATCGCAAAATTATCGTGCCGCTGGCAATGAAGCCCATTGTGGAGCGGGCAGGCTATGCCCCTGCAGTACGCTTCGAAAACCTGGTTTTCTGCGCGGGGCAGGTTGGACGTGATACCAACATGAATGTTATCAACGACCCAGAGAAGCAATTCGAGGCGTGTTGGGACAACTTAGCTACCGTTTTGTCAGCGGCAGGTTGCAGCTTCGAAGATGTTGTCGAAATGACTACGTATCATGTCGGATTGCAGCAGCACATACATACCTTTCGCAAGGTAAAGGACCGTATTTTCCCCCGCGGAACATGCGCGTGGACTTGCATTGGCGTCTCTGAACTTGCCCATCCCGGTCTTCTTGTGGAAATCAAAGTTGTGGCAGCTATACCTTCGCACCCGTTGGAATAGAGCGGCAGTTGATCCCCGCCGAAGTCGAAAAGTGCTCAAATTCGACCGGACAGCTGGTGTAGAAGACATAACAAAACGACGCAGCTGGTGGACGACCAGCCGCGGTTTGTATCGCGAATGCCGCGGCTGCGACAGGTTTGTATTCTCTCCATAATATTCCTTGGTCGGCTGGAATTAGCGGTTCCATAAGCTCCGGTACGCCGCACCACCGATCCGCATACGCTTAGTAGTCGAAGTCTCAGTCACTTAGAGGTCTGATGAGCAGATCGCGAGTTGGTATCCGTTCAAGAGCGAAGACGCTCGCAAGGCATCTTGCTCGAGCTAACTGTGGTTCATTCGCAACGACGGCTTTGCGCCTTCGAAGCAGTCGTTGTTCGTTATCCACGGATAAAGCTGCTTTGCCCCTTCATTCCGGACGTAAGCATAGCAATTACACGGTCCCTAAAGCAGTCTTTTCGCTTATCAGGCCTTCAACGATGTTAAGGGATGGGAAATTGACGGGCAGGTTTCGGACGACGGCGGCGGGAAAGCTGCATCCCACCGAGCCAGCGAGCTTTGGGCGGCAGTGCACTTTACATCGCCACTCCTTCATTCCTGCCGCTCCTCAGGAGTGAGCATCGCGTGTATCGATGGAGGGTCAAATTAGCCCTCGTTCTTTGATATTAACGGTTTTACAGAACGGCAAATATCTGTGCGGGTTGGCTTCAAAGCTATCCCGCCCCCAAATGACCTTAATTGAAGGATTGCTAAACAGACGAGCGGGCCTCATCATCATGACCTCGGCAGCAATTACACCGGGCGCTGCCGCCGGGGGCTTTAATGCCGCGCCCGCAATCTGAAAGGAGGACACCATGTCTTCCAGCGTCCACGCCGTCGATTTCGATCCCGGCAGCAATCGAATTCTCAACCAGATGCTTCATAAGGCCGGATATGTAACGGCGAGGATGAAACCGCTGTCAGGAGCGCTTTCAAGTCAATCGCGCACTGTGTTGCTCCGGTTCATGTCCGGAGTTATACGCCAGCGAGACGGTGGTTTTCCGTCGGGACCGGAGTCCATTTCTGCTTTCGGACGACCGTGATTCCATAGGTGAACCTAGTGTCGGGTTACGCTCCCAGCTCCGTTTCCGCAGTCTAGGCTGTGAAACGGCCTCGGAAATTACAACTAAATGCTGAGAATATTGGCGAGAAAAAAACTAAAAGTTGACAATCGCATATGAGCGACCTATATGTTGATTGTCGATATTTGCTTGCTTAGCTTTGGTTATCGCGCGTCTGGCACCGTGCCCTGAACGAGCCTTTTCTCTCAAAAATCGCTATCGCATCTGCAGCGCCGCAGGCGTTGTGGTCCTCTTATTGCTATGAAAGGGTTCATCATGACCACTGGCACAGTTAAATGGTTCAATTCCACGAAGGGCTTTGGCTTCATCCAGCCTGACAATGGCGGCACCGACGCGTTCGTGCATATCTCGGCCGTCGAGCGCGCCGGAATGCGCGAACTCATTGAAGGTCAGAAGATCGGCTACGACCTTGAGCGCGATAACAAGTCGGGCAAGATGTCGGCTTGCAACCTCCAGGCCGCTTAA
Protein sequences of DBSCAN-SWA_33 >CP036360|334459:336471|335739_335967_+|QBJ16704.1|DBSCAN-SWA MSSSVHAVDFDPGSNRILNQMLHKAGYVTARMKPLSGALSSQSRTVLLRFMSGVIRQRDGGFPSGPESISAFGRP >CP036360|334459:336471|334459_334888_+|QBJ16703.1|DBSCAN-SWA MRTKKGSFVNRKIIVPLAMKPIVERAGYAPAVRFENLVFCAGQVGRDTNMNVINDPEKQFEACWDNLATVLSAAGCSFEDVVEMTTYHVGLQQHIHTFRKVKDRIFPRGTCAWTCIGVSELAHPGLLVEIKVVAAIPSHPLE >CP036360|334459:336471|336261_336471_+|QBJ16705.1|DBSCAN-SWA MTTGTVKWFNSTKGFGFIQPDNGGTDAFVHISAVERAGMRELIEGQKIGYDLERDNKSGKMSACNLQAA |
3 | Pandoravirus(50.0%) | NA | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_34 |
344619 : 345381
Sequences of DBSCAN-SWA_34
Nucleotide sequences of DBSCAN-SWA_34 >CP036360|344619:345381|DBSCAN-SWA TATGAATATCTCTTTCGAAAACAAGGTAGCCCTGGTCACTGGTGCAGCCTCCGGCATGGGCCTTGCCGCAGCAAAAGCCTTCGCCGAGGCCGGAGCGGCGGTTGCTCTCGCCGACGTCAATGAAGAGGCAGTGCGTGCTGCGGCTGAAGCTCTGACCTCTTACGGTTACCGGGCGATCGCCATCCAGTGCGACGTCGCCGTCATGGAGCAGGTCGCGGCCATGGTGGATCAGACGGTCGCAGAGTTCGGGCGTCTAGACGCGGCTTTCAACAATGCCGGTGTACAGAGTCCCGTCGCCGAGACTGCTGACGCGGACCCCAAGGACTACGATTTCGTCATGGGGGTCAACCTGAGGGGTGTCTGGAATTGCATGAAGTATGAACTTCTGCAGATGCGCAAGCAGGGTTCCGGCGCGATCGTCAATAACTCCTCCCTCGGTGGTTTGGTCGGGATCGCTGAACGCGGCATCTACCATGCCTCGAAGCACGGCGTTGTCGGACTGACCAAGAGTGCCGGCCTCGAATACGCGCCTAAGGGAATCCGGATCAATGCGATCTGCCCAGGCATTATCGAAACCCCGATGGTGACCGGAATGCTGGAGACACAACCGGACGCTATGTATGCTCTGATGCAGGTTGTCCCCATGAGCAGGCTCGGCAAAGCTGAAGAAATCGCCGACGCGGTCCTGTGGCTGTGCAGCGACGCGTCCAGCTATGTCGTCGGACACGCTCTTCCCGTCGATGGCGGTTATACCGTCCAGTAG
Protein sequences of DBSCAN-SWA_34 >CP036360|344619:345381|344619_345381_+|QBJ16715.1|DBSCAN-SWA MNISFENKVALVTGAASGMGLAAAKAFAEAGAAVALADVNEEAVRAAAEALTSYGYRAIAIQCDVAVMEQVAAMVDQTVAEFGRLDAAFNNAGVQSPVAETADADPKDYDFVMGVNLRGVWNCMKYELLQMRKQGSGAIVNNSSLGGLVGIAERGIYHASKHGVVGLTKSAGLEYAPKGIRINAICPGIIETPMVTGMLETQPDAMYALMQVVPMSRLGKAEEIADAVLWLCSDASSYVVGHALPVDGGYTVQ |
1 | Trichoplusia_ni_ascovirus(100.0%) | NA | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_35 |
351079 : 355189
Sequences of DBSCAN-SWA_35
Nucleotide sequences of DBSCAN-SWA_35 >CP036360|351079:355189|DBSCAN-SWA CATGAGCAGCATCTCCTTGCGCGACATCCGAAAAGCCTACGTCAACGGACCTCAAGTTCTTCACGGGGTGTCGCTCGACATCGAGCCGGGAGAGTTCGTGGTCATTGTCGGTCCGTCCGGTTGCGGAAAGTCCACACTGCTGCGCCTGATCGCAGGTCTGGATAAATGTGAGGAAGGTACGATCGAGATTGGCGGAAAGCGCGCCAACGATCTTCCCCCACAAGATCGCGACATCGCGATGATTTTCCAGAACTATGCGCTCTATCCGCACATGACGGTCCGCGACAACATCGCGTTCGGCCTGGAGCTGCGCGGGATGAGCAAAACAGAGCGTAATGAGAGAGCAGAAAGGGTCGCCAGAACCCTTCAGCTGCATGCTTATCTGGATCGTAAGCCCGCCGCGTTGTCGGGAGGTCAACGCCAGCGTGTCGCCATGGGCCGAGCGATGGCGCGTAATGCCGCAATCTTCCTGATGGATGAGCCGCTTTCCAATCTCGACAATTCCCTTCGTATCTCAATGCGCACGGAGATCAAGGAGCTTCATCGGCAATTGGGCGCAACGATCATTTACGTCACCCATGATCAGACCGAGGCGCTCTCGCTTGCCGATCGTATCGCCGTCATGAGGGACGGGCATCTATTGCAATTCGATCGCCCCGAGGTGATCTACGACCGTCCCTCGAACCGTTTCGTAGCCAGTTTTCTCGGAAGTCCCCCGATGAACTTCCTCGCATCCAACAGCCTTCCAGGCTGGACGGGAGCAGGGAAAGTGACTGTGGGACTGCGGCCTGACTTGCTCACCGTCCACCATGAAAAGCCCAACCAACCAGCTCTTCCTGGACGCCTCCTGCTCTCCGAAATGACAGGATCAGACATGCTCCTCCATTGTGAGACTCCAGCGGGCCGCCTGACCGTTTCCGCCCCCCGCAAGACGGTGACAAGGGAGACTGAGCAGCTTTGGATCGGCTTCGATCTTGACCGTGCTCTGTTCTTTGATCCTCAAAACGGCGACCGCATCGATTTGCCTGCCAGTGGCCACACGGAAGGACGAGGGGCGGCGAGTTGATGAACAAACAAGATGCGCCATTGGCACTCAGCGATCTTATCAATCGCTATTCGACGAAGCTGACAGAGGCCGACACGCGCCTTCTTGACGTTCTGTTACAAGATCCGATCCGTGCGGCCATGGAAAACGGAAAGGATGTTTCTTCTCGCGCGGGCGTTCATCCAGCATCGGCCGTGAGGCTGGCCCGTCGCCTGGGGTTCAAAGGCTATCCCGAGTTCAGGAGTTTCCTGCAGTCCAGTTTAACGGAGGGGGAGGGAGACTTCGAAAGCCCTGCAGCGCGCATGGCTGCGCGGCTGGTGCGGGCCGAAGACAACGGTCTGCTCGCGTCGGTCCTGGACAGCGAGATAACGGCTCTTCAACAGGTCCGCCACGCCGTATCCGACGCAGATATTCGCGCGTTTTCCGCAATCGTGCGCGACAGTCGCCGGATTTTTGTCTTCGGATGCGGACATTTTTCGGCGCTCGCATCGCTTGTTGCGCTTCGCCTCAATCGCTCCGGCTACGAAGCGATCGATCTGGCGAGCCGGATGCACCAGCTTGCGGAAGTGCTGGCAGCACTGACCGCGGAAGATGTCGTTTGGTTTCTCGCCTTCCGTCGGACACCTCCGCTTATCCAGGAAGTACGCGAGATTGCAAAGCGGCGAGGGGCAAAAACGCTGGCTGTGACGGACGTCGGGGGCACGAGGATCGATCCTGCTCCCTGTCATCAAATACTGGTGTCACGAGGTAACCCAGGTGAATCGCAGTCGCTTGTTGTGCCTATGACGATTGCCAATGCGATTATCCTCGATCTGGCGTCCATCGACAACGGGCGGTCGTTGCAGTCGCTGAGCGAGTTCAGATCGTTCCGCGCATCGTCACGTCTGACTGCCGGGTGAGAGTGCACCACGGCAGAGGAGATCATGCGATGGGGGTTCCCTAGAATTCGTCCCAGGTGTTATTCGCAAGTGCAGCACTGCCGTAACCAGCAGAGACCACTGCGGCGTTCCGCGACCGCCTTGTCAGCGATCCGGAGCCAGATGCTGCTGCCATGGCAGAGGCCATGTGATGAAGTCCCGCCGTCTCATGCGACGCAGCCAAGCCACCTCGCCCCAACTGGAACTGACTGATCAACTCGCGCAACCGGACAGCTTCTGCGGCAAGAGTTGCGCCAGCAGCGTTGGCCTCCTCCACCATGGCTGCGTTTTGCTGGGTGACCTGGTCCATCTGATTGACAGCGGTATTGACTTCGGACAGGCCGACCGATTGTTCGCGGGAGGACGTGGCAATGGAGTCCATGTGCTGGTTGATCGTGACAATGTAAGCCTCGATCGTTTTGAGAACCTCGCCGGTTTCGCTGACCAGCCTGACGCCGTTGTCGACCTCATGCGTGGACTTGCTGATCAGGCTTTTGATTTCTTTCGCTGCATTGGCAGAACGCTGGGCAAGTTCCCGAACTTCCTGGGCGACCACTGCAAAGCCTTTTCCCGCTTCGCCCGCGCGCGCGGCTTCAACGCCTGCGTTGAGCGCGAGCAGATTGGTCTGAAAGGCGATATCATCAATCACGCCAATGATGTTGGAAATCTGGCCTGAAGATTGTTCTATCCGCTCCATTGCTTCAACAGCCTTGGCCACCACGGTCCCCGACTGGCGGGCACTGTCGTTTGCCTGAACTGCGGTATTTCGTGCTTCTTCTGCGCGCTTATAGGAATTGGTGACATTCACCGTAATCTGGTCAAGGGCGGCCGCAGTCTCTTCGAGAGAAGCCGCTTGTTGTTCGGTCCGCCTCGACAAGTCGTCGGCACTCTGGCTGATCTCACGCGAGCCGCTGTCAATAGACGCTGCGGCTCGCGAAACCGATCCGAGTGTCCCTTGCAACTGTTCGACAGCTGCATTGAAGTCTGTTCGCAAGCTTTCGAAATCCGGTGCGAAGCTGTGCTCAAGAGTGACCGTCAAATCGCCTGAGGCGAGACGCTTAAGCCCTGCAGCAAGCCCATTTGTGGCCTCTGCCATCGCCTCTGCACGGACACGTTCCTGCTCGGCAATTTGGCGCCGTTGCTCCTCGGTCATTTGCCGCTGTTCATCCGCCTGATTTTCGAGTTCCCTTGTCTTCAAGGCGTTGTCCTTGAAGACCCGCACGGCGCGTGCCATCGCACCGGTCTCGTCAGAGCGATCCTCACCGTCTATATCAGTTGAAAGATCACCGGATGCGAGCCTGGTCATCGTGCCCGTCAGGCGGCCGATTGCATTGCCAAGAGATTTAGCGAACAGAAAAAAGGCAGTAAGAGCTAATAGTGAGGTCAGGAGAGTGACAGCGATCATGGTCCAAAGCGATGAACGGGCCTCCGCCAAAAGCGTAGTCGCATCAGTACCGATCTCGAAGACGCCTATTTTATCGCCAGCGAAGGACGTGAACGGAACCGCTTTGATCAGCATATCGCGGTTATCAAGCACAGTTTGTTCAAACACCGACGCGCCATCGAATGCTGACCGAAGTATATCGTCCGACAGGAAGGGCTTGCCTCCATCTGTCGAGGACTGATTAACAAGTTTTCCGTCCTGGACGACATTCACGGAGATCTCAGCGCCAATGCGCTCTGCGATAGGTGTAAAATAGGCGTTGGATAGTTCCGTGCCGATGTCTACCACCCCAACGACCTTTCCGTCGGCCAAGACAGGCGCGACGGCATAAACGCCAATTCCGGTTCGGCCCGGTTCTATTCCTGCCGCCACTTTTCCCGTTGTGACCGCCGCGCCGACTGTCTTGCGGCGAGCCAGATAGTTGTCGCCGAATTTCTCAGGCGCGTGCACGCGGGCGATTACATTTCCGTTTGCATCCGCTACGGTAAAATTCTGCAACCCGCCCTGTTGTGCCACGGCCTTGATGTTAGACGAGAATTTATCGAGAAGTTTCTGGCGATCATTATTCTGGATGTAGCCGGCAATATCCGGTTCTCCGGCGAGCGTCAGCGCTAGAGCCGATGCGGAGCGCTGTATGGCTGCCATATCTGTCTGAATAACATGGAGGTCGCTTTCTGCCTTACTGTGAAGAGCTTGCGAATTCAT
Protein sequences of DBSCAN-SWA_35 >CP036360|351079:355189|352143_353022_+|QBJ16722.1|DBSCAN-SWA MNKQDAPLALSDLINRYSTKLTEADTRLLDVLLQDPIRAAMENGKDVSSRAGVHPASAVRLARRLGFKGYPEFRSFLQSSLTEGEGDFESPAARMAARLVRAEDNGLLASVLDSEITALQQVRHAVSDADIRAFSAIVRDSRRIFVFGCGHFSALASLVALRLNRSGYEAIDLASRMHQLAEVLAALTAEDVVWFLAFRRTPPLIQEVREIAKRRGAKTLAVTDVGGTRIDPAPCHQILVSRGNPGESQSLVVPMTIANAIILDLASIDNGRSLQSLSEFRSFRASSRLTAG >CP036360|351079:355189|353062_355189_-|QBJ16860.1|DBSCAN-SWA MNSQALHSKAESDLHVIQTDMAAIQRSASALALTLAGEPDIAGYIQNNDRQKLLDKFSSNIKAVAQQGGLQNFTVADANGNVIARVHAPEKFGDNYLARRKTVGAAVTTGKVAAGIEPGRTGIGVYAVAPVLADGKVVGVVDIGTELSNAYFTPIAERIGAEISVNVVQDGKLVNQSSTDGGKPFLSDDILRSAFDGASVFEQTVLDNRDMLIKAVPFTSFAGDKIGVFEIGTDATTLLAEARSSLWTMIAVTLLTSLLALTAFFLFAKSLGNAIGRLTGTMTRLASGDLSTDIDGEDRSDETGAMARAVRVFKDNALKTRELENQADEQRQMTEEQRRQIAEQERVRAEAMAEATNGLAAGLKRLASGDLTVTLEHSFAPDFESLRTDFNAAVEQLQGTLGSVSRAAASIDSGSREISQSADDLSRRTEQQAASLEETAAALDQITVNVTNSYKRAEEARNTAVQANDSARQSGTVVAKAVEAMERIEQSSGQISNIIGVIDDIAFQTNLLALNAGVEAARAGEAGKGFAVVAQEVRELAQRSANAAKEIKSLISKSTHEVDNGVRLVSETGEVLKTIEAYIVTINQHMDSIATSSREQSVGLSEVNTAVNQMDQVTQQNAAMVEEANAAGATLAAEAVRLRELISQFQLGRGGLAASHETAGLHHMASAMAAASGSGSLTRRSRNAAVVSAGYGSAALANNTWDEF >CP036360|351079:355189|351079_352144_+|QBJ16721.1|DBSCAN-SWA MSSISLRDIRKAYVNGPQVLHGVSLDIEPGEFVVIVGPSGCGKSTLLRLIAGLDKCEEGTIEIGGKRANDLPPQDRDIAMIFQNYALYPHMTVRDNIAFGLELRGMSKTERNERAERVARTLQLHAYLDRKPAALSGGQRQRVAMGRAMARNAAIFLMDEPLSNLDNSLRISMRTEIKELHRQLGATIIYVTHDQTEALSLADRIAVMRDGHLLQFDRPEVIYDRPSNRFVASFLGSPPMNFLASNSLPGWTGAGKVTVGLRPDLLTVHHEKPNQPALPGRLLLSEMTGSDMLLHCETPAGRLTVSAPRKTVTRETEQLWIGFDLDRALFFDPQNGDRIDLPASGHTEGRGAAS |
3 | Bacillus_virus(50.0%) | NA | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_36 |
361038 : 368446
Sequences of DBSCAN-SWA_36
Nucleotide sequences of DBSCAN-SWA_36 >CP036360|361038:368446|DBSCAN-SWA TATGGCACAAGCCACCCAGAAAATCACCCTCAGCGCAGCCCGGGATATTCCCTTCAACAAGCTGGTGCTCAGTCAGCAAAATGTCCGCAAGATCAAGGCGGGCATCTCGATCGAGGACCTTGCCGAAGACATCGCCCATCGCGGGCTGCTCACCAGCCTCAACGTCCGCCCCGAACTCGACGGCGATGGCAACGAAACCGGCATCTACCGGATCCCGGCTGGCGGGCGCCGGTACCGCGCCCTCGAGCGTCTTGTGGCGCAGAAGCGTCTGGCCAAAACTGCCGGTGTCCCTTGCATCGTCAGCAAAGGCGAGACCCTTGAAGTCGAGGACTCCCTCGCCGAAAACGTCCAGCGCGTCAGCCTTCATCCGCTCGATCAATTCCGCGCCTTCCAGACCTTGCGTGAGCAGGGGCTCGGTGAGGAAGAAATCGCGGCGCGCTTCTTCGTTTCGGTTGCCACCGTCAAGCAGCGCCTGCGGCTGGCCTCCGTCTCGTCGCGGCTTCTCGATCTTTATGCCGAGGACGAGATGACTCTCGAACAGATCATGGCCTTCTCGATCACCAACGACCATGTTCGCCAGGAGCAGGTCTGGGATACGGTCTCCCGCTCCCATAGCCGTGAACCTTATTATATCCGCCGACTACTGACGGAGACCGCGATCCGCGCCAGCGACCGCCGCGCAGTTTATGTCGGCATCGAAGCTTACGAGGCCGCGGGCGGTGTGACCATGCGTGATCTGTTCGACCAGGATCAGGGCGGCTGGCTGCAGGATCCGGCGCTGCTCGAGCAGCTCGTGATGGAAAAGCTGAAGGCCGACGCCGAGGCGATCCGCGTGAGCGAAGGCTGGAAATGGGTTGAAGCCGCCTTCGACTTTCCCTACGGCCACACGTCCGGCCTTCGCCGCTTCTACGGAGAGCAGGCCGAGATGACCGAGGCGGACTTGGCCCACTATGACGCCACCCGTGCCGAATACGATAAGCTCGACGCAGAATATTCGGAAGCAGATGAGTACTCGGAAGCGACCGAACAGAAGCTGGAGGAGCTCGGCGCGGAGCTCGACCGGCTCAACGACCGCCCATATGTGTTCGATCCGGAGGAGGTCGCCCGCGGCGGTCTCTTCGTCTCGCTCGGCGTCGGTGGCGAGCTCAACATCGAGCGAGGCTTCGTGCGGCCCGAGGACGAACCGAAGGATGCAGCCGATCCCTCGGCTGATGTCGGCGATAGTGGCGATTATGCCAGCAGTGTTCCGGCAACCGGGGCCGCCGGTGGGGAGGAGACGCAACCGGACGACGAAGACCAAACGGTTAAACCGCTTCCTGATCGCCTGGTTCTCGATCTGACAGCGGCGCGCACGGTGGCCTTGCGCAATGCGCTGGCGAACGATCCTGTCATCGCCTTCATTGCGGTCCTGCATGCCTTCGTCCTGAAGACCTTCTATGTCTACGGCTCGGATTCATGCCTGGAGGTGACGCTGCAGAGCGCTCGCTTCTCCCAGACGCCCGGCCTCGGCGATACGGTCTGGGCGAAAGAGATCGAGCAGCGGCACGAAGGCTGGGGCCAGGATCTTCCCAAGGATCCGAATGATCTCTGGAATTTTCTGATCAGGCTCGACGAGGTCAGCCGGCAGGCATTGTTTGCCCATTGCGCTTCATTGTCGGTCAATGCGGTCATCGAACCCTGGAACAAGCGGACCCGGGCGATCGCCCATGCCGAGCAGCTGGCCAACTCGATCGGCTTCGATTTGGTGGAAGCCGGCTGGACACCGACCGTCGACAACTATCTCGGCCGCGTCACGAAGGCGCGGATCCTCCAGGCGGTCCGAGAAGCCAAAGGTGACCAGGCGGCGGAGTTGATCGGCCATCTCAAGAAGACCGACATGGCCCGCGAGGCCGAACGACTGATGACAGGTTGCGGCTGGTTACCGGAACCGCTTCGCATGACGAGTGATGACGGCATCAGCGAAAGCGATGCCCTCAGAGAGACCGTCGGTTCCGATCATGCTGAGACTGGGGAAGCTCAGGACCTGCCGGCTTTCCTGCTCGACCAGCCGGAAACATCGGGCGATGACACCGGCGATACCGAGATTGGGATCGACGGCGAGCATTTCGCGGCGGCCGAATAGCGTCACAAGCGTCCCATCAACCTGACCCGATCGCTTTTGCGGTCGGGTCTTCTTTCAAACAGCCCGGCCACCGCGCCGGGGTTTCCTGTTTCAGGAGGCACCTATGCCTGACTTCACAATCGAAACCACCTATCACCTGTCGGTCTTCCGGCATCGCACCTACGCGGCTGACACGCCGGAGGCCGCCTGCCGCGCCGCCATCGACGACGACAGCTGGGACATCGCCGCGATGGACTTCGACTCCTCCGGTCCAGTTCATATCACCGGCATCTGGGACAGCAAACTTGCTGCCTATGGGGGTCCTCCGCTCCAGATCCCGCAGCAATTCGACGAACCTGTCCAGCGCCGGGCCCGTCATTTCGAGATCCTGCTCGGACTGTTGAAGATGTTGTTGGATGACATCAGTTCGGCCCGGCCGCCATTGCCCGATTGGCTCGCCCGATCGGCCTGGGCGATTGCCCGAGGAGAAGCCATTCTCGCCGGAGAACCCGATCCCGAAGAGCCGGTCGACCTACCAAAGCCTTCCCACGTCCTGGTCAGGCTGCAAGAGGCTGGTGTACGCGACGCAATCGCCGCCGTGCTCGAAATAGATCCGAGTTTCAAAGCACTATCGCCCGAGGCGGTGACCGACGACGAGGTCCACGCCGCATGCCGTTCCATCGCCGCCACGATGGATTTTTCCGACGCGGTCGGAAGCGCCGAATTTCAGGCAGCGCTCTCGGCGATCCGTTCGGCGCATCGCCGGCTTACGTCCGATTAATTCATCTTCTTCCCCAGAATCCCTCAACCTCGCCCGGCCACCGCGCCGGGTTTCATCTTATGGAGACAACCCAAATGAATTCCCTCGCATCCACTAACCAGCCAGCTTCCTCCGGTTTCAAAGTCGATATCTCCCGCGGCGAACGGATTGGCCGCGTTTCGTCCGAATGGTTTTCGCGCCCTGATGACGAACGCTTCCTGTCGCTGAACGATCTCTACGACACGGTCCGGTCCCGCGCTGACCGCGCCCATGCCCGAACGATCGAAAGCGCCGCGATCCGCGTCGAGGCTACGCGCGACAATGCGGAGCGGCTTGAACTTCTCGTTCCAGGTCAGCGCCAGCCGATCGCACCGACCCACTGGAGCTACGGTCAGCTCTGCAGCCTGGTCGGTGCACCGTCGAGCTACATGCGGCAACTCCCCGCGCCTCTTGCCGCGATCAACCTGCAGCACGGCCTGCTCAATCACCGTGCCGAGCTGGTAAAAACGCTCGAGATGGACGATGGCCGGCTCGAACTGCGCGCGGTGACGGGACCCGAATACGGCCGCATCTGGGATCACGAACTGGTCTCGGCGGTGATGAAGATCGCCGGCAATGGGACCGGCGACACGATGTGGAAAGTGCCGGGCGTCCTGGACTGGGCGACGATGAGCCACAATCCCTTCGTCGACATCACCAAAGATACGACGACGCTTTATGCCAGCGACCGCGATGTCTTCCTGTTCCTCGTCGACGACACCCATCCGATCGAAGCCGGGCGATTGCCCAACGGTGAGCCCGACCTCTATTTTCGCGGCTTCTATGCCTGGAACTCGGAGGTAGGCTCGAAGACACTCGGCATCGCCTCCTTTTATCTGCGGGCGGTCTGCGCCAACAGGAACCTCTGGGGCACTGAGAATTTTGAGGAGATCACTATCCGCCATTCGAAGTTCGCCGCCCAGCGTTTTGCGCATGAAGCAGCACCCGCGCTGACCCGCTTTGCCAATTCGTCGCCCGCTCCGTTCATCGCCGGCATCAAGGCGGCACGCGAAAGGATCGTCGCGCGCAAGGATGACGACCGTGAGAGCATCCTGCGTCGGCGCGGCTTCTCGAAAGGCGAGACCGGCAAGGTGATTGAGATGGTCTTGTCGGAAGAGGGGAGGCCGCCGGAATCGATCTTCGATTTCGTACAGGGGATCACCGCACTGGCGCGCACCAAGACCAATCAGGACACGCGTCTCGAGCTCGAAGGAAAGGCCAAGAAGCTGCTGGAGAGCGCCTCCTGACACGTTCAAACGTACCGGGCTCAGAACCCGGCCGCGTCGGCAAATCCCGTCATCGTCATCACAGAGCCCGGTCAAGCGCCGGGCCTCATTTTGTCTGGTGCACCCAATGGCTATCCCCGATCATGCCCGCACAAACTTCGACACGCTGCTGCGCGCCGCGTCCGATGGTAATCTTGCTCTCATGGAATGCCTCGATGCCACTACACGCGAGCCGCGTTACGTGCTCTGCGGCGTCGGCCGCAGTAACGGCGAATTTTTCTTCACGCCGTTTGGTCATCTGGCCGACGGCAATCCTTACGACGCCTACCTGCCGCCGGATCGAGAGAATCCCGCTGGCTTCATCGTAAACCCGCCGTGCTAGGCGCGATCGCCGGCGGTTCAACCTGCCGATCCCCGACCCTTCCACGGCCGCCCAGATGTGGCCTCCGACCTAGTCGGCAGGACACCGGACCTTGCAGTTCTTTCCTCACTCCTGGAGCGTCCCCCTTATGAATATCATTTCGCATCCTCAAAACGTTTCCACCTCTCCGCGTCCTATGGCCAGAACCAGCGCCGGGGCACTTATTCATGGTGCCGAACAGATCAGGTCGCTGGAGCAAGGTAAAGGCATCGCCACCGCCGACCTGAGACAGGTCATGACGGAGGTTTTCGGCGGCAGCGACGCCGAAGGCCGCTGGCTCCGGAAGGATGCCTATAAACCAACAGAGGTCGCCCAGGTCTTGTTTCTATCCCGCTCAATCACCTCCCGGGTCCAGTCGCCTCAATCGGCGCGTGCGATGCTGATGAAGGCGGCCCGGCTCTCGCCGACCCACACGCGCCGATCCGAGGAGATGATCAGTTGGAAGCTCCGTTCCTTTGTGCCTGTCGCAGGTGAGGGCACGGCGATCCTGTCGAAGCTGAATGATCGTCATTACCTTGTCGATGTCGCACACCGGTCATGATGGAGGTCTGCCATGCACTCCGCGTCGGATCTGGCGGGCCGTCTCGCGCGGGACGCGGAGGCGGTCTGCCGGCACTATCTCTCCGCTGGCCGTCGTGCCGGCAACTACTGGCTCGTTGGCGATGTTAGCAACAGAAAGGGCCGGTCGCTCTATGTGCACCTCGTCGGCCCGCGCGCCGGCCGCTGGACCGACGCGGCGACAGGCCAGTTCGGTGATCTGCTCGATCTGATCCGGGAGACTTGCGGTCTCGTCGACTTCCGGGACGTTGCAGACGAGGCGCGTCATTTCCTCAGCCTGCCTCGACCAGAGCCGGTGTCCTCTCGCGGGACCGATGCTTATGATTTCGCCCCGGTGGAACGGCGCACGGGCGTGCAGGCACAGCGGCTGTTCCGGATGACGCAGCCCCTGGCGGGCACGCTTGCCGACACCTATCTGCGCGGGCGCGGCATCTTACGGGCATCGACGCATGCGGCGCTGCGCTTCCATCCGTCCTGCTACTATCGTGATCTCGTGAGTGGTCGCACGACCAGCTATCCGGCCCTGATTGCCGCCGTGACCGACTCTGCCGGCGCGATCACCGGTGTGCATCGCAGCTGGCTCGATCCTGACGGCGCCGGCAAGGCGAAGGTCGACGATCCGCGGCGCGCACTTGGCGGGCTCCTCGGGAATGGCGTCCGTTTCTGCTTTCCGGTCAATGCACCTGTCCCGGTCATGGCTGCAGGCGAGGGCATCGAAACAATGCTGTCGCTCGCACACGTGATGCCCGGCATGCCGATAGTGGCGGCGCTCACGGCCAACCACCTTGCCGCCTTCCGTTTCCCGCCCGGATGCCGGCGCATCTATATCGCCGCAGACGCGGATGCCGCCGGCCGGCATGGGATCGAGGGCATGAGCCGCCGGGCGCAGGAGTGCGGGATCCTGCCACTGGTGCTGTCGCCGGAGCTCGGCGACTTCAATGAAGATCTTCGCTGGCTGGGCCCGGACCGGTTGACGGCAAACCTCAGGGCACAACTCGTCCCGGAAGACGCGATGGCTTTTCTTCCAGCCTGACGTCGGGGACGAGGTTGGGGGAGGGCCCGGCCGCGGCACTGGCGAGGCGCAATCCATTGGCCTTATTCGGGATGGGCGCGCGCCCGCGGGCCTGCCGAGAGGCGACCCTTACCACGCGGGCCTCCGGGCCTTCAGCGGGTCGGTCGGGTTTCTTCCCCTGCCAGACCGCAGGCGCGGTCCGTCATTCCTCGCGGATCAAGAAACCCTCCCTCCGCCGCCCGGCGCTTCGCTGGGCCGCGGCACTTCGCTTGCGGTTCCGGCCTCGGCCCGCGTGATCGGGTTCGCCGTCAGGCCGCGAGAGGCGCGGCCCCAAACCGATGGAGACCACCATGGACCTGATCCTTCACCCCGAAGACACCTTCGAGCCCCACCATACATCCTCCCCGACCGACCGCTTCATCTACGAGATGCAGGTCTTCGGCTATCGCCCCTTCCAGGACGAACCCGATCCGCGGCCATTGCCGGAGGAGCCCCAAGTCCGCTCATCGATCACCACGCTCTTCGACGCCCTCGCAGAAATGCTCGGCGACACCCGCCTGGAGCCCGATCTCGAAGACCTTTTCTGGTCGATCCCCAATCTCTTCCATCGCGCCGGCGAACGGATCCAGCGCGAACTCGATCGTAACGAAGAGGCGCAACGCACCGGCCAGCGCGAGCAGGACGGCTCGGAGGTGAAGAGCGTCGAACTCGAACGCCTCATCGCCGAAGGCATCACGCTCCTGGAACGCCGCAACAGCTTCGAGTTCATGCGCGACTTCAGTGCCGACCTGTACGAGGCGCAGACCGGATCGTCCTGGCGGCCGCGCAGCGGCTCCAAGGTCAACCACGCCAACATGACGGCGGCGATGATCGACAGCCGGGACTTCCTCTCCGCCCGACGCCGCGCCGAAACCGAGGTGCTGATCCCTGCCGGAACCAAGATCGCATTCGCCGGCGGTATGGACTACAACGATCATGAGCGCATCTGGGCCAAGCTCGATCAGGCGCATGCTAAGCATCCCGACATGGTGCTCTTGCACGGCGGCTCGCCGAAAGGCGCTGAACGCATCGCCGCCTGCTGGGCGGAGGCGCGCCAAGTGACGCAGATTACCTTCAAGCCCAACTGGACCAAACATGCCAAGGCTGCGCCCTTCCGGCGCAATGACGAGATGCTTTCAGTCATGCCCGCCGGATTGATCGTCTTTCCCGGAAACGGCATCACCGGCAATCTCGCCGACAAGGCACGCCAGCTCGGCATCCCGGTCTGGCAGGCTTCAGGAGACGGCGCCTGA
Protein sequences of DBSCAN-SWA_36 >CP036360|361038:368446|367504_368446_+|QBJ16735.1|DBSCAN-SWA MDLILHPEDTFEPHHTSSPTDRFIYEMQVFGYRPFQDEPDPRPLPEEPQVRSSITTLFDALAEMLGDTRLEPDLEDLFWSIPNLFHRAGERIQRELDRNEEAQRTGQREQDGSEVKSVELERLIAEGITLLERRNSFEFMRDFSADLYEAQTGSSWRPRSGSKVNHANMTAAMIDSRDFLSARRRAETEVLIPAGTKIAFAGGMDYNDHERIWAKLDQAHAKHPDMVLLHGGSPKGAERIAACWAEARQVTQITFKPNWTKHAKAAPFRRNDEMLSVMPAGLIVFPGNGITGNLADKARQLGIPVWQASGDGA >CP036360|361038:368446|361038_363159_+|QBJ16729.1|DBSCAN-SWA MAQATQKITLSAARDIPFNKLVLSQQNVRKIKAGISIEDLAEDIAHRGLLTSLNVRPELDGDGNETGIYRIPAGGRRYRALERLVAQKRLAKTAGVPCIVSKGETLEVEDSLAENVQRVSLHPLDQFRAFQTLREQGLGEEEIAARFFVSVATVKQRLRLASVSSRLLDLYAEDEMTLEQIMAFSITNDHVRQEQVWDTVSRSHSREPYYIRRLLTETAIRASDRRAVYVGIEAYEAAGGVTMRDLFDQDQGGWLQDPALLEQLVMEKLKADAEAIRVSEGWKWVEAAFDFPYGHTSGLRRFYGEQAEMTEADLAHYDATRAEYDKLDAEYSEADEYSEATEQKLEELGAELDRLNDRPYVFDPEEVARGGLFVSLGVGGELNIERGFVRPEDEPKDAADPSADVGDSGDYASSVPATGAAGGEETQPDDEDQTVKPLPDRLVLDLTAARTVALRNALANDPVIAFIAVLHAFVLKTFYVYGSDSCLEVTLQSARFSQTPGLGDTVWAKEIEQRHEGWGQDLPKDPNDLWNFLIRLDEVSRQALFAHCASLSVNAVIEPWNKRTRAIAHAEQLANSIGFDLVEAGWTPTVDNYLGRVTKARILQAVREAKGDQAAELIGHLKKTDMAREAERLMTGCGWLPEPLRMTSDDGISESDALRETVGSDHAETGEAQDLPAFLLDQPETSGDDTGDTEIGIDGEHFAAAE >CP036360|361038:368446|363262_363919_+|QBJ16730.1|DBSCAN-SWA MPDFTIETTYHLSVFRHRTYAADTPEAACRAAIDDDSWDIAAMDFDSSGPVHITGIWDSKLAAYGGPPLQIPQQFDEPVQRRARHFEILLGLLKMLLDDISSARPPLPDWLARSAWAIARGEAILAGEPDPEEPVDLPKPSHVLVRLQEAGVRDAIAAVLEIDPSFKALSPEAVTDDEVHAACRSIAATMDFSDAVGSAEFQAALSAIRSAHRRLTSD >CP036360|361038:368446|365290_365545_+|QBJ16732.1|DBSCAN-SWA MAIPDHARTNFDTLLRAASDGNLALMECLDATTREPRYVLCGVGRSNGEFFFTPFGHLADGNPYDAYLPPDRENPAGFIVNPPC >CP036360|361038:368446|365672_366125_+|QBJ16733.1|DBSCAN-SWA MNIISHPQNVSTSPRPMARTSAGALIHGAEQIRSLEQGKGIATADLRQVMTEVFGGSDAEGRWLRKDAYKPTEVAQVLFLSRSITSRVQSPQSARAMLMKAARLSPTHTRRSEEMISWKLRSFVPVAGEGTAILSKLNDRHYLVDVAHRS >CP036360|361038:368446|366137_367175_+|QBJ16734.1|DBSCAN-SWA MHSASDLAGRLARDAEAVCRHYLSAGRRAGNYWLVGDVSNRKGRSLYVHLVGPRAGRWTDAATGQFGDLLDLIRETCGLVDFRDVADEARHFLSLPRPEPVSSRGTDAYDFAPVERRTGVQAQRLFRMTQPLAGTLADTYLRGRGILRASTHAALRFHPSCYYRDLVSGRTTSYPALIAAVTDSAGAITGVHRSWLDPDGAGKAKVDDPRRALGGLLGNGVRFCFPVNAPVPVMAAGEGIETMLSLAHVMPGMPIVAALTANHLAAFRFPPGCRRIYIAADADAAGRHGIEGMSRRAQECGILPLVLSPELGDFNEDLRWLGPDRLTANLRAQLVPEDAMAFLPA >CP036360|361038:368446|363993_365184_+|QBJ16731.1|DBSCAN-SWA MNSLASTNQPASSGFKVDISRGERIGRVSSEWFSRPDDERFLSLNDLYDTVRSRADRAHARTIESAAIRVEATRDNAERLELLVPGQRQPIAPTHWSYGQLCSLVGAPSSYMRQLPAPLAAINLQHGLLNHRAELVKTLEMDDGRLELRAVTGPEYGRIWDHELVSAVMKIAGNGTGDTMWKVPGVLDWATMSHNPFVDITKDTTTLYASDRDVFLFLVDDTHPIEAGRLPNGEPDLYFRGFYAWNSEVGSKTLGIASFYLRAVCANRNLWGTENFEEITIRHSKFAAQRFAHEAAPALTRFANSSPAPFIAGIKAARERIVARKDDDRESILRRRGFSKGETGKVIEMVLSEEGRPPESIFDFVQGITALARTKTNQDTRLELEGKAKKLLESAS |
7 | Emiliania_huxleyi_virus(25.0%) | NA | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_37 |
372178 : 372457
Sequences of DBSCAN-SWA_37
Nucleotide sequences of DBSCAN-SWA_37 >CP036360|372178:372457|DBSCAN-SWA ATTACTTGTTCAGCGCATCCTTGAGAGCCTTGGCAGGCGTAAAAGCCAGCTTTTTCGCGGCGGCAACCTTGATCGTCGCGCCAGTCGCCGGGTTACGCGCCTCGCGCTCCGGAGTGTCCTTCACCTTGAATTTGCCGAAGCCGGGCAGCGATGTCTCGTTGCCGGCCACGGCCGCCTCGGTGATCGAAGCGATCACCGCCTCTACGATAGCTTTTCCCTGAACCTTGGTGAGGCCATGCTCGCTGGCAATCTTGTCGGCAATTTCATTGGTCGTGGTCAT
Protein sequences of DBSCAN-SWA_37 >CP036360|372178:372457|372178_372457_-|QBJ16743.1|DBSCAN-SWA MTTTNEIADKIASEHGLTKVQGKAIVEAVIASITEAAVAGNETSLPGFGKFKVKDTPEREARNPATGATIKVAAAKKLAFTPAKALKDALNK |
1 | Burkholderia_phage(100.0%) | NA | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_38 |
379939 : 381442
Sequences of DBSCAN-SWA_38
Nucleotide sequences of DBSCAN-SWA_38 >CP036360|379939:381442|DBSCAN-SWA TGTGAAAAATATTCCTATGCAAACTCTTTCAAACTACCGTCAGGAGTGGTTCTCCAACATCCGTGGCGACGTACTTTCGGGTATTGTCGTCGCTCTCGCGCTTATCCCGGAAGCGATAGGCTTCTCGGTGATTGCGGGTGTCGATCCCAAGGTCGGCCTCTACGCTTCCTTCGCAATCGCCTGCGTGACGGCCTTTGTGGGCGGCCGGCCGGGCATGATCTCTGCCGCCACGGCGGCGACCGCTGTCGTCATGATCTCGCTGGTCAAGGACCACGGTCTTCAGTATCTCTTCGCCGCCACCATCCTGATGGGCATCATCCAGATCGCTGCGGGATGGCTGAGGCTGGGCCGCGTGATGCGCTTCGTTTCACGTTCCGTCATCACAGGCTTCGTCAATGCGCTTGCGATCCTGATCTTCATGGCCCAGTTACCCGAACTCGTGGGGGTTCCGACGCTCACCTACATGATGATTGCAGGCGGTCTTGCCATCATCTACCTCTTTCCATACGTGACCAAGGCGATCCCATCGCCGCTTGTGGCCATCGTGGTTCTCACGGCCATGGCCTGGTGGTTCGGCATGGACCTGCGCACGGTCGGCGACCTTGGCGAACTTCCCTCTTCGATCCCATTCCTGATGCTTCCTCAGGTTCCGTTGACCTGGGAGACGCTGCAGATCATCTTCTCATACTCGGTGACACTTGCTGCTGTCGGCCTGCTGGAATCGCTGCTTACCGCGCAGATAGTCGATGATATGACGGACACGCCCAGCAACAAGAGCCAAGAATGCGTTGGACAGGGCGCGGGGAATATCGCCTCGGCACTGATCGGCGGCATGGGCGGATGCGCCATGATCGGACAATCGGTCATCAACGTAACATCTGGTGGTCGAGGACGTCTTTCAACCTTCGTGGCTGGTTCTTTCCTGCTGTTTCTGATCGTCGTCCTCAACGATCTCGTCCGGATCATCCCCATGGCCGCGCTTGTTGCGGTTATGATTATGGTTTCAATCGGCACTTTCTCCTGGAGGTCGATCGTCGATCTGAGACGTCACCCGCTTCCGTCGTCCTTCGTCATGCTGGCGACCGTCGTCACCGTCGTTGCAACACATGATCTGGCGAAAGGCGTGATCGTTGGCGTCCTGCTGTCCGGCATCTTTTTCGCTGGCAAGGTAGCCCGTCTGTTCAAGGTCACGCGGCTGGAGAATCCCGAACAGAACAGTGTCACCTATGCAGTAGTTGGACAGGTCTTCTTTGCGTCGGCAGAAGCCTTTATCCAAGCTTTCGACTTCACCGATAAGGGCAAACGGATCATCATCGATCTCACAAGGGCCCATCTCTGGGACATCACCGCAATTGGCGCGTTGGACAAGGTCGTATTGAAGCTCCGTCAGGCGGGCAACGAGGTCGAGGTTCTCGGATTCAACGAAGCGAGTGCTGATATGGTTGATCGCTTCGCGCTGCACGACAAGGATGAGCGGCGCGCGGCAAGCGCCGCACTACACTGA
Protein sequences of DBSCAN-SWA_38 >CP036360|379939:381442|379939_381442_+|QBJ16748.1|DBSCAN-SWA MKNIPMQTLSNYRQEWFSNIRGDVLSGIVVALALIPEAIGFSVIAGVDPKVGLYASFAIACVTAFVGGRPGMISAATAATAVVMISLVKDHGLQYLFAATILMGIIQIAAGWLRLGRVMRFVSRSVITGFVNALAILIFMAQLPELVGVPTLTYMMIAGGLAIIYLFPYVTKAIPSPLVAIVVLTAMAWWFGMDLRTVGDLGELPSSIPFLMLPQVPLTWETLQIIFSYSVTLAAVGLLESLLTAQIVDDMTDTPSNKSQECVGQGAGNIASALIGGMGGCAMIGQSVINVTSGGRGRLSTFVAGSFLLFLIVVLNDLVRIIPMAALVAVMIMVSIGTFSWRSIVDLRRHPLPSSFVMLATVVTVVATHDLAKGVIVGVLLSGIFFAGKVARLFKVTRLENPEQNSVTYAVVGQVFFASAEAFIQAFDFTDKGKRIIIDLTRAHLWDITAIGALDKVVLKLRQAGNEVEVLGFNEASADMVDRFALHDKDERRAASAALH |
1 | uncultured_Caudovirales_phage(100.0%) | NA | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_39 |
385851 : 393544
Sequences of DBSCAN-SWA_39
Nucleotide sequences of DBSCAN-SWA_39 >CP036360|385851:393544|DBSCAN-SWA TATGTCGTTCCGACCGCTTCATGACCGCATTCTCGTCCGCCGGGTCGAATCCGAAGAAAAGACCAAAGGCGGTATTATCATCCCCGACACTGCCAAGGAAAAACCGCAGGAGGGCGAGGTCATCGCCGTTGGGCCCGGTGCGCGCAACGATGCCGGACAGATCCAGGTGCTCGACGTCAAGGTGGGCGACCGCATCCTGTTCGGCAAATGGTCAGGCACCGAGATCAAGATCAATGGCGAAGACCTGCTGATCATGAAGGAAAGCGATGTCATGGGCATCATCGGCGCCCAGGCTGAGCAGAAGAAAGCCGCCTGAGCGTCTTACCCTACACCCATCAGTCAGATAGCTGCCTATTGAGGAGTTAAAAAAATGGCTGCCAAAGAAGTCAAGTTCCACACCGATGCCCGTGAACGCATGCTGCGAGGGGTCGATGTGCTCGCTAATGCCGTGAAAGTCACCCTTGGCCCCAAGGGTCGTAACGTTGTTATCGACAAGTCTTTCGGCGCACCGCGCATCACCAAGGACGGCGTGTCGGTCGCGAAGGAAGTCGAGCTTGAAGACAAGTTCGAGAATATGGGCGCGCAGATGCTGCGCGAAGTCGCTTCGAAGACAAACGATCTGGCCGGCGACGGCACCACGACCGCAACTGTCCTCGCCCAGGCGATCGTCAAGGAAGGTGCCAAGGCGGTTGCCTCCGGTATGAACCCGATGGACCTGAAGCGCGGCATCGATCTTGCGGTCGATGCTGTCGTCAAGGAGCTGAAGACCAACGCTCGTAAGATAACCAGCAATTCCGAAATCGCCCAGGTTGGAACAATCTCTGCGAACGGGGATACGGAGATCGGCCGTTATCTTGCCGAGGCAATGGAGAAGGTCGGCAACGAAGGCGTCATCACCGTGGAGGAAGCAAAGACCGCAGAAACCGAACTTGAAGTCGTCGAAGGCATGCAGTTCGACCGCGGCTATCTGTCGCCCTACTTTGTCACCAATCAGGACAAGATGAGGGTCGAACTTGAGGATCCCTATATCCTTATCCACGAAAAGAAGCTCTCGAACCTGCAGGCCATGCTGCCGGTCCTCGAAGCCGTGGTCCAGTCGGGCAAGCCTCTCCTCATCATCGCTGAAGACGTCGAAGGCGAAGCCCTTGCAACGCTCGTCGTCAACAAGCTGCGTGGCGGCCTCAAGATCGCTGCCGTCAAGGCGCCTGGCTTCGGCGACCGCCGCAAGGCCATGCTGGAAGACATCGCCATCCTGACCGGTGGTACCGTCATCTCCGAAGACCTTGGCATCAAGCTTGAAAACGTCACGCTCAACATGCTCGGACGCGCCAAAAAGGTGGCTATCGAGAAGGAGAACACCACCATCATCGACGGCGTAGGTTCCAAGTCCGAGATCGACGGGCGTGTTGCGCAGATCCGCGCTCAAATCGAGGATACCACTTCCGACTATGACCGCGAAAAGCTGCAGGAGCGTCTCGCCAAGCTCGCCGGCGGCGTTGCCGTCATCCGGGTTGGCGGCTCGACGGAAGTCGAAGTAAAGGAAAAGAAGGACCGCGTCGACGATGCGCTGCACGCCACTCGTGCGGCGGTAGAGGAAGGCATCCTGCCTGGCGGTGGCGTCGCGCTGCTGCGCGCTGTCAAGGCGCTCGACAATCTCGGTACGGCCAACCAGGATCAGAGGGTCGGCGTTGATATCGTCCGCCGTGCAATCGAGGCACCCGTCCGTCAGATCGCCGAAAACGCCGGCGCGGAAGGCTCCGTTATCGTCGGCAAGCTGCGCGAGAAGACGGACTTCTCCTTTGGCTGGAACGCACAGACAGGCGAATACGGTGATCTCTACGCGCAGGGCGTTATTGACCCGGCCAAGGTCGTTCGTACTGCGCTTCAGGATGCCGCCTCTGTAGCAGGCCTTCTGGTGACGACCGAAGCGATGATAGCCGAAAAGCCGAAGAAGGATGCCGCGCCTGCACCCGCCGGAGCGGGAATGGACTTTTGATGGAAGGGGGCGCCCAGAGGGCGCCCCCTGTTTGGTGCTCCGTGAGTAACGATGCGGAGCCTGACCAGCCAAATACCAGCGCGAGTCCTCTTCTGCAAATTCGCGGGCGTCTGGATGAACGCAGTGAAGGCGGCCGGTGGGCTGTTCCTCCCGGAATAATAAAGGGCCACGGGCGTCAGGAGCGGCGTCCACTCCTCCAATATGTGCGTCAGTCGTCCAGCCCCAACCCTTCGCGGCAAAGGTTTCTGCGAAAGCGAAATCGACCACACGAAACTTTCACCAGGAAATAAATCGGTTCCGCCGAGGTTTTTCCATTCTGCTGTCAGTCAGGGTGGAATGCTGCGCGGTTGACATGGCGACCTCGTCGCAGTTCGACTGCCCGAATTTCGGAACGGATGCGACATCGACTGAGGAGGGGCCTGTTTAATGTCTCCTGGCTTTGCATAGTCGAGTTAACCGCACTGCGAGTTACTAGTGGCCGAATTTTTAGTCATCCCACCACCGCCGCCGAAAACGCGGATTTGATCAGTGTTTTCGTATATTCAGCCGACGGCCGGTTGAAGATGGCGTTTGTCTCTCCCTCCTCGACGATCTTTCCGTTTCTCATGACGATGACGTAATCTGAAATGGCCTTGATGACGGACAGGTCGTGGCTGATGAAGATGTAGGAAAGGCCATGGGCCTTCTGTAGATCACGCAGGAGGTCGATAACCTGGCCCTGAACGGACCGGTCCAGCGCCGACGTGGGTTCGTCAAGAATGATGACCTCTGGTTTGAGAATAATGGCGCGGGCAATGGCGATACGTTGGCGTTGCCCCCCGGAAAATTCGTGGGGGTAGCGGTTGCGGGCGGCGGGATCGAGGCCCACTTCCTTCAGCGCCTCGATTGCGCGCCTGTCCCGTTCCGCACGGTTGAGGTTCTGGTCGTGAACGAAGAGCCCCTCGGTGATGATCTCGCCTACCGTCTGTCGCGGCGAAAGCGAACCATAGGGGTCCTGGAACACCAGTTGGAGATGCCGCCGTAGCGGCCGCATGGCCTTTCGATCAAGCCCGGTGATTTCCTGTTTTCCGAAGAAAATCCGGCCATCGGCCTGGATCAGGCGCAGCAGAGCGCGACCGAGCGTGGATTTGCCTGACCCGGATTCGCCAACAATGCCGATCGTCTGTCCCTGATGAAGATTGACGTTCACGCCATCAACGGCCCGGAACTTGCTCGATTTCGACAGGAAGCCGCCCGGAATTGTGTAATCGATCGTTACGTTCTGGCCGGAAAGCACGATCGGAGCCTCTCGGCTGACTGCAGGCTTGTTCCCTCGGGGCTCGGCATCGAGAAGCATCCGGGTATAATCCGCCTTCGGGCGCTCGAAGATATCCTCCGTTGTCCCAGCCTCCACGATTTCCCCCCGGCGCATGACCGCGACGCGATCGGCAAAGTGGCGGACGACGCCGAGATCATGCGTGATCAGCACGATGGCCATCCCGAAGCGTTGCTGCAGCGACTTCAAAAGATCAAGGATCTGGGCCTGGATCGTAACGTCGAGGGCGGTGGTCGGCTCATCCGCAATCAGGATGTCCGGCTCGTTCGCGAGCGCCATGGCGATCATGACGCGCTGGCGCTGACCACCGGAAAGCTCGTGAGGATAGCTGTCTATACGCCGCTCGGGCTCCGGAATTCCGACGAGCCTCAGCAGTTCGAGAACGCGTGCGCGCGCCTCCTTTTTACTGCCACCGCGGTGGTGAATGATCGGCTCAGCAATCTGCGCACCGATCCGGTAGAGAGGATCGAGCGAGGTCATCGGCTCCTGGAAGATCATGGTGACTTTCGCGCCACGGATTTTGTTCAGCTCGCCGACAGGCAGTCCAAGAAGTTCCCTTCCGCGATATTTCGCCGAGCCGCTGATGGCGCCATTGGATGCAAGCAATCCCATGATGCCCATCATCGTCTGGCTTTTGCCCGAGCCGGACTCCCCGACGACTGCGAGCGTTTCACCCTGTTTAACGTCAAGGTCGATGCCCTTGACGGCATGGACAGTGCCATCCGGCGTGGTGAAATCGACCTTGAGGTCGCGAACGGAAAGAATGGTGTCAGTTGTCGTCTTCATGTCAACGATCCTTGGGATCGAGTGCATCACGCAACCCATCGCCGACGAAATTCAGCGAAAACAGGGTCAGCACGAAAAAGATCGCCGGGAATATCAGAAGCCAGGGGGCGGACTGGATATTGTTGGCACCTTCGGAAATCAGCGCGCCCCAGCTTGTCAGGGGCGCCTGAACGCCGAGACCGAGGAAGGACAGGAAGCTTTCCAGAAGAATGACTTTGGGCACGACGACCGTCACGAACACGACAACGGGTCCGATCGTATTGGGGATGATATGACGGCGGATAATCTGCCAGTCGCTTAGTCCCAGAGCCTGCGCCGCACTGACAAACTCCCGTCGTTTCAAGGCGAGTGTCTGGCCACGCACGATGCGCGCCATGTCCAGCCATTCCACTGCGCCGATGACGAGGAAGATCAGGATGAAGCTGCGGCCAAAAAAGACGACCAGAACCACAACCAGAAAGACGAAGGGCAGTGAATAGAGGATTTCGACGAAGCGCATCATCACATTGTCCACGCGCCCGCCAATATAGCCCGCTGTTGCACCGTAGACCACGCCGATTCCCAGGGAAACGAGGCTCGCCAGGACACCGACCGCAATCGAAATCTGACCGCCAAGCATGACGCGCGCAAGCATGTCGCGGCCGTTGGAATCTGTACCGAAGAAAAAGTATTCGCGGTTGACGTCGCCTTCCAGCTTCAGCGTGCGTCCGTCGTCTTCGGTCGCAATCACCCTGGTATTTCTGAATTCATTGGCACGGTCGAAGTAGCGTGTTGCGCGCGGATCGATTGCGCTGCTGGAGGCGATGGTCGCCGTGAAAGTCTGGCCTTCAACGGCAAATTCCTTCAGCTCGACGCGGGCCCGGCTCGCAACACCCTCCATCACGTCCTGAAGGTTATTCACATCCGGGCGTGGCTCGAGACTGGGTGCCACCGAGACATAGGAGGAAAACACCTGATCATAGGTATGTGCCAGGAAATGAGGCCCGACAAACGAGAACAGAGTAATAAGGAGCAACATGACGGAACCCGCCATGGCTGCCTTGTTGCGTCTGAAACGCAGGACTGCCAGCTGGAACAGGCTGCGGCTTTTGATTTGTTGCGAATGAGAGTTAGTGCCGGGGCTATCAGTCATGTCTGACCCTCGGATCAAGAAGGCCGTAGAGAATATCGACGACTAGATTGAACAGGATGACGAAAATGGCGACGAGAACAACGGTGCCCATTACCAGCGTATAGTCACGATTGATGGCGCCGAGAACGAAGTAGCGGCCGACACCGGGAATGGTGAAGATCGTTTCGATGACTGCCGAGCCAGTCAAAAGCGCGGCGGCGCAGGGAGCGAGATAGGAAACGACCGGCAGCATTGCCGCGCGCATGGCGTGGAAAACGACGACGCTGCGTGCGGGAAGGCCATAGGCCTTGGCCGTGCGGATGTGGTCCATTCGCAGTGCCTCGATCATGGCACCGCGCGTCAGCCGGGCAATGACCGCAAGCTGCGGCAGCGCAAGTGCGATCATCGGAAGAATGAGGTAGCGTAGCGACCCGTCACCCCAGCTGCCTGCCGGAAGAAGCCCGAGCAGCACGGCAAATACAAGCGTGAGAACAGGGGCCACGACGAAGTTCGGGACGGTGACGCCCACTGTCGAGATTGACATGATCGAGAAATCGAAAGCGCTGTTTTGCCTGAGTGCTGCGAGCGTGCCGGCGAGCACGCCGCCCACCAGAGCCAGCAACAGGGCATAACAACCCAGTTCGAGCGAATAGGGCAGGCCCTTGCCGATCAGCTGCGCGACGGTGTTGTCCTTATAGATGTAGCTTGGGCCAAAATCGCCGGTGACGGCATTGCCGAGATAGATGAGATACTGGCGCCATAGTGGCTCATCCAGATGATAGGTTCTCATCAGATTTTCCATGGTCTGCGGGGGGAGGGGACGTTCGAGATTGAACGGACCACCGGGCGCAAAACGCATCAGGAAAAACGAGATCGTGACGACGATAAACAACGTCGGCACGGCGCTCGCCAAACGGCGCAGAAACCGGCATCGACATGGCTGTTCCGCAGAGCTCAACCCTTGCGACCCAGGATACGGCATCCGATGCGGCAAGGCGCAATTTTGCCGAAACCTTCGGACGAGTGGCCGACCGCACGATCCAGCGCAATATGGATGTACAGCCGACGCTGCAAATCCGGCCCGGCTATAAATTCAATGTGCTAGTAGATCAGGACATTGTCTTTTCAGGGAAATATCGTTGAGAAAGCAGCCTAGCCCTAGTCGCCGTTTCCTTCCGTGGCCAGCGCGGCATAGCTAAATCACGAAGCATGCCAACTGAACCGCACCGGGTTTGCCGGAGGCCATTTCGTTTAAGTTATGCTTGCCACGGCCGGTGCGTCCAGCATGGCGTAATATCGTTCTTCGGCTTCGGCTGGCGGTATGTTGCCGATGGGCTCCAGAAGGCGACGGTTGTTGAACCAATCTACCCATTCCAGCGTGGCGAACTCCACGGCTTCGAAGTTTCGCCACGGTCCTCGCCGATGGATGACCTCGGCCTTGTAAAGACCATTGACCGTTTCAGCGAGAGCGTTGTCATAGGAACCTCCAACACTCCCGACCGACGGCTCGATGCCCGCTTCCGCCAGCCGTTCGGAATAGCGAATGGACACATATTGCGATCCGCGGTCGGAATGGTGAACCAGTCCTCCACGGTGGACGGGCCGCCGATCATGAAGCGCCTGGTCGAGGGCATCGAGAACGAAACCCGCATGGGCAGTCCGGCTTGCTCGCCAACCGACGATGCGGCGAGCGAAGGCATCGATCACGAACGCGACGTAAACAAAGCCCTGCCAAGTGGCCACATACGTGAAATCAGAAAGCCAGAGCATGTTGGGCGAAGGCGCGAAAAAGTGCCGGTTCACCCGGTCGAGCGGACAAGGCGCGGCCTTGTCCGACACGGTGGTTTTGACGGGTTTGCCACGGATGATGCCCTGTAGTCCCATTGCCCTCATGAGCAGAGCAACAGTGGAGCGGGCGATGTCGTAGCCTTCTCGCTGCAACTGCCGCCAGACTTTACGCACGCCATAGACCTGGAAGTTCTCGTTGAACACCCGGCGTATTTCGATCTTCAAACCGATATCGCTGCGGGCCCGGACCGACAGGCGATCCACATCCAGGCGCTTGGCAACGTTCTCGTAGTAGGTTGATGGGGCAATCGGCAAAAGCCTGCAGATCGGCTCGACCCCGAACACGCCACGGTGTTCGTCAATGAACGAGATCATCGTTTGAAGGGGCGGTCGAGCTCCGCCATCGCGAAATGAGCAGACGCCTTGCGCGAAATCTCGTTGGCCTGACGAAGCTCGCGGTTCTCGCGCTCAAGAGCCTTCATCTTCTCGGCCGCGTCGCTCGGCAAGCCTGCTCGTTTGCCGCTGTCGACCTCGGTCTTCTTCACCCATTCATGCAGCGCGGCTGGCGAGCAACCGATCTTGCCCGCAATAGATGAAACGGCAGCCCACCGCGATGGGTGCTCAGCCTCGTGATCCAGCACCATACGGATGGCGCGTTGGCGGACTTCGGGTGAAAACTTGTTCGTTGTCTTGCTCAT
Protein sequences of DBSCAN-SWA_39 >CP036360|385851:393544|388336_389926_-|QBJ16862.1|DBSCAN-SWA MLSVRDLKVDFTTPDGTVHAVKGIDLDVKQGETLAVVGESGSGKSQTMMGIMGLLASNGAISGSAKYRGRELLGLPVGELNKIRGAKVTMIFQEPMTSLDPLYRIGAQIAEPIIHHRGGSKKEARARVLELLRLVGIPEPERRIDSYPHELSGGQRQRVMIAMALANEPDILIADEPTTALDVTIQAQILDLLKSLQQRFGMAIVLITHDLGVVRHFADRVAVMRRGEIVEAGTTEDIFERPKADYTRMLLDAEPRGNKPAVSREAPIVLSGQNVTIDYTIPGGFLSKSSKFRAVDGVNVNLHQGQTIGIVGESGSGKSTLGRALLRLIQADGRIFFGKQEITGLDRKAMRPLRRHLQLVFQDPYGSLSPRQTVGEIITEGLFVHDQNLNRAERDRRAIEALKEVGLDPAARNRYPHEFSGGQRQRIAIARAIILKPEVIILDEPTSALDRSVQGQVIDLLRDLQKAHGLSYIFISHDLSVIKAISDYVIVMRNGKIVEEGETNAIFNRPSAEYTKTLIKSAFSAAVVG >CP036360|385851:393544|391071_392040_-|QBJ16756.1|DBSCAN-SWA MPYPGSQGLSSAEQPCRCRFLRRLASAVPTLFIVVTISFFLMRFAPGGPFNLERPLPPQTMENLMRTYHLDEPLWRQYLIYLGNAVTGDFGPSYIYKDNTVAQLIGKGLPYSLELGCYALLLALVGGVLAGTLAALRQNSAFDFSIMSISTVGVTVPNFVVAPVLTLVFAVLLGLLPAGSWGDGSLRYLILPMIALALPQLAVIARLTRGAMIEALRMDHIRTAKAYGLPARSVVVFHAMRAAMLPVVSYLAPCAAALLTGSAVIETIFTIPGVGRYFVLGAINRDYTLVMGTVVLVAIFVILFNLVVDILYGLLDPRVRHD >CP036360|385851:393544|392309_393544_-|QBJ16757.1|transposase|DBSCAN-SWA MSKTTNKFSPEVRQRAIRMVLDHEAEHPSRWAAVSSIAGKIGCSPAALHEWVKKTEVDSGKRAGLPSDAAEKMKALERENRELRQANEISRKASAHFANGGARPPLQTMISFIDEHRGVFGVEPICRLLPIAPSTYYENVAKRLDVDRLSVRARSDIGLKIEIRRVFNENFQVYGVRKVWRQLQREGYDIARSTVALLMRAMGLQGIIRGKPVKTTVSDKAAPCPLDRVNRHFFAPSPNMLWLSDFTYVATWQGFVYVAFVIDAFARRIVGWRASRTAHAGFVLDALDQALHDRRPVHRGGLVHHSDRGSQYVSIRYSERLAEAGIEPSVGSVGGSYDNALAETVNGLYKAEVIHRRGPWRNFEAVEFATLEWVDWFNNRRLLEPIGNIPPAEAEERYYAMLDAPAVASIT >CP036360|385851:393544|386220_387846_+|QBJ16754.1|DBSCAN-SWA MAAKEVKFHTDARERMLRGVDVLANAVKVTLGPKGRNVVIDKSFGAPRITKDGVSVAKEVELEDKFENMGAQMLREVASKTNDLAGDGTTTATVLAQAIVKEGAKAVASGMNPMDLKRGIDLAVDAVVKELKTNARKITSNSEIAQVGTISANGDTEIGRYLAEAMEKVGNEGVITVEEAKTAETELEVVEGMQFDRGYLSPYFVTNQDKMRVELEDPYILIHEKKLSNLQAMLPVLEAVVQSGKPLLIIAEDVEGEALATLVVNKLRGGLKIAAVKAPGFGDRRKAMLEDIAILTGGTVISEDLGIKLENVTLNMLGRAKKVAIEKENTTIIDGVGSKSEIDGRVAQIRAQIEDTTSDYDREKLQERLAKLAGGVAVIRVGGSTEVEVKEKKDRVDDALHATRAAVEEGILPGGGVALLRAVKALDNLGTANQDQRVGVDIVRRAIEAPVRQIAENAGAEGSVIVGKLREKTDFSFGWNAQTGEYGDLYAQGVIDPAKVVRTALQDAASVAGLLVTTEAMIAEKPKKDAAPAPAGAGMDF >CP036360|385851:393544|385851_386166_+|QBJ16753.1|DBSCAN-SWA MSFRPLHDRILVRRVESEEKTKGGIIIPDTAKEKPQEGEVIAVGPGARNDAGQIQVLDVKVGDRILFGKWSGTEIKINGEDLLIMKESDVMGIIGAQAEQKKAA >CP036360|385851:393544|389948_391079_-|QBJ16755.1|DBSCAN-SWA MTDSPGTNSHSQQIKSRSLFQLAVLRFRRNKAAMAGSVMLLLITLFSFVGPHFLAHTYDQVFSSYVSVAPSLEPRPDVNNLQDVMEGVASRARVELKEFAVEGQTFTATIASSSAIDPRATRYFDRANEFRNTRVIATEDDGRTLKLEGDVNREYFFFGTDSNGRDMLARVMLGGQISIAVGVLASLVSLGIGVVYGATAGYIGGRVDNVMMRFVEILYSLPFVFLVVVLVVFFGRSFILIFLVIGAVEWLDMARIVRGQTLALKRREFVSAAQALGLSDWQIIRRHIIPNTIGPVVVFVTVVVPKVILLESFLSFLGLGVQAPLTSWGALISEGANNIQSAPWLLIFPAIFFVLTLFSLNFVGDGLRDALDPKDR |
6 | uncultured_virus(50.0%) | transposase | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_40 |
399012 : 399764
Sequences of DBSCAN-SWA_40
Nucleotide sequences of DBSCAN-SWA_40 >CP036360|399012:399764|DBSCAN-SWA TTCATCTGGAGTTTATGACGGCTGCTGCGAGATGGATGATGGCGTTGAAGCTCTGGTCTGTTTTGTCGGCGCGCATGGCGATGCGCTTGAATTCCTTGAGCTTGCAGAAGAAGTTCTCGATGAGATGACGCCATTTGTACATCTCGGCGTCGAACGGCAGTGGCTTGGCGCGGCGCGGATGCTGGGAGATGACAACCTTGGCGCCGCGTTCGTCGAGGTCGGCGATGATGGCGTTGCTGTCGAAGGCCTTGTCGGCGATCAGCGCGCCGAAAGCGACGCCGTCAATGAGCGGCGGCACGCCGACGGTATCGAAGCGATGGCCGGGCAGCAGGACGAAGCGGGCAAGGTTGCCGAGTGCGTCGATCAGCGCGAGGATTTTGGTGGTCATGCCGCCCTTGGAACGGCCTATGGCCTGGCTCTGAGTCCCCCTTTTGCGCCCTGGCCGTGGCGGTGGACCTTGACGATGGTGGCATCGACCATGGCGTATTCCATGTCGGGCTCGTCCGAGCAGGCCTCAAAAAGCCGCACGAAAACATCGGCTTTGACCCAATCACGGTATCGCTTGAAGACGGTGTTCCAGTTGCCGAAGAAGGCGGGAAGGTCACGCCACGGACTGCCGGTACGCACCACCCACAGCACCGCTTCGATGAACCGTCGGTTATCGCTGCCGCTTCGCCCGGGGTCCGTCGGCTTGCGCAGGCAGTGCGGCTCCATCTTCGCCCATTGGGCGTCAGTCAATACGAATCGTTCCAT
Protein sequences of DBSCAN-SWA_40 >CP036360|399012:399764|399012_399764_-|QBJ16761.1|transposase|DBSCAN-SWA MERFVLTDAQWAKMEPHCLRKPTDPGRSGSDNRRFIEAVLWVVRTGSPWRDLPAFFGNWNTVFKRYRDWVKADVFVRLFEACSDEPDMEYAMVDATIVKVHRHGQGAKGGPQSQAIGRSKGGMTTKILALIDALGNLARFVLLPGHRFDTVGVPPLIDGVAFGALIADKAFDSNAIIADLDERGAKVVISQHPRRAKPLPFDAEMYKWRHLIENFFCKLKEFKRIAMRADKTDQSFNAIIHLAAAVINSR |
1 | Paenibacillus_phage(100.0%) | transposase | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_41 |
418207 : 422249
Sequences of DBSCAN-SWA_41
Nucleotide sequences of DBSCAN-SWA_41 >CP036360|418207:422249|DBSCAN-SWA CATGAAATCGTTCAACCTATCCGACTGGGCGTTAGAGCACAGATCGCTTGTCTGGTATTTCATAATCGTCTTCGTGATCGCGGGCGCGTTCTCCTACATCAAACTCGGGAGAGAGGAGGATCCCTCGTTCACGATCAAGACTATGATCATACAAGCGGAATGGCCGGGTGCGTCTGCTGACGAAATGGCCAGGCAAGTCACCGACAGAATCGAGAAGAAGCTCGAGGAACTCCCCGCACTCGACTTCACTCGCAGCATGACGGTATCCGGCAGGACGACCATTTTCCTGGATATTCTCCCGTCGACAAAAGCGAGCGAGGTCGAGAAGAACTGGCTCCTCGTCAGGAACATGATCAACGACATTCAGGCGACCCTTCCAGCCGGAGTGCGGGGTCCGTTCTTCAACGACAGGTTCGGTGATGTTTTCGGCAACATATACGCTTTCACGTCAGATGGCGTGACTCAGCGCGAGATGCGTGATCTCGTCGAGAACGCTCGCGCCGAGGTTCTCACGGTGCCGAACGTTGGCAAGGTGGAAATCCTCGGAGCTCAGGACGAAGTGGTGTACCTCGAGTTCTCCACCCGTAAGCTGTCGGCGCTAGGCGTAGATCGTCAGACCGTGATCGAGACGCTTCAGGCGCAGAACGCGCTGACGCACTCGGGCGTCGTCGAGTCCGGGGCCGAAAGGATCGCCCTCAGGGTCAGCGGATCGTTCGAATCCGAAAAGAGCCTCGGTGCGATCAATTTACGGATCGGAGACCGCTTTTTTCCGTTGAGCGAGATCGCCACGATCCGACGAGCCTATGCCGACCCTCCTGCCACACTGTTCCGTTACAACGGCGAGCCCGCGCTCGGATTGGCAATCGGCATGCGTACGGGAGCCAACCTGCTGGAGTTTGGCGAGGCGCTCAAGCGAAAGGTTTCGGAGATAGAGGCGAACATGCCCGTGGGCGCTGACGTCCATCTGGTTTCCGACCAGCCCGCCGTCGTCGAAGAAGCCGTATCCGGTTTCACCAGGGCCCTGTTCGAGGCGGTGGTGATCGTTCTTTTTATCAGCTTCATAAGCCTCGGCGTGCGCGCGGGTCTCGTGGTGGCCGTGTCTATCCCGCTCGTATTGGCGATAACGTTCGTTTTCATGGAGTACACGGGCATCTCGATGCAGCGTATTTCACTCGGCGCGCTGATCATCGCCCTCGGACTTCTTGTTGACGACGCCATGATCGCCGTCGAAATGATGGTCGCCAGGCTTGAGGCGGGCGACAATATCAGGAAGGCGGCGACCCACGTCTACACGACAACGGCGTTCCCCATGCTGACCGGGACGCTGGTGACGGTTGCGGGCTTCATCCCCATCGGCCTCAACAGCAGCGCCGCCGGGGAGTTCACGTTTACGCTGTTCGTCGTGATCGCGGTATCGCTCGTCGTGTCGTGGATCGTCGCTGTCCTTTTTACTCCACTTCTCGGGATGACCCTGCTTCCCAAGACGATGGCAAGCCACAAGGCGCGAAAGGGAATAGCCGCACGAGTTTTCTCGCGGGTACTTTCCGCAGCCGTTCGGTGGAGATGGGTGACGATCGCCCTTACCGTCGGTGCTTTTGGGCTGTCCATCGCCGGGATGAGCCTCGTCCAGCAGCAGTTCTTTCCAAACTCCGATCGCCATGAACTGATCGTGGACTGGAATCTGCCGCACAACACCTCGTTCGCGGAGATCAACCGGCAGATGCAGACGTTCGAGAAGGACACCCTATCGGGCAACGAAGACGTCGCTCACTGGTCGACCTACGTCGGCACGGGAGCTCCCCGCTTCATCCTGGCGTTCGACGTCCAGACGCCGGACACCTCGTTCGGGCAGACGGTCATCGTCTCCAAGTCCCTCGAGGCGCGCGACAGGTTGCGCCCCAAGTTGCAGCAGTATCTCAAGGACATGTTTCCCGGCACCGATGCCTACGTGAAACTCCTCGATATCGGGCCGCCGATCGGGAAACCGATCCACTACCGGGTCAGCGGGCCCGATATCGACGAGTTGAGATCACTTGGTCAGGAGTTGCTCGGCCTGGTCAGCAGGCATCCGCTGCTCCACAACACGATCCTCGACTGGAACGAGCCCGAGCGCGTCGTCAAGATCGACGTTCTCCAGGACAAGGCCCGCCAGCTCGGCGTCACGTCCCAGGAAATCGCGACTACCCTCGATACGGTTGTCGAAGGCGCGCCTGTCACGCAGATCAGGGACGACATCTACCTGATCAATGTTGTCGGCCGGGCAAACGCCACGGAACGAGGTTCACTTGAGACGCTAAGCAACGTGCTCGTCCCTGCGTCCAACGGCAAAGCGGTTCCCCTTTCGGCAGTTGCCACGCTGCGGTATGAACTCGAACCTCCGAAAATATGGCGGCGGGACCGTACTCCGACCATCACGGTCAAGGCTGCGGTTTCCGGTCCGACCCAGCCGGACACCATCGTCAAACAGCTTGGCCCCGAGATCCAGAAGTTTTCGGAGACCCTGCCATCCGGGTATTCCCTCGCGGTCGGCGGGACGGTCGAAGAAAGCGCGAAGTCGCAGGGTCCGATCGCTGCGGTCGTCCCGCTGATGCTCTTCATCATGGCGACGGTTCTTATGATCCAGCTTCAAAGCTTCAATCGGCTCTTTCTGGTTTTTGCCGTCGCGCCGCTCGCCCTCATTGGCGTTGTGGCCGCGCTGGTCCTCAGCCATGCCCCGTTGGGGTTCGTGGCCATCCTCGGAATCCTCGCCCTGGCGGGTATCCTGATCCGAAATTCGGTGATCCTGGTCGTTCAGATCGAAGAGCTGAAAGCCGAAGGCACGCCAGCCTGGCAAGCAGTGATTGAGGCGACCGAACATCGCATGCGGCCGATCATGTTGACGGCCGCGGCGGCGACACTGGCGCTCATACCGATCTCGCGTGAAGTCTTCTGGGGACCCATGGCCTACGCGATGATGGGCGGCATTGTCGTCGGTACGGTCCTGACGCTGATTTTCCTGCCGGCGCTGTATGTCACCTGGTTCCGGATCAAGCCGGAAGCGGTCGAGACAGGCGGACCCGAGCAGATCATCGACCAGGGAAACCCGGTGGACGAGGCGGCGTGACCGCGCCGGCGCGTCGGCGGAAGTACGCCTCCGAATAGAAAAGACACCCCGCTAGCAGCGTTCGCCGTCGGGGTGTTTTGCTTTAGATCGGTGTAATCGGTCTTCAGCTTCGTTCGGACAGTCTAGCGATTACACGTTCCAGGCCAGCAGCACCTCCGAGAGCCCAAGCCTCGATAGCGGAGACATCCAGACCTAGCTTTTTCGCCGCCGCTGCAAACTCTTCGACGTCGGTGCGGTTGAGCTTCGCGATATGGGCCGCGATCTGCGGCGTCACCTTAGTGCCGAGAGCGTCAAGGATGAAGCTTCCGCCGCTGCTGTGGGAAAAAGCCTTCTGCAGGCTTGCCGGGGACACGCCGAACGTGTCGGCGATCGAAAAAACCTCGACGACGTTCCGCAGGTTCGAGACGGTGAGAGCGTTGTTGCAGAGCTTGCTGGCTTGCCCGGTTCCCGCTCCACCCATGTGAACGACGTGCGTGCTATGGGTGGAAAGGATTTCGCCGCACTTTTCCAGAGTGGCGGCATCCGTCCCCACGAAGCAAGTGAGCGTCCTTGCGACCGCTCCGGGCCGTCCTCCGCTGACCGGGGCATCAACGAAGCGGATTTCCTTTGCTTTGCAGGTTTCCTCGAAGGCCCTCGCTTCACCTGGGTCCCCGGTGGCATGGTTCATAAGGATCGCGCCGGAGGGCAGGGCGTCAAGCACTCCCCCTTGCAGCAATTCAAGCAGGTCGTGGTCGGACCGTAGGCAGACGCAGAGCACGTCCACCGCCGTCGCCAGTTCGACCGCCGATTCATGCCGCGTGAATAAAGTGCCCTCAAGGCTCTCGTAGGAGAAATCTCGTCTGGCCCAGACGTGAAGGTTGAATTTTTCAGCGATCGCGACCGCCATCGGCGCGCCCTGGTCGCCGATGCCGATGAAGCCGACGTTCAGCGGGCCGCTCAT
Protein sequences of DBSCAN-SWA_41 >CP036360|418207:422249|418207_421309_+|QBJ16779.1|DBSCAN-SWA MKSFNLSDWALEHRSLVWYFIIVFVIAGAFSYIKLGREEDPSFTIKTMIIQAEWPGASADEMARQVTDRIEKKLEELPALDFTRSMTVSGRTTIFLDILPSTKASEVEKNWLLVRNMINDIQATLPAGVRGPFFNDRFGDVFGNIYAFTSDGVTQREMRDLVENARAEVLTVPNVGKVEILGAQDEVVYLEFSTRKLSALGVDRQTVIETLQAQNALTHSGVVESGAERIALRVSGSFESEKSLGAINLRIGDRFFPLSEIATIRRAYADPPATLFRYNGEPALGLAIGMRTGANLLEFGEALKRKVSEIEANMPVGADVHLVSDQPAVVEEAVSGFTRALFEAVVIVLFISFISLGVRAGLVVAVSIPLVLAITFVFMEYTGISMQRISLGALIIALGLLVDDAMIAVEMMVARLEAGDNIRKAATHVYTTTAFPMLTGTLVTVAGFIPIGLNSSAAGEFTFTLFVVIAVSLVVSWIVAVLFTPLLGMTLLPKTMASHKARKGIAARVFSRVLSAAVRWRWVTIALTVGAFGLSIAGMSLVQQQFFPNSDRHELIVDWNLPHNTSFAEINRQMQTFEKDTLSGNEDVAHWSTYVGTGAPRFILAFDVQTPDTSFGQTVIVSKSLEARDRLRPKLQQYLKDMFPGTDAYVKLLDIGPPIGKPIHYRVSGPDIDELRSLGQELLGLVSRHPLLHNTILDWNEPERVVKIDVLQDKARQLGVTSQEIATTLDTVVEGAPVTQIRDDIYLINVVGRANATERGSLETLSNVLVPASNGKAVPLSAVATLRYELEPPKIWRRDRTPTITVKAAVSGPTQPDTIVKQLGPEIQKFSETLPSGYSLAVGGTVEESAKSQGPIAAVVPLMLFIMATVLMIQLQSFNRLFLVFAVAPLALIGVVAALVLSHAPLGFVAILGILALAGILIRNSVILVVQIEELKAEGTPAWQAVIEATEHRMRPIMLTAAAATLALIPISREVFWGPMAYAMMGGIVVGTVLTLIFLPALYVTWFRIKPEAVETGGPEQIIDQGNPVDEAA >CP036360|418207:422249|421412_422249_-|QBJ16780.1|DBSCAN-SWA MSGPLNVGFIGIGDQGAPMAVAIAEKFNLHVWARRDFSYESLEGTLFTRHESAVELATAVDVLCVCLRSDHDLLELLQGGVLDALPSGAILMNHATGDPGEARAFEETCKAKEIRFVDAPVSGGRPGAVARTLTCFVGTDAATLEKCGEILSTHSTHVVHMGGAGTGQASKLCNNALTVSNLRNVVEVFSIADTFGVSPASLQKAFSHSSGGSFILDALGTKVTPQIAAHIAKLNRTDVEEFAAAAKKLGLDVSAIEAWALGGAAGLERVIARLSERS |
2 | Leptospira_phage(50.0%) | NA | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_42 |
435083 : 437362
Sequences of DBSCAN-SWA_42
Nucleotide sequences of DBSCAN-SWA_42 >CP036360|435083:437362|DBSCAN-SWA CATGGGCTTTGTGAAAACGAAAGACGGCGCGGAAATCTTCTACAAGGACTGGGGCCACGGTCAGCCGATCGTGTTCCATCACGGATGGCCGTTGAGCGGCGACGACTGGGATGCCCAGATGTTGTTTTTCCTGGAAAAGGGCTTCCGCGTCGTGGCCCATGACAGGCGTGGCCACGGCCGCTCGAGCCAGGTCGCCGAGGGTCACGACATGGACCACTACGCGTCGGACGCCGCAGCCGTTGCCGAACATCTCGATCTCCGCAACGCGGTTCACATCGGCCACTCGACGGGCGGTGGTGAAGCCGCGAGGTACGTCGCCAGGCACGGCAAGGGCCGTGTCGCCAAGCTCGTTCTCGTCGGCGCGGTTCCGCCGATCATGGTCAAGACCGCTGCGAACCCCGGAGGTCTGCCAATCGAGGTGTTCGACGATTTCCGCAAGAACCTGGTCGCCAACCGCGCCCAGTTCTTCCTCGATATTCCTTCCGGCCCTTTCTATGGCTTCAATCGGCCGGACGCCAAGGTTTCCCAGGGCGTCATCCAGAATTGGTGGCGACAGGCCATGGTTGGCGGCGCGAAGGCGCACTACGACAGCATCAAGGCGTTCTCCGAGACGGATTTCACGGAAGACCTCAAGTCGATCACGGTTCCCACGCTCATCCTGCACGGCTCGGACGACCAGATCGTGCCGATCGATGACTCCGCCAAGCTCGCCATCAAGCTGCTGAAAAACGGGACTCTGAAAATCTACGACGGCTACCCGCATGGCATGTGCACGACCCACGCCGACGTCATCAACCCGGACCTTCTGGCGTTCGTCCAGGCCGCCGGCTGATCCAAGAGGGGCAGGCTGCAAAGCCTGCCCCGCAAATCATGCGCCGCTGCGCGATCCGCAGGCACAATCAAATACAGAGTTTATCCGTTCGGCACTTTGAAGATTAGGGTCAGGACCCATTGATTTGAATTGACGGCCATAATTCAGACGGGCGTATAGGAGCCTGCTGGTGAGTCATCTGTTTTGGCTGACGGACGAGCAAATGGCTCGTCTTCAGCCGTACTTCCTCAAGAGCCATGGTCGCCAGCGCGTTGATGATCGACGCGTTCTGAGCGGCATCATTTTCGTCAACCGCAACGGCCTCAGGTGGTGCGATGCGCCAAAGGAATATGGCCCCTCCAAGACGCTTTATAACCGTTGGAAACGTTGGGGAGACAAGGGCATCTTTCTCCAGATGATGGAAGGCTTGGCTGTGCCTGAGGCTGCAGAGCGCACGGCCATCATGATCGACGCGACCTATCTCAAGGCCCACCGCACGGCTTCCAGCCTGCGGGTAAAAAGGGGGGCTCGGGCCGCCTGATTGGACGCACGATAGGCGGCATGAACACCAAGCTTCATGCCGTAACGGATGCGAATGGTCGCCCGATCACTTTCTTCATAACGGCCGGTCAGGTCAGCGATTACACCGGTGCTGCCGCTTTGCTTGATGAACTTCCCAAGGCCAGATGGCTACTGGCCGACCGTGGCTATGATGCCGACTGGTATCGTGAAGCTTTACAAGCGATGGGGATCACTCCCTGCATTCCGGGTCGGAAATCCCGCAATAAAACCATCAAATACGATAAACGCCGCTACAAACGGCGCAACCGGATCGAGATAATGTTCGGGCGTCTCAAAGACTGGCGGCGTGTCGCTACGCGCTACGACAGATGCCCAATGGCCTTCCTCTCCGCCATCGCTCGCGCGGCAACCGTTATCTTCTGGCTCTGATCAACGAGTTCTGAGCGTAACTATGTCCGCGTTAAACAGTCGCGAAATGTCAAATGGTCTCAAAGAGAATGCGGCCGTTAGAACGTTGGAAAACTTGTTCAATCAGCAATTTTACCCTCGCGACCCAGCAGTGCGCTCTTTCGCTCCAGGCCCCAGGTATAGCCAAAGAACGTTCCGTCGTGCCTAACAACCCGATGACAGGGAATGGCAACCGCCAGAGGATTGCTGGCGCAGGCACTGGCGACCGCACCGATGGCACGAGGGTGGCCCACGCGCCGCGCTATCTCGTCATAGGAGATCGTCTCGCCGGCTAGCACCGATTGCAGAGCATGCCAAACCCGTCGTTGAAAGACTGTCCCTCTGATATCCAGGGGCAGATTCGTCGTCAGATTCGGCGCCTCGATCAAACCGACGACCTGGGCGATGGTACGCTCGTAGTCTGCATTGCCGCCGACTAGAATCGCCTTCGGGAAGCGGTCCTGGAGGTCGCGAAGCAGTGCCTCGGCATCGTCGCCCAT
Protein sequences of DBSCAN-SWA_42 >CP036360|435083:437362|435083_435914_+|QBJ16865.1|DBSCAN-SWA MGFVKTKDGAEIFYKDWGHGQPIVFHHGWPLSGDDWDAQMLFFLEKGFRVVAHDRRGHGRSSQVAEGHDMDHYASDAAAVAEHLDLRNAVHIGHSTGGGEAARYVARHGKGRVAKLVLVGAVPPIMVKTAANPGGLPIEVFDDFRKNLVANRAQFFLDIPSGPFYGFNRPDAKVSQGVIQNWWRQAMVGGAKAHYDSIKAFSETDFTEDLKSITVPTLILHGSDDQIVPIDDSAKLAIKLLKNGTLKIYDGYPHGMCTTHADVINPDLLAFVQAAG >CP036360|435083:437362|436083_436843_+|QBJ16791.1|transposase|DBSCAN-SWA MSHLFWLTDEQMARLQPYFLKSHGRQRVDDRRVLSGIIFVNRNGLRWCDAPKEYGPSKTLYNRWKRWGDKGIFLQMMEGLAVPEAAERTAIMIDATYLKAHRTASSLRGKKGGSGRLIGRTIGGMNTKLHAVTDANGRPITFFITAGQVSDYTGAAALLDELPKARWLLADRGYDADWYREALQAMGITPCIPGRKSRNKTIKYDKRRYKRRNRIEIMFGRLKDWRRVATRYDRCPMAFLSAIARAATVIFWL >CP036360|435083:437362|436942_437362_-|QBJ16792.1|DBSCAN-SWA MGDDAEALLRDLQDRFPKAILVGGNADYERTIAQVVGLIEAPNLTTNLPLDIRGTVFQRRVWHALQSVLAGETISYDEIARRVGHPRAIGAVASACASNPLAVAIPCHRVVRHDGTFFGYTWGLERKSALLGREGKIAD |
3 | Mycobacterium_phage(33.33%) | transposase | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_43 |
442637 : 453120
Sequences of DBSCAN-SWA_43
Nucleotide sequences of DBSCAN-SWA_43 >CP036360|442637:453120|DBSCAN-SWA CTCAGCATAGTCCGGCGTCCCTGCCGGATGGATCACAACGGTCGTCCCGCGAGGAGGTCGACATGTCTGCGAGTTCGAGCAGCTTTCCATCATGAATGCGATAGACCCTGTCCGCGGCGATCCTGGCAGCGGCGAGGTCGTGCGTCGCGATAACTATCGCGGTGCCATAGATGCGGCGGATGCGGCCGAGCAGGTTCAGCATCGATGCCGCGAGCGACGTGTCCAGCGCGCTGATGGGTTCGTCACACAGGAGGAGCCGCGGGGAAACCGCAAGCGCTCTGGCGATCGCGGCACGCTGGCATTGACCGACAGAAAGCTCTGAAGGCAAGGCATCGGCAAGCGAGCGCTCGAGATCGACGAAATCCATCAATTCCGCCAGGCGCTTGTCGCGAGGCACCTTTGCCACCTTCGCGGAACGCAACCCTTCGACGATCTGTTCACCGATAGGGACCCAAGGCGTCAGGAAAGATTTAGGGTCCTGGAAGACCAGTTGAGGGCGGCGGCCGTCCAACCGAACCGTTCCGCGGTCCTGTTCAAGGAGCCCAGCAGCGATCTTGAGGAGGGTGGATTTACCGATGCCGCTCTCGCCTAGAAGCGCCACACACTCTCCGGCCTTCACGGTGAGGTCGATCTCGGCCAGGACATCGTAAGAGCTAACCCGAAAGAGGCTTCTCGTACGAAAGGCCTTCGAGACCCGGCGCATGTCCAACACGACTTCCTCTCGATTCGAGGAGAACGGGGCCCAAGGAACGCTTTGCTGCACGATGTTTTCGAAATGCGAATTGTCGATAGCTCGATCGCCTTGGGCGAGGGTTGGCAATTGCGCGGATTTGTCGACGGCTCCATCATACCGGATCGCGAGGAGCCCGACGCTGTAGGGGTGGCGGGGGCGGGCGAAGATCGTCGAAGTCGGCCCGACCTCGACGATCCGCCCATCTTCGAGGATTGCCGTTTGATCCGCTATCTCCGCTACTGCGAGATCGTGCGTACTGAGCAATACGGCCGCTCCGTGTGCTGCCAGATTCCGCAGAAGCGCGGTGACCCTTCGTTTGTTATCAAGGTCGAGACCTGTCGTGGGTTCGTCAGCCACGATCAGCGGCACGGGTACGGCCGTCGTCATAGCGATCAGAACACGCTGTCTCTCTCCCCCGGAAAGCTGATGCGGAAGGCTGCCCATGACGCGTTCGGTGTCAGCGATCCCCGCCAACCGCAGCCGTTCGACCGCTGTGAGTTCCCCGCACACTTCCGCCATCTGCGCTCGAACGCTCATTGTCGGGTTCAGCGCGTCGAAGGGGTCCTGGGGGATGATCCGCACCAGGCTCTTCCGGGCAGCTCTAAGGTTGGGCTCGCTCGTACCGACCAATTCCTCACCGACGACACGGATAGAGCCACTAATCCGAGGCTGGCTGTCGCGCGGCAAAAGACCATGGATGACGTGAGCAAGGGTCGACTTTCCAGAACCGCTTCGACCCACAAGCGCGGTTATCTGCCCGGAATGAACCTCTAGCTCCAGCCCTTTGAGAACGTCGTTCCTCTGTGCTCCACGCAACATCGAAACGTGCAAGCCACGGATTTCGAGCGTGGCAAACGGCGGAGTACGGCTTCCGACCAGCTCTCCTACCCCGACGCTCATCGGCGGACGCTCTCGTAATCTATGTTTCCCGGGGGATAGACGGGTCTCAAGCCGAAATCGCGGATGTCGTTGGTATGGACCACGACATCCTGCATTTCGACAAGCGGTATGGCGATTCCGGCGTCGAAGATCATCTGTCCGGCTTGTTCGTAGAGTGCGTTCCGTCGAGCGTCGTTCCCCAGACTGCCCGCCTCCTCGATGAGGCGGTCGGCTTGGCCGATCGAACGTCCGAAGAAGTTGACCGGAGCGTCCTTCGTGAAAAACGCTTTTGCCTGGTTCACCGGGTGAGCCGCATCCGGACCTGCGATCGTAAGGAGTAAGTCCGGGGGCGACGTCGCTTGCTTCAAGTTGAACGCGGCTCCCGGCGGCAACGCGTAGGACGTCGCCTTGACGCCGATGGATGCCAGCTGCGCGATCAGGAGGCCCGAGATTTTCTGATAGCCAGGTGTGGCGCTGTGCTGGCCGATGACGAGACTGACAGGACCGCGGCGTGCGACTATCGCCTTGGCGCGCTCCATGTCGACAGGGAAGACGAGCGGTCTGCCGGGATCCAGCACCACGCGGGGATAGACCGACCTGGCCACGGTCGCATACTGGCCGAACGCTTCATCCGCCCAGAGAGCCGGATTGATGGCAGTCAACACCGCCTGCCTGATTTCCGGATCATCGAGGGGCGAGCCGGATTTCGCGAACAGGGTGTAGAGGCTTACGCTCGGGGCCGCCGAGACCGCGAGATCGGATGGAATGCCGCCGAGTTGCGCGATAGGATACCCCGTCGGCACGGCGTCGATGTCTCCAGCTCTCAACTGCAGTATCTGCTGGCCAATGTCCGGGACGACCGGAATCTGCACTTCCGGGAAAAACGCTTTCTGTCCCCAGTACTCGTCATTCCTCTTGAGAACATAGCGGTCACCGCGACTGAATTCCGCCAATGCATAGGGACCTGTTCCATCGGCGTGTTCACCGAGCCATCCGGTGGCTTTGTCGCCTTTATCGTGTTCGGCTAGCGCTCGCGGGCTGATCACCTTTGGACCCCAAGGACTCGCGAGGCTGTCGATGAAAGAGGGTTGCGCGCGCCTGAGGACGATCTTGAGGGTCAATCTGTCCACTGGTTCGAGTCTTTCGACGTTGGACAGGAAGTACGACAGAGCCAGCTTGCCGTCCTTCCTTCGGTCGAAGGAAGCAACGACGTCGGACGCACTCATCTCGGTGCCGTCGTGAAACTTTACGCCGTCAACCAACCTGAAGACATATTCGCGACCATCGTCGGAGACTTCCCAGGAATGTGCCAGTCGGCCGACGACCTTTACGGTGCCCGGCTCGTATTGAACCAGACCCTCGTAGACGTTGTTGATCGCGCTCATGGCACCGATTTCAAATCCGTTGTCCGGATCGAACGTGGCGATGTCCGTTAGAAGCGGAACCCGGAGAACCTCCTGCGCGACCACGGGTTGCGCAATGAGGGCCGCCGCGAAAACACACGCGATCGCACGACTGACAGCAAAGGCAAAACCGCCCGGCAACATGTTGGCATCCTGTTATGAACTCAACCCGACTCCGGCGGACTTTCCGCACAGCGAAATCCTAACATCCGGAATTGCGCCGTAACCAGTTCCGACAACCAAGGGAGCTTGTACAAATCGCGCCAAAAATCACGTCGCCAGTTGCCGGTGTCATTGCCCGATACCGCGGGGAGCCGAGTGTTGAGCTCGTCGGTCCGTCCAGATCTCGGATCCGGGCGGATGGGGCAGGGTTCCAAAACGATGCTTCTCTTGTTAACATTGCTGTATGATAGGATCGCGTGGGCGCGCGCCGCTTTTGAGGGGATGACCATGGAGGGGGCTGCAACGCGGCCGTTGGAGCTGCTGCCGATCGTGACGCTGGTTGACGACGATGCCCGCGTGCTGGAGGCGATGGAAAATCTCCTCTGCTCAGTCGGTATCGACACCGTATCCTTTGCGTCCGCACGCGAGGTGATCGAGGCCACCTTGCCGGATCGGCCGGGATGCTTCGTGCTGGACGTCCGTATGCCGGGATTGAGCGGGCTAGACCTACAGGATCATCTCCTGAAACAAGGCAATCACACCCCGGTCATATTTCTCACGGCCCACGGCGACATCGCCATGAGCGTCGAGGCCATGAAGGCCGGAGCCAAGGACTTTCTTACGAAGCCCGTGCGCGATCAAACGTTTCTTGATGCCGTCTCTAGGGCGATCAGCGCCGACCTCGAACGGCGCGAGGCTCAAACGAACTCGCAAAAGCATATCGCGCTCTACCAAGGTCTCACCGCCCGGGAACGTCAGGTGCTACGCTTCGTGGTCGAGGGCAGCCTGAACAAGCAGATCGGTTTCGAGCTGGGCATTTCGGAGGTCACGGTAAAGCTGCACCGCAGCAACATGATGAAGAAGATGCAGGTGACGACATTCAACCAGCTCTTCACCGCCTGGCAGGCGCTGCCCTCCCATATACGAGAGAACGTCGAATAGCGGGGCCGATCATGCCTCTTACCTGTCCTACGAGATAGGTTTCGACCACCGAACCATCGGTATAGGTTGAACTCGGTCATGAACGACCCGCTACCGTAGCCCGGCGCGTCCTGTCATCATCCTTGGAAGGCGTGATGGATTTCCAGACATCACAGCAGCGCCGTTGGTTTGCTTTTGCGGCGAGTGCGGTCGGCGCTGGATTTGCCGTCCTTTTGAGCGAGCCTGTCGGCCTCCTGATTACCGTCGCTTTCGCAACTATTTGCTTTGGTTGGAGGTGGGGTATCGCTACGGCCATCGTCATAAGCGCAGCAGCTGCCGCGCTTATTTGGGTAGAAGGCGATCTTGCAAACTTGAGCATCAGGGGCTGGACTGCCTACGCGATAAGCGCATTCGGGATCTGGGCAGTTATCACCTCCTATCGGACGATATCCTTCTACGATCAGGTCTATAAGACTGTCCGCCCGACTCTGGAAGATATACCCGGACTTGGATGGTCGGCCTATCCCGATGGCCGCATGAGGTTCGTGAACCCCGCCGCCACGGAGTTCGTCGGGATCACTCCGGAGGAAATGAAGGAGAGGATGGAAGGAACCGACACCGCGTGGTGGACACCCTTCGTTCATCCCGATGACCGGGAAAGAAGCTTGGCGCTCTGGCGCAACAGTCTCAAGACGGGTGAACCCATTGTCGACGAGCAACGCGTACGCCGCTTCGATGGGACCTACAGATGGTTCCGGGATTCCGCGATCGCCTCTCGCGACGAAAACGGAAAGATCACCGCGTGGTACGGTTCGACGGTCGACATTACCGATCAGAAAAACGCCGAGGCGGCGTTGAAGGCCAGCGAGCAGCAGCTCCGCGAGCTCATCAACACTGTGCCCGCACTCATCTGGCGCGCGGGTCCTGACGGAAAGACGAACTATGTCAATGACCAATTGATCCGATGGTTTGGACTATCGGCTGGAGAAAGAGATGTCGGCGGACTAGAGAGCCTGCTCATGGATGCGGTTCACCCCGAAGAACGTCAGGAGATAAGGGCGACTGTCAGTCGGCTCTTTGCCACACGAAAGTCATTCTGCCTCAAGTACCGTCACAGGCGTTCGGACGGTTCCTATAGATGGACGAATGCTACCGTTCAGCCGCTTCTCGATATCGAGGGGGCGATCGTCCAGTGGTACGGCGTTTTTCTCGACATCAACGACGAGATCGAAGCGCTGAACGCGCTGCGCAGAAGCGAGACGGAGACGCGCCTCATAGTGGATACGGTGCCATCGTTGATATGGCTGATGTCGACCGAAGGTTTCGTCTTCCATTTCAATGACCGAATGGTCGAATGGACCGGGATCGAACCGGGGCCGAGCCCCAGCAATCCGTCTGCACCTCGGCCAACGTATTCGGAGCTGATACACCCGGATGACAGCAGGCGCATTGCCGAAGAATTCAAGAATGCCTTAGAAACTGGCACAGCATTGCATACCAAGGGAAGGTTGCTCAGGAAAGACGGGCAGTTCCGATGGCTTGACTCGCGCGTCGAACCGCTCAGGGACGACACTGGCAACATCATCCGGTGGTATGGTGTTTCCATCGATATCGAGGAGGAGGTTAGAGCACAGGCCGCTTTGCGCGAAAGCGAACGTTACCTTCAGCATATGATAGATACCGTGCCCGTGGGGATTGTCCTTTCCGACAACAGCGGAACGCCTGTCTATGTCAACAAAAGGTTGGTCGACAACAACGGCTTGTACGTCTCTCGCAAAAGCGAAGGCTCTCGGCTCGACATCAGTCCGGCGGTGGAAGACCTCGTCCATCCCGACGACAGGGAAACAGTCGAGAACCGCTGGGCGCTTGCGCGCCGTAAGGGAAAGCCCTACGCGATGCGTTACAGACAACGCAGAGCGGACGGCGTCTATCGCTGGATAGAGGACAGGAGCGAGCCTTTCCGGGATGACGGTGGCAAAATCCTGCAATGGTACGGCGTCAACTTGGACATCGACGACGAGGTGAAAGCGCAGGAAGCCCTCCGGATCGCCGACGAGCGTTTGGCCAAAGCCGCGAGAACCGTGAGCCTATCGGAGCTTTCGATTTCGATCGCACATGATCTGAACCAGCCACTGCAGGGGGTTGTCTCGAATGTAACCGCCTTCAAGAACTGGCTGAGGGCCACGCCGCCAAACCTCGAGCGGGCGACACGTACCGCTGAGTGGATCGTGCGGGACGTGGAGGCGGCGGCAGAAGTAGTCAGCCGTATCCGAACCATTTTCTCGCAGACAGAGCATAAGCGCGAGGCGGTTTCGCTGAAAGCAGTGATCGAGGACGTGAAGAGTTCGCTGGCCGACAAGTTGATCGCGGGGAAAATCAAGGTCCACGTCGATCTTGACGGTGACCTACCCGACACTCTGGCAGACCGCGTTCAAATCGAGCAGGTGGTTCTGAATCTACTCAAGAACAGTGTCGAGGCCTTTGATGAAACCCGGTCACGCAGTCGGTCAATCGAGGTACGTGCGAGAAAGGTGAACAGTGAGGTTGTGGAGGTCTCCGTCGGTGACAACGGTCCCGGTCTGACCGACCCGGAAAAGGTGTTCGAAGCTTTCTACACGACCAAAAGCGATGCTCTCGGGATCGGTCTTGCCATCTGCCGATCTATCGTTGAAGCCCATGGCGGTCAGCTCAGTGCAGGCAACCGTGGAAGGGGTGGGGCCTTGGTGGCTTTCACGCTTCCGACCAATGGAAGGGCGGGCATCGGCGCGGCAGGCAGTGACGTCGTCGATATCGCGACTTTGGAAAGGGGCGAATTGCAATAGCAGTTAGGTGGGATTCGGCTGGACTGCCGGGTCCTACAACCTAGTTCAGTGACTTGACTGCCGGACACGGTCCACCAGCTTGAGCGAACGGTCAGACTGCACGACGTATGTTTCGGAATAGCGAGTGCCATCAGAATAAAAGTCATGATTGAACTGGCTACCGACGGGGGCCTTTTTGAGGCGTGTGGCATGCTGCCTCTCGTACGTTATGCTACCCGGAATGGGATCGATGGCCATAGCGGGAACTGCGAGGAGTGCCGAGAGCGCGAGTACAGCCGAATACGATAGCTTTTTTATGTTATTCGCCGTGAGGGGGGTCGTTTCCCTTGTTGCGGGACGAAAGTGAAGGTGATCGCTGGCGTTTTGCAGTTGTACGAAAGGGTACCCTTGCTCGCAGGCGGTTTTACGTGTGCTTCCCTTCTTAGTGGAGTGCTCCAAGACCTCCGGGAGCGGGTGTTCTAATCGCTGCAACAAAAAGCACGTCTCTTGCAAGCCGAAAATGTTGTATGTGCCCTGCGTCCATACAACCGTAAAGAATTGGTCAATGACAGACGATTGGGTAGATTGGTTTCCAACCGTCTTCTTCCCGCTGAAGGTTCTCGTGCTGGGCGTTGGCATGTTTTTCGCGCAGGTGGACGAGCAGCCAATTCCCCCAGGGCGCGAAAGTGGAGGAAATAGGATGTGTGCTGCGGCCTGAGCGAAAAGCGGTATCTGAAAGTTGCGGCTATTCCTACCAAAAACCAAACGGAGGTATATATAACGGCGCTCCTGCAATGCGTGGTATTGCGACAACCGGAGAAGTTTGAATCTCAACCGTCTCGTTTTCATGCTGATCAAAAAGTCCCGATTTGCCGTCCCGTGTTGCAAGAACCGATTATTTGACGCGTTTTTCTGAAATCCCGCTGCGGAAAGTGACAAGTTTCAAAAATGGCTGGCTCGCAAACCGCACACATCGCAATCATCGAAGACGACGTTCACCTACGGAATTCGATCAAGGACTTTCTCGATACCGCCGGATACACGAGCGAACTTTTCGGCTCGGCGGACTTTTTCCTGGACAGTGACCGCTACCGATCGTTCCGCTGCGTTCTCGCCGACGTAAGGATGCCGGGGATTTCGGGCATCGAGATGCTCAGATTGCTGAAGTCTCGGGAAGATTGCCCTCCGATCCTGATCATGACTTCCTACTCGGACAATCAAATGCAGGCGACGGCCCTGAAATACGGCGCGTCAGGCTTCCTAGCCAAGCCGATCGATACTTCATTGCTTCTCCAATTGATCCATGACGCCATCGGTAGCGACGCTTCGCGGGGCTGATAATCCACAGTGCTCCCGGTCAGTGCCGCGACCCCGGTACCCGAACGTATAGCTGTCGATCCACGCCGACTTCTCCAGAATGCGATCAACGTCATTTCGATGAGGAGAATTTTGATGGCAGAGTCTTCAGTCATAGCTTCGCCCGAAGTCCTAAGGCTGCCACAGCCCGATCGGGAAGCGAAGCCCTCTCCGGCATCGGAGACGGTCAACCTTCCTGTAGAGCCCAGTTCAGGTACTCGTGGATTGATGCGACGCCTCATGCTCGGCGGGTCGATCCTCCTTGCCGCAGCCTATGGCGCATACGAGGGGTTCCAGTACTACACCCTTGGCCGTTTCATTATCACGACCGACGACGCTTATCTCAAGGCAGACTTCACGGCGGTCGCGCCGAAGGTCTCCGGATATATCAGCGAGGTCCTCGTGACCGACAACGAGCACGTCAAGGCCGGGCAGGTCGTCGCGCGGATCGACGATCGCGACTTCCAGTCCGCGGTGTCGCAGGCTCGCGGTGATCTCGCAGCCGCCGAGGCGGCCATCGCAAACGTCGATGCGCAGATCGTGCTCCAGAATGCGCTGATCGATCAGGCGAAAGCGGCGCTTGCCTCTTCGGAAGCCAATCTCGCCTTTGCGTCATCCGATGCCAAAAGGTCCGAGTATCTCTATACAAACGGGTCCGGGACGCTCTCGCGAGCCGAGCAAACACAATCGGTGAGCGAACAGGCCAAAGCGGCGGTGGACAATAGCAAGGCGGCCGTCGTGGCAGCGGAAAACAAGGTTCCAGTGCTTGAGACCCAACGCAGCCAAGCCATCGCCCAGAGGGACCGCGCAAAGGCCGGAGTAAACCAAGCCGAGCTCAACCTGTCCTACACCAACATCGTATCGGCGATTGACGGAACGGTCGGCGCGCGTACCATACGTCTTGGGCAGCTGGTAAGCGCCGGCACACAGCTCATGGCGGTCGTGCCTCTCAACGCCGTCTACGTCGTGGGCAATTTCAAGGAGACACAGCTGACCGACGTCGTGCCGGGGCAGCGGGTGGCGGTCAAGGTGGACAGTTTTCCGGATGCGGCCATTGACGGGCACGTCGACAGCGTTTCACCGGGAAGCGGACTCGAGTTTTCGCTGTTGCCCGCCGACAACGCCACGGGCAACTTCACGAAAATCGTCCAGCGAATCCCCGTGAAAATCGTCATCGATAGCAAGGAGTATATCGGCCGTCTGCGATCGGGTATGTCGGTGGTTCCGAGTATCGACACGAGAGGTGGTCCTTCGACGCCATGAGGAGGGGTCGACAATCCTCAGTCTTGCCATCGTCGTTGAACATAGTGGCGACGCGAGGTATTTTTCCCGAAAACCGGGACGACCACACCCACCGCTAATTTCGGCGACGAAAAGCATAGCCGGGGTGATTTGCTTGCCGCCACCAATTCCGCCAAATCGTGATGGGGAGCGATCGCGCGAGTGGGCAGCCGTCTCCCACTCATCTTCGCCAACGGTCGTTCTTCTGCCAGCGATTAAAACCGAGTTGATCTGCCGATGCCAGTCGCGACTGCGCTCGCCCGTCGGGTGGACTTCGCGCCACATGATCTCGTGAAGCATGATTCCAAACGCGAACACGCTGGTCCATTCACCAAGCTCGGTCTTGTGCCATTGTTCGGGTGCCATGTACGGGCGCGAACCAGTGAACACGCCGATGTCTGCGGCAAGATTGACGCTACCAAAGTCCGCGACCAAAGGGCGAAACCACAAATCGGTCGGTGGCAGCGAAAAGTTCGTCCTCAGGTCTCGAACGAAGATGTTTTGCGGCTTGAGGTCTTGGTGAACCAAGCCGCGACGGTGACCGTGCGCGAGACTCGCCGAAAGCTGGATCAACAACGACAATCGCCCGAGATCACCCCAACGAGGATCCTCGATGAAATCGGACAGATCGCCCTCCCACCGCCGGAAAAAGGCGATGGGCGTGCCAAGGATAAAGATAACGTCAAAGGGCCAATGTACGTTAGGGTGATAATGCGCCGCCGTCTGCAGCTTGATCTCACGCAGGAAGCGCTCCGCACGTTCAGCTGGTGAGAAGACTTCGCGCCGTCGCGGGAATTTAGCGCAGGTCAGTCGTGGAAAGACGCTATCGCCGTTATCGAGAAAGACGATTTCACCCGAGAAGCCCGCGCTACTGTCTAACGCCTTTTGCGCTGCCCCAAAAGGCAAGAAGCTGATCGCGGAGTGGGATCGGAAGCTCGTAGAACGCAT
Protein sequences of DBSCAN-SWA_43 >CP036360|442637:453120|446889_449556_+|QBJ16801.1|DBSCAN-SWA MDFQTSQQRRWFAFAASAVGAGFAVLLSEPVGLLITVAFATICFGWRWGIATAIVISAAAAALIWVEGDLANLSIRGWTAYAISAFGIWAVITSYRTISFYDQVYKTVRPTLEDIPGLGWSAYPDGRMRFVNPAATEFVGITPEEMKERMEGTDTAWWTPFVHPDDRERSLALWRNSLKTGEPIVDEQRVRRFDGTYRWFRDSAIASRDENGKITAWYGSTVDITDQKNAEAALKASEQQLRELINTVPALIWRAGPDGKTNYVNDQLIRWFGLSAGERDVGGLESLLMDAVHPEERQEIRATVSRLFATRKSFCLKYRHRRSDGSYRWTNATVQPLLDIEGAIVQWYGVFLDINDEIEALNALRRSETETRLIVDTVPSLIWLMSTEGFVFHFNDRMVEWTGIEPGPSPSNPSAPRPTYSELIHPDDSRRIAEEFKNALETGTALHTKGRLLRKDGQFRWLDSRVEPLRDDTGNIIRWYGVSIDIEEEVRAQAALRESERYLQHMIDTVPVGIVLSDNSGTPVYVNKRLVDNNGLYVSRKSEGSRLDISPAVEDLVHPDDRETVENRWALARRKGKPYAMRYRQRRADGVYRWIEDRSEPFRDDGGKILQWYGVNLDIDDEVKAQEALRIADERLAKAARTVSLSELSISIAHDLNQPLQGVVSNVTAFKNWLRATPPNLERATRTAEWIVRDVEAAAEVVSRIRTIFSQTEHKREAVSLKAVIEDVKSSLADKLIAGKIKVHVDLDGDLPDTLADRVQIEQVVLNLLKNSVEAFDETRSRSRSIEVRARKVNSEVVEVSVGDNGPGLTDPEKVFEAFYTTKSDALGIGLAICRSIVEAHGGQLSAGNRGRGGALVAFTLPTNGRAGIGAAGSDVVDIATLERGELQ >CP036360|442637:453120|450483_450873_+|QBJ16802.1|DBSCAN-SWA MAGSQTAHIAIIEDDVHLRNSIKDFLDTAGYTSELFGSADFFLDSDRYRSFRCVLADVRMPGISGIEMLRLLKSREDCPPILIMTSYSDNQMQATALKYGASGFLAKPIDTSLLLQLIHDAIGSDASRG >CP036360|442637:453120|442637_444269_-|QBJ16799.1|DBSCAN-SWA MSVGVGELVGSRTPPFATLEIRGLHVSMLRGAQRNDVLKGLELEVHSGQITALVGRSGSGKSTLAHVIHGLLPRDSQPRISGSIRVVGEELVGTSEPNLRAARKSLVRIIPQDPFDALNPTMSVRAQMAEVCGELTAVERLRLAGIADTERVMGSLPHQLSGGERQRVLIAMTTAVPVPLIVADEPTTGLDLDNKRRVTALLRNLAAHGAAVLLSTHDLAVAEIADQTAILEDGRIVEVGPTSTIFARPRHPYSVGLLAIRYDGAVDKSAQLPTLAQGDRAIDNSHFENIVQQSVPWAPFSSNREEVVLDMRRVSKAFRTRSLFRVSSYDVLAEIDLTVKAGECVALLGESGIGKSTLLKIAAGLLEQDRGTVRLDGRRPQLVFQDPKSFLTPWVPIGEQIVEGLRSAKVAKVPRDKRLAELMDFVDLERSLADALPSELSVGQCQRAAIARALAVSPRLLLCDEPISALDTSLAASMLNLLGRIRRIYGTAIVIATHDLAAARIAADRVYRIHDGKLLELADMSTSSRDDRCDPSGRDAGLC >CP036360|442637:453120|446101_446755_+|QBJ16867.1|DBSCAN-SWA MEGAATRPLELLPIVTLVDDDARVLEAMENLLCSVGIDTVSFASAREVIEATLPDRPGCFVLDVRMPGLSGLDLQDHLLKQGNHTPVIFLTAHGDIAMSVEAMKAGAKDFLTKPVRDQTFLDAVSRAISADLERREAQTNSQKHIALYQGLTARERQVLRFVVEGSLNKQIGFELGISEVTVKLHRSNMMKKMQVTTFNQLFTAWQALPSHIRENVE >CP036360|442637:453120|450972_452154_+|QBJ16803.1|DBSCAN-SWA MRRILMAESSVIASPEVLRLPQPDREAKPSPASETVNLPVEPSSGTRGLMRRLMLGGSILLAAAYGAYEGFQYYTLGRFIITTDDAYLKADFTAVAPKVSGYISEVLVTDNEHVKAGQVVARIDDRDFQSAVSQARGDLAAAEAAIANVDAQIVLQNALIDQAKAALASSEANLAFASSDAKRSEYLYTNGSGTLSRAEQTQSVSEQAKAAVDNSKAAVVAAENKVPVLETQRSQAIAQRDRAKAGVNQAELNLSYTNIVSAIDGTVGARTIRLGQLVSAGTQLMAVVPLNAVYVVGNFKETQLTDVVPGQRVAVKVDSFPDAAIDGHVDSVSPGSGLEFSLLPADNATGNFTKIVQRIPVKIVIDSKEYIGRLRSGMSVVPSIDTRGGPSTP >CP036360|442637:453120|444265_445795_-|QBJ16800.1|DBSCAN-SWA MLPGGFAFAVSRAIACVFAAALIAQPVVAQEVLRVPLLTDIATFDPDNGFEIGAMSAINNVYEGLVQYEPGTVKVVGRLAHSWEVSDDGREYVFRLVDGVKFHDGTEMSASDVVASFDRRKDGKLALSYFLSNVERLEPVDRLTLKIVLRRAQPSFIDSLASPWGPKVISPRALAEHDKGDKATGWLGEHADGTGPYALAEFSRGDRYVLKRNDEYWGQKAFFPEVQIPVVPDIGQQILQLRAGDIDAVPTGYPIAQLGGIPSDLAVSAAPSVSLYTLFAKSGSPLDDPEIRQAVLTAINPALWADEAFGQYATVARSVYPRVVLDPGRPLVFPVDMERAKAIVARRGPVSLVIGQHSATPGYQKISGLLIAQLASIGVKATSYALPPGAAFNLKQATSPPDLLLTIAGPDAAHPVNQAKAFFTKDAPVNFFGRSIGQADRLIEEAGSLGNDARRNALYEQAGQMIFDAGIAIPLVEMQDVVVHTNDIRDFGLRPVYPPGNIDYESVRR >CP036360|442637:453120|449601_449853_-|QBJ16868.1|DBSCAN-SWA MKKLSYSAVLALSALLAVPAMAIDPIPGSITYERQHATRLKKAPVGSQFNHDFYSDGTRYSETYVVQSDRSLKLVDRVRQSSH >CP036360|442637:453120|451968_453120_-|QBJ16804.1|DBSCAN-SWA MRSTSFRSHSAISFLPFGAAQKALDSSAGFSGEIVFLDNGDSVFPRLTCAKFPRRREVFSPAERAERFLREIKLQTAAHYHPNVHWPFDVIFILGTPIAFFRRWEGDLSDFIEDPRWGDLGRLSLLIQLSASLAHGHRRGLVHQDLKPQNIFVRDLRTNFSLPPTDLWFRPLVADFGSVNLAADIGVFTGSRPYMAPEQWHKTELGEWTSVFAFGIMLHEIMWREVHPTGERSRDWHRQINSVLIAGRRTTVGEDEWETAAHSRDRSPSRFGGIGGGKQITPAMLFVAEISGGCGRPGFREKYLASPLCSTTMARLRIVDPSSWRRRTTSRVDTRNHRHTRSQTADILLAIDDDFHGDSLDDFREVARGVVGGQQRKLESASR |
8 | Bacillus_virus(33.33%) | NA | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_44 |
481153 : 489030
Sequences of DBSCAN-SWA_44
Nucleotide sequences of DBSCAN-SWA_44 >CP036360|481153:489030|DBSCAN-SWA CATGGCCAACGACAAGCTCTCGACCTACAGGCAGAAACGCGATTTTCAGAAGACGCAGGAGCCAAGCGGAAAAACGAAGCTGAAAGCTTCAAACCGCAGGCGCTTTGTCATCCAGAAACACGACGCCACGCGGCTCCATTATGACCTGCGCCTCGAACTCGATGGCGTCTTCAAGTCTTGGGCTGTGACCAAGGGACCGTCTCTCGATCCCCAGGACAAGCGCCTGGCCGTGGAGGTGGAAGACCACCCGCTGGACTATGGTGACTTCGAAGGCACGATCCCCAAAGGCCAATATGGCGGCGGCACCGTCATGCTCTGGGATCGCGGCTACTGGGAGCCGGAAGGCAACCGGACACCCGAGCAGGCGCTCGCCAAGGGCGACTTCAAATTTACACTGGAGGGAGAGCGGCTGCATGGCAGCTTCGTGCTGGTGCGGATGCGCAATGACCGTGACGGTGGCAAACGGACGAACTGGCTGCTGATCAAGCATCGCGACGACTTCTCGGTTGAAGAAAACGGCGCCGCCATCCTTGACGAAAACGACACGTCGGTTGCCTCGGGCAGAACCATGGACGCCATCGCCGCCGGTAAAGGAAAGAAACCAAAACCGTTTATGGTGCAGAGCGGCGATGTGCAGGCCGACGCCGTCTGGGACAGCAATCACGGACTGGCGGCAGATAAACGTGCGGCGGATACGAAGAAAAGGCGCCCTGCATCGCCGAAAGCGGCGAAGTCCGCGATGCCCGACTTCATACCTCCGCAGCTTTGCGAAACGCTAGAACGTCCACCCTCGGCCGATGGCTGGATCCACGAGATTAAATTCGACGGATACCGTATTCAGGCGCGCATTGAAAACGGCGAAGTCACGCTGAAGACGCGCAAGGGGCTGGACTGGACCGCCAAGTATCCGGCGATTGCGACATCGGCCGCAAGCTTGCCTGATGCCATCATCGACGGTGAGATCTGTGCGCTCGATGAAAACGGTGCCCCGGACTTCGCCGCACTTCAGGCGGCGCTGTCAGAAGGTAAAACGGATGCGCTCGTCTATTTTGCTTTCGACCTGCTGTTCGAGGGAAGTGAGGATCTAAGGCAACTACCGCTTACGGAACGCAAGAAAAGGCTCGAAGCGCTTCTGAGCGAGGCGGGGGAAGATCCGCGCCTGCGGTTCGTCGAACATTTCGAGACCGGTGGCGATGCAGTATTGAAATCGGCGTGCAAGCTCTCGCTGGAGGGCATTGTCTCGAAGCAGGCGGATGCCCCCTATCAGTCGGGGCGTACCGATACCTGGGCGAAATCCAAATGCCGCGCCGGACATGAGGTTGTCATTGGCGCTTACGCCAAAACCAACGGCAAGTTTCGTTCACTTCTTGTCGGCGTCTTCAGGGACAATCACTTCGTCTATGTCGGGCGGGTCGGGACCGGCTATGGTGCCAAAACGGTGGATACAATCCTGCCGAAGCTCCGGGAACTGGAAACCTCAAAATCGCCCTTTACCGGAATAGGCGCTCCGAAAAAGGAACCCAACATCGTCTGGGTCAAGCCCGAACTGGTGGCCGAGATCCAGTTCGCTGGCTGGACCGCGGACGGACTGGTCCGTCAGGCAGCCTTCAAGGGTTTGCGCGAAGACAAGCCGGCGAAGGAGGTCGAGGCCGAAACGCCGGCATCGCCCGGGAAAACTGAAACGCCAACGCCGGCCCGGCCAAAGCCGTCGCGACCGGCACGCGGCAAGAACACCAAGGCAGAAGTGATGGGAGTGATGATCTCCAGCCCTGACAAGCCGCTCTGGCCCGATGCAAATGACGGTGAGCCGGTCACCAAGGAGGATCTCGCGCACTATCATCAGGCCGTTGGTCCATGGCTGATCGACCATATCAAGGGACGGCCATGCTCTATCATCCGGACGCCTGACGGCATCGGTGGTGAGCAGTTCTTCCAGAGACACGCGATGCCGGGAACGTCGAACCTGGTCGAACTGGTCAAGGTGTTCGGCGACAAGAAACCCTATCTGCAGATTGATCGCGTTGAGGGCCTTGCCGCCATCGCCCAGATCGGCGGCGTCGAACTGCATCCCTGGAATTGCGAGCCCAACGAACCGGAGGTGCCCGGTCGTCTTGTCTTCGACCTTGATCCGGGTCCGGACGTACCCTTCTCCACCGTAGTCGAAGCCGCGCGGGAAATGCGCGACCGGCTGGAGGAGCTTGGCCTCGTCAGCTTCTGCAAGACCACCGGCGGCAAGGGCCTTCACGTCGTCACCCCGCTTGCGGTACCGAAGGGAAACAAGCTGAGCTGGCCGGAGGCGAAAGGCTTCGCACACGATGTGTGCCTGCAGATGGCACGTGACAACCCTGACCTTTACCTGATCAAGATGGCGAAGAACCAGCGCAACGGCCGGATCTTCCTCGACTACCTGCGCAATGACCGAATGGCGACTGCTGTTGCGCCGCTGTCACCGCGCGCCCGGCCAGGCGCACCAGTATCGATGCCGCTGACCTGGAAGCAGGTAAAAACGGATCTCGATCCGAAACGCTTCACCATTCGCACCGTACCGGCGCTTTTGTCGAAGACAACCGCCTGGCAGGATTACTGCGACGGGCAGCGTTCGCTGGAGCAGGCCATCAAGCGGCTGACGAAGGCGATGAAACAGGCCGCCTGAGAGCGGCCTGGATCAAGACCCGTAGCGCCGGTCAAGGTCGCTCAGCGCAGCCTGAACCTCATCGATTGAGACAGGCTCCGGCGTTCCAGGCATATGGCGGTATGCCGGCCTTGAGTCGATCGTGTAGAGGTCCGAAGCCCAGGCGGCCAATATGGTCCGTTTTTCGTCGACCTTGAGCGTGGTTGCGTCCACCACATCGATCGGCCGCGCAATTTCCATCCCCTTGAGCAGGATTGTCGTGCCCATCAGGGCGATATCGCTCGCGGCAAGCTGCGGATTCTCAAGCATGTCGGCTTCCTCAGCGAACGTGCGGATCCTTGTCGTGCGAGTGACAGGACGCGGAGATGACATCAATGTCCGGGGCCAGCAGATCGTGTTTGGCGGCAAAGGCTGCAAACAAACCCCGGGCGGTGTCCGCTTCGATCTCACCGCGCAATGCCGCGCTGCAGGCCTTCAGCGCCATTGAATGCGCCGTATCGCGCATCGATGCCGGCCATTCGGCGAGATGCCGGTAAGCGTCGATGACGCTGCGAACCTCGGTCGGAAATCCAAGGCCGATAAGGATGGTTACGGGCTCTTTGAACATATCGGGTTTCATCGCGATCTCCTTCCTACCCCCGATGTTGACGGGCGCCGCTGTAACGGCGCCCGTTTTCAAAAATTGATCAGGCCACCTTTTGCGCTTCGATCTGCGCAGGGGCAACCTTATGCTGAAGGGCCGGGCTGGTCTGGATATCGATCTTGCGCGGTTTCAAGGCCTCAGGAATCTCACGAACGAGATCGATCGACAGAAGACCGTTGCGGAGATCGGCGCCATTCACCCTGACATGGTCGGCAAGTTCGAAACGGTGTTCGAAGGGCCGGCCGGCTATACCGCGATGAAGGTATTCATCGGCGGAAGCGTCCTGCTTCTTGCCGGTAACGGTGAGAAGGTTGGATTGAAAGGTGATGTCTAGCTCGTCCTCGGCAAACCCTGCGACCGCGACCGAAATCCGGTAGCTGTCATCGCCGGTCTTGACGATGTCATAGGGCGGCCAGTCGCTCATCGAGCGGGCGCGCTGGGCATTTTCAAGAAGATTGAAGATCCGGTCGAAGCCGACGGTCGAGCGGAACAGGGGTGCGTAGTCATAAGATGTTGCCATAGCCATATCCTCCTTGAAGCAACATGGGTACAGAAGCGCCGAAACGCGGTGCTTCCAGCAAGTCCAGCCCTGTCTCAGCAACTGGCGACAATCGATTTGGTTTTCTGGAATTTGCGATTCAAGGGCGCCGTCAGAAAAAAATGACATCCAGGCGCCGGCCAGGACCAACCCATTTGAGTTCATGGACCGCGGCATTGTCCTGCACCTCGCGCAGGCCTTTATAGGACGCATGACGCAGGTTGCCATCGTCGGTCCAGCCACGGAACTCTATTTCGGCGATGAGCGTCGGCTGCGCGAAGACGTAATTCCTGCCCTTGAGCGGAACGACCGGTGTCTTCGTCTTCAGCCTATCCAGTGTCTTCTTCAGATACGCCGCATCTTTCTCCTTGAACCCGGTTCCAACGGCGCCAACATAGACCCAGTCATGTCCATGTCTCGCCGCCAGCAGCAAACTGCCAATCCCGCCGCGCGCCGCCGCCGACTGCTCGTAGCCGACAATCATGAAGCTCTCGCTTTGAACACACTTGATCTTCAGCCAGTCTCCGGTCCGGCCCGAGCGATAAGGGCTGTCACGATGTTTTGCGATGATGCCTTCCATGCCGAGAGAGCACGCGCTCGCCAATAATTCAGCGCCGTCCGCTTCTATTTCTTCGGACAGCTGGATTGCGCCGGTCGCGTCTTCAAGCAGGTCCCCCAGCAAGTGCCGTCGGACCGACAGCTCGGTACGAGTGAGATCGTGGCCATCGAAATACAAAAGGTCGAAGGCGAAGAGTACGGCTTCTGTCGACGCCCGTTTGCCGCCTCGCCCTCCGAGCGAGCGTTGCAGCGCGCCGAAATCGGAGCGACCCTCGCCGTTCAGCACCACCGCTTCGCCGTCAAGGATGGCTGTTGCAACTCCAAGGTCCCTGGCCGCTGCGGCGATAGCGGGAAACCGGTGCGTCCAGTCGTGGCCGCCCCGGGTAATGATCCGCACACCCTTCGGCTCGATATGGACCGCGAGCCGATAGCCGTCCCACTTCATTTCATAGACCCACTCGGGGCCGGACGGCGGGGCCGTCTTCAGGAGCGCCAGACATGGCTCGACGCGATCTGGCATCGGGTCGAACGGCAGATTGGGCTGAGCCGGATCGCGCCTGCGGCGCGGCTGCGATTTTGCCGTCAGGCTGTCATTCTGCAGGAGCGGCTTGGCGGGACGTTTCATTTGCCCGCCCGGTTCTCGGCCGCAACCGACTTCTTCAGCGCGTCCATGATATTGACGACATTGCTGGCTGCCGACTTTGCCGGTGTGGATGTCCTGGGTTTTGCAGACTTCTTGAGCGCTTTCTTCTTGGCGGCAATGATGTCAAGCAACCGATCCTGAACCGGATCGATGACCATCTTGGGATCCCAGTGCTGCGTTTGCTTTTTGATCAGTTGCTGCACGAGACGCATCATCTCGCTATCGGCCGTTTCATCTTCTTCGATCCCGCCAAAGTAAGTATTCTCGTCACGGACTTCATCCCCATACCGCAGCGTCCAGAGAACGATACCCTTGCCGCGTGGCTCAAGCATGACGGCGCGCTCGCGGCGGGTAATCACTAGACGTGAAATCCCCACCATGTTCTGCGCTTCCATCGCATCCCTGATGACCGAGAATGCTTCCTGGCCGACGGCATCGTTGGGTGAAAGATAATAGGGCGTGTCGAGCCATATCCACTCGATGCCGTCACGGGGCGTAAACGTCGAGATGTCGATGGTCTTCGTGCTGTCGAGCGCAACGTTTTCCAGTTCTTCGTCTTCGAGGATGATGTATTCGTTCTCGCCACGCTCATAGCCCTTGACCTCATCCTCCTCCCTCACCTCTTTTCCGGTAACGCTATCGACGTAGTGGCTGACGACCCGGTTTTCGGTATCGCGGTTGAGGGTGTGGAAGCGGACCTTCTCACTTTCGGACGTTGCCGGCATCATCTGCACGGGACAGGTGACAAGCGAGAGTTTCAGATAACCTTTCCAATAGGGACGGACAGCCATGATTTCCTCCGTCAGCCGGCGCGGCGCGTCTGTGACGAGACCGCATTTGATTTGGCAGCAGCACCCGATTTTGAAGCCTTTTGACGGCCAGCACTCTTGTTGGCGTTAGAAGCGGCACGTTTCTGGCTGGATTTGGTGCCCATGCCAGCACTCTCACGCAGCGCCTGCAAAAGGTCGTTTTGCCGGGGGACGGCTGCGGCCTTCTTCTTCGGCAGTGCCCTGCCCTCGATCTTGGCCTTCACCAGCTCGGCAACGGCGGCCTCATAACGGTCGTTGAACGTGCTGGCGTCGAAGGTCCCCTTCTTTGTGCCGATAATGTGCTCGGCAAGCTCAAGCATCTCGCCTTCAATCTTCAGGTCCGGCAACTCCTCGAAGGCCTCTGCCGACGAGCGGACTTCATAGTCAAAGTTGAGCGTCGTGGCGATCAGGCCCTTCCCGTGTGGCCTGATCAGCACCGTACGCAGGCGCCGGAAGAGCACGGTGCGGGCGATCGCAGCAACTTTGGCTCTTTTCATGCCATCGCGCAAAAGAACGAAAGCTTCCGTCCCCATCCTGTCGGGCGCCAGATAATACGGCTTGTCGAAATAGACGCTGTCGATTTCACTGCAGGGGATGAAGGCGTCGATCTTCAGCGTCTTGTCGCTGTGGGGGACGGCGGCAGCCACCTCGTCCGGCTCAAGAACGATGTACTGGCCGTTCTCGATCTCATAGCCCTTGACTTGGGCGTCCCGCTCCACCGGATCGCCGGTCTCGGTGTCGATGAATTCCCTTTTCACCCGGTTGCCGGTTCGGCGATTGAGGGTGTTGAACGCGATCCGTTCCGATGATGATGCCGCAGTGTAGAGAGCCACGGGACACGCGACCTCTCCGAATTTGATAAAGCCCTTCCAGTTCGCTCGCGGGGCAACCATGACGCAAAACTCCTGCCACAACCAATGCGATTCAAGCGAATCAGCATGCATTTGTTCCGAGTCAAAACGAATCATTTTTCAAGGATTTAGGACCATGATCACAACTGGCGTTGCGATTCTGCGCTAGACGCTTGATTCCGGTGCATCTTTTTGCGGCGACGAACGTTTCCTCTTCACGAAACGCAAAGAGGAGGCCTGTCATGAGCAAGCGTGAACTGATCGATACCGGAACCGACAAGCGTTATGTGCGCCGCGATAAGGACGGCAAGTTCAAGGAAAGCGTCGACGTCTCGCGATCCTCTCTGCCGACGCGCGCCACGATGCGAAGCATGATGCGAAACCGGGCCAGGGTGACCGGGGTGATCGCAAACACTGACGAACCCCGGTGAAGCTCGCCACGTATAACGTCAACGGTATCAACGGCCGGCTCGAAGTCCTGCTTCGTTGGCTCGATCAGGCAAAACCCGATGTCGTCTGCCTGCAGGAGTTGAAGGCACCGGACGAGAAGTTTCCGCGCCGCCAAATCGAGTGCGCAGGCTACGGCGCCATCTGGCATGGTCAGAAATCATGGAATGGTGTCGCAATTCTCGCCCGGGGTCAGGAACCTCTTGAGACGCGCCGGGGGCTTCCCGGAGACCCTGACGACAGCCACAGCCGGTATATCGAGGCAGCCATCGACGGCATGATAATCGGCTGCCTTTACCTGCCGAACGGTAACCCGGCCCCCGGCCCGAAATTCGACTACAAGCTGCGCTGGTTTGAGCGGCTGGTTTCATATGCAGGTCAACTGCTTGAGCTGGACGTGCCATGCGCCCTTGTCGGAGACTTCAACGTTATGCCGACCGATCTCGATGTCTATAAGCCTGAGCGCTGGCGAGACGATGCCCTGTTTCGCCCCGAGGTCCGCGCGGCCTATGCCTACCTTATCGCCATGGGCTGGACGGACGCCATCAGACGGCTGCATCCGAATGAGAGAATATATACCTTCTGGAAGTATTTCCGGAACGCGTTTGCACGCGATGCCGGTCTGCGCATTGATCACTTCCTGCTGACCCCGACCGTTCAGAAGCGATTGCAGGCGTGTGGGGTCGACAGGTTTGCGCGTGAATGGGAGCGCACCAGTGATCACGCCCCCGCCTGGATTGAGCTGGACGACCGGTAG
Protein sequences of DBSCAN-SWA_44 >CP036360|481153:489030|481153_483805_+|QBJ16826.1|DBSCAN-SWA MANDKLSTYRQKRDFQKTQEPSGKTKLKASNRRRFVIQKHDATRLHYDLRLELDGVFKSWAVTKGPSLDPQDKRLAVEVEDHPLDYGDFEGTIPKGQYGGGTVMLWDRGYWEPEGNRTPEQALAKGDFKFTLEGERLHGSFVLVRMRNDRDGGKRTNWLLIKHRDDFSVEENGAAILDENDTSVASGRTMDAIAAGKGKKPKPFMVQSGDVQADAVWDSNHGLAADKRAADTKKRRPASPKAAKSAMPDFIPPQLCETLERPPSADGWIHEIKFDGYRIQARIENGEVTLKTRKGLDWTAKYPAIATSAASLPDAIIDGEICALDENGAPDFAALQAALSEGKTDALVYFAFDLLFEGSEDLRQLPLTERKKRLEALLSEAGEDPRLRFVEHFETGGDAVLKSACKLSLEGIVSKQADAPYQSGRTDTWAKSKCRAGHEVVIGAYAKTNGKFRSLLVGVFRDNHFVYVGRVGTGYGAKTVDTILPKLRELETSKSPFTGIGAPKKEPNIVWVKPELVAEIQFAGWTADGLVRQAAFKGLREDKPAKEVEAETPASPGKTETPTPARPKPSRPARGKNTKAEVMGVMISSPDKPLWPDANDGEPVTKEDLAHYHQAVGPWLIDHIKGRPCSIIRTPDGIGGEQFFQRHAMPGTSNLVELVKVFGDKKPYLQIDRVEGLAAIAQIGGVELHPWNCEPNEPEVPGRLVFDLDPGPDVPFSTVVEAAREMRDRLEELGLVSFCKTTGGKGLHVVTPLAVPKGNKLSWPEAKGFAHDVCLQMARDNPDLYLIKMAKNQRNGRIFLDYLRNDRMATAVAPLSPRARPGAPVSMPLTWKQVKTDLDPKRFTIRTVPALLSKTTAWQDYCDGQRSLEQAIKRLTKAMKQAA >CP036360|481153:489030|488253_489030_+|QBJ16832.1|DBSCAN-SWA MKLATYNVNGINGRLEVLLRWLDQAKPDVVCLQELKAPDEKFPRRQIECAGYGAIWHGQKSWNGVAILARGQEPLETRRGLPGDPDDSHSRYIEAAIDGMIIGCLYLPNGNPAPGPKFDYKLRWFERLVSYAGQLLELDVPCALVGDFNVMPTDLDVYKPERWRDDALFRPEVRAAYAYLIAMGWTDAIRRLHPNERIYTFWKYFRNAFARDAGLRIDHFLLTPTVQKRLQACGVDRFAREWERTSDHAPAWIELDDR >CP036360|481153:489030|484103_484403_-|QBJ16828.1|DBSCAN-SWA MKPDMFKEPVTILIGLGFPTEVRSVIDAYRHLAEWPASMRDTAHSMALKACSAALRGEIEADTARGLFAAFAAKHDLLAPDIDVISASCHSHDKDPHVR >CP036360|481153:489030|483817_484093_-|QBJ16827.1|DBSCAN-SWA MLENPQLAASDIALMGTTILLKGMEIARPIDVVDATTLKVDEKRTILAAWASDLYTIDSRPAYRHMPGTPEPVSIDEVQAALSDLDRRYGS >CP036360|481153:489030|486144_486957_-|QBJ16831.1|DBSCAN-SWA MAVRPYWKGYLKLSLVTCPVQMMPATSESEKVRFHTLNRDTENRVVSHYVDSVTGKEVREEDEVKGYERGENEYIILEDEELENVALDSTKTIDISTFTPRDGIEWIWLDTPYYLSPNDAVGQEAFSVIRDAMEAQNMVGISRLVITRRERAVMLEPRGKGIVLWTLRYGDEVRDENTYFGGIEEDETADSEMMRLVQQLIKKQTQHWDPKMVIDPVQDRLLDIIAAKKKALKKSAKPRTSTPAKSAASNVVNIMDALKKSVAAENRAGK >CP036360|481153:489030|485077_486148_-|QBJ16830.1|DBSCAN-SWA MKRPAKPLLQNDSLTAKSQPRRRRDPAQPNLPFDPMPDRVEPCLALLKTAPPSGPEWVYEMKWDGYRLAVHIEPKGVRIITRGGHDWTHRFPAIAAAARDLGVATAILDGEAVVLNGEGRSDFGALQRSLGGRGGKRASTEAVLFAFDLLYFDGHDLTRTELSVRRHLLGDLLEDATGAIQLSEEIEADGAELLASACSLGMEGIIAKHRDSPYRSGRTGDWLKIKCVQSESFMIVGYEQSAAARGGIGSLLLAARHGHDWVYVGAVGTGFKEKDAAYLKKTLDRLKTKTPVVPLKGRNYVFAQPTLIAEIEFRGWTDDGNLRHASYKGLREVQDNAAVHELKWVGPGRRLDVIFF >CP036360|481153:489030|484470_484947_-|QBJ16829.1|DBSCAN-SWA MATSYDYAPLFRSTVGFDRIFNLLENAQRARSMSDWPPYDIVKTGDDSYRISVAVAGFAEDELDITFQSNLLTVTGKKQDASADEYLHRGIAGRPFEHRFELADHVRVNGADLRNGLLSIDLVREIPEALKPRKIDIQTSPALQHKVAPAQIEAQKVA >CP036360|481153:489030|486968_487868_-|QBJ16873.1|DBSCAN-SWA MVAPRANWKGFIKFGEVACPVALYTAASSSERIAFNTLNRRTGNRVKREFIDTETGDPVERDAQVKGYEIENGQYIVLEPDEVAAAVPHSDKTLKIDAFIPCSEIDSVYFDKPYYLAPDRMGTEAFVLLRDGMKRAKVAAIARTVLFRRLRTVLIRPHGKGLIATTLNFDYEVRSSAEAFEELPDLKIEGEMLELAEHIIGTKKGTFDASTFNDRYEAAVAELVKAKIEGRALPKKKAAAVPRQNDLLQALRESAGMGTKSSQKRAASNANKSAGRQKASKSGAAAKSNAVSSQTRRAG |
8 | Sinorhizobium_phage(20.0%) | NA | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
Acr ID | Acr position | Acr size | Homology with known anti | Neighbor HTH/AcRanker | Neighbor Aca | In prophage | Protospacer in prophage |
---|