| Contig_ID | Contig_def | CRISPR array number | Contig Signature genes | Self targeting spacer number | Target MGE spacer number | Prophage number | Anti-CRISPR protein number |
|---|---|---|---|---|---|---|---|
| NZ_AP021844 | Azospira sp. I09 | 0 crisprs | DEDDh,DinG,WYL,RT,csa3 | 0 | 0 | 3 | 0 |
| NZ_AP021845 | Azospira sp. I09 plasmid pAZI09, complete sequence | 4 crisprs | PD-DExK,DinG,csf5gr6,csf1gr8,csf2gr7,csf3gr5 | 0 | 7 | 2 | 0 |
| CRISPR_ID | Spacer_Info | Spacer_region | Spacer_length | Hit_ID | Protospacer_location | Mismatch | Identity |
|---|
| CRISPR_ID | Spacer_Info | Spacer_region | Spacer_length | Hit_phage_ID | Hit_phage_def | Protospacer_location | Mismatch | Identity |
|---|
| Region | Region Position | Protein_number | Hit_taxonomy | Key_proteins | Att_site | Prophage annotation | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| DBSCAN-SWA_1 |
1397060 : 1410067
Sequences of DBSCAN-SWA_1
Nucleotide sequences of DBSCAN-SWA_1 >NZ_AP021844|1397060:1410067|DBSCAN-SWA ACTAAGCCGTGCTCTTACGCGCGGCGACTTCTTCCAGAAGCCGCTCGATCAGGCTTGCCACGGGCATCTGCCCCAGGTCCTGACCACCGCGGGTACGCACGGCCACCAAGCCGGCTTCCTTTTCCTTGTCGCCGATGACGAGCTGATAAGGCAACCGGTTCAAGCTATGTTCGCGTATTTTATAGGTAATTTTTTCATTGCGCAAATCCGCTTCGGCACGCAAACCTGCCTGACGCAGGGTTTTCACCACTTCGGCGGAAAAATCGGCCTGTTTTTCCGAAATATTCAGCACCACGGCCTGTACCGGCGCCAGCCACAGGGGCAAGGCACCCGCATAGTTTTCCACCAGGATGCCGATGAAACGCTCCAGGGAACCGAGGATGGCCCGATGCAGCATCACCGGCACATGGCGGGCGTTGTCCTCGCCCACATACTCGGCGCCCAGGCGACCCGGCATGGAGAAATCTACCTGCATGGTGCCGCACTGCCAGGAACGCCCGATGGCATCCTTGATGTGGAACTCGATCTTGGGGCCATAGAAGGCACCCTCGCCGGGCAATTCATCCCATTCCAGGCCGGAAGCCTTGAGGCCGGCCCGCAGGGCATTCTCCGCCTTGTCCCAGATGTCGTCGGAACCGACACGGCTTTCGGGGCGCAGGGCCAGCTTCACCGCCACCTGATCGAAACCGAAGTCGGCATAGACCTTCTTCACCAGAGCGTTGAAAGCCGTCACTTCCGCTTCGATCTGGTCTTCGGTACAGAAGATGTGACCGTCGTCCTGGACGAAGCCGCGCACGCGCATCAGGCCGTGCAGGGCGCCGGAGGCCTCGTTACGATGACAGGAGCCGAACTCGCCATAGCGCAGGGGCAGGTCGCGGTAGGAGCGCAGATCGGAATTGAACACCTGCACGTGCCCCGGGCAATTCATCGGCTTGATGGCGTAATCCCGCTTCTCCGACTCCGTGGTGAACATGTTGTTCTTGTAGTGCTCCCAGTGACCGGACTTCTCCCACATGCTGCGGTCGAGAATCTGGGGGCAGCGGATTTCCTGGTAGCCGTTGTCGCGATAGACCTGGCGCATGTACTGCTCGATTTCCTGCCAGATGGCCCAGCCCTTGGGGTGCCAGAAAACCATGCCCGGCGCCTCGTCCTGCATATGGAACAGATCCAGATGCTTGCCGATGCGGCGGTGATCCCGCTTCTCGGCCTCTTCCAGCATGTGCAGGTAGGCTTCCTGGTCTTCCTTCTTGGCCCAGGCGGTGCCGTAGATGCGCTGCAGCATCTCGTTCTTGGAATCGCCGCGCCAGTAGGCACCGGCCACCTTCATCAGCTTGAAAACCTTGAGCTTGCCGGTGGAGGGGACGTGGGGGCCGCGACAAAGGTCCACAAATTCGCCTTCCCGGTACAGGGAAACATCCTGGTCGGCCGGGATGGCAGCGATCAGCTCGGCCTTGTACTTCTCACCCTGCTCCAGGAAAAACTTGACCGCATCGTCCCTGGCCCAGACTTCGCGGCTTACCGGAATGTCGCGCTTGGCCAGCTCGGCCATTTTCTTCTCGATGGCCAGCAGGTCTTCAGGGGTAAAGGGGCGCTTGTAGGCGAAGTCGTAGTAGAAGCCGTTGTCGATGACCGGACCGATGGTCACCTGGGCTTCGGGAAACAGCTCCTTCACCGCATAGGCCAGCAGGTGGGCCGTGGAGTGGCGGATGATCTCCAGGCCGTCCGCATCCTTGTCGGTGACGATGGCCAGCTGGGTATCACGCTCCATCAGGTGGGACGTATCCACCAGCTTGCCGTCCACCTTGCCGGCCAGGGCGGCACGGGCGAGGCCGGCACCGATGGAGGCAGCAACCTCGGCAACGGTCACCGGAGCATCGAAGGAGCGGATGGAACCGTCGGGCAGCGTGATATTGGGCATGATTTCTCCAGCCACATTAAATTCTGGACAAAAAAAAAGTGCGGACGGATCCGCACTTTTTTTCGACATAGGAACGAAGGCGAACCGCTTAGGCTTTCTGAACCAGGATAGTGCAAGTTCGGAAATTCACGGTACAGCTCCGTTCAAGGTGTTGGTAGGCGCGATTGGATTCGAACCAACGACCCCCACCATGTCAAGGTGGTGCTCTAACCAACTGAGCTACGCGCCTGCAAAGAAGGCCGAATTATAACGATCCCAGAGAATCGCGCAAGGGGCTGAGGAGACTTTTTTCGCTTCGCCCCACAAAAAAAATCAGCACAACCGTCCGGGCGAACATTTGGCCAGGGCCGGCGCCACTGCGGTGACGGGGCGCGACTCCTGGATCCAGGCGCGGATGCCCTGGCGCACGTTGTAGATCTTGCTGTAGCCGGCCTGCTGTTCGAGGAAGTCGCTGACGGCCCGGGTGCGGTTGCCGCTGCGGCAGATGAGGATCACCGGCTGCTCCGGACCTGCCACCGTCTTCAGCTGCTCCAGCCAGGCCGCCGGATTGGCGCGGCCGTTGGCATCGAAGAAGGTGAGCAGGCGGCTGCCGGGAATGACGCCGGTTTCCCGCCATTCGGGCTCGGTACGGATATCCACCAGGACCACGCCACTGGCCACCAGCCGGGCCACTTCGGCGCTGTCCACATTCACCACCTCGGCCCTGGCCGCGAACGCCGCCAGCAGAGCCAGGAGGAAGAGGAAGGTGTGCTTCACGCCGCTACCTCCGGCCAGGCGGCCAGGAATTCCTGCCAGTGGGGCTTGTCCAGCTTGGCCAGTTCGGCCTTGATGAAGGCCAGTTCGGCCAGATGCTCCTCCCGGCTGACCTCGCCCCGCATCAGGCGGAAGCGGGTGGACACCAGGTAGGTATTGACCACATCGGTCTCGCAGTAATCGCGGATCTCGTCGGCCTTGCCCTCCTGCCAGGCCTGCCACACCTTGCCGCCGTCCATGCCCAGCTTGCCGGGGAAGCCCATGAGCTTGGCCAGGTCGTCCAGGGGAGCGCTGGCCCGGGGCTGATACATGGCCAGCAGGTCCATCAGGTCCAGGTGGCGGGTGTGGTAGCGGCTGATGTAGTTGTTCCACTTGAAGTCCCGGGAATCCGCATAGTCGCCGTCGCCCAGGTCCCAGTAGCGGGGAGCCACGACGCCGTGGATCAGGCCCCGGTAGTGCAGCACCGGCAGGTCGAAGCCGCCGCCGTTCCAGGAAACAATCTGGGGCGTGAATTTCTCGATGCCGTCGAAGAAGCGCTGGATGATCTCCCCTTCGCCAATCTCGGGAGCCGCCAGGGACCACACCTTGAAGGCATCCCGGGCCCGCAGGGCGCAGGAGATGGTCACCACCCGCTGCAGGTGCAGGGGCAGGAAGTCGCTGCCGTTCTGGGCCCGGCGCTGCTGGAAGGCCAGTTCGGCCACCTCGTCGTCGGAGAGGTCGGCGGGAAGGTCGTGCAGGCGGCGCAGCCCGGGTACATCCGGGATGGTTTCAATATCGAAAACGAGGACGGGAACCATGGGACTCAGGCGGGGAAAACGCCGGTGGACAGATAACGGTCGCCCCGGTCGCAAACGATGGTGACGATGGTGGCGTTCTCCAGCTCCCGGGCCAGGCGCAGGGCCACGGCCAGGGCGCCGCCGGAGGAGATGCCGGCGAACAGGCCCTCTTCCCGGGCCAGGCGCCGGGTCATGTCCTCGGCCTCGGCCTGGGACACGTATTCGAGGCGGTCCACCCGGCTGCGCTCGTAGATTTTCGGCAGATAGGCTTCGGGCCATTTGCGGATGCCGGGAATCTGGCTGCCCTCTTCCGGCTGGCAGCCGACGATCTGGATGCGCGGATTCTTTTTCTTGAGAAACTGGGAAGTACCGATGATGGTGCCGGTGGTGCCCATGCTGGAAACGAAATGGGTCACCTGGCCCTTGGTGTCGCGCCAGATTTCCGGCCCGGTGCCCTCGAAGTGGGCCAGGGGGTTATCCGGATTGGCGAACTGGTCGAGGATGATGCCCTTGCCTTCGTCGCGCATTTTCTCGGCCACGTCCCGGGCCAGTTCCATGCCGCCATCCCGGGGCGTCAGGATCAGTTCGGCGCCGTAGGCACGCATGGTCTGGCGCCGCTCCAGGCTCTGGTTTTCCGGCATGACCAGGATCATGCGGTAACCGCGCATGGCGGCGGCCATGGCCAGGGCGATGCCGGTGTTGCCGGAGGTGGCTTCGATCAGGGTATCGCCGGGCCGGATTTCGCCCCGCTGCTCGGCGTGGGAAATCATGGAGAGGGCCGGGCGATCCTTCACCGAACCGGCCGGATTGTTGCCTTCCAGCTTGGCCAGGATGACGTTGCCGCGCTGGGCGACAACATCGCCGGGCAGGCGCTTCAACTGCACCAGGGGCGTGTTGCCGACGAAATCTTCCAGAGTCTTGTACATGGCTCAGCGTCCGTTGAGGAATTGCACGTAGTCGGCGACGCCTTCGGCCACGGTGGCGAACTCGTCACCATAGCCGGCGCTGCGCAGCTTGCTGAGGTCGGCCTGGGTAAAGCTCTGGTACTTGCCTTTGAGGGCTTCGGGGAAGGCCACGTACTCCACCAGCCCCTGCTGCACCATCGCCTCCAGGGACAGAGCCGGCTTGCCCTCGGCGGCCCGGCAGCTGTTCACCGTGGCCACGGCCACGTCGTTGAAGCTCTGGGCCCGGCCGGTGCCCAGGTTGAAGATGCCGGATTTTTCCGGATGGTCGAGGAAGTACAGATTGACCTTGGCCACGTCCTTCACATAGACGAAGTCGCGCTGCTGCTCGCCGTTGGCGTAGCCGTCGCAGCCCTCGAACAGCTTGACCTTGCCTTCGGCCCGATACTGGTTGAAGTGGTGGAAGGCGACCGAGGCCATGCGCCCCTTGTGGCTCTCGCGGGGGCCGTAGACGTTGAAGTAGCGGAAGCCCACCACCTGGGAACGGACTTCAGGCAAGCGCTGACGGACGATCTGGTCGAAGAGGAACTTGGAGTAGCCGTACACGTTGAGAGGCGCCTCGTACTGCCGCTCTTCCTTGAAAACGCTGCTGCCGCCATAGGTGGCGGCGGAAGAGGCGTAGAGCAGCTGCACGTCCTGCTCCAAGCACCAGTCCAGCAGGGCCAGGGAGTAGCGGTAGTTGTTCTCCATCATGTAGCGGCCGTCGGTCTCCATGGTGTCGGAGCAGGCGCCTTCGTGGAAGATGGCCTCCACGTCGCCGTCGAAGTGGCCGCAGAGCAGGCGCTCGAGAAACTCGCCCTTGTCCAGGTAATCGGCGATCTCGCAGTCCACCAGATTCTTGAACTTGTCCGCCTTGGTCAGGTTATCCACGGCGATGATGCGGGTGATGCCCCGCTCGTTGAGGGCCTTGACCAGGTTGGCGCCGACGAAGCCGGCGGCACCGGTGACGATGTAGTACATGTGAATTCCTTAATTGGCGTCGGCCGCCAGGGCGGCCGACAGTTCCTCGCGACTGACGGTAGCGGTACCGAGCTTGCCCACCACCACGCCACCGGCCAGGTTGGCCAGGTGGATGGCGTCGCCCCAGGGGGCGCCCAAGGCCATCATGGCGGCCAGAGTGGCAATGACCGTGTCGCCGGCACCGGAGACGTCGAACACTTCCTGGGCCCGGGCCGGCTGGTGCAGCGCCTCGCCGTCGCGATAGAGGCTCATGCCCTCTTCGCTACGAGTCACCAGCAGGGCATCCAGTTCCAGTTCGCTGCGCAATTGCTGGGCCTTGGCCGCCAGCTGGGCCTCGTCGCTCCAGCGCCCCACCACCTGGCGCAGCTCGGAGCGGTTGGGGGTGATGACGGTGGCGCCGCGGTACTTGGAATAATCCTCGCCCTTGGGATCCACCAGCACCTTTTTCCCGGCCGCCCGGGCCAGGCGGATCATGTCGCCGATGTGGGCCAGGCCGCCCTTGCCGTAGTCGGAAAGGATCACCACATCCACCCCGGCCAGGCGCTGCTCGAACTCGGCCAGCTTGGCCTGCAGCACCTCGTGGGACGGGGTGGTCTCGAAATCGATGCGCAGCAGCTGCTGCTGGCGGCCGATGACCCGCAGCTTGACCGTGGTGTCGATGGCTCCGTCGGGCAGCAGGGAGGCGGCAATGCCGCCCTCTTCCATCTGCCGCTGCAGGATGCGGCCGGCCTCGTCGTTGCCGACCACGGACAGCAGGCCGACCCGCGCCCCCAGGGAAGCGCAGTTGCGGGCCACGTTGGCAGCACCGCCGGGACGCTCTTCGGAGCGTTCCACCTTGACTACCGGCACCGGTGCTTCGGGGGAAATGCGGGAAACATCGCCGAACCAGTAGCGGTCCAGCATGACATCGCCGACCACAAGAATGCGGGCGGCGGAAAAATCGGGAAGTTGGTGCATGGTGAAGATACTCAGTCGAAAAGATCGGCCTGGGCGAAGGGCGTGCCCTGCATATCCCGGGCCGAAAGACGGGGCTCCAGCCCCTCCAAAGGCCAGGCCACGGCCAAGGCCGGATCGTTCCAAGCGATGCAGCGCTCATGTTCCGGCGCGTAATAGTCGGTGGTTTTATAAAGAAAATCGGCGGTCTCGCTGAGCACCAGGAAACCGTGGGCAAAACCGGGAGGTACCCACATTTGGCGCTGATTATCCGCGGAAAGTACGGCACCAACCCAGCGGCCGAAATAGGGGGATTGGCAACGCAAGTCAACCGCCACGTCGAAAACGGCGCCTTGAGCCACTCGAACCAGCTTCCCTTGAGGCTGGCGAATCTGATAATGCAGGCCCCGCAGCACGCCTCGTGCGGAACGGGAGTGGTTGTCCTGGACGAAATCCACATCGGCCCCAGTCAGCTCGGTAAAGCGACGCCGGTTGTAACTTTCCATAAAAAAGCCGCGAGCATCGCCGAAAACCAGGGGTTCCAGCATGATCACATCGGCAATGGCGCTGGGAATCGCCTTCACGCTGCATGCTCCTTGAGCAGGGCCAGCAGGTATTGGCCGTAGCCGTTCTTGGCCAGGACCCGGGCCTGGGCTTCCAGAGTGGCGTCGTCGATCCAGCGTTGGCGCCAGGCCACCTCTTCGGGACAGGCCACCTTGAGACCCTGGCGCTTCTCGATGGTCTCGATGAACTGCCCCGCCTCCAATAGAGATTCGTGGGTACCGGTATCGAGCCAGGCATAGCCCCGGCCCATGATTTCCACATTGAGCTTGCCCGCTTCCAGGTAATGCCGGTTCACGTCGGTGATTTCCAGTTCGCCCCGGGGCGAGGGCTTGATGCCCTTGGCCACGGCCACGATATCGGTATCGTAGAAGTAAAGGCCGGTGACGGCATAGTGGGACTTGGGCTGCAGCGGTTTTTCCTCGATGGAGAGGGCCCGCTGCTGGGCATCGAACTCGACCACACCGTAGCGCTCCGGATCATTCACCCGGTAAGCGAAGACCGAGGCACCGCTGTCCCGATCGTTGGCCCGCTGCACCAGGGTGGCCAGGTCGTGGCCGTGGAAGATGTTGTCCCCCAGCACCAGGGCCGCCGGGGCACCGTCAAGGAAGGCCTCGCCGATGAGGAAGGCCTGGGCCAGGCCGTCCGGCGAGGGCTGCACCGCGTACTGCAGATTGATGCCCCACTGGGAGCCGTTTCCCAGCAGCTGCTCGAAACGGGGCGTGTCCTGGGGCGTGGAGATGATGAGGATGTCCCGCAGACCGGCCAGCATCAAGGTGGTCAGGGGGTAGTAGATCATCGGCTTGTCGTAGATCGGCAGCAGCTGCTTGGACACCGCCAGGGTGGCCGGATAGAGCCGGGTGCCGGAACCGCCGGCGAGAATGATGCCTTTACGGGGTTTAGTAGCCATTCTGGGCCTTCAGGGCGAGAAGCTGCATCATGCGGGAAAGATAGGGCTGCCAGTCGGGCATGGTCAGACCGAAACGGTCCTCCAGCTTGCGGCAGTCGAGACGGGAATTGAGGGGGCGCGGCGCCGGCAGCGGATATTCGCTGCTGGGAATGGGTGCGATGGCCTCGGGCCCCAGCTTGAGGGCGAAGCCGGGCGTCTGTTCGGCGGTGGCGACGATGGCCCGGGCAAAACCGTTCCAGCTCACCGGATTGGCAGCCACCAGGTGATACAGCTCGCAGCCCTGCTGGGCGCGCCCGCCGTCGAGCTGGGCCAGGACCATGCCGGTGACGGTGGCGATCATGGCCGCCGGCGTCGGGCTGCCGACCTGGTCGGCCACCACCTTGAGGCTGTCCCGCTCGCTCGCCAGGCGCAGGATGGACTTGACGAAATTCTTGCCCCGGGCGCCGAAGACCCAGCTGGTGCGGAAGATGAGGCCGCGGCCGCCCACGGCCAGCATGGCTTCCTCCCCTTCCCGCTTGGTCCGGCCATAGACGCCGAGGGGCGCCGTGGCATCGGACTCCACATAGGGCGCCGCCTTGCTGCCGTCGAAGACGTAGTCGGTGGAGTAATGCACCAGCAGGGCATCCAGGGCCTTGGCCTCCTCGGCCAGCAGGCCCACGGCCTCGGCATTGATGCGCCGGGCCAGTTCAGGCTCCATTTCCGCCTGATCCACCGCCGTATAGGCGGCGGCATTGACGATCAGGCGGGGGCGCTGTTCCCGCACCACGGCCCGCAGCCGGTCCAGGTCGGCCAGGTCGCATGTACGCCGGTCCAGGGCCAGCACCGGCCCCAGGGGCGCCAGGTCCCGCTGCAGTTGCCAGCCCAGCTGGCCCTGGCTGCCCAGGAGCAGGATGGGAGCCGACACCTCAGGCTTCCCCATACTGGCGGCCCACCCACTCCCGGTAGGCGCCGGAGGTGACGTTGTGCACCCACTGGGGATTGTCCAGGTACCAGCGTACGGTCTTGCGGATGCCGGTTTCGAAGGTCTCCGCCGGCTTCCAGCCCAGCTCCCGTTCCAGCTTGCTGGCGTCGATGGCATAGCGCCGGTCGTGGCCGGGCCGGTCGGCAACGAAAGTGATCTGGCTGGCGTAGGAGGCGCCGTCGGCCCGGGGCGACAGTTCGTCGAGCATGGTGCACAGGGTATGCACCACCTCCAGGTTGGGCTTTTCGTTCCAGCCGCCCACGTTATAGGTCTCACCCAGGCGGCCGGCTTCCAGGACGCGGCGGATGGCACTGCAATGGTCCTTCACATAGAGCCAGTCGCGGATCTGCTGGCCGTCGCCGTAGATGGGCAGGGGCTTGCCGGCCAGGGCGTTGTGGATGATGAGGGGAATGAGCTTTTCCGGGAAATGGTAGGGCCCGTAATTGTTGGAGCAGTTGGTGGTCAGCACCGGCAGGCCGTAGGTATGGTGGTAGGCCCGCACCAGATGGTCGGAGGCCGCCTTGCTGGCCGAATAGGGACTGTTGGGCTCGTAGCGGTGCTGCTCGGTGAAGGCCGGGGCCTCCTTTTCCAGGGAACCGTAAACCTCGTCCGTGGACACGTGGAGAAAGCGGAAGGCCGCCTTGTCGTCGGCAGGCAGGCCGTTCCAGTAGGCCCGCACGGCCTCCAGCAGGCGGAAGGTGCCGACGATGTTGGTCTGAATGAAGTCCTCCGGCCCGTGGATGGAACGATCCACATGGCTCTCGGCGGCGAAGTTCACCACCGCCCGCACCCGGTTCTGCTGCAGCAGTTCCAGAATCAGGTCGTAGTCGGCGATGTCGCCGCGCACGAAACGGTGCCGCGGATCGCCGGCCAGTCCCTGGAGGTTCTCCAGATTGCCGGCGTAGGTCAGCTTGTCCAGGTTGATGACAGGCTCACCCCCCGCCGCCAGCCAGTCGATGACGAAATTGCTGCCGATGAAGCCTGCACCGCCGGTCACCAGGATCATGGCGGACTCCTTATTGACGACCGATGGCCTGGTAGTCGATGCCGAACTGGCACACCTGCTTGGGTTCGTACAGGTTGCGGCCGTCGAAGATCACCGGCTGCTTCAGCTTGGCCTTGATGGCCTCGAAATCGGGACTGCGGAATTCCTTCCACTCGGTGACGATGAGCAGGGCATCCGCCCCATCCAAGGCGGCCATGGGGCTCTCCGCATAGCTCAGACGCGGCTCATCGCCGAAAATGCGCCGGGCTTCATGCATAGCCACCGGATCGTAGGCCACCACGGTGGCGCCAGCGGCGAAGAGATCGGCCAGCAGATAACGGCTGGGGGCCTCGCGCATATCGTCCGTATTGGGCTTGAATGCCAGGCCCCAGACGGCGAACTTGCGGCCGCTGAGGTCGTTGCCGAAACGCTTCACGGTCTTGGCGGTGAGCACGTGCTTCTGGGCATCGTTGGCGTCTTCCACCGCATTGAGGACCTTCATCTCCATGCCGGCGTCGAGGCGGGCGGTGCGCTGCAGGGCCTGCACATCCTTGGGGAAGCAGGAACCGCCGTAGCCGCAGCCTGGATAGAGGAAGTGGTAGCCGATGCGCGGGTCGGAACCGATGCCCTGGCGCACCTGCTCGATGTCGGCACCCAGCTTCTCGGCCAGGTTGGCCAGTTCGTTCATGAAGCTGATGCGGGTGGCCAGCATGGCGTTGGCGGCGTACTTGGTCAGTTCGGCGGAACGCACATCCATGACGATGAGGCGCTCGTGGTTGCGCTGGAAGGGCGCATAGAGGGCGCGCATCAACTCGATGGCGCGCTCGTCCTCGGCGCCGACGACGATGCGGTCCGGCCGCATGAAATCCTCCACGGCGGCGCCTTCCTTGAGGAATTCCGGATTGGAGACGACGCTGTAGGCGATATCCGCTCCCCGGGCCTTGAGCTCGTCGGCGATGGCGGCGCGCACCTTGTCGCCGGTGCCCACGGGCACGGTGGACTTGTCCACCACCACCTTGTAGTCGCCCATGTGGCGGCCGATGTTGCGGGCGGCGGCGAGCACGTACTGCAGATCGGCGGAACCGTCCTCGTCCGGCGGCGTGCCGACGGCAATGAACTGGATGGTGCCGTGGGCCACGGCCTGTTCCACATCGGTGGTGAAACGCAGGCGGCCGGCGGCCACGTTGCGCTTCACCATGTCCAGCAGGCCCGGTTCGAAAATGGGAATGCCGCCTTCGTTGAGAATCCGGATCTTTTCCGGATCCACATCCAGGCACAGCACATCGTTGCCCACCTCGGCCAGGCAGGTACCGCTCACCAGGCCCACATAGCCCGTACCGACAACTGTAACTTTCAAATTATTCTCCCTGAAGGGTCATTGATCCGCCCAAACCGACGCATTCATGGGCTCAGGGGATCAGATCGAATTCTTCCGTGCGCCGGGGCGGATAGGTTTCCCAGCCGCCGCAGGCAGGACAGCGCCAGTAGAAGTGGCGCGCCTTGAAGCCGCAATTATCGCACCGATAGCGGGCCAGCCGGCGGGTGTGGTTGTGCACCAGATTGCGCACCAGCTCCAGGTCCGCCCGCTTTTCCGGCGGCACGCCGAGCAGCTGGGCTTCCAGCAGGCGGTCCAGGCCCAGCAGGGTCGGATTGCGCCGCAGTTCGTCCCGCACCAGGCGATAGGCCGCCTCCGGCCCCTCCGCATCCATCACCAGCTGGAATACGGTTTCCAGCAGATCGAGGGAGGGGTAGCTGGCCAGATAGCCGCGCAGCAGTTGCAGCCCTTCGTCCCGCTGCCCCTGGGCCAGGTAGGCATCCTGGAGCTTGCGGGCGACGATGGCCAGGTAGGCCGGATTCTGGCTCTCGATGCGCTTCCAGGCCTCGATGGCCGCTGCCAGATCACCCGCCTGCTGCAACAAGTCGCCCTGCAGGACGCTGGCGCGCACGCAGTTGCGGTGCAGGGAAAGGGCCGAGTCCAGGTACTGGCGGGCGGAATCGGGCCGGGAATTGATCATCTCCCCGGCTGCCAGCTCGCAGTAATAGTTGGCGATTTCCTTCTGGGTGGCGTAGTCAGGCATTTCCTTGGCGATGGCGATGGCCTTCTGCCAGTCCTTCTCCTGCTGGTAGATTTCCAGCAGGTTGCGCTTGGCCTCCTCGTCCCGGGAGGTGCCGCGCAAACGGGAAAACACCTCTTCGGCCCGGTCCAGCAGACCGGCCTTGAGGAAGTCCTGGCCAAGCTCGGAAAGGGCCTGCAGCTTCAGCTCCGGAGACAGGTCCACCCGCTCGATGAGGTTCTGGTGCATGCGGATGGCCCGCTCGGTCTCGCCGCGGCGGCGGAACAGGTTGCCCAGGGCGAAGTGCAGTTCCACCGTCTGGGGATCCACCTTCACCACTTCGATGAAGGCTTCGATGGCCTTGTCCGGCTGCTCGTTGAGCAGGAAATTCAAGCCCTGGAAGTAGGACCGGGGCAGGGCCCGGGATTCCCGCACCAGATGCTTGATGTCGATGCGGGCGGCGGCCCAGCCAAGGACGAAAAACAGGGGAAAGAGCAGTAACTGCCAGTACTCGAATTCGATCATTGGGCCACGGCCTCGCCGGCAGGAGGCGGTTGCACGGCAGCGTCGCTGGCGACCGGCTTGAGCACGGCCTGGCGCTCCCGCTCCCGCGCCAGTTCACGGCGGGTGCGCGACAGTTCGCGGCGCAACTGGAACAGGGTGCCGAGCAGGGACAGGGCACCTAAGGCCGTCCCGGCGGCGAAGAAACCGAGCAGGATGATCACCAGGGGCGCCTGCCACTGGGTATCGAAGAAGAAGCGCAGACTCACCGGATCGCTGTTCATTGCCGCAAAACCCAGCAAGAAGAAGAAGATGATGAGCCGGATGATCAAGATCAGGGCGCGCATAGCCTTATCCGCAAGCAATAAAAAAGGCGGCAACCCTAGGGTTGCCGCCCGCAATCTACCACAGGACAGGGGGTAAGGTCAGCGTCAGGCGGAGAGAGAGGCCAACAGCCCCAGGTCCACCCGTTCGCGCAGTTCCTTGCCTGCCTTGAAATGGGGTACGTATTTTTCCGGAACGCTGACTTTATCCCCGGACTTGGGGTTACGCCCCATTCGCGGCGGCCGGTAGTTCAGCGCGAAGCTGCCGAACCCCCGAATCTCGATACGGTCACCGTGGGCCAGAGCCTCGGTCATGGCATCGAGAATTTCCTTGACCGCGAAATCAGCGTCTTTCGCCACCAGTTGCGGAAACCGCATGGCCAGGCGGGCGATCAGCTCGGATTTGGTCAT
Protein sequences of DBSCAN-SWA_1 >NZ_AP021844|1397060:1410067|1401435_1402428_-|WP_152089519.1|DBSCAN-SWA MYYIVTGAAGFVGANLVKALNERGITRIIAVDNLTKADKFKNLVDCEIADYLDKGEFLERLLCGHFDGDVEAIFHEGACSDTMETDGRYMMENNYRYSLALLDWCLEQDVQLLYASSAATYGGSSVFKEERQYEAPLNVYGYSKFLFDQIVRQRLPEVRSQVVGFRYFNVYGPRESHKGRMASVAFHHFNQYRAEGKVKLFEGCDGYANGEQQRDFVYVKDVAKVNLYFLDHPEKSGIFNLGTGRAQSFNDVAVATVNSCRAAEGKPALSLEAMVQQGLVEYVAFPEALKGKYQSFTQADLSKLRSAGYGDEFATVAEGVADYVQFLNGR >NZ_AP021844|1397060:1410067|1409353_1409680_-|WP_152089526.1|DBSCAN-SWA MRALILIIRLIIFFFLLGFAAMNSDPVSLRFFFDTQWQAPLVIILLGFFAAGTALGALSLLGTLFQLRRELSRTRRELARERERQAVLKPVASDAAVQPPPAGEAVAQ >NZ_AP021844|1397060:1410067|1399729_1400527_-|WP_014237027.1|DBSCAN-SWA MVPVLVFDIETIPDVPGLRRLHDLPADLSDDEVAELAFQQRRAQNGSDFLPLHLQRVVTISCALRARDAFKVWSLAAPEIGEGEIIQRFFDGIEKFTPQIVSWNGGGFDLPVLHYRGLIHGVVAPRYWDLGDGDYADSRDFKWNNYISRYHTRHLDLMDLLAMYQPRASAPLDDLAKLMGFPGKLGMDGGKVWQAWQEGKADEIRDYCETDVVNTYLVSTRFRLMRGEVSREEHLAELAFIKAELAKLDKPHWQEFLAAWPEVAA >NZ_AP021844|1397060:1410067|1403396_1403945_-|WP_152089521.1|DBSCAN-SWA MKAIPSAIADVIMLEPLVFGDARGFFMESYNRRRFTELTGADVDFVQDNHSRSARGVLRGLHYQIRQPQGKLVRVAQGAVFDVAVDLRCQSPYFGRWVGAVLSADNQRQMWVPPGFAHGFLVLSETADFLYKTTDYYAPEHERCIAWNDPALAVAWPLEGLEPRLSARDMQGTPFAQADLFD >NZ_AP021844|1397060:1410067|1403941_1404832_-|WP_152089522.1|DBSCAN-SWA MATKPRKGIILAGGSGTRLYPATLAVSKQLLPIYDKPMIYYPLTTLMLAGLRDILIISTPQDTPRFEQLLGNGSQWGINLQYAVQPSPDGLAQAFLIGEAFLDGAPAALVLGDNIFHGHDLATLVQRANDRDSGASVFAYRVNDPERYGVVEFDAQQRALSIEEKPLQPKSHYAVTGLYFYDTDIVAVAKGIKPSPRGELEITDVNRHYLEAGKLNVEIMGRGYAWLDTGTHESLLEAGQFIETIEKRQGLKVACPEEVAWRQRWIDDATLEAQARVLAKNGYGQYLLALLKEHAA >NZ_AP021844|1397060:1410067|1400532_1401432_-|WP_152089518.1|DBSCAN-SWA MYKTLEDFVGNTPLVQLKRLPGDVVAQRGNVILAKLEGNNPAGSVKDRPALSMISHAEQRGEIRPGDTLIEATSGNTGIALAMAAAMRGYRMILVMPENQSLERRQTMRAYGAELILTPRDGGMELARDVAEKMRDEGKGIILDQFANPDNPLAHFEGTGPEIWRDTKGQVTHFVSSMGTTGTIIGTSQFLKKKNPRIQIVGCQPEEGSQIPGIRKWPEAYLPKIYERSRVDRLEYVSQAEAEDMTRRLAREEGLFAGISSGGALAVALRLARELENATIVTIVCDRGDRYLSTGVFPA >NZ_AP021844|1397060:1410067|1408187_1409357_-|WP_172974712.1|DBSCAN-SWA MIEFEYWQLLLFPLFFVLGWAAARIDIKHLVRESRALPRSYFQGLNFLLNEQPDKAIEAFIEVVKVDPQTVELHFALGNLFRRRGETERAIRMHQNLIERVDLSPELKLQALSELGQDFLKAGLLDRAEEVFSRLRGTSRDEEAKRNLLEIYQQEKDWQKAIAIAKEMPDYATQKEIANYYCELAAGEMINSRPDSARQYLDSALSLHRNCVRASVLQGDLLQQAGDLAAAIEAWKRIESQNPAYLAIVARKLQDAYLAQGQRDEGLQLLRGYLASYPSLDLLETVFQLVMDAEGPEAAYRLVRDELRRNPTLLGLDRLLEAQLLGVPPEKRADLELVRNLVHNHTRRLARYRCDNCGFKARHFYWRCPACGGWETYPPRRTEEFDLIP >NZ_AP021844|1397060:1410067|1404821_1405751_-|WP_152089523.1|DBSCAN-SWA MGKPEVSAPILLLGSQGQLGWQLQRDLAPLGPVLALDRRTCDLADLDRLRAVVREQRPRLIVNAAAYTAVDQAEMEPELARRINAEAVGLLAEEAKALDALLVHYSTDYVFDGSKAAPYVESDATAPLGVYGRTKREGEEAMLAVGGRGLIFRTSWVFGARGKNFVKSILRLASERDSLKVVADQVGSPTPAAMIATVTGMVLAQLDGGRAQQGCELYHLVAANPVSWNGFARAIVATAEQTPGFALKLGPEAIAPIPSSEYPLPAPRPLNSRLDCRKLEDRFGLTMPDWQPYLSRMMQLLALKAQNGY >NZ_AP021844|1397060:1410067|1399289_1399733_-|WP_152089517.1|DBSCAN-SWA MKHTFLFLLALLAAFAARAEVVNVDSAEVARLVASGVVLVDIRTEPEWRETGVIPGSRLLTFFDANGRANPAAWLEQLKTVAGPEQPVILICRSGNRTRAVSDFLEQQAGYSKIYNVRQGIRAWIQESRPVTAVAPALAKCSPGRLC >NZ_AP021844|1397060:1410067|1409764_1410067_-|WP_014237016.1|DBSCAN-SWA MTKSELIARLAMRFPQLVAKDADFAVKEILDAMTEALAHGDRIEIRGFGSFALNYRPPRMGRNPKSGDKVSVPEKYVPHFKAGKELRERVDLGLLASLSA >NZ_AP021844|1397060:1410067|1402437_1403385_-|WP_152089520.1|DBSCAN-SWA MHQLPDFSAARILVVGDVMLDRYWFGDVSRISPEAPVPVVKVERSEERPGGAANVARNCASLGARVGLLSVVGNDEAGRILQRQMEEGGIAASLLPDGAIDTTVKLRVIGRQQQLLRIDFETTPSHEVLQAKLAEFEQRLAGVDVVILSDYGKGGLAHIGDMIRLARAAGKKVLVDPKGEDYSKYRGATVITPNRSELRQVVGRWSDEAQLAAKAQQLRSELELDALLVTRSEEGMSLYRDGEALHQPARAQEVFDVSGAGDTVIATLAAMMALGAPWGDAIHLANLAGGVVVGKLGTATVSREELSAALAADAN >NZ_AP021844|1397060:1410067|1397060_1398977_-|WP_014237029.1|tRNA|DBSCAN-SWA MPNITLPDGSIRSFDAPVTVAEVAASIGAGLARAALAGKVDGKLVDTSHLMERDTQLAIVTDKDADGLEIIRHSTAHLLAYAVKELFPEAQVTIGPVIDNGFYYDFAYKRPFTPEDLLAIEKKMAELAKRDIPVSREVWARDDAVKFFLEQGEKYKAELIAAIPADQDVSLYREGEFVDLCRGPHVPSTGKLKVFKLMKVAGAYWRGDSKNEMLQRIYGTAWAKKEDQEAYLHMLEEAEKRDHRRIGKHLDLFHMQDEAPGMVFWHPKGWAIWQEIEQYMRQVYRDNGYQEIRCPQILDRSMWEKSGHWEHYKNNMFTTESEKRDYAIKPMNCPGHVQVFNSDLRSYRDLPLRYGEFGSCHRNEASGALHGLMRVRGFVQDDGHIFCTEDQIEAEVTAFNALVKKVYADFGFDQVAVKLALRPESRVGSDDIWDKAENALRAGLKASGLEWDELPGEGAFYGPKIEFHIKDAIGRSWQCGTMQVDFSMPGRLGAEYVGEDNARHVPVMLHRAILGSLERFIGILVENYAGALPLWLAPVQAVVLNISEKQADFSAEVVKTLRQAGLRAEADLRNEKITYKIREHSLNRLPYQLVIGDKEKEAGLVAVRTRGGQDLGQMPVASLIERLLEEVAARKSTA >NZ_AP021844|1397060:1410067|1406809_1408135_-|WP_152089525.1|DBSCAN-SWA MKVTVVGTGYVGLVSGTCLAEVGNDVLCLDVDPEKIRILNEGGIPIFEPGLLDMVKRNVAAGRLRFTTDVEQAVAHGTIQFIAVGTPPDEDGSADLQYVLAAARNIGRHMGDYKVVVDKSTVPVGTGDKVRAAIADELKARGADIAYSVVSNPEFLKEGAAVEDFMRPDRIVVGAEDERAIELMRALYAPFQRNHERLIVMDVRSAELTKYAANAMLATRISFMNELANLAEKLGADIEQVRQGIGSDPRIGYHFLYPGCGYGGSCFPKDVQALQRTARLDAGMEMKVLNAVEDANDAQKHVLTAKTVKRFGNDLSGRKFAVWGLAFKPNTDDMREAPSRYLLADLFAAGATVVAYDPVAMHEARRIFGDEPRLSYAESPMAALDGADALLIVTEWKEFRSPDFEAIKAKLKQPVIFDGRNLYEPKQVCQFGIDYQAIGRQ >NZ_AP021844|1397060:1410067|1405737_1406799_-|WP_152089524.1|DBSCAN-SWA MILVTGGAGFIGSNFVIDWLAAGGEPVINLDKLTYAGNLENLQGLAGDPRHRFVRGDIADYDLILELLQQNRVRAVVNFAAESHVDRSIHGPEDFIQTNIVGTFRLLEAVRAYWNGLPADDKAAFRFLHVSTDEVYGSLEKEAPAFTEQHRYEPNSPYSASKAASDHLVRAYHHTYGLPVLTTNCSNNYGPYHFPEKLIPLIIHNALAGKPLPIYGDGQQIRDWLYVKDHCSAIRRVLEAGRLGETYNVGGWNEKPNLEVVHTLCTMLDELSPRADGASYASQITFVADRPGHDRRYAIDASKLERELGWKPAETFETGIRKTVRWYLDNPQWVHNVTSGAYREWVGRQYGEA |
14 | Prochlorococcus_phage(20.0%) | tRNA | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| DBSCAN-SWA_2 |
2569244 : 2576915
Sequences of DBSCAN-SWA_2
Nucleotide sequences of DBSCAN-SWA_2 >NZ_AP021844|2569244:2576915|DBSCAN-SWA ATCAGGCGGTGATGCGGGCCTGCTTGGCCAGCTTGGCCTTGATGCGGCCCTGTTTGAGCTGGGACAGGTGATCGACGAAAACCTTGCCCTGCAGGTGATCCATCTCATGCTGGATGCAGACGGCCAGCAGGCCGTCCGTTTCCAGGGAACAGGTCTTGCCTTCCAGGTCCAGGTAACGCACGGCGATATGCTCGGCCCGCTCGACCTTGTCGTAAATGCCCGGCACCGAGAGACAGCCTTCTTCACCCACCTGTAGTCCGTCGCGGTGGGTGATTTCCGGATTGATCAGCACCAGGAGTTCGTCCTTGGTTTCCGACACGTCGATGACAATCACCTGCTTGTGTACATCGACCTGGGTGGCGGCCAGACCGATACCCGGCGCCTCGTACATGGTTTCCGCCATGTCCCGGGCCAGGGCACGAATGCCGTCGTCAATTTTTTCGACCGGAACCGCCACTTTTTTCAAGCGGGGATCCGGGAAGCGCAAAATAGGGAGTAAAGCCATAAAAAGCTTGCCCGAATAATCTATTACATGCAGAATTTAAACCAAATCCCTAGATTCGGGACATGGGTGTGGACAACAAAAGGGCCGCCCTTGTTCCGGCAGCGCGAGGACGCCACGATGATTCGACACCTGTTCGGCCGCCCTGGCCTGAACCCCGCCCGCATTATAGCCACCCTTCTGTTGGCTGTCGCCGCCTCCAGCGCCAGTGCCCAGGAATCTCCCCGCCTCGCCGACAACGCGCCGGACCGCCATATCGTGGTGCCGGGCGACACCCTGTGGGGCATCGCCGGCAAGTTCATCCAGGAACCATGGCGCTGGCCCGAAATCTGGCGCCTGAACAAGGACCAGATCAAGAATCCCCACCGCATCTACCCGGGCGACGTCATCGTCATGGTCACCGGCGAGGACGGCAAGCCACAGCTCAAACTGGCCAAGTCGCTCAAGCTGCAGCCGCGCGAGTACAGCGAAGCGGTCAAGAACGAAATTCCCACCATTCCGCAAAGCATCATCGAGCCCTTCCTGTCCCAGCCCCTGGTGGTGGACCCCAGCGCCATGGACAAGGAAGCCCGCATCATCGCCACCCAGGAAGGACGGGTCTATCTTGGTGGCGGTGATCAGGCCTACGTGGTCGGGGTACGGGAGCCTTCCGAATTGTGGCAGGTCTATCGTCCCGGCAAGGCCATGCTCGATCCCGACACCAAGGAAGTGCTGGGCCATGAGGCGTTCTACCTGGGCACGGCCAGGCTGATTCAACCGGGAGAGCCCTCCGTCATGGAAATGGTGGAGGTGAAGCAGGAGGTCGGCAAGTTCGATCGACTGATGCCCGCATCCCGCCCCGAACTGATCACCTATGCCCCACGTCGTCCGGAAGCCAAGGTTGAGGCCCGCATCATCGCGGTGTACGGCGGTGTTGGCACCGGTGGGCGCTACTCGGTGGTATCCCTGTCCAGGGGCAGCCGCGACGGACTCGAGGTCGGCCACGTCCTAGCCTTGCTGCGCAGTGAAAAGGTCTATGAACAGCGCAACGAACAGGGCGAGCGGGAGTTGGTCAAGGTGCCGCCCCAGCGCTATGGCCTGGTCTTTGTCTTCAGGACGTTTGAACGAGTTTCCTACGCTCTGGTCATGGATGCTGCCTTGCCCCTGTCTCTGGCCGATCTGGTACGCAACCCCTGAGCCCCCGCCGTGGCTGACCCGGCCCTCACCGCCTGGCTCCGGCTGACGCTGGTTCCGGGCGTCGGGCCGGAGACCCAGCGCCATCTCTTGGCCGCCTTCGGCCTACCGGAACAGGTTTTCTCCGCCCCCCGCAGTGCCCTCAAGCAGGTCGTCGGCAAGAAGGCCGATCTGCTGCTCGATACGGACAATCAGGAAGCGGTGGACCGGGCCCTGGACTGGGCCGACAAGCCGGGCAACCGCATCCTGACCCTGGCCGATCCGGACTATCCCCAGCTGCTGCTCGAATCCGCCGATCCGCCCAGCCTGCTCTATGTGAAGGGCCGGGTGGAACTGCTAAACCGGCCTGCCCTGGCCATTGTCGGCAGTCGTAATGCCACGCCCCAGGGCCTCAAGGATGCCGAGGCCCTCGCTGCCGATCTGGCGGCCCAGGGGCTGACCATCGTCAGCGGCCTGGCCCTGGGCATCGACGGCGCCGCCCATCGGGGCGGGCTGAAAGGGGAGGGCGGCAGTGTCGCCATCATCGGCACCGGGGCCGATCGCATCTATCCCTCGCGGCACAAGGAACTGGCGCTGCAGCTGGCGACCGAAGGCGCCATCGTTTCCGAATTTCCCCTGGGTACGCCGGCGGTGGCCCACAATTTTCCGCGCCGCAACCGCATCATCGCCGGCATGGCCAAGGGCTGCCTGGTGGTGGAAGCCGCCCTGGAAAGCGGCTCCCTCATCACCGCCCGTCTGGCGGCCGAACTGGGCCGGGAAGTGTTTGCCATTCCCGGCTCCATCCATTCGCCGGTGGCCAAGGGCTGTCACCGGCTGATCCAACAGGGGGCCAAGCTGGTGCAGGAAGCCCGGGACATTGTGGAAGAAATCGGTCCGTTCGACCCCCCGGGCTGCAGGCCAGCAAGGACGCCACTTTCCACCGGCAATACGCCAACAACCGTGCCAATCCTCGATCCTGGCCAGGCTGCCGTACTCGATGCCCTCGGCCACGACCCGGCCAATCTGGACCAATTGCTGCAGCGCACAGGCTTGACGACGGAAGCCCTATGCGCCATCCTCGTGACGCTGGAACTGGCGGACCACGTTGCCAGTCTTCCCGGAGGCCGCTACCAGCGGCTTTCCCCCACCTGATACGTTGCCAATGTTCGACATCCTCGTCTATCTCTTCGAAAACTACGTCGATTTCGCCGACTTCAGCAAGTCCGGCAATCAACCCGATTCACCCGATTCCCAGGCCGACACGGCCCTCAGCCGCAAACTCACTGCGGCCGGCTTCTCCGAAGAAGAAATCAGCGAAGCCCTGGAATGGCTCCAGGGCCTCAAGGCCACCCTGCCGACCCGCCAGCTGCAGGCCGATTCCCGTTCCCTGCGGGCCTACACGCCGGACGAAAGCGCCCACCTGGGTGCCGACGCCCTGGGCTTCCTGCATTTCCTGGAACAGGCCAAGGTCCTTTCCGCCGACCTGCGGGAACTGGTCATCGAGCGGGCCATGGCCCTGCCGGACGACCGGTTGTCCCTGGGGCGCTTCAAGGTCATCGTGCTGATGGTCCTGTGGAGCCAGGAGCAAAACCTGGATACCCTCATCGTCGAAGAACTGCTCTCCGAAGCGGAACCCGAACACCTGCATTAAGTCCATCCGTCGGGCCTGCCAGCCGGGGCGACTGAGGGGTCGCACCAGATGGCGCAGGCAGCATGATGGGCGAAGCACGCAATATGCAGGCTTCGCGCGTCAAATTAGTGGCTGCTTGCTTGCCAAGGCCCGTAAGTCCCTCTTATCATCCCCCGCACCTCCCGACCGGGACTGCCAGAGGCCCCTGCCATGGGCAAACAGCTCATCATTGCCGAAAAACCTTCCGTCGCTGCCGACATCGCCAAGGCCCTTGGCGGTTTCACCAAGCATGACGACTATTTCGAGAGCGACAACTTCGTTCTCTCCTCTGCTATCGGCCATCTGCTGGAACTGGTGATCCCCGAGGAATACGAGGTCAAGCGCGGCAAGTGGTCCTTTGCCCACCTGCCCGTGATCCCGCCCCACTTCGAACTGAAGCCGGTGGAGAAAACCGAGTCCCGCCTCAAGCTGCTGACCAAGCTGATCAAGCGCAAGGATGTGGACGGCCTGGTGAACGCCTGTGACGCGGGCCGCGAGGGTGAGCTGATCTTCAATTACATCGCCCGCCACGCCAAGTCCGGCAAGGCCGTGCAGCGGCTGTGGCTGCAGTCCATGACGCCCCAGGCCATCCGCGACGGCTTCGCCCGTCTGCGCCGCGGCGAGGAAATGCAGGGGCTGGGCGATGCCGCCGTGTGCCGTTCCGAATCCGACTGGCTGGTCGGCATCAACGGCACCCGGGCCATGACCGCCTTCAACTCCAAGACCGGCGGCTTCCACCTCACCACCGTGGGCCGGGTGCAGACCCCAACCCTATCCCTGGTGGTGGAGCGGGAACGCAAGATCCGCGAATTCAAGGCCCGTCCCTACTGGGAAGTGGAGGCCACCTTCGCCGCCGCTGCCGGCGAATACAAGGGCAAGTGGTTCGACGAAGCCTTCAAGGGCAAGGACGAGGACGAACACGCCCGGGCCGACCGCCTGTGGGACGAAGCCCGGGCCAAGGCTCTGCAGGCCAAGTGCGAAGGCCAGCCCGGCGAAGTGAGCGAGGAGGCCAAGCCCTCCACCCAACTCTCGCCCCTGCTCTTCGACCTCACCAGCCTGCAGCGGGAGGCTAACAGCCGCTTCGGCTTCTCCGCCAAGAACACCCTGGGCCTGGCCCAGGCCCTGTACGAAAAGCACAAGGTCCTGACCTATCCCCGGACCGACTCCCGGGCCCTGCCCGAGGATTACCTGGGCACGGTGCAGGCCACCCTGCAGATGTTCAACGGCGAAAACCTGACCAAGGGTTCCGACACCTCCGTAGTGGACCGCTACGGCATCTTCGCCAACAAGATTCTCAAGTCCAAGTGGGTGGTGCCGAACAAGCGCATTTTCAACAATGCCAAGATTTCCGACCACTTCGCCATCATCCCCACCACCCAGGCGCCGAAGAATCTCTCCGAGCCGGAACAGAAGCTCTATGACCTGGTGGTCAAGCGCTTCCTCGCCGTCTTCTTCCCCGCCGCCGAATACCTGATCACCACCCGCATCACCCGGGTCGCCGGCGAACCCTTCAAAACCGAAGGCAAGGTCCTGGTGAATCCGGGCTGGCTAGCCATCTACGGCCGCGAAGGCCAGGAGGGCGACGAGGGCAACCTGGTGGCCGTCTCCCAGGGTGAGAAGGTGCAGACCGAAGAAGTGGCGGTCAATCAGAACGACACCCGGCCCCCGGCCCGCTACTCGGAAGCCACCCTGCTCTCCGCCATGGAAGGCGCCGGCAAGATGGTGGACGACGAGGAACTGCGTGCCGCCATGGCCGGCCGCGGCCTCGGCACCCCGGCCACCCGGGCCCAGATCATCGAAGGCCTGATCACCGAACAGTATCTGCACCGCGAAGGCCGGGAACTGATCCCCACCGCCAAGGCCTTCTCCCTCATGACCCTGCTCAACGGCCTGGGCATTTCCGAACTGACCTCGCCGGAACTGACCGGCGAATGGGAATGGAAGCTGGCCCAGATCGAGCGCGGCGATCTTTCCCGCAGCGCCTTCATGCAGGAAATCGAGGAAATGACCCGGCACATCGTGGACCGGGCCAAGAGCTACGACAGCGACACGGTGCCCGGCGACTTCGGCCTGCTCAAGTCGCCCTGTCCCAAGTGCGGCGGTCTCATGCGCGAGACCTACAAGAAATTCCAGTGCGGCGATTGCGACTACGGCCTGTGGAAGATCGTCGCCGGCCGCCAGTTCGAGCCGGAGGAAATCGAGACCCTGCTCACCGAACGCCAGGTCGGCCCCCTGATGGGCTTCCGCAACAAGATGGGGCGGCCCTTCAATGCCCTGATCAAGCTCAACGACAAGAACGAACCGGAATTCGACTTCGGCCAGGACCGCTCCGGCGAGGACGGCGGCGAACCGGTGGACTTCTCCGGCCAGGAAAGCCTGGGGCCCTGTCCCAAGTGCGGCAGCCCTGTCTATGAGCACGGTCTGGCCTACGTCTGCGAAAAGTCCGTGGGTCCGGCCAAGAGCTGCGACTTCCGTTCCGGCAAGATCATCCTGCAACAGGCGGTGGAACGGGAACAGATGCAAAAACTGCTGTCCACCGGCCGCACCGATCTGCTCAAGGACTTCATCTCCGCCCGCACCCGGCGCAAGTTCTCCGCCTTCCTGGTGAAGGGCAAGGACGGCAAGGTCAGCTTCGAGTTCGAGAAACGGGAGCCCAAGGCCCCAGCGGCGAAAAAAACTGCCGCCAAGGCCGAGCCCAAGGCAGCGGCGGAAAAGCCCGCCAAAGCCCCGGCAAAGCGCAAGGCGAAGGAAGCCTGAGGTTAAGCAAGGCAAAAGAAAAGGGCTCGGTGTGAACCGAGCCCTTTTTGCATCCGGACCGCCGCCAGGCGGCGAAGCGGTGAATTTACTTATTCTTCGAGCAGGACGCGCAGCATCCAGGCGTTCTTTTCATGCACTTCCATGCGCTGGGTCAGCAGGTCGGCCGTCGGCTGGTCGTTGGCCTTGTCCACCACCGCGAAGACCTGGCGGGCGGTGCGGGCCACGGCTTCCTGGCCGGCCACCAGCTGGCGGATCATGTCCTTGGCCTTGGGCACGCCGTCTTCCTCGGTGATGGAAGCCAGTTCCACGAAGCGCTTGTAGGAGCCGGGAGCCGGGTAGCCGAGGGCGCGGATGCGCTCGGCGATCAGGTCCAGGGAGTTCCACAGTTCGGTGTACTGGGTCATGAACATGGTGTGCAGGGTCTGGAACATGGGGCCGGTGACGTTCCAGTGGAAATTGTGGGTCTTCAGATAGAGGATGTAGCTGTCTGCCAGCAGGTGGGAAAGGCCGTCGGCGATTTTCTTGCGGTCTTTCTCGGTGATGCCGATATCGATCTTGGTTGCCATGTGGATTCTCCTTTCAAGCTTCTCAATAACCATGCGCCCATTGTGGCCGATGGCGGCCGCCCGGGCCAATGACTTATCCCTATGGATTGAATAGGGACGGGCAATACAGACGGACTACAGTACCTTAACCCCGGGTGGCGGACATTGCAAAATGGCGGCCCGCACCGCATCGATGGCCTGGGGCCGGGGGAAGGTGACGCGCCAGGCCAGCACGATGCGGCGGGAAGGCTCCGGCGCCTTGAAGGGAATGACCCGGGCCAGGGACGGATCCGGCGGCGCCACCTCGACGGCGGAAGCGGGCAGCACGGCAACGCCAGTGCCGCTGGCCACCATCAGGCGGATGGTCTCCAGGGAGCCGCCCTCGTAGGAACGTTCCAGCCCGCCCGGCTCGGTCAGACGCGGACAGGCGGCCACCACCTGGTCGCGGAAACAGTTGCCCTGGCCCAGCACCAGCAGTTCCTCGCCCTTCAGCTCCCCCGCATCCACCGATTTCCGCTCGGCCCAGGGATGGGCCGCCGGCACCAGCATGCGGAAGGGTTCGTCGTACACCGGCTGGGTGACGATGCCCGGCTCGTCGAACGGCTGGGCCACCACGATCACGTCCAGCTCGCCCCGCTTCAGGGATTCGGCCAGCACGTGGGTGAAATTCTCCTGCAGGTAAAGGTGCATGTGGGACACGGCCTGGTGCAGAGCCGGCACCAGCCGGGGCAGCAGATAGGGGCCGATGGTGTAGATCACCCCCAGGCGCAGGGGCCCGCTGAAGGGATCCTTGCCCCGCTGGGCGATTTCCTCGACCCGCTGGGCCTCGTTCAAGACCTTCTCGGACTGCCGCGCCACTTCTTCGCCGATGGGCGTCAACCGCACTTCGGCGGCGCTGCGTTCGAACAGCAGCACCCCCAGGCGCTCTTCCACCTTTTTCAGGGCCACCGACAGGGTGGGCTGGCTCACGTGGCATTTCTCGGCGGCGCGGCCGAAGTGGCGCTCCCGGGCCAAGGACACGATGTAGCGCATTTCCGTCAGGGTCAT
Protein sequences of DBSCAN-SWA_2 >NZ_AP021844|2569244:2576915|2572059_2572548_+|WP_152090173.1|DBSCAN-SWA MFDILVYLFENYVDFADFSKSGNQPDSPDSQADTALSRKLTAAGFSEEEISEALEWLQGLKATLPTRQLQADSRSLRAYTPDESAHLGADALGFLHFLEQAKVLSADLRELVIERAMALPDDRLSLGRFKVIVLMVLWSQEQNLDTLIVEELLSEAEPEHLH >NZ_AP021844|2569244:2576915|2572737_2575323_+|WP_152090174.1|DBSCAN-SWA MGKQLIIAEKPSVAADIAKALGGFTKHDDYFESDNFVLSSAIGHLLELVIPEEYEVKRGKWSFAHLPVIPPHFELKPVEKTESRLKLLTKLIKRKDVDGLVNACDAGREGELIFNYIARHAKSGKAVQRLWLQSMTPQAIRDGFARLRRGEEMQGLGDAAVCRSESDWLVGINGTRAMTAFNSKTGGFHLTTVGRVQTPTLSLVVERERKIREFKARPYWEVEATFAAAAGEYKGKWFDEAFKGKDEDEHARADRLWDEARAKALQAKCEGQPGEVSEEAKPSTQLSPLLFDLTSLQREANSRFGFSAKNTLGLAQALYEKHKVLTYPRTDSRALPEDYLGTVQATLQMFNGENLTKGSDTSVVDRYGIFANKILKSKWVVPNKRIFNNAKISDHFAIIPTTQAPKNLSEPEQKLYDLVVKRFLAVFFPAAEYLITTRITRVAGEPFKTEGKVLVNPGWLAIYGREGQEGDEGNLVAVSQGEKVQTEEVAVNQNDTRPPARYSEATLLSAMEGAGKMVDDEELRAAMAGRGLGTPATRAQIIEGLITEQYLHREGRELIPTAKAFSLMTLLNGLGISELTSPELTGEWEWKLAQIERGDLSRSAFMQEIEEMTRHIVDRAKSYDSDTVPGDFGLLKSPCPKCGGLMRETYKKFQCGDCDYGLWKIVAGRQFEPEEIETLLTERQVGPLMGFRNKMGRPFNALIKLNDKNEPEFDFGQDRSGEDGGEPVDFSGQESLGPCPKCGSPVYEHGLAYVCEKSVGPAKSCDFRSGKIILQQAVEREQMQKLLSTGRTDLLKDFISARTRRKFSAFLVKGKDGKVSFEFEKREPKAPAAKKTAAKAEPKAAAEKPAKAPAKRKAKEA >NZ_AP021844|2569244:2576915|2576003_2576915_-|WP_152090175.1|DBSCAN-SWA MTLTEMRYIVSLARERHFGRAAEKCHVSQPTLSVALKKVEERLGVLLFERSAAEVRLTPIGEEVARQSEKVLNEAQRVEEIAQRGKDPFSGPLRLGVIYTIGPYLLPRLVPALHQAVSHMHLYLQENFTHVLAESLKRGELDVIVVAQPFDEPGIVTQPVYDEPFRMLVPAAHPWAERKSVDAGELKGEELLVLGQGNCFRDQVVAACPRLTEPGGLERSYEGGSLETIRLMVASGTGVAVLPASAVEVAPPDPSLARVIPFKAPEPSRRIVLAWRVTFPRPQAIDAVRAAILQCPPPGVKVL >NZ_AP021844|2569244:2576915|2570930_2572049_+|WP_152090172.1|DBSCAN-SWA MADPALTAWLRLTLVPGVGPETQRHLLAAFGLPEQVFSAPRSALKQVVGKKADLLLDTDNQEAVDRALDWADKPGNRILTLADPDYPQLLLESADPPSLLYVKGRVELLNRPALAIVGSRNATPQGLKDAEALAADLAAQGLTIVSGLALGIDGAAHRGGLKGEGGSVAIIGTGADRIYPSRHKELALQLATEGAIVSEFPLGTPAVAHNFPRRNRIIAGMAKGCLVVEAALESGSLITARLAAELGREVFAIPGSIHSPVAKGCHRLIQQGAKLVQEARDIVEEIGPFDPPGCRPARTPLSTGNTPTTVPILDPGQAAVLDALGHDPANLDQLLQRTGLTTEALCAILVTLELADHVASLPGGRYQRLSPT >NZ_AP021844|2569244:2576915|2569244_2569748_-|WP_152090171.1|DBSCAN-SWA MALLPILRFPDPRLKKVAVPVEKIDDGIRALARDMAETMYEAPGIGLAATQVDVHKQVIVIDVSETKDELLVLINPEITHRDGLQVGEEGCLSVPGIYDKVERAEHIAVRYLDLEGKTCSLETDGLLAVCIQHEMDHLQGKVFVDHLSQLKQGRIKAKLAKQARITA >NZ_AP021844|2569244:2576915|2569865_2570921_+|WP_014235926.1|DBSCAN-SWA MIRHLFGRPGLNPARIIATLLLAVAASSASAQESPRLADNAPDRHIVVPGDTLWGIAGKFIQEPWRWPEIWRLNKDQIKNPHRIYPGDVIVMVTGEDGKPQLKLAKSLKLQPREYSEAVKNEIPTIPQSIIEPFLSQPLVVDPSAMDKEARIIATQEGRVYLGGGDQAYVVGVREPSELWQVYRPGKAMLDPDTKEVLGHEAFYLGTARLIQPGEPSVMEMVEVKQEVGKFDRLMPASRPELITYAPRRPEAKVEARIIAVYGGVGTGGRYSVVSLSRGSRDGLEVGHVLALLRSEKVYEQRNEQGERELVKVPPQRYGLVFVFRTFERVSYALVMDAALPLSLADLVRNP >NZ_AP021844|2569244:2576915|2575412_2575889_-|WP_014235930.1|DBSCAN-SWA MATKIDIGITEKDRKKIADGLSHLLADSYILYLKTHNFHWNVTGPMFQTLHTMFMTQYTELWNSLDLIAERIRALGYPAPGSYKRFVELASITEEDGVPKAKDMIRQLVAGQEAVARTARQVFAVVDKANDQPTADLLTQRMEVHEKNAWMLRVLLEE |
7 | Synechococcus_phage(16.67%) | NA | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| DBSCAN-SWA_3 |
2967016 : 3023959
Sequences of DBSCAN-SWA_3
Nucleotide sequences of DBSCAN-SWA_3 >NZ_AP021844|2967016:3023959|DBSCAN-SWA CTCATTGAGCAGTCGTTTCATAGGTTAACCGCTTGCCCTTCACACCCTGAAGCAGTTTTTCAGCCCGCTCCTGATCTTCGATACCGAGCGCCTTGCGATTGTTGTAGCGGAAATCGAATTCTGCAAGGTAGCGGTTCAGATGGTGGTGGCCACAGTGCTGATAGACGCCCTTCATGCCGCGCTTGAAGATCGAGAAGAAGCCCTCGATGGTGTTGGTGTGAATGGTGGGATCAATCTTGGATACGTATTCACCCATGCCATGCCGGGTAAAAGCGTGACCAGCGAAGTGCTGACCTACATTTTTGTACTGGCCGGCTTCGTCGGTCATGATCCGTGCTTCCCGGGCGATGTTTGCTTCCAGGATTGGGATCAGGGTCTTGGCTTTGAGGTCGTCCACCACCATGCTGCGGGCCTGGCCGCTAGTGCGGTCAACCAAGGACAGTACCTTGTTCTTGTGGTCGTAGCCGCGCCCCTTCTTCTCGCCCTTTGGCTTCTTGGTGTAGTCACGCCCAATGAAGGTTTCATCTACTTCCACTGGACCACCTTCGGAACCGAACGGGGAGAAGTCGCCGGAGCGCATGGCTTCCCGAATCCGGTGAGACATGAACCAGGCTGACTTGAGGGTGATGCCCAGGGTGCGGTGAAGCTGATTGGCACTGATTCCCTTCTTGCTAGAAGAGATCAGGAAGATGGCTTGGAGCCACAGCCGCATTGGGATGTGGCTAGACTCGAAGATGGTGCCAACTTTCACGGTGAAGGGCTTCCGGCAGTTGTAGCACTTGTAAGCGCCTATGCGAGTGCTCTTGCCCCCCATCTTGCTTATACGTTCCACACAGCCACAGTGAGGGCAGACCGGGCCACTGGGCCACAGACGGGATTCTACGAATTCGTAAGCAGCTTCTTCGTTGTGGAAGTACGGAGCAGATAGAATGGACGCACCCATGACAGCCTCCTTGAATATGCATTCAGTATAGGAGAGGTAAGTGGGTACGTCAAGTATAATATCGCCCTTTTTTTATGGCCGGACGTCGGCCCGGGCCTGCAGCACGGGATGGGGCGGGGCGGAGGGCGAGGCCAGGGTGCGCAGGGCGCCGTAGGCCAGGAAGGCGCCGAGGAGGGCGCACAGGGTGAAGAACAGCGCCAGCTCGCGGCGCAACAGGCGTTCGATCCTGGGCAGGGTGTCCCTGCCGGGCGGTACTAGCGGCATCTGGTTCATTTTTTTCTCCCTGGACTGGCTGCCCGCCTGCGCCGTCTGGCGGCGGGCCGGCAAGGCCAGTAAATACCCTGTTGCTTACATGGATATTACGGCGAGGGAGGGAGGCCGGTGGCGGCAGACGCGGTGCCGCGTCAGACCTGGGCCAGGATTTCCAGGATGTCCGTGGCTTCGGGCAGGGGTTGGGCCACCAGGAGAAAGTCGGCCATGGTGTCCAGTTCGGCCAGGGCCGCTTCCAGGGCCTCGCGGACTTCCGGGTGGTCGAAACCGTGGCCGGCCTGGGCCCGGACGATGGCCGCCGCCAGACCCTGGGCCAGGGCTTCGGCACTGTCCAGTTCCAGGTGGGCGGCGTGGTCCGCCAGGCAGTGGGCCGATTCGGCCAGGGCTTCCCCATCCGGTTCCAGGCAGGCCAGTTCCTCCCGCATCAGGGCCGCCTGCTCGCCGATTTCGCCGGCAAAGAGGCGCAGCAGGGTGGGCGAGACGGCGGGACGCTGGCTTTCCCGGCATTCGGCCAGGCGCTGGGCGACGTGGGCCACCCGTTCGGCGAAGACCGGCAGGCCCAGGTCCCGGGTGTCGCCGAGCAGTTCCAGGGCGGCGGCGATGGCGGCGCGCAGGTAGGGGTCCTCGGCCGCCTCCGGGGCGTCGAGAAGGTCGCCGACGCCGGCCAGGGATTCGGCCAGGTGCAGGGTCTCCGGCAGGTTGAGGCTCATGGCGGCGGCGCAGAGGTCGATCAGGGCCCGCCGGAAGGGAGCCAGATCGCCGCTGCCGCGCCCGTTCCAGGCGGCCACGGCGGCGTTGCCGGCGGCGCTCCAGGCGGCTACCGCCTCAGCGTCCAGGTCCGGGGCCTCGGGCTTGGCCATCAGGGAAATCAGTTCGGTCACCGGCGGCAGCTGGCCGGCACCCTGGCGGATGCACTGCTGCAGTTCCTCGGCCAGGCCGCCCTCGGGGGCTTCGCTGCCATTCAGGGCGGCCCGCATCTGCAGGTCGATGCGGGCGCAGAGGCGGCGGGATTCCGCATCCACTGCCAGGGTGCCGTCGGAGATGGCGCGCAGGAACACTTCGGCACTGTGCCAGAAGGGGGCGAAGTCGCCGCCGGCGGCGGCTTCCAGGTGGCGCACCGCGGCCCGCATCTCCGGCAGTCCGGCCGGGTCCCCCGGCTGTTGCAGCCAGGCCAGCATGCCCCGCTGGTAACGGCTGCGGGCCTGACGCTGGGAGGGGGAGGGCGGGGCAACGCTCATGGCAGGGGCGGAAGGATCGGTGGATGGGGCATTTTACTCCTCTGCCCGGGGACGTCTGTGGTTAGAATGCGGGCATAAATACAAGCGGAGACATCCGTGTTCCACTCCCCCTCGGCTCCCAGAGACCGCCCGGCCTTGCTGTGGCAGGCCCTCCTGTTTTTTCTGCTCCTGCTGCCTGCCGCCGTCCGGGCCCATCCCCTGGTGCTGGATCAGGACGACGGCAGCTTTGCTCTGGTGCCCCACGTGGAAGTGCTGGAAGACCCGGGCGGCAAGCTCGACCTGGCCGCCGTGCGGCAGGCCGCCGCCGCAGGCCGCTTCGCGCCGGCCCATGCCCTGGGCGAATTGAACTTCGGCTATTCCTCCTCTGCCTTCTGGCTGCGCATTCCCCTGGAGTCCCGCCTGCAGCGTTCCAGCCCGTGGCTGCTGGAGATCGCCTTTCCCTCCCTCGACCGGGTGGAGCTATTCCTGCCCCGGGCCGACGGCCGCGTCGACTACCAATTAACCGGGGACCGCCTGCCCTTTGCCGAACGCCCCTATCCCAACCGCAATCTGGTGCTGCCCCTGGAGCTGGCGCCCGGGGAATCCCTCGCCCTCTATCTGCGGGTGGAGTCGGAGGGCAGCCTGACTTTGCCCCTGACCCTGTGGACGCCGGATGCCTTCCGCCTGCACAACCAGGACGCCTACGCCGGCTTCTCCCTCTACTACGGCATGCTGCTGGCCCTGGGCCTGTACAACCTGCTGCTCTTCTTCGCCCTGCGGGAGCGCATCTATCTGGTCTATGTGGCCTTCGCCGTGAGCATGGCGGTGGGCCAGCTGTCCCTCAACGGCCTGGGCAACGAATACATCTGGCCGGCCTTCCCCGCCTGGGGCAATGTGGCCCTGCCCTCGGGCTTTGCCGCCACCGGCTTCTTCGGCGCCATTTTTACCCGCCTCTTCCTCAATACCCGGCACAGCAATCCCCGGGCCGACAAGCTGATCCTGGCCCTGGCCGCCGGCTTTGCCGTGGCCGCCCTGGGGCCGGCCCTGCTGCCTTACCGCTGGGCCGCCATCCTCACCTCCCTGCTGGGTGCCGCCTTTTCCGCCGTGGCCGTGGCGGTGGGCGTCCATGCCCAGCTGCGGCGCCACCCGGGGGCCCGCTACTTCCTCCTGGCCTGGTCCCTGCTGCTGGTGGGGGTGGGCATGATGGCCCTGCGCAACCTGGGCTGGCTGCCCACCACCCTGTTCACTTCCTACGGCATGCAGATCGGTTCGGCCCTGGAGATGCTGCTGCTCTCCTTTGCCCTGGCCGACCGCATCCAGGCCGAGCGCCTGGCCCGGGAACTGGCCCAGGGCGAGGCTCTCCACAGCAAGCAGGACCTGGTCAACGCCCTGCGCAGCAATGAGCAACTGCTGGAGGCCCGGGTGGCGGAACGGACCCGGGACCTGGCCGCCGCCAACGATCGCCTGCTGGCCAACGAGCAGCAGTTGCAGCGCATGGCGCGGCACGATCCCCTGACCGGGCTGGCCAACCGTCTGCTGCTCGACGACCGTATCAGCCACGGTCTGGCGGTGGGGCGGCGCAACGGCACCCGTCTGGCCCTGCTGCTGATCGACCTGGACGGCTTCAAGCCGATCAACGATAAGCATGGTCATGCCGTGGGCGACCAGTTGCTGGTGGTGCTGGCGGACCGCCTGCAACGCTCGGTGCGGGCGGTGGATACGGTGGCGCGCCTGGGCGGTGACGAGTTCGTGCTGGTGCTGGAGGATCTGGCGGCGGTGGAGGACGGGCGCCAGGTAGCGGCCAAGGTGGTGGCGGAAATGAGCCGGCCGGTGGTGCTGGAGGGGCGGGAACTGCTGGTCTCCGCCAGCGCCGGCCTGGCCTTCTATCCGGAGGACGGGGAGGACGCCCAGACCCTGCTCAGGCGGGCCGACGAGGCTATGTACGAAGCCAAGCGGGCCGGCCGCAACACCTTCCGTCAGGTGGGCCAGTAGGGCGTTGCGGGCAGCGTGAACGTGCAAAAGGCCGCCCGTGGGCGGCCTTTTCATTGCGAAAGCCTGTGGGCGGCGCTGCTCAGCCGGCCAGCTTGGCGAAGGCGTCCACCACTTCGGCGGGAGCCTGCACCAGCTCGATCAGCACGCCTTCGCCGCCGATGGGGGATTCCTCGTTGCCCTTGGGGTGCAGGAAGCAGATGTCGAAACCGGCCGCGCCCTTGCGGATGCCGCCGGGGGCGAAGCGCACGCCGTTGGCCGTCAGCCATTCCACGGCCTTGGGCAGGTCGTCGATCCACAGGCCCACGTGGTTGAGGGGGGTGGTGTGCACCGCCGGCTTCTTTTCCGGGTCCAGGGGCTGCATCAGGTCCACTTCGACCTTGAAGGGGCCCTTGCCCATGGCGCAGATGTCCTCGTCCACGTTTTCCCGCTCGGAGACGAAATTGCCGGTGACTTCCAGGCCCAGCATGTCGACCCACAGGGTCTTCAGCTTGTCCTTGGAAGGGCCGCCGATGGCGATCTGCTGGATGCCGAGAACTTTGAAGGGACGCTGGGACATGGAGAGGCTCCTGAAATCGGTGGAATTAAAAATACATAATTATACAGAGCGGACCGCCGCCCAGGGAGAGGGGCGAGCGGTACTCGGGGCCGGAATCCGGGGTGGAGCCCGGGCGGGCTCCCGGGACCATGTTCCAGGGCGGACACCGGCCCCGGGCGGACGGAGGAGGATGCCGCCTCACGCCACCTGTCCGGCACGATGGGGCGCGAAAAAAAAAGCGGAGCCCCGCCGCCGCAGGTGCTCCGCTTCCCTCTGTAACCGGGCGGACCCGGAAGTGATGGCAATCGCCGTTGCTGCCGGCAGGCGTAACGGTCTGCTTATTTCTTCTTGGCCTTGGCGTCCTTGCTGGCGGCGATGTTGCCGACGAAGGTGTTTTCCACCAGGGGCGGAGCATCCACCTGGGGCACGTGGCACTGCACGCAGTTGTGGCGCAGGTGGGTGACTTCGGCGTGCTGCTTGCCTTCCCGGTCGATGAAGTGGCTCTCGCCGATCTTCGGGGCCTTCTTCTCCTTGTACTTCTCCGGACCGTGGCAGGTCAGGCACTGGTTCTCTTCCAGGGTGATCTCGTCGAAGTTGTCCACCGCATGGGGGATCACCGGCGGCTGCTCCTTGTAGGTGCGGGCGATGGGCTGTTGCAGGCCCGGCTTCTTGCCGGCGTAGGCCTTCACCTCGGGGGCCGGGTCGCCGGCGGGAATGTCGGCGCCGCGCATGGTCTTGGGCGCATCGGCGGCCTGGGCCAGGCAGGCGAAGGAGGCGGCCAGGATGGCCAGGGTCAGTTTGTGCAGTCGGTTCATGGTCGGCTCCTCAATGGATGGTCTTGCGATGGTCTGTCTGGCCTTCGGCCTCCCCGGCCGGGGCGCAGCGCTGGGTGTGCTGGTTGAAGCGGCTGCCGAAGACGAAGACGTCCTTGGCGCAAACGTCGATGCAGCGGCCGCAGTTGGTGCAGGCAGAGGCCAGGATGACCGGGCCGGTGCCGTTGGCCTCGCCCTTGAGGGCGGGGCGGATGACCTGGGGTTCGGGGCAGGCGGCGAAGCAGTCCATGCAGTCGTCGCAGTCCTGGCGCCGCCGGGCGCTGACCCGCAGCAGGCTGGTGCGGCCGAGCAGGCTGTAGAAGGCGCCCACCGGGCACAGGTGACCGCACCAGCCCCGGCTCATGATCAGCAGGTCCAGCAGGAAAATGGCGAGGACCACGGTCCAGGCGGCGCCCAGGCCGAAGATGAGGCCCCGGTGCAGCATGGATACCGGATTGATCAGCTCCCAGGCCAGGCCGGCGCCGGCCAGGGGCAGCAGCAGGGTCAGCCCCAGGATCCAGTAGCGGCTGCGGCGGCTGATGTGGGCGCTGCCCTTGAGACCGAGGCGCTCCCGCAGCCAGCCGGCCAGGTCGGTGACCAGGTTCATGGGGCAGACCCAGGAGCAATACACCCGGCCGCCCACCAGCAGGTAGAAGGCGAGCACGATGGCGGCGCCGAGCAGGGCCAGGCCCTCGGGGCGATGGCCGGAGAACAGCACCTGCAGTACCAGCAGGGGATCGGCCAGGGGCAGGGTGTCCAGGGTCAGGCTGTAGCTGAGGTTGCCCTTGACCAGCCACAGGCCGGCCAGGGGGCCGAGCAGGAACAGGCCCAGGATGCCGAACTGGGACAGGCGCCGCAGCAGCAGCCAGCGGTTGGCGCCGAGTCGGCCCTTGGCGGCCAGGGCGGCGGGGAAACGGGGAGAGAGGAAGCTCATCGCGCCGCCTCGTCGGAAAGCCGGTTGGGCAGGCCGCCACCGGGGATGTTCTGGGGTATGGCCGGAACGCCCGGTCCATGGGCATCGGCACCGGGGATGCTCGGCGCCAGGCTATCGACGCCGCTGCCCGGGGTGGCCGGCTTGCCGGGGACGAGGGACGGGCCTCCCTGGCTGGCCGGGTCGAAGTGGCCCTCCAGGCGGGCGCCTTCAGGCATGCGGTCCGGCAGGTCGCCCAGGCCCTTGTCGTCCACCAGGGAATGGCCGGCCTTCTGTTCTTCCTCCCAGCCGACCCGGTAGTGCTGGCCCAGCTCGCCCTTGGCCAGGGGCACGGGCAGGACCTTGATGGCGGCGGTTTCGAGGACGCAGGAGCGCTCGCATTTGCCGCAGCCCGTACAGTGCTCGGAATGCACCGCGGGAATGAACATGCTGTGGCGGCCGGTGCGGGTGTTGGGGCGCAGCTCCAGGGTGATGGCCTTGTCGATCACCGGGCACACCCGGTAGCAGACGTCGCAGCGCAGGCCGAGGAAATTGAGGCAGGTCTCCTGGTCGAGCAGGACCGCCAGGCCCATGCGGGCCTGGTTGATGTCGGTGAGGCCGTGGTCCAGGGCGCCGGTGGGGCAGGCCTTGACGCAGGGGATGTCCTCGCACATCTCGCAGGGCACCTGGCGGGCGACGAAGTAGGGCGTGCCGGTGGAGACCGGCTGCTCCGGCCGGGCCAGGGACAGGGTGCCATAGGGACAGTCGCGCACGCACAGGCCGCAACGGATGCAGGCACCGAGAAAGTCCTCCTCCGCACCGGCACCGGGCGGGCGCAGGGCTGCCGGCGGCAGGGCCCGGGCCTGCTTGGCGTGGAAGCCCAGACCGAGACCCAGCAGCCCGACGCCGCAGGCCATGCGGCCGGCGTCGGCGAAGAACTGGCGCCGGGCCGCCGCAGCCTTGTCGGACTTGGCAGGAGGGGAGTTGGAAAGATCGCTCATGGCGTGACCGGCGGCGGGAACCGACTGGGGGCGGACTTCGCTGTCCGCCCCGCCGGCCTCCCTGATGCCTTATCCGGTTGTGTAGGTGCTTAGGCCTTGACCACCTTGCAGGCGCACTTCTTGAAGTCCGTCTCTTTCGAGATGGGGCAGGTGGCGTCCAGGGTCAGCTTGTTCACCAGCCGGTGCTCGTCGAAGAAGGGCACGAAGACCAGGCCCAGGGGCGGCTTGTTGCGGCCCCGGGTTTCCACCCGGGTGGAAATCTCGCCGCGGCGGGACTGCACCTTGACCGTGTCGCCGCGCTGCAGGCCGCGCTTCTTGGCGTCTTCCGGATGCATGTAGATCCAGGCGTCGGGCATGGCCTTGTACAGCTCGGGCACGCGCCGGGTCATGGAGCCGGTGTGCCAGTGCTCCAGCACGCGGCCGGTACAGAGCCACAGGTCGTACTCGGCATCCGGCTGCTCGGCGGCAGGCTGGTAGGGCAGGGCGAAGACCACCGCCTTGCCGTCGGGGAAGCCGTAGAAGCGCACCTTCTCGCCGGCCTTGACGTAGGGGTCGTAGCCTTCGCGGAAGCGCCACAGGGTTTCCTTGTTGTCCACCACCGGCCAGCGCAGGCCGCGGGCCTTGTGATAGGTGTCGAAGGCGGCCAGGTCGTGGCCATGGCCGCGGCCGAAGGCAGCGTACTCCTCGAACAGGCCCTTCTGCAGGTAGAAGCCGAGGACCTTGCCTTCCTCGTTCTCGAAGCCCTTCAGCTGGTCGGAGACGGGGAACTTGTTCACTTCGCCGTTGGCGTAGAGCACGTCATAGAGGGTCTTGCCCTTGTACTCGGGGGCCTTGTCCAGCAGTTCGGCGGGCCACACTTCCTCCATCTTGAAGCGCTTGGAGAATTCCACGTACTGCAGCACGTCGGAGCGGGCCTGGCCCTGGGGCTTGACCTGCTGGCGCCAGAACTGGGTGCGGCGCTCGGCGTTGCCGTAGGCGCCTTCCTTTTCCATCCACATGGCGGAGGGCAGGATCAGGTCGGCGGCCAGGGCGGAGACGGTGGGATAGACGTCGGAGTGCACCACGAAGGCGGCCGGATTGCGCCAGCCCGGGTAGACCTCGCCGTTGATGTTGGGGCCGGCCTGCATGTTGTTGGTGGTGGTGGACCAGAAGAAGGCCACCTTGCCATCCTTCAGGGCGCGGCTCTGGGCCACGGCGTGCAGACCCACCCAGTCGGGGATGGTGCCGGCGGGCAGCTTCCACAGCTTCTCGGTGATTTCCCGGTGCTTGGGATTGACCACCACCATGTCGGCGGGCAGGCGATGGGCGAAGGTGCCCACTTCCCGGGCCGTGCCGCAGGCGGAAGGCTGGCCGGTGAGGGAGAAGGGGCCGTTGCCAGGCTCGGAGATCTTGCCCACCAGCAGGTGCACGTTGTAGATCATGTTGTTCACCCAGGTGCCCCGGGTGTGCTGGTTGAAGCCCATGGTCCAGTAGGAGACGACCTTCACCTTGGGATCGGCATAGGCCTTGGCCAGGGCTTCCAGGTTTTCCTTGGGCACGCCGGAAATCTCGTGGGTCTTGTCCAGGGTGTACTCGGCGACGAAGGCCTTGAACTCGTCGAAGGAGATGTCGGTGGCCTTGTTGGGGTCGCCCTTGGGTTTGCCGTCCGGGCCCGGGTAGCCGTTGTTGCCGGCGGCCTGTTCCAGGGGGTGGTTGGGACGCAGGCCGTAGCCGATGTCGGTGACGCCCTTCTTGAACTTGACGTGGTTCTTGACGAAGTCCTGGTTCACCGCGCCGTTCTGGATGATGTAGTTGGCGATGTAGTTGAGGATGGCCAGGTCGGACTGGGGCTTGAAGATCAGCTCGTTGTCGGCCAGCTCGCAGGAGCGGTGGGTGAAGGTGGACAGCACGTGAATCTTGACGTGCTTGGCGTTGAGGCGGCGGTCGGTGATGCGGGACCAGAGGATGGGGTGCATCTCCGCCATGTTGGAGCCCCACAGGGCGAACACGTCGGCGTGCTCCGCATCGTCATAGCAGCCCATGGGCTCGTCGATGCCGAAGGTGCGCATGAAGCCAGCCACGGCGGAAGCCATGCAGTGGCGGGCATTGGGGTCCAGGTTGTTGGAGCGGAAACCGGCCTTCCACAGCTTGGCGGCGGCATAGCCTTCCCAGATGGTCCACTGGCCGGAGCCGAACATGGCGATGTTGCGCGGGCCGCCGGCCTTGAGGGCTGCCTTGCACTTCTCGGCCATGATGTCGTAGGCCTGGTCCCAGGAAATGGGGGTGAAGTCGCCGTTCTTGTCGTACTGGCCGTTCTTCATGCGCAGCAGGGGCTGGGTCAGGCGGTCCTTGCCGTACATGATCTTGGAGAGGAAGTAACCCTTGATGCAGTTGAGGCCCCGGTTTACCGGCGCTTCCGGATCGCCCTGGGTGGCCACCACGCGGCCGTCCTTGGTGCCCACCAGGACACCGCAGCCGGTGCCGCAGAAGCGGCAGACGCCCTTGTCCCAGCGGATGCCGTCATTCTTCGGCTGCTGCGCCAGGGCTTCGGATACGCCGGGCACCGCCATGCCGGCGGCGTTGGCGGCGGCGGCGACGGCGCTGCTCTTGATAAAGTCGCGACGGGTCAGGTTCATCTCAGGACTCCTTCTCGGGATCAGGTTCGAAGTGGTGATACACCATGGCCAGGGACATGACGCCCGGCAGCTGCTGTATCGCCTCGTAGGTTTGGGTGGTTTCCCGGTCGCCGTCGCTCTCGATGGTGACGATCATGCGGCCCTCTTCGGAGACGGCGTGGACCTCCACGCCCGCCAAGGTCGCCAGACCCGCCTCGACGGCCGCGATCTGCTGCGGCCCGGCGTTGACCAGAATGCTGGAAATATTCACGGATGATTCCCCCAATCGGCAGGCACGGCTCGAAGGGCCGGCTCAAGGTTCGACCACTCTATGCCCCTACGGAAACCCAGGTTTATGATGTGAATCAAAAGCGAAAAAAAGCCCCCGCAGGATGTAGGTGATTTGAAAAGTCACCGGGCTTCGGTTAGCCTGATCGGGCTTTGCCTGGTGCGGTTTTCCGCCGCATCGGGGTTAATGAGAATGACCCACAACAAGGCCCGCCCCATGTCGGCAGGCCGGCGGTTCGAGTAACAGCCCCGATTCCCCAGAAGGCTTGTTTTCCCACCCACCCATGTCCCGGCCCCTGCCCATTCTCTCGACGCCCGCAGCCGCCCCCCCGGAGGCCGCTCCCCTGACCCGCTACAGCCCCATTCCCTGGGGCCTGGTGATCGTCCTTTCCCTGCTTTTCGTCGTGGTCTGGCTGCTGCCGCCCCTGGGCGGACTCAAGCAGAGCGACACCATCTTCCCCCTGACCCTGCACACGGTGATGGAGAGCTTCTCCTTCGTCGTCTCCGTGCTGGTCTTCGCCGTATCCTGGCATGCCTACAGCCGGGAGCGGGCGGGCAACCTGATGATCCTGGCCTGCGGCTTCCTCGCCGTGGCCCTGCTGGATTTCGGCCATACCCTCTCCTACCGGGGCATGCCCGACTTCGTCACCCCCTCCTCGCCCCAGAAGGCCATCATCTTCTGGCTGGCGGCCCGCTATGTGGCGGCCCTGACCCTGCTCACCATCGCCCTGCGCCCCTGGCAGCCCCTGGCCCGCCCCCGGGACCGCTACCGGCTGATGCTGTGGGCCCTGCTGGTGACCGCCGCGGTGTTCGTCTCGGAGCTTTACCTGCCGGATTTCTGGCCCACCATGTTCGTGCCCGGGGTGGGCCTCACCGGTCTCAAGATCGCCGCCGAATACGGCCTCATCGCCATCCTCGGCGCCACCGCGGTGATCCTCTATCCGAAGACCCAGGGCAAGCCCGCCTTCGACGCCGCCAACCTGTTTACCGCGGTGCTCATCACCATCCTCTCGGAGCTGTGCTTCACCCTGTACTCCAACGTCAATGACGTGTTCCAGCTGCTCGGCCACACCTACAAGGTCATCGCCTATTTCTGGATCTACAAGGCAGTGTTCGTCTCCAGCGTGCGCGATCCCTACCTGCGCCTGAGCCTGGAGATGGCCGAGCGCCAGGCGGCGGAGGCGCGCATCCAGTTCCTCGCCTACCACGACCCCCTGACCGAACTGCCCAACCGCATCCTGGTGCGGGAACGTTTCGAGCGGGCGGTGGAGCGGGCCCGGGACCAGTCCTCCCGGGTGGGCCTGGTCTATATCGATCTGGACAATTTCAAGACGGTGAACGACTCCCTCGGCCACACCCTGGGCGACCTGCTGCTGCAGGCCATCGGCCAGCGCCTGCAGTCCCTGGTGCCGGCCGGCAGCACGGTCAGCCGCCAGGGTGGCGACGAGTTCCTCATCCTGCTGGAAGACCTGGAGCAGTCCCGGCTGGCGGAGAGCCTGGTGAGCCGCATCGTGGAGCAGATGCAGGCGCCCTTCGAAATCCAGGGCCACGACCTGTCCACCTCCGTTTCCATCGGCGTTTCCCTTTTCCCCGACGACGGCGGCGATTTCGACACCCTGCTGAAAAAGGCGGATACGGCCATGTACCGGGCCAAGGGCGCCGGCCGCAACGGCTACCGCTTTTTCGACCGGGAAATGGACAAGGACGTGGGCGAGCGCCTGCGCCTGAGTAACGACCTGCGCCTGGCCCTGGCGCGCAACGAGTTCGTGCTGCACTACCAGCCCCAGATCGATTTGCGCACCCAGGAAGTGATCGGTGCCGAGGCCCTGATCCGCTGGCAGCATCCGGAACTGGGCCTGCTGGCCCCGGGCCGCTTCATCGGCATCGCCGAGGACACGGGCCTGATCGTGCCCATCGGCGAATGGGTGATCCGCATGGCCTGCCATCAGGCCGCCGCCTGGCAGCGGGCCGGCCTGCCGCCCCTGGTGGTGGCGGTCAATCTTTCCGCCGTGCAGTTCATGCGCGGCGACCTGGTGGGCACGGTGGCCAGCGCCCTGGCCACCTCCGCCCTGCCTTCCCGCTGTCTGGAACTGGAACTGACCGAATCGATCCTGATCCAGGATGCGGAGAACATCCTGGGCACGGTGCAGCGCCTCAACGCCATTGGTGTGCAGATGTCCATCGACGACTTCGGCACCGGCTATTCCAGCCTTTCCTACCTGAAGCGCTTCGCCGTGGACAAACTGAAGGTGGACCAGTCCTTCGTGCGCGACCTCTGCAGCGATCCGGACGACGCCGCCATCGTGCGGGCTATCATCCAGCTGGCCCGCAGCCTGGGCCTGAAGACCATCGCCGAAGGGGTAGAAACGGCGGAAATCCTCGCCCTGCTGCAGGAGCTGGGCTGCGACGAAGCCCAGGGCTACTACTTCGCCAAGCCCCTGCCGGCGGACAACTTCAGCGCCTTCCTCAGCCAGCGCCTGTCCTGACGGTCCGGCGCTGCCGCAGCGCCGGCAGCCCCCATTTCCCCCAACTGAAAACTACTGCCGCACTTCCCGGATGTTGTCGGGCGGCAGGAACTCGCCGCTGCTGCGGAAGGGATTGATGTCCAGGCCGCCGCGCCGGGTGTAGCGGGCATACACGGCCAGGGACTGGGGCGCGCAGCGGGCGCTGATGTCGCAGAAGATGCGCTCGACGCACTGCTCATGGAACTCGTTGTGGCCGCGGAAGGACACGATGTAGCGCAGCAGGGCGGCGCGGTCGATGGCCGGGCCCCGGTAGCGCACCACCACCATGCCCCAGTCGGGCTGGCCGGTGACCAGGCAGTTGGATTTGAGCAGGTGGGAATAGAGGGTTTCCTCGACGCTGCGGCCGGCATCGGCCTGCAGCAGTTCCGGCGCCGGCTGGTAGCGGTCGCACTCGATGTCCAGCTCGTCCAGCAGGATGCCCGAGGGGTAATCCACCCGGGGCCGCGGCTGGGCAGCCAGAGGCTCCAGCTGCACCGTCACCTGGCCGCCGGCGGCGGCGGAAAGATCCCGGACCAGGGTGGCGGCCACCGTTTGCGCGTCGGCGAAGGCGCTCTGGTTGAAGGAATTCAGGTACAGCTTGAAGGACTTGGATTCGATCAGGTGCGGCGTCTGCGCCGGGATGCGGAAGGTGCCCAGGGCCACCACCGGCTTGCCCCGGGGATTGAGCCAGGAAATCTCGTAGGCGTTCCACAGGTCCTCGCCGACGAAGGGCAGGCGGGCCGGGTCGATGCCGATCTCGTCCCGCTTCAGCTGGCGGGGAATGGGAAAGAGCAGTTCCGGGGCGTAATGGCAGCGGTACTCGCTGGCCTTGCCGAGAGGGGAGAGGGCGCTGGGATCGATGGTCATGGGGCGGATTTTACCCCCAGGCGGAGGCAGGCCCCAGCATCGGCCGTGAGCGCCGCCTAGAGCAGCACGTAGCCCTGGGGCTGCAGCAGGGGCGGCAGGTCGGTCTCGCCCAGGGCTTCGCCCAGGTCCCCTTCCAGCATGCGCACCATGGCGTCCAGGGGCAGGTCGTTTTCCAGGGTGCCGAAGGGCTCTTCCAGCTCGTCCCCCAGTGCGTCCAGGCCGAAGAAGGTGTAGGCCAGCACCGCCGTCAGCAGGGGGGTGGCCCAGCCGACGGAGCGGGCCAGGCCGAAGGGCAGCAGCAGGCAGAAGAGGTGGGCGGTGCGGTGCAGCAGCAGGGTGTAGGCGAAGGGCAGGGGGGTGAAGCGGATGCGCTCGCAGGCGGCCTGGATGCCGGAGAGGGCGTGCAGGCGCTGGGTCAGGCCCTGGTAGACGATGTCGCCGAGGCCGTCCCGTTGCCGGGCCTGGACCAGGTCGTGGCCGCACTGGCGCAGCAGGGCATCGGCCGGATTGCGGCTCTGGGCAAGGCGCTCGGCCTCGCTGGGAGGCAGGAAGGGTGCCGCTTCCAGGGCTGCGTCCCGGCCCCGCAGGCGGGCGGCCAGGGCGTGGGCGAAGGCCAGGCTGCGGCGCACCAGGAGGCGCCGGGGTTCCGCCTCCAGCACCAGGCTATCCCGGGCCAGGGAGCGCAGTTCGACGATGAGGCCGCCCCACTGCTTGCGTGCTTCCCACCAGCGGTCGTAGCAGGCGCTGTTGCGAAAGCCGAGGAAGATGGAAAAGGCCAGGCCGAGCAGGGCGAAGGGGGCGGCGGAATAGTCGGGGAAGAGGTGGCCGAAGTGCTGGGCGCCCCAGGTGATGAGCACGGCGAAGCTGGTGGTGAAGACGATCTGGGGCAGCACGTGGGGCACCACCGAGCCGCGCCAGATGAAAAAGAGACGGAGCAGGCTGGGGCGTTCGCGGACGATCATGTCGAGGCAGTCGGTGGAGTGGTGGTCGGGGTGGCGCTCGCCAGTGCGGCCAGGCCGGGCAGGGCGGCGCGGGCGTTGGCGCCGGTGGCGGCGATCAGCTCGGCTGTGGGCATGCCCCGCAGCTCGGCCAGGAGGGCGGCGAAGCGGGGCAGGTAGGCGGGCTTGTTGCGCCGGTCCGGGCTCGCCGCGGTGAGAAAGGCCGGGGGAATGTCGGGGGCGTCCGTTTCCAGGACCAGGGCTTCCAGGGGCAGGGTGGCGGCCAGTTCCCGGATGCGGGTGGAGCCGCTGAAGGTCATGGCGCCGCCGAAGCCCAGCTTGAAGCCGAGCTTGATGAATTCGTCGGCCTGCTGCCGGCTGCCGTTGAAGGCATGGGCGATGCCGCCCCGGGGCCGGTAGCGGCGCAGCTGCTTGAGAATGGGGTCCAGGGCCCGGCGCACGTGGAGGATCACCGGCAGGTCGAACTCCACGGCAAGCTGCAGCTGTTCGGCGAAGAAGTGCTGCTGCCGCGCCAGGGCCTCGCCCTGCTGCAGTTCGGGCACGTACAGGTCGAGGCCGATCTCGCCCACCGCCAGCGGCGCCAGGGGGCCATCCCGCTCCTCGGCCAGCCAGCGGCGCAGGGTGGAGAGGTCTTCCTCCCGGGCCGCCGGCGTGTACAGGGGATGGATGCCGTAGGCCGGCGCGCAGCCCGGGTAGGCGAGGCAGCAGGCGCGCACCTCGGCGAAGGTGGCGGCGGCCACTGCCGGCACCACCATGGCCTGTACACCAGCAGTGACGCCATCCTGGAAGATGGCCTCCCGGTCAGGGGCGAATTCCGCCGCGTCCAGGTGGCAGTGGGTGTCGATCAGCATGGAATCTGGCTGTCCGGGCAGGCCGGGGGGAGAGTCCCGCCCTGCCTTACTTCTGGCCGAAGGCAGGTTTGCGCTTTTCCAGGAAGGCGCCCAGGCCTTCGGCGAAGTCGGGATGGACGGAGCAGGCGGCGAAGTTGCCCTGTTCGGCGAACAGCTGCTCCGGCAGGGAATTGCCGCTGGAAGCCTGCAGCAGGGCCTTGGTGCGGGCCAGGGCCTGGCGCGGACCGGCGGCCAGGCGGCGGGCCAGCTTGGCGCTCTCGGCCTCCAGCTCGGCGGCGGGCACCACCCGGTTGATGAGGCCCCACTCCCTGGCCTGGGCGGCATCGAAACGGTCCCCCAGCAGGGCGATCTCGGCGGCCCGCTTGGCCCCCACGGCCCGGGGCAGGAACCAGGTGGCGCCGCCGTCGGGGGAAAGGCCGATGTGGCAGTAGGCCAGGGTGAAATAGGCGTTGTCGGCGGCCACCGCCAGGTCGCAGGCCAGCATCAGGGACAGGCCGAAGCCGGCGGCGGCACCGCTGACCGAGGCCACTACGGGTTTGCCCATGCGGCGCACCTGCAGGGTGGTGGCGTGCACGGCGGCGATGGTCTGTTCGAAGAGGGCCTGGCGTTCCGCCGGGGGCAGGGCCAGCTGGCTGTGGAACCACTTGAGGTCGCCGCCGGCCATGAAGTGCTCGCCGCCGCGCAGGACCACGGCGCCGACGGCCTCGTCGTGCTCGGCCCGGGCGGTGGCGGCGCGCAGGTCCTCGATCATGGCCAGGTTGAGGGCGTTGAGGGCCTCGGGGCGGTTCAGGGTCAGGGTCAGGACCCCGTCCTCCAGGTGGGAAAGCACGGTGCTCATGGGTTGTCTCCTTTGGGTGGTGGTTTTTATGGTTTTATTGCTTTGTCGTTCCGAGGAACGGGAAATCCGGTTCCGGCCGGCGGCCGGAAATGAGGTCCGCCAGGGCCGCGGCGGAGCCGCAGGACAGGGTCCAGCCCAGGGTGCCGTGGCCGGTGTTGAGCCACAGGTTGGGCAGGCGGGTGCGGCCGATGAGGGGCACGTTGGAGGGCGTCACCGGGCGCAGGCCGCACCAGTAGAGCGGGTCGCCGTCGGGGCGCAGCTGGGGGAACAGTTCCAGGGCGCGCCGCAGCAGGGCCTCGCAGCGCACCGGGGTGAGCTCCAGGTTGTGGCCGTTGAACTCCGCCGTGCCGGCCACCCGCAGGCGGTTGCCGAGGCGGGACATGACGATCTTGCGTTCGTCGTCGGTGATGCTGACGCTGGGGGCGACGCTGTCCGGGGAGAGGGCGATGGTGGCGGAATAGCCCTTGCCCGGATAGACGCAGGCCTTGACCCCGGCGGGCTTGAGCAGGGCCGGGGAATAGCTGCCCAGGGCCACCACGTAGGCGTCGGCCAGGAGCAGGTCGCCGCCGGCGACGACGCCGGCCACCCGGCCGCCGGCGCTGGCAATCTTCTCCACCGGGCAGTTGTAGCGGAACTGCACGCCCCGGGCGGCGGCGGCCTCGGCCAGGCGCTGGGTGAAGCGGTGGGCGTCGCCGGATTCGTCGCTGGGGGTGTAGTCGCCACCAGCCAGGCGCCCTTGCACCGCGGCCAGGGCCGGCTCGATGGCGACGCAGCGGGCGGCGTCCACCGGCTCCCGGTCCACCCCGAACTCCCGCATCAGGGCGGCGGCGTGGCAGGCGGCCTCGAACTCGGCGGCCTGGGTGAAGATGTGCAGGATGCCCTGGCAGCGCTGGTCGTAGTCCAGGGGCAGGGTCTGGCGCAGGGCCTGCAGCCGCTGCCGGCTGTAGAGGGCGAGGGCGATGATGTCGCGGATGTTGCGGCGGGTGGCCCCGGGCGGGCAGTTGGCGAGGAAGCGCAGGCTCCAGGCGAAGAGGGCCGGGTCGTAGCGCAGGCGGAAGAGCAGGGGGGCGTCTTCCTTGCCCAGCCACTCCAGGGCCTTGAAGGGGGCCCGGGGATTGGCCCAGGGCTCGGCGTGGCAGACGGAAATCTGGCCGCCGTTGGCGAAGCTGGTTTCCAGGGCGGCGCCGGGCTGGCGGTCCACCACCGTGACTTCGTGGCCGGCCTCGGCCAGGAACCAGGCACTGGTGACGCCGACGACGCCGGCGCCGAGGACGAGAACACGCACCCGGCCTATTCCTCCTGGCGTTCCCGGGTGATGCGCACCACTTCGGGAATCAGGCGCACGGCCCGCAGGACCCGGGCCAGGTGGGCCCGGTTGGCCACCTGCACGGTGAAGTTGAGGGTGGTGTAGAAGCCCGGATCGGGGGCCATGGAGACCTTCTCGATGTTGGAGCCGGACTCGGCGATCTCGGTGGCCACCTTGGCCAGCACGCCGCGGGCGTTGCGGGCGGCGACGTGGATGTCCACGTCGAACAGCTTGCCCGGTTCCGGTTCCCACTCCACGTCGATCCAGCGCTGGGGTTCGGCACTGCGGGACTTGCGGATGACGGCGCAGTCATGGGTGTGCACCACCAGGCCCTGGCCCTTCTTGATGGAGCCGATGATGGGGTCGCCCGGGATCGGGCGGCAGCAATGGGCCAGCTGGATGGCCATGCCCTCGGTGCCGCGGATCACCACCGAGGTGTGGGGTGCCGGTTCCGCGTTGGGCAGGGCCGCCTCGTGGGCCAGCAGGCGGCGCGCCACCACGGCGGCCAGGCGCTTGCCCAGGCCGATGTCGGTGTACACCTCCTTGACGGACTTGCTGCCCCCTTCCTTGAGCACCGCTTCCCAGCTGGCGTCCGGCAGTTCCGAGGGCGTGATGCCGAGGCCGAACAGTTCCTGGTTGAGCAGGCGCTCGCCCAGAGCGGCGGATTCCTCGTGCTGGCGGGTCTTCAGGAAGTGGCGAATCTTGCTGCGGGCGCGGCCGGTCTTCACATACGAGAGCCAGGCCGGATTGGGATTGGCATGGGCGGCGGTGACGATTTCCACCTGATCGCCGCTGTTGAGCTCGCTGCGCAGTGGCATCAGCTCGTAGTTGATCTTGGCGGCGACGCAGCGGTTGCCCACGTCCGTGTGCACCGCATAGGCGAAGTCCACCGGGGTGGCGCCCTTGGGCAGGGAGAATATCTTGCCCTTGGGGGAGAAGACATAGACCTCGTCGGGGAAGAGGTCGATCTTGACGTGCTCGAAGAACTCGGCCGAGTCGCCGGCGGTGCTCTGCAGCTCCAGCAGGGACTGCAGCCAGCGGTGGGTCTGGTACTGCAGTTCGGCGGCGCTCTTCTCCGTGTCCTTGTACAGCCAGTGGGAAGCCACGCCCTCCTGGGCCATGTGGTGCATTTCCTCGGTGCGCAGCTGCACTTCCACCGGCATGCCGTAGGGGCCGATCAGGGTGGTGTGCAGGGACTGGTAGCCGTTGGCCTTGGGGATGGCGATGTAGTCCTTGAACTTGCCCGGCAGGGGCTTGTACAGGGCGTGCAGGGCGCCCAGGCCCAGGTAGCAGCTGGGCACGTCCTTGACCACCACGCGGAAGCCGTAGATGTCCAGCACCTGGGAGAAGGAGAGGCGCTTTTCCACCATCTTGCGGTAGATGGAATAGAGGCTCTTCTCCCGGCCGAAGACCTGGGCCTCGATGCCCGAGTCCCGCATCTTGCTCTGGACCCCGTCGAGAATCTTCGACAGCACCTCGCGCCGGTTGCCCCGGGCCGCCATGACGGCCTTCAGCAGCACCTGGTAGCGCATCGGGTGGGTGTGTTTGAAGGAGAGGTCCTGCAGCTCCCGGTAGACCGTGTTCAGCCCCAGCCGGTTGGCGATGGGGGCGTAGATCTCCAGGGTCTCCAGGGCGATGCGGCGGCGCTTGTCCGGACGCATGCAGCCCAGGGTCTGCATGTTGTGCAGACGGTCGGTGAGCTTGATGAGGATGACCCGCAGGTCCTTGGCCATGGCCAGGAGCATCTTGCGGAAGTTTTCCGCCTGGGCTTCCTGGTAGGAGGAGAACTCGATCTTGTCGAGCTTGGAGAGGCCGTCCACCAGGTCGGCCACGCCCTTGCCGAAGCGTTCGGTCAGCTCCTCCTTGGAGATGCCCGTGTCCTCCATGGTGTCGTGCAGGAGGGCGGCGATGATGGCGGTGGAATCCAGCCGCCATTCGGCAATGGCCCCGGCCACGGCCAGGGGGTGGGTGATGTAGGGTTCGCCGGAAAGGCGCTTCTGGCCCCGGTGGGCGGCTTCGCCGAAGGCAAAGGCCTCCTTGATCTTGGCGATCTCTTCCGGTTTCAGGTAGTCGAGGCTGTCCAGGAAGACCCGGTAGGCCGGGTCGTCGTTGAACGGGTAGGGGGTGGGCGGCGCCGCCGGGTCAGGTGCCGGCGCGAAGGGCGCGGCGGAAGATGCAGACGGTTTGGCCGGGGCGGGGTCGGTTGCAGTATCCATACCGGTTCACCTTACCCGCCCGGGCCCGGGGTCAGGCCTGACCGCGGTTGAGGATTTCCAGGCCGATCTGGCCGGCAGCCAGTTCGCGCAGGGCGATGACGGTGGGCTTGTCCTTGCTCGGTTCCTGCATGGGGGTGGAACCATTGGCGATCTGGCGGGCCCGGTAGGTGGCGGCCAGGGTCATCTGGAAGCGGTTGGGGATTTGTTTCAGGCAGTCTTCAACGGTAATGCGGGCCATGGTCCATCCAATCAAAAAGCGGAAAATTTAGAGCAGCGAGGCGAACAGCGAAGCGTGGCGTTCCTGCTGCACGGGAAGCTTCAAGCGCGTTGCGCGCACCACGGCCAGCAGGTCGCTGAGGGCCGTCTGCAGGTCGTTGTTAATAATAACATAGTCGAATTCCCCCACATGCCGCATCTCGTCGCGGGCAGCGGCCAGGCGGCGGGCGATGACGTCCTCGCTGTCGGTGCCGCGGCCGGCCAGGCGGCGGGCCAGTTCTTCCATGGAGGGCGGCAGGATGAAGACGCCGATGGCGTCGCCGAACACCTTGCGCACCTGCTGGGCGCCCTGCCAGTCGATCTCCAGCAGCACGTCGCGGCCGGCGGCCAGCTGCTGTTCGATCCAGGTGCGCGAAGTGCCGTAGTAGTTGCCGTGCACCTCGGCCCATTCGAGGAACTCGCCCCGGTCCACCCGGGCCAGGAAATCGGCCACGTCGGTGAAGTGGTAGGCCTGGCCGTTCTCCTCCCCGGTGCGGGGCGCCCGGGTGGTGTGGGAGACGGAGAGGCCGATGGCCGGGTCGTTCTGCAGCAGCAGGCGGACCAGGGTGGTCTTGCCGGCGCCGGAGGGGGCGGTGACGATGTAGAGGTGGCCGCTCATGCTGGCATCCCTTTCCTTATTCGATGTTCTGGATCTGTTCGCGCATCTGCTCGATGAGCAGCTTCAGGTCCATGGAGGCCTTGGAGACTTCGCTGAGGACCGACTTGGAGCCCAGGGTGTTGGCCTCCCGGTTCAGTTCCTGCATGAGGAAGTCGAGGCGCTTGCCGGCGTTGCCGCCGGCCTTGAGGATGCGCTCCACCTCGGTGAGATGGGCCTGCAGGCGAGACAGTTCCTCGTCCACGTCGATACGGGTGGCATACAGCACCACTTCCTGGCGCACCCGTTCGTCGTCGGCGCTGCCCAGGGCCTCCACCAGGCGCTGCTTGAGCTTGTCCTGGTAGGCAGCCTGGGCCTGGGGAATGAGGGGGGCAACGGCGGCCACGGTGGCGCGGATCTTGTCCACCCGCTCCTGGATCATGGCGGCCAGCTTGGCGCCTTCCCGGGCCCGGCTGGCGGTGAAGTCCTCCAGGGCCTCCTTCAGGGTGGCCTGGACGGCGGCGTGCAGGGCGGCCGTATCCACCTCCGGTTCGCCCAGCATGCCGGGCCAGCGCAGCACTTCGGCCACCGACAGGGCGGCGGCGTTGGGCAGGGTCTGGCGTACCTGGCCTTCCAGGGCCTGCAACTGGGTCAGCAGGTCGGCGTTGATGGCCAGCTGGCGGTTCTGGCTCTGGCTGGCGACCAGGTTGAGGCGCAGTTCCACCTTGCCCCGGGCCAGCTTGGCGGTGATGGCTTCGCGCAGGGCCGGCTCCAGCACCCGCAGGTCGTCCACGATGCGGAAATGGATGTCGAGGAAGCGGGAATTGACGCTGCGCAGTTCCAGGTGCAGGGAGCCGCCTGCCACTTCCCGGGTTTTGGCGGCATAGCCGGTCATACTGTAGATCATGGAATTCCTTGCTGTGTGGGGTGTGCCGGCCCGGACGGCGGGCCTGGGAGAGGCTTCTTGAGGCTTCTTTACAGTCCGTTGGCAAAGCCTGACAATGCGCGCAGTGCGCCTGAGGCGCTATCTTAGCTTCGGACTTCATGGCGTCACAAGCCAATCACCCCCTCCCTGCCGGATTCCAACTGGAAGACTACCGCATCGAAAAGCAGATTTCGGTCGGCGGTTTTTCCATTGTTTACCTGGCCCACGATGCCAGCGGCAAGGCGGTGGCCATCAAGGAATACCTGCCGGCCAGCCTGGCCCTGCGCTCCGAGGGGCAGACCAAGCCGGTCATTTCCCAGGAGCATCTTTCTGCCTTCCGCTACGGCATGAAATGCTTTTTCGAGGAAGGCCGGGCCCTGGCCAAGCTGAACCATCCCAACGTGATCCAGGTGCTGAACTTTTTCCGCGCCAACGACACGGTTTATATGGTCATGGAATACGAGCGGGGGCGCACCCTGCAGGAATTCATCCAGAAGCACCACGGTCACATCCACGAGAAATTCATCCGCGGCGTGTTCACCCGCATGCTCAACGGCCTGCGCGAAGTGCACACCCACAAGCTGCTGCACCTGGACCTGAAACCGTCCAACATCTACCTGCGGGCCGACAATACGCCGGTGCTGATCGACTTCGGCGCCGCCCGCCAGACCCTGCATTCCGACACCCCCATGCTGAAACCCATGTACACCCCGGGTTTCGCCTCCCCCGAGCACTACTTCAAGCGGGACGAACTGGGGCCCTGGAGCGACATCTATTCGGTGGGCGCCTCCATGTACTCCTGTCTGGCCGGGGCGGCGCCCCAGGCGGCCGATGCGCGCATGGAGAAGGATCAGCTGCAGCCGGCCTCGGTGCGCTGGGAGGGCCAGTATTCGGACCAGCTGCTGGAGACCATCGACTGGTGCCTGTGCCTCAACCACCTGTACCGTCCCCAGAGCGTCTTCGCCCTGCAGAAGGCCCTCACCGAGGCGGTGGACATGCCGGGTCAGGGAGCCAGCAAGGCGGCGGAAAAGGAAGGATGGCTGGGCCATCTGGTGGGCAAGATCAAGGGAATGACTGCTAAATGAAATTCACCATCTACCAGGAAAGCCGCATCGGCAAGCGGCAGAACAACGAGGACCGGATCGCCTACTGCTACTCGCGGGAGGCGGTGCTGATGGTGGTGGCCGATGGCATGGGCGGCCATTACCACGGCGAGGTGGCCTCCCAGATCGCGGTGCAGACCCTGACCTCGGCCTTCCAGCGGGATGCCCAGCCGGAGATCGCCGATCCCTTCCTCTTCCTGCAGAAGGGCATGACCAATGCCCACCACGCCATCCTGGACTATTCCCAGGAGCACCGGCTGAAGGATTCGCCGCGCACCACCTGCGTCGCCTGCCTGATCCAGGACAACATCGCCTACTGGGCCCACGTCGGCGATTCCCGCCTCTACCACATGCGCGACGGCAAGGTGCTGGCGGTGACCCGGGACCATTCCCGGGTGCGCCTGCTGATGGACGAGGGCCTCATCAGCGAGGCCCAGGCCGCCACCCACCCGGACCGCAACAAGGTGTACAGCTGCCTGGGGGGCGAAAACCCGCCGGAAATCGAGTTCTCCCGCAAGACCCCCCTGGAAGTGGGGGATGTCCTGGTGCTGTGCACCGACGGCCTGTGGGGGCCGCTGCCGGCCGATGTCATGGCCGCCTCCCTGAAGGGGGCCAACCTGATGCAGGCCGTGCCCATGCTGCTCAACCAGGCGGAAATCCGCTCCGGCCCCTACGGCGACAATCTTTCCGTGGTGGCGGTGCGCTGGGAGCAGAGCTACAGCGAGGAGGCCTCCAGCACGGTGATGACCCAGACCATGCCCCTGGACGCGGTGACCACCAAGCTCGGCGAATTCGGTCGGGACCCGGCCTACAAGACCGATCTTTCCGACGACGAGATCGAAAAGGCCATCGACGAAATCCGCGCCGCCATCCAGAAATTCTCCAAATAAGGAAGTTCCATGCGTCCCAGCCAACGTGCCGCCGACCAGCTGCGCCAAGTCCGCATCACCCGCCGTTTCACCCGCCATGCCGAAGGTTCGGTGCTGGTGGAAATGGGCGACACCAAGGTGCTGTGCACCGCCAGCATCGAGGAAAACCTGCCGCCCTTCCTGCGCGGCAAGGGCCAGGGCTGGGTCACCGCCGAATACGGCATGCTGCCCCGCTCCACCCACACCCGCAGTTCCCGGGAAGCGGCCAAGGGCAAGCAGACCGGCCGCACCCAGGAAATCCAGCGCCTCATCGGCCGTTCCCTGCGCGCCGTCACCGATCTCAAGGCCCTGGGCGAGCGCCAGATCACCCTGGACTGCGACGTGCTGCAGGCCGACGGCGGCACCCGCTGTGCCTCCATCACCGGCGCCTGGGTGGCCCTGTGGGACGCCTGCCAGTCCCTGGTGGCCGCCGGCAAGCTGAGCGAGAACCCCCTCAAGGAACACGTGGCCGCCATCTCCGTCGGCATCTACAAGGGCACCCCGGTGCTGGACCTGGACTACCCGGAAGATTCCGACTGCGATACCGACATGAACGTGATCATGACCGGCAGCGGCGGACTGGTGGAAGTTCAGGGCACGGCCGAAGGCGAGCCCTTCTCCCGGCAGCAGATGAATGTGCTGCTGGACCTGGCCGAAGCCGGCATCCGCCAGCTCATCCACGCCCAGGAAACCGCCCTGGCGGATTGATTCGGAGCCCGTCATGGCCCAGGAAACCTCCCGCGACCCGATCAAGGCCCTGCTCGACGATCTGGAACAGTCCATCGCCGATTTCGATCAGCGCCTGGGTGGCGTCGAGGAGTCTCCTGCCGTGACCGGTCTGCGTTCTTCCGGGCAGCGCTATCCCGACATCGAACCCGAGGCCAGGCGTCAACTGTCTCCTGCCGCTCCTGTTGCCGTTGCCGGCAATGCCGACGCAACCGCTGTGTCCGAAGCGCCGGCGGTGGACCTGCTGGCCGAACTGGCCCAGGCGGCGGCCTGCCGCAGCGTGGATGATGCGGAGACCCAGCGTCGCCAGCTGGAACTGACCGAGCGCCTGCACCAGGACCTGAAGACCGTCTTCGACTACCTCAACCAGCTCATCCGCCACGCCAACACCCTGAAGCCGGTGCTGCCCCGCAGCTACCGGCTGGATGCGCGCAACAGCTTCGACGGGCTGGCCTGGCATGACGGATTCGTCGATTACCGCAGCACCAGTCGCTTCGACCGCAGCTACTACGAGCAGATTCTCTTCCAGGTGAGCTACCGGGCGCCGGCGCCGCTGGTCGCGGTCTGCGCTGCGGACCAGGCCGCCATCGTGCGCAAGGAGCTGGAACTGGTGAACCTGCGCATCCAGCGAGAAGAGCCGGTGATGCTGCCGGAGGGCGGCCCCGGGGTGCGCTATGTGCTGCCGGATGCCATTCCGCTGCATCTGGCGGTACAGGCGGACTTCGCCAACGATGCCCTGACCTTCCGCTGCCGCAATGCCGGCAATTTCGGCCCTACTGCCTACCGTCTGCCGGGCGGGAGCATCACCCGGCCCCTGCTCGACGGCATCGGCCTGGTGCTGCTGGGCCGCAGCGACACCATGCCCAAGGAACTGCAACGCATTCCCTACCAACGGATCAACTGAGTCCCATGCAAAAGATCGTCCTCGCCTCCAACAATGCCAAGAAGCTCAAGGAACTGTCAGCCCTGCTGACACCCCTGGGCATCCAGCTCATTCCCCAGGGCGAGCTGGGGGTGCCGGAGGCGGAGGAGCCCCACCACACCTTCCTCGAAAACGCCCTGGCCAAGGCCCGCCATGCGGCCCAGCTGACCGGCTTGCCGGCCCTGGCCGACGACTCCGGCCTGTGCGTCAAGGCGTTGGGCGGCGCTCCCGGAGTGCAGTCGGCCCGCTACGCCGGCGAGCCCAAGTCCGATGCCCGCAACAACGAGAAGCTGCTGGCGGCCCTCACTGGCGTGGCCGACCGCCGTGCCCACTTCGTCTCACTGCTGGTGCTGGTGCGCCACGGCGACGACCCCCAGCCCCTGGTGGCCGAGGGCGAGTGGCACGGCGAGATCATCGACCAGTACCGGGGCGAGGGAGGCTTCGGCTACGACCCCCTGTTCTACGTGCCAGCGGAAAAGGCGACGGCGGCCGAACTCTCCGCCGAGGTGAAGAACCGTCTCTCCCATCGTGGCCAGGCCATGGCCCGGCTGCTGGAACGCCTCAAGCTGGAACTGTGAGCCCGGCCGGCGCCGGGAAACCTTCTGGCGCTGCCGCGGTCCAGGGCAGGGTGTTCCGGACTTGCCGCTCCGCAGTGGCCTGACTCAAAGCGCCATTATTCAGACATCGCATCATGCCGCCTGCGGGCGGAAAGGTTTTCTTTCGTGTCGTCCCGTTCTTCCTCCCGCATCATTCCCATCGCCGTCGCCGGCGGCACCCGCGCTGGCGGCAGTCCCCTGCACTTCACCAGTCCTCCGCCCCTCTCGCTCTACATCCACGTGCCCTGGTGCGTGAGGAAGTGCCCCTATTGTGATTTCAATTCCCATGAGGCGCGGGCGGAGAACGACGAGGCCGCCTATGTGGCAGCCCTCGTTGCCGATCTGGAAAGCGCCCTGCCGTCGGTGTGGGGACGCAAGGTATCCACCATTTTCATCGGCGGTGGCACGCCAAGCCTGCTCTCCGGCGAGGCCCTGCACGAACTGCTGAATGCGGTGCGCATGCGTCTGCCCCTGCTGCCCGAAGCGGAGGTGACCCTGGAGGCCAACCCGGGCACCGCCGAGGCGGGCAAGTTCGCCGCCTTCCGGGCCGCCGGAGTGAATCGTCTGTCCCTCGGCATCCAGAGCTTCAACGACCGGCACCTGGAGGCTCTGGGCCGCATCCATGACAGCGCTGAGGCCAGGGCCGCCATCGAGTTGGCCAAAGCCCACTTCGAGCGCTTCAACCTGGACCTGATGTACGGCCTGCCCCAGCAGTCCCAGGCCGAAGCGATGGCAGACCTGGAGATGGCCCTCTCTTTCGCGCCGCCCCATCTTTCCTGCTACCAGCTGACCCTGGAGCCCAACACCCTCTTTGCCGCCCGGCCGCCCCAGCTGCCCGAGGGCGACACCTGCGCCGACATGCAGGACGCCATCGAGGCCCGCCTGGCTGCCGCCGGCTACGTGCATTACGAAACTTCGGCCTTCGCCCGGCCCGACTACCAGTGCCGGCACAACCTCAACTACTGGACCTTCGGCGACTACCTGGGCATCGGCGCCGGGGCCCACGGCAAGCTGACCCTGCCGGACCACAGCGGCTTCTCGGTGCAACGCCAGATGCGCTGGAAACAGCCCAAGCAGTACCTGGAGCAGGTGGCCGCCGGCCAGCCGGTACAGGAGCAGCACGGCGTGGGGGCGGACGAGCTGCCCTTCGAATTCCTCATGAACGCCCTGCGCCTCAACCAGGGCTTCGATCCGGCCCTCTTCGAGCAGCGCACCGGCCTGCCCCTGCTGCTGGTGCGGGGCGAGCTGGAAAAGGCGGCCCGGGAAGGGCTGCTGACCCTGGCACCGGACTGCATCGCACCCACCGAGCGGGGCCGGCGCTTCCTCAACGCCCTGCTGGAACGCTTCCTGCCGGATGCCTGAATAGGCTATGCCGCAGAAGTAGGAAGGACGCCATTCTTCACCACGGCACGGTGAAGAATGGCGTCCTTCTTGTCTTCTGGCTGAGGTGCCTTAGAAACCCTGGGTATAGGTCAGGCGCCAGATGCGGGGTTCTGCCGGGTGGCTGTGCACATCCTCCACTTTCGCTGCTTCGCCGCGCAGTTGGGACTCGTAGTAGTAGTCGATGTCGCTGGCCTTGCGGTTGAACAGGTTGAGCACTTCCAGGGTCAGCTGGCTCTTGGCGGCCAGCTTGTAGCCCACATTGAGATTGACCATCACCGAGCTGCCGGAGCGCACAGAGTCGTCTTCTTTGAGGGCCCGGGGGCCCAGGTAGCGCAGGCGCAGGCCGCCGCGCCAGGGGCCCAGGTCGTGCACGGCGACACCGACGGAGGCGGTGCGTTCGACGGCGCCGGGGACGTGGTTGCCAACGCTACTGTCATCGCGGAAGCGGGCCTTGGAGAGGGCGATGTCCGCATCCAGGGTCAGCCAGTCTCGGGGCGTCCAGTAGTTGGACCATTCCATGCCCTGGCGGTGGCTGGGGCGGCTGGCCTGGGTGGTGCCGGCGTCGCCGACGAAGAGCAGTTCCGAATCCAGGTCCAGGCGCCACAGGGCGACGCTGGTGTTCCAGCCGGGGGCCGGGGCGCTGCGCCAGCCCACTTCCTGGCCCCGGGATTTGACCAGGGCCGGCACCCGGGACATGGGGTCGCCCGGGTTGGAGGGATCGACGCGGATGGTGGTGCCGCGGGCGTCGTTGCTGTGGAAGCCCTGGCCCCAGTTGTAATAGAACTCCTGGTTGGCGAAGGGGCCGAAAATGAGGGACAGCTTGGGGCTGGTGATGCCGTCGTTTTCCTTGCCGGAATTGGCGGCCAGGCTGGAATCCACCTTGAAGCGGTAGCGGTCGTGGCGCAGGCCGGCGACGCTGCGCAGCCAGTCGCTCCACTGGGCGCCCCACTGGCCGTAGAGGCCCAGGCTGCCCTGGTTCACGCTGTCGCTGCGCACGGTGGACAGGCGCTGGCGGGCGGCGGTGCGGTACAGGCCCACGTTGTCGATGTCGTCCTGGCGCCCCTGCACGCCCCAGGTGAAGTCCCCTTCCTTGCCCAGCCACTGCACCGGCTGGCTGCGGCTCCAGCCGAAGCCGCCGTAACGGCGCCGGTCGGCCTGCTCGAACTGGTCGCCGTTGACCGGATCGTCCATGGCGTAGGTGAAGTTGGAGAAGAGGTTGAGCCGGTAGTCCACCAGATAGGTGTTGGCCCGGGTCTGCACCGCCCCGTCCTGCCGCGCCCACTGGCCGGAGAGGGAGAGGCGCCGGGTGTTGCCGCCGGCGGTGGGGTCCAGGCTGCCGTAGCGGTTCACCAGGCCCTGGTCCACGGCCCGTCGCGCCAGCTGGTCGGTGGAGGTCCAGTCGCCGTCGTAGGCCATGAAGGCCAGCGAGTGGCCGTTGTTGCGCGTGCCTTCCGAATAGCGCAGCACGCCGTTGAGGCGCTTGTAGTGCTCCGGTACTTCCCAGGGACCATCGTTGTGGAAGACCTCCACGGCGCCCAGCCAGCGGCCGCCGCCGGCGGTCTCCTTGTCGGCGGCGGTGAGGAGGCGGCGGTAGCCGTTGCTGCCGAGGCCGATGCTGACGTAGTCCTCCGGCAGGGCGCGGCGGTAGTCGATACGGGCGCTGCCGGCGGAGGAGAAGTCCCCGTCCTCGGCGGCGTAGGGGCCCTTCTTGTACTGGATGCGCTCCACCAGCTCGGGGATGAGGAAGTTGAGGTCCAGGTAGCCGTGGCCGTGGGCGTGGGTCGGCAGGTTGATGGGCATGCCGTCGATGGTGACGGAGAAGTCGGTGCCGTGGTCCAGGTTGAAGCCGCGCAGGAAGTACTGGTTGGCTTTGCCGTCGCCGGCGTGCTGGGTGACGATGAGGCCGGGCACGGTTTCCAGCACTTCCGCCGGGCGCAGCAGGGGGCGGTTCTCCAGCTGCTTCGCTGTCACCGTGCCGACGCTGGCGGCGTCGGCGACGCCGATCAGGTCCTGGGCGCCGGCCTTCACTTCGATCACGTCTTATGTAATCCAATTCAGTGTTGCGTATAGCCAAGTTGCTTCTGCTGAGCGGGATTACATCAGGGGATACGGTTTTCTAGTAGCCTAAAGAGAAAAACGAACAAACCGAAAAATTCTGACTATTGTCAGCTGACACTATCGGAATGATGTGGTTATGTGGCGCCCTTCTAATAAGGCCCTGCATAGCCAATGGCACATGATACCAATCCTACGTTGCGTGAAGCCCTTTTCATGTACTGCGAGCGCATCAGTATCCATAAGAAGGGCCACGCACAAGAGAAATATCGAATTAACCTATATTGTCGTTATTCCATTGCTGATCTTCCGATTCGCAATATAACGTCAGTCGATGTGGCGACATTTAGGGATGAGCGATTAGCGGAGATTAACGCACGAACGGGTAGGGCACTTTCCCCTGCTACGGTTCGGCTGGATCTGGCGCTGCTTTCCGACCTGTTTCGCATTGCGAAGAACGAATGGGGTATATGCAACGATAACCCTGTCGCCAACGTCCGTAAGCCAAAACTTCCGCCTGGCCGTGATCGACGCTTGGCTCCTCGTGAAGAACGGATGATCATGAGGCATTGTTCCCAGCGGGGCGCGCATGAGATGAAGGCCATTGTCCAATTGGCATTAGAAACTGCTATGCGTCAGGGGGAGATTCTGGGGGTGTGCTGGGAGCACATCAATCTGAAATCCAGAATTGTTCATCTGCCCGACACCAAGAATGGTTCCAAACGTGATATCCCGTTAAGCATGGAGGCTAGGGATATCCTGGCGGCCCAGAGGGTGAAGCTGTCGGGGCGAGTCTTTAGCTATACGAACAACGGATTGAAGAGCAGTTGGCGAAGCATGATCAAGAGGCTGAATATTCCTGATCTGCATTTCCACGATCTTCGGCACGAAGCAATCTCTCGCTTGATGGAACGAGGTGTCTTCAACCTGATGGAAGTTGCTGCCATCAGCGGACACAAGAGCCTGTCCATGCTGAAGCGATATACGCATCTTCGTGCTCAGCGTTTGGTGCGTAAGCTCGACGCTGGCGCAAACAAGGGGAAGGCTGCGGTCTTGAGCTACCTGGTTCCTTATCCAGCCTTCATCGAGCCCTATGAGAGCCAGGTAAAAGTAACCTTCCCGGACTTTGACGATCTGCATGTGGCAGGGCCATGTCTAAACAGTGCAGTACAGCAAGCTCAGGATGCCCTATTACGGGAAATTTTGGTCTTGATGCGTCAAGGTCGGCCGATCCCGCCGCCAAACAACTACCTAGAACTCCTCGATGAATCCAGGCTCTTTCACCTGGACCCGTTGGCAACCTATGATTCCCTCGCGGATCTTGCCGAGGGCGCGCTGGTTTGAGTTCTGATGACCTTGCGAGGGAGGAGGAGTCCCTACTGGTGTGGATGAGCGGGTTGAAAGCAGGACCTTATCTAACTACCGCTTGAATTGGGGTGACAGTCATCTAGTGGCAAATCCACAAAAAAGTTGCGCTTGAAGTGAACAACTAAGTTCGTGCATACTGAGAATATGTATCAATCTCTTCGCCACCGTTTTCGTGCATATGTGCAGCACCCTATCGGGCGCTGCGCGGTGGTATTGATGCTGTTTGCCTTGGTGGCAGCCAGTGTTCCGTTTGGTGAGATTCATGCCCATGCGGATGGTGATCACGATCATGATCACGGTTACGTCACTGCTGAATTGACGAAGGCATCGCTCTCCGATCCTTCAGACTCTATGGACTCCGATTCTGATTCGACCGGAGCCAAAGTGCTGCATGCACACGGTTCCGTTGTCACTCCTCCGCCCTTGCCGGTGGATGGACTGGGAATCGAGCCATTCATCTTTCCCGCCCGGGACAAGATTACCCTCGCCTACTTGTCGCGGCCTTCTGCGACACCACTTCCCCCCTATCGCCCTCCAATCGCCTGACGCCTAGCGCCGCTTTTTAGCGGCTTGCTTTGTGTCGTCTGTCGATATTGGAGGTTTTCCGTGAACTTGCGTTGTTTGCTGGTTCTGGCTGTAGCCGGAACCTGCGGTATCCCCTTGTCGGGGTATGCCGCTGAATCCTTGCGCCTGGAGGAAGCAGTCTCCCGCGCCTTGGCATCCCACCCCTCACTTGCGGCCGAGGCCGCGCAATTGAAAGCCGTTCAGGCACGCGCTCAGCGTGAAGGCCTGGCAACGCCCTTTATGATCGGCGCCGATGTGGAAAACGTCGGCGGTACTGGAGCCTTTCGGGGGGGGCAATCAGCTGAAACCACGCTACGTATTGGCCGCGTCATTGAACTGGGCGGTAAACGTGAAGCGCGCCAGGCATTGGGTAGCGCTGAAATCAATCAGCAACAGAACCTGTCCGAGGCAACCCGCCTGGATGTCATCAGCCGCACCTCACTCCGCTTCATTTCAGTGCTGGCTGACCAGCAACGGCTGAAATACGCTCAAGAGCAGGTAGGACAAGCCGAGCGCACACGCCGCGAGGTCGCCAATTGGGTAGCCGCTGCCCGCAACCCGGAGTCAGATTTGCGTGCGGCTGAAATCGCCGTTGCTGACGCCGAGCTGGAGCGCACCCGGGCCGAGCACAAGCTGACCTCTGCCAGGTTAACCCTGGCCTCCAGCTGGGGGGTCTTAACACCCGATTTTGAGACGGCTGCAGGCAATCTGCTCGTGCTGCCCAAAGCGGAGTCGCTGGATACCTTGGTGGCTCGTCTGCCGATGACACCAGAGCAACGTGCCGCATTGCTCGAGGCGGATAGTATCGCTGCTCGCAAGCGCTTAGCCGAGGCCGGCGCCAAGCCGGACGTTACCGTCAATCTGGGTGTGCGTCGCCTTGAGGCAACCAGCGATCAGGCATTGATGATGTCGGTATCGATTCCACTCGGCAACCAGGTTCGCTCGGGACTGTCCGTCGCCGAAGCCAATGCGCAACTGATGGCACTGGAAGCTCGCCGCGATGCTCAGCGTTTCGAGCACTACCAGTCGCTGTTTGGAAAGTATCAAGAACTCAATCAGGCCCGTACTGAAGCTGAAACGCTGCAAAAGCACATGCTTCCCAAGGCCGAGGAGGCACTGGCCTTCACCCGGCGCGGCTTCGAAGCCGGCCGCTTCTCCTTTCTTGCCCTGGCACAAGCGCAAAAAACCCTATTCGAACTGCGCCAACGCGCTGTCGATGCTGCTGCTCGCTGCCAGATCCTGATGACCGAGGTGGAACGCCTCACCGCCATTGCCCCGGAACCCACGCCATGAACCGACTATTACCCCTGATTCTCGTGCCTCTGCTGCTGACAGCTTGCGGCAACGATACCCCTCCCTCCGCTGTGGTTGCTGCGGAAAAAGCCAGTGCTGCCGAAGAGTACGAGCGTGGCCCCCATCGCGGCCGGATGCTGCGCCAGGGTGACTTCGCTCTCGAAGTGACCATCTATGAAACCAATGTGCCGCCGCAGTATCGGCTGTATGCCTACCAGAACGGCAAGCCTTTGCCGCCGGCCAGCGTGCAAGCCGCAATCCAGCTCAAGCGCCTGGATGGCGAATTCAACAATTTCACCTTCACGCCGGAAAAAGACTACCTGAACGGCAGCAGTGAAGTCATTGAGCCCCATTCATTCGATGTCGAGGTCAAGGCCCAGCATGCCGGCCAATCCTACAGCTGGGCGTTCCCCTCGTATGAGGGGCGCACCACGATTCCGGCGGCTGCCGCAAACGACGCAGGGGTTAAGGTCGAGAAGGCCGGTCCGACAACAATCCGCAATACAGTGCGGCTGATGGGTGCTGTGATGGTCGATGCGAATCGGCGTGCCGAGATCAAGGCCCGCTTCCCGGGCATCGTACGCGCGGTCAATGTCCAGGAAGGGCAGCGTGTCAGTCGTGGCCAGACGCTGGTGGCGATTGAAGGTAACGACAGCATGCGGACCTATTCCGTTGTCGCACCGTTTGACGGCATCGTCTTGGCGCGCAATACCAACGTCGGCGACGTTGCCGGCAGCAACACCCTGGTTGAACTGGCGGATTTGTCCAGCGTCTGGGTGGAATTACGGGCTCTCGGTGGAGATGCGGAGAAGCTGTCCGTGGGCCAGGAGGTCGAGATTTCCTCGGCCACCGGTGGCAGCCGGGTCACCGGGAAAATCCAGACGCTGCTGCCCCTGGCCTCCGGGCAAAGCGTGGTGGCCCGTGCCAGCATTGCCAACCCTGAAGGGCGGTGGCGGCCGGGTATGGCGGTCTCTGCGGATGTCACCGTGGCGGCACGCCAAGTCCCGCTGGCGGTGAAGGAATCCGGCCTGCAACGCTTCCGTGATTTCACCGTCGTCTTTACCCAGGTAGGGGACACCTACGAGGTCCGCATGCTCGAGCTGGGTGAGCGTGATGGCCGCTACGCCGAAGTGCTGGGCGGGCTGAAGCAAGGTGCTACTTATGTAGCTGAGCAGAGCTTCCTCATCAAAGCCGACATAGAGAAGTCCGGCGCCAGCCACGATCACTAAGGGATTTGCCATGCTAGAACGAATGATTCGTGCGGCAATCGCACATCGCTGGCTGGTCCTGATACTGGTTCTGGGCACCTCCGCACTTGGTGTCTGGAGCTATGGTCGCCTGCCGATCGATGCCGTCCCCGACATTACCAATGTCCAGGTCCAGGTCAATTCCGAGGCCCCCGGCTATTCGCCGTTGGAGGCAGAGCAACGTGTCACCTTCCCGGTAGAAACCGCCCTGGCAGGTATGGCTCGCCTGAAGTACACCCGCTCGATTTCGCGCTATGGACTGTCCCAGGTCACCGTGGTGTTCGAGGACGGTACGGACATCTACTTTGCCCGACAGCAGGTGAGCGAACGTCTGCAACAGGCGTCTTCCCAATTGCCGGCTGGCGTCAAACCGACCTTGGGACCGGTGGCGACAGGGCTGGGTGAAATCTTCATGTATACGGTCGAGGCCACACCAGGGGCTACCAAGGCGGATGGCAAACCCTGGATGCCTACGGATTTGCGAACACTGCAGGATTGGGTGATTCGCCCTCAGCTGCGTAACCTGAAAGGTGTCACCGAGGTCAATACCATCGGCGGCAACGTGCAGCAGTTTCATGTCACCCCCGACCCGGCCAAGATGGTGGCCTACAAGTTAACCATTGATGACCTGCTGCAGGCCATTGAACGTAACAACGCCAATACGGGCGCCGGTTACATCGAACGGGGTGGTGAGCAGAACCTGATCCGCATTCCTGGGCAGGTGGGTGATGAGGCTGGTTTGCGAGAGATCGTGGTGGCAATGCGTGACGGGCTGCCCTTGCGAATTAGCGACATAGCTACGGTCCAGATCGGCTCGGAACTGCGCACCGGTGCCGCAACCAGGGATGGCCGGGAAGTGGTGCTGGGCACGGTATTCATGCTGATTGGTGAGAACAGCCGGGAAGTCGCCATGCGTGCAGCGACCCGCCTCAAGGAAATCGATGCTTCGCTACCGGAAGGGGTCAGTGCGCGTGCGGTTTATGACCGCACCCAACTGGTGGACCGCAGTATTGCCACGGTCCAGAAGAACCTCCTCGAGGGAGCCTTGCTGGTAATCGTGGTTCTTTTCCTGCTGCTGGGCAATATCCGTGCGGCACTGATCACGGCGGCCGTGATCCCGGTTGCCATGCTGATGACCATCACCGGCATGGTGCAGAACCGGGTATCGGCCAACCTGATGAGCCTTGGGGCCTTGGACTTCGGCCTGATCGTCGATGGCGCCGTGATCATCGTCGAGAATTGCCTGCGCCGCTTCGGTGAGCGGCAGCACGCCCTGGGCCGCTTACTGTCCATCGAGGAGCGCTTCCAACTAGCTGCGAAAGCAAGTGCCGAGGTAATCAAGCCTAGCCTGTTCGGTCTGTTCATCATTGCCGCTGTTTATCTGCCGATCTTTGCCCTCAGCGGGGTCGAGGGCAAGACTTTCCATCCCATGGCTATCACTGTGGTCATGGCGCTGGTTGCCGCAATGGTGTTATCCCTGACTTTCGTGCCGGCGGCCATCGCACAGTTCGTCACCGGCAAGGTCGAGGAAAAAGAAACCCGCCTGATGCAGCGGCTGCATGGGATTTACGCTCCTCTGCTGGAGAAGTCCCTATCGCTGCAAAAGCCGGTGATTGGCGCCGCTGCAGTGCTGGTGGTGCTGTGTGGATTGTTGGCGACTCGCCTGGGTACGGAGTTCATCCCCAACCTGGATGAGGGGGATATTGCCCTGCACGCCCTACGCATCCCGGGTACCAGCCTGACCCAGGCTATCGGTATGCAGGCCCAGCTCGAAGCACGGATCAAGCAGTTCCCGGAAGTAGACAAGGTGGTGGGCAAGCTCGGCACGGCAGAAGTGGCCACCGACCCGATGCCGCCTTCTGTGGCCGATACTTTCATTCTGCTCAAGGAACGCAAGGACTGGCCGGACCCGCGCAAGTCCAAGGCTACCCTGGTGGCGGAGCTGGAGGAAGCTGTTCGTGCCATCCCCGGCAACAACTACGAGTTTACCCAGCCGGTACAGATGCGGATGAACGAGTTGATTGCCGGTGTACGTGCGGAAGTGGCGATCAAGGTATTCGGCGATGACCTGCAAGCGCTGACCGCGGTTGGCAAACAGATCGAGAAAGTCGCAGGCAGCATTTCCGGAAGTGCCGACGTGAAACTTGAGCAGGTGACCGGCCTGCCGCTGCTGGTCATCAAGCCGGATCGTGCCGCCCTGGCCCGCTACGGCCTGGCCGTGGCCGACATCCAGGACACCGTATCCGCGGCGATGGGTGGGGCAACGGCTGGCCAGCTTTTCGAGGGGGATCGCCGTTTCGATATCGTGGTGCGTCTCCCCGATGCCCAGCGCCAGGACCCGAAGGCACTGGCAGCGCTGCCCATTGCCCTGCCGGCGACAAGCAGAGCCGATGGAGCTTCGTTGTCGCGGATGCCCGGCGTGGTGCCCTTGAGTGCCGTGGCCACTATTGCGGTAGAGCTAGGGCCCAACCAGGTCAGTCGGGAAAACGGTAAGCGGCGCGTGGTCATCACGTCGAACGTGCGCGGCCGGGACCTCGGCTCCTTCGTGGAGGAACTCCGGGGGAAAGTTGCGGCGGAAGTCGTGCTGCCTGTCGGAAGCTGGGTCGAATACGGCGGCACCTTCGAACAGCTGATCTCGGCCGGCCAGCGTCTGAGCGTCGTGGTTCCCGTGGTCCTGGTCATGATTTTTGGCTTGCTGTTCATGGCCTTCGGATCGGCCAAGGATGCCGCAATCGTGTTCAGCGGCGTACCCCTGGCGCTGACCGGTGGCGTACTGGCCCTGTGGCTGCGCGGTATTCCCTTCTCCATCTCAGCCGGGGTCGGATTCATTGCGTTGTCTGGCGTTGCGGTACTCAACGGCTTGGTGATGATCACCTTCATACGGAAGCTGCGTGAGCTTGGGCAACCGCTACATACCGCTGTGACCGAGGGGGCGCTGACCCGTCTGCGCCCCGTGCTGATGACCGCACTGGTTGCCAGTCTTGGCTTCGTCCCCATGGCCCTCAATGTCGGTACAGGTGCTGAAGTGCAGCGCCCACTGGCAACCGTGGTGATCGGCGGCATCATCTCTTCGACCCTGCTGACCCTCTTGGTGCTCCCGGTGCTGTACCGGCTGATACACCGGAATGAGAACGAGGAGACAGCCGCGTGACCCCTTCCCCCATTCTATTCAACAAGGAGTTTCCTTTGCCATTTCGCCAGTTCAGCGCCACCGGCATTTGCCGGTGGACCGTGGCGCTCCTCGCGTCACTGCTGCCCCTGTGGGCTTTCGCCCACGGGGTCACGGGAGAGGATCAGTCCTTTCTCGAGCAGAACACCGGCCGCAACCTGCTGTTGTTCGCCTACCTGGGAGCCAAGCACATGGTCACCGGGTATGACCATCTGTTGTTCCTGTTTGGTGTGGTGTTCTTTCTGTACCGCATGCGCGACGTCAGCATTTACGTGACCCTGTTCGCCGTCGGACACAGCGTGACCCTGCTGCTGGGGGTGCTGGGCGGTTTCCACGTCAATCCCTATGTCGTCGACGCAATCATCGGCGTCTCCGTGGTTTACAAGGCGCTGGACAACCTGGGGGCATTCAAGCACTGGTTGGGATTCCAGCCCAATACCAAGGCGGCCGTACTGGTCTTCGGCTTTTTCCACGGTTTCGGCCTGGCCACCAAGCTGCAGGACTTCTCGTTGTCCCGCGATGGTCTGGTGCCGAACATGCTGGCCTTCAACGTTGGCGTAGAACTTGGCCAATTGCTGGCACTGGCTGGAATTCTGATCGTCATGGGGTTCTGGCGCCGCAGCACAGCCTTCTCCCGGCAAGCATTCACCGCCAATACCGCACTCATGGCTGCTGGCTTTGTCTTGGTCGGCTACCAACTTACCGGCTATTTCGTTTCCTGATCGAGGTCTTCCTATGTCCAATACTCAAACCCACTCCCTGCCCAGTAGTGCCAGTCTGTTCAAGGCAACCGCGGTAGCCGCAGGTGTTGCCGCCACCTTGCTGGTGACCATGGTGCTTCCTGCGGAATATGGGATGGACCCTACCGGCATTGGCCGTTTCCTCGGCCTCGATGCCCTTAAACAGTCTGCCGGTGCTGAAACAACATCTGTTCTGGCAACCCCGGATGCTATTGCTGGCCCCAATGCAATGCTTGCCGCCAAAGCAGATGCTGCTTTCGGAAAGCAAGCCGGCAGGTCCTTGGATGCCTCCGCCGTTTCATTGGCAGGTGATGGCCCCATGCGTCGCAACACATTCACGGTAACGCTGGCTCCCGGCAAAGGCGCAGAGGTCAAAGCGCACCTCCGGGCTGGTGAAGGCCTGACCTTCCACTGGCAAGCAACCGCCGCGGTGGCCGTGGATATGCACGGCGAAGCACCGAATGCCAAAAATGCCTGGACCAGCTATTCGGTCGAAAGTGCTCAAAAGAGTGCATCCGGCACTTTTGTTGCCCCCTTCGAAGGAAGTCACGGTTGGTATTGGCAAAACCGCGGCACCGAGCCGGTGACGGTATCCATCGAAGCCTCCGGTTTCCAATCCGAGTTGTATCGGCCGTAACGAAGCTTTCTTACACGGCCCCGTTGTATTAACCCCGCGGCTTGCCGCTTTCACATTGGAGATGTACCCCGTGAAAAACAAAACCTTCCTGTCCCTCTCTTTGCTGGTCGGGTCCTTCATGTCGCTTTCCAGCGTTGCCTATGCCCACGGTGTCCACGAAGACAGTGCCGAACCAAAGGCCACGCCCACTGCTTGCCGGCACCTCACCGACACCGAGCATTACGTGGTGGATCTAAAGGACCCCGCAACCCGGGCGCTCAAGACCCGTTGCGATGCCACCAAGAAGCCTGTAACCCCGGTGGCCGAGAAGAAGGACGAAACACCGGATAAGAAGTAACCCCTCCTGATAGTTCATTGTCGCCCGTAGAGACATAGGAAATAGCCATGCTTGAAATCCTCCGACATCGCAGTTTTAGGCATTTGTTTCTCGCCCAAGTCGTTGCATTGGTGGGGACGGGGCTTTTGACCGTGGCCCTGGCATTGCTGGCCTATGATCTGGCAGGCGCCAATGCCGGTGCGGTACTGGGTACCGCACTGGCCATCAAAATGATCGTCTACGTCACGCTTTCGCCTGTAGCGGGGGCTGTTGTCCCTGCGGCATGGCGAAAGCGTGTCTTGGTCGGCCTAGATTTGATTCGAGCGGCGGTGGCATTGCTGCTGCCGTTCGTCACCGAAATCTGGCAGGTCTATGTGCTGATTGCGCTGCTGCAATCAGCCTCAGCCTGCTTTACCCCGCTTTTTCAGTCGCTTATTCCCCAGATTCTGCCGGAGGAAAGCGACTACACCCGCGCGCTCTCCCTGTCGCGGCTGGCCTATGACCTGGAAAGCCTGCTTAGTCCGGCCCTGGCAGCGGCATTGTTGGTGGTCATCAGTTTTCACGGGCTGTTTGCCGGCACCTCCGTCGGCTTTGTTCTATCTGCACTGTTGGTCATGAGCACGGCATTTCCCGTCGTGCCAGAAACCCGTTTGGGAGATGGCCCCTACAGTCGAGCCCTCCGAGGCATGCGGATTTATCTACACACACCGCGACTACGCGGACTTCTGGCGTTGAACCTATGTGCCGCAAGTGGGGCCAGCATGGTTTTCGTGAATACCGTAGTCCTTGTCCGCGAGGTGCTGGGAGGCGGTGAACGGGAGGTGGCATGGGCTCTGGCAGCTTTCGGTGCCGGGTCCATGGCCGTGGCCTTTTCCTTGCCAACATTGCTCGACCGTATGGCGGATCGTCGAATCATGCTGAGCGCGGCATCAGCAATGGTCGTTGTGCTACTGGCGGTAACGGGGGTTTGGTGGAGTACCGGGGGTTTGGGCTGGGCAAGTCTCATTCCCGCATGGGTGGTTCTGGGTATGTCCTACGCAGGCCTGGTAACACCCGGGGGACGACTATTGCGGCGCTCGGCCCAATCGGACGATCTACCCTTTTTGTTTGCCGCCCAGTTTTCACTCTCACACCTGTGTTGGCTTCTGGCTTATCCACTGGCGGGATGGCTTGGAGCGCGGCTGGGATTCGGTGTTGCCCTCTCCGCCCTTAGCGCCATGGCAGCAGTGGGAGGGGCGCTTGCCTGGCGCACTTGGCCAAGGCAGGACCCTGATGTGATTGCCCACCATCATGACGATCTCTCTACCGACCACCCTCATTGGAACGAGTACGCGCTTGGTGGCGGAGGTCGGACTCATGAGCACCGTTTTGTCATCGATGAACTGCATCAGCGATGGCCCCACTAAGCCGGCCACTACGCGATGGTACTGAGTGATTTTCTTCGAGTAGCAGATAGCCACCTGATTGTTGGTCGGGGACTCTTTCACATGCTCAGCCACATGTGGCTCCACATCGAGTTCCGGCTATTTGCCGTCGGAATTGGCGTGCTACTGGTGTTGCTGTTGGTACTGATTTTTATGAGCTGGGAAGAGGAGCGGTGGATTCGTGCTGTAAGGATTTTTGATTTTTTATTTCGAAAACGGAAATAAAAAAGCTCATTAATGGGGTGGTATTCGCATGCCAAAGATGGAGCTTCCACAGCAGGTGGCAAAACTAGATACAACGCCGACATCCGTAGTTGGTCAGTCCCGTATGCAGACGGCCGACTTCTCGAAAATCCTCAATCAGGCGCTTTCCAGGAGCAACACGCCAGCTGACGTAGGCGTAACCGTGCATAGAGACGGAAACAAGAAACCGGGGGACTTCCAGCAAAAAATGCTGGGAATACGGGCCTATAGGCAACAGCTCATTGCTTCAAATATCGCGAATTCCGATACGCCAGGATATAGGGCTATGGATATCGATGTCGAAGATGCTGCCAAGCAGAACCAAATGGGGCTGTTGCCATTAGCAAAATCATCTCCCAGCCATATCAACGGGAGTGCTCATTGGTCCTCTCCGCCGTTCAACCTGAAATACCGCACACCATTTCAGGCCAGTGCAGACGCAAATACCGTAGAAATGGACATTGAGCGCCAGCATTTTGCTGAGAATGCTGTGATGTACCAATTCACCCTGGATCAGGTTGGTGGCGATTTCAAAGAGCTGACTGAGTTGTTTCGAAACCTAAAATAGTTCGTTCAAGCTAGCCTCATGCCAGCCAGGCTGCAAATCCAATGTAGCAACAAATTTTATTTGTAGAGTTATCAACAGCATTGTAGAGTACACTCATGACTACATGGATCGCACTCATTACCAGCCTTCCCACCGAGAATGCCACGGCCCGCATGCGTGCCTGGCGTAGCCTCAAGGCATCGGGTGCCGCCGTCCTCCGGGATGGGGTCTATCTGATGCCGGAGCGGGAGGATTGCCGGAACACACTTGATGCCGTAGCCGCAGATGTTCGTGCTGCAGAAGGTACAGCCCTGGTCGTCCGCCTCGAGGAGCCCAGCGATGGCAACTTTGTGGTCTTCTTCGACCGCAGCGCCGACTTTGCTACTCTGCTGGGGGAGATTGCCACGGCCCGAGACACGCTCGGTCCGGACACGGTAAACGAAGCTCTGAAGCAAGCCCGCAAGCTGCGCAAGGCGTTCTCCAACCTGGTAGCCATCGATTTTTTCCCTGGAGAAGCGCAAAAGCAGGCCGATGAGGCCTTACGTGACCTTGAGCAACGAGCAGCCTGGGCTCTTTCCCCCGATGAGCCGCACCCGGTCAACGACGCTATCTCCCGCTTAAGCATTCAGGACTATCAAAAACGTCGTTGGGCAACGCGACGGCGCCCCTGGGTGGACCGGCTGGCCAGTGCCTGGCTGATTCGCCGCTACATCGATCCCCAGGCCGAACTGCTCTGGCTGGCAACGCCGGCAGATTGTCCGGCCGAGGCTCTTGGTTTTGATTTCGATGGGGCGACGTTCACCCATGTCGGCGCCCGGGTGACCTTCGAAGTGCTGCTTGCCAGCTTCGGCCTGGAAACTCCGGCTCTGCAGCGCATCGGTACCTTGGTCCATTTCCTGGATGTGGGTGGCGTACAACCGCTAGAGGCGGTGGGCATCGAAAGCACCCTGGCCGGCCTACGCGACACCATTCTCGATGATGACCAACTCCTGGCATTGGCCGGCAGTATCTTTGACGGACTACTGGCCTCCTTTGAGAAAGGATCGAAATCATGAGTACGATCCTGACAGCCGCCGATTCCACCTCGCCAGAGTCCAAACCTGCCGAAGTCAGCTTCTGGCAGGCCTTCCTGTTCTGGCTGAAGCTCGGCTTCATCAGTTTTGGCGGGCCTGCCGGGCAGATCGCCATCATGCATCAGGAGCTGGTCGAGCGCCGGCGCTGGATTTCTGAACGCCGCTTTCTGCACGCCCTCAATTACTGCATGGTGCTCCCCGGTCCGGAGGCCCAGCAGTTGGCTACCTATATCGGTTGGTTGATGCACCGCACCTGGGGTGGCATCGTCGCCGGTGGGCTATTCGTGCTGCCGTCGCTGTTCATCCTGATTGGGCTGTCGTGGATCTATATCGCGTTCGGCAATGTGCCCCTGGTGGCCGGCCTGTTCTACGGCATCAAACCGGCGGTTACCGCCATTGTCGTCCAGGCGGCCCACCGCATCGGCTCCAGGGCCCTGAAGAACAATGCCCTCTGGGCCATCGCTGCAGCATCCTTTGTGGCCATATTTGCACTCAACGTGCCGTTCCCAGCCATCGTCGCGGCGGCTGCAGCCATCGGCTACTTTGGCGGCCGTGTCGCGCCGGACAAATTCAAGGCTGGTGGCGGCCACGGCAAAGCGGATAAGTCCTTCGGCCGAGCCCTGATCGACGACGATACGCCGACGCCGGTACATGCCCGGTTCTCCTGGGGCCAGTTGGCGAAAGTCGCGCTTATCGGTGGCTTGCTGTGGCTGGTCCCGATGGGGCTGCTGACCGCCAGCTACGGATGGAGTCATACCCTGACCCAGATGGGCTGGTTCTTCACCAAGGCCGCATTGCTGACCTTTGGTGGTGCTTACGCCGTACTGCCCTATGTTTACCAGGGGGCCGTCGGGAGCTATGGCTGGCTCACCGGTCCCCAGATGATTGATGGTCTGGCCCTTGGCGAAACAACACCGGGACCGCTCATCATGGTGGTGACCTTCGTCGGCTTCGTTGGCGGCTACGTGAAGGCCGTGTTCGGCCCGGATAGCCTCTTCCTGGCCGGTGCGGTGGCGGCCATGCTGGTCACCTGGTTCACCTTCCTGCCGTCCTTCGTCTTCATCCTGATGGGCGGTCCCTTCATCGAAACGACCCACAATGACCTGAAGTTCACGGCGCCGCTCACCGCCATCACGGCCGCCGTGGTCGGCGTTATCCTGAACCTGGCCCTGTTCTTCGGTTACCACGTGCTGTGGCCGAAGGGCTTCGACGGGGCGTTCGAGTGGGTATCGGCACTGATTGCCCTAGGGGCAGCCATTGCCTTGTTCCGCTTCAAGGCGAACGTCATCCATGTCATTGGTGGCTGCGCGGTCATCGGCTTCCTGGTGAAGATGTTCCTGTGAGCCTCGGCATGGGAACGGCACGGTGGGTAGGGGTCGCTGTGCTGGTGACGGCTATCAACGCTGCTGCTGCCGACAGAGGCTGGGTATTGCTCCAGGGCAAGGTGCTGGCCCAAGCCCTGTCCAACCAGGATTTCGGTGATGGCGTCCACTTTGCCTACCAATTTCTCAGTGGTGGGGAGTTGCGCGGCATGAATATGGGCAAGCCTGCCAGGGGAAATTGGCGGGTCATTGGCAATGAGCTTTGCTGGCACTGGACGAGGCCCAAAGAACCAGAGGAGTGTTACCAGGTACGCCAGCGAGGACAGGCTGTCCGTCTCTATTTGGATGGTCAGGAAGTGCTTTCCGGCAACCTGACCCCGTTACCCGCCAATCTGAAGGAGATGCCCCAATGAAATGGATCACGCGAGAAAGACCCAAGATTGATCGTATCGCCTGTCCCTGGCTGATAAGCCGGTTTGTCGACGAGAGTCCGGAATTCCTCTACGTCCCCGCAGGTGAAGTGATGCGCATTGCGGCCGAGACCGGTGCCACCCCCTACGACGTGCCCAACACCGAATTGGGCCACCATGGCGACCAGTGCAGCTTCGATGCCTTCATCGGAAAGTACAAGCTTGAGGATGCCGCACTCAACAAGCTGGCCCTCATCGTGCGCGGCGCCGACTGCGGTCAGCCACAACTGGCCAAGGAAGCGGCTGGTCTATTGGCCATTTCAAAGGGTCTGTCTCTGAATTTCAGCGACGATCACGAGATGCTGGCCCACGGCATGGTCATCTACGATGCGCTCTACGCCTGGTGCGCCGATACCCCGCTGAAGAAAATAGGCCGGTTTCTGGGGTTGAAGTGACTGGCCGCTGGTTGCTGCCTGAAGGTGTAGACCGGGTAGTGCTGCCGCTCCTGGTGGGCAAGGCCCTGCGGGCATTTGCCGATGGCTATGTGGCAGTTCTCCTTCCAGCCTATCTGCTGGCGCTCGGTTTCGGCACCCTGGATGTCGGCATCCTGAGTACAACGACCTTGCTGGGTTCGGCATTCGCCACCCTGGCGGTGGGGGCCTGGGGCCATCGCTTCCATCACCGGAACCTGCTGCTGGGCGCCGCGCTGCTGATGCTGGGGACCGGGCTCTCCTTTGCCTCTTTGTCAGCATTCCTGCCCCTGCTCCTGGTCGCTTTCGTCGGCACCCTCAATCCGAGTTCTGGGGATGTCAGCGTGTTTCTGCCCCTCGAACATGCCCGGTTGGCCGAATCGGGCCAGGGTACTGCCCGCACCACCTTGTTCGCCCGCTACTCCCTGCTTGGAGCCCTGTTTGCTGCGTTGGGGGCGCTGGCCTCAGGCATTCCCCAGCTACTGGTATCGGTGCTGGGGATCGAGCTGCTATCAGGGTTCCGGGTGATGTTCGTGCTCTACGGGCTGGTGGGTGGCACGGTATGGCTGCTGTATCGACGGATGCCGGCACCCCGGCGGGAGTGCGCGGTGGCCGCTCCGCAGGCGCTCGGCGAGTCGAAGGGCGTTGTTGTCCGGCTGGCGCTGCTGTTCTCCCTGGACTCCTTTGCGGGAGGGCTGGCCATCAATGCCTTGATGGCCCTTTGGTTCTTTCAGCGTTTTGAGTTGTCGCTGGCTGCCGCGGGGAGCTTCTTCTTCTGGGCTGGGCTGTTGTCCGCTGTGTCCCAGCTAATCGCACCGAAGGTCGCCGAGCGCATTGGCCTGGTGAATACAATGGTATTCACCCACATTCCCGCCAGCATCTGTCTCATCGCCGCGGCATTTGCCCCGGGTCTCGAGCTGGCATTCGCATTGTTGTTTATCCGGGCGTTGCTGTCTCAGATGGACGTACCGGTTCGAAGCGCTTTCGTAATGGCGGTGGTGACACCGGCCGAGCGTGCAGCTGCGGCAAGTTTTACTGCGGTCCCACGCAGTTTGGCTTCTGCCATCAGCCCAACGATTGGTGGGGCAATGTTTGCAGCGGGATGGCTTGCAGCGCCTCTGGTTGCCTGTGGAGCGTTGAAAATTTGCTATGACTTAATGCTTTGGAAAGCATTTCGACAACGAGACCCATAACGAAGTGGATTTGTCTCTCGGGCCTCAGAAGGAGCCGAAGTCGGCAATCAAAATTGGGCAGATGCAAGGGCCCTTATCGGAATTTCAACAATAGGCTTCATTGGACGGGCAACAGAATAACAAGGCTTTCTAGCTCCAAAGACTCTCGGTTGAACCATGTCACCTTTAGGTCACCGTTGTCTTGCAGTTCCAAGAAGAGGCCGGTTGCGACACGCGGACCATCAAGTTTTTCGTTCCGGATTGCTTCAATCAACCTTGGCATATGGGAAACGAAGAGCGCAATTTGATAAGCAAGTAGGTCGGCGTCGTCAACGGAAAGCGTAAGTCGAACACCGAAATAGTCGTTCGGAATGGGGAGAGCGACCACGATGTCGGTTCGCTCTGGAAGTTTCTTCTTTAGGCGAACAACCTTCTCCGCGAACAACTTCACCTTGTTTGCCGCGCCTACCACGAGCGCCGTCGCCTTTGCGCGGTTCTTCCACGATTCCTTTGCCGCTTCCTTCACGATCTCCGCCATATACAAAGCCGCGGCAGCCGAGAAAAGGCTAATCCACCAACTTGAATCGGCAATCAGGTGAATCCACGACGGTGGCTCAGCAGATAGGAGAGCAACGCGTCCATCTTCGATCTCAAATTCGAATTCTGGACCAACGTCTTCGCCAAATTCCTTGAGAGGTTGAATCGGTACATCTGCAGTGGAGAGCGCTCTCATAGCCTATGAAACCTCGCGTCGAAATTGAGCGGCCTGCGCATCTTTTCGCGCAGGTCCGCTCGAATGTAGTGGTAGGGGGCGTATTGGTATTGATGACACGAAGCGCCCTCAACGACTGAAGTGCCGAAGCATGAGATCGCCGTACCCCCAGACAAATGCACCCCAGATAGCAACGAACGTAATGATCCAGCTAACGGTACCGTGCTGTTTGTTGAACAGGCGGAAGAGTGCCCACGACTCGGAGAATGTTCCACCACGGATGGATTCGAAGAAGTTGTTGATTCTGAGTTGGGCAAAGACGCAGAAGACCGAAGTGATTGCACCGCTGCGCTGAAACCACGAAGCAAGAGGTTCAGATTCCGGCTTTAGCACCGCGGCAGCCGCCAACACCTCCGCCAAAACCGCGACAAAACACAACGACAGGATGAACAGCAATTCCAGTCTAAGGCGGGTCTCAACGGCGATGGTCACTTGTGCCCCTAACGTTGAAGGCAAGGGGCGGCCCACTTGCGGGACGTCCCAGCAGCCGAAGGCTGCGCCTTGAACGTGGTGTTAGGCTCCACCACTCAGCACCTCTGCGATATCTTGACGTGCGACAGCGATGAATGCCTCTTTCAGTTTGAAGAAGTCAGCAACTTCCTTCGCAGGCTGCGCACTATGACTGGTAATGACGTAGTCGGCCAATTTCTTCGCCGCCTCCACGACCATACCGGATGCGACAAGTGAAACCTCCGCGAACTTTCCGTTCAGCATGTTTAGATCGGCAAGGGATGAAATTTTCTCTTCTCTCGCCTGAACTACGAGTCGTTGCGTTTCAACTAGAAACTCCGCGTAAAGTTTGTGGCGAATAGTGCTTTGTTCTTTCGCAAGCGCAAGTTTCCATTCGTGGTTCTTTACACTGCGGCTCGCCAAGTAGTTTATGAACCCACCAATCAACACCCCCGAGAGGGTACCGAGTACGGCGATAGTTTCAGACGGCATAGTGGAGCCTAACAGTTAATAGACGGACCCATGGGTCCGTCTATGCGTAATCGTACGGACATGAGAAAAATGCCGGGAACCTTGAACTAGCGACGATTCTCGCGCCTTGTGGCCGAAATATGATTTACGAAAAAGGAGATGGAGTCGCAAGTATCTATCCATCTTGCATTTGGCTCAAGGCGAGCGTCTGCTCGATGTGAAATTCACAGTATTCATGCGAATACTCACCTAACCTTCCCTTGGACTTTCGCTGATTAAAGAGACTTTTCTATTTTCTGCCCGGAGGTTGGCGATCAAGGAATCCTTCTCTTTCAACAAATCCTCCAGCTCTTGAATACGTTGCTGGAGCGCGTAATTCTCCCGGGCAATCTTCAACAGATTGGACTGCTCCAGTTCCAGAAGCTCCCGGTATTCCTTCTTTCGTTCCTCTGCCTTTGCAAGCGCGGATGTCTTGATCTTCAACTTCGACTGGGCATCAGTGTCGTTGATGCGCTGAATCTCGGACAATATCTCCCGGTGGTACCGATACAGTGTGGAACGCTCCACTCCAGCCTCTTGAGCAACGCTGGCGGCGGTCAGGCGAGACCCAACCCTTACCTGATCGGGATGTCCGCCAACAATGCGCTGTATTGCCAAGTTCAGGGCTTGTCGCGTCAGCTTGATCGAATTTTCCTTATGGGCCTGTAGCGCTTGCGGGTTCCCAGCATTATTGCTCATTGAGGTATTTCCAATTTGGCGATGACATCAAGGGCCTTATTCAGGATTCGCTGGGACCGGGTTTTACCAGGCAGCCCCAGATCCGACATGTCTAAAGCCGTTTGCTGCTGAAGAGCAATTTCCTTCCAGACCGGAAGGTGCTCAGGGCCGATCATCCCGTAGGCGCAATCGACGCACATGTCTGCCTCAAAAACACACAGGCCACCACAACTTGTCCCCTTGGCATTGCCGATGCACCAACTGTGGCCGGTCCCATTTAGGGTGATGCTTGACGAAATCTGTTGCAACAAGGACTGCTTATTCTTGGCAGTTTTGATGGTCTGGCGTAGATCGGCAAGCATGATGTCGCCGTTGGCCAGGGGAGCGTCTGTCATCAGGTAATTGTGTAGAACGCTCTCCTGCTTTTCTGTCTTGCTCCGCAATATTTCCGTGAGTAGGTCTGTATCCGTCTGGTAGGCATCGGAGGCTCCGCTGGTATATAGCAGCGTCATGTCAATGGACCAGTGACCAAAATGCTCTCGGAGATAGTGGAGATCGCCCAGCTCGGCTGAGGCGACGAAATAGGCATAGGTTCTGCGGAATTGATGGGATGAAAGCCGCCAATATTTGCCGTCGTCTCCCAGAATATTGAAATGTGGACAGAAGTTCTGGAGGCGTCGATGAATGGTGCTTTTGCTGAGCACTCCTACTAAATGACCCACACAGGGGGCGCATCCCAGGAAGAGCCCATCCCGGTCCTTTCTGGCGGTGTGCAGGCGTTTGAGTTGGCGTTTGTGGAAGGTCGAGCCCGGAATGGACATGTCGAGTTGCTGTTCAAATTGGGAAATCTGCTCCTCGATCTGAATTGCATACGGTTGTCGATACCACTCCATTACACGAACTGCGGTCTCAACAATTGGCGGGACGAGCCATTTGTGCGGCTTTATTCCAGTCTTGAAGATTGTGCCGTGAAGCCAGATGAGATCAATGCCATCGTCCGCTTTGTCATGAGCGATGCACCCCCTTTTTAGGGATAGCGTTTCGGAATCCCGGATTCCGGAGAACATGGCAATCACGATATAGCAGGAATCCCGGAGATAACTCAGTTCTGCACTGAGGTCACGAGAGCCCTCAAAACCCAATTCCCGGGCTAGTGGTATTCGGATATTGGTGGCTTGGAATCCCTCTTTGTTAGCCACCGCCTGCTCCAGGGCATCCCGAGTGCTTAAGATATGGGGAGCTAGGTTGGTGACGTAATTTATTGCGACTTCGGCTAGCTGGCGAACCGTGCGCTCTGGGATTCGTAGTGTCTTGATGGTTCGACCTTTGGCTCGTTCTCCTGACAGAGTGAAACCGGATTCATCAGGCCATGGATGTGTAGGTAAGGCATCGGCCAGCTTTTCACGTTGAAGATATAGCTCCTCCAAAACTATCAACCGCCTTGCATGCCAGCCCTTGGCTGTCGGTTTGCCTGTTCTCTGATTAATCTTGGCAATGGGGACATATTCCAGGGCTCTTCCCTCTATATCCCTGAACTGTCGCATGCCTCGGGACGCCATCCAGCGCAATAGGGGCGTTAGCGCTACCACATGGGCAATCACGGTTGCCATTCGCGGTCGTTGGCGCCCATCGATAGGGTCTATAAACAAGGACCAGGTAAAGTCCTTGGTACTTTCCAGCAGGGGGGCATGCTGGTGGTCAGTGAGAAATTCACCAGGTGCGATCTCAATGCGCCAGTTGATCCGCTTTTCACTATCCCTAGCGTTGTTTTGCGGGATGTAAGGCCAGAAGTCCCAGATTTCGTCCTCATACCGGGACACCACGACGGACTCACCCAGATCCGTCTTCGCCATAGAGACCGGAGTTCTGGCCTTTTCTGCGGATGAAAGCGCTCGAATGGGCTTCGCAGGATTAGCTAGTGTGCTCATATCAGGGGGGCGTCATTGCGCCAGGCAGGATGCGGTGTGGATTGGGCTCGCTGGCGGGCTTCCGCCACTACGGCAGAATCAAACTGCACCGCAATTTGTTCGTCGATAGTTCTTATGACTGGTCCGAATGTTTTCATCCACTGGTGGGGAGGAATCTTGGGGCGCTCAGCCAGGAGACGGAAATAGAAGCTGTATAGGCGGTAAAGGTCGTCCTCAAACACCACCATGCTGGGGCAGCGAAAGCAGGCCAGGAATTTTCCGCAGACCTGGTCCGCTTCGCGAAATGGATTCCGACAGCGGGCAATCGATGTGTTGTAGCCACCGGTGAGCAGCTCAGTTGCGTTCCGAAGAGGAATCTCTCCATCTGCTGCTAAGCGGATAGCTAAGGTTTCATCTCTGCTGGTGGCCCAGCCCACCATGGCCTGGCCAATGAAGGCATGGTTGCGAACAGCCTCGGGAGTGACTGGTGGAATGTAGCGTTGGATGGTCAGGAGAATGGAACTATGGCCAAGGGCTTGTTGTAGCTTACGTAGGTCGCGATGACGTAGGTAGTAATTCAGCGCAAATGTCGGTCTGAGGCGTGCGATGCTCAGGTTCAAGGGGGCGCCTCGGTCGTCAAATAGCTCATGTCGATTGACAAAGCTTTTGAGTGCATTGCGCACCTGAACTTCGTCGAAACGAACAACATTTCCGCGACGTCGTGAGTACGTGACGTCGGCCAAACGGCAGAGAAAAACGAATGGGCGATCTGCTTCGTCAGCGTCTGACACAAAGCGCTCTGTGTATTGCTGTAGCTGCCGGAGATAATCTCCCACCATTTTGGTAATGGGTGTTGCAGTTTCCTCCGGTGTATTTTTGGGAAGGGATATGGTGCGCGTGGTATACCCACGGCGCTTCTCCAAGACCAGTAAGTCTCGATCCGGAAGAAAGCTGCGCAGGCTATCCCGGCGCATTTCCAGGAGGGGGGTCATATTCACGCCGCAGGTCAAAACGGTGATGACGGCATGAACTGCCAAGACTTGATGAGATGAGAGGGCTTCCGGATTTGCGTTGAATAGGGCCAGATCTTCTGCGCAGGCAGCAATGATCCGCTCTTGCTCCCCTGCAGAGTAACCTTCTCGTCGCGGTATCCGCTTATTGGAGTTGGGATAGGGATTCTTGGGGAAATTCAGGTTAGGGTTGATGGACTCTGGCACATGTTGCCGGCGATTTTTGAGTATTGATTTGATACCGGCATAGGTCGCATTACAGGCGGATACAGACCACGGTAACCCCTTGTTCTTTCCGTGTTGAACGGTCTGTAGGGCCAACCATGCTACAAATTGCATGATGATTAGCCGGTCGACCTGTTCCAGAGTAGTCACTACTTGGCCGGATTGCTCCAGGTCATCCAGGAAGTGCCAGAAGCAACGAATACCGCTATTGAAATAGGTCATCAGTGATTTGCCGGCAGAGACAAAACGCAGAGACCAAATGGCATCTCGCATTTGCTCGGCCAACGGCTCACGGCCTCGGGCCCGGTGGCTTGTGAAGTCGAAGTGATACTCCGCGCCACTGTGTACACAGCGAATGGTGAACGCCCATCCTGGAGGGAGGTCGATGATCTGCTTTTCTGGGGTCAGATCGATGTGGGTATTGCGATCAACCCGCTTACGCCGGATCGCCATCAGACACAATCCTCTTGGAGCAGTTTGCAGATGTCAGCTTGGTAGCCATCCAAGGCTGTGTGATCTACCAGACTGGCTGTGTGTATATAGATTTGTGTCGTTGCCAAACTGCTGTGTCCCATGCGTTCCATGACCCAAAACAGAGCGCTTTCCTTACTCTTGAATTGACTCATCCGGATAAATTCGTAGGTGCCATACGTGTGACGCAGCATGTGTGGTGTGCATTTGATTCCTGATTTCTCTGAGGCCTTCTTGAAGGCGTTGTTTATTCCGCCCAGGCTGATTGGCTCTCCATATGCTGTCAGGAAGACCCGGTTGCTCTCAGGACCAACGTGCCGTTTACGGTAGCGCTTAGCCAGCGAAGATCTCTCATAAATTACGTAGTTCCATAACTGGACAGCTAGGTCATAAGGGACAGATACCCATCGGGGTTTTCTGCCCTTTGTTGTACGTAGCGTCATCGCGAGAGGCTTACCTGGTGGATGGCCAGATGGATTGGGAATGTCCTCCAACTCCAAGGTTGCAACCTCTTCCCTACGAAGTCCCGAAAACCACATCAAGTAGGCCATTAGGCGGTTACGTCGCGGGGTTAGCGCGGTGACGAACAGCCTGGCCTTGTCGGCTGTTAGAAATTTTGGTTGTGGCTTAAATGAACGTAACTTCAGTTCATTGGCTTGGACTTGATTTCCCTTGGCATCAATGTGCGCCAGAAATCCCTTCGGCCGGGATACCCGAACGTCCTCCATATAGAAGGGGAGGGAAGAGATTTTCTGTGAGCGCAAGGCCCATTTGTAGAAGTTGGATACAGCCCCCACACGGTGGTTGACCGTAGAGCGGCGACATCCTCGTGAAAGCATGGAGTCCCGCCAGGCTGCTAGATGAGTGCTTCTTATCCTGTCCCAGGAAATCTCTATGGTTTCCAGGAAGCTGAAGAATTCATAGAGGTGATTTCCATAGGTCTGCCAAGTCTTCGGGGACGATGTTCTTCCCCTAACTACGGCGACGTGGAAGAGATATTCATTTGGCGCTGTGACAAGCGCCATTTGGTCGTCAATGAGAAATGGGATGCCTGGGTATGGCTGTCCGTGTACCGTGAAGGTTGGGTCGGTGAGGAATAAGTGCATATCGATCCAGGCAGTAGTAATCCAGTTGTCCAGAATGGCAACTTGGAGAACCGTAACTGTAAGGACCTTTTTTTTCTACAGAGGCAACACTCTGCAGGTCTCTTAACACGTCCAGGGTCGGCATGCCTGCGGCCCCTGCCTGGAGCGGCAATAGCAATGCGGCTCCCGGCAGGGCGCCGGCCAGCAGGACCGTCCTTATTGCCGCCTGGAGGCGATGGTTTTTGCCAGGTGTGAAGATATTTTTATTTTTCATATCCGAGGGCGAGAGCAGTGGCAGGTGGTTGGATGACAGCGCAGGATAGGGCGGGCGGTTCAGGCCGGCAGGGGCCGGCGGGCGAGGCGCAGGGCGAGCAGGAAGCTGGCGGCGATGATGCATACCAGGGCGAAGCCGAAGGCCAGTTCCTTGCCATCGCTCCAGGCTTCCACCGCGGGGGAGAGCATCTTGGCGGCGCCGAAGGCGGCCACCAGCAGGCTGACGGCGGAGACCACCAGGCCCATGACCCGGGAGGCCACCAGGGCCACCGTGTCGGCCCGGGCGATGAGGCGGGAGATCCACAGGCCGTTGATGCCGTCGGTGATGAGCATGCCCAGCATGAACAGCAGGGAAAGGCCCAGGGCGTGTTCCCAGCCGCCGTGGCGGGTGGCGGTGAAGGCGAAGAGGGCGGCCTGGGACAGGGTGTCGAAGGAGAGGGCGAAGAGGGCGCCGACGGCGGCCACCAGCAGGGGGTGGCCGGCTTCGGTGAGGCGGCCGAGGAAACGGCCCTTGATGCCGGCGGGGCGCACCATCTCGCCCGGAGCGGCGGCCAGCACGGCTCGCAGGTTCACGGCGCCGAGCAGGGTGAGGAAGGCGATGGAGATGAGGCCGCCCACCCATTCGAACCATTCGGGCACCTGCCACACGTCCGCCAGGGTGCCGACCGCCAGGGCGATGGCGATGACGATGGCGCCGTGGCCGAGGGAAAAGAGGGTGCCGCAGTAGCGCGCCAGGCCCGGGTTGCGCCGGCTGTTGAAGCGGGTGAGGCCGTCGATGGTGGCCAGGTGGTCGGCGTCGAAGCCGTGCTTCATGCCCAGGATCAACACCAGGGAAGCGAGGGCGAGCCAGTTGTCGGGCAGGGTATCCACGGGCCGTGAGGGGGCCGCCGGCGGGCGGCAGAAGGTTAGAAGATGTGAAGAATAAACTTTCCGTCACACGATTGGGGTTGACCTGGATCAGAAAGCGGCAGGGAAATCCGGGGCGACGGGCTCGATCCGCTCCACGGCCGGCCGGGCTGGGGCCGTGGGCAGGTGTGCCGGCAGCAGGGTGCCGCCGGGGGGCTTCAGGCGGCGGGGAGGTCCCGGGGCAGGGTCAGGCGGAACACGGCGCCGCCTTCGGGATGGTTGCTGGCGACGATGCGGCCGCCGTGGCGTTCGACGATGCCGTAGGAGATGGAAAGGCCCAGGCCCGTGCCTTCCCCCACCGGCTTGGTGGTGAAGAAGGGGTCGAAGAGGTGGCCCAGGGCTTCGTCGGGAATGCCGGGGCCGTTGTCGTGGAAGGTCAGGCGCAGTTCCCGGTCCCCCAGTTCGCCGCTGATCAGCAGCTCTCCCCCGGGCTGGGCGGCGGTGGCCTGCAGGGCGTTCTGCACCAGGTTCATCACCACCTGCTGCAGCTGCCCCGGGGAGCCCCGCAGGGGCAGTTCCGCCGGCAGCTCGGTGCGCACGGTGAAGCTGGGCGGGGCGCTCTTGGCCACCCAGCGCACCGCCCGCTGCACCACCTCGGCCAGGTTGAAACGTTCCGCCGCCTCCCGGTCCAGGGCGGAGAAGCGCTTCAGGCCGTCCACGATGTCCCGGGTGCGCTCGGCCCCTTCGATCATGCCGTCGATGAGGGGGGACATGTCTTCCAGGATGCGGTCGATGCGCAGTTCGCTGCGCAGCTCCTCCAGTTCCGGGATGCAGTCGCAGCCCCGGTTGTGTACCGCTTCCAGATAGGTTTCCAGGCGCCCGGCGTAGCGCTTTAGGGCCAGCACATTGCCCAGCACGAAGCTGATGGGGTTGTTCAGTTCGTGGGCCACCCCGGCCACCAGGCGGCCGAGGGAGGCCATCTTCTCCGAGTGCAGCAGCTGCTGCTGGGTGCGCTTCAAGTCCTCGTGGGTCTGCCGCAGTTCCCGGTAGGCCCGGCGCAGTTCCCCCACCGGGCGGCCGGTGACCACCATGCCGATCAGCTTGCCGGTGCCGGAGAGGCGGGGCGTCCAGTTGAAGGAAACGGGTACGGCGCCGCCGTCCTCGGCCTGCAGGGGCAGCTCCACGTCCTGGGCGCCGTCGTGGCTCTGGCTGGCGAAGAAGCTGCGGGCCTTGTCCCGGGCCTTGTCGTCGGCGAAGAGGTCGAAGATGGAAGTGCCCTTGAGAGTCGGCTCGTCCCGTCCGGTGTAGCGCTGGAAGGCCGGGTTCACCTCCTCGATGGCGCCGTGGCGGTCGCACACCACCAGGATGTCGGACATGGAGGCGAGGATGCTGGCGATGAAGCGGTGGGACTCCTCCAGGGCGGCGTTGTTCTGCTCCAGGGCCACTTCGTACTGCAGCAGGTCGTTGTAGACCTCGTCCATCTTCTGGATCACCTCGATCCACACCTTCTCGTTCACCCCCTCCAGCAGCTGGGCCGGCTCCGGCAGCAGGCTGCCGTCGAGCAGGGGGCGGGCGGGACGCTTGGCGGTCATGGCGGCGGGACTAGCGGCTGGGCTTCAGGTGCAGGTGGCGATAGCCGTGGCCGTGGGCATGGCTGTGCTTGTGGCTGTGCTGCTTCTGGCTCTCGTCCAGTTCCACGGTGACCAGGTTGAGCTGGCCATGGCGCACGCCGCGCTCCGCCATGAGGGCCTGGGCGAAGGCCCGCACGTCCTCCGTCGGCCCCTTGAGGATGGTGCTCTCGATGCAGTGCTCGTGGTCCAGGTGGGCGTGCAGGGTGGACACGGTGAGGTCGTGGTGGTCGTGCTGGATGGAAGTGAGGCGCTCAGCCAGTTCCCGCTCGTGGTGGTTGTAGACGTAGGAGAGGTTGGCGACGCAGTGGTCCGATTCCTTGCGCGCCTGGCGCCAGGTTTCCAGCTGGGAGCGGAGGATGTCGCGCACCGCCTCTGAGCGGTTGCTGTAGCCCCGGGCGGCGATGAGGGCGTCGAATTCCCGGGCCAGGTCCTCGTCCAGGGAAATGGTGATGCGTTCCATGCGTTCTCCCTCGCCGGCGGGCGGAAGGCCGCGGAATGCGGCAGGCCGGCGCAATGCCCGCCAGTCTACCAAAAGGCACCGGGCTTCACGGCGGCGCCGGGGTCTTTGCTGCAAAAGAAAAGGCCCGCCGGAGCGGGCCTGGGGATGCGGTGCGCCGCCATGGGCAGCGCCTCGATGCTCAGCCGCGTTGCTGCAGGGCGGCGATGCGCTCCTCGATGGGCGGGTGGCTGGAGAAGAGGCTCATCCAGCCCTTGCCGCCGGCGATGCCGGAGGCCGCCATATTGGCCGGCAGGGGCTCGGGGGACAAGCCGCCGAGACGCTGCAGGGCCGCCATCATGGGGCGCGGGTTGTTGCCCATGAGCCGGGCGGCGCCGGCATCGGCCCGGAATTCCCGCTGGCGGGAGAAGTACATGACGATCATGGAGGCGAGGATGCCGAAGACGATGTCGCACACCACCACCGTCACCATGTAGCCGAGCCCGGGGCCGGAGGATTCCTCGTTGTCCTTGCGCAGGAAGCTGTCCACCAGATAGCCGACCACCCGGGCCAGGAAGAAGACGAAGGTGTTCACCACGCCCTGGATCAGGGTCAGGGTCACCATGTCGCCGTTGGCCACGTGGGCCACCTCGTGGGCCAGCACCGCCTCCACTTCCTCCCTGCTCATGGACTGCAGCAGGCCGGTGGACACGGCCACCAGGGAGTTGTTGCGGCTGGGGCCGGTGGCGAAAGCGTTGGCCTCGCCTTCGTAGATGGCCACTTCCGGCATGGGCAGGCCGGCGTTCTTGGCCAGGCGGGCGACAGTGTCCACCAGCCAGGCTTCGGTGGGGTTGCGCGGCTGCTCGATGACCTGGGCGCCGGTGCTCCACTTGGCCATGGGCTTGGACAGCCACAGGGAAATGAAGGAGCCGCCGAAGCCCATCACCGCGGCGAACCCCAGCAGCATGGGCAGGTTGAGGCCGTTGGCGGTGAGGAAGCGGTTGAGGCCCAGCAGATTGATGACCAGGCCCAGGACCACCATGATGGCCAGGTTGGTGGCAAGGAAGATCAGAACGCGTTTCAT
Protein sequences of DBSCAN-SWA_3 >NZ_AP021844|2967016:3023959|3022410_3022899_-|WP_014236251.1|DBSCAN-SWA MERITISLDEDLAREFDALIAARGYSNRSEAVRDILRSQLETWRQARKESDHCVANLSYVYNHHERELAERLTSIQHDHHDLTVSTLHAHLDHEHCIESTILKGPTEDVRAFAQALMAERGVRHGQLNLVTVELDESQKQHSHKHSHAHGHGYRHLHLKPSR >NZ_AP021844|2967016:3023959|2977311_2977560_-|WP_014235622.1|DBSCAN-SWA MNISSILVNAGPQQIAAVEAGLATLAGVEVHAVSEEGRMIVTIESDGDRETTQTYEAIQQLPGVMSLAMVYHHFEPDPEKES >NZ_AP021844|2967016:3023959|2971516_2971993_-|WP_014235627.1|DBSCAN-SWA MSQRPFKVLGIQQIAIGGPSKDKLKTLWVDMLGLEVTGNFVSERENVDEDICAMGKGPFKVEVDLMQPLDPEKKPAVHTTPLNHVGLWIDDLPKAVEWLTANGVRFAPGGIRKGAAGFDICFLHPKGNEESPIGGEGVLIELVQAPAEVVDAFAKLAG >NZ_AP021844|2967016:3023959|3012699_3013314_-|WP_014236244.1|DBSCAN-SWA MRALSTADVPIQPLKEFGEDVGPEFEFEIEDGRVALLSAEPPSWIHLIADSSWWISLFSAAAALYMAEIVKEAAKESWKNRAKATALVVGAANKVKLFAEKVVRLKKKLPERTDIVVALPIPNDYFGVRLTLSVDDADLLAYQIALFVSHMPRLIEAIRNEKLDGPRVATGLFLELQDNGDLKVTWFNRESLELESLVILLPVQ >NZ_AP021844|2967016:3023959|3009185_3010550_+|WP_014236240.1|DBSCAN-SWA MSTILTAADSTSPESKPAEVSFWQAFLFWLKLGFISFGGPAGQIAIMHQELVERRRWISERRFLHALNYCMVLPGPEAQQLATYIGWLMHRTWGGIVAGGLFVLPSLFILIGLSWIYIAFGNVPLVAGLFYGIKPAVTAIVVQAAHRIGSRALKNNALWAIAAASFVAIFALNVPFPAIVAAAAAIGYFGGRVAPDKFKAGGGHGKADKSFGRALIDDDTPTPVHARFSWGQLAKVALIGGLLWLVPMGLLTASYGWSHTLTQMGWFFTKAALLTFGGAYAVLPYVYQGAVGSYGWLTGPQMIDGLALGETTPGPLIMVVTFVGFVGGYVKAVFGPDSLFLAGAVAAMLVTWFTFLPSFVFILMGGPFIETTHNDLKFTAPLTAITAAVVGVILNLALFFGYHVLWPKGFDGAFEWVSALIALGAAIALFRFKANVIHVIGGCAVIGFLVKMFL >NZ_AP021844|2967016:3023959|2973714_2974695_-|WP_152090379.1|DBSCAN-SWA MSDLSNSPPAKSDKAAAARRQFFADAGRMACGVGLLGLGLGFHAKQARALPPAALRPPGAGAEEDFLGACIRCGLCVRDCPYGTLSLARPEQPVSTGTPYFVARQVPCEMCEDIPCVKACPTGALDHGLTDINQARMGLAVLLDQETCLNFLGLRCDVCYRVCPVIDKAITLELRPNTRTGRHSMFIPAVHSEHCTGCGKCERSCVLETAAIKVLPVPLAKGELGQHYRVGWEEEQKAGHSLVDDKGLGDLPDRMPEGARLEGHFDPASQGGPSLVPGKPATPGSGVDSLAPSIPGADAHGPGVPAIPQNIPGGGLPNRLSDEAAR >NZ_AP021844|2967016:3023959|3001050_3004209_+|WP_014236233.1|DBSCAN-SWA MLERMIRAAIAHRWLVLILVLGTSALGVWSYGRLPIDAVPDITNVQVQVNSEAPGYSPLEAEQRVTFPVETALAGMARLKYTRSISRYGLSQVTVVFEDGTDIYFARQQVSERLQQASSQLPAGVKPTLGPVATGLGEIFMYTVEATPGATKADGKPWMPTDLRTLQDWVIRPQLRNLKGVTEVNTIGGNVQQFHVTPDPAKMVAYKLTIDDLLQAIERNNANTGAGYIERGGEQNLIRIPGQVGDEAGLREIVVAMRDGLPLRISDIATVQIGSELRTGAATRDGREVVLGTVFMLIGENSREVAMRAATRLKEIDASLPEGVSARAVYDRTQLVDRSIATVQKNLLEGALLVIVVLFLLLGNIRAALITAAVIPVAMLMTITGMVQNRVSANLMSLGALDFGLIVDGAVIIVENCLRRFGERQHALGRLLSIEERFQLAAKASAEVIKPSLFGLFIIAAVYLPIFALSGVEGKTFHPMAITVVMALVAAMVLSLTFVPAAIAQFVTGKVEEKETRLMQRLHGIYAPLLEKSLSLQKPVIGAAAVLVVLCGLLATRLGTEFIPNLDEGDIALHALRIPGTSLTQAIGMQAQLEARIKQFPEVDKVVGKLGTAEVATDPMPPSVADTFILLKERKDWPDPRKSKATLVAELEEAVRAIPGNNYEFTQPVQMRMNELIAGVRAEVAIKVFGDDLQALTAVGKQIEKVAGSISGSADVKLEQVTGLPLLVIKPDRAALARYGLAVADIQDTVSAAMGGATAGQLFEGDRRFDIVVRLPDAQRQDPKALAALPIALPATSRADGASLSRMPGVVPLSAVATIAVELGPNQVSRENGKRRVVITSNVRGRDLGSFVEELRGKVAAEVVLPVGSWVEYGGTFEQLISAGQRLSVVVPVVLVMIFGLLFMAFGSAKDAAIVFSGVPLALTGGVLALWLRGIPFSISAGVGFIALSGVAVLNGLVMITFIRKLRELGQPLHTAVTEGALTRLRPVLMTALVASLGFVPMALNVGTGAEVQRPLATVVIGGIISSTLLTLLVLPVLYRLIHRNENEETAA >NZ_AP021844|2967016:3023959|3010546_3010942_+|WP_014236241.1|DBSCAN-SWA MSLGMGTARWVGVAVLVTAINAAAADRGWVLLQGKVLAQALSNQDFGDGVHFAYQFLSGGELRGMNMGKPARGNWRVIGNELCWHWTRPKEPEECYQVRQRGQAVRLYLDGQEVLSGNLTPLPANLKEMPQ >NZ_AP021844|2967016:3023959|2974784_2977310_-|WP_152090380.1|DBSCAN-SWA MNLTRRDFIKSSAVAAAANAAGMAVPGVSEALAQQPKNDGIRWDKGVCRFCGTGCGVLVGTKDGRVVATQGDPEAPVNRGLNCIKGYFLSKIMYGKDRLTQPLLRMKNGQYDKNGDFTPISWDQAYDIMAEKCKAALKAGGPRNIAMFGSGQWTIWEGYAAAKLWKAGFRSNNLDPNARHCMASAVAGFMRTFGIDEPMGCYDDAEHADVFALWGSNMAEMHPILWSRITDRRLNAKHVKIHVLSTFTHRSCELADNELIFKPQSDLAILNYIANYIIQNGAVNQDFVKNHVKFKKGVTDIGYGLRPNHPLEQAAGNNGYPGPDGKPKGDPNKATDISFDEFKAFVAEYTLDKTHEISGVPKENLEALAKAYADPKVKVVSYWTMGFNQHTRGTWVNNMIYNVHLLVGKISEPGNGPFSLTGQPSACGTAREVGTFAHRLPADMVVVNPKHREITEKLWKLPAGTIPDWVGLHAVAQSRALKDGKVAFFWSTTTNNMQAGPNINGEVYPGWRNPAAFVVHSDVYPTVSALAADLILPSAMWMEKEGAYGNAERRTQFWRQQVKPQGQARSDVLQYVEFSKRFKMEEVWPAELLDKAPEYKGKTLYDVLYANGEVNKFPVSDQLKGFENEEGKVLGFYLQKGLFEEYAAFGRGHGHDLAAFDTYHKARGLRWPVVDNKETLWRFREGYDPYVKAGEKVRFYGFPDGKAVVFALPYQPAAEQPDAEYDLWLCTGRVLEHWHTGSMTRRVPELYKAMPDAWIYMHPEDAKKRGLQRGDTVKVQSRRGEISTRVETRGRNKPPLGLVFVPFFDEHRLVNKLTLDATCPISKETDFKKCACKVVKA >NZ_AP021844|2967016:3023959|2980114_2980948_-|WP_152090382.1|DBSCAN-SWA MTIDPSALSPLGKASEYRCHYAPELLFPIPRQLKRDEIGIDPARLPFVGEDLWNAYEISWLNPRGKPVVALGTFRIPAQTPHLIESKSFKLYLNSFNQSAFADAQTVAATLVRDLSAAAGGQVTVQLEPLAAQPRPRVDYPSGILLDELDIECDRYQPAPELLQADAGRSVEETLYSHLLKSNCLVTGQPDWGMVVVRYRGPAIDRAALLRYIVSFRGHNEFHEQCVERIFCDISARCAPQSLAVYARYTRRGGLDINPFRSSGEFLPPDNIREVRQ >NZ_AP021844|2967016:3023959|2994600_2996535_-|WP_152090900.1|DBSCAN-SWA MGVADAASVGTVTAKQLENRPLLRPAEVLETVPGLIVTQHAGDGKANQYFLRGFNLDHGTDFSVTIDGMPINLPTHAHGHGYLDLNFLIPELVERIQYKKGPYAAEDGDFSSAGSARIDYRRALPEDYVSIGLGSNGYRRLLTAADKETAGGGRWLGAVEVFHNDGPWEVPEHYKRLNGVLRYSEGTRNNGHSLAFMAYDGDWTSTDQLARRAVDQGLVNRYGSLDPTAGGNTRRLSLSGQWARQDGAVQTRANTYLVDYRLNLFSNFTYAMDDPVNGDQFEQADRRRYGGFGWSRSQPVQWLGKEGDFTWGVQGRQDDIDNVGLYRTAARQRLSTVRSDSVNQGSLGLYGQWGAQWSDWLRSVAGLRHDRYRFKVDSSLAANSGKENDGITSPKLSLIFGPFANQEFYYNWGQGFHSNDARGTTIRVDPSNPGDPMSRVPALVKSRGQEVGWRSAPAPGWNTSVALWRLDLDSELLFVGDAGTTQASRPSHRQGMEWSNYWTPRDWLTLDADIALSKARFRDDSSVGNHVPGAVERTASVGVAVHDLGPWRGGLRLRYLGPRALKEDDSVRSGSSVMVNLNVGYKLAAKSQLTLEVLNLFNRKASDIDYYYESQLRGEAAKVEDVHSHPAEPRIWRLTYTQGF >NZ_AP021844|2967016:3023959|2998098_2998500_+|WP_133247329.1|DBSCAN-SWA MYQSLRHRFRAYVQHPIGRCAVVLMLFALVAASVPFGEIHAHADGDHDHDHGYVTAELTKASLSDPSDSMDSDSDSTGAKVLHAHGSVVTPPPLPVDGLGIEPFIFPARDKITLAYLSRPSATPLPPYRPPIA >NZ_AP021844|2967016:3023959|3011390_3012602_+|WP_014236243.1|DBSCAN-SWA MTGRWLLPEGVDRVVLPLLVGKALRAFADGYVAVLLPAYLLALGFGTLDVGILSTTTLLGSAFATLAVGAWGHRFHHRNLLLGAALLMLGTGLSFASLSAFLPLLLVAFVGTLNPSSGDVSVFLPLEHARLAESGQGTARTTLFARYSLLGALFAALGALASGIPQLLVSVLGIELLSGFRVMFVLYGLVGGTVWLLYRRMPAPRRECAVAAPQALGESKGVVVRLALLFSLDSFAGGLAINALMALWFFQRFELSLAAAGSFFFWAGLLSAVSQLIAPKVAERIGLVNTMVFTHIPASICLIAAAFAPGLELAFALLFIRALLSQMDVPVRSAFVMAVVTPAERAAAASFTAVPRSLASAISPTIGGAMFAAGWLAAPLVACGALKICYDLMLWKAFRQRDP >NZ_AP021844|2967016:3023959|3015008_3016919_-|WP_014236246.1|integrase|DBSCAN-SWA MSTLANPAKPIRALSSAEKARTPVSMAKTDLGESVVVSRYEDEIWDFWPYIPQNNARDSEKRINWRIEIAPGEFLTDHQHAPLLESTKDFTWSLFIDPIDGRQRPRMATVIAHVVALTPLLRWMASRGMRQFRDIEGRALEYVPIAKINQRTGKPTAKGWHARRLIVLEELYLQREKLADALPTHPWPDESGFTLSGERAKGRTIKTLRIPERTVRQLAEVAINYVTNLAPHILSTRDALEQAVANKEGFQATNIRIPLARELGFEGSRDLSAELSYLRDSCYIVIAMFSGIRDSETLSLKRGCIAHDKADDGIDLIWLHGTIFKTGIKPHKWLVPPIVETAVRVMEWYRQPYAIQIEEQISQFEQQLDMSIPGSTFHKRQLKRLHTARKDRDGLFLGCAPCVGHLVGVLSKSTIHRRLQNFCPHFNILGDDGKYWRLSSHQFRRTYAYFVASAELGDLHYLREHFGHWSIDMTLLYTSGASDAYQTDTDLLTEILRSKTEKQESVLHNYLMTDAPLANGDIMLADLRQTIKTAKNKQSLLQQISSSITLNGTGHSWCIGNAKGTSCGGLCVFEADMCVDCAYGMIGPEHLPVWKEIALQQQTALDMSDLGLPGKTRSQRILNKALDVIAKLEIPQ >NZ_AP021844|2967016:3023959|3013422_3013785_-|WP_043797805.1|DBSCAN-SWA MTIAVETRLRLELLFILSLCFVAVLAEVLAAAAVLKPESEPLASWFQRSGAITSVFCVFAQLRINNFFESIRGGTFSESWALFRLFNKQHGTVSWIITFVAIWGAFVWGYGDLMLRHFSR >NZ_AP021844|2967016:3023959|2991625_2992534_+|WP_152090387.1|DBSCAN-SWA MAQETSRDPIKALLDDLEQSIADFDQRLGGVEESPAVTGLRSSGQRYPDIEPEARRQLSPAAPVAVAGNADATAVSEAPAVDLLAELAQAAACRSVDDAETQRRQLELTERLHQDLKTVFDYLNQLIRHANTLKPVLPRSYRLDARNSFDGLAWHDGFVDYRSTSRFDRSYYEQILFQVSYRAPAPLVAVCAADQAAIVRKELELVNLRIQREEPVMLPEGGPGVRYVLPDAIPLHLAVQADFANDALTFRCRNAGNFGPTAYRLPGGSITRPLLDGIGLVLLGRSDTMPKELQRIPYQRIN >NZ_AP021844|2967016:3023959|2968362_2969466_-|WP_152090376.1|DBSCAN-SWA MSVAPPSPSQRQARSRYQRGMLAWLQQPGDPAGLPEMRAAVRHLEAAAGGDFAPFWHSAEVFLRAISDGTLAVDAESRRLCARIDLQMRAALNGSEAPEGGLAEELQQCIRQGAGQLPPVTELISLMAKPEAPDLDAEAVAAWSAAGNAAVAAWNGRGSGDLAPFRRALIDLCAAAMSLNLPETLHLAESLAGVGDLLDAPEAAEDPYLRAAIAAALELLGDTRDLGLPVFAERVAHVAQRLAECRESQRPAVSPTLLRLFAGEIGEQAALMREELACLEPDGEALAESAHCLADHAAHLELDSAEALAQGLAAAIVRAQAGHGFDHPEVREALEAALAELDTMADFLLVAQPLPEATDILEILAQV >NZ_AP021844|2967016:3023959|2987149_2987356_-|WP_014235614.1|DBSCAN-SWA MARITVEDCLKQIPNRFQMTLAATYRARQIANGSTPMQEPSKDKPTVIALRELAAGQIGLEILNRGQA >NZ_AP021844|2967016:3023959|2977861_2980063_+|WP_152090381.1|DBSCAN-SWA MSRPLPILSTPAAAPPEAAPLTRYSPIPWGLVIVLSLLFVVVWLLPPLGGLKQSDTIFPLTLHTVMESFSFVVSVLVFAVSWHAYSRERAGNLMILACGFLAVALLDFGHTLSYRGMPDFVTPSSPQKAIIFWLAARYVAALTLLTIALRPWQPLARPRDRYRLMLWALLVTAAVFVSELYLPDFWPTMFVPGVGLTGLKIAAEYGLIAILGATAVILYPKTQGKPAFDAANLFTAVLITILSELCFTLYSNVNDVFQLLGHTYKVIAYFWIYKAVFVSSVRDPYLRLSLEMAERQAAEARIQFLAYHDPLTELPNRILVRERFERAVERARDQSSRVGLVYIDLDNFKTVNDSLGHTLGDLLLQAIGQRLQSLVPAGSTVSRQGGDEFLILLEDLEQSRLAESLVSRIVEQMQAPFEIQGHDLSTSVSIGVSLFPDDGGDFDTLLKKADTAMYRAKGAGRNGYRFFDREMDKDVGERLRLSNDLRLALARNEFVLHYQPQIDLRTQEVIGAEALIRWQHPELGLLAPGRFIGIAEDTGLIVPIGEWVIRMACHQAAAWQRAGLPPLVVAVNLSAVQFMRGDLVGTVASALATSALPSRCLELELTESILIQDAENILGTVQRLNAIGVQMSIDDFGTGYSSLSYLKRFAVDKLKVDQSFVRDLCSDPDDAAIVRAIIQLARSLGLKTIAEGVETAEILALLQELGCDEAQGYYFAKPLPADNFSAFLSQRLS >NZ_AP021844|2967016:3023959|2969562_2971437_+|WP_152090377.1|DBSCAN-SWA MFHSPSAPRDRPALLWQALLFFLLLLPAAVRAHPLVLDQDDGSFALVPHVEVLEDPGGKLDLAAVRQAAAAGRFAPAHALGELNFGYSSSAFWLRIPLESRLQRSSPWLLEIAFPSLDRVELFLPRADGRVDYQLTGDRLPFAERPYPNRNLVLPLELAPGESLALYLRVESEGSLTLPLTLWTPDAFRLHNQDAYAGFSLYYGMLLALGLYNLLLFFALRERIYLVYVAFAVSMAVGQLSLNGLGNEYIWPAFPAWGNVALPSGFAATGFFGAIFTRLFLNTRHSNPRADKLILALAAGFAVAALGPALLPYRWAAILTSLLGAAFSAVAVAVGVHAQLRRHPGARYFLLAWSLLLVGVGMMALRNLGWLPTTLFTSYGMQIGSALEMLLLSFALADRIQAERLARELAQGEALHSKQDLVNALRSNEQLLEARVAERTRDLAAANDRLLANEQQLQRMARHDPLTGLANRLLLDDRISHGLAVGRRNGTRLALLLIDLDGFKPINDKHGHAVGDQLLVVLADRLQRSVRAVDTVARLGGDEFVLVLEDLAAVEDGRQVAAKVVAEMSRPVVLEGRELLVSASAGLAFYPEDGEDAQTLLRRADEAMYEAKRAGRNTFRQVGQ >NZ_AP021844|2967016:3023959|2972797_2973718_-|WP_152090378.1|DBSCAN-SWA MSFLSPRFPAALAAKGRLGANRWLLLRRLSQFGILGLFLLGPLAGLWLVKGNLSYSLTLDTLPLADPLLVLQVLFSGHRPEGLALLGAAIVLAFYLLVGGRVYCSWVCPMNLVTDLAGWLRERLGLKGSAHISRRSRYWILGLTLLLPLAGAGLAWELINPVSMLHRGLIFGLGAAWTVVLAIFLLDLLIMSRGWCGHLCPVGAFYSLLGRTSLLRVSARRRQDCDDCMDCFAACPEPQVIRPALKGEANGTGPVILASACTNCGRCIDVCAKDVFVFGSRFNQHTQRCAPAGEAEGQTDHRKTIH >NZ_AP021844|2967016:3023959|2981004_2981910_-|WP_152090383.1|DBSCAN-SWA MIVRERPSLLRLFFIWRGSVVPHVLPQIVFTTSFAVLITWGAQHFGHLFPDYSAAPFALLGLAFSIFLGFRNSACYDRWWEARKQWGGLIVELRSLARDSLVLEAEPRRLLVRRSLAFAHALAARLRGRDAALEAAPFLPPSEAERLAQSRNPADALLRQCGHDLVQARQRDGLGDIVYQGLTQRLHALSGIQAACERIRFTPLPFAYTLLLHRTAHLFCLLLPFGLARSVGWATPLLTAVLAYTFFGLDALGDELEEPFGTLENDLPLDAMVRMLEGDLGEALGETDLPPLLQPQGYVLL >NZ_AP021844|2967016:3023959|3004962_3005604_+|WP_014236235.1|DBSCAN-SWA MSNTQTHSLPSSASLFKATAVAAGVAATLLVTMVLPAEYGMDPTGIGRFLGLDALKQSAGAETTSVLATPDAIAGPNAMLAAKADAAFGKQAGRSLDASAVSLAGDGPMRRNTFTVTLAPGKGAEVKAHLRAGEGLTFHWQATAAVAVDMHGEAPNAKNAWTSYSVESAQKSASGTFVAPFEGSHGWYWQNRGTEPVTVSIEASGFQSELYRP >NZ_AP021844|2967016:3023959|2983630_2984890_-|WP_152090899.1|DBSCAN-SWA MGRVRVLVLGAGVVGVTSAWFLAEAGHEVTVVDRQPGAALETSFANGGQISVCHAEPWANPRAPFKALEWLGKEDAPLLFRLRYDPALFAWSLRFLANCPPGATRRNIRDIIALALYSRQRLQALRQTLPLDYDQRCQGILHIFTQAAEFEAACHAAALMREFGVDREPVDAARCVAIEPALAAVQGRLAGGDYTPSDESGDAHRFTQRLAEAAAARGVQFRYNCPVEKIASAGGRVAGVVAGGDLLLADAYVVALGSYSPALLKPAGVKACVYPGKGYSATIALSPDSVAPSVSITDDERKIVMSRLGNRLRVAGTAEFNGHNLELTPVRCEALLRRALELFPQLRPDGDPLYWCGLRPVTPSNVPLIGRTRLPNLWLNTGHGTLGWTLSCGSAAALADLISGRRPEPDFPFLGTTKQ >NZ_AP021844|2967016:3023959|2989012_2989978_+|WP_152090386.1|DBSCAN-SWA MASQANHPLPAGFQLEDYRIEKQISVGGFSIVYLAHDASGKAVAIKEYLPASLALRSEGQTKPVISQEHLSAFRYGMKCFFEEGRALAKLNHPNVIQVLNFFRANDTVYMVMEYERGRTLQEFIQKHHGHIHEKFIRGVFTRMLNGLREVHTHKLLHLDLKPSNIYLRADNTPVLIDFGAARQTLHSDTPMLKPMYTPGFASPEHYFKRDELGPWSDIYSVGASMYSCLAGAAPQAADARMEKDQLQPASVRWEGQYSDQLLETIDWCLCLNHLYRPQSVFALQKALTEAVDMPGQGASKAAEKEGWLGHLVGKIKGMTAK >NZ_AP021844|2967016:3023959|3005988_3007326_+|WP_014236237.1|DBSCAN-SWA MLEILRHRSFRHLFLAQVVALVGTGLLTVALALLAYDLAGANAGAVLGTALAIKMIVYVTLSPVAGAVVPAAWRKRVLVGLDLIRAAVALLLPFVTEIWQVYVLIALLQSASACFTPLFQSLIPQILPEESDYTRALSLSRLAYDLESLLSPALAAALLVVISFHGLFAGTSVGFVLSALLVMSTAFPVVPETRLGDGPYSRALRGMRIYLHTPRLRGLLALNLCAASGASMVFVNTVVLVREVLGGGEREVAWALAAFGAGSMAVAFSLPTLLDRMADRRIMLSAASAMVVVLLAVTGVWWSTGGLGWASLIPAWVVLGMSYAGLVTPGGRLLRRSAQSDDLPFLFAAQFSLSHLCWLLAYPLAGWLGARLGFGVALSALSAMAAVGGALAWRTWPRQDPDVIAHHHDDLSTDHPHWNEYALGGGGRTHEHRFVIDELHQRWPH >NZ_AP021844|2967016:3023959|3021026_3022400_-|WP_152090390.1|DBSCAN-SWA MTAKRPARPLLDGSLLPEPAQLLEGVNEKVWIEVIQKMDEVYNDLLQYEVALEQNNAALEESHRFIASILASMSDILVVCDRHGAIEEVNPAFQRYTGRDEPTLKGTSIFDLFADDKARDKARSFFASQSHDGAQDVELPLQAEDGGAVPVSFNWTPRLSGTGKLIGMVVTGRPVGELRRAYRELRQTHEDLKRTQQQLLHSEKMASLGRLVAGVAHELNNPISFVLGNVLALKRYAGRLETYLEAVHNRGCDCIPELEELRSELRIDRILEDMSPLIDGMIEGAERTRDIVDGLKRFSALDREAAERFNLAEVVQRAVRWVAKSAPPSFTVRTELPAELPLRGSPGQLQQVVMNLVQNALQATAAQPGGELLISGELGDRELRLTFHDNGPGIPDEALGHLFDPFFTTKPVGEGTGLGLSISYGIVERHGGRIVASNHPEGGAVFRLTLPRDLPAA >NZ_AP021844|2967016:3023959|2990895_2991612_+|WP_130459832.1|DBSCAN-SWA MRPSQRAADQLRQVRITRRFTRHAEGSVLVEMGDTKVLCTASIEENLPPFLRGKGQGWVTAEYGMLPRSTHTRSSREAAKGKQTGRTQEIQRLIGRSLRAVTDLKALGERQITLDCDVLQADGGTRCASITGAWVALWDACQSLVAAGKLSENPLKEHVAAISVGIYKGTPVLDLDYPEDSDCDTDMNVIMTGSGGLVEVQGTAEGEPFSRQQMNVLLDLAEAGIRQLIHAQETALAD >NZ_AP021844|2967016:3023959|3020022_3020832_-|WP_152090389.1|DBSCAN-SWA MDTLPDNWLALASLVLILGMKHGFDADHLATIDGLTRFNSRRNPGLARYCGTLFSLGHGAIVIAIALAVGTLADVWQVPEWFEWVGGLISIAFLTLLGAVNLRAVLAAAPGEMVRPAGIKGRFLGRLTEAGHPLLVAAVGALFALSFDTLSQAALFAFTATRHGGWEHALGLSLLFMLGMLITDGINGLWISRLIARADTVALVASRVMGLVVSAVSLLVAAFGAAKMLSPAVEAWSDGKELAFGFALVCIIAASFLLALRLARRPLPA >NZ_AP021844|2967016:3023959|2987383_2987992_-|WP_130459829.1|DBSCAN-SWA MSGHLYIVTAPSGAGKTTLVRLLLQNDPAIGLSVSHTTRAPRTGEENGQAYHFTDVADFLARVDRGEFLEWAEVHGNYYGTSRTWIEQQLAAGRDVLLEIDWQGAQQVRKVFGDAIGVFILPPSMEELARRLAGRGTDSEDVIARRLAAARDEMRHVGEFDYVIINNDLQTALSDLLAVVRATRLKLPVQQERHASLFASLL >NZ_AP021844|2967016:3023959|2972310_2972787_-|WP_130459820.1|DBSCAN-SWA MNRLHKLTLAILAASFACLAQAADAPKTMRGADIPAGDPAPEVKAYAGKKPGLQQPIARTYKEQPPVIPHAVDNFDEITLEENQCLTCHGPEKYKEKKAPKIGESHFIDREGKQHAEVTHLRHNCVQCHVPQVDAPPLVENTFVGNIAASKDAKAKKK >NZ_AP021844|2967016:3023959|2992539_2993130_+|WP_014235607.1|DBSCAN-SWA MQKIVLASNNAKKLKELSALLTPLGIQLIPQGELGVPEAEEPHHTFLENALAKARHAAQLTGLPALADDSGLCVKALGGAPGVQSARYAGEPKSDARNNEKLLAALTGVADRRAHFVSLLVLVRHGDDPQPLVAEGEWHGEIIDQYRGEGGFGYDPLFYVPAEKATAAELSAEVKNRLSHRGQAMARLLERLKLEL >NZ_AP021844|2967016:3023959|3023077_3023959_-|WP_014236252.1|protease|DBSCAN-SWA MKRVLIFLATNLAIMVVLGLVINLLGLNRFLTANGLNLPMLLGFAAVMGFGGSFISLWLSKPMAKWSTGAQVIEQPRNPTEAWLVDTVARLAKNAGLPMPEVAIYEGEANAFATGPSRNNSLVAVSTGLLQSMSREEVEAVLAHEVAHVANGDMVTLTLIQGVVNTFVFFLARVVGYLVDSFLRKDNEESSGPGLGYMVTVVVCDIVFGILASMIVMYFSRQREFRADAGAARLMGNNPRPMMAALQRLGGLSPEPLPANMAASGIAGGKGWMSLFSSHPPIEERIAALQQRG >NZ_AP021844|2967016:3023959|3014523_3015012_-|WP_014236245.1|DBSCAN-SWA MSNNAGNPQALQAHKENSIKLTRQALNLAIQRIVGGHPDQVRVGSRLTAASVAQEAGVERSTLYRYHREILSEIQRINDTDAQSKLKIKTSALAKAEERKKEYRELLELEQSNLLKIARENYALQQRIQELEDLLKEKDSLIANLRAENRKVSLISESPREG >NZ_AP021844|2967016:3023959|3007672_3008155_+|WP_083834012.1|DBSCAN-SWA MQTADFSKILNQALSRSNTPADVGVTVHRDGNKKPGDFQQKMLGIRAYRQQLIASNIANSDTPGYRAMDIDVEDAAKQNQMGLLPLAKSSPSHINGSAHWSSPPFNLKYRTPFQASADANTVEMDIERQHFAENAVMYQFTLDQVGGDFKELTELFRNLK >NZ_AP021844|2967016:3023959|3016915_3018586_-|WP_014236247.1|integrase|DBSCAN-SWA MAIRRKRVDRNTHIDLTPEKQIIDLPPGWAFTIRCVHSGAEYHFDFTSHRARGREPLAEQMRDAIWSLRFVSAGKSLMTYFNSGIRCFWHFLDDLEQSGQVVTTLEQVDRLIIMQFVAWLALQTVQHGKNKGLPWSVSACNATYAGIKSILKNRRQHVPESINPNLNFPKNPYPNSNKRIPRREGYSAGEQERIIAACAEDLALFNANPEALSSHQVLAVHAVITVLTCGVNMTPLLEMRRDSLRSFLPDRDLLVLEKRRGYTTRTISLPKNTPEETATPITKMVGDYLRQLQQYTERFVSDADEADRPFVFLCRLADVTYSRRRGNVVRFDEVQVRNALKSFVNRHELFDDRGAPLNLSIARLRPTFALNYYLRHRDLRKLQQALGHSSILLTIQRYIPPVTPEAVRNHAFIGQAMVGWATSRDETLAIRLAADGEIPLRNATELLTGGYNTSIARCRNPFREADQVCGKFLACFRCPSMVVFEDDLYRLYSFYFRLLAERPKIPPHQWMKTFGPVIRTIDEQIAVQFDSAVVAEARQRAQSTPHPAWRNDAPLI >NZ_AP021844|2967016:3023959|2982804_2983596_-|WP_014235617.1|DBSCAN-SWA MSTVLSHLEDGVLTLTLNRPEALNALNLAMIEDLRAATARAEHDEAVGAVVLRGGEHFMAGGDLKWFHSQLALPPAERQALFEQTIAAVHATTLQVRRMGKPVVASVSGAAAGFGLSLMLACDLAVAADNAYFTLAYCHIGLSPDGGATWFLPRAVGAKRAAEIALLGDRFDAAQAREWGLINRVVPAAELEAESAKLARRLAAGPRQALARTKALLQASSGNSLPEQLFAEQGNFAACSVHPDFAEGLGAFLEKRKPAFGQK >NZ_AP021844|2967016:3023959|3013866_3014295_-|WP_043797807.1|DBSCAN-SWA MPSETIAVLGTLSGVLIGGFINYLASRSVKNHEWKLALAKEQSTIRHKLYAEFLVETQRLVVQAREEKISSLADLNMLNGKFAEVSLVASGMVVEAAKKLADYVITSHSAQPAKEVADFFKLKEAFIAVARQDIAEVLSGGA >NZ_AP021844|2967016:3023959|2981906_2982758_-|WP_152090384.1|DBSCAN-SWA MLIDTHCHLDAAEFAPDREAIFQDGVTAGVQAMVVPAVAAATFAEVRACCLAYPGCAPAYGIHPLYTPAAREEDLSTLRRWLAEERDGPLAPLAVGEIGLDLYVPELQQGEALARQQHFFAEQLQLAVEFDLPVILHVRRALDPILKQLRRYRPRGGIAHAFNGSRQQADEFIKLGFKLGFGGAMTFSGSTRIRELAATLPLEALVLETDAPDIPPAFLTAASPDRRNKPAYLPRFAALLAELRGMPTAELIAATGANARAALPGLAALASATPTTTPPTAST >NZ_AP021844|2967016:3023959|2996760_2997930_+|WP_014236230.1|integrase|DBSCAN-SWA MAHDTNPTLREALFMYCERISIHKKGHAQEKYRINLYCRYSIADLPIRNITSVDVATFRDERLAEINARTGRALSPATVRLDLALLSDLFRIAKNEWGICNDNPVANVRKPKLPPGRDRRLAPREERMIMRHCSQRGAHEMKAIVQLALETAMRQGEILGVCWEHINLKSRIVHLPDTKNGSKRDIPLSMEARDILAAQRVKLSGRVFSYTNNGLKSSWRSMIKRLNIPDLHFHDLRHEAISRLMERGVFNLMEVAAISGHKSLSMLKRYTHLRAQRLVRKLDAGANKGKAAVLSYLVPYPAFIEPYESQVKVTFPDFDDLHVAGPCLNSAVQQAQDALLREILVLMRQGRPIPPPNNYLELLDESRLFHLDPLATYDSLADLAEGALV >NZ_AP021844|2967016:3023959|2993274_2994510_+|WP_152090388.1|DBSCAN-SWA MSSRSSSRIIPIAVAGGTRAGGSPLHFTSPPPLSLYIHVPWCVRKCPYCDFNSHEARAENDEAAYVAALVADLESALPSVWGRKVSTIFIGGGTPSLLSGEALHELLNAVRMRLPLLPEAEVTLEANPGTAEAGKFAAFRAAGVNRLSLGIQSFNDRHLEALGRIHDSAEARAAIELAKAHFERFNLDLMYGLPQQSQAEAMADLEMALSFAPPHLSCYQLTLEPNTLFAARPPQLPEGDTCADMQDAIEARLAAAGYVHYETSAFARPDYQCRHNLNYWTFGDYLGIGAGAHGKLTLPDHSGFSVQRQMRWKQPKQYLEQVAAGQPVQEQHGVGADELPFEFLMNALRLNQGFDPALFEQRTGLPLLLVRGELEKAAREGLLTLAPDCIAPTERGRRFLNALLERFLPDA >NZ_AP021844|2967016:3023959|2984886_2987118_-|WP_130459828.1|DBSCAN-SWA MDTATDPAPAKPSASSAAPFAPAPDPAAPPTPYPFNDDPAYRVFLDSLDYLKPEEIAKIKEAFAFGEAAHRGQKRLSGEPYITHPLAVAGAIAEWRLDSTAIIAALLHDTMEDTGISKEELTERFGKGVADLVDGLSKLDKIEFSSYQEAQAENFRKMLLAMAKDLRVILIKLTDRLHNMQTLGCMRPDKRRRIALETLEIYAPIANRLGLNTVYRELQDLSFKHTHPMRYQVLLKAVMAARGNRREVLSKILDGVQSKMRDSGIEAQVFGREKSLYSIYRKMVEKRLSFSQVLDIYGFRVVVKDVPSCYLGLGALHALYKPLPGKFKDYIAIPKANGYQSLHTTLIGPYGMPVEVQLRTEEMHHMAQEGVASHWLYKDTEKSAAELQYQTHRWLQSLLELQSTAGDSAEFFEHVKIDLFPDEVYVFSPKGKIFSLPKGATPVDFAYAVHTDVGNRCVAAKINYELMPLRSELNSGDQVEIVTAAHANPNPAWLSYVKTGRARSKIRHFLKTRQHEESAALGERLLNQELFGLGITPSELPDASWEAVLKEGGSKSVKEVYTDIGLGKRLAAVVARRLLAHEAALPNAEPAPHTSVVIRGTEGMAIQLAHCCRPIPGDPIIGSIKKGQGLVVHTHDCAVIRKSRSAEPQRWIDVEWEPEPGKLFDVDIHVAARNARGVLAKVATEIAESGSNIEKVSMAPDPGFYTTLNFTVQVANRAHLARVLRAVRLIPEVVRITRERQEE >NZ_AP021844|2967016:3023959|2989974_2990886_+|WP_014235610.1|DBSCAN-SWA MKFTIYQESRIGKRQNNEDRIAYCYSREAVLMVVADGMGGHYHGEVASQIAVQTLTSAFQRDAQPEIADPFLFLQKGMTNAHHAILDYSQEHRLKDSPRTTCVACLIQDNIAYWAHVGDSRLYHMRDGKVLAVTRDHSRVRLLMDEGLISEAQAATHPDRNKVYSCLGGENPPEIEFSRKTPLEVGDVLVLCTDGLWGPLPADVMAASLKGANLMQAVPMLLNQAEIRSGPYGDNLSVVAVRWEQSYSEEASSTVMTQTMPLDAVTTKLGEFGRDPAYKTDLSDDEIEKAIDEIRAAIQKFSK >NZ_AP021844|2967016:3023959|2967016_2967958_-|WP_152090375.1|transposase|DBSCAN-SWA MGASILSAPYFHNEEAAYEFVESRLWPSGPVCPHCGCVERISKMGGKSTRIGAYKCYNCRKPFTVKVGTIFESSHIPMRLWLQAIFLISSSKKGISANQLHRTLGITLKSAWFMSHRIREAMRSGDFSPFGSEGGPVEVDETFIGRDYTKKPKGEKKGRGYDHKNKVLSLVDRTSGQARSMVVDDLKAKTLIPILEANIAREARIMTDEAGQYKNVGQHFAGHAFTRHGMGEYVSKIDPTIHTNTIEGFFSIFKRGMKGVYQHCGHHHLNRYLAEFDFRYNNRKALGIEDQERAEKLLQGVKGKRLTYETTAQ >NZ_AP021844|2967016:3023959|2968030_2968231_-|WP_014235630.1|DBSCAN-SWA MNQMPLVPPGRDTLPRIERLLRRELALFFTLCALLGAFLAYGALRTLASPSAPPHPVLQARADVRP >NZ_AP021844|2967016:3023959|3008250_3009189_+|WP_014236239.1|DBSCAN-SWA MTTWIALITSLPTENATARMRAWRSLKASGAAVLRDGVYLMPEREDCRNTLDAVAADVRAAEGTALVVRLEEPSDGNFVVFFDRSADFATLLGEIATARDTLGPDTVNEALKQARKLRKAFSNLVAIDFFPGEAQKQADEALRDLEQRAAWALSPDEPHPVNDAISRLSIQDYQKRRWATRRRPWVDRLASAWLIRRYIDPQAELLWLATPADCPAEALGFDFDGATFTHVGARVTFEVLLASFGLETPALQRIGTLVHFLDVGGVQPLEAVGIESTLAGLRDTILDDDQLLALAGSIFDGLLASFEKGSKS >NZ_AP021844|2967016:3023959|2988008_2988875_-|WP_152090385.1|DBSCAN-SWA MIYSMTGYAAKTREVAGGSLHLELRSVNSRFLDIHFRIVDDLRVLEPALREAITAKLARGKVELRLNLVASQSQNRQLAINADLLTQLQALEGQVRQTLPNAAALSVAEVLRWPGMLGEPEVDTAALHAAVQATLKEALEDFTASRAREGAKLAAMIQERVDKIRATVAAVAPLIPQAQAAYQDKLKQRLVEALGSADDERVRQEVVLYATRIDVDEELSRLQAHLTEVERILKAGGNAGKRLDFLMQELNREANTLGSKSVLSEVSKASMDLKLLIEQMREQIQNIE >NZ_AP021844|2967016:3023959|3010938_3011394_+|WP_014236242.1|DBSCAN-SWA MKWITRERPKIDRIACPWLISRFVDESPEFLYVPAGEVMRIAAETGATPYDVPNTELGHHGDQCSFDAFIGKYKLEDAALNKLALIVRGADCGQPQLAKEAAGLLAISKGLSLNFSDDHEMLAHGMVIYDALYAWCADTPLKKIGRFLGLK >NZ_AP021844|2967016:3023959|2999807_3001040_+|WP_014236232.1|DBSCAN-SWA MNRLLPLILVPLLLTACGNDTPPSAVVAAEKASAAEEYERGPHRGRMLRQGDFALEVTIYETNVPPQYRLYAYQNGKPLPPASVQAAIQLKRLDGEFNNFTFTPEKDYLNGSSEVIEPHSFDVEVKAQHAGQSYSWAFPSYEGRTTIPAAAANDAGVKVEKAGPTTIRNTVRLMGAVMVDANRRAEIKARFPGIVRAVNVQEGQRVSRGQTLVAIEGNDSMRTYSVVAPFDGIVLARNTNVGDVAGSNTLVELADLSSVWVELRALGGDAEKLSVGQEVEISSATGGSRVTGKIQTLLPLASGQSVVARASIANPEGRWRPGMAVSADVTVAARQVPLAVKESGLQRFRDFTVVFTQVGDTYEVRMLELGERDGRYAEVLGGLKQGATYVAEQSFLIKADIEKSGASHDH >NZ_AP021844|2967016:3023959|2998560_2999811_+|WP_014236231.1|DBSCAN-SWA MNLRCLLVLAVAGTCGIPLSGYAAESLRLEEAVSRALASHPSLAAEAAQLKAVQARAQREGLATPFMIGADVENVGGTGAFRGGQSAETTLRIGRVIELGGKREARQALGSAEINQQQNLSEATRLDVISRTSLRFISVLADQQRLKYAQEQVGQAERTRREVANWVAAARNPESDLRAAEIAVADAELERTRAEHKLTSARLTLASSWGVLTPDFETAAGNLLVLPKAESLDTLVARLPMTPEQRAALLEADSIAARKRLAEAGAKPDVTVNLGVRRLEATSDQALMMSVSIPLGNQVRSGLSVAEANAQLMALEARRDAQRFEHYQSLFGKYQELNQARTEAETLQKHMLPKAEEALAFTRRGFEAGRFSFLALAQAQKTLFELRQRAVDAAARCQILMTEVERLTAIAPEPTP >NZ_AP021844|2967016:3023959|3018585_3019710_-|WP_014236248.1|integrase|DBSCAN-SWA MHLFLTDPTFTVHGQPYPGIPFLIDDQMALVTAPNEYLFHVAVVRGRTSSPKTWQTYGNHLYEFFSFLETIEISWDRIRSTHLAAWRDSMLSRGCRRSTVNHRVGAVSNFYKWALRSQKISSLPFYMEDVRVSRPKGFLAHIDAKGNQVQANELKLRSFKPQPKFLTADKARLFVTALTPRRNRLMAYLMWFSGLRREEVATLELEDIPNPSGHPPGKPLAMTLRTTKGRKPRWVSVPYDLAVQLWNYVIYERSSLAKRYRKRHVGPESNRVFLTAYGEPISLGGINNAFKKASEKSGIKCTPHMLRHTYGTYEFIRMSQFKSKESALFWVMERMGHSSLATTQIYIHTASLVDHTALDGYQADICKLLQEDCV >NZ_AP021844|2967016:3023959|3005674_3005941_+|WP_043797801.1|DBSCAN-SWA MKNKTFLSLSLLVGSFMSLSSVAYAHGVHEDSAEPKATPTACRHLTDTEHYVVDLKDPATRALKTRCDATKKPVTPVAEKKDETPDKK >NZ_AP021844|2967016:3023959|3004244_3004949_+|WP_050804349.1|DBSCAN-SWA MPFRQFSATGICRWTVALLASLLPLWAFAHGVTGEDQSFLEQNTGRNLLLFAYLGAKHMVTGYDHLLFLFGVVFFLYRMRDVSIYVTLFAVGHSVTLLLGVLGGFHVNPYVVDAIIGVSVVYKALDNLGAFKHWLGFQPNTKAAVLVFGFFHGFGLATKLQDFSLSRDGLVPNMLAFNVGVELGQLLALAGILIVMGFWRRSTAFSRQAFTANTALMAAGFVLVGYQLTGYFVS |
53 | Bacillus_phage(28.57%) | protease,transposase,integrase | attL 2991252:2991270|attR 3028742:3028760 |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
| Acr ID | Acr position | Acr size | Homology with known anti | Neighbor HTH/AcRanker | Neighbor Aca | In prophage | Protospacer in prophage |
|---|
| CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| NZ_AP021845_1 | 11331-11433 | TypeIV-A |
NA
Consensus repeat of NZ_AP021845_1
|
1 spacers
spacers of NZ_AP021845_1
>1.1|11367|30|NZ_AP021845|CRISPRCasFinder ATCTCAACCCGCTCTAGGATTCGGCTGACT |
csf3gr5,csf2gr7,csf1gr8,csf5gr6,DinG,PD-DExK |
CRISPR arrays and Neighbor proteins around NZ_AP021845_1
The CRISPR arrays of NZ_AP021845_1 >merge|NZ_AP021845|1|11331-11433|CRISPRCasFinder GCAGGATATACCCCTCATTCGATGGGTGGTTACAGGATCTCAACCCGCTCTAGGATTCGGCTGACTGCTGGATGCACCCCAGATTTGCGGGGTGGTTACAGGT >NZ_AP021845|1|1|11331-11433|CRISPRCasFinder GCAGGATATACCCCTCATTCGATGGGTGGTTACAGG ATCTCAACCCGCTCTAGGATTCGGCTGACT GCTGGATGCACCCCAGATTTGCGGGGTGGTTACAGGT
>NZ_AP021845.1|WP_152090939.1|10540_11263_+|hypothetical-protein MTPLKITFQVSGGFVPPPYPLHLDALLAYAQTFDALGDVADEPGIPQLRALADDMPIQRFEKDGDWCYMASAVQPEGPVLNDARFYTQRMNQDDYSARVGREHIQHGRHKPGSPMERYQIQLETARGVHRNLLGFYPVQQSATSSGALLTLVGWCIAEKWWVEDRLLNGRITHIGARRRSGHGKIQSIAIEEDNLAMSQWRLRVRPWKLLDDDLEIRAAWKPPYWAPENRGTAFCSSQLI >NZ_AP021845.1|WP_152090938.1|9447_10539_+|type-IV-CRISPR-associated-protein-Csf2 MNSIQIQLNLTSPLYIAYPDNVDKTANVSRTTKLRLMNNGRLYDLPIYPANGFRGGLRRAAAARVVEALSAKEGPVPGDLYLGLTCGASSASPDQTPKTVEEIIRGRGNVYMGLFGGGARLLSSMYRVSDMLPVLQATIETGAVPDYLAELVMPKFQKEGEPAKHAGPWEVMSERTSIRVDDLYRVMRPEEIKAYVKNPLETVAAHQDGVLANKEGRKTDGDTKTDVSNMMGIETVAPGVPFYFCIDLDKDVTPGQVGMLLLSLRDLFQENAFGGWTRCGFGKVRVNQIKIAYDDQDLAWSDFYGTSHFELPDAANVYTSQAQEEIGSLTTAEMASFFEDFSAGKKAEAKAKAKAKKTAPAEA >NZ_AP021845.1|WP_152090937.1|8688_9438_+|hypothetical-protein MRQITASHLTIQAAGIKAIGAVLAGPDHVGQRCAVCGADINPGDPIDKLDLPRTFTNQSSLAIPNGKWRCGACNAIMGNSEFQMGASTILVCSEGVFPIVRKEHRAWAFLTPPKPPFVIAIQNAKQQHVIWRAPVSLSKDLIMVRLGEQIFRLRRQKLLNCVEIAKRIDAARITPGRPVKDAIENPFVNDWKFQSAEGGRLKSWVWKLQAEQKIAPEDFMELTTLNGGEAWALTAFLSATITKPDPLNF >NZ_AP021845.1|WP_172974821.1|8071_8692_+|hypothetical-protein MLAKAADGRAILPPQFFHYGEDGKPLSTGEAEIRTIGSKNWVGVLSKTGNAELFDPCVGIATRVAANHYGSPAKMEVMELEYGLEAAAMPVFYNLSRAAFKRRSAKRRALSTEEIIKEYLLRQLNDEAERFGFDLPPDSALKIQVHHAKEMGMRLNLNTGLSNEYVSLVDANFSMYLELHGMWQIGNLQARGHGLIYRAKPGGVWS >NZ_AP021845.1|WP_152090935.1|5373_7878_-|DEAD/DEAH-box-helicase-family-protein MPTLNVPKSACSLLHAGFHPNDVSDEIRLFIRGAIQGLICQAIDDGIPPAELPAAAETGIPMNISFTKGHDRAIRNLSKEKKIREGDAALSYLYAAIARGDAIERRKSVSESPLHPYVAALGLTDRQHQNVFGEALIETLSGKSIGMVEGATGIGKTLGMVAAAAHVLEGRSFGRSLIAVPTLTLLRQFARQHQALADAIPEFPSARFILGKNEFVSVGELRILLDSGTFSEYSSTILGWLGQNGPSPSSDQAIDHRFLISSLIAIAPQFPVDAVRCGNLTDDGDPGMASYRAQFEVDESERLECEIIYCTHAMLAADIRRRMFGARSSEEGLEIRQRHRTIRQEAMGLRAALDDAGNEAYRDAGMDIKNSIDGELFELAAQAVALDAGILPSWQYLLIDEAHLFESNLANTFAFNLSLGRLLQHINAAQAEGAVSAAAAKRASKAMTIIRHAGENNDDINLKSTSPQARDVCAALNELLSIVTGVKPSKTSPTITTLKGQASVIRTALRLATTSVLGRSMLCYSPIRAFPQLSVGRASVSSELAFLWHSCEAGACVSATLYLRRLDKDSASYMAGILNIPTNRMREYPVIRPHWVTAPVAGLWIPESTKNPSGRLWLRPPTRSDKLDTEQYRLREEEWLEDLSAEIRKIQVSAAGGTLVLMTSYTSAKGLAERLADIDGLVVAEQGVSISRQVEGFVNQHSAGKKPLWIAVGGAWTGVDINGKDYGLATPGEDNLLTDLVIPRFPFGTNMSMTHRHRAEQASNVPWDLLDAAMRFKQGLGRLVRREGLPPNRRIYVLDGRMNEPTFDFFMSHLRRIIGIYPVKTLKRSAAIDD >NZ_AP021845.1|WP_152090934.1|4030_5377_+|hypothetical-protein MGKSHQQWREDLRKVMHELQALEDDEASLKGERRTSEEDLGKLKSRIDGLRRHLDDLAAAGCTAEEKLRKAKDRLAGYWPDLAADDHDQERSSPWAHPEWRAARIRVFLAALNLHQAFIEENASKMMANLGIAMDMLQGGIPDPKVRVQALDSLAIACPVISTTFASVPSLCGSMSSEGIGWLLIDEAGQATPQAAAGAIWRARRVVVVGDPLQLEPVVTLPRSVEASLAACNGGVNSRLHPSRTSVQKLADQTTAIGTTVGEGDDAIWVGAPLRVHRRCDEPMFSISNEVAYDGLMVHHKKPAALTWPASYWLDVPGGQGNGNWIPAEGEALRGLIQNLLGQAQVPADDIFLISPFRDVVRELKGMGKAFGLDYRRVGTVHTTQGKEADVVIMVLGGGTAGARDWASSRPNLLNVAASRAKARFYVVGDRKDWSKRRFFDVLSKNLS >NZ_AP021845.1|WP_004883034.1|2592_3792_-|tyrosine-type-recombinase/integrase MAKIKLTKSAVDAAQPQAEAVELRDTLVPGFLCKITPAGRKVFMLQYRTNAGERRKPSLGLYGELTVEQARSLAQEWLAQVRRGGDPAAEKAEARQAPTVKELCTKFMEDYSKKRNKLSTQAGYQAVINRNIIPLLGRKKVQDVKRPEIAGLMEKLSYKQTEANKVFSVLRKMFNMAEVWGYRPDGTNPCRHVPMFPAGKSTHLISDEEMGNLFRQLDKIESEGLENYVIPLGIRLQFEFAGRRSEIIALEWNWVDLQNRRVVWPDSKTGGMSKPMSEEAYRLLSTAPRQEGSRYVLPSPSHAGKHLTTGEYYGGWSRALKAAGATHVGTHGIRHRSATDIANSGIPVKVGMALTAHKTVVMFMRYVHTEDKPVREAAELVANRRKTITGMQGAKEVAA >NZ_AP021845.1|WP_004883035.1|1528_2596_-|DUF1016-family-protein MTRRKASVSAPAAPPALLGDIRALIEASRQRVASAVNAELTLLFWRIGQRIHTEVLAGQRAGYGDEILPTLAAQLVRDYGRSFADKNLRRMVQFAATFSDEPIVVTLSRQLSWSHFVALLPLKDPLQRDYYVQMASAERWSVRTLRERIDSMLYERTALSKKPDETITQELAAMRDAQRMSPALVMRDPYILDFLGLRDTWQEGDLEAAIIREMESFLLELGAGFSFLARQKRIQIDDEDFHLDLLFYNRKLRRLVAVELKIGEFKAAYKGQMELYLRWLDKHEREPEEASPLGIILCTGKKSEQIELLELDKSGIHVAEYLTTLPPRAVLGERLQQATERARLQIEQRQPGEKS >NZ_AP021845.1|WP_004883036.1|41_1289_+|hypothetical-protein MKNIFEEINEFSSEKIALFSFGKFCYVFLNKDPIFVKKLLPLIQTSLANESFQADVMRAYTEGCMNEKAAILKEFEAKRDHPNAAKFYGPQLDLVDKRLAIKTIQHLMDYLNNYLNEYPGSLEILNNSYKHIHDEDGVSYIKENYANYRIGCIFYSKHQSIMGRAEMLELKYSKVVEREYEKIGIDIRKEDAQFSKYSLVSLNENIQIFNDKDSQTIRDERIGRHFWIKVPRKLLTSIEELIEKGMLSEIAFRIDYVSDYVPAMEEMEFGAPLRLKISSLPRLSKFYSTDKYENNLWIHHDAEKLSLTFEELMEDFEVAGDDVVTQVIHLEYSSKGDDFFITHLDHEFIVYTLDSYQERLSNANIKGHRKIKTFKIDNSMIPFDINISGDLFLFQVLDSYLKNDDLIREYFEKIN >NZ_AP021845.1|WP_152090940.1|12610_12820_-|hypothetical-protein MSSLPKIDHQENAERNLGIAIDRLDEMRWAVVGVDSPDAECLAKFDEGVAKLKEALTVIRHSPSKTTGR >NZ_AP021845.1|WP_152090941.1|12821_13283_-|hypothetical-protein MNKADLIWWPDNKKGVIINAVAARDRATARFVVTRAIRLVFPMPIMVLASYLISTTLTFPDDAPGWVLTWVEWGPTTLAWACATMTLVVVAMLLIDWRRDRSAAVALACEASALGVDIAKLDGDWVFEALVLPMVRRKGITLPDGSVAHLTEE >NZ_AP021845.1|WP_152090942.1|13293_13818_-|hypothetical-protein MTLASCIGCGCDDLHACVDDLGPCSWIVVDRDAGRGVCSCCESHMERWNAGDRSVLMLVAKITRDGETEPYFIEKNGIGSFPYFLEVEAGDKFSIEWVEMSQEQFEALPQFEHVRLAEKWLEEITAAGDAESEGKMDEAGTHRAEAQRLSEKVAERGFDVLDLIEADELPASLL >NZ_AP021845.1|WP_152090943.1|14138_14459_-|hypothetical-protein MKIDPHEVTSRSIAVVSEVPLIATQWHRHGDHPRVHGMNGSNWELFPDEDDVRYGLLSGGHSGCLLVESGDWIVWNEVFKTYAIFKPDQFEALFSTSAAAIAPSER >NZ_AP021845.1|WP_152090944.1|14759_15089_-|hypothetical-protein MTPNWQPIEALPLIAGMLDDQLHSLHTQVGNLEQCRHRPWVLDGETVNRLQAVFGEQMDSLPVFREQLARWLELPLDEHQRQEINRLNAVLDQMKAAIERILSLAGNIR >NZ_AP021845.1|WP_152090945.1|15091_15607_-|hypothetical-protein MEEYSRPATLDDLKALIASLNEQCADYLLIGGYALFAHGYHRATTDIDVLVPATQEAGIKIRSALMVLPDQAAKNIDPAWFDEGENIRVADAFIVDIMLNACGETYETLKKYAETLDVDGVPVRTINLEGLLLTKQTMREKDVSDRIILERALETLKERVSKPESDHGLGL >NZ_AP021845.1|WP_152090946.1|15606_15834_-|hypothetical-protein MRTIGRRKEHPITFSASAELLVEGARFNDEIHRLPTGNTTHIPKGLYRFKSFEEANQHQQDCLVAGMAKIALERK >NZ_AP021845.1|WP_152090947.1|15850_16288_-|hypothetical-protein MEYRDFLKEIPEGLGATEKTLQLWFVIYQWILEHGYADSADSPAFLQHINAFRKCSTQTVATHLRRMSDAKLIKRYVLRRKLSGEAKEELSIGSLLFAPGAESIPTTFVRYCLPGQQCPTEFKSYEAAVSALDGRMNAIRETVRP >NZ_AP021845.1|WP_152090948.1|16290_16860_-|hypothetical-protein MNTLETQNASTVIDMATAVRERASIKVYANGNLVGEISIAEHEAFKLAAKNDRSLYREQALNYLEATFRLVGRIVLSIPENWFVIAVLLALMMPSEFNSLVSAIIANPSTSSTEFLNTVRWALAASIASTALVAVISGESFGLANVFDDRVAMMIRAKFKLPPMCKLFVDAEQILDGLPSHQAPTHFGK >NZ_AP021845.1|WP_152090949.1|16868_17072_-|hypothetical-protein MAEFTILVGDEVVRLTKKEVEALRKSLKTDVLVTPEDWTRSELQSRSQARKKLMDALYSAEKDIILR |
You can click texts colored in the table to view more detailed information
| CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| NZ_AP021845_2 | 11530-11698 | TypeIV-A |
NA
Consensus repeat of NZ_AP021845_2
|
2 spacers
spacers of NZ_AP021845_2
>2.1|11566|30|NZ_AP021845|CRISPRCasFinder CTGATATTGACAAGGCCTTGGCAGTTGTCG >2.2|11632|30|NZ_AP021845|CRISPRCasFinder GTCAGGAGTATCAGCCAGACAATGAACTTG |
csf3gr5,csf2gr7,csf1gr8,csf5gr6,DinG,PD-DExK |
CRISPR arrays and Neighbor proteins around NZ_AP021845_2
The CRISPR arrays of NZ_AP021845_2 >merge|NZ_AP021845|2|11530-11698|CRISPRCasFinder GCTGAATGCACCCCTAATCCTGGGGGTGGTTACAGGCTGATATTGACAAGGCCTTGGCAGTTGTCGGCTGGATGTACCCCACATCTGAGGGGTGGTTACAGGGTCAGGAGTATCAGCCAGACAATGAACTTGGCTGGATGCACCCCACATCTGAGGGGTGGTTGCAGGG >NZ_AP021845|2|2|11530-11698|CRISPRCasFinder GCTGAATGCACCCCTAATCCTGGGGGTGGTTACAGG CTGATATTGACAAGGCCTTGGCAGTTGTCG GCTGGATGTACCCCACATCTGAGGGGTGGTTACAGG GTCAGGAGTATCAGCCAGACAATGAACTTG GCTGGATGCACCCCACATCTGAGGGGTGGTTGCAGGG
>NZ_AP021845.1|WP_152090939.1|10540_11263_+|hypothetical-protein MTPLKITFQVSGGFVPPPYPLHLDALLAYAQTFDALGDVADEPGIPQLRALADDMPIQRFEKDGDWCYMASAVQPEGPVLNDARFYTQRMNQDDYSARVGREHIQHGRHKPGSPMERYQIQLETARGVHRNLLGFYPVQQSATSSGALLTLVGWCIAEKWWVEDRLLNGRITHIGARRRSGHGKIQSIAIEEDNLAMSQWRLRVRPWKLLDDDLEIRAAWKPPYWAPENRGTAFCSSQLI >NZ_AP021845.1|WP_152090938.1|9447_10539_+|type-IV-CRISPR-associated-protein-Csf2 MNSIQIQLNLTSPLYIAYPDNVDKTANVSRTTKLRLMNNGRLYDLPIYPANGFRGGLRRAAAARVVEALSAKEGPVPGDLYLGLTCGASSASPDQTPKTVEEIIRGRGNVYMGLFGGGARLLSSMYRVSDMLPVLQATIETGAVPDYLAELVMPKFQKEGEPAKHAGPWEVMSERTSIRVDDLYRVMRPEEIKAYVKNPLETVAAHQDGVLANKEGRKTDGDTKTDVSNMMGIETVAPGVPFYFCIDLDKDVTPGQVGMLLLSLRDLFQENAFGGWTRCGFGKVRVNQIKIAYDDQDLAWSDFYGTSHFELPDAANVYTSQAQEEIGSLTTAEMASFFEDFSAGKKAEAKAKAKAKKTAPAEA >NZ_AP021845.1|WP_152090937.1|8688_9438_+|hypothetical-protein MRQITASHLTIQAAGIKAIGAVLAGPDHVGQRCAVCGADINPGDPIDKLDLPRTFTNQSSLAIPNGKWRCGACNAIMGNSEFQMGASTILVCSEGVFPIVRKEHRAWAFLTPPKPPFVIAIQNAKQQHVIWRAPVSLSKDLIMVRLGEQIFRLRRQKLLNCVEIAKRIDAARITPGRPVKDAIENPFVNDWKFQSAEGGRLKSWVWKLQAEQKIAPEDFMELTTLNGGEAWALTAFLSATITKPDPLNF >NZ_AP021845.1|WP_172974821.1|8071_8692_+|hypothetical-protein MLAKAADGRAILPPQFFHYGEDGKPLSTGEAEIRTIGSKNWVGVLSKTGNAELFDPCVGIATRVAANHYGSPAKMEVMELEYGLEAAAMPVFYNLSRAAFKRRSAKRRALSTEEIIKEYLLRQLNDEAERFGFDLPPDSALKIQVHHAKEMGMRLNLNTGLSNEYVSLVDANFSMYLELHGMWQIGNLQARGHGLIYRAKPGGVWS >NZ_AP021845.1|WP_152090935.1|5373_7878_-|DEAD/DEAH-box-helicase-family-protein MPTLNVPKSACSLLHAGFHPNDVSDEIRLFIRGAIQGLICQAIDDGIPPAELPAAAETGIPMNISFTKGHDRAIRNLSKEKKIREGDAALSYLYAAIARGDAIERRKSVSESPLHPYVAALGLTDRQHQNVFGEALIETLSGKSIGMVEGATGIGKTLGMVAAAAHVLEGRSFGRSLIAVPTLTLLRQFARQHQALADAIPEFPSARFILGKNEFVSVGELRILLDSGTFSEYSSTILGWLGQNGPSPSSDQAIDHRFLISSLIAIAPQFPVDAVRCGNLTDDGDPGMASYRAQFEVDESERLECEIIYCTHAMLAADIRRRMFGARSSEEGLEIRQRHRTIRQEAMGLRAALDDAGNEAYRDAGMDIKNSIDGELFELAAQAVALDAGILPSWQYLLIDEAHLFESNLANTFAFNLSLGRLLQHINAAQAEGAVSAAAAKRASKAMTIIRHAGENNDDINLKSTSPQARDVCAALNELLSIVTGVKPSKTSPTITTLKGQASVIRTALRLATTSVLGRSMLCYSPIRAFPQLSVGRASVSSELAFLWHSCEAGACVSATLYLRRLDKDSASYMAGILNIPTNRMREYPVIRPHWVTAPVAGLWIPESTKNPSGRLWLRPPTRSDKLDTEQYRLREEEWLEDLSAEIRKIQVSAAGGTLVLMTSYTSAKGLAERLADIDGLVVAEQGVSISRQVEGFVNQHSAGKKPLWIAVGGAWTGVDINGKDYGLATPGEDNLLTDLVIPRFPFGTNMSMTHRHRAEQASNVPWDLLDAAMRFKQGLGRLVRREGLPPNRRIYVLDGRMNEPTFDFFMSHLRRIIGIYPVKTLKRSAAIDD >NZ_AP021845.1|WP_152090934.1|4030_5377_+|hypothetical-protein MGKSHQQWREDLRKVMHELQALEDDEASLKGERRTSEEDLGKLKSRIDGLRRHLDDLAAAGCTAEEKLRKAKDRLAGYWPDLAADDHDQERSSPWAHPEWRAARIRVFLAALNLHQAFIEENASKMMANLGIAMDMLQGGIPDPKVRVQALDSLAIACPVISTTFASVPSLCGSMSSEGIGWLLIDEAGQATPQAAAGAIWRARRVVVVGDPLQLEPVVTLPRSVEASLAACNGGVNSRLHPSRTSVQKLADQTTAIGTTVGEGDDAIWVGAPLRVHRRCDEPMFSISNEVAYDGLMVHHKKPAALTWPASYWLDVPGGQGNGNWIPAEGEALRGLIQNLLGQAQVPADDIFLISPFRDVVRELKGMGKAFGLDYRRVGTVHTTQGKEADVVIMVLGGGTAGARDWASSRPNLLNVAASRAKARFYVVGDRKDWSKRRFFDVLSKNLS >NZ_AP021845.1|WP_004883034.1|2592_3792_-|tyrosine-type-recombinase/integrase MAKIKLTKSAVDAAQPQAEAVELRDTLVPGFLCKITPAGRKVFMLQYRTNAGERRKPSLGLYGELTVEQARSLAQEWLAQVRRGGDPAAEKAEARQAPTVKELCTKFMEDYSKKRNKLSTQAGYQAVINRNIIPLLGRKKVQDVKRPEIAGLMEKLSYKQTEANKVFSVLRKMFNMAEVWGYRPDGTNPCRHVPMFPAGKSTHLISDEEMGNLFRQLDKIESEGLENYVIPLGIRLQFEFAGRRSEIIALEWNWVDLQNRRVVWPDSKTGGMSKPMSEEAYRLLSTAPRQEGSRYVLPSPSHAGKHLTTGEYYGGWSRALKAAGATHVGTHGIRHRSATDIANSGIPVKVGMALTAHKTVVMFMRYVHTEDKPVREAAELVANRRKTITGMQGAKEVAA >NZ_AP021845.1|WP_004883035.1|1528_2596_-|DUF1016-family-protein MTRRKASVSAPAAPPALLGDIRALIEASRQRVASAVNAELTLLFWRIGQRIHTEVLAGQRAGYGDEILPTLAAQLVRDYGRSFADKNLRRMVQFAATFSDEPIVVTLSRQLSWSHFVALLPLKDPLQRDYYVQMASAERWSVRTLRERIDSMLYERTALSKKPDETITQELAAMRDAQRMSPALVMRDPYILDFLGLRDTWQEGDLEAAIIREMESFLLELGAGFSFLARQKRIQIDDEDFHLDLLFYNRKLRRLVAVELKIGEFKAAYKGQMELYLRWLDKHEREPEEASPLGIILCTGKKSEQIELLELDKSGIHVAEYLTTLPPRAVLGERLQQATERARLQIEQRQPGEKS >NZ_AP021845.1|WP_004883036.1|41_1289_+|hypothetical-protein MKNIFEEINEFSSEKIALFSFGKFCYVFLNKDPIFVKKLLPLIQTSLANESFQADVMRAYTEGCMNEKAAILKEFEAKRDHPNAAKFYGPQLDLVDKRLAIKTIQHLMDYLNNYLNEYPGSLEILNNSYKHIHDEDGVSYIKENYANYRIGCIFYSKHQSIMGRAEMLELKYSKVVEREYEKIGIDIRKEDAQFSKYSLVSLNENIQIFNDKDSQTIRDERIGRHFWIKVPRKLLTSIEELIEKGMLSEIAFRIDYVSDYVPAMEEMEFGAPLRLKISSLPRLSKFYSTDKYENNLWIHHDAEKLSLTFEELMEDFEVAGDDVVTQVIHLEYSSKGDDFFITHLDHEFIVYTLDSYQERLSNANIKGHRKIKTFKIDNSMIPFDINISGDLFLFQVLDSYLKNDDLIREYFEKIN >NZ_AP021845.1|WP_152090940.1|12610_12820_-|hypothetical-protein MSSLPKIDHQENAERNLGIAIDRLDEMRWAVVGVDSPDAECLAKFDEGVAKLKEALTVIRHSPSKTTGR >NZ_AP021845.1|WP_152090941.1|12821_13283_-|hypothetical-protein MNKADLIWWPDNKKGVIINAVAARDRATARFVVTRAIRLVFPMPIMVLASYLISTTLTFPDDAPGWVLTWVEWGPTTLAWACATMTLVVVAMLLIDWRRDRSAAVALACEASALGVDIAKLDGDWVFEALVLPMVRRKGITLPDGSVAHLTEE >NZ_AP021845.1|WP_152090942.1|13293_13818_-|hypothetical-protein MTLASCIGCGCDDLHACVDDLGPCSWIVVDRDAGRGVCSCCESHMERWNAGDRSVLMLVAKITRDGETEPYFIEKNGIGSFPYFLEVEAGDKFSIEWVEMSQEQFEALPQFEHVRLAEKWLEEITAAGDAESEGKMDEAGTHRAEAQRLSEKVAERGFDVLDLIEADELPASLL >NZ_AP021845.1|WP_152090943.1|14138_14459_-|hypothetical-protein MKIDPHEVTSRSIAVVSEVPLIATQWHRHGDHPRVHGMNGSNWELFPDEDDVRYGLLSGGHSGCLLVESGDWIVWNEVFKTYAIFKPDQFEALFSTSAAAIAPSER >NZ_AP021845.1|WP_152090944.1|14759_15089_-|hypothetical-protein MTPNWQPIEALPLIAGMLDDQLHSLHTQVGNLEQCRHRPWVLDGETVNRLQAVFGEQMDSLPVFREQLARWLELPLDEHQRQEINRLNAVLDQMKAAIERILSLAGNIR >NZ_AP021845.1|WP_152090945.1|15091_15607_-|hypothetical-protein MEEYSRPATLDDLKALIASLNEQCADYLLIGGYALFAHGYHRATTDIDVLVPATQEAGIKIRSALMVLPDQAAKNIDPAWFDEGENIRVADAFIVDIMLNACGETYETLKKYAETLDVDGVPVRTINLEGLLLTKQTMREKDVSDRIILERALETLKERVSKPESDHGLGL >NZ_AP021845.1|WP_152090946.1|15606_15834_-|hypothetical-protein MRTIGRRKEHPITFSASAELLVEGARFNDEIHRLPTGNTTHIPKGLYRFKSFEEANQHQQDCLVAGMAKIALERK >NZ_AP021845.1|WP_152090947.1|15850_16288_-|hypothetical-protein MEYRDFLKEIPEGLGATEKTLQLWFVIYQWILEHGYADSADSPAFLQHINAFRKCSTQTVATHLRRMSDAKLIKRYVLRRKLSGEAKEELSIGSLLFAPGAESIPTTFVRYCLPGQQCPTEFKSYEAAVSALDGRMNAIRETVRP >NZ_AP021845.1|WP_152090948.1|16290_16860_-|hypothetical-protein MNTLETQNASTVIDMATAVRERASIKVYANGNLVGEISIAEHEAFKLAAKNDRSLYREQALNYLEATFRLVGRIVLSIPENWFVIAVLLALMMPSEFNSLVSAIIANPSTSSTEFLNTVRWALAASIASTALVAVISGESFGLANVFDDRVAMMIRAKFKLPPMCKLFVDAEQILDGLPSHQAPTHFGK >NZ_AP021845.1|WP_152090949.1|16868_17072_-|hypothetical-protein MAEFTILVGDEVVRLTKKEVEALRKSLKTDVLVTPEDWTRSELQSRSQARKKLMDALYSAEKDIILR |
You can click texts colored in the table to view more detailed information
| CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| NZ_AP021845_3 | 12193-12294 | TypeIV-A |
NA
Consensus repeat of NZ_AP021845_3
|
1 spacers
spacers of NZ_AP021845_3
>3.1|12229|30|NZ_AP021845|CRISPRCasFinder GAATATCGGTTCTGCGGTCGCAGATTGGCC |
csf3gr5,csf2gr7,csf1gr8,csf5gr6,DinG,PD-DExK |
CRISPR arrays and Neighbor proteins around NZ_AP021845_3
The CRISPR arrays of NZ_AP021845_3 >merge|NZ_AP021845|3|12193-12294|CRISPRCasFinder GTTGGAGGCACCGCTCAGATGAGGGGTGGTTACAGGGAATATCGGTTCTGCGGTCGCAGATTGGCCGTTGGAGGCACCGCTCAGATGAGGGGTGGTTACAGG >NZ_AP021845|3|3|12193-12294|CRISPRCasFinder GTTGGAGGCACCGCTCAGATGAGGGGTGGTTACAGG GAATATCGGTTCTGCGGTCGCAGATTGGCC GTTGGAGGCACCGCTCAGATGAGGGGTGGTTACAGG
>NZ_AP021845.1|WP_152090939.1|10540_11263_+|hypothetical-protein MTPLKITFQVSGGFVPPPYPLHLDALLAYAQTFDALGDVADEPGIPQLRALADDMPIQRFEKDGDWCYMASAVQPEGPVLNDARFYTQRMNQDDYSARVGREHIQHGRHKPGSPMERYQIQLETARGVHRNLLGFYPVQQSATSSGALLTLVGWCIAEKWWVEDRLLNGRITHIGARRRSGHGKIQSIAIEEDNLAMSQWRLRVRPWKLLDDDLEIRAAWKPPYWAPENRGTAFCSSQLI >NZ_AP021845.1|WP_152090938.1|9447_10539_+|type-IV-CRISPR-associated-protein-Csf2 MNSIQIQLNLTSPLYIAYPDNVDKTANVSRTTKLRLMNNGRLYDLPIYPANGFRGGLRRAAAARVVEALSAKEGPVPGDLYLGLTCGASSASPDQTPKTVEEIIRGRGNVYMGLFGGGARLLSSMYRVSDMLPVLQATIETGAVPDYLAELVMPKFQKEGEPAKHAGPWEVMSERTSIRVDDLYRVMRPEEIKAYVKNPLETVAAHQDGVLANKEGRKTDGDTKTDVSNMMGIETVAPGVPFYFCIDLDKDVTPGQVGMLLLSLRDLFQENAFGGWTRCGFGKVRVNQIKIAYDDQDLAWSDFYGTSHFELPDAANVYTSQAQEEIGSLTTAEMASFFEDFSAGKKAEAKAKAKAKKTAPAEA >NZ_AP021845.1|WP_152090937.1|8688_9438_+|hypothetical-protein MRQITASHLTIQAAGIKAIGAVLAGPDHVGQRCAVCGADINPGDPIDKLDLPRTFTNQSSLAIPNGKWRCGACNAIMGNSEFQMGASTILVCSEGVFPIVRKEHRAWAFLTPPKPPFVIAIQNAKQQHVIWRAPVSLSKDLIMVRLGEQIFRLRRQKLLNCVEIAKRIDAARITPGRPVKDAIENPFVNDWKFQSAEGGRLKSWVWKLQAEQKIAPEDFMELTTLNGGEAWALTAFLSATITKPDPLNF >NZ_AP021845.1|WP_172974821.1|8071_8692_+|hypothetical-protein MLAKAADGRAILPPQFFHYGEDGKPLSTGEAEIRTIGSKNWVGVLSKTGNAELFDPCVGIATRVAANHYGSPAKMEVMELEYGLEAAAMPVFYNLSRAAFKRRSAKRRALSTEEIIKEYLLRQLNDEAERFGFDLPPDSALKIQVHHAKEMGMRLNLNTGLSNEYVSLVDANFSMYLELHGMWQIGNLQARGHGLIYRAKPGGVWS >NZ_AP021845.1|WP_152090935.1|5373_7878_-|DEAD/DEAH-box-helicase-family-protein MPTLNVPKSACSLLHAGFHPNDVSDEIRLFIRGAIQGLICQAIDDGIPPAELPAAAETGIPMNISFTKGHDRAIRNLSKEKKIREGDAALSYLYAAIARGDAIERRKSVSESPLHPYVAALGLTDRQHQNVFGEALIETLSGKSIGMVEGATGIGKTLGMVAAAAHVLEGRSFGRSLIAVPTLTLLRQFARQHQALADAIPEFPSARFILGKNEFVSVGELRILLDSGTFSEYSSTILGWLGQNGPSPSSDQAIDHRFLISSLIAIAPQFPVDAVRCGNLTDDGDPGMASYRAQFEVDESERLECEIIYCTHAMLAADIRRRMFGARSSEEGLEIRQRHRTIRQEAMGLRAALDDAGNEAYRDAGMDIKNSIDGELFELAAQAVALDAGILPSWQYLLIDEAHLFESNLANTFAFNLSLGRLLQHINAAQAEGAVSAAAAKRASKAMTIIRHAGENNDDINLKSTSPQARDVCAALNELLSIVTGVKPSKTSPTITTLKGQASVIRTALRLATTSVLGRSMLCYSPIRAFPQLSVGRASVSSELAFLWHSCEAGACVSATLYLRRLDKDSASYMAGILNIPTNRMREYPVIRPHWVTAPVAGLWIPESTKNPSGRLWLRPPTRSDKLDTEQYRLREEEWLEDLSAEIRKIQVSAAGGTLVLMTSYTSAKGLAERLADIDGLVVAEQGVSISRQVEGFVNQHSAGKKPLWIAVGGAWTGVDINGKDYGLATPGEDNLLTDLVIPRFPFGTNMSMTHRHRAEQASNVPWDLLDAAMRFKQGLGRLVRREGLPPNRRIYVLDGRMNEPTFDFFMSHLRRIIGIYPVKTLKRSAAIDD >NZ_AP021845.1|WP_152090934.1|4030_5377_+|hypothetical-protein MGKSHQQWREDLRKVMHELQALEDDEASLKGERRTSEEDLGKLKSRIDGLRRHLDDLAAAGCTAEEKLRKAKDRLAGYWPDLAADDHDQERSSPWAHPEWRAARIRVFLAALNLHQAFIEENASKMMANLGIAMDMLQGGIPDPKVRVQALDSLAIACPVISTTFASVPSLCGSMSSEGIGWLLIDEAGQATPQAAAGAIWRARRVVVVGDPLQLEPVVTLPRSVEASLAACNGGVNSRLHPSRTSVQKLADQTTAIGTTVGEGDDAIWVGAPLRVHRRCDEPMFSISNEVAYDGLMVHHKKPAALTWPASYWLDVPGGQGNGNWIPAEGEALRGLIQNLLGQAQVPADDIFLISPFRDVVRELKGMGKAFGLDYRRVGTVHTTQGKEADVVIMVLGGGTAGARDWASSRPNLLNVAASRAKARFYVVGDRKDWSKRRFFDVLSKNLS >NZ_AP021845.1|WP_004883034.1|2592_3792_-|tyrosine-type-recombinase/integrase MAKIKLTKSAVDAAQPQAEAVELRDTLVPGFLCKITPAGRKVFMLQYRTNAGERRKPSLGLYGELTVEQARSLAQEWLAQVRRGGDPAAEKAEARQAPTVKELCTKFMEDYSKKRNKLSTQAGYQAVINRNIIPLLGRKKVQDVKRPEIAGLMEKLSYKQTEANKVFSVLRKMFNMAEVWGYRPDGTNPCRHVPMFPAGKSTHLISDEEMGNLFRQLDKIESEGLENYVIPLGIRLQFEFAGRRSEIIALEWNWVDLQNRRVVWPDSKTGGMSKPMSEEAYRLLSTAPRQEGSRYVLPSPSHAGKHLTTGEYYGGWSRALKAAGATHVGTHGIRHRSATDIANSGIPVKVGMALTAHKTVVMFMRYVHTEDKPVREAAELVANRRKTITGMQGAKEVAA >NZ_AP021845.1|WP_004883035.1|1528_2596_-|DUF1016-family-protein MTRRKASVSAPAAPPALLGDIRALIEASRQRVASAVNAELTLLFWRIGQRIHTEVLAGQRAGYGDEILPTLAAQLVRDYGRSFADKNLRRMVQFAATFSDEPIVVTLSRQLSWSHFVALLPLKDPLQRDYYVQMASAERWSVRTLRERIDSMLYERTALSKKPDETITQELAAMRDAQRMSPALVMRDPYILDFLGLRDTWQEGDLEAAIIREMESFLLELGAGFSFLARQKRIQIDDEDFHLDLLFYNRKLRRLVAVELKIGEFKAAYKGQMELYLRWLDKHEREPEEASPLGIILCTGKKSEQIELLELDKSGIHVAEYLTTLPPRAVLGERLQQATERARLQIEQRQPGEKS >NZ_AP021845.1|WP_004883036.1|41_1289_+|hypothetical-protein MKNIFEEINEFSSEKIALFSFGKFCYVFLNKDPIFVKKLLPLIQTSLANESFQADVMRAYTEGCMNEKAAILKEFEAKRDHPNAAKFYGPQLDLVDKRLAIKTIQHLMDYLNNYLNEYPGSLEILNNSYKHIHDEDGVSYIKENYANYRIGCIFYSKHQSIMGRAEMLELKYSKVVEREYEKIGIDIRKEDAQFSKYSLVSLNENIQIFNDKDSQTIRDERIGRHFWIKVPRKLLTSIEELIEKGMLSEIAFRIDYVSDYVPAMEEMEFGAPLRLKISSLPRLSKFYSTDKYENNLWIHHDAEKLSLTFEELMEDFEVAGDDVVTQVIHLEYSSKGDDFFITHLDHEFIVYTLDSYQERLSNANIKGHRKIKTFKIDNSMIPFDINISGDLFLFQVLDSYLKNDDLIREYFEKIN >NZ_AP021845.1|WP_152090940.1|12610_12820_-|hypothetical-protein MSSLPKIDHQENAERNLGIAIDRLDEMRWAVVGVDSPDAECLAKFDEGVAKLKEALTVIRHSPSKTTGR >NZ_AP021845.1|WP_152090941.1|12821_13283_-|hypothetical-protein MNKADLIWWPDNKKGVIINAVAARDRATARFVVTRAIRLVFPMPIMVLASYLISTTLTFPDDAPGWVLTWVEWGPTTLAWACATMTLVVVAMLLIDWRRDRSAAVALACEASALGVDIAKLDGDWVFEALVLPMVRRKGITLPDGSVAHLTEE >NZ_AP021845.1|WP_152090942.1|13293_13818_-|hypothetical-protein MTLASCIGCGCDDLHACVDDLGPCSWIVVDRDAGRGVCSCCESHMERWNAGDRSVLMLVAKITRDGETEPYFIEKNGIGSFPYFLEVEAGDKFSIEWVEMSQEQFEALPQFEHVRLAEKWLEEITAAGDAESEGKMDEAGTHRAEAQRLSEKVAERGFDVLDLIEADELPASLL >NZ_AP021845.1|WP_152090943.1|14138_14459_-|hypothetical-protein MKIDPHEVTSRSIAVVSEVPLIATQWHRHGDHPRVHGMNGSNWELFPDEDDVRYGLLSGGHSGCLLVESGDWIVWNEVFKTYAIFKPDQFEALFSTSAAAIAPSER >NZ_AP021845.1|WP_152090944.1|14759_15089_-|hypothetical-protein MTPNWQPIEALPLIAGMLDDQLHSLHTQVGNLEQCRHRPWVLDGETVNRLQAVFGEQMDSLPVFREQLARWLELPLDEHQRQEINRLNAVLDQMKAAIERILSLAGNIR >NZ_AP021845.1|WP_152090945.1|15091_15607_-|hypothetical-protein MEEYSRPATLDDLKALIASLNEQCADYLLIGGYALFAHGYHRATTDIDVLVPATQEAGIKIRSALMVLPDQAAKNIDPAWFDEGENIRVADAFIVDIMLNACGETYETLKKYAETLDVDGVPVRTINLEGLLLTKQTMREKDVSDRIILERALETLKERVSKPESDHGLGL >NZ_AP021845.1|WP_152090946.1|15606_15834_-|hypothetical-protein MRTIGRRKEHPITFSASAELLVEGARFNDEIHRLPTGNTTHIPKGLYRFKSFEEANQHQQDCLVAGMAKIALERK >NZ_AP021845.1|WP_152090947.1|15850_16288_-|hypothetical-protein MEYRDFLKEIPEGLGATEKTLQLWFVIYQWILEHGYADSADSPAFLQHINAFRKCSTQTVATHLRRMSDAKLIKRYVLRRKLSGEAKEELSIGSLLFAPGAESIPTTFVRYCLPGQQCPTEFKSYEAAVSALDGRMNAIRETVRP >NZ_AP021845.1|WP_152090948.1|16290_16860_-|hypothetical-protein MNTLETQNASTVIDMATAVRERASIKVYANGNLVGEISIAEHEAFKLAAKNDRSLYREQALNYLEATFRLVGRIVLSIPENWFVIAVLLALMMPSEFNSLVSAIIANPSTSSTEFLNTVRWALAASIASTALVAVISGESFGLANVFDDRVAMMIRAKFKLPPMCKLFVDAEQILDGLPSHQAPTHFGK >NZ_AP021845.1|WP_152090949.1|16868_17072_-|hypothetical-protein MAEFTILVGDEVVRLTKKEVEALRKSLKTDVLVTPEDWTRSELQSRSQARKKLMDALYSAEKDIILR |
You can click texts colored in the table to view more detailed information
| CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| NZ_AP021845_4 | 307466-307630 | Orphan |
NA
Consensus repeat of NZ_AP021845_4
|
3 spacers
spacers of NZ_AP021845_4
>4.1|307483|31|NZ_AP021845|PILER-CR AAACACGCTGACGGGGAGGGAGGCGCATGGG >4.2|307531|43|NZ_AP021845|PILER-CR TTGCCGAGCACCAAGGGGTGATGCGAGAGGAGGGCTGTGCGGA >4.3|307591|23|NZ_AP021845|PILER-CR TCGCTGACATCCCTGGAATCCGA |
CRISPR arrays and Neighbor proteins around NZ_AP021845_4
The CRISPR arrays of NZ_AP021845_4 >merge|NZ_AP021845|4|307466-307630|PILER-CR TTGCACGTCGACGTGCAAAACACGCTGACGGGGAGGGAGGCGCATGGGCTGCACGTTGACGTGCATTGCCGAGCACCAAGGGGTGATGCGAGAGGAGGGCTGTGCGGATTGCACGTCAACGTGCATCGCTGACATCCCTGGAATCCGATTGCACGTCAACGTGCA >NZ_AP021845|4|1|307466-307630|PILER-CR TTGCACGTCGACGTGCA AAACACGCTGACGGGGAGGGAGGCGCATGGG CTGCACGTTGACGTGCA TTGCCGAGCACCAAGGGGTGATGCGAGAGGAGGGCTGTGCGGA TTGCACGTCAACGTGCA TCGCTGACATCCCTGGAATCCGA TTGCACGTCAACGTGCA
>NZ_AP021845.1|WP_152091267.1|305586_306732_-|hypothetical-protein MQAIIAAGVAADNPLAPIFARKATLAKMAECSEVTVYRAMRQLEDAGWISRSEQVRLDDGSMDIGLLSITKKLATLVGLCLDEEVHGSAEESKRSQDIDSEPITVNNKKHTLTQNIAGREGAPTAAALVGNGPKQDGSDAKLCTQMKDGLIAGPIYRGEQRVDPKASVNYQSTRPGFVRIDGRSVAQELVWLIEEKRLTFGALFQLQTLAKQVPGQTLSDFVAYRSERIKQLTTTNDCYRYLKKLISDGIDARYLCAQRAKKEHRVMRRQQRDKAASSRAAWCRARHEMTFLNTQTGVTYRINANHELLEVGENGLPTSRPNLAITSKFIKAVEEGRLVPFRIQEPVINLELGNRRLDEMASKFPWLRRKGKGSPVEEVRA >NZ_AP021845.1|WP_152091266.1|305013_305346_-|hypothetical-protein MSSAVLQGAQVPLVAEFHQGTQVLIRWALWHKQHAYPKAILIGVFRKEDIRLVVYQYGKALCVAVKMADVAPKIRDWREWRQRYATVKGKLIQRGFALVDEREEVSHVAH >NZ_AP021845.1|WP_152091265.1|304361_305027_-|hypothetical-protein MSLTDFDLFGHPVAAPQNVTPPRRRVVSQAAKRAKLARHLEKSNNLVLFDELLDFLKAPLLKQQYDMDATFASDVGTVAVIEVDEDGNTQDEDSALVIPYEAWGEEWVTDSNGLAWSKEGLLFTQVRLFWRSMEELALNNNEQEKWSVLRWIFRPAIWKHYVYDKRIGRSHCFEVHERDETFSFHNCCIAARVDEDTVREGVRRNIPAEVVKAVEKVCKFD >NZ_AP021845.1|WP_152091264.1|303146_304286_-|AAA-family-ATPase MFDLSAILVQVSYSLCEVFGLDPNEVDPSITVDGFEIPDPSTLTDPVAQQYAAFLRAAVPPIDPYYQFRKDLVRDIRYWWLTGEGDVLLLWGPTGSGKTSVFEQWCARLGIPLFMAKGHRRFEPMEAFGQFVGGENGTTPWVDGPVTLAARYGLPCIINEYDRIAADRTIVFNDVFEGRSFPIPGKSGEVVTPQPGFRVAITANTNLVEDLSGNYGTANTHDISILERIVALHVGYPSDDTEAKLLEKELEQFSDDLLSYWFDQEGIKISTPQGMKEGSAINRGEFIQGLLEVAKKIRAQSKDGGNTSDSALERTMSTRILRKWARHSVAQASAPEKLGLSALHLALKKYLSSLSTESTRIALHQAVENVFGVGEVVKP >NZ_AP021845.1|WP_152091263.1|302680_303136_-|hypothetical-protein MSNNKFVGVAYWSVATQVADWLARAVDVLMPLERAGIAVLAYDCLPGGNQLTVTFGEERHQMLVTDGKGRGVVRASEAAAVFVVCLVALRKALGDISVVTDSQETVPALPRQNYPLYADSWQRVLPVAQELGLATGAAFTARHNAVLNNVF >NZ_AP021845.1|WP_152091262.1|301730_302669_-|hypothetical-protein MSAHIIDKLLFLVIIVAAARVLWRFYPYRQVVVEEDSREVVAPEVQTQPEVVQQQMDAAPVSSEREQVTASPLFVRHLNDMATLMVRYRHRKQACTAHLIVWSGSKRKGYSDSYFDLGLIEGQELTEQVIEQSLALAKQQLVDLAEKGKRKRRESKKQKQEAAAVAEVVVAEAAVSEVVVAEVAEAPADVAIVAATEVVEQKAEVPPLVVEDTPPESIKLRKFPSVYRGIIKEIGMMTQNKDGREFETFGVRFETQEGIVDAVFGVNLREALRDAKADVGDQVEILKIGRKTITKGKAPMNLFKIAKLECTA >NZ_AP021845.1|WP_152091261.1|300765_301308_-|single-stranded-DNA-binding-protein MASVNKVILVGNVGADPETRYMPNGDAVCNLRLATTESWKDKNSGEKRELTEWHRIVCYRKLAEITSQYVKKGSQLYLEGRIKTRKWQDKDGQDRYTTEIEMTEMQMLGNRRSGDSDGSGDDRPPRQHSAGGGGNGEPPARERRPAPAYDPMEDDIPFCRVDMNADPAFVKASARVRRVA >NZ_AP021845.1|WP_152091260.1|300423_300684_-|hypothetical-protein MYGAPYLGYGCEIVHSFNTLIDAIKRRQVLRPRSKLAINLAGGDIEVNLLPNGFVELDGKVQPVAVEKEIEDAMQQFGAVLKEVLV >NZ_AP021845.1|WP_152091259.1|300076_300421_-|KTSC-domain-containing-protein MHPNFTPVSSSNIDGYLYMPDRKILLIAFKSGGTYAYEDVEQPVATGFAQASSKGKFFRSDIKDRYATSKLDDMAVANLLGGMGASVPPQPRRKAPRVTLQSLLSRYPMLNAVF >NZ_AP021845.1|WP_152091258.1|298723_299683_-|hypothetical-protein MQKDQSSLAGTLPANRRTIVAMPDPILGSLRWPNRPKLPEGNPCWTYMVEGTREQYAVAVGHVENGRPHPFEVWVLANEQPRCLGAMAKTLSADMRTQDRAWLTRKLEVLASVTGDLAIDLPLGTERLLASSNTAAVARIVQYRLNQLGVANPEEGEATPLVDALLPVRDAGHEGTLSWTADIKNPSSGDDFTVFLPEVQTEDGQHRPVAVRLSGRYPRDLDGLAAILTMDMAVVDVAWIGMKLRKLLDYDEPMGSFLAKTPGTGKTERYSSIVAYLARLILHRYATLGLLTASGYPVVEMGVMVSVPGDASNVVPIAA >NZ_AP021845.1|WP_152091268.1|307728_308742_-|ParB-N-terminal-domain-containing-protein MKQRVIKRPDMPLTALGAPAPGADTSTDKGELPAVSATTQPPSPPLLHLAGAEEVVDIPVSKLRVSPCNARKIRLPKRVSKIAESLKNNGQKDPLYVYPGAGDDEGYFMVLGGETRRLGALQIALPTLKAFVDRKVDPTDALNLTKISNILNDSADECDLDRGMVAIDLLEKGHTQGEVAEVLELESHTHVQRLIKLAGLPKRFIDFGQDYPERFSASLGAYISQAIDRHGEDFAHDLLKAALVDELPHRKIAKAIEAGPSDKQPGQERGKRLRRDGGFDIPTPDAPGGRYDVYKSKTPGLKVLKLQVEVPDELAKDLNEKLTEVLTQFIQTSRDQQ >NZ_AP021845.1|WP_152091269.1|308759_309899_-|AAA-family-ATPase MSDLRPTYAYIGAVEHRLKSAAALLGVSENTLRTTLAESGIEVRRANQDNPNAPAVRLFDLPTIFQIAEYRRAKKLTKGPEGKKPIVIAIEIIKGGTGKTTTAAEVAVQLQLQGLKVLGIDIDIQANFTQLMGYEADLTEDEAAMYGLTEEAIVNGTFATICGPFIERNGRPVDAKAIIKYPFGPSGPAIIPADTFFSDLEHDISKTGGKRELVFQKFFKESLAGNVPGLNVGDFDVVLFDCPPNISFVATNALASADIVIAPVKMESFSVKGLSRLIGEVHTLKAEYGGEVKDPELVILPTYYSTNLPRVGRMQEKLAQYRANTSPVSISQSEEFPKSTDNYMPLTVIKPTCQPVKEYRMFVDHLIKRINEVSKARAS >NZ_AP021845.1|WP_152091270.1|309895_310324_-|hypothetical-protein MNAFGRDGDSRQKEHSNVNYRSRDGRETQGSLSSATRVPVPRGHAPHPIFPPSWSIPTRKPQFTAVFHYCGFPEVAHTLANLETPGPEGAENFVQKKMAECGYMATVGRVKTRKAAFCLPMRLTNNTRAKIMREGNHKDEME >NZ_AP021845.1|WP_152091271.1|310495_311095_+|hypothetical-protein MSESTTPKRSRHSYRERIQEVIQERIALGKPLTHRDILKEAGGGSASTVVEELAKAERSTPATLIGRGAKSLPQRIAALEDALNASLAREKVLEAENQALRESLTSARADVDKLLAGHQDSQRMLLQGVDDLRQMVKAGQGGMASAVIATERQKAAGDDTGDGILWKARHDQLLQRFVALDAKNRKMSSQLHELGVDVD >NZ_AP021845.1|WP_152091272.1|311091_312549_-|RepB-family-plasmid-replication-initiator-protein MRQHQPEQRNLFPTEDLIVPESLQKMRKAVAAIHAIPRNPEDSQNLTNRRVFDGLIIVAQIHCRQRGKEFIQRIRDERVSPLFEVRTSELGKLSGIPGKNYDRIIEEISRIYEMDFEFNVCAEDGETIWENRARLLSSLGVGKNHKRGYIRFAMDPEMLILLLEPNLWASFSLSVMHDLGTSAAYALFQQTYRYINTNQKLTAALPTKTWIELLVGKNRYVKDIDGKEVINYGEFKRRVLNDAIEKVNEVPALTYNIELKEHRQGNRVARLQFKFIPKEPTLQLESTWPEDILTVLKSIGFLDKEITDISQAHSSASVADAICRLKEAEQRLKSEGKAISSRKPYFLGILRNIAAGEDDIDPEKIEAEVRIEMAERAAEERKKKMQDAYDEHRRKRFSAWVTSLSVEDRKQLIADYEASEDFNPVLGKSLKKILTEENRSGLSTLRVWMEKHRSETLAGVFNTPEYQSLEGWMMWKLSGDDAIEA >NZ_AP021845.1|WP_152091273.1|313182_313680_+|hypothetical-protein MMDDDEIKDRQLNALVGGAVRGLVESGADDEAIGAFASTYRQKAAKLLGRPQEALDPPDLTDIIKDAVAQALAAAQVKPKKSRKQNEHFTVSIGGQKTSVTIHKDVIAQLAEAKGSKAEVSRFVREVAKDVPDSVENKSEWIEHRIATIMRFKSESAGNGSSARH >NZ_AP021845.1|WP_152091274.1|313676_314798_-|restriction-endonuclease MFGPKKQTAAAKPNGVDNLIHKLKALPPAIDLIVAATLGGYSFVAIYVDNAKFVGLACILIALIFAVLGISAVTREMKDFKVVEQNSNPEALRMMKTQQFENYLVALFRLDGYQVRPSIDELHRQDDADLIAVKKKETILIQYNHWDEDIVGTKPIQSLHKAAAAVRAQGATAISFGRFSAEAADWARRKGVTLMTMQDVIGMACRLTGLTPEEAAAEPDEEVVVEKAHEVAEVVRGHHRFLFVDFAGLEHGLARLSELLLQHPAYQVIASTLPPLKSMEDIRLSLGECGDRLAGDLEAAQDGRYFAIQKHLQASREGKHAIWLAVDSEPRQFPEGCAELIAVNRAFGFDVSASQRLIEAMVIIDRRSIAGAG >NZ_AP021845.1|WP_152091275.1|314875_315796_-|recombination-associated-protein-RdgC MFFRNLTLYRLPTPWNMDLAKLEEMLARNPFTRCSGSEQQRSGWISPRDKGSLVYAQNRQWLIALCTEQRLLPSSVIQDEVRERAEALEEQQGYAPGRKQLRELKDRVAEELLPRAFTRRRTTFVWIDPVNGWLAVDASALSKAEEVLEQLRMVLDDFPLSLVHTKLSHSSAMADWLAGGEAPVNFTVDRDCELKAVGEEKAAVRYVRHPLDGDGIASEIKAHLAAGKLPTRLALTWNDRISFVLTERLEIKRLGFLDLLMEEAEKNTEHAEEKFDADFALMTGELARFIPSLLDALGGEVVEDRA >NZ_AP021845.1|WP_152091276.1|315816_317793_-|DEAD/DEAH-box-helicase MQPPPDNERNAVEQANGQPLLDRLKRLGVTAWREPLLCLPKLFQDYSSISTLKQALPQNDVVAGPKLFTLLVSEKAVVLSQPKKRLVMTATDGMLSVKIVIFVVLGVDVPTWKAFEEGDKIHLRGVLQNWNGKLQITGPTLIDPQLVGKVIPIYEKRRSVVADGAIYDATRYALEHHLKETIDYLVESYHGLPEADILRRARLKAPSVEVILRAAHQPTSEDEGMRGIAGMRRLAALSVVENARRLKQRDPVPESVVSIPDTLIQQLTEKLPYPLTGDQRRSIGEIVADMASPLPMRRVLSGDVGSGKTLPIMIAALATQHLGHRAVILTPNGLLADQFVKECKALFGEDSLVISVTSGTKKLDLASNPILVGTTALLSRLKGESPPALFCVDEEQKMSVSQKIELTGFASNYLQATATPIPRTTALITHGAMDVSVLKEMPVVKNITTHIVTAGERKRLFDHTRKVLASGGQIAIVYPIVNDDEQEKKSVVAAAVEWEKQFPGLVGMVHGQMKEAEKVAAVNGLKSGNQKIAVVSSVIEIGLTLPSLRSLIVVHAERYGTSTLHQLRGRVARLGGNGYFFLFLPETVAPETMQRLQLLVDHSDGFTLSEKDAELRGYGDLFEDAERQSGNSRSTIFRCVDLTPSEIHAATIHEALPS >NZ_AP021845.1|WP_152091277.1|317776_318742_-|hypothetical-protein MTTTNLATAKEVATAVASLGFQASSETLGAISRNCREDFLEHLSRCITDQDQDGRSKKFIGNLLRCLAPNTINRIKPIFPDATIDMIVPVAKAVPTRFLSAIDAAHDPKHARHEDAKAYLASIFAPPDTHSEEEPPQSSLQQQHDQQEERPVDQAALSRRLAPSGSKKYHSVHVYGSNAALCFNATDWNGAPGVMVDAAMQTGPKTYDWKNAVHVWLDINEVGAVLAVFRRWRKGVEFSAHGAQNDKGFAIEFQGQHFFAKVTAKKAAAGAVRAVKILPSDAMSVSILFLTQLAESYPMIPLNELLATVRATHQIEDAAAA |
You can click texts colored in the table to view more detailed information
| CRISPR_ID | Spacer_Info | Spacer_region | Spacer_length | Hit_ID | Protospacer_location | Mismatch | Identity |
|---|
| CRISPR_ID | Spacer_Info | Spacer_region | Spacer_length | Hit_phage_ID | Hit_phage_def | Protospacer_location | Mismatch | Identity |
|---|---|---|---|---|---|---|---|---|
| NZ_AP021845_1 | 1.1|11367|30|NZ_AP021845|CRISPRCasFinder | 11367-11396 | 30 | NZ_AP021845 | Azospira sp. I09 plasmid pAZI09, complete sequence | 11367-11396 | 0 | 1.0 |
| NZ_AP021845_2 | 2.1|11566|30|NZ_AP021845|CRISPRCasFinder | 11566-11595 | 30 | NZ_AP021845 | Azospira sp. I09 plasmid pAZI09, complete sequence | 11566-11595 | 0 | 1.0 |
| NZ_AP021845_2 | 2.2|11632|30|NZ_AP021845|CRISPRCasFinder | 11632-11661 | 30 | NZ_AP021845 | Azospira sp. I09 plasmid pAZI09, complete sequence | 11632-11661 | 0 | 1.0 |
| NZ_AP021845_3 | 3.1|12229|30|NZ_AP021845|CRISPRCasFinder | 12229-12258 | 30 | NZ_AP021845 | Azospira sp. I09 plasmid pAZI09, complete sequence | 12229-12258 | 0 | 1.0 |
| NZ_AP021845_4 | 4.1|307483|31|NZ_AP021845|PILER-CR | 307483-307513 | 31 | NZ_AP021845 | Azospira sp. I09 plasmid pAZI09, complete sequence | 307483-307513 | 0 | 1.0 |
| NZ_AP021845_4 | 4.2|307531|43|NZ_AP021845|PILER-CR | 307531-307573 | 43 | NZ_AP021845 | Azospira sp. I09 plasmid pAZI09, complete sequence | 307531-307573 | 0 | 1.0 |
| NZ_AP021845_4 | 4.3|307591|23|NZ_AP021845|PILER-CR | 307591-307613 | 23 | NZ_AP021845 | Azospira sp. I09 plasmid pAZI09, complete sequence | 307591-307613 | 0 | 1.0 |
| NZ_AP021845_2 | 2.1|11566|30|NZ_AP021845|CRISPRCasFinder | 11566-11595 | 30 | NC_018141 | Legionella pneumophila subsp. pneumophila plasmid pLELO, complete sequence | 116661-116690 | 6 | 0.8 |
| NZ_AP021845_2 | 2.1|11566|30|NZ_AP021845|CRISPRCasFinder | 11566-11595 | 30 | NZ_CP025492 | Legionella sainthelensi strain LA01-117 plasmid pLA01-117_150k, complete sequence | 38691-38720 | 6 | 0.8 |
| NZ_AP021845_2 | 2.1|11566|30|NZ_AP021845|CRISPRCasFinder | 11566-11595 | 30 | NZ_CP021284 | Legionella pneumophila subsp. pneumophila strain Allentown 1 (D-7475) plasmid unnamed1, complete sequence | 109995-110024 | 6 | 0.8 |
| NZ_AP021845_2 | 2.1|11566|30|NZ_AP021845|CRISPRCasFinder | 11566-11595 | 30 | NZ_CP011106 | Legionella pneumophila strain L10-023 isolate Ulm plasmid unnamed, complete sequence | 106875-106904 | 6 | 0.8 |
| NZ_AP021845_2 | 2.1|11566|30|NZ_AP021845|CRISPRCasFinder | 11566-11595 | 30 | NZ_CP045305 | Legionella longbeachae strain B1445CHC plasmid pB1445CHC_150k, complete sequence | 37393-37422 | 6 | 0.8 |
| NZ_AP021845_2 | 2.1|11566|30|NZ_AP021845|CRISPRCasFinder | 11566-11595 | 30 | NZ_CP042253 | Legionella longbeachae strain B3526CHC plasmid pB3526CHC_150k, complete sequence | 39581-39610 | 6 | 0.8 |
| NZ_AP021845_2 | 2.2|11632|30|NZ_AP021845|CRISPRCasFinder | 11632-11661 | 30 | KM389300 | UNVERIFIED: Escherichia phage CBA6 clone ctg7180000000096 genomic sequence | 16175-16204 | 7 | 0.767 |
| NZ_AP021845_2 | 2.2|11632|30|NZ_AP021845|CRISPRCasFinder | 11632-11661 | 30 | KC139562 | Salmonella phage FSL SP-029 hypothetical protein gene, partial cds; hypothetical proteins, RIIA protector from prophage-induced early lysis, RIIB Protector from prophage-induced early lysis, and hypothetical proteins genes, complete cds; and DNA topoisomerase 2 gene, partial cds | 2138-2167 | 7 | 0.767 |
| NZ_AP021845_2 | 2.2|11632|30|NZ_AP021845|CRISPRCasFinder | 11632-11661 | 30 | KC139523 | Salmonella phage FSL SP-063 hypothetical protein genes, complete cds; tRNA-Met, tRNA-Trp, tRNA-Asn, tRNA-OTHER, tRNA-Ser, and tRNA-OTHER genes, complete sequence; and hypothetical proteins, DNA polymerase, hypothetical proteins, RIIA protector from prophage-induced early lysis, RIIB Protector from prophage-induced early lysis, hypothetical proteins, putative tail fibre, hypothetical proteins, DNA topoisomerase IIs, hypothetical proteins, exonuclease A, hypothetical proteins, deoxycytidylate deaminase, hypothetical proteins, head completion protein, putative tail tube associated base plate protein, baseplate wedge subunit, hypothetical proteins, loader of T4-like helicase, hypothetical proteins, putative membrane protein, DNA ligase, hypothetical proteins, helicase, hypothetical protein, RecA-like recombination protein, hypothetical protein, putative dUTP diphosphatase, hypothetical protein, thymidylate synthase, hypothetical proteins, DNA end protector protein, baseplate tail tube, single-stranded DNA binding protein, hypothetical proteins, regulatory protein FmdB, hypothetical proteins, and base plate hub subunit genes, complete cds | 20576-20605 | 7 | 0.767 |
| NZ_AP021845_2 | 2.2|11632|30|NZ_AP021845|CRISPRCasFinder | 11632-11661 | 30 | FQ312032 | Salmonella phage Vi01 complete sequence | 154723-154752 | 7 | 0.767 |
| NZ_AP021845_2 | 2.2|11632|30|NZ_AP021845|CRISPRCasFinder | 11632-11661 | 30 | NC_015296 | Salmonella phage Vi01, complete genome | 154723-154752 | 7 | 0.767 |
| NZ_AP021845_2 | 2.2|11632|30|NZ_AP021845|CRISPRCasFinder | 11632-11661 | 30 | NC_023856 | Salmonella phage vB_SalM_SJ2, complete genome | 80074-80103 | 7 | 0.767 |
| NZ_AP021845_2 | 2.2|11632|30|NZ_AP021845|CRISPRCasFinder | 11632-11661 | 30 | MH427377 | Escherichia phage vB_EcoM Sa157lw, complete genome | 129521-129550 | 7 | 0.767 |
| NZ_AP021845_4 | 4.1|307483|31|NZ_AP021845|PILER-CR | 307483-307513 | 31 | NZ_CP012399 | Chelatococcus sp. CO-6 plasmid pCO-6, complete sequence | 220195-220225 | 7 | 0.774 |
| NZ_AP021845_4 | 4.1|307483|31|NZ_AP021845|PILER-CR | 307483-307513 | 31 | NZ_CP018096 | Chelatococcus daeguensis strain TAD1 plasmid pTAD1, complete sequence | 197951-197981 | 7 | 0.774 |
1. spacer 1.1|11367|30|NZ_AP021845|CRISPRCasFinder matches to NZ_AP021845 (Azospira sp. I09 plasmid pAZI09, complete sequence) position: , mismatch: 0, identity: 1.0
atctcaacccgctctaggattcggctgact CRISPR spacer atctcaacccgctctaggattcggctgact Protospacer ******************************
2. spacer 2.1|11566|30|NZ_AP021845|CRISPRCasFinder matches to NZ_AP021845 (Azospira sp. I09 plasmid pAZI09, complete sequence) position: , mismatch: 0, identity: 1.0
ctgatattgacaaggccttggcagttgtcg CRISPR spacer ctgatattgacaaggccttggcagttgtcg Protospacer ******************************
3. spacer 2.2|11632|30|NZ_AP021845|CRISPRCasFinder matches to NZ_AP021845 (Azospira sp. I09 plasmid pAZI09, complete sequence) position: , mismatch: 0, identity: 1.0
gtcaggagtatcagccagacaatgaacttg CRISPR spacer gtcaggagtatcagccagacaatgaacttg Protospacer ******************************
4. spacer 3.1|12229|30|NZ_AP021845|CRISPRCasFinder matches to NZ_AP021845 (Azospira sp. I09 plasmid pAZI09, complete sequence) position: , mismatch: 0, identity: 1.0
gaatatcggttctgcggtcgcagattggcc CRISPR spacer gaatatcggttctgcggtcgcagattggcc Protospacer ******************************
5. spacer 4.1|307483|31|NZ_AP021845|PILER-CR matches to NZ_AP021845 (Azospira sp. I09 plasmid pAZI09, complete sequence) position: , mismatch: 0, identity: 1.0
aaacacgctgacggggagggaggcgcatggg CRISPR spacer aaacacgctgacggggagggaggcgcatggg Protospacer *******************************
6. spacer 4.2|307531|43|NZ_AP021845|PILER-CR matches to NZ_AP021845 (Azospira sp. I09 plasmid pAZI09, complete sequence) position: , mismatch: 0, identity: 1.0
ttgccgagcaccaaggggtgatgcgagaggagggctgtgcgga CRISPR spacer ttgccgagcaccaaggggtgatgcgagaggagggctgtgcgga Protospacer *******************************************
7. spacer 4.3|307591|23|NZ_AP021845|PILER-CR matches to NZ_AP021845 (Azospira sp. I09 plasmid pAZI09, complete sequence) position: , mismatch: 0, identity: 1.0
tcgctgacatccctggaatccga CRISPR spacer tcgctgacatccctggaatccga Protospacer ***********************
8. spacer 2.1|11566|30|NZ_AP021845|CRISPRCasFinder matches to NC_018141 (Legionella pneumophila subsp. pneumophila plasmid pLELO, complete sequence) position: , mismatch: 6, identity: 0.8
ctgatattgacaaggccttgg--cagttgtcg CRISPR spacer gtgatattgacaaggccttagctcagttag-- Protospacer ******************.* *****.
9. spacer 2.1|11566|30|NZ_AP021845|CRISPRCasFinder matches to NZ_CP025492 (Legionella sainthelensi strain LA01-117 plasmid pLA01-117_150k, complete sequence) position: , mismatch: 6, identity: 0.8
ctgatattgacaaggccttgg--cagttgtcg CRISPR spacer gtgatattgacaaggccttagctcagttag-- Protospacer ******************.* *****.
10. spacer 2.1|11566|30|NZ_AP021845|CRISPRCasFinder matches to NZ_CP021284 (Legionella pneumophila subsp. pneumophila strain Allentown 1 (D-7475) plasmid unnamed1, complete sequence) position: , mismatch: 6, identity: 0.8
ctgatattgacaaggccttgg--cagttgtcg CRISPR spacer gtgatattgacaaggccttagctcagttag-- Protospacer ******************.* *****.
11. spacer 2.1|11566|30|NZ_AP021845|CRISPRCasFinder matches to NZ_CP011106 (Legionella pneumophila strain L10-023 isolate Ulm plasmid unnamed, complete sequence) position: , mismatch: 6, identity: 0.8
ctgatattgacaaggccttgg--cagttgtcg CRISPR spacer gtgatattgacaaggccttagctcagttag-- Protospacer ******************.* *****.
12. spacer 2.1|11566|30|NZ_AP021845|CRISPRCasFinder matches to NZ_CP045305 (Legionella longbeachae strain B1445CHC plasmid pB1445CHC_150k, complete sequence) position: , mismatch: 6, identity: 0.8
ctgatattgacaaggccttgg--cagttgtcg CRISPR spacer gtgatattgacaaggccttagctcagttag-- Protospacer ******************.* *****.
13. spacer 2.1|11566|30|NZ_AP021845|CRISPRCasFinder matches to NZ_CP042253 (Legionella longbeachae strain B3526CHC plasmid pB3526CHC_150k, complete sequence) position: , mismatch: 6, identity: 0.8
ctgatattgacaaggccttgg--cagttgtcg CRISPR spacer gtgatattgacaaggccttagctcagttag-- Protospacer ******************.* *****.
14. spacer 2.2|11632|30|NZ_AP021845|CRISPRCasFinder matches to KM389300 (UNVERIFIED: Escherichia phage CBA6 clone ctg7180000000096 genomic sequence) position: , mismatch: 7, identity: 0.767
gtcagga----gtatcagccagacaatgaacttg CRISPR spacer
----gaattccgtatcagccagacaaagaaattg Protospacer
*.* *************** *** ***
15. spacer 2.2|11632|30|NZ_AP021845|CRISPRCasFinder matches to KC139562 (Salmonella phage FSL SP-029 hypothetical protein gene, partial cds; hypothetical proteins, RIIA protector from prophage-induced early lysis, RIIB Protector from prophage-induced early lysis, and hypothetical proteins genes, complete cds; and DNA topoisomerase 2 gene, partial cds) position: , mismatch: 7, identity: 0.767
gtcagga----gtatcagccagacaatgaacttg CRISPR spacer
----gaattccgtatcagccagacaaagaaattg Protospacer
*.* *************** *** ***
16. spacer 2.2|11632|30|NZ_AP021845|CRISPRCasFinder matches to KC139523 (Salmonella phage FSL SP-063 hypothetical protein genes, complete cds; tRNA-Met, tRNA-Trp, tRNA-Asn, tRNA-OTHER, tRNA-Ser, and tRNA-OTHER genes, complete sequence; and hypothetical proteins, DNA polymerase, hypothetical proteins, RIIA protector from prophage-induced early lysis, RIIB Protector from prophage-induced early lysis, hypothetical proteins, putative tail fibre, hypothetical proteins, DNA topoisomerase IIs, hypothetical proteins, exonuclease A, hypothetical proteins, deoxycytidylate deaminase, hypothetical proteins, head completion protein, putative tail tube associated base plate protein, baseplate wedge subunit, hypothetical proteins, loader of T4-like helicase, hypothetical proteins, putative membrane protein, DNA ligase, hypothetical proteins, helicase, hypothetical protein, RecA-like recombination protein, hypothetical protein, putative dUTP diphosphatase, hypothetical protein, thymidylate synthase, hypothetical proteins, DNA end protector protein, baseplate tail tube, single-stranded DNA binding protein, hypothetical proteins, regulatory protein FmdB, hypothetical proteins, and base plate hub subunit genes, complete cds) position: , mismatch: 7, identity: 0.767
gtcagga----gtatcagccagacaatgaacttg CRISPR spacer
----gaattccgtatcagccagacaaagaaattg Protospacer
*.* *************** *** ***
17. spacer 2.2|11632|30|NZ_AP021845|CRISPRCasFinder matches to FQ312032 (Salmonella phage Vi01 complete sequence) position: , mismatch: 7, identity: 0.767
gtcagga----gtatcagccagacaatgaacttg CRISPR spacer
----gaattccgtatcagccagacaaagaaattg Protospacer
*.* *************** *** ***
18. spacer 2.2|11632|30|NZ_AP021845|CRISPRCasFinder matches to NC_015296 (Salmonella phage Vi01, complete genome) position: , mismatch: 7, identity: 0.767
gtcagga----gtatcagccagacaatgaacttg CRISPR spacer
----gaattccgtatcagccagacaaagaaattg Protospacer
*.* *************** *** ***
19. spacer 2.2|11632|30|NZ_AP021845|CRISPRCasFinder matches to NC_023856 (Salmonella phage vB_SalM_SJ2, complete genome) position: , mismatch: 7, identity: 0.767
gtcagga----gtatcagccagacaatgaacttg CRISPR spacer
----gaattccgtatcagccagacaaagaaattg Protospacer
*.* *************** *** ***
20. spacer 2.2|11632|30|NZ_AP021845|CRISPRCasFinder matches to MH427377 (Escherichia phage vB_EcoM Sa157lw, complete genome) position: , mismatch: 7, identity: 0.767
gtcagga----gtatcagccagacaatgaacttg CRISPR spacer
----gaattccgtatcagccagacaaagaaattg Protospacer
*.* *************** *** ***
21. spacer 4.1|307483|31|NZ_AP021845|PILER-CR matches to NZ_CP012399 (Chelatococcus sp. CO-6 plasmid pCO-6, complete sequence) position: , mismatch: 7, identity: 0.774
aaacacgctgacggggagggaggcgcatggg CRISPR spacer gatcgcgctgacgggacgggaggcgcatacg Protospacer .* *.**********. ***********. *
22. spacer 4.1|307483|31|NZ_AP021845|PILER-CR matches to NZ_CP018096 (Chelatococcus daeguensis strain TAD1 plasmid pTAD1, complete sequence) position: , mismatch: 7, identity: 0.774
aaacacgctgacggggagggaggcgcatggg CRISPR spacer gatcgcgctgacgggacgggaggcgcatacg Protospacer .* *.**********. ***********. *
| Region | Region Position | Protein_number | Hit_taxonomy | Key_proteins | Att_site | Prophage annotation | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| DBSCAN-SWA_1 |
72042 : 78469
Sequences of DBSCAN-SWA_1
Nucleotide sequences of DBSCAN-SWA_1 >NZ_AP021845|72042:78469|DBSCAN-SWA CTCATGCAGAGGCTCCTTCTGTGGCTCCCGCCCGTCGCTCGCGGATCGACGTCCTGACCCAGGTCTCAGCCTCTTCGGGGCTTCCAAGAGAGCCGGTTTCTCGCCGGCGCCGCCTCATTGCTGCGGCTTGCCTCGAGGTGATTTCCACCGCACCGGACTTGATCGCCAGGCTCCGCTTAGCCAGGCAGATGTCGTAGTGATCCCCCTGGTACCAGCGCCGGGCGACACCAATGCAGTCGGCCATTGCATGTACCTCGGCTTCCGAGTCCCCGAGCATGTGACACATGACCATTCGTCCATAACCGGCCCGCATGTTGTCGACGTAGACGGTCATTGAAGACCCTCCGCAATCGCTATGCAGGCCTGCCAATCATTTGAATAAGGCCAGCGGGGTCCAATTTTGCCGTCTCTCATCTGGAATTCACCGAATTTGGCCAGGAATGACCAGACATCATGAGGAGTCTTGTTAGCGACATCGGCGCGGCGATAGTCGCGGAATAGATCGTCAAAACCTTCAATAAGGACGCGACGTATGCTCGAATAATCACAGGAATTGATGTCGTGACGAATAGCTGCAAGGTAATAGGCCAGTGCCTTTGCCCCGTTTGCGCCGGAAGTACGGCAACCGACCTCATGGTATGACCCAAGCCACCACATGGCTTTCGAATCGCCATTGCGCGCCCAGAGTCCGATTTTGAGTAGAACTTCTTGGAAAAGCGGCTCTGGGTAGCCGAACGGTTCAAGATCGAGGATTGAAGCGGTTCCCGCTTGGCCTGTAGGCCAATGGACGCTCCAACGCCAGTTTTTGAACTCAGGCGGATCATCGCGCTGAATTCCCATTTGCAGCACTTCATGCCCCTGCTGCCGGGCCAGGATAAGGTAGTTCCCGGTAAAGATTTTGTCGTCCTCCTCAACGCGGGTCGAGTCGCAAATAGGGCAACGAACCTTGCTGAGGGTACCAGTGACGGAAATGGGAACTCCGCTGTTGATGGCTTGGTTGCTTCCTCGCCATCCACATTCTTGGCATCTGACTGGCCAGTCTTGATTTTCCATGGTTCAGAACAGACTTCCAGTTGTCGCAACCTCGGGCTGCATCGCCGCGGCCGGAACCGTCGAAGCGGGCCTCTTCGAGGCTTTGCTGACTTTCTGGAGGTAGCGTTCAACCAGGCGGCCGAGGTTGGTTGTGTCGTCATCGACCTTCTGCACACCAAGGTGCGGATCTACGATCTGATTAGCCTCGCTGGCCTTGATACCAAGCACGTCCATGATGGGCGGATCGCTGCCCTCCTCGCTGACGAGGAAGAAGGCGGTGACCGGATCTTCCTGACCCTCTCTATCCAAGCGCCAGATGATTTGCTGGTGTATGCCCGGGGACCAATCGAGCTCGCCGAACACCACAACCGAACTGCGGAACTGCAGATCGTCAATGCCTGCTCCTGAGCGGAGAGACATGATCATCACGTCCGTGTCGCCGGTGAGGAACCGATCCTTTTCCTTGTTCTTCTGGGCAGCCGTTTCCGAGCCGGTGTACATGGCCGGGCGAAGGTCTGCGAGTTCTTCAAGCCAGATGTCGTAGACGGCCCGGTGCCACCCAACCAGAAGGACGGGCTCGCCGGCCTCGACCATCAGGCGGACGAACTGTGCTACTGCCTTGGCCTTGGAGAGCCCGGTAGCCTGCCGGACCATCATGTCCAGTTCCCGGGCGGCCTGCCCGCGCTCGACGAAGGTGCCAGTGGTGGCGCGGATGGCCAGCACCCTGGCCAGGTCCTCGATGGACTGCACAGCCTTGGCATCGAAGTCCACATACTCGATGATTCTGGACACCTTTGGCAGCTCGAGGCCCACATCCGATTTGAGGCGTCGTAAGAGGACGTGCTGTTCGCGGAGGTATGTTCCCAGAGCCTTCGGGTTCCCGATGCGGCCCATGTCATCGGTCCATTCTCTGGAGAAGTCAGCGAAACTACCGAGCACCGTGTCGTCGATGAACTGCATGACATTGTGCATTTCAATGCCGTATCCATAAATTGGGGTAGCGGTCAGCCCGAGCCTGTAAGACACGTGGTTCGCTAACACCTTGGCAGCGGCGCCCTTGGCGGTCGACGTGCCCGTGCGCAGGCTTTGTGGCTCATCGAATACGGCTGCTTTGAAGAAGTCCGTAGCAAAGATGTCCGCCCATCCGCCAATTTGCGAGATACGGAAGACATAGACGTCCGCCGGCGGCAGATCGTATGGGGATGCTTTGGTGATGATATGCACCCGGAGGTGGGTGAAAGCGGTGAGCTTGTCCTTCCACTGCTTTTGCATGTGCGGGTCGCAGACGACGGCCGCTGGGAGAGATTGGGCTTCAGCGCACAGGAACGCCGCTGCCGTGTAGGTTTTCCCCAGTCCGCCTTCATCGCCCAGGAGCAGCGAGCGGCGCCGCCGCAGGAGTTCGACGGCCTGCATCTGGTAGTGACGTACCTTCTGCCCTTCGCGAAGGCCTACAACCGCCGGCGGGACGTATTCGGGGAGAAGGATGCGCTCCATCTCGGCCTGCTGCATCTCGAAGTCGAGCCGACCGCCTCGCAGCGCGTTCCGGTCACCGTCCGACATGGCCAGCGGATACCGAGAGAGAAACCAATCGAGGTCCGCCGCGTGCATCAGATCGCGGGGAAAGCGGAAGGGACCGGTTGATTGCTTCGGCACCCGGGGGAAGATGTGCTTCAGGCGAATGGCGACATGCGGCTCCAGGGAGGACATTTCCCAGGCGGTACCGCCTTCGATCAATCGGAGTTCGCCGTAGGTCCGCATCAGATCCACCCCTGGCTCAACGAGGCGGCGAATAACGGCGTTCCTTCGATCTCCGGTGGAAGCCCCATGGAGACATTCGAGGCCAGGATGATCGCGGTCACCTCGGGATAGGTGGCGTACCGGGCCAGTTGTCGAAAAATGTCCATTTTCTTGGACTTGTTCCGCATCTTGCACTCGACCACTACGCCGCCGGCGATCAGGAAATCAGGGATGTCTTTTGGGGACAGGCGTTTTTCCCTTTCATAAGCGATTCCGGCTGCCTTGAGCACTTCGGCGACGCCTTCCTGCAGATGCTTTTCCGATGAGAGGTCGAGCCGGCTGCGCTGCACGAGGCGGATCACGTCGGCGATCACAGGGGGAGTTGTGGACGCCTGGCTCATGGCTGAGTCTCCTCCGGTTTCACCTCGAAGAAGTTGAGCTTCCCTTTCACTTTGCGGAAAGGCAGCGGCTTTGAATCTCCCACCACGAAGGCATGCTCCCCCATGTACCAGCGAGACGTCAGTTGGCCGTCTCTGGCCTTCTCGGGCGGGATGGAGTCGGTCAGCACGGCTTGGCCGACGATTCCACCAAGGTCGTACTGACCCGGGCGAGGAAGCGGAATTTTCGGAAAGCGGCCCTTCACCCACAGGTAGCCTTCCATGTCGAACGTCTGGCCGGCGTGGACGAGAAATGGTCCCCGGTGCTTCGTTGCCCAGGTACGGTTCTCGATGTCCTTGATCTCGCCGGCGGCGACGGCGGCGGCACGTTGCTGGGGGTCGGTCAGGTCTGGGCGCACGATCAGCCAGGCCCAGGGTTGTCGGATGGATAGCGCCTTCATGATCGATCTCAATGGGTGGTGGCCACTGTATTGCTGACCCGCAATTCAACCGGGACGCCGTTGATTTCGGGCGGGAGGCCCTGGATTGGTTCAGCAGCCAATACCAACAGCCGATCGGCGCCGTACAGGATGCGGTAGGGGGCGGCCGGCAGTTTCTCGGCCAGTACCGCCTTCAGATCGTTCTCCGCCGCTTCAAGAACTGGATTCTTGGTTGGCTGCTGCCTCGCATGGGCCAGTTCAGCGCGCATGGCTTCGAGCTCGAGCAGCGCAGCGAATGCGCGGTCTGCCGTTTCCGGGCAGCTGTAGGCGATCATGAAGGAATTGATCAGGCTGCGGAGCAAGCCGCCGGTGTCGTGGTTCTTCACGACCGACATCACGGCAAACTGATCCAGGCGCTCGACAGGGGTCTCCAGGAAGCGCTCCAGGCCCATGCCCTTAGCGATTGCGGTGAAGAGAATGGTCAACGTCCGCCGGCGCTCCTCCGGGGCCAGGGGCTGCCCGTTCGTAGGGCTTGCCAGCTCGCAGCGGAAGCGGTTGGCGAGAGCGACATCGACGCGCTGTTGGATGAGGGTTTCGATGTTGGGAGCTTTGGTCATGATGTTCTCCGATCAGCAGCCGCCGGTGGCGTGACGGTAGATAGAAACTGGCGAACGCCACTGGCGTTTTGAGGACCAAGGATGTCGCCTTCCATCGCCTCCAAGAGCCGTGCAGCGTGGTTATAGCCAATCCGCAGATGGCGCTGGACCATGGAAATGGAGGCATGGCGTCCTTTCAGGACGAGATCGCGGGCTTCCTGGTACAGGGGGTCTAGCCGACTTCCCTTGATGCTTTCATCGATATCGTGTCCGCGGATCATGCGACGTGCTCCCCAGCCAGGCGGGCCGTCAATGGCATCAGCGCCATCCGGCGGCGCTCAAGGCAGAACCACATGATTTCCGCCCCCGGCTTGCCCTGGGCGGCCAGTGCACGGTAATCCGCGATCAACTGGTCGAGGATGGTGGGTTGGACGTGGCCGTAGATTTGGTCGATCGTCGGGAAGTCCTTGAAAATCTCGCCCACCGGTACCGGCGGCTCGGCCATGAGGGCGTAGGCCCACGGCTGTGTGTAGCCGCAATGGGGGCAGATCCAGCCTTGCTCGGTAGCGATCAGGAGGCCGCGGTCACCGCCCTCGGTACTGTGGGTGGCGAGCGATACATCGGCGGCACCCGATTCGTCGTAGGTAATGCCGTCACCGCGATTTGGGCAGGTGAAGGGGTGGATGGGCATGCTGCCGTCCACGTGGCATTGTCGCTCGTTGAGATATTGAACCTGGGCGGGGGTAAAGGGGGCCTGAATCTTCATTGCTTCTGATGCTGTCTTGCGGGAGAGGAGATCGCCTCGTCGATCGAGAGGCAGTAATGCCAGAGGCGGTGCTGGCGGAAAACGCGGCTGATGCTGTCGCATTCCCCAACCTGGAATACCGGCACGCCGGCCATCAAGGCGGCGCCAGCCTCCAGAATTGCCCCCTTGAGGATTTCCCCGGCTTCGCAGTAGAGCACCAGGCGCTCGGAGCCGGCGATTTCGCTTAGGCACCGGTCAGCCAGCTCCTCGTAATCGGCGGTTTGGCCCTCGCCGGCTTCATCGATCCAGGAAGATGCTGTCTTGGTGCCGCTGTCGCGCAGCGCCTTCCAGCGATGGGCATGGACGACTTTGGAGGCGAAATAGACGCCCCCTCGACCTTTGGCGGTACCGGCGGCGATCTCAAGCACGGCTCTCACCTTGGCCAGGTCAAAATGGTTGTCGCTGCGAACCCCCGACTCCATGTCGATCCAGTAGGGCACGTCGCGCCAGGCAGTTGCGGATGCGTGGATGCGTGGCAGCTCGATTTCGAGGACATTGGGGCCCAGCCCGCCGGCGTAGCCGCAGTGCAACGGGATTTGGCCGACGGTCGAGATCGGTGGCCATTGCTCCGGACACTGCCCGGTCCCCTTGGATTCGTCGTAGAGCAGATCAATACGGGACAGGGAACTGGCATCGACCTCGGCAACGAACTTCTGAATGGCCTGGCGAGATCCGTCGTGGAATTGAAGGATCAGGTGCAGGCCAGCCTGGTGCAGCGTGCGGTAGACCGACAGGACCTCTTCCTCGGTGAACTCGGGTCGGCGCGCGTTGATGTTGAGTTGGATGCGCCGGTACCGAGAAATGTCGGCAATGCGCGCCGGGGCGGTGCGTGGATCAAGGATCTCGCGAAAGACTTGCGTGCCGCAGAGATGGGCTGCGGTGAAGGGCAAGCCCAGGGCCAGGAATGCTTCACGCCAGGCAGCAGAGGGGTTCCGTGGCGAGCCTTCCTTCTCCGGGAAATAGAGAATCGCCCACTCGGGCGTGATCGGATTGGTGTAGCGGGCCGACAGGGTAGCCAGTTCGGAGGGTAGGACTTTATCGTCGGCGCCGGTGATGGAAACGAGGTTCAGAGGCATGGAGGAATCCTTGAAGGGTTTCCTGCCATGCTACGGAGCCGCCATAGGCGTTTCTGGATTTTCTTCTTCACTCAGCGACCCTCCTTCGGCCGAGCAAAGGGGACCACGTTTTCCAGCTCACGTTGAACGCCCCCGGCCGCGCGGAATTCATCCCAGAGCACCATGTGGCCCGGGCAATAATGGACATTCGGCGCGACTTCGGTGGCGTGGCTCTCGCACAGGGGCATGTCGCACGTCTTGCCGTCCCCGACCGGGAAGTCGCACAGGAAGTCCCCAACGGACGCGCAGGCGCCGCAGTGAGGGCCGAATTCGCCGCACAGGAACATCGTCCCGCCGTCCTTCATCGGTTGGATGTAGCAGGGCAT
Protein sequences of DBSCAN-SWA_1 >NZ_AP021845|72042:78469|74830_75211_-|WP_152090994.1|DBSCAN-SWA MSQASTTPPVIADVIRLVQRSRLDLSSEKHLQEGVAEVLKAAGIAYEREKRLSPKDIPDFLIAGGVVVECKMRNKSKKMDIFRQLARYATYPEVTAIILASNVSMGLPPEIEGTPLFAASLSQGWI >NZ_AP021845|72042:78469|75207_75648_-|WP_152090995.1|DBSCAN-SWA MKALSIRQPWAWLIVRPDLTDPQQRAAAVAAGEIKDIENRTWATKHRGPFLVHAGQTFDMEGYLWVKGRFPKIPLPRPGQYDLGGIVGQAVLTDSIPPEKARDGQLTSRWYMGEHAFVVGDSKPLPFRKVKGKLNFFEVKPEETQP >NZ_AP021845|72042:78469|73097_74831_-|WP_152090993.1|DBSCAN-SWA MRTYGELRLIEGGTAWEMSSLEPHVAIRLKHIFPRVPKQSTGPFRFPRDLMHAADLDWFLSRYPLAMSDGDRNALRGGRLDFEMQQAEMERILLPEYVPPAVVGLREGQKVRHYQMQAVELLRRRRSLLLGDEGGLGKTYTAAAFLCAEAQSLPAAVVCDPHMQKQWKDKLTAFTHLRVHIITKASPYDLPPADVYVFRISQIGGWADIFATDFFKAAVFDEPQSLRTGTSTAKGAAAKVLANHVSYRLGLTATPIYGYGIEMHNVMQFIDDTVLGSFADFSREWTDDMGRIGNPKALGTYLREQHVLLRRLKSDVGLELPKVSRIIEYVDFDAKAVQSIEDLARVLAIRATTGTFVERGQAARELDMMVRQATGLSKAKAVAQFVRLMVEAGEPVLLVGWHRAVYDIWLEELADLRPAMYTGSETAAQKNKEKDRFLTGDTDVMIMSLRSGAGIDDLQFRSSVVVFGELDWSPGIHQQIIWRLDREGQEDPVTAFFLVSEEGSDPPIMDVLGIKASEANQIVDPHLGVQKVDDDTTNLGRLVERYLQKVSKASKRPASTVPAAAMQPEVATTGSLF >NZ_AP021845|72042:78469|76240_76504_-|WP_152090997.1|DBSCAN-SWA MIRGHDIDESIKGSRLDPLYQEARDLVLKGRHASISMVQRHLRIGYNHAARLLEAMEGDILGPQNASGVRQFLSTVTPPAAADRRTS >NZ_AP021845|72042:78469|72042_72375_-|WP_152090991.1|DBSCAN-SWA MTVYVDNMRAGYGRMVMCHMLGDSEAEVHAMADCIGVARRWYQGDHYDICLAKRSLAIKSGAVEITSRQAAAMRRRRRETGSLGSPEEAETWVRTSIRERRAGATEGASA >NZ_AP021845|72042:78469|76500_76989_-|WP_152090998.1|DBSCAN-SWA MKIQAPFTPAQVQYLNERQCHVDGSMPIHPFTCPNRGDGITYDESGAADVSLATHSTEGGDRGLLIATEQGWICPHCGYTQPWAYALMAEPPVPVGEIFKDFPTIDQIYGHVQPTILDQLIADYRALAAQGKPGAEIMWFCLERRRMALMPLTARLAGEHVA >NZ_AP021845|72042:78469|76985_78104_-|WP_152090999.1|DBSCAN-SWA MPLNLVSITGADDKVLPSELATLSARYTNPITPEWAILYFPEKEGSPRNPSAAWREAFLALGLPFTAAHLCGTQVFREILDPRTAPARIADISRYRRIQLNINARRPEFTEEEVLSVYRTLHQAGLHLILQFHDGSRQAIQKFVAEVDASSLSRIDLLYDESKGTGQCPEQWPPISTVGQIPLHCGYAGGLGPNVLEIELPRIHASATAWRDVPYWIDMESGVRSDNHFDLAKVRAVLEIAAGTAKGRGGVYFASKVVHAHRWKALRDSGTKTASSWIDEAGEGQTADYEELADRCLSEIAGSERLVLYCEAGEILKGAILEAGAALMAGVPVFQVGECDSISRVFRQHRLWHYCLSIDEAISSPARQHQKQ >NZ_AP021845|72042:78469|72371_73094_-|WP_152090992.1|DBSCAN-SWA MENQDWPVRCQECGWRGSNQAINSGVPISVTGTLSKVRCPICDSTRVEEDDKIFTGNYLILARQQGHEVLQMGIQRDDPPEFKNWRWSVHWPTGQAGTASILDLEPFGYPEPLFQEVLLKIGLWARNGDSKAMWWLGSYHEVGCRTSGANGAKALAYYLAAIRHDINSCDYSSIRRVLIEGFDDLFRDYRRADVANKTPHDVWSFLAKFGEFQMRDGKIGPRWPYSNDWQACIAIAEGLQ >NZ_AP021845|72042:78469|75656_76244_-|WP_152090996.1|DBSCAN-SWA MTKAPNIETLIQQRVDVALANRFRCELASPTNGQPLAPEERRRTLTILFTAIAKGMGLERFLETPVERLDQFAVMSVVKNHDTGGLLRSLINSFMIAYSCPETADRAFAALLELEAMRAELAHARQQPTKNPVLEAAENDLKAVLAEKLPAAPYRILYGADRLLVLAAEPIQGLPPEINGVPVELRVSNTVATTH >NZ_AP021845|72042:78469|78175_78469_-|WP_152091000.1|DBSCAN-SWA MPCYIQPMKDGGTMFLCGEFGPHCGACASVGDFLCDFPVGDGKTCDMPLCESHATEVAPNVHYCPGHMVLWDEFRAAGGVQRELENVVPFARPKEGR |
10 | Ruegeria_phage(33.33%) | NA | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| DBSCAN-SWA_2 |
219800 : 230598
Sequences of DBSCAN-SWA_2
Nucleotide sequences of DBSCAN-SWA_2 >NZ_AP021845|219800:230598|DBSCAN-SWA CCTAGTAGCGAGTCTGGTAATCGTCGCCACGGGAGAAGCGGCTGTTCCAGTGCTCCAGGTAGCCCTTGGCCAGCTCAGGGTTCTTCCAGACCACCAGCACGTTCTCGCTGTTCTTGTTCGCCGCCGCGCCGCTGTAATTGAAGCTACCGGTCTCGACCGTCTCCCTGTCCGATACGACGATCTTGTCGTGGTGGATAGGGTAGATCGAGATCGTCCGCACGCGACAGCCAGCATTCACCAAGGCGGAGAGCGCCGCCCGGGCCTTGCCGGACTTGTCCTCGACGACGTTGTTCTTGTAGTCGGAAACGATGGCGACATCGACGCCCCGCCGGCGGGCCGCGATCAGGGCTTCCACCACCGGTGCCGAGGTCAGCGAATAGGTCATCATGCGGATCTCGGAGCGGGCTGATCCAATCACCTTCAGAACCAGCTTTTCGGCGCCTTCATTGGGGCTGAAGGCATATTCGATGGTCCCGCTATGCTGGACCGTCGAGGAATGGTCCTGGAGGCCCTGGACGGCCTTCTCGGCTACATTCAGCCAGTTGGTTGCGCCGGCCTGTCCGGCGAAAAGAACGAGTGCGAGGGCGGCTGCCCTGAGTACGGCATGCTTCATGTTGCTTCCTTTCGAGATTGAGCCGCTTGCGCGGCGGGAACGGTTTTACTCCCCCTTGCCCTTGGCTTGGCGGAGGGTGATGAGTTCTTGGTACCGGGCGATCACTTCGCCGGCCGTGGTGAAGCCCACAGAGATGCCCAGGATGTCGGCCACCGCCTGGCGGACTTCGGCGATAAGCTGGGCTTCCTCCTCGGTCGGCACGCAGGTTGGCCAACCAGAGGAATAGAGCAGGTGAGCAGGCAGCCGGGATTCAGCCTGGACGCGCAGTTCGGCGCGACGCTTGGCATCAGCCAGCCCTTCTCGCTCCTCTGCGGACAGAAGATGCGGGGCCACGTCGCCGAGCCACGTCGCCCCCTGCAGGGCGAACGTCGAGGCGATCTCCAGGCGCACCATCCGGCGGTAGATATCGGCGTTATCCGGGCAGGACGCCGATGCCTTCAAATCGCCGGCCGACCCCATAATGCAAAACGTGCAACTGAGGCGCGTGCTTCCGTAGACGGTGTAGGCCTCGTGCAGGGGCGCCTGCTTGAGCGCGATGTAGTCGAATACTGCTGGCGTCGGCCACTTGATCACTGGATTCCAGTTCATGCCCACGCATCCTCGGGCCGAGAGAAGCGGCTGCGGCGTAGCGATGGGCATTTTTGCTCGAGCAGCACTCTCCGCATGGCGAACGCCCGTTGCGGACAGGATTTTTTGCCCTGGGTACCGCTTCGTTAGCTCCCTGCAGATGATCGATGTCTTGAGCTCGCTGGTGCAGAATCTCATTGATGGGGTTGACCATGGAAGGATGATCCTCACCGTCGAGAGCGATGCGTACCGCTCAATGTTGTTCTTCCAGCGGACTTCCCATCGATCCATCATGTCGCCCGCAGCACGGCGCACGACGATGAGTTCCCATCCCAAGCGCTTTGCCAGGCGTTCGCACAGCGGTAGGCTATCCTTCCATTCGACAGTGCCAAGGTCGCTGTGGATGAGAACACGCGGCCCCGAATGACCAATCTGGTCGAGGTATTCGGAAACGCGAACGGCCATTGCGACACTGTCCTTCCCTGCGCTCACGCCAAGTGCGCAAACTGCGCCTGCCGCCAGCCACTCACTGACCTCCGGTGTCACGGCCACGTCGTACTGGCCTGCCGCGGCGATTGCCGGGGCGAATAAGTCAAGCTGCTGCAGTCGGTCCATCAAGTCGCACTCCGTTGGCTTGAGGGCGCGGCCCTCACTCAAACGTCATGCTACGGATTGACCCCACCCGATTCTGAAATACTTCCCTGTCCGCGATGGACTGTCTGGTGGCCCGTCTTCCGGCCCGAATTCAGGCGGCGAGCGGGAGGCGGAGCTGGTCGCCCGGCTGCGCGGACATGCGGGAGCGCGCCGTCCTCATCCAGCGGCTCATTTCGCTCGAGCGGTGGGCCGTGGTGTTGGACAGACCGCTCTGCCGCGCCTTGATCCTGGCGCCGAAGTCGTAGGCCATGGAGTCGGCCGAAGCCACCCAGTCCAACATCTTCACCTCGGACAGGCTCGCCCCCTTGACCCCGAACAGGTGCAGCCGCGCGCCTTGTGGCAGCTTCCCTTCCAGGGCGGACAGGATGGCGTAGAGGCCGTGTGTCGGGTGATGCAGGTTCCGCCGGCACACCGATCCAACTCCGATGAGCAGAGGCGCATCAAGCCAGGGCTGCCAGCGCTCCCAAACGGCCGTCAGCAGTTCCAGGCTGCGGAGGTAGTCGGATGCCGACCACCCCTGGATGACGGGCACCGGCGGCGGCAACATGTTGGCCACCACCGAGGCTGAGCACGTCTTGGCGAGTTCGTTCTGCCAGGCGTAGACCACGCGCAGCACGCCCTCAAGCAGCGTTGCCGTCGCGTTGATCCGGTAGTCGATCGCCGCCTGGTCCTGGGCGATCTCCGGCTCGCAGCAGAGATCCGGCTGGGACCACCACGACGGGCTCAACAACGACGCCAGTTCAACATACTGCTCGTATGACCAAGGGAATACCCCAGCCATGCCCGGCTGCTTTCCCTTCGCCTTCCACAGCTTCATTGCGGTGAAGCCGGCACTGTCCAGCGCAACATCGAGCTCGGATAGATCGGTGGCCTCCGGCACGCGGAAACAGCCCTTCTCGGCGTCCCAGAAGGCATTCGCGCTCACCATGACGGGGAAGTCTTCATTGAAGGCGTGAAAGGCCAGCTTTCCTCCGCGGTGGGGAATGCCGACCCGCATAACGAGCTCGTCATCCAGGACGGCATTGTGTTGCCCGGTGGGCAAACGCCCCAGTTCCAGCTGATGATCCATGTTGAAGCACTCCGATTGCTCGAGGGCGCGGCCCTCAGTCAGTCGTCATGCTACGGATTGGCCCTGGCCTGTTCCGAAATACTTCCCTGTCCGTTGGGGATCGGCTTCGCTTCGACCTGAATGTACTCAAGGTTGTCCGTTGGGTGCAGGATCAAACGCCCGCGGTAGCCTGGCACCCGATCATCGACCAGGACCCGCAGATGCGGGCCCTTGGCGGACGTGATGGTGCAGTTCCAGATAACCCCGGCCCCGTCGGTGTAGCGCACACGCACGCCACGCCGGGCGGGCACCCCGTAGGTCCTGCGGATGTACTCAAGGCTCATGGCGCCCCCGCTCAGGCGGCCATCGGGGCCTTGATCGCGGCATGCGATTCGTAGCCCTCGAGGGTGATGTCGGCGGGCTCGATGCGCGTGAAGGCACCGGCGACGTCCTCCAGGCGCTCGATCCGCTTGATGTTGTCCGACAGGATCAGCTTCGGGGCTTCGAGGTGTTCCCTGGCCAGCAGCTCGCGCACCTGGTCGAAGTGGTCCTCGTAGATGTGTGCGTTTGTCGCCTGGATCGTGACCACACCAGGCTCGAACCCGGCCAGCCGCGCCATGATCGCCAGGAAGATCGAGGTCGCCGCGATGTTCGCCGGCGCGCCAAGGAACAGATCCCACGACCTGATCGTCATGACCAGGTTGAGGACACGGGGGTTCTCGAAGGCCACGAAACGGTAGTCCATGTGGCAAGGCGGCAGCGCCATCATGTCCAGTTCGGCCACGTTCCAGCCAGAGACGATCACGCGCCGATCAGAGGGATCGGTGAGCAGCTTCTTCAGCGAGTTCTCCAGTTGGTTGATGGTCCGCTGCATGAGCCATTCGGTCTTCTTCCCGTCCTCGTCGACCGCATCGGCGCACATGCGCACTTGGTAGCCCAGGGCCAGGAGCCGATCACGTTCAGCCGGGGTATCCGCGATGCGGCGATCCATCCACTCGGTCCATTGCTTGCCGTAGATTCGCGATAGGTGGTCGTGACCGCGACGGTAGGGACTGGCCAGCCAGGCCGGCGTCTCGTTGGCGTTGATGTCCCAGAAATGGCAGCCCAAGGCGCGGAAGTCGGCGGCGTTGTCGTAGCCCCGGAAAAAGCCCAGCAGCTCGCCGACGATGTTCTTGAAGGGCAGCTTGCGGGTGGTCAGCGCCGGGAAGCCCTGACGAAGGTCGAATTGCACCTGATGCCCCACCAGCGCGCGGCAGAGCTTGTTGGTACGGGTGTTGTACTGATCGACGCCCTGCTCCATCGTCAGGCGCAGCAGCTGGTGGTAGTTTTCCATGGTTCCCTCGCGTAAAAGTAGAAATCGTTTGGACAGAGACAGGATGCCAGATCGATACCCCTTGATCTGGAATCCTTTCCGCTGTCCCTCTTGGATTGGCTGTGTCCGTCAGGCGGCTTTCTTGACCGCCAGCCACGGGGCATTGGCGCGCAGGAATGCCTCCGCCATCACCGGATTGACGCTGTTACCGATCATGCGGACCTGGGTGGCCACGGAGAACTTCCGGCCGTCGTGACCACGGTCGATGATGTAGTCCTCCGGGAAATCCTGGCCGCGCGCCAGTTCGCGGGGCGTAAGCATGCGAAGTCGAATATCGACAATCACATACGGGTCGCCCTTGATGAACACGGTGACCAAGGCCAACCGGTCTCGCGTCGTGATCGTGTTGAGCGGCTTGTCCAGCGCGGACCATTGCCCGCCCTCGCTGTAGTATTCCATCAGGAAGGCGGCGACTTTCAGCGCGCCTGCCTCGTCTTCAGGGGACAGGCCGTTTTCGGTCTCGCCGGCCAGCTCGCATTCCACCAAGGCAGACTTTCCTCCGCCCCCGGCCGTGGCGGTCCCCATCGAATCGTTCGCGGCATGCCCGACACTGTTGCCGAACTGGCGGGACAGGAAAGCTGCCACAAGCGCATGATGCTCCCCTCCAGCGGAGATGGTCTGGATCGGGGCGTTCAGATCCCGGGCGTCGCAGTTGTTGCGCAACTGAGCCAGGTTGAGTGCCACCAGTTGCTGCTGGCTGCCGGTCGTCGTGATCGACGTCATCGACTCATCCAGGCCGCGGCCGATCGTCGCGTTGAAACCGTCGTTTGCCTGCATCATGAAGGCGCTCGACGGGCCGGACGGCTTCGATGGCGACAGCACCGGGGTGGCCAGCATCAGCTCGCCGCGGTGGGCCGTGGTAATCGTGGGCAGAGAGCCCTCAACCGAGTGCATCCGGTCAGACCCTTGATGGGTTGCCGGGACGATGATCGGGGTCGCCACGGCGAATTCTCCGCCCTTGGTCGCCGCCTTCACTGTCCGCAACGGTTCCAGTGCGCTGTGAACCACATCCTGTCCGTTGTAGTGGGCAATCGGAACGATGCTCGGCGTAGCCAAATACTTGTGATTCCCGCCACAGCAGGTCGCCATCGGCTCGTCGACCATGCGGGGGACGTTGTTCATCATGTTGTTCACGATGAACGGCTTCGGGTTGTCCAGGACGAACTTCTTGATCCCCTTGGCGATCCGGCGCTTCGTGGCCGGCGCCAGTTCCTTCTTCCGGCCGAAGATCGACTTGCCCAGGTTGGAGAAGTCGATGCAATCGGCAGCCGGCTTGTAGGCCATCTGCTTTCCGGTGGGACTCTCGAAGTGGATCTGCTCGGGCCAAATGATGGGGTACCCATCCCGGCGCGCGATCATGAACAGCCGCTTGCGAGTGGTATGGCCGCCCAGTTTCGCCGCCACCAGCGCGCGCCACTCGACCACGTAGCCCATCGCCTGAAGCTTCCTGACGAACTTCTTCCAGGTCTCACCTTCCCGCTTGGGGTCGGGGATCAGGTACTGCTGCTGCACCGGCACGCGCTCACCTGGCTCGGCCACAACCTGCCGGACCTTCTTCTTACCCTTCACGACCTCGGTCACCAGCTTGATGACGCGGCCCGTAGCCTTATCCCGCTTGGCGATCAACGGACCCCATTTCAGGATGGCCAGGACGTTCTCGAGCGAAATCACGTCCGGCTTGGCTTGCCCAGCCCAGCGCATCCCCGACCAGGACAAGCCCCTGATCTTCTTCGATCGCGGCTGACCGCCTGCTGCTTGCGAGAAATGCGTGCAATCGGGCGAATAGTGGAACCAGCCCACTTCATCACCCATCGTCGCGCCACGAGGATCCACCTCGTAGGCATCGGCGCAGAAATGCCGCGTGGTCGGGTGATTGATCCGATGGCAGCTCACTGCGTCATCGTTGTGGTTGAAGCAGACATCCGGCGATCTGCCGAAAGCCTTCTCGAAGGCGATCGACATCCCGCCCGCGCCGGCGAACCCGTCAACGATCTTTTTCCGGTGGATGTTCAGCAGCAGTTGCTTGCCCATTGTTCGTGCCCTCTGTGAATGACGATTGGGTCAAGTCTGCCTATCTGAGCCAGGCCTTTCTGAAAAAAGCGGGCAGACCCCGGAGGTGTGCCCGCTTCAATCTGCCTGTCCGTAACATCTGCGCTGCTTATGTCTCCTTGTTGGCCAGCTCCAAAAGCACTGCCGCATGGCAGGCGTCCTCATACGGATCATCCTTCCCACACCAGCAGGCCAGATTCTTCCCAGCAAGCTCGGCCCTGGCTTCGGCTACCAGTTTGGGGTTGAGAGGAGCCAGCGACCGATACAACACGAACGCATGCCGCTTGTCCCGAACGATCTGGCCCTTGGTCGGACCGAACGGCGCTGGTTTCCCCGGCACAAAGGGGTTCCCCCACTTTGTCGTCCGATCCACTTTCACCGTGTTTTCCGGCATGCGCCAGCCTTTGGCGCGCTTTAGTTGCACACGTTCTGGCATGGCCGCTACACCGGGTAGCCGTCGTGAATCTCGCCGTCCAGCGTGCGCCCCGCAGCTTTCTTGCCGATCAGGAACATATCCGGCTCATCGTCGATATGGTCATCTTCACGCGCCTTGCCGACGCGCCGGATGTCCCACTTGCCGTTGAACCAGTACGCGGCGTCGATCGCACGGTCGTTGGGGCCAGCAACCTCGCCTGGCGCCCATTCGCCCCATTGTTTGAACAGATAGGGCACGCCGGCGGCTGCGCACTGATCACGCAGCATCCTCGCCCAAGCCGGAAGGCCTGGCCGAGCACCGATACCGCTCTCGAAACCCTGGACAACCCAGTCAATGCCCGGGTCGTAGGCGGTCCATCCTGCGCGCTCACCACAGTGCGGGCAGAGAGGCACGGTTTCCTCGCCATCCTTTACGGTCTCGATTTCGTCGGCGATCACATAGCAGCTGTCCGGACAAACGTCTTTGCACTGCACGCCCACCGGATTGAGCCATCGGGCCAGATCGATCTCCCCCAATTGGGGCTCGCAGCTTACCCAGCGCACTGCCGCTGGCGTGTCCATCAGCAATGGCACCCGCTCGTCAGCCGCTGCCTGATCTTCCACAGAAACGCCCAGCCAGATACGGGGATGGGGGCCATCAAAGTTCATCACCTGATCCCAGATGCCGTCGGGGTCCGTGCCGCCACCGTGATTGACAGCCGCCCGCGCCCAGGCCTCCCGCCGGTCGGTGCTGAAGTAGTCACGCATGCGCTGCGCGCGCTTGGTGAGCACCTGGAAGATATGGCCGTCCTGTTCGTTGCGGCCATACAGGCAGGCCCACATCACACCCATGATGGTGTCGATCCAATCGTCCGGCACGTCCGGGTGGAACAGATCCGAGTGGGCGCAGACGAACACCTGGCGCTGCCGCCCCCAACGAATCGGCTGGTCAAGCCATTCATGGTTCAGGCGCACTTCGCCGGTCCATACCGGACCGTTTTTGGTGTCGATGGTCAGCCCTGCGCGGGACGGGTGATTCTTGAGGCGGCCGCCGGCCAGCTTCATTGCATAACACAGCCGACAACCACCGGAGGTTACAGAGCAACCCGTTATCGGATTCCAAGTGGCATCGGTCCATTCAATCTTGCTGTTGTCAGCCATGAGCACTCCCTTCAGTTGTTGGTGGAGCCATCTTTACAGCCACCACCTTCCAGTCGCCGAACTCGCGACCGGCTTGCTTCAATCCATCGATCTCCAGCGCGATCTCGCTGGCGTTTTGTTCCGATGCGGCACAGCAAAGCAGCATGGCCGCCGCGAAAGTCGTCACGCGGTCAAGCTGCTCCGCAGCACTGGTCAGCAAAGCGATCTTCTTCTTCACCAACTTGCCGAGTGCGGCGTCGTCGAGGTCGCTCCAGAGTGCCGCAGCCTTCTCTTCCTCGGTAAGTATCAGTTCAGCCACAGACGCCCTCTGCCAGTTCTCTGGGCGTCAGGCCCAGGCTTTCGTTGAACAAGGTCTTTTCCTCGTCCGACAGGGACACGCCTTCGGCAGCGATCACCAACAGGGCGTCGGCCTGAGACCGACCACCCAGCGGGCGGAACTTCACGCCATAGACAGAGGTGCCATGCACCAGGGCCGGGATGATCTCGTCCCAGTTGCCGTCGCTGTCCCGATACACCCAGCGGATGTCTTCCAGGGCAACGTTGAGCGCCTGGATGTGCAGGTGATACAGGAAGCTCACCAGTTCCTCAGCGGAGTTGGTGATACTGGCCCCATGGTTGCCGTCGGTCATGGCCACGACGACTCGCCCATCACGTCCATGCAGGATCGCCACATGGGCCGTAGCCTTGCCCTGGGCCAGCCCGTGACGGGGAACCGCCCCCGCCCGACTCACCACCAGGGGAAACAGGCCACCAATCGGCGGCCGCGTCATCGTTGCTTGCATACCTTTCCCTTCCTAGACGAACAGTGCGCCGCTCTGGCACAGACTGACGACACGCTTGCACTCAGGCACATCCATCCAGGAGATGTGAGCGAAGCGAACCCCAAGGGCTTTGGCCAGTGCCCTGTAGGCCTGAGACCGGTGCCACGGGGTCTTCCCCCGCCACAGCCGGTCAAAGGCGTCATGGGCTTCCCGGCGGGCCTGCATGGTCGGCCCATCAGCCAGCGTGCCGAGCGGGATGTCGGTACCGGGATGGCATCCCACCCGCGCACCGCAGGACGTGCAGCAATAGGCCAAGGGCCATCCATACTCTCGGCCCTTGTAGAAATCGGCGTTGTTGACCAGCTTCACCTCGCCCTTGCAGAAGCGGCAGGTGTCCGGCACGGATACGGGATCACCCTTCACCCGCGCGACAGCCTCGGGCAGATTCACCGTCCGCCCGAACAACTCGTAGGGTTTGAACCGTCTGCGCTTACCGGCCATGGCTTTCACCGTTCGGGCAAGGTTGCAAGGCTCTGGAAGGGCAAATAATCCGCTGCACCGGCAGCATGGCCAGATCGTCGCGGTAGCTCCGCGACCGGTTGAGACGCGGCAGCACAAGCCACCGCCAAACGCCCAGGAGCGATGTTCCCGTACAGACGATCTGCATGGTGACCATGCCGAAGGTGTGCGTCATCACGCCATAGGTCATCCAGGCCGCGTTAGACACCAGGAACAGGACGAACCCATACCCGGACCAGCGATTGTTGAAGGCCAGGAGGAGCGCCCCGGCCGCGCCCCCGATGGCACCAATCCATTCGAGCATGCCGAGGCTCAGCATGATGCTTCCTCCGGCTTGGCCACCGCCATCCGCTGACGGCAGGCCTTCTGGCCAGGGCACGGCATAGACACCCAGCGATAGATGCCAGCGGCCGTCGCATCCAGCGCCAGCACCAGCCCGAGGGCGACCACCCCGGCGGCCTTGAATGATGCCAGGGTCAGGATTGCAGAGAAGCCCCCCCCATCGACGGTGTTTGCGATCCAGCCGACGGCACCGGCGATCATCAGCATGCCGACGAAGAACTGGGCCAGCGCACGCTGGCTGAGGGTGCTGTTGAGAGCTTCCGGGGGGGTGTCCACCACCCAGCTTCCGGTGTGCCCACAAACGACAATGGGGCGATGCCACGCGGCGTAGCGCCAGGTGGGCACGTCGATGTCACGCTTCAGCAGCAGGAGGAGCAGAGCGAATTGGTAGTGCAGGTAATTGCTGATCATGGTGATACTCCAAAGAGGTTGGTATCAGCCCCCCGGTCTTTTTCCATCCGGCTCGCCGAACCGGATCGAAAAAGAGGCCTCCAGGAGGAGAGGAGGCCGAAAGCCTCAAGGTGAGGCCAGCGGGGGAGAACTGAAACCAAGATTACATAGATGGCATCGGCGATTCTGGGAAAAGATCAGCCTGTGCTTTGGATACGTCGGGGTCCGTAAGGGACGCGCTGCGTGCGTTCCACCTCCCCTGCCCTTGATGTCCGGCAGGGTCATTCACGGGACCGATTTGAAGGCGTCCTTCAGCTCGGGCAGCCGGCCGATCTCGGCCCACTTGCCCTGGAAGCGGCCGCGCTCGATGGTGGGCACCATGAACCGCTCTTCCCGGCGGATCACCAGGCCATCCTCGGACAGCACGTCGCACAGTTCAACATCGCGCTTGGAGCCGCCCCAGTCAGGGGCCAGCACATCCATGCGTGGCTGCACCTTCACCGGCGGCATCCCGTTGTAGCAAAGCGGCCAGCGGGAATGCCAGTTGCCCAGGCTCTCGCAGGTGGCGTACCAGAGGGTCATGTCGTCGCTGACCGAATCGAAGTGTCCATCCAGGACAACATGCTCAACCCGGTGGGTCAGCACGGTGTCGGCGTTGATCGCCCAGCGTGTCACCACGGTATAGGCACTCAGGAACTTCTCGGTCTCGATGAAGAGACCGGCCACAGCCTTCTGGCCGACCTTGCGGCAGCCCACCCAGTAGCACTTGTTCGGGTAGGGCCGGCGAATGGTCAGGTCCTGGTCGCCCTTGAGCACCAGGCCCGTCTCCTCGATGGTCAGATCGACGAGTCGAACGGCGGCATCCAGTCCGCCAGGGACGTACAGCTTGGGTAGAACATGGATCAGCAT
Protein sequences of DBSCAN-SWA_2 >NZ_AP021845|219800:230598|220457_221594_-|WP_152091181.1|DBSCAN-SWA MDRLQQLDLFAPAIAAAGQYDVAVTPEVSEWLAAGAVCALGVSAGKDSVAMAVRVSEYLDQIGHSGPRVLIHSDLGTVEWKDSLPLCERLAKRLGWELIVVRRAAGDMMDRWEVRWKNNIERYASLSTVRIILPWSTPSMRFCTSELKTSIICRELTKRYPGQKILSATGVRHAESAARAKMPIATPQPLLSARGCVGMNWNPVIKWPTPAVFDYIALKQAPLHEAYTVYGSTRLSCTFCIMGSAGDLKASASCPDNADIYRRMVRLEIASTFALQGATWLGDVAPHLLSAEEREGLADAKRRAELRVQAESRLPAHLLYSSGWPTCVPTEEEAQLIAEVRQAVADILGISVGFTTAGEVIARYQELITLRQAKGKGE >NZ_AP021845|219800:230598|226211_226538_-|WP_152091185.1|DBSCAN-SWA MPERVQLKRAKGWRMPENTVKVDRTTKWGNPFVPGKPAPFGPTKGQIVRDKRHAFVLYRSLAPLNPKLVAEARAELAGKNLACWCGKDDPYEDACHAAVLLELANKET >NZ_AP021845|219800:230598|219800_220412_-|WP_152091180.1|DBSCAN-SWA MKHAVLRAAALALVLFAGQAGATNWLNVAEKAVQGLQDHSSTVQHSGTIEYAFSPNEGAEKLVLKVIGSARSEIRMMTYSLTSAPVVEALIAARRRGVDVAIVSDYKNNVVEDKSGKARAALSALVNAGCRVRTISIYPIHHDKIVVSDRETVETGSFNYSGAAANKNSENVLVVWKNPELAKGYLEHWNSRFSRGDDYQTRY >NZ_AP021845|219800:230598|221724_222702_-|WP_172974840.1|DBSCAN-SWA MDHQLELGRLPTGQHNAVLDDELVMRVGIPHRGGKLAFHAFNEDFPVMVSANAFWDAEKGCFRVPEATDLSELDVALDSAGFTAMKLWKAKGKQPGMAGVFPWSYEQYVELASLLSPSWWSQPDLCCEPEIAQDQAAIDYRINATATLLEGVLRVVYAWQNELAKTCSASVVANMLPPPVPVIQGWSASDYLRSLELLTAVWERWQPWLDAPLLIGVGSVCRRNLHHPTHGLYAILSALEGKLPQGARLHLFGVKGASLSEVKMLDWVASADSMAYDFGARIKARQSGLSNTTAHRSSEMSRWMRTARSRMSAQPGDQLRLPLAA >NZ_AP021845|219800:230598|227967_228459_-|WP_152091188.1|DBSCAN-SWA MQATMTRPPIGGLFPLVVSRAGAVPRHGLAQGKATAHVAILHGRDGRVVVAMTDGNHGASITNSAEELVSFLYHLHIQALNVALEDIRWVYRDSDGNWDEIIPALVHGTSVYGVKFRPLGGRSQADALLVIAAEGVSLSDEEKTLFNESLGLTPRELAEGVCG >NZ_AP021845|219800:230598|224122_226084_-|WP_152091184.1|DBSCAN-SWA MGKQLLLNIHRKKIVDGFAGAGGMSIAFEKAFGRSPDVCFNHNDDAVSCHRINHPTTRHFCADAYEVDPRGATMGDEVGWFHYSPDCTHFSQAAGGQPRSKKIRGLSWSGMRWAGQAKPDVISLENVLAILKWGPLIAKRDKATGRVIKLVTEVVKGKKKVRQVVAEPGERVPVQQQYLIPDPKREGETWKKFVRKLQAMGYVVEWRALVAAKLGGHTTRKRLFMIARRDGYPIIWPEQIHFESPTGKQMAYKPAADCIDFSNLGKSIFGRKKELAPATKRRIAKGIKKFVLDNPKPFIVNNMMNNVPRMVDEPMATCCGGNHKYLATPSIVPIAHYNGQDVVHSALEPLRTVKAATKGGEFAVATPIIVPATHQGSDRMHSVEGSLPTITTAHRGELMLATPVLSPSKPSGPSSAFMMQANDGFNATIGRGLDESMTSITTTGSQQQLVALNLAQLRNNCDARDLNAPIQTISAGGEHHALVAAFLSRQFGNSVGHAANDSMGTATAGGGGKSALVECELAGETENGLSPEDEAGALKVAAFLMEYYSEGGQWSALDKPLNTITTRDRLALVTVFIKGDPYVIVDIRLRMLTPRELARGQDFPEDYIIDRGHDGRKFSVATQVRMIGNSVNPVMAEAFLRANAPWLAVKKAA >NZ_AP021845|219800:230598|226543_227677_-|WP_152091186.1|DBSCAN-SWA MADNSKIEWTDATWNPITGCSVTSGGCRLCYAMKLAGGRLKNHPSRAGLTIDTKNGPVWTGEVRLNHEWLDQPIRWGRQRQVFVCAHSDLFHPDVPDDWIDTIMGVMWACLYGRNEQDGHIFQVLTKRAQRMRDYFSTDRREAWARAAVNHGGGTDPDGIWDQVMNFDGPHPRIWLGVSVEDQAAADERVPLLMDTPAAVRWVSCEPQLGEIDLARWLNPVGVQCKDVCPDSCYVIADEIETVKDGEETVPLCPHCGERAGWTAYDPGIDWVVQGFESGIGARPGLPAWARMLRDQCAAAGVPYLFKQWGEWAPGEVAGPNDRAIDAAYWFNGKWDIRRVGKAREDDHIDDEPDMFLIGKKAAGRTLDGEIHDGYPV >NZ_AP021845|219800:230598|229974_230598_-|WP_152091191.1|DBSCAN-SWA MLIHVLPKLYVPGGLDAAVRLVDLTIEETGLVLKGDQDLTIRRPYPNKCYWVGCRKVGQKAVAGLFIETEKFLSAYTVVTRWAINADTVLTHRVEHVVLDGHFDSVSDDMTLWYATCESLGNWHSRWPLCYNGMPPVKVQPRMDVLAPDWGGSKRDVELCDVLSEDGLVIRREERFMVPTIERGRFQGKWAEIGRLPELKDAFKSVP >NZ_AP021845|219800:230598|223036_224014_-|WP_152091183.1|DBSCAN-SWA MENYHQLLRLTMEQGVDQYNTRTNKLCRALVGHQVQFDLRQGFPALTTRKLPFKNIVGELLGFFRGYDNAADFRALGCHFWDINANETPAWLASPYRRGHDHLSRIYGKQWTEWMDRRIADTPAERDRLLALGYQVRMCADAVDEDGKKTEWLMQRTINQLENSLKKLLTDPSDRRVIVSGWNVAELDMMALPPCHMDYRFVAFENPRVLNLVMTIRSWDLFLGAPANIAATSIFLAIMARLAGFEPGVVTIQATNAHIYEDHFDQVRELLAREHLEAPKLILSDNIKRIERLEDVAGAFTRIEPADITLEGYESHAAIKAPMAA >NZ_AP021845|219800:230598|222752_223025_-|WP_152091182.1|DBSCAN-SWA MSLEYIRRTYGVPARRGVRVRYTDGAGVIWNCTITSAKGPHLRVLVDDRVPGYRGRLILHPTDNLEYIQVEAKPIPNGQGSISEQARANP >NZ_AP021845|219800:230598|229269_229710_-|WP_152091190.1|DBSCAN-SWA MISNYLHYQFALLLLLLKRDIDVPTWRYAAWHRPIVVCGHTGSWVVDTPPEALNSTLSQRALAQFFVGMLMIAGAVGWIANTVDGGGFSAILTLASFKAAGVVALGLVLALDATAAGIYRWVSMPCPGQKACRQRMAVAKPEEASC >NZ_AP021845|219800:230598|228471_228939_-|WP_152091189.1|DBSCAN-SWA MAGKRRRFKPYELFGRTVNLPEAVARVKGDPVSVPDTCRFCKGEVKLVNNADFYKGREYGWPLAYCCTSCGARVGCHPGTDIPLGTLADGPTMQARREAHDAFDRLWRGKTPWHRSQAYRALAKALGVRFAHISWMDVPECKRVVSLCQSGALFV >NZ_AP021845|219800:230598|227669_227975_-|WP_152091187.1|DBSCAN-SWA MAELILTEEEKAAALWSDLDDAALGKLVKKKIALLTSAAEQLDRVTTFAAAMLLCCAASEQNASEIALEIDGLKQAGREFGDWKVVAVKMAPPTTEGSAHG |
13 | Mycobacterium_phage(22.22%) | NA | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
| Acr ID | Acr position | Acr size | Homology with known anti | Neighbor HTH/AcRanker | Neighbor Aca | In prophage | Protospacer in prophage |
|---|