| Contig_ID | Contig_def | CRISPR array number | Contig Signature genes | Self targeting spacer number | Target MGE spacer number | Prophage number | Anti-CRISPR protein number |
|---|---|---|---|---|---|---|---|
| NZ_AP021844 | Azospira sp. I09 | 0 crisprs | DEDDh,DinG,WYL,RT,csa3 | 0 | 0 | 3 | 0 |
| NZ_AP021845 | Azospira sp. I09 plasmid pAZI09, complete sequence | 4 crisprs | PD-DExK,DinG,csf5gr6,csf1gr8,csf2gr7,csf3gr5 | 0 | 7 | 2 | 0 |
| CRISPR_ID | Spacer_Info | Spacer_region | Spacer_length | Hit_ID | Protospacer_location | Mismatch | Identity |
|---|
| CRISPR_ID | Spacer_Info | Spacer_region | Spacer_length | Hit_phage_ID | Hit_phage_def | Protospacer_location | Mismatch | Identity |
|---|
| Region | Region Position | Protein_number | Hit_taxonomy | Key_proteins | Att_site | Prophage annotation | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| DBSCAN-SWA_1 |
1397060 : 1410067
Sequences of DBSCAN-SWA_1
Nucleotide sequences of DBSCAN-SWA_1 >NZ_AP021844|1397060:1410067|DBSCAN-SWA ACTAAGCCGTGCTCTTACGCGCGGCGACTTCTTCCAGAAGCCGCTCGATCAGGCTTGCCACGGGCATCTGCCCCAGGTCCTGACCACCGCGGGTACGCACGGCCACCAAGCCGGCTTCCTTTTCCTTGTCGCCGATGACGAGCTGATAAGGCAACCGGTTCAAGCTATGTTCGCGTATTTTATAGGTAATTTTTTCATTGCGCAAATCCGCTTCGGCACGCAAACCTGCCTGACGCAGGGTTTTCACCACTTCGGCGGAAAAATCGGCCTGTTTTTCCGAAATATTCAGCACCACGGCCTGTACCGGCGCCAGCCACAGGGGCAAGGCACCCGCATAGTTTTCCACCAGGATGCCGATGAAACGCTCCAGGGAACCGAGGATGGCCCGATGCAGCATCACCGGCACATGGCGGGCGTTGTCCTCGCCCACATACTCGGCGCCCAGGCGACCCGGCATGGAGAAATCTACCTGCATGGTGCCGCACTGCCAGGAACGCCCGATGGCATCCTTGATGTGGAACTCGATCTTGGGGCCATAGAAGGCACCCTCGCCGGGCAATTCATCCCATTCCAGGCCGGAAGCCTTGAGGCCGGCCCGCAGGGCATTCTCCGCCTTGTCCCAGATGTCGTCGGAACCGACACGGCTTTCGGGGCGCAGGGCCAGCTTCACCGCCACCTGATCGAAACCGAAGTCGGCATAGACCTTCTTCACCAGAGCGTTGAAAGCCGTCACTTCCGCTTCGATCTGGTCTTCGGTACAGAAGATGTGACCGTCGTCCTGGACGAAGCCGCGCACGCGCATCAGGCCGTGCAGGGCGCCGGAGGCCTCGTTACGATGACAGGAGCCGAACTCGCCATAGCGCAGGGGCAGGTCGCGGTAGGAGCGCAGATCGGAATTGAACACCTGCACGTGCCCCGGGCAATTCATCGGCTTGATGGCGTAATCCCGCTTCTCCGACTCCGTGGTGAACATGTTGTTCTTGTAGTGCTCCCAGTGACCGGACTTCTCCCACATGCTGCGGTCGAGAATCTGGGGGCAGCGGATTTCCTGGTAGCCGTTGTCGCGATAGACCTGGCGCATGTACTGCTCGATTTCCTGCCAGATGGCCCAGCCCTTGGGGTGCCAGAAAACCATGCCCGGCGCCTCGTCCTGCATATGGAACAGATCCAGATGCTTGCCGATGCGGCGGTGATCCCGCTTCTCGGCCTCTTCCAGCATGTGCAGGTAGGCTTCCTGGTCTTCCTTCTTGGCCCAGGCGGTGCCGTAGATGCGCTGCAGCATCTCGTTCTTGGAATCGCCGCGCCAGTAGGCACCGGCCACCTTCATCAGCTTGAAAACCTTGAGCTTGCCGGTGGAGGGGACGTGGGGGCCGCGACAAAGGTCCACAAATTCGCCTTCCCGGTACAGGGAAACATCCTGGTCGGCCGGGATGGCAGCGATCAGCTCGGCCTTGTACTTCTCACCCTGCTCCAGGAAAAACTTGACCGCATCGTCCCTGGCCCAGACTTCGCGGCTTACCGGAATGTCGCGCTTGGCCAGCTCGGCCATTTTCTTCTCGATGGCCAGCAGGTCTTCAGGGGTAAAGGGGCGCTTGTAGGCGAAGTCGTAGTAGAAGCCGTTGTCGATGACCGGACCGATGGTCACCTGGGCTTCGGGAAACAGCTCCTTCACCGCATAGGCCAGCAGGTGGGCCGTGGAGTGGCGGATGATCTCCAGGCCGTCCGCATCCTTGTCGGTGACGATGGCCAGCTGGGTATCACGCTCCATCAGGTGGGACGTATCCACCAGCTTGCCGTCCACCTTGCCGGCCAGGGCGGCACGGGCGAGGCCGGCACCGATGGAGGCAGCAACCTCGGCAACGGTCACCGGAGCATCGAAGGAGCGGATGGAACCGTCGGGCAGCGTGATATTGGGCATGATTTCTCCAGCCACATTAAATTCTGGACAAAAAAAAAGTGCGGACGGATCCGCACTTTTTTTCGACATAGGAACGAAGGCGAACCGCTTAGGCTTTCTGAACCAGGATAGTGCAAGTTCGGAAATTCACGGTACAGCTCCGTTCAAGGTGTTGGTAGGCGCGATTGGATTCGAACCAACGACCCCCACCATGTCAAGGTGGTGCTCTAACCAACTGAGCTACGCGCCTGCAAAGAAGGCCGAATTATAACGATCCCAGAGAATCGCGCAAGGGGCTGAGGAGACTTTTTTCGCTTCGCCCCACAAAAAAAATCAGCACAACCGTCCGGGCGAACATTTGGCCAGGGCCGGCGCCACTGCGGTGACGGGGCGCGACTCCTGGATCCAGGCGCGGATGCCCTGGCGCACGTTGTAGATCTTGCTGTAGCCGGCCTGCTGTTCGAGGAAGTCGCTGACGGCCCGGGTGCGGTTGCCGCTGCGGCAGATGAGGATCACCGGCTGCTCCGGACCTGCCACCGTCTTCAGCTGCTCCAGCCAGGCCGCCGGATTGGCGCGGCCGTTGGCATCGAAGAAGGTGAGCAGGCGGCTGCCGGGAATGACGCCGGTTTCCCGCCATTCGGGCTCGGTACGGATATCCACCAGGACCACGCCACTGGCCACCAGCCGGGCCACTTCGGCGCTGTCCACATTCACCACCTCGGCCCTGGCCGCGAACGCCGCCAGCAGAGCCAGGAGGAAGAGGAAGGTGTGCTTCACGCCGCTACCTCCGGCCAGGCGGCCAGGAATTCCTGCCAGTGGGGCTTGTCCAGCTTGGCCAGTTCGGCCTTGATGAAGGCCAGTTCGGCCAGATGCTCCTCCCGGCTGACCTCGCCCCGCATCAGGCGGAAGCGGGTGGACACCAGGTAGGTATTGACCACATCGGTCTCGCAGTAATCGCGGATCTCGTCGGCCTTGCCCTCCTGCCAGGCCTGCCACACCTTGCCGCCGTCCATGCCCAGCTTGCCGGGGAAGCCCATGAGCTTGGCCAGGTCGTCCAGGGGAGCGCTGGCCCGGGGCTGATACATGGCCAGCAGGTCCATCAGGTCCAGGTGGCGGGTGTGGTAGCGGCTGATGTAGTTGTTCCACTTGAAGTCCCGGGAATCCGCATAGTCGCCGTCGCCCAGGTCCCAGTAGCGGGGAGCCACGACGCCGTGGATCAGGCCCCGGTAGTGCAGCACCGGCAGGTCGAAGCCGCCGCCGTTCCAGGAAACAATCTGGGGCGTGAATTTCTCGATGCCGTCGAAGAAGCGCTGGATGATCTCCCCTTCGCCAATCTCGGGAGCCGCCAGGGACCACACCTTGAAGGCATCCCGGGCCCGCAGGGCGCAGGAGATGGTCACCACCCGCTGCAGGTGCAGGGGCAGGAAGTCGCTGCCGTTCTGGGCCCGGCGCTGCTGGAAGGCCAGTTCGGCCACCTCGTCGTCGGAGAGGTCGGCGGGAAGGTCGTGCAGGCGGCGCAGCCCGGGTACATCCGGGATGGTTTCAATATCGAAAACGAGGACGGGAACCATGGGACTCAGGCGGGGAAAACGCCGGTGGACAGATAACGGTCGCCCCGGTCGCAAACGATGGTGACGATGGTGGCGTTCTCCAGCTCCCGGGCCAGGCGCAGGGCCACGGCCAGGGCGCCGCCGGAGGAGATGCCGGCGAACAGGCCCTCTTCCCGGGCCAGGCGCCGGGTCATGTCCTCGGCCTCGGCCTGGGACACGTATTCGAGGCGGTCCACCCGGCTGCGCTCGTAGATTTTCGGCAGATAGGCTTCGGGCCATTTGCGGATGCCGGGAATCTGGCTGCCCTCTTCCGGCTGGCAGCCGACGATCTGGATGCGCGGATTCTTTTTCTTGAGAAACTGGGAAGTACCGATGATGGTGCCGGTGGTGCCCATGCTGGAAACGAAATGGGTCACCTGGCCCTTGGTGTCGCGCCAGATTTCCGGCCCGGTGCCCTCGAAGTGGGCCAGGGGGTTATCCGGATTGGCGAACTGGTCGAGGATGATGCCCTTGCCTTCGTCGCGCATTTTCTCGGCCACGTCCCGGGCCAGTTCCATGCCGCCATCCCGGGGCGTCAGGATCAGTTCGGCGCCGTAGGCACGCATGGTCTGGCGCCGCTCCAGGCTCTGGTTTTCCGGCATGACCAGGATCATGCGGTAACCGCGCATGGCGGCGGCCATGGCCAGGGCGATGCCGGTGTTGCCGGAGGTGGCTTCGATCAGGGTATCGCCGGGCCGGATTTCGCCCCGCTGCTCGGCGTGGGAAATCATGGAGAGGGCCGGGCGATCCTTCACCGAACCGGCCGGATTGTTGCCTTCCAGCTTGGCCAGGATGACGTTGCCGCGCTGGGCGACAACATCGCCGGGCAGGCGCTTCAACTGCACCAGGGGCGTGTTGCCGACGAAATCTTCCAGAGTCTTGTACATGGCTCAGCGTCCGTTGAGGAATTGCACGTAGTCGGCGACGCCTTCGGCCACGGTGGCGAACTCGTCACCATAGCCGGCGCTGCGCAGCTTGCTGAGGTCGGCCTGGGTAAAGCTCTGGTACTTGCCTTTGAGGGCTTCGGGGAAGGCCACGTACTCCACCAGCCCCTGCTGCACCATCGCCTCCAGGGACAGAGCCGGCTTGCCCTCGGCGGCCCGGCAGCTGTTCACCGTGGCCACGGCCACGTCGTTGAAGCTCTGGGCCCGGCCGGTGCCCAGGTTGAAGATGCCGGATTTTTCCGGATGGTCGAGGAAGTACAGATTGACCTTGGCCACGTCCTTCACATAGACGAAGTCGCGCTGCTGCTCGCCGTTGGCGTAGCCGTCGCAGCCCTCGAACAGCTTGACCTTGCCTTCGGCCCGATACTGGTTGAAGTGGTGGAAGGCGACCGAGGCCATGCGCCCCTTGTGGCTCTCGCGGGGGCCGTAGACGTTGAAGTAGCGGAAGCCCACCACCTGGGAACGGACTTCAGGCAAGCGCTGACGGACGATCTGGTCGAAGAGGAACTTGGAGTAGCCGTACACGTTGAGAGGCGCCTCGTACTGCCGCTCTTCCTTGAAAACGCTGCTGCCGCCATAGGTGGCGGCGGAAGAGGCGTAGAGCAGCTGCACGTCCTGCTCCAAGCACCAGTCCAGCAGGGCCAGGGAGTAGCGGTAGTTGTTCTCCATCATGTAGCGGCCGTCGGTCTCCATGGTGTCGGAGCAGGCGCCTTCGTGGAAGATGGCCTCCACGTCGCCGTCGAAGTGGCCGCAGAGCAGGCGCTCGAGAAACTCGCCCTTGTCCAGGTAATCGGCGATCTCGCAGTCCACCAGATTCTTGAACTTGTCCGCCTTGGTCAGGTTATCCACGGCGATGATGCGGGTGATGCCCCGCTCGTTGAGGGCCTTGACCAGGTTGGCGCCGACGAAGCCGGCGGCACCGGTGACGATGTAGTACATGTGAATTCCTTAATTGGCGTCGGCCGCCAGGGCGGCCGACAGTTCCTCGCGACTGACGGTAGCGGTACCGAGCTTGCCCACCACCACGCCACCGGCCAGGTTGGCCAGGTGGATGGCGTCGCCCCAGGGGGCGCCCAAGGCCATCATGGCGGCCAGAGTGGCAATGACCGTGTCGCCGGCACCGGAGACGTCGAACACTTCCTGGGCCCGGGCCGGCTGGTGCAGCGCCTCGCCGTCGCGATAGAGGCTCATGCCCTCTTCGCTACGAGTCACCAGCAGGGCATCCAGTTCCAGTTCGCTGCGCAATTGCTGGGCCTTGGCCGCCAGCTGGGCCTCGTCGCTCCAGCGCCCCACCACCTGGCGCAGCTCGGAGCGGTTGGGGGTGATGACGGTGGCGCCGCGGTACTTGGAATAATCCTCGCCCTTGGGATCCACCAGCACCTTTTTCCCGGCCGCCCGGGCCAGGCGGATCATGTCGCCGATGTGGGCCAGGCCGCCCTTGCCGTAGTCGGAAAGGATCACCACATCCACCCCGGCCAGGCGCTGCTCGAACTCGGCCAGCTTGGCCTGCAGCACCTCGTGGGACGGGGTGGTCTCGAAATCGATGCGCAGCAGCTGCTGCTGGCGGCCGATGACCCGCAGCTTGACCGTGGTGTCGATGGCTCCGTCGGGCAGCAGGGAGGCGGCAATGCCGCCCTCTTCCATCTGCCGCTGCAGGATGCGGCCGGCCTCGTCGTTGCCGACCACGGACAGCAGGCCGACCCGCGCCCCCAGGGAAGCGCAGTTGCGGGCCACGTTGGCAGCACCGCCGGGACGCTCTTCGGAGCGTTCCACCTTGACTACCGGCACCGGTGCTTCGGGGGAAATGCGGGAAACATCGCCGAACCAGTAGCGGTCCAGCATGACATCGCCGACCACAAGAATGCGGGCGGCGGAAAAATCGGGAAGTTGGTGCATGGTGAAGATACTCAGTCGAAAAGATCGGCCTGGGCGAAGGGCGTGCCCTGCATATCCCGGGCCGAAAGACGGGGCTCCAGCCCCTCCAAAGGCCAGGCCACGGCCAAGGCCGGATCGTTCCAAGCGATGCAGCGCTCATGTTCCGGCGCGTAATAGTCGGTGGTTTTATAAAGAAAATCGGCGGTCTCGCTGAGCACCAGGAAACCGTGGGCAAAACCGGGAGGTACCCACATTTGGCGCTGATTATCCGCGGAAAGTACGGCACCAACCCAGCGGCCGAAATAGGGGGATTGGCAACGCAAGTCAACCGCCACGTCGAAAACGGCGCCTTGAGCCACTCGAACCAGCTTCCCTTGAGGCTGGCGAATCTGATAATGCAGGCCCCGCAGCACGCCTCGTGCGGAACGGGAGTGGTTGTCCTGGACGAAATCCACATCGGCCCCAGTCAGCTCGGTAAAGCGACGCCGGTTGTAACTTTCCATAAAAAAGCCGCGAGCATCGCCGAAAACCAGGGGTTCCAGCATGATCACATCGGCAATGGCGCTGGGAATCGCCTTCACGCTGCATGCTCCTTGAGCAGGGCCAGCAGGTATTGGCCGTAGCCGTTCTTGGCCAGGACCCGGGCCTGGGCTTCCAGAGTGGCGTCGTCGATCCAGCGTTGGCGCCAGGCCACCTCTTCGGGACAGGCCACCTTGAGACCCTGGCGCTTCTCGATGGTCTCGATGAACTGCCCCGCCTCCAATAGAGATTCGTGGGTACCGGTATCGAGCCAGGCATAGCCCCGGCCCATGATTTCCACATTGAGCTTGCCCGCTTCCAGGTAATGCCGGTTCACGTCGGTGATTTCCAGTTCGCCCCGGGGCGAGGGCTTGATGCCCTTGGCCACGGCCACGATATCGGTATCGTAGAAGTAAAGGCCGGTGACGGCATAGTGGGACTTGGGCTGCAGCGGTTTTTCCTCGATGGAGAGGGCCCGCTGCTGGGCATCGAACTCGACCACACCGTAGCGCTCCGGATCATTCACCCGGTAAGCGAAGACCGAGGCACCGCTGTCCCGATCGTTGGCCCGCTGCACCAGGGTGGCCAGGTCGTGGCCGTGGAAGATGTTGTCCCCCAGCACCAGGGCCGCCGGGGCACCGTCAAGGAAGGCCTCGCCGATGAGGAAGGCCTGGGCCAGGCCGTCCGGCGAGGGCTGCACCGCGTACTGCAGATTGATGCCCCACTGGGAGCCGTTTCCCAGCAGCTGCTCGAAACGGGGCGTGTCCTGGGGCGTGGAGATGATGAGGATGTCCCGCAGACCGGCCAGCATCAAGGTGGTCAGGGGGTAGTAGATCATCGGCTTGTCGTAGATCGGCAGCAGCTGCTTGGACACCGCCAGGGTGGCCGGATAGAGCCGGGTGCCGGAACCGCCGGCGAGAATGATGCCTTTACGGGGTTTAGTAGCCATTCTGGGCCTTCAGGGCGAGAAGCTGCATCATGCGGGAAAGATAGGGCTGCCAGTCGGGCATGGTCAGACCGAAACGGTCCTCCAGCTTGCGGCAGTCGAGACGGGAATTGAGGGGGCGCGGCGCCGGCAGCGGATATTCGCTGCTGGGAATGGGTGCGATGGCCTCGGGCCCCAGCTTGAGGGCGAAGCCGGGCGTCTGTTCGGCGGTGGCGACGATGGCCCGGGCAAAACCGTTCCAGCTCACCGGATTGGCAGCCACCAGGTGATACAGCTCGCAGCCCTGCTGGGCGCGCCCGCCGTCGAGCTGGGCCAGGACCATGCCGGTGACGGTGGCGATCATGGCCGCCGGCGTCGGGCTGCCGACCTGGTCGGCCACCACCTTGAGGCTGTCCCGCTCGCTCGCCAGGCGCAGGATGGACTTGACGAAATTCTTGCCCCGGGCGCCGAAGACCCAGCTGGTGCGGAAGATGAGGCCGCGGCCGCCCACGGCCAGCATGGCTTCCTCCCCTTCCCGCTTGGTCCGGCCATAGACGCCGAGGGGCGCCGTGGCATCGGACTCCACATAGGGCGCCGCCTTGCTGCCGTCGAAGACGTAGTCGGTGGAGTAATGCACCAGCAGGGCATCCAGGGCCTTGGCCTCCTCGGCCAGCAGGCCCACGGCCTCGGCATTGATGCGCCGGGCCAGTTCAGGCTCCATTTCCGCCTGATCCACCGCCGTATAGGCGGCGGCATTGACGATCAGGCGGGGGCGCTGTTCCCGCACCACGGCCCGCAGCCGGTCCAGGTCGGCCAGGTCGCATGTACGCCGGTCCAGGGCCAGCACCGGCCCCAGGGGCGCCAGGTCCCGCTGCAGTTGCCAGCCCAGCTGGCCCTGGCTGCCCAGGAGCAGGATGGGAGCCGACACCTCAGGCTTCCCCATACTGGCGGCCCACCCACTCCCGGTAGGCGCCGGAGGTGACGTTGTGCACCCACTGGGGATTGTCCAGGTACCAGCGTACGGTCTTGCGGATGCCGGTTTCGAAGGTCTCCGCCGGCTTCCAGCCCAGCTCCCGTTCCAGCTTGCTGGCGTCGATGGCATAGCGCCGGTCGTGGCCGGGCCGGTCGGCAACGAAAGTGATCTGGCTGGCGTAGGAGGCGCCGTCGGCCCGGGGCGACAGTTCGTCGAGCATGGTGCACAGGGTATGCACCACCTCCAGGTTGGGCTTTTCGTTCCAGCCGCCCACGTTATAGGTCTCACCCAGGCGGCCGGCTTCCAGGACGCGGCGGATGGCACTGCAATGGTCCTTCACATAGAGCCAGTCGCGGATCTGCTGGCCGTCGCCGTAGATGGGCAGGGGCTTGCCGGCCAGGGCGTTGTGGATGATGAGGGGAATGAGCTTTTCCGGGAAATGGTAGGGCCCGTAATTGTTGGAGCAGTTGGTGGTCAGCACCGGCAGGCCGTAGGTATGGTGGTAGGCCCGCACCAGATGGTCGGAGGCCGCCTTGCTGGCCGAATAGGGACTGTTGGGCTCGTAGCGGTGCTGCTCGGTGAAGGCCGGGGCCTCCTTTTCCAGGGAACCGTAAACCTCGTCCGTGGACACGTGGAGAAAGCGGAAGGCCGCCTTGTCGTCGGCAGGCAGGCCGTTCCAGTAGGCCCGCACGGCCTCCAGCAGGCGGAAGGTGCCGACGATGTTGGTCTGAATGAAGTCCTCCGGCCCGTGGATGGAACGATCCACATGGCTCTCGGCGGCGAAGTTCACCACCGCCCGCACCCGGTTCTGCTGCAGCAGTTCCAGAATCAGGTCGTAGTCGGCGATGTCGCCGCGCACGAAACGGTGCCGCGGATCGCCGGCCAGTCCCTGGAGGTTCTCCAGATTGCCGGCGTAGGTCAGCTTGTCCAGGTTGATGACAGGCTCACCCCCCGCCGCCAGCCAGTCGATGACGAAATTGCTGCCGATGAAGCCTGCACCGCCGGTCACCAGGATCATGGCGGACTCCTTATTGACGACCGATGGCCTGGTAGTCGATGCCGAACTGGCACACCTGCTTGGGTTCGTACAGGTTGCGGCCGTCGAAGATCACCGGCTGCTTCAGCTTGGCCTTGATGGCCTCGAAATCGGGACTGCGGAATTCCTTCCACTCGGTGACGATGAGCAGGGCATCCGCCCCATCCAAGGCGGCCATGGGGCTCTCCGCATAGCTCAGACGCGGCTCATCGCCGAAAATGCGCCGGGCTTCATGCATAGCCACCGGATCGTAGGCCACCACGGTGGCGCCAGCGGCGAAGAGATCGGCCAGCAGATAACGGCTGGGGGCCTCGCGCATATCGTCCGTATTGGGCTTGAATGCCAGGCCCCAGACGGCGAACTTGCGGCCGCTGAGGTCGTTGCCGAAACGCTTCACGGTCTTGGCGGTGAGCACGTGCTTCTGGGCATCGTTGGCGTCTTCCACCGCATTGAGGACCTTCATCTCCATGCCGGCGTCGAGGCGGGCGGTGCGCTGCAGGGCCTGCACATCCTTGGGGAAGCAGGAACCGCCGTAGCCGCAGCCTGGATAGAGGAAGTGGTAGCCGATGCGCGGGTCGGAACCGATGCCCTGGCGCACCTGCTCGATGTCGGCACCCAGCTTCTCGGCCAGGTTGGCCAGTTCGTTCATGAAGCTGATGCGGGTGGCCAGCATGGCGTTGGCGGCGTACTTGGTCAGTTCGGCGGAACGCACATCCATGACGATGAGGCGCTCGTGGTTGCGCTGGAAGGGCGCATAGAGGGCGCGCATCAACTCGATGGCGCGCTCGTCCTCGGCGCCGACGACGATGCGGTCCGGCCGCATGAAATCCTCCACGGCGGCGCCTTCCTTGAGGAATTCCGGATTGGAGACGACGCTGTAGGCGATATCCGCTCCCCGGGCCTTGAGCTCGTCGGCGATGGCGGCGCGCACCTTGTCGCCGGTGCCCACGGGCACGGTGGACTTGTCCACCACCACCTTGTAGTCGCCCATGTGGCGGCCGATGTTGCGGGCGGCGGCGAGCACGTACTGCAGATCGGCGGAACCGTCCTCGTCCGGCGGCGTGCCGACGGCAATGAACTGGATGGTGCCGTGGGCCACGGCCTGTTCCACATCGGTGGTGAAACGCAGGCGGCCGGCGGCCACGTTGCGCTTCACCATGTCCAGCAGGCCCGGTTCGAAAATGGGAATGCCGCCTTCGTTGAGAATCCGGATCTTTTCCGGATCCACATCCAGGCACAGCACATCGTTGCCCACCTCGGCCAGGCAGGTACCGCTCACCAGGCCCACATAGCCCGTACCGACAACTGTAACTTTCAAATTATTCTCCCTGAAGGGTCATTGATCCGCCCAAACCGACGCATTCATGGGCTCAGGGGATCAGATCGAATTCTTCCGTGCGCCGGGGCGGATAGGTTTCCCAGCCGCCGCAGGCAGGACAGCGCCAGTAGAAGTGGCGCGCCTTGAAGCCGCAATTATCGCACCGATAGCGGGCCAGCCGGCGGGTGTGGTTGTGCACCAGATTGCGCACCAGCTCCAGGTCCGCCCGCTTTTCCGGCGGCACGCCGAGCAGCTGGGCTTCCAGCAGGCGGTCCAGGCCCAGCAGGGTCGGATTGCGCCGCAGTTCGTCCCGCACCAGGCGATAGGCCGCCTCCGGCCCCTCCGCATCCATCACCAGCTGGAATACGGTTTCCAGCAGATCGAGGGAGGGGTAGCTGGCCAGATAGCCGCGCAGCAGTTGCAGCCCTTCGTCCCGCTGCCCCTGGGCCAGGTAGGCATCCTGGAGCTTGCGGGCGACGATGGCCAGGTAGGCCGGATTCTGGCTCTCGATGCGCTTCCAGGCCTCGATGGCCGCTGCCAGATCACCCGCCTGCTGCAACAAGTCGCCCTGCAGGACGCTGGCGCGCACGCAGTTGCGGTGCAGGGAAAGGGCCGAGTCCAGGTACTGGCGGGCGGAATCGGGCCGGGAATTGATCATCTCCCCGGCTGCCAGCTCGCAGTAATAGTTGGCGATTTCCTTCTGGGTGGCGTAGTCAGGCATTTCCTTGGCGATGGCGATGGCCTTCTGCCAGTCCTTCTCCTGCTGGTAGATTTCCAGCAGGTTGCGCTTGGCCTCCTCGTCCCGGGAGGTGCCGCGCAAACGGGAAAACACCTCTTCGGCCCGGTCCAGCAGACCGGCCTTGAGGAAGTCCTGGCCAAGCTCGGAAAGGGCCTGCAGCTTCAGCTCCGGAGACAGGTCCACCCGCTCGATGAGGTTCTGGTGCATGCGGATGGCCCGCTCGGTCTCGCCGCGGCGGCGGAACAGGTTGCCCAGGGCGAAGTGCAGTTCCACCGTCTGGGGATCCACCTTCACCACTTCGATGAAGGCTTCGATGGCCTTGTCCGGCTGCTCGTTGAGCAGGAAATTCAAGCCCTGGAAGTAGGACCGGGGCAGGGCCCGGGATTCCCGCACCAGATGCTTGATGTCGATGCGGGCGGCGGCCCAGCCAAGGACGAAAAACAGGGGAAAGAGCAGTAACTGCCAGTACTCGAATTCGATCATTGGGCCACGGCCTCGCCGGCAGGAGGCGGTTGCACGGCAGCGTCGCTGGCGACCGGCTTGAGCACGGCCTGGCGCTCCCGCTCCCGCGCCAGTTCACGGCGGGTGCGCGACAGTTCGCGGCGCAACTGGAACAGGGTGCCGAGCAGGGACAGGGCACCTAAGGCCGTCCCGGCGGCGAAGAAACCGAGCAGGATGATCACCAGGGGCGCCTGCCACTGGGTATCGAAGAAGAAGCGCAGACTCACCGGATCGCTGTTCATTGCCGCAAAACCCAGCAAGAAGAAGAAGATGATGAGCCGGATGATCAAGATCAGGGCGCGCATAGCCTTATCCGCAAGCAATAAAAAAGGCGGCAACCCTAGGGTTGCCGCCCGCAATCTACCACAGGACAGGGGGTAAGGTCAGCGTCAGGCGGAGAGAGAGGCCAACAGCCCCAGGTCCACCCGTTCGCGCAGTTCCTTGCCTGCCTTGAAATGGGGTACGTATTTTTCCGGAACGCTGACTTTATCCCCGGACTTGGGGTTACGCCCCATTCGCGGCGGCCGGTAGTTCAGCGCGAAGCTGCCGAACCCCCGAATCTCGATACGGTCACCGTGGGCCAGAGCCTCGGTCATGGCATCGAGAATTTCCTTGACCGCGAAATCAGCGTCTTTCGCCACCAGTTGCGGAAACCGCATGGCCAGGCGGGCGATCAGCTCGGATTTGGTCAT
Protein sequences of DBSCAN-SWA_1 >NZ_AP021844|1397060:1410067|1408187_1409357_-|WP_172974712.1|DBSCAN-SWA MIEFEYWQLLLFPLFFVLGWAAARIDIKHLVRESRALPRSYFQGLNFLLNEQPDKAIEAFIEVVKVDPQTVELHFALGNLFRRRGETERAIRMHQNLIERVDLSPELKLQALSELGQDFLKAGLLDRAEEVFSRLRGTSRDEEAKRNLLEIYQQEKDWQKAIAIAKEMPDYATQKEIANYYCELAAGEMINSRPDSARQYLDSALSLHRNCVRASVLQGDLLQQAGDLAAAIEAWKRIESQNPAYLAIVARKLQDAYLAQGQRDEGLQLLRGYLASYPSLDLLETVFQLVMDAEGPEAAYRLVRDELRRNPTLLGLDRLLEAQLLGVPPEKRADLELVRNLVHNHTRRLARYRCDNCGFKARHFYWRCPACGGWETYPPRRTEEFDLIP >NZ_AP021844|1397060:1410067|1403941_1404832_-|WP_152089522.1|DBSCAN-SWA MATKPRKGIILAGGSGTRLYPATLAVSKQLLPIYDKPMIYYPLTTLMLAGLRDILIISTPQDTPRFEQLLGNGSQWGINLQYAVQPSPDGLAQAFLIGEAFLDGAPAALVLGDNIFHGHDLATLVQRANDRDSGASVFAYRVNDPERYGVVEFDAQQRALSIEEKPLQPKSHYAVTGLYFYDTDIVAVAKGIKPSPRGELEITDVNRHYLEAGKLNVEIMGRGYAWLDTGTHESLLEAGQFIETIEKRQGLKVACPEEVAWRQRWIDDATLEAQARVLAKNGYGQYLLALLKEHAA >NZ_AP021844|1397060:1410067|1400532_1401432_-|WP_152089518.1|DBSCAN-SWA MYKTLEDFVGNTPLVQLKRLPGDVVAQRGNVILAKLEGNNPAGSVKDRPALSMISHAEQRGEIRPGDTLIEATSGNTGIALAMAAAMRGYRMILVMPENQSLERRQTMRAYGAELILTPRDGGMELARDVAEKMRDEGKGIILDQFANPDNPLAHFEGTGPEIWRDTKGQVTHFVSSMGTTGTIIGTSQFLKKKNPRIQIVGCQPEEGSQIPGIRKWPEAYLPKIYERSRVDRLEYVSQAEAEDMTRRLAREEGLFAGISSGGALAVALRLARELENATIVTIVCDRGDRYLSTGVFPA >NZ_AP021844|1397060:1410067|1409353_1409680_-|WP_152089526.1|DBSCAN-SWA MRALILIIRLIIFFFLLGFAAMNSDPVSLRFFFDTQWQAPLVIILLGFFAAGTALGALSLLGTLFQLRRELSRTRRELARERERQAVLKPVASDAAVQPPPAGEAVAQ >NZ_AP021844|1397060:1410067|1406809_1408135_-|WP_152089525.1|DBSCAN-SWA MKVTVVGTGYVGLVSGTCLAEVGNDVLCLDVDPEKIRILNEGGIPIFEPGLLDMVKRNVAAGRLRFTTDVEQAVAHGTIQFIAVGTPPDEDGSADLQYVLAAARNIGRHMGDYKVVVDKSTVPVGTGDKVRAAIADELKARGADIAYSVVSNPEFLKEGAAVEDFMRPDRIVVGAEDERAIELMRALYAPFQRNHERLIVMDVRSAELTKYAANAMLATRISFMNELANLAEKLGADIEQVRQGIGSDPRIGYHFLYPGCGYGGSCFPKDVQALQRTARLDAGMEMKVLNAVEDANDAQKHVLTAKTVKRFGNDLSGRKFAVWGLAFKPNTDDMREAPSRYLLADLFAAGATVVAYDPVAMHEARRIFGDEPRLSYAESPMAALDGADALLIVTEWKEFRSPDFEAIKAKLKQPVIFDGRNLYEPKQVCQFGIDYQAIGRQ >NZ_AP021844|1397060:1410067|1403396_1403945_-|WP_152089521.1|DBSCAN-SWA MKAIPSAIADVIMLEPLVFGDARGFFMESYNRRRFTELTGADVDFVQDNHSRSARGVLRGLHYQIRQPQGKLVRVAQGAVFDVAVDLRCQSPYFGRWVGAVLSADNQRQMWVPPGFAHGFLVLSETADFLYKTTDYYAPEHERCIAWNDPALAVAWPLEGLEPRLSARDMQGTPFAQADLFD >NZ_AP021844|1397060:1410067|1404821_1405751_-|WP_152089523.1|DBSCAN-SWA MGKPEVSAPILLLGSQGQLGWQLQRDLAPLGPVLALDRRTCDLADLDRLRAVVREQRPRLIVNAAAYTAVDQAEMEPELARRINAEAVGLLAEEAKALDALLVHYSTDYVFDGSKAAPYVESDATAPLGVYGRTKREGEEAMLAVGGRGLIFRTSWVFGARGKNFVKSILRLASERDSLKVVADQVGSPTPAAMIATVTGMVLAQLDGGRAQQGCELYHLVAANPVSWNGFARAIVATAEQTPGFALKLGPEAIAPIPSSEYPLPAPRPLNSRLDCRKLEDRFGLTMPDWQPYLSRMMQLLALKAQNGY >NZ_AP021844|1397060:1410067|1401435_1402428_-|WP_152089519.1|DBSCAN-SWA MYYIVTGAAGFVGANLVKALNERGITRIIAVDNLTKADKFKNLVDCEIADYLDKGEFLERLLCGHFDGDVEAIFHEGACSDTMETDGRYMMENNYRYSLALLDWCLEQDVQLLYASSAATYGGSSVFKEERQYEAPLNVYGYSKFLFDQIVRQRLPEVRSQVVGFRYFNVYGPRESHKGRMASVAFHHFNQYRAEGKVKLFEGCDGYANGEQQRDFVYVKDVAKVNLYFLDHPEKSGIFNLGTGRAQSFNDVAVATVNSCRAAEGKPALSLEAMVQQGLVEYVAFPEALKGKYQSFTQADLSKLRSAGYGDEFATVAEGVADYVQFLNGR >NZ_AP021844|1397060:1410067|1399289_1399733_-|WP_152089517.1|DBSCAN-SWA MKHTFLFLLALLAAFAARAEVVNVDSAEVARLVASGVVLVDIRTEPEWRETGVIPGSRLLTFFDANGRANPAAWLEQLKTVAGPEQPVILICRSGNRTRAVSDFLEQQAGYSKIYNVRQGIRAWIQESRPVTAVAPALAKCSPGRLC >NZ_AP021844|1397060:1410067|1399729_1400527_-|WP_014237027.1|DBSCAN-SWA MVPVLVFDIETIPDVPGLRRLHDLPADLSDDEVAELAFQQRRAQNGSDFLPLHLQRVVTISCALRARDAFKVWSLAAPEIGEGEIIQRFFDGIEKFTPQIVSWNGGGFDLPVLHYRGLIHGVVAPRYWDLGDGDYADSRDFKWNNYISRYHTRHLDLMDLLAMYQPRASAPLDDLAKLMGFPGKLGMDGGKVWQAWQEGKADEIRDYCETDVVNTYLVSTRFRLMRGEVSREEHLAELAFIKAELAKLDKPHWQEFLAAWPEVAA >NZ_AP021844|1397060:1410067|1402437_1403385_-|WP_152089520.1|DBSCAN-SWA MHQLPDFSAARILVVGDVMLDRYWFGDVSRISPEAPVPVVKVERSEERPGGAANVARNCASLGARVGLLSVVGNDEAGRILQRQMEEGGIAASLLPDGAIDTTVKLRVIGRQQQLLRIDFETTPSHEVLQAKLAEFEQRLAGVDVVILSDYGKGGLAHIGDMIRLARAAGKKVLVDPKGEDYSKYRGATVITPNRSELRQVVGRWSDEAQLAAKAQQLRSELELDALLVTRSEEGMSLYRDGEALHQPARAQEVFDVSGAGDTVIATLAAMMALGAPWGDAIHLANLAGGVVVGKLGTATVSREELSAALAADAN >NZ_AP021844|1397060:1410067|1409764_1410067_-|WP_014237016.1|DBSCAN-SWA MTKSELIARLAMRFPQLVAKDADFAVKEILDAMTEALAHGDRIEIRGFGSFALNYRPPRMGRNPKSGDKVSVPEKYVPHFKAGKELRERVDLGLLASLSA >NZ_AP021844|1397060:1410067|1405737_1406799_-|WP_152089524.1|DBSCAN-SWA MILVTGGAGFIGSNFVIDWLAAGGEPVINLDKLTYAGNLENLQGLAGDPRHRFVRGDIADYDLILELLQQNRVRAVVNFAAESHVDRSIHGPEDFIQTNIVGTFRLLEAVRAYWNGLPADDKAAFRFLHVSTDEVYGSLEKEAPAFTEQHRYEPNSPYSASKAASDHLVRAYHHTYGLPVLTTNCSNNYGPYHFPEKLIPLIIHNALAGKPLPIYGDGQQIRDWLYVKDHCSAIRRVLEAGRLGETYNVGGWNEKPNLEVVHTLCTMLDELSPRADGASYASQITFVADRPGHDRRYAIDASKLERELGWKPAETFETGIRKTVRWYLDNPQWVHNVTSGAYREWVGRQYGEA >NZ_AP021844|1397060:1410067|1397060_1398977_-|WP_014237029.1|tRNA|DBSCAN-SWA MPNITLPDGSIRSFDAPVTVAEVAASIGAGLARAALAGKVDGKLVDTSHLMERDTQLAIVTDKDADGLEIIRHSTAHLLAYAVKELFPEAQVTIGPVIDNGFYYDFAYKRPFTPEDLLAIEKKMAELAKRDIPVSREVWARDDAVKFFLEQGEKYKAELIAAIPADQDVSLYREGEFVDLCRGPHVPSTGKLKVFKLMKVAGAYWRGDSKNEMLQRIYGTAWAKKEDQEAYLHMLEEAEKRDHRRIGKHLDLFHMQDEAPGMVFWHPKGWAIWQEIEQYMRQVYRDNGYQEIRCPQILDRSMWEKSGHWEHYKNNMFTTESEKRDYAIKPMNCPGHVQVFNSDLRSYRDLPLRYGEFGSCHRNEASGALHGLMRVRGFVQDDGHIFCTEDQIEAEVTAFNALVKKVYADFGFDQVAVKLALRPESRVGSDDIWDKAENALRAGLKASGLEWDELPGEGAFYGPKIEFHIKDAIGRSWQCGTMQVDFSMPGRLGAEYVGEDNARHVPVMLHRAILGSLERFIGILVENYAGALPLWLAPVQAVVLNISEKQADFSAEVVKTLRQAGLRAEADLRNEKITYKIREHSLNRLPYQLVIGDKEKEAGLVAVRTRGGQDLGQMPVASLIERLLEEVAARKSTA |
14 | Prochlorococcus_phage(20.0%) | tRNA | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| DBSCAN-SWA_2 |
2569244 : 2576915
Sequences of DBSCAN-SWA_2
Nucleotide sequences of DBSCAN-SWA_2 >NZ_AP021844|2569244:2576915|DBSCAN-SWA ATCAGGCGGTGATGCGGGCCTGCTTGGCCAGCTTGGCCTTGATGCGGCCCTGTTTGAGCTGGGACAGGTGATCGACGAAAACCTTGCCCTGCAGGTGATCCATCTCATGCTGGATGCAGACGGCCAGCAGGCCGTCCGTTTCCAGGGAACAGGTCTTGCCTTCCAGGTCCAGGTAACGCACGGCGATATGCTCGGCCCGCTCGACCTTGTCGTAAATGCCCGGCACCGAGAGACAGCCTTCTTCACCCACCTGTAGTCCGTCGCGGTGGGTGATTTCCGGATTGATCAGCACCAGGAGTTCGTCCTTGGTTTCCGACACGTCGATGACAATCACCTGCTTGTGTACATCGACCTGGGTGGCGGCCAGACCGATACCCGGCGCCTCGTACATGGTTTCCGCCATGTCCCGGGCCAGGGCACGAATGCCGTCGTCAATTTTTTCGACCGGAACCGCCACTTTTTTCAAGCGGGGATCCGGGAAGCGCAAAATAGGGAGTAAAGCCATAAAAAGCTTGCCCGAATAATCTATTACATGCAGAATTTAAACCAAATCCCTAGATTCGGGACATGGGTGTGGACAACAAAAGGGCCGCCCTTGTTCCGGCAGCGCGAGGACGCCACGATGATTCGACACCTGTTCGGCCGCCCTGGCCTGAACCCCGCCCGCATTATAGCCACCCTTCTGTTGGCTGTCGCCGCCTCCAGCGCCAGTGCCCAGGAATCTCCCCGCCTCGCCGACAACGCGCCGGACCGCCATATCGTGGTGCCGGGCGACACCCTGTGGGGCATCGCCGGCAAGTTCATCCAGGAACCATGGCGCTGGCCCGAAATCTGGCGCCTGAACAAGGACCAGATCAAGAATCCCCACCGCATCTACCCGGGCGACGTCATCGTCATGGTCACCGGCGAGGACGGCAAGCCACAGCTCAAACTGGCCAAGTCGCTCAAGCTGCAGCCGCGCGAGTACAGCGAAGCGGTCAAGAACGAAATTCCCACCATTCCGCAAAGCATCATCGAGCCCTTCCTGTCCCAGCCCCTGGTGGTGGACCCCAGCGCCATGGACAAGGAAGCCCGCATCATCGCCACCCAGGAAGGACGGGTCTATCTTGGTGGCGGTGATCAGGCCTACGTGGTCGGGGTACGGGAGCCTTCCGAATTGTGGCAGGTCTATCGTCCCGGCAAGGCCATGCTCGATCCCGACACCAAGGAAGTGCTGGGCCATGAGGCGTTCTACCTGGGCACGGCCAGGCTGATTCAACCGGGAGAGCCCTCCGTCATGGAAATGGTGGAGGTGAAGCAGGAGGTCGGCAAGTTCGATCGACTGATGCCCGCATCCCGCCCCGAACTGATCACCTATGCCCCACGTCGTCCGGAAGCCAAGGTTGAGGCCCGCATCATCGCGGTGTACGGCGGTGTTGGCACCGGTGGGCGCTACTCGGTGGTATCCCTGTCCAGGGGCAGCCGCGACGGACTCGAGGTCGGCCACGTCCTAGCCTTGCTGCGCAGTGAAAAGGTCTATGAACAGCGCAACGAACAGGGCGAGCGGGAGTTGGTCAAGGTGCCGCCCCAGCGCTATGGCCTGGTCTTTGTCTTCAGGACGTTTGAACGAGTTTCCTACGCTCTGGTCATGGATGCTGCCTTGCCCCTGTCTCTGGCCGATCTGGTACGCAACCCCTGAGCCCCCGCCGTGGCTGACCCGGCCCTCACCGCCTGGCTCCGGCTGACGCTGGTTCCGGGCGTCGGGCCGGAGACCCAGCGCCATCTCTTGGCCGCCTTCGGCCTACCGGAACAGGTTTTCTCCGCCCCCCGCAGTGCCCTCAAGCAGGTCGTCGGCAAGAAGGCCGATCTGCTGCTCGATACGGACAATCAGGAAGCGGTGGACCGGGCCCTGGACTGGGCCGACAAGCCGGGCAACCGCATCCTGACCCTGGCCGATCCGGACTATCCCCAGCTGCTGCTCGAATCCGCCGATCCGCCCAGCCTGCTCTATGTGAAGGGCCGGGTGGAACTGCTAAACCGGCCTGCCCTGGCCATTGTCGGCAGTCGTAATGCCACGCCCCAGGGCCTCAAGGATGCCGAGGCCCTCGCTGCCGATCTGGCGGCCCAGGGGCTGACCATCGTCAGCGGCCTGGCCCTGGGCATCGACGGCGCCGCCCATCGGGGCGGGCTGAAAGGGGAGGGCGGCAGTGTCGCCATCATCGGCACCGGGGCCGATCGCATCTATCCCTCGCGGCACAAGGAACTGGCGCTGCAGCTGGCGACCGAAGGCGCCATCGTTTCCGAATTTCCCCTGGGTACGCCGGCGGTGGCCCACAATTTTCCGCGCCGCAACCGCATCATCGCCGGCATGGCCAAGGGCTGCCTGGTGGTGGAAGCCGCCCTGGAAAGCGGCTCCCTCATCACCGCCCGTCTGGCGGCCGAACTGGGCCGGGAAGTGTTTGCCATTCCCGGCTCCATCCATTCGCCGGTGGCCAAGGGCTGTCACCGGCTGATCCAACAGGGGGCCAAGCTGGTGCAGGAAGCCCGGGACATTGTGGAAGAAATCGGTCCGTTCGACCCCCCGGGCTGCAGGCCAGCAAGGACGCCACTTTCCACCGGCAATACGCCAACAACCGTGCCAATCCTCGATCCTGGCCAGGCTGCCGTACTCGATGCCCTCGGCCACGACCCGGCCAATCTGGACCAATTGCTGCAGCGCACAGGCTTGACGACGGAAGCCCTATGCGCCATCCTCGTGACGCTGGAACTGGCGGACCACGTTGCCAGTCTTCCCGGAGGCCGCTACCAGCGGCTTTCCCCCACCTGATACGTTGCCAATGTTCGACATCCTCGTCTATCTCTTCGAAAACTACGTCGATTTCGCCGACTTCAGCAAGTCCGGCAATCAACCCGATTCACCCGATTCCCAGGCCGACACGGCCCTCAGCCGCAAACTCACTGCGGCCGGCTTCTCCGAAGAAGAAATCAGCGAAGCCCTGGAATGGCTCCAGGGCCTCAAGGCCACCCTGCCGACCCGCCAGCTGCAGGCCGATTCCCGTTCCCTGCGGGCCTACACGCCGGACGAAAGCGCCCACCTGGGTGCCGACGCCCTGGGCTTCCTGCATTTCCTGGAACAGGCCAAGGTCCTTTCCGCCGACCTGCGGGAACTGGTCATCGAGCGGGCCATGGCCCTGCCGGACGACCGGTTGTCCCTGGGGCGCTTCAAGGTCATCGTGCTGATGGTCCTGTGGAGCCAGGAGCAAAACCTGGATACCCTCATCGTCGAAGAACTGCTCTCCGAAGCGGAACCCGAACACCTGCATTAAGTCCATCCGTCGGGCCTGCCAGCCGGGGCGACTGAGGGGTCGCACCAGATGGCGCAGGCAGCATGATGGGCGAAGCACGCAATATGCAGGCTTCGCGCGTCAAATTAGTGGCTGCTTGCTTGCCAAGGCCCGTAAGTCCCTCTTATCATCCCCCGCACCTCCCGACCGGGACTGCCAGAGGCCCCTGCCATGGGCAAACAGCTCATCATTGCCGAAAAACCTTCCGTCGCTGCCGACATCGCCAAGGCCCTTGGCGGTTTCACCAAGCATGACGACTATTTCGAGAGCGACAACTTCGTTCTCTCCTCTGCTATCGGCCATCTGCTGGAACTGGTGATCCCCGAGGAATACGAGGTCAAGCGCGGCAAGTGGTCCTTTGCCCACCTGCCCGTGATCCCGCCCCACTTCGAACTGAAGCCGGTGGAGAAAACCGAGTCCCGCCTCAAGCTGCTGACCAAGCTGATCAAGCGCAAGGATGTGGACGGCCTGGTGAACGCCTGTGACGCGGGCCGCGAGGGTGAGCTGATCTTCAATTACATCGCCCGCCACGCCAAGTCCGGCAAGGCCGTGCAGCGGCTGTGGCTGCAGTCCATGACGCCCCAGGCCATCCGCGACGGCTTCGCCCGTCTGCGCCGCGGCGAGGAAATGCAGGGGCTGGGCGATGCCGCCGTGTGCCGTTCCGAATCCGACTGGCTGGTCGGCATCAACGGCACCCGGGCCATGACCGCCTTCAACTCCAAGACCGGCGGCTTCCACCTCACCACCGTGGGCCGGGTGCAGACCCCAACCCTATCCCTGGTGGTGGAGCGGGAACGCAAGATCCGCGAATTCAAGGCCCGTCCCTACTGGGAAGTGGAGGCCACCTTCGCCGCCGCTGCCGGCGAATACAAGGGCAAGTGGTTCGACGAAGCCTTCAAGGGCAAGGACGAGGACGAACACGCCCGGGCCGACCGCCTGTGGGACGAAGCCCGGGCCAAGGCTCTGCAGGCCAAGTGCGAAGGCCAGCCCGGCGAAGTGAGCGAGGAGGCCAAGCCCTCCACCCAACTCTCGCCCCTGCTCTTCGACCTCACCAGCCTGCAGCGGGAGGCTAACAGCCGCTTCGGCTTCTCCGCCAAGAACACCCTGGGCCTGGCCCAGGCCCTGTACGAAAAGCACAAGGTCCTGACCTATCCCCGGACCGACTCCCGGGCCCTGCCCGAGGATTACCTGGGCACGGTGCAGGCCACCCTGCAGATGTTCAACGGCGAAAACCTGACCAAGGGTTCCGACACCTCCGTAGTGGACCGCTACGGCATCTTCGCCAACAAGATTCTCAAGTCCAAGTGGGTGGTGCCGAACAAGCGCATTTTCAACAATGCCAAGATTTCCGACCACTTCGCCATCATCCCCACCACCCAGGCGCCGAAGAATCTCTCCGAGCCGGAACAGAAGCTCTATGACCTGGTGGTCAAGCGCTTCCTCGCCGTCTTCTTCCCCGCCGCCGAATACCTGATCACCACCCGCATCACCCGGGTCGCCGGCGAACCCTTCAAAACCGAAGGCAAGGTCCTGGTGAATCCGGGCTGGCTAGCCATCTACGGCCGCGAAGGCCAGGAGGGCGACGAGGGCAACCTGGTGGCCGTCTCCCAGGGTGAGAAGGTGCAGACCGAAGAAGTGGCGGTCAATCAGAACGACACCCGGCCCCCGGCCCGCTACTCGGAAGCCACCCTGCTCTCCGCCATGGAAGGCGCCGGCAAGATGGTGGACGACGAGGAACTGCGTGCCGCCATGGCCGGCCGCGGCCTCGGCACCCCGGCCACCCGGGCCCAGATCATCGAAGGCCTGATCACCGAACAGTATCTGCACCGCGAAGGCCGGGAACTGATCCCCACCGCCAAGGCCTTCTCCCTCATGACCCTGCTCAACGGCCTGGGCATTTCCGAACTGACCTCGCCGGAACTGACCGGCGAATGGGAATGGAAGCTGGCCCAGATCGAGCGCGGCGATCTTTCCCGCAGCGCCTTCATGCAGGAAATCGAGGAAATGACCCGGCACATCGTGGACCGGGCCAAGAGCTACGACAGCGACACGGTGCCCGGCGACTTCGGCCTGCTCAAGTCGCCCTGTCCCAAGTGCGGCGGTCTCATGCGCGAGACCTACAAGAAATTCCAGTGCGGCGATTGCGACTACGGCCTGTGGAAGATCGTCGCCGGCCGCCAGTTCGAGCCGGAGGAAATCGAGACCCTGCTCACCGAACGCCAGGTCGGCCCCCTGATGGGCTTCCGCAACAAGATGGGGCGGCCCTTCAATGCCCTGATCAAGCTCAACGACAAGAACGAACCGGAATTCGACTTCGGCCAGGACCGCTCCGGCGAGGACGGCGGCGAACCGGTGGACTTCTCCGGCCAGGAAAGCCTGGGGCCCTGTCCCAAGTGCGGCAGCCCTGTCTATGAGCACGGTCTGGCCTACGTCTGCGAAAAGTCCGTGGGTCCGGCCAAGAGCTGCGACTTCCGTTCCGGCAAGATCATCCTGCAACAGGCGGTGGAACGGGAACAGATGCAAAAACTGCTGTCCACCGGCCGCACCGATCTGCTCAAGGACTTCATCTCCGCCCGCACCCGGCGCAAGTTCTCCGCCTTCCTGGTGAAGGGCAAGGACGGCAAGGTCAGCTTCGAGTTCGAGAAACGGGAGCCCAAGGCCCCAGCGGCGAAAAAAACTGCCGCCAAGGCCGAGCCCAAGGCAGCGGCGGAAAAGCCCGCCAAAGCCCCGGCAAAGCGCAAGGCGAAGGAAGCCTGAGGTTAAGCAAGGCAAAAGAAAAGGGCTCGGTGTGAACCGAGCCCTTTTTGCATCCGGACCGCCGCCAGGCGGCGAAGCGGTGAATTTACTTATTCTTCGAGCAGGACGCGCAGCATCCAGGCGTTCTTTTCATGCACTTCCATGCGCTGGGTCAGCAGGTCGGCCGTCGGCTGGTCGTTGGCCTTGTCCACCACCGCGAAGACCTGGCGGGCGGTGCGGGCCACGGCTTCCTGGCCGGCCACCAGCTGGCGGATCATGTCCTTGGCCTTGGGCACGCCGTCTTCCTCGGTGATGGAAGCCAGTTCCACGAAGCGCTTGTAGGAGCCGGGAGCCGGGTAGCCGAGGGCGCGGATGCGCTCGGCGATCAGGTCCAGGGAGTTCCACAGTTCGGTGTACTGGGTCATGAACATGGTGTGCAGGGTCTGGAACATGGGGCCGGTGACGTTCCAGTGGAAATTGTGGGTCTTCAGATAGAGGATGTAGCTGTCTGCCAGCAGGTGGGAAAGGCCGTCGGCGATTTTCTTGCGGTCTTTCTCGGTGATGCCGATATCGATCTTGGTTGCCATGTGGATTCTCCTTTCAAGCTTCTCAATAACCATGCGCCCATTGTGGCCGATGGCGGCCGCCCGGGCCAATGACTTATCCCTATGGATTGAATAGGGACGGGCAATACAGACGGACTACAGTACCTTAACCCCGGGTGGCGGACATTGCAAAATGGCGGCCCGCACCGCATCGATGGCCTGGGGCCGGGGGAAGGTGACGCGCCAGGCCAGCACGATGCGGCGGGAAGGCTCCGGCGCCTTGAAGGGAATGACCCGGGCCAGGGACGGATCCGGCGGCGCCACCTCGACGGCGGAAGCGGGCAGCACGGCAACGCCAGTGCCGCTGGCCACCATCAGGCGGATGGTCTCCAGGGAGCCGCCCTCGTAGGAACGTTCCAGCCCGCCCGGCTCGGTCAGACGCGGACAGGCGGCCACCACCTGGTCGCGGAAACAGTTGCCCTGGCCCAGCACCAGCAGTTCCTCGCCCTTCAGCTCCCCCGCATCCACCGATTTCCGCTCGGCCCAGGGATGGGCCGCCGGCACCAGCATGCGGAAGGGTTCGTCGTACACCGGCTGGGTGACGATGCCCGGCTCGTCGAACGGCTGGGCCACCACGATCACGTCCAGCTCGCCCCGCTTCAGGGATTCGGCCAGCACGTGGGTGAAATTCTCCTGCAGGTAAAGGTGCATGTGGGACACGGCCTGGTGCAGAGCCGGCACCAGCCGGGGCAGCAGATAGGGGCCGATGGTGTAGATCACCCCCAGGCGCAGGGGCCCGCTGAAGGGATCCTTGCCCCGCTGGGCGATTTCCTCGACCCGCTGGGCCTCGTTCAAGACCTTCTCGGACTGCCGCGCCACTTCTTCGCCGATGGGCGTCAACCGCACTTCGGCGGCGCTGCGTTCGAACAGCAGCACCCCCAGGCGCTCTTCCACCTTTTTCAGGGCCACCGACAGGGTGGGCTGGCTCACGTGGCATTTCTCGGCGGCGCGGCCGAAGTGGCGCTCCCGGGCCAAGGACACGATGTAGCGCATTTCCGTCAGGGTCAT
Protein sequences of DBSCAN-SWA_2 >NZ_AP021844|2569244:2576915|2569244_2569748_-|WP_152090171.1|DBSCAN-SWA MALLPILRFPDPRLKKVAVPVEKIDDGIRALARDMAETMYEAPGIGLAATQVDVHKQVIVIDVSETKDELLVLINPEITHRDGLQVGEEGCLSVPGIYDKVERAEHIAVRYLDLEGKTCSLETDGLLAVCIQHEMDHLQGKVFVDHLSQLKQGRIKAKLAKQARITA >NZ_AP021844|2569244:2576915|2575412_2575889_-|WP_014235930.1|DBSCAN-SWA MATKIDIGITEKDRKKIADGLSHLLADSYILYLKTHNFHWNVTGPMFQTLHTMFMTQYTELWNSLDLIAERIRALGYPAPGSYKRFVELASITEEDGVPKAKDMIRQLVAGQEAVARTARQVFAVVDKANDQPTADLLTQRMEVHEKNAWMLRVLLEE >NZ_AP021844|2569244:2576915|2576003_2576915_-|WP_152090175.1|DBSCAN-SWA MTLTEMRYIVSLARERHFGRAAEKCHVSQPTLSVALKKVEERLGVLLFERSAAEVRLTPIGEEVARQSEKVLNEAQRVEEIAQRGKDPFSGPLRLGVIYTIGPYLLPRLVPALHQAVSHMHLYLQENFTHVLAESLKRGELDVIVVAQPFDEPGIVTQPVYDEPFRMLVPAAHPWAERKSVDAGELKGEELLVLGQGNCFRDQVVAACPRLTEPGGLERSYEGGSLETIRLMVASGTGVAVLPASAVEVAPPDPSLARVIPFKAPEPSRRIVLAWRVTFPRPQAIDAVRAAILQCPPPGVKVL >NZ_AP021844|2569244:2576915|2569865_2570921_+|WP_014235926.1|DBSCAN-SWA MIRHLFGRPGLNPARIIATLLLAVAASSASAQESPRLADNAPDRHIVVPGDTLWGIAGKFIQEPWRWPEIWRLNKDQIKNPHRIYPGDVIVMVTGEDGKPQLKLAKSLKLQPREYSEAVKNEIPTIPQSIIEPFLSQPLVVDPSAMDKEARIIATQEGRVYLGGGDQAYVVGVREPSELWQVYRPGKAMLDPDTKEVLGHEAFYLGTARLIQPGEPSVMEMVEVKQEVGKFDRLMPASRPELITYAPRRPEAKVEARIIAVYGGVGTGGRYSVVSLSRGSRDGLEVGHVLALLRSEKVYEQRNEQGERELVKVPPQRYGLVFVFRTFERVSYALVMDAALPLSLADLVRNP >NZ_AP021844|2569244:2576915|2572059_2572548_+|WP_152090173.1|DBSCAN-SWA MFDILVYLFENYVDFADFSKSGNQPDSPDSQADTALSRKLTAAGFSEEEISEALEWLQGLKATLPTRQLQADSRSLRAYTPDESAHLGADALGFLHFLEQAKVLSADLRELVIERAMALPDDRLSLGRFKVIVLMVLWSQEQNLDTLIVEELLSEAEPEHLH >NZ_AP021844|2569244:2576915|2572737_2575323_+|WP_152090174.1|DBSCAN-SWA MGKQLIIAEKPSVAADIAKALGGFTKHDDYFESDNFVLSSAIGHLLELVIPEEYEVKRGKWSFAHLPVIPPHFELKPVEKTESRLKLLTKLIKRKDVDGLVNACDAGREGELIFNYIARHAKSGKAVQRLWLQSMTPQAIRDGFARLRRGEEMQGLGDAAVCRSESDWLVGINGTRAMTAFNSKTGGFHLTTVGRVQTPTLSLVVERERKIREFKARPYWEVEATFAAAAGEYKGKWFDEAFKGKDEDEHARADRLWDEARAKALQAKCEGQPGEVSEEAKPSTQLSPLLFDLTSLQREANSRFGFSAKNTLGLAQALYEKHKVLTYPRTDSRALPEDYLGTVQATLQMFNGENLTKGSDTSVVDRYGIFANKILKSKWVVPNKRIFNNAKISDHFAIIPTTQAPKNLSEPEQKLYDLVVKRFLAVFFPAAEYLITTRITRVAGEPFKTEGKVLVNPGWLAIYGREGQEGDEGNLVAVSQGEKVQTEEVAVNQNDTRPPARYSEATLLSAMEGAGKMVDDEELRAAMAGRGLGTPATRAQIIEGLITEQYLHREGRELIPTAKAFSLMTLLNGLGISELTSPELTGEWEWKLAQIERGDLSRSAFMQEIEEMTRHIVDRAKSYDSDTVPGDFGLLKSPCPKCGGLMRETYKKFQCGDCDYGLWKIVAGRQFEPEEIETLLTERQVGPLMGFRNKMGRPFNALIKLNDKNEPEFDFGQDRSGEDGGEPVDFSGQESLGPCPKCGSPVYEHGLAYVCEKSVGPAKSCDFRSGKIILQQAVEREQMQKLLSTGRTDLLKDFISARTRRKFSAFLVKGKDGKVSFEFEKREPKAPAAKKTAAKAEPKAAAEKPAKAPAKRKAKEA >NZ_AP021844|2569244:2576915|2570930_2572049_+|WP_152090172.1|DBSCAN-SWA MADPALTAWLRLTLVPGVGPETQRHLLAAFGLPEQVFSAPRSALKQVVGKKADLLLDTDNQEAVDRALDWADKPGNRILTLADPDYPQLLLESADPPSLLYVKGRVELLNRPALAIVGSRNATPQGLKDAEALAADLAAQGLTIVSGLALGIDGAAHRGGLKGEGGSVAIIGTGADRIYPSRHKELALQLATEGAIVSEFPLGTPAVAHNFPRRNRIIAGMAKGCLVVEAALESGSLITARLAAELGREVFAIPGSIHSPVAKGCHRLIQQGAKLVQEARDIVEEIGPFDPPGCRPARTPLSTGNTPTTVPILDPGQAAVLDALGHDPANLDQLLQRTGLTTEALCAILVTLELADHVASLPGGRYQRLSPT |
7 | Synechococcus_phage(16.67%) | NA | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| DBSCAN-SWA_3 |
2967016 : 3023959
Sequences of DBSCAN-SWA_3
Nucleotide sequences of DBSCAN-SWA_3 >NZ_AP021844|2967016:3023959|DBSCAN-SWA CTCATTGAGCAGTCGTTTCATAGGTTAACCGCTTGCCCTTCACACCCTGAAGCAGTTTTTCAGCCCGCTCCTGATCTTCGATACCGAGCGCCTTGCGATTGTTGTAGCGGAAATCGAATTCTGCAAGGTAGCGGTTCAGATGGTGGTGGCCACAGTGCTGATAGACGCCCTTCATGCCGCGCTTGAAGATCGAGAAGAAGCCCTCGATGGTGTTGGTGTGAATGGTGGGATCAATCTTGGATACGTATTCACCCATGCCATGCCGGGTAAAAGCGTGACCAGCGAAGTGCTGACCTACATTTTTGTACTGGCCGGCTTCGTCGGTCATGATCCGTGCTTCCCGGGCGATGTTTGCTTCCAGGATTGGGATCAGGGTCTTGGCTTTGAGGTCGTCCACCACCATGCTGCGGGCCTGGCCGCTAGTGCGGTCAACCAAGGACAGTACCTTGTTCTTGTGGTCGTAGCCGCGCCCCTTCTTCTCGCCCTTTGGCTTCTTGGTGTAGTCACGCCCAATGAAGGTTTCATCTACTTCCACTGGACCACCTTCGGAACCGAACGGGGAGAAGTCGCCGGAGCGCATGGCTTCCCGAATCCGGTGAGACATGAACCAGGCTGACTTGAGGGTGATGCCCAGGGTGCGGTGAAGCTGATTGGCACTGATTCCCTTCTTGCTAGAAGAGATCAGGAAGATGGCTTGGAGCCACAGCCGCATTGGGATGTGGCTAGACTCGAAGATGGTGCCAACTTTCACGGTGAAGGGCTTCCGGCAGTTGTAGCACTTGTAAGCGCCTATGCGAGTGCTCTTGCCCCCCATCTTGCTTATACGTTCCACACAGCCACAGTGAGGGCAGACCGGGCCACTGGGCCACAGACGGGATTCTACGAATTCGTAAGCAGCTTCTTCGTTGTGGAAGTACGGAGCAGATAGAATGGACGCACCCATGACAGCCTCCTTGAATATGCATTCAGTATAGGAGAGGTAAGTGGGTACGTCAAGTATAATATCGCCCTTTTTTTATGGCCGGACGTCGGCCCGGGCCTGCAGCACGGGATGGGGCGGGGCGGAGGGCGAGGCCAGGGTGCGCAGGGCGCCGTAGGCCAGGAAGGCGCCGAGGAGGGCGCACAGGGTGAAGAACAGCGCCAGCTCGCGGCGCAACAGGCGTTCGATCCTGGGCAGGGTGTCCCTGCCGGGCGGTACTAGCGGCATCTGGTTCATTTTTTTCTCCCTGGACTGGCTGCCCGCCTGCGCCGTCTGGCGGCGGGCCGGCAAGGCCAGTAAATACCCTGTTGCTTACATGGATATTACGGCGAGGGAGGGAGGCCGGTGGCGGCAGACGCGGTGCCGCGTCAGACCTGGGCCAGGATTTCCAGGATGTCCGTGGCTTCGGGCAGGGGTTGGGCCACCAGGAGAAAGTCGGCCATGGTGTCCAGTTCGGCCAGGGCCGCTTCCAGGGCCTCGCGGACTTCCGGGTGGTCGAAACCGTGGCCGGCCTGGGCCCGGACGATGGCCGCCGCCAGACCCTGGGCCAGGGCTTCGGCACTGTCCAGTTCCAGGTGGGCGGCGTGGTCCGCCAGGCAGTGGGCCGATTCGGCCAGGGCTTCCCCATCCGGTTCCAGGCAGGCCAGTTCCTCCCGCATCAGGGCCGCCTGCTCGCCGATTTCGCCGGCAAAGAGGCGCAGCAGGGTGGGCGAGACGGCGGGACGCTGGCTTTCCCGGCATTCGGCCAGGCGCTGGGCGACGTGGGCCACCCGTTCGGCGAAGACCGGCAGGCCCAGGTCCCGGGTGTCGCCGAGCAGTTCCAGGGCGGCGGCGATGGCGGCGCGCAGGTAGGGGTCCTCGGCCGCCTCCGGGGCGTCGAGAAGGTCGCCGACGCCGGCCAGGGATTCGGCCAGGTGCAGGGTCTCCGGCAGGTTGAGGCTCATGGCGGCGGCGCAGAGGTCGATCAGGGCCCGCCGGAAGGGAGCCAGATCGCCGCTGCCGCGCCCGTTCCAGGCGGCCACGGCGGCGTTGCCGGCGGCGCTCCAGGCGGCTACCGCCTCAGCGTCCAGGTCCGGGGCCTCGGGCTTGGCCATCAGGGAAATCAGTTCGGTCACCGGCGGCAGCTGGCCGGCACCCTGGCGGATGCACTGCTGCAGTTCCTCGGCCAGGCCGCCCTCGGGGGCTTCGCTGCCATTCAGGGCGGCCCGCATCTGCAGGTCGATGCGGGCGCAGAGGCGGCGGGATTCCGCATCCACTGCCAGGGTGCCGTCGGAGATGGCGCGCAGGAACACTTCGGCACTGTGCCAGAAGGGGGCGAAGTCGCCGCCGGCGGCGGCTTCCAGGTGGCGCACCGCGGCCCGCATCTCCGGCAGTCCGGCCGGGTCCCCCGGCTGTTGCAGCCAGGCCAGCATGCCCCGCTGGTAACGGCTGCGGGCCTGACGCTGGGAGGGGGAGGGCGGGGCAACGCTCATGGCAGGGGCGGAAGGATCGGTGGATGGGGCATTTTACTCCTCTGCCCGGGGACGTCTGTGGTTAGAATGCGGGCATAAATACAAGCGGAGACATCCGTGTTCCACTCCCCCTCGGCTCCCAGAGACCGCCCGGCCTTGCTGTGGCAGGCCCTCCTGTTTTTTCTGCTCCTGCTGCCTGCCGCCGTCCGGGCCCATCCCCTGGTGCTGGATCAGGACGACGGCAGCTTTGCTCTGGTGCCCCACGTGGAAGTGCTGGAAGACCCGGGCGGCAAGCTCGACCTGGCCGCCGTGCGGCAGGCCGCCGCCGCAGGCCGCTTCGCGCCGGCCCATGCCCTGGGCGAATTGAACTTCGGCTATTCCTCCTCTGCCTTCTGGCTGCGCATTCCCCTGGAGTCCCGCCTGCAGCGTTCCAGCCCGTGGCTGCTGGAGATCGCCTTTCCCTCCCTCGACCGGGTGGAGCTATTCCTGCCCCGGGCCGACGGCCGCGTCGACTACCAATTAACCGGGGACCGCCTGCCCTTTGCCGAACGCCCCTATCCCAACCGCAATCTGGTGCTGCCCCTGGAGCTGGCGCCCGGGGAATCCCTCGCCCTCTATCTGCGGGTGGAGTCGGAGGGCAGCCTGACTTTGCCCCTGACCCTGTGGACGCCGGATGCCTTCCGCCTGCACAACCAGGACGCCTACGCCGGCTTCTCCCTCTACTACGGCATGCTGCTGGCCCTGGGCCTGTACAACCTGCTGCTCTTCTTCGCCCTGCGGGAGCGCATCTATCTGGTCTATGTGGCCTTCGCCGTGAGCATGGCGGTGGGCCAGCTGTCCCTCAACGGCCTGGGCAACGAATACATCTGGCCGGCCTTCCCCGCCTGGGGCAATGTGGCCCTGCCCTCGGGCTTTGCCGCCACCGGCTTCTTCGGCGCCATTTTTACCCGCCTCTTCCTCAATACCCGGCACAGCAATCCCCGGGCCGACAAGCTGATCCTGGCCCTGGCCGCCGGCTTTGCCGTGGCCGCCCTGGGGCCGGCCCTGCTGCCTTACCGCTGGGCCGCCATCCTCACCTCCCTGCTGGGTGCCGCCTTTTCCGCCGTGGCCGTGGCGGTGGGCGTCCATGCCCAGCTGCGGCGCCACCCGGGGGCCCGCTACTTCCTCCTGGCCTGGTCCCTGCTGCTGGTGGGGGTGGGCATGATGGCCCTGCGCAACCTGGGCTGGCTGCCCACCACCCTGTTCACTTCCTACGGCATGCAGATCGGTTCGGCCCTGGAGATGCTGCTGCTCTCCTTTGCCCTGGCCGACCGCATCCAGGCCGAGCGCCTGGCCCGGGAACTGGCCCAGGGCGAGGCTCTCCACAGCAAGCAGGACCTGGTCAACGCCCTGCGCAGCAATGAGCAACTGCTGGAGGCCCGGGTGGCGGAACGGACCCGGGACCTGGCCGCCGCCAACGATCGCCTGCTGGCCAACGAGCAGCAGTTGCAGCGCATGGCGCGGCACGATCCCCTGACCGGGCTGGCCAACCGTCTGCTGCTCGACGACCGTATCAGCCACGGTCTGGCGGTGGGGCGGCGCAACGGCACCCGTCTGGCCCTGCTGCTGATCGACCTGGACGGCTTCAAGCCGATCAACGATAAGCATGGTCATGCCGTGGGCGACCAGTTGCTGGTGGTGCTGGCGGACCGCCTGCAACGCTCGGTGCGGGCGGTGGATACGGTGGCGCGCCTGGGCGGTGACGAGTTCGTGCTGGTGCTGGAGGATCTGGCGGCGGTGGAGGACGGGCGCCAGGTAGCGGCCAAGGTGGTGGCGGAAATGAGCCGGCCGGTGGTGCTGGAGGGGCGGGAACTGCTGGTCTCCGCCAGCGCCGGCCTGGCCTTCTATCCGGAGGACGGGGAGGACGCCCAGACCCTGCTCAGGCGGGCCGACGAGGCTATGTACGAAGCCAAGCGGGCCGGCCGCAACACCTTCCGTCAGGTGGGCCAGTAGGGCGTTGCGGGCAGCGTGAACGTGCAAAAGGCCGCCCGTGGGCGGCCTTTTCATTGCGAAAGCCTGTGGGCGGCGCTGCTCAGCCGGCCAGCTTGGCGAAGGCGTCCACCACTTCGGCGGGAGCCTGCACCAGCTCGATCAGCACGCCTTCGCCGCCGATGGGGGATTCCTCGTTGCCCTTGGGGTGCAGGAAGCAGATGTCGAAACCGGCCGCGCCCTTGCGGATGCCGCCGGGGGCGAAGCGCACGCCGTTGGCCGTCAGCCATTCCACGGCCTTGGGCAGGTCGTCGATCCACAGGCCCACGTGGTTGAGGGGGGTGGTGTGCACCGCCGGCTTCTTTTCCGGGTCCAGGGGCTGCATCAGGTCCACTTCGACCTTGAAGGGGCCCTTGCCCATGGCGCAGATGTCCTCGTCCACGTTTTCCCGCTCGGAGACGAAATTGCCGGTGACTTCCAGGCCCAGCATGTCGACCCACAGGGTCTTCAGCTTGTCCTTGGAAGGGCCGCCGATGGCGATCTGCTGGATGCCGAGAACTTTGAAGGGACGCTGGGACATGGAGAGGCTCCTGAAATCGGTGGAATTAAAAATACATAATTATACAGAGCGGACCGCCGCCCAGGGAGAGGGGCGAGCGGTACTCGGGGCCGGAATCCGGGGTGGAGCCCGGGCGGGCTCCCGGGACCATGTTCCAGGGCGGACACCGGCCCCGGGCGGACGGAGGAGGATGCCGCCTCACGCCACCTGTCCGGCACGATGGGGCGCGAAAAAAAAAGCGGAGCCCCGCCGCCGCAGGTGCTCCGCTTCCCTCTGTAACCGGGCGGACCCGGAAGTGATGGCAATCGCCGTTGCTGCCGGCAGGCGTAACGGTCTGCTTATTTCTTCTTGGCCTTGGCGTCCTTGCTGGCGGCGATGTTGCCGACGAAGGTGTTTTCCACCAGGGGCGGAGCATCCACCTGGGGCACGTGGCACTGCACGCAGTTGTGGCGCAGGTGGGTGACTTCGGCGTGCTGCTTGCCTTCCCGGTCGATGAAGTGGCTCTCGCCGATCTTCGGGGCCTTCTTCTCCTTGTACTTCTCCGGACCGTGGCAGGTCAGGCACTGGTTCTCTTCCAGGGTGATCTCGTCGAAGTTGTCCACCGCATGGGGGATCACCGGCGGCTGCTCCTTGTAGGTGCGGGCGATGGGCTGTTGCAGGCCCGGCTTCTTGCCGGCGTAGGCCTTCACCTCGGGGGCCGGGTCGCCGGCGGGAATGTCGGCGCCGCGCATGGTCTTGGGCGCATCGGCGGCCTGGGCCAGGCAGGCGAAGGAGGCGGCCAGGATGGCCAGGGTCAGTTTGTGCAGTCGGTTCATGGTCGGCTCCTCAATGGATGGTCTTGCGATGGTCTGTCTGGCCTTCGGCCTCCCCGGCCGGGGCGCAGCGCTGGGTGTGCTGGTTGAAGCGGCTGCCGAAGACGAAGACGTCCTTGGCGCAAACGTCGATGCAGCGGCCGCAGTTGGTGCAGGCAGAGGCCAGGATGACCGGGCCGGTGCCGTTGGCCTCGCCCTTGAGGGCGGGGCGGATGACCTGGGGTTCGGGGCAGGCGGCGAAGCAGTCCATGCAGTCGTCGCAGTCCTGGCGCCGCCGGGCGCTGACCCGCAGCAGGCTGGTGCGGCCGAGCAGGCTGTAGAAGGCGCCCACCGGGCACAGGTGACCGCACCAGCCCCGGCTCATGATCAGCAGGTCCAGCAGGAAAATGGCGAGGACCACGGTCCAGGCGGCGCCCAGGCCGAAGATGAGGCCCCGGTGCAGCATGGATACCGGATTGATCAGCTCCCAGGCCAGGCCGGCGCCGGCCAGGGGCAGCAGCAGGGTCAGCCCCAGGATCCAGTAGCGGCTGCGGCGGCTGATGTGGGCGCTGCCCTTGAGACCGAGGCGCTCCCGCAGCCAGCCGGCCAGGTCGGTGACCAGGTTCATGGGGCAGACCCAGGAGCAATACACCCGGCCGCCCACCAGCAGGTAGAAGGCGAGCACGATGGCGGCGCCGAGCAGGGCCAGGCCCTCGGGGCGATGGCCGGAGAACAGCACCTGCAGTACCAGCAGGGGATCGGCCAGGGGCAGGGTGTCCAGGGTCAGGCTGTAGCTGAGGTTGCCCTTGACCAGCCACAGGCCGGCCAGGGGGCCGAGCAGGAACAGGCCCAGGATGCCGAACTGGGACAGGCGCCGCAGCAGCAGCCAGCGGTTGGCGCCGAGTCGGCCCTTGGCGGCCAGGGCGGCGGGGAAACGGGGAGAGAGGAAGCTCATCGCGCCGCCTCGTCGGAAAGCCGGTTGGGCAGGCCGCCACCGGGGATGTTCTGGGGTATGGCCGGAACGCCCGGTCCATGGGCATCGGCACCGGGGATGCTCGGCGCCAGGCTATCGACGCCGCTGCCCGGGGTGGCCGGCTTGCCGGGGACGAGGGACGGGCCTCCCTGGCTGGCCGGGTCGAAGTGGCCCTCCAGGCGGGCGCCTTCAGGCATGCGGTCCGGCAGGTCGCCCAGGCCCTTGTCGTCCACCAGGGAATGGCCGGCCTTCTGTTCTTCCTCCCAGCCGACCCGGTAGTGCTGGCCCAGCTCGCCCTTGGCCAGGGGCACGGGCAGGACCTTGATGGCGGCGGTTTCGAGGACGCAGGAGCGCTCGCATTTGCCGCAGCCCGTACAGTGCTCGGAATGCACCGCGGGAATGAACATGCTGTGGCGGCCGGTGCGGGTGTTGGGGCGCAGCTCCAGGGTGATGGCCTTGTCGATCACCGGGCACACCCGGTAGCAGACGTCGCAGCGCAGGCCGAGGAAATTGAGGCAGGTCTCCTGGTCGAGCAGGACCGCCAGGCCCATGCGGGCCTGGTTGATGTCGGTGAGGCCGTGGTCCAGGGCGCCGGTGGGGCAGGCCTTGACGCAGGGGATGTCCTCGCACATCTCGCAGGGCACCTGGCGGGCGACGAAGTAGGGCGTGCCGGTGGAGACCGGCTGCTCCGGCCGGGCCAGGGACAGGGTGCCATAGGGACAGTCGCGCACGCACAGGCCGCAACGGATGCAGGCACCGAGAAAGTCCTCCTCCGCACCGGCACCGGGCGGGCGCAGGGCTGCCGGCGGCAGGGCCCGGGCCTGCTTGGCGTGGAAGCCCAGACCGAGACCCAGCAGCCCGACGCCGCAGGCCATGCGGCCGGCGTCGGCGAAGAACTGGCGCCGGGCCGCCGCAGCCTTGTCGGACTTGGCAGGAGGGGAGTTGGAAAGATCGCTCATGGCGTGACCGGCGGCGGGAACCGACTGGGGGCGGACTTCGCTGTCCGCCCCGCCGGCCTCCCTGATGCCTTATCCGGTTGTGTAGGTGCTTAGGCCTTGACCACCTTGCAGGCGCACTTCTTGAAGTCCGTCTCTTTCGAGATGGGGCAGGTGGCGTCCAGGGTCAGCTTGTTCACCAGCCGGTGCTCGTCGAAGAAGGGCACGAAGACCAGGCCCAGGGGCGGCTTGTTGCGGCCCCGGGTTTCCACCCGGGTGGAAATCTCGCCGCGGCGGGACTGCACCTTGACCGTGTCGCCGCGCTGCAGGCCGCGCTTCTTGGCGTCTTCCGGATGCATGTAGATCCAGGCGTCGGGCATGGCCTTGTACAGCTCGGGCACGCGCCGGGTCATGGAGCCGGTGTGCCAGTGCTCCAGCACGCGGCCGGTACAGAGCCACAGGTCGTACTCGGCATCCGGCTGCTCGGCGGCAGGCTGGTAGGGCAGGGCGAAGACCACCGCCTTGCCGTCGGGGAAGCCGTAGAAGCGCACCTTCTCGCCGGCCTTGACGTAGGGGTCGTAGCCTTCGCGGAAGCGCCACAGGGTTTCCTTGTTGTCCACCACCGGCCAGCGCAGGCCGCGGGCCTTGTGATAGGTGTCGAAGGCGGCCAGGTCGTGGCCATGGCCGCGGCCGAAGGCAGCGTACTCCTCGAACAGGCCCTTCTGCAGGTAGAAGCCGAGGACCTTGCCTTCCTCGTTCTCGAAGCCCTTCAGCTGGTCGGAGACGGGGAACTTGTTCACTTCGCCGTTGGCGTAGAGCACGTCATAGAGGGTCTTGCCCTTGTACTCGGGGGCCTTGTCCAGCAGTTCGGCGGGCCACACTTCCTCCATCTTGAAGCGCTTGGAGAATTCCACGTACTGCAGCACGTCGGAGCGGGCCTGGCCCTGGGGCTTGACCTGCTGGCGCCAGAACTGGGTGCGGCGCTCGGCGTTGCCGTAGGCGCCTTCCTTTTCCATCCACATGGCGGAGGGCAGGATCAGGTCGGCGGCCAGGGCGGAGACGGTGGGATAGACGTCGGAGTGCACCACGAAGGCGGCCGGATTGCGCCAGCCCGGGTAGACCTCGCCGTTGATGTTGGGGCCGGCCTGCATGTTGTTGGTGGTGGTGGACCAGAAGAAGGCCACCTTGCCATCCTTCAGGGCGCGGCTCTGGGCCACGGCGTGCAGACCCACCCAGTCGGGGATGGTGCCGGCGGGCAGCTTCCACAGCTTCTCGGTGATTTCCCGGTGCTTGGGATTGACCACCACCATGTCGGCGGGCAGGCGATGGGCGAAGGTGCCCACTTCCCGGGCCGTGCCGCAGGCGGAAGGCTGGCCGGTGAGGGAGAAGGGGCCGTTGCCAGGCTCGGAGATCTTGCCCACCAGCAGGTGCACGTTGTAGATCATGTTGTTCACCCAGGTGCCCCGGGTGTGCTGGTTGAAGCCCATGGTCCAGTAGGAGACGACCTTCACCTTGGGATCGGCATAGGCCTTGGCCAGGGCTTCCAGGTTTTCCTTGGGCACGCCGGAAATCTCGTGGGTCTTGTCCAGGGTGTACTCGGCGACGAAGGCCTTGAACTCGTCGAAGGAGATGTCGGTGGCCTTGTTGGGGTCGCCCTTGGGTTTGCCGTCCGGGCCCGGGTAGCCGTTGTTGCCGGCGGCCTGTTCCAGGGGGTGGTTGGGACGCAGGCCGTAGCCGATGTCGGTGACGCCCTTCTTGAACTTGACGTGGTTCTTGACGAAGTCCTGGTTCACCGCGCCGTTCTGGATGATGTAGTTGGCGATGTAGTTGAGGATGGCCAGGTCGGACTGGGGCTTGAAGATCAGCTCGTTGTCGGCCAGCTCGCAGGAGCGGTGGGTGAAGGTGGACAGCACGTGAATCTTGACGTGCTTGGCGTTGAGGCGGCGGTCGGTGATGCGGGACCAGAGGATGGGGTGCATCTCCGCCATGTTGGAGCCCCACAGGGCGAACACGTCGGCGTGCTCCGCATCGTCATAGCAGCCCATGGGCTCGTCGATGCCGAAGGTGCGCATGAAGCCAGCCACGGCGGAAGCCATGCAGTGGCGGGCATTGGGGTCCAGGTTGTTGGAGCGGAAACCGGCCTTCCACAGCTTGGCGGCGGCATAGCCTTCCCAGATGGTCCACTGGCCGGAGCCGAACATGGCGATGTTGCGCGGGCCGCCGGCCTTGAGGGCTGCCTTGCACTTCTCGGCCATGATGTCGTAGGCCTGGTCCCAGGAAATGGGGGTGAAGTCGCCGTTCTTGTCGTACTGGCCGTTCTTCATGCGCAGCAGGGGCTGGGTCAGGCGGTCCTTGCCGTACATGATCTTGGAGAGGAAGTAACCCTTGATGCAGTTGAGGCCCCGGTTTACCGGCGCTTCCGGATCGCCCTGGGTGGCCACCACGCGGCCGTCCTTGGTGCCCACCAGGACACCGCAGCCGGTGCCGCAGAAGCGGCAGACGCCCTTGTCCCAGCGGATGCCGTCATTCTTCGGCTGCTGCGCCAGGGCTTCGGATACGCCGGGCACCGCCATGCCGGCGGCGTTGGCGGCGGCGGCGACGGCGCTGCTCTTGATAAAGTCGCGACGGGTCAGGTTCATCTCAGGACTCCTTCTCGGGATCAGGTTCGAAGTGGTGATACACCATGGCCAGGGACATGACGCCCGGCAGCTGCTGTATCGCCTCGTAGGTTTGGGTGGTTTCCCGGTCGCCGTCGCTCTCGATGGTGACGATCATGCGGCCCTCTTCGGAGACGGCGTGGACCTCCACGCCCGCCAAGGTCGCCAGACCCGCCTCGACGGCCGCGATCTGCTGCGGCCCGGCGTTGACCAGAATGCTGGAAATATTCACGGATGATTCCCCCAATCGGCAGGCACGGCTCGAAGGGCCGGCTCAAGGTTCGACCACTCTATGCCCCTACGGAAACCCAGGTTTATGATGTGAATCAAAAGCGAAAAAAAGCCCCCGCAGGATGTAGGTGATTTGAAAAGTCACCGGGCTTCGGTTAGCCTGATCGGGCTTTGCCTGGTGCGGTTTTCCGCCGCATCGGGGTTAATGAGAATGACCCACAACAAGGCCCGCCCCATGTCGGCAGGCCGGCGGTTCGAGTAACAGCCCCGATTCCCCAGAAGGCTTGTTTTCCCACCCACCCATGTCCCGGCCCCTGCCCATTCTCTCGACGCCCGCAGCCGCCCCCCCGGAGGCCGCTCCCCTGACCCGCTACAGCCCCATTCCCTGGGGCCTGGTGATCGTCCTTTCCCTGCTTTTCGTCGTGGTCTGGCTGCTGCCGCCCCTGGGCGGACTCAAGCAGAGCGACACCATCTTCCCCCTGACCCTGCACACGGTGATGGAGAGCTTCTCCTTCGTCGTCTCCGTGCTGGTCTTCGCCGTATCCTGGCATGCCTACAGCCGGGAGCGGGCGGGCAACCTGATGATCCTGGCCTGCGGCTTCCTCGCCGTGGCCCTGCTGGATTTCGGCCATACCCTCTCCTACCGGGGCATGCCCGACTTCGTCACCCCCTCCTCGCCCCAGAAGGCCATCATCTTCTGGCTGGCGGCCCGCTATGTGGCGGCCCTGACCCTGCTCACCATCGCCCTGCGCCCCTGGCAGCCCCTGGCCCGCCCCCGGGACCGCTACCGGCTGATGCTGTGGGCCCTGCTGGTGACCGCCGCGGTGTTCGTCTCGGAGCTTTACCTGCCGGATTTCTGGCCCACCATGTTCGTGCCCGGGGTGGGCCTCACCGGTCTCAAGATCGCCGCCGAATACGGCCTCATCGCCATCCTCGGCGCCACCGCGGTGATCCTCTATCCGAAGACCCAGGGCAAGCCCGCCTTCGACGCCGCCAACCTGTTTACCGCGGTGCTCATCACCATCCTCTCGGAGCTGTGCTTCACCCTGTACTCCAACGTCAATGACGTGTTCCAGCTGCTCGGCCACACCTACAAGGTCATCGCCTATTTCTGGATCTACAAGGCAGTGTTCGTCTCCAGCGTGCGCGATCCCTACCTGCGCCTGAGCCTGGAGATGGCCGAGCGCCAGGCGGCGGAGGCGCGCATCCAGTTCCTCGCCTACCACGACCCCCTGACCGAACTGCCCAACCGCATCCTGGTGCGGGAACGTTTCGAGCGGGCGGTGGAGCGGGCCCGGGACCAGTCCTCCCGGGTGGGCCTGGTCTATATCGATCTGGACAATTTCAAGACGGTGAACGACTCCCTCGGCCACACCCTGGGCGACCTGCTGCTGCAGGCCATCGGCCAGCGCCTGCAGTCCCTGGTGCCGGCCGGCAGCACGGTCAGCCGCCAGGGTGGCGACGAGTTCCTCATCCTGCTGGAAGACCTGGAGCAGTCCCGGCTGGCGGAGAGCCTGGTGAGCCGCATCGTGGAGCAGATGCAGGCGCCCTTCGAAATCCAGGGCCACGACCTGTCCACCTCCGTTTCCATCGGCGTTTCCCTTTTCCCCGACGACGGCGGCGATTTCGACACCCTGCTGAAAAAGGCGGATACGGCCATGTACCGGGCCAAGGGCGCCGGCCGCAACGGCTACCGCTTTTTCGACCGGGAAATGGACAAGGACGTGGGCGAGCGCCTGCGCCTGAGTAACGACCTGCGCCTGGCCCTGGCGCGCAACGAGTTCGTGCTGCACTACCAGCCCCAGATCGATTTGCGCACCCAGGAAGTGATCGGTGCCGAGGCCCTGATCCGCTGGCAGCATCCGGAACTGGGCCTGCTGGCCCCGGGCCGCTTCATCGGCATCGCCGAGGACACGGGCCTGATCGTGCCCATCGGCGAATGGGTGATCCGCATGGCCTGCCATCAGGCCGCCGCCTGGCAGCGGGCCGGCCTGCCGCCCCTGGTGGTGGCGGTCAATCTTTCCGCCGTGCAGTTCATGCGCGGCGACCTGGTGGGCACGGTGGCCAGCGCCCTGGCCACCTCCGCCCTGCCTTCCCGCTGTCTGGAACTGGAACTGACCGAATCGATCCTGATCCAGGATGCGGAGAACATCCTGGGCACGGTGCAGCGCCTCAACGCCATTGGTGTGCAGATGTCCATCGACGACTTCGGCACCGGCTATTCCAGCCTTTCCTACCTGAAGCGCTTCGCCGTGGACAAACTGAAGGTGGACCAGTCCTTCGTGCGCGACCTCTGCAGCGATCCGGACGACGCCGCCATCGTGCGGGCTATCATCCAGCTGGCCCGCAGCCTGGGCCTGAAGACCATCGCCGAAGGGGTAGAAACGGCGGAAATCCTCGCCCTGCTGCAGGAGCTGGGCTGCGACGAAGCCCAGGGCTACTACTTCGCCAAGCCCCTGCCGGCGGACAACTTCAGCGCCTTCCTCAGCCAGCGCCTGTCCTGACGGTCCGGCGCTGCCGCAGCGCCGGCAGCCCCCATTTCCCCCAACTGAAAACTACTGCCGCACTTCCCGGATGTTGTCGGGCGGCAGGAACTCGCCGCTGCTGCGGAAGGGATTGATGTCCAGGCCGCCGCGCCGGGTGTAGCGGGCATACACGGCCAGGGACTGGGGCGCGCAGCGGGCGCTGATGTCGCAGAAGATGCGCTCGACGCACTGCTCATGGAACTCGTTGTGGCCGCGGAAGGACACGATGTAGCGCAGCAGGGCGGCGCGGTCGATGGCCGGGCCCCGGTAGCGCACCACCACCATGCCCCAGTCGGGCTGGCCGGTGACCAGGCAGTTGGATTTGAGCAGGTGGGAATAGAGGGTTTCCTCGACGCTGCGGCCGGCATCGGCCTGCAGCAGTTCCGGCGCCGGCTGGTAGCGGTCGCACTCGATGTCCAGCTCGTCCAGCAGGATGCCCGAGGGGTAATCCACCCGGGGCCGCGGCTGGGCAGCCAGAGGCTCCAGCTGCACCGTCACCTGGCCGCCGGCGGCGGCGGAAAGATCCCGGACCAGGGTGGCGGCCACCGTTTGCGCGTCGGCGAAGGCGCTCTGGTTGAAGGAATTCAGGTACAGCTTGAAGGACTTGGATTCGATCAGGTGCGGCGTCTGCGCCGGGATGCGGAAGGTGCCCAGGGCCACCACCGGCTTGCCCCGGGGATTGAGCCAGGAAATCTCGTAGGCGTTCCACAGGTCCTCGCCGACGAAGGGCAGGCGGGCCGGGTCGATGCCGATCTCGTCCCGCTTCAGCTGGCGGGGAATGGGAAAGAGCAGTTCCGGGGCGTAATGGCAGCGGTACTCGCTGGCCTTGCCGAGAGGGGAGAGGGCGCTGGGATCGATGGTCATGGGGCGGATTTTACCCCCAGGCGGAGGCAGGCCCCAGCATCGGCCGTGAGCGCCGCCTAGAGCAGCACGTAGCCCTGGGGCTGCAGCAGGGGCGGCAGGTCGGTCTCGCCCAGGGCTTCGCCCAGGTCCCCTTCCAGCATGCGCACCATGGCGTCCAGGGGCAGGTCGTTTTCCAGGGTGCCGAAGGGCTCTTCCAGCTCGTCCCCCAGTGCGTCCAGGCCGAAGAAGGTGTAGGCCAGCACCGCCGTCAGCAGGGGGGTGGCCCAGCCGACGGAGCGGGCCAGGCCGAAGGGCAGCAGCAGGCAGAAGAGGTGGGCGGTGCGGTGCAGCAGCAGGGTGTAGGCGAAGGGCAGGGGGGTGAAGCGGATGCGCTCGCAGGCGGCCTGGATGCCGGAGAGGGCGTGCAGGCGCTGGGTCAGGCCCTGGTAGACGATGTCGCCGAGGCCGTCCCGTTGCCGGGCCTGGACCAGGTCGTGGCCGCACTGGCGCAGCAGGGCATCGGCCGGATTGCGGCTCTGGGCAAGGCGCTCGGCCTCGCTGGGAGGCAGGAAGGGTGCCGCTTCCAGGGCTGCGTCCCGGCCCCGCAGGCGGGCGGCCAGGGCGTGGGCGAAGGCCAGGCTGCGGCGCACCAGGAGGCGCCGGGGTTCCGCCTCCAGCACCAGGCTATCCCGGGCCAGGGAGCGCAGTTCGACGATGAGGCCGCCCCACTGCTTGCGTGCTTCCCACCAGCGGTCGTAGCAGGCGCTGTTGCGAAAGCCGAGGAAGATGGAAAAGGCCAGGCCGAGCAGGGCGAAGGGGGCGGCGGAATAGTCGGGGAAGAGGTGGCCGAAGTGCTGGGCGCCCCAGGTGATGAGCACGGCGAAGCTGGTGGTGAAGACGATCTGGGGCAGCACGTGGGGCACCACCGAGCCGCGCCAGATGAAAAAGAGACGGAGCAGGCTGGGGCGTTCGCGGACGATCATGTCGAGGCAGTCGGTGGAGTGGTGGTCGGGGTGGCGCTCGCCAGTGCGGCCAGGCCGGGCAGGGCGGCGCGGGCGTTGGCGCCGGTGGCGGCGATCAGCTCGGCTGTGGGCATGCCCCGCAGCTCGGCCAGGAGGGCGGCGAAGCGGGGCAGGTAGGCGGGCTTGTTGCGCCGGTCCGGGCTCGCCGCGGTGAGAAAGGCCGGGGGAATGTCGGGGGCGTCCGTTTCCAGGACCAGGGCTTCCAGGGGCAGGGTGGCGGCCAGTTCCCGGATGCGGGTGGAGCCGCTGAAGGTCATGGCGCCGCCGAAGCCCAGCTTGAAGCCGAGCTTGATGAATTCGTCGGCCTGCTGCCGGCTGCCGTTGAAGGCATGGGCGATGCCGCCCCGGGGCCGGTAGCGGCGCAGCTGCTTGAGAATGGGGTCCAGGGCCCGGCGCACGTGGAGGATCACCGGCAGGTCGAACTCCACGGCAAGCTGCAGCTGTTCGGCGAAGAAGTGCTGCTGCCGCGCCAGGGCCTCGCCCTGCTGCAGTTCGGGCACGTACAGGTCGAGGCCGATCTCGCCCACCGCCAGCGGCGCCAGGGGGCCATCCCGCTCCTCGGCCAGCCAGCGGCGCAGGGTGGAGAGGTCTTCCTCCCGGGCCGCCGGCGTGTACAGGGGATGGATGCCGTAGGCCGGCGCGCAGCCCGGGTAGGCGAGGCAGCAGGCGCGCACCTCGGCGAAGGTGGCGGCGGCCACTGCCGGCACCACCATGGCCTGTACACCAGCAGTGACGCCATCCTGGAAGATGGCCTCCCGGTCAGGGGCGAATTCCGCCGCGTCCAGGTGGCAGTGGGTGTCGATCAGCATGGAATCTGGCTGTCCGGGCAGGCCGGGGGGAGAGTCCCGCCCTGCCTTACTTCTGGCCGAAGGCAGGTTTGCGCTTTTCCAGGAAGGCGCCCAGGCCTTCGGCGAAGTCGGGATGGACGGAGCAGGCGGCGAAGTTGCCCTGTTCGGCGAACAGCTGCTCCGGCAGGGAATTGCCGCTGGAAGCCTGCAGCAGGGCCTTGGTGCGGGCCAGGGCCTGGCGCGGACCGGCGGCCAGGCGGCGGGCCAGCTTGGCGCTCTCGGCCTCCAGCTCGGCGGCGGGCACCACCCGGTTGATGAGGCCCCACTCCCTGGCCTGGGCGGCATCGAAACGGTCCCCCAGCAGGGCGATCTCGGCGGCCCGCTTGGCCCCCACGGCCCGGGGCAGGAACCAGGTGGCGCCGCCGTCGGGGGAAAGGCCGATGTGGCAGTAGGCCAGGGTGAAATAGGCGTTGTCGGCGGCCACCGCCAGGTCGCAGGCCAGCATCAGGGACAGGCCGAAGCCGGCGGCGGCACCGCTGACCGAGGCCACTACGGGTTTGCCCATGCGGCGCACCTGCAGGGTGGTGGCGTGCACGGCGGCGATGGTCTGTTCGAAGAGGGCCTGGCGTTCCGCCGGGGGCAGGGCCAGCTGGCTGTGGAACCACTTGAGGTCGCCGCCGGCCATGAAGTGCTCGCCGCCGCGCAGGACCACGGCGCCGACGGCCTCGTCGTGCTCGGCCCGGGCGGTGGCGGCGCGCAGGTCCTCGATCATGGCCAGGTTGAGGGCGTTGAGGGCCTCGGGGCGGTTCAGGGTCAGGGTCAGGACCCCGTCCTCCAGGTGGGAAAGCACGGTGCTCATGGGTTGTCTCCTTTGGGTGGTGGTTTTTATGGTTTTATTGCTTTGTCGTTCCGAGGAACGGGAAATCCGGTTCCGGCCGGCGGCCGGAAATGAGGTCCGCCAGGGCCGCGGCGGAGCCGCAGGACAGGGTCCAGCCCAGGGTGCCGTGGCCGGTGTTGAGCCACAGGTTGGGCAGGCGGGTGCGGCCGATGAGGGGCACGTTGGAGGGCGTCACCGGGCGCAGGCCGCACCAGTAGAGCGGGTCGCCGTCGGGGCGCAGCTGGGGGAACAGTTCCAGGGCGCGCCGCAGCAGGGCCTCGCAGCGCACCGGGGTGAGCTCCAGGTTGTGGCCGTTGAACTCCGCCGTGCCGGCCACCCGCAGGCGGTTGCCGAGGCGGGACATGACGATCTTGCGTTCGTCGTCGGTGATGCTGACGCTGGGGGCGACGCTGTCCGGGGAGAGGGCGATGGTGGCGGAATAGCCCTTGCCCGGATAGACGCAGGCCTTGACCCCGGCGGGCTTGAGCAGGGCCGGGGAATAGCTGCCCAGGGCCACCACGTAGGCGTCGGCCAGGAGCAGGTCGCCGCCGGCGACGACGCCGGCCACCCGGCCGCCGGCGCTGGCAATCTTCTCCACCGGGCAGTTGTAGCGGAACTGCACGCCCCGGGCGGCGGCGGCCTCGGCCAGGCGCTGGGTGAAGCGGTGGGCGTCGCCGGATTCGTCGCTGGGGGTGTAGTCGCCACCAGCCAGGCGCCCTTGCACCGCGGCCAGGGCCGGCTCGATGGCGACGCAGCGGGCGGCGTCCACCGGCTCCCGGTCCACCCCGAACTCCCGCATCAGGGCGGCGGCGTGGCAGGCGGCCTCGAACTCGGCGGCCTGGGTGAAGATGTGCAGGATGCCCTGGCAGCGCTGGTCGTAGTCCAGGGGCAGGGTCTGGCGCAGGGCCTGCAGCCGCTGCCGGCTGTAGAGGGCGAGGGCGATGATGTCGCGGATGTTGCGGCGGGTGGCCCCGGGCGGGCAGTTGGCGAGGAAGCGCAGGCTCCAGGCGAAGAGGGCCGGGTCGTAGCGCAGGCGGAAGAGCAGGGGGGCGTCTTCCTTGCCCAGCCACTCCAGGGCCTTGAAGGGGGCCCGGGGATTGGCCCAGGGCTCGGCGTGGCAGACGGAAATCTGGCCGCCGTTGGCGAAGCTGGTTTCCAGGGCGGCGCCGGGCTGGCGGTCCACCACCGTGACTTCGTGGCCGGCCTCGGCCAGGAACCAGGCACTGGTGACGCCGACGACGCCGGCGCCGAGGACGAGAACACGCACCCGGCCTATTCCTCCTGGCGTTCCCGGGTGATGCGCACCACTTCGGGAATCAGGCGCACGGCCCGCAGGACCCGGGCCAGGTGGGCCCGGTTGGCCACCTGCACGGTGAAGTTGAGGGTGGTGTAGAAGCCCGGATCGGGGGCCATGGAGACCTTCTCGATGTTGGAGCCGGACTCGGCGATCTCGGTGGCCACCTTGGCCAGCACGCCGCGGGCGTTGCGGGCGGCGACGTGGATGTCCACGTCGAACAGCTTGCCCGGTTCCGGTTCCCACTCCACGTCGATCCAGCGCTGGGGTTCGGCACTGCGGGACTTGCGGATGACGGCGCAGTCATGGGTGTGCACCACCAGGCCCTGGCCCTTCTTGATGGAGCCGATGATGGGGTCGCCCGGGATCGGGCGGCAGCAATGGGCCAGCTGGATGGCCATGCCCTCGGTGCCGCGGATCACCACCGAGGTGTGGGGTGCCGGTTCCGCGTTGGGCAGGGCCGCCTCGTGGGCCAGCAGGCGGCGCGCCACCACGGCGGCCAGGCGCTTGCCCAGGCCGATGTCGGTGTACACCTCCTTGACGGACTTGCTGCCCCCTTCCTTGAGCACCGCTTCCCAGCTGGCGTCCGGCAGTTCCGAGGGCGTGATGCCGAGGCCGAACAGTTCCTGGTTGAGCAGGCGCTCGCCCAGAGCGGCGGATTCCTCGTGCTGGCGGGTCTTCAGGAAGTGGCGAATCTTGCTGCGGGCGCGGCCGGTCTTCACATACGAGAGCCAGGCCGGATTGGGATTGGCATGGGCGGCGGTGACGATTTCCACCTGATCGCCGCTGTTGAGCTCGCTGCGCAGTGGCATCAGCTCGTAGTTGATCTTGGCGGCGACGCAGCGGTTGCCCACGTCCGTGTGCACCGCATAGGCGAAGTCCACCGGGGTGGCGCCCTTGGGCAGGGAGAATATCTTGCCCTTGGGGGAGAAGACATAGACCTCGTCGGGGAAGAGGTCGATCTTGACGTGCTCGAAGAACTCGGCCGAGTCGCCGGCGGTGCTCTGCAGCTCCAGCAGGGACTGCAGCCAGCGGTGGGTCTGGTACTGCAGTTCGGCGGCGCTCTTCTCCGTGTCCTTGTACAGCCAGTGGGAAGCCACGCCCTCCTGGGCCATGTGGTGCATTTCCTCGGTGCGCAGCTGCACTTCCACCGGCATGCCGTAGGGGCCGATCAGGGTGGTGTGCAGGGACTGGTAGCCGTTGGCCTTGGGGATGGCGATGTAGTCCTTGAACTTGCCCGGCAGGGGCTTGTACAGGGCGTGCAGGGCGCCCAGGCCCAGGTAGCAGCTGGGCACGTCCTTGACCACCACGCGGAAGCCGTAGATGTCCAGCACCTGGGAGAAGGAGAGGCGCTTTTCCACCATCTTGCGGTAGATGGAATAGAGGCTCTTCTCCCGGCCGAAGACCTGGGCCTCGATGCCCGAGTCCCGCATCTTGCTCTGGACCCCGTCGAGAATCTTCGACAGCACCTCGCGCCGGTTGCCCCGGGCCGCCATGACGGCCTTCAGCAGCACCTGGTAGCGCATCGGGTGGGTGTGTTTGAAGGAGAGGTCCTGCAGCTCCCGGTAGACCGTGTTCAGCCCCAGCCGGTTGGCGATGGGGGCGTAGATCTCCAGGGTCTCCAGGGCGATGCGGCGGCGCTTGTCCGGACGCATGCAGCCCAGGGTCTGCATGTTGTGCAGACGGTCGGTGAGCTTGATGAGGATGACCCGCAGGTCCTTGGCCATGGCCAGGAGCATCTTGCGGAAGTTTTCCGCCTGGGCTTCCTGGTAGGAGGAGAACTCGATCTTGTCGAGCTTGGAGAGGCCGTCCACCAGGTCGGCCACGCCCTTGCCGAAGCGTTCGGTCAGCTCCTCCTTGGAGATGCCCGTGTCCTCCATGGTGTCGTGCAGGAGGGCGGCGATGATGGCGGTGGAATCCAGCCGCCATTCGGCAATGGCCCCGGCCACGGCCAGGGGGTGGGTGATGTAGGGTTCGCCGGAAAGGCGCTTCTGGCCCCGGTGGGCGGCTTCGCCGAAGGCAAAGGCCTCCTTGATCTTGGCGATCTCTTCCGGTTTCAGGTAGTCGAGGCTGTCCAGGAAGACCCGGTAGGCCGGGTCGTCGTTGAACGGGTAGGGGGTGGGCGGCGCCGCCGGGTCAGGTGCCGGCGCGAAGGGCGCGGCGGAAGATGCAGACGGTTTGGCCGGGGCGGGGTCGGTTGCAGTATCCATACCGGTTCACCTTACCCGCCCGGGCCCGGGGTCAGGCCTGACCGCGGTTGAGGATTTCCAGGCCGATCTGGCCGGCAGCCAGTTCGCGCAGGGCGATGACGGTGGGCTTGTCCTTGCTCGGTTCCTGCATGGGGGTGGAACCATTGGCGATCTGGCGGGCCCGGTAGGTGGCGGCCAGGGTCATCTGGAAGCGGTTGGGGATTTGTTTCAGGCAGTCTTCAACGGTAATGCGGGCCATGGTCCATCCAATCAAAAAGCGGAAAATTTAGAGCAGCGAGGCGAACAGCGAAGCGTGGCGTTCCTGCTGCACGGGAAGCTTCAAGCGCGTTGCGCGCACCACGGCCAGCAGGTCGCTGAGGGCCGTCTGCAGGTCGTTGTTAATAATAACATAGTCGAATTCCCCCACATGCCGCATCTCGTCGCGGGCAGCGGCCAGGCGGCGGGCGATGACGTCCTCGCTGTCGGTGCCGCGGCCGGCCAGGCGGCGGGCCAGTTCTTCCATGGAGGGCGGCAGGATGAAGACGCCGATGGCGTCGCCGAACACCTTGCGCACCTGCTGGGCGCCCTGCCAGTCGATCTCCAGCAGCACGTCGCGGCCGGCGGCCAGCTGCTGTTCGATCCAGGTGCGCGAAGTGCCGTAGTAGTTGCCGTGCACCTCGGCCCATTCGAGGAACTCGCCCCGGTCCACCCGGGCCAGGAAATCGGCCACGTCGGTGAAGTGGTAGGCCTGGCCGTTCTCCTCCCCGGTGCGGGGCGCCCGGGTGGTGTGGGAGACGGAGAGGCCGATGGCCGGGTCGTTCTGCAGCAGCAGGCGGACCAGGGTGGTCTTGCCGGCGCCGGAGGGGGCGGTGACGATGTAGAGGTGGCCGCTCATGCTGGCATCCCTTTCCTTATTCGATGTTCTGGATCTGTTCGCGCATCTGCTCGATGAGCAGCTTCAGGTCCATGGAGGCCTTGGAGACTTCGCTGAGGACCGACTTGGAGCCCAGGGTGTTGGCCTCCCGGTTCAGTTCCTGCATGAGGAAGTCGAGGCGCTTGCCGGCGTTGCCGCCGGCCTTGAGGATGCGCTCCACCTCGGTGAGATGGGCCTGCAGGCGAGACAGTTCCTCGTCCACGTCGATACGGGTGGCATACAGCACCACTTCCTGGCGCACCCGTTCGTCGTCGGCGCTGCCCAGGGCCTCCACCAGGCGCTGCTTGAGCTTGTCCTGGTAGGCAGCCTGGGCCTGGGGAATGAGGGGGGCAACGGCGGCCACGGTGGCGCGGATCTTGTCCACCCGCTCCTGGATCATGGCGGCCAGCTTGGCGCCTTCCCGGGCCCGGCTGGCGGTGAAGTCCTCCAGGGCCTCCTTCAGGGTGGCCTGGACGGCGGCGTGCAGGGCGGCCGTATCCACCTCCGGTTCGCCCAGCATGCCGGGCCAGCGCAGCACTTCGGCCACCGACAGGGCGGCGGCGTTGGGCAGGGTCTGGCGTACCTGGCCTTCCAGGGCCTGCAACTGGGTCAGCAGGTCGGCGTTGATGGCCAGCTGGCGGTTCTGGCTCTGGCTGGCGACCAGGTTGAGGCGCAGTTCCACCTTGCCCCGGGCCAGCTTGGCGGTGATGGCTTCGCGCAGGGCCGGCTCCAGCACCCGCAGGTCGTCCACGATGCGGAAATGGATGTCGAGGAAGCGGGAATTGACGCTGCGCAGTTCCAGGTGCAGGGAGCCGCCTGCCACTTCCCGGGTTTTGGCGGCATAGCCGGTCATACTGTAGATCATGGAATTCCTTGCTGTGTGGGGTGTGCCGGCCCGGACGGCGGGCCTGGGAGAGGCTTCTTGAGGCTTCTTTACAGTCCGTTGGCAAAGCCTGACAATGCGCGCAGTGCGCCTGAGGCGCTATCTTAGCTTCGGACTTCATGGCGTCACAAGCCAATCACCCCCTCCCTGCCGGATTCCAACTGGAAGACTACCGCATCGAAAAGCAGATTTCGGTCGGCGGTTTTTCCATTGTTTACCTGGCCCACGATGCCAGCGGCAAGGCGGTGGCCATCAAGGAATACCTGCCGGCCAGCCTGGCCCTGCGCTCCGAGGGGCAGACCAAGCCGGTCATTTCCCAGGAGCATCTTTCTGCCTTCCGCTACGGCATGAAATGCTTTTTCGAGGAAGGCCGGGCCCTGGCCAAGCTGAACCATCCCAACGTGATCCAGGTGCTGAACTTTTTCCGCGCCAACGACACGGTTTATATGGTCATGGAATACGAGCGGGGGCGCACCCTGCAGGAATTCATCCAGAAGCACCACGGTCACATCCACGAGAAATTCATCCGCGGCGTGTTCACCCGCATGCTCAACGGCCTGCGCGAAGTGCACACCCACAAGCTGCTGCACCTGGACCTGAAACCGTCCAACATCTACCTGCGGGCCGACAATACGCCGGTGCTGATCGACTTCGGCGCCGCCCGCCAGACCCTGCATTCCGACACCCCCATGCTGAAACCCATGTACACCCCGGGTTTCGCCTCCCCCGAGCACTACTTCAAGCGGGACGAACTGGGGCCCTGGAGCGACATCTATTCGGTGGGCGCCTCCATGTACTCCTGTCTGGCCGGGGCGGCGCCCCAGGCGGCCGATGCGCGCATGGAGAAGGATCAGCTGCAGCCGGCCTCGGTGCGCTGGGAGGGCCAGTATTCGGACCAGCTGCTGGAGACCATCGACTGGTGCCTGTGCCTCAACCACCTGTACCGTCCCCAGAGCGTCTTCGCCCTGCAGAAGGCCCTCACCGAGGCGGTGGACATGCCGGGTCAGGGAGCCAGCAAGGCGGCGGAAAAGGAAGGATGGCTGGGCCATCTGGTGGGCAAGATCAAGGGAATGACTGCTAAATGAAATTCACCATCTACCAGGAAAGCCGCATCGGCAAGCGGCAGAACAACGAGGACCGGATCGCCTACTGCTACTCGCGGGAGGCGGTGCTGATGGTGGTGGCCGATGGCATGGGCGGCCATTACCACGGCGAGGTGGCCTCCCAGATCGCGGTGCAGACCCTGACCTCGGCCTTCCAGCGGGATGCCCAGCCGGAGATCGCCGATCCCTTCCTCTTCCTGCAGAAGGGCATGACCAATGCCCACCACGCCATCCTGGACTATTCCCAGGAGCACCGGCTGAAGGATTCGCCGCGCACCACCTGCGTCGCCTGCCTGATCCAGGACAACATCGCCTACTGGGCCCACGTCGGCGATTCCCGCCTCTACCACATGCGCGACGGCAAGGTGCTGGCGGTGACCCGGGACCATTCCCGGGTGCGCCTGCTGATGGACGAGGGCCTCATCAGCGAGGCCCAGGCCGCCACCCACCCGGACCGCAACAAGGTGTACAGCTGCCTGGGGGGCGAAAACCCGCCGGAAATCGAGTTCTCCCGCAAGACCCCCCTGGAAGTGGGGGATGTCCTGGTGCTGTGCACCGACGGCCTGTGGGGGCCGCTGCCGGCCGATGTCATGGCCGCCTCCCTGAAGGGGGCCAACCTGATGCAGGCCGTGCCCATGCTGCTCAACCAGGCGGAAATCCGCTCCGGCCCCTACGGCGACAATCTTTCCGTGGTGGCGGTGCGCTGGGAGCAGAGCTACAGCGAGGAGGCCTCCAGCACGGTGATGACCCAGACCATGCCCCTGGACGCGGTGACCACCAAGCTCGGCGAATTCGGTCGGGACCCGGCCTACAAGACCGATCTTTCCGACGACGAGATCGAAAAGGCCATCGACGAAATCCGCGCCGCCATCCAGAAATTCTCCAAATAAGGAAGTTCCATGCGTCCCAGCCAACGTGCCGCCGACCAGCTGCGCCAAGTCCGCATCACCCGCCGTTTCACCCGCCATGCCGAAGGTTCGGTGCTGGTGGAAATGGGCGACACCAAGGTGCTGTGCACCGCCAGCATCGAGGAAAACCTGCCGCCCTTCCTGCGCGGCAAGGGCCAGGGCTGGGTCACCGCCGAATACGGCATGCTGCCCCGCTCCACCCACACCCGCAGTTCCCGGGAAGCGGCCAAGGGCAAGCAGACCGGCCGCACCCAGGAAATCCAGCGCCTCATCGGCCGTTCCCTGCGCGCCGTCACCGATCTCAAGGCCCTGGGCGAGCGCCAGATCACCCTGGACTGCGACGTGCTGCAGGCCGACGGCGGCACCCGCTGTGCCTCCATCACCGGCGCCTGGGTGGCCCTGTGGGACGCCTGCCAGTCCCTGGTGGCCGCCGGCAAGCTGAGCGAGAACCCCCTCAAGGAACACGTGGCCGCCATCTCCGTCGGCATCTACAAGGGCACCCCGGTGCTGGACCTGGACTACCCGGAAGATTCCGACTGCGATACCGACATGAACGTGATCATGACCGGCAGCGGCGGACTGGTGGAAGTTCAGGGCACGGCCGAAGGCGAGCCCTTCTCCCGGCAGCAGATGAATGTGCTGCTGGACCTGGCCGAAGCCGGCATCCGCCAGCTCATCCACGCCCAGGAAACCGCCCTGGCGGATTGATTCGGAGCCCGTCATGGCCCAGGAAACCTCCCGCGACCCGATCAAGGCCCTGCTCGACGATCTGGAACAGTCCATCGCCGATTTCGATCAGCGCCTGGGTGGCGTCGAGGAGTCTCCTGCCGTGACCGGTCTGCGTTCTTCCGGGCAGCGCTATCCCGACATCGAACCCGAGGCCAGGCGTCAACTGTCTCCTGCCGCTCCTGTTGCCGTTGCCGGCAATGCCGACGCAACCGCTGTGTCCGAAGCGCCGGCGGTGGACCTGCTGGCCGAACTGGCCCAGGCGGCGGCCTGCCGCAGCGTGGATGATGCGGAGACCCAGCGTCGCCAGCTGGAACTGACCGAGCGCCTGCACCAGGACCTGAAGACCGTCTTCGACTACCTCAACCAGCTCATCCGCCACGCCAACACCCTGAAGCCGGTGCTGCCCCGCAGCTACCGGCTGGATGCGCGCAACAGCTTCGACGGGCTGGCCTGGCATGACGGATTCGTCGATTACCGCAGCACCAGTCGCTTCGACCGCAGCTACTACGAGCAGATTCTCTTCCAGGTGAGCTACCGGGCGCCGGCGCCGCTGGTCGCGGTCTGCGCTGCGGACCAGGCCGCCATCGTGCGCAAGGAGCTGGAACTGGTGAACCTGCGCATCCAGCGAGAAGAGCCGGTGATGCTGCCGGAGGGCGGCCCCGGGGTGCGCTATGTGCTGCCGGATGCCATTCCGCTGCATCTGGCGGTACAGGCGGACTTCGCCAACGATGCCCTGACCTTCCGCTGCCGCAATGCCGGCAATTTCGGCCCTACTGCCTACCGTCTGCCGGGCGGGAGCATCACCCGGCCCCTGCTCGACGGCATCGGCCTGGTGCTGCTGGGCCGCAGCGACACCATGCCCAAGGAACTGCAACGCATTCCCTACCAACGGATCAACTGAGTCCCATGCAAAAGATCGTCCTCGCCTCCAACAATGCCAAGAAGCTCAAGGAACTGTCAGCCCTGCTGACACCCCTGGGCATCCAGCTCATTCCCCAGGGCGAGCTGGGGGTGCCGGAGGCGGAGGAGCCCCACCACACCTTCCTCGAAAACGCCCTGGCCAAGGCCCGCCATGCGGCCCAGCTGACCGGCTTGCCGGCCCTGGCCGACGACTCCGGCCTGTGCGTCAAGGCGTTGGGCGGCGCTCCCGGAGTGCAGTCGGCCCGCTACGCCGGCGAGCCCAAGTCCGATGCCCGCAACAACGAGAAGCTGCTGGCGGCCCTCACTGGCGTGGCCGACCGCCGTGCCCACTTCGTCTCACTGCTGGTGCTGGTGCGCCACGGCGACGACCCCCAGCCCCTGGTGGCCGAGGGCGAGTGGCACGGCGAGATCATCGACCAGTACCGGGGCGAGGGAGGCTTCGGCTACGACCCCCTGTTCTACGTGCCAGCGGAAAAGGCGACGGCGGCCGAACTCTCCGCCGAGGTGAAGAACCGTCTCTCCCATCGTGGCCAGGCCATGGCCCGGCTGCTGGAACGCCTCAAGCTGGAACTGTGAGCCCGGCCGGCGCCGGGAAACCTTCTGGCGCTGCCGCGGTCCAGGGCAGGGTGTTCCGGACTTGCCGCTCCGCAGTGGCCTGACTCAAAGCGCCATTATTCAGACATCGCATCATGCCGCCTGCGGGCGGAAAGGTTTTCTTTCGTGTCGTCCCGTTCTTCCTCCCGCATCATTCCCATCGCCGTCGCCGGCGGCACCCGCGCTGGCGGCAGTCCCCTGCACTTCACCAGTCCTCCGCCCCTCTCGCTCTACATCCACGTGCCCTGGTGCGTGAGGAAGTGCCCCTATTGTGATTTCAATTCCCATGAGGCGCGGGCGGAGAACGACGAGGCCGCCTATGTGGCAGCCCTCGTTGCCGATCTGGAAAGCGCCCTGCCGTCGGTGTGGGGACGCAAGGTATCCACCATTTTCATCGGCGGTGGCACGCCAAGCCTGCTCTCCGGCGAGGCCCTGCACGAACTGCTGAATGCGGTGCGCATGCGTCTGCCCCTGCTGCCCGAAGCGGAGGTGACCCTGGAGGCCAACCCGGGCACCGCCGAGGCGGGCAAGTTCGCCGCCTTCCGGGCCGCCGGAGTGAATCGTCTGTCCCTCGGCATCCAGAGCTTCAACGACCGGCACCTGGAGGCTCTGGGCCGCATCCATGACAGCGCTGAGGCCAGGGCCGCCATCGAGTTGGCCAAAGCCCACTTCGAGCGCTTCAACCTGGACCTGATGTACGGCCTGCCCCAGCAGTCCCAGGCCGAAGCGATGGCAGACCTGGAGATGGCCCTCTCTTTCGCGCCGCCCCATCTTTCCTGCTACCAGCTGACCCTGGAGCCCAACACCCTCTTTGCCGCCCGGCCGCCCCAGCTGCCCGAGGGCGACACCTGCGCCGACATGCAGGACGCCATCGAGGCCCGCCTGGCTGCCGCCGGCTACGTGCATTACGAAACTTCGGCCTTCGCCCGGCCCGACTACCAGTGCCGGCACAACCTCAACTACTGGACCTTCGGCGACTACCTGGGCATCGGCGCCGGGGCCCACGGCAAGCTGACCCTGCCGGACCACAGCGGCTTCTCGGTGCAACGCCAGATGCGCTGGAAACAGCCCAAGCAGTACCTGGAGCAGGTGGCCGCCGGCCAGCCGGTACAGGAGCAGCACGGCGTGGGGGCGGACGAGCTGCCCTTCGAATTCCTCATGAACGCCCTGCGCCTCAACCAGGGCTTCGATCCGGCCCTCTTCGAGCAGCGCACCGGCCTGCCCCTGCTGCTGGTGCGGGGCGAGCTGGAAAAGGCGGCCCGGGAAGGGCTGCTGACCCTGGCACCGGACTGCATCGCACCCACCGAGCGGGGCCGGCGCTTCCTCAACGCCCTGCTGGAACGCTTCCTGCCGGATGCCTGAATAGGCTATGCCGCAGAAGTAGGAAGGACGCCATTCTTCACCACGGCACGGTGAAGAATGGCGTCCTTCTTGTCTTCTGGCTGAGGTGCCTTAGAAACCCTGGGTATAGGTCAGGCGCCAGATGCGGGGTTCTGCCGGGTGGCTGTGCACATCCTCCACTTTCGCTGCTTCGCCGCGCAGTTGGGACTCGTAGTAGTAGTCGATGTCGCTGGCCTTGCGGTTGAACAGGTTGAGCACTTCCAGGGTCAGCTGGCTCTTGGCGGCCAGCTTGTAGCCCACATTGAGATTGACCATCACCGAGCTGCCGGAGCGCACAGAGTCGTCTTCTTTGAGGGCCCGGGGGCCCAGGTAGCGCAGGCGCAGGCCGCCGCGCCAGGGGCCCAGGTCGTGCACGGCGACACCGACGGAGGCGGTGCGTTCGACGGCGCCGGGGACGTGGTTGCCAACGCTACTGTCATCGCGGAAGCGGGCCTTGGAGAGGGCGATGTCCGCATCCAGGGTCAGCCAGTCTCGGGGCGTCCAGTAGTTGGACCATTCCATGCCCTGGCGGTGGCTGGGGCGGCTGGCCTGGGTGGTGCCGGCGTCGCCGACGAAGAGCAGTTCCGAATCCAGGTCCAGGCGCCACAGGGCGACGCTGGTGTTCCAGCCGGGGGCCGGGGCGCTGCGCCAGCCCACTTCCTGGCCCCGGGATTTGACCAGGGCCGGCACCCGGGACATGGGGTCGCCCGGGTTGGAGGGATCGACGCGGATGGTGGTGCCGCGGGCGTCGTTGCTGTGGAAGCCCTGGCCCCAGTTGTAATAGAACTCCTGGTTGGCGAAGGGGCCGAAAATGAGGGACAGCTTGGGGCTGGTGATGCCGTCGTTTTCCTTGCCGGAATTGGCGGCCAGGCTGGAATCCACCTTGAAGCGGTAGCGGTCGTGGCGCAGGCCGGCGACGCTGCGCAGCCAGTCGCTCCACTGGGCGCCCCACTGGCCGTAGAGGCCCAGGCTGCCCTGGTTCACGCTGTCGCTGCGCACGGTGGACAGGCGCTGGCGGGCGGCGGTGCGGTACAGGCCCACGTTGTCGATGTCGTCCTGGCGCCCCTGCACGCCCCAGGTGAAGTCCCCTTCCTTGCCCAGCCACTGCACCGGCTGGCTGCGGCTCCAGCCGAAGCCGCCGTAACGGCGCCGGTCGGCCTGCTCGAACTGGTCGCCGTTGACCGGATCGTCCATGGCGTAGGTGAAGTTGGAGAAGAGGTTGAGCCGGTAGTCCACCAGATAGGTGTTGGCCCGGGTCTGCACCGCCCCGTCCTGCCGCGCCCACTGGCCGGAGAGGGAGAGGCGCCGGGTGTTGCCGCCGGCGGTGGGGTCCAGGCTGCCGTAGCGGTTCACCAGGCCCTGGTCCACGGCCCGTCGCGCCAGCTGGTCGGTGGAGGTCCAGTCGCCGTCGTAGGCCATGAAGGCCAGCGAGTGGCCGTTGTTGCGCGTGCCTTCCGAATAGCGCAGCACGCCGTTGAGGCGCTTGTAGTGCTCCGGTACTTCCCAGGGACCATCGTTGTGGAAGACCTCCACGGCGCCCAGCCAGCGGCCGCCGCCGGCGGTCTCCTTGTCGGCGGCGGTGAGGAGGCGGCGGTAGCCGTTGCTGCCGAGGCCGATGCTGACGTAGTCCTCCGGCAGGGCGCGGCGGTAGTCGATACGGGCGCTGCCGGCGGAGGAGAAGTCCCCGTCCTCGGCGGCGTAGGGGCCCTTCTTGTACTGGATGCGCTCCACCAGCTCGGGGATGAGGAAGTTGAGGTCCAGGTAGCCGTGGCCGTGGGCGTGGGTCGGCAGGTTGATGGGCATGCCGTCGATGGTGACGGAGAAGTCGGTGCCGTGGTCCAGGTTGAAGCCGCGCAGGAAGTACTGGTTGGCTTTGCCGTCGCCGGCGTGCTGGGTGACGATGAGGCCGGGCACGGTTTCCAGCACTTCCGCCGGGCGCAGCAGGGGGCGGTTCTCCAGCTGCTTCGCTGTCACCGTGCCGACGCTGGCGGCGTCGGCGACGCCGATCAGGTCCTGGGCGCCGGCCTTCACTTCGATCACGTCTTATGTAATCCAATTCAGTGTTGCGTATAGCCAAGTTGCTTCTGCTGAGCGGGATTACATCAGGGGATACGGTTTTCTAGTAGCCTAAAGAGAAAAACGAACAAACCGAAAAATTCTGACTATTGTCAGCTGACACTATCGGAATGATGTGGTTATGTGGCGCCCTTCTAATAAGGCCCTGCATAGCCAATGGCACATGATACCAATCCTACGTTGCGTGAAGCCCTTTTCATGTACTGCGAGCGCATCAGTATCCATAAGAAGGGCCACGCACAAGAGAAATATCGAATTAACCTATATTGTCGTTATTCCATTGCTGATCTTCCGATTCGCAATATAACGTCAGTCGATGTGGCGACATTTAGGGATGAGCGATTAGCGGAGATTAACGCACGAACGGGTAGGGCACTTTCCCCTGCTACGGTTCGGCTGGATCTGGCGCTGCTTTCCGACCTGTTTCGCATTGCGAAGAACGAATGGGGTATATGCAACGATAACCCTGTCGCCAACGTCCGTAAGCCAAAACTTCCGCCTGGCCGTGATCGACGCTTGGCTCCTCGTGAAGAACGGATGATCATGAGGCATTGTTCCCAGCGGGGCGCGCATGAGATGAAGGCCATTGTCCAATTGGCATTAGAAACTGCTATGCGTCAGGGGGAGATTCTGGGGGTGTGCTGGGAGCACATCAATCTGAAATCCAGAATTGTTCATCTGCCCGACACCAAGAATGGTTCCAAACGTGATATCCCGTTAAGCATGGAGGCTAGGGATATCCTGGCGGCCCAGAGGGTGAAGCTGTCGGGGCGAGTCTTTAGCTATACGAACAACGGATTGAAGAGCAGTTGGCGAAGCATGATCAAGAGGCTGAATATTCCTGATCTGCATTTCCACGATCTTCGGCACGAAGCAATCTCTCGCTTGATGGAACGAGGTGTCTTCAACCTGATGGAAGTTGCTGCCATCAGCGGACACAAGAGCCTGTCCATGCTGAAGCGATATACGCATCTTCGTGCTCAGCGTTTGGTGCGTAAGCTCGACGCTGGCGCAAACAAGGGGAAGGCTGCGGTCTTGAGCTACCTGGTTCCTTATCCAGCCTTCATCGAGCCCTATGAGAGCCAGGTAAAAGTAACCTTCCCGGACTTTGACGATCTGCATGTGGCAGGGCCATGTCTAAACAGTGCAGTACAGCAAGCTCAGGATGCCCTATTACGGGAAATTTTGGTCTTGATGCGTCAAGGTCGGCCGATCCCGCCGCCAAACAACTACCTAGAACTCCTCGATGAATCCAGGCTCTTTCACCTGGACCCGTTGGCAACCTATGATTCCCTCGCGGATCTTGCCGAGGGCGCGCTGGTTTGAGTTCTGATGACCTTGCGAGGGAGGAGGAGTCCCTACTGGTGTGGATGAGCGGGTTGAAAGCAGGACCTTATCTAACTACCGCTTGAATTGGGGTGACAGTCATCTAGTGGCAAATCCACAAAAAAGTTGCGCTTGAAGTGAACAACTAAGTTCGTGCATACTGAGAATATGTATCAATCTCTTCGCCACCGTTTTCGTGCATATGTGCAGCACCCTATCGGGCGCTGCGCGGTGGTATTGATGCTGTTTGCCTTGGTGGCAGCCAGTGTTCCGTTTGGTGAGATTCATGCCCATGCGGATGGTGATCACGATCATGATCACGGTTACGTCACTGCTGAATTGACGAAGGCATCGCTCTCCGATCCTTCAGACTCTATGGACTCCGATTCTGATTCGACCGGAGCCAAAGTGCTGCATGCACACGGTTCCGTTGTCACTCCTCCGCCCTTGCCGGTGGATGGACTGGGAATCGAGCCATTCATCTTTCCCGCCCGGGACAAGATTACCCTCGCCTACTTGTCGCGGCCTTCTGCGACACCACTTCCCCCCTATCGCCCTCCAATCGCCTGACGCCTAGCGCCGCTTTTTAGCGGCTTGCTTTGTGTCGTCTGTCGATATTGGAGGTTTTCCGTGAACTTGCGTTGTTTGCTGGTTCTGGCTGTAGCCGGAACCTGCGGTATCCCCTTGTCGGGGTATGCCGCTGAATCCTTGCGCCTGGAGGAAGCAGTCTCCCGCGCCTTGGCATCCCACCCCTCACTTGCGGCCGAGGCCGCGCAATTGAAAGCCGTTCAGGCACGCGCTCAGCGTGAAGGCCTGGCAACGCCCTTTATGATCGGCGCCGATGTGGAAAACGTCGGCGGTACTGGAGCCTTTCGGGGGGGGCAATCAGCTGAAACCACGCTACGTATTGGCCGCGTCATTGAACTGGGCGGTAAACGTGAAGCGCGCCAGGCATTGGGTAGCGCTGAAATCAATCAGCAACAGAACCTGTCCGAGGCAACCCGCCTGGATGTCATCAGCCGCACCTCACTCCGCTTCATTTCAGTGCTGGCTGACCAGCAACGGCTGAAATACGCTCAAGAGCAGGTAGGACAAGCCGAGCGCACACGCCGCGAGGTCGCCAATTGGGTAGCCGCTGCCCGCAACCCGGAGTCAGATTTGCGTGCGGCTGAAATCGCCGTTGCTGACGCCGAGCTGGAGCGCACCCGGGCCGAGCACAAGCTGACCTCTGCCAGGTTAACCCTGGCCTCCAGCTGGGGGGTCTTAACACCCGATTTTGAGACGGCTGCAGGCAATCTGCTCGTGCTGCCCAAAGCGGAGTCGCTGGATACCTTGGTGGCTCGTCTGCCGATGACACCAGAGCAACGTGCCGCATTGCTCGAGGCGGATAGTATCGCTGCTCGCAAGCGCTTAGCCGAGGCCGGCGCCAAGCCGGACGTTACCGTCAATCTGGGTGTGCGTCGCCTTGAGGCAACCAGCGATCAGGCATTGATGATGTCGGTATCGATTCCACTCGGCAACCAGGTTCGCTCGGGACTGTCCGTCGCCGAAGCCAATGCGCAACTGATGGCACTGGAAGCTCGCCGCGATGCTCAGCGTTTCGAGCACTACCAGTCGCTGTTTGGAAAGTATCAAGAACTCAATCAGGCCCGTACTGAAGCTGAAACGCTGCAAAAGCACATGCTTCCCAAGGCCGAGGAGGCACTGGCCTTCACCCGGCGCGGCTTCGAAGCCGGCCGCTTCTCCTTTCTTGCCCTGGCACAAGCGCAAAAAACCCTATTCGAACTGCGCCAACGCGCTGTCGATGCTGCTGCTCGCTGCCAGATCCTGATGACCGAGGTGGAACGCCTCACCGCCATTGCCCCGGAACCCACGCCATGAACCGACTATTACCCCTGATTCTCGTGCCTCTGCTGCTGACAGCTTGCGGCAACGATACCCCTCCCTCCGCTGTGGTTGCTGCGGAAAAAGCCAGTGCTGCCGAAGAGTACGAGCGTGGCCCCCATCGCGGCCGGATGCTGCGCCAGGGTGACTTCGCTCTCGAAGTGACCATCTATGAAACCAATGTGCCGCCGCAGTATCGGCTGTATGCCTACCAGAACGGCAAGCCTTTGCCGCCGGCCAGCGTGCAAGCCGCAATCCAGCTCAAGCGCCTGGATGGCGAATTCAACAATTTCACCTTCACGCCGGAAAAAGACTACCTGAACGGCAGCAGTGAAGTCATTGAGCCCCATTCATTCGATGTCGAGGTCAAGGCCCAGCATGCCGGCCAATCCTACAGCTGGGCGTTCCCCTCGTATGAGGGGCGCACCACGATTCCGGCGGCTGCCGCAAACGACGCAGGGGTTAAGGTCGAGAAGGCCGGTCCGACAACAATCCGCAATACAGTGCGGCTGATGGGTGCTGTGATGGTCGATGCGAATCGGCGTGCCGAGATCAAGGCCCGCTTCCCGGGCATCGTACGCGCGGTCAATGTCCAGGAAGGGCAGCGTGTCAGTCGTGGCCAGACGCTGGTGGCGATTGAAGGTAACGACAGCATGCGGACCTATTCCGTTGTCGCACCGTTTGACGGCATCGTCTTGGCGCGCAATACCAACGTCGGCGACGTTGCCGGCAGCAACACCCTGGTTGAACTGGCGGATTTGTCCAGCGTCTGGGTGGAATTACGGGCTCTCGGTGGAGATGCGGAGAAGCTGTCCGTGGGCCAGGAGGTCGAGATTTCCTCGGCCACCGGTGGCAGCCGGGTCACCGGGAAAATCCAGACGCTGCTGCCCCTGGCCTCCGGGCAAAGCGTGGTGGCCCGTGCCAGCATTGCCAACCCTGAAGGGCGGTGGCGGCCGGGTATGGCGGTCTCTGCGGATGTCACCGTGGCGGCACGCCAAGTCCCGCTGGCGGTGAAGGAATCCGGCCTGCAACGCTTCCGTGATTTCACCGTCGTCTTTACCCAGGTAGGGGACACCTACGAGGTCCGCATGCTCGAGCTGGGTGAGCGTGATGGCCGCTACGCCGAAGTGCTGGGCGGGCTGAAGCAAGGTGCTACTTATGTAGCTGAGCAGAGCTTCCTCATCAAAGCCGACATAGAGAAGTCCGGCGCCAGCCACGATCACTAAGGGATTTGCCATGCTAGAACGAATGATTCGTGCGGCAATCGCACATCGCTGGCTGGTCCTGATACTGGTTCTGGGCACCTCCGCACTTGGTGTCTGGAGCTATGGTCGCCTGCCGATCGATGCCGTCCCCGACATTACCAATGTCCAGGTCCAGGTCAATTCCGAGGCCCCCGGCTATTCGCCGTTGGAGGCAGAGCAACGTGTCACCTTCCCGGTAGAAACCGCCCTGGCAGGTATGGCTCGCCTGAAGTACACCCGCTCGATTTCGCGCTATGGACTGTCCCAGGTCACCGTGGTGTTCGAGGACGGTACGGACATCTACTTTGCCCGACAGCAGGTGAGCGAACGTCTGCAACAGGCGTCTTCCCAATTGCCGGCTGGCGTCAAACCGACCTTGGGACCGGTGGCGACAGGGCTGGGTGAAATCTTCATGTATACGGTCGAGGCCACACCAGGGGCTACCAAGGCGGATGGCAAACCCTGGATGCCTACGGATTTGCGAACACTGCAGGATTGGGTGATTCGCCCTCAGCTGCGTAACCTGAAAGGTGTCACCGAGGTCAATACCATCGGCGGCAACGTGCAGCAGTTTCATGTCACCCCCGACCCGGCCAAGATGGTGGCCTACAAGTTAACCATTGATGACCTGCTGCAGGCCATTGAACGTAACAACGCCAATACGGGCGCCGGTTACATCGAACGGGGTGGTGAGCAGAACCTGATCCGCATTCCTGGGCAGGTGGGTGATGAGGCTGGTTTGCGAGAGATCGTGGTGGCAATGCGTGACGGGCTGCCCTTGCGAATTAGCGACATAGCTACGGTCCAGATCGGCTCGGAACTGCGCACCGGTGCCGCAACCAGGGATGGCCGGGAAGTGGTGCTGGGCACGGTATTCATGCTGATTGGTGAGAACAGCCGGGAAGTCGCCATGCGTGCAGCGACCCGCCTCAAGGAAATCGATGCTTCGCTACCGGAAGGGGTCAGTGCGCGTGCGGTTTATGACCGCACCCAACTGGTGGACCGCAGTATTGCCACGGTCCAGAAGAACCTCCTCGAGGGAGCCTTGCTGGTAATCGTGGTTCTTTTCCTGCTGCTGGGCAATATCCGTGCGGCACTGATCACGGCGGCCGTGATCCCGGTTGCCATGCTGATGACCATCACCGGCATGGTGCAGAACCGGGTATCGGCCAACCTGATGAGCCTTGGGGCCTTGGACTTCGGCCTGATCGTCGATGGCGCCGTGATCATCGTCGAGAATTGCCTGCGCCGCTTCGGTGAGCGGCAGCACGCCCTGGGCCGCTTACTGTCCATCGAGGAGCGCTTCCAACTAGCTGCGAAAGCAAGTGCCGAGGTAATCAAGCCTAGCCTGTTCGGTCTGTTCATCATTGCCGCTGTTTATCTGCCGATCTTTGCCCTCAGCGGGGTCGAGGGCAAGACTTTCCATCCCATGGCTATCACTGTGGTCATGGCGCTGGTTGCCGCAATGGTGTTATCCCTGACTTTCGTGCCGGCGGCCATCGCACAGTTCGTCACCGGCAAGGTCGAGGAAAAAGAAACCCGCCTGATGCAGCGGCTGCATGGGATTTACGCTCCTCTGCTGGAGAAGTCCCTATCGCTGCAAAAGCCGGTGATTGGCGCCGCTGCAGTGCTGGTGGTGCTGTGTGGATTGTTGGCGACTCGCCTGGGTACGGAGTTCATCCCCAACCTGGATGAGGGGGATATTGCCCTGCACGCCCTACGCATCCCGGGTACCAGCCTGACCCAGGCTATCGGTATGCAGGCCCAGCTCGAAGCACGGATCAAGCAGTTCCCGGAAGTAGACAAGGTGGTGGGCAAGCTCGGCACGGCAGAAGTGGCCACCGACCCGATGCCGCCTTCTGTGGCCGATACTTTCATTCTGCTCAAGGAACGCAAGGACTGGCCGGACCCGCGCAAGTCCAAGGCTACCCTGGTGGCGGAGCTGGAGGAAGCTGTTCGTGCCATCCCCGGCAACAACTACGAGTTTACCCAGCCGGTACAGATGCGGATGAACGAGTTGATTGCCGGTGTACGTGCGGAAGTGGCGATCAAGGTATTCGGCGATGACCTGCAAGCGCTGACCGCGGTTGGCAAACAGATCGAGAAAGTCGCAGGCAGCATTTCCGGAAGTGCCGACGTGAAACTTGAGCAGGTGACCGGCCTGCCGCTGCTGGTCATCAAGCCGGATCGTGCCGCCCTGGCCCGCTACGGCCTGGCCGTGGCCGACATCCAGGACACCGTATCCGCGGCGATGGGTGGGGCAACGGCTGGCCAGCTTTTCGAGGGGGATCGCCGTTTCGATATCGTGGTGCGTCTCCCCGATGCCCAGCGCCAGGACCCGAAGGCACTGGCAGCGCTGCCCATTGCCCTGCCGGCGACAAGCAGAGCCGATGGAGCTTCGTTGTCGCGGATGCCCGGCGTGGTGCCCTTGAGTGCCGTGGCCACTATTGCGGTAGAGCTAGGGCCCAACCAGGTCAGTCGGGAAAACGGTAAGCGGCGCGTGGTCATCACGTCGAACGTGCGCGGCCGGGACCTCGGCTCCTTCGTGGAGGAACTCCGGGGGAAAGTTGCGGCGGAAGTCGTGCTGCCTGTCGGAAGCTGGGTCGAATACGGCGGCACCTTCGAACAGCTGATCTCGGCCGGCCAGCGTCTGAGCGTCGTGGTTCCCGTGGTCCTGGTCATGATTTTTGGCTTGCTGTTCATGGCCTTCGGATCGGCCAAGGATGCCGCAATCGTGTTCAGCGGCGTACCCCTGGCGCTGACCGGTGGCGTACTGGCCCTGTGGCTGCGCGGTATTCCCTTCTCCATCTCAGCCGGGGTCGGATTCATTGCGTTGTCTGGCGTTGCGGTACTCAACGGCTTGGTGATGATCACCTTCATACGGAAGCTGCGTGAGCTTGGGCAACCGCTACATACCGCTGTGACCGAGGGGGCGCTGACCCGTCTGCGCCCCGTGCTGATGACCGCACTGGTTGCCAGTCTTGGCTTCGTCCCCATGGCCCTCAATGTCGGTACAGGTGCTGAAGTGCAGCGCCCACTGGCAACCGTGGTGATCGGCGGCATCATCTCTTCGACCCTGCTGACCCTCTTGGTGCTCCCGGTGCTGTACCGGCTGATACACCGGAATGAGAACGAGGAGACAGCCGCGTGACCCCTTCCCCCATTCTATTCAACAAGGAGTTTCCTTTGCCATTTCGCCAGTTCAGCGCCACCGGCATTTGCCGGTGGACCGTGGCGCTCCTCGCGTCACTGCTGCCCCTGTGGGCTTTCGCCCACGGGGTCACGGGAGAGGATCAGTCCTTTCTCGAGCAGAACACCGGCCGCAACCTGCTGTTGTTCGCCTACCTGGGAGCCAAGCACATGGTCACCGGGTATGACCATCTGTTGTTCCTGTTTGGTGTGGTGTTCTTTCTGTACCGCATGCGCGACGTCAGCATTTACGTGACCCTGTTCGCCGTCGGACACAGCGTGACCCTGCTGCTGGGGGTGCTGGGCGGTTTCCACGTCAATCCCTATGTCGTCGACGCAATCATCGGCGTCTCCGTGGTTTACAAGGCGCTGGACAACCTGGGGGCATTCAAGCACTGGTTGGGATTCCAGCCCAATACCAAGGCGGCCGTACTGGTCTTCGGCTTTTTCCACGGTTTCGGCCTGGCCACCAAGCTGCAGGACTTCTCGTTGTCCCGCGATGGTCTGGTGCCGAACATGCTGGCCTTCAACGTTGGCGTAGAACTTGGCCAATTGCTGGCACTGGCTGGAATTCTGATCGTCATGGGGTTCTGGCGCCGCAGCACAGCCTTCTCCCGGCAAGCATTCACCGCCAATACCGCACTCATGGCTGCTGGCTTTGTCTTGGTCGGCTACCAACTTACCGGCTATTTCGTTTCCTGATCGAGGTCTTCCTATGTCCAATACTCAAACCCACTCCCTGCCCAGTAGTGCCAGTCTGTTCAAGGCAACCGCGGTAGCCGCAGGTGTTGCCGCCACCTTGCTGGTGACCATGGTGCTTCCTGCGGAATATGGGATGGACCCTACCGGCATTGGCCGTTTCCTCGGCCTCGATGCCCTTAAACAGTCTGCCGGTGCTGAAACAACATCTGTTCTGGCAACCCCGGATGCTATTGCTGGCCCCAATGCAATGCTTGCCGCCAAAGCAGATGCTGCTTTCGGAAAGCAAGCCGGCAGGTCCTTGGATGCCTCCGCCGTTTCATTGGCAGGTGATGGCCCCATGCGTCGCAACACATTCACGGTAACGCTGGCTCCCGGCAAAGGCGCAGAGGTCAAAGCGCACCTCCGGGCTGGTGAAGGCCTGACCTTCCACTGGCAAGCAACCGCCGCGGTGGCCGTGGATATGCACGGCGAAGCACCGAATGCCAAAAATGCCTGGACCAGCTATTCGGTCGAAAGTGCTCAAAAGAGTGCATCCGGCACTTTTGTTGCCCCCTTCGAAGGAAGTCACGGTTGGTATTGGCAAAACCGCGGCACCGAGCCGGTGACGGTATCCATCGAAGCCTCCGGTTTCCAATCCGAGTTGTATCGGCCGTAACGAAGCTTTCTTACACGGCCCCGTTGTATTAACCCCGCGGCTTGCCGCTTTCACATTGGAGATGTACCCCGTGAAAAACAAAACCTTCCTGTCCCTCTCTTTGCTGGTCGGGTCCTTCATGTCGCTTTCCAGCGTTGCCTATGCCCACGGTGTCCACGAAGACAGTGCCGAACCAAAGGCCACGCCCACTGCTTGCCGGCACCTCACCGACACCGAGCATTACGTGGTGGATCTAAAGGACCCCGCAACCCGGGCGCTCAAGACCCGTTGCGATGCCACCAAGAAGCCTGTAACCCCGGTGGCCGAGAAGAAGGACGAAACACCGGATAAGAAGTAACCCCTCCTGATAGTTCATTGTCGCCCGTAGAGACATAGGAAATAGCCATGCTTGAAATCCTCCGACATCGCAGTTTTAGGCATTTGTTTCTCGCCCAAGTCGTTGCATTGGTGGGGACGGGGCTTTTGACCGTGGCCCTGGCATTGCTGGCCTATGATCTGGCAGGCGCCAATGCCGGTGCGGTACTGGGTACCGCACTGGCCATCAAAATGATCGTCTACGTCACGCTTTCGCCTGTAGCGGGGGCTGTTGTCCCTGCGGCATGGCGAAAGCGTGTCTTGGTCGGCCTAGATTTGATTCGAGCGGCGGTGGCATTGCTGCTGCCGTTCGTCACCGAAATCTGGCAGGTCTATGTGCTGATTGCGCTGCTGCAATCAGCCTCAGCCTGCTTTACCCCGCTTTTTCAGTCGCTTATTCCCCAGATTCTGCCGGAGGAAAGCGACTACACCCGCGCGCTCTCCCTGTCGCGGCTGGCCTATGACCTGGAAAGCCTGCTTAGTCCGGCCCTGGCAGCGGCATTGTTGGTGGTCATCAGTTTTCACGGGCTGTTTGCCGGCACCTCCGTCGGCTTTGTTCTATCTGCACTGTTGGTCATGAGCACGGCATTTCCCGTCGTGCCAGAAACCCGTTTGGGAGATGGCCCCTACAGTCGAGCCCTCCGAGGCATGCGGATTTATCTACACACACCGCGACTACGCGGACTTCTGGCGTTGAACCTATGTGCCGCAAGTGGGGCCAGCATGGTTTTCGTGAATACCGTAGTCCTTGTCCGCGAGGTGCTGGGAGGCGGTGAACGGGAGGTGGCATGGGCTCTGGCAGCTTTCGGTGCCGGGTCCATGGCCGTGGCCTTTTCCTTGCCAACATTGCTCGACCGTATGGCGGATCGTCGAATCATGCTGAGCGCGGCATCAGCAATGGTCGTTGTGCTACTGGCGGTAACGGGGGTTTGGTGGAGTACCGGGGGTTTGGGCTGGGCAAGTCTCATTCCCGCATGGGTGGTTCTGGGTATGTCCTACGCAGGCCTGGTAACACCCGGGGGACGACTATTGCGGCGCTCGGCCCAATCGGACGATCTACCCTTTTTGTTTGCCGCCCAGTTTTCACTCTCACACCTGTGTTGGCTTCTGGCTTATCCACTGGCGGGATGGCTTGGAGCGCGGCTGGGATTCGGTGTTGCCCTCTCCGCCCTTAGCGCCATGGCAGCAGTGGGAGGGGCGCTTGCCTGGCGCACTTGGCCAAGGCAGGACCCTGATGTGATTGCCCACCATCATGACGATCTCTCTACCGACCACCCTCATTGGAACGAGTACGCGCTTGGTGGCGGAGGTCGGACTCATGAGCACCGTTTTGTCATCGATGAACTGCATCAGCGATGGCCCCACTAAGCCGGCCACTACGCGATGGTACTGAGTGATTTTCTTCGAGTAGCAGATAGCCACCTGATTGTTGGTCGGGGACTCTTTCACATGCTCAGCCACATGTGGCTCCACATCGAGTTCCGGCTATTTGCCGTCGGAATTGGCGTGCTACTGGTGTTGCTGTTGGTACTGATTTTTATGAGCTGGGAAGAGGAGCGGTGGATTCGTGCTGTAAGGATTTTTGATTTTTTATTTCGAAAACGGAAATAAAAAAGCTCATTAATGGGGTGGTATTCGCATGCCAAAGATGGAGCTTCCACAGCAGGTGGCAAAACTAGATACAACGCCGACATCCGTAGTTGGTCAGTCCCGTATGCAGACGGCCGACTTCTCGAAAATCCTCAATCAGGCGCTTTCCAGGAGCAACACGCCAGCTGACGTAGGCGTAACCGTGCATAGAGACGGAAACAAGAAACCGGGGGACTTCCAGCAAAAAATGCTGGGAATACGGGCCTATAGGCAACAGCTCATTGCTTCAAATATCGCGAATTCCGATACGCCAGGATATAGGGCTATGGATATCGATGTCGAAGATGCTGCCAAGCAGAACCAAATGGGGCTGTTGCCATTAGCAAAATCATCTCCCAGCCATATCAACGGGAGTGCTCATTGGTCCTCTCCGCCGTTCAACCTGAAATACCGCACACCATTTCAGGCCAGTGCAGACGCAAATACCGTAGAAATGGACATTGAGCGCCAGCATTTTGCTGAGAATGCTGTGATGTACCAATTCACCCTGGATCAGGTTGGTGGCGATTTCAAAGAGCTGACTGAGTTGTTTCGAAACCTAAAATAGTTCGTTCAAGCTAGCCTCATGCCAGCCAGGCTGCAAATCCAATGTAGCAACAAATTTTATTTGTAGAGTTATCAACAGCATTGTAGAGTACACTCATGACTACATGGATCGCACTCATTACCAGCCTTCCCACCGAGAATGCCACGGCCCGCATGCGTGCCTGGCGTAGCCTCAAGGCATCGGGTGCCGCCGTCCTCCGGGATGGGGTCTATCTGATGCCGGAGCGGGAGGATTGCCGGAACACACTTGATGCCGTAGCCGCAGATGTTCGTGCTGCAGAAGGTACAGCCCTGGTCGTCCGCCTCGAGGAGCCCAGCGATGGCAACTTTGTGGTCTTCTTCGACCGCAGCGCCGACTTTGCTACTCTGCTGGGGGAGATTGCCACGGCCCGAGACACGCTCGGTCCGGACACGGTAAACGAAGCTCTGAAGCAAGCCCGCAAGCTGCGCAAGGCGTTCTCCAACCTGGTAGCCATCGATTTTTTCCCTGGAGAAGCGCAAAAGCAGGCCGATGAGGCCTTACGTGACCTTGAGCAACGAGCAGCCTGGGCTCTTTCCCCCGATGAGCCGCACCCGGTCAACGACGCTATCTCCCGCTTAAGCATTCAGGACTATCAAAAACGTCGTTGGGCAACGCGACGGCGCCCCTGGGTGGACCGGCTGGCCAGTGCCTGGCTGATTCGCCGCTACATCGATCCCCAGGCCGAACTGCTCTGGCTGGCAACGCCGGCAGATTGTCCGGCCGAGGCTCTTGGTTTTGATTTCGATGGGGCGACGTTCACCCATGTCGGCGCCCGGGTGACCTTCGAAGTGCTGCTTGCCAGCTTCGGCCTGGAAACTCCGGCTCTGCAGCGCATCGGTACCTTGGTCCATTTCCTGGATGTGGGTGGCGTACAACCGCTAGAGGCGGTGGGCATCGAAAGCACCCTGGCCGGCCTACGCGACACCATTCTCGATGATGACCAACTCCTGGCATTGGCCGGCAGTATCTTTGACGGACTACTGGCCTCCTTTGAGAAAGGATCGAAATCATGAGTACGATCCTGACAGCCGCCGATTCCACCTCGCCAGAGTCCAAACCTGCCGAAGTCAGCTTCTGGCAGGCCTTCCTGTTCTGGCTGAAGCTCGGCTTCATCAGTTTTGGCGGGCCTGCCGGGCAGATCGCCATCATGCATCAGGAGCTGGTCGAGCGCCGGCGCTGGATTTCTGAACGCCGCTTTCTGCACGCCCTCAATTACTGCATGGTGCTCCCCGGTCCGGAGGCCCAGCAGTTGGCTACCTATATCGGTTGGTTGATGCACCGCACCTGGGGTGGCATCGTCGCCGGTGGGCTATTCGTGCTGCCGTCGCTGTTCATCCTGATTGGGCTGTCGTGGATCTATATCGCGTTCGGCAATGTGCCCCTGGTGGCCGGCCTGTTCTACGGCATCAAACCGGCGGTTACCGCCATTGTCGTCCAGGCGGCCCACCGCATCGGCTCCAGGGCCCTGAAGAACAATGCCCTCTGGGCCATCGCTGCAGCATCCTTTGTGGCCATATTTGCACTCAACGTGCCGTTCCCAGCCATCGTCGCGGCGGCTGCAGCCATCGGCTACTTTGGCGGCCGTGTCGCGCCGGACAAATTCAAGGCTGGTGGCGGCCACGGCAAAGCGGATAAGTCCTTCGGCCGAGCCCTGATCGACGACGATACGCCGACGCCGGTACATGCCCGGTTCTCCTGGGGCCAGTTGGCGAAAGTCGCGCTTATCGGTGGCTTGCTGTGGCTGGTCCCGATGGGGCTGCTGACCGCCAGCTACGGATGGAGTCATACCCTGACCCAGATGGGCTGGTTCTTCACCAAGGCCGCATTGCTGACCTTTGGTGGTGCTTACGCCGTACTGCCCTATGTTTACCAGGGGGCCGTCGGGAGCTATGGCTGGCTCACCGGTCCCCAGATGATTGATGGTCTGGCCCTTGGCGAAACAACACCGGGACCGCTCATCATGGTGGTGACCTTCGTCGGCTTCGTTGGCGGCTACGTGAAGGCCGTGTTCGGCCCGGATAGCCTCTTCCTGGCCGGTGCGGTGGCGGCCATGCTGGTCACCTGGTTCACCTTCCTGCCGTCCTTCGTCTTCATCCTGATGGGCGGTCCCTTCATCGAAACGACCCACAATGACCTGAAGTTCACGGCGCCGCTCACCGCCATCACGGCCGCCGTGGTCGGCGTTATCCTGAACCTGGCCCTGTTCTTCGGTTACCACGTGCTGTGGCCGAAGGGCTTCGACGGGGCGTTCGAGTGGGTATCGGCACTGATTGCCCTAGGGGCAGCCATTGCCTTGTTCCGCTTCAAGGCGAACGTCATCCATGTCATTGGTGGCTGCGCGGTCATCGGCTTCCTGGTGAAGATGTTCCTGTGAGCCTCGGCATGGGAACGGCACGGTGGGTAGGGGTCGCTGTGCTGGTGACGGCTATCAACGCTGCTGCTGCCGACAGAGGCTGGGTATTGCTCCAGGGCAAGGTGCTGGCCCAAGCCCTGTCCAACCAGGATTTCGGTGATGGCGTCCACTTTGCCTACCAATTTCTCAGTGGTGGGGAGTTGCGCGGCATGAATATGGGCAAGCCTGCCAGGGGAAATTGGCGGGTCATTGGCAATGAGCTTTGCTGGCACTGGACGAGGCCCAAAGAACCAGAGGAGTGTTACCAGGTACGCCAGCGAGGACAGGCTGTCCGTCTCTATTTGGATGGTCAGGAAGTGCTTTCCGGCAACCTGACCCCGTTACCCGCCAATCTGAAGGAGATGCCCCAATGAAATGGATCACGCGAGAAAGACCCAAGATTGATCGTATCGCCTGTCCCTGGCTGATAAGCCGGTTTGTCGACGAGAGTCCGGAATTCCTCTACGTCCCCGCAGGTGAAGTGATGCGCATTGCGGCCGAGACCGGTGCCACCCCCTACGACGTGCCCAACACCGAATTGGGCCACCATGGCGACCAGTGCAGCTTCGATGCCTTCATCGGAAAGTACAAGCTTGAGGATGCCGCACTCAACAAGCTGGCCCTCATCGTGCGCGGCGCCGACTGCGGTCAGCCACAACTGGCCAAGGAAGCGGCTGGTCTATTGGCCATTTCAAAGGGTCTGTCTCTGAATTTCAGCGACGATCACGAGATGCTGGCCCACGGCATGGTCATCTACGATGCGCTCTACGCCTGGTGCGCCGATACCCCGCTGAAGAAAATAGGCCGGTTTCTGGGGTTGAAGTGACTGGCCGCTGGTTGCTGCCTGAAGGTGTAGACCGGGTAGTGCTGCCGCTCCTGGTGGGCAAGGCCCTGCGGGCATTTGCCGATGGCTATGTGGCAGTTCTCCTTCCAGCCTATCTGCTGGCGCTCGGTTTCGGCACCCTGGATGTCGGCATCCTGAGTACAACGACCTTGCTGGGTTCGGCATTCGCCACCCTGGCGGTGGGGGCCTGGGGCCATCGCTTCCATCACCGGAACCTGCTGCTGGGCGCCGCGCTGCTGATGCTGGGGACCGGGCTCTCCTTTGCCTCTTTGTCAGCATTCCTGCCCCTGCTCCTGGTCGCTTTCGTCGGCACCCTCAATCCGAGTTCTGGGGATGTCAGCGTGTTTCTGCCCCTCGAACATGCCCGGTTGGCCGAATCGGGCCAGGGTACTGCCCGCACCACCTTGTTCGCCCGCTACTCCCTGCTTGGAGCCCTGTTTGCTGCGTTGGGGGCGCTGGCCTCAGGCATTCCCCAGCTACTGGTATCGGTGCTGGGGATCGAGCTGCTATCAGGGTTCCGGGTGATGTTCGTGCTCTACGGGCTGGTGGGTGGCACGGTATGGCTGCTGTATCGACGGATGCCGGCACCCCGGCGGGAGTGCGCGGTGGCCGCTCCGCAGGCGCTCGGCGAGTCGAAGGGCGTTGTTGTCCGGCTGGCGCTGCTGTTCTCCCTGGACTCCTTTGCGGGAGGGCTGGCCATCAATGCCTTGATGGCCCTTTGGTTCTTTCAGCGTTTTGAGTTGTCGCTGGCTGCCGCGGGGAGCTTCTTCTTCTGGGCTGGGCTGTTGTCCGCTGTGTCCCAGCTAATCGCACCGAAGGTCGCCGAGCGCATTGGCCTGGTGAATACAATGGTATTCACCCACATTCCCGCCAGCATCTGTCTCATCGCCGCGGCATTTGCCCCGGGTCTCGAGCTGGCATTCGCATTGTTGTTTATCCGGGCGTTGCTGTCTCAGATGGACGTACCGGTTCGAAGCGCTTTCGTAATGGCGGTGGTGACACCGGCCGAGCGTGCAGCTGCGGCAAGTTTTACTGCGGTCCCACGCAGTTTGGCTTCTGCCATCAGCCCAACGATTGGTGGGGCAATGTTTGCAGCGGGATGGCTTGCAGCGCCTCTGGTTGCCTGTGGAGCGTTGAAAATTTGCTATGACTTAATGCTTTGGAAAGCATTTCGACAACGAGACCCATAACGAAGTGGATTTGTCTCTCGGGCCTCAGAAGGAGCCGAAGTCGGCAATCAAAATTGGGCAGATGCAAGGGCCCTTATCGGAATTTCAACAATAGGCTTCATTGGACGGGCAACAGAATAACAAGGCTTTCTAGCTCCAAAGACTCTCGGTTGAACCATGTCACCTTTAGGTCACCGTTGTCTTGCAGTTCCAAGAAGAGGCCGGTTGCGACACGCGGACCATCAAGTTTTTCGTTCCGGATTGCTTCAATCAACCTTGGCATATGGGAAACGAAGAGCGCAATTTGATAAGCAAGTAGGTCGGCGTCGTCAACGGAAAGCGTAAGTCGAACACCGAAATAGTCGTTCGGAATGGGGAGAGCGACCACGATGTCGGTTCGCTCTGGAAGTTTCTTCTTTAGGCGAACAACCTTCTCCGCGAACAACTTCACCTTGTTTGCCGCGCCTACCACGAGCGCCGTCGCCTTTGCGCGGTTCTTCCACGATTCCTTTGCCGCTTCCTTCACGATCTCCGCCATATACAAAGCCGCGGCAGCCGAGAAAAGGCTAATCCACCAACTTGAATCGGCAATCAGGTGAATCCACGACGGTGGCTCAGCAGATAGGAGAGCAACGCGTCCATCTTCGATCTCAAATTCGAATTCTGGACCAACGTCTTCGCCAAATTCCTTGAGAGGTTGAATCGGTACATCTGCAGTGGAGAGCGCTCTCATAGCCTATGAAACCTCGCGTCGAAATTGAGCGGCCTGCGCATCTTTTCGCGCAGGTCCGCTCGAATGTAGTGGTAGGGGGCGTATTGGTATTGATGACACGAAGCGCCCTCAACGACTGAAGTGCCGAAGCATGAGATCGCCGTACCCCCAGACAAATGCACCCCAGATAGCAACGAACGTAATGATCCAGCTAACGGTACCGTGCTGTTTGTTGAACAGGCGGAAGAGTGCCCACGACTCGGAGAATGTTCCACCACGGATGGATTCGAAGAAGTTGTTGATTCTGAGTTGGGCAAAGACGCAGAAGACCGAAGTGATTGCACCGCTGCGCTGAAACCACGAAGCAAGAGGTTCAGATTCCGGCTTTAGCACCGCGGCAGCCGCCAACACCTCCGCCAAAACCGCGACAAAACACAACGACAGGATGAACAGCAATTCCAGTCTAAGGCGGGTCTCAACGGCGATGGTCACTTGTGCCCCTAACGTTGAAGGCAAGGGGCGGCCCACTTGCGGGACGTCCCAGCAGCCGAAGGCTGCGCCTTGAACGTGGTGTTAGGCTCCACCACTCAGCACCTCTGCGATATCTTGACGTGCGACAGCGATGAATGCCTCTTTCAGTTTGAAGAAGTCAGCAACTTCCTTCGCAGGCTGCGCACTATGACTGGTAATGACGTAGTCGGCCAATTTCTTCGCCGCCTCCACGACCATACCGGATGCGACAAGTGAAACCTCCGCGAACTTTCCGTTCAGCATGTTTAGATCGGCAAGGGATGAAATTTTCTCTTCTCTCGCCTGAACTACGAGTCGTTGCGTTTCAACTAGAAACTCCGCGTAAAGTTTGTGGCGAATAGTGCTTTGTTCTTTCGCAAGCGCAAGTTTCCATTCGTGGTTCTTTACACTGCGGCTCGCCAAGTAGTTTATGAACCCACCAATCAACACCCCCGAGAGGGTACCGAGTACGGCGATAGTTTCAGACGGCATAGTGGAGCCTAACAGTTAATAGACGGACCCATGGGTCCGTCTATGCGTAATCGTACGGACATGAGAAAAATGCCGGGAACCTTGAACTAGCGACGATTCTCGCGCCTTGTGGCCGAAATATGATTTACGAAAAAGGAGATGGAGTCGCAAGTATCTATCCATCTTGCATTTGGCTCAAGGCGAGCGTCTGCTCGATGTGAAATTCACAGTATTCATGCGAATACTCACCTAACCTTCCCTTGGACTTTCGCTGATTAAAGAGACTTTTCTATTTTCTGCCCGGAGGTTGGCGATCAAGGAATCCTTCTCTTTCAACAAATCCTCCAGCTCTTGAATACGTTGCTGGAGCGCGTAATTCTCCCGGGCAATCTTCAACAGATTGGACTGCTCCAGTTCCAGAAGCTCCCGGTATTCCTTCTTTCGTTCCTCTGCCTTTGCAAGCGCGGATGTCTTGATCTTCAACTTCGACTGGGCATCAGTGTCGTTGATGCGCTGAATCTCGGACAATATCTCCCGGTGGTACCGATACAGTGTGGAACGCTCCACTCCAGCCTCTTGAGCAACGCTGGCGGCGGTCAGGCGAGACCCAACCCTTACCTGATCGGGATGTCCGCCAACAATGCGCTGTATTGCCAAGTTCAGGGCTTGTCGCGTCAGCTTGATCGAATTTTCCTTATGGGCCTGTAGCGCTTGCGGGTTCCCAGCATTATTGCTCATTGAGGTATTTCCAATTTGGCGATGACATCAAGGGCCTTATTCAGGATTCGCTGGGACCGGGTTTTACCAGGCAGCCCCAGATCCGACATGTCTAAAGCCGTTTGCTGCTGAAGAGCAATTTCCTTCCAGACCGGAAGGTGCTCAGGGCCGATCATCCCGTAGGCGCAATCGACGCACATGTCTGCCTCAAAAACACACAGGCCACCACAACTTGTCCCCTTGGCATTGCCGATGCACCAACTGTGGCCGGTCCCATTTAGGGTGATGCTTGACGAAATCTGTTGCAACAAGGACTGCTTATTCTTGGCAGTTTTGATGGTCTGGCGTAGATCGGCAAGCATGATGTCGCCGTTGGCCAGGGGAGCGTCTGTCATCAGGTAATTGTGTAGAACGCTCTCCTGCTTTTCTGTCTTGCTCCGCAATATTTCCGTGAGTAGGTCTGTATCCGTCTGGTAGGCATCGGAGGCTCCGCTGGTATATAGCAGCGTCATGTCAATGGACCAGTGACCAAAATGCTCTCGGAGATAGTGGAGATCGCCCAGCTCGGCTGAGGCGACGAAATAGGCATAGGTTCTGCGGAATTGATGGGATGAAAGCCGCCAATATTTGCCGTCGTCTCCCAGAATATTGAAATGTGGACAGAAGTTCTGGAGGCGTCGATGAATGGTGCTTTTGCTGAGCACTCCTACTAAATGACCCACACAGGGGGCGCATCCCAGGAAGAGCCCATCCCGGTCCTTTCTGGCGGTGTGCAGGCGTTTGAGTTGGCGTTTGTGGAAGGTCGAGCCCGGAATGGACATGTCGAGTTGCTGTTCAAATTGGGAAATCTGCTCCTCGATCTGAATTGCATACGGTTGTCGATACCACTCCATTACACGAACTGCGGTCTCAACAATTGGCGGGACGAGCCATTTGTGCGGCTTTATTCCAGTCTTGAAGATTGTGCCGTGAAGCCAGATGAGATCAATGCCATCGTCCGCTTTGTCATGAGCGATGCACCCCCTTTTTAGGGATAGCGTTTCGGAATCCCGGATTCCGGAGAACATGGCAATCACGATATAGCAGGAATCCCGGAGATAACTCAGTTCTGCACTGAGGTCACGAGAGCCCTCAAAACCCAATTCCCGGGCTAGTGGTATTCGGATATTGGTGGCTTGGAATCCCTCTTTGTTAGCCACCGCCTGCTCCAGGGCATCCCGAGTGCTTAAGATATGGGGAGCTAGGTTGGTGACGTAATTTATTGCGACTTCGGCTAGCTGGCGAACCGTGCGCTCTGGGATTCGTAGTGTCTTGATGGTTCGACCTTTGGCTCGTTCTCCTGACAGAGTGAAACCGGATTCATCAGGCCATGGATGTGTAGGTAAGGCATCGGCCAGCTTTTCACGTTGAAGATATAGCTCCTCCAAAACTATCAACCGCCTTGCATGCCAGCCCTTGGCTGTCGGTTTGCCTGTTCTCTGATTAATCTTGGCAATGGGGACATATTCCAGGGCTCTTCCCTCTATATCCCTGAACTGTCGCATGCCTCGGGACGCCATCCAGCGCAATAGGGGCGTTAGCGCTACCACATGGGCAATCACGGTTGCCATTCGCGGTCGTTGGCGCCCATCGATAGGGTCTATAAACAAGGACCAGGTAAAGTCCTTGGTACTTTCCAGCAGGGGGGCATGCTGGTGGTCAGTGAGAAATTCACCAGGTGCGATCTCAATGCGCCAGTTGATCCGCTTTTCACTATCCCTAGCGTTGTTTTGCGGGATGTAAGGCCAGAAGTCCCAGATTTCGTCCTCATACCGGGACACCACGACGGACTCACCCAGATCCGTCTTCGCCATAGAGACCGGAGTTCTGGCCTTTTCTGCGGATGAAAGCGCTCGAATGGGCTTCGCAGGATTAGCTAGTGTGCTCATATCAGGGGGGCGTCATTGCGCCAGGCAGGATGCGGTGTGGATTGGGCTCGCTGGCGGGCTTCCGCCACTACGGCAGAATCAAACTGCACCGCAATTTGTTCGTCGATAGTTCTTATGACTGGTCCGAATGTTTTCATCCACTGGTGGGGAGGAATCTTGGGGCGCTCAGCCAGGAGACGGAAATAGAAGCTGTATAGGCGGTAAAGGTCGTCCTCAAACACCACCATGCTGGGGCAGCGAAAGCAGGCCAGGAATTTTCCGCAGACCTGGTCCGCTTCGCGAAATGGATTCCGACAGCGGGCAATCGATGTGTTGTAGCCACCGGTGAGCAGCTCAGTTGCGTTCCGAAGAGGAATCTCTCCATCTGCTGCTAAGCGGATAGCTAAGGTTTCATCTCTGCTGGTGGCCCAGCCCACCATGGCCTGGCCAATGAAGGCATGGTTGCGAACAGCCTCGGGAGTGACTGGTGGAATGTAGCGTTGGATGGTCAGGAGAATGGAACTATGGCCAAGGGCTTGTTGTAGCTTACGTAGGTCGCGATGACGTAGGTAGTAATTCAGCGCAAATGTCGGTCTGAGGCGTGCGATGCTCAGGTTCAAGGGGGCGCCTCGGTCGTCAAATAGCTCATGTCGATTGACAAAGCTTTTGAGTGCATTGCGCACCTGAACTTCGTCGAAACGAACAACATTTCCGCGACGTCGTGAGTACGTGACGTCGGCCAAACGGCAGAGAAAAACGAATGGGCGATCTGCTTCGTCAGCGTCTGACACAAAGCGCTCTGTGTATTGCTGTAGCTGCCGGAGATAATCTCCCACCATTTTGGTAATGGGTGTTGCAGTTTCCTCCGGTGTATTTTTGGGAAGGGATATGGTGCGCGTGGTATACCCACGGCGCTTCTCCAAGACCAGTAAGTCTCGATCCGGAAGAAAGCTGCGCAGGCTATCCCGGCGCATTTCCAGGAGGGGGGTCATATTCACGCCGCAGGTCAAAACGGTGATGACGGCATGAACTGCCAAGACTTGATGAGATGAGAGGGCTTCCGGATTTGCGTTGAATAGGGCCAGATCTTCTGCGCAGGCAGCAATGATCCGCTCTTGCTCCCCTGCAGAGTAACCTTCTCGTCGCGGTATCCGCTTATTGGAGTTGGGATAGGGATTCTTGGGGAAATTCAGGTTAGGGTTGATGGACTCTGGCACATGTTGCCGGCGATTTTTGAGTATTGATTTGATACCGGCATAGGTCGCATTACAGGCGGATACAGACCACGGTAACCCCTTGTTCTTTCCGTGTTGAACGGTCTGTAGGGCCAACCATGCTACAAATTGCATGATGATTAGCCGGTCGACCTGTTCCAGAGTAGTCACTACTTGGCCGGATTGCTCCAGGTCATCCAGGAAGTGCCAGAAGCAACGAATACCGCTATTGAAATAGGTCATCAGTGATTTGCCGGCAGAGACAAAACGCAGAGACCAAATGGCATCTCGCATTTGCTCGGCCAACGGCTCACGGCCTCGGGCCCGGTGGCTTGTGAAGTCGAAGTGATACTCCGCGCCACTGTGTACACAGCGAATGGTGAACGCCCATCCTGGAGGGAGGTCGATGATCTGCTTTTCTGGGGTCAGATCGATGTGGGTATTGCGATCAACCCGCTTACGCCGGATCGCCATCAGACACAATCCTCTTGGAGCAGTTTGCAGATGTCAGCTTGGTAGCCATCCAAGGCTGTGTGATCTACCAGACTGGCTGTGTGTATATAGATTTGTGTCGTTGCCAAACTGCTGTGTCCCATGCGTTCCATGACCCAAAACAGAGCGCTTTCCTTACTCTTGAATTGACTCATCCGGATAAATTCGTAGGTGCCATACGTGTGACGCAGCATGTGTGGTGTGCATTTGATTCCTGATTTCTCTGAGGCCTTCTTGAAGGCGTTGTTTATTCCGCCCAGGCTGATTGGCTCTCCATATGCTGTCAGGAAGACCCGGTTGCTCTCAGGACCAACGTGCCGTTTACGGTAGCGCTTAGCCAGCGAAGATCTCTCATAAATTACGTAGTTCCATAACTGGACAGCTAGGTCATAAGGGACAGATACCCATCGGGGTTTTCTGCCCTTTGTTGTACGTAGCGTCATCGCGAGAGGCTTACCTGGTGGATGGCCAGATGGATTGGGAATGTCCTCCAACTCCAAGGTTGCAACCTCTTCCCTACGAAGTCCCGAAAACCACATCAAGTAGGCCATTAGGCGGTTACGTCGCGGGGTTAGCGCGGTGACGAACAGCCTGGCCTTGTCGGCTGTTAGAAATTTTGGTTGTGGCTTAAATGAACGTAACTTCAGTTCATTGGCTTGGACTTGATTTCCCTTGGCATCAATGTGCGCCAGAAATCCCTTCGGCCGGGATACCCGAACGTCCTCCATATAGAAGGGGAGGGAAGAGATTTTCTGTGAGCGCAAGGCCCATTTGTAGAAGTTGGATACAGCCCCCACACGGTGGTTGACCGTAGAGCGGCGACATCCTCGTGAAAGCATGGAGTCCCGCCAGGCTGCTAGATGAGTGCTTCTTATCCTGTCCCAGGAAATCTCTATGGTTTCCAGGAAGCTGAAGAATTCATAGAGGTGATTTCCATAGGTCTGCCAAGTCTTCGGGGACGATGTTCTTCCCCTAACTACGGCGACGTGGAAGAGATATTCATTTGGCGCTGTGACAAGCGCCATTTGGTCGTCAATGAGAAATGGGATGCCTGGGTATGGCTGTCCGTGTACCGTGAAGGTTGGGTCGGTGAGGAATAAGTGCATATCGATCCAGGCAGTAGTAATCCAGTTGTCCAGAATGGCAACTTGGAGAACCGTAACTGTAAGGACCTTTTTTTTCTACAGAGGCAACACTCTGCAGGTCTCTTAACACGTCCAGGGTCGGCATGCCTGCGGCCCCTGCCTGGAGCGGCAATAGCAATGCGGCTCCCGGCAGGGCGCCGGCCAGCAGGACCGTCCTTATTGCCGCCTGGAGGCGATGGTTTTTGCCAGGTGTGAAGATATTTTTATTTTTCATATCCGAGGGCGAGAGCAGTGGCAGGTGGTTGGATGACAGCGCAGGATAGGGCGGGCGGTTCAGGCCGGCAGGGGCCGGCGGGCGAGGCGCAGGGCGAGCAGGAAGCTGGCGGCGATGATGCATACCAGGGCGAAGCCGAAGGCCAGTTCCTTGCCATCGCTCCAGGCTTCCACCGCGGGGGAGAGCATCTTGGCGGCGCCGAAGGCGGCCACCAGCAGGCTGACGGCGGAGACCACCAGGCCCATGACCCGGGAGGCCACCAGGGCCACCGTGTCGGCCCGGGCGATGAGGCGGGAGATCCACAGGCCGTTGATGCCGTCGGTGATGAGCATGCCCAGCATGAACAGCAGGGAAAGGCCCAGGGCGTGTTCCCAGCCGCCGTGGCGGGTGGCGGTGAAGGCGAAGAGGGCGGCCTGGGACAGGGTGTCGAAGGAGAGGGCGAAGAGGGCGCCGACGGCGGCCACCAGCAGGGGGTGGCCGGCTTCGGTGAGGCGGCCGAGGAAACGGCCCTTGATGCCGGCGGGGCGCACCATCTCGCCCGGAGCGGCGGCCAGCACGGCTCGCAGGTTCACGGCGCCGAGCAGGGTGAGGAAGGCGATGGAGATGAGGCCGCCCACCCATTCGAACCATTCGGGCACCTGCCACACGTCCGCCAGGGTGCCGACCGCCAGGGCGATGGCGATGACGATGGCGCCGTGGCCGAGGGAAAAGAGGGTGCCGCAGTAGCGCGCCAGGCCCGGGTTGCGCCGGCTGTTGAAGCGGGTGAGGCCGTCGATGGTGGCCAGGTGGTCGGCGTCGAAGCCGTGCTTCATGCCCAGGATCAACACCAGGGAAGCGAGGGCGAGCCAGTTGTCGGGCAGGGTATCCACGGGCCGTGAGGGGGCCGCCGGCGGGCGGCAGAAGGTTAGAAGATGTGAAGAATAAACTTTCCGTCACACGATTGGGGTTGACCTGGATCAGAAAGCGGCAGGGAAATCCGGGGCGACGGGCTCGATCCGCTCCACGGCCGGCCGGGCTGGGGCCGTGGGCAGGTGTGCCGGCAGCAGGGTGCCGCCGGGGGGCTTCAGGCGGCGGGGAGGTCCCGGGGCAGGGTCAGGCGGAACACGGCGCCGCCTTCGGGATGGTTGCTGGCGACGATGCGGCCGCCGTGGCGTTCGACGATGCCGTAGGAGATGGAAAGGCCCAGGCCCGTGCCTTCCCCCACCGGCTTGGTGGTGAAGAAGGGGTCGAAGAGGTGGCCCAGGGCTTCGTCGGGAATGCCGGGGCCGTTGTCGTGGAAGGTCAGGCGCAGTTCCCGGTCCCCCAGTTCGCCGCTGATCAGCAGCTCTCCCCCGGGCTGGGCGGCGGTGGCCTGCAGGGCGTTCTGCACCAGGTTCATCACCACCTGCTGCAGCTGCCCCGGGGAGCCCCGCAGGGGCAGTTCCGCCGGCAGCTCGGTGCGCACGGTGAAGCTGGGCGGGGCGCTCTTGGCCACCCAGCGCACCGCCCGCTGCACCACCTCGGCCAGGTTGAAACGTTCCGCCGCCTCCCGGTCCAGGGCGGAGAAGCGCTTCAGGCCGTCCACGATGTCCCGGGTGCGCTCGGCCCCTTCGATCATGCCGTCGATGAGGGGGGACATGTCTTCCAGGATGCGGTCGATGCGCAGTTCGCTGCGCAGCTCCTCCAGTTCCGGGATGCAGTCGCAGCCCCGGTTGTGTACCGCTTCCAGATAGGTTTCCAGGCGCCCGGCGTAGCGCTTTAGGGCCAGCACATTGCCCAGCACGAAGCTGATGGGGTTGTTCAGTTCGTGGGCCACCCCGGCCACCAGGCGGCCGAGGGAGGCCATCTTCTCCGAGTGCAGCAGCTGCTGCTGGGTGCGCTTCAAGTCCTCGTGGGTCTGCCGCAGTTCCCGGTAGGCCCGGCGCAGTTCCCCCACCGGGCGGCCGGTGACCACCATGCCGATCAGCTTGCCGGTGCCGGAGAGGCGGGGCGTCCAGTTGAAGGAAACGGGTACGGCGCCGCCGTCCTCGGCCTGCAGGGGCAGCTCCACGTCCTGGGCGCCGTCGTGGCTCTGGCTGGCGAAGAAGCTGCGGGCCTTGTCCCGGGCCTTGTCGTCGGCGAAGAGGTCGAAGATGGAAGTGCCCTTGAGAGTCGGCTCGTCCCGTCCGGTGTAGCGCTGGAAGGCCGGGTTCACCTCCTCGATGGCGCCGTGGCGGTCGCACACCACCAGGATGTCGGACATGGAGGCGAGGATGCTGGCGATGAAGCGGTGGGACTCCTCCAGGGCGGCGTTGTTCTGCTCCAGGGCCACTTCGTACTGCAGCAGGTCGTTGTAGACCTCGTCCATCTTCTGGATCACCTCGATCCACACCTTCTCGTTCACCCCCTCCAGCAGCTGGGCCGGCTCCGGCAGCAGGCTGCCGTCGAGCAGGGGGCGGGCGGGACGCTTGGCGGTCATGGCGGCGGGACTAGCGGCTGGGCTTCAGGTGCAGGTGGCGATAGCCGTGGCCGTGGGCATGGCTGTGCTTGTGGCTGTGCTGCTTCTGGCTCTCGTCCAGTTCCACGGTGACCAGGTTGAGCTGGCCATGGCGCACGCCGCGCTCCGCCATGAGGGCCTGGGCGAAGGCCCGCACGTCCTCCGTCGGCCCCTTGAGGATGGTGCTCTCGATGCAGTGCTCGTGGTCCAGGTGGGCGTGCAGGGTGGACACGGTGAGGTCGTGGTGGTCGTGCTGGATGGAAGTGAGGCGCTCAGCCAGTTCCCGCTCGTGGTGGTTGTAGACGTAGGAGAGGTTGGCGACGCAGTGGTCCGATTCCTTGCGCGCCTGGCGCCAGGTTTCCAGCTGGGAGCGGAGGATGTCGCGCACCGCCTCTGAGCGGTTGCTGTAGCCCCGGGCGGCGATGAGGGCGTCGAATTCCCGGGCCAGGTCCTCGTCCAGGGAAATGGTGATGCGTTCCATGCGTTCTCCCTCGCCGGCGGGCGGAAGGCCGCGGAATGCGGCAGGCCGGCGCAATGCCCGCCAGTCTACCAAAAGGCACCGGGCTTCACGGCGGCGCCGGGGTCTTTGCTGCAAAAGAAAAGGCCCGCCGGAGCGGGCCTGGGGATGCGGTGCGCCGCCATGGGCAGCGCCTCGATGCTCAGCCGCGTTGCTGCAGGGCGGCGATGCGCTCCTCGATGGGCGGGTGGCTGGAGAAGAGGCTCATCCAGCCCTTGCCGCCGGCGATGCCGGAGGCCGCCATATTGGCCGGCAGGGGCTCGGGGGACAAGCCGCCGAGACGCTGCAGGGCCGCCATCATGGGGCGCGGGTTGTTGCCCATGAGCCGGGCGGCGCCGGCATCGGCCCGGAATTCCCGCTGGCGGGAGAAGTACATGACGATCATGGAGGCGAGGATGCCGAAGACGATGTCGCACACCACCACCGTCACCATGTAGCCGAGCCCGGGGCCGGAGGATTCCTCGTTGTCCTTGCGCAGGAAGCTGTCCACCAGATAGCCGACCACCCGGGCCAGGAAGAAGACGAAGGTGTTCACCACGCCCTGGATCAGGGTCAGGGTCACCATGTCGCCGTTGGCCACGTGGGCCACCTCGTGGGCCAGCACCGCCTCCACTTCCTCCCTGCTCATGGACTGCAGCAGGCCGGTGGACACGGCCACCAGGGAGTTGTTGCGGCTGGGGCCGGTGGCGAAAGCGTTGGCCTCGCCTTCGTAGATGGCCACTTCCGGCATGGGCAGGCCGGCGTTCTTGGCCAGGCGGGCGACAGTGTCCACCAGCCAGGCTTCGGTGGGGTTGCGCGGCTGCTCGATGACCTGGGCGCCGGTGCTCCACTTGGCCATGGGCTTGGACAGCCACAGGGAAATGAAGGAGCCGCCGAAGCCCATCACCGCGGCGAACCCCAGCAGCATGGGCAGGTTGAGGCCGTTGGCGGTGAGGAAGCGGTTGAGGCCCAGCAGATTGATGACCAGGCCCAGGACCACCATGATGGCCAGGTTGGTGGCAAGGAAGATCAGAACGCGTTTCAT
Protein sequences of DBSCAN-SWA_3 >NZ_AP021844|2967016:3023959|2990895_2991612_+|WP_130459832.1|DBSCAN-SWA MRPSQRAADQLRQVRITRRFTRHAEGSVLVEMGDTKVLCTASIEENLPPFLRGKGQGWVTAEYGMLPRSTHTRSSREAAKGKQTGRTQEIQRLIGRSLRAVTDLKALGERQITLDCDVLQADGGTRCASITGAWVALWDACQSLVAAGKLSENPLKEHVAAISVGIYKGTPVLDLDYPEDSDCDTDMNVIMTGSGGLVEVQGTAEGEPFSRQQMNVLLDLAEAGIRQLIHAQETALAD >NZ_AP021844|2967016:3023959|3001050_3004209_+|WP_014236233.1|DBSCAN-SWA MLERMIRAAIAHRWLVLILVLGTSALGVWSYGRLPIDAVPDITNVQVQVNSEAPGYSPLEAEQRVTFPVETALAGMARLKYTRSISRYGLSQVTVVFEDGTDIYFARQQVSERLQQASSQLPAGVKPTLGPVATGLGEIFMYTVEATPGATKADGKPWMPTDLRTLQDWVIRPQLRNLKGVTEVNTIGGNVQQFHVTPDPAKMVAYKLTIDDLLQAIERNNANTGAGYIERGGEQNLIRIPGQVGDEAGLREIVVAMRDGLPLRISDIATVQIGSELRTGAATRDGREVVLGTVFMLIGENSREVAMRAATRLKEIDASLPEGVSARAVYDRTQLVDRSIATVQKNLLEGALLVIVVLFLLLGNIRAALITAAVIPVAMLMTITGMVQNRVSANLMSLGALDFGLIVDGAVIIVENCLRRFGERQHALGRLLSIEERFQLAAKASAEVIKPSLFGLFIIAAVYLPIFALSGVEGKTFHPMAITVVMALVAAMVLSLTFVPAAIAQFVTGKVEEKETRLMQRLHGIYAPLLEKSLSLQKPVIGAAAVLVVLCGLLATRLGTEFIPNLDEGDIALHALRIPGTSLTQAIGMQAQLEARIKQFPEVDKVVGKLGTAEVATDPMPPSVADTFILLKERKDWPDPRKSKATLVAELEEAVRAIPGNNYEFTQPVQMRMNELIAGVRAEVAIKVFGDDLQALTAVGKQIEKVAGSISGSADVKLEQVTGLPLLVIKPDRAALARYGLAVADIQDTVSAAMGGATAGQLFEGDRRFDIVVRLPDAQRQDPKALAALPIALPATSRADGASLSRMPGVVPLSAVATIAVELGPNQVSRENGKRRVVITSNVRGRDLGSFVEELRGKVAAEVVLPVGSWVEYGGTFEQLISAGQRLSVVVPVVLVMIFGLLFMAFGSAKDAAIVFSGVPLALTGGVLALWLRGIPFSISAGVGFIALSGVAVLNGLVMITFIRKLRELGQPLHTAVTEGALTRLRPVLMTALVASLGFVPMALNVGTGAEVQRPLATVVIGGIISSTLLTLLVLPVLYRLIHRNENEETAA >NZ_AP021844|2967016:3023959|2994600_2996535_-|WP_152090900.1|DBSCAN-SWA MGVADAASVGTVTAKQLENRPLLRPAEVLETVPGLIVTQHAGDGKANQYFLRGFNLDHGTDFSVTIDGMPINLPTHAHGHGYLDLNFLIPELVERIQYKKGPYAAEDGDFSSAGSARIDYRRALPEDYVSIGLGSNGYRRLLTAADKETAGGGRWLGAVEVFHNDGPWEVPEHYKRLNGVLRYSEGTRNNGHSLAFMAYDGDWTSTDQLARRAVDQGLVNRYGSLDPTAGGNTRRLSLSGQWARQDGAVQTRANTYLVDYRLNLFSNFTYAMDDPVNGDQFEQADRRRYGGFGWSRSQPVQWLGKEGDFTWGVQGRQDDIDNVGLYRTAARQRLSTVRSDSVNQGSLGLYGQWGAQWSDWLRSVAGLRHDRYRFKVDSSLAANSGKENDGITSPKLSLIFGPFANQEFYYNWGQGFHSNDARGTTIRVDPSNPGDPMSRVPALVKSRGQEVGWRSAPAPGWNTSVALWRLDLDSELLFVGDAGTTQASRPSHRQGMEWSNYWTPRDWLTLDADIALSKARFRDDSSVGNHVPGAVERTASVGVAVHDLGPWRGGLRLRYLGPRALKEDDSVRSGSSVMVNLNVGYKLAAKSQLTLEVLNLFNRKASDIDYYYESQLRGEAAKVEDVHSHPAEPRIWRLTYTQGF >NZ_AP021844|2967016:3023959|2972797_2973718_-|WP_152090378.1|DBSCAN-SWA MSFLSPRFPAALAAKGRLGANRWLLLRRLSQFGILGLFLLGPLAGLWLVKGNLSYSLTLDTLPLADPLLVLQVLFSGHRPEGLALLGAAIVLAFYLLVGGRVYCSWVCPMNLVTDLAGWLRERLGLKGSAHISRRSRYWILGLTLLLPLAGAGLAWELINPVSMLHRGLIFGLGAAWTVVLAIFLLDLLIMSRGWCGHLCPVGAFYSLLGRTSLLRVSARRRQDCDDCMDCFAACPEPQVIRPALKGEANGTGPVILASACTNCGRCIDVCAKDVFVFGSRFNQHTQRCAPAGEAEGQTDHRKTIH >NZ_AP021844|2967016:3023959|2996760_2997930_+|WP_014236230.1|integrase|DBSCAN-SWA MAHDTNPTLREALFMYCERISIHKKGHAQEKYRINLYCRYSIADLPIRNITSVDVATFRDERLAEINARTGRALSPATVRLDLALLSDLFRIAKNEWGICNDNPVANVRKPKLPPGRDRRLAPREERMIMRHCSQRGAHEMKAIVQLALETAMRQGEILGVCWEHINLKSRIVHLPDTKNGSKRDIPLSMEARDILAAQRVKLSGRVFSYTNNGLKSSWRSMIKRLNIPDLHFHDLRHEAISRLMERGVFNLMEVAAISGHKSLSMLKRYTHLRAQRLVRKLDAGANKGKAAVLSYLVPYPAFIEPYESQVKVTFPDFDDLHVAGPCLNSAVQQAQDALLREILVLMRQGRPIPPPNNYLELLDESRLFHLDPLATYDSLADLAEGALV >NZ_AP021844|2967016:3023959|3010546_3010942_+|WP_014236241.1|DBSCAN-SWA MSLGMGTARWVGVAVLVTAINAAAADRGWVLLQGKVLAQALSNQDFGDGVHFAYQFLSGGELRGMNMGKPARGNWRVIGNELCWHWTRPKEPEECYQVRQRGQAVRLYLDGQEVLSGNLTPLPANLKEMPQ >NZ_AP021844|2967016:3023959|2989012_2989978_+|WP_152090386.1|DBSCAN-SWA MASQANHPLPAGFQLEDYRIEKQISVGGFSIVYLAHDASGKAVAIKEYLPASLALRSEGQTKPVISQEHLSAFRYGMKCFFEEGRALAKLNHPNVIQVLNFFRANDTVYMVMEYERGRTLQEFIQKHHGHIHEKFIRGVFTRMLNGLREVHTHKLLHLDLKPSNIYLRADNTPVLIDFGAARQTLHSDTPMLKPMYTPGFASPEHYFKRDELGPWSDIYSVGASMYSCLAGAAPQAADARMEKDQLQPASVRWEGQYSDQLLETIDWCLCLNHLYRPQSVFALQKALTEAVDMPGQGASKAAEKEGWLGHLVGKIKGMTAK >NZ_AP021844|2967016:3023959|3020022_3020832_-|WP_152090389.1|DBSCAN-SWA MDTLPDNWLALASLVLILGMKHGFDADHLATIDGLTRFNSRRNPGLARYCGTLFSLGHGAIVIAIALAVGTLADVWQVPEWFEWVGGLISIAFLTLLGAVNLRAVLAAAPGEMVRPAGIKGRFLGRLTEAGHPLLVAAVGALFALSFDTLSQAALFAFTATRHGGWEHALGLSLLFMLGMLITDGINGLWISRLIARADTVALVASRVMGLVVSAVSLLVAAFGAAKMLSPAVEAWSDGKELAFGFALVCIIAASFLLALRLARRPLPA >NZ_AP021844|2967016:3023959|3013866_3014295_-|WP_043797807.1|DBSCAN-SWA MPSETIAVLGTLSGVLIGGFINYLASRSVKNHEWKLALAKEQSTIRHKLYAEFLVETQRLVVQAREEKISSLADLNMLNGKFAEVSLVASGMVVEAAKKLADYVITSHSAQPAKEVADFFKLKEAFIAVARQDIAEVLSGGA >NZ_AP021844|2967016:3023959|3008250_3009189_+|WP_014236239.1|DBSCAN-SWA MTTWIALITSLPTENATARMRAWRSLKASGAAVLRDGVYLMPEREDCRNTLDAVAADVRAAEGTALVVRLEEPSDGNFVVFFDRSADFATLLGEIATARDTLGPDTVNEALKQARKLRKAFSNLVAIDFFPGEAQKQADEALRDLEQRAAWALSPDEPHPVNDAISRLSIQDYQKRRWATRRRPWVDRLASAWLIRRYIDPQAELLWLATPADCPAEALGFDFDGATFTHVGARVTFEVLLASFGLETPALQRIGTLVHFLDVGGVQPLEAVGIESTLAGLRDTILDDDQLLALAGSIFDGLLASFEKGSKS >NZ_AP021844|2967016:3023959|2973714_2974695_-|WP_152090379.1|DBSCAN-SWA MSDLSNSPPAKSDKAAAARRQFFADAGRMACGVGLLGLGLGFHAKQARALPPAALRPPGAGAEEDFLGACIRCGLCVRDCPYGTLSLARPEQPVSTGTPYFVARQVPCEMCEDIPCVKACPTGALDHGLTDINQARMGLAVLLDQETCLNFLGLRCDVCYRVCPVIDKAITLELRPNTRTGRHSMFIPAVHSEHCTGCGKCERSCVLETAAIKVLPVPLAKGELGQHYRVGWEEEQKAGHSLVDDKGLGDLPDRMPEGARLEGHFDPASQGGPSLVPGKPATPGSGVDSLAPSIPGADAHGPGVPAIPQNIPGGGLPNRLSDEAAR >NZ_AP021844|2967016:3023959|2974784_2977310_-|WP_152090380.1|DBSCAN-SWA MNLTRRDFIKSSAVAAAANAAGMAVPGVSEALAQQPKNDGIRWDKGVCRFCGTGCGVLVGTKDGRVVATQGDPEAPVNRGLNCIKGYFLSKIMYGKDRLTQPLLRMKNGQYDKNGDFTPISWDQAYDIMAEKCKAALKAGGPRNIAMFGSGQWTIWEGYAAAKLWKAGFRSNNLDPNARHCMASAVAGFMRTFGIDEPMGCYDDAEHADVFALWGSNMAEMHPILWSRITDRRLNAKHVKIHVLSTFTHRSCELADNELIFKPQSDLAILNYIANYIIQNGAVNQDFVKNHVKFKKGVTDIGYGLRPNHPLEQAAGNNGYPGPDGKPKGDPNKATDISFDEFKAFVAEYTLDKTHEISGVPKENLEALAKAYADPKVKVVSYWTMGFNQHTRGTWVNNMIYNVHLLVGKISEPGNGPFSLTGQPSACGTAREVGTFAHRLPADMVVVNPKHREITEKLWKLPAGTIPDWVGLHAVAQSRALKDGKVAFFWSTTTNNMQAGPNINGEVYPGWRNPAAFVVHSDVYPTVSALAADLILPSAMWMEKEGAYGNAERRTQFWRQQVKPQGQARSDVLQYVEFSKRFKMEEVWPAELLDKAPEYKGKTLYDVLYANGEVNKFPVSDQLKGFENEEGKVLGFYLQKGLFEEYAAFGRGHGHDLAAFDTYHKARGLRWPVVDNKETLWRFREGYDPYVKAGEKVRFYGFPDGKAVVFALPYQPAAEQPDAEYDLWLCTGRVLEHWHTGSMTRRVPELYKAMPDAWIYMHPEDAKKRGLQRGDTVKVQSRRGEISTRVETRGRNKPPLGLVFVPFFDEHRLVNKLTLDATCPISKETDFKKCACKVVKA >NZ_AP021844|2967016:3023959|2988008_2988875_-|WP_152090385.1|DBSCAN-SWA MIYSMTGYAAKTREVAGGSLHLELRSVNSRFLDIHFRIVDDLRVLEPALREAITAKLARGKVELRLNLVASQSQNRQLAINADLLTQLQALEGQVRQTLPNAAALSVAEVLRWPGMLGEPEVDTAALHAAVQATLKEALEDFTASRAREGAKLAAMIQERVDKIRATVAAVAPLIPQAQAAYQDKLKQRLVEALGSADDERVRQEVVLYATRIDVDEELSRLQAHLTEVERILKAGGNAGKRLDFLMQELNREANTLGSKSVLSEVSKASMDLKLLIEQMREQIQNIE >NZ_AP021844|2967016:3023959|2968030_2968231_-|WP_014235630.1|DBSCAN-SWA MNQMPLVPPGRDTLPRIERLLRRELALFFTLCALLGAFLAYGALRTLASPSAPPHPVLQARADVRP >NZ_AP021844|2967016:3023959|2969562_2971437_+|WP_152090377.1|DBSCAN-SWA MFHSPSAPRDRPALLWQALLFFLLLLPAAVRAHPLVLDQDDGSFALVPHVEVLEDPGGKLDLAAVRQAAAAGRFAPAHALGELNFGYSSSAFWLRIPLESRLQRSSPWLLEIAFPSLDRVELFLPRADGRVDYQLTGDRLPFAERPYPNRNLVLPLELAPGESLALYLRVESEGSLTLPLTLWTPDAFRLHNQDAYAGFSLYYGMLLALGLYNLLLFFALRERIYLVYVAFAVSMAVGQLSLNGLGNEYIWPAFPAWGNVALPSGFAATGFFGAIFTRLFLNTRHSNPRADKLILALAAGFAVAALGPALLPYRWAAILTSLLGAAFSAVAVAVGVHAQLRRHPGARYFLLAWSLLLVGVGMMALRNLGWLPTTLFTSYGMQIGSALEMLLLSFALADRIQAERLARELAQGEALHSKQDLVNALRSNEQLLEARVAERTRDLAAANDRLLANEQQLQRMARHDPLTGLANRLLLDDRISHGLAVGRRNGTRLALLLIDLDGFKPINDKHGHAVGDQLLVVLADRLQRSVRAVDTVARLGGDEFVLVLEDLAAVEDGRQVAAKVVAEMSRPVVLEGRELLVSASAGLAFYPEDGEDAQTLLRRADEAMYEAKRAGRNTFRQVGQ >NZ_AP021844|2967016:3023959|3018585_3019710_-|WP_014236248.1|integrase|DBSCAN-SWA MHLFLTDPTFTVHGQPYPGIPFLIDDQMALVTAPNEYLFHVAVVRGRTSSPKTWQTYGNHLYEFFSFLETIEISWDRIRSTHLAAWRDSMLSRGCRRSTVNHRVGAVSNFYKWALRSQKISSLPFYMEDVRVSRPKGFLAHIDAKGNQVQANELKLRSFKPQPKFLTADKARLFVTALTPRRNRLMAYLMWFSGLRREEVATLELEDIPNPSGHPPGKPLAMTLRTTKGRKPRWVSVPYDLAVQLWNYVIYERSSLAKRYRKRHVGPESNRVFLTAYGEPISLGGINNAFKKASEKSGIKCTPHMLRHTYGTYEFIRMSQFKSKESALFWVMERMGHSSLATTQIYIHTASLVDHTALDGYQADICKLLQEDCV >NZ_AP021844|2967016:3023959|2971516_2971993_-|WP_014235627.1|DBSCAN-SWA MSQRPFKVLGIQQIAIGGPSKDKLKTLWVDMLGLEVTGNFVSERENVDEDICAMGKGPFKVEVDLMQPLDPEKKPAVHTTPLNHVGLWIDDLPKAVEWLTANGVRFAPGGIRKGAAGFDICFLHPKGNEESPIGGEGVLIELVQAPAEVVDAFAKLAG >NZ_AP021844|2967016:3023959|3013422_3013785_-|WP_043797805.1|DBSCAN-SWA MTIAVETRLRLELLFILSLCFVAVLAEVLAAAAVLKPESEPLASWFQRSGAITSVFCVFAQLRINNFFESIRGGTFSESWALFRLFNKQHGTVSWIITFVAIWGAFVWGYGDLMLRHFSR >NZ_AP021844|2967016:3023959|3014523_3015012_-|WP_014236245.1|DBSCAN-SWA MSNNAGNPQALQAHKENSIKLTRQALNLAIQRIVGGHPDQVRVGSRLTAASVAQEAGVERSTLYRYHREILSEIQRINDTDAQSKLKIKTSALAKAEERKKEYRELLELEQSNLLKIARENYALQQRIQELEDLLKEKDSLIANLRAENRKVSLISESPREG >NZ_AP021844|2967016:3023959|2991625_2992534_+|WP_152090387.1|DBSCAN-SWA MAQETSRDPIKALLDDLEQSIADFDQRLGGVEESPAVTGLRSSGQRYPDIEPEARRQLSPAAPVAVAGNADATAVSEAPAVDLLAELAQAAACRSVDDAETQRRQLELTERLHQDLKTVFDYLNQLIRHANTLKPVLPRSYRLDARNSFDGLAWHDGFVDYRSTSRFDRSYYEQILFQVSYRAPAPLVAVCAADQAAIVRKELELVNLRIQREEPVMLPEGGPGVRYVLPDAIPLHLAVQADFANDALTFRCRNAGNFGPTAYRLPGGSITRPLLDGIGLVLLGRSDTMPKELQRIPYQRIN >NZ_AP021844|2967016:3023959|3007672_3008155_+|WP_083834012.1|DBSCAN-SWA MQTADFSKILNQALSRSNTPADVGVTVHRDGNKKPGDFQQKMLGIRAYRQQLIASNIANSDTPGYRAMDIDVEDAAKQNQMGLLPLAKSSPSHINGSAHWSSPPFNLKYRTPFQASADANTVEMDIERQHFAENAVMYQFTLDQVGGDFKELTELFRNLK >NZ_AP021844|2967016:3023959|3005988_3007326_+|WP_014236237.1|DBSCAN-SWA MLEILRHRSFRHLFLAQVVALVGTGLLTVALALLAYDLAGANAGAVLGTALAIKMIVYVTLSPVAGAVVPAAWRKRVLVGLDLIRAAVALLLPFVTEIWQVYVLIALLQSASACFTPLFQSLIPQILPEESDYTRALSLSRLAYDLESLLSPALAAALLVVISFHGLFAGTSVGFVLSALLVMSTAFPVVPETRLGDGPYSRALRGMRIYLHTPRLRGLLALNLCAASGASMVFVNTVVLVREVLGGGEREVAWALAAFGAGSMAVAFSLPTLLDRMADRRIMLSAASAMVVVLLAVTGVWWSTGGLGWASLIPAWVVLGMSYAGLVTPGGRLLRRSAQSDDLPFLFAAQFSLSHLCWLLAYPLAGWLGARLGFGVALSALSAMAAVGGALAWRTWPRQDPDVIAHHHDDLSTDHPHWNEYALGGGGRTHEHRFVIDELHQRWPH >NZ_AP021844|2967016:3023959|2998560_2999811_+|WP_014236231.1|DBSCAN-SWA MNLRCLLVLAVAGTCGIPLSGYAAESLRLEEAVSRALASHPSLAAEAAQLKAVQARAQREGLATPFMIGADVENVGGTGAFRGGQSAETTLRIGRVIELGGKREARQALGSAEINQQQNLSEATRLDVISRTSLRFISVLADQQRLKYAQEQVGQAERTRREVANWVAAARNPESDLRAAEIAVADAELERTRAEHKLTSARLTLASSWGVLTPDFETAAGNLLVLPKAESLDTLVARLPMTPEQRAALLEADSIAARKRLAEAGAKPDVTVNLGVRRLEATSDQALMMSVSIPLGNQVRSGLSVAEANAQLMALEARRDAQRFEHYQSLFGKYQELNQARTEAETLQKHMLPKAEEALAFTRRGFEAGRFSFLALAQAQKTLFELRQRAVDAAARCQILMTEVERLTAIAPEPTP >NZ_AP021844|2967016:3023959|2998098_2998500_+|WP_133247329.1|DBSCAN-SWA MYQSLRHRFRAYVQHPIGRCAVVLMLFALVAASVPFGEIHAHADGDHDHDHGYVTAELTKASLSDPSDSMDSDSDSTGAKVLHAHGSVVTPPPLPVDGLGIEPFIFPARDKITLAYLSRPSATPLPPYRPPIA >NZ_AP021844|2967016:3023959|3012699_3013314_-|WP_014236244.1|DBSCAN-SWA MRALSTADVPIQPLKEFGEDVGPEFEFEIEDGRVALLSAEPPSWIHLIADSSWWISLFSAAAALYMAEIVKEAAKESWKNRAKATALVVGAANKVKLFAEKVVRLKKKLPERTDIVVALPIPNDYFGVRLTLSVDDADLLAYQIALFVSHMPRLIEAIRNEKLDGPRVATGLFLELQDNGDLKVTWFNRESLELESLVILLPVQ >NZ_AP021844|2967016:3023959|3004962_3005604_+|WP_014236235.1|DBSCAN-SWA MSNTQTHSLPSSASLFKATAVAAGVAATLLVTMVLPAEYGMDPTGIGRFLGLDALKQSAGAETTSVLATPDAIAGPNAMLAAKADAAFGKQAGRSLDASAVSLAGDGPMRRNTFTVTLAPGKGAEVKAHLRAGEGLTFHWQATAAVAVDMHGEAPNAKNAWTSYSVESAQKSASGTFVAPFEGSHGWYWQNRGTEPVTVSIEASGFQSELYRP >NZ_AP021844|2967016:3023959|2983630_2984890_-|WP_152090899.1|DBSCAN-SWA MGRVRVLVLGAGVVGVTSAWFLAEAGHEVTVVDRQPGAALETSFANGGQISVCHAEPWANPRAPFKALEWLGKEDAPLLFRLRYDPALFAWSLRFLANCPPGATRRNIRDIIALALYSRQRLQALRQTLPLDYDQRCQGILHIFTQAAEFEAACHAAALMREFGVDREPVDAARCVAIEPALAAVQGRLAGGDYTPSDESGDAHRFTQRLAEAAAARGVQFRYNCPVEKIASAGGRVAGVVAGGDLLLADAYVVALGSYSPALLKPAGVKACVYPGKGYSATIALSPDSVAPSVSITDDERKIVMSRLGNRLRVAGTAEFNGHNLELTPVRCEALLRRALELFPQLRPDGDPLYWCGLRPVTPSNVPLIGRTRLPNLWLNTGHGTLGWTLSCGSAAALADLISGRRPEPDFPFLGTTKQ >NZ_AP021844|2967016:3023959|3011390_3012602_+|WP_014236243.1|DBSCAN-SWA MTGRWLLPEGVDRVVLPLLVGKALRAFADGYVAVLLPAYLLALGFGTLDVGILSTTTLLGSAFATLAVGAWGHRFHHRNLLLGAALLMLGTGLSFASLSAFLPLLLVAFVGTLNPSSGDVSVFLPLEHARLAESGQGTARTTLFARYSLLGALFAALGALASGIPQLLVSVLGIELLSGFRVMFVLYGLVGGTVWLLYRRMPAPRRECAVAAPQALGESKGVVVRLALLFSLDSFAGGLAINALMALWFFQRFELSLAAAGSFFFWAGLLSAVSQLIAPKVAERIGLVNTMVFTHIPASICLIAAAFAPGLELAFALLFIRALLSQMDVPVRSAFVMAVVTPAERAAAASFTAVPRSLASAISPTIGGAMFAAGWLAAPLVACGALKICYDLMLWKAFRQRDP >NZ_AP021844|2967016:3023959|3015008_3016919_-|WP_014236246.1|integrase|DBSCAN-SWA MSTLANPAKPIRALSSAEKARTPVSMAKTDLGESVVVSRYEDEIWDFWPYIPQNNARDSEKRINWRIEIAPGEFLTDHQHAPLLESTKDFTWSLFIDPIDGRQRPRMATVIAHVVALTPLLRWMASRGMRQFRDIEGRALEYVPIAKINQRTGKPTAKGWHARRLIVLEELYLQREKLADALPTHPWPDESGFTLSGERAKGRTIKTLRIPERTVRQLAEVAINYVTNLAPHILSTRDALEQAVANKEGFQATNIRIPLARELGFEGSRDLSAELSYLRDSCYIVIAMFSGIRDSETLSLKRGCIAHDKADDGIDLIWLHGTIFKTGIKPHKWLVPPIVETAVRVMEWYRQPYAIQIEEQISQFEQQLDMSIPGSTFHKRQLKRLHTARKDRDGLFLGCAPCVGHLVGVLSKSTIHRRLQNFCPHFNILGDDGKYWRLSSHQFRRTYAYFVASAELGDLHYLREHFGHWSIDMTLLYTSGASDAYQTDTDLLTEILRSKTEKQESVLHNYLMTDAPLANGDIMLADLRQTIKTAKNKQSLLQQISSSITLNGTGHSWCIGNAKGTSCGGLCVFEADMCVDCAYGMIGPEHLPVWKEIALQQQTALDMSDLGLPGKTRSQRILNKALDVIAKLEIPQ >NZ_AP021844|2967016:3023959|2987149_2987356_-|WP_014235614.1|DBSCAN-SWA MARITVEDCLKQIPNRFQMTLAATYRARQIANGSTPMQEPSKDKPTVIALRELAAGQIGLEILNRGQA >NZ_AP021844|2967016:3023959|2993274_2994510_+|WP_152090388.1|DBSCAN-SWA MSSRSSSRIIPIAVAGGTRAGGSPLHFTSPPPLSLYIHVPWCVRKCPYCDFNSHEARAENDEAAYVAALVADLESALPSVWGRKVSTIFIGGGTPSLLSGEALHELLNAVRMRLPLLPEAEVTLEANPGTAEAGKFAAFRAAGVNRLSLGIQSFNDRHLEALGRIHDSAEARAAIELAKAHFERFNLDLMYGLPQQSQAEAMADLEMALSFAPPHLSCYQLTLEPNTLFAARPPQLPEGDTCADMQDAIEARLAAAGYVHYETSAFARPDYQCRHNLNYWTFGDYLGIGAGAHGKLTLPDHSGFSVQRQMRWKQPKQYLEQVAAGQPVQEQHGVGADELPFEFLMNALRLNQGFDPALFEQRTGLPLLLVRGELEKAAREGLLTLAPDCIAPTERGRRFLNALLERFLPDA >NZ_AP021844|2967016:3023959|2989974_2990886_+|WP_014235610.1|DBSCAN-SWA MKFTIYQESRIGKRQNNEDRIAYCYSREAVLMVVADGMGGHYHGEVASQIAVQTLTSAFQRDAQPEIADPFLFLQKGMTNAHHAILDYSQEHRLKDSPRTTCVACLIQDNIAYWAHVGDSRLYHMRDGKVLAVTRDHSRVRLLMDEGLISEAQAATHPDRNKVYSCLGGENPPEIEFSRKTPLEVGDVLVLCTDGLWGPLPADVMAASLKGANLMQAVPMLLNQAEIRSGPYGDNLSVVAVRWEQSYSEEASSTVMTQTMPLDAVTTKLGEFGRDPAYKTDLSDDEIEKAIDEIRAAIQKFSK >NZ_AP021844|2967016:3023959|3016915_3018586_-|WP_014236247.1|integrase|DBSCAN-SWA MAIRRKRVDRNTHIDLTPEKQIIDLPPGWAFTIRCVHSGAEYHFDFTSHRARGREPLAEQMRDAIWSLRFVSAGKSLMTYFNSGIRCFWHFLDDLEQSGQVVTTLEQVDRLIIMQFVAWLALQTVQHGKNKGLPWSVSACNATYAGIKSILKNRRQHVPESINPNLNFPKNPYPNSNKRIPRREGYSAGEQERIIAACAEDLALFNANPEALSSHQVLAVHAVITVLTCGVNMTPLLEMRRDSLRSFLPDRDLLVLEKRRGYTTRTISLPKNTPEETATPITKMVGDYLRQLQQYTERFVSDADEADRPFVFLCRLADVTYSRRRGNVVRFDEVQVRNALKSFVNRHELFDDRGAPLNLSIARLRPTFALNYYLRHRDLRKLQQALGHSSILLTIQRYIPPVTPEAVRNHAFIGQAMVGWATSRDETLAIRLAADGEIPLRNATELLTGGYNTSIARCRNPFREADQVCGKFLACFRCPSMVVFEDDLYRLYSFYFRLLAERPKIPPHQWMKTFGPVIRTIDEQIAVQFDSAVVAEARQRAQSTPHPAWRNDAPLI >NZ_AP021844|2967016:3023959|2999807_3001040_+|WP_014236232.1|DBSCAN-SWA MNRLLPLILVPLLLTACGNDTPPSAVVAAEKASAAEEYERGPHRGRMLRQGDFALEVTIYETNVPPQYRLYAYQNGKPLPPASVQAAIQLKRLDGEFNNFTFTPEKDYLNGSSEVIEPHSFDVEVKAQHAGQSYSWAFPSYEGRTTIPAAAANDAGVKVEKAGPTTIRNTVRLMGAVMVDANRRAEIKARFPGIVRAVNVQEGQRVSRGQTLVAIEGNDSMRTYSVVAPFDGIVLARNTNVGDVAGSNTLVELADLSSVWVELRALGGDAEKLSVGQEVEISSATGGSRVTGKIQTLLPLASGQSVVARASIANPEGRWRPGMAVSADVTVAARQVPLAVKESGLQRFRDFTVVFTQVGDTYEVRMLELGERDGRYAEVLGGLKQGATYVAEQSFLIKADIEKSGASHDH >NZ_AP021844|2967016:3023959|3023077_3023959_-|WP_014236252.1|protease|DBSCAN-SWA MKRVLIFLATNLAIMVVLGLVINLLGLNRFLTANGLNLPMLLGFAAVMGFGGSFISLWLSKPMAKWSTGAQVIEQPRNPTEAWLVDTVARLAKNAGLPMPEVAIYEGEANAFATGPSRNNSLVAVSTGLLQSMSREEVEAVLAHEVAHVANGDMVTLTLIQGVVNTFVFFLARVVGYLVDSFLRKDNEESSGPGLGYMVTVVVCDIVFGILASMIVMYFSRQREFRADAGAARLMGNNPRPMMAALQRLGGLSPEPLPANMAASGIAGGKGWMSLFSSHPPIEERIAALQQRG >NZ_AP021844|2967016:3023959|3010938_3011394_+|WP_014236242.1|DBSCAN-SWA MKWITRERPKIDRIACPWLISRFVDESPEFLYVPAGEVMRIAAETGATPYDVPNTELGHHGDQCSFDAFIGKYKLEDAALNKLALIVRGADCGQPQLAKEAAGLLAISKGLSLNFSDDHEMLAHGMVIYDALYAWCADTPLKKIGRFLGLK >NZ_AP021844|2967016:3023959|2972310_2972787_-|WP_130459820.1|DBSCAN-SWA MNRLHKLTLAILAASFACLAQAADAPKTMRGADIPAGDPAPEVKAYAGKKPGLQQPIARTYKEQPPVIPHAVDNFDEITLEENQCLTCHGPEKYKEKKAPKIGESHFIDREGKQHAEVTHLRHNCVQCHVPQVDAPPLVENTFVGNIAASKDAKAKKK >NZ_AP021844|2967016:3023959|2984886_2987118_-|WP_130459828.1|DBSCAN-SWA MDTATDPAPAKPSASSAAPFAPAPDPAAPPTPYPFNDDPAYRVFLDSLDYLKPEEIAKIKEAFAFGEAAHRGQKRLSGEPYITHPLAVAGAIAEWRLDSTAIIAALLHDTMEDTGISKEELTERFGKGVADLVDGLSKLDKIEFSSYQEAQAENFRKMLLAMAKDLRVILIKLTDRLHNMQTLGCMRPDKRRRIALETLEIYAPIANRLGLNTVYRELQDLSFKHTHPMRYQVLLKAVMAARGNRREVLSKILDGVQSKMRDSGIEAQVFGREKSLYSIYRKMVEKRLSFSQVLDIYGFRVVVKDVPSCYLGLGALHALYKPLPGKFKDYIAIPKANGYQSLHTTLIGPYGMPVEVQLRTEEMHHMAQEGVASHWLYKDTEKSAAELQYQTHRWLQSLLELQSTAGDSAEFFEHVKIDLFPDEVYVFSPKGKIFSLPKGATPVDFAYAVHTDVGNRCVAAKINYELMPLRSELNSGDQVEIVTAAHANPNPAWLSYVKTGRARSKIRHFLKTRQHEESAALGERLLNQELFGLGITPSELPDASWEAVLKEGGSKSVKEVYTDIGLGKRLAAVVARRLLAHEAALPNAEPAPHTSVVIRGTEGMAIQLAHCCRPIPGDPIIGSIKKGQGLVVHTHDCAVIRKSRSAEPQRWIDVEWEPEPGKLFDVDIHVAARNARGVLAKVATEIAESGSNIEKVSMAPDPGFYTTLNFTVQVANRAHLARVLRAVRLIPEVVRITRERQEE >NZ_AP021844|2967016:3023959|3009185_3010550_+|WP_014236240.1|DBSCAN-SWA MSTILTAADSTSPESKPAEVSFWQAFLFWLKLGFISFGGPAGQIAIMHQELVERRRWISERRFLHALNYCMVLPGPEAQQLATYIGWLMHRTWGGIVAGGLFVLPSLFILIGLSWIYIAFGNVPLVAGLFYGIKPAVTAIVVQAAHRIGSRALKNNALWAIAAASFVAIFALNVPFPAIVAAAAAIGYFGGRVAPDKFKAGGGHGKADKSFGRALIDDDTPTPVHARFSWGQLAKVALIGGLLWLVPMGLLTASYGWSHTLTQMGWFFTKAALLTFGGAYAVLPYVYQGAVGSYGWLTGPQMIDGLALGETTPGPLIMVVTFVGFVGGYVKAVFGPDSLFLAGAVAAMLVTWFTFLPSFVFILMGGPFIETTHNDLKFTAPLTAITAAVVGVILNLALFFGYHVLWPKGFDGAFEWVSALIALGAAIALFRFKANVIHVIGGCAVIGFLVKMFL >NZ_AP021844|2967016:3023959|2968362_2969466_-|WP_152090376.1|DBSCAN-SWA MSVAPPSPSQRQARSRYQRGMLAWLQQPGDPAGLPEMRAAVRHLEAAAGGDFAPFWHSAEVFLRAISDGTLAVDAESRRLCARIDLQMRAALNGSEAPEGGLAEELQQCIRQGAGQLPPVTELISLMAKPEAPDLDAEAVAAWSAAGNAAVAAWNGRGSGDLAPFRRALIDLCAAAMSLNLPETLHLAESLAGVGDLLDAPEAAEDPYLRAAIAAALELLGDTRDLGLPVFAERVAHVAQRLAECRESQRPAVSPTLLRLFAGEIGEQAALMREELACLEPDGEALAESAHCLADHAAHLELDSAEALAQGLAAAIVRAQAGHGFDHPEVREALEAALAELDTMADFLLVAQPLPEATDILEILAQV >NZ_AP021844|2967016:3023959|3021026_3022400_-|WP_152090390.1|DBSCAN-SWA MTAKRPARPLLDGSLLPEPAQLLEGVNEKVWIEVIQKMDEVYNDLLQYEVALEQNNAALEESHRFIASILASMSDILVVCDRHGAIEEVNPAFQRYTGRDEPTLKGTSIFDLFADDKARDKARSFFASQSHDGAQDVELPLQAEDGGAVPVSFNWTPRLSGTGKLIGMVVTGRPVGELRRAYRELRQTHEDLKRTQQQLLHSEKMASLGRLVAGVAHELNNPISFVLGNVLALKRYAGRLETYLEAVHNRGCDCIPELEELRSELRIDRILEDMSPLIDGMIEGAERTRDIVDGLKRFSALDREAAERFNLAEVVQRAVRWVAKSAPPSFTVRTELPAELPLRGSPGQLQQVVMNLVQNALQATAAQPGGELLISGELGDRELRLTFHDNGPGIPDEALGHLFDPFFTTKPVGEGTGLGLSISYGIVERHGGRIVASNHPEGGAVFRLTLPRDLPAA >NZ_AP021844|2967016:3023959|2981906_2982758_-|WP_152090384.1|DBSCAN-SWA MLIDTHCHLDAAEFAPDREAIFQDGVTAGVQAMVVPAVAAATFAEVRACCLAYPGCAPAYGIHPLYTPAAREEDLSTLRRWLAEERDGPLAPLAVGEIGLDLYVPELQQGEALARQQHFFAEQLQLAVEFDLPVILHVRRALDPILKQLRRYRPRGGIAHAFNGSRQQADEFIKLGFKLGFGGAMTFSGSTRIRELAATLPLEALVLETDAPDIPPAFLTAASPDRRNKPAYLPRFAALLAELRGMPTAELIAATGANARAALPGLAALASATPTTTPPTAST >NZ_AP021844|2967016:3023959|3022410_3022899_-|WP_014236251.1|DBSCAN-SWA MERITISLDEDLAREFDALIAARGYSNRSEAVRDILRSQLETWRQARKESDHCVANLSYVYNHHERELAERLTSIQHDHHDLTVSTLHAHLDHEHCIESTILKGPTEDVRAFAQALMAERGVRHGQLNLVTVELDESQKQHSHKHSHAHGHGYRHLHLKPSR >NZ_AP021844|2967016:3023959|3005674_3005941_+|WP_043797801.1|DBSCAN-SWA MKNKTFLSLSLLVGSFMSLSSVAYAHGVHEDSAEPKATPTACRHLTDTEHYVVDLKDPATRALKTRCDATKKPVTPVAEKKDETPDKK >NZ_AP021844|2967016:3023959|2982804_2983596_-|WP_014235617.1|DBSCAN-SWA MSTVLSHLEDGVLTLTLNRPEALNALNLAMIEDLRAATARAEHDEAVGAVVLRGGEHFMAGGDLKWFHSQLALPPAERQALFEQTIAAVHATTLQVRRMGKPVVASVSGAAAGFGLSLMLACDLAVAADNAYFTLAYCHIGLSPDGGATWFLPRAVGAKRAAEIALLGDRFDAAQAREWGLINRVVPAAELEAESAKLARRLAAGPRQALARTKALLQASSGNSLPEQLFAEQGNFAACSVHPDFAEGLGAFLEKRKPAFGQK >NZ_AP021844|2967016:3023959|2987383_2987992_-|WP_130459829.1|DBSCAN-SWA MSGHLYIVTAPSGAGKTTLVRLLLQNDPAIGLSVSHTTRAPRTGEENGQAYHFTDVADFLARVDRGEFLEWAEVHGNYYGTSRTWIEQQLAAGRDVLLEIDWQGAQQVRKVFGDAIGVFILPPSMEELARRLAGRGTDSEDVIARRLAAARDEMRHVGEFDYVIINNDLQTALSDLLAVVRATRLKLPVQQERHASLFASLL >NZ_AP021844|2967016:3023959|2981004_2981910_-|WP_152090383.1|DBSCAN-SWA MIVRERPSLLRLFFIWRGSVVPHVLPQIVFTTSFAVLITWGAQHFGHLFPDYSAAPFALLGLAFSIFLGFRNSACYDRWWEARKQWGGLIVELRSLARDSLVLEAEPRRLLVRRSLAFAHALAARLRGRDAALEAAPFLPPSEAERLAQSRNPADALLRQCGHDLVQARQRDGLGDIVYQGLTQRLHALSGIQAACERIRFTPLPFAYTLLLHRTAHLFCLLLPFGLARSVGWATPLLTAVLAYTFFGLDALGDELEEPFGTLENDLPLDAMVRMLEGDLGEALGETDLPPLLQPQGYVLL >NZ_AP021844|2967016:3023959|2977311_2977560_-|WP_014235622.1|DBSCAN-SWA MNISSILVNAGPQQIAAVEAGLATLAGVEVHAVSEEGRMIVTIESDGDRETTQTYEAIQQLPGVMSLAMVYHHFEPDPEKES >NZ_AP021844|2967016:3023959|2977861_2980063_+|WP_152090381.1|DBSCAN-SWA MSRPLPILSTPAAAPPEAAPLTRYSPIPWGLVIVLSLLFVVVWLLPPLGGLKQSDTIFPLTLHTVMESFSFVVSVLVFAVSWHAYSRERAGNLMILACGFLAVALLDFGHTLSYRGMPDFVTPSSPQKAIIFWLAARYVAALTLLTIALRPWQPLARPRDRYRLMLWALLVTAAVFVSELYLPDFWPTMFVPGVGLTGLKIAAEYGLIAILGATAVILYPKTQGKPAFDAANLFTAVLITILSELCFTLYSNVNDVFQLLGHTYKVIAYFWIYKAVFVSSVRDPYLRLSLEMAERQAAEARIQFLAYHDPLTELPNRILVRERFERAVERARDQSSRVGLVYIDLDNFKTVNDSLGHTLGDLLLQAIGQRLQSLVPAGSTVSRQGGDEFLILLEDLEQSRLAESLVSRIVEQMQAPFEIQGHDLSTSVSIGVSLFPDDGGDFDTLLKKADTAMYRAKGAGRNGYRFFDREMDKDVGERLRLSNDLRLALARNEFVLHYQPQIDLRTQEVIGAEALIRWQHPELGLLAPGRFIGIAEDTGLIVPIGEWVIRMACHQAAAWQRAGLPPLVVAVNLSAVQFMRGDLVGTVASALATSALPSRCLELELTESILIQDAENILGTVQRLNAIGVQMSIDDFGTGYSSLSYLKRFAVDKLKVDQSFVRDLCSDPDDAAIVRAIIQLARSLGLKTIAEGVETAEILALLQELGCDEAQGYYFAKPLPADNFSAFLSQRLS >NZ_AP021844|2967016:3023959|2980114_2980948_-|WP_152090382.1|DBSCAN-SWA MTIDPSALSPLGKASEYRCHYAPELLFPIPRQLKRDEIGIDPARLPFVGEDLWNAYEISWLNPRGKPVVALGTFRIPAQTPHLIESKSFKLYLNSFNQSAFADAQTVAATLVRDLSAAAGGQVTVQLEPLAAQPRPRVDYPSGILLDELDIECDRYQPAPELLQADAGRSVEETLYSHLLKSNCLVTGQPDWGMVVVRYRGPAIDRAALLRYIVSFRGHNEFHEQCVERIFCDISARCAPQSLAVYARYTRRGGLDINPFRSSGEFLPPDNIREVRQ >NZ_AP021844|2967016:3023959|2992539_2993130_+|WP_014235607.1|DBSCAN-SWA MQKIVLASNNAKKLKELSALLTPLGIQLIPQGELGVPEAEEPHHTFLENALAKARHAAQLTGLPALADDSGLCVKALGGAPGVQSARYAGEPKSDARNNEKLLAALTGVADRRAHFVSLLVLVRHGDDPQPLVAEGEWHGEIIDQYRGEGGFGYDPLFYVPAEKATAAELSAEVKNRLSHRGQAMARLLERLKLEL >NZ_AP021844|2967016:3023959|3004244_3004949_+|WP_050804349.1|DBSCAN-SWA MPFRQFSATGICRWTVALLASLLPLWAFAHGVTGEDQSFLEQNTGRNLLLFAYLGAKHMVTGYDHLLFLFGVVFFLYRMRDVSIYVTLFAVGHSVTLLLGVLGGFHVNPYVVDAIIGVSVVYKALDNLGAFKHWLGFQPNTKAAVLVFGFFHGFGLATKLQDFSLSRDGLVPNMLAFNVGVELGQLLALAGILIVMGFWRRSTAFSRQAFTANTALMAAGFVLVGYQLTGYFVS >NZ_AP021844|2967016:3023959|2967016_2967958_-|WP_152090375.1|transposase|DBSCAN-SWA MGASILSAPYFHNEEAAYEFVESRLWPSGPVCPHCGCVERISKMGGKSTRIGAYKCYNCRKPFTVKVGTIFESSHIPMRLWLQAIFLISSSKKGISANQLHRTLGITLKSAWFMSHRIREAMRSGDFSPFGSEGGPVEVDETFIGRDYTKKPKGEKKGRGYDHKNKVLSLVDRTSGQARSMVVDDLKAKTLIPILEANIAREARIMTDEAGQYKNVGQHFAGHAFTRHGMGEYVSKIDPTIHTNTIEGFFSIFKRGMKGVYQHCGHHHLNRYLAEFDFRYNNRKALGIEDQERAEKLLQGVKGKRLTYETTAQ |
53 | Bacillus_phage(28.57%) | protease,transposase,integrase | attL 2991252:2991270|attR 3028742:3028760 |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
| Acr ID | Acr position | Acr size | Homology with known anti | Neighbor HTH/AcRanker | Neighbor Aca | In prophage | Protospacer in prophage |
|---|
| CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| NZ_AP021845_1 | 11331-11433 | TypeIV-A |
NA
Consensus repeat of NZ_AP021845_1
|
1 spacers
spacers of NZ_AP021845_1
>1.1|11367|30|NZ_AP021845|CRISPRCasFinder ATCTCAACCCGCTCTAGGATTCGGCTGACT |
csf3gr5,csf2gr7,csf1gr8,csf5gr6,DinG,PD-DExK |
CRISPR arrays and Neighbor proteins around NZ_AP021845_1
The CRISPR arrays of NZ_AP021845_1 >merge|NZ_AP021845|1|11331-11433|CRISPRCasFinder GCAGGATATACCCCTCATTCGATGGGTGGTTACAGGATCTCAACCCGCTCTAGGATTCGGCTGACTGCTGGATGCACCCCAGATTTGCGGGGTGGTTACAGGT >NZ_AP021845|1|1|11331-11433|CRISPRCasFinder GCAGGATATACCCCTCATTCGATGGGTGGTTACAGG ATCTCAACCCGCTCTAGGATTCGGCTGACT GCTGGATGCACCCCAGATTTGCGGGGTGGTTACAGGT
>NZ_AP021845.1|WP_152090939.1|10540_11263_+|hypothetical-protein MTPLKITFQVSGGFVPPPYPLHLDALLAYAQTFDALGDVADEPGIPQLRALADDMPIQRFEKDGDWCYMASAVQPEGPVLNDARFYTQRMNQDDYSARVGREHIQHGRHKPGSPMERYQIQLETARGVHRNLLGFYPVQQSATSSGALLTLVGWCIAEKWWVEDRLLNGRITHIGARRRSGHGKIQSIAIEEDNLAMSQWRLRVRPWKLLDDDLEIRAAWKPPYWAPENRGTAFCSSQLI >NZ_AP021845.1|WP_152090938.1|9447_10539_+|type-IV-CRISPR-associated-protein-Csf2 MNSIQIQLNLTSPLYIAYPDNVDKTANVSRTTKLRLMNNGRLYDLPIYPANGFRGGLRRAAAARVVEALSAKEGPVPGDLYLGLTCGASSASPDQTPKTVEEIIRGRGNVYMGLFGGGARLLSSMYRVSDMLPVLQATIETGAVPDYLAELVMPKFQKEGEPAKHAGPWEVMSERTSIRVDDLYRVMRPEEIKAYVKNPLETVAAHQDGVLANKEGRKTDGDTKTDVSNMMGIETVAPGVPFYFCIDLDKDVTPGQVGMLLLSLRDLFQENAFGGWTRCGFGKVRVNQIKIAYDDQDLAWSDFYGTSHFELPDAANVYTSQAQEEIGSLTTAEMASFFEDFSAGKKAEAKAKAKAKKTAPAEA >NZ_AP021845.1|WP_152090937.1|8688_9438_+|hypothetical-protein MRQITASHLTIQAAGIKAIGAVLAGPDHVGQRCAVCGADINPGDPIDKLDLPRTFTNQSSLAIPNGKWRCGACNAIMGNSEFQMGASTILVCSEGVFPIVRKEHRAWAFLTPPKPPFVIAIQNAKQQHVIWRAPVSLSKDLIMVRLGEQIFRLRRQKLLNCVEIAKRIDAARITPGRPVKDAIENPFVNDWKFQSAEGGRLKSWVWKLQAEQKIAPEDFMELTTLNGGEAWALTAFLSATITKPDPLNF >NZ_AP021845.1|WP_172974821.1|8071_8692_+|hypothetical-protein MLAKAADGRAILPPQFFHYGEDGKPLSTGEAEIRTIGSKNWVGVLSKTGNAELFDPCVGIATRVAANHYGSPAKMEVMELEYGLEAAAMPVFYNLSRAAFKRRSAKRRALSTEEIIKEYLLRQLNDEAERFGFDLPPDSALKIQVHHAKEMGMRLNLNTGLSNEYVSLVDANFSMYLELHGMWQIGNLQARGHGLIYRAKPGGVWS >NZ_AP021845.1|WP_152090935.1|5373_7878_-|DEAD/DEAH-box-helicase-family-protein MPTLNVPKSACSLLHAGFHPNDVSDEIRLFIRGAIQGLICQAIDDGIPPAELPAAAETGIPMNISFTKGHDRAIRNLSKEKKIREGDAALSYLYAAIARGDAIERRKSVSESPLHPYVAALGLTDRQHQNVFGEALIETLSGKSIGMVEGATGIGKTLGMVAAAAHVLEGRSFGRSLIAVPTLTLLRQFARQHQALADAIPEFPSARFILGKNEFVSVGELRILLDSGTFSEYSSTILGWLGQNGPSPSSDQAIDHRFLISSLIAIAPQFPVDAVRCGNLTDDGDPGMASYRAQFEVDESERLECEIIYCTHAMLAADIRRRMFGARSSEEGLEIRQRHRTIRQEAMGLRAALDDAGNEAYRDAGMDIKNSIDGELFELAAQAVALDAGILPSWQYLLIDEAHLFESNLANTFAFNLSLGRLLQHINAAQAEGAVSAAAAKRASKAMTIIRHAGENNDDINLKSTSPQARDVCAALNELLSIVTGVKPSKTSPTITTLKGQASVIRTALRLATTSVLGRSMLCYSPIRAFPQLSVGRASVSSELAFLWHSCEAGACVSATLYLRRLDKDSASYMAGILNIPTNRMREYPVIRPHWVTAPVAGLWIPESTKNPSGRLWLRPPTRSDKLDTEQYRLREEEWLEDLSAEIRKIQVSAAGGTLVLMTSYTSAKGLAERLADIDGLVVAEQGVSISRQVEGFVNQHSAGKKPLWIAVGGAWTGVDINGKDYGLATPGEDNLLTDLVIPRFPFGTNMSMTHRHRAEQASNVPWDLLDAAMRFKQGLGRLVRREGLPPNRRIYVLDGRMNEPTFDFFMSHLRRIIGIYPVKTLKRSAAIDD >NZ_AP021845.1|WP_152090934.1|4030_5377_+|hypothetical-protein MGKSHQQWREDLRKVMHELQALEDDEASLKGERRTSEEDLGKLKSRIDGLRRHLDDLAAAGCTAEEKLRKAKDRLAGYWPDLAADDHDQERSSPWAHPEWRAARIRVFLAALNLHQAFIEENASKMMANLGIAMDMLQGGIPDPKVRVQALDSLAIACPVISTTFASVPSLCGSMSSEGIGWLLIDEAGQATPQAAAGAIWRARRVVVVGDPLQLEPVVTLPRSVEASLAACNGGVNSRLHPSRTSVQKLADQTTAIGTTVGEGDDAIWVGAPLRVHRRCDEPMFSISNEVAYDGLMVHHKKPAALTWPASYWLDVPGGQGNGNWIPAEGEALRGLIQNLLGQAQVPADDIFLISPFRDVVRELKGMGKAFGLDYRRVGTVHTTQGKEADVVIMVLGGGTAGARDWASSRPNLLNVAASRAKARFYVVGDRKDWSKRRFFDVLSKNLS >NZ_AP021845.1|WP_004883034.1|2592_3792_-|tyrosine-type-recombinase/integrase MAKIKLTKSAVDAAQPQAEAVELRDTLVPGFLCKITPAGRKVFMLQYRTNAGERRKPSLGLYGELTVEQARSLAQEWLAQVRRGGDPAAEKAEARQAPTVKELCTKFMEDYSKKRNKLSTQAGYQAVINRNIIPLLGRKKVQDVKRPEIAGLMEKLSYKQTEANKVFSVLRKMFNMAEVWGYRPDGTNPCRHVPMFPAGKSTHLISDEEMGNLFRQLDKIESEGLENYVIPLGIRLQFEFAGRRSEIIALEWNWVDLQNRRVVWPDSKTGGMSKPMSEEAYRLLSTAPRQEGSRYVLPSPSHAGKHLTTGEYYGGWSRALKAAGATHVGTHGIRHRSATDIANSGIPVKVGMALTAHKTVVMFMRYVHTEDKPVREAAELVANRRKTITGMQGAKEVAA >NZ_AP021845.1|WP_004883035.1|1528_2596_-|DUF1016-family-protein MTRRKASVSAPAAPPALLGDIRALIEASRQRVASAVNAELTLLFWRIGQRIHTEVLAGQRAGYGDEILPTLAAQLVRDYGRSFADKNLRRMVQFAATFSDEPIVVTLSRQLSWSHFVALLPLKDPLQRDYYVQMASAERWSVRTLRERIDSMLYERTALSKKPDETITQELAAMRDAQRMSPALVMRDPYILDFLGLRDTWQEGDLEAAIIREMESFLLELGAGFSFLARQKRIQIDDEDFHLDLLFYNRKLRRLVAVELKIGEFKAAYKGQMELYLRWLDKHEREPEEASPLGIILCTGKKSEQIELLELDKSGIHVAEYLTTLPPRAVLGERLQQATERARLQIEQRQPGEKS >NZ_AP021845.1|WP_004883036.1|41_1289_+|hypothetical-protein MKNIFEEINEFSSEKIALFSFGKFCYVFLNKDPIFVKKLLPLIQTSLANESFQADVMRAYTEGCMNEKAAILKEFEAKRDHPNAAKFYGPQLDLVDKRLAIKTIQHLMDYLNNYLNEYPGSLEILNNSYKHIHDEDGVSYIKENYANYRIGCIFYSKHQSIMGRAEMLELKYSKVVEREYEKIGIDIRKEDAQFSKYSLVSLNENIQIFNDKDSQTIRDERIGRHFWIKVPRKLLTSIEELIEKGMLSEIAFRIDYVSDYVPAMEEMEFGAPLRLKISSLPRLSKFYSTDKYENNLWIHHDAEKLSLTFEELMEDFEVAGDDVVTQVIHLEYSSKGDDFFITHLDHEFIVYTLDSYQERLSNANIKGHRKIKTFKIDNSMIPFDINISGDLFLFQVLDSYLKNDDLIREYFEKIN >NZ_AP021845.1|WP_152090940.1|12610_12820_-|hypothetical-protein MSSLPKIDHQENAERNLGIAIDRLDEMRWAVVGVDSPDAECLAKFDEGVAKLKEALTVIRHSPSKTTGR >NZ_AP021845.1|WP_152090941.1|12821_13283_-|hypothetical-protein MNKADLIWWPDNKKGVIINAVAARDRATARFVVTRAIRLVFPMPIMVLASYLISTTLTFPDDAPGWVLTWVEWGPTTLAWACATMTLVVVAMLLIDWRRDRSAAVALACEASALGVDIAKLDGDWVFEALVLPMVRRKGITLPDGSVAHLTEE >NZ_AP021845.1|WP_152090942.1|13293_13818_-|hypothetical-protein MTLASCIGCGCDDLHACVDDLGPCSWIVVDRDAGRGVCSCCESHMERWNAGDRSVLMLVAKITRDGETEPYFIEKNGIGSFPYFLEVEAGDKFSIEWVEMSQEQFEALPQFEHVRLAEKWLEEITAAGDAESEGKMDEAGTHRAEAQRLSEKVAERGFDVLDLIEADELPASLL >NZ_AP021845.1|WP_152090943.1|14138_14459_-|hypothetical-protein MKIDPHEVTSRSIAVVSEVPLIATQWHRHGDHPRVHGMNGSNWELFPDEDDVRYGLLSGGHSGCLLVESGDWIVWNEVFKTYAIFKPDQFEALFSTSAAAIAPSER >NZ_AP021845.1|WP_152090944.1|14759_15089_-|hypothetical-protein MTPNWQPIEALPLIAGMLDDQLHSLHTQVGNLEQCRHRPWVLDGETVNRLQAVFGEQMDSLPVFREQLARWLELPLDEHQRQEINRLNAVLDQMKAAIERILSLAGNIR >NZ_AP021845.1|WP_152090945.1|15091_15607_-|hypothetical-protein MEEYSRPATLDDLKALIASLNEQCADYLLIGGYALFAHGYHRATTDIDVLVPATQEAGIKIRSALMVLPDQAAKNIDPAWFDEGENIRVADAFIVDIMLNACGETYETLKKYAETLDVDGVPVRTINLEGLLLTKQTMREKDVSDRIILERALETLKERVSKPESDHGLGL >NZ_AP021845.1|WP_152090946.1|15606_15834_-|hypothetical-protein MRTIGRRKEHPITFSASAELLVEGARFNDEIHRLPTGNTTHIPKGLYRFKSFEEANQHQQDCLVAGMAKIALERK >NZ_AP021845.1|WP_152090947.1|15850_16288_-|hypothetical-protein MEYRDFLKEIPEGLGATEKTLQLWFVIYQWILEHGYADSADSPAFLQHINAFRKCSTQTVATHLRRMSDAKLIKRYVLRRKLSGEAKEELSIGSLLFAPGAESIPTTFVRYCLPGQQCPTEFKSYEAAVSALDGRMNAIRETVRP >NZ_AP021845.1|WP_152090948.1|16290_16860_-|hypothetical-protein MNTLETQNASTVIDMATAVRERASIKVYANGNLVGEISIAEHEAFKLAAKNDRSLYREQALNYLEATFRLVGRIVLSIPENWFVIAVLLALMMPSEFNSLVSAIIANPSTSSTEFLNTVRWALAASIASTALVAVISGESFGLANVFDDRVAMMIRAKFKLPPMCKLFVDAEQILDGLPSHQAPTHFGK >NZ_AP021845.1|WP_152090949.1|16868_17072_-|hypothetical-protein MAEFTILVGDEVVRLTKKEVEALRKSLKTDVLVTPEDWTRSELQSRSQARKKLMDALYSAEKDIILR |
You can click texts colored in the table to view more detailed information
| CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| NZ_AP021845_2 | 11530-11698 | TypeIV-A |
NA
Consensus repeat of NZ_AP021845_2
|
2 spacers
spacers of NZ_AP021845_2
>2.1|11566|30|NZ_AP021845|CRISPRCasFinder CTGATATTGACAAGGCCTTGGCAGTTGTCG >2.2|11632|30|NZ_AP021845|CRISPRCasFinder GTCAGGAGTATCAGCCAGACAATGAACTTG |
csf3gr5,csf2gr7,csf1gr8,csf5gr6,DinG,PD-DExK |
CRISPR arrays and Neighbor proteins around NZ_AP021845_2
The CRISPR arrays of NZ_AP021845_2 >merge|NZ_AP021845|2|11530-11698|CRISPRCasFinder GCTGAATGCACCCCTAATCCTGGGGGTGGTTACAGGCTGATATTGACAAGGCCTTGGCAGTTGTCGGCTGGATGTACCCCACATCTGAGGGGTGGTTACAGGGTCAGGAGTATCAGCCAGACAATGAACTTGGCTGGATGCACCCCACATCTGAGGGGTGGTTGCAGGG >NZ_AP021845|2|2|11530-11698|CRISPRCasFinder GCTGAATGCACCCCTAATCCTGGGGGTGGTTACAGG CTGATATTGACAAGGCCTTGGCAGTTGTCG GCTGGATGTACCCCACATCTGAGGGGTGGTTACAGG GTCAGGAGTATCAGCCAGACAATGAACTTG GCTGGATGCACCCCACATCTGAGGGGTGGTTGCAGGG
>NZ_AP021845.1|WP_152090939.1|10540_11263_+|hypothetical-protein MTPLKITFQVSGGFVPPPYPLHLDALLAYAQTFDALGDVADEPGIPQLRALADDMPIQRFEKDGDWCYMASAVQPEGPVLNDARFYTQRMNQDDYSARVGREHIQHGRHKPGSPMERYQIQLETARGVHRNLLGFYPVQQSATSSGALLTLVGWCIAEKWWVEDRLLNGRITHIGARRRSGHGKIQSIAIEEDNLAMSQWRLRVRPWKLLDDDLEIRAAWKPPYWAPENRGTAFCSSQLI >NZ_AP021845.1|WP_152090938.1|9447_10539_+|type-IV-CRISPR-associated-protein-Csf2 MNSIQIQLNLTSPLYIAYPDNVDKTANVSRTTKLRLMNNGRLYDLPIYPANGFRGGLRRAAAARVVEALSAKEGPVPGDLYLGLTCGASSASPDQTPKTVEEIIRGRGNVYMGLFGGGARLLSSMYRVSDMLPVLQATIETGAVPDYLAELVMPKFQKEGEPAKHAGPWEVMSERTSIRVDDLYRVMRPEEIKAYVKNPLETVAAHQDGVLANKEGRKTDGDTKTDVSNMMGIETVAPGVPFYFCIDLDKDVTPGQVGMLLLSLRDLFQENAFGGWTRCGFGKVRVNQIKIAYDDQDLAWSDFYGTSHFELPDAANVYTSQAQEEIGSLTTAEMASFFEDFSAGKKAEAKAKAKAKKTAPAEA >NZ_AP021845.1|WP_152090937.1|8688_9438_+|hypothetical-protein MRQITASHLTIQAAGIKAIGAVLAGPDHVGQRCAVCGADINPGDPIDKLDLPRTFTNQSSLAIPNGKWRCGACNAIMGNSEFQMGASTILVCSEGVFPIVRKEHRAWAFLTPPKPPFVIAIQNAKQQHVIWRAPVSLSKDLIMVRLGEQIFRLRRQKLLNCVEIAKRIDAARITPGRPVKDAIENPFVNDWKFQSAEGGRLKSWVWKLQAEQKIAPEDFMELTTLNGGEAWALTAFLSATITKPDPLNF >NZ_AP021845.1|WP_172974821.1|8071_8692_+|hypothetical-protein MLAKAADGRAILPPQFFHYGEDGKPLSTGEAEIRTIGSKNWVGVLSKTGNAELFDPCVGIATRVAANHYGSPAKMEVMELEYGLEAAAMPVFYNLSRAAFKRRSAKRRALSTEEIIKEYLLRQLNDEAERFGFDLPPDSALKIQVHHAKEMGMRLNLNTGLSNEYVSLVDANFSMYLELHGMWQIGNLQARGHGLIYRAKPGGVWS >NZ_AP021845.1|WP_152090935.1|5373_7878_-|DEAD/DEAH-box-helicase-family-protein MPTLNVPKSACSLLHAGFHPNDVSDEIRLFIRGAIQGLICQAIDDGIPPAELPAAAETGIPMNISFTKGHDRAIRNLSKEKKIREGDAALSYLYAAIARGDAIERRKSVSESPLHPYVAALGLTDRQHQNVFGEALIETLSGKSIGMVEGATGIGKTLGMVAAAAHVLEGRSFGRSLIAVPTLTLLRQFARQHQALADAIPEFPSARFILGKNEFVSVGELRILLDSGTFSEYSSTILGWLGQNGPSPSSDQAIDHRFLISSLIAIAPQFPVDAVRCGNLTDDGDPGMASYRAQFEVDESERLECEIIYCTHAMLAADIRRRMFGARSSEEGLEIRQRHRTIRQEAMGLRAALDDAGNEAYRDAGMDIKNSIDGELFELAAQAVALDAGILPSWQYLLIDEAHLFESNLANTFAFNLSLGRLLQHINAAQAEGAVSAAAAKRASKAMTIIRHAGENNDDINLKSTSPQARDVCAALNELLSIVTGVKPSKTSPTITTLKGQASVIRTALRLATTSVLGRSMLCYSPIRAFPQLSVGRASVSSELAFLWHSCEAGACVSATLYLRRLDKDSASYMAGILNIPTNRMREYPVIRPHWVTAPVAGLWIPESTKNPSGRLWLRPPTRSDKLDTEQYRLREEEWLEDLSAEIRKIQVSAAGGTLVLMTSYTSAKGLAERLADIDGLVVAEQGVSISRQVEGFVNQHSAGKKPLWIAVGGAWTGVDINGKDYGLATPGEDNLLTDLVIPRFPFGTNMSMTHRHRAEQASNVPWDLLDAAMRFKQGLGRLVRREGLPPNRRIYVLDGRMNEPTFDFFMSHLRRIIGIYPVKTLKRSAAIDD >NZ_AP021845.1|WP_152090934.1|4030_5377_+|hypothetical-protein MGKSHQQWREDLRKVMHELQALEDDEASLKGERRTSEEDLGKLKSRIDGLRRHLDDLAAAGCTAEEKLRKAKDRLAGYWPDLAADDHDQERSSPWAHPEWRAARIRVFLAALNLHQAFIEENASKMMANLGIAMDMLQGGIPDPKVRVQALDSLAIACPVISTTFASVPSLCGSMSSEGIGWLLIDEAGQATPQAAAGAIWRARRVVVVGDPLQLEPVVTLPRSVEASLAACNGGVNSRLHPSRTSVQKLADQTTAIGTTVGEGDDAIWVGAPLRVHRRCDEPMFSISNEVAYDGLMVHHKKPAALTWPASYWLDVPGGQGNGNWIPAEGEALRGLIQNLLGQAQVPADDIFLISPFRDVVRELKGMGKAFGLDYRRVGTVHTTQGKEADVVIMVLGGGTAGARDWASSRPNLLNVAASRAKARFYVVGDRKDWSKRRFFDVLSKNLS >NZ_AP021845.1|WP_004883034.1|2592_3792_-|tyrosine-type-recombinase/integrase MAKIKLTKSAVDAAQPQAEAVELRDTLVPGFLCKITPAGRKVFMLQYRTNAGERRKPSLGLYGELTVEQARSLAQEWLAQVRRGGDPAAEKAEARQAPTVKELCTKFMEDYSKKRNKLSTQAGYQAVINRNIIPLLGRKKVQDVKRPEIAGLMEKLSYKQTEANKVFSVLRKMFNMAEVWGYRPDGTNPCRHVPMFPAGKSTHLISDEEMGNLFRQLDKIESEGLENYVIPLGIRLQFEFAGRRSEIIALEWNWVDLQNRRVVWPDSKTGGMSKPMSEEAYRLLSTAPRQEGSRYVLPSPSHAGKHLTTGEYYGGWSRALKAAGATHVGTHGIRHRSATDIANSGIPVKVGMALTAHKTVVMFMRYVHTEDKPVREAAELVANRRKTITGMQGAKEVAA >NZ_AP021845.1|WP_004883035.1|1528_2596_-|DUF1016-family-protein MTRRKASVSAPAAPPALLGDIRALIEASRQRVASAVNAELTLLFWRIGQRIHTEVLAGQRAGYGDEILPTLAAQLVRDYGRSFADKNLRRMVQFAATFSDEPIVVTLSRQLSWSHFVALLPLKDPLQRDYYVQMASAERWSVRTLRERIDSMLYERTALSKKPDETITQELAAMRDAQRMSPALVMRDPYILDFLGLRDTWQEGDLEAAIIREMESFLLELGAGFSFLARQKRIQIDDEDFHLDLLFYNRKLRRLVAVELKIGEFKAAYKGQMELYLRWLDKHEREPEEASPLGIILCTGKKSEQIELLELDKSGIHVAEYLTTLPPRAVLGERLQQATERARLQIEQRQPGEKS >NZ_AP021845.1|WP_004883036.1|41_1289_+|hypothetical-protein MKNIFEEINEFSSEKIALFSFGKFCYVFLNKDPIFVKKLLPLIQTSLANESFQADVMRAYTEGCMNEKAAILKEFEAKRDHPNAAKFYGPQLDLVDKRLAIKTIQHLMDYLNNYLNEYPGSLEILNNSYKHIHDEDGVSYIKENYANYRIGCIFYSKHQSIMGRAEMLELKYSKVVEREYEKIGIDIRKEDAQFSKYSLVSLNENIQIFNDKDSQTIRDERIGRHFWIKVPRKLLTSIEELIEKGMLSEIAFRIDYVSDYVPAMEEMEFGAPLRLKISSLPRLSKFYSTDKYENNLWIHHDAEKLSLTFEELMEDFEVAGDDVVTQVIHLEYSSKGDDFFITHLDHEFIVYTLDSYQERLSNANIKGHRKIKTFKIDNSMIPFDINISGDLFLFQVLDSYLKNDDLIREYFEKIN >NZ_AP021845.1|WP_152090940.1|12610_12820_-|hypothetical-protein MSSLPKIDHQENAERNLGIAIDRLDEMRWAVVGVDSPDAECLAKFDEGVAKLKEALTVIRHSPSKTTGR >NZ_AP021845.1|WP_152090941.1|12821_13283_-|hypothetical-protein MNKADLIWWPDNKKGVIINAVAARDRATARFVVTRAIRLVFPMPIMVLASYLISTTLTFPDDAPGWVLTWVEWGPTTLAWACATMTLVVVAMLLIDWRRDRSAAVALACEASALGVDIAKLDGDWVFEALVLPMVRRKGITLPDGSVAHLTEE >NZ_AP021845.1|WP_152090942.1|13293_13818_-|hypothetical-protein MTLASCIGCGCDDLHACVDDLGPCSWIVVDRDAGRGVCSCCESHMERWNAGDRSVLMLVAKITRDGETEPYFIEKNGIGSFPYFLEVEAGDKFSIEWVEMSQEQFEALPQFEHVRLAEKWLEEITAAGDAESEGKMDEAGTHRAEAQRLSEKVAERGFDVLDLIEADELPASLL >NZ_AP021845.1|WP_152090943.1|14138_14459_-|hypothetical-protein MKIDPHEVTSRSIAVVSEVPLIATQWHRHGDHPRVHGMNGSNWELFPDEDDVRYGLLSGGHSGCLLVESGDWIVWNEVFKTYAIFKPDQFEALFSTSAAAIAPSER >NZ_AP021845.1|WP_152090944.1|14759_15089_-|hypothetical-protein MTPNWQPIEALPLIAGMLDDQLHSLHTQVGNLEQCRHRPWVLDGETVNRLQAVFGEQMDSLPVFREQLARWLELPLDEHQRQEINRLNAVLDQMKAAIERILSLAGNIR >NZ_AP021845.1|WP_152090945.1|15091_15607_-|hypothetical-protein MEEYSRPATLDDLKALIASLNEQCADYLLIGGYALFAHGYHRATTDIDVLVPATQEAGIKIRSALMVLPDQAAKNIDPAWFDEGENIRVADAFIVDIMLNACGETYETLKKYAETLDVDGVPVRTINLEGLLLTKQTMREKDVSDRIILERALETLKERVSKPESDHGLGL >NZ_AP021845.1|WP_152090946.1|15606_15834_-|hypothetical-protein MRTIGRRKEHPITFSASAELLVEGARFNDEIHRLPTGNTTHIPKGLYRFKSFEEANQHQQDCLVAGMAKIALERK >NZ_AP021845.1|WP_152090947.1|15850_16288_-|hypothetical-protein MEYRDFLKEIPEGLGATEKTLQLWFVIYQWILEHGYADSADSPAFLQHINAFRKCSTQTVATHLRRMSDAKLIKRYVLRRKLSGEAKEELSIGSLLFAPGAESIPTTFVRYCLPGQQCPTEFKSYEAAVSALDGRMNAIRETVRP >NZ_AP021845.1|WP_152090948.1|16290_16860_-|hypothetical-protein MNTLETQNASTVIDMATAVRERASIKVYANGNLVGEISIAEHEAFKLAAKNDRSLYREQALNYLEATFRLVGRIVLSIPENWFVIAVLLALMMPSEFNSLVSAIIANPSTSSTEFLNTVRWALAASIASTALVAVISGESFGLANVFDDRVAMMIRAKFKLPPMCKLFVDAEQILDGLPSHQAPTHFGK >NZ_AP021845.1|WP_152090949.1|16868_17072_-|hypothetical-protein MAEFTILVGDEVVRLTKKEVEALRKSLKTDVLVTPEDWTRSELQSRSQARKKLMDALYSAEKDIILR |
You can click texts colored in the table to view more detailed information
| CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| NZ_AP021845_3 | 12193-12294 | TypeIV-A |
NA
Consensus repeat of NZ_AP021845_3
|
1 spacers
spacers of NZ_AP021845_3
>3.1|12229|30|NZ_AP021845|CRISPRCasFinder GAATATCGGTTCTGCGGTCGCAGATTGGCC |
csf3gr5,csf2gr7,csf1gr8,csf5gr6,DinG,PD-DExK |
CRISPR arrays and Neighbor proteins around NZ_AP021845_3
The CRISPR arrays of NZ_AP021845_3 >merge|NZ_AP021845|3|12193-12294|CRISPRCasFinder GTTGGAGGCACCGCTCAGATGAGGGGTGGTTACAGGGAATATCGGTTCTGCGGTCGCAGATTGGCCGTTGGAGGCACCGCTCAGATGAGGGGTGGTTACAGG >NZ_AP021845|3|3|12193-12294|CRISPRCasFinder GTTGGAGGCACCGCTCAGATGAGGGGTGGTTACAGG GAATATCGGTTCTGCGGTCGCAGATTGGCC GTTGGAGGCACCGCTCAGATGAGGGGTGGTTACAGG
>NZ_AP021845.1|WP_152090939.1|10540_11263_+|hypothetical-protein MTPLKITFQVSGGFVPPPYPLHLDALLAYAQTFDALGDVADEPGIPQLRALADDMPIQRFEKDGDWCYMASAVQPEGPVLNDARFYTQRMNQDDYSARVGREHIQHGRHKPGSPMERYQIQLETARGVHRNLLGFYPVQQSATSSGALLTLVGWCIAEKWWVEDRLLNGRITHIGARRRSGHGKIQSIAIEEDNLAMSQWRLRVRPWKLLDDDLEIRAAWKPPYWAPENRGTAFCSSQLI >NZ_AP021845.1|WP_152090938.1|9447_10539_+|type-IV-CRISPR-associated-protein-Csf2 MNSIQIQLNLTSPLYIAYPDNVDKTANVSRTTKLRLMNNGRLYDLPIYPANGFRGGLRRAAAARVVEALSAKEGPVPGDLYLGLTCGASSASPDQTPKTVEEIIRGRGNVYMGLFGGGARLLSSMYRVSDMLPVLQATIETGAVPDYLAELVMPKFQKEGEPAKHAGPWEVMSERTSIRVDDLYRVMRPEEIKAYVKNPLETVAAHQDGVLANKEGRKTDGDTKTDVSNMMGIETVAPGVPFYFCIDLDKDVTPGQVGMLLLSLRDLFQENAFGGWTRCGFGKVRVNQIKIAYDDQDLAWSDFYGTSHFELPDAANVYTSQAQEEIGSLTTAEMASFFEDFSAGKKAEAKAKAKAKKTAPAEA >NZ_AP021845.1|WP_152090937.1|8688_9438_+|hypothetical-protein MRQITASHLTIQAAGIKAIGAVLAGPDHVGQRCAVCGADINPGDPIDKLDLPRTFTNQSSLAIPNGKWRCGACNAIMGNSEFQMGASTILVCSEGVFPIVRKEHRAWAFLTPPKPPFVIAIQNAKQQHVIWRAPVSLSKDLIMVRLGEQIFRLRRQKLLNCVEIAKRIDAARITPGRPVKDAIENPFVNDWKFQSAEGGRLKSWVWKLQAEQKIAPEDFMELTTLNGGEAWALTAFLSATITKPDPLNF >NZ_AP021845.1|WP_172974821.1|8071_8692_+|hypothetical-protein MLAKAADGRAILPPQFFHYGEDGKPLSTGEAEIRTIGSKNWVGVLSKTGNAELFDPCVGIATRVAANHYGSPAKMEVMELEYGLEAAAMPVFYNLSRAAFKRRSAKRRALSTEEIIKEYLLRQLNDEAERFGFDLPPDSALKIQVHHAKEMGMRLNLNTGLSNEYVSLVDANFSMYLELHGMWQIGNLQARGHGLIYRAKPGGVWS >NZ_AP021845.1|WP_152090935.1|5373_7878_-|DEAD/DEAH-box-helicase-family-protein MPTLNVPKSACSLLHAGFHPNDVSDEIRLFIRGAIQGLICQAIDDGIPPAELPAAAETGIPMNISFTKGHDRAIRNLSKEKKIREGDAALSYLYAAIARGDAIERRKSVSESPLHPYVAALGLTDRQHQNVFGEALIETLSGKSIGMVEGATGIGKTLGMVAAAAHVLEGRSFGRSLIAVPTLTLLRQFARQHQALADAIPEFPSARFILGKNEFVSVGELRILLDSGTFSEYSSTILGWLGQNGPSPSSDQAIDHRFLISSLIAIAPQFPVDAVRCGNLTDDGDPGMASYRAQFEVDESERLECEIIYCTHAMLAADIRRRMFGARSSEEGLEIRQRHRTIRQEAMGLRAALDDAGNEAYRDAGMDIKNSIDGELFELAAQAVALDAGILPSWQYLLIDEAHLFESNLANTFAFNLSLGRLLQHINAAQAEGAVSAAAAKRASKAMTIIRHAGENNDDINLKSTSPQARDVCAALNELLSIVTGVKPSKTSPTITTLKGQASVIRTALRLATTSVLGRSMLCYSPIRAFPQLSVGRASVSSELAFLWHSCEAGACVSATLYLRRLDKDSASYMAGILNIPTNRMREYPVIRPHWVTAPVAGLWIPESTKNPSGRLWLRPPTRSDKLDTEQYRLREEEWLEDLSAEIRKIQVSAAGGTLVLMTSYTSAKGLAERLADIDGLVVAEQGVSISRQVEGFVNQHSAGKKPLWIAVGGAWTGVDINGKDYGLATPGEDNLLTDLVIPRFPFGTNMSMTHRHRAEQASNVPWDLLDAAMRFKQGLGRLVRREGLPPNRRIYVLDGRMNEPTFDFFMSHLRRIIGIYPVKTLKRSAAIDD >NZ_AP021845.1|WP_152090934.1|4030_5377_+|hypothetical-protein MGKSHQQWREDLRKVMHELQALEDDEASLKGERRTSEEDLGKLKSRIDGLRRHLDDLAAAGCTAEEKLRKAKDRLAGYWPDLAADDHDQERSSPWAHPEWRAARIRVFLAALNLHQAFIEENASKMMANLGIAMDMLQGGIPDPKVRVQALDSLAIACPVISTTFASVPSLCGSMSSEGIGWLLIDEAGQATPQAAAGAIWRARRVVVVGDPLQLEPVVTLPRSVEASLAACNGGVNSRLHPSRTSVQKLADQTTAIGTTVGEGDDAIWVGAPLRVHRRCDEPMFSISNEVAYDGLMVHHKKPAALTWPASYWLDVPGGQGNGNWIPAEGEALRGLIQNLLGQAQVPADDIFLISPFRDVVRELKGMGKAFGLDYRRVGTVHTTQGKEADVVIMVLGGGTAGARDWASSRPNLLNVAASRAKARFYVVGDRKDWSKRRFFDVLSKNLS >NZ_AP021845.1|WP_004883034.1|2592_3792_-|tyrosine-type-recombinase/integrase MAKIKLTKSAVDAAQPQAEAVELRDTLVPGFLCKITPAGRKVFMLQYRTNAGERRKPSLGLYGELTVEQARSLAQEWLAQVRRGGDPAAEKAEARQAPTVKELCTKFMEDYSKKRNKLSTQAGYQAVINRNIIPLLGRKKVQDVKRPEIAGLMEKLSYKQTEANKVFSVLRKMFNMAEVWGYRPDGTNPCRHVPMFPAGKSTHLISDEEMGNLFRQLDKIESEGLENYVIPLGIRLQFEFAGRRSEIIALEWNWVDLQNRRVVWPDSKTGGMSKPMSEEAYRLLSTAPRQEGSRYVLPSPSHAGKHLTTGEYYGGWSRALKAAGATHVGTHGIRHRSATDIANSGIPVKVGMALTAHKTVVMFMRYVHTEDKPVREAAELVANRRKTITGMQGAKEVAA >NZ_AP021845.1|WP_004883035.1|1528_2596_-|DUF1016-family-protein MTRRKASVSAPAAPPALLGDIRALIEASRQRVASAVNAELTLLFWRIGQRIHTEVLAGQRAGYGDEILPTLAAQLVRDYGRSFADKNLRRMVQFAATFSDEPIVVTLSRQLSWSHFVALLPLKDPLQRDYYVQMASAERWSVRTLRERIDSMLYERTALSKKPDETITQELAAMRDAQRMSPALVMRDPYILDFLGLRDTWQEGDLEAAIIREMESFLLELGAGFSFLARQKRIQIDDEDFHLDLLFYNRKLRRLVAVELKIGEFKAAYKGQMELYLRWLDKHEREPEEASPLGIILCTGKKSEQIELLELDKSGIHVAEYLTTLPPRAVLGERLQQATERARLQIEQRQPGEKS >NZ_AP021845.1|WP_004883036.1|41_1289_+|hypothetical-protein MKNIFEEINEFSSEKIALFSFGKFCYVFLNKDPIFVKKLLPLIQTSLANESFQADVMRAYTEGCMNEKAAILKEFEAKRDHPNAAKFYGPQLDLVDKRLAIKTIQHLMDYLNNYLNEYPGSLEILNNSYKHIHDEDGVSYIKENYANYRIGCIFYSKHQSIMGRAEMLELKYSKVVEREYEKIGIDIRKEDAQFSKYSLVSLNENIQIFNDKDSQTIRDERIGRHFWIKVPRKLLTSIEELIEKGMLSEIAFRIDYVSDYVPAMEEMEFGAPLRLKISSLPRLSKFYSTDKYENNLWIHHDAEKLSLTFEELMEDFEVAGDDVVTQVIHLEYSSKGDDFFITHLDHEFIVYTLDSYQERLSNANIKGHRKIKTFKIDNSMIPFDINISGDLFLFQVLDSYLKNDDLIREYFEKIN >NZ_AP021845.1|WP_152090940.1|12610_12820_-|hypothetical-protein MSSLPKIDHQENAERNLGIAIDRLDEMRWAVVGVDSPDAECLAKFDEGVAKLKEALTVIRHSPSKTTGR >NZ_AP021845.1|WP_152090941.1|12821_13283_-|hypothetical-protein MNKADLIWWPDNKKGVIINAVAARDRATARFVVTRAIRLVFPMPIMVLASYLISTTLTFPDDAPGWVLTWVEWGPTTLAWACATMTLVVVAMLLIDWRRDRSAAVALACEASALGVDIAKLDGDWVFEALVLPMVRRKGITLPDGSVAHLTEE >NZ_AP021845.1|WP_152090942.1|13293_13818_-|hypothetical-protein MTLASCIGCGCDDLHACVDDLGPCSWIVVDRDAGRGVCSCCESHMERWNAGDRSVLMLVAKITRDGETEPYFIEKNGIGSFPYFLEVEAGDKFSIEWVEMSQEQFEALPQFEHVRLAEKWLEEITAAGDAESEGKMDEAGTHRAEAQRLSEKVAERGFDVLDLIEADELPASLL >NZ_AP021845.1|WP_152090943.1|14138_14459_-|hypothetical-protein MKIDPHEVTSRSIAVVSEVPLIATQWHRHGDHPRVHGMNGSNWELFPDEDDVRYGLLSGGHSGCLLVESGDWIVWNEVFKTYAIFKPDQFEALFSTSAAAIAPSER >NZ_AP021845.1|WP_152090944.1|14759_15089_-|hypothetical-protein MTPNWQPIEALPLIAGMLDDQLHSLHTQVGNLEQCRHRPWVLDGETVNRLQAVFGEQMDSLPVFREQLARWLELPLDEHQRQEINRLNAVLDQMKAAIERILSLAGNIR >NZ_AP021845.1|WP_152090945.1|15091_15607_-|hypothetical-protein MEEYSRPATLDDLKALIASLNEQCADYLLIGGYALFAHGYHRATTDIDVLVPATQEAGIKIRSALMVLPDQAAKNIDPAWFDEGENIRVADAFIVDIMLNACGETYETLKKYAETLDVDGVPVRTINLEGLLLTKQTMREKDVSDRIILERALETLKERVSKPESDHGLGL >NZ_AP021845.1|WP_152090946.1|15606_15834_-|hypothetical-protein MRTIGRRKEHPITFSASAELLVEGARFNDEIHRLPTGNTTHIPKGLYRFKSFEEANQHQQDCLVAGMAKIALERK >NZ_AP021845.1|WP_152090947.1|15850_16288_-|hypothetical-protein MEYRDFLKEIPEGLGATEKTLQLWFVIYQWILEHGYADSADSPAFLQHINAFRKCSTQTVATHLRRMSDAKLIKRYVLRRKLSGEAKEELSIGSLLFAPGAESIPTTFVRYCLPGQQCPTEFKSYEAAVSALDGRMNAIRETVRP >NZ_AP021845.1|WP_152090948.1|16290_16860_-|hypothetical-protein MNTLETQNASTVIDMATAVRERASIKVYANGNLVGEISIAEHEAFKLAAKNDRSLYREQALNYLEATFRLVGRIVLSIPENWFVIAVLLALMMPSEFNSLVSAIIANPSTSSTEFLNTVRWALAASIASTALVAVISGESFGLANVFDDRVAMMIRAKFKLPPMCKLFVDAEQILDGLPSHQAPTHFGK >NZ_AP021845.1|WP_152090949.1|16868_17072_-|hypothetical-protein MAEFTILVGDEVVRLTKKEVEALRKSLKTDVLVTPEDWTRSELQSRSQARKKLMDALYSAEKDIILR |
You can click texts colored in the table to view more detailed information
| CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| NZ_AP021845_4 | 307466-307630 | Orphan |
NA
Consensus repeat of NZ_AP021845_4
|
3 spacers
spacers of NZ_AP021845_4
>4.1|307483|31|NZ_AP021845|PILER-CR AAACACGCTGACGGGGAGGGAGGCGCATGGG >4.2|307531|43|NZ_AP021845|PILER-CR TTGCCGAGCACCAAGGGGTGATGCGAGAGGAGGGCTGTGCGGA >4.3|307591|23|NZ_AP021845|PILER-CR TCGCTGACATCCCTGGAATCCGA |
CRISPR arrays and Neighbor proteins around NZ_AP021845_4
The CRISPR arrays of NZ_AP021845_4 >merge|NZ_AP021845|4|307466-307630|PILER-CR TTGCACGTCGACGTGCAAAACACGCTGACGGGGAGGGAGGCGCATGGGCTGCACGTTGACGTGCATTGCCGAGCACCAAGGGGTGATGCGAGAGGAGGGCTGTGCGGATTGCACGTCAACGTGCATCGCTGACATCCCTGGAATCCGATTGCACGTCAACGTGCA >NZ_AP021845|4|1|307466-307630|PILER-CR TTGCACGTCGACGTGCA AAACACGCTGACGGGGAGGGAGGCGCATGGG CTGCACGTTGACGTGCA TTGCCGAGCACCAAGGGGTGATGCGAGAGGAGGGCTGTGCGGA TTGCACGTCAACGTGCA TCGCTGACATCCCTGGAATCCGA TTGCACGTCAACGTGCA
>NZ_AP021845.1|WP_152091267.1|305586_306732_-|hypothetical-protein MQAIIAAGVAADNPLAPIFARKATLAKMAECSEVTVYRAMRQLEDAGWISRSEQVRLDDGSMDIGLLSITKKLATLVGLCLDEEVHGSAEESKRSQDIDSEPITVNNKKHTLTQNIAGREGAPTAAALVGNGPKQDGSDAKLCTQMKDGLIAGPIYRGEQRVDPKASVNYQSTRPGFVRIDGRSVAQELVWLIEEKRLTFGALFQLQTLAKQVPGQTLSDFVAYRSERIKQLTTTNDCYRYLKKLISDGIDARYLCAQRAKKEHRVMRRQQRDKAASSRAAWCRARHEMTFLNTQTGVTYRINANHELLEVGENGLPTSRPNLAITSKFIKAVEEGRLVPFRIQEPVINLELGNRRLDEMASKFPWLRRKGKGSPVEEVRA >NZ_AP021845.1|WP_152091266.1|305013_305346_-|hypothetical-protein MSSAVLQGAQVPLVAEFHQGTQVLIRWALWHKQHAYPKAILIGVFRKEDIRLVVYQYGKALCVAVKMADVAPKIRDWREWRQRYATVKGKLIQRGFALVDEREEVSHVAH >NZ_AP021845.1|WP_152091265.1|304361_305027_-|hypothetical-protein MSLTDFDLFGHPVAAPQNVTPPRRRVVSQAAKRAKLARHLEKSNNLVLFDELLDFLKAPLLKQQYDMDATFASDVGTVAVIEVDEDGNTQDEDSALVIPYEAWGEEWVTDSNGLAWSKEGLLFTQVRLFWRSMEELALNNNEQEKWSVLRWIFRPAIWKHYVYDKRIGRSHCFEVHERDETFSFHNCCIAARVDEDTVREGVRRNIPAEVVKAVEKVCKFD >NZ_AP021845.1|WP_152091264.1|303146_304286_-|AAA-family-ATPase MFDLSAILVQVSYSLCEVFGLDPNEVDPSITVDGFEIPDPSTLTDPVAQQYAAFLRAAVPPIDPYYQFRKDLVRDIRYWWLTGEGDVLLLWGPTGSGKTSVFEQWCARLGIPLFMAKGHRRFEPMEAFGQFVGGENGTTPWVDGPVTLAARYGLPCIINEYDRIAADRTIVFNDVFEGRSFPIPGKSGEVVTPQPGFRVAITANTNLVEDLSGNYGTANTHDISILERIVALHVGYPSDDTEAKLLEKELEQFSDDLLSYWFDQEGIKISTPQGMKEGSAINRGEFIQGLLEVAKKIRAQSKDGGNTSDSALERTMSTRILRKWARHSVAQASAPEKLGLSALHLALKKYLSSLSTESTRIALHQAVENVFGVGEVVKP >NZ_AP021845.1|WP_152091263.1|302680_303136_-|hypothetical-protein MSNNKFVGVAYWSVATQVADWLARAVDVLMPLERAGIAVLAYDCLPGGNQLTVTFGEERHQMLVTDGKGRGVVRASEAAAVFVVCLVALRKALGDISVVTDSQETVPALPRQNYPLYADSWQRVLPVAQELGLATGAAFTARHNAVLNNVF >NZ_AP021845.1|WP_152091262.1|301730_302669_-|hypothetical-protein MSAHIIDKLLFLVIIVAAARVLWRFYPYRQVVVEEDSREVVAPEVQTQPEVVQQQMDAAPVSSEREQVTASPLFVRHLNDMATLMVRYRHRKQACTAHLIVWSGSKRKGYSDSYFDLGLIEGQELTEQVIEQSLALAKQQLVDLAEKGKRKRRESKKQKQEAAAVAEVVVAEAAVSEVVVAEVAEAPADVAIVAATEVVEQKAEVPPLVVEDTPPESIKLRKFPSVYRGIIKEIGMMTQNKDGREFETFGVRFETQEGIVDAVFGVNLREALRDAKADVGDQVEILKIGRKTITKGKAPMNLFKIAKLECTA >NZ_AP021845.1|WP_152091261.1|300765_301308_-|single-stranded-DNA-binding-protein MASVNKVILVGNVGADPETRYMPNGDAVCNLRLATTESWKDKNSGEKRELTEWHRIVCYRKLAEITSQYVKKGSQLYLEGRIKTRKWQDKDGQDRYTTEIEMTEMQMLGNRRSGDSDGSGDDRPPRQHSAGGGGNGEPPARERRPAPAYDPMEDDIPFCRVDMNADPAFVKASARVRRVA >NZ_AP021845.1|WP_152091260.1|300423_300684_-|hypothetical-protein MYGAPYLGYGCEIVHSFNTLIDAIKRRQVLRPRSKLAINLAGGDIEVNLLPNGFVELDGKVQPVAVEKEIEDAMQQFGAVLKEVLV >NZ_AP021845.1|WP_152091259.1|300076_300421_-|KTSC-domain-containing-protein MHPNFTPVSSSNIDGYLYMPDRKILLIAFKSGGTYAYEDVEQPVATGFAQASSKGKFFRSDIKDRYATSKLDDMAVANLLGGMGASVPPQPRRKAPRVTLQSLLSRYPMLNAVF >NZ_AP021845.1|WP_152091258.1|298723_299683_-|hypothetical-protein MQKDQSSLAGTLPANRRTIVAMPDPILGSLRWPNRPKLPEGNPCWTYMVEGTREQYAVAVGHVENGRPHPFEVWVLANEQPRCLGAMAKTLSADMRTQDRAWLTRKLEVLASVTGDLAIDLPLGTERLLASSNTAAVARIVQYRLNQLGVANPEEGEATPLVDALLPVRDAGHEGTLSWTADIKNPSSGDDFTVFLPEVQTEDGQHRPVAVRLSGRYPRDLDGLAAILTMDMAVVDVAWIGMKLRKLLDYDEPMGSFLAKTPGTGKTERYSSIVAYLARLILHRYATLGLLTASGYPVVEMGVMVSVPGDASNVVPIAA >NZ_AP021845.1|WP_152091268.1|307728_308742_-|ParB-N-terminal-domain-containing-protein MKQRVIKRPDMPLTALGAPAPGADTSTDKGELPAVSATTQPPSPPLLHLAGAEEVVDIPVSKLRVSPCNARKIRLPKRVSKIAESLKNNGQKDPLYVYPGAGDDEGYFMVLGGETRRLGALQIALPTLKAFVDRKVDPTDALNLTKISNILNDSADECDLDRGMVAIDLLEKGHTQGEVAEVLELESHTHVQRLIKLAGLPKRFIDFGQDYPERFSASLGAYISQAIDRHGEDFAHDLLKAALVDELPHRKIAKAIEAGPSDKQPGQERGKRLRRDGGFDIPTPDAPGGRYDVYKSKTPGLKVLKLQVEVPDELAKDLNEKLTEVLTQFIQTSRDQQ >NZ_AP021845.1|WP_152091269.1|308759_309899_-|AAA-family-ATPase MSDLRPTYAYIGAVEHRLKSAAALLGVSENTLRTTLAESGIEVRRANQDNPNAPAVRLFDLPTIFQIAEYRRAKKLTKGPEGKKPIVIAIEIIKGGTGKTTTAAEVAVQLQLQGLKVLGIDIDIQANFTQLMGYEADLTEDEAAMYGLTEEAIVNGTFATICGPFIERNGRPVDAKAIIKYPFGPSGPAIIPADTFFSDLEHDISKTGGKRELVFQKFFKESLAGNVPGLNVGDFDVVLFDCPPNISFVATNALASADIVIAPVKMESFSVKGLSRLIGEVHTLKAEYGGEVKDPELVILPTYYSTNLPRVGRMQEKLAQYRANTSPVSISQSEEFPKSTDNYMPLTVIKPTCQPVKEYRMFVDHLIKRINEVSKARAS >NZ_AP021845.1|WP_152091270.1|309895_310324_-|hypothetical-protein MNAFGRDGDSRQKEHSNVNYRSRDGRETQGSLSSATRVPVPRGHAPHPIFPPSWSIPTRKPQFTAVFHYCGFPEVAHTLANLETPGPEGAENFVQKKMAECGYMATVGRVKTRKAAFCLPMRLTNNTRAKIMREGNHKDEME >NZ_AP021845.1|WP_152091271.1|310495_311095_+|hypothetical-protein MSESTTPKRSRHSYRERIQEVIQERIALGKPLTHRDILKEAGGGSASTVVEELAKAERSTPATLIGRGAKSLPQRIAALEDALNASLAREKVLEAENQALRESLTSARADVDKLLAGHQDSQRMLLQGVDDLRQMVKAGQGGMASAVIATERQKAAGDDTGDGILWKARHDQLLQRFVALDAKNRKMSSQLHELGVDVD >NZ_AP021845.1|WP_152091272.1|311091_312549_-|RepB-family-plasmid-replication-initiator-protein MRQHQPEQRNLFPTEDLIVPESLQKMRKAVAAIHAIPRNPEDSQNLTNRRVFDGLIIVAQIHCRQRGKEFIQRIRDERVSPLFEVRTSELGKLSGIPGKNYDRIIEEISRIYEMDFEFNVCAEDGETIWENRARLLSSLGVGKNHKRGYIRFAMDPEMLILLLEPNLWASFSLSVMHDLGTSAAYALFQQTYRYINTNQKLTAALPTKTWIELLVGKNRYVKDIDGKEVINYGEFKRRVLNDAIEKVNEVPALTYNIELKEHRQGNRVARLQFKFIPKEPTLQLESTWPEDILTVLKSIGFLDKEITDISQAHSSASVADAICRLKEAEQRLKSEGKAISSRKPYFLGILRNIAAGEDDIDPEKIEAEVRIEMAERAAEERKKKMQDAYDEHRRKRFSAWVTSLSVEDRKQLIADYEASEDFNPVLGKSLKKILTEENRSGLSTLRVWMEKHRSETLAGVFNTPEYQSLEGWMMWKLSGDDAIEA >NZ_AP021845.1|WP_152091273.1|313182_313680_+|hypothetical-protein MMDDDEIKDRQLNALVGGAVRGLVESGADDEAIGAFASTYRQKAAKLLGRPQEALDPPDLTDIIKDAVAQALAAAQVKPKKSRKQNEHFTVSIGGQKTSVTIHKDVIAQLAEAKGSKAEVSRFVREVAKDVPDSVENKSEWIEHRIATIMRFKSESAGNGSSARH >NZ_AP021845.1|WP_152091274.1|313676_314798_-|restriction-endonuclease MFGPKKQTAAAKPNGVDNLIHKLKALPPAIDLIVAATLGGYSFVAIYVDNAKFVGLACILIALIFAVLGISAVTREMKDFKVVEQNSNPEALRMMKTQQFENYLVALFRLDGYQVRPSIDELHRQDDADLIAVKKKETILIQYNHWDEDIVGTKPIQSLHKAAAAVRAQGATAISFGRFSAEAADWARRKGVTLMTMQDVIGMACRLTGLTPEEAAAEPDEEVVVEKAHEVAEVVRGHHRFLFVDFAGLEHGLARLSELLLQHPAYQVIASTLPPLKSMEDIRLSLGECGDRLAGDLEAAQDGRYFAIQKHLQASREGKHAIWLAVDSEPRQFPEGCAELIAVNRAFGFDVSASQRLIEAMVIIDRRSIAGAG >NZ_AP021845.1|WP_152091275.1|314875_315796_-|recombination-associated-protein-RdgC MFFRNLTLYRLPTPWNMDLAKLEEMLARNPFTRCSGSEQQRSGWISPRDKGSLVYAQNRQWLIALCTEQRLLPSSVIQDEVRERAEALEEQQGYAPGRKQLRELKDRVAEELLPRAFTRRRTTFVWIDPVNGWLAVDASALSKAEEVLEQLRMVLDDFPLSLVHTKLSHSSAMADWLAGGEAPVNFTVDRDCELKAVGEEKAAVRYVRHPLDGDGIASEIKAHLAAGKLPTRLALTWNDRISFVLTERLEIKRLGFLDLLMEEAEKNTEHAEEKFDADFALMTGELARFIPSLLDALGGEVVEDRA >NZ_AP021845.1|WP_152091276.1|315816_317793_-|DEAD/DEAH-box-helicase MQPPPDNERNAVEQANGQPLLDRLKRLGVTAWREPLLCLPKLFQDYSSISTLKQALPQNDVVAGPKLFTLLVSEKAVVLSQPKKRLVMTATDGMLSVKIVIFVVLGVDVPTWKAFEEGDKIHLRGVLQNWNGKLQITGPTLIDPQLVGKVIPIYEKRRSVVADGAIYDATRYALEHHLKETIDYLVESYHGLPEADILRRARLKAPSVEVILRAAHQPTSEDEGMRGIAGMRRLAALSVVENARRLKQRDPVPESVVSIPDTLIQQLTEKLPYPLTGDQRRSIGEIVADMASPLPMRRVLSGDVGSGKTLPIMIAALATQHLGHRAVILTPNGLLADQFVKECKALFGEDSLVISVTSGTKKLDLASNPILVGTTALLSRLKGESPPALFCVDEEQKMSVSQKIELTGFASNYLQATATPIPRTTALITHGAMDVSVLKEMPVVKNITTHIVTAGERKRLFDHTRKVLASGGQIAIVYPIVNDDEQEKKSVVAAAVEWEKQFPGLVGMVHGQMKEAEKVAAVNGLKSGNQKIAVVSSVIEIGLTLPSLRSLIVVHAERYGTSTLHQLRGRVARLGGNGYFFLFLPETVAPETMQRLQLLVDHSDGFTLSEKDAELRGYGDLFEDAERQSGNSRSTIFRCVDLTPSEIHAATIHEALPS >NZ_AP021845.1|WP_152091277.1|317776_318742_-|hypothetical-protein MTTTNLATAKEVATAVASLGFQASSETLGAISRNCREDFLEHLSRCITDQDQDGRSKKFIGNLLRCLAPNTINRIKPIFPDATIDMIVPVAKAVPTRFLSAIDAAHDPKHARHEDAKAYLASIFAPPDTHSEEEPPQSSLQQQHDQQEERPVDQAALSRRLAPSGSKKYHSVHVYGSNAALCFNATDWNGAPGVMVDAAMQTGPKTYDWKNAVHVWLDINEVGAVLAVFRRWRKGVEFSAHGAQNDKGFAIEFQGQHFFAKVTAKKAAAGAVRAVKILPSDAMSVSILFLTQLAESYPMIPLNELLATVRATHQIEDAAAA |
You can click texts colored in the table to view more detailed information
| CRISPR_ID | Spacer_Info | Spacer_region | Spacer_length | Hit_ID | Protospacer_location | Mismatch | Identity |
|---|
| CRISPR_ID | Spacer_Info | Spacer_region | Spacer_length | Hit_phage_ID | Hit_phage_def | Protospacer_location | Mismatch | Identity |
|---|---|---|---|---|---|---|---|---|
| NZ_AP021845_1 | 1.1|11367|30|NZ_AP021845|CRISPRCasFinder | 11367-11396 | 30 | NZ_AP021845 | Azospira sp. I09 plasmid pAZI09, complete sequence | 11367-11396 | 0 | 1.0 |
| NZ_AP021845_2 | 2.1|11566|30|NZ_AP021845|CRISPRCasFinder | 11566-11595 | 30 | NZ_AP021845 | Azospira sp. I09 plasmid pAZI09, complete sequence | 11566-11595 | 0 | 1.0 |
| NZ_AP021845_2 | 2.2|11632|30|NZ_AP021845|CRISPRCasFinder | 11632-11661 | 30 | NZ_AP021845 | Azospira sp. I09 plasmid pAZI09, complete sequence | 11632-11661 | 0 | 1.0 |
| NZ_AP021845_3 | 3.1|12229|30|NZ_AP021845|CRISPRCasFinder | 12229-12258 | 30 | NZ_AP021845 | Azospira sp. I09 plasmid pAZI09, complete sequence | 12229-12258 | 0 | 1.0 |
| NZ_AP021845_4 | 4.1|307483|31|NZ_AP021845|PILER-CR | 307483-307513 | 31 | NZ_AP021845 | Azospira sp. I09 plasmid pAZI09, complete sequence | 307483-307513 | 0 | 1.0 |
| NZ_AP021845_4 | 4.2|307531|43|NZ_AP021845|PILER-CR | 307531-307573 | 43 | NZ_AP021845 | Azospira sp. I09 plasmid pAZI09, complete sequence | 307531-307573 | 0 | 1.0 |
| NZ_AP021845_4 | 4.3|307591|23|NZ_AP021845|PILER-CR | 307591-307613 | 23 | NZ_AP021845 | Azospira sp. I09 plasmid pAZI09, complete sequence | 307591-307613 | 0 | 1.0 |
| NZ_AP021845_2 | 2.1|11566|30|NZ_AP021845|CRISPRCasFinder | 11566-11595 | 30 | NC_018141 | Legionella pneumophila subsp. pneumophila plasmid pLELO, complete sequence | 116661-116690 | 6 | 0.8 |
| NZ_AP021845_2 | 2.1|11566|30|NZ_AP021845|CRISPRCasFinder | 11566-11595 | 30 | NZ_CP025492 | Legionella sainthelensi strain LA01-117 plasmid pLA01-117_150k, complete sequence | 38691-38720 | 6 | 0.8 |
| NZ_AP021845_2 | 2.1|11566|30|NZ_AP021845|CRISPRCasFinder | 11566-11595 | 30 | NZ_CP021284 | Legionella pneumophila subsp. pneumophila strain Allentown 1 (D-7475) plasmid unnamed1, complete sequence | 109995-110024 | 6 | 0.8 |
| NZ_AP021845_2 | 2.1|11566|30|NZ_AP021845|CRISPRCasFinder | 11566-11595 | 30 | NZ_CP011106 | Legionella pneumophila strain L10-023 isolate Ulm plasmid unnamed, complete sequence | 106875-106904 | 6 | 0.8 |
| NZ_AP021845_2 | 2.1|11566|30|NZ_AP021845|CRISPRCasFinder | 11566-11595 | 30 | NZ_CP045305 | Legionella longbeachae strain B1445CHC plasmid pB1445CHC_150k, complete sequence | 37393-37422 | 6 | 0.8 |
| NZ_AP021845_2 | 2.1|11566|30|NZ_AP021845|CRISPRCasFinder | 11566-11595 | 30 | NZ_CP042253 | Legionella longbeachae strain B3526CHC plasmid pB3526CHC_150k, complete sequence | 39581-39610 | 6 | 0.8 |
| NZ_AP021845_2 | 2.2|11632|30|NZ_AP021845|CRISPRCasFinder | 11632-11661 | 30 | KM389300 | UNVERIFIED: Escherichia phage CBA6 clone ctg7180000000096 genomic sequence | 16175-16204 | 7 | 0.767 |
| NZ_AP021845_2 | 2.2|11632|30|NZ_AP021845|CRISPRCasFinder | 11632-11661 | 30 | KC139562 | Salmonella phage FSL SP-029 hypothetical protein gene, partial cds; hypothetical proteins, RIIA protector from prophage-induced early lysis, RIIB Protector from prophage-induced early lysis, and hypothetical proteins genes, complete cds; and DNA topoisomerase 2 gene, partial cds | 2138-2167 | 7 | 0.767 |
| NZ_AP021845_2 | 2.2|11632|30|NZ_AP021845|CRISPRCasFinder | 11632-11661 | 30 | KC139523 | Salmonella phage FSL SP-063 hypothetical protein genes, complete cds; tRNA-Met, tRNA-Trp, tRNA-Asn, tRNA-OTHER, tRNA-Ser, and tRNA-OTHER genes, complete sequence; and hypothetical proteins, DNA polymerase, hypothetical proteins, RIIA protector from prophage-induced early lysis, RIIB Protector from prophage-induced early lysis, hypothetical proteins, putative tail fibre, hypothetical proteins, DNA topoisomerase IIs, hypothetical proteins, exonuclease A, hypothetical proteins, deoxycytidylate deaminase, hypothetical proteins, head completion protein, putative tail tube associated base plate protein, baseplate wedge subunit, hypothetical proteins, loader of T4-like helicase, hypothetical proteins, putative membrane protein, DNA ligase, hypothetical proteins, helicase, hypothetical protein, RecA-like recombination protein, hypothetical protein, putative dUTP diphosphatase, hypothetical protein, thymidylate synthase, hypothetical proteins, DNA end protector protein, baseplate tail tube, single-stranded DNA binding protein, hypothetical proteins, regulatory protein FmdB, hypothetical proteins, and base plate hub subunit genes, complete cds | 20576-20605 | 7 | 0.767 |
| NZ_AP021845_2 | 2.2|11632|30|NZ_AP021845|CRISPRCasFinder | 11632-11661 | 30 | FQ312032 | Salmonella phage Vi01 complete sequence | 154723-154752 | 7 | 0.767 |
| NZ_AP021845_2 | 2.2|11632|30|NZ_AP021845|CRISPRCasFinder | 11632-11661 | 30 | NC_015296 | Salmonella phage Vi01, complete genome | 154723-154752 | 7 | 0.767 |
| NZ_AP021845_2 | 2.2|11632|30|NZ_AP021845|CRISPRCasFinder | 11632-11661 | 30 | NC_023856 | Salmonella phage vB_SalM_SJ2, complete genome | 80074-80103 | 7 | 0.767 |
| NZ_AP021845_2 | 2.2|11632|30|NZ_AP021845|CRISPRCasFinder | 11632-11661 | 30 | MH427377 | Escherichia phage vB_EcoM Sa157lw, complete genome | 129521-129550 | 7 | 0.767 |
| NZ_AP021845_4 | 4.1|307483|31|NZ_AP021845|PILER-CR | 307483-307513 | 31 | NZ_CP012399 | Chelatococcus sp. CO-6 plasmid pCO-6, complete sequence | 220195-220225 | 7 | 0.774 |
| NZ_AP021845_4 | 4.1|307483|31|NZ_AP021845|PILER-CR | 307483-307513 | 31 | NZ_CP018096 | Chelatococcus daeguensis strain TAD1 plasmid pTAD1, complete sequence | 197951-197981 | 7 | 0.774 |
1. spacer 1.1|11367|30|NZ_AP021845|CRISPRCasFinder matches to NZ_AP021845 (Azospira sp. I09 plasmid pAZI09, complete sequence) position: , mismatch: 0, identity: 1.0
atctcaacccgctctaggattcggctgact CRISPR spacer atctcaacccgctctaggattcggctgact Protospacer ******************************
2. spacer 2.1|11566|30|NZ_AP021845|CRISPRCasFinder matches to NZ_AP021845 (Azospira sp. I09 plasmid pAZI09, complete sequence) position: , mismatch: 0, identity: 1.0
ctgatattgacaaggccttggcagttgtcg CRISPR spacer ctgatattgacaaggccttggcagttgtcg Protospacer ******************************
3. spacer 2.2|11632|30|NZ_AP021845|CRISPRCasFinder matches to NZ_AP021845 (Azospira sp. I09 plasmid pAZI09, complete sequence) position: , mismatch: 0, identity: 1.0
gtcaggagtatcagccagacaatgaacttg CRISPR spacer gtcaggagtatcagccagacaatgaacttg Protospacer ******************************
4. spacer 3.1|12229|30|NZ_AP021845|CRISPRCasFinder matches to NZ_AP021845 (Azospira sp. I09 plasmid pAZI09, complete sequence) position: , mismatch: 0, identity: 1.0
gaatatcggttctgcggtcgcagattggcc CRISPR spacer gaatatcggttctgcggtcgcagattggcc Protospacer ******************************
5. spacer 4.1|307483|31|NZ_AP021845|PILER-CR matches to NZ_AP021845 (Azospira sp. I09 plasmid pAZI09, complete sequence) position: , mismatch: 0, identity: 1.0
aaacacgctgacggggagggaggcgcatggg CRISPR spacer aaacacgctgacggggagggaggcgcatggg Protospacer *******************************
6. spacer 4.2|307531|43|NZ_AP021845|PILER-CR matches to NZ_AP021845 (Azospira sp. I09 plasmid pAZI09, complete sequence) position: , mismatch: 0, identity: 1.0
ttgccgagcaccaaggggtgatgcgagaggagggctgtgcgga CRISPR spacer ttgccgagcaccaaggggtgatgcgagaggagggctgtgcgga Protospacer *******************************************
7. spacer 4.3|307591|23|NZ_AP021845|PILER-CR matches to NZ_AP021845 (Azospira sp. I09 plasmid pAZI09, complete sequence) position: , mismatch: 0, identity: 1.0
tcgctgacatccctggaatccga CRISPR spacer tcgctgacatccctggaatccga Protospacer ***********************
8. spacer 2.1|11566|30|NZ_AP021845|CRISPRCasFinder matches to NC_018141 (Legionella pneumophila subsp. pneumophila plasmid pLELO, complete sequence) position: , mismatch: 6, identity: 0.8
ctgatattgacaaggccttgg--cagttgtcg CRISPR spacer gtgatattgacaaggccttagctcagttag-- Protospacer ******************.* *****.
9. spacer 2.1|11566|30|NZ_AP021845|CRISPRCasFinder matches to NZ_CP025492 (Legionella sainthelensi strain LA01-117 plasmid pLA01-117_150k, complete sequence) position: , mismatch: 6, identity: 0.8
ctgatattgacaaggccttgg--cagttgtcg CRISPR spacer gtgatattgacaaggccttagctcagttag-- Protospacer ******************.* *****.
10. spacer 2.1|11566|30|NZ_AP021845|CRISPRCasFinder matches to NZ_CP021284 (Legionella pneumophila subsp. pneumophila strain Allentown 1 (D-7475) plasmid unnamed1, complete sequence) position: , mismatch: 6, identity: 0.8
ctgatattgacaaggccttgg--cagttgtcg CRISPR spacer gtgatattgacaaggccttagctcagttag-- Protospacer ******************.* *****.
11. spacer 2.1|11566|30|NZ_AP021845|CRISPRCasFinder matches to NZ_CP011106 (Legionella pneumophila strain L10-023 isolate Ulm plasmid unnamed, complete sequence) position: , mismatch: 6, identity: 0.8
ctgatattgacaaggccttgg--cagttgtcg CRISPR spacer gtgatattgacaaggccttagctcagttag-- Protospacer ******************.* *****.
12. spacer 2.1|11566|30|NZ_AP021845|CRISPRCasFinder matches to NZ_CP045305 (Legionella longbeachae strain B1445CHC plasmid pB1445CHC_150k, complete sequence) position: , mismatch: 6, identity: 0.8
ctgatattgacaaggccttgg--cagttgtcg CRISPR spacer gtgatattgacaaggccttagctcagttag-- Protospacer ******************.* *****.
13. spacer 2.1|11566|30|NZ_AP021845|CRISPRCasFinder matches to NZ_CP042253 (Legionella longbeachae strain B3526CHC plasmid pB3526CHC_150k, complete sequence) position: , mismatch: 6, identity: 0.8
ctgatattgacaaggccttgg--cagttgtcg CRISPR spacer gtgatattgacaaggccttagctcagttag-- Protospacer ******************.* *****.
14. spacer 2.2|11632|30|NZ_AP021845|CRISPRCasFinder matches to KM389300 (UNVERIFIED: Escherichia phage CBA6 clone ctg7180000000096 genomic sequence) position: , mismatch: 7, identity: 0.767
gtcagga----gtatcagccagacaatgaacttg CRISPR spacer
----gaattccgtatcagccagacaaagaaattg Protospacer
*.* *************** *** ***
15. spacer 2.2|11632|30|NZ_AP021845|CRISPRCasFinder matches to KC139562 (Salmonella phage FSL SP-029 hypothetical protein gene, partial cds; hypothetical proteins, RIIA protector from prophage-induced early lysis, RIIB Protector from prophage-induced early lysis, and hypothetical proteins genes, complete cds; and DNA topoisomerase 2 gene, partial cds) position: , mismatch: 7, identity: 0.767
gtcagga----gtatcagccagacaatgaacttg CRISPR spacer
----gaattccgtatcagccagacaaagaaattg Protospacer
*.* *************** *** ***
16. spacer 2.2|11632|30|NZ_AP021845|CRISPRCasFinder matches to KC139523 (Salmonella phage FSL SP-063 hypothetical protein genes, complete cds; tRNA-Met, tRNA-Trp, tRNA-Asn, tRNA-OTHER, tRNA-Ser, and tRNA-OTHER genes, complete sequence; and hypothetical proteins, DNA polymerase, hypothetical proteins, RIIA protector from prophage-induced early lysis, RIIB Protector from prophage-induced early lysis, hypothetical proteins, putative tail fibre, hypothetical proteins, DNA topoisomerase IIs, hypothetical proteins, exonuclease A, hypothetical proteins, deoxycytidylate deaminase, hypothetical proteins, head completion protein, putative tail tube associated base plate protein, baseplate wedge subunit, hypothetical proteins, loader of T4-like helicase, hypothetical proteins, putative membrane protein, DNA ligase, hypothetical proteins, helicase, hypothetical protein, RecA-like recombination protein, hypothetical protein, putative dUTP diphosphatase, hypothetical protein, thymidylate synthase, hypothetical proteins, DNA end protector protein, baseplate tail tube, single-stranded DNA binding protein, hypothetical proteins, regulatory protein FmdB, hypothetical proteins, and base plate hub subunit genes, complete cds) position: , mismatch: 7, identity: 0.767
gtcagga----gtatcagccagacaatgaacttg CRISPR spacer
----gaattccgtatcagccagacaaagaaattg Protospacer
*.* *************** *** ***
17. spacer 2.2|11632|30|NZ_AP021845|CRISPRCasFinder matches to FQ312032 (Salmonella phage Vi01 complete sequence) position: , mismatch: 7, identity: 0.767
gtcagga----gtatcagccagacaatgaacttg CRISPR spacer
----gaattccgtatcagccagacaaagaaattg Protospacer
*.* *************** *** ***
18. spacer 2.2|11632|30|NZ_AP021845|CRISPRCasFinder matches to NC_015296 (Salmonella phage Vi01, complete genome) position: , mismatch: 7, identity: 0.767
gtcagga----gtatcagccagacaatgaacttg CRISPR spacer
----gaattccgtatcagccagacaaagaaattg Protospacer
*.* *************** *** ***
19. spacer 2.2|11632|30|NZ_AP021845|CRISPRCasFinder matches to NC_023856 (Salmonella phage vB_SalM_SJ2, complete genome) position: , mismatch: 7, identity: 0.767
gtcagga----gtatcagccagacaatgaacttg CRISPR spacer
----gaattccgtatcagccagacaaagaaattg Protospacer
*.* *************** *** ***
20. spacer 2.2|11632|30|NZ_AP021845|CRISPRCasFinder matches to MH427377 (Escherichia phage vB_EcoM Sa157lw, complete genome) position: , mismatch: 7, identity: 0.767
gtcagga----gtatcagccagacaatgaacttg CRISPR spacer
----gaattccgtatcagccagacaaagaaattg Protospacer
*.* *************** *** ***
21. spacer 4.1|307483|31|NZ_AP021845|PILER-CR matches to NZ_CP012399 (Chelatococcus sp. CO-6 plasmid pCO-6, complete sequence) position: , mismatch: 7, identity: 0.774
aaacacgctgacggggagggaggcgcatggg CRISPR spacer gatcgcgctgacgggacgggaggcgcatacg Protospacer .* *.**********. ***********. *
22. spacer 4.1|307483|31|NZ_AP021845|PILER-CR matches to NZ_CP018096 (Chelatococcus daeguensis strain TAD1 plasmid pTAD1, complete sequence) position: , mismatch: 7, identity: 0.774
aaacacgctgacggggagggaggcgcatggg CRISPR spacer gatcgcgctgacgggacgggaggcgcatacg Protospacer .* *.**********. ***********. *
| Region | Region Position | Protein_number | Hit_taxonomy | Key_proteins | Att_site | Prophage annotation | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| DBSCAN-SWA_1 |
72042 : 78469
Sequences of DBSCAN-SWA_1
Nucleotide sequences of DBSCAN-SWA_1 >NZ_AP021845|72042:78469|DBSCAN-SWA CTCATGCAGAGGCTCCTTCTGTGGCTCCCGCCCGTCGCTCGCGGATCGACGTCCTGACCCAGGTCTCAGCCTCTTCGGGGCTTCCAAGAGAGCCGGTTTCTCGCCGGCGCCGCCTCATTGCTGCGGCTTGCCTCGAGGTGATTTCCACCGCACCGGACTTGATCGCCAGGCTCCGCTTAGCCAGGCAGATGTCGTAGTGATCCCCCTGGTACCAGCGCCGGGCGACACCAATGCAGTCGGCCATTGCATGTACCTCGGCTTCCGAGTCCCCGAGCATGTGACACATGACCATTCGTCCATAACCGGCCCGCATGTTGTCGACGTAGACGGTCATTGAAGACCCTCCGCAATCGCTATGCAGGCCTGCCAATCATTTGAATAAGGCCAGCGGGGTCCAATTTTGCCGTCTCTCATCTGGAATTCACCGAATTTGGCCAGGAATGACCAGACATCATGAGGAGTCTTGTTAGCGACATCGGCGCGGCGATAGTCGCGGAATAGATCGTCAAAACCTTCAATAAGGACGCGACGTATGCTCGAATAATCACAGGAATTGATGTCGTGACGAATAGCTGCAAGGTAATAGGCCAGTGCCTTTGCCCCGTTTGCGCCGGAAGTACGGCAACCGACCTCATGGTATGACCCAAGCCACCACATGGCTTTCGAATCGCCATTGCGCGCCCAGAGTCCGATTTTGAGTAGAACTTCTTGGAAAAGCGGCTCTGGGTAGCCGAACGGTTCAAGATCGAGGATTGAAGCGGTTCCCGCTTGGCCTGTAGGCCAATGGACGCTCCAACGCCAGTTTTTGAACTCAGGCGGATCATCGCGCTGAATTCCCATTTGCAGCACTTCATGCCCCTGCTGCCGGGCCAGGATAAGGTAGTTCCCGGTAAAGATTTTGTCGTCCTCCTCAACGCGGGTCGAGTCGCAAATAGGGCAACGAACCTTGCTGAGGGTACCAGTGACGGAAATGGGAACTCCGCTGTTGATGGCTTGGTTGCTTCCTCGCCATCCACATTCTTGGCATCTGACTGGCCAGTCTTGATTTTCCATGGTTCAGAACAGACTTCCAGTTGTCGCAACCTCGGGCTGCATCGCCGCGGCCGGAACCGTCGAAGCGGGCCTCTTCGAGGCTTTGCTGACTTTCTGGAGGTAGCGTTCAACCAGGCGGCCGAGGTTGGTTGTGTCGTCATCGACCTTCTGCACACCAAGGTGCGGATCTACGATCTGATTAGCCTCGCTGGCCTTGATACCAAGCACGTCCATGATGGGCGGATCGCTGCCCTCCTCGCTGACGAGGAAGAAGGCGGTGACCGGATCTTCCTGACCCTCTCTATCCAAGCGCCAGATGATTTGCTGGTGTATGCCCGGGGACCAATCGAGCTCGCCGAACACCACAACCGAACTGCGGAACTGCAGATCGTCAATGCCTGCTCCTGAGCGGAGAGACATGATCATCACGTCCGTGTCGCCGGTGAGGAACCGATCCTTTTCCTTGTTCTTCTGGGCAGCCGTTTCCGAGCCGGTGTACATGGCCGGGCGAAGGTCTGCGAGTTCTTCAAGCCAGATGTCGTAGACGGCCCGGTGCCACCCAACCAGAAGGACGGGCTCGCCGGCCTCGACCATCAGGCGGACGAACTGTGCTACTGCCTTGGCCTTGGAGAGCCCGGTAGCCTGCCGGACCATCATGTCCAGTTCCCGGGCGGCCTGCCCGCGCTCGACGAAGGTGCCAGTGGTGGCGCGGATGGCCAGCACCCTGGCCAGGTCCTCGATGGACTGCACAGCCTTGGCATCGAAGTCCACATACTCGATGATTCTGGACACCTTTGGCAGCTCGAGGCCCACATCCGATTTGAGGCGTCGTAAGAGGACGTGCTGTTCGCGGAGGTATGTTCCCAGAGCCTTCGGGTTCCCGATGCGGCCCATGTCATCGGTCCATTCTCTGGAGAAGTCAGCGAAACTACCGAGCACCGTGTCGTCGATGAACTGCATGACATTGTGCATTTCAATGCCGTATCCATAAATTGGGGTAGCGGTCAGCCCGAGCCTGTAAGACACGTGGTTCGCTAACACCTTGGCAGCGGCGCCCTTGGCGGTCGACGTGCCCGTGCGCAGGCTTTGTGGCTCATCGAATACGGCTGCTTTGAAGAAGTCCGTAGCAAAGATGTCCGCCCATCCGCCAATTTGCGAGATACGGAAGACATAGACGTCCGCCGGCGGCAGATCGTATGGGGATGCTTTGGTGATGATATGCACCCGGAGGTGGGTGAAAGCGGTGAGCTTGTCCTTCCACTGCTTTTGCATGTGCGGGTCGCAGACGACGGCCGCTGGGAGAGATTGGGCTTCAGCGCACAGGAACGCCGCTGCCGTGTAGGTTTTCCCCAGTCCGCCTTCATCGCCCAGGAGCAGCGAGCGGCGCCGCCGCAGGAGTTCGACGGCCTGCATCTGGTAGTGACGTACCTTCTGCCCTTCGCGAAGGCCTACAACCGCCGGCGGGACGTATTCGGGGAGAAGGATGCGCTCCATCTCGGCCTGCTGCATCTCGAAGTCGAGCCGACCGCCTCGCAGCGCGTTCCGGTCACCGTCCGACATGGCCAGCGGATACCGAGAGAGAAACCAATCGAGGTCCGCCGCGTGCATCAGATCGCGGGGAAAGCGGAAGGGACCGGTTGATTGCTTCGGCACCCGGGGGAAGATGTGCTTCAGGCGAATGGCGACATGCGGCTCCAGGGAGGACATTTCCCAGGCGGTACCGCCTTCGATCAATCGGAGTTCGCCGTAGGTCCGCATCAGATCCACCCCTGGCTCAACGAGGCGGCGAATAACGGCGTTCCTTCGATCTCCGGTGGAAGCCCCATGGAGACATTCGAGGCCAGGATGATCGCGGTCACCTCGGGATAGGTGGCGTACCGGGCCAGTTGTCGAAAAATGTCCATTTTCTTGGACTTGTTCCGCATCTTGCACTCGACCACTACGCCGCCGGCGATCAGGAAATCAGGGATGTCTTTTGGGGACAGGCGTTTTTCCCTTTCATAAGCGATTCCGGCTGCCTTGAGCACTTCGGCGACGCCTTCCTGCAGATGCTTTTCCGATGAGAGGTCGAGCCGGCTGCGCTGCACGAGGCGGATCACGTCGGCGATCACAGGGGGAGTTGTGGACGCCTGGCTCATGGCTGAGTCTCCTCCGGTTTCACCTCGAAGAAGTTGAGCTTCCCTTTCACTTTGCGGAAAGGCAGCGGCTTTGAATCTCCCACCACGAAGGCATGCTCCCCCATGTACCAGCGAGACGTCAGTTGGCCGTCTCTGGCCTTCTCGGGCGGGATGGAGTCGGTCAGCACGGCTTGGCCGACGATTCCACCAAGGTCGTACTGACCCGGGCGAGGAAGCGGAATTTTCGGAAAGCGGCCCTTCACCCACAGGTAGCCTTCCATGTCGAACGTCTGGCCGGCGTGGACGAGAAATGGTCCCCGGTGCTTCGTTGCCCAGGTACGGTTCTCGATGTCCTTGATCTCGCCGGCGGCGACGGCGGCGGCACGTTGCTGGGGGTCGGTCAGGTCTGGGCGCACGATCAGCCAGGCCCAGGGTTGTCGGATGGATAGCGCCTTCATGATCGATCTCAATGGGTGGTGGCCACTGTATTGCTGACCCGCAATTCAACCGGGACGCCGTTGATTTCGGGCGGGAGGCCCTGGATTGGTTCAGCAGCCAATACCAACAGCCGATCGGCGCCGTACAGGATGCGGTAGGGGGCGGCCGGCAGTTTCTCGGCCAGTACCGCCTTCAGATCGTTCTCCGCCGCTTCAAGAACTGGATTCTTGGTTGGCTGCTGCCTCGCATGGGCCAGTTCAGCGCGCATGGCTTCGAGCTCGAGCAGCGCAGCGAATGCGCGGTCTGCCGTTTCCGGGCAGCTGTAGGCGATCATGAAGGAATTGATCAGGCTGCGGAGCAAGCCGCCGGTGTCGTGGTTCTTCACGACCGACATCACGGCAAACTGATCCAGGCGCTCGACAGGGGTCTCCAGGAAGCGCTCCAGGCCCATGCCCTTAGCGATTGCGGTGAAGAGAATGGTCAACGTCCGCCGGCGCTCCTCCGGGGCCAGGGGCTGCCCGTTCGTAGGGCTTGCCAGCTCGCAGCGGAAGCGGTTGGCGAGAGCGACATCGACGCGCTGTTGGATGAGGGTTTCGATGTTGGGAGCTTTGGTCATGATGTTCTCCGATCAGCAGCCGCCGGTGGCGTGACGGTAGATAGAAACTGGCGAACGCCACTGGCGTTTTGAGGACCAAGGATGTCGCCTTCCATCGCCTCCAAGAGCCGTGCAGCGTGGTTATAGCCAATCCGCAGATGGCGCTGGACCATGGAAATGGAGGCATGGCGTCCTTTCAGGACGAGATCGCGGGCTTCCTGGTACAGGGGGTCTAGCCGACTTCCCTTGATGCTTTCATCGATATCGTGTCCGCGGATCATGCGACGTGCTCCCCAGCCAGGCGGGCCGTCAATGGCATCAGCGCCATCCGGCGGCGCTCAAGGCAGAACCACATGATTTCCGCCCCCGGCTTGCCCTGGGCGGCCAGTGCACGGTAATCCGCGATCAACTGGTCGAGGATGGTGGGTTGGACGTGGCCGTAGATTTGGTCGATCGTCGGGAAGTCCTTGAAAATCTCGCCCACCGGTACCGGCGGCTCGGCCATGAGGGCGTAGGCCCACGGCTGTGTGTAGCCGCAATGGGGGCAGATCCAGCCTTGCTCGGTAGCGATCAGGAGGCCGCGGTCACCGCCCTCGGTACTGTGGGTGGCGAGCGATACATCGGCGGCACCCGATTCGTCGTAGGTAATGCCGTCACCGCGATTTGGGCAGGTGAAGGGGTGGATGGGCATGCTGCCGTCCACGTGGCATTGTCGCTCGTTGAGATATTGAACCTGGGCGGGGGTAAAGGGGGCCTGAATCTTCATTGCTTCTGATGCTGTCTTGCGGGAGAGGAGATCGCCTCGTCGATCGAGAGGCAGTAATGCCAGAGGCGGTGCTGGCGGAAAACGCGGCTGATGCTGTCGCATTCCCCAACCTGGAATACCGGCACGCCGGCCATCAAGGCGGCGCCAGCCTCCAGAATTGCCCCCTTGAGGATTTCCCCGGCTTCGCAGTAGAGCACCAGGCGCTCGGAGCCGGCGATTTCGCTTAGGCACCGGTCAGCCAGCTCCTCGTAATCGGCGGTTTGGCCCTCGCCGGCTTCATCGATCCAGGAAGATGCTGTCTTGGTGCCGCTGTCGCGCAGCGCCTTCCAGCGATGGGCATGGACGACTTTGGAGGCGAAATAGACGCCCCCTCGACCTTTGGCGGTACCGGCGGCGATCTCAAGCACGGCTCTCACCTTGGCCAGGTCAAAATGGTTGTCGCTGCGAACCCCCGACTCCATGTCGATCCAGTAGGGCACGTCGCGCCAGGCAGTTGCGGATGCGTGGATGCGTGGCAGCTCGATTTCGAGGACATTGGGGCCCAGCCCGCCGGCGTAGCCGCAGTGCAACGGGATTTGGCCGACGGTCGAGATCGGTGGCCATTGCTCCGGACACTGCCCGGTCCCCTTGGATTCGTCGTAGAGCAGATCAATACGGGACAGGGAACTGGCATCGACCTCGGCAACGAACTTCTGAATGGCCTGGCGAGATCCGTCGTGGAATTGAAGGATCAGGTGCAGGCCAGCCTGGTGCAGCGTGCGGTAGACCGACAGGACCTCTTCCTCGGTGAACTCGGGTCGGCGCGCGTTGATGTTGAGTTGGATGCGCCGGTACCGAGAAATGTCGGCAATGCGCGCCGGGGCGGTGCGTGGATCAAGGATCTCGCGAAAGACTTGCGTGCCGCAGAGATGGGCTGCGGTGAAGGGCAAGCCCAGGGCCAGGAATGCTTCACGCCAGGCAGCAGAGGGGTTCCGTGGCGAGCCTTCCTTCTCCGGGAAATAGAGAATCGCCCACTCGGGCGTGATCGGATTGGTGTAGCGGGCCGACAGGGTAGCCAGTTCGGAGGGTAGGACTTTATCGTCGGCGCCGGTGATGGAAACGAGGTTCAGAGGCATGGAGGAATCCTTGAAGGGTTTCCTGCCATGCTACGGAGCCGCCATAGGCGTTTCTGGATTTTCTTCTTCACTCAGCGACCCTCCTTCGGCCGAGCAAAGGGGACCACGTTTTCCAGCTCACGTTGAACGCCCCCGGCCGCGCGGAATTCATCCCAGAGCACCATGTGGCCCGGGCAATAATGGACATTCGGCGCGACTTCGGTGGCGTGGCTCTCGCACAGGGGCATGTCGCACGTCTTGCCGTCCCCGACCGGGAAGTCGCACAGGAAGTCCCCAACGGACGCGCAGGCGCCGCAGTGAGGGCCGAATTCGCCGCACAGGAACATCGTCCCGCCGTCCTTCATCGGTTGGATGTAGCAGGGCAT
Protein sequences of DBSCAN-SWA_1 >NZ_AP021845|72042:78469|74830_75211_-|WP_152090994.1|DBSCAN-SWA MSQASTTPPVIADVIRLVQRSRLDLSSEKHLQEGVAEVLKAAGIAYEREKRLSPKDIPDFLIAGGVVVECKMRNKSKKMDIFRQLARYATYPEVTAIILASNVSMGLPPEIEGTPLFAASLSQGWI >NZ_AP021845|72042:78469|72042_72375_-|WP_152090991.1|DBSCAN-SWA MTVYVDNMRAGYGRMVMCHMLGDSEAEVHAMADCIGVARRWYQGDHYDICLAKRSLAIKSGAVEITSRQAAAMRRRRRETGSLGSPEEAETWVRTSIRERRAGATEGASA >NZ_AP021845|72042:78469|75656_76244_-|WP_152090996.1|DBSCAN-SWA MTKAPNIETLIQQRVDVALANRFRCELASPTNGQPLAPEERRRTLTILFTAIAKGMGLERFLETPVERLDQFAVMSVVKNHDTGGLLRSLINSFMIAYSCPETADRAFAALLELEAMRAELAHARQQPTKNPVLEAAENDLKAVLAEKLPAAPYRILYGADRLLVLAAEPIQGLPPEINGVPVELRVSNTVATTH >NZ_AP021845|72042:78469|76240_76504_-|WP_152090997.1|DBSCAN-SWA MIRGHDIDESIKGSRLDPLYQEARDLVLKGRHASISMVQRHLRIGYNHAARLLEAMEGDILGPQNASGVRQFLSTVTPPAAADRRTS >NZ_AP021845|72042:78469|76985_78104_-|WP_152090999.1|DBSCAN-SWA MPLNLVSITGADDKVLPSELATLSARYTNPITPEWAILYFPEKEGSPRNPSAAWREAFLALGLPFTAAHLCGTQVFREILDPRTAPARIADISRYRRIQLNINARRPEFTEEEVLSVYRTLHQAGLHLILQFHDGSRQAIQKFVAEVDASSLSRIDLLYDESKGTGQCPEQWPPISTVGQIPLHCGYAGGLGPNVLEIELPRIHASATAWRDVPYWIDMESGVRSDNHFDLAKVRAVLEIAAGTAKGRGGVYFASKVVHAHRWKALRDSGTKTASSWIDEAGEGQTADYEELADRCLSEIAGSERLVLYCEAGEILKGAILEAGAALMAGVPVFQVGECDSISRVFRQHRLWHYCLSIDEAISSPARQHQKQ >NZ_AP021845|72042:78469|76500_76989_-|WP_152090998.1|DBSCAN-SWA MKIQAPFTPAQVQYLNERQCHVDGSMPIHPFTCPNRGDGITYDESGAADVSLATHSTEGGDRGLLIATEQGWICPHCGYTQPWAYALMAEPPVPVGEIFKDFPTIDQIYGHVQPTILDQLIADYRALAAQGKPGAEIMWFCLERRRMALMPLTARLAGEHVA >NZ_AP021845|72042:78469|72371_73094_-|WP_152090992.1|DBSCAN-SWA MENQDWPVRCQECGWRGSNQAINSGVPISVTGTLSKVRCPICDSTRVEEDDKIFTGNYLILARQQGHEVLQMGIQRDDPPEFKNWRWSVHWPTGQAGTASILDLEPFGYPEPLFQEVLLKIGLWARNGDSKAMWWLGSYHEVGCRTSGANGAKALAYYLAAIRHDINSCDYSSIRRVLIEGFDDLFRDYRRADVANKTPHDVWSFLAKFGEFQMRDGKIGPRWPYSNDWQACIAIAEGLQ >NZ_AP021845|72042:78469|75207_75648_-|WP_152090995.1|DBSCAN-SWA MKALSIRQPWAWLIVRPDLTDPQQRAAAVAAGEIKDIENRTWATKHRGPFLVHAGQTFDMEGYLWVKGRFPKIPLPRPGQYDLGGIVGQAVLTDSIPPEKARDGQLTSRWYMGEHAFVVGDSKPLPFRKVKGKLNFFEVKPEETQP >NZ_AP021845|72042:78469|78175_78469_-|WP_152091000.1|DBSCAN-SWA MPCYIQPMKDGGTMFLCGEFGPHCGACASVGDFLCDFPVGDGKTCDMPLCESHATEVAPNVHYCPGHMVLWDEFRAAGGVQRELENVVPFARPKEGR >NZ_AP021845|72042:78469|73097_74831_-|WP_152090993.1|DBSCAN-SWA MRTYGELRLIEGGTAWEMSSLEPHVAIRLKHIFPRVPKQSTGPFRFPRDLMHAADLDWFLSRYPLAMSDGDRNALRGGRLDFEMQQAEMERILLPEYVPPAVVGLREGQKVRHYQMQAVELLRRRRSLLLGDEGGLGKTYTAAAFLCAEAQSLPAAVVCDPHMQKQWKDKLTAFTHLRVHIITKASPYDLPPADVYVFRISQIGGWADIFATDFFKAAVFDEPQSLRTGTSTAKGAAAKVLANHVSYRLGLTATPIYGYGIEMHNVMQFIDDTVLGSFADFSREWTDDMGRIGNPKALGTYLREQHVLLRRLKSDVGLELPKVSRIIEYVDFDAKAVQSIEDLARVLAIRATTGTFVERGQAARELDMMVRQATGLSKAKAVAQFVRLMVEAGEPVLLVGWHRAVYDIWLEELADLRPAMYTGSETAAQKNKEKDRFLTGDTDVMIMSLRSGAGIDDLQFRSSVVVFGELDWSPGIHQQIIWRLDREGQEDPVTAFFLVSEEGSDPPIMDVLGIKASEANQIVDPHLGVQKVDDDTTNLGRLVERYLQKVSKASKRPASTVPAAAMQPEVATTGSLF |
10 | Ruegeria_phage(33.33%) | NA | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| DBSCAN-SWA_2 |
219800 : 230598
Sequences of DBSCAN-SWA_2
Nucleotide sequences of DBSCAN-SWA_2 >NZ_AP021845|219800:230598|DBSCAN-SWA CCTAGTAGCGAGTCTGGTAATCGTCGCCACGGGAGAAGCGGCTGTTCCAGTGCTCCAGGTAGCCCTTGGCCAGCTCAGGGTTCTTCCAGACCACCAGCACGTTCTCGCTGTTCTTGTTCGCCGCCGCGCCGCTGTAATTGAAGCTACCGGTCTCGACCGTCTCCCTGTCCGATACGACGATCTTGTCGTGGTGGATAGGGTAGATCGAGATCGTCCGCACGCGACAGCCAGCATTCACCAAGGCGGAGAGCGCCGCCCGGGCCTTGCCGGACTTGTCCTCGACGACGTTGTTCTTGTAGTCGGAAACGATGGCGACATCGACGCCCCGCCGGCGGGCCGCGATCAGGGCTTCCACCACCGGTGCCGAGGTCAGCGAATAGGTCATCATGCGGATCTCGGAGCGGGCTGATCCAATCACCTTCAGAACCAGCTTTTCGGCGCCTTCATTGGGGCTGAAGGCATATTCGATGGTCCCGCTATGCTGGACCGTCGAGGAATGGTCCTGGAGGCCCTGGACGGCCTTCTCGGCTACATTCAGCCAGTTGGTTGCGCCGGCCTGTCCGGCGAAAAGAACGAGTGCGAGGGCGGCTGCCCTGAGTACGGCATGCTTCATGTTGCTTCCTTTCGAGATTGAGCCGCTTGCGCGGCGGGAACGGTTTTACTCCCCCTTGCCCTTGGCTTGGCGGAGGGTGATGAGTTCTTGGTACCGGGCGATCACTTCGCCGGCCGTGGTGAAGCCCACAGAGATGCCCAGGATGTCGGCCACCGCCTGGCGGACTTCGGCGATAAGCTGGGCTTCCTCCTCGGTCGGCACGCAGGTTGGCCAACCAGAGGAATAGAGCAGGTGAGCAGGCAGCCGGGATTCAGCCTGGACGCGCAGTTCGGCGCGACGCTTGGCATCAGCCAGCCCTTCTCGCTCCTCTGCGGACAGAAGATGCGGGGCCACGTCGCCGAGCCACGTCGCCCCCTGCAGGGCGAACGTCGAGGCGATCTCCAGGCGCACCATCCGGCGGTAGATATCGGCGTTATCCGGGCAGGACGCCGATGCCTTCAAATCGCCGGCCGACCCCATAATGCAAAACGTGCAACTGAGGCGCGTGCTTCCGTAGACGGTGTAGGCCTCGTGCAGGGGCGCCTGCTTGAGCGCGATGTAGTCGAATACTGCTGGCGTCGGCCACTTGATCACTGGATTCCAGTTCATGCCCACGCATCCTCGGGCCGAGAGAAGCGGCTGCGGCGTAGCGATGGGCATTTTTGCTCGAGCAGCACTCTCCGCATGGCGAACGCCCGTTGCGGACAGGATTTTTTGCCCTGGGTACCGCTTCGTTAGCTCCCTGCAGATGATCGATGTCTTGAGCTCGCTGGTGCAGAATCTCATTGATGGGGTTGACCATGGAAGGATGATCCTCACCGTCGAGAGCGATGCGTACCGCTCAATGTTGTTCTTCCAGCGGACTTCCCATCGATCCATCATGTCGCCCGCAGCACGGCGCACGACGATGAGTTCCCATCCCAAGCGCTTTGCCAGGCGTTCGCACAGCGGTAGGCTATCCTTCCATTCGACAGTGCCAAGGTCGCTGTGGATGAGAACACGCGGCCCCGAATGACCAATCTGGTCGAGGTATTCGGAAACGCGAACGGCCATTGCGACACTGTCCTTCCCTGCGCTCACGCCAAGTGCGCAAACTGCGCCTGCCGCCAGCCACTCACTGACCTCCGGTGTCACGGCCACGTCGTACTGGCCTGCCGCGGCGATTGCCGGGGCGAATAAGTCAAGCTGCTGCAGTCGGTCCATCAAGTCGCACTCCGTTGGCTTGAGGGCGCGGCCCTCACTCAAACGTCATGCTACGGATTGACCCCACCCGATTCTGAAATACTTCCCTGTCCGCGATGGACTGTCTGGTGGCCCGTCTTCCGGCCCGAATTCAGGCGGCGAGCGGGAGGCGGAGCTGGTCGCCCGGCTGCGCGGACATGCGGGAGCGCGCCGTCCTCATCCAGCGGCTCATTTCGCTCGAGCGGTGGGCCGTGGTGTTGGACAGACCGCTCTGCCGCGCCTTGATCCTGGCGCCGAAGTCGTAGGCCATGGAGTCGGCCGAAGCCACCCAGTCCAACATCTTCACCTCGGACAGGCTCGCCCCCTTGACCCCGAACAGGTGCAGCCGCGCGCCTTGTGGCAGCTTCCCTTCCAGGGCGGACAGGATGGCGTAGAGGCCGTGTGTCGGGTGATGCAGGTTCCGCCGGCACACCGATCCAACTCCGATGAGCAGAGGCGCATCAAGCCAGGGCTGCCAGCGCTCCCAAACGGCCGTCAGCAGTTCCAGGCTGCGGAGGTAGTCGGATGCCGACCACCCCTGGATGACGGGCACCGGCGGCGGCAACATGTTGGCCACCACCGAGGCTGAGCACGTCTTGGCGAGTTCGTTCTGCCAGGCGTAGACCACGCGCAGCACGCCCTCAAGCAGCGTTGCCGTCGCGTTGATCCGGTAGTCGATCGCCGCCTGGTCCTGGGCGATCTCCGGCTCGCAGCAGAGATCCGGCTGGGACCACCACGACGGGCTCAACAACGACGCCAGTTCAACATACTGCTCGTATGACCAAGGGAATACCCCAGCCATGCCCGGCTGCTTTCCCTTCGCCTTCCACAGCTTCATTGCGGTGAAGCCGGCACTGTCCAGCGCAACATCGAGCTCGGATAGATCGGTGGCCTCCGGCACGCGGAAACAGCCCTTCTCGGCGTCCCAGAAGGCATTCGCGCTCACCATGACGGGGAAGTCTTCATTGAAGGCGTGAAAGGCCAGCTTTCCTCCGCGGTGGGGAATGCCGACCCGCATAACGAGCTCGTCATCCAGGACGGCATTGTGTTGCCCGGTGGGCAAACGCCCCAGTTCCAGCTGATGATCCATGTTGAAGCACTCCGATTGCTCGAGGGCGCGGCCCTCAGTCAGTCGTCATGCTACGGATTGGCCCTGGCCTGTTCCGAAATACTTCCCTGTCCGTTGGGGATCGGCTTCGCTTCGACCTGAATGTACTCAAGGTTGTCCGTTGGGTGCAGGATCAAACGCCCGCGGTAGCCTGGCACCCGATCATCGACCAGGACCCGCAGATGCGGGCCCTTGGCGGACGTGATGGTGCAGTTCCAGATAACCCCGGCCCCGTCGGTGTAGCGCACACGCACGCCACGCCGGGCGGGCACCCCGTAGGTCCTGCGGATGTACTCAAGGCTCATGGCGCCCCCGCTCAGGCGGCCATCGGGGCCTTGATCGCGGCATGCGATTCGTAGCCCTCGAGGGTGATGTCGGCGGGCTCGATGCGCGTGAAGGCACCGGCGACGTCCTCCAGGCGCTCGATCCGCTTGATGTTGTCCGACAGGATCAGCTTCGGGGCTTCGAGGTGTTCCCTGGCCAGCAGCTCGCGCACCTGGTCGAAGTGGTCCTCGTAGATGTGTGCGTTTGTCGCCTGGATCGTGACCACACCAGGCTCGAACCCGGCCAGCCGCGCCATGATCGCCAGGAAGATCGAGGTCGCCGCGATGTTCGCCGGCGCGCCAAGGAACAGATCCCACGACCTGATCGTCATGACCAGGTTGAGGACACGGGGGTTCTCGAAGGCCACGAAACGGTAGTCCATGTGGCAAGGCGGCAGCGCCATCATGTCCAGTTCGGCCACGTTCCAGCCAGAGACGATCACGCGCCGATCAGAGGGATCGGTGAGCAGCTTCTTCAGCGAGTTCTCCAGTTGGTTGATGGTCCGCTGCATGAGCCATTCGGTCTTCTTCCCGTCCTCGTCGACCGCATCGGCGCACATGCGCACTTGGTAGCCCAGGGCCAGGAGCCGATCACGTTCAGCCGGGGTATCCGCGATGCGGCGATCCATCCACTCGGTCCATTGCTTGCCGTAGATTCGCGATAGGTGGTCGTGACCGCGACGGTAGGGACTGGCCAGCCAGGCCGGCGTCTCGTTGGCGTTGATGTCCCAGAAATGGCAGCCCAAGGCGCGGAAGTCGGCGGCGTTGTCGTAGCCCCGGAAAAAGCCCAGCAGCTCGCCGACGATGTTCTTGAAGGGCAGCTTGCGGGTGGTCAGCGCCGGGAAGCCCTGACGAAGGTCGAATTGCACCTGATGCCCCACCAGCGCGCGGCAGAGCTTGTTGGTACGGGTGTTGTACTGATCGACGCCCTGCTCCATCGTCAGGCGCAGCAGCTGGTGGTAGTTTTCCATGGTTCCCTCGCGTAAAAGTAGAAATCGTTTGGACAGAGACAGGATGCCAGATCGATACCCCTTGATCTGGAATCCTTTCCGCTGTCCCTCTTGGATTGGCTGTGTCCGTCAGGCGGCTTTCTTGACCGCCAGCCACGGGGCATTGGCGCGCAGGAATGCCTCCGCCATCACCGGATTGACGCTGTTACCGATCATGCGGACCTGGGTGGCCACGGAGAACTTCCGGCCGTCGTGACCACGGTCGATGATGTAGTCCTCCGGGAAATCCTGGCCGCGCGCCAGTTCGCGGGGCGTAAGCATGCGAAGTCGAATATCGACAATCACATACGGGTCGCCCTTGATGAACACGGTGACCAAGGCCAACCGGTCTCGCGTCGTGATCGTGTTGAGCGGCTTGTCCAGCGCGGACCATTGCCCGCCCTCGCTGTAGTATTCCATCAGGAAGGCGGCGACTTTCAGCGCGCCTGCCTCGTCTTCAGGGGACAGGCCGTTTTCGGTCTCGCCGGCCAGCTCGCATTCCACCAAGGCAGACTTTCCTCCGCCCCCGGCCGTGGCGGTCCCCATCGAATCGTTCGCGGCATGCCCGACACTGTTGCCGAACTGGCGGGACAGGAAAGCTGCCACAAGCGCATGATGCTCCCCTCCAGCGGAGATGGTCTGGATCGGGGCGTTCAGATCCCGGGCGTCGCAGTTGTTGCGCAACTGAGCCAGGTTGAGTGCCACCAGTTGCTGCTGGCTGCCGGTCGTCGTGATCGACGTCATCGACTCATCCAGGCCGCGGCCGATCGTCGCGTTGAAACCGTCGTTTGCCTGCATCATGAAGGCGCTCGACGGGCCGGACGGCTTCGATGGCGACAGCACCGGGGTGGCCAGCATCAGCTCGCCGCGGTGGGCCGTGGTAATCGTGGGCAGAGAGCCCTCAACCGAGTGCATCCGGTCAGACCCTTGATGGGTTGCCGGGACGATGATCGGGGTCGCCACGGCGAATTCTCCGCCCTTGGTCGCCGCCTTCACTGTCCGCAACGGTTCCAGTGCGCTGTGAACCACATCCTGTCCGTTGTAGTGGGCAATCGGAACGATGCTCGGCGTAGCCAAATACTTGTGATTCCCGCCACAGCAGGTCGCCATCGGCTCGTCGACCATGCGGGGGACGTTGTTCATCATGTTGTTCACGATGAACGGCTTCGGGTTGTCCAGGACGAACTTCTTGATCCCCTTGGCGATCCGGCGCTTCGTGGCCGGCGCCAGTTCCTTCTTCCGGCCGAAGATCGACTTGCCCAGGTTGGAGAAGTCGATGCAATCGGCAGCCGGCTTGTAGGCCATCTGCTTTCCGGTGGGACTCTCGAAGTGGATCTGCTCGGGCCAAATGATGGGGTACCCATCCCGGCGCGCGATCATGAACAGCCGCTTGCGAGTGGTATGGCCGCCCAGTTTCGCCGCCACCAGCGCGCGCCACTCGACCACGTAGCCCATCGCCTGAAGCTTCCTGACGAACTTCTTCCAGGTCTCACCTTCCCGCTTGGGGTCGGGGATCAGGTACTGCTGCTGCACCGGCACGCGCTCACCTGGCTCGGCCACAACCTGCCGGACCTTCTTCTTACCCTTCACGACCTCGGTCACCAGCTTGATGACGCGGCCCGTAGCCTTATCCCGCTTGGCGATCAACGGACCCCATTTCAGGATGGCCAGGACGTTCTCGAGCGAAATCACGTCCGGCTTGGCTTGCCCAGCCCAGCGCATCCCCGACCAGGACAAGCCCCTGATCTTCTTCGATCGCGGCTGACCGCCTGCTGCTTGCGAGAAATGCGTGCAATCGGGCGAATAGTGGAACCAGCCCACTTCATCACCCATCGTCGCGCCACGAGGATCCACCTCGTAGGCATCGGCGCAGAAATGCCGCGTGGTCGGGTGATTGATCCGATGGCAGCTCACTGCGTCATCGTTGTGGTTGAAGCAGACATCCGGCGATCTGCCGAAAGCCTTCTCGAAGGCGATCGACATCCCGCCCGCGCCGGCGAACCCGTCAACGATCTTTTTCCGGTGGATGTTCAGCAGCAGTTGCTTGCCCATTGTTCGTGCCCTCTGTGAATGACGATTGGGTCAAGTCTGCCTATCTGAGCCAGGCCTTTCTGAAAAAAGCGGGCAGACCCCGGAGGTGTGCCCGCTTCAATCTGCCTGTCCGTAACATCTGCGCTGCTTATGTCTCCTTGTTGGCCAGCTCCAAAAGCACTGCCGCATGGCAGGCGTCCTCATACGGATCATCCTTCCCACACCAGCAGGCCAGATTCTTCCCAGCAAGCTCGGCCCTGGCTTCGGCTACCAGTTTGGGGTTGAGAGGAGCCAGCGACCGATACAACACGAACGCATGCCGCTTGTCCCGAACGATCTGGCCCTTGGTCGGACCGAACGGCGCTGGTTTCCCCGGCACAAAGGGGTTCCCCCACTTTGTCGTCCGATCCACTTTCACCGTGTTTTCCGGCATGCGCCAGCCTTTGGCGCGCTTTAGTTGCACACGTTCTGGCATGGCCGCTACACCGGGTAGCCGTCGTGAATCTCGCCGTCCAGCGTGCGCCCCGCAGCTTTCTTGCCGATCAGGAACATATCCGGCTCATCGTCGATATGGTCATCTTCACGCGCCTTGCCGACGCGCCGGATGTCCCACTTGCCGTTGAACCAGTACGCGGCGTCGATCGCACGGTCGTTGGGGCCAGCAACCTCGCCTGGCGCCCATTCGCCCCATTGTTTGAACAGATAGGGCACGCCGGCGGCTGCGCACTGATCACGCAGCATCCTCGCCCAAGCCGGAAGGCCTGGCCGAGCACCGATACCGCTCTCGAAACCCTGGACAACCCAGTCAATGCCCGGGTCGTAGGCGGTCCATCCTGCGCGCTCACCACAGTGCGGGCAGAGAGGCACGGTTTCCTCGCCATCCTTTACGGTCTCGATTTCGTCGGCGATCACATAGCAGCTGTCCGGACAAACGTCTTTGCACTGCACGCCCACCGGATTGAGCCATCGGGCCAGATCGATCTCCCCCAATTGGGGCTCGCAGCTTACCCAGCGCACTGCCGCTGGCGTGTCCATCAGCAATGGCACCCGCTCGTCAGCCGCTGCCTGATCTTCCACAGAAACGCCCAGCCAGATACGGGGATGGGGGCCATCAAAGTTCATCACCTGATCCCAGATGCCGTCGGGGTCCGTGCCGCCACCGTGATTGACAGCCGCCCGCGCCCAGGCCTCCCGCCGGTCGGTGCTGAAGTAGTCACGCATGCGCTGCGCGCGCTTGGTGAGCACCTGGAAGATATGGCCGTCCTGTTCGTTGCGGCCATACAGGCAGGCCCACATCACACCCATGATGGTGTCGATCCAATCGTCCGGCACGTCCGGGTGGAACAGATCCGAGTGGGCGCAGACGAACACCTGGCGCTGCCGCCCCCAACGAATCGGCTGGTCAAGCCATTCATGGTTCAGGCGCACTTCGCCGGTCCATACCGGACCGTTTTTGGTGTCGATGGTCAGCCCTGCGCGGGACGGGTGATTCTTGAGGCGGCCGCCGGCCAGCTTCATTGCATAACACAGCCGACAACCACCGGAGGTTACAGAGCAACCCGTTATCGGATTCCAAGTGGCATCGGTCCATTCAATCTTGCTGTTGTCAGCCATGAGCACTCCCTTCAGTTGTTGGTGGAGCCATCTTTACAGCCACCACCTTCCAGTCGCCGAACTCGCGACCGGCTTGCTTCAATCCATCGATCTCCAGCGCGATCTCGCTGGCGTTTTGTTCCGATGCGGCACAGCAAAGCAGCATGGCCGCCGCGAAAGTCGTCACGCGGTCAAGCTGCTCCGCAGCACTGGTCAGCAAAGCGATCTTCTTCTTCACCAACTTGCCGAGTGCGGCGTCGTCGAGGTCGCTCCAGAGTGCCGCAGCCTTCTCTTCCTCGGTAAGTATCAGTTCAGCCACAGACGCCCTCTGCCAGTTCTCTGGGCGTCAGGCCCAGGCTTTCGTTGAACAAGGTCTTTTCCTCGTCCGACAGGGACACGCCTTCGGCAGCGATCACCAACAGGGCGTCGGCCTGAGACCGACCACCCAGCGGGCGGAACTTCACGCCATAGACAGAGGTGCCATGCACCAGGGCCGGGATGATCTCGTCCCAGTTGCCGTCGCTGTCCCGATACACCCAGCGGATGTCTTCCAGGGCAACGTTGAGCGCCTGGATGTGCAGGTGATACAGGAAGCTCACCAGTTCCTCAGCGGAGTTGGTGATACTGGCCCCATGGTTGCCGTCGGTCATGGCCACGACGACTCGCCCATCACGTCCATGCAGGATCGCCACATGGGCCGTAGCCTTGCCCTGGGCCAGCCCGTGACGGGGAACCGCCCCCGCCCGACTCACCACCAGGGGAAACAGGCCACCAATCGGCGGCCGCGTCATCGTTGCTTGCATACCTTTCCCTTCCTAGACGAACAGTGCGCCGCTCTGGCACAGACTGACGACACGCTTGCACTCAGGCACATCCATCCAGGAGATGTGAGCGAAGCGAACCCCAAGGGCTTTGGCCAGTGCCCTGTAGGCCTGAGACCGGTGCCACGGGGTCTTCCCCCGCCACAGCCGGTCAAAGGCGTCATGGGCTTCCCGGCGGGCCTGCATGGTCGGCCCATCAGCCAGCGTGCCGAGCGGGATGTCGGTACCGGGATGGCATCCCACCCGCGCACCGCAGGACGTGCAGCAATAGGCCAAGGGCCATCCATACTCTCGGCCCTTGTAGAAATCGGCGTTGTTGACCAGCTTCACCTCGCCCTTGCAGAAGCGGCAGGTGTCCGGCACGGATACGGGATCACCCTTCACCCGCGCGACAGCCTCGGGCAGATTCACCGTCCGCCCGAACAACTCGTAGGGTTTGAACCGTCTGCGCTTACCGGCCATGGCTTTCACCGTTCGGGCAAGGTTGCAAGGCTCTGGAAGGGCAAATAATCCGCTGCACCGGCAGCATGGCCAGATCGTCGCGGTAGCTCCGCGACCGGTTGAGACGCGGCAGCACAAGCCACCGCCAAACGCCCAGGAGCGATGTTCCCGTACAGACGATCTGCATGGTGACCATGCCGAAGGTGTGCGTCATCACGCCATAGGTCATCCAGGCCGCGTTAGACACCAGGAACAGGACGAACCCATACCCGGACCAGCGATTGTTGAAGGCCAGGAGGAGCGCCCCGGCCGCGCCCCCGATGGCACCAATCCATTCGAGCATGCCGAGGCTCAGCATGATGCTTCCTCCGGCTTGGCCACCGCCATCCGCTGACGGCAGGCCTTCTGGCCAGGGCACGGCATAGACACCCAGCGATAGATGCCAGCGGCCGTCGCATCCAGCGCCAGCACCAGCCCGAGGGCGACCACCCCGGCGGCCTTGAATGATGCCAGGGTCAGGATTGCAGAGAAGCCCCCCCCATCGACGGTGTTTGCGATCCAGCCGACGGCACCGGCGATCATCAGCATGCCGACGAAGAACTGGGCCAGCGCACGCTGGCTGAGGGTGCTGTTGAGAGCTTCCGGGGGGGTGTCCACCACCCAGCTTCCGGTGTGCCCACAAACGACAATGGGGCGATGCCACGCGGCGTAGCGCCAGGTGGGCACGTCGATGTCACGCTTCAGCAGCAGGAGGAGCAGAGCGAATTGGTAGTGCAGGTAATTGCTGATCATGGTGATACTCCAAAGAGGTTGGTATCAGCCCCCCGGTCTTTTTCCATCCGGCTCGCCGAACCGGATCGAAAAAGAGGCCTCCAGGAGGAGAGGAGGCCGAAAGCCTCAAGGTGAGGCCAGCGGGGGAGAACTGAAACCAAGATTACATAGATGGCATCGGCGATTCTGGGAAAAGATCAGCCTGTGCTTTGGATACGTCGGGGTCCGTAAGGGACGCGCTGCGTGCGTTCCACCTCCCCTGCCCTTGATGTCCGGCAGGGTCATTCACGGGACCGATTTGAAGGCGTCCTTCAGCTCGGGCAGCCGGCCGATCTCGGCCCACTTGCCCTGGAAGCGGCCGCGCTCGATGGTGGGCACCATGAACCGCTCTTCCCGGCGGATCACCAGGCCATCCTCGGACAGCACGTCGCACAGTTCAACATCGCGCTTGGAGCCGCCCCAGTCAGGGGCCAGCACATCCATGCGTGGCTGCACCTTCACCGGCGGCATCCCGTTGTAGCAAAGCGGCCAGCGGGAATGCCAGTTGCCCAGGCTCTCGCAGGTGGCGTACCAGAGGGTCATGTCGTCGCTGACCGAATCGAAGTGTCCATCCAGGACAACATGCTCAACCCGGTGGGTCAGCACGGTGTCGGCGTTGATCGCCCAGCGTGTCACCACGGTATAGGCACTCAGGAACTTCTCGGTCTCGATGAAGAGACCGGCCACAGCCTTCTGGCCGACCTTGCGGCAGCCCACCCAGTAGCACTTGTTCGGGTAGGGCCGGCGAATGGTCAGGTCCTGGTCGCCCTTGAGCACCAGGCCCGTCTCCTCGATGGTCAGATCGACGAGTCGAACGGCGGCATCCAGTCCGCCAGGGACGTACAGCTTGGGTAGAACATGGATCAGCAT
Protein sequences of DBSCAN-SWA_2 >NZ_AP021845|219800:230598|229974_230598_-|WP_152091191.1|DBSCAN-SWA MLIHVLPKLYVPGGLDAAVRLVDLTIEETGLVLKGDQDLTIRRPYPNKCYWVGCRKVGQKAVAGLFIETEKFLSAYTVVTRWAINADTVLTHRVEHVVLDGHFDSVSDDMTLWYATCESLGNWHSRWPLCYNGMPPVKVQPRMDVLAPDWGGSKRDVELCDVLSEDGLVIRREERFMVPTIERGRFQGKWAEIGRLPELKDAFKSVP >NZ_AP021845|219800:230598|221724_222702_-|WP_172974840.1|DBSCAN-SWA MDHQLELGRLPTGQHNAVLDDELVMRVGIPHRGGKLAFHAFNEDFPVMVSANAFWDAEKGCFRVPEATDLSELDVALDSAGFTAMKLWKAKGKQPGMAGVFPWSYEQYVELASLLSPSWWSQPDLCCEPEIAQDQAAIDYRINATATLLEGVLRVVYAWQNELAKTCSASVVANMLPPPVPVIQGWSASDYLRSLELLTAVWERWQPWLDAPLLIGVGSVCRRNLHHPTHGLYAILSALEGKLPQGARLHLFGVKGASLSEVKMLDWVASADSMAYDFGARIKARQSGLSNTTAHRSSEMSRWMRTARSRMSAQPGDQLRLPLAA >NZ_AP021845|219800:230598|219800_220412_-|WP_152091180.1|DBSCAN-SWA MKHAVLRAAALALVLFAGQAGATNWLNVAEKAVQGLQDHSSTVQHSGTIEYAFSPNEGAEKLVLKVIGSARSEIRMMTYSLTSAPVVEALIAARRRGVDVAIVSDYKNNVVEDKSGKARAALSALVNAGCRVRTISIYPIHHDKIVVSDRETVETGSFNYSGAAANKNSENVLVVWKNPELAKGYLEHWNSRFSRGDDYQTRY >NZ_AP021845|219800:230598|226543_227677_-|WP_152091186.1|DBSCAN-SWA MADNSKIEWTDATWNPITGCSVTSGGCRLCYAMKLAGGRLKNHPSRAGLTIDTKNGPVWTGEVRLNHEWLDQPIRWGRQRQVFVCAHSDLFHPDVPDDWIDTIMGVMWACLYGRNEQDGHIFQVLTKRAQRMRDYFSTDRREAWARAAVNHGGGTDPDGIWDQVMNFDGPHPRIWLGVSVEDQAAADERVPLLMDTPAAVRWVSCEPQLGEIDLARWLNPVGVQCKDVCPDSCYVIADEIETVKDGEETVPLCPHCGERAGWTAYDPGIDWVVQGFESGIGARPGLPAWARMLRDQCAAAGVPYLFKQWGEWAPGEVAGPNDRAIDAAYWFNGKWDIRRVGKAREDDHIDDEPDMFLIGKKAAGRTLDGEIHDGYPV >NZ_AP021845|219800:230598|220457_221594_-|WP_152091181.1|DBSCAN-SWA MDRLQQLDLFAPAIAAAGQYDVAVTPEVSEWLAAGAVCALGVSAGKDSVAMAVRVSEYLDQIGHSGPRVLIHSDLGTVEWKDSLPLCERLAKRLGWELIVVRRAAGDMMDRWEVRWKNNIERYASLSTVRIILPWSTPSMRFCTSELKTSIICRELTKRYPGQKILSATGVRHAESAARAKMPIATPQPLLSARGCVGMNWNPVIKWPTPAVFDYIALKQAPLHEAYTVYGSTRLSCTFCIMGSAGDLKASASCPDNADIYRRMVRLEIASTFALQGATWLGDVAPHLLSAEEREGLADAKRRAELRVQAESRLPAHLLYSSGWPTCVPTEEEAQLIAEVRQAVADILGISVGFTTAGEVIARYQELITLRQAKGKGE >NZ_AP021845|219800:230598|226211_226538_-|WP_152091185.1|DBSCAN-SWA MPERVQLKRAKGWRMPENTVKVDRTTKWGNPFVPGKPAPFGPTKGQIVRDKRHAFVLYRSLAPLNPKLVAEARAELAGKNLACWCGKDDPYEDACHAAVLLELANKET >NZ_AP021845|219800:230598|222752_223025_-|WP_152091182.1|DBSCAN-SWA MSLEYIRRTYGVPARRGVRVRYTDGAGVIWNCTITSAKGPHLRVLVDDRVPGYRGRLILHPTDNLEYIQVEAKPIPNGQGSISEQARANP >NZ_AP021845|219800:230598|223036_224014_-|WP_152091183.1|DBSCAN-SWA MENYHQLLRLTMEQGVDQYNTRTNKLCRALVGHQVQFDLRQGFPALTTRKLPFKNIVGELLGFFRGYDNAADFRALGCHFWDINANETPAWLASPYRRGHDHLSRIYGKQWTEWMDRRIADTPAERDRLLALGYQVRMCADAVDEDGKKTEWLMQRTINQLENSLKKLLTDPSDRRVIVSGWNVAELDMMALPPCHMDYRFVAFENPRVLNLVMTIRSWDLFLGAPANIAATSIFLAIMARLAGFEPGVVTIQATNAHIYEDHFDQVRELLAREHLEAPKLILSDNIKRIERLEDVAGAFTRIEPADITLEGYESHAAIKAPMAA >NZ_AP021845|219800:230598|227669_227975_-|WP_152091187.1|DBSCAN-SWA MAELILTEEEKAAALWSDLDDAALGKLVKKKIALLTSAAEQLDRVTTFAAAMLLCCAASEQNASEIALEIDGLKQAGREFGDWKVVAVKMAPPTTEGSAHG >NZ_AP021845|219800:230598|224122_226084_-|WP_152091184.1|DBSCAN-SWA MGKQLLLNIHRKKIVDGFAGAGGMSIAFEKAFGRSPDVCFNHNDDAVSCHRINHPTTRHFCADAYEVDPRGATMGDEVGWFHYSPDCTHFSQAAGGQPRSKKIRGLSWSGMRWAGQAKPDVISLENVLAILKWGPLIAKRDKATGRVIKLVTEVVKGKKKVRQVVAEPGERVPVQQQYLIPDPKREGETWKKFVRKLQAMGYVVEWRALVAAKLGGHTTRKRLFMIARRDGYPIIWPEQIHFESPTGKQMAYKPAADCIDFSNLGKSIFGRKKELAPATKRRIAKGIKKFVLDNPKPFIVNNMMNNVPRMVDEPMATCCGGNHKYLATPSIVPIAHYNGQDVVHSALEPLRTVKAATKGGEFAVATPIIVPATHQGSDRMHSVEGSLPTITTAHRGELMLATPVLSPSKPSGPSSAFMMQANDGFNATIGRGLDESMTSITTTGSQQQLVALNLAQLRNNCDARDLNAPIQTISAGGEHHALVAAFLSRQFGNSVGHAANDSMGTATAGGGGKSALVECELAGETENGLSPEDEAGALKVAAFLMEYYSEGGQWSALDKPLNTITTRDRLALVTVFIKGDPYVIVDIRLRMLTPRELARGQDFPEDYIIDRGHDGRKFSVATQVRMIGNSVNPVMAEAFLRANAPWLAVKKAA >NZ_AP021845|219800:230598|227967_228459_-|WP_152091188.1|DBSCAN-SWA MQATMTRPPIGGLFPLVVSRAGAVPRHGLAQGKATAHVAILHGRDGRVVVAMTDGNHGASITNSAEELVSFLYHLHIQALNVALEDIRWVYRDSDGNWDEIIPALVHGTSVYGVKFRPLGGRSQADALLVIAAEGVSLSDEEKTLFNESLGLTPRELAEGVCG >NZ_AP021845|219800:230598|229269_229710_-|WP_152091190.1|DBSCAN-SWA MISNYLHYQFALLLLLLKRDIDVPTWRYAAWHRPIVVCGHTGSWVVDTPPEALNSTLSQRALAQFFVGMLMIAGAVGWIANTVDGGGFSAILTLASFKAAGVVALGLVLALDATAAGIYRWVSMPCPGQKACRQRMAVAKPEEASC >NZ_AP021845|219800:230598|228471_228939_-|WP_152091189.1|DBSCAN-SWA MAGKRRRFKPYELFGRTVNLPEAVARVKGDPVSVPDTCRFCKGEVKLVNNADFYKGREYGWPLAYCCTSCGARVGCHPGTDIPLGTLADGPTMQARREAHDAFDRLWRGKTPWHRSQAYRALAKALGVRFAHISWMDVPECKRVVSLCQSGALFV |
13 | Mycobacterium_phage(22.22%) | NA | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
| Acr ID | Acr position | Acr size | Homology with known anti | Neighbor HTH/AcRanker | Neighbor Aca | In prophage | Protospacer in prophage |
|---|