Contig_ID | Contig_def | CRISPR array number | Contig Signature genes | Self targeting spacer number | Target MGE spacer number | Prophage number | Anti-CRISPR protein number |
---|---|---|---|---|---|---|---|
NZ_AP021844 | Azospira sp. I09 | 0 crisprs | DEDDh,DinG,WYL,RT,csa3 | 0 | 0 | 3 | 0 |
NZ_AP021845 | Azospira sp. I09 plasmid pAZI09, complete sequence | 4 crisprs | PD-DExK,DinG,csf5gr6,csf1gr8,csf2gr7,csf3gr5 | 0 | 7 | 2 | 0 |
CRISPR_ID | Spacer_Info | Spacer_region | Spacer_length | Hit_ID | Protospacer_location | Mismatch | Identity |
---|
CRISPR_ID | Spacer_Info | Spacer_region | Spacer_length | Hit_phage_ID | Hit_phage_def | Protospacer_location | Mismatch | Identity |
---|
Region | Region Position | Protein_number | Hit_taxonomy | Key_proteins | Att_site | Prophage annotation | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
DBSCAN-SWA_1 |
1397060 : 1410067
Sequences of DBSCAN-SWA_1
Nucleotide sequences of DBSCAN-SWA_1 >NZ_AP021844|1397060:1410067|DBSCAN-SWA ACTAAGCCGTGCTCTTACGCGCGGCGACTTCTTCCAGAAGCCGCTCGATCAGGCTTGCCACGGGCATCTGCCCCAGGTCCTGACCACCGCGGGTACGCACGGCCACCAAGCCGGCTTCCTTTTCCTTGTCGCCGATGACGAGCTGATAAGGCAACCGGTTCAAGCTATGTTCGCGTATTTTATAGGTAATTTTTTCATTGCGCAAATCCGCTTCGGCACGCAAACCTGCCTGACGCAGGGTTTTCACCACTTCGGCGGAAAAATCGGCCTGTTTTTCCGAAATATTCAGCACCACGGCCTGTACCGGCGCCAGCCACAGGGGCAAGGCACCCGCATAGTTTTCCACCAGGATGCCGATGAAACGCTCCAGGGAACCGAGGATGGCCCGATGCAGCATCACCGGCACATGGCGGGCGTTGTCCTCGCCCACATACTCGGCGCCCAGGCGACCCGGCATGGAGAAATCTACCTGCATGGTGCCGCACTGCCAGGAACGCCCGATGGCATCCTTGATGTGGAACTCGATCTTGGGGCCATAGAAGGCACCCTCGCCGGGCAATTCATCCCATTCCAGGCCGGAAGCCTTGAGGCCGGCCCGCAGGGCATTCTCCGCCTTGTCCCAGATGTCGTCGGAACCGACACGGCTTTCGGGGCGCAGGGCCAGCTTCACCGCCACCTGATCGAAACCGAAGTCGGCATAGACCTTCTTCACCAGAGCGTTGAAAGCCGTCACTTCCGCTTCGATCTGGTCTTCGGTACAGAAGATGTGACCGTCGTCCTGGACGAAGCCGCGCACGCGCATCAGGCCGTGCAGGGCGCCGGAGGCCTCGTTACGATGACAGGAGCCGAACTCGCCATAGCGCAGGGGCAGGTCGCGGTAGGAGCGCAGATCGGAATTGAACACCTGCACGTGCCCCGGGCAATTCATCGGCTTGATGGCGTAATCCCGCTTCTCCGACTCCGTGGTGAACATGTTGTTCTTGTAGTGCTCCCAGTGACCGGACTTCTCCCACATGCTGCGGTCGAGAATCTGGGGGCAGCGGATTTCCTGGTAGCCGTTGTCGCGATAGACCTGGCGCATGTACTGCTCGATTTCCTGCCAGATGGCCCAGCCCTTGGGGTGCCAGAAAACCATGCCCGGCGCCTCGTCCTGCATATGGAACAGATCCAGATGCTTGCCGATGCGGCGGTGATCCCGCTTCTCGGCCTCTTCCAGCATGTGCAGGTAGGCTTCCTGGTCTTCCTTCTTGGCCCAGGCGGTGCCGTAGATGCGCTGCAGCATCTCGTTCTTGGAATCGCCGCGCCAGTAGGCACCGGCCACCTTCATCAGCTTGAAAACCTTGAGCTTGCCGGTGGAGGGGACGTGGGGGCCGCGACAAAGGTCCACAAATTCGCCTTCCCGGTACAGGGAAACATCCTGGTCGGCCGGGATGGCAGCGATCAGCTCGGCCTTGTACTTCTCACCCTGCTCCAGGAAAAACTTGACCGCATCGTCCCTGGCCCAGACTTCGCGGCTTACCGGAATGTCGCGCTTGGCCAGCTCGGCCATTTTCTTCTCGATGGCCAGCAGGTCTTCAGGGGTAAAGGGGCGCTTGTAGGCGAAGTCGTAGTAGAAGCCGTTGTCGATGACCGGACCGATGGTCACCTGGGCTTCGGGAAACAGCTCCTTCACCGCATAGGCCAGCAGGTGGGCCGTGGAGTGGCGGATGATCTCCAGGCCGTCCGCATCCTTGTCGGTGACGATGGCCAGCTGGGTATCACGCTCCATCAGGTGGGACGTATCCACCAGCTTGCCGTCCACCTTGCCGGCCAGGGCGGCACGGGCGAGGCCGGCACCGATGGAGGCAGCAACCTCGGCAACGGTCACCGGAGCATCGAAGGAGCGGATGGAACCGTCGGGCAGCGTGATATTGGGCATGATTTCTCCAGCCACATTAAATTCTGGACAAAAAAAAAGTGCGGACGGATCCGCACTTTTTTTCGACATAGGAACGAAGGCGAACCGCTTAGGCTTTCTGAACCAGGATAGTGCAAGTTCGGAAATTCACGGTACAGCTCCGTTCAAGGTGTTGGTAGGCGCGATTGGATTCGAACCAACGACCCCCACCATGTCAAGGTGGTGCTCTAACCAACTGAGCTACGCGCCTGCAAAGAAGGCCGAATTATAACGATCCCAGAGAATCGCGCAAGGGGCTGAGGAGACTTTTTTCGCTTCGCCCCACAAAAAAAATCAGCACAACCGTCCGGGCGAACATTTGGCCAGGGCCGGCGCCACTGCGGTGACGGGGCGCGACTCCTGGATCCAGGCGCGGATGCCCTGGCGCACGTTGTAGATCTTGCTGTAGCCGGCCTGCTGTTCGAGGAAGTCGCTGACGGCCCGGGTGCGGTTGCCGCTGCGGCAGATGAGGATCACCGGCTGCTCCGGACCTGCCACCGTCTTCAGCTGCTCCAGCCAGGCCGCCGGATTGGCGCGGCCGTTGGCATCGAAGAAGGTGAGCAGGCGGCTGCCGGGAATGACGCCGGTTTCCCGCCATTCGGGCTCGGTACGGATATCCACCAGGACCACGCCACTGGCCACCAGCCGGGCCACTTCGGCGCTGTCCACATTCACCACCTCGGCCCTGGCCGCGAACGCCGCCAGCAGAGCCAGGAGGAAGAGGAAGGTGTGCTTCACGCCGCTACCTCCGGCCAGGCGGCCAGGAATTCCTGCCAGTGGGGCTTGTCCAGCTTGGCCAGTTCGGCCTTGATGAAGGCCAGTTCGGCCAGATGCTCCTCCCGGCTGACCTCGCCCCGCATCAGGCGGAAGCGGGTGGACACCAGGTAGGTATTGACCACATCGGTCTCGCAGTAATCGCGGATCTCGTCGGCCTTGCCCTCCTGCCAGGCCTGCCACACCTTGCCGCCGTCCATGCCCAGCTTGCCGGGGAAGCCCATGAGCTTGGCCAGGTCGTCCAGGGGAGCGCTGGCCCGGGGCTGATACATGGCCAGCAGGTCCATCAGGTCCAGGTGGCGGGTGTGGTAGCGGCTGATGTAGTTGTTCCACTTGAAGTCCCGGGAATCCGCATAGTCGCCGTCGCCCAGGTCCCAGTAGCGGGGAGCCACGACGCCGTGGATCAGGCCCCGGTAGTGCAGCACCGGCAGGTCGAAGCCGCCGCCGTTCCAGGAAACAATCTGGGGCGTGAATTTCTCGATGCCGTCGAAGAAGCGCTGGATGATCTCCCCTTCGCCAATCTCGGGAGCCGCCAGGGACCACACCTTGAAGGCATCCCGGGCCCGCAGGGCGCAGGAGATGGTCACCACCCGCTGCAGGTGCAGGGGCAGGAAGTCGCTGCCGTTCTGGGCCCGGCGCTGCTGGAAGGCCAGTTCGGCCACCTCGTCGTCGGAGAGGTCGGCGGGAAGGTCGTGCAGGCGGCGCAGCCCGGGTACATCCGGGATGGTTTCAATATCGAAAACGAGGACGGGAACCATGGGACTCAGGCGGGGAAAACGCCGGTGGACAGATAACGGTCGCCCCGGTCGCAAACGATGGTGACGATGGTGGCGTTCTCCAGCTCCCGGGCCAGGCGCAGGGCCACGGCCAGGGCGCCGCCGGAGGAGATGCCGGCGAACAGGCCCTCTTCCCGGGCCAGGCGCCGGGTCATGTCCTCGGCCTCGGCCTGGGACACGTATTCGAGGCGGTCCACCCGGCTGCGCTCGTAGATTTTCGGCAGATAGGCTTCGGGCCATTTGCGGATGCCGGGAATCTGGCTGCCCTCTTCCGGCTGGCAGCCGACGATCTGGATGCGCGGATTCTTTTTCTTGAGAAACTGGGAAGTACCGATGATGGTGCCGGTGGTGCCCATGCTGGAAACGAAATGGGTCACCTGGCCCTTGGTGTCGCGCCAGATTTCCGGCCCGGTGCCCTCGAAGTGGGCCAGGGGGTTATCCGGATTGGCGAACTGGTCGAGGATGATGCCCTTGCCTTCGTCGCGCATTTTCTCGGCCACGTCCCGGGCCAGTTCCATGCCGCCATCCCGGGGCGTCAGGATCAGTTCGGCGCCGTAGGCACGCATGGTCTGGCGCCGCTCCAGGCTCTGGTTTTCCGGCATGACCAGGATCATGCGGTAACCGCGCATGGCGGCGGCCATGGCCAGGGCGATGCCGGTGTTGCCGGAGGTGGCTTCGATCAGGGTATCGCCGGGCCGGATTTCGCCCCGCTGCTCGGCGTGGGAAATCATGGAGAGGGCCGGGCGATCCTTCACCGAACCGGCCGGATTGTTGCCTTCCAGCTTGGCCAGGATGACGTTGCCGCGCTGGGCGACAACATCGCCGGGCAGGCGCTTCAACTGCACCAGGGGCGTGTTGCCGACGAAATCTTCCAGAGTCTTGTACATGGCTCAGCGTCCGTTGAGGAATTGCACGTAGTCGGCGACGCCTTCGGCCACGGTGGCGAACTCGTCACCATAGCCGGCGCTGCGCAGCTTGCTGAGGTCGGCCTGGGTAAAGCTCTGGTACTTGCCTTTGAGGGCTTCGGGGAAGGCCACGTACTCCACCAGCCCCTGCTGCACCATCGCCTCCAGGGACAGAGCCGGCTTGCCCTCGGCGGCCCGGCAGCTGTTCACCGTGGCCACGGCCACGTCGTTGAAGCTCTGGGCCCGGCCGGTGCCCAGGTTGAAGATGCCGGATTTTTCCGGATGGTCGAGGAAGTACAGATTGACCTTGGCCACGTCCTTCACATAGACGAAGTCGCGCTGCTGCTCGCCGTTGGCGTAGCCGTCGCAGCCCTCGAACAGCTTGACCTTGCCTTCGGCCCGATACTGGTTGAAGTGGTGGAAGGCGACCGAGGCCATGCGCCCCTTGTGGCTCTCGCGGGGGCCGTAGACGTTGAAGTAGCGGAAGCCCACCACCTGGGAACGGACTTCAGGCAAGCGCTGACGGACGATCTGGTCGAAGAGGAACTTGGAGTAGCCGTACACGTTGAGAGGCGCCTCGTACTGCCGCTCTTCCTTGAAAACGCTGCTGCCGCCATAGGTGGCGGCGGAAGAGGCGTAGAGCAGCTGCACGTCCTGCTCCAAGCACCAGTCCAGCAGGGCCAGGGAGTAGCGGTAGTTGTTCTCCATCATGTAGCGGCCGTCGGTCTCCATGGTGTCGGAGCAGGCGCCTTCGTGGAAGATGGCCTCCACGTCGCCGTCGAAGTGGCCGCAGAGCAGGCGCTCGAGAAACTCGCCCTTGTCCAGGTAATCGGCGATCTCGCAGTCCACCAGATTCTTGAACTTGTCCGCCTTGGTCAGGTTATCCACGGCGATGATGCGGGTGATGCCCCGCTCGTTGAGGGCCTTGACCAGGTTGGCGCCGACGAAGCCGGCGGCACCGGTGACGATGTAGTACATGTGAATTCCTTAATTGGCGTCGGCCGCCAGGGCGGCCGACAGTTCCTCGCGACTGACGGTAGCGGTACCGAGCTTGCCCACCACCACGCCACCGGCCAGGTTGGCCAGGTGGATGGCGTCGCCCCAGGGGGCGCCCAAGGCCATCATGGCGGCCAGAGTGGCAATGACCGTGTCGCCGGCACCGGAGACGTCGAACACTTCCTGGGCCCGGGCCGGCTGGTGCAGCGCCTCGCCGTCGCGATAGAGGCTCATGCCCTCTTCGCTACGAGTCACCAGCAGGGCATCCAGTTCCAGTTCGCTGCGCAATTGCTGGGCCTTGGCCGCCAGCTGGGCCTCGTCGCTCCAGCGCCCCACCACCTGGCGCAGCTCGGAGCGGTTGGGGGTGATGACGGTGGCGCCGCGGTACTTGGAATAATCCTCGCCCTTGGGATCCACCAGCACCTTTTTCCCGGCCGCCCGGGCCAGGCGGATCATGTCGCCGATGTGGGCCAGGCCGCCCTTGCCGTAGTCGGAAAGGATCACCACATCCACCCCGGCCAGGCGCTGCTCGAACTCGGCCAGCTTGGCCTGCAGCACCTCGTGGGACGGGGTGGTCTCGAAATCGATGCGCAGCAGCTGCTGCTGGCGGCCGATGACCCGCAGCTTGACCGTGGTGTCGATGGCTCCGTCGGGCAGCAGGGAGGCGGCAATGCCGCCCTCTTCCATCTGCCGCTGCAGGATGCGGCCGGCCTCGTCGTTGCCGACCACGGACAGCAGGCCGACCCGCGCCCCCAGGGAAGCGCAGTTGCGGGCCACGTTGGCAGCACCGCCGGGACGCTCTTCGGAGCGTTCCACCTTGACTACCGGCACCGGTGCTTCGGGGGAAATGCGGGAAACATCGCCGAACCAGTAGCGGTCCAGCATGACATCGCCGACCACAAGAATGCGGGCGGCGGAAAAATCGGGAAGTTGGTGCATGGTGAAGATACTCAGTCGAAAAGATCGGCCTGGGCGAAGGGCGTGCCCTGCATATCCCGGGCCGAAAGACGGGGCTCCAGCCCCTCCAAAGGCCAGGCCACGGCCAAGGCCGGATCGTTCCAAGCGATGCAGCGCTCATGTTCCGGCGCGTAATAGTCGGTGGTTTTATAAAGAAAATCGGCGGTCTCGCTGAGCACCAGGAAACCGTGGGCAAAACCGGGAGGTACCCACATTTGGCGCTGATTATCCGCGGAAAGTACGGCACCAACCCAGCGGCCGAAATAGGGGGATTGGCAACGCAAGTCAACCGCCACGTCGAAAACGGCGCCTTGAGCCACTCGAACCAGCTTCCCTTGAGGCTGGCGAATCTGATAATGCAGGCCCCGCAGCACGCCTCGTGCGGAACGGGAGTGGTTGTCCTGGACGAAATCCACATCGGCCCCAGTCAGCTCGGTAAAGCGACGCCGGTTGTAACTTTCCATAAAAAAGCCGCGAGCATCGCCGAAAACCAGGGGTTCCAGCATGATCACATCGGCAATGGCGCTGGGAATCGCCTTCACGCTGCATGCTCCTTGAGCAGGGCCAGCAGGTATTGGCCGTAGCCGTTCTTGGCCAGGACCCGGGCCTGGGCTTCCAGAGTGGCGTCGTCGATCCAGCGTTGGCGCCAGGCCACCTCTTCGGGACAGGCCACCTTGAGACCCTGGCGCTTCTCGATGGTCTCGATGAACTGCCCCGCCTCCAATAGAGATTCGTGGGTACCGGTATCGAGCCAGGCATAGCCCCGGCCCATGATTTCCACATTGAGCTTGCCCGCTTCCAGGTAATGCCGGTTCACGTCGGTGATTTCCAGTTCGCCCCGGGGCGAGGGCTTGATGCCCTTGGCCACGGCCACGATATCGGTATCGTAGAAGTAAAGGCCGGTGACGGCATAGTGGGACTTGGGCTGCAGCGGTTTTTCCTCGATGGAGAGGGCCCGCTGCTGGGCATCGAACTCGACCACACCGTAGCGCTCCGGATCATTCACCCGGTAAGCGAAGACCGAGGCACCGCTGTCCCGATCGTTGGCCCGCTGCACCAGGGTGGCCAGGTCGTGGCCGTGGAAGATGTTGTCCCCCAGCACCAGGGCCGCCGGGGCACCGTCAAGGAAGGCCTCGCCGATGAGGAAGGCCTGGGCCAGGCCGTCCGGCGAGGGCTGCACCGCGTACTGCAGATTGATGCCCCACTGGGAGCCGTTTCCCAGCAGCTGCTCGAAACGGGGCGTGTCCTGGGGCGTGGAGATGATGAGGATGTCCCGCAGACCGGCCAGCATCAAGGTGGTCAGGGGGTAGTAGATCATCGGCTTGTCGTAGATCGGCAGCAGCTGCTTGGACACCGCCAGGGTGGCCGGATAGAGCCGGGTGCCGGAACCGCCGGCGAGAATGATGCCTTTACGGGGTTTAGTAGCCATTCTGGGCCTTCAGGGCGAGAAGCTGCATCATGCGGGAAAGATAGGGCTGCCAGTCGGGCATGGTCAGACCGAAACGGTCCTCCAGCTTGCGGCAGTCGAGACGGGAATTGAGGGGGCGCGGCGCCGGCAGCGGATATTCGCTGCTGGGAATGGGTGCGATGGCCTCGGGCCCCAGCTTGAGGGCGAAGCCGGGCGTCTGTTCGGCGGTGGCGACGATGGCCCGGGCAAAACCGTTCCAGCTCACCGGATTGGCAGCCACCAGGTGATACAGCTCGCAGCCCTGCTGGGCGCGCCCGCCGTCGAGCTGGGCCAGGACCATGCCGGTGACGGTGGCGATCATGGCCGCCGGCGTCGGGCTGCCGACCTGGTCGGCCACCACCTTGAGGCTGTCCCGCTCGCTCGCCAGGCGCAGGATGGACTTGACGAAATTCTTGCCCCGGGCGCCGAAGACCCAGCTGGTGCGGAAGATGAGGCCGCGGCCGCCCACGGCCAGCATGGCTTCCTCCCCTTCCCGCTTGGTCCGGCCATAGACGCCGAGGGGCGCCGTGGCATCGGACTCCACATAGGGCGCCGCCTTGCTGCCGTCGAAGACGTAGTCGGTGGAGTAATGCACCAGCAGGGCATCCAGGGCCTTGGCCTCCTCGGCCAGCAGGCCCACGGCCTCGGCATTGATGCGCCGGGCCAGTTCAGGCTCCATTTCCGCCTGATCCACCGCCGTATAGGCGGCGGCATTGACGATCAGGCGGGGGCGCTGTTCCCGCACCACGGCCCGCAGCCGGTCCAGGTCGGCCAGGTCGCATGTACGCCGGTCCAGGGCCAGCACCGGCCCCAGGGGCGCCAGGTCCCGCTGCAGTTGCCAGCCCAGCTGGCCCTGGCTGCCCAGGAGCAGGATGGGAGCCGACACCTCAGGCTTCCCCATACTGGCGGCCCACCCACTCCCGGTAGGCGCCGGAGGTGACGTTGTGCACCCACTGGGGATTGTCCAGGTACCAGCGTACGGTCTTGCGGATGCCGGTTTCGAAGGTCTCCGCCGGCTTCCAGCCCAGCTCCCGTTCCAGCTTGCTGGCGTCGATGGCATAGCGCCGGTCGTGGCCGGGCCGGTCGGCAACGAAAGTGATCTGGCTGGCGTAGGAGGCGCCGTCGGCCCGGGGCGACAGTTCGTCGAGCATGGTGCACAGGGTATGCACCACCTCCAGGTTGGGCTTTTCGTTCCAGCCGCCCACGTTATAGGTCTCACCCAGGCGGCCGGCTTCCAGGACGCGGCGGATGGCACTGCAATGGTCCTTCACATAGAGCCAGTCGCGGATCTGCTGGCCGTCGCCGTAGATGGGCAGGGGCTTGCCGGCCAGGGCGTTGTGGATGATGAGGGGAATGAGCTTTTCCGGGAAATGGTAGGGCCCGTAATTGTTGGAGCAGTTGGTGGTCAGCACCGGCAGGCCGTAGGTATGGTGGTAGGCCCGCACCAGATGGTCGGAGGCCGCCTTGCTGGCCGAATAGGGACTGTTGGGCTCGTAGCGGTGCTGCTCGGTGAAGGCCGGGGCCTCCTTTTCCAGGGAACCGTAAACCTCGTCCGTGGACACGTGGAGAAAGCGGAAGGCCGCCTTGTCGTCGGCAGGCAGGCCGTTCCAGTAGGCCCGCACGGCCTCCAGCAGGCGGAAGGTGCCGACGATGTTGGTCTGAATGAAGTCCTCCGGCCCGTGGATGGAACGATCCACATGGCTCTCGGCGGCGAAGTTCACCACCGCCCGCACCCGGTTCTGCTGCAGCAGTTCCAGAATCAGGTCGTAGTCGGCGATGTCGCCGCGCACGAAACGGTGCCGCGGATCGCCGGCCAGTCCCTGGAGGTTCTCCAGATTGCCGGCGTAGGTCAGCTTGTCCAGGTTGATGACAGGCTCACCCCCCGCCGCCAGCCAGTCGATGACGAAATTGCTGCCGATGAAGCCTGCACCGCCGGTCACCAGGATCATGGCGGACTCCTTATTGACGACCGATGGCCTGGTAGTCGATGCCGAACTGGCACACCTGCTTGGGTTCGTACAGGTTGCGGCCGTCGAAGATCACCGGCTGCTTCAGCTTGGCCTTGATGGCCTCGAAATCGGGACTGCGGAATTCCTTCCACTCGGTGACGATGAGCAGGGCATCCGCCCCATCCAAGGCGGCCATGGGGCTCTCCGCATAGCTCAGACGCGGCTCATCGCCGAAAATGCGCCGGGCTTCATGCATAGCCACCGGATCGTAGGCCACCACGGTGGCGCCAGCGGCGAAGAGATCGGCCAGCAGATAACGGCTGGGGGCCTCGCGCATATCGTCCGTATTGGGCTTGAATGCCAGGCCCCAGACGGCGAACTTGCGGCCGCTGAGGTCGTTGCCGAAACGCTTCACGGTCTTGGCGGTGAGCACGTGCTTCTGGGCATCGTTGGCGTCTTCCACCGCATTGAGGACCTTCATCTCCATGCCGGCGTCGAGGCGGGCGGTGCGCTGCAGGGCCTGCACATCCTTGGGGAAGCAGGAACCGCCGTAGCCGCAGCCTGGATAGAGGAAGTGGTAGCCGATGCGCGGGTCGGAACCGATGCCCTGGCGCACCTGCTCGATGTCGGCACCCAGCTTCTCGGCCAGGTTGGCCAGTTCGTTCATGAAGCTGATGCGGGTGGCCAGCATGGCGTTGGCGGCGTACTTGGTCAGTTCGGCGGAACGCACATCCATGACGATGAGGCGCTCGTGGTTGCGCTGGAAGGGCGCATAGAGGGCGCGCATCAACTCGATGGCGCGCTCGTCCTCGGCGCCGACGACGATGCGGTCCGGCCGCATGAAATCCTCCACGGCGGCGCCTTCCTTGAGGAATTCCGGATTGGAGACGACGCTGTAGGCGATATCCGCTCCCCGGGCCTTGAGCTCGTCGGCGATGGCGGCGCGCACCTTGTCGCCGGTGCCCACGGGCACGGTGGACTTGTCCACCACCACCTTGTAGTCGCCCATGTGGCGGCCGATGTTGCGGGCGGCGGCGAGCACGTACTGCAGATCGGCGGAACCGTCCTCGTCCGGCGGCGTGCCGACGGCAATGAACTGGATGGTGCCGTGGGCCACGGCCTGTTCCACATCGGTGGTGAAACGCAGGCGGCCGGCGGCCACGTTGCGCTTCACCATGTCCAGCAGGCCCGGTTCGAAAATGGGAATGCCGCCTTCGTTGAGAATCCGGATCTTTTCCGGATCCACATCCAGGCACAGCACATCGTTGCCCACCTCGGCCAGGCAGGTACCGCTCACCAGGCCCACATAGCCCGTACCGACAACTGTAACTTTCAAATTATTCTCCCTGAAGGGTCATTGATCCGCCCAAACCGACGCATTCATGGGCTCAGGGGATCAGATCGAATTCTTCCGTGCGCCGGGGCGGATAGGTTTCCCAGCCGCCGCAGGCAGGACAGCGCCAGTAGAAGTGGCGCGCCTTGAAGCCGCAATTATCGCACCGATAGCGGGCCAGCCGGCGGGTGTGGTTGTGCACCAGATTGCGCACCAGCTCCAGGTCCGCCCGCTTTTCCGGCGGCACGCCGAGCAGCTGGGCTTCCAGCAGGCGGTCCAGGCCCAGCAGGGTCGGATTGCGCCGCAGTTCGTCCCGCACCAGGCGATAGGCCGCCTCCGGCCCCTCCGCATCCATCACCAGCTGGAATACGGTTTCCAGCAGATCGAGGGAGGGGTAGCTGGCCAGATAGCCGCGCAGCAGTTGCAGCCCTTCGTCCCGCTGCCCCTGGGCCAGGTAGGCATCCTGGAGCTTGCGGGCGACGATGGCCAGGTAGGCCGGATTCTGGCTCTCGATGCGCTTCCAGGCCTCGATGGCCGCTGCCAGATCACCCGCCTGCTGCAACAAGTCGCCCTGCAGGACGCTGGCGCGCACGCAGTTGCGGTGCAGGGAAAGGGCCGAGTCCAGGTACTGGCGGGCGGAATCGGGCCGGGAATTGATCATCTCCCCGGCTGCCAGCTCGCAGTAATAGTTGGCGATTTCCTTCTGGGTGGCGTAGTCAGGCATTTCCTTGGCGATGGCGATGGCCTTCTGCCAGTCCTTCTCCTGCTGGTAGATTTCCAGCAGGTTGCGCTTGGCCTCCTCGTCCCGGGAGGTGCCGCGCAAACGGGAAAACACCTCTTCGGCCCGGTCCAGCAGACCGGCCTTGAGGAAGTCCTGGCCAAGCTCGGAAAGGGCCTGCAGCTTCAGCTCCGGAGACAGGTCCACCCGCTCGATGAGGTTCTGGTGCATGCGGATGGCCCGCTCGGTCTCGCCGCGGCGGCGGAACAGGTTGCCCAGGGCGAAGTGCAGTTCCACCGTCTGGGGATCCACCTTCACCACTTCGATGAAGGCTTCGATGGCCTTGTCCGGCTGCTCGTTGAGCAGGAAATTCAAGCCCTGGAAGTAGGACCGGGGCAGGGCCCGGGATTCCCGCACCAGATGCTTGATGTCGATGCGGGCGGCGGCCCAGCCAAGGACGAAAAACAGGGGAAAGAGCAGTAACTGCCAGTACTCGAATTCGATCATTGGGCCACGGCCTCGCCGGCAGGAGGCGGTTGCACGGCAGCGTCGCTGGCGACCGGCTTGAGCACGGCCTGGCGCTCCCGCTCCCGCGCCAGTTCACGGCGGGTGCGCGACAGTTCGCGGCGCAACTGGAACAGGGTGCCGAGCAGGGACAGGGCACCTAAGGCCGTCCCGGCGGCGAAGAAACCGAGCAGGATGATCACCAGGGGCGCCTGCCACTGGGTATCGAAGAAGAAGCGCAGACTCACCGGATCGCTGTTCATTGCCGCAAAACCCAGCAAGAAGAAGAAGATGATGAGCCGGATGATCAAGATCAGGGCGCGCATAGCCTTATCCGCAAGCAATAAAAAAGGCGGCAACCCTAGGGTTGCCGCCCGCAATCTACCACAGGACAGGGGGTAAGGTCAGCGTCAGGCGGAGAGAGAGGCCAACAGCCCCAGGTCCACCCGTTCGCGCAGTTCCTTGCCTGCCTTGAAATGGGGTACGTATTTTTCCGGAACGCTGACTTTATCCCCGGACTTGGGGTTACGCCCCATTCGCGGCGGCCGGTAGTTCAGCGCGAAGCTGCCGAACCCCCGAATCTCGATACGGTCACCGTGGGCCAGAGCCTCGGTCATGGCATCGAGAATTTCCTTGACCGCGAAATCAGCGTCTTTCGCCACCAGTTGCGGAAACCGCATGGCCAGGCGGGCGATCAGCTCGGATTTGGTCAT
Protein sequences of DBSCAN-SWA_1 >NZ_AP021844|1397060:1410067|1409764_1410067_-|WP_014237016.1|DBSCAN-SWA MTKSELIARLAMRFPQLVAKDADFAVKEILDAMTEALAHGDRIEIRGFGSFALNYRPPRMGRNPKSGDKVSVPEKYVPHFKAGKELRERVDLGLLASLSA >NZ_AP021844|1397060:1410067|1404821_1405751_-|WP_152089523.1|DBSCAN-SWA MGKPEVSAPILLLGSQGQLGWQLQRDLAPLGPVLALDRRTCDLADLDRLRAVVREQRPRLIVNAAAYTAVDQAEMEPELARRINAEAVGLLAEEAKALDALLVHYSTDYVFDGSKAAPYVESDATAPLGVYGRTKREGEEAMLAVGGRGLIFRTSWVFGARGKNFVKSILRLASERDSLKVVADQVGSPTPAAMIATVTGMVLAQLDGGRAQQGCELYHLVAANPVSWNGFARAIVATAEQTPGFALKLGPEAIAPIPSSEYPLPAPRPLNSRLDCRKLEDRFGLTMPDWQPYLSRMMQLLALKAQNGY >NZ_AP021844|1397060:1410067|1401435_1402428_-|WP_152089519.1|DBSCAN-SWA MYYIVTGAAGFVGANLVKALNERGITRIIAVDNLTKADKFKNLVDCEIADYLDKGEFLERLLCGHFDGDVEAIFHEGACSDTMETDGRYMMENNYRYSLALLDWCLEQDVQLLYASSAATYGGSSVFKEERQYEAPLNVYGYSKFLFDQIVRQRLPEVRSQVVGFRYFNVYGPRESHKGRMASVAFHHFNQYRAEGKVKLFEGCDGYANGEQQRDFVYVKDVAKVNLYFLDHPEKSGIFNLGTGRAQSFNDVAVATVNSCRAAEGKPALSLEAMVQQGLVEYVAFPEALKGKYQSFTQADLSKLRSAGYGDEFATVAEGVADYVQFLNGR >NZ_AP021844|1397060:1410067|1406809_1408135_-|WP_152089525.1|DBSCAN-SWA MKVTVVGTGYVGLVSGTCLAEVGNDVLCLDVDPEKIRILNEGGIPIFEPGLLDMVKRNVAAGRLRFTTDVEQAVAHGTIQFIAVGTPPDEDGSADLQYVLAAARNIGRHMGDYKVVVDKSTVPVGTGDKVRAAIADELKARGADIAYSVVSNPEFLKEGAAVEDFMRPDRIVVGAEDERAIELMRALYAPFQRNHERLIVMDVRSAELTKYAANAMLATRISFMNELANLAEKLGADIEQVRQGIGSDPRIGYHFLYPGCGYGGSCFPKDVQALQRTARLDAGMEMKVLNAVEDANDAQKHVLTAKTVKRFGNDLSGRKFAVWGLAFKPNTDDMREAPSRYLLADLFAAGATVVAYDPVAMHEARRIFGDEPRLSYAESPMAALDGADALLIVTEWKEFRSPDFEAIKAKLKQPVIFDGRNLYEPKQVCQFGIDYQAIGRQ >NZ_AP021844|1397060:1410067|1409353_1409680_-|WP_152089526.1|DBSCAN-SWA MRALILIIRLIIFFFLLGFAAMNSDPVSLRFFFDTQWQAPLVIILLGFFAAGTALGALSLLGTLFQLRRELSRTRRELARERERQAVLKPVASDAAVQPPPAGEAVAQ >NZ_AP021844|1397060:1410067|1400532_1401432_-|WP_152089518.1|DBSCAN-SWA MYKTLEDFVGNTPLVQLKRLPGDVVAQRGNVILAKLEGNNPAGSVKDRPALSMISHAEQRGEIRPGDTLIEATSGNTGIALAMAAAMRGYRMILVMPENQSLERRQTMRAYGAELILTPRDGGMELARDVAEKMRDEGKGIILDQFANPDNPLAHFEGTGPEIWRDTKGQVTHFVSSMGTTGTIIGTSQFLKKKNPRIQIVGCQPEEGSQIPGIRKWPEAYLPKIYERSRVDRLEYVSQAEAEDMTRRLAREEGLFAGISSGGALAVALRLARELENATIVTIVCDRGDRYLSTGVFPA >NZ_AP021844|1397060:1410067|1397060_1398977_-|WP_014237029.1|tRNA|DBSCAN-SWA MPNITLPDGSIRSFDAPVTVAEVAASIGAGLARAALAGKVDGKLVDTSHLMERDTQLAIVTDKDADGLEIIRHSTAHLLAYAVKELFPEAQVTIGPVIDNGFYYDFAYKRPFTPEDLLAIEKKMAELAKRDIPVSREVWARDDAVKFFLEQGEKYKAELIAAIPADQDVSLYREGEFVDLCRGPHVPSTGKLKVFKLMKVAGAYWRGDSKNEMLQRIYGTAWAKKEDQEAYLHMLEEAEKRDHRRIGKHLDLFHMQDEAPGMVFWHPKGWAIWQEIEQYMRQVYRDNGYQEIRCPQILDRSMWEKSGHWEHYKNNMFTTESEKRDYAIKPMNCPGHVQVFNSDLRSYRDLPLRYGEFGSCHRNEASGALHGLMRVRGFVQDDGHIFCTEDQIEAEVTAFNALVKKVYADFGFDQVAVKLALRPESRVGSDDIWDKAENALRAGLKASGLEWDELPGEGAFYGPKIEFHIKDAIGRSWQCGTMQVDFSMPGRLGAEYVGEDNARHVPVMLHRAILGSLERFIGILVENYAGALPLWLAPVQAVVLNISEKQADFSAEVVKTLRQAGLRAEADLRNEKITYKIREHSLNRLPYQLVIGDKEKEAGLVAVRTRGGQDLGQMPVASLIERLLEEVAARKSTA >NZ_AP021844|1397060:1410067|1403941_1404832_-|WP_152089522.1|DBSCAN-SWA MATKPRKGIILAGGSGTRLYPATLAVSKQLLPIYDKPMIYYPLTTLMLAGLRDILIISTPQDTPRFEQLLGNGSQWGINLQYAVQPSPDGLAQAFLIGEAFLDGAPAALVLGDNIFHGHDLATLVQRANDRDSGASVFAYRVNDPERYGVVEFDAQQRALSIEEKPLQPKSHYAVTGLYFYDTDIVAVAKGIKPSPRGELEITDVNRHYLEAGKLNVEIMGRGYAWLDTGTHESLLEAGQFIETIEKRQGLKVACPEEVAWRQRWIDDATLEAQARVLAKNGYGQYLLALLKEHAA >NZ_AP021844|1397060:1410067|1399729_1400527_-|WP_014237027.1|DBSCAN-SWA MVPVLVFDIETIPDVPGLRRLHDLPADLSDDEVAELAFQQRRAQNGSDFLPLHLQRVVTISCALRARDAFKVWSLAAPEIGEGEIIQRFFDGIEKFTPQIVSWNGGGFDLPVLHYRGLIHGVVAPRYWDLGDGDYADSRDFKWNNYISRYHTRHLDLMDLLAMYQPRASAPLDDLAKLMGFPGKLGMDGGKVWQAWQEGKADEIRDYCETDVVNTYLVSTRFRLMRGEVSREEHLAELAFIKAELAKLDKPHWQEFLAAWPEVAA >NZ_AP021844|1397060:1410067|1405737_1406799_-|WP_152089524.1|DBSCAN-SWA MILVTGGAGFIGSNFVIDWLAAGGEPVINLDKLTYAGNLENLQGLAGDPRHRFVRGDIADYDLILELLQQNRVRAVVNFAAESHVDRSIHGPEDFIQTNIVGTFRLLEAVRAYWNGLPADDKAAFRFLHVSTDEVYGSLEKEAPAFTEQHRYEPNSPYSASKAASDHLVRAYHHTYGLPVLTTNCSNNYGPYHFPEKLIPLIIHNALAGKPLPIYGDGQQIRDWLYVKDHCSAIRRVLEAGRLGETYNVGGWNEKPNLEVVHTLCTMLDELSPRADGASYASQITFVADRPGHDRRYAIDASKLERELGWKPAETFETGIRKTVRWYLDNPQWVHNVTSGAYREWVGRQYGEA >NZ_AP021844|1397060:1410067|1402437_1403385_-|WP_152089520.1|DBSCAN-SWA MHQLPDFSAARILVVGDVMLDRYWFGDVSRISPEAPVPVVKVERSEERPGGAANVARNCASLGARVGLLSVVGNDEAGRILQRQMEEGGIAASLLPDGAIDTTVKLRVIGRQQQLLRIDFETTPSHEVLQAKLAEFEQRLAGVDVVILSDYGKGGLAHIGDMIRLARAAGKKVLVDPKGEDYSKYRGATVITPNRSELRQVVGRWSDEAQLAAKAQQLRSELELDALLVTRSEEGMSLYRDGEALHQPARAQEVFDVSGAGDTVIATLAAMMALGAPWGDAIHLANLAGGVVVGKLGTATVSREELSAALAADAN >NZ_AP021844|1397060:1410067|1403396_1403945_-|WP_152089521.1|DBSCAN-SWA MKAIPSAIADVIMLEPLVFGDARGFFMESYNRRRFTELTGADVDFVQDNHSRSARGVLRGLHYQIRQPQGKLVRVAQGAVFDVAVDLRCQSPYFGRWVGAVLSADNQRQMWVPPGFAHGFLVLSETADFLYKTTDYYAPEHERCIAWNDPALAVAWPLEGLEPRLSARDMQGTPFAQADLFD >NZ_AP021844|1397060:1410067|1399289_1399733_-|WP_152089517.1|DBSCAN-SWA MKHTFLFLLALLAAFAARAEVVNVDSAEVARLVASGVVLVDIRTEPEWRETGVIPGSRLLTFFDANGRANPAAWLEQLKTVAGPEQPVILICRSGNRTRAVSDFLEQQAGYSKIYNVRQGIRAWIQESRPVTAVAPALAKCSPGRLC >NZ_AP021844|1397060:1410067|1408187_1409357_-|WP_172974712.1|DBSCAN-SWA MIEFEYWQLLLFPLFFVLGWAAARIDIKHLVRESRALPRSYFQGLNFLLNEQPDKAIEAFIEVVKVDPQTVELHFALGNLFRRRGETERAIRMHQNLIERVDLSPELKLQALSELGQDFLKAGLLDRAEEVFSRLRGTSRDEEAKRNLLEIYQQEKDWQKAIAIAKEMPDYATQKEIANYYCELAAGEMINSRPDSARQYLDSALSLHRNCVRASVLQGDLLQQAGDLAAAIEAWKRIESQNPAYLAIVARKLQDAYLAQGQRDEGLQLLRGYLASYPSLDLLETVFQLVMDAEGPEAAYRLVRDELRRNPTLLGLDRLLEAQLLGVPPEKRADLELVRNLVHNHTRRLARYRCDNCGFKARHFYWRCPACGGWETYPPRRTEEFDLIP |
14 | Prochlorococcus_phage(20.0%) | tRNA | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_2 |
2569244 : 2576915
Sequences of DBSCAN-SWA_2
Nucleotide sequences of DBSCAN-SWA_2 >NZ_AP021844|2569244:2576915|DBSCAN-SWA ATCAGGCGGTGATGCGGGCCTGCTTGGCCAGCTTGGCCTTGATGCGGCCCTGTTTGAGCTGGGACAGGTGATCGACGAAAACCTTGCCCTGCAGGTGATCCATCTCATGCTGGATGCAGACGGCCAGCAGGCCGTCCGTTTCCAGGGAACAGGTCTTGCCTTCCAGGTCCAGGTAACGCACGGCGATATGCTCGGCCCGCTCGACCTTGTCGTAAATGCCCGGCACCGAGAGACAGCCTTCTTCACCCACCTGTAGTCCGTCGCGGTGGGTGATTTCCGGATTGATCAGCACCAGGAGTTCGTCCTTGGTTTCCGACACGTCGATGACAATCACCTGCTTGTGTACATCGACCTGGGTGGCGGCCAGACCGATACCCGGCGCCTCGTACATGGTTTCCGCCATGTCCCGGGCCAGGGCACGAATGCCGTCGTCAATTTTTTCGACCGGAACCGCCACTTTTTTCAAGCGGGGATCCGGGAAGCGCAAAATAGGGAGTAAAGCCATAAAAAGCTTGCCCGAATAATCTATTACATGCAGAATTTAAACCAAATCCCTAGATTCGGGACATGGGTGTGGACAACAAAAGGGCCGCCCTTGTTCCGGCAGCGCGAGGACGCCACGATGATTCGACACCTGTTCGGCCGCCCTGGCCTGAACCCCGCCCGCATTATAGCCACCCTTCTGTTGGCTGTCGCCGCCTCCAGCGCCAGTGCCCAGGAATCTCCCCGCCTCGCCGACAACGCGCCGGACCGCCATATCGTGGTGCCGGGCGACACCCTGTGGGGCATCGCCGGCAAGTTCATCCAGGAACCATGGCGCTGGCCCGAAATCTGGCGCCTGAACAAGGACCAGATCAAGAATCCCCACCGCATCTACCCGGGCGACGTCATCGTCATGGTCACCGGCGAGGACGGCAAGCCACAGCTCAAACTGGCCAAGTCGCTCAAGCTGCAGCCGCGCGAGTACAGCGAAGCGGTCAAGAACGAAATTCCCACCATTCCGCAAAGCATCATCGAGCCCTTCCTGTCCCAGCCCCTGGTGGTGGACCCCAGCGCCATGGACAAGGAAGCCCGCATCATCGCCACCCAGGAAGGACGGGTCTATCTTGGTGGCGGTGATCAGGCCTACGTGGTCGGGGTACGGGAGCCTTCCGAATTGTGGCAGGTCTATCGTCCCGGCAAGGCCATGCTCGATCCCGACACCAAGGAAGTGCTGGGCCATGAGGCGTTCTACCTGGGCACGGCCAGGCTGATTCAACCGGGAGAGCCCTCCGTCATGGAAATGGTGGAGGTGAAGCAGGAGGTCGGCAAGTTCGATCGACTGATGCCCGCATCCCGCCCCGAACTGATCACCTATGCCCCACGTCGTCCGGAAGCCAAGGTTGAGGCCCGCATCATCGCGGTGTACGGCGGTGTTGGCACCGGTGGGCGCTACTCGGTGGTATCCCTGTCCAGGGGCAGCCGCGACGGACTCGAGGTCGGCCACGTCCTAGCCTTGCTGCGCAGTGAAAAGGTCTATGAACAGCGCAACGAACAGGGCGAGCGGGAGTTGGTCAAGGTGCCGCCCCAGCGCTATGGCCTGGTCTTTGTCTTCAGGACGTTTGAACGAGTTTCCTACGCTCTGGTCATGGATGCTGCCTTGCCCCTGTCTCTGGCCGATCTGGTACGCAACCCCTGAGCCCCCGCCGTGGCTGACCCGGCCCTCACCGCCTGGCTCCGGCTGACGCTGGTTCCGGGCGTCGGGCCGGAGACCCAGCGCCATCTCTTGGCCGCCTTCGGCCTACCGGAACAGGTTTTCTCCGCCCCCCGCAGTGCCCTCAAGCAGGTCGTCGGCAAGAAGGCCGATCTGCTGCTCGATACGGACAATCAGGAAGCGGTGGACCGGGCCCTGGACTGGGCCGACAAGCCGGGCAACCGCATCCTGACCCTGGCCGATCCGGACTATCCCCAGCTGCTGCTCGAATCCGCCGATCCGCCCAGCCTGCTCTATGTGAAGGGCCGGGTGGAACTGCTAAACCGGCCTGCCCTGGCCATTGTCGGCAGTCGTAATGCCACGCCCCAGGGCCTCAAGGATGCCGAGGCCCTCGCTGCCGATCTGGCGGCCCAGGGGCTGACCATCGTCAGCGGCCTGGCCCTGGGCATCGACGGCGCCGCCCATCGGGGCGGGCTGAAAGGGGAGGGCGGCAGTGTCGCCATCATCGGCACCGGGGCCGATCGCATCTATCCCTCGCGGCACAAGGAACTGGCGCTGCAGCTGGCGACCGAAGGCGCCATCGTTTCCGAATTTCCCCTGGGTACGCCGGCGGTGGCCCACAATTTTCCGCGCCGCAACCGCATCATCGCCGGCATGGCCAAGGGCTGCCTGGTGGTGGAAGCCGCCCTGGAAAGCGGCTCCCTCATCACCGCCCGTCTGGCGGCCGAACTGGGCCGGGAAGTGTTTGCCATTCCCGGCTCCATCCATTCGCCGGTGGCCAAGGGCTGTCACCGGCTGATCCAACAGGGGGCCAAGCTGGTGCAGGAAGCCCGGGACATTGTGGAAGAAATCGGTCCGTTCGACCCCCCGGGCTGCAGGCCAGCAAGGACGCCACTTTCCACCGGCAATACGCCAACAACCGTGCCAATCCTCGATCCTGGCCAGGCTGCCGTACTCGATGCCCTCGGCCACGACCCGGCCAATCTGGACCAATTGCTGCAGCGCACAGGCTTGACGACGGAAGCCCTATGCGCCATCCTCGTGACGCTGGAACTGGCGGACCACGTTGCCAGTCTTCCCGGAGGCCGCTACCAGCGGCTTTCCCCCACCTGATACGTTGCCAATGTTCGACATCCTCGTCTATCTCTTCGAAAACTACGTCGATTTCGCCGACTTCAGCAAGTCCGGCAATCAACCCGATTCACCCGATTCCCAGGCCGACACGGCCCTCAGCCGCAAACTCACTGCGGCCGGCTTCTCCGAAGAAGAAATCAGCGAAGCCCTGGAATGGCTCCAGGGCCTCAAGGCCACCCTGCCGACCCGCCAGCTGCAGGCCGATTCCCGTTCCCTGCGGGCCTACACGCCGGACGAAAGCGCCCACCTGGGTGCCGACGCCCTGGGCTTCCTGCATTTCCTGGAACAGGCCAAGGTCCTTTCCGCCGACCTGCGGGAACTGGTCATCGAGCGGGCCATGGCCCTGCCGGACGACCGGTTGTCCCTGGGGCGCTTCAAGGTCATCGTGCTGATGGTCCTGTGGAGCCAGGAGCAAAACCTGGATACCCTCATCGTCGAAGAACTGCTCTCCGAAGCGGAACCCGAACACCTGCATTAAGTCCATCCGTCGGGCCTGCCAGCCGGGGCGACTGAGGGGTCGCACCAGATGGCGCAGGCAGCATGATGGGCGAAGCACGCAATATGCAGGCTTCGCGCGTCAAATTAGTGGCTGCTTGCTTGCCAAGGCCCGTAAGTCCCTCTTATCATCCCCCGCACCTCCCGACCGGGACTGCCAGAGGCCCCTGCCATGGGCAAACAGCTCATCATTGCCGAAAAACCTTCCGTCGCTGCCGACATCGCCAAGGCCCTTGGCGGTTTCACCAAGCATGACGACTATTTCGAGAGCGACAACTTCGTTCTCTCCTCTGCTATCGGCCATCTGCTGGAACTGGTGATCCCCGAGGAATACGAGGTCAAGCGCGGCAAGTGGTCCTTTGCCCACCTGCCCGTGATCCCGCCCCACTTCGAACTGAAGCCGGTGGAGAAAACCGAGTCCCGCCTCAAGCTGCTGACCAAGCTGATCAAGCGCAAGGATGTGGACGGCCTGGTGAACGCCTGTGACGCGGGCCGCGAGGGTGAGCTGATCTTCAATTACATCGCCCGCCACGCCAAGTCCGGCAAGGCCGTGCAGCGGCTGTGGCTGCAGTCCATGACGCCCCAGGCCATCCGCGACGGCTTCGCCCGTCTGCGCCGCGGCGAGGAAATGCAGGGGCTGGGCGATGCCGCCGTGTGCCGTTCCGAATCCGACTGGCTGGTCGGCATCAACGGCACCCGGGCCATGACCGCCTTCAACTCCAAGACCGGCGGCTTCCACCTCACCACCGTGGGCCGGGTGCAGACCCCAACCCTATCCCTGGTGGTGGAGCGGGAACGCAAGATCCGCGAATTCAAGGCCCGTCCCTACTGGGAAGTGGAGGCCACCTTCGCCGCCGCTGCCGGCGAATACAAGGGCAAGTGGTTCGACGAAGCCTTCAAGGGCAAGGACGAGGACGAACACGCCCGGGCCGACCGCCTGTGGGACGAAGCCCGGGCCAAGGCTCTGCAGGCCAAGTGCGAAGGCCAGCCCGGCGAAGTGAGCGAGGAGGCCAAGCCCTCCACCCAACTCTCGCCCCTGCTCTTCGACCTCACCAGCCTGCAGCGGGAGGCTAACAGCCGCTTCGGCTTCTCCGCCAAGAACACCCTGGGCCTGGCCCAGGCCCTGTACGAAAAGCACAAGGTCCTGACCTATCCCCGGACCGACTCCCGGGCCCTGCCCGAGGATTACCTGGGCACGGTGCAGGCCACCCTGCAGATGTTCAACGGCGAAAACCTGACCAAGGGTTCCGACACCTCCGTAGTGGACCGCTACGGCATCTTCGCCAACAAGATTCTCAAGTCCAAGTGGGTGGTGCCGAACAAGCGCATTTTCAACAATGCCAAGATTTCCGACCACTTCGCCATCATCCCCACCACCCAGGCGCCGAAGAATCTCTCCGAGCCGGAACAGAAGCTCTATGACCTGGTGGTCAAGCGCTTCCTCGCCGTCTTCTTCCCCGCCGCCGAATACCTGATCACCACCCGCATCACCCGGGTCGCCGGCGAACCCTTCAAAACCGAAGGCAAGGTCCTGGTGAATCCGGGCTGGCTAGCCATCTACGGCCGCGAAGGCCAGGAGGGCGACGAGGGCAACCTGGTGGCCGTCTCCCAGGGTGAGAAGGTGCAGACCGAAGAAGTGGCGGTCAATCAGAACGACACCCGGCCCCCGGCCCGCTACTCGGAAGCCACCCTGCTCTCCGCCATGGAAGGCGCCGGCAAGATGGTGGACGACGAGGAACTGCGTGCCGCCATGGCCGGCCGCGGCCTCGGCACCCCGGCCACCCGGGCCCAGATCATCGAAGGCCTGATCACCGAACAGTATCTGCACCGCGAAGGCCGGGAACTGATCCCCACCGCCAAGGCCTTCTCCCTCATGACCCTGCTCAACGGCCTGGGCATTTCCGAACTGACCTCGCCGGAACTGACCGGCGAATGGGAATGGAAGCTGGCCCAGATCGAGCGCGGCGATCTTTCCCGCAGCGCCTTCATGCAGGAAATCGAGGAAATGACCCGGCACATCGTGGACCGGGCCAAGAGCTACGACAGCGACACGGTGCCCGGCGACTTCGGCCTGCTCAAGTCGCCCTGTCCCAAGTGCGGCGGTCTCATGCGCGAGACCTACAAGAAATTCCAGTGCGGCGATTGCGACTACGGCCTGTGGAAGATCGTCGCCGGCCGCCAGTTCGAGCCGGAGGAAATCGAGACCCTGCTCACCGAACGCCAGGTCGGCCCCCTGATGGGCTTCCGCAACAAGATGGGGCGGCCCTTCAATGCCCTGATCAAGCTCAACGACAAGAACGAACCGGAATTCGACTTCGGCCAGGACCGCTCCGGCGAGGACGGCGGCGAACCGGTGGACTTCTCCGGCCAGGAAAGCCTGGGGCCCTGTCCCAAGTGCGGCAGCCCTGTCTATGAGCACGGTCTGGCCTACGTCTGCGAAAAGTCCGTGGGTCCGGCCAAGAGCTGCGACTTCCGTTCCGGCAAGATCATCCTGCAACAGGCGGTGGAACGGGAACAGATGCAAAAACTGCTGTCCACCGGCCGCACCGATCTGCTCAAGGACTTCATCTCCGCCCGCACCCGGCGCAAGTTCTCCGCCTTCCTGGTGAAGGGCAAGGACGGCAAGGTCAGCTTCGAGTTCGAGAAACGGGAGCCCAAGGCCCCAGCGGCGAAAAAAACTGCCGCCAAGGCCGAGCCCAAGGCAGCGGCGGAAAAGCCCGCCAAAGCCCCGGCAAAGCGCAAGGCGAAGGAAGCCTGAGGTTAAGCAAGGCAAAAGAAAAGGGCTCGGTGTGAACCGAGCCCTTTTTGCATCCGGACCGCCGCCAGGCGGCGAAGCGGTGAATTTACTTATTCTTCGAGCAGGACGCGCAGCATCCAGGCGTTCTTTTCATGCACTTCCATGCGCTGGGTCAGCAGGTCGGCCGTCGGCTGGTCGTTGGCCTTGTCCACCACCGCGAAGACCTGGCGGGCGGTGCGGGCCACGGCTTCCTGGCCGGCCACCAGCTGGCGGATCATGTCCTTGGCCTTGGGCACGCCGTCTTCCTCGGTGATGGAAGCCAGTTCCACGAAGCGCTTGTAGGAGCCGGGAGCCGGGTAGCCGAGGGCGCGGATGCGCTCGGCGATCAGGTCCAGGGAGTTCCACAGTTCGGTGTACTGGGTCATGAACATGGTGTGCAGGGTCTGGAACATGGGGCCGGTGACGTTCCAGTGGAAATTGTGGGTCTTCAGATAGAGGATGTAGCTGTCTGCCAGCAGGTGGGAAAGGCCGTCGGCGATTTTCTTGCGGTCTTTCTCGGTGATGCCGATATCGATCTTGGTTGCCATGTGGATTCTCCTTTCAAGCTTCTCAATAACCATGCGCCCATTGTGGCCGATGGCGGCCGCCCGGGCCAATGACTTATCCCTATGGATTGAATAGGGACGGGCAATACAGACGGACTACAGTACCTTAACCCCGGGTGGCGGACATTGCAAAATGGCGGCCCGCACCGCATCGATGGCCTGGGGCCGGGGGAAGGTGACGCGCCAGGCCAGCACGATGCGGCGGGAAGGCTCCGGCGCCTTGAAGGGAATGACCCGGGCCAGGGACGGATCCGGCGGCGCCACCTCGACGGCGGAAGCGGGCAGCACGGCAACGCCAGTGCCGCTGGCCACCATCAGGCGGATGGTCTCCAGGGAGCCGCCCTCGTAGGAACGTTCCAGCCCGCCCGGCTCGGTCAGACGCGGACAGGCGGCCACCACCTGGTCGCGGAAACAGTTGCCCTGGCCCAGCACCAGCAGTTCCTCGCCCTTCAGCTCCCCCGCATCCACCGATTTCCGCTCGGCCCAGGGATGGGCCGCCGGCACCAGCATGCGGAAGGGTTCGTCGTACACCGGCTGGGTGACGATGCCCGGCTCGTCGAACGGCTGGGCCACCACGATCACGTCCAGCTCGCCCCGCTTCAGGGATTCGGCCAGCACGTGGGTGAAATTCTCCTGCAGGTAAAGGTGCATGTGGGACACGGCCTGGTGCAGAGCCGGCACCAGCCGGGGCAGCAGATAGGGGCCGATGGTGTAGATCACCCCCAGGCGCAGGGGCCCGCTGAAGGGATCCTTGCCCCGCTGGGCGATTTCCTCGACCCGCTGGGCCTCGTTCAAGACCTTCTCGGACTGCCGCGCCACTTCTTCGCCGATGGGCGTCAACCGCACTTCGGCGGCGCTGCGTTCGAACAGCAGCACCCCCAGGCGCTCTTCCACCTTTTTCAGGGCCACCGACAGGGTGGGCTGGCTCACGTGGCATTTCTCGGCGGCGCGGCCGAAGTGGCGCTCCCGGGCCAAGGACACGATGTAGCGCATTTCCGTCAGGGTCAT
Protein sequences of DBSCAN-SWA_2 >NZ_AP021844|2569244:2576915|2576003_2576915_-|WP_152090175.1|DBSCAN-SWA MTLTEMRYIVSLARERHFGRAAEKCHVSQPTLSVALKKVEERLGVLLFERSAAEVRLTPIGEEVARQSEKVLNEAQRVEEIAQRGKDPFSGPLRLGVIYTIGPYLLPRLVPALHQAVSHMHLYLQENFTHVLAESLKRGELDVIVVAQPFDEPGIVTQPVYDEPFRMLVPAAHPWAERKSVDAGELKGEELLVLGQGNCFRDQVVAACPRLTEPGGLERSYEGGSLETIRLMVASGTGVAVLPASAVEVAPPDPSLARVIPFKAPEPSRRIVLAWRVTFPRPQAIDAVRAAILQCPPPGVKVL >NZ_AP021844|2569244:2576915|2569244_2569748_-|WP_152090171.1|DBSCAN-SWA MALLPILRFPDPRLKKVAVPVEKIDDGIRALARDMAETMYEAPGIGLAATQVDVHKQVIVIDVSETKDELLVLINPEITHRDGLQVGEEGCLSVPGIYDKVERAEHIAVRYLDLEGKTCSLETDGLLAVCIQHEMDHLQGKVFVDHLSQLKQGRIKAKLAKQARITA >NZ_AP021844|2569244:2576915|2569865_2570921_+|WP_014235926.1|DBSCAN-SWA MIRHLFGRPGLNPARIIATLLLAVAASSASAQESPRLADNAPDRHIVVPGDTLWGIAGKFIQEPWRWPEIWRLNKDQIKNPHRIYPGDVIVMVTGEDGKPQLKLAKSLKLQPREYSEAVKNEIPTIPQSIIEPFLSQPLVVDPSAMDKEARIIATQEGRVYLGGGDQAYVVGVREPSELWQVYRPGKAMLDPDTKEVLGHEAFYLGTARLIQPGEPSVMEMVEVKQEVGKFDRLMPASRPELITYAPRRPEAKVEARIIAVYGGVGTGGRYSVVSLSRGSRDGLEVGHVLALLRSEKVYEQRNEQGERELVKVPPQRYGLVFVFRTFERVSYALVMDAALPLSLADLVRNP >NZ_AP021844|2569244:2576915|2572059_2572548_+|WP_152090173.1|DBSCAN-SWA MFDILVYLFENYVDFADFSKSGNQPDSPDSQADTALSRKLTAAGFSEEEISEALEWLQGLKATLPTRQLQADSRSLRAYTPDESAHLGADALGFLHFLEQAKVLSADLRELVIERAMALPDDRLSLGRFKVIVLMVLWSQEQNLDTLIVEELLSEAEPEHLH >NZ_AP021844|2569244:2576915|2575412_2575889_-|WP_014235930.1|DBSCAN-SWA MATKIDIGITEKDRKKIADGLSHLLADSYILYLKTHNFHWNVTGPMFQTLHTMFMTQYTELWNSLDLIAERIRALGYPAPGSYKRFVELASITEEDGVPKAKDMIRQLVAGQEAVARTARQVFAVVDKANDQPTADLLTQRMEVHEKNAWMLRVLLEE >NZ_AP021844|2569244:2576915|2570930_2572049_+|WP_152090172.1|DBSCAN-SWA MADPALTAWLRLTLVPGVGPETQRHLLAAFGLPEQVFSAPRSALKQVVGKKADLLLDTDNQEAVDRALDWADKPGNRILTLADPDYPQLLLESADPPSLLYVKGRVELLNRPALAIVGSRNATPQGLKDAEALAADLAAQGLTIVSGLALGIDGAAHRGGLKGEGGSVAIIGTGADRIYPSRHKELALQLATEGAIVSEFPLGTPAVAHNFPRRNRIIAGMAKGCLVVEAALESGSLITARLAAELGREVFAIPGSIHSPVAKGCHRLIQQGAKLVQEARDIVEEIGPFDPPGCRPARTPLSTGNTPTTVPILDPGQAAVLDALGHDPANLDQLLQRTGLTTEALCAILVTLELADHVASLPGGRYQRLSPT >NZ_AP021844|2569244:2576915|2572737_2575323_+|WP_152090174.1|DBSCAN-SWA MGKQLIIAEKPSVAADIAKALGGFTKHDDYFESDNFVLSSAIGHLLELVIPEEYEVKRGKWSFAHLPVIPPHFELKPVEKTESRLKLLTKLIKRKDVDGLVNACDAGREGELIFNYIARHAKSGKAVQRLWLQSMTPQAIRDGFARLRRGEEMQGLGDAAVCRSESDWLVGINGTRAMTAFNSKTGGFHLTTVGRVQTPTLSLVVERERKIREFKARPYWEVEATFAAAAGEYKGKWFDEAFKGKDEDEHARADRLWDEARAKALQAKCEGQPGEVSEEAKPSTQLSPLLFDLTSLQREANSRFGFSAKNTLGLAQALYEKHKVLTYPRTDSRALPEDYLGTVQATLQMFNGENLTKGSDTSVVDRYGIFANKILKSKWVVPNKRIFNNAKISDHFAIIPTTQAPKNLSEPEQKLYDLVVKRFLAVFFPAAEYLITTRITRVAGEPFKTEGKVLVNPGWLAIYGREGQEGDEGNLVAVSQGEKVQTEEVAVNQNDTRPPARYSEATLLSAMEGAGKMVDDEELRAAMAGRGLGTPATRAQIIEGLITEQYLHREGRELIPTAKAFSLMTLLNGLGISELTSPELTGEWEWKLAQIERGDLSRSAFMQEIEEMTRHIVDRAKSYDSDTVPGDFGLLKSPCPKCGGLMRETYKKFQCGDCDYGLWKIVAGRQFEPEEIETLLTERQVGPLMGFRNKMGRPFNALIKLNDKNEPEFDFGQDRSGEDGGEPVDFSGQESLGPCPKCGSPVYEHGLAYVCEKSVGPAKSCDFRSGKIILQQAVEREQMQKLLSTGRTDLLKDFISARTRRKFSAFLVKGKDGKVSFEFEKREPKAPAAKKTAAKAEPKAAAEKPAKAPAKRKAKEA |
7 | Synechococcus_phage(16.67%) | NA | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_3 |
2967016 : 3023959
Sequences of DBSCAN-SWA_3
Nucleotide sequences of DBSCAN-SWA_3 >NZ_AP021844|2967016:3023959|DBSCAN-SWA CTCATTGAGCAGTCGTTTCATAGGTTAACCGCTTGCCCTTCACACCCTGAAGCAGTTTTTCAGCCCGCTCCTGATCTTCGATACCGAGCGCCTTGCGATTGTTGTAGCGGAAATCGAATTCTGCAAGGTAGCGGTTCAGATGGTGGTGGCCACAGTGCTGATAGACGCCCTTCATGCCGCGCTTGAAGATCGAGAAGAAGCCCTCGATGGTGTTGGTGTGAATGGTGGGATCAATCTTGGATACGTATTCACCCATGCCATGCCGGGTAAAAGCGTGACCAGCGAAGTGCTGACCTACATTTTTGTACTGGCCGGCTTCGTCGGTCATGATCCGTGCTTCCCGGGCGATGTTTGCTTCCAGGATTGGGATCAGGGTCTTGGCTTTGAGGTCGTCCACCACCATGCTGCGGGCCTGGCCGCTAGTGCGGTCAACCAAGGACAGTACCTTGTTCTTGTGGTCGTAGCCGCGCCCCTTCTTCTCGCCCTTTGGCTTCTTGGTGTAGTCACGCCCAATGAAGGTTTCATCTACTTCCACTGGACCACCTTCGGAACCGAACGGGGAGAAGTCGCCGGAGCGCATGGCTTCCCGAATCCGGTGAGACATGAACCAGGCTGACTTGAGGGTGATGCCCAGGGTGCGGTGAAGCTGATTGGCACTGATTCCCTTCTTGCTAGAAGAGATCAGGAAGATGGCTTGGAGCCACAGCCGCATTGGGATGTGGCTAGACTCGAAGATGGTGCCAACTTTCACGGTGAAGGGCTTCCGGCAGTTGTAGCACTTGTAAGCGCCTATGCGAGTGCTCTTGCCCCCCATCTTGCTTATACGTTCCACACAGCCACAGTGAGGGCAGACCGGGCCACTGGGCCACAGACGGGATTCTACGAATTCGTAAGCAGCTTCTTCGTTGTGGAAGTACGGAGCAGATAGAATGGACGCACCCATGACAGCCTCCTTGAATATGCATTCAGTATAGGAGAGGTAAGTGGGTACGTCAAGTATAATATCGCCCTTTTTTTATGGCCGGACGTCGGCCCGGGCCTGCAGCACGGGATGGGGCGGGGCGGAGGGCGAGGCCAGGGTGCGCAGGGCGCCGTAGGCCAGGAAGGCGCCGAGGAGGGCGCACAGGGTGAAGAACAGCGCCAGCTCGCGGCGCAACAGGCGTTCGATCCTGGGCAGGGTGTCCCTGCCGGGCGGTACTAGCGGCATCTGGTTCATTTTTTTCTCCCTGGACTGGCTGCCCGCCTGCGCCGTCTGGCGGCGGGCCGGCAAGGCCAGTAAATACCCTGTTGCTTACATGGATATTACGGCGAGGGAGGGAGGCCGGTGGCGGCAGACGCGGTGCCGCGTCAGACCTGGGCCAGGATTTCCAGGATGTCCGTGGCTTCGGGCAGGGGTTGGGCCACCAGGAGAAAGTCGGCCATGGTGTCCAGTTCGGCCAGGGCCGCTTCCAGGGCCTCGCGGACTTCCGGGTGGTCGAAACCGTGGCCGGCCTGGGCCCGGACGATGGCCGCCGCCAGACCCTGGGCCAGGGCTTCGGCACTGTCCAGTTCCAGGTGGGCGGCGTGGTCCGCCAGGCAGTGGGCCGATTCGGCCAGGGCTTCCCCATCCGGTTCCAGGCAGGCCAGTTCCTCCCGCATCAGGGCCGCCTGCTCGCCGATTTCGCCGGCAAAGAGGCGCAGCAGGGTGGGCGAGACGGCGGGACGCTGGCTTTCCCGGCATTCGGCCAGGCGCTGGGCGACGTGGGCCACCCGTTCGGCGAAGACCGGCAGGCCCAGGTCCCGGGTGTCGCCGAGCAGTTCCAGGGCGGCGGCGATGGCGGCGCGCAGGTAGGGGTCCTCGGCCGCCTCCGGGGCGTCGAGAAGGTCGCCGACGCCGGCCAGGGATTCGGCCAGGTGCAGGGTCTCCGGCAGGTTGAGGCTCATGGCGGCGGCGCAGAGGTCGATCAGGGCCCGCCGGAAGGGAGCCAGATCGCCGCTGCCGCGCCCGTTCCAGGCGGCCACGGCGGCGTTGCCGGCGGCGCTCCAGGCGGCTACCGCCTCAGCGTCCAGGTCCGGGGCCTCGGGCTTGGCCATCAGGGAAATCAGTTCGGTCACCGGCGGCAGCTGGCCGGCACCCTGGCGGATGCACTGCTGCAGTTCCTCGGCCAGGCCGCCCTCGGGGGCTTCGCTGCCATTCAGGGCGGCCCGCATCTGCAGGTCGATGCGGGCGCAGAGGCGGCGGGATTCCGCATCCACTGCCAGGGTGCCGTCGGAGATGGCGCGCAGGAACACTTCGGCACTGTGCCAGAAGGGGGCGAAGTCGCCGCCGGCGGCGGCTTCCAGGTGGCGCACCGCGGCCCGCATCTCCGGCAGTCCGGCCGGGTCCCCCGGCTGTTGCAGCCAGGCCAGCATGCCCCGCTGGTAACGGCTGCGGGCCTGACGCTGGGAGGGGGAGGGCGGGGCAACGCTCATGGCAGGGGCGGAAGGATCGGTGGATGGGGCATTTTACTCCTCTGCCCGGGGACGTCTGTGGTTAGAATGCGGGCATAAATACAAGCGGAGACATCCGTGTTCCACTCCCCCTCGGCTCCCAGAGACCGCCCGGCCTTGCTGTGGCAGGCCCTCCTGTTTTTTCTGCTCCTGCTGCCTGCCGCCGTCCGGGCCCATCCCCTGGTGCTGGATCAGGACGACGGCAGCTTTGCTCTGGTGCCCCACGTGGAAGTGCTGGAAGACCCGGGCGGCAAGCTCGACCTGGCCGCCGTGCGGCAGGCCGCCGCCGCAGGCCGCTTCGCGCCGGCCCATGCCCTGGGCGAATTGAACTTCGGCTATTCCTCCTCTGCCTTCTGGCTGCGCATTCCCCTGGAGTCCCGCCTGCAGCGTTCCAGCCCGTGGCTGCTGGAGATCGCCTTTCCCTCCCTCGACCGGGTGGAGCTATTCCTGCCCCGGGCCGACGGCCGCGTCGACTACCAATTAACCGGGGACCGCCTGCCCTTTGCCGAACGCCCCTATCCCAACCGCAATCTGGTGCTGCCCCTGGAGCTGGCGCCCGGGGAATCCCTCGCCCTCTATCTGCGGGTGGAGTCGGAGGGCAGCCTGACTTTGCCCCTGACCCTGTGGACGCCGGATGCCTTCCGCCTGCACAACCAGGACGCCTACGCCGGCTTCTCCCTCTACTACGGCATGCTGCTGGCCCTGGGCCTGTACAACCTGCTGCTCTTCTTCGCCCTGCGGGAGCGCATCTATCTGGTCTATGTGGCCTTCGCCGTGAGCATGGCGGTGGGCCAGCTGTCCCTCAACGGCCTGGGCAACGAATACATCTGGCCGGCCTTCCCCGCCTGGGGCAATGTGGCCCTGCCCTCGGGCTTTGCCGCCACCGGCTTCTTCGGCGCCATTTTTACCCGCCTCTTCCTCAATACCCGGCACAGCAATCCCCGGGCCGACAAGCTGATCCTGGCCCTGGCCGCCGGCTTTGCCGTGGCCGCCCTGGGGCCGGCCCTGCTGCCTTACCGCTGGGCCGCCATCCTCACCTCCCTGCTGGGTGCCGCCTTTTCCGCCGTGGCCGTGGCGGTGGGCGTCCATGCCCAGCTGCGGCGCCACCCGGGGGCCCGCTACTTCCTCCTGGCCTGGTCCCTGCTGCTGGTGGGGGTGGGCATGATGGCCCTGCGCAACCTGGGCTGGCTGCCCACCACCCTGTTCACTTCCTACGGCATGCAGATCGGTTCGGCCCTGGAGATGCTGCTGCTCTCCTTTGCCCTGGCCGACCGCATCCAGGCCGAGCGCCTGGCCCGGGAACTGGCCCAGGGCGAGGCTCTCCACAGCAAGCAGGACCTGGTCAACGCCCTGCGCAGCAATGAGCAACTGCTGGAGGCCCGGGTGGCGGAACGGACCCGGGACCTGGCCGCCGCCAACGATCGCCTGCTGGCCAACGAGCAGCAGTTGCAGCGCATGGCGCGGCACGATCCCCTGACCGGGCTGGCCAACCGTCTGCTGCTCGACGACCGTATCAGCCACGGTCTGGCGGTGGGGCGGCGCAACGGCACCCGTCTGGCCCTGCTGCTGATCGACCTGGACGGCTTCAAGCCGATCAACGATAAGCATGGTCATGCCGTGGGCGACCAGTTGCTGGTGGTGCTGGCGGACCGCCTGCAACGCTCGGTGCGGGCGGTGGATACGGTGGCGCGCCTGGGCGGTGACGAGTTCGTGCTGGTGCTGGAGGATCTGGCGGCGGTGGAGGACGGGCGCCAGGTAGCGGCCAAGGTGGTGGCGGAAATGAGCCGGCCGGTGGTGCTGGAGGGGCGGGAACTGCTGGTCTCCGCCAGCGCCGGCCTGGCCTTCTATCCGGAGGACGGGGAGGACGCCCAGACCCTGCTCAGGCGGGCCGACGAGGCTATGTACGAAGCCAAGCGGGCCGGCCGCAACACCTTCCGTCAGGTGGGCCAGTAGGGCGTTGCGGGCAGCGTGAACGTGCAAAAGGCCGCCCGTGGGCGGCCTTTTCATTGCGAAAGCCTGTGGGCGGCGCTGCTCAGCCGGCCAGCTTGGCGAAGGCGTCCACCACTTCGGCGGGAGCCTGCACCAGCTCGATCAGCACGCCTTCGCCGCCGATGGGGGATTCCTCGTTGCCCTTGGGGTGCAGGAAGCAGATGTCGAAACCGGCCGCGCCCTTGCGGATGCCGCCGGGGGCGAAGCGCACGCCGTTGGCCGTCAGCCATTCCACGGCCTTGGGCAGGTCGTCGATCCACAGGCCCACGTGGTTGAGGGGGGTGGTGTGCACCGCCGGCTTCTTTTCCGGGTCCAGGGGCTGCATCAGGTCCACTTCGACCTTGAAGGGGCCCTTGCCCATGGCGCAGATGTCCTCGTCCACGTTTTCCCGCTCGGAGACGAAATTGCCGGTGACTTCCAGGCCCAGCATGTCGACCCACAGGGTCTTCAGCTTGTCCTTGGAAGGGCCGCCGATGGCGATCTGCTGGATGCCGAGAACTTTGAAGGGACGCTGGGACATGGAGAGGCTCCTGAAATCGGTGGAATTAAAAATACATAATTATACAGAGCGGACCGCCGCCCAGGGAGAGGGGCGAGCGGTACTCGGGGCCGGAATCCGGGGTGGAGCCCGGGCGGGCTCCCGGGACCATGTTCCAGGGCGGACACCGGCCCCGGGCGGACGGAGGAGGATGCCGCCTCACGCCACCTGTCCGGCACGATGGGGCGCGAAAAAAAAAGCGGAGCCCCGCCGCCGCAGGTGCTCCGCTTCCCTCTGTAACCGGGCGGACCCGGAAGTGATGGCAATCGCCGTTGCTGCCGGCAGGCGTAACGGTCTGCTTATTTCTTCTTGGCCTTGGCGTCCTTGCTGGCGGCGATGTTGCCGACGAAGGTGTTTTCCACCAGGGGCGGAGCATCCACCTGGGGCACGTGGCACTGCACGCAGTTGTGGCGCAGGTGGGTGACTTCGGCGTGCTGCTTGCCTTCCCGGTCGATGAAGTGGCTCTCGCCGATCTTCGGGGCCTTCTTCTCCTTGTACTTCTCCGGACCGTGGCAGGTCAGGCACTGGTTCTCTTCCAGGGTGATCTCGTCGAAGTTGTCCACCGCATGGGGGATCACCGGCGGCTGCTCCTTGTAGGTGCGGGCGATGGGCTGTTGCAGGCCCGGCTTCTTGCCGGCGTAGGCCTTCACCTCGGGGGCCGGGTCGCCGGCGGGAATGTCGGCGCCGCGCATGGTCTTGGGCGCATCGGCGGCCTGGGCCAGGCAGGCGAAGGAGGCGGCCAGGATGGCCAGGGTCAGTTTGTGCAGTCGGTTCATGGTCGGCTCCTCAATGGATGGTCTTGCGATGGTCTGTCTGGCCTTCGGCCTCCCCGGCCGGGGCGCAGCGCTGGGTGTGCTGGTTGAAGCGGCTGCCGAAGACGAAGACGTCCTTGGCGCAAACGTCGATGCAGCGGCCGCAGTTGGTGCAGGCAGAGGCCAGGATGACCGGGCCGGTGCCGTTGGCCTCGCCCTTGAGGGCGGGGCGGATGACCTGGGGTTCGGGGCAGGCGGCGAAGCAGTCCATGCAGTCGTCGCAGTCCTGGCGCCGCCGGGCGCTGACCCGCAGCAGGCTGGTGCGGCCGAGCAGGCTGTAGAAGGCGCCCACCGGGCACAGGTGACCGCACCAGCCCCGGCTCATGATCAGCAGGTCCAGCAGGAAAATGGCGAGGACCACGGTCCAGGCGGCGCCCAGGCCGAAGATGAGGCCCCGGTGCAGCATGGATACCGGATTGATCAGCTCCCAGGCCAGGCCGGCGCCGGCCAGGGGCAGCAGCAGGGTCAGCCCCAGGATCCAGTAGCGGCTGCGGCGGCTGATGTGGGCGCTGCCCTTGAGACCGAGGCGCTCCCGCAGCCAGCCGGCCAGGTCGGTGACCAGGTTCATGGGGCAGACCCAGGAGCAATACACCCGGCCGCCCACCAGCAGGTAGAAGGCGAGCACGATGGCGGCGCCGAGCAGGGCCAGGCCCTCGGGGCGATGGCCGGAGAACAGCACCTGCAGTACCAGCAGGGGATCGGCCAGGGGCAGGGTGTCCAGGGTCAGGCTGTAGCTGAGGTTGCCCTTGACCAGCCACAGGCCGGCCAGGGGGCCGAGCAGGAACAGGCCCAGGATGCCGAACTGGGACAGGCGCCGCAGCAGCAGCCAGCGGTTGGCGCCGAGTCGGCCCTTGGCGGCCAGGGCGGCGGGGAAACGGGGAGAGAGGAAGCTCATCGCGCCGCCTCGTCGGAAAGCCGGTTGGGCAGGCCGCCACCGGGGATGTTCTGGGGTATGGCCGGAACGCCCGGTCCATGGGCATCGGCACCGGGGATGCTCGGCGCCAGGCTATCGACGCCGCTGCCCGGGGTGGCCGGCTTGCCGGGGACGAGGGACGGGCCTCCCTGGCTGGCCGGGTCGAAGTGGCCCTCCAGGCGGGCGCCTTCAGGCATGCGGTCCGGCAGGTCGCCCAGGCCCTTGTCGTCCACCAGGGAATGGCCGGCCTTCTGTTCTTCCTCCCAGCCGACCCGGTAGTGCTGGCCCAGCTCGCCCTTGGCCAGGGGCACGGGCAGGACCTTGATGGCGGCGGTTTCGAGGACGCAGGAGCGCTCGCATTTGCCGCAGCCCGTACAGTGCTCGGAATGCACCGCGGGAATGAACATGCTGTGGCGGCCGGTGCGGGTGTTGGGGCGCAGCTCCAGGGTGATGGCCTTGTCGATCACCGGGCACACCCGGTAGCAGACGTCGCAGCGCAGGCCGAGGAAATTGAGGCAGGTCTCCTGGTCGAGCAGGACCGCCAGGCCCATGCGGGCCTGGTTGATGTCGGTGAGGCCGTGGTCCAGGGCGCCGGTGGGGCAGGCCTTGACGCAGGGGATGTCCTCGCACATCTCGCAGGGCACCTGGCGGGCGACGAAGTAGGGCGTGCCGGTGGAGACCGGCTGCTCCGGCCGGGCCAGGGACAGGGTGCCATAGGGACAGTCGCGCACGCACAGGCCGCAACGGATGCAGGCACCGAGAAAGTCCTCCTCCGCACCGGCACCGGGCGGGCGCAGGGCTGCCGGCGGCAGGGCCCGGGCCTGCTTGGCGTGGAAGCCCAGACCGAGACCCAGCAGCCCGACGCCGCAGGCCATGCGGCCGGCGTCGGCGAAGAACTGGCGCCGGGCCGCCGCAGCCTTGTCGGACTTGGCAGGAGGGGAGTTGGAAAGATCGCTCATGGCGTGACCGGCGGCGGGAACCGACTGGGGGCGGACTTCGCTGTCCGCCCCGCCGGCCTCCCTGATGCCTTATCCGGTTGTGTAGGTGCTTAGGCCTTGACCACCTTGCAGGCGCACTTCTTGAAGTCCGTCTCTTTCGAGATGGGGCAGGTGGCGTCCAGGGTCAGCTTGTTCACCAGCCGGTGCTCGTCGAAGAAGGGCACGAAGACCAGGCCCAGGGGCGGCTTGTTGCGGCCCCGGGTTTCCACCCGGGTGGAAATCTCGCCGCGGCGGGACTGCACCTTGACCGTGTCGCCGCGCTGCAGGCCGCGCTTCTTGGCGTCTTCCGGATGCATGTAGATCCAGGCGTCGGGCATGGCCTTGTACAGCTCGGGCACGCGCCGGGTCATGGAGCCGGTGTGCCAGTGCTCCAGCACGCGGCCGGTACAGAGCCACAGGTCGTACTCGGCATCCGGCTGCTCGGCGGCAGGCTGGTAGGGCAGGGCGAAGACCACCGCCTTGCCGTCGGGGAAGCCGTAGAAGCGCACCTTCTCGCCGGCCTTGACGTAGGGGTCGTAGCCTTCGCGGAAGCGCCACAGGGTTTCCTTGTTGTCCACCACCGGCCAGCGCAGGCCGCGGGCCTTGTGATAGGTGTCGAAGGCGGCCAGGTCGTGGCCATGGCCGCGGCCGAAGGCAGCGTACTCCTCGAACAGGCCCTTCTGCAGGTAGAAGCCGAGGACCTTGCCTTCCTCGTTCTCGAAGCCCTTCAGCTGGTCGGAGACGGGGAACTTGTTCACTTCGCCGTTGGCGTAGAGCACGTCATAGAGGGTCTTGCCCTTGTACTCGGGGGCCTTGTCCAGCAGTTCGGCGGGCCACACTTCCTCCATCTTGAAGCGCTTGGAGAATTCCACGTACTGCAGCACGTCGGAGCGGGCCTGGCCCTGGGGCTTGACCTGCTGGCGCCAGAACTGGGTGCGGCGCTCGGCGTTGCCGTAGGCGCCTTCCTTTTCCATCCACATGGCGGAGGGCAGGATCAGGTCGGCGGCCAGGGCGGAGACGGTGGGATAGACGTCGGAGTGCACCACGAAGGCGGCCGGATTGCGCCAGCCCGGGTAGACCTCGCCGTTGATGTTGGGGCCGGCCTGCATGTTGTTGGTGGTGGTGGACCAGAAGAAGGCCACCTTGCCATCCTTCAGGGCGCGGCTCTGGGCCACGGCGTGCAGACCCACCCAGTCGGGGATGGTGCCGGCGGGCAGCTTCCACAGCTTCTCGGTGATTTCCCGGTGCTTGGGATTGACCACCACCATGTCGGCGGGCAGGCGATGGGCGAAGGTGCCCACTTCCCGGGCCGTGCCGCAGGCGGAAGGCTGGCCGGTGAGGGAGAAGGGGCCGTTGCCAGGCTCGGAGATCTTGCCCACCAGCAGGTGCACGTTGTAGATCATGTTGTTCACCCAGGTGCCCCGGGTGTGCTGGTTGAAGCCCATGGTCCAGTAGGAGACGACCTTCACCTTGGGATCGGCATAGGCCTTGGCCAGGGCTTCCAGGTTTTCCTTGGGCACGCCGGAAATCTCGTGGGTCTTGTCCAGGGTGTACTCGGCGACGAAGGCCTTGAACTCGTCGAAGGAGATGTCGGTGGCCTTGTTGGGGTCGCCCTTGGGTTTGCCGTCCGGGCCCGGGTAGCCGTTGTTGCCGGCGGCCTGTTCCAGGGGGTGGTTGGGACGCAGGCCGTAGCCGATGTCGGTGACGCCCTTCTTGAACTTGACGTGGTTCTTGACGAAGTCCTGGTTCACCGCGCCGTTCTGGATGATGTAGTTGGCGATGTAGTTGAGGATGGCCAGGTCGGACTGGGGCTTGAAGATCAGCTCGTTGTCGGCCAGCTCGCAGGAGCGGTGGGTGAAGGTGGACAGCACGTGAATCTTGACGTGCTTGGCGTTGAGGCGGCGGTCGGTGATGCGGGACCAGAGGATGGGGTGCATCTCCGCCATGTTGGAGCCCCACAGGGCGAACACGTCGGCGTGCTCCGCATCGTCATAGCAGCCCATGGGCTCGTCGATGCCGAAGGTGCGCATGAAGCCAGCCACGGCGGAAGCCATGCAGTGGCGGGCATTGGGGTCCAGGTTGTTGGAGCGGAAACCGGCCTTCCACAGCTTGGCGGCGGCATAGCCTTCCCAGATGGTCCACTGGCCGGAGCCGAACATGGCGATGTTGCGCGGGCCGCCGGCCTTGAGGGCTGCCTTGCACTTCTCGGCCATGATGTCGTAGGCCTGGTCCCAGGAAATGGGGGTGAAGTCGCCGTTCTTGTCGTACTGGCCGTTCTTCATGCGCAGCAGGGGCTGGGTCAGGCGGTCCTTGCCGTACATGATCTTGGAGAGGAAGTAACCCTTGATGCAGTTGAGGCCCCGGTTTACCGGCGCTTCCGGATCGCCCTGGGTGGCCACCACGCGGCCGTCCTTGGTGCCCACCAGGACACCGCAGCCGGTGCCGCAGAAGCGGCAGACGCCCTTGTCCCAGCGGATGCCGTCATTCTTCGGCTGCTGCGCCAGGGCTTCGGATACGCCGGGCACCGCCATGCCGGCGGCGTTGGCGGCGGCGGCGACGGCGCTGCTCTTGATAAAGTCGCGACGGGTCAGGTTCATCTCAGGACTCCTTCTCGGGATCAGGTTCGAAGTGGTGATACACCATGGCCAGGGACATGACGCCCGGCAGCTGCTGTATCGCCTCGTAGGTTTGGGTGGTTTCCCGGTCGCCGTCGCTCTCGATGGTGACGATCATGCGGCCCTCTTCGGAGACGGCGTGGACCTCCACGCCCGCCAAGGTCGCCAGACCCGCCTCGACGGCCGCGATCTGCTGCGGCCCGGCGTTGACCAGAATGCTGGAAATATTCACGGATGATTCCCCCAATCGGCAGGCACGGCTCGAAGGGCCGGCTCAAGGTTCGACCACTCTATGCCCCTACGGAAACCCAGGTTTATGATGTGAATCAAAAGCGAAAAAAAGCCCCCGCAGGATGTAGGTGATTTGAAAAGTCACCGGGCTTCGGTTAGCCTGATCGGGCTTTGCCTGGTGCGGTTTTCCGCCGCATCGGGGTTAATGAGAATGACCCACAACAAGGCCCGCCCCATGTCGGCAGGCCGGCGGTTCGAGTAACAGCCCCGATTCCCCAGAAGGCTTGTTTTCCCACCCACCCATGTCCCGGCCCCTGCCCATTCTCTCGACGCCCGCAGCCGCCCCCCCGGAGGCCGCTCCCCTGACCCGCTACAGCCCCATTCCCTGGGGCCTGGTGATCGTCCTTTCCCTGCTTTTCGTCGTGGTCTGGCTGCTGCCGCCCCTGGGCGGACTCAAGCAGAGCGACACCATCTTCCCCCTGACCCTGCACACGGTGATGGAGAGCTTCTCCTTCGTCGTCTCCGTGCTGGTCTTCGCCGTATCCTGGCATGCCTACAGCCGGGAGCGGGCGGGCAACCTGATGATCCTGGCCTGCGGCTTCCTCGCCGTGGCCCTGCTGGATTTCGGCCATACCCTCTCCTACCGGGGCATGCCCGACTTCGTCACCCCCTCCTCGCCCCAGAAGGCCATCATCTTCTGGCTGGCGGCCCGCTATGTGGCGGCCCTGACCCTGCTCACCATCGCCCTGCGCCCCTGGCAGCCCCTGGCCCGCCCCCGGGACCGCTACCGGCTGATGCTGTGGGCCCTGCTGGTGACCGCCGCGGTGTTCGTCTCGGAGCTTTACCTGCCGGATTTCTGGCCCACCATGTTCGTGCCCGGGGTGGGCCTCACCGGTCTCAAGATCGCCGCCGAATACGGCCTCATCGCCATCCTCGGCGCCACCGCGGTGATCCTCTATCCGAAGACCCAGGGCAAGCCCGCCTTCGACGCCGCCAACCTGTTTACCGCGGTGCTCATCACCATCCTCTCGGAGCTGTGCTTCACCCTGTACTCCAACGTCAATGACGTGTTCCAGCTGCTCGGCCACACCTACAAGGTCATCGCCTATTTCTGGATCTACAAGGCAGTGTTCGTCTCCAGCGTGCGCGATCCCTACCTGCGCCTGAGCCTGGAGATGGCCGAGCGCCAGGCGGCGGAGGCGCGCATCCAGTTCCTCGCCTACCACGACCCCCTGACCGAACTGCCCAACCGCATCCTGGTGCGGGAACGTTTCGAGCGGGCGGTGGAGCGGGCCCGGGACCAGTCCTCCCGGGTGGGCCTGGTCTATATCGATCTGGACAATTTCAAGACGGTGAACGACTCCCTCGGCCACACCCTGGGCGACCTGCTGCTGCAGGCCATCGGCCAGCGCCTGCAGTCCCTGGTGCCGGCCGGCAGCACGGTCAGCCGCCAGGGTGGCGACGAGTTCCTCATCCTGCTGGAAGACCTGGAGCAGTCCCGGCTGGCGGAGAGCCTGGTGAGCCGCATCGTGGAGCAGATGCAGGCGCCCTTCGAAATCCAGGGCCACGACCTGTCCACCTCCGTTTCCATCGGCGTTTCCCTTTTCCCCGACGACGGCGGCGATTTCGACACCCTGCTGAAAAAGGCGGATACGGCCATGTACCGGGCCAAGGGCGCCGGCCGCAACGGCTACCGCTTTTTCGACCGGGAAATGGACAAGGACGTGGGCGAGCGCCTGCGCCTGAGTAACGACCTGCGCCTGGCCCTGGCGCGCAACGAGTTCGTGCTGCACTACCAGCCCCAGATCGATTTGCGCACCCAGGAAGTGATCGGTGCCGAGGCCCTGATCCGCTGGCAGCATCCGGAACTGGGCCTGCTGGCCCCGGGCCGCTTCATCGGCATCGCCGAGGACACGGGCCTGATCGTGCCCATCGGCGAATGGGTGATCCGCATGGCCTGCCATCAGGCCGCCGCCTGGCAGCGGGCCGGCCTGCCGCCCCTGGTGGTGGCGGTCAATCTTTCCGCCGTGCAGTTCATGCGCGGCGACCTGGTGGGCACGGTGGCCAGCGCCCTGGCCACCTCCGCCCTGCCTTCCCGCTGTCTGGAACTGGAACTGACCGAATCGATCCTGATCCAGGATGCGGAGAACATCCTGGGCACGGTGCAGCGCCTCAACGCCATTGGTGTGCAGATGTCCATCGACGACTTCGGCACCGGCTATTCCAGCCTTTCCTACCTGAAGCGCTTCGCCGTGGACAAACTGAAGGTGGACCAGTCCTTCGTGCGCGACCTCTGCAGCGATCCGGACGACGCCGCCATCGTGCGGGCTATCATCCAGCTGGCCCGCAGCCTGGGCCTGAAGACCATCGCCGAAGGGGTAGAAACGGCGGAAATCCTCGCCCTGCTGCAGGAGCTGGGCTGCGACGAAGCCCAGGGCTACTACTTCGCCAAGCCCCTGCCGGCGGACAACTTCAGCGCCTTCCTCAGCCAGCGCCTGTCCTGACGGTCCGGCGCTGCCGCAGCGCCGGCAGCCCCCATTTCCCCCAACTGAAAACTACTGCCGCACTTCCCGGATGTTGTCGGGCGGCAGGAACTCGCCGCTGCTGCGGAAGGGATTGATGTCCAGGCCGCCGCGCCGGGTGTAGCGGGCATACACGGCCAGGGACTGGGGCGCGCAGCGGGCGCTGATGTCGCAGAAGATGCGCTCGACGCACTGCTCATGGAACTCGTTGTGGCCGCGGAAGGACACGATGTAGCGCAGCAGGGCGGCGCGGTCGATGGCCGGGCCCCGGTAGCGCACCACCACCATGCCCCAGTCGGGCTGGCCGGTGACCAGGCAGTTGGATTTGAGCAGGTGGGAATAGAGGGTTTCCTCGACGCTGCGGCCGGCATCGGCCTGCAGCAGTTCCGGCGCCGGCTGGTAGCGGTCGCACTCGATGTCCAGCTCGTCCAGCAGGATGCCCGAGGGGTAATCCACCCGGGGCCGCGGCTGGGCAGCCAGAGGCTCCAGCTGCACCGTCACCTGGCCGCCGGCGGCGGCGGAAAGATCCCGGACCAGGGTGGCGGCCACCGTTTGCGCGTCGGCGAAGGCGCTCTGGTTGAAGGAATTCAGGTACAGCTTGAAGGACTTGGATTCGATCAGGTGCGGCGTCTGCGCCGGGATGCGGAAGGTGCCCAGGGCCACCACCGGCTTGCCCCGGGGATTGAGCCAGGAAATCTCGTAGGCGTTCCACAGGTCCTCGCCGACGAAGGGCAGGCGGGCCGGGTCGATGCCGATCTCGTCCCGCTTCAGCTGGCGGGGAATGGGAAAGAGCAGTTCCGGGGCGTAATGGCAGCGGTACTCGCTGGCCTTGCCGAGAGGGGAGAGGGCGCTGGGATCGATGGTCATGGGGCGGATTTTACCCCCAGGCGGAGGCAGGCCCCAGCATCGGCCGTGAGCGCCGCCTAGAGCAGCACGTAGCCCTGGGGCTGCAGCAGGGGCGGCAGGTCGGTCTCGCCCAGGGCTTCGCCCAGGTCCCCTTCCAGCATGCGCACCATGGCGTCCAGGGGCAGGTCGTTTTCCAGGGTGCCGAAGGGCTCTTCCAGCTCGTCCCCCAGTGCGTCCAGGCCGAAGAAGGTGTAGGCCAGCACCGCCGTCAGCAGGGGGGTGGCCCAGCCGACGGAGCGGGCCAGGCCGAAGGGCAGCAGCAGGCAGAAGAGGTGGGCGGTGCGGTGCAGCAGCAGGGTGTAGGCGAAGGGCAGGGGGGTGAAGCGGATGCGCTCGCAGGCGGCCTGGATGCCGGAGAGGGCGTGCAGGCGCTGGGTCAGGCCCTGGTAGACGATGTCGCCGAGGCCGTCCCGTTGCCGGGCCTGGACCAGGTCGTGGCCGCACTGGCGCAGCAGGGCATCGGCCGGATTGCGGCTCTGGGCAAGGCGCTCGGCCTCGCTGGGAGGCAGGAAGGGTGCCGCTTCCAGGGCTGCGTCCCGGCCCCGCAGGCGGGCGGCCAGGGCGTGGGCGAAGGCCAGGCTGCGGCGCACCAGGAGGCGCCGGGGTTCCGCCTCCAGCACCAGGCTATCCCGGGCCAGGGAGCGCAGTTCGACGATGAGGCCGCCCCACTGCTTGCGTGCTTCCCACCAGCGGTCGTAGCAGGCGCTGTTGCGAAAGCCGAGGAAGATGGAAAAGGCCAGGCCGAGCAGGGCGAAGGGGGCGGCGGAATAGTCGGGGAAGAGGTGGCCGAAGTGCTGGGCGCCCCAGGTGATGAGCACGGCGAAGCTGGTGGTGAAGACGATCTGGGGCAGCACGTGGGGCACCACCGAGCCGCGCCAGATGAAAAAGAGACGGAGCAGGCTGGGGCGTTCGCGGACGATCATGTCGAGGCAGTCGGTGGAGTGGTGGTCGGGGTGGCGCTCGCCAGTGCGGCCAGGCCGGGCAGGGCGGCGCGGGCGTTGGCGCCGGTGGCGGCGATCAGCTCGGCTGTGGGCATGCCCCGCAGCTCGGCCAGGAGGGCGGCGAAGCGGGGCAGGTAGGCGGGCTTGTTGCGCCGGTCCGGGCTCGCCGCGGTGAGAAAGGCCGGGGGAATGTCGGGGGCGTCCGTTTCCAGGACCAGGGCTTCCAGGGGCAGGGTGGCGGCCAGTTCCCGGATGCGGGTGGAGCCGCTGAAGGTCATGGCGCCGCCGAAGCCCAGCTTGAAGCCGAGCTTGATGAATTCGTCGGCCTGCTGCCGGCTGCCGTTGAAGGCATGGGCGATGCCGCCCCGGGGCCGGTAGCGGCGCAGCTGCTTGAGAATGGGGTCCAGGGCCCGGCGCACGTGGAGGATCACCGGCAGGTCGAACTCCACGGCAAGCTGCAGCTGTTCGGCGAAGAAGTGCTGCTGCCGCGCCAGGGCCTCGCCCTGCTGCAGTTCGGGCACGTACAGGTCGAGGCCGATCTCGCCCACCGCCAGCGGCGCCAGGGGGCCATCCCGCTCCTCGGCCAGCCAGCGGCGCAGGGTGGAGAGGTCTTCCTCCCGGGCCGCCGGCGTGTACAGGGGATGGATGCCGTAGGCCGGCGCGCAGCCCGGGTAGGCGAGGCAGCAGGCGCGCACCTCGGCGAAGGTGGCGGCGGCCACTGCCGGCACCACCATGGCCTGTACACCAGCAGTGACGCCATCCTGGAAGATGGCCTCCCGGTCAGGGGCGAATTCCGCCGCGTCCAGGTGGCAGTGGGTGTCGATCAGCATGGAATCTGGCTGTCCGGGCAGGCCGGGGGGAGAGTCCCGCCCTGCCTTACTTCTGGCCGAAGGCAGGTTTGCGCTTTTCCAGGAAGGCGCCCAGGCCTTCGGCGAAGTCGGGATGGACGGAGCAGGCGGCGAAGTTGCCCTGTTCGGCGAACAGCTGCTCCGGCAGGGAATTGCCGCTGGAAGCCTGCAGCAGGGCCTTGGTGCGGGCCAGGGCCTGGCGCGGACCGGCGGCCAGGCGGCGGGCCAGCTTGGCGCTCTCGGCCTCCAGCTCGGCGGCGGGCACCACCCGGTTGATGAGGCCCCACTCCCTGGCCTGGGCGGCATCGAAACGGTCCCCCAGCAGGGCGATCTCGGCGGCCCGCTTGGCCCCCACGGCCCGGGGCAGGAACCAGGTGGCGCCGCCGTCGGGGGAAAGGCCGATGTGGCAGTAGGCCAGGGTGAAATAGGCGTTGTCGGCGGCCACCGCCAGGTCGCAGGCCAGCATCAGGGACAGGCCGAAGCCGGCGGCGGCACCGCTGACCGAGGCCACTACGGGTTTGCCCATGCGGCGCACCTGCAGGGTGGTGGCGTGCACGGCGGCGATGGTCTGTTCGAAGAGGGCCTGGCGTTCCGCCGGGGGCAGGGCCAGCTGGCTGTGGAACCACTTGAGGTCGCCGCCGGCCATGAAGTGCTCGCCGCCGCGCAGGACCACGGCGCCGACGGCCTCGTCGTGCTCGGCCCGGGCGGTGGCGGCGCGCAGGTCCTCGATCATGGCCAGGTTGAGGGCGTTGAGGGCCTCGGGGCGGTTCAGGGTCAGGGTCAGGACCCCGTCCTCCAGGTGGGAAAGCACGGTGCTCATGGGTTGTCTCCTTTGGGTGGTGGTTTTTATGGTTTTATTGCTTTGTCGTTCCGAGGAACGGGAAATCCGGTTCCGGCCGGCGGCCGGAAATGAGGTCCGCCAGGGCCGCGGCGGAGCCGCAGGACAGGGTCCAGCCCAGGGTGCCGTGGCCGGTGTTGAGCCACAGGTTGGGCAGGCGGGTGCGGCCGATGAGGGGCACGTTGGAGGGCGTCACCGGGCGCAGGCCGCACCAGTAGAGCGGGTCGCCGTCGGGGCGCAGCTGGGGGAACAGTTCCAGGGCGCGCCGCAGCAGGGCCTCGCAGCGCACCGGGGTGAGCTCCAGGTTGTGGCCGTTGAACTCCGCCGTGCCGGCCACCCGCAGGCGGTTGCCGAGGCGGGACATGACGATCTTGCGTTCGTCGTCGGTGATGCTGACGCTGGGGGCGACGCTGTCCGGGGAGAGGGCGATGGTGGCGGAATAGCCCTTGCCCGGATAGACGCAGGCCTTGACCCCGGCGGGCTTGAGCAGGGCCGGGGAATAGCTGCCCAGGGCCACCACGTAGGCGTCGGCCAGGAGCAGGTCGCCGCCGGCGACGACGCCGGCCACCCGGCCGCCGGCGCTGGCAATCTTCTCCACCGGGCAGTTGTAGCGGAACTGCACGCCCCGGGCGGCGGCGGCCTCGGCCAGGCGCTGGGTGAAGCGGTGGGCGTCGCCGGATTCGTCGCTGGGGGTGTAGTCGCCACCAGCCAGGCGCCCTTGCACCGCGGCCAGGGCCGGCTCGATGGCGACGCAGCGGGCGGCGTCCACCGGCTCCCGGTCCACCCCGAACTCCCGCATCAGGGCGGCGGCGTGGCAGGCGGCCTCGAACTCGGCGGCCTGGGTGAAGATGTGCAGGATGCCCTGGCAGCGCTGGTCGTAGTCCAGGGGCAGGGTCTGGCGCAGGGCCTGCAGCCGCTGCCGGCTGTAGAGGGCGAGGGCGATGATGTCGCGGATGTTGCGGCGGGTGGCCCCGGGCGGGCAGTTGGCGAGGAAGCGCAGGCTCCAGGCGAAGAGGGCCGGGTCGTAGCGCAGGCGGAAGAGCAGGGGGGCGTCTTCCTTGCCCAGCCACTCCAGGGCCTTGAAGGGGGCCCGGGGATTGGCCCAGGGCTCGGCGTGGCAGACGGAAATCTGGCCGCCGTTGGCGAAGCTGGTTTCCAGGGCGGCGCCGGGCTGGCGGTCCACCACCGTGACTTCGTGGCCGGCCTCGGCCAGGAACCAGGCACTGGTGACGCCGACGACGCCGGCGCCGAGGACGAGAACACGCACCCGGCCTATTCCTCCTGGCGTTCCCGGGTGATGCGCACCACTTCGGGAATCAGGCGCACGGCCCGCAGGACCCGGGCCAGGTGGGCCCGGTTGGCCACCTGCACGGTGAAGTTGAGGGTGGTGTAGAAGCCCGGATCGGGGGCCATGGAGACCTTCTCGATGTTGGAGCCGGACTCGGCGATCTCGGTGGCCACCTTGGCCAGCACGCCGCGGGCGTTGCGGGCGGCGACGTGGATGTCCACGTCGAACAGCTTGCCCGGTTCCGGTTCCCACTCCACGTCGATCCAGCGCTGGGGTTCGGCACTGCGGGACTTGCGGATGACGGCGCAGTCATGGGTGTGCACCACCAGGCCCTGGCCCTTCTTGATGGAGCCGATGATGGGGTCGCCCGGGATCGGGCGGCAGCAATGGGCCAGCTGGATGGCCATGCCCTCGGTGCCGCGGATCACCACCGAGGTGTGGGGTGCCGGTTCCGCGTTGGGCAGGGCCGCCTCGTGGGCCAGCAGGCGGCGCGCCACCACGGCGGCCAGGCGCTTGCCCAGGCCGATGTCGGTGTACACCTCCTTGACGGACTTGCTGCCCCCTTCCTTGAGCACCGCTTCCCAGCTGGCGTCCGGCAGTTCCGAGGGCGTGATGCCGAGGCCGAACAGTTCCTGGTTGAGCAGGCGCTCGCCCAGAGCGGCGGATTCCTCGTGCTGGCGGGTCTTCAGGAAGTGGCGAATCTTGCTGCGGGCGCGGCCGGTCTTCACATACGAGAGCCAGGCCGGATTGGGATTGGCATGGGCGGCGGTGACGATTTCCACCTGATCGCCGCTGTTGAGCTCGCTGCGCAGTGGCATCAGCTCGTAGTTGATCTTGGCGGCGACGCAGCGGTTGCCCACGTCCGTGTGCACCGCATAGGCGAAGTCCACCGGGGTGGCGCCCTTGGGCAGGGAGAATATCTTGCCCTTGGGGGAGAAGACATAGACCTCGTCGGGGAAGAGGTCGATCTTGACGTGCTCGAAGAACTCGGCCGAGTCGCCGGCGGTGCTCTGCAGCTCCAGCAGGGACTGCAGCCAGCGGTGGGTCTGGTACTGCAGTTCGGCGGCGCTCTTCTCCGTGTCCTTGTACAGCCAGTGGGAAGCCACGCCCTCCTGGGCCATGTGGTGCATTTCCTCGGTGCGCAGCTGCACTTCCACCGGCATGCCGTAGGGGCCGATCAGGGTGGTGTGCAGGGACTGGTAGCCGTTGGCCTTGGGGATGGCGATGTAGTCCTTGAACTTGCCCGGCAGGGGCTTGTACAGGGCGTGCAGGGCGCCCAGGCCCAGGTAGCAGCTGGGCACGTCCTTGACCACCACGCGGAAGCCGTAGATGTCCAGCACCTGGGAGAAGGAGAGGCGCTTTTCCACCATCTTGCGGTAGATGGAATAGAGGCTCTTCTCCCGGCCGAAGACCTGGGCCTCGATGCCCGAGTCCCGCATCTTGCTCTGGACCCCGTCGAGAATCTTCGACAGCACCTCGCGCCGGTTGCCCCGGGCCGCCATGACGGCCTTCAGCAGCACCTGGTAGCGCATCGGGTGGGTGTGTTTGAAGGAGAGGTCCTGCAGCTCCCGGTAGACCGTGTTCAGCCCCAGCCGGTTGGCGATGGGGGCGTAGATCTCCAGGGTCTCCAGGGCGATGCGGCGGCGCTTGTCCGGACGCATGCAGCCCAGGGTCTGCATGTTGTGCAGACGGTCGGTGAGCTTGATGAGGATGACCCGCAGGTCCTTGGCCATGGCCAGGAGCATCTTGCGGAAGTTTTCCGCCTGGGCTTCCTGGTAGGAGGAGAACTCGATCTTGTCGAGCTTGGAGAGGCCGTCCACCAGGTCGGCCACGCCCTTGCCGAAGCGTTCGGTCAGCTCCTCCTTGGAGATGCCCGTGTCCTCCATGGTGTCGTGCAGGAGGGCGGCGATGATGGCGGTGGAATCCAGCCGCCATTCGGCAATGGCCCCGGCCACGGCCAGGGGGTGGGTGATGTAGGGTTCGCCGGAAAGGCGCTTCTGGCCCCGGTGGGCGGCTTCGCCGAAGGCAAAGGCCTCCTTGATCTTGGCGATCTCTTCCGGTTTCAGGTAGTCGAGGCTGTCCAGGAAGACCCGGTAGGCCGGGTCGTCGTTGAACGGGTAGGGGGTGGGCGGCGCCGCCGGGTCAGGTGCCGGCGCGAAGGGCGCGGCGGAAGATGCAGACGGTTTGGCCGGGGCGGGGTCGGTTGCAGTATCCATACCGGTTCACCTTACCCGCCCGGGCCCGGGGTCAGGCCTGACCGCGGTTGAGGATTTCCAGGCCGATCTGGCCGGCAGCCAGTTCGCGCAGGGCGATGACGGTGGGCTTGTCCTTGCTCGGTTCCTGCATGGGGGTGGAACCATTGGCGATCTGGCGGGCCCGGTAGGTGGCGGCCAGGGTCATCTGGAAGCGGTTGGGGATTTGTTTCAGGCAGTCTTCAACGGTAATGCGGGCCATGGTCCATCCAATCAAAAAGCGGAAAATTTAGAGCAGCGAGGCGAACAGCGAAGCGTGGCGTTCCTGCTGCACGGGAAGCTTCAAGCGCGTTGCGCGCACCACGGCCAGCAGGTCGCTGAGGGCCGTCTGCAGGTCGTTGTTAATAATAACATAGTCGAATTCCCCCACATGCCGCATCTCGTCGCGGGCAGCGGCCAGGCGGCGGGCGATGACGTCCTCGCTGTCGGTGCCGCGGCCGGCCAGGCGGCGGGCCAGTTCTTCCATGGAGGGCGGCAGGATGAAGACGCCGATGGCGTCGCCGAACACCTTGCGCACCTGCTGGGCGCCCTGCCAGTCGATCTCCAGCAGCACGTCGCGGCCGGCGGCCAGCTGCTGTTCGATCCAGGTGCGCGAAGTGCCGTAGTAGTTGCCGTGCACCTCGGCCCATTCGAGGAACTCGCCCCGGTCCACCCGGGCCAGGAAATCGGCCACGTCGGTGAAGTGGTAGGCCTGGCCGTTCTCCTCCCCGGTGCGGGGCGCCCGGGTGGTGTGGGAGACGGAGAGGCCGATGGCCGGGTCGTTCTGCAGCAGCAGGCGGACCAGGGTGGTCTTGCCGGCGCCGGAGGGGGCGGTGACGATGTAGAGGTGGCCGCTCATGCTGGCATCCCTTTCCTTATTCGATGTTCTGGATCTGTTCGCGCATCTGCTCGATGAGCAGCTTCAGGTCCATGGAGGCCTTGGAGACTTCGCTGAGGACCGACTTGGAGCCCAGGGTGTTGGCCTCCCGGTTCAGTTCCTGCATGAGGAAGTCGAGGCGCTTGCCGGCGTTGCCGCCGGCCTTGAGGATGCGCTCCACCTCGGTGAGATGGGCCTGCAGGCGAGACAGTTCCTCGTCCACGTCGATACGGGTGGCATACAGCACCACTTCCTGGCGCACCCGTTCGTCGTCGGCGCTGCCCAGGGCCTCCACCAGGCGCTGCTTGAGCTTGTCCTGGTAGGCAGCCTGGGCCTGGGGAATGAGGGGGGCAACGGCGGCCACGGTGGCGCGGATCTTGTCCACCCGCTCCTGGATCATGGCGGCCAGCTTGGCGCCTTCCCGGGCCCGGCTGGCGGTGAAGTCCTCCAGGGCCTCCTTCAGGGTGGCCTGGACGGCGGCGTGCAGGGCGGCCGTATCCACCTCCGGTTCGCCCAGCATGCCGGGCCAGCGCAGCACTTCGGCCACCGACAGGGCGGCGGCGTTGGGCAGGGTCTGGCGTACCTGGCCTTCCAGGGCCTGCAACTGGGTCAGCAGGTCGGCGTTGATGGCCAGCTGGCGGTTCTGGCTCTGGCTGGCGACCAGGTTGAGGCGCAGTTCCACCTTGCCCCGGGCCAGCTTGGCGGTGATGGCTTCGCGCAGGGCCGGCTCCAGCACCCGCAGGTCGTCCACGATGCGGAAATGGATGTCGAGGAAGCGGGAATTGACGCTGCGCAGTTCCAGGTGCAGGGAGCCGCCTGCCACTTCCCGGGTTTTGGCGGCATAGCCGGTCATACTGTAGATCATGGAATTCCTTGCTGTGTGGGGTGTGCCGGCCCGGACGGCGGGCCTGGGAGAGGCTTCTTGAGGCTTCTTTACAGTCCGTTGGCAAAGCCTGACAATGCGCGCAGTGCGCCTGAGGCGCTATCTTAGCTTCGGACTTCATGGCGTCACAAGCCAATCACCCCCTCCCTGCCGGATTCCAACTGGAAGACTACCGCATCGAAAAGCAGATTTCGGTCGGCGGTTTTTCCATTGTTTACCTGGCCCACGATGCCAGCGGCAAGGCGGTGGCCATCAAGGAATACCTGCCGGCCAGCCTGGCCCTGCGCTCCGAGGGGCAGACCAAGCCGGTCATTTCCCAGGAGCATCTTTCTGCCTTCCGCTACGGCATGAAATGCTTTTTCGAGGAAGGCCGGGCCCTGGCCAAGCTGAACCATCCCAACGTGATCCAGGTGCTGAACTTTTTCCGCGCCAACGACACGGTTTATATGGTCATGGAATACGAGCGGGGGCGCACCCTGCAGGAATTCATCCAGAAGCACCACGGTCACATCCACGAGAAATTCATCCGCGGCGTGTTCACCCGCATGCTCAACGGCCTGCGCGAAGTGCACACCCACAAGCTGCTGCACCTGGACCTGAAACCGTCCAACATCTACCTGCGGGCCGACAATACGCCGGTGCTGATCGACTTCGGCGCCGCCCGCCAGACCCTGCATTCCGACACCCCCATGCTGAAACCCATGTACACCCCGGGTTTCGCCTCCCCCGAGCACTACTTCAAGCGGGACGAACTGGGGCCCTGGAGCGACATCTATTCGGTGGGCGCCTCCATGTACTCCTGTCTGGCCGGGGCGGCGCCCCAGGCGGCCGATGCGCGCATGGAGAAGGATCAGCTGCAGCCGGCCTCGGTGCGCTGGGAGGGCCAGTATTCGGACCAGCTGCTGGAGACCATCGACTGGTGCCTGTGCCTCAACCACCTGTACCGTCCCCAGAGCGTCTTCGCCCTGCAGAAGGCCCTCACCGAGGCGGTGGACATGCCGGGTCAGGGAGCCAGCAAGGCGGCGGAAAAGGAAGGATGGCTGGGCCATCTGGTGGGCAAGATCAAGGGAATGACTGCTAAATGAAATTCACCATCTACCAGGAAAGCCGCATCGGCAAGCGGCAGAACAACGAGGACCGGATCGCCTACTGCTACTCGCGGGAGGCGGTGCTGATGGTGGTGGCCGATGGCATGGGCGGCCATTACCACGGCGAGGTGGCCTCCCAGATCGCGGTGCAGACCCTGACCTCGGCCTTCCAGCGGGATGCCCAGCCGGAGATCGCCGATCCCTTCCTCTTCCTGCAGAAGGGCATGACCAATGCCCACCACGCCATCCTGGACTATTCCCAGGAGCACCGGCTGAAGGATTCGCCGCGCACCACCTGCGTCGCCTGCCTGATCCAGGACAACATCGCCTACTGGGCCCACGTCGGCGATTCCCGCCTCTACCACATGCGCGACGGCAAGGTGCTGGCGGTGACCCGGGACCATTCCCGGGTGCGCCTGCTGATGGACGAGGGCCTCATCAGCGAGGCCCAGGCCGCCACCCACCCGGACCGCAACAAGGTGTACAGCTGCCTGGGGGGCGAAAACCCGCCGGAAATCGAGTTCTCCCGCAAGACCCCCCTGGAAGTGGGGGATGTCCTGGTGCTGTGCACCGACGGCCTGTGGGGGCCGCTGCCGGCCGATGTCATGGCCGCCTCCCTGAAGGGGGCCAACCTGATGCAGGCCGTGCCCATGCTGCTCAACCAGGCGGAAATCCGCTCCGGCCCCTACGGCGACAATCTTTCCGTGGTGGCGGTGCGCTGGGAGCAGAGCTACAGCGAGGAGGCCTCCAGCACGGTGATGACCCAGACCATGCCCCTGGACGCGGTGACCACCAAGCTCGGCGAATTCGGTCGGGACCCGGCCTACAAGACCGATCTTTCCGACGACGAGATCGAAAAGGCCATCGACGAAATCCGCGCCGCCATCCAGAAATTCTCCAAATAAGGAAGTTCCATGCGTCCCAGCCAACGTGCCGCCGACCAGCTGCGCCAAGTCCGCATCACCCGCCGTTTCACCCGCCATGCCGAAGGTTCGGTGCTGGTGGAAATGGGCGACACCAAGGTGCTGTGCACCGCCAGCATCGAGGAAAACCTGCCGCCCTTCCTGCGCGGCAAGGGCCAGGGCTGGGTCACCGCCGAATACGGCATGCTGCCCCGCTCCACCCACACCCGCAGTTCCCGGGAAGCGGCCAAGGGCAAGCAGACCGGCCGCACCCAGGAAATCCAGCGCCTCATCGGCCGTTCCCTGCGCGCCGTCACCGATCTCAAGGCCCTGGGCGAGCGCCAGATCACCCTGGACTGCGACGTGCTGCAGGCCGACGGCGGCACCCGCTGTGCCTCCATCACCGGCGCCTGGGTGGCCCTGTGGGACGCCTGCCAGTCCCTGGTGGCCGCCGGCAAGCTGAGCGAGAACCCCCTCAAGGAACACGTGGCCGCCATCTCCGTCGGCATCTACAAGGGCACCCCGGTGCTGGACCTGGACTACCCGGAAGATTCCGACTGCGATACCGACATGAACGTGATCATGACCGGCAGCGGCGGACTGGTGGAAGTTCAGGGCACGGCCGAAGGCGAGCCCTTCTCCCGGCAGCAGATGAATGTGCTGCTGGACCTGGCCGAAGCCGGCATCCGCCAGCTCATCCACGCCCAGGAAACCGCCCTGGCGGATTGATTCGGAGCCCGTCATGGCCCAGGAAACCTCCCGCGACCCGATCAAGGCCCTGCTCGACGATCTGGAACAGTCCATCGCCGATTTCGATCAGCGCCTGGGTGGCGTCGAGGAGTCTCCTGCCGTGACCGGTCTGCGTTCTTCCGGGCAGCGCTATCCCGACATCGAACCCGAGGCCAGGCGTCAACTGTCTCCTGCCGCTCCTGTTGCCGTTGCCGGCAATGCCGACGCAACCGCTGTGTCCGAAGCGCCGGCGGTGGACCTGCTGGCCGAACTGGCCCAGGCGGCGGCCTGCCGCAGCGTGGATGATGCGGAGACCCAGCGTCGCCAGCTGGAACTGACCGAGCGCCTGCACCAGGACCTGAAGACCGTCTTCGACTACCTCAACCAGCTCATCCGCCACGCCAACACCCTGAAGCCGGTGCTGCCCCGCAGCTACCGGCTGGATGCGCGCAACAGCTTCGACGGGCTGGCCTGGCATGACGGATTCGTCGATTACCGCAGCACCAGTCGCTTCGACCGCAGCTACTACGAGCAGATTCTCTTCCAGGTGAGCTACCGGGCGCCGGCGCCGCTGGTCGCGGTCTGCGCTGCGGACCAGGCCGCCATCGTGCGCAAGGAGCTGGAACTGGTGAACCTGCGCATCCAGCGAGAAGAGCCGGTGATGCTGCCGGAGGGCGGCCCCGGGGTGCGCTATGTGCTGCCGGATGCCATTCCGCTGCATCTGGCGGTACAGGCGGACTTCGCCAACGATGCCCTGACCTTCCGCTGCCGCAATGCCGGCAATTTCGGCCCTACTGCCTACCGTCTGCCGGGCGGGAGCATCACCCGGCCCCTGCTCGACGGCATCGGCCTGGTGCTGCTGGGCCGCAGCGACACCATGCCCAAGGAACTGCAACGCATTCCCTACCAACGGATCAACTGAGTCCCATGCAAAAGATCGTCCTCGCCTCCAACAATGCCAAGAAGCTCAAGGAACTGTCAGCCCTGCTGACACCCCTGGGCATCCAGCTCATTCCCCAGGGCGAGCTGGGGGTGCCGGAGGCGGAGGAGCCCCACCACACCTTCCTCGAAAACGCCCTGGCCAAGGCCCGCCATGCGGCCCAGCTGACCGGCTTGCCGGCCCTGGCCGACGACTCCGGCCTGTGCGTCAAGGCGTTGGGCGGCGCTCCCGGAGTGCAGTCGGCCCGCTACGCCGGCGAGCCCAAGTCCGATGCCCGCAACAACGAGAAGCTGCTGGCGGCCCTCACTGGCGTGGCCGACCGCCGTGCCCACTTCGTCTCACTGCTGGTGCTGGTGCGCCACGGCGACGACCCCCAGCCCCTGGTGGCCGAGGGCGAGTGGCACGGCGAGATCATCGACCAGTACCGGGGCGAGGGAGGCTTCGGCTACGACCCCCTGTTCTACGTGCCAGCGGAAAAGGCGACGGCGGCCGAACTCTCCGCCGAGGTGAAGAACCGTCTCTCCCATCGTGGCCAGGCCATGGCCCGGCTGCTGGAACGCCTCAAGCTGGAACTGTGAGCCCGGCCGGCGCCGGGAAACCTTCTGGCGCTGCCGCGGTCCAGGGCAGGGTGTTCCGGACTTGCCGCTCCGCAGTGGCCTGACTCAAAGCGCCATTATTCAGACATCGCATCATGCCGCCTGCGGGCGGAAAGGTTTTCTTTCGTGTCGTCCCGTTCTTCCTCCCGCATCATTCCCATCGCCGTCGCCGGCGGCACCCGCGCTGGCGGCAGTCCCCTGCACTTCACCAGTCCTCCGCCCCTCTCGCTCTACATCCACGTGCCCTGGTGCGTGAGGAAGTGCCCCTATTGTGATTTCAATTCCCATGAGGCGCGGGCGGAGAACGACGAGGCCGCCTATGTGGCAGCCCTCGTTGCCGATCTGGAAAGCGCCCTGCCGTCGGTGTGGGGACGCAAGGTATCCACCATTTTCATCGGCGGTGGCACGCCAAGCCTGCTCTCCGGCGAGGCCCTGCACGAACTGCTGAATGCGGTGCGCATGCGTCTGCCCCTGCTGCCCGAAGCGGAGGTGACCCTGGAGGCCAACCCGGGCACCGCCGAGGCGGGCAAGTTCGCCGCCTTCCGGGCCGCCGGAGTGAATCGTCTGTCCCTCGGCATCCAGAGCTTCAACGACCGGCACCTGGAGGCTCTGGGCCGCATCCATGACAGCGCTGAGGCCAGGGCCGCCATCGAGTTGGCCAAAGCCCACTTCGAGCGCTTCAACCTGGACCTGATGTACGGCCTGCCCCAGCAGTCCCAGGCCGAAGCGATGGCAGACCTGGAGATGGCCCTCTCTTTCGCGCCGCCCCATCTTTCCTGCTACCAGCTGACCCTGGAGCCCAACACCCTCTTTGCCGCCCGGCCGCCCCAGCTGCCCGAGGGCGACACCTGCGCCGACATGCAGGACGCCATCGAGGCCCGCCTGGCTGCCGCCGGCTACGTGCATTACGAAACTTCGGCCTTCGCCCGGCCCGACTACCAGTGCCGGCACAACCTCAACTACTGGACCTTCGGCGACTACCTGGGCATCGGCGCCGGGGCCCACGGCAAGCTGACCCTGCCGGACCACAGCGGCTTCTCGGTGCAACGCCAGATGCGCTGGAAACAGCCCAAGCAGTACCTGGAGCAGGTGGCCGCCGGCCAGCCGGTACAGGAGCAGCACGGCGTGGGGGCGGACGAGCTGCCCTTCGAATTCCTCATGAACGCCCTGCGCCTCAACCAGGGCTTCGATCCGGCCCTCTTCGAGCAGCGCACCGGCCTGCCCCTGCTGCTGGTGCGGGGCGAGCTGGAAAAGGCGGCCCGGGAAGGGCTGCTGACCCTGGCACCGGACTGCATCGCACCCACCGAGCGGGGCCGGCGCTTCCTCAACGCCCTGCTGGAACGCTTCCTGCCGGATGCCTGAATAGGCTATGCCGCAGAAGTAGGAAGGACGCCATTCTTCACCACGGCACGGTGAAGAATGGCGTCCTTCTTGTCTTCTGGCTGAGGTGCCTTAGAAACCCTGGGTATAGGTCAGGCGCCAGATGCGGGGTTCTGCCGGGTGGCTGTGCACATCCTCCACTTTCGCTGCTTCGCCGCGCAGTTGGGACTCGTAGTAGTAGTCGATGTCGCTGGCCTTGCGGTTGAACAGGTTGAGCACTTCCAGGGTCAGCTGGCTCTTGGCGGCCAGCTTGTAGCCCACATTGAGATTGACCATCACCGAGCTGCCGGAGCGCACAGAGTCGTCTTCTTTGAGGGCCCGGGGGCCCAGGTAGCGCAGGCGCAGGCCGCCGCGCCAGGGGCCCAGGTCGTGCACGGCGACACCGACGGAGGCGGTGCGTTCGACGGCGCCGGGGACGTGGTTGCCAACGCTACTGTCATCGCGGAAGCGGGCCTTGGAGAGGGCGATGTCCGCATCCAGGGTCAGCCAGTCTCGGGGCGTCCAGTAGTTGGACCATTCCATGCCCTGGCGGTGGCTGGGGCGGCTGGCCTGGGTGGTGCCGGCGTCGCCGACGAAGAGCAGTTCCGAATCCAGGTCCAGGCGCCACAGGGCGACGCTGGTGTTCCAGCCGGGGGCCGGGGCGCTGCGCCAGCCCACTTCCTGGCCCCGGGATTTGACCAGGGCCGGCACCCGGGACATGGGGTCGCCCGGGTTGGAGGGATCGACGCGGATGGTGGTGCCGCGGGCGTCGTTGCTGTGGAAGCCCTGGCCCCAGTTGTAATAGAACTCCTGGTTGGCGAAGGGGCCGAAAATGAGGGACAGCTTGGGGCTGGTGATGCCGTCGTTTTCCTTGCCGGAATTGGCGGCCAGGCTGGAATCCACCTTGAAGCGGTAGCGGTCGTGGCGCAGGCCGGCGACGCTGCGCAGCCAGTCGCTCCACTGGGCGCCCCACTGGCCGTAGAGGCCCAGGCTGCCCTGGTTCACGCTGTCGCTGCGCACGGTGGACAGGCGCTGGCGGGCGGCGGTGCGGTACAGGCCCACGTTGTCGATGTCGTCCTGGCGCCCCTGCACGCCCCAGGTGAAGTCCCCTTCCTTGCCCAGCCACTGCACCGGCTGGCTGCGGCTCCAGCCGAAGCCGCCGTAACGGCGCCGGTCGGCCTGCTCGAACTGGTCGCCGTTGACCGGATCGTCCATGGCGTAGGTGAAGTTGGAGAAGAGGTTGAGCCGGTAGTCCACCAGATAGGTGTTGGCCCGGGTCTGCACCGCCCCGTCCTGCCGCGCCCACTGGCCGGAGAGGGAGAGGCGCCGGGTGTTGCCGCCGGCGGTGGGGTCCAGGCTGCCGTAGCGGTTCACCAGGCCCTGGTCCACGGCCCGTCGCGCCAGCTGGTCGGTGGAGGTCCAGTCGCCGTCGTAGGCCATGAAGGCCAGCGAGTGGCCGTTGTTGCGCGTGCCTTCCGAATAGCGCAGCACGCCGTTGAGGCGCTTGTAGTGCTCCGGTACTTCCCAGGGACCATCGTTGTGGAAGACCTCCACGGCGCCCAGCCAGCGGCCGCCGCCGGCGGTCTCCTTGTCGGCGGCGGTGAGGAGGCGGCGGTAGCCGTTGCTGCCGAGGCCGATGCTGACGTAGTCCTCCGGCAGGGCGCGGCGGTAGTCGATACGGGCGCTGCCGGCGGAGGAGAAGTCCCCGTCCTCGGCGGCGTAGGGGCCCTTCTTGTACTGGATGCGCTCCACCAGCTCGGGGATGAGGAAGTTGAGGTCCAGGTAGCCGTGGCCGTGGGCGTGGGTCGGCAGGTTGATGGGCATGCCGTCGATGGTGACGGAGAAGTCGGTGCCGTGGTCCAGGTTGAAGCCGCGCAGGAAGTACTGGTTGGCTTTGCCGTCGCCGGCGTGCTGGGTGACGATGAGGCCGGGCACGGTTTCCAGCACTTCCGCCGGGCGCAGCAGGGGGCGGTTCTCCAGCTGCTTCGCTGTCACCGTGCCGACGCTGGCGGCGTCGGCGACGCCGATCAGGTCCTGGGCGCCGGCCTTCACTTCGATCACGTCTTATGTAATCCAATTCAGTGTTGCGTATAGCCAAGTTGCTTCTGCTGAGCGGGATTACATCAGGGGATACGGTTTTCTAGTAGCCTAAAGAGAAAAACGAACAAACCGAAAAATTCTGACTATTGTCAGCTGACACTATCGGAATGATGTGGTTATGTGGCGCCCTTCTAATAAGGCCCTGCATAGCCAATGGCACATGATACCAATCCTACGTTGCGTGAAGCCCTTTTCATGTACTGCGAGCGCATCAGTATCCATAAGAAGGGCCACGCACAAGAGAAATATCGAATTAACCTATATTGTCGTTATTCCATTGCTGATCTTCCGATTCGCAATATAACGTCAGTCGATGTGGCGACATTTAGGGATGAGCGATTAGCGGAGATTAACGCACGAACGGGTAGGGCACTTTCCCCTGCTACGGTTCGGCTGGATCTGGCGCTGCTTTCCGACCTGTTTCGCATTGCGAAGAACGAATGGGGTATATGCAACGATAACCCTGTCGCCAACGTCCGTAAGCCAAAACTTCCGCCTGGCCGTGATCGACGCTTGGCTCCTCGTGAAGAACGGATGATCATGAGGCATTGTTCCCAGCGGGGCGCGCATGAGATGAAGGCCATTGTCCAATTGGCATTAGAAACTGCTATGCGTCAGGGGGAGATTCTGGGGGTGTGCTGGGAGCACATCAATCTGAAATCCAGAATTGTTCATCTGCCCGACACCAAGAATGGTTCCAAACGTGATATCCCGTTAAGCATGGAGGCTAGGGATATCCTGGCGGCCCAGAGGGTGAAGCTGTCGGGGCGAGTCTTTAGCTATACGAACAACGGATTGAAGAGCAGTTGGCGAAGCATGATCAAGAGGCTGAATATTCCTGATCTGCATTTCCACGATCTTCGGCACGAAGCAATCTCTCGCTTGATGGAACGAGGTGTCTTCAACCTGATGGAAGTTGCTGCCATCAGCGGACACAAGAGCCTGTCCATGCTGAAGCGATATACGCATCTTCGTGCTCAGCGTTTGGTGCGTAAGCTCGACGCTGGCGCAAACAAGGGGAAGGCTGCGGTCTTGAGCTACCTGGTTCCTTATCCAGCCTTCATCGAGCCCTATGAGAGCCAGGTAAAAGTAACCTTCCCGGACTTTGACGATCTGCATGTGGCAGGGCCATGTCTAAACAGTGCAGTACAGCAAGCTCAGGATGCCCTATTACGGGAAATTTTGGTCTTGATGCGTCAAGGTCGGCCGATCCCGCCGCCAAACAACTACCTAGAACTCCTCGATGAATCCAGGCTCTTTCACCTGGACCCGTTGGCAACCTATGATTCCCTCGCGGATCTTGCCGAGGGCGCGCTGGTTTGAGTTCTGATGACCTTGCGAGGGAGGAGGAGTCCCTACTGGTGTGGATGAGCGGGTTGAAAGCAGGACCTTATCTAACTACCGCTTGAATTGGGGTGACAGTCATCTAGTGGCAAATCCACAAAAAAGTTGCGCTTGAAGTGAACAACTAAGTTCGTGCATACTGAGAATATGTATCAATCTCTTCGCCACCGTTTTCGTGCATATGTGCAGCACCCTATCGGGCGCTGCGCGGTGGTATTGATGCTGTTTGCCTTGGTGGCAGCCAGTGTTCCGTTTGGTGAGATTCATGCCCATGCGGATGGTGATCACGATCATGATCACGGTTACGTCACTGCTGAATTGACGAAGGCATCGCTCTCCGATCCTTCAGACTCTATGGACTCCGATTCTGATTCGACCGGAGCCAAAGTGCTGCATGCACACGGTTCCGTTGTCACTCCTCCGCCCTTGCCGGTGGATGGACTGGGAATCGAGCCATTCATCTTTCCCGCCCGGGACAAGATTACCCTCGCCTACTTGTCGCGGCCTTCTGCGACACCACTTCCCCCCTATCGCCCTCCAATCGCCTGACGCCTAGCGCCGCTTTTTAGCGGCTTGCTTTGTGTCGTCTGTCGATATTGGAGGTTTTCCGTGAACTTGCGTTGTTTGCTGGTTCTGGCTGTAGCCGGAACCTGCGGTATCCCCTTGTCGGGGTATGCCGCTGAATCCTTGCGCCTGGAGGAAGCAGTCTCCCGCGCCTTGGCATCCCACCCCTCACTTGCGGCCGAGGCCGCGCAATTGAAAGCCGTTCAGGCACGCGCTCAGCGTGAAGGCCTGGCAACGCCCTTTATGATCGGCGCCGATGTGGAAAACGTCGGCGGTACTGGAGCCTTTCGGGGGGGGCAATCAGCTGAAACCACGCTACGTATTGGCCGCGTCATTGAACTGGGCGGTAAACGTGAAGCGCGCCAGGCATTGGGTAGCGCTGAAATCAATCAGCAACAGAACCTGTCCGAGGCAACCCGCCTGGATGTCATCAGCCGCACCTCACTCCGCTTCATTTCAGTGCTGGCTGACCAGCAACGGCTGAAATACGCTCAAGAGCAGGTAGGACAAGCCGAGCGCACACGCCGCGAGGTCGCCAATTGGGTAGCCGCTGCCCGCAACCCGGAGTCAGATTTGCGTGCGGCTGAAATCGCCGTTGCTGACGCCGAGCTGGAGCGCACCCGGGCCGAGCACAAGCTGACCTCTGCCAGGTTAACCCTGGCCTCCAGCTGGGGGGTCTTAACACCCGATTTTGAGACGGCTGCAGGCAATCTGCTCGTGCTGCCCAAAGCGGAGTCGCTGGATACCTTGGTGGCTCGTCTGCCGATGACACCAGAGCAACGTGCCGCATTGCTCGAGGCGGATAGTATCGCTGCTCGCAAGCGCTTAGCCGAGGCCGGCGCCAAGCCGGACGTTACCGTCAATCTGGGTGTGCGTCGCCTTGAGGCAACCAGCGATCAGGCATTGATGATGTCGGTATCGATTCCACTCGGCAACCAGGTTCGCTCGGGACTGTCCGTCGCCGAAGCCAATGCGCAACTGATGGCACTGGAAGCTCGCCGCGATGCTCAGCGTTTCGAGCACTACCAGTCGCTGTTTGGAAAGTATCAAGAACTCAATCAGGCCCGTACTGAAGCTGAAACGCTGCAAAAGCACATGCTTCCCAAGGCCGAGGAGGCACTGGCCTTCACCCGGCGCGGCTTCGAAGCCGGCCGCTTCTCCTTTCTTGCCCTGGCACAAGCGCAAAAAACCCTATTCGAACTGCGCCAACGCGCTGTCGATGCTGCTGCTCGCTGCCAGATCCTGATGACCGAGGTGGAACGCCTCACCGCCATTGCCCCGGAACCCACGCCATGAACCGACTATTACCCCTGATTCTCGTGCCTCTGCTGCTGACAGCTTGCGGCAACGATACCCCTCCCTCCGCTGTGGTTGCTGCGGAAAAAGCCAGTGCTGCCGAAGAGTACGAGCGTGGCCCCCATCGCGGCCGGATGCTGCGCCAGGGTGACTTCGCTCTCGAAGTGACCATCTATGAAACCAATGTGCCGCCGCAGTATCGGCTGTATGCCTACCAGAACGGCAAGCCTTTGCCGCCGGCCAGCGTGCAAGCCGCAATCCAGCTCAAGCGCCTGGATGGCGAATTCAACAATTTCACCTTCACGCCGGAAAAAGACTACCTGAACGGCAGCAGTGAAGTCATTGAGCCCCATTCATTCGATGTCGAGGTCAAGGCCCAGCATGCCGGCCAATCCTACAGCTGGGCGTTCCCCTCGTATGAGGGGCGCACCACGATTCCGGCGGCTGCCGCAAACGACGCAGGGGTTAAGGTCGAGAAGGCCGGTCCGACAACAATCCGCAATACAGTGCGGCTGATGGGTGCTGTGATGGTCGATGCGAATCGGCGTGCCGAGATCAAGGCCCGCTTCCCGGGCATCGTACGCGCGGTCAATGTCCAGGAAGGGCAGCGTGTCAGTCGTGGCCAGACGCTGGTGGCGATTGAAGGTAACGACAGCATGCGGACCTATTCCGTTGTCGCACCGTTTGACGGCATCGTCTTGGCGCGCAATACCAACGTCGGCGACGTTGCCGGCAGCAACACCCTGGTTGAACTGGCGGATTTGTCCAGCGTCTGGGTGGAATTACGGGCTCTCGGTGGAGATGCGGAGAAGCTGTCCGTGGGCCAGGAGGTCGAGATTTCCTCGGCCACCGGTGGCAGCCGGGTCACCGGGAAAATCCAGACGCTGCTGCCCCTGGCCTCCGGGCAAAGCGTGGTGGCCCGTGCCAGCATTGCCAACCCTGAAGGGCGGTGGCGGCCGGGTATGGCGGTCTCTGCGGATGTCACCGTGGCGGCACGCCAAGTCCCGCTGGCGGTGAAGGAATCCGGCCTGCAACGCTTCCGTGATTTCACCGTCGTCTTTACCCAGGTAGGGGACACCTACGAGGTCCGCATGCTCGAGCTGGGTGAGCGTGATGGCCGCTACGCCGAAGTGCTGGGCGGGCTGAAGCAAGGTGCTACTTATGTAGCTGAGCAGAGCTTCCTCATCAAAGCCGACATAGAGAAGTCCGGCGCCAGCCACGATCACTAAGGGATTTGCCATGCTAGAACGAATGATTCGTGCGGCAATCGCACATCGCTGGCTGGTCCTGATACTGGTTCTGGGCACCTCCGCACTTGGTGTCTGGAGCTATGGTCGCCTGCCGATCGATGCCGTCCCCGACATTACCAATGTCCAGGTCCAGGTCAATTCCGAGGCCCCCGGCTATTCGCCGTTGGAGGCAGAGCAACGTGTCACCTTCCCGGTAGAAACCGCCCTGGCAGGTATGGCTCGCCTGAAGTACACCCGCTCGATTTCGCGCTATGGACTGTCCCAGGTCACCGTGGTGTTCGAGGACGGTACGGACATCTACTTTGCCCGACAGCAGGTGAGCGAACGTCTGCAACAGGCGTCTTCCCAATTGCCGGCTGGCGTCAAACCGACCTTGGGACCGGTGGCGACAGGGCTGGGTGAAATCTTCATGTATACGGTCGAGGCCACACCAGGGGCTACCAAGGCGGATGGCAAACCCTGGATGCCTACGGATTTGCGAACACTGCAGGATTGGGTGATTCGCCCTCAGCTGCGTAACCTGAAAGGTGTCACCGAGGTCAATACCATCGGCGGCAACGTGCAGCAGTTTCATGTCACCCCCGACCCGGCCAAGATGGTGGCCTACAAGTTAACCATTGATGACCTGCTGCAGGCCATTGAACGTAACAACGCCAATACGGGCGCCGGTTACATCGAACGGGGTGGTGAGCAGAACCTGATCCGCATTCCTGGGCAGGTGGGTGATGAGGCTGGTTTGCGAGAGATCGTGGTGGCAATGCGTGACGGGCTGCCCTTGCGAATTAGCGACATAGCTACGGTCCAGATCGGCTCGGAACTGCGCACCGGTGCCGCAACCAGGGATGGCCGGGAAGTGGTGCTGGGCACGGTATTCATGCTGATTGGTGAGAACAGCCGGGAAGTCGCCATGCGTGCAGCGACCCGCCTCAAGGAAATCGATGCTTCGCTACCGGAAGGGGTCAGTGCGCGTGCGGTTTATGACCGCACCCAACTGGTGGACCGCAGTATTGCCACGGTCCAGAAGAACCTCCTCGAGGGAGCCTTGCTGGTAATCGTGGTTCTTTTCCTGCTGCTGGGCAATATCCGTGCGGCACTGATCACGGCGGCCGTGATCCCGGTTGCCATGCTGATGACCATCACCGGCATGGTGCAGAACCGGGTATCGGCCAACCTGATGAGCCTTGGGGCCTTGGACTTCGGCCTGATCGTCGATGGCGCCGTGATCATCGTCGAGAATTGCCTGCGCCGCTTCGGTGAGCGGCAGCACGCCCTGGGCCGCTTACTGTCCATCGAGGAGCGCTTCCAACTAGCTGCGAAAGCAAGTGCCGAGGTAATCAAGCCTAGCCTGTTCGGTCTGTTCATCATTGCCGCTGTTTATCTGCCGATCTTTGCCCTCAGCGGGGTCGAGGGCAAGACTTTCCATCCCATGGCTATCACTGTGGTCATGGCGCTGGTTGCCGCAATGGTGTTATCCCTGACTTTCGTGCCGGCGGCCATCGCACAGTTCGTCACCGGCAAGGTCGAGGAAAAAGAAACCCGCCTGATGCAGCGGCTGCATGGGATTTACGCTCCTCTGCTGGAGAAGTCCCTATCGCTGCAAAAGCCGGTGATTGGCGCCGCTGCAGTGCTGGTGGTGCTGTGTGGATTGTTGGCGACTCGCCTGGGTACGGAGTTCATCCCCAACCTGGATGAGGGGGATATTGCCCTGCACGCCCTACGCATCCCGGGTACCAGCCTGACCCAGGCTATCGGTATGCAGGCCCAGCTCGAAGCACGGATCAAGCAGTTCCCGGAAGTAGACAAGGTGGTGGGCAAGCTCGGCACGGCAGAAGTGGCCACCGACCCGATGCCGCCTTCTGTGGCCGATACTTTCATTCTGCTCAAGGAACGCAAGGACTGGCCGGACCCGCGCAAGTCCAAGGCTACCCTGGTGGCGGAGCTGGAGGAAGCTGTTCGTGCCATCCCCGGCAACAACTACGAGTTTACCCAGCCGGTACAGATGCGGATGAACGAGTTGATTGCCGGTGTACGTGCGGAAGTGGCGATCAAGGTATTCGGCGATGACCTGCAAGCGCTGACCGCGGTTGGCAAACAGATCGAGAAAGTCGCAGGCAGCATTTCCGGAAGTGCCGACGTGAAACTTGAGCAGGTGACCGGCCTGCCGCTGCTGGTCATCAAGCCGGATCGTGCCGCCCTGGCCCGCTACGGCCTGGCCGTGGCCGACATCCAGGACACCGTATCCGCGGCGATGGGTGGGGCAACGGCTGGCCAGCTTTTCGAGGGGGATCGCCGTTTCGATATCGTGGTGCGTCTCCCCGATGCCCAGCGCCAGGACCCGAAGGCACTGGCAGCGCTGCCCATTGCCCTGCCGGCGACAAGCAGAGCCGATGGAGCTTCGTTGTCGCGGATGCCCGGCGTGGTGCCCTTGAGTGCCGTGGCCACTATTGCGGTAGAGCTAGGGCCCAACCAGGTCAGTCGGGAAAACGGTAAGCGGCGCGTGGTCATCACGTCGAACGTGCGCGGCCGGGACCTCGGCTCCTTCGTGGAGGAACTCCGGGGGAAAGTTGCGGCGGAAGTCGTGCTGCCTGTCGGAAGCTGGGTCGAATACGGCGGCACCTTCGAACAGCTGATCTCGGCCGGCCAGCGTCTGAGCGTCGTGGTTCCCGTGGTCCTGGTCATGATTTTTGGCTTGCTGTTCATGGCCTTCGGATCGGCCAAGGATGCCGCAATCGTGTTCAGCGGCGTACCCCTGGCGCTGACCGGTGGCGTACTGGCCCTGTGGCTGCGCGGTATTCCCTTCTCCATCTCAGCCGGGGTCGGATTCATTGCGTTGTCTGGCGTTGCGGTACTCAACGGCTTGGTGATGATCACCTTCATACGGAAGCTGCGTGAGCTTGGGCAACCGCTACATACCGCTGTGACCGAGGGGGCGCTGACCCGTCTGCGCCCCGTGCTGATGACCGCACTGGTTGCCAGTCTTGGCTTCGTCCCCATGGCCCTCAATGTCGGTACAGGTGCTGAAGTGCAGCGCCCACTGGCAACCGTGGTGATCGGCGGCATCATCTCTTCGACCCTGCTGACCCTCTTGGTGCTCCCGGTGCTGTACCGGCTGATACACCGGAATGAGAACGAGGAGACAGCCGCGTGACCCCTTCCCCCATTCTATTCAACAAGGAGTTTCCTTTGCCATTTCGCCAGTTCAGCGCCACCGGCATTTGCCGGTGGACCGTGGCGCTCCTCGCGTCACTGCTGCCCCTGTGGGCTTTCGCCCACGGGGTCACGGGAGAGGATCAGTCCTTTCTCGAGCAGAACACCGGCCGCAACCTGCTGTTGTTCGCCTACCTGGGAGCCAAGCACATGGTCACCGGGTATGACCATCTGTTGTTCCTGTTTGGTGTGGTGTTCTTTCTGTACCGCATGCGCGACGTCAGCATTTACGTGACCCTGTTCGCCGTCGGACACAGCGTGACCCTGCTGCTGGGGGTGCTGGGCGGTTTCCACGTCAATCCCTATGTCGTCGACGCAATCATCGGCGTCTCCGTGGTTTACAAGGCGCTGGACAACCTGGGGGCATTCAAGCACTGGTTGGGATTCCAGCCCAATACCAAGGCGGCCGTACTGGTCTTCGGCTTTTTCCACGGTTTCGGCCTGGCCACCAAGCTGCAGGACTTCTCGTTGTCCCGCGATGGTCTGGTGCCGAACATGCTGGCCTTCAACGTTGGCGTAGAACTTGGCCAATTGCTGGCACTGGCTGGAATTCTGATCGTCATGGGGTTCTGGCGCCGCAGCACAGCCTTCTCCCGGCAAGCATTCACCGCCAATACCGCACTCATGGCTGCTGGCTTTGTCTTGGTCGGCTACCAACTTACCGGCTATTTCGTTTCCTGATCGAGGTCTTCCTATGTCCAATACTCAAACCCACTCCCTGCCCAGTAGTGCCAGTCTGTTCAAGGCAACCGCGGTAGCCGCAGGTGTTGCCGCCACCTTGCTGGTGACCATGGTGCTTCCTGCGGAATATGGGATGGACCCTACCGGCATTGGCCGTTTCCTCGGCCTCGATGCCCTTAAACAGTCTGCCGGTGCTGAAACAACATCTGTTCTGGCAACCCCGGATGCTATTGCTGGCCCCAATGCAATGCTTGCCGCCAAAGCAGATGCTGCTTTCGGAAAGCAAGCCGGCAGGTCCTTGGATGCCTCCGCCGTTTCATTGGCAGGTGATGGCCCCATGCGTCGCAACACATTCACGGTAACGCTGGCTCCCGGCAAAGGCGCAGAGGTCAAAGCGCACCTCCGGGCTGGTGAAGGCCTGACCTTCCACTGGCAAGCAACCGCCGCGGTGGCCGTGGATATGCACGGCGAAGCACCGAATGCCAAAAATGCCTGGACCAGCTATTCGGTCGAAAGTGCTCAAAAGAGTGCATCCGGCACTTTTGTTGCCCCCTTCGAAGGAAGTCACGGTTGGTATTGGCAAAACCGCGGCACCGAGCCGGTGACGGTATCCATCGAAGCCTCCGGTTTCCAATCCGAGTTGTATCGGCCGTAACGAAGCTTTCTTACACGGCCCCGTTGTATTAACCCCGCGGCTTGCCGCTTTCACATTGGAGATGTACCCCGTGAAAAACAAAACCTTCCTGTCCCTCTCTTTGCTGGTCGGGTCCTTCATGTCGCTTTCCAGCGTTGCCTATGCCCACGGTGTCCACGAAGACAGTGCCGAACCAAAGGCCACGCCCACTGCTTGCCGGCACCTCACCGACACCGAGCATTACGTGGTGGATCTAAAGGACCCCGCAACCCGGGCGCTCAAGACCCGTTGCGATGCCACCAAGAAGCCTGTAACCCCGGTGGCCGAGAAGAAGGACGAAACACCGGATAAGAAGTAACCCCTCCTGATAGTTCATTGTCGCCCGTAGAGACATAGGAAATAGCCATGCTTGAAATCCTCCGACATCGCAGTTTTAGGCATTTGTTTCTCGCCCAAGTCGTTGCATTGGTGGGGACGGGGCTTTTGACCGTGGCCCTGGCATTGCTGGCCTATGATCTGGCAGGCGCCAATGCCGGTGCGGTACTGGGTACCGCACTGGCCATCAAAATGATCGTCTACGTCACGCTTTCGCCTGTAGCGGGGGCTGTTGTCCCTGCGGCATGGCGAAAGCGTGTCTTGGTCGGCCTAGATTTGATTCGAGCGGCGGTGGCATTGCTGCTGCCGTTCGTCACCGAAATCTGGCAGGTCTATGTGCTGATTGCGCTGCTGCAATCAGCCTCAGCCTGCTTTACCCCGCTTTTTCAGTCGCTTATTCCCCAGATTCTGCCGGAGGAAAGCGACTACACCCGCGCGCTCTCCCTGTCGCGGCTGGCCTATGACCTGGAAAGCCTGCTTAGTCCGGCCCTGGCAGCGGCATTGTTGGTGGTCATCAGTTTTCACGGGCTGTTTGCCGGCACCTCCGTCGGCTTTGTTCTATCTGCACTGTTGGTCATGAGCACGGCATTTCCCGTCGTGCCAGAAACCCGTTTGGGAGATGGCCCCTACAGTCGAGCCCTCCGAGGCATGCGGATTTATCTACACACACCGCGACTACGCGGACTTCTGGCGTTGAACCTATGTGCCGCAAGTGGGGCCAGCATGGTTTTCGTGAATACCGTAGTCCTTGTCCGCGAGGTGCTGGGAGGCGGTGAACGGGAGGTGGCATGGGCTCTGGCAGCTTTCGGTGCCGGGTCCATGGCCGTGGCCTTTTCCTTGCCAACATTGCTCGACCGTATGGCGGATCGTCGAATCATGCTGAGCGCGGCATCAGCAATGGTCGTTGTGCTACTGGCGGTAACGGGGGTTTGGTGGAGTACCGGGGGTTTGGGCTGGGCAAGTCTCATTCCCGCATGGGTGGTTCTGGGTATGTCCTACGCAGGCCTGGTAACACCCGGGGGACGACTATTGCGGCGCTCGGCCCAATCGGACGATCTACCCTTTTTGTTTGCCGCCCAGTTTTCACTCTCACACCTGTGTTGGCTTCTGGCTTATCCACTGGCGGGATGGCTTGGAGCGCGGCTGGGATTCGGTGTTGCCCTCTCCGCCCTTAGCGCCATGGCAGCAGTGGGAGGGGCGCTTGCCTGGCGCACTTGGCCAAGGCAGGACCCTGATGTGATTGCCCACCATCATGACGATCTCTCTACCGACCACCCTCATTGGAACGAGTACGCGCTTGGTGGCGGAGGTCGGACTCATGAGCACCGTTTTGTCATCGATGAACTGCATCAGCGATGGCCCCACTAAGCCGGCCACTACGCGATGGTACTGAGTGATTTTCTTCGAGTAGCAGATAGCCACCTGATTGTTGGTCGGGGACTCTTTCACATGCTCAGCCACATGTGGCTCCACATCGAGTTCCGGCTATTTGCCGTCGGAATTGGCGTGCTACTGGTGTTGCTGTTGGTACTGATTTTTATGAGCTGGGAAGAGGAGCGGTGGATTCGTGCTGTAAGGATTTTTGATTTTTTATTTCGAAAACGGAAATAAAAAAGCTCATTAATGGGGTGGTATTCGCATGCCAAAGATGGAGCTTCCACAGCAGGTGGCAAAACTAGATACAACGCCGACATCCGTAGTTGGTCAGTCCCGTATGCAGACGGCCGACTTCTCGAAAATCCTCAATCAGGCGCTTTCCAGGAGCAACACGCCAGCTGACGTAGGCGTAACCGTGCATAGAGACGGAAACAAGAAACCGGGGGACTTCCAGCAAAAAATGCTGGGAATACGGGCCTATAGGCAACAGCTCATTGCTTCAAATATCGCGAATTCCGATACGCCAGGATATAGGGCTATGGATATCGATGTCGAAGATGCTGCCAAGCAGAACCAAATGGGGCTGTTGCCATTAGCAAAATCATCTCCCAGCCATATCAACGGGAGTGCTCATTGGTCCTCTCCGCCGTTCAACCTGAAATACCGCACACCATTTCAGGCCAGTGCAGACGCAAATACCGTAGAAATGGACATTGAGCGCCAGCATTTTGCTGAGAATGCTGTGATGTACCAATTCACCCTGGATCAGGTTGGTGGCGATTTCAAAGAGCTGACTGAGTTGTTTCGAAACCTAAAATAGTTCGTTCAAGCTAGCCTCATGCCAGCCAGGCTGCAAATCCAATGTAGCAACAAATTTTATTTGTAGAGTTATCAACAGCATTGTAGAGTACACTCATGACTACATGGATCGCACTCATTACCAGCCTTCCCACCGAGAATGCCACGGCCCGCATGCGTGCCTGGCGTAGCCTCAAGGCATCGGGTGCCGCCGTCCTCCGGGATGGGGTCTATCTGATGCCGGAGCGGGAGGATTGCCGGAACACACTTGATGCCGTAGCCGCAGATGTTCGTGCTGCAGAAGGTACAGCCCTGGTCGTCCGCCTCGAGGAGCCCAGCGATGGCAACTTTGTGGTCTTCTTCGACCGCAGCGCCGACTTTGCTACTCTGCTGGGGGAGATTGCCACGGCCCGAGACACGCTCGGTCCGGACACGGTAAACGAAGCTCTGAAGCAAGCCCGCAAGCTGCGCAAGGCGTTCTCCAACCTGGTAGCCATCGATTTTTTCCCTGGAGAAGCGCAAAAGCAGGCCGATGAGGCCTTACGTGACCTTGAGCAACGAGCAGCCTGGGCTCTTTCCCCCGATGAGCCGCACCCGGTCAACGACGCTATCTCCCGCTTAAGCATTCAGGACTATCAAAAACGTCGTTGGGCAACGCGACGGCGCCCCTGGGTGGACCGGCTGGCCAGTGCCTGGCTGATTCGCCGCTACATCGATCCCCAGGCCGAACTGCTCTGGCTGGCAACGCCGGCAGATTGTCCGGCCGAGGCTCTTGGTTTTGATTTCGATGGGGCGACGTTCACCCATGTCGGCGCCCGGGTGACCTTCGAAGTGCTGCTTGCCAGCTTCGGCCTGGAAACTCCGGCTCTGCAGCGCATCGGTACCTTGGTCCATTTCCTGGATGTGGGTGGCGTACAACCGCTAGAGGCGGTGGGCATCGAAAGCACCCTGGCCGGCCTACGCGACACCATTCTCGATGATGACCAACTCCTGGCATTGGCCGGCAGTATCTTTGACGGACTACTGGCCTCCTTTGAGAAAGGATCGAAATCATGAGTACGATCCTGACAGCCGCCGATTCCACCTCGCCAGAGTCCAAACCTGCCGAAGTCAGCTTCTGGCAGGCCTTCCTGTTCTGGCTGAAGCTCGGCTTCATCAGTTTTGGCGGGCCTGCCGGGCAGATCGCCATCATGCATCAGGAGCTGGTCGAGCGCCGGCGCTGGATTTCTGAACGCCGCTTTCTGCACGCCCTCAATTACTGCATGGTGCTCCCCGGTCCGGAGGCCCAGCAGTTGGCTACCTATATCGGTTGGTTGATGCACCGCACCTGGGGTGGCATCGTCGCCGGTGGGCTATTCGTGCTGCCGTCGCTGTTCATCCTGATTGGGCTGTCGTGGATCTATATCGCGTTCGGCAATGTGCCCCTGGTGGCCGGCCTGTTCTACGGCATCAAACCGGCGGTTACCGCCATTGTCGTCCAGGCGGCCCACCGCATCGGCTCCAGGGCCCTGAAGAACAATGCCCTCTGGGCCATCGCTGCAGCATCCTTTGTGGCCATATTTGCACTCAACGTGCCGTTCCCAGCCATCGTCGCGGCGGCTGCAGCCATCGGCTACTTTGGCGGCCGTGTCGCGCCGGACAAATTCAAGGCTGGTGGCGGCCACGGCAAAGCGGATAAGTCCTTCGGCCGAGCCCTGATCGACGACGATACGCCGACGCCGGTACATGCCCGGTTCTCCTGGGGCCAGTTGGCGAAAGTCGCGCTTATCGGTGGCTTGCTGTGGCTGGTCCCGATGGGGCTGCTGACCGCCAGCTACGGATGGAGTCATACCCTGACCCAGATGGGCTGGTTCTTCACCAAGGCCGCATTGCTGACCTTTGGTGGTGCTTACGCCGTACTGCCCTATGTTTACCAGGGGGCCGTCGGGAGCTATGGCTGGCTCACCGGTCCCCAGATGATTGATGGTCTGGCCCTTGGCGAAACAACACCGGGACCGCTCATCATGGTGGTGACCTTCGTCGGCTTCGTTGGCGGCTACGTGAAGGCCGTGTTCGGCCCGGATAGCCTCTTCCTGGCCGGTGCGGTGGCGGCCATGCTGGTCACCTGGTTCACCTTCCTGCCGTCCTTCGTCTTCATCCTGATGGGCGGTCCCTTCATCGAAACGACCCACAATGACCTGAAGTTCACGGCGCCGCTCACCGCCATCACGGCCGCCGTGGTCGGCGTTATCCTGAACCTGGCCCTGTTCTTCGGTTACCACGTGCTGTGGCCGAAGGGCTTCGACGGGGCGTTCGAGTGGGTATCGGCACTGATTGCCCTAGGGGCAGCCATTGCCTTGTTCCGCTTCAAGGCGAACGTCATCCATGTCATTGGTGGCTGCGCGGTCATCGGCTTCCTGGTGAAGATGTTCCTGTGAGCCTCGGCATGGGAACGGCACGGTGGGTAGGGGTCGCTGTGCTGGTGACGGCTATCAACGCTGCTGCTGCCGACAGAGGCTGGGTATTGCTCCAGGGCAAGGTGCTGGCCCAAGCCCTGTCCAACCAGGATTTCGGTGATGGCGTCCACTTTGCCTACCAATTTCTCAGTGGTGGGGAGTTGCGCGGCATGAATATGGGCAAGCCTGCCAGGGGAAATTGGCGGGTCATTGGCAATGAGCTTTGCTGGCACTGGACGAGGCCCAAAGAACCAGAGGAGTGTTACCAGGTACGCCAGCGAGGACAGGCTGTCCGTCTCTATTTGGATGGTCAGGAAGTGCTTTCCGGCAACCTGACCCCGTTACCCGCCAATCTGAAGGAGATGCCCCAATGAAATGGATCACGCGAGAAAGACCCAAGATTGATCGTATCGCCTGTCCCTGGCTGATAAGCCGGTTTGTCGACGAGAGTCCGGAATTCCTCTACGTCCCCGCAGGTGAAGTGATGCGCATTGCGGCCGAGACCGGTGCCACCCCCTACGACGTGCCCAACACCGAATTGGGCCACCATGGCGACCAGTGCAGCTTCGATGCCTTCATCGGAAAGTACAAGCTTGAGGATGCCGCACTCAACAAGCTGGCCCTCATCGTGCGCGGCGCCGACTGCGGTCAGCCACAACTGGCCAAGGAAGCGGCTGGTCTATTGGCCATTTCAAAGGGTCTGTCTCTGAATTTCAGCGACGATCACGAGATGCTGGCCCACGGCATGGTCATCTACGATGCGCTCTACGCCTGGTGCGCCGATACCCCGCTGAAGAAAATAGGCCGGTTTCTGGGGTTGAAGTGACTGGCCGCTGGTTGCTGCCTGAAGGTGTAGACCGGGTAGTGCTGCCGCTCCTGGTGGGCAAGGCCCTGCGGGCATTTGCCGATGGCTATGTGGCAGTTCTCCTTCCAGCCTATCTGCTGGCGCTCGGTTTCGGCACCCTGGATGTCGGCATCCTGAGTACAACGACCTTGCTGGGTTCGGCATTCGCCACCCTGGCGGTGGGGGCCTGGGGCCATCGCTTCCATCACCGGAACCTGCTGCTGGGCGCCGCGCTGCTGATGCTGGGGACCGGGCTCTCCTTTGCCTCTTTGTCAGCATTCCTGCCCCTGCTCCTGGTCGCTTTCGTCGGCACCCTCAATCCGAGTTCTGGGGATGTCAGCGTGTTTCTGCCCCTCGAACATGCCCGGTTGGCCGAATCGGGCCAGGGTACTGCCCGCACCACCTTGTTCGCCCGCTACTCCCTGCTTGGAGCCCTGTTTGCTGCGTTGGGGGCGCTGGCCTCAGGCATTCCCCAGCTACTGGTATCGGTGCTGGGGATCGAGCTGCTATCAGGGTTCCGGGTGATGTTCGTGCTCTACGGGCTGGTGGGTGGCACGGTATGGCTGCTGTATCGACGGATGCCGGCACCCCGGCGGGAGTGCGCGGTGGCCGCTCCGCAGGCGCTCGGCGAGTCGAAGGGCGTTGTTGTCCGGCTGGCGCTGCTGTTCTCCCTGGACTCCTTTGCGGGAGGGCTGGCCATCAATGCCTTGATGGCCCTTTGGTTCTTTCAGCGTTTTGAGTTGTCGCTGGCTGCCGCGGGGAGCTTCTTCTTCTGGGCTGGGCTGTTGTCCGCTGTGTCCCAGCTAATCGCACCGAAGGTCGCCGAGCGCATTGGCCTGGTGAATACAATGGTATTCACCCACATTCCCGCCAGCATCTGTCTCATCGCCGCGGCATTTGCCCCGGGTCTCGAGCTGGCATTCGCATTGTTGTTTATCCGGGCGTTGCTGTCTCAGATGGACGTACCGGTTCGAAGCGCTTTCGTAATGGCGGTGGTGACACCGGCCGAGCGTGCAGCTGCGGCAAGTTTTACTGCGGTCCCACGCAGTTTGGCTTCTGCCATCAGCCCAACGATTGGTGGGGCAATGTTTGCAGCGGGATGGCTTGCAGCGCCTCTGGTTGCCTGTGGAGCGTTGAAAATTTGCTATGACTTAATGCTTTGGAAAGCATTTCGACAACGAGACCCATAACGAAGTGGATTTGTCTCTCGGGCCTCAGAAGGAGCCGAAGTCGGCAATCAAAATTGGGCAGATGCAAGGGCCCTTATCGGAATTTCAACAATAGGCTTCATTGGACGGGCAACAGAATAACAAGGCTTTCTAGCTCCAAAGACTCTCGGTTGAACCATGTCACCTTTAGGTCACCGTTGTCTTGCAGTTCCAAGAAGAGGCCGGTTGCGACACGCGGACCATCAAGTTTTTCGTTCCGGATTGCTTCAATCAACCTTGGCATATGGGAAACGAAGAGCGCAATTTGATAAGCAAGTAGGTCGGCGTCGTCAACGGAAAGCGTAAGTCGAACACCGAAATAGTCGTTCGGAATGGGGAGAGCGACCACGATGTCGGTTCGCTCTGGAAGTTTCTTCTTTAGGCGAACAACCTTCTCCGCGAACAACTTCACCTTGTTTGCCGCGCCTACCACGAGCGCCGTCGCCTTTGCGCGGTTCTTCCACGATTCCTTTGCCGCTTCCTTCACGATCTCCGCCATATACAAAGCCGCGGCAGCCGAGAAAAGGCTAATCCACCAACTTGAATCGGCAATCAGGTGAATCCACGACGGTGGCTCAGCAGATAGGAGAGCAACGCGTCCATCTTCGATCTCAAATTCGAATTCTGGACCAACGTCTTCGCCAAATTCCTTGAGAGGTTGAATCGGTACATCTGCAGTGGAGAGCGCTCTCATAGCCTATGAAACCTCGCGTCGAAATTGAGCGGCCTGCGCATCTTTTCGCGCAGGTCCGCTCGAATGTAGTGGTAGGGGGCGTATTGGTATTGATGACACGAAGCGCCCTCAACGACTGAAGTGCCGAAGCATGAGATCGCCGTACCCCCAGACAAATGCACCCCAGATAGCAACGAACGTAATGATCCAGCTAACGGTACCGTGCTGTTTGTTGAACAGGCGGAAGAGTGCCCACGACTCGGAGAATGTTCCACCACGGATGGATTCGAAGAAGTTGTTGATTCTGAGTTGGGCAAAGACGCAGAAGACCGAAGTGATTGCACCGCTGCGCTGAAACCACGAAGCAAGAGGTTCAGATTCCGGCTTTAGCACCGCGGCAGCCGCCAACACCTCCGCCAAAACCGCGACAAAACACAACGACAGGATGAACAGCAATTCCAGTCTAAGGCGGGTCTCAACGGCGATGGTCACTTGTGCCCCTAACGTTGAAGGCAAGGGGCGGCCCACTTGCGGGACGTCCCAGCAGCCGAAGGCTGCGCCTTGAACGTGGTGTTAGGCTCCACCACTCAGCACCTCTGCGATATCTTGACGTGCGACAGCGATGAATGCCTCTTTCAGTTTGAAGAAGTCAGCAACTTCCTTCGCAGGCTGCGCACTATGACTGGTAATGACGTAGTCGGCCAATTTCTTCGCCGCCTCCACGACCATACCGGATGCGACAAGTGAAACCTCCGCGAACTTTCCGTTCAGCATGTTTAGATCGGCAAGGGATGAAATTTTCTCTTCTCTCGCCTGAACTACGAGTCGTTGCGTTTCAACTAGAAACTCCGCGTAAAGTTTGTGGCGAATAGTGCTTTGTTCTTTCGCAAGCGCAAGTTTCCATTCGTGGTTCTTTACACTGCGGCTCGCCAAGTAGTTTATGAACCCACCAATCAACACCCCCGAGAGGGTACCGAGTACGGCGATAGTTTCAGACGGCATAGTGGAGCCTAACAGTTAATAGACGGACCCATGGGTCCGTCTATGCGTAATCGTACGGACATGAGAAAAATGCCGGGAACCTTGAACTAGCGACGATTCTCGCGCCTTGTGGCCGAAATATGATTTACGAAAAAGGAGATGGAGTCGCAAGTATCTATCCATCTTGCATTTGGCTCAAGGCGAGCGTCTGCTCGATGTGAAATTCACAGTATTCATGCGAATACTCACCTAACCTTCCCTTGGACTTTCGCTGATTAAAGAGACTTTTCTATTTTCTGCCCGGAGGTTGGCGATCAAGGAATCCTTCTCTTTCAACAAATCCTCCAGCTCTTGAATACGTTGCTGGAGCGCGTAATTCTCCCGGGCAATCTTCAACAGATTGGACTGCTCCAGTTCCAGAAGCTCCCGGTATTCCTTCTTTCGTTCCTCTGCCTTTGCAAGCGCGGATGTCTTGATCTTCAACTTCGACTGGGCATCAGTGTCGTTGATGCGCTGAATCTCGGACAATATCTCCCGGTGGTACCGATACAGTGTGGAACGCTCCACTCCAGCCTCTTGAGCAACGCTGGCGGCGGTCAGGCGAGACCCAACCCTTACCTGATCGGGATGTCCGCCAACAATGCGCTGTATTGCCAAGTTCAGGGCTTGTCGCGTCAGCTTGATCGAATTTTCCTTATGGGCCTGTAGCGCTTGCGGGTTCCCAGCATTATTGCTCATTGAGGTATTTCCAATTTGGCGATGACATCAAGGGCCTTATTCAGGATTCGCTGGGACCGGGTTTTACCAGGCAGCCCCAGATCCGACATGTCTAAAGCCGTTTGCTGCTGAAGAGCAATTTCCTTCCAGACCGGAAGGTGCTCAGGGCCGATCATCCCGTAGGCGCAATCGACGCACATGTCTGCCTCAAAAACACACAGGCCACCACAACTTGTCCCCTTGGCATTGCCGATGCACCAACTGTGGCCGGTCCCATTTAGGGTGATGCTTGACGAAATCTGTTGCAACAAGGACTGCTTATTCTTGGCAGTTTTGATGGTCTGGCGTAGATCGGCAAGCATGATGTCGCCGTTGGCCAGGGGAGCGTCTGTCATCAGGTAATTGTGTAGAACGCTCTCCTGCTTTTCTGTCTTGCTCCGCAATATTTCCGTGAGTAGGTCTGTATCCGTCTGGTAGGCATCGGAGGCTCCGCTGGTATATAGCAGCGTCATGTCAATGGACCAGTGACCAAAATGCTCTCGGAGATAGTGGAGATCGCCCAGCTCGGCTGAGGCGACGAAATAGGCATAGGTTCTGCGGAATTGATGGGATGAAAGCCGCCAATATTTGCCGTCGTCTCCCAGAATATTGAAATGTGGACAGAAGTTCTGGAGGCGTCGATGAATGGTGCTTTTGCTGAGCACTCCTACTAAATGACCCACACAGGGGGCGCATCCCAGGAAGAGCCCATCCCGGTCCTTTCTGGCGGTGTGCAGGCGTTTGAGTTGGCGTTTGTGGAAGGTCGAGCCCGGAATGGACATGTCGAGTTGCTGTTCAAATTGGGAAATCTGCTCCTCGATCTGAATTGCATACGGTTGTCGATACCACTCCATTACACGAACTGCGGTCTCAACAATTGGCGGGACGAGCCATTTGTGCGGCTTTATTCCAGTCTTGAAGATTGTGCCGTGAAGCCAGATGAGATCAATGCCATCGTCCGCTTTGTCATGAGCGATGCACCCCCTTTTTAGGGATAGCGTTTCGGAATCCCGGATTCCGGAGAACATGGCAATCACGATATAGCAGGAATCCCGGAGATAACTCAGTTCTGCACTGAGGTCACGAGAGCCCTCAAAACCCAATTCCCGGGCTAGTGGTATTCGGATATTGGTGGCTTGGAATCCCTCTTTGTTAGCCACCGCCTGCTCCAGGGCATCCCGAGTGCTTAAGATATGGGGAGCTAGGTTGGTGACGTAATTTATTGCGACTTCGGCTAGCTGGCGAACCGTGCGCTCTGGGATTCGTAGTGTCTTGATGGTTCGACCTTTGGCTCGTTCTCCTGACAGAGTGAAACCGGATTCATCAGGCCATGGATGTGTAGGTAAGGCATCGGCCAGCTTTTCACGTTGAAGATATAGCTCCTCCAAAACTATCAACCGCCTTGCATGCCAGCCCTTGGCTGTCGGTTTGCCTGTTCTCTGATTAATCTTGGCAATGGGGACATATTCCAGGGCTCTTCCCTCTATATCCCTGAACTGTCGCATGCCTCGGGACGCCATCCAGCGCAATAGGGGCGTTAGCGCTACCACATGGGCAATCACGGTTGCCATTCGCGGTCGTTGGCGCCCATCGATAGGGTCTATAAACAAGGACCAGGTAAAGTCCTTGGTACTTTCCAGCAGGGGGGCATGCTGGTGGTCAGTGAGAAATTCACCAGGTGCGATCTCAATGCGCCAGTTGATCCGCTTTTCACTATCCCTAGCGTTGTTTTGCGGGATGTAAGGCCAGAAGTCCCAGATTTCGTCCTCATACCGGGACACCACGACGGACTCACCCAGATCCGTCTTCGCCATAGAGACCGGAGTTCTGGCCTTTTCTGCGGATGAAAGCGCTCGAATGGGCTTCGCAGGATTAGCTAGTGTGCTCATATCAGGGGGGCGTCATTGCGCCAGGCAGGATGCGGTGTGGATTGGGCTCGCTGGCGGGCTTCCGCCACTACGGCAGAATCAAACTGCACCGCAATTTGTTCGTCGATAGTTCTTATGACTGGTCCGAATGTTTTCATCCACTGGTGGGGAGGAATCTTGGGGCGCTCAGCCAGGAGACGGAAATAGAAGCTGTATAGGCGGTAAAGGTCGTCCTCAAACACCACCATGCTGGGGCAGCGAAAGCAGGCCAGGAATTTTCCGCAGACCTGGTCCGCTTCGCGAAATGGATTCCGACAGCGGGCAATCGATGTGTTGTAGCCACCGGTGAGCAGCTCAGTTGCGTTCCGAAGAGGAATCTCTCCATCTGCTGCTAAGCGGATAGCTAAGGTTTCATCTCTGCTGGTGGCCCAGCCCACCATGGCCTGGCCAATGAAGGCATGGTTGCGAACAGCCTCGGGAGTGACTGGTGGAATGTAGCGTTGGATGGTCAGGAGAATGGAACTATGGCCAAGGGCTTGTTGTAGCTTACGTAGGTCGCGATGACGTAGGTAGTAATTCAGCGCAAATGTCGGTCTGAGGCGTGCGATGCTCAGGTTCAAGGGGGCGCCTCGGTCGTCAAATAGCTCATGTCGATTGACAAAGCTTTTGAGTGCATTGCGCACCTGAACTTCGTCGAAACGAACAACATTTCCGCGACGTCGTGAGTACGTGACGTCGGCCAAACGGCAGAGAAAAACGAATGGGCGATCTGCTTCGTCAGCGTCTGACACAAAGCGCTCTGTGTATTGCTGTAGCTGCCGGAGATAATCTCCCACCATTTTGGTAATGGGTGTTGCAGTTTCCTCCGGTGTATTTTTGGGAAGGGATATGGTGCGCGTGGTATACCCACGGCGCTTCTCCAAGACCAGTAAGTCTCGATCCGGAAGAAAGCTGCGCAGGCTATCCCGGCGCATTTCCAGGAGGGGGGTCATATTCACGCCGCAGGTCAAAACGGTGATGACGGCATGAACTGCCAAGACTTGATGAGATGAGAGGGCTTCCGGATTTGCGTTGAATAGGGCCAGATCTTCTGCGCAGGCAGCAATGATCCGCTCTTGCTCCCCTGCAGAGTAACCTTCTCGTCGCGGTATCCGCTTATTGGAGTTGGGATAGGGATTCTTGGGGAAATTCAGGTTAGGGTTGATGGACTCTGGCACATGTTGCCGGCGATTTTTGAGTATTGATTTGATACCGGCATAGGTCGCATTACAGGCGGATACAGACCACGGTAACCCCTTGTTCTTTCCGTGTTGAACGGTCTGTAGGGCCAACCATGCTACAAATTGCATGATGATTAGCCGGTCGACCTGTTCCAGAGTAGTCACTACTTGGCCGGATTGCTCCAGGTCATCCAGGAAGTGCCAGAAGCAACGAATACCGCTATTGAAATAGGTCATCAGTGATTTGCCGGCAGAGACAAAACGCAGAGACCAAATGGCATCTCGCATTTGCTCGGCCAACGGCTCACGGCCTCGGGCCCGGTGGCTTGTGAAGTCGAAGTGATACTCCGCGCCACTGTGTACACAGCGAATGGTGAACGCCCATCCTGGAGGGAGGTCGATGATCTGCTTTTCTGGGGTCAGATCGATGTGGGTATTGCGATCAACCCGCTTACGCCGGATCGCCATCAGACACAATCCTCTTGGAGCAGTTTGCAGATGTCAGCTTGGTAGCCATCCAAGGCTGTGTGATCTACCAGACTGGCTGTGTGTATATAGATTTGTGTCGTTGCCAAACTGCTGTGTCCCATGCGTTCCATGACCCAAAACAGAGCGCTTTCCTTACTCTTGAATTGACTCATCCGGATAAATTCGTAGGTGCCATACGTGTGACGCAGCATGTGTGGTGTGCATTTGATTCCTGATTTCTCTGAGGCCTTCTTGAAGGCGTTGTTTATTCCGCCCAGGCTGATTGGCTCTCCATATGCTGTCAGGAAGACCCGGTTGCTCTCAGGACCAACGTGCCGTTTACGGTAGCGCTTAGCCAGCGAAGATCTCTCATAAATTACGTAGTTCCATAACTGGACAGCTAGGTCATAAGGGACAGATACCCATCGGGGTTTTCTGCCCTTTGTTGTACGTAGCGTCATCGCGAGAGGCTTACCTGGTGGATGGCCAGATGGATTGGGAATGTCCTCCAACTCCAAGGTTGCAACCTCTTCCCTACGAAGTCCCGAAAACCACATCAAGTAGGCCATTAGGCGGTTACGTCGCGGGGTTAGCGCGGTGACGAACAGCCTGGCCTTGTCGGCTGTTAGAAATTTTGGTTGTGGCTTAAATGAACGTAACTTCAGTTCATTGGCTTGGACTTGATTTCCCTTGGCATCAATGTGCGCCAGAAATCCCTTCGGCCGGGATACCCGAACGTCCTCCATATAGAAGGGGAGGGAAGAGATTTTCTGTGAGCGCAAGGCCCATTTGTAGAAGTTGGATACAGCCCCCACACGGTGGTTGACCGTAGAGCGGCGACATCCTCGTGAAAGCATGGAGTCCCGCCAGGCTGCTAGATGAGTGCTTCTTATCCTGTCCCAGGAAATCTCTATGGTTTCCAGGAAGCTGAAGAATTCATAGAGGTGATTTCCATAGGTCTGCCAAGTCTTCGGGGACGATGTTCTTCCCCTAACTACGGCGACGTGGAAGAGATATTCATTTGGCGCTGTGACAAGCGCCATTTGGTCGTCAATGAGAAATGGGATGCCTGGGTATGGCTGTCCGTGTACCGTGAAGGTTGGGTCGGTGAGGAATAAGTGCATATCGATCCAGGCAGTAGTAATCCAGTTGTCCAGAATGGCAACTTGGAGAACCGTAACTGTAAGGACCTTTTTTTTCTACAGAGGCAACACTCTGCAGGTCTCTTAACACGTCCAGGGTCGGCATGCCTGCGGCCCCTGCCTGGAGCGGCAATAGCAATGCGGCTCCCGGCAGGGCGCCGGCCAGCAGGACCGTCCTTATTGCCGCCTGGAGGCGATGGTTTTTGCCAGGTGTGAAGATATTTTTATTTTTCATATCCGAGGGCGAGAGCAGTGGCAGGTGGTTGGATGACAGCGCAGGATAGGGCGGGCGGTTCAGGCCGGCAGGGGCCGGCGGGCGAGGCGCAGGGCGAGCAGGAAGCTGGCGGCGATGATGCATACCAGGGCGAAGCCGAAGGCCAGTTCCTTGCCATCGCTCCAGGCTTCCACCGCGGGGGAGAGCATCTTGGCGGCGCCGAAGGCGGCCACCAGCAGGCTGACGGCGGAGACCACCAGGCCCATGACCCGGGAGGCCACCAGGGCCACCGTGTCGGCCCGGGCGATGAGGCGGGAGATCCACAGGCCGTTGATGCCGTCGGTGATGAGCATGCCCAGCATGAACAGCAGGGAAAGGCCCAGGGCGTGTTCCCAGCCGCCGTGGCGGGTGGCGGTGAAGGCGAAGAGGGCGGCCTGGGACAGGGTGTCGAAGGAGAGGGCGAAGAGGGCGCCGACGGCGGCCACCAGCAGGGGGTGGCCGGCTTCGGTGAGGCGGCCGAGGAAACGGCCCTTGATGCCGGCGGGGCGCACCATCTCGCCCGGAGCGGCGGCCAGCACGGCTCGCAGGTTCACGGCGCCGAGCAGGGTGAGGAAGGCGATGGAGATGAGGCCGCCCACCCATTCGAACCATTCGGGCACCTGCCACACGTCCGCCAGGGTGCCGACCGCCAGGGCGATGGCGATGACGATGGCGCCGTGGCCGAGGGAAAAGAGGGTGCCGCAGTAGCGCGCCAGGCCCGGGTTGCGCCGGCTGTTGAAGCGGGTGAGGCCGTCGATGGTGGCCAGGTGGTCGGCGTCGAAGCCGTGCTTCATGCCCAGGATCAACACCAGGGAAGCGAGGGCGAGCCAGTTGTCGGGCAGGGTATCCACGGGCCGTGAGGGGGCCGCCGGCGGGCGGCAGAAGGTTAGAAGATGTGAAGAATAAACTTTCCGTCACACGATTGGGGTTGACCTGGATCAGAAAGCGGCAGGGAAATCCGGGGCGACGGGCTCGATCCGCTCCACGGCCGGCCGGGCTGGGGCCGTGGGCAGGTGTGCCGGCAGCAGGGTGCCGCCGGGGGGCTTCAGGCGGCGGGGAGGTCCCGGGGCAGGGTCAGGCGGAACACGGCGCCGCCTTCGGGATGGTTGCTGGCGACGATGCGGCCGCCGTGGCGTTCGACGATGCCGTAGGAGATGGAAAGGCCCAGGCCCGTGCCTTCCCCCACCGGCTTGGTGGTGAAGAAGGGGTCGAAGAGGTGGCCCAGGGCTTCGTCGGGAATGCCGGGGCCGTTGTCGTGGAAGGTCAGGCGCAGTTCCCGGTCCCCCAGTTCGCCGCTGATCAGCAGCTCTCCCCCGGGCTGGGCGGCGGTGGCCTGCAGGGCGTTCTGCACCAGGTTCATCACCACCTGCTGCAGCTGCCCCGGGGAGCCCCGCAGGGGCAGTTCCGCCGGCAGCTCGGTGCGCACGGTGAAGCTGGGCGGGGCGCTCTTGGCCACCCAGCGCACCGCCCGCTGCACCACCTCGGCCAGGTTGAAACGTTCCGCCGCCTCCCGGTCCAGGGCGGAGAAGCGCTTCAGGCCGTCCACGATGTCCCGGGTGCGCTCGGCCCCTTCGATCATGCCGTCGATGAGGGGGGACATGTCTTCCAGGATGCGGTCGATGCGCAGTTCGCTGCGCAGCTCCTCCAGTTCCGGGATGCAGTCGCAGCCCCGGTTGTGTACCGCTTCCAGATAGGTTTCCAGGCGCCCGGCGTAGCGCTTTAGGGCCAGCACATTGCCCAGCACGAAGCTGATGGGGTTGTTCAGTTCGTGGGCCACCCCGGCCACCAGGCGGCCGAGGGAGGCCATCTTCTCCGAGTGCAGCAGCTGCTGCTGGGTGCGCTTCAAGTCCTCGTGGGTCTGCCGCAGTTCCCGGTAGGCCCGGCGCAGTTCCCCCACCGGGCGGCCGGTGACCACCATGCCGATCAGCTTGCCGGTGCCGGAGAGGCGGGGCGTCCAGTTGAAGGAAACGGGTACGGCGCCGCCGTCCTCGGCCTGCAGGGGCAGCTCCACGTCCTGGGCGCCGTCGTGGCTCTGGCTGGCGAAGAAGCTGCGGGCCTTGTCCCGGGCCTTGTCGTCGGCGAAGAGGTCGAAGATGGAAGTGCCCTTGAGAGTCGGCTCGTCCCGTCCGGTGTAGCGCTGGAAGGCCGGGTTCACCTCCTCGATGGCGCCGTGGCGGTCGCACACCACCAGGATGTCGGACATGGAGGCGAGGATGCTGGCGATGAAGCGGTGGGACTCCTCCAGGGCGGCGTTGTTCTGCTCCAGGGCCACTTCGTACTGCAGCAGGTCGTTGTAGACCTCGTCCATCTTCTGGATCACCTCGATCCACACCTTCTCGTTCACCCCCTCCAGCAGCTGGGCCGGCTCCGGCAGCAGGCTGCCGTCGAGCAGGGGGCGGGCGGGACGCTTGGCGGTCATGGCGGCGGGACTAGCGGCTGGGCTTCAGGTGCAGGTGGCGATAGCCGTGGCCGTGGGCATGGCTGTGCTTGTGGCTGTGCTGCTTCTGGCTCTCGTCCAGTTCCACGGTGACCAGGTTGAGCTGGCCATGGCGCACGCCGCGCTCCGCCATGAGGGCCTGGGCGAAGGCCCGCACGTCCTCCGTCGGCCCCTTGAGGATGGTGCTCTCGATGCAGTGCTCGTGGTCCAGGTGGGCGTGCAGGGTGGACACGGTGAGGTCGTGGTGGTCGTGCTGGATGGAAGTGAGGCGCTCAGCCAGTTCCCGCTCGTGGTGGTTGTAGACGTAGGAGAGGTTGGCGACGCAGTGGTCCGATTCCTTGCGCGCCTGGCGCCAGGTTTCCAGCTGGGAGCGGAGGATGTCGCGCACCGCCTCTGAGCGGTTGCTGTAGCCCCGGGCGGCGATGAGGGCGTCGAATTCCCGGGCCAGGTCCTCGTCCAGGGAAATGGTGATGCGTTCCATGCGTTCTCCCTCGCCGGCGGGCGGAAGGCCGCGGAATGCGGCAGGCCGGCGCAATGCCCGCCAGTCTACCAAAAGGCACCGGGCTTCACGGCGGCGCCGGGGTCTTTGCTGCAAAAGAAAAGGCCCGCCGGAGCGGGCCTGGGGATGCGGTGCGCCGCCATGGGCAGCGCCTCGATGCTCAGCCGCGTTGCTGCAGGGCGGCGATGCGCTCCTCGATGGGCGGGTGGCTGGAGAAGAGGCTCATCCAGCCCTTGCCGCCGGCGATGCCGGAGGCCGCCATATTGGCCGGCAGGGGCTCGGGGGACAAGCCGCCGAGACGCTGCAGGGCCGCCATCATGGGGCGCGGGTTGTTGCCCATGAGCCGGGCGGCGCCGGCATCGGCCCGGAATTCCCGCTGGCGGGAGAAGTACATGACGATCATGGAGGCGAGGATGCCGAAGACGATGTCGCACACCACCACCGTCACCATGTAGCCGAGCCCGGGGCCGGAGGATTCCTCGTTGTCCTTGCGCAGGAAGCTGTCCACCAGATAGCCGACCACCCGGGCCAGGAAGAAGACGAAGGTGTTCACCACGCCCTGGATCAGGGTCAGGGTCACCATGTCGCCGTTGGCCACGTGGGCCACCTCGTGGGCCAGCACCGCCTCCACTTCCTCCCTGCTCATGGACTGCAGCAGGCCGGTGGACACGGCCACCAGGGAGTTGTTGCGGCTGGGGCCGGTGGCGAAAGCGTTGGCCTCGCCTTCGTAGATGGCCACTTCCGGCATGGGCAGGCCGGCGTTCTTGGCCAGGCGGGCGACAGTGTCCACCAGCCAGGCTTCGGTGGGGTTGCGCGGCTGCTCGATGACCTGGGCGCCGGTGCTCCACTTGGCCATGGGCTTGGACAGCCACAGGGAAATGAAGGAGCCGCCGAAGCCCATCACCGCGGCGAACCCCAGCAGCATGGGCAGGTTGAGGCCGTTGGCGGTGAGGAAGCGGTTGAGGCCCAGCAGATTGATGACCAGGCCCAGGACCACCATGATGGCCAGGTTGGTGGCAAGGAAGATCAGAACGCGTTTCAT
Protein sequences of DBSCAN-SWA_3 >NZ_AP021844|2967016:3023959|3023077_3023959_-|WP_014236252.1|protease|DBSCAN-SWA MKRVLIFLATNLAIMVVLGLVINLLGLNRFLTANGLNLPMLLGFAAVMGFGGSFISLWLSKPMAKWSTGAQVIEQPRNPTEAWLVDTVARLAKNAGLPMPEVAIYEGEANAFATGPSRNNSLVAVSTGLLQSMSREEVEAVLAHEVAHVANGDMVTLTLIQGVVNTFVFFLARVVGYLVDSFLRKDNEESSGPGLGYMVTVVVCDIVFGILASMIVMYFSRQREFRADAGAARLMGNNPRPMMAALQRLGGLSPEPLPANMAASGIAGGKGWMSLFSSHPPIEERIAALQQRG >NZ_AP021844|2967016:3023959|2990895_2991612_+|WP_130459832.1|DBSCAN-SWA MRPSQRAADQLRQVRITRRFTRHAEGSVLVEMGDTKVLCTASIEENLPPFLRGKGQGWVTAEYGMLPRSTHTRSSREAAKGKQTGRTQEIQRLIGRSLRAVTDLKALGERQITLDCDVLQADGGTRCASITGAWVALWDACQSLVAAGKLSENPLKEHVAAISVGIYKGTPVLDLDYPEDSDCDTDMNVIMTGSGGLVEVQGTAEGEPFSRQQMNVLLDLAEAGIRQLIHAQETALAD >NZ_AP021844|2967016:3023959|3018585_3019710_-|WP_014236248.1|integrase|DBSCAN-SWA MHLFLTDPTFTVHGQPYPGIPFLIDDQMALVTAPNEYLFHVAVVRGRTSSPKTWQTYGNHLYEFFSFLETIEISWDRIRSTHLAAWRDSMLSRGCRRSTVNHRVGAVSNFYKWALRSQKISSLPFYMEDVRVSRPKGFLAHIDAKGNQVQANELKLRSFKPQPKFLTADKARLFVTALTPRRNRLMAYLMWFSGLRREEVATLELEDIPNPSGHPPGKPLAMTLRTTKGRKPRWVSVPYDLAVQLWNYVIYERSSLAKRYRKRHVGPESNRVFLTAYGEPISLGGINNAFKKASEKSGIKCTPHMLRHTYGTYEFIRMSQFKSKESALFWVMERMGHSSLATTQIYIHTASLVDHTALDGYQADICKLLQEDCV >NZ_AP021844|2967016:3023959|3004962_3005604_+|WP_014236235.1|DBSCAN-SWA MSNTQTHSLPSSASLFKATAVAAGVAATLLVTMVLPAEYGMDPTGIGRFLGLDALKQSAGAETTSVLATPDAIAGPNAMLAAKADAAFGKQAGRSLDASAVSLAGDGPMRRNTFTVTLAPGKGAEVKAHLRAGEGLTFHWQATAAVAVDMHGEAPNAKNAWTSYSVESAQKSASGTFVAPFEGSHGWYWQNRGTEPVTVSIEASGFQSELYRP >NZ_AP021844|2967016:3023959|2992539_2993130_+|WP_014235607.1|DBSCAN-SWA MQKIVLASNNAKKLKELSALLTPLGIQLIPQGELGVPEAEEPHHTFLENALAKARHAAQLTGLPALADDSGLCVKALGGAPGVQSARYAGEPKSDARNNEKLLAALTGVADRRAHFVSLLVLVRHGDDPQPLVAEGEWHGEIIDQYRGEGGFGYDPLFYVPAEKATAAELSAEVKNRLSHRGQAMARLLERLKLEL >NZ_AP021844|2967016:3023959|2981004_2981910_-|WP_152090383.1|DBSCAN-SWA MIVRERPSLLRLFFIWRGSVVPHVLPQIVFTTSFAVLITWGAQHFGHLFPDYSAAPFALLGLAFSIFLGFRNSACYDRWWEARKQWGGLIVELRSLARDSLVLEAEPRRLLVRRSLAFAHALAARLRGRDAALEAAPFLPPSEAERLAQSRNPADALLRQCGHDLVQARQRDGLGDIVYQGLTQRLHALSGIQAACERIRFTPLPFAYTLLLHRTAHLFCLLLPFGLARSVGWATPLLTAVLAYTFFGLDALGDELEEPFGTLENDLPLDAMVRMLEGDLGEALGETDLPPLLQPQGYVLL >NZ_AP021844|2967016:3023959|2980114_2980948_-|WP_152090382.1|DBSCAN-SWA MTIDPSALSPLGKASEYRCHYAPELLFPIPRQLKRDEIGIDPARLPFVGEDLWNAYEISWLNPRGKPVVALGTFRIPAQTPHLIESKSFKLYLNSFNQSAFADAQTVAATLVRDLSAAAGGQVTVQLEPLAAQPRPRVDYPSGILLDELDIECDRYQPAPELLQADAGRSVEETLYSHLLKSNCLVTGQPDWGMVVVRYRGPAIDRAALLRYIVSFRGHNEFHEQCVERIFCDISARCAPQSLAVYARYTRRGGLDINPFRSSGEFLPPDNIREVRQ >NZ_AP021844|2967016:3023959|2977861_2980063_+|WP_152090381.1|DBSCAN-SWA MSRPLPILSTPAAAPPEAAPLTRYSPIPWGLVIVLSLLFVVVWLLPPLGGLKQSDTIFPLTLHTVMESFSFVVSVLVFAVSWHAYSRERAGNLMILACGFLAVALLDFGHTLSYRGMPDFVTPSSPQKAIIFWLAARYVAALTLLTIALRPWQPLARPRDRYRLMLWALLVTAAVFVSELYLPDFWPTMFVPGVGLTGLKIAAEYGLIAILGATAVILYPKTQGKPAFDAANLFTAVLITILSELCFTLYSNVNDVFQLLGHTYKVIAYFWIYKAVFVSSVRDPYLRLSLEMAERQAAEARIQFLAYHDPLTELPNRILVRERFERAVERARDQSSRVGLVYIDLDNFKTVNDSLGHTLGDLLLQAIGQRLQSLVPAGSTVSRQGGDEFLILLEDLEQSRLAESLVSRIVEQMQAPFEIQGHDLSTSVSIGVSLFPDDGGDFDTLLKKADTAMYRAKGAGRNGYRFFDREMDKDVGERLRLSNDLRLALARNEFVLHYQPQIDLRTQEVIGAEALIRWQHPELGLLAPGRFIGIAEDTGLIVPIGEWVIRMACHQAAAWQRAGLPPLVVAVNLSAVQFMRGDLVGTVASALATSALPSRCLELELTESILIQDAENILGTVQRLNAIGVQMSIDDFGTGYSSLSYLKRFAVDKLKVDQSFVRDLCSDPDDAAIVRAIIQLARSLGLKTIAEGVETAEILALLQELGCDEAQGYYFAKPLPADNFSAFLSQRLS >NZ_AP021844|2967016:3023959|3010546_3010942_+|WP_014236241.1|DBSCAN-SWA MSLGMGTARWVGVAVLVTAINAAAADRGWVLLQGKVLAQALSNQDFGDGVHFAYQFLSGGELRGMNMGKPARGNWRVIGNELCWHWTRPKEPEECYQVRQRGQAVRLYLDGQEVLSGNLTPLPANLKEMPQ >NZ_AP021844|2967016:3023959|2988008_2988875_-|WP_152090385.1|DBSCAN-SWA MIYSMTGYAAKTREVAGGSLHLELRSVNSRFLDIHFRIVDDLRVLEPALREAITAKLARGKVELRLNLVASQSQNRQLAINADLLTQLQALEGQVRQTLPNAAALSVAEVLRWPGMLGEPEVDTAALHAAVQATLKEALEDFTASRAREGAKLAAMIQERVDKIRATVAAVAPLIPQAQAAYQDKLKQRLVEALGSADDERVRQEVVLYATRIDVDEELSRLQAHLTEVERILKAGGNAGKRLDFLMQELNREANTLGSKSVLSEVSKASMDLKLLIEQMREQIQNIE >NZ_AP021844|2967016:3023959|3020022_3020832_-|WP_152090389.1|DBSCAN-SWA MDTLPDNWLALASLVLILGMKHGFDADHLATIDGLTRFNSRRNPGLARYCGTLFSLGHGAIVIAIALAVGTLADVWQVPEWFEWVGGLISIAFLTLLGAVNLRAVLAAAPGEMVRPAGIKGRFLGRLTEAGHPLLVAAVGALFALSFDTLSQAALFAFTATRHGGWEHALGLSLLFMLGMLITDGINGLWISRLIARADTVALVASRVMGLVVSAVSLLVAAFGAAKMLSPAVEAWSDGKELAFGFALVCIIAASFLLALRLARRPLPA >NZ_AP021844|2967016:3023959|2987149_2987356_-|WP_014235614.1|DBSCAN-SWA MARITVEDCLKQIPNRFQMTLAATYRARQIANGSTPMQEPSKDKPTVIALRELAAGQIGLEILNRGQA >NZ_AP021844|2967016:3023959|2974784_2977310_-|WP_152090380.1|DBSCAN-SWA MNLTRRDFIKSSAVAAAANAAGMAVPGVSEALAQQPKNDGIRWDKGVCRFCGTGCGVLVGTKDGRVVATQGDPEAPVNRGLNCIKGYFLSKIMYGKDRLTQPLLRMKNGQYDKNGDFTPISWDQAYDIMAEKCKAALKAGGPRNIAMFGSGQWTIWEGYAAAKLWKAGFRSNNLDPNARHCMASAVAGFMRTFGIDEPMGCYDDAEHADVFALWGSNMAEMHPILWSRITDRRLNAKHVKIHVLSTFTHRSCELADNELIFKPQSDLAILNYIANYIIQNGAVNQDFVKNHVKFKKGVTDIGYGLRPNHPLEQAAGNNGYPGPDGKPKGDPNKATDISFDEFKAFVAEYTLDKTHEISGVPKENLEALAKAYADPKVKVVSYWTMGFNQHTRGTWVNNMIYNVHLLVGKISEPGNGPFSLTGQPSACGTAREVGTFAHRLPADMVVVNPKHREITEKLWKLPAGTIPDWVGLHAVAQSRALKDGKVAFFWSTTTNNMQAGPNINGEVYPGWRNPAAFVVHSDVYPTVSALAADLILPSAMWMEKEGAYGNAERRTQFWRQQVKPQGQARSDVLQYVEFSKRFKMEEVWPAELLDKAPEYKGKTLYDVLYANGEVNKFPVSDQLKGFENEEGKVLGFYLQKGLFEEYAAFGRGHGHDLAAFDTYHKARGLRWPVVDNKETLWRFREGYDPYVKAGEKVRFYGFPDGKAVVFALPYQPAAEQPDAEYDLWLCTGRVLEHWHTGSMTRRVPELYKAMPDAWIYMHPEDAKKRGLQRGDTVKVQSRRGEISTRVETRGRNKPPLGLVFVPFFDEHRLVNKLTLDATCPISKETDFKKCACKVVKA >NZ_AP021844|2967016:3023959|2993274_2994510_+|WP_152090388.1|DBSCAN-SWA MSSRSSSRIIPIAVAGGTRAGGSPLHFTSPPPLSLYIHVPWCVRKCPYCDFNSHEARAENDEAAYVAALVADLESALPSVWGRKVSTIFIGGGTPSLLSGEALHELLNAVRMRLPLLPEAEVTLEANPGTAEAGKFAAFRAAGVNRLSLGIQSFNDRHLEALGRIHDSAEARAAIELAKAHFERFNLDLMYGLPQQSQAEAMADLEMALSFAPPHLSCYQLTLEPNTLFAARPPQLPEGDTCADMQDAIEARLAAAGYVHYETSAFARPDYQCRHNLNYWTFGDYLGIGAGAHGKLTLPDHSGFSVQRQMRWKQPKQYLEQVAAGQPVQEQHGVGADELPFEFLMNALRLNQGFDPALFEQRTGLPLLLVRGELEKAAREGLLTLAPDCIAPTERGRRFLNALLERFLPDA >NZ_AP021844|2967016:3023959|2967016_2967958_-|WP_152090375.1|transposase|DBSCAN-SWA MGASILSAPYFHNEEAAYEFVESRLWPSGPVCPHCGCVERISKMGGKSTRIGAYKCYNCRKPFTVKVGTIFESSHIPMRLWLQAIFLISSSKKGISANQLHRTLGITLKSAWFMSHRIREAMRSGDFSPFGSEGGPVEVDETFIGRDYTKKPKGEKKGRGYDHKNKVLSLVDRTSGQARSMVVDDLKAKTLIPILEANIAREARIMTDEAGQYKNVGQHFAGHAFTRHGMGEYVSKIDPTIHTNTIEGFFSIFKRGMKGVYQHCGHHHLNRYLAEFDFRYNNRKALGIEDQERAEKLLQGVKGKRLTYETTAQ >NZ_AP021844|2967016:3023959|2991625_2992534_+|WP_152090387.1|DBSCAN-SWA MAQETSRDPIKALLDDLEQSIADFDQRLGGVEESPAVTGLRSSGQRYPDIEPEARRQLSPAAPVAVAGNADATAVSEAPAVDLLAELAQAAACRSVDDAETQRRQLELTERLHQDLKTVFDYLNQLIRHANTLKPVLPRSYRLDARNSFDGLAWHDGFVDYRSTSRFDRSYYEQILFQVSYRAPAPLVAVCAADQAAIVRKELELVNLRIQREEPVMLPEGGPGVRYVLPDAIPLHLAVQADFANDALTFRCRNAGNFGPTAYRLPGGSITRPLLDGIGLVLLGRSDTMPKELQRIPYQRIN >NZ_AP021844|2967016:3023959|2981906_2982758_-|WP_152090384.1|DBSCAN-SWA MLIDTHCHLDAAEFAPDREAIFQDGVTAGVQAMVVPAVAAATFAEVRACCLAYPGCAPAYGIHPLYTPAAREEDLSTLRRWLAEERDGPLAPLAVGEIGLDLYVPELQQGEALARQQHFFAEQLQLAVEFDLPVILHVRRALDPILKQLRRYRPRGGIAHAFNGSRQQADEFIKLGFKLGFGGAMTFSGSTRIRELAATLPLEALVLETDAPDIPPAFLTAASPDRRNKPAYLPRFAALLAELRGMPTAELIAATGANARAALPGLAALASATPTTTPPTAST >NZ_AP021844|2967016:3023959|2998560_2999811_+|WP_014236231.1|DBSCAN-SWA MNLRCLLVLAVAGTCGIPLSGYAAESLRLEEAVSRALASHPSLAAEAAQLKAVQARAQREGLATPFMIGADVENVGGTGAFRGGQSAETTLRIGRVIELGGKREARQALGSAEINQQQNLSEATRLDVISRTSLRFISVLADQQRLKYAQEQVGQAERTRREVANWVAAARNPESDLRAAEIAVADAELERTRAEHKLTSARLTLASSWGVLTPDFETAAGNLLVLPKAESLDTLVARLPMTPEQRAALLEADSIAARKRLAEAGAKPDVTVNLGVRRLEATSDQALMMSVSIPLGNQVRSGLSVAEANAQLMALEARRDAQRFEHYQSLFGKYQELNQARTEAETLQKHMLPKAEEALAFTRRGFEAGRFSFLALAQAQKTLFELRQRAVDAAARCQILMTEVERLTAIAPEPTP >NZ_AP021844|2967016:3023959|3012699_3013314_-|WP_014236244.1|DBSCAN-SWA MRALSTADVPIQPLKEFGEDVGPEFEFEIEDGRVALLSAEPPSWIHLIADSSWWISLFSAAAALYMAEIVKEAAKESWKNRAKATALVVGAANKVKLFAEKVVRLKKKLPERTDIVVALPIPNDYFGVRLTLSVDDADLLAYQIALFVSHMPRLIEAIRNEKLDGPRVATGLFLELQDNGDLKVTWFNRESLELESLVILLPVQ >NZ_AP021844|2967016:3023959|2968362_2969466_-|WP_152090376.1|DBSCAN-SWA MSVAPPSPSQRQARSRYQRGMLAWLQQPGDPAGLPEMRAAVRHLEAAAGGDFAPFWHSAEVFLRAISDGTLAVDAESRRLCARIDLQMRAALNGSEAPEGGLAEELQQCIRQGAGQLPPVTELISLMAKPEAPDLDAEAVAAWSAAGNAAVAAWNGRGSGDLAPFRRALIDLCAAAMSLNLPETLHLAESLAGVGDLLDAPEAAEDPYLRAAIAAALELLGDTRDLGLPVFAERVAHVAQRLAECRESQRPAVSPTLLRLFAGEIGEQAALMREELACLEPDGEALAESAHCLADHAAHLELDSAEALAQGLAAAIVRAQAGHGFDHPEVREALEAALAELDTMADFLLVAQPLPEATDILEILAQV >NZ_AP021844|2967016:3023959|2998098_2998500_+|WP_133247329.1|DBSCAN-SWA MYQSLRHRFRAYVQHPIGRCAVVLMLFALVAASVPFGEIHAHADGDHDHDHGYVTAELTKASLSDPSDSMDSDSDSTGAKVLHAHGSVVTPPPLPVDGLGIEPFIFPARDKITLAYLSRPSATPLPPYRPPIA >NZ_AP021844|2967016:3023959|3008250_3009189_+|WP_014236239.1|DBSCAN-SWA MTTWIALITSLPTENATARMRAWRSLKASGAAVLRDGVYLMPEREDCRNTLDAVAADVRAAEGTALVVRLEEPSDGNFVVFFDRSADFATLLGEIATARDTLGPDTVNEALKQARKLRKAFSNLVAIDFFPGEAQKQADEALRDLEQRAAWALSPDEPHPVNDAISRLSIQDYQKRRWATRRRPWVDRLASAWLIRRYIDPQAELLWLATPADCPAEALGFDFDGATFTHVGARVTFEVLLASFGLETPALQRIGTLVHFLDVGGVQPLEAVGIESTLAGLRDTILDDDQLLALAGSIFDGLLASFEKGSKS >NZ_AP021844|2967016:3023959|2972310_2972787_-|WP_130459820.1|DBSCAN-SWA MNRLHKLTLAILAASFACLAQAADAPKTMRGADIPAGDPAPEVKAYAGKKPGLQQPIARTYKEQPPVIPHAVDNFDEITLEENQCLTCHGPEKYKEKKAPKIGESHFIDREGKQHAEVTHLRHNCVQCHVPQVDAPPLVENTFVGNIAASKDAKAKKK >NZ_AP021844|2967016:3023959|2972797_2973718_-|WP_152090378.1|DBSCAN-SWA MSFLSPRFPAALAAKGRLGANRWLLLRRLSQFGILGLFLLGPLAGLWLVKGNLSYSLTLDTLPLADPLLVLQVLFSGHRPEGLALLGAAIVLAFYLLVGGRVYCSWVCPMNLVTDLAGWLRERLGLKGSAHISRRSRYWILGLTLLLPLAGAGLAWELINPVSMLHRGLIFGLGAAWTVVLAIFLLDLLIMSRGWCGHLCPVGAFYSLLGRTSLLRVSARRRQDCDDCMDCFAACPEPQVIRPALKGEANGTGPVILASACTNCGRCIDVCAKDVFVFGSRFNQHTQRCAPAGEAEGQTDHRKTIH >NZ_AP021844|2967016:3023959|2989974_2990886_+|WP_014235610.1|DBSCAN-SWA MKFTIYQESRIGKRQNNEDRIAYCYSREAVLMVVADGMGGHYHGEVASQIAVQTLTSAFQRDAQPEIADPFLFLQKGMTNAHHAILDYSQEHRLKDSPRTTCVACLIQDNIAYWAHVGDSRLYHMRDGKVLAVTRDHSRVRLLMDEGLISEAQAATHPDRNKVYSCLGGENPPEIEFSRKTPLEVGDVLVLCTDGLWGPLPADVMAASLKGANLMQAVPMLLNQAEIRSGPYGDNLSVVAVRWEQSYSEEASSTVMTQTMPLDAVTTKLGEFGRDPAYKTDLSDDEIEKAIDEIRAAIQKFSK >NZ_AP021844|2967016:3023959|3013866_3014295_-|WP_043797807.1|DBSCAN-SWA MPSETIAVLGTLSGVLIGGFINYLASRSVKNHEWKLALAKEQSTIRHKLYAEFLVETQRLVVQAREEKISSLADLNMLNGKFAEVSLVASGMVVEAAKKLADYVITSHSAQPAKEVADFFKLKEAFIAVARQDIAEVLSGGA >NZ_AP021844|2967016:3023959|2996760_2997930_+|WP_014236230.1|integrase|DBSCAN-SWA MAHDTNPTLREALFMYCERISIHKKGHAQEKYRINLYCRYSIADLPIRNITSVDVATFRDERLAEINARTGRALSPATVRLDLALLSDLFRIAKNEWGICNDNPVANVRKPKLPPGRDRRLAPREERMIMRHCSQRGAHEMKAIVQLALETAMRQGEILGVCWEHINLKSRIVHLPDTKNGSKRDIPLSMEARDILAAQRVKLSGRVFSYTNNGLKSSWRSMIKRLNIPDLHFHDLRHEAISRLMERGVFNLMEVAAISGHKSLSMLKRYTHLRAQRLVRKLDAGANKGKAAVLSYLVPYPAFIEPYESQVKVTFPDFDDLHVAGPCLNSAVQQAQDALLREILVLMRQGRPIPPPNNYLELLDESRLFHLDPLATYDSLADLAEGALV >NZ_AP021844|2967016:3023959|2999807_3001040_+|WP_014236232.1|DBSCAN-SWA MNRLLPLILVPLLLTACGNDTPPSAVVAAEKASAAEEYERGPHRGRMLRQGDFALEVTIYETNVPPQYRLYAYQNGKPLPPASVQAAIQLKRLDGEFNNFTFTPEKDYLNGSSEVIEPHSFDVEVKAQHAGQSYSWAFPSYEGRTTIPAAAANDAGVKVEKAGPTTIRNTVRLMGAVMVDANRRAEIKARFPGIVRAVNVQEGQRVSRGQTLVAIEGNDSMRTYSVVAPFDGIVLARNTNVGDVAGSNTLVELADLSSVWVELRALGGDAEKLSVGQEVEISSATGGSRVTGKIQTLLPLASGQSVVARASIANPEGRWRPGMAVSADVTVAARQVPLAVKESGLQRFRDFTVVFTQVGDTYEVRMLELGERDGRYAEVLGGLKQGATYVAEQSFLIKADIEKSGASHDH >NZ_AP021844|2967016:3023959|3014523_3015012_-|WP_014236245.1|DBSCAN-SWA MSNNAGNPQALQAHKENSIKLTRQALNLAIQRIVGGHPDQVRVGSRLTAASVAQEAGVERSTLYRYHREILSEIQRINDTDAQSKLKIKTSALAKAEERKKEYRELLELEQSNLLKIARENYALQQRIQELEDLLKEKDSLIANLRAENRKVSLISESPREG >NZ_AP021844|2967016:3023959|3005674_3005941_+|WP_043797801.1|DBSCAN-SWA MKNKTFLSLSLLVGSFMSLSSVAYAHGVHEDSAEPKATPTACRHLTDTEHYVVDLKDPATRALKTRCDATKKPVTPVAEKKDETPDKK >NZ_AP021844|2967016:3023959|2971516_2971993_-|WP_014235627.1|DBSCAN-SWA MSQRPFKVLGIQQIAIGGPSKDKLKTLWVDMLGLEVTGNFVSERENVDEDICAMGKGPFKVEVDLMQPLDPEKKPAVHTTPLNHVGLWIDDLPKAVEWLTANGVRFAPGGIRKGAAGFDICFLHPKGNEESPIGGEGVLIELVQAPAEVVDAFAKLAG >NZ_AP021844|2967016:3023959|2994600_2996535_-|WP_152090900.1|DBSCAN-SWA MGVADAASVGTVTAKQLENRPLLRPAEVLETVPGLIVTQHAGDGKANQYFLRGFNLDHGTDFSVTIDGMPINLPTHAHGHGYLDLNFLIPELVERIQYKKGPYAAEDGDFSSAGSARIDYRRALPEDYVSIGLGSNGYRRLLTAADKETAGGGRWLGAVEVFHNDGPWEVPEHYKRLNGVLRYSEGTRNNGHSLAFMAYDGDWTSTDQLARRAVDQGLVNRYGSLDPTAGGNTRRLSLSGQWARQDGAVQTRANTYLVDYRLNLFSNFTYAMDDPVNGDQFEQADRRRYGGFGWSRSQPVQWLGKEGDFTWGVQGRQDDIDNVGLYRTAARQRLSTVRSDSVNQGSLGLYGQWGAQWSDWLRSVAGLRHDRYRFKVDSSLAANSGKENDGITSPKLSLIFGPFANQEFYYNWGQGFHSNDARGTTIRVDPSNPGDPMSRVPALVKSRGQEVGWRSAPAPGWNTSVALWRLDLDSELLFVGDAGTTQASRPSHRQGMEWSNYWTPRDWLTLDADIALSKARFRDDSSVGNHVPGAVERTASVGVAVHDLGPWRGGLRLRYLGPRALKEDDSVRSGSSVMVNLNVGYKLAAKSQLTLEVLNLFNRKASDIDYYYESQLRGEAAKVEDVHSHPAEPRIWRLTYTQGF >NZ_AP021844|2967016:3023959|3004244_3004949_+|WP_050804349.1|DBSCAN-SWA MPFRQFSATGICRWTVALLASLLPLWAFAHGVTGEDQSFLEQNTGRNLLLFAYLGAKHMVTGYDHLLFLFGVVFFLYRMRDVSIYVTLFAVGHSVTLLLGVLGGFHVNPYVVDAIIGVSVVYKALDNLGAFKHWLGFQPNTKAAVLVFGFFHGFGLATKLQDFSLSRDGLVPNMLAFNVGVELGQLLALAGILIVMGFWRRSTAFSRQAFTANTALMAAGFVLVGYQLTGYFVS >NZ_AP021844|2967016:3023959|2977311_2977560_-|WP_014235622.1|DBSCAN-SWA MNISSILVNAGPQQIAAVEAGLATLAGVEVHAVSEEGRMIVTIESDGDRETTQTYEAIQQLPGVMSLAMVYHHFEPDPEKES >NZ_AP021844|2967016:3023959|2973714_2974695_-|WP_152090379.1|DBSCAN-SWA MSDLSNSPPAKSDKAAAARRQFFADAGRMACGVGLLGLGLGFHAKQARALPPAALRPPGAGAEEDFLGACIRCGLCVRDCPYGTLSLARPEQPVSTGTPYFVARQVPCEMCEDIPCVKACPTGALDHGLTDINQARMGLAVLLDQETCLNFLGLRCDVCYRVCPVIDKAITLELRPNTRTGRHSMFIPAVHSEHCTGCGKCERSCVLETAAIKVLPVPLAKGELGQHYRVGWEEEQKAGHSLVDDKGLGDLPDRMPEGARLEGHFDPASQGGPSLVPGKPATPGSGVDSLAPSIPGADAHGPGVPAIPQNIPGGGLPNRLSDEAAR >NZ_AP021844|2967016:3023959|3013422_3013785_-|WP_043797805.1|DBSCAN-SWA MTIAVETRLRLELLFILSLCFVAVLAEVLAAAAVLKPESEPLASWFQRSGAITSVFCVFAQLRINNFFESIRGGTFSESWALFRLFNKQHGTVSWIITFVAIWGAFVWGYGDLMLRHFSR >NZ_AP021844|2967016:3023959|2982804_2983596_-|WP_014235617.1|DBSCAN-SWA MSTVLSHLEDGVLTLTLNRPEALNALNLAMIEDLRAATARAEHDEAVGAVVLRGGEHFMAGGDLKWFHSQLALPPAERQALFEQTIAAVHATTLQVRRMGKPVVASVSGAAAGFGLSLMLACDLAVAADNAYFTLAYCHIGLSPDGGATWFLPRAVGAKRAAEIALLGDRFDAAQAREWGLINRVVPAAELEAESAKLARRLAAGPRQALARTKALLQASSGNSLPEQLFAEQGNFAACSVHPDFAEGLGAFLEKRKPAFGQK >NZ_AP021844|2967016:3023959|3010938_3011394_+|WP_014236242.1|DBSCAN-SWA MKWITRERPKIDRIACPWLISRFVDESPEFLYVPAGEVMRIAAETGATPYDVPNTELGHHGDQCSFDAFIGKYKLEDAALNKLALIVRGADCGQPQLAKEAAGLLAISKGLSLNFSDDHEMLAHGMVIYDALYAWCADTPLKKIGRFLGLK >NZ_AP021844|2967016:3023959|2968030_2968231_-|WP_014235630.1|DBSCAN-SWA MNQMPLVPPGRDTLPRIERLLRRELALFFTLCALLGAFLAYGALRTLASPSAPPHPVLQARADVRP >NZ_AP021844|2967016:3023959|2987383_2987992_-|WP_130459829.1|DBSCAN-SWA MSGHLYIVTAPSGAGKTTLVRLLLQNDPAIGLSVSHTTRAPRTGEENGQAYHFTDVADFLARVDRGEFLEWAEVHGNYYGTSRTWIEQQLAAGRDVLLEIDWQGAQQVRKVFGDAIGVFILPPSMEELARRLAGRGTDSEDVIARRLAAARDEMRHVGEFDYVIINNDLQTALSDLLAVVRATRLKLPVQQERHASLFASLL >NZ_AP021844|2967016:3023959|2983630_2984890_-|WP_152090899.1|DBSCAN-SWA MGRVRVLVLGAGVVGVTSAWFLAEAGHEVTVVDRQPGAALETSFANGGQISVCHAEPWANPRAPFKALEWLGKEDAPLLFRLRYDPALFAWSLRFLANCPPGATRRNIRDIIALALYSRQRLQALRQTLPLDYDQRCQGILHIFTQAAEFEAACHAAALMREFGVDREPVDAARCVAIEPALAAVQGRLAGGDYTPSDESGDAHRFTQRLAEAAAARGVQFRYNCPVEKIASAGGRVAGVVAGGDLLLADAYVVALGSYSPALLKPAGVKACVYPGKGYSATIALSPDSVAPSVSITDDERKIVMSRLGNRLRVAGTAEFNGHNLELTPVRCEALLRRALELFPQLRPDGDPLYWCGLRPVTPSNVPLIGRTRLPNLWLNTGHGTLGWTLSCGSAAALADLISGRRPEPDFPFLGTTKQ >NZ_AP021844|2967016:3023959|3001050_3004209_+|WP_014236233.1|DBSCAN-SWA MLERMIRAAIAHRWLVLILVLGTSALGVWSYGRLPIDAVPDITNVQVQVNSEAPGYSPLEAEQRVTFPVETALAGMARLKYTRSISRYGLSQVTVVFEDGTDIYFARQQVSERLQQASSQLPAGVKPTLGPVATGLGEIFMYTVEATPGATKADGKPWMPTDLRTLQDWVIRPQLRNLKGVTEVNTIGGNVQQFHVTPDPAKMVAYKLTIDDLLQAIERNNANTGAGYIERGGEQNLIRIPGQVGDEAGLREIVVAMRDGLPLRISDIATVQIGSELRTGAATRDGREVVLGTVFMLIGENSREVAMRAATRLKEIDASLPEGVSARAVYDRTQLVDRSIATVQKNLLEGALLVIVVLFLLLGNIRAALITAAVIPVAMLMTITGMVQNRVSANLMSLGALDFGLIVDGAVIIVENCLRRFGERQHALGRLLSIEERFQLAAKASAEVIKPSLFGLFIIAAVYLPIFALSGVEGKTFHPMAITVVMALVAAMVLSLTFVPAAIAQFVTGKVEEKETRLMQRLHGIYAPLLEKSLSLQKPVIGAAAVLVVLCGLLATRLGTEFIPNLDEGDIALHALRIPGTSLTQAIGMQAQLEARIKQFPEVDKVVGKLGTAEVATDPMPPSVADTFILLKERKDWPDPRKSKATLVAELEEAVRAIPGNNYEFTQPVQMRMNELIAGVRAEVAIKVFGDDLQALTAVGKQIEKVAGSISGSADVKLEQVTGLPLLVIKPDRAALARYGLAVADIQDTVSAAMGGATAGQLFEGDRRFDIVVRLPDAQRQDPKALAALPIALPATSRADGASLSRMPGVVPLSAVATIAVELGPNQVSRENGKRRVVITSNVRGRDLGSFVEELRGKVAAEVVLPVGSWVEYGGTFEQLISAGQRLSVVVPVVLVMIFGLLFMAFGSAKDAAIVFSGVPLALTGGVLALWLRGIPFSISAGVGFIALSGVAVLNGLVMITFIRKLRELGQPLHTAVTEGALTRLRPVLMTALVASLGFVPMALNVGTGAEVQRPLATVVIGGIISSTLLTLLVLPVLYRLIHRNENEETAA >NZ_AP021844|2967016:3023959|3011390_3012602_+|WP_014236243.1|DBSCAN-SWA MTGRWLLPEGVDRVVLPLLVGKALRAFADGYVAVLLPAYLLALGFGTLDVGILSTTTLLGSAFATLAVGAWGHRFHHRNLLLGAALLMLGTGLSFASLSAFLPLLLVAFVGTLNPSSGDVSVFLPLEHARLAESGQGTARTTLFARYSLLGALFAALGALASGIPQLLVSVLGIELLSGFRVMFVLYGLVGGTVWLLYRRMPAPRRECAVAAPQALGESKGVVVRLALLFSLDSFAGGLAINALMALWFFQRFELSLAAAGSFFFWAGLLSAVSQLIAPKVAERIGLVNTMVFTHIPASICLIAAAFAPGLELAFALLFIRALLSQMDVPVRSAFVMAVVTPAERAAAASFTAVPRSLASAISPTIGGAMFAAGWLAAPLVACGALKICYDLMLWKAFRQRDP >NZ_AP021844|2967016:3023959|3015008_3016919_-|WP_014236246.1|integrase|DBSCAN-SWA MSTLANPAKPIRALSSAEKARTPVSMAKTDLGESVVVSRYEDEIWDFWPYIPQNNARDSEKRINWRIEIAPGEFLTDHQHAPLLESTKDFTWSLFIDPIDGRQRPRMATVIAHVVALTPLLRWMASRGMRQFRDIEGRALEYVPIAKINQRTGKPTAKGWHARRLIVLEELYLQREKLADALPTHPWPDESGFTLSGERAKGRTIKTLRIPERTVRQLAEVAINYVTNLAPHILSTRDALEQAVANKEGFQATNIRIPLARELGFEGSRDLSAELSYLRDSCYIVIAMFSGIRDSETLSLKRGCIAHDKADDGIDLIWLHGTIFKTGIKPHKWLVPPIVETAVRVMEWYRQPYAIQIEEQISQFEQQLDMSIPGSTFHKRQLKRLHTARKDRDGLFLGCAPCVGHLVGVLSKSTIHRRLQNFCPHFNILGDDGKYWRLSSHQFRRTYAYFVASAELGDLHYLREHFGHWSIDMTLLYTSGASDAYQTDTDLLTEILRSKTEKQESVLHNYLMTDAPLANGDIMLADLRQTIKTAKNKQSLLQQISSSITLNGTGHSWCIGNAKGTSCGGLCVFEADMCVDCAYGMIGPEHLPVWKEIALQQQTALDMSDLGLPGKTRSQRILNKALDVIAKLEIPQ >NZ_AP021844|2967016:3023959|3007672_3008155_+|WP_083834012.1|DBSCAN-SWA MQTADFSKILNQALSRSNTPADVGVTVHRDGNKKPGDFQQKMLGIRAYRQQLIASNIANSDTPGYRAMDIDVEDAAKQNQMGLLPLAKSSPSHINGSAHWSSPPFNLKYRTPFQASADANTVEMDIERQHFAENAVMYQFTLDQVGGDFKELTELFRNLK >NZ_AP021844|2967016:3023959|2984886_2987118_-|WP_130459828.1|DBSCAN-SWA MDTATDPAPAKPSASSAAPFAPAPDPAAPPTPYPFNDDPAYRVFLDSLDYLKPEEIAKIKEAFAFGEAAHRGQKRLSGEPYITHPLAVAGAIAEWRLDSTAIIAALLHDTMEDTGISKEELTERFGKGVADLVDGLSKLDKIEFSSYQEAQAENFRKMLLAMAKDLRVILIKLTDRLHNMQTLGCMRPDKRRRIALETLEIYAPIANRLGLNTVYRELQDLSFKHTHPMRYQVLLKAVMAARGNRREVLSKILDGVQSKMRDSGIEAQVFGREKSLYSIYRKMVEKRLSFSQVLDIYGFRVVVKDVPSCYLGLGALHALYKPLPGKFKDYIAIPKANGYQSLHTTLIGPYGMPVEVQLRTEEMHHMAQEGVASHWLYKDTEKSAAELQYQTHRWLQSLLELQSTAGDSAEFFEHVKIDLFPDEVYVFSPKGKIFSLPKGATPVDFAYAVHTDVGNRCVAAKINYELMPLRSELNSGDQVEIVTAAHANPNPAWLSYVKTGRARSKIRHFLKTRQHEESAALGERLLNQELFGLGITPSELPDASWEAVLKEGGSKSVKEVYTDIGLGKRLAAVVARRLLAHEAALPNAEPAPHTSVVIRGTEGMAIQLAHCCRPIPGDPIIGSIKKGQGLVVHTHDCAVIRKSRSAEPQRWIDVEWEPEPGKLFDVDIHVAARNARGVLAKVATEIAESGSNIEKVSMAPDPGFYTTLNFTVQVANRAHLARVLRAVRLIPEVVRITRERQEE >NZ_AP021844|2967016:3023959|3009185_3010550_+|WP_014236240.1|DBSCAN-SWA MSTILTAADSTSPESKPAEVSFWQAFLFWLKLGFISFGGPAGQIAIMHQELVERRRWISERRFLHALNYCMVLPGPEAQQLATYIGWLMHRTWGGIVAGGLFVLPSLFILIGLSWIYIAFGNVPLVAGLFYGIKPAVTAIVVQAAHRIGSRALKNNALWAIAAASFVAIFALNVPFPAIVAAAAAIGYFGGRVAPDKFKAGGGHGKADKSFGRALIDDDTPTPVHARFSWGQLAKVALIGGLLWLVPMGLLTASYGWSHTLTQMGWFFTKAALLTFGGAYAVLPYVYQGAVGSYGWLTGPQMIDGLALGETTPGPLIMVVTFVGFVGGYVKAVFGPDSLFLAGAVAAMLVTWFTFLPSFVFILMGGPFIETTHNDLKFTAPLTAITAAVVGVILNLALFFGYHVLWPKGFDGAFEWVSALIALGAAIALFRFKANVIHVIGGCAVIGFLVKMFL >NZ_AP021844|2967016:3023959|2969562_2971437_+|WP_152090377.1|DBSCAN-SWA MFHSPSAPRDRPALLWQALLFFLLLLPAAVRAHPLVLDQDDGSFALVPHVEVLEDPGGKLDLAAVRQAAAAGRFAPAHALGELNFGYSSSAFWLRIPLESRLQRSSPWLLEIAFPSLDRVELFLPRADGRVDYQLTGDRLPFAERPYPNRNLVLPLELAPGESLALYLRVESEGSLTLPLTLWTPDAFRLHNQDAYAGFSLYYGMLLALGLYNLLLFFALRERIYLVYVAFAVSMAVGQLSLNGLGNEYIWPAFPAWGNVALPSGFAATGFFGAIFTRLFLNTRHSNPRADKLILALAAGFAVAALGPALLPYRWAAILTSLLGAAFSAVAVAVGVHAQLRRHPGARYFLLAWSLLLVGVGMMALRNLGWLPTTLFTSYGMQIGSALEMLLLSFALADRIQAERLARELAQGEALHSKQDLVNALRSNEQLLEARVAERTRDLAAANDRLLANEQQLQRMARHDPLTGLANRLLLDDRISHGLAVGRRNGTRLALLLIDLDGFKPINDKHGHAVGDQLLVVLADRLQRSVRAVDTVARLGGDEFVLVLEDLAAVEDGRQVAAKVVAEMSRPVVLEGRELLVSASAGLAFYPEDGEDAQTLLRRADEAMYEAKRAGRNTFRQVGQ >NZ_AP021844|2967016:3023959|3022410_3022899_-|WP_014236251.1|DBSCAN-SWA MERITISLDEDLAREFDALIAARGYSNRSEAVRDILRSQLETWRQARKESDHCVANLSYVYNHHERELAERLTSIQHDHHDLTVSTLHAHLDHEHCIESTILKGPTEDVRAFAQALMAERGVRHGQLNLVTVELDESQKQHSHKHSHAHGHGYRHLHLKPSR >NZ_AP021844|2967016:3023959|3005988_3007326_+|WP_014236237.1|DBSCAN-SWA MLEILRHRSFRHLFLAQVVALVGTGLLTVALALLAYDLAGANAGAVLGTALAIKMIVYVTLSPVAGAVVPAAWRKRVLVGLDLIRAAVALLLPFVTEIWQVYVLIALLQSASACFTPLFQSLIPQILPEESDYTRALSLSRLAYDLESLLSPALAAALLVVISFHGLFAGTSVGFVLSALLVMSTAFPVVPETRLGDGPYSRALRGMRIYLHTPRLRGLLALNLCAASGASMVFVNTVVLVREVLGGGEREVAWALAAFGAGSMAVAFSLPTLLDRMADRRIMLSAASAMVVVLLAVTGVWWSTGGLGWASLIPAWVVLGMSYAGLVTPGGRLLRRSAQSDDLPFLFAAQFSLSHLCWLLAYPLAGWLGARLGFGVALSALSAMAAVGGALAWRTWPRQDPDVIAHHHDDLSTDHPHWNEYALGGGGRTHEHRFVIDELHQRWPH >NZ_AP021844|2967016:3023959|3016915_3018586_-|WP_014236247.1|integrase|DBSCAN-SWA MAIRRKRVDRNTHIDLTPEKQIIDLPPGWAFTIRCVHSGAEYHFDFTSHRARGREPLAEQMRDAIWSLRFVSAGKSLMTYFNSGIRCFWHFLDDLEQSGQVVTTLEQVDRLIIMQFVAWLALQTVQHGKNKGLPWSVSACNATYAGIKSILKNRRQHVPESINPNLNFPKNPYPNSNKRIPRREGYSAGEQERIIAACAEDLALFNANPEALSSHQVLAVHAVITVLTCGVNMTPLLEMRRDSLRSFLPDRDLLVLEKRRGYTTRTISLPKNTPEETATPITKMVGDYLRQLQQYTERFVSDADEADRPFVFLCRLADVTYSRRRGNVVRFDEVQVRNALKSFVNRHELFDDRGAPLNLSIARLRPTFALNYYLRHRDLRKLQQALGHSSILLTIQRYIPPVTPEAVRNHAFIGQAMVGWATSRDETLAIRLAADGEIPLRNATELLTGGYNTSIARCRNPFREADQVCGKFLACFRCPSMVVFEDDLYRLYSFYFRLLAERPKIPPHQWMKTFGPVIRTIDEQIAVQFDSAVVAEARQRAQSTPHPAWRNDAPLI >NZ_AP021844|2967016:3023959|3021026_3022400_-|WP_152090390.1|DBSCAN-SWA MTAKRPARPLLDGSLLPEPAQLLEGVNEKVWIEVIQKMDEVYNDLLQYEVALEQNNAALEESHRFIASILASMSDILVVCDRHGAIEEVNPAFQRYTGRDEPTLKGTSIFDLFADDKARDKARSFFASQSHDGAQDVELPLQAEDGGAVPVSFNWTPRLSGTGKLIGMVVTGRPVGELRRAYRELRQTHEDLKRTQQQLLHSEKMASLGRLVAGVAHELNNPISFVLGNVLALKRYAGRLETYLEAVHNRGCDCIPELEELRSELRIDRILEDMSPLIDGMIEGAERTRDIVDGLKRFSALDREAAERFNLAEVVQRAVRWVAKSAPPSFTVRTELPAELPLRGSPGQLQQVVMNLVQNALQATAAQPGGELLISGELGDRELRLTFHDNGPGIPDEALGHLFDPFFTTKPVGEGTGLGLSISYGIVERHGGRIVASNHPEGGAVFRLTLPRDLPAA >NZ_AP021844|2967016:3023959|2989012_2989978_+|WP_152090386.1|DBSCAN-SWA MASQANHPLPAGFQLEDYRIEKQISVGGFSIVYLAHDASGKAVAIKEYLPASLALRSEGQTKPVISQEHLSAFRYGMKCFFEEGRALAKLNHPNVIQVLNFFRANDTVYMVMEYERGRTLQEFIQKHHGHIHEKFIRGVFTRMLNGLREVHTHKLLHLDLKPSNIYLRADNTPVLIDFGAARQTLHSDTPMLKPMYTPGFASPEHYFKRDELGPWSDIYSVGASMYSCLAGAAPQAADARMEKDQLQPASVRWEGQYSDQLLETIDWCLCLNHLYRPQSVFALQKALTEAVDMPGQGASKAAEKEGWLGHLVGKIKGMTAK |
53 | Bacillus_phage(28.57%) | protease,transposase,integrase | attL 2991252:2991270|attR 3028742:3028760 |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
Acr ID | Acr position | Acr size | Homology with known anti | Neighbor HTH/AcRanker | Neighbor Aca | In prophage | Protospacer in prophage |
---|
CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
NZ_AP021845_1 | 11331-11433 | TypeIV-A |
NA
Consensus repeat of NZ_AP021845_1
|
1 spacers
spacers of NZ_AP021845_1
>1.1|11367|30|NZ_AP021845|CRISPRCasFinder ATCTCAACCCGCTCTAGGATTCGGCTGACT |
csf3gr5,csf2gr7,csf1gr8,csf5gr6,DinG,PD-DExK |
CRISPR arrays and Neighbor proteins around NZ_AP021845_1
The CRISPR arrays of NZ_AP021845_1 >merge|NZ_AP021845|1|11331-11433|CRISPRCasFinder GCAGGATATACCCCTCATTCGATGGGTGGTTACAGGATCTCAACCCGCTCTAGGATTCGGCTGACTGCTGGATGCACCCCAGATTTGCGGGGTGGTTACAGGT >NZ_AP021845|1|1|11331-11433|CRISPRCasFinder GCAGGATATACCCCTCATTCGATGGGTGGTTACAGG ATCTCAACCCGCTCTAGGATTCGGCTGACT GCTGGATGCACCCCAGATTTGCGGGGTGGTTACAGGT
>NZ_AP021845.1|WP_152090939.1|10540_11263_+|hypothetical-protein MTPLKITFQVSGGFVPPPYPLHLDALLAYAQTFDALGDVADEPGIPQLRALADDMPIQRFEKDGDWCYMASAVQPEGPVLNDARFYTQRMNQDDYSARVGREHIQHGRHKPGSPMERYQIQLETARGVHRNLLGFYPVQQSATSSGALLTLVGWCIAEKWWVEDRLLNGRITHIGARRRSGHGKIQSIAIEEDNLAMSQWRLRVRPWKLLDDDLEIRAAWKPPYWAPENRGTAFCSSQLI >NZ_AP021845.1|WP_152090938.1|9447_10539_+|type-IV-CRISPR-associated-protein-Csf2 MNSIQIQLNLTSPLYIAYPDNVDKTANVSRTTKLRLMNNGRLYDLPIYPANGFRGGLRRAAAARVVEALSAKEGPVPGDLYLGLTCGASSASPDQTPKTVEEIIRGRGNVYMGLFGGGARLLSSMYRVSDMLPVLQATIETGAVPDYLAELVMPKFQKEGEPAKHAGPWEVMSERTSIRVDDLYRVMRPEEIKAYVKNPLETVAAHQDGVLANKEGRKTDGDTKTDVSNMMGIETVAPGVPFYFCIDLDKDVTPGQVGMLLLSLRDLFQENAFGGWTRCGFGKVRVNQIKIAYDDQDLAWSDFYGTSHFELPDAANVYTSQAQEEIGSLTTAEMASFFEDFSAGKKAEAKAKAKAKKTAPAEA >NZ_AP021845.1|WP_152090937.1|8688_9438_+|hypothetical-protein MRQITASHLTIQAAGIKAIGAVLAGPDHVGQRCAVCGADINPGDPIDKLDLPRTFTNQSSLAIPNGKWRCGACNAIMGNSEFQMGASTILVCSEGVFPIVRKEHRAWAFLTPPKPPFVIAIQNAKQQHVIWRAPVSLSKDLIMVRLGEQIFRLRRQKLLNCVEIAKRIDAARITPGRPVKDAIENPFVNDWKFQSAEGGRLKSWVWKLQAEQKIAPEDFMELTTLNGGEAWALTAFLSATITKPDPLNF >NZ_AP021845.1|WP_172974821.1|8071_8692_+|hypothetical-protein MLAKAADGRAILPPQFFHYGEDGKPLSTGEAEIRTIGSKNWVGVLSKTGNAELFDPCVGIATRVAANHYGSPAKMEVMELEYGLEAAAMPVFYNLSRAAFKRRSAKRRALSTEEIIKEYLLRQLNDEAERFGFDLPPDSALKIQVHHAKEMGMRLNLNTGLSNEYVSLVDANFSMYLELHGMWQIGNLQARGHGLIYRAKPGGVWS >NZ_AP021845.1|WP_152090935.1|5373_7878_-|DEAD/DEAH-box-helicase-family-protein MPTLNVPKSACSLLHAGFHPNDVSDEIRLFIRGAIQGLICQAIDDGIPPAELPAAAETGIPMNISFTKGHDRAIRNLSKEKKIREGDAALSYLYAAIARGDAIERRKSVSESPLHPYVAALGLTDRQHQNVFGEALIETLSGKSIGMVEGATGIGKTLGMVAAAAHVLEGRSFGRSLIAVPTLTLLRQFARQHQALADAIPEFPSARFILGKNEFVSVGELRILLDSGTFSEYSSTILGWLGQNGPSPSSDQAIDHRFLISSLIAIAPQFPVDAVRCGNLTDDGDPGMASYRAQFEVDESERLECEIIYCTHAMLAADIRRRMFGARSSEEGLEIRQRHRTIRQEAMGLRAALDDAGNEAYRDAGMDIKNSIDGELFELAAQAVALDAGILPSWQYLLIDEAHLFESNLANTFAFNLSLGRLLQHINAAQAEGAVSAAAAKRASKAMTIIRHAGENNDDINLKSTSPQARDVCAALNELLSIVTGVKPSKTSPTITTLKGQASVIRTALRLATTSVLGRSMLCYSPIRAFPQLSVGRASVSSELAFLWHSCEAGACVSATLYLRRLDKDSASYMAGILNIPTNRMREYPVIRPHWVTAPVAGLWIPESTKNPSGRLWLRPPTRSDKLDTEQYRLREEEWLEDLSAEIRKIQVSAAGGTLVLMTSYTSAKGLAERLADIDGLVVAEQGVSISRQVEGFVNQHSAGKKPLWIAVGGAWTGVDINGKDYGLATPGEDNLLTDLVIPRFPFGTNMSMTHRHRAEQASNVPWDLLDAAMRFKQGLGRLVRREGLPPNRRIYVLDGRMNEPTFDFFMSHLRRIIGIYPVKTLKRSAAIDD >NZ_AP021845.1|WP_152090934.1|4030_5377_+|hypothetical-protein MGKSHQQWREDLRKVMHELQALEDDEASLKGERRTSEEDLGKLKSRIDGLRRHLDDLAAAGCTAEEKLRKAKDRLAGYWPDLAADDHDQERSSPWAHPEWRAARIRVFLAALNLHQAFIEENASKMMANLGIAMDMLQGGIPDPKVRVQALDSLAIACPVISTTFASVPSLCGSMSSEGIGWLLIDEAGQATPQAAAGAIWRARRVVVVGDPLQLEPVVTLPRSVEASLAACNGGVNSRLHPSRTSVQKLADQTTAIGTTVGEGDDAIWVGAPLRVHRRCDEPMFSISNEVAYDGLMVHHKKPAALTWPASYWLDVPGGQGNGNWIPAEGEALRGLIQNLLGQAQVPADDIFLISPFRDVVRELKGMGKAFGLDYRRVGTVHTTQGKEADVVIMVLGGGTAGARDWASSRPNLLNVAASRAKARFYVVGDRKDWSKRRFFDVLSKNLS >NZ_AP021845.1|WP_004883034.1|2592_3792_-|tyrosine-type-recombinase/integrase MAKIKLTKSAVDAAQPQAEAVELRDTLVPGFLCKITPAGRKVFMLQYRTNAGERRKPSLGLYGELTVEQARSLAQEWLAQVRRGGDPAAEKAEARQAPTVKELCTKFMEDYSKKRNKLSTQAGYQAVINRNIIPLLGRKKVQDVKRPEIAGLMEKLSYKQTEANKVFSVLRKMFNMAEVWGYRPDGTNPCRHVPMFPAGKSTHLISDEEMGNLFRQLDKIESEGLENYVIPLGIRLQFEFAGRRSEIIALEWNWVDLQNRRVVWPDSKTGGMSKPMSEEAYRLLSTAPRQEGSRYVLPSPSHAGKHLTTGEYYGGWSRALKAAGATHVGTHGIRHRSATDIANSGIPVKVGMALTAHKTVVMFMRYVHTEDKPVREAAELVANRRKTITGMQGAKEVAA >NZ_AP021845.1|WP_004883035.1|1528_2596_-|DUF1016-family-protein MTRRKASVSAPAAPPALLGDIRALIEASRQRVASAVNAELTLLFWRIGQRIHTEVLAGQRAGYGDEILPTLAAQLVRDYGRSFADKNLRRMVQFAATFSDEPIVVTLSRQLSWSHFVALLPLKDPLQRDYYVQMASAERWSVRTLRERIDSMLYERTALSKKPDETITQELAAMRDAQRMSPALVMRDPYILDFLGLRDTWQEGDLEAAIIREMESFLLELGAGFSFLARQKRIQIDDEDFHLDLLFYNRKLRRLVAVELKIGEFKAAYKGQMELYLRWLDKHEREPEEASPLGIILCTGKKSEQIELLELDKSGIHVAEYLTTLPPRAVLGERLQQATERARLQIEQRQPGEKS >NZ_AP021845.1|WP_004883036.1|41_1289_+|hypothetical-protein MKNIFEEINEFSSEKIALFSFGKFCYVFLNKDPIFVKKLLPLIQTSLANESFQADVMRAYTEGCMNEKAAILKEFEAKRDHPNAAKFYGPQLDLVDKRLAIKTIQHLMDYLNNYLNEYPGSLEILNNSYKHIHDEDGVSYIKENYANYRIGCIFYSKHQSIMGRAEMLELKYSKVVEREYEKIGIDIRKEDAQFSKYSLVSLNENIQIFNDKDSQTIRDERIGRHFWIKVPRKLLTSIEELIEKGMLSEIAFRIDYVSDYVPAMEEMEFGAPLRLKISSLPRLSKFYSTDKYENNLWIHHDAEKLSLTFEELMEDFEVAGDDVVTQVIHLEYSSKGDDFFITHLDHEFIVYTLDSYQERLSNANIKGHRKIKTFKIDNSMIPFDINISGDLFLFQVLDSYLKNDDLIREYFEKIN >NZ_AP021845.1|WP_152090940.1|12610_12820_-|hypothetical-protein MSSLPKIDHQENAERNLGIAIDRLDEMRWAVVGVDSPDAECLAKFDEGVAKLKEALTVIRHSPSKTTGR >NZ_AP021845.1|WP_152090941.1|12821_13283_-|hypothetical-protein MNKADLIWWPDNKKGVIINAVAARDRATARFVVTRAIRLVFPMPIMVLASYLISTTLTFPDDAPGWVLTWVEWGPTTLAWACATMTLVVVAMLLIDWRRDRSAAVALACEASALGVDIAKLDGDWVFEALVLPMVRRKGITLPDGSVAHLTEE >NZ_AP021845.1|WP_152090942.1|13293_13818_-|hypothetical-protein MTLASCIGCGCDDLHACVDDLGPCSWIVVDRDAGRGVCSCCESHMERWNAGDRSVLMLVAKITRDGETEPYFIEKNGIGSFPYFLEVEAGDKFSIEWVEMSQEQFEALPQFEHVRLAEKWLEEITAAGDAESEGKMDEAGTHRAEAQRLSEKVAERGFDVLDLIEADELPASLL >NZ_AP021845.1|WP_152090943.1|14138_14459_-|hypothetical-protein MKIDPHEVTSRSIAVVSEVPLIATQWHRHGDHPRVHGMNGSNWELFPDEDDVRYGLLSGGHSGCLLVESGDWIVWNEVFKTYAIFKPDQFEALFSTSAAAIAPSER >NZ_AP021845.1|WP_152090944.1|14759_15089_-|hypothetical-protein MTPNWQPIEALPLIAGMLDDQLHSLHTQVGNLEQCRHRPWVLDGETVNRLQAVFGEQMDSLPVFREQLARWLELPLDEHQRQEINRLNAVLDQMKAAIERILSLAGNIR >NZ_AP021845.1|WP_152090945.1|15091_15607_-|hypothetical-protein MEEYSRPATLDDLKALIASLNEQCADYLLIGGYALFAHGYHRATTDIDVLVPATQEAGIKIRSALMVLPDQAAKNIDPAWFDEGENIRVADAFIVDIMLNACGETYETLKKYAETLDVDGVPVRTINLEGLLLTKQTMREKDVSDRIILERALETLKERVSKPESDHGLGL >NZ_AP021845.1|WP_152090946.1|15606_15834_-|hypothetical-protein MRTIGRRKEHPITFSASAELLVEGARFNDEIHRLPTGNTTHIPKGLYRFKSFEEANQHQQDCLVAGMAKIALERK >NZ_AP021845.1|WP_152090947.1|15850_16288_-|hypothetical-protein MEYRDFLKEIPEGLGATEKTLQLWFVIYQWILEHGYADSADSPAFLQHINAFRKCSTQTVATHLRRMSDAKLIKRYVLRRKLSGEAKEELSIGSLLFAPGAESIPTTFVRYCLPGQQCPTEFKSYEAAVSALDGRMNAIRETVRP >NZ_AP021845.1|WP_152090948.1|16290_16860_-|hypothetical-protein MNTLETQNASTVIDMATAVRERASIKVYANGNLVGEISIAEHEAFKLAAKNDRSLYREQALNYLEATFRLVGRIVLSIPENWFVIAVLLALMMPSEFNSLVSAIIANPSTSSTEFLNTVRWALAASIASTALVAVISGESFGLANVFDDRVAMMIRAKFKLPPMCKLFVDAEQILDGLPSHQAPTHFGK >NZ_AP021845.1|WP_152090949.1|16868_17072_-|hypothetical-protein MAEFTILVGDEVVRLTKKEVEALRKSLKTDVLVTPEDWTRSELQSRSQARKKLMDALYSAEKDIILR |
You can click texts colored in the table to view more detailed information
CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
NZ_AP021845_2 | 11530-11698 | TypeIV-A |
NA
Consensus repeat of NZ_AP021845_2
|
2 spacers
spacers of NZ_AP021845_2
>2.1|11566|30|NZ_AP021845|CRISPRCasFinder CTGATATTGACAAGGCCTTGGCAGTTGTCG >2.2|11632|30|NZ_AP021845|CRISPRCasFinder GTCAGGAGTATCAGCCAGACAATGAACTTG |
csf3gr5,csf2gr7,csf1gr8,csf5gr6,DinG,PD-DExK |
CRISPR arrays and Neighbor proteins around NZ_AP021845_2
The CRISPR arrays of NZ_AP021845_2 >merge|NZ_AP021845|2|11530-11698|CRISPRCasFinder GCTGAATGCACCCCTAATCCTGGGGGTGGTTACAGGCTGATATTGACAAGGCCTTGGCAGTTGTCGGCTGGATGTACCCCACATCTGAGGGGTGGTTACAGGGTCAGGAGTATCAGCCAGACAATGAACTTGGCTGGATGCACCCCACATCTGAGGGGTGGTTGCAGGG >NZ_AP021845|2|2|11530-11698|CRISPRCasFinder GCTGAATGCACCCCTAATCCTGGGGGTGGTTACAGG CTGATATTGACAAGGCCTTGGCAGTTGTCG GCTGGATGTACCCCACATCTGAGGGGTGGTTACAGG GTCAGGAGTATCAGCCAGACAATGAACTTG GCTGGATGCACCCCACATCTGAGGGGTGGTTGCAGGG
>NZ_AP021845.1|WP_152090939.1|10540_11263_+|hypothetical-protein MTPLKITFQVSGGFVPPPYPLHLDALLAYAQTFDALGDVADEPGIPQLRALADDMPIQRFEKDGDWCYMASAVQPEGPVLNDARFYTQRMNQDDYSARVGREHIQHGRHKPGSPMERYQIQLETARGVHRNLLGFYPVQQSATSSGALLTLVGWCIAEKWWVEDRLLNGRITHIGARRRSGHGKIQSIAIEEDNLAMSQWRLRVRPWKLLDDDLEIRAAWKPPYWAPENRGTAFCSSQLI >NZ_AP021845.1|WP_152090938.1|9447_10539_+|type-IV-CRISPR-associated-protein-Csf2 MNSIQIQLNLTSPLYIAYPDNVDKTANVSRTTKLRLMNNGRLYDLPIYPANGFRGGLRRAAAARVVEALSAKEGPVPGDLYLGLTCGASSASPDQTPKTVEEIIRGRGNVYMGLFGGGARLLSSMYRVSDMLPVLQATIETGAVPDYLAELVMPKFQKEGEPAKHAGPWEVMSERTSIRVDDLYRVMRPEEIKAYVKNPLETVAAHQDGVLANKEGRKTDGDTKTDVSNMMGIETVAPGVPFYFCIDLDKDVTPGQVGMLLLSLRDLFQENAFGGWTRCGFGKVRVNQIKIAYDDQDLAWSDFYGTSHFELPDAANVYTSQAQEEIGSLTTAEMASFFEDFSAGKKAEAKAKAKAKKTAPAEA >NZ_AP021845.1|WP_152090937.1|8688_9438_+|hypothetical-protein MRQITASHLTIQAAGIKAIGAVLAGPDHVGQRCAVCGADINPGDPIDKLDLPRTFTNQSSLAIPNGKWRCGACNAIMGNSEFQMGASTILVCSEGVFPIVRKEHRAWAFLTPPKPPFVIAIQNAKQQHVIWRAPVSLSKDLIMVRLGEQIFRLRRQKLLNCVEIAKRIDAARITPGRPVKDAIENPFVNDWKFQSAEGGRLKSWVWKLQAEQKIAPEDFMELTTLNGGEAWALTAFLSATITKPDPLNF >NZ_AP021845.1|WP_172974821.1|8071_8692_+|hypothetical-protein MLAKAADGRAILPPQFFHYGEDGKPLSTGEAEIRTIGSKNWVGVLSKTGNAELFDPCVGIATRVAANHYGSPAKMEVMELEYGLEAAAMPVFYNLSRAAFKRRSAKRRALSTEEIIKEYLLRQLNDEAERFGFDLPPDSALKIQVHHAKEMGMRLNLNTGLSNEYVSLVDANFSMYLELHGMWQIGNLQARGHGLIYRAKPGGVWS >NZ_AP021845.1|WP_152090935.1|5373_7878_-|DEAD/DEAH-box-helicase-family-protein MPTLNVPKSACSLLHAGFHPNDVSDEIRLFIRGAIQGLICQAIDDGIPPAELPAAAETGIPMNISFTKGHDRAIRNLSKEKKIREGDAALSYLYAAIARGDAIERRKSVSESPLHPYVAALGLTDRQHQNVFGEALIETLSGKSIGMVEGATGIGKTLGMVAAAAHVLEGRSFGRSLIAVPTLTLLRQFARQHQALADAIPEFPSARFILGKNEFVSVGELRILLDSGTFSEYSSTILGWLGQNGPSPSSDQAIDHRFLISSLIAIAPQFPVDAVRCGNLTDDGDPGMASYRAQFEVDESERLECEIIYCTHAMLAADIRRRMFGARSSEEGLEIRQRHRTIRQEAMGLRAALDDAGNEAYRDAGMDIKNSIDGELFELAAQAVALDAGILPSWQYLLIDEAHLFESNLANTFAFNLSLGRLLQHINAAQAEGAVSAAAAKRASKAMTIIRHAGENNDDINLKSTSPQARDVCAALNELLSIVTGVKPSKTSPTITTLKGQASVIRTALRLATTSVLGRSMLCYSPIRAFPQLSVGRASVSSELAFLWHSCEAGACVSATLYLRRLDKDSASYMAGILNIPTNRMREYPVIRPHWVTAPVAGLWIPESTKNPSGRLWLRPPTRSDKLDTEQYRLREEEWLEDLSAEIRKIQVSAAGGTLVLMTSYTSAKGLAERLADIDGLVVAEQGVSISRQVEGFVNQHSAGKKPLWIAVGGAWTGVDINGKDYGLATPGEDNLLTDLVIPRFPFGTNMSMTHRHRAEQASNVPWDLLDAAMRFKQGLGRLVRREGLPPNRRIYVLDGRMNEPTFDFFMSHLRRIIGIYPVKTLKRSAAIDD >NZ_AP021845.1|WP_152090934.1|4030_5377_+|hypothetical-protein MGKSHQQWREDLRKVMHELQALEDDEASLKGERRTSEEDLGKLKSRIDGLRRHLDDLAAAGCTAEEKLRKAKDRLAGYWPDLAADDHDQERSSPWAHPEWRAARIRVFLAALNLHQAFIEENASKMMANLGIAMDMLQGGIPDPKVRVQALDSLAIACPVISTTFASVPSLCGSMSSEGIGWLLIDEAGQATPQAAAGAIWRARRVVVVGDPLQLEPVVTLPRSVEASLAACNGGVNSRLHPSRTSVQKLADQTTAIGTTVGEGDDAIWVGAPLRVHRRCDEPMFSISNEVAYDGLMVHHKKPAALTWPASYWLDVPGGQGNGNWIPAEGEALRGLIQNLLGQAQVPADDIFLISPFRDVVRELKGMGKAFGLDYRRVGTVHTTQGKEADVVIMVLGGGTAGARDWASSRPNLLNVAASRAKARFYVVGDRKDWSKRRFFDVLSKNLS >NZ_AP021845.1|WP_004883034.1|2592_3792_-|tyrosine-type-recombinase/integrase MAKIKLTKSAVDAAQPQAEAVELRDTLVPGFLCKITPAGRKVFMLQYRTNAGERRKPSLGLYGELTVEQARSLAQEWLAQVRRGGDPAAEKAEARQAPTVKELCTKFMEDYSKKRNKLSTQAGYQAVINRNIIPLLGRKKVQDVKRPEIAGLMEKLSYKQTEANKVFSVLRKMFNMAEVWGYRPDGTNPCRHVPMFPAGKSTHLISDEEMGNLFRQLDKIESEGLENYVIPLGIRLQFEFAGRRSEIIALEWNWVDLQNRRVVWPDSKTGGMSKPMSEEAYRLLSTAPRQEGSRYVLPSPSHAGKHLTTGEYYGGWSRALKAAGATHVGTHGIRHRSATDIANSGIPVKVGMALTAHKTVVMFMRYVHTEDKPVREAAELVANRRKTITGMQGAKEVAA >NZ_AP021845.1|WP_004883035.1|1528_2596_-|DUF1016-family-protein MTRRKASVSAPAAPPALLGDIRALIEASRQRVASAVNAELTLLFWRIGQRIHTEVLAGQRAGYGDEILPTLAAQLVRDYGRSFADKNLRRMVQFAATFSDEPIVVTLSRQLSWSHFVALLPLKDPLQRDYYVQMASAERWSVRTLRERIDSMLYERTALSKKPDETITQELAAMRDAQRMSPALVMRDPYILDFLGLRDTWQEGDLEAAIIREMESFLLELGAGFSFLARQKRIQIDDEDFHLDLLFYNRKLRRLVAVELKIGEFKAAYKGQMELYLRWLDKHEREPEEASPLGIILCTGKKSEQIELLELDKSGIHVAEYLTTLPPRAVLGERLQQATERARLQIEQRQPGEKS >NZ_AP021845.1|WP_004883036.1|41_1289_+|hypothetical-protein MKNIFEEINEFSSEKIALFSFGKFCYVFLNKDPIFVKKLLPLIQTSLANESFQADVMRAYTEGCMNEKAAILKEFEAKRDHPNAAKFYGPQLDLVDKRLAIKTIQHLMDYLNNYLNEYPGSLEILNNSYKHIHDEDGVSYIKENYANYRIGCIFYSKHQSIMGRAEMLELKYSKVVEREYEKIGIDIRKEDAQFSKYSLVSLNENIQIFNDKDSQTIRDERIGRHFWIKVPRKLLTSIEELIEKGMLSEIAFRIDYVSDYVPAMEEMEFGAPLRLKISSLPRLSKFYSTDKYENNLWIHHDAEKLSLTFEELMEDFEVAGDDVVTQVIHLEYSSKGDDFFITHLDHEFIVYTLDSYQERLSNANIKGHRKIKTFKIDNSMIPFDINISGDLFLFQVLDSYLKNDDLIREYFEKIN >NZ_AP021845.1|WP_152090940.1|12610_12820_-|hypothetical-protein MSSLPKIDHQENAERNLGIAIDRLDEMRWAVVGVDSPDAECLAKFDEGVAKLKEALTVIRHSPSKTTGR >NZ_AP021845.1|WP_152090941.1|12821_13283_-|hypothetical-protein MNKADLIWWPDNKKGVIINAVAARDRATARFVVTRAIRLVFPMPIMVLASYLISTTLTFPDDAPGWVLTWVEWGPTTLAWACATMTLVVVAMLLIDWRRDRSAAVALACEASALGVDIAKLDGDWVFEALVLPMVRRKGITLPDGSVAHLTEE >NZ_AP021845.1|WP_152090942.1|13293_13818_-|hypothetical-protein MTLASCIGCGCDDLHACVDDLGPCSWIVVDRDAGRGVCSCCESHMERWNAGDRSVLMLVAKITRDGETEPYFIEKNGIGSFPYFLEVEAGDKFSIEWVEMSQEQFEALPQFEHVRLAEKWLEEITAAGDAESEGKMDEAGTHRAEAQRLSEKVAERGFDVLDLIEADELPASLL >NZ_AP021845.1|WP_152090943.1|14138_14459_-|hypothetical-protein MKIDPHEVTSRSIAVVSEVPLIATQWHRHGDHPRVHGMNGSNWELFPDEDDVRYGLLSGGHSGCLLVESGDWIVWNEVFKTYAIFKPDQFEALFSTSAAAIAPSER >NZ_AP021845.1|WP_152090944.1|14759_15089_-|hypothetical-protein MTPNWQPIEALPLIAGMLDDQLHSLHTQVGNLEQCRHRPWVLDGETVNRLQAVFGEQMDSLPVFREQLARWLELPLDEHQRQEINRLNAVLDQMKAAIERILSLAGNIR >NZ_AP021845.1|WP_152090945.1|15091_15607_-|hypothetical-protein MEEYSRPATLDDLKALIASLNEQCADYLLIGGYALFAHGYHRATTDIDVLVPATQEAGIKIRSALMVLPDQAAKNIDPAWFDEGENIRVADAFIVDIMLNACGETYETLKKYAETLDVDGVPVRTINLEGLLLTKQTMREKDVSDRIILERALETLKERVSKPESDHGLGL >NZ_AP021845.1|WP_152090946.1|15606_15834_-|hypothetical-protein MRTIGRRKEHPITFSASAELLVEGARFNDEIHRLPTGNTTHIPKGLYRFKSFEEANQHQQDCLVAGMAKIALERK >NZ_AP021845.1|WP_152090947.1|15850_16288_-|hypothetical-protein MEYRDFLKEIPEGLGATEKTLQLWFVIYQWILEHGYADSADSPAFLQHINAFRKCSTQTVATHLRRMSDAKLIKRYVLRRKLSGEAKEELSIGSLLFAPGAESIPTTFVRYCLPGQQCPTEFKSYEAAVSALDGRMNAIRETVRP >NZ_AP021845.1|WP_152090948.1|16290_16860_-|hypothetical-protein MNTLETQNASTVIDMATAVRERASIKVYANGNLVGEISIAEHEAFKLAAKNDRSLYREQALNYLEATFRLVGRIVLSIPENWFVIAVLLALMMPSEFNSLVSAIIANPSTSSTEFLNTVRWALAASIASTALVAVISGESFGLANVFDDRVAMMIRAKFKLPPMCKLFVDAEQILDGLPSHQAPTHFGK >NZ_AP021845.1|WP_152090949.1|16868_17072_-|hypothetical-protein MAEFTILVGDEVVRLTKKEVEALRKSLKTDVLVTPEDWTRSELQSRSQARKKLMDALYSAEKDIILR |
You can click texts colored in the table to view more detailed information
CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
NZ_AP021845_3 | 12193-12294 | TypeIV-A |
NA
Consensus repeat of NZ_AP021845_3
|
1 spacers
spacers of NZ_AP021845_3
>3.1|12229|30|NZ_AP021845|CRISPRCasFinder GAATATCGGTTCTGCGGTCGCAGATTGGCC |
csf3gr5,csf2gr7,csf1gr8,csf5gr6,DinG,PD-DExK |
CRISPR arrays and Neighbor proteins around NZ_AP021845_3
The CRISPR arrays of NZ_AP021845_3 >merge|NZ_AP021845|3|12193-12294|CRISPRCasFinder GTTGGAGGCACCGCTCAGATGAGGGGTGGTTACAGGGAATATCGGTTCTGCGGTCGCAGATTGGCCGTTGGAGGCACCGCTCAGATGAGGGGTGGTTACAGG >NZ_AP021845|3|3|12193-12294|CRISPRCasFinder GTTGGAGGCACCGCTCAGATGAGGGGTGGTTACAGG GAATATCGGTTCTGCGGTCGCAGATTGGCC GTTGGAGGCACCGCTCAGATGAGGGGTGGTTACAGG
>NZ_AP021845.1|WP_152090939.1|10540_11263_+|hypothetical-protein MTPLKITFQVSGGFVPPPYPLHLDALLAYAQTFDALGDVADEPGIPQLRALADDMPIQRFEKDGDWCYMASAVQPEGPVLNDARFYTQRMNQDDYSARVGREHIQHGRHKPGSPMERYQIQLETARGVHRNLLGFYPVQQSATSSGALLTLVGWCIAEKWWVEDRLLNGRITHIGARRRSGHGKIQSIAIEEDNLAMSQWRLRVRPWKLLDDDLEIRAAWKPPYWAPENRGTAFCSSQLI >NZ_AP021845.1|WP_152090938.1|9447_10539_+|type-IV-CRISPR-associated-protein-Csf2 MNSIQIQLNLTSPLYIAYPDNVDKTANVSRTTKLRLMNNGRLYDLPIYPANGFRGGLRRAAAARVVEALSAKEGPVPGDLYLGLTCGASSASPDQTPKTVEEIIRGRGNVYMGLFGGGARLLSSMYRVSDMLPVLQATIETGAVPDYLAELVMPKFQKEGEPAKHAGPWEVMSERTSIRVDDLYRVMRPEEIKAYVKNPLETVAAHQDGVLANKEGRKTDGDTKTDVSNMMGIETVAPGVPFYFCIDLDKDVTPGQVGMLLLSLRDLFQENAFGGWTRCGFGKVRVNQIKIAYDDQDLAWSDFYGTSHFELPDAANVYTSQAQEEIGSLTTAEMASFFEDFSAGKKAEAKAKAKAKKTAPAEA >NZ_AP021845.1|WP_152090937.1|8688_9438_+|hypothetical-protein MRQITASHLTIQAAGIKAIGAVLAGPDHVGQRCAVCGADINPGDPIDKLDLPRTFTNQSSLAIPNGKWRCGACNAIMGNSEFQMGASTILVCSEGVFPIVRKEHRAWAFLTPPKPPFVIAIQNAKQQHVIWRAPVSLSKDLIMVRLGEQIFRLRRQKLLNCVEIAKRIDAARITPGRPVKDAIENPFVNDWKFQSAEGGRLKSWVWKLQAEQKIAPEDFMELTTLNGGEAWALTAFLSATITKPDPLNF >NZ_AP021845.1|WP_172974821.1|8071_8692_+|hypothetical-protein MLAKAADGRAILPPQFFHYGEDGKPLSTGEAEIRTIGSKNWVGVLSKTGNAELFDPCVGIATRVAANHYGSPAKMEVMELEYGLEAAAMPVFYNLSRAAFKRRSAKRRALSTEEIIKEYLLRQLNDEAERFGFDLPPDSALKIQVHHAKEMGMRLNLNTGLSNEYVSLVDANFSMYLELHGMWQIGNLQARGHGLIYRAKPGGVWS >NZ_AP021845.1|WP_152090935.1|5373_7878_-|DEAD/DEAH-box-helicase-family-protein MPTLNVPKSACSLLHAGFHPNDVSDEIRLFIRGAIQGLICQAIDDGIPPAELPAAAETGIPMNISFTKGHDRAIRNLSKEKKIREGDAALSYLYAAIARGDAIERRKSVSESPLHPYVAALGLTDRQHQNVFGEALIETLSGKSIGMVEGATGIGKTLGMVAAAAHVLEGRSFGRSLIAVPTLTLLRQFARQHQALADAIPEFPSARFILGKNEFVSVGELRILLDSGTFSEYSSTILGWLGQNGPSPSSDQAIDHRFLISSLIAIAPQFPVDAVRCGNLTDDGDPGMASYRAQFEVDESERLECEIIYCTHAMLAADIRRRMFGARSSEEGLEIRQRHRTIRQEAMGLRAALDDAGNEAYRDAGMDIKNSIDGELFELAAQAVALDAGILPSWQYLLIDEAHLFESNLANTFAFNLSLGRLLQHINAAQAEGAVSAAAAKRASKAMTIIRHAGENNDDINLKSTSPQARDVCAALNELLSIVTGVKPSKTSPTITTLKGQASVIRTALRLATTSVLGRSMLCYSPIRAFPQLSVGRASVSSELAFLWHSCEAGACVSATLYLRRLDKDSASYMAGILNIPTNRMREYPVIRPHWVTAPVAGLWIPESTKNPSGRLWLRPPTRSDKLDTEQYRLREEEWLEDLSAEIRKIQVSAAGGTLVLMTSYTSAKGLAERLADIDGLVVAEQGVSISRQVEGFVNQHSAGKKPLWIAVGGAWTGVDINGKDYGLATPGEDNLLTDLVIPRFPFGTNMSMTHRHRAEQASNVPWDLLDAAMRFKQGLGRLVRREGLPPNRRIYVLDGRMNEPTFDFFMSHLRRIIGIYPVKTLKRSAAIDD >NZ_AP021845.1|WP_152090934.1|4030_5377_+|hypothetical-protein MGKSHQQWREDLRKVMHELQALEDDEASLKGERRTSEEDLGKLKSRIDGLRRHLDDLAAAGCTAEEKLRKAKDRLAGYWPDLAADDHDQERSSPWAHPEWRAARIRVFLAALNLHQAFIEENASKMMANLGIAMDMLQGGIPDPKVRVQALDSLAIACPVISTTFASVPSLCGSMSSEGIGWLLIDEAGQATPQAAAGAIWRARRVVVVGDPLQLEPVVTLPRSVEASLAACNGGVNSRLHPSRTSVQKLADQTTAIGTTVGEGDDAIWVGAPLRVHRRCDEPMFSISNEVAYDGLMVHHKKPAALTWPASYWLDVPGGQGNGNWIPAEGEALRGLIQNLLGQAQVPADDIFLISPFRDVVRELKGMGKAFGLDYRRVGTVHTTQGKEADVVIMVLGGGTAGARDWASSRPNLLNVAASRAKARFYVVGDRKDWSKRRFFDVLSKNLS >NZ_AP021845.1|WP_004883034.1|2592_3792_-|tyrosine-type-recombinase/integrase MAKIKLTKSAVDAAQPQAEAVELRDTLVPGFLCKITPAGRKVFMLQYRTNAGERRKPSLGLYGELTVEQARSLAQEWLAQVRRGGDPAAEKAEARQAPTVKELCTKFMEDYSKKRNKLSTQAGYQAVINRNIIPLLGRKKVQDVKRPEIAGLMEKLSYKQTEANKVFSVLRKMFNMAEVWGYRPDGTNPCRHVPMFPAGKSTHLISDEEMGNLFRQLDKIESEGLENYVIPLGIRLQFEFAGRRSEIIALEWNWVDLQNRRVVWPDSKTGGMSKPMSEEAYRLLSTAPRQEGSRYVLPSPSHAGKHLTTGEYYGGWSRALKAAGATHVGTHGIRHRSATDIANSGIPVKVGMALTAHKTVVMFMRYVHTEDKPVREAAELVANRRKTITGMQGAKEVAA >NZ_AP021845.1|WP_004883035.1|1528_2596_-|DUF1016-family-protein MTRRKASVSAPAAPPALLGDIRALIEASRQRVASAVNAELTLLFWRIGQRIHTEVLAGQRAGYGDEILPTLAAQLVRDYGRSFADKNLRRMVQFAATFSDEPIVVTLSRQLSWSHFVALLPLKDPLQRDYYVQMASAERWSVRTLRERIDSMLYERTALSKKPDETITQELAAMRDAQRMSPALVMRDPYILDFLGLRDTWQEGDLEAAIIREMESFLLELGAGFSFLARQKRIQIDDEDFHLDLLFYNRKLRRLVAVELKIGEFKAAYKGQMELYLRWLDKHEREPEEASPLGIILCTGKKSEQIELLELDKSGIHVAEYLTTLPPRAVLGERLQQATERARLQIEQRQPGEKS >NZ_AP021845.1|WP_004883036.1|41_1289_+|hypothetical-protein MKNIFEEINEFSSEKIALFSFGKFCYVFLNKDPIFVKKLLPLIQTSLANESFQADVMRAYTEGCMNEKAAILKEFEAKRDHPNAAKFYGPQLDLVDKRLAIKTIQHLMDYLNNYLNEYPGSLEILNNSYKHIHDEDGVSYIKENYANYRIGCIFYSKHQSIMGRAEMLELKYSKVVEREYEKIGIDIRKEDAQFSKYSLVSLNENIQIFNDKDSQTIRDERIGRHFWIKVPRKLLTSIEELIEKGMLSEIAFRIDYVSDYVPAMEEMEFGAPLRLKISSLPRLSKFYSTDKYENNLWIHHDAEKLSLTFEELMEDFEVAGDDVVTQVIHLEYSSKGDDFFITHLDHEFIVYTLDSYQERLSNANIKGHRKIKTFKIDNSMIPFDINISGDLFLFQVLDSYLKNDDLIREYFEKIN >NZ_AP021845.1|WP_152090940.1|12610_12820_-|hypothetical-protein MSSLPKIDHQENAERNLGIAIDRLDEMRWAVVGVDSPDAECLAKFDEGVAKLKEALTVIRHSPSKTTGR >NZ_AP021845.1|WP_152090941.1|12821_13283_-|hypothetical-protein MNKADLIWWPDNKKGVIINAVAARDRATARFVVTRAIRLVFPMPIMVLASYLISTTLTFPDDAPGWVLTWVEWGPTTLAWACATMTLVVVAMLLIDWRRDRSAAVALACEASALGVDIAKLDGDWVFEALVLPMVRRKGITLPDGSVAHLTEE >NZ_AP021845.1|WP_152090942.1|13293_13818_-|hypothetical-protein MTLASCIGCGCDDLHACVDDLGPCSWIVVDRDAGRGVCSCCESHMERWNAGDRSVLMLVAKITRDGETEPYFIEKNGIGSFPYFLEVEAGDKFSIEWVEMSQEQFEALPQFEHVRLAEKWLEEITAAGDAESEGKMDEAGTHRAEAQRLSEKVAERGFDVLDLIEADELPASLL >NZ_AP021845.1|WP_152090943.1|14138_14459_-|hypothetical-protein MKIDPHEVTSRSIAVVSEVPLIATQWHRHGDHPRVHGMNGSNWELFPDEDDVRYGLLSGGHSGCLLVESGDWIVWNEVFKTYAIFKPDQFEALFSTSAAAIAPSER >NZ_AP021845.1|WP_152090944.1|14759_15089_-|hypothetical-protein MTPNWQPIEALPLIAGMLDDQLHSLHTQVGNLEQCRHRPWVLDGETVNRLQAVFGEQMDSLPVFREQLARWLELPLDEHQRQEINRLNAVLDQMKAAIERILSLAGNIR >NZ_AP021845.1|WP_152090945.1|15091_15607_-|hypothetical-protein MEEYSRPATLDDLKALIASLNEQCADYLLIGGYALFAHGYHRATTDIDVLVPATQEAGIKIRSALMVLPDQAAKNIDPAWFDEGENIRVADAFIVDIMLNACGETYETLKKYAETLDVDGVPVRTINLEGLLLTKQTMREKDVSDRIILERALETLKERVSKPESDHGLGL >NZ_AP021845.1|WP_152090946.1|15606_15834_-|hypothetical-protein MRTIGRRKEHPITFSASAELLVEGARFNDEIHRLPTGNTTHIPKGLYRFKSFEEANQHQQDCLVAGMAKIALERK >NZ_AP021845.1|WP_152090947.1|15850_16288_-|hypothetical-protein MEYRDFLKEIPEGLGATEKTLQLWFVIYQWILEHGYADSADSPAFLQHINAFRKCSTQTVATHLRRMSDAKLIKRYVLRRKLSGEAKEELSIGSLLFAPGAESIPTTFVRYCLPGQQCPTEFKSYEAAVSALDGRMNAIRETVRP >NZ_AP021845.1|WP_152090948.1|16290_16860_-|hypothetical-protein MNTLETQNASTVIDMATAVRERASIKVYANGNLVGEISIAEHEAFKLAAKNDRSLYREQALNYLEATFRLVGRIVLSIPENWFVIAVLLALMMPSEFNSLVSAIIANPSTSSTEFLNTVRWALAASIASTALVAVISGESFGLANVFDDRVAMMIRAKFKLPPMCKLFVDAEQILDGLPSHQAPTHFGK >NZ_AP021845.1|WP_152090949.1|16868_17072_-|hypothetical-protein MAEFTILVGDEVVRLTKKEVEALRKSLKTDVLVTPEDWTRSELQSRSQARKKLMDALYSAEKDIILR |
You can click texts colored in the table to view more detailed information
CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
NZ_AP021845_4 | 307466-307630 | Orphan |
NA
Consensus repeat of NZ_AP021845_4
|
3 spacers
spacers of NZ_AP021845_4
>4.1|307483|31|NZ_AP021845|PILER-CR AAACACGCTGACGGGGAGGGAGGCGCATGGG >4.2|307531|43|NZ_AP021845|PILER-CR TTGCCGAGCACCAAGGGGTGATGCGAGAGGAGGGCTGTGCGGA >4.3|307591|23|NZ_AP021845|PILER-CR TCGCTGACATCCCTGGAATCCGA |
CRISPR arrays and Neighbor proteins around NZ_AP021845_4
The CRISPR arrays of NZ_AP021845_4 >merge|NZ_AP021845|4|307466-307630|PILER-CR TTGCACGTCGACGTGCAAAACACGCTGACGGGGAGGGAGGCGCATGGGCTGCACGTTGACGTGCATTGCCGAGCACCAAGGGGTGATGCGAGAGGAGGGCTGTGCGGATTGCACGTCAACGTGCATCGCTGACATCCCTGGAATCCGATTGCACGTCAACGTGCA >NZ_AP021845|4|1|307466-307630|PILER-CR TTGCACGTCGACGTGCA AAACACGCTGACGGGGAGGGAGGCGCATGGG CTGCACGTTGACGTGCA TTGCCGAGCACCAAGGGGTGATGCGAGAGGAGGGCTGTGCGGA TTGCACGTCAACGTGCA TCGCTGACATCCCTGGAATCCGA TTGCACGTCAACGTGCA
>NZ_AP021845.1|WP_152091267.1|305586_306732_-|hypothetical-protein MQAIIAAGVAADNPLAPIFARKATLAKMAECSEVTVYRAMRQLEDAGWISRSEQVRLDDGSMDIGLLSITKKLATLVGLCLDEEVHGSAEESKRSQDIDSEPITVNNKKHTLTQNIAGREGAPTAAALVGNGPKQDGSDAKLCTQMKDGLIAGPIYRGEQRVDPKASVNYQSTRPGFVRIDGRSVAQELVWLIEEKRLTFGALFQLQTLAKQVPGQTLSDFVAYRSERIKQLTTTNDCYRYLKKLISDGIDARYLCAQRAKKEHRVMRRQQRDKAASSRAAWCRARHEMTFLNTQTGVTYRINANHELLEVGENGLPTSRPNLAITSKFIKAVEEGRLVPFRIQEPVINLELGNRRLDEMASKFPWLRRKGKGSPVEEVRA >NZ_AP021845.1|WP_152091266.1|305013_305346_-|hypothetical-protein MSSAVLQGAQVPLVAEFHQGTQVLIRWALWHKQHAYPKAILIGVFRKEDIRLVVYQYGKALCVAVKMADVAPKIRDWREWRQRYATVKGKLIQRGFALVDEREEVSHVAH >NZ_AP021845.1|WP_152091265.1|304361_305027_-|hypothetical-protein MSLTDFDLFGHPVAAPQNVTPPRRRVVSQAAKRAKLARHLEKSNNLVLFDELLDFLKAPLLKQQYDMDATFASDVGTVAVIEVDEDGNTQDEDSALVIPYEAWGEEWVTDSNGLAWSKEGLLFTQVRLFWRSMEELALNNNEQEKWSVLRWIFRPAIWKHYVYDKRIGRSHCFEVHERDETFSFHNCCIAARVDEDTVREGVRRNIPAEVVKAVEKVCKFD >NZ_AP021845.1|WP_152091264.1|303146_304286_-|AAA-family-ATPase MFDLSAILVQVSYSLCEVFGLDPNEVDPSITVDGFEIPDPSTLTDPVAQQYAAFLRAAVPPIDPYYQFRKDLVRDIRYWWLTGEGDVLLLWGPTGSGKTSVFEQWCARLGIPLFMAKGHRRFEPMEAFGQFVGGENGTTPWVDGPVTLAARYGLPCIINEYDRIAADRTIVFNDVFEGRSFPIPGKSGEVVTPQPGFRVAITANTNLVEDLSGNYGTANTHDISILERIVALHVGYPSDDTEAKLLEKELEQFSDDLLSYWFDQEGIKISTPQGMKEGSAINRGEFIQGLLEVAKKIRAQSKDGGNTSDSALERTMSTRILRKWARHSVAQASAPEKLGLSALHLALKKYLSSLSTESTRIALHQAVENVFGVGEVVKP >NZ_AP021845.1|WP_152091263.1|302680_303136_-|hypothetical-protein MSNNKFVGVAYWSVATQVADWLARAVDVLMPLERAGIAVLAYDCLPGGNQLTVTFGEERHQMLVTDGKGRGVVRASEAAAVFVVCLVALRKALGDISVVTDSQETVPALPRQNYPLYADSWQRVLPVAQELGLATGAAFTARHNAVLNNVF >NZ_AP021845.1|WP_152091262.1|301730_302669_-|hypothetical-protein MSAHIIDKLLFLVIIVAAARVLWRFYPYRQVVVEEDSREVVAPEVQTQPEVVQQQMDAAPVSSEREQVTASPLFVRHLNDMATLMVRYRHRKQACTAHLIVWSGSKRKGYSDSYFDLGLIEGQELTEQVIEQSLALAKQQLVDLAEKGKRKRRESKKQKQEAAAVAEVVVAEAAVSEVVVAEVAEAPADVAIVAATEVVEQKAEVPPLVVEDTPPESIKLRKFPSVYRGIIKEIGMMTQNKDGREFETFGVRFETQEGIVDAVFGVNLREALRDAKADVGDQVEILKIGRKTITKGKAPMNLFKIAKLECTA >NZ_AP021845.1|WP_152091261.1|300765_301308_-|single-stranded-DNA-binding-protein MASVNKVILVGNVGADPETRYMPNGDAVCNLRLATTESWKDKNSGEKRELTEWHRIVCYRKLAEITSQYVKKGSQLYLEGRIKTRKWQDKDGQDRYTTEIEMTEMQMLGNRRSGDSDGSGDDRPPRQHSAGGGGNGEPPARERRPAPAYDPMEDDIPFCRVDMNADPAFVKASARVRRVA >NZ_AP021845.1|WP_152091260.1|300423_300684_-|hypothetical-protein MYGAPYLGYGCEIVHSFNTLIDAIKRRQVLRPRSKLAINLAGGDIEVNLLPNGFVELDGKVQPVAVEKEIEDAMQQFGAVLKEVLV >NZ_AP021845.1|WP_152091259.1|300076_300421_-|KTSC-domain-containing-protein MHPNFTPVSSSNIDGYLYMPDRKILLIAFKSGGTYAYEDVEQPVATGFAQASSKGKFFRSDIKDRYATSKLDDMAVANLLGGMGASVPPQPRRKAPRVTLQSLLSRYPMLNAVF >NZ_AP021845.1|WP_152091258.1|298723_299683_-|hypothetical-protein MQKDQSSLAGTLPANRRTIVAMPDPILGSLRWPNRPKLPEGNPCWTYMVEGTREQYAVAVGHVENGRPHPFEVWVLANEQPRCLGAMAKTLSADMRTQDRAWLTRKLEVLASVTGDLAIDLPLGTERLLASSNTAAVARIVQYRLNQLGVANPEEGEATPLVDALLPVRDAGHEGTLSWTADIKNPSSGDDFTVFLPEVQTEDGQHRPVAVRLSGRYPRDLDGLAAILTMDMAVVDVAWIGMKLRKLLDYDEPMGSFLAKTPGTGKTERYSSIVAYLARLILHRYATLGLLTASGYPVVEMGVMVSVPGDASNVVPIAA >NZ_AP021845.1|WP_152091268.1|307728_308742_-|ParB-N-terminal-domain-containing-protein MKQRVIKRPDMPLTALGAPAPGADTSTDKGELPAVSATTQPPSPPLLHLAGAEEVVDIPVSKLRVSPCNARKIRLPKRVSKIAESLKNNGQKDPLYVYPGAGDDEGYFMVLGGETRRLGALQIALPTLKAFVDRKVDPTDALNLTKISNILNDSADECDLDRGMVAIDLLEKGHTQGEVAEVLELESHTHVQRLIKLAGLPKRFIDFGQDYPERFSASLGAYISQAIDRHGEDFAHDLLKAALVDELPHRKIAKAIEAGPSDKQPGQERGKRLRRDGGFDIPTPDAPGGRYDVYKSKTPGLKVLKLQVEVPDELAKDLNEKLTEVLTQFIQTSRDQQ >NZ_AP021845.1|WP_152091269.1|308759_309899_-|AAA-family-ATPase MSDLRPTYAYIGAVEHRLKSAAALLGVSENTLRTTLAESGIEVRRANQDNPNAPAVRLFDLPTIFQIAEYRRAKKLTKGPEGKKPIVIAIEIIKGGTGKTTTAAEVAVQLQLQGLKVLGIDIDIQANFTQLMGYEADLTEDEAAMYGLTEEAIVNGTFATICGPFIERNGRPVDAKAIIKYPFGPSGPAIIPADTFFSDLEHDISKTGGKRELVFQKFFKESLAGNVPGLNVGDFDVVLFDCPPNISFVATNALASADIVIAPVKMESFSVKGLSRLIGEVHTLKAEYGGEVKDPELVILPTYYSTNLPRVGRMQEKLAQYRANTSPVSISQSEEFPKSTDNYMPLTVIKPTCQPVKEYRMFVDHLIKRINEVSKARAS >NZ_AP021845.1|WP_152091270.1|309895_310324_-|hypothetical-protein MNAFGRDGDSRQKEHSNVNYRSRDGRETQGSLSSATRVPVPRGHAPHPIFPPSWSIPTRKPQFTAVFHYCGFPEVAHTLANLETPGPEGAENFVQKKMAECGYMATVGRVKTRKAAFCLPMRLTNNTRAKIMREGNHKDEME >NZ_AP021845.1|WP_152091271.1|310495_311095_+|hypothetical-protein MSESTTPKRSRHSYRERIQEVIQERIALGKPLTHRDILKEAGGGSASTVVEELAKAERSTPATLIGRGAKSLPQRIAALEDALNASLAREKVLEAENQALRESLTSARADVDKLLAGHQDSQRMLLQGVDDLRQMVKAGQGGMASAVIATERQKAAGDDTGDGILWKARHDQLLQRFVALDAKNRKMSSQLHELGVDVD >NZ_AP021845.1|WP_152091272.1|311091_312549_-|RepB-family-plasmid-replication-initiator-protein MRQHQPEQRNLFPTEDLIVPESLQKMRKAVAAIHAIPRNPEDSQNLTNRRVFDGLIIVAQIHCRQRGKEFIQRIRDERVSPLFEVRTSELGKLSGIPGKNYDRIIEEISRIYEMDFEFNVCAEDGETIWENRARLLSSLGVGKNHKRGYIRFAMDPEMLILLLEPNLWASFSLSVMHDLGTSAAYALFQQTYRYINTNQKLTAALPTKTWIELLVGKNRYVKDIDGKEVINYGEFKRRVLNDAIEKVNEVPALTYNIELKEHRQGNRVARLQFKFIPKEPTLQLESTWPEDILTVLKSIGFLDKEITDISQAHSSASVADAICRLKEAEQRLKSEGKAISSRKPYFLGILRNIAAGEDDIDPEKIEAEVRIEMAERAAEERKKKMQDAYDEHRRKRFSAWVTSLSVEDRKQLIADYEASEDFNPVLGKSLKKILTEENRSGLSTLRVWMEKHRSETLAGVFNTPEYQSLEGWMMWKLSGDDAIEA >NZ_AP021845.1|WP_152091273.1|313182_313680_+|hypothetical-protein MMDDDEIKDRQLNALVGGAVRGLVESGADDEAIGAFASTYRQKAAKLLGRPQEALDPPDLTDIIKDAVAQALAAAQVKPKKSRKQNEHFTVSIGGQKTSVTIHKDVIAQLAEAKGSKAEVSRFVREVAKDVPDSVENKSEWIEHRIATIMRFKSESAGNGSSARH >NZ_AP021845.1|WP_152091274.1|313676_314798_-|restriction-endonuclease MFGPKKQTAAAKPNGVDNLIHKLKALPPAIDLIVAATLGGYSFVAIYVDNAKFVGLACILIALIFAVLGISAVTREMKDFKVVEQNSNPEALRMMKTQQFENYLVALFRLDGYQVRPSIDELHRQDDADLIAVKKKETILIQYNHWDEDIVGTKPIQSLHKAAAAVRAQGATAISFGRFSAEAADWARRKGVTLMTMQDVIGMACRLTGLTPEEAAAEPDEEVVVEKAHEVAEVVRGHHRFLFVDFAGLEHGLARLSELLLQHPAYQVIASTLPPLKSMEDIRLSLGECGDRLAGDLEAAQDGRYFAIQKHLQASREGKHAIWLAVDSEPRQFPEGCAELIAVNRAFGFDVSASQRLIEAMVIIDRRSIAGAG >NZ_AP021845.1|WP_152091275.1|314875_315796_-|recombination-associated-protein-RdgC MFFRNLTLYRLPTPWNMDLAKLEEMLARNPFTRCSGSEQQRSGWISPRDKGSLVYAQNRQWLIALCTEQRLLPSSVIQDEVRERAEALEEQQGYAPGRKQLRELKDRVAEELLPRAFTRRRTTFVWIDPVNGWLAVDASALSKAEEVLEQLRMVLDDFPLSLVHTKLSHSSAMADWLAGGEAPVNFTVDRDCELKAVGEEKAAVRYVRHPLDGDGIASEIKAHLAAGKLPTRLALTWNDRISFVLTERLEIKRLGFLDLLMEEAEKNTEHAEEKFDADFALMTGELARFIPSLLDALGGEVVEDRA >NZ_AP021845.1|WP_152091276.1|315816_317793_-|DEAD/DEAH-box-helicase MQPPPDNERNAVEQANGQPLLDRLKRLGVTAWREPLLCLPKLFQDYSSISTLKQALPQNDVVAGPKLFTLLVSEKAVVLSQPKKRLVMTATDGMLSVKIVIFVVLGVDVPTWKAFEEGDKIHLRGVLQNWNGKLQITGPTLIDPQLVGKVIPIYEKRRSVVADGAIYDATRYALEHHLKETIDYLVESYHGLPEADILRRARLKAPSVEVILRAAHQPTSEDEGMRGIAGMRRLAALSVVENARRLKQRDPVPESVVSIPDTLIQQLTEKLPYPLTGDQRRSIGEIVADMASPLPMRRVLSGDVGSGKTLPIMIAALATQHLGHRAVILTPNGLLADQFVKECKALFGEDSLVISVTSGTKKLDLASNPILVGTTALLSRLKGESPPALFCVDEEQKMSVSQKIELTGFASNYLQATATPIPRTTALITHGAMDVSVLKEMPVVKNITTHIVTAGERKRLFDHTRKVLASGGQIAIVYPIVNDDEQEKKSVVAAAVEWEKQFPGLVGMVHGQMKEAEKVAAVNGLKSGNQKIAVVSSVIEIGLTLPSLRSLIVVHAERYGTSTLHQLRGRVARLGGNGYFFLFLPETVAPETMQRLQLLVDHSDGFTLSEKDAELRGYGDLFEDAERQSGNSRSTIFRCVDLTPSEIHAATIHEALPS >NZ_AP021845.1|WP_152091277.1|317776_318742_-|hypothetical-protein MTTTNLATAKEVATAVASLGFQASSETLGAISRNCREDFLEHLSRCITDQDQDGRSKKFIGNLLRCLAPNTINRIKPIFPDATIDMIVPVAKAVPTRFLSAIDAAHDPKHARHEDAKAYLASIFAPPDTHSEEEPPQSSLQQQHDQQEERPVDQAALSRRLAPSGSKKYHSVHVYGSNAALCFNATDWNGAPGVMVDAAMQTGPKTYDWKNAVHVWLDINEVGAVLAVFRRWRKGVEFSAHGAQNDKGFAIEFQGQHFFAKVTAKKAAAGAVRAVKILPSDAMSVSILFLTQLAESYPMIPLNELLATVRATHQIEDAAAA |
You can click texts colored in the table to view more detailed information
CRISPR_ID | Spacer_Info | Spacer_region | Spacer_length | Hit_ID | Protospacer_location | Mismatch | Identity |
---|
CRISPR_ID | Spacer_Info | Spacer_region | Spacer_length | Hit_phage_ID | Hit_phage_def | Protospacer_location | Mismatch | Identity |
---|---|---|---|---|---|---|---|---|
NZ_AP021845_1 | 1.1|11367|30|NZ_AP021845|CRISPRCasFinder | 11367-11396 | 30 | NZ_AP021845 | Azospira sp. I09 plasmid pAZI09, complete sequence | 11367-11396 | 0 | 1.0 |
NZ_AP021845_2 | 2.1|11566|30|NZ_AP021845|CRISPRCasFinder | 11566-11595 | 30 | NZ_AP021845 | Azospira sp. I09 plasmid pAZI09, complete sequence | 11566-11595 | 0 | 1.0 |
NZ_AP021845_2 | 2.2|11632|30|NZ_AP021845|CRISPRCasFinder | 11632-11661 | 30 | NZ_AP021845 | Azospira sp. I09 plasmid pAZI09, complete sequence | 11632-11661 | 0 | 1.0 |
NZ_AP021845_3 | 3.1|12229|30|NZ_AP021845|CRISPRCasFinder | 12229-12258 | 30 | NZ_AP021845 | Azospira sp. I09 plasmid pAZI09, complete sequence | 12229-12258 | 0 | 1.0 |
NZ_AP021845_4 | 4.1|307483|31|NZ_AP021845|PILER-CR | 307483-307513 | 31 | NZ_AP021845 | Azospira sp. I09 plasmid pAZI09, complete sequence | 307483-307513 | 0 | 1.0 |
NZ_AP021845_4 | 4.2|307531|43|NZ_AP021845|PILER-CR | 307531-307573 | 43 | NZ_AP021845 | Azospira sp. I09 plasmid pAZI09, complete sequence | 307531-307573 | 0 | 1.0 |
NZ_AP021845_4 | 4.3|307591|23|NZ_AP021845|PILER-CR | 307591-307613 | 23 | NZ_AP021845 | Azospira sp. I09 plasmid pAZI09, complete sequence | 307591-307613 | 0 | 1.0 |
NZ_AP021845_2 | 2.1|11566|30|NZ_AP021845|CRISPRCasFinder | 11566-11595 | 30 | NC_018141 | Legionella pneumophila subsp. pneumophila plasmid pLELO, complete sequence | 116661-116690 | 6 | 0.8 |
NZ_AP021845_2 | 2.1|11566|30|NZ_AP021845|CRISPRCasFinder | 11566-11595 | 30 | NZ_CP025492 | Legionella sainthelensi strain LA01-117 plasmid pLA01-117_150k, complete sequence | 38691-38720 | 6 | 0.8 |
NZ_AP021845_2 | 2.1|11566|30|NZ_AP021845|CRISPRCasFinder | 11566-11595 | 30 | NZ_CP021284 | Legionella pneumophila subsp. pneumophila strain Allentown 1 (D-7475) plasmid unnamed1, complete sequence | 109995-110024 | 6 | 0.8 |
NZ_AP021845_2 | 2.1|11566|30|NZ_AP021845|CRISPRCasFinder | 11566-11595 | 30 | NZ_CP011106 | Legionella pneumophila strain L10-023 isolate Ulm plasmid unnamed, complete sequence | 106875-106904 | 6 | 0.8 |
NZ_AP021845_2 | 2.1|11566|30|NZ_AP021845|CRISPRCasFinder | 11566-11595 | 30 | NZ_CP045305 | Legionella longbeachae strain B1445CHC plasmid pB1445CHC_150k, complete sequence | 37393-37422 | 6 | 0.8 |
NZ_AP021845_2 | 2.1|11566|30|NZ_AP021845|CRISPRCasFinder | 11566-11595 | 30 | NZ_CP042253 | Legionella longbeachae strain B3526CHC plasmid pB3526CHC_150k, complete sequence | 39581-39610 | 6 | 0.8 |
NZ_AP021845_2 | 2.2|11632|30|NZ_AP021845|CRISPRCasFinder | 11632-11661 | 30 | KM389300 | UNVERIFIED: Escherichia phage CBA6 clone ctg7180000000096 genomic sequence | 16175-16204 | 7 | 0.767 |
NZ_AP021845_2 | 2.2|11632|30|NZ_AP021845|CRISPRCasFinder | 11632-11661 | 30 | KC139562 | Salmonella phage FSL SP-029 hypothetical protein gene, partial cds; hypothetical proteins, RIIA protector from prophage-induced early lysis, RIIB Protector from prophage-induced early lysis, and hypothetical proteins genes, complete cds; and DNA topoisomerase 2 gene, partial cds | 2138-2167 | 7 | 0.767 |
NZ_AP021845_2 | 2.2|11632|30|NZ_AP021845|CRISPRCasFinder | 11632-11661 | 30 | KC139523 | Salmonella phage FSL SP-063 hypothetical protein genes, complete cds; tRNA-Met, tRNA-Trp, tRNA-Asn, tRNA-OTHER, tRNA-Ser, and tRNA-OTHER genes, complete sequence; and hypothetical proteins, DNA polymerase, hypothetical proteins, RIIA protector from prophage-induced early lysis, RIIB Protector from prophage-induced early lysis, hypothetical proteins, putative tail fibre, hypothetical proteins, DNA topoisomerase IIs, hypothetical proteins, exonuclease A, hypothetical proteins, deoxycytidylate deaminase, hypothetical proteins, head completion protein, putative tail tube associated base plate protein, baseplate wedge subunit, hypothetical proteins, loader of T4-like helicase, hypothetical proteins, putative membrane protein, DNA ligase, hypothetical proteins, helicase, hypothetical protein, RecA-like recombination protein, hypothetical protein, putative dUTP diphosphatase, hypothetical protein, thymidylate synthase, hypothetical proteins, DNA end protector protein, baseplate tail tube, single-stranded DNA binding protein, hypothetical proteins, regulatory protein FmdB, hypothetical proteins, and base plate hub subunit genes, complete cds | 20576-20605 | 7 | 0.767 |
NZ_AP021845_2 | 2.2|11632|30|NZ_AP021845|CRISPRCasFinder | 11632-11661 | 30 | FQ312032 | Salmonella phage Vi01 complete sequence | 154723-154752 | 7 | 0.767 |
NZ_AP021845_2 | 2.2|11632|30|NZ_AP021845|CRISPRCasFinder | 11632-11661 | 30 | NC_015296 | Salmonella phage Vi01, complete genome | 154723-154752 | 7 | 0.767 |
NZ_AP021845_2 | 2.2|11632|30|NZ_AP021845|CRISPRCasFinder | 11632-11661 | 30 | NC_023856 | Salmonella phage vB_SalM_SJ2, complete genome | 80074-80103 | 7 | 0.767 |
NZ_AP021845_2 | 2.2|11632|30|NZ_AP021845|CRISPRCasFinder | 11632-11661 | 30 | MH427377 | Escherichia phage vB_EcoM Sa157lw, complete genome | 129521-129550 | 7 | 0.767 |
NZ_AP021845_4 | 4.1|307483|31|NZ_AP021845|PILER-CR | 307483-307513 | 31 | NZ_CP012399 | Chelatococcus sp. CO-6 plasmid pCO-6, complete sequence | 220195-220225 | 7 | 0.774 |
NZ_AP021845_4 | 4.1|307483|31|NZ_AP021845|PILER-CR | 307483-307513 | 31 | NZ_CP018096 | Chelatococcus daeguensis strain TAD1 plasmid pTAD1, complete sequence | 197951-197981 | 7 | 0.774 |
1. spacer 1.1|11367|30|NZ_AP021845|CRISPRCasFinder matches to NZ_AP021845 (Azospira sp. I09 plasmid pAZI09, complete sequence) position: , mismatch: 0, identity: 1.0
atctcaacccgctctaggattcggctgact CRISPR spacer atctcaacccgctctaggattcggctgact Protospacer ******************************
2. spacer 2.1|11566|30|NZ_AP021845|CRISPRCasFinder matches to NZ_AP021845 (Azospira sp. I09 plasmid pAZI09, complete sequence) position: , mismatch: 0, identity: 1.0
ctgatattgacaaggccttggcagttgtcg CRISPR spacer ctgatattgacaaggccttggcagttgtcg Protospacer ******************************
3. spacer 2.2|11632|30|NZ_AP021845|CRISPRCasFinder matches to NZ_AP021845 (Azospira sp. I09 plasmid pAZI09, complete sequence) position: , mismatch: 0, identity: 1.0
gtcaggagtatcagccagacaatgaacttg CRISPR spacer gtcaggagtatcagccagacaatgaacttg Protospacer ******************************
4. spacer 3.1|12229|30|NZ_AP021845|CRISPRCasFinder matches to NZ_AP021845 (Azospira sp. I09 plasmid pAZI09, complete sequence) position: , mismatch: 0, identity: 1.0
gaatatcggttctgcggtcgcagattggcc CRISPR spacer gaatatcggttctgcggtcgcagattggcc Protospacer ******************************
5. spacer 4.1|307483|31|NZ_AP021845|PILER-CR matches to NZ_AP021845 (Azospira sp. I09 plasmid pAZI09, complete sequence) position: , mismatch: 0, identity: 1.0
aaacacgctgacggggagggaggcgcatggg CRISPR spacer aaacacgctgacggggagggaggcgcatggg Protospacer *******************************
6. spacer 4.2|307531|43|NZ_AP021845|PILER-CR matches to NZ_AP021845 (Azospira sp. I09 plasmid pAZI09, complete sequence) position: , mismatch: 0, identity: 1.0
ttgccgagcaccaaggggtgatgcgagaggagggctgtgcgga CRISPR spacer ttgccgagcaccaaggggtgatgcgagaggagggctgtgcgga Protospacer *******************************************
7. spacer 4.3|307591|23|NZ_AP021845|PILER-CR matches to NZ_AP021845 (Azospira sp. I09 plasmid pAZI09, complete sequence) position: , mismatch: 0, identity: 1.0
tcgctgacatccctggaatccga CRISPR spacer tcgctgacatccctggaatccga Protospacer ***********************
8. spacer 2.1|11566|30|NZ_AP021845|CRISPRCasFinder matches to NC_018141 (Legionella pneumophila subsp. pneumophila plasmid pLELO, complete sequence) position: , mismatch: 6, identity: 0.8
ctgatattgacaaggccttgg--cagttgtcg CRISPR spacer gtgatattgacaaggccttagctcagttag-- Protospacer ******************.* *****.
9. spacer 2.1|11566|30|NZ_AP021845|CRISPRCasFinder matches to NZ_CP025492 (Legionella sainthelensi strain LA01-117 plasmid pLA01-117_150k, complete sequence) position: , mismatch: 6, identity: 0.8
ctgatattgacaaggccttgg--cagttgtcg CRISPR spacer gtgatattgacaaggccttagctcagttag-- Protospacer ******************.* *****.
10. spacer 2.1|11566|30|NZ_AP021845|CRISPRCasFinder matches to NZ_CP021284 (Legionella pneumophila subsp. pneumophila strain Allentown 1 (D-7475) plasmid unnamed1, complete sequence) position: , mismatch: 6, identity: 0.8
ctgatattgacaaggccttgg--cagttgtcg CRISPR spacer gtgatattgacaaggccttagctcagttag-- Protospacer ******************.* *****.
11. spacer 2.1|11566|30|NZ_AP021845|CRISPRCasFinder matches to NZ_CP011106 (Legionella pneumophila strain L10-023 isolate Ulm plasmid unnamed, complete sequence) position: , mismatch: 6, identity: 0.8
ctgatattgacaaggccttgg--cagttgtcg CRISPR spacer gtgatattgacaaggccttagctcagttag-- Protospacer ******************.* *****.
12. spacer 2.1|11566|30|NZ_AP021845|CRISPRCasFinder matches to NZ_CP045305 (Legionella longbeachae strain B1445CHC plasmid pB1445CHC_150k, complete sequence) position: , mismatch: 6, identity: 0.8
ctgatattgacaaggccttgg--cagttgtcg CRISPR spacer gtgatattgacaaggccttagctcagttag-- Protospacer ******************.* *****.
13. spacer 2.1|11566|30|NZ_AP021845|CRISPRCasFinder matches to NZ_CP042253 (Legionella longbeachae strain B3526CHC plasmid pB3526CHC_150k, complete sequence) position: , mismatch: 6, identity: 0.8
ctgatattgacaaggccttgg--cagttgtcg CRISPR spacer gtgatattgacaaggccttagctcagttag-- Protospacer ******************.* *****.
14. spacer 2.2|11632|30|NZ_AP021845|CRISPRCasFinder matches to KM389300 (UNVERIFIED: Escherichia phage CBA6 clone ctg7180000000096 genomic sequence) position: , mismatch: 7, identity: 0.767
gtcagga----gtatcagccagacaatgaacttg CRISPR spacer ----gaattccgtatcagccagacaaagaaattg Protospacer *.* *************** *** ***
15. spacer 2.2|11632|30|NZ_AP021845|CRISPRCasFinder matches to KC139562 (Salmonella phage FSL SP-029 hypothetical protein gene, partial cds; hypothetical proteins, RIIA protector from prophage-induced early lysis, RIIB Protector from prophage-induced early lysis, and hypothetical proteins genes, complete cds; and DNA topoisomerase 2 gene, partial cds) position: , mismatch: 7, identity: 0.767
gtcagga----gtatcagccagacaatgaacttg CRISPR spacer ----gaattccgtatcagccagacaaagaaattg Protospacer *.* *************** *** ***
16. spacer 2.2|11632|30|NZ_AP021845|CRISPRCasFinder matches to KC139523 (Salmonella phage FSL SP-063 hypothetical protein genes, complete cds; tRNA-Met, tRNA-Trp, tRNA-Asn, tRNA-OTHER, tRNA-Ser, and tRNA-OTHER genes, complete sequence; and hypothetical proteins, DNA polymerase, hypothetical proteins, RIIA protector from prophage-induced early lysis, RIIB Protector from prophage-induced early lysis, hypothetical proteins, putative tail fibre, hypothetical proteins, DNA topoisomerase IIs, hypothetical proteins, exonuclease A, hypothetical proteins, deoxycytidylate deaminase, hypothetical proteins, head completion protein, putative tail tube associated base plate protein, baseplate wedge subunit, hypothetical proteins, loader of T4-like helicase, hypothetical proteins, putative membrane protein, DNA ligase, hypothetical proteins, helicase, hypothetical protein, RecA-like recombination protein, hypothetical protein, putative dUTP diphosphatase, hypothetical protein, thymidylate synthase, hypothetical proteins, DNA end protector protein, baseplate tail tube, single-stranded DNA binding protein, hypothetical proteins, regulatory protein FmdB, hypothetical proteins, and base plate hub subunit genes, complete cds) position: , mismatch: 7, identity: 0.767
gtcagga----gtatcagccagacaatgaacttg CRISPR spacer ----gaattccgtatcagccagacaaagaaattg Protospacer *.* *************** *** ***
17. spacer 2.2|11632|30|NZ_AP021845|CRISPRCasFinder matches to FQ312032 (Salmonella phage Vi01 complete sequence) position: , mismatch: 7, identity: 0.767
gtcagga----gtatcagccagacaatgaacttg CRISPR spacer ----gaattccgtatcagccagacaaagaaattg Protospacer *.* *************** *** ***
18. spacer 2.2|11632|30|NZ_AP021845|CRISPRCasFinder matches to NC_015296 (Salmonella phage Vi01, complete genome) position: , mismatch: 7, identity: 0.767
gtcagga----gtatcagccagacaatgaacttg CRISPR spacer ----gaattccgtatcagccagacaaagaaattg Protospacer *.* *************** *** ***
19. spacer 2.2|11632|30|NZ_AP021845|CRISPRCasFinder matches to NC_023856 (Salmonella phage vB_SalM_SJ2, complete genome) position: , mismatch: 7, identity: 0.767
gtcagga----gtatcagccagacaatgaacttg CRISPR spacer ----gaattccgtatcagccagacaaagaaattg Protospacer *.* *************** *** ***
20. spacer 2.2|11632|30|NZ_AP021845|CRISPRCasFinder matches to MH427377 (Escherichia phage vB_EcoM Sa157lw, complete genome) position: , mismatch: 7, identity: 0.767
gtcagga----gtatcagccagacaatgaacttg CRISPR spacer ----gaattccgtatcagccagacaaagaaattg Protospacer *.* *************** *** ***
21. spacer 4.1|307483|31|NZ_AP021845|PILER-CR matches to NZ_CP012399 (Chelatococcus sp. CO-6 plasmid pCO-6, complete sequence) position: , mismatch: 7, identity: 0.774
aaacacgctgacggggagggaggcgcatggg CRISPR spacer gatcgcgctgacgggacgggaggcgcatacg Protospacer .* *.**********. ***********. *
22. spacer 4.1|307483|31|NZ_AP021845|PILER-CR matches to NZ_CP018096 (Chelatococcus daeguensis strain TAD1 plasmid pTAD1, complete sequence) position: , mismatch: 7, identity: 0.774
aaacacgctgacggggagggaggcgcatggg CRISPR spacer gatcgcgctgacgggacgggaggcgcatacg Protospacer .* *.**********. ***********. *
Region | Region Position | Protein_number | Hit_taxonomy | Key_proteins | Att_site | Prophage annotation | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
DBSCAN-SWA_1 |
72042 : 78469
Sequences of DBSCAN-SWA_1
Nucleotide sequences of DBSCAN-SWA_1 >NZ_AP021845|72042:78469|DBSCAN-SWA CTCATGCAGAGGCTCCTTCTGTGGCTCCCGCCCGTCGCTCGCGGATCGACGTCCTGACCCAGGTCTCAGCCTCTTCGGGGCTTCCAAGAGAGCCGGTTTCTCGCCGGCGCCGCCTCATTGCTGCGGCTTGCCTCGAGGTGATTTCCACCGCACCGGACTTGATCGCCAGGCTCCGCTTAGCCAGGCAGATGTCGTAGTGATCCCCCTGGTACCAGCGCCGGGCGACACCAATGCAGTCGGCCATTGCATGTACCTCGGCTTCCGAGTCCCCGAGCATGTGACACATGACCATTCGTCCATAACCGGCCCGCATGTTGTCGACGTAGACGGTCATTGAAGACCCTCCGCAATCGCTATGCAGGCCTGCCAATCATTTGAATAAGGCCAGCGGGGTCCAATTTTGCCGTCTCTCATCTGGAATTCACCGAATTTGGCCAGGAATGACCAGACATCATGAGGAGTCTTGTTAGCGACATCGGCGCGGCGATAGTCGCGGAATAGATCGTCAAAACCTTCAATAAGGACGCGACGTATGCTCGAATAATCACAGGAATTGATGTCGTGACGAATAGCTGCAAGGTAATAGGCCAGTGCCTTTGCCCCGTTTGCGCCGGAAGTACGGCAACCGACCTCATGGTATGACCCAAGCCACCACATGGCTTTCGAATCGCCATTGCGCGCCCAGAGTCCGATTTTGAGTAGAACTTCTTGGAAAAGCGGCTCTGGGTAGCCGAACGGTTCAAGATCGAGGATTGAAGCGGTTCCCGCTTGGCCTGTAGGCCAATGGACGCTCCAACGCCAGTTTTTGAACTCAGGCGGATCATCGCGCTGAATTCCCATTTGCAGCACTTCATGCCCCTGCTGCCGGGCCAGGATAAGGTAGTTCCCGGTAAAGATTTTGTCGTCCTCCTCAACGCGGGTCGAGTCGCAAATAGGGCAACGAACCTTGCTGAGGGTACCAGTGACGGAAATGGGAACTCCGCTGTTGATGGCTTGGTTGCTTCCTCGCCATCCACATTCTTGGCATCTGACTGGCCAGTCTTGATTTTCCATGGTTCAGAACAGACTTCCAGTTGTCGCAACCTCGGGCTGCATCGCCGCGGCCGGAACCGTCGAAGCGGGCCTCTTCGAGGCTTTGCTGACTTTCTGGAGGTAGCGTTCAACCAGGCGGCCGAGGTTGGTTGTGTCGTCATCGACCTTCTGCACACCAAGGTGCGGATCTACGATCTGATTAGCCTCGCTGGCCTTGATACCAAGCACGTCCATGATGGGCGGATCGCTGCCCTCCTCGCTGACGAGGAAGAAGGCGGTGACCGGATCTTCCTGACCCTCTCTATCCAAGCGCCAGATGATTTGCTGGTGTATGCCCGGGGACCAATCGAGCTCGCCGAACACCACAACCGAACTGCGGAACTGCAGATCGTCAATGCCTGCTCCTGAGCGGAGAGACATGATCATCACGTCCGTGTCGCCGGTGAGGAACCGATCCTTTTCCTTGTTCTTCTGGGCAGCCGTTTCCGAGCCGGTGTACATGGCCGGGCGAAGGTCTGCGAGTTCTTCAAGCCAGATGTCGTAGACGGCCCGGTGCCACCCAACCAGAAGGACGGGCTCGCCGGCCTCGACCATCAGGCGGACGAACTGTGCTACTGCCTTGGCCTTGGAGAGCCCGGTAGCCTGCCGGACCATCATGTCCAGTTCCCGGGCGGCCTGCCCGCGCTCGACGAAGGTGCCAGTGGTGGCGCGGATGGCCAGCACCCTGGCCAGGTCCTCGATGGACTGCACAGCCTTGGCATCGAAGTCCACATACTCGATGATTCTGGACACCTTTGGCAGCTCGAGGCCCACATCCGATTTGAGGCGTCGTAAGAGGACGTGCTGTTCGCGGAGGTATGTTCCCAGAGCCTTCGGGTTCCCGATGCGGCCCATGTCATCGGTCCATTCTCTGGAGAAGTCAGCGAAACTACCGAGCACCGTGTCGTCGATGAACTGCATGACATTGTGCATTTCAATGCCGTATCCATAAATTGGGGTAGCGGTCAGCCCGAGCCTGTAAGACACGTGGTTCGCTAACACCTTGGCAGCGGCGCCCTTGGCGGTCGACGTGCCCGTGCGCAGGCTTTGTGGCTCATCGAATACGGCTGCTTTGAAGAAGTCCGTAGCAAAGATGTCCGCCCATCCGCCAATTTGCGAGATACGGAAGACATAGACGTCCGCCGGCGGCAGATCGTATGGGGATGCTTTGGTGATGATATGCACCCGGAGGTGGGTGAAAGCGGTGAGCTTGTCCTTCCACTGCTTTTGCATGTGCGGGTCGCAGACGACGGCCGCTGGGAGAGATTGGGCTTCAGCGCACAGGAACGCCGCTGCCGTGTAGGTTTTCCCCAGTCCGCCTTCATCGCCCAGGAGCAGCGAGCGGCGCCGCCGCAGGAGTTCGACGGCCTGCATCTGGTAGTGACGTACCTTCTGCCCTTCGCGAAGGCCTACAACCGCCGGCGGGACGTATTCGGGGAGAAGGATGCGCTCCATCTCGGCCTGCTGCATCTCGAAGTCGAGCCGACCGCCTCGCAGCGCGTTCCGGTCACCGTCCGACATGGCCAGCGGATACCGAGAGAGAAACCAATCGAGGTCCGCCGCGTGCATCAGATCGCGGGGAAAGCGGAAGGGACCGGTTGATTGCTTCGGCACCCGGGGGAAGATGTGCTTCAGGCGAATGGCGACATGCGGCTCCAGGGAGGACATTTCCCAGGCGGTACCGCCTTCGATCAATCGGAGTTCGCCGTAGGTCCGCATCAGATCCACCCCTGGCTCAACGAGGCGGCGAATAACGGCGTTCCTTCGATCTCCGGTGGAAGCCCCATGGAGACATTCGAGGCCAGGATGATCGCGGTCACCTCGGGATAGGTGGCGTACCGGGCCAGTTGTCGAAAAATGTCCATTTTCTTGGACTTGTTCCGCATCTTGCACTCGACCACTACGCCGCCGGCGATCAGGAAATCAGGGATGTCTTTTGGGGACAGGCGTTTTTCCCTTTCATAAGCGATTCCGGCTGCCTTGAGCACTTCGGCGACGCCTTCCTGCAGATGCTTTTCCGATGAGAGGTCGAGCCGGCTGCGCTGCACGAGGCGGATCACGTCGGCGATCACAGGGGGAGTTGTGGACGCCTGGCTCATGGCTGAGTCTCCTCCGGTTTCACCTCGAAGAAGTTGAGCTTCCCTTTCACTTTGCGGAAAGGCAGCGGCTTTGAATCTCCCACCACGAAGGCATGCTCCCCCATGTACCAGCGAGACGTCAGTTGGCCGTCTCTGGCCTTCTCGGGCGGGATGGAGTCGGTCAGCACGGCTTGGCCGACGATTCCACCAAGGTCGTACTGACCCGGGCGAGGAAGCGGAATTTTCGGAAAGCGGCCCTTCACCCACAGGTAGCCTTCCATGTCGAACGTCTGGCCGGCGTGGACGAGAAATGGTCCCCGGTGCTTCGTTGCCCAGGTACGGTTCTCGATGTCCTTGATCTCGCCGGCGGCGACGGCGGCGGCACGTTGCTGGGGGTCGGTCAGGTCTGGGCGCACGATCAGCCAGGCCCAGGGTTGTCGGATGGATAGCGCCTTCATGATCGATCTCAATGGGTGGTGGCCACTGTATTGCTGACCCGCAATTCAACCGGGACGCCGTTGATTTCGGGCGGGAGGCCCTGGATTGGTTCAGCAGCCAATACCAACAGCCGATCGGCGCCGTACAGGATGCGGTAGGGGGCGGCCGGCAGTTTCTCGGCCAGTACCGCCTTCAGATCGTTCTCCGCCGCTTCAAGAACTGGATTCTTGGTTGGCTGCTGCCTCGCATGGGCCAGTTCAGCGCGCATGGCTTCGAGCTCGAGCAGCGCAGCGAATGCGCGGTCTGCCGTTTCCGGGCAGCTGTAGGCGATCATGAAGGAATTGATCAGGCTGCGGAGCAAGCCGCCGGTGTCGTGGTTCTTCACGACCGACATCACGGCAAACTGATCCAGGCGCTCGACAGGGGTCTCCAGGAAGCGCTCCAGGCCCATGCCCTTAGCGATTGCGGTGAAGAGAATGGTCAACGTCCGCCGGCGCTCCTCCGGGGCCAGGGGCTGCCCGTTCGTAGGGCTTGCCAGCTCGCAGCGGAAGCGGTTGGCGAGAGCGACATCGACGCGCTGTTGGATGAGGGTTTCGATGTTGGGAGCTTTGGTCATGATGTTCTCCGATCAGCAGCCGCCGGTGGCGTGACGGTAGATAGAAACTGGCGAACGCCACTGGCGTTTTGAGGACCAAGGATGTCGCCTTCCATCGCCTCCAAGAGCCGTGCAGCGTGGTTATAGCCAATCCGCAGATGGCGCTGGACCATGGAAATGGAGGCATGGCGTCCTTTCAGGACGAGATCGCGGGCTTCCTGGTACAGGGGGTCTAGCCGACTTCCCTTGATGCTTTCATCGATATCGTGTCCGCGGATCATGCGACGTGCTCCCCAGCCAGGCGGGCCGTCAATGGCATCAGCGCCATCCGGCGGCGCTCAAGGCAGAACCACATGATTTCCGCCCCCGGCTTGCCCTGGGCGGCCAGTGCACGGTAATCCGCGATCAACTGGTCGAGGATGGTGGGTTGGACGTGGCCGTAGATTTGGTCGATCGTCGGGAAGTCCTTGAAAATCTCGCCCACCGGTACCGGCGGCTCGGCCATGAGGGCGTAGGCCCACGGCTGTGTGTAGCCGCAATGGGGGCAGATCCAGCCTTGCTCGGTAGCGATCAGGAGGCCGCGGTCACCGCCCTCGGTACTGTGGGTGGCGAGCGATACATCGGCGGCACCCGATTCGTCGTAGGTAATGCCGTCACCGCGATTTGGGCAGGTGAAGGGGTGGATGGGCATGCTGCCGTCCACGTGGCATTGTCGCTCGTTGAGATATTGAACCTGGGCGGGGGTAAAGGGGGCCTGAATCTTCATTGCTTCTGATGCTGTCTTGCGGGAGAGGAGATCGCCTCGTCGATCGAGAGGCAGTAATGCCAGAGGCGGTGCTGGCGGAAAACGCGGCTGATGCTGTCGCATTCCCCAACCTGGAATACCGGCACGCCGGCCATCAAGGCGGCGCCAGCCTCCAGAATTGCCCCCTTGAGGATTTCCCCGGCTTCGCAGTAGAGCACCAGGCGCTCGGAGCCGGCGATTTCGCTTAGGCACCGGTCAGCCAGCTCCTCGTAATCGGCGGTTTGGCCCTCGCCGGCTTCATCGATCCAGGAAGATGCTGTCTTGGTGCCGCTGTCGCGCAGCGCCTTCCAGCGATGGGCATGGACGACTTTGGAGGCGAAATAGACGCCCCCTCGACCTTTGGCGGTACCGGCGGCGATCTCAAGCACGGCTCTCACCTTGGCCAGGTCAAAATGGTTGTCGCTGCGAACCCCCGACTCCATGTCGATCCAGTAGGGCACGTCGCGCCAGGCAGTTGCGGATGCGTGGATGCGTGGCAGCTCGATTTCGAGGACATTGGGGCCCAGCCCGCCGGCGTAGCCGCAGTGCAACGGGATTTGGCCGACGGTCGAGATCGGTGGCCATTGCTCCGGACACTGCCCGGTCCCCTTGGATTCGTCGTAGAGCAGATCAATACGGGACAGGGAACTGGCATCGACCTCGGCAACGAACTTCTGAATGGCCTGGCGAGATCCGTCGTGGAATTGAAGGATCAGGTGCAGGCCAGCCTGGTGCAGCGTGCGGTAGACCGACAGGACCTCTTCCTCGGTGAACTCGGGTCGGCGCGCGTTGATGTTGAGTTGGATGCGCCGGTACCGAGAAATGTCGGCAATGCGCGCCGGGGCGGTGCGTGGATCAAGGATCTCGCGAAAGACTTGCGTGCCGCAGAGATGGGCTGCGGTGAAGGGCAAGCCCAGGGCCAGGAATGCTTCACGCCAGGCAGCAGAGGGGTTCCGTGGCGAGCCTTCCTTCTCCGGGAAATAGAGAATCGCCCACTCGGGCGTGATCGGATTGGTGTAGCGGGCCGACAGGGTAGCCAGTTCGGAGGGTAGGACTTTATCGTCGGCGCCGGTGATGGAAACGAGGTTCAGAGGCATGGAGGAATCCTTGAAGGGTTTCCTGCCATGCTACGGAGCCGCCATAGGCGTTTCTGGATTTTCTTCTTCACTCAGCGACCCTCCTTCGGCCGAGCAAAGGGGACCACGTTTTCCAGCTCACGTTGAACGCCCCCGGCCGCGCGGAATTCATCCCAGAGCACCATGTGGCCCGGGCAATAATGGACATTCGGCGCGACTTCGGTGGCGTGGCTCTCGCACAGGGGCATGTCGCACGTCTTGCCGTCCCCGACCGGGAAGTCGCACAGGAAGTCCCCAACGGACGCGCAGGCGCCGCAGTGAGGGCCGAATTCGCCGCACAGGAACATCGTCCCGCCGTCCTTCATCGGTTGGATGTAGCAGGGCAT
Protein sequences of DBSCAN-SWA_1 >NZ_AP021845|72042:78469|76500_76989_-|WP_152090998.1|DBSCAN-SWA MKIQAPFTPAQVQYLNERQCHVDGSMPIHPFTCPNRGDGITYDESGAADVSLATHSTEGGDRGLLIATEQGWICPHCGYTQPWAYALMAEPPVPVGEIFKDFPTIDQIYGHVQPTILDQLIADYRALAAQGKPGAEIMWFCLERRRMALMPLTARLAGEHVA >NZ_AP021845|72042:78469|78175_78469_-|WP_152091000.1|DBSCAN-SWA MPCYIQPMKDGGTMFLCGEFGPHCGACASVGDFLCDFPVGDGKTCDMPLCESHATEVAPNVHYCPGHMVLWDEFRAAGGVQRELENVVPFARPKEGR >NZ_AP021845|72042:78469|74830_75211_-|WP_152090994.1|DBSCAN-SWA MSQASTTPPVIADVIRLVQRSRLDLSSEKHLQEGVAEVLKAAGIAYEREKRLSPKDIPDFLIAGGVVVECKMRNKSKKMDIFRQLARYATYPEVTAIILASNVSMGLPPEIEGTPLFAASLSQGWI >NZ_AP021845|72042:78469|72042_72375_-|WP_152090991.1|DBSCAN-SWA MTVYVDNMRAGYGRMVMCHMLGDSEAEVHAMADCIGVARRWYQGDHYDICLAKRSLAIKSGAVEITSRQAAAMRRRRRETGSLGSPEEAETWVRTSIRERRAGATEGASA >NZ_AP021845|72042:78469|72371_73094_-|WP_152090992.1|DBSCAN-SWA MENQDWPVRCQECGWRGSNQAINSGVPISVTGTLSKVRCPICDSTRVEEDDKIFTGNYLILARQQGHEVLQMGIQRDDPPEFKNWRWSVHWPTGQAGTASILDLEPFGYPEPLFQEVLLKIGLWARNGDSKAMWWLGSYHEVGCRTSGANGAKALAYYLAAIRHDINSCDYSSIRRVLIEGFDDLFRDYRRADVANKTPHDVWSFLAKFGEFQMRDGKIGPRWPYSNDWQACIAIAEGLQ >NZ_AP021845|72042:78469|76985_78104_-|WP_152090999.1|DBSCAN-SWA MPLNLVSITGADDKVLPSELATLSARYTNPITPEWAILYFPEKEGSPRNPSAAWREAFLALGLPFTAAHLCGTQVFREILDPRTAPARIADISRYRRIQLNINARRPEFTEEEVLSVYRTLHQAGLHLILQFHDGSRQAIQKFVAEVDASSLSRIDLLYDESKGTGQCPEQWPPISTVGQIPLHCGYAGGLGPNVLEIELPRIHASATAWRDVPYWIDMESGVRSDNHFDLAKVRAVLEIAAGTAKGRGGVYFASKVVHAHRWKALRDSGTKTASSWIDEAGEGQTADYEELADRCLSEIAGSERLVLYCEAGEILKGAILEAGAALMAGVPVFQVGECDSISRVFRQHRLWHYCLSIDEAISSPARQHQKQ >NZ_AP021845|72042:78469|73097_74831_-|WP_152090993.1|DBSCAN-SWA MRTYGELRLIEGGTAWEMSSLEPHVAIRLKHIFPRVPKQSTGPFRFPRDLMHAADLDWFLSRYPLAMSDGDRNALRGGRLDFEMQQAEMERILLPEYVPPAVVGLREGQKVRHYQMQAVELLRRRRSLLLGDEGGLGKTYTAAAFLCAEAQSLPAAVVCDPHMQKQWKDKLTAFTHLRVHIITKASPYDLPPADVYVFRISQIGGWADIFATDFFKAAVFDEPQSLRTGTSTAKGAAAKVLANHVSYRLGLTATPIYGYGIEMHNVMQFIDDTVLGSFADFSREWTDDMGRIGNPKALGTYLREQHVLLRRLKSDVGLELPKVSRIIEYVDFDAKAVQSIEDLARVLAIRATTGTFVERGQAARELDMMVRQATGLSKAKAVAQFVRLMVEAGEPVLLVGWHRAVYDIWLEELADLRPAMYTGSETAAQKNKEKDRFLTGDTDVMIMSLRSGAGIDDLQFRSSVVVFGELDWSPGIHQQIIWRLDREGQEDPVTAFFLVSEEGSDPPIMDVLGIKASEANQIVDPHLGVQKVDDDTTNLGRLVERYLQKVSKASKRPASTVPAAAMQPEVATTGSLF >NZ_AP021845|72042:78469|75656_76244_-|WP_152090996.1|DBSCAN-SWA MTKAPNIETLIQQRVDVALANRFRCELASPTNGQPLAPEERRRTLTILFTAIAKGMGLERFLETPVERLDQFAVMSVVKNHDTGGLLRSLINSFMIAYSCPETADRAFAALLELEAMRAELAHARQQPTKNPVLEAAENDLKAVLAEKLPAAPYRILYGADRLLVLAAEPIQGLPPEINGVPVELRVSNTVATTH >NZ_AP021845|72042:78469|76240_76504_-|WP_152090997.1|DBSCAN-SWA MIRGHDIDESIKGSRLDPLYQEARDLVLKGRHASISMVQRHLRIGYNHAARLLEAMEGDILGPQNASGVRQFLSTVTPPAAADRRTS >NZ_AP021845|72042:78469|75207_75648_-|WP_152090995.1|DBSCAN-SWA MKALSIRQPWAWLIVRPDLTDPQQRAAAVAAGEIKDIENRTWATKHRGPFLVHAGQTFDMEGYLWVKGRFPKIPLPRPGQYDLGGIVGQAVLTDSIPPEKARDGQLTSRWYMGEHAFVVGDSKPLPFRKVKGKLNFFEVKPEETQP |
10 | Ruegeria_phage(33.33%) | NA | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_2 |
219800 : 230598
Sequences of DBSCAN-SWA_2
Nucleotide sequences of DBSCAN-SWA_2 >NZ_AP021845|219800:230598|DBSCAN-SWA CCTAGTAGCGAGTCTGGTAATCGTCGCCACGGGAGAAGCGGCTGTTCCAGTGCTCCAGGTAGCCCTTGGCCAGCTCAGGGTTCTTCCAGACCACCAGCACGTTCTCGCTGTTCTTGTTCGCCGCCGCGCCGCTGTAATTGAAGCTACCGGTCTCGACCGTCTCCCTGTCCGATACGACGATCTTGTCGTGGTGGATAGGGTAGATCGAGATCGTCCGCACGCGACAGCCAGCATTCACCAAGGCGGAGAGCGCCGCCCGGGCCTTGCCGGACTTGTCCTCGACGACGTTGTTCTTGTAGTCGGAAACGATGGCGACATCGACGCCCCGCCGGCGGGCCGCGATCAGGGCTTCCACCACCGGTGCCGAGGTCAGCGAATAGGTCATCATGCGGATCTCGGAGCGGGCTGATCCAATCACCTTCAGAACCAGCTTTTCGGCGCCTTCATTGGGGCTGAAGGCATATTCGATGGTCCCGCTATGCTGGACCGTCGAGGAATGGTCCTGGAGGCCCTGGACGGCCTTCTCGGCTACATTCAGCCAGTTGGTTGCGCCGGCCTGTCCGGCGAAAAGAACGAGTGCGAGGGCGGCTGCCCTGAGTACGGCATGCTTCATGTTGCTTCCTTTCGAGATTGAGCCGCTTGCGCGGCGGGAACGGTTTTACTCCCCCTTGCCCTTGGCTTGGCGGAGGGTGATGAGTTCTTGGTACCGGGCGATCACTTCGCCGGCCGTGGTGAAGCCCACAGAGATGCCCAGGATGTCGGCCACCGCCTGGCGGACTTCGGCGATAAGCTGGGCTTCCTCCTCGGTCGGCACGCAGGTTGGCCAACCAGAGGAATAGAGCAGGTGAGCAGGCAGCCGGGATTCAGCCTGGACGCGCAGTTCGGCGCGACGCTTGGCATCAGCCAGCCCTTCTCGCTCCTCTGCGGACAGAAGATGCGGGGCCACGTCGCCGAGCCACGTCGCCCCCTGCAGGGCGAACGTCGAGGCGATCTCCAGGCGCACCATCCGGCGGTAGATATCGGCGTTATCCGGGCAGGACGCCGATGCCTTCAAATCGCCGGCCGACCCCATAATGCAAAACGTGCAACTGAGGCGCGTGCTTCCGTAGACGGTGTAGGCCTCGTGCAGGGGCGCCTGCTTGAGCGCGATGTAGTCGAATACTGCTGGCGTCGGCCACTTGATCACTGGATTCCAGTTCATGCCCACGCATCCTCGGGCCGAGAGAAGCGGCTGCGGCGTAGCGATGGGCATTTTTGCTCGAGCAGCACTCTCCGCATGGCGAACGCCCGTTGCGGACAGGATTTTTTGCCCTGGGTACCGCTTCGTTAGCTCCCTGCAGATGATCGATGTCTTGAGCTCGCTGGTGCAGAATCTCATTGATGGGGTTGACCATGGAAGGATGATCCTCACCGTCGAGAGCGATGCGTACCGCTCAATGTTGTTCTTCCAGCGGACTTCCCATCGATCCATCATGTCGCCCGCAGCACGGCGCACGACGATGAGTTCCCATCCCAAGCGCTTTGCCAGGCGTTCGCACAGCGGTAGGCTATCCTTCCATTCGACAGTGCCAAGGTCGCTGTGGATGAGAACACGCGGCCCCGAATGACCAATCTGGTCGAGGTATTCGGAAACGCGAACGGCCATTGCGACACTGTCCTTCCCTGCGCTCACGCCAAGTGCGCAAACTGCGCCTGCCGCCAGCCACTCACTGACCTCCGGTGTCACGGCCACGTCGTACTGGCCTGCCGCGGCGATTGCCGGGGCGAATAAGTCAAGCTGCTGCAGTCGGTCCATCAAGTCGCACTCCGTTGGCTTGAGGGCGCGGCCCTCACTCAAACGTCATGCTACGGATTGACCCCACCCGATTCTGAAATACTTCCCTGTCCGCGATGGACTGTCTGGTGGCCCGTCTTCCGGCCCGAATTCAGGCGGCGAGCGGGAGGCGGAGCTGGTCGCCCGGCTGCGCGGACATGCGGGAGCGCGCCGTCCTCATCCAGCGGCTCATTTCGCTCGAGCGGTGGGCCGTGGTGTTGGACAGACCGCTCTGCCGCGCCTTGATCCTGGCGCCGAAGTCGTAGGCCATGGAGTCGGCCGAAGCCACCCAGTCCAACATCTTCACCTCGGACAGGCTCGCCCCCTTGACCCCGAACAGGTGCAGCCGCGCGCCTTGTGGCAGCTTCCCTTCCAGGGCGGACAGGATGGCGTAGAGGCCGTGTGTCGGGTGATGCAGGTTCCGCCGGCACACCGATCCAACTCCGATGAGCAGAGGCGCATCAAGCCAGGGCTGCCAGCGCTCCCAAACGGCCGTCAGCAGTTCCAGGCTGCGGAGGTAGTCGGATGCCGACCACCCCTGGATGACGGGCACCGGCGGCGGCAACATGTTGGCCACCACCGAGGCTGAGCACGTCTTGGCGAGTTCGTTCTGCCAGGCGTAGACCACGCGCAGCACGCCCTCAAGCAGCGTTGCCGTCGCGTTGATCCGGTAGTCGATCGCCGCCTGGTCCTGGGCGATCTCCGGCTCGCAGCAGAGATCCGGCTGGGACCACCACGACGGGCTCAACAACGACGCCAGTTCAACATACTGCTCGTATGACCAAGGGAATACCCCAGCCATGCCCGGCTGCTTTCCCTTCGCCTTCCACAGCTTCATTGCGGTGAAGCCGGCACTGTCCAGCGCAACATCGAGCTCGGATAGATCGGTGGCCTCCGGCACGCGGAAACAGCCCTTCTCGGCGTCCCAGAAGGCATTCGCGCTCACCATGACGGGGAAGTCTTCATTGAAGGCGTGAAAGGCCAGCTTTCCTCCGCGGTGGGGAATGCCGACCCGCATAACGAGCTCGTCATCCAGGACGGCATTGTGTTGCCCGGTGGGCAAACGCCCCAGTTCCAGCTGATGATCCATGTTGAAGCACTCCGATTGCTCGAGGGCGCGGCCCTCAGTCAGTCGTCATGCTACGGATTGGCCCTGGCCTGTTCCGAAATACTTCCCTGTCCGTTGGGGATCGGCTTCGCTTCGACCTGAATGTACTCAAGGTTGTCCGTTGGGTGCAGGATCAAACGCCCGCGGTAGCCTGGCACCCGATCATCGACCAGGACCCGCAGATGCGGGCCCTTGGCGGACGTGATGGTGCAGTTCCAGATAACCCCGGCCCCGTCGGTGTAGCGCACACGCACGCCACGCCGGGCGGGCACCCCGTAGGTCCTGCGGATGTACTCAAGGCTCATGGCGCCCCCGCTCAGGCGGCCATCGGGGCCTTGATCGCGGCATGCGATTCGTAGCCCTCGAGGGTGATGTCGGCGGGCTCGATGCGCGTGAAGGCACCGGCGACGTCCTCCAGGCGCTCGATCCGCTTGATGTTGTCCGACAGGATCAGCTTCGGGGCTTCGAGGTGTTCCCTGGCCAGCAGCTCGCGCACCTGGTCGAAGTGGTCCTCGTAGATGTGTGCGTTTGTCGCCTGGATCGTGACCACACCAGGCTCGAACCCGGCCAGCCGCGCCATGATCGCCAGGAAGATCGAGGTCGCCGCGATGTTCGCCGGCGCGCCAAGGAACAGATCCCACGACCTGATCGTCATGACCAGGTTGAGGACACGGGGGTTCTCGAAGGCCACGAAACGGTAGTCCATGTGGCAAGGCGGCAGCGCCATCATGTCCAGTTCGGCCACGTTCCAGCCAGAGACGATCACGCGCCGATCAGAGGGATCGGTGAGCAGCTTCTTCAGCGAGTTCTCCAGTTGGTTGATGGTCCGCTGCATGAGCCATTCGGTCTTCTTCCCGTCCTCGTCGACCGCATCGGCGCACATGCGCACTTGGTAGCCCAGGGCCAGGAGCCGATCACGTTCAGCCGGGGTATCCGCGATGCGGCGATCCATCCACTCGGTCCATTGCTTGCCGTAGATTCGCGATAGGTGGTCGTGACCGCGACGGTAGGGACTGGCCAGCCAGGCCGGCGTCTCGTTGGCGTTGATGTCCCAGAAATGGCAGCCCAAGGCGCGGAAGTCGGCGGCGTTGTCGTAGCCCCGGAAAAAGCCCAGCAGCTCGCCGACGATGTTCTTGAAGGGCAGCTTGCGGGTGGTCAGCGCCGGGAAGCCCTGACGAAGGTCGAATTGCACCTGATGCCCCACCAGCGCGCGGCAGAGCTTGTTGGTACGGGTGTTGTACTGATCGACGCCCTGCTCCATCGTCAGGCGCAGCAGCTGGTGGTAGTTTTCCATGGTTCCCTCGCGTAAAAGTAGAAATCGTTTGGACAGAGACAGGATGCCAGATCGATACCCCTTGATCTGGAATCCTTTCCGCTGTCCCTCTTGGATTGGCTGTGTCCGTCAGGCGGCTTTCTTGACCGCCAGCCACGGGGCATTGGCGCGCAGGAATGCCTCCGCCATCACCGGATTGACGCTGTTACCGATCATGCGGACCTGGGTGGCCACGGAGAACTTCCGGCCGTCGTGACCACGGTCGATGATGTAGTCCTCCGGGAAATCCTGGCCGCGCGCCAGTTCGCGGGGCGTAAGCATGCGAAGTCGAATATCGACAATCACATACGGGTCGCCCTTGATGAACACGGTGACCAAGGCCAACCGGTCTCGCGTCGTGATCGTGTTGAGCGGCTTGTCCAGCGCGGACCATTGCCCGCCCTCGCTGTAGTATTCCATCAGGAAGGCGGCGACTTTCAGCGCGCCTGCCTCGTCTTCAGGGGACAGGCCGTTTTCGGTCTCGCCGGCCAGCTCGCATTCCACCAAGGCAGACTTTCCTCCGCCCCCGGCCGTGGCGGTCCCCATCGAATCGTTCGCGGCATGCCCGACACTGTTGCCGAACTGGCGGGACAGGAAAGCTGCCACAAGCGCATGATGCTCCCCTCCAGCGGAGATGGTCTGGATCGGGGCGTTCAGATCCCGGGCGTCGCAGTTGTTGCGCAACTGAGCCAGGTTGAGTGCCACCAGTTGCTGCTGGCTGCCGGTCGTCGTGATCGACGTCATCGACTCATCCAGGCCGCGGCCGATCGTCGCGTTGAAACCGTCGTTTGCCTGCATCATGAAGGCGCTCGACGGGCCGGACGGCTTCGATGGCGACAGCACCGGGGTGGCCAGCATCAGCTCGCCGCGGTGGGCCGTGGTAATCGTGGGCAGAGAGCCCTCAACCGAGTGCATCCGGTCAGACCCTTGATGGGTTGCCGGGACGATGATCGGGGTCGCCACGGCGAATTCTCCGCCCTTGGTCGCCGCCTTCACTGTCCGCAACGGTTCCAGTGCGCTGTGAACCACATCCTGTCCGTTGTAGTGGGCAATCGGAACGATGCTCGGCGTAGCCAAATACTTGTGATTCCCGCCACAGCAGGTCGCCATCGGCTCGTCGACCATGCGGGGGACGTTGTTCATCATGTTGTTCACGATGAACGGCTTCGGGTTGTCCAGGACGAACTTCTTGATCCCCTTGGCGATCCGGCGCTTCGTGGCCGGCGCCAGTTCCTTCTTCCGGCCGAAGATCGACTTGCCCAGGTTGGAGAAGTCGATGCAATCGGCAGCCGGCTTGTAGGCCATCTGCTTTCCGGTGGGACTCTCGAAGTGGATCTGCTCGGGCCAAATGATGGGGTACCCATCCCGGCGCGCGATCATGAACAGCCGCTTGCGAGTGGTATGGCCGCCCAGTTTCGCCGCCACCAGCGCGCGCCACTCGACCACGTAGCCCATCGCCTGAAGCTTCCTGACGAACTTCTTCCAGGTCTCACCTTCCCGCTTGGGGTCGGGGATCAGGTACTGCTGCTGCACCGGCACGCGCTCACCTGGCTCGGCCACAACCTGCCGGACCTTCTTCTTACCCTTCACGACCTCGGTCACCAGCTTGATGACGCGGCCCGTAGCCTTATCCCGCTTGGCGATCAACGGACCCCATTTCAGGATGGCCAGGACGTTCTCGAGCGAAATCACGTCCGGCTTGGCTTGCCCAGCCCAGCGCATCCCCGACCAGGACAAGCCCCTGATCTTCTTCGATCGCGGCTGACCGCCTGCTGCTTGCGAGAAATGCGTGCAATCGGGCGAATAGTGGAACCAGCCCACTTCATCACCCATCGTCGCGCCACGAGGATCCACCTCGTAGGCATCGGCGCAGAAATGCCGCGTGGTCGGGTGATTGATCCGATGGCAGCTCACTGCGTCATCGTTGTGGTTGAAGCAGACATCCGGCGATCTGCCGAAAGCCTTCTCGAAGGCGATCGACATCCCGCCCGCGCCGGCGAACCCGTCAACGATCTTTTTCCGGTGGATGTTCAGCAGCAGTTGCTTGCCCATTGTTCGTGCCCTCTGTGAATGACGATTGGGTCAAGTCTGCCTATCTGAGCCAGGCCTTTCTGAAAAAAGCGGGCAGACCCCGGAGGTGTGCCCGCTTCAATCTGCCTGTCCGTAACATCTGCGCTGCTTATGTCTCCTTGTTGGCCAGCTCCAAAAGCACTGCCGCATGGCAGGCGTCCTCATACGGATCATCCTTCCCACACCAGCAGGCCAGATTCTTCCCAGCAAGCTCGGCCCTGGCTTCGGCTACCAGTTTGGGGTTGAGAGGAGCCAGCGACCGATACAACACGAACGCATGCCGCTTGTCCCGAACGATCTGGCCCTTGGTCGGACCGAACGGCGCTGGTTTCCCCGGCACAAAGGGGTTCCCCCACTTTGTCGTCCGATCCACTTTCACCGTGTTTTCCGGCATGCGCCAGCCTTTGGCGCGCTTTAGTTGCACACGTTCTGGCATGGCCGCTACACCGGGTAGCCGTCGTGAATCTCGCCGTCCAGCGTGCGCCCCGCAGCTTTCTTGCCGATCAGGAACATATCCGGCTCATCGTCGATATGGTCATCTTCACGCGCCTTGCCGACGCGCCGGATGTCCCACTTGCCGTTGAACCAGTACGCGGCGTCGATCGCACGGTCGTTGGGGCCAGCAACCTCGCCTGGCGCCCATTCGCCCCATTGTTTGAACAGATAGGGCACGCCGGCGGCTGCGCACTGATCACGCAGCATCCTCGCCCAAGCCGGAAGGCCTGGCCGAGCACCGATACCGCTCTCGAAACCCTGGACAACCCAGTCAATGCCCGGGTCGTAGGCGGTCCATCCTGCGCGCTCACCACAGTGCGGGCAGAGAGGCACGGTTTCCTCGCCATCCTTTACGGTCTCGATTTCGTCGGCGATCACATAGCAGCTGTCCGGACAAACGTCTTTGCACTGCACGCCCACCGGATTGAGCCATCGGGCCAGATCGATCTCCCCCAATTGGGGCTCGCAGCTTACCCAGCGCACTGCCGCTGGCGTGTCCATCAGCAATGGCACCCGCTCGTCAGCCGCTGCCTGATCTTCCACAGAAACGCCCAGCCAGATACGGGGATGGGGGCCATCAAAGTTCATCACCTGATCCCAGATGCCGTCGGGGTCCGTGCCGCCACCGTGATTGACAGCCGCCCGCGCCCAGGCCTCCCGCCGGTCGGTGCTGAAGTAGTCACGCATGCGCTGCGCGCGCTTGGTGAGCACCTGGAAGATATGGCCGTCCTGTTCGTTGCGGCCATACAGGCAGGCCCACATCACACCCATGATGGTGTCGATCCAATCGTCCGGCACGTCCGGGTGGAACAGATCCGAGTGGGCGCAGACGAACACCTGGCGCTGCCGCCCCCAACGAATCGGCTGGTCAAGCCATTCATGGTTCAGGCGCACTTCGCCGGTCCATACCGGACCGTTTTTGGTGTCGATGGTCAGCCCTGCGCGGGACGGGTGATTCTTGAGGCGGCCGCCGGCCAGCTTCATTGCATAACACAGCCGACAACCACCGGAGGTTACAGAGCAACCCGTTATCGGATTCCAAGTGGCATCGGTCCATTCAATCTTGCTGTTGTCAGCCATGAGCACTCCCTTCAGTTGTTGGTGGAGCCATCTTTACAGCCACCACCTTCCAGTCGCCGAACTCGCGACCGGCTTGCTTCAATCCATCGATCTCCAGCGCGATCTCGCTGGCGTTTTGTTCCGATGCGGCACAGCAAAGCAGCATGGCCGCCGCGAAAGTCGTCACGCGGTCAAGCTGCTCCGCAGCACTGGTCAGCAAAGCGATCTTCTTCTTCACCAACTTGCCGAGTGCGGCGTCGTCGAGGTCGCTCCAGAGTGCCGCAGCCTTCTCTTCCTCGGTAAGTATCAGTTCAGCCACAGACGCCCTCTGCCAGTTCTCTGGGCGTCAGGCCCAGGCTTTCGTTGAACAAGGTCTTTTCCTCGTCCGACAGGGACACGCCTTCGGCAGCGATCACCAACAGGGCGTCGGCCTGAGACCGACCACCCAGCGGGCGGAACTTCACGCCATAGACAGAGGTGCCATGCACCAGGGCCGGGATGATCTCGTCCCAGTTGCCGTCGCTGTCCCGATACACCCAGCGGATGTCTTCCAGGGCAACGTTGAGCGCCTGGATGTGCAGGTGATACAGGAAGCTCACCAGTTCCTCAGCGGAGTTGGTGATACTGGCCCCATGGTTGCCGTCGGTCATGGCCACGACGACTCGCCCATCACGTCCATGCAGGATCGCCACATGGGCCGTAGCCTTGCCCTGGGCCAGCCCGTGACGGGGAACCGCCCCCGCCCGACTCACCACCAGGGGAAACAGGCCACCAATCGGCGGCCGCGTCATCGTTGCTTGCATACCTTTCCCTTCCTAGACGAACAGTGCGCCGCTCTGGCACAGACTGACGACACGCTTGCACTCAGGCACATCCATCCAGGAGATGTGAGCGAAGCGAACCCCAAGGGCTTTGGCCAGTGCCCTGTAGGCCTGAGACCGGTGCCACGGGGTCTTCCCCCGCCACAGCCGGTCAAAGGCGTCATGGGCTTCCCGGCGGGCCTGCATGGTCGGCCCATCAGCCAGCGTGCCGAGCGGGATGTCGGTACCGGGATGGCATCCCACCCGCGCACCGCAGGACGTGCAGCAATAGGCCAAGGGCCATCCATACTCTCGGCCCTTGTAGAAATCGGCGTTGTTGACCAGCTTCACCTCGCCCTTGCAGAAGCGGCAGGTGTCCGGCACGGATACGGGATCACCCTTCACCCGCGCGACAGCCTCGGGCAGATTCACCGTCCGCCCGAACAACTCGTAGGGTTTGAACCGTCTGCGCTTACCGGCCATGGCTTTCACCGTTCGGGCAAGGTTGCAAGGCTCTGGAAGGGCAAATAATCCGCTGCACCGGCAGCATGGCCAGATCGTCGCGGTAGCTCCGCGACCGGTTGAGACGCGGCAGCACAAGCCACCGCCAAACGCCCAGGAGCGATGTTCCCGTACAGACGATCTGCATGGTGACCATGCCGAAGGTGTGCGTCATCACGCCATAGGTCATCCAGGCCGCGTTAGACACCAGGAACAGGACGAACCCATACCCGGACCAGCGATTGTTGAAGGCCAGGAGGAGCGCCCCGGCCGCGCCCCCGATGGCACCAATCCATTCGAGCATGCCGAGGCTCAGCATGATGCTTCCTCCGGCTTGGCCACCGCCATCCGCTGACGGCAGGCCTTCTGGCCAGGGCACGGCATAGACACCCAGCGATAGATGCCAGCGGCCGTCGCATCCAGCGCCAGCACCAGCCCGAGGGCGACCACCCCGGCGGCCTTGAATGATGCCAGGGTCAGGATTGCAGAGAAGCCCCCCCCATCGACGGTGTTTGCGATCCAGCCGACGGCACCGGCGATCATCAGCATGCCGACGAAGAACTGGGCCAGCGCACGCTGGCTGAGGGTGCTGTTGAGAGCTTCCGGGGGGGTGTCCACCACCCAGCTTCCGGTGTGCCCACAAACGACAATGGGGCGATGCCACGCGGCGTAGCGCCAGGTGGGCACGTCGATGTCACGCTTCAGCAGCAGGAGGAGCAGAGCGAATTGGTAGTGCAGGTAATTGCTGATCATGGTGATACTCCAAAGAGGTTGGTATCAGCCCCCCGGTCTTTTTCCATCCGGCTCGCCGAACCGGATCGAAAAAGAGGCCTCCAGGAGGAGAGGAGGCCGAAAGCCTCAAGGTGAGGCCAGCGGGGGAGAACTGAAACCAAGATTACATAGATGGCATCGGCGATTCTGGGAAAAGATCAGCCTGTGCTTTGGATACGTCGGGGTCCGTAAGGGACGCGCTGCGTGCGTTCCACCTCCCCTGCCCTTGATGTCCGGCAGGGTCATTCACGGGACCGATTTGAAGGCGTCCTTCAGCTCGGGCAGCCGGCCGATCTCGGCCCACTTGCCCTGGAAGCGGCCGCGCTCGATGGTGGGCACCATGAACCGCTCTTCCCGGCGGATCACCAGGCCATCCTCGGACAGCACGTCGCACAGTTCAACATCGCGCTTGGAGCCGCCCCAGTCAGGGGCCAGCACATCCATGCGTGGCTGCACCTTCACCGGCGGCATCCCGTTGTAGCAAAGCGGCCAGCGGGAATGCCAGTTGCCCAGGCTCTCGCAGGTGGCGTACCAGAGGGTCATGTCGTCGCTGACCGAATCGAAGTGTCCATCCAGGACAACATGCTCAACCCGGTGGGTCAGCACGGTGTCGGCGTTGATCGCCCAGCGTGTCACCACGGTATAGGCACTCAGGAACTTCTCGGTCTCGATGAAGAGACCGGCCACAGCCTTCTGGCCGACCTTGCGGCAGCCCACCCAGTAGCACTTGTTCGGGTAGGGCCGGCGAATGGTCAGGTCCTGGTCGCCCTTGAGCACCAGGCCCGTCTCCTCGATGGTCAGATCGACGAGTCGAACGGCGGCATCCAGTCCGCCAGGGACGTACAGCTTGGGTAGAACATGGATCAGCAT
Protein sequences of DBSCAN-SWA_2 >NZ_AP021845|219800:230598|220457_221594_-|WP_152091181.1|DBSCAN-SWA MDRLQQLDLFAPAIAAAGQYDVAVTPEVSEWLAAGAVCALGVSAGKDSVAMAVRVSEYLDQIGHSGPRVLIHSDLGTVEWKDSLPLCERLAKRLGWELIVVRRAAGDMMDRWEVRWKNNIERYASLSTVRIILPWSTPSMRFCTSELKTSIICRELTKRYPGQKILSATGVRHAESAARAKMPIATPQPLLSARGCVGMNWNPVIKWPTPAVFDYIALKQAPLHEAYTVYGSTRLSCTFCIMGSAGDLKASASCPDNADIYRRMVRLEIASTFALQGATWLGDVAPHLLSAEEREGLADAKRRAELRVQAESRLPAHLLYSSGWPTCVPTEEEAQLIAEVRQAVADILGISVGFTTAGEVIARYQELITLRQAKGKGE >NZ_AP021845|219800:230598|223036_224014_-|WP_152091183.1|DBSCAN-SWA MENYHQLLRLTMEQGVDQYNTRTNKLCRALVGHQVQFDLRQGFPALTTRKLPFKNIVGELLGFFRGYDNAADFRALGCHFWDINANETPAWLASPYRRGHDHLSRIYGKQWTEWMDRRIADTPAERDRLLALGYQVRMCADAVDEDGKKTEWLMQRTINQLENSLKKLLTDPSDRRVIVSGWNVAELDMMALPPCHMDYRFVAFENPRVLNLVMTIRSWDLFLGAPANIAATSIFLAIMARLAGFEPGVVTIQATNAHIYEDHFDQVRELLAREHLEAPKLILSDNIKRIERLEDVAGAFTRIEPADITLEGYESHAAIKAPMAA >NZ_AP021845|219800:230598|227967_228459_-|WP_152091188.1|DBSCAN-SWA MQATMTRPPIGGLFPLVVSRAGAVPRHGLAQGKATAHVAILHGRDGRVVVAMTDGNHGASITNSAEELVSFLYHLHIQALNVALEDIRWVYRDSDGNWDEIIPALVHGTSVYGVKFRPLGGRSQADALLVIAAEGVSLSDEEKTLFNESLGLTPRELAEGVCG >NZ_AP021845|219800:230598|228471_228939_-|WP_152091189.1|DBSCAN-SWA MAGKRRRFKPYELFGRTVNLPEAVARVKGDPVSVPDTCRFCKGEVKLVNNADFYKGREYGWPLAYCCTSCGARVGCHPGTDIPLGTLADGPTMQARREAHDAFDRLWRGKTPWHRSQAYRALAKALGVRFAHISWMDVPECKRVVSLCQSGALFV >NZ_AP021845|219800:230598|227669_227975_-|WP_152091187.1|DBSCAN-SWA MAELILTEEEKAAALWSDLDDAALGKLVKKKIALLTSAAEQLDRVTTFAAAMLLCCAASEQNASEIALEIDGLKQAGREFGDWKVVAVKMAPPTTEGSAHG >NZ_AP021845|219800:230598|229974_230598_-|WP_152091191.1|DBSCAN-SWA MLIHVLPKLYVPGGLDAAVRLVDLTIEETGLVLKGDQDLTIRRPYPNKCYWVGCRKVGQKAVAGLFIETEKFLSAYTVVTRWAINADTVLTHRVEHVVLDGHFDSVSDDMTLWYATCESLGNWHSRWPLCYNGMPPVKVQPRMDVLAPDWGGSKRDVELCDVLSEDGLVIRREERFMVPTIERGRFQGKWAEIGRLPELKDAFKSVP >NZ_AP021845|219800:230598|226211_226538_-|WP_152091185.1|DBSCAN-SWA MPERVQLKRAKGWRMPENTVKVDRTTKWGNPFVPGKPAPFGPTKGQIVRDKRHAFVLYRSLAPLNPKLVAEARAELAGKNLACWCGKDDPYEDACHAAVLLELANKET >NZ_AP021845|219800:230598|224122_226084_-|WP_152091184.1|DBSCAN-SWA MGKQLLLNIHRKKIVDGFAGAGGMSIAFEKAFGRSPDVCFNHNDDAVSCHRINHPTTRHFCADAYEVDPRGATMGDEVGWFHYSPDCTHFSQAAGGQPRSKKIRGLSWSGMRWAGQAKPDVISLENVLAILKWGPLIAKRDKATGRVIKLVTEVVKGKKKVRQVVAEPGERVPVQQQYLIPDPKREGETWKKFVRKLQAMGYVVEWRALVAAKLGGHTTRKRLFMIARRDGYPIIWPEQIHFESPTGKQMAYKPAADCIDFSNLGKSIFGRKKELAPATKRRIAKGIKKFVLDNPKPFIVNNMMNNVPRMVDEPMATCCGGNHKYLATPSIVPIAHYNGQDVVHSALEPLRTVKAATKGGEFAVATPIIVPATHQGSDRMHSVEGSLPTITTAHRGELMLATPVLSPSKPSGPSSAFMMQANDGFNATIGRGLDESMTSITTTGSQQQLVALNLAQLRNNCDARDLNAPIQTISAGGEHHALVAAFLSRQFGNSVGHAANDSMGTATAGGGGKSALVECELAGETENGLSPEDEAGALKVAAFLMEYYSEGGQWSALDKPLNTITTRDRLALVTVFIKGDPYVIVDIRLRMLTPRELARGQDFPEDYIIDRGHDGRKFSVATQVRMIGNSVNPVMAEAFLRANAPWLAVKKAA >NZ_AP021845|219800:230598|229269_229710_-|WP_152091190.1|DBSCAN-SWA MISNYLHYQFALLLLLLKRDIDVPTWRYAAWHRPIVVCGHTGSWVVDTPPEALNSTLSQRALAQFFVGMLMIAGAVGWIANTVDGGGFSAILTLASFKAAGVVALGLVLALDATAAGIYRWVSMPCPGQKACRQRMAVAKPEEASC >NZ_AP021845|219800:230598|222752_223025_-|WP_152091182.1|DBSCAN-SWA MSLEYIRRTYGVPARRGVRVRYTDGAGVIWNCTITSAKGPHLRVLVDDRVPGYRGRLILHPTDNLEYIQVEAKPIPNGQGSISEQARANP >NZ_AP021845|219800:230598|226543_227677_-|WP_152091186.1|DBSCAN-SWA MADNSKIEWTDATWNPITGCSVTSGGCRLCYAMKLAGGRLKNHPSRAGLTIDTKNGPVWTGEVRLNHEWLDQPIRWGRQRQVFVCAHSDLFHPDVPDDWIDTIMGVMWACLYGRNEQDGHIFQVLTKRAQRMRDYFSTDRREAWARAAVNHGGGTDPDGIWDQVMNFDGPHPRIWLGVSVEDQAAADERVPLLMDTPAAVRWVSCEPQLGEIDLARWLNPVGVQCKDVCPDSCYVIADEIETVKDGEETVPLCPHCGERAGWTAYDPGIDWVVQGFESGIGARPGLPAWARMLRDQCAAAGVPYLFKQWGEWAPGEVAGPNDRAIDAAYWFNGKWDIRRVGKAREDDHIDDEPDMFLIGKKAAGRTLDGEIHDGYPV >NZ_AP021845|219800:230598|219800_220412_-|WP_152091180.1|DBSCAN-SWA MKHAVLRAAALALVLFAGQAGATNWLNVAEKAVQGLQDHSSTVQHSGTIEYAFSPNEGAEKLVLKVIGSARSEIRMMTYSLTSAPVVEALIAARRRGVDVAIVSDYKNNVVEDKSGKARAALSALVNAGCRVRTISIYPIHHDKIVVSDRETVETGSFNYSGAAANKNSENVLVVWKNPELAKGYLEHWNSRFSRGDDYQTRY >NZ_AP021845|219800:230598|221724_222702_-|WP_172974840.1|DBSCAN-SWA MDHQLELGRLPTGQHNAVLDDELVMRVGIPHRGGKLAFHAFNEDFPVMVSANAFWDAEKGCFRVPEATDLSELDVALDSAGFTAMKLWKAKGKQPGMAGVFPWSYEQYVELASLLSPSWWSQPDLCCEPEIAQDQAAIDYRINATATLLEGVLRVVYAWQNELAKTCSASVVANMLPPPVPVIQGWSASDYLRSLELLTAVWERWQPWLDAPLLIGVGSVCRRNLHHPTHGLYAILSALEGKLPQGARLHLFGVKGASLSEVKMLDWVASADSMAYDFGARIKARQSGLSNTTAHRSSEMSRWMRTARSRMSAQPGDQLRLPLAA |
13 | Mycobacterium_phage(22.22%) | NA | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
Acr ID | Acr position | Acr size | Homology with known anti | Neighbor HTH/AcRanker | Neighbor Aca | In prophage | Protospacer in prophage |
---|