Contig_ID | Contig_def | CRISPR array number | Contig Signature genes | Self targeting spacer number | Target MGE spacer number | Prophage number | Anti-CRISPR protein number |
---|---|---|---|---|---|---|---|
NZ_CP048345 | Escherichia coli strain 124 plasmid p124_A, complete sequence | 0 crisprs | NA | 0 | 0 | 2 | 0 |
NZ_CP048348 | Escherichia coli strain 124 plasmid p124_D, complete sequence | 0 crisprs | NA | 0 | 0 | 0 | 0 |
NZ_CP048347 | Escherichia coli strain 124 plasmid p124_C, complete sequence | 0 crisprs | NA | 0 | 0 | 0 | 0 |
NZ_CP048346 | Escherichia coli strain 124 plasmid p124_B-OXA181, complete sequence | 0 crisprs | NA | 0 | 0 | 0 | 0 |
NZ_CP048344 | Escherichia coli strain 124 chromosome, complete genome | 9 crisprs | RT,csa3,PD-DExK,cas5,cas6e,cas1,cas2,cas3,DEDDh,c2c9_V-U4,DinG | 0 | 17 | 9 | 0 |
CRISPR_ID | Spacer_Info | Spacer_region | Spacer_length | Hit_ID | Protospacer_location | Mismatch | Identity |
---|
CRISPR_ID | Spacer_Info | Spacer_region | Spacer_length | Hit_phage_ID | Hit_phage_def | Protospacer_location | Mismatch | Identity |
---|
Region | Region Position | Protein_number | Hit_taxonomy | Key_proteins | Att_site | Prophage annotation | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
DBSCAN-SWA_1 |
1262 : 51907
Sequences of DBSCAN-SWA_1
Nucleotide sequences of DBSCAN-SWA_1 >NZ_CP048345|1262:51907|DBSCAN-SWA GTCACCTCAGGGGAGGCAGTGTACGCAGGATCTCCGCAGCATCCCGCCCGTCACCTGTGAAAGGCACTGCCAGCGTGGCAGCCATATCCAGTGCAAACACTCTGGTATAAACCTCCATCGAACGTGGATCCCTGTGACCAGCCAGTGCCTGGATGACTTTCCGGGGCTGGCGGTGATAGAGCATGTGCATGATATAGCTGTGCCGGAAGGTGTGTGGTGTGACCGGAATCGAAAAGTGTACCCCGTCAGCTTCGGCCCGTCTGACAGCCTGCTTCAGCCAGTTGCGCATGGTTTCGTCGGTCACGGCCCATAATGGTTCACGACGACGGGGCCGGGTGGTGATCATCCAGCTTTCCATCTGCCTGACATAGCTTATATCTGTCAGCGGAACCAGGCGCACTTCATCTTTTGGCGGGCGCCCGCGTCGCGCACGCACTTTTTCGGACAGGATCCGCACAAACGGTCTTACTCCATCCAGGTCAAATGATTCCGGTGTCAGCATCCGGGCTTCGCCAATACGCATTCCGGTATTCCAGAGGGTGGCGAACAGCATATGGTGACGCTGATCCGGCATATAGAAAAGAAGGGCACTCACTTCCGGGGCCAGCAGGTAGGCCGGTGTGGCCCCCGTGGAAGTGGCCATTCTTCTCAGGGAGAGCGCTGTTGCAAAATCCACCCCCGGCGCAATGGGTAACAGGGAGACGCGTTCTGGTGAATTCTGCAGGGGAATGACATTGTTCATGAGTTATTTTTCACGTAGCACACGCACTGTTTATGGATACAGGTATTCTGACCTGAACGCCCGGATCCTTCCACGACAATAAATCGCAGCGCGCGGATCCTTTTGTTTGCTTATCACGCATTATTGCGGGTAAAACCAAATACGTAAAGGTACGTTTTGCAGCAGGGATCCCGTCTTAAGCCAGTCGTGGCGCGGGCTGAGGGCTGTTTTACGGTTTTTGGGAGAAATGGGGTAATGCCAGGGCAAAGATGATCGCAAGGAAAGGAGAAAAATACACAGAGAGGGGAGTTGCGCTACGTGAATTCTGGCTAGGTACACCCCTGTAACCCTGCCAGTTTTCACGCAACGCAATGTGGTGGCTGATTACTGTCAGTTGAAATGACGGCTGGCATCCTGATGCTGACTTAATATACGAATAATAACAATTTCAGTATCAGTCTGCAGAAAATAAACCATATGCCGTTCAACGGGTAGTGCGAATGTGTATTCCCCAGTTCAGGAGGGGGAGTACCGATATTATGATTACTCATAACCTGAAAAATGTCGGATATCTGATTAATATATTTATCAGCCTACTCTTCACCGAAATGGTGGTACCCGTAATGCTAGATATCCTCCAGATCCTGACTGGCTTTAGGTGTGAGTTTAACGGTTTTCACCGGAAGCCCTCGCGCCAGCTTTCACTTTTTTCAGAAACGCATTTTTTTCCCACGACACAGGTTCACCGCTGCTCAAGCCTTCTGCCAGTAAATCCCGCAGGACACCTGGAGACGTGACTCTGCCTGTTTCTCTCGAAGGAGACGAAGTGACTCGCGGATTACCTCACTCTGTGTACGATAATCACCTGATTCTATGAGAGATTCTATGTACTCCCAGAGTTCATCTCCGAGAGCAACTGTCATTGTTCTGGCCATATCAAGCCTCGTGATTGTAAGTTTAAATTTTACATGCAGTTTATTCGATACTCAAAGGTGCTTCCAGTACAGAACATTTGGGAGAATTGGGGTAATGCCTGGAACAGACAGATACGGAGAAAAACCCTGCAGATAACCTTATACGACAACTGAAGGTGTCTGATGCCATATTCTTAGCTACTTCTCTGTCTTCAGGCGGCTTAGTCACAACATAAGACCGATGCATAATAAGCATATTTTTTCATGTTATGGTCTTGTTCATGCATGAACTATTATCTATTCAGTCACTTTTAAAGTTTTATATCATACTGCATATGATATATATCATCTCCTATATTCAGACTAAAAATATTATGACGATGGTCAAATGAATTTAGTGAAAACGCTACCTAAAACAATATGCTTATGATATAACTTATCAAGAATATGCGACAATCTTCTTGAAAGGTTACGGAGAAGAGTCAGATAATGAAGTGTTATAACTCATTCAGCTTTTTTATGTTCTCTTTATAGTAAAACCAGGAGATTAAGATGAATCAGAAGTGGAAGACGTTAATTATTTCATTATTAACACCATCAAGTATTATATCAGGGATTGCGTTGATAGTTCTCTGGGGATATTTCAGTCGCCTGGGCCGACTAGATATTTTCTTTGATGTAATGAATATTAAAAGCATCTTGGTATTGGTTTGTTGTGCAACTATACTATCATTGGCTATGATAGTATTTATCTTTTTTATAACCTCTTTTTTTATACCCGTAGTTATTCCTCAGGATATAAATAATCTTCCTGCTTACAATAAGATTCAGGGTAATTTTTTATCTGTGATGATGTTATCTGGCATGTTCCCCATGGCCTTTATATATGTGTTATACTGTGCATTTGATTTTGATCAGAATGTTAAAGATAACTCTGGCTGGTTATCTATTGCAAGTATGGGTGTACTGATAGCTGTTATTTTAGCCATAGTAAATAAGAGGTATCTTGAATATGACTTATCGTTTAAAAACAATAGAATGAAACTGTTGAGACGTGTTCAGATCTATCTTGTCATTCCGTTGTCTATTGCTTTACTGGTGCATCTTCAGTTAATTCCTCTGGAGATAGTTTTTAGTAATATTGCAACTTCAGATAAGAGTGTTAATTTCAAAGTGATAGCTGAACTGGCTTTCATGTCATATTTTATCTTTCTTCTGACAATGCTTCCTGGAGTCATTTATCTGAAAATGAATCCACAGCATAAATTTTCAAAAAGAATTAGCTATTCATTTATCGCATCCTTGATGTTATTATTAATAATATCGACTCAAATAACCGTGTTACCAGTGATTTTTACCCATTCCGTTATTAAGCTATCAGGTATAAGTGACTTCAAAATACATAGCTACATTATTAAAACAAGTGAATATCCTGAAGAGTTTTTTTCCAATGCAGCATGGGATAAAAAAAATATAAAACCGGGGGATTATTACTCTGTGCAGGCTGTTTCTATGTTTACGACAAACCAGTTTATTCTCCTTTGTCCTAAGGATATTATAAAGTTTTATCGGGAAAGCTGGAAGTTCGAGTTGCTTAATGTAGACTTCGATATAAATACCAGGAAGAAACTCCAGGAAGAAGCTGCATATTGCGTACCAATTTCTGCAATATCTGTAAAACGATGGGATATGCCTCTTCAGGGCTCGAAACCCTCAAGTTAGAAGGCAACTAGGGCGACTGTATTCTGTGACGAAATGAATGGCTGGCTCACTATAATAATGAGCGAACCCATCAGGGAAAAATGTGCAATGGGCGGACGCCAATGGAAACGTTACTCGATGGAAAACGCATCTGGGCCGAGAAAAATTTAAGCCAGATGTAATCTGACAGATACCTGTATAAATAACCGATAACTGTCAGATCAGGTCTGAGCTAATACATTTCAATAAAGGTACTTGCTCGCGCTCTGTCATTTTCTGAAACTCTTCATGCTGCATTTCGATTTTCCTTCTAAGATCATAACTCGCATACCTGTTCTTTTTATGTATATCTATATCTATTTTTGTGACGAGTTCAAACCACAATGTTCGCGCTCCCGGTGATGCTGCCAACTTACTGATTTAGTGTATGATGGTGTTTTTGAGGTGCTCCAGTGGCTTCTGTTTCTATCAGCTGTCCCTCCTGTTCAGCTACTGACGGGGTGGTGCGTAACGGCAAAAGCACCGCCGGACATCAGCGCTATCTCTGCTCTCACTGCCGTAAAACATGGCAACTGCAGTTCACTTACACCGCTTCTCAACCCGGTACGCACCAGAAAATCATTGATATGGCCATGAATGGCGTTGGATGCCGGGCAACTGCCCGCATTATGGGCGTTGGCCTCAACACGATTTTACGTCACTTAAAAAACTCAGGCCGCAGTCGGTAACCTCGCGCATACAGCCGGGCAGTGACGTCATCGTCTGCGCGGAAATGGACGAACAGTGGGGATACGTCGGGGCTAAATCGCGCCAGCGCTGGCTGTTTTACGCGTATGACAGGCTCCGGAAGACGGTTGTTGCGCACGTATTCGGTGAACGCACTATGGCGACGCTGGGGCGTCTTATGAGCCTGCTGTCACCCTTTGACGTGGTGATATGGATGACGGATGGCTGGCCGCTGTATGAATCCCGCCTGAAGGGAAAGCTGCACGTAATCAGCAAGCGATATACGCAGCGAATTGAGCGGCATAACCTGAATCTGAGGCAGCACCTGGCACGGCTGGGACGGAAGTCGCTGTCGTTCTCAAAATCGGTGGAGCTGCATGACAAAGTCATCGGGCATTATCTGAACATAAAACACTATCAATAAGTTGGAGTCATTACCGGAGCCGCATTATTTTCGCTTTATGAATCTAAAGGGTGGTTAACTCGACATCTTGGTTACCGTGAAGTTACCATCACGGAAAAAGGTTATGCTGCTTTTAAGACCCACTTTCACATTTAAGTTGTTTTTCTAATCCGCATATGATCAATTCAAGGCCGAATAAGAAGGCTGGCTCTGCACCTTGGTGATCAAATAATTCGATAGCTTGTCGTAATAATGGCGGCATACTATCAGTAGTAGGTGTTTCCCTTTCTTCTTTAGCGACTTGATGCTCTTGATCTTCCAATACGCAACCTAAAGTAAAATGCCCCACAGCGCTGAGTGCATATAATGCATTCTCTAGTGAAAAACCTTGTTGGCATAAAAAGGCTAATTGATTTTCGAGAGTTTCATACTGTTTTTCTGTAGGCCGTGTACCTAAATGTACTTTTGCTCCATCGCGATGACTTAGTAAAGCACATCTAAAACTTTTAGCGTTATTACGTAAAAAATCTTGCCAGCTTTCCCCTTCTAAAGGGCAAAAGTGAGTATGGTGCCTATCTAACATCTCAATGGCTAAGGCGTCGAGCAAAGCCCGCTTATTTTTTACATGCCAATACAATGTAGGCTGCTCTACACCTAGCTTCTGGGCGAGTTTACGGGTTGTTAAACCTTCGATTCCGACCTCATTAAGCAGCTCTAATGCGCTGTTAATCACTTTACTTTTATCTAATCTAGACATCATTAATTCCTAATTTTTGTTGACACTCTATCATTGATAGAGTTATTTTACCACTCCCTATCAGTGATAGAGAAAAGTGAAATGAATAGTTCGACAAAGATCGCATTGGTAATTACGTTACTCGATGCCATGGGGATTGGCCTTATCATGCCAGTCTTGCCAACGTTATTACGTGAATTTATTGCTTCGGAAGATATCGCTAACCACTTTGGCGTATTGCTTGCACTTTATGCGTTAATGCAGGTTATCTTTGCTCCTTGGCTTGGAAAAATGTCTGACCGATTTGGTCGGCGCCCAGTGCTGTTGTTGTCATTAATAGGCGCATCGCTGGATTACTTATTGCTGGCTTTTTCAAGTGCGCTTTGGATGCTGTATTTAGGCCGTTTGCTTTCAGGGATCACAGGAGCTACTGGGGCTGTCGCGGCATCGGTCATTGCCGATACCACCTCAGCTTCTCAACGCGTGAAGTGGTTCGGTTGGTTAGGGGCAAGTTTTGGGCTTGGTTTAATAGCGGGGCCTATTATTGGTGGTTTTGCAGGAGAGATTTCACCGCATAGTCCCTTTTTTATCGCTGCGTTGCTAAATATTGTCGCTTTCCTTGTGGTTATGTTTTGGTTCCGTGAAACCAAAAATACACGTGATAATACAGATACCGAAGTAGGGGTTGAGACGCAATCGAATTCGGTATACATCACTTTATTTAAAACGATGCCCATTTTGTTGATTATTTATTTTTCAGCGCAATTGATAGGCCAAATTCCCGCAACGGTGTGGGTGCTATTTACCGAAAATCGTTTTGGATGGAATAGCATGATGGTTGGCTTTTCATTAGCGGGTCTTGGTCTTTTACACTCAGTATTCCAAGCCTTTGTGGCAGGAAGAATAGCCACTAAATGGGGCGAAAAAACGGCAGTACTGCTCGGATTTATTGCAGATAGTAGTGCATTTGCCTTTTTAGCGTTTATATCTGAAGGTTGGTTAGTTTTCCCTGTTTTAATTTTATTGGCTGGTGGTGGGATCGCTTTACCTGCATTACAGGGAGTGATGTCTATCCAAACAAAGAGTCATCAGCAAGGTGCTTTACAGGGATTATTGGTGAGCCTTACCAATGCAACCGGTGTTATTGGCCCATTACTGTTTGCTGTTATTTATAATCATTCACTACCAATTTGGGATGGCTGGATTTGGATTATTGGTTTAGCGTTTTACTGTATTATTATCCTGCTATCGATGACCTTCATGTTAACCCCTCAAGCTCAGGGGAGTAAACAGGAGACAAGTGCTTAGTTATTTCGTCACCAAATGATGTTATTCCGCGAAATATAATGACCCTCTTGATAACCCAAGAGGGCATTTTTTACGATAAAGAAGATTTAGCTTCAAATAAAACCTATCTATTTTATTTATCTTTCAAGCTCAATAAAAAGCCGCGGTAAATAGCAATAAATTGGCCTTTTTTATCGGCAAGCTCTTTTAGGTTTTTCGCATGTATTGCGATATGCATAAACCAGCCATTGAGTAAGTTTTTAAGCACATCATCATCATAAGCTTTAAGTTGGTTCTCTTGGATCAATTTGCTGACAATGGCGTTTACCTTACCAGTAATGTATTCAAGGCTAATTTTTTCAAGTTCATTCCAACCAATGATAGGCATCACTTCTTGGATAGGGATAAGGTTTTTATTATTATCAATAATATAATCAAGATAATGTTCAAATATACTTTCTAAGGCAGACCAACCATTTGTTAAATCAGTTTTTGTTGTGATGTAGGCATCAATCATAATTAATTGCTGCTTATAACAGGCACTGAGTAATTGTTTTTTATTTTTAAAGTGATGATAAAAGGCACCTTTGGTCACCAACGCTTTTCCCGAGATCTCATCTATTGAAACAGCTTGATAGCCTTTTTCAACAAACAATATTCGTGCTGAGTTAACCAGTGATTGATAGGTACTCTTAAAATTTTCTTGTTGATGATTTTTATTTTCCATGATAGATTTAAAATAACATACCGTCAGTATGTTTATGGTATCATGATGATGTGGTCGTGACAATCTTAAGGGCACTGTTGCAAATAGTCGGTGGTGATAAACTTATCATCCCCTTTTGCTGATGGAGCTGCACATGAACCCATTCAAAGGCCGGCATTTTCAGCGTGACATCATTCTGTGGGCCGTACGCTGGTACTGCAAATACGGCATCAGTTACCGTGAGCTGCAGGAGATGCTGGCTGAACGCGGAGTGAATGTCGATCACTCCACGATTTACCGCTGGGTTCAGCGTTATGCGCCTGAAATGGAAAAACGGCTGCGCTGGTACTGGCGTAACCCTTCCGATCTTTGCCCGTGGCACATGGATGAAACCTACGTGAAGGTCAATGGCCGCTGGGCGTATCTGTACCGGGCCGTCGACAGCCGGGGCCGCACTGTCGATTTTTATCTCTCCTCCCGTCGTAACAGCAAAGCTGCATACCGGTTTCTGGGTAAAATCCTCAACAACGTGAAGAAGTGGCAGATCCCGCGATTCATCAACACGGATAAAGCGCCCGCCTATGGTCGCGCGCTTGCTCTGCTCAAACGCGAAGGCCGGTGCCCGTCTGACGTTGAACACCGACAGATTAAGTACCGGAACAACGTGATTGAATGCGATCATGGCAAACTGAAACGGATAATCGGCGCCACGCTGGGATTTAAATCCATGAAGACGGCTTACGCCACCATCAAAGGTATTGAGGTGATGCGTGCACTACGCAAAGGCCAGGCCTCAGCATTTTATTATGGTGATCCCCTGGGCGAAATGCGCCTGGTAAGCAGAGTTTTTGAAATGTAAGGCCTTTGAATAAGACAAAAGGCTGCCTCATCGCTAACTTTGCAACAGTGCCCTTTTAAGCCATGCAACGCGCCCTGCGAGACGTAATCGCCATAGCTGGAGATGCCGAGCGCAAAAAGGATCAAGGCTATGGCAGACGGCAGCGTGAAGCCAGCCCAAGCAGCCAGCGCCCCGCTGTATCCAGCCCGAGACAGTCCTACCGCTATGCCGACCTGGCTGCTTGCAGGCCCTGGCAAGAACTGACAAAGCGCGACCAAGTCAGCATAGCTCCGTTCGGAGAGCCAGCGCCGCCGTGTGACAAATTCGGCGCGGAAGTAGCCCAAGTGCGCAATGGGGCCGCCAAAAGATGTCAATCCAAGCCGCAGAAAAATAAGAAAGACCGACCATGGTCTGCTGTCATCGGTAGGGTTATTCGTCATACTTTCGCCTTCATGATCTGCAACGAGTTGATCAATAATAAGCGAAATTCGATAACGAAATTCGATATAAATCTAGAAAAAAATACCTCTATGTGTACTACGCAGTTTTAGCTGTGGCTTTCACAGGAGCACGCTTACTTACGGCTTAGCGTGCTTTATTTTCCGTTTTCTGAGGCGATCCCTAGGAGCTCGGATCTCAGGACGAAGGTCTCCGCGAATGTCCGGTCGATCCGCGCGACGTCCCAGGCGGGCGTTCCCTTGGCGGACATCCACGCCGCAGCGTCGTGCATCAGCCGCACAACCTCGTCGATATCACCCGAGCAGGCGACCCGAACGTTCGGAGGCTCCTCGCTGTCCATTCGCTCCCCTGGCGCGGTATGAACCGCCGCCTCATAGTGCAGTTTGATCCTGACGAGCCCAGCATGTCTGCGCCCACCTTCGCGGAACCTGACCAGGGTCCGCTAGCGGGCGGCCGGAAGGTGAATGCTAGGCATGATCTAACCCTCGGTCTCTGGCGTCGCGACTGCGAAATTTCGCGAGGGTTTCCGAGAAGGTGATTGCGCTTCGCAGATCTCCAGGCGCGTGGGTGCGGACGTAGTCAGCGCCATTGCCGATCGCGTGAAGTTCCGCCGCAAGGCTCGCTGGACCCAGATCCTTTACAGGAAGGCCAACGGTGGCGCCCAAGAAGGATTTCCGCGACACCGAGACCAATAGCGGAAGCCCCAACGCCGACTTCAGCTTTTGAAGGTTCGACAGCACGTGCAGCGATGTTTCCGGTGCGGGGCTCAAGAAAAATCCCATCCCCGGATCGAGGATGAGCCGGTCGGCAGCGACCCCGCTCCGTCGCAAGGCGGAAACCCGCGCCTCGAAGAACCGCACAATCTCGTCGAGCGCGTCTTCGGGTCGAAGGTGACCGGTGCGGGTGGCGATGCCATCCCGCTGCGCTGAGTGCATAACCACCAGCCTGCAGTCCGCCTCAGCAATATCGGGATAGAGCGCAGGGTCAGGAAATCCTTGGATATCGTTCAGGTAGCCCACGCCGCGCTTGAGCGCATAGCGCTGGGTTTCCGGTTGGAAGCTGTCGATTGAAACACGGTGCATCTGATCGGACAGGGCGTCTAAGAGCGGCGCAATACGTCTGATCTCATCGGCCGGCGATACAGGCCTCGCGTCCGGATGGCTGGCGGCCGGTCCGACATCCACGACGTCTGATCCGACTCGCAGCATTTCGATCGCCGCGGTGACAGCGCCGGCGGGGTCTAGCCGCCGGCTCTCATCGAAGAAGGAGTCCTCGGTGAGATTCAGAATGCCGAACACCGTCACCATGGCGTCGGCCTCCGCAGCGACTTCCACGATGGGGATCGGGCGAGCAAAAAGGCAGCAATTATGAGCCCCATACCTACAAAGCCCCACGCATCAAGCTTTTGCCCATGAAGCAACCAGGCAATGGCTGTAATTATGACGACGCCGAGTCCCGACCAGACTGCATAAGCAACACCGACAGGGATGGATTTCAGAACCAGAGAAAGAAAATAAAATGCGATGCCATAACCGATTATGACAACGGCGGAAGGGGCAAGCTTAGTAAAGCCCTCGCTAGATTTTAATGCGGATGTTGCGATTACTTCGCCAACTATTGCGATAACAAGAAAAAGCCAGCCTTTCATGATATATCTCCCAATTTGTGTAGGGCTTATTATGCACGCTTAAAAATAATAAAAGCAGACTTGACCTGATAGTTTGGCTGTGAGCAATTATGTGCTTAGTGCATCTAACGCATAGTTGAGCGGCGGGCGCAGCCCGTCCGCTTGAACGCCGAGTTAGGCATCAGATGCCCTCGGCGCGGGTCGATGCACTTTTCGCACATGCCGCTCAACGCAAGATTCTCTCAATCGTTGCTTTGGCATATCGAACGAACGCGGCCGTCTCTTCGACGCGCATTGCTAGGTCGTCGTCCTCGCTACCCAGGTACGCCGCGCGTGCCTTGCAGATGAGGGGCCGATGCTCGGCAGGCAAACGCTCCGATACCCATGCGGCAGCAACGTCCTTAGGAGCAATGAGACCAGTTGAAGCGCTGTACCAAATGCGAGCAAGAGCAAGAACGACGTTCCGCTCGTCACCCTTCCAATCCGACTCTGCATTCCACTGGGCAATAGTGTCGAAAAGCGCCTTGGAGAAATGCTCCTTCGGCACCGGCTCGAAAAACGTGGCTGCGGATGGGCCTAGAAGCGCAAGGCTGTGTTGCCTCGCCTTGGTCAGCAAAATCGCAAGATCGTGATCCAGAACGGCAGGCTCGAACGTTCCGGAAAGGATGTCGTGGCGGAGCCACTCACCGAACTGAAGCTCACGCCGCGCCGGATAGCGCCAAGGCACTACTTCGCTTCGAGCGACAACAGTTAGCTCCAGCGGTCGCCATGTTCCGCCATCGCCTGGCGGTGATGAGACTTTCAGCAAATCGAGCATTAGCGCCTGCCGGAGCGAATCGTTAGGTGCGGCGCTGACGGTCACGAGCAAGTCTATGTCGCTGTCCGGCTTCAGCCCTCCATCGATCGCAGATCCGAACAGGTGGATTGTGTCCAGTGTCGCAGCCAGATGGCGCTCGATCACCGCGCGAGCGTGGGACAGCTGCTTGAAAACTTGTGCAGGGAAAAATTCACCCATGATGCCTAACGTTAAGTTCAGCGGCAGCTTTTAAGTTGCGGCTTTGTGGAATACTTTTGCGCAGCAAAACCACAAAGACGCGACTTAAAAGCTGTCCAAGGAGCGAAGCGACTGGTGCTGCAACGCATTGTTAGCCTTTTTTCCAAATCTGGTATGTATAATTTATATTAGACATAAAAAACTGTTCAAAAACCAAATTGAAATTCTCAGGCATTATAGGGAATTTGATATCACCTTCGACTTCAACGTGAACAGTAGACAAATGAATTATATCTGCTTTTTCAATAAGGCTATTATAGATTTGACCCCCGCCAGAGACATATACATGATCTGTAACTTTTGATAGCTCTTTCAAAGCATTTTCTATTGAAGGAAAAACTAGGACGTTTTCATTTGAGCTTGAAATTCCGTTCTTTGACACTACTGCATATTTGCGATTTGGAAGAACACCCATAGAGTCAAATGTTTTTCTTCCGACAAGGAGCCATTGATTATATGTGAGCGCTTTAAAGAGTAGTTGCTCACCTTTTACTGACCACGGGATATCAGGACCACTACCGATTACGCCATTTTCTGACACTGCAGAAATCAATGATATTTTCAATTTAACTCCCTTAATGGCTAACTTTGTTTTAGGGCGACTGCCCTGCTGCGTAACATCGTTGCTGCTCCATAACATCAAACATCGACCCACGGCGTAACGCGCTTGCTGCTTGGATGCCCGAGGCATAGACTGTACAAAAAAACAGTCATAACAAGCCATGAAAACCGCCACTGCGCCGTTACCACCGCTGCGTTCGGTCAAGGTTCTGGACCAGTTGCGTGAGCGCATACGCTACTTGCATTACAGTTTACGAACCGAACAGGCTTATGTCCACTGGGTTCGTGCCTTCATCCGTTTCCACGGTGTGCGTCACCCGGCAACCTTGGGCAGCAGCGAAGTCGAGGCATTTCTGTCCTGGCTGGCGAACGAGCGCAAGGTTTCGGTCTCCACGCATCGTCAGGCATTGGCGGCCTTGCTGTTCTTCTACGGCAAGGTGCTGTGCACGGATCTGCCCTGGCTTCAGGAGATCGGAAGACCTCGGCCGTCGCGGCGCTTGCCGGTGGTGCTGACCCCGGATGAAGTGGTTCGCATCCTCGGTTTTCTGGAAGGCGAGCATCGTTTGTTCGCCCAGCTTCTGTATGGAACGGGCATGCGGATCAGTGAGGGTTTGCAACTGCGGGTCAAGGATCTGGATTTCGATCACGGCACGATCATCGTGCGGGAGGGCAAGGGCTCCAAGGATCGGGCCTTGATGTTACCCGAGAGCTTGGCACCCAGCCTGCGCGAGCAGCTGTCGCGTGCACGGGCATGGTGGCTGAAGGACCAGGCCGAGGGCCGCAGCGGCGTTGCGCTTCCCGACGCCCTTGAGCGGAAGTATCCGCGCGCCGGGCATTCCTGGCCGTGGTTCTGGGTTTTTGCGCAGCACACGCATTCGACCGATCCACGGAGCGGTGTCGTGCGTCGCCATCACATGTATGACCAGACCTTTCAGCGCGCCTTCAAACGTGCCGTAGAACAAGCAGGCATCACGAAGCCCGCCACACCGCACACCCTCCGCCACTCGTTCGCGACGGCCTTGCTCCGCAGCGGTTACGACATTCGAACCGTGCAGGATCTGCTCGGCCATTCCGACGTCTCTACGACGATGATTTACACGCATGTGCTGAAAGTTGGCGGTGCCGGAGTGCGCTCACCGCTTGATGCGCTGCCGCCCCTCACTAGTGAGAGGTAGGGCAGCGCAAGTCAATCCTGGCGGATTCACTACCCCTGCGCGAAGGCCATCGGTGCCGCATCGAACGGCCGGTTGCGGAAAGTCCTCCCTGCGTCCGCTGATGGCCGGCAGCAGCCCGTCGTTGCCTGATGGATCCAACCCCTCCGCTGCTATAGTGCAGTCGGCTTCTGACGTTCAGTGCAGCCGTCTTCTGAAAACGACAATGGAGGTGGTAGCCGAGGGTGTGGAAACACCCGACTGCCTTGCGTGGTTGCGGCAGGCGGGTTGCGACACGGTGCAGGGTTTCCTGTTCGCCAGGCCGATGCCGGCGGCGGCCTTCGTCGGCTTCGTCAACCAATGGAGGAACACCGGCACTGTTGCAAAGTTAGCGATGAGGCAGCCTTTTGTCTTATTCAAAGGCCTTACATTTCAAAAACTCTGCTTACGGGCGCATTTCGCCCGGGATCACCATAATAAAATGCTGAGGGCGGCCTTTGCGTAGTGCACGCATCACCTCAATACCTTTGATGGTGGCGTAAGCCGTCTTCATGGATTTAAATCCCAGCGTGGCGCCGATTATCCGTTTCAGTTTGCCATGATCGCATTCAATCACGTTGTTCCGGTACTTAATCTGTCGGTGTTCAACGTCAGACGGGCACCGGCCTTCGCGTTTGAGCAGAGCAAGCGCGCGACCATAGGCGGGCGCTTTATCCGTGTTGATGAATCGCGGGATCTGCCACTTCTTCACGTTGTTGAGGATTTTACCCAGAAACCGGTATGCAGCTTTGCTGTTACGACGGGAGGAGAGATAAAAATCGACAGTGCGGCCCCGGCTGTCGACGGCCCGGTACAGATACGCCCAGCGGTACCATTGACCTTCACGTAGGTTTCATCCATGTGCCACGGGCAAAGATCGGAAGGGTTACGCCAGTACCAGCGCAGCCGTTTTTCCATTTCAGGCGCATAACGCTGAACCCAGCGGTAAATCGTGGAGTGATCGACATTCACTCGCGTTCAGCCAGCATCTCCTGCAGCTCACGGTAACTGATGCCGTATTTGCAGTACCAGCGTACGGCCCACAGAATGATGTCACGCTGAAAATGCCGGCCTTTGAATGGGTTCATGTGCAGCTCCATCAGCAAAAGGGGATGATAAGTTTATCACCACCGACTATTTGCAACAGTGCCCTTCTCCTATCCCCGGGAACACATCAATCTCACCGGAGAATATCGCTACCAAAGCCTTAGCGTAGGATTCCGCCCCTTCCCGCAAACGACCCCAAACAGGAAACGCAGCTGAAACGGGAAGCTCAACACCCACTGACGCATGGGTTGTTCAGGCAGTACTTCATCAACCAGCAAGGCGGCACTTTCGGCCATCCGCCGCGCCCCACAGCTCGGGCAGAAACCGCGACGCTTACAGCTGAAAGCGACCGGGTGCTCGGCGTGGCAAGACTCGCAGCGAACCCGTAGAAAGCCATGCTCCAGCCGCCCGCATTGGAGAAATTCTTCAAATTCCCGTTGCACATAGCCCGGCAATTCCTTTCCCTGCTCTGCCATAAGGGGTCTGACGCTCAGTGGAACGAAAACTCACGTTAAGGGATTTTGGTCATGAGATTATCAAAAGGATCTTCACCTAGATCCTTTTAAATTAAAAATGAAGTTTTAAATCAATCTAAAGTATATATGAGTAAACTTGGTCTGACAGTTACCAATGCTTAATCAGTGAGGCACCTATCTCAGCGATCTGTCTATTTCGTTCATCCATAGTTGCCTGACTCCCCGTCGTGTAGATAACTACGATACGGGAGGGCTTACCATCTGGCCCCAGTGCTGCAATGATACCGCGAGACCCACGCTCACCGGCTCCAGATTTATCAGCAATAAACCAGCCAGCCGGAAGGGCCGAGCGCAGAAGTGGTCCTGCAACTTTATCCGCCTCCATCCAGTCTATTAATTGTTGCCGGGAAGCTAGAGTAAGTAGTTCGCCAGTTAATAGTTTGCGCAACGTTGTTGCCATTGCTGCAGGCATCGTGGTGTCACGCTCGTCGTTTGGTATGGCTTCATTCAGCTCCGGTTCCCAACGATCAAGGCGAGTTACATGATCCCCATGTTGTGCAAAAAAGCGGTTAGCTCCTTCGGTCCTCCGATCGTTGTCAGAAGTAAGTTGGCAGCAGTGTTATCACTCATGGTTATGGCAGCACTGCATAATTCTCTTACTGTCATGCCATCCGTAAGATGCTTTTCTGTGACTGGTGAGTACTCAACCAAGTCATTCTGAGAATAGTGTATGCGGCGACCGAGTTGCTCTTGCCCGGCGTCAACACGGGATAATACCGCACCACATAGCAGAACTTTAAAAGTGCTCATCATTGGAAAACGTTCTTCGGGGCGAAAACTCTCAAGGATCTTACCGCTGTTGAGATCCAGTTCGATGTAACCCACTCGTGCACCCAACTGATCTTCAGCATCTTTTACTTTCACCAGCGTTTCTGGGTGAGCAAAAACAGGAAGGCAAAATGCCGCAAAAAGGGAATAAGGGCGACACGAAAATGTTGAATACTCATACTCTTCCTTTTTCAATATTATTGAAGCATTTACGGGGTTATTGTCTCATGAGCGGATACATATTTGAATGTATTTAGAAAAATAAACAAATAGGGGTTCCGCGCACATTTCCCCGAAAAGTGCCACCTGACGTCTAAGAAACCATTATTATCATGACATTAACCTATAAAAATAGGCGTATCACGAGGCCCTTTCGTCTTCAAGAATTTTATAAACCGTGGAGCGGGCAATACTGAGCTGATGAGCAATTTCCGTTGCACCAGTGCCCTTCTGATGAAGCGTCAGCACGACGTTCCTGTCCACGGTACGCCTGCGGTAAATTTGATTCCTTTCAGCTTTGCTTCCTGTCGGCCCTCATTCGTGCGTTCTAGGGCACTGTTGCAAATAGTCGGTGGTGATAAACTTATCATCCCCTTTTGCTGATGGAGCTGCACATGAACCCATTCAAAGGCCGGCATTTTCAGCGTGACATCATTCTGTGGGCCGTACGCTGGTACTGCAAATACGGCATCAGTTACCGTGAGCTGCAGGAGATGCTGGCTGAACGCGGAGTGAATGTCGATCACTCCACGATTTACCGCTGGGTTCAGCGTTATGCGCCTGAAATGGAAAAACGGCTGCGCTGGTACTGGCGTAACCCTTCCGATCTTTGCCCGTGGCACATGGATGAAACCTACGTGAAGGTCAATGGTACCGCTGGGCGTATCTGTACCGGGCCGTCGACAGCCGGGGCCGCACTGTCGATTTTTATCTCTCCTCCCGTCGTAACAGCAAAGCTGCATACCGGTTTCTGGGTAAAATCCTCAACAACGTGAAGAAGTGGCAGATCCACGATTCATCAACACGGATAAAGCGCCCGCCTATGGTCGCGCGCTTGCTCTGCTCAAACGCGAAGGCCGGTGCCCGTCTGACGTTGAACACCGACAGATTAAGTACCGGAACAACGTGATTGAATGCGATCATGGCAAACTGAAACGGATAATCAACGCCACGCTGGGATTTAAATCCATGAAGACGGCTTACGCCACCATCAAAGGTATTGAGGTGATGCGTGCACTACGCAAAGGCCGGCCTCAGCATTTTATTATGGTGATCCCCCGGGCGAAATGCGCCCGGTAAGCAGAGTTTTTGAAATGTAAGGCCTTTGAATAAGACAAAAGGCTGCCTCATCGCTAACTTTGCAACAGTGCCTTAACAAGTTGTTTGGGTGTTCGAATTTCAACAGGTAAGTTAGTTGCTAGAACCCATGGCTCCTTTGCCGACGCTGAGTAGATTTTAGGTGACGGGTGGTGACAATGAGTCCGTGTCGAGCGCTGATTTTTTCGGCCTTTAGAGCGAGATTTATACAATAGAATTTGGCATGAGATTGGATTGCTTTTAGTCAGCCTCTTATAGCCTAAAGTCTTTGAGTGACTAGATGACATATCATGTAAGTTGCTGATAGGTTTCCAGTTTTCCGCTCCTAGGTCTGCATATTGTACTTTTCCTCTTACTCGACTTAACCAGTACCAACCCAGCTTCTCAACGGATTTATACCATGGCACTTTAAAGCCAGCATCACTGACAATGAGCGGTGTGGTGTTACTCGGTAGAATGCTCGCAAGGTCGGCTAGAAATTGGTCATGAGCTTTCTTTGAACATTGCTCTGAAAGCGGGAACGCTTTCTCATAAAGAGTAACAGAACGACCGTGTAGTGCGACTGAAGCTCGCAATACCATAAGTCGTTTTTGCTCACGAATATCAGACCAGTCAACAAGTACAATGGGCATCGTATTGCCCGAACAGATAAAGCTAGCATGCCAACGGTATACAGCGAGTCGCTCTTTGTGGAGGTGACGATTACCTAACAATCGGTCGATTCGTTTGATGTTATGTTTTGTTCTCGCTTTGGTTGGCAGGTTACGGTAAAGTTCGGTAAGAGTGAGAGTTTTACAGTCAAGTAATGCGTGGCAAGCCAACGTTAAGCTGTTGAGTCGTTTTAAGTGTAATTCGGGGCAGAATTGGTAAAGAGAGTCGTGTAAAATATCGAGTTCGCACATCTTGTTGTCTGATTATTGATTTTTCGCGAAACCATTTGATCATATGACAAGATGTGTATCCACCTTAACTTAATGATTTTTACCAAAATCATTAGGGGATTCATCAGAGCATGATATCAAAACGCTCTGAGCTGCTCGTTCGGCTATGGCGTAGGCCTAGTCCGTAGGCAGGACTTTTCAAGTCTCGGAAGGTTTCTTCAATCTGCATTCGCTTCGAATAGATATTAACAAGTTGTTTGGGTGTTCGAATTTCAACAGGTAAGTTAGTTGCTAGAACCCATGGCTCCTTTGCCGACGCTGAGTAGATTTTAGGTGACGGGTGGTGACAATGAGTCCGTGTCGAGCGCTGATTTTTCGGCCTTTAGAGCGAGATTTATACAATAGAATTTGGCATGAGATTGGATTGCTTTTAGTCAGCCTCTTATAGCCTAAAGTCTTTGAGTGACTAGATGACATATCATGTAAGTTGCTGATAGGTTTCCAGTTTTCCGCTCCTAGGTCTGCATATTGTACTTTTCCTCTTACTCGACTTAACCAGTACCAACCCAGCTTCTCAACGGATTTATACCATGGCACTTTAAAGCCAGCATCACTGACAATGAGCGGTGTGGTGTTACTCGGTAGAATGCTCGCAAGGTCGGCTAGAAATTGGTCATGAGCTTTCTTTGAACATTGCTCTGAAAGCGGGAACGCTTTCTCATAAAGAGTAACAGAACGACCGTGTAGTGCGACTGAAGCTCGCAATACCATAAGCCGTTTTTGCTCACGGATATCAGACCAGTCAACAAGTACAATGGGCATCGTATTGCCCGAACAGATAAAGCTAGCATGCCAACGGTATACAGCGAGTCGCTCTTTGTGGAGGTGACGATTACCTAACAATCGGTCGATTCGTTTGATGTTATGTTTTGTTCTCGCTTTGGTTGGCAGGTTACGGTACCAAGTTCGGTAAGAGTGAGAGTTTTACAGTCAAGTAATGCGTGGCAAGCCAACGTTAAGCTGTTGAGTCGTTTTAAGTGTAATTCGGGGCAGAATTGGTAAAGAGAGTCGTGTAAAATATCGAGTTCGCACATCTTGTTGTCTGATTATTGATTTTTCGCGAAACCATTTGATCATATGACAAGATGTGTATCCACCTTAACTTAATGATTTTTACCAAAATCATTAGGGGATTCATCAGCGTATAGTGTTTTGCAGTTTAGAGGAGATATCGCGATGCATACGCGGAAGGCAATAACGGAGGCGCTTCAAAAACTCGGAGTCCAAACCGGTGACCTCTTGATGGTGCATGCCTCACTTAAAGCGATTGGTCCGGTCGAAGGAGGAGCGGAGACGGTCGTTGCCGCGTTACGCTCCGCGGTTGGGCCGACTGGCACTGTGATGGGATACGCGTCGTGGGACCGATCACCCTACGAGGAGACTCTGAATGGCGCTCGGCTGGATGACGAAGCCCGCCGTACCTGGCTGCCGTTCGATCCCGCAACAGCCGGGACTTACCGTGGGTTCGGCCTGCTGAATCAATTTCTGGTTCAAGCCCCCGGCGCGCGGCGCAGCGCGCACCCCGATGCATCGATGGTCGCGGTTGGTCCGCTGGCTGAAACGCTGACGGAGCCTCACGAACTCGGTCACGCCTTGGGGGAAGGATCGCCCGTCGAGCGGTTCGTTCGCCTTGGCGGGAAGGCCCTGCTGTTGGGTGCGCCGCTAAACTCCGTTACCGCATTGCACTACGCCGAGGCGGTTGCCGATATCCCCAACAAACGGTGGGTGACGTATGAGATGCCGATGCTTGGAAGAGACGGTGAAGTCGCCTGGAAAACGGCATCGGATTACGATTCAAACGGCATTCTCGATTGCTTTGCTATCGAAGGAAAGCCGGATGCGGTTGAAACTATAGCAAATGCTTACGTGAAGCTCGGTCGCCATCGAGAAGGTGTCGTGGGCTTTGCTCAGTGCTACCTGTTCGACGCGCAGGACATCGTGACGTTCGGCGTCACCTATCTTGAGAAGCATTTCGGAACCACTCCGATCGTGCCTCCGCACGAGGCCGTCGAGCGCTCTTGCGAGCCTTCAGGTTAGAGGCCGTCGACAATGATAATCTGGATCAACGGACCTTTCGGCGCCGGAAAGACGACGCTCGCTAAGCGGCTGCGCGATCGGCGTTCCAAATCGCTGATCTTTGACCCCGAGGAAATCGGGTTCGTGGTGAAAGAAACGGTCCCCATGCCAGCGAGCGGAGACTATCAGGATCTCCCCTTGTGGAGGGGACTTACGATCGCGGCGGTCAGGGAGATTCGAAGGAATTACTCGCAGGACATCATCATCCCAATGACGCTCGTGCACCCGGACTATCTGACTGAGATACTCGACGGGGTAAGGCGGATCGACGATCAGCTGCTGCACATCTTTCTGACGCTCAACGAGGACCTATTGCGTCACCGGATCGCGAACCAGACCATGCATCCTGACCCGAATCGAAATGCGGAGATTCGAGAGTGGCGATTAGCGAATGTCGCCCGATGCTTGGCCGCAAGGGAACGGCTTCCATGCACAACCCGTGTTCTCGATAGTGGTGCACACACCAGCGATGAACTCGCAGCGATGGTGCTCGACGGAATCGATGGGCGCACCTGATCGCCTTCGACGCCTGCGCAAAGCGTAGCGCGAGGGTGGCGGGCTCACGACCAAACGCCCAGAGGTCGATCATCGCAGGGATGTTTGGCTTTGTGGTGCGGACGACGGGACTCGAACCCGTACTCTCACAGAGAAGCAGATTTTCGTACCACCTCGACTTTCGCCGCCGTCTGATGACGTTCGTGGTCTGGACTGTCCCTTCGCCATTGCCCGAAGGCTTTAGGCGCCGCCCGTCCAGTCTCTACACCTTCCCCCGAAGGGGCTTGGCTCGGGATTGGCTTAGGGTATTGCCCGTTAGCGTTCCCCGACTTTGAGCGGTTCTACTCCGCGGATTTCCCCGCGGGCACTCCAATTTTAAAGTCTGCTGCGTCTACCGATTTCGCCACGTCCGCCTTTTTTCGCCGTTCCTAGCGCTCGTGCGATGCACCTATGTTGCACCTAGCGCCGAATCGTTCTTCGTCATCCTGAAAAACCACGTCTCCTAAAGCCTTGCATAGCTTATCTTTTCTCCACCACGAACTTTTTTGTGGGATGGTAGAAAAAAAGACTTTTTAAGTCCGCTGGCTTGCCAGGCCTTGTTAGCTTGTACGGTCATGGTTATCGGGTAAAGAATATTGACGGCATCGCTGGTGTCGGTGGCTGAAAAGCCGGCTCCCATCAGGGCAATAGCCATTTCAGATGCAGGCGTACAGGGCAATGGTCAACAGCTACAGCCTGTCTGACGATTCCGGCGTCATGGCTGCGGCGGCTATCACGCATTTTTTGTTCGGTCAGGCGGTGTTTTCGTACCTCAATGGTTGGAGCGTGTTGATCGGACCTGGTACAGGTTTGGACAGCACGGGCTGCAAATACGCAAGGGATTTAATGGGCCTGGTGGCGTTCACGGCTTTTATCGTGACGTTTCTGTTCAGGGGCTACTCATAATCTCGTGGCTCGGCGGTTCCCGGCACACCATGACAGTAAGGAAGGACCCTGTGTCTCAACTCTCCCAGCTTCGAAGCCCCGCCGCCGTGCAGGCTGCCATCGATGAGTTCGTGCAACTGGGCCGCACGAAATTCCTGGCGCGCCACGGCTACGGCAAGTCCCGCGACTTCCTGGTACGTGATCCGAAGACCGGCACCGATTGCGATTCCAAGGCCATCGCCGGTGTGGCCTTCGGCAAGCAATTTCCCGAGCAGGGCCCGCTCACTGCTGACAGCTTCTCCGGTGGCGAGGCGACCGTCGTTCCGGCGCTGACGCGGCTCGGGTTTCGCATCATTCGCATCGGCGAAGACTGGTCCGAAGAAGAGGTCCTGGCCACGGTCGAAGACTATTTCGACATGCTGCGTGCCGAGGCGGCTGGGGAGCCGTACAACAAGTCCGAGCACAACCAGGCACTGCGCCAACTGCTGAACGGTCGCAGCAAGTCTTCAGTCGAGCTCAAGCACCAGAACATTAGCGCCGTACTCGATGCCCTGGGCCTGCCCTATATCAACGGCTACAAGCCACGCGGCAACAGCCAACTGCTGCTGCGTAAATCCGTACACGCCTACGTTCTGGAACATCAGCAGACGGTCGGCGCTCTTGTCGATGCCCTGGAGGAGGTAAAACTTCCGGGTGACAAAACCTACCGAGCGGCTTTGGTAGAACCACCCGCCCGTGAAGTGCTTGTGCGTACCCCGGCATCTCTACGGCAACGCCTACCGCGAAAGTTCGATTATGCCGCTCGCGATGAAGCCAACCGCAAGCTGGGCCGGGCAGGGGAGCAGTGGGTGATTGGCTACGAACAGCAACGCCTGACCGAGCTCGGCCACCCAGAGCTTTTTCAGCGGCTGGATTGGGTGTCCGACACCCAGGGAGACGGTGCGGGGTTCGACATCCTGTCGTTCGAAGAGGACGCCCATGAGCGCTTCATCGAGGTGAAAACCACCAATGGCGGGGTAGGCTCGTCTTTCTTGGTCAGCCACAACGAACTCGAATTCTCCAAGGAGGCGGGCGATCAATTCCATCTGTATCGCGTGTTCCAGTTTCGGGACGGTCCGCGCCTGTTCACGCTACCCGGCGACCTCAGCCAACATGTGCATCTCAAGCCGACGGGCACTGTTGCAAATAGTCGGTGGTGATAAACTTATCATCCCCTTTTGCTGATGGAGCTGCACATGAACCCATTCAAAGGCCGGCATTTTCAGCGTGACATCATTCTGTGGGCCGTACGCTGGTACTGCAAATACGGCATCAGTTACCGTGAGCTGCAGGAGATGCTGGCTGAACGCGGAGTGAATGTCGATCACTCCACGATTTACCGCTGGGTTCAGCGTTATGCGCCTGAAATGGAAAAACGGCTGCGCTGGTACTGGCGTAACCCTTCCGATCTTTGCCCGTGGCACATGGATGAAACCTACGTGAAGGTCAATGGCCGCTGGGCGTATCTGTACCGGGCCGTCGACAGCCGGGGCCGCACTGTCGATTTTTATCTCTCCTCCCGTCGTAACAGCAAAGCTGCATACCGGTTTCTGGGTAAAATCCTCAACAACGTGAAGAAGTGGCAGATCCCGCGATTCATCAACACGGATAAAGCGCCCGCCTATGGTCGCGCGCTTGCTCTGCTCAAACGCGAAGGCCGGTGCCCGTCTGACGTTGAACACCGACAGATTAAGTACCGGAACAACGTGATTGAATGCGATCATGGCAAACTGAAACGGATAATCGGCGCCACGCTGGGATTTAAATCCATGAAGACGGCTTACGCCACCATCAAAGGTATTGAGGTGATGCGTGCACTACGCAAAGGCCAGGCCTCAGCATTTTATTATGGTGATCCCCTGGGCGAAATGCGCCTGGTAAGCAGAGTTTTTGAAATGTAAGGCCTTTGAATAAGACAAAAGGCTGCCTCATCGCTAACTTTGCAACAGTGCCGGATTGAATATAACCGACGTGACTGTTACATTTAGGTGGCTAAACCCGTCAAGCCCTCAGGAGTGAATCATGACCGTAGTCACGACCGCCGATACCTCCCAACTGTACGCACTTGCAGCCCGACATGGGCTCAAGCTCCATGGCCCGCTGACTGTCAATGAGCTTGGGCTCGACTATAGGATCGTGATCGCCACCGTCGACGATGGACGTCGGTGGGTGCTGCGCATCCCGCGCCGAGCCGAGGTAAGCGCGAAGGTCGAACCAGAGGCGCGGGTGCTGGCAATGCTCAAGAATCGCCTGCCGTTCGCGGTGCCGGACTGGCGCGTGGCCAACGCCGAGCTCGTTGCCTATCCCATGCTCGAAGACTCGACTGCGATGGTCATCCAGCCTGGTTCGTCCACGCCCGACTGGGTCGTGCCGCAGGACTCGGAGGTCTTCGCGGAGAGCTTCGCGACCGCGCTCGCCGCCCTGCATGCCGTCCCCATTTCCGCCGCCGTGGATGCGGGGATGCTCATCCGTACACCGACGCAGGCCCGTCAGAAGGTGGCCGACGACGTTGACCGCGTCCGACGCGAGTTCGTGGTGAACGACAAGCGCCTCCACCGGTGGCAGCGCTGGCTCGACGACGATTCGTCGTGGCCAGATTTCTCCGTGGTGGTGCATGGCGATCTCTACGTGGGCCATGTGCTCATCGACAACACGGAGCGCGTCAGCGGGATGATCGACTGGAGCGAGGCCCGCGTTGATGACCCTGCCATCGACATGGCCGCGCACCTTATGGTCTTTGGTGAAGAGGGGCTCGCGAAGCTCCTCCTCACGTATGAAGCGGCCGGTGGCCGGGTGTGGCCGCGGCTCGCCCACCACATCGCGGAGCGCCTTGCGTTCGGGGCGGTCACCTACGCACTCTTCGCCCTCGACTCGGGTAACGAAGAGTACCTCGCTGCGGCGAAGGCGCAGCTCGCCGCAGCGGAATGAGCGAACGTCGATATAGCCCGCTCGCGACGCTGTTCGCGGCGACCTTTCTCTTCCGGATCGGCAACGCGGTGGCGGCCCTCGCGCTTCCATGGTTCGTCCTGTCTCATACAAAGAGCGCGGCCTGGGCGGGCGCCACGGCCGCTAGCAGCGTCATCGCGACCATCATCGGCGCGTGGGTTGGTGGTGGCCTCGTCGATCGGTTCGGGCGCGCGCCCGTCGCATTGATCTCGGGTGTGGTGGGCGGCGTGGCCATGGCGAGCATCCCACTGCTCGATGCCGTTGGCGCCCTCTCGAACACTGGGCTGATCGCTTGCGTGGTGCTCGGTGCCGCGTTCGACGCACCCGGTATGGCCGCGCAGGACAGTGAGCTGCCCAAACTCGGCCACGTCGCCGGGCTCTCCGTTGAGCGCGTCTCGTCACTGAAAGCGGTGATCGGGAACGTCGCGATTCTAGGTGGCCCGGCCCTTGGGGGGGCCGCAATCGGCCTGCTTGGCGCTGCGCCAACGCTCGGGCTGACGGCGTTCTGCTCCGTCCTTGCAGGTCTGCTCGGCGCGTGGGTGCTTCCCGCGCGTGCCGCTCGGACGATGACCACGACGGCGACTCTCTCCATGCGCGCCGGCGTCGCTTTTCTCTGGAGCGAACCCCTGCTGCGCCCTCTCTTTGGTATAGTGATGATCTTCGTGGGCATCGTTGGCGCCAACGGCAGCGTCATCATGCCTGCGCTGTTTGTAGATGCAGGACGCCAAGTAGCAGAGCTCGGGCTGTTCTCCTCAATGATGGGGGCTGGTGGTCTCCTTGGCATTGCCATTCATGCGTCGGTCGGCGCCCGGATATCAGCGCAGAACTGGCTGGCGGTGGCATTTTGTGGCTCTGCGGTGGGCTCGCTTCTGCTTTCACAGTTGCCAGGCGTGCCGGTGCTGATGTTGTTGGGCGCGCTCGTGGGACTGCTGACCGGCTCAGTCTCTCCCATTCTCAACGCTGCCATCTACAACCGCACGCCGCCAGAACTTCTCGGCCGGGTACTCGGCACGGTCTCGGCGGTGATGCTGTCAGCCTCGCCCATGGTTATGCTTGCGGCCGGCGCGTTTGTCGACCTTGCTGGTCCGCTCCCTGGCCTCGTTGTATCGGCCGTGTTTGCGGGGCTCGTGGCTCTACTCTCGCTCCGTCTTCAATTTGCTACAATGGCGGCGGCAGCCACAGCCTCCGCCCCAACCCATACAGAAGGTGAACACTGATGCCCCGCCCCAAGCTCAAGTCCGATGACGAGGTACTCGAGGCCGCCACCGTAGTGCTGAAGCGTTGCGGTCCCATAGAGTTCACGCTCAGCGGAGTAGCAAAGGAGGTGGGGCTCTCCCGCGCAGCGTTAATCCAGCGCTTCACCAACCGCGATACGCTGCTGGTGAGGATGATGGAGCGCGGCGTCGAGCAGGTGCGGCATTACCTGAATGCGATACCGATAGGCGCAGGGCCGCAAGGGCTCTGGGAATTTTTGCAGGTGCTCGTTCGGAGCATGAACACTCGCAACGACTTCTCGGTGAACTATCTCATCTCCTGGTACGAGCTCCAGGTGCCGGAGCTACGCACGCTTGCGATCCAGCGGAACCGCGCGGTGGTGGAGGGGATCCGCAAGCGACTGCCCCCAGGTGCTCCTGCGGCAGCTGAGTTGCTCCTGCACTCGGTCATCGCTGGCGCGACGATGCAGTGGGCCGTCGATCCGGATGGTGAGCTAGCTGATCATGTGCTGGCTCAGATCGCTGCCATCCTGTGTTTAATGTTTCCCGAACACGACGATTTCCAACTCCTCCAGGCACATGCGTAAACGGAGGTGTGCAGAGTCCCTGCGGCAGGCGACGAACACGACCGTCGTCGATTAGTACCGGTACGGTCGGTGGTATCGAAGTCTTGATCACCACTCAGGTCTACGGCTTACAAATGGTGACCATCCCGATACTTGCGTCAGAGCACCGGGCCGATTCTTTGACAGTGAATCACTCCCGTAAGGTTGTGCCGGTGTGGGTGTCCCGGGTCGAGACGATACTCCGCCAATGCGCCCAGCAAACAACCTGGCCATCGCAGGTGGTGGGGAGCGGTGTGGCGGATGAGTTGGACAAGTTGGTGTAGCAGCACGAGCACGGCGAGATAACATCGCAGGAGTTCGACATGCTCAAGAGACAGCTGATTGCGAATCGCGATGCAGATTCATAACCCGATTGCGGGTTGGCTTCACTCCACCATCACCGAGCAGACTAGCACGGCGGGCTCTGTTGCAAAGATTGGCGGCAGTCAGAGGTAGGCTGTCGCTCTGCGCCGATCAGGCGGCTGCTGCGAAATGGTGGTTGAGCATGCCCATGGCCTCCGTCAGCGCCGAGGGCCCAATGCCAAAAGCTCTCTCCACAAGGCGCACCTCGCCCCTGATGCCGGGCTGCAGGCACCAGGGGCGAGCCTGTCCTTTGCGCAGGGCTCGCATGACTTCGAATCCCTTGATCGTGGCATAGGCCGTGGGGATCGATTTGAAACCGCGCACCGGCTTGATCAGTATCTTGAGCTTTCCGTGATCGGCCTCGATCACGTTATTGAGATACTTCACCTGCCGGTGGGCCGTCTCCCGGTCCAGCTTTCCTTCGCGCTTCAATTCGGTGATCGCTGCACCATAGCTCGGCGCTTTGTCGGTATTGAGCGTGGCAGGCTTTTCCCAGTGCTTCAGGCCTCGCAGGGCCTTGCCCAGGAACCGCTTCGCTGCCTTGGCGCTGCGGGTCGGCGACAGGTAGAAATCGATCGTGTCGCCCCGCTTGTCGACTGCCCGGTACAGGTAGGTCCACTTGCCCCGCACCTTGACGTAGGTTTCATCCAGGCGCCAGCTCGGATCAAAGCCACGCCGCCAGAACCAGCGCAGCCGCTTCTCCATCTCCGGGGCGTAGCACTGGACCCAGCGATAGATCGTCGTATGGTCGACCGAAATGCCGCGTTCCGCCAGCATTTCCTCAAGGTCGCGATAGCTGATCGGATAGCGACAATACCAGCGCACCGCCCACAGGATCACATCACCCTGGAAATGGCGCCACTTGAAATCCGTCATCGTTCCGTCCGTCCAATCTCCGCCAAGCATGCTCAAGCTTCACGATTTTTGCAACAGAGCCCACACGAGTATTGAGGGCACTGTTGCAAATAGTCGGTGGTGATAAACTTATCATCCCCTTTTGCTGATGGAGCTACGCCCAGACACATTTCAAAGGCCCGGCATCCAGCGTGACATCATTCTGTGGGCCGTACGCTGGTACTGCAAATACGGCATCAGTTACCGTGAGCTGCAGGAGATGCTGGCTGAACGCGGAGTGAATGTCGATCACTCCACGATTTACCGCTGGGTTCAGCGTTATGCGCCTGAAATGGAAAAACGGCTGCGCTGGTACTGGCGTAACCCTTCCGATCTTTGCCCGTGGCACATGGATGAAACCTACGTGAAGGTCAATGGTAGCTGGGCGTATCTGTACCGGGCCGTCGACAGCCGGGGCCGCACTGTCGATTTTATCTCTCCTCCCGTCGTAACAGCAAAGCTGCATACCGGTTTCTGGGTAAAATCCTCAACAACGTGAAGAAGTGGCAGATCCGCGATTCATCAACACGGATAAAGCGCCCGCCTATGGTCGCGCGCTTGCTCTGCTCAAACGCGAAGGCCGGTGCCCGTCTGACGTTGAACACCGACAGATTAAGTACCGGAACAACGTGATTGAATGCGATCATGGCAAACTGAAACGGATAATCGGCGCCACGCTGGGATTTAAATCCATGAAGACGGCTTACGCCACCATCAAAGGTATTGAGGTGATGCGTGCACTACGCAAAGGCCAGCCTCAGCATTTTATTATGGTGATCCCTGGGCGAAATGCGCCCTGGTAAGCAGAGTTTTTGAAATGTAAGGCCTTTGAATAAGACAAAAGGCTGCCTCATCGCTAACTTTGCAACAGTGCCTTTAAGCGTGCATAATAAGCCCTACACAAATTGGGAGTTAGACATCATGAGCAACGCAAAAACAAAGTTAGGCATCACAAAGTACAGCATCGTGACCAACAGCAACGATTCCGTCACACTGCGCCTCATGACTGAGCATGACCTTGCGATGCTCTATGAGTGGCTAAATCGATCTCATATCGTCGAGTGGTGGGGCGGAGAAGAAGCACGCCCGACACTTGCTGACGTACAGGAACAGTACTTGCCAAGCGTTTTAGCGCAAGAGTCCGTCACTCCATACATTGCAATGCTGAATGGAGAGCCGATTGGGTATGCCCAGTCGTACGTTGCTCTTGGAAGCGGGGACGGACGGTGGGAAGAAGAAACCGATCCGAGTACGCGGAATAGACCAGTTACTGGCGAATGCATCACAACTGGGCAAAGGCTTGGGAACCAAGCTGGTTCGAGCTCTGGTTGAGTTGCTGTTCAATGATCCCGAGGTCACCAAGATCCAAACGGACCCGTCGCCGAGCAACTTGCGAGCGATCCGATGCTACGAGAAAGCGGGGTTTGAGAGGCAAGGTACCGTAACCACCCCATATGGTCCAGCCGTGTACATGGTTCAAACACGCCGGGCATTCGAGCGAACACGCAGTGATGCCTAACCCTTCCATCGAGGGGGACGTCCAAGGGCTGGCGCCCGCCCCGCCCTCATGTCAAACGTTGGGCGAACCCGGAGCCTCATTAATTGTTAGCCGTTAAAATTAAGCCCTTTACCAAACCAATACTTATTATGAAAAACACAATACATATCAACTTCGCTATTTTTTTAATAATTGCAAATATTATCTACAGCAGCGCCAGTGCATCAACAGATATCTCTACTGTTGCATCTCCATTATTTGAAGGAACTGAAGGTTGTTTTTACTTTACGATGCATCCACAAACGCTGAAATTGCTCAATTCAATAAAGCAAAGTGTGCAACGCAAATGGCACCAGATTCAACTTTCAAGATCGCATTATCACTTATGGCATTTGATGCGGAAATAATAGATCAGAAAACCATATTCAAATGGGATAAAACCCCAAAGGAATGGAGATCTGGAACAGCAATCATACACCAAAGACGTGGATGCAATTTTCTGTTGTTTGGGTTTCGCAAGAAATAACCCAAAAAATTGGATTAAATAAAATCAAGAATTATCTCAAAGATTTTGATTATGGAAATCAAGACTTCTCTGGAGATAAAGAAAGAAACAACGGATTAACAGAAGCATGGCTCGAAAGTAGCTTAAAAATTTCACCAGAAGAACAAATTCAATTCCTGCGTAAAATTATTAATCACAATCTCCCAGTTAAAAACTCAGCCATAGAAAACACCATAGAGAACATGTATCTACAAGATCTGGATAATAGTACAAAACTGTATGGGAAAACTGGTGCAGGATTCACAGCAAATAGAACCTTACAAAACGGATGGTTTGAAGGGTTTATTATAAGCAAATCAGGACATAAATATGTTTTTGTGTCCGCACTTACAGGAAACTTGGGGTCGAATTTAACATCAAGCATAAAAGCCAAGAAAAATGCGATCACCATTCTAAACACACTAAATTTATAAAAATCTAATGGCAAAATCGCCCAACCCTTCAATCAAGTCGGGACGGTAAAAGCAAGCTTTTGGCTCCCCTCGCTGGCGCTCGGCGCCCCTTATTTCAAACGTTAGACGGCAAAGTCACAGACCGCGGGATCTCTTATGACCAACTACTTTGATAGCCCCTTCAAAGGCAAGCTGCTTTCTGAGCAAGTGAAGAACCCCAATATCAAAGTTGGGCGGTACAGCTATTACTCTGGCTACTATCATGGGCACTCATTCGATGACTGCGCACGGTATCTGTTTCGGACCGTGATGACGTTGATAAGTTGATCATCGGTAGTTTCTGCTCTATCGGGAGTGGGGCTTCCTTTATCATGGCTGGCAATCAGGGGCATCGGTACGACTGGGCATCATCTTTCCCGTTCTTTTATATGCAGGAAGAACCTGCATTCTCAAGCGCACTCGATGCCTTCCAAAAAGCAGGTAATACTGTCATTGGCAATGACGTTTGGATCGGCTCTGAGGCAATGGTCATGCCCGGAATCAAGATCGGGCACGGTGCGGTGATAGGCAGCCGCTCGTTGGTGACAAAAGATGTGGGGCACTGTTGCAAAGTTAGCGATGAGGCAGCCTTTTGTCTTATTCAAAGGCCTTACATTTCAAAAACTCTGCTTACGGGCGCATTTCGCCCGGGATCACCATAATAAAATGCTGAGGCACGGCCTTTGCGTAGTGCACGCATCACCTCAATACCTTTGATGGTGGCGTAAGCCGTCTTCATGGATTTAAATCCCAGCGTGGCGCCGATTATCCGTTTCAGTTTGCCATGATCGCATTCAATCACGTTGTTCCGGTACTTAATCTGTCGGTGTTCAACGTCAGACGGGCACCGGCCTTCGCGTTTGAGCAGAGCAAGCGCGCGACCATAGGCGGGCGCTTTATCCGTGTTGATGAATCGCGGGATCTGCCACTTCTTCACGTTGTTGAGGATTTTACCCAGAAACCGGTATGCAGCTTTGCTGTTACGACGGGAGGAGAGATAAAAATCGACAGTGCGGCCCCGGCTGTCGACGGCCCGGTACAGATACGCCCAGCGGCCATTGACCTTCACGTAGGTTTCATCCATGTGCCACGGGCAAAGATCGGAAGGGTTACGCCAGTACCAGCGCAGCCGTTTTTCCATTTCAGGCGCATAACGCTGAACCCAGCGGTAAATCGTGGAGTGATCGACATTCACTCGCGTTCAGCCAGCATCTCCTGCAGCTCACGGTAACTGATGCCGTATTTGCAGTACCAGCGTACGGCCCACAGAATGATGTCACGCTGAAAATGCCGGCCTTTGAATGGGTTCATGTGCAGCTCCATCAGCAAAAGGGGATGATAAGTTTATCACCACCGACTATTTGCAACAGTGCCCTATTTTTCGGGGATCTGATTGCCCTCTGGCAATATCATTCAGCACGCCATAGTCGGCATCATGGTCCATTCGCCAGAAAACCGAAGCACCGGCATCAGCCAGGCGTGGAGAAAACTGGTATCCCAGCAGCCAGAAAAGGCCAAAGACAAGTTCGCTGGCACCTGCTGTATCGGTCATAATTTCGGTTGGATTCAGCCCGGTCTCCTGTTCCAGAAGACCTTCCAGCACAAAGATAGAGTCCCTCAGCGTCCCCGGTATAACGATGCCATGAAAGCCGGAATACTGATCGGACACAAAGTTGTACCAGGTGATCCCTCTGTTATTACCAAAGTATTTGCGGTTCGGTCCGGCATTGATTGTTCTGACTGGCGTAACAAAGCGCATTCCATCTGCAGATGCCACTTCTCCTCCACCCCATATCTGTGCCAGTGGCAGCGTTGCCTGAAAATCAACCAGTCTGGCATTAGCGCTGGTGATAGTTTCAGCCCGCAGATAGTTCGCTTTTGTCCAGTTCAGCCGGTGTCGGGTCAGTGCAGGAACATTTGATCTGATCAGTGGTTCCAGACCGATATTGCAGGCTTCAGCCATCAGCACGGCGCTGATGCTGACGGGCAGATCATCAACTCTGGCACTGGCTTCACTAGCATGGAAAAACTCATCAGCAAATCCGGTATGGGCGTTAATTTCGAGCAGCAACTCCGTTAAATCCACCGGAGGGAGTAGATCACTGATCATTTTGCTCAGTCGTTTCAGACTGTCCGGCTCATCAAGACTGGCGAGGGGAGAAATTGTCAACCGGGGCTTCGGGCCAGAAACATCGAGTTCGACAGCCTCATTTTCGCAAAGACGTGCAGCAACCTGTCTGTAACGACTATCAAGCTGATGACCCAGAGATTTTATTGCTTCCTGCGGGTCTGTCGGGTGCCCCAAAGAACGATAAACCTTAATCCGGTTTGCCTGCCAGTCAGCACCCTGTAGTAATCTTGCACGAGGATCTCCCCACCGGTTACTGCCGGTAACGTAGACATCCCTCCGCCTCAGACTATCCTGCAGTTTACTGAGAAAGCAGAGCGTGTATCCCCTGCGGGTGATATGTTTTTCCTTGTTAATCACCAGCCGTTTCCATGACCGACTGATAATTTCCGTTGGTGCGTCGTCAAAAAACTGCCGCCGTGAGCTGAACTCCCGGCTGAGGTAGTCACAGGCATTCAGAGTGGTAACCCCGGCAGGTGCGGATGAAAATTTAACGGTATTCAGCAGATGGGGCAGGAAACGACGAACGCGCCCGTACTGCTCCACCATTTCTTCATGAAAATTATCGTCTGAGGGCCGGGCAATTTCACGGACAAGCGTGATGATTTCAGCCAGCTTTTGCCTTGGGATGTAGCTGAACACCTCAGCACGAATCGATTCGTCCGGTGTTTCTTCTTTCAGCAGGTACGAACATGCGCTGGCGAGCGCCAATGCAGATTTATCCAGATCCTTCAGCGAGCGGAGCCGTTTTTTCTGCCCAATCTTTCTGGCGTCACGGATGATAACGGCCAGCATGGCGTCCAGAACGTCCAATGCATCATCCAGCGCCAGCGTTTCCCATGCAAGGACAAAGGCAACCAGAACCGCCATCCTTTTCTGCGGTGACATCCTGGCAATATTGAACACCGAAGTCATACCAGCATAACGTGCGAGATTTTTCAGGCGCACAGCCGGGAGTGTACTCAGGTTTTCAGCATGCAGGCCAAAATCGTTCAGAGTTTTCCAGCGTTCAATTGCTTCATTAAACGCCGGACCACTGATGGTCACAGGGCCCTTTTTCAGTGATTCCAGTAAAGACAGGCGGCTGCAATCAGTTGGCCCCAGCAGCATCTCCAGCTGTGAACGCTGTTCGGCTGACGGTATCAGTGCCAGTTTGTTCCACAGGCGCAACGTCGCCTTTTCCCTTACCTCTGAAATCAACCGGGTCAGCGTAGTGGCTCCGGGGAGAATAATACGATGTTGCATAAGCCACCCTGTCGCCAGATCGAAAAGCAGGCCAGGACGTTCGTTGCTTATCCAGCTCCGGGTATATAAAAGACGGGTAAGGCGAAATGTCCAGGGCCAGGCAAATTCACGATACTGATAGTGCTGACGTATCAGCGCTGCATGCTCACGGCGGGTATTTTCCCTCTGACCGTATTCTGCAAGAACGGTGATATCACGAATCCCGAGCTGTCTGGCGGTAAAATGCCGGACGCCGGAAGGAATATGACCTGGGACCTACGTGCGCCCGCACCGACACCCTCACACCTTCGAGCTACTGTTGCCATTAAGGGGTCGTTTCGTGGTGCTGAATTTTGACGATCGGGGTACCGTCACCCATCGGGCGATATTGGGGGAAACCTGTACGGTGCTGGAGATGGCCGCAGGAACCTGGCATGCCGTGCTGTCGCTGGATACCGGTGGCATAATTTTTGAAGTAAAACACGGTGGCTATCAACCCGTGGCTGCCGATGACTATGCGCACTGGGCTCCAGCGGAAGGAGAACCAGGAACCACGGAGCTTATGGCCTGGTATGCGCAAGCGCAGGTGGGCGACAGCACTTTTGCCGTCTAAGGCGATAAACAAAAACGGAATGAGTTTCCCCATTCCGTTTCCGCTATTACAAACCGTCGGTGACGATTTTAGCCGCCGACGCTAATACATCGCGACGGCTTTCTGCCTTAGGTTGAGGCTGGGTGAAGTAAGTGACCAGAATCAGCGGCGCACGATCTTTTGGCCAGATCACCGCGATATCGTTGGTGGTGCCATAGCCACCGCTGCCGGTTTTATCCCCCACAACCCAGGAAGCAGGCAGTCCAGCCTGAATGCTCGCTGCACCGGTGGTATTGCCTTTCATCCATGTCACCAGCTGCGCCCGTTGGCTGTCGCCCAATGCTTTACCCAGCGTCAGATTCCGCAGAGTTTGCGCCATTGCCCGAGGTGAAGTGGTATCACGCGGATCGCCCGGAATGGCGGTGTTTAACGTCGGCTCGGTACGGTCGAGACGGAACGTTTCGTCTCCCAGCTGTCGGGCGAACGCGGTGACGCTAGCCGGGCCGCCAACGTGAGCAATCAGCTTATTCATCGCCACGTTATCGCTGTACTGTAGCGCGGCCGCGCTAAGCTCAGCCAGTGACATCGTCCCATTGACGTGCTTTTCCGCAATCGGATTATAGTTAACAAGGTCAGATTTTTTGATCTCAACTCGCTGATTTAACAGATTCGGTTCGCTTTCACTTTTCTTCAGCACCGCGGCCGCGGCCATCACTTTACTGGTGCTGCACATCGCAAAGCGCTCATCAGCACGATAAAGTATTTGCGAATTATCTGCTGTGTTAATCAATGCCACACCCAGTCTGCCTCCCGACTGCCGCTCTAATTCGGCAAGTTTTTGCTGTACGTCCGCCGTTTGCGCATACAGCGGCACACTTCCTAACAACAGCGTGACGGTTGCCGTCGCCATCAGCGTGAACTGGCGCAGTGATTTTTTAACCATGGGATTCCTTATTCTGGAAGATACGAAATAACAACAACATGAATAGTCCCTAAATTCCACGTGTGTTTTTATTAGCTTCAAAAATCACTATTTCACGAAGAATTTAGACTGCTTCTCACACATTGTAACATTATTTACAACCACCTTTCAATCATTTTTGATAAATCATTGATTTCATCTTTGCTGCAATGATACTTAATAAACTCTGCAAGTTATCCACAGAGCAACACTCAATTTTATTGATGATATTCTTATTATACCAGACATTTTTCATACACTCCCTTGTACGGATAGTTTTCCGACAACTTCATGATTACATATCTTGCGGGGCACTGTTGCAAAGTTAGCGATGAGGCAGCCTTTTGTCTTATTCAAAGGCCTTACATTTCAAAAACTCTGCTTACGGGCGCATTTCGCCCGTGGGATCACCATAATAAAATGCTGAGGCCGGCCTTTGCGTAGTGCACGCATCACCTCAATACCTTTGATGGTGGCGTAAGCCGTCTTCATGGATTTAAATCCCAGCGTGGCGCCGATTATCCGTTTCAGTTTGCCATGATCGCATTCAATCACGTTGTTCCGGTACTTAATCTGTCGGTGTTCAACGTCAGACGGGCACCGGCCTTCGCGTTTGAGCAGAGCAAGCGCGCGACCATAGGCGGGCGCTTTATCCGTGTTGATGAATCGCGGGATCTGCCACTTCTTCACGTTGTTGAGGATTTTACCCAGAAACCGGTATGCAGCTTTGCTGTTACGACGGGAGGAGAGATAAAAATCGACAGTGCGGCCCCGGCTGTCGACGGCCCGGTACAGATACGCCCAGCACCATTGACCTTCACGTAGGTTTCATCCATGTGCCACGGGCAAAGATCGGAAGGGTTACGCCAGTACCAGCGCAGCCGTTTTTCCATTTCAGGCGCATAACGCTGAACCCAGCGGTAAATCGTGGAGTGATCGACATTCACTCGCGTTCAGCCAGCATCTCCTGCAGCTCACGGTAACTGATGCCGTATTTGCAGTACCAGCGTACGGCCCACAGAATGATGTCACGCTGAAAATGCCGGCCTTTGAATGGGTTCATGTGCAGCTCCATCAGCAAAAGGGGATGATAAGTTTATCACCACCGACTATTTGCAACAGTGCCGCCGATATGATCCAACTGATAAAGGAATTTGACGCTCAGGGCGTGGCAGTCCGGTTCATTGATGACGGGATCAGTACCGACGGTGATATGGGGCAAATGGTGGTCACCATCCTGTCGGCTGTGGCACAGGCTGAACGCCGGAGGATCCTAGAACGCACGAATGAGGGCCGACAGGAAGCAAAGCTGAAAGGAATCAAATTTGGCCGCAGGCGTACCGTGGACAGGAACGTCGTGCTGACGCTTCATCAGAAGGGCACTGGTGCAACGGAAATTGCTCATCAGCTCAGTATTGCCCGCTCCACGGTTTATAAAATTCTTGAAGACGAAAGGGCCTCGTGATACGCCTATTTTTATAGGTTAATGTCATGATAATAATGGTTTCTTAGACGTCAGGTGGCACTTTTCGGGGAAATGTGCGCGGAACCCCTATTTGTTTATTTTTCTAAATACATTCAAATATGTATCCGCTCATGAGACAATAACCCTGGTAAATGCTTCAATAATATTGAAAAAGGAAGAGTATGAGTATTCAACATTTTCGTGTCGCCCTTATTCCCTTTTTTGCGGCATTTTGCCTTCCTGTTTTTGCTCACCCAGAAACGCTGGTGAAAGTAAAAGATGCTGAAGATCAGTTGGGTGCACGAGTGGGTTACATCGAACTGGATCTCAACAGCGGTAAGATCCTTGAGAGTTTTCGCCCCGAAGAACGTTTTCCAATGATGAGCACTTTTAAAGTTCTGCTATGTGGTGCGGTATTATCCCGTGTTGACGCCGGGCAAGAGCAACTCGGTCGCCGCATACACTATTCTCAGAATGACTTGGTTGAGTACTCACCAGTCACAGAAAAGCATCTTACGGATGGCATGACAGTAAGAGAATTATGCAGTGCTGCCATAACCATGAGTGATAACACTGCTGCCAACTTACTTCTGACAACGATCGGAGGACCGAAGGAGCTAACCGCTTTTTTGCACAACATGGGGGATCATGTAACTCGCCTTGATCGTTGGGAACCGGAGCTGAATGAAGCCATACCAAACGACGAGCGTGACACCACGATGCCTGCAGCAATGGCAACAACGTTGCGCAAACTATTAACTGGCGAACTACTTACTCTAGCTTCCCGGCAACAATTAATAGACTGGATGGAGGCGGATAAAGTTGCAGGACCACTTCTGCGCTCGGCCCTTCCGGCTGGCTGGTTTATTGCTGATAAATCTGGAGCCGGTGAGCGTGGGTCTCGCGGTATCATTGCAGCACTGGGGCCAGATGGTAAGCCCTCCCGTATCGTAGTTATCTACACGACGGGGAGTCAGGCAACTATGGATGAACGAAATAGACAGATCGCTGAGATAGGTGCCTCACTGATTAAGCATTGGTAACTGTCAGACCAAGTTTACTCATATATACTTTAGATTGATTTAAAACTTCATTTTTAATTTAAAAGGATCTAGGTGAAGATCCTTTTTGATAATCTCATGACCAAAATCCCTTAACGTGAGTTTTCGTTCCACTGAGCGTCAGACCCCTTGATCGAGGTGGCCAGCCGCAGGATTTCGTCCCAATGGGCGCGGACGTGCTTGATGTTGAGCGTGCCGCCGATCATCGGCTTGAGCGCGTCATAGGCGGCATCGCCCTTCGGGATGTAGAGCTTGGTGTCGCCCAGGTCGCGGATGCGCGGCGCGAAGCGGAAGCCCAAGAGGTGCATCAGGGCGAAGACGTGATCGGTGAAGCCCGCCGTGTCGGTGTAGTGCTCCTCGATCCGCAGGTCGGATTCGTGGTACAGCAGGCCGTCGAGCACGTAGGTTGAGTCGCGCAGGCCGACATTGACCACCTTGGTGTGGAATGGCGCGTATTGGTCGGAGATGTGGGTGTAGAAAGTCCGTCCTGGGCTGCTGCCATATTTTGGGTTGATGTGCCCCGTGCTCTTTGCCTTGCTAGCGGTTCGGAAATTCTGTCCGTCCGATGATGATGTGGTGCCATCGCCCCAGTGCCCGGCAAAGGGATGCCGAAACTGAGCGTTGACCAGTTCAGCCAACGCTGTCGAGTACGTTTCGTCGCGGGTATGCCAGGCTTGCAGCCAAGCGAGCTTCGCGTAGGTCGTGCCGGGGCAGGACTCGGCCATCTTGGTCAGGCCCAGGTTGATCGCGTCGGCCAGGATCGTGGTCAACAACAGGTTCTTGTCCTTGGCCAGATCGCCCGATTTCAAGTGCGTGAAGTGCCGGGTGAAGCCCGTCCACTCATCGACTTCGAGCAGCAGTTCGGTGATCTTGACGTGCGGCAGGACCATGGCTGTCTGGTCTATCAGCGCCTGCGCGGTGTCGGGCACCGCCGCATCCAGCGGCGTGATCTTCAAGCCCGACTCGGTGATGATGGCATCCGGCAGGTCGTTGGCTGCCGCCATGCGGTTGACGGTGGCAAGTTGTGCTTCCAGCAGCGTCAGCCGCTCATGCAGATATTGTTCGCAGTCGGTGGCCACGGCCAGCGGCAATTCGCTGGACTGCTTGAGGCTGGTGAACTTCTCGGGCGGTACCAGGTAGTCCTCGAAGTCCTTGAACTGGCGTGAACCCTGCACCCAGATGTCGCCCGAGCGCAGGGAGTTCTTCAACTCGGACAGCGCGCACAGTTCGTAGTAGCGCCGGTCGATGCCGGCGTCGGTCATCACCAGTTTCTGCCAGCGCGGCTTGATGAAGCCGGTCGGTGCATCGGCTGGCAGCTTGCGGGCGTTGTCGGTGTTCATGCCGCGCAGCACCTCAATGGCATCAAGCACGTTTTTGGCGGCGGGCGCGGCCCGCAGCTTGAGCACGGCAAGGAATTCCGGTGCATAGCGGCGCAGGGTGGCGTAGCTCTCGCCGATGCGATGCAGGAAATCGAAGTCATCGGGTTGCGCGAGCTTCTGCGCCTCGGTGACGCTCTCGGCAAAGGAATCCCAGGACATGACGGCCTCGATGGCGGCAAACGCATCGCGGCCTGATTGCTTGGCGTCGATCAGCGCCTGACCGATGCGCCCGTACAGACGTACCTTGGCGTTGATGGCCTTGCCTGACGCCTGGAACTGCTGCTGATGCTTATTCTTGGCAGCGTTAAACAGCTTACCCAGGATGCGGTCGTGCAGGTCGATGATTTCGTCGGTGACGGTGGCCATGCCCTCGGTGGCCAGCGCCACGAGAGTGGCGTAGCGCCGTTGCGGCTCGAATTTGGCCAGGTCGGCGGGTGTCATCTGGCCGCCCTCGCGGGCAATCTTGAGCAGGCGGTTCTGGTGAACCAGCCGCTCGATGCCGGTAGGCAGATCGAGTGCCTGCCATGCCTTGAGGCGTTCGATGTGTTCCAGCATATGCCGCGAATTTGGCTTGGCCGGAGACTGGCGCAACCAAGCCAACCAGGTCGTCTTGCCGTTGTCCCGGCGCTTGAGCAGATCGTCGAGGCGGCGGCGATGCGCGTCCGCCAGTGGTTCGGCCAAGGCGTCGTAGATGCGCCGGTTAGCACGGGTGATCGCCTCGGCACTCGCCCGCTCGACGGCGTTGAGGGCGGGCAGAATGACCGACTGCCGCCGCAGGTGCCCGATCAAGGCGCTGGCCAGCACGATGCCTTTGTCGGTTTGCATCGCCAGCTCGGTCAGCATCTGGACGGCCTGCCGGTAATGGCTCATGGTGAAGGGCCGGAAACCGAACACGGTTTGCAGCTCGCTCAGGTGCTCGCGCCGGGTCTGCTCCCGCTGGCCGTACTCGTTCCAGCTTTCGACGCCGACCTTGAGCTGGTCGGCGACCAGCTTCAACAAGGGCGGGAACGGTAGTTCATCGACGCCCAGGATGACGCCGGGAAAGCGCAGGTAACAGAGCTGCACCGCGAAGCCCAGCCGATTGGCTGGCCCGCGCCGCTGTCGGATGATCGAGAGGTCGGTATCGTTGAATGTGTAATGTCGGATCAGGTCGTCCTTGGAGTCCGGCAACGCCAGCAGGCTTTCCCGCTCGGCGGCGGACAGGATGGAACGACGTGGCATATTTACTGATCCGTTCTCAAGTATTGATACAGGGTTTCGCGACTGATTCCGAATTCACGAGCAAGCTTGGTCTTTTGCTCGCCAGCCTCGACACGTTGGCGCAGTTCGGCAATACGCTCAGACGACAGGGATTTCTTCCTGCCACGGTAAGCCCCGCGTTGCTTGGCGAGCGCAATACCCTCGCGCTGACGCTCGCGGATCAGGGCGCGCTCGAACTCGGCGAACGCGCCCATCACCGAGAGCATCAGGTTCGCCATCGGAGAGTCTTCGCCAGTAAAACTGAGGTGTTCCTTGACGAATTCGATATGCACGCCGCGTTGTGTCAGCGTTTGCACGATCCGGCGCAAATCATCGAGATTGCGCGCCAGGCGATCCATGCTATGCACCACCACGGTGTCGCCGGTGCGGGCGAAGCTTATCAGCGCTTCCAGTTGCGGACGCTTGACATCCTTGCCGGATGCCTTGTCGCTAAAAGCGCGATCAACCTTGACGCCTTCCAGTTGCCGTTCCGGGTTCTGGTCGAAGGTGCTGACCCTGATATACCCAATGCGCTGTCCAGTCATGGAATTCCCTAAGTAGTTATATACTATGAAAAGCGTGGTAATGCTGAAAACTATATCAAAGAAGCCAAATACGACATGGCGGTGGGTCATCTCTTGCTAAAGTCATTTTGGGCGAATGAAGCCGTGTTTCAAATGATGATGCTTTCATATAACCTATTTTTGTTGTTCAAGTTTGATTCCTTGGACTCTTCAGAATACAGACAGCAAATAAAGACCTTTCGTTTGAAGTATGTATTTCTTGCAGCAAAAATAATCAAAACCGCAAGATATGTAATCATGAAGTTGTCGGAAAACTATCCGTACAAGGGAGTGTATGAAAAATGTCTGGTATAATAAGAATATCATCAATAAAATTGAGTGTTGCTCTGTGGATAACTTGCAGAGTTTATTAAGTATCATTGCAGCAAAGATGAAATCAATGATTTATCAAAAATGATTGAAAGGTGGTTGTAAATAATGTTACAATGTGTGAGAAGCAGTCTAAATTCTTCGTGAAATAGTGATTTTTGAAGCTAATAAAAAACACACGTGGAATTTAGGTTAGACTATAAATAGAAAAAGGTGTTTTGACAATGACAGAGATTATCAAAGATTTATACCAATTTACAGAGGTAATGGAGCCGATTAAGCTTTCAATGCACCAATACTTATTGATGACAAATGAGCCTGTTCTCATTCAGACCGGAGCCGTATCACAAGCGCAAACTACCATTCCTAAGTTGCAAGAGTTGCTCGGTGAACGCAAAATAAAATACATTCTAATTTCTCATTTTGAATCAGACGAATGTGGCGGACTTGCTTTGGTTCTCAAAGAACATCCCGAAGCTGTCGCGGTTTGTTCCGAAACCACAGCAAGACAACTGATGGGGTTTGGTATTACTAACAATGTGCTTATTAAAAAGCCGAATGAAATATTTGCCGGAGATGATTTTGAATTTCAAACTATCAGTTACCCATCTGAAATGCATATGTGGGAAGGACTTTTATTTTTTGAAAAGAAACGTGGTATTTTCTTTAGCAGCGACCTCATGTTCGGCATGGGTGAGAATCACGGACAAGTTATCGAAAGCAGTTGGGACGCAGCGGTAAAATCAAGCGGCGCAGATACATTGCCCAATCAAGAATCCGGACAAAAGCTATCCTCGGATTTGAGTGAGATTGAACCTAAGTTCGTTGCTTCCGGTCATGGGTTTTGCATTACGATTGTAGGATAAATCAAGCAAATAGACCTTGCAAAATAAGCATTTGCAAATATACCACTTCATAAAACCGTATACTGTAATTATCAAGATTTTGAACGATATTGAAAAACAGTAACTTAAAGGCTCGATATATTTATAATATATCGAGCTTTTTTCATGTCTAAAAACTTGGGGGTATATAGTGAAAAATACCGTCCCAACGTCCCAAAGGCTGATAAGTAGGCGCATTATATGGGTTTAACGTACACATCACCCGCTTCCAGTGAGTTGCGTAGTAGGCGATAGACCAAAAACTCATAGCGATCTACATCAAGCGTTTTAAATGTTTTGCCCTCTTTACTAAACAAATATCGGCGTAAACCTTTAGGAATAATCTCAGTCGGAAATGAATTAGGGTCAGTTTGCCTTGGTGATTTTTCTGTGCGCAATAAGTTTTGTAAAAACGCGATAGCTTCAAGCAAAGGAGAGTCTTCTACACGTCCGGCAAAATCCAGATCAGTAAAAAGTTGCCTTAAGTTACGTTTGAATGTGGCGGATAATTTTGTGTAATGTGACCATTCAAATGCCGTTTTGTCGAAAGCAATATTGCGTAAGTAATCAGCAACTAATGGGAATCTCTCTTGTTCAAGCAATGCATAGGCTTTTTCTTTAATAATGGAAAAAGGTGTGTCATCGGTGATGGTGTCATCCGTAAACAGGCTCAATACATGACCCGCAGCCTGTAAATTTTTCGCTGCATTGGTAACCGCATTATTCATTGCTTCTTCAGCGGCACGCTTGGCCTGTTTCTCGTATTGATCGACCCAATGGAGCAATGCTTCAATTAAGTTGTCATTAATTTGCTGAAACCGATGAAAAGCAAAAAACAGCAGATATAACTGTGCAGTCTCTTTTTTCATGCGTTGAAGGGCACTGTTGCAAATAGTCGGTGGTGATAAACTTATCATCCCCTTTTGCTGATGGAGCTGCACATGAACCCATTCAAAGGCCGGCATTTTCAGCGTGACATCATTCTGTGGGCCGTACGCTGGTACTGCAAATACGGCATCAGTTACCGTGAGCTGCAGGAGATGCTGGCTGAACGCGGAGTGAATGTCGATCACTCCACGATTTACCGCTGGGTTCAGCGTTATGCGCCTGAAATGGAAAAACGGCTGCGCTGGTACTGGCGTAACCCTTCCGATCTTTGCCCGTGGCACATGGATGAAACCTACGTGAAGGTCAATGGCCGCTGGGCGTATCTGTACCGGGCCGTCGACAGCCGGGGCCGCACTGTCGATTTTTATCTCTCCTCCCGTCGTAACAGCAAAGCTGCATACCGGTTTCTGGGTAAAATCCTCAACAACGTGAAGAAGTGGCAGATCCCGCGATTCATCAACACGGATAAAGCGCCCGCCTATGGTCGCGCGCTTGCTCTGCTCAAACGCGAAGGCCGGTGCCCGTCTGACGTTGAACACCGACAGATTAAGTACCGGAACAACGTGATTGAATGCGATCATGGCAAACTGAAACGGATAATCGGCGCCACGCTGGGATTTAAATCCATGAAGACGGCTTACGCCACCATCAAAGGTATTGAGGTGATGCGTGCACTACGCAAAGGCCAGGCCTCAGCATTTTATTATGGTGATCCCCTGGGCGAAATGCGCCTGGTAAGCAGAGTTTTTGAAATGTAAGGCCTTTGAATAAGACAAAAGGCTGCCTCATCGCTAACTTTGCAACAGTGCCGGTAAATCCATGCTGGCCCTGCAACTGGCCGCACAGATTGCAGGCGGGCCGGATCTGCTGGAGGTGGGCGAACTGCCCACCGGCCCGGTGATCTACCTGCCCGCCGAAGACCCGCCCACCGCCATTCATCACCGCCTGCACGCCCTTGGGGCGCACCTCAGCGCCGAGGAACGGCAAGCCGTGGCTGACGGCCTGCTGATCCAGCCGCTGATCGGCAGCCTGCCCAACATCATGGCCCCGGAGTGGTTCGACGGCCTCAAGCGCGCCGCCGAGGGCCGCCGCCTGATGGTGCTGGACACGCTGCGCCGGTTCCACATCGAGGAAGAAAACGCCAGCGGCCCCATGGCCCAGGTCATCGGTCGCATGGAGGCCATCGCCGCCGATACCGGGTGCTCTATCGTGTTCCTGCACCATGCCAGCAAGGGCGCGGCCATGATGGGCGCAGGCGACCAGCAGCAGGCCAGCCGGGGCAGCTCGGTACTGGTCGATAACATCCGCTGGCAGTCCTACCTGTCGAGCATGACCAGCGCCGAGGCCGAGGAATGGGGTGTGGACGACGACCAGCGCCGGTTCTTCGTCCGCTTCGGTGTGAGCAAGGCCAACTATGGCGCACCGTTCGCTGATCGGTGGTTCAGGCGGCATGACGGCGGGGTGCTCAAGCCCGCCGTGCTGGAGAGGCAGCGCAAGAGCAAGGGGGTGCCCCGTGGTGAAGCCTAAGAACAAGCACAGCCTCAGCCACGTCCGGCACGACCCGGCGCACTGTCTGGCCCCCGGCCTGTTCCGTGCCCTCAAGCGGGGCGAGCGCAAGCGCAGCAAGCTGGACGTGACGTATGACTACGGCGACGGCAAGCGGATCGAGTTCAGCGGCCCGGAGCCGCTGGGCGCTGATGATCTGCGCATCCTGCAAGGGCTGGTGGCCATGGCTGGGCCTAATGGCCTAGTGCTTGGCCCGGAACCCAAGACCGAAGGCGGACGGCAGCTCCGGCTGTTCCTGGAACCCAAGTGGGAGGCCGTCACCGCTGATGCCATGGTGGTCAAAGGTAGCTATCGGGCGCTGGCAAAGGAAATCGGGGCAGAGGTCGATAGTGGTGGGGCGCTCAAGCACATACAGGACTGCATCGAGCGCCTTTGGAAGGTATCCATCATCGCCCAGAATGGCCGCAAGCGGCAGGGGTTTCGGCTGCTGTCGGAGTACGCCAGCGACGAGGCGGACGGGCGCCTGTACGTGGCCCTGAACCCCTTGATCGCGCAGGCCGTCATGGGTGGCGGCCAGCATGTGCGCATCAGCATGGACGAGGTGCGGGCGCTGGACAGCGAAACCGCCCGCCTGCTGCACCAGCGGCTGTGTGGCTGGATCGACCCCGGCAAAACCGGCAAGGCTTCCATAGATACCTTGTGCGGCTATGTCTGGCCGTCAGAGGCCAGTGGTTCGACCATGCGCAAGCGCCGCCAGCGGGTGCGCGAGGCGTTGCCGGAGCTGGTCGCGCTGGGCTGGACGGTAACCGAGTTCGCGGCGGGCAAGTACGACATCACCCGGCCCAAGGCGGCAGGCTGACCCCCCCCACTCTATTGTAAACAAGACATTTTTATCTTTTATATTCAATGGCTTATTTTCCTGCTAATTGGTAATACCATGAAAAATACCATGCTCAGAAAAGGCTTAACAATATTTTGAAAAATTGCCTACTGAGCGCTGCCGCACAGCTCCATAGGCCGCTTTCCTGGCTTTGCTTCCAGATGTATGCTATTCTGCTCCTGCAGCTAATGGATCACCGCAAACAGGTTACTCGCCTGGGGATTCCCTTTCGACCCGAGCATCCGTATGAGACTCATGCTCGATTATTATTATTATAGAAGCCCCCATGAATAAATCGCTCATCATTTTCGGCATCGTCAACATAACCTCGGACAGTTTCTCCGATGGAGGCCGGTATCTGGCGCCAGACGCAGCCATTGCGCAGGCGCGTAAGCTGATGGCCGAGGGGGCAGATGTGATCGACCTCGGTCCGGCATCCAGCAACCCCGACGCCGCGCCTGTTTCGTCCGACACAGAAATCGAGCGTATCGCGCCGGTGCTGGACGCGCTCAAGGCAGATGGCATTCCCGTCTCGCTCGACAGTTATCAACCCGCGACGCAAGCCTATGCCTTGTCGCGTGGTGTGGCCTATCTCAATGATATTCGCGGTTTTCCAGACGCTGCGTTCTATCCGCAATTGGCGAAATCATCTGCCAAACTCGTCGTTATGCATTCGGTGCAAGACGGGCAGGCAGATCGGCGCGAGGCACCCGCTGGCGACATCATGGATCACATTGCGGCGTTCTTTGACGCGCGCATCGCGGCGCTGACGGGTGCCGGTATCAAACGCAACCGCCTTGTCCTTGATCCCGGCATGGGGTTTTTTCTGGGGGCTGCTCCCGAAACCTCGCTCTCGGTGCTGGCGCGGTTCGATGAATTGCGGCTGCGCTTCGATTTGCCGGTGCTTCTGTCTGTTTCGCGCAAATCCTTTCTGCGCGCGCTCACAGGCCGTGGTCCGGGGGATGTCGGGGCCGCGACACTCGCTGCAGAGCTTGCCGCCGCCGCAGGTGGAGCTGACTTCATCCGCACACACGAGCCGCGCCCCTTGCGCGACGGGCTGGCGGTATTGGCGGCGCTAAAAGAAACCGCAAGAATTCGTTAACTGCACATTCGGGATATTTCTCTATATTCGCGCTTCATCAGAAAACTGAAGGAACCTCCATTGAATCGAACTAATATTTTTTTTGGTGAATCGCATTCTGACTGGTTGCCTGTCAGAGGCGGAGAATCTGGTGATTTTGTTTTTCGACGTGGTGACGGGCATGCCTTCGCGAAAATCGCACCTGCTTCCCGCCGCGGTGAGCTCGCTGGAGAGCGTGACCGCCTCATTTGGCTCAAAGGTCGAGGTGTGGCTTGCCCCGAGGTCATCAACTGGCAGGAGGAACAGGAGGGTGCATGCTTGGTGATAACGGCAATTCCGGGAGTACCGGCGGCTGATCTGTCTGGAGCGGATTTGCTCAAAGCGTGGCCGTCAATGGGGCAGCAACTTGGCGCTGTTCACAGCCTATCGGTTGATCAATGTCCGTTTGAGCGCAGGCTGTCGCGAATGTTCGGACGCGCCGTTGATGTGGTGTCCCGCAATGCCGTCAATCCCGACTTCTTACCGGACGAGGACAAGAGTACGCCGCAGCTCGATCTTTTGGCTCGTGTCGAACGAGAGCTACCGGTGCGGCTCGACCAAGAGCGCACCGATATGGTTGTTTGCCATGGTGATCCCTGCATGCCGAACTTCATGGTGGACCCTAAAACTCTTCAATGCACGGGTCTGATCGACCTTGGGCGGCTCGGAACAGCAGATCGCTATGCCGATTTGGCACTCATGATTGCTAACGCCGAAGAGAACTGGGCAGCGCCAGATGAAGCAGAGCGCGCCTTCGCTGTCCTATTCAATGTATTGGGGATCGAAGCCCCCGACCGCGAACGCCTTGCCTTCTATCTGCGATTGGACCCTCTGACTTGGGGTTGATGTTCATGCCGCCTGTTTTTCCTGCTCATTGGCACGTTTCGCAACCTGTTCTCATTGCGGACACCTTTTCCAGCCTCGTTTGGAAAGTTTCATTGCCAGACGGGACTCCTGCAATCGTCAAGGGATTGAAACCTATAGAAGACATTGCTGATGAACTGCGCGGGGCCGACTATCTGGTATGGCGCAATGGGAGGGGAGCAGTCCGGTTGCTCGGTCGTGAGAACAATCTGATGTTGCTCGAATATGCCGGGGAGCGAATGCTCTCTCACATCGTTGCCGAGCACGGCGACTACCAGGCGACCGAAATTGCAGCGGAACTAATGGCGAAGCTGTATGCCGCATCTGAGGAACCCCTGCCTTCTGCCCTTCTCCCGATCCGGGATCGCTTTGCAGCTTTGTTTCAGCGGGCGCGCGATGATCAAAACGCAGGTTGTCAAACTGACTACGTCCACGCGGCGATTATAGCCGATCAAATGATGAGCAATGCCTCGGAACTGCGTGGGCTACATGGCGATCTGCATCATGAAAACATCATGTTCTCCAGTCGCGGCTGGCTGGTGATAGATCCCGTCGGTCTGGTCGGTGAAGTGGGCTTTGGCGCCGCCAATATGTTCTACGATCCGGCTGACAGAGACGACCTTTGTCTCGATCCTAGACGCATTGCACAGATGGCGGACGCATTCTCTCGTGCGCTGGACGTCGATCCGCGTCGCCTGCTCGACCAGGCGTACGCTTATGGGTGCCTTTCCGCAGCTTGGAACGCGGATGGAGAAGAGGAGCAACGCGATCTAGCTATCGCGGCCGCGATCAAGCAGGTGCGACAGACGTCATACTAGATATCAAGGGCACTGTTGCAAAGTTAGCGATGAGGCAGCCTTTTGTCTTATTCAAAGGCCTTACATTTCAAAAACTCTGCTTACGGGCGCATTTCGCCCGGGGATCACCATAATAAAATGCTGAGGCCGGCCTTTGCGTAGTGCACGCATCACCTCAATACCTTTGATGGTGGCGTAAGCCGTCTTCATGGATTTAAATCCCAGCGTGGCGCCGATTATCCGTTTCAGTTTGCCATGATCGCATTCAATCACGTTGTTCCGGTACTTAATCTGTCGGTGTTCAACGTCAGACGGGCACCGGCCTTCGCGTTTGAGCAGAGCAAGCGCGCGACCATAGGCGGGCGCTTTATCCGTGTTGATGAATCGCGGGATCTGCCACTTCTTCACGTTGTTGAGGATTTTACCCAGAAACCGGTATGCAGCTTTGCTGTTACGACGGGAGGAGAGATAAAAATCGACAGTGCGGCCCCGGCTGTCGACGGCCCGGTACAGATACGCCCAGCTACCATTGACCTTCACGTAGGTTTCATCCATGTGCCACGGGCAAAGATCGGAAGGGTTACGCCAGTACCAGCGCAGCCGTTTTTCCATTTCAGGCGCATAACGCTGAACCCAGCGGTAAATCGTGGAGTGATCGACATTCACTCCGCGTTCAGCCAGCATCTCCTGCAGCTCACGGTAACTGATGCCGTATTTGCAGTACCAGCGTACGGCCCACAGAATGATGTCACGCTGAAAATGCCGGCCTTTGAATGGGTTCATGTGCAGCTCCATCAGCAAAAGGGGATGATAAGTTTATCACCACCGACTATTTGCAACAGTGCCTAACAGCGGATGTTCGCGATCACTGGACAAACTTCTGAGTGAAATGCTCGCCGGCAATATCAGTCGTTTCATCTGGCTTCGCAACTTCGAGGTTGGTAACAACTCGGCTGCTGCTAACCGTTTGCTCGACAGGCTCGAATTTCTGCGTACCCTGAATATCAATCATAGTGCTTTACCAGCATACCTGCCCATCGCATTGCCCGGCTGCGTCGGCAGGGTGAACGCTACTTCACCGACGGTTTGCGTGACATCACTTCGGACCGCCGCTGGGCGATCCTTGCCGTCTGTGTTGTGGAGTGGGAAGCGGCGATTGCTGATGCCATAGTCGAAACCCATGACAGGATCGTAGGAAAAACGGCGGGAAGCGAAGCGCCAGCATGACGAAACAATTTCCGGCTCTAAAGCCACACTCACGGATACGATCCGTACCTTCACCGCGCTGGGAGCTTCGTTGCTTGAGGCCCGCAGTGACGGAACCCCGCTGGAGATGGCTGTCGCCAGTTCGGTTGCATGGGACCGGCTCGCTCAACTGGTAGCGACAGGGACTCAACTCAGCAACACGCTAGCCGATGAGCCTCTTGCATATGTCGGGCAGGGATACCATCGCTTTCGTCGTTATGCGCCCCGCATGTTGCGCTGTCTGAAGCTCGAAGCCGCGCCGGTCGCCGGACCATTGGTAGCAGCAGCTTTGTCGATCGGAGAGATGAAAGGTGTTGCATCGCCAGAAAGGCGTTTCCTGCGGCCCAGCTCCAAATGGAACCGTCATTTACGAGCTCAGGAAAAAGGAGATACCCGTCTTTGGGAAGTGGCGGTACTCTTTCACCTCCGGGATGCTTTTCGTTCCGGAGATGTCTGGCTCGCTCATTCGCGCCGCTATGGTGACCTCAAGCAGGTACTGGTGCCGATGATCGCGGCGCAGGAAAATGCAAAACTGGCCGTGCCTTCCAACCCACAGGATTGGCTGGCAGACAGAAAGGCGCGACTCACGATCGCTCTTAAGCGGCTGGCCCGGGCTGCCCGTAACGGCACTATTCCGCACGGTAGCATAGAAGATGGAACGTTGCGGATCGACAGGTTGACAGCAGACGTGCCGGATGGTGCCGAGGCACTCATACTGGATCTGTATCGCCGAATGCCGTCCGTTCGGATTACCGACATGCTGCTTGAAGTTGATGCAGCCCTTGGTTTCACAGATGCGTTTACCCATCTGAGAACCGGGGCTCCATGTCGCGACCGGATCGGTCTGCTCAACGTCCTGCTCGCTGAAGGGCTCAATCTGGGCCTGCGTAAGATGGCGGAAGCTACAAACACGCATGATTACTGGCAGCTCTCACGCCTTGCCCGCTGGCATGTTGAAAGCGAAGCCATGAACGTGCATTGGCAATTGTGGTACTGCGCAGGGTAAACTGCCGATGTCACGCGTCTGGGGATGGGCACGTCAGCATCGAGCGATGGTCAGTTTTTCCCGACAGCGCGGCATGGCGAAGCCATGAACATGGTCAATGCCAAATATGGTTCTGTTCCCGGCCTCAAAGCGTATACTCACGTAAGCGACCAGTTCGCGCCATTCGCTTGTCAGTCGATCCCGGCGACCGTGAGCGAGGCACCGTATATTCTCGATGGACTACTGATGAACGAGGTCGGTCGCCATGTTCGCGAACAGTATGCCGATACAGCAGGATTCACCGACCATTTGTTCGGAGCCAGTAGCCTGCTCGGCTACAATCTCGTTCTGCGAATCAGGATCTGCCATCGAAGCGGTTGTACGTATTTAATCCCGATACGACCCCCAGGGAGTTACGCAAGTTGGTAGGTGGAAAAGCCGGGAGGATCTTATCGTTGCGAACTGGCCTGATATTTTCCGTTGTGCCGCGACGATGACCGCTGGCAAAATCAGGCCCAGCCAACTCCTGCGCAAGCTCGCTTCTTACCCACGACAAAACAACCTTGCAGTTGCGCTTCGTGAAGTTGGTCGTATTGAACGGACCCTTTTCATTATTGAGTGGATCCCCGGATACGGACATGCAGCGGCGTGCTCAGATCGGTCTTAACAAGGGAGAGGCCCACCATGCGCTCAAAAATGCGCTCCGTATCGGGAGGCAGGGGGAAATTCGCGATCGCACGACAGAGGGGCAGCACTACCGAATCGCTGGGCTCAATTTATTGACTGCGGTGATCATTTACTGGAATACCGTCCATCTTGGTCATGCCGTCACGGAGCGGCGGAACGAAGGGTTGGATGTTCCCCTGAATTTCTTCCCCCACATATCCCCATTGGGCTGGGCGCACATTCTACTGACTGGCGAATATCTTTGGCCCAAGGAACCGAAAGCTTAG
Protein sequences of DBSCAN-SWA_1 >NZ_CP048345|1262:51907|40876_41437_-|WP_000147567.1|DBSCAN-SWA MTGQRIGYIRVSTFDQNPERQLEGVKVDRAFSDKASGKDVKRPQLEALISFARTGDTVVVHSMDRLARNLDDLRRIVQTLTQRGVHIEFVKEHLSFTGEDSPMANLMLSVMGAFAEFERALIRERQREGIALAKQRGAYRGRKKSLSSERIAELRQRVEAGEQKTKLAREFGISRETLYQYLRTDQ >NZ_CP048345|1262:51907|5859_6483_-|WP_000088605.1|DBSCAN-SWA MSRLDKSKVINSALELLNEVGIEGLTTRKLAQKLGVEQPTLYWHVKNKRALLDALAIEMLDRHHTHFCPLEGESWQDFLRNNAKSFRCALLSHRDGAKVHLGTRPTEKQYETLENQLAFLCQQGFSLENALYALSAVGHFTLGCVLEDQEHQVAKEERETPTTDSMPPLLRQAIELFDHQGAEPAFLFGLELIICGLEKQLKCESGS >NZ_CP048345|1262:51907|8609_9314_+|WP_001067855.1|transposase|DBSCAN-SWA MNPFKGRHFQRDIILWAVRWYCKYGISYRELQEMLAERGVNVDHSTIYRWVQRYAPEMEKRLRWYWRNPSDLCPWHMDETYVKVNGRWAYLYRAVDSRGRTVDFYLSSRRNSKAAYRFLGKILNNVKKWQIPRFINTDKAPAYGRALALLKREGRCPSDVEHRQIKYRNNVIECDHGKLKRIIGATLGFKSMKTAYATIKGIEVMRALRKGQASAFYYGDPLGEMRLVSRVFEM >NZ_CP048345|1262:51907|47105_47909_+|WP_001082319.1|DBSCAN-SWA MNRTNIFFGESHSDWLPVRGGESGDFVFRRGDGHAFAKIAPASRRGELAGERDRLIWLKGRGVACPEVINWQEEQEGACLVITAIPGVPAADLSGADLLKAWPSMGQQLGAVHSLSVDQCPFERRLSRMFGRAVDVVSRNAVNPDFLPDEDKSTPQLDLLARVERELPVRLDQERTDMVVCHGDPCMPNFMVDPKTLQCTGLIDLGRLGTADRYADLALMIANAEENWAAPDEAERAFAVLFNVLGIEAPDRERLAFYLRLDPLTWG >NZ_CP048345|1262:51907|26489_27074_+|WP_001137892.1|DBSCAN-SWA MPRPKLKSDDEVLEAATVVLKRCGPIEFTLSGVAKEVGLSRAALIQRFTNRDTLLVRMMERGVEQVRHYLNAIPIGAGPQGLWEFLQVLVRSMNTRNDFSVNYLISWYELQVPELRTLAIQRNRAVVEGIRKRLPPGAPAAAELLLHSVIAGATMQWAVDPDGELADHVLAQIAAILCLMFPEHDDFQLLQAHA >NZ_CP048345|1262:51907|34854_35730_-|WP_000239590.1|DBSCAN-SWA MVKKSLRQFTLMATATVTLLLGSVPLYAQTADVQQKLAELERQSGGRLGVALINTADNSQILYRADERFAMCSTSKVMAAAAVLKKSESEPNLLNQRVEIKKSDLVNYNPIAEKHVNGTMSLAELSAAALQYSDNVAMNKLIAHVGGPASVTAFARQLGDETFRLDRTEPTLNTAIPGDPRDTTSPRAMAQTLRNLTLGKALGDSQRAQLVTWMKGNTTGAASIQAGLPASWVVGDKTGSGGYGTTNDIAVIWPKDRAPLILVTYFTQPQPKAESRRDVLASAAKIVTDGL >NZ_CP048345|1262:51907|45070_45922_+|WP_000240536.1|DBSCAN-SWA MVKPKNKHSLSHVRHDPAHCLAPGLFRALKRGERKRSKLDVTYDYGDGKRIEFSGPEPLGADDLRILQGLVAMAGPNGLVLGPEPKTEGGRQLRLFLEPKWEAVTADAMVVKGSYRALAKEIGAEVDSGGALKHIQDCIERLWKVSIIAQNGRKRQGFRLLSEYASDEADGRLYVALNPLIAQAVMGGGQHVRISMDEVRALDSETARLLHQRLCGWIDPGKTGKASIDTLCGYVWPSEASGSTMRKRRQRVREALPELVALGWTVTEFAAGKYDITRPKAAG >NZ_CP048345|1262:51907|9888_10092_-|WP_000376616.1|DBSCAN-SWA MDSEEPPNVRVACSGDIDEVVRLMHDAAAWMSAKGTPAWDVARIDRTFAETFVLRSELLGIASENGK >NZ_CP048345|1262:51907|25251_26490_+|WP_000004159.1|DBSCAN-SWA MSERRYSPLATLFAATFLFRIGNAVAALALPWFVLSHTKSAAWAGATAASSVIATIIGAWVGGGLVDRFGRAPVALISGVVGGVAMASIPLLDAVGALSNTGLIACVVLGAAFDAPGMAAQDSELPKLGHVAGLSVERVSSLKAVIGNVAILGGPALGGAAIGLLGAAPTLGLTAFCSVLAGLLGAWVLPARAARTMTTTATLSMRAGVAFLWSEPLLRPLFGIVMIFVGIVGANGSVIMPALFVDAGRQVAELGLFSSMMGAGGLLGIAIHASVGARISAQNWLAVAFCGSAVGSLLLSQLPGVPVLMLLGALVGLLTGSVSPILNAAIYNRTPPELLGRVLGTVSAVMLSASPMVMLAAGAFVDLAGPLPGLVVSAVFAGLVALLSLRLQFATMAAAATASAPTHTEGEH >NZ_CP048345|1262:51907|17887_18754_-|WP_162676759.1|transposase|DBSCAN-SWA MCELDILHDSLYQFCPELHLKRLNSLTLACHALLDCKTLTLTELYRNLPTKARTKHNIKRIDRLLGNRHLHKERLAVYRWHASFICSGNTMPIVLVDWSDIREQKRLMVLRASVALHGRSVTLYEKAFPLSEQCSKKAHDQFLADLASILPSNTTPLIVSDAGFKVPWYKSVEKLGWYWLSRVRGKVQYADLGAENWKPISNLHDMSSSHSKTLGYKRLTKSNPISCQILLYKSRSKGRKNQRSTRTHCHHPSPKIYSASAKEPWVLATNLPVEIRTPKQLVKALLQS >NZ_CP048345|1262:51907|43610_44315_+|WP_001067855.1|transposase|DBSCAN-SWA MNPFKGRHFQRDIILWAVRWYCKYGISYRELQEMLAERGVNVDHSTIYRWVQRYAPEMEKRLRWYWRNPSDLCPWHMDETYVKVNGRWAYLYRAVDSRGRTVDFYLSSRRNSKAAYRFLGKILNNVKKWQIPRFINTDKAPAYGRALALLKREGRCPSDVEHRQIKYRNNVIECDHGKLKRIIGATLGFKSMKTAYATIKGIEVMRALRKGQASAFYYGDPLGEMRLVSRVFEM >NZ_CP048345|1262:51907|13155_14169_+|WP_000845048.1|integrase|DBSCAN-SWA MKTATAPLPPLRSVKVLDQLRERIRYLHYSLRTEQAYVHWVRAFIRFHGVRHPATLGSSEVEAFLSWLANERKVSVSTHRQALAALLFFYGKVLCTDLPWLQEIGRPRPSRRLPVVLTPDEVVRILGFLEGEHRLFAQLLYGTGMRISEGLQLRVKDLDFDHGTIIVREGKGSKDRALMLPESLAPSLREQLSRARAWWLKDQAEGRSGVALPDALERKYPRAGHSWPWFWVFAQHTHSTDPRSGVVRRHHMYDQTFQRAFKRAVEQAGITKPATPHTLRHSFATALLRSGYDIRTVQDLLGHSDVSTTMIYTHVLKVGGAGVRSPLDALPPLTSER >NZ_CP048345|1262:51907|21866_22058_-|WP_000951934.1|DBSCAN-SWA MAIALMGAGFSATDTSDAVNILYPITMTVQANKAWQASGLKKSFFLPSHKKVRGGEKISYARL >NZ_CP048345|1262:51907|22359_23487_+|WP_014837927.1|DBSCAN-SWA MSQLSQLRSPAAVQAAIDEFVQLGRTKFLARHGYGKSRDFLVRDPKTGTDCDSKAIAGVAFGKQFPEQGPLTADSFSGGEATVVPALTRLGFRIIRIGEDWSEEEVLATVEDYFDMLRAEAAGEPYNKSEHNQALRQLLNGRSKSSVELKHQNISAVLDALGLPYINGYKPRGNSQLLLRKSVHAYVLEHQQTVGALVDALEEVKLPGDKTYRAALVEPPAREVLVRTPASLRQRLPRKFDYAARDEANRKLGRAGEQWVIGYEQQRLTELGHPELFQRLDWVSDTQGDGAGFDILSFEEDAHERFIEVKTTNGGVGSSFLVSHNELEFSKEAGDQFHLYRVFQFRDGPRLFTLPGDLSQHVHLKPTGTVANSRW >NZ_CP048345|1262:51907|50999_51383_+|WP_162676763.1|transposase|DBSCAN-SWA MGTSASSDGQFFPTARHGEAMNMVNAKYGSVPGLKAYTHVSDQFAPFACQSIPATVSEAPYILDGLLMNEVGRHVREQYADTAGFTDHLFGASSLLGYNLVLRIRICHRSGCTYLIPIRPPGSYASW >NZ_CP048345|1262:51907|20842_21385_+|WP_000587837.1|DBSCAN-SWA MIIWINGPFGAGKTTLAKRLRDRRSKSLIFDPEEIGFVVKETVPMPASGDYQDLPLWRGLTIAAVREIRRNYSQDIIIPMTLVHPDYLTEILDGVRRIDDQLLHIFLTLNEDLLRHRIANQTMHPDPNRNAEIREWRLANVARCLAARERLPCTTRVLDSGAHTSDELAAMVLDGIDGRT >NZ_CP048345|1262:51907|19969_20830_+|WP_000557454.1|DBSCAN-SWA MHTRKAITEALQKLGVQTGDLLMVHASLKAIGPVEGGAETVVAALRSAVGPTGTVMGYASWDRSPYEETLNGARLDDEARRTWLPFDPATAGTYRGFGLLNQFLVQAPGARRSAHPDASMVAVGPLAETLTEPHELGHALGEGSPVERFVRLGGKALLLGAPLNSVTALHYAEAVADIPNKRWVTYEMPMLGRDGEVAWKTASDYDSNGILDCFAIEGKPDAVETIANAYVKLGRHREGVVGFAQCYLFDAQDIVTFGVTYLEKHFGTTPIVPPHEAVERSCEPSG >NZ_CP048345|1262:51907|2123_2249_-|WP_032156742.1|DBSCAN-SWA MPWHYPISPKNRKTALSPRHDWLKTGSLLQNVPLRIWFYPQ >NZ_CP048345|1262:51907|42009_42651_+|WP_000134999.1|DBSCAN-SWA MTEIIKDLYQFTEVMEPIKLSMHQYLLMTNEPVLIQTGAVSQAQTTIPKLQELLGERKIKYILISHFESDECGGLALVLKEHPEAVAVCSETTARQLMGFGITNNVLIKKPNEIFAGDDFEFQTISYPSEMHMWEGLLFFEKKRGIFFSSDLMFGMGENHGQVIESSWDAAVKSSGADTLPNQESGQKLSSDLSEIEPKFVASGHGFCITIVG >NZ_CP048345|1262:51907|51592_51907_+|WP_162676764.1|transposase|DBSCAN-SWA MQRRAQIGLNKGEAHHALKNALRIGRQGEIRDRTTEGQHYRIAGLNLLTAVIIYWNTVHLGHAVTERRNEGLDVPLNFFPHISPLGWAHILLTGEYLWPKEPKA >NZ_CP048345|1262:51907|24349_25255_+|WP_000219391.1|DBSCAN-SWA MTVVTTADTSQLYALAARHGLKLHGPLTVNELGLDYRIVIATVDDGRRWVLRIPRRAEVSAKVEPEARVLAMLKNRLPFAVPDWRVANAELVAYPMLEDSTAMVIQPGSSTPDWVVPQDSEVFAESFATALAALHAVPISAAVDAGMLIRTPTQARQKVADDVDRVRREFVVNDKRLHRWQRWLDDDSSWPDFSVVVHGDLYVGHVLIDNTERVSGMIDWSEARVDDPAIDMAAHLMVFGEEGLAKLLLTYEAAGGRVWPRLAHHIAERLAFGAVTYALFALDSGNEEYLAAAKAQLAAAE >NZ_CP048345|1262:51907|49732_49984_-|WP_162676761.1|DBSCAN-SWA MSVALEPEIVSSCWRFASRRFSYDPVMGFDYGISNRRFPLHNTDGKDRPAAVRSDVTQTVGEVAFTLPTQPGNAMGRYAGKAL >NZ_CP048345|1262:51907|23523_24228_+|WP_001067855.1|transposase|DBSCAN-SWA MNPFKGRHFQRDIILWAVRWYCKYGISYRELQEMLAERGVNVDHSTIYRWVQRYAPEMEKRLRWYWRNPSDLCPWHMDETYVKVNGRWAYLYRAVDSRGRTVDFYLSSRRNSKAAYRFLGKILNNVKKWQIPRFINTDKAPAYGRALALLKREGRCPSDVEHRQIKYRNNVIECDHGKLKRIIGATLGFKSMKTAYATIKGIEVMRALRKGQASAFYYGDPLGEMRLVSRVFEM >NZ_CP048345|1262:51907|34475_34808_+|WP_001393253.1|DBSCAN-SWA MRPHRHPHTFELLLPLRGRFVVLNFDDRGTVTHRAILGETCTVLEMAAGTWHAVLSLDTGGIIFEVKHGGYQPVAADDYAHWAPAEGEPGTTELMAWYAQAQVGDSTFAV >NZ_CP048345|1262:51907|37395_38256_+|WP_000027057.1|DBSCAN-SWA MSIQHFRVALIPFFAAFCLPVFAHPETLVKVKDAEDQLGARVGYIELDLNSGKILESFRPEERFPMMSTFKVLLCGAVLSRVDAGQEQLGRRIHYSQNDLVEYSPVTEKHLTDGMTVRELCSAAITMSDNTAANLLLTTIGGPKELTAFLHNMGDHVTRLDRWEPELNEAIPNDERDTTMPAAMATTLRKLLTGELLTLASRQQLIDWMEADKVAGPLLRSALPAGWFIADKSGAGERGSRGIIAALGPDGKPSRIVVIYTTGSQATMDERNRQIAEIGASLIKHW >NZ_CP048345|1262:51907|22081_22309_+|WP_000248278.1|DBSCAN-SWA MVNSYSLSDDSGVMAAAAITHFLFGQAVFSYLNGWSVLIGPGTGLDSTGCKYARDLMGLVAFTAFIVTFLFRGYS >NZ_CP048345|1262:51907|11605_12394_-|WP_000503573.1|DBSCAN-SWA MGEFFPAQVFKQLSHARAVIERHLAATLDTIHLFGSAIDGGLKPDSDIDLLVTVSAAPNDSLRQALMLDLLKVSSPPGDGGTWRPLELTVVARSEVVPWRYPARRELQFGEWLRHDILSGTFEPAVLDHDLAILLTKARQHSLALLGPSAATFFEPVPKEHFSKALFDTIAQWNAESDWKGDERNVVLALARIWYSASTGLIAPKDVAAAWVSERLPAEHRPLICKARAAYLGSEDDDLAMRVEETAAFVRYAKATIERILR >NZ_CP048345|1262:51907|1262_2003_-|WP_001066942.1|integrase|DBSCAN-SWA MNNVIPLQNSPERVSLLPIAPGVDFATALSLRRMATSTGATPAYLLAPEVSALLFYMPDQRHHMLFATLWNTGMRIGEARMLTPESFDLDGVRPFVRILSEKVRARRGRPPKDEVRLVPLTDISYVRQMESWMITTRPRRREPLWAVTDETMRNWLKQAVRRAEADGVHFSIPVTPHTFRHSYIMHMLYHRQPRKVIQALAGHRDPRSMEVYTRVFALDMAATLAVPFTGDGRDAAEILRTLPPLR >NZ_CP048345|1262:51907|10219_11059_-|WP_000259031.1|DBSCAN-SWA MVTVFGILNLTEDSFFDESRRLDPAGAVTAAIEMLRVGSDVVDVGPAASHPDARPVSPADEIRRIAPLLDALSDQMHRVSIDSFQPETQRYALKRGVGYLNDIQGFPDPALYPDIAEADCRLVVMHSAQRDGIATRTGHLRPEDALDEIVRFFEARVSALRRSGVAADRLILDPGMGFFLSPAPETSLHVLSNLQKLKSALGLPLLVSVSRKSFLGATVGLPVKDLGPASLAAELHAIGNGADYVRTHAPGDLRSAITFSETLAKFRSRDARDRGLDHA >NZ_CP048345|1262:51907|4813_5008_-|WP_086258557.1|DBSCAN-SWA MAASPGARTLWFELVTKIDIDIHKKNRYASYDLRRKIEMQHEEFQKMTEREQVPLLKCISSDLI >NZ_CP048345|1262:51907|27566_28331_-|WP_001389365.1|transposase|DBSCAN-SWA MTDFKWRHFQGDVILWAVRWYCRYPISYRDLEEMLAERGISVDHTTIYRWVQCYAPEMEKRLRWFWRRGFDPSWRLDETYVKVRGKWTYLYRAVDKRGDTIDFYLSPTRSAKAAKRFLGKALRGLKHWEKPATLNTDKAPSYGAAITELKREGKLDRETAHRQVKYLNNVIEADHGKLKILIKPVRGFKSIPTAYATIKGFEVMRALRKGQARPWCLQPGIRGEVRLVERAFGIGPSALTEAMGMLNHHFAAAA >NZ_CP048345|1262:51907|12524_12998_-|WP_001389366.1|DBSCAN-SWA MKISLISAVSENGVIGSGPDIPWSVKGEQLLFKALTYNQWLLVGRKTFDSMGVLPNRKYAVVSKNGISSSNENVLVFPSIENALKELSKVTDHVYVSGGGQIYNSLIEKADIIHLSTVHVEVEGDIKFPIMPENFNLVFEQFFMSNINYTYQIWKKG >NZ_CP048345|1262:51907|46229_47045_+|WP_001043265.1|DBSCAN-SWA MNKSLIIFGIVNITSDSFSDGGRYLAPDAAIAQARKLMAEGADVIDLGPASSNPDAAPVSSDTEIERIAPVLDALKADGIPVSLDSYQPATQAYALSRGVAYLNDIRGFPDAAFYPQLAKSSAKLVVMHSVQDGQADRREAPAGDIMDHIAAFFDARIAALTGAGIKRNRLVLDPGMGFFLGAAPETSLSVLARFDELRLRFDLPVLLSVSRKSFLRALTGRGPGDVGAATLAAELAAAAGGADFIRTHEPRPLRDGLAVLAALKETARIR >NZ_CP048345|1262:51907|47908_48745_+|WP_000480968.1|DBSCAN-SWA MFMPPVFPAHWHVSQPVLIADTFSSLVWKVSLPDGTPAIVKGLKPIEDIADELRGADYLVWRNGRGAVRLLGRENNLMLLEYAGERMLSHIVAEHGDYQATEIAAELMAKLYAASEEPLPSALLPIRDRFAALFQRARDDQNAGCQTDYVHAAIIADQMMSNASELRGLHGDLHHENIMFSSRGWLVIDPVGLVGEVGFGAANMFYDPADRDDLCLDPRRIAQMADAFSRALDVDPRRLLDQAYAYGCLSAAWNADGEEEQRDLAIAAAIKQVRQTSY >NZ_CP048345|1262:51907|50057_50975_+|WP_162676762.1|transposase|DBSCAN-SWA MAVASSVAWDRLAQLVATGTQLSNTLADEPLAYVGQGYHRFRRYAPRMLRCLKLEAAPVAGPLVAAALSIGEMKGVASPERRFLRPSSKWNRHLRAQEKGDTRLWEVAVLFHLRDAFRSGDVWLAHSRRYGDLKQVLVPMIAAQENAKLAVPSNPQDWLADRKARLTIALKRLARAARNGTIPHGSIEDGTLRIDRLTADVPDGAEALILDLYRRMPSVRITDMLLEVDAALGFTDAFTHLRTGAPCRDRIGLLNVLLAEGLNLGLRKMAEATNTHDYWQLSRLARWHVESEAMNVHWQLWYCAG >NZ_CP048345|1262:51907|51405_51621_+|WP_162676768.1|transposase|DBSCAN-SWA MVANWPDIFRCAATMTAGKIRPSQLLRKLASYPRQNNLAVALREVGRIERTLFIIEWIPGYGHAAACSDRS >NZ_CP048345|1262:51907|49589_49736_-|WP_162676760.1|DBSCAN-SWA MIDIQGTQKFEPVEQTVSSSRVVTNLEVAKPDETTDIAGEHFTQKFVQ >NZ_CP048345|1262:51907|11052_11400_-|WP_000679427.1|DBSCAN-SWA MKGWLFLVIAIVGEVIATSALKSSEGFTKLAPSAVVIIGYGIAFYFLSLVLKSIPVGVAYAVWSGLGVVIITAIAWLLHGQKLDAWGFVGMGLIIAAFLLARSPSWKSLRRPTPW >NZ_CP048345|1262:51907|7882_8476_-|WP_000428546.1|DBSCAN-SWA MENKNHQQENFKSTYQSLVNSARILFVEKGYQAVSIDEISGKALVTKGAFYHHFKNKKQLLSACYKQQLIMIDAYITTKTDLTNGWSALESIFEHYLDYIIDNNKNLIPIQEVMPIIGWNELEKISLEYITGKVNAIVSKLIQENQLKAYDDDVLKNLLNGWFMHIAIHAKNLKELADKKGQFIAIYRGFLLSLKDK >NZ_CP048345|1262:51907|6564_7770_+|WP_001089068.1|DBSCAN-SWA MNSSTKIALVITLLDAMGIGLIMPVLPTLLREFIASEDIANHFGVLLALYALMQVIFAPWLGKMSDRFGRRPVLLLSLIGASLDYLLLAFSSALWMLYLGRLLSGITGATGAVAASVIADTTSASQRVKWFGWLGASFGLGLIAGPIIGGFAGEISPHSPFFIAALLNIVAFLVVMFWFRETKNTRDNTDTEVGVETQSNSVYITLFKTMPILLIIYFSAQLIGQIPATVWVLFTENRFGWNSMMVGFSLAGLGLLHSVFQAFVAGRIATKWGEKTAVLLGFIADSSAFAFLAFISEGWLVFPVLILLAGGGIALPALQGVMSIQTKSHQQGALQGLLVSLTNATGVIGPLLFAVIYNHSLPIWDGWIWIIGLAFYCIIILLSMTFMLTPQAQGSKQETSA >NZ_CP048345|1262:51907|3448_4618_+|WP_001072355.1|DBSCAN-SWA MNQKWKTLIISLLTPSSIISGIALIVLWGYFSRLGRLDIFFDVMNIKSILVLVCCATILSLAMIVFIFFITSFFIPVVIPQDINNLPAYNKIQGNFLSVMMLSGMFPMAFIYVLYCAFDFDQNVKDNSGWLSIASMGVLIAVILAIVNKRYLEYDLSFKNNRMKLLRRVQIYLVIPLSIALLVHLQLIPLEIVFSNIATSDKSVNFKVIAELAFMSYFIFLLTMLPGVIYLKMNPQHKFSKRISYSFIASLMLLLIISTQITVLPVIFTHSVIKLSGISDFKIHSYIIKTSEYPEEFFSNAAWDKKNIKPGDYYSVQAVSMFTTNQFILLCPKDIIKFYRESWKFELLNVDFDINTRKKLQEEAAYCVPISAISVKRWDMPLQGSKPSS |
41 | Salmonella_phage(35.29%) | integrase,transposase | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_2 |
88549 : 95979
Sequences of DBSCAN-SWA_2
Nucleotide sequences of DBSCAN-SWA_2 >NZ_CP048345|88549:95979|DBSCAN-SWA GCTATATTTTTCCGCCTACGCCTTTAAACTTCTCAATAAACGAGACGATTTTCTGGAAAACTGCCTGTTTTTTCGTTTTATATTGCGGGTTTAACGGACTAAGTTTTGGTAATGTCTCGTTTAATTCTGTGCCATTTTCGGTGGCGTATTCGCGTTTTAAAGACGTGCGAATATAGCGTTTCGCCGCCTCTTCATTGAGATTTTCTTCTTTAATCAACGCTTCTGCTTCACGTAGCTGTTCGTGTTGAGCAAAGGTAAAGAACGCGTCAATGATGCTGGCTTTGTCTGGTAAATCATCCAGGTTCGTTTGCTGAATAAAATCGACCACCAGGCCCTCTTTCGCCCGGTTCCCCAGGCTTGAACGAATTAAGCGTTTGACCTCTTCGATCATTTCGCCCTTGCCTTTATTTTGTCTGTTATGTTCGAAAATCAGTCCAAGGATATAATCCAGGTTTATTTCCTGAGACTTCAGCAAATCGACCTCAAAAACCACGTCATCCCAGTCAGTGGTTGATTTCTCTTTTTTCTCAGCTTCTTTCTCACGGCGCTGCCAGTCGCGAATATCGTTATAGGCAGAACGATAATCCTGAATCTTGCGATCAGCAGGGAGACGAATTGTTTGCAATTCAGCGAGCTTTTCATCATCCACATAATGTTCTGCTTTGAATTTTTCTACTGCAACAGGATCGCTAAGATCGATTTGTTGCAGGGCTTTCAGCGTGGCAAATTCATCATAGTTTTGCAGGATGTTCTCGGTACGCAGGTATTCACCAAACAGTTTTACGAAGTCTTTCTTCTCTTTTTCACTTTCAATACTGGCAGGGTCAGGGAACCGTTGTTCCAGTTCTGCAACTACTGCCATAAAGCCGCGCTTAGCTTCACCAGTAGCAGCATCAGTAAAGCCTTCCATATACTCTGCATAACTCTTTTCTAACACTACATTTTTGGTGTTTTTGTCACCAAACAGCGTTATGGCATCAATAGTTGAGCGTTCCAGATCCCGGAAAGTGACGATATTACCGAAGGTTTTAGTGGCGTCATAAATGCGGTTGGTGCGGGAAAATGCCTGCATCAGGCCGTGAAAACGTAAGTTTTTATCGACGAATAGCGTGTTCAATGTTGGAGCATCGAAGCCGGTTAAGAACATCCCCACGACAATTAGCAGATCGATATCCTGATTTTTAACCCGCTGGGCTAAATCACGATAGTAGTTCTGAAAACCGTTACTGTCGGTGCTAAAGTTAGTTTTAAAATGGCTGTTATATTCACGAATTGCAGCGTCCAGAAACTCTTTAGCACTGCTGTCCATTGCGCTGGTATCAAAAGTTTCATCGGAAATCTCACCAATGGCATTTTGTTCTTCATTGGCGGCAAAGGAGAAGATTGTCGCAATACGCAGCGGTTTATAGGTAGCCGATTTATTAGCGGCTTCTTCTTGTAACCGTTTAAACGTCGCATAATAGGCTTTCGCGGCATCCACGCTGCTCACTGCCAACATAGCATTAAAACCTTTTGAGCCAGGGAAGGTACGGTGAGTTTTCTGGCGGAAATTATTCAGAATATATTGCGTAATTTCCTGTATACGCATGGGATGAAGAAACGCCTGCTGATTTTCAGCCGCACTCAGTTTTTTCTCGTCGGTTTCTGTCTCTAAAGACTTAAACTGTGGCCGCACATCGTTGTAGTCCACCTTGAATTTAAGCACTTTTTCATCTCGAATCGCATCGGTAATAACATATGAATGCAATTCACGACCAAATACGCTGGCGGTTGTTTCTGAGCCTAAGGCGTTTTCCGGGAAAATAGGGGTACCGGTAAAACCAAACTGATAATAGCGTTTGAATTTCTTCTTCAGGTTTTTCTGTGCTTCTCCAAACTGGCTGCGGTGGCATTCATCAAATATAAACACCACTTGCTGATTGTATACAGGCAGGTCGCTTTCTGCTTTCATCAGGTTATTGAGTTTCTGAATAGTGGTGACGATAATTTTGTTATCATCCTTATCCAGATTTCGTTTAAGACCTGCGGTATTTTCCGATCCATTGACACTGTCTGGCGAAAAACGCTGATATTCCTTCATGGTCTGGTAATCGAGGTCTTTCCTGTCGACCACAAAGAAGACTTTATCAATAAAGTCCAGTTCTGTTGCCAGACGCGCGGCTTTAAAGCTGGTCAGGGTTTTACCAGAACCGGTAGTGTGCCAGATAAAGCCACCACTTTCGGGGGTAGACCAGTTTTTCGCTTTATAGGAGCTGTTGATTTTCCATAAGATACGCTCAGTGGCGGCAATCTGGTACGGTCGCATCACCAGTAGCGTCTGGCTACTGTCAAAAACGCTGTAGTTCACCAGAACATTCAGCAGAGTATGTTTCTGGAAAAAGGTAGCGGTAAAGTCTTTGAGGTCTTTAATCAGCGTGTTGTCTGATTTTGCCCAATTCATGGTGAAGTCAAAACTGTTTTTATCGCGCTTTGTCGTGTTGGCAAAATAATGGGTATCGGTGCCGTTAGAAATGACAAACAGTTGCAGATACTTAAACAGGGAATTTTCGCTGTTAAAACTCTCTTTACTGTAACGATGTATCTGGTTGAAAGCCTCACGAATCGCCACCCCGCGTTTTTTTAGTTCGATTTGCACCAGCGGTAAACCATTAACCAGGATCGTGACGTCATAACGGTTAGCATGAGAACCCGTCTGTTCAAACTGCTGGATAATCTGCACCTTATTGCGCATGAGATTCTTTTTATCTATCAAATAGATGTTCTCAAGACGCTCGTCATCAAAAATAAAGTCGCAAATATAGTCGATATGGATTTTACGGGTCTTATCCAGAATGCCATCGCTCGGGTTATCCAGATACTGCTCCGTGAAACGCCGCCACTCGCTGTCATTAAACACCACACCATTGAGGTTCTGAAGCTGTTCCCGAACATTGGCCAGCATCGCCGACTGTGATTTTACGGATATAAATTCATAGCCCTGATTCCGCAGGTCCTGAATCAGTTCTCGTTCCAGGTCCGATTCGCTCTGGTAGCTGTCGCCTGTTGGCTCAGCTTTGATGTACTTATCAAGGACGATAAAGTTATTGGATTCAGCAATGGTGTGTGTCTGATGAGTCATAGCGCATCCTTTGTGCCGTCTGGCAAGGGCCGGAAGGGAGTTAAGGGTGACTTCCGGCACGTAAAAAATAGTCTATATACAGACCGGATGTTAAGGTGGCCCGGTCGGTAGCAACGGTCAATTAATTACTGACAGTTTCAGGTTTTGGGAAACTGAACAGTAAATCACGGTAGTATTCGTATTGTTTCTGGCGCAACTCGATTTCACGCGGAAGACCTTCGGTGATGGAGTTAGTCAGTGTGTCGAATTTGTCGAGTATTTCGACAATGCGAGCTTGTTCCTTAAGTGATTTTTCGTGATCTTTAGGATATGGAACCGGAATCATAATTTTTGAAAAACCATTAATGAGTAGTGTATTAACTTTTGTTCTGGCTACATACTTTGCTTTTTCAGAAATAAACGAATCGGTTTGCATGTAATAGGAAATAAATTTTGGATTCAAAGAATGTCGAAAAGCATAACAGTGATCATGAATAGCGATATCGTCATCCCCAAGCCATGCCACTGCTTTACCAACGTCTTCTACAGTCTCCCCCACGTCAGTTATCACGACATCTCCATGTTTGGCATAGCGTAATGACGCTGCCATATCAGCTCTAACCTGTGATAACGAATGAGTTGTGTAAACACCATATCGTGTATATATCTCACCATAATGGATTACACTGATACCACCATCTTCTACATAATCTGCTTTAGTAAAACGTTTTCCACGAATAAACTCACCAATTTCCCCCAAAGCTTTCCACTCAACCTCACCTTCTTTAAAAGTCAGCAATTGGTCGCGGTAGTAGTTGTACTGTTTTTTACGCATGTTAAGCTCAGCGGTAAGCTCAGCGGTAAGCTCAGCGGTAAGTGCAGTAAACTTATCCAGAATCCGAACGATTTCAGACTGGATGGCAAGGGACTTTTCCGGATTATCCGGGCAGGGGATGGGGATCTTAATATTTTTGACAATCTGCGCATTTATGTTTGTCTGGGACCCTGTTCCAAGGGATTTGATATATGTGTATTGGCTACACAAGAAGTGAAATACATATCTATAATGAGCAACTTCTTCATTAAGTTGAATATTTGCGCACGCTTGATTTGTTGTCATTGGAATTTTGTTTATGCCGATTTTCCCCACAGTTGCCCCATACATAGCAACAATGACACAATTCTTTGGTATCCATTTTGCACTAGAGTTTTTAACTCCAGACTCAGTTATTTTTACCTCGGTATCCCATATATCACAAAAGTTTACTTCTTGAGTTCTCAACCAAGGAATGTCGCCATCATAAAATTCTGATACGCCAGTTTTAGGGGTTCCTCCAGATGATATCTTTATAGAAATATCCTCAAGGGTTTTCCACTCAACCTCAACCCCATCCAGCAATTTTTCCAGATAACTCAACTCGCTCATTTCTGCACCTCGCAGCCTTCAATTTCAGCCACAATCGCATCAATATCTTTACGCAACTGGTCGATTTTGCTGACCGTAGTTTTAAGCTCTGCATTTAGCTCAGCAATATTGATAATTTCGCGGTTATCTTTCGCTTCTACATAGCAGCTCACCGACAGGTTATAGTCATTAGCGACAACGGTCTCAAACGCGACAGATTTCGCCAGATGAGCAACATCTTCCTTGCTGGCAAATACCTGCATAATCTGTTCGATATGGGCATCGGTCAGGATATTGTTGTTGGTCTCTTTTTTGAATAGTTCGCTGGCATCAATAAACTGAACTTTGGTATCCGTTTTATGTTTAGACAACACCAGAATATTGACGGCAATGGTGGTGCCAAAGAACAGGTTCGGTGCGAGTGAAATCACGGTTTCGACATAGTTATTGTCAACCAGATACTGACGGATTTTCTGCTCCGCGCCGCCACGGTAAAAAATGCCCGGGAAGCAGACAATCGCAGCACGACCTTTGGCAGAAAGATAGTTCAGCGCATGTAATACAAACGCAAAGTCAGCTTTGGATTTGGGGGCCAGAACGCCAGCCGGGGCAAAACGTTCATCGTTAATCAGCGTCGGGTCATCGCTGCCAATCCATTTCACCGAATACGGCGGGTTAGAAACGATGGCATCAAACGGTTTTTCATCTCTGAAGTGCGGTTCAGTCAGCGTATTGCCCAGCTTGATATCAAACTTGTCGTAGTTGATGTTGTGCAAAAACATGTTCATACGCGCCAGGTTATAGGTCGTATGGTTGATTTCCTGACCAAAAAAACCTTCTTCGATGATATGGTTATCAAACTGTTTTTTCGCCTGCAACAACAGTGAGCCGGAACCCGCTGCCGGGTCGTAGATTTTGTTAACGCTGGTCTGCCCGTGCATAGCCAGTTGTGCAATCAGCCTGGAGACGTGCTGCGGTGTAAAGAACTCACCGCCTGACTTACCGGCATTTGCCGCATAGTTAGAAATCAGGAACTCATAGGCATCACCGAACAGGTCAATCTGATGTTCGTTGAAGTCACCAAGTTTTAACCCTTCAACCCCTTTCAGAACCGCAGCCAGGCGGGCATTTTTATCTTTAACGGTGTTACCCAGGCGGTTACTGGTGGTATCGAAATCAGCAAACAAACCTTTGATGTCAGCTTCTGAAGGATAACCGTAAGCAGAACTTTCGATAGCAACGAAGATGCTGTTTAAATCTGCATTCAATCTGTCATTGGTATTTGCTTTCGCAGCTACGTTGCAGAAAAGCTGGCTTGGGTAGATGAAGTAGCCTTTGGTTTTGATGGCATCGTCTTTAATGTCATCAGTAATTACGCTGTCATCCAGTTTCGCATAACAGATACTGTCATCACCGGCTTCAATATAACTGGAAAAATTTTCGCTGATAAAACGGTAGAAAAGTGCGCCCAGAACGTATTGCTTAAAATCCCATCCATCGACCGAACCCCTGACATCGTTAGCAATTTGCCAGATTTGACGATGAAGCTCTGCACGTTGTTGAATACTTGTCATTTTCATCCACTTATTTCAGGCTTATGTAATTGGCGGTGATTCTACAGCAACTTGGATGCTTTAGCAGTTCGGACATTAGGCTACGAATGACCTGCCTAGAGGTTTGTTAAGCCGCAAAGTGCTGGTGCTTTATGCCTGTGAAGTTTATAATTGTGTACACATAACGAGTACACGAGGTGTTTATGCAATCCATTAACTTCCGTACCGCGCGCGGCAACCTTTCTGAAGTGCTCAACAATGTTGAGGCCGGGGAAGAGGTTGAAATCACCCGCAGAGGCCGTGAGCCAGCAGTAATTGTCAGCAAGGCTACTTTCGAAGCCTACAAAAAAGCGGCGCTGGATGCTGAATTTGCATCCCTGTTTGACACCCTGGACTCCACCAACAAGGAACTGGTTAACCGATAATGAGGCATATATCACCGGAAGAACTTATTGCGCTTCATGATGCGAATATAAACCGCTACGGCGGCCTGCCGGGAATGTCTGATCCGGGCAGGGCAGAGGCCATTATCGGGAGAGTTCAGGCCAGAGTTGCCTACGAAGAGATCACCGACCTTTTCGAAGTCTCCGCCACCTACCTAGTGGCTACAGCGAGAGGGCATATATTCAATGATGCCAATAAGCGTACCGCGCTAAACAGTGCGCTGTTATTTCTACGCCGTAACGGGGTGCAGGTATTTGATTCACCTGAACTGGCAGACCTTACCGTAGGGGCTGCGACCGGAGAGATATCTGTATCTTCTGTCGCCGACACGTTACGTAGATTGTATGGTTCTGCGGAGTAGATTAATGGCACGCAAATACAACAAATTGTCCCGTGAAGCGTTAAAGATGCTTCTTGATGGCGTGAGTCGCCGCGAGGTAAAGCAATACCTGGCTGGTAAGCAAATTGGTGCCAGGACCGCTATTGCTGTGTTATGCCGTCAGGAAATGGTTGTGCTTAAACAGAGAATGCCGGGCAGCAGATAAAGCCCAATCAGTGATGAAAGGTGTGATGTGAAAGCCGTAATTACTCCCTTTGTACAAAAAGAGCTTGGCGTCGCCACATTCAAAGTGGATCAGGAAGTCAGAAAGCTGGTGGAGGCTGGCCGTAAATTTATTATGGAGCCGGTGCCGCGTGAGTTAATCGAGCACATGGACGACGGCCTCGTTGTTTCCGAGCAAACTATGGCAACAAATGAGGCGTTGCAGCCGTTTTTTAACAGCGATGAACTGTTTCGCCGTATTGGTGGAATTGACTCGCTGGTAGCGTGGTTGCGCAGGAAAGAGGGGCAGGTCAGGCTGGGCCATGTCGTTCTCGCCGGACAGCAGGCTTACCATGAAAGCACTGGAAATGGCATGGGAAACCCGTGGTAA
Protein sequences of DBSCAN-SWA_2 >NZ_CP048345|88549:95979|95619_95979_+|WP_001513660.1|DBSCAN-SWA MKAVITPFVQKELGVATFKVDQEVRKLVEAGRKFIMEPVPRELIEHMDDGLVVSEQTMATNEALQPFFNSDELFRRIGGIDSLVAWLRRKEGQVRLGHVVLAGQQAYHESTGNGMGNPW >NZ_CP048345|88549:95979|94806_95028_+|WP_001190712.1|DBSCAN-SWA MQSINFRTARGNLSEVLNNVEAGEEVEITRRGREPAVIVSKATFEAYKKAALDAEFASLFDTLDSTNKELVNR >NZ_CP048345|88549:95979|88549_91666_-|WP_031311986.1|DBSCAN-SWA MTHQTHTIAESNNFIVLDKYIKAEPTGDSYQSESDLERELIQDLRNQGYEFISVKSQSAMLANVREQLQNLNGVVFNDSEWRRFTEQYLDNPSDGILDKTRKIHIDYICDFIFDDERLENIYLIDKKNLMRNKVQIIQQFEQTGSHANRYDVTILVNGLPLVQIELKKRGVAIREAFNQIHRYSKESFNSENSLFKYLQLFVISNGTDTHYFANTTKRDKNSFDFTMNWAKSDNTLIKDLKDFTATFFQKHTLLNVLVNYSVFDSSQTLLVMRPYQIAATERILWKINSSYKAKNWSTPESGGFIWHTTGSGKTLTSFKAARLATELDFIDKVFFVVDRKDLDYQTMKEYQRFSPDSVNGSENTAGLKRNLDKDDNKIIVTTIQKLNNLMKAESDLPVYNQQVVFIFDECHRSQFGEAQKNLKKKFKRYYQFGFTGTPIFPENALGSETTASVFGRELHSYVITDAIRDEKVLKFKVDYNDVRPQFKSLETETDEKKLSAAENQQAFLHPMRIQEITQYILNNFRQKTHRTFPGSKGFNAMLAVSSVDAAKAYYATFKRLQEEAANKSATYKPLRIATIFSFAANEEQNAIGEISDETFDTSAMDSSAKEFLDAAIREYNSHFKTNFSTDSNGFQNYYRDLAQRVKNQDIDLLIVVGMFLTGFDAPTLNTLFVDKNLRFHGLMQAFSRTNRIYDATKTFGNIVTFRDLERSTIDAITLFGDKNTKNVVLEKSYAEYMEGFTDAATGEAKRGFMAVVAELEQRFPDPASIESEKEKKDFVKLFGEYLRTENILQNYDEFATLKALQQIDLSDPVAVEKFKAEHYVDDEKLAELQTIRLPADRKIQDYRSAYNDIRDWQRREKEAEKKEKSTTDWDDVVFEVDLLKSQEINLDYILGLIFEHNRQNKGKGEMIEEVKRLIRSSLGNRAKEGLVVDFIQQTNLDDLPDKASIIDAFFTFAQHEQLREAEALIKEENLNEEAAKRYIRTSLKREYATENGTELNETLPKLSPLNPQYKTKKQAVFQKIVSFIEKFKGVGGKI >NZ_CP048345|88549:95979|95027_95408_+|WP_001216034.1|DBSCAN-SWA MRHISPEELIALHDANINRYGGLPGMSDPGRAEAIIGRVQARVAYEEITDLFEVSATYLVATARGHIFNDANKRTALNSALLFLRRNGVQVFDSPELADLTVGAATGEISVSSVADTLRRLYGSAE >NZ_CP048345|88549:95979|91787_93071_-|WP_001617890.1|DBSCAN-SWA MSELSYLEKLLDGVEVEWKTLEDISIKISSGGTPKTGVSEFYDGDIPWLRTQEVNFCDIWDTEVKITESGVKNSSAKWIPKNCVIVAMYGATVGKIGINKIPMTTNQACANIQLNEEVAHYRYVFHFLCSQYTYIKSLGTGSQTNINAQIVKNIKIPIPCPDNPEKSLAIQSEIVRILDKFTALTAELTAELTAELNMRKKQYNYYRDQLLTFKEGEVEWKALGEIGEFIRGKRFTKADYVEDGGISVIHYGEIYTRYGVYTTHSLSQVRADMAASLRYAKHGDVVITDVGETVEDVGKAVAWLGDDDIAIHDHCYAFRHSLNPKFISYYMQTDSFISEKAKYVARTKVNTLLINGFSKIMIPVPYPKDHEKSLKEQARIVEILDKFDTLTNSITEGLPREIELRQKQYEYYRDLLFSFPKPETVSN >NZ_CP048345|88549:95979|95412_95592_+|WP_001513661.1|DBSCAN-SWA MARKYNKLSREALKMLLDGVSRREVKQYLAGKQIGARTAIAVLCRQEMVVLKQRMPGSR >NZ_CP048345|88549:95979|93067_94624_-|WP_001553856.1|DBSCAN-SWA MTSIQQRAELHRQIWQIANDVRGSVDGWDFKQYVLGALFYRFISENFSSYIEAGDDSICYAKLDDSVITDDIKDDAIKTKGYFIYPSQLFCNVAAKANTNDRLNADLNSIFVAIESSAYGYPSEADIKGLFADFDTTSNRLGNTVKDKNARLAAVLKGVEGLKLGDFNEHQIDLFGDAYEFLISNYAANAGKSGGEFFTPQHVSRLIAQLAMHGQTSVNKIYDPAAGSGSLLLQAKKQFDNHIIEEGFFGQEINHTTYNLARMNMFLHNINYDKFDIKLGNTLTEPHFRDEKPFDAIVSNPPYSVKWIGSDDPTLINDERFAPAGVLAPKSKADFAFVLHALNYLSAKGRAAIVCFPGIFYRGGAEQKIRQYLVDNNYVETVISLAPNLFFGTTIAVNILVLSKHKTDTKVQFIDASELFKKETNNNILTDAHIEQIMQVFASKEDVAHLAKSVAFETVVANDYNLSVSCYVEAKDNREIINIAELNAELKTTVSKIDQLRKDIDAIVAEIEGCEVQK |
7 | Escherichia_phage(57.14%) | NA | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
Acr ID | Acr position | Acr size | Homology with known anti | Neighbor HTH/AcRanker | Neighbor Aca | In prophage | Protospacer in prophage |
---|
CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
NZ_CP048344_1 | 611229-611368 | Orphan |
NA
Consensus repeat of NZ_CP048344_1
|
1 spacers
spacers of NZ_CP048344_1
>1.1|611278|42|NZ_CP048344|CRISPRCasFinder ACAGCAGTCGGATGCGGCGTAAACACCTTATCTGACCTACGT |
CRISPR arrays and Neighbor proteins around NZ_CP048344_1
The CRISPR arrays of NZ_CP048344_1 >merge|NZ_CP048344|1|611229-611368|CRISPRCasFinder TTTGTATCGTTGTAGGCCTGATAAGACGCGGCAAGCGTCGCATCAGGCAACAGCAGTCGGATGCGGCGTAAACACCTTATCTGACCTACGTTTTGTGTCGTTGTAGGCCTGATAAGACGCGGCAAGCGTCGCATCAGGCA >NZ_CP048344|1|1|611229-611368|CRISPRCasFinder TTTGTATCGTTGTAGGCCTGATAAGACGCGGCAAGCGTCGCATCAGGCA ACAGCAGTCGGATGCGGCGTAAACACCTTATCTGACCTACGT TTTGTGTCGTTGTAGGCCTGATAAGACGCGGCAAGCGTCGCATCAGGCA
>NZ_CP048344.1|WP_001375265.1|610136_611177_+|permease MTGQSSSQAATPIQWWKPALFFLVVIAGLWYVKWEPYYGKAFTAAETHSIGKSILAQADANPWQAALDYAMIYFLAVWKAAVLGVILGSLIQVLIPRDWLLRTLGQSRFRGTLLGTLFSLPGMMCTCCAAPVAAGMRRQQVSMGGALAFWMGNPVLNPATLVFMGFVLSWGFAAIRLVAGLVMVLLIATLVQKWVRETPQTQAPVEIDIPEAQGGFFSRWGRALWTLFWSTIPVYILAVLVLGAARVWLFPHADGTVDNSLMWVVAMAVAGCLFVIPTAAEIPIVQTMMLAGMGTAPALALLMTLPAVSLPSLIMLRKAFPAKALWLTGAMVAVSGVIVGGLALLF >NZ_CP048344.1|WP_001295551.1|609428_610064_+|NAD(P)H-binding-protein MSQVLITGATGLVGGHLLRMLINEPKVNAIAAPTRRPLGDMPGVFNPHDPQLTDALAQVTDPIDIVFCCLGTTRREAGSKEAFIHADYTLVVDTALTGRRLGAQHMLVVSAMGANAHSPFFYNRVKGEMEEALIAQNWPKLTIARPSMLLGDRSKQRMNETLFAPLFRLLPGNWKSIDARDVARVMLAESMRPEHEGVTILSSSELRKRAE >NZ_CP048344.1|WP_000037608.1|608782_609301_-|protein/nucleic-acid-deglycase MSKKIAVLITDEFEDSEFTSPADEFRKAGHEVITIEKQAGKTVKGKKGEASVTIDKSIDEVTPAEFDALLLPGGHSPDYLRGDNRFVTFTRDFVNSGKPVFAICHGPQLLISADVIRGRKLTAVKPIIIDVKNAGAEFYDQEVVVDKDQLVTSRTPDDLPAFNREALRLLGA >NZ_CP048344.1|WP_000449030.1|608359_608803_+|YhbP-family-protein METLIAISRWLAKQHVVTWCVQQEGELWCANAFYLFDAQKVAFYILTEEKTRHAQMSGPQAAVAGTVNGQPKTVALIRGVQFKGEIRRLEGEESDLARKAYNRRFPVARMLSAPVWEIRLDEIKFTDNTLGFGKKMIWLRDSGTEQA >NZ_CP048344.1|WP_000189314.1|608006_608309_-|DNA-damage-response-exodeoxyribonuclease-YhbQ MTPWFLYLIRTADNKLYTGITTDVERRYQQHQSGKGAKALRGKGELTLAFSAPVGDRSLALRAEYRVKQLTKRQKERLVAEGAGFAELLSSLQTPEIKSD >NZ_CP048344.1|WP_000908554.1|607516_608020_+|N-acetyltransferase MLIRVEIPIDAPGIDALLRRSFESDAEAKLVHDLREDGFLTLGLVATDDEGQVIGYVAFSPVDVQGEDLQWVGMAPLAVDEKYRGQGLARQLVYEGLDSLNEFGYAAVVTLGDPALYSRFGFELAAHHDLRCRWPGTESAFQVHRLADDALNGVTGLVEYHEHFNRF >NZ_CP048344.1|WP_001375267.1|606998_607523_+|SCP2-domain-containing-protein MLDKLRSRIVHLGPSLLSVPVKLTPFALKRQVLEQVLSWQFRQALDDGELEFLEGRWLSIHVRDIDLQWFTSVVNGKLVVSQNAQADVSFSADASDLLMIAARKQDPDTLFFQRRLVIEGDTELGLYVKNLMDAIELEQMPKALRMMLLQLADFVEAGMKNAPETKQTSVGEPC >NZ_CP048344.1|WP_000421305.1|605794_606790_-|U32-family-peptidase MELLCPAGNLPALKAAIENGADAVYIGLKDDTNARHFAGLNFTEKKLQEAVSFVHQHRRKLHIAINTFAHPDGYARWQRAVDMAAQLGADALILADLAMLEYAAERYPHIERHVSVQASATNEEAINFYHRHFDVARVVLPRVLSIHQVKQLARVTPVPLEVFAFGSLCIMSEGRCYLSSYLTGESPNTVGACSPARFVRWQQTPQGLESRLNEVLIDRYQDGENAGYPTLCKGRYLVDGERYHALEEPTSLNTLELLPELMAANIASVKIEGRQRSPAYVSQVAKVWRQAIDRCKADPQNFVPQSAWMETLGSMSEGTQTTLGAYHRKWQ >NZ_CP048344.1|WP_001301318.1|604907_605786_-|U32-family-peptidase MKYSLGPVLWYWPKETLEEFYQQAATSSADVIYLGEAVCSKRRATKVGDWLEMAKSLAGSGKQIVLSTLALVQASSELGELKRYVENGEFLIEASDLGVVNMCAERKLPFVAGHALNCYNAVTLKILLKQGMMRWCMPVELSRDWLVNLLNQCDELGIRNQFEVEVLSYGHLPLAYSARCFTARSEDRPKDECETCCIKYPNGRNVLSQENQQVFVLNGIQTMSGYVYNLGNELASMQGLVDVVRLSPQGTDTFAMLDAFRANENGAAPLPLTANSDCNGYWRRLAGLELQA >NZ_CP048344.1|WP_000130392.1|603694_604702_-|LLM-class-flavin-dependent-oxidoreductase MTDKTIAFSLLDLAPIPEGSSAREAFSHSLDLARLAEKRGYHRYWLAEHHNMTGIASAATSVLIGYLAANTTTLHLGSGGVMLPNHSPLVIAEQFGTLNTLYPGRIDLGLGRAPGSDQRTMMALRRHMSGDIDNFPRDVAELVDWFDARDPNPNVRPVPGYGEKIPVWLLGSSLYSAQLAAQLGLPFAFASHFAPDMLFQALHLYRSNFKPSARLEKPYAMVCINIIAADSNRDAEFLFTSMQQAFVKLRRGETGQLPPPIQNMDQFWSPSEQYGVQQALSMSLVGDKAKVRHGLQSILRETDADEIMVNGQIFDHQARLHSFELAMDVKEELLG >NZ_CP048344.1|WP_000646033.1|611381_611957_-|divisome-associated-lipoprotein-YraP MKALSPIAVLISALLLQGCVAAAVVGTAAVGTKAATDPRSVGTQVDDGTLEVRVNSALSKDEQIKKEARINVTAYQGKVLLVGQSPNAELSARAKQIAMGVDGANEVYNEIRQGQPIGLGEASNDTWITTKVRSQLLTSDLVKSSNVKVTTENGEVFLMGLVTEREAKAAADIASRVSGVKRVTTAFTFIK >NZ_CP048344.1|WP_001158034.1|611966_612557_-|DnaA-initiator-associating-protein-DiaA MQERIKACFTESIQTQIAAAEALPDAISRAAMTLVQSLLNGNKILCCGNGTSAANAQHFAASMINRFETERPSLPAIALNTDNVVLTAIANDRLHDEVYAKQVRALGHAGDVLLAISTRGNSRDIVKAVEAAVTRDMTIVALTGYDGGELAGLLGPQDVEIRIPSHRSARIQEMHMLTVNCLCDLIDNTLFPHQDD >NZ_CP048344.1|WP_000246837.1|612576_612972_-|YraN-family-protein MATVPTRSGSPRQLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTVFVEVRYRRSALYGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNEVEWIKDAFNDHS >NZ_CP048344.1|WP_000249160.1|612929_614966_-|penicillin-binding-protein-activator MVPSTFSRLKAARCLPVVLAALIFAGCGTHTPDQSTAYMQGTAQADSAFYLQQMQQSSDDTRINWQLLAIRALVKEGKTGQAVELFNQLPQELNDSQRREKTLLAVEIKLAQKDFAGAQNLLAKITPADLEQNQQARYWQAKIDASQGRPSIDLLRALIAQEPLLGAKEKQQNIDATWQALSSMTQEQANTLVINADENILQGWLDLQRVWFDNRNDPDMMKAGIADWQKRYPNNPGAKMLPTQLVNVKAFKPASTNKIALLLPLNGQAAVFGRTIQQGFEAAKNIGTQPVAAQVAAAPAADVAEQPQPQTVDGVASPAQASVSDLTGEQPAAQPVPVSAPATSTAAVSAPANPSAELKIYDTSSQPLSQILSQVQQDGASIVVGPLLKNNVEELLKSNTPLNVLALNQPENIENRVNICYFALSPEDEARDAARHIRDQGKQAPLVLIPRSSLGDRVANAFAQEWQKLGGGTVLQQKFGSTSELRAGVNGGSGIALTGSPITPRATTDSGMTTNNPTLQTTPTDDQFTNNGGRVDAVYIVATPGEIAFIKPMIAMRNGSQSGATLYASSRSAQGTAGPDFRLEMEGLQYSEIPMLAGGNLPLMQQALSAVNNDYSLARMYAMGVDAWSLANHFSQMRQVQGFEINGNTGSLTANPDCVINRKLSWLQYQQGQVVPAS >NZ_CP048344.1|WP_000809262.1|615030_615891_+|16S-rRNA-(cytidine(1402)-2'-O)-methyltransferase MKQHQSADNSQGQLYIVPTPIGNLADITQRALEVLQAVDLIAAEDTRHTGLLLQHFGINARLFALHDHNEQQKAETLLAKLQEGQNIALVSDAGTPLINDPGYHLVRTCREAGIRVVPLPGPCAAITALSAAGLPSDRFCYEGFLPAKSKGRRDALKAIEAEPRTLIFYESTHRLLDSLEDIVAVLGESRYVVLARELTKTWETIHGAPVGELLAWVKEDENRRKGEMVLIVEGHKAQEEDLPADALRTLALLQAELPLKKAAALAAEIHGVKKNALYKYALEQQG >NZ_CP048344.1|WP_000816988.1|615933_617025_-|fimbrial-protein MKRAPLITGLLLISTSCAYASSGGCGADSTSGATNYSSVVDDVTVNQTDNVTGREFTSATLSSTNWQYACSCSAGKAVKLVYMVSPVLTTTGHQAGYYKLNDSLDIKTTLKANDIPGLVTDQTVSVNTRFTQIKSNTVYSAATQTGVCQGDTSRYGPVNIGANTTFTLYVTKPFLGSMTIPKTDIAVIKGAWVDGMGSPSTGDFHDLVKLSIQGNLTAPQSCKINQGDVIKVNFGFINGQKFTTRNAMPDGFTPVDFDITYDCGDTSKIKNSLQMRIDGTTGVVDQYNLVARRRSSDNAPDVGIRIENLGGGVANIPFQNGILPVDPSGHGTVNMRAWPVNLVGGELETGKFQGTATITVIVR >NZ_CP048344.1|WP_001323952.1|620557_621313_-|galactosamine-6-phosphate-isomerase MERGTASGGASLLKEFHPVQTLQQVENYTALSERASEYLLAVIRSKPDAVICLATGATPLLTYHYLVEKIHQQQVDVSQLTFVKLDEWVDLPLTMPGTCETFLQQHIVQPLGLREDQLISFRSEEINETECERVTNLIARKGGLDLCVLGLGKNGHLGLNEPGESLQPACHISQLDARTQQHEMLKTAGRPVTRGITLGLKDILNAREVLLLVTGEGKQDATERFLTAKVSTAIPASFLWLHSNFICLINT >NZ_CP048344.1|WP_000534351.1|621313_622105_-|PTS-N-acetylgalactosamine-transporter-subunit-IID MGSEISKKDITRLGFRSSLLQASFNYERMQAGGFTWAMLPILKKIYKDDKPGLSAAMKDNLEFINTHPNLVGFLMGLLISMEEKGENRDTIKGLKVALFGPIAGIGDAIFWFTLLPIMAGICSSFASQGNLLGPILFFAVYLLIFFLRVGWTHVGYSVGVKAIDKVRENSQMIARSATILGITVIGGLIASYVHINVVTSFAIDSTHSVALQQDFFDKVFPNILPMAYTLLMYYFLRVKKAHPVLLIGVTFVLSIVCSAFGIL >NZ_CP048344.1|WP_000544489.1|622094_622898_-|PTS-N-acetylgalactosamine-transporter-subunit-IIC MHEITLLQGLSLAALVFVLGIDFWLEALFLFRPIIVCTLTGAILGDIQTGLITGGLTELAFAGLTPAGGVQPPNPIMAGLMTTVIAWSTGVDAKTAIGLGLPFSLLMQYVILFFYSAFSLFMTKADKCAKEADTAAFSRLNWTTMLIVASAYAVIAFLCTYLAQGAMQALVKAMPAWLTHGFEVAGGILPAVGFGLLLRVMFKAQYIPYLIAGFLFVCYIQVSNLLPVAVLGAGFAVYEFFNAKSRQQAQPQPVASKNEEEDYSNGI >NZ_CP048344.1|WP_000098025.1|622936_623413_-|PTS-N-acetylgalactosamine-transporter-subunit-IIB MSSPNILLTRIDNRLVHGQVGVTWTSTIGANLLVVVDDVVANDDIQQKLMGITAETYGFGIRFFTIEKTINVIGKAAPHQKIFLICRTPQTVRKLVEGGIDLKDVNVGNMHFSEGKKQISSKVYVDDQDLTDLRFIKQRGVNVFIQDVPGDQKEQIPD |
You can click texts colored in the table to view more detailed information
CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
NZ_CP048344_2 | 654568-654685 | Orphan |
NA
Consensus repeat of NZ_CP048344_2
|
1 spacers
spacers of NZ_CP048344_2
>2.1|654608|38|NZ_CP048344|CRISPRCasFinder GTGCTCAACTTGTTGATGTTGTTGTGTTTTGTACCTGA |
CRISPR arrays and Neighbor proteins around NZ_CP048344_2
The CRISPR arrays of NZ_CP048344_2 >merge|NZ_CP048344|2|654568-654685|CRISPRCasFinder TGCCGGATGCGATGCTGGCGCACCTTATCCGGCCTACGGGGTGCTCAACTTGTTGATGTTGTTGTGTTTTGTACCTGATGCCGGATGCGATGCTGGCGCATCTTATCCGGCCTACGGG >NZ_CP048344|2|2|654568-654685|CRISPRCasFinder TGCCGGATGCGATGCTGGCGCACCTTATCCGGCCTACGGG GTGCTCAACTTGTTGATGTTGTTGTGTTTTGTACCTGA TGCCGGATGCGATGCTGGCGCATCTTATCCGGCCTACGGG
>NZ_CP048344.1|WP_000460519.1|653236_654547_+|serine-dehydratase-subunit-alpha-family-protein MFDSTLNPLWQRYILAVQEEVKPALGCTEPISLALAAAVAAAELEGPVERVEAWVSPNLMKNGLGVTVPGTGMVGLPIAAALGALGGNANAGLEVLKDATAQAIADAKALLAAGKVSVKIQEPCNEILFSRAKVWNGEKWACVTIVGGHTNIVHIETHNGVVFTQQACVAEGEQESPLTVLSRTTLAEILKFVNEVPFAAIRFILDSAKLNCALSQEGLSGKWGLHIGATLEKQCERGLLAKDLSSSIVIRTSAASDARMGGATLPAMSNSGSGNQGITATMPVVVVAEHFGADDERLARALMLSHLSAIYIHNQLPRLSALCAATTAAMGAAAGMAWLVDGRYETISMAISSMIGDVSGMICDGASNSCAMKVSTSASAAWKAVLMALDDTAVTGNEGIVAHDVEQSIANLCALASHSMQQTDRQIIEIMASKAR >NZ_CP048344.1|WP_000401598.1|651877_653209_+|HAAAP-family-serine/threonine-permease MEIASNKGVIADASTPAGRAGMSESEWREAIKFDSTDTGWVIMSIGMAIGAGIVFLPVQVGLMGLWVFLLSSVIGYPAMYLFQRLFINTLAESPECKDYPSVISGYLGKNWGILLGALYFVMLVIWMFVYSTAITNDSASYLHTFGVTEGLLSDSPFYGLVLICILVAISSRGEKLLFKISTGMVLTKLLVVAALGVSMVGMWHLYNVGSLPPLGLLVKNAIITLPFTLTSILFIQTLSPMVISYRSREKSIEVARHKALRAMNIAFGILFVTVFFYAVSFTLAMGHDEAVKAYEQNISALAIAAQFISGDGAAWVKVVSVILNIFAVMTAFFGVYLGFREATQGIVMNILRRKMPAEKINENLVQRGIMIFAILLAWSAIVLNAPVLSFTSICSPIFGMVGCLIPAWLVYKVPALHKYKGMSLYLIIVTGLLLCVSPFLAFS >NZ_CP048344.1|WP_000622115.1|650238_651603_+|L-serine-ammonia-lyase MISAFDIFKIGIGPSSSHTVGPMNAGKSFIDRLESSGLLTATSHIVVDLYGSLSLTGKGHATDVAIIMGLAGNSPQDVVIDEIPAFIELVTRSGRLPVASGAHIVDFPVAKNIIFHPEMLPRHENGMRITAWKGQEALLSKTYYSVGGGFIVEEEHFGLSHDVETSVPYDFHSAGELLKMCDYNGLSISGLMMHNELALRSKAEIDAGFARIWQVMHDGIERGMNTEGVLPGPLNVPRRAVALRRQLVSSDNISNDPMNVIDWINMYALAVSEENAAGGRVVTAPTNGACGIIPAVLAYYDKFRRPVNERSIARYFLAAGAIGALYKMNASISGAEVGCQGEIGVACSMAAAGLTELLGGSPAQVCNAAEIAMEHNLGLTCDPVAGQVQIPCIERNAINAVKAVNAARMAMRRTSAPRVSLDKVIETMYETGKDMNDKYRETSRGGLAIKVVCG >NZ_CP048344.1|WP_001375219.1|649777_650167_+|enamine/imine-deaminase MKKIIETQRAPGAIGPYVQGVDLGSMVFTSGQIPVCPQTGEIPADVQDQARLSLENVKAIVVAAGLSVGDIIKMTVFITDLNDFATINEVYKQFFDEHQATYPTRSYVQVARLPKDVKLEIEAIAVRSA >NZ_CP048344.1|WP_000861734.1|647469_649764_+|2-ketobutyrate-formate-lyase/pyruvate-formate-lyase MKVDIDTSDKLYADAWLGFKGTDWKNEINVRDFIQHNYTPYEGDESFLAEATPATTELWEKVMEGIRIENATHAPVDFDTNIATTITAHDAGYINQPLEKIVGLQTDAPLKRALHPFGGINMIKSSFHAYGREMDSEFEYLFTDLRKTHNQGVFDVYSPDMLRCRKSGVLTGLPDGYGRGRIIGDYRRVALYGISYLVRERELQFADLQSRLEKGEDLEATIRLREELAEHRHALLQIQEMAAKYGFDISRPAQNAQEAVQWLYFAYLAAVKSQNGGAMSLGRTASFLDIYIERDFKAGVLNEQQAQELIDHFIMKIRMVRFLRTPEFDSLFSGDPIWATEVIGGMGLDGRTLVTKNSFRYLHTLHTMGPAPEPNLTILWSEELPIAFKKYAAQVSIVTSSLQYENDDLMRTDFNSDDYAIACCVSPMVIGKQMQFFGARANLAKTLLYAINGGVDEKLKIQVGPKTAPLMDDVLDYDKVMDSLDHFMDWLAVQYISALNIIHYMHDKYSYEASLMALHDRDVYRTMACGIAGLSVATDSLSAIKYARVKPIRDENGLAVDFEIDGEYPQYGNNDERVDSIACDLVERFMKKIKALPTYRNAVPTQSILTITSNVVYGQKTGNTPDGRRAGTPFAPGANPMHGRDRKGAVASLTSVAKLPFTYAKDGISYTFSIVPAALGKEDPVRKTNLVGLLDGYFHHEADVEGGQHLNVNVMNREMLLDAIEHPEKYPNLTIRVSGYAVRFNALTREQQQDVISRTFTQAL >NZ_CP048344.1|WP_001297162.1|646227_647436_+|propionate-kinase MNEFPVVLVINCGSSSIKFSVLDASDCEVLMSGIADGINSENAFLSVNGGEPAPLAHHSYEGALKAIAFELEKRNLNDSVALIGHRIAHGGSIFTESAIITDEVIDNIRRVSPLAPLHNYANLSGIESAQQLFPGVTQVAVFDTSFHQTMAPEAYLYGLPWKYYEELGVRRYGFHGTSHRYVSQRAHSLLNLAEDDSGLVVAHLGNGASICAVRNGQSVDTSMGMTPLEGLMMGTRSGDVDFGAMSWVASQTNQSLGDLERVVNKESGLLGISGLSSDLRVLEKAWHEGHERAQLAIKTFVHRIARHIAGHAASLRRLDGIIFTGGIGENSSLIRRLVMEHLAVLGVEIDTEMNNRSNSCGERIVSSENARVICAVIPTNEEKMIALDAIHLGKVNAPAEFA >NZ_CP048344.1|WP_000107720.1|644870_646202_+|threonine/serine-transporter-TdcC MSTSDSIVSSQTKQSSWRKSDTTWTLGLFGTAIGAGVLFFPIRAGFGGLIPILLMLVLAYPIAFYCHRALARLCLSGSNPSGNITETVEEHFGKTGGVVITFLYFFAICPLLWIYGVTITNTFMTFWENQLGFAPLNRGFVALFLLLLMAFVIWFGKDLMVKVMSYLVWPFIASLVLISLSLIPYWNSAVIDQVDLGSLSLTGHDGILITVWLGISIMVFSFNFSPIVSSFVVSKREEYEKDFGRDFTERKCSQIISRASMLMVAVVMFFAFSCLFTLSPANMAEAKAQNIPVLSYLANHFASMTGTKTTFAITLEYAASIIALVAIFKSFFGHYLGTLEGLNGLILKFGYKGDKTKVSLGKLNTISMIFIMGSTWVVAYANPNILDLIEAMGAPIIASLLCLLPMYAIRKAPSLAKYRGRLDNVFVTVIGLLTILNIVYKLF >NZ_CP048344.1|WP_000548347.1|643859_644849_+|bifunctional-threonine-ammonia-lyase/L-serine-ammonia-lyase-TdcB MHITYDLPVAIDDIIEAKQRLAGRIYKTGMPRSNYFSERCKGEIFLKFENMQRTGSFKIRGAFNKLSSLTDAEKRKGVVACSAGNHAQGVSLSCAMLGIDGKVVMPKGAPKSKVAATCDYSAEVVLHGDNFNDTIAKVSEIVEMEGRIFIPPYDDPKVIAGQGTIGLEIMEDLYDVDNVIVPIGGGGLIAGIAVAIKSINPTIRVIGVQSENVHGMAASFHSGEITTHRTTGTLADGCDVSRPGNLTYEIVRELVDDIVLVSEDEIRNSMIALIQRNKVVTEGAGALACAALLSGKLDQYIQNRKTVSIISGGNIDLSRVSQITGFVDA >NZ_CP048344.1|WP_000104211.1|642822_643761_+|transcriptional-regulator-TdcA MSTILLPKTQHLVVFQEVIRSGSIGSAAKELGLTQPAVSKIINDIEDYFGVELVVRKNTGVTLTPAGQLLLSRSESITREMKNMVNEISGMSSEAVVEVSFGFPSLIGFTFMSGMINKFKEVFPKAQVSMYEAQLSSFLPAIRDGRLDFAIGTLSAEMKLQDLHVEPLFESEFVLVASKSRTCTGTTTLESLKNEQWVLPQTNMGYYSELLTTLQRNGISIENIVKTDSVVTIYNLVLNADFLTVIPCDMTSPFGSNQFITIPVEETLPVAQYAAVWSKNYRIKKAASVLVELAKEYSSYNGCRRRQLIEVG >NZ_CP048344.1|WP_000145820.1|642289_642634_-|DNA-binding-transcriptional-activator-TdcR MTGITIFYGDNIIRYVVNIKKGLRPYFKQLPDNYQAKFELNLMSKFSNFIINKPFSAINTAARHIFSRYLLENKHLFYQYFKISNTGIDHLEQLINVNFFSSDRTSFCECNRFP >NZ_CP048344.1|WP_001295544.1|654758_654923_-|hypothetical-protein MSKKSAKKRQPVKPVVAKEPARTAKNFGYEEMLSELEAIVADAETRLAEDEATA >NZ_CP048344.1|WP_000633577.1|654945_655647_-|pirin-family-protein MITTRTARQCGQADYGWLQARYTFSFGHYFDPKLLGYASLRVLNQEVLAPGAAFQPRTYPKVDILNVILDGEAEYRDSEGNHVQASAGEALLLSTQPGVSYSEHNLSKDKPLTRMQLWLDACPQRENPLIQKLALNMGKQQLIASPEGTMGSLQLRQQVWLHHIVLDKGESANFQLHGPRAYLQSIHGKFHALTHHEEKAALTCGDGAFIRDEANITLVADSPLRALLIDLPV >NZ_CP048344.1|WP_001041010.1|655751_656648_+|LysR-family-transcriptional-regulator MAKERALTLEALRVMDAIDRRGSFAAAADELGRVPSALSYTMQKLEEELDVVLFDRSGHRTKFTNVGRMLLERGRVLLEAADKLTTDAEALARGWETHLTIVTEALVPTPAFFPLIDKLAAKANTQLAIITEVLAGAWERLEQGRADIVIAPDMHFRSSSEINSRKLYTLMNVYVAAPDHPIHQEPEPLSEVTRVKYRGIAVADTARERPVLTVQLLDKQPRLTVSTIEDKRQALLAGLGVATMPYPMVEKDIAEGRLRVVSPESTSEIDIIMAWRRDSMGEAKSWCLREIPKLFSGK >NZ_CP048344.1|WP_001198780.1|656698_657055_-|DUF805-domain-containing-protein MQWYLAVLKNYVGFSGRARRKEYWMFTLINAIVGAIINVIQLILGLEFPFLSLIYLAATIIPVIALCVRRLHDTDRSGAWALLYLVPIIGWLVLFVFACLEGNSGSNRYGNDPKFGSN >NZ_CP048344.1|WP_000384145.1|657296_657662_-|DUF805-domain-containing-protein MDWYLKVLKNYVGFRGRARRKEYWMFILVNIIFTFVLGLLDKMLGWQRAGGEGILTTIYGILVFLPWWAVQFRRLHDTDRSAWWALLFLIPFIGWLIIIVFNCQAGTPGENRFGPDPKLEP >NZ_CP048344.1|WP_000531204.1|657954_658941_-|glutathione-S-transferase-family-protein MGQLIDGVWHDTWYDTKSTGGKFQRSASAFRNWLTADGAPGPTGTGGFIAEKDRYHLYVSLACPWAHRTLIMRKLKGLEPFISVSVVNPLMLENGWTFDDSFPGATGDTLYQHEFLYQLYLHADPHYSGRVTVPVLWDKKNHTIVSNESAEIIRMFNTAFDALGAKAGDYYPPALQTKIDELNGWIYDTVNNGVYKAGFATSQQAYDEAVAKVFESLARLEQILGQHRYLTGNQLTEADIRLWTTLVRFDPVYVTHFKCDKHRISNYLNLYGFLRDIYQMPGIAETVNFDHIRNHYFRSHKTINPTGIISIGPWQDLDEPHGRDVRFG >NZ_CP048344.1|WP_000603618.1|659010_659493_-|DoxX-family-protein MILSIDSNDANTAPLHKKTISSLSGAVESMMKKLEDVGVLVARILMPILFITAGWGKITGYAGTQQYMEAMGVPGFMLPLVILLEFGGGLAILFGFLTRTTALFTAGFTLLTAFLFHSNFAEGVNSLMFMKNLTISGGFLLLAITGPGAYSIDRLLNKKW >NZ_CP048344.1|WP_000096086.1|659588_659888_-|hypothetical-protein MSSKVERERRKAQLLSQIQQQRLDLSASRREWLEATGAYDRRWNMLLSLRSWALVGSSVMAIWTIRHPNMLVRWARRGFGVWSAWRLVKTTLKQQQLRG >NZ_CP048344.1|WP_000785722.1|659877_660282_-|hypothetical-protein MADTHHAQGPGKSVLGIGQRIVSIMVEMVETRLRLAVVELEEEKANLFQLLLMLGLTMLFAAFGLMSLMVLIIWAVDPQYRLNAMIATTVVLLLLALIGGIWTLRKSRKSTLLRHTRHELANDRQLLEEESREQ >NZ_CP048344.1|WP_000031415.1|660284_660590_-|DUF883-domain-containing-protein MSKEHTTEHLRAELKSLSDTLEEVLSSSGEKSKEELSKIRSKAEQALKQSRYRLGETGDAIAKQTRVAAARADEYVRENPWTGVGIGAAIGVVLGVLLSRR |
You can click texts colored in the table to view more detailed information
CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
NZ_CP048344_3 | 1027798-1028314 | Orphan |
I-E
Consensus repeat of NZ_CP048344_3
|
8 spacers
spacers of NZ_CP048344_3
>3.1|1027827|32|NZ_CP048344|PILER-CR,CRISPRCasFinder,CRT TCCACGCTGTAACGGCCATCATTAAGTTTAGT >3.2|1027888|32|NZ_CP048344|PILER-CR,CRISPRCasFinder,CRT GCTGATGGTCTGGGAGTGTCCATCGGGCAACT >3.3|1027949|32|NZ_CP048344|PILER-CR,CRISPRCasFinder,CRT GAAGTAGGCCTGACAGTGATTGAACGCATACT >3.4|1028010|32|NZ_CP048344|PILER-CR,CRISPRCasFinder,CRT AGTTGGGGCGGCGCAATAACGAGACGATACGC >3.5|1028071|32|NZ_CP048344|PILER-CR,CRISPRCasFinder,CRT GGGAGTGGCACTTCTGGGGTAGCGGCGGCCCT >3.6|1028132|32|NZ_CP048344|PILER-CR,CRISPRCasFinder,CRT TCAACGCGCTCAGACGTTGCGTGAGTGAACCA >3.7|1028193|32|NZ_CP048344|PILER-CR,CRISPRCasFinder,CRT AAATATCCAGGGCTGGGCTGGAGGCAGACGGC >3.8|1028254|32|NZ_CP048344|PILER-CR,CRISPRCasFinder,CRT CCCGGAATGCATTCTGAAGGTTTGCTGTATAT |
CRISPR arrays and Neighbor proteins around NZ_CP048344_3
The CRISPR arrays of NZ_CP048344_3 >merge|NZ_CP048344|3|1027798-1028314|PILER-CR,CRISPRCasFinder,CRT GAGTTCCCCGCGCCAGCGGGGATAAACCGTCCACGCTGTAACGGCCATCATTAAGTTTAGTGAGTTCCCCGCGCCAGCGGGGATAAACCGGCTGATGGTCTGGGAGTGTCCATCGGGCAACTGAGTTCCCCGCGCCAGCGGGGATAAACCGGAAGTAGGCCTGACAGTGATTGAACGCATACTGAGTTCCCCGCGCCAGCGGGGATAAACCGAGTTGGGGCGGCGCAATAACGAGACGATACGCGAGTTCCCCGCGCCAGCGGGGATAAACCGGGGAGTGGCACTTCTGGGGTAGCGGCGGCCCTGAGTTCCCCGCGCCAGCGGGGATAAACCGTCAACGCGCTCAGACGTTGCGTGAGTGAACCAGAGTTCCCCGCGCCAGCGGGGATAAACCGAAATATCCAGGGCTGGGCTGGAGGCAGACGGCGAGTTCCCCGCGCCAGCGGGGATAAACCGCCCGGAATGCATTCTGAAGGTTTGCTGTATATGAGTTCCCCGCGCCAGCGGGGATAAACCA >NZ_CP048344|3|1|1027798-1028314|PILER-CR GAGTTCCCCGCGCCAGCGGGGATAAACCG TCCACGCTGTAACGGCCATCATTAAGTTTAGT GAGTTCCCCGCGCCAGCGGGGATAAACCG GCTGATGGTCTGGGAGTGTCCATCGGGCAACT GAGTTCCCCGCGCCAGCGGGGATAAACCG GAAGTAGGCCTGACAGTGATTGAACGCATACT GAGTTCCCCGCGCCAGCGGGGATAAACCG AGTTGGGGCGGCGCAATAACGAGACGATACGC GAGTTCCCCGCGCCAGCGGGGATAAACCG GGGAGTGGCACTTCTGGGGTAGCGGCGGCCCT GAGTTCCCCGCGCCAGCGGGGATAAACCG TCAACGCGCTCAGACGTTGCGTGAGTGAACCA GAGTTCCCCGCGCCAGCGGGGATAAACCG AAATATCCAGGGCTGGGCTGGAGGCAGACGGC GAGTTCCCCGCGCCAGCGGGGATAAACCG CCCGGAATGCATTCTGAAGGTTTGCTGTATAT GAGTTCCCCGCGCCAGCGGGGATAAACCA >NZ_CP048344|3|3|1027798-1028314|CRISPRCasFinder GAGTTCCCCGCGCCAGCGGGGATAAACCG TCCACGCTGTAACGGCCATCATTAAGTTTAGT GAGTTCCCCGCGCCAGCGGGGATAAACCG GCTGATGGTCTGGGAGTGTCCATCGGGCAACT GAGTTCCCCGCGCCAGCGGGGATAAACCG GAAGTAGGCCTGACAGTGATTGAACGCATACT GAGTTCCCCGCGCCAGCGGGGATAAACCG AGTTGGGGCGGCGCAATAACGAGACGATACGC GAGTTCCCCGCGCCAGCGGGGATAAACCG GGGAGTGGCACTTCTGGGGTAGCGGCGGCCCT GAGTTCCCCGCGCCAGCGGGGATAAACCG TCAACGCGCTCAGACGTTGCGTGAGTGAACCA GAGTTCCCCGCGCCAGCGGGGATAAACCG AAATATCCAGGGCTGGGCTGGAGGCAGACGGC GAGTTCCCCGCGCCAGCGGGGATAAACCG CCCGGAATGCATTCTGAAGGTTTGCTGTATAT GAGTTCCCCGCGCCAGCGGGGATAAACCA >NZ_CP048344|3|1|1027798-1028314|CRT GAGTTCCCCGCGCCAGCGGGGATAAACCG TCCACGCTGTAACGGCCATCATTAAGTTTAGT GAGTTCCCCGCGCCAGCGGGGATAAACCG GCTGATGGTCTGGGAGTGTCCATCGGGCAACT GAGTTCCCCGCGCCAGCGGGGATAAACCG GAAGTAGGCCTGACAGTGATTGAACGCATACT GAGTTCCCCGCGCCAGCGGGGATAAACCG AGTTGGGGCGGCGCAATAACGAGACGATACGC GAGTTCCCCGCGCCAGCGGGGATAAACCG GGGAGTGGCACTTCTGGGGTAGCGGCGGCCCT GAGTTCCCCGCGCCAGCGGGGATAAACCG TCAACGCGCTCAGACGTTGCGTGAGTGAACCA GAGTTCCCCGCGCCAGCGGGGATAAACCG AAATATCCAGGGCTGGGCTGGAGGCAGACGGC GAGTTCCCCGCGCCAGCGGGGATAAACCG CCCGGAATGCATTCTGAAGGTTTGCTGTATAT GAGTTCCCCGCGCCAGCGGGGATAAACCA
>NZ_CP048344.1|WP_001199979.1|1026786_1027458_+|7-carboxy-7-deazaguanine-synthase-QueE MQYPINEMFQTLQGEGYFTGVPAIFIRLQGCPVGCAWCDTKHTWEKLEDREVSLFSILAKTKESDKWGAASSEDLLAVISRQGYTARHVVITGGEPCIHDLLPLTDLLEKNGFSCQIETSGTHEVRCTPNTWVTVSPKLNMRGGYEVLSQALERANEIKHPVGRVRDIEALDELLATLTDDKPRVIALQPISQKDDATRLCIETCIARNWRLSMQTHKYLNIA >NZ_CP048344.1|WP_001288227.1|1026507_1026648_-|hypothetical-protein MSEENKENGFNHVKTFTKIIFIFSVLVFNDNESKITDAAVNLFIQI >NZ_CP048344.1|WP_001679366.1|1025621_1026494_-|YgcG-family-protein MRYFILMFTFVCSFVAAQPTIVPQLQQQVTDLTSSLNSQEKKELTHKLESIFNNTQVQIAVLIVPTTKDETIEQYATRVFDNWRLGDAKRNDGILIIVAWSDRTVRIKVGYGLEEKVTDALAGDIIRSNMIPAFKQQKLAQGLELAINALNNQLTSQHQYPTNPSESESASSSDHYYFAIFWVFAVMFFPFWFFHQCSNFCRACKSGVCISAIYLLDLFLFSDKIFSIAVFSFFFTFTIFMVFTCLCVLQKRASGRSYHSDNSGSAGGSDSGGFSGGGGSSGGGGASGRW >NZ_CP048344.1|WP_000036723.1|1024263_1025562_+|phosphopyruvate-hydratase MSKIVKIIGREIIDSRGNPTVEAEVHLEGGFVGMAAAPSGASTGSREALELRDGDKSRFLGKGVTKAVAAVNGPIAQALIGKDAKDQAGIDKIMIDLDGTENKSKFGANAILAVSLANAKAAAAAKGMPLYEHIAELNGTPGKYSMPVPMMNIINGGEHADNNVDIQEFMIQPVGAKTVKEAIRMGSEVFHHLAKVLKAKGMNTAVGDEGGYAPNLGSNAEALAVIAEAVKAAGYELGKDITLAMDCAASEFYKDGKYVLAGEGNKAFTSEEFTHFLEELTKQYPIVSIEDGLDESDWDGFAYQTKVLGDKIQLVGDDLFVTNTKILKEGIEKGIANSILIKFNQIGSLTETLAAIKMAKDAGYTAVISHRSGETEDATIADLAVGTAAGQIKTGSMSRSDRVAKYNQLIRIEEALGEKAPYNGRKEIKGQA >NZ_CP048344.1|WP_000210878.1|1022538_1024176_+|CTP-synthase-(glutamine-hydrolyzing) MTTNYIFVTGGVVSSLGKGIAAASLAAILEARGLNVTIMKLDPYINVDPGTMSPIQHGEVFVTEDGAETDLDLGHYERFIRTKMSRRNNFTTGRIYSDVLRKERRGDYLGATVQVIPHITNAIKERVLEGGEGHDVVLVEIGGTVGDIESLPFLEAIRQMAVEIGREHTLFMHLTLVPYMAASGEVKTKPTQHSVKELLSIGIQPDILICRSDRAVPANERAKIALFCNVPEKAVISLKDVDSIYKIPGLLKSQGLDDYICKRFSLNCPEANLSEWEQVIFEEANPVSEVTIGMVGKYIELPDAYKSVIEALKHGGLKNRVSVNIKLIDSQDVETRGVEILKGLDAILVPGGFGYRGVEGMITTARFARENNIPYLGICLGMQVALIDYARHVANMENANSTEFVPDCKYPVVALITEWRDENGNVEVRSEKSDLGGTMRLGAQQCQLVDDSLVRQLYNAPTIVERHRHRYEVNNMLLKQIEDAGLRVAGRSGDDQLVEIIEVPNHPWFVACQFHPEFTSTPRDGHPLFAGFVKAASEFQKRQAK >NZ_CP048344.1|WP_001071648.1|1021519_1022311_+|nucleoside-triphosphate-pyrophosphohydrolase MNQIDRLLTIMQRLRDPENGCPWDKEQTFATIAPYTLEETYEVLDAIAREDFDDLRGELGDLLFQVVFYAQMAQEEGRFDFNDICAAISDKLERRHPHVFADSSAENSSEVLARWEQIKTEERAQKAQHSALDDIPRSLPALMRAQKIQKRCANVGFDWTTLGPVVDKVYEEIDEVMYEARQAVVDQAKLEEEMGDLLFATVNLARHLGTKAEIALQKANEKFERRFREVERIVAARGLEMTGVDLETMEEVWQQVKRQEIDL >NZ_CP048344.1|WP_000254738.1|1021113_1021449_+|endoribonuclease-MazF MVSRYVPDMGDLIWVDFDPTKGSEQAGHRPAVVLSPFMYNNKTGMCLCVPCTTQSKGYPFEVVLSGQERDGVALADQVKSIAWRARGATKKGTVAPEELQLIKAKINVLIG >NZ_CP048344.1|WP_000581937.1|1020865_1021114_+|type-II-toxin-antitoxin-system-antitoxin-MazE MIHSSVKRWGNSPAVRIPATLMQALNLNIDDEVKIDLVDGKLIIEPVRKEPVFTLAELVNDITPENLHENIDWGEPKDKEVW >NZ_CP048344.1|WP_000226815.1|1018553_1020788_+|GTP-pyrophosphokinase MVAVRSAHINKAGEFDPEKWIASLGITSQKSCECLAETWAYCLQQTQGHPDASLLLWRGVEMVEILSTLSMDIDTLRAALLFPLADANVVSEDVLRESVGKSVVNLIHGVRDMAAIRQLKATHTDSVSSEQVDNVRRMLLAMVDDFRCVVIKLAERIAHLREVKDAPEDERVLAAKECTNIYAPLANRLGIGQLKWELEDYCFRYLHPTEYKRIAKLLHERRLDREHYIEEFVGHLRAEMKAEGVKAEVYGRPKHIYSIWRKMQKKNLAFDELFDVRAVRIVAERLQDCYAALGIVHTHYRHLPDEFDDYVANPKPNGYQSIHTVVLGPGGKTVEIQIRTKQMHEDAELGVAAHWKYKEGAAAGGARSGHEDRIAWLRKLIAWQEEMADSGEMLDEVRSQVFDDRVYVFTPKGDVVDLPAGSTPLDFAYHIHSDVGHRCIGAKIGGRIVPFTYQLQMGDQIEIITQKQPNPSRDWLNPNLGYVTTSRGRSKIHAWFRKQDRDKNILAGRQILDDELEHLGISLKEAEKHLLPRYNFNDVDELLAAIGGGDIRLNQMVNFLQSQFNKPSAEEQDAAALKQLQQKSYTPQNRSKDNGRVVVEGVGNLMHHIARCCQPIPGDEIVGFITQGRGISVHRADCEQLAELRSHAPERIVDAVWGESYSAGYSLVVRVVANDRSGLLRDITTILANEKVNVLGVASRSDTKQQLATIDMTIEIYNLQVLGRVLGKLNQVPDVIDARRLHGS >NZ_CP048344.1|WP_000046812.1|1017204_1018506_+|23S-rRNA-(uracil(1939)-C(5))-methyltransferase-RlmD MAQFYSAKRRTTTRQIITVSVNDLDSFGQGVARHNGKTLFIPGLLPQENAEVTVTEDKKQYARAKVVRRLSDSPERETPRCPHFGVCGGCQQQHASVDLQQRSKSAALARLMKHDVSEVIADVPWGYRRRARLSLNYLPKTQQLQMGFRKAGSSDIVDVKQCPILAPQLEALLPKVRACLGSLQAMRHLGHVELVQATSGTLMILRHTAPLSSADREKLERFSHSEGLDLYLAPDSEILETVSGEMPWYDSNGLRLTFSPRDFIQVNAGVNQKMVARALEWLDVQPEDRVLDLFCGMGNFTLPLATQAASVVGVEGVPALVEKGQQNARLNGLQNVTFYHENLEEDVTKQPWAKNGFDKVLLDPARAGAAGVMQQIIKLEPIRIVYVSCNPATLARDSEALLKAGYTIARLAMLDMFPHTGHLESMVLFSRVK >NZ_CP048344.1|WP_000039683.1|1028951_1030430_-|sugar-kinase MSKKYIIGIDGGSQSTKVVMYDLEGNVVCEGKGLLQPMHTPDADTAEHPDDDLWASLCFAGHDLMSQFAGNKEDIVGIGLGSIRCCRALLKADGTPAAPLISWQDARVTRPYEHTNPDVAYVTSFSGYLTHRLTGEFKDNIANYFGQWPVDYKSWAWSEDAAVMDKFNIPRHMLFDVQMPGTVLGHITPQAALATHFPAGLPVVCTTSDKPVEALGAGLLDDETAVISLGTYIALMMNGKALPKDPVAYWPIMSSIPQTLLYEGYGIRKGMWTVSWLRDMLGESLIQDAKAQDLSPEDLLNKKASCVPPGCNGLMTVLDWLTNPWEPYKRGIMIGFDSSMDYAWIYRSILESVALTLKNNYDNMCNEMNYFAKHVIITGGGSNSDLFMQIFADVFNLPARRNAINGCASLGAAINTAVGLGLYPDYATAVDKMVRVKDIFMPVESNAKRYDAMNKGIFKDLTKHTDVILKKSYEVMHGELGNADSIQSWSNA >NZ_CP048344.1|WP_060621186.1|1030456_1031734_-|MFS-transporter MQHNSYRRWITLAIISFSGGVSFDLAYLRYIYQIPMAKFMGFSNTEIGLIMSTFGIAAIILYAPSGVIADKFSHRKMITSAMIITGLLGLLMATYPPLWVMLCIQVAFAITTILMLWSVSIKAASLLGDHSEQGKIMGWMEGLRGVGVMSLAVFTMWVFSRFAPDDSTSLKTVIIIYSVVYILLGILCWFFVSDNNNLRSANNEEKQSFQLSDILAVLRISTTWYCSMVIFGVFTIYAILSYSTNYLTEMYGMSLVAASYMGIVINKIFRALCGPLGGIITTYSKVKSPTRVIQILSIIGLLALTALLVTNSNPQSVAMGIGLILLLGFTCYASRGLYWACPGEARTPSYIMGTTVGICSVIGFLPDVFVYPIIGHWQDTLPAAEAYRNMWLMGMAALGMVIVFIFLLFQKIRTADSAPAMASSK >NZ_CP048344.1|WP_000021330.1|1032052_1032838_+|SDR-family-oxidoreductase MSIESLNAFSMDFFSLKGKTAIVTGGNSGLGQAFAMALAKAGANIFIPSFVKDNGETKEMIEKQGVEVDFMQVDITAEGAPQKIIAASCERFGTVDILVNNAGICKLNKVLDFGRADWDPMIDVNLTAAFELSYEAAKIMIPQKSGKIINICSLFSYLGGQWSPAYSATKHALAGFTKAYCDELGQYNIQVNGIAPGYYATDITLATRSNPETNQRVLDHIPANRWGDTQDLMGAAVFLASPASNYVNGHLLVVDGGYLVR >NZ_CP048344.1|WP_000059312.1|1032907_1034362_+|FAD-binding-oxidoreductase MSLSRAAIVDQLKEIVGADRVITDETVLKKNSIDRFRKFPDIHGIYTLPIPAAVVKLGSTEQVSRVLNFMNAHKINGVPRTGASATEGGLETVVENSVVLDGSAMNQIINIDIENMQATAQCGVPLEVLENALREKGYTTGHSPQSKPLAQMGGLVATRSIGQFSTLYGAIEDMVVGLEAVLADGTVTRIKNVPRRAAGPDIRHIIIGNEGALCYITEVTVKIFKFTPENNLFYGYILEDMKTGFNILREVMVEGYRPSIARLYDAEDGTQHFTHFADGKCVLIFMAEGNPRIAKATGEGIAEIVARYPQCQRVDSKLIETWFNNLNWGPDKVAAERVQILKTGNMGFTTEVSGCWSCIHEIYESVINRIRTEFPHADDITMLGGHSSHSYQNGTNMYFVYDYNVVDCKPEEEIDKYHNPLNKIICEETIRLGGSMVHHHGIGKHRVHWSKLEHGSAWALLEGLKKQFDPNGIMNTGTIYPIEK >NZ_CP048344.1|WP_000147666.1|1034383_1035793_+|MFS-transporter MTGRCLFGFSGEKPFLLPDNEGVKMNTSPVRMDDLPLNRFHCRIAALTFGAHLTDGYVLGVIGYAIIQLTPAMQLTPFMAGMIGGSALLGLFLGSLVLGWISDHIGRQKIFTFSFLLITLASFLQFFATTPEHLIGLRILIGIGLGGDYSVGHTLLAEFSPRRHRGILLGAFSVVWTVGYVLASIAGHHFISENPEAWRWLLASAALPALLITLLRWGTPESPRWLLRQGRFAEAHAIVHRYFGPHVLLGDEVVTATHKHIKTLFSSRYWRRTAFNSVFFVCLVIPWFVIYTWLPTIAQTIGLEDALTASLMLNALLIVGALLGLVLTHLLAHRKFLLGSFLLLAATLVVMACLPSGSSLTLLLFVLFSTTISAVSNLVGILPAESFPTDIRSLGVGFATAMSRLGAAVSTGLLPWVLAQWGMQVTLLLLATVLLVGFVVTWLWAPETKALPLVAAGNVGGANEHSVSV >NZ_CP048344.1|WP_001324445.1|1035770_1036550_+|electron-transfer-flavoprotein-subunit-beta/FixA-family-protein MNILLAFKAEPDAGMLAEKEWQAAAQGKSGPDISLLRSLLGADEQAAAALLLAQRKNGTPMSLTALSMGDERALHWLRYLMALGFEEAVLLETAADLRFAPEFVARHIAEWQHQNPLDLIITGCQSSEGQNGQTPFLLAEMLGWPCFTQVERFTLDALFITLEQRTEHGLRCCRVRLPAVIAVRQCGEVALPVPGMRQRMAAGKAEIIRKTVAAEMPAMQCLQLARAEQRRGATLIDGQTVAEKAQKLWRDYLRQRMQP >NZ_CP048344.1|WP_001324446.1|1036546_1037407_+|electron-transfer-flavoprotein-subunit-alpha/FixB-family-protein MNIAIVTINQENAAIASWLAAQDFSGCTLAHWQIEPQPVVAEQVLDALVEQWQRTPADVVLFPPGTFGDELSTRLAWRLHGASICQVTSLDIPTVSVRKSHWGNALTATLQTEKRPLCLSLARQAGAAKNATLPSGMQQLIIVPGALPDWLVSTEDLKNVTRDPLAEARRVLVVGQGGEADNQEIAMLAEKLGAEVGYSRARVMNGGVDAEKVIGISGHLLAPEVCIVVGASGAAALMAGVRNSKFVVAINHDASAAVFSQADVGVVDDWKVVLEALVTNIHADCQ >NZ_CP048344.1|WP_001130266.1|1037554_1038130_-|glycerol-3-phosphate-responsive-antiterminator MPLLHLLRQNPVIAAVKDNASLQLAIDSECQFISVLYGNICTISNIVKKIKNAGKYAFIHVDLLEGASNKEVVIQFLKLVTEADGIISTKASMLKAARAEGFFCIHRLFIVDSISFHNIDKQVAQSNPDCIEILPGCMPKVLGWVTEKIRQPLIAGGLVCDEEDARNAINAGVVALSTTNTGVWTLAKKLL >NZ_CP048344.1|WP_000109532.1|1038146_1038407_-|ferredoxin-family-protein MSVARNLWRVADAPHIVPADSVERQTAERLISACPAGLFSLTPEGDLRIDYRSCLECGTCRLLCDESTLQQWRYPPSGFGITYRFG >NZ_CP048344.1|WP_001295150.1|1038397_1039669_-|FAD-dependent-oxidoreductase MEDDCDIIIIGAGIAGTACALRCARAGLSVLLLERAEIPGSKNLSGGRLYTHALAELLPQFHLTAPLERCITHESLSLLTPDGATTFSSLQPGGESWSVLRARFDPWLVAEAEKEGVECIPGATVDALYEENGRVCGVICGDDILRARYVVLAEGANSVLAERHGLVTRPAGEAMALGIKEVLSLETSAIEERFHLENNEGAALLFSGGICDDLPGGAFLYTNQQTLSLGIVCPLSSLTQSRVPASELLTRFKAHPAVRPLIKNTESLEYGAHLVPEGGLHSMPVQYAGNGWLLVGDALRSCVNTGISVRGMDMALTGAQAAAQTLISACQHREPQNLFPLYHHNVERSLLWDVLQRYQHVPALLQRPGWYRTWPALMQDISRDLWDQGDKPVPPLRQLFWHHLRRHGLWHLAGDVIRSLRCL |
You can click texts colored in the table to view more detailed information
CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
NZ_CP048344_4 | 1050699-1051277 | Unclear |
I-E
Consensus repeat of NZ_CP048344_4
|
9 spacers
spacers of NZ_CP048344_4
>4.1|1050729|31|NZ_CP048344|CRISPRCasFinder TTGCCCGCGCAATTCCGGGAGCATCCGCAAT >4.2|1050790|31|NZ_CP048344|CRISPRCasFinder ACGGACAAAATATATATTGATTTGCGAATTA >4.3|1050851|31|NZ_CP048344|CRISPRCasFinder GTAAAGAAACTGCCGACAAATCCCTGTTCGT >4.4|1050912|31|NZ_CP048344|CRISPRCasFinder CCCGTCACCGACGCGCAGTGGCGCTACCGTG >4.5|1050973|31|NZ_CP048344|CRISPRCasFinder GGATCTAACGCGCTGTAAAAATTCCGTGCTT >4.6|1051034|31|NZ_CP048344|CRISPRCasFinder TGCGGATTACCGGCAAAACATGGGAGCAAAC >4.7|1051095|31|NZ_CP048344|CRISPRCasFinder CCGAACGGCTGGCGAAGCAGGTGGCTGGCGT >4.8|1051156|31|NZ_CP048344|CRISPRCasFinder GTTTACCGCCCCGCAGAGGCGCTGGCAGATC >4.9|1051217|31|NZ_CP048344|CRISPRCasFinder GGATGACCTGTCGCTAAAACTCGCCGCGTAC >4.10|1050729|32|NZ_CP048344|PILER-CR,CRT TTGCCCGCGCAATTCCGGGAGCATCCGCAATT >4.11|1050790|32|NZ_CP048344|PILER-CR,CRT ACGGACAAAATATATATTGATTTGCGAATTAT >4.12|1050851|32|NZ_CP048344|PILER-CR,CRT GTAAAGAAACTGCCGACAAATCCCTGTTCGTT >4.13|1050912|32|NZ_CP048344|PILER-CR,CRT CCCGTCACCGACGCGCAGTGGCGCTACCGTGA >4.14|1050973|32|NZ_CP048344|PILER-CR,CRT GGATCTAACGCGCTGTAAAAATTCCGTGCTTT >4.15|1051034|32|NZ_CP048344|PILER-CR,CRT TGCGGATTACCGGCAAAACATGGGAGCAAACC >4.16|1051095|32|NZ_CP048344|PILER-CR,CRT CCGAACGGCTGGCGAAGCAGGTGGCTGGCGTA >4.17|1051156|32|NZ_CP048344|PILER-CR,CRT GTTTACCGCCCCGCAGAGGCGCTGGCAGATCC >4.18|1051217|32|NZ_CP048344|PILER-CR,CRT GGATGACCTGTCGCTAAAACTCGCCGCGTACA |
cas2,cas1,cas6e,cas5 |
CRISPR arrays and Neighbor proteins around NZ_CP048344_4
The CRISPR arrays of NZ_CP048344_4 >merge|NZ_CP048344|4|1050699-1051277|CRISPRCasFinder,PILER-CR,CRT TGTGTTCCCCGCGCCAGCGGGGATAAACCGTTGCCCGCGCAATTCCGGGAGCATCCGCAATTGTGTTCCCCGCGCCAGCGGGGATAAACCGACGGACAAAATATATATTGATTTGCGAATTATGTGTTCCCCGCGCCAGCGGGGATAAACCGGTAAAGAAACTGCCGACAAATCCCTGTTCGTTGTGTTCCCCGCGCCAGCGGGGATAAACCGCCCGTCACCGACGCGCAGTGGCGCTACCGTGAGTGTTCCCCGCGCCAGCGGGGATAAACCGGGATCTAACGCGCTGTAAAAATTCCGTGCTTTGTGTTCCCCGCGCCAGCGGGGATAAACCATGCGGATTACCGGCAAAACATGGGAGCAAACCGTGTTCCCCGCGCCAGCGGGGATAAACCGCCGAACGGCTGGCGAAGCAGGTGGCTGGCGTAGTGTTCCCCGCGCCAGCGGGGATAAACCGGTTTACCGCCCCGCAGAGGCGCTGGCAGATCCGTGTTCCCCGCGCCAGCGGGGATAAACCGGGATGACCTGTCGCTAAAACTCGCCGCGTACAGTGTTCCCCGCGCCAGCGGGGATAAACCG >NZ_CP048344|4|4|1050699-1051277|CRISPRCasFinder TGTGTTCCCCGCGCCAGCGGGGATAAACCG TTGCCCGCGCAATTCCGGGAGCATCCGCAAT TGTGTTCCCCGCGCCAGCGGGGATAAACCG ACGGACAAAATATATATTGATTTGCGAATTA TGTGTTCCCCGCGCCAGCGGGGATAAACCG GTAAAGAAACTGCCGACAAATCCCTGTTCGT TGTGTTCCCCGCGCCAGCGGGGATAAACCG CCCGTCACCGACGCGCAGTGGCGCTACCGTG AGTGTTCCCCGCGCCAGCGGGGATAAACCG GGATCTAACGCGCTGTAAAAATTCCGTGCTT TGTGTTCCCCGCGCCAGCGGGGATAAACCA TGCGGATTACCGGCAAAACATGGGAGCAAAC CGTGTTCCCCGCGCCAGCGGGGATAAACCG CCGAACGGCTGGCGAAGCAGGTGGCTGGCGT AGTGTTCCCCGCGCCAGCGGGGATAAACCG GTTTACCGCCCCGCAGAGGCGCTGGCAGATC CGTGTTCCCCGCGCCAGCGGGGATAAACCG GGATGACCTGTCGCTAAAACTCGCCGCGTAC AGTGTTCCCCGCGCCAGCGGGGATAAACCG >NZ_CP048344|4|2|1050700-1051277|PILER-CR GTGTTCCCCGCGCCAGCGGGGATAAACCG TTGCCCGCGCAATTCCGGGAGCATCCGCAATT GTGTTCCCCGCGCCAGCGGGGATAAACCG ACGGACAAAATATATATTGATTTGCGAATTAT GTGTTCCCCGCGCCAGCGGGGATAAACCG GTAAAGAAACTGCCGACAAATCCCTGTTCGTT GTGTTCCCCGCGCCAGCGGGGATAAACCG CCCGTCACCGACGCGCAGTGGCGCTACCGTGA GTGTTCCCCGCGCCAGCGGGGATAAACCG GGATCTAACGCGCTGTAAAAATTCCGTGCTTT GTGTTCCCCGCGCCAGCGGGGATAAACCA TGCGGATTACCGGCAAAACATGGGAGCAAACC GTGTTCCCCGCGCCAGCGGGGATAAACCG CCGAACGGCTGGCGAAGCAGGTGGCTGGCGTA GTGTTCCCCGCGCCAGCGGGGATAAACCG GTTTACCGCCCCGCAGAGGCGCTGGCAGATCC GTGTTCCCCGCGCCAGCGGGGATAAACCG GGATGACCTGTCGCTAAAACTCGCCGCGTACA GTGTTCCCCGCGCCAGCGGGGATAAACCG >NZ_CP048344|4|2|1050700-1051277|CRT GTGTTCCCCGCGCCAGCGGGGATAAACCG TTGCCCGCGCAATTCCGGGAGCATCCGCAATT GTGTTCCCCGCGCCAGCGGGGATAAACCG ACGGACAAAATATATATTGATTTGCGAATTAT GTGTTCCCCGCGCCAGCGGGGATAAACCG GTAAAGAAACTGCCGACAAATCCCTGTTCGTT GTGTTCCCCGCGCCAGCGGGGATAAACCG CCCGTCACCGACGCGCAGTGGCGCTACCGTGA GTGTTCCCCGCGCCAGCGGGGATAAACCG GGATCTAACGCGCTGTAAAAATTCCGTGCTTT GTGTTCCCCGCGCCAGCGGGGATAAACCA TGCGGATTACCGGCAAAACATGGGAGCAAACC GTGTTCCCCGCGCCAGCGGGGATAAACCG CCGAACGGCTGGCGAAGCAGGTGGCTGGCGTA GTGTTCCCCGCGCCAGCGGGGATAAACCG GTTTACCGCCCCGCAGAGGCGCTGGCAGATCC GTGTTCCCCGCGCCAGCGGGGATAAACCG GGATGACCTGTCGCTAAAACTCGCCGCGTACA GTGTTCCCCGCGCCAGCGGGGATAAACCG
>NZ_CP048344.1|WP_000063176.1|1050309_1050603_+|type-I-E-CRISPR-associated-endoribonuclease-Cas2 MSMVVVVTENVPPRLRGRLAIWLLEVRAGVYVGDTSKRIREMIWQQITQLAGCGNVVMAWATNTESGFEFQTWGENRRIPVDLDGLRLVSFLPVDNQ >NZ_CP048344.1|WP_000144861.1|1049389_1050313_+|type-I-E-CRISPR-associated-endonuclease-Cas1 MTFVPLSPIPLKDRTSMIFLQYGQIDVLDGAFVLIDKTGIRTHIPVGSVACIMLEPGTRVSHAAVHLAATVGTLLVWVGEAGVRVYSSGQPGGARADKLLYQAKLALTEDLRLKVVRKMYELRFREPPPARRSVEQLRGIEGSRVRQTYALLAKQYGVKWNGRKYDPKDWEKGDVVNRCISAATSCLYGISEAAVLAAGYAPAIGFIHSGKPLSFVYDIADIIKFDSVVPKAFEIAARQPAEPDKEVRLACRDIFRSTKLTGKLIPLIEEVLAAGEIEPPQPAPDMLPPAIPEPETLGDSGHRGRGG >NZ_CP048344.1|WP_000281446.1|1048742_1049393_+|type-I-E-CRISPR-associated-protein-Cas6/Cse3/CasE MYLSRITLHTGQLSPAQLLHLVDRGEYVMHQWLWDLFPGGKERQFLYRREELQGAFRFFVLSQERPAESDTFTIECRSFAPELRTGQQLCFNLRANPTICKSGKRHDLLMEAKRQVRGQAEGSDVWLHQQQAALDWLAAQGERSGFTLLDTSVDAYRQQQLRRENSRQLIQFSSVDYTGMLTVTDPGLFLQRLSQGYGKSRAFGCGLMLIKPGAEA >NZ_CP048344.1|WP_000085051.1|1048014_1048761_+|type-I-E-CRISPR-associated-protein-Cas5/CasD MSQYLIFQLHGPMASWGVDAPGEVRHTHELPSRSALLGLLAAGVGIRRDDTERLNAFNRHYSLVVCASRNPRWARDYHTIQMPKEVRKARYFSRREELSDPDLLSAIISRRDYYTDAWWMVAVATTADAPYSLEQLQDGLRHPVFPLYLGRKSHPLALPLAPLLLEGNACDALCNAYQQYQDHFHKLKVSLPKLQDECWWEGEHDGLVASKILRRRDVPLNRQQWLFGERTINQGPWLSKEEPCTSQE >NZ_CP048344.1|WP_000956458.1|1045011_1045164_+|type-I-toxin-antitoxin-system-Hok-family-toxin MLTKYALVAIIVLCCTVLGFTLMVGDSLCELSIRERGMEFKAVLAYESKK >NZ_CP048344.1|WP_000039842.1|1044012_1044747_+|phosphoadenosine-phosphosulfate-reductase MSKLDLNALNELPKVDRILALAETNAELEKLDAEGRVAWALDNLPGEYVLSSSFGIQAAVSLHLVNQIHPDIPVILTDTGYLFPETYRFIDELTDKLKLNLKVYRATESAAWQEARYGKLWEQGVEGIEKYNDINKVEPMNRALKELNAQTWFAGLRREQSGSRANLPVLAIQRGVFKVLPIIDWDNRTIYQYLQKHGLKYHPLWDEGYLSVGDTHTTRKWEPGMLEEETRFFGLKRECGLHEG >NZ_CP048344.1|WP_001290706.1|1042226_1043939_+|assimilatory-sulfite-reductase-(NADPH)-hemoprotein-subunit MSEKHPGPLVVEGKLTDAERMKLESNYLRGTIAEDLNDGLTGGFKGDNFLLIRFHGMYQQDDRDIRAERAEQKLEPRHAMLLRCRLPGGVITTKQWQAIDKFAGENTIYGSIRLTNRQTFQFHGILKKNVKPVHQMLHSVGLDALATANDMNRNVLCTSNPYESQLHAEAYEWAKKISEHLLPRTRAYAEIWLDQEKVATTDEEPILGQTYLPRKFKTTVVIPPQNDIDLHANDMNFVAIAENGKLVGFNLLVGGGLSIEHGNKKTYARTASEFGYLPLEHTLAVAEAVVTTQRDWGNRTDRKNAKTKYTLERVGVETFKAEVERRAGIKFEPIRPYEFTGRGDRIGWVKGIDDNWHLTLFIENGRILDYPGRPLKTGLLEIAKIHKGDFRITANQNLIIAGVPESEKAKIEKIAKESGLMNAVTPQRENSMACVSFPTCPLAMAEAERFLPSFIDNIDNLMAKHGVSDEHIVMRVTGCPNGCGRAMLAEVGLVGKAPGRYNLHLGGNRIGTRIPRMYKENITEPEILASLDELIGRWAKEREAGEGFGDFTVRAGIIRPVLDPARDLWD >NZ_CP048344.1|WP_000211954.1|1040427_1042227_+|NADPH-dependent-assimilatory-sulfite-reductase-flavoprotein-subunit MTTQVPPSALLPLNPEQLVRLQAATTDLTPTQLAWVSGYFWGVLNQQPAALAATPAPAAEMPGITIISASQTGNARRVAEALRDDLLAAKLNVKLVNAGDYKFKQIASEKLLIVVTSTQGEGEPPEEAVALHKFLFSKKAPKLENTAFAVFSLGDSSYEFFCQSGKDFDSKLAELGGERLLDRVDADVEYQAAASEWRARVVDALKSRAPVAAPSQSVATGAVNEIHTSPYSKDAPLVASLSVNQKITGRNSEKDVRHIEIDLGDSGLRYQPGDALGVWYQNDPALVKELVELLWLKGDEPVTVEGKTLPLNEALQWHFELTVNTANIVENYATLTRSETLLPLVGDKAKLQHYAATTPIVDMVRFSPAQLDAEALINLLRPLTPRLYSIASSQAEVENEVHVTVGVVRYDVEGRARAGGASSFLADRVEEEGEVRVFIEHNDNFRLPANPETPVIMIGPGTGIAPFRAFMQQRAADEAPGKNWLFFGNPHFTEDFLYQVEWQRYVKDGVLTRIDLAWSRDQKEKVYVQDKLREQGAELWRWINDGAHIYVCGDANRMAKDVEQALLEVIAEFGGMDTEAADEFLSELRVERRYQRDVY >NZ_CP048344.1|WP_000987944.1|1039746_1040112_-|6-carboxytetrahydropterin-synthase-QueD MMSTTLFKDFTFEAAHRLPHVPEGHKCGRLHGHSFMVRLEITGEVDPHTGWIIDFAELKAAFKPTYERLDHHYLNDIPGLENPTSEVLAKWIWDQVKPVVPLLSAVMVKETCTAGCIYRGE >NZ_CP048344.1|WP_001295150.1|1038397_1039669_-|FAD-dependent-oxidoreductase MEDDCDIIIIGAGIAGTACALRCARAGLSVLLLERAEIPGSKNLSGGRLYTHALAELLPQFHLTAPLERCITHESLSLLTPDGATTFSSLQPGGESWSVLRARFDPWLVAEAEKEGVECIPGATVDALYEENGRVCGVICGDDILRARYVVLAEGANSVLAERHGLVTRPAGEAMALGIKEVLSLETSAIEERFHLENNEGAALLFSGGICDDLPGGAFLYTNQQTLSLGIVCPLSSLTQSRVPASELLTRFKAHPAVRPLIKNTESLEYGAHLVPEGGLHSMPVQYAGNGWLLVGDALRSCVNTGISVRGMDMALTGAQAAAQTLISACQHREPQNLFPLYHHNVERSLLWDVLQRYQHVPALLQRPGWYRTWPALMQDISRDLWDQGDKPVPPLRQLFWHHLRRHGLWHLAGDVIRSLRCL >NZ_CP048344.1|WP_000490426.1|1051358_1052396_-|alkaline-phosphatase-isozyme-conversion-aminopeptidase MFSALRHRTAALALGVCFILPVHASSPKPGDFANTQARHIATFFPGRMTGTPAEMLSADYIRQQFQQMGYRSDIRTFNSRYIYTARDNRKSWHNVTGSTVIAAHEGKAPQQIIIMAHLDTYAPLSDADADANLGGLTLQGMDDNAAGLGVMLELAERLKNTPTEYGIRFVATSGEEEGKLGAENLLKRMSDTEKKNTLLVINLDNLIVGDKLYFNSGVKTPEAVRKLTRDRALAIARSHGIAATTNPGLNKNYPKGTGCCNDAEIFDKAGIAVLSVEATNWNLGNKDGYQQRAKTAAFPAGNSWHDVRLDNQQHIDKALPGRIERRCRDVMRIMLPLVKELAKAS >NZ_CP048344.1|WP_000372108.1|1052647_1053556_+|sulfate-adenylyltransferase-subunit-CysD MDQIRLTHLRQLEAESIHIIREVAAEFSNPVMLYSIGKDSSVMLHLARKAFYPGTLPFPLLHVDTGWKFREMYEFRDRTAKAYGCELLVHKNPEGVAMGINPFVHGSAKHTDIMKTEGLKQALNKYGFDAAFGGARRDEEKSRAKERIYSFRDRFHRWDPKNQRPELWHNYNGQINKGESIRVFPLSNWTEQDIWQYIWLENIDIVPLYLAAERPVLERDGMLMMIDDNRIDLQPGEVIKKRMVRFRTLGCWPLTGAVESNAQTLPEIIEEMLVSTTSERQGRVIDRDQAGSMELKKRQGYF >NZ_CP048344.1|WP_001090386.1|1053557_1054985_+|sulfate-adenylyltransferase-subunit-CysN MNTALAQQIANEGGVEAWMIAQQHKSLLRFLTCGSVDDGKSTLIGRLLHDTRQIYEDQLSSLHNDSKRHGTQGEKLDLALLVDGLQAEREQGITIDVAYRYFSTEKRKFIIADTPGHEQYTRNMATGASTCELAILLIDARKGVLDQTRRHSFISTLLGIKHLVVAINKMDLVDYSEKTFTRIREDYLTFAGQLPGNLDIRFVPLSALEGDNVASQSESMAWYSGPTLLEVLETVEIQRVVDAQPMRFPVQYVNRPNLDFRGYAGTLASGRVEVGQRVKVLPSGVESNVARIVTFDGDREEAFAGEAITLVLTDEIDISRGDLLLAADEALPAVQSASVDVVWMAEQPLSPGQSYDIKIAGKKTRARVDGIRYQVDINNLTQREVENLPLNGIGLVDLTFDEPLVLDRYQQNPVTGGLIFIDRLSNVTVGAGMVHEPVSQATAAPSEFSAFELELNALVRRHFPHWGARDLLGDK >NZ_CP048344.1|WP_001173673.1|1054984_1055590_+|adenylyl-sulfate-kinase MALHDENVVWHSHPVTVQQRELHHGHRGVVLWFTGLSGSGKSTVAGALEEALHKLGVSTYLLDGDNVRHGLCSDLGFSDADRKENIRRVGEVANLMVEAGLVVLTAFISPHRAERQMVRERVGEGRFIEVFVDTPLAICEARDPKGLYKKARAGELRNFTGIDSVYEAPESAEIHLNGEQLVTNLVQQLLDLLRQNDIIRS >NZ_CP048344.1|WP_124039059.1|1055639_1055963_+|DUF3561-family-protein MRNSHNITLTNNDSLTEDEETTWSLPGAVVGFISWLFALAMPMLIYGSNTLFFFIYTWPFFLALMPVAVVVGIALHLLMDGKLRYSIVFTLVTVGIMFGALFMWLLG >NZ_CP048344.1|WP_000517476.1|1056156_1056468_+|cell-division-protein-FtsB MGKLTLLLLAILVWLQYSLWFGKNGIHDYTRVNDDVAAQQATNAKLKARNDQLFAEIDDLNGGQEALEERARNELSMTRPGETFYRLVPDASKRAQSAGQNNR >NZ_CP048344.1|WP_000246138.1|1056486_1057197_+|2-C-methyl-D-erythritol-4-phosphate-cytidylyltransferase MATTHLDVCAVVPAAGFGRRMQTECPKQYLSIGNQTILEHSVHALLAHPRVKRVVIAISPGDSRFAQLPLANHPQITVVDGGDERADSVLAGLKAAGDAQWVLVHDAARPCLHQDDLARLLALSETSRTGGILAAPVRDTMKRAEPGKNAIAHTVDRNGLWHALTPQFFPRELLHDCLTRALNEGATITDEASALEYCGFHPQLVEGRADNIKVTRPEDLALAEFYLTRTIHQENT >NZ_CP048344.1|WP_001374730.1|1057196_1057676_+|2-C-methyl-D-erythritol-2,4-cyclodiphosphate-synthase MRIGHGFDVHAFGGEGPIIIGGVRIPYERGLLAHSDGDVALHALTDALLGAAALGDIGKLFPDTDPAFKGADSRELLREAWRRIQAKGYTLGNVDVTIIAQAPKMLPHIPQMRVFIAEDLGCHMDDVNVKATTTEKLGFTGRGEGIACEAVALLIKATK >NZ_CP048344.1|WP_000568943.1|1057672_1058722_+|tRNA-pseudouridine(13)-synthase-TruD MIEFDNLTYLHGKPQGTGLLKANPEDFVVVEDLGFEPDGEGEHILVRILKNGCNTRFVADALAKFLKIHAREVSFAGQKDKHAVTEQWLCARVPGKEMPDLSAFQLEGCQVLEYARHKRKLRLGALKGNAFTLVLREVSNRDDVEQRLIDICVKGVPNYFGAQRFGIGGSNLQGAQRWAQTNTPVRDRNKRSFWLSAARSALFNQIVAERLKKADVNQVVDGDALQLAGRGSWFVATTEELAELQRRVNDKELMITAALPGSGEWGTQREALAFEQAAVAAETELQALLVREKVEAARRAMLLYPQQLSWNWWDDVTVEIRFWLPAGSFATSVVRELINTTGDYAHIAE >NZ_CP048344.1|WP_001374723.1|1058702_1059464_+|5'/3'-nucleotidase-SurE MRILLSNDDGVHAPGIQTLAKALREFADVQVVAPDRNRSGASNSLTLESSLRTFTFENGDIAVQMGTPTDCVYLGVNALMRPRPDIVVSGINAGPNLGDDVIYSGTVAAAMEGRHLGFPALAVSLDGHKHYDTAAAVICSILRALCKEPLRTGRILNINVPDLPLDQIKGIRVTRCGTRHPADQVIPQQDPRGNTLYWIGPPGGKCDAGPGTDFAAVDEGYVSITPLHVDLTAHSAQDVVSDWLNSVGVGTQW |
You can click texts colored in the table to view more detailed information
CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
NZ_CP048344_5 | 1569992-1570109 | Orphan |
NA
Consensus repeat of NZ_CP048344_5
|
1 spacers
spacers of NZ_CP048344_5
>5.1|1570023|56|NZ_CP048344|CRISPRCasFinder TGCATCCGGCACCCGGAGCCTGATGCGACGCTGGCGCGTCTTATCAGGCCTACAAA |
CRISPR arrays and Neighbor proteins around NZ_CP048344_5
The CRISPR arrays of NZ_CP048344_5 >merge|NZ_CP048344|5|1569992-1570109|CRISPRCasFinder CCGAGCCGTAGGCCGGATAAGGCGTTCACGCTGCATCCGGCACCCGGAGCCTGATGCGACGCTGGCGCGTCTTATCAGGCCTACAAACCGAGCCGTAGGCCGGATAAGGCGTTTACGC >NZ_CP048344|5|5|1569992-1570109|CRISPRCasFinder CCGAGCCGTAGGCCGGATAAGGCGTTCACGC TGCATCCGGCACCCGGAGCCTGATGCGACGCTGGCGCGTCTTATCAGGCCTACAAA CCGAGCCGTAGGCCGGATAAGGCGTTTACGC
>NZ_CP048344.1|WP_000332037.1|1568764_1569895_-|ribonucleotide-diphosphate-reductase-subunit-beta MAYTTFSQTKNDQLKEPMFFGQPVNVARYDQQKYDIFEKLIEKQLSFFWRPEEVDVSRDRIDYQALPEHEKHIFISNLKYQTLLDSIQGRSPNVALLPLISIPELETWVETWAFSETIHSRSYTHIIRNIVNDPSVVFDDIVTNEQIQKRAEGISSYYDELIEMTSYWHLLGEGTHTVNGKTVTVSLRELKKKLYLCLMSVNALEAIRFYVSFACSFAFAERELMEGNAKIIRLIARDEALHLTGTQHMLNLLRSGADDPEMAEIAEECKQECYDLFVQAAQQEKDWADYLFRDGSMIGLNKDILCQYVEYITNIRMQAVGLDLPFQTRSNPIPWINTWLVSDNVQVAPQEVEVSSYLVGQIDSEVDTDDLSNFQL >NZ_CP048344.1|WP_000135040.1|1568510_1568765_-|ferredoxin-like-diferric-tyrosyl-radical-cofactor-maintenance-protein-YfaE MARVTLRITGTQLLCQDEHPSLLAALESHNVAVEYQCREGYCGSCRTRLVAGQVDWIAEPLAFIQPGEILPCCCRAKGDIEIEM >NZ_CP048344.1|WP_000301049.1|1567806_1568457_+|lipopolysaccharide-kinase-InaA MAVSAKYDEFNHWWATEGDWVEEPNYRRNGMSGVQCVERNGKKLYVKRMTHHLFHSVRYPFGRPTIVREVAVIKELERAGVIVPKIVFGEAVKIEGEWRALLVTEDMAGFISIADWYAQHAVSPYSDEVRQAMLKAVALAFKKMHSINRQHGCCYVRHIYVKTEGKAEAGFLDLEKSRRRLRRDKAINHDFRQLEKYLEPIPKADWEQVKAYYYAM >NZ_CP048344.1|WP_072163405.1|1565072_1565387_-|hypothetical-protein MTNKLGGELIDIADKKLAPLINDSFSYTRDFFAYSKQENNIFTFDNSKFVDPKEKEGLMIQHSNGQLVITGKYCPEGVQTAFTQEQYDKLIRYINIFFTFPKCE >NZ_CP048344.1|WP_000768974.1|1563777_1564854_+|glycerophosphodiester-phosphodiesterase MKLKLKNLSMAIMMSTIVMGSSAMAADSNEKIVIAHRGASGYLPEHTLPAKAMAYAQGADYLEQDLVMTKDDHLVVLHDHYLDRVTDVADRFPDRARKDGRYYAIDFTLDEIKSLKFTEGFDIENGKKVQTYPGRFPMGKSDFRVHTFEEEIEFVQGLNHSTGKNIGIYPEIKAPWFHHQEGKDIAAKTLEVLKKYGYTGKDDKVYLQCFDADELKRIKNELEPKMGMDLNLVQLIAYTDWNETQQKQPDGSWVNYSYDWMFKPGAMKQVAEYADGIGPDYHMLIEETSQPGNIKLTGMVQDAQQNKLVVHPYTVRSDKLPEYTTDVNQLYDVLYNKAGVNGLFTDFPDKAVKFLNKE >NZ_CP048344.1|WP_000948732.1|1562414_1563773_+|glycerol-3-phosphate-transporter MLSIFKPAPHKARLPAAEIDPTYRRLRWQIFLGIFFGYAAYYLVRKNFALAMPYLVEQGFSRGDLGFALSGISIAYGFSKFIMGSVSDRSNPRVFLPAGLILAAAVMLFMGFVPWATSSIAVMFVLLFLCGWFQGMGWPPCGRTMVHWWSQKERGGIVSVWNCAHNVGGGIPPLLFLLGMAWFNDWHAALYMPAFCAILVALFAFAMMRDTPQSCGLPPIEEYKNDYPDDYNEKAEQELTAKQIFMQYVLPNKLLWYIAIANVFVYLLRYGILDWSPTYLKEVKHFALDKSSWAYFLYEYAGIPGTLLCGWMSDKVFRGNRGATGVFFMTLVTIATIVYWMNPAGNPTVDMICMIVIGFLIYGPVMLIGLHALELAPKKAAGTAAGFTGLFGYLGGSVAASAIVGYTVDFFGWDGGFMVMIGGSILAVILLIVVMIGEKRRHEQLLQKRNGG >NZ_CP048344.1|WP_000857251.1|1560513_1562142_-|anaerobic-glycerol-3-phosphate-dehydrogenase-subunit-A MKTRDSQSSDVIIIGGGATGAGIARDCALRGLRVILVERHDIATGATGRNHGLLHSGARYAVTDAESARECISENQILKRIARHCVEPTNGLFITLPEDDLSFQATFIRACEEAGISAEAIDPQQARIIEPAVNPALIGAVKVPDGTVDPFRLTAANMLDAKEHGAVILTAHEVTGLIREGATVCGVRVRNHLTGETQALHAPVVVNAAGIWGQHIAEYADLRIRMFPAKGSLLIMDHRINQHVINRCRKPSDADILVPGDTISLIGTTSLRIDYNEIDDNRVTAEEVDILLREGEKLAPVMAKTRILRAYSGVRPLVASDDDPSGRNVSRGIVLLDHAERDGLDGFITITGGKLMTYRLMAEWATDAVCRKLGNTRPCTTADLALPGSQDPAEVTLRKVISLPAPLRGSAVYRHGDRTPAWLSEGRLHRSLVCECEAVTAGEVQYAVENLNVNSLLDLRRRTRVGMGTCQGELCACRAAGLLQRFNVTTSAQSIEQLSTFLNERWKGVQPIAWGDALRESEFTRWVYQGLCGLEKEQKDAL >NZ_CP048344.1|WP_060621281.1|1559264_1560524_-|glycerol-3-phosphate-dehydrogenase-subunit-GlpB MRFDTVIMGGGLAGLLCGLQLQKHGLRCAIVTRGQSALHFSSGSLDLLSHLPDGQPVADIHSGLESLRQQAPAHPYSLLGPQRVLDLACQAQALIAESGAQLQGSVELAHQRITPLGTLRSTWLSSPEVPVWPLPAKKICVVGISGLMDFQAHLAAASLRELDLSVETAEIELPELDVLRNNATEFRAVNIARFLDNEENWPLLLDALIPVANTCEMILMPACFGLADDKLWRWLNEKLPCSLMLLPTLPPSVLGIRLQNQLQRQFVRQGGVWMPGDEVKKVTCKNGVMNEIWTRNHADIPLRPRFAVLASGSFFSGGLVAERNGIREPILGLDVLQTATRGEWYKGDFFAPQPWQQFGVTTDETLRPSQAGQTIENLFAIGSVLGGFDPIAQGCGGGVCAVSALHAAQQIAQRAGGQQ >NZ_CP048344.1|WP_001000370.1|1558077_1559268_-|anaerobic-glycerol-3-phosphate-dehydrogenase-subunit-C MNDTSFENCIKCTVCTTACPVSRVNPGYPGPKQAGPDGERLRLKDGALYDEALKYCINCKRCEVACPSDVKIGDIIQRARAKYDTTRPSLRNFVLSHTDLMGSVSTPFAPIVNTATSLKPVRQLLDAALKIDHRRTLPKYSFGTFRRWYRSVAAQQAQYKDQVAFFHGCFVNYNHPQLGKDLIKVLNAMGTGVQLLSKEKCCGVPLIANGFTAKARKQAITNVESIREAVGVKGIPVIATSSTCTFALRDEYPEVLNVDNKGLRDHIELATRWLWRKLDEGKTLPLKPLPLKVVYHTPCHMEKMGWTLYTLELLRKIPGLELTVLDSQCCGIAGTYGFKKENYPTSQAIGAPLFRQIEESGADLVVTDCETCKWQIEMSTSLRCEHPITLLAQALA >NZ_CP048344.1|WP_001374259.1|1556985_1557885_-|ISNCY-family-transposase MTESTTSSPHDAVFKTFMFTPETARDFLEIHLPEPLRKLCNLQTLRLEPTSFIEKSLRAYYSDVLWSVETSDGDGYIYCVIEHQSSAEKNMAFRLMRYATAAMQRHLDKGYDRVPLVVPLLFYHGETSPYPYSLNWLDEFDDPQLARQLYTEAFPLVDITIVPDDEIMQHRRIALLELIQKHIRDRDLIGMVDRITTLLVKGFTNDSQLQTLFNYLLQCGDTSRFTRFIEEIAKRSPLQKERLMTIAERLRQEGHQIGWQEGMHEQAIKIALRMLEQGFEREIVLATTQLTDADIPNCH >NZ_CP048344.1|WP_001075164.1|1570128_1572414_-|ribonucleoside-diphosphate-reductase-subunit-alpha MNQNLLVTKRDGSTERINLDKIHRVLDWAAEGLHNVSISQVELRSHIQFYDGIKTSDIHETIIKAAADLISRDAPDYQYLAARLAIFHLRKKAYGQFEPPALYDHVVKMVEMGKYDNHLLEDYTEEEFKQMDTFIDHDRDMTFSYAAVKQLEGKYLVQNRVTGEIYESAQFLYILVAACLFSNYPRETRLQYVKRFYDAVSTFKISLPTPIMSGVRTPTRQFSSCVLIECGDSLDSINATSSAIVKYVSQRAGIGINAGRIRALGSPIRGGEAFHTGCIPFYKHFQTAVKSCSQGGVRGGAATLFYPMWHLEVESLLVLKNNRGVEGNRVRHMDYGVQINKLMYTRLLKGEDITLFSPSDVPGLYDAFFADQEEFERLYTKYEKDDSIRKQRVKAVELFSLMMQERASTGRIYIQNVDHCNTHSPFDPAIAPVRQSNLCLEIALPTKPLNDVNDENGEIALCTLSAFNLGAINNLDELEELAILAVRALDALLDYQDYPIPAAKRGAMGRRTLGIGVINFAYYLAKHGKRYSDGSANNLTHKTFEAIQYYLLKASNELAKEQGACPWFNETTYAKGILPIDTYKKDLDTIANEPLHYDWEALRESIKTHGLRNSTLSALMPSETSSQISNATNGIEPPRGYVSIKASKDGILRQVVPDYEHLHDAYELLWEMPGNDGYLQLVGIMQKFIDQSISANTNYDPSRFPSGKVPMQQLLKDLLTAYKFGVKTLYYQNTRDGAEDAQDDLVPSIQDDGCESGACKI >NZ_CP048344.1|WP_001220074.1|1573109_1576862_+|AIDA-I-family-autotransporter-adhesin-YfaL/EhaC MRIIFLRKEYLSLLPSMIASLFSANGVAAVTDSCQGYDVKASCQASRQSLSGITQDWSIADGQWLVFSDMTNNASGGAVFLQQGAEFSLLPENETGMTLFANNTVTGEYNNGGAIFAKENSTLNLTDVIFSGNVAGGYGGAIYSSGTNDTGAVDLRVTNAMFRNNIANDGKGGAIYTINNDVYLSDVIFDNNQAYTSTSYSDGDGGAIDVTDNNSDSKHPSGYTIVNNTAFTNNTAEGYGGAIYTNSVTAPYLIDISVDDSYSQNGGVLVDENNSAAGYGDGPSSAAGGFMYLGLSEVTFDIADGKTLVIGNTENDGAVDSIAGTGLITKTGSGDLVLNADNNDFTGEMQIENGEVTLGRSNSLMNVGDTHCQDDPQDCYGLTIGSIDQYQNQAELNVGSTQQTFVHALTGFQNGTLNIDAGGNVTVNQGSFAGIIEGAGQLTIAQNGSYVLAGAQSMALTGDIVVDDGAVLSLEGDAADLTALQDDPQSIVLNGGVLDLSDFSTWQSGTSYNDGLEVSGSSGTVIGSQDVVDLAGGDNLHIGGDGKDGVYVVVDASDGQVSLANNNSYLGTTQIASGTLMVSDNSQLGDTHYNRQVIFTDKQQESVMEITSDVDTRSDAAGHGRDIEMRADGEVAVDAGVDTQWGALMADSSGQHQDEGSTLTKTGAGTLELTASGTTQSAVRVEEGTLKGDVADILPYASSLWVGDGATFVTGADQDIQSIDAISSGTIDISDGTVLRLTGQDTSVALNASLFNGDGTLVNATDGVTLTGELNTNLETDSLTYLSNVTVNGNLTNTSGAVSLQNGVAGDTLTVNGDYTGGGTLLLDSELNGDDSVSDQLVMNGNTAGNTTVVVNSITGIGEPTSTGIKMVDFAADPTQFQNNAQFSLAGSGYVNMGAYDYTLVEDNNDWYLRSQEVTPPSPPDPDPTPDPDPTPDPDPTPDPEPTPAYQPVLNAKVGGYLNNLRAANQAFMMERRDHAGGDGQTLNLRVIGGDYHYTAAGQLAQHEDTSTVQLSGDLFSGRWGTDGEWMLGIVGGYSDNQGDSRSNMTGTRADNQNHGYAVGLTSSWFQHGNQKQGAWLDSWLQYAWFSNDVSEQEDGTDHYHSSGIIASLEAGYQWLPGRGVVIEPQAQVIYQGVQQDDFTAANRARVSQSQGDDIQTRLGLHSEWRTAVHVIPTLDLNYYHDPHSTEIEEDGSTISDDAVKQRGEIKVGVTGNISQRVSLRGSVAWQKGSDDFAQTAGFLSMTVKW >NZ_CP048344.1|WP_000990756.1|1576989_1577712_-|bifunctional-2-polyprenyl-6-hydroxyphenol-methylase/3-demethylubiquinol-3-O-methyltransferase-UbiG MNAEKSPENHNVDHEEIAKFEAVASRWWDLEGEFKPLHRINPLRLGYIAERAGGLFGKKVLDVGCGGGILAESMAREGATVTGLDMGFEPLQVAKLHALESGIQVDYVQETVEKHAAKHAGQYDVVTCMEMLEHVPDPQSVVRACAQLVKPGGDVFFSTLNRNGKSWLMAVVGAEYILRMVPKGTHDVKKFIKPAELLGWVDQTSLKERHITGLHYNPITNSFKLGPGVDVNYMLHTQNK >NZ_CP048344.1|WP_001281225.1|1577858_1580486_+|DNA-topoisomerase-(ATP-hydrolyzing)-subunit-A MSDLAREITPVNIEEELKSSYLDYAMSVIVGRALPDVRDGLKPVHRRVLYAMNVLGNDWNKAYKKSARVVGDVIGKYHPHGDLAVYNTIVRMAQPFSLRYMLVDGQGNFGSIDGDSAAAMRYTEIRLAKIAHELMADLEKETVDFVDNYDGTEKIPDVMPTKIPNLLVNGSSGIAVGMATNIPPHNLTEVINGCLAYIDDEDISIEGLMEHIPGPDFPTAAIINGRRGIEEAYRTGRGKVYIRARAEVEVDAKTGRETIIVHEIPYQVNKARLIEKIAELVKEKRVEGISALRDESDKDGMRIVIEVKRDAVGEVVLNNLYSQTQLQVSFGINMVALHHGQPKIMNLKDIIAAFVRHRREVVTRRTIFELRKARDRAHILEALAVALANIDPIIELIRHAPTPAEAKTALVANPWQLGNVAAMLERAGDDAARPEWLEPEFGVRDGLYYLTEQQAQAILDLRLQKLTGLEHEKLLDEYKELLDQIAELLRILGSADRLMEVIREELELVREQFGDKRRTEITANSADINLEDLITQEDVVVTLSHQGYVKYQPLSEYEAQRRGGKGKSAARIKEEDFIDRLLVANTHDHILCFSSRGRVYSMKVYQLPEATRGARGRPIVNLLPLEQDERITAILPVTEFEEGVKVFMATANGTVKKTVLTEFNRLRTAGKVAIKLVDGDELIGVDLTSGEDEVMLFSAEGKVVRFKESSVRAMGCNTTGVRGIRLGEGDKVVSLIVPRGDGAILTATQNGYGKRTAVAEYPTKSRATKGVISIKVTERNGLVVGAVQVDDCDQIMMITDAGTLVRTRVSEISIVGRNTQGVILIRTAEDENVVGLQRVAEPVDEEDLDTIDGSAAEGDDEIAPEVDVDDEPEEE >NZ_CP048344.1|WP_000012305.1|1580634_1582323_+|DUF2138-domain-containing-protein MSGEKKAKGWRFYGLVGFGAIALLSAGVWALQYAGSGPEKTLSPLVVHNNLQIDLNEPDLFLDSDSLSQLPKDLLTIPFLHDVLSEDFVFYYQNHADRLGIEGSIRRIVYEHDLTLKDKLFSSLLDQPAQAALWHDKQGHLSHYMVLIQRSGLSKLLEPLLFAATSDSQLSKTEISSIKINSETVPVYQLRYNGNNALMFATYQDKMLVFSSTDMLFKDDQQDTEATAIAGDLLSGKKRWQASFGLEERTAEKTPVRQRIVVSARWLGFGYQRLMPSFAGVRFEMGNDGWHSFVALNDESASVDASFDFTPVWNSMPAGASFCVAVPYSHGIAEEMLSHISQENDKLNGALDGAAGLCWYEDSKLQTPLFVGQFDGTAEQAQLPGKLFTQNIGAHESKAPEGVLPVSQTQQGEAQIWRREVSSRYGQYPKAQAAQPDQLMSDYFFRVSLAMQNKTLLFSLDDTLVNNALQTLNKTRPAMVDVIPTDGIVPLYINPQGIAKLLRNETLTSLPKNLEPVFYNAAQTLLMPKLDALSQQPRYVMKLAQMEPGAAWQWLPITWQPL >NZ_CP048344.1|WP_001295211.1|1582319_1582943_+|DUF1175-domain-containing-protein MRHGLLALICWLCCVVAHSEMLNVEQSGLFRAWFVRIAQEQLRQGPSPRWYQQDCAGLVRFAANETLKVHDSKWLKSNGLSSQYLPPEMTLTPEQRQLAQNWNQGNGKTGPYVTAINLIQYNSQFIGQDINQALPGDMIFFDQGDAQHLMVWMGRYVIYHTGSATKTDNGMRAVSLQQLMTWKDTRWIPNDSNPNFIGIYRLNFLAR >NZ_CP048344.1|WP_122633159.1|1583086_1587481_+|alpha-2-macroglobulin-family-protein MRLEAPGRDYRRYQMEEYGGVDVRLYRIPDPMAFLRQQKNLHRIVVQPQYLGDGLNNTLTWLWDNWYGKSRRVMQRTFSSQSRQNVTQALPELQLGNAIIKPSRYVQNNQFSPLKKYPLVEQFRYPLWQAKPFEPQQGVKLEGASSNFISPQPGNIYIPLGQQEPGLYLVEAMVGGYRATTVVFVSDTVALSKVSGNELLVWTAGKKQGEAKPGSEILWTDGLGVMTRGVTDDSGTLQLQHISPERSYILGKDAEGGVFVSENFFYESEIYNTRLYIFTDRPLYRAGDRVDVKVMGREFHDPLHSSPIVSAPAKLSVLDANGSLLQTVDVTLDARNGGQGSFRLPENAVAGGYELRLAYRNQVYSSSFRVANYIKPHFEIGLALAKKEFKTGEAVSGKLQLLYPDGEPVKNARVQLSLRAQQLSMVGNDLRYAGRFPVSLEGSETVSDASGHVALNLPAADKPSRYLLTVSASDGAAYRVTTTKEILIERGLAHYSLSTAAQYSNSGESVVFRYAALESSKQVPVTYEWLRLEDRTSHSGELPSGGKSFTVNFAKPGNYNLTLRDKDGLILAGLSHAVSGKGSTAHTGTVDIVADKTLYQPGETAKMLITFPEPIDEALLTLERDRVEQQSLLSHPANWLTLQRLNDTQYEARVPVSNSFAPNITFSVLYTRNGQYSFQNAGIKVAVPQLDIRVKTDKTHYQPGELVNVELTSSLKGKPVSAQLTVGVVDEMIYALQPEIAPNIGKFFYPLGRNNVRTSSSLSFISYDQALSSEPVAPGATNRSERRVKMLERPRREEVDTAAWMPSLTTDKQGKAYFTFLMPDSLTRWRITARGMNGDGLVGQGRAYLRSEKNLYMKWSMPTVYRVGDKPAAGLFIFSQQDNEPVALVTKFAGAEMRQTLTLHKGANYISLTQNIQQSGLLSAELQQNGQVQDSISTKLSFVDNSWPVEQQKNVMLGGGDNALMLPEQASNIRLQSSETPQEIFRNNLDALVDEPWGGVINTGSRLIPLSLAWRSLADHQSAAANDIRQMIQVNRLRLMQLAGPGARFTWWGEDGNGDAFLTAWAWYADWQASQAIGVTQQPEYWQHMLDSYAEQADNMPLLHRALVLAWAQEMNLPCKTLLKGLDEAIARRGTKTEDFSEEDTRDINDSLILDTPESPLADAVANVLTMTLLKKAQLKSTVMPQVQQYAWDKAANSNQPLAHTVVLLNSGGDATQAAAILSGLTAEQSTIERALAMNWLAKYMATMPPVVLPAPAGAWAKHKLTGGGEYWRWVGQGVPDILSFGDELSPQNVQVRWREPAKTAQQSNIPVTVERQLYRLITGEEEMSFTLQPVTSNEIDSDALYLDEITLTSEQDAVLRYGQVEVPLPPGADVERTTWGISVNKPNAAKQQGQLLEIARNEMGELAYMVPVKELTGTVTFRHLLRFSQKGQFVLPPARYMRSYAPAQQSVAAGSEWTRMQVK >NZ_CP048344.1|WP_001104488.1|1587481_1589131_+|DUF2300-domain-containing-protein MNWRRIVWLLALVTLPTLAEEPPLQLALRGAQHDQLYKLSSSGVTNVSTLPDTLTTPLGSLWKLYVYAWLEDTHQPEQPYQCRGNSPEEVYCCQAGESITRDTALVRSCGLYFAPQRLHIGADVWGQYWQQRQAPAWLASLTTLKPETSVTVKSLLDSLATLPAQNKAQEVLLDVVLDEAKIGVASMLGSRVRVKTWSWFADDKQEIRQGGFAGWLTDGTPLWVTGSGTSKTVLTRYATVLNRVLPVPTQVASGQCVEVELFARYPLKKITAEKSTTAVKPGVLNGRYRVTFTNGNHITFVSHGETTLLSEKGKLKLQSHLDREEYVARVLDREAKSTPPEAAKAMTVAIRTFLQQNANREGDCLTIPDSSATQRVSASPATTGARTMAAWTQDLIYAGDPVHYHGSRATEGTLSWRQATAQAGQGERYDQILAFAYPDNSLSRWGAPRSTCQLLPKAKAWLAKKMPQWRRILQAETGYNEPDVFAVCRLVSGFPYTDRQQKRLFIRNFFTLQDRLDLTHEYLHLAFDGYPTGLDENYIETLTRQLLMD >NZ_CP048344.1|WP_001567753.1|1589135_1589912_+|YfaP-family-protein MRKIFLPLLLVALSPVAHSEGVQEVEIDAPLSGWHPVEGEDASFSQSINYPASSVNMADDQNISAQIRGKIKNYAAAGKVQQGRLVVNGASMPQRIESDGSFARPYIFTEGSNSVQVISPDGQSRQKMQFYSTPGTGTIRARLRLVLSWDTDNTDLDLHVVTPDGEHAWYGNTVLKNSGALDMDVTTGYGPEIFAMPAPVHGRYQVYINYYGGRSETELTTAQLTLITDEGSVNEKQETFIVPMRNAGELTLVKSFDW >NZ_CP048344.1|WP_000786548.1|1589985_1591170_-|acetyl-CoA-acetyltransferase MKNCVIVSAVRTAIGSFNGSLASTSAIDLGATVIKAAIERAKIDSQHVDEVIMGNVLQAGLGQNPARQALLKSGLAETVCGFTVNKVCGSGLKSVALAAQAIQAGQAQSIVAGGMENMSLAPYLLDAKARSGYRLGDGQVYDVILRDGLMCATHGYHMGITAENVAKEYGITREMQDELALHSQRKAAAAIESGAFTAEIVPVNVVTRKKTFVFSQDEFPKANSTAEALGALRPAFDKAGTVTAGNASGINDGAAALVIMEESAALAAGLTPLARIKSYASGGVPPALMGMGPVPATQKALQLAGLQLADIDLIEANEAFAAQFLAVGKTLGFDPEKVNVNGGAIALGHPIGASGARILVTLLHAMQARDKTLGLATLCIGGGQGIAMVIERLN |
You can click texts colored in the table to view more detailed information
CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
NZ_CP048344_6 | 2174874-2174997 | Orphan |
NA
Consensus repeat of NZ_CP048344_6
|
1 spacers
spacers of NZ_CP048344_6
>6.1|2174917|38|NZ_CP048344|CRISPRCasFinder CGGACGCAGGATGGTGCGTTCAATTGGACTCGAACCAA |
CRISPR arrays and Neighbor proteins around NZ_CP048344_6
The CRISPR arrays of NZ_CP048344_6 >merge|NZ_CP048344|6|2174874-2174997|CRISPRCasFinder CGACCCCCACCATGTCAAGGTGGTGCTCTAACCAACTGAGCTACGGACGCAGGATGGTGCGTTCAATTGGACTCGAACCAACGACCCCCACCATGTCAAGGTGGTGCTCTAACCAACTGAGCTA >NZ_CP048344|6|6|2174874-2174997|CRISPRCasFinder CGACCCCCACCATGTCAAGGTGGTGCTCTAACCAACTGAGCTA CGGACGCAGGATGGTGCGTTCAATTGGACTCGAACCAA CGACCCCCACCATGTCAAGGTGGTGCTCTAACCAACTGAGCTA
>NZ_CP048344.1|WP_000212657.1|2174433_2174739_-|monooxygenase MATLLQLHFAFNGPFGDAMAEQLKPLAESINQEPGFLWKVWTESEKNHEAGGIYLFTDEKSALAYLEKHTARLKNLGVEEVVAKVFDVNEPLSQINQAKLA >NZ_CP048344.1|WP_000716929.1|2172703_2174308_-|FAD-NAD(P)-binding-protein MKKIAIVGAGPTGIYTLFSLLQQQTPLSISIFEQADEAGVGMPYSDEENSKMMLANIASIEIPPINCTYLEWLQKQEASHLQRYGVKKETLHDRQFLPRILLGEYFRDQFLRLVDQARQQKFAVAVYESCQVTDLQITNAGVMLATNQDLPSETFDLVVIATGHVWPDEEEATRTYFPSPWSGLMEAKVDACNVGIMGTSLSGLDAAMAVAIQHGSFIEDDKQHVVFNRDNASEKLNITLMSRTGILPEADFYCPIPYEPLHIVTDQALNAEIQKGEEGLLDRVFRLIVEEIKFADPDWSQRIALESLNVDSFAQAWFAERKQRDPFDWAEKNLQEVERNKREKHTVPWRYVILRLHEAVQEIVPHLNEHDHKRFSKGLARVFIDNYAAIPSESIRRLLALREAGIIHILALGEDYEMEINESRTVLKTEDNSYSFDVFIDARGQRPLKVKDIPFPGLREQLQKTGDEIPDVGEDYTLQQPEDIRGRVAFGALPWLMHDQPFVQGLTACAEIGEAMARAVVKPASRARRRLSFD >NZ_CP048344.1|WP_000587555.1|2171879_2172692_+|hypothetical-protein MIITRADLREWRIGAVMYRWFLRHFPRGGSYADIHHALIEEGYTDWAESLVEYAWKKWLADENFAHQEVSSMQKLATDPGEIPFCSQFARSDDHARIGCCEDNARIATAGYAAQIASMGYSVRIGSVGFNSHIGSSGERARVAVTGNSSRISSAGDSSRIANTGMRVRVCTLGERCHVASNGDLAQIASFGANARIANSGDNVHIIASGENSTVVSTGVVDSIILGPGGSAALAYHDGERVRFAVAIEGENNIRAGVRYRLNEQHQFVEC >NZ_CP048344.1|WP_001069997.1|2171090_2171876_+|thiosulfate-reductase-cytochrome-B-subunit MNPSQHAEQFQSQLANYVPQFTPEFWPVWLIIAGVLLVGMWLVLGLHALLRARGVKKSVTDYGEKIYLYCKAVRLWHWSNALLFVLLLASGLINHFALVGATAVKSLVAVHEVCGFLLLACWLGFVLINAVGGNGHHYRIRRQGWLERAAKQTRFYLFGIMQGEEHPFPATTQSKFNPLQQVAYVGVMYGLLPLLLLTGLLCLYPQAVGDVFPGVRYWLLQAHFALAFISLFFIFGHLYLCTTGRTPHETFKSMVDGYHRH >NZ_CP048344.1|WP_001310861.1|2170425_2171094_+|4Fe-4S-dicluster-domain-containing-protein MSFTRRKFVLGMGTVIFFTGSASSLLANTRQEKEVRYAMIHDESRCNGCNICARACRKTNHVPAQGSRLSIAHIPVTDNDNETQYHFFRQSCQHCEDAPCIDVCPTGASWRDEQGIVRVEKSQCIGCSYCIGACPYQVRYLNPVTKVADKCDFCAESRLAKGFPPICVSACPEHALIFGREDSPEIQAWLQQNKYYQYQLPGAGKPHLYRRFGQHLIKKENV >NZ_CP048344.1|WP_001297805.1|2169723_2170362_+|YdhW-family-putative-oxidoreductase-system-protein MNHRDELPLAKVSEVDEAKRQWLQGMRHPVDTVTEPEPAEILAEFIRQHSAAGQLVARAVFLSPPYSVAEEELSVLLESIKQNGDYADIACMTGSQDDYYYSTQAMSENYAAMSLQVVEQDICRAIAHAVRFECQTYPRPYKVAMLMQAPYYFQEAQIEAAIAAMDVAPEYADIRQVESSTAVLYLFSERFMTYGKAYGLCEWFEVEQFQNP >NZ_CP048344.1|WP_001678907.1|2167608_2169711_+|aldehyde-ferredoxin-oxidoreductase MANGWTGNILRVNLTTGNITLEDSSKFKSFVGGMGFGYKIMYDEVPPGTKPFDEANKLVFATGPLTGSGAPCSSRVNITSLSTFTKGNLVVDAHMGGFFAAQMKFAGYDVIIIEGKAKSPVWLKIKDDKVSLEKADFLWGKGTRATTEEICRLTSPETCVAAIGQAGENLVPLSGMLNSRNHSGGAGTGAIMGSKNLKAIAVEGTKGVNIADRQEMKRLNDYMMTELIGANNNHVVPSTPQSWAEYSDPKSRWTARKELFWGAAEGGPIETGEIPPGNQNTVGFRTYKSVFDLGPAAEKYTVKMSGCHSCPIRCMTQMNIPRVKEFGVPSTGGNTCVANFVHTTIFPNGPKDFEDKDDGRVIGNLVGLNLFDDYGLWCNYGQLHRDFTYCYSKGVFKRVLPAEEYAEIHWDQLEAGDVNFIKDFYYRLAHRVGELSHLADGSYAIAERWNLGEEYWGYAKNKLWSPFGYPVHHANEASAQVGSIVNCMFNRDCMTHTHINFIGSGLPLKLQREVAKELFGSEDAYDETKNYTPINDAKIKYAKWSLLRVCLHNAVTLCNWVWPMTVSPLKSRNYRGDLALEAKFFKAITGEEMTQEKLDLAAERIFTLHRAYTVKLMQTKDMRNEHDLICSWVFDKDPQIPVFTEGTDKMDRDDMHASLTMFYKEMGWDPQLGCPTRETLQRLGLEDIAADLAAHNLLPV >NZ_CP048344.1|WP_001070230.1|2166961_2167588_+|ferredoxin-like-protein MNPVDRPLLDIGLTRLEFLRISGKGLAGLTIAPALLSLLGCKQEDIDSGTVGLINTPKGVLVTQRARCTGCHRCEISCTNFNDGSVGTFFSRIKIHRNYFFGDNGVGSGGGLYGDLNYTADTCRQCKEPQCMNVCPIGAITWQQKEGCITVDHKRCIGCSACTTACPWMMATVNTESKKSSKCVLCGECANACPTGALKIIEWKDITV >NZ_CP048344.1|WP_000528342.1|2166296_2166506_+|fumarate-hydratase-FumD MGNRTKEDELYREMCRVVGKVVLEMRDLGQEPKHIVIAGVLRTALANKRIQRSELEKQAMETVINALVK >NZ_CP048344.1|WP_001295403.1|2164328_2165741_-|pyruvate-kinase-PykF MKKTKIVCTIGPKTESEEMLAKMLDAGMNVMRLNFSHGDYAEHGQRIQNLRNVMSKTGKTAAILLDTKGPEIRTMKLEGGNDVSLKAGQTFTFTTDKSVIGNSEMVAVTYEGFTTDLSVGNTVLVDDGLIGMEVTAIEGNKVICKVLNNGDLGENKGVNLPGVSIALPALAEKDKQDLIFGCEQGVDFVAASFIRKRSDVIEIREHLKAHGGENIHIISKIENQEGLNNFDEILEASDGIMVARGDLGVEIPVEEVIFAQKMMIEKCIRARKVVITATQMLDSMIKNPRPTRAEAGDVANAILDGTDAVMLSGESAKGKYPLEAVSIMATICERTDRVMNSRLEFNNDNRKLRITEAVCRGAVETAEKLDAPLIVVATQGGKSARAVRKYFPDATILALTTNEKTAHQLVLSKGVVPQLVKEITSTDDFYRLGKELALQSGLAHKGDVVVMVSGALVPSGTTNTASVHVL >NZ_CP048344.1|WP_000534291.1|2175311_2176568_+|hypothetical-protein MGSDAKNLMSDGNVQIVKTGEVIGATQLTEGELIVEAGGRAENTVVTGAGWLKVATGGIAKCTQYGNNGTLSVSDGAIATDIVQSEGGAISLSTLATVNGRHPEGEFSVDQGYACGLLLENGGNLRVLEGHRAEKIILDQEGGLLVNGTTSAVVVDEGGELLVYPGGEASNCEINQGGVFMLAGKASDTLLAGGTMNNLGGEDSDTIVENGSIYRLGTDGLQLYSSGKTQNLSVNVGGRAEVHAGTLENAVIQGGTVILLSPTSADENFVVEEDRAPVELTGSVALLDGASMIIGYGADLQQSTITVQQGGVLILDGSTVKGDGVTFIVGNINLNGGKLWLITGAATHVQLKVKRLRGEGAICLQTSAKEISPDFINVKGEVTGDIHVEITDASRQTLCNALKLQPDEDGIGATLQPA >NZ_CP048344.1|WP_001174942.1|2176608_2177982_-|multidrug-efflux-MATE-transporter-MdtK MQKYISEARLLLALAIPVILAQIAQTAMGFVDTVMAGGYSATDMAAVAIGTSIWLPAILFGHGLLLALTPVIAQLNGSGRRERIAHQVRQGFWLAGFVSVLIMLVLWNAGYIIRSMENIDPALADKAVGYLRALLWGAPGYLFFQVARNQCEGLAKTKPGMVMGFIGLLVNIPVNYIFIYGHFGMPELGGVGCGVATAAVYWVMFLAMVSYIKRARSMRDIRNEKGTAKPDPAVMKRLIQLGLPIALALFFEVTLFAVVALLVSPLGIVDVAGHQIALNFSSLMFVLPMSLAAAVTIRVGYRLGQGSTLDAQTAARTGLMVGVCMATLTAIFTVSLREQIALLYNDNPEVVTLAAHLMLLAAVYQISDSIQVIGSGILRGYKDTRSIFYITFTAYWVLGLPSGYILALTDLVVEPMGPAGFWIGFIIGLTSAAIMMMLRMRFLQRLPSVIILQRASR >NZ_CP048344.1|WP_001373655.1|2178196_2178838_+|riboflavin-synthase MFTGIVQGTVKLVSIDEKPNFRTHVVELPDHMLDGLETGASVAHNGCCLTVTEINGNHVSFDLMKETLRITNLGDLKVGDWVNVERAAKFSDEIGGHLMSGHIMTTAEVAKILTSENNRQIWFKVQDSQLMKYILYKGFIGIDGISLTVGEVTPTRFCVHLIPETLERTTLGKKKLGARVNIEIDPQTQAVVDTVERVLAARENAMNQPGTEA >NZ_CP048344.1|WP_000098911.1|2178877_2180026_-|cyclopropane-fatty-acyl-phospholipid-synthase MSSSCIEEVSVPDDNWYRIANELLSRAGIAINGSAPADIRVKNPDFFKRVLQEGSLGLGESYMDGWWECDRLDMFFSKVLRAGLENQLPHHFKDTLRIASARLFNLQSKKRAWIVGKEHYDLGNDLFSRMLDPFMQYSCAYWKDADNLESAQQAKLKMICEKLQLKPGMRVLDIGCGWGGLAHYMASNYDVSVVGVTISAEQQKMAQERCEGLDVTILLQDYRDLNDQFDRIVSVGMFEHVGPKNYDTYFAVVDRNLKPEGIFLLHTIGSKKTDLNVDPWINKYIFPNGCLPSVRQIAQSSEPHFVMEDWHNFGADYDTTLMAWYERFLAAWPEIADNYSERFKRMFTYYLNACAGAFRARDIQLWQVVFSRGVENGLRVAR >NZ_CP048344.1|WP_001182363.1|2180316_2181528_-|Bcr/CflA-family-multidrug-efflux-MFS-transporter MQPGKRFLVWLAGLSVLGFLATDMYLPAFAAIQADLQTPASAVSASLSLFLAGFAAAQLLWGPLSDRYGRKPVLLIGLTIFALGSLGMLWVENAATLLVLRFVQAVGVCAAAVIWQALVTDYYPSQKVNRIFATIMPLVGLSPALAPLLGSWLLVHFSWQAIFATLFAITVVLILPIFWLKPTTKARNNSQDGLTFTDLLRSKTYRGNVLIYAACSASFFAWLTGSPFILSEMGYSPAVIGLSYVPQTIAFLIGGYGCRAALQKWQGKQLLPWLLVLFAVSVIATWAAGFISHVSLVEILIPFCVMAIANGAIYPIVVAQALRPFPHATGRAAALQNTLQLGLCFLASLVVSWLISISTPLLTTTSVMLSTVVLVALGYMMQRCEEVGCQNHGNAEVAHSESH >NZ_CP048344.1|WP_060621248.1|2181640_2182573_+|LysR-family-transcriptional-regulator MWSEYSLEVVDAVARNGSFSAAAQELHRVPSAVSYTVRQLEEWLAVPLFERRHRDVELTAAGAWFLKEGRSVVKKMQITRQQCQQIANGWRGQLAIAVDNIVRPERTRQMIVDFYRHFDDVELLVFQEVFNGVWDALSDGRVELAIGATRAIPVGGRYAFRDMGMLSWSCVVASHHPLALMDGPFSDDTLRNWPSLVREDTSRTLPKRITWLLDNQKRVVVPDWESSATCISAGLCIGMVPTHFAKPWLNEGKWVVLELENPFPDSACCLTWQQNDMSPALTWLLEYLGDSETLNKEWLREPEETPATGD >NZ_CP048344.1|WP_000190982.1|2182569_2183595_-|HTH-type-transcriptional-repressor-PurR MATIKDVAKRANVSTTTVSHVINKTRFVAEETRNAVWAAIKELHYSPSAVARSLKVNHTKSIGLLATSSEAAYFAEIIEAVEKNCFQKGYTLILGNAWNNLEKQRAYLSMMAQKRVDGLLVMCSEYPEPLLAMLEEYRHIPMVVMDWGEAKADFTDAVIDNAFEGGYMAGRYLIERGHREIGVIPGPLERNTGAGRLAGFMKAMEEAMIKVPESWIVQGDFEPESGYRAMQQILSQPHRPTAVFCGGDIMAMGALCAADEMGLRVPQDVSLIGYDNVRNARYFTPALTTIHQPKDSLGETAFNMLLDRIVNKREEPQSIEVHPRLIERRSVADGPFRDYRR >NZ_CP048344.1|WP_000102278.1|2183893_2183983_+|stress-response-protein-YnhF MSTDLKFSLVTTIIVLGLIVAVGLTAALH >NZ_CP048344.1|WP_000701040.1|2184148_2185318_+|MFS-transporter MKINYPLLALAIGAFGIGTTEFSPMGLLPVIARGVDVSIPAAGMLISAYAVGVMVGAPLMTLLLSHRARRSALIFLMAIFTLGNVLSAIAPDYMTLMLSRILTSLNHGAFFGLGSVVAASVVPKHKQASAVATMFMGLTLANIGGVPAATWLGETIGWRMSFLATAGLGVISMVSLFFSLPKGGAGARPEVKKELAVLMRPQVLSALLTTVLGAGAMFTLYTYISPVLQSITHATPVFVTAMLVLIGVGFSIGNYLGGKLADRSVNGTLKGFLLLLMVIMLAIPFLARNEFGAAISMVVWGAATFAVVPPLQMRVMRVASEAPGLSSSVNIGAFNLGNALGAAAGGAVISAGLGYSFVPVMGAIVAGLALLLVFMSARKQPETVCVANS >NZ_CP048344.1|WP_000007283.1|2185463_2186045_-|superoxide-dismutase-[Fe] MSFELPALPYAKDALAPHISAETIEYHYGKHHQTYVTNLNNLIKGTAFEGKSLEEIIRSSEGGVFNNAAQVWNHTFYWNCLAPNAGGEPTGKVAEAIAASFGSFADFKAQFTDAAIKNFGSGWTWLVKNSDGKLAIVSTSNAGTPLTTDATPLLTVDVWEHAYYIDYRNARPGYLEHFWALVNWEFVAKNLAA |
You can click texts colored in the table to view more detailed information
CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
NZ_CP048344_7 | 2896012-2896103 | Orphan |
NA
Consensus repeat of NZ_CP048344_7
|
1 spacers
spacers of NZ_CP048344_7
>7.1|2896038|40|NZ_CP048344|CRISPRCasFinder GCGCTGCGGGTCATTCTTGAAATTACCCCCGCTGTGCTGT |
CRISPR arrays and Neighbor proteins around NZ_CP048344_7
The CRISPR arrays of NZ_CP048344_7 >merge|NZ_CP048344|7|2896012-2896103|CRISPRCasFinder CCACCTTTTTTACCTGCTTCAGATGCGCGCTGCGGGTCATTCTTGAAATTACCCCCGCTGTGCTGTCCACCTTTTTTACCTGCTTCTGATGC >NZ_CP048344|7|7|2896012-2896103|CRISPRCasFinder CCACCTTTTTTACCTGCTTCAGATGC GCGCTGCGGGTCATTCTTGAAATTACCCCCGCTGTGCTGT CCACCTTTTTTACCTGCTTCTGATGC
>NZ_CP048344.1|WP_001347171.1|2894569_2895898_+|pyrimidine-utilization-transport-protein-G MAMFGFPHWQLKSTSTESGVVAPDERLPFAQTAIMGVQHAVAMFGATVLMPILMGLDPNLSILMSGVGTLLFFFITGGRVPSYLGSSAAFVGVVIAATGFNGQGINPNISIALGGIIACGLVYTVIGLVVMKIGTRWIERLMPPVVTGAVVMAIGLNLAPIAVKSVSASAFDSWMAVMTVLCIGLVAVFTRGMIQRLLILVGLIVACLLYGVMTNLLGLGKAVDFTLVSHAAWFGLPHFSTPAFNSQAMMLIAPVAVILVAENLGHLKAVAGMTGRNMDPYMGRAFVGDGLATMLSGSVGGSGVTTYAENIGVMAVTKVYSTLVFVAAAVIAMLLGFSPKFGALIHTIPAAVIGGASIVVFGLIAVAGARIWVQNRVDLSQNGNLIMVAVTLVLGAGDFALTLGGFTLGGIGTATFGAILLNALLSRKLVDVPPPEVVHQEP >NZ_CP048344.1|WP_001028095.1|2894054_2894549_+|pyrimidine-utilization-flavin-reductase-protein-F MNIVDQQTFRDAMSCMGAAVNIITTDGPAGRAGFTASAVCSVTDTPPTLLVCLNRGASVWPVFNENRTLCVNTLSAGQEPLSNLFGGKTPMEHRFAAARWQTGVTGCPQLEEALVSFDCRISQVVSVGTHDILFCAIEAIHRHATPYGLVWFDRSYHALMRPAC >NZ_CP048344.1|WP_001001184.1|2893453_2894044_+|malonic-semialdehyde-reductase MNEAVSPGALSTLFTDARTHNGWRETPVSDETLRELYALMKWGPTSANCSPARIVFIRTAEGKERLRPALSSGNLQKTLTAPVTAIVAWDSEFYERLPLLFPHGDARSWFTSSPQLAEETAFRNSSMQAAYLIVACRALGLDTGPMSGFDRQYVDDAFFAGSTLKSNLLINIGYGDNSKLYARLPRLSFEEACGLL >NZ_CP048344.1|WP_001323674.1|2892643_2893444_+|pyrimidine-utilization-protein-D MKLSLSPPPYADAPVVVLISGLGGSGSYWLPQLAVLEQEYQVVCYDQRGTGNNPDTLAEDYSIAQMAAELHQALVAAGIEHYAVVGHALGALVGMQLALDYPASVTVLVCVNGWLRINAHTRRCFQVRERLLYSGGAQAWVEAQPLFLYPADWMAARAPRLEAEDALALAHFQGKNNLLRRLNALKRADFSHHAVRIRCPVQIICASDDLLVPSACSSELHAALPDSQKMVMRYGGHACNVTDPETFNALLLNGLASLLHHREAAL >NZ_CP048344.1|WP_001126787.1|2892249_2892636_+|pyrimidine-utilization-protein-C MPKSVIIPAGSSAPLAPFVPGTLADGVVYVSGTLAFDQHNNVLFADDPKAQTRHVLETIRTVIETAGGTMADVTFNSIFITDWKNYAAINEIYAEFFPGDKPARFCIQCGLVKPDALVEIATIAHIAK >NZ_CP048344.1|WP_001345643.1|2891545_2892238_+|peroxyureidoacrylate/ureidoacrylate-amidohydrolase-RutB MTTLTARPEAITFDPQQSALIVVDMQNAYATPGGYLDLAGFDVSTTRPVIANIQTAVTAARAAGMLIIWFQNGWDEQYVEAGGPGSPNFHKSNALKTMRKQPQLQGKLLAKGSWDYQLVDELVPQPGDIVLPKPRYSGFFNTPLDSILRSRGIRHLVFTGIATNVCVESTLRDGFFLEYFGVVLEDATHQAGPEFVQKAALFNIETFFGWVSDVETFCDALSPTSFARIA >NZ_CP048344.1|WP_001345642.1|2890454_2891546_+|pyrimidine-utilization-protein-A MKIGVFVPIGNNGWLISTHAPQYMPTFELNKAIVQKAEHYHFDFALSMIKLRGFGGKTEFWDHNLESFTLMAGLAAVTSRIQIYATAATLTLPPAIVARMAATIDSISGGRFGVNLVTGWQKPEYEQMGIWPGDDYFSRRYDYLTEYVQVLRDLWGSGKSDFKGDFFTMNDCRVSPQPSVPMKVICAGQSDAGMAFSAQYADFNFCFGKGVNTPTAFAPTAARMKQAAEQTGRDVGSYVLFMVIADETDDAARAKWEHYKAGADEEALSWLTEQSQKDTRSGTDTNVRQMADPTSAVNINMGTLVGSYASVARMLDEVASVPGAEGVLLTFDDFLSGIETFGERIQPLMQCRAHLPALTQEVA >NZ_CP048344.1|WP_001295606.1|2889528_2890167_-|HTH-type-transcriptional-regulator-RutR MTQGAVKTTGKRSRTVSAKKKAILSAALDTFSQFGFHGTRLEQIAELAGVSKTNLLYYFPSKEALYIAVLRQILDIWLAPLKAFREDFAPLAAIKEYIRLKLEVSRDYPQASRLFCMEMLAGAPLLMDELTGDLKALIDEKSALIAGWVKSGKLAPIDPQHLIFMIWASTQHYADFAPQVEAVTGATLRDEVFFNQTVENVQRIIIEGIRPR >NZ_CP048344.1|WP_001299828.1|2885526_2889489_+|trifunctional-transcriptional-regulator/proline-dehydrogenase/L-glutamate-gamma-semialdehyde-dehydrogenase MGTTTMGVKLDDATRERIKSAATRIDRTPHWLIKQAIFSYLEQLENSDTLPELPALLSGAANESDEAPTPAEEPHQPFLDFAEQILPQSVSRAAITAAYRRPETEAVSMLLEQARLPQPVAEQAHKLAYQLADKLRNQKNASGRAGMVQGLLQEFSLSSQEGVALMCLAEALLRIPDKATRDALIRDKISNGNWQSHIGRSPSLFVNAATWGLLFTGKLVSTHNEASLSRSLNRIIGKSGEPLIRKGVDMAMRLMGEQFVTGETIAEALANARKLEEKGFRYSYDMLGEAALTAADAQAYMVSYQQAIHAIGKASNGRGIYEGPGISIKLSALHPRYSRAQYDRVMEELYPRLKSLTLLARQYDIGINIDAEEADRLEISLDLLEKLCFEPELAGWNGIGFVIQAYQKRCPLVIDYLIDLATRSRRRLMIRLVKGAYWDSEIKRAQMDGLEGYPVYTRKVYTDVSYLACAKKLLAVPNLIYPQFATHNAHTLAAIYQLAGQNYYPGQYEFQCLHGMGEPLYEQVTGKVADGKLNRPCRIYAPVGTHETLLAYLVRRLLENGANTSFVNRIADTSLPLDELVADPVTAVEKLAQQEGQTGLPHPKIPLPRDLYGHGRDNSAGLDLANEHRLASLSSALLNSALQKWQALPMLEQPVAAGEMSPVINPAEPKDIVGFVREATPREVEQALESAVNNAPIWFATPPVERAAILHRAAVLMESQMQQLIGILVREAGKTFSNAIAEVREAVDFLHYYAGQVRDDFANETHRPLGPVVCISPWNFPLAIFTGQIAAALAAGNSVLAKPAEQTPLIAAQGIAILLEAGVPPGVVQLLPGQGETVGAQLTGDDRVRGVMFTGSTEVATLLQRNIASRLDAQGRPIPLIAETGGMNAMIVDSSALTEQVVVDVLASAFDSAGQRCSALRVLCLQDEIADHTLKMLRGAMAECRMGNPGRLTTDIGPVIDSEAKANIERHIQTMRSKGRPVFQAVRENSEDAREWQSGTFVAPTLIELDDFAELQKEVFGPVLHVVRYNRNQLPELIEQINASGYGLTLGVHTRIDETIAQVTGSAHVGNLYVNRNMVGAVVGVQPFGGEGLSGTGPKAGGPLYLYRLLANRPESALAVTLARQDAEYPVDAQLKAALTQPLNALREWAANRPELQALCTQYGELAQAGTQRLLPGPTGERNTWTLLPRERVLCIADDEQDALTQLAAVLAVGSQVLWPDDALHRQLVKALPSAVSERIQLAKAENITAQPFDAVIFHGDSDQLRALCEAVAARDGAIVSVQGFARGESNILLERLYIERSLSVNTAAAGGNASLMTIG >NZ_CP048344.1|WP_001678465.1|2883596_2885105_-|sodium/proline-symporter-PutP MAISTPMLVTFCVYIFGMILIGFIAWRSTKNFDDYILGGRSLGPFVTALSAGASDMSGWLLMGLPGAVFLSGISESWIAIGLTLGAWINWKLVAGRLRVHTEYNNNALTLPDYFTGRFEDKSRILRIISALVILLFFTIYCASGIVAGARLFESTFGMSYETALWAGAAATILYTFIGGFLAVSWTDTVQASLMIFALILTPVIVIISVGGFGDSLEVIKQKSIENVDMLKGLNFVAIISLMGWGLGYFGQPHILARFMAADSHHSIVHARRISMTWMILCLAGAVAVGFFGIAYFNEHPAVAGAVNQNAERVFIELAQILFNPWIAGILLSAILAAVMSTLSCQLLVCSSAITEDLYKAFLRKHASQKELVWVGRVMVLVVALVAIALAANPENRVLGLVSYAWAGFGAAFGPVVLFSVMWSRMTRNGALAGMIIGALTVIVWKQFGWLGVYEIIPGFIFGSIGIVVFSLLGKAPSAAMQKRFAEADAHYHSAPPSRLQES >NZ_CP048344.1|WP_001151437.1|2896526_2897123_+|NAD(P)H:quinone-oxidoreductase MAKVLVLYYSMYGHIETMARAVAEGASKVDGAEVVVKRVPETMPPQLFEKAGGKTQTAPVATPQELADYDAIIFGTPTRFGNMSGQMRTFLDQTGGLWASGALYGKLASVFSSTGTGGGQEQTITSTWTTLAHHGMVIVPIGYAAQELFDVSQVRGGTPYGATTIAGGDGSRQPSQEELSIARYQGEYVAGLAVKLNG >NZ_CP048344.1|WP_001143120.1|2897143_2897371_+|hypothetical-protein MPTQEAKAHHVGEWASLRNTSPEIAEAIFEVAGYDEKMAEKIWEEGSDEVLVKAFAKTDKDSLFWGEQTIERKNV >NZ_CP048344.1|WP_001044313.1|2897408_2898650_-|bifunctional-glucose-1-phosphatase/inositol-phosphatase MNKTLIAATVAGIVLLASNAQAQTVPEGYQLQQVLMMSRHNLRAPLANNGSVLEQSTPNKWPEWDVPGGQLTTKGGVLEVYMGHYMREWLAQQGMVKSGECPPPDTVYAYANSLQRTVATAQFFITGAFPGCDIPVHHQEKMGTMDPTFNPVITDDSAAFSEQAVAAMEKELSKLQLTDSYQLLEKIVNYKDSPACKEKQQCSLVDGKNTFSAKYQQEPGVSGPLKVGNSLVDAFTLQYYEGFPMDQVAWGEIKSDQQWKVLSKLKNGYQDSLFTSPEVARNVAKPLVSYIDKALVTDRTSAPKITVLVGHDSNIASLLTALDFKPYQLHDQNERTPIGGKIVFQRWHDSKANRDLMKIEYVYQSAEQLRNADALTLQAPAQRVTLELSGCPIDANGFCPMDKFDSVLNEAVK >NZ_CP048344.1|WP_000097602.1|2898941_2900201_-|YccE-family-protein MSSNIHGISCTANNYLKQAWNNIKNEHEKNQKYSITLFENTLVCFMRLYKEIRRQKAEDYIPCLECDSLEKEFEEMQNDNDLSLFLRTLRTNDTETYSGVSEGITYTIQYVRDIDIVRVSLPGRGSESITDFKGYYWYGFMEYIENINACDDVFSEYCLDDENMSIQPEWINTPGISDLDTGIDLSGISFIQSEINKTYGLKYAPVDGDGYCLLRAILVLKEHEYSWALGSHKTQKQVYEEFIKIVDKQTIEALVDTAFNDLREDVKTLFGVNLQSDNKIQGQGGFLSWSFLSFKKEFIDSCLNDKKCILHLPEFIFNDNKARLVLDTDPEQKVNEVKNFLTALSDSICSLFIVNSNVASISLGNESFSTDDDLEYGYLINTGNHYDVYLPPELFAQAYELNNKERNAQIDFLTRYAIY >NZ_CP048344.1|WP_000420629.1|2900460_2901381_+|curved-DNA-binding-protein MELKDYYAIMGVKPTDDLKTIKTAYRRLARKYHPDVSKEPDAEARFKEVAEAWEVLSDEQRRAEYDQMWQHRNDPQFNRQFHHSDGQSFNAEDFDDIFSSIFGQHARQSRQRPATRGHDIEIEVAVFLEETLTEHKRTISYNLPVYNAFGMIEQEIPKTLNVKIPAGVGNGQRIRLKGQGTPGENGGPNGDLWLVIHIAPHPLFDIVGHDLEIVVPVSPWEAALGAKVTVPTLKESILLTIPPGSQAGQRLRVKGKGLVSKKQTGDLYAVLKIVMPPKPDENTAALWQQLADAQSSFDPRKDWGKA >NZ_CP048344.1|WP_000024560.1|2901380_2901686_+|chaperone-modulator-CbpM MANVTVTFTITEFCLHTGISEEELNEIVGLGVVEPREIQETTWVFDDHAAIVVQRAVRLRHELALDWPGIAVALTLMDDIAHLKQENRLLRQRLSRFVAHP >NZ_CP048344.1|WP_000209869.1|2901778_2902378_-|molecular-chaperone-TorD MTTLTAQQIACVYAWLAQLFSRELDDEQLTQIASAQMAEWFSLLKSEPPLAAAVNELENCIATLTVRDDARLELAADFCGLFLMTDKQAALPYASAYKQDEQEIKRLLVEAGMETSGNFNEPADHLAIYLELLSHLHFSLGEGTVPARRIDSLRQKTLTALWQWLPEFVVRCRQYDSFGFYAALSQLLLVLVESDHQNR >NZ_CP048344.1|WP_001062101.1|2902374_2904921_-|trimethylamine-N-oxide-reductase-TorA MNNNDLFQASRRRFLAQLGGLTVAGMLGPSLLTPRRATAAQAATDAVISKEGILTGSHWGAIRATVKDGRFVAAKPFELDKYPSKMIAGLPDHVHNAARIRYPMVRVDWLRKRHLSDTSQRGDNRFVRVSWDEALDMFYEELERVQKTHGPSALLTASGWQSTGMFHNASGMLAKAIALHGNSVGTGGDYSTGAAQVILPRVVGSMEVYEQQTSWPLVLQNSKTIVLWGSDLLKNQQANWWCPDHDVYEYYAQLKAKVAAGEIEVISIDPVVTSTHEYLGREHVKHIAVNPQTDVPLQLALAYTLYSENLYDKNFLANYCVGFEQFLPYLLGEKDGQPKDAAWAEKLTGIDAETIRGLARQMAANRTQIIAGWCVQRMQHGEQWAWMIVVLAAMLGQIGLPGGGFGFGWHYNGAGTPGRKGVILSGFSGSTSIPPVHDNSDYKGYSSTIPIARFIDAILEPGKVINWNGKSVKLPPLKMCIFAGTNPFHRHQQINRIIEGWRKLETVIAIDNQWTSTCRFADIVLPATTQFERNDLDQYGNHSNRGIIAMKQVVPPQFEARNDFDIFRELCRRFNREEAFTEGLDEMGWLKRIWQEGVQQGKGRGVHLPAFDDFWNNKEYVEFDHPQMFVRHQAFREDPDLEPLGTPSGLIEIYSKTIADMNYDDCQGHPMWFEKIERSHGGPGSQKYPLHLQSVHPDFRLHSQLCESETLRQQYTVAGKEPVFINPQDASARGIRNGDVVRVFNARGQVLAGAVVSDRYAPGVARIHEGAWYDPDKGGEPGALCKYGNPNVLTIDIGTSQLAQATSAHTTLVEIEKYNGAVEQVTAFNGPVEMVAQCEYVPASQVKS >NZ_CP048344.1|WP_001323677.1|2904920_2906093_-|pentaheme-c-type-cytochrome-TorC MRKLWNALRRPSARWSVLALVAIGIVIGIALIVLPHVGIKVTSTTEFCVSCHSMQPVYEEYKQSVHFQNASGVRAECHDCHIPPDMPGMVKRKLEASNDIYQTFIAHSIDTPEKFEAKRAELAEREWARMKENNSATCRSCHNYDAMDHAKQHPEAARQMKVAAKDNQSCIDCHKGIAHQLPDMSSGFRKQFDELRASANDSGDTLYSIDIKPIYAAKGDKEASGSLLPASEVKVLKRDGDWLQIEITGWTESAGRQRVLTQFPGKRIFVASIRGDVQQQVKTLEKTTVADTNTEWSKLQATAWMKKGDMVNDIKPIWAYADSLYNGTCNQCHGAPEIAHFDANGWIGTLNGMIGFTSLDKREERTLLKYLQMNASDTAGKAHGDKKEEK >NZ_CP048344.1|WP_001120112.1|2906222_2906915_+|two-component-system-response-regulator-TorR MPHHIVIVEDEPVTQARLQSYFTQEGYTVSVTASGAGLREIMQNQPVDLILLDINLPDENGLMLTRALRERSTVGIILVTGRSDRIDRIVGLEMGADDYVTKPLELRELVVRVKNLLWRIDLARQAQPHTQDNCYRFAGYCLNVSRHTLERDGEPIKLTRAEYEMLVAFVTNPGEILSRERLLRMLSARRVENPDLRTVDVLIRRLRHKLSADLLVTQHGEGYFLAADVC |
You can click texts colored in the table to view more detailed information
CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
NZ_CP048344_8 | 3184577-3184721 | Orphan |
NA
Consensus repeat of NZ_CP048344_8
|
1 spacers
spacers of NZ_CP048344_8
>8.1|3184629|41|NZ_CP048344|CRISPRCasFinder TGCGAAAATGCCTTATCTGGCCTACAGATTCGATGCGATTC |
CRISPR arrays and Neighbor proteins around NZ_CP048344_8
The CRISPR arrays of NZ_CP048344_8 >merge|NZ_CP048344|8|3184577-3184721|CRISPRCasFinder GTAGGTCGGATAAGATGCGCAAGCATCGCATCCGACAATAAGTGCCGGATGCTGCGAAAATGCCTTATCTGGCCTACAGATTCGATGCGATTCGTAGGTCGGATAAGATGCGCAAGCATCGCATCCGACAATAAGTGCCGAATGC >NZ_CP048344|8|8|3184577-3184721|CRISPRCasFinder GTAGGTCGGATAAGATGCGCAAGCATCGCATCCGACAATAAGTGCCGGATGC TGCGAAAATGCCTTATCTGGCCTACAGATTCGATGCGATTC GTAGGTCGGATAAGATGCGCAAGCATCGCATCCGACAATAAGTGCCGAATGC
>NZ_CP048344.1|WP_001091569.1|3183227_3184511_+|putative-acyl-CoA-thioester-hydrolase MNTFSVSRLALALAFGVTLTACSSTPPDQRPSDQTAPGTSSRPILSAKEAQNFDAQHYFASLTPGAAAWNPSPITLPAQPDFVVGPAGTQGVTHTTIQAAVDAAIIKRTNKRQYIAVMPGEYQGTVYVPAAPGGITLYGTGEKPIDVKIGLSLDGGMSPADWRHDVNPRGKYMPGKPAWYMYDSCQSKRSDSIGVLCSAVFWSQNNGLQLQNLTIENTLGDSVDAGNHPAVALRTDGDQVQINNVNILGRQNTFFVTNSGVQNRLETNRQPRTLVTNSYIEGDVDIVSGRGAVVFDNTEFRVVNSRTQQEAYVFAPATLSNIYYGFLAVNSRFNAFGDGVAQLGRSLDVDANTNGQVVIRDSAINEGFNTAKPWADAVISNRPFAGNTGSVDDNDEIQRNLNDTNYNRMWEYNNRGVGSKVVAEAKK >NZ_CP048344.1|WP_000533646.1|3182022_3183093_+|tyrosine-type-recombinase/integrase MGRRRSHERRDLPPNLYIRNNGYYCYRDPRTGKEFGLGRDRRIAITEAIQANIELFSGHKHKPLTARINSDNSVTLHSWLDRYEKILASRGIKQKTLINYMSKIKAIRRGLPDAPLEDITTKEIAAMLNGYIDEGKAASAKLIRSTLSDAFREAIAEGHITTNPVAATRAAKSEVRRSRLTADEYLKIYQAAESSPCWLRLAMELAVVTGQRVGDLCEMKWSDIVDGYLYVEQSKTGVKIAIPTVLHVDALGISMKETLDKCKEILGGETIIASTRREPLSSGTVSRYFMRARKASGLSFEGDPPTFHELRSLSARLYEKQISDKFAQHLLGHKSDTMASQYRDDRGREWDKIEIK >NZ_CP048344.1|WP_001303849.1|3181826_3182045_+|excisionase MYLTLQEWNARQRRPRSLETVRRWVRECRIFPPPVKDGREYLFHESAVKVDLNRPVTGSLLKRIRNGKKAKS >NZ_CP048344.1|WP_000545745.1|3181619_3181787_+|hypothetical-protein MHFRVTGEWNGEPFNRVIEAENISDCYDHWMLWAQIAHADVTNIRIEELKEHQAA >NZ_CP048344.1|WP_000120065.1|3180774_3181377_-|hypothetical-protein MSYFLRKKWMVNLSGSGKILWALNMKKDSYPYLICMTVSGLIFIFLFFWWRADIYRVTFLNQSISHYYILFSMGIAFLLSLFWVKKGIVKQSGWKSLSAYLKVYAGMCIFAGFFLIIPLTTLTYFLPGETSSYVAPYRYTSGSSKSCSGAEVDDPDLHENIRICYPYGNYEYDNIIYVEKKINILGAVVTYAQTARDDTE >NZ_CP048344.1|WP_000763365.1|3180342_3180564_+|TraR/DksA-family-transcriptional-regulator MADIIDSASEIEELQRNTAIKMRRLNHQAISATHCCECGDPIDERRRLAVQGCRTCASCQQDLELISKQRGSK >NZ_CP048344.1|WP_001395510.1|3179962_3180244_+|cell-division-protein-ZapA MHFSGSGLHILCAYACRHGACSMTPQQENALRSIARQANSEIKKARQQFPDKNVDDICRSVLKKHRETVTLMGFTPTHLSLAIGMLNGVFKER >NZ_CP048344.1|WP_023148020.1|3179760_3179952_+|DUF1382-family-protein MHKASPVELRTSIEMAHSLAQIGVRFVPIPVETDEEFHTLAAFLSQKLEMMVAKAEADERDQV >NZ_CP048344.1|WP_072126246.1|3179605_3179788_+|DUF1317-domain-containing-protein MTHPHDNIRVGAITFVYSVTKRGWVFPGLSVIRNPLKAQRLAEEINNKRGAVCTKHLPLS >NZ_CP048344.1|WP_001372450.1|3178928_3179609_+|YqaJ-viral-recombinase-family-protein MTPDIILQRTGIDVRAVEQGDDAWHKLRLGVITASEVHNVIAKPRSGKKWPDMKMSYFHTLLAEVCTGVAPEVNAKALAWGKQYENDARTLFEFTSSVNITESPIIYRDENMRTACSPDGLCSDGNGLELKCPFTSRDFMKFRLGGFEAIKSAYMAQVQYSMWVTRKDAWYFANYDPRMKREGLHYVVVERDEKYMASFDEMVPEFIEKMDEALAEIGFVFGEQWR >NZ_CP048344.1|WP_001372426.1|3184744_3187006_-|hydratase MIKLSEKGVFLASNNEIIAEEHFTGEIKKEEAQKGTIAWSILSSHNTSGNMDKLKIKFDSLASHDITFVGIVQTAKASGMERFPLPYVLTNCHNSLCAVGGTINGDDHVFGLSAAQRYGGIFVPPHIAVIHQYMREMMAGGGKMILGSDSHTRYGALGTMAVGEGGGELVKQLLNDTWDIDYPGVVAVHLTGKPAPYVGPQDVALAIIGAVFKNGYVKNKVMEFVGPGVSALSTDFRNSVDVMTTETTCLSSVWQTDEEVHNWLALHGRGQDYCQLNPQPMAYYDGCISVDLSAIKPMIALPFHPSNVYKIDTLNQNLTDILREIEIESERVAHGKAKLSLLDKVENGRLKVQQGIIAGCSGGNYENVIAAANALRGQSCGNDTFSLAVYPSSQPVFMDLAQKGVVADLIGAGAIIRTAFCGPCFGAGDTPINNGLSIRHTTRNFPNREGSKPANGQMSAVALMDARSIAATAANGGYLTSASELDCWDNVPEYAFDVTPYKNRVYQGFVKGATQQPLIYGPNIKDWPELGALTDNIVLKVCSKILDEVTTTDELIPSGETSSYRSNPIGLAEFTLSRRDPGYVGRSKATAELENQRLAGNVSELTEVFARIKQIAGQEHIDPLQTEIGSMVYAVKPGDGSAREQAASCQRVIGGLANIAEEYATKRYRSNVINWGMLPLQMAEVPTFEVGDYIYIPGIKAALDNPGTTFKGYVIHEDAPVTEITLYMGSLTAEEREIIKAGSLINFNKNRQM >NZ_CP048344.1|WP_001036475.1|3187188_3188622_-|anion-permease MNKKSLWKLILILAIPCIIGFMPAPAGLSELAWVLFGIYLAAIVGLVIKPFPEPVVLLIAVAASMVVVGNLSDGAFKTTAVLSGYSSGTTWLVFSAFTLSAAFVTTGLGKRIAYLLIGKIGNTTLGLGYVTVFLDLVLAPATPSNTARAGGIVLPIINSVAVALGSEPEKSPRRVGHYLMMSIYMVTKTTSYMFFTAMAGNILALKMINDILHLQISWGGWALAAGLPGIIMLLVTPLVIYTMYPPEIKKVDNKTIAKAGLAELGPMKIREKMLLGVFVLALLGWIFSKSLGVDESTVAIVVMATMLLLGIVTWEDVVKNKGGWNTLIWYGGIIGLSSLLSKVKFFEWLAEVFKNNLAFDGHGNVAFFVIIFLSIIVRYFFASGSAYIVAMLPVFAMLANVSGAPLMLTALALLFSNSYGGMVTHYGGAAGPVIFGVGYNDIKSWWLVGAVLTILTFLVHITLGVWWWNMLIGWNML >NZ_CP048344.1|WP_001372427.1|3188697_3189750_-|4-oxalomesaconate-tautomerase MKKIPCVMMRGGTSRGAFLLAEHLPEDQTQRDKILMAIMGSGNDLEIDGIGGGNPLTSKVAIISRSSDLRADVDYLFAQVIVHEQRVDTTPNCGNMLSGVGAFAIENGLIAATSPVTRVRIRNVNTGTFIEADVQTPNGVVEYEGSARIDGVPGTAAPVALTFLNAAGTKTGKVFPTDNQIDYFDDVPVTCIDMAMPVVIIPAEYLGKTGYELPAELDADKALLARIESIRLQAGKAMGLGDVSNMVIPKPVLISPAQKGGAINVRYFMPHSCHRALAITGAIAISSSCALEGTVTRQIVPSVGYGNINIEHPSGALDVHLSNEGQDATTLRASVIRTTRKIFSGEVYLP >NZ_CP048344.1|WP_000679972.1|3189933_3190887_+|LysR-family-transcriptional-regulator MKHELSSMKAFVILAESSSFNNAAKLLNITQPALTRRIKKMEEDLHIQLFERTTRKVTLTKAGKRLLPEARELIKKFDETLFNIRDMNAYHRGMVTLACIPTAVFYFLPLAIGKFNELYPNIKVRILEQGTNNCMESVLCNESDFGINMNNVTNSSIDFTPLVNEPFVLACRRDHPLAKKQLVEWQELVGYKMIGVRSSSGNRLLIEQQLADKPWKLDWFYEVRHLSTSLGLVEAGLGISALPGLAMPHAPYSSIIGIPLVEPVIRRTLGIIRRKDAVLSPAAERFFALLINLWTDDKDNLWTNIVERQRHALQEIG >NZ_CP048344.1|WP_000815449.1|3190927_3191923_-|6-phosphogluconolactonase MKQTVYIASPESQQIHVWNLNHEGALTLTQVVDVPGQVQPMVVSPDKRYLYVGVRPEFRVLAYSIAPDDGALTFAAESALPGSPTHISTDHQGQFVFVGSYNAGNVSVTRLEDGLPVGVVDVVEGLDGCHSANISPDNRTLWVPALKQDRICLFTVSDDGHLVAQDPAEVTTVEGAGPRHMVFHPNEQYAYCVNELNSSVDVWELKDPHGNIECVQTLDMMPENFSDTRWAADIHITPDGRHLYACDRTASLITVFSVSEDGSVLSKEGFQPTETQPRGFNVDHSGKYLIAAGQKSHHISVYEIVGEQGLLHEKGRYAVGQGPMWVVVNAH >NZ_CP048344.1|WP_000213425.1|3192077_3192896_+|pyridoxal-phosphatase MTTRVIALDLDGTLLTPKKTLLPSSIEALARAREAGYQLIIVTGRHHVAIHPFYQALALDTPAICCNGTYLYDYHAKTVLEADPMPVNKALQLIEMLNEHHIHGLMYVDDAMVYEHPTGHVIRTSNWAQTLPPEQRPTFTQVASLAETAQQVNAVWKFALTHDDLPQLQHFGKHVEHELGLECEWSWHDQVDIARGGNSKGKRLTKWVEAQGWSMENVVAFGDNFNDISMLEAAGTGVAMGNADDAVKARANIVIGDNTTDSIAQFIYSHLI >NZ_CP048344.1|WP_000891692.1|3192896_3193955_-|molybdenum-ABC-transporter-ATP-binding-protein-ModC MLELNFSQTLGNHCLTINETLPANGITAIFGVSGAGKTSLINAISGLTRPQKGRIVLNGRVLNDAEKGICLTPEKRRVGYVFQDARLFPHYKVRGNLRYGMSKSMVDQFDKLVALLGIEPLLDRLPGSLSGGEKQRVAIGRALLTAPELLLLDEPLASLDIPRKRELLPYLQRLTREINIPMLYVSHSLDEILHLADRVMVLENGQVKAFGALEEVWGSSVMNPWLPKEQQSSILKVTVLEHHPHYAMTALALGDQHLWVNKLDEPLQAALRIRIQASDVSLVLQPPQQTSIRNVLRAKVVNSYDDNGQVEVELEVGGKTLWARISPWARDELAIKPGLWLYAQIKSVSITA >NZ_CP048344.1|WP_000604034.1|3193957_3194647_-|molybdate-ABC-transporter-permease-subunit MILTDPEWQAVLLSLKVSSLAVLFSLPFGIFFAWLLVRCTFPGKALLDSVLHLPLVLPPVVVGYLLLVSMGRRGFIGERLYDWFGITFAFSWRGAVLAAAVMSFPLMVRAIRLALEGVDVKLEQAARTLGAGRWRVFFTITLPLTLPGIIVGTVLAFARSLGEFGATITFVSNIPGETRTIPSAMYTLIQTPGGESGAARLCIISIALAMISLLISEWLARISRERAGR >NZ_CP048344.1|WP_000101993.1|3194646_3195420_-|molybdate-ABC-transporter-substrate-binding-protein MARKWLNLFAGAALSFAVAGNALADEGKITVFAAASLTNAMQDIATQYKKEKGVDVVSSFASSSTLARQIEAGAPADLFISADQKWMDYAVDKKAIDTATRQTLLGNSLVVVAPKASEQKDFTIDSKTNWTSLLNGGRLAVGDPEHVPAGIYAKEALQKLGAWDTLSPKLAPAEDVRGALALVERNEAPLGIVYGSDAVASKGVKVVAIFPEDSHKKVEYPVAVVEGHNNATVKAFYDYLKGPQAAEIFKRYGFTTK >NZ_CP048344.1|WP_000891515.1|3195586_3195736_-|multidrug-efflux-pump-accessory-protein-AcrZ MLELLKSLVFAVIMVPVVMAIILGLIYGLGEVFNIFSGVGKKDQPGQNH |
You can click texts colored in the table to view more detailed information
CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
NZ_CP048344_9 | 3690114-3690267 | Orphan |
NA
Consensus repeat of NZ_CP048344_9
|
1 spacers
spacers of NZ_CP048344_9
>9.1|3690167|48|NZ_CP048344|CRISPRCasFinder TCAGCGTCGCATCAGGCATCTGCGCATAACCGCCGGATGCGGCGTAAA |
CRISPR arrays and Neighbor proteins around NZ_CP048344_9
The CRISPR arrays of NZ_CP048344_9 >merge|NZ_CP048344|9|3690114-3690267|CRISPRCasFinder CGCCTTATCCGGCCTACCGATCCAGCACAGGTTTGTAGGCATGATAAGACGCGTCAGCGTCGCATCAGGCATCTGCGCATAACCGCCGGATGCGGCGTAAACGCCTTATCCGGCCTACCGATCCGGCACAGGTTTGTAGGCATGATAAGACGCG >NZ_CP048344|9|9|3690114-3690267|CRISPRCasFinder CGCCTTATCCGGCCTACCGATCCAGCACAGGTTTGTAGGCATGATAAGACGCG TCAGCGTCGCATCAGGCATCTGCGCATAACCGCCGGATGCGGCGTAAA CGCCTTATCCGGCCTACCGATCCGGCACAGGTTTGTAGGCATGATAAGACGCG
>NZ_CP048344.1|WP_000952760.1|3688239_3689979_+|flagellar-type-III-secretion-system-protein-FlhA MLSRSDLLTLLTINFIVVTKGAERISEVSARFTLDAMPGKQMAIDADLNAGLINQAQAQTRRKDVASEADFYGAMDGASKFVRGDAIAGMMILAINLIGGVCIGIFKYNLSADAAFQQYVLMTIGDGLVAQIPSLLLSTAAAIIVTRISDNGDITHDVRHQLLASPSVLYTATGIMFVLAVVPGMPHLPFLLFSALLGFTGWRMSKRPQAAEAEEKSLETLTRTITETSEQQVSWETIPLIEPISLSLGYKLVALVDKAQGNPLTQRIRGVRQVISDGNGVLLPEIRIRENFRLKPSQYAIFINGIKADEADIPADKLMALPSSETYGEIDGVLGNDPAYGMPVTWIQPAQKAKALNMGYQVIDSASVIATHVNKIVRSYIPDLFSYDDITQLHNRLSSMAPRLAEDLSAALNYSQLLKVYRALLTEGVSLRDIVTIATVLVASSAVTKDHILLAADVRLALRRSITHPFVRKQELTVYTLNNELENLLTNVVNQAQQGGKVMLDSVPVDPNMLNQFQSTMPQVKEQMKAAGKDPVLLVPPQLRPLLARYARLFAPGLHVLSYNEVPDELELKIMGALM >NZ_CP048344.1|WP_032283079.1|3687509_3688295_-|putative-lateral-flagellar-export/assembly-protein-LafU MTTIKLIVNSVSKSERESIIAALHGQSIFSGGGLSPLNKISPSHPPKPATVAVPEETEKKARDVNEKTALLKKKSATELGELATSINTIARDAHMEANLEMEIVPQGLRVLIKDDQNRNMFECGSAQIMPFFKTLLVELAPVFDSLDNKIIITGHTDAMAYKNNIYNNWNLSGDRALSARRVLEEAGMPEDKVMQVSAMADQMLLDAKNPQSAGNRRIEIMVLTKSASDTLYQYFGQHGDKVVQPLVQKLDKQQVLSQRMR >NZ_CP048344.1|WP_001226155.1|3686383_3687439_-|DNA-polymerase-IV MRKIIHVDMDCFFAAVEMRDNPALRDIPIAIGGSRERRGVISTANYPARKFGVRSAMPTGMALKLCPHLTLLPGRFDAYKEASNHIREIFSRYTSRIEPLSLDEAYLDVTDSVHCHGSATLIAQEIRQTIFNELHLTASAGVAPVKFLAKIASDMNKPNGQFVITPAEVPAFLQTLPLAKIPGVGKVSAAKLEAMGLRTCGDVQKCDLVILLKRFGKFGRILWERSQGIDERDVNSERLRKSVGVERTMAEDIHHWSECEAIIERLYPELERRLAKVKPDLLIARQGVKLKFDDFQQTTQEHVWPRLNKADLIATARKTWDERRGGRGVRLVGLHVTLLDPQMERQLVLGL >NZ_CP048344.1|WP_001059874.1|3685934_3686387_-|GNAT-family-N-acetyltransferase MNNIQIRNYQPGDFQQLCAIFIRAVMMTASQHYSPQQIAAWAQIDESRWKEKLAKSQVRVAVINAQPVGFISRIERHIDMLFVDPEYTRRGVASALLKPLIKSESELTVDASITAKPFFERYGFQIVKQQHVECRGAWFTNFYMRYKPQH >NZ_CP048344.1|WP_001295202.1|3685361_3685628_-|hypothetical-protein MEWYMGKYIRPLSDAVFTIASDDLWIESLAIQQLHTTANLPNMQRVVGMPDLHPGRGYPIGAAFFSVGRFYPARRRGNGAGNRNGPLL >NZ_CP048344.1|WP_001293003.1|3683547_3685005_+|cytosol-nonspecific-dipeptidase MSELSQLSPQPLWDIFAKICSIPHPSYHEEQLAEYIVGWAKEKGFHVERDQVGNILIRKPATAGMENRKPVVLQAHLDMVPQKNNDTVHDFTKDPIQPYIDGEWVKARGTTLGADNGIGMASALAVLADENVVHGPLEVLLTMTEEAGMDGAFGLQSNWLQADILINTDSEEEGEIYMGCAGGIDFTSNLHLDREAVPAGFETFKLTLKGLKGGHSGGEIHVGLGNANKLLVRFLAGHAEELDLRLIDFNGGTLRNAIPREAFATIAVAADKVDALKSLVNTYQDILKNELAEKEKNLALLLDSVANDKAALIAKSRDTFIRLLNATPNGVIRNSDVAKGVVETSLNVGVVTMTDNNVEIHCLIRSLIDSGKDYVVSMLDSLGKLAGAKTEAKGAYPGWQPDANSPVMHLVRETYQRLFNKTPNIQIIHAGLECGLFKKPYPEMDMVSIGPTITGPHSPDEQVHIKSVGHYWTLLTELLKEIPAK >NZ_CP048344.1|WP_001291992.1|3682828_3683287_-|xanthine-phosphoribosyltransferase MSEKYIVTWDMLQIHARKLASRLMPSEQWKGIIAVSRGGLVPGALLARELGIRHVDTVCISSYDHDNQRELKVLKRAEGDGEGFIVIDDLVDTGGTAVAIREMYPKAHFVTIFAKPAGRPLVDNYVVDIPQDTWIEQPWDMGVVFVPPISGR >NZ_CP048344.1|WP_000189539.1|3681492_3682737_-|esterase-FrsA MTQANLSETLFKPRFKHPETSTLVRRFNHGAQPPVQSALDGKTIPHWYRMINRLMWIWRGIDPREILDVQARIVMSDAERTDDDLYDTVIGYRGGNWIYEWATQAMVWQQKACAEEDPQLSGRHWLHAATLYNIAAYPHLKGDDLAEQAQALSNRAYEEAAQRLPGTMRQMEFTVPGGAPITGFLHMPKGDGPFPTVLMCGGLDAMQTDYYSLYERYFAPRGIAMLTIDMPSVGFSSKWKLTQDSSLLHQHVLKALPNVPWVDHTRVAAFGFRFGANVAVRLAYLESPRLKAVACLGPVVHTLLSDFKCQQQVPEMYLDVLASRLGMHDASDDALRVELNRYSLKVQGLLGRRCPTPMLSGYWKNDPFSPEEDSRLITSSSADGKLLEIPFNPVYRNFDKGLQEITGWIEKRLC >NZ_CP048344.1|WP_000174677.1|3681033_3681435_-|sigma-factor-binding-protein-Crl MTLPSGHPKSRLIKKFTALGPYIREGKCEDNRFFFDCLAVCVNVKPAPEVREFWGWWMELEAQESRFTYSYQFGLFDKAGDWKSVPVKDTEVVERLEHTLREFHEKLRELLTTLNLKLEPADDFRDEPVKLTA >NZ_CP048344.1|WP_000749881.1|3679939_3680995_+|phosphoporin-PhoE MKKSTLALVVMGIVASASVQAAEIYNKDGNKLDVYGKVKAMHYMSDNDSKDGDQSYIRFGFKGETQINDQLTGYGRWEAEFAGNKAESDTAQQKTRLAFAGLKYKDLGSFDYGRNLGALYDVEAWTDMFPEFGGDSSAQTDNFMTKRASGLATYRNTDFFGVIDGLNLTLQYQGKNENRDVKKQNGDGFGTSLTYDFGGSDFAISGAYTNSDRTNEQNLQSRGTGKRAEAWATGLKYDANNIYLATFYSETRKMTPITGGFANKTQNFEAVAQYQFDFGLRPSLGYVLSKGKDIEGIGDEDLVNYIDVGATYYFNKNMSAFVDYKINQLDSDNKLNINNDDIVAVGMTYQF >NZ_CP048344.1|WP_000006256.1|3690296_3690794_-|REP-associated-tyrosine-transposase-RayT MSEYRRYYIKGGTWFFTVNLRNRRSQLLTTQYQMLRHAIIKVKRDRPFEINAWVVLPEHMHCIWTLPEGDDDFSSRWREIKKQFTHACGLKNIWQPRFWEHAIRNTKDYRHHVDYIYINPVKHGWVKQVSDWPFSTFHRDVARGLYPIDWAGDVTDINAGERIIL >NZ_CP048344.1|WP_000009291.1|3690969_3691728_-|C40-family-peptidase MSFMSSFLLGRFLHPGVFSLCVLLPLFASATTSHISFSYAARQRMQNRARLLKQYQTHLKKQASYIVEGNAESRRALRQHNREQIKQHPEWFPAPLKASDRRWQALAENNHFLSSDHLHNITEVAIHRLEQQLGKPYVWGGTRPDQGFDCSGLVFYAYNKILEAKLPRTANEMYHYHRATIVANNDLRRGDLLFFHIHSREIADHMGVYLGDGQFIESPRTGENIRVSRLAEPFWQDHFLGARRILTEETIL >NZ_CP048344.1|WP_001225679.1|3692019_3692760_+|murein-L,D-transpeptidase MRKIALILAMLLIPCVSFAGLLGSSSSTTPVSKEYKQQLMGSPVYIQIFKEERTLDLYVKMGEQYQLLDSYKICKYSGGLGPKQRQGDFKSPEGFYSVQRNQLKPDSRYYKAINIGFPNAYDRAHGYEGKYLMIHGDCVSIGCYAMTNQGIDEIFQFVTGALVFGQPSVQVSIYPFRMTDANMKRHKYSNFKDFWEQLKPGYDYFEQTRKPPTVSVVNGRYVVSKPLSHEVVQPQLASNYTLPEAK >NZ_CP048344.1|WP_000333380.1|3692730_3693498_-|class-II-glutamine-amidotransferase MCELLGMSANVPTDICFSFTGLVQRGGGTGPHKDGWGITFYEGKGCRTFKDPQPSFNSPIAKLVQDYPIKSCSVVAHIRQANRGEVALENTHPFTRELWGRNWTYAHNGQLTGYKSLETGNFRPVGETDSEKAFCWLLHKLTQRYPRTPGNMAAVFKYIASLADELRQKGVFNMLLSDGRYVMAYCSTNLHWITRRAPFGVATLLDQDVEIDFSSQTTPNDVVTVIATQPLTGNETWQKIMPGEWRLFCLGERVV >NZ_CP048344.1|WP_000284050.1|3693703_3694282_-|D-sedoheptulose-7-phosphate-isomerase MYQDLIRNELNEAAETLANFLKDDANIHAIQRAAVLLADSFKAGGKVLSCGNGGSHCDAMHFAEELTGRYRENRPGYPAIAISDVSHISCVGNDFGFNDIFSRYVEAVGREGDVLLGISTSGNSANVIKAIAAAREKGMKVITLTGKDGGKMAGTADIEIRVPHFGYADRIQEIHIKVIHILIQLIEKEMVK >NZ_CP048344.1|WP_000973093.1|3694521_3696966_+|acyl-CoA-dehydrogenase-FadE MMILSILATVVLLGALFYHRVSLFISSLILLAWTAALGVAGLWSAWVLVPLAIILVPFNFAPMRKSMISAPVFRGFRKVMPPMSRTEKEAIDAGTTWWEGDLFQGKPDWKKLHNYPQPRLTAEEQAFLDGPVEEACRMANDFQITHELADLPPELWAYLKEHRFFAMIIKKEYGGLEFSAYAQSRVLQKLSGVSGILAITVGVPNSLGPGELLQHYGTDEQKNHYLPRLARGQEIPCFALTSPEAGSDAGAIPDTGIVCMGEWQGQQVLGMRLTWNKRYITLAPIATVLGLAFKLSDPEKLLGGAEDLGITCALIPTTTPGVEIGRRHFPLNVPFQNGPTRGKDVFVPIDYIIGGPKMAGQGWRMLVECLSVGRGITLPSNSTGGVKSVALATGAYAHIRRQFKISIGKMEGIEEPLARIAGNAYVMDAAASLITYGIMLGEKPAVLSAIVKYHCTHRGQQSIIDAMDITGGKGIMLGQSNFLARAYQGAPIAITVEGANILTRSMMIFGQGAIRCHPYVLEEMEAAKNNDVNAFDKLLFKHIGHVGSNKVRSFWLGLTRGLTSSTPTGDATKRYYQHLNRLSANLALLSDVSMAVLGGSLKRRERISARLGDILSQLYLASAVLKRYDDEGRNEADLPLVHWGVQDALYQAEQAMDDLLQNFPNRVVAGLLNVVIFPTGRHYLAPSDKLDHKVAKILQVPNATRSRIGRGQYLTPSEHNPVGLLEEALVDVIAADPIHQRICKELGKNLPFTRLDELAHNALAKGLIDKDEAAILVKAEESRLCSINVDDFDPEELATKPVKLPEKVRKVEAA >NZ_CP048344.1|WP_000532698.1|3697008_3697482_-|C-lysozyme-inhibitor MGRISSGGMMFKAITTVAALVIATSAMAQDDLTISSLAKGETTKAAFNQMVQGHKLPAWVMKGGTYTPAQTVTLGDETYQVMSACKPHDCGSQRIAVMWSEKSNQMTGLFSTIDEKTSQEKLTWLNVNDALSIDGKTVLFAALTGSLENHPDGFNFK >NZ_CP048344.1|WP_001118055.1|3697635_3698406_+|2-oxoglutaramate-amidase MPGLKITLLQQPLVWMDGPANLRHFDRQLEGITGRDVIVLPEMFTSGFAMEAAASSLAQNDVVNWMTAKAQQCNALIAGSVALQTESGSVNRFLLVEPGGTVHFYDKRHLFRMADEHLHYKAGNARVIVEWRGWRILPLVCYDLRFPVWSRNLNDYDLAIYVANWPAPRSLHWQALLTARAIENQAYVAGCNRVGSDGNGCHYRGDSRVINPQGEIIATADAHQATRIDAELSMVALREYREKFPAWQDADEFRLR >NZ_CP048344.1|WP_000978828.1|3699844_3700294_-|hypothetical-protein MMKYLMVLLSLFSGSVLGMGRVNELCGIDSVKTIEIINLPSYVTTLVPLSKEGLNEIYRYKVVVNEISDLYAGKIIDLLQMKYFRKEKYNNIRWGVSIISKGNNKCEIYFDAFGECGSVNGINVCFEKNEMIGWIKKEIPLLSQKIGGL >NZ_CP048344.1|WP_001087742.1|3705206_3706559_+|membrane-protein MNSNVLTQTIVTGSDPRGLPEFSAIREEINKASHPSQPELNWKLVESLALAIFKANGVDLHTATYYTLARTRTQGLAGFCEGAELLAAMVSHDWDKFWPQGGPARTEMLDWFNSRTGNILRQQISFAESDLPLIYRTERALQLICDKLQQVELKRVPRVENLLYFMQNTRKRLEPQLKSNTENAAQTTVRTLIYAPETQASSTPEAVVPPLPGLPEMKVEVRSLTENPPQASVIKQGSTVRGFIAGIACSVAVASALWWWQVYPVQQQLLQVNDTAQGAATVWMASPELENYERRLQQLLDTSPVQPLETGMQMMRVADSRWPESLQQQQASTQWNEALKTRAQSSPQLRGWLQTRQDLHAFADLVMQREKEGLTLSYIKNVIWQAERGLGQETPVESLLTQYHDARAQKQNTDTLEKQINERLEGVLSRWLLLKNNVMPEAATGTTAEK |
You can click texts colored in the table to view more detailed information
CRISPR_ID | Spacer_Info | Spacer_region | Spacer_length | Hit_ID | Protospacer_location | Mismatch | Identity |
---|
CRISPR_ID | Spacer_Info | Spacer_region | Spacer_length | Hit_phage_ID | Hit_phage_def | Protospacer_location | Mismatch | Identity |
---|---|---|---|---|---|---|---|---|
NZ_CP048344_7 | 7.1|2896038|40|NZ_CP048344|CRISPRCasFinder | 2896038-2896077 | 40 | NZ_CP041417 | Escherichia coli strain STEC711 plasmid pSTEC711_1, complete sequence | 47951-47990 | 0 | 1.0 |
NZ_CP048344_6 | 6.1|2174917|38|NZ_CP048344|CRISPRCasFinder | 2174917-2174954 | 38 | NZ_CP043437 | Enterobacter sp. LU1 plasmid unnamed | 113727-113764 | 2 | 0.947 |
NZ_CP048344_9 | 9.1|3690167|48|NZ_CP048344|CRISPRCasFinder | 3690167-3690214 | 48 | NZ_CP053606 | Escherichia coli strain NEB_Turbo plasmid F', complete sequence | 4089-4136 | 3 | 0.938 |
NZ_CP048344_9 | 9.1|3690167|48|NZ_CP048344|CRISPRCasFinder | 3690167-3690214 | 48 | NZ_CP053608 | Escherichia coli strain NEB5-alpha_F'Iq plasmid F'Iq, complete sequence | 4088-4135 | 3 | 0.938 |
NZ_CP048344_9 | 9.1|3690167|48|NZ_CP048344|CRISPRCasFinder | 3690167-3690214 | 48 | NZ_CP014271 | Escherichia coli K-12 strain K-12 DHB4 plasmid F128-(DHB4), complete sequence | 4088-4135 | 3 | 0.938 |
NZ_CP048344_9 | 9.1|3690167|48|NZ_CP048344|CRISPRCasFinder | 3690167-3690214 | 48 | NZ_CP014273 | Escherichia coli K-12 strain K-12 C3026 plasmid F128-(C3026), complete sequence | 4088-4135 | 3 | 0.938 |
NZ_CP048344_1 | 1.1|611278|42|NZ_CP048344|CRISPRCasFinder | 611278-611319 | 42 | NZ_CP010208 | Escherichia coli strain M11 plasmid B, complete sequence | 30214-30255 | 7 | 0.833 |
NZ_CP048344_3 | 3.7|1028193|32|NZ_CP048344|PILER-CR,CRISPRCasFinder,CRT | 1028193-1028224 | 32 | NZ_MG299151 | Shigella sonnei strain SH287-2 plasmid pSH287-2, complete sequence | 51276-51307 | 7 | 0.781 |
NZ_CP048344_3 | 3.7|1028193|32|NZ_CP048344|PILER-CR,CRISPRCasFinder,CRT | 1028193-1028224 | 32 | NZ_KY471628 | Shigella sonnei strain SH15sh99 plasmid pSH15sh99, complete sequence | 45716-45747 | 7 | 0.781 |
NZ_CP048344_3 | 3.7|1028193|32|NZ_CP048344|PILER-CR,CRISPRCasFinder,CRT | 1028193-1028224 | 32 | NZ_MG299131 | Shigella sonnei strain SH271-2 plasmid pSH271-2, complete sequence | 51276-51307 | 7 | 0.781 |
NZ_CP048344_3 | 3.7|1028193|32|NZ_CP048344|PILER-CR,CRISPRCasFinder,CRT | 1028193-1028224 | 32 | NZ_KY471629 | Shigella sonnei strain SH15sh105 plasmid pSH15sh104, complete sequence | 45716-45747 | 7 | 0.781 |
NZ_CP048344_3 | 3.7|1028193|32|NZ_CP048344|PILER-CR,CRISPRCasFinder,CRT | 1028193-1028224 | 32 | NZ_MG299133 | Shigella sonnei strain SH272-2 plasmid pSH272-2, complete sequence | 51276-51307 | 7 | 0.781 |
NZ_CP048344_3 | 3.7|1028193|32|NZ_CP048344|PILER-CR,CRISPRCasFinder,CRT | 1028193-1028224 | 32 | NZ_MG299128 | Shigella sonnei strain SH262-2 plasmid pSH262-2, complete sequence | 51276-51307 | 7 | 0.781 |
NZ_CP048344_3 | 3.7|1028193|32|NZ_CP048344|PILER-CR,CRISPRCasFinder,CRT | 1028193-1028224 | 32 | NZ_MG299147 | Shigella sonnei strain SH284-2 plasmid pSH284-2, complete sequence | 51276-51307 | 7 | 0.781 |
NZ_CP048344_3 | 3.7|1028193|32|NZ_CP048344|PILER-CR,CRISPRCasFinder,CRT | 1028193-1028224 | 32 | NC_018995 | Escherichia coli plasmid pHUSEC41-1, complete sequence | 29015-29046 | 7 | 0.781 |
NZ_CP048344_3 | 3.7|1028193|32|NZ_CP048344|PILER-CR,CRISPRCasFinder,CRT | 1028193-1028224 | 32 | NZ_CP053235 | Escherichia coli strain SCU-106 plasmid pSCU-106-1, complete sequence | 78292-78323 | 7 | 0.781 |
NZ_CP048344_3 | 3.7|1028193|32|NZ_CP048344|PILER-CR,CRISPRCasFinder,CRT | 1028193-1028224 | 32 | NZ_CP005999 | Escherichia coli B7A plasmid pEB1, complete sequence | 39563-39594 | 7 | 0.781 |
NZ_CP048344_3 | 3.7|1028193|32|NZ_CP048344|PILER-CR,CRISPRCasFinder,CRT | 1028193-1028224 | 32 | KU932021 | Escherichia coli plasmid pEC3I, complete sequence | 51902-51933 | 7 | 0.781 |
NZ_CP048344_3 | 3.7|1028193|32|NZ_CP048344|PILER-CR,CRISPRCasFinder,CRT | 1028193-1028224 | 32 | NZ_CP024154 | Escherichia coli strain 14EC033 plasmid p14EC033g, complete sequence | 18560-18591 | 7 | 0.781 |
NZ_CP048344_3 | 3.7|1028193|32|NZ_CP048344|PILER-CR,CRISPRCasFinder,CRT | 1028193-1028224 | 32 | NC_011754 | Escherichia coli ED1a plasmid pECOED, complete sequence | 49240-49271 | 7 | 0.781 |
NZ_CP048344_3 | 3.7|1028193|32|NZ_CP048344|PILER-CR,CRISPRCasFinder,CRT | 1028193-1028224 | 32 | NZ_CP015141 | Escherichia coli strain Ecol_732 plasmid pEC732_3, complete sequence | 81434-81465 | 7 | 0.781 |
NZ_CP048344_3 | 3.7|1028193|32|NZ_CP048344|PILER-CR,CRISPRCasFinder,CRT | 1028193-1028224 | 32 | NZ_LR213460 | Shigella sonnei strain AUSMDU00008333 isolate AUSMDU00008333 plasmid 3 | 28916-28947 | 7 | 0.781 |
NZ_CP048344_3 | 3.7|1028193|32|NZ_CP048344|PILER-CR,CRISPRCasFinder,CRT | 1028193-1028224 | 32 | NZ_MH287044 | Escherichia coli strain 5.1-R1 plasmid pCERC6, complete sequence | 36182-36213 | 7 | 0.781 |
NZ_CP048344_3 | 3.7|1028193|32|NZ_CP048344|PILER-CR,CRISPRCasFinder,CRT | 1028193-1028224 | 32 | NZ_MH618673 | Escherichia coli strain 838B plasmid p838B-R, complete sequence | 32230-32261 | 7 | 0.781 |
NZ_CP048344_4 | 4.1|1050729|31|NZ_CP048344|CRISPRCasFinder | 1050729-1050759 | 31 | NC_007336 | Cupriavidus pinatubonensis JMP134 megaplasmid, complete sequence | 62682-62712 | 7 | 0.774 |
NZ_CP048344_4 | 4.1|1050729|31|NZ_CP048344|CRISPRCasFinder | 1050729-1050759 | 31 | NZ_CP013104 | Paraburkholderia caribensis strain MWAP64 plasmid 1, complete sequence | 1222106-1222136 | 7 | 0.774 |
NZ_CP048344_4 | 4.1|1050729|31|NZ_CP048344|CRISPRCasFinder | 1050729-1050759 | 31 | NZ_CP012748 | Paraburkholderia caribensis MBA4 plasmid unnamed, complete sequence | 2467672-2467702 | 7 | 0.774 |
NZ_CP048344_4 | 4.4|1050912|31|NZ_CP048344|CRISPRCasFinder | 1050912-1050942 | 31 | NZ_CP034185 | Deinococcus sp. S14-83 strain S14-83T plasmid unnamed1, complete sequence | 17977-18007 | 7 | 0.774 |
NZ_CP048344_4 | 4.7|1051095|31|NZ_CP048344|CRISPRCasFinder | 1051095-1051125 | 31 | NC_007336 | Cupriavidus pinatubonensis JMP134 megaplasmid, complete sequence | 530641-530671 | 7 | 0.774 |
NZ_CP048344_1 | 1.1|611278|42|NZ_CP048344|CRISPRCasFinder | 611278-611319 | 42 | NZ_CP048307 | Escherichia coli strain 9 plasmid p009_C, complete sequence | 24899-24940 | 8 | 0.81 |
NZ_CP048344_3 | 3.6|1028132|32|NZ_CP048344|PILER-CR,CRISPRCasFinder,CRT | 1028132-1028163 | 32 | NZ_CP012748 | Paraburkholderia caribensis MBA4 plasmid unnamed, complete sequence | 1417960-1417991 | 8 | 0.75 |
NZ_CP048344_4 | 4.4|1050912|31|NZ_CP048344|CRISPRCasFinder | 1050912-1050942 | 31 | NZ_CP017753 | Cupriavidus sp. USMAHM13 plasmid unnamed1, complete sequence | 97498-97528 | 8 | 0.742 |
NZ_CP048344_4 | 4.7|1051095|31|NZ_CP048344|CRISPRCasFinder | 1051095-1051125 | 31 | NZ_CP036297 | Planctomycetes bacterium Pla86 plasmid pPla86_1, complete sequence | 14953-14983 | 8 | 0.742 |
NZ_CP048344_4 | 4.7|1051095|31|NZ_CP048344|CRISPRCasFinder | 1051095-1051125 | 31 | NZ_CP036288 | Planctomycetes bacterium Pla133 plasmid pPla133_1, complete sequence | 14983-15013 | 8 | 0.742 |
NZ_CP048344_4 | 4.7|1051095|31|NZ_CP048344|CRISPRCasFinder | 1051095-1051125 | 31 | NZ_CP015882 | Ensifer adhaerens strain Casida A plasmid pCasidaAB, complete sequence | 3454-3484 | 8 | 0.742 |
NZ_CP048344_4 | 4.7|1051095|31|NZ_CP048344|CRISPRCasFinder | 1051095-1051125 | 31 | NZ_CP017750 | Cupriavidus sp. USMAA2-4 plasmid unnamed1, complete sequence | 148992-149022 | 8 | 0.742 |
NZ_CP048344_4 | 4.10|1050729|32|NZ_CP048344|PILER-CR,CRT | 1050729-1050760 | 32 | NC_007336 | Cupriavidus pinatubonensis JMP134 megaplasmid, complete sequence | 62682-62713 | 8 | 0.75 |
NZ_CP048344_4 | 4.10|1050729|32|NZ_CP048344|PILER-CR,CRT | 1050729-1050760 | 32 | NZ_CP013104 | Paraburkholderia caribensis strain MWAP64 plasmid 1, complete sequence | 1222106-1222137 | 8 | 0.75 |
NZ_CP048344_4 | 4.10|1050729|32|NZ_CP048344|PILER-CR,CRT | 1050729-1050760 | 32 | NZ_CP012748 | Paraburkholderia caribensis MBA4 plasmid unnamed, complete sequence | 2467671-2467702 | 8 | 0.75 |
NZ_CP048344_4 | 4.10|1050729|32|NZ_CP048344|PILER-CR,CRT | 1050729-1050760 | 32 | NC_008759 | Polaromonas naphthalenivorans CJ2 plasmid pPNAP03, complete sequence | 12670-12701 | 8 | 0.75 |
NZ_CP048344_4 | 4.13|1050912|32|NZ_CP048344|PILER-CR,CRT | 1050912-1050943 | 32 | NZ_CP034185 | Deinococcus sp. S14-83 strain S14-83T plasmid unnamed1, complete sequence | 17977-18008 | 8 | 0.75 |
NZ_CP048344_4 | 4.13|1050912|32|NZ_CP048344|PILER-CR,CRT | 1050912-1050943 | 32 | NZ_CP017753 | Cupriavidus sp. USMAHM13 plasmid unnamed1, complete sequence | 97497-97528 | 8 | 0.75 |
NZ_CP048344_4 | 4.16|1051095|32|NZ_CP048344|PILER-CR,CRT | 1051095-1051126 | 32 | NZ_CP017750 | Cupriavidus sp. USMAA2-4 plasmid unnamed1, complete sequence | 148991-149022 | 8 | 0.75 |
NZ_CP048344_4 | 4.16|1051095|32|NZ_CP048344|PILER-CR,CRT | 1051095-1051126 | 32 | NC_007336 | Cupriavidus pinatubonensis JMP134 megaplasmid, complete sequence | 530640-530671 | 8 | 0.75 |
NZ_CP048344_4 | 4.17|1051156|32|NZ_CP048344|PILER-CR,CRT | 1051156-1051187 | 32 | NZ_CP006991 | Rhizobium sp. IE4771 plasmid pRetIE4771e, complete sequence | 532343-532374 | 8 | 0.75 |
NZ_CP048344_1 | 1.1|611278|42|NZ_CP048344|CRISPRCasFinder | 611278-611319 | 42 | NZ_CP048307 | Escherichia coli strain 9 plasmid p009_C, complete sequence | 24786-24827 | 9 | 0.786 |
NZ_CP048344_4 | 4.1|1050729|31|NZ_CP048344|CRISPRCasFinder | 1050729-1050759 | 31 | NC_011987 | Agrobacterium radiobacter K84 plasmid pAtK84c, complete sequence | 86182-86212 | 9 | 0.71 |
NZ_CP048344_4 | 4.2|1050790|31|NZ_CP048344|CRISPRCasFinder | 1050790-1050820 | 31 | CP011075 | Brevibacillus laterosporus strain B9 plasmid unnamed1, complete sequence | 244686-244716 | 9 | 0.71 |
NZ_CP048344_4 | 4.2|1050790|31|NZ_CP048344|CRISPRCasFinder | 1050790-1050820 | 31 | GU075905 | Prochlorococcus phage P-HM2, complete genome | 78536-78566 | 9 | 0.71 |
NZ_CP048344_4 | 4.4|1050912|31|NZ_CP048344|CRISPRCasFinder | 1050912-1050942 | 31 | NZ_CP017750 | Cupriavidus sp. USMAA2-4 plasmid unnamed1, complete sequence | 405875-405905 | 9 | 0.71 |
NZ_CP048344_4 | 4.4|1050912|31|NZ_CP048344|CRISPRCasFinder | 1050912-1050942 | 31 | NZ_AP022593 | Mycolicibacterium arabiense strain JCM 18538 plasmid pJCM18538, complete sequence | 2248363-2248393 | 9 | 0.71 |
NZ_CP048344_4 | 4.8|1051156|31|NZ_CP048344|CRISPRCasFinder | 1051156-1051186 | 31 | NZ_CP040723 | Rhodococcus pyridinivorans strain YF3 plasmid unnamed4, complete sequence | 35740-35770 | 9 | 0.71 |
NZ_CP048344_4 | 4.13|1050912|32|NZ_CP048344|PILER-CR,CRT | 1050912-1050943 | 32 | NZ_CP017750 | Cupriavidus sp. USMAA2-4 plasmid unnamed1, complete sequence | 405875-405906 | 9 | 0.719 |
NZ_CP048344_4 | 4.16|1051095|32|NZ_CP048344|PILER-CR,CRT | 1051095-1051126 | 32 | NZ_CP036297 | Planctomycetes bacterium Pla86 plasmid pPla86_1, complete sequence | 14953-14984 | 9 | 0.719 |
NZ_CP048344_4 | 4.16|1051095|32|NZ_CP048344|PILER-CR,CRT | 1051095-1051126 | 32 | NZ_CP036288 | Planctomycetes bacterium Pla133 plasmid pPla133_1, complete sequence | 14983-15014 | 9 | 0.719 |
NZ_CP048344_4 | 4.16|1051095|32|NZ_CP048344|PILER-CR,CRT | 1051095-1051126 | 32 | NZ_CP015882 | Ensifer adhaerens strain Casida A plasmid pCasidaAB, complete sequence | 3454-3485 | 9 | 0.719 |
NZ_CP048344_4 | 4.17|1051156|32|NZ_CP048344|PILER-CR,CRT | 1051156-1051187 | 32 | NZ_CP040723 | Rhodococcus pyridinivorans strain YF3 plasmid unnamed4, complete sequence | 35740-35771 | 9 | 0.719 |
NZ_CP048344_3 | 3.1|1027827|32|NZ_CP048344|PILER-CR,CRISPRCasFinder,CRT | 1027827-1027858 | 32 | NZ_CP030933 | Enterococcus gilvus strain CR1 plasmid pCR1A, complete sequence | 51062-51093 | 10 | 0.688 |
NZ_CP048344_4 | 4.10|1050729|32|NZ_CP048344|PILER-CR,CRT | 1050729-1050760 | 32 | NC_011987 | Agrobacterium radiobacter K84 plasmid pAtK84c, complete sequence | 86181-86212 | 10 | 0.688 |
NZ_CP048344_4 | 4.11|1050790|32|NZ_CP048344|PILER-CR,CRT | 1050790-1050821 | 32 | CP011075 | Brevibacillus laterosporus strain B9 plasmid unnamed1, complete sequence | 244686-244717 | 10 | 0.688 |
NZ_CP048344_4 | 4.11|1050790|32|NZ_CP048344|PILER-CR,CRT | 1050790-1050821 | 32 | GU075905 | Prochlorococcus phage P-HM2, complete genome | 78536-78567 | 10 | 0.688 |
NZ_CP048344_4 | 4.13|1050912|32|NZ_CP048344|PILER-CR,CRT | 1050912-1050943 | 32 | NZ_AP022593 | Mycolicibacterium arabiense strain JCM 18538 plasmid pJCM18538, complete sequence | 2248362-2248393 | 10 | 0.688 |
1. spacer 7.1|2896038|40|NZ_CP048344|CRISPRCasFinder matches to NZ_CP041417 (Escherichia coli strain STEC711 plasmid pSTEC711_1, complete sequence) position: , mismatch: 0, identity: 1.0
gcgctgcgggtcattcttgaaattacccccgctgtgctgt CRISPR spacer gcgctgcgggtcattcttgaaattacccccgctgtgctgt Protospacer ****************************************
2. spacer 6.1|2174917|38|NZ_CP048344|CRISPRCasFinder matches to NZ_CP043437 (Enterobacter sp. LU1 plasmid unnamed) position: , mismatch: 2, identity: 0.947
cggacgcaggatggtgcgttcaattggactcgaaccaa CRISPR spacer cagacgcagaatggtgcgttcaattggactcgaaccaa Protospacer *.*******.****************************
3. spacer 9.1|3690167|48|NZ_CP048344|CRISPRCasFinder matches to NZ_CP053606 (Escherichia coli strain NEB_Turbo plasmid F', complete sequence) position: , mismatch: 3, identity: 0.938
tcagcgtcgcatcaggcatctgcgcataaccgccggatgcggcgtaaa CRISPR spacer ccagcgtcgcatcaggcatctgcgcataactgccggatgcggcataaa Protospacer .*****************************.************.****
4. spacer 9.1|3690167|48|NZ_CP048344|CRISPRCasFinder matches to NZ_CP053608 (Escherichia coli strain NEB5-alpha_F'Iq plasmid F'Iq, complete sequence) position: , mismatch: 3, identity: 0.938
tcagcgtcgcatcaggcatctgcgcataaccgccggatgcggcgtaaa CRISPR spacer ccagcgtcgcatcaggcatctgcgcataactgccggatgcggcataaa Protospacer .*****************************.************.****
5. spacer 9.1|3690167|48|NZ_CP048344|CRISPRCasFinder matches to NZ_CP014271 (Escherichia coli K-12 strain K-12 DHB4 plasmid F128-(DHB4), complete sequence) position: , mismatch: 3, identity: 0.938
tcagcgtcgcatcaggcatctgcgcataaccgccggatgcggcgtaaa CRISPR spacer ccagcgtcgcatcaggcatctgcgcataactgccggatgcggcataaa Protospacer .*****************************.************.****
6. spacer 9.1|3690167|48|NZ_CP048344|CRISPRCasFinder matches to NZ_CP014273 (Escherichia coli K-12 strain K-12 C3026 plasmid F128-(C3026), complete sequence) position: , mismatch: 3, identity: 0.938
tcagcgtcgcatcaggcatctgcgcataaccgccggatgcggcgtaaa CRISPR spacer ccagcgtcgcatcaggcatctgcgcataactgccggatgcggcataaa Protospacer .*****************************.************.****
7. spacer 1.1|611278|42|NZ_CP048344|CRISPRCasFinder matches to NZ_CP010208 (Escherichia coli strain M11 plasmid B, complete sequence) position: , mismatch: 7, identity: 0.833
acagcagtcggatgcggcgtaaacaccttatctgacctacgt CRISPR spacer acaaatgccggatgcggcgtaaacgccttatctggcctacgc Protospacer ***. *.****************.*********.******.
8. spacer 3.7|1028193|32|NZ_CP048344|PILER-CR,CRISPRCasFinder,CRT matches to NZ_MG299151 (Shigella sonnei strain SH287-2 plasmid pSH287-2, complete sequence) position: , mismatch: 7, identity: 0.781
aaatatccagggctgggctggaggcagacggc-- CRISPR spacer cgttatccagggctgagctgcaggcag--ggcca Protospacer . ************.**** ****** ***
9. spacer 3.7|1028193|32|NZ_CP048344|PILER-CR,CRISPRCasFinder,CRT matches to NZ_KY471628 (Shigella sonnei strain SH15sh99 plasmid pSH15sh99, complete sequence) position: , mismatch: 7, identity: 0.781
aaatatccagggctgggctggaggcagacggc-- CRISPR spacer cgttatccagggctgagctgcaggcag--ggcca Protospacer . ************.**** ****** ***
10. spacer 3.7|1028193|32|NZ_CP048344|PILER-CR,CRISPRCasFinder,CRT matches to NZ_MG299131 (Shigella sonnei strain SH271-2 plasmid pSH271-2, complete sequence) position: , mismatch: 7, identity: 0.781
aaatatccagggctgggctggaggcagacggc-- CRISPR spacer cgttatccagggctgagctgcaggcag--ggcca Protospacer . ************.**** ****** ***
11. spacer 3.7|1028193|32|NZ_CP048344|PILER-CR,CRISPRCasFinder,CRT matches to NZ_KY471629 (Shigella sonnei strain SH15sh105 plasmid pSH15sh104, complete sequence) position: , mismatch: 7, identity: 0.781
aaatatccagggctgggctggaggcagacggc-- CRISPR spacer cgttatccagggctgagctgcaggcag--ggcca Protospacer . ************.**** ****** ***
12. spacer 3.7|1028193|32|NZ_CP048344|PILER-CR,CRISPRCasFinder,CRT matches to NZ_MG299133 (Shigella sonnei strain SH272-2 plasmid pSH272-2, complete sequence) position: , mismatch: 7, identity: 0.781
aaatatccagggctgggctggaggcagacggc-- CRISPR spacer cgttatccagggctgagctgcaggcag--ggcca Protospacer . ************.**** ****** ***
13. spacer 3.7|1028193|32|NZ_CP048344|PILER-CR,CRISPRCasFinder,CRT matches to NZ_MG299128 (Shigella sonnei strain SH262-2 plasmid pSH262-2, complete sequence) position: , mismatch: 7, identity: 0.781
aaatatccagggctgggctggaggcagacggc-- CRISPR spacer cgttatccagggctgagctgcaggcag--ggcca Protospacer . ************.**** ****** ***
14. spacer 3.7|1028193|32|NZ_CP048344|PILER-CR,CRISPRCasFinder,CRT matches to NZ_MG299147 (Shigella sonnei strain SH284-2 plasmid pSH284-2, complete sequence) position: , mismatch: 7, identity: 0.781
aaatatccagggctgggctggaggcagacggc-- CRISPR spacer cgttatccagggctgagctgcaggcag--ggcca Protospacer . ************.**** ****** ***
15. spacer 3.7|1028193|32|NZ_CP048344|PILER-CR,CRISPRCasFinder,CRT matches to NC_018995 (Escherichia coli plasmid pHUSEC41-1, complete sequence) position: , mismatch: 7, identity: 0.781
aaatatccagggctgggctggaggcagacggc-- CRISPR spacer cgttatccagggctgagctgcaggcag--ggcca Protospacer . ************.**** ****** ***
16. spacer 3.7|1028193|32|NZ_CP048344|PILER-CR,CRISPRCasFinder,CRT matches to NZ_CP053235 (Escherichia coli strain SCU-106 plasmid pSCU-106-1, complete sequence) position: , mismatch: 7, identity: 0.781
aaatatccagggctgggctggaggcagacggc-- CRISPR spacer cgttatccagggctgagctgcaggcag--ggcca Protospacer . ************.**** ****** ***
17. spacer 3.7|1028193|32|NZ_CP048344|PILER-CR,CRISPRCasFinder,CRT matches to NZ_CP005999 (Escherichia coli B7A plasmid pEB1, complete sequence) position: , mismatch: 7, identity: 0.781
aaatatccagggctgggctggaggcagacggc-- CRISPR spacer cgttatccagggctgagctgcaggcag--ggcca Protospacer . ************.**** ****** ***
18. spacer 3.7|1028193|32|NZ_CP048344|PILER-CR,CRISPRCasFinder,CRT matches to KU932021 (Escherichia coli plasmid pEC3I, complete sequence) position: , mismatch: 7, identity: 0.781
aaatatccagggctgggctggaggcagacggc-- CRISPR spacer cgttatccagggctgagctgcaggcag--ggcca Protospacer . ************.**** ****** ***
19. spacer 3.7|1028193|32|NZ_CP048344|PILER-CR,CRISPRCasFinder,CRT matches to NZ_CP024154 (Escherichia coli strain 14EC033 plasmid p14EC033g, complete sequence) position: , mismatch: 7, identity: 0.781
aaatatccagggctgggctggaggcagacggc-- CRISPR spacer cgttatccagggctgagctgcaggcag--ggcca Protospacer . ************.**** ****** ***
20. spacer 3.7|1028193|32|NZ_CP048344|PILER-CR,CRISPRCasFinder,CRT matches to NC_011754 (Escherichia coli ED1a plasmid pECOED, complete sequence) position: , mismatch: 7, identity: 0.781
aaatatccagggctgggctggaggcagacggc-- CRISPR spacer cgttatccagggctgagctgcaggcag--ggcca Protospacer . ************.**** ****** ***
21. spacer 3.7|1028193|32|NZ_CP048344|PILER-CR,CRISPRCasFinder,CRT matches to NZ_CP015141 (Escherichia coli strain Ecol_732 plasmid pEC732_3, complete sequence) position: , mismatch: 7, identity: 0.781
aaatatccagggctgggctggaggcagacggc-- CRISPR spacer cgttatccagggctgagctgcaggcag--ggcca Protospacer . ************.**** ****** ***
22. spacer 3.7|1028193|32|NZ_CP048344|PILER-CR,CRISPRCasFinder,CRT matches to NZ_LR213460 (Shigella sonnei strain AUSMDU00008333 isolate AUSMDU00008333 plasmid 3) position: , mismatch: 7, identity: 0.781
aaatatccagggctgggctggaggcagacggc-- CRISPR spacer cgttatccagggctgagctgcaggcag--ggcca Protospacer . ************.**** ****** ***
23. spacer 3.7|1028193|32|NZ_CP048344|PILER-CR,CRISPRCasFinder,CRT matches to NZ_MH287044 (Escherichia coli strain 5.1-R1 plasmid pCERC6, complete sequence) position: , mismatch: 7, identity: 0.781
aaatatccagggctgggctggaggcagacggc-- CRISPR spacer cgttatccagggctgagctgcaggcag--ggcca Protospacer . ************.**** ****** ***
24. spacer 3.7|1028193|32|NZ_CP048344|PILER-CR,CRISPRCasFinder,CRT matches to NZ_MH618673 (Escherichia coli strain 838B plasmid p838B-R, complete sequence) position: , mismatch: 7, identity: 0.781
aaatatccagggctgggctggaggcagacggc-- CRISPR spacer cgttatccagggctgagctgcaggcag--ggcca Protospacer . ************.**** ****** ***
25. spacer 4.1|1050729|31|NZ_CP048344|CRISPRCasFinder matches to NC_007336 (Cupriavidus pinatubonensis JMP134 megaplasmid, complete sequence) position: , mismatch: 7, identity: 0.774
ttgcccgcgcaattccgggagcatccgcaat CRISPR spacer tccctatcgcaatgccggcagcatccgcaat Protospacer *. *. ****** **** ************
26. spacer 4.1|1050729|31|NZ_CP048344|CRISPRCasFinder matches to NZ_CP013104 (Paraburkholderia caribensis strain MWAP64 plasmid 1, complete sequence) position: , mismatch: 7, identity: 0.774
ttgcccgcgcaattccgggagcatccgcaat CRISPR spacer ttgcgcgcgcaattccgtgagcagcgccatc Protospacer **** ************ ***** * ** .
27. spacer 4.1|1050729|31|NZ_CP048344|CRISPRCasFinder matches to NZ_CP012748 (Paraburkholderia caribensis MBA4 plasmid unnamed, complete sequence) position: , mismatch: 7, identity: 0.774
ttgcccgcgcaattccgggagcatccgcaat CRISPR spacer ttgcgcgcgcaattccgtgagcagcgccatc Protospacer **** ************ ***** * ** .
28. spacer 4.4|1050912|31|NZ_CP048344|CRISPRCasFinder matches to NZ_CP034185 (Deinococcus sp. S14-83 strain S14-83T plasmid unnamed1, complete sequence) position: , mismatch: 7, identity: 0.774
cccgtcaccgacgcgcagtggcgctaccgtg CRISPR spacer agcgtcaccgacgcgcagggccgctaccaac Protospacer **************** * *******.
29. spacer 4.7|1051095|31|NZ_CP048344|CRISPRCasFinder matches to NC_007336 (Cupriavidus pinatubonensis JMP134 megaplasmid, complete sequence) position: , mismatch: 7, identity: 0.774
ccgaacggctggcgaagcaggtggctggcgt CRISPR spacer ccgaacaggtggcgaagcaggtgatgggcca Protospacer ******.* **************.. ***
30. spacer 1.1|611278|42|NZ_CP048344|CRISPRCasFinder matches to NZ_CP048307 (Escherichia coli strain 9 plasmid p009_C, complete sequence) position: , mismatch: 8, identity: 0.81
acagcagtcggatgcggcgtaaacaccttatctgacctacgt CRISPR spacer attgatgtcggatgcggcgtaaacgccttatccgacctacaa Protospacer *. * ******************.*******.*******.
31. spacer 3.6|1028132|32|NZ_CP048344|PILER-CR,CRISPRCasFinder,CRT matches to NZ_CP012748 (Paraburkholderia caribensis MBA4 plasmid unnamed, complete sequence) position: , mismatch: 8, identity: 0.75
tcaacgcgctcagacgttgcgtgagtgaacca CRISPR spacer acaacgcggtcggacgttgcgtgattaccccg Protospacer ******* **.************ *. **.
32. spacer 4.4|1050912|31|NZ_CP048344|CRISPRCasFinder matches to NZ_CP017753 (Cupriavidus sp. USMAHM13 plasmid unnamed1, complete sequence) position: , mismatch: 8, identity: 0.742
cccgtcaccgacgcgcagtggcgctaccgtg CRISPR spacer gacgtcaccgacgcgcagtcgcgcttcttca Protospacer ***************** ***** *. ..
33. spacer 4.7|1051095|31|NZ_CP048344|CRISPRCasFinder matches to NZ_CP036297 (Planctomycetes bacterium Pla86 plasmid pPla86_1, complete sequence) position: , mismatch: 8, identity: 0.742
ccgaacggctggcgaagcaggtggctggcgt CRISPR spacer agcggcagctggcgatgcaggtggcttgcgt Protospacer ..*.******** ********** ****
34. spacer 4.7|1051095|31|NZ_CP048344|CRISPRCasFinder matches to NZ_CP036288 (Planctomycetes bacterium Pla133 plasmid pPla133_1, complete sequence) position: , mismatch: 8, identity: 0.742
ccgaacggctggcgaagcaggtggctggcgt CRISPR spacer agcggcagctggcgatgcaggtggcttgcgt Protospacer ..*.******** ********** ****
35. spacer 4.7|1051095|31|NZ_CP048344|CRISPRCasFinder matches to NZ_CP015882 (Ensifer adhaerens strain Casida A plasmid pCasidaAB, complete sequence) position: , mismatch: 8, identity: 0.742
ccgaacggctggcgaagcaggtggctggcgt CRISPR spacer ttgcgcagctggcgcagcaggtggctgccga Protospacer ..* .*.******* ************ **
36. spacer 4.7|1051095|31|NZ_CP048344|CRISPRCasFinder matches to NZ_CP017750 (Cupriavidus sp. USMAA2-4 plasmid unnamed1, complete sequence) position: , mismatch: 8, identity: 0.742
ccgaacggctggcgaagcaggtggctggcgt CRISPR spacer gggtacggctggcgaaggaggcggctgcgga Protospacer * ************* ***.***** *
37. spacer 4.10|1050729|32|NZ_CP048344|PILER-CR,CRT matches to NC_007336 (Cupriavidus pinatubonensis JMP134 megaplasmid, complete sequence) position: , mismatch: 8, identity: 0.75
ttgcccgcgcaattccgggagcatccgcaatt CRISPR spacer tccctatcgcaatgccggcagcatccgcaatc Protospacer *. *. ****** **** ************.
38. spacer 4.10|1050729|32|NZ_CP048344|PILER-CR,CRT matches to NZ_CP013104 (Paraburkholderia caribensis strain MWAP64 plasmid 1, complete sequence) position: , mismatch: 8, identity: 0.75
ttgcccgcgcaattccgggagcatccgcaatt CRISPR spacer ttgcgcgcgcaattccgtgagcagcgccatca Protospacer **** ************ ***** * ** .
39. spacer 4.10|1050729|32|NZ_CP048344|PILER-CR,CRT matches to NZ_CP012748 (Paraburkholderia caribensis MBA4 plasmid unnamed, complete sequence) position: , mismatch: 8, identity: 0.75
ttgcccgcgcaattccgggagcatccgcaatt CRISPR spacer ttgcgcgcgcaattccgtgagcagcgccatca Protospacer **** ************ ***** * ** .
40. spacer 4.10|1050729|32|NZ_CP048344|PILER-CR,CRT matches to NC_008759 (Polaromonas naphthalenivorans CJ2 plasmid pPNAP03, complete sequence) position: , mismatch: 8, identity: 0.75
ttgcccgcg-----caattccgggagcatccgcaatt CRISPR spacer -----cgtgaaactcatttccgggagcatccgcattt Protospacer **.* ** ***************** **
41. spacer 4.13|1050912|32|NZ_CP048344|PILER-CR,CRT matches to NZ_CP034185 (Deinococcus sp. S14-83 strain S14-83T plasmid unnamed1, complete sequence) position: , mismatch: 8, identity: 0.75
cccgtcaccgacgcgcagtggcgctaccgtga CRISPR spacer agcgtcaccgacgcgcagggccgctaccaact Protospacer **************** * *******.
42. spacer 4.13|1050912|32|NZ_CP048344|PILER-CR,CRT matches to NZ_CP017753 (Cupriavidus sp. USMAHM13 plasmid unnamed1, complete sequence) position: , mismatch: 8, identity: 0.75
cccgtcaccgacgcgcagtggcgctaccgtga CRISPR spacer gacgtcaccgacgcgcagtcgcgcttcttcaa Protospacer ***************** ***** *. ..*
43. spacer 4.16|1051095|32|NZ_CP048344|PILER-CR,CRT matches to NZ_CP017750 (Cupriavidus sp. USMAA2-4 plasmid unnamed1, complete sequence) position: , mismatch: 8, identity: 0.75
ccgaacggctggcgaagcaggtggctggcgta CRISPR spacer gggtacggctggcgaaggaggcggctgcggaa Protospacer * ************* ***.***** * *
44. spacer 4.16|1051095|32|NZ_CP048344|PILER-CR,CRT matches to NC_007336 (Cupriavidus pinatubonensis JMP134 megaplasmid, complete sequence) position: , mismatch: 8, identity: 0.75
ccgaacggctggcgaagcaggtggctggcgta CRISPR spacer ccgaacaggtggcgaagcaggtgatgggccag Protospacer ******.* **************.. *** .
45. spacer 4.17|1051156|32|NZ_CP048344|PILER-CR,CRT matches to NZ_CP006991 (Rhizobium sp. IE4771 plasmid pRetIE4771e, complete sequence) position: , mismatch: 8, identity: 0.75
gtttaccgccccgcagaggcgctggcagatcc CRISPR spacer catcatcctcccgcagatgcgctggccgatcc Protospacer *.*.* .******** ******** *****
46. spacer 1.1|611278|42|NZ_CP048344|CRISPRCasFinder matches to NZ_CP048307 (Escherichia coli strain 9 plasmid p009_C, complete sequence) position: , mismatch: 9, identity: 0.786
acagcagtcggatgcggcgtaaacaccttatctgacctacgt CRISPR spacer gttgatgtcggatgcggcgtaaacgccttatccgacctacaa Protospacer .. * ******************.*******.*******.
47. spacer 4.1|1050729|31|NZ_CP048344|CRISPRCasFinder matches to NC_011987 (Agrobacterium radiobacter K84 plasmid pAtK84c, complete sequence) position: , mismatch: 9, identity: 0.71
ttgcccgcgcaattccgggagcatccgcaat CRISPR spacer gctaccgcgcaattcgaggagcatccgctgg Protospacer . *********** .*********** .
48. spacer 4.2|1050790|31|NZ_CP048344|CRISPRCasFinder matches to CP011075 (Brevibacillus laterosporus strain B9 plasmid unnamed1, complete sequence) position: , mismatch: 9, identity: 0.71
acggacaaaatatatattgatttgcgaatta CRISPR spacer tgaggcaaaatatagattgatttccgaaaat Protospacer .*.********* ******** ****
49. spacer 4.2|1050790|31|NZ_CP048344|CRISPRCasFinder matches to GU075905 (Prochlorococcus phage P-HM2, complete genome) position: , mismatch: 9, identity: 0.71
acggacaaaatatatattgatttgcgaatta CRISPR spacer acggaaaaattatatattgattttacttctg Protospacer ***** *** ************* .*.
50. spacer 4.4|1050912|31|NZ_CP048344|CRISPRCasFinder matches to NZ_CP017750 (Cupriavidus sp. USMAA2-4 plasmid unnamed1, complete sequence) position: , mismatch: 9, identity: 0.71
cccgtcaccgacgcgcagtggcgctaccgtg CRISPR spacer gacgtcactgacgcgcagtcgcgcttcttca Protospacer ******.********** ***** *. ..
51. spacer 4.4|1050912|31|NZ_CP048344|CRISPRCasFinder matches to NZ_AP022593 (Mycolicibacterium arabiense strain JCM 18538 plasmid pJCM18538, complete sequence) position: , mismatch: 9, identity: 0.71
cccgtcaccgacgcgcagtggcgctaccgtg CRISPR spacer gacatcaccgacgcccagtggcgcgacgtcc Protospacer *.********** ********* ** .
52. spacer 4.8|1051156|31|NZ_CP048344|CRISPRCasFinder matches to NZ_CP040723 (Rhodococcus pyridinivorans strain YF3 plasmid unnamed4, complete sequence) position: , mismatch: 9, identity: 0.71
gtttaccgccccgcagaggcgctggcagatc CRISPR spacer cgagaccgcctcgccgaggcgctggcagcga Protospacer ******.*** *************
53. spacer 4.13|1050912|32|NZ_CP048344|PILER-CR,CRT matches to NZ_CP017750 (Cupriavidus sp. USMAA2-4 plasmid unnamed1, complete sequence) position: , mismatch: 9, identity: 0.719
cccgtcaccgacgcgcagtggcgctaccgtga CRISPR spacer gacgtcactgacgcgcagtcgcgcttcttcaa Protospacer ******.********** ***** *. ..*
54. spacer 4.16|1051095|32|NZ_CP048344|PILER-CR,CRT matches to NZ_CP036297 (Planctomycetes bacterium Pla86 plasmid pPla86_1, complete sequence) position: , mismatch: 9, identity: 0.719
ccgaacggctggcgaagcaggtggctggcgta CRISPR spacer agcggcagctggcgatgcaggtggcttgcgtg Protospacer ..*.******** ********** ****.
55. spacer 4.16|1051095|32|NZ_CP048344|PILER-CR,CRT matches to NZ_CP036288 (Planctomycetes bacterium Pla133 plasmid pPla133_1, complete sequence) position: , mismatch: 9, identity: 0.719
ccgaacggctggcgaagcaggtggctggcgta CRISPR spacer agcggcagctggcgatgcaggtggcttgcgtg Protospacer ..*.******** ********** ****.
56. spacer 4.16|1051095|32|NZ_CP048344|PILER-CR,CRT matches to NZ_CP015882 (Ensifer adhaerens strain Casida A plasmid pCasidaAB, complete sequence) position: , mismatch: 9, identity: 0.719
ccgaacggctggcgaagcaggtggctggcgta CRISPR spacer ttgcgcagctggcgcagcaggtggctgccgag Protospacer ..* .*.******* ************ ** .
57. spacer 4.17|1051156|32|NZ_CP048344|PILER-CR,CRT matches to NZ_CP040723 (Rhodococcus pyridinivorans strain YF3 plasmid unnamed4, complete sequence) position: , mismatch: 9, identity: 0.719
gtttaccgccccgcagaggcgctggcagatcc CRISPR spacer cgagaccgcctcgccgaggcgctggcagcgac Protospacer ******.*** ************* *
58. spacer 3.1|1027827|32|NZ_CP048344|PILER-CR,CRISPRCasFinder,CRT matches to NZ_CP030933 (Enterococcus gilvus strain CR1 plasmid pCR1A, complete sequence) position: , mismatch: 10, identity: 0.688
tccacgctgtaacggccatcattaagtttagt CRISPR spacer ccgctgctgtgacgcccatcattaagttactc Protospacer .* .*****.*** ************* .
59. spacer 4.10|1050729|32|NZ_CP048344|PILER-CR,CRT matches to NC_011987 (Agrobacterium radiobacter K84 plasmid pAtK84c, complete sequence) position: , mismatch: 10, identity: 0.688
ttgcccgcgcaattccgggagcatccgcaatt CRISPR spacer gctaccgcgcaattcgaggagcatccgctggg Protospacer . *********** .*********** .
60. spacer 4.11|1050790|32|NZ_CP048344|PILER-CR,CRT matches to CP011075 (Brevibacillus laterosporus strain B9 plasmid unnamed1, complete sequence) position: , mismatch: 10, identity: 0.688
acggacaaaatatatattgatttgcgaattat CRISPR spacer tgaggcaaaatatagattgatttccgaaaata Protospacer .*.********* ******** ****
61. spacer 4.11|1050790|32|NZ_CP048344|PILER-CR,CRT matches to GU075905 (Prochlorococcus phage P-HM2, complete genome) position: , mismatch: 10, identity: 0.688
acggacaaaatatatattgatttgcgaattat CRISPR spacer acggaaaaattatatattgattttacttctgg Protospacer ***** *** ************* .*.
62. spacer 4.13|1050912|32|NZ_CP048344|PILER-CR,CRT matches to NZ_AP022593 (Mycolicibacterium arabiense strain JCM 18538 plasmid pJCM18538, complete sequence) position: , mismatch: 10, identity: 0.688
cccgtcaccgacgcgcagtggcgctaccgtga CRISPR spacer gacatcaccgacgcccagtggcgcgacgtccc Protospacer *.********** ********* ** .
Region | Region Position | Protein_number | Hit_taxonomy | Key_proteins | Att_site | Prophage annotation | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
DBSCAN-SWA_1 |
1058702 : 1071885
Sequences of DBSCAN-SWA_1
Nucleotide sequences of DBSCAN-SWA_1 >NZ_CP048344|1058702:1071885|DBSCAN-SWA TATGCGCATATTGCTGAGTAATGATGACGGGGTACATGCACCCGGTATACAAACGCTGGCGAAAGCCTTGCGTGAGTTTGCTGACGTTCAGGTGGTCGCCCCCGATCGTAACCGCAGCGGCGCTTCAAATTCTCTGACACTGGAATCCTCCCTGCGCACGTTTACCTTTGAAAATGGTGATATTGCTGTGCAAATGGGAACCCCGACCGATTGCGTCTATCTTGGCGTGAATGCTCTGATGCGTCCGCGCCCGGACATTGTTGTGTCCGGAATTAACGCCGGGCCGAATCTGGGGGATGATGTTATTTATTCCGGTACGGTAGCCGCCGCGATGGAAGGCCGTCATTTAGGTTTTCCGGCGCTTGCCGTCTCGCTTGACGGGCATAAACATTACGACACTGCCGCGGCGGTAATCTGTTCAATTTTGCGCGCACTGTGTAAAGAGCCGCTGCGCACCGGGCGTATTCTTAATATTAACGTTCCGGATTTACCCTTGGATCAAATCAAAGGTATTCGCGTGACGCGCTGCGGTACACGACATCCGGCAGATCAGGTGATCCCGCAGCAAGATCCGCGCGGCAATACGCTGTACTGGATTGGCCCGCCGGGCGGTAAATGTGATGCTGGTCCGGGGACCGATTTTGCTGCGGTAGATGAGGGCTATGTCTCCATCACGCCGCTGCATGTGGATTTAACTGCGCATAGCGCGCAAGATGTGGTTTCAGACTGGTTAAACAGCGTGGGAGTTGGCACGCAATGGTAAGCAGACGCGTACAAGCACTTCTGGATCAATTACGTGCGCAAGGTATTCAGGATGAGCAGGTGCTGAATGCACTTGCCGCCGTGCCGCGTGAAAAATTCGTTGATGAAGCGTTTGAACAAAAAGCCTGGGACAATATCGCTTTGCCGATAGGTCAGGGGCAGACAATTTCGCAGCCATATATGGTGGCGCGAATGACCGAATTACTCGAGCTGACGCCGCAGTCGCGGGTGCTGGAAATTGGCACCGGTTCGGGATATCAAACGGCAATCCTGGCGCATCTTGTCCAGCATGTTTGCTCGGTTGAACGGATTAAAGGCTTGCAGTGGCAGGCACGTCGCCGCCTGAAAAATCTTGATTTACATAATGTTTCAACCCGTCATGGCGATGGATGGCAAGGTTGGCAGGCACGTGCGCCGTTTGACGCTATCATTGTTACGGCGGCACCGCCGGAAATTCCAACTGCGCTAATGACGCAGCTGGACGAAGGCGGGATTCTCGTCTTACCCGTAGGGGAGGAGCACCAGTATTTGAAACGGGTGCGTCGTCGGGGAGGCGAATTTATTATCGATACCGTGGAGGCCGTGCGCTTTGTCCCTTTAGTGAAGGGTGAGCTGGCTTAAAACGTGAGGAAATACCTGGATTTTTCCTGGTTATTTTGCCGCAGGTCAGCGTATCGTGAACATCTTTTCCAGTGTTCAGTAGGGTGCCTTGCACGGTAATTATGTCACTGGTTATTAACCAATTTTTCCTGGGGGATAAATGAGCGCGGGAAGCCCAAAATTCACCGTTCGCCGCATTGCGGCTTTGTCACTGGTTTCGCTATGGCTGGCAGGCTGTTCTGACACTTCAAATCCACCGGCACCGGTCAGCTCCGTTAATGGCAATGCGCCTGCAAATACTAATTCTGGTATGTTGATTACGCCGCCGCCGAAAATGGGGACGACGTCTACAGCGCAGCAACCGCAAATTCAGCCGGTGCAGCAGCCACAAATTCAGGCTACTCAACAACCGCAAATCCAGCCAGTGCAGCCAGTAGCTCAGCAGCCGGTACAGATGGAAAACGGACGCATCGTCTATAACCGTCAGTATGGGAACATTCCGAAAGGCAGTTATAGCGGCAGTACCTATACCGTGAAAAAAGGCGACACACTTTTCTATATCGCCTGGATTACTGGCAACGATTTCCGTGACCTTGCTCAGCGCAACAATATTCAGGCACCATACGCGCTGAACGTTGGTCAGACCTTGCAGGTGGGTAATGCTTCCGGTACGCCAATCACTGGCGGAAATGCCATTACCCAGGCCGACGCAGCAGAGCAAGGAGTTGTGATCAAGCCTGCACAAAATTCCACCGTTGCTGTTGCGTCGCAACCGACAATTACGTATTCTGAGTCTTCGGGTGAACAGAGTGCTAACAAAATGTTGCCGAACAACAAGCCAACTGCGACCACGGTCACAGCGCCTGTAACGGTACCAACAGCAAGCACAACCGAGCCGACTGTCAGCAGTACATCAACCAGTACGCCTATCTCCACCTGGCGCTGGCCGACTGAGGGCAAAGTGATCGAAACCTTTGGCGCTTCTGAGGGGGGCAACAAGGGGATTGATATCGCAGGCAGCAAAGGACAGGCAATTATCGCGACCGCAGATGGCCGCGTTGTTTATGCTGGTAACGCGCTGCGCGGCTACGGTAATCTGATTATCATCAAACATAATGATGATTACCTGAGTGCCTACGCCCATAACGACACAATGCTGGTCCGGGAACAACAAGAAGTTAAGGCGGGGCAAAAAATAGCGACCATGGGTAGCACCGGAACCAGTTCAACACGCTTGCATTTTGAAATTCGTTACAAGGGGAAATCCGTAAACCCGCTCCGTTATTTGCCGCAGCGATAAATCGACGGAACCAGGCTTTTGCTTGAATGTTCCGTCAAGGGATCACGGGTAGGAGCCACCTTATGAGTCAGAATACGCTGAAAGTTCATGATTTAAATGAAGATGCGGAATTTGATGAGAACGGAGTTGAGGTTTTTGACGAAAAGGCCTTAGTAGAAGAGGAACCCAGTGATAACGATTTGGCCGAAGAGGAACTGTTATCGCAGGGAGCCACACAGCGTGTGTTGGACGCGACTCAGCTTTACCTTGGTGAGATTGGTTATTCACCACTGTTAACGGCCGAAGAAGAAGTTTATTTTGCGCGTCGCGCACTGCGTGGAGATGTCGCCTCTCGCCGCCGGATGATCGAGAGTAACTTGCGTCTGGTGGTAAAAATTGCCCGCCGTTATGGCAATCGTGGTCTGGCGTTGCTGGACCTTATCGAAGAGGGCAACCTGGGGCTGATCCGCGCGGTAGAGAAGTTTGACCCGGAACGTGGTTTCCGCTTCTCAACATACGCAACCTGGTGGATTCGCCAGACGATTGAACGGGCGATTATGAACCAAACCCGTACTATTCGTTTGCCGATTCACATCGTAAAGGAGCTGAACGTTTACCTGCGAACCGCACGTGAGTTGTCCCATAAGCTGGACCATGAACCAAGTGCGGAAGAGATCGCAGAGCAACTGGATAAGCCAGTTGATGACGTCAGCCGTATGCTTCGTCTTAACGAGCGCATTACCTCGGTAGACACCCCGCTGGGTGGTGATTCCGAAAAAGCGTTGCTGGACATCCTGGCCGATGAAAAAGAGAACGGTCCGGAAGATACCACGCAAGATGACGATATGAAGCAGAGCATCGTCAAATGGCTGTTCGAGCTGAACGCCAAACAGCGTGAAGTGCTGGCACGTCGATTCGGTTTGCTGGGGTACGAAGCGGCAACACTGGAAGATGTAGGTCGTGAAATTGGCCTCACCCGTGAACGTGTTCGCCAGATTCAGGTTGAAGGCCTGCGCCGTTTGCGTGAAATCCTGCAAACGCAGGGGCTGAATATCGAAGCGCTGTTCCGCGAGTAAGTAAGCATCTGTCAGAAAGGCCAGTCTCAAGCGAGGCTGGCCTTTTCTGTGCACAATAAAAGGTCCGATGCCCATCGGACCTTTTTATTAAGGTCAAATTACCGCCCATACGCACCAGGTAATTAAGAATCCGGTAAAACCGAGAATGGTCGTTAACACTGTCCAGGTTTTCAGACCGTCTGCTACCGACAACCCCAGATATTTGGTCACAATCCAGAACCCTGAGTCATTAATATGTGACGCGCCAAGCCCACCAAAGCAGGCTGCCAGCGTCACCAATACGCACTGAATCGGATTCAATCCCATCACCGCTTCTGAGAGTAACCCGCCGGTTGTCAGGATTGCTACGGTTGCTGACCCCTGCGATGCACGCAGAGCCAGTGAAATAATAAATGCGGCTGGTAACAGAGGCAGGTCAATCATTTGTAGCATATTGGCAAGGGCTTTGCCGACGCCCGATTCCACCAGCACTTTGCCAAATACCCCTCCAGCACCAGTAACCAAAATCACTACCGCCGCAGTGGGAAGGGCTGAGCCCATAATGTCGCTGGTGTGTTGTAAGCTCCAGCCGCGACGTAAAGCCAATAACCAGAATGCCAGCACCAGCGCAATCATTAGAGCTACCATTGGTGAGCCGATCAGCTGTAGCGTACCAAGCAGGGGATGCGAAGGCGGCATCAGTGTTGCGGAAACCGTACCCGCCATGATAATCGCGATAGGAATAACAATTAGCGAGGTGACCAGCGCGACGCCCGGTGGATTTATTTTATCGCTTAATTTTGTCGCGCCTTCCTCACTGGCCGGAGCCAGTTGCATCTGTTCCAGTACTTCCACCGACATCGCATATTGGTGCTTATTGATTATTTTCGCTGCAAAGTAGCCAACAATCCCTACGGGAATAGAAATCGCAATACCGATGATGGTTAGCCAGCCGATGTCTGCGTGGAGTAACCCCGCTGCGGCGACAGGGCCTGGATGCGGCGGTACCGCCACATGTACAGTTAGCATGATCCCAGCGACAGGCAGGCCAAATTTGAGTGGCGATATTTTGGCAACCTTGGCAAAACCGTAAATGATTGGCGCAAGAATAATAAAGCCGACATCAAAGAAGACGGGAATACCGAGGAAGAACGCTGCCAGAGTCAGCGCAGCGATAGTTCGTTTGTCACCTAACTTGCGACTGAAATAATTAGCCAGTGACTCTGCACCACCAGAGTGTTCGATCATACGCCCCAGCATAGCGCCCAGACCAATAATAATAGTGACGGAACCAAGCACACCGCCCATCCCGGCGATCATCACTTTACCCACTTCGCCCGCCGGTATACCCGCCGCAAGTGCGACTAACAGGCTGACGAGGAGCAGAGCAACGAATGGTTGTACCTTTGCCTTGATGACCAGCAGCAACAGCATGATTACGCCAGTTAACGCAATGCATAACAATGTAATTGTGGACATGGGAAACCCTGTCTGAAAGTTATAGTTAACCTACCCCATCCGTAGATGGGGGGATGTATGGGTACGTTGTAATTAGGGATTTAACGAATTAGCGCCAGGCGTCAAACCAGCCAAGCCCTTCTTCGGTGAGGCCACGAGGTTTATATTCACAACCGATCCAGCCCTGATATCCCACCTCATCGAACAGGCGGAACAGCCACGGATAGTTGATTTCTCCATCGTCCGGTTCATGTCGATCAGGTAGTCCGGCAATTTGTACGTGCGCATATTTCCCGGCGTAGTCGCGGATTAAATGCGTCAGGTTGCCATCTACTTTTTGCGCATGAAAAGTATCTAGTTGAATAAACACGTTATCTCGCGCAACCTCTTCAACAATAGCCAGTGCCTGATACTGGCTGGAGAAGAGATAATGAGGCTTAACGTCGGGGCTGAGTGCTTCAACTAATATTCGCTTGCCGTGTAGCGCAAAGCGGTCGGCAGCGTAGCGGAGATTATCGATAAATACTGCCCGGTACCGTTCAGCGTCTTCGCCAGCGGGCACGACGCCTGCCATCACATGGACTTGTTCACAATTGAGCGCCAATGCATATTCCAGTGCCAGGTCGATGTCTGCGTGTGCTTCGTGCTCACGTCCGGGAAGGGCGGATAATCCCCATTCCCCCGCATTAATATCTCCGGGAGCGGTATTGAACAGCGCCAGTGTCAGATGGTTTTGCTCCAGTTGCTTTTGGATTTGCAGGGTGGAGTAGTCATAGGGAAACAGAAATTCCACAGCATCGAACCCGGCTTTTCGCGCTGCGGCGAAGCGTTCAATAAAAGGCACTTCGGTGAACATCATGGATAAATTAGCTGCAAAACGAGGCATTGCATTAACTCCTTAATTCCGCAATTTCACCTGCGGTCAGATAACGGATCGGGCGGTCACCGAGAATAAAAATCAGCTTTGCCGTTTCCTCCAGCTCTTCCATATTGTTGGCGGCTTCTTGCAGGCTTTCACCGCAAACCACTGGGCCATGATTTGCCAGTAAAAAAGCCTGATTGTCTGCTGCCAGTTCCGCCAGATCCTGTGCGATGCGTTTATCGCCCGGTCGGTAATAAGGCACCAGCGGGACATTTCCCATCCGCATCACCACGTATGGTGTGAACGGACGAATAACGTTGCTGCTGTCCAGCCCTTGCAGGCAGGAAAGCGCCGTCGACCATGTGCTGTGCAAATGCACCACCGCTTTACAGCGCGGATTGTTGCGATACAGCGCCAGATGAAAGAGCACCTCTTTCGAGGGTTTGTCACCACTTAACCATTCGCCATCCGCGGCGACTTTGGAAAGCCGCTGCGGATCGAGATTGCCCAGGCATGAACCTGTCGGTGTCGCCAGTAAATTCCCGTCAGGTAAAAGCAGCGACAGATTGCCAGCCGAACCGGTTGCATAGCCGCGCTGAAAGAATGAACTGGCAATCCGCGTCATCTCCTCTCGCAAAGACTGCTCTACTTTTGCGAAATCGCTCATGATAAAAACTCTCTTTGGGCTCGTGAAAAAAAGGCTTCATCACCGAAGTTGCCAGATTTAAGGGCGAGTGAGACAGGCTTATCCAGTGCGTTTACCCACGGCACGCCGGGGGAAATGGTTGGGCCAATATGAAACCCTTTTATGCCCAGGCTCTGTGTGACTACGCCGGAGGTCTCACCGCCTGCGACAATAAAGCGTGTCACGCCTTCCGCTGCTAACCGCGCCGCTAGTTGAGAAAACAGAGTTTCTACTGCCTGACTGGCTTTTTGTGCACCGTATTGCTGTTGAATTGCTGCCAATGCGTCAGTGCTGGCGGTGGCAAAAACCAGTGGAGCAAGTACACTTTCCTGGCCCAGAACCCACTCTGCCAGTTCGTGTGCATAAGCGGCCAGAGTTTCGGTTGAGAGGCAGCGTGCCACATCAACTTCACGGGCTGGTGCAATTTGACGGTAATGTGCCACCTGGCGGTTGGTCATTTGAGAGCATGAACCGGAGAGCACTACGCCGCGCCCAGCGAGCGGATGCCCTGCTTCGCGAGCCTGGTTACCGTTTTCTTGCGCCCACTGCCGGGCCAGGCCAATCGCCAGACCAGAACCGCCCGTTACCAGTGGGGCATCGCGCAAGGCTTCTCCCTGAATTTCCAGATGGTGTTCGGTCAGCGCATCAAGCACCGCGTAGCGGTAGCCCTCTTGCTGTAAGCGAGCCAGCTCTTGACGAACGGCATCCACACCTTGTTCGAAAACATGTGCCGAAACGACGCCGCAGCGCCCTGTGGATTGCGCTTCAACCAGACGGGGAAGATAGCTGTCGGTCATGGGATTTACCGGGTGATGGCGCATCCCGGATTCGGCCAGCAGTTGATTCATTACGAACAAATACCCCTGATAAACCGTACGTCCGTTGACCGGCAGGGCCGGAGAGAAGACCGTAAACGGCGTGTCGAGAGCATCCATTAATGCATCGGTAACCGGGCCGATATTACCTTTCGCCGTACTGTCGAAAGTAGAGCAGTATTTGAAATAGATCTGTTTGCAACCTTGCTGTTGCAACCAGCTCAGAGCCGCCAGCGATTGCTGTGTGGCTTCAACCACCGGACAGGAGCGCGTTTTCAGGCTGATCACCAGTGCGTCGATTGCTTCCGGCATTTTACCTGTTGGAACACCGTTAATTTGTACCGTTGGTAGACCGTTTTCCACCAGAAAACTGGCGATATCCGTCGCGCCGGTAAAATCATCGGCGATAACGCCAATCTTGATCATGATTTCGCTCCCGGTAGAGTGATGCCAGAGAAAATCTTGATAACTGCGCTATCGTCTTCTTTCCCGTAACCTGCGTTACTGGCGCTGGTGAACATATTCAATGCTGTTGAGGCCAATGGCAGCGGGAAGTGCAGGGCTTTGGCTGTATCGGCAACCAGACCAAGATCCTTAACAAAAATATCGACGGCTGAATGCGGGGTGTAATCGCCATCCACCACATGACGCATCCGGTTTTCGAACATCCAGGAATTTCCGGCGGCATTGGTCACGACGTCATACATCACATCCAGCGGGATCCCCGCACGGGCTGCAAGTGCCATCGCTTCGGCTCCGGCAGCAATATGTACGCCCGCTAACAACTGGTGAATAATTTTTACGGTCGAACCTAGTCCCGGTTCTGCACCTATGCGATAAACTTTTCCGGCAACGGCTTCCAGCACGGGTGCCAGTCGTTCAAAGGCAATATCGCTACCGGAGGCCATGACAGTCATTTCACCGTTAGCGGCTTTTACTGCACCACCAGAAACTGGCGCATCCAGCATTTCCAGATCGAATCCAGCCAGAGCGGTAGCAATTTCTTGCGCATCAGCACTAGCGATAGTGGAAGAAACCATTACTGCCGTACCGGGTTTCAGATGTTGTGCAACGCCTGTTTCACCAAACAGCACCTGTTTAACCTGGGCCGCATTGACCACCAGCACCAGCAGAGCGTCCAGTTTTTCGGCAAACGTCGCGGCGTTATCAGAAACCCCGCAAGCACCTGCCTCTTTCAACGTAGCGCAGGCATTGCTGTTCAGGTCTGCGCCCCAGGTAGAAAGACCTGCGCGGACACATGACAGTGCTGCTCCCATTCCCATTGACCCTAAGCCAACGATACCGACATGAAACTCAGATCCCGTTTTCATCTGCTCTCCTTGTTAATTTAAGTGATATTTTGTTTGATATTGTGAATATAAGCGCTGGAAGATAACGATATGGTGAGCTGATTCACATAAATTAACATTGTGTGTTATTTTATGTGAACTAAGCGTTAGTTGCGCCGCGCACGTTTCGCAGGCAAATAGCGTAGAATGTCAGCAGGACAAAGGAAGGAGCAAAAGTTGATACCCGTAGAGCGTCGCCAAATCATCCTTGAGATGGTAGCTGAAAAAGGCATTGTCAGTATTGCTGAACTAACGGACAGAATGAATGTGTCACATATGACCATTCGTCGGGATTTACAAAAACTGGAGCAGCAGGGAGCCGTTGTGCTGGTGTCCGGAGGCGTCCAGTCTCCGGGACGCGTGGCGCATGAACCTTCTCATCAGGTAAAAACTGCGCTGGCAATGACGCAAAAAGCGGCTATTGGCAAGCTGGCGGCAAGTCTTGTTCAGCCGGGAAGCTGTATCTATCTGGATGCGGGAACGACCACGTTAGCGATAGCACAGCATCTGATTCACATGGAGTCACTGACTGTGGTCACAAACGATTTCGTTATTGCGAACTACTTGCTCGACAACAGTAATTGCACAATTATTCACACTGGCGGTGCAGTGTGTCGGGAAAACCGTTCCTGTGTCGGGGAAGCCGCTGCGACCATGCTGCGCAGCCTGATGATTGATCAGGCTTTTATTTCTGCATCGTCATGGAGTGTGCGGGGGATTTCTACGCCAGCGGAAGATAAAGTCACGGTGAAACGGGCGATTGCCAGTGCCAGCCGCCAGCGAGTTTTGGTCTGTGATGCGACGAAATATGGTCAGGTGGCGACATGGCTGGCGTTACCATTAAGCGAGTTTGATCAGATTATCACAGACGACGGTTTGCCGGAGAGTGCCAGTCGCGCGCTGGCGAAGCAGGATCTCTCTTTGCTGGTAGCGAAAAATGAATAATGGCCTGCAATAACATTTGGTTACTCATGCTTCACAGAAGAAGCATGAGACTACTTTATTTTATAAAATGACAGCCGCCCGCTTTTCGGCGTGCCGGTATCAATATAAATCTGGTTAGCGAACGTCTGAATGTTATCAAACATCATATGTCCAAATATAAAATAATCAGCGCCGTTTATTTGTTGTAACTCGCCATTAAGCGATTTCTGCACACGATCAACAGGCCAGAGTAATTCGCTCTCCGCTATTTCTTTACCAAAGAGATATTCACTCCCCGGATAATCTGCATGTGCGATGACATATTTTATGTTGTCGTTAGTGATTTCAATAATATGTGGAAGGTGATGGAATTTCAGCAACAGATCTATTGCCTCTTGTTGCTCTGAATCATTTAAATCGAAAAACCAGTCACCACCGCTGGCAAGCCACATATTGCCATCGCCAGTTTCGAATGCATCCAACGCCATCGCTTCGTGGTTGCCTTTAACCGACGTAAACCAGGGTTGGTTTAGCAGGCGCAGCACGTTAAGACTCTTCGGCCCACGATCAATATTATCGCCTGTGGAAATAAGTAAGTCGGTTTCAGGGTAAAAAGAGAGTTGATGTAAACGGGATTGTAATAACTGATATTCACCATGAATATCACCAACGACCCATATATGGCGATAGTGATGGGCATTGATTTTTTGATAGCGTGTAGATGGCATGGTTTTACCCTGTAAAATAAGCTTTCCTATTATACAGGGTATTTTTATTTGATTCGTCAGTTGTCGTTAATATTCCCGATAGCAAAAGACTATCGGGAATTGTTATTACACCAGACTCTTCAAGCGATAAATCCACTCCAGCGCCTGACGCGGAGTCAGTGAATCCGGGTCGAGATTTTCCAGTGCTTCGACCGCAGGCGAAGTTTCTTCTGGCACTGACAGCAAAGACATCTGCGTACCATCCACTTGCGTCGCGGCGGCGTTCGGCGAAATGCTTTCCAGCTCACGCAGTTTTTGCCGTGCGCGCTTAATCACCTCTTTTGGCACGCCCGCCAGAGCTGCAACCGCCAGGCCGTAGCTTTTGCTCGCCGCGCCATCCTGCACGCTATGCATAAAGGCAATGGTGTCGCCGTGCTCCAGCGCATCGAGATGCACGTTGGCGACGCCTTCCATTTTCTCCGGTAACTGGGTCAGCTCGAAATAGTGGGTAGCAAATAACGTCAATGCCTTAATCTTATTCGCCAGATTTTCCGCGCACGCCCACGCCAGCGACAGACCATCGTAGGTGGACGTTCCACGCCCGATCTCATCCATTAACACCAGACTGTATTCGGTGGCGTTATGTAAAATATTGGCGGTTTCAGTCATCTCCACCATAAAGGTTGAGCGCCCGGAAGCCAGGTCATCTGCCGCGCCTACGCGGGTAAAGATGCGATCGATAGGTCCAATCTCGACTTTTTGTGCCGGTACATAGCTGCCGATGTAGGCCATCAGCGCAATCAGTGCGGTCTGGCGCATATAGGTACTTTTACCGCCCATGTTCGGGCCGGTAATAATCAACATACGGCGCTGCGGCGACAGATTCAGCGGGTTAGCGATAAATGGCTCATTCAGTACTTGTTCAACTACCGGATGGCGACCTTCGGTAATGCGAATGCCCGGTTTATCAATGAAGGTCGGGCAGGTGTAGTTCAGGGTATAGGCCCGTTCCGCCAGGTTAACCAGCACGTCGAGTTCCGCCAGCGCGCTCGCGCTCTGTTGCAACGCTTCCAGATGCGGCAACAGCAGGTCGAACAGCTCTTCATAAAGCTGTTTTTCCAGTGCCAGTGCTTTGCCTTTTGAGGTGAGAACTTTATCTTCGTACTCTTTTAGCTCTGGAATGATGTAGCGCTCGGCGTTTTTCAGCGTCTGGCGACGCATGTAGTTGATGGGTGCCAGATGGCTTTGCCCACGGCTGATTTGAATGTAGTAGCCGTGCACCGCATTAAAGCCAACTTTCAGCGTGTCCAGGCCGGTACGTTCACGCTCGCGGACTTCCAGACGCTCCAGATAATCGGTCGCGCCGTCAGCCAGCGCGCGCCACTCATCCAGCTCTTCGTTATAGCCCGATGCGATAACACCACCGTCGCGTACCAGCACCGGCGGTGTGTCGATGATTGCTCGCTCCAGCAGATCGCGCAGCTCGGCAAACTCGCCCATCTTCTCACGTAGCGCCTGTACCGGTGCACTATCGACAGTTTCTAACTGCGCACGCAGCTCCGGCAGTTGCTGGAAAGCGTGGCGCATACGGGCCAGATCGCGTGGGCGAGCAGTTCGTAAAGCCAGACGTGCCAGAATACGTTCCAGGTCGCCGACCTGACGCAGTACCGGCTGCAACTCGGCGGTGAAATCCTGCAATGCGCCAATAGTTTGCTGGCGCTCAAGCAACACGCGGGTATCGCGCACTGGCATATGCAGCCAGCGTTTCAGCATACGACTACCCATCGGCGTGACGGTGCAGTCGAGCACAGAAGCCAGCGTATTTTCCGCACCGCCCGCCAGGTTCTGAGTAATTTCCAGGTTACGACGTGTCGCGGCATCCATAATGATGCTGTCCTGCTCACGTTCCATGGTGATAGAACGAATATGCGGCAGGGTCGTGCGTTGGGTATCTTTCGCATACTGCAACAGACAACCGGCAGCACAAAGTCCGCGCGGCGCGTTCTCGACGCCAAAACCGACCAGATCGCGGGTGCCAAATTGCAGATTCAACTGCTGGCGCGCGGTGTCGATTTCAAACTCCCACAGCGGGCGACGGCGCAGGCCGCGACGGCCTTCAATCAGCGACATCTCGGCGAAATCTTCTGCATACAGCAGTTCCGCCGGATTAGTGCGTTGCAGCTCTGCCGCCATCGTTTCGCGGTCGGCCGGTTCGCTCAGACGAAAACGCCCGGAGCTGATATCCAGCGTCGCGTAGCCGAAACCTTTGCTGTCCTGCCAGATAGCCGCCAGCAGGTTGTCCTGACGCTCCTGTAACAGGGCTTCATCGCTGATGGTGCCTGGCGTAACGATACGCACAACTTTGCGCTCAACCGGACCTTTGCTGGTCGCCGGATCGCCAATTTGTTCGCAGATGGCAACGGACTCTCCCTGATTCACCAGTTTGGCGAGATAGTTTTCCACCGCATGGTAGGGAATCCCCGCCATCGGGATCGGCTCTCCCGCCGAAGCACCGCGTTTGGTCAGTGAAATATCCAGCAGTTGCGACGCGCGTTTTGCGTCGTCATAAAACAGTTCATAAAAATCACCCATCCGGTAAAACAGCAGGATCTCGGGATGCTGGGCTTTCAGCCTGAGATACTGCTGCATCATGGGCGTATGGGCGTCGAAATTTTCTATTGCACTCAT
Protein sequences of DBSCAN-SWA_1 >NZ_CP048344|1058702:1071885|1066639_1067548_-|WP_000847985.1|DBSCAN-SWA MKTGSEFHVGIVGLGSMGMGAALSCVRAGLSTWGADLNSNACATLKEAGACGVSDNAATFAEKLDALLVLVVNAAQVKQVLFGETGVAQHLKPGTAVMVSSTIASADAQEIATALAGFDLEMLDAPVSGGAVKAANGEMTVMASGSDIAFERLAPVLEAVAGKVYRIGAEPGLGSTVKIIHQLLAGVHIAAGAEAMALAARAGIPLDVMYDVVTNAAGNSWMFENRMRHVVDGDYTPHSAVDIFVKDLGLVADTAKALHFPLPLASTALNMFTSASNAGYGKEDDSAVIKIFSGITLPGAKS >NZ_CP048344|1058702:1071885|1061425_1062418_+|WP_000081550.1|DBSCAN-SWA MSQNTLKVHDLNEDAEFDENGVEVFDEKALVEEEPSDNDLAEEELLSQGATQRVLDATQLYLGEIGYSPLLTAEEEVYFARRALRGDVASRRRMIESNLRLVVKIARRYGNRGLALLDLIEEGNLGLIRAVEKFDPERGFRFSTYATWWIRQTIERAIMNQTRTIRLPIHIVKELNVYLRTARELSHKLDHEPSAEEIAEQLDKPVDDVSRMLRLNERITSVDTPLGGDSEKALLDILADEKENGPEDTTQDDDMKQSIVKWLFELNAKQREVLARRFGLLGYEAATLEDVGREIGLTRERVRQIQVEGLRRLREILQTQGLNIEALFRE >NZ_CP048344|1058702:1071885|1065380_1066643_-|WP_000590392.1|DBSCAN-SWA MIKIGVIADDFTGATDIASFLVENGLPTVQINGVPTGKMPEAIDALVISLKTRSCPVVEATQQSLAALSWLQQQGCKQIYFKYCSTFDSTAKGNIGPVTDALMDALDTPFTVFSPALPVNGRTVYQGYLFVMNQLLAESGMRHHPVNPMTDSYLPRLVEAQSTGRCGVVSAHVFEQGVDAVRQELARLQQEGYRYAVLDALTEHHLEIQGEALRDAPLVTGGSGLAIGLARQWAQENGNQAREAGHPLAGRGVVLSGSCSQMTNRQVAHYRQIAPAREVDVARCLSTETLAAYAHELAEWVLGQESVLAPLVFATASTDALAAIQQQYGAQKASQAVETLFSQLAARLAAEGVTRFIVAGGETSGVVTQSLGIKGFHIGPTISPGVPWVNALDKPVSLALKSGNFGDEAFFSRAQREFLS >NZ_CP048344|1058702:1071885|1064745_1065384_-|WP_001278994.1|DBSCAN-SWA MSDFAKVEQSLREEMTRIASSFFQRGYATGSAGNLSLLLPDGNLLATPTGSCLGNLDPQRLSKVAADGEWLSGDKPSKEVLFHLALYRNNPRCKAVVHLHSTWSTALSCLQGLDSSNVIRPFTPYVVMRMGNVPLVPYYRPGDKRIAQDLAELAADNQAFLLANHGPVVCGESLQEAANNMEELEETAKLIFILGDRPIRYLTAGEIAELRS >NZ_CP048344|1058702:1071885|1062511_1063876_-|WP_000104456.1|DBSCAN-SWA MSTITLLCIALTGVIMLLLLVIKAKVQPFVALLLVSLLVALAAGIPAGEVGKVMIAGMGGVLGSVTIIIGLGAMLGRMIEHSGGAESLANYFSRKLGDKRTIAALTLAAFFLGIPVFFDVGFIILAPIIYGFAKVAKISPLKFGLPVAGIMLTVHVAVPPHPGPVAAAGLLHADIGWLTIIGIAISIPVGIVGYFAAKIINKHQYAMSVEVLEQMQLAPASEEGATKLSDKINPPGVALVTSLIVIPIAIIMAGTVSATLMPPSHPLLGTLQLIGSPMVALMIALVLAFWLLALRRGWSLQHTSDIMGSALPTAAVVILVTGAGGVFGKVLVESGVGKALANMLQMIDLPLLPAAFIISLALRASQGSATVAILTTGGLLSEAVMGLNPIQCVLVTLAACFGGLGASHINDSGFWIVTKYLGLSVADGLKTWTVLTTILGFTGFLITWCVWAVI >NZ_CP048344|1058702:1071885|1067713_1068511_+|WP_001272549.1|DBSCAN-SWA MSAGQRKEQKLIPVERRQIILEMVAEKGIVSIAELTDRMNVSHMTIRRDLQKLEQQGAVVLVSGGVQSPGRVAHEPSHQVKTALAMTQKAAIGKLAASLVQPGSCIYLDAGTTTLAIAQHLIHMESLTVVTNDFVIANYLLDNSNCTIIHTGGAVCRENRSCVGEAAATMLRSLMIDQAFISASSWSVRGISTPAEDKVTVKRAIASASRQRVLVCDATKYGQVATWLALPLSEFDQIITDDGLPESASRALAKQDLSLLVAKNE >NZ_CP048344|1058702:1071885|1063964_1064741_-|WP_001136918.1|DBSCAN-SWA MPRFAANLSMMFTEVPFIERFAAARKAGFDAVEFLFPYDYSTLQIQKQLEQNHLTLALFNTAPGDINAGEWGLSALPGREHEAHADIDLALEYALALNCEQVHVMAGVVPAGEDAERYRAVFIDNLRYAADRFALHGKRILVEALSPDVKPHYLFSSQYQALAIVEEVARDNVFIQLDTFHAQKVDGNLTHLIRDYAGKYAHVQIAGLPDRHEPDDGEINYPWLFRLFDEVGYQGWIGCEYKPRGLTEEGLGWFDAWR >NZ_CP048344|1058702:1071885|1059457_1060084_+|WP_000254708.1|DBSCAN-SWA MVSRRVQALLDQLRAQGIQDEQVLNALAAVPREKFVDEAFEQKAWDNIALPIGQGQTISQPYMVARMTELLELTPQSRVLEIGTGSGYQTAILAHLVQHVCSVERIKGLQWQARRRLKNLDLHNVSTRHGDGWQGWQARAPFDAIIVTAAPPEIPTALMTQLDEGGILVLPVGEEHQYLKRVRRRGGEFIIDTVEAVRFVPLVKGELA >NZ_CP048344|1058702:1071885|1060223_1061363_+|WP_001272592.1|DBSCAN-SWA MSAGSPKFTVRRIAALSLVSLWLAGCSDTSNPPAPVSSVNGNAPANTNSGMLITPPPKMGTTSTAQQPQIQPVQQPQIQATQQPQIQPVQPVAQQPVQMENGRIVYNRQYGNIPKGSYSGSTYTVKKGDTLFYIAWITGNDFRDLAQRNNIQAPYALNVGQTLQVGNASGTPITGGNAITQADAAEQGVVIKPAQNSTVAVASQPTITYSESSGEQSANKMLPNNKPTATTVTAPVTVPTASTTEPTVSSTSTSTPISTWRWPTEGKVIETFGASEGGNKGIDIAGSKGQAIIATADGRVVYAGNALRGYGNLIIIKHNDDYLSAYAHNDTMLVREQQEVKAGQKIATMGSTGTSSTRLHFEIRYKGKSVNPLRYLPQR >NZ_CP048344|1058702:1071885|1058702_1059464_+|WP_001374723.1|DBSCAN-SWA MRILLSNDDGVHAPGIQTLAKALREFADVQVVAPDRNRSGASNSLTLESSLRTFTFENGDIAVQMGTPTDCVYLGVNALMRPRPDIVVSGINAGPNLGDDVIYSGTVAAAMEGRHLGFPALAVSLDGHKHYDTAAAVICSILRALCKEPLRTGRILNINVPDLPLDQIKGIRVTRCGTRHPADQVIPQQDPRGNTLYWIGPPGGKCDAGPGTDFAAVDEGYVSITPLHVDLTAHSAQDVVSDWLNSVGVGTQW >NZ_CP048344|1058702:1071885|1068561_1069218_-|WP_001141340.1|DBSCAN-SWA MPSTRYQKINAHHYRHIWVVGDIHGEYQLLQSRLHQLSFYPETDLLISTGDNIDRGPKSLNVLRLLNQPWFTSVKGNHEAMALDAFETGDGNMWLASGGDWFFDLNDSEQQEAIDLLLKFHHLPHIIEITNDNIKYVIAHADYPGSEYLFGKEIAESELLWPVDRVQKSLNGELQQINGADYFIFGHMMFDNIQTFANQIYIDTGTPKSGRLSFYKIK >NZ_CP048344|1058702:1071885|1069323_1071885_-|WP_001272924.1|DBSCAN-SWA MSAIENFDAHTPMMQQYLRLKAQHPEILLFYRMGDFYELFYDDAKRASQLLDISLTKRGASAGEPIPMAGIPYHAVENYLAKLVNQGESVAICEQIGDPATSKGPVERKVVRIVTPGTISDEALLQERQDNLLAAIWQDSKGFGYATLDISSGRFRLSEPADRETMAAELQRTNPAELLYAEDFAEMSLIEGRRGLRRRPLWEFEIDTARQQLNLQFGTRDLVGFGVENAPRGLCAAGCLLQYAKDTQRTTLPHIRSITMEREQDSIIMDAATRRNLEITQNLAGGAENTLASVLDCTVTPMGSRMLKRWLHMPVRDTRVLLERQQTIGALQDFTAELQPVLRQVGDLERILARLALRTARPRDLARMRHAFQQLPELRAQLETVDSAPVQALREKMGEFAELRDLLERAIIDTPPVLVRDGGVIASGYNEELDEWRALADGATDYLERLEVRERERTGLDTLKVGFNAVHGYYIQISRGQSHLAPINYMRRQTLKNAERYIIPELKEYEDKVLTSKGKALALEKQLYEELFDLLLPHLEALQQSASALAELDVLVNLAERAYTLNYTCPTFIDKPGIRITEGRHPVVEQVLNEPFIANPLNLSPQRRMLIITGPNMGGKSTYMRQTALIALMAYIGSYVPAQKVEIGPIDRIFTRVGAADDLASGRSTFMVEMTETANILHNATEYSLVLMDEIGRGTSTYDGLSLAWACAENLANKIKALTLFATHYFELTQLPEKMEGVANVHLDALEHGDTIAFMHSVQDGAASKSYGLAVAALAGVPKEVIKRARQKLRELESISPNAAATQVDGTQMSLLSVPEETSPAVEALENLDPDSLTPRQALEWIYRLKSLV |
12 | Escherichia_phage(50.0%) | NA | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_2 |
1434291 : 1445320
Sequences of DBSCAN-SWA_2
Nucleotide sequences of DBSCAN-SWA_2 >NZ_CP048344|1434291:1445320|DBSCAN-SWA GTTAACCCAGTAGCCAGAGTGCTCCATGTTGCAGCACAGCCACTCCGTGGGAGGCATAAAGCGACAGTTCCCGTTCTTCTGGCTGCGGATAGATTCGACTACTCATCACCGCTTCCCCGTCGTTAATAAATACTTCCACGGATGATGTATCGATAAATATCCTTAGGGCGAGCGTGTCACGCTGCGGGAGGGGAATACTACGGTAGCCGTCTAAATTCTCGTGTGGGTAATACCGCCACAAAACAAGTCGCTCAGATTGGTTATCAATATACAGCCGCATTCCAGTGCCGAGCTGTAATCCGTAATGTTCGGCATCACTGTTCTTCAGCGCCCACTGCAACTGAATCTCAACTGCTTGCGCGTTTTCCTGCAAAACATATTTATTGCTGATTGTGCGGGGAGAGACAGATTGATGCTGCTGGCGTAACGACTCAGCTTCGTGTACCGGGCGTTGTAGAAGTTTGCCATTGCTCTCTGATAGCTCGCGCGCCAGCGTCATGCAGCCTGCCCATCCTTCACGTTTTGAGGGCATTGGCGATTCCCACATATCCATCCAGCCGATAACAATACGCCGACCATCCTTCGCTAAAAAGCTTTGTGGTGCATAAAAGTCATGCCCGTTATCAAGTTCAGTAAAATGCCCGGATTGTGCAAAAAGTCGTCCTGGCGACCACATTCCGGGTATTACGCCACTTTGAAAGCGATTTCGGTAACTGTATCCCTCGGCATTCATTCCCTGCGGGGAAAACATCAGATAATGCTGATCGCCAAGGCTGAAAAAGTCCGGACATTCCCACATATAGCTTTCACCCGCATCAGCGTGGGCCAGTACGCGATCGAAGGTCCATTCACGCAACGAACTGCCGCGATAAAGCAGGATCTGCCCCGTGTTGCCTGGATCTTTCGCCCCGACTACCATCCACCATGTGTCGGCTTCACGCCACACTTTAGGATCGCGGAAGTGCATGATTCCTTCTGGTGGAGTGAGGATCACACCCTGTTTCTCGAAATGAATACCATCCCGACTGGTAGCCAGACATTGTACTTCGCGAATTGCATCGTCATTACCTGCACCATCGAGCCAGACGTGTCCGGTGTAGATAAGTGAGAGGACACCATTGTCATCGACAGCACTACCTGAAAAACACCCGTCTTTGTCATTATCGTCTCCTGGCGCTAGCGCAATAGGCTCATGCTGCCAGTGGATCATATCGTCGCTGGTGGCATGTCCCCAGTGCATTGGTCCCCAGTGTTCGCTCATCGGGTGATGTTGATAAAACGCGTGATAACGATCGTTAAACCAGATCAGGCCGTTTGGATCGTTCATCCATCCGGCAGGTGGCGCGAGGTGAAAATGGGGATAGAAAGTGTTACCCCGGTGCTCATGAAGTTTTGCTAGTGCGTTTTGCGCCGCATGCAATCGAGATTGCGTCATTTTAATCATCCTGGTTAAGCAAATTTGGTGAATTGTTAACGTTAACTTTTATAAAAATATAGTCCCTTACTTTCATAAATGCGATGAATATCACAAATGTTAACGTTAACTATGACGTTTTGTGATCGAATATGCATGTTTTAGTAAATCCATGACGATTTTGCGAGAAAGAGGTTTATCACTATGCGTAACTCAGATGAATTTAAGGGAAAAAAATGTCAGCCAAAGTATGGGTTTTAGGGGATGCGGTCGTAGATCTCTTGCCAGAATCAGACGGGCGCCTACTGCCTTGTCCTGGCGGCGCGCCAGCTAACGTTGCGGTGGGAATCGCCAGATTAGGCGGAATAAGTGGATTTATAGGTCGGGTCGGTGATGATCCTTTTGGGGCGTTAATGCAAAGAATGCTGCTAACTGAGGGAGTCGATATCACGTATCTGAAGCAAGATGAATGGCACCGGACATCCACGGTGCTTGTCGATCTGAACGATCAAGGGGAACGTTCATTTACGTTTATGGTCCGCCCCAGTGCCGATCTTTTTTTAGAGACGACAGACTTGCCCTGCTGGCGACATGGCGAATGGTTACATCTCTGTTCAATTGCGTTGTCTGCCGAGCCTTCGCGTACCAGCGCATTTACTGCGATGACGGCGATCCGGCATGCCGGAGGTTTTGTCAGCTTCGATCCTAATATTCGTGAAGATCTATGGCAAGACGAGCATTTGCTCCGCTTGTGTTTGCGGCAGGCGCTACAACTGGCGGATGTCGTCAAGCTCTCGGAAGAAGAATGGCGACTTATCAGTGGAAAAACACAGAACGATCAGGATATATGCGCCCTGGCAAAAGAGTATGAGATCGCCATGCTGTTGGTGACTAAAGGTGCAGAAGGGGTGGTGGTCTGTTATCGAGGACAAGTTCACCATTTTGCTGGAATGTCTGTGAATTGTGTCGATAGCACGGGGGCGGGAGATGCGTTCGTTGCCGGGTTACTCACAGGTCTGTCCTCTACGGGATTATCTACAGATGAGAGAGAAATGCGACGAATTATCGATCTCGCTCAACGTTGCGGAGCGCTTGCAGTAACGGCGAAAGGGGCAATGACAGCGCTGCCATGTCGACAAGAACTGGAATAGTGAGAAGTAAACGGCGAAGTCGCTCTTATCTCTAAATAGGACGTGAATTTTTTAACGACAGGTAGGTAATTATGGCACTGAATATTCCATTCAGAAATGCGTACTATCGTTTTGCATCCAGTTACTCATTTCTCTTTTTTATTTCCTGGTCGCTGTGGTGGTCGTTATACGCTATTTGGCTGAAAGGACATCTAGGGTTGACAGGGACGGAATTAGGTACACTTTATTCGGTCAACCAGTTTACCAGCATTCTATTTATGATGTTCTACGGCATCGTTCAGGATAAACTCGGTCTGAAGAAACCGCTCATCTGGTGTATGAGTTTCATCCTGGTCTTGACCGGACCGTTTATGATTTACGTTTATTACCCGTTACTGCAAAGCAATTTTTCTGTAGGTCTAATTCTGGGGGCGCTCTTTTTTGGCCTGGGGTATCTGGCGGGATGCGGTTTGCTTGACAGCTTCACCGAAAAAATGGCGCGAAATTTTCATTTCGAATATGGAACAGCGCGCGCCTGGGGATCTTTTGGCTATGCTATTGGCGCGTTCTTTGCCGGCATATTTTTTAGTATCAGTCCCCATATCAACTTCTGGTTGGTCTCGCTATTTGGCGCTGTATTTATGATGATCAACATGTGTTTTAAAGATAAGGATCACCAGTGCGTAGCGGCGGATGCGGGAGGGGTAAAAAAAGAGGATTTTATCGCAGTTTTCAAGGATCGAAACTTCTGGGTTTTCGTCATATTTATTGTGGGGACGTGGTCTTTCTATAACATTTTTGATCAACAACTTTTTCCTGTCTTTTATGCAGGTTTATTCGAATCACACGATGTAGGAACGCGCCTGTATGGTTATCTCAACTCATTCCAGGTGGTACTCGAAGCGCTATGCATGGCGATTATTCCTTTCTTTGTGAATCGGGTAGGGCCAAAAAATGCATTACTTATCGGTGTTGTGATTATGGCGTTGCGTATCCTTTCCTGCGCGCTGTTCGTTAACCCCTGGATTATTTCATTAGTGAAGCTGTTACATGCTATTGAGGTTCCACTTTGTGTCATATCCGTCTTCAAATACAGCGTGGCAAATTTTGATAAGCGCCTGTCGTCGACGATCTTTCTGATTGGTTTTCAAATTGCCAGTTCGCTTGGGATTGTGCTGCTTTCAACGCCGACTGGGATACTCTTTGACCACGCAGGCTACCAGACAGTTTTCTTCGCAATTTCGGGTATTGTCTGCCTGATGTTGCTATTTGGCATTTTCTTCTTGAGTAAAAAACGCGAGCAAATAGTTATGGAAACGCCTGTACCTTCAGCAATATAGACGTAAACTTTTTCCGGTTGTTGTCGATAGCTCTATATCCCTCAACCGGAAAATAATAATAGTAAAATGCTTAGCCCTGCTAATAATCGCCTAATCCAAACGCCTCATTCATGTTCTGGTACAGTCGCTCAAATGTACTTCAGATGCGCGGTTCGCTGATTTCCAGGACATTGTCGTCATTCAGTGACCTGTCCCGTGTATCACGGTCCTGCGAATTCATCAAGGAATGCATTGCGGAGTGAAGTATCGAGTCACGCCATATTTCGCTATCAGGATTCTGTGTGATGGTTACATCGCCCGGCCCAGGGCTGTTTAGTCATCAGCGCTTTCTGACAGTGCTGAGATTTCAACCTGTTGCAGTAAAAATGAGTAGATATAAGGCAAGTGTGCTGCCAAACCCATCTTTTACGGGGTGAAGGTAGATTTCGTTTGAAGGGTATCTGGTGTCCCCTGCAGACATCTACTTGACGAGGCAGGGGATTGATTGGAATGGTGTTTTTTAGATGTGAGAAATATTTTACCCGCTATTTTACCCATTGGCGCGGCTTAAGAGCTTATTTTTGAATTCACAATGGTCACGATATAACCATCTTGCTCGCCCGTGGATAACTTTGGCTTTAGGCAGGTCGCCGGACTTAATCCGGTCGTAGATGAAGGTTTTACCGAAGCCAGTATCAGCCATGATGAATTTCAAATCAACCAGTGAATCAGGTTGTAGTTCGTGTTGCATGAGTGCTATCTCCGAATAGGGAATCGAACCTGCAAATCAGGAAATAAAAAACCGCCATCAGGCGGCTTGGTGTTCTTTCAGTTCTTCAATTCGAATGTTGGTTACGTCTTATTCGATACGCACTCCTGGTATTTCGCCTTTTGATATTGCTAAGTCATAAATTTGCGCGGCACTATATCCATCCCTCATCCATGAATCTAAGGCGCGAACAGCCTCGCTACGCTTTTTATCTTCTCTCTCATTTTTGATATCAACGAGGACATCAACGCAATTAAGGCAAATGTGGATTTTGTCCTTACATTCGATCATGGCGGCTTTACCATGATTTCCGCCACACAGTGAGCATAAATCTTCAGGGTCTGTCTGGTATTTCTGTAACGTTAGATGGTTGAATGTTGAACAGGCCATAATCATCTCCATAAAACAAAACCCGCCGTAGCGAGTTCAGATAAAAGAAATCCCCGCGAGTGCGAGGATTGTTATTCACCTTTAACGGCAAGTTGCAGGTTAGCCACGGTTAACCTCCTGCGGCGGTTCTGGTAGCGGCATCCAGTGTGACGGTTTCCACGACGCCCCAGGAATGATCCACCCATCATTAGCGTCAGGATGCCCCGGGATGTAAGTCGCCCATTTCATTCGCCAGTCACCTTTCCTATCAAACTCCTTGGCAACAAGAACGGCTGTTTTGGTATCCGGCATTCGCTCACTACAGCTTATCCAACCATCCGGATTTACCGGATAGTTGCGCATTGCGACCTTTAACGCCTCATAGAACCAACCCTTCAGATTGTTGAACTGATGACCATTTAGAGGGATGCATTCAGTAAGCATGTTGTGTAATCTCCATGCCGCTTCGTTTACTTCGCTGGATGGCAGGGGAGGCAACTTGTAAGTCTGTTTTACAGGTTCTGCACCATGAAGCATGGCGGCGCGGAGTTTCTGTATCTCCCGTGCCATTATTGCGTCCTGACGAGGAGTGAACATATCACCTGCAATAATTCTGTTTAGTTCCTCGTCAGTGAACTCATAATCATCAGGAACATTGTGAACCTCAGCATAAAGAGGCTTTTTTGTAGAAATGGTCATCGTTAAACCCCCTCCGCACTTACCAGTTCGTTTCGCAAAAGATAATCCATCGCCCTATCAGGTAATTTGCAATCAGGTTTTGCTTTTTTCAGTTGGCTGACCAATTGTTTAACCAGCATTGTTAATTCGATCACCTGGTAGCGCGGCAATGGTGAATTATCGGATTTGCCCTGACTGTCATCACTGCATGAATGCCCTTCCAGCCAGGCCAATGCTTGTCGCATGAAATACGCAATATGTTTGCCGTGGTAATCGTCTTCATCGATGTGAAAAGCGATACTGCGGATGTATTCAATAGCGTTTTCAATGGCCTCCGACGCTATTGGTGCAGGTGAGGCAGCATAAACAGGAATAACGTCCGATTGATCTTTATTGCTTTCATCCGTCAAAGCCCAGAATAATTTCCCGGCCGGATGTTTGAAAATATAAGCAACTGGTTTTGCTTCCAGCGATGCCAGCGCGATACGCGCCAGCTCAGCCGCTTCGTCATCTTGAACGCAATACCATGCGTTTCCAGATGCTAATTGTTCCAGACGTTCTTTGGGAATAGTGCTCATGATGCCTCTCCTTTACCTGCTTCGGTGACGTTTATTCCAGCGTTGTGCAGTGCCTCAAGAACCTGATGCTGCCTGTAAACCATTTCAGTGTGGTAAGGCTCGTCAAAATCGACGCGATGCAACATGCTATAGCATTGCGGAAGCACAACCTCCCGCAACTGCAGCTCAGCAATGCGCTTCTCTGCGGCTTCCAGCTCAACACGCAGTTTCCCTACCGTTAGCGCAATTTCCTCGTTCTCCTGATCGCGGAGTTTGATGTATTGCTGGTTTCTTTCCCGTTCATCCAGCAGCGCCAAGACGGTAGCCGGATTGGCTGCGGCGATGAATTCAGCATTGGCCTGCTGTTCCATTTGGAAATCTTCATCGAAACCGCTTTCTGGATGCGCTCCTTCAATTCTGCAAATGGGAAGATATCCAACAACTTCACGATGAATTAGCGCATCACCAGCATCAAATCTCTCCTCTCCATATTCGAGCGACCACACACCACACGTTGCTTTCTCTGCCTTTTCACGCAGTGCCTGGTAATTAATTTTGCTCACTGGTTGCCTCCTTTGCGAAGCTGGTCGGCGAACAAACGTACACCAGACGCTTCACTGCGTAGAAACTTAACGGCATAATCAAAACCACCTCGTTCTGCGTCGTCTGCTCCGTTGTCGAGGTTATCTGCGTACATCTCTACCCCCTGCGCCCGTACTTCAGCCAGGAAAGCATCGGTGGCTGGGGTTTCCGTGAAGTTGTCCTCCCAACCGTAGTACTCCTGACGACAGAAGTTATTAAATTCCTTCTCCGACTGTTTAAGCGCCGCATTCTCCGCTGCTAGCGCCGCGCACTTGGCCTCCGCTTCAGCAAATTTACGCACCAGATATTCAGCGTTTGTTTCGTTAACCTTTAAATCACATGGGATGCATTTACCTTTCAGAAATCCATCCATCTCAATTAGTGACATTTGTTTCATTTCTTCCCACTCCGCCACATCGCATTCAGATATTTGTTTTGATTAACTGATGGAAAAGAATTTCTCTTAAGCAATTCCTCTATCGATGGCATTGGCTTTACGCGTTGGCGAATAATCATTTCTGCCGGAAAAATGCCGGGATTGTATGCAAGTCCTCTCATTATTTACTCTCCACGAACTGGTCAATAGCCATGCTAAGTGACTCACCTAAAGTCTCGATATGCTGCTGAATATCCTGTAGCGTCTGCGCCTGAGATACACAAAGGAGCTTGCTGATGCTAACGAGACTATCGAAAGCCTCCGTGCTGATGTTTCTGCTGGTCGTAAGCGCCTGCAAGTCGCCGCCACCTGTGCAAAGTCAACGGCCGGAGCCAGCAGCATGGGCGATGGAGAAAGCCCAAGACTTACAGCAGATGCTGAACTCAATTATTACCGTCTCCGAAGTGGAATCGACAAGATAACCGCGCAGGTTAACTACCTGCAGGAGTACATCAGGACGCAATGCCTTCGATGATAGCGATAATTTTACTCATCATCCTTCACATCTGTCTCTGTAGACAGGGTGGTGATCACTTCTGGAGTGAATCCAGATTAAACATCTCATTGCTGATGCTTGAAGTTGAGCATCTGGCGCGCGGTAAGGGGCTGCGTTGAGATAAGAGCCAGTCATTACAAAGCCTATCTACGGGTGGGCTTGATAATGAAACCGTGATTTACATCCCTCACAATCCAGGTATGTAAAGCTGGATTATGCGAGAACGGATTTAACTAAATCTGTGCACCACCAGTTACGGCAGTACCACGAAGCAACCCAAGCCAGTAAGTGGGGAAATAACACTGGCAGCCACTGAAAGATGAACCTCCAGCCTTATGGCAAAAAAGATTCTTTGTGGTGGCGGACTGATGGAAAGACATCGGTTATTGCAGAGGCCATTCAATGAGTGGTCTCGACAATGGCTTATACCCTACACGGGATAACTTAACTGATATCCCTTTTAACGGATAAACGGAGCCAACAATGGCAGAGATTATTCCCATGACTGAAGAACAGAAATTCCAGTTAGAGATTTACAAACTGGTCATGAACCAGAACGCAGCCGCAGAAGAAGCATTTCAATTCATTGGCACTGACGAACTGAAGCTTGAGCTATTCAAAATTCACTTCCAGTCAGGTGGCGCTAATTCGGATATCACGATCCGCACATTTGAAGCGGTGCGTAAATCGAAGGAAGCGTTAGACCTGTTCACTACCGGAGCATAAACATGGCAACTCAAGGTTTCGACAACCCATCCAAATTCCGCGATGAATGGGATAAGCAAGCAGAAGGGAAATAATCAATATGGCGACTGAGAAAAAGAATGTCGGTCGCCCTTCGGATTACCTACCGGAGGTGGCTGATGATATCTGTGCGCTGCTTGCCTCCGGGGAACCAACTCCGCAAGTTCTCAGTAACGCCCAACGCTCATATGATGCCTTGATGACCGACACTCTGGTTGTTCCTTCAATGCGACGACGTGGAGATTTTCCTGTAGGACAGGGTAATAAATATGACGTGTTTACATCTGACTGATATTATCCAGGCGATCTCCCTCTGATTGATGGCGATATCCCAAACGCATAGGTGAATAAATGCCGATTCAGCAACTTCCGCTTATGAAAGGTGTCGGCAAAGACTTCCGAAACGCCGACTATATCGACTAGCCAAGCAGATGAAAAAAATGGTAGATAGCGATATATACACACCAACCTCCTTAGTTTTGAGCATGACGAAATTAACCAGCCATCAGGCTGGTATTAGTCACATTCTATTTCTTTTGTCGAGGTCCATACATAAGGGTCTGAGCAAACAACCTCTCCATTTATCATGACGTCATAGCCCATTAGATATGAGTTACCACCAACAATCTGAGCGGCTATGACGCTAATGCTGGCGGCGCAGGCTGCACCAAAAACGAAACCAATTAATATATTTTTCATTTTAATTCCAAATAGTTATAGGATGAAAGTTATGAGCATAGAAGAACGCCTGAACAACATTGAGTTGAACCAAACCCTGCTTGACCAGCGACTTTCAGATCTTGAACTTAAAGATCTGGATGCGCAAATATCGGAAGCAGAAGCCAAGCTCTCCAGCCTAAACCACCGCAAGAAGCAAATCCGCAACAGAATTACTCAGGGACGCGGAAGCTGTTGAGGTGGGATGCTAGGTCTCTATCGTTAAAATCAAGGCTGCTAATCATTTCATTGTAAATAGCGTTTTTATCTTCCATTGGCAGTCTTGAGTAAACCAGACACAGAGCATATTTCAGGGAGTTTAGCTCTTTCTCTAGCTCTTCCTTGCTTGATGTTTTTGACTTAATAAACTGTTTTTTATTCATTTTGCATCCTTACCATACATGGTTTTCAGTGTTTCAATCAGCGCATCCCTGAATTTGTCAGCCTCTTTCTGAGCAAATTCATTGCTATTAAGCGATCTACCACAAACAGCATCTTCGATAATCTGTATTATTTCAGCATTCATGGAACGCTTGTTATGTTGCGCCCTGGCTTTAACCTTTGCCTTTAACTCTTTGGAAATCCTGATATTTATTTGCGGCTCTTCGCGTGACATACCACCTCCATAGCATTTTGGTGATATTACTATTGCATCACTGCGATCACAATGGTATAACGGTTATACCAAATTGATTGGAGGTAATATGATAGTCAAGTCAGACGCACCAAAGTACCCTTTGCGCATCCCATTAGAGGTTAAGTTAGCAATCGAGAAGTCAGCGAAAGAAAATGGTCGCTCAATAAATACCGAGATGGTAATGCGGTTAGTGGATAGTTTAAGGCGGGATAGTTCTAAAGGTAATCTAGCAAAAAGTTGAAGCCCCAACTGCGGTAACAGTCAGGGCTTCGTTATCAACAAATCGGCTTAGGAAATATTGACATGAAAAGTATAGCAAAGGCACAAAACGATTTCACCATCTTCAAATTCGGCGACAGTGAAATCCGCGTCATCAACAAGTGCGGTGAGCCGTGGTTTGTAGCTAAAGATGTTTGTGATGCTTTAGCTTTGACTAACTCACGCAAGGCGCTTACTGCACTTGATGACGATGAAAAGGGAGTAACTTTAAGTTACACCCTTGGTGGTGGGCAGAATCTAAGCATTGTTAGCGAATCAGGTATGTATACATTGGTTCTGCGCTGCCGCGATGCTGTCAATAAAGGTTCAGTCCCGCACAAATTCCGCAAGTGGGTAACAGCAGAAGTTCTGCCTTCAATTCGCAAACATGGCGAGTATGTGAAAGGCAAGAAAACCACTGTTGAGGAAAGAACACCGCTACGCGATGCAGTAAACATGCTGGTAGGAAAGAAAGGACTTCGCTATGACGATGCATACAATATGGTTCATCAGCGTTTTGGTATTGACAGCATTGATGAACTTTCAATTGAACAAATCCCGCAAGCCGTAGAGTACATCCGCAGGGTAGTGCTTGAAGGTGAGTTCATTGGCAAACAAGAGAAGAAAACAAACGAGCTTTCTGCAAAAGAAGCAAACAGCCTTGTATGGTTATGGGATTATGCCAACCGCTCACAGGCATTATTCCGCGAACTGTATCCGGCGCTAAAACAAATTCAATCGAACTATTCCGGCAGATGCTACGACTACGGTCATGAATTCTCGTATGTTATCGGAATGGCGAGAGACGTTTTAATAAACCACACACGAGATGTTGATATTAATGAGCCAGGCGGACCGACGAATCTTTCCGCATGGATGAGACTTAAGAATAAAGAATTACCTCCTTCAGTACATAACTACTGA
Protein sequences of DBSCAN-SWA_2 >NZ_CP048344|1434291:1445320|1441749_1442019_+|WP_162676756.1|lysis|DBSCAN-SWA MRLRYTKELADANETIESLRADVSAGRKRLQVAATCAKSTAGASSMGDGESPRLTADAELNYYRLRSGIDKITAQVNYLQEYIRTQCLR >NZ_CP048344|1434291:1445320|1439411_1439654_-|WP_106504540.1|DBSCAN-SWA MRNYPVNPDGWISCSERMPDTKTAVLVAKEFDRKGDWRMKWATYIPGHPDANDGWIIPGASWKPSHWMPLPEPPQEVNRG >NZ_CP048344|1434291:1445320|1438703_1438904_-|WP_001163428.1|DBSCAN-SWA MQHELQPDSLVDLKFIMADTGFGKTFIYDRIKSGDLPKAKVIHGRARWLYRDHCEFKNKLLSRANG >NZ_CP048344|1434291:1445320|1443674_1443884_-|WP_001036007.1|DBSCAN-SWA MNKKQFIKSKTSSKEELEKELNSLKYALCLVYSRLPMEDKNAIYNEMISSLDFNDRDLASHLNSFRVPE >NZ_CP048344|1434291:1445320|1439992_1440547_-|WP_060621064.1|DBSCAN-SWA MSTIPKERLEQLASGNAWYCVQDDEAAELARIALASLEAKPVAYIFKHPAGKLFWALTDESNKDQSDVIPVYAASPAPIASEAIENAIEYIRSIAFHIDEDDYHGKHIAYFMRQALAWLEGHSCSDDSQGKSDNSPLPRYQVIELTMLVKQLVSQLKKAKPDCKLPDRAMDYLLRNELVSAEGV >NZ_CP048344|1434291:1445320|1442845_1443076_+|WP_162676750.1|DBSCAN-SWA MATEKKNVGRPSDYLPEVADDICALLASGEPTPQVLSNAQRSYDALMTDTLVVPSMRRRGDFPVGQGNKYDVFTSD >NZ_CP048344|1434291:1445320|1435940_1436855_+|WP_001274871.1|DBSCAN-SWA MSAKVWVLGDAVVDLLPESDGRLLPCPGGAPANVAVGIARLGGISGFIGRVGDDPFGALMQRMLLTEGVDITYLKQDEWHRTSTVLVDLNDQGERSFTFMVRPSADLFLETTDLPCWRHGEWLHLCSIALSAEPSRTSAFTAMTAIRHAGGFVSFDPNIREDLWQDEHLLRLCLRQALQLADVVKLSEEEWRLISGKTQNDQDICALAKEYEIAMLLVTKGAEGVVVCYRGQVHHFAGMSVNCVDSTGAGDAFVAGLLTGLSSTGLSTDEREMRRIIDLAQRCGALAVTAKGAMTALPCRQELE >NZ_CP048344|1434291:1445320|1444205_1444379_+|WP_001549438.1|DBSCAN-SWA MIVKSDAPKYPLRIPLEVKLAIEKSAKENGRSINTEMVMRLVDSLRRDSSKGNLAKS >NZ_CP048344|1434291:1445320|1442006_1442159_+|WP_032181221.1|DBSCAN-SWA MPSMIAIILLIILHICLCRQGGDHFWSESRLNISLLMLEVEHLARGKGLR >NZ_CP048344|1434291:1445320|1443880_1444117_-|WP_001549440.1|DBSCAN-SWA MSREEPQINIRISKELKAKVKARAQHNKRSMNAEIIQIIEDAVCGRSLNSNEFAQKEADKFRDALIETLKTMYGKDAK >NZ_CP048344|1434291:1445320|1441501_1441666_-|WP_122641387.1|DBSCAN-SWA MRGLAYNPGIFPAEMIIRQRVKPMPSIEELLKRNSFPSVNQNKYLNAMWRSGKK >NZ_CP048344|1434291:1445320|1439012_1439312_-|WP_122633558.1|DBSCAN-SWA MACSTFNHLTLQKYQTDPEDLCSLCGGNHGKAAMIECKDKIHICLNCVDVLVDIKNEREDKKRSEAVRALDSWMRDGYSAAQIYDLAISKGEIPGVRIE >NZ_CP048344|1434291:1445320|1443514_1443700_+|WP_122641402.1|DBSCAN-SWA MSIEERLNNIELNQTLLDQRLSDLELKDLDAQISEAEAKLSSLNHRKKQIRNRITQGRGSC >NZ_CP048344|1434291:1445320|1442523_1442766_+|WP_000807785.1|DBSCAN-SWA MAEIIPMTEEQKFQLEIYKLVMNQNAAAEEAFQFIGTDELKLELFKIHFQSGGANSDITIRTFEAVRKSKEALDLFTTGA >NZ_CP048344|1434291:1445320|1441085_1441505_-|WP_077896452.1|DBSCAN-SWA MKQMSLIEMDGFLKGKCIPCDLKVNETNAEYLVRKFAEAEAKCAALAAENAALKQSEKEFNNFCRQEYYGWEDNFTETPATDAFLAEVRAQGVEMYADNLDNGADDAERGGFDYAVKFLRSEASGVRLFADQLRKGGNQ >NZ_CP048344|1434291:1445320|1440543_1441089_-|WP_122641386.1|DBSCAN-SWA MSKINYQALREKAEKATCGVWSLEYGEERFDAGDALIHREVVGYLPICRIEGAHPESGFDEDFQMEQQANAEFIAAANPATVLALLDERERNQQYIKLRDQENEEIALTVGKLRVELEAAEKRIAELQLREVVLPQCYSMLHRVDFDEPYHTEMVYRQHQVLEALHNAGINVTEAGKGEAS >NZ_CP048344|1434291:1445320|1444441_1445320_+|WP_122641400.1|DBSCAN-SWA MKSIAKAQNDFTIFKFGDSEIRVINKCGEPWFVAKDVCDALALTNSRKALTALDDDEKGVTLSYTLGGGQNLSIVSESGMYTLVLRCRDAVNKGSVPHKFRKWVTAEVLPSIRKHGEYVKGKKTTVEERTPLRDAVNMLVGKKGLRYDDAYNMVHQRFGIDSIDELSIEQIPQAVEYIRRVVLEGEFIGKQEKKTNELSAKEANSLVWLWDYANRSQALFRELYPALKQIQSNYSGRCYDYGHEFSYVIGMARDVLINHTRDVDINEPGGPTNLSAWMRLKNKELPPSVHNY >NZ_CP048344|1434291:1445320|1434291_1435725_-|WP_000194515.1|DBSCAN-SWA MTQSRLHAAQNALAKLHEHRGNTFYPHFHLAPPAGWMNDPNGLIWFNDRYHAFYQHHPMSEHWGPMHWGHATSDDMIHWQHEPIALAPGDDNDKDGCFSGSAVDDNGVLSLIYTGHVWLDGAGNDDAIREVQCLATSRDGIHFEKQGVILTPPEGIMHFRDPKVWREADTWWMVVGAKDPGNTGQILLYRGSSLREWTFDRVLAHADAGESYMWECPDFFSLGDQHYLMFSPQGMNAEGYSYRNRFQSGVIPGMWSPGRLFAQSGHFTELDNGHDFYAPQSFLAKDGRRIVIGWMDMWESPMPSKREGWAGCMTLARELSESNGKLLQRPVHEAESLRQQHQSVSPRTISNKYVLQENAQAVEIQLQWALKNSDAEHYGLQLGTGMRLYIDNQSERLVLWRYYPHENLDGYRSIPLPQRDTLALRIFIDTSSVEVFINDGEAVMSSRIYPQPEERELSLYASHGVAVLQHGALWLLG >NZ_CP048344|1434291:1445320|1436926_1438174_+|WP_124039053.1|DBSCAN-SWA MALNIPFRNAYYRFASSYSFLFFISWSLWWSLYAIWLKGHLGLTGTELGTLYSVNQFTSILFMMFYGIVQDKLGLKKPLIWCMSFILVLTGPFMIYVYYPLLQSNFSVGLILGALFFGLGYLAGCGLLDSFTEKMARNFHFEYGTARAWGSFGYAIGAFFAGIFFSISPHINFWLVSLFGAVFMMINMCFKDKDHQCVAADAGGVKKEDFIAVFKDRNFWVFVIFIVGTWSFYNIFDQQLFPVFYAGLFESHDVGTRLYGYLNSFQVVLEALCMAIIPFFVNRVGPKNALLIGVVIMALRILSCALFVNPWIISLVKLLHAIEVPLCVISVFKYSVANFDKRLSSTIFLIGFQIASSLGIVLLSTPTGILFDHAGYQTVFFAISGIVCLMLLFGIFFLSKKREQIVMETPVPSAI |
19 | Enterobacteria_phage(25.0%) | lysis | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_3 |
1698304 : 1707746
Sequences of DBSCAN-SWA_3
Nucleotide sequences of DBSCAN-SWA_3 >NZ_CP048344|1698304:1707746|DBSCAN-SWA AATGATTGAATTTAGCCATGTCAGCAAACTGTTTGGCGCACAAAAAGCCGTTAACGATCTCAATCTCAATTTTCAGGAAGGGAGTTTTTCGGTGCTGATTGGCACATCTGGCTCCGGCAAATCCACCACCCTGAAAATGATTAACCGCCTAGTGGAACATGACAGCGGCGTGATCCGCTTTGCCGGAGAAGAAATTCGCTCGCTGCCAGTACTGGAGTTGCGCCGCCGGATGGGCTATGCCATTCAATCTATTGGCCTGTTCCCCCACTGGAGCGTGGCGCAAAACATCGCTACCGTGCCGCAATTGCAAAAATGGTCGCGGGCGCGGATTGACGATCGTATCGACGAATTAATGGCGCTACTGGGGCTGGAGTCAAATTTGCGTGAGCGTTATCCGCATCAGCTTTCCGGTGGTCAGCAGCAACGTGTGGGAGTGGCGCGCGCACTGGCTGCCGATCCGCAAGTCTTACTGATGGATGAACCTTTTGGCGCACTGGACCCGGTAACGCGCGGCGCGTTGCAACAAGAGATGACGCGCATTCACCGTTTGCTGGGGCGCACTATTGTGCTGGTCACTCATGATATTGATGAGGCGCTAAGGCTGGCAGAACATCTGGTATTGATGGATCACGGTGAAGTGGTGCAGCAGGGGAATCCGCTGACGATGCTGACTCGTCCGGCGAATGATTTTGTCCGCCAGTTTTTTGGACGTAGTGAACTGGGTGTGCGCCTGCTTTCGTTACGTAGTGTGGCGGATTACGTGCGTCGCGAAGAACGGGCAGAAGGTGAGGCACTGGCAGAAGAGATGACGCTACGCGATGCGCTCTCCCTGTTTGTCGCGCGGGGATGCGAGGTGCTGCCGGTGGTGAACACGCAGGGCCAGCCTTGCGGCACGCTGCATTTTCAGGATCTGCTGGAGGAGGCGTAAGCGTATGAAGATGTTGCGCGATCCGCTGTTCTGGCTCATTGCTTTGTTTGTGGCACTGATTTTCTGGCTGCCTTACAGCCAGCCGCTGTTTGCTGCCTTGTTCCCACAACTGCCACGACCCGTTTATCAGCAAGAAAGTTTTGCAGCTCTGGCACTGGCTCATTTCTGGCTGGTGGGAATTTCGAGTTTGTTTGCGGTGATCATTGGCACTGGTGCCGGAATTGCTGTCACTCGCCCGTGGGGCGCGGAATTTCGCCCACTGGTGGAAACTATTGCCGCCGTTGGACAGACTTTTCCGCCTGTCGCAGTGCTGGCGATTGCCGTTCCGGTGATCGGCTTTGGTCTGCAACCAGCGATTATCGCCTTGATCCTTTACGGTGTGCTGCCCGTCCTGCAGGCGACACTTGCCGGGCTGGGAGCGATTGATGCCAGCGTGACAGAAGTTGCGAAAGGTATGGGAATGAGTCGTGGTCAGCGACTGCGTAAGGTCGAGCTACCGCTGGCGGCTCCGGTGATTCTGGCGGGCGTGCGAACTTCGGTGATTATCAACATTGGTACGGCGACGATCGCCTCAACGGTAGGGGCCAGCACGCTGGGTACGCCCATCATCATCGGGCTTAGCGGATTTAATACCGCGTATGTGATCCAGGGGGCGTTACTGGTGGCACTGGCGGCGATCATCGCAGACCGCCTGTTTGAAAGGCTGGTGCAGGCGCTTAGCCAGCACGCAAAATAAAGGTATAACCTGCGAGCATGACGCCACCAATTCCGCCTAACGCCATAAACAGGAACAGGGCGATGACCCCAATTTTAGCTATGCGCATATTGCACTCCTTATGTTAACGAAAGGATTGTACAGTAAAGCGCATTTGTTAACGAATCATTAAATGCCGAGTGGGAAAATATCATGGCCTTGTTCTTGCCAACTGGTGAGTTGCTGCTGTTGGGCGGAGGTTCGATTTTCACCGCACCACACCAGCAATGTACGGCCTTCGAATAGTTCAGGGCGTAGTTGATTGAGCGAGTGGGCGAGGACATCAATGCGCCATCCTTGTTGACTGGCAATCCAGCCCTCCAGCCACAGACGGGTGGTATCCTGAATATTCCAGCCAACCACCAGCGCATCTTTACCCTGTTTTTTACGTGCCGAAGCCAGACAAATGGCGATGTAGTTGATCAGTACGCCGTCGAGGATCGCCAGCAGCGCCTGGAGAGTCGGTTGTTGGCATTGAAGTCGTCGGCGCAGAGGAATAAACAGATGTGTGGTGAGTGTCTGGGCGGGGTAATCCTGACCGCGCTCTTTGATCCACGTTCGCAGGCTATGCAGATTGCCGCTTTGCAGGTAGGTCAGTAATGTTTCTTGCTGATCGCGCCAGCCGTTCTGCACATCAACATTTTCATTACTGAGCAGCATTTTAACTTTGCTGACCTGCACGCCGTTGTCGATCCAGCGTTTGATCTCGCGGATACGGTCAATATCGGCATCGTTGAACAGTCGATGACCGCCGTCTGTCCGTTGCGGTTTCAGCAACCCGTAACGCCTCTGCCACGCGCGTAACGTGACAGGGTTAATATCACAAAGCAACGCCACTTCACCAATTGTGTAAAGCGCCATCGTCTCACCCTTGCTCGCGAGGTCCCGGTTTAACTTTAGACGCAGTTTTGCGAACCAGGTAGTTTTGCCCGTTTTTTGTGCATCTATAGGGTGATTTTATTTTTGCCAGGCGATTTTGAGTGATCGTACTCACGAATTCTCATTTTTCTGCAAGAGTTCAAAGAAAGTTAAACGCAGGCAATGTATGTTACGCGTTTTAAAGGGAAGTGTGGTTTGCGGGTATGTACGATTTTAATCTGGTGTTGCTGCTGCTTCAGCAGATGTGCGTTTTTTTAGTCATTGCGTGGTTAATGAGTAAAACGCCATTATTCATACCGTTAATGCAGGTCACGGTTCGTCTGCCGCATAAATTTCTCTGCTACATCGTCTTTTCCATCTTCTGCATCATGGGCACCTGGTTTGGGTTGCACATTGACGATTCTATTGCCAATACCCGTGCGATAGGCGCGGTAATGGGCGGCTTACTCGGCGGTCCGGTCGTCGGTGGGCTGGTTGGTCTGACCGGCGGCTTACATCGATATTCGATGGGGGGCATGACCGCGCTAAGTTGCATGATCTCAACCATCGTTGAAGGATTGCTCGGCGGCCTGGTACACAGCATCCTGATCCGTCGCGGGCGCACTGATAAAGTCTTTAACCCCATTACCGCCGGTGCCGTCACGTTCGTCGCTGAAATGGTGCAAATGCTGATCATCCTTGCGATCGCCCGACCTTATGAAGATGCGGTGCGTCTGGTGAGTAATATTGCTGCACCAATGATGGTCACCAATACCGTCGGCGCGGCGCTGTTTATGCGTATATTGCTCGATAAACGCGCGATGTTTGAAAAATACACTTCTGCTTTTTCTGCCACTGCGCTGAAAGTGGCAGCCTCGACGGAAGGCATTTTGCGACAGGGGTTTAACGAAGTGAACAGCATGAAAGTGGCTCAGGTGCTGTATCAGGAGCTGGATATTGGTGCAGTCGCGATTACCGATCGAGAGAAATTGCTGGCCTTTACCGGAATTGGTGACGACCACCATTTACCCGGCAAACCGATTTCTTCGACTTACACCTTAAAAGCGATTGAAACCGGTGAAGTGGTCTACGCTGATGGCAACGAAGTACCTTATCGTTGCTCTTTGCATCCGCAATGCAAACTGGGGTCGACGCTGGTAATTCCGTTGCGTGGCGAAAATCAGCGCGTGATGGGCACCATCAAATTGTATGAAGCCAAAAACCGTTTATTCAGTTCAATAAACCGCACGCTGGGCGAGGGGATTGCGCAACTGCTTTCGGCGCAGATTCTTGCCGGGCAATATGAGCGGCAAAAAGCGATGCTCACCCAGTCAGAAATCAAACTGCTTCACGCCCAGGTGAATCCTCATTTTTTGTTTAATGCGCTTAACACCATTAAAGCGGTGATCCGCCGCGACAGCGAACAGGCCAGCCAGCTGGTGCAGTATCTTTCCACTTTTTTCCGCAAAAACTTAAAGCGGCCTTCGGAGTTTGTTACTCTCGCCGACGAAATTGAACATGTGAACGCTTATCTGCAAATTGAAAAGGCGCGCTTCCAGTCGCGGTTGCAGGTCAACATTGCTATTCCGCAAGAATTATCCCAGCAGCAATTGCCCGCGTTTACCCTGCAACCGATAGTGGAAAACGCCATTAAACATGGGACATCACAACTGCTGGATACAGGGCGAGTGGCAATCAGCGCCCGACGTGAGGGGCAACATTTGATGCTGGAGATCGAAGACAATGCCGGTTTGTATCAACCGGTAACCAATGCCAGTGGGCTGGGGATGAATCTGGTGGATAAGCGTTTACGTGAACGGTTTGGCGATGACTATGGAATAAGCGTCGCCTGTGAGCCTGATAGTTACACCCGAATAACGTTACGACTACCATGGAGGGACGAGGCATGATTAAAGTCTTAATTGTCGATGATGAACCGTTAGCACGGGAGAACCTGCGCGTATTTTTGCAGGAGCAGAGCGATATTGAAATCGTTGGAGAGTGTTCAAACGCCGTGGAAGGGATCGGCGCGGTGCATAAACTGCGCCCGGATGTGCTGTTTCTCGATATCCAGATGCCGCGCATCAGTGGTCTGGAAATGGTGGGGATGCTCGACCCGGAACATCGTCCGTATATTGTTTTTCTCACCGCGTTTGACGAATACGCTATTAAAGCCTTTGAAGAACATGCCTTTGATTATCTGCTGAAGCCAATTGATGAAGCGCGACTGGAGAAAACGCTGGCGCGTTTGCGTCAGGAACGCAGCAAGCAGGATGTTTCGCTGTTACCGGAAAATCAACAGGCGCTGAAATTTATCCCTTGTACGGGGCATAGTCGGATTTATTTGCTGCAAATGAAAGATGTGGCATTTGTCAGCAGTCGGATGAGCGGTGTCTACGTTACCAGCCACGAAGGGAAAGAGGGCTTTACCGAATTGACATTACGTACCCTGGAAAGTCGTACACCACTACTGCGCTGCCATCGTCAGTATCTGGTTAACCTCGCGCATTTACAGGAGATTCGTCTGGAAGATAACGGCCAGGCCGAGTTGATTTTGCGTAATGGCTTAACCGTGCCGGTCAGCCGCCGTTATCTGAAAAGCTTAAAAGAGGCGATTGGCCTGTAAAAGACTGCTAAAATGGCTTTTTGCCTCATCAACACCTGAAGGCCTCATGCTAAGTAACGATATTCTGCGCAGCGTGCGCTACATTTTGAAAGCCAATAATAATGACCTGGTGCGTATTCTGGCGCTGGGTAATGTCGAAGCCACCGCGGAACAGATCGCCGTCTGGCTACGTAAAGAAGACGAAGAGGGTTTTCAGCGTTGTCCGGACATTGTTTTGTCGTCATTCCTCAATGGCCTGATTTATGAAAAACGCGGCAAGGATGAGTCTGCTCCGGCACTGGAGCCGGAACGTCGCATTAATAACAACATCGTGCTGAAAAAATTACGCATCGCGTTTTCGCTGAAAACCGATGACATTCTGGCTATCCTCACCGAACAGCAGTTCCGCGTTTCGATGCCGGAAATTACAGCGATGATGCGCGCACCGGATCATAAAAACTTCCGCGAATGCGGCGATCAATTTTTACGTTATTTTCTGCGTGGACTGGCAGCGCGCCAGCATGTGAAGAAAAGCTAAGACGGGTATGGCGGCCATGCGAAACATGGCCGCCGACAGATTATTTCACTTCTTTAAAACCAGCGGCTTTCATCACCAGTTCCATTTGCGCCATAGTGATACCTTTTTTGGCATCTTCAGCAGAAACGTTGATTCCTGAAATACCCTGCAGGGCTTTAAAATCCACTTTTTCCATATCGATAGTCACGTTTTCCTGCGCGTAGGTATCTGTATAGGTTAATTTTTCTTCAACACCCGCGATGTTTTTGTATTTGGCGCTTAACGGCTCAAGTGTCTTGGCAGCGTCTTCTTTGGTGGTTGCACCAATAGAAGCAAATTGAATTTTGGTTTCAGAAGATTGCTTAAGCACCTTGTCACCTTTGTAGACATAGGTAATGGCAATTTCAGTGCCGTTCAGATTGGCGCTGAATTTCTTCGATTCTTCTTTGTCACCGCAGCCAGCAAGAGAGAAAACCAGAACAGATGCAACAACGAGGGAAAACAGCTTATTGAAAGCCTTCATGTAAAACTCCATTTTATTTAATCAAGAAACTGGTGACTCTCACCAGGGGCTATATAGAATATGCCTAATACCGTGACGTGAGCAGTCCGGAACTGGAGTAGAACTCTTAGTAAAAAGCACTATTTCATCCTTGTTGCTGAAGCATGGGGAATAATTGTTCGCAAAGTAAAACACCGTTATTCATTGCTTCTACCCGTGCCTCGCTTTCTGTATTACGAAATTGTCCCAACACATGTGCCAGCCGATAAAAACCGACCGCGGAGAGGTCATTCGCCAGCAACTCTGCCTGACTAATAGCGCTCTGTTCCTGATAGCGCCAGCCGTTATGGAGCAGTTGAATAAGTAACGCCTGGCAGCGCATCAGCAACTGATGAGCGGTAGACGGCACAGGCAAAACGCTGGCAGAAGGTAGCGGTGCCACAGGCGCAGTTTCTGCGTCCAGCGCCCAGGCGCGGGTTTTTGTCATCATCACCCGTGGTTCCAGTGTCAATTGCCCATCAACAAAACTGACAAAGCCAGAAACCAGACTCACGGGGTCGTCTGTTTGTTGCAAAAGCGCCGCCATGCGTTCAACGGCAAAAGGAGAACAGGCAGATGCCGGAAGGGATAACGTCAGGAGATTATCTTCACCTTCGCCGCTGATTACCTGCGCATCCAGCGTCTGGCGGCTGCTATCCCAACCGAGCGAAATACACTCAGCGACCGGCAGAATAAATAAGTTATCGACCTGATTAAGAGGCCGTATGCAGGCGGGGGGACGCTGGCGTAAATATTCCCGCAAAGCCACAATGCCCGGCTGGCGTAACGGCGCGCTCAACATTTGCCAGGCATCAGGCGACAGCGGCACAACGCTGCTTAAGCGGTTGCGGGTAGCTAACAGCAGCTCGCCATCGGCACTGCGTTTTGCTGCTTGTGAAACAATTTGCCCACCCGCCAGTGCGCCAGCCTGAAAACTAAACAGCCGACGCGTAGCTGCCGGTGAGTTTTCCTGTTCACTTCGCGGCCAACTGCGCGAAAGGTGCAAAATACTGCCGGTGTCGGGATCGGTAAACCAGATGCGTAAACCATAATGCTCAATATCCTGCCAGCAACGCATACCTAAAGACACCAGCCGCAGATGATCAAGCTTTGCTTCTCCGGCAATGCCAGAGCCAACGACCGTGCGCCACGGCACAGGAGGAACTTCACCAACACTGTCGCGCCGGGCCATCTCTTGTGCGCAATTTAATCGACTGTTTAATGCCGCAAGCTGACGTAAGCATTCTCCGGCATGATAGTGGCTGGCGCGGACGTGGAAGGCATCAACGCTTGCCCGTAGCTGTCGTAGTGATTCACTCACCCATCGCCAGTTGCAGCGTTCCGCCGCCTGCTGCGCGCGACTGAAAGCGGCCTCGTAGTGAATAAGCGGCTGGCTGATGCCGCCCAGCCATAATGCCTGGCTTAATTGCTGAACATATTGACGACACGCGTTACCCTCGTCGTTGGCAAACGGATCGTCAGATGATGTGACGTGTTCGCTGCGCATCTGCCAGATTAAATGAGTAAATTCCGCTTGCTGAGTTTTGGCCTCGACGAAGGCCTGCACAGCCAGTACGACATGTTCGCAAAGTGTGCCTTCAATACAATCACAACGGGCGAAACGAATACTGCTGCGGGAATAAAAACGCACATCGCTCATCGGTAAGCGGGCAGAGGGAATTTCGCCCGGCGTACAGAACAACTCAATGGTGATGCCTTTACCGACCAACGCCTGTGCGCGTTTGCGGGTGGCATCGGGCAGGGTAGCCAGTTCTTCCAGCCAGATTGCCGGATCCCACGCTTCTTCTTTTTCCGTAGGCTGGGCGGTAGTACAAAGTCGTTGATAACTTAACACAAGCATCACGCGATGACGGCACATACCGCTGGCCCCGCAGCTGCACTGAGCCTCTTTCAGTGCCTGGCCGTTCGCCAGCTGGGTACGGACACCGTCACTGAAGGTGGCGATTAAAGCGCTGTTCTCATGGCTGATTTCCGGGACGTTGCCATTTTCCAGTTCCTTAAGACTGCGCTTAACAAAACCGGCATTGCTTAACGCCGTCAGGGCCTGCGGTGTCAGTTCTAATAATTCCGGACGTAGTGAATTCATGACTGAAGATTCTCCGCAAGCCATGATGCCAGCTCGCCCGGTGTCATGGCGGCTATTTGTGCGCCGACATTTACCAGCGCCTGGGCCGTATCGTGGTCATAGCAAGGTGTTGCTGTGCTATCGAGCGCTGCCAGTCCCAGCACTTTGATGCCGCTCTGGACACACTTTTTCACCTGATGCGTCAGTAATGATGATGAACCCCCTTCGTAAAAATCGCTCACGAGGATAATGACGCTTTTCGCTGGTTGTTCAATAAGTTGCCGACCATACTCCACGGCACTGGCGATATTGGTCCCGCCGCCCAACTGTACTTTCATTAATAACTCTACCGGATCGGCAACGTCTGCCGTGAGATCAACGACGCTTGTGTCAAACGCCACCAGATGGGTACGAATGCCGGGTAACTGCCACAAACAGGCCGCCATCACCGCAGAGTGGATCACCGAGTCCACCATCGATCCGCTTTGATCAACCAGTAAGACCAGTTGCCATTGTTCGCTTTGGCGTTTAATGCGGCTGTTAAAGCGGGGGGATTCGATATACAACTTGCCGTGTTGCGGGTGCCAGTGTTGCAGGTTGGCGCGCAGAGTACTTTTGAAATCAAAGTTTCGCGCCAGTGGAATTAATGAGCGGCGACGGCGATCGCGGACACCAGAAAAAGCCTGACGAACTTCCTTTGCCAGTCGCGCCATAATTTCTTCAACAACCTGGCGCACTATCTGGCGGGCGGCAGCCAGTACTTCGGGATTCATCAGATGTTTGGTGTGCAAAACGGCGCGTAGCAGGCTTTCCGAAGGCTGCATACGTTCCAGCACGTCGAGATTTGTCACCACATCTTCAATGCCGTAGCGCAGTACGGCATCGCTTTCCAGCCGCTCAATCACCTGTTGCGGAAACAGCGTGTGGATACTGTTGATCCACTCAGGAGTGGTGAGATTTGAGCCACCTAATCCACCAGAGCGTTCACCACGCTGGAGCCGTTCAGGATCGCGCCCATACAGCCACTCCAGCGCGTGATCTATCTGCCGGGCGTTGTCATCCAGCCCACAAAGCGTCGTTTCTGCCGCTTCGCCAAGAATTAATCGCCAGCGTTGTAGCTCACGGGTGGTCAGAAGATCGTTCAGTTCAGACAT
Protein sequences of DBSCAN-SWA_3 >NZ_CP048344|1698304:1707746|1704612_1706613_-|WP_001374182.1|DBSCAN-SWA MNSLRPELLELTPQALTALSNAGFVKRSLKELENGNVPEISHENSALIATFSDGVRTQLANGQALKEAQCSCGASGMCRHRVMLVLSYQRLCTTAQPTEKEEAWDPAIWLEELATLPDATRKRAQALVGKGITIELFCTPGEIPSARLPMSDVRFYSRSSIRFARCDCIEGTLCEHVVLAVQAFVEAKTQQAEFTHLIWQMRSEHVTSSDDPFANDEGNACRQYVQQLSQALWLGGISQPLIHYEAAFSRAQQAAERCNWRWVSESLRQLRASVDAFHVRASHYHAGECLRQLAALNSRLNCAQEMARRDSVGEVPPVPWRTVVGSGIAGEAKLDHLRLVSLGMRCWQDIEHYGLRIWFTDPDTGSILHLSRSWPRSEQENSPAATRRLFSFQAGALAGGQIVSQAAKRSADGELLLATRNRLSSVVPLSPDAWQMLSAPLRQPGIVALREYLRQRPPACIRPLNQVDNLFILPVAECISLGWDSSRQTLDAQVISGEGEDNLLTLSLPASACSPFAVERMAALLQQTDDPVSLVSGFVSFVDGQLTLEPRVMMTKTRAWALDAETAPVAPLPSASVLPVPSTAHQLLMRCQALLIQLLHNGWRYQEQSAISQAELLANDLSAVGFYRLAHVLGQFRNTESEARVEAMNNGVLLCEQLFPMLQQQG >NZ_CP048344|1698304:1707746|1703515_1703986_+|WP_001295430.1|DBSCAN-SWA MLSNDILRSVRYILKANNNDLVRILALGNVEATAEQIAVWLRKEDEEGFQRCPDIVLSSFLNGLIYEKRGKDESAPALEPERRINNNIVLKKLRIAFSLKTDDILAILTEQQFRVSMPEITAMMRAPDHKNFRECGDQFLRYFLRGLAARQHVKKS >NZ_CP048344|1698304:1707746|1704026_1704488_-|WP_001295429.1|DBSCAN-SWA MKAFNKLFSLVVASVLVFSLAGCGDKEESKKFSANLNGTEIAITYVYKGDKVLKQSSETKIQFASIGATTKEDAAKTLEPLSAKYKNIAGVEEKLTYTDTYAQENVTIDMEKVDFKALQGISGINVSAEDAKKGITMAQMELVMKAAGFKEVK >NZ_CP048344|1698304:1707746|1701067_1702753_+|WP_001295431.1|DBSCAN-SWA MYDFNLVLLLLQQMCVFLVIAWLMSKTPLFIPLMQVTVRLPHKFLCYIVFSIFCIMGTWFGLHIDDSIANTRAIGAVMGGLLGGPVVGGLVGLTGGLHRYSMGGMTALSCMISTIVEGLLGGLVHSILIRRGRTDKVFNPITAGAVTFVAEMVQMLIILAIARPYEDAVRLVSNIAAPMMVTNTVGAALFMRILLDKRAMFEKYTSAFSATALKVAASTEGILRQGFNEVNSMKVAQVLYQELDIGAVAITDREKLLAFTGIGDDHHLPGKPISSTYTLKAIETGEVVYADGNEVPYRCSLHPQCKLGSTLVIPLRGENQRVMGTIKLYEAKNRLFSSINRTLGEGIAQLLSAQILAGQYERQKAMLTQSEIKLLHAQVNPHFLFNALNTIKAVIRRDSEQASQLVQYLSTFFRKNLKRPSEFVTLADEIEHVNAYLQIEKARFQSRLQVNIAIPQELSQQQLPAFTLQPIVENAIKHGTSQLLDTGRVAISARREGQHLMLEIEDNAGLYQPVTNASGLGMNLVDKRLRERFGDDYGISVACEPDSYTRITLRLPWRDEA >NZ_CP048344|1698304:1707746|1702749_1703469_+|WP_000598641.1|DBSCAN-SWA MIKVLIVDDEPLARENLRVFLQEQSDIEIVGECSNAVEGIGAVHKLRPDVLFLDIQMPRISGLEMVGMLDPEHRPYIVFLTAFDEYAIKAFEEHAFDYLLKPIDEARLEKTLARLRQERSKQDVSLLPENQQALKFIPCTGHSRIYLLQMKDVAFVSSRMSGVYVTSHEGKEGFTELTLRTLESRTPLLRCHRQYLVNLAHLQEIRLEDNGQAELILRNGLTVPVSRRYLKSLKEAIGL >NZ_CP048344|1698304:1707746|1706609_1707746_-|WP_001292773.1|DBSCAN-SWA MSELNDLLTTRELQRWRLILGEAAETTLCGLDDNARQIDHALEWLYGRDPERLQRGERSGGLGGSNLTTPEWINSIHTLFPQQVIERLESDAVLRYGIEDVVTNLDVLERMQPSESLLRAVLHTKHLMNPEVLAAARQIVRQVVEEIMARLAKEVRQAFSGVRDRRRRSLIPLARNFDFKSTLRANLQHWHPQHGKLYIESPRFNSRIKRQSEQWQLVLLVDQSGSMVDSVIHSAVMAACLWQLPGIRTHLVAFDTSVVDLTADVADPVELLMKVQLGGGTNIASAVEYGRQLIEQPAKSVIILVSDFYEGGSSSLLTHQVKKCVQSGIKVLGLAALDSTATPCYDHDTAQALVNVGAQIAAMTPGELASWLAENLQS >NZ_CP048344|1698304:1707746|1698304_1699231_+|WP_000569361.1|DBSCAN-SWA MIEFSHVSKLFGAQKAVNDLNLNFQEGSFSVLIGTSGSGKSTTLKMINRLVEHDSGVIRFAGEEIRSLPVLELRRRMGYAIQSIGLFPHWSVAQNIATVPQLQKWSRARIDDRIDELMALLGLESNLRERYPHQLSGGQQQRVGVARALAADPQVLLMDEPFGALDPVTRGALQQEMTRIHRLLGRTIVLVTHDIDEALRLAEHLVLMDHGEVVQQGNPLTMLTRPANDFVRQFFGRSELGVRLLSLRSVADYVRREERAEGEALAEEMTLRDALSLFVARGCEVLPVVNTQGQPCGTLHFQDLLEEA >NZ_CP048344|1698304:1707746|1699235_1699967_+|WP_000783120.1|DBSCAN-SWA MKMLRDPLFWLIALFVALIFWLPYSQPLFAALFPQLPRPVYQQESFAALALAHFWLVGISSLFAVIIGTGAGIAVTRPWGAEFRPLVETIAAVGQTFPPVAVLAIAVPVIGFGLQPAIIALILYGVLPVLQATLAGLGAIDASVTEVAKGMGMSRGQRLRKVELPLAAPVILAGVRTSVIINIGTATIASTVGASTLGTPIIIGLSGFNTAYVIQGALLVALAAIIADRLFERLVQALSQHAK >NZ_CP048344|1698304:1707746|1699947_1700055_-|WP_001216963.1|DBSCAN-SWA MRIAKIGVIALFLFMALGGIGGVMLAGYTFILRAG >NZ_CP048344|1698304:1707746|1700114_1700846_-|WP_001240401.1|DBSCAN-SWA MALYTIGEVALLCDINPVTLRAWQRRYGLLKPQRTDGGHRLFNDADIDRIREIKRWIDNGVQVSKVKMLLSNENVDVQNGWRDQQETLLTYLQSGNLHSLRTWIKERGQDYPAQTLTTHLFIPLRRRLQCQQPTLQALLAILDGVLINYIAICLASARKKQGKDALVVGWNIQDTTRLWLEGWIASQQGWRIDVLAHSLNQLRPELFEGRTLLVWCGENRTSAQQQQLTSWQEQGHDIFPLGI |
10 | Enterobacteria_phage(85.71%) | NA | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_4 |
2265540 : 2283285
Sequences of DBSCAN-SWA_4
Nucleotide sequences of DBSCAN-SWA_4 >NZ_CP048344|2265540:2283285|DBSCAN-SWA GATGATTAAAACAACGTTACTATTTTTTGCTACTGCGCTGTGTGAAATTATTGGATGCTTTCTGCCCTGGTTGTGGTTAAAACGAAACGCCAGTATCTGGCTGTTGCTTCCGGCGGGGATTTCACTGGCGCTGTTTGTCTGGTTGTTAACGTTGCATCCAGCGGCGAGTGGGCGTGTTTACGCGGCTTATGGTGGCGTTTATGTCTGCACGGCGTTGATGTGGCTGCGCGTTGTGGATGGCGTGAAACTGACTCTTTATGACTGGACGGGTGCGTTGATTGCGCTTTGCGGCATGTTGATCATTGTTGCGGGCTGGGGGCGCACGTAGGAACATAAATCCATTTTATCAATAAGATAAGAGGAAGTGTCAGCTGACAAAAGGTATTCTATTTCATCTTTTGTCAACCATTCACAGCGCAAATATACGCCTTTTTTTGTGATCACTCCGGCTTTTTTCGATCTTTATACTTGTATGGTAGTAGCTCAGTTGCGTAGATTTCATGCATCACGACAAGCGATGCAAGGAATCGAACATGAAGATCGTAAAGGCTGAAGTTTTTGTTACCTGTCCGGGGCGTAATTTCGTCACATTAAAAATCACCACTGAGGACGGTATTACGGGCCTTGGGGATGCCACCCTCAATGGACGTGAGCTTTCCGTGGCCTCTTATTTGCAGGATCACCTTTGTCCGCAGCTTATTGGTCGCGATGCGCACCGTATCGAAGATATCTGGCAGTTTTTCTATAAAGGTGCTTACTGGCGTCGCGGTCCGGTTACGATGTCGGCCATTTCAGCGGTTGATATGGCGCTGTGGGATATTAAAGCCAAAGCTGCCAACATGCCGCTTTACCAGTTACTCGGCGGCGCGTCTCGTGAAGGGGTGATGGTTTATTGCCATACCACCGGTCACAGTATTGATGAAGCTCTGGATGATTATGCCCGTCATCAGGAGCTGGGATTCAAAGCCATCCGCGTGCAGTGCGGAATCCCTGGTATGAAAACCACCTACGGCATGTCGAAAGGTAAAGGTCTGGCTTATGAACCCGCAACCAAAGGACAGTGGCCGGAAGAGCAGCTGTGGTCGACGGAGAAATACCTCGATTTCATGCCGAAATTGTTTGACGCGGTACGTAACAAGTTTGGTTTTAATGAACATTTGCTGCATGACATGCACCATCGCTTAACGCCTATTGAAGCGGCGCGCTTTGGTAAAAGCATTGAAGATTATCGCATGTTCTGGATGGAAGACCCGACGCCTGCAGAAAACCAGGAATGCTTCCGTCTCATTCGCCAACATACCGTCACACCCATCGCAGTGGGTGAAGTCTTCAACAGCATCTGGGACTGCAAACAACTGATTGAAGAGCAACTCATCGATTATATCCGCACCACGCTGACCCATGCAGGCGGAATTACCGGTATGCGCCGGATTGCCGATTTTGCTTCGCTGTATCAGGTACGTACTGGCTCACACGGTCCTTCCGATTTGTCACCAGTCTGCATGGCTGCGGCGCTGCACTTTGATCTGTGGGTCCCCAATTTCGGTGTCCAGGAATACATGGGTTATTCCGAACAAATGCTCGAAGTCTTCCCGCACAACTGGACTTTCGATAACGGCTATATGCATCCGGGAGACAAACCGGGTCTTGGCATCGAATTCGATGAAAAGCTGGCGGCGAAATATCCCTATGAACCTGCTTATCTACCAGTCGCACGTCTGGAAGACGGCACGCTGTGGAACTGGTAAGGAGGAAGATAATGAAAAGCATATTAATTGAAAAACCGAATCAACTGGCGATTGTCGAACGTGAAATACCCACCCCGTCAGCGGGTGAAGTACGAGTAAAAGTGAAACTTGCCGGAATTTGTGGTTCAGATAGCCATATTTATCGTGGGCATAATCCTTTTGCGAAATATCCGCGCGTCATTGGACATGAATTCTTTGGCGTCATTGATGCGGTGGGTGACGGCGTGGAAAGCGCCAGAGTCGGTGAACGTGTTGCTGTCGATCCGGTGGTCAGCTGTGGGCATTGCTATCCGTGCTCTATAGGTAAACCGAACGTTTGTACGACACTGGCTGTTTTAGGTGTGCACGCTGACGGTGGTTTCAGTGAATATGCCGTGGTTCCGGCAAAAAATGCGTGGAAAATTCCTGAAGCAGTGGCCGATCAATATGCGGTAATGATCGAACCTTTTACCATTGCGGCTAACGTAACCGGACATGGTCAACCGACTGAAAATGATACCGTTCTGGTTTATGGTGCCGGTCCAATCGGCCTGACGATCGTTCAGGTATTAAAAGGCGTCTATAACGTTAAAAATGTGATTGTTGCCGATCGCATTGATGAACGACTGGAAAAAGCGAAAGAGAGCGGGGCAGACTGGGCGATCAATAACAGCCAGACACCGCTTGGCGAGATTTTCGCTGAAAAAGGCATCAAGCCGACATTAATTATCGATGCGGCTTGTCATCCTTCTATCCTGAAAGAGGCCGTAACGCTGGCTTCTCCAGCGGCACGTATTGTATTGATGGGCTTCTCCAGTGAACCGTCTGAAGTGATTCAGCAAGGAATTACCGGAAAAGAACTCTCTATTTTCTCTTCACGCTTAAATGCAAATAAATTTCCGGTTGTTATCGACTGGTTAAGTAAAGGGTTAATTAAACCAGAAAAATTAATTACCCATACGTTTGATTTCCAGCATGTTGCTGATGCCATTAGTTTATTTGAACAGGATCAAAAACATTGCTGCAAAGTCTTACTCACTTTTTCTGAATAATACCAATAACGGCGAGTAAGTAGTACGCATCTTACCTCTTTTTTAGAGATAACCATTATGACAATAGAAAAACATGAAAGAAGCACTAAGGATTTGGTGAAAGCAGCAGTATCGGGATGGCTGGGCACTGCGCTTGAATTTATGGATTTCAAGAGTCATGCGTGTTAACTATTTGATAAATATTAAATTAATTTTTCATTGCTTCGTTATGGGGCATGGTTGGGGCAAACTCGCTTAACTGTGTATTTAACAAAGCTACCTGTGCATTATTGTTTTCAGACATCCATTTTCCGTATACCTGAAATACCATTTGCGCATCTGCATGGCCCATCTGGTTTGCTATAAATGCCGGGTTAGCACCAGCTGTCAGCGACCAGCAGGCATAAGTATGTCTCGACTGATATGATTTTCGATGGCGGAGTCCGGCACGTTTTATCGCTGCGTCCCACATCTGCCTTATTGAGTCAACGGTAAAATGGTCACCATAATTTTTTACTCTCGCTGACACTTCAGGTTGAAAAACAAAGGTGCATTTTTGTTTTTCTGTTCTGCCATACTCTCTGAGGTGAACATCAATGATATGCTCTTTGCTCAGTCTCGTTAATGTCATCTGACTCCGGAGAGCGTCGATTGCTGGCTTAATAAGATGAATGACCCGATTGGTTCCCGCCTGTGTTTTTGGTACCGTGAAACGGTCTTTTGCTAAATTTCTCCTGATCATCATTGTTCCATTTTTCAGATCTATGTCCTCCCATCCAAGTGCACACAGCTCACCAGGGCGAACGCCAGTATAAACAGAAACACACCATAAATTTTTTGCTTGCTGATTTCTGCACGCATCGATAAGACGGATAAATTCTTCCCGCGAAAGAGGATCCGGAATGGTTCTTGATTCCTTTAATGGCGAGATCCCCTTAAACGGATTATCTGCCAGGTAACCGTTATCAACACCAAACTGGAACACGGCGTTAAGATTTGTCATGTAATTATTTACAGTTACAGCCGATCTCCCTGGTTGTGTAACAATATAGTTACTTTTGGGGATCTGGTATCCAGTCAGTAACTCTTTACGAACCTCCAGTAATTTTTCTTTATTAATCGATGAGGCAAGATTTTTTTCACCGATTATGCTCAGGATATTTTTGATGACGGCACGGTATGTGTTGAGTGATGTTTTGGCGACTTCAGTTTCTTTCAGTGCCAGAAATTTTTCAGCCAGTTCTTTTATGGTTAAATCTTGTCGGGCCTCACCAAATTTTTCCAGATTGCGTGAGGAGGGAAACTGTTTTGCATAGTCGAAAACACCAGTTTTTATTGCGTAACAAACAGAGGAGCGTAGTTCACCTGCAACGCGCCTGTTTTTTGCTGTGTCAGGAACCCCCAGGTTTTCCCTGACTCTTACGCCTTTATAAACAAACCAGATACGTAATTTCCCTCCATGGTTTTCCACGCCTGTCGGATATTTCATTTCAACTTCTCTCATTAGTTAGTGTGGCTTTTAGTCAAGTAAGATGACGTCTTGGTCTCGCTGATGCCTGGCGCTCAATCCAGCGATCAATTTCTTCCAGGTTGTAAAAGCATGGACTGTTATCCCATGGCATACCGTCATGAGCGACATGCTTATATTCCCTTCCTTCCATAAACGATTTTTCCCGGGCCTTTTTTAACGTACCTTTTTTTATTCCTTTCAGCGCAATTAACTGCTCTTCGGATACCCATTTGCCGGGAGAGACAATCATGATTACTTCGCTCATCGATTTCTTTATCTCTTACATTAGACGAGCGCCGGTTGCAGAATACCAGTCACAACCGGCGACAGTTGAACATTAAGAATCAGCCTGACTCGGGATCAGTTTTTGCCAGATAACTGAAACGTATTTTGCCTGGTAACGGGCGTCATCAAGTGCATTATGGCGCTCACCTTCGAATTGAATAGCCGTTCTGGCATCGAAGTCTATGACTTTCCCCAGCTCAACGATTGTGCGTACATCGCGATCGTTGTAGTAACGCCACGGGCAGGGGATCCCCTGCCGTTCGTATGAACGGCGCAAAATCGTGTTGTCGAAGTTGGCTCCATTCCCCCAAACCTGAACAAAAAATTCACCGGAGTTTTCGTCGATAAATTCCCGCAATTGTAACAGTGCATCATCTAACGGGATTTCATCGGTCATAATGGCAGATTGCGCTTCGCGTGATTGCTTAAGCCACCATTTAATGGTGTCCCGATCAATGACTCCGCCAGCAGTTTCCAGATCGATAGTCTTACTAAATTCCGGTCCCATATCTCCGGTTTGCGGATCGAAAAATATTGCACCTATTGAGGTGATCGGGGCATCAGGATTTTTTCCCATGGTTTCAAGGTCGATCATTAGATGGTCACACGTCCTGCTGGTGGATGTGATTTCGTGATGACCGTTCACCTTAATTGGGTGATCTGCCGTCTCGCCAGTTTCATTATCGCTATTGTGATGCTGATTGCCGCCAGTGTTCTCCTTGTGTGGATGTTCAGCGCCTTCCATTTCCTCCGGATCATCTTCCTGAACTTCAACCTGATACTCTTCATCGAATGTTTCTTGGTATGTTGCGTCGCCCATCACCGCGCCACAATCAGGGCAGTTGCCGCCGCCGGTCTGACCGCAGGCGGTGCAGACTTTTTCCACTTCCTGTTGCGCTACTGGTTCAGGCTGTTTCGTTTCTGGCTCGTTTTGTAACGCATTTGGACTGTTTTGTTCCGCTTTTTGGTAGTTCCGTTCCGATTCATGCTGGTTCTGGTTCACAGAATCGCGGGTCTGGATCCCCTTAACCCATTTCGGATCATTCGGGTCACTAATCCCTTCAACAAATTCACCACGTGATGCAGCAAGCAACTTATCAGCGTCAGGCTGGCTGATATTGGCTGCCTGCATAATTTTGTTTACTTCGTCAGCGGTAACTTTTATCGGCTCTGGTTGTTCTGAATCTTCAGCGGTATCTACATTTTGCGGTAAGCCCGTGTATGTGCCATTTTTTCGGGCAAAATATTCTTCTTTTGTGATTTCAGTGGCGCCAGCAGCCAGTGCCTTATCCAGACCAGAAAGTTTGTTTGCGCGACCGTATTTTTCTCCGTCCTTATCTGCGAAGAGGAAATAGAACGGCCCCTCACGCTCTACAGATGGTTCAGCTTCCGGCGCGGTTTCATTTTTTGGGATATCAGATACCTCAGTTTCCACTGCATCAGTTTGTGTTTCTGATGACTGGAGAACATCAACAGTGCCCAGGTCTGTTTCTTCATTCTCAAACACGCCCTTTGTCGTCAGGTATTCGCAGATATATTTGTTCAGTGCTACGGGATCTTTGTGAATGTCGATCGGACGCTCACGGACAAGGCCAAAAATAGTTTGGCGGTCGTAGCGAAGGGCATCAGGCTGTTTGCGCATTGATGCCGAGATACGCTTCCAGTCTTCGCGGTCGTTGTCGATAACTTCTTTTTTTGCCCAGCGATGGATGCTGCCGTCAATGTTTCCGGCATCCACATCACCAGGCCAGAGAGCGTAGGCCAGTTCGTCATCCAGTGTTTTCCATGTCTGCTTGTATTCGCGATGAATGGCAGCAATGACCGGGCTGATTTTTCCTGTTGAATTTTCAGTGTACTGTTGATTGGCTCTGGCGCGTGCGAGATCAACAACAGACGTGTATTTTCCGGTTTCCTTGCGTTCACCTTCGCGACGTTTTTTCCAGATGCGCATCTCTGCCTGAATTTCGGGCCATTTAGCACCAGGAATACATTTATGCTTAACCCACCCAATGGCGTGCAACTTAAGCTCCGGATACATGGCGTTAACTTCTGGCATTTTCATCAACGCTTCAACGATATGTCCGTCGAATGTTGCCATGTCTTCCTGCAACAATTCCTGTGCGCTAATAACCATATCAACGGTGATGTTTTCACATGTGTCGAACTTAACCATGACAGCGTTCTGTACTTCAGGGGCCAGCTTGTCAAAAGTGACGTTCATCGGATCGGATTCAGTCTCAACCGGGACAAAAGAAGCAGACTCCTCATCCCAGCGGTTTTCCTGCATATATTCAGCATCCCATGAATCGAGGGCAGGGCGGGGTATGCCAGGTTTATCCTCGCAGACAATAAATTTATAAGCGCAGTCCTGAGCAGCCGGATAATGTTCCAGGAATTGCCAGTGAAATTTTGCTCGAGCACGGCGTTCGTCGCCAGCTTCAATGGCTGTGGCTACAGCCACAGCGCCTTCTTCCCTTGTTGCCAGTTCGTCAGGAATAGCGGCGCAAATAAAGACTTTACTCATTTTGTTTTAACCTCATGACAGATTTAAGGATGAACAAATCCCTGCCATTGCTGGCATATAAGAATGAAACCGGATATTTATTACGGAACTGTTTTAAAGACCTGCCGGGATTTCGATATTATCCTGGTTAATAACTTTATCGACCGGGTAACAGTTACCGGGAATTTTCTGTTCGGTTGCTGCAGTCATACACTCCTGCATTGTCCTGTGAACACTGACTGCAATATCAACTGGCACTCCGGAAACAAGAAAAACTGTCAGAACAAGCGCAAATGCTGAATTCATTGTGCACATCCTTTTGGCATCAGACGTAAACGAGCCAGCATTGAAACAATGCATATTTTATTTAATAGCTCCCGTTCTTGTTTTCTCTTGTTAATGGCATCTTCAGTAAATACAGGGTTACTGATAGTGACACCAATTTCAAAACAACCTTCAGACGTATTAACGTTTGGTAATAACGTTTTCATTATCGCGTCCTCAACAATGAATTTTGTGATGCAGTGCCTGGTGCCTCCAGGTGACGTTAACCAGTTAACAATTAACGTCGGATATCCGGATTAGTGATTTCAGGTTGTATCGTGAGATCAGTGATGGAAAAAGTATTACGTACATGATCGCCGGGTTAAATAAAGAATATGGCGATGTGGTGGAATCCGGACTGCTTTTTGCAGATCCTGCCGTTGTAGATCGTGAAACTGACGAACTTATAGAAAAAGCAATTGCTTTCAAGCTTGCGTATCGACAGCAATACCAACAAAAAGCTGGATGGAATTATGAGTCTTCTTTTTGCTGAACGCCCACTGGTTATAAACACGCAGCTGGCAATGAAAATTGGCTTAAACGAAGCCATTGTTTTGCAACAACTGCACTACTGGTTGAGAGATACCAACTCCGGCATGGAATGTGATGGTGTTCGCTGGATTTATAACACAACGGAACAATGGCTGGAACAGTTCCCATTCTGGTCAGAGTCAACGTTAAAGCGCGCGTTTGCAAGTCTGAAAACGCTGGGGCTTTTGCGTTGTGAAAAGCTCAATAAATCAAAGCGCGATATGACCAATTTCTACACGATCAACTACGGGAGCGAGCTTTTAGATGGTGGCAAATTGAGCGAATCCATCGGTTTAAAATGCGCCGCTCCATCAGGTCAAAATGACACGATGGAAGAGGTCAAAATGAAACGCTCCATTGGTTCAAAACGACTCAATGTCATCGGGTCAAAATGGCCTGATGATCTTACAGAGAATACAACAGAGATTACTACAGAGAATAAAAAGACTTCTCGTCCGGAAGCTTCGCAACCGGACCCGCAGACGGTTGAACAGGATTTTTTAACCCGACACCCTGACGCGGTTGTGTTCAGTGCGAAAAAACGCCAGTGGGGCAGCCAGGAAGATTTGGCGTGTGCGCAGTGGATCTGGGGGCGAATCGTGAGTCTTTACGAGCAGGCCGCCAGCGATGATGGCGAGATTTCGCGACCGAAAGAACCCAACTGGACCGCATGGGCCAACGACGTGCGCACAATGCGGATGCTGGATGGCAGAACTCACAGACAAATTTGTGAAATGTTTGGTCGGGTGCAGCGGGATCCATTCTGGGTAAAAAATATCATGAGTCCGTCAAAGCTTCGCGAAAAATGGGATGAACTGGTTATCCGCCTGGGGCGTTCGTCTGTACAGCGTTGTGTGAATCATATTTCTGAGCCGGATACCGAAATTCCGCCGGGCTTCAGGGGGTAAGTGTTAATTTCTGGTCATGAGGTAATTTTCAGGAGGGCTTGTGGCAAAAGTTTTTACACAAGAAGAGCGGGAAAAAATTAAAGGGCAGGTTGTTGAACTCGTACGCCGGAGTGGGCGTGAGACGTTACGGCAACTGGAAGTCAAGACAGGTGCGACAAGATATCTGATGAGCGTTCTCGCAAGAGAGCTGGTTGCCAGCGGCGATGTATACAACTCTGGTTACGGGTTATTCCCGTCTGAACAGGCGCGTAAGGACTGGCAAAATGCCCGTAAAAAGCTCTCAAGGGCAAAGCTGAAGGAACCATCTGCGGTTGATCCGGACCTTATCTGGTCATTACCTGACGGAGAAATACGTCGTTACGACAGGCGTCATAATATGATTTGTACTGAGTGTCGTAAAAGCGAAGTTATGCAGCGCATATTGTCGTTTTATCAGGGGGATGTCCGGTATTTATTGAAGTGACGAGATTAAAGTGCATTAGTTCAGATGCAAATTGACATTTTGTGGCACAGGGTAGAGCTAGCGTGGTTGTCCGCTTTGTGCCAAGAGCGGACTTTGCAAAATGGGGGTTATTTCAATCAAAACGTAACGTCACAATCAGCCGACGCTCTCTCGCCATTTATAATTAGTAACTTTATCATTTTCGCTTATTTTTTTAGATATAGAGCGCGGCTCTCTTCCTAGATACTCAGATATTTCTATCGGGGACAAATCAAAATCAACAAGCATGACTCTAAGTTTTTCCATTTCCTTTAAAGTCCAAGGCTTGCCATAATTTTCATAAAGAGATACCTTATGCTCCCTGATAGTTCGCTTTCTCTGAGTTTCACTTTCCCTCTGAAAAATCTCGCTTTTAAATTGTGAACGGAACTCTTTACAGAAGTTGTTATCAGTTTTTTTGTTATATAGATTGCTAAAAATAAGCTGGGATGCCGGGTCTAAATCAGGTGTGTTAATAATGAAAACTTTGACCTTTTCCATATAGGGATATTCAATTTTACCCAATATGCTTAATTGCTTTATATTAAAAAAACCTCTCAAATCAAAAGATTTAATGAGCTTTGATTGGATTACACTTTCAATCCTGCTTGCATTTAAATAACAGTACTTTGCCATTGGAAGGGCCCATACTACCAGTTGAGGAATATATTTTTCTAATGCAATGCTCTGCCAATCAGTATCAAAGGTTTGGCTTTTTGCTAGCATGTTTTTCGGCAAATCAGAATACATTGTGGTAGATGCCCATATCTTATAATCATTCGCCAAGGCTTGATAGTATTTTGTGTGGTTATGAATTTTATAAGCTGACATGAAGCGATAAACGTCTTCGTCGTGACCAGCGTCGTAAATTGTTCGATTTCCTCTAAGATAACCGTCGTAATGTTCGGTTATCCTTCTACCTACATTACAACTTACCCCAACGTAAACCACACGACTGAAAAGTCCTTTATGGACAATAAGATAAACTCCGCTACAGCCAGATTTCCTGGCCTCTGATAGAGAACCTAAAAATCTCCATTCCATAATTAAATCCATAATTATTGCTACTGTTTTTGTTTATCATTATTTTCGTGAAACTTCAACAATTTTATCCAAAAGCTAAGGGCAAAGACTTATATAATTATACTTGTCATCGTTAGCGATTATATAGAGTAGTGGCGCTGACCTGCTCCCTGGTGATTCACACAGAATGCTGTTAGTAATGTCCGTTCCTCGCTCTCAGCGGACCTTCAGCTCAGTGATATCGTCCGCTCTGTGCAAAGAGCGGACGTTGGTATGCAAGAGCCCTCCAAAAGTTGATGGTTGGTTTGCAGGGGGGCTTAAAGAAACTGCACTTATCAAGTTGAAGTTCTGTATTCAGCGAAATCGTAGCACTCTGACGATAAGTAACTCCGGTACTCCGCTCTTCGATGAAGAACCAGAGTAATCCCCCCGAAAAACCAGCGCATCAAAATTGGATCTTCAGCGGTAGCTTATCGGCTATCGGAAGTACAGGTGTGGATTCGTGGTGAATTGCTTTGATAATAAACGATTAATACGGAAAAACGCATTAATCATTTATTAGCTTTTAGTAAACCACAATTTATTCCGTTTTACATATCATAGTAGTCGATTGGAGAATATAGTTTCTGGGAATGTACTCTTCAAAGTGTTCGTCCTTTTTAAATACATGAACTACATTTGGGAATAATTGATAGTCAACAGGGTGTATAGCGTTTGGATTATGGTACATGTACATGGCTGTACACCATGGTTCTTGATAGTTAGGGTCACTTACATCGGCTGAAAATGGATGTGGGGCTGCATCCTGATCAGTTTTAACACCACTGACGTACACTTTGAATCCACTCGCCTCTACACCTGCAAGAATTCCCATCCGGTTAAACTTAGGTATGGTTGCTTGAGTAGTGAGTAAAACGGCAGAAACATAATTATTTTGTTCTGAGCCAAAAAAGTTCGACTTGATACTTCTATTTTCATCTGTATGTCTTTCAATAGAAATGCCTGACTCAATATCAATCCCGTACAAATAGCTATGCAAGGCTTCGCTTGAGAAGGCCATGGACATTCTTTTTGAATAATCCTGCATTGCTATGACAAATGGTTTGTTCTTTGTATGGTTGAGTTCCCAGTAATGAACTTTCTCTGGCTCAGGGCAATGCCGGACTTTTTTTAATAAACTTCTTGCAAACTTAAAAGGCATGACATTTAGAACATGTTTTCTTAATTCATCCATCTGTTCATCGTTAATGACTTTTCTTTCAAGAGGGGCTTCTGCTTCAGCAATGCTTACAGCCTCTACAGCAATTTCCACTCCAAATTTAGATAGCAGAAAATCTGGTTGATTGTATTCTCTATTCATTTCAAAGTCGAGTTCATAAAATACAGCGTTCAAATATAATTCAAATAACCTTGAATTAAATGCATCACTTTGAAAATCCCTTATAAATATTCCATCAGGATCTTTGAACCAGTATGCAAGTTCCTCAAGAACAATATATGCAGGGAAATGAAGAGGGTCTTCGAGGAGCATTTTTATATAAACATTCCTTTTTTTCGCTGGGACCTTACTCAAGAATAATGAAAAAGGTTTGGTTGATTCATCGCCTTGCATGAATGTACCATTTTGGTGCTGCGCCAGCATCTTTGGTATGTCATCGTTCAAATTATTAAGCAAGACATCCATTGAATCAAATGAAGCCAAGACGTTTATTGCTCTGAATTTTTTATCTAAATCCCGACCTAAGACTATTGCGTTAAAATCTTTATCAATATTGCATATGATTATTGTGGATAACAATGTTATCCCATTCCCCTCATATTTAAACCAGCGTATCTCCTCAGAAAATGTCTTAAGGTAAGGTGAGCGACCGTAAAAATAAATATCAAATTGTTCTTTGCTGATCTCACTGAAGTGTAATCCTGCGTTCATACCAATTCCTTTTCAATGAATAATTGGCCTTTAGGAGTGATTCCCTTTGTCTTTAATTCAGTTCTAACTAGTTCTTTAATCCAATAGCCTAAGCTCATCATGCAGTTGGATCATAAGACAACGCCCTATAGTGCTCGTGATACTATAGGGCATCTGACCACACTGTTAACTGGAGTAACGACTATGGCAGGAATACAGCATAACCAAACTCACCCCAAACTTACATAGCGCTTTCTGGCCGTGAGCATAACAAGGTCCACTCCTCGCTCATAAGGGACAACCATACTCAAATCTCCCACATTGCAGGAGATTTGAGTATGAACACGTCACCGTGGAACAAAGACCGTATCATAGGCCAAAAAAGACCACTTCAGATATCTCATATCTGGGGTATCCGAATCCGACTTGAACTGGAAGGTAAAACTCGCGATTTAGCTCTGTTCAACATGGCCCTGGATAGTAAGCTTCGAGGCTGTGATCTGGTCAAACTCAAAGTATCTGATGTTGCATATGGTGGCTCTGTTTCAAGCAGAGCAACGGTGTTGCAACAGAAAACCGGTAGCCCTGTTCAATTTGAGATAACCAAAGGGACAAGAGAAGCTGTTGCTGCATTGATACAGCTTAGCAATTTGCACAGTAAAGACTTCTTGTTTCGGTCTAGGGTCGGAACTAACCAGCACATTTCAACCCGGCAATACAACCGAATCTTTCATGGGGGGGTAGAAAAGCTTGGTCTCGAAGATTCGCTTTACAGCACACATTCCATGAGAAGAACAAAACCTTACCTGATCTACAAGAAAACCAAGAATCTCCGGGTGATCCAACTTCTGTTGGGTCATAAGAAACTGGAAAGCACAGTCCGTTATCTGGGCATTGAAGTCGATGATGCGTTAGAGATTTCTGAATCGATTGAAGTCTAAGGTTGTCAGGGCTGCAACAGCAGCCCTGTGCCATAAGCGGAAGTATTTAACAACTATCAGTGTTGTTCAACAGATAAAGGGGCACTTGATTTTTTCTGTTCTCAGGAAATGATAAAAGCGCGTCGGTTCAAGCCTGCTTAACGGGAGTTTGTTAATCCTGTTGCCGTGACGTTTTGACACCATTATGATGGGGAGACACTTAATGTATGAAGGTTCCGCCACTTATACCTGTCCAACAACTGCCTCGGATGTTTCTTTGTATGAATAAGTGGTAATGAGTAGTGAATCGCTAACAGTCACCCGAACAATCGGTGCCTGCAATTAATTCTATATTCTAAACGAGGGGGAGATTATTACACATGAAATTTAAGGACAAGAACCTTAAGGCTCTCGCGGAATGTATCATAGGAGATAATAAGGCATTTCTGTATCGTTCAAGCAGTCACATCACTGAATTTTTCCAGGACTGCGGCATGGATGTTACTCATGACGGATCCACTCGGTGGAAATGGACGGCCCAGAGGCTTGAAGAACTTCTTTATGAGCCACAGTCAAAGCCACATACTTTGCCGGAAAGGTTTGTTCATGTGCTCAGAACTTTAATGTTAAAAGAAGATGCAATGGATGACGATCCAGGAAGATTAAAGGCGCTTGAAGAACTGAACAAGCCTTTGATGCGGGAAGGCTATGAGGCATTCTATGGTGACGATCGCCTTTTGTATATACGCCATACCGATACCAAAACGGTTTCAGTCAGTAATAACCCTCATCGGCCCTTAACGCCTCACGAAGTAGAATGCAGAAGGTTACTGACCGCGTTTCTTGATACCTGCTCAGAAGATGAGTTAATAGAAGATATTCTCCTTCCTTTATTCCGGCAACTTGGTTTTCACCGGATAACAGCAGTGGGACATAAAGATAAAGCGCTGGAATACGGGAAAGACATCTGGATGAAGTTCACACTGCCAACTCAGCATGTTCTTTATTTCGGCATTCAGGCAAAAAAAGGTAAGTTGGATGCGTCCGGTGCCAGCAAATCTACGAATTCAAACGTGGCAGAAATCTTCAACCAGGTACTGATGATGCTTGGCCATGAAATATTTGACCCAGAAACAAATAGAAAGGTGCTGGTAGATCATGCCTTTATCGTTGCTGGCGGAGAAATTACTAAACAGGCGAGGAACTGGCTGGGCGGGAAACTTGATGCCAGCAAAAGAAGCCAGATAATATTTATGGACCGGGAAGACATTCTTAATTTATATACTGTAAGTAATGTACCTCTGCCAACAGGTGCTCTCATCTCTGATGATGCCGTTAAGAACGATGATATTCCTTTCTAATCAGAAGTACGTCTTTTTCTGAAAGAATACGTGATAGGTAGCCACACCACACCTTTAGTGACCCCTTAATCTGGTAATATAACAGCCCGTATGAATGTCCGCGGCATCGCGGGCTGAAATTTATTAAAAATACTTATTCATCAAGCTGGAGTAGTTTGCCGAGTAACTGTAAACGCCCAACTTAACCGGACCATTCACTTTTAGATTGCTACCAGCAAACCAACTTCCGTTTCTCGCTCAAAGCGGACTAGAAGGTTAGCTTGCGTCGGACTTGGCGTATTTAAAGAAGTGCTGGTGGTAACTGGTTGTTGTGTTCCATTTCTACAAAACAAAATCACAGAAACTATACCCAATAGTTATATTGAATCAATGATGAGACAGCCTCATATTTATCAGAACTGGTGTACGTCCAATACAGGAGGTTGTCGTGCTGGTTCTCAAATATGCGCTAGCTATTGCGGCTGTAATGGCAATTTATTGTCTTGCTATTGTTCTTACGGATCGCCTTTCTGATTGATTTTATATTGGCGAGGTGACGGGAGTTAAGTAGAATTGCTGCGGGTGCTTGAGGCTATCTGCCTCAGGCATGAACACCAAAGGCAGATAGAGAAAAGCCCCAGTTAACATTACGCGTCCTGCAAGACGTTTAACATTAATCTGAGGCTCAATCTATGAACGGCAAATCTAGGTTAGCCTCTTACGTGCCGAAAGGCAAGGAGAAGCAGGCTATGAAGCAGCAAAAGGCGATGTTAATCGCCCTGATCGTCATCTGTTTAACCGTCATAATGACGGCACTGGTAACGAGGAAAGACCTCTGCGAGGTACGAATCCGAACCGGCCAGACGGAGGTCGCTGTCTTCACAGCTTACGAACCTGAGGAGTAAGAGACCAGGCGGGGGAGAATCCCTCGCCACCTCTGATGTGTCAGGCATCCTCAACGCACCCGCACTTAACCCGCTTCGGCGGGTTTTTGTTTTTATTTTCAACGCGTTTGAAGTTCTGGACGGTGCCGGAATAGAATCAAAAATACTTAAGTAGCGCGCAGGGATAAGAGGGATGGTCCCTTAAAGGGGAGAGCTAATTATCCGGAAGGATTCTGATGATGAACATCGAAGAACTGCGTAAAATTTTTTGTGAAGATGGCCTCTATGCTGTGTGCGTTGAAAATGGAAATCTTGTTAGTCATTACCGCATTATGTGTTTGCGAAAGAATGGGGCTGCGTTAATTAATTTTGTGGATGGTCGAGTGACAGACGGATTTATCTTGCGCGAAGGTGAGTTTGTCACTTCATTACAGGCACTGAAAGAGATTGGAATAAAAGCAGGCTTTTCAGCTTTTGCAGAAGAATAAACTCATCTACAATCTTGCGCGGGGCTGAACTCCCGCTGAGTAACACCGTGCCACCGGAGAAAACCGATGGCACGCAACGTAAAATATTACAATTCTGATAATTCGCCCGTTCTTGCCTGCACGCACGAGCGGTATTCTCACGCATTCAAGTCTGAATGGTTCCAGCACCCTCCATGCACTGAAGAGCAGGCTGAATGGATAATTCAGTGTTACCGCAGGCGCGGATACGAGGTTAAGAAAGCTCTTAGTCTCGACTACCGTCACTGGATAATCTCAGTCAGATTGCCTTACTCCGAACGCCCACCGCGTCCGTCCCGTACATTCCAGCAACGCATCTGGAGGTAACGTGCGGGTATTACTTCGACCTGTTCTGGTACCGGAACTCGGGCTGGTGGTCGTTAAGCCGGGCCGTGAATCCATGCCGGTATTCCACAATACCCGGGTAACCGAAGCGCTCGACACGACATTAGGACTGCTTACAAAAACAGGACGAGACTGGAACACGCAGCATACTGATAACATTAATAAATTTATACCAATTGCAGGCAGTACAAACGGCCCGGCAGGCTCTATGGTTCTTGGCGGCATTCATGTTCAATTTAGTAAAAATTATGCTGTGCAGTTCGGAGGCCGCAATTCCGGTTTTTGGGGAAGAACAATTGAAAATGGAACGACACAGGAATGGAAGAAATTACTAACAGTAGACGATCTCAATTCATCTACCGATCTTGCTGTCAGGTCATTAACCACATCTAACCCGGTAAAATCTGGCGGAGGGCGAATTGATGTCCTTGGAAGCACGTCAGACTATAGCAAAATGGATTGCTTTGTACGTGGGTTTGATAGCACCGGTAATTCTCTCGTGTGGGCGTTGGGTTCATCAGTCGGCGTAAGTAAGATGCTATCGCTAAAAAATTTCTTTAGCGGAGCTGAGATACTGTTAAATGGTAATGACGGCGCGGTTCAACTCAAAACAGGTGCTGTTAACGGGGCTACAGCGCAGACGCTCACTATCAACAAGAATGAGGTTAACTCAACCGTTGATTTAACCCTTACAAAGCAATCAGGGACTGGTAATCGTTTTGTTTTACAGAACTCAGGTAATGCAGAACTACCGTTTTCTGTCAGGGTGTGGGGTTCCAGTACTCGACAAAACGTTTTTGAGGTTGGAACGTCTGCTGCGTATCTGTTTTATGCGCAAAAAACGTCAGCAGGCCAGTTGTTTGATGTAAATGGCGCTATTAATTGCACAACGCTGAATCAGTCATCAGACCGCGACCTTAAAGACGATATTCTCGTTATCAGCGACGCGACGAAAGCAATCCGTAAAATGAACGGATACACCTACACGCTCAAGGAAAACGGGATGCCTTATGCTGGCGTTATTGCACAGGAAGTAATGGAGGCGATACCAGAAGCTGTGGGATCGTTTACTCATTATGGTGAAGAGTTGCAAGGTCCGACCGTTGACGGCAACGAGCTACGCGAAGAAACTCGCTATCTTAATGTTGACTACTCCGCCGTGACGGGTTTACTTGTTCAGGTCGCCCGTGAAACAGATGATCGCGTTACCGCGCTGGAAGAGGAAAACACAACGCTACGTCAAAATCTGGCAACAGCAGGCACCCGGATCAGCACTCTGGAAAATCAGGTAAGCGAACTGGTTGCACTTGTCCGGCAGTTAACAGGAAGCGAACATTGATATCCTTCAAGCCCTGAAGGAGGCTGTTCCTGGTACGTTCAGACTGTTGTTGAGCTGGAAATCGCAACGGAGGAAGAAACTTCGTTGCTGGAAGTCTGGAAGAAGTATCGGGTGTTGCTGAACCGTGTTAATACAACAACTGCACCGGATATTGAATGGCCAGTAGCACCTATAGGGTAA
Protein sequences of DBSCAN-SWA_4 >NZ_CP048344|2265540:2283285|2268375_2268486_+|WP_001360138.1|DBSCAN-SWA MTIEKHERSTKDLVKAAVSGWLGTALEFMDFKSHAC >NZ_CP048344|2265540:2283285|2272709_2272901_-|WP_001083281.1|lysis|DBSCAN-SWA MNSAFALVLTVFLVSGVPVDIAVSVHRTMQECMTAATEQKIPGNCYPVDKVINQDNIEIPAGL >NZ_CP048344|2265540:2283285|2279071_2280052_+|WP_023147794.1|DBSCAN-SWA MKFKDKNLKALAECIIGDNKAFLYRSSSHITEFFQDCGMDVTHDGSTRWKWTAQRLEELLYEPQSKPHTLPERFVHVLRTLMLKEDAMDDDPGRLKALEELNKPLMREGYEAFYGDDRLLYIRHTDTKTVSVSNNPHRPLTPHEVECRRLLTAFLDTCSEDELIEDILLPLFRQLGFHRITAVGHKDKALEYGKDIWMKFTLPTQHVLYFGIQAKKGKLDASGASKSTNSNVAEIFNQVLMMLGHEIFDPETNRKVLVDHAFIVAGGEITKQARNWLGGKLDASKRSQIIFMDREDILNLYTVSNVPLPTGALISDDAVKNDDIPF >NZ_CP048344|2265540:2283285|2265540_2265867_+|WP_000598292.1|DBSCAN-SWA MIKTTLLFFATALCEIIGCFLPWLWLKRNASIWLLLPAGISLALFVWLLTLHPAASGRVYAAYGGVYVCTALMWLRVVDGVKLTLYDWTGALIALCGMLIIVAGWGRT >NZ_CP048344|2265540:2283285|2266072_2267287_+|WP_001295394.1|DBSCAN-SWA MKIVKAEVFVTCPGRNFVTLKITTEDGITGLGDATLNGRELSVASYLQDHLCPQLIGRDAHRIEDIWQFFYKGAYWRRGPVTMSAISAVDMALWDIKAKAANMPLYQLLGGASREGVMVYCHTTGHSIDEALDDYARHQELGFKAIRVQCGIPGMKTTYGMSKGKGLAYEPATKGQWPEEQLWSTEKYLDFMPKLFDAVRNKFGFNEHLLHDMHHRLTPIEAARFGKSIEDYRMFWMEDPTPAENQECFRLIRQHTVTPIAVGEVFNSIWDCKQLIEEQLIDYIRTTLTHAGGITGMRRIADFASLYQVRTGSHGPSDLSPVCMAAALHFDLWVPNFGVQEYMGYSEQMLEVFPHNWTFDNGYMHPGDKPGLGIEFDEKLAAKYPYEPAYLPVARLEDGTLWNW >NZ_CP048344|2265540:2283285|2273169_2273412_+|WP_072163420.1|DBSCAN-SWA MRISDFRLYREISDGKSITYMIAGLNKEYGDVVESGLLFADPAVVDRETDELIEKAIAFKLAYRQQYQQKAGWNYESSFC >NZ_CP048344|2265540:2283285|2269820_2270057_-|WP_001296941.1|DBSCAN-SWA MIVSPGKWVSEEQLIALKGIKKGTLKKAREKSFMEGREYKHVAHDGMPWDNSPCFYNLEEIDRWIERQASARPRRHLT >NZ_CP048344|2265540:2283285|2274956_2275895_-|WP_124039036.1|DBSCAN-SWA MDLIMEWRFLGSLSEARKSGCSGVYLIVHKGLFSRVVYVGVSCNVGRRITEHYDGYLRGNRTIYDAGHDEDVYRFMSAYKIHNHTKYYQALANDYKIWASTTMYSDLPKNMLAKSQTFDTDWQSIALEKYIPQLVVWALPMAKYCYLNASRIESVIQSKLIKSFDLRGFFNIKQLSILGKIEYPYMEKVKVFIINTPDLDPASQLIFSNLYNKKTDNNFCKEFRSQFKSEIFQRESETQRKRTIREHKVSLYENYGKPWTLKEMEKLRVMLVDFDLSPIEISEYLGREPRSISKKISENDKVTNYKWRESVG >NZ_CP048344|2265540:2283285|2280571_2280679_-|WP_122083109.1|DBSCAN-SWA MLTGAFLYLPLVFMPEADSLKHPQQFYLTPVTSPI >NZ_CP048344|2265540:2283285|2268505_2269786_-|WP_000877001.1|integrase|DBSCAN-SWA MKYPTGVENHGGKLRIWFVYKGVRVRENLGVPDTAKNRRVAGELRSSVCYAIKTGVFDYAKQFPSSRNLEKFGEARQDLTIKELAEKFLALKETEVAKTSLNTYRAVIKNILSIIGEKNLASSINKEKLLEVRKELLTGYQIPKSNYIVTQPGRSAVTVNNYMTNLNAVFQFGVDNGYLADNPFKGISPLKESRTIPDPLSREEFIRLIDACRNQQAKNLWCVSVYTGVRPGELCALGWEDIDLKNGTMMIRRNLAKDRFTVPKTQAGTNRVIHLIKPAIDALRSQMTLTRLSKEHIIDVHLREYGRTEKQKCTFVFQPEVSARVKNYGDHFTVDSIRQMWDAAIKRAGLRHRKSYQSRHTYACWSLTAGANPAFIANQMGHADAQMVFQVYGKWMSENNNAQVALLNTQLSEFAPTMPHNEAMKN >NZ_CP048344|2265540:2283285|2281469_2281748_+|WP_023147795.1|DBSCAN-SWA MARNVKYYNSDNSPVLACTHERYSHAFKSEWFQHPPCTEEQAEWIIQCYRRRGYEVKKALSLDYRHWIISVRLPYSERPPRPSRTFQQRIWR >NZ_CP048344|2265540:2283285|2267298_2268318_+|WP_000836058.1|DBSCAN-SWA MKSILIEKPNQLAIVEREIPTPSAGEVRVKVKLAGICGSDSHIYRGHNPFAKYPRVIGHEFFGVIDAVGDGVESARVGERVAVDPVVSCGHCYPCSIGKPNVCTTLAVLGVHADGGFSEYAVVPAKNAWKIPEAVADQYAVMIEPFTIAANVTGHGQPTENDTVLVYGAGPIGLTIVQVLKGVYNVKNVIVADRIDERLEKAKESGADWAINNSQTPLGEIFAEKGIKPTLIIDAACHPSILKEAVTLASPAARIVLMGFSSEPSEVIQQGITGKELSIFSSRLNANKFPVVIDWLSKGLIKPEKLITHTFDFQHVADAISLFEQDQKHCCKVLLTFSE >NZ_CP048344|2265540:2283285|2270144_2272616_-|WP_001372999.1|DBSCAN-SWA MSKVFICAAIPDELATREEGAVAVATAIEAGDERRARAKFHWQFLEHYPAAQDCAYKFIVCEDKPGIPRPALDSWDAEYMQENRWDEESASFVPVETESDPMNVTFDKLAPEVQNAVMVKFDTCENITVDMVISAQELLQEDMATFDGHIVEALMKMPEVNAMYPELKLHAIGWVKHKCIPGAKWPEIQAEMRIWKKRREGERKETGKYTSVVDLARARANQQYTENSTGKISPVIAAIHREYKQTWKTLDDELAYALWPGDVDAGNIDGSIHRWAKKEVIDNDREDWKRISASMRKQPDALRYDRQTIFGLVRERPIDIHKDPVALNKYICEYLTTKGVFENEETDLGTVDVLQSSETQTDAVETEVSDIPKNETAPEAEPSVEREGPFYFLFADKDGEKYGRANKLSGLDKALAAGATEITKEEYFARKNGTYTGLPQNVDTAEDSEQPEPIKVTADEVNKIMQAANISQPDADKLLAASRGEFVEGISDPNDPKWVKGIQTRDSVNQNQHESERNYQKAEQNSPNALQNEPETKQPEPVAQQEVEKVCTACGQTGGGNCPDCGAVMGDATYQETFDEEYQVEVQEDDPEEMEGAEHPHKENTGGNQHHNSDNETGETADHPIKVNGHHEITSTSRTCDHLMIDLETMGKNPDAPITSIGAIFFDPQTGDMGPEFSKTIDLETAGGVIDRDTIKWWLKQSREAQSAIMTDEIPLDDALLQLREFIDENSGEFFVQVWGNGANFDNTILRRSYERQGIPCPWRYYNDRDVRTIVELGKVIDFDARTAIQFEGERHNALDDARYQAKYVSVIWQKLIPSQADS >NZ_CP048344|2265540:2283285|2274398_2274821_+|WP_001373616.1|DBSCAN-SWA MAKVFTQEEREKIKGQVVELVRRSGRETLRQLEVKTGATRYLMSVLARELVASGDVYNSGYGLFPSEQARKDWQNARKKLSRAKLKEPSAVDPDLIWSLPDGEIRRYDRRHNMICTECRKSEVMQRILSFYQGDVRYLLK >NZ_CP048344|2265540:2283285|2280723_2280936_+|WP_001013632.1|DBSCAN-SWA MNGKSRLASYVPKGKEKQAMKQQKAMLIALIVICLTVIMTALVTRKDLCEVRIRTGQTEVAVFTAYEPEE >NZ_CP048344|2265540:2283285|2272897_2273086_-|WP_000854559.1|DBSCAN-SWA MKTLLPNVNTSEGCFEIGVTISNPVFTEDAINKRKQERELLNKICIVSMLARLRLMPKGCAQ >NZ_CP048344|2265540:2283285|2273392_2274358_+|WP_000054501.1|DBSCAN-SWA MSLLFAERPLVINTQLAMKIGLNEAIVLQQLHYWLRDTNSGMECDGVRWIYNTTEQWLEQFPFWSESTLKRAFASLKTLGLLRCEKLNKSKRDMTNFYTINYGSELLDGGKLSESIGLKCAAPSGQNDTMEEVKMKRSIGSKRLNVIGSKWPDDLTENTTEITTENKKTSRPEASQPDPQTVEQDFLTRHPDAVVFSAKKRQWGSQEDLACAQWIWGRIVSLYEQAASDDGEISRPKEPNWTAWANDVRTMRMLDGRTHRQICEMFGRVQRDPFWVKNIMSPSKLREKWDELVIRLGRSSVQRCVNHISEPDTEIPPGFRG >NZ_CP048344|2265540:2283285|2281151_2281403_+|WP_000980999.1|DBSCAN-SWA MMNIEELRKIFCEDGLYAVCVENGNLVSHYRIMCLRKNGAALINFVDGRVTDGFILREGEFVTSLQALKEIGIKAGFSAFAEE >NZ_CP048344|2265540:2283285|2283159_2283285_+|WP_072163404.1|tail|DBSCAN-SWA MEIATEEETSLLEVWKKYRVLLNRVNTTTAPDIEWPVAPIG >NZ_CP048344|2265540:2283285|2278109_2278712_+|WP_023147793.1|integrase|DBSCAN-SWA MNTSPWNKDRIIGQKRPLQISHIWGIRIRLELEGKTRDLALFNMALDSKLRGCDLVKLKVSDVAYGGSVSSRATVLQQKTGSPVQFEITKGTREAVAALIQLSNLHSKDFLFRSRVGTNQHISTRQYNRIFHGGVEKLGLEDSLYSTHSMRRTKPYLIYKKTKNLRVIQLLLGHKKLESTVRYLGIEVDDALEISESIEV >NZ_CP048344|2265540:2283285|2276442_2277792_-|WP_001678529.1|DBSCAN-SWA MNAGLHFSEISKEQFDIYFYGRSPYLKTFSEEIRWFKYEGNGITLLSTIIICNIDKDFNAIVLGRDLDKKFRAINVLASFDSMDVLLNNLNDDIPKMLAQHQNGTFMQGDESTKPFSLFLSKVPAKKRNVYIKMLLEDPLHFPAYIVLEELAYWFKDPDGIFIRDFQSDAFNSRLFELYLNAVFYELDFEMNREYNQPDFLLSKFGVEIAVEAVSIAEAEAPLERKVINDEQMDELRKHVLNVMPFKFARSLLKKVRHCPEPEKVHYWELNHTKNKPFVIAMQDYSKRMSMAFSSEALHSYLYGIDIESGISIERHTDENRSIKSNFFGSEQNNYVSAVLLTTQATIPKFNRMGILAGVEASGFKVYVSGVKTDQDAAPHPFSADVSDPNYQEPWCTAMYMYHNPNAIHPVDYQLFPNVVHVFKKDEHFEEYIPRNYILQSTTMICKTE |
21 | Enterobacteria_phage(21.43%) | integrase,lysis,tail | attL 2257930:2257943|attR 2284545:2284558 |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_5 |
2689945 : 2696675
Sequences of DBSCAN-SWA_5
Nucleotide sequences of DBSCAN-SWA_5 >NZ_CP048344|2689945:2696675|DBSCAN-SWA ATTAAAGCCATATGTTTTGCGCATGACAGCCAGAAAGACCAGAAGCTGGTGCTGTGTTAATCCGTACCAGCATCACAGCTTCCAGCAACTCATTTGCAATGCGCGTATAACCATCATCGAGATCTGCCACGCGCCGCTCCTTTTGTGCCACATCCGGCACAGGAAAATTGAATATCTCAGCAGTGTTTGCCATAATTCCTCCCGCAATGAGTGTGTTACGATTTGCACCTGAAAGTCGGTTCTGTTCCCGCAGACCGACTTTCGCCATTTTTGAACCTGTCATATTGCCCCCAGCATGGTGGTGACCATCGCCATCAATGGACCAGCCAGATCCGGGTCCACTCGAAACATCGACACAATGCCTTCACTCATCTCCTTCCAGTTTCTGGTGGCGTGGTGCGTTGAGAATGACCGCCTGCTTTGCCTCACAGAGTTCCTTTTCCATTTCAGCCAGCCGAGCCATGAAGCTATCCTGCTCAACCAGTACTGCGATATTCCAGCGGTAGTACCGCCAGAATTGCCGGGGTCAGTTCACGCACGTTATTTCGGTATTTTTCAGAATCGAATTTGTTATCGAGGAAGCGGAACAGCTTCTGGCGTGCGCTGACATCATCAGGGAAATCGATAGTGCCGCCGCCCTGCTCCCGATACTCATTCACAATGAGTGTGGCAACGACATCCTGATTATCTTCAGCCGACCGGGCGCGGACGGCATCACGGATTTTTCGTGGCCCGGCACCTGTTTTGTTTGAGAACGATTTATCACCGCAGTCGGGCTAAATCCGCTAGTCTGTTGGTATGTAAGTAGTTGCATAATTGACTCCTTTAGTTTGAATTGACTGTTAAGTTGATTGCTTATTGTTAAAGAGCGTGAAATGGAAATTTAAGCTGCGTTCTTTTCGGTGTGTGGAAACAACTTCGGAAGATCCGGGCGAATCTGGTATGCCTTCACTACTCCACCAGTAGCCGTAACAATGCTGCCGACATGTTCAGGGGATACCTTTGCTTTGTTGTGAAGCCACTTATATCTATGACAAGGATGAAGCCGAGCGCATTGTCGAAAATACCGCATACACTGCAGAACGTCAGCCGGAACGCGACATCACTCCGGTTAACGATGAAACCATGCAGGAGATTAACACTCTGCTGATTGCCCCGGATAAAACATGGGATGACGACTTATTGCCGCTCTGTTCCCAGATATTTCGCCGCGACATTCGCGCATCGTCAGAACTGACACAGGCCGAAGCAGTGAAAGCTCTTGGATTCCTGAAACAGAAAGCCTCTGAGCAGAAGGTGGCTGCATGACACCGGACATTATCCTGCAGCGTACCGGGATCGACGTGAGAGCTGTCGAACAGGGAGATGATGCGTGGAACAAATTACGACTCGGCGTCATCACGGCTTCAGAAGTTCACAATGTGATAGCAAAACCCCGCTCCGGTAAAAAGTGGCCTGACATGAAAATGTCCTACTTTCACACCCTGCTGGCTGAGATTTGCACCGGTGTGGCTCCGGAAGTTAACGCTAAGGCGCTGGCCCGGGAAAACAGTACGAGAATGACGCCAGAGCCCTGTTTGAGTTTACTTCCGGCGTGAATGTTACTGAATCCCCGATCATCTATCGCGACGAAAGTATGCGCACCGCCTGCTCTCCCGATGGTTTATGCAGTGACGGCAACGGCCTTGAGCTGAAATGCCCGTTTACCTCCCGGGATTTCATGAAGTTCCGGCTCGGTGGTTTCGAGGCCATAAAGTCGGCTTACATGGCCCAGTGCAGTACAGCATGTGGGTGACACGAAAAGATGCCCGGTACTTTGCCAACTATGACCCGCGTATGAAGCGTGAAGGACTGCATTATGTCGTGGTTGAGCGGGATGAAAAGTACATGGCGAGTTTTGACGAGATGGTGCCGGAGTTCATCGAAAAAAATGGACGAGGCACTGGCTGAAATTGGTTTTGTATTTGGGGAGCAATGGCGATGACGCATCCTCACGATAATATCCGGGTAGGCGCGATCACTTTCGTCTACTCCATTACAAAGCGAGGCTGGGTATTTCCCGGCCTTTCTGTTATCAGAAATCCACTGAAAGCACAGCGGCTGGCTGAGAAGATAAATAATAAACAGGAGGATATATGAGTCAGGTTGGTAATCATTCATTCGAATTTCCGGCATCGCAAGGTGTACAGGGTGGTACTGTTACACTCTTCCTTACCATACCAGGAAGATCGCTGGCTCGTTTCCTCGCTTCAGATAATTACGGCCATACACTGGAACGCTCTCAGCGAGAAATTAATCCAAATCGAGTACGAAAATTTTTAAATTATCTCACTAACGCAGACTCAAGAAATGAGTCTTTTATCATTCCCCCTCTCGTAGGTAACTGTGATTCGAATATAGAATTTGTACCGTTTGGCAACACAAATGTTGGTATAGCCAGAATTCCCCTCGACGCCGAAATAAAACTTTTTGATGGCCAACATCGTGCAGCTGGCATTGAGATATTTTGCCGAAGTTCCCCATCAACGCTCATGGTTCCCATGATGCTTACAATGAATCTGCCGCTAAAAACCCGGCAGCAGTTCTTTTCGGACATAAATAACAACGTTTCTAAGCCATCAGCGACCATCAATATGGCGTATAACGGCCGGGATGATATTGCTCAGGGAATGATATCCTTCCTGACCCAACATACTGTATTTGCCGATATAACCGATTTTGAACACAACGTAGTGCCATTAAAAAGTAATATGTGGGTGAGTTTCAAGGCACTCACTGATGCAACGTCAAAGTTTGCTAGGAACGGCAATCAACAACTTGAAATGGGATATATAGAATCTGTCTGGGAGGCATGGATTACACTAACTCAGATTGACTCAATCCGACATGGTGTACACCACGCTACGTACAAGCGCGATTATATTCAGTTCCATGGAGTAATGATTAACGCTTTCGGTTTTGCGGTTCAACAGATGATGGTTAATCATTCCATCGCAGAAATAACTTCTATGATCGAAAAACTCTGTGCAACTACCAGCTCTGCAGAAAGAGAGGATTTTTTTCTGATGGATAACTGGGCGGGGATCTGCACGAAAGCCAGCCAGGAAAAACTATCGGTTATTGCCAATGTGGCAGCGCAGAAAGCAGCAGCAAACAGACTGATACAAGCTTTTACCAAAGGAAGTCTGGAAACAACTTAATGAATCAACATTGTCTCATATCAGCATGCTGTACGGCGTCTTTAAGGAACGGTGAGCATGAAAAACAAAATCATCATGGAGCTACAGGCTCCTTTTTTATTATTCGCATTCACCCTCAAGCGTATTAACCAACAATTCAGGGATTAATGAAAGATGGCAGACATCATTGATTCAGCATCAGAAATTGAAGAATTACAGCGCAACACAGCAATAAAAATGCGCCGCCTGAACCACCAGGCTATATCTGCCACTCATTGTTGTGAGTGTGGCGATCCGCTAGATGAACGAAGACGCCTGGCCGTTCAGGGTTGTCGGACTTGTGCAAGTTGCCAGGAGGAGATCGAACTTAAGAACAAACAATGGGGACTGTGATGGCCTCAAAGCAGCAAATTTCAACATCGTCCAACTGAGGTGTAAAAATGTTCAGAATCATTTTTCCTAACACCTGGTACGTCGACCACCACGGCACTCCCTGCAAAATCCTGCGTTCTACCCACAACAAAGTTCACTACATCCGAAAAGGCAGAACATGTATCGCCAGCATGTTCCGCTTTAATCATGACTTTGAACCTGTGAATAAAGCTGATGCAGATCGGATAGCAGAAGAGATCGAAACGGCAGAACACATTAAGAAGTTACGTGCCATACGCAGGAAATAGAAAAATTGATAAATTCAATACTGCATTTCTCAGCATTAAATTTATCTCTATGACCAGTCAAGAGATGTACCTGCCATGAGCTTAATATCATGTCAGATATATCGGTCACAAACTCCCTCAGCAGCTAAGAGGAGGACAAATGTCTCGACTAATCACTTTACAGGACTGGGCTAAAGAAGAATTTGGGGACTTAGCACCAAGTGAGCGAGTTCTGAAAAAATACGCGCAAGGGAAAATGATGGCCCCACCCGCTATAAAAGTTGGTCGCTACTGGATGATTGACCGAAATTCCCGTTTTGTAGGAACGCTGGCAGAACCGCAACTCCCAATAAACGCAAACCCAAAACTCCAACGGATAATCGCTGATGGCTGCTAGACCCCGATCTCACAAAATCTCTATACCCAATTTATATTGCAAATTAGATAAGCGAACCGGAAAGGTATATTGGCAATACAAACATCCACTATCCGGTCGTTTTCATAGCTTAGGAACTGATGAGAATGAAGCAAAACAAGTTGCTACTGAAGCAAATACCATTATTGCTGAACAACGTACCAGACAAATATTAAGCGTCAATGAGCGTCTGGAAAGAATGAAAGGCAGGCGCTCAGACATTACGGTGACAGAATGGCTTGATAAATATATTTCTATCCAGGAGGACAGGCTGCAACATAATGAACTAAGACCCAACTCCTATCGGCAAAAAGGCAAACCCATTCGTCTTTTCCGTGAGCATTGTGGAATGCAACACCTCAAGGATATTACCGCACTTGATATTGCCGAAATAATTGATGCTGTAAAGGCTGAAGGTCATAACAGGATGGCGCAAGTCGTGAGAATGGTGTTGATCGACGTCTTCAAAGAAGCACAACACGCAGGACATGTTCCGCCAGGATTTAACCCAGCGCAGGCAACAAAACAACCGCGAAATCGAGTAAACCGCCAAAGATTGTCACTGCCCGAATGGCAGGCAATATTTGAAAGCGTAAGCAGACGGCAGCCCTATTTAAAATGCGGCATGCTACTTGCTCTTGTTACTGGACAACGTTTAGGCGATATCTGCAATTTGAAATTCTCTGATATATGGGACGACATGTTGCACATTACTCAGGAAAAAACCGGTTCAAAACTTGCTATTCCGCTTAACCTGAAATGCGATGCTCTGAATATTACCCTTCGTGAAGTTATATCTCAGTGCAGGGATGCTGTTGTTAGTAAATATCTGGTCCATTACCGTCACACTACCTCTCAAGCAAACAGAGGAGACCAGGTTTCTGCAAATACTCTGACAACGGCTTTTAAAAAGGCCAGGGAAAAATGTGGCATAAAATGGGAGCAAGGAACTGCGCCCACATTTCATGAGCAGCGATCTCTGTCAGAACGGTTATATCGGGAACAGGGTCTGGATACGCAAAAGTTGTTAGGCCATAAATCCAGAAAAATGACCGACCGATACAATGATGATCGTGGTAAAGACTGGGTTATCGTAGATATCAAAACAGCATAGAAAATAGCCAGTTTTGGGGAAGGGTTTTGGGGAAAGTTTTGGGGAAGATTTTACATCATCATAAAACAACGGGCGTATAACACGCCCGTTTCAATATTTAACACATGTAGAGATTACATGTTCTTGATGATCGCATCACCAAACTCTGAACATTTCAACAGTTTAGCGCCTTCCATCAGACGTTCGAAGTCATAGGTTACGGTCTTCGCATTGATTGCGCCTTCCATACCTTTAACAATCAGGTCTGCGGCTTCGGTCCAACCCATGTGGCGTAACATCATCTCAGCAGAGAGAATAATAGAGCCTGGGTTTACTTTGTCCTGACCGGCATATTTCGGCGCAGTACCGTGGGTGGCTTCAAACAGGGCGCATTCGTCACCGATGTTTGCACCTGGGGCGATACCGATACCGCCAACCTGCGCTGCCAGGGCGTCAGAAATGTAGTCACCGTTCAGGTTCATACAGGCGATAACATCATATTCAGCCGGACGCAGCAGGATCTGTTGCAGGAATGCATCAGCAATCACGTCTTTAATGACGATCTCTTTGCCGGTGTTCGGGTTTTTAACTTTCAGCCACGGGCCGCCGTCGATCAGTTCACCGCCAAACTCTTCACGCGCCAGCTGGTAGCCCCAGTCTTTAAACGCTCCTTCGGTGAACTTCATGATGTTGCCTTTGTGCACCAGAGTCACAGAGTCACGATCGTTAGCAATTGCGTATTCGATCGCTGCACGAACCAGACGTTTGGTGCCTTCTTCCGAACACGGCTTAATACCGATACCACAATGTTCCGGGAAGCGAATTTTCTTCACCCCCATCTCTTCACGCAGGAATTTAATCACTTTCTCGGCGTCGGCAGAGTCTGCTTTCCATTCGATACCCGCATAAATGTCTTCCGAGTTTTCACGGAAGATAACCATATCGGTCAGTTCAGGGTGTTTAACCGGGCTTGGAGTGCCCTGATAGTAACGTACCGGACGCAGGCAGATGTAGAGATCCAGTTCCTGGCGCAGGGCAACGTTCAGAGAGCGAATACCGCCACCAACAGGAGTGGTCAGCGGACCTTTAATGGCAACGCGATATTCACGAATCAGATCAAGGGTTTCAGCAGGCAGCCAGACATCCTGACCATAAACCTGTGTGGATTTTTCACCGGTGTAAATTTCCATCCAGGAGATTTTACGCTCGCCTTTATAGGCTTTCTCGACTGCAGCGTCGACCACTTTCAGCATGGCTGGGGTTACATCTACACCGATTCCATCACCTTCAATGTAAGGGATAATCGGATTTTCAGGAACGTTGAGTTTGCCGTTTTGCAGGGTGATCTTCTTGCCTTGTGCCGGAACAACTACTTTACTTTCCAT
Protein sequences of DBSCAN-SWA_5 >NZ_CP048344|2689945:2696675|2692079_2693144_+|WP_001678641.1|DBSCAN-SWA MSQVGNHSFEFPASQGVQGGTVTLFLTIPGRSLARFLASDNYGHTLERSQREINPNRVRKFLNYLTNADSRNESFIIPPLVGNCDSNIEFVPFGNTNVGIARIPLDAEIKLFDGQHRAAGIEIFCRSSPSTLMVPMMLTMNLPLKTRQQFFSDINNNVSKPSATINMAYNGRDDIAQGMISFLTQHTVFADITDFEHNVVPLKSNMWVSFKALTDATSKFARNGNQQLEMGYIESVWEAWITLTQIDSIRHGVHHATYKRDYIQFHGVMINAFGFAVQQMMVNHSIAEITSMIEKLCATTSSAEREDFFLMDNWAGICTKASQEKLSVIANVAAQKAAANRLIQAFTKGSLETT >NZ_CP048344|2689945:2696675|2694168_2695311_+|WP_000741339.1|integrase|DBSCAN-SWA MAARPRSHKISIPNLYCKLDKRTGKVYWQYKHPLSGRFHSLGTDENEAKQVATEANTIIAEQRTRQILSVNERLERMKGRRSDITVTEWLDKYISIQEDRLQHNELRPNSYRQKGKPIRLFREHCGMQHLKDITALDIAEIIDAVKAEGHNRMAQVVRMVLIDVFKEAQHAGHVPPGFNPAQATKQPRNRVNRQRLSLPEWQAIFESVSRRQPYLKCGMLLALVTGQRLGDICNLKFSDIWDDMLHITQEKTGSKLAIPLNLKCDALNITLREVISQCRDAVVSKYLVHYRHTTSQANRGDQVSANTLTTAFKKAREKCGIKWEQGTAPTFHEQRSLSERLYREQGLDTQKLLGHKSRKMTDRYNDDRGKDWVIVDIKTA >NZ_CP048344|2689945:2696675|2695424_2696675_-|WP_000444487.1|DBSCAN-SWA MESKVVVPAQGKKITLQNGKLNVPENPIIPYIEGDGIGVDVTPAMLKVVDAAVEKAYKGERKISWMEIYTGEKSTQVYGQDVWLPAETLDLIREYRVAIKGPLTTPVGGGIRSLNVALRQELDLYICLRPVRYYQGTPSPVKHPELTDMVIFRENSEDIYAGIEWKADSADAEKVIKFLREEMGVKKIRFPEHCGIGIKPCSEEGTKRLVRAAIEYAIANDRDSVTLVHKGNIMKFTEGAFKDWGYQLAREEFGGELIDGGPWLKVKNPNTGKEIVIKDVIADAFLQQILLRPAEYDVIACMNLNGDYISDALAAQVGGIGIAPGANIGDECALFEATHGTAPKYAGQDKVNPGSIILSAEMMLRHMGWTEAADLIVKGMEGAINAKTVTYDFERLMEGAKLLKCSEFGDAIIKNM >NZ_CP048344|2689945:2696675|2693563_2693803_+|WP_000488406.1|DBSCAN-SWA MFRIIFPNTWYVDHHGTPCKILRSTHNKVHYIRKGRTCIASMFRFNHDFEPVNKADADRIAEEIETAEHIKKLRAIRRK >NZ_CP048344|2689945:2696675|2693942_2694179_+|WP_000088653.1|DBSCAN-SWA MSRLITLQDWAKEEFGDLAPSERVLKKYAQGKMMAPPAIKVGRYWMIDRNSRFVGTLAEPQLPINANPKLQRIIADGC >NZ_CP048344|2689945:2696675|2689945_2690227_-|WP_162676751.1|DBSCAN-SWA MTGSKMAKVGLREQNRLSGANRNTLIAGGIMANTAEIFNFPVPDVAQKERRVADLDDGYTRIANELLEAVMLVRINTAPASGLSGCHAQNIWL >NZ_CP048344|2689945:2696675|2693297_2693516_+|WP_001678640.1|DBSCAN-SWA MADIIDSASEIEELQRNTAIKMRRLNHQAISATHCCECGDPLDERRRLAVQGCRTCASCQEEIELKNKQWGL >NZ_CP048344|2689945:2696675|2690940_2691252_+|WP_162676757.1|DBSCAN-SWA MPLLCCEATYIYDKDEAERIVENTAYTAERQPERDITPVNDETMQEINTLLIAPDKTWDDDLLPLCSQIFRRDIRASSELTQAEAVKALGFLKQKASEQKVAA >NZ_CP048344|2689945:2696675|2691924_2692083_+|WP_000149533.1|DBSCAN-SWA MTHPHDNIRVGAITFVYSITKRGWVFPGLSVIRNPLKAQRLAEKINNKQEDI |
9 | Enterobacteria_phage(37.5%) | integrase | attL 2683879:2683902|attR 2695378:2695401 |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_6 |
2825482 : 2867030
Sequences of DBSCAN-SWA_6
Nucleotide sequences of DBSCAN-SWA_6 >NZ_CP048344|2825482:2867030|DBSCAN-SWA GTTACTTACGGTCCGTAAACGGGCTGCCCGGACAGGGAATCGATAACTGCTCTCCCATTTTATCCTCTTCAAGCTGGTGCTTTATGTAATCCTGTATCTTCGCCGTGTTCTTACCCACTGTATCGACGTAGTCCCCCCTGCACCAGAACTCCCTGTTCCTGTATTTGAATTTCAAATCACCAAACTGCTCGTAAAGCATCAGACTGCTTTTCCCTTTCAGATATCCCATAAAGCCGGATACGCTCATTTTGGGCGGGATCTCCACAAGCATATGGATATGATCTGCACAGCATTCAGCTTCCAGAATCCGTACACTTTTCCACTCACACAGCTTTCTCAAAATACTGCCTGTTGCTCTACGCTTCTCTCTGTAGAACACCTGTCTTCGGTATTTTGGCGCAAAAACTATGTGATATTTACAGTTCCATCGGGTGTGCGCTAAGCTCTTTTCGTTCCCCATTTGAACCCCTTTTGATTTCTTGTTTGACTCTTGCAGTTGCCAGACCGCAAGGTGTTTTAACAAATCCGAGGATCTTAGTATGAATATGGAAGAAATTGTGGCCCTTAGTGTAAAGCATAACGTCTCGGATCTACACCTGTGCAGCGCCTGGCCCGCACGATGGCGTATTCGCGGGAGAATGGAAGCTGCGCCGTTTGACGCGCCGGACGTCGAAGAGCTACTGCGGGAGTGGCTGGATGACGATCAGCGGGCAATATTGCTGGAGAATGGTCAGCTGGATTTTGCTGTGTCGCTGGCGGAAAACCAGCGATTGCGCGGCAGTGCGTTCGCACAACGGCAAGGTATTTCTCTGGCGTTACGGCTGTTACCTTCGCACTGCCCGCAGCTCGAACAGCTTGGCGCACCACCGGTATTGCCGGAATTACTCAAGAGCGAGAATGGCCTGATTCTGGTGACGGGGGCGACGGGGAGCGGCAAATCTACCACGCTGGCGGCGATGGTTGGCTATCTCAATCAACATGCCGATGCGCATATTCTGACGCTGGAAGATCCTGTGGAATATCTCTATTCGGGGAGCTACACGCGACAACCAGGAATGCAGCCGTAACTGCAGCAACGACGGGCAAAATGCGCATGGGATTTTCCTTGCTGTATTTTTGTTAAGTGTAGATGACAACAGGAAAAAAAGAGAAAGAAAGGAGGCCCAATATCCTGGGCCTCATCGTCAGTTATTGCAGCTTTTCAAGAATGCGCCAGGCCGCCTCGACACGGACAGGGTTAGGATAGCTTTTGTTTGCCAGCATCACGATGCCAAGGTTTTTTTCTGGAACGAAGGCTACGTAGCTGCCAAATCCACCAGTGGAGCCCGTTTTATGCACCCATGAGGCTTTCACTGCGGGGGCGGGCGGGTTTACCTCAACGGCGGGAAGCGCTGCCAATGCCACTTTGCTGTCGCTGCCGTTGATGATCGAATCAGCTTTCAGCGGCCAGTTCAGCATCTCCCAGCCTAATCCCTGGTACATATCGCCAATACGCCAGTAGCGAGACTGCGCAAGCGCAATGCCCTGCTGGAGCGTTTTCTCCTGAACGTGGCTGGCATCCATGTTGGCCTGAACCCAGCGGGCCATATCAATAACGCTGGATTTCACGCCATAGGCTTCGGCGTCAAGTTGTCCCGGAGAAACGTGTACGGGCTTCCCTTCGCGATAGCCCCAGGCATAATCTTTTTGTTCGTTCTGCGGAACCGTAATCCAGGTATGCGCCAGTTTTAATGGTTGCAGGACGCGTCTGGTCATTGCCTCTTCGTAACTCATTCCTGAGGGTTTCACCGCCAGCGCGCCAAACAGACCAATGCTGGAGTTAGCGTAAAGTCGCTTAGCGCCCGGAGTCCATTGCGGCTGCCAGTTTTGATAAAAATGCAGTAATGCGGCTTTATCCCTAACGTCATCGGGGATCTGCAGCGGTAGGCCGCCTGCCGTATAGGTGGCTAAGTGCAGCAGGCGGATACCCTGCCACTGTTTGCCTGTCAGTTCTGGCCAGTATTTCGTGACCGGATCGCTGAGCTTAATTTCGCCGCGGGCGATAGCATCGCCGCCCAACACGCCGTTAAACGTCTTACTAACCGATCCTAGCTCAAACAGCGTTTGCTGCGTGACTGGGTGGTTATTGGCGATATCGGCTTTACCCCAGGTGAAATAATAGGGTTTTCCCTGGTAGATAACGGCAACGGCCATACCCGGAATAGCCTGCTCCTGCATCAACGGGGTGATGGTGCGATTAACGATATCGGCAATCTGTTGTTCTGTTTTTGCGGCAGCAAATGTGGAGAAAGAGGCTGTCAGCAGCAGAGCGCAGCATAACGATTTTTTCATCATGAAATCAGTTCCGTAATTAAAAGCAAAAAGGTGTCCGGGCCCGTCAGACGCAATCAGTGTGTTTGATTTGCACCGTGTTGACAAACGGTTAAATTTAGCAGCAGATATAAGTTTTTCCTAAATTCCACGTGTGTTTTTTATTAGCTTCAAAAATCACTATTTCACGAAGAATTTAGACTGCTTCTCACACATCGTAACATTATTTACAACCACCTTTCAATCATTTTTGATAAATCATTGATTTCATCTTTGCTGCAATGATACTTAATAAACTCTGCAAGTTATCCACAGAGCAACACTCAATTTTATTGATGATATTCTTATTATACCAGACATTTTTCATACACTCCCTTGTACGGATAGTTTTCCGACAACTTCATGATTACATATCTTGCGGTTTTGATTATTTTTGCTGCAAGAAATACATACTTCAAACGAAAGGTCTTTATTTGCTGTCTGTATTCTGAAGAGTCCAAGGAATCAAACTTGAACAACAAAAATAGGTTATATGAAAGCATCATCATTTGAAACACGGCTTCATTCGCCCAAAATGACTTTAGCAAGAGATGACCCACCGCCATGTCGTATTTGGCTTCTTTGATATAGTTTTCAGCATTACCACGCTTTTCATAGTATATAACTACTTTTTCAGAAAGCAAGGTAGTATTTGTTACAAAGAAAAAGTAGTCGTATTCGGAACCTTCTAAAAGTGATAATTGTGCTCTTTCTTTTTCTGGTTTCAGTACGCGAGATACGACAAATCTTCTGTCTTTTTCCCATTTAACTAATTTTGTATACAGTTCTGTAGTTTCTCTACCTTCTTCTCCTTTAACGAATACAATTGATGAATTCGTTGCTTGTGAGGTGAGTGTAGAATAACTTTTGGCTTTAATTAAATATTTGCATCCAAGAGATTCTATCGTTTCGATAATTTTTTCATCAAAGTAGCCACTATCCATTCGAAATAAAATTTCTAAATCGTCTGATTTGATGTTAGCAACAATTTCTTTGATCATTTCCGCAGCACCGTTTGCAGTGTAAGTATTGCCACTTCTTACAAATCCGGTAACATATGCTTTTAATTCGTCGCAAAATGCAAATTGGATATTGTAGCATCGGTTTCCCAGTTTCTTAGGATTATATCCTTTTGACGCACCTTCTTGATGACCTTCTACGTTAATTACACTACTATCAATATCAATCGTAATGGATGTCAATTTACTTTTAGTGAGCAGTTTTTTAAAGACTTTAAAATTAATGTCTCTAAACATTTGGGTTGTCTTGAAGTTGAAGTTTCCTAGAAACCGTGACACTGTTTCAGGTTCTTTTACGGAAATATCAAACTCGTTGACGAGGGGATCATTTTGAAGTAGCTTTAGACGTTCTAACTTATCAATGCCAATGAAGTGACCGCAGAGCATGGTCTTTATATGATTCATCTTGATTTTATTTGTTGAGTCATTATCAAATACGAGGTCATTTTCAATAAAATCAAAAATCCCATTGCTTTTTGCATTCTCAAGGAGCAGAAAAAGACCTGCATTTGATGTTAGATTCTTAGCTTTGAAATCAATTTTATTAATCATAATTAGAACCCCTTTTTACTACTTTTCTTACTATTATTTTACCATATATCGAGTCATAAAAGCTGATAATTTAACATATTTTTGAGCACTTTTCTTTCACCCAATGGGTGAAAGCTGAATTTCGAAGGAATGCATATTTATCAAGGCTTTGATTATGCTTTTTGAAGTACTGACGTAGAATCTAGGCAAATCAAAAGGGGTTTTAATAACTGGCTCAAAGCTGAAAGCTTTCCGGAACCCCCAGCCTAGCTGTAATGCCAGTCAGTTAAGCAACTGACTGGCTCTTTTTCGGGGCTGTGGGGTATTTCCAGGGCCTCTCCTTTACCACTCTCGGGAAGGCCCTTTCCCTTCTTGTCGGTAATTTCACAAGTTGTCCCATACTTGCAAGATCGCGCATCAGCTCCGGTATACGTCCCGGTGAAGCGCCCTGCAATGTCATCAGCATTCTCATCACCATTCCACATGATTCTGAGAAACTCAGTTGATTCGGCCAGTAACCTTTCAGATGTTCCGCCATTTTAATCATCTGATATCTCACCAGATTATAAGCCAGTAAGACACCCCACAGCTCTTGCTCCACAAGCTCCGGTTTTTTACTTCTCAGCGTCAGCCTGCTCAGTTGCATCGTCTGTTTTATCTCCCTGTATCCCAGTTCGATTTCCCAGCGATGACTGTACAGATCCGCCATTTCTCCTCCGGGGAAGCGCATGGCGTCCGTCATCGACGTCAGCAGATGGCAGACTTTTCCTTTGCGCGTCACGGTCAGCAGGCGGGCTGTCACTTCATTTCCCAGTCCCGGCCACTTTTTTCGTGCCTGCGGGCTGGTTTTCAGCTTCACCAGATGATCGCCTTTACCCAGTTTTCTGATCTCTTCATATTGCGCTCCCTTTCTGAGAGGTATCATCCAGTGGCGGTGTTCTCCCGCCAGGCTCCAGGCATTTAACAGTCCCAGTGAGTAATAACCTTTATCCATTAACGTCAGAGTGTTATCGCCGGTTTGTTCTATAAGTTGCTCAGCAAGCTCATTTTCGCTGTTCTTCATCGTGCCGAAGGCTGCAGCCGTCAGCAGATGGCTGGTCAGTTCCATCTGGCAGACCATTTTGACCTGCGGGTAGAGCGCCGGGTTCCCGGCATGTGTCTGGCGGGGGAAGGCTGCATCGTTCTCTGGTGTATCCGGTGTGCGCCAGAACACACCATCGATGGCCAGCAGGGTCAGGCCGCACCAGTGCGGATGCGGCGTGGCGTTATGCCAGAGCTGCGCTGTTTTCGTGAACACGCGGCGGACAGCCTCACTTCCCAGGCGCTGGCGGGCCTGAATAACGGCACTGGGGGCAACGAAGGGGCGATTGCCCGGCAGCATGATGTCCAGGCGATTCACAATCTGGTGAAGAGGTTCTTTACGCTCAAGCGCCATGCCAACAATACACCAGACCATCATTTCGAGGGGAAGACGGCGCTTGCGTAGCGTTACAGTACCTGATTCGGCAAGGCAACGAGAGATGAGTTCGGGGTCGAGGTAATCCCCCAGAGAAGTCAGTGGGTTACGCAGAGAATCGTAACGGGATACCAGATCAAGAGCCTGTCCAATGTGCATAAAAAAATCCGGAAACAAGTGAGCGTTTCCGGATTCTTACACAGCCACTGGATCGGTCAACTGATCCTTAACTGATCGGCATTACAGCCTAGCTGGGGGTTTTCTGTGCACAAAAAACCCCTGTAAAAAACTTACAGGGGTATAAGGCTTAGCCTAACTTGCGTCTGTTGCATGGTGCCGGGTGCCTCCCGGTGAATTCAGTCGGTGTCACTGAACCCGCGTAGGCTTCGCTCATAACATAATAAATGCTATGTACACCAGTCGCCCCACCGCACAGGGGGATTCACCACACAGCGCACTTTTTAACAAATATCCCTCCGGCCAGACAATAATAAACATAATGAATTGTGATCTTCTTAACGGTTTTAAAGTGTTACAGATAATATGCCAAGTAATTGCTTGTTTTTTCATCAATAGGAAGACCACACCAACAACCCCAGCTATCAGCCAGAACCGACATTATCCGGCTGATAAAATCTACCATCACTCCAACACCAATCACCGCCTGTGCCAGATCGCGTTTCTCAAACTTTTTAATCTCTGTTGCCACTCTGCGGGTTTTCTTTTTGAATTTTGAAAATACCAAATATCGTGACGTTTCTTTGGGGGATGAGCTATCAAGCGGGAACGATCTGCCTACAGAGAAAGTCAGCCAGACCACTCTGCAAATTATCCCCGAAAGGCTCTGTGGCTGATATGCGCCGGGCATGGCGCAATGGGCCAGTGGTGTCAACGACGGATGAAAAGTGATCCACTTATATCTCCACCAACGGCCCAATATTGATCCACCGTTTTACTCAGGATTAGCTTCTGCTATAACCCCGGCCTTTCGTTTCTGTCTGAGTCGATAGCTTTCTCCTTTGATTTGAACGACATGTGAGTGGTGTAAGATACGGTCCAGCATCGCTGAGGTCAGTGCTGCATCACCGGCGAACGTTTGATCCCACTGCCCGAACGGCAGATTGGATGTCAGGATCATTGCGCTCTTTTCGTAACGTTTAGCGATGACCTGGAAGAACAGCTTTGCTTCTTCCTGACTGAACGGCAGATAGCCTATTTCATCAATGATGAGCAGACGGGGGGCCATTACTCCACGCTGAAGCGTCGTTTTATAACGGCCCTGACGTTGTGCCGTAGATAACTGAAGTAACAGATCTGCTGCTGTTGTGAAGCGAACTTTGATACCTGCACGGACTGCTTCATAGCCCATCGCTATTGCCAGATGGGTTTTCCCCACACCTGATGGCCCCAGTAATACGATATTTTCATTACGTTCTATGAAGCTGAGTGAGCGTAACGACTGGAGTTGCTTCTGCGGTGCTCCGGTGGCGAATGTGAAGTCATACTCTTCGAACGTTTTCACCGCCGGGAAGGCTGCCATTCGGGTATACATCACCTGTTTACGTTGATGACGTGCCAGTTTTTCTTCATGAAGCAGATGCTCCAGGAAGTCCATATAACTCCATTCCTGGTCTACTGCCTGTTGTGACAGCGCAGGCGCTGCGCTTATAAGGCTTTCCAGTTGCAACTGCCCGGCGAGCGCCATCAGTCGTTGATGTTGCAGTTCCATCATCACGCCACTCCTCTGCAGAATGAGTCGTAGATGGAGAGTGGATGATGCAGGGGGTGTTTGTCGAAGTTCACCAGATTTTCATCAAGATGCACGTCATACTCTTTTTTCTCCGGAGGCAGTGCCAGCATGGACTGCTGCTCTTCGAGCCAGCGATCGCAGGGACGTGCCTGGATTGTTTCATGCTTTCGTTGGTTAGCGACATCGTGCAGCCAGCGCAGACCGTGGCGGTTGGCTGTTTCAACATCGACAGTGATCCCCATCGGGCGCAGGCGAGTCATTAGTGGGATGTAAAAACTGTTACGGGTGTACTGCACCATCCGTTCCACCTTACCTTTAGTCTGTGCCCTGAAGGGGCGACACAGTCGGGGAGAGAAGCCCATCTCCTTGCCGAACTGCCACAGCGAAGGATGGAACCGGTGCTGACCGGTCTGATATGCGTCACGTTGCAGAACCACAGTTTTCATATTGTCATACAACACTTAGCGCGGCACACCACCAAAGAAGCGGAACGCATTACGATGGCAGGTCTCCAGCGTGTCATAACGCATATTGTCAGTGAATTCGATGTACAGTATTCCGCTGTATCCGAGAACAGCAACGAACACGTGAAGCGGTGAGCGACCATTACGCATAGTGCCCCAGTCAACCTGCATCTGTCGTCCGGGTTCAGTTTCGAACCGAACGGCAGGCTCCTGCTCCTGAGGAACCGAGAGAGAACGAATGAATGCCCTGAGAATGGTCATTCCGCCACGATATCCCTGGTCTCTGATCTCGCGAGCGATTACCGTTGCCGGGATTTTGTAAGGATGAGCATCGGCGATGTGTTGACGAATATAATCCCGGTATTCATCCAGGAGTGAAGCAACAGCAGGTCGCGGCGTATATTTTGGCGGCTCAGATTTTGCCTGCAAATAACGTTTAACGGTATTGCGGGAGATCCCCAGTTCTCTGGCAATCGCCCGGCTACTCATTCCCTGCTTGTGCAGGATTTTAATTTCCATAACTGTCTCAAAAGTGACCATAAGCTCTCCTGAATCAGGAGAGCAGATTACCCCCTGGATCTGATTTCAGGCGTTGGGTGTGGATCACTATTGCACCGTTCGTGACAACAGAGAAAGTCAGCCAGACCACTCTGCAAATTATCCCCGAAAGGCTCTGTGACTGATATGTGCCGGGCATGGCGCAATGGGCCAGTGGCAGTGTGTGATGGTGGCCCTTACTGGATTTGAACCAGCGACCAAGCGATTATGAGTCCTCTTACCACTGAGCTAAAGGGTTGGAGAACGCAATATCACCTGCCTTATATGATTACACCCAGAATTTCCCGGACTGTCTTGTCAAAACATTCAGTCTCCAATTCCCACCAATAGCAAGACGGTCACTATGACAGCATCTCCGGAATGAGCTCAGGGCACGGGGGTTTACCGAGGTTACTTTTCCAACGAAGTTTCTCAAACGCAGGCGTGATAACATTCAAACTTAGGATCTCAGTTGCTATCTTTTCCCATGTCTTACGGATTAATGTCATCGGTTGCATACAAATCGCTACAAGCTGCATCAAGGCACATGCTTCATATGTACAGTACACCAAGATGGATAGGTGTTTCACCTAATCCTAGCTCATTTCAGGCAGGCTCAACAGGCTGCAGCCCGCATGTTTAAGGGCGCAGGTTATCAGCAGGGTATAAGCTGACCGTAAGCGTACAGCGAGGGCCGTATTGACGGGGATGTGTTATTCAGCTGGCAGTGCTATGCGCCACGGAAGCAGTTCGCTGACCCGGTTGACCGGCCAGTCTGCTATGACGCCAAGCACATGGCGAAGGTAGCTTTCTGGATCCACGTCATTCAGTTTGCACGTCCCGATCAGGCTGTACAGTAGCGCTCCCCGCTCACCACCATGGTCAGAGCCGAAGAACAGGAAGTTTTTACGACCCAGACTGACCGCCCGCAGGGCATTTTCAGCGATGTTGTTGTCGATTTCCACCCAGCCATCGTTCGCATAGTACGTCAGTGCCGACCACTGGTTAAGTGCGTACGCGAACGCCTTCGCCAACTCTGAGTGTCGCGACAGGGTCTTCATCTTTTCACGCAACCAGCTTTCCAGGGATTTCAACAGCGGTTTCGTTTTTCGCTGACGTTCAGCAAGCCGCTGCTCTGCCGGTATTCCCCTTATATCCGCCTCTATGGCGTACAACTGACCGATCTGTTCCAGGGCTTCTTCCGTCAGTGCTGACGGGATGCGGACGTGCACATCGTGGATCTTTCGGCGGGCATGAGCCCAGCAGGCAGCTTCCGTTATCCCACCATTGCGATACAGCTCGTTGAACCCGGCGTACGCATCCGCTTGCAGCACACCGCTGAAGCAAGCAAGATGAGTCTGCGGATGGATGCCTTTTCTGTCCGGGCTGTAAGCGAACCACACTGCAGGTGCCAACGCTGACCCGGCATTGCGGTCATCACGAACATACGCCCACAACCGCCCGGTCTTCGTCTTCTTATTACCCGGCATCAGTACCTGGACCGGGGTATCATCGGCATGGAGTTTGCCGTCAGTCATGACATAGCCATGAAGCGCCTCTTCCAGCGGAGACAGCAGCCGGCAGCATGCATCCACCCAGCCCGACAGCAGTGAACGGCTCAGCTCCACACCTTGCCGGCCGTATATTTCTGACTGGCAATACAGCGGGGTGTGCTCTGCATACTTCGAGGTCAGCACGCGGGCCAGCAGCCCCGGTCCGGCGATACCCCGCTCGATGGGCCGCGAAGGTGCAGGTGCCTGCACGATGGCATCGCACTGAGTACAGGCATGTTTTTCCCGTACCGTCCGGATAACCCGGAATGCGCTACGCATCAACTCCAGCTGTTCGGCGGTATCCTCGCCCAGATAGCTCAGTGAACCGCCGCAGTTCGGGCAGCACGGCGCCGCAGGCAACAGTCGCTTTTCGTCACGGGGTAGTGATTCAGGGAACGGCTTACGGGTGCGGGTCTGACGCAACGGACGCTGTACTGCCGGGTCATACACCCTACCAGTCAGCGTATCGCTCTCTTTCTGAAGCCGGTTCAGATCGGCTTCCATTTGTGCGATACGGCGGGAGACTTTTTCGGAACGACTGCCGAAGTTCATCCGGCGGAGTTTATCCAGCTGCGCCTGCAGATGGTCTATTTCGCGCTCCCGGTTGCTCAGCTTTTCCTGCAGGGCGTGGATCAGCGCTTCCTGTTCGGCCAGGCGCTGTTTTAGCAGGAAGATGTCGTCAGAAGAGATGTCGTTCATAAGCCCGTATTTTACCGGGCTTATTCTGTGACAACCAGGATAAAGAGATTTACAGCATGGTCAGGGAGGTCAGCAGCCGCTTGGGCTGTCGCCAGTCGATACCTTCCAGCAGCATCGCCAGCTGCGCCTGCGTAAGGAACACTTTGCCATCACGGGCTGACGGCCAGGCGAAGCGCCCACGCTCCAGCCGTTTGGTCAGGAGGCACAGTCCGTCACCGGTGGACCACAGCAGTTTAACCTGACTGCCGCTGCGGCCCCGGAAAATGAAAACATGGCCGGACATGGGATCGTCTTTCAGCGCCGTCTGTACTTTCGCAGCCAGGCCGTTGAAGCCATTTCTCATATCGGTGATACCGGCAACCAGCCAAATTTTGGTCCCGGAAGGTAACGGGATCATCGCTTCAGTTCCTGTATCAGCAGAGTCAGGAGCTTTTCGCTGACATTGCCATTGAAGCGGAGCGTCCCGTGCCGGAACGTTACCTCACAGCTGATACTGAGGGTTTCCGGGTCCTCTGCGAGCGATTCTGGCTGTTCGGCAGCTGCATCGAGAGTCACAGGAAGTAGCTGGGGGCTCTCTGAAGAAGGTAATAGCAGCTTTCCCTCGCGCCATTGTTGTCGCCATTTGAACAACAGATTGGCGTTAATGCCATTTTCAAGAGCAAGTTTTGAGATGGATATCCCGGGTTCACAGGAGGCAGCAACGAGCTGCTGTTTAAATTCGGGAGAATAATTAGGGCAGCCTTTTCGCCTGCCGGGAGTCACATTTTTCTGCATATCTGACACTTTGGTTCCCACTACTTATTTGGTGGACACCACTTTGTCTAATTCGTCAGATTCTGACCAGACGGTTCAGGCTGTACGCTTACAGCTGACCGGATACATTCCGGCAAAGAAAAACCTTGCAGCGATGAAAGCCGTTTCGGTAATGCAATGACGATAAAGCTGTCCTGTATATATGTGCTTCGCCTCAAACGCTTGCCGTTTTGGTATTGTGCACATGCCGTCTGAACGATGTGGAGCCAGAAAAATGGATGCGTTATGTCATTGAGCATATCTAGGACTGGCCGGCAAACCGGGTACACGATCTGTTGCTCTAAATAGTTGAGCTGGCCTCTCAGTAAATATCAATACGGTTTTGGTGAGCCGCTTACCACTGAAGCATCACTTCGGCAGGTAGATTTCTGCAGGCAGAATGGCGCTTACCTTAGCGATAAGCGCGTAAATGTGGTTATTCAATACCTGTGGTGACTGTAAAAGTGCGCGTTTGCTGCGGTGCAACCTGAATCAGCGTGCCATTACGTTGCGCGGCAAGATACCCCTCAGGTCGACAGGTTGCTGGTAATGCAAAGGCGGCTACCTGTTGCTCGCCGTTATAAAGGATCCAGCGTGTCACATAATTTAGTTCAGCACTGGAGAAACGAGTAACAAACGTAGTGCCATCGGGAGCGATCATGCGAAACTCTGGCTGATCTGTGTAAGCGTCCAGTTTATCTGCAAAGAAGACAATTTCTGGATCATAAAATTCCGGTTGATTCAGCGTCGACAGAGAGGATTCCCCCTGCATAATCCGTTGATTAAACGCCAGCCACTGAGGGGTGGGATTAACATGCGAAGGTACTGATTCACGCAACCTTAATATTTCGTCCGGTATATTCTGGCTGAACGTAGCATTTGGTATATATGCATAATTCATGTGGCACATATATTGTAGTGGCATATCTACAGAAGCCAGATTGGTTACGGCCATCTTAATATCGAACAGTGTAGAGGATTTGTGAAGGACCACTGTTGGCTGAGCCAGATAATGATGCCCGAACCCCATTACATACTCGTAACGCCCGGTAAGGCGTAACATATCTCCCTCTAATTCCATCCATGCTTCATCCATCGCGGCACAGGCCATTTCACCGTGTAGCAGATGAGTATCTTCCGCAGATGGGCAGCCATTAGCCAGCAAACCTGAATGAAAGGCAAAACAGCCATAGGTCTCTATCACCTCTGTCGCCGGTTTAGGCTGGCGAAACATATTGCACATGGTGAGACTGTGTCCATCAAATTGCGCATCCCAAATCATCTGCCCCATCCAGGGAAGAATAATTAAATGTCCACGACTGTTTGCAATTTTAAGCCCCTCGACACCACTGTCATAGCGAAAAGACGTGACAGTAAAATCACTATTTTCCAGCAAGATACGAGGTTTTTCGCCAAAAAGCGCCCGCCACAAATAAATACGCGTACTCATAACGATTCTCCTCAGGACTCTGTGACTTCAGCCAGTGCATTACGTACTTTGCTTTCACGCCAGAAGTAGACACCGACATAGACAAAACAGAGCATAGAAACCAGGAATGAAAGCTGTAGTGAGTGGAACATATCTGCAATATACCCCTGAATTGCCGGAACCACCGCTGCGCCAACAATAGCCATAACAATGACTGCTCCTGCCATTTCTGTATGTTCGTTATCAACTGTATCCAGTGTTCCTGCATAGATCGTCGCCCAGCAAGGGCCAAACAAAACACTTACCAGGACGGCGACATAGACCGCGCTGAAACTTGGAGCCAGTGCAACATATGCCAGAAACAGCGCCCCTATAACGGAATAGAGAATTAGGACTTTTTCCGGATTAAAACGTGTCATAAGGATGTTGGCTATAAACTTGCCAATAAAGAAGCAGGCAAAGCTGTAGACCATGAAGTTTGAAGCATCACGTTCGTTGATATCGCCCAACTCCAGTGCCAGACGGATGGTAAATGACCATACTGCGACCTGCATACCCACATAAAGGAATTGAGCCACAATACCACGACGAAAGCGCGAATTTTTAGCCAGATAGCGCAGCGTATCCATTGCTGACGGGCGTTTATGGTGACTTGTCTGTGCCACTTTACAGGTTGGGAAGCGGGTTAAAAGGAACAACACCATGACCACAACCAGAATCATAATCATATACTTATACGGTTCAAGGGTGTTCTCTAACATCAGCACCTTAAAGTTGTGAATTTGCTCGGCGTTCATTCCGGACATCTGCTTCTCAAGGCTTTCCCCCTCGGAGAAAACCAGATATTTGCCCAATAAAATACCAGACGCAGCACCAATCGGATAAAAGGTCTGGCTGATATTGAGCCGCAATGTGGCATAGGCTTTTGGACCGATCATTGAACTGTATGTGTTCGCTGCAGTTTCAAGGAAACTCAGGCCAATCGCAATCGCAAAAATAGCTGCAAGGAACATAGTGTAGGTTGCCATATGCGAGGCAGGGAAAAAAAGTGTACAACCACCAATATACAGCGTCAGGCCAATTAAAATTGCCACCTTATAACTGGTCTTTTTAATCACAAGGGATGCTGGTATTGCAATTAAAAAATAACCTCCATAAAATGCGCTCTGCACCAATGCTGAAGCAAAGTTACTTAGCGAAAATACACTTTTGAATTGAGTGATTAATATGTCATTTAATGCAGCTGCGCATCCCCATAGCGGGAATAAACACGATAACAAAATAAACTGGAACAAGGGAGTCTTATTCAGATACCCATCAGGCATCTGAATGATGTTTTTATCGTTCATAGTGCTACCTTTAACTGTGCAGGATGATTATTCGTTTAAGGTTAAAAATTCATTAAATTGTTCAATACTCGGATAAGATGATTGCGTACCTTTCCCTGTGACGCTGAAAGCGGCAAAGAGAGCGGCTTTTTTCAAAGCGGCTTCAACATCACCGCTTTGAACATAATAATGGGAAAAGCAACCAATAAATGCGTCACCAGCGCCACTAGTATCAACAGCATTTACTTTGAATGCAGGAACATGGACTTCCTGATCGCGGGTCATCCATAATGCGCCTTTTTCGCTCATGGTAACAATAATATTGTTCAGCCCTTTATCAACTAACGAACGTGCGGCCAAACGAATATGATCATAAGTATCAACCGACATACCGGTTAATATTTCCAGTTCTGTTTCATTCGGAATAAAGAAATCACATTTGCAGGCATAAGACATATCTAACTCACGCAATGCCGGAGCCGGATTTAATAACACTTCAATACCATTTTTCTTACCAAACTCAATCGCGTGGTAAACTGTTTCCAGTTGAACTTCCAGTTGTAAAACGATCAATTTGCATTTTTTCAGATCTTCTGCAGCTCGATCGATATCTTCCGGGGAAAGAAATTTATTCGCTCCCTTAATTATTAATATACTATTGCTCGAGTTGGCATTAACAAAGATCGGTGCAACACCACTGCTGGTACAGGGGACCTTCTCAACATAAGTGGTATTAATTCCCCATGATTCGAGATTACGAATAGTATTATCCGCAAAAATATCATCACCTACTTTAGTCAGCATCAGGACTTTTGAATTCAATTTAGCCGCCGCCACCGCTTGATTAGCACCTTTCCCACCACATCCGATTTTGAAGGCAGGTGCTTCCAGAGTTTCTCCTTCTTTAGGCATCTGATTAGTGTAAGTAATGAGATCCACCATATTGGAACCAATAACTGCAATGTCCATTTCACTACCTCTTATAAACTTTCGCATAACAATGGTATTTAAATAACATTAGCATGTTACTTTTGCATCATTTGTGACTGAGATCGCGATTAGCACATCAACCCGATGTTTATTTAATAGACTTCCAGTCTCATCACTCAGGCCAACACTATCTAATCATAAGCAACCTAACAAGATTAGTGCCCAAAACTCAGCAGCCTATACCCTTTTCATTTCAAAGGGGCGGTCGTATAGTATGGTAATGAAAACAATGTTTACTAACGCCAAAATGTTATTTTTATAACATTCTTACGGAGAGAGAGTTGATGGAAACGAAGCAAAAAGAGCGTATCCGACGTTTGATGGAACTGCTTAAGAAAACCGACAGAATCCATTTGAAAGACGCAGCGCGAATGCTGGAAGTTTCTGTAATGACTATTCGTCGCGATCTCCATCAGGAAGATGAACCTCTGCCACTGACCCTACTGGGTGGCTATATTGTAATGGTGAATAAACCCGCGCCATCCATGCCAGTAATCCATGACGTTCCAAAAAATCATCGTGATGACTTACCTATTGCAATTCTGGCTGCCGGAATGGTTAATGAAAATGATCTGATCTTCTTTGATAATGGCCAGGAGATACCACTCGTTATAAGCATGATCCCGGATGCAATCACCTTCACCGGTATCTGTTACTCACATCGCGTCTTTGTTGCGTTGAATGAAAAGCCTAATGTAACAGCAATACTTTGTGGTGGTACGTATCGTGCCAGAAGTGATGCTTTTTACGATGCCAGTAACTCTTCGCCATTAGACTCTCTCAATCCGCGAAAAATATTTATTTCCGCCAGCGGTGTGCATAATCACTTTGGCGTCAGCTGGTTTAACCCTGAAGATCTTGCCACTAAGCGTAAAGCGATGAACCGTGGACTACGGAAAATTTTGCTCGCCCGCCACGCGTTGTTCGATGAAGTGGCCTCTGCCAGCCTCGCACCGATCTCTGCATTTGACGTTCTGATTAGCGATCGTCCGTTACCGGCAGATTATGTTACGCACTGCCAGAATGGTTCTGTAAAGATCATTACACCTGATTCAGAAGACGAATGACTTACTGAAAAAACACCACAATCTTGTTAAACATCGTCGGATTGGACTGATTACGTTGCACTTTCACCACATATTCCAGCTTATCTATTTGGCTTATCACCTACTCCAGACGCTGGTCATCCTTGACCAGTAGCCAGATATGGCTTTTGTCGCAGTCCTGAATCGGCAGGCAGAGAATGTCTTCAACGTTAAGAGCGCGACGGGCAAAAGCCCACAAAGGTGAGTCATTACACCCGGATGGTAAACAGGCACTTCCTTGTTGTAGAAGCCGTTATATTGTCCATAACGTCTGGGTTTTTAATGCATACTGGGTAACAACCGGACGAATCTTCGAATATCACTGGCGCTGCCAGCCGGTTTGAATTCATCTCACAACCCTGCATAGAGCGAATCTCCTGCCTGCACGTCACTCCACTCCATGGTATCAACTTCACACTCTTTATCTGCGGCTAGTTTCAGCCACCAGATAAGCATCGACTATGAAAGATGAGCCATGACAACATAACGTTGGTAACGCTCTGACGCCTTAATGGAAGATGCCTGCCACCATAGGGAATGTAAACGACTGAAGTGTGGCCTTTAATGCCGTGAACGGCTCATGGTCTCCTGGCACGGTTGCCGCCCCAACCTGTAACAACATTCCACAGTACAATGTCTGTCAGAGTCAGAGCCTCCCATGCTTGTTGTAGTAACTCTACCAGTGGATTTGCCCCTATATTTCCAGACGCCTGTTATCACTTAACCCATTACTGGCTTGCTGCCGTAGATATTCCCGTGGCGAGCGATAACCCAGTGCACTATGCGGATGCCATTCGTTATAATGCTCGAACGCCTCTGCAAGGTTCTTTGCTGCCGTTAACCCGTCTGGTTTGGGCATGACACTGATGTAGTCACGCTTTATCGTTTTCACAAAGCTCTCTGCTATTCCGTTACTCTCCGGACTCCGCACCGCCGTGCTCTTCGGTTCAAGCCCCAACATCCGGGCAAACTGCCGTGTTTCATTAGCCCGGTAGCATGAACCATTATCCGTCAGCCACTCTACTGGAGACGCCGGAAGCTCGTTGCCGAAGCGGCGTTCCACCGCTCCCAGCATGACGTCCTGTACTGTTTCACTGTTGAAGCCGCCCGTAGTGACCGCCCAGTGCAGTGCCTCACGGTCACAGCAGTCCAGCGCGAACGTGACTCGCAGTTTTTCTCCGTTATCACAGCGGAACTCGAACCCGTCAGAGCACCATCGCTGATTACTTTCTTTCACAGCCACTCTGCCGGTATGTGCCCGTTTCGATGGCGGTACAGCAGGTTTTCGCTCAAGCAACAGCGCATTCTGGCGCATGATCCGGTAAACACGTTTGGCATTGATCGCAGGCATACCATCAAGTTCTGCCTGTCTGCGAAGCAGCGCCCATACCCGACGATAACCATACGTGGGCAGCTCTCCGATAACATGGTGTATACGGAGAAGCACATCCGTATCATCAGTGTGACGACTGCGGCGGCCATCCATCCAGTCATCGGTTCGTCTGAGAATGACGTGCAACTGCGCACGCGACACCCGGAGACAACGGCTGACTAAGCTTACTCCCCATCCCCGGGCAATAAGGGCGCGTGCGCTATCCACTTTTTTGCCCGTCCATATTCAACGGCTTCTTTGAGGAGTTCATTTTCCATCGTTTTCTTGCCGAGCAGGCGCTGGAGTTCTTTAATCTGCTTCATGGCGGCAGCAAGTTCAGAGGCAGGAACAACCTGTTCTCCGGCGGCGACAGCAGTAAGACTTCCTTCCTGGTATTGCTTACGCCAGAGAAATAACTGGCTGGCTGCTACACCATGTTGCCGGGCAACGAGGGAGACCGTCATCCCCGGTTCAAAGCTCTGCTGAACAATTGCGATCTTTTCCTGTGTGGTACGCCGTCTGCGTTTCTCCGGTCCTAAGACATCAATCATCTGCTCTCCAATGACTAGTCTAAAAACTAGTATTAAGACTATCACTTAAATAAGTGATACTGGTTGTCTGGAGATTCAGGGGGCCAGTCTAACCAGTTACGAACATCCTTCCTCAAAATTGTTGTCATATCTCGCATGGAAGAAAAGATCCTGGCTAAGGAGCAACAAACAACGTATTGCGGAACTTGCATATTTTTCCTGTAACTAGTGTATTACCACATATGGTAATAGCTACCTGTGTGGTTTCGCTGGATAGCAAGGGGATTTATTCGCAAGTAAAATGCCTGATAAAATACACGAATCTAGTAATCATCAATATTTACTCTGGTCGAATGACGCGTGAAGTGGACTGCCAGCAGACGCGGCCAGTGGTCCACCGCCTGCTGAACAAAACGCCAGATATCTCTCGGCTCTGAAAGTAACGCTTCGGTTATTTGCACGGAATACTACTCCTTCAGACTCTGTTAAGTTTTGTTTGTTAAACCGGTGCAGACCTGCAGGAAAGCATGCCAGCACCGGCACTGTACGATATAAACATCCGGTACCGGGGATACGAATGGAATGACGAATACGCCAGAAAAGGGATAACAACCTTCCTCATAATGGTGAAATCATTCGCTATCGGTTACACGGTACGCGATGTCGCCAAAGGCAGCTGGATCGACGAATCCACGGTCACGCTACCGAAAGCGCCGCCGCTTAACACCCTGCCTCGGGCGACCAAAGTGCCGGAGCCGCAGCAGCCGCAGGAAGATTACACCTTTGAAGGTTACCGCAACGCCGACGGCAGCGTGGGCACCAAAAACCTGTTGGGTATTACCACCAGCGTGCACTGCATGGCAGACGTTGAGGACTACGTGGTTAAAATTATCGAACGCGACCTGCTGCCGAAATACCTGAGCATCGACGGCGTGGTCGACTTGAACCACCTCTACGGCTGTGGCGTAGCGATTAATGTACCGGCCGCCGTGGTGCCAATTCGCACCATCCATAATATTGCGCTGAACCCAAACTTCGGTGGCGAAGTGATGGTGGTGGGCATGCAGTGCGGTGGCAGCGACGCGTTCTCCGGCGTTACCACTAACCCCGCTGTCGGCTACGACTCTGACCTGCTGGTGCGCTGCGGCGCAACGGTGATGTTCTCCGAAGTCACTGAAGTACGCGACGCCATTCATCTGTTAACGCCACGCGCCATCAATGAAGAAGTGGGCAGGCGTCTGCTCGAAGAGATGGCCTGATACGATAACTATCCCGATATGGGCAAAACCGACCGCAGCGCCAACCCTTCGCCGGGCAACTAAAAGGGCGGCCTCGCCAACGTGGTAAAGAAAGCACTCGGCTCCATTGCTAAATCGGGTAAAACCGCAATTGTTGAAGTGCTGTCGCCCGGTCAACACCCGACTAAACGCGAATTAATTTACGCCGCGACGCCAGCCAGAGATTTTGTCTGTGGCACGCAACAGGTGGCTTCGGGTATCACCGTGCAAGTGTTTACGACCGGCCGTGGTACGCCGTACGGCCTGATGGAGGTACCCGTCATTAAAATGGCGACCCGCACCGGGCTGGCGAACCACTGGTTTGATTTAATGGATATTAACGCAGGCACTATCGCTACCGGCGAAGAAACCATTGAAGAGGTGGGCTGGAAGTTGTTCCACTTTATTCTCGACGTCGCCAGCGGGAAGAAGAAAACCCTCTCGGATCAATGGGGATTGCATAACCAACTGGCAGTGTTTAACCCGGCACCGGTGGCCTGATATTCTCTTCATACATTAAGTTGTATTATGCCCGATAACGCTTGTTTATCGGGCATAGTGAATCACAGCGAAGACGCGAGCTCCCCGACCAGAATCACTTCAACCCCAGCCTTTCGCAAGCCTTCCAGACTATCCGCAGGAATGCCTTCATCAACAATGATCATGTCGATACGTTGAGTATCAATGATCTTATGTAAACTGGAACGATTGAACTTACTGGAATCGGTGACCACGATGATCCGTTCCGCAACTTCGCACATCCGGCGGTTTAAACGGGCTTCATCTTCATTATGCGTGCTGACGCCGCGCTCCAGATCGATCGCATCTACACCAAGAAACAGCATATCGAAGTGGTAATTTTGCAGCGATTGCTCAGCCTGATCGCCGTAAAAAGATTGCGACTGACGGCGCAAATGCCCGCCGGTCATCAGCAGCTCAACGCCTTCCGCTTCCAGCAACGCATTAGCCACGTTCATACCGTTGGTCATCGCAATTACGTCAGTGTGCTTGCGCATCAGACGAGCAATCTCAAAAGTGGTGGTCCCGGAATCGAGGAGCACCCGATGACCTGGCTGAATCAACTCAACGGCAGCTTTCGCAACGCTGCGTTTCATCGCGGTGTTCAGTGCGCTTTTGTCTTCCACTGATGGCTCGACTGACGGCGTCGTGCTATCGCAGATCAACGCGCCACCATAGGCACGCACAGCGATTCCCTGCTTTTCCAGAAACGCCAGATCGTTGCGGATCGTCACAGTAGATACGCCATACAATGCCGACAGATCGTTAACCTGCACACTCCCTTGCTGTCGCAGACGCTGAATGATCTGTTCTCGTCGCTCGCTGGTGCCTGTCACTCGCTTCTCACCTGAAGCGTCGGTATTACTCATAGTAAGTCCTTTCGTAAAACTTTCGTTTCATTTCGTTTTGCCTATTAACGCCTTTCTATTAAGCAAATGCAAGCCCACCTTGCCCATTGGCGCAAGCTACTCTCGTTTCACTGACTTTCATTATGTTTCTTTTGTGAATCAGATCAGAAAACTATTATCTTTCGTTTTATTTTTATCTCACCATGACGCAGTATCAACTGAAACAAAACGAAAGATTAATATCGCAGCAATCTGAACTGGAGAGGAAAGTGAAACATCTGACAGAAATGGTGAGGCAGCACAAAGCGGGCAAAACAAATGGAATTTATGCCGTTTGTTTTGCCCGCTTTGTGCTGCATGGGGCGAGCGATGTGCCGGATGAGTATGTTCGTCGCACCATTGGGCCAGGCGTCTGCAAAGTCAACGTTGCAACCGAGTTGAAGATCGCCTTCTCTGACGCTATCAAAGCTTGGTTTGCTGAAAATCAGCAGAGCAACGATCCGCGCTTTTACATGCGGGTTGGCATGGACGCCATGAAAGAGGTGGTCAGAAGCAAAATCGCCGTCTGCGGCTCGGCAAATCGATTACGGCTACCGGCGGAGGCCTGATCCAACAGCGTATTACCTCAATATTTCAAAATAATTATAAGTCCCACAAATATGAAGGCGCGTCCTTAAACCGGGTAGTGCCTTCCATTATCCTAAAATTCGAGGAGCCCTATATGACACAAAAAAAATCTTTTAAATCAAAATTATGGGAGTTTTTACAAAGTCTGGGGAAAACCTTTATGTTCCCGGTTTCGCTTCTTGCCTTTATGGGATTGCTGCTGGGTATCGGTAGTTCAGTCACCAGCCCTTCCACCATTACTAGTTTTCCCTTTCTGGGCGGCGAATTTACCCAGTTGACCTTTGGCTTTATCGCTATGGTCGGTGGCTTTGCTTTTACCTATCTGCCGCTGATGTTTGCCATGGCGATCCCCATGGGGCTTGCCAAGCGCAACAAAGCGGTCGCTGCCTTTGCCGGGTTCGTTGGCTACATGCTGATGAACATGAGCATTAATTATTACCTGACGGCTACCCACCAGCTTGCCGACCCCGCCACCATGAAACAGGTAGGACAATCGATCGTGCTGGGCATTCAAACCCTGGAGATGGGGGTATTAGGTGGCATTGTGGTTGGGGTTATCACCTATTTTCTGCATGACCGTTTTCAGGACACGGTTCTGCATGACGCCTTCGCCTTCTTTAGCGGCATTCGTTTCGTGCCGATTATTACCGCGCTCACCCTGTCGCTGGTGGGTCTGTTCATTCCCATGCTGTGGGAATACGTCGCGCTGGGCATCGCGGGCATTGGGCATATCATCCAGAGCACCAGCGTTTTCGGCCCCTTCCTCTACGGCGTAGGCGTGCTGCTGCTTAAACCTTTTGGTCTGCACCACATCCTGCTGGCGATGGTGCGTTTTACCCCAGCAGGCGGCATTGAAATGGTAAATGGCCATGAGGTCGCCGGGGCGCTGAATATCTTCTACGCCGAGCTCAAAGCCGGCCTGCCGTTTAGCCCGCACGTTACCGCGTTTCTGTCACAAGGGTTTATGCCGACCTTTATCTTCGGTTTACCCGCCGTGGCTTACGCCATCTACCGCACCGCGCGTCCGGAAAATCGGCCGGTCATTAAGGGGTTGCTGCTTTCCGGCGTGCTGGTTTCCGTCGTCACCGGTATTTCAGAGCCGATTGAGTTCCTGTTCCTGTTTATCGCCCCCGCGCTTTACGCCTTCCATATCGTCATGTCTGGCCTGGCGCTGATGGTAATGGCCCTGCTGGGAGTGACCATCGGCAATACCGACGGCGGCATTCTGGATCTGCTGATTTTCGGCGTGATGCAGGGAATGTCGACCAAATGGTATCTGCTGTTCCCGGTTGGTATTGCCTGGTTTGCCATCTACTTCTTTGTCTTCCGCTGGTACATCCTCAAACACAACATCAAAACGCCGGGCCGCGAGGTGGATGTTCAGGGGGCACAGCAAGCCGTCGAGGCGAACACCCGCGCGCGCGGAAAATCAAAATACGATCACGAGCTTATCCTACGTGCGCTCGGAGGTAAAGAGAACATTGAGTCGCTTGATAACTGTATTACCCGCTTGCGTCTGGTGGTGAAAGATATGGGCCTTATCGATCAGCAGGCGCTGAAAGCGGCAGGCGCGTTGTCAGTGGTGATGCTTGATGCGCATAGCGTGCAGGTGATCATCGGACCGCAGGTACAGAGCGTCAAAACCGGCATTGAAGCCTTAATTTAACAGGAGGAGTGATGTTTGATTTCGACAAAATCATTGAGCGTCAAAATGATAAGTGCCGTAAATGGGACCATACCTTTGTTTGCTCGCGTTTCGGTGACGTCCCGGAGTCCTTTATCCCCCTATGGATAGCCGATATGGATTTCACCTCACCACCTGCGGTGATTGACGGTTTCCGGCGCATCGTGGAGCACGGCACCTTTGGTTATACCTGGTGCTTTGACGAATTCTACGACGCGGTCATTGCCTTCCAGCGCAAACGTCATCAGGTTGAGGTGGAAAAGTCGTGGATCACGTTGACCTACGGCACCGTATCCACGCTGCACTACACGGTTCAGGCATTCTGCAAACCGGGTGACAGCGTGATGATGAACACGCCGGTCTACGATCCTTTTGCGATGGCGGCACAGCGCCAGGGCGTGCAGGTACTGGCTAACCCGCTGCGCGTGGAGGAAAACCGCTATCAGCTTGATTTTAATCTGATAGAAGAACAGCTCAAAACCCACCGTCCAACGCTGTGGTTCTTCTGCTCGCCACATAACCCGTCCGGCAGGATCTGGCGCGAGGAAGAAATACGCCAGGTGTCCGATCTCTGTCAACGCTACGGCACGATTCTGGTGGTCGATGAGGTTCACGCTGAACACATTCTGGATGGCAAATTCGCCAGTTGTCTCACCTCTGGCTGTGCCGCCCAGGACAACCTGATCGTGCTCACATCGCCCAACAAAGCGTTCAATTTGGGCGGGCTGAAAACCTCCTACTCCATGATTCCAGACGACTCGCTGCGCCAGCGCTTCCGCCAGCAGCTCGAGAAGAACTCCATTACCTCGCCCAATTTGTTCGGGGTATGGGGAATCATTCTGGCCTATCAACACGGTCTGCCCTGGCTCGACGCGCTGAACGGTTATCTGCAAGGCAACGCCCGGTATCTGGCGGATGCCCTCCAGACCCACTTCCCGGCGTGGAAGATGATGAACCCGGAATCGTCGTATCTGGCGTGGATAGACGTAAGCGCGGATGAGCGTAGCGCAACGCAGCTAACCCAACATTTCGCACGGCAGGCAGGCGTGGTCATAGAAGACGGCAGCCACTATGTACAAAACGGCGAAAACTACCTGCGGATTAATTTTGGCACCCAGCGCTACTGGCTGGAGCAGTCCATTAACCGAATGCTGAAAAATGACAAATAAGGATCTTACCCCGATGAAGAAAGTGCTCACTCTCTCACTGCTGGCTCTCTGCGTTTCTCATGGTGCAGCGGCAGCAAACTACGCGCTCAATAACGACAATATTGCCCTCTTGTTTGATGATACAAACTCAACGGTCGTGGTGAAGGACAACAAGGCTAACCATCCGCTCACGCCGCAGGAGTTGTTCTTTCTGACGCTGCCGGATGAGAGTAAAATCCACACCGCGGATTTCAAAATCAAGCACGTCGAAAAGCAGGATAACGCGATTGTCATCGACTTTACGCACCCGGATTTTAACGTCACGGTGAAGCTGAACCTGGTGAAGGGAAAATACGCCAACATCGGCTACACCATTGCCGCCGTGGGGCAGCCGCGCGACGTCGCTAAAATCACCTTCTTCCCGACCCAAAAACAGTCTCAGGCCCCTTACGTAGACGGCGCAATCAATAGCTCTCCGATCGTTGCGGACTCGTTCTTTATCCTGCCGGATAAACCGATCGTGAATACCTACGCCTATGAAGCCACCACCAATCTCAACGTAGAGCTGAAAACGCCGATTCAGCCAGAGGCGCCGGTCAGCTTTACTACCTGGTTCGGCACTTTCCCGGAAACCAGCCAGCTGCGCCGCAGCGTGAACCAGTTTATTAATGACGTACGTCCACGCCCATACAAGCCTTATCTGCACTACAACAGCTGGATGGATATCGGCTTTTTCACTCCCTACACTGAACAGGATGTGCTGGGGCGTATGGACGAATGGAACAAGGAGTTCATTACGGGCCGCGGCGTGGCGCTGGACGCCTTCCTGCTGGATGATGGCTGGGACGATCTGACCGGACGCTGGCTATTTGGCACGGCATTCAGAAACGGTTTTAGCAAAGTACGGGAGAAAGCCGACAGCCTGCACAGCTCCGTTGGGCTATGGCTTTCACCGTGGGGTGGCTACAACAAACCGCGCGACGTTCGCGTTTCGCATGCAAAAGAGTATGGGTTCGAAACCGTGGACGGCAAACTGGCGCTGTCGGGAGCGAACTACTTTAAAAACTTCAATGAGCGGATCATCAAGCTTATCAAAAACGAGCACATCACCTCGTTTAAACTCGACGGGATGGGTAACGCCAGTTCGCATATCAAAGGCAGCTCGTTCGCCTCAGATTTCGATGCATCAATCGCCCTGCTGCACAATATGCGCAGCGCAACCCCGAATCTGTTTATCAACCTGACCACCGGCACCGACGCCAGCCCGTCCTGGCTGTTCTACGCTGATTCTATCTGGCGTCAGGGAGATGACATCAACCTGTATGGTTCCGGTACGCCGGTGCAGCAGTGGATGACCTACCGCGATGCCGAGACGTACCGCTCCATTGTCCGTAAAGGCCCTCTGTTCCCGCTGAACTCGCTGATGTACCACGGGATAGTCAGCGCCGAGAATGCCTATTACGGGTTAGAGAAGGTGCAAACGGACAGCGACTTTGCCGATCAGGTCTGGAGCTACTTCGCGACCGGCACCCAGCTGCAGGAGCTGTATATTACCCCGTCCATGCTGAACAAGGTGAAGTGGGATACGCTGGCGAAGGCTGCAAAATGGTCGAAGGAAAATGCCAGCGTGCTGGTTGATACCCACTGGATTGGCGGCGACCCAACGGCGCTTGCCGTGTACGGCTGGGCATCCTGGAGCAAAGACAAAGCCATTCTCGGTTTGCGCAACCCATCGGATAAGCCACAGGCCTACTATCTGGATTTGGCTAAGGATTTCGAAATACCGACAGGAGACGTGGCGCAGTTTAGTCTGAAAGCGGTATACGGCAGCAATAAAACCGTGCCCGTTGAGTATAAAAACGCGACGGTGATTACGTTGCAGCCGCTGGAAACGCTGGTGTTTGAGGCGGTGCCCGTTAACTAAACGCTTGTCCCAATGAGCAGACCGGGTAAGGCGCAAGCGCCACCCGGCAAAACCGGCAGCAGGGGCTTATTCCCCCTGCTGTTCCAGCGCATACTTATACAACGCATTCTTCTTCACTCCGTGGATTTCTGCCGCCAACGCCGCCGCTTGCTTCAACGGCAGCTCAGCCTACAACAGCGCCAGCGTACGCAGCGCATCGGCGGGCAGTTCGTCATCCTGGGCTTTATGGCCTTCAATAATCAGCACCATCTCGCCTTTGCGAGGGTTTTCATCTTCTTTGATCCACGCCAGCAGTTCGCCGACCGACGTGCCGTGGATGGTTCCCAGGCGGGTGTTCATCACATAACGGTATAGGGCATCATTCAGCGGAGCCATTAACAGGCAGCACGCGTCAACCCAGTTGGAGAGTAACGCCGTCATTGGGTTATTTCTTTCCATAAAAATACGCAAACATGCCCCATACCTCCTGTTCCAACAGCTCAACCTTTCTTAATCATTGCCGAAATTGCGCCACAGTCATGAAACTTATCAAAACCAGTGGTGCCTTTGATATATTCAGTAACCAGAGTACCAATAATCTTAACCGTTTCTTTGTTCGTTGTGATAAAGCCCAGTGGCATCGTTACAGGCAATATTTCGGGTTGAGAAATATGATAGATAACGGCAGATAACTCATCCTCAAGATGAAGGATCTCAACATTACCATCTAACAGCCCCTCAACTAGTCCTTTAGCGTACTCATGGAATTGCCCAAGTTCATACTTCCACTGTCTGGCAACCCGAGCATCAGCTGCATCCCGGACAGGTACGTACAGCACCGGTTTATACAGCTGCTTATCCTTCGTAGCCGACTTGCTAAGAAAATTGCTCAGGCTTGAGATAAATCGAATGCTTTTTGACCAGTAGGGATCTGATGGTGCCACCCAGATGAACGGGCTACCTCCATAATCAGTTTTATCTCTGATACGCTGATTGAGCGCTGGTGCCAGAGCTTTCTGCAGCTTTTCCCAGTTGTTTTTAGAATAAACCGGACCTAAAAAAGTCGTTCTGCCTGAGGACGAGTATTCTTCCAGTGCTTGCATATCTTTGATAAACAACGGATTAAGTCGATTCGTTTTAGAGTCAAACAGCTTAAGCGTAAGGTCATCGTTAATGGATTGCATGGCCTTAAAAATACTCATGGTATTGTTATGCCGTGATAACCTGGCTATCTCAACAGCCATTAACTCATGAACACATTCCGAACTATTTAACTGTTCATAGTCTTCACGCTCGACAGCCTTACCATCGCTATCCATTTTAAATTCCGTTGTTAAACGGAGTGGTATTTGCCGCAGCTTATTAACAGAATTAACGGTATAGATAGAAGGTTGTTCTTTTCCGACCTGAACCTGCAGGTGAGACAGGTAACCTTTATCCAGGATTTGATTAAACTGGTATTGAAAGTGCTTCTCAACCAGTTTCTCACTTTGAGAAATGTTGTTGTCATCAACCTTTAATGTTACCCCTCCCCTCCAGTGGTCCTGAAAAGCCTGGTTGCTGAAGCAACTAAATGTGGCTAAATCAAAGGATAAGGTACAGCCACTGTCCTGAATAGTTGAAAGATGCGGTATCTCGCCAACCTCTGAAAAATAGCCATTTGATTTGTCTGTTAAGGCCAGTCTGCCGTCTGAGGAGAGCGTCAATTCGCCACGCTGCACAAGATCGGAAATCGCCTCCTCTGTCTCACGGTGGGTCAGGCCGAAATAGGTTGCTATTTGTGCTTTGCTCATCGAAGCTACATGAACAAGTCTCAGCACAAACTCCCGGATAAATGGCAGCCCCTTCTGGGAAACATAGGAAAACTGAATGTTAAACCTCTGTGCTGGCAGCAGGAAGTCAACCTCATGATAGGTAACTTTGTTATCAGATATCATTTCTTTTTGCCTCCCTGCTGAGCAGAGAGGAACCTGTATCCAGCTTCCTGACCTCGCTCAGCCATATAACTTACAACGTAGCCTAATGGCAATTCTTTATTGTTTCCCTTCCAGATATCGGCATTACCCACAATCAACAGTCGATCCATTGCGCGTGACATCGCGACATTTATACGGTTTGGAACGCGTAAGAAACCTGGGCTATGCTGCTTATCCGAACGCGTCAGTGACAGGATAATGATCCGATTTTCCTTTCCCTGATAACTGTCAACAGTGTCAATTTTAACAATGTCCTTAAATCCCTCGCTCCAGATTTCCTGATTGAATTTCTGACGGAGTAACCGCTTTTGTTCGGCATACATACATATCACGCCGATAGCGGCTTCATCTTTGCTAACAAGTTTTGAAAGCTTAGCGACAAATTCTTCATTCTCTGACACCTGTTTAAGAACAGAAATAATCTCGTCAGCTTCACATCGGTTGTAAATGCTTGTTCCGCGATCTTCAAGATGATGTGCTCGGTGGCCCTGATTAGCAGTATCAAGCCAGGTTACAACGCTACGTAACGCTTCCGGAGCTTGCTGATAGACATCCGGAATTGCCCGTACTCCATTCAGAAGCTTCCCGTCATAAAACGTCTTCGATACGAGATTACCAATCGGTGGAGCCATACGATACTGGGTCATCAAAGCTGCACTCGTCTGCGCACCATAAGCAGAGTTGAAGGCTCGGGCAAAGTCACTTCGTAATACCTCGTCAATTTCAGTGCGGGAGTTATTGATACCCAGCTTCCTCGCTAATGCCGCCTTGTGGGCATCTGAGTACAATGGAGGAAGCTGCATGTGGTCACCCACCAACAGGACACGCCGGGCTGACTGCATTGCAATGGCCAGCTCACTTGAAATCGAGCGCGCCGCCTCATCAATAATCACCCAGTCATAGATATTCTCCTGAATGCCAATGTGCCCTTGTCCAATACCAACGCATGTGCCAGCCACTAACTGCCTGGAGCGCGAATAAAACTCGTCCAGGTTCACTCGCTCTCCCGACATAGCATCCTGCATATCACGGGATATTTTAGCTAATGCTTTTACTCGTCTTGCCTCATCAGGTCTGACACCATACTCTGTACATAGCTTGGAAATCAGAATGTCTTTTGCTGCTGAGACCTTCACGCCATTGTCCAAGTTAATCCCATACTCCTGACTCAGCTTAGAGCGGATGGAAAAATCGAGTTCGACTGCAATATCTTTCAGTTCATTGCTCTCATTTGAATCCGTTAAATTGTTAACCTGATAGAGCAATTTCTCAAGGTGATCGATTTGTCTGAACAAATTGAGCTCTGCAAGAACAACACCCGAAATAAATCCCGGCTCCAGACCAATAGCTTCACTCAATGCTTCAACACGGTACTTAATTTCAGCATTGAAGAGTTCGCGCTTCTCTGTTGTGATTGCGTGTGAGTACACATCTTTTAAGCCAGGGGAAACGGCTCCCTCTCGGTTGCTGAACCTGACAACGTCCAACTCTGTACCAAGCCGGGAACAATGCTTTCTGATACGCTCGGCCGCTGTATTCACAGCCTCGTGTGACTGGCTGACCAGTAAAATGCGTTTGGTATTCTGTTTCTCGATCAGGTAGTGAACAAAGGCCGCGATGAACTCGGTTTTACCGGTCCCCGGTGGCCCCTGAAGCAGGGAGAGAGGGCCATTATTGACCAGTTTGTTAAACGCCTTTCTTTGCTGTTCATTCAGACTGATCTTGTTTCCGTGCTGATCTTCACGGTCGTATCTGGCAAAATCGGTATCGCTGAGAGTGATACCATAATTTTGAGCCGCCTGTTTGCAGGATGGATCGAACAAGTCAATTAAGTCAGGCAGCACACTTTCTCGATCTAGGAGACGTTCCAGTGCGCGTTTACGTTTCTGATAAGATGCACGAGTCGGCCTTGTACGGAAGAAGACAATATCAGAATCCTTCAGCTTGAAGGCTGCTGAACTAACTTTGACAAGACGAATCTCTTTGAGCTCTGACTTCTTAAGTGACACTTCACCAATAAATCGCTCAACACCTTCCTGATCGACCTGTAAGGCTTCGACTTCATCACTACTTCTGAAAGCACCGAGTGGATCAACATCAGCAGAGTAAGGGAGAAGAAGCTCTCCATGAGCATCTGCAACCGGGACCACTTCACCGCTGATTTCAATGTTTGGATAAGATTCTGTTTCAGTATCCAGAATGGCGCGCCACAGTTTCACTGTTGGGATTTCCAGCACTTCTCTTAAAGAAGGTTCAAGCGTCTGCTTATCAAGCCTTGCAAAGGTATCTTTTAGCTGAAGCGTAAGTGGCTCTTGTACCTGAACATCTTCAGTTGCGGCAATCAACTCGATTGCCCGCGCAAAAGATTCTTCTTCATTAAGCAATACTGTCAACGCCGACAGATCCTGCGGACTGCCGGGAATAATCTTGATGCCAGTATCAATTTCGAACTGGCTTTCATCAATATCTTGCTTACGAATTGTGACGCGAGCCCGTGGCCTGAAGCCATGAACCAGCGTCTTCTGATCTTTGTTGAACACCGCAGTAAAACTGCCACCAATCCCAGAGAAAGTCACATTTACTTCAGCTGGCGCTTTAGGATTTGACTTAACTTTGACATACAGATGCCCGTTATCAGGCAGAATAGAGATTATTTCATCGGCATTTCCTGCAGTAATCTCTATCAGGTCTTGTTCTGGCACCAGGTCATTACTATCTATCGCTTTTTTAAAGCGGCCTAAATCCTTAAACCCAAAAACCGGATCTTCCAGCTCTGCTCGGATAGCATTCGCGATTGTCGGATAAATGTCTGATTCCAGCCCCCATGACATGCCTAGCAACTCGCAGGACATCTTCATCACTGCATAGTTGTCACGTTCAAAAGAAGTACAATTATCAATGTATTCAGGGCTGTAACTGTGATTTTTAGGTTCGTCACCGGACGGTGAAAAATCGGGGATATCGATGAGAAAAAGCAAACGGCTCTGTGTCTCAAATATCACATTGCCAGGGTGGATGTCCCCGTGAGAAACACCCAGACCATGTAAGTGCTCAACGGCGGCGACAAACTTACCAATGAGGTCGATTTTTTCATCATCGGGGACCGCTATTTTATCCCAGGTTTCTCCCTGTACCTGATCCGTGACCATATATAGGCTTGATGATTTTGATGCGATACCGAATTCACGAATTTGAGGTAGATACGTTGTTTTAACTGAAGAAAGCCGCTCAACCTGCTTTAAAAACTTCAGAACCTGGAAATTAATTGACGGATCGTATCCCTGTCCCCCAACATTCAGCCAAGCCTTCACGAGCCTTCCTTTCGAGATATAGACTTCTTTGTCGACTGTTTCGACCTGAAACTGAAAGCCATCGTCTTCCGGGTATTGACGAGCGTGATTGATAGCATGTCGGTAAGGGTCAAGCTCAGTATCATCAAATGTTGGAATATCCTTGCCAGCAGGTTCGGCTTGTTTCAGGGCATCAAAGAATTCTGTTGCAGAAGTGAATTTTGCGGCAACGGCATCCCGTAATACAGATGAATACCAGTGCTGGCTGTTTAGCATGTTGTCCTGTACTTTCTCCAGACTCTTTGGCGACATGCGCATACCGCTAAATAAGTGCCAGGCGACAAGGCCCAGAGTATGAACATCTTGCTGAAAAGGCGTAAGCTCGCCCTTATCAAGCATGTCTTTTACGTGGACTGCACCTACGGATAAAAGCTTACGGTAATCGCCAACCGTTCCGGCTGGCTGGTGGTAAGCCGAAATAAAGTTCGAAAGAGCAACTTCTTTTGAAGGTGAAATCCACAGACTGTGATCGGCGACATCCCTGTGCGCAATTTTCATCTCATGGAGATCACTAAACTTTGCAATGAGCAGTTTCACCACATTCAAGCGGTCCATATCAGAAAAGTTCTTACCATATTTTCCGATAAACTCATTAAACCGGACATGACCCGGCGGAACTTCGTAGACTTCGCTATACTCGGCCGTCACCTCGTCTTTCTGAAAACTCGTCAAAGACCTGAGACAGTGGTTATACAGGTCACGATTCTGGTGATTGATGTGCTGTAACACTTCGCGCTCACGTGAAACAATCTGAGCTCGTCCTTCCGGGGTATTCGCTTTCGTTCCCGTTATATTTCTAAAATTCCATACTCTGAGTAGCGCTTCGCTGTTCGTTGATATCTCAGATTTTGCCAGATATTCTCGATACACCTTTTTAGGGTGTTCGAAAATCATATCATTAGCTTCATAGCCATTAACTCTCAAGGCTTTTGGTGCCGTCTGAGGCCCTAGGAACAAATCATCGAAAAGATGGAAATCCTTGTTAAGAACCTTGGTAGCTGGGTGCGGTTTGAAATAGTTATTGAAGCTCCCGCGATCGGCAAACTTCAGGAAATCTTTTAACGAGATCGTATGGCGCCGTTGTTCCTCCGGCAGCGCGCTGAAATCTGCATTGCCGGTCATCACAACAAAAAAATGAACAATCGGGATATAGCCTTTGTTCGTAAAACGGTCTACCAGACGCTTGAGTTTTTTGTCCAGCATGAATTTTTTGCTACGAGTCACGCTTACTGGCGAGCGCCCCATGTTCTTATCGCCTTTAAACCAAGTATCTCCGCGTGCGGTTACAGGCTGATGGTTCCAGTCTTTCAGTTCAACAATGATCACGTTGCAGTGTGTTACAATAACTAAATCGAATTCCCCCTCTTTTTTTGCCTCAACAAATCGAAAACCTGCATAACCTTTCCAAGGGAACATTTCATTGCCGATAAAACCATAGCTTTTTAGCTGCTCACTAATTGAACCGCTACGGAAAGGCTTGTCAGGCTTAGATACGTTGACTGAAAAAGCAGCTTTTATTTTCTCGATAGCCAATACTTCCTGTTCTTGTAAGCCACCATCCCACATTTCTACTTCCAACGGTGTTCTCCTTAAATTTCAGTAAAAATAAGGTCATTTATTCTGGTCAATATGGCAGGTTTCGATGTTTATAGTTCTGAATACTATGGTGTTTTACACGTTCATTCTGGTTATTATACGTAGAAAAGAATTTATCGGGCAACGCCGAGATGACGCGGCCCCGTTAAACCGTGTAGTGGTAGATGATGAGATCAGATACAGGGATCAACTGACGGTGAGCGCAGTCCCCACATTTTATCCTGAGTTTACCGCTATTCCCGGCTGCCATTCGTTAGCACAGGCCCGCAAGTACCCTGATTTGCCGCTGGTTTTGCTTTCCCATCGAAGGGCCCATACATCATCACGCCCGCGAAACAGTCGACAAAATAACGCAACTTTCTCATCCGTGGATAATACGGAAACGCACTGCACAGGACTCTGCGGTTTACGTCACCATTCAATCCCATAAGCTTCAAGTTATGTTTGCCCTCAGTGCAGCTAATTCATCACTGTCAGATTTATGAACCATATTCGTTACCTGTATCCGATCCACTGAGGTACCACGGCAGAAATTCGGGTCCCCTACACTGATGACAAAGCGCCCCGCCCTCTTCTGCCTATTAACTCAAGCACCTCCCAACAACTTACAGAAACTTAATTCTAATACCTAATACAACTGTAATTCAGTTATGTCGTGGTCGGCCCCGGTTGCTTTTTTCGGGAAGCCTGCTACGGCCCAGAATAATACCGTGTTCTTTCTGATCCTGTTTAGCCAGGTAGCTAAGGGCGTAACGTAAACCGTCTACCGCAGACTTATCACTATAGTGAATCACGTGATCGATACGCACCGGGTATTTGTCTTTAGTACTGCACAGGTGAAAATAACCCTCCCCTTCCGTGATCCGACTCCAGATATCTCCCAGTTGCCTGGATATCCGATAAAATTTCTTGTGACGTTGCCCATCCAGGTAGCCGATAAAATGAATATGAAGCCCTTTGTTTGGTGTATATTCCATAACCCAGTAATATCCCGCCAACATCGTCTGGGTTTCGCTAAGTAAACGGTATATTTCCATACACATACTGTGCTTACAGGAATGCCCGAAACTGGGCGTGTCTTTCCTGTAGGCAAAATCAATTCTGAAAGGTAACAGTTTAGAAAAACGTTGAAACATACCATCCATGTGTTCATTCACGTCTTTCAGAATCATGAAGTCCATTTCGTAATTGGGATTAGCATTATACATTTTATAGTCCTTACTTTATAAAAGTTACAGGAAGGTGGAATTACATAAATACTGAGAGTACAGAAACGCCGTACCGCGGCATTAATAAACGAGGTAACCAGCAGCACAGACAACCTGATAAAACTGTTTCGTATTACCTCATTACTCCAAAGAGGTAATATTTCAGAGTACAGTAAGACTCACTTAAACTGAGGACCAGTACGAATTACAATGGTAAAAACACACCTCATAGAATTGAGTTTAAGGAAGTACCATTCAATCACTTACAGGAAGTAATCTGACATTCACAAAAGCCAGTATCGGGTAGCCACAGGCAACACCGCTATTAATACTTCTGTACCATTAAGTAAAAACAGCGACCCAGCTTAAGGCCTCTACAGATATTTAACTTCACCATAAAAATGATGGTATACAGACCATACCCCACACACCATAAGAATATTATTTCATACAGGAAAGATTATATTTTTTCATACCTATAAACTGTAACAGAGCCATAACATACTCTAATAGTGTTCCGGTACGTAACCAGTGTTAAGTGCGTACAAGACAACATAACAGATCACATAACTTCCACTAATAATACTAATTAACCTAAATTATTTATTGAACACGAAAACAATGAAGGCCCCACCTCACCTCAGACTAAAGCCGACAATATCCTGAGCAACACTCTATCCAATAGTAAATAGTGGTTACACCAGTTATATTTACTGACTGACCTGTACCAGAGTTTTCTAATACAACGCTGACATGGTAAAGACATTAGTCTCCTCCGCAAGTAGCAGAGGAGACTATATGTACTCAACGATTAAGTCGTTCAGAGGTCAAAGTTAGGTGCGACACACAACGTTTTTCCCTCAATAACCGGTACAACTCTGTTTTGTTCATACAGTAAATCCAGTAACCAGTTGATTTTATCCTTTTTCCGGAAACGGTTCGGACCACGCTGTAAAATATCATTTTTTTTCATACAGAGGATCCCCTTCTCAATACAATAGCTTTTTATCCAGTTGAAAAGTTCAAGCTCCTCTGGAATGAGTCGCACAGGTACGGTCAGGGCAGAGTTGTCAAAAGTTAACGGATTAGACAACCGCACATACTCATTACCGTACCATATTGCTAATTCTCTCGCCATTTCTGCAGTGTAAGGGGAAATCTCCCCCTCTTCACCGCTCGAATGGTAAATAAGTCCTGCCAGTCTTGCCATATACTCTGCATTTTTAGCAGCATATTCCCGGCAATGTCTTAAAGGCCCCAATCCCCCCAGCTTCGATTCCACATCATTGTAATAATCCGTCCAGATTCTGGCAGCCTGAGGAGAAAAGTGAAGGCAACGTCGTTCACCACTCATCGCCAGACTTTCATCAATAAGCTCATTGATCCTCTCTTCAAACAAATCCTGATACTGTGATGAATAATTATCTCCGGTTATTATCCTTGTCCCCTGCGTTGATGTTGGTTGACACATCAAAAACCTTGCATGATGTCCTGACGTTTTCACAATTTCTTTTTTTCGCGTACAAAAACCTTTGTGGTAAACATCAGGCTGAATCATCACCGATATCGTCAGTCTTGGCTCCTTCAAATTAATTCCGGGAGATGATTTCCTGTCGATGAAAAGAGAACCTCCATCCCACAAAGTGTTAATAATTCCCAGTTTACTCATGGCCCGGCTGTCAAAAATTACCCCCCCTTCACTGGATACAAGAGCAAAAGAGCGATTGCTATCGGAGTAATATTTTAACATTCCCTCTATCGTTGTCTCATTAAAAATTGTTCGACGTATCTGCGGCGGAACAGGAGGTTTATTCAGATGCGTTTCAAGCTCTGATTCTGTTGCCTTGTAATCTTTACCGGCACGAATCTCTTTATGAAATTTTGATTCCAGCGCTTTTTGTTTTTGCTCCCATATTTCCTTTTCTGTACTGTAATTCTCAACCAGTTTCGCGTATTCATCCGCCAGGGCTTCATCCCTGAGATAAAATGCTTTCATAAACACTTTATCCACGGTCGTTTTCCTTTCACCGGAATCAGCCAGAATCAGAGAGTAAAGATTAACAGGCCCATGTAAATTTCCAGGTCTGCACACGTCAATCTGATTCTGACAGGCAATTGAGATCGCTGTTAATGCGGATGTTGCCACCATAGCCAAAGGTGCCTGTGTATTTTTTTGAGTTTCAATTATTGCATTTCTCACCAGCGGTGGTAGTGCATATATCGGATAAGGATTTTCTGGTGCAAGTAAGCACATAAAAGCCTCTCTTTTCATTAATGGGTTAGTAAAATAGCAGCAAATCCCGCTACAATAGCGAAAATAGCTGCTATTCAGACCTGATGCTCTGAGAAATACAGAGCATTTGTAGATCTGATGTTTAATAAACCGATAAGGTTAATTCAGTGAAGACTGTATTATTCACCACGAAATATCATTCATATTTCTCATTCTTAATACCTCCGGAAATTATTTCACTAATACCTTTCCCCCGGTCTTTTCCCTTTCTCCAGAACCATAGTGTAAATATTAACGGCCCCACGTAAATTTCTGGGACTGCATACGTCAACCTGATTCTGACAGGCAATTGACATCGCTGTTAATACTGACATTGCAACCAGAGTCAGAGGTACTTGTTTATTGTATTGAGCTTTAATTATTGCCTTCCCTATCATTGATGGTCATGCATATACTGAATACGGCATATCCGGTTTGATGTAAGGCATAAATGCTTCCTTTTTCGTAATACGTTTCGGTTATAGCAGCAATTACCGCTACAATAGCGACAATAGCGACAATAGCGACAATAGCTGCTCTTTCAACAATGAATGCTGGAAATTCTGTTTATCAATTAAACTTCTGGCTAAATTGTTCTTATGCTTCTTTTTCCTCCTGAGGTGATCACGTGATGACAAAAACTTATCTGCTCATGGTCATATCTGAAGAAAGCGTATGTCTCACTTCGTCATTAACACCTTTGACTGCTGATCTGGCAGGTACAGTATGTAAAGTAAGCCATTCGTCAATAGCTGATGAACGCCAGCCCACAGAACCACTACCAAGACGTACCGGCCTGGGGAAGGTTGCATCGTAGTATTTCGATAACGGATTCATTTTTTCGTAAATTGTCGAACGAGATATACCTAACAATTGGCTCAGTTCAGGCATCCGTAAGATGCGTGAAGGAGTCCTGCAGTGTCCGGATTGATCTGGCATTTTATCCCCCATAGTGTGGTGAACTGTCGTTACAGTTTGTATTCCCCGGGTAAATTTGTCTGCGTAAAAAAATGGGTTGAGAAAAAAGACAATAAATAATTAAAAATCAATATGATAGAAAACGACAGCATTAGTGTTATTTTACTTTTCCCTTATCGCTGTCGATTCTCATTATCATTCATGGCAAAAAAACAGCAAAAGAGGCCCTGGTAATGACTCCTGTCAGTAGTACATGTGTTTTCGATAAATGAACCAATGGCTTTTCGTGAATTTCCAGGGAGCGTAAAAAGCAAATCGTTTTCCGGGCAGGTCGTTTGATGTGAGTACGAAGTGTCAGATTGTTGCGCTCAATGCGCCGGGTAAATATCTTGCCGACAAGATGCTTATCCTGTGGCATTTCTCTGGTATAACTGCTCCGGTTGTCTCTGGTTATCATGCCTGCGGAGAATGGCTTCAGGAATTCCGGCAACTCACGGCATGTTTCATCAGTGCGAGGACCAAAAGTGTAAGCCAGCACACCGTCAGCTTTGGTCTTATACGCGTACCAGTGCCACTGCTGACGAGCTTTGTTTTCGACAAAACTCCATTGTTCATCAAGTTCACAGATAAGTGCCACATCGACACTGGCGGCTGGAGATATGGTTACTTTACGTGGTGAGCGTTTTTTAAAGTGAGAATGACGGTGTTGATATCCACCTTCAGCGTTCTGGAGCTATCGCGAACCCCGGCTCCGTTATGAACCATTTCAACAATTTGCTCTTTAATGCCTGGTTTTCGGGTTTCGTAGATGTAATTCAGTTGAAATACACGTTTGCAGGATTAGCACTGGAAACGCTCATGATCTGAGGTGCTGTGATCATAACGGTACACTTTATAGGAATTACAGCGGGGAGAATATACTGTAACTGTTGCCATATGGTCTCCAGATACCAATAGAATACAACATTAATCTATCGTCAGAAGGCATCACCGGGGCCTTCTGACATAATCTGTTAATACGTGTACTATTACCCTGTTCAGAAAATATTGATTCAAAAATGAAAACCAGTTAACAGAAAAGCAAAATGATATAATGTTAAAATTTTATATAGTGCAATAAAAGGAGAATGTTATGTATAATTTTATCACTATAATGTATGATGTCTTTTCATGTTTTGGTGTTCTGGCTAAAAACCAGAATAGCCGTGACATCCGAAATATTAAAAATTTTTCCTCACATCAACATTCACTGGGCGACATGTTTGATGAATTAATAAACATTATTGATAAAGAACAAGTATTGAGTAAAGAACAACGAAAAGTTATATTTAGGAGATATGAAGATCTCTATGTTAAGCTAATGCACTATTCTGTTTTTACAGACAAAACACATCAAATAATAAAACAAAAATATTTTAATGACATTGTACCAATGATTCTCGCACTCGACATCAGGAACACATATCGCCCGGATAATGAGATGGCATTTTACTATCATATTCATTCTTTTCTCACTCAGATACCGGATAATGAGGATGATATATATCATGCTGCAAGGACATATCTGCGAAATTACGTTAAGTTATGTTTATCCGGATACACGCCAGCGAATGCGCATTTCAAAGATATCTTTGATGGCGTATATGAATTCATTCGTAATATTCGCAAAAACAGTACACCAGGAAAAACAAAACTTATCGCAACTATCAACACATGCAAAGAAACCTGTAAACATCTGCTTTATTTAAGTAATGAAGACAAGGAAAAAATAATTTCTGACTTAGATAAAGTTCAGGTTGCATGTTATTATCTCACTATATTACTGGCTTTCGAAAGACGAACTTCATTAACAAGCACCCTGACAACTTTATATAAAATGCTGATAAGCGAAAGAGAAGTTTCAGAATATGAATGCCAGTTATTATATTTAACCAACCCAATAGATGTAATGAATATACTGAACAAATACATATATTACTTTCCTAATGAGAACTCACCATTTTATACACTGAAAATTGACAGTGCATTATCGTGGGATGCCATTGACGCAATACGAGACTATAGTATTTCTGATATTTATCTTTATCCTGAACAAAAAACAATAAATTGTGTCGTTGAGATTGAAAACATTGTCTTTGGCGGTTACATTTATACATTGAACAACGGCGTCACATTACAAAACATAGAAAACTCTTTAAAAGATTCTTCATGCCATTATGTCTTAAATGGCTATACAGAATTTGTTAACTGTTTGAGACAACTTACTTCAGGAAAGACTGAAAGTGTTCATCGCACCATCAATAAACTGAACTATGAGAAATTACCTTTTGGATTTATCATTGCCGCGTTTGCTATACTAAAGATAGCATTTAAAATAAAATTCAGTAAAAATCATGTAAATATCCGAGCATTATTAAATGACATCAATTATTTTATGACTTATCAGGGCGAGTCCATTAACCTTATTTCACTGGATCACGAATACCCAGAGTCCTGTCTTCAAAATGACACAAACACATATTTATTAGGAAGAGTAATATTTCTGTATAACTCAATGATTTATAAGTTCATAAACTGTCAGGAACATGAAACCAATAACATTCACTCAGCTATGATAAATAACCTATTACAGGAAGTTGATATAGCCCTTGGTAAAATAAATGACATTATAGACAGCAGAAACATATCAGCCCCCCATGAACTGGCAAATATTCTTACCCGCGAAAAAATACTTACAACACGGGAAAAAAAAGGAAACCTGATAAGCCTGTTTGATGGATTCACTTTATTCCATTGTGTTGGAATGATAACCTTTCTTATCCATTATCTCAGAACACCTGAAGAAAAAGTTGAAAATATATTTATGTTATATGGTGCAGATAAAAACAATAAACTACGCAGAAGACTGATTTATGACGCACTAGGAATAATTCAGTCTCAGCAGGAGTGAAGAGTTAAACAGGCAAATTATTTTTTATAAAAAGGGGATTGTTAAGTAATCCCCTGTTAATCAAAATACGCTTTCCTGACGATCTGATAATTATCAGCTCAATACATTTGACATAAAAGCGGTTTCTTAATTACTACTGCTACGATCACATATCAAATCATGATCGTACAATATACATTTTCACTATTATTTATTCACAATTTGAGACATCACATAATCCGCCCATGCCTGCATCATCGGTACACGCTGCTCCAGTAGATCAGTACGATGATATGCCGCCTCAACCTTATTTTTCAGCGTATGAGCGAGCGCCCTTTCCGCCAGATCCCGCGAATACCCCTGTTCGCTACACCAGTCCCTGAATGTTGAGCGAAAACCATGTGCCGTGGCAACTCGCCCCGGAATGTCACTGACGGCTTTCTTTTTACGTAGAAAACTTGTCAACACCATATCGGAAAGGATCTGCTGCTTTCTGGGTGAAGGGAACACCAGTTCATCATGCAGGCCACGTATATTTTCCAGAATGTAAATAGCCTGCCGGGATAAAGGAACACGATGCTGTAGCCTAGCTTTCATTCTTTCTGCAGGTATAGTCCATACCCGCTTATGAAAATCAATTTCAGCCCAGCGCATTCCCCTGGCTTCGCCCGAGCGAGTTGCTGTAAGTATCACCATTAATAACAGTGCGCGGGTAACATTATAAGGTTCATCGGTATACACACTGGTCGCCACAAAAAGCGGTAACTGCCTCCAGGGCATTGCGGGTTGGTGTTCATCACGTCCTCTTGTCTGCTGAGGAAGCAAATGGTCAACCACATCAACAGGATTTGCTACACAAAAACCGTGCGCCCATCCCCACTGCATAACAACATGAATGCGCTGTTTAACCCGGCTTGCCGTTTCTGACAAGGTTAACCAGACTGGACGCAGTGTTTCTGCCACATCCGCAGCCGTAATCGAATCCAGCGTTTTTGCTCCCAGTTGAGGAAACGCGTAATTCTCAAGCGTCGATAACCACTGCCTTACATGCTTTGGATTTTCCCATCCAGGAGACAGTTCTGCATGTACACGCCTGGCTGCATCGGCAAATGTTGGGATAGCGACTTTCTCAGATTCAGCCTTTTTAATCTCCAGAGGATCATCACCTGCAGCAAGTTGCTCTCGCATTATCCGGGCAGTACGTGCAGCTTCAGCAATACTGACCTCTGGGTAAGTTCCCAATCCAGCATTACGTCTTTTTTGTGTCACCGGACTTACATAACGAAAAACCCATTTCCCCCGCCCCTTTACTGAAGAAGGATGAAGGGTCAGTCCGGTAATTCCCCCATGGGGCAATGGTTTGTCATCAGGTTTGATATGTCTTGCTTTCGTATCCGTCAATACTGCCAT
Protein sequences of DBSCAN-SWA_6 >NZ_CP048344|2825482:2867030|2846278_2846617_+|WP_001544449.1|DBSCAN-SWA MKHLTEMVRQHKAGKTNGIYAVCFARFVLHGASDVPDEYVRRTIGPGVCKVNVATELKIAFSDAIKAWFAENQQSNDPRFYMRVGMDAMKEVVRSKIAVCGSANRLRLPAEA >NZ_CP048344|2825482:2867030|2848313_2849489_+|WP_032180015.1|DBSCAN-SWA MFDFDKIIERQNDKCRKWDHTFVCSRFGDVPESFIPLWIADMDFTSPPAVIDGFRRIVEHGTFGYTWCFDEFYDAVIAFQRKRHQVEVEKSWITLTYGTVSTLHYTVQAFCKPGDSVMMNTPVYDPFAMAAQRQGVQVLANPLRVEENRYQLDFNLIEEQLKTHRPTLWFFCSPHNPSGRIWREEEIRQVSDLCQRYGTILVVDEVHAEHILDGKFASCLTSGCAAQDNLIVLTSPNKAFNLGGLKTSYSMIPDDSLRQRFRQQLEKNSITSPNLFGVWGIILAYQHGLPWLDALNGYLQGNARYLADALQTHFPAWKMMNPESSYLAWIDVSADERSATQLTQHFARQAGVVIEDGSHYVQNGENYLRINFGTQRYWLEQSINRMLKNDK >NZ_CP048344|2825482:2867030|2862415_2862712_-|WP_001303889.1|DBSCAN-SWA MPDQSGHCRTPSRILRMPELSQLLGISRSTIYEKMNPLSKYYDATFPRPVRLGSGSVGWRSSAIDEWLTLHTVPARSAVKGVNDEVRHTLSSDMTMSR >NZ_CP048344|2825482:2867030|2834300_2835839_-|WP_124039032.1|transposase|DBSCAN-SWA MNDISSDDIFLLKQRLAEQEALIHALQEKLSNREREIDHLQAQLDKLRRMNFGSRSEKVSRRIAQMEADLNRLQKESDTLTGRVYDPAVQRPLRQTRTRKPFPESLPRDEKRLLPAAPCCPNCGGSLSYLGEDTAEQLELMRSAFRVIRTVREKHACTQCDAIVQAPAPSRPIERGIAGPGLLARVLTSKYAEHTPLYCQSEIYGRQGVELSRSLLSGWVDACCRLLSPLEEALHGYVMTDGKLHADDTPVQVLMPGNKKTKTGRLWAYVRDDRNAGSALAPAVWFAYSPDRKGIHPQTHLACFSGVLQADAYAGFNELYRNGGITEAACWAHARRKIHDVHVRIPSALTEEALEQIGQLYAIEADIRGIPAEQRLAERQRKTKPLLKSLESWLREKMKTLSRHSELAKAFAYALNQWSALTYYANDGWVEIDNNIAENALRAVSLGRKNFLFFGSDHGGERGALLYSLIGTCKLNDVDPESYLRHVLGVIADWPVNRVSELLPWRIALPAE >NZ_CP048344|2825482:2867030|2843688_2843826_-|WP_154761740.1|DBSCAN-SWA MQITEALLSEPRDIWRFVQQAVDHWPRLLAVHFTRHSTRVNIDDY >NZ_CP048344|2825482:2867030|2831368_2831545_-|WP_000179884.1|DBSCAN-SWA MATEIKKFEKRDLAQAVIGVGVMVDFISRIMSVLADSWGCWCGLPIDEKTSNYLAYYL >NZ_CP048344|2825482:2867030|2825482_2825941_-|WP_021513032.1|transposase|DBSCAN-SWA MGNEKSLAHTRWNCKYHIVFAPKYRRQVFYREKRRATGSILRKLCEWKSVRILEAECCADHIHMLVEIPPKMSVSGFMGYLKGKSSLMLYEQFGDLKFKYRNREFWCRGDYVDTVGKNTAKIQDYIKHQLEEDKMGEQLSIPCPGSPFTDRK >NZ_CP048344|2825482:2867030|2826669_2827815_-|WP_000976514.1|DBSCAN-SWA MMKKSLCCALLLTASFSTFAAAKTEQQIADIVNRTITPLMQEQAIPGMAVAVIYQGKPYYFTWGKADIANNHPVTQQTLFELGSVSKTFNGVLGGDAIARGEIKLSDPVTKYWPELTGKQWQGIRLLHLATYTAGGLPLQIPDDVRDKAALLHFYQNWQPQWTPGAKRLYANSSIGLFGALAVKPSGMSYEEAMTRRVLQPLKLAHTWITVPQNEQKDYAWGYREGKPVHVSPGQLDAEAYGVKSSVIDMARWVQANMDASHVQEKTLQQGIALAQSRYWRIGDMYQGLGWEMLNWPLKADSIINGSDSKVALAALPAVEVNPPAPAVKASWVHKTGSTGGFGSYVAFVPEKNLGIVMLANKSYPNPVRVEAAWRILEKLQ >NZ_CP048344|2825482:2867030|2828138_2829401_-|WP_000608644.1|transposase|DBSCAN-SWA MINKIDFKAKNLTSNAGLFLLLENAKSNGIFDFIENDLVFDNDSTNKIKMNHIKTMLCGHFIGIDKLERLKLLQNDPLVNEFDISVKEPETVSRFLGNFNFKTTQMFRDINFKVFKKLLTKSKLTSITIDIDSSVINVEGHQEGASKGYNPKKLGNRCYNIQFAFCDELKAYVTGFVRSGNTYTANGAAEMIKEIVANIKSDDLEILFRMDSGYFDEKIIETIESLGCKYLIKAKSYSTLTSQATNSSIVFVKGEEGRETTELYTKLVKWEKDRRFVVSRVLKPEKERAQLSLLEGSEYDYFFFVTNTTLLSEKVVIYYEKRGNAENYIKEAKYDMAVGHLLLKSFWANEAVFQMMMLSYNLFLLFKFDSLDSSEYRQQIKTFRLKYVFLAAKIIKTARYVIMKLSENYPYKGVYEKCLV >NZ_CP048344|2825482:2867030|2865827_2867030_-|WP_000279872.1|integrase|DBSCAN-SWA MAVLTDTKARHIKPDDKPLPHGGITGLTLHPSSVKGRGKWVFRYVSPVTQKRRNAGLGTYPEVSIAEAARTARIMREQLAAGDDPLEIKKAESEKVAIPTFADAARRVHAELSPGWENPKHVRQWLSTLENYAFPQLGAKTLDSITAADVAETLRPVWLTLSETASRVKQRIHVVMQWGWAHGFCVANPVDVVDHLLPQQTRGRDEHQPAMPWRQLPLFVATSVYTDEPYNVTRALLLMVILTATRSGEARGMRWAEIDFHKRVWTIPAERMKARLQHRVPLSRQAIYILENIRGLHDELVFPSPRKQQILSDMVLTSFLRKKKAVSDIPGRVATAHGFRSTFRDWCSEQGYSRDLAERALAHTLKNKVEAAYHRTDLLEQRVPMMQAWADYVMSQIVNK >NZ_CP048344|2825482:2867030|2851560_2851767_-|WP_024167628.1|DBSCAN-SWA MAPLNDALYRYVMNTRLGTIHGTSVGELLAWIKEDENPRKGEMVLIIEGHKAQDDELPADALRTLALL >NZ_CP048344|2825482:2867030|2858937_2859501_-|WP_032180014.1|DBSCAN-SWA MYNANPNYEMDFMILKDVNEHMDGMFQRFSKLLPFRIDFAYRKDTPSFGHSCKHSMCMEIYRLLSETQTMLAGYYWVMEYTPNKGLHIHFIGYLDGQRHKKFYRISRQLGDIWSRITEGEGYFHLCSTKDKYPVRIDHVIHYSDKSAVDGLRYALSYLAKQDQKEHGIILGRSRLPEKSNRGRPRHN >NZ_CP048344|2825482:2867030|2844705_2845143_+|WP_001335688.1|DBSCAN-SWA MVKKALGSIAKSGKTAIVEVLSPGQHPTKRELIYAATPARDFVCGTQQVASGITVQVFTTGRGTPYGLMEVPVIKMATRTGLANHWFDLMDINAGTIATGEETIEEVGWKLFHFILDVASGKKKTLSDQWGLHNQLAVFNPAPVA >NZ_CP048344|2825482:2867030|2853304_2858263_-|WP_001333439.1|DBSCAN-SWA MWDGGLQEQEVLAIEKIKAAFSVNVSKPDKPFRSGSISEQLKSYGFIGNEMFPWKGYAGFRFVEAKKEGEFDLVIVTHCNVIIVELKDWNHQPVTARGDTWFKGDKNMGRSPVSVTRSKKFMLDKKLKRLVDRFTNKGYIPIVHFFVVMTGNADFSALPEEQRRHTISLKDFLKFADRGSFNNYFKPHPATKVLNKDFHLFDDLFLGPQTAPKALRVNGYEANDMIFEHPKKVYREYLAKSEISTNSEALLRVWNFRNITGTKANTPEGRAQIVSREREVLQHINHQNRDLYNHCLRSLTSFQKDEVTAEYSEVYEVPPGHVRFNEFIGKYGKNFSDMDRLNVVKLLIAKFSDLHEMKIAHRDVADHSLWISPSKEVALSNFISAYHQPAGTVGDYRKLLSVGAVHVKDMLDKGELTPFQQDVHTLGLVAWHLFSGMRMSPKSLEKVQDNMLNSQHWYSSVLRDAVAAKFTSATEFFDALKQAEPAGKDIPTFDDTELDPYRHAINHARQYPEDDGFQFQVETVDKEVYISKGRLVKAWLNVGGQGYDPSINFQVLKFLKQVERLSSVKTTYLPQIREFGIASKSSSLYMVTDQVQGETWDKIAVPDDEKIDLIGKFVAAVEHLHGLGVSHGDIHPGNVIFETQSRLLFLIDIPDFSPSGDEPKNHSYSPEYIDNCTSFERDNYAVMKMSCELLGMSWGLESDIYPTIANAIRAELEDPVFGFKDLGRFKKAIDSNDLVPEQDLIEITAGNADEIISILPDNGHLYVKVKSNPKAPAEVNVTFSGIGGSFTAVFNKDQKTLVHGFRPRARVTIRKQDIDESQFEIDTGIKIIPGSPQDLSALTVLLNEEESFARAIELIAATEDVQVQEPLTLQLKDTFARLDKQTLEPSLREVLEIPTVKLWRAILDTETESYPNIEISGEVVPVADAHGELLLPYSADVDPLGAFRSSDEVEALQVDQEGVERFIGEVSLKKSELKEIRLVKVSSAAFKLKDSDIVFFRTRPTRASYQKRKRALERLLDRESVLPDLIDLFDPSCKQAAQNYGITLSDTDFARYDREDQHGNKISLNEQQRKAFNKLVNNGPLSLLQGPPGTGKTEFIAAFVHYLIEKQNTKRILLVSQSHEAVNTAAERIRKHCSRLGTELDVVRFSNREGAVSPGLKDVYSHAITTEKRELFNAEIKYRVEALSEAIGLEPGFISGVVLAELNLFRQIDHLEKLLYQVNNLTDSNESNELKDIAVELDFSIRSKLSQEYGINLDNGVKVSAAKDILISKLCTEYGVRPDEARRVKALAKISRDMQDAMSGERVNLDEFYSRSRQLVAGTCVGIGQGHIGIQENIYDWVIIDEAARSISSELAIAMQSARRVLLVGDHMQLPPLYSDAHKAALARKLGINNSRTEIDEVLRSDFARAFNSAYGAQTSAALMTQYRMAPPIGNLVSKTFYDGKLLNGVRAIPDVYQQAPEALRSVVTWLDTANQGHRAHHLEDRGTSIYNRCEADEIISVLKQVSENEEFVAKLSKLVSKDEAAIGVICMYAEQKRLLRQKFNQEIWSEGFKDIVKIDTVDSYQGKENRIIILSLTRSDKQHSPGFLRVPNRINVAMSRAMDRLLIVGNADIWKGNNKELPLGYVVSYMAERGQEAGYRFLSAQQGGKKK >NZ_CP048344|2825482:2867030|2849502_2851392_+|WP_000757210.1|DBSCAN-SWA MKKVLTLSLLALCVSHGAAAANYALNNDNIALLFDDTNSTVVVKDNKANHPLTPQELFFLTLPDESKIHTADFKIKHVEKQDNAIVIDFTHPDFNVTVKLNLVKGKYANIGYTIAAVGQPRDVAKITFFPTQKQSQAPYVDGAINSSPIVADSFFILPDKPIVNTYAYEATTNLNVELKTPIQPEAPVSFTTWFGTFPETSQLRRSVNQFINDVRPRPYKPYLHYNSWMDIGFFTPYTEQDVLGRMDEWNKEFITGRGVALDAFLLDDGWDDLTGRWLFGTAFRNGFSKVREKADSLHSSVGLWLSPWGGYNKPRDVRVSHAKEYGFETVDGKLALSGANYFKNFNERIIKLIKNEHITSFKLDGMGNASSHIKGSSFASDFDASIALLHNMRSATPNLFINLTTGTDASPSWLFYADSIWRQGDDINLYGSGTPVQQWMTYRDAETYRSIVRKGPLFPLNSLMYHGIVSAENAYYGLEKVQTDSDFADQVWSYFATGTQLQELYITPSMLNKVKWDTLAKAAKWSKENASVLVDTHWIGGDPTALAVYGWASWSKDKAILGLRNPSDKPQAYYLDLAKDFEIPTGDVAQFSLKAVYGSNKTVPVEYKNATVITLQPLETLVFEAVPVN >NZ_CP048344|2825482:2867030|2860321_2861755_-|WP_000335695.1|DBSCAN-SWA MCLLAPENPYPIYALPPLVRNAIIETQKNTQAPLAMVATSALTAISIACQNQIDVCRPGNLHGPVNLYSLILADSGERKTTVDKVFMKAFYLRDEALADEYAKLVENYSTEKEIWEQKQKALESKFHKEIRAGKDYKATESELETHLNKPPVPPQIRRTIFNETTIEGMLKYYSDSNRSFALVSSEGGVIFDSRAMSKLGIINTLWDGGSLFIDRKSSPGINLKEPRLTISVMIQPDVYHKGFCTRKKEIVKTSGHHARFLMCQPTSTQGTRIITGDNYSSQYQDLFEERINELIDESLAMSGERRCLHFSPQAARIWTDYYNDVESKLGGLGPLRHCREYAAKNAEYMARLAGLIYHSSGEEGEISPYTAEMARELAIWYGNEYVRLSNPLTFDNSALTVPVRLIPEELELFNWIKSYCIEKGILCMKKNDILQRGPNRFRKKDKINWLLDLLYEQNRVVPVIEGKTLCVAPNFDL >NZ_CP048344|2825482:2867030|2831788_2832568_-|WP_032262852.1|DBSCAN-SWA MMELQHQRLMALAGQLQLESLISAAPALSQQAVDQEWSYMDFLEHLLHEEKLARHQRKQVMYTRMAAFPAVKTFEEYDFTFATGAPQKQLQSLRSLSFIERNENIVLLGPSGVGKTHLAIAMGYEAVRAGIKVRFTTAADLLLQLSTAQRQGRYKTTLQRGVMAPRLLIIDEIGYLPFSQEEAKLFFQVIAKRYEKSAMILTSNLPFGQWDQTFAGDAALTSAMLDRILHHSHVVQIKGESYRLRQKRKAGVIAEANPE >NZ_CP048344|2825482:2867030|2835888_2836236_-|WP_000612591.1|DBSCAN-SWA MIPLPSGTKIWLVAGITDMRNGFNGLAAKVQTALKDDPMSGHVFIFRGRSGSQVKLLWSTGDGLCLLTKRLERGRFAWPSARDGKVFLTQAQLAMLLEGIDWRQPKRLLTSLTML >NZ_CP048344|2825482:2867030|2840662_2841445_+|WP_001295538.1|DBSCAN-SWA METKQKERIRRLMELLKKTDRIHLKDAARMLEVSVMTIRRDLHQEDEPLPLTLLGGYIVMVNKPAPSMPVIHDVPKNHRDDLPIAILAAGMVNENDLIFFDNGQEIPLVISMIPDAITFTGICYSHRVFVALNEKPNVTAILCGGTYRARSDAFYDASNSSPLDSLNPRKIFISASGVHNHFGVSWFNPEDLATKRKAMNRGLRKILLARHALFDEVASASLAPISAFDVLISDRPLPADYVTHCQNGSVKIITPDSEDE >NZ_CP048344|2825482:2867030|2845205_2846030_-|WP_000072197.1|DBSCAN-SWA MSNTDASGEKRVTGTSERREQIIQRLRQQGSVQVNDLSALYGVSTVTIRNDLAFLEKQGIAVRAYGGALICDSTTPSVEPSVEDKSALNTAMKRSVAKAAVELIQPGHRVLLDSGTTTFEIARLMRKHTDVIAMTNGMNVANALLEAEGVELLMTGGHLRRQSQSFYGDQAEQSLQNYHFDMLFLGVDAIDLERGVSTHNEDEARLNRRMCEVAERIIVVTDSSKFNRSSLHKIIDTQRIDMIIVDEGIPADSLEGLRKAGVEVILVGELASSL >NZ_CP048344|2825482:2867030|2843988_2844624_+|WP_032180018.1|DBSCAN-SWA MVKSFAIGYTVRDVAKGSWIDESTVTLPKAPPLNTLPRATKVPEPQQPQEDYTFEGYRNADGSVGTKNLLGITTSVHCMADVEDYVVKIIERDLLPKYLSIDGVVDLNHLYGCGVAINVPAAVVPIRTIHNIALNPNFGGEVMVVGMQCGGSDAFSGVTTNPAVGYDSDLLVRCGATVMFSEVTEVRDAIHLLTPRAINEEVGRRLLEEMA >NZ_CP048344|2825482:2867030|2842157_2843386_-|WP_121372241.1|transposase|DBSCAN-SWA MIDVLGPEKRRRRTTQEKIAIVQQSFEPGMTVSLVARQHGVAASQLFLWRKQYQEGSLTAVAAGEQVVPASELAAAMKQIKELQRLLGKKTMENELLKEAVEYGRGKKVDSARALIARGWGVSLVSRCLRVSRAQLHVILRRTDDWMDGRRSRHTDDTDVLLRIHHVIGELPTYGYRRVWALLRRQAELDGMPAINAKRVYRIMRQNALLLERKPAVPPSKRAHTGRVAVKESNQRWCSDGFEFRCDNGEKLRVTFALDCCDREALHWAVTTGGFNSETVQDVMLGAVERRFGNELPASPVEWLTDNGSCYRANETRQFARMLGLEPKSTAVRSPESNGIAESFVKTIKRDYISVMPKPDGLTAAKNLAEAFEHYNEWHPHSALGYRSPREYLRQQASNGLSDNRRLEI >NZ_CP048344|2825482:2867030|2846730_2848302_+|WP_000192271.1|DBSCAN-SWA MTQKKSFKSKLWEFLQSLGKTFMFPVSLLAFMGLLLGIGSSVTSPSTITSFPFLGGEFTQLTFGFIAMVGGFAFTYLPLMFAMAIPMGLAKRNKAVAAFAGFVGYMLMNMSINYYLTATHQLADPATMKQVGQSIVLGIQTLEMGVLGGIVVGVITYFLHDRFQDTVLHDAFAFFSGIRFVPIITALTLSLVGLFIPMLWEYVALGIAGIGHIIQSTSVFGPFLYGVGVLLLKPFGLHHILLAMVRFTPAGGIEMVNGHEVAGALNIFYAELKAGLPFSPHVTAFLSQGFMPTFIFGLPAVAYAIYRTARPENRPVIKGLLLSGVLVSVVTGISEPIEFLFLFIAPALYAFHIVMSGLALMVMALLGVTIGNTDGGILDLLIFGVMQGMSTKWYLLFPVGIAWFAIYFFVFRWYILKHNIKTPGREVDVQGAQQAVEANTRARGKSKYDHELILRALGGKENIESLDNCITRLRLVVKDMGLIDQQALKAAGALSVVMLDAHSVQVIIGPQVQSVKTGIEALI >NZ_CP048344|2825482:2867030|2863823_2865641_+|WP_042002566.1|DBSCAN-SWA MYNFITIMYDVFSCFGVLAKNQNSRDIRNIKNFSSHQHSLGDMFDELINIIDKEQVLSKEQRKVIFRRYEDLYVKLMHYSVFTDKTHQIIKQKYFNDIVPMILALDIRNTYRPDNEMAFYYHIHSFLTQIPDNEDDIYHAARTYLRNYVKLCLSGYTPANAHFKDIFDGVYEFIRNIRKNSTPGKTKLIATINTCKETCKHLLYLSNEDKEKIISDLDKVQVACYYLTILLAFERRTSLTSTLTTLYKMLISEREVSEYECQLLYLTNPIDVMNILNKYIYYFPNENSPFYTLKIDSALSWDAIDAIRDYSISDIYLYPEQKTINCVVEIENIVFGGYIYTLNNGVTLQNIENSLKDSSCHYVLNGYTEFVNCLRQLTSGKTESVHRTINKLNYEKLPFGFIIAAFAILKIAFKIKFSKNHVNIRALLNDINYFMTYQGESINLISLDHEYPESCLQNDTNTYLLGRVIFLYNSMIYKFINCQEHETNNIHSAMINNLLQEVDIALGKINDIIDSRNISAPHELANILTREKILTTREKKGNLISLFDGFTLFHCVGMITFLIHYLRTPEEKVENIFMLYGADKNNKLRRRLIYDALGIIQSQQE >NZ_CP048344|2825482:2867030|2851871_2853308_-|WP_000622487.1|DBSCAN-SWA MISDNKVTYHEVDFLLPAQRFNIQFSYVSQKGLPFIREFVLRLVHVASMSKAQIATYFGLTHRETEEAISDLVQRGELTLSSDGRLALTDKSNGYFSEVGEIPHLSTIQDSGCTLSFDLATFSCFSNQAFQDHWRGGVTLKVDDNNISQSEKLVEKHFQYQFNQILDKGYLSHLQVQVGKEQPSIYTVNSVNKLRQIPLRLTTEFKMDSDGKAVEREDYEQLNSSECVHELMAVEIARLSRHNNTMSIFKAMQSINDDLTLKLFDSKTNRLNPLFIKDMQALEEYSSSGRTTFLGPVYSKNNWEKLQKALAPALNQRIRDKTDYGGSPFIWVAPSDPYWSKSIRFISSLSNFLSKSATKDKQLYKPVLYVPVRDAADARVARQWKYELGQFHEYAKGLVEGLLDGNVEILHLEDELSAVIYHISQPEILPVTMPLGFITTNKETVKIIGTLVTEYIKGTTGFDKFHDCGAISAMIKKG >NZ_CP048344|2825482:2867030|2839436_2840357_-|WP_000350265.1|DBSCAN-SWA MDIAVIGSNMVDLITYTNQMPKEGETLEAPAFKIGCGGKGANQAVAAAKLNSKVLMLTKVGDDIFADNTIRNLESWGINTTYVEKVPCTSSGVAPIFVNANSSNSILIIKGANKFLSPEDIDRAAEDLKKCKLIVLQLEVQLETVYHAIEFGKKNGIEVLLNPAPALRELDMSYACKCDFFIPNETELEILTGMSVDTYDHIRLAARSLVDKGLNNIIVTMSEKGALWMTRDQEVHVPAFKVNAVDTSGAGDAFIGCFSHYYVQSGDVEAALKKAALFAAFSVTGKGTQSSYPSIEQFNEFLTLNE >NZ_CP048344|2825482:2867030|2838092_2839409_-|WP_000998346.1|DBSCAN-SWA MNDKNIIQMPDGYLNKTPLFQFILLSCLFPLWGCAAALNDILITQFKSVFSLSNFASALVQSAFYGGYFLIAIPASLVIKKTSYKVAILIGLTLYIGGCTLFFPASHMATYTMFLAAIFAIAIGLSFLETAANTYSSMIGPKAYATLRLNISQTFYPIGAASGILLGKYLVFSEGESLEKQMSGMNAEQIHNFKVLMLENTLEPYKYMIMILVVVMVLFLLTRFPTCKVAQTSHHKRPSAMDTLRYLAKNSRFRRGIVAQFLYVGMQVAVWSFTIRLALELGDINERDASNFMVYSFACFFIGKFIANILMTRFNPEKVLILYSVIGALFLAYVALAPSFSAVYVAVLVSVLFGPCWATIYAGTLDTVDNEHTEMAGAVIVMAIVGAAVVPAIQGYIADMFHSLQLSFLVSMLCFVYVGVYFWRESKVRNALAEVTES >NZ_CP048344|2825482:2867030|2837067_2838081_-|WP_000107485.1|DBSCAN-SWA MSTRIYLWRALFGEKPRILLENSDFTVTSFRYDSGVEGLKIANSRGHLIILPWMGQMIWDAQFDGHSLTMCNMFRQPKPATEVIETYGCFAFHSGLLANGCPSAEDTHLLHGEMACAAMDEAWMELEGDMLRLTGRYEYVMGFGHHYLAQPTVVLHKSSTLFDIKMAVTNLASVDMPLQYMCHMNYAYIPNATFSQNIPDEILRLRESVPSHVNPTPQWLAFNQRIMQGESSLSTLNQPEFYDPEIVFFADKLDAYTDQPEFRMIAPDGTTFVTRFSSAELNYVTRWILYNGEQQVAAFALPATCRPEGYLAAQRNGTLIQVAPQQTRTFTVTTGIE >NZ_CP048344|2825482:2867030|2841446_2841545_-|WP_001387298.1|DBSCAN-SWA MISQIDKLEYVVKVQRNQSNPTMFNKIVVFFQ >NZ_CP048344|2825482:2867030|2836232_2836613_-|WP_021566758.1|DBSCAN-SWA MQKNVTPGRRKGCPNYSPEFKQQLVAASCEPGISISKLALENGINANLLFKWRQQWREGKLLLPSSESPQLLPVTLDAAAEQPESLAEDPETLSISCEVTFRHGTLRFNGNVSEKLLTLLIQELKR >NZ_CP048344|2825482:2867030|2829666_2831013_-|WP_000483766.1|transposase|DBSCAN-SWA MFPDFFMHIGQALDLVSRYDSLRNPLTSLGDYLDPELISRCLAESGTVTLRKRRLPLEMMVWCIVGMALERKEPLHQIVNRLDIMLPGNRPFVAPSAVIQARQRLGSEAVRRVFTKTAQLWHNATPHPHWCGLTLLAIDGVFWRTPDTPENDAAFPRQTHAGNPALYPQVKMVCQMELTSHLLTAAAFGTMKNSENELAEQLIEQTGDNTLTLMDKGYYSLGLLNAWSLAGEHRHWMIPLRKGAQYEEIRKLGKGDHLVKLKTSPQARKKWPGLGNEVTARLLTVTRKGKVCHLLTSMTDAMRFPGGEMADLYSHRWEIELGYREIKQTMQLSRLTLRSKKPELVEQELWGVLLAYNLVRYQMIKMAEHLKGYWPNQLSFSESCGMVMRMLMTLQGASPGRIPELMRDLASMGQLVKLPTRRERAFPRVVKERPWKYPTAPKKSQSVA >NZ_CP048344|2825482:2867030|2861973_2862171_-|WP_032141622.1|DBSCAN-SWA MIGKAIIKAQYNKQVPLTLVAMSVLTAMSIACQNQVDVCSPRNLRGAVNIYTMVLEKGKRPGERY |
32 | Stx2-converting_phage(37.5%) | integrase,transposase | attL 2864455:2864469|attR 2870646:2870660 |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_7 |
3156380 : 3183093
Sequences of DBSCAN-SWA_7
Nucleotide sequences of DBSCAN-SWA_7 >NZ_CP048344|3156380:3183093|DBSCAN-SWA TATGACAACGGACGATCTTGCCTTTGACCAACGCCATATCTGGCACCCATACACATCCATGACCTCCCCTCTGCCGGTTTATCCGGTGGTGAGCGCCGAAGGTTGCGAGCTGATTTTGTCTGACGGCAAACGCCTGGTTGACGGTATGTCGTCCTGGTGGGCGGCGATCCACGGCTACAATCACCCGCAGCTTAATGCGGCGATGAAGTCGCAAATTGATGCCATGTCGCATGTGATGTTTGGCGGTATCACCCATGCGCCAGCCATTGAGCTGTGCCGCAAACTGGTGGCGATGACGCCGCAACCGCTGGAGTGCGTTTTTCTCGCGGACTCCGGTTCCGTAGCGGTGGAAGTGGCGATGAAAATGGCGTTGCAGTACTGGCAAGCCAAAGGCGAAGCGCGCCAGCGTTTTCTGACCTTCCGCAATGGTTATCATGGCGATACCTTTGGCGCGATGTCGGTGTGCGATCCGGATAACTCAATGCACAGTCTGTGGAAAGGCTACCTGCCAGAAAACCTGTTTGCTCCCGCCCCGCAAAGCCGCATGGATGGCGAATGGGATGAGCGCGATATGGTGGGCTTTGCCCGCCTGATGGCGGCGCATCGTCATGAAATCGCGGCGGTGATCATTGAGCCGATTGTCCAGGGCGCAGGCGGGATGCGCATGTACCATCCGGAATGGTTAAAACGAATCCGCAAAATGTGCGATCGCGAAGGTATCTTGCTGATTGCCGACGAGATCGCCACCGGATTTGGTCGTACCGGCAAACTGTTTGCCTGTGAATATGCAGAAATCGCGCCGGACATTTTGTGCCTCGGTAAAGCCTTAACCGGCGGCACAATGACCCTTTCCGCCACACTTACCACGCGCGAGGTTGCAGAAACCATCAGTAACGGCGAAGCCGGCTGCTTTATGCATGGGCCAACTTTTATGGGCAATCCGCTGGCCTGCGCGGCAGCAAACGCCAGCCTGGCGATTATCGAATCCGGCGAATGGCAGCAGCAGGTGGCGGCTATTGAAGTGCAGCTGCGCGAGCAACTGGCACCAGCCCGTGATGCCGAAATGGTTGCCGATGTGCGCGTACTGGGGGCAATCGGTGTGGTCGAAACCACTCGTCCGGTGAATATGGCGGCGCTGCAAAAATTCTTTGTCGAACAGGGTGTCTGGATCCGGCCTTTTGGCAAACTGATTTACCTGATGCCGCCCTATATTATTCTCCCGCAACAGTTGCAGCGTCTGACCGCAGCGGTTAACCGCGCGGTACAGGATGAAACATTTTTTTGCCAATAACGGGAAGTCCGCGTGAGGGTTTCTGGCTACACTTTCTGCAAACAAGAAAGGAGGGTTCATGAAACTCATCAGTAACGATCTGCGCGATGGCGATAAATTGCCGCATCGTCATGTCTTTAACGGCATGGGTTACGATGGCGATAATATTTCACCGCATCTGGCGTGGGATGATGTTCCTGCGGGAACGAAAAGTTTTGTTGTCACCTGCTACGACCCGGATGCGCCAACCGGCTCCGGCTGGTGGCACTGGGTAGTTGTTAACTTACCCGCTGATACCCGCGTATTACCGCAAGGGTTTGGCTCTGGTCTGGTAGCAATGCCAGACGGCGTTTTGCAGACGCGTACCGACTTTGGTAAAACCGGGTACGATGGCGCAGCACCGCCGAAAGGCGAAACTCATCGCTACATTTTTACCGTTCACGCGCTGGATATAGAACGTATTGATGTCGATGAAGGTGCCAGCGGCGCGATGGTCGGGTTTAACGTTCATTTCCACTCTCTGGCAAGCGCCTCGATTACTGCGATGTTTAGTTAATCACTCTGCCAGATGGCGCAATGCCATCTGGTATCACTTAAAGGTATTAAAAACAACTTTTTGTCTTTTTACCTTCCCGTTTCGCTCAAGTTAGTATAAAAAAGCTGAATGCGAAACATTAAAAAACATTAATATCAATGTGTTACAATATCATTGGTCTAAAAAATAGACTACATGATGCTACAAAACACAACATATCCAGTCACTATGAATCAACTACTTAGATAGTATTAGTGACCTGAGACAGAGCATTAGCGCAAGGTGATTTTTGTCTTCTTGCGCTAATTTTTTGTTATCAAACATGTCGCACTCCAGAGAAGCACAAAGCCTTGCAATCCAGTGCAAAGATTTGTGTGCCTCAGTTTTGTCTAAGTGTTCTACTGAAAACATAGTAAAATCGGCAACAGCTGGAAATCATTCAATACTCGCACTATCGAAAGTTCACCAGCCAACCGCAGCACGTTCTTGCATACGACGTGCTGCGGTTTTCTTTATGATTTATGCACAATGGACAATTTGAAATTATTGATGATTGTATGGTGCATCGTTTTCTGAACCTACACTGATTTTTTGGTATAGCCTTGCCTAGTCAGTCTTACCGGATCAAACTCTTCGCTATTGCAATACTAACCAAAATCATCAATTTGACAGCGATTAACCAGAATAATAGTATACTATCACCAGTAAGAAATTATCGTTATTTGTAGCGATACATATTATATATATATCATTCTCAGGTGCGTACATGATTATCAACCAGGTACCTATAAAAATAAAAATCTTTATCTTTTTATTTTCATGCATCTCTATTATATTTTTGTTACTGCATGCAAATAATGGAATATACATAACACAAACAACACAAATAAGTTATAGTGTTTTCATTATTGGGCTTTTTTTCATAAACCTGATGATTTTTATTTTTCTATTGCTTTACTATGTTTCTAATCAGAGACAAAGTTATCTCTTAATTCTTTCATTCGCGTTTTTGAGCAACACGTATTATTTATTAGAAGTGGCTATTATTTCTTTATCTCCGTTAGGTAACGATTTATCTACAATCTATCAGAAATCAAATGATATCGCAATATATTATCTATTCCGTCAGTTCAGCTTTATATCTATAATCTTTCTGGCTGTTTATTCCACCAATGTTAAAAATAAAAGTGTTTTAGAAGATAAAAGAAACATAATAATTGTTGTTTTGTCAATATTAATTCTTTTTATTACTCCGTTTGTAGCAAAAAATCTAAGCAGTGACAATATAAAATATAGTCTTAATATTATACAATACTCGCTGAATCGTCATTTGCCGACGTGGAATATCGTGTACACCAAAATAATATCAGTATTTTGGCTTGTATTACTTATCAGCTCATGCATCAGCATACGTAATTACTCAAAAATATGGTTGTGTATAATACTTATTAGTATAGTGTCAGTATGCAATAATCTAATTTTATTGTATTTTATTGATAAATCCCATCCTGCATGGTATATGACAAAATTTCTTGAATTGATATCAATGATTTATATCATTTCAACACTCATGTATTATGTTTTCAGGAAATTAAATCATGCTAATCATATGGCAATTCATGATCCACTAACGAATACATACAATAGAAGATACTTTATTGACTCATTGAAGAATATATCAAAACACCATGATTTCTCAGTAATAATGTTAGATATTGACAGTTTCAAAAGCATCAATGACAAATGGGGGCATCATATGGGTGATCAAGTCATAGTAATGGTTACCAGAATAATAAAAAAATCCATCAGGAAAGAGGATGTATTAGGGCGCTTAGGCGGTGAGGAGTTCGGTATTATCATTAAAGGTAATACTCAAAAGCTCTTGCTATCAATTGCAGAGCGAATCAGAAAAAACATTGAAGAGCAATGCTCGGAAAAATTATTATCGCATGGACCTGAGAAAATAACTGTCAGTATTGGTTGCTTTACTTCAAAAGAGAATAATCTCAGCCCATCTGAAATGTTAGTCAATGCCGATAAAGCGTTATATCAAGCCAAAAGAACCGGAAAAAACAAGGTGATAATTCACTCAAAATAAACACCTTTTTAAAATACAGCCCCAATAAACTGCAGAATATTATCCCATATAATATCCTGCAGTTCGTAATGCACTATTCGATAATGGGTACTGTTGGCCATTCAATATCCGGTGCAGTTGTTGTATTAACACGGTTCAGCAACACCCGATACTTCTTCCAGGCTTCCAGCAACGAGTTTTCTTCCTCCGTTGCGATCTCCAGATCTACAGCATCCTGAAGTGGCGCAATATGCTCACTGAATTCCTGGATGTAGAACTATGTGGTGACGGTCTTCCAGCCATTCGGCTCCTGCTGTATCGAAGCATACCAGGCTATTTCAATATCGCTATGCTGCGGCAGCATTTAACCCCTTGTAATTCATCACCATAATTGATTTAATTCACAAACAAAACTATAACATGGTGAAATTAATGAAAAAAAACACAGATGATGGGGCTAAAATTTACACACCACTTACCCTAAAGCTTTATGACTGGTGGGTTTTGGGAGTATCAAATCGGCTTGCATGGGGATGTCCTACAAAGGAACACCTTCTTCCACACTTTCTGGAACATTTAGGTAACAACCATCTGGATATTGGTGTTGGAACTGGGTTTTACCTTACTCACGTACCTGAGAGTAGTCTGATATCTTTAATGGATTTGAACGAAGCTAGCCTGAACGCGGCATCTACAAGGGCTGGGGAATCAAAAATTAAACATAAAATTAGCCATGATGTTTTTGAACCTTATCCCGCGGCGTTACATGGTCAATTTGATTCCATTTCCATGTTTTACCTTCTTCACTGCCTGCCTGGAAATATATCTACAAAAAGCTGTGTAATACGCAATGCGGCGCAGGCCTTAACTGACGATGGAACTCTATACGGAGCCACAATTCTTGGCGATGGAGTTGTGCACAATAGCTTCGGTCAAAAACTGATGCGCATTTACAATCAGAAAGGCATCTTTTCAAACACAAAAGATTCCGAAGAAGGCTTAACACATATACTCTCAGAGCATTTCGAGAATGTTAAAACCAAGGTTCAAGGTACTGTAGTAATGTTTTCCGCTTCAGGGAAAAAATAGCATCCAACCGCAGCTCGCTCTTGCTTAAGACGTGCTGCGGCATAATCCCAATGATTACTCCCTGACAGGGTTCGTAGGCCACTCAATATCAGGTGCAGTTGATGTATCAACACGGTTCAGCAACACCCGATACTTCTTCCAGACTTCCAGCAACAAGGTTTCTTCCTCCGTTGCGATTTCCAGCTCAACAACAGTCTGAACGTACCAGGAACAGCCTCCTTCAGGGCTTGAAGGATATCAATGTTCGCTTCCTGTTAACTGCCCGACAAGTGCAACCAGTTCGCTTACCTGATTTTCCAGAGTGGTGATCCGGGTGCCTGCTGTTGCCAGATTTTCACGTAACGTTGTATTTTCCTCTTCCAGCGCGGTAACGCGATCATCTGTTTCACGGGCGACCTGAACAAGTAACCCCGTCACGGCGGAGTAGTCAACATTAAGATAGCGAGTTTCTTCGCGTAGCTCGTTGCCGTCAACGGTCGGACCTTGCAACTCTTCACCATAATGAGTAAACGATCCCACAGCTTCAGGTATCGCCTCCATTACTTCCTGTGCAATAACGCCAGCATAAGGCATTCCGTTTTCCTTGAGCGTGTAGGTGTACCCGTTCATTTTACGGATTGCTTTCGTCGCGTCGCTGATAACGAGAATATCGTCTTTAAGGTCGCGGTCTGATGACTGATTCAGCGTTGTGCAATTAATAGCGCCATTTACATCAAACAACTGGCCTGCTGACGTTTTTTGCGCATAAAACAGATACGCAGCAGACGTTCCAACCTCAAAAACGTTTTGTCGAGTACTGGAACCCCACACCCTGACAGAAAACGGTAGTTCTGCATTACCTGAGTTCTGTAAAACAAAACGATTGCCAGTCCCTGATTGTTTTGTAAGGGTTAAATCAACTGTTGAGTTAACCTCATCCTTGTTGATAGTGAGCGCCTGCGCTTTATCAGTTCATCCAGCGCGGCTGCTTTGTTCATGGCTTTGATGATATCCCGTTTCAGGAAATCAACATGTCGGTTTTCCAGTTCCGGAAAACGCCGCTGCACCGACAGGGGGATCCCGTCGAGAATACTGGCAATTTCACCTGCGATCCGCGACAGCACGAAAGTACAGAATGCGGTTTCCACCACTTCAGCGGAGTCTCTGGCATTTTTCAGCTCCTGTGCGTCGGCCTGCGCACGCGTAAGTCGATGGCGTTCGTACTCAATAGTCCCTGGCTGGAGATCTGTCTCGCTGGCCTGCCGCAGTTCTTCAACTTCCCGGCGCAGCTTTTCGTTCTCAATTTCAGCATCCCTTTCGGCATACCATTTTATGACGGCGGCAGAATCATAAAGCACCTCATTACCCTTCCCACCACCCCGCAGAACGGGCATTCCCTGCTCCTGCCAGTTCTGAATGGTACGGATACTCGCGCCGAAAATGTCAGCCAGCTGCTTTTTGTTGACTTCCATTGTTCATTCCACGGACAAAAACAGAGAAAGGAAACGACAGAGGCCAAAAAGCCCGTTTTCAGCACCTGTCGTTTCCTTTCTTTTCAGGGGGTGTTTTAAATAAAAACATTAAGTTACGACGAAGAAGAACGGAAATGCCTTAAACCGGAAAATTTTCATAAATAGCGAAAACCCGAGAGGTCGCCGCCCCGTAACCTGTCGGATCGCCGGAAAGGACCCGCAAAATGATAATAATTATCATCTGCATGTCACAACGTGCATCTACGCCATCAAACCACGTCAAATAATCAATTATGACGCAGGTATCATATTAATTGATCTGCATCAACTTAACGTAAAAACAACTTCAGACAATACAAATCAGCGACACTAAATACGGGACAACCTCATGTCAACGAAGAACAGAACCCGCAGAACAACAATCCGCAACATCCGCTTTCCTAACCAAATGATTGAACAAATTAACATCGCTCTTGATCAAAAAGGGTCCGGGAATTTCTCAGCCTGGGTCATTGAAGCCTGCCGCCGGAGACTGTGCTCAGAAAAAAGAGTTTCTCCTGAAGCAAACAAAGAAAAGAGTGACATTACTGAATTGCTCAGAAAACAGATCAGACCAGATTGAAGCAATTTAGATAATCGTGCAGACTACGCCCCTCATATCACATGGAAGGTACTACAATGGCTCAGGTTGCCATTTTTAAACAAATATTCGATAAAGTGCGAAATAATTTAAACTATCACTGGTTTTATTCTGAACTAAAACGTCACAATGTCTCACATTACATTTACTATTTAGCTACAGAGAATATTCATCTTGTTCTTGAAAACGATAATACGGTTTTAATAAAAGGACAGGGTAAGGTTGTAAATGTAAGATTTTCAAAAAATAAATGCCTTATAGAAGCCACCTTAAAAGGATTCAAATCAGGAGAGTTATCATTTTACGAATACAGGAAAAATCTTGCTACAGCAGGGGTTTTCAGATGGATTACAAATATCCACGAAAACAAAAGGTATTACTATACCTTTGATAATTCATTACTCTTTACTGAGAACATTCAGAACACTACACAAATATTTCCGCACTAAATCATAACGTCCGGTTTCTTCCGTGCCAGAACCGGACTCGCTGGCATGATGAAATATGTGTACCCGGTAACCCCGGTGTGCATCGTTTTTGATTATTCCCGCACACTCGCGCAGAAGGAGTTCCCCGTCGGGCTACGGTCTCTGTTAATACGGGAATACGGCGACGATACAGCGCATGATGTGTCAGGCTTGAATACCTTTATCCTTTAAAAGGGATATCAGTTAAGTTATCCCGTGTAGGGTATAAACCATTATCAAAGCCACTCTGTAGGAAGTGGCTTTTGTAATGGCAATAAAAAGCCCCGCGAATGCGAGGCTAAATCCTGGTATTTGTAATGACTGGTTCTTATCTCAACGCAGCCCCTTACCGCGCGCAAAATGCTCAATATCAAGCATCAGCAATGAGATGTTTAATCTGGATTCACTCCAGAAGTGAGCACCACCCTGTCTACAGAGCCAGATGTGAAGGATGATGAGTAAAATTATCGCTATCATCGAAGGCATTGCGTCCTGATGTATTCCTGAAGCGTTCTCAGTGCTGTTTGGTCGCGGATAATTCCGTCCCGGATACCGAGAACGTTTCGTCCAGCAACTGGAGAGAGTTCGACGGTGGCATCATTGCCCATGCCGGAGGCGCTGGAGGTTTCGGCTGAGGATGGCACAGGGCATTTTCCTTTGACGAGCACCCGACCACCATTATCAAGCTTGCGCCGAAGAGCATCATTTTCAGCTTTCGCATCAGCTAACTCCTTCGTGTATTTATCATCGAGTGCATCAGCATCACGCTGGCGCTGCTGCATGTCAGTAATGGTGGCGTTCGCCTTCTCCAGTTCACTGGCCTTGTTATCGCGCTGTTCTTTGTAGGCGATTGCATTATCACGGTAATGATTAACAGCCCATGACAGGCAGACGATGATGCAGATAACCAGAGCGGAGATAATCGCGGTTACCCTGCTCATTGTTGCCCCCACAAACAGACCTCACGCTCAATCTCACGACGAGTCATCAGGCCTTTCCATTGCTTACCGCCAGCGTATGCCCAGCGACGTAGCTGGTCACATGCGCCCTTGATATCGCCCTGGTTTATTTTGCGAAGAAGAGCGGATGTTCTGAAATTGCCTGCGCCCACGTTATAGACGAACGAGTAAAGAGCGCCGCGCGTTGTTTCCGGTATATCGACTTTGATGTACGGGTTAATTTGTCTGGCTACCGTGGCAAGGTCTTTATTCAGGAGGGCTTTGCATTCTGCTTCGGTATACGTTTTACCGGGAATGATGTCTTTTCCTGTATGCCCGTGACATACAGTCCATACGCCAACGATATCTTTGTATGGTATGTAGCTGACACCTTCCAGACCATCGTTACCACTTGGGCCAGTGATTAACACTGATGCTATAGCAATTGCTCCGCCACCAATAGCAGCAGCAACGGCTTTTCGTAATGATGGAGGCATTATTCACCTCTCGCAGCCTTGCGCTTATCTTCTTTAATCTTGAAATAAAGGTTTGTCAGGTACGTCAGCAGGCCAAATACCAAGCTACCCAGCACACCTATTGCTGCCCACTGTGAGGGCGTGACTTTATCGAGCAGCTGTAAAAACCAGTAGCCAGCACTGCCTGCGGAGGTGCCGTAGGCAATGCCCGTTGTTAACTTATCCATGGATTTCATAGCCTCACCTCCGCAAATAACGGATGGTGTACACGGTTCGGAACGAAGAGGAAAGGTATAGAAGTTACATTAGCGTAAGGCTTGAACATCTATTCAAAAAGAAAAACGCCGCGATTATTCTGGCGTAGCTGAAAGCATCATACAATTATCAAATACGAAAATTACAAAATCATTAAAACGCATCACGTTACATCATGTCTTTTTCTAAAAAAAATCTTGATGAATATTGATGGGGAGGAACACCAAAATACCTTCTGAAAACACTTACAAAATATGACGTGTTTTCATAACCGCATATCTCAGCAACTTTTCCAACAGAATATAAATTGTAGCTTAATAACCTTTCCGCCATCACCATTCGCTCTTCAAGAATTAACTTACTAAATGATAAGCCTTCGTGCTTTAATTTTCTTTTTAACAGACTTTCACTCAGATACAGTCTTGAAGATATATCACAAAGTCTCCATGCTGCAGATATATCCGTGTGAATAATAGCCTTAACTTTACTTCCTAAACTATTAAGACATCCAAATAAAAAACTTTGCACTATTTTCTCTGAAGATAAGATAGCAAGACATGCAAGTGATATTTGATTTCTAACAAAATCCACAGTTCTGCCATCACAATTCAAGCATGCAATCAAGTTCTTTAACAATGAAAAATCTTCACATTCCACCATCAAGTATGCCGGATAAAACCTTCTTACAGAAAAAGGTGAGAGTGTGTTGCTTTTAAAGAAATCATTAACTGTTTTTTCTTCAACATCTACGATCATTACATGATCTATATTTGATGAAAAAAATCTTTTAAATTGTAATCAATGAGAACAGCACTTCCTTTTTTAAACAAAATATCTTCTTTACCAATTCGGACATCAAACGAATTCAACACCAAAATGATAGAACATATGTATGGCATATTATCCACCTGATATCATTGGGGTTACACCAGGTAAGTGTAGGTGGAAAATCAATATTCGCCAGTTCAACAATAAGGAAAATTTCATTACATCACAAGTATAAAATTATGTATTTAACTCACAAAGACAAATTATTAAACCAATCTGTTATATTATATATAGCTGCGTGGAATCATAATATCATATATTTTGACTGGCATGTTTACCAACTTTAAGTTGCATCTCAATTGTTTCTTCAGCGTAAACAGAGTTTTTATACAAACTGACACTCTGGGTATCATAGTGTAGTTTTTACGATTGTAAATATCCTGCATGCAGGAACTCATCCTTTTGGATGATATCGCATACAATTAATTTACCATCAGTCTTAGAGCCAGTTCGTCCGGATAGGGATCGAAGTAATTTTGTGTAAGCAAGTAATCATTAGGATACTCACCCAGATAATGCTTCAGCAGAGTCAACGGCGCAAGAAGAGGTAATGTGCCAGAACGATAGTTAAGTATAACCTCGCTCAACTCTTTACGCTGGCGTGTACTTAAGTAGTTACTAAAATACCCCTGTATATGCATCAGCACATTCGTGTGATTTTTACGTGATGCAGGTTTTCTGAGAATCGCCATCAGCTTATCACGATACACCTCAAAGTATGATTCAAGGTCCGCCCACTCGTGTATTGCAGCCACAAATGGTCCCATATCTTTATAGCCTGCCTGACTATGCGCCAACAACTGAAGCTTATAACGACTATGAAAAGCTAATAACTCTCTTCTTGATAATTTCTCCTTGTAAAGGTGATTGAGCTCATGCAAAGCAAAAACTCTTTCAACAAAATTCTCACGAAGCACTGGATCATGTAATCGCCCATCCTCTTCAACCGGTAGCCAGGAAAACTTTTCCATCAAAGTGCTCGTAAATAGTCCCACTCCATCTTTACGACCTCGATTACCATTTTCATCATAGACACGCACGCGCTCCATGCCACAGCTGGGAGATTTAGCACAAACCACAAACCCCGATACATCCTTTAATTTGTCCATATAAGAACGACTAAACTCTGTCATTCTCTCTGTCACATCCTCATTCTGGTCGTGGCTGAAACACATCCGTATATTTCCTTGCATCGAGCGCACAAGACGTAGAGCAGGACGCGGAACTGGCAGCCCTATAGCCATTTCCGGACATACTGGTCTGAATGTTACCCATTCCACTAATTTGTCCATTAAAAAGTCAGCTCTTTTGTGACCACCATCAAAACGAACAGCAGAACCGGCCAAACAACCGCTGATTCCAATCACAGGTTTTTTTATCATATTCTCCCCCTTGACTAATTCATTAACACATAAACTGTGTAGTGCACGGAATAAATTGCCTTTCTGGCGTCATCACTGACAATTTTTCTGTTATGGACTATTCCTAATATAGTATGAAAGTTCTTTAAGTGATCGGTCGTAATCATCTATCTTTCATACTTACTCTCAACTATCAAAAGTACAGGATTTATTATGAAGTTATGGCCTGTGTTGACTGGCATTGCACTCTCTTTCACTCTTATAGCATGTAAGGCCCCGACACCACCTAAAGGTGTGCAGCCGATTACAAATTTTGACGCCAACCGCTACCTCGGAAAATGGTATGAAATAGCTCGCCTCGAGAACCGGTTCGAACGTGGTCTGGAACAGGTCAGCGCTACTTATGGAAAACGGAACGACGGAGGGATTCGTGTACTTAACCGTGGATACGATCCAACGAAAAATAAATGGAGCGAGAGCGAAGGTAAAGCATACTTTACTGGAGATACTAAAACTGCAGCATTGAAGGTTTCGTTTTTTGGCCCCTTCTATGGTGGCTATAATGTAATCAAACTGGATGATGAGTATAAGTATGCTCTTGTCAGTGGTCCGAACAGAGAATACCTATGGATTCTGGCAAGGACCCCAACTATTCCAGATAAAGTAAAAGCAGACTATGTGCGAACCGCTCAAAAGTTGGGATTCAATGTCAATGAATTATTATGGGTTAAACAATAAAATCCCTACCCGAAATAATACTTATTAGAAAAAAACCAGCCTTTGGGGAGGCTGGCTAAATCAGGAAACAAGCTGTTATATGATAATAACTACGTTGTGATTCCAACATTTAAAATGTTAGACTAATGACAATCAGACAGCAACTTTTCCTTTAATTATTTCGAACAATCAGCATCCATCTCCAATCGGAGATCCAACACCATCAGCATACCCTCCACTACGCCCTCAGCTTTCTGAAGCATCCTGCCAACCCAACAATCAGATCGCCCATGCTTACGTGCAAGCGCCATAAAAGTCATGCCGCCGACATAATAGTCCACCAATAAATCATGCAAATCGCTGTTGTTCTTTTTCAGACGGGCCATGCACCCGCAAATGATCATCGCATCATCGTCACAACATTGCGGGCGAGATTTTACTTTTGAAGGAATTAATCCCTTAAAACCGGCGGCAATGGACGACCAAGTCACATCTTCATGATTATTAGCCGCCCACGCTCCCCAACGCTCAAGAACCATCTGAATATCACGCATCAACTTACTCCACAAAAATCAGACCAGAACGCCAATCACAAGCAAAAATCAACAAAACAGTATTAGTTGATTGTTATCTCTGACTTCATACTCCTGCTCCTGTCAGGGTTTTGGCGTAATTCTTCAGTATTCGGTAATCGGTCAAAACAGAACCGGGGAAACGATATAAGCGCAGATGCCCCCAGCGGTGGCGAAGAAGTTCTGCCATATAAAACTCAAACATCATTCATTCCCCATTTCGGTGATGGTCAGTTCCAGCCTCCCACCTTTGGTAACAGGCATCTTCACAACGCGGTAATCAACGACCTGAGCATCATCCAGCCAGAAACCTGCTTTAGTGAGTGCGTCAAAAGCGGCTTTTTGCAGATTATCCAGGTCACGGCGACGGCGATCCGGCATGTGGCACTCAATGCGGATTTTCACAGGCATAGCCAGGCCGATATCCAGCATTGCGTTTTTAATGATTCGGGCGACGTTATCGCGGTATGCCTGCCCCTCTGCGCTGACGTGCGTGCGCCCGCGATTATGGCGGTAATAGCGATTATTGCTCGGAGGCCAGGGTAATGTGATGTTGTAGGTATTCACGCCTTGATTACCCCCTCTTTCATCCAGATAACCTGCGTTCTCGCCATACCTTCCAGCGCGCATTCTTTTGCATATGCGGCATCGACAAAATGTGTGCGGCGGTCGATTTCGTCGTGGCAGGCAGAACATGCAATGGTGGCAATCAGGTCTGGCGGTTTGGTACCGGTGCCGCACAATCCAGTCAGCCGGATATGTGCCAGTACAGACGTTTCAGGGTTGCCATTACATACGCCAGGGGTTCTTACCTGGCATTCCCGACCACGCGCTGCTTTTCTCAAATCAGCCATGATTCCTCCTTGCTGCCAGTCGCAACCATTTTTTATCAACCAGGCTGGCGGTATATCCGAGCAGTGTTGGTATTTCGGATGGCTTCAGCTCAGGTTTACGCTTACGACGATTTGGTACTCTGTAGATGTGTCCGTTCATGACACGAATAAGCGGTGTAGCCATTACGCCTCCTGCTTGTCGCGCAGCAGCTGGAACTCGCAGCTCTGCGGAATAGTCAGGTGGCAGCCAATATTCATCGCCCAGGCTTCAACCTTACACAGGAAGACATACATCTCTCCGGCATCAAGATCGGAGGTATGGCGTAACGACTGGATAGTGGTGATTTCACCGGTTACGACATCAACCAGTTCTTTGGTTTCATAACCGAGGTATGTGTGTTTGAGAGCTTCTTTTACCCATGCTGCGGTAGCGAACGATTTCCCCCTGCTGATAAGGTATTCACTGATTTCGCTGTACCACATGTGGCTGAGTGCATTCTGGGAAAGACTGCGTTTCTCACGCCACGGTTTAAGCACCATGCGAAAGCATTTTCCGTCCTCCAGATAAGGCTGGATCTGCTGGCCGATAGCGGTGAAGTTGCCGCGATGTAATTTGATGCCATCTTGTGGTAGGTTCACGCTTCACCTCCGCAGAGGTCAAACGCAGGATGCAAAAAATCGCAGGTGCATTTCTGCATCTGTGAAGGGAGAAGAGAGTTTGGATTGTATGTGCGCATAAACGTCCCCGTTTAGCGCAGAAGTCACCGGAGTTGTTCAAGCTCCGATGACTTTATTATTACGAATTGATTTTACAAAATCAAAAGGTATGTTAGTGACGCGGGTCTGTTATTATGCGAGAAGGGTTTCCGTATAAAACAAGGACCTTACTTCCTTGAGTAAATAACGGATCTTTGCCTTGAACAATGGTCATTAAATTCCCATTCTCAGTTTCGACAACATATTCCATGCCTGTTTGTTTTGTTGCTGAAGATTCGATTGCTGCCCCGGCAATACCACCAATGACTGCACCACCAACGGCACCAACGATATTAGAACGAACTCCCCCACCAAGCGCAGAACCAGCGGTTGCCCCCACGGCAGCCCCAGCAGTCCCGCCTAACGCGGAAGTCCCACTGATATCAACCCCCCTGGCACTAATAACTGTACCAGCGATAGTTCGATTAACCATGCCCACAGAGCCAACAGAATAACTATTTGGCGATATATTTTGTGCGCATCCAACCAACACTAAGAGTGGAGCAATTACGAATAATCGCTTCATTTAGCTACCCTAACAGGAAACATTGGACGAGAAAGATCAACACTTTCTAATGCTTGCAAGAACTGCGTTATGTTGTTTTGCACCGCGCGATTAACAGATTCGCGTGCTCGAACAATACCGTAGAATGCGTAACTGGCTGGAACAGTACCGGTAGACTCAATATCCTGCGTATATATAATATCACCATTCGCACGGTTGATTATTTCATACCTTGCAATTGCTTTAGTTGTCATTGAAACACCAAAAGCAGGAACGTCAAGAGCCAACACTTTAACATTTAAGCTAACCGTATTTGGTGAACTATCACGAAAAATAGTCATTCGGTCGAGTGCTTCCTGCAAAGATTCACGCCAAATTGGAGTTATAGCCTCCATACCAGCAGTGATATCCCCTTTCTGCTCATCTGGACGAGCAAGTGATACCGTTAATGACTTAATTTCAGCATCTATTTTTTTCTGGCTAACTCCCACGTTAGGTGTTGAAAAATTCAATGGTGGCACACTAGCGCAACCTGTTAAAGAACCAATAATCATGGCTAATAATATTATCTTCTTCATAAATTTACCTTATTGTTATAACCAAAGGAATTATAAAGTAAAAAAGTTCACTATCACTAGCCATTAACGACATCAATTTCAGAGAAACATGGTACTCATTTCCACAAATTTGACACAAGTCATTTTCATCTACATATTCCATCATACTTGATGCATATGTTATTGAAGCCTCTATCCTATCCGTTCATAATAGCAATAGTTACCCGGGTGATAGTACCTCTATGATTACTCGTCTTTCTGATTGATTGGATTAAATATGCGCGCCAAAATTTATCAACTTTCGTTATGGATATTTATTTCGTTTCTAGCGATCTATGCCTTTATTATCTATAAAGGTTCTTATATTGGAGTAGCATTGCATCAAATTGCTTGGATCATCATTATTGCCTCTGGCTTGATTGCTAGGCTAACTAAACCAAAGCAAAAACCAATTTCGTCCAATAATTAGACATGTATTAAAAAATGATATTTTTATGTACATAGTCTATTGAAAATTGCCGCGATAAAATGCCAACACCCGCTTCATCGCGGCACTCTGGCGACACTCCTTGAAAATCAGATTCGTGCTCACCTTTCCTTCCCGTTCTTCCCTGGTAGCGAACCGGTAATACACCGTTCGCCAGACCTTACCATCAATAACTAAGATTCCTGTCCGCGCCATTTTAGCCGCAGCCTGGTTTATGCTGGTTACTGTTGCGCCTGTTACCGCAGCAACGTCCTGCGCACAGAAGCTCTTATGCGTCCCCAGGTAATGAATAATTGCTTCTTTTCCCGTCATACACTGGCTCCTTTCAGTCCGAACTTAGCTTTGATTTCTGCAATCTTCGCCAGAGCCTGTGCACGATTTAGAGGTCTACCGCCCATGACAGGAAGTTGTTTTACTGGTTCAGGGATCGCCTCACCACGGTTAATTCTCGCAGTCATATGGACAAGCTCATCTGCGGCCTTACGGCGTAATTCCGCATCAGTAAGCGCATTGGCCCGCATGTTCTGATACAGGTTGGTAACCAGCCAGTAGTGCGCGTTTGATTTCCACGGATAAGACTCCGCATCCGGATACAGGCCTCGCTTCCGGCAATACTCGTAAACCATATCAACCAGCTCGCTGACGTTTGGCAGTCCGGCGATAACGGATGCTTCTTCCCGGCACCATGCAACAAACTGCCCGGGTGATGGCAGGAATGGTCGATTCTGCCGACGGGCTACGCGCATTCCAGCGTTAACCTGTTCCATTGTGGTGATCCCGTTTTCCCGAAAAGCCAGCACCCACTGGCGGCGGATTTCGTTCAGTTCGTTCTGGTCCCGGTTAGCCAGGCTCGCTGGGAAAGTTGCCAGTAACTGGCTGAATACACCGTTGATTATCTGCGCTACCTGCTGTACCTGCGGCTTTTCGTCGTACTGTTCCGGCATGTTATTGGCGATCCGGCACATCTGCTCACGGTCAAAGTTAACCATCTGTGCGGCGATGTTTTTCATAAATCCACCCCGTAAATCCAGTCAGTGTTCGTCAGGTCGAGTTTTGGTTTGCCGGCTGTCACGCCAGCCTGTTGCTTGTTTCGGTTGATTTCGAGCTGGGTCCACTTGTCGCGGAGTTTGGCCGGACTCAGCACGTTACCGGACCAGAAGTTGTCCTGGCAGGCCCAGCGGAACAGTACACACATGTCGCGGTGGTTACGTCCATCACGTTCACGCATCAGACGGATATCGTTAGCCCACCCTGCAAAATTCGGTTTTCTGGCTGATGGCGCGATGGTCTTCACCATGTCAAACATCCACTCTGCGGCGGTCAGGTCTTCTGCTGTCCCCCACTTGCTGCCGTTCTGAATCGCAGCATCCGCTTTCACCACAGGAAGGTCGTTTTCTGGCAGGTCAGAGGATTCGCCAGAATTCTCGGACGAATAAGGTTTTATATTGTCTTTTGTTAGTTTGTCTTTTGTGTTTACCTGATTCGGGTAAGTGCCTTTACCTGATTTGGGTAAACTTTTCTTACCTGATTCAGGTAAATTTACCTCTTTCAGGTAAACTTTATTTTTCTTACCTGATTCGGGTAATGTTGACCATTCACTGACCACATTATTAATGCCTATATTCCGCCCGCTCTGAATAAAAATCCCACGCTTTACCAGAACACTTTTTGCAGCAGAACACTTGTGCGGCAATATCCCGGTCAACCTAGCCATGAAGCTATCCTGCTCAACCAGGTAACCGCGATATTCCAGCGGTAGTACCGCCAGAATTGCCGGGGTCAGTTCACGCACGTTATTTCGGTATTTTTCAGAATCGAATTTGTTATCGAGGAAGCGGAACAGCTTCTGGCGTGCACGGCTGACATCATCAGGGAAATCGATAGTGCCGCCGCCCTGCTCCCGATACTCATTCACAATGAGTGTGGCAACGACATCCTGATTATCTTCAGCCGACCAGGCGCGGACGGCATCACGGATTTTTTCGTGGCCTGGCACCTGTTTTGTTTGAGAACGATTTATCACCGCAGTCGGGCTAAATCCGCTAGTCTGTTGGTATGTAAGTAGTTGCATAATTGACTCCTTTAGTTTGAATTGACTGTTAAGTTGATTGCTTATTGTTAAAGAGCGTGAAATGGAAATTTAAGCTGCGTTCTTTTCGGTGTGTGGAAACAACTTCGGAAGATCCGGGCGAATCTGGTATGCCTTCACTACTCCACCAGTAGCCGTAACAATGCTGCCGACATGTTCAGGGGATACCTTTGCTTTGTTGTGAAGCCACTTATAGACGGCCTGCTGTGAAACTTCGCAAGCAGCGCCCAGTTTCTTTTGTGAACCAACGATATTGATCGCTGTTTTGATAGCTGGGTTCATAACAACCTCCGTGGTTAATTTGAATCAAGATTAAAACTATGGTTGTTTTTAGTCAACAACCATTTTCGTTTGATGGAATAAAACCTTGGTTGTACATTTGGACTATGAAAACAACACTCTCAGAAAGACTTAAAGAAGCCAGATTAGCGCGAGGCCTTACACAAAAGGCGCTTGGGGATTTGGTCGGGGTTAGCCAAGCTGCTATTCAGAAAATCGAAACAGGGAAAGCTAATCAAACAACTAAAATCGTGGAGATCGCGAACGCTTTGGGTGTGCGCGCAGAATGGTTATCTTCTGGCGTTGGAAATATGTCAGACAGTACAGTGCAACCAATACAATCAACTGTCAGCCATTCCAAATACTTCAAGATTGACGTTCTTGATATAGAAGTCAGTGCTGGGCCGGGAGTCATCAACCGTGAGTTTGTAGAAGTTCTACGCTCGGTTGAGTACTCGTTTGACGATGCTCGTCACATGTTCGATGGTAGGAAGGCGGAAAATATCCGCATCATTAACGTGCGTGGTGACAGCATGTCAGGAACGATCGAACCAGGTGATCTGCTGTTCGTTGATATCACAGTTAAATCTTTCGACGGTGATGGTATCTATGCGTTTCTGTACGACGACACAGCCCATGTAAAGCGCCTGCAAATGATGAAGGATAAGCTGCTGGTCATCTCTGATAACAAAAGCTACTCACCGTGGGACCCGATCGAGAAAGACGAGATGAACCGGGTGTTCATCTTCGGTAAGGTTATTGGGAGCATGCCGCAGACATATAGGAAGCATGGTTAAAGTGAGGCTAAAAAACAGTTACAGCAATAGGCCTGTTGTTTTTCTTTAAACACGCAGTGTTAAACCGCTCTTTGAGATGCGGAGTAATGAGATGGAAGACTTGAATCACATAAGGGTTAGTGATGGAGTGCGTAGCGAGCAGCAATAGTGCAATACCTAATGTCGTTGAAGTAATACGTCGCATCAATGAAGGTTCCACTCAGCCATTTCTTTGCAAATGTGATGATGGGCAGTTGTATGTTTTGAAGTCAAAACCATCAATGCCCCCGAAAAATCTCTTAGCTGAGTTCATTTCGGCGTGTTTGGCTAATGATATCGGCCTTCCTTTACCTGACTTTAAAATCGTATTTGTGCCAGAGGAACTTATAGAGTACTCACCTGATCTGCAGCAACAAATTTGTACAGGATATGCCTTTGCTTCATTGTTCATTGACGGTGCAATAGCGTTAACGTTTACGCAGTCAAGAAACGAAACGATCATCCCAGTCGAACAGCAAAAATTAATCTATGTTTTTGATAAATGGATATTAAATGCAGACAGAACGCTTACTGACAAAGGTGGAAACGTTAACATCCTTTATGACATCAGTAACGATAAGTATTATCTGATTGACCATAATCTCTCATTTGATCAGAATGCTGGACCTGAAGATTTTTCTGTGCACGTGTACGGCCCTGGTAACCGCAAATGGCAATATGATTTAGTGGATCGCGTAGAGTACCGCCAGAGGGTCGTTAACAGTTTACACAAGCTTCCTGCTATCCTTGACGAAATTCCAGAAGAGTGGATAGTAGATGAGGAGTTTTTACCTTTTGTCTGCACTACGCTAGACAAAGGTGATTGTGATGAATTTTGGAGCGCAATAGAATGACAACTCCATGCCTATATAGCATCGTTCGCTATGCGCCTTATGCGGAGACTGAAGAATTCGCAAACATAGGCGTACTTCTGTGCGCGCCAAAAGAAAATTACTTTGATTTCCAGCTCACAAAGCGAAATGACTCTCGTGTAAAGAATTTTTTCCATGATGATTGTATTTTCCCTGTAGCAAAAGACTCAATACAAAGAGAACTACAGTTCGCAAAAATGCATGCGACCCAGATTGTTGGACATCAACAACTTGCACAATTCTTCAGATATTTTACAAACAAAAAAGAATCAATTTTTCAGTTCAGTTCTACGAGAGTGATTCTCAGCGAAAACCCAAAAGAAGAGCTGGCCCGCATTTACAATAAATATGTAAACCACTCTGACTACACAAAAGAGCGCCGTGAAGATGTTCTAGCCAGAGAGCTAAAACGAAGTATCGATAGAATAGATGGATTGAAGAACGTCTTCAAACAAGCAACCATTGATGGGTATTTCGCAAAGTTCTCAATGCCATTGGTCGCCAAGAAGCATGACAGGATCCAATGTGCCATCAAACCTCTGGCATTCACTCAAGCTGAACCAGGAAAAATGATGGAGCATAGTGATACTTGGGTGATGAGAATAACTCGAGCAGCAGAAGAAAACCTGCTTTCACTTGATGACATTTTATTCACAATTGAAACTCCTGAATCACCAAACTCAGGCCAAAGCAAAGTTATTGACATCATAAAGAGAACTATGGATGCTAAGAAAATAAATCATATACCTGCATCCAACCACAAAGAAACTATTGATTTTGCAAAAAAAATACTTCCCCAAGTTTAAAATTTATTTTTGTATGTGATATTCCTTATTAATAACCCGGCCACCGTGCCGGGTTTTCTTTTGCCTCCCCTCATCACACAAACCGCTCAAAAAACCACCATAACCTCGCTTCAGTTATCGCTATGCGATTCAAGTCACAAAATAAATCCATCCTAAATACAACCAGTTATATCTAAAACAACCAATAAAACAACTTTTGTTGTTGACGGTAAAACAACTATAGTTTTAAATAGGTTCATCGCAACAACACAACGATACGGCAACCACCTGATTCACCGTTGCGATGACCGCTTAGATCCGCAGTTTGAATTTCAGCAGGCTTCGGGGAGTGCGAGGGGTGAAACGGACGCGTGAACGTCGGTGTGACCAGCTGAAATCAACTCAACATTTCATACCTTAGTCGCTTCAACGAGGCGGCTTAGTTATGACAACCGGCGACCATCCACCGCCTGAATACGCGCAGAAGTCTCTATATGTTCAGCAGCCCAGCTTACGGGCAGGAGTTTTTATGGTTCATCAACATTACGGAACGCAGACCGTTAATCGCGGCGCGGTCATGCCAGGAATGCTGGTCAAACACAAAGATGGTACCTGGACTGCATCAGCTAATTTACGCGGACGGCTTTATCTGCATCGCGGCATCGAGCGCACTTATACCCGTGATTTGCTCGTGGAAGTTTTTCTCGACGGACGCGGTAACGGCCTGAATCACTAATCCCCTTTCCTGTTTTCCTAATCAGCCTGGCATTTCGCGGGCGATATTTTCACAGCCATTTTCAGGAGGTCAGCCATGAACGCTTATTACATTCAGGATCGTCTTGAGGCTCAGAGCTGGGCGCGTCACTACCAGCAGATCGCCCGTGAAGAGAAAGAGGCAGAACTGGCAGACGACATGGAAAAAGGCCTGCCCCAGCACCTGTTTGAATCGCTATGCATCGATCATTTGCAACGCCACGGGGCCAGCAAAAAAGCCATTACCCGTGCGTTTGATGACGATGTTGAGTTTCAGGAGCGCATGGCAGAACACATCCGGTACATGGTTGAAACCATTGCTCACCACCAGGTTGATATTGATTCAGAGGTATAAAACGGATGAGTACAGCACTCGCAACGCTGGCTGGGAAGCTGGCTGAACGTGTCGGCATGGATTCTGTCGACCCACAGGAACTGATCACCACTCTTCGCCAGACGGCATTTAAAGGTGATGCCAGCGATGCGCAGTTCATCGCATTGCTGATCGTCGCCAACCAGTACGGCCTTAATCCGTGGACGAAAGAAATTTACGCCTTCCCTGATAAGCAGAACGGCATTGTTCCGGTGGTGGGCGTTGATGGCTGGTCCCGCATCATCAATGAAAACCAGCAGTTTGATGGCATGGACTTTGAGCAGGACAATGAATCCTGTACATGCCGGATTTACCGCAAGGACCGTAATCATCCGATCTGCGTTACCGAATGGATGGATGAATGCCGCCGCGAACCATTCAAAACCCGCGAAGGCAGAGAAATCACGGGGCCGTGGCAGTCGCATCCCAAACGGATGTTACGGCATAAAGCCATGATTCAGTGTGCCCGTCTGGCCTTCGGATTTGCTGGTATCTATGACAAGGATGAAGCCGAGCGCATTGTCGAAAATACTGCATACACTGCAGAACGTCAGCCGGAACGCGACATCACTCCGGTTAACGATGAAACCATGCAGGAGATTAACACTCTGCTGATCGCCCTGGATAAAACATGGGATGACGACTTATTGCCGCTCTGTTCCCAGATATTTCGCCGCGACATTCGCGCATCGTCAGAACTGACACAGGCCGAAGCAGTGAAAGCTCTTGGATTCCTTAAACAGAAAGCCACTGAGCAGAAGGTGGCAGCATGACACCGGACATTATCCTGCAGCGTACCGGGATCGACGTGAGAGCTGTCGAACAGGGGGATGATGCATGGCACAAATTACGGCTCGGCGTCATCACCGCTTCAGAAGTTCACAACGTGATAGCAAAGCCCCGCTCAGGAAAGAAGTGGCCTGACATGAAAATGTCCTACTTCCACACCCTGCTGGCTGAGGTTTGCACCGGTGTGGCTCCGGAAGTTAATGCTAAGGCGCTGGCCTGGGGAAAACAGTACGAGAACGACGCCAGAACCCTGTTTGAATTCACTTCCAGCGTGAATATTACTGAATCCCCGATCATCTATCGCGACGAAAATATGCGCACCGCCTGCTCTCCCGATGGTTTATGCAGTGACGGCAACGGCCTTGAACTGAAATGCCCGTTTACCTCCCGGGATTTCATGAAATTCCGGCTCGGTGGTTTCGAGGCAATAAAATCGGCTTACATGGCCCAGGTGCAGTACAGCATGTGGGTGACGCGAAAAGATGCCTGGTACTTTGCCAACTATGACCCGCGCATGAAGCGTGAAGGCCTGCATTATGTCGTGGTTGAGCGGGATGAAAAGTACATGGCGAGTTTTGACGAGATGGTGCCGGAGTTCATCGAAAAAATGGACGAGGCACTGGCTGAAATTGGTTTTGTATTTGGGGAGCAATGGCGATGACGCATCCTCACGATAATATCCGGGTAGGTGCGATCACTTTCGTCTACTCCGTTACAAAGCGAGGCTGGGTATTTCCCGGCCTTTCTGTTATCAGAAATCCCCTGAAAGCACAGCGGCTGGCTGAGGAGATAAATAATAAACGGGGAGCTGTATGCACAAAGCATCTCCCGTTGAGTTAAGAACGAGTATCGAGATGGCACATAGCCTCGCTCAAATTGGAGTCAGGTTTGTGCCAATACCAGTAGAAACAGACGAAGAATTTCATACGTTAGCCGCATTCCTTTCACAAAAGCTGGAAATGATGGTGGCGAAAGCAGAAGCAGATGAGAGAGACCAGGTATGACAACCACTGAATGCATTTTTCTGGCAGCGGGCTTCATATTCTGTGTGCTTATGCTTGCCGACATGGGGCTTGTTCAATGACACCTCAGCAAGAAAACGCCCTTCGCAGCATTGCCCGTCAGGCTAATTCTGAAATCAAAAAAGCCAGACAGCAGTTTCCGGATAAAAACGTCGATGACATTTGCCGTAGCGTACTAAAGAAGCACCGCGAAACGGTAACGCTGATGGGATTCACACCGACTCATTTAAGCCTGGCGATCGGCATGTTGAACGGCGTCTTTAAGGAACGGTGAACATGAAAAGCAAAATCATCAGGGAGCTACAGGCTCCTTTTTTATTATTCGCATTCACCCTCAAGCGTATTAACCAACAATTCAGGGATTAATGAAAGATGGCAGACATAATTGATTCAGCATCAGAAATTGAAGAATTACAGCGCAACACAGCAATAAAAATGCGCCGCCTGAACCACCAGGCTATATCTGCCACTCATTGTTGTGAGTGTGGCGATCCGATAGATGAACGAAGACGACTGGCCGTTCAGGGTTGTCGGACTTGTGCCAGTTGCCAGCAAGATCTGGAGCTTATCAGTAAACAGAGAGGTTCGAAGTGAGCGAAATTAACTAGAAGCCAAAGATAAAATCATCGCTGAGCAGGAGAAAATCGCTAACGGAGAAAAGACAGTAAGTCAGTATATGAAAACCGCATGATATCATCAGATAAAAATCGGTCGTAAAGCGAAATATTAATACCAGAACAAACGAGTCGAGGTAAATTATATTACCTCTATAAATTAACTAAAACTTGCCCGCTATATACTATATCATTCAGTATCATCACGCGCGGTCTGTGCATATGTCACTACCGCACCTAATATATTAATTTTCTTTTCAACATAGATAATATTATCGTACTCATAATTGCCATACGGATAGCAAATGCGAATATTCTCATGTAGATCGGGGTCATCCACCTCAGCTCCAGAACAACTTTTTGAACTACCGGAAGTATACCGATACGGTGCAACATAAGACGATGTCTCTCCAGGCAAAAAATAAGTTAGTGTCGTAAGGGGTATAATCAGAAAAAATCCAGCAAATATGCACATCCCTGCATAAACCTTAAGGTATGCTGACAGACTCTTCCAGCCGCTTTGTTTTACTATCCCCTTCTTAACCCAAAACAGAGATAACAGAAAAGCTATTCCCATGCTAAACAGAATGTAATAGTGGGATATACTCTGATTAAGAAACGTGACCCTGTAGATATCTGCCCGCCACCAGAAGAAAAGGAAAATAAAGATCAGCCCTGAAACTGTCATGCAAATCAAATAAGGATACGAATCTTTTTTCATGTTTAGCGCCCATAAAATTTTTCCTGACCCGGACAAATTTACCATCCATTTTTTGCGCAGAAAATAGCTCATTACTTACTGCACAATAATACACAAAATTGCGTAAATTTTTTGCATGGATTTTAGCTCTTTCAGCCGACATTTAAGGGGTAAATAGCATTTCCTAAAAGCAACTGCACCAACCCAACAGAATGGGCTACCGCTTACGTTGAGAGCAAAAAAGTGTATAGCAGCAATGAACAGCATCCTCGCACTGACGAGGATTTCTTTTATCTGAACTCGCTACGGCGGGTTTTGTTTTATGGAGATGATAAATGCACTTCCGAGTCACAGGTGAATGGAATGGAGAACCATTCAACAGAGTTATCGAAGCCGAGAACATCAGCGACTGCTATGACCACTGGATGCTGTGGGCGCAGATAGCACATGCAGACGTAACCAATATTCGAATTGAAGAACTGAAAGAACACCAAGCCGCCTGATGGCGGTTTTTTCTTGCGTGTAATTGCGGAGACTTTGCGATGTACTTGACACTTCAGGAGTGGAACGCTCGCCAGCGACGCCCAAGAAGCCTTGAAACAGTTCGTCGATGGGTGCGCGAATGCAGGATATTCCCTCCTCCGGTTAAGGATGGAAGAGAATATCTGTTCCACGAATCAGCGGTAAAGGTTGACTTAAATCGACCAGTAACAGGTAGCCTTTTGAAGAGGATCAGAAATGGGAAGAAGGCGAAGTCATGAGCGCCGGGATTTACCCCCTAACCTTTATATAAGAAACAATGGATATTACTGCTACAGGGACCCAAGGACGGGTAAAGAGTTTGGATTAGGCCGAGACAGGCGAATCGCAATCACTGAAGCTATACAGGCCAACATTGAGTTATTTTCAGGACACAAACACAAGCCTCTGACAGCGAGAATCAACAGTGATAATTCCGTTACGTTACATTCATGGCTTGATCGCTACGAAAAAATCCTGGCCAGCAGAGGAATCAAGCAGAAGACACTCATAAATTACATGAGCAAAATTAAAGCAATAAGGAGGGGTCTGCCTGATGCTCCACTTGAAGACATCACCACAAAAGAAATTGCGGCAATGCTCAATGGATACATAGACGAGGGCAAGGCGGCGTCAGCCAAGTTAATCAGATCAACACTGAGCGATGCATTCCGAGAGGCAATAGCTGAAGGCCATATAACAACAAACCCTGTCGCTGCCACTCGCGCAGCAAAATCAGAGGTAAGGAGATCAAGACTTACGGCTGACGAATACCTGAAAATTTATCAAGCAGCAGAATCATCACCATGTTGGCTCAGACTTGCAATGGAACTGGCTGTTGTTACCGGGCAACGAGTTGGTGATTTATGCGAAATGAAGTGGTCTGATATCGTAGATGGATATCTTTATGTCGAGCAAAGCAAAACAGGCGTAAAAATTGCCATCCCAACAGTATTGCATGTTGATGCTCTCGGAATATCAATGAAGGAAACACTTGATAAATGCAAAGAGATTCTTGGCGGAGAAACCATAATTGCATCTACTCGTCGCGAACCGCTTTCATCCGGCACAGTATCAAGGTATTTTATGCGCGCACGAAAAGCATCAGGTCTTTCCTTCGAAGGGGATCCGCCTACCTTTCACGAGTTGCGCAGTTTGTCTGCAAGACTCTATGAGAAGCAGATAAGCGATAAGTTTGCTCAACATCTTCTCGGGCATAAGTCGGACACCATGGCATCACAGTATCGTGATGACAGAGGCAGGGAGTGGGACAAAATTGAAATCAAATAA
Protein sequences of DBSCAN-SWA_7 >NZ_CP048344|3156380:3183093|3173199_3174075_-|WP_162676758.1|DBSCAN-SWA MDFPDDVSRARQKLFRFLDNKFDSEKYRNNVRELTPAILAVLPLEYRGYLVEQDSFMARLTGILPHKCSAAKSVLVKRGIFIQSGRNIGINNVVSEWSTLPESGKKNKVYLKEVNLPESGKKSLPKSGKGTYPNQVNTKDKLTKDNIKPYSSENSGESSDLPENDLPVVKADAAIQNGSKWGTAEDLTAAEWMFDMVKTIAPSARKPNFAGWANDIRLMRERDGRNHRDMCVLFRWACQDNFWSGNVLSPAKLRDKWTQLEINRNKQQAGVTAGKPKLDLTNTDWIYGVDL >NZ_CP048344|3156380:3183093|3181826_3182045_+|WP_001303849.1|DBSCAN-SWA MYLTLQEWNARQRRPRSLETVRRWVRECRIFPPPVKDGREYLFHESAVKVDLNRPVTGSLLKRIRNGKKAKS >NZ_CP048344|3156380:3183093|3174333_3174564_-|WP_001067458.1|DBSCAN-SWA MNPAIKTAINIVGSQKKLGAACEVSQQAVYKWLHNKAKVSPEHVGSIVTATGGVVKAYQIRPDLPKLFPHTEKNAA >NZ_CP048344|3156380:3183093|3164422_3164629_-|WP_001228702.1|DBSCAN-SWA MRKLKMMLFGASLIMVVGCSSKENALCHPQPKPPAPPAWAMMPPSNSLQLLDETFSVSGTELSATKQH >NZ_CP048344|3156380:3183093|3164241_3164394_-|WP_001139678.1|DBSCAN-SWA MPSMIAIILLIILHIWLCRQGGAHFWSESRLNISLLMLDIEHFARGKGLR >NZ_CP048344|3156380:3183093|3179605_3179788_+|WP_072126246.1|DBSCAN-SWA MTHPHDNIRVGAITFVYSVTKRGWVFPGLSVIRNPLKAQRLAEEINNKRGAVCTKHLPLS >NZ_CP048344|3156380:3183093|3179760_3179952_+|WP_023148020.1|DBSCAN-SWA MHKASPVELRTSIEMAHSLAQIGVRFVPIPVETDEEFHTLAAFLSQKLEMMVAKAEADERDQV >NZ_CP048344|3156380:3183093|3170717_3171170_-|WP_000825400.1|DBSCAN-SWA MKRLFVIAPLLVLVGCAQNISPNSYSVGSVGMVNRTIAGTVISARGVDISGTSALGGTAGAAVGATAGSALGGGVRSNIVGAVGGAVIGGIAGAAIESSATKQTGMEYVVETENGNLMTIVQGKDPLFTQGSKVLVLYGNPSRIITDPRH >NZ_CP048344|3156380:3183093|3177562_3177769_+|WP_000233576.1|DBSCAN-SWA MVHQHYGTQTVNRGAVMPGMLVKHKDGTWTASANLRGRLYLHRGIERTYTRDLLVEVFLDGRGNGLNH >NZ_CP048344|3156380:3183093|3163479_3163890_+|WP_000079508.1|DBSCAN-SWA MAQVAIFKQIFDKVRNNLNYHWFYSELKRHNVSHYIYYLATENIHLVLENDNTVLIKGQGKVVNVRFSKNKCLIEATLKGFKSGELSFYEYRKNLATAGVFRWITNIHENKRYYYTFDNSLLFTENIQNTTQIFPH >NZ_CP048344|3156380:3183093|3170523_3170625_-|WP_072157016.1|DBSCAN-SWA MRTYNPNSLLPSQMQKCTCDFLHPAFDLCGGEA >NZ_CP048344|3156380:3183093|3181619_3181787_+|WP_000545745.1|DBSCAN-SWA MHFRVTGEWNGEPFNRVIEAENISDCYDHWMLWAQIAHADVTNIRIEELKEHQAA >NZ_CP048344|3156380:3183093|3169259_3169622_-|WP_001372483.1|DBSCAN-SWA MNTYNITLPWPPSNNRYYRHNRGRTHVSAEGQAYRDNVARIIKNAMLDIGLAMPVKIRIECHMPDRRRRDLDNLQKAAFDALTKAGFWLDDAQVVDYRVVKMPVTKGGRLELTITEMGNE >NZ_CP048344|3156380:3183093|3176226_3177054_+|WP_000210934.1|DBSCAN-SWA MTTPCLYSIVRYAPYAETEEFANIGVLLCAPKENYFDFQLTKRNDSRVKNFFHDDCIFPVAKDSIQRELQFAKMHATQIVGHQQLAQFFRYFTNKKESIFQFSSTRVILSENPKEELARIYNKYVNHSDYTKERREDVLARELKRSIDRIDGLKNVFKQATIDGYFAKFSMPLVAKKHDRIQCAIKPLAFTQAEPGKMMEHSDTWVMRITRAAEENLLSLDDILFTIETPESPNSGQSKVIDIIKRTMDAKKINHIPASNHKETIDFAKKILPQV >NZ_CP048344|3156380:3183093|3162240_3162801_-|WP_001372490.1|DBSCAN-SWA MEVNKKQLADIFGASIRTIQNWQEQGMPVLRGGGKGNEVLYDSAAVIKWYAERDAEIENEKLRREVEELRQASETDLQPGTIEYERHRLTRAQADAQELKNARDSAEVVETAFCTFVLSRIAGEIASILDGIPLSVQRRFPELENRHVDFLKRDIIKAMNKAAALDELIKRRRSLSTRMRLTQQLI >NZ_CP048344|3156380:3183093|3169122_3169263_-|WP_000971068.1|DBSCAN-SWA MMFEFYMAELLRHRWGHLRLYRFPGSVLTDYRILKNYAKTLTGAGV >NZ_CP048344|3156380:3183093|3177844_3178141_+|WP_000995439.1|DBSCAN-SWA MNAYYIQDRLEAQSWARHYQQIAREEKEAELADDMEKGLPQHLFESLCIDHLQRHGASKKAITRAFDDDVEFQERMAEHIRYMVETIAHHQVDIDSEV >NZ_CP048344|3156380:3183093|3168659_3169037_-|WP_001204777.1|DBSCAN-SWA MRDIQMVLERWGAWAANNHEDVTWSSIAAGFKGLIPSKVKSRPQCCDDDAMIICGCMARLKKNNSDLHDLLVDYYVGGMTFMALARKHGRSDCWVGRMLQKAEGVVEGMLMVLDLRLEMDADCSK >NZ_CP048344|3156380:3183093|3163189_3163423_+|WP_000105084.1|DBSCAN-SWA MSTKNRTRRTTIRNIRFPNQMIEQINIALDQKGSGNFSAWVIEACRRRLCSEKRVSPEANKEKSDITELLRKQIRPD >NZ_CP048344|3156380:3183093|3166827_3167787_-|WP_000592543.1|DBSCAN-SWA MIKKPVIGISGCLAGSAVRFDGGHKRADFLMDKLVEWVTFRPVCPEMAIGLPVPRPALRLVRSMQGNIRMCFSHDQNEDVTERMTEFSRSYMDKLKDVSGFVVCAKSPSCGMERVRVYDENGNRGRKDGVGLFTSTLMEKFSWLPVEEDGRLHDPVLRENFVERVFALHELNHLYKEKLSRRELLAFHSRYKLQLLAHSQAGYKDMGPFVAAIHEWADLESYFEVYRDKLMAILRKPASRKNHTNVLMHIQGYFSNYLSTRQRKELSEVILNYRSGTLPLLAPLTLLKHYLGEYPNDYLLTQNYFDPYPDELALRLMVN >NZ_CP048344|3156380:3183093|3164845_3165343_-|WP_001372488.1|DBSCAN-SWA MPPSLRKAVAAAIGGGAIAIASVLITGPSGNDGLEGVSYIPYKDIVGVWTVCHGHTGKDIIPGKTYTEAECKALLNKDLATVARQINPYIKVDIPETTRGALYSFVYNVGAGNFRTSALLRKINQGDIKGACDQLRRWAYAGGKQWKGLMTRREIEREVCLWGQQ >NZ_CP048344|3156380:3183093|3165342_3165558_-|WP_000839582.1|lysis|DBSCAN-SWA MKSMDKLTTGIAYGTSAGSAGYWFLQLLDKVTPSQWAAIGVLGSLVFGLLTYLTNLYFKIKEDKRKAARGE >NZ_CP048344|3156380:3183093|3156380_3157670_+|WP_001356070.1|DBSCAN-SWA MTTDDLAFDQRHIWHPYTSMTSPLPVYPVVSAEGCELILSDGKRLVDGMSSWWAAIHGYNHPQLNAAMKSQIDAMSHVMFGGITHAPAIELCRKLVAMTPQPLECVFLADSGSVAVEVAMKMALQYWQAKGEARQRFLTFRNGYHGDTFGAMSVCDPDNSMHSLWKGYLPENLFAPAPQSRMDGEWDERDMVGFARLMAAHRHEIAAVIIEPIVQGAGGMRMYHPEWLKRIRKMCDREGILLIADEIATGFGRTGKLFACEYAEIAPDILCLGKALTGGTMTLSATLTTREVAETISNGEAGCFMHGPTFMGNPLACAAANASLAIIESGEWQQQVAAIEVQLREQLAPARDAEMVADVRVLGAIGVVETTRPVNMAALQKFFVEQGVWIRPFGKLIYLMPPYIILPQQLQRLTAAVNRAVQDETFFCQ >NZ_CP048344|3156380:3183093|3178928_3179609_+|WP_001372450.1|DBSCAN-SWA MTPDIILQRTGIDVRAVEQGDDAWHKLRLGVITASEVHNVIAKPRSGKKWPDMKMSYFHTLLAEVCTGVAPEVNAKALAWGKQYENDARTLFEFTSSVNITESPIIYRDENMRTACSPDGLCSDGNGLELKCPFTSRDFMKFRLGGFEAIKSAYMAQVQYSMWVTRKDAWYFANYDPRMKREGLHYVVVERDEKYMASFDEMVPEFIEKMDEALAEIGFVFGEQWR >NZ_CP048344|3156380:3183093|3172211_3172505_-|WP_000145917.1|DBSCAN-SWA MTGKEAIIHYLGTHKSFCAQDVAAVTGATVTSINQAAAKMARTGILVIDGKVWRTVYYRFATREEREGKVSTNLIFKECRQSAAMKRVLAFYRGNFQ >NZ_CP048344|3156380:3183093|3160681_3161350_+|WP_000239881.1|DBSCAN-SWA MVKLMKKNTDDGAKIYTPLTLKLYDWWVLGVSNRLAWGCPTKEHLLPHFLEHLGNNHLDIGVGTGFYLTHVPESSLISLMDLNEASLNAASTRAGESKIKHKISHDVFEPYPAALHGQFDSISMFYLLHCLPGNISTKSCVIRNAAQALTDDGTLYGATILGDGVVHNSFGQKLMRIYNQKGIFSNTKDSEEGLTHILSEHFENVKTKVQGTVVMFSASGKK >NZ_CP048344|3156380:3183093|3182022_3183093_+|WP_000533646.1|integrase|DBSCAN-SWA MGRRRSHERRDLPPNLYIRNNGYYCYRDPRTGKEFGLGRDRRIAITEAIQANIELFSGHKHKPLTARINSDNSVTLHSWLDRYEKILASRGIKQKTLINYMSKIKAIRRGLPDAPLEDITTKEIAAMLNGYIDEGKAASAKLIRSTLSDAFREAIAEGHITTNPVAATRAAKSEVRRSRLTADEYLKIYQAAESSPCWLRLAMELAVVTGQRVGDLCEMKWSDIVDGYLYVEQSKTGVKIAIPTVLHVDALGISMKETLDKCKEILGGETIIASTRREPLSSGTVSRYFMRARKASGLSFEGDPPTFHELRSLSARLYEKQISDKFAQHLLGHKSDTMASQYRDDRGREWDKIEIK >NZ_CP048344|3156380:3183093|3160355_3160532_-|WP_072163407.1|tail|DBSCAN-SWA MQEFSEHIAPLQDAVDLEIATEEENSLLEAWKKYRVLLNRVNTTTAPDIEWPTVPIIE >NZ_CP048344|3156380:3183093|3157728_3158205_+|WP_000767389.1|DBSCAN-SWA MKLISNDLRDGDKLPHRHVFNGMGYDGDNISPHLAWDDVPAGTKSFVVTCYDPDAPTGSGWWHWVVVNLPADTRVLPQGFGSGLVAMPDGVLQTRTDFGKTGYDGAAPPKGETHRYIFTVHALDIERIDVDEGASGAMVGFNVHFHSLASASITAMFS >NZ_CP048344|3156380:3183093|3169901_3170072_-|WP_000224914.1|DBSCAN-SWA MATPLIRVMNGHIYRVPNRRKRKPELKPSEIPTLLGYTASLVDKKWLRLAARRNHG >NZ_CP048344|3156380:3183093|3175480_3176230_+|WP_000389051.1|DBSCAN-SWA MECVASSNSAIPNVVEVIRRINEGSTQPFLCKCDDGQLYVLKSKPSMPPKNLLAEFISACLANDIGLPLPDFKIVFVPEELIEYSPDLQQQICTGYAFASLFIDGAIALTFTQSRNETIIPVEQQKLIYVFDKWILNADRTLTDKGGNVNILYDISNDKYYLIDHNLSFDQNAGPEDFSVHVYGPGNRKWQYDLVDRVEYRQRVVNSLHKLPAILDEIPEEWIVDEEFLPFVCTTLDKGDCDEFWSAIE >NZ_CP048344|3156380:3183093|3172501_3173203_-|WP_001372464.1|DBSCAN-SWA MKNIAAQMVNFDREQMCRIANNMPEQYDEKPQVQQVAQIINGVFSQLLATFPASLANRDQNELNEIRRQWVLAFRENGITTMEQVNAGMRVARRQNRPFLPSPGQFVAWCREEASVIAGLPNVSELVDMVYEYCRKRGLYPDAESYPWKSNAHYWLVTNLYQNMRANALTDAELRRKAADELVHMTARINRGEAIPEPVKQLPVMGGRPLNRAQALAKIAEIKAKFGLKGASV >NZ_CP048344|3156380:3183093|3158950_3160282_+|WP_001753290.1|DBSCAN-SWA MIINQVPIKIKIFIFLFSCISIIFLLLHANNGIYITQTTQISYSVFIIGLFFINLMIFIFLLLYYVSNQRQSYLLILSFAFLSNTYYLLEVAIISLSPLGNDLSTIYQKSNDIAIYYLFRQFSFISIIFLAVYSTNVKNKSVLEDKRNIIIVVLSILILFITPFVAKNLSSDNIKYSLNIIQYSLNRHLPTWNIVYTKIISVFWLVLLISSCISIRNYSKIWLCIILISIVSVCNNLILLYFIDKSHPAWYMTKFLELISMIYIISTLMYYVFRKLNHANHMAIHDPLTNTYNRRYFIDSLKNISKHHDFSVIMLDIDSFKSINDKWGHHMGDQVIVMVTRIIKKSIRKEDVLGRLGGEEFGIIIKGNTQKLLLSIAERIRKNIEEQCSEKLLSHGPEKITVSIGCFTSKENNLSPSEMLVNADKALYQAKRTGKNKVIIHSK >NZ_CP048344|3156380:3183093|3171166_3171727_-|WP_000720581.1|DBSCAN-SWA MKKIILLAMIIGSLTGCASVPPLNFSTPNVGVSQKKIDAEIKSLTVSLARPDEQKGDITAGMEAITPIWRESLQEALDRMTIFRDSSPNTVSLNVKVLALDVPAFGVSMTTKAIARYEIINRANGDIIYTQDIESTGTVPASYAFYGIVRARESVNRAVQNNITQFLQALESVDLSRPMFPVRVAK >NZ_CP048344|3156380:3183093|3180774_3181377_-|WP_000120065.1|DBSCAN-SWA MSYFLRKKWMVNLSGSGKILWALNMKKDSYPYLICMTVSGLIFIFLFFWWRADIYRVTFLNQSISHYYILFSMGIAFLLSLFWVKKGIVKQSGWKSLSAYLKVYAGMCIFAGFFLIIPLTTLTYFLPGETSSYVAPYRYTSGSSKSCSGAEVDDPDLHENIRICYPYGNYEYDNIIYVEKKINILGAVVTYAQTARDDTE >NZ_CP048344|3156380:3183093|3169618_3169909_-|WP_001372487.1|DBSCAN-SWA MADLRKAARGRECQVRTPGVCNGNPETSVLAHIRLTGLCGTGTKPPDLIATIACSACHDEIDRRTHFVDAAYAKECALEGMARTQVIWMKEGVIKA >NZ_CP048344|3156380:3183093|3170071_3170527_-|WP_001372486.1|DBSCAN-SWA MNLPQDGIKLHRGNFTAIGQQIQPYLEDGKCFRMVLKPWREKRSLSQNALSHMWYSEISEYLISRGKSFATAAWVKEALKHTYLGYETKELVDVVTGEITTIQSLRHTSDLDAGEMYVFLCKVEAWAMNIGCHLTIPQSCEFQLLRDKQEA >NZ_CP048344|3156380:3183093|3180342_3180564_+|WP_000763365.1|DBSCAN-SWA MADIIDSASEIEELQRNTAIKMRRLNHQAISATHCCECGDPIDERRRLAVQGCRTCASCQQDLELISKQRGSK >NZ_CP048344|3156380:3183093|3178146_3178932_+|WP_000100847.1|DBSCAN-SWA MSTALATLAGKLAERVGMDSVDPQELITTLRQTAFKGDASDAQFIALLIVANQYGLNPWTKEIYAFPDKQNGIVPVVGVDGWSRIINENQQFDGMDFEQDNESCTCRIYRKDRNHPICVTEWMDECRREPFKTREGREITGPWQSHPKRMLRHKAMIQCARLAFGFAGIYDKDEAERIVENTAYTAERQPERDITPVNDETMQEINTLLIALDKTWDDDLLPLCSQIFRRDIRASSELTQAEAVKALGFLKQKATEQKVAA >NZ_CP048344|3156380:3183093|3167979_3168504_+|WP_000780581.1|DBSCAN-SWA MKLWPVLTGIALSFTLIACKAPTPPKGVQPITNFDANRYLGKWYEIARLENRFERGLEQVSATYGKRNDGGIRVLNRGYDPTKNKWSESEGKAYFTGDTKTAALKVSFFGPFYGGYNVIKLDDEYKYALVSGPNREYLWILARTPTIPDKVKADYVRTAQKLGFNVNELLWVKQ >NZ_CP048344|3156380:3183093|3179962_3180244_+|WP_001395510.1|DBSCAN-SWA MHFSGSGLHILCAYACRHGACSMTPQQENALRSIARQANSEIKKARQQFPDKNVDDICRSVLKKHRETVTLMGFTPTHLSLAIGMLNGVFKER >NZ_CP048344|3156380:3183093|3174602_3175358_+|WP_000259990.1|DBSCAN-SWA MVVFSQQPFSFDGIKPWLYIWTMKTTLSERLKEARLARGLTQKALGDLVGVSQAAIQKIETGKANQTTKIVEIANALGVRAEWLSSGVGNMSDSTVQPIQSTVSHSKYFKIDVLDIEVSAGPGVINREFVEVLRSVEYSFDDARHMFDGRKAENIRIINVRGDSMSGTIEPGDLLFVDITVKSFDGDGIYAFLYDDTAHVKRLQMMKDKLLVISDNKSYSPWDPIEKDEMNRVFIFGKVIGSMPQTYRKHG |
42 | Enterobacteria_phage(48.48%) | integrase,lysis,tail | attL 3158296:3158310|attR 3183167:3183181 |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_8 |
4066874 : 4073433
Sequences of DBSCAN-SWA_8
Nucleotide sequences of DBSCAN-SWA_8 >NZ_CP048344|4066874:4073433|DBSCAN-SWA GATGAAAATTGCGCTGGTTATTTTCATCACCCTTGCCCTGGCGGGCTGTGCGCTGTTATCACTCCATATGGGAGTGATCCCCGTGCCGTGGCGCGCGCTGCTGACCGACTGGCAGGCCGGACGCGAGCATTATTATGTATTGATGGAGTACCGACTGCCGCGCTTGCTGCTGGCACTGTTTGTCGGTGCAGCCCTCGCCGTGGCGGGCGTGCTGATACAGGGGATTGTGCGCAACCCTCTGGCATCACCGGATATTCTCGGCGTTAACCATGCCGCCAGCCTGGCCTCTGTGGGGGCTCTACTTCTTATGCCGTCACTGCCCGTGATGGTGCTGCCGCTGCTGGCCTTTGCGGGCGGCATGGCGGGGTTGATATTACTGAAGATGCTGGCAAAGACCCACCAGCCGATGAAGCTGGCGCTCACCGGCGTGGCGCTTTCTGCATGCTGGGCCAGCCTGACGGATTATCTGATGCTCTCGCGCCCACAGGATGTGAACAACGCCCTGCTGTGGCTGACCGGCAGCTTATGGGGCCGTGACTGGAGCTTTGTGAAGATTGCCATCCCGCTGATGATTTTATTTCTGCCGCTGAGCCTGAGTTTTTGCCGCGATCTCGACCTCCTTGCACTCGGCGATGCGCGCGCCACCACGCTCGGTGTGTCGGTGCCCCATACCCGATTCTGGGCTTTGTTACTAGCTGTCGCCATGACATCTACCGGCGTGGCCGCCTGCGGCCCGATTAGCTTTATTGGTCTCGTGGTGCCGCATATGATGCGTAGCATCACCGGTGGACGTCACCGCAGACTGCTGCCTGTTTCAGCCCTGACAGGTGCGTTGCTGTTGGTGGTTGCCGATCTGCTGGCGAGAATTATTCATCCCCCACTGGAGCTCCCGGTTGGCGTGCTGACCGCCATTATCGGTGCGCCGTGGTTTGTCTGGTTGCTTGTGAGAATGCGATAAATGACTTTACGAACTGAAAATCTGACGGTCAGTTACGGGACAGACAAGGTACTTAACGACGTTTCACTCTCACTGCCAACGGGGAAGATCACCGCCCTGATCGGTCCTAACGGTTGCGGGAAATCGACGCTGTTAAACTGTTTTTCGCGGCTTTTAATGCCGCAGTCTGGCACCGTATTTCTCGGCGATAATCCCATAAATATGCTCTCATCGCGCCAGTTGGCCCGCAGGCTTTCGCTGCTGCCTCAGCACCATTTAACGCCAGAGGGGATCACAGTCCAGGAGCTGGTTTCGTATGGTCGTAATCCCTGGCTGTCACTCTGGGGGCGTCTCTCCGCTGAAGACAATGCACGAGTTAATGTCGCCATGAACCAGACCCGGATCAATCATCTTGCCGTTCGTCGGTTAACCGAGCTTTCCGGCGGTCAGCGCCAGCGCGCATTTCTGGCGATGGTCCTGGCCCAGAATACGCCCGTTGTATTACTTGATGAGCCAACCACCTATCTTGATATCAATCACCAGGTGGACCTGATGCGGTTGATGGGCGAACTCCGGACTCAGGGGAAAACGGTGGTCGCTGTGCTGCACGACCTTAATCAGGCTAGCCGGTACTGCGATCAACTGGTGGTAATGGCAAACGGACATGTTATGGCGCAAGGCACACCAGAAGAGGTGATGACCCCAGGATTGCTGAGAACAGTATTCAGCGTGGAAGCGGAAATACACCCCGAGCCGGTATCTGGCAGGCCGATGTGCCTAATGAGGTAGATTGCACAGGCCGTAAGAACCAAACCACGACTGAATGAAACTGGACTGGCGCCAGCAAGCCTGTTCAGACTGGGGCTGAACTTTTCCGGACTCTGAAAGATTACCAATACTCATCGTCCATCCGCTTGCTTTAGGCTGACAGGTTCATAATCAACGCAAACCAGAGCTGTACAGGCTTGGGCGCGGCTTTCAAACCAGTCGTGATCACGGCAATCAATTTTGAACTCTGCTTAACGGACATTTCTGTATAACCCTTACGGCAACGAAAAACGCGAAGTTAAAATTTTAGAAACCCAAAAACGTGACATGACTAAGTTTAGATTTCAGGGGGGGAGATCAAAAAATTTCGCTCTGTGCCAGAGCGGACATTCACGGAGCTGGTTCATTACCAATGAGGTTGGGCTTTTGAGGATAAATCAATGATCAGACGCCAACGTAAATCAAAAGCACCCCTGGAACGGAATTGCTAATCCAGTTTCTGACCATCGATTTTTCTAAAAAGTGTGCGTTTGCTACTACTTAGGTAGGTGCAGCTTTCTTAATCACCGGCAGCCACGTTATACAGGCCAGTTGATGGATCGATTGTTATCAATGATATCTTTATGAGTCGGTGTCTCACCCAGCTTACCGAAGCTGGCATAAAGTGAAGGCAGACGGGCCCGTCCTTCTCCCTTTTTCGCCAGAGGGAAAGCGCGAAGCATGGTGGCAGCCTCCCAGTCACAACAATAGGATGGTGTGCACGGCTGCTGACGCCATGATTCAGCGATAGAGCCGGAAAATACGGGGTCAAAGCCGGTATCATTAACCAGAGTCATCGTGACCTGTACTGCGGCTGGATGATCTCCTGCTACTGCAACTGCTAGGCGACCGCGGCTCCCTTCAGGTAGTCTGTTGTGGTCAACTAAAACTGGCCTCCGCGTTAGAGTTTTTCCAGTATCGGTTTTCTGATTCGTTTGGTGGTAACCCACCATTATATTCGTGCGGTCTTAGTGCGCTGTAATATCCAACGATATAGTCCGTTATTGCGTGAGCTGCATCGCTGAAGCTTACATAGCCCGTCGCTGGCACCCATTCGTTCTTCAGACTCCTGAAGAAGCGCTCCATTGGGCTGTTATCCCAGCAGTTTCCACGCCGACTCATACTCTGCCTGATCCGGTATCGCCACAGTAACTGCCGGAACTGCCTGCTCGTATAATGACTGCCTTGATCGCCTGGAACATCACCCCGACGGGCTTACCACGGGTTTCCCATGCCATTTCCAGTGCTTTCATGGTAAGCCTGCTGTCCGGCGAGAACGACATGGCCCAGCCCACTGGTTTTCTTGCGAACAGGTCGAGAACAACGGCGAGGTACGCCCAGCGCTTACCCGTCCAGATATAGGTCACATCACCGCACCACACCTGATTTGGTTCCGTTACGGCGAACTGTCGCTCAAGATGATTCGGGATAGCAACGTGCTCATGACCGCCACGCTTATACCGGTGAGTCGGCTGCTGGCAACTGACCAGCCCCAGCTCTTTCATGAGTCTGCCAGCAAGCCAGTGCCCCATCTGGTAGCCTCTCCGGGTTGCCATTGTGGCGATGCTTCTTGCTCCGGCAGAGCCGTGGCTGATGCCATGCAGTTCAAGTACCTGGCTGCGTAATACAGCCCGTCTGCCGTCTGGTTTTTCAGGACGGTTTTTCCAGTATCTGTAGCTGCTGCGATGAACCCCGAACACATGGCAGAGTGTGACCACAGGATAATGCGCTCTGAGTTTCCTGATTATCGAGAACTGTTCAGGGAGTCTGACATCAAGAGCGCGGTTGTAGATTCAATTGGTCAACGCAACAGTTATGTGAAAACATGGGGTTGCGGAGGTTTTTTGAATGAGACGAACATTTACAGCAGAGGAAAAAGCCTCTGTTTTTGAACTATGGAAGAACGGAACAGGCTTCAGTGAAATAGCGAATATCCTGGGTTCAAAACCCGGAACGATCTTCACTATGTTAAGGGATACTGGCGGCATAAAACCCCATGAGCATAAGCGGGCTGTAGCTCACCTGACACTGTCTGAGCGCGAGGAGATACGAGCTGGTTTGTCAGCCAAAATGAGCATTCGTGCGATAGCTACTGCGCTGAATCGCAGTCCTTCGACGATCTCACGTGAAGTTCAGCGTAATCGGGGCAGACGCTATTACAAAGCTGTTGATGCTAATAACCGAGCCAACAGAATGGCGAAAAGGCCAAAACCGTGCTTACTGGATCAAAATTTACCATTGCGAAAGCTTGTTCTGGAAAAGCTGGAGATGAAATGGTCTCCAGAGCAAATATCAGGATGGTTAAGGCGAACAAAACCACGTCAAAAAACGCTGCGAATATCACCTGAGACAATTTATAAAACGCTGTACTTTCGTAGCCGTGAAGCGCTACACCACCTGAATATACAGCATCTGCGACGGTCGCATAGCCTTCGCCATGGCAGGCGTCATACCCGCAAAGGCGAAAGAGGTACGATTAACATAGTGAACGGAACACCAATTCACGAACGTTCCCGAAATATCGATAACAGACGCTCTCTGGGGCATTGGGAGGGCGATTTAGTCTCAGGTACAAAAAACTCTCATATAGCCACACTTGTAGACCGAAAATCACGTTATACGATCATCCTTAGACTCAGGGGCAAAGATTCTGTCTCAGTAAATCAGGCTCTTACCGACAAATTCCTGAGTTTACCGTCAGAACTCAGAAAATCACTGACATGGGACAGAGGAATGGAACTGGCCAGACATCTAGAATTTACTGTCAGCACCGGCGTTAAAGTTTACTTCTGCGATCCTCAGAGTCCTTGGCAGCGGGGAACAAATGAGAACACAAATGGGCTAATTCGGCAGTACTTTCCTAAAAAGACATGTCTTGCCCAATATACTCAACATGAACTAGATCTGGTTGCTGCTCAGCTAAACAACAGACCGAGAAAGACACTGAAGTTCAAAACACCGAAAGAGATAATTGAAAGGGGTGTTGCATTGACAGATTGAATCTACAGTAGCCTTTTTTAATATTTCATTCTCCATTTCAATGCGTTGTAGCTTTTTCCTCAGCTTACGTATTTCGATTTGTTCTGGTGTTATCGGAGAGGCTTTTGGTGTTTTGCCCTGACGCTCATCACGCAGTTGTTTGACCCATCTTGTCATTGTGGAAAGGCCAACATCCATAGCTTTGGCGGCATCTGCCACCGTGTATGTCTGGTCAACAACCAGTTGAGCGGATTCGCGTTTAAACTCTGCGCTAAAATTTCTTTTTTTCATTGGAGCACCTGTGTTGTTCTGAGGTGAGCATATCACCTCTGTTCAGGTGGCCAAATTCAGTAAACCACTTCACCACTTCAGAGCAGACGAGACAAAAAGAATGGTTGACGCATCAGCTCAACCCCATATAGTTGTAACACTTGAGCCAAACCCTTGGGCCGCTTTTTACTTTGATATTAACATTGCTAATACAGGGAACGCACCTGCCTATAATGTTGAGGTTGTGTTTGATCCTCCACTAGTAAATGCGGAGCATAGAGAAAAAAGTGAGATTCCGTTTAGTAAGGTAAGCGTGTTAAAAAATGGGCAATCACTTACCAGCAATCTCTGTAAGTATGAACAAATCAAAGATCAAATTTATAATATTAATATAAGCTGGGCAAGCAAACCTAAATCAAACGATAGAGAAACAAATGAATATGTGTATGACATGGCGACATTTGAAGGAATAAGTTATCTAGGAGCGAGAAGCCCATTGACGCAAATTGCAGAACAAATTAAAGGTATAAGAGAGGATTGGAAACCTATTGCACAAGGAGCTAAAAAAGTAAAAGCAGACGTATATACTTCAAGCGATAGAAACGAAGAACGCACGTATCTGCAAGAGCAACACGATTTGGCAATAAAAAGGAGAGATGAGAAAAGAGAAAAAAGATTAGAGTCTGGTGAATAATTTTAAAGGGAGTGGGTAACTAACCCACTCGTAACTATAAACCTGTAATTAATCACTTATTTTTGACAACAGATAATTACTGAACGCACTGCAAGTGACTAACATAAATTTAGCTTCAGCTAAGGTAGGATTTACATCTTCTTCTGTTAGCGCATGACGTATTCCACCCTGATCACTTGTATAACCATAGAGCTGACTAAAAGCGCCTTTCATTGCAGAGTGTATATATCCTTTTTCCTCTATAGCTTTAAGACAAGCCCCCAAGGTTCCTTTATCATTGCCCGTGATTTTCCTGCATAAAGATTCAATTGCAGAGATAGACTCTTTAATCGAGTTTCTGTAGTCTGGCTGCTCTCTATCCGTCATTAGTTGTAACGCCCTTTCGAAATGGCTACGCGATGAATCAGTGCCATTATCAACTGCGTTCTGAACACTTTCAATTTCGTTATCATTTGAAATAGGAGTAATACAACCATTTATTATGGTATAACCAACGCCATGCTTTTTAAAGATGGAATTGAGATGCTTCGATAGATTAATATATGAATTAGTTCTCTCAATGATGAACTCAATTAAATCATATACCAAATACCATGCTTCCCCATATATATAATCTCGGATAGCAGTCAGCAACGTCTTATCACTTTTGTATCCACTTTCATAACGAGGAATATTATCCGCAGGTTGATTTAGATAATATATCCACACAGACTGCGCACATTTTGTTGCTGTAGCAGTTTGACGATTGTTAGTCCAAAGGAAAAGATATAAGCAATTCCACAATGCCATGCGTGTATCAGAATTAAGATCATTCAGCTGGACATGCTCTCTAACGTCAACATGACCATACCTCACAGAAAATGGCTTTATCAT
Protein sequences of DBSCAN-SWA_8 >NZ_CP048344|4066874:4073433|4067831_4068599_+|WP_000175457.1|DBSCAN-SWA MTLRTENLTVSYGTDKVLNDVSLSLPTGKITALIGPNGCGKSTLLNCFSRLLMPQSGTVFLGDNPINMLSSRQLARRLSLLPQHHLTPEGITVQELVSYGRNPWLSLWGRLSAEDNARVNVAMNQTRINHLAVRRLTELSGGQRQRAFLAMVLAQNTPVVLLDEPTTYLDINHQVDLMRLMGELRTQGKTVVAVLHDLNQASRYCDQLVVMANGHVMAQGTPEEVMTPGLLRTVFSVEAEIHPEPVSGRPMCLMR >NZ_CP048344|4066874:4073433|4066874_4067831_+|WP_000684856.1|DBSCAN-SWA MKIALVIFITLALAGCALLSLHMGVIPVPWRALLTDWQAGREHYYVLMEYRLPRLLLALFVGAALAVAGVLIQGIVRNPLASPDILGVNHAASLASVGALLLMPSLPVMVLPLLAFAGGMAGLILLKMLAKTHQPMKLALTGVALSACWASLTDYLMLSRPQDVNNALLWLTGSLWGRDWSFVKIAIPLMILFLPLSLSFCRDLDLLALGDARATTLGVSVPHTRFWALLLAVAMTSTGVAACGPISFIGLVVPHMMRSITGGRHRRLLPVSALTGALLLVVADLLARIIHPPLELPVGVLTAIIGAPWFVWLLVRMR >NZ_CP048344|4066874:4073433|4072608_4073433_-|WP_000594911.1|DBSCAN-SWA MIKPFSVRYGHVDVREHVQLNDLNSDTRMALWNCLYLFLWTNNRQTATATKCAQSVWIYYLNQPADNIPRYESGYKSDKTLLTAIRDYIYGEAWYLVYDLIEFIIERTNSYINLSKHLNSIFKKHGVGYTIINGCITPISNDNEIESVQNAVDNGTDSSRSHFERALQLMTDREQPDYRNSIKESISAIESLCRKITGNDKGTLGACLKAIEEKGYIHSAMKGAFSQLYGYTSDQGGIRHALTEEDVNPTLAEAKFMLVTCSAFSNYLLSKISD >NZ_CP048344|4066874:4073433|4069156_4069414_-|WP_000177060.1|DBSCAN-SWA MTLVNDTGFDPVFSGSIAESWRQQPCTPSYCCDWEAATMLRAFPLAKKGEGRARLPSLYASFGKLGETPTHKDIIDNNRSINWPV >NZ_CP048344|4066874:4073433|4070465_4071617_+|WP_001254876.1|transposase|DBSCAN-SWA MRRTFTAEEKASVFELWKNGTGFSEIANILGSKPGTIFTMLRDTGGIKPHEHKRAVAHLTLSEREEIRAGLSAKMSIRAIATALNRSPSTISREVQRNRGRRYYKAVDANNRANRMAKRPKPCLLDQNLPLRKLVLEKLEMKWSPEQISGWLRRTKPRQKTLRISPETIYKTLYFRSREALHHLNIQHLRRSHSLRHGRRHTRKGERGTINIVNGTPIHERSRNIDNRRSLGHWEGDLVSGTKNSHIATLVDRKSRYTIILRLRGKDSVSVNQALTDKFLSLPSELRKSLTWDRGMELARHLEFTVSTGVKVYFCDPQSPWQRGTNENTNGLIRQYFPKKTCLAQYTQHELDLVAAQLNNRPRKTLKFKTPKEIIERGVALTD >NZ_CP048344|4066874:4073433|4071987_4072560_+|WP_000227281.1|DBSCAN-SWA MVDASAQPHIVVTLEPNPWAAFYFDINIANTGNAPAYNVEVVFDPPLVNAEHREKSEIPFSKVSVLKNGQSLTSNLCKYEQIKDQIYNINISWASKPKSNDRETNEYVYDMATFEGISYLGARSPLTQIAEQIKGIREDWKPIAQGAKKVKADVYTSSDRNEERTYLQEQHDLAIKRRDEKREKRLESGE >NZ_CP048344|4066874:4073433|4071536_4071887_-|WP_000747102.1|transposase|DBSCAN-SWA MKKRNFSAEFKRESAQLVVDQTYTVADAAKAMDVGLSTMTRWVKQLRDERQGKTPKASPITPEQIEIRKLRKKLQRIEMENEILKKATVDSICQCNTPFNYLFRCFELQCLSRSVV |
7 | uncultured_Caudovirales_phage(16.67%) | transposase | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_9 |
4490113 : 4508632
Sequences of DBSCAN-SWA_9
Nucleotide sequences of DBSCAN-SWA_9 >NZ_CP048344|4490113:4508632|DBSCAN-SWA ATCACATCCACATAATTTGCTGCCCTGACGGCAACGGGTGCGGCCTCACGGCGTGGACTTCTCCCGGCTTCACGATGTATCTCTGTACCGACTCATAAGTGATGAACGTGGCGCTGCAATTCACGTTCTGGCACTGGTGATAACGCTCTTTTGTCGTGTCAGTGATATAGCGGCTTGTACGCGCATGTGCGGCATGCTGGCATAAAGGACAATGAAACATCGCGAGCACCTCTTCCGGTTTTGTTGATAGTGCCATTTTAGTTAAATTATCATTATAAAACAAAAAGATAAACAAAAGACATCACTCATAATCTTCTGTTTCGTACTCCACATCAGAAACCGTCGCGGCGAATCAGCGTAGTGACGCCTGACTCGTTAAGCAGGTCAGCATCGGTGCCGGACTCCTGCAAATCCCAGAATACAGATGCGCTGATGCCGGTAACACCGTTCACCCCGACATTGGACAGCGTTTTATGCCAGCCCTGCTCCTGGTCGATTTTAGCGCGCAGCCCCAGCGCACGGGCGGTGGCATACGCGGTGGCGGTGGTACTGGTGACCGTATCCCATGCGAGGAAATCCGGCCAGATGACCATCAGCTCACGCTGGCTGAAATTCTGGCGGTAGGCTTTCACCTCGGAAATGGTTTTACAGCCCCATGCGCTGATATACCCGAAAGCGCGCAGCTTCTGACAGACTGATGCCAGTGCAACAGCCACCTCTTTGGTATCCAGCCCCGGCACGCCGAGAATACGCGGTTTAACACCGGTTACCGACTCCGCCGCCAGCAGGGCTTTCAGTCCGGTGTACTGACCGTTTTCGTCGGTGGTGCCGATGATATTGGAAACGGTCTGCGCAAGTTTCGCTTCCTCGTCGTCGCCGGTGCCGTCTTCCACACGCACGACAACGGTGACCGGTTTTGACTGGTCGGCGATGGCCTGCAACGATGCCGCCAGCGTCCCTTTTTTACCGGCCTTTGCAATTGCGCTCTGCACATTGGTAATCAGCACTGGTTTATTGAGGGGGAAGATTTCCGCATCCGCATCGCTGGCCGTGCAGACCATGCCAACAATGGCGGTGGATACGGTGGAAATGACGCGGGTGCCGTCGTTAATCTCCAGCACCTGCACGCCGTGATGATAGTCACTCATCCGTTTAACTCCGTGGTTAATGGGTGCAACTATTTTCTGTTGGGCAGTGCATGAGACGCTATTTGACCTGGCTGGTCAGTGGATGAAACAACAGATAAAGAAAAGGCAGGCAATTCGCCCGCCTGTCCTGATTTGTACTCACTCATTTTCCGACTGACAATTTACATAGCCAAAACGCTATCAAATCTGACAGTCTGCTTTGAGCGAGGAGCAGAGGTTAGTTTTAGTTAACCAAAATGATAAAAAGCAGTAGAAAAATCCGCTCATTACGTTATGGTTATAAGCCGCACATAATCATCCGAGCCAAATCCTCTTGATTTCAAAGAATTAGCACTTGCTCTTCACTAGAAACTATGGTTCTGACTTCACGCTCAAATATTGGATCCATTATCATTTTTCTATTTGTTGGAATAGTAAATTTACCGAGTACCATGACAATGCTACTCAACGGAAATAATTTTGGAGTAACATTTCCTAATTGAATCGTAGGATATATATTAAAGCCCCCTTTTATCCTCATTGGCGAACCTTCAATGAAGTGTCGATAATTAAATTTCGCCATAGGGATTCTCGCCACATGCACTTTCATAATTACAGCAACCTGAGCTATTTTTACTTTCTGGAGGTAAGTTGACAATCGTTCTAACTGCGTTGTTTCATAGAAAAAATCAGACCACGTTACAGTACTTGTTCCAGTAACAATTTTTATGCTGTTTCTCTTGTTGGGGTGAGCTTTTCCATATTCATAGATCTCCCGTAAGCTATCCATGTTCTTCACATAAGGGGCTTTCCGACCTCTCTTGACATAAACGCGGTCTGTAGTATCAGAAGGTGGATTTGCCTGCATAGCTGCAGCTTTTCTGCTTAATTTAGAGATATCATCTTCATCGAGTATATGTATACGGAATATATTATTACCTGTGGCAAGCGCACGGCTGATATCATGACTGGAATCACTTGCATAAATTGTATTCAATCCTGAAGTTCCATACTGGCATAATTTGTCATGCGTGAAATTTTTTTCCAGTCTAAAGAAGGCGGGGATATGCAAGGTTTTATTAAAACTCCGACGAATATGGCTTTTGACATACACCAAATGAGCAGAGCAATTTGGGTCAGGACATTTGAGTGGATAGGGATTAACACTATAATTAAAAGTATTTACATTTACTAAAATCTGATTGTTATCAATAGCCTGAGTTATACGAATACCACTCATTACTCCTCCATAAAGATAGGCGCAAGCATTTAGTAATAAACCATTAACATGCCGAGATCATACCCGATCATTGAAATGTAAACACTCCCCTAAGACGGCTGCTGGCATAATATTTTATGACATCAGCAATGCCCACCTCTGGCACAGAGTGGACTGTCAGATTAGGCTTTACTCTGTGCCATAGATATGTAAGCCCACACTAGAGCTCATACAACTTATTGCGGCATTTCCGGCCATTCAGGATTTGCAGGATCCACACGACTGACCAGAACACTATAGCGTTCCCATGCCTCCAGTCGACTACGCTCCTCATCTGTTGCCATATTCAGCCTGACCGCGCGCTCCAGCGGCAAAATCACGGATTCAGCTTCGGAAAGCAAAGCTGCCTTATGTAATTCCGCCAGTTTCTGCTGTTCGTCTGCCGTATAAATCCGCTTAATCACGGCACCATCCTTAAACATCCATTTACCTGAGTCGTCAGCACGTCGGTTGGAGGTAATATCAGGAACCTCAACAACGCTAAAACCTTCAGGATTAAGCGTTGAAGCATCTCTGGTGATAGCGACAATAATATTATTTTCATCGTAAACAATCTTTATTGTGTCTGGCTGAAAGTTCTTCACTTCCTCATACCAGTTTTTTCCGTCCTCAGAGTAAAGCCAGATAACTCCGTGTTTCTTTGTTAACTCATACTGTTCCAGTGTTTTAGCGTTACCCGCTTTTATGTTCTTTAAGTGCATCATATTAAACGCTCGCTACATTATACCAGGTGCCATTTATATACTTTTGAACGGGTCTGTAATAAACGCCCGCTATATTATCGGCAGAGTTGGACCCTGTATCCTGAACATTAATACCAGACAATACATGACCTGACGGGCACTGGAAATTCCATGTTTGCCAGTTGTTCACTCCATAATATTGCTGTGACCCAAGTCGAACATCTTTCACATATCTGGAATCAAAATTGCCATAGTTGCCAGGAATAACTTGCGAGCCACAAAGCCAGTTACCGTTATTATCCATGTACGCCTGACCATCGGTGCCATTGGCTGTCCTTGAGTTATTAATCATGTAGATGCCAAATTGCTTATTTCCCAGACCGCCAATCATAAATTTGCGGTCGGCATGGTCCTGACGGAGCAAAGCCTGAGCACCATCAGTGGATACCGCATTACGTCCAAAAATAACATTCTGGTCACGCATATGAATCCACATGCCTGTGCTGCTGTTAATTGCAAGGCGGTTTGCGTACACCCATGCGTTAGTTGTTATATCTCCTGTAACATCCAGAGCGTGCCCCATAGTTATGCGACCAGTTCTGAGATTCAACGAAAAGGGGCGTAGTGGCCCGATATCACCATTTTCCCCCTCATTCTCTCGTGTGGGGATAATATGCAGGCATTCTTCAGAACGGCGAAAAATAGCACCAAAGGATGAATTAAAGATTCTCAGTGCATTGACTGTCGATATTTTTACTTCACTGCTGAAAAGGGCTTTAACAAGGACAGACAAAGCATCCCATTTAAGATTCATCAGGTCTTTTGTTGTGGTGCCCTGACGGCTTCTCCATTTGAAATATTCATTGCCGTTGTCGCCTGTTTCAAACCACATGTATGAATCAGTGTCACCATCGGCATCATTTTTAAATCCAATCTTCGCCCAGTCAGTATTTCGAATCCAGGCAAGGATTGAGTCGTTTTCAAAAGTAAGTCCACCGGACAAGGTATCGCCATTTTTTTGCACGGCGTTCCCGGCTCGGTTTACCGTTTCCTGTAAACCGAGATATTCGATAACGGCGGCAACGGTCGATTTAGCCAGAATACCGTGAGCGCGCTGGTTTCAGGTTCATACTCAATCACCGCCCCGTCAGGGAAACGGATATGCAGGGCATCCGCCGACGCAGACGGCGCGGGGTTATCGCCGGAATAAATCCCCGGCAGAACGAACGCCGTGTCGAGTTCACCACCCACGGCCAGAATCAGCACCTGTTCCCCCACGGAAGGTGCCCACCATGTGCGCGAACGCCCGGCGCGATGGGTCAGCCACTGAAGCCAGTCGGTGCACATGCCGCCGGTCTGCACACGGCAGCGACCGGCATTAAGGTCGGTTTCGACGATAATGCCGGTGCGGATCATGTTGCGCAGTGCGCGCGCGAGTTCCTGAATATTTGCGAGAGTGTTCATGCGTGTGAGATTGCACAATATATAAAAGTTATGCTATCTGGATTCATTTGTAGAACGACCATACAACATTCGAGGAGAGCGTAATGTTCAGTGATAATGTGACTAATGCGTGGTGGTTTATCTCTTTGTATCTATTTTTATTAATAGCATTAACATTTGTTACCTTTGGTAAAAGTAATCTTATGAGGTTTATTGCACATCATTTCAATCTTGAGTATTCAGACAGAAAGTTAAAAATGCTCGACAAAAAATGGCGCGACATTCAACTATTTAAAATAATTAACGGAATCAATGTATCAGGCATCGAAGATGTGAGAATGATACAGCAGGGGCTGATTGATGGAAAACTAAAAACATCGTATTTTTTCCTTACTCGCTTCTGGGGTGACATAACAAAACCACCACACATAATTAAAACAACAATTGTAATTCTGGCCAGTATTATTTATATTCTCTTCGCATGTTATATACACAATGAACAATCCGCTATAGTAAGGGATGCCATAGGTATACCATATAAAAATATGATGTACTATGTTTATAGTGACAAAGTTCTTTTATCCTTCAAAAATAAAACGGTTGAATTTAATAAAACTTATAGCCTTGCCGATTGCAAGAGTCTGCAAAACGTATTTATAAAAGACACACTTCCCGAGATCGCCTGCAATAAGCTCTTACAGCTAAACAAGGAGGACTCCGAATGGTTAAGTCAGGAGATTAAAGATAATAACAGCTACAGAAAAACATTATTAATAATGTCCCTAACCTATTTCATTTCAGGTCTGCTTATATTCCTGTCATATACAAAATTCCTTTACGCCAATAAGAAGGTTTTAGAATACAAAGCATCAAATAAAAACCACTCATAAACCTCTAAATATTGAGCGACCAGCACGGCCGCTCAATGCTTAATTGCGCATCAGCCTCTGCCTGGATAAAACTAACGCTCAAGGTGAGCCAGGATAATCTCTTCAATCATCTGCACATCCTCACCGGTAAAGCCGAGCAGAGGACGCGCCGGATAATCAATTTTCTTACCGTCTTTCCGGTTTTCTTCCGACAGACCGAACTGATGCACACTGGCGATTTTCGGTGACTTCCCGCCGTAAAACTCCATTGATGCCTGTTCCGGGCTGGCGCGGATATGCAAAAAACGACTGGTGATAAGTTTCGCAAACATTTTTCGCTTAACACGACCGGTCTTTTTTCTGGCGCTCTGCTGCTGGCGTGGCGCGTAGGGTGTGCCGTCCGGAGTTTTCTGTGCCATCACCCGACGCTGCTGACTCTGCCGCAGGCGTTTCGCCAGTTCGGCACTCAGTCGCCGACGCCCTGACGGTGACAGCGATTCAATCAGTCCGGTCAGCCGGTCTTCAAAACGCTTAAACTCATTCATCCCACTTGCTCACCAGTTCGCCATTGATATACAGCTCCATCGGGCGGGTGACCGGCTCCGGCGGCGGAGGTTCCGGGATATTCTTCACATGCAGCGCGCCGTCCACCTCACTGACCAGCGTGCGCTCGGTCAGCATCAGGCTGATACTGATATCAAAGCTGCTGTCATTGTTGATGTCCGCATAAAACGTGAAGCCCTTTTTCTGGCCTGCGTCGGTGGTCATGATGTCGGGCTGATTTTCCCGCAGCCACGCTAGCACCGGCACGATGAGCAGGTCAAAATCACCGGTAAAGTCGGTCACAATGACATTGAGCGTGTAACGCTTTTCAAATGACAGCGACCTCGCCAGTGTGGAGGCAATACTCCCGTTATCCACGAATATCCGCAGCATCTCGGGACTGGTTTTCAGCACCGTGACAGCATCAGTCAGCGCCCTGCGCAGGCTGTCGGGTTTGAGCATCGTTTTCGTCCTGACAGTGTTTAATCATTTTTACCTGGCTGGCACAGCGTGCCAGCGCGTTCTCAAGCTGCCGGATATCAGCACTTAAATCGCCGTTCGTCTCCGGGTCACTGCCCGGCATCGGGCAAAGGCTCACTTTCGGGCAGGCGTTGTGGACAATCACTGGCGTCGGCGCAGGCCGGGCGCTGGTGCAACCGGCGCACAGCATCAGGCAGGTCAGCGCCGTACCAGCGGCGAAAATCTTCGTTTTCATTAAGTAACCTCGTGATGGTTTTCTCGCGCTGTGCTTCACGCTTCGCGGCGTTCTCCAGCTCCTGACGCAGTGCCACCTGCGCCAGCTCGTTTTTGTCTGCCCTGGTAAGGGCAACATGAAGCTGATTTTTCAGCATGGTGATGGTCGTCTGCTGTTCACTGGCGACGTTGTTCGCCCTGTCCAGCGAGGCGCGCAGGCTGGCATTTTTGTGTTTCACCAGAAACAGACCGGCCACCGCCAGTGATAACAACACGACCAGCACAATCATCAGCTTTGACATGGTTCCCGCCCCTCAAAACGCTGACAGCAGGCCGTACGTATCAGCCGGAAGAACACCGATGCCACGAGATAAATCAGCGCGGTAAAAATCCCCCCGGCAGCGACCAGCGAGATAAACGTCGCCACCATCACTACCAGAGCCACTGACCGCCTGCGCCACGGCACCGGCTGCAAAAACAGCGCCGTGACAATCTTCACGGCCAGCGATTCCGGCGGCAGCTCCCGCCCGTAACGTTCCAGTACATACTCAGTGGCATACACGCCGACACCACCGGCAACCACACAGATAACCGTCGCCAGAATCGCCCAGGCGGCGACAAAACTGACGGCCACGCTCTGCGGGTAAATCAGGGACAGTGCCAGCATCAGCGCCAGCGACACGTTCAGCATCAGTGAAAGGGATAATTTCTTCATGGTGTTTACTCCGTTTACCGGTCTGGCAAAAAGCCTGCGTGCTGCCGTGCATCACAGCTCACCGATTTACGTCAAACGTAACATTCTGGCCTCAACGTTTATCCCACACCCGTGGCTTTCTCAGCAGGATTTCAGCCGCTTTGTGCTGGATTTTCTGGTATTCGGTAATGCGTTTCTGGAAAAGCGTTACAGCACCACCGGTAAGGTCATCAGACTGGAAACCTCACCGGCAAAATATACCCGCCGTGGCGTGGAGGAGGATGTTTACTGGTGGGTGCCGTCCTTCAACGAGCCGACACCTTTCGCGCCCGGCTCCGTGTTTCACCTGCTGGAGCCGGATATTAATCAGGAGCTGTACGGTCTGCCGGAATATCTCAGCGCCCTTAACTCTGCCTGGCTGAATGAGTCGGCCACGCTGTTCCGCCGCAAGTATTACGAAAACGGCGCTCATGCCGGATATATCATGTACGTCACTGATGCCGTGCAGGATCGCAACGATATCGAAATGCTTCGCGAAAACATGGTGAAGTCGAAAGGCCGCAACAACTTTAAAAACCTGTTTCTCTATGCCCCGCAGGGGAAAGCTGACGGCATTAAAATTATCCCGCTCAGTGAAGTGGCAACGAAGGACGATTTTTTTAATATCAAAAAAGCCAGCGCCGCTGACCTGCTGGACGCGCACCGCATCCCCTTTCAGTTGATGGGCGGCAAGCCGGAGAACGTCGGGTCGCTGGGTGATATTGAGAAAGTAGCAAAGGTCTTTGTCCGCAATGAGCTTATCCCGTTACAGGACAGGATCCGCGAGATAAACGGCTGGCTCGGTCAGGAGGTCATCCGATTTAAAAACTACTCACTGGACACTGACAACGGCTGAACATCGCCGCCTGCGGGCGGCTTTTTTACACCCCGTCATCACGCCCTCACACGTTCGCCACTGTACAAAACACCCCGCAGACACACCAACGCCCCGGCAGGCCGACTAAACGCCATCACGACGCGCTCAGACGCTGAAAAAATAAAATCAGCACCACCGCCAGCGCGCAGTGCTTTCCCCGCCTCGCCCGCCCGCTTCATGGGTCGGTTTTGATGCAATTCCAAAAGCCGTCCAAACTCTCTTAGGCTAAATGTCCAACGAGAAAATAGTTCTTTGAATGTGAATGCATTTTAATGCAGAGTTATGCCCAGCATTTTTGTACACTTCGATGTATCAAATGCGCTGCAAACGATCAAATATGGATGTTTTATCAAGCATCCCCCAAAAGATATTTACATCATCCCATGAGGTTAAGATGGATAACAAAATCGTAGAAATTGAGACAAATAAGCTTGATTTTGACCCTAAAAACCCACGTTTCTTTCGTCTCAATGATGCCAGTAACGCTGCAACAGTCATTGAGGAAATGTTAGATGACGAAAGTGTCCACGATCTAATGCTATCAATCGGTCAGCAAGGTTACTTTCCTGGAGAACCTTTATTGGCAGTAAAAAGCAATGGAAACTACATCGTGGTTGAGGGAAACAGACGCTTAGCTGCTGTAAAGTTGCTCAATGGAGATCTGCTTCCTCCAAAAAGAAAACTTAAAGGTGTGCAAGAAATCATTGATGATACTACCAATAAACCTAAGAAGCTTCCCTGCATCATTTATGAAAACCGAGAGGATGTACTGAGATATATCGGTTATCGTCATATAACTGGGGTCAAAGAATGGGACTCATTATCTAAAGCCAAATACCTTAAAGAGTTATGTGATACTTTTTATTCACATGAGCCTAAAGAGATAGTATTAAAAAATCTGGCTCGTGAGATTGGGAGTAAACCACATTATGTTGCAACACTTCTCACTGCACTGAACTTATATGAAGTCGCGCATGACCATGAGTTTTTTAATTTACCCATGAAGGCTTCTGACGTGGAATTTTCATATATAACCACAGCTTTGGGATATTCAAAAATCACAAACTGGTTAGGTCTACAGGATAAAAAGGATTTTTTAGACCCAAATTTAAATGAAGAAAACCTTAAGCGTTTATTCTCTTGGTTTTTTGTGCCTGACCAACAAGGTAGAACCATCATCGGTGAGTCTCGAAGAATAAAAGATATTGCAGCAGTGGTTGAGAAACCCGAAGCAATTGAAATTCTCATGAAAAGTTCAAACTTGGATGAAGCATATCTATATACCAGCGGAGAAAGAGAAGCATTAGATAAAGCACTAAACGCAGCTAGTGTTAAATTAAGAGTAGTTTGGGATATGCTACTTAAAGCTAAAGAATTAACATTAGAGCATGAAGAGGCTGCATCTGAAATTTTTGAGATGTCAAAAAATATTAGAAATCAGATCAGAAGCAAAAGGGAGGATGATTGAGATTATGATTACAAATCTTGATTCAATGCCTTCTAATGAGCCTTATTTATGGGCTGATTATATTGAGATATTGGCCTTAACTAATATCGACAGGTCATTCAGTCGAGGAGACCTATATAGCACACTGCAAGCTCAACCCGAAGCAGTACTAGCTGAAACAGATGAAGCAGAAGAAGAGGGCGTTTATGATGTTGATGATGAAAATGATACGCCTGTACGCAAGAGAACAAAACGAAGTGTTAGTCGAGCATATACTGACAGAAAGTGGAGCTATGCGATAGGCTTCATACGACAACGCATTGATTTATTTGGGGATAGTTACCCTTTTACTTTATCAGAAGACAACGATACTGTAGAGTTACGTGATATATCAGAAAAGCCACTGGAACATTTAGAAAGACTATATTTAGCTTTACTAATCTGTGCTAACATAAAATATGTCAACATAATGAGCAGAAGAGAGATAACGCGCAGTTTTGAACTAATTAGTTTACCTATTTTTGAAAGCCTAATGCCTAGCGGTAGCATAATAAAAGCATGCTGGGCTTCTGGTGGTCAAGCGGCCCCTTACACTGGAACTCTATATAATAAATTTAAGAGTATTGCTTCCGATATCCGTTGCACAGCGAACTTCAAAGAACGAGATTTCAGTCGAGGAAATAGTGGTGACGGAGGCCTTGACATAATTGCCTGGCATCCAATGGGAGATCAACGAGATGCCATCCCTATTTCTTTTGTTCAGTGTGGCTGTTCTCAAGAAGAGTGGGAAGCGAAGCAGCTTGAGGCCTCACCTGCGATGCTCTACAGTAAATTCCCCGTAGCTCACCGATGGGCAACTTATTATTTCTTACCTCAAGATCTACGATGGATAGATGGTGAGTGGGCGCATAAAAATAAGTTAGGCGATGCTATTTTTGTTGATCGCCTAAGATTAATCAATTTAACCAGAGCATCTGATAATATTGATCACAGTCAAAATATTAGCTATCTAGATATCATCCTTGATCCTTCCAGCGCGATCGCTGCTTAATCCCATAAATCTGGAAGGTTTCTAGCAACCGCCTCAAACAATGGAGGCGGTACTGCATTACCTACCACAGTATATTTCATATTAATAGAAGCCCGTTCAGTTTCTGGGAAAATTAAATCCCCAAAACCTTGTAAAATAGCAGCCTCTCGATAGCTAAACCTACGAGCTGGTGCATCCGAAGTAAATTGCCACTTATCAGGTCCCAATTTTTCTAATGTTGGACTTATTGGATGTAGAGGCATATGTCTAGGATTTGCAACAATTGTTTTAGATATCTGATCCCAATCTTGCCTACGGTTTCGCGATAGATAATACCAATGAAAATCGGCGTCATAAAACTCGCCAACAGGCCAAACAGGCATATGCCCAATAGCATCACGAATTGTGGAATATGGTGTCAAACCATCACCATGTGTTGGTTTTGGAAATTTGTATGTAATACCGTAGTCCTTTCGTATTCCTACGATAAAGATTCGCTTCCTATCTTGGGATACCCCATAATGGGACGCATTCAGAATTTGCGAGCTTACTGTATAACCTGCTTCTTCGAAAACTTTGAATTGATCCTTTAATAAATGCTCAAAGTTACGCCTTACCATACCAGAGACATTCTCTACAATGAATGCTTTTGGCTTAATTTTACTCAAAGCACGGGCAAACTCTAAATATAGTGTATTAATCTTTCTATCTGCCTTCCTTGCCCCACCTTGACTAAATCCTTGGCAAGGATAGCATCCGATGAGCAACTCAGCAGAAGGGAACGACTGGAGCCCTGAGATATCGCCCAAAATGTAGTCAGTTTCAGGATGGTTTTCTAAGTAAACGTCCCTTGCGTAAGGCAAAATATCATTTGCCATAAGCACATTGAAACCTGCGTTCAAAACTCCCGCATCAGAACCACCACATCCAGAAAAAAGTGAAACTACAGTTGGCATTGACCCCTCCTAAAAACCGACCGCGTATTATAGCGAAACACCCCGTTGGGAAAAGCTAGATTTTGCCAAGTCTTGATATTCTCACGTTTTAGTAGTTGTGGCCATCTTTAACGAGAAAAAGATAAAATTGACTTCTCATTAATTTTCAATAGGTTTAATTGTAAGCTCAAACTAACGCCTCGCGACACTCGTTATTCAACCCCGCCAGCCCTGAAAACAAGTTTCACGACTGGCGGCGTTCTCTATCGTCTGCGTTGTGGTGGCGCAACTCTGGACTGACCGATATGGTTAAACCGCCCGTAATTATCCCGGACTATTTCGGCACACCCGACCAGCTCATCGGGCGTCAGATTTTCGTTGACCATAATCCGCTGTAAACGCTGAACAATAGCCATCAGCTTGATATTTTTAGTTTTATGGTGCGGTATCTCGCCTGGTATTCTGTGCATTATCCAAGCCACCCGTTTTGCTGTGCACGCTCCATCTGTTCATCTGAATAGTTCCATGCTCCATCCGTGGCAACCATTGCCCCGCCAGACATCCCCGTCTCTGGTTCATACATAACAGCAAGGCCGAGCTGATGCATAATTTCATGATTAATTCTGAATACCAGACCACGCTCACTAAGTTCTTTCCAGTTCACAATCTCACATGCGCCTGTATTAAGCCGCTCAATACTTAGCAAGACATAATCTTCCAGCCAGTCTGACAGGTCAGTAACATCTGTTATCCGGGCTTCAACCTTTCGCCCCGTATACACACCCTGCACCCATTCATGCAAAATCAACGTGTCCCCGCGCTCATAATTACGGTCATTTTTCCGAAACTCTGCGCGTTTCTTTCCTTCCAGCACAAGGTCAAAATATTTTGCGTGCAGCTTTACCTCGTGAATTTTTGCCATTATGTCCACTCCATTACTGTTGAGAATCCCGGCCACTCATCAGCGACCGGATACGTGAATTTTTTCCCGTCATAATTTACGGTCGCTCCACGCGCCAGCGCCTCAAGCTCCCATCGCTGAGGCCTGATACCGTTCTGAGCAAGGTCAACGCGGATACGGGTGATTTGCATTCGTTCCGACCGGGTCAGTCTGGCCGACGGTGCAATTTCATGTGTTTTTAACGGGCTTCCGTTTCTTTGTTGACGATTTGGTGTTTTCAGGCCGTGTTTTAATGCTCCCCTGAGCGCCCTCACGACCTCCGGGTCATTCCATTCGATAACACCGTCATCAACCAGATTTAGCACTGCTGCGGCGTGCTCAGAAGGTGTGGGAGCCGGTAACGAAGCATCACTACCGGTGAGCTTTGTGATGTGGCGAATCATGGCGTCAAGATGAGAAGAAAAGCGCGTTGCTGCGTCGGCCTGTGCTTCGGTTCTGACCTGTTGCAGCAGTAATGCGTATTTACCGCACTGATTTTCAGAAACTGTATGCATGACTTTCTCCAGGCAAAAAGAAGCCCCGCACAATTAAGTGCGTTAAAAACTCTGGTTAATTACTTAATGCAGATATTGCTCTGGTTTTACCGACGTCAGGATTGTCGGTGCATACTCAAACAGGCTGAATAATTCACGTAATGCACGGAATAAGGCATCACGCCAGTAACATGATTCTTCATTAATTCGCCAGTATGGCTGGTTGAATTCTTTTTCAGTCAATCCGGCATGCATAAATAAAGTACGGCGCTGACTGACAGTTAAAAAGCTAATATATGCATACTCACTTGCACCGACCTGACGGCGTTTTGAGAATGCCCCACGCAATTCATCAATTGCACATACCAGTCGTTCACGTTCGACGTCGTTCATTTCTTCAAAACGCATCGTTGCGTGACGCTGTTTTAACTGTGCATGAAAGCAAACCGTTAGCCGTTCGCGCTCCATCATCTGATTATAATAATCACATGTATCCTGCCAGCGAGGGACGGCAAGATGCTTGCCAATTATCCGGCGCATAGCTGCTGGCTGTTTTTCAACGAGATTGAGCGTCATCACTGTCATTTCCAGCCCCTCCGGCTTTTCAGAAAGGTCAGAGCCTTTTTTAACGGACTCTGTTTTTTGGTGCGGATAATGATTCCCTTACGCCCCTTACCGTGGGTGATGGTGAAGTCAATCGCCCTGGGGCTTTCGTTACGCAGTAACTGAGCAATACAACGCGGTTCACTCATAATCACAACCCCATCCACAAAAGCCATGCATCACGCTGTTCAACTGGTCGGTTATAAAACGCCTCTCGTACAGCGCGATTAAACTCTGGAATGAAAACCCACTTCTCACCGACACGAGCGTTCGGCTTACTTGGATCACGAAGCTCAATAACTGGCAACTTATTCTCTTTTACCATCTTGACTACAGCCGTTTCTGGCTTACCAAGTAACTCTGCAAACTTAACCGTATGTACCGCATCAATCGGGTACTGAATCACATAGTCATTGACTTCCATTGATTAGCCCTTTTTGCTTTCGTGTTACCCTTATTAGATCCAGTCCCTTCTAGGTCGCACCTGTCCTTTCTAGGGACTGGCTAACACACTCAAAAGGTCACCAATACACAACCTTTTGACGGGAATATAAGTCACCAATAGGTTACTGTCAAATGCAGACATTCGAAAAACTGAAAGCGATTAGGAAAGCAGAAGGCTTAACACAGGCGAAATTCAGCGAAATTAGCGGGATAGCTCTAGGAACAGTCAAAAATTACGAAAGTGGGCATAAAGACCCTGGTCTCAGCATCGTTATGCGAGTCACAAATACGCCTTTATTTAAAAAATATACGCTCTGGTTAATGACTGGTGATACGTCACCACAAGCTGGTCAGATCGCGCCGGCTCTCGCACACATTGGGCAAAAACCAACAGAATCAGACCACTCCGAAAAACAGACTGGTTAACACTCTATAAACATTACATTTTCACCATTTGTTACCAAGATGGTGAATACAGCGTCAGAGGGCTTTCTTATGTCAATTAAGAAGCTCGATGATGGACGCTATGAAGTGGACATTAGACCTCGCGGTCGCGACGGAAAACGCATCCGCAGGAAATTTGAAAGAAAAGCTGAGGCTGTAGCATTTGAGCGATACACAATCGCCTACGCCAGCCAGAAAGAATGGGCAGGTCAGCGAGCAGATCGCAGAACTTTGAGTGAGTTGCTGAACATCTGGTGGAAATATCACGGGCAAAACCACGAGCATGGAACAAAAGAGTTTAATCATCTGCTCAAAACCATCAGCGGCATAGGTGATATACCAGTGAGCCGGATGAGCAAAAGAGCTTTGATGGATTATCGTTCCATGCGACTACGTGATGGTATCAGTGCCGCAACGATAAACCGTGACATGTACCGATTATCCGGCATGTTCACAAAATTAATTCAATTGGATGAATTTTCCGGGCAACACCCAATTCACGGACTGCCGCCACTGGCGGAGGCCAACCCTGAAATGACGTTCCTGGAAAAAGCAGAAATCGAAAAACTGTTAAATGTTTTGGATGGTGATGACTTACTTGTCGCACTTTTATGTCTGAGCACTGGAGGAAGATGGACGGAAGTTGCCACGCTAAAACCAGCACAGATTACAAATTGCAGGGTTACCTTCCTGAAAACCAAAAACGGTAAAAAGCGAACCGTGCCGATTTCTGAGGAACTGGAGAAAAAAGTTAAAGAGGAGGCCAGCGCTAAATTATTCAAAGTTGATTATGAGAAGTTTTGCGGGATTTTACGCAGAGTGAAGCCAGATATACCACCCAATCAGGCAACCCACATCCTGCGGCATACATTCGCAAGCCATTTCATGATGAATGGGGGCAATATAATCGCACTGCAACAGATTCTGGGACATGCGAGCATTCAGCAGACGATGGCCTATGCGCACCTTGCGCCTGACTACCTGCAAAATGCCGTCGCGCTGAATCCTCTAAAAGGCGGAGTGACGTTATAAATTTCCCTTCTGAGTGTCCACATAGTGTCCACACTCTCAGAACTTTGTAGCCCTTCCAGTCCCTTATAGGTTTTCTTAAGTTACTGTTTTCTTACGGAAACCGATGTAAGTGATTGATAAAAAAAACCCCCACATCATGTGGGGGAAGACAGGGATGGTGTCTATGGCAAGGAAAACAGGGTTTACTACTGGGAACGTGAGTTGCTACTACTCAATAGCTTCAACGATGAACTTTTTTGCCATTGCGTCACGTCGCGCAACTGCTCCATTCGTTGTTGATGTTTCTCGTTTAAAACCGCTTGCTGCTCCGGCGTTAACAGGCGATACATTTGGTTGCGGACTTTTGCCATCTCAACCTGACGAGCAATTTGCTCATTCGCCATTTTTTCTGCCTGTGCGCGCACAGCGTTTTCATCAAAATTTTCTGCGGTGACAAGGCGATGCATTGTCTCCAGTTCGCTAACATTAACAGGAGGCTGTTCGTGCCGGGCCTGTTGCATAAGATCTCGCATCTGCTGACGCTGATGTTCGGTTAAACTTATGCCGTCGAACATATGGCTCTGCGTACTGCGCTGCGTAAGTTCTTCACCCGGATGCCAGTTATCGCCTGAACCGACTTCAGCAGCGTGGCTTAATGAACTGACTGCCAGCGTTGAGGCCATGACGGCAGCGGTAACTATGCGCATCATTTGCTCCCAAAATCTTTCTGTCGCGATTCAACGATAGAGAGTTTACGATTCAGGCTGCAAACATGCGTCAGGGGGTGTAAAACAACGTAAAGTCATGGATTAGCGACGTCTGATGACGTAATTTCTGCCTCGGAGGTATTTAAACAATGAATAAAATCCTGTTAGTTGATGATGACCGAGAGCTGACTTCCCTATTAAAGGAGCTGCTCGAGATGGAAGGCTTCAACGTGATTGTTGCCCACGATGGGGAACAGGCGCTTGATCTTCTGGACGACAGCATTGATTTACTTTTGCTTGATGTAATGATGCCGAAGAAAAATGGTATCGACACATTAAAAGCACTTCGCCAGACACACCAGACGCCTGTCATTATGTTGACGGCGCGCGGCAGCGAACTTGATCGCGTTCTCGGCCTTGAGCTGGGCGCAGATGACTATCTCCCGAAACCGTTTAATGATCGTGAGCTGGTGGCACGTATTCGCGCGATCCTGCGCCGTTCGCACTGGAGCGAGCAACAGCAAAACAACGACAACGGTTCACCGACACTGGAAGTTGATGCCTTAGTGCTGAATCCAGGCCGTCAGGAAGCCAGCTTCGACGGGCAAACGCTGGAGTTAACCGGCACTGAGTTTACCCTGCTCTATTTGCTGGCACAGCATCTGGGTCAGGTGGTTTCCCGTGAACATTTAAGCCAGGAAGTGCTGGGCAAACGCCTGACGCCTTTTGACCGCGCTATCGATATGCACATTTCCAACCTGCGTCGTAAACTGCCGGATCGTAAAGATGGTCACCCGTGGTTTAAAACCTTGCGTGGTCGCGGCTATCTGATGGTTTCTGCTTCATGATAGGCAGCTTAACCGCGCGCATCTTCGCCATCTTCTGGCTGACGCTGGCGCTGGTGTTGATGTTGGTTTTGATGTTACCCAAGCTCGATTCACGCCAGATGACCGAGCTTCTGGATAGCGAACAGCGTCAGGGGCTGATGATTGAGCAGCATGTTGAAGCGGAACTGGCGAACGATCCGCCCAACGATTTAATGTGGTGGCGGCGTCTGTTTCGGGCGATTGATAAGTGGGCACCGCCAGGACAGCGTTTGTTATTGGTGACCACCGAAGGCCGCGTGATCGGCGCTGAACGCAGCGAAATGCAGATCATTCGTAACTTTATTGGTCAGGCCGATAACGCCGATCATCCGCAGAAGAAAAAGTATGGCCGCGTGGAACTGGTCGGTCCGTTCTCCGTGCGTGATGGCGAAGATAATTACCAACTTTATCTGATTCGTCCGGCCAGCAGTTCTCAATCCGATTTCATTAACTTACTGTTTGACCGCCCGTTATTACTGCTGATTGTCACCATGTTGGTCAGTACGCCGCTGCTGTTGTGGTTGGCCTGGAGTCTGGCAAAACCGGCGCGTAAGCTGAAAAACGCTGCCGATGAAGTTGCCCAGGGAAACTTACGCCAGCACCCGGAACTGGAAGCGGGGCCACAGGAATTCCTTGCCGCAGGTGCCAGTTTTAACCAGATGGTCACCGCGCTGGAGCGTATGATGACCTCCCAGCAGCGTCTGCTTTCTGATATCTCTCACGAGCTGCGCACCCCACTGACGCGTCTGCAACTGGGTACGGCGTTACTGCGCCGTCGTAGCGGTGAAAGCAAGGAACTGGAGCGTATTGAAACCGAAGCGCAACGTCTGGACAGCATGATCAACGATCTGTTGGTGATGTCACGTAATCAGCAAAAAAACGCGCTGGTTAGCGAAACCATCAAAGCCAACCAGTTGTGGAGTGAAGTGCTGGATAACGCGGCGTTCGAAGCCGAGCAAATGGGCAAGTCGTTGACGGTTAACTTCCCGCCTGGGCCGTGGCCGCTGTACGGCAACCCAAACGCCCTGGAAAGTGCGCTGGAAAACATTGTTCGTAATGCTCTGCGTTATTCCCATACGAAGATTGAAGTGGGCTTTGCGGTAGATAAAGACGGTATCACCATTACGGTGGACGACGATGGTCCTGGCGTTAGCCCGGAAGATCGCGAACAGATTTTCCGTCCGTTCTATCGGACCGATGAAGCACGCGATCGTGAATCTGGCGGTACAGGTTTGGGGCTGGCGATTGTTGAAACCGCCATTCAGCAGCATCGTGGCTGGGTGAAGGCAGAAGACAGCCCGCTGGGCGGTTTACGGCTGGTGATTTGGTTGCCGCTGTATAAGCGGAGTTAA
Protein sequences of DBSCAN-SWA_9 >NZ_CP048344|4490113:4508632|4492694_4493222_-|WP_000972099.1|tail|DBSCAN-SWA MMHLKNIKAGNAKTLEQYELTKKHGVIWLYSEDGKNWYEEVKNFQPDTIKIVYDENNIIVAITRDASTLNPEGFSVVEVPDITSNRRADDSGKWMFKDGAVIKRIYTADEQQKLAELHKAALLSEAESVILPLERAVRLNMATDEERSRLEAWERYSVLVSRVDPANPEWPEMPQ >NZ_CP048344|4490113:4508632|4504747_4505728_+|WP_000023386.1|integrase|DBSCAN-SWA MSIKKLDDGRYEVDIRPRGRDGKRIRRKFERKAEAVAFERYTIAYASQKEWAGQRADRRTLSELLNIWWKYHGQNHEHGTKEFNHLLKTISGIGDIPVSRMSKRALMDYRSMRLRDGISAATINRDMYRLSGMFTKLIQLDEFSGQHPIHGLPPLAEANPEMTFLEKAEIEKLLNVLDGDDLLVALLCLSTGGRWTEVATLKPAQITNCRVTFLKTKNGKKRTVPISEELEKKVKEEASAKLFKVDYEKFCGILRRVKPDIPPNQATHILRHTFASHFMMNGGNIIALQQILGHASIQQTMAYAHLAPDYLQNAVALNPLKGGVTL >NZ_CP048344|4490113:4508632|4502034_4502241_-|WP_000554771.1|DBSCAN-SWA MHRIPGEIPHHKTKNIKLMAIVQRLQRIMVNENLTPDELVGCAEIVRDNYGRFNHIGQSRVAPPQRRR >NZ_CP048344|4490113:4508632|4499819_4500857_+|WP_000570053.1|DBSCAN-SWA MIEIMITNLDSMPSNEPYLWADYIEILALTNIDRSFSRGDLYSTLQAQPEAVLAETDEAEEEGVYDVDDENDTPVRKRTKRSVSRAYTDRKWSYAIGFIRQRIDLFGDSYPFTLSEDNDTVELRDISEKPLEHLERLYLALLICANIKYVNIMSRREITRSFELISLPIFESLMPSGSIIKACWASGGQAAPYTGTLYNKFKSIASDIRCTANFKERDFSRGNSGDGGLDIIAWHPMGDQRDAIPISFVQCGCSQEEWEAKQLEASPAMLYSKFPVAHRWATYYFLPQDLRWIDGEWAHKNKLGDAIFVDRLRLINLTRASDNIDHSQNISYLDIILDPSSAIAA >NZ_CP048344|4490113:4508632|4495605_4496058_-|WP_001695629.1|DBSCAN-SWA MNEFKRFEDRLTGLIESLSPSGRRRLSAELAKRLRQSQQRRVMAQKTPDGTPYAPRQQQSARKKTGRVKRKMFAKLITSRFLHIRASPEQASMEFYGGKSPKIASVHQFGLSEENRKDGKKIDYPARPLLGFTGEDVQMIEEIILAHLER >NZ_CP048344|4490113:4508632|4502240_4502693_-|WP_001600138.1|DBSCAN-SWA MAKIHEVKLHAKYFDLVLEGKKRAEFRKNDRNYERGDTLILHEWVQGVYTGRKVEARITDVTDLSDWLEDYVLLSIERLNTGACEIVNWKELSERGLVFRINHEIMHQLGLAVMYEPETGMSGGAMVATDGAWNYSDEQMERAQQNGWLG >NZ_CP048344|4490113:4508632|4504384_4504678_+|WP_001192857.1|DBSCAN-SWA MQTFEKLKAIRKAEGLTQAKFSEISGIALGTVKNYESGHKDPGLSIVMRVTNTPLFKKYTLWLMTGDTSPQAGQIAPALAHIGQKPTESDHSEKQTG >NZ_CP048344|4490113:4508632|4500853_4501792_-|WP_001143634.1|DBSCAN-SWA MPTVVSLFSGCGGSDAGVLNAGFNVLMANDILPYARDVYLENHPETDYILGDISGLQSFPSAELLIGCYPCQGFSQGGARKADRKINTLYLEFARALSKIKPKAFIVENVSGMVRRNFEHLLKDQFKVFEEAGYTVSSQILNASHYGVSQDRKRIFIVGIRKDYGITYKFPKPTHGDGLTPYSTIRDAIGHMPVWPVGEFYDADFHWYYLSRNRRQDWDQISKTIVANPRHMPLHPISPTLEKLGPDKWQFTSDAPARRFSYREAAILQGFGDLIFPETERASINMKYTVVGNAVPPPLFEAVARNLPDLWD >NZ_CP048344|4490113:4508632|4491579_4492479_-|WP_000014361.1|DBSCAN-SWA MSGIRITQAIDNNQILVNVNTFNYSVNPYPLKCPDPNCSAHLVYVKSHIRRSFNKTLHIPAFFRLEKNFTHDKLCQYGTSGLNTIYASDSSHDISRALATGNNIFRIHILDEDDISKLSRKAAAMQANPPSDTTDRVYVKRGRKAPYVKNMDSLREIYEYGKAHPNKRNSIKIVTGTSTVTWSDFFYETTQLERLSTYLQKVKIAQVAVIMKVHVARIPMAKFNYRHFIEGSPMRIKGGFNIYPTIQLGNVTPKLFPLSSIVMVLGKFTIPTNRKMIMDPIFEREVRTIVSSEEQVLIL >NZ_CP048344|4490113:4508632|4490113_4490332_-|WP_000468308.1|DBSCAN-SWA MFHCPLCQHAAHARTSRYITDTTKERYHQCQNVNCSATFITYESVQRYIVKPGEVHAVRPHPLPSGQQIMWM >NZ_CP048344|4490113:4508632|4494748_4495534_+|WP_060621266.1|DBSCAN-SWA MFSDNVTNAWWFISLYLFLLIALTFVTFGKSNLMRFIAHHFNLEYSDRKLKMLDKKWRDIQLFKIINGINVSGIEDVRMIQQGLIDGKLKTSYFFLTRFWGDITKPPHIIKTTIVILASIIYILFACYIHNEQSAIVRDAIGIPYKNMMYYVYSDKVLLSFKNKTVEFNKTYSLADCKSLQNVFIKDTLPEIACNKLLQLNKEDSEWLSQEIKDNNSYRKTLLIMSLTYFISGLLIFLSYTKFLYANKKVLEYKASNKNHS >NZ_CP048344|4490113:4508632|4497038_4497464_-|WP_060621267.1|DBSCAN-SWA MKKLSLSLMLNVSLALMLALSLIYPQSVAVSFVAAWAILATVICVVAGGVGVYATEYVLERYGRELPPESLAVKIVTALFLQPVPWRRRSVALVVMVATFISLVAAGGIFTALIYLVASVFFRLIRTACCQRFEGREPCQS >NZ_CP048344|4490113:4508632|4498753_4499827_+|WP_000368931.1|DBSCAN-SWA MDNKIVEIETNKLDFDPKNPRFFRLNDASNAATVIEEMLDDESVHDLMLSIGQQGYFPGEPLLAVKSNGNYIVVEGNRRLAAVKLLNGDLLPPKRKLKGVQEIIDDTTNKPKKLPCIIYENREDVLRYIGYRHITGVKEWDSLSKAKYLKELCDTFYSHEPKEIVLKNLAREIGSKPHYVATLLTALNLYEVAHDHEFFNLPMKASDVEFSYITTALGYSKITNWLGLQDKKDFLDPNLNEENLKRLFSWFFVPDQQGRTIIGESRRIKDIAAVVEKPEAIEILMKSSNLDEAYLYTSGEREALDKALNAASVKLRVVWDMLLKAKELTLEHEEAASEIFEMSKNIRNQIRSKREDD >NZ_CP048344|4490113:4508632|4496050_4496518_-|WP_000917174.1|tail|DBSCAN-SWA MLKPDSLRRALTDAVTVLKTSPEMLRIFVDNGSIASTLARSLSFEKRYTLNVIVTDFTGDFDLLIVPVLAWLRENQPDIMTTDAGQKKGFTFYADINNDSSFDISISLMLTERTLVSEVDGALHVKNIPEPPPPEPVTRPMELYINGELVSKWDE >NZ_CP048344|4490113:4508632|4505913_4506414_-|WP_001223800.1|DBSCAN-SWA MRIVTAAVMASTLAVSSLSHAAEVGSGDNWHPGEELTQRSTQSHMFDGISLTEHQRQQMRDLMQQARHEQPPVNVSELETMHRLVTAENFDENAVRAQAEKMANEQIARQVEMAKVRNQMYRLLTPEQQAVLNEKHQQRMEQLRDVTQWQKSSSLKLLSSSNSRSQ >NZ_CP048344|4490113:4508632|4507258_4508632_+|WP_000580417.1|DBSCAN-SWA MIGSLTARIFAIFWLTLALVLMLVLMLPKLDSRQMTELLDSEQRQGLMIEQHVEAELANDPPNDLMWWRRLFRAIDKWAPPGQRLLLVTTEGRVIGAERSEMQIIRNFIGQADNADHPQKKKYGRVELVGPFSVRDGEDNYQLYLIRPASSSQSDFINLLFDRPLLLLIVTMLVSTPLLLWLAWSLAKPARKLKNAADEVAQGNLRQHPELEAGPQEFLAAGASFNQMVTALERMMTSQQRLLSDISHELRTPLTRLQLGTALLRRRSGESKELERIETEAQRLDSMINDLLVMSRNQQKNALVSETIKANQLWSEVLDNAAFEAEQMGKSLTVNFPPGPWPLYGNPNALESALENIVRNALRYSHTKIEVGFAVDKDGITITVDDDGPGVSPEDREQIFRPFYRTDEARDRESGGTGLGLAIVETAIQQHRGWVKAEDSPLGGLRLVIWLPLYKRS >NZ_CP048344|4490113:4508632|4493223_4494225_-|WP_060621264.1|DBSCAN-SWA MQKNGDTLSGGLTFENDSILAWIRNTDWAKIGFKNDADGDTDSYMWFETGDNGNEYFKWRSRQGTTTKDLMNLKWDALSVLVKALFSSEVKISTVNALRIFNSSFGAIFRRSEECLHIIPTRENEGENGDIGPLRPFSLNLRTGRITMGHALDVTGDITTNAWVYANRLAINSSTGMWIHMRDQNVIFGRNAVSTDGAQALLRQDHADRKFMIGGLGNKQFGIYMINNSRTANGTDGQAYMDNNGNWLCGSQVIPGNYGNFDSRYVKDVRLGSQQYYGVNNWQTWNFQCPSGHVLSGINVQDTGSNSADNIAGVYYRPVQKYINGTWYNVASV >NZ_CP048344|4490113:4508632|4503289_4503790_-|WP_000217670.1|DBSCAN-SWA MTVMTLNLVEKQPAAMRRIIGKHLAVPRWQDTCDYYNQMMERERLTVCFHAQLKQRHATMRFEEMNDVERERLVCAIDELRGAFSKRRQVGASEYAYISFLTVSQRRTLFMHAGLTEKEFNQPYWRINEESCYWRDALFRALRELFSLFEYAPTILTSVKPEQYLH >NZ_CP048344|4490113:4508632|4496625_4497051_-|WP_000040662.1|lysis|DBSCAN-SWA MSKLMIVLVVLLSLAVAGLFLVKHKNASLRASLDRANNVASEQQTTITMLKNQLHVALTRADKNELAQVALRQELENAAKREAQREKTITRLLNENEDFRRWYGADLPDAVRRLHQRPACADASDCPQRLPESEPLPDAGQ >NZ_CP048344|4490113:4508632|4506563_4507262_+|WP_001033722.1|DBSCAN-SWA MNKILLVDDDRELTSLLKELLEMEGFNVIVAHDGEQALDLLDDSIDLLLLDVMMPKKNGIDTLKALRQTHQTPVIMLTARGSELDRVLGLELGADDYLPKPFNDRELVARIRAILRRSHWSEQQQNNDNGSPTLEVDALVLNPGRQEASFDGQTLELTGTEFTLLYLLAQHLGQVVSREHLSQEVLGKRLTPFDRAIDMHISNLRRKLPDRKDGHPWFKTLRGRGYLMVSAS >NZ_CP048344|4490113:4508632|4503959_4504232_-|WP_000453534.1|DBSCAN-SWA MEVNDYVIQYPIDAVHTVKFAELLGKPETAVVKMVKENKLPVIELRDPSKPNARVGEKWVFIPEFNRAVREAFYNRPVEQRDAWLLWMGL |
21 | Escherichia_virus(26.32%) | integrase,lysis,tail | attL 4489956:4490002|attR 4505844:4505890 |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
Acr ID | Acr position | Acr size | Homology with known anti | Neighbor HTH/AcRanker | Neighbor Aca | In prophage | Protospacer in prophage |
---|