Contig_ID | Contig_def | CRISPR array number | Contig Signature genes | Self targeting spacer number | Target MGE spacer number | Prophage number | Anti-CRISPR protein number |
---|---|---|---|---|---|---|---|
NZ_AP021860 | Alteromonas sp. I4 plasmid pAltI4, complete sequence | 1 crisprs | NA | 1 | 1 | 0 | 0 |
NZ_AP021859 | Alteromonas sp. I4 | 5 crisprs | WYL,DEDDh,DinG,RT,csa3,Cas9_archaeal,cas3 | 1 | 0 | 2 | 0 |
CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
NZ_AP021860_1 | 17-129 | Orphan |
NA
Consensus repeat of NZ_AP021860_1
|
1 spacers
spacers of NZ_AP021860_1
>1.1|58|31|NZ_AP021860|CRISPRCasFinder TTGCCCCATGTGGGACACTTCCGCCCACTGT |
CRISPR arrays and Neighbor proteins around NZ_AP021860_1
The CRISPR arrays of NZ_AP021860_1 >merge|NZ_AP021860|1|17-129|CRISPRCasFinder GAATGTTGAATAAATTCAAGTGTCCAATATAGGATACATAGTTGCCCCATGTGGGACACTTCCGCCCACTGTGAATGTTGAATAAATTCAAGTGTCCAATATAGGACACATAG >NZ_AP021860|1|1|17-129|CRISPRCasFinder GAATGTTGAATAAATTCAAGTGTCCAATATAGGATACATAG TTGCCCCATGTGGGACACTTCCGCCCACTGT GAATGTTGAATAAATTCAAGTGTCCAATATAGGACACATAG
>NZ_AP021860.1|WP_155016802.1|344_1313_-|tyrosine-type-recombinase/integrase MLMLSLEDVPLAISPDDRQRVIDNLQEFKDEVWHLKSENTKRAYQSDFKQYLSFCMDNGMPALASDWRITRESCRSYLKYMMASKLKHHTIRRKIASIRYFIGVSELADPWKHSKLFTEFTNNTLKAKPSRQGQAKPLRVNLIDKFTSQLDLDNLLELRDAVIFNVAIDTIFRASNLLAIDISHIKFSQNKVFAPRSKTDRTGKGHYGYISQTSIELIKRWMEAGNISTGPLFRTLSPKHTVRDEGMQYHALISRYRTIARRIMIEDRFSCHSTRVGGVVTMFENGVSLDEIQKAGGWSSQAMPLHYAEEYDVAKTGMARLR >NZ_AP021860.1|WP_155016728.1|1553_2414_+|hypothetical-protein MNNDRPWQCFIEQQLDTFLSSLGNADWVSLYPTTLDKERLAESGESAALMAMRKVIKPGAMRRFERDFAEYRKEFEKSLWSVYLSKHLKKFLSAVPESDYHPDAAPLTDDKIESWFNLQPKDLYKTLRIWIPQRYVKQFQNRYRAYKHREVRNIKVFDISAKSKAILERYRDEIEANSLDEAIERCFSVNYRTRENDPESTVAKSAIANTIMFGNDVYFDDLMQRLSNNDRQKLALIIERSFKAGWNAAKANRVRKGDPKQQALDEFDLMQKLTAFLPAENVDSGQ >NZ_AP021860.1|WP_162360026.1|2671_3829_-|putative-DNA-binding-domain-containing-protein MQKLVEQLIKNKSEGYWWDFKLKHHSNLLELLHDVLCLANIIYEGERFIIFGVSDDFNVIGLNDDDIRHKQADILNFLRTKSFAYHKIPSVKIDTIQVDGKELDVLSIKDENYKPYFLTRDETKKGITIRAGTIYSRLGDSNTPKDSCANPYEVEAMWRQRFGLDKKASERFYDVLVDFKNWKYDGISKAFYDIDPDYTIEIGGNEGSGGKFWWEESLFEKPDRFYYHLRYKGVELYKLLVVRFNSENLQLPFPDVEYITYPEKNDGCETEVYCDIFFYLENSIEYSLFKHIRALEVSEITKKSFTTPIETQMKPRIIELPFLIFNSEYSLKTACQKLVENYNDFLRVKSESNEIKNSSDEMRKRYITERLFTEWAYSIVHENST >NZ_AP021860.1|WP_155016730.1|3977_4733_-|hypothetical-protein MKDQLDSVIPIFHEDFQTEKINQIGSGVLILFRAYYFILTAGHVIDEQKSGHLLIPGVTRHLTGIRGSFSHFNPIIGRKNDLVDVGYFKLETDFGLELSKVFEVVTEQDMFLAPEYAEDTIFSLCGYPYRKSKIENDQVNNEIFSYSALHAKAEEYEKHNCKQPYQIVMKFNRKKAVDSYSGKKEISPLPHGISGGGIFIWPKIFESLTPIDRKLTGICHTYKQSEHLFIGTNLLLIINFILHNNPELAKN >NZ_AP021860.1|WP_162360027.1|5163_6498_+|alpha/beta-hydrolase MVSTIRFLIVAAVMFSIISCGSVPYLEKSGHALIPPAEMDLSFDDYVAHSTAEIRTAMQGRQDPLVFQGSYSLDDAVSMRAPYSIPVDSTVCTGGTGGEDKGFLLIHGLTDSPYLMKGLANSMRKAYPCSTIRAIVLPGHSTIPGDSNHSSDWDGSNDSQLMTYKKWLKSTNFGIRSFDNKEHVKSLYVLTFSTGAPLLIQHLSKHKEEKLKGAVLISAAIKAKSKAAFLAPLAQYIVPWSTVYPEEDAVRYETFSTHAAAEFYWLTRELLEEEYRFKLPLFIAISADDNTVSAQAALRYFCAAETDSKQMLWYQYASSERPLNSYRLASGPCGEDIIAREIGKNGFELPSYYKSFSHTSLSVPESDPHYGKEGAYRQCKDYFKDDKLEKFEQCKDPDEANYVIGETTERLYEENKGKRVRRGVYNPDYPYMEQQILEFIESIR >NZ_AP021860.1|WP_155016732.1|6551_7628_+|MBL-fold-metallo-hydrolase MRKLILNVLTALTVTACAYKPNHYLPVENNEPGKGDLYGRFLGVTNLYFSDGHDAIMIDAYIGHRTLPGLFFFYDMETDPENVELILDRAEITDIDRVFIAHSHFDHALDVATIVKLHPGIVVSGSLNTRAILEDDKQITRIVDLAEGKSAGSKSEPVDSDETVALFDNVVGGNERYQGNFKVTVFESPHVKKEPHQRWVESAINYVSDGDIFKEPGTSYSYYIDHPQAKMLVVPSAGYPSSFNDIEADVVFLGIGLLSNWVGKPGLYREPPFKYVEQYWQKTVVDTCASVVVPIHWDSPFTALSVEPRAPLDIFDSITKSVAALERVAETMRGCDGKPVEIVFPRGFKRFKVPVNSY >NZ_AP021860.1|WP_155016733.1|7724_8738_-|hypothetical-protein MANDGVRLLQLIDELNDAFDVFEANLDLETTAYFADIAPIDKNMQKSYQEGRVCLLVEHYFGNEAIDKCVRALRQYKRPDETVSGRFAGQFPGILFAKNGETVQRDVAQINIIKSEIQACVQDQRRRKRGKRMETYHARNHQEKHEFLHRYLPNAISYQLYRHIDVISVASDDETLSLKTIGFYWGNKNTDKYLSLDQANHYIDKSREMSVSSSIRMEIKEKLANSNLSHRFCLRRTRNDTINISLYFGVGENGGPVTSRTVIPQPIIITDYDSIPKIGLVKHQEPGRRSEIRTGHQWELLDSKLKLYRRPKTPDEFKKDADRALLEVDGLTNTAKT >NZ_AP021860.1|WP_155016734.1|9124_10018_-|hypothetical-protein MPKLTPAQRRLRNALERGVCYKSLDFDYLCSRAPHRAGVIIEGDSWVDYPRKYIISGPSINLGHRFEQTTEYLDTVNVLRIGSNGDTAVGMTTGKQFALMKKILKKNGKHIQLMLFSAGGNDIVGESDLDPLIKEFDAALHHGWEDVIEKELFDEKLDEILEAYLRMIELYKSLAPAASIVTHTYDCVNPSPQGAAFFWNLIKTKSWVWPTMQKRNIPVEWRAPIIQYMLSLFSLRIQALQNHPSAVGRFYVVDTQGTLDPTSKADWLNEIHATPAGYRKIFNKMYPIFKHLLPVLP >NZ_AP021860.1|WP_155016735.1|10355_12710_+|hypothetical-protein MRQYPPRLQQLVAAITATGFTAIMLALGIILLTLVDQIKDILIQVDQSDHVRYRIGFILGGLSFSLACWWSARFVLDVMAASHSKNSPESCITGNYKVLAPSAGISPFLALWLPRFYAAVIPIVVLIAGACNELWILFLLSTFGVLVPALLFVICRRAFISWAFNVQTPALFRIFGDTKSWGLLASIMYLLVTVWALANPYSLGDFFGAYFVVFWGLSSILFCLIVSFYGLVPWLIKKVFGPLVKAQQTAFEIQQQAHPNVVLSRPLLLQLSESQLVQPPHPVSLPVFLLAVLISWLVDTDNHEVRRVYLKEQVSYAYADFSEAWKTFSINLSKSDLYLKPSTNETEQRVRKKPVFFVASQGGGLRAAYWSAVGMGYLEARIPGFSQHVFSLAGVSGGSVGNSFYAASLNQPVVQNQCLALTKDLHLACGLEHALGTDYLSPVLTSFLYNDLLYRFFPLSSLPFMVRDRAEVLETSWEKGFARVFSSNNMQAKLQSLYQSDDDNWLPLLMSMGAHQESGTRLVTAPFPIEQDIFINQYNTYDLMACDNGIGINCDMRLSTVALNAARFPFVTPAGTLAKKSENGSNIPWKEKDHIIDGGYVENYGLMATRFMISHLMANNQFSVVENGQSIELVPVVIIFANDMDLTTEVFNPSRKRPYRNGNSLALNEVTNPLQGLLTTRSGRSVQSLTELIDFQHRLGGKQIATQITFPDAGSNEAPVIMNNVVVFHLQNDASNVNVPLGWWLSDQSQTYMSDQYRTQGKRAHQAIESLSRVIKTSSNAESP >NZ_AP021860.1|WP_155016736.1|12855_13092_-|hypothetical-protein MTILRIDNYPELKFIFWDWSPNEIDEKIAFALIEKRWPYISHKNLNSEEVSLIQRLAKTYGNGLLLVGQAQTIVQSTE |
You can click texts colored in the table to view more detailed information
CRISPR_ID | Spacer_Info | Spacer_region | Spacer_length | Hit_ID | Protospacer_location | Mismatch | Identity |
---|---|---|---|---|---|---|---|
NZ_AP021860_1 | 1.1|58|31|NZ_AP021860|CRISPRCasFinder | 58-88 | 31 | NZ_AP021860.1 | 123606-123636 | 0 | 1.0 |
NZ_AP021860_1 | 1.1|58|31|NZ_AP021860|CRISPRCasFinder | 58-88 | 31 | NZ_AP021860.1 | 123750-123780 | 0 | 1.0 |
1. spacer 1.1|58|31|NZ_AP021860|CRISPRCasFinder matches to position: 123606-123636, mismatch: 0, identity: 1.0
ttgccccatgtgggacacttccgcccactgt CRISPR spacer ttgccccatgtgggacacttccgcccactgt Protospacer *******************************
2. spacer 1.1|58|31|NZ_AP021860|CRISPRCasFinder matches to position: 123750-123780, mismatch: 0, identity: 1.0
ttgccccatgtgggacacttccgcccactgt CRISPR spacer ttgccccatgtgggacacttccgcccactgt Protospacer *******************************
CRISPR_ID | Spacer_Info | Spacer_region | Spacer_length | Hit_phage_ID | Hit_phage_def | Protospacer_location | Mismatch | Identity |
---|---|---|---|---|---|---|---|---|
NZ_AP021860_1 | 1.1|58|31|NZ_AP021860|CRISPRCasFinder | 58-88 | 31 | NZ_AP021860 | Alteromonas sp. I4 plasmid pAltI4, complete sequence | 58-88 | 0 | 1.0 |
NZ_AP021860_1 | 1.1|58|31|NZ_AP021860|CRISPRCasFinder | 58-88 | 31 | NZ_AP021860 | Alteromonas sp. I4 plasmid pAltI4, complete sequence | 123606-123636 | 0 | 1.0 |
NZ_AP021860_1 | 1.1|58|31|NZ_AP021860|CRISPRCasFinder | 58-88 | 31 | NZ_AP021860 | Alteromonas sp. I4 plasmid pAltI4, complete sequence | 123750-123780 | 0 | 1.0 |
NZ_AP021860_1 | 1.1|58|31|NZ_AP021860|CRISPRCasFinder | 58-88 | 31 | NZ_AP021860 | Alteromonas sp. I4 plasmid pAltI4, complete sequence | 123462-123492 | 2 | 0.935 |
NZ_AP021860_1 | 1.1|58|31|NZ_AP021860|CRISPRCasFinder | 58-88 | 31 | NZ_AP021860 | Alteromonas sp. I4 plasmid pAltI4, complete sequence | 123534-123564 | 2 | 0.935 |
NZ_AP021860_1 | 1.1|58|31|NZ_AP021860|CRISPRCasFinder | 58-88 | 31 | NZ_AP021860 | Alteromonas sp. I4 plasmid pAltI4, complete sequence | 123678-123708 | 3 | 0.903 |
NZ_AP021860_1 | 1.1|58|31|NZ_AP021860|CRISPRCasFinder | 58-88 | 31 | NZ_AP021860 | Alteromonas sp. I4 plasmid pAltI4, complete sequence | 130-160 | 4 | 0.871 |
1. spacer 1.1|58|31|NZ_AP021860|CRISPRCasFinder matches to NZ_AP021860 (Alteromonas sp. I4 plasmid pAltI4, complete sequence) position: , mismatch: 0, identity: 1.0
ttgccccatgtgggacacttccgcccactgt CRISPR spacer ttgccccatgtgggacacttccgcccactgt Protospacer *******************************
2. spacer 1.1|58|31|NZ_AP021860|CRISPRCasFinder matches to NZ_AP021860 (Alteromonas sp. I4 plasmid pAltI4, complete sequence) position: , mismatch: 0, identity: 1.0
ttgccccatgtgggacacttccgcccactgt CRISPR spacer ttgccccatgtgggacacttccgcccactgt Protospacer *******************************
3. spacer 1.1|58|31|NZ_AP021860|CRISPRCasFinder matches to NZ_AP021860 (Alteromonas sp. I4 plasmid pAltI4, complete sequence) position: , mismatch: 0, identity: 1.0
ttgccccatgtgggacacttccgcccactgt CRISPR spacer ttgccccatgtgggacacttccgcccactgt Protospacer *******************************
4. spacer 1.1|58|31|NZ_AP021860|CRISPRCasFinder matches to NZ_AP021860 (Alteromonas sp. I4 plasmid pAltI4, complete sequence) position: , mismatch: 2, identity: 0.935
ttgccccatgtgggacacttccgcccactgt CRISPR spacer ctgccccatgtgggacacttccgcccactgg Protospacer .*****************************
5. spacer 1.1|58|31|NZ_AP021860|CRISPRCasFinder matches to NZ_AP021860 (Alteromonas sp. I4 plasmid pAltI4, complete sequence) position: , mismatch: 2, identity: 0.935
ttgccccatgtgggacacttccgcccactgt CRISPR spacer ctgccccatgtgggacacttccgcccactgg Protospacer .*****************************
6. spacer 1.1|58|31|NZ_AP021860|CRISPRCasFinder matches to NZ_AP021860 (Alteromonas sp. I4 plasmid pAltI4, complete sequence) position: , mismatch: 3, identity: 0.903
ttgccccatgtgggacacttccgcccactgt CRISPR spacer ctgccccatgtggggcacttccgcccactgg Protospacer .*************.***************
7. spacer 1.1|58|31|NZ_AP021860|CRISPRCasFinder matches to NZ_AP021860 (Alteromonas sp. I4 plasmid pAltI4, complete sequence) position: , mismatch: 4, identity: 0.871
ttgccccatgtgggacacttccgcccactgt CRISPR spacer ctgccccatgtggggcacttccgcctactgg Protospacer .*************.**********.****
Region | Region Position | Protein_number | Hit_taxonomy | Key_proteins | Att_site | Prophage annotation |
---|
Acr ID | Acr position | Acr size | Homology with known anti | Neighbor HTH/AcRanker | Neighbor Aca | In prophage | Protospacer in prophage |
---|
CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
NZ_AP021859_1 | 115001-115125 | Orphan |
NA
Consensus repeat of NZ_AP021859_1
|
2 spacers
spacers of NZ_AP021859_1
>1.1|115026|25|NZ_AP021859|PILER-CR TGGGTTGTTTTGTCGGGTGAATATC >1.2|115076|26|NZ_AP021859|PILER-CR GATTGAAATATCCTATCTGCTATCAT |
CRISPR arrays and Neighbor proteins around NZ_AP021859_1
The CRISPR arrays of NZ_AP021859_1 >merge|NZ_AP021859|1|115001-115125|PILER-CR AAATAGCCGCGAACGCATAAATTTTGGGTTGTTTTGTCGGGTGAATATCAATTTAACCGCGAACGCATAAATAAGATTGAAATATCCTATCTGCTATCATAAAGTAACCGCGAACGCATAAATTT >NZ_AP021859|1|1|115001-115125|PILER-CR AAATAGCCGCGAACGCATAAATTTT GGGTTGTTTTGTCGGGTGAATATCA ATTTAACCGCGAACGCATAAATAAG ATTGAAATATCCTATCTGCTATCATA AAGTAACCGCGAACGCATAAATTT
>NZ_AP021859.1|WP_155013288.1|113264_114737_+|AMP-binding-protein MSFLSVQAKLRPFKQAVRDLTSGRSWTYLEWDTFVNKCSQWLSLQGLACGDRLVCIAKNCAELVALHFACEQSGVIFVPLNWRLSSDELHTLIADCTPQLIVGDEMANQLELAYFELKTLNNAVAELNGEIQPRNHRALPSLILYTSGTTGKPKGVMHSYDTIMETTLNMALLGQVDEYSTFLCETPMFHVIGLISCVRPALYQGGKILISDGFKPSRTLARLTDKNLLITHYFCVPQMANSLRQEPKFNPSSLYNLKALLTGGAPHPAVQIRQWLNDDIPIVDGYGMSEAGTVFGMPFDIATIDVKAGCVGIPTHRLEVRLADTEGQPVADGVAGEIQLKGLNLFIGIWRQPALFKACFTDDGWFKTGDVAIRDNDGFYFIVDRIKDMFISGGENVYPTEIESVVLKLDSVLECALVGVPDDRWGEVGCLFVVAKSKHSTIEQQEIMDELEQCLAKYKLPKYIQFVDSLPRNGGGKVMKHRLKAMFHSD >NZ_AP021859.1|WP_073324062.1|112417_113254_+|p-hydroxycinnamoyl-CoA-hydratase/lyase MSAEREEDTVAVKVENQIAWVSFNRPEKRNCMSPKLNRQMMRVLDDLEFREDVSVLVLTGEGSAWSAGMDLKEYFRETEAQGLGGTRQAQRESYGWWRRLRWYQKATIAMVNGWCFGGGYGPLFACDLAFAAEEAQFGLSEINWGILPGGGAAKVVAELMPLRKAMYHAMMGENIDGKTAEEWGLVNEAVPAEQLRARVTDVANVLLQKNQVALKATKDAVRRVKEMTYDNAEDYLVRAQEAANSFDNHGRKEGIKQFIDDKTYKPGLGAYDKSKQKS >NZ_AP021859.1|WP_073324060.1|110983_112381_+|aldehyde-dehydrogenase MNFERKNPLTGEVASQSIAMQAHEMAGIAERAQQGFEQWSAYGPNARRAILNKAAAALESRQNDFVEAMMTEVGATAGWAMFNLGLAVSMMREAASLTTQIGGETIPSDKPGCLAMAIRQPVGVVLGIAPWNAPIILGVRAISTALACGNAVILKASELCPRTHSLIIEALETAGFPDGTVNIVTNSPADAGEVVGALIDQPLVKRINFTGSTEVGRIIAQRAGANLKPVLLELGGKAPMLVLDDADLDEAVKAAAFGAFMNQGQICMSTERLVVDESVADSFAAKFAQKVSGMATGDPREGNTPLGAVVDQKTVSKVNALIDDAVAKGAKIIAGQKSDSVLMAATVIDHVTDDMKIYREESFGPVVAIIRAKDEADAIRIANDSEYGLSAAVFTKDSARGLRVARQIQSGICHVNGSTVHDEAQMPFGGVGASGYGRFGGKAGIDQFTELRWITIETEKGHFPI >NZ_AP021859.1|WP_155013287.1|109619_110942_+|MFS-transporter MSTNPKQQIDQSEIKGFQLFVILMCILLNALDGFDVLAISFASPGIASDWNVSRGALGIVLSMELVGMAIGSISLGNLADRLGRRPTILLCLSLMTIGMGACAFVNSLNYLLVFRFLTGLGVGGMLASTNALVAEFANAKYRNLAVILMATGYPIGAIVGGYISTELLALYNWKVIFVFGGAVTGSFLVICWLLLPESIDFLASRQPQNALQRINKILKRIGHNPIASLPKLENQQASSGFSTLITNNLRAVTGLLVIAYFAQIMTFYYILKWIPKIVVDMGYEPTSAGTVLVWANVGGAVGSLIFGVIASRLKLRPLLIAIMLCAFVMVSVFGLGPQTLLQLSVVSAATGFFTNSAVVGLYALMAQSFPAEVRASGTGVVIGIGRGGAALGPIVAGYLFQSGYGLFDVSVAMGFGAVVAAIAIFSLGPVLKKYQLSSQV >NZ_AP021859.1|WP_073324056.1|108406_109432_+|AraC-family-transcriptional-regulator MPGKYEVGTAANHYISQLYHEALKNKLDVTSMLNTLGLTEEVFDKPELRVKTEKLATFQNLIWQAMQDESMGLGASPVPAGSYFMMGRLTVNQPTLHKALNLAVRFYGMVTKAFTINLTVDGDTAFLGFKLHSPERDPQHMFAEILLLAMHRYASWLIADSLPLIECYFDYPTPAHISEYSYLFPGGHTFESDKLGFAFPARYLKRDVKQNDASLKLFMKRCPQEIFQRYEADYSLTTELQRLLWKNLKGGVPSIEAAAAMMNMTKRTMMRKLKSEGTSYQQLKDQVRLDKAVTLLTKYNLPINQISESVGFSEPAVFTRAFLNWTGDSPSHFREKNAVED >NZ_AP021859.1|WP_155013286.1|105369_107844_-|TonB-dependent-receptor MCLFTPYLFTKSAFQIFKDQAFNNTQGDRKSANFFKRSLVATSIGLSFLSSTAIAQQTDTAPEAESEVQLEVITVNARRRAESMQETPIAVSAFSVKELERRGIENTQDLDRVTPSLQFATSGQLSGNNSAAVVFIRGVGQLDPTSSVDPGVGIYVDDVYMGRSAGGAMDFKDIQSVEVLRGPQGTLFGRNTIGGAVLVKTAEPSDVFGGKARLRIGDDNLREAFVAVDLPITTDLLSRFSLGTRKRDGYVTRVYDGQDLGNDDTYSVNGTIQYTPSDTFKITLKGDFTKEDENGSPFVFAGVNESAPVAAIVSVAAGCPGATIPFAPLAPGDAGFGAPNVPNIDDERCANDFQHKGEFTNGGTAPVESTLKGWGLSAAMEWEYSKTITLKSISAFRSTEWTGIRDADNTPFDMLTTDVTSDSEQFSQEFQLIYDNDKVSGITGLYYFDETSDDKLSILLAFPPSPPVIGSLLNGGPGTRDYQVINLETESFAVFSEWAYELSNDWSISAGLRYTEDDKGFQGAIMNLFPATQPDPTTLPTKATSEGGPLFIFNTPFADTYSATTGSASVRYKVQENINTYLSYSSSFKSGGFNSRYNAPTPGNLPISFGEEEVSSWEIGVKADITNDFRVNAAAFMSEYSDIQLIFRQGVVPLLFNAGSASIDGVELEFTYIPTNSLLIEGGFSYLRDKIDSITEVNGAQATITPDNSLPLTPEWQGNLGASYSTELGNNYELTTRLDVSYTASQYFDSSNTDIVAQNDGVTYVTASVKLDDLVNYWDLTFGVNNLTDERYIEQGNASLATLGYAEVIYARPRNWFLSFSTEF >NZ_AP021859.1|WP_155013285.1|103501_105223_-|tannase/feruloyl-esterase-family-alpha/beta-hydrolase MTKHNIFHAFTLAALSSTLLACNSSNNDISPVEIPQLSPATAANLSGNCNDLAASMSALANTTITSSSEVASGELMVAGKDIPAHCLVTGSMFERVSDIDGNVYAIRFEMRLPLNWNGRFYHQGNGGIDGSVVTAVGDAGPGNLSNALYQGFAVLSSDAGHSGALGPAFGVDPIARLDYGYKAVEKLTPMAKELISIAYGKGPDRSYFGGCSNGGRHTFNTLARMPDEYDGYLAGAPGFRLPYAAIANIFGAQRYFSVATDPSDISTGFTAEERNMVATAALVKCDDLDGINDGLIGDVEACQSVFSLDDVPSCSSERDGTCLSSQQKEALSPIFSGAVTASGEAFYAPFPFDTGIASPDYNFWDFFAPLVLDSGGVGLIWGVPVADPATFNGPEFALTGSIDDMLTSIESTDDVYTEAASSFMIPPNNAEALSAVRDRGAKIMVYHGVSDAIFSALDTINWYNNLTANHNDDASDFARLYLMPGMGHCSGGAAVDQVDLLTPLVAWVESGIEPEGLVATARGAGNPGGENPAIPASWAADRTRPLCAYPTVARYNADAGNGDVESAESFSCQ >NZ_AP021859.1|WP_155013284.1|102456_103407_+|2Fe-2S-iron-sulfur-cluster-binding-domain-containing-protein MFEVIVTNKQPLTASVCRLQLAAVDDSALPAWQAGAHIDVHLPNGIIRQYSLCGGVDTKHYEIAILNEPNSRGGSKYIHDQLQQGDVLTISAPKNLFPLVQGTHKTLLIAAGIGITPMLAMAEQLHAEDTPFELHYCAREQQHAAYYDRISNSDYARNCHFHFSLGNSQNRLNPYRLLADYNQDTQLYICGPNLFIQDVISAAEQHGWPTANIHREFFAAEAIDHSQDQRFEVVINSTGQVLQVAEDVSILNVLEDNGMFIPVACEEGVCGTCLTGLLEGEADHKDVFLSADEKQKMNQITPCCSRAKSKRLVLDL >NZ_AP021859.1|WP_155013283.1|101386_102469_+|Rieske-2Fe-2S-domain-containing-protein MHPQSYPLNTWYVAATPDEISDKPFARQICSIKLVFFRNSQQKIVAVEDFCPHRGAPLSLGFVENGQLVCGYHGLRMGDDGKTQSMPNQRVAHFPCIKHFAVIERHGFVWIWPGDQTLADDSLIPELHWANNPDWGYGGGLYHIKCDYRLMIDNLMDLTHETYVHASSIGQKEIDESPVSTKMEGQTVVTSRFMDNVMAPPFWQAALRANDLADDVAVDRWQICRFSLPSHIMIEVGVAHAGKGGYDAPKSHKASSIVVDFITPETEHSIWYFWGMARDFKPEDQALTQTIQQGQGAIFAEDLEVLERQQRNLLDYPDRSLLKLDIDAGGVQARRMIERVIKQEQAASASTNAGEKQCSK >NZ_AP021859.1|WP_155013282.1|100416_101118_-|FCD-domain-containing-protein MPREGQAVISILRDKIVSGVFPAGERLAEIPTAELLGVSRTPVRIAFRALAQEGLLIKLPRRGYQVRKVTNDEILGAVEVRGVLEGLAARQAAEKGLTEDTRVQLAECLKNADAIFEKGYLTEEDIEQYNVINKQFHDLIINASGNPAIQSAMQLNEHLPFASVNALVFNPKQLDREFRRFNFANMQHHVVFDALLKRQGARAEAVMKEHAHATLSQVDLCESPDSRTCNPSK >NZ_AP021859.1|WP_155013289.1|115138_115423_+|transcriptional-regulator MIFSINNTKQLGKAAQLTRKVQGLDQFAAASMSENGITFLSEFENGKQTVELGRVLRVLSTLGIKVTIDIPVDEASLTPKQQQQLAKIINEANL >NZ_AP021859.1|WP_155013290.1|115419_116688_+|HipA-domain-containing-protein MKRALDVYIDKTQVGKLTDENNIWAFEYTTGWLTSQHRHPLSPHITLEAGKQIDGSSFRPVQWFFDNLLPEEKARELLARHVKVPVEDAFQLLKEAGAESAGAITLMPEGEEVAPGTVHKLTYEEVNQRILNLPQVPLNRAERKRMSVAGAQHKMLVIYRHGELLEPSGFFPSTHILKPQHSSPEVYYHTVRNEWFVMTLAGLCGLEVPPVDIRYLPEPVYLIERFDRAGEYPHQHRRHVLDGCQLLNLGPHMKYPNSNAGSLNKLAELTRMKARTKIDIFRWALFNALVGNGDAHLKNLSFFINKEDVVMTPHYDLLSTAIYEAPHKHMDHQLSQQMGDAQYLGQLTVPNILAFAEELQLPTKLAKRELDRLIGKIEQEAIPLMQQVQDAPAHPGKGGEIRMIKEIYYNCIKEMVTRLTKA >NZ_AP021859.1|WP_155013291.1|116777_117206_+|DUF3010-family-protein MRVCGVELAGNDANIALLNLENDLIQIPDCRTRKLSLQKAATAHELKYFQKSFAQLVQDYKIETIVIRQRPMKGKFAGGAIGFKLEAALELLNGVQVIVMPPTEIKAALKENHMFIEFGDTGLKGFQKSAFETALAYISKHL >NZ_AP021859.1|WP_155013292.1|117299_117662_+|hypothetical-protein MLKRTAILAPLFFSAHLSAQTWIEDAQGQRYLLGDQLVSTTTEAYNYCTSKGLTPANVAQMRRALRQGKVTVDFVSVPVSEDAKTPFIKDKHWIAKHEQGRIRVRNSSSFDSALPLCAGR >NZ_AP021859.1|WP_162359724.1|118444_119167_+|PEP-CTERM-sorting-domain-containing-protein MNNVVKGLGVLLTLASMTANAVLINDNSFAQAGFKDTSTGLVWMDFGINNGQSYNHVSSQLGAGGDYSGWRLPSAEEVYMLWDHVANLDEVEADFESPDYYGAGQLYAWDYNSRVVGGDDSVWDNIFNIMGFNSASGTDYMERSSAIGFFMGHHGLASVKFHDAIDKVGFPAFTHKDEVALRDDGAYSDFFLGLAHENYSTMLVRTATVPEPGAFVAFATAIVALSWRRRQGRFGRKTRL >NZ_AP021859.1|WP_073324076.1|119339_119885_-|hypothetical-protein MKYQKQLDRLNSGTMSRHELAVMKKNAKALVEKGDSDAVAILDAIDYSKPADDYILFMGFCPGADFSQRLDIEWKKHGICRFDYLESESQLNRWNTLCAGDLVILKKREKFGESMKLYGYGRIKRIAYDEENTRYFEMDWSAQEQEIEVPLMGCNSTVDVKSMLEVEKQMPDNFWQWLNKE >NZ_AP021859.1|WP_155013294.1|120562_123034_+|glycoside-hydrolase-family-9-protein MIKSVLYSAIASALVTAPIVTSAAVPALNDKAYFSQPSLDIVVFSNWYNGLFGDSKISGVELVHFGERIATNGDVRLSATPEQWDPIPTFVERKVDNNTNTISATLSYPEFDFTYTISASPIENGVEITLSSPRPVPAELVGKAGFNMEFLPANYMETSFLADGKPGTFPLYPTGVKEIIGQHEPAPLASAKQLVLAPESDTKRVMIESSAPLTLYDGRAKAQNGWYVVRGVLPENKQGELLTWRITASTDDAWLRDPMIAHSQVGYLPGQTKRAVIELDKHAPVNGMAELLKVNADGSKAVVKKAAPGKVEDYTRYQYATFDFTDVTEPGLYQLRYKGTTTASFPIAEHVLDAAWYPTLDHYFPVQMDHVLVNEAYRVWHGASHLDDALQAPVNHEHFDLYAQGPTTDTQYKPGEHIPGLNVGGWYDAGDYDIRTQTQYRTVRFLVQAFEEFGIDRDTTLVDYDRKYVDIHVPDGKPDLLQQIEHGTLALLAQFKAVGHAIPGIIVPDISQYTHLGDGLTMTDNLIYNASMADTESNGIESGVFDDRWAFTSKSTPLNYGSMAALAAASRTLQGYKPVLAKESLDTAIAAWASEADKQPDLFRVGNTTGGGLEEEKLKAAVELLVTTGDTQYKHAVTALLPHIEEHFGRSAVLAVRALPFMDNAYKKRIRAAAEAYKPKLEAITSKNPFGVVITEYGWAGNGTVLDMAVTQYYLHQAYPDLYSSDLIYRSLDYLYGTHPDSDISFVSNVGTVSKKVAYGMNRADYSFISGAIVPGVLILKPDLPENMENWPFLWGENEYVIDLGASYLFTVNAALKLAGRQP >NZ_AP021859.1|WP_155013295.1|123152_124235_+|electron-transporter-RnfD MLNRFITLCALVCLHVSVAAFAKVVPATDAGYVYTGRIDFANASAPYLTWPGSSVKARFSGESLSVTLKDDNGKNYYNVIVDGNDAFPFVIEAKQGEHTYWISNTLGAGEHTVEIYKRTEGEEGGTHFLGISIDDDAALLAPPGRPTRRIEIYGDSISSGMGNVAPYNGPDNLPRDKNHYLSYGAIAARTLGAELHTISQSGIGIMVSWFNFIMPQFYDQLSAVGNNDSQWDFSTWTPQVVVINLMQNDSWLVPDPKRISPTPTEPQIIAAYQAFVKSVRAEYPNAQIICALGSMDATKAGSPWPGYVEAAVANLTIEGDSRLSTVVFPFNGYGQHPRVNQHTSNAELLTQAIQQVTGWR >NZ_AP021859.1|WP_162359727.1|124231_124978_-|CPBP-family-intramembrane-metalloprotease MEQGKAVSSIIILSVFSAAIFATRFLQPQITAPYVKYAVPYLCWSLAILLAGVLLRRKGPLLKVVGITHQPLIGVAAALLFSLPMLIGFSVFFEFATPSMTTLLTKSLIPGFFEELFFRGFLVGSLIAIAGWRFLPAALIGAVIFGMGHWFQGATLVQAATAALFTAIGGLWFAWLFYRWGHNLWIVITLHTLMNAYWVLWQVDSTAIGGQAANLCRMGTIVLSIAGTEWWVRRSRKRSLTPADSPET >NZ_AP021859.1|WP_155013297.1|125097_125910_-|TauD/TfdA-family-dioxygenase MASDTITITPLTRNIGAEIGNIDLTKPITAEVEDQLKAAIAEHQVIFFRDQQITHEQHMAVGQIFGDLIVHPGAKGIDGYEKIVAIHADKDSKYIAGDNWHSDLSCNELPPMGSMLYIHTLPEVGGDTLFSSMYAAYDALSPAMQQYLEGLQAEHDANHVYHAIYGDYGTAYPCNVHPVVRTHPVTGKKAIFVNASYTTRILGVSKNESDGILAMLYELAKDPNFQVRFSWQPHSIAIWDNRCTQHFAVWDYFPDTRSGYRVTIGGDKPY |
You can click texts colored in the table to view more detailed information
CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
NZ_AP021859_2 | 1657863-1658090 | Orphan |
NA
Consensus repeat of NZ_AP021859_2
|
2 spacers
spacers of NZ_AP021859_2
>2.1|1657924|22|NZ_AP021859|PILER-CR GTTTAGAAATAATGAGAGGGGA >2.2|1658007|23|NZ_AP021859|PILER-CR AGTTTAGAAATAATAAGAGTGGG |
CRISPR arrays and Neighbor proteins around NZ_AP021859_2
The CRISPR arrays of NZ_AP021859_2 >merge|NZ_AP021859|2|1657863-1658090|PILER-CR AGTGGCGGACGTCTGCTGTGGCTCGCAACGAAGTTGTTGAACCAATGCCGTGTGTTTTTTAGTTTAGAAATAATGAGAGGGGAAGTGGTGGACGTTTGCTGTGGCTCGCAACGAAGTTGTTGAACCAATGCCGTGTGTTTTTTTAGTTTAGAAATAATAAGAGTGGGAGTGGTGGACGTCTGCTGTGGCTCGCAACGAAGTTGTTGAACCAATGCCGTGTCTTTTTTA >NZ_AP021859|2|2|1657863-1658090|PILER-CR AGTGGCGGACGTCTGCTGTGGCTCGCAACGAAGTTGTTGAACCAATGCCGTGTGTTTTTTA GTTTAGAAATAATGAGAGGGGA AGTGGTGGACGTTTGCTGTGGCTCGCAACGAAGTTGTTGAACCAATGCCGTGTGTTTTTTT AGTTTAGAAATAATAAGAGTGGG AGTGGTGGACGTCTGCTGTGGCTCGCAACGAAGTTGTTGAACCAATGCCGTGTCTTTTTTA
>NZ_AP021859.1|WP_155014261.1|1656658_1657708_+|mechanosensitive-ion-channel MEASTNELVMALNTYWQAFIVKLPAIALGLVIMVIMVTLAGRVAAALIKPVGYLSASPLLRSVSQRAISLIIILLALYIFLKLSGLTEFAVAIMSGTGVLGLILGFAFRDIAENMIASLLLTVQRPFRINDVVQINNYTGVVQKVTTRATTLVDFDGNHIQIPNAVIYKGTIKNLTANPKMRGQFTIGIGYDADCQLAQKRALSLISEHPNVLVEPEPLVLVDGLGSSTINLKVYFWIDVESTSVVKMASVLMRDLVNLFTEKGISMPDDAREIVFPEGVPVLVQQGTEEQQYTDPDQSVRQNEQALTNRAEPNVESQSFLDDVSSENDDIRRQADEARAPEQGTNILS >NZ_AP021859.1|WP_155014260.1|1654770_1656405_-|response-regulator MNLKLARQTRVLVIDDQVLAKGYLKYSLEELGFQNIEYADRASNALSAIRRHHYDLIVCSYDLKNEQDGYFLYDQLKEHNELPPSTAFVFISADTTADIVHSIVELQPDDFLAKPFTVRELDRRLGRLLTRKKALKPVYQLIEQSTLDKALLELENFLTEPKNSEFFPLALKLKGELLLACGHYEEAREFYQAIINVQNFTWAQLGLIKSYISLDQDEEAEKLVIELALKQDSMLAAYDLLAALQIKHHEFEDALESVEVASAISPRNLHRHAKALDLSRLTHDYESQFEAAKKIVKIAKNSIHDKPEIYLNVARSGIDFAMTAEEEHTKRLIKQSTEYLKQLKSNFPKADIDDQLKVIDARMLYLEDEVDNARALLNQLNNDAWETESIEGLLDKAKAFHEVGIQDHALKILDMIERRCNNDPAQSDLFLQYVQQEKVEKTEIKLSPKALNNNAVIQYQRGDLQKALETFRQAFTIMPKNPSIALNLLQATAINLKESSSDAAKESLNSTLIHNCLKTIESGRLSEEQEQRYQRVKSVLKDLT >NZ_AP021859.1|WP_155014259.1|1652216_1654556_-|diguanylate-cyclase MNQALIEKYRLHRQEYETQLTQYVQSIAELFDVPVAFLGLSDNETLWIKAKCGTEATEAPLKQAICHLVIDSNDLIYIEDTQKDPRTQKMPAVAGSPFVRFYAGVPVKINNDLVGSFCIIDVKPRQLSSKELRALHNFGTHLGQHCELIFKHLSSEDEHELLNNSPAALIRWQVRPSLSVSYLSPNLTKLLGVALPEDASLFKLEDVIHHDDTDHFLFTVRNHQQGAEICECDFRVKVPGNRSVWYKLVSRAIFDNENLIAIQGLLLNNTEQKYLENRILDTNERMRLLLEASGLGTSDWDIENDTLRVNNRICHMLKLHPDSVDTQSMFWMQLVHPADKDRLISQIGNSLKEPNGVVDIEYRLRNAEGQYLWIETYGKVVTRNEMGRATRFAATHRNITEKKLAELHADKQRRLLGFINHAQNLFVAQKDLQLACEQIFPELIDLAESAYGFIGKMETEDGVPCLQIYAISDVSWSESSKAEYKKFKKGELRFTNLSNLFGHVVTSNAPVIANKPMMHNASKGTPAGHPVLKRFLGLPIQRENKVVGMIGLANKLEDYSQKDVEFLTPLTETLAYLFKAVEVEQARYEAEERLSYLAATDPLTGLMNRRAYFDKIQQFAAENNEPHCLAIVDIDNFKTLNDTYGHPVGDQVLRAVAKVMQHNIRNNDMVSRLGGEEFGIYIDTKDNAVCHQILQAILEEIKQLSFDTDQGQLNNVTVSIGATMFKKGPHAVDTTYFDIAMKQADQALYEVKRNGKASINWFSPSNALKKDGLITGLKS >NZ_AP021859.1|WP_155016569.1|1651636_1652182_+|GNAT-family-N-acetyltransferase MEQPESERLTYRLLDETDGEFLWELDQCEAVMRFINGGRKTSRKETNDIFIPRMLSYRNAPAGWGLWQASLKDGDKTALGWILVRPHGYFTEQRDDSVIELGWRFKQDWWGKGYATEAARAVMTYMQTLGARSFSAIALPENTASIHIMKKLGMTFSHTEHYQDQVFNDDIVVYHTDATKP >NZ_AP021859.1|WP_155014258.1|1650662_1651628_+|GTP-3',8-cyclase-MoaA MLQDTFGRRFYYLRLSITDVCNFRCQYCLPDGYEGQAQTFLSVNEIDTLVKGFAGMGTCKIRVTGGEPTLRKDVSDIVAACAATPGIQKVAMTTHGGKLASHAKSLADAGLSQVNISLDSLDPAQFALISGQDKLNAVLDGVNEALIQGMSVKVNTVLLRPFAESQLSTFLDWLKDMPVTVRFIELMETGQHKAFFKSHHTSAEPFLTRLKAAGWEVLQRGADAGPAVELQHPDYAGRFGFIMPYSSDFCSSCNRLRVTALGKLHLCLFSDNGLSLRDYLLRGDVSGLQGFITEKLADKKVSHYLQDGVTGITKNLSMLGG >NZ_AP021859.1|WP_155014257.1|1648706_1650491_-|ATP-binding-cassette-domain-containing-protein MRPTRLPVNPDTPMQWHVFARLWPYLLEFKQRVALALLCLVAAKIASIGLPFVLKHTVDNLNQEPIATLAVPIALVVAYGTLRLINVLLGEVRDTLFGRVTERAMRRIGLEVFEHLHRLDLTFHLSRQTGGLSRDIERGTSGISFLMRFMVFNIGPTLLEIALVVGVLLTQYGMSFAMIILCSVVAYVWFSMKATDWRTEFVRQANLADSTSNTRAIDSLLNYETVKYFNNEQYEARQYDTNLANWEQARRKNRLSLFALNGGQAFIIAASMTFMMLLAALEVSRDNMTIGDFVLINAFTMQIFMPLNFLGFVYREIRGSLANIENLFNLLGTVPTVADAPSAGKLEINQSRIHFDNVSFYYRAERRILNHVNFTIEPGTKVAIVGESGAGKSTLVKLLFRFYDPVSGAVRIDGQDISTVTQHSLRQHIGIVPQDTVLFNDTIGENIRYGRPDATEQDIQQAIRLAHLEQFIASLPDGLNTQVGERGLKLSGGEKQRVAIARAILKRPAIMVFDEATSSLDSQSEQAILSALREVAEGHTSMVIAHRLSTIIDADKILVMQQGQIVEQGTHSELLSQQGVYAGLWQAQQKQASD >NZ_AP021859.1|WP_073322631.1|1648491_1648707_-|DUF3820-family-protein MDPQQLKLTINQIMPFGKYAGRKLIHLPEPYLVWFAKQGFPEGKLGQQLALMYEIKLNGMESMLTPLIDND >NZ_AP021859.1|WP_155014256.1|1648144_1648480_-|hypothetical-protein MVYQRKRMQQQRIDQQSQQLHLCIAKKLLANPDMMEAVTARLHQRYQDKLMGYGSYLHWQAILAEFPHPEHFIAAITASDSTTTRLRRATIFTGVLNEKERSDCLAAPSQQ >NZ_AP021859.1|WP_073322635.1|1646788_1648132_+|3-deoxy-7-phosphoheptulonate-synthase-class-II MNNWQPDSWRKKPILQQPEYDDKAELAQVEKTLSSYPPLVFAAEARELRRQLGQVCEGKGFLLQGGDCAESFSEFNAPKIRDTFKVLLQMAIVLTFAGRCPVTKVARMAGQYAKPRSSDFETKDGITLPSYRGDIINSFEFSEAARRPDPQRLIEAYHRSSATLNLLRAFAQGGLADLHEVNRWNMAFVENNPLKEQYQDIARRIQDSLEFMDVIGLNASNTPTLHETSLFTSHEALLLNYEEALTRIDTLTGKPYDCSAHMVWIGERTRQLDHAHIEFFRGIHNPIGVKVGPTMEEDELIRLIDALNPNNEAGRLTLITRMGADKLEANLPRLLRRVKAEGRNVVWSSDPMHGNTFSASSGYKTRNFDAILSEIRQFFAAHDAEGTYAGGIHLEMTGQHVTECTGGAYQISDDDLAEAYKTQCDPRLNADQVLEMAFLVSDHLRIK >NZ_AP021859.1|WP_155014255.1|1645978_1646731_+|ATP-binding-cassette-domain-containing-protein MTRKLRAEHICLTNRFKRLSIVHASSGIVCLLGANGAGKSSLLEVLAGLTPATEGEVLWGGQPTAQRSLAELAVERGYLAQKPSIQFELTGRDCLQFFNDHTQQQIPGMLIEKLGLTTLLDKVYTHMSGGEQQRIFIARTLLQVWQPLMDGNALLILDEPLQSLDIRHQHALMCWLADLGIRGNQIVMSCHDVNIANTFADTVWLARSGELLASGPVEEVMTLDNLWRTFDCHFDFLEREPRGVFVPVSV >NZ_AP021859.1|WP_155014262.1|1658616_1660026_+|glutamate--tRNA-ligase MAVVTRFAPSPTGYLHVGGARTALYSWLYAKSQGGEFVLRIEDTDIERSTEEAKQAILDGMQWLGLTWDRGPYYQTERFDRYKAIIQTMLEEGKAYKCFMPADELDAIREAQKERGEKPRYPGTWRDRTEHPEGQPYVIRFKNPQEGSVVFDDHVRGRIEISNSELDDLIIQRSDGTPTYNFCVVVDDWDMGITHVVRGEDHINNTPRQINILKALNAPVPEYAHVSMILGDDGKKLSKRHGAVSVMQYRDDGYLPQAVKNYLVRLGWSHGDQEIFSEQEMIELFSLDAIGQSASAFNTEKLIWLNQHYIKTLPGSEVAEHAKWHFEQLNVDLSAGPALEDVIAIQADRVKTLKELAEISLYFYKDFEDFDANAAKKHLRPVAKEPLQVVQEKLEALADWTPETIHAAINGAAESLGVGMGKVGMPLRVAATGGGNSPSLDVTLHLLPKAKVVERINKALTFIANRENS >NZ_AP021859.1|WP_155016570.1|1661311_1662256_+|TRAP-transporter-substrate-binding-protein-DctP MPAQATTLNVVTALSQNDPIYQGLLRFKQAVEQGSDNQIKVRLFVGSQLGNDNDILEQAMAGAPVAVLVDAGRLSFYQPEIGVLSAPYLIDNVEQLNVLVQSPMFEQWANALATQSGIKVLGFNWWQGERHVLTNKPVFTPDDLDGVRLRTIGAPVWISTIRAMGATPTPLSWAEVYSGLQQRVIDGAEAQHAGTYGARLYEVIGYVNKTRHIHLISGLVASNHWFKRLSKAHQNLVQKSALEAGEFATSLVQARQSEIEQALAAAGVEIVEPDIDAFKHATQQVYTELGYENVYQRIQQYLAEQMGVDIKESH >NZ_AP021859.1|WP_155014263.1|1662255_1662801_+|TRAP-transporter-small-permease-subunit MWAQIERGIAVMLLAAIVLLVLLAAILRTAGYPIIWSVDIAQLLFAWLSVIAANQALRQGSHARLDILMNRLRLINRLRLTLALNLISMSCMLVVAVFGFQLVGINPARTLGSTAIPYAWVTAALPAGAVLMLVTLLQQSVRVFHCLRKPGDTLQNPPAFLASVLTPDTHHSDKLAKESLS >NZ_AP021859.1|WP_155014264.1|1662797_1664069_+|TRAP-transporter-large-permease-subunit MSGVVFLILLLLGLPLAFTLIASGMVYFAQNPELPSAVAVQRMVAASQSFPLLAVPFFILAGHVMNCSGITKRLIHVSNLLVAWISGGLAHVTIVLSALMGGVSGSAIADAAMQARILGEPMQASGLTKGFSAATITVSALITACIPPSIGLILYGYMGNVSIGKLFLAGLIPGLLLTLVLMLVVYLQARKKGFAPTQANPPSFKQILEAINQSKWALLFPVLLIVTIRFGIFTPSEAGAFAVLYACVVGRFAYQELTLSDITTSLSESVSDIGMIMLIILASGVVGYAIAYEQLPVSLTLAVTQVTEQPQLILLMSLVILLVVGVVMEGTITVLLLTPILVPLMQSVGVDPVHFGILMLIMVTLGGTTPPVGIAMYAVCNILRCSTTDYVKAAVPLFTAVLALVVVLALYPPLVLFIPELLF >NZ_AP021859.1|WP_155014265.1|1664083_1665292_+|D-galactonate-dehydratase-family-protein MKIRDVKVIVCSPGRNFVTLKIVTDEGIYGIGDATLNGREKSVVSYLEDYIAPALIGKDPHRIEDIWQFFYRGAYWRRGPVGMTAIAAVDTALWDIKAKVAGLPLYQLLGGRSRDKIMVYTHANGADIPATLDAVGKAIEDGYKAIRVQSGIPGVKSTYGVAKEGQKYEPADADLPTESVWSTEKYLNFAPKLFAAVREQYGDDIHLLHDVHHRLSPIEAARLGKSLEPYHLFWMEDPVAAENQQGFKLIREHTTTPIAVGEVFNSIHDCQALIQNQWIDYIRSTVAHAGGITQLRRIADLASLYHIRMGCHGATDLSPVCMGAALHFDYWVPNFGIQEHMPHSELMESVFSVSYKFDDGFFTPGETPGHGVDIDEELAKKYPYKRACLPVNRLEDGTLWHW >NZ_AP021859.1|WP_155014266.1|1665355_1665772_-|hemin-receptor MDAKTISLVQSTFQQVVPIAGTAASLFYTKLFELDPSLKPMFKSDITEQGKKLMQMIGVAVNGLNNLDALVPAVEQLGSRHVGYGVQDSHYDTVGTALLWTLNKGLAEDFTPEVEAAWTEVYTLLASVMKEASKTQVA >NZ_AP021859.1|WP_155014267.1|1666123_1667044_+|DUF2817-domain-containing-protein MTMYPIGTPGTPWDENEKRQWLELQSVKRSYAEEVLTKLTALPETLTRVQYGALPYDTERYPLYALLSKSPTDGAPWVLITGGVHGYETSGVQGAILFANEYSKAYNGKVNFVVVPCVSPWGYETINRWNPKAVDPNRSFKPESPAAESQLLMDFVNSLPFDITLHVDLHETTDTDNSEFRPALAARDAIEQKTWNIPDGFYLVADTQQPCIALQEAMIHEVKQVTHIAPADDSGRIIGETLLSEGVIGYNKKALFLCGGFTNAPMCSTTEVYPDSPSATDEICNLAQVAAIGGALRYLLSDANAQ >NZ_AP021859.1|WP_155523163.1|1667129_1667303_+|hypothetical-protein MQFVALLYDGISQRFVRVEAQDEKAFFSALDKQYPCYVCLWHSYEATEVNASVPQQV >NZ_AP021859.1|WP_073322593.1|1667415_1668354_+|tRNA-dihydrouridine(16)-synthase-DusC MRVYLAPMEGVVDHLMRDMLTRVGGFDLCVTEFVRVVDQKLPHKTFYRLCPELHNDCKTPSGVPVKIQLLGQHPEWLAENAMTAVELGSPGVDLNFGCPAKTVNKSKGGAVLLQYTQQLHDIVYAVRQAVPAHLPVTAKIRLGYEDKSLAIDNAVAIDEAGASELVVHARTKTEGYRPPAYWDWIKKIKAVTRLPVIANGEIWNHDDAVRCMQASGCDDLMIGRGALAMPNLARHIRGEEAPMAWQDLSQLLIDYSGYEIFGDKGRYYPNRIKQWCGYLKRQYPQAETLFSNIRRLQKADEIVNVLRQSAHL >NZ_AP021859.1|WP_155014268.1|1668521_1669940_+|amidohydrolase-family-protein MNNTHKRLSIMMLCCCAFWLTACSEDKAAKTEQAAKVSIDKNPFPSRYTPLAGEPTLITNVTILDGIGNKIDKGMVYFADGKIVEIGETLSVPNGVRTIDGQGKWVTPGIIDVHSHLGVYPNPSTHSHSDGNEIVKPVTANVWAEHSVWPQDPGFGRALAGGVTSLQILPGSANLFGGRSVVLKNVPHRTMQEMKFPDAPYGLKMACGENPKRVYGKRGGPSTRMGNVAGYRQAWSDAQDYQRKWDQYEADYEAGKNPKAPKRDLNLETLAGVLDGDIRVHMHCYRADEMAVMMDVMKEFNYQIYSFQHAVEAYKISDILAENNVCSAMWADWWGFKMEAYDGIRENVPMVHNAGACAIVHSDSDLGIQRLNQEAAKAWADGRRAGIDIPQEDAWIWLSANPAKSLGIFDKTGSLESGKNADLVMWTANPFSTYARAEKVYIDGGLAYDLNDPQSWPVADFELGQVGEGDSK |
You can click texts colored in the table to view more detailed information
CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
NZ_AP021859_3 | 3836907-3836999 | Orphan |
NA
Consensus repeat of NZ_AP021859_3
|
1 spacers
spacers of NZ_AP021859_3
>3.1|3836936|35|NZ_AP021859|CRISPRCasFinder TTAGGTCACGGGGCAAGCCCGGGATGACGGGGAGT |
CRISPR arrays and Neighbor proteins around NZ_AP021859_3
The CRISPR arrays of NZ_AP021859_3 >merge|NZ_AP021859|3|3836907-3836999|CRISPRCasFinder GGGCAAACCTTAGCTCCGAGGCTTTTTTATTAGGTCACGGGGCAAGCCCGGGATGACGGGGAGTGGGCAAACCTTAGCTCCGAGGCCTTTTTA >NZ_AP021859|3|1|3836907-3836999|CRISPRCasFinder GGGCAAACCTTAGCTCCGAGGCTTTTTTA TTAGGTCACGGGGCAAGCCCGGGATGACGGGGAGT GGGCAAACCTTAGCTCCGAGGCCTTTTTA
>NZ_AP021859.1|WP_155015717.1|3836345_3836498_-|lasso-RiPP-family-leader-peptide-containing-protein MNKQDDRNNKLTYHAPQLQRAGSLQDLTQGFSGSVPDEQGGHTKRNPFQG >NZ_AP021859.1|WP_155015716.1|3835231_3836215_-|hypothetical-protein MSPNTQQELASLSSTPLHGVTLALNKTTNNTNTPFALDVSFNPVTPLSNPKTAWTLTDKGERYESNITLNREEDCFQLHIDCEGQGTFQLQDNQLTIDWQANGTGPAHYLQTLGLSLFLELQGHLCLHANTLVKNNRAQLFLAPSRTGKSTLTTLLTTLGYTLTTDDMAALYHTNEQYEVYPSWPKVRLWPDSAAMLENHLTTQPVQQKKVHERFAKQEISFAAQDTHTATPVTAMYYLNRVDEQPQSANPLTITPINPSAALIILMQNSMLGDAYRGLGIEQSRIIALAALLLRVPFYRVTYPSGLEQLPDIAKHLDEWLQSEQAK >NZ_AP021859.1|WP_084526282.1|3833804_3835025_-|HD-domain-containing-protein MNKRRYQKLVAIYLLASLIWIIGSDWLLAQFIGDFNDSYVLSAGKGVAFVTLMSGLLYVLLMRLNSAERQVASAATDSTEQLVLNKIPTFFQHIPMLTYAVNWDGKTLRTLWVSDNLYDLLGYTQEEALQPGWWEKCVHPDDRLRALEESKTILANGGGDHYYRVKHAKGHYVYFHDELRRVESVSATCFVGIWRDISTEESALEQVQEYSTKLEKTILGTITAISHMVELRDPYTAGHESRVGELASAIALEMGLDMDTQYGLRIAGLLHDIGKISIPAEYLTKPTRLTDAEFEIIKSHACNGYNILKNIAFPWPVAEVAYQHHERLDGTGYPRGLKGDEILLEARIITVADVIESMATNRPYRHALGIKKALQEIEQHAGKLYDPEVCNAALTLFRQKNYQLGD >NZ_AP021859.1|WP_155015715.1|3830006_3833462_+|hypothetical-protein MKTCLKSFAAVVKYTAFLPLLFVISACGGGGDTSTPSPNPNPPVTPKAAISLGSYTLQTEVRDTNGNVVSNCNCGSAVVLPSGEIWVAFEQYPLVEDYGYGVLLTTQAQSTSSFTADYTYYEASVSVAEGSLASPGTFNANQFSFSLTVKGQSVNVEAVKSTDATSIDLFSLSSGSGELEFSDDPYSRISVTANGNVNGEVLGCPISDGELADVSNSSHVFSLSLSFDSCSNDSLSNQSFFGAAFTLPGISNHGLQLLFADDNGYFRQAYAEKLADTGSADYTTESAPSISSLTVTGAETVTLHWYSEDQTSIPTTGTSYTVYASTQPDFQPLPSHLVHVTGNALSADIDSLASDTEYYFKVIAKTEDKQILISSEIAAKTYKSSPVQAQGTTIFQAEDNNLTLTSASSDTLVFTLSAGANVPAPGDFIFAKNADDELLFKQVTSVSSNNNEATVLVTQPSLSELIPSAELSDYTSLGTVSGFAPDSRKVVPTASGKYPANSSPTKAERWGNSISNTAYYKPYKKAHLTQNGESVNVTFDGDKITLDIDGVAGSISLDPEIDIAPSVYSDFAWAGPLEIARAESRLDATVDLSLSVDSSISVDLLDFEKKIDVPSLTFKNTKIYAIGPIPVYQEIIFTLHAVISSTSSAGIDVSTGLHRKYKLTAGASYNGLTEEWDGILNGPTLLSQSSTASAAVHIGANLELRIVPKIELRFYTVAAPYFSIEPYASAQIEVGNGVTFDSLSGNTQVTPTQFNTFNVELGLECFVGFEFGILVPNKFSLKSKNVCPFVEPNVLWAVPEFQFDVDTESTTSDFATISALVSNDLGYNELDASSATWDIYPNDFILIPHSDDPLKADVQLTSNANEEGEYTILFSATDKLGEIGRQYDSGKIRFGDCITEINEAYTSMDQLPVDESLRLSQYACYHTPAQNEFGFRAENALFARSYFQVNEEVYEGPLSPLEVLNAGEFRYKKHVGILDIPFFTVAESRYVVYGNYGREIDGIYTKYREVIFKALSDEPFERLRRTASGDDKPYFVPKQSVEAELTYNVGIPSYNLTYFNGLDDTDTRGAIYYIYKYRCAQRTKETEGYTIYGELVYESYIEMPENDDTCLSNAEAKRLNSIDPIRDNYFYLDYKHRYSDLLEIIFAIKDE >NZ_AP021859.1|WP_155015714.1|3827740_3829144_-|undecaprenyl-phosphate-glucose-phosphotransferase MNENGGFIRNNINQFAFVYRLIDILIIQLCLAVSAFAYIGKFDVRYFVLGLISNMAYLFFAELFVLYRSWRNGSFKEMLFYTVSSWLLTLFPLFLFLFFTKQTEYFSRVTLGLWIISTTTLLCLWRAIFRQFLISIRKKGYNTRSVGIIGLTKRGLDLANEIVNYPESGYKLTAVFDERAPERLDGKFLHKLEGGIADGVRLAQEGKIEILFVALPLTNKERIENILKELGDTTVDVHIVPDVFTFNLLHSRMGHVGEIQTISVYDSPMRGGYSIMKRMEDLFLAFSILTVIALPMLIIAAIIKITSPGPVLFKQDRYGMDGKKIKVWKFRSMRVMDNGEVVKQATKNDPRVTKFGAFLRRTSLDELPQFFNVIQGTMSVVGPRPHAVSHNEEYRKKVAYYMLRHKMKPGITGWAQVNGWRGETDTVEKMEMRIKYDLEYIRNWSIWMDFKIVIFTIFRGFVGKNVY >NZ_AP021859.1|WP_155015713.1|3826803_3827751_-|sulfotransferase MFTKRIFIFSLPRSGSTLLQRYIMSCENIETTSETWLLLSLLYSIEVDGIKAEYNQRKLTEAVEDFFRYKKIEKKQYYESIIDFYFKLYNTQYISEENCSKIIVEKTPRLSLVSDKLIECSKNSHFVFLWRNPVDIINSMCETWAKGNWNIFHYYIDLYKSQKNNVETYLKYKNRSNVHSIKYEDFVSDENLREKLLDDIGLEYSELNINKAPLFGKMGDKTGIQKYKSISSPKKENKIGSILRYFWIKRYIRYLKEIGFDDIYDTGEIIKSYKISWRIDVLIKDVFSFSYGYIYLATEPRVYLEKLKGRKGILG >NZ_AP021859.1|WP_155015712.1|3826024_3826807_-|glycosyltransferase MRLVIITITYNNLNGLMKTNESLEVQSDQDYIQIVIDGGSTDGTSNYVKNMRERASFYYASERDKGIYDAMNKGLLKYKSLASNNNDYILFLNSGDYLYSKESIKNIKEKLIDLDSDLLLFDVYEDIYGKLYYKKSRDKDWIPKGMPTSHQAMLFNSNIFKEYKFDDGMRFSGDYDLVCYCYINNKNIKKTNKAVSVFDKTGVSEVNRIQALKENYVVRRKTLKMSFLNSIFLYFIHYIHTKLKIYLPGFTRYLRGLTRE >NZ_AP021859.1|WP_155015711.1|3825600_3826032_-|hypothetical-protein MSKKILAYASRGGHWVQLKRIIDGSDFKLTTISTLGNDEADYIVPDFSRSTWYRFLGVLISISKIVFRNNPDIVITTGAAPGVIIALLYRVKGTKVIWIDSIANSKKLSLSARIVRPFVTVVLTQWEYLADESKNILYKGAVF >NZ_AP021859.1|WP_162359943.1|3825127_3825604_-|glucuronosyltransferase MILVTVGVQLPFKRLIKKTLEIAKINADVDFVIQCGDHQHAELQNVKFISSVSEDDFNELISKCEFVVSHAGMGTILKALTLRKKIILVPRLASLNEHRNNHQIDTLRAFKQKPGIFPCMNVEDLLTIYNECKFAAPNFQESNNEKLRELRAYLFGLI >NZ_AP021859.1|WP_155015709.1|3823997_3824897_-|hypothetical-protein MSIEKIQAGLTVEETFITRILLFFSFIFFVFSKKVDFVVLFLRYRVFFLSVFFILNILQLVKQSNIAFGALKHIYHEEIILVYVIALLYIYCNKKSSIFTMLLIALTPAFSFKNTGFFLSLLLLVYILLILNNEKRVKLSVSLTISSVLVIAFTGVGFLLYDYILPYLPSGSPEVRLETYSFRINTFLENVFFGDYAGSNMLLRIGYLFWGFDIPSHSDVLDILAFFGLFGAFVFYYPVFLSIFRAAYTNSHDLTYLFIIAGFVFLMAFNPIINQPKLIVVYYYILATAYEKFRFKIKL >NZ_AP021859.1|WP_155015718.1|3837325_3837520_-|hypothetical-protein MTLKASDSASPARRYILKNKEKRDEEESSFDALEKRYQALKKAHEGDAERASAMVERAYALVRE >NZ_AP021859.1|WP_155015719.1|3837612_3839007_-|hypothetical-protein MQLPQFFKQVLHISTGFYKSLSQSNLIFGVVFLFLSATNYCVYMSFFTLSWPSEFESIIRHRDIAVFNVVLGAALLLSLILRFSTQHKDQDSWRYLSIVTFSLVGSVVVALITAENRLQMPAIWALAISLFHVGYWSWRREHNERVNYETKMENMTTDLTNYVNTMPKEDAFRLLGETTKAKFNFLYTLDLMAKQVNSIEQQEKLAEYAKEEFKTGLQALCQIASLWTISKTRIYEGNVMVALPSSDAPNAPNAREAFENGKYFFHDMTTLDTVEHCCKQVLYIVPQLTTSISSNNSQPTPTEPFMLPVGLKSEIHPGSIKGAPECVEKNEVVSIDINEIENSLPDNYVGQRLNEIKEYYKLQTDWQSVLCIPLHIKGLKNTADDFTNGVNPPANGVINLYRSEKGSVKSPELFYELTAPLVQLLENLLYLYLAHTPDDGSYSPEFCTFPRAEKDEGIGDIPTG >NZ_AP021859.1|WP_139241518.1|3839197_3839359_-|lasso-RiPP-family-leader-peptide-containing-protein MKSQQAVVTEQSKKAYAQPKLDKAGSLAELTQGFSGSVPDEQGGHTKRNPFQG >NZ_AP021859.1|WP_155015720.1|3839371_3839974_-|hypothetical-protein MRKFIIAMAMTLFASFQANASYVQGVTGADMVGISVSVDFANGSSESAIWQALTATQGGAFGASDWALILDGDSFGDFDPVSNTFYGLFTFFNGPFDVVSITIDILSEGFVFDTAFFDASANGSGPGHEFVSSDPQATASYSNLVEDELYGTMTISAFIAAGSGLAFQTDTDAFGEVPAPAGLLLVAFGLLALRATRRSK >NZ_AP021859.1|WP_139241519.1|3840254_3840416_-|lasso-RiPP-family-leader-peptide-containing-protein MTSIEQRDAHAEKREYSKPELQKAGSLAELTQGFSGSVPDEQGGHTKRNPFQG >NZ_AP021859.1|WP_155015721.1|3840430_3841045_-|hypothetical-protein MKKLILAGILAFFACQANAGFVQGVTGEDMVGMEVTATFADGSSETATWGAVAPGAGGVFGLIPWGVLLDGDSFGDFDPVTGDLFGGFLMMNFYDFDMVSLSFNALAAGFVFDTAYFDASANGSGPGRELVSSNADVFAVYSDNYMDELFGTMTLLSTSQVVVASGEMQVFLTDTDQISVPAPAGFAMIALALMGMRIARKSSK >NZ_AP021859.1|WP_155014205.1|3841742_3842780_+|IS110-family-transposase MYKDNVIAIDIAKSVFQVCVFDKHNQIKSNQEIRRQKLMAWLAKQPASIVALEGCGSSHYWAKVAEKLGHTPLQIPTRFVKKFVEGQKTDKNDAIAIGIAARQPNLKPVAVKSDEQLALQACEKMRKHYQDVAISTSNMMRSILYEFGIVIPQGESALKSKLPDILEDAENGLPMMLRQPLHQQFQLWLSLKERINEATKYLRVQLRTHTICNELQKLDGIGPVNALNLYLALGTKGESFKNGREAAACIGLTPKQHSSGGKVVMLGISKHIAKKQLRANLIQGALAKIKVVAKRPPKNTREVWMKQLIERRGLRRAAVALANKMVRVAWAMVHHQQPYKSPQAI >NZ_AP021859.1|WP_155015722.1|3844323_3846102_+|lasso-peptide-isopeptide-bond-forming-cyclase MTALFAMVAADHHAIMDLFDEASRHATLALNWQNEHCLVGNYCYRGNARTQLLSQFSLPQGNIIANASAPLSNSELSEQFSTQPAAISNAITGPHTVFTWHNPHKLLFASRDPLNQHALYYGKVGGVTVISSEASFIAKLMPKQPSLNTTALSCWLAGQPNPALCLYNEINTLPLGTSLSVSPQGNVTEHTFWDIDPQNKLAPTSDGAYRETFLDLLKQCVSSHIHPSDSLVVSQMSGGMDSTSITALANELLTEPRSCRALSHLYSHSASCDESDNIKAMYQKLGLVDPIQITVDAGAHRDFMSLYPTDYDSPGTVLSPRYHQECEIIQAAGGHRLLTGNGGDEMCWGHASAYTERLFKGEFGVIAEVLKACKQTGMARWPVARSLFVKPMIPQWLLNSAYALKGYKPSDIPAWLTPEAAKLATDASKIPNPFNERKQPVGYARYQALKTTSTYNSVRSYQKVGWQYGIDVAHPFFDPRMAEFSFAVPGKQLIRGPYPKWLLRNAMQNHLPESVCWNVKKVTFDNHFGQLVKDNAKPLRELLSDTRLASLGLVDNDVLLNAFDAAVGGNGVSVHVDLLYAILTQRWIQQHH >NZ_AP021859.1|WP_155015723.1|3846131_3846593_-|lasso-peptide-biosynthesis-B2-protein MLKSFSKYRALAPDQRRWFRRCWWQFALWHIRIQYFPYHWWKARIFSELNSESGHSLPFSLSEAIRLSEMAARHHIFPINCLRRCVVQQQLLAQYGYDLALHFGVAKQDARLKAHCWLTHNGQLINDGLEVVNTYTELKLAAEQSQHILASLR >NZ_AP021859.1|WP_073320074.1|3846585_3846855_-|PqqD-family-protein MSAYQLKPELLLQKVADEMVLLEPESGEYFTLNNVGADMLEQLQQGKSAQQIAQYIADIYDVTAEQAEQDFQVLMHDLVQANLAEAGVA |
You can click texts colored in the table to view more detailed information
CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
NZ_AP021859_4 | 3854466-3854541 | Orphan |
NA
Consensus repeat of NZ_AP021859_4
|
1 spacers
spacers of NZ_AP021859_4
>4.1|3854489|30|NZ_AP021859|CRISPRCasFinder CCCAGCGATATTCGAGTTCTTAAAAAGCCA |
CRISPR arrays and Neighbor proteins around NZ_AP021859_4
The CRISPR arrays of NZ_AP021859_4 >merge|NZ_AP021859|4|3854466-3854541|CRISPRCasFinder CACATCCGTCGTCCCGGAATCGTCCCAGCGATATTCGAGTTCTTAAAAAGCCACACATCCGTCGTCCCGAAATCGT >NZ_AP021859|4|2|3854466-3854541|CRISPRCasFinder CACATCCGTCGTCCCGGAATCGT CCCAGCGATATTCGAGTTCTTAAAAAGCCA CACATCCGTCGTCCCGAAATCGT
>NZ_AP021859.1|WP_155014205.1|3852964_3854002_+|IS110-family-transposase MYKDNVIAIDIAKSVFQVCVFDKHNQIKSNQEIRRQKLMAWLAKQPASIVALEGCGSSHYWAKVAEKLGHTPLQIPTRFVKKFVEGQKTDKNDAIAIGIAARQPNLKPVAVKSDEQLALQACEKMRKHYQDVAISTSNMMRSILYEFGIVIPQGESALKSKLPDILEDAENGLPMMLRQPLHQQFQLWLSLKERINEATKYLRVQLRTHTICNELQKLDGIGPVNALNLYLALGTKGESFKNGREAAACIGLTPKQHSSGGKVVMLGISKHIAKKQLRANLIQGALAKIKVVAKRPPKNTREVWMKQLIERRGLRRAAVALANKMVRVAWAMVHHQQPYKSPQAI >NZ_AP021859.1|WP_073320090.1|3851542_3852481_+|hypothetical-protein MKALAIASLFGATLATSAVAQNLTELDKQLNIMSGVIDTALKQDTRKEGVRYRSIEATYLAKQGVIFTIHTGGRGMMFDFDFGDLMSVIPTPPSAPDAPTVTVVADGMHVESHGDYEFIVEQDWGETAERVVRQVEKIVRKTDEKLREFRSDRREIEWEIRELERRNRDLEFELRAADNERKREIEGEMKELKAELDRLQSRQQELQDYAQELAEEKKAELAKQREMQEKAYKTFLANFEGSVGDTLCSFGAGLRELPDDEHISFILKNFGKGEDGKAQDRLYIFNKKSVKSCVAEKITPSDLLAKAEVYVF >NZ_AP021859.1|WP_155015726.1|3850869_3851520_+|hypothetical-protein MANYESDYLFGQWLDNALTNEERDAFEALCLSDKAFAAQVETATQLSVAAEQFTPPPMPAWDKNATFVAPDKPKWWQWQGLPVMSMAMSALAIVMVVSGFSVQVSEGKLTMGFQQGPSDEQVAALVNQKLNDYQQANQAMFTQYVAALQQQQQENSTQLTQYLLTSSRQERREDFAELIKFINQQRDDDQRYYARQLTQLQREINSLDDGYPALTE >NZ_AP021859.1|WP_155015725.1|3850334_3850877_+|sigma-70-family-RNA-polymerase-sigma-factor MFEKRDVQLIEQALKGHKKAWFALIKRYESAIYQYGVRMTGNPHDAADLMQDIFIAVFRSLSNYRGEGSFKAWLFRIAHFRCIEFYRRKRPDSPLDEDDELSCERPCPEHNLMTDSTSQALTAAMQRLPLAQRAVIELKFFGQFTFDEIADQLGLSSNTVKSRLYSALSKLKLDLEVEHG >NZ_AP021859.1|WP_073320080.1|3849350_3850259_+|hydrogen-peroxide-inducible-genes-activator MKWPNLKHLHYLVTLHQEQHFHRAAQRCNVSQSTLSTAIQNLEEHFGSQLLEREHKTFVFTSLGLDVVERSKVILQEAGELVEYAQNAGNWQRGKLKLGVIPTIAPFLFEAMLGAFRTFLPEIQLELQEDTTANLTRQLTDGSLDLLVLALPMETPGCKQMVLGHDPFHLIAHKDLANDLPSPLDISSLPKKSIFLLQQEHCMTGHAVSACNLQHTDQISSLAASSLYTLVQLANSKLGYTFLPELALNQDLLKNTQLTSFPAEEKAFREIGLVWRAGTTRMRLFRRVGEIISPLLPVPTLK >NZ_AP021859.1|WP_162359946.1|3846858_3848547_-|ATP-binding-cassette-domain-containing-protein MFDSLRLHYGLKVFQSYKWSFIMVVALMVTETAVSLSVPYLIGQQSQSFLQEVTILNNNHLKYLYLWVALFAAQAALRFLSTYNVNLVGARLMAELSCRLYDHIQVLPIQYFQENKRGDVLSMLTNDLAVVSFFASSVLTNLIPNILVLIGAGILMYMIEPTIALLICFLVPLIYIILKLVSRGMQPISRALVQRQADSLAQASENIGAISLIKAFNKEEAESSKFKRRSNEILALRARQFRLQALLSPLIQFLASLCILVVVIMSVLKFNSGALGIPDLISLLLYGIVFTRPLGSLAGLYGQLQQVIGASERLLHVYHLESEPRDEHGTPMTIEHGDIVFDSVAFGFPQRGTILKDVSFHVKAGQNMLIYGQNGGGKTTMLHLLMRFYQPATGKILIDGQNIQQATTASLRHAIGLVSQDVLLLNGSIFDNLTYGLANPAADDVYKAASQAGLDHLIMRLPQGYDTQVGEGGVRLSGGQRQRIALARALLMKPKILLLDEPTSMLDEQARLSFKEEFHGLFAQFTVIMISHDPTLSDVADVVYQLENGTLQRQSIRSDYQE >NZ_AP021859.1|WP_073320074.1|3846585_3846855_-|PqqD-family-protein MSAYQLKPELLLQKVADEMVLLEPESGEYFTLNNVGADMLEQLQQGKSAQQIAQYIADIYDVTAEQAEQDFQVLMHDLVQANLAEAGVA >NZ_AP021859.1|WP_155015723.1|3846131_3846593_-|lasso-peptide-biosynthesis-B2-protein MLKSFSKYRALAPDQRRWFRRCWWQFALWHIRIQYFPYHWWKARIFSELNSESGHSLPFSLSEAIRLSEMAARHHIFPINCLRRCVVQQQLLAQYGYDLALHFGVAKQDARLKAHCWLTHNGQLINDGLEVVNTYTELKLAAEQSQHILASLR >NZ_AP021859.1|WP_155015722.1|3844323_3846102_+|lasso-peptide-isopeptide-bond-forming-cyclase MTALFAMVAADHHAIMDLFDEASRHATLALNWQNEHCLVGNYCYRGNARTQLLSQFSLPQGNIIANASAPLSNSELSEQFSTQPAAISNAITGPHTVFTWHNPHKLLFASRDPLNQHALYYGKVGGVTVISSEASFIAKLMPKQPSLNTTALSCWLAGQPNPALCLYNEINTLPLGTSLSVSPQGNVTEHTFWDIDPQNKLAPTSDGAYRETFLDLLKQCVSSHIHPSDSLVVSQMSGGMDSTSITALANELLTEPRSCRALSHLYSHSASCDESDNIKAMYQKLGLVDPIQITVDAGAHRDFMSLYPTDYDSPGTVLSPRYHQECEIIQAAGGHRLLTGNGGDEMCWGHASAYTERLFKGEFGVIAEVLKACKQTGMARWPVARSLFVKPMIPQWLLNSAYALKGYKPSDIPAWLTPEAAKLATDASKIPNPFNERKQPVGYARYQALKTTSTYNSVRSYQKVGWQYGIDVAHPFFDPRMAEFSFAVPGKQLIRGPYPKWLLRNAMQNHLPESVCWNVKKVTFDNHFGQLVKDNAKPLRELLSDTRLASLGLVDNDVLLNAFDAAVGGNGVSVHVDLLYAILTQRWIQQHH >NZ_AP021859.1|WP_155014205.1|3841742_3842780_+|IS110-family-transposase MYKDNVIAIDIAKSVFQVCVFDKHNQIKSNQEIRRQKLMAWLAKQPASIVALEGCGSSHYWAKVAEKLGHTPLQIPTRFVKKFVEGQKTDKNDAIAIGIAARQPNLKPVAVKSDEQLALQACEKMRKHYQDVAISTSNMMRSILYEFGIVIPQGESALKSKLPDILEDAENGLPMMLRQPLHQQFQLWLSLKERINEATKYLRVQLRTHTICNELQKLDGIGPVNALNLYLALGTKGESFKNGREAAACIGLTPKQHSSGGKVVMLGISKHIAKKQLRANLIQGALAKIKVVAKRPPKNTREVWMKQLIERRGLRRAAVALANKMVRVAWAMVHHQQPYKSPQAI >NZ_AP021859.1|WP_155015727.1|3854690_3854987_-|GIY-YIG-nuclease-family-protein MSERYPAVYILSNFTRTVLYVGVTSNLPQRVYQHKMSMASGFCSRYNVKDLVYYEMHEEMYAAITREKQLKRWRRSWKEKLITQKNPQWLDLYPLIVG >NZ_AP021859.1|WP_073320095.1|3855129_3856398_-|inorganic-phosphate-transporter MDFLQSYGMILIILAAAVGFVMAWGIGANDVANAMGTSVGSKALTIKQAIIIAMIFEFAGAYLAGGEVTSTIRKGIIDTAYFVDIPEYLVLGMISSLFAAGLWLAVASYLGWPVSTTHSIVGAIIGFTAVGVSMDAVEWSKVGGIVGSWIVTPAISGVIAYLIFMSAHKLIFETDKPFHYARKYVPFYMAFAGFVMSLVTIKKGLKHVGLDLSPTTGYVLSVVLAVIIAFIGKWLISRQAYSHSEDADLQRANVEKVFALLMVVTACCMAFAHGSNDVANAIGPLAAVVSVVSNGGEIGSSSSLAPWILPLGGLGIVAGLALFGHRVIATIGEGITHLTPSRGFAAEMAAACTVVIASGTGLPISTTQTLVGAVLGVGLARGVSALNLGIIRNIVISWVVTLPAGAILSILCFFTLKAIFGV >NZ_AP021859.1|WP_073320098.1|3856714_3857431_-|phosphate-signaling-complex-protein-PhoU MHQVALNTHISDRFNLELENLRNSVLTMGGEVEQQLIDTLKAISTNNPGLAEKVILNDLKVNSMEMQIDEECVRIIAKRHPTASDLRLIMTISKAITDIERMGDEIERIAKLVTKQKIPASESIKSSMLQIGQQVTAMMRGTFDAFARQDERAALHVYDQDNRIDSEYKKLLTFTTGEMSRSGEDMEDWLEILWALRSLERIGDRCKNICEYIVSLTSGKDVRHTPLESLQQKLDDLT >NZ_AP021859.1|WP_073320101.1|3857445_3858252_-|phosphate-ABC-transporter-ATP-binding-protein MLKLFERERLDLEGLSPEQTAIEVRDLNLRFGQKHVLHDINMRIPKHRITALIGQSGCGKSTLIACFNRMNDLILNSQTSGEIVIEGRNINHKKENLSLLRSQVGMVFQRPNPFPMSIYDNVCYGLRLQGIKQRRQLDDAVERALHEAALWEEVKDRLFDSAMTLSGGQQQRLVIARALALKPSILLLDEPTSALDPLTTLFIEELMGELKKRCTIVIVTHNMQQAARVSDYTAFLHQGELVEYSDSDTLFTMPDKKQTEDYITGRYG >NZ_AP021859.1|WP_155015728.1|3858257_3859847_-|phosphate-ABC-transporter-permease-PstA MGKWSVRQFLSNRQQQSFVISLGAFCASLLLVALVCVLALIALRGSDYFWPRPVHSLTYVDQAGKEHKVYGQVGVGHASSQYSAGTQRLWLIRYSDTRYPYGNQLILETPAIQNLAVAKDAADMLLADGTRVFAKPVSVDMPENQAQPLSSLAQAQARVDLLQQDVDQIRTQHLAPIHRRLAELDIRAVAEDAPARERLSAEFREWQTKVLEREAQIAEFRLNVQFSDGAPFSVALNELDQLTYTGQLTTWGKLGVAADGVWTFLSESPKQANTAGGVFPALFGTVLMIFIMTILVTPFGVMAAIYLNEYAPDNSMTAVIRICVSNMAGVPSIVYGVFGLGFFVYMVGGQIDELFFSDRLPAPTMGTPGVFWAALTMAILTLPVVIVATEEGLRRVPDRLKAGSYALGATKLETIWHTILPIASPGIMTGVILAIARAAGEVAPLMLVGAVKFAPNLPFDGEFPYLHLDRQFMHLGVLIYDGAFHSQTDMRSASMMFASCLLLLLVVFVLNILAVILRKRLRQRYLRGY >NZ_AP021859.1|WP_155015729.1|3859839_3861987_-|ABC-transporter-permease-subunit MNVAAEQGSVLKKRRRRDTAARVIISGFGGIVLLTMVILIWHLFSQAASIAMSPDADIQTEIPVLPSGRYLYVGDMDSGQAAIIDGPGCRLTLARLEADTLTARQSIRRPCSHTLTTLQVQGQPYAVDISTSGQVRLLPVPTVGTAQTLGSFGTPLASELSFAIPEAVWAEHTDWTLGVGEQWLIMVVNTAQSQLIQWVNRQDPANIMRHTLPSSHPVALLPDSKQVVQIRDRELRFYNEKHQQINQISLTKAVDKVFTFVKNRSLFVTHPDSTVSRYTVFNDKGTLRYQRTYVLALRKQEQPVAIYPHASVNGLAMVTNQQQLLLINRVTGEIVERRGLPIQPTGVSWFDNRAYVFSDAVMIKLRIQHLAGLSTADSLLTPQIYEGYKEADQLWQTTSATDYQETKMNLVPLLIGSFKASGLALLIAIPLALGAAVYTAYFARPKVRDNMKPAIEMLEAIPSVLIGFIAAIWLAPLAERFLFSFAVFLFTVPFSLLLIAFVQHKVARNLPSEVRNVAELILPVLGIVGLGYISIEWAPQLLFYLLEVNDFDFITDATGVPVGKTTILVAIALGFAISPSIYSLAEDAISGVPASLRQASYALGATRLQTLRRVVLRVAFPGIMAAIMLGFGRAFGETMIVLMVTGNTPIADWDLFAGLRALTANLAIELPEAELDSMHYKVLFLTACVLFTFTFVVNTLAELLRQRLRRNASYG >NZ_AP021859.1|WP_073320107.1|3862127_3862637_+|glycine-cleavage-system-protein-R MKPVIITVIGKDRPGLVDAVAKKVYQFGGNWQGSSFAHMAGQFAGFVEVLVPAEQHQALIDALNTLDGLQVQSQSVTDTLEQPDEMLRIEVMGNDRAGIVQELTNVLHGFNLNILHFASTCESAPNWGSQMFKAQLRVGVSADLDRDDLQEALEAVANDLVVDITTTLS >NZ_AP021859.1|WP_155015730.1|3862761_3864834_+|polyphosphate-kinase-1 MESTDLYYPKELSWLAFNERVLQEAADKNNPAVERIRFLGIYSNNLDEFFRVRVSDVKRQIIIAQNDGNELEAQHQRKLLEQIQQKVMALSKKFDTIHKDVVKALARYNIYILQKHELTDYQREWVRNYFVNKVLRHIAPILIDKKTDLLSRLNGNAVYLYVALRREGRSPRFAAVQVPTGEVPRFFLIPPQRSRKNKHIILLDDMIQLSMEDIFRGFVKFDTLESYSFKMTRDAEYSINDEIDESYVEKMSESMKQRLIAEPVRVIHDQDMPEDMVEDLQKRLKVTKLDTLHSAGHYRNFKDFIGFPNPGREYLEHPPLPAIDTKDFSAYNTVFDAISDHDILLYYPYHRFLHFTEFVRQAAFDPSVKSIRINIYRVASHSRIISSLIDAVDNGKKVTVIVELRARFDEEANIEWSKRMTDAGIRVVLGVPSLKIHSKLCIVSREERGKLMHYAHFGTGNFNEKTAKIYTDYSLFTKNQELAEEGNAVFDLISNPYRRYKFQHLQISPLNARTKIQSLIRQEIQYLKEGHKAGITFKINNLVDNELIDDLYRASQAGVKIRGIVRGMCSLRPGIKGLSENIKIISVVDRFLEHPRVMIFNGGGNRKVYISSADWMMRNMDNRIEVGAPVYDDALQQRIVDIMEIQFRDTMKAREIDKEQVNQYVRRGNRKKLRSQEEIYDYLKQLEEKK >NZ_AP021859.1|WP_073320113.1|3864830_3866363_+|exopolyphosphatase MSPDPSSFASVESREASKVAALDIGSNSFHLVVARIVAGSVQILHRVKQKVRLAEGLNEDNVLQGEAKQRGLDTLKIIADSLKGFEPDSVRIVATHTLRRAVNAKEFIEEALQVLPYPIEVISGTEEARLIYSGVAHTNHDAGKRLVVDIGGGSTEFIIGEGLSPLLLRSLQMGCVSYTQRFFANGELKAKAFDKAITAAEQEMEPIEERYRRLGWQRCIGTSGTIKAIYNLVQQTQKEGEHDVPVTLKALKSLMKQFIEAGHIDKLNFPEMTEDRRPVIPAGLCVLIGLFKALKIEALEYSPAALREGVLYQMEDELHNSDIRSRTASSLATRYDVDIDQANLVLNTTLTLFNHCQKAWKLRHPDYRAILGWAALLHEVGFQINTRGVQRHSAYILQNVDMPGFNQAQQELLATLVRFHRKKIRIADIPSFTQYDKEDVYRLIVMLRLGVLLNIKRQESFLPEFEVDVEKDKLSVTFPAEWLPSKPIMTADLEREKGYQSAVGIELDVS >NZ_AP021859.1|WP_155015731.1|3866441_3867596_-|diguanylate-cyclase MMDYDYVVAQTLLLGIVALIGVVFTFSLPVPNHKTKASSGVFARLFLAACWLVEVFQTVRFSGYEDIGRVGFYVMSLSAAYMLMMTIVKRYGHSLNRQQITLVLLHLVGVALCSLLLQAGYLPQWTANTVILLSVAFPVWQAIRRVKYYLATNSLGDKVLYAVLSTVFYTLLAILPIYLIFFDASIVHHHSLTFAILLVFMLVFMLSFAVSVLHSLVNRLHTQVHTDPLTGAKNRHFFYEIAPKLSAHALRNNEILSVVACDIDHFKAINDKHGHVVGDIALKRFCKIIQDELRAEDTLIRMGGEEFLVLSPHCDRNQATELAERLRKVISETEIEAKGVNLMLTASFGVIEMTHNSEFFSSVKEADQALYNAKAAGRNQVITV |
You can click texts colored in the table to view more detailed information
CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
NZ_AP021859_5 | 4714383-4714471 | Orphan |
NA
Consensus repeat of NZ_AP021859_5
|
1 spacers
spacers of NZ_AP021859_5
>5.1|4714411|33|NZ_AP021859|CRISPRCasFinder AGACGGTGAGGTTGTTTTGAAATCGCGGCCTTG |
CRISPR arrays and Neighbor proteins around NZ_AP021859_5
The CRISPR arrays of NZ_AP021859_5 >merge|NZ_AP021859|5|4714383-4714471|CRISPRCasFinder TTGAATAGATCCCGGGACAAGCCCGGGAAGACGGTGAGGTTGTTTTGAAATCGCGGCCTTGTTGAATAGATCCCGGGGCAAGCCCGGGA >NZ_AP021859|5|3|4714383-4714471|CRISPRCasFinder TTGAATAGATCCCGGGACAAGCCCGGGA AGACGGTGAGGTTGTTTTGAAATCGCGGCCTTG TTGAATAGATCCCGGGGCAAGCCCGGGA
>NZ_AP021859.1|WP_155016257.1|4713304_4713913_+|TetR-family-transcriptional-regulator MRRTKEEAEQTRAAILDAAVDVFSSQGVARATLEQIAKSANVTRGAVYWHFKNKTDIFMALYDELHKPFIQELVDGLEKSYDDPLRQLEKVCCDLTVRLEEDPHLQRVLSLFLLKCDYSGSLQICQEKTRLAKEEKQATLEKFFAKAQAQGTLSADLDPKTLTMALNCFFRGIVVEYLENPDEFSLKEMAPKLFGVFFGKWR >NZ_AP021859.1|WP_073317402.1|4712044_4713178_-|efflux-RND-transporter-periplasmic-adaptor-subunit MIKKLVLAALSVSVIAVTGCSQESGGQQAAGGQGAPSGVPVNVVTVEQQNVSTTLELPGRVSAFRQSHVRPQVTGVITQRLFEQGTVVEKGQQLYQIDDLQYKAALNSAKADVASANANVKTLKAKAARYKDLMKVNAISGQEYDDVVAQLDQAMAAVSVAEAQVALAEVNMDYTKVYAPISGRISRSFYTEGALVTANQTDPLATITQLDPVYVDVQVSSEQALGLQMALRDKGSLTVDLTIPGSHQSLEGLKGTVEFSEVIVNESTGSVTIRARFPNPDNILLPGLYVRATIHLSDTAALVVPQRATIRQPDGSLSVWVVNGDNPELRSIGVLQAFDGNWQVSNGLSAGEQIIVAGYHKLRPGAKVMPIPLSKDA >NZ_AP021859.1|WP_155016256.1|4708921_4712029_-|efflux-RND-transporter-permease-subunit MARFFIDRPVFAWVLAIITMLAGVMAITSLPIQQYPTVAPPSVTISASYPGASAQTVENAVTQVIEQRLTAIDNLRYFHSSSANGRMTITLTFEPEADPDIAQVQTQNKVQGAVSQLPSSVQQMGVTVTKSNNSFMMAVGFYSEDDSIDQYELGDILLSKFRDPISRVDGVGSVRAFGAQKAMRIWLDPQRLYSYNLTPQDVQSAIAVQNTDVSAGELGGLPAISGQEINATIQAQSRLQTVDDFERILLRVNADGSQVRLRDVARVELGSESYGVISRYKRHPAAGMALSLASGANALDTIDRVKARVEDLKSNLPAGVKVIYPIDNGPFIELSIKSVVQTLLEAIVLVFLVMLLFLQNWRATLIPTIAVPVVLLGTFAVLYAFGFSINVLTMFGLVLAIGLLVDDAIVVVENVERIIHEEGLSPKEATKKSMTQITSALVGIAAVLSTVFIPMAFFSGSAGAIYRQFSITIVSAMVLSVLVAIILSPSLCATFLRAADAEGEAKTGFFGWFNRTFNKGRDRYQGATRFMANRLARFVALYTLLVAGMVVIFMRLPGSFLPNEDQGFLMMMLNTPAGSSAERTLESVAKVEDHFLEKEGDLVDHMFTVTGFSFAGSAQSSALGFIRLKDWSERTEPGTSVEAVAGRAYPALAQVIDASAFAFFPPPIRELGNASGFDMQLLDVAGRGHEALMQARNQLLGAAAQNPKLVGVRPNGLNDVPQYKIDIDSEKATALGVSLSDINSTLQIAWGSSYVNDFIDDGRIKKVYLQADAPHRMMPDDLNKWYLRNSAGDMVPFSAFASSKWTYGSPQLERFDGVSSVNLQGSAAPGISSGEAMQEMEKLVAEMLPEGFEIAWSGLSYEERAAGSQAGFLYAISILVVFLCLAALYESWAVPFSVILIVPLGILGAVVAAYLFNLSNDVYLQVAFLTTIGLAAKNAILIVEFAKVLQEEEGKTVMEAVTMAAKQRFRPILMTSMAFILGVTPLAIANGPGAASQNAIGITVIGGMFAATFLAIFFVPMFYVLISKFSRPKQS >NZ_AP021859.1|WP_155016255.1|4707451_4708603_-|hypothetical-protein MLKLSFIIILCAWIWSPTKAIANEVVMSVICFDCNESYAKNLAKEHATPFIECDQNNDYFEFNSTQSCYSQPKRIIVFDGLYKTAYPYRLSHSNQGGPLNTLRLNIDDFQIESGTHSLLTDIANARLDYEERMETFVADTLSRIPNEEFNNLVLSSSVNDQCSDDPGVAAFKRAMSQSDTTALKTWFQIQQNVINDSVFGFLPFDISALSFSYQSPPTPYGSLGITGQFDVDPDASVITVVYTNISSTLRSTVVEGAQIESSAVVYKIGKQENFPSLVEVNVEPDLSRVDGVPLSDIAGGRVADVDQTKPVSKCVYDYIKDVYETEQRMAPVGGGLGGSTVGNRGGSGSGEMGPALTYEGSCVVDTFSGGEHTGTFKINCADL >NZ_AP021859.1|WP_073317384.1|4704917_4706774_-|GNAT-family-N-acetyltransferase MTVALPPRPDWSQLIKSGCRVFVGGNAGVPYALIDDLIANSKAYSDIELVHMLALGDNRWAKEEYRQLFKVNTFFIHGDEVRRAVDEGRADYTPVFLSEMSSLFSDGTLPLDTALVMVSPPDEFGYCSLGVSVDICMSAARHANKVVAQINPQMPRTAGHSYLHISEFAAVIEADQPLQEIEAPPIDSVTERIGQYVAMLVEDGATLQFGVGKIPSATLKYLERHKDLGIHSEMLSDSIMEIIASGAISNRKKTFHPGKVVTSFCIGSRKLYDFVNNNPHIEFYPSSYVNKPTNIAKNDNMIAINSALEVDLTGQVVADSLGFDFYSGIGGQVDFVSGASMSKGGKPIIALPSTAKNETVSRIVPYITEGSGVVTSRGNVHYIVTEYGIASLRGKSIRERALELIRVAHPKFRAKLLAEVRQNYWVPHYQQKYPTDIPELGAIQLHKMVVNGEKFYLRPLNPADERRLQEFFYSHTKETLRLRYNYDPKQMSREKSCNLVSVDQSSDAALCIVRQEGSRITIHAVGRFYYNEHDNTCEAAFVTRETQQGKGMASKLLTTLIDIAQKRNINKMLAFCRADNKPMIAIFEHHGFKRLFSGDPSEVELALPLQEASQEKSA >NZ_AP021859.1|WP_155016254.1|4703988_4704921_-|histone-deacetylase-family-protein MTIKIFRGKDCVHHDVSGEHPEHPDRLYAIDDQLLSSGLDMVCQHADAKPVKRENLALAHDPYYVDSIFQRAPKTGVIWLEQDTGMTPITLSAALYAAGAGCDAVDWVMDGENRQAFCAVRPPGHHAEYDNAMGFCLFNNIAVAARYAVKKYDLSRVAIVDFDVHHGNGTEHIIAGDQRIMMCSSFQHPFYPHSGSPVSASNILCAPLEAGANGEAFRKAVSYWFDALINYQPQLILISAGFDAHAEDHMGQLRLREDDYHWVSQQLRKVADKVCHGRIVSMLEGGYNLSALGRSVVAHIKGLHGDDTSH >NZ_AP021859.1|WP_155016701.1|4703388_4703955_+|hypothetical-protein MLVLPVCFLALPFAANANAGVPMLFLAMPALLMSLLPIIFIESVYCAQRLSLSFGQSLKTVSISNLASTLVGIPVTWLLLVGVQIATSGGRAYGIDSPVEKVLAVTWQAPWLIPYETDLHWMIPAAGLVLLVPFYFASWWSEFWIAKKLNTLLPTSDIKLTVRNANRITYCLLAGWPIASWLANVAMK >NZ_AP021859.1|WP_155016253.1|4702534_4703062_+|thioredoxin MSSRTTTILVGVWATAILVALLIANSNQMQDFDPDASLAQAASQQDFDSAFTGMLQEAGVSNGSIVHLSADSNCFCNDLSKGHQYDITQSLADKGYEFHTLSLSENPAISKLISHFPALAVIDNNGNLRYVGPYATGFGCFTGNDLVDDIARIATTEQYFGATVNTEARGCFCNV >NZ_AP021859.1|WP_155016252.1|4701056_4702535_+|chemotaxis-protein MFSWVKEGHQIFRVILIVQLVISVVIGLITGELMIAFWLGIPIIALPLYLSYANPESEISGHAVGIGVQLMTALHIHQAFGLIEIHFEIFVLLAMLAYFRNWRIIATSTATVAVHHILFFFMQAGGSGVFIFEENHITFSILLLHAAFALAEGLTLMYMTKRSHEDGVGGALLESAIADIIRDKESLNLAVKIDKSVPVMRTFDELLDAIRQLVSNAAKLADDVADTSAFMQNATRELSEHAQQSHQEIGSISAASEEIAVTMQDTSERTNAANDITQEAKANTSESRTSVESTKTTISSLRDRLNSAAQTNQELNERCASISDSMRSITAVAEQTNLLALNAAIESARAGEHGRGFAVVADEVRTLAIRSKESADEISTITEQLVASTASSVTQMNQCIELVDEAVSASDRAATHMQGIESKIQAASDNMMEVATSAVEQETASSSIAASTAKIYELATQEARTAAELEQKSQSLATLCQTLQTMVRRFVV >NZ_AP021859.1|WP_155016251.1|4699829_4700801_+|zinc-transporter-ZntB MSNADQAFLWAYDIGADGTIATVDQAAITTPVAPNTYRWVHLQSDESDAEQLLDTLALPSSVADSLMALQTRPRVLPIKEGALIFLRGINANPGADPDDMVSLRLWLTPNLMVTARRQNRRLMSVQDTREMIESGEAPATTAELLVTLLTRIADRIHDKIEDIDEQLAQYETADALNKQDRQQLAMLRRQTAIIRRHLAPQRDALDTLIRLPNLINDSLIFELRDQADRMTRYVEDLDLARERSLVLQDELRNQIADQQGIRMYVLSMITAIFLPLSFLTGVFGMNVAGLPGTEAPDAFTTLMMAMGGIAVVMLIAMLWKRWL >NZ_AP021859.1|WP_155016258.1|4714512_4715037_-|hypothetical-protein MKTLMKFATLVMAMTLAACASQPAYRAAENGGYGYSETKLTDTQYRVYFKGKGSDKTKAMDYAMLRAAEITLDQGYDWFVVANRETMVDREKVSMEPEIGFSKRYTRVTDCGLVTCRTSYYPESTLSTGIYVGGREKSVIESALDIQLGRGTRPDNSASFDARQVKENLSPKDE >NZ_AP021859.1|WP_155016259.1|4715055_4715796_-|hypothetical-protein MKITDEQLSAFLDNELDDEQMALVRDAIAADETLCDRMATLSMVDHVVKRAAEQATTGPVPEHIVARCDSASESNVVSFADRKAEQTAQQPTPNSDTRWLRGMAMAASVALVGLLGWQQLMGDQPGDAGQWQQIAAVLDSQTSGSRYSAGDVTVMPQLSFVHQDGALCRQFTVSGQSRNDAVIACKKDGSWQQRTLVPMTPTNGQAGEYQTATSAHELDKVLDTMIKGAPLNREQEQQAIQSNWQQ >NZ_AP021859.1|WP_073317410.1|4715792_4716314_-|RNA-polymerase-sigma-factor MTKTQSEQLKAMLPVLRRFAYSLTGSMADADDLLQNTVEKLLTKPVPDDVELLAWSYRICRNLWIDEYRANKVRQAAVHNPELQQAEVDATAQITSDITLKQVESAMATLPDDQREVLSLVAVQGLSYQDTANVLSVPSGTVMSRLARARSKLAQILFLEKGTKGPNGNEVTA >NZ_AP021859.1|WP_155016260.1|4716616_4718026_+|S8-family-serine-peptidase MKLSLSTKLLSRSLIAAAVVLSPISLVNAQVLPSVTRSVTQPIEDLTRRLPANRVTERLTRPEKPALPELLPALTSTLTADLSNALLPVKQAISVVDSLQQTVLREEITPQGELAIAREWVVYTSEADLAWFEQSPFSITKQRYVALLDSWLVNIQVPDAFNSLNRIKAALPAHLQSQLGRNHVYLTQSNAAEANEESAAVKESAAVKESAATTSSVETKPLCETPARIGMLDSAIEDTHPLLSTLQVREHTFIDASLPLSRAHGTAVAGILQQQLAHNSQVVNAAVFYARTQVSQGASLFDLVSGLDWLASQQVPVINMSLTGPDNPLLAKAITGLSSKGVTLIAAVGNAGPAAPPLFPAAYPDVIGVSAVDAQGNIYRWANQGEQVALSAPGVSVLTARVKGETGPETGTSIASPAVAGWIAQWRTCQSGSDATKTAIPPALRQQLQDKGEPGWDPVFGEGAWLPAK >NZ_AP021859.1|WP_155016261.1|4718043_4718940_-|DUF560-domain-containing-protein MKTTLIFAAITAVSAPAVATVTYKGELQAGAQYDSNVTVTELDRASNQSDYAGYLKGKLSADWQATDALSFNAGVNHQRTQYQDATDFNLAITTWNVGAGLKNRLGKWGVHSYLADAALDGNDFMRYQQSGVSWQNSITAKTYLHISADYLQKRFDTVPERDANGGQLSTQWFYMPDMEGQMINVGYTYQYEDADIDRLDFTGHAGQLSWTYPTQWWGTPTSLKAQYQFAYRDYREAEAFLSVTQRTDRQHTLGVSASYQFTKHVAFDIGAKYADYQSNLSIADYQENQANVGVTAKF >NZ_AP021859.1|WP_155016262.1|4719020_4719842_-|hypothetical-protein MKTFTFTSLILAMGLAHGAAQAQSVESNVTFNQSTAADAQTSASTDPTTASSVTLSFANQSNVQANSNANEQVADESAAQSEEATTEVAESAEGTAEQSTETVSESEETMAETGQQATASADTALEQSTDIVTDLPVAELSQSLQGSLQGSLNVVADAGGSISSTLNGSGDAINSAVNGTLQNTIDAATAAQVADAASVEQVVSSAVSATVAQNVSTSAVTQVSDTVNSTVTNSVTSAVDGAVEGAVEGAVEGAVSSTVNNAVAANISSILGN >NZ_AP021859.1|WP_155016263.1|4719987_4720314_+|TfoX/Sxy-family-protein MSANQFANSLHDVFSTFGPIHLKRMFGGHGVFSQGKMFALVVSDQLYIKVDAGMKQALEARGYSPFTYVRQGKTIALSYMEAPEEIYDDPDDACNWAHQAYTAALAGK >NZ_AP021859.1|WP_155016264.1|4720331_4720784_-|hypothetical-protein MATFYNKQHGEFSLIMNQDVVLTNAVGPWNLECIEQFGIDYATSVYSAKVSRWADIIMLQGESLLVPDAEKDLQVRIARAVETGLSHVAVVMCKSEVKTTAKLQMKRLYRNLPAELAFFETVQEAIGWITEHGYRCEAPVIEAFFDDAPK >NZ_AP021859.1|WP_155016265.1|4720919_4721483_-|CBS-domain-containing-protein MHHLTLCQTEAIDTLAHPEVYDHVELASSALSIFTDFHEHQPLVIDGNVKAIELERLMRQSHVKMKLVLDKHDQFVGIVTLADITEQKILQRVVQLGLPRSELLVVDMMQPKAALQAFDYHELKVASVKDVVDTLQDNGKMHCLVIDKQQHEIRGVISVSDIARILRVPLDIQSQPSFAALSHIIAA >NZ_AP021859.1|WP_155016266.1|4721538_4721847_-|hypothetical-protein MKAWLLILTLIALAGLPGSGAAVQSGLHGQWSSVDSNLTEAESAEVEQGIESDQQGDGPDVISSANGIRFAPQASTVPGQLQTQLSGSFYSGYAIRAPPVFS |
You can click texts colored in the table to view more detailed information
CRISPR_ID | Spacer_Info | Spacer_region | Spacer_length | Hit_ID | Protospacer_location | Mismatch | Identity |
---|---|---|---|---|---|---|---|
NZ_AP021859_2 | 2.1|1657924|22|NZ_AP021859|PILER-CR | 1657924-1657945 | 22 | NZ_AP021859.1 | 1658091-1658112 | 2 | 0.909 |
1. spacer 2.1|1657924|22|NZ_AP021859|PILER-CR matches to position: 1658091-1658112, mismatch: 2, identity: 0.909
gtttagaaataatgagagggga CRISPR spacer gtttagaaaaaataagagggga Protospacer ********* ***.********
CRISPR_ID | Spacer_Info | Spacer_region | Spacer_length | Hit_phage_ID | Hit_phage_def | Protospacer_location | Mismatch | Identity |
---|
Region | Region Position | Protein_number | Hit_taxonomy | Key_proteins | Att_site | Prophage annotation | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
DBSCAN-SWA_1 |
888747 : 896718
Sequences of DBSCAN-SWA_1
Nucleotide sequences of DBSCAN-SWA_1 >NZ_AP021859|888747:896718|DBSCAN-SWA AATGAAGACTGTATTTGTCGCTGGTCATAGAGGAATGGTTGGTGCAGCAATTGTTAGAAATCTTGAAAAACGTGGTGGGGTTTCGATTATTACCCGTGCCCGAAATGAGTTGGATTTAACAAATCAACAAGCTGTATCAACGTTTTTTGCAGAGAATAAGATAGACGAAGTGTACTTAGCTGCCGCAAAAGTAGGCGGCATTCACGCGAATAACACGTATCCAGCTGAATTCATTTATGAAAACCTGATGATTGAAGCCAATATTGTTCATGCTGCACATATGAACAATGTTCAGAAATTGCTGTTTTTGGGCTCTAGCTGTATTTATCCCAAATTGGCTGAACAACCCATGACCGAGAAAGCCTTACTTACTGGCACTCTGGAAGAAACCAATGAGCCGTATGCGATTGCGAAAATCGCCGGTATTAAGCTATGCGAAAGCTATAATCGTCAGTATGGTCGCGATTACCGGTCGGTAATGCCTACCAATTTGTATGGGCCAAACGATAACTTCCACCCTGAAAACTCCCATGTTATTCCAGCCTTGCTTCGCCGTTTTCACGAAGCAGCACTACGCGGTGATGCTGAGGTTGTAGCCTGGGGCAGTGGTAAACCTATGCGCGAATTCTTGCATGTTGATGATATGGCGGCAGCCAGCGTACACGTAATGGAGCTGGATAAAGCTACTTATGATGACAATACACAGCCTATGCTAAGCCATATTAACGTGGGCACAGGTGAAGACTGCACTATCAAACAATTGGTGGAAACCGTTGCCAAGGTTACCGGATTTAAAGGTGACATTGTTTGGGATACAACTAAGCCAGACGGCGCACCCAGAAAGCTAATGGATGTATCTCGATTGCACGCACTGGGTTGGAAGCATACTTACAATCTGGAGAAAGGCCTTAGCAATGCCTATGAATGGTTTATTGCAAACCAGGATAATTTCAGAGGTTAAAGTGGTTATAGATGCGGCACAATAGTGAAGAATTTCAATGAAATTAGTTAGTAAAATTAAGAACAAAATTGACGCATTGGGTCTTGAGAAGCACTACGCAGACTTTAGTATTGACTTACTAAGTGCAGAAGAGTTTAGAGGCTGGGCAAGAAAAGCTGGGGATATATCAAACACTTCCTGCTATGTTAAATTGTACTCTGGTGACAATGTGATCGCCGAGGGAAAAGCTAATCAATACCGTGATGATCTGCATGATTTGGGCTTTGGTAATGGTTGCAAAGGCTTCAACTTAAAAGTCAACTGGCGTGCACTGGATGCTGGTGAAAATAAATTGTCGTTGTTTATAGATGAACACAAAGTTAAGGTTATTCGTCTACCTGTAACGATTGCCGAGTTTGTTAGTTTGGCAATACAAGAGCAAAATAGACGCTAGATGCATGTAGTAGTTATTGATCCTGGATTTGAAGACCTTCATTCTCATCATGCAAATGTGAATGAAGGTTTGCACTTGTATTTCAATAATCAAAAAGCAACTTCGCTATGTACGTTGGCATCCACGCGAATAAAACTCACTACAGACGAATCTCTGATTCCAAAACATGTACAGCCACATTTTTCGACCCCTTGTTACACAAATCGACTAAGGCCCTTGGATAGTGAGCAAGAGCAGCTTTTAGCAGAGCAGTTTGCCAGGGAGCTTACATCCGCATCAGATAAAAATATCATTCAAACTTCAAGTTTCCTCGTATTTCATACCTTTTATAGTTTCCATGTGTTAGGTCTAGCAATTTGGTTGCGGCAAATTGGTGATCGTTTTAGTGGCGGCATCATTTTATGTGGCATGTTCTTCCCTGGAACCTCCCGGCTGTCAGCTAAAGTAAACATTGTTGAATTTCATAGATATTTACGATTTAAGTTAGCTGTTACCTATTTAAAAGGTGTTTCCCAGAAACAGTTGCTAGTGGTTGCGACAAGCTGTTCAGATTATATAGCTTCTTATGAATCGTTGTTTGATTGCGAAGTTGCGCTTCATCCTATTGTTACTTACAGAGATGGAATTCAGCGAAATCCGCGAAATCCAGATATGCCAAAGTCAGTACTCTTATATGCAGGGAGTGTTAAGCAAGATAAAGGGCTTCAATTCATATTAGACGTTACCGAAAAATTACTTACAGATTTCCCTGCAGTTAATTTTGTCTTCCATTTAAATACTTTGTCTCCTGGGATTCGTGATTTCCCCGATGCGGAAATCACGCTAAATAAACTCGCCCAACAATATCGAAATTTAGAGTGTGTCTTTGATTATCTAAATTATGAAGCCTTTCAGAATTTATTGGATCAGGCTGATGCAATCATTTGTCACTACGATCCCGCAGTGTATCGCCTTAAAACCTCAGGATTATTATGGGATTCTATGTCAAGAGAGCACATGGGAGTGATCTGTTCAAACGATAGTTGGCTTGCCAGAGAGCTTTCAGCTGTCGGAGGAAAGCCGTTTACATTTAACCATAACAAGTATGATAGTTTGCAAAATGCCATTACGCGTTGGCTTGATTGCAGCGCCCCCTATATTCAACCAAACCAATATTTTAAGACTCTTCTTAAGAGTTTCCCAGAATGGGTTGAACTGCAATGTTCAAGCTTGCACAAAGAATCTAAGACTGTTTAAATGCGGCGCTAGCCTGAGCGCGAAATGGAAAAATATATGAATAAAGCGATCTTCCACTACCATTTATTTAAAAACGCAGGGACATCCTTGGATGCTAGTTTAAAAGAAAACTTTGATGGTGATAAATGGGTTACTGCTGAGTTTCCAGCGCAACCTGCGAGAAATCGAGATCTTGCACGTCAATGGGTGAGTGAAAATCCTCAAGCATTTTGTTTCTCGTCACACACAGCGTTTTTTCCACCCCCAGAAGTGCCTGGTCGGGTTATTTTGCCAGTAATTTTTGTAAGACATCCAATTGACCGAATTGCTTCGGCGTATGCCTTTGAAAAGAAGCAGGGAGGCAATGGCTTTGGGGCGGTACTTGCCCGTAACACTTCATTCAGCGGTTACTGTGAGTCAAGGTTAGCATTAGGATACGACCGCCAGTGCAAAAATTTTCATGTCGACAGGTTTGCTTCGATGTTCGCTCCTGCTTTTGGTTCAGAGTTAGATAGAGCTTTAAAAGCATTTGAGGTACTACCTTTTGTTGGTTTAGTCGAAGAGTTTGATAAATCTTTGCGTAGGTTAGAAGACTGGTTAAAATCAGAAGGTTTTGACGATATTGCTATAGCGCCCAAAGAGCATAATGTTTCGAGAGATAACAAAGCCACGCTTGAAGAAAAACTTGATGCATTAGAGGAAGAAATTGGCTCTGAAATGTTTGTCAGACTTCTCGCAGAGAATGCTGACGACATGATCTTGTATGAAAAAGTTAAAGCGAGTTATGAGCAGTAAAGTATACTGTGCTACCGACTACCCTGGCAGACACTTCCTTTAACTTTGCTGTCAAAAAGCATTATACTTCCCGCTGTGAAGGATATGTTTATCTTTTCTAATTATTGAATCGTGATGCTGTTGGGAAGTGCGTATGAGAAAAGGAATAATATTAGCAGGTGGATCTGGGACGAGACTACACCCGTTAACCAAAGTCGTTAGTAAGCAACTAATGCCTGTTTACGATAAGCCCATGATATATTACCCGTTATCTACATTAATGATGTCGGGTATTCGAGATATTCTTATCATTACTACACCACAAGAACAAAGCAGATTTGTAGATTTGCTGGGTGATGGCAGGGCATGGGGTTTGAACCTGCAATATGCAGTTCAACCTTCTCCAGATGGCCTGGCCCAAGCATTCCTAATTGGTGAAGAGTTTATTGGCAACAATAGTTGTTCATTGGTCTTAGGTGATAACATCTATTATGGTCACGATCTAAGAGTTTCCTTAAAAAACGCTTACGGTCAGACTAATGGCGCCACAGTGTTCGGGTACCACGTAACTGATCCTGAACGATATGGGGTTGTTGATTTCGACAATGACTGGAATGCGCTTTCTATAGAAGAAAAGCCTGCGAAACCTAAATCTAATTATGCCGTAACCGGCCTTTACTACTACGACAATCGCGTAGTGGACTTTGCCAAAGAGGTTAAACTTTCTCCTCGTGGCGAATTAGAAATCACGGATCTGAACAATATGTATCTTCAGGACGGCTCACTTAAAGTAGAACTAATGGGCCGCGGTTCTGCGTGGTTAGATACAGGTACACTGGATAGCTTATTAGATGCAGCTAATTTTGTGGCTGCCATTGAAAAACGGCAGGGTCTGAAGATTTGTTGCCCCGAAGAAGTGGCGTATCGCATGGGGTACATCGACGCCGAGCAGTTAGAAAAACTGGCTGCACCGCTTAAGAAGAGCGGCTACGGCGATTACCTGTTAAAAGTAATTCACGACCGGGTTAAATAAATACCGCGTTTTTCCTGCAGCCGGTTTGAATGAATTTTTAGGTGTAACAATACAATGAAAGTAATTGAAACCGACATCCCTGATGTCAAAATTATAGAGCCACGTGTGTTTGGTGACGAACGTGGCTTTTTTATGGAAACCTTTCGTACGGATTGGTTCAAGTCTAAATGCGCAGACGTTGATTTTGTTCAGGATAATCACTCTAAATCCAAGCAAGGTATTTTGCGCGGATTGCATTACCAGCTTCAGCAAACTCAGGGCAAGTTAGTACGTGTTGTATCTGGTGAAGTATTTGACGTTGCCGTAGACATGCGTAAAGAGTCAGAAACATTTGGGAAGTGGGTTGGCGTTTACCTGTCGGCCGAGAACAAGCGTCAGTTATGGGTACCTGCGGGCTTTGCCCATGGCTTTTATGTCACCAGCGAATCGGCTGAGTTTGTGTATAAGTGTACGGATTACTACCACCCAGAATCCGAAGTGTCGGTAAATTATAATGACCCGACCATTGGTATTGAATGGCCGCTGGTGAATGGCGAAGCGCCGTCATTATCGGGCAAAGATGAAAACGGCACCGCATTCATTGATGCGCCAACGTTTTAAGGCCACTAGTTAAAGGAATAATGATGTCACAACTGGTAGTCATTGGGAAAAGCGGTCAGCTCGCATGGGAAATCGCGCGGCTAGTGCCGGATGCAGTGTGTCTAGGCCGTGATGATATTGATATTACAACTGCAGAAAGTATTGCTGATAAGCTCGACACGTTAGCACCTGATGCGGTAATTAATGCCTCTGCTTACACGGCAGTTGATAAGGCCGAAAGTGATGAAGCTAATGCGTATTTGTTAAATCAAACCGCGGTGGCAAACCTGGCCAACTACTGTAAAGCGAACAACGTATTCTTTGTGCACGTGTCTACCGACTATGTGTTTAATGGCGAAAAAGGCTCGCCCTATACGGTTGATGACACCATTGCGCCGCAGGGCATGTACGGCAAAACCAAGGCCGCCGGTGAAGCTGAAGTGAAAAGTATTTTACCTGAGCACAGTGCGATTATTCGCACATCCTGGGTGTATTCGGCGCATGGCAATAACTTTGTAAAAACTATGCTGCGGTTGATGGCAGAAAAGCCTCAATTGGGTGTCATTGATGACCAATTAGGTAGCCCGACCTGGGCGAAGGGGTTAGCGCAAGCCTGTATTGAGGCCGCCACACAACAACACACAGGTGTATTCCATTGGTCTGACGAAGGCGTGTGTAGCTGGTATGACTTTGCTGTTGCGATACAGCAACTGGGTCTAGAGAAAGGGCTGTTAAGTCAGGCCGTACCAATAAACCCTATCCCAAGCAGCGCCTATCCTACGCCAGCCAAACGTCCTCACTACAGTGTGCTGGATAAAACACTGAGCCGTGAAACTTATACAACCCCTCTTATACACTGGCGCGAACAGCTTAGCGCCATGATGGATGAGCTGGTGAAATAGCAAGAGAATTGACTAACATGACAGCAAAAACAATTATTGTAACTGGCGGCGCGGGCTTTATTGGTTCCGCCGTAGTTCGTCACCTAATTAACGATACCGACCACACGGTGGTGAACCTCGATAAGCTGACTTACGCAGGTAACCTGGAATCACTGAAAGAAATCGACCAAAGCGACCGTTACCATTTTGAGCAGGTAGACATTTGCGACGGACCGGAAGTTAAACGCGTACTGGACACATACCAGCCTGATATTATTATGCATCTTGCAGCAGAGAGTCATGTTGACCGCTCTATCGATGGCCCGGGTGAATTTATCCAGACCAACGTAGTCGGCACCTATACGCTATTAGAGCAAGCACGTAGCTTCTACGCCACACTAAGCGATGATAAAAAAACAGGTTTTAAGTTCCACCATATCTCGACCGACGAGGTCTACGGTGATTTACCTCACCCTGATGAAGTAGAAGCAGGCACAGAGTTGCCTTTATTTACGGAAGAAACATCGTACGAACCAAGCTCGCCGTATTCGGCATCGAAAGCTGCGTCTGACCATTTGGTTCGCGCCTGGTTACGCACGTTCAAACTGCCAACCGTCGTTACCAACTGTTCAAATAACTATGGCCCGTATCACTTCCCGGAAAAACTCATCCCGCTAATGATACTCAATGCCCTGGCGGGTAAACCACTGCCGGTATATGGAAAAGGTAATCAAATCCGTGACTGGCTCTATGTAGAAGACCACGCCCGCGCATTAGTGGTAGTCGCTACAACTGGCGCTATTGGCGAAACCTACAACATTGGCGGTCACAACGAAAAGCAAAACATCGATGTGGTACACACCATCTGCGATATTCTTGACGAAGTGAAACCGAAAGCTGAAGGTAGTTATCGCGACCAAATTACCAGCGTTGCTGATCGCCCCGGACACGATATGCGCTACGCCATCGATGCCAGCAAGATTCAAAAAGAGCTGGGCTGGGTGCCCCAGGAAACCTTCGAATCTGGTATTAAGAAAACTGTAGAGTGGTACCTGAACAACGAAGCCTGGTGGAAAGCCGTGTTAGATGGCAGTTACCAGGGTGAACGACTGGGTCAAAATCAGTAACGATAACTGAAAAGGTCTTAACGATGAGTACGTATCTAGTAAGAAGGCATAAATAGGATGAAAGTACTGATCGTTGGCGGTTTTGGTTTCATCGGAAAGCATCTCATCGAGAATGCAATAACCAAGAATATTAGTTTTACGGTAATAGCCCGAAAGGATCCCGATGTTCAGTGGGTAAAGTACGCATATATTCTGGAAAATTCACTTGATGATAAAGCGTTGCAAAATCTGGCCGCTGAACATGATGCGCTAGTATATCTGGCTTCGTCTTCCATTCCTGCCACAGGGTCCTTCCTGAAAGAGTTTCCTACGAATGTAGAGCCCGCCGTTCAGTTAGTTGAACGGCTTACCTATCATAACCCGGAGCTAAAAGTAATATACCTGTCCAGCGGTGGCCAGGTATATGGAAATGGTTATAAGCGGCCGATGAAGGAGTCTGACAATTGCGAGCCAATGAGCCCCTATGGATTTGGTAAGTTGATGACCGAGCAGTCATTGAGCTACTTACATAGATCACGAGGGACCAAAATAGCCATTCTTCGAGTAGCTAATCCAGTTGGTAAGTGGCAGGTAGGGTTAAGACAGGGCTTGGTTAATGTTGTTTATCAAAGTCTGATGCTTAAAGAGCCGCTAAAGATATTTGGTACAGGCGCTGAATTAAGAGACTACATTGATGTGGATGAGCTTGCACTGCTTATTATTAAAGTTGCTTCAATGGATTTTGATTTAGAAACCTGGAACGTTGGCTCTGGCGTTGGAACGGCAACTATCCAGTTAGTTGAAAAGATCGAGAGTTTTCTTGGTATCGAAGGAGAAAAGGTCTTCCTTCCCAGAAGGCCTGTTGATCCTGAATCGGCGGTTTTGAACTGCGAAAAGATAGAAAAACAACTAGGCTGGAAGGCAAGAATGTCAATTGATGACGTGTTGGAAAAGACACTCAACAGCAAATTGAAAACGGCCTCATTTTAA
Protein sequences of DBSCAN-SWA_1 >NZ_AP021859|888747:896718|892283_893162_+|WP_155013757.1|DBSCAN-SWA MRKGIILAGGSGTRLHPLTKVVSKQLMPVYDKPMIYYPLSTLMMSGIRDILIITTPQEQSRFVDLLGDGRAWGLNLQYAVQPSPDGLAQAFLIGEEFIGNNSCSLVLGDNIYYGHDLRVSLKNAYGQTNGATVFGYHVTDPERYGVVDFDNDWNALSIEEKPAKPKSNYAVTGLYYYDNRVVDFAKEVKLSPRGELEITDLNNMYLQDGSLKVELMGRGSAWLDTGTLDSLLDAANFVAAIEKRQGLKICCPEEVAYRMGYIDAEQLEKLAAPLKKSGYGDYLLKVIHDRVK >NZ_AP021859|888747:896718|893216_893762_+|WP_155013758.1|DBSCAN-SWA MKVIETDIPDVKIIEPRVFGDERGFFMETFRTDWFKSKCADVDFVQDNHSKSKQGILRGLHYQLQQTQGKLVRVVSGEVFDVAVDMRKESETFGKWVGVYLSAENKRQLWVPAGFAHGFYVTSESAEFVYKCTDYYHPESEVSVNYNDPTIGIEWPLVNGEAPSLSGKDENGTAFIDAPTF >NZ_AP021859|888747:896718|891412_892150_+|WP_155013756.1|DBSCAN-SWA MNKAIFHYHLFKNAGTSLDASLKENFDGDKWVTAEFPAQPARNRDLARQWVSENPQAFCFSSHTAFFPPPEVPGRVILPVIFVRHPIDRIASAYAFEKKQGGNGFGAVLARNTSFSGYCESRLALGYDRQCKNFHVDRFASMFAPAFGSELDRALKAFEVLPFVGLVEEFDKSLRRLEDWLKSEGFDDIAIAPKEHNVSRDNKATLEEKLDALEEEIGSEMFVRLLAENADDMILYEKVKASYEQ >NZ_AP021859|888747:896718|890140_891376_+|WP_155013755.1|DBSCAN-SWA MHVVVIDPGFEDLHSHHANVNEGLHLYFNNQKATSLCTLASTRIKLTTDESLIPKHVQPHFSTPCYTNRLRPLDSEQEQLLAEQFARELTSASDKNIIQTSSFLVFHTFYSFHVLGLAIWLRQIGDRFSGGIILCGMFFPGTSRLSAKVNIVEFHRYLRFKLAVTYLKGVSQKQLLVVATSCSDYIASYESLFDCEVALHPIVTYRDGIQRNPRNPDMPKSVLLYAGSVKQDKGLQFILDVTEKLLTDFPAVNFVFHLNTLSPGIRDFPDAEITLNKLAQQYRNLECVFDYLNYEAFQNLLDQADAIICHYDPAVYRLKTSGLLWDSMSREHMGVICSNDSWLARELSAVGGKPFTFNHNKYDSLQNAITRWLDCSAPYIQPNQYFKTLLKSFPEWVELQCSSLHKESKTV >NZ_AP021859|888747:896718|893785_894643_+|WP_155013759.1|DBSCAN-SWA MSQLVVIGKSGQLAWEIARLVPDAVCLGRDDIDITTAESIADKLDTLAPDAVINASAYTAVDKAESDEANAYLLNQTAVANLANYCKANNVFFVHVSTDYVFNGEKGSPYTVDDTIAPQGMYGKTKAAGEAEVKSILPEHSAIIRTSWVYSAHGNNFVKTMLRLMAEKPQLGVIDDQLGSPTWAKGLAQACIEAATQQHTGVFHWSDEGVCSWYDFAVAIQQLGLEKGLLSQAVPINPIPSSAYPTPAKRPHYSVLDKTLSRETYTTPLIHWREQLSAMMDELVK >NZ_AP021859|888747:896718|888747_889707_+|WP_155013753.1|DBSCAN-SWA MKTVFVAGHRGMVGAAIVRNLEKRGGVSIITRARNELDLTNQQAVSTFFAENKIDEVYLAAAKVGGIHANNTYPAEFIYENLMIEANIVHAAHMNNVQKLLFLGSSCIYPKLAEQPMTEKALLTGTLEETNEPYAIAKIAGIKLCESYNRQYGRDYRSVMPTNLYGPNDNFHPENSHVIPALLRRFHEAALRGDAEVVAWGSGKPMREFLHVDDMAAASVHVMELDKATYDDNTQPMLSHINVGTGEDCTIKQLVETVAKVTGFKGDIVWDTTKPDGAPRKLMDVSRLHALGWKHTYNLEKGLSNAYEWFIANQDNFRG >NZ_AP021859|888747:896718|889744_890140_+|WP_155013754.1|DBSCAN-SWA MKLVSKIKNKIDALGLEKHYADFSIDLLSAEEFRGWARKAGDISNTSCYVKLYSGDNVIAEGKANQYRDDLHDLGFGNGCKGFNLKVNWRALDAGENKLSLFIDEHKVKVIRLPVTIAEFVSLAIQEQNRR >NZ_AP021859|888747:896718|895806_896718_+|WP_155013761.1|DBSCAN-SWA MKVLIVGGFGFIGKHLIENAITKNISFTVIARKDPDVQWVKYAYILENSLDDKALQNLAAEHDALVYLASSSIPATGSFLKEFPTNVEPAVQLVERLTYHNPELKVIYLSSGGQVYGNGYKRPMKESDNCEPMSPYGFGKLMTEQSLSYLHRSRGTKIAILRVANPVGKWQVGLRQGLVNVVYQSLMLKEPLKIFGTGAELRDYIDVDELALLIIKVASMDFDLETWNVGSGVGTATIQLVEKIESFLGIEGEKVFLPRRPVDPESAVLNCEKIEKQLGWKARMSIDDVLEKTLNSKLKTASF >NZ_AP021859|888747:896718|894660_895749_+|WP_155013760.1|DBSCAN-SWA MTAKTIIVTGGAGFIGSAVVRHLINDTDHTVVNLDKLTYAGNLESLKEIDQSDRYHFEQVDICDGPEVKRVLDTYQPDIIMHLAAESHVDRSIDGPGEFIQTNVVGTYTLLEQARSFYATLSDDKKTGFKFHHISTDEVYGDLPHPDEVEAGTELPLFTEETSYEPSSPYSASKAASDHLVRAWLRTFKLPTVVTNCSNNYGPYHFPEKLIPLMILNALAGKPLPVYGKGNQIRDWLYVEDHARALVVVATTGAIGETYNIGGHNEKQNIDVVHTICDILDEVKPKAEGSYRDQITSVADRPGHDMRYAIDASKIQKELGWVPQETFESGIKKTVEWYLNNEAWWKAVLDGSYQGERLGQNQ |
9 | Enterobacteria_phage(33.33%) | NA | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_2 |
2866728 : 2892555
Sequences of DBSCAN-SWA_2
Nucleotide sequences of DBSCAN-SWA_2 >NZ_AP021859|2866728:2892555|DBSCAN-SWA CATGAATAGCATATCACTGATTGACCGGCCTGAACGACGCCGTTTAGAAAAAATCGTTCAACGTAGTAAAGATAAGCGCTTGAGTCGTAGAGCAAATGCAGTGCTTTTGGTTCACAAAGGCCAATCTCGTAAGCATGTGGCGGCACTGCTGAGTGCAGCGCGTTCATCGGTTAATCGCTGGTGTAAGTGGTATGAAGAATCAGGTATAGAAGGTTTAAAAGATCTGGAGCAAGGCAAACCCGCTTATTTACCCGGGGGTGCAATAGTCCAGATACTGTTCTTGTTGATTCCATGGACACCGCAAGAGCTAGGTTATCAGCGCAGTCGCTGGAGTTCTGAATTACTGGCGAAGGTAATTCATGAAAATACAGGCATTCGTATACACAGTTCGACATTACGACGATGGATGCCAGAACTAGGCATTGTTTGGCGAAGAGCTGCACCGACTTTACGCATACGAGATCCGCATAAGGAAGAGAAACTAGCCGCGATTAAAGCGGCACTGGATAAATGTTCGGCTGATCACCCTGTATTTTACGAGGATGAAGTGGATATTCATCTGAATCCGAAAATAGGGGCAGATTGGGGATTTAGAGGAAAACAGCGGTTGGTTCCGACACCAGGACAAAATGAAAAATACTATCTGGCTGGAGCGTTAAACGCCAAAACGGGCAAAGTGCTTTATGTTGGCCGACAGTCAAAATGTTCAGAGATATTTATCCGGCTAATGGAACACCTAAGAAAAACGTACCGGAAAGCAAAAACTATTACGTTGATAGTGGATAACTACATCATCCACAAAAGCAAAAAAACGCAGGCATGGCTGAAGCAGAATCCTAAATTTACCCTGCTATTCCAGCCAGTTTACAGCCCATGGGTGAACAAGATTGAAAAGCTATGGCATGCTTTGCATGAAACCATAACTCGCAATCATAAATGCACAGAAATGTGGCAGTTGTTGCAAAAAGTACGGCGATTTATGGAAACCGCATCACCATTCCCAGGCAATAAGCATGGGCTGGCCAAGGTGTAGCAGAGTTAGGATCAGTTATTTAGCTGTTTACAACAAACACTGCGGCAACGGCAAAAAACCAACAAAGCGACAGAACCAGCACCATTTTGAGCATGCTTGAACTTGCCTTCATCCGAAACAAAGTGAAGCCTGAATAGGCTGCTGGAAGCAATGCGCCAATCCCGCCCAGCGTGAGCCCTTTGACGAGACCAACAACGGGGGCCCATTCGCTTTAAATAAAAACCATTTCAGTAATGAATGCAAAAACACCCAACGCGATACAGCCTAACAACAAATAAAATGCCCTGTATTCGCACTTTTTATATTCTGTACTCGTCATTCTCGGCCTCCTGCCCTAACCGTCACGCATTGGACTTCAGAATGAAACATACCAATCGCTTGCGTTACTGCCCGTGAACAACAGGCACTATCCAGTCAGCCCATGCCTGCCATTTAAATACAGTGTGCATTGCAAGTGCTAACAAGCAAAACAAAGCAAACGACAACAACGTGCTGCCATGCAACCGCTTGGTTGTTTTCCAGTCGTACATCATTAAGGTCGCATAAACCATAAACATAATTACAAACATGGAATACCCGGGCAACGAGAGTAAATGAACTGCTCGATTAAGGCCTTCGGGCATCATTTGAATAGCCGCTGCAATCATAAACCGCTTGTGGTGGGCAGGCTGTTTGACAGATAACACACCCAACACATAAAGCACGGCGAATAGCGTCCATTGAAGAAAAGCGCCACCAAGAAAGCTGCCAGCCCTTAGCCTGGCCTGAGCGTCGTCGACGCCAAACACGCCAAAACCCGCCATCCGGTGATAAAATTCAATGGCCATAAGCGGACCGGTGATCAATATTCCAATAACCAAAATCACGCTCAGTGTGCCGATGATTTTATGACTGGCGTACTTGCCGGCAGCAGATAAGCGGGTTTGATTGAGCAGTAATACATACCAGGCCGCACTCATTGCGCCGTGTAGACCTGTCCATAGCGATATTCTGGCAATGTTGTTGGGGTTATTTGTCCAGTTAAGCGCAAAACCGCCGAAAACCACAACGGTGATGGCAAGGGCAAAGACGCCAAAAAAGCGGTTGTCCAAGGGTGATACCGCCGCTGATTTACTACTCACAATACTAGACTCCATGTCAAAAACGTCCTGTTCGCAGGCATTTTAGATATTTTGGCTATTCATAATGCACTGATATTAATACATAAACCAGTTGATTTTGTGACGGAAACTAAGCAGAACAGGCTCGTAATCGAGTAGAACCTTGTTTAACAACAGTCATCACGCATAGAGGGTTGGAATAATTGGTCCAGTCCCCCTGGCATATAGAGTCTGCTGAATACTGTTGAAGCCATCCAGCCTGTAAGTCTTCGGAAGGTAGCTCGGATGTTGAACCTCTGGCGATTATTCAGCACCAGGGCCAATACTGTCAGATACACCACCGCCAAAGGTTTTGTCTTGTGTAACTTGTATATCCTGGCTCCTGTTGAGCAGCCACACGCCTATTGCTGCCAGAAACGGAACCAGCCAAACCAATAACATTTGTGCGCCTTTCTGGAAAGTATCCAAATCATCACGTTTCATTAAGAATACAGAAACGCTTGCATTCAAAATAAATAGAATTGCCATCACGACATACAAAATTTCAACGTCCATTGAATCACCTCTTCCTGGCGAGATCTTTAAAAATGGAACTCTTCTCGAATTAAAACTCAACACTACCAATAATCGCTAGCAGGCCAATCGGGAACAATAGAACCCACTGCAGCAACTTCATCAAAGCCTTAGCTTGAATAAGTAGTGATGCCTGCGCATAAACAGAAAAAAACATGGCGCTATACACCTATGGAGCGCCATGTTTGTGTCCCTCTTGAGCGCTTTGTTAGACATCTTCGTCAACCGCTGCATCAAAAGAGCCTGGCATTTGCTCTTCGCATGTTTCCAGAGCAATTATGTCTTGTTCATTCCATTGGTCTAATCGATTTAAAACACTGGTATAGCATGCAGAAGCTCTGACATTAATTGCTAAATAAGTATTGTCCCAACGTTCCAGTGAACAATCTAATTCACGAATTATTTTGATGTATTTTTCTTGTTGTTCCCTAGTTACTTTTTTGTCGAAAATAATTCTAAGGGTGCTATTGCCACTTCTTTTAACCACTTTCTCAATCTCCAAGATTCCGTCTTCGTCGGGTTTAGCTTCAACCTCATCTTTAAAATTGAGTGAATAAGAGAAAAATGGGACGTTTTCTAAACGATAGATATTATTACCGAGAGGCTCGGCCCACATCGATTCCCCACCAATCATCCAATGGTTTGGAAGAGAAACATGCACTTTCATTAAACCGTCGTTATTCACTCTGAAGCCTAACACCTAAATAAGGGGCAAATGCACGTAGGCTAAAATTAGAACGAAGCGACTGAGCCTACGTGTATTTGTCCCAGCAAGTTTGCGAGCGAGTTTAAGCGCTTGTTAGCTCAGCACCTTGCGCCTTCCTTGCGATAAATGCGACTGACCCCGAGTACACATCCATAAATATGTGCGCCAATATAGCAACAACAATACTTTCGAAATACAAATATATACCGCATAGCAAAACACCCACAACAGCTGTTTTTATGACATGCTTCCAACCAAGATATATATGCCAAAAGCCGAAAATGATGCTACTACCAATAACGGCAACTAGCCCGCCTGTATGTTGCTCAATGAAGCTAAACAAATACCAACGGAAAATAAGCTCTTCGCACACACCAGCACTAAATGAGACCAGCAATGTAAAAAATAAAAACTCTTTACGTGATGATGGCAGTATTTCTACAAATGACTCGCCACCGTCTTCAAATGCGTTTAAAACCTGAAGTCTGACACTTTTATCTTTATTGATTGAATAAAGCACATATTTCATATATGCAATAAACACCGCAAACATAAAGAGCGCCAAATAACCTTTCCAAGTAGTAGACGGTAGGTATTGTGGAGGTTCTATTTTTAAGACACCATTTAAAAAGCAAAAAAATAAGAATCCAGTGACAGCCCACAACATGAAAGAAGTTTTTACGTATTCGGTACACTTATTTCTACATTTATATTTTTCCAAGACTAAATCGAACAATGGAAATAGAAGTATTGTAGACAGAAATGCAATTTCCATCGTTTGCACTGCAATCATCCTTGTGTAGAGCTAAATTTTCACTAACGGGCGCAAGCTCGCAGGGTATAATGGCGAAGCCGCCCCGCGGGCGACATAATTTTTTTGTTAAGTACTTGTGCCATTACCACACTACAACTTTTTTTGGCTCAAAGCCACTAACCATACAACTACCCAACACTAGTTCGCCGCGTTCAGCAGCTGTAAATGCCTCTTCTGTGGGAAAGCCCATTACATAGTGCACAATTTTTGGTGGTCCATTCAAGTCGATAAACGTGAAGTCTAAAGGGATAAGGCGTCGAAACAATGAATACAAACCAGGCGATAACTTTAGTTTTTTGATTTCTTCTAAAAATGAAAGCAGCGATAAAAACAAATAAGCATAAAAAGCCTAGATTACGATAAGCCAAAAAGCTCCATAAGATACTACTGAGAACAAGAATCGTTTTGCTTCTTTACTTTGAAAAATCATGTGAATCTACATCAGCACCTAACGGCCCAACAACGGGCGCTGGCACGTAGGGCACAATAGCGAAGCGGCCGTGCGAGCTTGCGTCCCAGCGAGCGAAGCGAGAGCCTTGAGTGGTTTGTTATGTCCGTTTTCCAAAATCATCCTTGAATTTCTTAAATTTGAATTCGTTTATAGAGATGAGGATCGTAGCATAAGCGATAACAACAATAACTGCTCCCATAATTTGTTGGTTCTGAGACCATTCTAGGTATTGCATGATTCTAAATGTAACAAACATGACAATTACACCGCGGAAAAGCATCCAGACCTGAGACTTATCTTCTTTGTACTCATTGTATAATTCTCGCATTGCATTATCGGATTCCACCGCTTGAATATGAGCAGCCTTTGAAGGGTGTTTTTCAATCGTATCCAAGAGTAGTGAATAGCGTTCGGGAAATTTTTCCTTGTCTAAGCTTTCCTTAATGTCATACAGCTCTTCTAAATCATATTTAGAGAAATCGGGCTCCATTTTTTCTCCTAGGACATAACGGCCCACAAAACGGGCAAAAATACGTAGGCTATACTATTGAGTGAAGCGAAGGGAGCCTACGTGTTTTTGTCGCAGAGAGCTTACGAGCGAGTTTATGCGATTGTTAGGTTAACTACTCTAAAAAAGCGCCTTGCAGCAAGAGCCTATTAACATTGTGATGTATGACTACAAGCCCAATGGCGCAAATTGCAAGAAACACAATTGTTGCCAAAATATACTCTACCCAAGCTAACCACTTAATTCTATTTTTATGCTCGAGAATTAACTCTTTTCTGTAAGCGCTTGATAGTAGAAAAGGCCAAGCAAGATATACAAAGCGAGGTCCGCTGGCTAATTCTGCAAAAAAATCTAAAACGTCGCCTAAAACGGGAATTCTGGATAGCACGATAGCTACAGCGAGAAATACCCCGAAAACAAGTAAGCTAATTACAGCAACCAAAATTTCCATTTATAGGTTTACCTACCCCCACTTAGGGGCACAAATACGCAGGCTATAATACGGAGCGAAGCGGAGGGAGCCTGCGTATTTGTGTCCCAACAAGCGCCAGCGAGTGCCTACAGTGGATTGTTATGCACGTCTTTGCCTGATACGTAATCTGTGTATAGCAATAGCCCCAATACTGCCACCAGGCCAAGCAAAAATAAACTAGAAGCACTGATTGACCTAGTAGCCATTGAATATACGGAAACTGCACACAAAAGAAACACATGAGCTGAGCAAACAATATATGACCATCGTATTCTTCGTTTTAGGGCAGACACTGAAATGATGTTAGCTAGTAATAGCAATGCAAATAAAGCCGCAGTTTTTGGAATAGCATCGACGTTTATAAAGTCAACGGCGACCAACGTCGTTAGCGAGAGGATAAGCAGACCTGAAATAATGGCGTAGACAACAGAGAGAAAATTAAAGGCCAAAACAACCTGTCTCATGGTTCACCTCATTCCTTAAAGTGCATAACGAGCGCATATGCGTAGGGCAAAATAGCGAAGCGGCCCTGCGTGTTTGTGTCCCAGCGAAGCGCGTTTATTGGTTTGTTATATGCTTCCGCATAACAACTTTCCGTGCGAAAACTTTTCAAGCTTTTTGGACACTCGAACAGACACAATGATTTCATTAAACAAGTGCTCAGTGTATACATCATGAGAAATAATTCTCACCTGACAACCTATAAAACTCTTCTTCTTTGATAGTTTCAGTTCGTCCAAATCAAGCACAAGGCCAAAACTTGGAAAGCTGCTCAAAAGCCATAAACTATCGCTACCAACTACAATTTGAACATTGTATTTCCGCCACACCCGAAAACTTTTCGCCTTTGACGCCATTCATTGCTTCTTTCCTAAGAAAACCACTCGCACCGAGATTGATAGAAATCAGCCCGAGAAGACTGGAAGCTAACCCCAGTATAGGAACCAAATAATCCAAAAATAAATCCTCAGTATTTCAAAGGCATATAGCACCCTAAGAAGGGGAAATTACATGTGGGCTAAAATACGGAGCGAAGCGACGGGAGCCCACATGTAATTGTCCCACTTGCTTGGCTTGTTATGCATCATTCAACGAGAGAAACTGCCCCGTCATCATACTTTGATATTTTCACCGTATATACTTTGCTTTTCCACGCTTTGCGTGATTCTAAAGAAACGTTTGCATGTTCTAAAGTTGTGCTGACTGAAAGTAAATAAAAATTATCTTTTTCAACAAAATCAATAAATTTCCAACCTTTTGATTTTGGTAATTCATATAACGCTGGGACTTCGTATAGATCTCCGTTACTTACGATTACAAAAAGTCTCGCGTTAGGGCAAGACTCAAAAAGGCCACCGCACTCTGATATTTCCTCCCATGTTTCGATAACCAAAACATTCGGGAAAGTGTTACTTTTAAATACAACTTCAGATGTGGAAAGAATCAGGGCTAGCTCTAAATTTTCACTTTTGCTCAAAGTTACTGCATGTACCTGAATAGAGTAGAAAAGAGCTAACCAAACTAAAAATTTCACGAAATGTCCTTGGATGCATAACGCCCGCATAACGGGCAAAAATACGTAGGGTACAATTACGAGCGAAGCGAGGGAGCCTACGTGTTAGTGTCCCGCTGCCTTGGTTTGTTAGGTTACCTTGTTAAACAAAACAGTCCGCTTTCATTAGCCCAACTTTCTATTTGAGTTGCTTTCTCTGGATTGCGAAGATAAAAAACATTGGTATCGTCCCAGCTTTCAAAGTACACAGCCTCTAGCTCCGGTAATACGACCTTATAAAATTGAGAGCTGTCCACGTTATGTGGTGTGCACATCAAATAATTTACTTTCGCTGTGTGAGAAGTAAATGTGCGAAATAATGGCAATGCTTTATGCCAGCGCCCCTTGAAAGTAAAATTCAGAATTTCAGTGTTACTCACGAGATTTTCAGCGAATTTAATAAACTTTGAATCTCTTTCTCGTTGCTCTTCTTCGGAGACATCTGACAGAAACTTTAGTGCATCATCATGATCTAGCCAGCGGTCAAATACTGAAACAGCCATATAGGTATATTCAGAGGTGTCTTCTACAGCGACTAACTTGTCTAAATCAGGAAACTTTTCTCGCAGTTCCATCTGAGCATTCTTATCTAAAACTGAAAATCTACGCATGTTTCTGCTGGTAACCTAACACCCCGCTCTAACGAATCCCCTCATAAATTAAAGCAGATTCAATAATATAGACACGCATGGTGCAATTTGATCTAATTCATTTCGCCAAAAACATATCAGAACAGTACACCATGCGCGATATCTCTATATTACATGATTTACTTAAAAATCAATGCCCTAATTTGCATCAAAAGCGTCTTTCGTTTCTTATGGTTGCCGTGCAATCATTACTCGACGGACAACAACTTTCACTCACTGAACTAGGTCGTAATATCTCAGGGCCAGTCTCGGCCAAACACAATATTAAACGCATTGACCGATTGCTCGGTAATCAGGCCCTTTATTCTGAGAGGCTCGATATCTATCGATGGCATGCGAACTTATTGTGTGGAGCCAACCCAATGCCGATTGTGTTGATAGATTGGTCTGATGTGCGTGAGCAAATGCGACATCAAACACTACGTGCTTCCATTAGTTTTGAGGGCCGTTCGGTTATACTCTATGAGCGAGTATTCCCTTTCTCTCAGTACAACTCTCCGGTAAGCCATAACCCCTTTCTCAGAGAGTTAGCAACAATACTGCCTAAGCGCTGCTGTCCTCTGATTATCACCGATGCGGGTTATCGCAATACCTGGTTTCGGGAAGTAGAAAAGTTAGGTTGGTTTTGGCTCGGCAGGATCCGCGGAGAAGTGGGCTTTCGAGAGAGCGGTCAGTCAAAGTGGCGCAGTAACAAAACCTTTTACCCGTCAGCAAACGACAAAGCGCGTTACTTAGGGTCTGGTGACTTAGGTCGGAAAAGCCCAATTGAAGCGTACCTTCACCTATTTAAGGCAAGATCAAAAGGCCGCAAAGACCAACGTTCGTCAAAAGCAGGAAGACATCACAATGCGCAACAAAATTACCGGGACAGCAGCAAAGAACCCTGGTTGCTCGCCACAAATCTACCCGCTGAATCAATGACGTCTAAGCAACTGGTAAACCTTTATGCCAAACGCATGCAAATTGAAGAAAGCTTCAGAGATATCAAAAGCCCCCAGTATGGCTTGGGGTTGCGTCACAGTAATACTCGCTGTACCAAGCGCTTCGATATTCTATTGCTGATAGCGATGCTGGCTGAGTGGGTGTTAAGGCTCATCGGTTTTATTGCAACAAAGCATAATTGGGCACGACAATTTCAGGCAAACACCATCAGGAATAGACCTGTTTTATCGCTCATTCGTCTGGGCAGAGAAGTAAGAAAACGCAGTCAGCACTATCAAATAAAAGAACACGATATTCGATGGGCGATCAGGCATTACATTGAACGAATTCATGAAACCGGAATGCCGAAATTATGAGGGGATCCCCCAGCGCCCGCATAACGGGCAAAAATACGTAGGCTATAATAACGAGCGAAGCGAGGGAGCCTACGTGTTTTTGTCCCTAACGAGCGCAGCGAGTGTTAATGCGTTTGTTATAATCCAACCATGCCTTGCAATGCGCATATGCATGCGTATACTTGATGGTGAAATTAGTTGGAGTGATAAAATGCAAGATTTATATGACATCCATGCCCCCAAAAAAGCGACAAATCTAAGCTTAAATAGCGACTTATTGCAAAAAGCTCGAAGTTTAAAAGTTAACCTGTCTGCGACTTTAGAGCAAGCACTTAGAGACAAGCTAAAAAGTATTGAAGCAGAGAAATGGAAAAAAGAAAATAAAGCTGCCATAGCTGCATACAACGAGTTTGTTGCTGAGAACGGTTGTATAGGTGATGAGTACCGGAACTTCTAATGGCGCAATTTGATGTTTACAGAAACCCAAGCAAGAAAACTAGCAAAGCTTACCCTTTTCTTGTTGATGTTCAAAACTCTGTAATTGACCAATTGGCAACAAGGCTGGTTGTACCGTTAACAACTAGCAATACTAAAAACAGTTTTTACATGAAGAAGCTGACGCCGGAGATAGAATTCGAGGGCACAACATACTTATTTCTAGCACAACAACTTAGTTCTATACCCGAAGACGTGCTAAAGGATCGTATTGGTTCGTTGGAGCAATCTAGAGAACTGCTAATAGACGCAATAGACTTTGCCATCACTGGAATATAACTTTTCAATAACGGGCGCAGACATGTGGGGCATAATGGCGCAGCCGCCCCGCGTGTATGCGTCCCAGCGAACGAAGTGAGTGAGTTTATTGAGTTGTTAGGCCGTGCTGATAAGTTAGTCACGGCTATTAATAAAACGTTGTAACTCATCTGAGAAGCCACTAATTTGTTTAGCTTCTAGAAGGCCGTCTGATTTCATCAATGCAAGTTGCAGCATAGCTTTAGATCTTGCATTGTAATTGAATATGTTAGCGATACGTTTGTCGCTGTTTTCCATCAATTTATACAAATAAAGATATTTTGCATGATTTGAAAGTTCGGGATCATCGATTCCTTTACGCACATCGTCCAAAACTTGCTGGCAGCGTTTATCTAATGCATCTGCCTTAACTTGTTTGAAGTGTTTCCAATCTGCCTCTGAAATCATGTTTTACCTCTATAGGCCTAACACCCCGCTTAGGGGCAATAACACGTGGGCTAAAATTTGGAGCGAAGCGACTGGGCCCACGTGTTATTGTCCCAGGGAGCTTGCGACCGATTACAGCGGTTTGTTAGCTGCCAACGCGCCAGCTTTTTGCTTGGCATATGAAAGTAACGATATGCCAATGAAGTAGTAGATACCTGCCTGAAATACCGCTAGTAAACCATACTGACTGTAGTAGCCAGTAAAAATACCAATAATTGGAAGTAGCATATACATTACAGCTGCAACTCTGAAGCCAAAAGGACTAATTAGTCCATTGCTAGCATTCAAATTGAAAATGAGTTGAAAACACTTAATAAACAGCCATGCATTTAAGATTGCACAAACAGTTGTAATTAGTACGCCGAATAGAAATTTCTCTTCATTTACAGCCAGAGGCATCACGATTACGCCCAGCAAGCACAGCAAATTTAATAAAAAGAGAACTATTCCTGAAGGGATTCTAACCCATGGGCCTAGCATCGACTTCTCCATAAGTGTCCTTGGCAGCTAACGCCTAAATATGGGGCACAAACAGGCAGGCTATAATGGCGAAGCCGCCTGCGTGTTTGTGTCCCAGCCCGCCTGCGGGCGACATAATTTTTTTGTTAGAAGCCTGCACGCCGTTGCTCCCTACGAGGATCATGTGGATTTACATTACCACCAAATGTTTGTTTATACACCTTGCCTAAATCTGAAGTGATTTCAAACGTTACTAGGTAAGGTGAGCCCAGAGCTGAACGCCAATAGTGATACTGCTTTTGATATTGCGCATTAAAACTAAAAGTAACATTCGCTGACTTAAGCCATGATGAGGGAATTTTATGAGAATTCAGCATTTTAATAAGCAATTCGCACATGCTCTTAGCAAATGGATCAAGGCAATTTTTAACACCGCTATGTTCTACTTCTAACAGATTGATTGAGATAGAAGATATGTTTTCTTGCTTTGCAATTAAGCATAGTTGCCCTATAGCCCAGTAACCAAGATGATCGTTATTGCGGTTATTTAGCATATGAGCAAAGTTTCTAACAATTCCCTTGAACTCTGAACGACGAGCCATGATTCTCCTTGGCTTCTAACACCCCGCTTAGGGGCTATAACACGTGGGCTAAAATTAGGAGCGAAGCGACTGAGCCCACGTGTTATTGTCCCAGGGAGCTTGCGACCGACTACAGCGGTTTGTTAGTTGCCACTTTTTTCCCGACGTAATTTGAAGCTAATATTGCCCGCACTCATTTTTTTCCCGTTGACAAAAACGTTTACAAAGAAAGCCATTTGTTCTTCGGTTAGCTTTGTGACTGTGAAAATTTCAACCTTATTTGGGTGAGTGACAACAATTCGATTTTCTTCCAGCTCATAAGGTGCTTGGGAGCTTTCCCCTTGTGCCGCAGATGTAAAAATACCTTTTTCAAAATTGAAGAAGTCATTTTCCCCTGCTGGAAGTCGCGAATTCAACTCCGAAGTGCACCACTCGCTAAATTAGGCAGCCTTGCGTAATTCATTGAATATCGCGACAGGTTGTCTGAAGCCGAGACACTTTCTCGGTCTGGCATTTAATGCCCGTTCAATCTCAGCAATTTGCTTGTCCGTTACTGTCCGCAAATCGGTACCTTTACGTATGTATTGGCGAAGCAAACCGTTAAAGTTTTCATTCAGTCCGCGCTCCCAGGATGAATAAGGGTTCGCGAAGTAAATATCCGTTTTAAGCTTCTCTGCGACCAGCTCGTGGTCACAGAATTCGCTGCCGTTGTCCGCCGTGATAGTTCGGACATGACTGCGATATTTCCACAACATGCCAACCATGGCACGGGCCACATCGGCTGCGCTTTTCGCTGGCACTTTGCGTATCAGATACAGCTTACTTTTGCGCTCCACCAGGCTAACAATCGCGCCCGTTCCTTGCTTGCCTAATACCGTGTCAGCTTCCCAATCACCGAACCGCTTCTTCTTGTTCACGATAGCGGGACGATGCTCTATGCCAACGCGATTTGGGATAATCACACGCTTCGCGCGGCTACCTTTACGGTATCTTCGACGGCTCTGACGTAACTGTTTATACAATTTACCGCCGCGAAATTTATCACGCTGGACGTAGCCATAAATCCACTCGTGACTGACGTGATGACCGATGATTTTACCTACACCAGCTATCTGCTCAGGGCTCCACTTTTCGTTAAGCCCGAACTCAACAAAAGTAATCGTCAGTTCAGATATCCTGTACTTTACGGCCTGACAACGTCGTTTTAATGCCTTGGCCTGAGCGACTTCTGGTAGGTAATGGCTATCTCGAATTCGATTGCGATTTACTTCCCGGCTGATGGTGCTATGACTTACTCTCAGGCGTTTTCCTATTTCCCGGTAACTGAAACCCTCGCGCAAGTAGGCCTCTATCTGGTATCGTTGTCCTTCGATCAACTGCTTGTAACTCATGGTAACTACTCTTGTTTTTTGGCGATTACAGAGTACCACCAACAGGCAGTTGATCTCTCCTCACCATTTAATCAAGAGTGGTGCACTTATTATCTGAATTTGCGGTTCCCCCAAAGTTCCCATAACAATGGCCTCTATATGCCATTTACCTGCAATTTCGACATCTGTAACATCTAGTGCCATTGCCGAAAATGGAAGCAGGAACAAAACAATTAAAACAAATTTGTGCATGAAAAATCCCTAGTACTTATGAGTGGCAACTAACGCCCGCATAACGGGCAAAAACACGTAGGCTACAATACCAAGCGAAGCGCAGGGAGCCTACGTGTTTTTGTCCCTAGCAAGCGCAGCGAGCGTTAATGCGTTTGTTAGAAGACGATGTTTCTACTACCTTTGGTTGCTCAAAAGCTAAAACCGAAACACACTCGTTTAAACCAATAATTAAGTATTCTTTTGGCTGATATAGCTTACCCGTAACGAAGTGATCTCTAGTTTTTTCCAGAGCATTCCAGCCACCGTCGATTACTTCAAAACACCAACCAGTTTTTAAGTTACATTCACTCCAAAACTCGCCTAAATCACCTTCATCAAGCACTCTGAAACCATAAATAGCATCGAAATAAACTTTTATACTCCAATTCTTCGATTTAACAGCGATGTTTAAGCATTCTTTGTCGAATGTAATTTCGTCAATGTCTACTGTAGCGTCTGATTCGAAATCAGTCTCTACTTTTACAACCTTAGGCTCATAAATCATCGCTAGTCTTCTAACTTTCCAATAACGGGCGCAGACATGTGGGGCATAATGGCGCAGCCGCCCCACGTGTATGCGTCCCAGCGAACGCAGTGAGTGTGTTGATTGGCTTGTTAGGCTTTACCACTTTCGCTTATTTAACCATCATGTTGCATCAGCGCCCAATCTTTATCTTGGTTTTCCATAACGAAACACAATGGTTCCATTTTTTTATTACCTATAGAGTATGTTGTCACAAAAACACAACCGCCGGACTCTCTAGCTATAACACCTGTGACAAGTGAGTTAGCACCTTCAATTCCATACATGCCAATCTTTTTACGAAAATCTTTTACAATTTTCTTTATAGTTTTTGACTTTGGTGCGTTGAAATACCAAAAATGCCAAGAGCTCCATCGAATCATAATGCATTTATACTTCTCGTGTTCTTCTTGATTTAGTAACCGCGCCACAGAATATAAAGACCCTTGTGGTGTGAAGGATAAAAACTCTTGAATGAACCTAGAAATTTCCTTTTCCATAATAAAACCAGCTGTCTATGTGAGAAGCCTAACACTTAAATATGGGGCACAAGCAGGCAGGCGATAATGGCGAAGCCGCCTGCCTGTTTGTGTCCCAGCCCGCCCCGGCGGGCGACATAATTTTTTTGTTATACGTGCGAGCTCCGAACCTTCGCGGCTTGCCGCCACATCCTGCTGATAAAACCGCAACGTATAACACCTTGAATTACCTTAAAACAACAACCTACAAAATACCAGACGCACATTTGATTCTGTAAACAACCTGGAATGCTGACTGTTGGTGTTGCTGACTATCTAAAACCGCGATGATAAAGCTGTTGAAACAACTGCAAATGGTGGGTGCAAAAACTACCTAGCAAACAGCCCTACCTAGCCAAAAATCTAATCCTTTGAAGCCTTAAAACAACTAGCACGAATCGGCGTATAACGCCCTGCTTAGGGGCAAAAATACGCAGGCTAGAATTAGGAGCGAAGCGACTAGGAGCCTGCGTGTTTTTGTCCCAGTGAGCGCTAGCGAACGACTACAGCGGCTTGTTATGTTTACTTGCCTTTTAAGTGCGAGACAATTAAAGACAGCTGAACCAGGAGCAAACCAGCCAATATTTGATCGAATTTTAGATAGGCTGATATTAAGAATTTTAAATTTTTTTGTTCGTTGTACTGCTCAGTTTTAAGACCATTTACCATCATCAACGAGACTGCTCGTAAAGCTTCAGGCTCTGTAGAGTTTTGTATTTCTTTCTCTAAAAGGTCAAGCCTTTCAATATTTAAAGGAATAACACTGTTCTGTTCCATTTTAGAAGGAAGAAAAACCAGTGAGTAACCTACGTAAGCGACTAAGGCTATGCTAGATAACAATAAAGGCCACTTTGACAAAATTTAACTCCGTAAACATAACATGGGTTTGTTAAGTGGCGCACATGCATGTGCCAGATAAAGGAACTGATGTATCTTGCGTCCACTTCAACAACTTTGTTATACGATTGCGAGTCACTATACTACATCTACCCTCTAATGCTGTTCCGTACACCTAGTTAACTTTCCATGTTCCTTTACGCCGTACGAGCGGCTTTTTCGAAGTTCGACATCCACACTTTTAGCGCCAGCGATGTCGCTATCAAGCCCCGTCTTATACGTCATCGCTGGCGCTAAAAGCGTGACATAAAAGCTCGAACGTTACCACTGCACTTCGGGCCAGTTTACTTGACGTACTACGAGTAAATTCCTATGGGCTCTTACGCTCGTACCGTATAACGCTAAGCTTTGGGGCGCTAGCACGTGGGCTAAAATCCGAACGAAGTGAGTCAGCCCACGTGTTAGCGTCCCAGCGACCGGAGGGAGTGCCAACAGCGTTTTGTTAGGCGTACAAAGAAACGTACGCAGCTATGCTGCCACAACTAGCCAAATAGACCGCGTGTTTAAATAATTGAATTTAAGATAAAACAACCCTGTGCGAAAAGCCAGATGTTAACGGTAATACTGGATAAAAGAGCGAGTTTAGATAACTTGTATTGTTAAATGTTGGCGTTGCGGTTTTTATGAAAAGCACGTTACTAAAGTCGCTTAAACAGCAATATGTGCTCGCCGCAAAGACAACCTAGCGAAACAACCTTGGTGCTGGAATGTCACAGTTAAACCTACTTTTTAATGCTGTAAGTAAATGACGCCTAACATGGGTTTGTTAAGTGGCGCACATGCATGTGCCAGATAAAGGAACTGATGTATCTTGCGTCCACTTCAACAACTTTGTTATACGATTGCGAGTCACTATACTACATCTACCCTCTAATGCTGTTCCGTACACCTAGTTGCATATTCTGGCTGATCATGAACACCTATTCTGGCCGATGGTGAACACTGATTCTGGCGCATCGTGAACACCTATTCGGGTCTGATGGTGAACAGTTTTGCCCTGACGCCAGAATCACTGTTCACGATCCAGAATAGATTTCCCGCATTGATATTTTTGACTGCTAACCTGTTGCCTTTTGCTTTAGGAAACAGGGATGCCAACCAAGAGATTATCAATGCGACAACTTCGAGAAATTCTTCGCCTGAAATTGCAGGCAGATCTCAGTATCCGGCAAATCCACCGTAGTTTACGAGTGAGTGTTGGTGCCGTTTCCAAGGTGCTTAGCAAAGCGAATGAGATGAACCTGAGCTGGCCTGACGTTAACCAGCTTGATGATGTACAACTGGCAAGTCGCTTCTATCCAGAAGCGGATACACGTCAGTCTGGGCAGTTTGAGATGCCGGACTGGCGGGATGTGCATCAGGAGCTTACCCATAAAGGCGTAACCAAACACCTCCTGTGGGAAGAGTACACAGAACAATATCCCAACCGCAGCTACAGTTATCCGCAGTATTGCCACCATTATCAGGTCTGGCAAACGTTACAGCGCCGCTCAATGCGGCAGGTGCACAAGGCCGGTGAAAAGCTGTTTGTGGATTATGCGGGGCAAACCGTACCGATAATCAGTGCCAGTACCGGTGAAGTTCGTCAGGCCCAGGTGTTTGTTGCGGTGATGGGCGCTTCCAACCAGACCTTCGCGGAGGGCACCTGGACACAGAGTCTGCCGGACTGGCTGGGTAGCCACACCAGAGCGTTTAGCTTCTTCGGCGGTGTGCCGCAGTTAGTGATACCGGATAATTTAAAAAGTGGAGTGAGTAGAGCCTGCCGCTATGACCCGGATGTCACGCCAGCCTATCAGCAGTTAGCGGCTCACTATGGCTGTGCGATTGTTCCGGCGAGACCTTATAAGCCAAAGGATAAAGCCAAGGCTGAAGTCGGCGTACAAGTGATAGAACGTTGGATATTAGCCCGGTTACGCCATTACACCTTCTTCTCACTGGCAGAGCTGAATACGTGTATTGCCGCCCTCCTTAAAGACGTGAATAACCGTCCCTTCAAGCAACTCAATGGCAGCCGGCAATCGTGGTTCGACAGCATCGATAAACCAGCACTCGCGCCGTTACCCAAACTGGCTTATCAATATACCGACATCAAAACGGTCAAGGTCAATATCGATTACCATATCCAGTATGATGCGCACCTGTACTCCGTGCCTCATCATCTGGTTGGGGAGCGAATAGATGTGCATGCCAGTAATACCTTAATCACCCTGTACTTCCACAATAAAGTCGTCGCCAGCCATCCCAGACAATACCGCCATGGGATGAGTACCGTTCCGGCGCATATGCCTGCACGCCATCAGAAGCATCAAAGCTGGACGCCGGGCAGACTCATGAACTGGGCGAAGGATGTCGGGGATGAGGTACTTGCCTGGGTAAAACATCAACTGGCCAGCAAATCACATCAGGAGCAGGCATACCGCGTGTGTCTGGGCTTGCTCAATCTCTCACGCCAATATCCGCCACAACGACTCAATAAAGCCTGCGCTATCGCCAATCAACAGCATCTGTACCGGCTTAAACAGGTGAAAGCCATCCTGACCTCGAATCAGGACAAGCTGTATCAGCACAATCAGGACGAGTCGCAAAACCATCTGCCACAGACTCATGAGAACATCCGCGGCCCACAGAGCTTCCACTAACGCAGGAATATACAATGACCAACACCACCTTAATGCTGCTCAGACAGCTAAAACTCACCGGCATGGCCGATGCCCTGACCATGCAACAGTCGCAGCCGAATAACTACGATAACTTAAGCTTCGAAGAGCGACTGCAATTACTGGTCGATGCCGAACATCTTGAGCGCGGTCAGCGAAAACAGCAACGGCTGCTAAAGGCTGCGAAGCTCAAACTGCATGCTACCGCCAGGGACATTGATTACACCCATCCGCGCGGTTTAAAGCAGGCGCACATGGCCAGCCTGCTGCAATGTGAATGGGTGCATAAACACCAGAACTTACTGCTCACCGGGCCGTGCGGTTGCGGGAAAACGTACTTGGCCTGCGCCGTGGCTCACACCGCCTGCATGAAAGGCTACAGCGTCAGGTATTACCGGTTATCCAGACTTATGCTCGAATTGAGCCAGGCAAAAGCCGATGGTACTTACAGCAAAATGCTACAACAACTGGCCAGGATAGACATCCTCATCCTGGATGACTGGGGCCTTGAGCCACTAAAAGCTGCCCAGCGCAACGACCTGATGGAAATCATGGACGACCGCAACAACCACTGCTCAACCATTATCATCAGCCAGCTACCCACAGACCAATGGTATCAATCCATCGGTGATAACACGCTGGCTGATGCCATCCTAGACCGGCTGATGCACAACGCGCACCGGATAAAACTAAAGGGAGAATCAATGCGAAAAACACGCTCAGAGTTGACTGATGGTGAACACTTAGCGTAAAAATGGGCTTCGCCCAATGCGGGAAATCTTAGGTGTTCACGATGCTCCAGAATCCGTGTTCACGTTCAGCAGAATACGCACTAGTTAACTTTCCATGTTCCTTTACGCCGTACGAGCGGCTTTTTCGAAGTTCGACATCCACACTTTTAGCGCCAGCGATGTCGCTATCAAGCCCCGTCTTATACGTCATCGCTGGCGCTAAAAGCGTGACATAAAAGCTCGAACGTTACCACTGCACTTCGGGCCAGTTTACTTGACGTACTACGAGTAAATTCCTATGGGCTCTTACGCTCGTACCGTATAACGCCCGCATAACGGGCAAAAATACGTAGGCTACAATTACGAGCGAAGCGAGGGAGCCTACGTGTTTTTGTCCCCAGCGAGCGCAGCGAGTGGTTAATGCGTTTGTTATACGCTCAGTCGTAAAACCAACCTTCCGATTTATAAAAACTGCAAAATGCATCAACGAGATATTGCTGTTTACCGCTATTTATCGCGTCGCGTTTTAGAATGTAACCGTGAATGACTTTAGTTGCATTGGGAATTGTAATCCATTCATACATATTATCCTGCTCACTTATAATTCCTTGCAGTTTCTTTAGCGCAGTTTCTGCACTCTCTTTATTGAAATAGCTTTCGTCCATCCGAACATCTTCCGGTATATCTGTATTTACTGGGCAAATATTTTCACTGAATACCTGACTACTAAATAAGAGTAAAATTAAACCAAAAGATCTCATAACTTCCCTCTTGAGCGTATAACACTTAAATATGGGGCACAAACACGTAGGCTAAACTGCGAAGCAGAGCCTGCGTGTTTGTGTCCCAGCCCGCCTGCGGGCGACATAATTTTTTTGTTAGGTGCGTACTAAAACACCCTTTATTTGTTGCCACTAAATACAATCTAAGCCTACAGTGCAAACAGTTACTAGCGTAAAAAACGGCCAAACAACGCCTGTAATTATGGAAGCAATGGATGCCATTACATTACGAGTTTTTGACTTTATATTTTGAGACAGTTTTACAAATAAGGCGGCTAAACCCAAGGGTACAGAATACATAATAAAAAGGCTTATAAGTAGCTCCCTTTCTCGGCTTTGTATTACGTGAGGAGCTGTGCCTTCATACCCGCTAATGGTAAAAAGACTGAAATAAAACAATAAAGCCGCCAACAGTCCCTTATCTTTGTAGGTGTTAGACGCCCAAATTGAGCTGAATAGAACTGTAAAGCAAACAACCGTAGAAGTTATAAGAGCTTCGAACAAATCCATTTTCCTTGTAGCACCTAACGCCTAAATATGGGGCACAAACATGTAGGCTAAACTGCGAAGCAGAGCCTGCGTGTTTGTGTCCCAGCCCGCCCCGGCGGGCGACATAATTTTTTTGTTAGGTGTCACTTTAGATCGAAGAAACTGCAAGAACAAAATTTAATAGCCCCCATAGCTCAAAGAAGATAATGACACCTATACCTATAAGGGCTACTGTTCTTCCATAAAACGAATAGACTTTATCTCTTATATTCTTTACACAAAAGCATAAAATTGGAAATGGCATTACCCAAAGTGACCACATACCAATATAGTAAAAAAGCATGGGAACAGATCTTAAATTCTCAGTAAGCCCCAATCCATAAAAGGCAAAAAAGTAAATAATACAGAAGCATACCAAAGCTAGATACACATACGAAATCCATCTCATCACACTGGCACCTAACACCCACATAAGGGGCAGTAACACATGGGCTAAAATCCGAGCGAAGCGACAGAGCCCATGTGTTACTGTCCCAGCGAGCTTGCGAGCGGCTTAATGTGATTGTTATGCGCTATTGACTTACAACGCGCAAGTACCATAATTAGGTATATATTGGTACTTGGAGAAACCAGATGGCTAAAAACACCAGCATTACCCTTGGAGACCATTTTGATGGCTTTATTGCTAGCCAAATTCAAACTGGTCGGTATGGTTCAGCAAGTGAGGTGATTCGTTCAGCTCTTCGCTTGCTAGAAACACAGGAAACCAAATTAAACACTCTGCGTCAATTACTTGTAGCAGGTGAAGAAAGCGGCGAAGCTGAATATGACCTTGAACATCTGATATCAGAACTAGACGGTGAATTAAAAGAGTGAAACCATTTAAGTTAACCGTACTAGCAAAATCAGATCTAAAAGATATTGCATTGTTTACGCAACGGAAATGGGGCCGAGAACAGCGTAATATATATCTTAAACAATTCGACGATTCATTTTGGATGTTGTCTGAAAATCCTGATATTGGTAAAAGCTGCGATGAAATAAGAGATGGATACAAAAAATTTCCTCAAGGTAGCCATGTTATTTTCTACAAACAAACGGGCAGTCAGGAAATTCTGATCATAAGAATCCTTCATAAAAGTATGGATGTAAATCCTGTACGCTTTGGCGCATAACATGGGTTTGTTAAGTGGCGCACATGCATGTGCCAGATAAAGGAACTGATGTATCTTGCGTCCACTTCAACAACTTTGTTATACGATTGCGAGTCACTATACTACATCTACCCTCTAATGCTGTTCCGTACACCTAGTTAACTTTCCATGTTCCTTTACGCCGTACGAGCGGCTTTTTCGAAGTTCGACATCCACACTTTTAGCGCCAGCGATGTCGCTATCAAGCCCCGTCTTATACGTCATCGCTGGCGCTAAAAGCGTGACATAAAAGCTCGAACGTTACCACTGCACTTCGGGCCAGTTTACTTGACGTACTACGAGTAAATTCCTATGGGCTCTTACGCTCGTACCGTATAACATCCCAAGAAGGGGCAACTACACGTGGGCTATAATACGGAACGAAGTGACGGGAGCCCACGTGTAATTGTCCCGCTTGCTTGGTTTGTTATAAGGCTTACCACTGATGCTTAGTTGCAACTACGACTGCCCCTAAATAATGGGCAAACCCAATAGCTCTAATCGCTCTTTCATTGAAAATTGGTATGGCGGCTTCTTTCTCTGTCTTAAATACAAGTTGTGCTAGCGGAATATATTTGTACAAAAAACTTTGGTACTTAACCAATATAAATTCCATATGTCGTGGCACAAACACTAACCAAGTACCAAATACAATAAAGGTGACGATGCGCCAAATTACTTCTTCCATTGAAGGGCCTTATAACGCTTTACCAAGGGGCAAATGCGAACGTTGTTGGGCGTAGCCCAGTGAGTATTTGTCCCGCAAGCGAAGCGCGCATTCTTCGGTAACTTGTTATATGGCATCTTATGCTGTCCGTTTAATCCTTGAAACTATTATGTTTGCTAGTGAGAAAATTGAACACCAAACAACCCCCAATACCCAAGCCATAAAACCGAGCATTACTGCTCCTAGTACGCCAGGTTCGTAACTAGGATCTAGGTACATCACAACCTCATATCGAATAAGTGCGAACGCCCAAAAAGCCAATATGCCAGAGAGAAAACCAAAGAGGCTAAATCTATAAGCTAAAACCGCAGAAGTCAGAGCTCCTAAAATCGGTATTAAATAATCTAAGGTTGAAGCTACAGTCATGCAATATAACGCCCTGCTTAGGGGCGGAAATACATGGGCTAAAATTAGGAGCGAAGCGACGCAGCCCATGTGTTTGCGTCCCAGCCCGCTTGCGGGCGACTACAGCAGATTGTTATGTACGTGCTTAAACAACATTATACCTTGCCAAAACATACAGAGCAGTGAAGCCAATTATGCAACCCAACAACCCCCAAATTATCATATTTCTTACATAAAGCATTTGAGGCTTATACCAACTCTTCAAAAATATTTCTGGGATATGGTAAATGCAAGCTTTGCTTAAATAGCCTGAGCTTTTAAACGTTTCTTCGTTTATGGCAAATCGAGCGAGAATAAACCCAACGAAAGCACTTAAAAACGAAATTAGTAAAAATATGAATAAAGCTAGAATAATTTTTCCCTGAGTACATAACACCCACATAAGGGGCAGTAACACATGGGCTAAAATCCGAGCGAAGCGACAGAGCCCATGTGTTACTGTCCCAGCGAGCTTGCGAGCGGCTTAATGTGATTGTTATGCGCTATTGACTTACAACGCGCAAGTACCATAATTAGGTATATATTGGTACTTGGAGAAACCAGATGGCTAAAAACACCAGCATTACCCTTGGAGACCATTTTGATGGCTTTATTGCTAGCCAAATTCAAACTGGTCGGTATGGTTCAGCAAGTGAGGTGATTCGTTCAGCTCTTCGCTTGCTAGAAACACAGGAAACCAAATTAAACACTCTGCGTCAATTACTTGTAGCAGGTGAAGAAAGCGGCGAAGCTGAATATGACCTTGAACATCTGATATCAGAACTAGACGGTGAATTAAAAGAGTGAAACCATTTAAGTTAACCGTACTAGCAAAATCAGATCTAAAAGATATTGCATTGTTTACGCAACGGAAATGGGGCCGAGAACAGCGTAATATATATCTTAAACAATTCGACGATTCATTTTGGATGTTGTCTGAAAATCCTGATATTGGTAAAAGCTGCGATGAAATAAGAGATGGATACAAAAAATTTCCTCAAGGTAGCCATGTTATTTTCTACAAACAAACGGGCAGTCAGGAAATTCTGATCATAAGAATCCTTCATAAAAGTATGGATGTAAATCCTGTACGCTTTGGCGCATAACACCCCGCTTAGGGGCAAAAATACGTGGACTATAATATCGAGCGAAGCGAGGACAGCCCACGTGTTTTTGTCCCAGCACGCTCCGCGTGCGTATACAGCGGTTTGTTATATTTCAACGATGTCAAAAACATCCCCTTCTTTGACTAAATTAAGTGAGAAAGTGTCCCAGATAGTATTTTCAATTTCACCATTACTTCCGGGTCTTTTGATCATTTTAAATGTACCGTTGAAAATACATTCAATGAGATGGCGAATTCCTTTTTCGCGAGATGGAAGGCCATATTCAGAAGCTTCAATATTTCCATGTGTATGAAAAGAGCCATTGAGTAAGGATATCGAATCATCACCTTCGTAATTCCAGATTTCAATGACTAATGTGTCAAAGTCTCTGGGTAGTCCTACATGGATAGCGTCACCAACAAGTTGGTATTCTATTTTCCCATCAATTAGCTCATCAACGATGCTGTTAAGTGCTGGATGCATTGAGTTTACCTGAAATATAACGCCCCAAGAAGGGGCAATTACATGTGGGCTAAAATACGGAGCAAAGCGACGGGAGCCCACATGTAATTGTCCCACTTGCTTGGCTTGTTATGTTTCTAATACTCCTCTCCCTCATCCCCTACAAGTCTGGCCCCAAGTAATTTAGCTAACTTTTTGGCTTTCCCTATTTGGGCCTTGTCGCTACCTAAAGTAATGCTGCCATTTGAGAATGTGAAATAATACTTTCTGCGTAAAATTGGACTAGTCCAAACGCAACCAATTGGAGTTTGGATTTCAATGGTTTCACCCGTTTCCGGGTTCATTGCTTTAGCAACGTGCTGCACAGTTAATGAAGAATCGCTTTCGCAAATCGCCAACCATTCACTTATAGAAATCTGGTTATCACGCTCAATATGTAAACTATAAGAAGCCATAATATTCCAGCTTATTAATAATAAAATCCCAACTAAAATTTTACGCATAGTTTTCGAGAAACATAACGCCTAAATATGGGGCACAAACACGTAGGCTAAACTGCGAAGCAGAGCCTGCGTGTTTGTGTCCCAGCCCGCCCCGGCGGGCGCCATAATTTTTTTGTTAGGTGCGTGCATGGTTGAAAAAATGCTCCGCTTGCTTTGCCAAAACTGGCAACTGACCAAAATCAGTTACGATGCTCGAAGTTACCTGCCATTCCTCAGGATGTCCGTGCAGGTCACTTAGCTGAAAAGAAAGACGAACCTCTCCTAACGATGAGCAAGTAGCAGAAATACTAAATTGGTTTTCTAAAGACTGCCATTCGATTGCCGATTTCCATGGTTTGGAAAATGAAGCGAGTTTATTTAACCAATTAGAAATGCCGGATTCATCAGTATATGTGCACACTCGCACCGAAGCTGACAGAATACCTGACAAATGAACCGTAACGTACTCTGACTCAGGCATAGAGAATCTGAGTGAGTTGTTCGATATCGATGAATCTATAGAAAATTCCATTTATCTCCAGAGCACCTAACACCCCGCTTAGGGGCAAAAATACGTGGGCTATAATACCGAGCAGAGCGAGGACAGCCCACGTGTTTTTGTCCCAGGGAGCTTGCGACCGACTACAGCGGTTTGTTATACGCGTTTTTCAACGAAGTGCATACCCCAACGATTTGATGCTGAAAGCATGAACCCAAAATAAACTATAGAAAATACGCAGCCCAAAAATGCTTCACTACCAACGCCCGAATTCAGATAATAAGTGCACCACTCATGGTTAAACGGTGACGAGAGATCAACTGCCTGTTGGTGGTACTCTGTAATTGCCAAAAAACAAGAGTAGTTACCATGAGTTACAAGCAGTTGATCGAAGGACAACGATACCAGATAGAGGCCTACTTGCGCGAGGGTTTCAGTTATCGAGAGATTGGAAAGCGTCTTAATGTGAGTCACAGCACCATTAGTAGAGAAGTGAAACGTAATCGAATTCGAGATAGCCATTACCTACCAGAAGTCGCTCAGGCCAAGGCATTAAAACGACGTTGTCAGGCCGTAAAGTACAGGATATCTGAACTGACGATTACTTTTGTTGAGTTCGGGCTTAACGAAAAGTGGAGCCCTGAGCAGATAGCTGGTGTAGGTAAAATCATCGGTCATCACGTCAGTCACGAGTGGATTTATGGCTACGTCCAGCGTGATAAATTTCGCGGCGGTAAATTGTATAAACAGTTACGTCAGAGCCGTCGAAGATATCGTAAAGGTAGCCGCGCGAAGCGTGTGATTATCCCAAATCGCGTTGGCATAGAGCATCGTCCCGCTATCGTGAACAAGAAGAAGCGGTTCGGTGATTGGGAAGCTGACACGGTTTTAGGCAAGCAAGGAACGGGCGCGATTGTTAGCCTGGTGGAGCGTAAAAGTAAGCTGTATCTGATACGCAAAGTGCCAGCGAAAAGCGCAGCCGATGTGGCCCGTGCCATGGTTGGCATGTTGTGGAAATATCGCGGTCATGTCCGAACTATCACGGCGGACAACGGCAGCGAATTCTGTGACCACGAACTGGTCGCAGAGAAGCTTAAAACGGACATTTACTTCGCGAATCCATATTCATCCTGGGAGCGCGGACTGAATGAAAACTTTAACGGTTTGCTTCGCCAATACATACGTAAAGGCACCGATTTGAGGACAGTAACGGACAAACAAATTGCTGAAATTGAACGGGCATTAAATGCCAGACCGAGAAAGTGTCTCGGCTTCAGGCAACCTGTCGCGATATTCAATGAATTACGCAAGGCTGCCTAA
Protein sequences of DBSCAN-SWA_2 >NZ_AP021859|2866728:2892555|2872470_2872848_-|WP_155015098.1|DBSCAN-SWA MRQVVLAFNFLSVVYAIISGLLILSLTTLVAVDFINVDAIPKTAALFALLLLANIISVSALKRRIRWSYIVCSAHVFLLCAVSVYSMATRSISASSLFLLGLVAVLGLLLYTDYVSGKDVHNNPL >NZ_AP021859|2866728:2892555|2877174_2877594_-|WP_155015104.1|DBSCAN-SWA MEKSMLGPWVRIPSGIVLFLLNLLCLLGVIVMPLAVNEEKFLFGVLITTVCAILNAWLFIKCFQLIFNLNASNGLISPFGFRVAAVMYMLLPIIGIFTGYYSQYGLLAVFQAGIYYFIGISLLSYAKQKAGALAANKPL >NZ_AP021859|2866728:2892555|2889897_2890275_-|WP_155015113.1|DBSCAN-SWA MHPALNSIVDELIDGKIEYQLVGDAIHVGLPRDFDTLVIEIWNYEGDDSISLLNGSFHTHGNIEASEYGLPSREKGIRHLIECIFNGTFKMIKRPGSNGEIENTIWDTFSLNLVKEGDVFDIVEI >NZ_AP021859|2866728:2892555|2871501_2871894_-|WP_155015097.1|DBSCAN-SWA MEPDFSKYDLEELYDIKESLDKEKFPERYSLLLDTIEKHPSKAAHIQAVESDNAMRELYNEYKEDKSQVWMLFRGVIVMFVTFRIMQYLEWSQNQQIMGAVIVVIAYATILISINEFKFKKFKDDFGKRT >NZ_AP021859|2866728:2892555|2887254_2887557_+|WP_155015112.1|DBSCAN-SWA MKPFKLTVLAKSDLKDIALFTQRKWGREQRNIYLKQFDDSFWMLSENPDIGKSCDEIRDGYKKFPQGSHVIFYKQTGSQEILIIRILHKSMDVNPVRFGA >NZ_AP021859|2866728:2892555|2868143_2868896_-|WP_155015095.1|DBSCAN-SWA MESSIVSSKSAAVSPLDNRFFGVFALAITVVVFGGFALNWTNNPNNIARISLWTGLHGAMSAAWYVLLLNQTRLSAAGKYASHKIIGTLSVILVIGILITGPLMAIEFYHRMAGFGVFGVDDAQARLRAGSFLGGAFLQWTLFAVLYVLGVLSVKQPAHHKRFMIAAAIQMMPEGLNRAVHLLSLPGYSMFVIMFMVYATLMMYDWKTTKRLHGSTLLSFALFCLLALAMHTVFKWQAWADWIVPVVHGQ >NZ_AP021859|2866728:2892555|2881348_2881684_-|WP_155015110.1|DBSCAN-SWA MSKWPLLLSSIALVAYVGYSLVFLPSKMEQNSVIPLNIERLDLLEKEIQNSTEPEALRAVSLMMVNGLKTEQYNEQKNLKFLISAYLKFDQILAGLLLVQLSLIVSHLKGK >NZ_AP021859|2866728:2892555|2890391_2890757_-|WP_155015114.1|DBSCAN-SWA MRKILVGILLLISWNIMASYSLHIERDNQISISEWLAICESDSSLTVQHVAKAMNPETGETIEIQTPIGCVWTSPILRRKYYFTFSNGSITLGSDKAQIGKAKKLAKLLGARLVGDEGEEY >NZ_AP021859|2866728:2892555|2887015_2887258_+|WP_014978218.1|DBSCAN-SWA MAKNTSITLGDHFDGFIASQIQTGRYGSASEVIRSALRLLETQETKLNTLRQLLVAGEESGEAEYDLEHLISELDGELKE >NZ_AP021859|2866728:2892555|2879870_2880290_-|WP_155015108.1|DBSCAN-SWA MIYEPKVVKVETDFESDATVDIDEITFDKECLNIAVKSKNWSIKVYFDAIYGFRVLDEGDLGEFWSECNLKTGWCFEVIDGGWNALEKTRDHFVTGKLYQPKEYLIIGLNECVSVLAFEQPKVVETSSSNKRINARCAC >NZ_AP021859|2866728:2892555|2866728_2867760_+|WP_155016623.1|transposase|DBSCAN-SWA MNSISLIDRPERRRLEKIVQRSKDKRLSRRANAVLLVHKGQSRKHVAALLSAARSSVNRWCKWYEESGIEGLKDLEQGKPAYLPGGAIVQILFLLIPWTPQELGYQRSRWSSELLAKVIHENTGIRIHSSTLRRWMPELGIVWRRAAPTLRIRDPHKEEKLAAIKAALDKCSADHPVFYEDEVDIHLNPKIGADWGFRGKQRLVPTPGQNEKYYLAGALNAKTGKVLYVGRQSKCSEIFIRLMEHLRKTYRKAKTITLIVDNYIIHKSKKTQAWLKQNPKFTLLFQPVYSPWVNKIEKLWHALHETITRNHKCTEMWQLLQKVRRFMETASPFPGNKHGLAKV >NZ_AP021859|2866728:2892555|2874678_2875884_+|WP_155016624.1|transposase|DBSCAN-SWA MRDISILHDLLKNQCPNLHQKRLSFLMVAVQSLLDGQQLSLTELGRNISGPVSAKHNIKRIDRLLGNQALYSERLDIYRWHANLLCGANPMPIVLIDWSDVREQMRHQTLRASISFEGRSVILYERVFPFSQYNSPVSHNPFLRELATILPKRCCPLIITDAGYRNTWFREVEKLGWFWLGRIRGEVGFRESGQSKWRSNKTFYPSANDKARYLGSGDLGRKSPIEAYLHLFKARSKGRKDQRSSKAGRHHNAQQNYRDSSKEPWLLATNLPAESMTSKQLVNLYAKRMQIEESFRDIKSPQYGLGLRHSNTRCTKRFDILLLIAMLAEWVLRLIGFIATKHNWARQFQANTIRNRPVLSLIRLGREVRKRSQHYQIKEHDIRWAIRHYIERIHETGMPKL >NZ_AP021859|2866728:2892555|2885548_2885872_-|WP_155015111.1|DBSCAN-SWA MRSFGLILLLFSSQVFSENICPVNTDIPEDVRMDESYFNKESAETALKKLQGIISEQDNMYEWITIPNATKVIHGYILKRDAINSGKQQYLVDAFCSFYKSEGWFYD >NZ_AP021859|2866728:2892555|2889250_2889493_+|WP_014978218.1|DBSCAN-SWA MAKNTSITLGDHFDGFIASQIQTGRYGSASEVIRSALRLLETQETKLNTLRQLLVAGEESGEAEYDLEHLISELDGELKE >NZ_AP021859|2866728:2892555|2876074_2876320_+|WP_155016625.1|DBSCAN-SWA MQDLYDIHAPKKATNLSLNSDLLQKARSLKVNLSATLEQALRDKLKSIEAEKWKKENKAAIAAYNEFVAENGCIGDEYRNF >NZ_AP021859|2866728:2892555|2884376_2885132_+|WP_155014142.1|DBSCAN-SWA MTNTTLMLLRQLKLTGMADALTMQQSQPNNYDNLSFEERLQLLVDAEHLERGQRKQQRLLKAAKLKLHATARDIDYTHPRGLKQAHMASLLQCEWVHKHQNLLLTGPCGCGKTYLACAVAHTACMKGYSVRYYRLSRLMLELSQAKADGTYSKMLQQLARIDILILDDWGLEPLKAAQRNDLMEIMDDRNNHCSTIIISQLPTDQWYQSIGDNTLADAILDRLMHNAHRIKLKGESMRKTRSELTDGEHLA >NZ_AP021859|2866728:2892555|2880424_2880808_-|WP_155523171.1|DBSCAN-SWA MEKEISRFIQEFLSFTPQGSLYSVARLLNQEEHEKYKCIMIRWSSWHFWYFNAPKSKTIKKIVKDFRKKIGMYGIEGANSLVTGVIARESGGCVFVTTYSIGNKKMEPLCFVMENQDKDWALMQHDG >NZ_AP021859|2866728:2892555|2889489_2889792_+|WP_155015112.1|DBSCAN-SWA MKPFKLTVLAKSDLKDIALFTQRKWGREQRNIYLKQFDDSFWMLSENPDIGKSCDEIRDGYKKFPQGSHVIFYKQTGSQEILIIRILHKSMDVNPVRFGA >NZ_AP021859|2866728:2892555|2890872_2891268_-|WP_155014787.1|DBSCAN-SWA MEFSIDSSISNNSLRFSMPESEYVTVHLSGILSASVRVCTYTDESGISNWLNKLASFSKPWKSAIEWQSLENQFSISATCSSLGEVRLSFQLSDLHGHPEEWQVTSSIVTDFGQLPVLAKQAEHFFNHART >NZ_AP021859|2866728:2892555|2869641_2870100_-|WP_155015096.1|DBSCAN-SWA MKVHVSLPNHWMIGGESMWAEPLGNNIYRLENVPFFSYSLNFKDEVEAKPDEDGILEIEKVVKRSGNSTLRIIFDKKVTREQQEKYIKIIRELDCSLERWDNTYLAINVRASACYTSVLNRLDQWNEQDIIALETCEEQMPGSFDAAVDEDV >NZ_AP021859|2866728:2892555|2876751_2877063_-|WP_155015103.1|DBSCAN-SWA MISEADWKHFKQVKADALDKRCQQVLDDVRKGIDDPELSNHAKYLYLYKLMENSDKRIANIFNYNARSKAMLQLALMKSDGLLEAKQISGFSDELQRFINSRD >NZ_AP021859|2866728:2892555|2869163_2869415_-|WP_073324816.1|DBSCAN-SWA MDVEILYVVMAILFILNASVSVFLMKRDDLDTFQKGAQMLLVWLVPFLAAIGVWLLNRSQDIQVTQDKTFGGGVSDSIGPGAE >NZ_AP021859|2866728:2892555|2878582_2879533_-|WP_155015107.1|transposase|DBSCAN-SWA MSYKQLIEGQRYQIEAYLREGFSYREIGKRLRVSHSTISREVNRNRIRDSHYLPEVAQAKALKRRCQAVKYRISELTITFVEFGLNEKWSPEQIAGVGKIIGHHVSHEWIYGYVQRDKFRGGKLYKQLRQSRRRYRKGSRAKRVIIPNRVGIEHRPAIVNKKKRFGDWEADTVLGKQGTGAIVSLVERKSKLYLIRKVPAKSAADVARAMVGMLWKYRSHVRTITADNGSEFCDHELVAEKLKTDIYFANPYSSWERGLNENFNGLLRQYIRKGTDLRTVTDKQIAEIERALNARPRKCLGFRQPVAIFNELRKAA >NZ_AP021859|2866728:2892555|2874031_2874547_-|WP_155015101.1|DBSCAN-SWA MRRFSVLDKNAQMELREKFPDLDKLVAVEDTSEYTYMAVSVFDRWLDHDDALKFLSDVSEEEQRERDSKFIKFAENLVSNTEILNFTFKGRWHKALPLFRTFTSHTAKVNYLMCTPHNVDSSQFYKVVLPELEAVYFESWDDTNVFYLRNPEKATQIESWANESGLFCLTR >NZ_AP021859|2866728:2892555|2870221_2870923_-|WP_073324818.1|protease|DBSCAN-SWA MQTMEIAFLSTILLFPLFDLVLEKYKCRNKCTEYVKTSFMLWAVTGFLFFCFLNGVLKIEPPQYLPSTTWKGYLALFMFAVFIAYMKYVLYSINKDKSVRLQVLNAFEDGGESFVEILPSSRKEFLFFTLLVSFSAGVCEELIFRWYLFSFIEQHTGGLVAVIGSSIIFGFWHIYLGWKHVIKTAVVGVLLCGIYLYFESIVVAILAHIFMDVYSGSVAFIARKAQGAELTSA >NZ_AP021859|2866728:2892555|2888011_2888263_-|WP_073325466.1|DBSCAN-SWA MEEVIWRIVTFIVFGTWLVFVPRHMEFILVKYQSFLYKYIPLAQLVFKTEKEAAIPIFNERAIRAIGFAHYLGAVVVATKHQW >NZ_AP021859|2866728:2892555|2879593_2879764_-|WP_155523170.1|DBSCAN-SWA MHKFVLIVLFLLPFSAMALDVTDVEIAGKWHIEAIVMGTLGEPQIQIISAPLLIKW >NZ_AP021859|2866728:2892555|2882814_2884362_+|WP_155014623.1|transposase|DBSCAN-SWA MPTKRLSMRQLREILRLKLQADLSIRQIHRSLRVSVGAVSKVLSKANEMNLSWPDVNQLDDVQLASRFYPEADTRQSGQFEMPDWRDVHQELTHKGVTKHLLWEEYTEQYPNRSYSYPQYCHHYQVWQTLQRRSMRQVHKAGEKLFVDYAGQTVPIISASTGEVRQAQVFVAVMGASNQTFAEGTWTQSLPDWLGSHTRAFSFFGGVPQLVIPDNLKSGVSRACRYDPDVTPAYQQLAAHYGCAIVPARPYKPKDKAKAEVGVQVIERWILARLRHYTFFSLAELNTCIAALLKDVNNRPFKQLNGSRQSWFDSIDKPALAPLPKLAYQYTDIKTVKVNIDYHIQYDAHLYSVPHHLVGERIDVHASNTLITLYFHNKVVASHPRQYRHGMSTVPAHMPARHQKHQSWTPGRLMNWAKDVGDEVLAWVKHQLASKSHQEQAYRVCLGLLNLSRQYPPQRLNKACAIANQQHLYRLKQVKAILTSNQDKLYQHNQDESQNHLPQTHENIRGPQSFH >NZ_AP021859|2866728:2892555|2876319_2876637_+|WP_155015102.1|DBSCAN-SWA MAQFDVYRNPSKKTSKAYPFLVDVQNSVIDQLATRLVVPLTTSNTKNSFYMKKLTPEIEFEGTTYLFLAQQLSSIPEDVLKDRIGSLEQSRELLIDAIDFAITGI >NZ_AP021859|2866728:2892555|2877707_2878163_-|WP_155015105.1|DBSCAN-SWA MARRSEFKGIVRNFAHMLNNRNNDHLGYWAIGQLCLIAKQENISSISINLLEVEHSGVKNCLDPFAKSMCELLIKMLNSHKIPSSWLKSANVTFSFNAQYQKQYHYWRSALGSPYLVTFEITSDLGKVYKQTFGGNVNPHDPRREQRRAGF >NZ_AP021859|2866728:2892555|2873468_2873918_-|WP_155015100.1|DBSCAN-SWA MKFLVWLALFYSIQVHAVTLSKSENLELALILSTSEVVFKSNTFPNVLVIETWEEISECGGLFESCPNARLFVIVSNGDLYEVPALYELPKSKGWKFIDFVEKDNFYLLSVSTTLEHANVSLESRKAWKSKVYTVKISKYDDGAVSLVE >NZ_AP021859|2866728:2892555|2872953_2873241_-|WP_155015099.1|DBSCAN-SWA MASKAKSFRVWRKYNVQIVVGSDSLWLLSSFPSFGLVLDLDELKLSKKKSFIGCQVRIISHDVYTEHLFNEIIVSVRVSKKLEKFSHGKLLCGSI >NZ_AP021859|2866728:2892555|2872027_2872363_-|WP_073324821.1|DBSCAN-SWA MEILVAVISLLVFGVFLAVAIVLSRIPVLGDVLDFFAELASGPRFVYLAWPFLLSSAYRKELILEHKNRIKWLAWVEYILATIVFLAICAIGLVVIHHNVNRLLLQGAFLE >NZ_AP021859|2866728:2892555|2878285_2878558_-|WP_155015106.1|DBSCAN-SWA MNSRLPAGENDFFNFEKGIFTSAAQGESSQAPYELEENRIVVTHPNKVEIFTVTKLTEEQMAFFVNVFVNGKKMSAGNISFKLRREKSGN >NZ_AP021859|2866728:2892555|2891604_2892555_+|WP_155015024.1|transposase|DBSCAN-SWA MSYKQLIEGQRYQIEAYLREGFSYREIGKRLNVSHSTISREVKRNRIRDSHYLPEVAQAKALKRRCQAVKYRISELTITFVEFGLNEKWSPEQIAGVGKIIGHHVSHEWIYGYVQRDKFRGGKLYKQLRQSRRRYRKGSRAKRVIIPNRVGIEHRPAIVNKKKRFGDWEADTVLGKQGTGAIVSLVERKSKLYLIRKVPAKSAADVARAMVGMLWKYRGHVRTITADNGSEFCDHELVAEKLKTDIYFANPYSSWERGLNENFNGLLRQYIRKGTDLRTVTDKQIAEIERALNARPRKCLGFRQPVAIFNELRKAA |
35 | Acidithiobacillus_phage(40.0%) | protease,transposase | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
Acr ID | Acr position | Acr size | Homology with known anti | Neighbor HTH/AcRanker | Neighbor Aca | In prophage | Protospacer in prophage |
---|