Contig_ID | Contig_def | CRISPR array number | Contig Signature genes | Self targeting spacer number | Target MGE spacer number | Prophage number | Anti-CRISPR protein number |
---|---|---|---|---|---|---|---|
NC_020562 | Sphingomonas sp. MM-1 plasmid pISP1, complete sequence | 1 crisprs | csa3 | 0 | 1 | 0 | 0 |
NC_020544 | Sphingomonas sp. MM-1 plasmid pISP3, complete sequence | 0 crisprs | NA | 0 | 0 | 0 | 0 |
NC_020542 | Sphingomonas sp. MM-1 plasmid pISP0, complete sequence | 0 crisprs | WYL | 0 | 0 | 26 | 0 |
NC_020563 | Sphingomonas sp. MM-1 plasmid pISP4, complete sequence | 1 crisprs | NA | 0 | 1 | 1 | 0 |
NC_020543 | Sphingomonas sp. MM-1 plasmid pISP2, complete sequence | 0 crisprs | NA | 0 | 0 | 0 | 0 |
NC_020561 | Sphingomonas sp. MM-1, complete sequence | 6 crisprs | csa3,cas9,cas1,cas2,DinG,DEDDh,RT,WYL | 0 | 12 | 6 | 0 |
CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
NC_020562_1 | 159505-159594 | Orphan |
NA
Consensus repeat of NC_020562_1
|
1 spacers
spacers of NC_020562_1
>1.1|159529|42|NC_020562|CRISPRCasFinder ACCGACCCCAAAACTCACCGAAGGCTACCCCAACACTCACGC |
CRISPR arrays and Neighbor proteins around NC_020562_1
The CRISPR arrays of NC_020562_1 >merge|NC_020562|1|159505-159594|CRISPRCasFinder GACTCGACCCCAAAACTCACTTTTACCGACCCCAAAACTCACCGAAGGCTACCCCAACACTCACGCGACTCGACCCCAAAACTCACTTTT >NC_020562|1|1|159505-159594|CRISPRCasFinder GACTCGACCCCAAAACTCACTTTT ACCGACCCCAAAACTCACCGAAGGCTACCCCAACACTCACGC GACTCGACCCCAAAACTCACTTTT
>NC_020562.1|WP_015460618.1|158476_159388_-|replication-initiation-protein MTRASSPVNNGKAKIALGDDTALTLSQKGRGNPFDPANYGEIVKPGELVDIVELSPLTLADRRIYNLLIANAWERISEPVIHRIAKTALKGTHQGNERIESSLLRLMGTIAIVTIRKGGKSYKRRVQLLGSSDESLEKDGFLHYRIPEELIEILRNSEVYARLKTQVMYCFESKYALCLYEMIERRIGLEYKQSEEFTIAELRGLLNVPEGKLERFADFNKYCLKVAQEEINKLCPFWVEFTPIKKGRKVERVSMMWLPKTMSGRRDAQNLIDQHSIVRRAKLRGDIPEMPVLVDFSAPAAQR >NC_020562.1|WP_015460617.1|158138_158420_+|hypothetical-protein MAIARKPNSKPKSPMDEAAADAFIAGAAKPKAEPIATEADEAGQGAEPRKSPVMLRFDRALLAKVDAAAKRRGISRSAWIQFTVSRALDAGEG >NC_020562.1|WP_015460616.1|157477_158113_+|AAA-family-ATPase MILAVGNTKGGVGKTTLAVNLAVARALAGRDLLLVDGDEQGTALTFTELRADRLGQAGYTAVALTGAALRSQVRQLAAKYDDIIIDVGGRDTGSLRAALTVADTLLVPVQPRSFDVWALDQVAALVAEAREINEGLRAVAVLNGADAQGADNEAALEMIGDIEGIEVLPTSIVRRKAFPNAAAEGRAVGEQSPRDAKAIDELAALVSAVFV >NC_020562.1|WP_007685994.1|155815_156508_+|hypothetical-protein MQVLDTVGWVGDGDDTDFFLAIERTFDLRLRSNLPWTTFGEVRDHVVAHVAAYSGGGTTCATQMTFYRLRRALGLGRHVGPDAPLAPLIGGKLRQAFSDLEADTDLKMPATRAGWLGIVSGLCFAVAVAILAFTTLAPPLRIFAAGASAYAGLWLRHLDRRRLPRRCDTIGDLARLVTEQNRGRLARDGARLTAPEIWRIIQQLAAEESGIDPDLIGSETTFFRAKVRAA >NC_020562.1|WP_007685993.1|155053_155650_-|recombinase-family-protein MRVGYARVSTSDQNPELQLDALRRAGCERVFTEKASGARDDRPELARILEDVLRAGDTLVVWKLDRLARSLKKLIATAEDLEREKIGLVSLTESIDTTTPGGMLTFHVFGAIAQFERALIRERTTAGLVEARRQGRKGGRPSAMRPSDVAAARAMMKEGTLPVRDIAKRMGVSVATLYRYAGKRGSGASIKEAATAHG >NC_020562.1|WP_001389365.1|152685_153450_-|IS6-like-element-IS6100-family-transposase MTDFKWRHFQGDVILWAVRWYCRYPISYRDLEEMLAERGISVDHTTIYRWVQCYAPEMEKRLRWFWRRGFDPSWRLDETYVKVRGKWTYLYRAVDKRGDTIDFYLSPTRSAKAAKRFLGKALRGLKHWEKPATLNTDKAPSYGAAITELKREGKLDRETAHRQVKYLNNVIEADHGKLKILIKPVRGFKSIPTAYATIKGFEVMRALRKGQARPWCLQPGIRGEVRLVERAFGIGPSALTEAMGMLNHHFAAAA >NC_020562.1|WP_007687861.1|150753_151512_-|esterase MVALSISRQAEYPPTGKPAKTGSRSSRADAVGPDAASASHRQELEANMSRDNAIVMRYDNPDIPSGRDIVYLHGRGSTEREAGFALPLFGRANVRSYRGPLPQGPGFAWFENAGIGVALPSSLSGETSKVGDWIAADTGRQRPWLCGFSNGAAMAASLLLSNPGAYSGLIMIGGCFAVEDGDLPDNGLLDKPVLFCRGQFDDVIPRHKFEQAEAYLSGPSGARATFIPYEGGHELPLPIKAAVQGWLGAESR >NC_020562.1|WP_001389365.1|149286_150051_-|IS6-like-element-IS6100-family-transposase MTDFKWRHFQGDVILWAVRWYCRYPISYRDLEEMLAERGISVDHTTIYRWVQCYAPEMEKRLRWFWRRGFDPSWRLDETYVKVRGKWTYLYRAVDKRGDTIDFYLSPTRSAKAAKRFLGKALRGLKHWEKPATLNTDKAPSYGAAITELKREGKLDRETAHRQVKYLNNVIEADHGKLKILIKPVRGFKSIPTAYATIKGFEVMRALRKGQARPWCLQPGIRGEVRLVERAFGIGPSALTEAMGMLNHHFAAAA >NC_020562.1|WP_007682395.1|148826_149189_+|hypothetical-protein MPAPALDEEPVLPPGFEDLLEFVPHWIGETAQERWDIRARATMAEITRFYDVLLSRSEAILDHVETFPLDAMPAPTLRLFRLQLALAHAAMSVELHKQPRAHNSPYPHQVRILRTAEPTL >NC_020562.1|WP_007682394.1|148140_148827_+|EthD-domain-containing-protein MHSIKILATIPRRKDISEQQFHDHWRHPHGTLSKKIACLRGYVQSHRIVSPLLPDTQLAYDGITELWYDSLDDALNMGKDPAHRKYNIPDEPLFVDMDGLKFTFFEEDIIRSRPAVDDPDDAAVQWSPTEWSVSVKILQLVKADGNPAWAGDQDKALGDRIGAFRHVRSFAIDAVHKGTSPFIGARELWWPTLSDFERGVAGDRAAFDALLAQAGQHYTMLASAERVI >NC_020562.1|WP_015460621.1|161333_161558_-|conjugal-transfer-protein-TraD MARRERTRHLIELGGLVQKAGLVELADDDRATLYGALLDCTARVQGDDAGNVLALWKRRGKRAFDAEAEGAGNG >NC_020562.1|WP_015460622.1|161595_161901_-|conjugal-transfer-protein-TraD MRKVRDYDAELRALNDKAKALKARKVQQLGELVTSTGADALDLDTLAGALLAAVEAADANEKEAWRSRGAAFFQGRGRKAGRRTGGNGEGARQTGAGKEQA >NC_020562.1|WP_015460623.1|162073_165208_+|Ti-type-conjugative-transfer-relaxase-TraA MAIYHFSAKVISRANGSSAVASAAYRAAERLHDDRLGRDHDFSNKAGVVHSEILAPEGAPERLNDRATLWNEVEAGEKRKDAQLAREVEFSIPRELNQQQGIQLARDFVEKQFVERGMVADMNVHWDMGKDGQPKPHAHVMLSMREVGPEGFGQKVREWNSTALLQEWRVAWADHVNERLAELDIDARIDHRTLEAQGIDLEPQHKIGPAASRMPEQGLEAERVEDHARIARENGEKIIARPEIALDAIARQQATFTRRDLAQFAFRHSDGKDQFDQVMSAVRSSPELVALGRDGKGEDRFTSRDMIAAEQRLERAAEGLAIDRGHGVADAHVTRALASAEGRGLDLSAEQRGALAHITGDKGLASVVGYAGSGKSAMLGVAREAWEAQGYQVRGAALSGIAAENLEGGSAIASRTIASMEYQWEQGRELLGPRDVLVIDEAGMIGTRQMECVLSHAEQAGAKVVLVGDPEQLQAIEAGAAFRAVTERHGWAEITEIRRQCEDWQRDATKALATGRAGEAIHAYEAHGMVQAAETRELARADLVDRWDAERIAAPDQSRIILTHTNAEVRDLNLAARDRLRDAGELGPDVRVSAERGARDFATGDRIMFLKNERGLGVRNGTLGKVEQVSPERMAVKLDDGRSVAFDLKDYAHVDHGYAATIHKSQGVTVDRAHVLATPGMDRHSAYVALSRHRDGVQLHYGRDDFGDDRRLVRTLSRERAKDMASDYGRDRDAEIRAFADRRGLSGEIRLPERAERSPVEILGPRAGTMRQMGEDPRTVRDAGDRGAGAGQAAAERQPRRGMFDGFRPAPQRPAPESTPAGEREKAAPKRGMFDGLKLSAAPLKGAERAPVPADRGQGRDYARAVERASRSAEAVLQARASGAPVLEHQKVALERTTQALDQIRPGASRDLASAMQRDPALLREAAAGRSGPMIEAMAQEARVRADPNLRADRFVERWQGLKQERDRLYRAGDMAGRERTGKEMAGMAKSLERDPQVELVLRNRTRELGLEIGMGRGRGMNSGDLGRELARDLGIGMGRGMSR >NC_020562.1|WP_015460630.1|165215_165938_+|hypothetical-protein MMDEDNYRNNGRAGDDPQAAFEQLRGEVALVRLAVEGLARARESIEIPDYQPTLANTEKILLALTQRVDVIAKSPAMKLTPETMGERVNASVASATGELHNLVNSTRSDMSEAARELRGLIGTTRARWQQDRWLFWIGLGGVVLGILLYALLAGLIARAMPDSWQLPERMATRALAEPTLWDAGTHLMQRASPASWEGIVAAANLARDNRETIEACGAAAAKAKKTVRCTIEVKPANNDR >NC_020562.1|WP_007683476.1|167085_168447_-|hypothetical-protein MKRGHDLTGLMKFATRPEWADDLHDALDDHLGPVLTQFDIDSDELPGIIGDHWAMTLWGCAFEDLVTRVFEPDGRNIVDEYLKRRGWNEAGPNKLYMRALKTSVMSVYEVSAIEPGVGFLARDLIRGGDPVQVRERTASRTLGPWDRIGVRIIPVSGHRILAGGLLSFTAEATSALLEALRLGQGKRGPRAKLVIDDDQLRDLAPLISMVWLFDILPRMLEPVAIPTLHNADGEEVVFHRVRFPFTRGTTQALIGDRLDTVPALQRETSHFWNWLGTRTKQGKKGTGQMAWGVSMEDGTPVLGNLELKGRALILSVTSAERAERGVALVTQALGALVGTPLTEIETIEQAMAARQEGRTVSEPAPDIPVEVATPLVHGMLDRQYRTLLDEPVPMLGDKTPRQCAGSKAGRDQLATWLKHLENLSGRHADIDDPMATYDFGWIWQELGIEELRR >NC_020562.1|WP_007683474.1|168443_169037_-|recombinase-family-protein MTRAPYLIGYARVSKGDEQSNAAQRRALDAAGCRRVFEEIASGGRWDRPKLLEMIGQLRDSDVVVVWKLDRLSRSLKDLLHIMERIEAAGAGFRSLTEAIDTTTAAGRMMMQMVGSFAEFERAMIRERTSAGLAQARAEGRIGGRRRKLGEKQRREIAESVISGRKSGAEMARLYHVSEPTVSRIVAAHRQTMELPA >NC_020562.1|WP_007683471.1|169183_172114_+|Tn3-family-transposase MTTRQRAALLMLPDDEAAIVKHYSLSGEDMTAIDTARTPATRLGYALQLCCLRYPGRHLRHGELLPAVMLDHIAEQVGVDAKVIADFARRTPTRYDQLAAIKTRFGFSDLSRPHRVELRTWLTNEAASIIDGRALLGRLLDEMRARRIVIPGVSVVERMAAEAIHQAETDLVAAIDGGLGHEMRQQLDALIDDKVHDRQSRLSWLREPEPRVASASLLEIVEKITLIRGTGISAFSPDVRHEPRLGQFAREGVRYTAQAFQQMRPARRRVVLLATLRELEATLTDAAIDMFIALVGRAHLRARKRLEQRVAVSGREGRERMLRIARVLEAISQAARAGGDVAAAVDAVASLDIIDADAAIIRRTASPHRNEVLDEIAAEYRAFKRMGPSFVRAFDFQGRAGMQPLRDAMAILADLDGDWRRALPDDVPLGHVEHRWRRHVMTAGGIDRTHWEMATYSALSNALASGGIWVPTARVHRALSVLLAPPASPVPKPAFSLGDPHAWLDERAARLDSALREVARDLDKRDPPLFAGERLRFPKDPKEDPGQDEGRQLALTCYGMVPATRITDVLSQVQRWTGFIQHFGHVSTGLPPADERAFLATLIAEATNLGLSRMAEVCGVASRRALLRMQTWHMREETFRAALASLTDAIHAEPLAAWFGSGHRASADGQAYYLGGAGEAGGTVNAHYGRDPVVKIYTTITDRYAPLHQTVIAGTAGEAIHALDGILGHESSADITALHTDGGGVSDIVFAVMHLLGLDFEPRIPRLSDRQLYGFEPARRYGRLAPLFGRRLGRDLIVSHWAEIAEVIAAMRDRTVTPSLILKKLSAYRQQNSLAAALREVGRIERTLFTLRWFDDTDLRRTVTAELNKGEARNSLARAVAFHRLGRFRDRGLENQQTRAAALNLVTAAIILFNCRYLGRAVDELRHRGTPVDPAMLSRLSPLGWDRINLTGDYIWSESLDLDADGLMPLLIKPLP |
You can click texts colored in the table to view more detailed information
CRISPR_ID | Spacer_Info | Spacer_region | Spacer_length | Hit_ID | Protospacer_location | Mismatch | Identity |
---|
CRISPR_ID | Spacer_Info | Spacer_region | Spacer_length | Hit_phage_ID | Hit_phage_def | Protospacer_location | Mismatch | Identity |
---|---|---|---|---|---|---|---|---|
NC_020562_1 | 1.1|159529|42|NC_020562|CRISPRCasFinder | 159529-159570 | 42 | NZ_CP005192 | Sphingobium sp. MI1205 plasmid pMI3, complete sequence | 31147-31188 | 0 | 1.0 |
NC_020562_1 | 1.1|159529|42|NC_020562|CRISPRCasFinder | 159529-159570 | 42 | NZ_CP005087 | Sphingobium sp. TKS plasmid pTK3, complete sequence | 20315-20356 | 0 | 1.0 |
NC_020562_1 | 1.1|159529|42|NC_020562|CRISPRCasFinder | 159529-159570 | 42 | NC_020562 | Sphingomonas sp. MM-1 plasmid pISP1, complete sequence | 159529-159570 | 0 | 1.0 |
NC_020562_1 | 1.1|159529|42|NC_020562|CRISPRCasFinder | 159529-159570 | 42 | NZ_CP005193 | Sphingobium sp. MI1205 plasmid pMI4, complete sequence | 18238-18279 | 0 | 1.0 |
NC_020562_1 | 1.1|159529|42|NC_020562|CRISPRCasFinder | 159529-159570 | 42 | NC_020563 | Sphingomonas sp. MM-1 plasmid pISP4, complete sequence | 33102-33143 | 0 | 1.0 |
NC_020562_1 | 1.1|159529|42|NC_020562|CRISPRCasFinder | 159529-159570 | 42 | NZ_CP005088 | Sphingobium sp. TKS plasmid pTK4, complete sequence | 56281-56322 | 0 | 1.0 |
NC_020562_1 | 1.1|159529|42|NC_020562|CRISPRCasFinder | 159529-159570 | 42 | NZ_AP017658 | Sphingobium cloacae strain JCM 10874 plasmid pSCLO_4, complete sequence | 34773-34814 | 1 | 0.976 |
NC_020562_1 | 1.1|159529|42|NC_020562|CRISPRCasFinder | 159529-159570 | 42 | NZ_CP047220 | Sphingobium yanoikuyae strain YC-JY1 plasmid unnamed3, complete sequence | 56938-56979 | 2 | 0.952 |
1. spacer 1.1|159529|42|NC_020562|CRISPRCasFinder matches to NZ_CP005192 (Sphingobium sp. MI1205 plasmid pMI3, complete sequence) position: , mismatch: 0, identity: 1.0
accgaccccaaaactcaccgaaggctaccccaacactcacgc CRISPR spacer accgaccccaaaactcaccgaaggctaccccaacactcacgc Protospacer ******************************************
2. spacer 1.1|159529|42|NC_020562|CRISPRCasFinder matches to NZ_CP005087 (Sphingobium sp. TKS plasmid pTK3, complete sequence) position: , mismatch: 0, identity: 1.0
accgaccccaaaactcaccgaaggctaccccaacactcacgc CRISPR spacer accgaccccaaaactcaccgaaggctaccccaacactcacgc Protospacer ******************************************
3. spacer 1.1|159529|42|NC_020562|CRISPRCasFinder matches to NC_020562 (Sphingomonas sp. MM-1 plasmid pISP1, complete sequence) position: , mismatch: 0, identity: 1.0
accgaccccaaaactcaccgaaggctaccccaacactcacgc CRISPR spacer accgaccccaaaactcaccgaaggctaccccaacactcacgc Protospacer ******************************************
4. spacer 1.1|159529|42|NC_020562|CRISPRCasFinder matches to NZ_CP005193 (Sphingobium sp. MI1205 plasmid pMI4, complete sequence) position: , mismatch: 0, identity: 1.0
accgaccccaaaactcaccgaaggctaccccaacactcacgc CRISPR spacer accgaccccaaaactcaccgaaggctaccccaacactcacgc Protospacer ******************************************
5. spacer 1.1|159529|42|NC_020562|CRISPRCasFinder matches to NC_020563 (Sphingomonas sp. MM-1 plasmid pISP4, complete sequence) position: , mismatch: 0, identity: 1.0
accgaccccaaaactcaccgaaggctaccccaacactcacgc CRISPR spacer accgaccccaaaactcaccgaaggctaccccaacactcacgc Protospacer ******************************************
6. spacer 1.1|159529|42|NC_020562|CRISPRCasFinder matches to NZ_CP005088 (Sphingobium sp. TKS plasmid pTK4, complete sequence) position: , mismatch: 0, identity: 1.0
accgaccccaaaactcaccgaaggctaccccaacactcacgc CRISPR spacer accgaccccaaaactcaccgaaggctaccccaacactcacgc Protospacer ******************************************
7. spacer 1.1|159529|42|NC_020562|CRISPRCasFinder matches to NZ_AP017658 (Sphingobium cloacae strain JCM 10874 plasmid pSCLO_4, complete sequence) position: , mismatch: 1, identity: 0.976
accgaccccaaaactcaccgaaggctaccccaacactcacgc CRISPR spacer accgaccccaaaactcaccgaaggctaccccaaaactcacgc Protospacer ********************************* ********
8. spacer 1.1|159529|42|NC_020562|CRISPRCasFinder matches to NZ_CP047220 (Sphingobium yanoikuyae strain YC-JY1 plasmid unnamed3, complete sequence) position: , mismatch: 2, identity: 0.952
accgaccccaaaactcaccgaaggctaccccaacactcacgc CRISPR spacer accgaccccaaaactcaccgaaggctaccctaaaactcacgc Protospacer ******************************.** ********
Region | Region Position | Protein_number | Hit_taxonomy | Key_proteins | Att_site | Prophage annotation |
---|
Acr ID | Acr position | Acr size | Homology with known anti | Neighbor HTH/AcRanker | Neighbor Aca | In prophage | Protospacer in prophage |
---|
CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
NC_020561_1 | 157341-157431 | Orphan |
NA
Consensus repeat of NC_020561_1
|
1 spacers
spacers of NC_020561_1
>1.1|157367|39|NC_020561|CRISPRCasFinder TCCCCCAACATAACAGGAGCGCGCTGCGCGCGCCCGGCT |
CRISPR arrays and Neighbor proteins around NC_020561_1
The CRISPR arrays of NC_020561_1 >merge|NC_020561|1|157341-157431|CRISPRCasFinder GGGGAGGGAGGCTGGCGAAATGGCGCTCCCCCAACATAACAGGAGCGCGCTGCGCGCGCCCGGCTGGGGAGGGAGGCTAGCGAAATGGCGC >NC_020561|1|1|157341-157431|CRISPRCasFinder GGGGAGGGAGGCTGGCGAAATGGCGC TCCCCCAACATAACAGGAGCGCGCTGCGCGCGCCCGGCT GGGGAGGGAGGCTAGCGAAATGGCGC
>NC_020561.1|WP_015456897.1|156273_157122_+|SDR-family-oxidoreductase MTEVYGRSDEELATIPIALAPGLFAGKVVVVSGAGSGIGRAVAHWFARLGAKLVLCGRKAEKLEATAAGLSRYAAETLVHPLSIRDPEAVAAMFDAAWAHFGRVDILVNNAGGQFPQAAIDFSPKGWAAVIDTNLNGTWYMMQAAARKWRDAGLPGSIVNVATVIWRGMPGVAHTCAARAGVIYGSKTVAIEWAPLNIRVNCVSPGIIATEGMAVYSDEARAEMPNTNLMRRFGQVEDIANAVCYLAGDAGGFITGEVLTIDGGNQLWGDQWTIPKPDFFRV >NC_020561.1|WP_051128686.1|155062_156274_+|acyl-CoA-dehydrogenase-family-protein MKEGRRELDLGTNIFPGGFALTSEQQEILDTASAFARDRFAPLQQRMDDEEWWPPEAMPELGRMGFLGVTAPARFGGADSDFFTSGLIAQGLARWNHSIALSYVAHENLCLNNIARNASEEVKARYLPGLCDGSAIGALGLTEPGAGSDALGSMATTARREGGKYLLNGRKLYITNGPVADVILVYARTDKEAGTKGISAFIVEKGFKGFKVAQKLDKMGFRGSTTAELVFDDCEVPAENLVGVENRGVGIVMSGLDLERAVVAMLNVGMAERALDLAIDYARTRTQFGRPIGEFQLVQGKLAEMYVGVETMKALCYRTLAECNAIGEDGGGRGEIHKLTAAAILHAAETCTRVISDSVQIHGGVGYMREAEINRLYRASKLLEIGAGTSEIRKLIIAGELLR >NC_020561.1|WP_015456895.1|153476_154952_-|ATP-grasp-domain-containing-protein MTAFPFESVLIANRGEIAARLARTVKALGLRALLVAHRVDEGSPALALADDVRWIEGPTPVAAFLDIPQIIAAARDMGAGAIHPGYGFLSENAGFARAVAAAGMIFVGPEPDTIELMGDKVRARAFVERHGFPVAPSAIEDDDPATFVERARALGAPILIKPSAGGGGKGMRIVRDMAVLEQEIARGRSEGERYFGDGRLFVERYIERPRHIEVQVLGDAHGNVVHLFERECSLQRRFQKIVEEAPSPALTPQERERICETAAGIARAAGYRNAGTVEFIYGQGEFYFLEMNTRLQVEHPVTEAITGIDLVEQQLRIAAGQPLAFDQTAVTRSGHAIELRICAEDSARDFAPTTGPVLRLAAPAGARFDGGVSEGGRISAAFDPMIGKLIVHGEDRAEAIARADRALAGLVLLGLKTNIGYLRRLMGDPAVIAGDIHTGLIGERTELAAEPVADEATLARLVAIAARHVPELVREAAEIPAMHAAIGGWRN >NC_020561.1|WP_015456894.1|153054_153477_-|acetyl-CoA-carboxylase-biotin-carboxyl-carrier-protein-subunit MPGFFLIDGVAHPAALAPADLKAPPPEEAIVARDGDHIWVHVDGAAHELVWQDPITHFEEESASGGDDVARAPMPGSVIQVAVTDGDSVAEGEIMMVIESMKLETAIKAPRDGVVMTVHRAIGQTFERDAALITLEAIAL >NC_020561.1|WP_041865032.1|151447_153055_-|methylcrotonoyl-CoA-carboxylase MRRIHSRIDTSGTTYQANRAHNLRMVAELREKQEAVRNVRPQRDRDRLDRQGKMFLRDRLEALLDPGTPFLELSTLAANMAYDGDVPGAGQLSGIGVVSGREVVIHADDASVKGGAWYPLSVKKIVRTLDIAIENRLPVVHLCDSAGGFLPLQAEFFADRYHAGRIFRNQSILSKMGVPQVAVVMGHCTAGGAYIPALSDYNVIVRGTGAIFLGGPPLVKAATGEEVTVEELGGADMHTSVSGTADYPASSERHAIAIAREIVGRFTRAEKAQVDWAEPEPPYYDPQELYGILPQDSRTTFDMREVIARIVDGSRFHEYQPRYGETLVCGFARIWGYQVGILANNGVLFNDSSLKGAHFIQLCDKNRTPLIFLQNITGFMVGREYERRGISKDGAKMIMAVSGASVPKFTVNCNGAFGAGVYGMSGRAFDSRFLFSWPQGQTSVMGAEQAANVLTDIKLRQLARNGDTLTAEQIDAIRDPVIEGYKREQSAYYATSEIWDDGLLDPVDTRNALGIAISAALNAPIEDPHYGVFRL >NC_020561.1|WP_015456892.1|150459_151431_-|alpha/beta-hydrolase MTVETHSASLHLVDPELRAALDAFPTFDLNEDLLPVMRAQGFGVDVPPPQGPAAGVAVERITVPGRDGEPDVSCLLYTPPGRTGQSGAYLHIHGGGYVLGDAAMSELSNRSLAAAIGCILLSVDYRLAPETRWPGAVEDCYAALGWLHANANRLGVDHQRIAIGGESAGGGHAASLALVARDRGEYRIRHQHLIYPMIDDRTGSTVPALPYAGDFVWTAASNAFGWSALLGHPAGTGEPPRNAVPARVEDLSGLPPTFLGTAALDLFVGENLDYGRRLIAAGVPTELVVAPGAYHGFNGFAPDAAVSRGFNSASLEALRRAIG >NC_020561.1|WP_015456891.1|149339_150341_-|NADP-dependent-oxidoreductase MGDLMQAMVLDEFGGPEVLHIATIERPRAAPGNVVVEVAYAGVNPADWKAREGWLSRYFQYQFPFVVGFDAAGIVAEVGEGVTGLKVGDRVVTASNQGIGERGSYAQFVASIEERCVKLPDHVALVDAAAMPTAAITAWEAVFDVGGTEAGSIVLVNGGAGGTGSYAIQLARMAGARVAATCGPANMDYVRGLGAELAIDYRQGDVADAVRAWAPEGVDLVVDTVGQGSLLEAVEFTRKGGVIAPIATLIADEPTIDPARAEARGVRVVPTISSHANQPRQLAALVAALAEGSIHAPEITLMPLDQAGEAHRKIQAGHVRGKIVLVVNEALGR >NC_020561.1|WP_015456890.1|148839_149304_-|nuclear-transport-factor-2-family-protein MSLQYLIDKDAIEQVYVRYCEIVDAKTFDDMHEVFTEDATGDYTQALGPGVISPDRASLIASMHANLGPDSNCGATHHNVGNFRVRVDGDHAHAKVHYYAEHLGQGDYAGEQYSMWGQYEDDLVRTVDGWRVKARVYTCAISRGPAAVTSARVG >NC_020561.1|WP_015456889.1|146905_148723_-|DUF885-family-protein MDRRSFLVSSGALVLGAALPAPLFAKTDADGALNALLDSFFYESLEDSPEAATSQGLDKGQRAALKSKLSDYSTSGRAKRLVRAKDQAARLARVDRAALSSLGHVNYDVTEYMLAQDIKGLGKYPFGSVDGIWSPYAISQLGGAYQGVPDFLDSQHGIENKADADAYLARLDAFATVLDQDSERQRAEAAYGAVAPDFSLDLTIAQLEALRGKPAAETVLVQSIARRTKEKGIAGDWAAQAAKIVSGKIFPALDRQIALVKQLRATASSDAGVWRLPEGAAFYADALANSTTTTLSPEEIHQIGLEQVAELTARIDTILKAEGMTQGTVGERLTALNADPKQLYPNTDEGRAALLASLNADIHKMTALLPRAFSTLPKAPIEVRRVPVFIQDGAPNGYYNPAALDGSRDAIYYINLKDTHDWPKYGLPALTFHEAVPGHHLQGSLAQETQGIPILRRQTFFSAYGEGWALYAEGVAEELGAYGDDRLGIAGSLQSLLFRAVRLVIDTGIHAKRWTREQATDYMVANTGFPRPRSLREVERYCVWPGQACSYKVGHNKWVELRKRAEAELGDRFDLAWFHDVLLDGAMPLTILEARVNERIAARKA >NC_020561.1|WP_187294044.1|144733_146833_-|PBP1A-family-penicillin-binding-protein MRDDDPYDLEWTEPEEDRRAPASPRTTDRKPPQRGAAAPRAFWKRWRFWKRVAQAGALIFVLLVGWLAITAPLSRSLKPIAPPSITLLSSDGKPIARKGAIIDRPVVVADLPPHVPQAFMAIEDRRFYSHWGIDPRGIARAAWRNTVAGGVREGGSTITQQLAKVAFLDSDRTAARKLREVLIAFWLEARLSKDEILSRYLSNVYFGDNVYGLRAAALHYFNRQPEKLNVAQAAMLAGLLKAPSRLSPAVNLKGARERQRVVVAAMADAGFLTPAEAAGVPPASLNLRPLKMLPSGTYFADWALPAARDNAGAVYAEQEVKTTLDSRIQRAAEAAVRRAGLGKAQVALVAMRPDGSVVAMIGGKNYADSPFNRATQARRQPGSTFKLFVYLAAIRHGLTPDSLVEDEPITIAGWSPKNNDGRYRGKITLREAFARSSNVAAVRIASEVGMDNVIRAARDLGITSPLAADDATLALGTSGVTLLELTSAYAAIAANAYPVKAHALPDKERSWYDAFWDRPRAFDGETRAMLLDLLGAAVREGTGRSATLAIDAFGKTGTSQDNRDAIFVGFSGDLVAAVWVGNDDNSPLGGIAGGGLPARIWRDFMSRVVDGAAPPVVEREPAPAAEPDPIGDLIENQVDNLSIAVNGAIGDVDVGLRVGPDGLTISANPGNNRPPEERRGPGPAIAPPPPVPEPVPNGQ >NC_020561.1|WP_015456898.1|157497_157971_-|Lrp/AsnC-family-transcriptional-regulator MVKKAGAPFDIDGLDEKIIAALRCNGRIATRDLATEVGVKEATVRAHLRRLEDNDIVRVVAMRDLAALGYNCVSAVGIQVRGRPAADVAAELAEMEQVITVAVAIGIHDLEVQLVARDVHELDQLLTGVIAKVRGVDQIFPSVALKVMKYVSEWAPF >NC_020561.1|WP_015456899.1|158566_159439_+|helix-turn-helix-transcriptional-regulator MHEDQAGAIEQHFAVGDFRLDVLSQPDTGPFTRTHLVDYPSIAYLPTGQGEDPVRGCFGEPRSHRSFVPFGAAVLVPANLAVHVQSTGYAERRLLICRFDPDIFESLTGLGANASGDELAACIDVRDAAVLATLERLSIAVSRPSTAREMLVRGLGMVLLAELTRHFELVRERGFHRAGTLAPWQLKRIDQRLADESKPVPSVSELASLCGIGRRHLMRAFKATRGSTVMEHVERTLFARAARMLGETTIPVKSLAVSLGYERQGSFSAAFRRRFGETPRDYRARASAGR >NC_020561.1|WP_015456900.1|159486_160734_+|MFS-transporter MKPHRAEGTPQRLRAADIGLIAMLAFVVMFEGFDISLTSVVLPFVGKAYGVDAEGLGRSLSVIGLGAIAAWFVIRLSDRFGRRPVLLLSAGAFSIGSLATILMPTIESYTLVQALTRIALVSQIATAYLIVSESLPPALRGRAAGLLGACGSFGAALPAALLATALDTSLSWRGLFLVGGAPLLILPLLWFRLGETPAFTARKAAPSNALEELRMLVAPGLRRRFVAMSLLWLIVNFSAVVSTFFFTFYVLNERGWTAADLALIAPFGLGSAFFGYLAAGFLMDGIGRRATAALFFVANGLLVMICYAATGWLAIAACYVGIQAMLGTWTICFTLNAELFPTHVRAAANGWCHNLIGRWGMVGTPLLIGWLSRLWGSVGTTCFWLGLSCFAALPVILFALPETRGRNLSTEESDA >NC_020561.1|WP_015456901.1|160730_161477_+|SDR-family-oxidoreductase MNRMVGKVALVTGAASGIGRASAVRLASEGAIVICADRNMAGAEETASGLSGASAVQFDAASAASCRDLVAHVVARHGKLDVLCNIAGIGGFGHAAEISDESWDQLVAINLSSIFHLTKAALPHLEKTQGNIVNMASASGLVGAAYASAYSATKAGVVGYTRTVAIEYAARQVRVNAICPGGVDTPLIAGGMGDIEGVDFALILRMSPKMAPLAQPEDVAAAVAFLASDDARFITGIMLPVDGGQTAG >NC_020561.1|WP_015456902.1|161461_162049_-|EthD-domain-containing-protein MMKSIGFLPRLAGIARPDFRNYYETRHAPLADSYFHFAGYVRNHIVDGQEPGFDCISEFWTADPAAIATLLAGEAGERMRADERNFADSPNIRPALAEPAPTGRLVPLGPRTVQFLGGHDNARLIAAVAASAGAEALTLDFLTPFDAASRAPCDALLIREGTAAAAPSLPSGWTLLASLQVVAEGALPISHQPAV >NC_020561.1|WP_015456903.1|162182_162596_-|Rrf2-family-transcriptional-regulator MLSQRTRYAIRALLHLGDRYGEGPVQLPEIAEAQNIPAKFLTVILSEMKRAGLVETLRGKEGGYWLARPPEEITYGEIVRLTRGSLALVPCAARLAYHPCENCVDEATCRLRAVMLSVRDETANILDRVSLSEKMAV >NC_020561.1|WP_015456904.1|162892_163942_+|Glu/Leu/Phe/Val-dehydrogenase MTAPWDFPDYDDHEGVHLFRDQASGLTAIIAIHSTALGPAAGGTRFWHYPNRADAITDALRLSRGMSYKNAMAGLPMGGGKGVILADRNRTKTPEMLAAFGRAVESLGGRYVTAEDVGITDADMVEVRKQTTHVAGLPVGSDAAGGDPGPFTSLGVFLGVKAAIRRALKRDDVAGVHVAIQGVGSVGGGLARRLAAEGARLTLADVDAARAERLAEELGAKTVAAGDIARVEADVFSPCALGAILDEASIPLLSVPVVAGGANNQLATKEDGARLHARGVLYAPDYVINGGGIINVGLEYLGGADRAEVERRIGHIPGRLEQIWQESAETGDPSAEVADRIARRLIGRH >NC_020561.1|WP_107394544.1|164199_164994_+|PEPxxWA-CTERM-sorting-domain-containing-protein MFVGGAAYADTTVVPASSLTSSGNYYTDNIGDIVVMTGGGNAPGIGNPSGRNDDGFSGPIDLGFNFTLYGNTYSSLYINNNGNVSFGAGISAYVPTGPTGANAPLVSVFFGDVDTRGANSGVVHYQLDTPGQLIVTWDNVGRYNGRSDLLNSFQLVLRSDDFVIPTNEGQIGFFYKNMGWDQTDTSQVAAIGFGDGAGNATILEGSLSSGLNRVVQNKYIWFNANLEPVPSGVPEPTTWAMMLIGFGVVGVSMRRRQRVRVAFA >NC_020561.1|WP_187294045.1|165051_165933_-|DMT-family-transporter MRGILLRIGSVVMFGIMQAAMKLAGEHGVIAIEMVFYRSIFGLPIVLAWLAIGPGFATIRPNRPRAHVWRSIIGLSGITLNFTALILLPLADATTIGFTAPIFATILSALLLHEHVGRHRWLAVAIGFLGVVVITRPGAASGLPAIGILVALGGAVGTSAVTVTLRQLGSTETVGAIVFWFFVGCAIVGGIGTAIWGSGHDAATFGLLTIGAWAGAAAQLLMTASLRAAPVSTVAPFDYLQIIIAISLGWLIWATGPSLATLAGAAMIAGSGLYTAYREHRLRRDSVAATPPV >NC_020561.1|WP_015456907.1|166102_166819_+|cytochrome-c-biogenesis-protein-CcsA MHIFANPNRFLGIARPLTPWLGWGGAVLTAIALLSGLFLTPPEQLQGESVRIMYVHVPSAWLGMGGWTGIAVASLMQLVWRHPLAAVAARAVALPGALFTAICLVTGSIWGRPTWGTWWEWDGRLTSMLVLLFLYIAYIALAGATADRAGGSRVAAIFGLVGAINIPIIKYSVDWWNTLHQTASITLTKNTIDPSILWPLPIALIGFSMLFGAIVLMRMRALLAEARIEARLKRMADA |
You can click texts colored in the table to view more detailed information
CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
NC_020561_2 | 1587617-1587795 | Orphan |
NA
Consensus repeat of NC_020561_2
|
2 spacers
spacers of NC_020561_2
>2.1|1587674|20|NC_020561|PILER-CR CTTCGGGGCAGGCGCTGGCT >2.2|1587751|6|NC_020561|PILER-CR CCGGCG |
CRISPR arrays and Neighbor proteins around NC_020561_2
The CRISPR arrays of NC_020561_2 >merge|NC_020561|2|1587617-1587795|PILER-CR GGCCGGGCGCGGCGTCGGCGCAGGCGCGGCGGCGCGCGGCGCGGCCGGCTTCGGGGCAGGCGCTGGCTTGGCGGCCGGCGCGGGGGTGGGCGCAGGCGCGGGCGCCTCCGCCACCGGGGCCGGCGTGGAGGCGGGAGCCGGGGTGGGCGCGGGCGCGGCGGCGGGAGCCTCGGCCACGG >NC_020561|2|1|1587617-1587795|PILER-CR GGCCGGGCGCGGCGTCGGCGCAGGCGCGGCGGCGCGCGGCGCGGCCGGCTTCGGGGC AGGCGCTGGCTTGGCGGCCG GCGCGGGGGTGGGCGCAGGCGCGGGCGCCTCCGCCACCGGGGCCGGCGTGGAGGCGG GAGCCG GGGTGGGCGCGGGCGCGGCGGCGGGAGCCTCGGCCACGG
>NC_020561.1|WP_015458262.1|1584827_1585259_-|30S-ribosome-binding-factor-RbfA MRRNETPEGKSVRVLRVGEQVRHALADILMRGDVHDDVLASHTVSVTEVRMSPDLRHATAFVKPLLGADEEKVLKALRTNTAYLQSEVARRVNTKYAAKLKFLADESFDEGSHIDALLRRPEIARDLDPDDAGGDGGEADRDG >NC_020561.1|WP_015458261.1|1583961_1584543_-|thymidine-kinase MAKLYFYYASMNAGKSATLLQADFNYRERGMETMLFTAAIDDRYAPGRISSRIGLEAEAFPFDVATDLRGEVESELARRPLACVLVDEAQFLTRDQVFQLASICDDLGIPVLAYGLRTDFRAELFEGSAHLLALADALVEIKAICECGVKATMNLRTDAMGRAVREGAQTEIGGNDRYVALCRRHFMERMRNG >NC_020561.1|WP_015458260.1|1583408_1583969_-|GNAT-family-N-acetyltransferase MADAARLLVPLVEGDARLVPLEERHREALRAACAADADIWTIYNVSYDPDHFDASFDALMANPARLGFAILQDDAVIGMTAYLGVDAGKGLLEIGNSYIAPAARGTGLNGRIKRLMIDHAIACGFRRIEFRIDARNGRSMAAVEKLGGVKEGVLRQERITWNGHLRDTVLYSILADEWRARFTAGS >NC_020561.1|WP_015458259.1|1582937_1583375_-|hypothetical-protein MTRHPCTVALLALMLASPAMAGQAPDPEAAPAAREAAIPFLGSESINDYRVEGRDTLYIQDIRGRWYKAELMGNCLDLDLAEVIGFDTGGTSSFDRFSTIVVRGRRCPLKSLVASPAPPPARGKTHAHHHGGKAPQSDPPEDDQG >NC_020561.1|WP_015458258.1|1581983_1582880_-|tRNA-pseudouridine(55)-synthase-TruB MDGWIIIDKPVGIGSTQVVSAVKRVLRQGGYGKHKVGHGGTLDPLASGVLPIAVGEATKLSGRMLDADKAYDFTIGFGTETDTLDAEGKAIATSDVRPPRAAVEAVLPRFTGAIDQVPPAFSALKVDGARAYDLARAGEEVVLKSRAVTIHDLRLSAWDGAGATLSARVSKGTYIRSLARDIAYALDTVGHVTMLRRTKAGPFTLDQAISLDKLEESAKGHALEDILLPLTAGLDDIPALAVSPDQARALREGRKLIGIAKHQGLHLAVSGQVPVALVEVSGPEIRVVRGFNIRDVEG >NC_020561.1|WP_015458257.1|1581708_1581978_-|30S-ribosomal-protein-S15 MTITAARKAELIATHARGEGDTGSPEVQVAILSERIANLTEHFKTHAKDNHSRRGLLMLVNKRRSLLDYLKREDAGRYADIVAKLGLRK >NC_020561.1|WP_015458256.1|1579186_1581511_-|polyribonucleotide-nucleotidyltransferase MFNIKKQEIQWGGQTLTLETGRVARQADGAVVATLGETVVLCAVTAARSVKEGQDFFPLTVHYQEKYFSSGRIPGGFFKRERGATEKETLVSRLIDRPVRPLFPEGFYNEINVIAQVLSYDGENEPDILAMIAASAALTLSGVPFMGPIGAARVGYKDGEYILNPTDAQVAEGDLDLVVAGTHDAVMMVESEAKELSEDVMLGAVMFGHREMQKVIDAIIDLAEAAAKDPWELAAQPDTSAMKAKLKKLVGKDIAAAYKLINKSDRSNALNAARAKAKEAFADASPQDQMVASKLVKKLEAEIVRTAILKDGRRIDGRDTKTVRPIVAEAHFLPRAHGSALFTRGETQSISTCTLGTKDAEQMIDGLNGLRYEHFMLHYNFPPYSVGEVGRFGAPGRREVGHGKLAWRALHGVLPTKEEFPYTIRLTSDITESNGSSSMATVCGGSLALMDAGVPIKRPVSGIAMGLILEGKDFAVLSDILGDEDHLGDMDFKVAGTSEGITSLQMDIKIAGITEEIMKVALHQASDGRAHILGEMAKALDHTRTELSAHAPRIETMTVPKEKIRDVIGTGGKVIREIVAQTGAKVDIEDDGTVKISSSDLDKIEAAKNWIIGIVAEPEVGKVYTGKVVNLVDFGAFVNFMGGRDGLVHVSEIKNERVAKVSDVLSEGQEVKVKVLEVDQRGKVRLSMRVVDQETGAELEDTRPAREPREGGDRGPRGDRGDRGDRGDRGDRRREGGDRGPRRDRGDRGPRRERDNDDGPAPEFAPAFLKRDDD >NC_020561.1|WP_015458255.1|1578826_1579099_+|hypothetical-protein MNRMIKLTAIAAFAALAACGGKGDDSLAANVEQAYDNQADQLDAIADNTTNDAQADAIEDQADTLRQEGDNRADAIDAADVNAAATHNGL >NC_020561.1|WP_015458254.1|1577703_1578123_+|large-conductance-mechanosensitive-channel-protein-MscL MLKEFKAFINRGNVLDLAVAVIIGAAFSKIVSSLTDDIIMPVVGKLFGGLDFSGYFIRLGEIPANFAGSANSYADLKKAGVPLLGYGEFITVAVNFLIVAFIIFLIVRAVNRAIPLEGPADTPDVAVLKEIRDELKKRP >NC_020561.1|WP_015458253.1|1576968_1577604_-|NUDIX-hydrolase MNHDPVQKHSAAGHPLPDDADQPAEILWQGRFIEARRKGKWEYVGRARGIGAAVILAVDDGHVLLVEQYRVPLGAPCLELPAGLVGDDVAGEPIETAAGRELEEETGYRAGRLENAGCFAASPGMVSETFTLIVARDLVRVGPGGGVEGENIVVHRVPLDEVADFVAERRRAGVMMDVKLLLLLGAGLIGSTLPDGRQAPATPLAPMLRGH >NC_020561.1|WP_015458264.1|1588054_1588858_-|DUF448-domain-containing-protein MASNEHPSAIAPTHRPSRAKPKGGPRAGGKHAESAPDAGGEDVVDTGHGPERRCVLSGDHGPRDGLIRLALGPDGTVAPDVRAKAGGRGAWIAVDRVALETAIAKGKLKGALARAFKTASFLIPDDLPAQIERALERAALDRLGLEARAGNLVTGSERIVDAARKGTVALLLHARDAAADGTRKLDQALRVGLDMEGTDTRGLVIPASRAILSMALGRENVVHIALVAPAAAARVSDALGRWRGFIGRNGSAEPCDTPSQGPSALRN >NC_020561.1|WP_015458265.1|1588844_1590452_-|transcription-termination/antitermination-protein-NusA MATAISANRAELLAIADAVAREKLIDREIVIEAMEDAIQRAARARYGAENDIRAKIDPRSGDMRLWRVVEVVEQVDDYFKQVSVADAQKLQPGAAVGDFIVDPLPPIEFGRIAAQAAKQVIFQKVRDAERERQYDEFKDRAGEIITGVVKRVEFGHVVVDLGRAEGVIRRDQQIPREVLRVGDRVRSLILSVRRENRGPQIFLSRAHPDFMKKLFAQEVPEIYDGIIEIKAAARDPGSRAKIGVISHDGSIDPVGACVGMKGSRVQAVVQEMQGEKIDIIPWSPDTATFVVNALQPAQVARVVIDEEEERIEVVVPDDQLSLAIGRRGQNVRLASQLTGKAIDILTEADASEKRQKEFVQNSEMFQNELDVDETLAQLLVAEGFGSLEEVAYVEADEIASIEGFDEELAAELQSRAQEALDRREQANRDERRALGVEDDLADLPYLTEAMLVTLGKAGIKTLDDLADLATDELVQKKRAEPRRRNENAPKRAEDKGGVLAEYNLTEEQGNEIIMAARAHWFADEAQEDAAADGEQ >NC_020561.1|WP_015458266.1|1590461_1590992_-|ribosome-maturation-protein-RimP MADADIAALTKLIEPEAQALGLALVRVAMFGGKSDPTLQVMAERPDTRQLDLADCEALSRRISDVLDAADPIEEAYRLEVSSPGIDRPLTRLKDFEDWAGFDARIKVAPPLDGRKQFDARLDGLEGETVKVYAERVGEVAIPFGRIASAKLILTDALLKATAPLSTEGADRISKEG >NC_020561.1|WP_015458267.1|1591198_1592431_+|class-I-SAM-dependent-methyltransferase MWLLDRMLSGIVKRGVLHVTYADGTEKAYGTATPGWAEIRIRFTDKGAPNFIARNPRLGAAEAWMDGRLTVEGDDVRGLIDLLRGNAPWEKGGDKLKASFWREQLQSILARLDRINWERRSRRNVAHHYDLNGRLYDLFLDKDRQYSCAYFTDPGNSLEQAQADKKAHIAAKLDLKPGQKVLDIGCGWGGMALYLHRVADVDVLGITLSEEQLAVARRRAQEAGVADRVKFELIDYREVQGQFDRIVSVGMFEHVGPPHYRTFFDKCRTLLAEDGVMLIHTIGRMGKPSTTDAFTAKYIFPGGYIPALSEVVSASERSKLILSDLETLRVHYAWTLDIWYDRTVAARAEIEALYDARFYRMWLFYLAGAAAAFRHGGMCNYQLQYIRRRDALPYTRDYIAEAERELRAKA >NC_020561.1|WP_015458268.1|1592523_1593672_+|PQQ-dependent-sugar-dehydrogenase MHRPTSFILPAIALLAACGGAGEEGNAAAPATAAAADKPFVATVVADFDSPWAMTFLPDGRMLVTEKAGRMLLVSADGKAATPLAGIPAVDSEGQGALMDVVLHPKFAENRLVYFSFSEKGEGGKGVALARGTLAEGPAPALRDVQVIFRASPYVEGDGHYSGRIAFAPDGHLFFTNGERQKFDPAQDPKSTLGKVLRLNDDGTPAKGNPLAARGFHPAIWSYGHRNLLGLAFDAQGNLWEQEMGPRHGDELNLILPGRNYGYPIVSNGDHYDGRPIPDHDTRPDLEAPKVYWKPAISPAGLMIYSGDMFPEWKGSAFIGAMNMPGLVRVALDGTSAAKADQWDMDGQRIREVEQGPDGAIWLLEDGLRGSQGRLLRLTPRR >NC_020561.1|WP_015458269.1|1593696_1594356_-|hypothetical-protein MRVVIAAPVLMGLALSGCGPKALTLPDDPIDRAATCGVVAALGARAAGGGNVAAALPFDRQAGIMHYALLAGAEGKSFDQSRAAAVAARMPQLEAGISAGKWQDLAPACAAAYPQTQEPAGGPIDLPQDALRAETGCYALGAFLNKTLGGPTSAYKDRLAEFTPMNRALDAKIGAGIAARGLKPDAAVALRSEALATMVKLGPPAGVMASCVARFTPNG >NC_020561.1|WP_015458270.1|1594644_1595214_+|TMEM165/GDT1-family-protein MEALLTSTALVALAEIGDKTQLLAIVLATRFKRPWPIVAGILVATLANHFLAALIGSNVAALLDGTWFRYLVAFSFIAMAAWTLIPDKLDDVETKPARFGAFMTTVIAFFLVEMGDKTQIATVALGARFHDVIAVTAGTTLGMMIANVPAVFLGNELVKRVPMRVVHAIAALLFLAIGLWLVAQTAGWL >NC_020561.1|WP_015458271.1|1595284_1595692_-|DUF1636-domain-containing-protein MLTRVADGPAVVVCNSCRHSAASREDGEGVRGGARLAEALRAVQATDPDTAHIAIQEMPCLFACSEHCTVHIRAPGRTGYVLGRFAPTGDAARAILDYAVRHAASEEGRVPLREWPEGVKGHFIVRVPPPGFVAD >NC_020561.1|WP_015458272.1|1595706_1597665_-|TonB-dependent-receptor MLRSVLLTSFLLSAPAFAEVQVAPVPDAATPPDYDGGAIIVTATRAPIAIDRLASSVTVLDKAAIDRAQDIGVTELLWRTPGVTVSRNGGYGTVTSVRIRGAEAEQTVVVIDGVKLNDPSSTSGGYNFANLLVGDAQRIEVLRGPQSILWGSQAIGGVVNVVTAMPEKDLEASFDVEAGSRDTVNARAGLGGRTGPLAWRIGGNVFTTDGISAIRADQGGGERDGYSNRSLTGRAELEIADGVSADVRGYYSRGRTEIDGFAGDTAEYGINREFVGYAGLNVALLDGRFRNRFAFGYTDTDRDNYDPTRQRQQTFDAAGRNRRFEYQGSFAITDTWTALFGVENERSRFRTVSPAASLAIPVPDPVRGHAGITSLYAQLTGEVLPGLTVNGGVRHDDHKTYGGKTQFAGGAAWSLPTGTVLRASYAEGFKAPTLYQLYSEYGNTTLSPERARGWEAGIEQHLFGDALTLGATWFDRRTKDQIDFYSCPFPAPTDPDEIDPLCLTPAGDARFGYYLNIARTRSRGIEATASLKLSDRLLVDGNYSWIDAENRDTGKWLSRRPRNAANGSISYQWPFGLTTGAAVRWAGKSYDDAGNNRRLDDYTLVDLRAEYDLGGGVRLFGRIENLFDEDYQTVYRYGTLGRSVYGGVRARF >NC_020561.1|WP_015458273.1|1598012_1598789_-|ABC-transporter-ATP-binding-protein MVTIRAESLGVALGRRAVLANVDADLAPGRLIGVIGPNGAGKSTLVRALLGLVPLSGGGVRVDGQPVARLPRAALARRIAYLPQGQTLHWPLTVERLVALGRLPHLAPLSRIGEADVAAIDRAIEQADIGHLRGRVATELSGGERARVLLARALAVEAPALIADEPLAALDPGHQLEVMALLRRQADAGALVVAVLHDLSLAAGHCDRLLLLHHGRLVADGPPDRVLTADRLADVYGVRAWIGEVEGRRLVVPISHHG |
You can click texts colored in the table to view more detailed information
CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
NC_020561_3 | 1616584-1618020 | TypeII |
NA
Consensus repeat of NC_020561_3
|
21 spacers
spacers of NC_020561_3
>3.1|1616620|30|NC_020561|CRISPRCasFinder,CRT CGGGCAAGACGGTTGGGCGACGCGCGTTTG >3.2|1616686|30|NC_020561|CRISPRCasFinder,CRT GAAGTTCGCCGGGTCTACGCACGCGCTTTC >3.3|1616752|30|NC_020561|CRISPRCasFinder,CRT,PILER-CR CCTATGTCCGTAACAACCCGGACGTGGCCG >3.4|1616818|30|NC_020561|CRISPRCasFinder,CRT,PILER-CR AGTGATGACTGACATCGCAACGATAGCGGC >3.5|1616884|30|NC_020561|CRISPRCasFinder,CRT,PILER-CR CGAACGTCGCCCTGTAACAACAGCCCTGAA >3.6|1616950|30|NC_020561|CRISPRCasFinder,CRT,PILER-CR AGCCCGCTGCAAAGGCGGATTCCGCGACGC >3.7|1617016|30|NC_020561|CRISPRCasFinder,CRT,PILER-CR CCGAGTTGCTCGACAGCCAACGCGCTTTAG >3.8|1617082|30|NC_020561|CRISPRCasFinder,CRT,PILER-CR GTATCTGTGCGCCAGTCGTACATTGTTGAC >3.9|1617148|30|NC_020561|CRISPRCasFinder,CRT,PILER-CR CCTTCCACGCGTCAAGCTCACCTTCGAACC >3.10|1617214|30|NC_020561|CRISPRCasFinder,CRT,PILER-CR TTTGGCGAAGTCCGCCCACATATGCGCGCA >3.11|1617280|30|NC_020561|CRISPRCasFinder,CRT,PILER-CR CGCGGCGAGACCCACGTCAACAACCTGCTG >3.12|1617346|30|NC_020561|CRISPRCasFinder,CRT,PILER-CR GCCCATCCCGAGCTCGCGCTTGTAGCGCAT >3.13|1617412|30|NC_020561|CRISPRCasFinder,CRT,PILER-CR GATTCTTGCCGCGATGGCGGCGGCCCAGGC >3.14|1617478|30|NC_020561|CRISPRCasFinder,CRT,PILER-CR ACTCGCTGCGAGGGGACGGGGAGAGGAAGG >3.15|1617544|30|NC_020561|CRISPRCasFinder,CRT,PILER-CR CCCCCAGGGCGCATAGCCAAGCCGGCCCAC >3.16|1617610|44|NC_020561|CRISPRCasFinder,CRT AAGATATCACACAGGCGGTATTGCTGGAGGCGGTATTGCTGGTT >3.17|1617690|30|NC_020561|CRISPRCasFinder,CRT,PILER-CR GAAACATTCGATGCGCCAGATCCAGATGAT >3.18|1617756|30|NC_020561|CRISPRCasFinder,CRT,PILER-CR CTATGTTGACGCGCAGTTCGGTTTGGCCAA >3.19|1617822|30|NC_020561|CRISPRCasFinder,CRT,PILER-CR CCAGCGGACGGACGCATATGGGCAAGCGGC >3.20|1617888|30|NC_020561|CRISPRCasFinder,CRT,PILER-CR CCTGTGCGCCCGCGCGAGGATGACCATAAT >3.21|1617954|31|NC_020561|CRISPRCasFinder,CRT,PILER-CR TCCTTTTACGCGATGAGGGCAGTGAGCCCGG |
cas2,cas1,cas9 |
CRISPR arrays and Neighbor proteins around NC_020561_3
The CRISPR arrays of NC_020561_3 >merge|NC_020561|3|1616584-1618020|CRISPRCasFinder,CRT,PILER-CR,PILER-CR AGCCTACCATCGGCAAATCGGTAGGGAAACCACGGCCGGGCAAGACGGTTGGGCGACGCGCGTTTGAGCCTACCATCGGCAAATCGGTAGGGAAACCACGGCGAAGTTCGCCGGGTCTACGCACGCGCTTTCAGCCTACCATCGGCAAATCGGTAGGGAAACCACGGCCCTATGTCCGTAACAACCCGGACGTGGCCGAGCCTACCATCGGCAAATCGGTAGGGAAACCACGGCAGTGATGACTGACATCGCAACGATAGCGGCAGCCTACCATCGGCAAATCGGTAGGGAAACCACGGCCGAACGTCGCCCTGTAACAACAGCCCTGAAAGCCTACCATCGGCAAATCGGTAGGGAAACCACGGCAGCCCGCTGCAAAGGCGGATTCCGCGACGCAGCCTACCATCGGCAAATCGGTAGGGAAACCACGGCCCGAGTTGCTCGACAGCCAACGCGCTTTAGAGCCTACCATCGGCAAATCGGTAGGGAAACCACGGCGTATCTGTGCGCCAGTCGTACATTGTTGACAGCCTACCATCGGCAAATCGGTAGGGAAACCACGGCCCTTCCACGCGTCAAGCTCACCTTCGAACCAGCCTACCATCGGCAAATCGGTAGGGAAACCACGGCTTTGGCGAAGTCCGCCCACATATGCGCGCAAGCCTACCATCGGCAAATCGGTAGGGAAACCACGGCCGCGGCGAGACCCACGTCAACAACCTGCTGAGCCCACCATCGGCAAATCGGTAGGGAAACCACGGCGCCCATCCCGAGCTCGCGCTTGTAGCGCATAGCCTACCATCGGCAAATCGGTAGGGAAACCACGGCGATTCTTGCCGCGATGGCGGCGGCCCAGGCAGCCTACCATCGGCAAATCGGTAGGGAAACCACGGCACTCGCTGCGAGGGGACGGGGAGAGGAAGGAGCCTACCATCGGCAAATCGGTAGGGAAACCACGGCCCCCCAGGGCGCATAGCCAAGCCGGCCCACAGCCTACCATCGGCAAATCGGTAGGGAAACCACGGCAAGATATCACACAGGCGGTATTGCTGGAGGCGGTATTGCTGGTTAGCCTACCATCGGCAAATCGGTAGGGAAACCACGGCGAAACATTCGATGCGCCAGATCCAGATGATAGCCTACCATCGGCAAATCGGTAGGGAAACCACGGCCTATGTTGACGCGCAGTTCGGTTTGGCCAAAGCCTACCATCGGCAAATCGGTAGGGAAACCACGGCCCAGCGGACGGACGCATATGGGCAAGCGGCAGCCTACCATCGGCAAATCGGTAGGGAAACCACGGCCCTGTGCGCCCGCGCGAGGATGACCATAATAGCCTACCATCGGCAAATCGGTAGGGAAACCACGGCTCCTTTTACGCGATGAGGGCAGTGAGCCCGGAGCCTACCATCGGCAAATCGGTAGGGAAACCACGGC >NC_020561|3|2|1616584-1618020|CRISPRCasFinder AGCCTACCATCGGCAAATCGGTAGGGAAACCACGGC CGGGCAAGACGGTTGGGCGACGCGCGTTTG AGCCTACCATCGGCAAATCGGTAGGGAAACCACGGC GAAGTTCGCCGGGTCTACGCACGCGCTTTC AGCCTACCATCGGCAAATCGGTAGGGAAACCACGGC CCTATGTCCGTAACAACCCGGACGTGGCCG AGCCTACCATCGGCAAATCGGTAGGGAAACCACGGC AGTGATGACTGACATCGCAACGATAGCGGC AGCCTACCATCGGCAAATCGGTAGGGAAACCACGGC CGAACGTCGCCCTGTAACAACAGCCCTGAA AGCCTACCATCGGCAAATCGGTAGGGAAACCACGGC AGCCCGCTGCAAAGGCGGATTCCGCGACGC AGCCTACCATCGGCAAATCGGTAGGGAAACCACGGC CCGAGTTGCTCGACAGCCAACGCGCTTTAG AGCCTACCATCGGCAAATCGGTAGGGAAACCACGGC GTATCTGTGCGCCAGTCGTACATTGTTGAC AGCCTACCATCGGCAAATCGGTAGGGAAACCACGGC CCTTCCACGCGTCAAGCTCACCTTCGAACC AGCCTACCATCGGCAAATCGGTAGGGAAACCACGGC TTTGGCGAAGTCCGCCCACATATGCGCGCA AGCCTACCATCGGCAAATCGGTAGGGAAACCACGGC CGCGGCGAGACCCACGTCAACAACCTGCTG AGCCCACCATCGGCAAATCGGTAGGGAAACCACGGC GCCCATCCCGAGCTCGCGCTTGTAGCGCAT AGCCTACCATCGGCAAATCGGTAGGGAAACCACGGC GATTCTTGCCGCGATGGCGGCGGCCCAGGC AGCCTACCATCGGCAAATCGGTAGGGAAACCACGGC ACTCGCTGCGAGGGGACGGGGAGAGGAAGG AGCCTACCATCGGCAAATCGGTAGGGAAACCACGGC CCCCCAGGGCGCATAGCCAAGCCGGCCCAC AGCCTACCATCGGCAAATCGGTAGGGAAACCACGGC AAGATATCACACAGGCGGTATTGCTGGAGGCGGTATTGCTGGTT AGCCTACCATCGGCAAATCGGTAGGGAAACCACGGC GAAACATTCGATGCGCCAGATCCAGATGAT AGCCTACCATCGGCAAATCGGTAGGGAAACCACGGC CTATGTTGACGCGCAGTTCGGTTTGGCCAA AGCCTACCATCGGCAAATCGGTAGGGAAACCACGGC CCAGCGGACGGACGCATATGGGCAAGCGGC AGCCTACCATCGGCAAATCGGTAGGGAAACCACGGC CCTGTGCGCCCGCGCGAGGATGACCATAAT AGCCTACCATCGGCAAATCGGTAGGGAAACCACGGC TCCTTTTACGCGATGAGGGCAGTGAGCCCGG AGCCTACCATCGGCAAATCGGTAGGGAAACCACGGC >NC_020561|3|1|1616584-1618020|CRT AGCCTACCATCGGCAAATCGGTAGGGAAACCACGGC CGGGCAAGACGGTTGGGCGACGCGCGTTTG AGCCTACCATCGGCAAATCGGTAGGGAAACCACGGC GAAGTTCGCCGGGTCTACGCACGCGCTTTC AGCCTACCATCGGCAAATCGGTAGGGAAACCACGGC CCTATGTCCGTAACAACCCGGACGTGGCCG AGCCTACCATCGGCAAATCGGTAGGGAAACCACGGC AGTGATGACTGACATCGCAACGATAGCGGC AGCCTACCATCGGCAAATCGGTAGGGAAACCACGGC CGAACGTCGCCCTGTAACAACAGCCCTGAA AGCCTACCATCGGCAAATCGGTAGGGAAACCACGGC AGCCCGCTGCAAAGGCGGATTCCGCGACGC AGCCTACCATCGGCAAATCGGTAGGGAAACCACGGC CCGAGTTGCTCGACAGCCAACGCGCTTTAG AGCCTACCATCGGCAAATCGGTAGGGAAACCACGGC GTATCTGTGCGCCAGTCGTACATTGTTGAC AGCCTACCATCGGCAAATCGGTAGGGAAACCACGGC CCTTCCACGCGTCAAGCTCACCTTCGAACC AGCCTACCATCGGCAAATCGGTAGGGAAACCACGGC TTTGGCGAAGTCCGCCCACATATGCGCGCA AGCCTACCATCGGCAAATCGGTAGGGAAACCACGGC CGCGGCGAGACCCACGTCAACAACCTGCTG AGCCCACCATCGGCAAATCGGTAGGGAAACCACGGC GCCCATCCCGAGCTCGCGCTTGTAGCGCAT AGCCTACCATCGGCAAATCGGTAGGGAAACCACGGC GATTCTTGCCGCGATGGCGGCGGCCCAGGC AGCCTACCATCGGCAAATCGGTAGGGAAACCACGGC ACTCGCTGCGAGGGGACGGGGAGAGGAAGG AGCCTACCATCGGCAAATCGGTAGGGAAACCACGGC CCCCCAGGGCGCATAGCCAAGCCGGCCCAC AGCCTACCATCGGCAAATCGGTAGGGAAACCACGGC AAGATATCACACAGGCGGTATTGCTGGAGGCGGTATTGCTGGTT AGCCTACCATCGGCAAATCGGTAGGGAAACCACGGC GAAACATTCGATGCGCCAGATCCAGATGAT AGCCTACCATCGGCAAATCGGTAGGGAAACCACGGC CTATGTTGACGCGCAGTTCGGTTTGGCCAA AGCCTACCATCGGCAAATCGGTAGGGAAACCACGGC CCAGCGGACGGACGCATATGGGCAAGCGGC AGCCTACCATCGGCAAATCGGTAGGGAAACCACGGC CCTGTGCGCCCGCGCGAGGATGACCATAAT AGCCTACCATCGGCAAATCGGTAGGGAAACCACGGC TCCTTTTACGCGATGAGGGCAGTGAGCCCGG AGCCTACCATCGGCAAATCGGTAGGGAAACCACGGC >NC_020561|3|2|1616716-1617609|PILER-CR AGCCTACCATCGGCAAATCGGTAGGGAAACCACGGC CCTATGTCCGTAACAACCCGGACGTGGCCG AGCCTACCATCGGCAAATCGGTAGGGAAACCACGGC AGTGATGACTGACATCGCAACGATAGCGGC AGCCTACCATCGGCAAATCGGTAGGGAAACCACGGC CGAACGTCGCCCTGTAACAACAGCCCTGAA AGCCTACCATCGGCAAATCGGTAGGGAAACCACGGC AGCCCGCTGCAAAGGCGGATTCCGCGACGC AGCCTACCATCGGCAAATCGGTAGGGAAACCACGGC CCGAGTTGCTCGACAGCCAACGCGCTTTAG AGCCTACCATCGGCAAATCGGTAGGGAAACCACGGC GTATCTGTGCGCCAGTCGTACATTGTTGAC AGCCTACCATCGGCAAATCGGTAGGGAAACCACGGC CCTTCCACGCGTCAAGCTCACCTTCGAACC AGCCTACCATCGGCAAATCGGTAGGGAAACCACGGC TTTGGCGAAGTCCGCCCACATATGCGCGCA AGCCTACCATCGGCAAATCGGTAGGGAAACCACGGC CGCGGCGAGACCCACGTCAACAACCTGCTG AGCCCACCATCGGCAAATCGGTAGGGAAACCACGGC GCCCATCCCGAGCTCGCGCTTGTAGCGCAT AGCCTACCATCGGCAAATCGGTAGGGAAACCACGGC GATTCTTGCCGCGATGGCGGCGGCCCAGGC AGCCTACCATCGGCAAATCGGTAGGGAAACCACGGC ACTCGCTGCGAGGGGACGGGGAGAGGAAGG AGCCTACCATCGGCAAATCGGTAGGGAAACCACGGC CCCCCAGGGCGCATAGCCAAGCCGGCCCAC AGCCTACCATCGGCAAATCGGTAGGGAAACCACGGCAAGATATCACACAGGCGGTATTGCTGGAGGCGGTATTGCTGGTTAGCCTACCATCGGCAAATCGGTAGGGAAACCACGGC GAAACATTCGATGCGCCAGATCCAGATGAT AGCCTACCATCGGCAAATCGGTAGGGAAACCACGGC CTATGTTGACGCGCAGTTCGGTTTGGCCAA AGCCTACCATCGGCAAATCGGTAGGGAAACCACGGC CCAGCGGACGGACGCATATGGGCAAGCGGC AGCCTACCATCGGCAAATCGGTAGGGAAACCACGGC CCTGTGCGCCCGCGCGAGGATGACCATAAT AGCCTACCATCGGCAAATCGGTAGGGAAACCACGGC TCCTTTTACGCGATGAGGGCAGTGAGCCCGG >NC_020561|3|3|1617654-1618020|PILER-CR CCTATGTCCGTAACAACCCGGACGTGGCCG AGCCTACCATCGGCAAATCGGTAGGGAAACCACGGC AGTGATGACTGACATCGCAACGATAGCGGC AGCCTACCATCGGCAAATCGGTAGGGAAACCACGGC CGAACGTCGCCCTGTAACAACAGCCCTGAA AGCCTACCATCGGCAAATCGGTAGGGAAACCACGGC AGCCCGCTGCAAAGGCGGATTCCGCGACGC AGCCTACCATCGGCAAATCGGTAGGGAAACCACGGC CCGAGTTGCTCGACAGCCAACGCGCTTTAG AGCCTACCATCGGCAAATCGGTAGGGAAACCACGGC GTATCTGTGCGCCAGTCGTACATTGTTGAC AGCCTACCATCGGCAAATCGGTAGGGAAACCACGGC CCTTCCACGCGTCAAGCTCACCTTCGAACC AGCCTACCATCGGCAAATCGGTAGGGAAACCACGGC TTTGGCGAAGTCCGCCCACATATGCGCGCA AGCCTACCATCGGCAAATCGGTAGGGAAACCACGGC CGCGGCGAGACCCACGTCAACAACCTGCTG AGCCCACCATCGGCAAATCGGTAGGGAAACCACGGC GCCCATCCCGAGCTCGCGCTTGTAGCGCAT AGCCTACCATCGGCAAATCGGTAGGGAAACCACGGC GATTCTTGCCGCGATGGCGGCGGCCCAGGC AGCCTACCATCGGCAAATCGGTAGGGAAACCACGGC ACTCGCTGCGAGGGGACGGGGAGAGGAAGG AGCCTACCATCGGCAAATCGGTAGGGAAACCACGGC CCCCCAGGGCGCATAGCCAAGCCGGCCCAC AGCCTACCATCGGCAAATCGGTAGGGAAACCACGGCAAGATATCACACAGGCGGTATTGCTGGAGGCGGTATTGCTGGTTAGCCTACCATCGGCAAATCGGTAGGGAAACCACGGC GAAACATTCGATGCGCCAGATCCAGATGAT AGCCTACCATCGGCAAATCGGTAGGGAAACCACGGC CTATGTTGACGCGCAGTTCGGTTTGGCCAA AGCCTACCATCGGCAAATCGGTAGGGAAACCACGGC CCAGCGGACGGACGCATATGGGCAAGCGGC AGCCTACCATCGGCAAATCGGTAGGGAAACCACGGC CCTGTGCGCCCGCGCGAGGATGACCATAAT AGCCTACCATCGGCAAATCGGTAGGGAAACCACGGC TCCTTTTACGCGATGAGGGCAGTGAGCCCGG AGCCTACCATCGGCAAATCGGTAGGGAAACCACGGC
>NC_020561.1|WP_015458291.1|1616195_1616525_+|CRISPR-associated-endonuclease-Cas2 MQADEVRFMWLMVFFDLPTRTKPQRRRANRFRQFLKKDGYIMLQFSVYARVCRGQDAVDKHVRRVRTSLPKEGSVRTLQVTDRQYGRMELMLGIAPKTEEIGSSQMVLL >NC_020561.1|WP_015458290.1|1615291_1616194_+|type-II-CRISPR-associated-endonuclease-Cas1 MAWRGLHISNPARLSHRSRQIVVDPEGGSEILTFPVEDVAWIILDTPQVTLTGSLLSALAENGVAMVVPDARHHPAGMLLSFHQHHAQSAIAHSQIAMTQPLRKRLWQKLVVAKIENQAAVLRGIGHDYADTLSAMAARVGSGDPDNLEAQAARAYWQRLFADFWRHDEDRRNGLLNYGYAVVRAALARACAASGLLPAFGVHHRSRANPFNLVDDLLEPFRPAVDRLARLRALQEERDELDVADRRHMAGILGENIAIGEEHLTMLAATEAVAASLVQAIDGGNAALLNTPALPLARRG >NC_020561.1|WP_015458289.1|1612172_1615334_+|type-II-CRISPR-RNA-guided-endonuclease-Cas9 MSGMVFGIDLGIASCGWAVLRQPQRDGDPGEIVDLGSWMFDVPETDKERTPTNQVRRGNRLLRRVIRRRAQRMVEIRRLFHDHGLLAGHAPEALKRAGLDPWDLRARSLDKVLEPAALAVALGHIAKRRGFKSAARRKEANTAGDDQKMLKALEATHERLGRYRTIGEMFARDPDFESRKRNRDGMFDRTQGRDDLLHEVGEIFKAQRRLGSALATAELEQAFTAIAFRQLPIQDSERLVGLCQFEPKEKRAARFAPSFERFRLLQRLTNLRVVTVEGERPLTADEIAAAAADLGRTAKLSVKEVRKRIGLAADHRFAAFKADEEDRDIIARTGEALHGTYRLRKALGEGLWAEMLPGQLDAIAHALSFFETQDVILKELDKLDLPAGVRDAIATGLDAGAFARFKGTGHISARAARALLPHLEAGLRYDQACTKAGYDHAASRWAKREQVADKAAFNRLVTDMGAEIANPIARKSLTEALKQLWAMRNRWGLPDAIHIELARDVGNSLEKRREIERAIEKNTAARERERGEARDLLGIDDVSGDTLLRYRLWKEQAGRCPYTDAPIPPGAIIATDNSFQVDHILPWSRFGDDSFANKVLCATAANQRKKGQTPCEWITAAQGEEGWATFVARIEGNAAFRGPKKRNYVLKNAKEAEERFRARNLNDTRYAARLLAEAVKLFYPEGERQDKGGVRRVFTRPGGLTAALRHAWGVEALKKRDGKRVDDARHHALDALVVAAIGEGEVQRLTRSYQEWEQQGLARPLRRVDPPWGDFHSFRREVKDAYDGIFVARPERRRARGEGHAATIRQVRERDGAAVVFERKAIADLSEKRLADIKDPERNQAIVEAIRQWIVDGRPADRLPRSPAGDEIRKVRLRTKGKPAVQVRGGAADRGEIVRVDVFTKPNKKGKNEFYLVPIYPHQVMNKAEWPTPPMRAVVAYKDESEWTLLDENFGFLFSLFPRSYVEVTKPGGEVLSGYFQGMDRSTGAISLFNHRDSRSLTDDSGNSTRGIGAKTLLTMKKYSVDRFGKRAEVKSEVRTWHGVACTYPTPPG >NC_020561.1|WP_144062010.1|1611246_1611915_+|hypothetical-protein MPFTQDELHELPTVISAPRFATYLQAMGNYREKALELYEWNLALSSALIVPLQVCEIAIRNGIAEGIELVHGATWPWSNGLIRSLPRPKKRFHYIPADDLKACAARLPTTGKIIAELKFAFWENIFTVGQETRIWNKHFRTCFPGAPAQQTISQCRITAYNDLRGIRHLRNRIAHHEPVFTRNIADDYQRIHDMIAWRNPVAAAWMDGKQTVLGLLGQRPQP >NC_020561.1|WP_015458287.1|1610738_1610930_+|type-II-toxin-antitoxin-system-Phd/YefM-family-antitoxin MAITTFPSRALSRHIGQVKRAARNGPVFITERRRVAYVLLSIEDYQRLLSDGEGEAAADGASP >NC_020561.1|WP_015458286.1|1610145_1610712_+|recombinase-family-protein MALIGYARVSTADQKLSLQQDALAHAGCERIFDDQASGAKADRPGLAEALAYLRSGDTLVVWKLDRLGRSMRHLIDAVDALAARGIGFRSLTEHIDTTTPGGMLVFNIFGALAQFERDLIRERTQAGLSAARERGSRGGRRPVVTPDKLRKARQHIAAGLTVREAAARLRIGKTALYKALESDRNDMA >NC_020561.1|WP_144062009.1|1609430_1609904_+|hypothetical-protein MQKMIKMPKFLFLSFFVFVSIFLIVILFLSSELLKDPCYALSLDKKDGILYNVLPANYCIPSSSLTIYGSLHEDNGNMQFTGSPRNSSEKISAILNISEQAAFDGMKGTMVPCIKNKNSHIVFENISVSGRLIKPDSQSIYRKNIILAERIMCLNHE >NC_020561.1|WP_187294040.1|1606278_1609443_+|hypothetical-protein MTSDIATSRVISDTDALNRTTSYQRDSFGRVTRVTAPEGNYTQFTYDARGNVTQTRSVGKSGSGLADIVKSAVYPATCGNAITCNKPTSTTDARGKVTDYTYDATHGGVLTVTAPAAPNGVRPQSRYSYTPMQAYYKNSAGSLVASGGTGILNTYVLTSTSTCKTTASCAGGADEVKTTLGYGAQVAGTANNLLPVTTSSGSGDGALTATASVTYDSIANTLTVDGALSGTADTIRYRYDAARRVVGTVSPDPDGAGALKHRATRYTYRPDGLVSQVESGTVASQSDADWAAMAVLDKAQISYDVNGRKVKEELYGGVTLEAVTQTSYDALGRVDCVAQRMNKAVFGSLPASTCTLGTAGADGPDRIAKTIYDAASQVTKVQTAYGTSLQRDEVTNTWSNNGKLLTVADAKGNKTTYEYDGFDRLSKTRFPSPTTPGTSSTTDYEQLGYDAGSNITSRRLRDGTSIAFSYDNLSRATSKDLPGTELDVSYAYDNLGRVTTATDTASNFVGAAYDALGRMTAQSSALGVFGMAYDLAGRRTKLTYPDNFYVNYDYLVTGEMTAVRESGATSGAGLLATFAYDDRGRRTSLTRGNGTVTSYGYDNASRLSQLTQNLTGTASDFTQTFTYNVGGQLTRQDRSNDLYSWTQHVNLNRSYTVNGLNQYSAITGVAPAPAYDARGNLTNGGTGTYAYSSENYLISGPGVTLSYDPSGRLLQTAGSVTKRFAYDGANLAAEYSSTGVLQQRYVHGSDVDEPLVWYEGSGTTDRRWLHADERGSVIAVSDSAGNTIAINAYDEYGIPQSTNLGRFQYAGQTWLPELGLYYYKARIYSPTFGRFLQTDPIGYNAGMNIYAYANSDPVNLVDDSGNSPTNGVNLYDILNSLIRNDNRSRRFGVEYAQKLTYWPSMGFTYYSSYFRGERNQANVPSCSYCNVITHSHYTDFNVAGNENLSPDDINLSESIGKPIWGIMPNSTVKAYDPSSDMLYTLVKIDSSGNSLGAFSFGDLKGDIITKVTELKDGTFKVSYQTRNGGMGQIRVGLEGSQCSKSKDGGTVCKK >NC_020561.1|WP_015458284.1|1605677_1605923_-|hypothetical-protein MNENREAWCLRPAVLAADRGLRDAYADAIRTGVERSTLISYRKRWARLRNRSSDEPRYLIGSYRALAEELASLSDQARMGR >NC_020561.1|WP_041864840.1|1604891_1605674_-|serine/threonine-protein-phosphatase MALAPPPSAAPVRPGSVEGGLVYAIGDVHGCYDQLCGLLGRVMEDIAGRGAGRRPILIFCGDYIDRGPQSAEVLDALCWLDRRAGFELHLLKGNHEQALLDFLEMPEDGEGWLEFGGVATLASYGVAPPAADLGPQDFRRARNELLDRMPAGHLRLLQRLELIVSLGDYAFVHAGIRPGIALDRQDEDDLLWIRRDFLDAAGPHEKIIVHGHSWADARPDIGPHRIGIDTGAYQTGVLTALRLEDGGIQAIQFGAEERLS >NC_020561.1|WP_015458293.1|1618109_1621502_-|class-I-SAM-dependent-DNA-methyltransferase MDKADRVESFIDRWRGGEGGAERANYALFLVELVDLLDLPRPDPAEATRDRNDYVFERAVRRTDRDGKESIGRIDLYRRGCFVLEAKQSRWKNQAKEVQVPAAQLPLPAFAEPEILGRRNAARNWDVLMHNAREQAEQYARALEPDHGWPPFLIICDVGHCLELFADFSGQGKNYRQFPDRAGFRIYLDDLRDEAVRRMLRAIWLDPHSLDPARKSAAVTREIARRLAKVSKALEDRGHAPEKVAHFLMRCLFTMFSEDVGLLERGCFTQLLEESTATPASFAPLLEDLWRVMDKGGFSPVLHRPVRHFNGKLFADASAIPLQREDIGELLAAARHDWTQVEPAIFGTLLEQALDPGDRRQLGAHYTPRAYVEQLVVATVIAPLRAQWERVVLGTVERERVDHPGRAIGAVREFHAQLAQTRVLDPACGTGNFLYVALELMKQLEGEVLETLAALGGQEALALETMSVDPRNFLGLEINPRAAAIAELVLWLGYLQWHLRGGGAISDPVLQSFGNIACRDAVLAHDPERPKADGSGTERPNARPPEWPEADYIVGNPPFIGGKDLRSRLPAGYVEALWRAHPHINRSADFVMYWWDRAAELLTQKGTRLKRFGFVTTNSITQEFSRRVIAKRIEGRVPLHLVLAIADHPWTKASRDAAAVRIAMTVAEAGAGDGQLRTVVAEVALDSDQPVISMTVTDGRINADLTIGANLMEVVALRANGGLGSRGVSLHGAGFIVSPQEAEHLGLGRADGLERHIRPYRNGRDLAGRARGAMVIDLFGLDEATVRRRFPDVFGHVWRRVKPERDTNNRATYRDNWWIFGEPRRDLRPALEGLPRYIATVETAKHRVFQFLDAAILPDNMLVCIASDDAFHLGVLSSRIHVTWALRAGGWLGVGNDSRYSKSRTFDPFPFPEATAALRARIADVAEELDSTRKTVLAVQGDLTLTGLYNLRDKIERHEPLDMVEQDQRVRGRIDIICALHAHLDRVVAEAFGWPTDLADEDIVARLVALNAARHQEERNGIVRWLRTDYQLGRAGIEQLGLKVDVPDRIAAHHSSSSIRKPAFPRDAIGQTAAVLEALRSAPLLSAEAIADRYSNGHKALPRIGATLSALTRLGHVAAEGGDYSLRRAA >NC_020561.1|WP_015458294.1|1621636_1622326_-|hypothetical-protein MTFWEWIAVNKEQLGILITGAGVPLLLWQVTQSGRQERRRLRRRHAAARSTLPLTLSAICAYAGRAGAELRPMFYFYRGRGPHLEFTPPVASDQIIAAIERMIEAASKEEIAHRLADIASRMQVLSARMNGLVVSPSVFRSLVGELILDAAEIDALASSLFAFARRHTEKAPPPLTKSDIRNALHRIGCDEERDSEIYTALGETPQWQALPPWWRRLGNRFHKPILAEY >NC_020561.1|WP_084673633.1|1622336_1623329_-|MobA/MobL-family-protein MLRDWREQWAEIQNRHLRRHLGPDTPQVTHLSLDGQGVDREPMQHLGPTASAIERKGERSERGDINRDIHAANAERAAWKVRKREIEDELVRRTPHQPSSPQSLQAELRTLRDAMVAERAKWQAEVAAIGKPAVLKPYEVRRAILDPARTRLAQAERDLSATRERVQRLSTRRMQLAHWVKNPQRMIWAKIREVHAIDRARRDVARAKAGLRLREQWLGSEQGRAYVLAQVDRSHAAAKPLLGRRRTLARKIARASKRIERVDKLQQKLRVAEKLGVGAIARPVHVRSPDQLIRSIDQTVMRMARSFSPQQQQHALQQVRAIGRVIGLEW >NC_020561.1|WP_015458296.1|1623356_1623818_-|MobA/MobL-family-protein MAQYRFSAQVISRRDGRSAVAAAAYRAGERLHDERLDMPFDYARRDGVEHSEILLPEGAPARFADRHIVWNAVEAVERRSDAQVAREVQLSLPHELTFEQRLELVRDFARTAFTDRGMIADIALHRPDRHGDERNFHAHILLTTRAIAGESFG >NC_020561.1|WP_144062011.1|1623956_1624268_-|hypothetical-protein MVRRFQLYSACAEPGSRATGPSPSAAPCSGAKGARADLSPIVTGGTADNPRSRANALAWFAAWNARTDRSWTMHPVTIDGGTHAANATDAYRTGLRLLFRPDE >NC_020561.1|WP_015458297.1|1624323_1625310_-|nucleotidyltransferase-domain-containing-protein MTGRSIAAPLQTLFAELLQQAETTDPAGSVYERTRDGITYLYAKLPVGTTRVDRFLGRADDAAAVALAEAMRQGAAQARERRSLVAMLKRGGLAGPDRRLGAALDAIAYAGLFRGGAVLVGTAAYMMSGPLVGHLLPAPTLMTGDLDLATASLALSADPPERMEAILRRADPSFQAIMPLDPGNPASRFRSGDGYLVDLVTPQRSRADPNPKPLKALEAGAAPLQHLAWLIADPVASVALWGAGIPVTIPQPARFAVHKLILAQRREGAHRLKRAKDLAQAQALMAALQRFDPFLLEDALDSARAMGKAGWADPIDRSLKEIARSANP >NC_020561.1|WP_041864841.1|1625486_1625708_+|helix-turn-helix-transcriptional-regulator MMTTIPSPQALGAAVRTARKAAGLRQDELAGVAGVGTRFIVELEAGKPTLQLGKVLAVLAALGLTLHLDGGPA >NC_020561.1|WP_015458299.1|1625790_1626015_+|hypothetical-protein MRIGRAGTIDELDGEAWARFATDAGITFPFFRRRVSALTERIEAAIAGGEDVADVAELRERTMLRARLVWQTTG >NC_020561.1|WP_015458300.1|1626087_1626438_+|WGR-domain-containing-protein MSTIANLACPVHLEAIDSARNMARGYSLWMSRDLFGEWVVETRWGRIGARGQSQVVSFVDGAAARAYVRSVLRRRAGLRRRGGVGYRLVAPCPFSPSSMKLENLMGLEGLIEGLEG >NC_020561.1|WP_015458302.1|1628225_1630091_-|phosphomethylpyrimidine-synthase-ThiC MADIPARTEMTVTTGPIRGSRKIHVGPLGVAMREIDLEPSSGEPPLRVYDCSGPYTDPQARIDIMAGLPELRRDWIRGRGDVEEYAGRAVKPEDNGLSGAIGRNGAVQPFPNVRQRPLRAKAGANVSQMHYAKRGIITPEMEYVAVRENLGREMLKDKLVRDGQDWGASIPDYVTPEFVRDEVARGRAIIPSNINHPESEPMAIGRNFLVKINANIGNSAVASSVAEEVEKMVWAIRWGADTVMDLSTGRNIHDTREWILRNSPVPIGTVPIYQALEKVGGIAEDLTWEIFRDTLIEQAEQGVDYFTIHAGVRLPYIPLTARRVTGIVSRGGSIMAKWCLAHHRESFLYEHFDEITEIMKAYDIAYSLGDGLRPGSIADANDEAQFAELYTLGELTKRAWEQDVQVMIEGPGHVPMHKIKENMDKQLEACGEAPFYTLGPLTTDIAPGYDHITSGIGAAMIGWYGTAMLCYVTPKEHLGLPDRDDVKVGVVTYKLAAHAADLAKGHPAAKLRDDALSRARFEFRWRDQFNLSLDPDTAEQYHDQTLPAEGAKTAHFCSMCGPKFCSMKITQEVRDFAAKQNQPADSFLAAEAAEAGMAEMSKVFKETGGELYMGAGGREHD |
You can click texts colored in the table to view more detailed information
CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
NC_020561_4 | 1787810-1787889 | Orphan |
NA
Consensus repeat of NC_020561_4
|
1 spacers
spacers of NC_020561_4
>4.1|1787833|34|NC_020561|CRISPRCasFinder GCCCCGTCAGTATAACAATCTTCGGAATTATGCT |
CRISPR arrays and Neighbor proteins around NC_020561_4
The CRISPR arrays of NC_020561_4 >merge|NC_020561|4|1787810-1787889|CRISPRCasFinder GACGTTGGTTGCGGGGACAGGATGCCCCGTCAGTATAACAATCTTCGGAATTATGCTGACGGTGGTTGCGGGGACAGGAT >NC_020561|4|3|1787810-1787889|CRISPRCasFinder GACGTTGGTTGCGGGGACAGGAT GCCCCGTCAGTATAACAATCTTCGGAATTATGCT GACGGTGGTTGCGGGGACAGGAT
>NC_020561.1|WP_015458458.1|1786139_1787471_+|ferric-reductase-like-transmembrane-domain-containing-protein MRLTRLKLVPVALTLVLVAAWLLSLRTGALTGGFWALRHELIYLTGILAIGFMAAGVVLAARPVQIEGALGGLDKFYRLHKWFGVGGLLLALAHWLLEIIPRWMVGQGWLVRPSRLRASGPAADANLLDSLRGVATELGEIALYILIVLVLLALWKKFPYRWFFKAHRLMAPIFLVLVFHAVVLMDRSYWTAPLGPLMIVLLAAGTVAATTALFRRIGYSRRAAGVITRLVTYPGNAVLDVAVDVGTAWPGHQAGQFVFLKTDDREGAHPFTISSAWHNDGHLLFNIKGLGDYTRKLPDLLRVGQPVTIEGPYGRFDFGGECARQVWIAGGVGITPFIARLQALAQARQERDIDLFYSTGAPDEDFVGQVRDLTEKAGIRFNLLVTPRDGFLTLDRLADLVPDWIEADIWFCGPAAFGRSLYVAMTSRGLPGSQFHQELFEMR >NC_020561.1|WP_015458457.1|1784791_1785907_-|VIT1/CCC1-transporter-family-protein MAEPNALPRYRSNLQGEVDGAAIYAALAESEADPKLAEVFRRLAAVEQAHGDFWRKRIEANGANFRPSPSTRARILAWLARRFGPAFVLPTLAANETRDSAAYDNQPEARGAGLPADERSHALLMRAAAGKGGLSGPTLALLEGRHRGGGNTLRAAVLGANDGLVSNMSLVMGVAGAAAAQQTLLLTGLAGLVAGACSMAMGEWLSVTSSRELYQSQIATEAEELREVPDEEREELVLIYQAKGIDESQARALADKLLSNEGTALDTLAREELGIDPDQLGGSAWTAATWSFLLFSAGAIVPVAPFLFLSGRTALIASLGASGVALALIGAGTSLFTGRSALFSAVRQLIIGLAAAGVTYGAGAIVGISLG >NC_020561.1|WP_144062015.1|1783671_1784682_+|cupin-domain-containing-protein MLDRARFMRDGTTMRADPLSDVLDLADARCVLTGTLVAGGGWARKFNRSDAVKFLAVVRGTCWLSTEADTADPARFEAGDVVITNGAPAIILASTAEWLANAPSTPLERDAEGNLRAGEGSEFTMIGGLLEVDKQRCGFLRESLPPMVHVNGQRGEAAKLRWLLTELAEETQRKRAGSTTAITHLAKLLFVEALRLHIEATKSDRSGWLTALDDRRISIALRGIHAEPSHAWNLEKLAKLSGMSRTSFAVRFRDVVGVPPLTYVLNWRMRLAERELSETDHSVADIAWSIGYGSESAFSNAFSRSTGVSPGRFRKEAMHTYSERRRKVDRSAIDVD >NC_020561.1|WP_015458455.1|1782542_1783610_-|aldo/keto-reductase MPLDHFITLGRSGLRVSPLCLGTMTFGEDFGWGASEAESHAMLSEYRNRGGNFIDTANIYTAGHSEEIVGNYLRQSDLRRDGIVLSTKFYCSLFPGDPNGGGAGRKALIQQCEASLKRLQTDYIDVYWLHNWDQTAPVEETLRGLDDLVTAGKIRYVGFSDVPAWKTAEAQTIAHFRGWAPIIALQLEYSLLERTSEGELFPMAQGMGMGVMPWSPLKSGFLSGKFRRGDAGHVDTRRTAMVGVPSEADYDIIEAVADVASELGVSSASVALAWVRSRAGVSSTLIGARRVDQLKANLDSLDVTLSSEQMKTLDDISRPKLNFPAENNETLAPMLAFPGLTVDGRTLPSMARLSA >NC_020561.1|WP_144062013.1|1781301_1781598_-|helix-turn-helix-domain-containing-protein MMRDNLGSAISISEVASLCRLSLCYFVRAFTNTVGIAPYAWFVQQRIVCAKGLLADTALPLVQVALECGFSDQAHFTKAFAKASGITPAKWRRQICTS >NC_020561.1|WP_015458453.1|1780605_1780989_+|hypothetical-protein MSDEKVPDEAMRRIALALVEHCVRNTRLEDLHAGTVPDSLIGDYSDVKVVTPYGEIPWTQASRISDAEMKALMIDIVNKVYTFLTHLEDVVVLRDSARWNRPEHDPALLAVAKRRAAARGADDERKE >NC_020561.1|WP_041864851.1|1779130_1780486_+|ImmA/IrrE-family-metallo-endopeptidase MILIDGEPAWPVPGETEALVEIQIDDLLAHLTDFWKPLMLRQVYPIDAAPSRPSTLRSIAEAEWEHMQPEAAAAEDEAITRFEEAHDLSHAFAGLYGLPPFWMMRSGEDYILESSRALWRLPFDDVRASLNATGDWICARLHEADAERWQDAIAAWQERDAGDAAGLLAWSTGLDRDLATSLLKEGALEPPQNFNDAANDNDELRIAARMAGALPADQIREIIGLARQFAGHEAEALKALAADAQAMIAERFPHAKPFEQGEAAARFVRERLSITADRAVKVFEMATSLGIELRHNPAEPPSLDGLAIWGPRHGPGVFLNEASGRILGRDDRDVEASLGARVTMAHELCHLLLDGEHALSAIEVLKARMPAGVEQRAKSFAGEFLVPTDIAAEFWHRAKRPVDRAGLDAVVRELIEIYEVTRSVAAWKVEHAARRHAVDLSATLDSVAPHR >NC_020561.1|WP_015458451.1|1777379_1778294_+|HEPN-domain-containing-protein MKTELDHLPVNKRRELDRVIQIIFEEFEDALGQPTGPRKLGRILKIILYGSYATGRWVHEPHTERGYRSDFDLLIIVNQKELTDRAEYWEKAEERLDRETMILNRLRTPVNFIVHTLQEVNDGLAHGRYFFMDLARDGIALYQVDNSELHEPRPKTPQQAYDMAKEYFDQWFDLAVSSRMLFQFAYDNKQFPDAAYNLQQACERLYYCVLLVYTFYTPYSHNIKFLRTRAEKISARLLDAWPRETRKQEAYFNKLKDAYVKARHSKHFKMTEEEFAFLAERVEVLGTIVNELCQERLSELRAQL >NC_020561.1|WP_015458450.1|1776909_1777362_+|type-II-toxin-antitoxin-system-VapC-family-toxin MKGWLLDTHIVSALANPNGASSVKAWATAQPEHRMYLSVLTLAEYDKGIHNLEPDHPDRSRYVAARDALAERFSNRLLSIDDAIVFRWGAISGEVKRRIGQSPPMIDALLAATAIEHDLFLVTRNIKDTRHSGAVIFNPWEDEPSRFPLT >NC_020561.1|WP_015458449.1|1776616_1776913_+|type-II-toxin-antitoxin-system-Phd/YefM-family-antitoxin MGTATRKGDRDQSVPGGTWKLEDAKARFSEVVRRAQSEGPQRVTVRGREAVVVMSVDELDRLMPKDADKPAFVPFLESLGLDGLDLEREIDRGRDVAL >NC_020561.1|WP_144062016.1|1793365_1793728_+|hypothetical-protein MNQHFKEPFLHKYTWYMHIRTFERPGFFRSSDALKPSETGAAAHMSLHQSNNEKEPTAPHPSLRKLGDRFTDLERRVSNPENRVTAAGDRASMPTAPHRQLPISNFIAILSQLTEKYKRI >NC_020561.1|WP_015458460.1|1793904_1794333_-|CBS-domain-containing-protein MTIATILGGKGHDVISVSTGTRVAEVVSLIASKRIGAVPVMDGASVAGILSERDIIYKLQSDGAAILDWPVERVMTAPAITVTGDVPVLHALSLMTKRRIRHLPVVEDGRLAGLVSIGDLVKARIDRIEAEATAMRDYIQGV >NC_020561.1|WP_015458461.1|1794412_1794709_+|hypothetical-protein MEAMTEHRPPHSDPRRLSPSWSHYWRLMRWMVLAAIVAVAAALYYLHVEGGLVSIHMVIATIAGVGASVLLGAGLMLLVFMSSGSGHDEDVGGRKDRP >NC_020561.1|WP_015458462.1|1794705_1795098_+|acyl-CoA-thioesterase MTDATSAHHPRDPILRVVPRPGDINSNGHIFGGWVLSQMDIAGGIVAHRETKGATATVAIDSMAFIAPILLHDLISVYAEVERRGRTSLAIRIEVIATRDAGAQEVKVTEGLFTFVALDENHRPRPLPPR >NC_020561.1|WP_015458463.1|1795113_1795902_-|dioxygenase MRQPSFFIPHGGGPCFFMDDPAGMWTRMEAFLAGFVAGLPERPKALLVVSGHWEEDAFTVQDGARPGLLYDYYGFPPHTYQLHWDAPGAPDVARRAAGLLADAGFATARDAERGWDHGVFVPMKVAVPGADIPTAQLSLRKDLDPAAHIAAGRALAPLRDEGVLIVGSGMSFHNMRVRDGDATGPADIFDAALTAAATDPDPEARARRLSAWSMLPHARFAHPREEHLIPLMVAAGAGGDDPAAHVFADHVIGWKVSGYRFG >NC_020561.1|WP_015458464.1|1796070_1796625_+|(2Fe-2S)-binding-protein MTRFTVNGQPVHYRMDPETPLLWALRDASNLTGTKYGCGAGLCGACTVHIDGAAVRSCQVPIGSIEGSFVTTIEALSRDRSHPVQQAWVAESVPQCGYCQSGIIMAVAAMLEKNPNPSDADIDAEITNICRCGTYPRIRRAIRHAARVAAGGETIAAAPPPGIDPEDAARAIPALTPPKPTGKE >NC_020561.1|WP_084673635.1|1796717_1797632_+|RcnB-family-protein MCAWARSMDMVRGTLVALLLAATAATPALAQSQGWHGQGSQGRSWQGQTGGRGDIGRPGGRMEGPAQRPDTARPAPSANASPRWNGSIARGNSAEGARGPQRPAPQRAGLSGNEDGRDRAALRRGDSWSADPRTRPGSQPADSRYRDRTIRYGDRDAINGRPGNNWNDRDRPGRDNRWDNRERWDDRRDRDRRDNQWDNRNRWSQGWRNDNRYDWQHYRQSNRYIYRLPVYYGPAGHGGYRRWAPGYRLPGVYYVRSYWISDPWYYRLPPIYGPYRWVRYYDDVLLIDTTTGLIEDVIPGFFWR >NC_020561.1|WP_015458466.1|1797852_1798284_+|RcnB-family-protein MRKLIISALIAATAMPLAAQAQTAELKRDRQDIRMAQRYGDRHDVRDAKREYREDWQDYRRNHRDVYRRPAYVGPRGYVYRPVAVGARLGAPYYASRYVISDPYRYRLPKPTGVNRWVRYGNDVLLVNVRTGRVIEAHRAFFW >NC_020561.1|WP_015458467.1|1798387_1800226_+|DUF885-domain-containing-protein MAVRHLLLASAVSLLAALPANAEAQSPAPVAAASQSAHDQLHALFHASDEASLKRNPINAIFRGDLRYADHLGDYVSDAYYDAERAAAEDDLKRLHAIDRASLDATDQIAYDVFEWQTQETLKNLTPEMLALTAVRPIDHFTGFHTFYPSFASGQGAAPFKTLADYENNLKRHKEYVALLDRSIERFRQGMASGVVQSKMTVRNMIDQLDEIIALGVEGSTFYGPVKKFPEGISAADQARLKTAYAAAIRDELIPAHIRLRDFLKNEYLPVAREAVGISAMKGGDKVYLAAIEQLTTLPLTPDYVHQLGLSEVARIRSQMEAIKTQVGFKGTLAEFFHHLRTDPKFKPKSKQQLVDGYYAIGKRVDARIPEQFSTIPKTPLEIRYYEPYREKTQAGGSYEPGMYDPKDPSKNRPGIFYFNTYDLPSRTTPGMETLYLHEGAPGHHFQISLAQENDRLPAFMRYGGNTAYAEGWGLYAETLWKELGMETDPYERFGGLDDEMLRAMRLVVDSGIHAKGWTREQAIKYMLDNSGMGETDATAEVERYIAWPGQALAYKIGQLTMSRLKAKAQAELGARFDPREFHAQILMTGALPMTVLEKKIDGWIASVKAAN >NC_020561.1|WP_015458468.1|1800586_1800805_-|hypothetical-protein MRDRPGRWRTVRAVLQPAGQGAVGAEPVPAYLLACDIQTAPIEREAISIAEAVRWASAQPWPVDLYLHDDGG |
You can click texts colored in the table to view more detailed information
CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
NC_020561_5 | 2786536-2786630 | Orphan |
NA
Consensus repeat of NC_020561_5
|
1 spacers
spacers of NC_020561_5
>5.1|2786559|49|NC_020561|CRISPRCasFinder GACAGTGGACACGAGCGGCGATCGCATGCTGGGCACGCTGCTGGGCGGT |
CRISPR arrays and Neighbor proteins around NC_020561_5
The CRISPR arrays of NC_020561_5 >merge|NC_020561|5|2786536-2786630|CRISPRCasFinder GCCGCCGGCGCGCTGCTGGGCCGGACAGTGGACACGAGCGGCGATCGCATGCTGGGCACGCTGCTGGGCGGTGCCGGCGGCGCGCTGCTGGGCCG >NC_020561|5|4|2786536-2786630|CRISPRCasFinder GCCGCCGGCGCGCTGCTGGGCCG GACAGTGGACACGAGCGGCGATCGCATGCTGGGCACGCTGCTGGGCGGT GCCGGCGGCGCGCTGCTGGGCCG
>NC_020561.1|WP_015459337.1|2784885_2786157_+|CoA-transferase MEAGATGAKPQVLAGVKVLDLSRVLAGPWCTQILADFGADVIKVEMPGRGDDTRGWGPPFLDPAPDEPGPGESAYYLSCNRNKRSLALDLSTPEGAAIVRRLAAEADILVENFKVGGLARYGLDYQSLRAVNPRLVYCSITGFGQDGPYADQAGYDFVAQAMGGLMSITGEPDGPPTKVGVAITDITTGIYATVSILVALRHAESTGQGQHIDCSLLDTQISMLANQAMSWLVGGVVPGRLGNAHPTIVPYRLFDAADGSVVVAVGNDGQFRSLCAALGRPDLGTDDRFARNAARVANRDVLEPVLEGLIATRSAAEVIAMLKENGIPGGPVNRIDQIFGDPFVAARGSVHNFVREDGVAVPTVAYPARLSETPADYRRRAPYLGEHSSEILGEWLGIGTSELAGLRGDGVIRDRPGPDGEVP >NC_020561.1|WP_015459336.1|2783835_2784795_+|1-phosphofructokinase-family-hexose-kinase MRRIATLTLNPAIDGACEAERVFPTHKIRTNNERYDPGGGGINVARVVARLGGEAEAYYLAGGVTGAVLDSLIDKAGIARTRIDIHDHTRVSLAVHERASGQEFRFVPEGPLVGDAEWQAALDRLTVAECDYLVVSGSLPRGVPDDFYARTRAAMAPRGVKLVVDTSGAALARTLVDGGIFLMKPSQGELEQLIGRKLADVAAIAEAASAFVAGGQVEHVAVTMGHRGAVLVNAGGAFLLPAVPVEARSAVGAGDSFVGAMTLGFARGWSAAEAFRYGLAAGTAAVLTPGTDLCCREDVERILASVPEPEALTIGASAG >NC_020561.1|WP_084673659.1|2780327_2783564_-|TonB-dependent-receptor MKRSFLRGYLYSATAFASIATACAVPASAQTSEQLYDFNIPSQSLGGALNAFARASHQQITFDPAAVREKQSPALTGQYSARDALDRLLANSGLTVRVGRTGIFIVEKPATPRPSKAEAQNPDLTPAETLDIVVTAQKREEKILDVPIAVSAFSGTQLDRQKIESGADLVRGVPNLNFSKSFSSMYNIGIRGIGTKALNSSSDPGVAVSYNNTPLIRNRLFEQEFFDTSRLEVLRGPQGTLYGRNATGGVVNIFPALPTGEFEGELKGEVGNYETRRVSGMLNIPLTDTFSIRGAGAYTKREGFDYNEFTRNRVNGRDLWSTRLSAQWEPSDRFKANVIWEHFNEDDDRSRTGKALCTTDPGPEMVGSTIVPDRLRSRLSQGCLPGSLYDDAAFGVPNAASQTALYNAQSIVIGIDPNTFASIPLVKAGDPYAGIVQSRNPRRISTAIDLTFKAKNDIVQLNLELKIGESLTLISQSGYSKDRWYSSQDYNRFASNPIFGNTKGLYNVLFEPYADDGPLPDGFYTDSQLGSSDRLLTMDLNRTRTRQWTQEIRLQSDFNNRFNFNVGANYLNFKTTDDYYAFSNLFNLTTDFVYLQDLSKAFGSPPTISFLNCPAASEDPACQYAPYKDKNPLNSIDDLGHNYFLSRNNVKTKSFAIFGEAYYQATDDIKITLGARYTNDKKYQTQIPSQLLLSSSIITGGTVNYGYPALPDIDQGWSRFTGRAVVDWKPNISFTDDTLVYGSVSRGYKGGGANPPRVDFDPRIVQYIPLSDRYKPEGLTAFEIGTKNLLANNTISLNATAFFYDYDNYQISQVTDRITYTENFDAQTWGLELEATWQPNRNFRFNSSLGYLDSKLKKNAKSIDVMDRTQGNPDWTVVRPWLQVPTNCVAPTKYVEKVLSTFPSELALAALCPGSTGIGSYNPNIPPETTVPYWQYLGFTYDPLTEAPNAGRGFDADLGGNELPNSPHLTFNVGAQYTFFLDSDDWELTFRGDYYRQSKSYARVYNTEFDKLKGWGNLNISVSFARPKDQLAFQLYVKNVLNDQPITDVFLSADDIGMPANTFYLDPRIIGFNITKKF >NC_020561.1|WP_144062054.1|2779509_2780052_-|hypothetical-protein MKKLSLICSAALIMAGMSSAANAVTIRKAGNSMVLSGPITTTILGVSTTCTVTAVYDVPEMAGDGHTTFSHSLSTDPSHGHTVNLRSFSMSGGTGCSLATLHGTPTISVSPTTVTISGINATAIGGLITCAGSISGTYTHPGSPPPPNARVTFLNQTVGACTFSGTLTAAAGEFDIDATP >NC_020561.1|WP_144062053.1|2778064_2779378_+|efflux-RND-transporter-periplasmic-adaptor-subunit MAHIPERLKRMDRRSLGWLVVAAGGLFVLALWWRSPGQPEAGGETTEQTMVVQPRPFTASISFAGTIKAGEGTGIVAPFDGTVKEMGFAYGNPVAPGQMLAVLDVSELEQSRNEAESAYLKAMQAARDMEGWASGPEVSRARRAVESARFDLADTERKLAETKTLLDRGLVARTEYDGMLQQLRTQRTAVASANEDMRVALERGSGPNRRVAMLELANARARLAVLNAQFTGAVIRAKDAGIMVRPPANKLAVAAENDVHVGARVSRGQLIGVNARAGDLIVTFNIEEADVALLRLGQRLMVTGAGFMGLALPGKIDAIAGEASNPGGVTTPGKAIFTATASLDPLAPDQAARVRVGMTANIAVMTYNAAAALVVPPSAVRGAAPDTFILVRNQRTGKDSPAKVQIGQVGPDGVEIVSGLKPGDTIVWEDAQSFPSQ >NC_020561.1|WP_144062052.1|2776637_2778089_+|TolC-family-protein MLIACLCGAIAPDLASAQKIAPIAAVRSPPANPVPSGQPVPLTLAETVALGLRDNRTIKSAYLQRVAQKFDLFVADTLFLPKLNLSADIAHQRVGGTTFNTSSVGAAGTWLTPIGTRVQFSWDRRDQLDSGRTGHSDTAALSFTQPLLRGAGTKVNMAPVRIARLQEEINKLSLKSTVIDTVTGIIQAYRRLSQAQSQVELAELSLERTRDLLETNRALIAAGRMAAADIVQTESGVANQEVAVLQARQQLASAQLALLQLLAVDPRTNVVAADEPDAEQADIDLDRVVDLGLSSRVDILGQRLALEQTRIDLAVARNNRLWDLSIGGSVSRQRVDDPILGRLDPPTDHNVGVQLSIPIGDFSYRQREIGATTSLRTAELRYQDLTQSVETQIRDAVQTVEASWQQLAAARRARALAARALELQQEKLKVGRASNFEVLSFQADLRTADTQELTARIGYLNALTSLDQQIGNTLETWRISLND >NC_020561.1|WP_015459331.1|2775277_2776480_+|ABC-transporter-permease MTQRAPATSGIPLAEIIGEAFANLRVQGRRSALALLGILIGTASIVALLNIGHIAQLETLKLFRHLGVDTVQLQATPTGEMPPGFDPDVVAQLPARDPDVLRAVPIITGRASISAGRQTTDAGIVGMPPAFAATVGLAPRLGRLFRPIDNCSPVALVGKGTAEKLSAPGAELLPGAAIIVGNYGFTVIGILMPTALEAINPSDYNESVIVPLACSRRVVAGGVPNIVLAKLRPTADPDIVGQRLSAMLANPRSAIQVISARTYIKTMNAQKAVHSRMLAAVGAISLLVGGIGVMNVMLMGVMERRREIGLRAAIGATPRDLRTMFIVESATLAVAGGLFGALLGLLATYFVARSSGWTFSIAYYVLPLGTGVAGLVGLIFGLYPAITASRLKPIEALRAE >NC_020561.1|WP_051128741.1|2774609_2775281_+|ABC-transporter-ATP-binding-protein MKEVEKAYGVAANPIPVLKGISFSIENGSFCAILGPSGSGKSTLLNIIGLLDHPDRGEVLLGDNAVNFASAEETARLRNRLLGFVFQSFQLLPRLRAWENVALPLLYRGIPKADRRPKALALLDRVGLGHRADHLPSELSGGQCQRVALARALIGDPQLILADEPTGSLDSGTSLEMMDLLKDLSRRLAVTIVMVTHDRQLAERCDRRIELLDGQVIADTVAM >NC_020561.1|WP_144061970.1|2773204_2773965_+|IS5-family-transposase MARHLFWLSDEAWAAIEPHLPHGRPGKPRVDDRTVISGILHVLKTGCRWRDVPAAYGPPTTIYNRYNRWASRGIWQRLFEKIAGAGPVPDELSIDSTHVKAHRSAAGSKKGEWQEAIGRSRGGRTCKVHCLADDRGRPVAIALTPGNVADISMAVPLLSVTAPARRLIGDKAYDANSLRRWLAERRIKAVIPSTASRRTPYPLNRRIYRRRNVIERLFCRLKNWRRIATRYDRYATNYLAAIALVATIAEWIK >NC_020561.1|WP_015459329.1|2771215_2773117_+|ribonucleoside-diphosphate-reductase-subunit-alpha MDLSGSNNEAGASDVATTLEATRAEAGTDSPHGVLKRPYPVEVDHGRDALLTDFGKETLKDRYLLPGESYQDLFVRVASAYADDAAHAQRLYDYISKLWFMPATPVLSNGGTGRGLPISCYLNSVDDSLQAITEIWNENVWLASRGGGIGTYWGNVRGIGEPVGLNGKTSGIIPFVRVMDSLTLAISQGSLRRGSAACYLDISHPEIEEFLEIRKPSGDFNRKALNLHHGVLLTDAFMEAVRDGREWELTSPKDGSVRGKVDARSLFQKLVETRLATGEPYIVFADTVNRAMPKHHRELGLKVSTSNLCSEITLPTGRDHLGNDRTAVCCLSSLNLETWDEWNGDKQFIEDVMRFLDNVLTDYIDRAPPEMARAKYSAMRERSVGLGVMGFHSFLQARGLPFEGAMAKSWNLRMFKHIAAKAQEASMLLASERGACPDAEDRGVMERFSCKMAIAPTASISIICGGTSACIEPIPANIYTHKTLSGSFSIKNVHLQKLLQAKSKDSDAVWNSILEQGGSVQHLDFLNQEEKDTFKTSFEIDQRWLLELAADRTPYIDQATSLNLFIPADVEKWDLLMLHFRAWELGIKSLYYLRSKSIQRAGFAGGVEADNTPDLKKIELATTTDYDECLACQ >NC_020561.1|WP_144062055.1|2786713_2787616_-|hypothetical-protein MHDAGAMPFDQWVMLAFPLAGIALTLWLWKTAERRLWWKLAAGFALFLGLLTVALPYADHDRVQARAIAGEVTTVEGPINGHRRWTERSFAGSSRGVGVTTFDRYKETTYEYFYIGDTPFTFIVGGYPSHASFTNAADPPVAIADGMWARAKFFRDDWYNDERRITWLELAPAPPAGARPIFPASVPRAPPAKAGSNLPPDFAAFWEGFAAAVGRGDAAAVRPLVAFPFHFDSHELGADEFGSLWMSLFAAPLRPCIAAAAPVREGDRYVIFCAGYGYYFAKTASGWKLAEFLADGEAMQ >NC_020561.1|WP_041865409.1|2787733_2788327_-|arylesterase MTLFVTFPALAADKLVVAFGDSLMAGYQLKPGEGFAPRLEAALRRSGIPARVHNAGVSGDTTAQGTARLGWVLGGLKARPDLVIVELGANDMLRGLPNAQTRANLDAILAELKRRRIPAMVAGMQAAPNLGQAYAREFNAIHPALARKYQVPLYPFFLQGVATNKALLLKDGMHPNPRGVDVIVANILPSVRKALGR >NC_020561.1|WP_015459341.1|2788406_2789099_+|ABC-transporter-ATP-binding-protein MSAANIVIEARNVTLALGRGEARVEILRGIDLSIAEGETVALLGPSGSGKSSLMAVLSGLERADAGQVHVAGADFAAMDEDRLARARRGRIGIILQAFHLLPTMTALENVAVPLELAGQADAFARARVELEAVGLGHRTGHYPAQLSGGEQQRVAIARAVAPRPAILFADEPTGNLDARTGAAIMDLLFGRQRETGETLLVITHDPALAHRCGRVIEMLDGRIVSDSRAA >NC_020561.1|WP_015459342.1|2789095_2791591_+|FtsX-like-permease-family-protein MKLAWALALRDLRGGFAGLRLLAICLFLGVMALAGVGSLSSAITSELALQGQSILGGDVQMSIVQRTADPGERAAFAAAGRVSETIRMRAMASRPDGAQAVLAELKGVDGAYPLYGDFRLAPGALGARPRGKEVAIAPALADRLAVKPGDMVRIGDAELRVIGLIAEEPDRVGEGFTFGPAALVDMDGLAATGLVQPGSLYTSRYRIRLPDGQDAANVAKQIADRFPGAGWEVQDRSNAAPGTRRFIGRLGQFLMLVGLTALAVAGIGVGNGVTSYLEGKRNAIATLKVLGASSRTIFLSYLIQIGLVAGAGILAGVVAGSLVPSAVVALAGDALPVQPHFAIHARPLLLAALYGLLIALLFVLAPLARARAVTAASLFRGGVETARRPAFPVLAAMAITLAAIVALAVGTAREPLFAAWFVAAVAGLLLLLTLIGWAVRRIAARLPRPRRPLLRLAIANLHRPGAQTGRLVVALGLGLTLFATLAVIETNLSGQIDSTVPAKAPSFFALDIPVDDIDRFRALVAARAPGAEVRTVPSLRGPVVSFGGKRVADLDTLPEGAWILRGDRGLTYSATPPEGSRVVEGQWWPPDYSGPPLVSLDVEAARILGLKVGDEITVSVLGVEVPATIASLREIKWDTMGFNFVLVYSPGVLEGAPHSYMATIAMPEKGEAALNREITRQFPSVSLIRVKEVIGQVADVLGQLSTAVRSAASVALAAGIAVLVGAIAASRRSRIYDSVLLKLLGATRRQVLAAQAIEYAILASILSLLAALFGALAGWYVVTGVFELDWAPDWMVVGATLAIGGFGTLALGLLGSLPALAARPARALREL >NC_020561.1|WP_015459343.1|2791607_2792387_-|peptidoglycan-editing-factor-PgeF MTQAIDPIRAASLGDIPHAFLGRRGGVSMGIHAGLNVGLGSDDDRDAIRENRRRAVAAVLPDAQLVTLHQVHSADAVKVGAPFPDDARPHADALVTDRPGLLLGILTADCVPVLFADSKAGVIGAAHAGWKGAIGGVTDATIAAMEAIGADRGRIVAAIGPCIARASYEVDEAFLRRFAEDDAENERFFTDGVRARHYQFDIEAYVTARIAAAGIGRVEALGLDTYADPDRFYSFRRATHRGEPGYGRQISLIGLPPHA >NC_020561.1|WP_015459344.1|2792445_2793504_-|SAM-dependent-methyltransferase MTSPSCEERLARLIRAVGPIPIAQFMAEANGAYYASRDPLGAAGDFVTAPEISQMFGELIGLWLADLWQQAGEEPACYVELGPGRGTLAADATRAMRAVGLQPAVHFVETSPALRAAQAERFANAAWHDDLSTLPAGKPLLLVANEFFDALPIRQFVRTVNGWRERMVAHGPDGFVPVPGEVPVDALVPDRLRDAPAGSILESAPMGTAIARDVAGRIAEQGGAAIIIDYGYAGRAAGDTFQAVHAHAYADPFARPGTRDLTAHVDFSAIRQAGEAEGVRVHGPVGQGAWLEAIGIGARTAALSRGSPTRAEEIEAARHRLTDASEMGELFKVMAFVAPGWPEPAGFGAPPA >NC_020561.1|WP_015459345.1|2793520_2794387_-|prolipoprotein-diacylglyceryl-transferase MILTFLADATAALRFDQLGLSPVALDLGFFQLRWYSLAYIAGILIGWWYLLKLLDQPGAPMARRHADDMVFYATLGILIGGRLAYVTFYQPEIWQHPLDVLKLWEGGMSFHGGVIGVSLGIILLARKYQLNWLRIHDYVACCVPFGLFFGRLANFVNGELWGRAADVPWAMIFPRGGDVARHPSQLYEAGLEGILLFAVLWFLFWKTDARYQPGKLVGTFLLGYGLSRFCVEYFREPDAQLMEFAARTHLSMGQWLTVPMILGGLYLILTARGRRQRVEPVAGDQSVA >NC_020561.1|WP_015459346.1|2794484_2796158_+|acyl--CoA-ligase MQAVMDAVTGPGGLVEITHDARGFAMAAKLPATLPDLFRFACGQYGPETALVAGKERLTYADLDMWSERLARSLAGGHGIRKGDRVAIAMRNAPAWIVAYMAAAKAGAIVTLINGWWTPEELAHSLQLSTPSLVIADGPRAQRIADTGIEVRVADLLIDLPIAQALAPLIDGVAEGDLPAVSPEDDATLLFTSGSTGQCKGAVSTHHAVTTATYCFVALTATLLGAFYGGDRNNLPGAPAALVTVPLFHVTGEIPVFVASIVIARKLVLMPKWDATEALRLIEAEKVSYFVGVPTMSLELMQHPDRGRYDLSTLLDIAAGGAPRPAAHVPRLMEAFPQSNPMMGYGLTETNAVGCTNCRGNYAAKPSSTGPAQAPFVHVAIYDDDGNALPPGERGMIGIASAANIRGYWNNPEATAAAFTADGHFLTGDVGYLDEDGYLFIVDRAKDIVIRGGENISCIEVEAALYAYPDVAEASVFGLPDERLGEIVGAVVRMRRGGAVDAVTLLEFLGGHLARFKLPAHLWFSDDPLPRLGTGKIDKRALRERFTRQMEADARAA >NC_020561.1|WP_015459347.1|2796233_2798450_+|xanthine-dehydrogenase-family-protein-molybdopterin-binding-subunit MAISRRNFLVGGGAGAGLLLAWGLWPRSYRPNLVASPGEAIFNAFLKIGEDGHVAVVVPQAEMGQGVWTSLPQVLADELGADWRTIAVEPAPISPLYANDFLIGEAAQGMLPDLLKGVGGWAARQYAIRSALMVTGGSSSIRGFETRFREAGAVARALLCTAAAKRWDADWRACDTAVGFVTRGEDRLRFGELAAEAASLDAPGGVALRAPGAGGLSGRSVPRIDLPSKVDGSARYAGDVRLPDMVFAAVRHGPHGATRLTGVDKAAAEKVPGVIAVVQNPGWAGAVATNGWAAERALDAMRPRFTTDGPFPDSDSIDQALNAALDGGEATRFVAVGDVDAAFVGKQGLKVDYSVPLAVHAAMEPLAATARLIGDRMEVWMPTQAPGLARAAVARALDMSEGQVTIYPMLVGGGFGRKIENDAAVQAAIIAREVRRPVQLTWSRRDDIQQDRFRPAARARMAAALGERGEVVGWQARIAAPAAMASMQSRLMAGGGDPGAKAELSAVEGALPPYAIPAIAIDHLPVDIGIPTGIWRSVANSYTAFFTECFIDELARSAGIEPLSFRMQMLGGNPRLAHCLTTVTAMGGWDGGMPGGNQGLACHSSFGSHVAMLVEAHVGEDQRIVVDRVAAAVDCGRIIHPDIVLQQIEGGIVWGLAAAFGATTGFARGMAEARNFDALNLPLLAGTPDIRVELIPSKEAPGGVGEIAVPPVAPAVANAIFAATGQRLRSLPLAIGGQ >NC_020561.1|WP_015459348.1|2798446_2799454_+|ferrochelatase MNPPADHPAVPQRRIGVLLVNLGTPDAPDASSVRRYLRDFLSDPRVVEIPRLIWQPILHGLILPTRPKKSAHAYAQVWRPDGSPLAAITRAQAAALAGAFGPDVIVDHAMRYGRPAIGDRIRALVAAGCDRILLAPLYPQYSAATTATANDRAFATLAAMRFQPAIRTLPPYFDHPDHIAALKAGIEGALAALDFVPEAIVASFHGMPERTLRLGDPYHCQCQKTARLLGEALGRELIVTFQSRFGRAKWLEPSTDVTLAALPGRGIRKVAIVAPGFAADCLETLEELAIRGRDGFLAAGGEKFAYLPCLNDSGAGIEMLKKLLGAELEGWRAGL |
You can click texts colored in the table to view more detailed information
CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
NC_020561_6 | 3573522-3573620 | Orphan |
NA
Consensus repeat of NC_020561_6
|
1 spacers
spacers of NC_020561_6
>6.1|3573548|47|NC_020561|CRISPRCasFinder CGGGGTGGCCGCGGAATAAGGGATGGAAAGGGCCGGCGGCGGTTCGG |
CRISPR arrays and Neighbor proteins around NC_020561_6
The CRISPR arrays of NC_020561_6 >merge|NC_020561|6|3573522-3573620|CRISPRCasFinder CCGCCGGCCTTCCGTTTCAGCCGCGCCGGGGTGGCCGCGGAATAAGGGATGGAAAGGGCCGGCGGCGGTTCGGCCGCCGGCCTTCCGTTTCAGCCGCGC >NC_020561|6|5|3573522-3573620|CRISPRCasFinder CCGCCGGCCTTCCGTTTCAGCCGCGC CGGGGTGGCCGCGGAATAAGGGATGGAAAGGGCCGGCGGCGGTTCGG CCGCCGGCCTTCCGTTTCAGCCGCGC
>NC_020561.1|WP_015460051.1|3572440_3573493_+|ribonucleotide-diphosphate-reductase-subunit-beta MPLLQASRTYKPFEYPWAFEYWKRQQQLHWLPEEVPLGEDCRDWAQKLDQSERNLLTQIFRFFTQADVEVQDCYHDKYGRVFKPTEIKMMLTAFSNMETVHIAAYSHLLDTIGMPETEYSAFLQYKEMKDKHDYLSQFGVDTDEDIARTLAMFGGFTEGLQLFASFAMLMNFPRFNKMKGMGQIVSWSVRDESLHCDGIIRLFHAFVKERNCLTPAVRDDILDQCQKTVRLEDAFIDLVFEMGPVPGMTPKDIKKYVRYIADWRLGQLGFKPIYMIDEHPLPWLAPLLNGVEHANFFETRATEYSKAATRGNWGEVWDAFDRRKAAHNGPAANEDAGGEDMFSRAGVAAE >NC_020561.1|WP_015460050.1|3572071_3572335_+|hypothetical-protein MTAPGWTLIVPIAVFNIVIGLWTLRDAARHNHYIKSRIGANDPLFEEHSRHPDFPGLKEVSSARAKGIILLLSGVVLLMLLYLPWAG >NC_020561.1|WP_015460049.1|3571365_3571770_+|DUF805-domain-containing-protein MEWMLLPLKRYADFNGRSSRREFWMFAALHALVALLFYVPLSGIFFRGMAGVLPATLGVIVPLLGLYVAVMFVPGLAVQVRRFHDLGRPGWMVLIGFVPVVGVFAILYFMCLPGTSGPNRYGADPVAEDVAIRP >NC_020561.1|WP_015460048.1|3570056_3571148_-|DNA-polymerase-IV MGQPERPAVTRKIIHIDMDAFYASVEQRDSPELRGRPVAVGGSSARGVVAAASYEARRYGVRSAMPSVTATRKCPELVFVRPRFDVYKAVSRQIREIFAEYSDLVEPLSLDEAYLDVTANRQQLPSATATAEAIRARILAETGLTASAGISYNKFLAKLASDQNKPNGQCVITPAQGEAFVAGLEVGRFHGIGPRTAEKLNRFGIHTGADLRAKDAEWLRRHFGKSGAWYHAIARGIDDRPVTPDRPRKSSGSETTYFEDLATAEAVENGVRAMADEVWGWCERTRAAARTVTVKVKYADFQQITRSRTLPATIDSQAMLHAVSVDLVRTIFPLVKSVRLLGVTLSNFEDEQSAAQAQLAFVL >NC_020561.1|WP_015460047.1|3569287_3569992_-|2OG-Fe(II)-oxygenase MTRHRIEAIDRQAIAAGLDGGGWALLPGLLDPAGCADMAGLYDRPAGFRSTVTMARHGFGRGEYRYFAYPLPPLVETLRAAFYRLLAPIANRWQERMGLAARFPEEHRDFLAHCHAAGQARPTPLMLRYGPGDHNCLHQDLYGEHVFPLQAAILLSAPGADFTGGEFVLTEQRPRMQSRVEVVPLAQGDAVVFAVNQRPIAGGRGDYRVTMRHGVSSVRSGRRHMLGIILHDAA >NC_020561.1|WP_041865516.1|3568635_3569283_-|DNA-oxidative-demethylase-AlkB MSAGTDLFDAEPRDQALSPGAMVLGGFARDMDRDLLAAIEGVLADAPPRHLVTPGGRRMSVAMSNCGGVGWVSDRRGYRYDPIDPESGRRWPAMPDIFTDLAIRAAAAAGFAGFVPDACLINRYEPGARLSLHQDRDERDRAAPIVSVSLGLPATFLWGGEKRSDRPRRIRIVHGDVTVWGGPARFAFHGVEPVADGAHPLTGRARYNLTFRKVF >NC_020561.1|WP_015460045.1|3567735_3568551_-|molybdate-ABC-transporter-substrate-binding-protein MQSFGRRAILALAGAFALAGSLAPAAIAAPADEPAIAAAADLNAALPQIADLFRRKTGRTVKLTFGASGNLTQQILNGAPFQLFLSADESYVARLAEAGRTVDGGTLYATGRIGLFTPRGSPVKADGRLADLAAAIRDGRLRKFAIANPEHAPYGRAAREALTTAKLWDAIQPRLVLGENVAQATQFATSGSADGGIIPLSLAMTPQVQAAGRFALIPAEWHKPLRQRAVLMKGAGETARAFYAFMQSPEAHKLLDHYGFTLPRTGQSKPR >NC_020561.1|WP_015460044.1|3567066_3567735_-|molybdate-ABC-transporter-permease-subunit MDWTAFALSLKLAGWTAALLLPIGLVASRALAFHARRSRPLFEAAVALPLVLPPTVLGYYLLVAFGGASPLGKLWTDLFGHGLAFSFHGLLAASVLINIPFAVQPMQRAFEALPADIREAAWVSGLTPWATFWRIELPLAWPGVLSAFVLTFAHTLGEFGVVLMVGGSIPGETRTAALAIYDRVQAFDNQAAGAMSLLLLLISIIAILIVHGLSGRIGRRRG >NC_020561.1|WP_187294010.1|3566354_3567074_-|ATP-binding-cassette-domain-containing-protein MAEGLGVSLAMARPVPIAVDFTCAPGELVALIGPSGAGKTTILRAIAGLDRAAAGRIACRGETWLDSAAGIRLPPHCRRVGLVFQSYALFPHLTAIGNVAAAIEGRPRGERLRRAAELLALVHLDGLEQRRPAELSGGQQQRVALARALAREPEALLLDEPFSAVDRRTRRRLREELAELRGRVRAPIILVTHDLDEATALADRLVVIDQGAMLQQGRPADVLAAPASERVRAALDLEG >NC_020561.1|WP_015460042.1|3565688_3566102_-|nucleoside-diphosphate-kinase-regulator MTKTDIPPSTRPPLHIIDSEYDAIAGIAMRAEHSQPELARLLMAELDRAEICDAASLPPDTAAMHSRISFIDEGSGASRTVELVYPQEADIEAGKISILTHVGAGLIGMRAGSSILWPDRDGRERRLKIVRIERPAP >NC_020561.1|WP_051128854.1|3574603_3575254_+|glutathione-S-transferase-family-protein MITVHHLENSRSHRILWLMEELGLDYAIERYKRRDRLFSPPEYERLHGLGKAPVITDGGRVVAESGAIIEYVIEVHGGGRLRPPVGSDDWVRYLQWMHLIEGSVMLPYIMGIYLEMLGPAGAPIHERIHGEIDRHFGFMERELSGRDHVVGDALTGADIQAAFVMEAASLRGMLDPYPALRRYLALMQARPAYRRALEKGGAHDLDDLRKGWQGRD >NC_020561.1|WP_015460054.1|3575276_3575717_-|DUF2834-domain-containing-protein MTMKELFYVAIGLVAIALTIYPNRHLLSRRAGGVSALEGFYYLIAIAALLVGWYFNFRFMREYGDEATWANWVRLLFVNPASASGGQDLLFANAVLFIPWTIVDGRRAGMKWNWIWFPMSAVTSFAFAMALFLALKERQLRWKAEA >NC_020561.1|WP_015460055.1|3576026_3578162_-|NAD-dependent-DNA-ligase-LigA MTTPPFPTDALAAAERLAWLAAEIARHNALYHDNDAPEISDAEFDALVRENNAIEAAFPHLVRADSPSRAVGSTPSGPLAKVTHAKAMLSLDNAFADEDVAEFVERIRRFLRLADDVPVAMTAEPKIDGLSCSIRYENGRLVQAATRGDGQVGEDVTPNVLTIADIPHRLPAGAPDLFEVRGEVYMAKADFRALNARLLAEAPDPEKARQFANPRNAAAGSLRQKDAAVTAARPLRFLAHGWGEVSALPADSQYGVMRAIAGWGLPVSDALVLVDSVAAMLAHYRAIETERADLPFDIDGVVYKVDRLDLQERLGFVARAPRWAIAHKFPAEQAQTTLRAIDTQVGRTGKITPVARLEPVTVGGVVVTNATLHNADEIERLGVRPGDRVVVQRAGDVIPQIVANLTREEPRAPWHFPTQCAECGSALAREEGEVDWRCTGGLICPAQRVERLRHFVSRHALDIEGLGLTHIEAFFRDGLIHSPADIYRLHERREALIARERWAETSVDNLIRAIDARRTPPLDRLLFALGIRHVGEVTARDLARRYSTWEALTAMIDAARARRAELVQAVGETDEKFRARTAKELAAIVETAGVGPEVAQALVDFFDEPHNQEVLADLLAQVTIEPVIHQTRASEVSGKTVVFTGSLETMSRDEAKAQAEALGAKTAGSVSSKTDLVVAGPGAGSKLKKAAELGIRVIDEAEWQAIVAAAG >NC_020561.1|WP_015460057.1|3578653_3579418_-|response-regulator-transcription-factor MRVLLVDDEVLALDRLKALFANVDGAEVVGQAMTGEEALEAIVTLKPDLVILDIQMPGRNGLRTAADIDVDPRPEIVFVTAHEHYAPDAFDVDAADYVLKPIRFDRLRQAVERARRRRVLREQAERVDVLEEQVQTLRSSAAESRDDAAFWIPERHGQRRVPLETINWIEAARDYVLLHTEMRSHMLRTTMSALEEKLAGSGLIRVHRSAFVRPERVMEVRRANRSIALVLEDGAEVQVGPSYSQVVDSALGLN >NC_020561.1|WP_015460058.1|3579490_3580687_-|histidine-kinase MELALRDESVILGQNGSHTHRGVTTFMPAAKSDARWADAVPLTIGLWLFMLLVFMPGIIARHPGDWVGVAIDSSTVCLSIGLGLLLFILFRGTADWQGGPRLVLMVAATIGMALASTIFDLKFTDWGARNLGGNWLAIPVDFKRASQSLLNYLCVFSVNVALFQFSFSRRRSLTRERQLAAAETAARQAELEALRLQLNPHFLFNTLNAISSLIVTRRNEDAEEMTDKLSSFLRASLACNPTELVPLEEELDLMADYLSIEAVRFGERLRVEISCTPEARAVHVPGLLIQPLVENAVKYGVARSAQPVTIAIDAVVDEGDLCIVITNDGGAGLPSVKSTATGVGLRNVRRRLAALYGERASLVAEPVGAGFLARICLPIDKDVVAALLHRQQGLPLPR >NC_020561.1|WP_015460059.1|3580918_3582208_+|L,D-transpeptidase-family-protein METSRTVGGRLRRSGRLVRWLAAAGATAMTTLALAGEPMTMGAGPAEASTAAPAAMAAAPTPAERWRPTDVAALLEEIDAAPGEGLDAAPYGGDAIRREMASGQGGAALDALADAAALRLAGDYLNGRVADRAGFDWHIERTDADPARLQAGLRQALAAGQVRPWLRSLLPADPRYAALREALAATPPADAGRRDRLAANMERWRWLPRDLGADHIYVNVPSYTLDLVDDGKPVSSYTVVVGAPATPTPQIAMAASSVVVNPWWNVPASIIRSSRLRPGAVNPARGYEFYPVGGGRYAVRQRPGPGNALGRIKIDMPNAHAIYLHDTPAKAYFDKPSRAFSHGCIRVKDIDRLAEEMVRLDHGRTADIERGLAGRTTTTVKLDTARPVWLVYFTAQAGPDGKVAMLEDPYNRDPRLIARLNGPMRLASR >NC_020561.1|WP_015460060.1|3582317_3583745_-|DEAD/DEAH-box-helicase MTFADLGLSDELLRAVAEAGYDEPTPIQAQAIPPVLMMKDLIGIAQTGTGKTASFVLPMIDILAHGRSRARMPRSLILEPTRELAAQVAENFEKYGKYHKLSMALLIGGVNMGDQVAALEKGVDVLIATPGRLMDLFQRGKILLTGCSLLVIDEADRMLDMGFIPDIEEICTKLPAQRQTLLFSATMPAPIKKLADRFLNNPKRIEVARVGTANASIEQKLVECQPRAKREVLRNLLSADDVRTAIIFCNRKTTVRELTTSLQRHGFHASQIHGDMDQSERLRELDRFKNGEINILVASDVAARGLDIKGVSHVFNFDVPWHPDDYVHRIGRTGRAGATGKAFTLVTPDDAEAVENIEKLAQQKIPRIGEAKPARAPAAAAEEKPARRARGAKAKPAEAEAKRADTEPKRADAEPKRAKAEDQPRREEKPRREERPARAAAAAPRHERRPADDGPGEGWNGPIPSFLDFGFGTRS >NC_020561.1|WP_015460061.1|3583856_3584303_-|hypothetical-protein MIGAELLSALLLSSGAAPPPESAAVTARFAQLTIRESVIIRVPTRGRQAIAPIEWKEGKGPKCLPMSEVAGATAVEEDSVDIILRGGGRVRAEFEDECPALDYYNGFYIRPTEDRRICAGRDSIHARSGGECQIRRFRTLTPVEGKKK >NC_020561.1|WP_015460062.1|3584408_3585857_+|FAD-binding-oxidoreductase MVSSPPDPAFLECLANRLGPRGFTADPADIDPWTIDWRGRVRGSAVALLSPADTTETADIVAMCAAAGVPLVPQGGNTSMVAGATPPANGSALILSTRRMRAIRSISAADGVAVVEAGVVLADLHDAAAVHGLRFPLSLAAKGSATIGGLVSTNAGGTQVLRFGPMRSLVLGIEAVLPDGSRFDGLSALRKDNRGYDLRQLLTGAEGTLGIVTAASLRLVPAIGRRAVAWAGLDSPQAALALLRRLEAATGEAVESFELVPDDALDLVIRHIPGSRAPLGGAHRWHALIEATAPQGAADPADALGQVLGQAMADGGVGDATIAASEAQAEALWRLRESISDAERADGMAAKHDISVPVSAMPDFILSARVAVEAAFPGTRVIAFGHLGDGNVHFNVRAPAGIPATGAEGMAWLAETGAAVSRMVNDLTVAAGGSISAEHGIGQTKLAEYARLADPARLAAQQAIKAALDPRWLMNPGKLVPR >NC_020561.1|WP_015460063.1|3585975_3586752_+|SapC-family-protein MASAPPSGLPLFYNQLQPLSSSLHADYVLRQRDSVPFLAGVHAVPLTVEEFGLAQRHYPIVFSSGPNPVPLALMGLNEGVNMFVGEDGKLAGDAYIPAYVRRYPFMLAKLQPNSEELSLCFDPTSDTVGQGGEGAALFADGQPSDATKGILGFCEQFEQAGQRTAAFMQELVDLKLLIDGEVSIQPEGAPQPFIYRGFQMIAEDKLRELRGDQARKLIQSGLLALVYAHLFSLSLIRDLFARQLQAGKVPAQQPQLQV |
You can click texts colored in the table to view more detailed information
CRISPR_ID | Spacer_Info | Spacer_region | Spacer_length | Hit_ID | Protospacer_location | Mismatch | Identity |
---|
CRISPR_ID | Spacer_Info | Spacer_region | Spacer_length | Hit_phage_ID | Hit_phage_def | Protospacer_location | Mismatch | Identity |
---|---|---|---|---|---|---|---|---|
NC_020561_3 | 3.4|1616818|30|NC_020561|CRISPRCasFinder,CRT,PILER-CR | 1616818-1616847 | 30 | NZ_CP020908 | Rhizobium etli strain NXC12 plasmid pRetNXC12b, complete sequence | 234486-234515 | 4 | 0.867 |
NC_020561_3 | 3.1|1616620|30|NC_020561|CRISPRCasFinder,CRT | 1616620-1616649 | 30 | NZ_LR594668 | Variovorax sp. SRS16 plasmid 3 | 336113-336142 | 5 | 0.833 |
NC_020561_3 | 3.1|1616620|30|NC_020561|CRISPRCasFinder,CRT | 1616620-1616649 | 30 | NZ_LR594673 | Variovorax sp. PBL-E5 plasmid 3 | 515499-515528 | 5 | 0.833 |
NC_020561_3 | 3.2|1616686|30|NC_020561|CRISPRCasFinder,CRT | 1616686-1616715 | 30 | JQ680373 | Unidentified phage clone 2209_scaffold64 genomic sequence | 36556-36585 | 5 | 0.833 |
NC_020561_3 | 3.4|1616818|30|NC_020561|CRISPRCasFinder,CRT,PILER-CR | 1616818-1616847 | 30 | NC_007764 | Rhizobium etli CFN 42 plasmid p42c, complete sequence | 235268-235297 | 5 | 0.833 |
NC_020561_3 | 3.4|1616818|30|NC_020561|CRISPRCasFinder,CRT,PILER-CR | 1616818-1616847 | 30 | NZ_CP013597 | Rhizobium sp. N741 plasmid pRspN741b, complete sequence | 299039-299068 | 5 | 0.833 |
NC_020561_3 | 3.4|1616818|30|NC_020561|CRISPRCasFinder,CRT,PILER-CR | 1616818-1616847 | 30 | NC_021907 | Rhizobium etli bv. mimosae str. Mim1 plasmid pRetMIM1b, complete sequence | 237012-237041 | 5 | 0.833 |
NC_020561_3 | 3.4|1616818|30|NC_020561|CRISPRCasFinder,CRT,PILER-CR | 1616818-1616847 | 30 | NZ_CP013501 | Rhizobium esperanzae strain N561 plasmid pRspN561a, complete sequence | 299405-299434 | 5 | 0.833 |
NC_020561_3 | 3.4|1616818|30|NC_020561|CRISPRCasFinder,CRT,PILER-CR | 1616818-1616847 | 30 | NZ_CP013507 | Rhizobium sp. N1341 plasmid pRspN1341b, complete sequence | 299039-299068 | 5 | 0.833 |
NC_020561_3 | 3.4|1616818|30|NC_020561|CRISPRCasFinder,CRT,PILER-CR | 1616818-1616847 | 30 | NZ_CP013518 | Rhizobium sp. N113 plasmid pRspN113a, complete sequence | 299405-299434 | 5 | 0.833 |
NC_020561_3 | 3.4|1616818|30|NC_020561|CRISPRCasFinder,CRT,PILER-CR | 1616818-1616847 | 30 | NZ_CP013491 | Rhizobium sp. N6212 plasmid pRspN6212a, complete sequence | 299408-299437 | 5 | 0.833 |
NC_020561_3 | 3.4|1616818|30|NC_020561|CRISPRCasFinder,CRT,PILER-CR | 1616818-1616847 | 30 | NZ_CP013496 | Rhizobium sp. N621 plasmid pRspN621a, complete sequence | 299408-299437 | 5 | 0.833 |
NC_020561_3 | 3.4|1616818|30|NC_020561|CRISPRCasFinder,CRT,PILER-CR | 1616818-1616847 | 30 | NZ_CP013591 | Rhizobium sp. N871 plasmid pRspN871a, complete sequence | 299408-299437 | 5 | 0.833 |
NC_020561_3 | 3.3|1616752|30|NC_020561|CRISPRCasFinder,CRT,PILER-CR | 1616752-1616781 | 30 | NC_010463 | Enterobacteria phage Fels-2, complete genome | 14569-14598 | 6 | 0.8 |
NC_020561_3 | 3.3|1616752|30|NC_020561|CRISPRCasFinder,CRT,PILER-CR | 1616752-1616781 | 30 | KT630647 | Salmonella phage SEN8, complete genome | 10144-10173 | 6 | 0.8 |
NC_020561_3 | 3.3|1616752|30|NC_020561|CRISPRCasFinder,CRT,PILER-CR | 1616752-1616781 | 30 | NC_019488 | Salmonella phage RE-2010, complete genome | 19297-19326 | 6 | 0.8 |
NC_020561_3 | 3.11|1617280|30|NC_020561|CRISPRCasFinder,CRT,PILER-CR | 1617280-1617309 | 30 | NC_018022 | Mycolicibacterium chubuense NBB4 plasmid pMYCCH.01, complete sequence | 447174-447203 | 6 | 0.8 |
NC_020561_3 | 3.11|1617280|30|NC_020561|CRISPRCasFinder,CRT,PILER-CR | 1617280-1617309 | 30 | MF063068 | Pseudomonas phage Noxifer, complete genome | 179629-179658 | 6 | 0.8 |
NC_020561_3 | 3.12|1617346|30|NC_020561|CRISPRCasFinder,CRT,PILER-CR | 1617346-1617375 | 30 | NZ_CP017563 | Paraburkholderia sprentiae WSM5005 plasmid pl1WSM5005, complete sequence | 123439-123468 | 6 | 0.8 |
NC_020561_3 | 3.13|1617412|30|NC_020561|CRISPRCasFinder,CRT,PILER-CR | 1617412-1617441 | 30 | NZ_CP017076 | Novosphingobium resinovorum strain SA1 plasmid pSA1, complete sequence | 577381-577410 | 6 | 0.8 |
NC_020561_3 | 3.13|1617412|30|NC_020561|CRISPRCasFinder,CRT,PILER-CR | 1617412-1617441 | 30 | NZ_CP046333 | Cupriavidus metallidurans strain FDAARGOS_675 plasmid unnamed3 | 981029-981058 | 6 | 0.8 |
NC_020561_3 | 3.13|1617412|30|NC_020561|CRISPRCasFinder,CRT,PILER-CR | 1617412-1617441 | 30 | NC_007974 | Cupriavidus metallidurans CH34 megaplasmid, complete sequence | 66193-66222 | 6 | 0.8 |
NC_020561_3 | 3.13|1617412|30|NC_020561|CRISPRCasFinder,CRT,PILER-CR | 1617412-1617441 | 30 | NC_004808 | Streptomyces rochei plasmid pSLA2-L DNA, complete sequence | 139150-139179 | 6 | 0.8 |
NC_020561_3 | 3.13|1617412|30|NC_020561|CRISPRCasFinder,CRT,PILER-CR | 1617412-1617441 | 30 | NZ_CP038146 | Streptomyces sp. S501 plasmid unnamed, complete sequence | 56420-56449 | 6 | 0.8 |
NC_020561_3 | 3.1|1616620|30|NC_020561|CRISPRCasFinder,CRT | 1616620-1616649 | 30 | NC_019388 | Thermus oshimai JL-2 plasmid pTHEOS02, complete sequence | 24066-24095 | 7 | 0.767 |
NC_020561_3 | 3.1|1616620|30|NC_020561|CRISPRCasFinder,CRT | 1616620-1616649 | 30 | NZ_CP010824 | Thermus aquaticus Y51MC23 plasmid pTA16, complete sequence | 12430-12459 | 7 | 0.767 |
NC_020561_3 | 3.1|1616620|30|NC_020561|CRISPRCasFinder,CRT | 1616620-1616649 | 30 | NC_016586 | Azospirillum lipoferum 4B plasmid AZO_p2, complete sequence | 508762-508791 | 7 | 0.767 |
NC_020561_3 | 3.2|1616686|30|NC_020561|CRISPRCasFinder,CRT | 1616686-1616715 | 30 | NZ_CP017563 | Paraburkholderia sprentiae WSM5005 plasmid pl1WSM5005, complete sequence | 829111-829140 | 7 | 0.767 |
NC_020561_3 | 3.3|1616752|30|NC_020561|CRISPRCasFinder,CRT,PILER-CR | 1616752-1616781 | 30 | NC_049453 | Klebsiella phage ST13-OXA48phi12.1, complete genome | 21071-21100 | 7 | 0.767 |
NC_020561_3 | 3.11|1617280|30|NC_020561|CRISPRCasFinder,CRT,PILER-CR | 1617280-1617309 | 30 | JX163858 | Caulobacter phage phiCbK, complete genome | 80456-80485 | 7 | 0.767 |
NC_020561_3 | 3.11|1617280|30|NC_020561|CRISPRCasFinder,CRT,PILER-CR | 1617280-1617309 | 30 | KY555147 | Caulobacter phage Ccr34, complete genome | 127815-127844 | 7 | 0.767 |
NC_020561_3 | 3.11|1617280|30|NC_020561|CRISPRCasFinder,CRT,PILER-CR | 1617280-1617309 | 30 | KY555145 | Caulobacter phage Ccr29, complete genome | 131672-131701 | 7 | 0.767 |
NC_020561_3 | 3.11|1617280|30|NC_020561|CRISPRCasFinder,CRT,PILER-CR | 1617280-1617309 | 30 | KY555143 | Caulobacter phage Ccr2, complete genome | 127171-127200 | 7 | 0.767 |
NC_020561_3 | 3.11|1617280|30|NC_020561|CRISPRCasFinder,CRT,PILER-CR | 1617280-1617309 | 30 | KY555146 | Caulobacter phage Ccr32, complete genome | 127375-127404 | 7 | 0.767 |
NC_020561_3 | 3.11|1617280|30|NC_020561|CRISPRCasFinder,CRT,PILER-CR | 1617280-1617309 | 30 | KY555142 | Caulobacter phage Ccr10, complete genome | 126696-126725 | 7 | 0.767 |
NC_020561_3 | 3.11|1617280|30|NC_020561|CRISPRCasFinder,CRT,PILER-CR | 1617280-1617309 | 30 | NZ_LS974446 | Rhizobium selenitireducens ATCC BAA-1503 isolate T2.30D-1.1_plasmid plasmid 1, complete sequence | 154923-154952 | 7 | 0.767 |
NC_020561_3 | 3.12|1617346|30|NC_020561|CRISPRCasFinder,CRT,PILER-CR | 1617346-1617375 | 30 | NC_017958 | Tistrella mobilis KA081020-065 plasmid pTM3, complete sequence | 242733-242762 | 7 | 0.767 |
NC_020561_3 | 3.12|1617346|30|NC_020561|CRISPRCasFinder,CRT,PILER-CR | 1617346-1617375 | 30 | NZ_CP031752 | Rhodobacter sphaeroides strain EBL0706 plasmid p.A, complete sequence | 170913-170942 | 7 | 0.767 |
NC_020561_3 | 3.12|1617346|30|NC_020561|CRISPRCasFinder,CRT,PILER-CR | 1617346-1617375 | 30 | NZ_AP022334 | Methylosinus sp. C49 isolate Methylosinus sp. C49 plasmid pMSC49b, complete sequence | 141743-141772 | 7 | 0.767 |
NC_020561_3 | 3.13|1617412|30|NC_020561|CRISPRCasFinder,CRT,PILER-CR | 1617412-1617441 | 30 | NZ_CP016613 | Ralstonia solanacearum FJAT-91 plasmid unnamed1, complete sequence | 576650-576679 | 7 | 0.767 |
NC_020561_3 | 3.13|1617412|30|NC_020561|CRISPRCasFinder,CRT,PILER-CR | 1617412-1617441 | 30 | NZ_CP021449 | Ralstonia solanacearum strain SEPPX05 plasmid pSEPPX05, complete sequence | 2035492-2035521 | 7 | 0.767 |
NC_020561_3 | 3.13|1617412|30|NC_020561|CRISPRCasFinder,CRT,PILER-CR | 1617412-1617441 | 30 | NZ_CP049794 | Ralstonia solanacearum strain 204 plasmid unnamed, complete sequence | 583048-583077 | 7 | 0.767 |
NC_020561_3 | 3.13|1617412|30|NC_020561|CRISPRCasFinder,CRT,PILER-CR | 1617412-1617441 | 30 | NZ_CP049788 | Ralstonia solanacearum strain B2 plasmid unnamed, complete sequence | 1882998-1883027 | 7 | 0.767 |
NC_020561_3 | 3.13|1617412|30|NC_020561|CRISPRCasFinder,CRT,PILER-CR | 1617412-1617441 | 30 | NZ_CP039340 | Ralstonia solanacearum strain UW386 plasmid pUW386, complete sequence | 868075-868104 | 7 | 0.767 |
NC_020561_3 | 3.13|1617412|30|NC_020561|CRISPRCasFinder,CRT,PILER-CR | 1617412-1617441 | 30 | NC_016113 | Streptomyces cattleya NRRL 8057 = DSM 46488 plasmid pSCAT, complete sequence | 1250244-1250273 | 7 | 0.767 |
NC_020561_3 | 3.13|1617412|30|NC_020561|CRISPRCasFinder,CRT,PILER-CR | 1617412-1617441 | 30 | NC_016113 | Streptomyces cattleya NRRL 8057 = DSM 46488 plasmid pSCAT, complete sequence | 85518-85547 | 7 | 0.767 |
NC_020561_3 | 3.13|1617412|30|NC_020561|CRISPRCasFinder,CRT,PILER-CR | 1617412-1617441 | 30 | NZ_CP012940 | Ralstonia solanacearum strain UW163 plasmid unnamed, complete sequence | 1919859-1919888 | 7 | 0.767 |
NC_020561_3 | 3.13|1617412|30|NC_020561|CRISPRCasFinder,CRT,PILER-CR | 1617412-1617441 | 30 | NZ_CP012944 | Ralstonia solanacearum strain IBSBF1503 plasmid unnamed, complete sequence | 1923526-1923555 | 7 | 0.767 |
NC_020561_3 | 3.13|1617412|30|NC_020561|CRISPRCasFinder,CRT,PILER-CR | 1617412-1617441 | 30 | NZ_CP049792 | Ralstonia solanacearum strain 203 plasmid unnamed, complete sequence | 364524-364553 | 7 | 0.767 |
NC_020561_3 | 3.13|1617412|30|NC_020561|CRISPRCasFinder,CRT,PILER-CR | 1617412-1617441 | 30 | NZ_CP010871 | Confluentimicrobium sp. EMB200-NS6 strain EMBL200_NS6 plasmid pNS6002, complete sequence | 36532-36561 | 7 | 0.767 |
NC_020561_3 | 3.13|1617412|30|NC_020561|CRISPRCasFinder,CRT,PILER-CR | 1617412-1617441 | 30 | NZ_CP015851 | Ralstonia solanacearum strain YC40-M plasmid, complete sequence | 329400-329429 | 7 | 0.767 |
NC_020561_3 | 3.13|1617412|30|NC_020561|CRISPRCasFinder,CRT,PILER-CR | 1617412-1617441 | 30 | NZ_AP014687 | Bradyrhizobium diazoefficiens strain NK6 plasmid pNK6c, complete sequence | 100889-100918 | 7 | 0.767 |
NC_020561_3 | 3.13|1617412|30|NC_020561|CRISPRCasFinder,CRT,PILER-CR | 1617412-1617441 | 30 | NZ_CP022791 | Ralstonia solanacearum strain SL3103 plasmid unnamed, complete sequence | 1955851-1955880 | 7 | 0.767 |
NC_020561_3 | 3.13|1617412|30|NC_020561|CRISPRCasFinder,CRT,PILER-CR | 1617412-1617441 | 30 | NZ_CP022482 | Ralstonia solanacearum strain HA4-1 plasmid HA4-1MP, complete sequence | 1030745-1030774 | 7 | 0.767 |
NC_020561_3 | 3.13|1617412|30|NC_020561|CRISPRCasFinder,CRT,PILER-CR | 1617412-1617441 | 30 | CP047139 | Ralstonia solanacearum strain CFBP 8695 plasmid unnamed, complete sequence | 85299-85328 | 7 | 0.767 |
NC_020561_3 | 3.13|1617412|30|NC_020561|CRISPRCasFinder,CRT,PILER-CR | 1617412-1617441 | 30 | NZ_CP051295 | Ralstonia solanacearum strain CIAT_078 plasmid megaplasmid, complete sequence | 1896252-1896281 | 7 | 0.767 |
NC_020561_3 | 3.13|1617412|30|NC_020561|CRISPRCasFinder,CRT,PILER-CR | 1617412-1617441 | 30 | CP047137 | Ralstonia solanacearum strain CFBP 8697 plasmid unnamed, complete sequence | 74680-74709 | 7 | 0.767 |
NC_020561_3 | 3.13|1617412|30|NC_020561|CRISPRCasFinder,CRT,PILER-CR | 1617412-1617441 | 30 | NZ_CP026091 | Ralstonia solanacearum strain IBSBF 2570 plasmid unnamed, complete sequence | 83370-83399 | 7 | 0.767 |
NC_020561_3 | 3.13|1617412|30|NC_020561|CRISPRCasFinder,CRT,PILER-CR | 1617412-1617441 | 30 | NC_014309 | Ralstonia solanacearum CFBP2957 plasmid RCFBPv3_mp, complete genome | 62380-62409 | 7 | 0.767 |
NC_020561_3 | 3.13|1617412|30|NC_020561|CRISPRCasFinder,CRT,PILER-CR | 1617412-1617441 | 30 | CP023013 | Ralstonia solanacearum strain T110 plasmid unnamed, complete sequence | 52411-52440 | 7 | 0.767 |
NC_020561_3 | 3.13|1617412|30|NC_020561|CRISPRCasFinder,CRT,PILER-CR | 1617412-1617441 | 30 | NZ_CP021653 | Ralstonia solanacearum strain RS 488 plasmid unnamed, complete sequence | 67761-67790 | 7 | 0.767 |
NC_020561_3 | 3.13|1617412|30|NC_020561|CRISPRCasFinder,CRT,PILER-CR | 1617412-1617441 | 30 | NC_014310 | Ralstonia solanacearum PSI07 plasmid mpPSI07, complete sequence | 64548-64577 | 7 | 0.767 |
NC_020561_3 | 3.13|1617412|30|NC_020561|CRISPRCasFinder,CRT,PILER-CR | 1617412-1617441 | 30 | NZ_CP022762 | Ralstonia solanacearum strain T95 plasmid unnamed, complete sequence | 94237-94266 | 7 | 0.767 |
NC_020561_3 | 3.13|1617412|30|NC_020561|CRISPRCasFinder,CRT,PILER-CR | 1617412-1617441 | 30 | NZ_CP049790 | Ralstonia solanacearum strain 202 plasmid unnamed, complete sequence | 68602-68631 | 7 | 0.767 |
NC_020561_3 | 3.13|1617412|30|NC_020561|CRISPRCasFinder,CRT,PILER-CR | 1617412-1617441 | 30 | NZ_CP020716 | Cnuibacter physcomitrellae strain XA(T) plasmid unnamed1, complete sequence | 201984-202013 | 7 | 0.767 |
NC_020561_3 | 3.13|1617412|30|NC_020561|CRISPRCasFinder,CRT,PILER-CR | 1617412-1617441 | 30 | NZ_CP022766 | Ralstonia solanacearum strain T78 plasmid unnamed, complete sequence | 53642-53671 | 7 | 0.767 |
NC_020561_3 | 3.13|1617412|30|NC_020561|CRISPRCasFinder,CRT,PILER-CR | 1617412-1617441 | 30 | NZ_CP021763 | Ralstonia pseudosolanacearum strain RS 476 plasmid unnamed, complete sequence | 89259-89288 | 7 | 0.767 |
NC_020561_3 | 3.13|1617412|30|NC_020561|CRISPRCasFinder,CRT,PILER-CR | 1617412-1617441 | 30 | NZ_CP026093 | Ralstonia solanacearum strain SFC plasmid unnamed, complete sequence | 83364-83393 | 7 | 0.767 |
NC_020561_3 | 3.13|1617412|30|NC_020561|CRISPRCasFinder,CRT,PILER-CR | 1617412-1617441 | 30 | NZ_CP021767 | Ralstonia solanacearum strain RS 489 plasmid unnamed, complete sequence | 67788-67817 | 7 | 0.767 |
NC_020561_3 | 3.13|1617412|30|NC_020561|CRISPRCasFinder,CRT,PILER-CR | 1617412-1617441 | 30 | NZ_CP015116 | Ralstonia solanacearum strain EP1 plasmid unnamed, complete sequence | 212409-212438 | 7 | 0.767 |
NC_020561_3 | 3.13|1617412|30|NC_020561|CRISPRCasFinder,CRT,PILER-CR | 1617412-1617441 | 30 | NZ_CP016555 | Ralstonia solanacearum FJAT-1458 plasmid plas1, complete sequence | 1782211-1782240 | 7 | 0.767 |
NC_020561_3 | 3.13|1617412|30|NC_020561|CRISPRCasFinder,CRT,PILER-CR | 1617412-1617441 | 30 | NZ_CP012688 | Ralstonia solanacearum strain UY031 plasmid unnamed, complete sequence | 67761-67790 | 7 | 0.767 |
NC_020561_3 | 3.13|1617412|30|NC_020561|CRISPRCasFinder,CRT,PILER-CR | 1617412-1617441 | 30 | NZ_CP052069 | Ralstonia solanacearum strain FJAT91.F50 plasmid Plas1, complete sequence | 53379-53408 | 7 | 0.767 |
NC_020561_3 | 3.13|1617412|30|NC_020561|CRISPRCasFinder,CRT,PILER-CR | 1617412-1617441 | 30 | NZ_CP016915 | Ralstonia solanacearum strain CQPS-1 plasmid unnamed, complete sequence | 663425-663454 | 7 | 0.767 |
NC_020561_3 | 3.13|1617412|30|NC_020561|CRISPRCasFinder,CRT,PILER-CR | 1617412-1617441 | 30 | NZ_CP016905 | Ralstonia solanacearum strain KACC 10709 plasmid unnamed1 | 1092381-1092410 | 7 | 0.767 |
NC_020561_3 | 3.13|1617412|30|NC_020561|CRISPRCasFinder,CRT,PILER-CR | 1617412-1617441 | 30 | NZ_CP025986 | Ralstonia solanacearum strain RSCM plasmid p-unname2, complete sequence | 285048-285077 | 7 | 0.767 |
NC_020561_3 | 3.13|1617412|30|NC_020561|CRISPRCasFinder,CRT,PILER-CR | 1617412-1617441 | 30 | NZ_CP022769 | Ralstonia solanacearum strain T60 plasmid unnamed, complete sequence | 54851-54880 | 7 | 0.767 |
NC_020561_3 | 3.13|1617412|30|NC_020561|CRISPRCasFinder,CRT,PILER-CR | 1617412-1617441 | 30 | NZ_CP023017 | Ralstonia solanacearum strain SL3022 plasmid unnamed, complete sequence | 66589-66618 | 7 | 0.767 |
NC_020561_3 | 3.13|1617412|30|NC_020561|CRISPRCasFinder,CRT,PILER-CR | 1617412-1617441 | 30 | NC_017585 | Streptomyces cattleya NRRL 8057 = DSM 46488 plasmid pSCATT, complete sequence | 563292-563321 | 7 | 0.767 |
NC_020561_3 | 3.13|1617412|30|NC_020561|CRISPRCasFinder,CRT,PILER-CR | 1617412-1617441 | 30 | NC_017585 | Streptomyces cattleya NRRL 8057 = DSM 46488 plasmid pSCATT, complete sequence | 1727600-1727629 | 7 | 0.767 |
NC_020561_3 | 3.13|1617412|30|NC_020561|CRISPRCasFinder,CRT,PILER-CR | 1617412-1617441 | 30 | NZ_CP022773 | Ralstonia solanacearum strain T42 plasmid unnamed, complete sequence | 57473-57502 | 7 | 0.767 |
NC_020561_3 | 3.13|1617412|30|NC_020561|CRISPRCasFinder,CRT,PILER-CR | 1617412-1617441 | 30 | NZ_CP022783 | Ralstonia solanacearum strain SL3755 plasmid unnamed, complete sequence | 52461-52490 | 7 | 0.767 |
NC_020561_3 | 3.13|1617412|30|NC_020561|CRISPRCasFinder,CRT,PILER-CR | 1617412-1617441 | 30 | NZ_CP014703 | Ralstonia solanacearum strain KACC 10722 plasmid, complete sequence | 94237-94266 | 7 | 0.767 |
NC_020561_3 | 3.13|1617412|30|NC_020561|CRISPRCasFinder,CRT,PILER-CR | 1617412-1617441 | 30 | NZ_CP022760 | Ralstonia solanacearum strain T98 plasmid unnamed, complete sequence | 67114-67143 | 7 | 0.767 |
NC_020561_3 | 3.13|1617412|30|NC_020561|CRISPRCasFinder,CRT,PILER-CR | 1617412-1617441 | 30 | NZ_CP022789 | Ralstonia solanacearum strain SL3175 plasmid unnamed, complete sequence | 67114-67143 | 7 | 0.767 |
NC_020561_3 | 3.13|1617412|30|NC_020561|CRISPRCasFinder,CRT,PILER-CR | 1617412-1617441 | 30 | NZ_CP022795 | Ralstonia solanacearum strain SL2330 plasmid unnamed, complete sequence | 52466-52495 | 7 | 0.767 |
NC_020561_3 | 3.13|1617412|30|NC_020561|CRISPRCasFinder,CRT,PILER-CR | 1617412-1617441 | 30 | NZ_CP052071 | Ralstonia solanacearum strain FJAT454.F1 plasmid Plas1, complete sequence | 68443-68472 | 7 | 0.767 |
NC_020561_3 | 3.13|1617412|30|NC_020561|CRISPRCasFinder,CRT,PILER-CR | 1617412-1617441 | 30 | NC_017575 | Ralstonia solanacearum Po82 megaplasmid, complete sequence | 83336-83365 | 7 | 0.767 |
NC_020561_3 | 3.13|1617412|30|NC_020561|CRISPRCasFinder,CRT,PILER-CR | 1617412-1617441 | 30 | NZ_CP022771 | Ralstonia solanacearum strain T51 plasmid unnamed, complete sequence | 94247-94276 | 7 | 0.767 |
NC_020561_3 | 3.13|1617412|30|NC_020561|CRISPRCasFinder,CRT,PILER-CR | 1617412-1617441 | 30 | NZ_CP022777 | Ralstonia solanacearum strain T11 plasmid unnamed, complete sequence | 94266-94295 | 7 | 0.767 |
NC_020561_3 | 3.13|1617412|30|NC_020561|CRISPRCasFinder,CRT,PILER-CR | 1617412-1617441 | 30 | NZ_CP022799 | Ralstonia solanacearum strain SL2064 plasmid unnamed, complete sequence | 94237-94266 | 7 | 0.767 |
NC_020561_3 | 3.13|1617412|30|NC_020561|CRISPRCasFinder,CRT,PILER-CR | 1617412-1617441 | 30 | NZ_CP009763 | Ralstonia solanacearum OE1-1 plasmid unnamed, complete sequence | 53345-53374 | 7 | 0.767 |
NC_020561_3 | 3.13|1617412|30|NC_020561|CRISPRCasFinder,CRT,PILER-CR | 1617412-1617441 | 30 | CP023015 | Ralstonia solanacearum strain T25 plasmid unnamed, complete sequence | 52450-52479 | 7 | 0.767 |
NC_020561_3 | 3.13|1617412|30|NC_020561|CRISPRCasFinder,CRT,PILER-CR | 1617412-1617441 | 30 | NZ_CP022779 | Ralstonia solanacearum strain SL3882 plasmid unnamed, complete sequence | 54851-54880 | 7 | 0.767 |
NC_020561_3 | 3.13|1617412|30|NC_020561|CRISPRCasFinder,CRT,PILER-CR | 1617412-1617441 | 30 | NZ_CP052075 | Ralstonia solanacearum strain FJAT448.F1 plasmid Plas1, complete sequence | 68443-68472 | 7 | 0.767 |
NC_020561_3 | 3.13|1617412|30|NC_020561|CRISPRCasFinder,CRT,PILER-CR | 1617412-1617441 | 30 | NZ_CP052085 | Ralstonia solanacearum strain FJAT15353.F8 plasmid Plas1, complete sequence | 70942-70971 | 7 | 0.767 |
NC_020561_3 | 3.13|1617412|30|NC_020561|CRISPRCasFinder,CRT,PILER-CR | 1617412-1617441 | 30 | NZ_CP052095 | Ralstonia solanacearum strain FJAT15340.F1 plasmid Plas1, complete sequence | 53393-53422 | 7 | 0.767 |
NC_020561_3 | 3.13|1617412|30|NC_020561|CRISPRCasFinder,CRT,PILER-CR | 1617412-1617441 | 30 | NZ_CP052105 | Ralstonia solanacearum strain FJAT15252.F1 plasmid Plas1, complete sequence | 68443-68472 | 7 | 0.767 |
NC_020561_3 | 3.13|1617412|30|NC_020561|CRISPRCasFinder,CRT,PILER-CR | 1617412-1617441 | 30 | NZ_CP026308 | Ralstonia solanacearum strain IBSBF 2571 plasmid unnamed, complete sequence | 83336-83365 | 7 | 0.767 |
NC_020561_3 | 3.13|1617412|30|NC_020561|CRISPRCasFinder,CRT,PILER-CR | 1617412-1617441 | 30 | NZ_CP021765 | Ralstonia pseudosolanacearum strain CRMRs218 plasmid unnamed, complete sequence | 89264-89293 | 7 | 0.767 |
NC_020561_3 | 3.13|1617412|30|NC_020561|CRISPRCasFinder,CRT,PILER-CR | 1617412-1617441 | 30 | NZ_CP052077 | Ralstonia solanacearum strain FJAT445.F50 plasmid Plas1, complete sequence | 55213-55242 | 7 | 0.767 |
NC_020561_3 | 3.13|1617412|30|NC_020561|CRISPRCasFinder,CRT,PILER-CR | 1617412-1617441 | 30 | NZ_CP052087 | Ralstonia solanacearum strain FJAT15353.F50 plasmid Plas1, complete sequence | 70942-70971 | 7 | 0.767 |
NC_020561_3 | 3.13|1617412|30|NC_020561|CRISPRCasFinder,CRT,PILER-CR | 1617412-1617441 | 30 | NZ_CP052097 | Ralstonia solanacearum strain FJAT15304.F6 plasmid Plas1, complete sequence | 53393-53422 | 7 | 0.767 |
NC_020561_3 | 3.13|1617412|30|NC_020561|CRISPRCasFinder,CRT,PILER-CR | 1617412-1617441 | 30 | NZ_CP052115 | Ralstonia solanacearum strain FJAT1463.F50 plasmid Plas1, complete sequence | 68443-68472 | 7 | 0.767 |
NC_020561_3 | 3.13|1617412|30|NC_020561|CRISPRCasFinder,CRT,PILER-CR | 1617412-1617441 | 30 | NZ_CP052127 | Ralstonia solanacearum strain FJAT1303.F50 plasmid Plas1, complete sequence | 70942-70971 | 7 | 0.767 |
NC_020561_3 | 3.13|1617412|30|NC_020561|CRISPRCasFinder,CRT,PILER-CR | 1617412-1617441 | 30 | NZ_CP052079 | Ralstonia solanacearum strain FJAT445.F1 plasmid Plas1, complete sequence | 55213-55242 | 7 | 0.767 |
NC_020561_3 | 3.13|1617412|30|NC_020561|CRISPRCasFinder,CRT,PILER-CR | 1617412-1617441 | 30 | NZ_CP052089 | Ralstonia solanacearum strain FJAT15353.F1 plasmid Plas1, complete sequence | 70942-70971 | 7 | 0.767 |
NC_020561_3 | 3.13|1617412|30|NC_020561|CRISPRCasFinder,CRT,PILER-CR | 1617412-1617441 | 30 | NZ_CP052093 | Ralstonia solanacearum strain FJAT15340.F50 plasmid Plas1, complete sequence | 53393-53422 | 7 | 0.767 |
NC_020561_3 | 3.13|1617412|30|NC_020561|CRISPRCasFinder,CRT,PILER-CR | 1617412-1617441 | 30 | NZ_CP052101 | Ralstonia solanacearum strain FJAT15304.F1 plasmid Plas1, complete sequence | 53393-53422 | 7 | 0.767 |
NC_020561_3 | 3.13|1617412|30|NC_020561|CRISPRCasFinder,CRT,PILER-CR | 1617412-1617441 | 30 | NZ_CP052099 | Ralstonia solanacearum strain FJAT15304.F50 plasmid Plas1, complete sequence | 53393-53422 | 7 | 0.767 |
NC_020561_3 | 3.13|1617412|30|NC_020561|CRISPRCasFinder,CRT,PILER-CR | 1617412-1617441 | 30 | NZ_CP052107 | Ralstonia solanacearum strain FJAT15249.F50 plasmid Plas1, complete sequence | 68443-68472 | 7 | 0.767 |
NC_020561_3 | 3.13|1617412|30|NC_020561|CRISPRCasFinder,CRT,PILER-CR | 1617412-1617441 | 30 | NZ_CP022781 | Ralstonia solanacearum strain SL3822 plasmid unnamed, complete sequence | 53640-53669 | 7 | 0.767 |
NC_020561_3 | 3.13|1617412|30|NC_020561|CRISPRCasFinder,CRT,PILER-CR | 1617412-1617441 | 30 | NZ_CP052117 | Ralstonia solanacearum strain FJAT1463.F1 plasmid Plas1, complete sequence | 68443-68472 | 7 | 0.767 |
NC_020561_3 | 3.13|1617412|30|NC_020561|CRISPRCasFinder,CRT,PILER-CR | 1617412-1617441 | 30 | NZ_CP052125 | Ralstonia solanacearum strain FJAT1452.F1 plasmid Plas1, complete sequence | 55213-55242 | 7 | 0.767 |
NC_020561_3 | 3.13|1617412|30|NC_020561|CRISPRCasFinder,CRT,PILER-CR | 1617412-1617441 | 30 | NZ_CP022793 | Ralstonia solanacearum strain SL2729 plasmid unnamed, complete sequence | 57474-57503 | 7 | 0.767 |
NC_020561_3 | 3.13|1617412|30|NC_020561|CRISPRCasFinder,CRT,PILER-CR | 1617412-1617441 | 30 | NZ_CP022785 | Ralstonia solanacearum strain SL3730 plasmid unnamed, complete sequence | 57470-57499 | 7 | 0.767 |
NC_020561_3 | 3.13|1617412|30|NC_020561|CRISPRCasFinder,CRT,PILER-CR | 1617412-1617441 | 30 | NZ_CP022787 | Ralstonia solanacearum strain SL3300 plasmid unnamed, complete sequence | 54818-54847 | 7 | 0.767 |
NC_020561_3 | 3.13|1617412|30|NC_020561|CRISPRCasFinder,CRT,PILER-CR | 1617412-1617441 | 30 | NZ_CP022756 | Ralstonia solanacearum strain T117 plasmid unnamed, complete sequence | 55706-55735 | 7 | 0.767 |
NC_020561_3 | 3.13|1617412|30|NC_020561|CRISPRCasFinder,CRT,PILER-CR | 1617412-1617441 | 30 | NZ_CP052129 | Ralstonia solanacearum strain FJAT1303.F1 plasmid Plas1, complete sequence | 52215-52244 | 7 | 0.767 |
NC_020561_3 | 3.13|1617412|30|NC_020561|CRISPRCasFinder,CRT,PILER-CR | 1617412-1617441 | 30 | NZ_CP052121 | Ralstonia solanacearum strain FJAT1458.F1 plasmid Plas1, complete sequence | 68443-68472 | 7 | 0.767 |
NC_020561_3 | 3.13|1617412|30|NC_020561|CRISPRCasFinder,CRT,PILER-CR | 1617412-1617441 | 30 | NZ_CP052123 | Ralstonia solanacearum strain FJAT1452.F50 plasmid Plas1, complete sequence | 55213-55242 | 7 | 0.767 |
NC_020561_3 | 3.13|1617412|30|NC_020561|CRISPRCasFinder,CRT,PILER-CR | 1617412-1617441 | 30 | NZ_CP052131 | Ralstonia solanacearum strain FJAT1303.F8 plasmid Plas1, complete sequence | 70942-70971 | 7 | 0.767 |
NC_020561_3 | 3.13|1617412|30|NC_020561|CRISPRCasFinder,CRT,PILER-CR | 1617412-1617441 | 30 | CP011998 | Ralstonia solanacearum strain YC45 plasmid, complete sequence | 89661-89690 | 7 | 0.767 |
NC_020561_3 | 3.13|1617412|30|NC_020561|CRISPRCasFinder,CRT,PILER-CR | 1617412-1617441 | 30 | NZ_CP052073 | Ralstonia solanacearum strain FJAT448.F50 plasmid Plas1, complete sequence | 68443-68472 | 7 | 0.767 |
NC_020561_3 | 3.13|1617412|30|NC_020561|CRISPRCasFinder,CRT,PILER-CR | 1617412-1617441 | 30 | NZ_CP052081 | Ralstonia solanacearum strain FJAT442.F50 plasmid Plas1, complete sequence | 55213-55242 | 7 | 0.767 |
NC_020561_3 | 3.13|1617412|30|NC_020561|CRISPRCasFinder,CRT,PILER-CR | 1617412-1617441 | 30 | NZ_CP052083 | Ralstonia solanacearum strain FJAT442.F1 plasmid Plas1, complete sequence | 55213-55242 | 7 | 0.767 |
NC_020561_3 | 3.13|1617412|30|NC_020561|CRISPRCasFinder,CRT,PILER-CR | 1617412-1617441 | 30 | NZ_CP052109 | Ralstonia solanacearum strain FJAT15249.F1 plasmid Plas1, complete sequence | 68443-68472 | 7 | 0.767 |
NC_020561_3 | 3.13|1617412|30|NC_020561|CRISPRCasFinder,CRT,PILER-CR | 1617412-1617441 | 30 | NZ_CP052091 | Ralstonia solanacearum strain FJAT15340.F6 plasmid Plas1, complete sequence | 53393-53422 | 7 | 0.767 |
NC_020561_3 | 3.13|1617412|30|NC_020561|CRISPRCasFinder,CRT,PILER-CR | 1617412-1617441 | 30 | NZ_CP052111 | Ralstonia solanacearum strain FJAT15244.F50 plasmid Plas1, complete sequence | 53768-53797 | 7 | 0.767 |
NC_020561_3 | 3.13|1617412|30|NC_020561|CRISPRCasFinder,CRT,PILER-CR | 1617412-1617441 | 30 | NZ_CP052103 | Ralstonia solanacearum strain FJAT15252.F50 plasmid Plas1, complete sequence | 68443-68472 | 7 | 0.767 |
NC_020561_3 | 3.13|1617412|30|NC_020561|CRISPRCasFinder,CRT,PILER-CR | 1617412-1617441 | 30 | NZ_CP052119 | Ralstonia solanacearum strain FJAT1458.F50 plasmid Plas1, complete sequence | 68443-68472 | 7 | 0.767 |
NC_020561_3 | 3.13|1617412|30|NC_020561|CRISPRCasFinder,CRT,PILER-CR | 1617412-1617441 | 30 | NZ_CP052113 | Ralstonia solanacearum strain FJAT15244.F1 plasmid Plas1, complete sequence | 53768-53797 | 7 | 0.767 |
NC_020561_3 | 3.13|1617412|30|NC_020561|CRISPRCasFinder,CRT,PILER-CR | 1617412-1617441 | 30 | MT316461 | Streptomyces phage Galactica, complete genome | 65519-65548 | 7 | 0.767 |
NC_020561_3 | 3.13|1617412|30|NC_020561|CRISPRCasFinder,CRT,PILER-CR | 1617412-1617441 | 30 | NZ_CP012477 | Arthrobacter sp. ERGS1:01 isolate water plasmid unnamed2, complete sequence | 47264-47293 | 7 | 0.767 |
NC_020561_3 | 3.13|1617412|30|NC_020561|CRISPRCasFinder,CRT,PILER-CR | 1617412-1617441 | 30 | NZ_AP014705 | Methylobacterium aquaticum strain MA-22A plasmid pMaq22A_1p, complete sequence | 1523942-1523971 | 7 | 0.767 |
NC_020561_3 | 3.13|1617412|30|NC_020561|CRISPRCasFinder,CRT,PILER-CR | 1617412-1617441 | 30 | MN284893 | Mycobacterium phage LilMcDreamy, complete genome | 68798-68827 | 7 | 0.767 |
NC_020561_3 | 3.13|1617412|30|NC_020561|CRISPRCasFinder,CRT,PILER-CR | 1617412-1617441 | 30 | NZ_CP019036 | Massilia putida strain 6NM-7 plasmid unnamed1, complete sequence | 61645-61674 | 7 | 0.767 |
NC_020561_3 | 3.13|1617412|30|NC_020561|CRISPRCasFinder,CRT,PILER-CR | 1617412-1617441 | 30 | NC_023316 | Streptomyces sp. 14R-10 plasmid pZL1, complete sequence | 119713-119742 | 7 | 0.767 |
NC_020561_3 | 3.13|1617412|30|NC_020561|CRISPRCasFinder,CRT,PILER-CR | 1617412-1617441 | 30 | NZ_CP025016 | Rhizobium leguminosarum strain Norway plasmid pRLN4, complete sequence | 160562-160591 | 7 | 0.767 |
NC_020561_3 | 3.13|1617412|30|NC_020561|CRISPRCasFinder,CRT,PILER-CR | 1617412-1617441 | 30 | NZ_LR134452 | Tsukamurella tyrosinosolvens strain NCTC13231 plasmid 10, complete sequence | 335987-336016 | 7 | 0.767 |
NC_020561_3 | 3.13|1617412|30|NC_020561|CRISPRCasFinder,CRT,PILER-CR | 1617412-1617441 | 30 | CP053919 | Serratia marcescens strain LY1 plasmid unnamed1, complete sequence | 99240-99269 | 7 | 0.767 |
NC_020561_3 | 3.13|1617412|30|NC_020561|CRISPRCasFinder,CRT,PILER-CR | 1617412-1617441 | 30 | CP053919 | Serratia marcescens strain LY1 plasmid unnamed1, complete sequence | 99639-99668 | 7 | 0.767 |
NC_020561_3 | 3.13|1617412|30|NC_020561|CRISPRCasFinder,CRT,PILER-CR | 1617412-1617441 | 30 | CP053919 | Serratia marcescens strain LY1 plasmid unnamed1, complete sequence | 100038-100067 | 7 | 0.767 |
NC_020561_3 | 3.13|1617412|30|NC_020561|CRISPRCasFinder,CRT,PILER-CR | 1617412-1617441 | 30 | CP053919 | Serratia marcescens strain LY1 plasmid unnamed1, complete sequence | 100437-100466 | 7 | 0.767 |
NC_020561_3 | 3.13|1617412|30|NC_020561|CRISPRCasFinder,CRT,PILER-CR | 1617412-1617441 | 30 | CP053919 | Serratia marcescens strain LY1 plasmid unnamed1, complete sequence | 100836-100865 | 7 | 0.767 |
NC_020561_3 | 3.13|1617412|30|NC_020561|CRISPRCasFinder,CRT,PILER-CR | 1617412-1617441 | 30 | CP053919 | Serratia marcescens strain LY1 plasmid unnamed1, complete sequence | 101235-101264 | 7 | 0.767 |
NC_020561_3 | 3.13|1617412|30|NC_020561|CRISPRCasFinder,CRT,PILER-CR | 1617412-1617441 | 30 | CP053919 | Serratia marcescens strain LY1 plasmid unnamed1, complete sequence | 101634-101663 | 7 | 0.767 |
NC_020561_3 | 3.13|1617412|30|NC_020561|CRISPRCasFinder,CRT,PILER-CR | 1617412-1617441 | 30 | CP053919 | Serratia marcescens strain LY1 plasmid unnamed1, complete sequence | 102033-102062 | 7 | 0.767 |
NC_020561_3 | 3.13|1617412|30|NC_020561|CRISPRCasFinder,CRT,PILER-CR | 1617412-1617441 | 30 | CP053919 | Serratia marcescens strain LY1 plasmid unnamed1, complete sequence | 102432-102461 | 7 | 0.767 |
NC_020561_3 | 3.13|1617412|30|NC_020561|CRISPRCasFinder,CRT,PILER-CR | 1617412-1617441 | 30 | CP053919 | Serratia marcescens strain LY1 plasmid unnamed1, complete sequence | 102831-102860 | 7 | 0.767 |
NC_020561_3 | 3.13|1617412|30|NC_020561|CRISPRCasFinder,CRT,PILER-CR | 1617412-1617441 | 30 | NZ_CP022363 | Azospirillum sp. TSH58 plasmid TSH58_p03, complete sequence | 266255-266284 | 7 | 0.767 |
NC_020561_3 | 3.13|1617412|30|NC_020561|CRISPRCasFinder,CRT,PILER-CR | 1617412-1617441 | 30 | JN564907 | Burkholderia phage AH2, complete genome | 12157-12186 | 7 | 0.767 |
NC_020561_3 | 3.13|1617412|30|NC_020561|CRISPRCasFinder,CRT,PILER-CR | 1617412-1617441 | 30 | MN813697 | Mycobacterium phage Noelle, complete genome | 29848-29877 | 7 | 0.767 |
NC_020561_3 | 3.3|1616752|30|NC_020561|CRISPRCasFinder,CRT,PILER-CR | 1616752-1616781 | 30 | NZ_CP014683 | Kozakia baliensis strain NBRC 16680 plasmid pKB16680_2, complete sequence | 79945-79974 | 8 | 0.733 |
NC_020561_3 | 3.9|1617148|30|NC_020561|CRISPRCasFinder,CRT,PILER-CR | 1617148-1617177 | 30 | MN034485 | Leviviridae sp. isolate H2_Bulk_34_354 hypothetical protein (H2Bulk34354_000001) gene, partial cds; and hypothetical protein (H2Bulk34354_000002) and RNA-dependent RNA polymerase (H2Bulk34354_000003) genes, complete cds | 2356-2385 | 8 | 0.733 |
NC_020561_3 | 3.11|1617280|30|NC_020561|CRISPRCasFinder,CRT,PILER-CR | 1617280-1617309 | 30 | NC_019410 | Caulobacter phage CcrKarma, complete genome | 127890-127919 | 8 | 0.733 |
NC_020561_3 | 3.11|1617280|30|NC_020561|CRISPRCasFinder,CRT,PILER-CR | 1617280-1617309 | 30 | NC_019407 | Caulobacter phage CcrMagneto, complete genome | 126091-126120 | 8 | 0.733 |
NC_020561_3 | 3.11|1617280|30|NC_020561|CRISPRCasFinder,CRT,PILER-CR | 1617280-1617309 | 30 | KY555144 | Caulobacter phage Ccr5, complete genome | 127063-127092 | 8 | 0.733 |
NC_020561_3 | 3.11|1617280|30|NC_020561|CRISPRCasFinder,CRT,PILER-CR | 1617280-1617309 | 30 | NC_019411 | Caulobacter phage CcrSwift, complete genome | 126654-126683 | 8 | 0.733 |
NC_020561_3 | 3.12|1617346|30|NC_020561|CRISPRCasFinder,CRT,PILER-CR | 1617346-1617375 | 30 | NZ_CP047174 | Rathayibacter sp. VKM Ac-2760 plasmid unnamed1, complete sequence | 144341-144370 | 8 | 0.733 |
NC_020561_3 | 3.13|1617412|30|NC_020561|CRISPRCasFinder,CRT,PILER-CR | 1617412-1617441 | 30 | NC_018022 | Mycolicibacterium chubuense NBB4 plasmid pMYCCH.01, complete sequence | 111022-111051 | 8 | 0.733 |
NC_020561_3 | 3.13|1617412|30|NC_020561|CRISPRCasFinder,CRT,PILER-CR | 1617412-1617441 | 30 | NZ_CP039340 | Ralstonia solanacearum strain UW386 plasmid pUW386, complete sequence | 522758-522787 | 8 | 0.733 |
NC_020561_3 | 3.13|1617412|30|NC_020561|CRISPRCasFinder,CRT,PILER-CR | 1617412-1617441 | 30 | NC_015583 | Novosphingobium sp. PP1Y plasmid Mpl, complete sequence | 106028-106057 | 8 | 0.733 |
NC_020561_3 | 3.13|1617412|30|NC_020561|CRISPRCasFinder,CRT,PILER-CR | 1617412-1617441 | 30 | NZ_CP050083 | Rhizobium leguminosarum bv. trifolii strain 31B plasmid pRL31b3, complete sequence | 302835-302864 | 8 | 0.733 |
NC_020561_3 | 3.13|1617412|30|NC_020561|CRISPRCasFinder,CRT,PILER-CR | 1617412-1617441 | 30 | NC_012811 | Methylorubrum extorquens AM1 megaplasmid, complete sequence | 667360-667389 | 8 | 0.733 |
NC_020561_3 | 3.13|1617412|30|NC_020561|CRISPRCasFinder,CRT,PILER-CR | 1617412-1617441 | 30 | NC_012586 | Sinorhizobium fredii NGR234 plasmid pNGR234b, complete sequence | 2314793-2314822 | 8 | 0.733 |
NC_020561_3 | 3.13|1617412|30|NC_020561|CRISPRCasFinder,CRT,PILER-CR | 1617412-1617441 | 30 | NZ_CP049733 | Rhizobium leguminosarum strain A1 plasmid pRL10, complete sequence | 293804-293833 | 8 | 0.733 |
NC_020561_3 | 3.13|1617412|30|NC_020561|CRISPRCasFinder,CRT,PILER-CR | 1617412-1617441 | 30 | NZ_CP024310 | Sinorhizobium fredii strain NXT3 plasmid pSfreNXT3c, complete sequence | 1511283-1511312 | 8 | 0.733 |
NC_020561_3 | 3.13|1617412|30|NC_020561|CRISPRCasFinder,CRT,PILER-CR | 1617412-1617441 | 30 | NZ_CP045120 | Rubrobacter sp. SCSIO 52909 plasmid unnamed1, complete sequence | 7563-7592 | 8 | 0.733 |
NC_020561_3 | 3.13|1617412|30|NC_020561|CRISPRCasFinder,CRT,PILER-CR | 1617412-1617441 | 30 | NZ_CP023064 | Sinorhizobium sp. CCBAU 05631 plasmid pSS05631b, complete sequence | 1327479-1327508 | 8 | 0.733 |
NC_020561_3 | 3.13|1617412|30|NC_020561|CRISPRCasFinder,CRT,PILER-CR | 1617412-1617441 | 30 | NZ_LR594663 | Variovorax sp. RA8 plasmid 2 | 303045-303074 | 8 | 0.733 |
NC_020561_3 | 3.15|1617544|30|NC_020561|CRISPRCasFinder,CRT,PILER-CR | 1617544-1617573 | 30 | NZ_CP015092 | Pelagibaca abyssi strain JLT2014 plasmid pPABY3, complete sequence | 896-925 | 8 | 0.733 |
NC_020561_3 | 3.15|1617544|30|NC_020561|CRISPRCasFinder,CRT,PILER-CR | 1617544-1617573 | 30 | NZ_CP049032 | Fluviibacterium aquatile strain SC52 plasmid pSC52_4, complete sequence | 34385-34414 | 8 | 0.733 |
NC_020561_3 | 3.15|1617544|30|NC_020561|CRISPRCasFinder,CRT,PILER-CR | 1617544-1617573 | 30 | NZ_CP031601 | Roseovarius indicus strain DSM 26383 plasmid pRIdsm_03, complete sequence | 7104-7133 | 8 | 0.733 |
NC_020561_3 | 3.15|1617544|30|NC_020561|CRISPRCasFinder,CRT,PILER-CR | 1617544-1617573 | 30 | NZ_CP004395 | Celeribacter indicus strain P73 plasmid pP73B, complete sequence | 9878-9907 | 8 | 0.733 |
NC_020561_3 | 3.19|1617822|30|NC_020561|CRISPRCasFinder,CRT,PILER-CR | 1617822-1617851 | 30 | NZ_CP012748 | Paraburkholderia caribensis MBA4 plasmid unnamed, complete sequence | 1700388-1700417 | 8 | 0.733 |
NC_020561_3 | 3.21|1617954|31|NC_020561|CRISPRCasFinder,CRT,PILER-CR | 1617954-1617984 | 31 | NZ_LR594668 | Variovorax sp. SRS16 plasmid 3 | 245642-245672 | 8 | 0.742 |
NC_020561_3 | 3.13|1617412|30|NC_020561|CRISPRCasFinder,CRT,PILER-CR | 1617412-1617441 | 30 | NZ_CP049794 | Ralstonia solanacearum strain 204 plasmid unnamed, complete sequence | 325911-325940 | 9 | 0.7 |
NC_020561_3 | 3.13|1617412|30|NC_020561|CRISPRCasFinder,CRT,PILER-CR | 1617412-1617441 | 30 | NZ_CP049792 | Ralstonia solanacearum strain 203 plasmid unnamed, complete sequence | 107387-107416 | 9 | 0.7 |
NC_020561_3 | 3.13|1617412|30|NC_020561|CRISPRCasFinder,CRT,PILER-CR | 1617412-1617441 | 30 | NZ_CP049790 | Ralstonia solanacearum strain 202 plasmid unnamed, complete sequence | 325739-325768 | 9 | 0.7 |
NC_020561_3 | 3.13|1617412|30|NC_020561|CRISPRCasFinder,CRT,PILER-CR | 1617412-1617441 | 30 | NZ_CP016915 | Ralstonia solanacearum strain CQPS-1 plasmid unnamed, complete sequence | 920551-920580 | 9 | 0.7 |
NC_020561_3 | 3.13|1617412|30|NC_020561|CRISPRCasFinder,CRT,PILER-CR | 1617412-1617441 | 30 | NZ_CP052085 | Ralstonia solanacearum strain FJAT15353.F8 plasmid Plas1, complete sequence | 324206-324235 | 9 | 0.7 |
NC_020561_3 | 3.13|1617412|30|NC_020561|CRISPRCasFinder,CRT,PILER-CR | 1617412-1617441 | 30 | NZ_CP052087 | Ralstonia solanacearum strain FJAT15353.F50 plasmid Plas1, complete sequence | 324206-324235 | 9 | 0.7 |
NC_020561_3 | 3.13|1617412|30|NC_020561|CRISPRCasFinder,CRT,PILER-CR | 1617412-1617441 | 30 | NZ_CP052127 | Ralstonia solanacearum strain FJAT1303.F50 plasmid Plas1, complete sequence | 324206-324235 | 9 | 0.7 |
NC_020561_3 | 3.13|1617412|30|NC_020561|CRISPRCasFinder,CRT,PILER-CR | 1617412-1617441 | 30 | NZ_CP052089 | Ralstonia solanacearum strain FJAT15353.F1 plasmid Plas1, complete sequence | 324206-324235 | 9 | 0.7 |
NC_020561_3 | 3.13|1617412|30|NC_020561|CRISPRCasFinder,CRT,PILER-CR | 1617412-1617441 | 30 | NZ_CP052131 | Ralstonia solanacearum strain FJAT1303.F8 plasmid Plas1, complete sequence | 324206-324235 | 9 | 0.7 |
NC_020561_3 | 3.13|1617412|30|NC_020561|CRISPRCasFinder,CRT,PILER-CR | 1617412-1617441 | 30 | NZ_AP022593 | Mycolicibacterium arabiense strain JCM 18538 plasmid pJCM18538, complete sequence | 5366445-5366474 | 9 | 0.7 |
NC_020561_3 | 3.13|1617412|30|NC_020561|CRISPRCasFinder,CRT,PILER-CR | 1617412-1617441 | 30 | NZ_CP021813 | Sinorhizobium meliloti strain M270 plasmid psymA, complete sequence | 17996-18025 | 9 | 0.7 |
NC_020561_3 | 3.13|1617412|30|NC_020561|CRISPRCasFinder,CRT,PILER-CR | 1617412-1617441 | 30 | NZ_CP045074 | Paracoccus kondratievae strain BJQ0001 plasmid unnamed1, complete sequence | 8483-8512 | 9 | 0.7 |
NC_020561_3 | 3.13|1617412|30|NC_020561|CRISPRCasFinder,CRT,PILER-CR | 1617412-1617441 | 30 | NZ_CP021819 | Sinorhizobium meliloti strain M162 plasmid psymA, complete sequence | 354253-354282 | 9 | 0.7 |
NC_020561_3 | 3.13|1617412|30|NC_020561|CRISPRCasFinder,CRT,PILER-CR | 1617412-1617441 | 30 | NC_020548 | Azoarcus sp. KH32C plasmid pAZKH, complete sequence | 547335-547364 | 9 | 0.7 |
NC_020561_3 | 3.13|1617412|30|NC_020561|CRISPRCasFinder,CRT,PILER-CR | 1617412-1617441 | 30 | NC_009620 | Sinorhizobium medicae WSM419 plasmid pSMED01, complete sequence | 959399-959428 | 9 | 0.7 |
NC_020561_3 | 3.14|1617478|30|NC_020561|CRISPRCasFinder,CRT,PILER-CR | 1617478-1617507 | 30 | NC_000914 | Sinorhizobium fredii NGR234 plasmid pNGR234a, complete sequence | 391584-391613 | 9 | 0.7 |
NC_020561_3 | 3.21|1617954|31|NC_020561|CRISPRCasFinder,CRT,PILER-CR | 1617954-1617984 | 31 | MN035828 | Leviviridae sp. isolate H3_Bulk_Litter_17_scaffold_1122 RNA-dependent RNA polymerase (H3BulkLitter171122_000001) and hypothetical protein (H3BulkLitter171122_000002) genes, complete cds; and hypothetical protein (H3BulkLitter171122_000003) gene, partial cds | 1228-1258 | 9 | 0.71 |
1. spacer 3.4|1616818|30|NC_020561|CRISPRCasFinder,CRT,PILER-CR matches to NZ_CP020908 (Rhizobium etli strain NXC12 plasmid pRetNXC12b, complete sequence) position: , mismatch: 4, identity: 0.867
agtgatgactgacatcgcaacg-atagcggc CRISPR spacer ggtgacgactgacatcgcaacgaagagcgg- Protospacer .****.**************** * *****
2. spacer 3.1|1616620|30|NC_020561|CRISPRCasFinder,CRT matches to NZ_LR594668 (Variovorax sp. SRS16 plasmid 3) position: , mismatch: 5, identity: 0.833
cgggcaagacggttgggcgacgcgcgtttg CRISPR spacer tgggcaagacggtcgggcggcgcgcggtcg Protospacer .************.*****.****** *.*
3. spacer 3.1|1616620|30|NC_020561|CRISPRCasFinder,CRT matches to NZ_LR594673 (Variovorax sp. PBL-E5 plasmid 3) position: , mismatch: 5, identity: 0.833
cgggcaagacggttgggcgacgcgcgtttg CRISPR spacer tgggcaagacggtcgggcggcgcgcggtcg Protospacer .************.*****.****** *.*
4. spacer 3.2|1616686|30|NC_020561|CRISPRCasFinder,CRT matches to JQ680373 (Unidentified phage clone 2209_scaffold64 genomic sequence) position: , mismatch: 5, identity: 0.833
gaagttcgccgggtctacgcacgcgctttc CRISPR spacer gtagttggccgggtctacgcacgcgctgct Protospacer * **** ******************** ..
5. spacer 3.4|1616818|30|NC_020561|CRISPRCasFinder,CRT,PILER-CR matches to NC_007764 (Rhizobium etli CFN 42 plasmid p42c, complete sequence) position: , mismatch: 5, identity: 0.833
agtgatgactgacatcgcaacg-atagcggc CRISPR spacer ggtgacgactgacatcgcaacgaagagtgg- Protospacer .****.**************** * **.**
6. spacer 3.4|1616818|30|NC_020561|CRISPRCasFinder,CRT,PILER-CR matches to NZ_CP013597 (Rhizobium sp. N741 plasmid pRspN741b, complete sequence) position: , mismatch: 5, identity: 0.833
agtgatgactgacatcgcaacg-atagcggc CRISPR spacer ggtgacgactgacatcgcaacgaagagtgg- Protospacer .****.**************** * **.**
7. spacer 3.4|1616818|30|NC_020561|CRISPRCasFinder,CRT,PILER-CR matches to NC_021907 (Rhizobium etli bv. mimosae str. Mim1 plasmid pRetMIM1b, complete sequence) position: , mismatch: 5, identity: 0.833
agtgatgactgacatcgcaacg-atagcggc CRISPR spacer ggtgacgactgacatcgcaacgaagagtgg- Protospacer .****.**************** * **.**
8. spacer 3.4|1616818|30|NC_020561|CRISPRCasFinder,CRT,PILER-CR matches to NZ_CP013501 (Rhizobium esperanzae strain N561 plasmid pRspN561a, complete sequence) position: , mismatch: 5, identity: 0.833
agtgatgactgacatcgcaacg-atagcggc CRISPR spacer ggtgacgactgacatcgcaacgaagagtgg- Protospacer .****.**************** * **.**
9. spacer 3.4|1616818|30|NC_020561|CRISPRCasFinder,CRT,PILER-CR matches to NZ_CP013507 (Rhizobium sp. N1341 plasmid pRspN1341b, complete sequence) position: , mismatch: 5, identity: 0.833
agtgatgactgacatcgcaacg-atagcggc CRISPR spacer ggtgacgactgacatcgcaacgaagagtgg- Protospacer .****.**************** * **.**
10. spacer 3.4|1616818|30|NC_020561|CRISPRCasFinder,CRT,PILER-CR matches to NZ_CP013518 (Rhizobium sp. N113 plasmid pRspN113a, complete sequence) position: , mismatch: 5, identity: 0.833
agtgatgactgacatcgcaacg-atagcggc CRISPR spacer ggtgacgactgacatcgcaacgaagagtgg- Protospacer .****.**************** * **.**
11. spacer 3.4|1616818|30|NC_020561|CRISPRCasFinder,CRT,PILER-CR matches to NZ_CP013491 (Rhizobium sp. N6212 plasmid pRspN6212a, complete sequence) position: , mismatch: 5, identity: 0.833
agtgatgactgacatcgcaacg-atagcggc CRISPR spacer ggtgacgactgacatcgcaacgaagagtgg- Protospacer .****.**************** * **.**
12. spacer 3.4|1616818|30|NC_020561|CRISPRCasFinder,CRT,PILER-CR matches to NZ_CP013496 (Rhizobium sp. N621 plasmid pRspN621a, complete sequence) position: , mismatch: 5, identity: 0.833
agtgatgactgacatcgcaacg-atagcggc CRISPR spacer ggtgacgactgacatcgcaacgaagagtgg- Protospacer .****.**************** * **.**
13. spacer 3.4|1616818|30|NC_020561|CRISPRCasFinder,CRT,PILER-CR matches to NZ_CP013591 (Rhizobium sp. N871 plasmid pRspN871a, complete sequence) position: , mismatch: 5, identity: 0.833
agtgatgactgacatcgcaacg-atagcggc CRISPR spacer ggtgacgactgacatcgcaacgaagagtgg- Protospacer .****.**************** * **.**
14. spacer 3.3|1616752|30|NC_020561|CRISPRCasFinder,CRT,PILER-CR matches to NC_010463 (Enterobacteria phage Fels-2, complete genome) position: , mismatch: 6, identity: 0.8
cctatgtccgtaacaacccggacgtggccg- CRISPR spacer cctatgtccgggacaacccggaca-agctgc Protospacer ********** .***********. .**.*
15. spacer 3.3|1616752|30|NC_020561|CRISPRCasFinder,CRT,PILER-CR matches to KT630647 (Salmonella phage SEN8, complete genome) position: , mismatch: 6, identity: 0.8
cctatgtccgtaacaacccggacgtggccg- CRISPR spacer cctatgtccgggacaacccggaca-agctgc Protospacer ********** .***********. .**.*
16. spacer 3.3|1616752|30|NC_020561|CRISPRCasFinder,CRT,PILER-CR matches to NC_019488 (Salmonella phage RE-2010, complete genome) position: , mismatch: 6, identity: 0.8
cctatgtccgtaacaacccggacgtggccg- CRISPR spacer cctatgtccgcaataacccggaca-agctgc Protospacer **********.**.*********. .**.*
17. spacer 3.11|1617280|30|NC_020561|CRISPRCasFinder,CRT,PILER-CR matches to NC_018022 (Mycolicibacterium chubuense NBB4 plasmid pMYCCH.01, complete sequence) position: , mismatch: 6, identity: 0.8
cgcgg-cgagacccacgtcaacaacctgctg CRISPR spacer -gtgatccagaccgaggtcaacaacctgctg Protospacer *.*. * ***** * ***************
18. spacer 3.11|1617280|30|NC_020561|CRISPRCasFinder,CRT,PILER-CR matches to MF063068 (Pseudomonas phage Noxifer, complete genome) position: , mismatch: 6, identity: 0.8
cgcgg-cgagacccacgtcaacaacctgctg CRISPR spacer -gtgaccaagacctacgtcaacaacctgatg Protospacer *.*. *.*****.************** **
19. spacer 3.12|1617346|30|NC_020561|CRISPRCasFinder,CRT,PILER-CR matches to NZ_CP017563 (Paraburkholderia sprentiae WSM5005 plasmid pl1WSM5005, complete sequence) position: , mismatch: 6, identity: 0.8
gcccatcccgagctcgcgcttgtagcgcat CRISPR spacer ggcggtttcgagctcgcgcttgtagcccat Protospacer * * .*..****************** ***
20. spacer 3.13|1617412|30|NC_020561|CRISPRCasFinder,CRT,PILER-CR matches to NZ_CP017076 (Novosphingobium resinovorum strain SA1 plasmid pSA1, complete sequence) position: , mismatch: 6, identity: 0.8
gattcttgccgcgatggcggcggcccaggc CRISPR spacer ggtcgatgccgcgatggcggcggtccagcc Protospacer *.*. *****************.**** *
21. spacer 3.13|1617412|30|NC_020561|CRISPRCasFinder,CRT,PILER-CR matches to NZ_CP046333 (Cupriavidus metallidurans strain FDAARGOS_675 plasmid unnamed3) position: , mismatch: 6, identity: 0.8
gattcttg---ccgcgatggcggcggcccaggc CRISPR spacer ---tcccgagtccgcgatggcggcggaccaggc Protospacer **..* *************** ******
22. spacer 3.13|1617412|30|NC_020561|CRISPRCasFinder,CRT,PILER-CR matches to NC_007974 (Cupriavidus metallidurans CH34 megaplasmid, complete sequence) position: , mismatch: 6, identity: 0.8
gattcttg---ccgcgatggcggcggcccaggc CRISPR spacer ---tcccgagtccgcgatggcggcggaccaggc Protospacer **..* *************** ******
23. spacer 3.13|1617412|30|NC_020561|CRISPRCasFinder,CRT,PILER-CR matches to NC_004808 (Streptomyces rochei plasmid pSLA2-L DNA, complete sequence) position: , mismatch: 6, identity: 0.8
gattcttgccgcgatggcggcggcccaggc CRISPR spacer gacgcacgccgcgatcgcggcggccgaggc Protospacer **. * .******** ********* ****
24. spacer 3.13|1617412|30|NC_020561|CRISPRCasFinder,CRT,PILER-CR matches to NZ_CP038146 (Streptomyces sp. S501 plasmid unnamed, complete sequence) position: , mismatch: 6, identity: 0.8
gattcttgccgcgatggcggcggcccaggc CRISPR spacer gatctccgccgcgatggcggcggctgaggc Protospacer ***....*****************. ****
25. spacer 3.1|1616620|30|NC_020561|CRISPRCasFinder,CRT matches to NC_019388 (Thermus oshimai JL-2 plasmid pTHEOS02, complete sequence) position: , mismatch: 7, identity: 0.767
cgggcaagacggttgggcgacgcgcgtttg CRISPR spacer ggggccagacggttgggcgacgcggaaagg Protospacer **** ****************** . *
26. spacer 3.1|1616620|30|NC_020561|CRISPRCasFinder,CRT matches to NZ_CP010824 (Thermus aquaticus Y51MC23 plasmid pTA16, complete sequence) position: , mismatch: 7, identity: 0.767
cgggcaagacggttgggcgacgcgcgtttg CRISPR spacer ggggccagacggttgggcgacgcggaaagg Protospacer **** ****************** . *
27. spacer 3.1|1616620|30|NC_020561|CRISPRCasFinder,CRT matches to NC_016586 (Azospirillum lipoferum 4B plasmid AZO_p2, complete sequence) position: , mismatch: 7, identity: 0.767
cgggcaagacggttgggcgacgcgcgtttg CRISPR spacer cgggccagacggctgggcgacgcggtcgag Protospacer ***** ******.*********** . *
28. spacer 3.2|1616686|30|NC_020561|CRISPRCasFinder,CRT matches to NZ_CP017563 (Paraburkholderia sprentiae WSM5005 plasmid pl1WSM5005, complete sequence) position: , mismatch: 7, identity: 0.767
gaagttcgccgggtctacgcacgcgctttc CRISPR spacer gttaatcgccgggtcaacgcacgcgctaac Protospacer * . ********** *********** *
29. spacer 3.3|1616752|30|NC_020561|CRISPRCasFinder,CRT,PILER-CR matches to NC_049453 (Klebsiella phage ST13-OXA48phi12.1, complete genome) position: , mismatch: 7, identity: 0.767
cctatgtccgtaacaacccggacgtggccg- CRISPR spacer cgtatgtccgtgacaacccggaca-aactgc Protospacer * *********.***********. ..*.*
30. spacer 3.11|1617280|30|NC_020561|CRISPRCasFinder,CRT,PILER-CR matches to JX163858 (Caulobacter phage phiCbK, complete genome) position: , mismatch: 7, identity: 0.767
cgcggcgagacccacgtcaacaacctgctg CRISPR spacer accctgaagacccaggtcaacaacctgctg Protospacer * .******* ***************
31. spacer 3.11|1617280|30|NC_020561|CRISPRCasFinder,CRT,PILER-CR matches to KY555147 (Caulobacter phage Ccr34, complete genome) position: , mismatch: 7, identity: 0.767
cgcggcgagacccacgtcaacaacctgctg CRISPR spacer accctgaagacccaggtcaacaacctgctg Protospacer * .******* ***************
32. spacer 3.11|1617280|30|NC_020561|CRISPRCasFinder,CRT,PILER-CR matches to KY555145 (Caulobacter phage Ccr29, complete genome) position: , mismatch: 7, identity: 0.767
cgcggcgagacccacgtcaacaacctgctg CRISPR spacer accctgaagacccaggtcaacaacctgctg Protospacer * .******* ***************
33. spacer 3.11|1617280|30|NC_020561|CRISPRCasFinder,CRT,PILER-CR matches to KY555143 (Caulobacter phage Ccr2, complete genome) position: , mismatch: 7, identity: 0.767
cgcggcgagacccacgtcaacaacctgctg CRISPR spacer accctgaagacccaggtcaacaacctgctg Protospacer * .******* ***************
34. spacer 3.11|1617280|30|NC_020561|CRISPRCasFinder,CRT,PILER-CR matches to KY555146 (Caulobacter phage Ccr32, complete genome) position: , mismatch: 7, identity: 0.767
cgcggcgagacccacgtcaacaacctgctg CRISPR spacer accctgaagacccaggtcaacaacctgctg Protospacer * .******* ***************
35. spacer 3.11|1617280|30|NC_020561|CRISPRCasFinder,CRT,PILER-CR matches to KY555142 (Caulobacter phage Ccr10, complete genome) position: , mismatch: 7, identity: 0.767
cgcggcgagacccacgtcaacaacctgctg CRISPR spacer accctgaagacccaggtcaacaacctgctg Protospacer * .******* ***************
36. spacer 3.11|1617280|30|NC_020561|CRISPRCasFinder,CRT,PILER-CR matches to NZ_LS974446 (Rhizobium selenitireducens ATCC BAA-1503 isolate T2.30D-1.1_plasmid plasmid 1, complete sequence) position: , mismatch: 7, identity: 0.767
cgcggcgagacccacgtcaacaacctgctg CRISPR spacer ccgaccaagacccacatcatcaacctgctg Protospacer * . *.********.*** **********
37. spacer 3.12|1617346|30|NC_020561|CRISPRCasFinder,CRT,PILER-CR matches to NC_017958 (Tistrella mobilis KA081020-065 plasmid pTM3, complete sequence) position: , mismatch: 7, identity: 0.767
gcccatcccgagctcgcgcttgtagcgcat CRISPR spacer gatgatctcgacctcgcgcttgtagcgctc Protospacer * . ***.*** **************** .
38. spacer 3.12|1617346|30|NC_020561|CRISPRCasFinder,CRT,PILER-CR matches to NZ_CP031752 (Rhodobacter sphaeroides strain EBL0706 plasmid p.A, complete sequence) position: , mismatch: 7, identity: 0.767
gcccatcccgagctcgcgcttgtagcgcat CRISPR spacer gctgatcccgagctcgcgctggaagcggtc Protospacer **. **************** * **** .
39. spacer 3.12|1617346|30|NC_020561|CRISPRCasFinder,CRT,PILER-CR matches to NZ_AP022334 (Methylosinus sp. C49 isolate Methylosinus sp. C49 plasmid pMSC49b, complete sequence) position: , mismatch: 7, identity: 0.767
gcccatcccgagctcgcgcttgtagcgcat CRISPR spacer gcccatcccgggctcgcgcttttgcagcgc Protospacer **********.********** *. **..
40. spacer 3.13|1617412|30|NC_020561|CRISPRCasFinder,CRT,PILER-CR matches to NZ_CP016613 (Ralstonia solanacearum FJAT-91 plasmid unnamed1, complete sequence) position: , mismatch: 7, identity: 0.767
gattcttgccgcgatggcggcggcccaggc CRISPR spacer tgcccatgccgcgatggcggcgcaccaggc Protospacer ...* **************** ******
41. spacer 3.13|1617412|30|NC_020561|CRISPRCasFinder,CRT,PILER-CR matches to NZ_CP021449 (Ralstonia solanacearum strain SEPPX05 plasmid pSEPPX05, complete sequence) position: , mismatch: 7, identity: 0.767
gattcttgccgcgatggcggcggcccaggc CRISPR spacer tgcccatgccgcgatggcggcgtaccaggc Protospacer ...* **************** ******
42. spacer 3.13|1617412|30|NC_020561|CRISPRCasFinder,CRT,PILER-CR matches to NZ_CP049794 (Ralstonia solanacearum strain 204 plasmid unnamed, complete sequence) position: , mismatch: 7, identity: 0.767
gattcttgccgcgatggcggcggcccaggc CRISPR spacer tgcccatgccgcgatggcggcgcaccaggc Protospacer ...* **************** ******
43. spacer 3.13|1617412|30|NC_020561|CRISPRCasFinder,CRT,PILER-CR matches to NZ_CP049788 (Ralstonia solanacearum strain B2 plasmid unnamed, complete sequence) position: , mismatch: 7, identity: 0.767
gattcttgccgcgatggcggcggcccaggc CRISPR spacer tgcccatgccgcgatggcggcgcaccaggc Protospacer ...* **************** ******
44. spacer 3.13|1617412|30|NC_020561|CRISPRCasFinder,CRT,PILER-CR matches to NZ_CP039340 (Ralstonia solanacearum strain UW386 plasmid pUW386, complete sequence) position: , mismatch: 7, identity: 0.767
gattcttgccgcgatggcggcggcccaggc CRISPR spacer tgcccatgccgcgatggcggcgcaccaggc Protospacer ...* **************** ******
45. spacer 3.13|1617412|30|NC_020561|CRISPRCasFinder,CRT,PILER-CR matches to NC_016113 (Streptomyces cattleya NRRL 8057 = DSM 46488 plasmid pSCAT, complete sequence) position: , mismatch: 7, identity: 0.767
gattcttgccgcgatggcggcggcccaggc CRISPR spacer ccggcgtgccgcggtggcggcggccccggc Protospacer * *******.************ ***
46. spacer 3.13|1617412|30|NC_020561|CRISPRCasFinder,CRT,PILER-CR matches to NC_016113 (Streptomyces cattleya NRRL 8057 = DSM 46488 plasmid pSCAT, complete sequence) position: , mismatch: 7, identity: 0.767
gattcttgccgcgatggcggcggcccaggc CRISPR spacer gccgccggccgcgatggtgggggcccaggc Protospacer * . *. **********.** *********
47. spacer 3.13|1617412|30|NC_020561|CRISPRCasFinder,CRT,PILER-CR matches to NZ_CP012940 (Ralstonia solanacearum strain UW163 plasmid unnamed, complete sequence) position: , mismatch: 7, identity: 0.767
gattcttgccgcgatggcggcggcccaggc CRISPR spacer tgcccatgccgcgatggcggcgcaccaggc Protospacer ...* **************** ******
48. spacer 3.13|1617412|30|NC_020561|CRISPRCasFinder,CRT,PILER-CR matches to NZ_CP012944 (Ralstonia solanacearum strain IBSBF1503 plasmid unnamed, complete sequence) position: , mismatch: 7, identity: 0.767
gattcttgccgcgatggcggcggcccaggc CRISPR spacer tgcccatgccgcgatggcggcgcaccaggc Protospacer ...* **************** ******
49. spacer 3.13|1617412|30|NC_020561|CRISPRCasFinder,CRT,PILER-CR matches to NZ_CP049792 (Ralstonia solanacearum strain 203 plasmid unnamed, complete sequence) position: , mismatch: 7, identity: 0.767
gattcttgccgcgatggcggcggcccaggc CRISPR spacer tgcccatgccgcgatggcggcgcaccaggc Protospacer ...* **************** ******
50. spacer 3.13|1617412|30|NC_020561|CRISPRCasFinder,CRT,PILER-CR matches to NZ_CP010871 (Confluentimicrobium sp. EMB200-NS6 strain EMBL200_NS6 plasmid pNS6002, complete sequence) position: , mismatch: 7, identity: 0.767
gattcttgccgcgatggcggcggcccaggc CRISPR spacer ccttcttgccgcgatggaggctgcccgagt Protospacer *************** *** ****..*.
51. spacer 3.13|1617412|30|NC_020561|CRISPRCasFinder,CRT,PILER-CR matches to NZ_CP015851 (Ralstonia solanacearum strain YC40-M plasmid, complete sequence) position: , mismatch: 7, identity: 0.767
gattcttgccgcgatggcggcggcccaggc CRISPR spacer tgcccatgccgcgatggcggcgcaccaggc Protospacer ...* **************** ******
52. spacer 3.13|1617412|30|NC_020561|CRISPRCasFinder,CRT,PILER-CR matches to NZ_AP014687 (Bradyrhizobium diazoefficiens strain NK6 plasmid pNK6c, complete sequence) position: , mismatch: 7, identity: 0.767
gattcttgccgcgatggcggcggcccaggc CRISPR spacer gagcggcgccgcgatggcggcggccgaggg Protospacer ** . .****************** ***
53. spacer 3.13|1617412|30|NC_020561|CRISPRCasFinder,CRT,PILER-CR matches to NZ_CP022791 (Ralstonia solanacearum strain SL3103 plasmid unnamed, complete sequence) position: , mismatch: 7, identity: 0.767
gattcttgccgcgatggcggcggcccaggc CRISPR spacer tgcccatgccgcgatggcggcgcaccaggc Protospacer ...* **************** ******
54. spacer 3.13|1617412|30|NC_020561|CRISPRCasFinder,CRT,PILER-CR matches to NZ_CP022482 (Ralstonia solanacearum strain HA4-1 plasmid HA4-1MP, complete sequence) position: , mismatch: 7, identity: 0.767
gattcttgccgcgatggcggcggcccaggc CRISPR spacer tgcccatgccgcgatggcggcgcaccaggc Protospacer ...* **************** ******
55. spacer 3.13|1617412|30|NC_020561|CRISPRCasFinder,CRT,PILER-CR matches to CP047139 (Ralstonia solanacearum strain CFBP 8695 plasmid unnamed, complete sequence) position: , mismatch: 7, identity: 0.767
gattcttgccgcgatggcggcggcccaggc CRISPR spacer tgcccatgccgcgatggcggcgcaccaggc Protospacer ...* **************** ******
56. spacer 3.13|1617412|30|NC_020561|CRISPRCasFinder,CRT,PILER-CR matches to NZ_CP051295 (Ralstonia solanacearum strain CIAT_078 plasmid megaplasmid, complete sequence) position: , mismatch: 7, identity: 0.767
gattcttgccgcgatggcggcggcccaggc CRISPR spacer tgcccatgccgcgatggcggcgcaccaggc Protospacer ...* **************** ******
57. spacer 3.13|1617412|30|NC_020561|CRISPRCasFinder,CRT,PILER-CR matches to CP047137 (Ralstonia solanacearum strain CFBP 8697 plasmid unnamed, complete sequence) position: , mismatch: 7, identity: 0.767
gattcttgccgcgatggcggcggcccaggc CRISPR spacer tgcccatgccgcgatggcggcgcaccaggc Protospacer ...* **************** ******
58. spacer 3.13|1617412|30|NC_020561|CRISPRCasFinder,CRT,PILER-CR matches to NZ_CP026091 (Ralstonia solanacearum strain IBSBF 2570 plasmid unnamed, complete sequence) position: , mismatch: 7, identity: 0.767
gattcttgccgcgatggcggcggcccaggc CRISPR spacer tgcccatgccgcgatggcggcgcaccaggc Protospacer ...* **************** ******
59. spacer 3.13|1617412|30|NC_020561|CRISPRCasFinder,CRT,PILER-CR matches to NC_014309 (Ralstonia solanacearum CFBP2957 plasmid RCFBPv3_mp, complete genome) position: , mismatch: 7, identity: 0.767
gattcttgccgcgatggcggcggcccaggc CRISPR spacer tgcccatgccgcgatggcggcgcaccaggc Protospacer ...* **************** ******
60. spacer 3.13|1617412|30|NC_020561|CRISPRCasFinder,CRT,PILER-CR matches to CP023013 (Ralstonia solanacearum strain T110 plasmid unnamed, complete sequence) position: , mismatch: 7, identity: 0.767
gattcttgccgcgatggcggcggcccaggc CRISPR spacer tgcccatgccgcgatggcggcgcaccaggc Protospacer ...* **************** ******
61. spacer 3.13|1617412|30|NC_020561|CRISPRCasFinder,CRT,PILER-CR matches to NZ_CP021653 (Ralstonia solanacearum strain RS 488 plasmid unnamed, complete sequence) position: , mismatch: 7, identity: 0.767
gattcttgccgcgatggcggcggcccaggc CRISPR spacer tgcccatgccgcgatggcggcgcaccaggc Protospacer ...* **************** ******
62. spacer 3.13|1617412|30|NC_020561|CRISPRCasFinder,CRT,PILER-CR matches to NC_014310 (Ralstonia solanacearum PSI07 plasmid mpPSI07, complete sequence) position: , mismatch: 7, identity: 0.767
gattcttgccgcgatggcggcggcccaggc CRISPR spacer tgcccatgccgcgatggcggcgcaccaggc Protospacer ...* **************** ******
63. spacer 3.13|1617412|30|NC_020561|CRISPRCasFinder,CRT,PILER-CR matches to NZ_CP022762 (Ralstonia solanacearum strain T95 plasmid unnamed, complete sequence) position: , mismatch: 7, identity: 0.767
gattcttgccgcgatggcggcggcccaggc CRISPR spacer tgcccatgccgcgatggcggcgcaccaggc Protospacer ...* **************** ******
64. spacer 3.13|1617412|30|NC_020561|CRISPRCasFinder,CRT,PILER-CR matches to NZ_CP049790 (Ralstonia solanacearum strain 202 plasmid unnamed, complete sequence) position: , mismatch: 7, identity: 0.767
gattcttgccgcgatggcggcggcccaggc CRISPR spacer tgcccatgccgcgatggcggcgcaccaggc Protospacer ...* **************** ******
65. spacer 3.13|1617412|30|NC_020561|CRISPRCasFinder,CRT,PILER-CR matches to NZ_CP020716 (Cnuibacter physcomitrellae strain XA(T) plasmid unnamed1, complete sequence) position: , mismatch: 7, identity: 0.767
gattcttgccgcgatggcggcggcccaggc CRISPR spacer gccccttgccgcggtggcggctgcccagta Protospacer * ..*********.******* ******
66. spacer 3.13|1617412|30|NC_020561|CRISPRCasFinder,CRT,PILER-CR matches to NZ_CP022766 (Ralstonia solanacearum strain T78 plasmid unnamed, complete sequence) position: , mismatch: 7, identity: 0.767
gattcttgccgcgatggcggcggcccaggc CRISPR spacer tgcccatgccgcgatggcggcgcaccaggc Protospacer ...* **************** ******
67. spacer 3.13|1617412|30|NC_020561|CRISPRCasFinder,CRT,PILER-CR matches to NZ_CP021763 (Ralstonia pseudosolanacearum strain RS 476 plasmid unnamed, complete sequence) position: , mismatch: 7, identity: 0.767
gattcttgccgcgatggcggcggcccaggc CRISPR spacer tgcccatgccgcgatggcggcgcaccaggc Protospacer ...* **************** ******
68. spacer 3.13|1617412|30|NC_020561|CRISPRCasFinder,CRT,PILER-CR matches to NZ_CP026093 (Ralstonia solanacearum strain SFC plasmid unnamed, complete sequence) position: , mismatch: 7, identity: 0.767
gattcttgccgcgatggcggcggcccaggc CRISPR spacer tgcccatgccgcgatggcggcgcaccaggc Protospacer ...* **************** ******
69. spacer 3.13|1617412|30|NC_020561|CRISPRCasFinder,CRT,PILER-CR matches to NZ_CP021767 (Ralstonia solanacearum strain RS 489 plasmid unnamed, complete sequence) position: , mismatch: 7, identity: 0.767
gattcttgccgcgatggcggcggcccaggc CRISPR spacer tgcccatgccgcgatggcggcgcaccaggc Protospacer ...* **************** ******
70. spacer 3.13|1617412|30|NC_020561|CRISPRCasFinder,CRT,PILER-CR matches to NZ_CP015116 (Ralstonia solanacearum strain EP1 plasmid unnamed, complete sequence) position: , mismatch: 7, identity: 0.767
gattcttgccgcgatggcggcggcccaggc CRISPR spacer tgcccatgccgcgatggcggcgcaccaggc Protospacer ...* **************** ******
71. spacer 3.13|1617412|30|NC_020561|CRISPRCasFinder,CRT,PILER-CR matches to NZ_CP016555 (Ralstonia solanacearum FJAT-1458 plasmid plas1, complete sequence) position: , mismatch: 7, identity: 0.767
gattcttgccgcgatggcggcggcccaggc CRISPR spacer tgcccatgccgcgatggcggcgcaccaggc Protospacer ...* **************** ******
72. spacer 3.13|1617412|30|NC_020561|CRISPRCasFinder,CRT,PILER-CR matches to NZ_CP012688 (Ralstonia solanacearum strain UY031 plasmid unnamed, complete sequence) position: , mismatch: 7, identity: 0.767
gattcttgccgcgatggcggcggcccaggc CRISPR spacer tgcccatgccgcgatggcggcgcaccaggc Protospacer ...* **************** ******
73. spacer 3.13|1617412|30|NC_020561|CRISPRCasFinder,CRT,PILER-CR matches to NZ_CP052069 (Ralstonia solanacearum strain FJAT91.F50 plasmid Plas1, complete sequence) position: , mismatch: 7, identity: 0.767
gattcttgccgcgatggcggcggcccaggc CRISPR spacer tgcccatgccgcgatggcggcgcaccaggc Protospacer ...* **************** ******
74. spacer 3.13|1617412|30|NC_020561|CRISPRCasFinder,CRT,PILER-CR matches to NZ_CP016915 (Ralstonia solanacearum strain CQPS-1 plasmid unnamed, complete sequence) position: , mismatch: 7, identity: 0.767
gattcttgccgcgatggcggcggcccaggc CRISPR spacer tgcccatgccgcgatggcggcgcaccaggc Protospacer ...* **************** ******
75. spacer 3.13|1617412|30|NC_020561|CRISPRCasFinder,CRT,PILER-CR matches to NZ_CP016905 (Ralstonia solanacearum strain KACC 10709 plasmid unnamed1) position: , mismatch: 7, identity: 0.767
gattcttgccgcgatggcggcggcccaggc CRISPR spacer tgcccatgccgcgatggcggcgcaccaggc Protospacer ...* **************** ******
76. spacer 3.13|1617412|30|NC_020561|CRISPRCasFinder,CRT,PILER-CR matches to NZ_CP025986 (Ralstonia solanacearum strain RSCM plasmid p-unname2, complete sequence) position: , mismatch: 7, identity: 0.767
gattcttgccgcgatggcggcggcccaggc CRISPR spacer tgcccatgccgcgatggcggcgcaccaggc Protospacer ...* **************** ******
77. spacer 3.13|1617412|30|NC_020561|CRISPRCasFinder,CRT,PILER-CR matches to NZ_CP022769 (Ralstonia solanacearum strain T60 plasmid unnamed, complete sequence) position: , mismatch: 7, identity: 0.767
gattcttgccgcgatggcggcggcccaggc CRISPR spacer tgcccatgccgcgatggcggcgcaccaggc Protospacer ...* **************** ******
78. spacer 3.13|1617412|30|NC_020561|CRISPRCasFinder,CRT,PILER-CR matches to NZ_CP023017 (Ralstonia solanacearum strain SL3022 plasmid unnamed, complete sequence) position: , mismatch: 7, identity: 0.767
gattcttgccgcgatggcggcggcccaggc CRISPR spacer tgcccatgccgcgatggcggcgcaccaggc Protospacer ...* **************** ******
79. spacer 3.13|1617412|30|NC_020561|CRISPRCasFinder,CRT,PILER-CR matches to NC_017585 (Streptomyces cattleya NRRL 8057 = DSM 46488 plasmid pSCATT, complete sequence) position: , mismatch: 7, identity: 0.767
gattcttgccgcgatggcggcggcccaggc CRISPR spacer ccggcgtgccgcggtggcggcggccccggc Protospacer * *******.************ ***
80. spacer 3.13|1617412|30|NC_020561|CRISPRCasFinder,CRT,PILER-CR matches to NC_017585 (Streptomyces cattleya NRRL 8057 = DSM 46488 plasmid pSCATT, complete sequence) position: , mismatch: 7, identity: 0.767
gattcttgccgcgatggcggcggcccaggc CRISPR spacer gccgccggccgcgatggtgggggcccaggc Protospacer * . *. **********.** *********
81. spacer 3.13|1617412|30|NC_020561|CRISPRCasFinder,CRT,PILER-CR matches to NZ_CP022773 (Ralstonia solanacearum strain T42 plasmid unnamed, complete sequence) position: , mismatch: 7, identity: 0.767
gattcttgccgcgatggcggcggcccaggc CRISPR spacer tgcccatgccgcgatggcggcgcaccaggc Protospacer ...* **************** ******
82. spacer 3.13|1617412|30|NC_020561|CRISPRCasFinder,CRT,PILER-CR matches to NZ_CP022783 (Ralstonia solanacearum strain SL3755 plasmid unnamed, complete sequence) position: , mismatch: 7, identity: 0.767
gattcttgccgcgatggcggcggcccaggc CRISPR spacer tgcccatgccgcgatggcggcgcaccaggc Protospacer ...* **************** ******
83. spacer 3.13|1617412|30|NC_020561|CRISPRCasFinder,CRT,PILER-CR matches to NZ_CP014703 (Ralstonia solanacearum strain KACC 10722 plasmid, complete sequence) position: , mismatch: 7, identity: 0.767
gattcttgccgcgatggcggcggcccaggc CRISPR spacer tgcccatgccgcgatggcggcgcaccaggc Protospacer ...* **************** ******
84. spacer 3.13|1617412|30|NC_020561|CRISPRCasFinder,CRT,PILER-CR matches to NZ_CP022760 (Ralstonia solanacearum strain T98 plasmid unnamed, complete sequence) position: , mismatch: 7, identity: 0.767
gattcttgccgcgatggcggcggcccaggc CRISPR spacer tgcccatgccgcgatggcggcgcaccaggc Protospacer ...* **************** ******
85. spacer 3.13|1617412|30|NC_020561|CRISPRCasFinder,CRT,PILER-CR matches to NZ_CP022789 (Ralstonia solanacearum strain SL3175 plasmid unnamed, complete sequence) position: , mismatch: 7, identity: 0.767
gattcttgccgcgatggcggcggcccaggc CRISPR spacer tgcccatgccgcgatggcggcgcaccaggc Protospacer ...* **************** ******
86. spacer 3.13|1617412|30|NC_020561|CRISPRCasFinder,CRT,PILER-CR matches to NZ_CP022795 (Ralstonia solanacearum strain SL2330 plasmid unnamed, complete sequence) position: , mismatch: 7, identity: 0.767
gattcttgccgcgatggcggcggcccaggc CRISPR spacer tgcccatgccgcgatggcggcgcaccaggc Protospacer ...* **************** ******
87. spacer 3.13|1617412|30|NC_020561|CRISPRCasFinder,CRT,PILER-CR matches to NZ_CP052071 (Ralstonia solanacearum strain FJAT454.F1 plasmid Plas1, complete sequence) position: , mismatch: 7, identity: 0.767
gattcttgccgcgatggcggcggcccaggc CRISPR spacer tgcccatgccgcgatggcggcgcaccaggc Protospacer ...* **************** ******
88. spacer 3.13|1617412|30|NC_020561|CRISPRCasFinder,CRT,PILER-CR matches to NC_017575 (Ralstonia solanacearum Po82 megaplasmid, complete sequence) position: , mismatch: 7, identity: 0.767
gattcttgccgcgatggcggcggcccaggc CRISPR spacer tgcccatgccgcgatggcggcgcaccaggc Protospacer ...* **************** ******
89. spacer 3.13|1617412|30|NC_020561|CRISPRCasFinder,CRT,PILER-CR matches to NZ_CP022771 (Ralstonia solanacearum strain T51 plasmid unnamed, complete sequence) position: , mismatch: 7, identity: 0.767
gattcttgccgcgatggcggcggcccaggc CRISPR spacer tgcccatgccgcgatggcggcgcaccaggc Protospacer ...* **************** ******
90. spacer 3.13|1617412|30|NC_020561|CRISPRCasFinder,CRT,PILER-CR matches to NZ_CP022777 (Ralstonia solanacearum strain T11 plasmid unnamed, complete sequence) position: , mismatch: 7, identity: 0.767
gattcttgccgcgatggcggcggcccaggc CRISPR spacer tgcccatgccgcgatggcggcgcaccaggc Protospacer ...* **************** ******
91. spacer 3.13|1617412|30|NC_020561|CRISPRCasFinder,CRT,PILER-CR matches to NZ_CP022799 (Ralstonia solanacearum strain SL2064 plasmid unnamed, complete sequence) position: , mismatch: 7, identity: 0.767
gattcttgccgcgatggcggcggcccaggc CRISPR spacer tgcccatgccgcgatggcggcgcaccaggc Protospacer ...* **************** ******
92. spacer 3.13|1617412|30|NC_020561|CRISPRCasFinder,CRT,PILER-CR matches to NZ_CP009763 (Ralstonia solanacearum OE1-1 plasmid unnamed, complete sequence) position: , mismatch: 7, identity: 0.767
gattcttgccgcgatggcggcggcccaggc CRISPR spacer tgcccatgccgcgatggcggcgcaccaggc Protospacer ...* **************** ******
93. spacer 3.13|1617412|30|NC_020561|CRISPRCasFinder,CRT,PILER-CR matches to CP023015 (Ralstonia solanacearum strain T25 plasmid unnamed, complete sequence) position: , mismatch: 7, identity: 0.767
gattcttgccgcgatggcggcggcccaggc CRISPR spacer tgcccatgccgcgatggcggcgtaccaggc Protospacer ...* **************** ******
94. spacer 3.13|1617412|30|NC_020561|CRISPRCasFinder,CRT,PILER-CR matches to NZ_CP022779 (Ralstonia solanacearum strain SL3882 plasmid unnamed, complete sequence) position: , mismatch: 7, identity: 0.767
gattcttgccgcgatggcggcggcccaggc CRISPR spacer tgcccatgccgcgatggcggcgcaccaggc Protospacer ...* **************** ******
95. spacer 3.13|1617412|30|NC_020561|CRISPRCasFinder,CRT,PILER-CR matches to NZ_CP052075 (Ralstonia solanacearum strain FJAT448.F1 plasmid Plas1, complete sequence) position: , mismatch: 7, identity: 0.767
gattcttgccgcgatggcggcggcccaggc CRISPR spacer tgcccatgccgcgatggcggcgcaccaggc Protospacer ...* **************** ******
96. spacer 3.13|1617412|30|NC_020561|CRISPRCasFinder,CRT,PILER-CR matches to NZ_CP052085 (Ralstonia solanacearum strain FJAT15353.F8 plasmid Plas1, complete sequence) position: , mismatch: 7, identity: 0.767
gattcttgccgcgatggcggcggcccaggc CRISPR spacer tgcccatgccgcgatggcggcgcaccaggc Protospacer ...* **************** ******
97. spacer 3.13|1617412|30|NC_020561|CRISPRCasFinder,CRT,PILER-CR matches to NZ_CP052095 (Ralstonia solanacearum strain FJAT15340.F1 plasmid Plas1, complete sequence) position: , mismatch: 7, identity: 0.767
gattcttgccgcgatggcggcggcccaggc CRISPR spacer tgcccatgccgcgatggcggcgcaccaggc Protospacer ...* **************** ******
98. spacer 3.13|1617412|30|NC_020561|CRISPRCasFinder,CRT,PILER-CR matches to NZ_CP052105 (Ralstonia solanacearum strain FJAT15252.F1 plasmid Plas1, complete sequence) position: , mismatch: 7, identity: 0.767
gattcttgccgcgatggcggcggcccaggc CRISPR spacer tgcccatgccgcgatggcggcgcaccaggc Protospacer ...* **************** ******
99. spacer 3.13|1617412|30|NC_020561|CRISPRCasFinder,CRT,PILER-CR matches to NZ_CP026308 (Ralstonia solanacearum strain IBSBF 2571 plasmid unnamed, complete sequence) position: , mismatch: 7, identity: 0.767
gattcttgccgcgatggcggcggcccaggc CRISPR spacer tgcccatgccgcgatggcggcgcaccaggc Protospacer ...* **************** ******
100. spacer 3.13|1617412|30|NC_020561|CRISPRCasFinder,CRT,PILER-CR matches to NZ_CP021765 (Ralstonia pseudosolanacearum strain CRMRs218 plasmid unnamed, complete sequence) position: , mismatch: 7, identity: 0.767
gattcttgccgcgatggcggcggcccaggc CRISPR spacer tgcccatgccgcgatggcggcgcaccaggc Protospacer ...* **************** ******
101. spacer 3.13|1617412|30|NC_020561|CRISPRCasFinder,CRT,PILER-CR matches to NZ_CP052077 (Ralstonia solanacearum strain FJAT445.F50 plasmid Plas1, complete sequence) position: , mismatch: 7, identity: 0.767
gattcttgccgcgatggcggcggcccaggc CRISPR spacer tgcccatgccgcgatggcggcgcaccaggc Protospacer ...* **************** ******
102. spacer 3.13|1617412|30|NC_020561|CRISPRCasFinder,CRT,PILER-CR matches to NZ_CP052087 (Ralstonia solanacearum strain FJAT15353.F50 plasmid Plas1, complete sequence) position: , mismatch: 7, identity: 0.767
gattcttgccgcgatggcggcggcccaggc CRISPR spacer tgcccatgccgcgatggcggcgcaccaggc Protospacer ...* **************** ******
103. spacer 3.13|1617412|30|NC_020561|CRISPRCasFinder,CRT,PILER-CR matches to NZ_CP052097 (Ralstonia solanacearum strain FJAT15304.F6 plasmid Plas1, complete sequence) position: , mismatch: 7, identity: 0.767
gattcttgccgcgatggcggcggcccaggc CRISPR spacer tgcccatgccgcgatggcggcgcaccaggc Protospacer ...* **************** ******
104. spacer 3.13|1617412|30|NC_020561|CRISPRCasFinder,CRT,PILER-CR matches to NZ_CP052115 (Ralstonia solanacearum strain FJAT1463.F50 plasmid Plas1, complete sequence) position: , mismatch: 7, identity: 0.767
gattcttgccgcgatggcggcggcccaggc CRISPR spacer tgcccatgccgcgatggcggcgcaccaggc Protospacer ...* **************** ******
105. spacer 3.13|1617412|30|NC_020561|CRISPRCasFinder,CRT,PILER-CR matches to NZ_CP052127 (Ralstonia solanacearum strain FJAT1303.F50 plasmid Plas1, complete sequence) position: , mismatch: 7, identity: 0.767
gattcttgccgcgatggcggcggcccaggc CRISPR spacer tgcccatgccgcgatggcggcgcaccaggc Protospacer ...* **************** ******
106. spacer 3.13|1617412|30|NC_020561|CRISPRCasFinder,CRT,PILER-CR matches to NZ_CP052079 (Ralstonia solanacearum strain FJAT445.F1 plasmid Plas1, complete sequence) position: , mismatch: 7, identity: 0.767
gattcttgccgcgatggcggcggcccaggc CRISPR spacer tgcccatgccgcgatggcggcgcaccaggc Protospacer ...* **************** ******
107. spacer 3.13|1617412|30|NC_020561|CRISPRCasFinder,CRT,PILER-CR matches to NZ_CP052089 (Ralstonia solanacearum strain FJAT15353.F1 plasmid Plas1, complete sequence) position: , mismatch: 7, identity: 0.767
gattcttgccgcgatggcggcggcccaggc CRISPR spacer tgcccatgccgcgatggcggcgcaccaggc Protospacer ...* **************** ******
108. spacer 3.13|1617412|30|NC_020561|CRISPRCasFinder,CRT,PILER-CR matches to NZ_CP052093 (Ralstonia solanacearum strain FJAT15340.F50 plasmid Plas1, complete sequence) position: , mismatch: 7, identity: 0.767
gattcttgccgcgatggcggcggcccaggc CRISPR spacer tgcccatgccgcgatggcggcgcaccaggc Protospacer ...* **************** ******
109. spacer 3.13|1617412|30|NC_020561|CRISPRCasFinder,CRT,PILER-CR matches to NZ_CP052101 (Ralstonia solanacearum strain FJAT15304.F1 plasmid Plas1, complete sequence) position: , mismatch: 7, identity: 0.767
gattcttgccgcgatggcggcggcccaggc CRISPR spacer tgcccatgccgcgatggcggcgcaccaggc Protospacer ...* **************** ******
110. spacer 3.13|1617412|30|NC_020561|CRISPRCasFinder,CRT,PILER-CR matches to NZ_CP052099 (Ralstonia solanacearum strain FJAT15304.F50 plasmid Plas1, complete sequence) position: , mismatch: 7, identity: 0.767
gattcttgccgcgatggcggcggcccaggc CRISPR spacer tgcccatgccgcgatggcggcgcaccaggc Protospacer ...* **************** ******
111. spacer 3.13|1617412|30|NC_020561|CRISPRCasFinder,CRT,PILER-CR matches to NZ_CP052107 (Ralstonia solanacearum strain FJAT15249.F50 plasmid Plas1, complete sequence) position: , mismatch: 7, identity: 0.767
gattcttgccgcgatggcggcggcccaggc CRISPR spacer tgcccatgccgcgatggcggcgcaccaggc Protospacer ...* **************** ******
112. spacer 3.13|1617412|30|NC_020561|CRISPRCasFinder,CRT,PILER-CR matches to NZ_CP022781 (Ralstonia solanacearum strain SL3822 plasmid unnamed, complete sequence) position: , mismatch: 7, identity: 0.767
gattcttgccgcgatggcggcggcccaggc CRISPR spacer tgcccatgccgcgatggcggcgcaccaggc Protospacer ...* **************** ******
113. spacer 3.13|1617412|30|NC_020561|CRISPRCasFinder,CRT,PILER-CR matches to NZ_CP052117 (Ralstonia solanacearum strain FJAT1463.F1 plasmid Plas1, complete sequence) position: , mismatch: 7, identity: 0.767
gattcttgccgcgatggcggcggcccaggc CRISPR spacer tgcccatgccgcgatggcggcgcaccaggc Protospacer ...* **************** ******
114. spacer 3.13|1617412|30|NC_020561|CRISPRCasFinder,CRT,PILER-CR matches to NZ_CP052125 (Ralstonia solanacearum strain FJAT1452.F1 plasmid Plas1, complete sequence) position: , mismatch: 7, identity: 0.767
gattcttgccgcgatggcggcggcccaggc CRISPR spacer tgcccatgccgcgatggcggcgcaccaggc Protospacer ...* **************** ******
115. spacer 3.13|1617412|30|NC_020561|CRISPRCasFinder,CRT,PILER-CR matches to NZ_CP022793 (Ralstonia solanacearum strain SL2729 plasmid unnamed, complete sequence) position: , mismatch: 7, identity: 0.767
gattcttgccgcgatggcggcggcccaggc CRISPR spacer tgcccatgccgcgatggcggcgcaccaggc Protospacer ...* **************** ******
116. spacer 3.13|1617412|30|NC_020561|CRISPRCasFinder,CRT,PILER-CR matches to NZ_CP022785 (Ralstonia solanacearum strain SL3730 plasmid unnamed, complete sequence) position: , mismatch: 7, identity: 0.767
gattcttgccgcgatggcggcggcccaggc CRISPR spacer tgcccatgccgcgatggcggcgcaccaggc Protospacer ...* **************** ******
117. spacer 3.13|1617412|30|NC_020561|CRISPRCasFinder,CRT,PILER-CR matches to NZ_CP022787 (Ralstonia solanacearum strain SL3300 plasmid unnamed, complete sequence) position: , mismatch: 7, identity: 0.767
gattcttgccgcgatggcggcggcccaggc CRISPR spacer tgcccatgccgcgatggcggcgcaccaggc Protospacer ...* **************** ******
118. spacer 3.13|1617412|30|NC_020561|CRISPRCasFinder,CRT,PILER-CR matches to NZ_CP022756 (Ralstonia solanacearum strain T117 plasmid unnamed, complete sequence) position: , mismatch: 7, identity: 0.767
gattcttgccgcgatggcggcggcccaggc CRISPR spacer tgcccatgccgcgatggcggcgcaccaggc Protospacer ...* **************** ******
119. spacer 3.13|1617412|30|NC_020561|CRISPRCasFinder,CRT,PILER-CR matches to NZ_CP052129 (Ralstonia solanacearum strain FJAT1303.F1 plasmid Plas1, complete sequence) position: , mismatch: 7, identity: 0.767
gattcttgccgcgatggcggcggcccaggc CRISPR spacer tgcccatgccgcgatggcggcgcaccaggc Protospacer ...* **************** ******
120. spacer 3.13|1617412|30|NC_020561|CRISPRCasFinder,CRT,PILER-CR matches to NZ_CP052121 (Ralstonia solanacearum strain FJAT1458.F1 plasmid Plas1, complete sequence) position: , mismatch: 7, identity: 0.767
gattcttgccgcgatggcggcggcccaggc CRISPR spacer tgcccatgccgcgatggcggcgcaccaggc Protospacer ...* **************** ******
121. spacer 3.13|1617412|30|NC_020561|CRISPRCasFinder,CRT,PILER-CR matches to NZ_CP052123 (Ralstonia solanacearum strain FJAT1452.F50 plasmid Plas1, complete sequence) position: , mismatch: 7, identity: 0.767
gattcttgccgcgatggcggcggcccaggc CRISPR spacer tgcccatgccgcgatggcggcgcaccaggc Protospacer ...* **************** ******
122. spacer 3.13|1617412|30|NC_020561|CRISPRCasFinder,CRT,PILER-CR matches to NZ_CP052131 (Ralstonia solanacearum strain FJAT1303.F8 plasmid Plas1, complete sequence) position: , mismatch: 7, identity: 0.767
gattcttgccgcgatggcggcggcccaggc CRISPR spacer tgcccatgccgcgatggcggcgcaccaggc Protospacer ...* **************** ******
123. spacer 3.13|1617412|30|NC_020561|CRISPRCasFinder,CRT,PILER-CR matches to CP011998 (Ralstonia solanacearum strain YC45 plasmid, complete sequence) position: , mismatch: 7, identity: 0.767
gattcttgccgcgatggcggcggcccaggc CRISPR spacer tgcccatgccgcgatggcggcgcaccaggc Protospacer ...* **************** ******
124. spacer 3.13|1617412|30|NC_020561|CRISPRCasFinder,CRT,PILER-CR matches to NZ_CP052073 (Ralstonia solanacearum strain FJAT448.F50 plasmid Plas1, complete sequence) position: , mismatch: 7, identity: 0.767
gattcttgccgcgatggcggcggcccaggc CRISPR spacer tgcccatgccgcgatggcggcgcaccaggc Protospacer ...* **************** ******
125. spacer 3.13|1617412|30|NC_020561|CRISPRCasFinder,CRT,PILER-CR matches to NZ_CP052081 (Ralstonia solanacearum strain FJAT442.F50 plasmid Plas1, complete sequence) position: , mismatch: 7, identity: 0.767
gattcttgccgcgatggcggcggcccaggc CRISPR spacer tgcccatgccgcgatggcggcgcaccaggc Protospacer ...* **************** ******
126. spacer 3.13|1617412|30|NC_020561|CRISPRCasFinder,CRT,PILER-CR matches to NZ_CP052083 (Ralstonia solanacearum strain FJAT442.F1 plasmid Plas1, complete sequence) position: , mismatch: 7, identity: 0.767
gattcttgccgcgatggcggcggcccaggc CRISPR spacer tgcccatgccgcgatggcggcgcaccaggc Protospacer ...* **************** ******
127. spacer 3.13|1617412|30|NC_020561|CRISPRCasFinder,CRT,PILER-CR matches to NZ_CP052109 (Ralstonia solanacearum strain FJAT15249.F1 plasmid Plas1, complete sequence) position: , mismatch: 7, identity: 0.767
gattcttgccgcgatggcggcggcccaggc CRISPR spacer tgcccatgccgcgatggcggcgcaccaggc Protospacer ...* **************** ******
128. spacer 3.13|1617412|30|NC_020561|CRISPRCasFinder,CRT,PILER-CR matches to NZ_CP052091 (Ralstonia solanacearum strain FJAT15340.F6 plasmid Plas1, complete sequence) position: , mismatch: 7, identity: 0.767
gattcttgccgcgatggcggcggcccaggc CRISPR spacer tgcccatgccgcgatggcggcgcaccaggc Protospacer ...* **************** ******
129. spacer 3.13|1617412|30|NC_020561|CRISPRCasFinder,CRT,PILER-CR matches to NZ_CP052111 (Ralstonia solanacearum strain FJAT15244.F50 plasmid Plas1, complete sequence) position: , mismatch: 7, identity: 0.767
gattcttgccgcgatggcggcggcccaggc CRISPR spacer tgcccatgccgcgatggcggcgcaccaggc Protospacer ...* **************** ******
130. spacer 3.13|1617412|30|NC_020561|CRISPRCasFinder,CRT,PILER-CR matches to NZ_CP052103 (Ralstonia solanacearum strain FJAT15252.F50 plasmid Plas1, complete sequence) position: , mismatch: 7, identity: 0.767
gattcttgccgcgatggcggcggcccaggc CRISPR spacer tgcccatgccgcgatggcggcgcaccaggc Protospacer ...* **************** ******
131. spacer 3.13|1617412|30|NC_020561|CRISPRCasFinder,CRT,PILER-CR matches to NZ_CP052119 (Ralstonia solanacearum strain FJAT1458.F50 plasmid Plas1, complete sequence) position: , mismatch: 7, identity: 0.767
gattcttgccgcgatggcggcggcccaggc CRISPR spacer tgcccatgccgcgatggcggcgcaccaggc Protospacer ...* **************** ******
132. spacer 3.13|1617412|30|NC_020561|CRISPRCasFinder,CRT,PILER-CR matches to NZ_CP052113 (Ralstonia solanacearum strain FJAT15244.F1 plasmid Plas1, complete sequence) position: , mismatch: 7, identity: 0.767
gattcttgccgcgatggcggcggcccaggc CRISPR spacer tgcccatgccgcgatggcggcgcaccaggc Protospacer ...* **************** ******
133. spacer 3.13|1617412|30|NC_020561|CRISPRCasFinder,CRT,PILER-CR matches to MT316461 (Streptomyces phage Galactica, complete genome) position: , mismatch: 7, identity: 0.767
gattcttgccgcgatggcggcggcccaggc CRISPR spacer ggccggtgccgcgaaggcggcggccaaggc Protospacer *... ******** ********** ****
134. spacer 3.13|1617412|30|NC_020561|CRISPRCasFinder,CRT,PILER-CR matches to NZ_CP012477 (Arthrobacter sp. ERGS1:01 isolate water plasmid unnamed2, complete sequence) position: , mismatch: 7, identity: 0.767
gattcttgccgcgatggcggcggcccaggc CRISPR spacer ctttgccgccgcgatcgcggcggccgaggc Protospacer ** ..******** ********* ****
135. spacer 3.13|1617412|30|NC_020561|CRISPRCasFinder,CRT,PILER-CR matches to NZ_AP014705 (Methylobacterium aquaticum strain MA-22A plasmid pMaq22A_1p, complete sequence) position: , mismatch: 7, identity: 0.767
gattcttgccgcgatggcggcggcccaggc CRISPR spacer ggtcggcgacgcgatggcggcggccgaggc Protospacer *.*. .* **************** ****
136. spacer 3.13|1617412|30|NC_020561|CRISPRCasFinder,CRT,PILER-CR matches to MN284893 (Mycobacterium phage LilMcDreamy, complete genome) position: , mismatch: 7, identity: 0.767
gattcttgccgcgatggcggcggcccaggc CRISPR spacer ggtcgaggccgcgatggcggcggcgctggc Protospacer *.*. ***************** * ***
137. spacer 3.13|1617412|30|NC_020561|CRISPRCasFinder,CRT,PILER-CR matches to NZ_CP019036 (Massilia putida strain 6NM-7 plasmid unnamed1, complete sequence) position: , mismatch: 7, identity: 0.767
gattc---ttgccgcgatggcggcggcccaggc CRISPR spacer ---tcagacggccgcgctggcggcggcgcaggc Protospacer ** . ****** ********** *****
138. spacer 3.13|1617412|30|NC_020561|CRISPRCasFinder,CRT,PILER-CR matches to NC_023316 (Streptomyces sp. 14R-10 plasmid pZL1, complete sequence) position: , mismatch: 7, identity: 0.767
gattcttgccgcgatggcggcggcccaggc CRISPR spacer gccgctggccgcgatggcggcggccgtgcc Protospacer * . ** ****************** * *
139. spacer 3.13|1617412|30|NC_020561|CRISPRCasFinder,CRT,PILER-CR matches to NZ_CP025016 (Rhizobium leguminosarum strain Norway plasmid pRLN4, complete sequence) position: , mismatch: 7, identity: 0.767
gattcttgccgcgatggcggcggcccaggc CRISPR spacer gctggaagccgcgattgcggcggcccgggc Protospacer * * ******** **********.***
140. spacer 3.13|1617412|30|NC_020561|CRISPRCasFinder,CRT,PILER-CR matches to NZ_LR134452 (Tsukamurella tyrosinosolvens strain NCTC13231 plasmid 10, complete sequence) position: , mismatch: 7, identity: 0.767
gattcttgccgcgatggcggcggcccaggc CRISPR spacer cacgcgcgccgcgaaggcggcggccgaggc Protospacer *. * .******* ********** ****
141. spacer 3.13|1617412|30|NC_020561|CRISPRCasFinder,CRT,PILER-CR matches to CP053919 (Serratia marcescens strain LY1 plasmid unnamed1, complete sequence) position: , mismatch: 7, identity: 0.767
gattcttgccgcgatggcggcggcccaggc CRISPR spacer ggtgaacgccgcgaaggcggcggaccaggc Protospacer *.* .******* ******** ******
142. spacer 3.13|1617412|30|NC_020561|CRISPRCasFinder,CRT,PILER-CR matches to CP053919 (Serratia marcescens strain LY1 plasmid unnamed1, complete sequence) position: , mismatch: 7, identity: 0.767
gattcttgccgcgatggcggcggcccaggc CRISPR spacer ggtgaacgccgcgaaggcggcggaccaggc Protospacer *.* .******* ******** ******
143. spacer 3.13|1617412|30|NC_020561|CRISPRCasFinder,CRT,PILER-CR matches to CP053919 (Serratia marcescens strain LY1 plasmid unnamed1, complete sequence) position: , mismatch: 7, identity: 0.767
gattcttgccgcgatggcggcggcccaggc CRISPR spacer ggtgaacgccgcgaaggcggcggaccaggc Protospacer *.* .******* ******** ******
144. spacer 3.13|1617412|30|NC_020561|CRISPRCasFinder,CRT,PILER-CR matches to CP053919 (Serratia marcescens strain LY1 plasmid unnamed1, complete sequence) position: , mismatch: 7, identity: 0.767
gattcttgccgcgatggcggcggcccaggc CRISPR spacer ggtgaacgccgcgaaggcggcggaccaggc Protospacer *.* .******* ******** ******
145. spacer 3.13|1617412|30|NC_020561|CRISPRCasFinder,CRT,PILER-CR matches to CP053919 (Serratia marcescens strain LY1 plasmid unnamed1, complete sequence) position: , mismatch: 7, identity: 0.767
gattcttgccgcgatggcggcggcccaggc CRISPR spacer ggtgaacgccgcgaaggcggcggaccaggc Protospacer *.* .******* ******** ******
146. spacer 3.13|1617412|30|NC_020561|CRISPRCasFinder,CRT,PILER-CR matches to CP053919 (Serratia marcescens strain LY1 plasmid unnamed1, complete sequence) position: , mismatch: 7, identity: 0.767
gattcttgccgcgatggcggcggcccaggc CRISPR spacer ggtgaacgccgcgaaggcggcggaccaggc Protospacer *.* .******* ******** ******
147. spacer 3.13|1617412|30|NC_020561|CRISPRCasFinder,CRT,PILER-CR matches to CP053919 (Serratia marcescens strain LY1 plasmid unnamed1, complete sequence) position: , mismatch: 7, identity: 0.767
gattcttgccgcgatggcggcggcccaggc CRISPR spacer ggtgaacgccgcgaaggcggcggaccaggc Protospacer *.* .******* ******** ******
148. spacer 3.13|1617412|30|NC_020561|CRISPRCasFinder,CRT,PILER-CR matches to CP053919 (Serratia marcescens strain LY1 plasmid unnamed1, complete sequence) position: , mismatch: 7, identity: 0.767
gattcttgccgcgatggcggcggcccaggc CRISPR spacer ggtgaacgccgcgaaggcggcggaccaggc Protospacer *.* .******* ******** ******
149. spacer 3.13|1617412|30|NC_020561|CRISPRCasFinder,CRT,PILER-CR matches to CP053919 (Serratia marcescens strain LY1 plasmid unnamed1, complete sequence) position: , mismatch: 7, identity: 0.767
gattcttgccgcgatggcggcggcccaggc CRISPR spacer ggtgaacgccgcgaaggcggcggaccaggc Protospacer *.* .******* ******** ******
150. spacer 3.13|1617412|30|NC_020561|CRISPRCasFinder,CRT,PILER-CR matches to CP053919 (Serratia marcescens strain LY1 plasmid unnamed1, complete sequence) position: , mismatch: 7, identity: 0.767
gattcttgccgcgatggcggcggcccaggc CRISPR spacer ggtgaacgccgcgaaggcggcggaccaggc Protospacer *.* .******* ******** ******
151. spacer 3.13|1617412|30|NC_020561|CRISPRCasFinder,CRT,PILER-CR matches to NZ_CP022363 (Azospirillum sp. TSH58 plasmid TSH58_p03, complete sequence) position: , mismatch: 7, identity: 0.767
gattcttgccgcgatggcggcggcccaggc CRISPR spacer caccgtcgccgcgatggcggcggccggggc Protospacer *.. *.****************** .***
152. spacer 3.13|1617412|30|NC_020561|CRISPRCasFinder,CRT,PILER-CR matches to JN564907 (Burkholderia phage AH2, complete genome) position: , mismatch: 7, identity: 0.767
gattctt----gccgcgatggcggcggcccaggc CRISPR spacer ----cctgaaggccgcgatggcgtcggccgaggc Protospacer *.* ************ ***** ****
153. spacer 3.13|1617412|30|NC_020561|CRISPRCasFinder,CRT,PILER-CR matches to MN813697 (Mycobacterium phage Noelle, complete genome) position: , mismatch: 7, identity: 0.767
gattcttgccgcgatggcggcggcccaggc CRISPR spacer cttcatcgccgcgacggcggcgtcccaggc Protospacer *. *.*******.******* *******
154. spacer 3.3|1616752|30|NC_020561|CRISPRCasFinder,CRT,PILER-CR matches to NZ_CP014683 (Kozakia baliensis strain NBRC 16680 plasmid pKB16680_2, complete sequence) position: , mismatch: 8, identity: 0.733
cctatgtccgtaacaacccggacgtggccg CRISPR spacer cctatgtccgtaataacacggacaagtgga Protospacer *************.*** *****. * .
155. spacer 3.9|1617148|30|NC_020561|CRISPRCasFinder,CRT,PILER-CR matches to MN034485 (Leviviridae sp. isolate H2_Bulk_34_354 hypothetical protein (H2Bulk34354_000001) gene, partial cds; and hypothetical protein (H2Bulk34354_000002) and RNA-dependent RNA polymerase (H2Bulk34354_000003) genes, complete cds) position: , mismatch: 8, identity: 0.733
ccttccacgcgtcaagctcaccttcgaacc-- CRISPR spacer tattccaggcgtcaagctcacct--agatcag Protospacer . ***** *************** ..*.*
156. spacer 3.11|1617280|30|NC_020561|CRISPRCasFinder,CRT,PILER-CR matches to NC_019410 (Caulobacter phage CcrKarma, complete genome) position: , mismatch: 8, identity: 0.733
cgcggcgagacccacgtcaacaacctgctg CRISPR spacer accctgaagacccaggtcaacaatctgctg Protospacer * .******* ********.******
157. spacer 3.11|1617280|30|NC_020561|CRISPRCasFinder,CRT,PILER-CR matches to NC_019407 (Caulobacter phage CcrMagneto, complete genome) position: , mismatch: 8, identity: 0.733
cgcggcgagacccacgtcaacaacctgctg CRISPR spacer accctgaagacccaggtcaacaccctgctg Protospacer * .******* ******* *******
158. spacer 3.11|1617280|30|NC_020561|CRISPRCasFinder,CRT,PILER-CR matches to KY555144 (Caulobacter phage Ccr5, complete genome) position: , mismatch: 8, identity: 0.733
cgcggcgagacccacgtcaacaacctgctg CRISPR spacer accctgaagacccaggtcaacaccctgctg Protospacer * .******* ******* *******
159. spacer 3.11|1617280|30|NC_020561|CRISPRCasFinder,CRT,PILER-CR matches to NC_019411 (Caulobacter phage CcrSwift, complete genome) position: , mismatch: 8, identity: 0.733
cgcggcgagacccacgtcaacaacctgctg CRISPR spacer accctgaagacccaggtcaacaatctgctg Protospacer * .******* ********.******
160. spacer 3.12|1617346|30|NC_020561|CRISPRCasFinder,CRT,PILER-CR matches to NZ_CP047174 (Rathayibacter sp. VKM Ac-2760 plasmid unnamed1, complete sequence) position: , mismatch: 8, identity: 0.733
gcccatcccgagctcgcgcttgtagcgcat CRISPR spacer gtcggggccgagcttgcgcttgtagcgcgc Protospacer *.* . *******.*************..
161. spacer 3.13|1617412|30|NC_020561|CRISPRCasFinder,CRT,PILER-CR matches to NC_018022 (Mycolicibacterium chubuense NBB4 plasmid pMYCCH.01, complete sequence) position: , mismatch: 8, identity: 0.733
gattcttgccgcgatggcggcggcccaggc CRISPR spacer cgcggatgccgcgatggccgcgacccaggc Protospacer .. ************ ***.*******
162. spacer 3.13|1617412|30|NC_020561|CRISPRCasFinder,CRT,PILER-CR matches to NZ_CP039340 (Ralstonia solanacearum strain UW386 plasmid pUW386, complete sequence) position: , mismatch: 8, identity: 0.733
gattcttgccgcgatggcggcggcccaggc CRISPR spacer cgctaacgccgcggcggcggcggcccaggc Protospacer ..* .******..***************
163. spacer 3.13|1617412|30|NC_020561|CRISPRCasFinder,CRT,PILER-CR matches to NC_015583 (Novosphingobium sp. PP1Y plasmid Mpl, complete sequence) position: , mismatch: 8, identity: 0.733
gattcttgccgcgatggcggcggcccaggc CRISPR spacer ctatcttgacgcgatggcggcggccgggct Protospacer ***** **************** .* .
164. spacer 3.13|1617412|30|NC_020561|CRISPRCasFinder,CRT,PILER-CR matches to NZ_CP050083 (Rhizobium leguminosarum bv. trifolii strain 31B plasmid pRL31b3, complete sequence) position: , mismatch: 8, identity: 0.733
gattcttgccgcgatggcggcggcccaggc CRISPR spacer ccgtctttccgcgatggcggcgggccgctc Protospacer **** *************** **. *
165. spacer 3.13|1617412|30|NC_020561|CRISPRCasFinder,CRT,PILER-CR matches to NC_012811 (Methylorubrum extorquens AM1 megaplasmid, complete sequence) position: , mismatch: 8, identity: 0.733
gattcttgccgcgatggcggcggcccaggc CRISPR spacer gccgagcgcctcgatggcggcggcgcaggc Protospacer * . .*** ************* *****
166. spacer 3.13|1617412|30|NC_020561|CRISPRCasFinder,CRT,PILER-CR matches to NC_012586 (Sinorhizobium fredii NGR234 plasmid pNGR234b, complete sequence) position: , mismatch: 8, identity: 0.733
gattcttgccgcgatggcggcggcccaggc CRISPR spacer atatagggccgcgttgccggcggcccaggc Protospacer . * ****** ** *************
167. spacer 3.13|1617412|30|NC_020561|CRISPRCasFinder,CRT,PILER-CR matches to NZ_CP049733 (Rhizobium leguminosarum strain A1 plasmid pRL10, complete sequence) position: , mismatch: 8, identity: 0.733
gattcttgccgcgatggcggcggcccaggc CRISPR spacer ccgtctttccgcgatggcggcgggccgctc Protospacer **** *************** **. *
168. spacer 3.13|1617412|30|NC_020561|CRISPRCasFinder,CRT,PILER-CR matches to NZ_CP024310 (Sinorhizobium fredii strain NXT3 plasmid pSfreNXT3c, complete sequence) position: , mismatch: 8, identity: 0.733
gattcttgccgcgatggcggcggcccaggc CRISPR spacer caaggctgccgcgatggcggccgaccagga Protospacer * .*************** * *****
169. spacer 3.13|1617412|30|NC_020561|CRISPRCasFinder,CRT,PILER-CR matches to NZ_CP045120 (Rubrobacter sp. SCSIO 52909 plasmid unnamed1, complete sequence) position: , mismatch: 8, identity: 0.733
-----gattcttgccgcgatggcggcggcccaggc CRISPR spacer ccaaagg-----gccgcgagggcgccggcccaggc Protospacer *. ******* **** **********
170. spacer 3.13|1617412|30|NC_020561|CRISPRCasFinder,CRT,PILER-CR matches to NZ_CP023064 (Sinorhizobium sp. CCBAU 05631 plasmid pSS05631b, complete sequence) position: , mismatch: 8, identity: 0.733
gattcttgccgcgatggcggcggcccaggc CRISPR spacer caaggctgccgcgatggcggccgaccagga Protospacer * .*************** * *****
171. spacer 3.13|1617412|30|NC_020561|CRISPRCasFinder,CRT,PILER-CR matches to NZ_LR594663 (Variovorax sp. RA8 plasmid 2) position: , mismatch: 8, identity: 0.733
gattcttgccgcgatggcggcggcccaggc CRISPR spacer gccgccgagcgcgatggcggcggccccggc Protospacer * . *. . ***************** ***
172. spacer 3.15|1617544|30|NC_020561|CRISPRCasFinder,CRT,PILER-CR matches to NZ_CP015092 (Pelagibaca abyssi strain JLT2014 plasmid pPABY3, complete sequence) position: , mismatch: 8, identity: 0.733
cccccagggcgcatagccaagccggcccac CRISPR spacer ggggcagggcgcatagccatgccagccctg Protospacer *************** ***.****
173. spacer 3.15|1617544|30|NC_020561|CRISPRCasFinder,CRT,PILER-CR matches to NZ_CP049032 (Fluviibacterium aquatile strain SC52 plasmid pSC52_4, complete sequence) position: , mismatch: 8, identity: 0.733
cccccagggcgcatagccaagccggcccac CRISPR spacer ggggcagggcgcatagccatgccagccctg Protospacer *************** ***.****
174. spacer 3.15|1617544|30|NC_020561|CRISPRCasFinder,CRT,PILER-CR matches to NZ_CP031601 (Roseovarius indicus strain DSM 26383 plasmid pRIdsm_03, complete sequence) position: , mismatch: 8, identity: 0.733
cccccagggcgcatagccaagccggcccac CRISPR spacer ggggcagggcgcatagccatgccagccctg Protospacer *************** ***.****
175. spacer 3.15|1617544|30|NC_020561|CRISPRCasFinder,CRT,PILER-CR matches to NZ_CP004395 (Celeribacter indicus strain P73 plasmid pP73B, complete sequence) position: , mismatch: 8, identity: 0.733
cccccagggcgcatagccaagccggcccac CRISPR spacer ggggcagggcgcatagccatgccagccctg Protospacer *************** ***.****
176. spacer 3.19|1617822|30|NC_020561|CRISPRCasFinder,CRT,PILER-CR matches to NZ_CP012748 (Paraburkholderia caribensis MBA4 plasmid unnamed, complete sequence) position: , mismatch: 8, identity: 0.733
ccagcggacggacgcatatgggcaagcggc CRISPR spacer ccagcggacggacgcatacggcggcccgat Protospacer ******************.** . **..
177. spacer 3.21|1617954|31|NC_020561|CRISPRCasFinder,CRT,PILER-CR matches to NZ_LR594668 (Variovorax sp. SRS16 plasmid 3) position: , mismatch: 8, identity: 0.742
tccttttacgcgatgagggcagtgagcccgg CRISPR spacer tgcagcgccgcgaagagcgcagtgagcccgg Protospacer * * . ***** *** *************
178. spacer 3.13|1617412|30|NC_020561|CRISPRCasFinder,CRT,PILER-CR matches to NZ_CP049794 (Ralstonia solanacearum strain 204 plasmid unnamed, complete sequence) position: , mismatch: 9, identity: 0.7
gattcttgccgcgatggcggcggcccaggc CRISPR spacer atcggacggcgcgatggcggcggccgaggc Protospacer . . .* **************** ****
179. spacer 3.13|1617412|30|NC_020561|CRISPRCasFinder,CRT,PILER-CR matches to NZ_CP049792 (Ralstonia solanacearum strain 203 plasmid unnamed, complete sequence) position: , mismatch: 9, identity: 0.7
gattcttgccgcgatggcggcggcccaggc CRISPR spacer atcggacggcgcgatggcggcggccgaggc Protospacer . . .* **************** ****
180. spacer 3.13|1617412|30|NC_020561|CRISPRCasFinder,CRT,PILER-CR matches to NZ_CP049790 (Ralstonia solanacearum strain 202 plasmid unnamed, complete sequence) position: , mismatch: 9, identity: 0.7
gattcttgccgcgatggcggcggcccaggc CRISPR spacer atcggacggcgcgatggcggcggccgaggc Protospacer . . .* **************** ****
181. spacer 3.13|1617412|30|NC_020561|CRISPRCasFinder,CRT,PILER-CR matches to NZ_CP016915 (Ralstonia solanacearum strain CQPS-1 plasmid unnamed, complete sequence) position: , mismatch: 9, identity: 0.7
gattcttgccgcgatggcggcggcccaggc CRISPR spacer atcggacggcgcgatggcggcggccgaggc Protospacer . . .* **************** ****
182. spacer 3.13|1617412|30|NC_020561|CRISPRCasFinder,CRT,PILER-CR matches to NZ_CP052085 (Ralstonia solanacearum strain FJAT15353.F8 plasmid Plas1, complete sequence) position: , mismatch: 9, identity: 0.7
gattcttgccgcgatggcggcggcccaggc CRISPR spacer atcggacggcgcgatggcggcggccgaggc Protospacer . . .* **************** ****
183. spacer 3.13|1617412|30|NC_020561|CRISPRCasFinder,CRT,PILER-CR matches to NZ_CP052087 (Ralstonia solanacearum strain FJAT15353.F50 plasmid Plas1, complete sequence) position: , mismatch: 9, identity: 0.7
gattcttgccgcgatggcggcggcccaggc CRISPR spacer atcggacggcgcgatggcggcggccgaggc Protospacer . . .* **************** ****
184. spacer 3.13|1617412|30|NC_020561|CRISPRCasFinder,CRT,PILER-CR matches to NZ_CP052127 (Ralstonia solanacearum strain FJAT1303.F50 plasmid Plas1, complete sequence) position: , mismatch: 9, identity: 0.7
gattcttgccgcgatggcggcggcccaggc CRISPR spacer atcggacggcgcgatggcggcggccgaggc Protospacer . . .* **************** ****
185. spacer 3.13|1617412|30|NC_020561|CRISPRCasFinder,CRT,PILER-CR matches to NZ_CP052089 (Ralstonia solanacearum strain FJAT15353.F1 plasmid Plas1, complete sequence) position: , mismatch: 9, identity: 0.7
gattcttgccgcgatggcggcggcccaggc CRISPR spacer atcggacggcgcgatggcggcggccgaggc Protospacer . . .* **************** ****
186. spacer 3.13|1617412|30|NC_020561|CRISPRCasFinder,CRT,PILER-CR matches to NZ_CP052131 (Ralstonia solanacearum strain FJAT1303.F8 plasmid Plas1, complete sequence) position: , mismatch: 9, identity: 0.7
gattcttgccgcgatggcggcggcccaggc CRISPR spacer atcggacggcgcgatggcggcggccgaggc Protospacer . . .* **************** ****
187. spacer 3.13|1617412|30|NC_020561|CRISPRCasFinder,CRT,PILER-CR matches to NZ_AP022593 (Mycolicibacterium arabiense strain JCM 18538 plasmid pJCM18538, complete sequence) position: , mismatch: 9, identity: 0.7
gattcttgccgcgatggcggcggcccaggc CRISPR spacer cggcgacgtcgcggtggcggcggcccaggc Protospacer . . .*.****.****************
188. spacer 3.13|1617412|30|NC_020561|CRISPRCasFinder,CRT,PILER-CR matches to NZ_CP021813 (Sinorhizobium meliloti strain M270 plasmid psymA, complete sequence) position: , mismatch: 9, identity: 0.7
gattcttgccgcgatggcggcggcccaggc CRISPR spacer cgcgaaagccgcgatcgcggcggccaaggc Protospacer .. ******** ********* ****
189. spacer 3.13|1617412|30|NC_020561|CRISPRCasFinder,CRT,PILER-CR matches to NZ_CP045074 (Paracoccus kondratievae strain BJQ0001 plasmid unnamed1, complete sequence) position: , mismatch: 9, identity: 0.7
gattcttgccgcgatggcggcggcccaggc CRISPR spacer tccagtcgccgcgatggcggcggaccagat Protospacer . *.**************** ****..
190. spacer 3.13|1617412|30|NC_020561|CRISPRCasFinder,CRT,PILER-CR matches to NZ_CP021819 (Sinorhizobium meliloti strain M162 plasmid psymA, complete sequence) position: , mismatch: 9, identity: 0.7
gattcttgccgcgatggcggcggcccaggc CRISPR spacer cgcgaaagccgcgatcgcggcggccaaggc Protospacer .. ******** ********* ****
191. spacer 3.13|1617412|30|NC_020561|CRISPRCasFinder,CRT,PILER-CR matches to NC_020548 (Azoarcus sp. KH32C plasmid pAZKH, complete sequence) position: , mismatch: 9, identity: 0.7
gattcttgccgcgatggcggcggcccaggc CRISPR spacer ccgcgcggcctcggtggcggcggcccaggc Protospacer . . *** **.****************
192. spacer 3.13|1617412|30|NC_020561|CRISPRCasFinder,CRT,PILER-CR matches to NC_009620 (Sinorhizobium medicae WSM419 plasmid pSMED01, complete sequence) position: , mismatch: 9, identity: 0.7
gattcttgccgcgatggcggcggcccaggc CRISPR spacer cgcgaaagccgcgattgcggcggccaaggc Protospacer .. ******** ********* ****
193. spacer 3.14|1617478|30|NC_020561|CRISPRCasFinder,CRT,PILER-CR matches to NC_000914 (Sinorhizobium fredii NGR234 plasmid pNGR234a, complete sequence) position: , mismatch: 9, identity: 0.7
actcgctgcgaggggacggggagaggaagg CRISPR spacer tgtgcctgcgaggggaaggggagaggcgcc Protospacer * *********** ********* .
194. spacer 3.21|1617954|31|NC_020561|CRISPRCasFinder,CRT,PILER-CR matches to MN035828 (Leviviridae sp. isolate H3_Bulk_Litter_17_scaffold_1122 RNA-dependent RNA polymerase (H3BulkLitter171122_000001) and hypothetical protein (H3BulkLitter171122_000002) genes, complete cds; and hypothetical protein (H3BulkLitter171122_000003) gene, partial cds) position: , mismatch: 9, identity: 0.71
tccttttacgcgatgagggcagtgagcccgg CRISPR spacer ccgaagaacgccatgcgggcagtgagcccga Protospacer .* **** *** **************.
Region | Region Position | Protein_number | Hit_taxonomy | Key_proteins | Att_site | Prophage annotation | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
DBSCAN-SWA_1 |
641969 : 650998
Sequences of DBSCAN-SWA_1
Nucleotide sequences of DBSCAN-SWA_1 >NC_020561|641969:650998|DBSCAN-SWA TTCATGCCGCAAGCTCCGCCTTCAGGCCGGCGCGATCGACCTTGCCATTGGCGTTGCGGGGAAGATCCGCCCGCCAGACGATCGCGCGCGGCTGCATGAAATTGGGGAGATCGCGCTTCAGCGCATCGCGCAGCGCCGCCTCCTGCGCGGGATCGCCCCTGAGGACGAGGAGGATCGCCTCGCCCAGCCGATCGTCCGCCACGCCGATGGCGACGGCTTCCGCCGCAAGCCCGGTGGCGATGGCGGCTTCCTCCACCTCCGTCGGGCTGACGCGGTTGCCGGACGTCTTGATCATCTCGTCATCGCGGCCGACGAAGTGGAGCAGCCCCTCCCCGTCCACCGCCACCGTATCGCCGGACCAGACGGCGGCGCCGCCATAAAGCGAATGGGAGGGCGCGGGGCGGAAGCGGTGCGCGGTCCGCTCCTCATCGCGCCAATAGCCTTGCGCCACCAGCGGGCCGGCATGGACCAGCTCGCCCGGCTCGCCGGCTTCGGTGGGGCTGCCGTCGGCGCGGACCACCATCACCTCGGCGAAGGGAATGGCGCGGCCAATGGATTCCGGGTGGGCATCGACCAGGGCGGGATCGAGATAGGTGGAACGGAACGCCTCCGTCAGCCCATACATCGAATAGAGATCGGCATTGGGGAAGAGCGCGCGCAGACGGCGGATCAGCGGCACCGGCAGGCGGCCGCCCGAATTGGTCAGGCGGCGCAGGCGGGCGGCGGTTTCGGCCGGCCAGTCCGCCTCGGCAAGCTGCACCCACAAGGGCGGCACGCCGGCCAGCGTGGTGATGGCGTGGCGTTCCACCGCGCGGATCACATCGCGCGCGGTGAGGTAATCGAGCGGGTGGACGCAGCCCCCCGCCGCCCAGGTGGAAAGCAGCTGGTTCTGGCCGTAATCGAAGCTGAAGGGCAGCACGGCGAGCGTGCGGTCATCCGGCCCAAGCCGCAGATAATGCGCGACGGAAACGGCGCCCAGCCACAGATTGGCGTGGCTCAGCATCACCCCCTTGGGCCGGCCGGTGGAGCCGGAGGTATAGAGGATGGCGGCCAGACCGGCCGGATCGGCCGAGGACGGCGGCAGGCCGGACTGGGCGGCGAGCGTGGCGGCGCCGGCCTCGTCCTCGATCAGCAGCGCGCAATCGGCGGGGCGGTCCCCCGGCTCCAGCGTCGCGACGCGGCTGGGGCCGGCGATCAGCAGCGCGGCCCCGCTATCGGCGAGGATGTGGGCAAGCTGGGCGCGCTTCAGCAGCGGGTTGACGGGGACGTGAACGAAGCCGGCCCGCGCGCAGGCCAGCGGCAGCAGGCTCGCCAGCCGCGTCTTCGCCAGCCATGAGGCGACGCGCGCGCCGGGGGCGAGACCGCGATCGGCCAGCGCGCGGGCCAGCGCGCCCACCGCGCGATCGAGCCCCGTATAATCCAGCGCGCCAGCCCGATCGACCAGCGCCAGCGCACCGGCCGCGCCCATCAGCGGCAGATGATCGAGCGGGCGGGGATCGGGATTGGGATCGGTCGCGGGCACCCCTCATGCTCCTTCGCGCTGATGCTGGGCCGGAGGCGATAGCGCGCCCACGGCCCGGTGGAAACCCGGCCGATGCTTGTCCCGCGCGTGCGAAGCGGCTAGGCGACCGCCGTTCGGGTTCGCCAGAAGACGAGGAATTAGATGGTCGGCGATGGGATCGATGTCGAAACGGCGGTTCGCGGCGTGCTGGGCGACGTGCTCGGGCTGGGCGATGCGCGCACGGCCGCGCTCGAACCCGGAACACCGCTGTTCGGCGCCATGCCGGAACTCGATTCGATGGCGGTGGCCGGCCTGCTGACGGAGCTGGAGGACCGGCTGGGCATCATCATCGACGATGACGAGGTGGATGGCGAGCTGCTGGAAAGCTTCGGCGCGCTGGTGGCCTTCGCCAAGGCCAAGGTGGCGAAGGCGGCCTGAGGCCGCCTCCCCCCATATCCTCAGGCCGCGGCCGTGGCCTTCGCCTCGAAATCGGCATGGGCCAGGAAGCGTTCCGCATCCAGCGCCGCCATGCAGCCCGTGCCCGCCGCCGTCACCGCCTGGCGATAGACCTTGTCCATCACGTCGCCGCAGGCGAACACCCCGGCCACGCTGGTCCGCGTGCTGCCCGGCTCCACCGCGATATAGCCGTCGCTGTCGAGCGCGATATGGCCGCGGAAGAGATCGGTGGCCGGCGAATGGCCGATCGCGACGAAGCCGCCGTCGACATCGAGGCTGCTGACCTCACCCGTCACCGTATCGCGCAGCGAGAGCGCGACGAGGCCCTCGGGATCGCCGCCGCCCACGAACTCCGCGACTTCCTTGTTCCACAGCACCTTGATGCGCGGATGGGCGAAAAGGCGGTCCTGGAGAATCTTTTCCGCGCGCAGCGAATCGCGGCGGTGGATCAGCGTCACGTCCGGGCTGTGGTTGGTCATGTAGATCGCTTCCTCGACCGCGGTGTTGCCGCCGCCGATCACCGCGACCTTCTTGCCGCGATAGAAGAAGCCGTCGCAGGTGGCGCAGGCGGAAACGCCCTTGCCCTGCAGCTTCAGCTCGCTGGGCAGGCCCAGCCAGCGCGCCTGGGCGCCGGTGGCGATCACCACCGTATCGCCTTCATAGACGGTGCCGCTGTCGCCGATGAGGCGGAAGGGGCGCTTCGAGAAATCGACCTCGACGATCGTGTCCCACATCATCCGCGCGCCGACATGCTCGGCCTGGCGCTGCATCTGCTCCATCAGCCACGGGCCCTGGATGACGTCCGCGAAGCCCGGATAATTCTCCACATCGGTGGTGATGGTCAGCTGGCCGCCCGGCTGGATGCCCTGCAGCACGATCGGCGCCATCCCGGCGCGCGCGGCATAGATCGCCGCCGAAAGGCCGGCCGGCCCCGAACCGAGGATGAGCATGCGGGTGGAATGGGTCGCGGTCATGGATCGCCTTTGCTCGTGAATCTCAATATCTCGCGCCCGAGATAGGGCGTGGGGGGGCCACACCGCAAGGAATGGAAATTCTCGCGGCCCGGAAAGGCTGGCTCGGGCGCCGGCCGGAAAACATCGCTGGAAGGAGGAAGGAAGCGTGGTGGGCGCGACAGGGATTGAACCTGTGACCCCACCCGTGTGAAGGGTGTGCTCTACCGCTGAGCTACGCGCCCACGCTTCGCTTGAGGCCGATCATCGGGGCCAGCGGGGCTGGCGATCGGCGTTGCGGGGAGGCGCTTAGGCGGTCGCCGCGGACTTGTCCAGCCCCATCGGCGAGATATCCCCAGATCAACGCGACATCGCGCGCGATTGCGGAGAAGCGGCCGGCCCGATCCCCACCGGGCCGGCCCCGATCCTTAGTTGACCGCGTCCTTGAGGCCCTTGCCCGCCTTGAACTTCGGCTGGCTGGACGCCTTGATCTTCATCGGCTCGCCGGTGCGCGGGTTGCGGCCGGTGGATGCCTTGCGCTTGGAGACCGAGAAGGTGCCGAAGCCGACGAGGCGAACCTCATCCCCCTTCTTGAGCGCGCCGCTGATCGTATCGAAAACCCCTTCCACGGCCTTCGAAGCGTCGTTCTTGCTGAGGCCTGTCGCCTCGGCGACCGACGCGATCAGTTCTTGCTTGTTCATGCCCGAGGAACCCCCTTGAATAAGGTGCGATTATTTGAGTCGCGGCCAAGGCCGGGCGCGCATCTAAGAACAGCGCCGATAGGCGTGTCAAAAGGAAAGCGCGCCTAAATAGCGGGTTTTTCGAGGTTCGGGCCGATTTGGCGCGGCTCGATTCGGTGCCGGAGCGAAAAAAGAAAGGCGACGACCCGGTAAAAACCGGATCGCCGCCCCTCGTTCGCCTTGGCGAATCAGTGGTGAATCGCAGCCCCGGCCTCGCCGGCAAGATGCGGCACGGGCGGCGCGGCCAGATCGTCCGCTTCCGTCCAGTCGATCGGCGAAACCGGCTGGGCCAGCGCCAGGGCCAGCACCTCATCGACATGCTTGACCGGCGTGATCTTCAGCCCTTCGCGGATATTGGCGGGGATTTCCGCCAGATCCTTCTCATTCTCCTGCGGGATCATCACGTGGGTGATGCCGCCGCGCAGCGCCGCCAGCAGCTTTTCCTTGAGGCCGCCGATCGGCAGCACCCGGCCGCGCAGCGTGACTTCGCCCGTCATCGCCACCTCGCGCCGCACCGGAATGCCGGTGAGCGTGGAGACGATCGCGGTGACGATGCCGATGCCGGCGGAAGGGCCGTCCTTGGGCACCGCGCCCTCGGGCAGGTGGATATGCACGTCCTTGCGGCCGAACAGGCTGGGCTTGATGCCATAAGCCGGCGCGCGGGCCTTTACGAAGGAGAAGGCGGCCTGCACGGATTCCTTCATCACGTCGCCCAGCTTGCCGGTGGTGGCGATGGCGCCCTTGCCCGGCACGGTGACGGCCTCGATCGTCAGCAGCTCGCCGCCGACCTCCGTCCAGGCGAGGCCGGTGACGGCGCCGATCTGGTCCTCCTCCTCGCCCACGCCGAAGCGGAACTTGCGGACGCCGGCGAACTCGGAAAGATTTTCGGGCGTGATCTCGACCGTTTCGGCCTTGCCTTCGAGGATGCGGCGCAGCGCCTTGCGGGCGAGCTTCGCGATCTCCCGCTCCAGCGTGCGGACGCCGGCTTCGCGGGTATAATAGCGGATCAGGTCGCGCAGGCCGGCGTCGGTGAGGATGAACTCACCGGCCTTCAGCCCGTGCGCCTCGATCTGCTTGGCGATCAGGTGCGCCTTCGCGATCTCGACCTTCTCGTCCTCGGTATAGCCTTCGAGCCGGATGATCTCCATGCGGTCGAGCAGCGGCTGCGGCAGGTTCAGCGAGTTGGCGGTGGTGACGAACATCACGTCCGACAGGTCGACATCGACCTCCAGATAATGATCCTGGAACTTGCTGTTCTGTTCGGGGTCGAGCACCTCCAGCAGCGCGGACGCGGGATCGCCCCGAAAATCCTGGCCGAGCTTGTCGATCTCGTCGAGCAGGAAGAGCGGGTTGGACGTGCCCGCCTTCTTGAGGTTGGTGACGACCTTGCCCGGCAGCGAGCCGATATAGGTGCGGCGATGGCCGCGGATCTCGGCCTCGTCGCGCACGCCGCCCAGCGACTGGCGCACGAATTCGCGCCCCGTCGCCTTGGCGATCGAGCGGCCGAGCGAGGTCTTGCCCACGCCGGGCGGGCCGACGAGGCACAGGATCGGCCCTTTCAGCTTGTTGGTGCGCGCCTGGACCGCGAGATATTCGACGATCCGCTCCTTCACCTTCTCCAGCCCGTAATGATCGGCATCAAGCACGGCCTGGGCGACGGCGATATCCTTCTTGAGCTTCGACTTCTTGCCCCAGGGCAGGCCGAGCAGCACATCCAGATAATTGCGGACGACGGTCGCTTCGGCCGACATCGGCGCCATGGTGCGCAGCTTCTTCAGCTCCGCATTCGCCTTTGCGCGGGCCTCCTTCGAAAGCTTGAGCTTGGTGATCTTCTGCGCCAGCTCGGCCAGCTCGTCGCCGCCCTCGCCTTCCTCGGCATTGCCCAGCTCGCGCTGGATCGCCTTGAGCTGCTCGTTCAGATAATATTCGCGCTGGGTCTTCTCCATCTGCCGCTTCACGCGGCTGCGGATCTTCTTTTCCACCTGGAGGACGCCCAGCTCGCCTTCCATGAAGGCGAAGGCCATTTCGAGCCGCTTGGCGGGATCGAGCTCGACGAGCAGGGTCTGCTTGTCGGCGACCTTGATGTTGATGTTGGACGACACCGCATCGGCCAGCCGCGCCGGATCCTCGATCTGGCTGAGCTGGACGGCGGTTTCGGCCGGCATCTTCTTGTTGAGCTTCGCGTAATTTTCGAACTGCTCGACCACCGAGCGCATCAGCGCCGCGATCTCGTCACCCTCGGCCGGCTGATCGTCGATCAGGTCCACCGAGGCGGTCAGATGATCGCCGTCGGCATCGAGCGTGGCGAGCGCCGCGCGCTGGCGTCCCTCCACCAGCACGCGCACGGTGCCGTCGGGCAGCTTCAGGAGCTGGAGAACGGTGGCGACGACGCCGATATCGTAGAGCGCGTCGCGATCGGGATCATCCTCGGACGGATCGAGCTGGGCGACCAGGAAGATCGCCTTGTCCGCCGCCATCGCGCTTTCCAGCGCGGCCACCGATTTGTCGCGCCCCACGAAGAGCGGAACGATCATCTGCGGGAACACGACGATGTCGCGCAGCGGCAGGACGGGGAGGATTTCGGTCATGCAAACTCCGATGCGGGGCGGAAAGGTCGAGGCCGCCCGCGTGATTCCTTGGTCACTTATCTGTTATATATATGGGCGCGGCACCGTCCCGCATCAACGGGCGCACGGAATCCTCTTCATATCGGATCTGAAACAAAATCCTTCCGCGGCAATAATCTCCTTGCGTCGCAGCGTCTTGGTGCGGGGAAGGCGGCGACGCCCTCCCCCGCCCGCCTCAGTGCCGCGTCTGCGTCATCCGATCGACCCAGGCGATGCCCAGCGCGGAGAGGATGAAGGCCATGTGGATGATCGCCTGCCACATCACGCCTGTCTCCGTGACGCGCGCGCCGGGCATGCCGATGGCGCTCGCCTCGATGAAGGTCCGCAGCAGATGGATCGAGGAGATGCCGATGATCGCCATGGCGAGCTTGACCTTCAGCACGCCCGAATTGACGTGGCTCAGCCATTCGGGGTTGTCCGGGTGCCGTTCCAGCCGCAGCCGCGAGACGAAGGTTTCATAACCGCCGACGATCACCATCACCAGCAGGTTGGATATCATCACCACGTCGATCAGGCCGAGCACGACGAGCATGATCTGCTGCTCCCCGAAATCGGCGGCGTGGGTGACGAGGTGCCACAACTCCTTGAGGAACAGGAAGACATAGACGCATTGCGCGACGATCAGCCCGACATAGAGCGGAAGCTGCAGCCAGCGCGACGAGAAGATCAGCAACGGCAGCGGCCGGAGCCGCGCGGGGGTTTCGATCTGGGGCGTGGGATCGGGAAGACTGGCCATCGGTCCTTGGAAACATGGTTGACGGGATGGACACCCTGTAGAGACCATCCGCGCCGGTTTCTGTTTCAAAGCTTTGCGGGCCGAAGTTGGCGTGCGTCCTTTTCCGTCACCCCAGCGCAAGCTGGGGTCTCAGGATGCAAGTCCGATGTCATCCGCCAGATCATGCCAGCATGGATTGACCTTCTCGATAAGCGCCACCTTCCAGGCACGCTTCCATGCCTTGAGCGCCTTCTCACGCGCGATCGCTTCCTCGATCGAGGAATGTGGCTCGGCAAGTACGAGCCTCTGCAGATTGTAGCGCCGGCAAAAGGCCGATCCCAGGCCTTCGCGATGCTGCGCCACTCGTGCCGCCAGATGCGCGGTTACACCGATATACAGCACGCCGCGCGGCTTGTTGGTCATGATATAGGTCCAGCCGCCGCGCTCCATGATCGGAATCTAGCGGCAACCTCATGAGATGCCAGCCTGCGCTGGCATGACGAAGAAAGAGAACGGAGCGTCTCCGCCCCTACCGTTCCGGGCGCAGGCGATCGTCGGCCAGGTCGCTGAACTTGGTGAACTTGCTCTCGAACTTCAGCGTCACGGTCGCGGTGGCGCCGTGGCGGTTCTTGGCGATGACAAGCTCGGCCAGGCCGTGGGCATCGGCCATCCGCTTCTGCCAGGTATCGAAGGCGGAGACCGCGCCGGCATCGTCGCCGGCATCGACCGAGGCTTCCTTCGGTTTCTCGAAATTGATGTAATATTCCTCGCGATAGACGAACCAGACGATATCGGCGTCCTGCTCGATCGATCCGGATTCGCGCAGGTCCGAAAGCTGCGGCCGCTTGTTCTCGCGGCTTTCGACCGCGCGGCTGAGCTGGGAGAGCGCGATCACCGGAACGTGCAGTTCCTTGGCGAGCTGCTTGAGGCCGCGGCTGATTTCCGAAATTTCCTGCACGCGGTTGCCGTCGCGCGAATTGCCGGAGCCCTGGAGGAGCTGGAGATAGTCGACCACGATCAGGTCGATGCCGCGCTGGCGCTTCAGCCGGCGGGCGCGGGTGCGCAGCGCCGCGATGGTGAGGCCGGGCGTGTCGTCGATATAGAGCGGCAGATTCTCCAGCTCCGCCGCCGCGCGGGCGAGGTTGCGGAAATCCTGGTGGCTGATGTCGCCCATGCGCAGCTTCTGCGAGCTGATCTCCGCCTGTTCGGACAGGATACGGGTGGCGAGCTGGTCCGCCGACATTTCGAGGCTGAAGAAGGCGACGCCGGCGCCCACCGACCTGGCATCGGGAATGCCGTCCTCCCGGTCGCGCACGCGGCGTATCGCGGCGCTGAAGGCGATGTTGGTGGCGAGCGAGGTCTTGCCCATGCCGGGGCGGCCGGCGAGGATGATGAGGTCCGAACGGTGGAGGCCGCCGATCCGGCTGTTCACGCTGTCGAGGCCGGTGGTGATGCCGGAGACATGGCCGCCCGAATTGAGCGCGCGCTCGGCATTTTCGACCGCCAGCTTGGTGGCCTGGCCGAAGCTCTTGACGCTGCCCTGCTCGCCGCCTTCCTCCGCCACGCGGTAGAGCGCGACCTCGGCCGCCTCGATCTGGCTCTTGGGATCGACCGACTCGCTGGTGTCGAGCGCGTTTTCCACCATGTTGCGGCCGACGCCGATCAGTTCGCGCAGCAGCGCGAGATCGTAGATCTGGGTCGCGAAATCGCGCGCGCCGATCAGCGCGGCGCCGCTGCCGGTCAATTGCGCGAGATAGCCGGGGCCGCCCAGCTCGCGCATCGCCTCGTCCCCTTCGAACATCGGGCGGAGCGTGACGGGGTTGGCGACCATGTTGCGATCGACCAGCTTGAGGATCGCATCATAGACGCGGCCATGAACGGGTTCGAAGAAATGATCCGAACGCAGGCGGACCTGCACGTCCTCGCACAGGCGGTTATCGATCATCAGCGCGCCGAGGAGGGCGGCTTCCGCCTCCACGTTGCGCGGCAATTCGGCGGCATCCGCGGAAGCCGCGGGCATCAATCGGACAGGTTCGGCCAT
Protein sequences of DBSCAN-SWA_1 >NC_020561|641969:650998|641969_643436_-|WP_051128789.1|DBSCAN-SWA MGAAGALALVDRAGALDYTGLDRAVGALARALADRGLAPGARVASWLAKTRLASLLPLACARAGFVHVPVNPLLKRAQLAHILADSGAALLIAGPSRVATLEPGDRPADCALLIEDEAGAATLAAQSGLPPSSADPAGLAAILYTSGSTGRPKGVMLSHANLWLGAVSVAHYLRLGPDDRTLAVLPFSFDYGQNQLLSTWAAGGCVHPLDYLTARDVIRAVERHAITTLAGVPPLWVQLAEADWPAETAARLRRLTNSGGRLPVPLIRRLRALFPNADLYSMYGLTEAFRSTYLDPALVDAHPESIGRAIPFAEVMVVRADGSPTEAGEPGELVHAGPLVAQGYWRDEERTAHRFRPAPSHSLYGGAAVWSGDTVAVDGEGLLHFVGRDDEMIKTSGNRVSPTEVEEAAIATGLAAEAVAIGVADDRLGEAILLVLRGDPAQEAALRDALKRDLPNFMQPRAIVWRADLPRNANGKVDRAGLKAELAA >NC_020561|641969:650998|643631_643907_+|WP_015457347.1|DBSCAN-SWA MVGDGIDVETAVRGVLGDVLGLGDARTAALEPGTPLFGAMPELDSMAVAGLLTELEDRLGIIIDDDEVDGELLESFGALVAFAKAKVAKAA >NC_020561|641969:650998|645303_645576_-|WP_015457349.1|DBSCAN-SWA MNKQELIASVAEATGLSKNDASKAVEGVFDTISGALKKGDEVRLVGFGTFSVSKRKASTGRNPRTGEPMKIKASSQPKFKAGKGLKDAVN >NC_020561|641969:650998|645803_648203_-|WP_015457350.1|DBSCAN-SWA MTEILPVLPLRDIVVFPQMIVPLFVGRDKSVAALESAMAADKAIFLVAQLDPSEDDPDRDALYDIGVVATVLQLLKLPDGTVRVLVEGRQRAALATLDADGDHLTASVDLIDDQPAEGDEIAALMRSVVEQFENYAKLNKKMPAETAVQLSQIEDPARLADAVSSNINIKVADKQTLLVELDPAKRLEMAFAFMEGELGVLQVEKKIRSRVKRQMEKTQREYYLNEQLKAIQRELGNAEEGEGGDELAELAQKITKLKLSKEARAKANAELKKLRTMAPMSAEATVVRNYLDVLLGLPWGKKSKLKKDIAVAQAVLDADHYGLEKVKERIVEYLAVQARTNKLKGPILCLVGPPGVGKTSLGRSIAKATGREFVRQSLGGVRDEAEIRGHRRTYIGSLPGKVVTNLKKAGTSNPLFLLDEIDKLGQDFRGDPASALLEVLDPEQNSKFQDHYLEVDVDLSDVMFVTTANSLNLPQPLLDRMEIIRLEGYTEDEKVEIAKAHLIAKQIEAHGLKAGEFILTDAGLRDLIRYYTREAGVRTLEREIAKLARKALRRILEGKAETVEITPENLSEFAGVRKFRFGVGEEEDQIGAVTGLAWTEVGGELLTIEAVTVPGKGAIATTGKLGDVMKESVQAAFSFVKARAPAYGIKPSLFGRKDVHIHLPEGAVPKDGPSAGIGIVTAIVSTLTGIPVRREVAMTGEVTLRGRVLPIGGLKEKLLAALRGGITHVMIPQENEKDLAEIPANIREGLKITPVKHVDEVLALALAQPVSPIDWTEADDLAAPPVPHLAGEAGAAIHH >NC_020561|641969:650998|649107_649407_-|WP_015457352.1|DBSCAN-SWA MERGGWTYIMTNKPRGVLYIGVTAHLAARVAQHREGLGSAFCRRYNLQRLVLAEPHSSIEEAIAREKALKAWKRAWKVALIEKVNPCWHDLADDIGLAS >NC_020561|641969:650998|643927_644875_-|WP_187294048.1|DBSCAN-SWA MLILGSGPAGLSAAIYAARAGMAPIVLQGIQPGGQLTITTDVENYPGFADVIQGPWLMEQMQRQAEHVGARMMWDTIVEVDFSKRPFRLIGDSGTVYEGDTVVIATGAQARWLGLPSELKLQGKGVSACATCDGFFYRGKKVAVIGGGNTAVEEAIYMTNHSPDVTLIHRRDSLRAEKILQDRLFAHPRIKVLWNKEVAEFVGGGDPEGLVALSLRDTVTGEVSSLDVDGGFVAIGHSPATDLFRGHIALDSDGYIAVEPGSTRTSVAGVFACGDVMDKVYRQAVTAAGTGCMAALDAERFLAHADFEAKATAAA >NC_020561|641969:650998|648417_648978_-|WP_015457351.1|DBSCAN-SWA MASLPDPTPQIETPARLRPLPLLIFSSRWLQLPLYVGLIVAQCVYVFLFLKELWHLVTHAADFGEQQIMLVVLGLIDVVMISNLLVMVIVGGYETFVSRLRLERHPDNPEWLSHVNSGVLKVKLAMAIIGISSIHLLRTFIEASAIGMPGARVTETGVMWQAIIHMAFILSALGIAWVDRMTQTRH >NC_020561|641969:650998|649486_650998_-|WP_015457353.1|DBSCAN-SWA MAEPVRLMPAASADAAELPRNVEAEAALLGALMIDNRLCEDVQVRLRSDHFFEPVHGRVYDAILKLVDRNMVANPVTLRPMFEGDEAMRELGGPGYLAQLTGSGAALIGARDFATQIYDLALLRELIGVGRNMVENALDTSESVDPKSQIEAAEVALYRVAEEGGEQGSVKSFGQATKLAVENAERALNSGGHVSGITTGLDSVNSRIGGLHRSDLIILAGRPGMGKTSLATNIAFSAAIRRVRDREDGIPDARSVGAGVAFFSLEMSADQLATRILSEQAEISSQKLRMGDISHQDFRNLARAAAELENLPLYIDDTPGLTIAALRTRARRLKRQRGIDLIVVDYLQLLQGSGNSRDGNRVQEISEISRGLKQLAKELHVPVIALSQLSRAVESRENKRPQLSDLRESGSIEQDADIVWFVYREEYYINFEKPKEASVDAGDDAGAVSAFDTWQKRMADAHGLAELVIAKNRHGATATVTLKFESKFTKFSDLADDRLRPER |
8 | Tupanvirus(16.67%) | NA | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_2 |
1047135 : 1072457
Sequences of DBSCAN-SWA_2
Nucleotide sequences of DBSCAN-SWA_2 >NC_020561|1047135:1072457|DBSCAN-SWA GGTGCTGACGGACGCGAAGGTCAAGGCGGCGAAGCCGAAGGACAAGCCCTACAAGCTGGGAGATTCAGGCCAGCTCTATCTCTACGTCAGCCCGGCCGGCGGTCGGCACTGGCGCATGAATTATGTGGCCCCGTCGACGCAAAAGCAAAAGACGCTGAGCTTCGGCTCCTACCCTACCATGACGCTCGCGGAGGCGCGGGCCGCGCGGGATGCCGCGAAGAAGATCCTCAGCACCGGGCGCGACCCGGCGATCGAGCGCCGCGTGGCCAGAAAGGAGCAGGCCCAATCCGACGCGAACACCTTCGAGTCGGTCGCCGAAAAATGGTTCGAGTTGAACAGCGGCTGGTCGCTGGAGAAGCTGCGCGAATATCGCGCGGCGAATAGCGACAAATGGTCGTGGAAGGCTGCCCGGCATTGGACGAAAAAGCCCGCGCCATGGTCGGCGGTCCATAGCGCCGACGTCCTGAAAAGCCTCGAAAGCGACGTCTTCCCGTCGATCGGCAGTCTGCCGATTCGGTCGTTGGAGGCGCGCGCGCCTCTGCTTCTCGAAGTGCTGCAGGAGGTAGAGGCTCGCGGCGCGATCGAGACCGCGCATCGCCTTCGGCAGCGGATTTCCGGGGTATTCGTCTATGGTATCGCTGCCGGCCTGTGCGGGGCTGACCCGGCGGCGAGCCTGGGCAAGGCGCTCGCGAAGAAGCCGCGCTCGAAGCCGCAGCCCTCGGTCATCGACGGCATCCAGAAGCAGGAGGATCGCCTGCGGGCCATCAAGGACATGCTGGCGAAATGCGAAGCAGAGCGCTGCCGGGCGAGCACCAAGCTCGCTTTGCGCTTTCTGGCGTTCACCGCCGTCCGGTCGAACGAACTGCGCTTTGCCTCGTGGGCGGAGTTCGAGGGGATCGACTGGGGCAACCCGCATGCCCCCGCGCCCGAGGCGCTCTGGCGCATCCCGGCGGCGCGGATGAAGGGCGACGACGAGCGGAAGGCCGAAGAGTTCGGCGACCATCTGGTGCCGCTGGCGCCCCAGGCGGTGGCCGTCCTGCGCGTGATGTGGCTGCTATCGGCTGGCCTGCCCTATGTCTTCCCCGGTGAGCGCCACCTGCACAAGCCGATCAGCGAAAACACGCTGCGCGCGCTGCTGATCCGCGCCGGCTATTATCAGCGCCATGTGCCGCATGGATTTCGGGCAGCCTTCTCGACCTATATGAATGACCGGCCGCTCTCCGAGCGGAAAGATGGCGACCGGGAGGTCATCGACCTGATGCTCGCGCATGTCCCTGAAGGAAAATCCGGCTCGGAGACGGCCTACAATCGGGCAGGATATATGGACCGGCGCCGCGAGCTTGCTTGCGAGTATGCCGACCTGATCGGCGCCGATCTCTGCGATCCGGCAGAGCATCTGGGCAAGCCGATCAGATATGCGGCAACGGGGCCGGGGCGGCTGGCCGCTTAGGCTGCCTGCTTGACGGCCTCCAGCCAGGCGCGGACCTCGGCTTCGCTCCACCGCGAGCCGAACCCGCCCGGCTTATAGGGCTTCGGGAACTTGCCCTCTCGGATCAGCTTGTAGATCCGCGTCCGCTTCAGCCCCGCGATCTGCTCGACGCGGGCGAAGGGAATCAGGCGGTCATTAGAATTGTCGGCCGGCGGCAGCGCCGCGAGCACGGCAAGATTTTGCTCCCGCGCAGTCATTGAGCGTTGGTGATTCACGCGACACACCTCCCGATCACCCCTCGATGATCCCAATCCTTGCGATAGACGGCCTGCCCGCGGCGAAGCTGCCAGCAGGACGTGTTGACGTTCCCAGCCCAGACGATCCCCACGGCCCGGCCATATCGATCTGTCGTCACCTGCTCGAACCGGATGGGTCCGAGGGTCAGGGCGGCCGCTAGGCTGGCCTTGCTGGTGATCGGATCGCCGGGGGCACAGATCCGCCCGCGCCGGCAGTGTCCGGGCATCTCCGGGGCGTCGATCCCGAGCAGGCGGATGCGCTGATCGCCGCAGCGAACGGTATCGCCATCGGTGGCCACGCACCCGGTCAGCGCCACGGCGGCGGCTAGAGCGAGCATCATGGGTGAGGGTCCTTGTGCGAATAGTCTCGAATAGCGCCCCACGTCGCGATGACATACTGGTGCCGGCCGGGATGTCCCGCAGGACGGATGCACCGATGATCTGCATCCGGCGATGGCGGGATGTGGAAGCAAAGGTTGTCGGCCGGCACGCTGTAATCGTCGCTCGGCCAATAGAATCGACCCTTCCCGCTCATCTTCCCTCGTCCTTCATGAGGTGGACGTCCTTCCGGGAGCGGATCAGGCCGCTGGCGACGCTGTCCCCGCACCACACGGAGATGTCGCCATCCATGAAGCGGCGCGTGATCGGGTCGGCATCACCGCCGACAAGGCGCAGGGCCACGCTGGGGCAGCCGCCCAAGCCGGCCGGGCTGTAATCCCAACCCTCGGTGAGGCCGATGATCTCCCATTCGGTGCCGTCCCACCATATGAAGCGATCGCCGGGCTTGGCCTCGATCGTCCCGGCCGATCCTGTCAGGCTGAACGCCAGCAGCGCATTGACCTCCGCCTCGCTCAGCCCCGCCGCGATCTTCGCGATCTCATCCATTTCCTTGCCCCTTGAGATGCTCGCCGCGCTCGATGGCATCGGCCAATTCAACGCAAAGCAATTGTCCGTCGCGCTTGAACGGGTCGCCCCAGAATGCGAGGATTTCTGGCGTGCGTGATTGACTGGCGAGCCTCCGCAGCCACGCCACCACCGCATCCCGCTCGTTGGGCTGGATCATCACCGCCCTCCTGCCGCAGCGATGGCTGCACGGGCGTCTCCCCGATACAGTTCCGCCACATCATCCCAGGGCTCCATGCGGAAATTTACCGGATGACCTTTTGCGGCGGACAGCACTTTGCCCGTATGGTCCGACCGTTCCTTTTCGCAGGCGTAGAGCGCCCTCGCCACCCGCTCGACCACATCCTCGGGCTGTGCTGGGGCGAGCTTTTCGTCGTAGGCGTTCAGCAGATCCGCGACGATCTCCGCCGGATCGTCCGTCGGCGCCCAATCCTTCAACAGCGGGTGCGAGCGATTGGATTCGATCGTTTCCATCAGCGATCGGCCGACAGCTTCCATGTCGTCGGCATCGAACTGGTGCACGGATTGCACAGGTTGCGCCGGGGTGGGGGATGTGAGGGTAACTTCGGCGGGAGCAATCGCCTCCCGAAGTTCGCTCCATGGCGACTTCGTGCCGTAGCGTCCATTTTCGCGGATGAAGGGAGACAAAGCAGGATCGATAGAGCACTCTACGTAGCGCTTTGCTGCTCGCACCATCTCCCGCAGCGCGGTGTCATCGGTCATCAGGCCCGACCTCCTTCCTGATATCGCCCTTCGGCCCGCGCGATCCACTCCTTCATCATGCTGATGGTGTCAGCGCGATCAGCGTTGCTGATGTAGTTGACCCGACCACCGTCGATCTGGCCGAACTCCGCGACCAGCAGAACGAATGCGATCTTGCGCGGACGGGCGTCTCCGTTGAACTGCTGGTCGAGCGCCCCGGCTATCGCGTTCATGCCGGCCCGATAGCGCGCCTGGATGGGGTCAGGCCGTGACATGCTTCACCGCCCACATGACCGCTTCCTCGATCTTGGTCATCTGACTGTCCTTTCAGGGAGAGGATGGCGGCGGCGATGTCGCGGCACGCCGAAATGTCGTCGCAGACTTCACCGGGCCGAGCATCGCCGGGGTGTGGCATCCGCCATTGTTCGTGCCGATCGCGGGCTACTTCGGCGGCCGCTTCCAGCGCCGCGATGCGGTGGCGGGCGAAGGCTTGGACGATGGGAGCATCATCACATGAGCCGCGTCTAATGTTCGCCGCTTGGTGATTTCGACCGTTCAGCTTGGCCCAAGCCGCCGCCGCCTCGCGATCAGCCTGCGTCACCGGCGCTCGGTCGGATTTCGTCATATCGCGCCCTCCGTTGCGAACATCCCGATCTGGCAAACCCCGGTCTGGCCGGCGAGCCAGTCGTGATAGGCTTCCGTGACGGCGCCCGATCCGGGAAACAGATCCACGAACTCGTCGTCAGGCCGCATATTGAGCCAGTCGAAAATCCATCTGCAAACGGCGGCGGGCTTCGCGCCGGTAAAGCCACGCCGAAGCGTGATGCTCTCCGCGATCGCAGGAGCCTCGATCCAGTCACGCTGCGTCGGCTGAGAGACCGGGATCGGCCTGCCTCCCCACAGGATGATCGGCTCCCATGCCCAGGCGCGGGTGACGTTCTTCTTGAACGAAGCGAATGGCTTGACCCACGCCCCGATGCGTGCATCCGGCGGGCACATCGGCAGGATCGTGCGCAGTGATGGCAGACTGAGGGACATGGCCCAGCCGTCGAACTCTGCCGTCATCCGTTCGATCAGCGCTTCGTGCGCCGTAGGGTCATCATAGTCTGCGGCGTCGGGGTGAAGGTGGCCATAGAACTTTTCGGCCACGCCCAGATAGGGCGGATCGGCATAGGCAAACCGCAGCGCTCGGTCGGATGGCGCGCGCGTCATGCTGCGATCCTTTCGTAAACGGCCGCCAGCGCAGCTTCCGTGCGCAGGATCGCGCGGCCTATCGCTTCGGGGATCTGTGGGACGACGGCATCGCCGAACGCTTCGACGATGAGGGAGGCGGCAGAAGTCCCTTTTGGACCGCCGACCGCAACGCGAGTGCCAGCCACCCAGGCGGATAGCCCATCATCCACCCGTATGTGACGGGCAAGGTCATCGAGGGACCAGTCAGCCCGTGATCCGACAGCACCGCGGCGATCTGGCGAGCATATTGCCATTTCTGGTCGCGCAGGAAGGTCGGGACGTTCGGAGATTTCACCTTGCCGTTGCTCGTCACCCCCAAGCTGCCCTTTCGGTCCGATGCCAATGGTATCGGCATCGTTGCCCGCTGCAACTGGTGGCGCACCGTGCCCATCCTGCCCGTGTCTCCGTGGCCACCCGCCTTCATGTCCGAGGCTCGCGGCGTGGAGAGCATCCCCGCATGCCGGTTGTTCTGCCCCTGTAGCTGCGCCAGCACATCGCCCCGGAAGCCCCGATCCGCATCCGTCTTGCGCGGGGTGGCGATCAACGCCCAGCCCTGACACGACGGCCATTTCTCCATGCTGGGATTGTGCATGTTCGCCATGGCCGTTGGCGTATGCAGTAAGCTGCCCGACATCGCACCCGATGAGCCAGGACCGGGGGCGTTCATGGTTGGCCCCGATGTCTCCAGCACGAACCACGAACGGCCAGCAGGCGTAGCCGATTGCCTCCAGAGCAGAGAGCACGGCGTCCGCGCCCCGAGTTCGGAGATTAGCGCTGTTCTCAAGAGCGAACCAACGAGGGCGGCACTCTCCGATGAGGCGGACGGCTTCGAAGTAGAGGCCCGACCTTTCGCCTTCGACACCTTTTCCCTTGGTGTTGGCGCTACTGATGTCCTGGCAGGGCGGGCTTCCGACGATGACGGCTGGAAGGTATCCGAAATCGTCGAGAAGTCGGCCTGCGCTGAGGGATCGAACGTCATCATAGACCTTCACGCCGGGGTTGTTTTCGGAATAGAGCGCTCGCCGCCACGGGATCACCTCGCAGGCCGCCATCGTGCGATAGCCGGCCCGGTGCATGCCGAGAGACCAGCCCCCGGCGGCGGCGCTGAACAGGTCCAGCACGCGCATCACCGCTCCCCCTCCCTCGATGCCGGCACCTGATCGAAGTCATCCCAATGTTTGTTCAGGATGTCCGCGACGCGCTGGGCTACCTCGTCGGGCTTCTCCAAGTAGAGCGCCATGATCAGCATCGGGAACCGAAGGCCGATCGATGTGCCTGTCTCTGTTTTGGTCGGCGGCTTCTGATAGTGGACGGCGATGCACGGTCCGCGCAGATCCTCGTCATAGAGGTTGCGCAGGTAGGAAGTCGAATCGACCGTGAAGCACTGGTGCGCCGAATCGTCGGACACGGGGCGGCAGGAGTGGCCGCCGCCCCGATCCGGCGCCGGCTGCGGCGGACGGCTGTGTGACACGCTACTCATTCGCCGCACCCCCGCCCGACCTGGCCGGCATCGCGGGCGTCCACGAAGTCCTGCCACTGCCGCCAGCCCTGCGGGCAGTGGAATCCCCACTCGCGCAGTTTCGGCCCGGTCATGAACAGCGAGACCGCCCGCCCGCCCGGTGGGACGACCAGCCGGTGGGCATCGGTCGCGCGGCGGGTGACGATGCTGCCGGCCCCACGGTAGAAGCTGCCGGCCGGCGTAATCTCGATATAGACGCCATCGATCACATAGGACGTATTGTCCCATGGGTGATCGTGCAGCGCGCGATTGTCGTCGCTGTGCAGGATCTCGTGCAGGTAGACGTTGCAGCCCTCATTGCGCGGGACGATCCACCAGCGGCGCAGATAATCGTCGCCGATCACGAAATCCGGCCGGCGCTGCATCTTGGCGCGAGCCCAGGCCTGGAGATCGTCCAGATGGATGCCTCCTAGCGAGCCATGTTCGATTACCATTGGCGTTCGTCCCAGCAGGTGTCGCACTCAGCCCAGCGGTTGCCCGAGGACCGGCGAAGCTCGTGGCAGGTGGAGCAGCCCGGAAGCGTGATCAGCGACCGGAGCCAGCGGCGGAGGGTGGAGAGGAGCTTCATGCGGCCACCTGCTTGCGCTGGTCCTGGAGCCATTCGCCGATGCGGGGCCATTCGGCCTCGATCTCAGCCCAGCGCGGCCAACGGCCATTGTGGACGCGGAACTGACCGGGCCACTGGCTGGCCTTGGTGAACTGGTCGCGCTCGGCGCGGGCGACCTCGTTCCAGCGCCGGAAGAAATCCATGGTCTCGGCCGGGACGTGGGCGTGCTGGCTCGGCGGGGACTTGATGAGGTCCGCAACGAAGTGCGTTCCCAGGTGCATGACGGTGCGCGGTGCCGGCAGCTGGAGGCAGACCGGCTTGTAGGTCACGGCCCCGCGGAGGTCGACATAGGCCAGTTCGACCGGCTTGGCGAAGCCGTAGCGCGCCAGAACCTCCATGTCGGCCGGCGGGCGCCGCTGGTCGATGTCGGCGGCCAGCAGCGCGGCGGTCGCATCCCATTCGGCCTCGGGCAGTTGGGCAGCCGCCCAGGCCCCGAAGGCCTTCCGGACGATCTTGCTCAGCAGCTCGCCATCCTGTGTCTTGCGATAGATCGCGGCCGGTGGCGGGATCGGGACGCCTGCCTGCCCGAAGGGATAGGTGATGGCCGGGCGCGGATAGGAGCCCAGCTTGGACGGGGCGATCCTCTGCCCATACATCAGCGCGTGCTTGCGCTCTTCCAGAGTGGTCACGGCATGGCCTCCAGTTCCCGGCGGTGCGGAGACCGGCGGGTGAATTCTTCGGCCATCGCGCGCATGTCGAGGCCGGTGATCTGCTCGAAAGATTCTTCGCCCGCCTCGTGCTGGAGCCGGTGGTGGGCCTGGCACAGGCTCAGGGTCCAGCGGTCGGAGGGCTTGAGCGCCATCCCGCCACCGGTGCCCCGGCGGACGTGGGCCACCTCGATCGGCCCCTGCTGGCATCCCTTGACGGAGCACGCATGGGACCGAACGAACCTGACGTGAGCGGGGCATCGCCGCCCGTCGTTCTCCCGACCGGATTTCGATCGACGTTTCGGCGGAAGCATCAGCCGAGCACCGCCCAGGCGACGATCGCGATCGTGATCGCGTTCATGACCGCGAAGGCGGCCCATTCGCCCGGCAGAGCGGGCCGGGGCGCGAAGAGCCAGTGCCAGGCGCCGCGGGTGATCGGCATCTTGGCCGCCCGCCGTTCGGCTTCCCGGCGGACCGCGCTGCCGCTGTGGATCGCATGCGGATGGATGCCGGCCATCACCGGGCAACCTGCGCTTCGCGACGCTCGGCAGCCTCTCGGGCAGCGCGTGCGGCAAGATAATCGGTCTGCGCCCGGGTGAGGATCCGGCGGGCCTCGCTCTCGCGGATGTAGGCGGAGGCCAGTTCGGACGATACCACCGAGAGGGAGGCCGCCTCGACCGGCTCCGTCGACCGTCGCGCGGTCATCTCGCTGCCTCCAACACCGAGGCGAGATACTTCCGGGCACCCTCCTCGCTGGCGTAGAACTTGCCCGCCGCCGCCGGCCCGAGGAGGAGCAGGCCCGCCGTCTCGGTATCGGTATCCTTTTCCAGCCGATAGCCGTCGGAACCTTCCAGGTGGACAGCCCAACCGGCAATGCAATGCGCCGTGCCGCAGTGCCACTGATCCATGACCAGCGCGTCCGGTTCGGCGAGTGCCGCTTTTGCGACTTCGCGCAGCCGGGCATCGGCATCCGATTTGGCGGCGACAGGGCGTCCGCGCACGGTGAGATTGCCCCACGCGGTGATGCTGGTGCCGCTGAGGTCGAGCGAGCCGCCGACGCTGAGGCCCTCGGGCAGCGCGGTGATGCTGGTGCCGCGGAGGTCGAGCGAGCCGCCGACGCTGAGGCCCTCGGGCAGCGCGGTGATGCTGGTGCCGCGGAGGTCGAGCGAGCCGCCGACGCTGAGGCCCTCGGGCAGCGCGGTGATGCTGGTGCCGCTGAGGTCGAGCGATCCGCCGATGCTGAGGCCCTCGGGCAGCGCGGTGATGCTGGTGCCGCTGAGGTCGAGCCAGCCGCCGACGCTGAGGCCCTCGGGCAGCGCGGTGATGCTGGTGCCGCTGAGGTCGAGCCAGCCGCCGACGCTGAGGCCCTCGGGCAGCGCGGTGATGCTGGTGTCGCTGAGGTAGAGCCAGCCGCCGACGCTGAGGCCCTCGGGCAGCGCGGTGATGCTGGTGCCGCTGAGGTCGAGCCAGCCGCCGACGCTGAGGCCCTCGGGCAGCGCGGTGATGCTGGTGCCGCGGAGGTCGAGCGATCCGCTTACACTCTTCAGCCCGGAAATATCATCGCTGCTTCTGGGGCTGAGATCGCCCTTATAGTGACCTTCGTATCGCATGAAACGTCCTCCAACGAGCGTCTCGCGTCGGCACCGCGCCGAGGTCTGCTCGTGGATTTGACATACCGCAACGGTATGTTTCCGGTCAAGATCGAAATACCGTTGTGGCATATTTCGATATGCCGCCGCGGCGCTACAAAACGGAACCGGAATGATCCAGCTTCAAATTGAAGGGGGAACACCAATGGAACTTCAAATTGGGAAATATACGGCGCGGATTTCCGTCTTACCTCCTATCCATCCGAAGTTTCGCATCACCCCGGAGCCTGCGGCGCCAAATTCCACCGATGGCGAGGAAGAGCAGATCGAGGCGGAGCCGATATCGGGCTTCACCTGCATCATTGGATATACAAGCGGTGGTGGCGAGGTGAGCGAAAGGCTCATTACATGCCGCTCTCTGAGAGCGCATGGGACCGGGCTCACCATTGGCGCGATCTGCCACGCAGCCAAGGGGTTTCGCGCTTTCCGGGTGGATAGAATAAGCGCGGTCTCGGACCCGCACACGGGCGAGGTTATAGGGGACGGCAGTTATTTTAGCCGGTTCGCTGTCAAGTCGTCAGACAAGGCGAGAGCGCCTAGCGGCGGTTGGGGCCTCACTCCCAGCCGGATGCGGACGCTTGTGGCTGGACTGAATGTCCTCTCCTTCATCGCTCGATGCGATGGCTATTGGCACGCTCTCGAAGATGAGCCTTTGGCAAATTTCATTGAGAGGCTCTGGATGCGCAAGGACTGGGAGGGGCAACCTCCGGTCGCGGCCATACTGGATCATGCTAAGCGACTTGCTCCAGATTCCTCGATATTTTTCGCATCTCTTGAGACGATTTTCGGCAGTCGCAGCAGCACCCGATTGCTAATTTCTTCAGTGCGCGAGTTGATTGAAGCCGACGGCATTATCCAGGATCAAGAGACCAACTGGGTCCTCGCCATGCAGCAGCACCAAGAAGAATATTACGCCCGAAAAGGGCTACTCGGGTCGATATGAACCCACTACGAGCGCAATGATCCGCACCTCTTCGATGTCATCCTCGTCGTGGCCGATCTCGATCGGCCGCTGGAAGGCGGGATTGTTGGACCGCGGAATGAGCCACTCCCTCCCGTCACCGTCGATCCGGTATTCCTTCACGGTCGTCTCGAAGTCCTGACTGACGCGCCGTCGCTGCACGACCACGCGCTTTCCGTTCGGGATCGCCGCCTCACCCCAATATGCGACGCATTCGACGATCGTGCCGTGGGGATATACAATATCCATGCTGTCCCCCTCGACGCGCAGCCCAAACCTGTCTCGGACGGGCGCGGCGACGTCAGCTCTGCCCGTGAAGACCTCCCATTCATCGGGCTCATATTCCCAGACCTCTTTCCACACGCCGGCAGCCACCACCCCTTTGACATACAGACGCGGCCCCAACGCCACCAACGGGCCCTCAGCGAACAATTCCCCCGGCTCAACTCCAAGCGCCGCGGCGAGCCGCAGTATATCAGCCATGTCCGGGGTCCGAGCACCGCTTTCCCACCGCTGAATGGTGGGCTGCTCCACGCCGACTGCCTCCGCCAAGGCGCTCTGTGTCATTTTACGCAGGTTGCGCAGCTTCTTGAGCTGGAGTGAATACCGCATTGGCATATATTGCCCGCTCGGCGCCGGCAACGTAACGTCCAGAACGGAATAAACGCTTGCATTGCAGTATACCATGTCGGTATATCCATCAGTATGAAGCTGGTCGACTATCTCGAAGCGCATAGCCTTACCCACGAGCAGTTCGCGGGCATCCTTGGCTGCAGCCAGCCGACGGTGACCAGATTCGTTCATGGCATCCGGCGGCCGTCTAGGTTCCTCATGCGCAGGATCGCTGAGGCCACGGGCGGGGTGGTCACCCCAAACGATTTCCTCGATGAATCTCCGCCCCCTTTCCAGGACCCACCCCGTCGCCACCGTCGAACGGCACCCGGGATGGCCACAGCATGACCGATCTCGAAAGGATTACGCCGATCGAGGCGGTCTCCGAGGATCGGGTGCGCGAGATCCTCCACGAGGAGGTGACGCCGACGCTCGATAGAATTCTGGCGGAGCTTCGGGCTCTGCGAGGGGAGGATTGAGGGATGGTGGACGAACTGAACTGCTCGTTCTGCGGAAAATCCAACGAAGAGGTGGCCCATCTGATAGCCGGCACGAATGCCCTCAGCGTGGATGCCTGCGCCAAAATCGTGGCCGATGGCACATCTCGCGAGGCGGCCGCCGCGGCGACCGCCTCCAACCAGCGGCTGGAGTGTCTTCGGCTCGCAATGAGCCGGGACTTTATCGATCCCGTAGCGGTCGCGCGGGAATTCTACAAATTCGTGCAGAGCGGGACGCCCGCCCCCAAAGGAGCGGGCGATGCTTGAATCATTTGGGCAGGCGCCTGCCCGAAGCCGCGTGATAGCAGCGGGCATACAAATCGAGATAGGCATCGACCCTGTCTTCCTTGGATTCGGCTTCCGGAAGCAGGTTTCGCAGGAACTTCGCCATCTCCCAGGCAACCTGGGCCGGACTGCCGCCGTCGGGGTGGTCGCACGTGACTTGAACCTTATCGGACATGTTGGCTCCTTCTCGGTCGTGTGTGGCAACCCGACTGTAGCCGAAGCCGGGGTCGTAACAAGCGGCCCCGGCGGAGGGCTAGCCTGATGTTCGTATCCCCGATCCCGAGCCTGATCTTCCCGACGCTTCGCGGGTCGATGGCCTTCCATCCGGTCGCCTCGCGGCCGGCGGATGAGCGCGAGGCCCGCGCGCTGCGGGTCTGCGATCCGGTTAAATGCCCCTCTTTCCATGGTGATCGTAATGGCTGACGCTCACAGCAATGTCGTTCCGCCCGCGCAACCTCTGACGGAAGCAGAGTTCCGCAACGCCTGGCTGCAATCGCTGGCGCGGCTGTGCGCTGCCCACGGCGACGGGCGCGTCGCCCTGGCGCTTGGCGTGTCCGAGCGGCACCTGCGCAACCTGAAGAGCGGCGCCAGCCTGCCGGCTGCGGATCGGATCTGGAACCTGTTGGCGCTGGACCCGTCCGCCCACGACGAGATCGACGCCAGGTATCAGGTGAAGAATTCGCCGATCGATGCGCTTTGCTCCACCGATCCGCTGACGCGCGACATCATCGCGCTCGCGAACGAGGTGGCACAATCGGAGGACCCGTCCAGCCCGGGCGGTGTCGTCGTCACCGATCATGAGCTGCTCCAGAAGGACGAGCACCGGATGCGTCGCATCCACAACACGCTCGGCAGCTGGCTGAAGCGGATCGAAGCGCTGCGGCGCCCGGCCATGAAGGCGGTCGCCTGACCACAGAATTGAGGACCTGATCCGGGGCGGCCCGGGGGGAGATTGAGAATGACGGTGATGGAAGGCGCGGCGCCGACGATGGTCGGTTCCGCGGGCGATGATTTGCGTCCGGCCGCATGGCCGTTCGGGGGCCTCGGGATGTTCCGATACGGCGCGATCATCGCCGATCCGCCGTGGTATTTCCGGAATTACTCGTTTGCCGGCGAGACGAAGAACCCCGTCGCCCGCTATGCGTGCATGTCGACGGACGAAATTGCCGCCCTGCCGGTGAGCCAGCTGGCGGCGCCGGACTGCGCACTGTTCATGTGGGCAACCGCGCCGATGCTGCCGGATGCTATCCGGCTGATGAAAGAGTGGGGCTTCACCTTCAAGAGCGCGGGCGCCTGGGCGAAGCAGTCCGCCACCGGCGAAAAATGGGCCTTCGGCACGGGATATTGCTTCCGCTCGGCGTCCGAATTCTACCTCCTCGGGACGATCGGCAAGCCGAAGGTGCTATCGCGGTCCATCCGCAATCTGATCGTCGCGCCGGTGCGCGAGCACAGCCGGAAGCCTGACGATCTGCATCGTGATGTCGAGCAACTCTATGCCGGGCCTTATGCCGAGTTGTTCGGCCGTGAGCAGCATCCCGGCTGGGATGTCTGGGGAAACCAGACGGACAAGTTCGGGGGAGCGGCCTGATGCGCCCGGTCCCCGACGACTTCGCCGCGCTGGCGGCGACGATGAGCCGCGCACAGATGCAGGCGCATTATCGCGTCCGATCGAGCACGCTATCGGCGTGGTATGTGGCGGCCGGCCTCCGGCCTCCCGTGCCCAGCCAGCAACGGCCTGCGCCTGCTGACTTCGCCGAGCATGGGCGCCGCCCCAGCGCCGAATTGCGCGAGCGATATGGCTGCAGCAACGAACTGCTGGCCCGCTGGCGCAAGCAGCACGGCATCAGCATGGCTGACGGCCCCACCGCCCGGCCGGTGCCCGAGGATTTCGCCATCCGAGCGCGGAGCAGCACCAACCGCGAGCTGGCGGAACATTATGGCGTCGGCCGGGCGCTGATCTCACGCTGGCGAGCCAAGTGCGGGCTGTCGGGCGGCATATCGACCTATCGCTGGAAGGTGCCGACACCGACCCAGGTTGGCGCGCGCGACTCTTCGCTTGCCGGCCGTGCCGCCGATCACCTGCGCCTCCCGCGCGCTGGCAGCTGGGTGGTGTTCCGCTGCGACGCGGCCGGCACGGCCGATCCTTCGGGCGGCCACTTCCGCGTCGGCACCCGCCTGATGACCGAAGACGAGATGATCCAGTTCGCGGAACGGAAGGGCTTCGCGGCCTTCCCGGCCGACGCCCTGGGTTCCAGCCGCCAGCACGGGGCGGCCCTGTCGTGACCGTGCACCTCAACCGAAAGGACAATCAGATGGCCGAAACCGGCGGCGCCGACCAGTTGCGCCTGTTTATCGAACGCATTGAGCGGCTGGAGGAGGAGAAGAGGGGCCTCTCCGACGACATCAAGGATGTCTACCTGGAGGCAAAATCCCAAGGCTGGGATGCCAAGACGATGCGCGCCATCGTCCGGCTGCGGAAGATGGAGAAGCATGCGCGCGACGAAGCCGAGGCGCTGCTCGAAACCTATAAAGCCAGCCTGGGGCTCTGACGGTGGGACCGCTCGTCCTCGGCATCGCCTGCCTGCTTATCGGCTGGATCATGGGCGCCCGTTGGCGAGCGGGCCGCCTTCCGGCTGCGCCGTCGATATCACCGTCCGCCGCCGGGCGGGCACTTGCCGCCCGCAGCCGGGAGGAACTGGCGCGGCGGCGGGAAGACAAGACGCGGCAGCTGCAGCTTGAGATCGAGCGGAGCCGCCGGTGAGCGCCGCCCGCGAGTGGGTGCCGCACCATGGCGGCCCGAACCCTTTCCATGATCCCGATCAGCCCGATCGGGAGGAGCCGGACATCATGATCCGATACCGCAACGGGCGCGTGTTCGGCCCGATCCAGCCGGCGTCACGACGCTGGGCCGCCTGGACCGTGCGGCCGGGCCGGAGCGACTGGGATATCGTCGCCTGGAGGCTGGCGTGAGCGCGGCCCTCCTCTGCCGTCTGATCGAGGCGGGGACACCGGCAGCGCTTGTTGCTGAGGTCGCCGCCGCCCTTGCTCAGGCTGATGCTGCTTCCCTCGCACTGCGCATGCGTCGGCAGGCCGATGCCGCACGTCAGCAGAAGAGGCGCGCAGAAGGTAAGCCCAAGGCCGATGCCGATGCCGATGCCGATGCAGAATCACGTGACGTCACAGGACATCACGTTATGTCACGTGACATACCCGAAGATAGCGTGACGTCACGTGACATTGCCGAAACTCCGGAAAATGTCCCCCCACACCCCCCTAAAAATATCTCTCTTAGAGAAGAAAACCCCGTCCCCCTAAAGGGGGACCTCCCCCTTTTCGGATCACCCGAAATCGACCGGGTCGTCAGTGATTGGAACGCCATGGCGGGCCGCACCGGGCTGAAATCGATCCGGACGGTCACCGCCGAGCGCCGCGGTCGCATCCTCGCCAGGCTGAGGGAGCACGGCCCGGATAGCTTCACCGAGGCGATCGCCGCGATCGAGCGATCTCGCTTCTGCCGTGGGCAGAACGACCGGCGGTGGCGGGCCGATTTCGACTTCCTGCTTCAACCCCGCAGCTTCGTGAAATTGATCGAGGGCAGCTATGAACCCGCTAACGACCGCTACGCCGAAACGGACATCGCCAACCCAATGGTCCGAGTCGCGGCTACCCGCCGTGCTCGCCGATCCGCTGAGGCGGGCGGCGGGTTCTGAGTTTGGCTATGTGCCGCTCGTCTCCGCCGATGTGGCGGCGACGATCCCCGCCGCTTTGGCGGCTTTGAAGGCCGGCATGGCGCCCTCAACCGAGGAGCAGATCGAGGGGCTGATGGGTAGCATTGCGCTGCTTTACCCGGCCGCCAAGGTCACCGAGAAGGAGGCCGAAGCCCGGCTGGACCTCTACATCGACCTGCTCCAGGATATCCCGTTCGACATCCTGTCGGCCGCGTTCAAGAGCGCTGCGCAGACGAGCCGGTTCTTTCCCACCGTGGCGGAGATACGGGAAGCTGCCCTGCCAGCCCGGCGGGAGCGGCTCAACAAGATCAATGGGCTCAAGGCGTTGGCGCTGAAGCACCGGCTAGAAGGCGAGCAAAGCCGGCAGGCCCAGGCGCCGATGTCGGCGGCAGAGATCGAGGAAGCCAACGCGATCTTCCAGCGGCTGGGCATCCGGACCCGTTATGCGGCCGACGGAACATCATATGAAATCGAACGCGGCAACACCGGCGATCAGGAAGCGAAGGCGGCATAATCACAATGGGAAAAGCTACTCGGGTCGACAATGAGCGCAGCGCGGCGATGGCGGCCGCCCGCGCTGCTGCCATCCTCAAGGGCGACCGGACGGTCAGTGTCGCGCCTCCGCCGCCGGCCAAGCGCGCCCGCCGCGCGCCTGCCCAGAAGCCGATCGATGCGACACCGGAGCGCATCTCCATGGCCGAGCCGGATGGCAAGAATGGCGTCCGCTTCCACGACACGGTCGAGGCGATGATCGACAAGGCCGGTCAGCGCGCTCTCATCACGCGCCGCTTTGCCGATGCTCAGATTGACCGGTGGCTAAAGCAAAAGCGCCTGACCTATGCGCAGTGGTATGCGGCTGACTGGTATCGGAATCAGTATGCGTTGGCGGGCATCGAGGGTCGGGTGGTGGCGCAATACGATATCACGCATCCAGGGGCCGAAGGGAGCAGCTACGGCCTGCCAGCCAATGAACGACAGTTGCGCGCCCGCCAGCGATGGCGGCAGGGAAGGGCCTTGCTGCCGGAGAACATGGTGGACCTTGTTGACCGGCTGGTGATCCACGACGTCGCCCCGGCCCTTTCCAGCGGCCGGATGCGCGATCGATATGCAGCCCGCATAGGCCGCGCGCTGGACCCGCTGGCGGACTGGCTTTCGGCGCCAGCGTCCGCTTGACAGGTGTCCCGTTTCGGATAGTGTGTTCGATAGTTGTTCTAATTGCGCCCGCAGCCATTCCCCCGGCTCGCGGGCGCAATCCGTTTCAGGGCTTTCGCAGGTCGGGCGCGAGCCGCGTGCCTGATGCGGGCCGGAAGCCCCTCGCCCGGCCCGCTCCCCCTTTCGTTTGACCGGAGGGCCTACTCCACAGGCGACCGAGGCGGCGCCTGCGATGCTTCCGCTGCTGCACACGCGATGCAGTCAACCCGGCGCTCTCCACGGCTGGCGAGCCGACCACTGGCCGGGAAGCGGAGGAACGCCTCAACCTTCACGCCGCGATCGACAGGCGGACACCGAGCGCACGGGTGAGCTTGACCAGCGTGTCGAGTGTCGGGTTGCCGTTTTCACCGAGCGCCTTGTAGAGTGCCTCACGCGAAAGGCCGGTTTCCTTCGCCAGCGCGGTCATGCCGCGTGCACGAGCCACATCGTTCAGCGCGGCGCGGATGATCGCCGGCTCGCCATCCTCGAATGCCGCTTCGAGATAGAGGCGGATGTCCTCTTCGTCGTTGAGAAATTCGGCGGCGTCCCACCGTTCCGTCTTGATAGCCATCGTCATTCCTCCAGATCTTCCACCATCGCCAGCGCGCGGGCGATGTCGCGGTCCTGCGATTTTTTGTCCCCGCCGCAAAGCAGGACGATCACCGTGTTGCCCCGCTCCACGAAATAGACGCGGTAGCCGGTGGCGTAGTCGATCTTCATCTCGCGCAGGCCCTTGGTCAGGTTGCGATGCTGACCGGGATTGCCCAGCGCCAACCGGGTGATGCGGACCTCGATACGGGCCTTGGCGGCGCGATCCTTGAGGCCCTTGAGCCAGTCGGCGAATTCGGCGGTGCGGCGGACTTCGATCATGTGTTGACTATAGTTCACGGGATGCGGGCTGTCAACTATAGATTACGAATTAAGGGGTGCTGCCATGCTCGTCCTGCTCCCCATGTCCATGTGGAAGGCGCACGTCCACGGCGCCGGCATCTGCATCAGCCTGTGGCAGGACGAGTGCGGCAACCGCTTCTGGAAGCGGATCGGGCTGCGCTGATGGCCATGGTCAGCGTCACCAACGACCTCACGGTCAACACCGCCTATGTAGCGTCGGTGTCGTGGGATCGTGGAGATACCTACACCACGCTGGTCATCACCATGGCTGATGGGACTGCCCATCGGGTCAAGCACCAGGGCGGGCCATATGGGGTGGATGCCTATGACGCGGAGCGAAGATTGCTCGCGGCGGCGGAGAGATGATGGGTCGGCTCAACACGCTGCCCAGCCGGCCCGCCTGCCAGCGGTGCCGCAAGCCGATAGTTCCGCCCGGCACGCCTCCGCCTCCGTCATCTGCGGACGATTAATGCCCAGCCGCCCGCCCGGCCTCTACGCCAAGCCGAAAAGCAAACCGTGGGAAAGGCAGAGCGCAGCATCTGCCCAGCGCATCCGGGGGCGGCGCGGCCAGCAGATCAGGGCGGCGCACCTTGCCGGTGAACCTCTATGCCGCGAATGCCTGAAGCATGGTCGAGTGACCGAAGCCGTCATCGTTGATCACACCCTTTCGCTGGCCGAAGGAGGCGAGGACGTGCCTGACAACCGCCAGAGCCTCTGCAAGGCATGTAGCGATGCGAAGACGGCCCAGGAGGCGGCGCGCGGGCGCAGGCGCGCCCAGCGGCCGGCTTGACCCGGAGGGGGAGGGTCAAAGTCATGGGCCGATGTGCCGGACACCGCGCCCCCAGTCTTTTTTTCGCGCGCCCGAATTAAACTTCCGGGCCAAATTAAATTTCGGAGACATCGCCATGAAGCCTGGACGGAAGGCCGAGGCACCTTCCACGAAGGCGGCTCGCGGCACGCTGCGCCCGTTCCGCGACGGGCTGAAGAACGAGCTTGTCGTTCCGGGCGATCCTCCACTGATGCCGGACTATCTGACGGCGGAAGCGCAGGACGTCTGGCAGGAAGAGATCGGCCGCGTCATGGCCGCCGGCGTGGCCGAGATCGATAGTTCGCTGTTTGCGCGCTACTGCTCGCTGGAAGCGCTGGTCCGGCAGTCGTTCAACGCAGGCCGAGAGCCGCCGCCAGCGTCATATCTGACCGTCCTGCGCCAGTATGCCGAGCTGCTCGGTATCGCCGGTCGCAAGAGCCGCGTCGGCAAGATCGCCGATGATCCGACGAAAACCAGAAACCCGTTCGCGCGCAACGGCCACCGCGCGAAGGCGTAAGCGGCCGGCGTTCCCGGATAGCGGGCACGCCCGGAACTACGCATCGATCGCGCTGGAATATGCGAGGGCGGCGGCGGCGGACACGGATCAGGTTCGCCACTGCAAATGGGTTCGGCTGGCGGCGCAACGGCATCTGGACGATCTGGAGCGCGCGAAGTCTAACGTGTGGGGCTATCGCTTCGACCCTTGGCACGCGAACGACATATGCGATTTCATCGAGAACCTGCCCCATATCGAGGGAAATTGGTGCAAGTGCCCGCGCGCAGATGACGGCATGCATCTTGATCGGTGCGGCAAGATCGACCTTGAGCCGCCGCAGATATTCATCCTCACCACCGTATTCGGGTGGCGCCGCAAGGATAACGGACTTCGTCGATTCACGGTGGTTTACGAGGAGGTTGCGCGCAAGAACGCCAAGTCGACGAAGACGGCGGGCGTATCGCTCTATTGCCTCTGCTGCGAGAACGAGACCGGACCGCAGGTTTTGACGGCGGCCACGACGTTCGATCAGGCGAAGAAGGTTTTTCATCCGGCCAAGCGGATGGTCGAGAAGACGCCGGAGTTGCAGGAGGCATTCGGCGTCACGGCCTGGGCGAAGTCGATCACGTGCGGCGACAACGGCGGATATATGCAGCCGCTGCACGCCAAGTCGAAGACGCAGGACGGGCATAACCCGCATCTCGTCACCATGGACGAGCTGCACGCCCATGCTGATCGCGGTCTTTATGACGTGATGCGCTCGGCCTTTGGTGCCAGGAAACAACCGCTTCTCTGGCAGATCACGACGGCCGGCTCGAATGTCCATGGCGTTTGCTACGAGCAGCGGACAATGGCGACGAAGGTGCTGGAGCGCTCGGTCATCGCCGAGCACATATTCGGTATCATCTTCACGCTGGACGGTCCCAAGGATTTCACGCCGGAGCGGAAGGTCGGCGACGATCCGTATGACGAGCGGAACTGGATCAAGGCGAACCCGCTGCTCGGATCGGCGGTCCAGCTGGACGAGTTGCGGCAATATGCCATCGAGGCGAAGAACAGCCCTTCGGCCGAGGGCGAGTTCTTCACCAAGCGCCTCAACAAGTGGATCGGCGCCGCGAGCGCCTGGCTCAATGTCAGCCAGTGGATTGCCTGTTCCGATCCGTCGCTGAAGCTGTCCGATTTTCGCGGGCTTGATTGCTATATCGGCGCCGACCTGGCCGATAAGGACGACATTACGGCGGTCGCTCTTGCGGCGCTCCACCCCGACGGCCGGCTGTTGCTCAAGACATGGTTCTTCCTGCCGGAAGCCGCGCTGGCGCGCGACGATCCTGCGTCCAAGCAGATCGTCGAGCTTTATCGCCAGTGGAAGGATGGCCGGTGGCTTTGGACGACGCCGGGCAATTTCGTCGATCACAACAGGGTCGAGCGCCTGATCCGGCGGCTCCAGAAAGTGCTGAGCGTGAAGCGGATCACGTTCGATCAGTTCGCCGCTGCTCAGGCGATGGCGTCCCGGTTGAACGAGGACCTTGGCGACGGCGACGGTGAACTGGCAGCGATCCTGTCGAAGAACGCGGCCAACGTGACGGACCCGGCCAAGGATCTGGAAGCCAGGGTAAAAGGCGGGCCGCATCTGCTGTGCCATGATGGAAATCCGGTGATGACGTGGATGGCCGGCAACGCCGTGGTGGACCGGCGGGTCAACAACACGATCCTGCCAAAGAAGGAAACGCCGATGTCGCAGAACAAGATCGACGGCATCGACGCTGCCATCAACGCCATGGCGCCGATGCAGCTTCCGCCACCGGCTCCCGTCGTGTCCCCATGGGACGATCCCGATTTCTCGCTGGTGGCGGCATGAAGCTGTTCGGCTGGGAGATCGGCCGCGCGGAGACGCGCTCGGCCTCGATCGAGAACCCGCGGGTCAAGGCCGACGCCGAGGGTATTATCCGCTATTTCGGCGGGGGCGATGCGCTCGATATCGGTCCGGTGAGCGTCGAGCAGGCGATGCAGGTGCCCGCCGTCTTCGCGGCCGTCAACTTCCTCACCCGCACGCTCGCCGCGCTGCCGCTGCATGCATGGCGGAAGGTCGGCGACCAGCCCGAGCGCATAAAGGGCGGTATCGAGACGCTGGTCCACGAAGCGCCGAACACGGAATGGACCAGCTTCGGCCTTCGCCAGTATTTCTGGAATCAGGTGTTCACGCGCGGGCGCGGGCTGCTCTGGATCGAGCGCGGGCTGAACGGCCAGCCCGTCGCCCTGCGGCCGATCAACGTGACGCGGACCGCCGTCGAGATGGTCGGCACGGGGCGCGTCTATTCGCTGGATGGCCGCCGCTATGCCAGCAGCGAGGTCATCGACCTGCCGTTCGCGCTGAAGGACGATCAGGTTGGCGTCGTCGGGCCGATCGCCCGGTGCGCCGGCGCGATCCGGCTGGCGCTCAACATGCAGGTCTATGGCAGCAAGTTCTTTGCCGGCGGCGGCGTCCCGCCGCTGGCGCTGACCGGGCCGTTGCCGACCGGCGCGGAAGCCCAGCGCCGGGCCATGGCCGACGTTCAGCGCGCGATCGATACCGCGCGCGCGACCGACAAGCCGGTATTTCCGATCCCGCCGGGCTACGACCTGAAGCCCGTCGCGGTCGATCCCGACAAGGGGCAGATGACGGAGGCGCGCCGCTTCCAGCTGGAAGAGATCGCGCGGGTCTTCCAGCTGCCGCCGGTCTTCCTGCAGGATTTGTCGAAAGGCACGTTCAGCAACACCGAGCAGCAGGACCTGTTCTTCGTCAAGCATCTGATCGGCCAGTGGGCCGAGGCGCTCGAACAGGAAATGAACCTGAAGCTGTTCGGGCAGATGAACGGCCGCCGCTATGTCGAGCATAATCTCGACGGGCTTCAGCGCGGCGACTTCAAGAGCCGGATCGAGGGCCTCGCGCGCGCGATCAACACGGCACAGCTGACGCCGGACGAAGCCCGTGAACTGGAAGGCCGCCCGTCCAAGCCGCACGGCGACAAGCTCTACCTTCAGGGCGCCACGGTCCCGCTCGGATCGCAGCCATTGAAGCCCGCTACTGGAGAACCCAATGACGGTGAAGACCGAAACCCGGACGCTGAGCCGCAGTCCTGAGCTTCGCGCGACCGGCACGGGCCGGACGCTGACGGGCTATGCTGCGGTGTTCAATTCGCCTGCCGATATCGGCGGCGCGTGGATCGAGACGATCGAGCGCGGCGCGTTCACCCGCGCTCTCGAAGGCGATATCGTCGCGATCATCGGACATGACCGTAACCGTGTCATCGGCCGCACCGGCGCCGGCACGCTTCGGCTGAGCGAGGATGACAAGGGGCTGCGTTTCGAGATCGACCTGCCTGACACCACCGACGGCCGCGACGTCGCCGTATCGGTCGAGCGCGGCGATATCGGCGGCATGTCCTTCGGTTTCTCGGTGACCCGGCAGCAATGGGACGAGACCACCGAGCCGCCACAGCGGACGATTCAGCAGGTCGAGCTTTACGAGATCACGGTGACCGCGTTCCCGGCCTACCCTGACACCAGCGTCGGCCTCCGCTCGCTGGACGATGCACGGAAGGAAGCGGCCATCGCGGCGCGGGAGGCGAAGCGCTCCCATATCGATCGCCGGCTTCGGATGCGCGCCCAGCTGGATATTCGCTCGCGCGCCTGAGTTTCTCCGTCCCTCGCGGGACGCGCCTGTATCGCCCTTCGGAAAGGCATGCAGCCCGGTGAGGCCGGGTCATTCCCAAGGATCAGAAAATGAGCAAGCTCAAGGAGCTGCGCGAAAAGCGCGCGCGGCTGATCACCGAGGCGCGTTCGCGCCTGGACGAAATCACCGCCAACACCGACGAAAGCCGTGCCGCCGAGCTGGAAGCCCAGCACGATGCGGCCATGGCCGAGATCGACAAGATCGAGAAGCAGATCGAGCGCGAGGAGCGCATGGCCGAGCTAGAGGCCAACGCCCAGCGCCAGAACGAGGAGGAGCGCGAGCGTCGTCGCCCGACCGAGCCGAACAACCGGCAGCCGGCCGGGGACAATGGCGAGGGCGAGCCGGAATATCGCCACGTCTTCGCCAAGATCATGTGCGGCGTCGAGCTGGCCGACCTGTCCACCGAGGAACGCGCCGTGCTGCGGCAGGGCGTCACCAAGTTCAAGGACGGCGAGCAGCGCGCGCAGGTGACCGGCACGAACACGGCGGGCGGCTATACCGTCCCGACCGAACTGGCGGCCGAGATCATCAAGAGCATGAAGCTGTGGGGTCCGATGTATGACGAAGCGATCTGCCGCGTCGTCAACCACTCCAGCGGCCATCCCTGGGCGATCCCGACGGTCGACGACACGGCGAGCACCGCCGGTGCGCATACCGAGGGCTCGGCGCTGACCGACGACGGCGGCAAGGACGTCACCTTCGGCCAGAAGGTGCTGGGCGCGTATGCGTTCGACACCGAGTTCGTGCGGTGGAGCTGGGAACTGGACATGGATTCCATCTTTTCGATGGAAGCCCTGCTCGGCGAACTGCTCGGCGAACGCCTCGGCCGCATCGCCAACACCCAGCTTACCTCCGGCAACGGCACGACGGCGCCGAACGGTATCGTCACCGCGTCGACGCTGGGCAAGACGGCCGCCGCTGCCGCCGCAATCACCTCGGACGAGGTGCTGGATCTGTTCCACTCGGTAGACCCGGCCTATCGCACCTCGCCGAAGGCGCGATGGATGTTCAACGACGCCACGCTGCTCGTCCTGCGCAAGCTGAAGGACGGTGACGGCAACTATCTGTGGCAGATGGGGGACATCAAGACCGGCGCGCCCGACACGCTCTGGTCGAAGCCCTACTCCATCAACCAGGCGATGGACTCGGTTGCGGCGTCGAAGAAGCCGATCGTCTTCGGTGACTTCGGCAAGTATTTCGTCCGCAAGGTCGGTTCGCCGATCATCGGCGTGCTGCGCGAGCGCTTCTGGCCGGACATGGGCATCGCCGGCCTGATCCGCTTCGACGGCGAGCTGGGCGACACCGCCGCCATCAAGCATCTCGCCACGCCTGCCAGCTAAGGCTGGTCAGCGACAGGGACCGGGGCGGGCCAACAACCCGCCCCGTCTTCGCTCATGAAAATCAAGATGCTCACGTCGATCTCGGGGCAGGACTTCGCGCTGTCGCCGGATGACGAAACAGATCGCTTCACGACCAGCGAGGCAACCCGCCTCATTGACGCCGGCTATGCCGTCCCGGTTGCCGACAAGGCGACCGAAAAAGCGGTGAAGACGGCGCCCCCCGAAAAGCGCGCGGCCAAGCGCGCGGCCAAGGAGTAACCGATCGTGGCCGACAATCTCACGACGCCGGTGGCAGACGGTTCCGTCCTCGGGACCAAGGATATCGGCGGCGTCCACCTCCCCAAGAATGTGATCGTCGATCAGGCCGGCGCCGATGCGATCGGCCTTGTCGCGTCCGATCCGGCGGCGAACAGCGTCCTCGGCCGCCTGAAGGCCATCTTCGATCGGTTGGGCGATGCGCTCTCTGTCACCATAGCGTCGCTGCCGCTGCCGACGGGTGCGGCCACCTCGGCCAAGCAGGACGATATCCTGACGGCGCTTGCCGGTCCCTTCCCGGTGACAGGCGATTTCTACCCCGAGACCCAGCCGGTGAGCGCCGCCAGCCTGCCCCTGCCCGCCGGGGCGGCGACGGCCACGAAGCAGGACGCGGCCACAGCCGCGATCGAAGCGCTCGCGCCACAGGGGCCGGCCTTTGCGATCACGCCTCATGCCAGCGATCCGCTGGAGCGCGAGATAGGTGCTATCTCCATCGTCACCGGCGGGGACATCACCTATCGCTATCCCGGCGAGGGTGGCGACCGCACGATCACGCTGCCCGCTGGCTTCTTCCCGCTGCGCGCGAGCCATATCCGCGACAGCAGCACCGCGAGCGGTCTGACGGGCTTCTGAGCGATGCTCGCGCACCTGATCGTCACCGGCATCGTGCCGACGGGCGGCGACGCTCCGCCGGCAGATGACGGCGAGCCGGTTTCGCTGGTCGAGGCCCGGCGGCAGTGCCGCTTCCTCGACAACGACACGACGCATGACGCGACGCTGGATCATCTGCGCAAGGCCGCGCGGTCCTACGTCGAGGCATATACCGGGCTGTCGCTGATCGAGCGTACCGCTCAGCTGCGCTTCACGCGCTGGGCGGACATCGCCCGGCTGCCGGTCGCCCCGGTTTCGGCGGCGACGATCCAGTATATCGGGGCGGATAACATCACCGCCACGCTCGATCCCGTCGATTATGACCTGTTCGTTGACGGCTACGAATCGGGCGTCGACCTTCTCGCACCGGCACCATCCCTCGGCCGGGGGCGCGCACCGATCACGGTGCAGGTATCCACCGGCTTCACGCCGTCGTCAGTCCCCCTTCACGTCAAGCAGGCGATCCTGCTGCTGATTGCGTGGTGGTTCGACAATCCCACCGCGATCGTTACCGGGCCTGCGCTCGGAGAACCGCCCCACGCGGTGGCGGCGCTGCTCTGCAACGACAGGCTGTTCGCATGAACCTGGCAGGCCGCCTTCGCCATCGCGTCAGCATCTGGCGCCTTGTCGATATCGACGACGGCAAGGGCGCCTATGTGCGCCAGTGGACGCAGGTCGCCGAGGTGAACGCCGAGGTGATCGGCCAAGGTGGTCGCGAGGCGCTGATCGATCGGTCGATGCAGGGGATCGGCCAGTATCGCGTCACCATCCACTGGCGCGACGATATCCGCACCAACGACCAGGTGCGCTATCGCGGGCGGAATCTGAACATCCGGTCGATTGAGCCGGACGCGACCGATCGGGTGTGGCTGTCCATGATGTGCGACACCGACGCGGAGGCCATCGGCTGA
Protein sequences of DBSCAN-SWA_2 >NC_020561|1047135:1072457|1057785_1058040_+|WP_015457757.1|DBSCAN-SWA MKLVDYLEAHSLTHEQFAGILGCSQPTVTRFVHGIRRPSRFLMRRIAEATGGVVTPNDFLDESPPPFQDPPRRHRRTAPGMATA >NC_020561|1047135:1072457|1070706_1070898_+|WP_144061995.1|DBSCAN-SWA MLTSISGQDFALSPDDETDRFTTSEATRLIDAGYAVPVADKATEKAVKTAPPEKRAAKRAAKE >NC_020561|1047135:1072457|1048580_1048820_-|WP_015457740.1|DBSCAN-SWA MTAREQNLAVLAALPPADNSNDRLIPFARVEQIAGLKRTRIYKLIREGKFPKPYKPGGFGSRWSEAEVRAWLEAVKQAA >NC_020561|1047135:1072457|1049736_1049922_-|WP_015457743.1|DBSCAN-SWA MIQPNERDAVVAWLRRLASQSRTPEILAFWGDPFKRDGQLLCVELADAIERGEHLKGQGNG >NC_020561|1047135:1072457|1056231_1057062_+|WP_144061992.1|DBSCAN-SWA MIQLQIEGGTPMELQIGKYTARISVLPPIHPKFRITPEPAAPNSTDGEEEQIEAEPISGFTCIIGYTSGGGEVSERLITCRSLRAHGTGLTIGAICHAAKGFRAFRVDRISAVSDPHTGEVIGDGSYFSRFAVKSSDKARAPSGGWGLTPSRMRTLVAGLNVLSFIARCDGYWHALEDEPLANFIERLWMRKDWEGQPPVAAILDHAKRLAPDSSIFFASLETIFGSRSSTRLLISSVRELIEADGIIQDQETNWVLAMQQHQEEYYARKGLLGSI >NC_020561|1047135:1072457|1058877_1059384_+|WP_015457760.1|DBSCAN-SWA MVIVMADAHSNVVPPAQPLTEAEFRNAWLQSLARLCAAHGDGRVALALGVSERHLRNLKSGASLPAADRIWNLLALDPSAHDEIDARYQVKNSPIDALCSTDPLTRDIIALANEVAQSEDPSSPGGVVVTDHELLQKDEHRMRRIHNTLGSWLKRIEALRRPAMKAVA >NC_020561|1047135:1072457|1049921_1050485_-|WP_015457744.1|DBSCAN-SWA MTDDTALREMVRAAKRYVECSIDPALSPFIRENGRYGTKSPWSELREAIAPAEVTLTSPTPAQPVQSVHQFDADDMEAVGRSLMETIESNRSHPLLKDWAPTDDPAEIVADLLNAYDEKLAPAQPEDVVERVARALYACEKERSDHTGKVLSAAKGHPVNFRMEPWDDVAELYRGDARAAIAAAGGR >NC_020561|1047135:1072457|1060786_1061023_+|WP_015457763.1|DBSCAN-SWA MAETGGADQLRLFIERIERLEEEKRGLSDDIKDVYLEAKSQGWDAKTMRAIVRLRKMEKHARDEAEALLETYKASLGL >NC_020561|1047135:1072457|1053780_1054452_-|WP_144061991.1|DBSCAN-SWA MTTLEERKHALMYGQRIAPSKLGSYPRPAITYPFGQAGVPIPPPAAIYRKTQDGELLSKIVRKAFGAWAAAQLPEAEWDATAALLAADIDQRRPPADMEVLARYGFAKPVELAYVDLRGAVTYKPVCLQLPAPRTVMHLGTHFVADLIKSPPSQHAHVPAETMDFFRRWNEVARAERDQFTKASQWPGQFRVHNGRWPRWAEIEAEWPRIGEWLQDQRKQVAA >NC_020561|1047135:1072457|1050484_1050739_-|WP_015457745.1|DBSCAN-SWA MSRPDPIQARYRAGMNAIAGALDQQFNGDARPRKIAFVLLVAEFGQIDGGRVNYISNADRADTISMMKEWIARAEGRYQEGGRA >NC_020561|1047135:1072457|1049390_1049744_-|WP_015457742.1|DBSCAN-SWA MDEIAKIAAGLSEAEVNALLAFSLTGSAGTIEAKPGDRFIWWDGTEWEIIGLTEGWDYSPAGLGGCPSVALRLVGGDADPITRRFMDGDISVWCGDSVASGLIRSRKDVHLMKDEGR >NC_020561|1047135:1072457|1051085_1051676_-|WP_015457746.1|DBSCAN-SWA MTRAPSDRALRFAYADPPYLGVAEKFYGHLHPDAADYDDPTAHEALIERMTAEFDGWAMSLSLPSLRTILPMCPPDARIGAWVKPFASFKKNVTRAWAWEPIILWGGRPIPVSQPTQRDWIEAPAIAESITLRRGFTGAKPAAVCRWIFDWLNMRPDDEFVDLFPGSGAVTEAYHDWLAGQTGVCQIGMFATEGAI >NC_020561|1047135:1072457|1070904_1071528_+|WP_015457778.1|DBSCAN-SWA MADNLTTPVADGSVLGTKDIGGVHLPKNVIVDQAGADAIGLVASDPAANSVLGRLKAIFDRLGDALSVTIASLPLPTGAATSAKQDDILTALAGPFPVTGDFYPETQPVSAASLPLPAGAATATKQDAATAAIEALAPQGPAFAITPHASDPLEREIGAISIVTGGDITYRYPGEGGDRTITLPAGFFPLRASHIRDSSTASGLTGF >NC_020561|1047135:1072457|1051672_1052824_-|WP_015457747.1|DBSCAN-SWA MRVLDLFSAAAGGWSLGMHRAGYRTMAACEVIPWRRALYSENNPGVKVYDDVRSLSAGRLLDDFGYLPAVIVGSPPCQDISSANTKGKGVEGERSGLYFEAVRLIGECRPRWFALENSANLRTRGADAVLSALEAIGYACWPFVVRAGDIGANHERPRSWLIGCDVGQLTAYANGHGEHAQSQHGEMAVVSGLGVDRHPAQDGCGSGLPGRCAGAATGAEQPACGDALHAASLGHEGGWPRRHGQDGHGAPPVAAGNDADTIGIGPKGQLGGDEQRQGEISERPDLPARPEMAICSPDRRGAVGSRADWSLDDLARHIRVDDGLSAWVAGTRVAVGGPKGTSAASLIVEAFGDAVVPQIPEAIGRAILRTEAALAAVYERIAA >NC_020561|1047135:1072457|1054986_1055175_-|WP_015457753.1|DBSCAN-SWA MTARRSTEPVEAASLSVVSSELASAYIRESEARRILTRAQTDYLAARAAREAAERREAQVAR >NC_020561|1047135:1072457|1058174_1058456_+|WP_015457758.1|DBSCAN-SWA MVDELNCSFCGKSNEEVAHLIAGTNALSVDACAKIVADGTSREAAAAATASNQRLECLRLAMSRDFIDPVAVAREFYKFVQSGTPAPKGAGDA >NC_020561|1047135:1072457|1062146_1062716_+|WP_041864800.1|DBSCAN-SWA MLADPLRRAAGSEFGYVPLVSADVAATIPAALAALKAGMAPSTEEQIEGLMGSIALLYPAAKVTEKEAEARLDLYIDLLQDIPFDILSAAFKSAAQTSRFFPTVAEIREAALPARRERLNKINGLKALALKHRLEGEQSRQAQAPMSAAEIEEANAIFQRLGIRTRYAADGTSYEIERGNTGDQEAKAA >NC_020561|1047135:1072457|1068664_1069261_+|WP_015457775.1|head,protease|DBSCAN-SWA MTVKTETRTLSRSPELRATGTGRTLTGYAAVFNSPADIGGAWIETIERGAFTRALEGDIVAIIGHDRNRVIGRTGAGTLRLSEDDKGLRFEIDLPDTTDGRDVAVSVERGDIGGMSFGFSVTRQQWDETTEPPQRTIQQVELYEITVTAFPAYPDTSVGLRSLDDARKEAAIAAREAKRSHIDRRLRMRAQLDIRSRA >NC_020561|1047135:1072457|1054448_1054625_-|WP_187294031.1|DBSCAN-SWA MALKPSDRWTLSLCQAHHRLQHEAGEESFEQITGLDMRAMAEEFTRRSPHRRELEAMP >NC_020561|1047135:1072457|1062763_1063375_+|WP_041864801.1|DBSCAN-SWA MAAARAAAILKGDRTVSVAPPPPAKRARRAPAQKPIDATPERISMAEPDGKNGVRFHDTVEAMIDKAGQRALITRRFADAQIDRWLKQKRLTYAQWYAADWYRNQYALAGIEGRVVAQYDITHPGAEGSSYGLPANERQLRARQRWRQGRALLPENMVDLVDRLVIHDVAPALSSGRMRDRYAARIGRALDPLADWLSAPASA >NC_020561|1047135:1072457|1061231_1061444_+|WP_015457764.1|DBSCAN-SWA MSAAREWVPHHGGPNPFHDPDQPDREEPDIMIRYRNGRVFGPIQPASRRWAAWTVRPGRSDWDIVAWRLA >NC_020561|1047135:1072457|1052823_1053177_-|WP_144061990.1|DBSCAN-SWA MSSVSHSRPPQPAPDRGGGHSCRPVSDDSAHQCFTVDSTSYLRNLYDEDLRGPCIAVHYQKPPTKTETGTSIGLRFPMLIMALYLEKPDEVAQRVADILNKHWDDFDQVPASREGER >NC_020561|1047135:1072457|1059432_1060062_+|WP_144061993.1|DBSCAN-SWA MTVMEGAAPTMVGSAGDDLRPAAWPFGGLGMFRYGAIIADPPWYFRNYSFAGETKNPVARYACMSTDEIAALPVSQLAAPDCALFMWATAPMLPDAIRLMKEWGFTFKSAGAWAKQSATGEKWAFGTGYCFRSASEFYLLGTIGKPKVLSRSIRNLIVAPVREHSRKPDDLHRDVEQLYAGPYAELFGREQHPGWDVWGNQTDKFGGAA >NC_020561|1047135:1072457|1065189_1065609_+|WP_015457772.1|DBSCAN-SWA MKPGRKAEAPSTKAARGTLRPFRDGLKNELVVPGDPPLMPDYLTAEAQDVWQEEIGRVMAAGVAEIDSSLFARYCSLEALVRQSFNAGREPPPASYLTVLRQYAELLGIAGRKSRVGKIADDPTKTRNPFARNGHRAKA >NC_020561|1047135:1072457|1063966_1064263_-|WP_015457769.1|DBSCAN-SWA MIEVRRTAEFADWLKGLKDRAAKARIEVRITRLALGNPGQHRNLTKGLREMKIDYATGYRVYFVERGNTVIVLLCGGDKKSQDRDIARALAMVEDLEE >NC_020561|1047135:1072457|1053173_1053650_-|WP_015457749.1|DBSCAN-SWA MVIEHGSLGGIHLDDLQAWARAKMQRRPDFVIGDDYLRRWWIVPRNEGCNVYLHEILHSDDNRALHDHPWDNTSYVIDGVYIEITPAGSFYRGAGSIVTRRATDAHRLVVPPGGRAVSLFMTGPKLREWGFHCPQGWRQWQDFVDARDAGQVGRGCGE >NC_020561|1047135:1072457|1061025_1061235_+|WP_041864799.1|DBSCAN-SWA MGPLVLGIACLLIGWIMGARWRAGRLPAAPSISPSAAGRALAARSREELARRREDKTRQLQLEIERSRR >NC_020561|1047135:1072457|1060061_1060757_+|WP_015457762.1|DBSCAN-SWA MRPVPDDFAALAATMSRAQMQAHYRVRSSTLSAWYVAAGLRPPVPSQQRPAPADFAEHGRRPSAELRERYGCSNELLARWRKQHGISMADGPTARPVPEDFAIRARSSTNRELAEHYGVGRALISRWRAKCGLSGGISTYRWKVPTPTQVGARDSSLAGRAADHLRLPRAGSWVVFRCDAAGTADPSGGHFRVGTRLMTEDEMIQFAERKGFAAFPADALGSSRQHGAALS >NC_020561|1047135:1072457|1061440_1062184_+|WP_144061994.1|DBSCAN-SWA MSAALLCRLIEAGTPAALVAEVAAALAQADAASLALRMRRQADAARQQKRRAEGKPKADADADADAESRDVTGHHVMSRDIPEDSVTSRDIAETPENVPPHPPKNISLREENPVPLKGDLPLFGSPEIDRVVSDWNAMAGRTGLKSIRTVTAERRGRILARLREHGPDSFTEAIAAIERSRFCRGQNDRRWRADFDFLLQPRSFVKLIEGSYEPANDRYAETDIANPMVRVAATRRARRSAEAGGGF >NC_020561|1047135:1072457|1064753_1065074_+|WP_041864802.1|DBSCAN-SWA MPSRPPGLYAKPKSKPWERQSAASAQRIRGRRGQQIRAAHLAGEPLCRECLKHGRVTEAVIVDHTLSLAEGGEDVPDNRQSLCKACSDAKTAQEAARGRRRAQRPA >NC_020561|1047135:1072457|1072124_1072457_+|WP_015457780.1|head|DBSCAN-SWA MNLAGRLRHRVSIWRLVDIDDGKGAYVRQWTQVAEVNAEVIGQGGREALIDRSMQGIGQYRVTIHWRDDIRTNDQVRYRGRNLNIRSIEPDATDRVWLSMMCDTDAEAIG >NC_020561|1047135:1072457|1069350_1070640_+|WP_015457776.1|capsid|DBSCAN-SWA MSKLKELREKRARLITEARSRLDEITANTDESRAAELEAQHDAAMAEIDKIEKQIEREERMAELEANAQRQNEEERERRRPTEPNNRQPAGDNGEGEPEYRHVFAKIMCGVELADLSTEERAVLRQGVTKFKDGEQRAQVTGTNTAGGYTVPTELAAEIIKSMKLWGPMYDEAICRVVNHSSGHPWAIPTVDDTASTAGAHTEGSALTDDGGKDVTFGQKVLGAYAFDTEFVRWSWELDMDSIFSMEALLGELLGERLGRIANTQLTSGNGTTAPNGIVTASTLGKTAAAAAAITSDEVLDLFHSVDPAYRTSPKARWMFNDATLLVLRKLKDGDGNYLWQMGDIKTGAPDTLWSKPYSINQAMDSVAASKKPIVFGDFGKYFVRKVGSPIIGVLRERFWPDMGIAGLIRFDGELGDTAAIKHLATPAS >NC_020561|1047135:1072457|1063682_1063964_-|WP_015457768.1|DBSCAN-SWA MAIKTERWDAAEFLNDEEDIRLYLEAAFEDGEPAIIRAALNDVARARGMTALAKETGLSREALYKALGENGNPTLDTLVKLTRALGVRLSIAA >NC_020561|1047135:1072457|1053643_1053784_-|WP_187294030.1|DBSCAN-SWA MKLLSTLRRWLRSLITLPGCSTCHELRRSSGNRWAECDTCWDERQW >NC_020561|1047135:1072457|1048834_1049197_-|WP_144062106.1|DBSCAN-SWA MLALAAAVALTGCVATDGDTVRCGDQRIRLLGIDAPEMPGHCRRGRICAPGDPITSKASLAAALTLGPIRFEQVTTDRYGRAVGIVWAGNVNTSCWQLRRGQAVYRKDWDHRGVIGRCVA >NC_020561|1047135:1072457|1055171_1056080_-|WP_015457754.1|DBSCAN-SWA MRYEGHYKGDLSPRSSDDISGLKSVSGSLDLRGTSITALPEGLSVGGWLDLSGTSITALPEGLSVGGWLYLSDTSITALPEGLSVGGWLDLSGTSITALPEGLSVGGWLDLSGTSITALPEGLSIGGSLDLSGTSITALPEGLSVGGSLDLRGTSITALPEGLSVGGSLDLRGTSITALPEGLSVGGSLDLSGTSITAWGNLTVRGRPVAAKSDADARLREVAKAALAEPDALVMDQWHCGTAHCIAGWAVHLEGSDGYRLEKDTDTETAGLLLLGPAAAGKFYASEEGARKYLASVLEAAR >NC_020561|1047135:1072457|1058457_1058649_-|WP_015457759.1|DBSCAN-SWA MSDKVQVTCDHPDGGSPAQVAWEMAKFLRNLLPEAESKEDRVDAYLDLYARCYHAASGRRLPK >NC_020561|1047135:1072457|1050693_1051089_-|WP_144061989.1|DBSCAN-SWA MTKSDRAPVTQADREAAAAWAKLNGRNHQAANIRRGSCDDAPIVQAFARHRIAALEAAAEVARDRHEQWRMPHPGDARPGEVCDDISACRDIAAAILSLKGQSDDQDRGSGHVGGEACHGLTPSRRAIGPA >NC_020561|1047135:1072457|1071531_1072128_+|WP_015457779.1|DBSCAN-SWA MLAHLIVTGIVPTGGDAPPADDGEPVSLVEARRQCRFLDNDTTHDATLDHLRKAARSYVEAYTGLSLIERTAQLRFTRWADIARLPVAPVSAATIQYIGADNITATLDPVDYDLFVDGYESGVDLLAPAPSLGRGRAPITVQVSTGFTPSSVPLHVKQAILLLIAWWFDNPTAIVTGPALGEPPHAVAALLCNDRLFA >NC_020561|1047135:1072457|1067442_1068708_+|WP_015457774.1|portal|DBSCAN-SWA MKLFGWEIGRAETRSASIENPRVKADAEGIIRYFGGGDALDIGPVSVEQAMQVPAVFAAVNFLTRTLAALPLHAWRKVGDQPERIKGGIETLVHEAPNTEWTSFGLRQYFWNQVFTRGRGLLWIERGLNGQPVALRPINVTRTAVEMVGTGRVYSLDGRRYASSEVIDLPFALKDDQVGVVGPIARCAGAIRLALNMQVYGSKFFAGGGVPPLALTGPLPTGAEAQRRAMADVQRAIDTARATDKPVFPIPPGYDLKPVAVDPDKGQMTEARRFQLEEIARVFQLPPVFLQDLSKGTFSNTEQQDLFFVKHLIGQWAEALEQEMNLKLFGQMNGRRYVEHNLDGLQRGDFKSRIEGLARAINTAQLTPDEARELEGRPSKPHGDKLYLQGATVPLGSQPLKPATGEPNDGEDRNPDAEPQS >NC_020561|1047135:1072457|1054783_1054987_-|WP_015457752.1|DBSCAN-SWA MAGIHPHAIHSGSAVRREAERRAAKMPITRGAWHWLFAPRPALPGEWAAFAVMNAITIAIVAWAVLG >NC_020561|1047135:1072457|1065550_1067446_+|WP_084673621.1|terminase|DBSCAN-SWA MIRRKPETRSRATATARRRKRPAFPDSGHARNYASIALEYARAAAADTDQVRHCKWVRLAAQRHLDDLERAKSNVWGYRFDPWHANDICDFIENLPHIEGNWCKCPRADDGMHLDRCGKIDLEPPQIFILTTVFGWRRKDNGLRRFTVVYEEVARKNAKSTKTAGVSLYCLCCENETGPQVLTAATTFDQAKKVFHPAKRMVEKTPELQEAFGVTAWAKSITCGDNGGYMQPLHAKSKTQDGHNPHLVTMDELHAHADRGLYDVMRSAFGARKQPLLWQITTAGSNVHGVCYEQRTMATKVLERSVIAEHIFGIIFTLDGPKDFTPERKVGDDPYDERNWIKANPLLGSAVQLDELRQYAIEAKNSPSAEGEFFTKRLNKWIGAASAWLNVSQWIACSDPSLKLSDFRGLDCYIGADLADKDDITAVALAALHPDGRLLLKTWFFLPEAALARDDPASKQIVELYRQWKDGRWLWTTPGNFVDHNRVERLIRRLQKVLSVKRITFDQFAAAQAMASRLNEDLGDGDGELAAILSKNAANVTDPAKDLEARVKGGPHLLCHDGNPVMTWMAGNAVVDRRVNNTILPKKETPMSQNKIDGIDAAINAMAPMQLPPPAPVVSPWDDPDFSLVAA >NC_020561|1047135:1072457|1057044_1057767_-|WP_187294032.1|DBSCAN-SWA MVYCNASVYSVLDVTLPAPSGQYMPMRYSLQLKKLRNLRKMTQSALAEAVGVEQPTIQRWESGARTPDMADILRLAAALGVEPGELFAEGPLVALGPRLYVKGVVAAGVWKEVWEYEPDEWEVFTGRADVAAPVRDRFGLRVEGDSMDIVYPHGTIVECVAYWGEAAIPNGKRVVVQRRRVSQDFETTVKEYRIDGDGREWLIPRSNNPAFQRPIEIGHDEDDIEEVRIIALVVGSYRPE >NC_020561|1047135:1072457|1047135_1048584_+|WP_015457739.1|integrase|DBSCAN-SWA MLTDAKVKAAKPKDKPYKLGDSGQLYLYVSPAGGRHWRMNYVAPSTQKQKTLSFGSYPTMTLAEARAARDAAKKILSTGRDPAIERRVARKEQAQSDANTFESVAEKWFELNSGWSLEKLREYRAANSDKWSWKAARHWTKKPAPWSAVHSADVLKSLESDVFPSIGSLPIRSLEARAPLLLEVLQEVEARGAIETAHRLRQRISGVFVYGIAAGLCGADPAASLGKALAKKPRSKPQPSVIDGIQKQEDRLRAIKDMLAKCEAERCRASTKLALRFLAFTAVRSNELRFASWAEFEGIDWGNPHAPAPEALWRIPAARMKGDDERKAEEFGDHLVPLAPQAVAVLRVMWLLSAGLPYVFPGERHLHKPISENTLRALLIRAGYYQRHVPHGFRAAFSTYMNDRPLSERKDGDREVIDLMLAHVPEGKSGSETAYNRAGYMDRRRELACEYADLIGADLCDPAEHLGKPIRYAATGPGRLAA >NC_020561|1047135:1072457|1064446_1064650_+|WP_015457770.1|DBSCAN-SWA MAMVSVTNDLTVNTAYVASVSWDRGDTYTTLVITMADGTAHRVKHQGGPYGVDAYDAERRLLAAAER |
45 | Rhizobium_phage(30.77%) | head,portal,integrase,capsid,protease,terminase | attL 1043612:1043628|attR 1061164:1061180 |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_3 |
1874484 : 1924690
Sequences of DBSCAN-SWA_3
Nucleotide sequences of DBSCAN-SWA_3 >NC_020561|1874484:1924690|DBSCAN-SWA GATGGCGGGCAAGGACGAGGAAAACGGCACCGGCCTCGGCGTTGCCACGCGCACCCGCACGCGCACCAAGAAGCCCCAGCCCTACAAGGTGCTGATGCTGAACGACGATTATACGCCGATGGAATTCGTCGTCCTCTGCCTGCAGCGCTTCTTCCGCATGAGCCTGGAGGACGCCACCCGCGTGATGCTCCACGTCCACCAGAAGGGTGTCGGCGTGTGCGGGGTGTTCAGCTATGAAGTCGCGGAAACCAAGGTCAGCCAGGTGATCGATTTCGCCCGGCAGAACCAGCATCCGCTGCAATGCACGCTGGAAAAGGCCTGATCGCGGCGGATCGGCAGCGACAAAACGCCCTCCTCCTCCCAGGCATAGCCCGTCGCCCCGGGCGACAAGCTCCCACCCGTCATTCCCGCGAAAGCGGGAGCGCAGGCCAGCGATGCGGTTGGTTCGATAGCTCCCGACACAATGGATTCCCGCCTGCGCGGGAATGACAAGGTGTGGGGGCGAAGGCGGGGAACCCGATACTGTCGGAGGCTGAGGACGGACGACGACAGGAAATGACGACAGGAAATACGGCCCGAGGCGCTCAGTCGTCGTTCTTCACCATCTCGGCGCCCGTCGCCATCATCGCCTTGGCGGCTTCGGCGGCGGCGTAGATGAAGCCGTTGGTGGGATAGGCGGTCATGCCCAGGCCGCGGCTCGGCCCCCACCAGACGCGCACCTTGCTCCAGTCGCCGGCCTCGGACACGTCGACGGCGGTGACATCGCGTTCCACCTGGCCGCGGCGGCCGTTGATGATCGACCAGTTGGCGTGGGTGAGCTTCACGGTGCGGGCATCGACCACCTCGCTCACGGTCGCGACGTGACCGGCGGGCATCGCCCGGAAGGCGCGGAAGGACAGCACCGATCCCACCTTGGGGGTCTGGCCGCGCTGATATTTGCCGGCGGCCTGCGCCCACCAGTCGGCCGCGCGGCCGAAAAGCTGGATGCCGGAGATCATCCGCGCGAAGGGCGCGCACTGCCAATATTGGCTCTGGGCGGCCGCCGGCGTCGCGGTCAGCGTCATCGACGCCAGCAGCATCGCGGCCATTCCGGCCCAGAAGGATGTGCGCATCATTTAAGTCCCCGCGTGTCCCGTCGCCTTGCTAGGCGGTCTTTGTGCGTCGCGGAAAGACCAGAAAGTTAACAAAACCGGCCTTCTGCGGCGGGCCGCGCTCACCCGGCGACGAGTTGAGCACGGCCCGGCCGATCAATCCTCCTGCTTGAACACCTCGCCGCCCTTCATCACGAAGCCGACCGTCTTCAGCACCTTCACGTCGGCCAGCGGATCGCCCGCCACCGCGATCAGGTCGGCCGCCTTGCCCGGCTCGATCGATCCCACCTCGTCCTTGAGGCCGAGCAGGTCGGCCGCGTTGACCGTCGCCGCCTGGATCGCGCCCATCGGGGTCAGCCCGTGCTTCACCAGCCATTCGAACTCGTCGGCGTTGCGGCCGTGCTTGGAGACGCCGGCATCGGTGCCGAAGGCGATCTTCACGCCGGCCGGCACCGCTTTTTCCAGCGCCTTGCCGGTGATCGAGATCCGCCAGTCGATCTTGGCGCGGACGGCGGGCTCATAGGCGTTGGGATCGGCGGCGATCCGTTCCAGATAGCCGTTCACGGTGGAGAGCGTCGGCACGTAATAAGCGCCCGTCTTGCGGAAGGCGGCGATGTCCGCATCGTCCATCAGCGTGCCATGCTCGATCGAATCGGCGCCGGCGGCGAGCGCCAGCGCGATCCCGTCCGCGCCATGGGCATGCACGGCCACCTTCTTGCCATACATGTGCGCCGTCTCGACGATCGCCTTCGCCTCGTCGTCGAACATCTGCTTGCCCAGCCCCGCGCCGATGCGGCTGTTGACGCCGCCGGTGGTGGCGATCTTGATGACGTCCGCCCCCCGGCCGATCTGGATGCGCACCGCGTGCCGGCAGGATTCGACGCCGTCGCACAGATTCTCGGGCGATCCGGCATGTTCGCGCAGGTCGTCGTTCAGGCCCAGCCGCTCGTCCATGTGGCCTGACGTGGTGGAGATCGACATGCCAGCATCGACGATGCGCGGGCCGACCGCCCAGCCCTGCCGGATCGCATCGCGCAGCGCCAGCGTCACGCCCGAGCCGTCGCCCAGGTTGCGGACGGTGGTGAAGCCGGCGTTCAGCGTCTTGCGGGCGTTCCAGGCGGCTTCATAGGCATGCATCGGCACGCCTTCGGTGAAGCTCGCCAAAAGCCCGTCCTTGCCCGCGCGATCGGATTCGAGGTGGACGTGGGAGTCGATCAGCCCAGGCAGGACATAGCGATCCTTGAGGTCGATCAGCCGCGCGCCGGCGGGTGCCGCGACGAAGCCGTCGCGCACCTCGGCCACCTTGCCGGCACGCACGACGATGGTGGCGTTGCGACGCGGCGCCTGGCCCGGACGATCGAGCAGGGTGCCGGCATGGATCACGATATCGCCGGCCTGCGGATCGGCGAAGGCGGGCGCGGCCACGAGCATCGCGGTGGAAACGGCCAACAGCAAACGCATGAAATCATCCCCCTTTTCGGTTATATCGCCTTGCTTATCGCCTCTCCGCGCCCAACGCCATGGCGTGCGGGAGGCGACGCGCGGATTGAAAAGCGCCCCGCCCCCCGGCTATGGCGCGCGCCGATCCCTTCAACGAAAGCCCCTGCCATGTCCGATACGCCCCCCGATCGCCTGTCGACCAATCCGAAGAGCCCCTATTTCGATGCCGATGCGCTCCAGCGCGGCGTGGGCATCCGCTTCAAGGGCGCGGAAAAGACCAATGTCGAGGAATATTGCGTGTCCGAAGGCTGGGTGCGCATGGCCGTGGGCAATTCGCGCGACCGCCACGGCAATCCGATGACGATCAAGGCCAGCGGCCCGGTCGAGCCCTATTTCCGCGACGTGACGGAAGGCGGCGAATCCGCCGAGGCAAGCCCTTCGGACAACTGATCGACTCGTCTTTGAATCGCTTCGGCGCATCGCAGGCCCTGCGATGTTGCATCGCAACAGAAAAAATGTTCCCCTTCGATCCGCTTCGCGGGTCCACGGTTAAGCTCCCGGCAATCACCGGGGCGCAGGATCATGGGCCTGTAGCGTAGTGGATCGAGGGGGTTGGCATATGGGCACCCAGGCGAGACCGGCAGACGGGGCGATGGCCCTGGCCGAGATGAAGGAGTTTGCGGGCTTTCCCGCCGGCACCCAGCGCTATATCCGCCGATCGCTCGATATCGGCCTCGATCGCGAGAATGCGATGGTCCGCTGGTCGCGCGACATGGTGGAAGCCGCCAGCATCCGCGCCCAGATACGCATCTATGAGCGGCTCGACACGATCCGCGCGCTGGTGCCTGACGATAGCGGCCTCGACGCGGTCGAGCCCTTCTTCGCGCCGCTGGTGATCGTATCGGCCTTCGATCTCGGCCAGGATCGGCTGCCGGCCTTCTCCGCCTACCGCTTCCTCTACGAACGGCTGATCGGGGCGCATGTCCGCCCCTGGCTGCCGGGCGCCTTCTGCGCCGCCGCCGCCCTGCCCCATCTCCACCCCGAAAAACGCCGGCTGCTGCTCCAGTCGATCAGCGAGGCGGCCGCCACCGCGCCCGGCTGGTCCAGCCGCGAACCCTGCTTCTTCCCGGAATGGGTGGAAAAGGTGGAAAGCCTGGCCTGATCGGGCCGGGCGCATTCCCGGAAAAGAGAAGGGCGGCCCCGTCGGAGCCGCCCTTTCCTGTTTCCGAAGAACCGGAACCAGCCGAATCAGCGCATCGTCTGCGCCAGTTCCTTGTTGATCCACAGCGCCAGCAGCGTGCACACCGCGCCGGACAGCAGATAGGCGCCGGCCGCCAGCAGGCCGAAATTGCTGGCGAGCAGCAACGCCACCAGCGGCGCGAAGCCCGCGCCCACCAGCCAGGCGAGGTCCGACGTCAGCGCCGATCCGGTATAGCGGTTGGCGGGCGAGAAGCTGGAGGAGACCGCGCCGGAGGACTGGCCGAAGGCGAGGCCCAGCAGGATGAAGCCCAGCACCATGAAGGCCAGCTCGCCCAGCTCGCCGCCGTTCAGCAGCTGCGGGGCGAAGCCGCTATAGGCGGCGATGGCGGCGGCGGAGAGCGCCAGCAGCGTGCGGCGGCCGACGCGATCGGCGATCAGCCCCGAAGCGATGATCGCCACCACGCCGAAGCCGGCGGCGATCGCCTCGATCACCAGGAAGCGATCGGGCGCGTTCTGGGTGTAGAGGAACACCCAGGAGAGCGGGAACACCGTGACCATGTGGAACATGGCGAAGCTCGCCAGCGGGGCGAAGGCGCCGATGATGATCGTGCGGCCTTCCTCGCGCACCGTCTGCATCACCGGCGCGGGCTCCAGATCGCGATTCTCGTAGAGGCGGGCATATTCCGGCGTCACCACGATGCGCAGCCGGGCGAACAGCGCCACCACGTTGATCGCGAAGGCGACGAAGAAGGGATAGCGCCAACCCCAGGCGAGGAAATCCTCCGCCGGCAGCACATAGACGAAGAAGGCGAACAGCAGGCTGGCGACGATCAGGCCCAGCGGCGCGCCGAGCTGCGGGATCATCGCATACCAGCCCCGCCGGTTCTGCGGCGCGTTGAGCGCGAGCAGCGAGGGCAGGCCATCCCAGGCACCGCCCAGCGCCAGCCCCTGGCCGAAGCGGAACAAGGTGAGCAGCAGGGCGGCGGTGGCGCCCATCTGCTGGTAGCTGGGCAGGAAGGCGACCGCCGCGGTCGACCCGCCGAGCAGGAACAGGGCGATGGTGAGCTTGACGCCCCGGCCGTAGGCGCGGTCGATCGTCATGAACAAAAGCGATCCCAGCGGCCGGGCGATGAAGGCCACCGCGAAGATCGCGAAGGAATAGAGCGTGCCCGTCAGCGGATCGACATAGGGGAAGACCAGCTTCGGGAACACCAGCACCGAGGCGATGGCATAGACGAAGAAATCGAAGAATTCCGACGTCCGGCCGATGATGACGCCAATCGCGATCTCACCCGGCGAGACATGGCTGTGCCGCGAGTTGATTTCGCTGAGATCGCGCTCCGCGGTTCTGGAGCCAACCATCTCTGCACTCATCAAACACTCCCAAACCAAGCGCGCGCCTGATCGGCGTTGGGCCGTCCGCGTGGACCGGCCGCCGCGTTTCTGCCAAGCTGGCGTTTCGGCCAAGCACGAATCGCCCGCGACGGAAGCGCCGCAGACTTTACGCCTCTTTGCTTCCGTTCGCGAACAAGGGGATTGGACAAAATGTCCTATGTCGTTTGCCGCGTCGCAGCACTAGGCGCCCGGACCATGAACCAGCCAGCCCTTTTCCCGGCGCGGGCCATGCGCGCCGCGCGTTCCGGTGCCCTGATCGGCGCCGTCGTCGCCCTTGGCGCCCTTCTCGGCGGTTGCAACGCCGTGGTGCTCGATCCCGCCGGCGACGTCGCGAAGCAGCAGGCCAACCTGGTCGTCATCTCGACGGTGCTGATGCTGCTGATCATCGTGCCGGTGATGGCGCTCACCGTGCTGTTCGCCTGGCGCTACCGCCAGTCCAACAAGGAAGCGCGCTACGAGCCGGAATGGGATCATTCCACCCAGCTCGAGCTGCTGATCTGGGCGGCGCCGCTGCTGATCATCATCTGCCTGGGCGCGATCACCTGGACGAGCACCCATCTGCTCGATCCCTATCGCCCGGTCTCGCGCGTGGCGAAGGACCAGCCCGTGCCCGCCGACGCCCGGCCGCTGGAGGTCGAGGTGGTGGCGCTCGACTGGAAATGGCTGTTCATCTACCCCGAATATGGCATCGCCACGGTGAACGAGCTCGCCGCGCCGGTCGATCGCCCGATCAATTTCCGCATCACCTCCTCCTCGGTGATGAATTCCTTCTACATCCCCGCGCTGGCCGGCCAGATCTATGCGATGCCGGGGATGGAGACGAAGCTGCACGCCGTGCTCAACAAGGCCGGCAATTATGAAGGCTTCTCCGCCAATTACAGCGGCGCGGGCTTCTCGGGCATGCGCTTCCGCTTCCACGGCCTCGACGATGCCGGCTTCGCCGCCTGGGTCGCCAAGGCGCGCGCGGAGAAGGACCGGCTCGACCGCAAGGCCTATCTGGAGCTTGAGCGGCCGAGCGAGAACGTGCCCGTCCGCCGCTTCGCCGCGGTCGACGCCGATCTCTACGACGCGGTCGTCAACATGTGCGTCGAGCCCGGCAAGATGTGCATGAGCGAGATGATGGCGATCGACGCCAAGGGCGGCCTCGGCAAGGAAGGCATCCGCAACGTCCGCCAGCTCACCTACGACAAGCATGCCCGCCGCGGCGCGGTGCTGGCGCCCCAGCCGATGATGGTGGGCGCGGTCTGCGCCCCGCCGCAGCCGGTGAAGCAGGCGGCGGCCGAAAGGCCGGCCCCCACCGGCGCGCCGCTCACCGGCGCCGGCCTGCCGCGCCCCGGCGTGATCGGCGGGCCGCCACGCGCCTCCGCCGACGCGGTCGTCCCCCGCGCGAACAATAGCTGATTGCCCCGGAAATCCTGATGACGGTCGAGTCCACCATGGAAAACCAATATGCCGCGCTCAGCCCGATCTTCGGTCGCCTGTCGCTGGAATCGCTGCCGCTGCACGAACCGATCCTGGTCGCGACCTTCGCGGCCGTGGCGCTGGGCGGCATAGCGCTGGTCGGCGCGCTCACCTATTTCCGCCTCTGGGGCTATCTCTGGAAGGAATGGTTCACCACCGTCGATCACAAGCGGATCGGCATCATGTACATGATCCTGGGCCTGGTGATGCTGCTGCGCGGCTTCGCCGATGCCGTCATGATGCGCCTCCAGCAGGCGATGGCGTTCAACGGGTCCGAGGGCTATCTGACCGCGCACCATTATGATCAGGTGTTCACCGCCCATGGCGTGATCATGATCTTCTTCGTGGCGATGCCCTTCGTCACCGGCCTGATGAACTATGTGGTGCCGCTGCAGATCGGCGCGCGCGACGTTTCCTTCCCCTATCTCAACAATTTCAGCTTCTGGATGACGACGGCGGGTGCCGTGCTGGTGATGTTCTCGCTGTTCATCGGCGAATTCGCGCGCACCGGCTGGCTGGCTTATCCGCCGCTGTCCAACATCGGCTACAGCCCGGATGTCGGGGTCGATTACTATATCTGGGCGCTGCAGATAGCGGGCGTCGGCACATTGCTGTCCGGCGTCAACCTGGTGGCGACGATCGTGAAGATGCGCGCGCCGGGCATGTCGATGATGAAGATGCCGGTCTTCACCTGGACGGCGCTGTGCACCAACGTGCTGATCGTGGCGGCCTTCCCGGTGCTGACCGCGGTGATGGCGCTGCTCTCGCTCGATCGCTATGTGGGCACCAACTTCTTCACGAACGATTTCGGCGGCAGCCCGATGATGTACGTGAACCTGATCTGGATCTGGGGCCACCCGGAGGTCTACATCCTGATCCTGCCGCTGTTCGGCGTCTTTTCCGAAGTCACCTCCACCTTCACCGGCAAGCGGCTGTTCGGCTATACGTCGATGGTCTACGCCACGGTGGTCATCACCATCCTGTCCTATATCGTGTGGCTGCACCACTTCTTCACGATGGGATCGGGCGCCAGCGTCAACAGCTTCTTCGGCATCACCACCATGGTGATCTCGATCCCCACGGGGGCCAAGCTCTTCAACTGGCTGTTCACCATGTATCGCGGCCGCATCCGCTTCGAACTGCCGATGATGTGGACGGTGGCGTTCATGCTGACCTTCGTGGTCGGCGGCATGACCGGCGTTCTCCTTGCCGTGCCGCCCGCGGATTTCGTGCTCCACAACTCGCTGTTTCTGATCGCGCACTTCCACAACGTGATCATCGGCGGCGTGCTGTTCGGCCTGTTCGCGGCGATCAACTACTGGTTCCCCAAGGCCTTCGGCTTCCGGCTCGACCCCTTCTGGGGCAAGGTCTCCTTCTGGGCCTGGGTGGTGGGCTTCTGGCTGGCCTTCATGCCGCTCTACGTGCTCGGCCTGATGGGGGTGACGCGGCGGATGCGCGTGTTCGACGATCCCTCGCTCCAGATCTGGTTCGTGATCGCCGCCTTCGGCGCCGCCCTGATCGCGATCGGCATCGCCGCCATGCTGGTGCAGTTCGCCGTCAGCTACCTGAAGCGCGACCAGCTTCGCGACACGACGGGCGATCCGTGGAACGGCCGCACGCTGGAATGGTCCACCTCCTCGCCGCCGCCGGACTACAACTTCGCCTTCACGCCGGTGATCCACGATCTCGATGCCTGGTATGACATGAAGAGCCGCGGCTATGTGCGCCCCACCGGCGGCTACCGGCCGATCCACATGCCGAAGAACACCGGCACCGGCGTGATCCTGGCCGCGCTCAGCCTCGCCTGCGGCTTCGGGCTGGTCTGGTATATCTGGTGGCTGGCGGCGCTGAGCTTCGCGGGCGTGCTCGCGGTCGCGATCGGCCACAGCTTCAACTACAAGCGCGATTTCTACATTCCTGCGGAAACCGTGGAAAAGACGGAAGACGAGCGGACGCGCCTGCTCGCGGCGGGAGCATAAGATAATGGCCGACACCAGCCTCACCATGACCCAGTCGGGCGCGCCCCGCTTCCACCTGGAGGAGGAGCATCACCACGCCGAAGGCGGCAGCACGATGCTCGGCTTCTGGATCTACCTGATGAGCGACTGCCTCATCTTCGCGATCCTGTTCGCCTGCTACGGCGTGCTCGGCGGCAATTACGCCGCCGGCCCTTCGCCGCGCGACCTGTTCGATCTGCCGCTCGTCGCGCTCAACACGACGATGCTGCTCTTCTCCTCGATCACCTACGGCTTCGCCATGCTGGCGATGGAGAAGGGCGCGATCGGGCGGACGCAGGGCTGGCTGGCCATCACCGGCCTGTTCGGCGCGGCCTTCCTCGGCATCGAGCTCTACGAGTTCGCGCACCTGATCCACGAGGGCGCCACGCCCCAGCGCAGCGCCTTCCTCTCCTCCTTCTTCACGCTGGTCGGCACCCACGGCCTCCACGTCACCTTCGGCATCGTCTGGCTGGTGACGCTGATGGTGCAGGTGGCGCGGCGCGGGCTGATCCCGGCCAACCGCCGGCGGCTGATGTGCCTCAGCCTGTTCTGGCACTTCCTCGACGTCATCTGGATCGGCGTCTTCACCTTTGTCTATCTGATGGGAATGCTGCGATGAGCACGGACGCCCACGGCGATCATGCCGGCCACCACGAGGATCATGGCCATGGCGACGCGCACGGCCACGGCACGCTGCGCGATTACGTAACCGGCTTCATCCTCGCCGCGATCCTGACGGCCATTCCCTTCTGGCTGGTGATGACGGACGCGCTGGGCGACAACCAGCTCACCGCGCTCGTCATCATGGGCTTCGCGGTGGCGCAGGTGGTGGTCCACATGATCTACTTCCTCCACATGAACGCGCGGTCCGAAGGCGGATGGACGATCATGGCGCTGATCTTTACTATCGTCCTCGTCGTCATCGCACTGACGGGATCGCTGTGGGTGATGTACCACCTCAACACCAACATGATGCCCATGTCGCCGCACGACATGAGCCAGATGCCTTGACCGCCGCCGGCGCGGGAGAGGACATAGCAAAGCACAAGAACGGGAAATCGGGCCGGTCCGCCGCGTCGCTCGCGCTCATCGGCCTCCTCGTCCTGCTGGGGGTCGCGGGCCTTGCGGGGCTGGGCCTCTGGCAGGTGCAGCGGCTCGCCTGGAAGGAGGCGCTGATCGCCCGGGTCGACGCCCGCGTCCACGCCGCGCCGGTGCCCGCCCCCGGCCCCGCCGCCTGGGCCGGCATTTCAGCCGCCGGCGACGAATATCGCCGCGTCGCCCTCCACGGCCGCTTCCGCCACGATCGCGAGACGCTGGTGCAGGCCGTCACCGATCTCGGCAGCGGCTATTGGGTGCTCACCCCGCTTGCCGACGACCGCGGCTTCACCGTCCTCGTCAATCGCGGCTTCGTGCCGCCCGACCGGCGCGATCCCGCCACCCGCGCCGCCGGCAACCCGGCCGGGCCGGCCAGCGTCACCGGCCTCCTCCGCATCACCGAGCCCAGGGGCGGCTTCCTGCGCGGCAACGATCCCGCCGCCGATCGCTGGCATTCGCGCGATGTCGCCGCCATCGCCGCCGCCCGCCATCTGCGGCCAGGCCCCATCGCCCCCTATTTCATCGATGCCGATGCCGCGCCCAATGCCGGCGGCTATCCCGTGGGCGGGCTCACCGTCGTCCGCTTCCCCAACAACCATCTCCAATATGCGATCACCTGGTTCGTGATGGCGCTGCTGCTGGCGGGCGCCGGCGCGGTCGTCGCCCGGCAGGAGATGCGCGCGCGGCGGGCCGCATGAACGCCGTCTGGGACAAGGCGCGCATGGCGATGGCCCCGGTGGAAGGGCCGGCGTCGATCGGCGTCAACAACATGCTGCTGCTGCTCCAGCTCCGCTGGATCGCGGTGCTCGGCCAGCTCGGCACCATCGCCGTCGTCCATGGCGTGATGGGCATTCCGCTGCCGCTGGGGCCGTTGCTGCTGATGCCGGCGCTGCTGATCGCCATCAACCTCCTGAGCCGCCCCTTCCTGCGCGAGCGCAAGCACGTCACCAATGGCGAGCTGTTCAGCGCCTTCCTCGTCGATGTCGCGGCGCTGACCTGGCAGCTCCACCAGACGGGCGGCCTCACCAATCCCTTCGCCTGGCTGTTCCTGCTCCAGGTCGTGCTGGGCGCGATCCTGCTCAAGCCCTGGTCGAGCTGGGCGATCGTCGGCGTGACCACGCTCTGCCTCGCCTGGCTGATGGGCCATTATCGCCCGCTGGCCCTGCCGCCCGGCCATCAGGACGACCTGTTCGGCCTCTATCTGCAGGGCGGGCTGGTCTGCTTCGGCCTGATCGCGATCCTGCTGGTGGTGTTCGTCACCCAGATCAGCCGCAACCTGCGCGAACGCGATGCCTCGCTCGCCGATGTCCGCCAGCAGGCGGCGGAGGAGAATCACATCGTCCGCATGGGCCTGCTCGCTTCCGGCGCCGCGCACGAGCTCGGCACGCCGCTCTCCTCCCTGTCGGTGATCCTGGGGGACTGGCAGCGCATGCCGGAGCTCGCCGGCAATCCCGATCTCGCGCAGGACATCGCCGACATGCAGGCCGAGGTGCAGCGCTGCAAGGCGATCGTCAGCGGCATATTGATGTCGGCGGGCGAGGCGCGCGGCGTGGCGCCCGCCGTCACCACCATGCGCCGCTTCCTCGACGATATCGTCGCGGACTGGCGATCGAGCCGGCTGGCCGGCACCGTCGACTATGACGATCGCTTCGGCGCGGACGTGCCGATCGTCTCCGATCCCGCGCTGAAGCAGGTGATCGGCAACGTCATCGACAATGCGGCCGAAGTCTCGCCCCACTGGATCGGCATCGTCGCCCGGCGCGAGCAGGATCTGCTGGTGCTTTCGGTCAGCGATCGCGGCCCCGGCTTCAGCCCGGACATGCTGAACGGCTTCGGCCAGCCCTATCGGTCCAGCAAGGGGCGGCCGGGCGGCGGGCTCGGCCTGTTCCTGCTCGTCAATGTGCTGCGCAAGCTCGGCGGGCGGGCGGAGGCGCAGAACCGCCCCGGCGGCGGCGCCACCGTCACCATCATGCTGCCCTTGTCGGCCATCGCCTATGCGCCGAAGGAATCCTCTCGGGAGATCGCGCCATGACCGGCCCGGAAAGACTGCTGGTGATCGTGGAGGACGATGCCGCCTTCGCCCGCACGCTCCGCCGCTCCTTCGAACGGCGCGGCTATGCCGTGCTGTCCGCCGCCAGCCATGACGCGCTGGTGGCCCTGCTCGCCGATCACGATCCCGGCTATGCCGTGGTCGATCTCAAGCTTGGCGGCGCTTCCGGCCTGGCCTGCGTGCAGGCGCTCCACGCGCATGATCCGGATATGCGGATCGTCGTCCTCACCGGCTTCGCCAGCATCGCGACGGCGGTGGAGGCGATCAAGCTCGGCGCCTCGCACTATCTCGCCAAGCCTTCCAACACTGACGATATCGAGGCCGCCTTCGATCGTGCCGACGGCGACGCCGCAACCCCTGTCGACGGCCGCCAGACATCGATCAAGACGCTGGAGTGGGAACATATCCACCAGACGCTGGTGGATACCGACTTCAACATATCGGAAGCCGCCCGCCGGCTCGGCATGCATCGCCGCACGCTGGCGCGGAAGCTGGAGAAGCGGCAATTGCGCTGAAGACGTGCCGGAAAGCGGCCGCCTTCGTATCCGAAAGATGTTTCAGGCTTGTTGCGATGGATTCGCATCGTTGCGGCAAAGCATCATGATGGAAAAATCGGACGAGCCGAAGTGAGATTGGGCCGGAAAGTGGCTCGACAATCGCTATTTAATATGATGGCATCAGCCTATGTCTGATCGCCCTCGGACCCCACTTCTCGATCTGGTGCGCACACCGGATGAACTGCGGCGATTGAGCCCCGATCAACTTGCCACCCTTGCGTCCGAGTTGCGTGCGGAGATGATATCGGCGGTGGGTGTGACGGGGGGGCATTTGGGGTCTGGCCTGGGTGTGGTGGAGCTGACGGTGGCGCTCCATTATGTGTTCGACACGCCCAGGGACGTGCTGATCTGGGACGTGGGCCACCAGGCCTATCCGCACAAGATATTGACCGGCAGGCGTGATCGCATCCGCACCTTGCGGCAGGGCGGCGGGCTTTCGGGTTTCACCAGGCGGTCCGAGAGCGAATATGATCCGTTCGGGGCGGCGCACAGCTCGACCTCGATCTCGGCGGCGCTCGGCTTTGCGGTGGCGAACAAGCTGGCGGGCGCGGCCGGCAAGGCGATCGCGGTGATCGGCGACGGGGCGATGTCGGCGGGCATGGCCTATGAGGCGATGAACAACGCCGCCCAGGCCGGCAACCGGCTGGTGGTGATCCTGAACGACAATGACATGTCGATCGCGCCGCCGGTGGGGGGGCTGTCCGCCTATCTCGCCCGGATCGTCTCGTCGCGGCCGTTCCTGTCCCTGCGCGATCTCGCCAAGAAGGTGGCGCGCCGGCTGCCGCGCCCGCTGCACGACGTCGCCAGCAAGACCGACCAGTTCGCCCGCGGCATGACGATGGGCGGCACGCTGTTCGAGGAGCTGGGCTTTTATTATGTCGGGCCGATCGACGGCCATAATCTCGATCACCTGATCCCGATCCTGGAGAATGTCCGCGACGCCTCCGAAGGGCCGATCCTGGTCCATGTCGTGACCCAGAAGGGCAAGGGCTATGCGCCGGCGGAGGCGGCGGCCGACAAATATCATGGCGTGCAGAAGTTCGATGTGGTGACGGGCGCGCAGGCCAAGGCGCCGCCGGGGCCGCCGGCCTATACCGCCGTCTTCGCCGATGCGCTGGTGGCCGAGGCGAAGCGCGACGAGACGATCTGCGCGATCACCGCGGCGATGCCGTCGGGCACCGGGCTCGACCGGTTCGAGAAGGCGTTTCCCGATCGCTGCTTCGACGTCGGCATTGCCGAACAGCATGCGGTGACCTTCGCCGCCGGCCTGGCCGCGCGCGGCATGCGGCCGTTCTGCGCGATCTATTCCACCTTCCTGCAGCGCGCCTACGACCAGGTGGTGCACGACGTGGCGATCCAGAACCTGCCGGTGCGCTTCGCGATCGACCGGGCGGGGCTGGTGGGGGCGGACGGCGCCACCCATGCCGGCAGCTTCGACGTCACCTATCTCGCCTCGCTGCCCAATTTCGTGGTGATGGCGGCCGCCGACGAGGCGGAGCTGACCCACATGGTCCACACCATGGCGCTGCACGATACGGGGCCGATCGCGGTGCGCTATCCCCGGGGCAACGGCACCGGCGCCGCCATCCCCGCCACGCCCGAGCGGCTGGAGATCGGCAAGGGCCGGCTGGTGCGCGAGGGCAAGACGGTGGCGATCCTCTCGCTCGGCACGCGGCTGGCGGAGGCCGAGCGCGCGGCGGACCAGCTGGAGGCCCTGGGCCTTTCCACCAGCGTCGCCGACCTGCGCTTCGCCAAGCCGCTGGACGAGGCGCTGATCCGCCGGCTGCTGTCCACCCATGAGGTGGCGGTCACCATCGAGGAAGGCGCGATCGGCGGCCTCGGCGCGCATGTGCTGACGCTGGCCTCCGACGCCGGGTTGATCGACGGCGGCCTGAAGCTGCGCACCATGCGCCTGCCCGACATCTTCCAGGATCAGGACAAGCCCGAGCGCCAGTACGAACAGGCCGGCCTCGACCACAATGCCATCACCGCAACCGTCCTCGCGGCCCTCAGAAAAAATAGCCTCGCCGTCGCGGGGGATGCCGGTTGATCGGCGCGCGGCTGTTGCTGGCCGCGCTGCTGGCGGCGCCCGCCGCCGATCCGTCCGGGTCGATCGAGATCGCCGTCACCGGCGTCCGCACCGCCGAAGGGCGCGTCCATGTCGATATCTGCCCCGAGGCGCATTTCCTGAAAGAGGATTGCCCGTGGTCGGGCGAGGCGCCGGCGCGGATCGGCGCCACCGTCGTCGTGGTGCGGGGCGTACCGCCGGGCCGCTATGCCGCGCAGGGCTTCCATGACCGCAACGGCAACGGCAAGGTCGATCGCAACCTGATCGGCATTCCCACCGAGGGCATCGGCTTTTCCAACGACGCGAAGATCCGTCTCGGCCCGCCCAAATTCGCCGATGCCGCCTTCGATCACGGGCCGGGCGACCAGAGGATCGCCTTCCGCCTGCGCCATATGGCGGGCTGAAGCAGGGCGTGGCGCGGCTGGCCCCAGCAACGATGCCGGCGCGGGCGAACGGGCCGGTCGCCCGCGCGGTGGACGCCGCCTGCCGGCGGCCGATGGCGGTGCTGGTCGCCGCCCTCCTCCTCGCGCTGGCCGCCGGCGCCTATGCCGCCACCCATTTCGCGATGACGACGGACAGCACGGCGCTGATCTCGCCCGATGTCGGCTGGCGGGTGAACGAGCGCCGGCTCGACGCCGCCTTCCCGCAGAATGGCGACGCGATACTGGTGGTGGTGGATGGCGCGACGGCGGAGCTGGCGGAGACCGCCTCCGCCGCGCTCGCCGATCGGCTGGCGGTCGATCGCGCCCATTTCCGGGGCGTCACCCGGCCCGACGGCGGCCAATTCTTCGCACGCGAGGGGCTGCTCTTCCGGTCGCAGGCCGATGTCGCCGCCGCGTCGGCCCGGATGGTGGAGGCCCAGCCCTTCCTCGGCCCGCTCGCCGCCGATCCCAGCCTGCGCGGCATCGCCGATGCGCTCGGCACGATGATCTCCGGCGTCGACCGGGGCGAGGCCGATATCGCCCGGATCGACCGGCCGATGGCGGCACTGGCAGACGCGCTGGAGGCGCAGGCGGCCGGCCGGCCGGCCTATTTCTCCTGGCAGGCGCTGCTCGGCGACGGGGACAAGGAGGGCGCGCTGGAGGCCCCGCGCCGCCGGCTGATCCTGGTCCGCCCGATCCTCGATTATGGCGCGCTCCAGCCCGGCATGGCCGCGAGCGACGCGATCCGTGCCGCCGCCCGCGCGCTCGCCCTCGATCCCGCCCATGGCGTCACCGTTCGCCTCACCGGATCGGTGCCGCTGTCGGACGAGGAATTCTCCTCGCTCGCCGACAAGGCATGGCTGGTGGCCATGGTGATGATCGCCGCCATGCTCGGCACATTATGGCTCGCCACCCGCTCCGGCCGTCTGGTCGCGGCGATCATGCTCACCACGCTCGCCGGCCTCGTCGTCACCGCCGCGATCGGGCTGATCGCCGTCGGCCGGTTCAACCTGATCTCGGTCGCGTTCATCCCGCTGTTCGTGGGGCTGGGCGTCGATTTCGGTATCCAGATCGCCGTGCGCTTCCAGGCCGAACGCCATGGCGGCGCGAGCCCCGCCGATGCGCTGCGGGGGGCCGCCACCGCGCTCGGCGCGCCGCTGCTGCTTGCCGCCGGCGCCGTCTGCCTCGGCTTCCTTGCCTTCCTGCCCACCGATTATGTGGGGATCGCCGAACTGGGCATCATCTCCGGCATCGGCATGATCGTGGCGCTCGCCTTCAGCGCCAGCTTGCTGCCGGCGCTGATCCTGCTGCTGCGCCCCGGCCGGCCCCGCGCCGAGGTGGGCACGCCCGCGCTGGCCCCGGCCGACGCCTTCCTGATCCAGCGGCGCAAATTGGTGCTGGGCCTGTTCGGCCTGTCGATGGTGGTGAGCATCATCGCGCTGCCGGCGGTGCGGTTCGATTTCAACCCGCTCCACCTCAAGGCCCCCGATGCCGAGGCGATGGCGACGCTGACCGATCTGATGCACGATCCCGATCGCAACCCCAATGTGATCGACATCCTCGCGCCCGATCTCCCCGCCGCCCGCCGGCTCGCCGCGCGGCTGGAAGCCTTGCCCGAGGTCGGCCGGGTGATGACGATCGAAAGCTTCGTGCCCGAAGGCCAGGCGGAAAAGCTCGCCACGATCGCCGATGCGCGCCTGCTGCTCGATCTCACGCTCGATCCGCTGGAGCCGCTGCCGCCGCCAAGCGACGCCGATACGGTCGCCGCCCTCCGCCGCACGGCCGCCGCGCTCGCCGCCCATGGCGAAAACCCCAATGCCCGCCGCTTGGCCAAGGCCCTCTCCACGCTGGCCGGCGCGCCGCCCGAGGCCCGCGGCAACGCCCGTGCGATGCTGGTGCCGCCGCTGGAGGCGATGCTCGCCCAGCTGCGCGCCGCCCTGTCCGCCGAACCCGTCACGCTGGCCGATATCCCCGCCGATCTGAAGCGCGACTGGCTGGCCCCGAAGGGCGGCGTGCGCGTGCAGGCGGTGCCGCGCGCGGCCGGCAACGACAATGAAGCGCTCGCCCGCTTCACCCGCGCGGTGCGCGCGATCGCGCCCGATGCCACCGGCGTCGCCATATCCACCCAGGAAGGCGCGCGCACCGTCGCCCATGCCTTCGTCCATGCCGGCCTGCTGGCGCTGGCGGCGATCAGCCTGCTGCTCTTCGCCGTGCTGCGCGACCTGCGCGAGGTGGCCTTCACCCTCGCGCCCGTCGTCCTTTCGGGATTCCTCACGCTCGGCACCTGCGTGCTGATCGGCCAGCCGATCAACTTCGCCAACATCATCGCCTTCCCGCTGCTGTTCGGGGTGGGCGTGGCCTTCCACATCTATTTCGTGATGGCCTGGCGGGCCGGCACCGCCGATCTCCTGCAATCCAGCCTCGCGCGCGCCATCTTCTTCTCGGCGCTCGCCACCGGCACCGCCTTCGGCAGCCTCTGGCTGTCCAGCCATCCCGGCACCGCCAGCATGGGCAAGATCCTGATGCTGTCGCTCGCCTGGACCCTGGTGTGCGCGCTGATCTTCGAGCCCGCCCTGCTCGGGCCGCCGCGAAAGGATCAGCGTTGAGCGCCGCCGTCGATCGCGCGCAGCCGCCGGGCGACGCCGGCCACGCCATCCGTGGCGACGGTGCGGGCAAGGTCCGCGCGCTGCACGGCAAGCTGGCTCACCCCTTGCGCGATCACGTCGATGATCCGCCATTCCCCGCCCGACTGTCGCATCCGATAGAGCAATGTATCGCCGCCGCTGGTGCCGGTGCGGATCGTCACCTTCACGATCCGGCTGCTGTCCCGCACGGTAACGTCGGGATCGACGATGAAGCGCTCCCCGCCATAGCTCGCGAAATTCCGCGCGAGCGACAGGGCCGAATGCCGCGTCAGCGCCGCGATCGCCGCCTGCCGGTCCGCCGCCGGGCTGGCGGCCCATTTCGGGCCGATCACCAAAGCCGTGATCGCCGGCATGTCGTAATAAGCCCGCACCGCCGCCTCGAACCGGTCGGTGCGCTGGCCCAGCGGCAGCCGCGCCTTCATGATCGCGACCACCTGATCGTTATAGGCCGCCACGCGCGCGGCCGGATCGGCGGCCTGCGCCCGGACCGCGACGGGCGCCAGCGGCATGGCGGCAAGCGCCAGAAGAAAGGCGGGGCGGAACATCGGCGTCTCCATCCGCGACGGGCAGGCCGCAACGATGTGGCCCGGCCCGGCGGCGGCGGCAAGCCCCTGCCGCCGCGATCATGGCAGGAACGGCACGGAACGCCTTGCCGCGATTTAACTTTTGGGAATGAGCCGGCTCCGCTATCTGTTGGCGCAGCCTCGTAAAAGGAGGACTCGGTGGTTAAGCTCGTCCTTGCCCAGCCCCGCGGTTTCTGCGCCGGCGTGATCCGCGCGATCGAGATCGTGGACAACGCCCTCGATCGGGTGGGCGCCCCCGTCTACGTCCGGCACGAGATCGTCCATAATCGCCATGTGGTCGATACGCTGCGCGCCAAGGGCGCTGTGTTCGTGGAGGAATTGTCCGACGTTCCCGACGGCGCCGTCACCGTGTTCAGCGCCCATGGCGTCGCCCGCGCGGTGGAGAAGGAAGCGCGCGATCGCGGGCTGCCGGTGCTGGATGCCACCTGCCCGCTGGTCAGCAAGGTCCACATCCAGGGCCGCCGCTATGTCGCCGCCGGCCGCACGCTGGTGCTGATCGGTCATGCCGGCCACCCGGAGGTGGAGGGCACGCTGGGCCAGATCGACGGCACCGTCCACCTGGTCGGATCGGCGGAGGACGTGGCCGCGCTCGACATCGCCGACGACAGCCCCGTCGCCTATGTCACCCAGACGACGCTGTCGGTGGACGACACGCGATCGGTGATCGATGCGCTGAAGGCGCGCTTCGCCGACATCACCGGCCCCGGCACCGCCGACATCTGCTACGCCACCCAGAACCGGCAGACCGCCGTGCGCGACCTCTGCCGCGTGGTCGATATGCTGCTGGTGGTGGGATCGGCCAACAGCTCCAATTCCAACCGCCTGCGCGAAATCGGCGTGGAGCTGGGCCTGCCCAGCCACCTCGTCGCGGATGGCGACGCGATCGACCCCGCCTGGCTCGAGGGGGTGGAGCGGATCGGCCTCACCGCCGGGGCCTCGGCGCCCGAGGATCTCGTCCAGGGCGTGATCGCCGCGATCCGCGGCCATGTTCCCATCACCGTCGAAACGCTCGACGGGATCGAGGAGGATCTTCATTTCCGCCTGCCGCCGGCGCTCGATCGGCTGGCCCGGCGCGATGTGGTTGCAGAGGAGGCCTGAGCAAGATGGCGGCCATACCCTTTTCCCAGGCGGCGCGGATCGGCGGCTATGTGCTGGGCCGCAAGCTCCGGCGCGTCGAACGCTATCCGCTGGTGCTGATGCTGGAGCCGCTGTTGCGCTGCAACCTCGCCTGCAAGGGCTGCGGCAAGATCGACTATCCGGACGAGATACTGAACCAGCGCCTTTCCTACGACCAGTGCATGGCCGCGATCGACGAATGCGGCGCCCCCGCCGTATCGATCGCCGGCGGGGAGCCGCTGCTCCATCGCGAGATGCCGCGCATCGTCGAAGGCTATATCGCGCGCAAGAAGTTCGTCATCCTCTGCACCAACGCGCTGCTTCTGAAGAAGAAGATCGATCAGTATCGCCCGTCGCCCTTCTTCACCTGGTCGATCCACCTCGACGGCGATCAGGTGATGCACGATCGCTCGGTCTGCCAGGACGGCGTCTACGAGGTCGCGCGCGATGCGATCCTGCTCGCCAAGTCCAGGGGGTTCCGCACCCAGATCAACTGCACCGTGTTCGACGGCGCCGATCCCGATCGGCTCGCCGCCTTCTTCGACGACATGATGGCGATCGGGCTCGACGGCATCACCGTCTCGCCCGGCTATGCCTATGAACGCGCGCCGGACCAGCAGCATTTCCTCAACCGCGAAAAGACGCGTCAGCTCTTCCGCGACGTGTTCCGCCATGATCCCAAGCGGAAATGGGCCTTCACCAACTCGCCCCTGTTCCTGGATTTCCTGGCCGGCAACCAGACCTACGAATGCACCCCCTGGTCGATGCCGCTGCGCACCGTCTTCGGCTGGCAAAAGCCCTGCTACCTGCTCGGCGAAGGCTATGTGCAGACCTTCCGCGAGCTGATGGACGACACGGACTGGGAGAATTACGGCGTCGGCAAATATGAAAAATGCGCCGACTGCATGGTCCATTGCGGATTCGAGGGCACCGCGACCACGGACGCCGTCCGCCATCCGCTCAAATTCCTGAAGGCCGCCCGCCACATCCGGACCGAAGGCCCGATGGCCCCGGATATCGACCTCAGCCGCCAGCGCCCGGCCGAGAACGTCTATTCCAGCCATGTCGAGCGCGAACTGGCCCTCATCCGGCAATCGCAGCCCGAAGCCGGCAAGCATGTGACGGCGGCGTGGCGGTAATGACGGCCGCCACGCGACCTTCGAAGGTTCATTTCACTCCGTCGGCCCGGTAGATCACCCCATCCTTCATCACGACCGCGAAGTTGCGATCCGGGTCGCCGATCAGGTCGATATTCGCCAGCGGATCGCCGTTCACCAGGATCAGGTCGGCCAGCGCGCCGGCCTCGACCACGCCCAGCTTGCCCTTATAGGGCGCGCGTTCGCCGGACAGGGCGAGGAGCTGGGCGTTGTCGTGCGTCACCATCCGCAGGATCTCGGCCGGGGTGAACCATTGCTTCAGCCGGAGGATATAGGCGTTCTGTTCCCGATTCTCCTCGGGCCGGAACAGGAAGTCCGTGCCCCAGGCGAGCTTCACATGATATTTCTTCGCCCAGTTCCAGGCATGGTCCGAACCTTCGCGCACCGCCATCTTCTTGGCGACGCGATCAGGCGGATCGTTCGGCAGGGGCGGCGGCAGGTTCTGGAGGGAAAGCCATGCACCGCGATCGGCGATCAGCTTGATCGTCGCTTCGTCGAGCAGCTGGCCATGCTCGATGGATTTCACCCCCGCATCCAGCGCACGGCGCACCGCGCGCGACGTATAGGCGTGGACGGCGACATAGGTGCCCCAATCCTCGGCGGCGGCCACGGCGGCCGAAAGCTCCTCGGGCAGATATTGGGTCACGTCGATCGGATCGTAATCGGAGGATGAACCTCCGCCCGCCATCAGCTTGATCTGGCTCGCCCCGAAGCGGAGATTCTCGCGCGTAGCGGCGAGCACCTCGTCCCGCCCGTCGGCGATGAAGGTGGCGCCGAGCTCCTCGGCGCGCGATGGCTTGCCGAAGAAGCGGCGCGAGCGTTCCGTGGGCAGGCGGAAATCGCCGTGGCCGGACGTCTGGCTGATCACCGCGCCCGACGGCCAGATGCGCGGCCCCTGATATTTGCCGCGATCGATCCCGGCCTTCAGCCCGAAAACCGGGCCGCCCATGTCGCGCACGGCGGTAAAGCCGCGCAGCAGCATGGCCTTGGCCTGATCGGCGGCGGCGGCCTCGGCTGCCTGCGGGGTCAGATCCGGCGACATGAGCTTCGCCATGTCGAGCGCGCCGAAGGTCAGGTGCACATGGACGTCGATCAGCCCCGGCATCAGCGTCCGGCCGCGCCCCTCGATCACCCGAGCCCCGGCGCCCGCCTTGGCCGCCGGCCCGATCGCCGTGATGACATTGCCCTTCACCGCGACATTGGTGGGCGGGGACAGGCGATCAGACTGCCCGTCGAACACCCGGACATCGCGGAACAGCGTTTCCGGGGCATCCTGCGCGTGGACCGCCGCGCCCGTCGCCATCGTCATGAAAAGCGCCGCCGCCGGCCGCCAATGCTTCCAGTGCATCATTGCCTCTGCTCCCAGAACCGTTCGCCTTCATTCATGCGCGCCATCAGCCGTTCGAGCGACCGGAAGGCCGCGCCCACATCGCGCGCGGTGCGGATCAGGGCGGGCAGCTGGCCGGGGTTGCGGGCCAGCGATGCCGTCACCGGGCCGATCGCCATGCCGCCGCCGGGCTGCATCCCCACCCAGGCGGCGTGCGGCAGATCATGGTCGGCGGTATCGGATATCACCCGCAGGAAGGCGAAGGGCAGGCCGTGCCGGGCCGCCACGCGGGCGGCGACGTGCGATTCCATATCGACGATGATCGCGCCCGTTCCGGCATGGAGCGCGCGCTTGACGGCCGCTTCGGCGACCAGCGTGCCATTGGCATGGACTTCGCCGATCCGCGCCTCGGGCAGGCGGCGGGCGAGTGCCTCCACCGTCGCCGGATCGCCGCCGATCGTCCAGTCGCCGACGGCGAGGTCGGGGGCGAGGCCGCCGGCTATCCCCATGCTCCAGATCGCGCGCGCGGAACGCGCATGCGCCTCCAGCGCTGCTTCCAGCGCCGCCGCGTCGCCGCCGCCGGGGATCGGCGCCAGCCCGTCGCGCGCGATGATCCGGGCCTCGCGCTTCAGCCCCGTCGCCACCAGGATCGTCACATGCCCACCATCACGCGGCGCGCGTTGCCGCGCTTGAGATTGCGGTATCGCGCCATCGCCCAGAGCGGGAAATAGCGCGGATAGCCGTGGTAGCGCAGGTAGAAGACGCGCGGGAAACCGCCCCCGGTGAAATGTTCCTGCGGCCACAGCCCATCCGTGCCCTGATGGCGCATCAGCCAGCCGACGCCGCGCTGGACGGCGGCGCCATCCACCTCGCCCGCCGCCATCAGCCCGATCAGCGCCCAGGCCGTCTGCGAGGCCGTGGAGGGCGCGGGGCGGTGGCCGGTGCGATCCAGCGCATAGCTGTCGCAATCTTCGCCCCAGCCGCCATCGGGATTCTGGATCGCCTCCAGCCAGGCGACTGCCCTGCGCACGATCAGTTCCTGCGGATCGACCCCCGCCGCGTTGAGCGCGCAGAGGGCGGACCATGTGCCGTAGACATAGTTCACCCCCCAGCGGCCGAACCAGCTGCCATCGGCCTCCTGCTCGCGCGCGAGATAGGCGAGCGCCGCCCGCATGCGGGGGCTTTCCGCCGGCTCGCCGAGTTGCGCCAGCATCGATATGCAGCGCGCGGTGACATCCACGGTCGGCGGATCGAGCAGCGCGCCATGATCGGCGAAGGGGATGTTGTTGAGATATTCCGCCATATTATCGGCATCGAAGGCGCCCCAGCCGCCATTGCGGCTCTGCATGCCTTCCACCCATTCGCGGCCGCGCGCGATCGCCTGATCCTGCCCGTCGCCGGGCACGATCCGACCGCGCGCGCGGTCCATCGCCATCGCCACCACGGCGGTATCGTCCAGATCGGGATAATGATCGTTGCGATACTGGAAGGCCCAGCCGCCCGGCCGCACATCGGGCCGCTGTTCCGCCCAGTCGCCCTTCACGTCCAGCACCTGCAGCGGCTTGAGCCAGGCGAGTCCCTTCGCCGCCCGCGCTTCCGCCGCGTCGCCGCCTGCCTCCATCAGGGCATGCGCCGCCAATGCGGTATCCCAGACGGGCGAGACGCAGGGCTGGCAGTAGATCTCGTCCGTCCCGTCCTCGGGATCGAGCACCAGCAGCTTTTCGACCGAGGCGCGGGCGATGGCGCGATCGGGGTGATCGGGCGGATAGCCGAGCGCATCATACATCATCACGGCATTGGCCATGGCGGGATAGATCGCGCCCAGCCCGTCCTCCCCGTTCAGCCGTTCGCGCGTCCAGGCGACGCAGCGATCGATCGCGCGGCGGCGCAAGCCGGCGGGCCAGAAGGGACGCACGCCCTTCAGCAGCTTGTCGAGCGCGAGGAAGCCTTCGGTCCACAGCCTTCCGGTGCCCGCCGCGCGGCTGGCGATCGGCGCCGGTTCCTTGCTGCGATACAGTTCGTCGACATGGGTGCCGCGCGGATTGCGGGCGCGCGGGCGGAGCGCCATCAGCACCAGCAGCGGCACGATCACGGTGCGCGCCCAATAGGACATCTTCGACAGATGGATCGGGAACCAGCGCGGCAGCAGGATCATCTCCACCGGCATTTCCGGCACCGCCGCCCACGGCCCCGCGCCATAGAGGGCAAGCTGGATACGGGTGAAGACATTGGCGGCTTCCGCCCCGCCGGCCGCGAGCACCGCCGCGCGCGCCCGCGCCATATGGGGCGCATCTCCGTCATCGCCGATCAGCTTGAGCGCGAAATAGGCCTTCACCGTGGCGCTCAGGTCCATCGCGCCGCCGTGGAACAGCGGCCAGCCGCCGTCCCCGTTCTGGATGCGGCGGAGATAGCGGCCGACCTTCGCCTCCAGCGCCAGATCCTCCGGTTCGCCCAGATAATGGCGCAGCAGCAGATATTCGGCCGGGATGGTGGCATCGGCCTCCAGCTCGAACACCCAATGGCCGTCGTCGCGCTGGGCGGCGATCAGTGCCACCGCCGCGCGATCGACGGCCTTCTCCACCGCGGCCAGCGGATCGACGGGAACAGGATTGCCGGCTTCAGACTGATACATGCGCGCGTGCCACCTCGCCTGCCAGATGGGCGGCCGTCCGCCCGGATCGGATCGCCCCTTCGATCGTCGCCGGCAGGCCGGTCCGTGTCCAGTCGCCCGCCAGCATCAGATTGGCCCAGGCCGTGGCCGCCCCCGGCCGCCGCGCATCCTGATCGGGCGTGGCGGCGAAGGTGGCGCGGCGTTCGCGCACGATCTGCCAAGGCGGCAGCGGCGCATCGATCCCGGCGGCGCGGGCCACCTCCGCCCACAGCCGGGCGGCGAGCGCGCCGCGATCCTCCTCGCACAACCGGTCCGCGGCGCTGATCGTCACCGAGATCCTGTCGGGAAAGGCGAAGATCCATTCCGCGGTGCCGCCGATCACCCCCAGCATCGGCGGCAAGGCGGGCGGCGGCGGAAAGGCGAAATGGGCGTTGACGATGGCATGGTGGCGATCGGGCACGGTGAGGCCCGGCACGAGATCGGCCGCCACCCATGCCGGCACCGCGAGGATCACCGGATCGTCGGTGGCGACGGCATCCGCCCCGTCGGCGAAATCCAGCCGATCCACCCGATCGCCTGCGAAGCGCAGCGCCTTCAGCCGCCGGCCCATGCGGATATCGGCGCCATGCGACCGGAGCCACGCGATCGCCGGATCGACGAAGGCGGCCGCCAAGGTGGGATGGGCGATGCGGGGGCGGCAGGCGCGGCCGCCCTTGGCCAGCGTCTCGCGGATCACGGCGGCCGCCAGCCCGGCCGATGCCTCCGCCGGCGGGGTGTTCATCACCGCCACCATCAGCGGCGCGATCAGCCTTTCCCACAGGGGCGTATCGGTGGCGATCACATCGCCGATCCGCGCATCCGCATCGGCGCCCAGCAGGCGGGCGAGCGGCAGATAATCGGCAACACCCGTGCCCGGCACCCGCCGGCCGGGGACGAACAGCCACCAGGGCAGCGGCCCGCCATTGGGCGCCACCCGCCAGCGTTCGCCATCGCGCAGGTCGAAAAAGGCAAAGCTCGCCTCCATCGGCCCGGCCAGCCGATCGGCCGCGCCGATCGTGGCGAGATGATCGGCCACGGCGGCATTGCCCGACAGCACCAGATGGTTGCCGTTATCGATGGTGAGGCCGAGCTGGGGATCGTGATAGGAGCGGCAGCGCCCGCCGGCGCGCGGCCCCGCTTCGGACAAGGTGACGGCGTAGCCCGCCTTGGTGAGCGCGATCGCCGCCGCCAATCCGGCCAGCCCCGCGCCGATCACATGAGCCCGCGCCATCGCCGCCCTACCGCAGCAGCCTGAGGCGGATCATCATCCCCAGGATCGCCAGCCGGTTGTGCCGCACCCGGCGGCGCGGCGGCGCCCAGCCCGCCGCCTCCATCCGATCCAGCAGCACGCCATAGGCGCCCGCCATCAGCCGGGGGGCGATCAGGTGGCCGCGCGGCCGGGCCGCCAATATGCGCCGCGCGGCATCGAAATGCTGGCGCGCCTCCGCCGCCACCGCCCGGCAGGCGCGATCGATCCGGGGATCGGCGGTGATCGCGGCGATATCGCCCAATGGCAGCCCGGCCGCCGCCAGCGCCTCGGCCGGCAGATAGACCCGGCCGATCGCCGCATCCTCATCCACGTCGCGCAGGATGTTGGTGAGTTGCAGGGCGCGGCCGAGATGGTGGGCCAGCGCCAGCCCCGGCGCTTCCTCCATGCCGAACACCCGCACCGAAAGCCGCCCCACGGCGGATGCCACGCGATCGCAATAGAGATCGAGATCGGCCGCGTCGGGGCAGCATATGTCGCCCGCCACGTCCATCGCCATGCCGGCGATCACCGCATCGAAATCCGCCCGCGCGAGATCGAAGCGGCGGACGGCCGGCGCCAGATAATGCGCCTGCCCCGGATCGCCGCCAGCATAGAGCCGGGCGATATCGCCGCGCCACGCCTCCAGCGCGGCGGCGCGGGCGGCACGATCCCCGCGCTGGTCGTCGGCGATATCGTCCACCTCGCGGCAGAAGGCGTAGATGGCATACATCGCCTCGCGTTCGGGGCGCGGCAGCACGCGCATGCCGGCATAGAAGGAGCTGCCGGCCGCCCGGCCCTGCGTGGCGAGGCTGGTGGCGGTCATGCCCGCCCGCCCCGCGCGATCATCCGGCCCAGTGCCGCGCGTCCAGCGATCATCAACGCCTCCATCCTGCCATGGTGGACCTTTTCGGCCAGCGGATCGCGACGGCGCAGCCGCGCCGCAAGGCTTTCCGCCAGCCGCTGGATCGCCGCCACCTCCATGCCCAGCCGCCGATCGGCGATGGCGCGGGCGAAACCGGCCGACCGGGCGAGCAGCGCCTCCGCCTCACCCGCCGCCTCGCCGATCGCCGCCTTCAGCGCCGGGGCCGCTCGGGGGGCAGCGAGCATATCGACGGTGGCGCCATGCGCGGCCAGCCGATCGGCGGGCAGATAGACCCGATCGATCGCGCGATAATCCTTCCCGCAATCCTGCAGGTGGTTGATGACCTGCAGCGCCGCGCACAGCGCATCCGACGCCGGCCACAGCGCGCGATCCTCGCCATGCACATCCAGCACGAAGCGGCCGACCGGCATCGCCGAATAGCGGCAATAATCGATCAGCGCGTCCCAATCGGCATAGCGGTTCACCGTCACGTCGCGGCGGAACGCCTCCAGCAGATCGAGCGCGTGGCCCGCATCCAGCCCGCGCCCGGCCAGCGCCGCGCGCAGCGCCCCCGCAGCGGGATCGGCATCGCTTTCGCCGGTCAGGCCCGCGCGCATCGCCTCCAGCCGGGCGAGCTTTTCGACAGGCGACGCCGTGGGGTGATCGGCCACGTCGTCCGCCGCCCGCGCGAAGCGGTAGAAGGCCATGATCGGCGCGCGATGCTCCGGCCGGATCAGCAGCGAGGCGACCGGGAAATTCTCGTCCTTATGGCCCTTGCCCGATGCCAGCGCCGCCGCGCCCGCCGCCCTTGGCGCCTGTTCCGCCAACGCGGTCATGCGCCCAGCCGCGCCTGCGCGCGCCCCTTCCACATGCCGCCGCGCCCGCGTGCGTGCGCGACGGCCGACAGCGCGGTGGCGCCGGCATAGAAAGCCGCGATCGCCGGCAGCAGCGGCCCCCACAGGGGCGACCGGCCGTAGAAACGCAGCATCGGCTGGAAGGCGATCGCCATCAGCGCCCAGGCGGCGATCCCCATCGCCCGGCCCGGCCCGGCGGCGAATGCCGCGGCCAGCGGCGGCACCAGGAAGACCAGCGCCAGGCCCAGCAGCGTGCCTGCCAGCAGCCAGGGCGAATAACGGAGCTGGGCATAGGCCGACCGCGCGATCATGGCGCCGATCTCCCGCCAGCCGCCATAGGGGCGGATGCTCACCGATCGGCGCGTCAGCATCAGGCGGATCGGCCCCTGCCGCTTCATCGCCCGGCCCAGCGCGCAATCGTCGATGATATGGGCGGCGATGGCGGGGATGCCGCCCGCCCGCGCCAGCGCATCGGCGCGCGCCAGCATGCAGCCGCCGGCAGCGGCGGCAACCGGCGATCCCGCCCGGTTCACCCGCGCGAAGGGATAGAGCATCTGGAAGAACAGCACGAAGGCCGGGATCAGCGCGCGTTCGGCGGCATTGGCGGTGCTGAGCCGCGCCATCAGCGAACAGAGCGCCAGCCCCTCCGCCTCGCCGCGCGCCACCAGCGACCGGAGCGTATCGGGGGCATGGGCAATATCGGCGTCGGTCAGCCACAGCCAGCGCGCGCCGCCGGCCCGGGCGATGCCCTGCTCCATCGCCCAGAGCTTGCCCGTCCAGCCGGCGGGCGGCGGGCTGCCGGGCACGATCTCCAGCCGGTCCCCCCGGCCCGTGGCGGCGGCCGCTTCCCGCGCGATCCGGGCGGTGCCGTCGCTGCTGCCGTCGTCCACCAGCAGAATACGGAAATCGCCGGGATAATCCTGCGCCAGCAGGCCGGCGATCGTCCGGCCGATCACCGCCGCCTCGTCGCGCGCCGGCACCACCGCCGCCACCGCCGGCCAGTTCGCCGGATCGGGGGCAGCCGCCTCGTCCGTCTCGGCCGCGCGCCAGAAACCGCCATGCCCCGCCAGCAGATAGATCCACAGGCCCAGGCTGATGGCGGCCAATATCGTCATGCGATCATGCCCTCGCGCCGGAACCAGGCGATCGCATCGCCGATCGCGTCGCGATAGGGCCGGGCGCGATAGCCCAGCTCCGCCGCCGCCCTGGCCGAGCTGTAGAACATGGCGTGGCGCGACATCTTCAGCGCATCGCGGGTGAGGAAGGGCTCGCGGCCCGACAGGCGCGACACCAGCTCCGCACCCCAGGCCAGCGGATAGAGCGGCCCGCGCGGCAGGTTGAGGCGCGGGGGCCGGCGGCCGGTGAGGCCCGCAATGTCCGTCAGCATCGTCCGCAAGCTCACGTCCTCGCCGCCCAATATGTAACGGCGCCCGGCCCGCCCCCGCTCGAACGCGGCGACATGGCCTTCGGCCACGTCGTCAACATGGACCAGGTTCAGCCCCGTGTCGACGAAGGCCGGCATCCGGCCCCGCGCGGCCTCGATCAGGATGCGGCCGGTGGGCGTGGGCTTCACGTCGCGCGGGCCGATCGGGGTGGAGGGGTTGACGATCACCGCAGGCAGGCCGCGCGCCGCCACCATCGCCTCCACCAGCCGTTCGGCGGCGACCTTGCTGCGCTTGTAGGCGCCGATCGCCGCCGCCTCGTCGAGCGGCCGGTCCTCGTCCGCCGGGCGGCCGGCGTCGGGCGCCAGCGTGGCGACGCTGCTGGTATAGACGATGCGCGGCGTGCCGGCGGCCAGCGCCGCCTCCATCACGGTGCGCGTGCCTTCGCGATTGTTGCGGACGATCTCCTCCGGGTCCGCCGCCCACAGGCGATAGTCGGCGGCGACGTGCGCCAGCCCGTCCACGCCCCGGAGCGCGGCGCGCATGGCGGCAGCGTCGCGGATGTCGCCGCGCACGATCTCGCCCGGAAAGTCCCTGAGATTCGCAGGCGCGCTCGTTTCGCGCGCGAGGCCGCGCACCGCATAGCCGGCGCCGGCGAAGGCCCGCGCCACCGCCGCGCCCACGAAGCCGGAAACGCCCGTCACCAAAATGGTCTTCCCGCGATCGGCCAGCATTGCCTCCTCGGTCGCCCGGTTCCGGCCCAGGCCTGCGACGAAGTATGTGAAAAGATGGCGGCGCGGTCTACAGTCGATCGCGTGACGATGACCGCGCTCCTGCTGCCGCAATGCCTCGCCGTCGCCGGCATCCTTTATACATTGGCCGCGACGATCCTTGCCGGACGCTGGAAATCCGCGCCGATACCGCCGGAAAACGGCCCGCCGGTCACGATTCTGAAGCCGCTCCACGGCGCCGAGCCGCTGCTGGCCGAAAATCTCCGCAGCTTCGTGGAGCAGGATTATCGCGGCGCGGTCGAGATCGTCTGCGGCGTGCACGACCCGGCCGATCCGGCGGCTGCCGTGGCCCGCGCGATCGGCGGACCGGTGCGGGTGCGCGCCGATCGCGCGCGCCACGGCAGCAACGGCAAGATATCCAATCTGATCAACATGATGTCCGACGCCAGCGGCGACATCATCATCCTGAGCGACAGCGATATCGCGGTGCCGCCCGATTATATCTCCCGCATCGTCGCCACGATCGCGCCTGCCGGCACGATCGCCACCTGCCTCTATGCCGGGCGGGGCGATGCCGGTGCCTGGTCCCGCATCGCCGCCGCCGGCATCAGCTGGCAGTTCCTGCCGTCCGTCATCGTCGGCCTCGCCACCGGGCGCGCCCGGCCGTGCATGGGATCGACCATCGCGATGCGCCGCGAAACGCTGGACCGCATCGGCGGCTTCGCCCCCTTCGCCGACGTGCTGGCGGACGACCACGCCATCGGTGCCGCCGCCCGCGCGGCGGGCTGCGAAGTGGCGATACCGCGCCTCATCGTCACCCATGGCTGCGCCGAGACGAGCCTTGCCGCCCTCGCCCGCCACGAGCTGCGCTGGAACGCCACCATCCGCGGGCTCGACCCATGGGGCTATGCCGGCAGCATCGTCACCCATCCGCTGCCGCTCGCTCTTCTCGGCCTGAATTTGTGGCTGGTCGCCGCAGCGCTGGCCGCGCGCCTCGCCCTCGCGCTCCGCATCGACCGCCTCGCCGGCCGCCGCACCGCGCCGATCGCGCTGTTGCCGCTGCGCGACATCCTTTCCTTCATCCTCTTTCTCGGCGCCTTCGCCGTCCGCTCAATCGATTGGCGGGGAGGAAGATTTCGGCTAGGGAAGGATGGTCGGATGTCGGCGGATACGGAGTATCTGACGTGATGCGCTCGCTTTTTCTGCAAGCCCCCTCCTTCGACGGTTATGACGGCGGCGCCGGCGCGCGCTACCAGATGAAGCGCGAGGTCCGGTCCTTCTGGTATCCCACCTGGCTCGCCCAGCCGGCGGCGCTGGTGGAGGGATCGAAGCTGATCGACGCCCCCGCCCACGACCTTTCCTTCGACGACATCAAGCACGAAGCCTATGCGCGCGACCTGGTGATCCTCCACACCTCCACCCCTTCCTTCCGGCAGGACGTGAAGACGGCGGAGATGCTGAAGGCGCTGAATCCCGACCTGAAGATCGGCCTGATCGGCGCCAAGGTGGCGGTGCAGGCGCAGGAAAGCCTGGCGGCGTCCGAGGCGATCGACTTCGTCGCCCGCAACGAATTCGACTTCACGATCAAGGACGTCGCCGACGGCCAGAACTGGGCATCGATCAAGGGCATCAGCTACCGCAACGCGCAGGGGGTGATCGTCCATAATGACGACCGGCCGGTGCTGGAGGATATGGACGCGCTGCCCTTCGTCAGCCCGATCTACAAGCGCGACCTCGTGATCGAGAAATATTTCGGCGGCTATCTGAAGCATCCCTATGTGAGCTTCTACACCGGGCGCGGCTGCAAGAGCCGCTGCACCTTCTGCCTGTGGCCGCAGACCGTGGGCGGCCATAATTACCGCACCCGATCGATCGGCCATGTGATCGAGGAGGTGAAATATGTGATGCGGGAAATGCCGCAGGTGAAGGAGATATTCTTCGACGACGACACGCTGACCGACAACGCGCCGCGCGTCGAGGCGCTGGCCCGCGAGCTGGGCAAGCTGGGCGTCACCTGGTCCTGCAACGCCAAGGCCAACGTGCCCTATGATACGCTGAAGGTGATGAAGGACAACGGGCTGCGCCTGCTGCTGGTCGGCTATGAAAGCGGCAACCAGAAGATCCTGCACAATATCAAGAAGGGCCTGCGCGTCGATGTCGCCCGGCAGTTCACCAAGGATTGCCACGCGCTGGGCATCGTCATCCACGGCACCTTCATCCTGGGCCTGCCCGGCGAGACGAAGGAGACGATCGAGGAAACGATCCGCTACGCGCAGGAGATCAACCCCCACACCATCCAGGTATCGCTGGCCGCGCCCTATCCGGGCACCTTCCTCTACCGGCAGGCGACGGAGAATGGCTGGTTCGACGGGACGGACCATCTGCTGACCGACCATGGTAACCAGATCGCCCAGTTGAGCTACCCGCACCTGAACAGCACGGAGATCTTCGCGTCGGTCGAGGATTTCTACAAGCGCTTCTACTTCCGCCCGCGCAAGATCGGCGCGATCGTGGGCGAGATGATCCGCGACCAGGACATGATGAAGCGGCGGCTGCGCGAAGGGGTGGAATTCTTCCGCTTCCTGCGCCAGCGCAAGGAAGCGGTGGCCTGAAGGGGGGCGGGACGATGCGCTTTCCCGCCCACGCAGCCGCCCCGGCCGGCACTTTGTCCCGCCCCCCCGCCCATGGGCCGGGCTGGGCCGGCACCGAGCCGGGCGAGGCCGCGCCCGCGCCGCACCCGGCCAGGAGGCTGGTGATAACCGCCGACGATTTCGGCGCGTCGATCGCGGTGAACCGCGCCGTGGAGCGCGCGCATCGCGAAGGCGTGCTGACCGCCACCAGCCTGATGGTGGCGGGCGAGGCCGCGACCGACGCGATCGCCACCGCGCGCAAGCTGCCCATGCTGGGCGTGGGCCTGCATCTGGTGCTGGTCGACGGCCGGCCCACCTTGCCGCCCGATCGCGTGCCCGATCTGGTCGATGGCGACGGCCGGTTCCGCGCCAACATGGTGCGCGCGGGCGTGGACTTCTTCTTCCGCCCCGCCGTGCGGCGCCAGTTGGCCGAGGAGATCGAGGCGCAGTTCATCGCCTTCGGCGCGACCGGGCTGAAGCTGGATCATGTGAACGCCCACAAGCATTTCCACCTGCACCCGACCATCGCCGGCCTGATCGTGGAGATCGGCGGGCGTTTCGGCCTGCGCGCCGTGCGCGCGCCGGTGGAACCGCCCGAGCCGCTGGCCGAGGTGGAGCCGGCCAGTCCCGGCCGGCTGGCCGACCGCGTTGCCGCGCCATGGGCGCGGACGCTGCAGGCGCGCTTCGCGTCGGCGGGGCTGATCGTGCCCGATCAGGTGTTCGGCCTGCGCTGGTCCGGCCATATGCATGTCGGGCGGCTGGCGGGCCTGATCGAGCATCTGCCGCCGGGCCTCACCGAAATCTATCTCCATCCCGCGACCGAGGCCGGCTTTGCCGGCCATGCGCCGGGTTACGACTATGAGGCCGAACTGGCGGCGCTGGTGGACGAGCGGACGCGCGCCGCGCTTCGCCTGTCGGGCGCGCGGCTGGGCAGCTTCACCGATTTCGAGGAAGAAAGGGTCGCGGCCTGATGGAGAGTATGAGCGCCCTTGCCCCCCCCGCCGCCGCCGGGGGCGCCCCGGCGCCGCTGGGCAGCGAGACGCACAAGACGCTGTTCTGCCGCACCCTGCTGGATACCCATGATCCCTACCGCCCCGCGCTGATCGAATGGCCGAAGCTGGACGCGGACACGCAGGGCAAGATCACCTCGCTGCCGATCTGGGACATCGCCGTCGCCACCGAGGGCCGGGCCGGCATGAACGTGCGCACCTTCGGCGAGGCGGTTTCCGACCCCCTCCTCAAGGAAGCGATCATGATGAACGCCTTCGAGGAGAGCCGCCACAAGCTGGTGCTGGCGGATATGGTGCAGGCCTATGGGATCGAACTGGCGCCCGAGCCCGAATATCGCCGCCCGCGCGATCCCGAATTCGCCTTCATGCGCACCGGATATTCCGAATGCATCGACAGCTTCTTCGGATTCGGCCTGTTCGATGTCGCCCGCAGATCGGGCTTCTTTCCGCCGGAGCTGGTCGACACGTTCGAGCCGGTGATGCGCGAGGAAGGGCGCCATATCCTGTTCTTCGTCAACTGGGTGGCCTGGTGGCGGCGCAACATGCCCTGGTGGCGCCGGCCTTGGTTCGAGCTGAAGGTGATCGCGGTGTGGATCGTGCTGATCCTCGAGCGGATCGACATGGCCAAGGGCATGGGCAGCAACACCAAGGCGCAGGAGAATAATTTCACGCTCAACGGTTCCAAGGAACTGGGCGTGGAAATCAGCTTTCCCGAGCTGGCCCGCATCTGCCTTGCCGAAAACGACCGGCGGCTGGCCCCTTATGACCGCCGCCTGATCCGGCCGCGCTTCGTGCCCGGCGCGATCCGCTTCGTGCTGCGCTTCATGCGATCGTGAAGGTCCCGTTCCGCTGGCCGCTGCTGGCGGCGACCGCGATCGGGCTCGCCATCGCGCTGTGGGCGATCGGCCGGGCGGGGCTGGGCGACATCATGGCCGCCGCCGGCCGGCTGGGCATCGGCGGCTTCCTGCTGCTGATCGCCTGTTCCTTCGCGGTGCTGGGCCTGCTGGGCGCGGCGTGGCTGACGGCGATGCCGGACGCACCTTTCCGCCGCCTGCCGCTCTTCACCTGGGCACGGACGACGCGCGAGGGGGCGAGCGACCTGCTGCCCTTTTCCCAGATCGGCGGCATCGTGGTCGGGGCGTGGACGCTGATCGGCCGCGGCCTGCCGGCGACGCGGGTCTATGCCTCGATCATCGTCGATCTCACCACCGAGATGGCGGCGCAATTGCTGTTCACCCTGTTCGGCCTGTGGATGCTGGGGGCGATCCTGCTGGACGCCGATGCGATGCGATCGCTGCGCACGCTGGCACTGATCGGCGCGGGCGTGGCCGTGGCGGTGACGATCGCCTTCGCCCTTCTGCAGGTGCCGGCGCTGCGCTTCCTCGCCTTCCTCGCCCGGCGGATGCTGCCCAGGGCGGAGGTGGCGGTGGATGCGGTGGTGGCCGAGCTGACGCGCTATTACCGGGTGCGGCGCGCGATCCTCGCCTCCTTCTTCTTCAACCTGCTCGCCTGGGCGGGGAGCGCGGCCTCGGCCTGGCTCACCCTGCGGCTGATGGGGGAGCATCAGACGATCTGGCACATCATAGCGCTGGAAAGCCTGATCTTCGCGCTGCGCAGCGCCGCCTTCGTGGTGCCGGGCGCGATCGGCATCCAGGAGGCGGGTTATATCCTGCTCGGCCCGATATTCGGCATCGGGCCGGAGGCGGCGGTGGCGCTGTCGCTCGTCAAGCGGGCGCGGGACATCGCGATCGGCGTGCCGGCGCTCCTCATCTGGCAGATGGGCGGCGTGCGATCGGGCCTCAGGAAAAGCGCCTGAAACCCGGACGCCTTATCCCTCGTCACCCATGCGCAGCGCGGCGATGAAGGCTTCCTGCGGGATGCTGACCGAACCATATTCGCGCATCCGCTTCTTGCCTTCCTTCTGCTTTTCCAGAAGCTTGCGTTTGCGGGTCGCGTCGCCGCCATAGCATTTGGCGGTCACGTCCTTGCGCAGCGCCGCGATCGTCTCGCGCGCGATCACCTTGCCGCCGATCGCCGCCTGGATCGGGATCTTGAACATGTGGCGGGGGATCAGGTCCTTCAGCCGCTCGCACATGTGGCGGCCCCGCGCCTCGGCGGCGGCGCGGTGGACGATCATGCTGAGCGCATCGACCGGCTCGTTGTTGACGAGGATCGACATCTTCACGAGATCGCCCTCGCGATAGCCGATCTGGTGATAGTCGAAGCTGGCATAGCCGCGCGTGATCGATTTCAGGCGATCGTAGAAATCGAACACCACTTCGTTCAGGGGCAGCTCGTAGGTCACCTGCGCGCGGCCGCCCACATAAGTCAGGTTCTTCTGGATGCCGCGCCGATCCTGGCAGAGCTTGAGGATGCTGCCCAGATATTCGTCGGGCACGTAGATCGTCGCCTCGATCCACGGCTCCTCGATCATTTCGATCTTGTTGGGATCGGGCATGTCGGCGGGGTTGTGCAGCTCGATCGTCCGCGCCTCCGCCTCGCCCGCCGAACGGGTGAGCAGCAGGCGATAGACCACCGACGGCGCAGTGGTGATGAGGTCGAGATCATATTCGCGGGTCAGCCGCTCCTGGATGATCTCCAGGTGGAGCAGGCCGAGGAAGCCGCAGCGGAAGCCGAAGCCCAGCGCGGCGGACGTCTCCATCTCGAAGCTGAACGAAGCGTCGTTGAGGCGCAGCTTGGAGATCGATTCGCGCAATTTCTCGAAATCATTGGCGTCGACCGGGAAGAGGCCGCAGAACACCACCGACTGCACTTCCTTGAAGCCGGGCAGCGGCTCTGCCGCCGGCTTCTTCGCATCGGTGATGGTATCGCCGACGGCGGTCTGCGACACTTCCTTGATCTGGGCGGTGATGAAGCCGACCTCGCCGGGGCCGAGATCGCCGAGCTGCTCGATCTTCGGGCGGAAGCAGCCGACGCGATCGACCAGATGGGTGGTGCCGGCCTGCATGAACTTGATCTGCTGGCCCTTGCGGATCACGCCGTCGATCACGCGGATCAGGATGACGACGCCCAGATAGGGATCATACCAGCTATCCACCAGCATCGCCTTGAGCGGCGCGGTCGCATCGCCCTTGGGCGGCGGGATCTTGGCGACGATCGCCTCGAGAATGTCGTCGATGCCGATGCCCGACTTGGCCGACGCCAGCACCGCGCCCGATGCATCGAGGCCGATCACCTCCTCGATCTCCTCGCGCACCTTCTCGGGCTCGGCGGCCGGCAGGTCGATCTTGTTGATGACGGGCACGATCTCGTGATCATGCTCGATCGACTGGTAGACGTTGGCCAGCGTCTGCGCCTCCACGCCCTGCGCCGCGTCCACCACCAGCAGCGCGCCCTCGCAGGCGGCGAGGCTGCGCGAGACCTCATAGGCGAAGTCCACGTGCCCCGGCGTGTCCATCAGGTTCAGGACATAGCTCTGCCCGTCCTTCGCGGTATAATCGAGGCGCACGGTCTGCGCCTTGATGGTGATGCCGCGCTCCTTCTCGATATCCATGTTGTCGAGGACCTGGGCGGACATCTCGCGGTCCGAAAGCCCACCGGTGCGCTGGATCAGGCGGTCGGCCAGCGTCGACTTGCCATGGTCGATATGCGCGATGATCGAGAAATTGCGGATACGGTCGAGCGGGGTCACGGAAAGGGCCGATCTTCTGGGATGGATGGAATTGGCGCCGCCGATAGCATCAAGCAGGCCGCTTGCCAAAGATATGTGAAATCCGCCGGCATGTGACCCGCGTCACAAAATCCGTTAAGGGGTTGGTGTTAACGCGCAGGCGTTATCGGGGGTCGCATAACTGGGAGGCATTTGTTGTGAAGAAGATCATGTTGCTCGCTGGCGCGTTGGCCAGCGTGGCCCTGCTGCCGCCCGCGGCCCAGGCGCAATCCTCGGCCCGCAAGCAGCAGACCGCCAAGCAGGCGGAGATCCCCGTCTGCACCCGCAAGCTGGGCACCATCGCCATCGTCGAGCCCGAGAATCAGTGGTGGCGTGAACTGAGCCTCGGCAGCCCCGAGGCGATCATCAAGACGTTCGTCCAGAAATCGGGCTGCTTCACGCTGGTCAATCGCGGCCGATCGCTCGCCAGCCGCAACATGGAGCGCGCGCTGGCCGATTCCGGCGAGCTTCAGGCCAAGTCCAATATCGGCAAGGGCCAGGTGAAGGCGGCCGATTATTTCCTGCAGCCGGATATCGTGAGCTCCAACAACAATGCCGGCGGCAATGCGCTGGGCGGCCTGCTGGGCGGTGTCCTCGGCCGCAACACGTTCGGCGCGCTGGCCGGCGGCCTTTCGATCAAGAAGAAGGAAGCCAACGTCACGCTTTCGGTCGTCAACGCCCGCACCACCGAGGAAGAGGCGCTGATCGAAGGCTATGCCCGCAAGCAGGATCTGGGCTTCGGCGCCGGCGCCGGCCTGTTCAGCGGCGGGGCCTTCGGTGCCGGCGGCGGCGGTTATGAGAATACCGAGATCGGCCAGGTGATCGTGCTCGCCTATCTCGATGCCTATACCAAGCTGGTCACCCAGCTCGGCGGCCTGCCCGCCGACGCATCGGCCGCGGCGCCCAAGGCGGCCAACTGATCGGGCAGGCGGGCGACCCCGAGGTCGCCCGCCTGTTTGCGGTCACCAGGCGAAAAGACCCGCCAGCCAGTAGAAGAGCGCGCCGATCGCGGCCGCCGCCGGCAGCGTCACCACCCAGGCGATGACGATGCGGCTGGCGATGTTCCAGCGCACGGCCGACATGCGCCGTGCCGATCCCACGCCGACGATCGCGCCGGTGATGGTGTGCGTGGTCGATACCGGCACCCCCAGCCATGTCGCGCCGAACAGGGTGAGCGCGCCGCCCGTTTCCGCGCAGAAGCCCTGCGCCGGGGTGAGGCGGGTGATCTTCGATCCCATGGTGTGGACGATCTTCCATCCGCCCATCAGCGTGCCCAGCCCCATCGCCGCCTGGCAGCTTATCACCACCCAAAAGGGCACGTGAAATCCGCCGCTCAGCATCCCCTGCGAATAAAGCAGCACGGCGATGATCCCCATCGTCTTCTGCGCGTCATTGCCGCCATGGCCGAGCGAATAGAGCGATGCCGAGACGAGCTGGAGCTTGCGGAACCAGCGATCGGCAAGCGTGGGCGTGGCCTTCACGAACGCCCATGAGGTGATGAGCACCAGCACCAGCGCCAGCACCAGGCCGATGGTGGGCGACAGCACGATGGCGGCGGCCGTCTTGAACACGCCGCTCCACACCACCGCGTGCAGCCCCGCCTTGGCAGTGCCGGCGCCCAGAAGGCCGCCGATCAGCGCGTGGCTGCTGCTGGAAGGGATGCCCAGCCCCCAGGTGATGAGGTTCCACGATATCGCCCCCATCAGCGCGCCGAAGATCACATGGGCATCGACGATCTCCGCGCTGACGATGCCGCGCCCCACCGTCTCGGCGACATGCAGGCCGAAGAAGAGGAAGGCGATGAAGTTGAAGAAGGCGGCCCAGACCACCGCATATTGCGGGCGGAGCACCCGGGTGGAGACGATCGTCGCGATCGAGTTGGCGGCATCGTGCAGGCCGTTCAGGAAATCGAACAGCAAGGCGACGCCGATCAGCCCGATCAGCAGCGGCAGCGAAAGAGCAGCGGCTTCCATGATGATTGTCCGGCGCGATCAGGCGTGATCGATCACGAGACCCTGGATCTCGTTGGCCACGTCCTCGAACCGGTCGACGATCTTTTCCAGATGGCTGTAGATTTCGCGCCCGACGATGAATTCCAGCGGCTTGCTGTCGATATGGGCGCGGTAGAGCGCGCGCAGGCCGGCATCGTGGATCTCGTCGGCATGGCCTTCGATCCTGACCAGCCGTTCGGTCAGCTCATGCAGCCGCCCGCCATTGGTGCCGACGGACCGCAGCAGCGGCATCGCCTCCACCGTGATCCGCGCAGCCTCCACGATGATACCGGCCATGTCGCGCATCTGCGGCTCGAACCGGGTGATCTCGTAGAGGGTGATCGCCTTGGCGGTCTTGTTCATCTGGTCGATCGCATCGTCCATCACCCCGATCAGGCTGGTGATGGCGCTGCGGTCGAACGGGGTGATGAGGATACGGCGCACATCCTGCAGCACCTCACGGATGATCTCGTCCGCTTCATGCTCGCGCTCGAAGATCTCGCGGACATGGTCCGCCATGTCGTCGCCGCCTTCCAGCAGCCGCGCGAGCGCGTCGGCGCCGGCGACCAGCGTGGCGGCATGATCCTCGAACTGTTCGAAGAAGCGCCCCTGTTTCGGCATCAAGGCCTGGAACCAGCGCAGCATCGGTATCGTAGCTCCCCAATTGCGGACGCAGGCGAGGCCGCGCTCCAGCAGGCCGGCGGGCGCGACGGGCGCCCGGAAACCGGCGATCAGTTGTTTCAGTTGCGGCTCGGCGACGGCATCCGCGGCCTCGGCCAGGGTGAACCATCGCGTCTCGCGCTCATCCTGCTCCGGCCATTCTTCCGCCTGCTTCAGCACCGCCAGCGGGAAGACCGCGACGGTGGCGTTGCGCGATTCGCCATTATTCTTTCTCTTGCGATAGCGATAGGTGCCGAGCGCGGACGGGCAGGGAATGCCCGATACGCCCGCTTCCTCATAAGCCTCGTGCGCGGCGGCGCGGTGCGGGTCGAGCCCGCGGATCGGATTGCCCTTGGGAATCACCCAGCGCCCCGTATCGCGGGAGGTGATGAGCATCACGCGCGCCGATCCGTCCGCCTCGATACGGTAGGGCAAGGCAGCGATCTGGCGGATGGGACGGGCTCCTTCGGGAATGCGCGGCTGTCATTGAAACGTAATATTGGCAAATGCAATCGCCAGCCGCCCGCCGGACGGCCCTTGTGCCGCGCCCGCACCGCGTTACCATCGTCCAAAATACTGGAGAGGCGAACGATATGCGGCATATCGCGATCATCGGTTCGGGCCCGGCGGGCTATTATACGGCGGAAGCCTGCCAGAAGCAGTTCGGCGACCAGGTGAGCATCGACATCATCGATCGTCTGCCCGTGCCCTACGGCCTCATCCGGTTCGGCGTCGCGCCGGATCACCAGTCGATCAAGGCGGTATCCCAGCGCTACGAGCAGGTATCGCTCAACCCCAATGTCCGGTTCGTCGGCAACATCACCGTCGGCGGCGAGGTGACGGTGGCGGAGCTGATCGCCCTCTATGACGCCGTGATCCTCGCCACCGGCGCACCCGTCGATCGCCCGCTCGGCATCCTCGGCGGCGACCTGCCCGGCGTCATCGGGTCGGCAGCCTTCGTCGGCTGGTATAACGGCCATCCCGATTTCGCCGGGCTCAACCCGCCACTCGACTGCGAGGGCGCGGTGGTGATCGGCAACGGCAATGTCGCGCTGGACGTGACGCGCATCCTCGCCAAGGCGCCGGCCGAGTTCGTCGGGTCGGATATCGTCGCCCACGCCTTCGAGGCGCTGGGCAATTCGGCCATCCGCAGGGTGACGATGGTGGGCCGCCGCGGCCCGCACGAAATGCAGATGACGCCCAAGGAGCTGGGCGAGCTCGGCCACCTCCAGCGCGCCGTGCCGCATGTCGACCCCGCCGATCTGCCGCCCGTGGAGGCCGATGCCGCGCTCGAGCCGGGCCAGCGCAAGGCGATGGGCCATCTGCGCGGCTTCGTCTCGCTCGCCCAGGACAAGGAGGTCGCGATCGACTTCGATTTCTTCGCCAAGCCGGTCGCGATCGAGGGCGACGGCCGCGTCGAGCGCCTGATCGTCGAGCGCACCCGGCCGATCGGCGACGGGCAGGTGGAAGGCACGGGCGAAACCTATGCGATTCCGTGCGGGCTGGTCGTATCCTGCATCGGCTATCGCACGCCGCCCATTCCCGGCGTGCCCTATGACGAGAGCGCCGGGCGATTCGCCAACAGCGAGGGCCGCGTGCCCGTGGACGGCGGCAACCTCTACGCCGTCGGCTGGGCCAGGCGGGGGCCCACCGGCACGATCGGCACGAATCGTCCCGATGGCTACAAGATCGCCGAGGAGATCGCCGCCGATCATCCCGCCCCCGCCAACAAGGCCGGCGGCAAGGGGCTGGACGCGCTGCTGGCCGAGCGCGGGGCGGACAAGGTGAGCTTCGCCGACTGGCAGAAAATCGAGGCGGCGGAGACCGCGCGCGCCCGCGACGGAGCCCCGCGCGAAAAATTCGTGGTGGTTGCGGACATGCTGGCTGCACGAGGCGCTTGAAAGCCGGAAAAATCCCCCCTATAGGCGCCCGACTTGGCAAGCAGTCGCGCCTTCCCGAGAGGCACAAAACAGCGATTGCACTCCGCGTCGGGGAGTAGCTCAGCCTGGTAGAGCACTGTCTTCGGGAGGCAGGGGCCGGAGGTTCGAATCCTCTCTCCCCGACCATTAATTTCAATAGCTTCGGTGGAAAACGGGCAGGCCTTCGGGCCGGGCCCGTCCGAGTTATGTCCGCGTTACGCCGATCCGCCCGATAACGAGAAAGCCCCCGCACCGGCTGGACCCGGGGCGAGGGCTGAAACTGGCTACCGTGCGGGCGGCCTCGATCTGGATAGCACAGCCGCCGCCGCCAGCCAACGCATTCGGCGCCCGATCCTCTCGCTGAGGTTCGGGGCATGAGCATTCGTATCATGTCGGCAGTGTGGGGCCTGAAGCTGGGTGACAGCGACAAGCTGGTACTGCTCGCGCTCGCCGATCAGGCGAACGACGATGGCGTCTGCTGGCCCTCCATGGCTTCGCTCGCGGCAAAGTGCAGCAAGTCGGATCGGACGGTTCAGGCGGCCATCAAGTCGCTTGTGGACGCCGGCCACCTTTCCCGCGTCGAGCGGCCGGGAAAAGGGGTCCGCTATACCGTCCACCCCCGAAGCGATTTCACCCCCGAAGCGGCTTCACCCCCGAAGGGAACGACGTCCACCCCCGAAGCGGCTTCGGACAAACCATCAAGAACCATCACTCTTTCTCAGAAGACTTCGTCTTCTTCGAAAGCGCGCGCGAAGAAATCGGCTGTTGAGCCGTTCGCGGTTCCTGACTGGGTTCCTGCCGATGCCTGGAATGGCTGGCTGGAGATGCGGCGGCAGGAGGGCAAGCGCCCGACTCCCCGGGCCCTCGAACTTGCGATCGAGGAACTGAGAAAGCTGGCGGACGCAGGCCACCCGCCCGGTGCGGTGCTCGACCAATCGACACTCCGCCAGTGGACTGGCCTCTTCCCGATCAAGGACAATCGCAATGAGCAACATCACCAATCTTTCGGATCGTCGTCCCACCGCCCTCAGCACGGAAGCGAGGGAGTGGGGAAAACAGTGCATGCGGCAAATCGCGCGATCGCCAGCCTTTCTGGCTCTGAAGGCGGGTGACGAGGCGACGGCAGCGGAAGCGGCGCGTGGCATGCGTTCCGAGATCGTGGACGCCTGGCGTGTGGCTGACGAGGCCATGCGGCCGACACCACCGGCAACGATCATCGCCAAGTTGACGCACGTTCTGGCGCTGTGTGCGGGCGTGGGCATGAGTGCCGACGATCGGGGCGAGTGGCTGGCGGCGGCTGCAACGGCGCTCGATGGCATCCCGCCGGATCTGCTCGCCATCGGCATCGAGGCTGCGCGCCGGACTGCGGATCACCCGAGCAAGATCGTGCCGGCCATCAACGCGGCCATTGGCGTGTATTGGACTGACAGGCGCGACGAATTGCGGATGTGCACGCGGCTGGGTGCGCTCACGAAGGCCGATGCCGTCGGGATTGCCGATACCATTGGCGCCAGCGGCGATGGTATCCACCCGAGCGAGGTTGCCGAAACCAACCGCCTGATGCGCAAGCTGGGCCTGCGGCAGCGCTACCGGCCGGATGGCGCCGGCTACCAGTTGGAGCGCGGCCGGCCCGACCCGGCCGGCGATGCTCAGCCCGGCGATGCGCCCATGCCGGCGGATACCGAGGTGCGGCGGCATGAGGGGCCGGCCCGCAATCCGACGATCAACGATTACGTGGCCTTGGGCGTGGACCGGGAAACGGCCGCGCGGTTCGTGGCGGAGCGCGCCGGCAAGCACGGGACCAATCACGATGGCTGATTGGCCGTACAACACGGCGGCATGGAAGCGGCTGCGGCTTGCCCACCTCACGCGCTTCCCGATGTGCGAGGAATGCGAGCGCGTGGGCCGGCTGGTGCCTGCCAACACCGTGGACCATCGCCATGCGATCAGCGACGGCGGCGCGCCCTTCCCGGGGCACGATGGCCTCGCCAGCTACTGCCCGTCCTGCCACGGCGCGAAGACGGCGCGCGGATCGGAGGCCGGCGCGATCCGTTCGAGCAAGCCCCGCAAGGGATGCAACCCGGACGGCACCCCGCTCGACCCCGCGCACCCGTGGCACGGAAAATCGCTCAGGGCTGGGGCTGTAAGACCGACACCCGAGACAAATACTCAATTAGTTTCGGGGGATCGCGGCCGGCATGGGTAAGAGGGGGCCGGGTGCTGGTCGGCTGCGCGCGGCGGCGCGGCGACCGGGGGCCAATCGCGTGCGCCATCCATGGTCCAAGCGGGGCATGCCGCCGGAGGAACAGGTGCTCGCCTTCCTCCGTTCGCTACCGATCGTCTCGGGCCTGAAGGCGGGCGAGAAAATGGAGCTGCTCGAATTTCAGGAGCGGTTCGTGCGCGCCGTCTATGGCCCGGCGGACGATGAGGGCCGCCGGCTGGTGCGGCTGGCCGCGCTGTCCGTCGGTCGCGGCAACGGCAAGTCGGCGCTGCTCGCCGGCCTGTCGCTCGCCCACCTGCTCGGGCCGATGGCCGAGCCGCACGGCGAATGCTACGCGGCTGCTCTCGATCGCGAACAAGCGGGCGTGCTCTACCGCATGGTCCGGGGGTATATCGAGGAAACCCCGTGGATGGCGGCGGCGGTGAATATCCGCGACTGGCACAAGTCGATCGAGGTGGAGACCACCCGTTCGACCTGGACGGCGCTCACGTCGGACGCCCGCAAGGCGCACGGCCTGGCGCCGTCGTTCTGGATCGCCGACGAGGTGGCGCAATGGCGCTCACGTGAGCTGTGGGACAACCTCGCGACCGGCATGGGCAAGCGCAAGCATGCGCTTGGCGTGACGATCTCGACTCAGGCCGCCGACGACCTCCATTTCTTTTCGGAGATGCTGGACGCGGATCCGGACCCGTCGATCTATGTCCAGCTTCACGCGGCGGCCAAGGAATGCGCGCTCGACGATCGCGAGGCATGGGCGGCTGCGAACCCGGCCCTGGGGGCCTTCCGTGACGAACGTGAATTTGAGCTGGCGTCCGAACGGGCATCGCGCATGCCGTCGTTCGAGCCTGCGTTCCGGCTGCTCTACCTCAACCAGAGGATCGCGGCCGAGGGCCGATTCCTGAACCCGCTGGACTGGGATGCCAATGGCGACCCGTTCGACCCAGCGGAGTTGGAGGGCAAGCGTTGCTATGGCGGGCTGGACCTGTCGAGCACGCGCGACCTGACCGCGCTGGCGCTGTGGTTTCCCGACGAGGGCAAGTTGCTCGCCTGGCACTTCGTGCCGGCGGACACGCTCAGGGAGAGGGTGGAGAGGGACCGGGTGCCCTATGACCGCTGGGCGGCGGAGGGATGGATGGAAACCACCGTTGGCCGCGCGACCGATCGAACCGCCGTGGCCCGGCGGCTGGCCGACATTCGGCAGATGTACGACGTGCAGGGCATCGCCTTCGATCGCTGGCGGTTCGAGGATTTGGGCAAGCTGTTATCGGACGAGGGAATCGAACTGCCGCTGAAAGAGTTCGTGCCGGGGTTCAAATCCTACGCCCCGGCCGTGGATGCGTTCGAGCGCGCGGTGCTGGAAAAGCGCATGCAGCACAATGGATCGCCAATCCTCCGCTGGCAGGCCGGCAACGTAATTGTGGAGAAAGACCCGGCCGGCAACCGCAAGCCCACCAAGGCCAAGAGCCGCGACAAGATCGACGGCATCGTTTCCGCGATCATGGCGTGCGGGCTGGCCGCGACGGATGAGGGACCGGCGGTGTACCGGGGCGCGGGATTGGTGTGGATTTAGACCCGCAGCACACCCCGTTTTGCATAGACTGACTGTGGATGATTGCCAGGCCGATTCGCGCTGACGAAGCTCGGGCTGCCGGTATTGAAGATATACTCGTGGCCGGTTCCGTCGTGGCACTTCAAGAGGGGCGCCTCATAAGCAAGAACGGTGTAAACCGTGGTCCCTTCCTCTTCGCCTATGCCGGTAGCAATCTCATATTTCTCGCCGACTTCAAACATCTTCACCTCTCATTTTCAACATGGCGCGATCGTAGTCGCCCTTATAGAAAGTGGCTCCGTTGTGGCCGCCATCTATGCTGAATACAATCCGCTGGGCGAGTTCATCTTCCAAGCCGAGCTTGCGCGCCACTTCATTCGTACCGACACCATAGATTGCGTCGTCGGGCTCGCCGGCAAGCAGATTGTTGATGTAATTTTCCACGCTTGCCGCCGCGTCGTTGTAGTCACGCTGCTTGCGCCTTCTGGCGTCATTTGCGCCTCGATAGACGTCGCGCGGGACACTTATCGGGTACTGTTTCACCGGAATTTCTCCGCGAGAATGAGGCGGATCGCTTCCGGCCGGCTGGGCTTGGGATCGGGTTGAGCCCCGATCCATGCATCAAGGGCGCCTAGCTGGTCTGGTTGAATGCGGACGGTAACGGGCAATCCCTTGCCAGTCGGTGCAGGACCACGCCGTTTTCGTGACGGCAGAGTTTCTTGACCAGCCGATTTAACCATGCCATCACAGTAGCGAGCCGAACGCGAGGCTGCAACCTCACGTCCGGCTCTAACCGCAACCACCTGCCTAGAGGTGCTTATGGCTATTCACGCCCCTATCACGGGCGCGCTGTCGTGCGCGACCGTAGTTTCCGATTATCCCGCCGATTTCTGGCGGACCCTCGTCAATCTGGATCGGCTTCGGCTGGAAGGTGTGATCGAAATGGCGATCAGCCTGCTTGACGCGGATGACGGCGACCCGGATCTGGAACCCAACGGCGATGAACTGGATGGGACTGGCGGCGAAGACGATTTTTGCAATCATTCGGCTTACATGTCCGGCCCCGGCTGCCCGATCGCGGATCCGGATGTTGGGATCGACGACGTCCCACGTGATCGGGAAGCACCGCTGCGCCCCAAATATGGGATCGACCAGTCGGCCGGGCCGATCAACGAGGTGCAAGCTGCCCGCGAGTGGCAGGAGGCTCAACGGAGGCGATACGCGATCGGCTGAGGCGGCGCTGTTGTGATACGTACTCGCGAGAGTATTTTTGTTGACGCAGGGGTGCGGCAAGTGCAAAACGAACTCATTGGAGATCGTTTTATGAACCAGCTGACCGTACCCCGCGTTCGTTTCCGCCACGCTGCGGCGGCTGTCGGCGTTACGAAAAAGACGCTGCGCAATTGGCTGGTTCGCGGCCAAGTCCAGATTGAAACCGAGTCGGAAGGCTGGAACGAGTTCTCGCTGATTGACCTTGCCGAACTGACCCTCACGGCCGAGCTTGTAAGGTATGGGGTTGGTATCCTTCCGGCCTCAATGGCGGCCCGCAGCCAAATCTCCCTGTCCACGCACCTGCTCGCGTCTTACAAGAACACACCCGCACAGGCGTTCATGTTCGCATTCCGGGGCGCGCGGCTTTTCATGCAGTCGCCGAGAGCGGGGCTAACGCATTTCAAACACGCGCGGGATAACGAAGGTGCGCCGGCCGATTGGGCGGGTGCGAGCCTTCTGACCATCGATATTCAAACGCTTATTGAAGAGATGCTCGATCGACTTGCCGCCGCCATGGGCGCGGACGATGGCGCCGACGAGGGCGACGAGTAATGCTCGCTTCCTCCATTATTTCACGAATTGCGCGCGTGCTCCTGGGGGTGGCCCTGTTCTGGGCGCCCCTCGGCGCGTTCGCGCAACAATCGCCCACGCTGGGAAGCGTCGGCTGTCTCCGTGCCGGGGCTCAGTGCCCCGGCCAGATAGAGGATATCGCATGAAGACGAGTGATCTGATCGAACAGCGGGCGGCGATCGTCGCTCGCATGACCGACGCCCACCAGGCGGATAACGGCGAAGCCTTTACCGCCGCCGAAACCGAGTTGCGCGCACTGGACGCCAAGATTGGCCGCGCCCGCGCGATCGACGACGCGGAGCGCAATGAGCCGGGCCGGCCGATCAACGGCGACGGCCGGCTCGACGGCGAGATCCGTTCGCGCTTCAACGTGTCGCGGGCGCTCGCCGGCGCGGCCGGCCTTGCCGTCGATTGGGGTTTCGAGCGCGAGGTGCAGGGCGAGCTTGCCAAGCGCGCCGGCCGTTCGGCCGAAGGCATTTTCATCCCGACCGAGGTTTTCGAAACCCGCGTGCTGACCACCGCGACCGGGACCGAGCTGGTGCCGACCGAGCACCGCCCGGATCAGTATATCTCCGCGCTGGTCGCCTCGTCCGTCGTGCGTGGACTGGGTGCCCGGGTGCTGTCGGGCCTTGTCGGCAATCTCAGCATCCCGCGCGAGACGGATAGCCCGGCGATCGGATGGGTTGCGGAGAATACCGCGCTGACCGCCGACGACGCCAATTTCGACGCGGTGACGCTCTCCCCCAAGCATGCCGGCGCGCTCAGCGAGTGGAGCCGCAACATGCTGTTGCAGGCCAGCCCCGACGTGGAGGCTCTGCTGCGCCAGATGCTCGCCCGCAACCTCGCGCTGGCGATCGACAAGGCTGCGATCCAGGGCGGCGGCGCCAATGAGCCCAAGGGCGTGCTCGCCACCGCCGGCATCGCGACGCAGGCCTATGCGACCGATCTTTTCACCACCACGGCCGAGATGATCGCCAAGGCGGACATTGCGAATGTCGGCATGCCGCGTCGCAGCTTCCTGACGACCAACCTCATCAAGAAGATCTGCTCGCTGGAGCTGGACGCGAACAAGCTGCCCGTGGGCATCCCGCCGATCTTCCACAATGAGGCGGTGACGTTCTCCAACCAGGTGCCGACCAACCTCGGCGGCGGCGACGAGCACGGCCTGATCTATGCCGACTGGTCCGAGCTGCTGATCGGTATCTGGTCGGAAATCGACATTCTCGTGAATCCGTTCGAAAGCACGGCCTACAGCAAGGGCAATGTCTCCATCCGCGCCATGGCGACGGTGGACAGCGCCGTGCGTCACCCGGCCGCGTTCGTGTCGGCCACGGGCGTCGAAACCACCAGCGTGGGCATCGCCTGATGGTGGCCGGGGGCGATATGGAGCGGCGGGCCTTCACCGAGGTTCGCACGGCCGGGCGGCGTATCGAGGGATATGCCGCCACCTTCGGCACGGTGGCCAATCTCGGCCGCTTCACCGAAACCATTGCCCCCGGCGCCTTTCGCGAGGCGCTGGCCGGCGACGTGCTGGCCATGCTCGACCATGATCCCGGCAAGGTGCTGGGGCGGACCCGTTCGGGCACGCTGCGCCTGTCGGAAGATAGTCGCGGCCTCGCTTTCTCGCTCGACCTGCCCGACACGCAAGCCGGCCGCGACGTGCTCGCGCTGGCCGATCGCGGCGACCTGGGGGGCATGTCCTTTGGCTTCACGGTGCCCAAGGGCGGTGAGGCGTGGCAGGGCAATCGCCGCTCGCTCAACCGCGTGGGCTTGCGCGAAATCTCGATCGTGCAGGCGTGGCCGGCTTATCCCGACACGGAAATCGCCCTGCGCTCGCTCACGGCCGGCGCGGAGGCCAACCGCCGCGCGCGCCGGCTCATTCTCGCGGAGCTTGGCGCATGGGCATGATCGAACGCTTTGCCGCATGGGCCGGCTATGAAAAGCGCGCGGGCGATGATCCGAGCTGGGCGGCGCTCGCCCCCGGCATCGGCGCCATGGCCGGCATGTCGGCGCGCGCCGCTGAGAACCTGTCCGCCGTGCTCGCTTGCACGGGCGTTATCGCCAGTTCGCTCGCCTCGATCCCGGCGCTGATCTATCGCCGCGAGGGCGATGGCCGGACGGAGGTTTCGGGCCACCCGCTCGCCCGCATCACCCGCGAAGGCGTCACGGCCGCGATGACGTGGCCCGAGTTCATCGAGCATCTGGTCGCCTCGACGCTGCTAACCGGCAATGGCCTGGCAGAGATCCTGCGTGGACCGGGCGGTGAGCTTTCCGGCCTGCGCCATATCCCTTGGGGGATGGTAACTGTGGCCGAGCTATCGAGCGGCCGGCTGGCCTATGACGTAAGCGACGGCCGGGGCCGTACTTGGCGCCTGCTGGCCGGTGAGGTGATCCACCTTCGCGACCGGACGGATGATGGCCTGATCGGCCGCTCGCGTCTCAGTCGCGCCGGCGATGCCTTGGCCGGTGCGATGGCCGCGAACGATTTTGCGCGTAGCTTCCTCAACAATGGCGCGCAGCCGAGCGGCGTGCTCGAAATGCCGGGCGTGCTGACGGCGGATCAATTCACGCGGCTGCGCACGCAGATGGCTGAACGCCATGCCGGCGCCAAGAAGGCCGGCAATGTGATGATCCTCGACGGCGGCGCGCAGTGGAAGGCGTCGCAAATTTCGCCCGAGGATGCCGAGCTGCTTGAAAGCCGCAAGTTCGCGGTCGAGGAGATTTGCCGCATCTATCAGGTGCCGCCGCCACTTGTGCAGGACTACAGCCACAACACCTTCACCAACTCCGAAACGGCTGGCCGCTGGTTCGCCATGTTCACGCTCGCCCCGTGGGCGCGCAAGATCGAGGCCGAGTTCGCGCGCAGCGTTTTCCCGGCCGGTAGTGGCCTCGAAATGGAGCTGGACCTGTCGGGCTTCCTGCGCGGCGATCCGGCGACGCGGTGGAATGCCCACAAGATCGCGATCGACGCGGGCGTGCTCGACGCGGACGAGGTGCGCCAGGTGGAGGGGTGGAACCCCCGCACCAAGCAGGACAAGCAGGAGGTGCCGGCCTGATGCTCGCCACGTTCATCGCCGCTGGCGTGATCCCGACCGCCCCCGCGCCGGTTACGGCCGATCTGGTCACGCTGGCCGAGGCCAAGGAATATTGCCGGATCGACGGCGCAGCCGCATATGAGGACGCAACCTTGTCGATCCTGATCGCTGCGGCATCGGGCACGGTGCGCGATTACGCCGCCGGCTGGGACGGAACGGGCGAGGTTCCCGCGCGCCTGAAGCTGGCCGCCCTTGAGCTTATCGCCGCCCATTTCGACACCCGTGGCGATGTGCCCGACGTGCACCTTGAGCGCATCCTTGGGCCGTATCGGGAGCACAACGTCTGA
Protein sequences of DBSCAN-SWA_3 >NC_020561|1874484:1924690|1879837_1881010_+|WP_041864857.1|DBSCAN-SWA MRAARSGALIGAVVALGALLGGCNAVVLDPAGDVAKQQANLVVISTVLMLLIIVPVMALTVLFAWRYRQSNKEARYEPEWDHSTQLELLIWAAPLLIIICLGAITWTSTHLLDPYRPVSRVAKDQPVPADARPLEVEVVALDWKWLFIYPEYGIATVNELAAPVDRPINFRITSSSVMNSFYIPALAGQIYAMPGMETKLHAVLNKAGNYEGFSANYSGAGFSGMRFRFHGLDDAGFAAWVAKARAEKDRLDRKAYLELERPSENVPVRRFAAVDADLYDAVVNMCVEPGKMCMSEMMAIDAKGGLGKEGIRNVRQLTYDKHARRGAVLAPQPMMVGAVCAPPQPVKQAAAERPAPTGAPLTGAGLPRPGVIGGPPRASADAVVPRANNS >NC_020561|1874484:1924690|1875730_1877035_-|WP_015458543.1|DBSCAN-SWA MRLLLAVSTAMLVAAPAFADPQAGDIVIHAGTLLDRPGQAPRRNATIVVRAGKVAEVRDGFVAAPAGARLIDLKDRYVLPGLIDSHVHLESDRAGKDGLLASFTEGVPMHAYEAAWNARKTLNAGFTTVRNLGDGSGVTLALRDAIRQGWAVGPRIVDAGMSISTTSGHMDERLGLNDDLREHAGSPENLCDGVESCRHAVRIQIGRGADVIKIATTGGVNSRIGAGLGKQMFDDEAKAIVETAHMYGKKVAVHAHGADGIALALAAGADSIEHGTLMDDADIAAFRKTGAYYVPTLSTVNGYLERIAADPNAYEPAVRAKIDWRISITGKALEKAVPAGVKIAFGTDAGVSKHGRNADEFEWLVKHGLTPMGAIQAATVNAADLLGLKDEVGSIEPGKAADLIAVAGDPLADVKVLKTVGFVMKGGEVFKQED >NC_020561|1874484:1924690|1903906_1905004_+|WP_144062129.1|DBSCAN-SWA MTALLLPQCLAVAGILYTLAATILAGRWKSAPIPPENGPPVTILKPLHGAEPLLAENLRSFVEQDYRGAVEIVCGVHDPADPAAAVARAIGGPVRVRADRARHGSNGKISNLINMMSDASGDIIILSDSDIAVPPDYISRIVATIAPAGTIATCLYAGRGDAGAWSRIAAAGISWQFLPSVIVGLATGRARPCMGSTIAMRRETLDRIGGFAPFADVLADDHAIGAAARAAGCEVAIPRLIVTHGCAETSLAALARHELRWNATIRGLDPWGYAGSIVTHPLPLALLGLNLWLVAAALAARLALALRIDRLAGRRTAPIALLPLRDILSFILFLGAFAVRSIDWRGGRFRLGKDGRMSADTEYLT >NC_020561|1874484:1924690|1874484_1874805_+|WP_015458541.1|protease|DBSCAN-SWA MAGKDEENGTGLGVATRTRTRTKKPQPYKVLMLNDDYTPMEFVVLCLQRFFRMSLEDATRVMLHVHQKGVGVCGVFSYEVAETKVSQVIDFARQNQHPLQCTLEKA >NC_020561|1874484:1924690|1898706_1899969_-|WP_015458563.1|DBSCAN-SWA MARAHVIGAGLAGLAAAIALTKAGYAVTLSEAGPRAGGRCRSYHDPQLGLTIDNGNHLVLSGNAAVADHLATIGAADRLAGPMEASFAFFDLRDGERWRVAPNGGPLPWWLFVPGRRVPGTGVADYLPLARLLGADADARIGDVIATDTPLWERLIAPLMVAVMNTPPAEASAGLAAAVIRETLAKGGRACRPRIAHPTLAAAFVDPAIAWLRSHGADIRMGRRLKALRFAGDRVDRLDFADGADAVATDDPVILAVPAWVAADLVPGLTVPDRHHAIVNAHFAFPPPPALPPMLGVIGGTAEWIFAFPDRISVTISAADRLCEEDRGALAARLWAEVARAAGIDAPLPPWQIVRERRATFAATPDQDARRPGAATAWANLMLAGDWTRTGLPATIEGAIRSGRTAAHLAGEVARAHVSV >NC_020561|1874484:1924690|1886896_1888819_+|WP_015458554.1|DBSCAN-SWA MSDRPRTPLLDLVRTPDELRRLSPDQLATLASELRAEMISAVGVTGGHLGSGLGVVELTVALHYVFDTPRDVLIWDVGHQAYPHKILTGRRDRIRTLRQGGGLSGFTRRSESEYDPFGAAHSSTSISAALGFAVANKLAGAAGKAIAVIGDGAMSAGMAYEAMNNAAQAGNRLVVILNDNDMSIAPPVGGLSAYLARIVSSRPFLSLRDLAKKVARRLPRPLHDVASKTDQFARGMTMGGTLFEELGFYYVGPIDGHNLDHLIPILENVRDASEGPILVHVVTQKGKGYAPAEAAADKYHGVQKFDVVTGAQAKAPPGPPAYTAVFADALVAEAKRDETICAITAAMPSGTGLDRFEKAFPDRCFDVGIAEQHAVTFAAGLAARGMRPFCAIYSTFLQRAYDQVVHDVAIQNLPVRFAIDRAGLVGADGATHAGSFDVTYLASLPNFVVMAAADEAELTHMVHTMALHDTGPIAVRYPRGNGTGAAIPATPERLEIGKGRLVREGKTVAILSLGTRLAEAERAADQLEALGLSTSVADLRFAKPLDEALIRRLLSTHEVAVTIEEGAIGGLGAHVLTLASDAGLIDGGLKLRTMRLPDIFQDQDKPERQYEQAGLDHNAITATVLAALRKNSLAVAGDAG >NC_020561|1874484:1924690|1911296_1912046_+|WP_041865292.1|DBSCAN-SWA MLLAGALASVALLPPAAQAQSSARKQQTAKQAEIPVCTRKLGTIAIVEPENQWWRELSLGSPEAIIKTFVQKSGCFTLVNRGRSLASRNMERALADSGELQAKSNIGKGQVKAADYFLQPDIVSSNNNAGGNALGGLLGGVLGRNTFGALAGGLSIKKKEANVTLSVVNARTTEEEALIEGYARKQDLGFGAGAGLFSGGAFGAGGGGYENTEIGQVIVLAYLDAYTKLVTQLGGLPADASAAAPKAAN >NC_020561|1874484:1924690|1896740_1898720_-|WP_015458562.1|DBSCAN-SWA MYQSEAGNPVPVDPLAAVEKAVDRAAVALIAAQRDDGHWVFELEADATIPAEYLLLRHYLGEPEDLALEAKVGRYLRRIQNGDGGWPLFHGGAMDLSATVKAYFALKLIGDDGDAPHMARARAAVLAAGGAEAANVFTRIQLALYGAGPWAAVPEMPVEMILLPRWFPIHLSKMSYWARTVIVPLLVLMALRPRARNPRGTHVDELYRSKEPAPIASRAAGTGRLWTEGFLALDKLLKGVRPFWPAGLRRRAIDRCVAWTRERLNGEDGLGAIYPAMANAVMMYDALGYPPDHPDRAIARASVEKLLVLDPEDGTDEIYCQPCVSPVWDTALAAHALMEAGGDAAEARAAKGLAWLKPLQVLDVKGDWAEQRPDVRPGGWAFQYRNDHYPDLDDTAVVAMAMDRARGRIVPGDGQDQAIARGREWVEGMQSRNGGWGAFDADNMAEYLNNIPFADHGALLDPPTVDVTARCISMLAQLGEPAESPRMRAALAYLAREQEADGSWFGRWGVNYVYGTWSALCALNAAGVDPQELIVRRAVAWLEAIQNPDGGWGEDCDSYALDRTGHRPAPSTASQTAWALIGLMAAGEVDGAAVQRGVGWLMRHQGTDGLWPQEHFTGGGFPRVFYLRYHGYPRYFPLWAMARYRNLKRGNARRVMVGM >NC_020561|1874484:1924690|1913117_1914230_-|WP_041864859.1|DBSCAN-SWA MRQIAALPYRIEADGSARVMLITSRDTGRWVIPKGNPIRGLDPHRAAAHEAYEEAGVSGIPCPSALGTYRYRKRKNNGESRNATVAVFPLAVLKQAEEWPEQDERETRWFTLAEAADAVAEPQLKQLIAGFRAPVAPAGLLERGLACVRNWGATIPMLRWFQALMPKQGRFFEQFEDHAATLVAGADALARLLEGGDDMADHVREIFEREHEADEIIREVLQDVRRILITPFDRSAITSLIGVMDDAIDQMNKTAKAITLYEITRFEPQMRDMAGIIVEAARITVEAMPLLRSVGTNGGRLHELTERLVRIEGHADEIHDAGLRALYRAHIDSKPLEFIVGREIYSHLEKIVDRFEDVANEIQGLVIDHA >NC_020561|1874484:1924690|1921450_1922674_+|WP_015458586.1|capsid|DBSCAN-SWA MKTSDLIEQRAAIVARMTDAHQADNGEAFTAAETELRALDAKIGRARAIDDAERNEPGRPINGDGRLDGEIRSRFNVSRALAGAAGLAVDWGFEREVQGELAKRAGRSAEGIFIPTEVFETRVLTTATGTELVPTEHRPDQYISALVASSVVRGLGARVLSGLVGNLSIPRETDSPAIGWVAENTALTADDANFDAVTLSPKHAGALSEWSRNMLLQASPDVEALLRQMLARNLALAIDKAAIQGGGANEPKGVLATAGIATQAYATDLFTTTAEMIAKADIANVGMPRRSFLTTNLIKKICSLELDANKLPVGIPPIFHNEAVTFSNQVPTNLGGGDEHGLIYADWSELLIGIWSEIDILVNPFESTAYSKGNVSIRAMATVDSAVRHPAAFVSATGVETTSVGIA >NC_020561|1874484:1924690|1922691_1923216_+|WP_144062131.1|head,protease|DBSCAN-SWA MERRAFTEVRTAGRRIEGYAATFGTVANLGRFTETIAPGAFREALAGDVLAMLDHDPGKVLGRTRSGTLRLSEDSRGLAFSLDLPDTQAGRDVLALADRGDLGGMSFGFTVPKGGEAWQGNRRSLNRVGLREISIVQAWPAYPDTEIALRSLTAGAEANRRARRLILAELGAWA >NC_020561|1874484:1924690|1919483_1919708_-|WP_015458582.1|DBSCAN-SWA MFEVGEKYEIATGIGEEEGTTVYTVLAYEAPLLKCHDGTGHEYIFNTGSPSFVSANRPGNHPQSVYAKRGVLRV >NC_020561|1874484:1924690|1877633_1878176_+|WP_015458545.1|DBSCAN-SWA MGTQARPADGAMALAEMKEFAGFPAGTQRYIRRSLDIGLDRENAMVRWSRDMVEAASIRAQIRIYERLDTIRALVPDDSGLDAVEPFFAPLVIVSAFDLGQDRLPAFSAYRFLYERLIGAHVRPWLPGAFCAAAALPHLHPEKRRLLLQSISEAAATAPGWSSREPCFFPEWVEKVESLA >NC_020561|1874484:1924690|1892626_1893586_+|WP_015458558.1|DBSCAN-SWA MVKLVLAQPRGFCAGVIRAIEIVDNALDRVGAPVYVRHEIVHNRHVVDTLRAKGAVFVEELSDVPDGAVTVFSAHGVARAVEKEARDRGLPVLDATCPLVSKVHIQGRRYVAAGRTLVLIGHAGHPEVEGTLGQIDGTVHLVGSAEDVAALDIADDSPVAYVTQTTLSVDDTRSVIDALKARFADITGPGTADICYATQNRQTAVRDLCRVVDMLLVVGSANSSNSNRLREIGVELGLPSHLVADGDAIDPAWLEGVERIGLTAGASAPEDLVQGVIAAIRGHVPITVETLDGIEEDLHFRLPPALDRLARRDVVAEEA >NC_020561|1874484:1924690|1884155_1884860_+|WP_107394679.1|DBSCAN-SWA MGLLVLLGVAGLAGLGLWQVQRLAWKEALIARVDARVHAAPVPAPGPAAWAGISAAGDEYRRVALHGRFRHDRETLVQAVTDLGSGYWVLTPLADDRGFTVLVNRGFVPPDRRDPATRAAGNPAGPASVTGLLRITEPRGGFLRGNDPAADRWHSRDVAAIAAARHLRPGPIAPYFIDADAAPNAGGYPVGGLTVVRFPNNHLQYAITWFVMALLLAGAGAVVARQEMRARRAA >NC_020561|1874484:1924690|1888815_1889241_+|WP_015458555.1|DBSCAN-SWA MIGARLLLAALLAAPAADPSGSIEIAVTGVRTAEGRVHVDICPEAHFLKEDCPWSGEAPARIGATVVVVRGVPPGRYAAQGFHDRNGNGKVDRNLIGIPTEGIGFSNDAKIRLGPPKFADAAFDHGPGDQRIAFRLRHMAG >NC_020561|1874484:1924690|1923206_1924364_+|WP_015458588.1|portal|DBSCAN-SWA MGMIERFAAWAGYEKRAGDDPSWAALAPGIGAMAGMSARAAENLSAVLACTGVIASSLASIPALIYRREGDGRTEVSGHPLARITREGVTAAMTWPEFIEHLVASTLLTGNGLAEILRGPGGELSGLRHIPWGMVTVAELSSGRLAYDVSDGRGRTWRLLAGEVIHLRDRTDDGLIGRSRLSRAGDALAGAMAANDFARSFLNNGAQPSGVLEMPGVLTADQFTRLRTQMAERHAGAKKAGNVMILDGGAQWKASQISPEDAELLESRKFAVEEICRIYQVPPPLVQDYSHNTFTNSETAGRWFAMFTLAPWARKIEAEFARSVFPAGSGLEMELDLSGFLRGDPATRWNAHKIAIDAGVLDADEVRQVEGWNPRTKQDKQEVPA >NC_020561|1874484:1924690|1907416_1908292_+|WP_015458571.1|DBSCAN-SWA MESMSALAPPAAAGGAPAPLGSETHKTLFCRTLLDTHDPYRPALIEWPKLDADTQGKITSLPIWDIAVATEGRAGMNVRTFGEAVSDPLLKEAIMMNAFEESRHKLVLADMVQAYGIELAPEPEYRRPRDPEFAFMRTGYSECIDSFFGFGLFDVARRSGFFPPELVDTFEPVMREEGRHILFFVNWVAWWRRNMPWWRRPWFELKVIAVWIVLILERIDMAKGMGSNTKAQENNFTLNGSKELGVEISFPELARICLAENDRRLAPYDRRLIRPRFVPGAIRFVLRFMRS >NC_020561|1874484:1924690|1901681_1902818_-|WP_015458566.1|DBSCAN-SWA MTILAAISLGLWIYLLAGHGGFWRAAETDEAAAPDPANWPAVAAVVPARDEAAVIGRTIAGLLAQDYPGDFRILLVDDGSSDGTARIAREAAAATGRGDRLEIVPGSPPPAGWTGKLWAMEQGIARAGGARWLWLTDADIAHAPDTLRSLVARGEAEGLALCSLMARLSTANAAERALIPAFVLFFQMLYPFARVNRAGSPVAAAAGGCMLARADALARAGGIPAIAAHIIDDCALGRAMKRQGPIRLMLTRRSVSIRPYGGWREIGAMIARSAYAQLRYSPWLLAGTLLGLALVFLVPPLAAAFAAGPGRAMGIAAWALMAIAFQPMLRFYGRSPLWGPLLPAIAAFYAGATALSAVAHARGRGGMWKGRAQARLGA >NC_020561|1874484:1924690|1917506_1917902_+|WP_015458580.1|DBSCAN-SWA MADWPYNTAAWKRLRLAHLTRFPMCEECERVGRLVPANTVDHRHAISDGGAPFPGHDGLASYCPSCHGAKTARGSEAGAIRSSKPRKGCNPDGTPLDPAHPWHGKSLRAGAVRPTPETNTQLVSGDRGRHG >NC_020561|1874484:1924690|1902814_1903819_-|WP_015458567.1|DBSCAN-SWA MLADRGKTILVTGVSGFVGAAVARAFAGAGYAVRGLARETSAPANLRDFPGEIVRGDIRDAAAMRAALRGVDGLAHVAADYRLWAADPEEIVRNNREGTRTVMEAALAAGTPRIVYTSSVATLAPDAGRPADEDRPLDEAAAIGAYKRSKVAAERLVEAMVAARGLPAVIVNPSTPIGPRDVKPTPTGRILIEAARGRMPAFVDTGLNLVHVDDVAEGHVAAFERGRAGRRYILGGEDVSLRTMLTDIAGLTGRRPPRLNLPRGPLYPLAWGAELVSRLSGREPFLTRDALKMSRHAMFYSSARAAAELGYRARPYRDAIGDAIAWFRREGMIA >NC_020561|1874484:1924690|1875067_1875595_-|WP_187294056.1|DBSCAN-SWA MRTSFWAGMAAMLLASMTLTATPAAAQSQYWQCAPFARMISGIQLFGRAADWWAQAAGKYQRGQTPKVGSVLSFRAFRAMPAGHVATVSEVVDARTVKLTHANWSIINGRRGQVERDVTAVDVSEAGDWSKVRVWWGPSRGLGMTAYPTNGFIYAAAEAAKAMMATGAEMVKNDD >NC_020561|1874484:1924690|1920285_1920699_+|WP_015458584.1|DBSCAN-SWA MAIHAPITGALSCATVVSDYPADFWRTLVNLDRLRLEGVIEMAISLLDADDGDPDLEPNGDELDGTGGEDDFCNHSAYMSGPGCPIADPDVGIDDVPRDREAPLRPKYGIDQSAGPINEVQAAREWQEAQRRRYAIG >NC_020561|1874484:1924690|1912088_1913099_-|WP_015458575.1|DBSCAN-SWA MEAAALSLPLLIGLIGVALLFDFLNGLHDAANSIATIVSTRVLRPQYAVVWAAFFNFIAFLFFGLHVAETVGRGIVSAEIVDAHVIFGALMGAISWNLITWGLGIPSSSSHALIGGLLGAGTAKAGLHAVVWSGVFKTAAAIVLSPTIGLVLALVLVLITSWAFVKATPTLADRWFRKLQLVSASLYSLGHGGNDAQKTMGIIAVLLYSQGMLSGGFHVPFWVVISCQAAMGLGTLMGGWKIVHTMGSKITRLTPAQGFCAETGGALTLFGATWLGVPVSTTHTITGAIVGVGSARRMSAVRWNIASRIVIAWVVTLPAAAAIGALFYWLAGLFAW >NC_020561|1874484:1924690|1883053_1883686_+|WP_015458549.1|DBSCAN-SWA MADTSLTMTQSGAPRFHLEEEHHHAEGGSTMLGFWIYLMSDCLIFAILFACYGVLGGNYAAGPSPRDLFDLPLVALNTTMLLFSSITYGFAMLAMEKGAIGRTQGWLAITGLFGAAFLGIELYEFAHLIHEGATPQRSAFLSSFFTLVGTHGLHVTFGIVWLVTLMVQVARRGLIPANRRRLMCLSLFWHFLDVIWIGVFTFVYLMGMLR >NC_020561|1874484:1924690|1886190_1886727_+|WP_015458553.1|DBSCAN-SWA MTGPERLLVIVEDDAAFARTLRRSFERRGYAVLSAASHDALVALLADHDPGYAVVDLKLGGASGLACVQALHAHDPDMRIVVLTGFASIATAVEAIKLGASHYLAKPSNTDDIEAAFDRADGDAATPVDGRQTSIKTLEWEHIHQTLVDTDFNISEAARRLGMHRRTLARKLEKRQLR >NC_020561|1874484:1924690|1884856_1886194_+|WP_015458552.1|DBSCAN-SWA MNAVWDKARMAMAPVEGPASIGVNNMLLLLQLRWIAVLGQLGTIAVVHGVMGIPLPLGPLLLMPALLIAINLLSRPFLRERKHVTNGELFSAFLVDVAALTWQLHQTGGLTNPFAWLFLLQVVLGAILLKPWSSWAIVGVTTLCLAWLMGHYRPLALPPGHQDDLFGLYLQGGLVCFGLIAILLVVFVTQISRNLRERDASLADVRQQAAEENHIVRMGLLASGAAHELGTPLSSLSVILGDWQRMPELAGNPDLAQDIADMQAEVQRCKAIVSGILMSAGEARGVAPAVTTMRRFLDDIVADWRSSRLAGTVDYDDRFGADVPIVSDPALKQVIGNVIDNAAEVSPHWIGIVARREQDLLVLSVSDRGPGFSPDMLNGFGQPYRSSKGRPGGGLGLFLLVNVLRKLGGRAEAQNRPGGGATVTIMLPLSAIAYAPKESSREIAP >NC_020561|1874484:1924690|1877182_1877464_+|WP_015458544.1|DBSCAN-SWA MSDTPPDRLSTNPKSPYFDADALQRGVGIRFKGAEKTNVEEYCVSEGWVRMAVGNSRDRHGNPMTIKASGPVEPYFRDVTEGGESAEASPSDN >NC_020561|1874484:1924690|1893591_1894743_+|WP_015458559.1|DBSCAN-SWA MAAIPFSQAARIGGYVLGRKLRRVERYPLVLMLEPLLRCNLACKGCGKIDYPDEILNQRLSYDQCMAAIDECGAPAVSIAGGEPLLHREMPRIVEGYIARKKFVILCTNALLLKKKIDQYRPSPFFTWSIHLDGDQVMHDRSVCQDGVYEVARDAILLAKSRGFRTQINCTVFDGADPDRLAAFFDDMMAIGLDGITVSPGYAYERAPDQQHFLNREKTRQLFRDVFRHDPKRKWAFTNSPLFLDFLAGNQTYECTPWSMPLRTVFGWQKPCYLLGEGYVQTFRELMDDTDWENYGVGKYEKCADCMVHCGFEGTATTDAVRHPLKFLKAARHIRTEGPMAPDIDLSRQRPAENVYSSHVERELALIRQSQPEAGKHVTAAWR >NC_020561|1874484:1924690|1906442_1907417_+|WP_084653392.1|DBSCAN-SWA MRFPAHAAAPAGTLSRPPAHGPGWAGTEPGEAAPAPHPARRLVITADDFGASIAVNRAVERAHREGVLTATSLMVAGEAATDAIATARKLPMLGVGLHLVLVDGRPTLPPDRVPDLVDGDGRFRANMVRAGVDFFFRPAVRRQLAEEIEAQFIAFGATGLKLDHVNAHKHFHLHPTIAGLIVEIGGRFGLRAVRAPVEPPEPLAEVEPASPGRLADRVAAPWARTLQARFASAGLIVPDQVFGLRWSGHMHVGRLAGLIEHLPPGLTEIYLHPATEAGFAGHAPGYDYEAELAALVDERTRAALRLSGARLGSFTDFEEERVAA >NC_020561|1874484:1924690|1896108_1896744_-|WP_015458561.1|DBSCAN-SWA MTILVATGLKREARIIARDGLAPIPGGGDAAALEAALEAHARSARAIWSMGIAGGLAPDLAVGDWTIGGDPATVEALARRLPEARIGEVHANGTLVAEAAVKRALHAGTGAIIVDMESHVAARVAARHGLPFAFLRVISDTADHDLPHAAWVGMQPGGGMAIGPVTASLARNPGQLPALIRTARDVGAAFRSLERLMARMNEGERFWEQRQ >NC_020561|1874484:1924690|1916070_1916808_+|WP_015458578.1|DBSCAN-SWA MSIRIMSAVWGLKLGDSDKLVLLALADQANDDGVCWPSMASLAAKCSKSDRTVQAAIKSLVDAGHLSRVERPGKGVRYTVHPRSDFTPEAASPPKGTTSTPEAASDKPSRTITLSQKTSSSSKARAKKSAVEPFAVPDWVPADAWNGWLEMRRQEGKRPTPRALELAIEELRKLADAGHPPGAVLDQSTLRQWTGLFPIKDNRNEQHHQSFGSSSHRPQHGSEGVGKTVHAANRAIASLSGSEGG >NC_020561|1874484:1924690|1878262_1879588_-|WP_015458546.1|DBSCAN-SWA MSAEMVGSRTAERDLSEINSRHSHVSPGEIAIGVIIGRTSEFFDFFVYAIASVLVFPKLVFPYVDPLTGTLYSFAIFAVAFIARPLGSLLFMTIDRAYGRGVKLTIALFLLGGSTAAVAFLPSYQQMGATAALLLTLFRFGQGLALGGAWDGLPSLLALNAPQNRRGWYAMIPQLGAPLGLIVASLLFAFFVYVLPAEDFLAWGWRYPFFVAFAINVVALFARLRIVVTPEYARLYENRDLEPAPVMQTVREEGRTIIIGAFAPLASFAMFHMVTVFPLSWVFLYTQNAPDRFLVIEAIAAGFGVVAIIASGLIADRVGRRTLLALSAAAIAAYSGFAPQLLNGGELGELAFMVLGFILLGLAFGQSSGAVSSSFSPANRYTGSALTSDLAWLVGAGFAPLVALLLASNFGLLAAGAYLLSGAVCTLLALWINKELAQTMR >NC_020561|1874484:1924690|1924363_1924690_+|WP_015458589.1|head,tail|DBSCAN-SWA MLATFIAAGVIPTAPAPVTADLVTLAEAKEYCRIDGAAAYEDATLSILIAAASGTVRDYAAGWDGTGEVPARLKLAALELIAAHFDTRGDVPDVHLERILGPYREHNV >NC_020561|1874484:1924690|1900806_1901685_-|WP_051128721.1|DBSCAN-SWA MTALAEQAPRAAGAAALASGKGHKDENFPVASLLIRPEHRAPIMAFYRFARAADDVADHPTASPVEKLARLEAMRAGLTGESDADPAAGALRAALAGRGLDAGHALDLLEAFRRDVTVNRYADWDALIDYCRYSAMPVGRFVLDVHGEDRALWPASDALCAALQVINHLQDCGKDYRAIDRVYLPADRLAAHGATVDMLAAPRAAPALKAAIGEAAGEAEALLARSAGFARAIADRRLGMEVAAIQRLAESLAARLRRRDPLAEKVHHGRMEALMIAGRAALGRMIARGGRA >NC_020561|1874484:1924690|1914370_1915678_+|WP_015458577.1|DBSCAN-SWA MRHIAIIGSGPAGYYTAEACQKQFGDQVSIDIIDRLPVPYGLIRFGVAPDHQSIKAVSQRYEQVSLNPNVRFVGNITVGGEVTVAELIALYDAVILATGAPVDRPLGILGGDLPGVIGSAAFVGWYNGHPDFAGLNPPLDCEGAVVIGNGNVALDVTRILAKAPAEFVGSDIVAHAFEALGNSAIRRVTMVGRRGPHEMQMTPKELGELGHLQRAVPHVDPADLPPVEADAALEPGQRKAMGHLRGFVSLAQDKEVAIDFDFFAKPVAIEGDGRVERLIVERTRPIGDGQVEGTGETYAIPCGLVVSCIGYRTPPIPGVPYDESAGRFANSEGRVPVDGGNLYAVGWARRGPTGTIGTNRPDGYKIAEEIAADHPAPANKAGGKGLDALLAERGADKVSFADWQKIEAAETARARDGAPREKFVVVADMLAARGA >NC_020561|1874484:1924690|1889273_1891865_+|WP_015458556.1|DBSCAN-SWA MPARANGPVARAVDAACRRPMAVLVAALLLALAAGAYAATHFAMTTDSTALISPDVGWRVNERRLDAAFPQNGDAILVVVDGATAELAETASAALADRLAVDRAHFRGVTRPDGGQFFAREGLLFRSQADVAAASARMVEAQPFLGPLAADPSLRGIADALGTMISGVDRGEADIARIDRPMAALADALEAQAAGRPAYFSWQALLGDGDKEGALEAPRRRLILVRPILDYGALQPGMAASDAIRAAARALALDPAHGVTVRLTGSVPLSDEEFSSLADKAWLVAMVMIAAMLGTLWLATRSGRLVAAIMLTTLAGLVVTAAIGLIAVGRFNLISVAFIPLFVGLGVDFGIQIAVRFQAERHGGASPADALRGAATALGAPLLLAAGAVCLGFLAFLPTDYVGIAELGIISGIGMIVALAFSASLLPALILLLRPGRPRAEVGTPALAPADAFLIQRRKLVLGLFGLSMVVSIIALPAVRFDFNPLHLKAPDAEAMATLTDLMHDPDRNPNVIDILAPDLPAARRLAARLEALPEVGRVMTIESFVPEGQAEKLATIADARLLLDLTLDPLEPLPPPSDADTVAALRRTAAALAAHGENPNARRLAKALSTLAGAPPEARGNARAMLVPPLEAMLAQLRAALSAEPVTLADIPADLKRDWLAPKGGVRVQAVPRAAGNDNEALARFTRAVRAIAPDATGVAISTQEGARTVAHAFVHAGLLALAAISLLLFAVLRDLREVAFTLAPVVLSGFLTLGTCVLIGQPINFANIIAFPLLFGVGVAFHIYFVMAWRAGTADLLQSSLARAIFFSALATGTAFGSLWLSSHPGTASMGKILMLSLAWTLVCALIFEPALLGPPRKDQR >NC_020561|1874484:1924690|1909284_1911108_-|WP_015458573.1|DBSCAN-SWA MTPLDRIRNFSIIAHIDHGKSTLADRLIQRTGGLSDREMSAQVLDNMDIEKERGITIKAQTVRLDYTAKDGQSYVLNLMDTPGHVDFAYEVSRSLAACEGALLVVDAAQGVEAQTLANVYQSIEHDHEIVPVINKIDLPAAEPEKVREEIEEVIGLDASGAVLASAKSGIGIDDILEAIVAKIPPPKGDATAPLKAMLVDSWYDPYLGVVILIRVIDGVIRKGQQIKFMQAGTTHLVDRVGCFRPKIEQLGDLGPGEVGFITAQIKEVSQTAVGDTITDAKKPAAEPLPGFKEVQSVVFCGLFPVDANDFEKLRESISKLRLNDASFSFEMETSAALGFGFRCGFLGLLHLEIIQERLTREYDLDLITTAPSVVYRLLLTRSAGEAEARTIELHNPADMPDPNKIEMIEEPWIEATIYVPDEYLGSILKLCQDRRGIQKNLTYVGGRAQVTYELPLNEVVFDFYDRLKSITRGYASFDYHQIGYREGDLVKMSILVNNEPVDALSMIVHRAAAEARGRHMCERLKDLIPRHMFKIPIQAAIGGKVIARETIAALRKDVTAKCYGGDATRKRKLLEKQKEGKKRMREYGSVSIPQEAFIAALRMGDEG >NC_020561|1874484:1924690|1920789_1921290_+|WP_015458585.1|DBSCAN-SWA MNQLTVPRVRFRHAAAAVGVTKKTLRNWLVRGQVQIETESEGWNEFSLIDLAELTLTAELVRYGVGILPASMAARSQISLSTHLLASYKNTPAQAFMFAFRGARLFMQSPRAGLTHFKHARDNEGAPADWAGASLLTIDIQTLIEEMLDRLAAAMGADDGADEGDE >NC_020561|1874484:1924690|1905003_1906428_+|WP_041864858.1|DBSCAN-SWA MRSLFLQAPSFDGYDGGAGARYQMKREVRSFWYPTWLAQPAALVEGSKLIDAPAHDLSFDDIKHEAYARDLVILHTSTPSFRQDVKTAEMLKALNPDLKIGLIGAKVAVQAQESLAASEAIDFVARNEFDFTIKDVADGQNWASIKGISYRNAQGVIVHNDDRPVLEDMDALPFVSPIYKRDLVIEKYFGGYLKHPYVSFYTGRGCKSRCTFCLWPQTVGGHNYRTRSIGHVIEEVKYVMREMPQVKEIFFDDDTLTDNAPRVEALARELGKLGVTWSCNAKANVPYDTLKVMKDNGLRLLLVGYESGNQKILHNIKKGLRVDVARQFTKDCHALGIVIHGTFILGLPGETKETIEETIRYAQEINPHTIQVSLAAPYPGTFLYRQATENGWFDGTDHLLTDHGNQIAQLSYPHLNSTEIFASVEDFYKRFYFRPRKIGAIVGEMIRDQDMMKRRLREGVEFFRFLRQRKEAVA >NC_020561|1874484:1924690|1908315_1909272_+|WP_051128819.1|DBSCAN-SWA MAATAIGLAIALWAIGRAGLGDIMAAAGRLGIGGFLLLIACSFAVLGLLGAAWLTAMPDAPFRRLPLFTWARTTREGASDLLPFSQIGGIVVGAWTLIGRGLPATRVYASIIVDLTTEMAAQLLFTLFGLWMLGAILLDADAMRSLRTLALIGAGVAVAVTIAFALLQVPALRFLAFLARRMLPRAEVAVDAVVAELTRYYRVRRAILASFFFNLLAWAGSAASAWLTLRLMGEHQTIWHIIALESLIFALRSAAFVVPGAIGIQEAGYILLGPIFGIGPEAAVALSLVKRARDIAIGVPALLIWQMGGVRSGLRKSA >NC_020561|1874484:1924690|1917987_1919487_+|WP_144062130.1|terminase|DBSCAN-SWA MPPEEQVLAFLRSLPIVSGLKAGEKMELLEFQERFVRAVYGPADDEGRRLVRLAALSVGRGNGKSALLAGLSLAHLLGPMAEPHGECYAAALDREQAGVLYRMVRGYIEETPWMAAAVNIRDWHKSIEVETTRSTWTALTSDARKAHGLAPSFWIADEVAQWRSRELWDNLATGMGKRKHALGVTISTQAADDLHFFSEMLDADPDPSIYVQLHAAAKECALDDREAWAAANPALGAFRDEREFELASERASRMPSFEPAFRLLYLNQRIAAEGRFLNPLDWDANGDPFDPAELEGKRCYGGLDLSSTRDLTALALWFPDEGKLLAWHFVPADTLRERVERDRVPYDRWAAEGWMETTVGRATDRTAVARRLADIRQMYDVQGIAFDRWRFEDLGKLLSDEGIELPLKEFVPGFKSYAPAVDAFERAVLEKRMQHNGSPILRWQAGNVIVEKDPAGNRKPTKAKSRDKIDGIVSAIMACGLAATDEGPAVYRGAGLVWI >NC_020561|1874484:1924690|1919700_1920009_-|WP_084673637.1|DBSCAN-SWA MKQYPISVPRDVYRGANDARRRKQRDYNDAAASVENYINNLLAGEPDDAIYGVGTNEVARKLGLEDELAQRIVFSIDGGHNGATFYKGDYDRAMLKMRGEDV >NC_020561|1874484:1924690|1881045_1883049_+|WP_015458548.1|DBSCAN-SWA MENQYAALSPIFGRLSLESLPLHEPILVATFAAVALGGIALVGALTYFRLWGYLWKEWFTTVDHKRIGIMYMILGLVMLLRGFADAVMMRLQQAMAFNGSEGYLTAHHYDQVFTAHGVIMIFFVAMPFVTGLMNYVVPLQIGARDVSFPYLNNFSFWMTTAGAVLVMFSLFIGEFARTGWLAYPPLSNIGYSPDVGVDYYIWALQIAGVGTLLSGVNLVATIVKMRAPGMSMMKMPVFTWTALCTNVLIVAAFPVLTAVMALLSLDRYVGTNFFTNDFGGSPMMYVNLIWIWGHPEVYILILPLFGVFSEVTSTFTGKRLFGYTSMVYATVVITILSYIVWLHHFFTMGSGASVNSFFGITTMVISIPTGAKLFNWLFTMYRGRIRFELPMMWTVAFMLTFVVGGMTGVLLAVPPADFVLHNSLFLIAHFHNVIIGGVLFGLFAAINYWFPKAFGFRLDPFWGKVSFWAWVVGFWLAFMPLYVLGLMGVTRRMRVFDDPSLQIWFVIAAFGAALIAIGIAAMLVQFAVSYLKRDQLRDTTGDPWNGRTLEWSTSSPPPDYNFAFTPVIHDLDAWYDMKSRGYVRPTGGYRPIHMPKNTGTGVILAALSLACGFGLVWYIWWLAALSFAGVLAVAIGHSFNYKRDFYIPAETVEKTEDERTRLLAAGA >NC_020561|1874484:1924690|1894771_1896109_-|WP_144062128.1|DBSCAN-SWA MHWKHWRPAAALFMTMATGAAVHAQDAPETLFRDVRVFDGQSDRLSPPTNVAVKGNVITAIGPAAKAGAGARVIEGRGRTLMPGLIDVHVHLTFGALDMAKLMSPDLTPQAAEAAAADQAKAMLLRGFTAVRDMGGPVFGLKAGIDRGKYQGPRIWPSGAVISQTSGHGDFRLPTERSRRFFGKPSRAEELGATFIADGRDEVLAATRENLRFGASQIKLMAGGGSSSDYDPIDVTQYLPEELSAAVAAAEDWGTYVAVHAYTSRAVRRALDAGVKSIEHGQLLDEATIKLIADRGAWLSLQNLPPPLPNDPPDRVAKKMAVREGSDHAWNWAKKYHVKLAWGTDFLFRPEENREQNAYILRLKQWFTPAEILRMVTHDNAQLLALSGERAPYKGKLGVVEAGALADLILVNGDPLANIDLIGDPDRNFAVVMKDGVIYRADGVK >NC_020561|1874484:1924690|1891855_1892449_-|WP_015458557.1|DBSCAN-SWA MFRPAFLLALAAMPLAPVAVRAQAADPAARVAAYNDQVVAIMKARLPLGQRTDRFEAAVRAYYDMPAITALVIGPKWAASPAADRQAAIAALTRHSALSLARNFASYGGERFIVDPDVTVRDSSRIVKVTIRTGTSGGDTLLYRMRQSGGEWRIIDVIAQGVSQLAVQRADLARTVATDGVAGVARRLRAIDGGAQR >NC_020561|1874484:1924690|1899976_1900810_-|WP_015458564.1|DBSCAN-SWA MTATSLATQGRAAGSSFYAGMRVLPRPEREAMYAIYAFCREVDDIADDQRGDRAARAAALEAWRGDIARLYAGGDPGQAHYLAPAVRRFDLARADFDAVIAGMAMDVAGDICCPDAADLDLYCDRVASAVGRLSVRVFGMEEAPGLALAHHLGRALQLTNILRDVDEDAAIGRVYLPAEALAAAGLPLGDIAAITADPRIDRACRAVAAEARQHFDAARRILAARPRGHLIAPRLMAGAYGVLLDRMEAAGWAPPRRRVRHNRLAILGMMIRLRLLR >NC_020561|1874484:1924690|1883682_1884078_+|WP_015458550.1|DBSCAN-SWA MSTDAHGDHAGHHEDHGHGDAHGHGTLRDYVTGFILAAILTAIPFWLVMTDALGDNQLTALVIMGFAVAQVVVHMIYFLHMNARSEGGWTIMALIFTIVLVVIALTGSLWVMYHLNTNMMPMSPHDMSQMP |
48 | Pseudomonas_phage(18.18%) | head,portal,capsid,tail,protease,terminase | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_4 |
2554304 : 2607859
Sequences of DBSCAN-SWA_4
Nucleotide sequences of DBSCAN-SWA_4 >NC_020561|2554304:2607859|DBSCAN-SWA GATGTTGATTGTGGAGACTATTGCCAAGATACGCAGGGAGCACAGGGACGGTAAGCCGATCAAAGAGATTGCGCGTGATTTACGGTTGTCGCGCAACACGGTGCGCAAGGCGATCCGTGCTCCGGAGGCGGATTTCAGCTACGAGAGGAAGGAGCAGCATCGTCCGCAGACCGGTCCATTTCGCGAACGGTTAGATGAGTTGCTGGCGGAGAACGAAGAGCGCCCCCGGCGCGAGCGACTGCGGCTGACGCGGATTCATGATCTGCTGGAACGTGAGGGGTTCACCGGCTCCTACGATGCGGTGCGGCGCTATGCGGCCCGCTGGAAGCAGGAGCGCCACGCCGGTGGCAGCGGGGATATGAGCAAGGTGTTCATCCCGCTCATGTTCCGGCCTGGCGAGGCCTACCAGTTTGATTGGAGCCACGAGGACGTGGAGATCGCCGGCAAGCCGATGCGGGTTAAGGTGGCGCATATGCGGCTATGCTGGTCGCGGGCGCCCTTTGTGCGGGCTTATCCGCGTGAGACCCAGGAGATGGTGTTTGACGCCCATGCCAGGGGCTTTGCTTTTCTCGGCGGGGTGCCGACGCGCGGCATCTACGACAACATGAAGACCGCGGTGACGACGGTGTTCACTGGCAAGGAGCGGGTGTTCAACCGGCGCTTCCTGATCATGACGGATCATTATGGCGTTGAGCCGGTGGCCTGTAGCCCGGCGGCAGGCTGGGAGAAGGGACAGGTCGAGAACCAGGTCCAGACCGGCAGGGAACGGCTGTTCAAGCCACGTCTGCGGTTTGCCAGCATGGAAGAGTTGAACGCATGGCTGGAGGCCGAGTGTCGCCGATGGGCCGAGCGCTATGCCCATCCGGATATGGAAGATATGACCATCGCCCAGGCACTGGAGATGGAACGACCCTCCCTACAGCCGCTCACCACGCCTTTTGACGGCTTCTTCGAGAGCGAACATGTGGCGAGCTCGACCTGCCTCGTCAGCTTCGATCGCAACCGTTACTCGGTCATGGCCGTTGCTGCCCGGCATGCGGTGCAACTGCGCGCCTATGCCGACCGGGTCGTCATCCGTTGTGCCGGCAAGGTGGTCGCCGAGCATGCCCGCCTGTTCGGCCGCAATCAGACGAAGTTCGATCCCTGGCACTATCTGCCGGTCCTGATCCGCAAGCCAGGCGCATTGCGCAACGGCGCTCCCTTCCAGGACTGGGATCTTCCGCCGGCCCTGGCCCAGCTGCGCCGCAAGCTGGGCAAAAGCGATGACGCAGACCGACGCTTTGTACGGGTACTGGCAGCGGTGCCCGAGGATGGCCTGGAGGCAGTCGAAGCTGCCGTGCGCGAAGCCATGGCGGCGGGCACGGCCAATGACGAGGTCATTCTCAACATCCTGTCGCGCCGACGCGAACCACAGCCTGTGCAGGCGATCAATGTTGTCGTCGATCTCAGGCTCAAGCATCCGCCCATTGCCGATTGCGCGCGCTACGATACGGTGCGAGGCCTCAATGCAGCGGCATGAGATGTTGGCAGCCCTCAAGGGGCTGGGCCTGAAGGGCATGATCGCCGCGTTCGACGATGCCGTCACCAATGGCATCCGCCGTGACCGGACCGCCATGGAGATGCTTGGCGATCTGCTACGCGCCGAAACGGCCCACCGTGAAGCCGCCTCGATCCGGTATCGCATGACTGCGGCCAGGCTGCCGGCCATCAAGGATCTCGACGGCTTTGTCTTCGCCGACACACCGATCAACGAGAGCCTGGTGCGTTCGCTCCATGCCGGCTCGTTCCTGCCGGAACGGCGCAATATCGTGCTGGTTGGTGGCACCGGCACCGGCAAGACGCATCTCGCGCTCGCCATCACCGCTGCGGTGGTCCGCGCCGGGGCCAGGGGCCGGTTCTTCAATACCGTCGATCTGGTCAATCGTCTGGAGGAAGAAACCCGGCAGGCCAAGGCCGGCAGCCTGGCCGCCCAGATGGCCCGCCTGGACGTCGTGGTTCTGGACGAGCTCGGGTATCTGCCGTTCGCCCGGTCAGGAGGCCAGATGCTGTTCCATCTGATCAGCAAACTCTACGAAAAGACCTCGGTGATCATCACCACCAATCTCGCCTTCGGCGAATGGCCTAGCGTCTTCCAGGATGCCAAAATGACGACGGCGCTGCTGGACCGTGTCACGCATCATTGCGACATCATCGAAACCGGCAACGACAGCTGGCGGTTCAAAAACCGAAGCTAAGAGACGGGAAAAACCCCTTCCCCCGGACCCCCATCCCCGGCGGTTACCCCGGTGGGTCCACAGGTACCTCCCGCGGACGCCGCCGCCAGCGCTCCTGACGGCGCGCGTCGGCGTCCGCGGGTCCCTCCTATGGACTACCGGGATAACCGCCGGAACTCACGAAAAAGGGGGTCAATGTTGGGCGCCGATAGGGGGTCAACTTTGCGCGCCGGTTGACACGCTTGACGACGGGCGCGTCCTTTCTCCAAGCAACTTTGCCCGTGGCAACGGCGCCCAGAAATGCTTCATCATTTACAGCTTCAGCTGTGCCGCGGATCGCGAATAGTTCACGAAGCCCCGCAAGATCGAGGATACGTTCGCTGAGCAGTATCTGCGCCACCTCGGCAGCCTTCGTGCTCTGATCTCGAAAGATGAAGGCCTCATCGTCGATGCGCATCAGTTTCTCGAATATCTCGAGATCTTTCGCTGGATCGTCAGTTGCCGGCAACAGCGCGCCCAGGACGCAGGCCCTAACCAGAACGAGTGGTTTACGTCCCTTCCAGTATGATCCCAACGCTGTCAGGGTCTGTGACTGAACGGCCTTGCGTTCCTTCTGCGCTTCTACGGAGATCTTCTGAGCCGGCCAAAGGCGCTCAATCAAGGCAGGCGCGTCTTGAAGCGAAAGTGGCTGGACCGTCGCGGCAATGGTCTGGTTCGTCATCACAGGTCCTCGAAACTGATCTGTCGCTTGTTGCCGGCTTCGACAGCGCCGGCGCCCTCCGTGAGTCCGATGCGGATCGCCTTGCGCCATCCCTTGTCGATGTCCTCGGCCCGCCCGGTGGCGTGGACGGCCATCGAATAGAGCCACCAGCGCTCTTCGGGACGAAGACCGGTCCATGCGTCGATGGCGGCGGGGATAAGGCTGGCCTCGCAGCTCTCGACCGCCCAGACCAGTACGCAGAGCTCACGGCCGAGAAGCCGGTGAACGTCATTGTCACCCGACGTCCACTTGCCGGATCCCATCTTACGACCCTTCAGACGCTCGTTGAAGGCAGTCCGCACTGCTGCGGAGATCTGCCGCCAACGCTTCTGGTCGAGACGGCACCGAACGACGCGATCCGAACGCATGTTAGTCTGCGGATCATACTGCGGACCCGTGATGCCGAAGTCCTCGATGATCTCGACGAGGTCGCGACCGTTGGCCGGGATGCGGACGACGAAGCGCTGGGGATCGATCTCGTTGGTCGGCACCCCATAGCCGATCGTCGCCATGCTCTTGCTGGCCATGCTCATTTTCCTTTCGTGACACCCGGACCGAAATCCGGCTCACGCCTGTTCGATGTCGGTTCCTGTGTAGATTTCCTCGATCGCGCCCATGAACGCGTCGATCTCGGCGGCGCGATCGACGGATGCCGACTTCCACTGCACCTTCACCTCGGCGAGCTCGTTCCCGAGCTGAGCGCGGGCGAACTTCGCCACTTCCTCGATGGCACCGGGCTTCACGGGCACGTCGCCGCTGAAGCTGGTGCTGACGTTGTTTCCGCCGGAGCCGACGATGACGACGACGCTGCGGAACGTGATGCCCGCCTCTTTGGCCTTCGTGACGGCGGCGAAGACCTTCTCTGTCGTGCCCAGGCTGATCGGCTTGCGAATCGTGGCAGGCTTTTCGCCGTCGATCATCTGACCGGCGCCGTCGCGCCGATTCACGCGGAATTCCTTCCGAATGACGACGCCTTCCGCTTCCGCGTGAGCATAGATCGTGACGTCAGTCTTGCCGTCTAATTTGATGGGACCGTCGTACACAGCTCCCTCGGCAGGGTTCGTGCCGTCGATGTTCCAGCGGATCGTGCCGGTGGGCTTCACCGTCAGCTCAACCTCGTAGCCGCCGCCGACGGGCTTCGGTTCGTGGGTCAGATTGAGGCTATTCAGCCATTCCGTGGCCGCGCCCTTCTGGTGGTTGCCGTCCGGATCGACCGCTAAGAAGAAGCGCTTCATATCGGTCGTGCGCACGGTGGTGTCGTCCAGAACCGTGGAGCTTTCCGACACGTCCGACGAGTTCGACCAATAGATGACGGGGTTGGATCCAGCGTTCTTCGGGCTGATGAACAGCGTCGCCTCGCCGGTCTTCTCATCGTAGTTGAGGACGTCGACGTTGACGCTCGTCTTCTCCTTCTCGAACGGACCCTTCTCGATCCAGCCGGTGCCGCTGTCCCGCCAGTCGCCCCGCGACAGGGCGATCTTGCGGAGCTCGTCCAGCCCGCCGCGGGGAAGCCACAGCCAGCGCACGTTAGAGGCTGCGCGATCCTGCACGTCGGCCCAGCGGGCACGCCGCTCGCTTGCGGGCCACAGCATGTCCTGCGCGCGATCCCGTAGCTTGGCGTAGTTCTCCTCCACGGAGAGGATCAGCTTCTGCGAGTTGACGTCGGTCAGCGCGCCTTCGGTCGCAGCCTCGCCGTCATAGCCGGCGCCATCCGTCCGGCGATGCGCGTCGAGCTTCAGCGCGGTGCTGACGAGACCGTCGGTGCCGCCCGGGACGCGCATCGGGAACCAGATCTTGTTCAGCGTCGAGGCGATCGTGGCGTACACGTCGATCTCGGCGTCCGTCTGGCGCTTCTCCAGATCGGCTCGGTTCTGGCTGTTGGGTCCCTCCGCTTGGATAACCTTTTCCACGGCATAGGCCGTGCGGACCTTCTCCTCGACACTGGCGAGGCTGTGGTCGTCGCCACTGACGATCGCGAAGTTGTTCTTCTCGGTCGTGCCCTCGAACAGCATCCGCGCCTTCTCGGGCGGGATCTGCGCGTCGGGTGACAGGACGATGAGGAGACGCTCCCCCTTGGTGTCGATGTCCTCGAGCTTCGGAAGAGCGAGGACGCGGCCGTAGGCGTTCTTCCGCTTGGGGGTGAAGAGCTCGATCAGGCGGCGTGCGAGATCCGCATCGATCTTCGGCTGGGCGGTCGTACGTGCGGTGTTCTCGATCTTCTTGGTCAGATTCTCATTCTTGGAGAAGATCCAAGCGCCGTTCTCGCGCTTGTGGAGGTACCAGGAGCGCTGCACCAGCGCGTCGAAGCCAGCGATGATGTCGCTCTCAGGCCGGCCGGGCGCGATGAGGTAGGCGCGGATCGTGTCTTCCTGCAGGCCCTTCACGGCGTCGTTGGCCTCGGCGAGCGATGCCATCAGGATGAGGCGGGCCGCCTGGCTGGCTGCGTCGTTTCCGGCGTTGGCGTCGATCGCCTCGGCATGTGCGTCGCTCGCACCTGTGTCGACGATGTCCTGCGTGATCGCGGCATCGAGATTGTAGATGCTGTTGACGAAGTCCCGCGTTTCGCGCTCCGCAAGGTCGAGATGCTGCGGGCCGAGCAGGTAGACGTCGTTGTACTGCCGGCCTTGGGCGGCTTTGATCATGAGCGCGGCGATGCGCATCAAGCCGCGCGTCTGGCGGAAGTTTTCGTTGTCCTTGAACGTGGCGACCACGGTCTTGAGTCCGGGATGGAAGGGGTACGTCGCCGAAACCTCGTCGGCGATCTTCTCCGCTGACTTGGTGATCATGTTCGCCGTGACCGCGTCGGACAGCAGCTGGCCGTATGTCTCGGCAACGGTGGTGATCAGGCCTCCATCCGGTTCCTCGGCAAGCAGACGCTTGCGAAGGATGCGGTAGATCTCGTCGGATCCGAGCTCCACCGGAGTGATCGGCTTGGTCTGGCGGGTCGCCTCGGCCGAAATCTGCGAGATGATCTTCGAGAGCTCGCCGCTTGCCTTGTACTGGCCAACGAGCGTCGAGAGCACGATGCACAGGCGGGGCAGTTTCGTGGCGGCCGAAAGCAGGTTGGCCAGCGCGTGGGAGGTGACGTCGCCGAGATGGCCGCCGCCGACCGAGATGGTGACGGCGTTCTCCAGATACGGAGGCAGCTCATCGATCAAGATGAGCGTCGGTTCGTCACCGAACAATGCCCGCCAATCATCCTCGTTCGGGGCCTTGGCGCCGTTACTCCAGAACTTGGAGAACAGCTCCTCCTTGCCGAGCTGGCGCGCGACCGAACCCCACAGATAGACGTCGTGCGGGATGTTGCGCCCGGAGATTACAACCACCTTGGCCGGGACGGGAGCGAAGCCCTTCACGATCGCGGGATCGACGTCGGCAGCCAGTTGGGGATGAGCGGCTAGGTAGCCTGCAGCGAGCATCGAGTGCGTCTTGCCGCCACCCATCGCCTGCCGCAGCTCGAAGGCCGCCTGGTCATTGAGCCCGGCAAGCCGCTGCATCGTCATGCGCAGAAGCGTTGCCATGCCTTCCGTGACGTACGTCTTGTCGAAGAAGGTGCGGGCGTCCGCCTCCGAGTGACCAACGAGGTCCGCAAGATCCTCGATCTGCTTGGTGAGGGCGACCTCAATCGCATCCTGCTTGAACTTGCAGACGTCGAGCACAGTCTGGAGCATGTTATGGACCTTCGTTCAGCTGGCGGTTGGCGTGGCGGTATCGCTGGGAGCGGTTGAGGCGAGATGCTTGCGGACGGCGTCTTCCAGATCGACGATCCGGTCGCGCTTCGAGATGATGACCTTCAGCAATCTCGTTCCGACGTCATCGGGACAGGGGTTGGCGGAGGAGAGCCAGTAGCGCACCGCGCGATCCGACACGCCGAGGTCAGCGGCGATGGGCGCCTGCCATCGCTCGCCGTACAGTGCCTCTCCGACCAGGCGGAGCAGATCAAGGTTCGAATTTTCCCCCATGCTTCCGGGGTTCTAGGAAGCGAAAAGGCTTTGGCCAACCGGTTTTTCGTGAGGAGGCCAATCCATCTACACTCTTCCACCGTGAGAACGAGTAGCCTGAAGCGGATTCGCCCTATGCGTGTAACCACGCGTAAGGTACAAACCCGGAACGGTGGCGCTGGGGGAGGTAGTTGTCTTGGCCGATTTCACGCATTTGGAAGATCAGATCCGCGATTGCTATGGCCGTGTCGTCTATACCCACAAGACACACGAGAAGATGGCCGACCGCTGCAGTCGGACGCTGCGGCGGTTCAAGATTGCACAGATCGTACTTACTGCGGTCACGGCGTCCGGCGCGTTCTCGGTCGTCTTTCTTGACGACACCTTGCTGAAGATCGCGACTGCGGTGGCGTCGATCGCGAGCCTCATCATCTCGGGATACATGAAGGGATTCGATCCCGGCGCCACGGCGCAGAAGCACAGGGATGCGGCCGCAAGCATGTGGCCAATCCGGGAGTCATATCTTTCCCTCCTCACCGACCTTCGCATGAAGCGGATCTCGGACGATGAAGCAGTCAAGGAGCGTGACGCGCTTCAGGCCAAGCTGGCCGCCATCTACAGGGGCGCGCCACAGACCACGGGTGACGCCTACACCGATGCCCAGGACGCACTGAAGAACAAGGAAGATCTCACCTTCTCCGATGCCGAGATCGACTGCTTCCTCCCAACCTCGCTTCGGAAGACCGCCGCCTGATGGCGCAGCCGAGCCGAACCGGGGAGGGGCTGATTCAGGATCTGGCCGAGGTTCTGGAAATCCCGGTTTCGCGCTACGAGTCAGCCGACCGCAGTTACAGGTCGGTCAGCCGGTGGCTCGATCGGCCGGAATCCCGGTTCGCCCACGTTGATCTGGACGTATACACGCAGGGCTCGTTCCGGCTGGGTACTGCCATTCGGCCGCTGAATGGCGAAGAGCACTACGATCTCGACATCGTCTGCGAGTTCCGCATCAGCAAGGCGCACACGACACAGAAGCAGCTGCACGACGATCTCGGGCATGAGCTCGAGTTGTATGCGGCTAAGCATGGGATGCAGGAGCCGTCACCCTGGCAACGATGCTGGACATTGAACTACGCGGACGAAGCCCAATTCCACATGGATGTGCTGCCATCCGTCCCTGACGCTCAACGGCAGAGGGTGCTGCGAGAGGCGCGATCTCTCCCGCTCGACTACGTGGACCAGTCGGTGTCGATCACCGACACCGAACACGCGAACTTCCGACGACTTTCCGACGAATGGCCGGCCAGCAATCCCAACGGATATGCTGACTGGTTTCAGTCCCGCATGAAGCCGGCTTTCGAGAGTCGCCGAAAGGCGATTATGCTGGCCGAAGCGAAGGCGGACGTGGCTGACATCCCCGTTTACCGGGTGAAGACGCCGCTCCAGTCGGCCATCCAGATCCTGAAGCGTCATCGAGACATGCGCTTCGCCGACGAACCCGAGCGCCGCCCGACGTCGATCGTCATCACCACGCTCGCGGCGCATGCCTATCAGCAGGAGACGACCATCTCGGGCGCGCTCCTGAGCATTCTCCAGCGCATGGACAGCTACATCGTCCAGCAGGACAACGGGTACTGGATCGCCAACCCGTCGGATCCGCGCGAGAACTTCGCGGACGCGTGGAACGAGGATGCGGATCGCCGCGACGCATTCTATGACTGGCTGGAGACGGCGCGCGCCGACTTCAGCCTGGCCGCCAGCCAGCAGGATCCCTCTGCGTTCGTGGACGCTTTGGCGCCTCGCATCGGCCGAAGCCTGGTGGAGGCAGCCGTTGCGCGACGGAGCCGTCCGTCGACAGGTGGAATGTCGTTCGTCAGCAGGGCCAGCCGGGCGCTGCAGCGTGTGATCGACGCTCCGCACCGCAAGCCTGCAACATGGCCCACTGTGCGCTCGGGCACCGTCGAGATCATCGTGGCGACCGCGATGCGGGATGGTTTCCGTCCGCGGATATTCACGAGCGACGATGCGCCTGTTCCACCCGGTGCCAAGCTGCGCTTCGACGCGCGGACCGATGTGCCTCGCCCGTTTCGCGTCTACTGGCAGGTGGTGAACACCGGCGCGGCAGCCACCGCGGCGCGGAACCTGCGGGGGGGATTTGACGAAGTCACTGTGGTGCCCGGTGTCCTGTCGAGAACCGAGGACGCCAAGTATCCCGGATCCCACAGCATCGAGTGCTTCGTCGTGAAGGATGGCTATCTGGCGGCTCGAAGCGGGCCGTTTCTCGTGAACATCGGATAGAAGACGGAAGTGGATCGGGGATGGGTGTCGGCGAGGATTTTTCGCGCTTCAAGGACGGCTACAACATCGATGCGTCGACGATGGCTTCGATCAGCTATCGCTATCGGCGCATAACCAGGCAGCTCAACAAGGACTTCTGGAACACCGAGTCCGAAACGACCCATAGTCTCTACGTGGGATCGTACGGTCGCGACACCGCAGCCCGCGGCCTCAGTGATCTCGACGTCGGGTTCGTTCTCCCCAACTCCCTATACCACCAGTACAACGCCCATCTGGGGAACGGCCAATCGGCACTTCTGCAGGCCGTCAAGCGCTCGATCCAGAAGACCTATACGACGTCGGAGAGCTTCGGGGACGGGCAGGTCGTGGTGGTCAGCTTCACCGACGGCATCACGTTCGAGATCCTGCCCGCCTTCGACAACTCCGATGGGGACAGCTGGACTTATCCCAACGCGAACGGCGGCGGCTCGTGGAAGACCTGCAATCCTCGTGCGGAAATGCGGGCGGTGGACACGCGAAGCCTCCTCACCAACAGGAACCTGAAATATCTGTGCCGCATGATGCGGGTGTGGCGTGACGTGCATTCCGTTCCGATGAGCGGAATGCTCATCGACACCCTGGCGTATCAGTTCATCGAAGGATGGGCGCACCGCGACAAGTCGTTCTTCTATCATGACTACATGGCGCGCGACTTCTTTCTCTACCTATCCCAGCGTGACACCACCCAGTCGTATTGGCGAGCGCCCGGCAGCGGGGCGCTGGTCGCTCGCAAGGGTGCCTTCGAGCGGAAGGCTGCGAGCGCGTATGCGAAGGCACTCGAAGCAATCACCTATGACGCCAACGGACATGATTGGTCACGTCGACAGAAGTGGCGCGAGATCTTCGGCACCCTCTTTCCGGCGTAGTGCTTTACAGATTGCCGATGCGATCTTGCCTCCCCTCCTTCACCACCTTTCACCGAAAAGGTGCTGAACGTGAGAAAGCGATGCCGGGACCGTCGGGTCTGAGCAGGTGACGCCCAGGTAGGTGGCAGCACGGGTCGCGCCGACATAGATGTACTTGTCGAAGAGATCAGGCTCTCGTTGTTCCAGCTTGTCGACATCCACGAAGAAGACGGCCTCGAACTCCAATCCTTTGATATGCTCCACCTCGAACACTCGAACATCGTTTTCCTGCCCCATCACCTGACCATCCGGACAAGCTACCGCGCGGATGTTTCTCGACTCGAGCCTTTCGCTCAATCCGGCAGCGATGGGCGCCATCGAGGCGCTCTCATTCACCAGGATCGCGATAGACGGTAGTTGGCCCGAGAAATCTTCTATTTCGCAGATCCTGTCGCCCAGCCACTCAATCAGCTCGGCCGAACTGGCCAGATCGGCCCCGAAGACTGGCTTCACACCTTCGTTCTCAAGGTGTTCGGGCATCCTCGTTGTGGAGCGGCGTGCAAAATTGACCCCCTTAGCGGGGTGATCGGCGTCTAAAATTGACCCCCATCATTCCAGAGTGTGAGGCGTCGCGGCTTGGGCTTCCAGGCGGCGAGGACGGGGATGTTGATTGTGGAGACTATTGCCAAGATACGCAGGGAGCACAGGGACGGTAAGCCGATCAAAGAGATTGCGCGTGATTTACGGTTGTCGCGCAACACGGTGCGCAAGGCGATCCGTGCTCCGGAGGCGGATTTCAGCTACGAGAGGAAGGAGCAGCATCGTCCGCAGACCGGTCCATTTCGCGAACGGTTAGATGAGTTGCTGGCGGAGAACGAAGAGCGCCCCCGGCGCGAGCGACTGCGGCTGACGCGGATTCATGATCTGCTGGAACGTGAGGGGTTCACCGGCTCCTACGATGCGGTGCGGCGCTATGCGGCCCGCTGGAAGCAGGAGCGCCACGCCGGTGGCAGCGGGGATATGAGCAAGGTGTTCATCCCGCTCATGTTCCGGCCTGGCGAGGCCTACCAGTTTGATTGGAGCCACGAGGACGTGGAGATCGCCGGCAAGCCGATGCGGGTTAAGGTGGCGCATATGCGGCTATGCTGGTCGCGGGCGCCCTTTGTGCGGGCTTATCCGCGTGAGACCCAGGAGATGGTGTTTGACGCCCATGCCAGGGGCTTTGCTTTTCTCGGCGGGGTGCCGACGCGCGGCATCTACGACAACATGAAGACCGCGGTGACGACGGTGTTCACTGGCAAGGAGCGGGTGTTCAACCGGCGCTTCCTGATCATGACGGATCATTATGGCGTTGAGCCGGTGGCCTGTAGCCCGGCGGCAGGCTGGGAGAAGGGACAGGTCGAGAACCAGGTCCAGACCGGCAGGGAACGGCTGTTCAAGCCACGTCTGCGGTTTGCCAGCATGGAAGAGTTGAACGCATGGCTGGAGGCCGAGTGTCGCCGATGGGCCGAGCGCTATGCCCATCCGGATATGGAAGATATGACCATCGCCCAGGCACTGGAGATGGAACGACCCTCCCTACAGCCGCTCACCACGCCTTTTGACGGCTTCTTCGAGAGCGAACATGTGGCGAGCTCGACCTGCCTCGTCAGCTTCGATCGCAACCGTTACTCGGTCATGGCCGTTGCTGCCCGGCATGCGGTGCAACTGCGCGCCTATGCCGACCGGGTCGTCATCCGTTGTGCCGGCAAGGTGGTCGCCGAGCATGCCCGCCTGTTCGGCCGCAATCAGACGAAGTTCGATCCCTGGCACTATCTGCCGGTCCTGATCCGCAAGCCAGGCGCATTGCGCAACGGCGCTCCCTTCCAGGACTGGGATCTTCCGCCGGCCCTGGCCCAGCTGCGCCGCAAGCTGGGCAAAAGCGATGACGCAGACCGACGCTTTGTACGGGTACTGGCAGCGGTGCCCGAGGATGGCCTGGAGGCAGTCGAAGCTGCCGTGCGCGAAGCCATGGCGGCGGGCACGGCCAATGACGAGGTCATTCTCAACATCCTGTCGCGCCGACGCGAACCACAGCCTGTGCAGGCGATCAATGTTGTCGTCGATCTCAGGCTCAAGCATCCGCCCATTGCCGATTGCGCGCGCTACGATACGGTGCGAGGCCTCAATGCAGCGGCATGAGATGTTGGCAGCCCTCAAGGGGCTGGGCCTGAAGGGCATGATCGCCGCGTTCGACGATGCCGTCACCAATGGCATCCGCCGTGACCGGACCGCCATGGAGATGCTTGGCGATCTGCTACGCGCCGAAACGGCCCACCGTGAAGCCGCCTCGATCCGGTATCGCATGACTGCGGCCAGGCTGCCGGCCATCAAGGATCTCGACGGCTTTGTCTTCGCCGACACACCGATCAACGAGAGCCTGGTGCGTTCGCTCCATGCCGGCTCGTTCCTGCCGGAACGGCGCAATATCGTGCTGGTTGGTGGCACCGGCACCGGCAAGACGCATCTCGCGCTCGCCATCACCGCTGCGGTGGTCCGCGCCGGGGCCAGGGGCCGGTTCTTCAATACCGTCGATCTGGTCAATCGTCTGGAGGAAGAAACCCGGCAGGCCAAGGCCGGCAGCCTGGCCGCCCAGATGGCCCGCCTGGACGTCGTGGTTCTGGACGAGCTCGGGTATCTGCCGTTCGCCCGGTCAGGAGGCCAGATGCTGTTCCATCTGATCAGCAAACTCTACGAAAAGACCTCGGTGATCATCACCACCAATCTCGCCTTCGGCGAATGGCCTAGCGTCTTCCAGGATGCCAAAATGACGACGGCGCTGCTGGACCGTGTCACGCATCATTGCGACATCATCGAAACCGGCAACGACAGCTGGCGGTTCAAAAACCGAAGCTAAGAGACGGGAAAAACCCCTTCCCCCGGACCCCCATCCCCGGCGGTTACCCCGGTGGGTCCACAGGTACCTCCCGCGGACGCCGCCGCCAGCGCTCCTGACGGCGCGCGTCGGCGTCCGCGGGTCCCTCCTATGGACTACCGGGATAACCGCCGGAACTCACGAAAAAGGGGGTCAATGTTGGGCGCCGATAGGGGGTCAACTTTGCGCGCCGGTTGACACATCCTCGTCGATGCCTTTCCGTCCTTGGCCGCCAAGGCGCTGGCGAGTTCGTTGAGACGTCGGCTCTGTCTGTAGGCCACGTCGATCTCGCGGACGTCGATATCAGGGAAGATCCAGCGAAGGTGATCGACTGAGCGGGCGCCCCAGACTGTTAGGCGTTGATTGAAGTCGCCACATGCAAAGAACGAGTTCGTCCAAGGACTTGCCAGCGCCGCCATGCAGGCGAGCTGGACCGGCGAGAAATCCGTCGCCTCATCGACAAGGATTTGGTTGCGCTGCAAGCGGCTGACATCGCTCAGGATCTGCGGTGTGCGATCCCCCAGGCGCCGCATGAGGTTAGTGTCGTCAAGCATGTCGCGGGCATTGCGCAGCATGGCGAGCAGGATGACGTCGACCTCGTGCGGCGAAACCTCCCCTCCAGCGATGTCGCTTTCCTGGTACCATCGACCGACCGCTAATCGTGCTCGGCGGAAGGTTCGATAACGTGCCGGCACGCCCCTGACGAAGGCAGTCGGCGAGGCAACTAGCCGTCGAGCAGCTCTCTGCACGACGACTAACTCGCCGATGTCGGCCATCTCCGGCAGCATCATACCACGATCCGTCAGCCACTGGAGGACCCGGCCGTTGCGGCTGGTACGTGACACCGGTCGCTTGCGAGCCTCGGCCACAGCCCGCCCTCGCAGCGCCTTTAGGAAGGCGTCCTGCGCGGCCCGGCGCCCCTGTTGCGGTACCTCATCTTCATCATCGCCCTCCGCGTCCTCGTCGTCTTCAACGTCAGGAGAGAGCGTCACGATGAACCGGGCGAGGTCGTCGAGCAGCGTAGGGTCGGCAGCCACCTGCCTGGCCAGCGGACGGCGTAGTCGCTGCCGAATTTCGTCGTTCAGGCTCGACGATGTCTTACGAAGCTCCTCGAACACGGCAGACACAGCGGCAATCGCTGCCACCGGGCGGGTGGACGCGCTCGATAGTAGCCCCGCTACTCGGCTTCCGATGCGGGCGATCGTCGGATCTGCTGATCCCCCGATCGCGGCAGCGTGTGCCGAGATTTGCTCGAGGAAGATGGCGGACGAGAAGGCATTGAAATCCTCGAACCAGGCGATCTGGTCGGTGATCGTGGCTGGCTGGAGGGGGTCCACTCGTTCGTTGATGACGAGCGTTCCGCCAGTGGCCGAACGCAAGATCGGAAGGGAACGCCGGGCCAGATCCCGCCGGTGGTCGCTCCAGGTTCGGATGCGCTGCTCGGGTGCTGGAACGCCCTCCCGCGCGAACGCCTCCTTGACGTACTGCTTTAGCAGCTCGGTGGGCGTGAACATGAGCCACGACTGATTATGGGGCGGTCCGTTTGCATCCGCCGTTTCGACCAGATCCTGTTCCTCCTCCGAGAGGAACTGGAAGTCTACCTTCTGGCGGAGACGCCGGACCAGCGTCGTGGTCTTGCCGGTGCCGGGGGGACCAAGAATGGCGAGGCGGCTGTCGAGGGGAAGCCGAAAGATTTCGTCTTGGAACTGGTCCAGGATCGGCTGATCCCGCAGCTGCATGGCGGTAAGAACCGAACGCCGCAGACCTTCCACCACGTTGACGGCTTCGTCGTCATCCGCAAGTTGCCGCTCGAGGGCATCCAGATCGTCATCTGCGTACCCCGCCGCCGCGAGAAGCTCGCGAAGCGAAACAATAGTTAGAGGGCCGTGTCCCTCTGCCTGGAAAACAGTTGGCTGGGAGTCCCACATCCCGCCCGGGCACGTTGGCTTGAATACCGCTTTTTCGACCACTTCGAGGTCTACAGCTCCCGAGGGCAATCGGATGCGCGCGAAGTCTCCCACGGGCAATGCCGCAAGGCGTCCTTTCGGCCCAAGGTTGCTACACAGCTGCACACCGCCGCAGGTGACTGGCGTCGCACGGCAGATGTAGAGCGTCTCACGTCGATCGTTCTCATCTGCCACTACAACCCGAGCGATCGCGGGTTCATCCCTCAGGTGTCGATACGCAGACGCAAGTTCTCCTTGCACACGTCCGAGATTCTCGACCGCTCTCGAGGCAGTAAGCTCGTTGATCCCCGCGAACGCATCCACACCGGCGGGGCGCTGTCCGATTAGGGATGCCGCGGCGCTGACGATCTGCTCGAAGGTGCCGAGCGCATCTTCGGCCAAGTCCCGAAGTATTACCGACCGGTCGCTCTGTTCGTCCTCTTTAGCGCTTGTGGCCACATCGCTCCCCCGTGATCCCCGGGGAAGAGTTGAGCGGAAACAACAGATTGCCGCAAGCTGATATCGACCGATCTCTCACCCGCCACCTTCGTGCTGAGGCAAGGATCATCGGGATGAAGGACATCGGCGTTCTACGTCACACTCGTTCGATTCGCTCCACGGCCAGCAGATCGAAGCCATCCCGAACGCCGACGACATTCACGCGCATGCCCAGCAGGCGCCGTGCTTTCCATCCTGCGTCCAGCCGCCAGGTTCCTCCGCCCGCCACCTCCAGTACCAGTTCTCCGCGCTGCTCCAGCAGGATTCCGGTGATCTCGTGGCGTGTCCCCCGCGGCATTTCGCCTCCTCAGCAGCGATGGTCGCGCCATAGGGGCGGCCAGGTGAGTACCGTTGCGCACGATGATTGCCGCGCCAAGCCATGAAGGTGCGAGACTGCGGGTCCCACCGCACATAACCCGCAAGTTCGGCTGGATGTGATGCGGTTCGGCTCGGAGGCGGTTCGCCGACAACGGGACCGAGGCCGGATTTTGGAGCTTCGATCGCGTCCGCGGATGCGTGTGGCGCTCACCGTCCTCGACGGGTCGCTGGTCGTTCCGGGGGTATTTCGTCCGGCGTAGGGTCGGCCGGGGCCGGCTCACCGCAGCCGAGCTCGGTGTCGAGGCTGGCGAACGGAACCGTTGCTCGCGGAAACAGCACGAGACGGCTGCGCAGCTCCACGTGCAGAAGGCTGACGCGGGCGTCGCAGTAGTCGCTGCGCCTCGCGGCCGCCTGATCGAACACGAACGCCGTGGCGCGGCCATCGGCGAGCGCGACGACCTTCCAATAGCCGGACGGCACGCGCTGATAGGGGCCGCCGGTCGGAAGAGGCTTCATCATCCGCTCGAATAGCGGGCCCGTGTAGACGTAGACGGGCACCCCGAGCCGAGTTGCCAGCGCGCGCTCCCTGTCCTCCAGCCGAACCCAAGGCCCCTGGTTGAGCGCCGAACTCTGGGGCGTGATGTTCGACAGGATGTTCGTGTCCGCTGCATGCGGGGTGCCGGAGAAGGACGACAGGGGCGCTTGATGGCCTCGGTCGATGTGGAGCGCGCCACTGGCGCCATCGTACGCATCCGGGGTCAGGGTCTCGTCCGATGACAGCCAGGGATCACGCCGCCAGTCGCGATCGCCCGATACCCCGATGCCCTCCCTGGTGACGCGGTAGGCGACCCAGTCCGCGAGCTTGGTCAGGTCGTTGGACGAGAGGGTGTAGATCTCGCGAACGACGATGTCGTTCGTGGCGGGCGCCCCGATGGGACACCCCTGCAGGCAGTGGAACGTATGGAGCTCCGATGCGCGGCTTTGTGCAGCGGCCGCGAATGGAGTCGCGAGCCCGACCAAGAATGCCGCGGCACCGGCAATCCTGGCCGTCAGGCGCGATGTGCGGATCATGGTTGCGGATGTCACGCGCCGAACCTGTCACGGAAGCGTTCGTGGTTGCGGAGCGCCTCCGGCCTATAAGGTCCCTCGCCGCCCTGACGAGGGACGCTGGGCAGTTCGAACCGCTCCTCGACAGAAGGGTGCAGGGTGGCCGTTTCGATCGGCTGCCGCCGCTCGACCTTCCAGGTCAGCGCGCGCGTGAGCCATCGCAGGGGCGCGGGAGTACGGGCCCGGATGGTGTCCGCCATGCCAGCGACCTCGTCGTGCTGGAGCCCTGCGGCTGACGGGAAGAGCCGCAGCCGGCTGTCGTCGACCAAGAGCGGATGGGGGACCGAACGCGCCTCTTCGAGCATCCACTGCAGCGAGATGTCGGACAGCCTCGACTCCGACTCGGGGTAGCTGCCGCCGATGTCGGAATGGTTCCCGGCGAACCATAGCTGGATGAAGGGCTTGGCTTCGCCCTCGACTTCGGGCGGTCGCTCGACACCCTTTCCCCAGCCCCATTTCAGACGCGGGAAGTCGGCGCGGTTCTCGTCGATCGCTATGGCGTGGCGCGCGTAGAGGACCGACCGGCTGAGCAGCCGGTCGAAGTTGCGCCCGCTCCACTGCGCGAGATGCATGCTGGGCCACCGTCGTCCAGGAAGCGGAGGCCACATGATCTTGAGCGCCGTGCGCAGATAACCCCACAGCGTCCAGCCCCCGATGACGGCGGCGGCGGCAAGGAAGGACCATGTCCAGCGAATGCCGAACGCCCAGTGCGCGACCCCGGCGAGCGTCGCGGCGGCCGCGGCGACGAGCAGCGCCACTAGGGCGGCGAGGGCGATCCTGAGCGGCCCCTTGGCGCCCAGGGAGGCGACGGTGTCGAACACGCCGACGAAGTAGGGCGCCACGTTGGCCTCTCCGCCGGCATCCGAACCGAAGTCGTGCCGGAACCGACGGGCGAGCTCGTCCCGCTCCTCCTCGTACTTCGCGCGGTCGTGGCCGGCGCCGTGCTCGTAGACGCGGATCACCGCCCGCCCGGAGATCTCACGGACCCGTAGCCGGAATCGGGGTAGCTCACCTTCGGGCACCTTGGTGGGCACGCCGCAGAGCATGAGGACGTTGGCGATGCACCGTGCGGTGTAGGCGCCGCGGCTGAAGCCGAAGAGCCAGATCCGATCGCCCGGCGTGTAGTGGTTGACGATGAACTCGTAGCAGTCCGCGATGTTGGTGGTGATGCCGCGACCGGTCACCGAGGACAGCAGCTTAGTGATGCCGCGCCGGATGGAGGTGATGCCGGTGGCGCTGGTGTCCGTGCCGAGGCCGGGATCGTAGAACGCGACCTGCAGGCGGGGATCGATCTGGCTGTCAGGGCCGGGACGGGCGGCGCGGTACATCTTGTAGATGTTGCTGAGCCGCTGCTCGGGCCTGACGCCACCGTCCTGCCCGGTGCCATCGGAGAATATGACGATGTTCTTGGGCACGGTACTCCCCTTCGCCGTCACCGTATCCCATTTGTTCCTTACAGCATAGTGTGGCAGCGCATGGAGGGGCACGTCGACAGGACGAATACGCACGTGGTTTGGCCGGCGACACCGTCGGCTACTCCTGGGTAACTTCGCGGACGGTGAAGTAGTCTACTTCGTAGCTGGCGTTCGGCGAGGCGTTCCCGTTGTTCAGGTAGACGAGAGGCGAGAAGAAGGCCGCGTCTGCAGGCAGGCGAAAGACCGACGCCGCCGTCTTGCCGATGATCGCCTTCCTGTCCTGCCAGGCCCGATCAGAGAACGAGAGATTGGCTCCCCAGGGATAGAAGCCATAGGAGGTGCCGTTGGGGATCCGAACTCCCGCTGCGTTGAAGACGACGATGCCGAGATAGGCGCCCGTCCTGTCGGGGTCGCCGTCGCCTTGGCGGATGCCGAGTGCGATCTCGAACTTCTGGTCGTCCGAAACGATCGGATAACGTGCGGTGGACTGGAACGCCACGTTGCCGCCAGGGATGCTGTAGGCGACGCCGGATCGGCCACCGCGACTTGCTGCCCACCGCATAGGCGCAAGCCGCCCGACGAGCGCTTCCTCATAACCCTTGAGGCCCTTGTCGAACGCCTCGTTGTCAGAGCACGCCGAGTAGTCGGTGGCTGACCGCGTGAGGGAGCCGAGATCTCCGATGAGCCCGGCCTCGGCGGCGTGCAGGCAAAGTGACGGATCCGCCTGCGCCCCGCTGGAGGTGTCGCGGCACGCGACGTCGCGTTGCTCCGTCACCGCCCCGCAGCCGCTCGTGTTGTTCCAGCCGCGATCCTCCCAAGCACATCCCGAGATCTGCTCCGCCTCGTCTAGAACGTCCGGTTTTGGGCCGGTGCAGATGCCGTCGTCCGGTGCGGGAATGCCGTCGGAGCGCAGGCACTGGACGGCGCGCGTGCGGGTCGCCGGATCGGCGCAGGTCGACGACCAGGCTCCATAGTCGCCGGTCCGCCAGTCGAAGGTGCATCCGTCGCGGCGTTCGGCGAAGTCGGTGGTGGCTGGCTTCGGCAACCGGCAGCGCGCCTCCTCGGCCGGCTTGTCGTCTGTCATGCATGTGACCGCGCGCGAACGAGGCGCCCGATCCGTGCAGGCGGCGAATCCGGTCCATTGCCACGCCCCCTCGACCCAGCGGCCATCGTCCTCAACTGGCTGATCGGGGATATGTGTCGCGTTCGTGGGTTTCGCGACCACTCGCCGCATCACCACCACGTCGCCGGGGGACGACGATTGCGCCAACGCTCCTGGCGCGGGAGCGCAGGCGAGTGTTGCGATCAGGCTCGCCTTCCAGAGGTTCTGCGTGACGGGCATGTGTTCTGCTCCTCGTCGGCCCCGTTCAACTGGTCGGGACAGCATTGTGCTGGTTCTAGATTGTCGGTTCAGCCGGCCATCCAGCAGCCGGATGGGGACGTATTTCCTCGCCCGATCGCATTCCGTCGCGACGTGGGGAACGGAGACTGGGGGCGGTGCTCCTCACGCTCCCATATCGCGCGCTGGATGGTGCGCGGCATCACGGCCGTTCGCGTCTGGCGGTCCACTCACCCTGCGCGCCGCTCCGCGTGGATCGGGATCTGGACCAGGCGCCGCCCGGAGGTGGCGCATCGTGTGCGTCTTGGTTTTAGGATGGTGGGAGCGCTTTCGGCAAGCGTCTCAACGGACGTTTCCGCCTGCGCAAGTGTGGCATTGAGGGCGGAAAGGGGGAAACGCGATACGGTCTGAGGTCCGTCGTGCTTTCGCTAACTATACGCCAGCTATATGCGAAAGTCAGGAATCGCTCTCGAAACCCTACGCTAGGTGCGCCTAATCGCGTGTCCCTTGGGGCGCCCCGAAGGTCGGTGGCAGCCAGTGAAGGGGCATATCGATGAAGTTCCTTGTTACGGTCAGCGCAACTGCATTGTTCGCGTGGCCTCTCGCAGCTCAAGCACAATCGACTGCCGAAGATAAGGTGGCGGCAGAAGCCGTACCGCCCTCGGTGCTCACGGTCAGTGGGTCGGCGACACTCGCCTCTGATTACCGCTTCCGCGGCGTGTCCCAGTCCGATCAGGAGATGGCGGTTCAGGGCGGTCTGACGATCGCGCACGACAGCGGCTTCTACGTCGGTGCATGGGCTTCGAACCTCGCGGGCTGGGGCACCTTCGGTGGTGCCAACATGGAACTCGACCTGATCGGCGGCTTCAAGGCGCCGCTTTCCGATAACGCAACACTCGATATTGGCCTCACTTGGTACATGTATCCGGGCGGCGCCGACAAAACGGACTTCGCCGAGCCCTATGCCAAGCTGACCGGCACTACCGGGACTGCAACGCTGACGGCCGGTGTGGCGTACGCGCCCAAGCAGCAGGCGCTTGGACGCTGGTACGACACCGGCACCGAGGCGGCGGCAGGCATCTACAACAATCCCGGTGCCAAGGACGACAATATCTATCTCTGGGGTGATGCCGCAGTTGGCATCGCGGGCACGCGGATCACGGCGAAGGCGCATATCGGCCACAGCTGGGGCCAGGACGGACTCGGGCCGAATGCGACGGCGGTCGCCCCCACCGGCGAGTATTGGGACTGGTCACTTGGCGCCGACGTGACGTGGAGGAACCTCACCTTCAACGTGTCCTACGTCGACACCGACATCTCCGTGTCCGAGGCCAGCCGTCTGCGCCCGAGCTTCAGCAAGGGCCAGGACGGCACCGGCAACATCGCCGGGAGCACGGTCGTCATGTCCCTGACCGCAGCCTTCTGAGGGGGAGGACACGGATGCAGGATCTGCTCTGGATCGCGATCATGGCCGGATTGGTGGCGGCGACGCTCGCCTACGTCCGCCTGTGCGACAATGCGTGAGGTGACGCGATGACGCTCGACCTCTGGCTCGCAGCGCTGACGGCGCTTGGCCTTCTCGCCTATCTCGTGGCGGTGCTGATCCGCCCCGAACGCTTCTGAAGGGGGCGACATCATGACACTTCAGGGTTGGATCCTGATTCTTGGCTTTGTTGCCATCCTGCTCGCGCTGGCCAAGCCGGTCGGCGCGTGGCTGTTCGCACTTTACGAGGGGCGTCGTACGCCGCTGCACCTCGTGCTCGGGCCGTTCGAACGCGGCTTCTACAGGCTGTCCGGCATCGACCCGACGCAGGAACAGGGCTGGCGTCGCTATGCGGTGCACATGCTGCTGTTCAACGCCGCGTTGATGTTCTTCAGCTACGCGGTGCTACGGCTGCAGGCGTTCCTGCCGCTGAACCCGCAAGGCCTGGGGCCGGTGAGCGAGCATCTCTCGTTCAACACGGCCATCAGCTTCACCTCCAACACGAACTGGCAGAGCTATGGCGGCGAGGCGACCATGTCGAACCTCAGTCAGATGCTGGGGCTGACGATCCACAACTTCCTCTCGGCGGCGACAGGCATCGCGCTCGCTTTCGCCCTGTTCCGCGGCTTCGCGCGGCGAGAGGCGAAGACGGTGGGCAACTTCTGGGCCGACGTGACCCGCATCACGCTCTACCTGCTGCTGCCGCTCTGCGTCGCGCTCACGATCTTCTACATCGCCTCGGGCGTGCCGCAGACGCTGGGTGGCGTGGTCGACGTCACCACGCTTGAGGGCGCGCGCCAGTCGATCCTTCTCGGGCCGGTGGCCAGCCAGGAAGCGATCAAGATGCTCGGCACCAATGGTGGCGGCTTCTTCAATGCCAATTCCGCGCATCCGTTCGAGAATCCGACCGCGTTGACCAACTTGGTGCAGATGCTGTCGATCTTCGTGGTCGGCGTGGGGCTGACATGGTGCTTCGGCAAGGCGGTGGGCAACACGCGGCAGGGCTGGGCGATCCTGTCGGCGATGATGATCCTGTTCCTGGCCGGAACGACGATCACCTATTGGCAGGAAGCCGCCGGTAACCCGGTGCTCCACAGCCTCGGCGTGGAGGGCGGCAACATGGAGGGCAAGGAGGCCCGCTTCGGCATCGCCGCTTCCGCGCTGTTCGCTGTCATCACGACCGCCGCCTCGTGCGGCGCGGTCAATGCCATGCATGACAGCTTCACCGCGCTGGGCGGCATGATCCCGCTGTTCAACATGCAGCTGGGCGAGGTCGTGATCGGCGGCGTCGGCGCCGGCATCTACGGCTTCCTGCTGTTCGCCATCCTTGCGGTGTTCGTCGCCGGGCTCATGGTCGGGCGTACGCCGGAATATGTCGGCAAGAAGATCGAGGCGCGCGAGGTGAAGCTCGCGGTGCTCGCGATCGCCGTGCTGCCGCTCATCATCCTTGGCTTCACCGCCCTCTCGTCCGTTGCGGATCAGGGGCTTGCCGGCCCGCTCAACAAGGGGCCGCACGGCTTCAGCGAGATCCTGTACGCCTTTACCTCCGGCGTCGCGAACAACGGCTCGGCCTTCGCCGGCCTGACGGCCAACACGCCTTGGTACAACGGCCTCCTGGGTGTCGCGATGTGGCTCGGGCGCTTCTTCATCATCGTGCCGATGTTGGCCATCGCCGGCAGCCTGGCGGCCAAGAAGTACACGCCTGAGAGCGCAGGCTCGTTCCCGACGACAGGCGGCCTGTGGGTCGGCCTGCTGGTGGGCATCATCCTGATCCTGGGCGGTCTCACCTTCCTGCCGAGCCTCGCGCTTGGTCCTATCGCCGATCATCTCGCGATGATCCGCGGTCAACTTTTCTGATCGGGGGCGCCGACATGGCCCAGAAAACCCAAACACCATCCCGGACCAAGAGCCTGTTTACGGCCGACCTCGTCGTGCCCGCGATCAGGGCATCGTTCACCAAGCTCAACCCACGCGAGCTCGTCCGCAACCCCGTGATGTTCGTCACCGCCGTCGTGGCCGCGCTGATGACCGTCCTGCTCGTGATCGGGCAGGACGATCTCTCGACCGGCTTCAAGATGCAACTTGTCGTATGGCTGTGGCTGACTGTCCTATTCGGCACCTTCGCCGAGGCGCTTGCCGAAGGCCGGGGCAAGGCGCAGGCGGCATCACTGCGCGCCACCAAGGCGGAACTCACCGCCAAGCGGCTGAAGGGCGACGGGCGCCAGTACGACAATGTGGCCGCCAGCCAGCTCAAGATCGGCGACATTGTGCTGGTCGAGACCAATGATCTGATCCCGTCGGACGGTGAGGTGGTCTCCGGCGTGGCGTCGGTCAACGAGGCGGCGATCACCGGCGAAAGCGCGCCGGTAATCCGCGAGGCGGGTGGTGACCGTTCGGCTGTGACCGCCGGCACGCGCGTCATCTCGGACGAGATCCGCGTCAAGGTGACGGTGGAGCCCGGCAAGGGCTTCCTCGACCGCATGATCGCGCTGGTCGAGGGTGCCGAGCGCCAGAAGACGCCGAACGAGATCGCGCTTACCCTGCTGCTGGTAGGGCTCACGATCATCTTCCTGATCGCGGTCGGTACGATCCCAGGCTTCGCAAGCTATGCCGGCGGCAGCATTCACATCGCCATCCTGGCGGCGCTGCTCATCACGTTGATCCCGACCACGATCGCAGCCCTGCTGTCGGCGATCGGCATCGCAGGCATGGACCGGCTCGTACGCTTCAACGTGCTCGCGAAGTCCGGCCGTGCGGTCGAGGCGGCCGGCGATGTGGATGTGCTGCTGCTCGACAAGACCGGCACGATCACGATCGGGGACCGTCAGGCAAGCGAGTTCCGGCCGGTCGGCGGTGTCGCGCCCGAAGCGCTCGCGGAAGCCGCGCTGCTGGCTAGCCTCGCGGACGAGACGCCAGAAGGCCGCTCGATCGTGGTGCTGGCGCGTGACCGCTTCCTGGTGCCCACCGCCGTGCTGCCGGATGGTGCGGAAGTCATCCCCTTCACCGCCCAGACCCGCATCTCGGGCGTGAGGATTGGCGGCGCTCTAATCCAGAAGGGCGCGGTGGATTCGGTTCTCCGGGCCAATCCCGGCCTGGGCGAGACGGCGGGCGCGACCGAGCTTCGGCGGATCACCGACGAGATCGCGCGCGCCGGCGGTACTCCGCTGGCGGTCGCTCGGGACGGCCGGCTGCTCGGCGCGATCTTCCTCAAGGACGTGGTCAAGGCGGGCATCCGCGAGCGCTTCGGTGAGTTGCGGGCCATGGGTATTCGCACGGTGATGATCACCGGCGACAACCCGCTCACCGCCGCGGCCATCGCCGCCGAGGCGGGCGTGGACGACTTCCTCGCTCAGGCGACGCCGGAGGACAAGCTGGAGCTGATCCGCAAGGAACAGCAAGGCGGCAAGCTGGTCGCCATGTGCGGCGACGGCACCAACGACGCGCCGGCGCTGGCGCAGGCCGACGTCGGCGTTGCGATGAACACCGGCACCCAGGCAGCGCGCGAGGCGGGCAACATGGTCGACCTCGACAGCGATCCGACCAAGCTGATCGAGGTCGTCGGCCTCGGCAAGCAGCTGCTGATGACGCGCGGGGCGCTCACCACCTTCTCGGTGGCCAACGACGTGGCCAAGTATTTCGCCATCATCCCCGCCATGTTCGTGGCGCTCTATCCTGGGCTCGGGGTGCTCAACGTCATGGGACTGGCGACGCCGCAATCGGCGATTCTGAGCGCGATCATCTTCAATGCGCTCATCATCCCCCTCTTAGTGCCCCTAGCGCTGAAAGGCGTCGCGTACAGGCCGATGGGTGCCGGGCCGCTGCTGGCGCGCAATCTCGCCGTCTACGGCCTGGGAGGTCTCGTCGCGCCGTTCATCGGCATCAAGATCATCGACCTCGTGGTCGGCGGGCTCGGTCTCGCGTAAGGAGTGACAGTCATGGGCAAGGATTTTACTTCAGCACTGCGGCCCGCGATCGTCATGACGATCCTGTTCGCCGCTCTTCTCGGGATCGCCTATCCGCTGGCGATGACCGGCATCGGTCAGGCAATCTTCCCGAGCCAAGCCAATGGGAGCCTGGTGCGGGATGCCGGCGGCAAGGTGATCGGCTCGACGGTGGTCGGTCAGGCCTTCACGTCGGATCGATACTTTCAGACACGGCCCTCGGCCGCCGGGGAGGGCTATGACGGACTCGCCTCCTCTGGCTCGAACCTCGGCCCCACGTCGCAGGCGCTGGTCGACCGGGTCAAGCCGGACATCGAGAAGCGCCGCGCCGAAGGCGTCACCGGACCAGTGCCGGTCGACCTCGTCACCGCCAGTGGTTCCGGACTCGATCCCGACCTCTCGCCGGAAGCGGCCCTCGCACAGGCGCCGCGGATCGCGAAAGTGCGTAACCTGCCGGTCGAGCGCGTCCGTTCGCTCGTGACCGATCAGCTGGAGACGTCGGTGCTTGGCGCGCCGCATGTCAACGTGCTGGCCCTCAACCGCTCGCTAGACGCGCTCACGCGCTGATGGCGCTGTCCGGCGACGGCAGACCCGACCCTGACGCCCTCCTGCGCGCTGCCGCGCAGGAGGGCAGAGGCCGCCTCAAGATCTTCCTCGGCGCCGCCCCCGGCGTCGGAAAGACCTACGAGATGCTGTCGGACGGGGCTGCCCAGCGCCAAGAGGGGCGGGATGTCGTGGTCGGGGTCGTCGAAACTCACGGTCGTGTAGAGACTGAGGCGCTCGTTCGAGGTCACGAGCTGATTCCCCGCCGGGAAGTTCCCTACCAGGGTCGCATTCTCCACGAGATGGATCTCGACGCGTTGCTCGAACGGGCGCCTGAGCTGGTGCTGGTGGATGAGCTAGCCCATACCAACGCGCCGGGCAGCCGGCATCCCAAGCGTTACCAGGATGTCGAGGAGCTGCTCGCTGCCGGCATCGACGTCTACACCACTGTCAACATCCAGCACATCGAAAGCCTCAACGACGTCGTCGCCTCGTTCACGCGAGTCCGGGTACGTGAAACCGTGCCTGACGGCATCCTCGAAATGGCGGATATCGAGGTGGTCGATATCCCCCCCGACGAACTGATCGAGCGGCTGAAGGCCGGCAAGGTCTATCTCCCACGTGAAGCGACCCGCGCGCTCACCCACTTCTTTTCGAAGTCCAATCTTTCGGCACTCCGCGAACTGGCGCTGCGGCGGGCTGCTCAGGCGGTGGATGCCCAGATGCTCGAGCATGTCCGGGCACTGGGCGTGGGCGGCACCTGGGCCGCGAGCGAGCGGATCGTAGTCGCAGTCAGCGAGCTTCCGGGGGCCGATGGTCTGGTCCGCGCCGCGAAGCGGATCGCGGACGCGCTCCACGCGCCCTGGACGGCGGTCTACATCGAAACGCCTCGCGCGCAGACGTTCGGCGCGGGTGAGCATCGATCTCTCGCCGCGGTCATGAACCTTGCCACCCAGCTTGGCGGGGTGGTCGCCACGGTGCCCGCGACGTCGATCGTGGCAGGCCTCAAGGCGTACCTGACCGACGCACGCGCAACCCAGCTGATCGTGGGTAAGTCGCAACGGTCCCGCTGGTTCGAGCTGCGTCACGGTTCCGTCGTCGATCGTCTCGTACGCGAAACGCCGGGGGTAGCAGTCCACGTTCTGCCGCTTGAGTCTCCACCGCCACGGACTGGTGGCTTTCGCATCCACAATGCCTGGGGTTCGCGCAGCGGCTATGCATGGACTGCTGCGATGGTGGCGGGTGTCACAGCGATCGCCAGTGGCCTGTTCCACGTGCTCGATCTGGGCAATGTGGCCCTGCTCTACCTGCTGCCGGTCATGGCCGCCGCTACCTTGTTCGGGCTGCGAACCGGCCTGTTCGCAGGCCTCACGTCCAGCCTGGCCTATAACTTCTTCTTTCTGCCTCCCACGGGCACGCTGACGGTCAACAATCCCGAGAATGTGATCTCGATCCTGGTGCTGCTCGGCGTTGCTATCGCGACCAGTCAGCTGACCGCACGCGTCCGCGCCCAGGCCGATATGGCGTCATCGAGCGCGCGTACGAATGCGGCATTAGCCGGCTTCCTGAGGCGGCTCACTGGGATCGGCGATCCAATGGAGCTGGCACGCGCGATCTGCGAGGATATCGCGCGCCTCTTCGATCTGCGTGTGGTGCTGCTTGTCCCAAACGGCGGAGGTCTGTCGCTCCAGGCGGCCAGCCACCCGGGGTGCGACCTCGAGACGATGGAACTGGCAGCCGCACGATGGGCATTCGATACCAGCGCGCCGGCGGGCAGGGGATCGGGAACCCTGGCGTCGTCCGACTGGTATTTTCAACCGCTGCGCGCGGGCGACAGGACACAGGGCGTGCTTGGGCTGGCGAAGGAGGATGGGGGCGATCCGCTGCGGGCGGACCAGCTGCCGTTGCTGACGAGCCTTGTCGATCAGGCCGCACTGGTGATGGAGCGCTTCCGTCTCGAAGAGGAGATGCGCGATGTCGAGTCAGTCCGCACCCGTGACCGATTGCGACAGGCACTGTTGTCGTCGGTGAGCCACGACCTGCGCACGCCCCTGACCGCCGTGATCGCCGCGGCCGACCAGCTCGATCATGGCGCCACGCCAGATTTGATCGGAACGATCAAGGCCGAGTCCGCACGGCTCAACCGCTTCGTGTCCAACCTGCTCGACATGGCGCGTGTCGAGGCGGGCGCGCTGAAGCTCAACATCGAAGCCATCGATCTCAGCGACGCTGTCACAGGCGCGGCTCATGATGCGCGTCGCGCGCTGGAAGGCCATCCCGTACGGCTGGATGTCCCGCCGGATCTGCCGCTGGTGCGCGCCGACCCGCAGCTGCTTCATCACTGTCTTCTGAATCTGCTCGATAATGCCGGCCGCTACGGCGATCCCGGCACGGAGATCGTGATCGAAGGGCGTCACCTCTTCGGTCACATCCGCCTTGCCGTGCTGGACAATGGTCCCGGCTTGCCTCCGGGCCGCGAGGCGGAGGTCTTCGAGACCTTCCGCCGGTTCGAGGGATCGGACCGCGCGATCGGCGGAACCGGGCTCGGCCTCGCCATCGTCAAGGCGTTCGCGGAGGCCATGGGCATGTCGGTCGAAGCATCCAACCGCGAAGACGGCAGCGGCGCCTCGTTCGCGCTGATGTTCCCATCCCCCCTCATCGTGCGCGACGTCGCACAGGGAAGCGTTTGATCGGATGTCCGCCAAGGTTCTCGTGATCGACGATGATGCCGCCATCCGCCGGCTGCTACGCAACACGCTCGAACGCGCGGGCTACGCGGTCATCGAGGCTGTGAACGGACGGGATGCGCTTGGGCAGGCTGCCTCGCAGCATCCCGATGCCATTCTGCTCGACCTGGGTCTGCCGGACCGCGATGGCCTCAGTCTCATCCCGCTCCTGCGAACCGACAACGGCGTGCTCCTCGTGGTTTCGGCTCGCGAGGCCACCGACGAGAAGGTGAGCGCGCTCGATCTGGGAGCGGACGATTACGTTACCAAGCCCTTCGACACGGAAGAGCTGCTCGCCCGCCTTCGTGTCGCGCTTCGGCATCGGAGCATGTCGGAAACGGCGCCCAAGGTCATCCGGAAGGGCGACCTGAGCATCGACCTTGACCGTCGCGTGGTGTGCCGCGCTGGGGAGGAACTGCACCTCACCCGCAAGGAGAATGACGTGCTGGCGGTGCTTGCACGCCATGTCGGCAGGGTCGTCACCCATGAGCGGATCATCGCAGCCGCATGGGGCGCGGACGAGGATCCGCGCATCGAGTACCTCCGCATCGTGATCCGCAACCTGCGCCAGAAGCTGGAGGCACCCGGTCCGGTCGGCAGCGTCATCGCCAACGAGCTTGGGGTCGGGTACCGGCTCCGAGCGGATTAGGCGCGTGCGGTCTTCACGCGGGCGCATGCGAAAGTCGGACTAACAGTCGCGAGTCGGGAAGTCCGCATCGTTCCTCCCCGCTCCGAAGCCAGGTTCCGATACCAATGCCCGCGTTCTTTCGCCTGGGCTGTTCCCGACGGGGCAGCCCGGAACACCAAGCGAGGGCCATGATCTCTTCCGGAAGGAGGCGACGAGCCGGTCCGAATGAACATGGGTTGGAGTCGATCCGCGACACCCGGCGAAGCGCCTCTTCCGGGCGATCGCCTGCTCCATCGCTTGGCGGTATTGGCAGGCACTAGTTCGATGCGCGGGCGGGACACCGGGATGTGCGGCCTCCGGAAGCGGTGGTCCGGTAGCCGCGCTCGGGGCTGCTAGTGGCGCGTGCTGCGCGCCTTCTTTCCGCGCTCGCACGCCAGGGCCAGGAACTCACTCAAGCCTTCCGCAGGAGCGGTCTCCGCCCGGTATACCACGGTGCGGCCCACCTTCTCCGAGGACACGAGGCCCGCACGCTGGAGAACGGCGAGGTGCGCCGACATGGTGTTGGGGGTCGTGCCCACCGCCGCCGCGATGTCGCTCGATGCCATGCCTTCCGGAAGGGCTTCCACCAGACGGCGGAACGTGGCGAGCCTGGTCGCTTGGGACAGCGCGGACATCACCTGAAGGGCATGCATCTTTTCCACTCCTGATACTCTAGGTCCGCTCGGCGCCCCAAGCCAAGCGTCCTCGCTACAAAAGCGTTAGCGTGGCGCTAACACTACAATAATTCGCGAAGTGTTGTAATGTTCCGTCGGTGGTCCATATTGTGGTGTGCCGGCGGTCGTCGCCACTTCCGGCACACCTACATCGATGATGATGGAGTGAACCATGAACGCGATCACGGGAAGCATAGAGGCTTCACCTCAGACCGACCTGGACGTCGTCCATTCCGCGCCCGCCTGTGGGCAGTTCCCGGACCCGCTCGAAGCGGACGATGCCTCGGACGAGATCGTCGTCACCACGCGGCGTGCCATCGGGCGTGTCGCCCTCGTGGCGGCCGAGGTGGGAATGCGCTTCCAGCGCGAATCCGTTCCCTACGATCCCATGAGCTGGATGCTGGCGCCGCGCCGCGTGTTCGACGGAGCGGCCCCCGTCGACGCATGTCTCGACCGGGACGACTGCATGCGCGGCGTCCTGGTCCATGGCCTTGGGCTGGGACTGGATGTCGACCGCGCGGCGATCGACACGCTCATGTCATCCGATGACGATGACTTCGATGAGCATGAGTTCGGGCACCTGTACGATGGCCAGTTCGGCGGCGCCGGGCGCTCGAAGCGCGAACGGACGGGTCGTGGCACGCGCCTGCGCCTCTACACCGCGACGATCGCCGAGACCCGTGACAACGTCATGACCCAGGCCTTCCACGCTTCGGTCGCCCGCAACTCGGGCGAAATCCGCGCGCGCCTCGCGGGCCGCTTCGGTCCCGACCTTGCGGACGCAGCCGACATCCGCATCGGCGTGCACGCGGCGTCTCCCCTGGTGGTGGCTCTGGTGCCGGGAGCCGTAATCGAGATGATCCGCCAGATGGATCGCGATTGCGGCACCGTCGCCGCGCGCACCTTCGCCGTCGACATCCAGCAGTGCATCCAGGCGTAGTCGGACCAACCGCCTGACTATGCTCGCGGGCGAAGGGCGCGGCCGTTGCCGCGGCGCGCGAACAGGCTCTTCGATCCACCCCGACTGAACCTCCGCCTCACGGAGACCTTCATGGATACCCAGACCAAGGTCGGGCGCATTGACGCGCCCGCCGCCGGAGGCGTGCGCCTCCTTCATCCCCTGCGATTGCAGGAAGGGCCGACCGGCGAAGGATCCGGGCGGGGCCCGTACATCGGCGTCGCCGTCGATTGCGAGACGACGGGCCTCGACTGGAAGGCAGGCCGCATCATCGAACTGGCTCTTCGCCGCGTGCGCTACGATCGCGACGGCACCATCACGGACATCGATCGCGCGTACGAGTGGCGGGAGGATCCCGGCGAACCGCTGACCGAGGAGGTCTCGCGCCTCACCGGTCTCACCGACCAGGACCTGGCCGGCGAGGAGATCGACACGGATGCCGCCGTCCGGCTCCTCCGGTCCGCCTCGTTCGTGGTCGCCCACAATAGCGCCTTCGACAGGCGCTGGATCGAGGCGCGGCTGCCGGAGGCCGCCGGGCTTCGCTGGTGCTGCTCCATGGCCCAGGTGGATTGGCGCGGCCGTGGCTTCGACGGCAAGGCCCTGGGCTATCTGCTGGTGCAGAACGGCTTCTACTTCTGCGGCCACCGGGCGGCGAACGACGTCGACGCGATGATCGAGATGCTGCGCCACCGCGATGGCCGGGGCCGCACGGCCTTGGCGGAGATGATCGAGCGCGGCTCGGCGCCCTCCTGGGTCGTGCGGGCGAGCGGCGCGCACTTCGATCTGAAGGATGCGCTGCGCGCCCGCGGCTATCGTTGGTCAGCGGACCTGAAGGTCTGGGCAAAGGAGGTCGCGGACGACGACCTCGTGCCCGAGCAGCTCTGGCTCGCCGGCAACGTCTACTCCGCGGACGCGAAGGCTAAAGCGCTCTGTCCCGAACTCGTTCGCGTCACGCCGCGCACGCGCTTCCTCTGACCGGGCGTGAGCCCACCCAATCGAAACAGGAAGGATGACCATGAGCAGGAACAAGATCACGCGAGCCAATAAGCCCGTCTCGACTGCGCTCGCGATCCCGACCGCGGCGGAGCTCGTGCCGATCGTCGGATCGGAGCGGGCGCTGGCGAAGCTGCCTGGCACCGGCGGAGCGGTCGCGCGGGAGGATGGCACCGTGCTGGTCACCGGACGCGGCTACGCAACCCGCGACGGGGTCATCCCTGCGGCGGAGATCTACTTCGTCGATGGGCTGAGGCCGGCGGGGGAGGGTGTCTGGCTGGGCGAGGCCGACAAGGTCGCGTGGCGCGACGAGACGACCGGCTACGAGTGTATCATGCTGCGGGCCACCCGCGGCGGCTACCTGAGCGGTTACGTCGGCGTTCCGCGCGATCACCCGCTTTGGGGATGGGAGCACGGGGCAGTCGGGCCCGATCTCGGCGTCGAGGTGCACGGCGGGCTGACCTACTCGCGCATCTGCGAGGATGGACCCTCGCCGGAGCGCCGGCTGGTAGAGGAGGCCCGCCGGATCTGCCATGTGCCGACGCTGCCTAGCCAGTACGAACCGGTGATTCACGCGACGGGCCACCGTCCCGGCGACGTCCACGCATGGTGGTTCGGGTTCGACTGCAACCACGCCTACGATCTGGTGCCCGAAGCCGACCGTCAGCCGCAGCCGTTCCTCGGCGCGGAGGTGGGGGCCGAGTACCGCGACGACGCCTATGTCATCCAGGAGATCCTCAATCTTGCCGCGCAGTTGCGGGCGGTCGCCGACGGTGTGCCGGCGCCGCCACGCCAGGGACCGCCGCTGCCGCCCATCGGCCTCGATCCCCACGCGGGAGGCTGACCATGGATCCGAGCGACTGGCATTTCGGCGACCTTCCCTGGGGGTGGTGGCTCCGCGCCCGTGACGGCGTGCTGGAGGATCAGGACGGGCGGACATGGTGCACGGTTCGCGACGCCTTCTGGCACGGTGAACTCGCGATGCCTTCGAGCAACGTCGTCCGCGAGCAGGTCGAGCTGCTGCAGCGGGTTCTCACGGCGATCCATGGGCGCTGGCTCGGCGGGGCCGAGCGCCAGCAGGACCTGTTCGACGGGAGCATGGTGTTCTGGCGCTTCTATCCGTGCTGGCTCGCCTCGATCGGACTTATCGAGCCTCGTGCCCGGGGCACGGTCCTGGAGGCACCGCTCACGCCGAAGGGCAGATCGGTGATGCTGATGCTGCAGGCCACGCGCGATCCCGCGTGGGAGGCGCTGCCGATGTCGGAGGTGCTTGAGGCGGTCAGCGCCGCCGAGCGCGGGACCGCCGACGAGGCTCGCGAGCAGGCATTGCGGGACTTCGAGCGGAGCCTCGGCAGACGTCGTCATCTCTTCGCGCGCGAACAAGTCGGCCGATCGCATCTGGTCACCCTCACCGGCGTTGCCATTGATGCAAGGATGCCGACGCTGCGCGTCATGTGGTCGCAGGCGTTTGCGACGGAGCGGGCGCGCGACGATCTCTTCGCCTGGCTGGCCACCAGGGTGGACCGCTGGGACGACTGGGGGTTGCTTGCCTACCGGAAGGGTGCGGACGCTCTGACGAGGCACCTCTTCTCGCTCGTTCTCCTCGACGGATCCACCTCGTCATGATGGCGCGCGAACCAGCTGGTGATCGAACGCGCACGGCGTCGGTCGGGCCTGATCGCATCCACGAGCTGGCGAGGCGTCGGGCGTGCGACGAGGCGCTCGTGATCGATAGGCTTGTGGAAACACTGCGGCTTGCCTCGTTTCGCTCGTTCCTCGCCTCGACCGTGGTGTCCATGTCCGCCATCGTCCCCTCCGTATTGGACATGGTCGGATCGGACGTGCCTTCGGCCCTCCAGCGGATACGGCCGGGGCACCTGTGGCCGCGGAGCACGAGCCGCGCCGGTCGTTCTCCCGCTTCCTCCGCCTTGGGCAGGAAGGACTTGGTCTGGCCGATGCGGATCGGTGATGCGGTCATGGCGGATGGCATCCTCGCGTGGGTGGAGGCTGCGATCCTCGGATCCTCGCTCGACATCGTCCTGCGCGCCGGGGGGGTCGAGCTGGCGACCTACGCGGGGGTGGCGCGCCTCCAGGTCGACGACCGCCTACCGGACACGGTGCTCAGCGCATGCGAGGGGCGCCCGCTGGACCAGATCGTCGACCACCCGCTGCTCCGCGGACGTGGATACGTCGTGGATGGGGCCTATCAGGCTCGGGACGCGTCCGTGCTGACGTTCGACGTCGGTCGGCGGAGCCTGGAGATGCCGTGGCGCCCGTGACCTGCTGTTCCTGTAACCTCCACGTCCATTGCTGATCTTGCTCCACCGTCGAATTGACTTGCGGCTCACATGGGTTCGTGCTCAACTTTCCCTGTATGTTCTCATCAATCCCCAGCGCCGGATGTAGTTCATGACCCTACAGTCCTCCCGCGTCGCAGCAGACCCCTTCTTGCTCCGATTCGCCACGCCGCGGTCCGCCGACGCGGAGATGCCGGGACGCTATTCGCCCGATCTGGGTGTCTGGGTGATCGATCGTGACGGTGGAGAGGTTCCGATCATCGAGGTGGCCGGCGGCTCGCTGGTCGCGACCCAGAGCAAGACCATGACCCATGTGGAGGTCGACGACGACGACCCGGCCCGGTTCGGGTCGATGGAGACGGGCACCTCCACCAGGGTGCGTCAGGAAGCCGACGACGAGGATGCTTCGCTCTGCCTCCCCGAGCTGACGACCAAGACCGACGTCCAGCAGGAGCGGGACGACGAGACGGTGACGGCCTACTGGTGAGGATCGTCCCGGCTTGATCCTGATCGTCACCAACAAGCGAGACGTGACGACTGACTTCGTCGTCATGGAGATGCGCCGGCGGGGTCTTCCGTTCGTTCGGCTCAATACGGAGGATCTGCCGCAGCACCAGGTCGCGATGATCGATGGCGATCCGTCCGAGCTGACGCTCACGGGCGCCTGCGGTTCACTTCGTCTCTCCGATGTTACGGGCGCCTACTATCGTCGACCAGGCAGCTTCGAGGTCGCCGGCTCCGCTCCCGTCTCTGAATACGTGGTGGCCGAATGGTCCGCTGTGCTGCGCAGCCTCTGGAACGCGCTTGAGGGCCGCTGGCTGAACTCGCCCTTCTCGATCCTGCGAGCCGAGGACAAGCCGCGACAGCTCGCGGCCGCGCGTCGCGTCGGCCTGCGTATCCCTGGCACGCTGGTGACCAATGACTTTGCTCTGGCGCGCGATTTTCTGTCGGGCGGGCCCATGGTGGCGAAGCCACTCCGCCATGCGCTCATCGACGACGGTGAGGTTGGCAGCGTGATCTTCACCAATCGGGTTGAACGCCTCCACGACGCCGACGCCGAAGGGTTCGGACGAGCGCCCATGATCCTGCAGCGGGAGATCGTGAAACGCGCGGACGTCCGCGTCGTGGTGGTGGGCGGAGCGGTGTTCGCCACCCGGATACTCTCCCAGGCGTATGACGAGACGCAGGTGGACTGGCGGCGGGGCGTGCGTCAGGACCTCGACCACGAGCCCCTCGGACTGCCGCCCGACATAGTTTCAGGCTGCCTCGCCGTGACCCGAGATCTCGGTCTGCGCTTCGCCGCGATCGACCTCGTCGAGGACGGGCAGGGCGCCTTCTGGTTCTTGGAGGCCAATCCGAACGGTCAGTGGGCCTGGATCGAGCAGAAGACGGGCGCGCGGATCACGTCCGCGATTGTCGATGTCCTCGCGTCCAAGGCGCGGGTATGAACTGGCGTCGCCTCGTCGATCTCGTGTTTCCCTACATCGAGCCTCTTACGACCGCCGAGAAGATCGCGGAGGAGCAGCGGCTTCGACGGGACATCGCGGCCATCGAGGCCGCGGACTTCACCCGAAGCGACGAGCGGGCACTCGACGAGGCACAGAAGGTCTCCGCCACTGAAATCGAGCGCGTCCGCACCGCAGAAGGTAAGGCCACAACCTATCTGGCGGTGCTGGCAGCACTCGTACCGGTGATCATCACATTGCAGGCAGCCAACTGGGAGAAGAAGGCTGGTCCCGCGCCGGACGCTGCCCGCCTGTTCGTTCTGGCGGTGGCAACCGTCTATGTGGCGGCGGCCGGCTTTCACGCCTTCAAGACGCTCCAGGTTCAGGGCTTCCAGCGCGTCGGGGAGGCCGAGATCGCAGCGGCGTGGCAGTCCCAGAAGCCGCTGCGCAGGCTCACGCGCGGCACGCTGCTGGCAACGCGCCGGTCGAGGGATGCGGTCAACGCGAAGATCACCCGGATTCGGGTGACGCACGAGCATCTGCTTCGCGCATTCGGTACGTTCGTCCTGCTGCTGCTCCTCGATCCGCTGTTCTACGCCATGGGCTTCCGAAACGCGGCACCGGATGCACCAGCCGCCAAGAACGTCGTGGTGGAGCACCGACGGGTAGCGGCGCCGCCATCGACAGCGCCACCTCCTCCGTCGCAGCGGCAGTCACCAGCTGGATCCTTGGAGCGAACATCTGCTCGGCCGCAATCGGTCGGGGCCTCGCCGACTGTTGCGCGAAAGGTCGAATCTGCGGCGGATCAGGCGCAGGATTGTCCTGCTGGAGCCAAGGAGTGCTCGGGGCCGAGATAGTTCCGGCGGCGGCCTCACGACTGTCCTGGCTTTACCAAGAGGAGCATGGCGCGGGCTTACAAGACAGCATGTTGCTTGCCGGTGGCTGGCTGCCCGCAGCGCCCTGGCTGAGGCGGACGGCGCAGATCTACCCACGGTGCTTCTCGGCGAGCGAGATGATCGTTAACGGGCGATCTCTTTCAGCTCGTCAGACATCCCAGAACCGACGCAAAACCGGGCAATGACACCATAGGTTGTGCGAATGACAATGACAGGTTGTGCCAGCGTCTAAGTCATTGAAAAAGTGGTGGACGCACTAGGGCTCGAACCTAGGACCCGCTGATTAAGAGTCATTCATAGGCGCTCCACGCCATACCCCGCATCACCACGAAACCGATTGTCTTACAAGCATTTATTGGGCACTACGAGGCGTGGCCGGGGTTGGCCACGCTGACCCCGGCACCGTCTTAACCGTCTTAATTCCCGTCTTCGGACGGGTGGGAGCCGAGCCGATGTCTGACCTGATTGAGCTGACCGACAAATATCTGAAGGCCCTGCCGCCCCCGACCGGCGGCCAGACCGTGGTGCGCGATACGCTGCCCGGCTTCTTCGTCGTCGTGGGCAAGCGCACCAAGACCTTCACCGTGCAGTGCGACGTGAAGGATGACCTCGGGCGGCGGCGCACCAAGAAGGTCGCGCTGGGGCAGGTGGGCGACCTGACCGTTGCCCAGGCCCGCGCGAAGGCAAAGGCGACGCTGGGCGCGCTACAGGTTGCCGGCAAGTTCGAGGAGCGCCGGAAGGAATGGACGTTGGGCGAGGCATGGGACCACATGCGCGACTTCGACCTTCCGGCGAAGGGATCACGCCCCCGGACGGTTGACGGCTATGAAAAGACGATCCGCCGCCTCATGGGCGACTGGCTCAAGGTGCCGCTCCGTAAGTTCGCGGAGAAGCCGCATCTGGTGACGGAGCGCCACCGCACCATCACCCGCGACAACGGCCCCTATGCCGCTAACCACTTCGGGCGAGCGTTCCGGCGGCTATATAACTACGCGCAGGCGAAGCTCGACCGCACGTTGCCGGTCACGGCATGGTCGAAGGTCATCACCTTCAACCGCGAATATCGCCGCAACACGGGCATGGGTGATGGGGATCTTGGGCATTGGTTCACCCAGCTTGCGGACATCCCGAACGGGGTGCGCCGGGAATTTCACCTGTTCACGCTGCTGTCCGGGTCGCGCCCCGATGCGCTGAGCAGGGCGGAATGGCGGCATCTGGACGTGAAGCGGCGGGCACTGCACATCCCGGCTCCGAAGGGCGGGGAGGATCGCGCCTTCGACATCCCGCTGTCCCGGCCCATGCTGGCGTCGCTGGCGCGTGCCCGGCGGATCGGCAGCAAGCTGGCCCCGCGCCAGTCTGAGACGTTCATCTTCCCGGCCGCAAAGAGCAAGGCTGGCCATATCGTGGAATGGAAGGAAGATCGCGATGTGCTGGGCAAGTGGGGCGTGGACCTGCGCCAGTCCTATACTATTGCCGCCGAAACGCTGGACGTTTCGGAGCGCACTTTGAAGCGCCTGCTGAACCATGCGACGCAGGACGTGACGATGGGCTATGGTGATCGCGACCGCATGTGGCCCCGGCTGCTGGAAGAACAGGCGCGCATTTCGGCGCACCTGATGGCCTACGCCAAGCGCAGTTAGTCAGTTTTTACGCTGCACTACGTTTGGCCCCGTTAAAATGCTTGCCGCGCGCGACGTGGCGAACCATATTGTCATCGTCTGGTTGAGGTAGCGCGCAAGCGCAGGCCCATCGGGGGAACGACCCCCGGCCATGACCGGCGGGCACCACTACGCCCCCAGCCGCTCTGGAGGCCCTTCGCGGGCCGCGTGGACTTAGCGGGCCGACGGAACCAACTCAGCGGATGGCCGGAGCGCCAACGCTGATCCGTCGGAGCTTCTCATGACCGCTGCACTTGATGATTCATGGGGCCAGCTTGCCGAGCGGGTCGCGGGCATCGTGACACGCCATCTTGCTGACCTGCCGCCCCGCCGCGTGTCGCCGTGGTTCACCCCGGACGACGCCGCCGACTATCTCCGCCTGACCCGGCGGGGCCTCGAAGATATGCGCGCAAAGGGCACCGGCCCGCGCTTCCACAAGGTCAATGATCGCGTCGTGCGCTACCATGTGCGCGACCTTGACGCTTGGCTTCTGGGCGAGGGAGGCAAGCGTGGGCGTAGCTGAATACGAAATGCCCGCCCCGTGCAAGGGGGCGGGCGAGGGCGCTACTTACCGAGCCGCAGCCCCTTCATTTAGCGACCATCCTTGGCTCTGGGAAATCAGCAACGGGGACTACGAGACGCTATACTATGCGGCGCGGGAGGCCGTCCGTTTCGCGGAAGCCCTGTTCGATCTCGACCGCGAGTATGAGGATGGTCGGGACTTGGTCGGCCGCATTCAGGCCCGATGGGACGCTACCCCGGAAATGTTGCGCGCCTGCCTGCCGCCGATGATGAATGTGGCGGTGCGGCGCGAAGCTGAGCGGTTGCTGCTTTGGGAAAGCCAGATTGCCGAAGCACGGTTCCAGGCCGTGAAGGGGATGGTGGCCCATGGCTGACGCCTTGCCCAGCACCGACACGGTCGGGGCCGCGATGAACTATATCGGCACGGGGATGCGGCTGGTCGTATTGCGGCCGGACAGCAAGAAGCCGGTAAGCGAAAAGGGGTGGCAGAACGCTGAGCCAAAGGGCCGTGATTTTGAGGACGGCAAGAATATCGGTGTGCAGCTTGGCGCAAAGTCGGGCCATCTTGTTGACATTGATTTTGACTGTGCGGAAGCGCGCGCACTGTCCGGGCTGGGCTGCTTCTTCGCAGATCTGCCGGCATTCCGCCGGGCCAGCCTGTCGGCGGACGCCCCCGGCCATAGAATTGTGGTATGTGCGGATGCGCCCGATGCGGTGATGCAGTTCGCCTTCACCAAGAATCCCGAACAGGAAGCCATCGCGGACCTTGGGCTTACCAAGTCCGTGATTTTGGAATTGCGAGCGGGCAAGGGCTATACTGTGTTCCCGCCCTCGGTGATTGAGGGCGACCGGCTGGTCTGGAACCCACGGGTCAGCGCGGACGTGCCGACTATGGCATGGGAGGAATTGCGCCTTAAGGCGGGCATCCTTGCTTTCGCCGCGTTCGCCGCCGCCTGTTATCCCCCGGAAGGCGGGCGCGACAATTTCTGCCTCTCTCTGGCGGGCGCTCTTATTCATGCGGGCGTTGCCGCCGAAACGGCTGAGGATATTATCGCCGCCATTGTCGCGCTTAAGGGTGACAACCCGCGCGACCGGCGCGGCAAGGCCATCGCCACGGCTGAGAAGCGCGATGCCGGGGAGCCGGTAACGGGCCTGCCCGCATTTCTCGAAAATATCGGGATGCGGGCGTGTGAGAAGCGGCTGAGGGATTGGCTGGGGATGGCCGCCGCAGAGGCCGGGGAGCCGCTGCCCCCGGACGCCATCCTTATCGGCAGGCCCGATACCCATGCGGTCCTCGCGGAAATTGAGGAAATGCTGATCGCCAAGAGCGGGCGCGTATATCGGCGCGGCTCTGATCTTGTGCGCGTCAGCACCTTGGAAGAGCCGGTGCGCGACGGTGACGAGATTGTTCGCCACGCGGGGTTGGTAGAATTGCGTTCGGCCTCTCCGGCATGGCTCGCCATTGAGGCCAGCCGCGTAGGCAATTTCGCGCAGCGAAGCGGCAACAAGATCGTGCCCGTCGCGCCTCCGGCTGGCCTCATGGCCATGCTGGGGGCCGTTGCCGACGAAAGCCGCTTCCCGCCATTGCGGGGCCTCTCCATGACGCCGACGCTGCGCTGTGAGCAGCCGGGCTACGACCCGGAAAGCAGGTTGTTCCTCGCCTTCCCGCCGGGCATGTTCCCGCCCGGCAATATGACGCCGACGCAGGCAGAGGCGGAGGCCGCGCTGGTGCGGCTGGCGCATCCTCTCCGGGGCTTCCCCTTCGTCGCCGATGCCGACCGCTCAGTGGCGCTGTCCGGCATGATCGCTGCCGTAATCCGGGGGGAGATGCGGACCTGCCCTCTGCACCTCATTGACGCGCCCGCGCGCGGCACCGGCAAGACCAAGCTCGCTGAGATTATCGGCATCATGGGTACGGGCGTTCCGCCATCAGGCGTCACGCATAGCGATGACGGCGACGAGAACGAGAAGCGGCTGGTCGCCATTCTGCGGACGGGCGACCCGGTTATCCTGATTGACAATGTCTCGTCTGATTTAGAAGGCGACTTCCTGTGCGCCATGCTCACCAGCGAGACGGTGCAGGCGCGCATTCTCGGCCAGAGCGAACGGGTGCGGCTTTCTACGCGCGTGCTCACGCTGGCGACCGGCAACAACATCCGTATGCGCGGCGATATGGCGCGGCGGGCAGTTAGGTGCCGCCTGGACGCCCACATGGCCAATCCCGATGAGCGGAGCTTTGACTTTGACGCGGTCGCGGACGTGCGCGAGGCTCGCCCGGCGCTGGTTACCGATGCGCTCACGGTCGTCCGCGCATTTGTCGCGGCGGGGAAGCCTGCCACCGTGCCGCCCTTCGGCAGCTTTGAGGATTGGGACTTGGTGCGCGGCGCGCTGGTTTGGCTGGGCCACGCGGACCCGGCTGAGACGCGGGCAGCGGTAAAGGAAGATGACGCTGACGTAGAGGAGAAGGTTGAGCTTCTGCGGCTGCTTCACGAACATGTCGGCATTGGCGCGCGATTTACGATGGCGGAATTGGGATCGGCGTCTCGCCGCGAGGCACTGCGGACGGCGCTGGCTCGGATGCTTGATCGCGGCGTGTGGGATAGCAGGCGCGCGGGCAGGCTGCTGCGGCGGCACAAGGACGTGCCGTTTCTGGGGGTTACGCTCCGCGCCCGCCCGAACACGGCAAATGTGCAGGAATGGTGGCTGGCCGGGGAGCCGGAGGAAGCGTTGCTCGATTATCGGGGGGAGCCGGCATGTCCCTTCTAGCGCATCGTCCTGACACTATCCCGGGTTTTCCGGGTTTTCCGGGAACCGCACTAGCCCCAAGGAGCTTCGGCTTGTTTTCTGTTGTTGTTAACAACAACAAGAATAGCCGTAGTAGGAGTAGGGGTGGGCAGCATTCCCGGAAAACCCGGAAAACCCGGCGTCGCCCCCCGGTTTCCTCGCCCATCCCGCCTTCGTGTCAGCTAAAAAAGGGACCGCGACGGTCCTTTTCGGGCGCTCCCCCGGCCGTTTCCCCGGACCCGCTGACCAGCAATCAGCAGGTATCGCCACGCAAACCACTGGTTTTGCAGAAATCATCGAGCAGAAACCTCAGATTAGGACGGTAAAAAGGACGGCTCAGCACATGGAAATGCGCTATGGAACATAACATGAACATTGAAGCCAGCCCCCCGGCCACCGCCACCCTGCCGACGACGCGCGACGACCGCTACGGCACGGTGACGATCAACGCCCCCGGCCTCGCCATCATCAGCGACATGGCGAGGAACGGCTACCCTACCACCAGCATCGCCTCTGCCCTGGGCATGTCGCCGCGCATCCTGCGCGAATGCCGGAAGCGCCAGCCCGAAGTCGAGGAGGCATGGGCGACGGGCCTCGCGGGCCTCGAACAGGAGCTTGTGCATAGCCTGCTGGTTGCGGCCCGGAAGGGCGCGATTGCCGCCGCCATGTTCCTGCTCAAGACCCGCCACGGCTACCGGGAGACGGGCCAGACCGACGCCAGCCCCAAGGTCGCGGTGCAGATCAACCTGCCCGCCGCGATGGAGGCCAAGGCTTACGCGGCGATGGTCGAGGCGGAGGCCCGAATGGTGGGCGACGGGGGCAGCGATGGCGACGCCTAAGCTCACCACCCCGACGCCGTGGCAGGAACGGGTGCTGGCGGTGCCGAAGACCTGAACCTCGCCCTGCTGGGCGGGCGCGGCTCCGGCAAGACGACCGCGCTGGCGCTGCTCGTCCTGCGCCACTGCGTCCAGTACGAGGACAAGGCCCGCGTCCTGATCCTCCGGGCCACCTACAAGTCGCTGGCGAACCTGTGGGACGAGCTCGAAGCCCTGTTCCGCGACGCCTTCCCCGGCGGCATCACCAGCAACCGCGCCGATTTCGTGATCCGCTGCCCGAACGGCGCGGTCGTCACGCTGGGCAACCTGTCGTCGCAGAAGGACGTGGTGAAGTGGCAGGGGCAGGAAGCCAATCTGCTGGCCGTCGATGAGATCACCAACTTCACGACCCTGCGCCACATCAACATGCTGCGCGCCAACCTGCGCGGCCCGGCGGGCATCCCCACCCGCATGATCGTGCTGGGCAACCCCGGCGGGCCGCTGCACGCCACCATCGCCCGGATGCACGTGCATGGTCGGGTGCCGTGGAAGCCCTACGAGCTTCCCGATGGCTCCCGCTGGGTATATGCCCCCTCCACCTACATCGACAATCCGACCATCGACACGGAGCGTTACGCCCGGTCGATCATCGCGTCGGCAGGCGGCGACCGCGCGCTGGCGTCGGCGTGGTTGGAGAACAACTGGAACGACCTTGCCGGGGCGTTCTTCGCGGACGTGTTCGGCGACCATCTCATCATTCCCGACGATCCCGGTTTCCGGGTGCCGAAGGGCAAGGGCCACGGCTGGTATAGCTGCGTCGCGCTCGATTGGGGCTGGTCCGCGCCGACCGCCGCCGTGCTGGCCGTCCATGCCCGGCGTCCGGGTCTTATTGGACCCGGAGGGCGGGTATTCCCGCAGGGATCGTGGATCATCGTGGACGAAGTGCATAGCGCCCGCTCCGACGACCCCTCGCTGGGCAAGAGCTGGCCGCCGCAGATGGTGGCGGAGGAAGTGCTGGCGGCTTGCGAGCGGTGGGGCATCCGCCCGCATGGCGTGGGCGACGACGCGCGGGGATTGCAGAACGACACGCTGCTCGAACAGCTTGGACGGCACGGCCTGCACCTGACCAAGCCCACCAAGGATCGCATTTCCGGCTGGGTGAAGGTGAAGTCCCTCATGGCCGCCGCCCGCGACGGCGACCCGGACACGCCCGGCCTGTGGATGTCCGAGCGGTGCCGCTTCGGGCTGGAAACCCTGCCGCTGCTGCCCCGCGACGATGTGCGGATGGAGGACGTGGACACGTCGGCCAACGACCACTTCGCCGATGCCGTCCGCTACCTCGTCAACTCCGCCCCCCGGATTGTCACGTCTGGCCGCGTCATCGGCCATTACTATTGAGAAGGAGACTACCCCATGCCGACCCCTCTGGACGAAAGCACCCCTGTTCGCGGTGACGTGCCGCTGTCGATGCACCCCGACGCCCTGCTGAACGTCAGCGACACCCTCAACCGTCCCGACACCCTGGGCATTCCCGCCCTGTCCGCCGCGCGTGAGGCCCTGCGGCTCTGCTACGACTGCTATGGGCGGCTGAATGACGCGGAGCGGGACTTGCAGGCCGTTGCGGAGCCTGCCTTGCGCCGCCAGTATCCCGCCGACCGGGGCGGGCACACGGAAGTCAGCGGCAACGTCCGCATGGTCAACGGCAAGCCGACCCGCATCGTGGACGCGGAAGAGTTCGTCACCGCCGCTGAACAGGCGCTGGCCCGCGTCTCCCCGGCGGTTGACCGTCGCATGGCGGAGTTGAAGGGCTACCGCGACACGCTGGCGCAGCGGGTAGCGACCGCGCTGGACGTGCCCGCCCGCAAGTCGCCGGAAGGGCTGGCACTGGCCTCCGAAGTCCGGGCGCACATCAAGGCGATGAAGCAGCCCGCCGCGCGGGGCAAGTTCGTGCTGGACGCGGTGGAGGCGGGCGACTTGCCCACCGTCGCCGCCGTGCTGCACGCCCCGGCCTTCCTCTCCGGGCTTGATGTGGGCACCCATGCCCTTGTCCGCTCGCGGGCGGCGGCGCGCTTCGCGCCGGTGGATAGTGCCCAGCTTGACGCCGCCGAAGTCGCCATCGCTCAGGTCGCGGCGGCGGGCAGTGCGACCACCCGGCGCTTCGGTGCCGTGCTGGCGCTGCGGACCACCCCCGCCGTGAAGGCCGCGAAGTCGGTGAAGGCGCTGGCCGGGGCGGGCGCGTGACGCTCTCCGTCGCCGCCGCCAATCGCATTGCCCGCGCCGCTGCCGCACGTCGCCAGCGCGACGAAGCCCGCCGCCTCGCGGCGCTGGCCGTGCGCGGGGCCTACGACCCGCCCCGCTGGGTGCTGGACCGGCTCACGTCCGGCGACAGGATGGAATACGAGGCCGCGCGCGACGAGGCGCGGAAGGGCAACGTGTGACCAAGCCCAAGCCATACAAACTCGGCCCCAAGCATCCGGGCCGGAGCCGGTTAGCCGGGGGTCATTCGCCAGATACTCAGGTTCGGCCCTGTATCACCTGCGGCAAGCCCTTCCCCAGCGAAGGGCCGCACAACCGCATGTGCCACGATTGTCGCTACTACCGGGACACCAATCCCTACGAGCCATAATCTGCGGCCCCTCTCAAGAAGTGTGTTGCCGAAATAACACACCATTGGGCAACTGGAAGTCTCCAGAAAAGGAGACAACCCCATGCCCACCAATCCCCGCCCCGACCGGCTGCACAATCGCCCGCATGGTTGGGGCCGCATCCTCGTGCTGGACACCCCGGCCAATTATAGCCCCGACCAAGCTGACGCCCATGCCCGGCTGATGACTGCACTGCGCGCGGGCGATGCCCCCGACGAAGCGGACGTTCGCCTTCTCCGGGCCGATTGGGATGGGCACCTAGAGGGCCTCTCGCAGCGCGCGGAAGCGTGGCGGTGGGGAGCATGACAGTGGCCGCTGGGATAGCGGTGCGCGTCCTGACCTTCCTGTTGGTCGTCCTGTTGGTCGTCCTGTTCGCCTTAATGCTGGACATGCGCCACGACCGGACCGGTAGGGAGGATTGAGGAGAGGCGCTAGGTTGGGGACTCATTTGATCCACTCGGCGATGGTGGCGACCAGTGCGATGGCGGCGAGGTAATTGGTGGCATATCGGTCATATCGCGTGGCGATGCGGCGCCAGTTCTTGAGCCTGCAGAAGAGGCGCTCGATAACATTCCTGCGGCGATAGATGCGCCTGTTCAGCGGGTAGGGCGTGCGGCGTGAGGCAGTGGACGGAATCACGGCCTTGATGCGCCGCTCGGCGAGCCAGCGGCGCAGGCTGTTGGCGTCATAGGCTTTGTCGCCGATCAGGCGTCGGGCCGGAGCGGTGACGCTTAGCAGCGGCACCGCCATGCTGATGTCGGCGACATTGCCGGGTGTCAGGGCGATGGCAACCGGTCGGCCGCGATCATCGGCCAGGCAGTGGACCTTGCAGGTCCGCCCACCGCGCGAGCGGCCGATCGCTTCCTGCCACTCCCCCTTTTTGAGCCCGCTGCCGAGCGGTGCGCCTTCACATGCGTGCTGTCGATCGACAGTTCGTCCGGCACCGGCCCGGCACCAGCGATCTTCTCGAACAAGCGCTGCCAGATGCCGCGCGATGCCCACCGGTTGTAGCGATTGTAGATGGTGGTCGGCGGCCCATATGCAGCCGGAACATCGCGCCACCGGCAGCCTGTCTTGAGCACATGCAGGATGCCAGAGATCACCGTCCGATCGTCGACCCGAGGCTTGCCCGGGCGCCCATGCGGAAGATGTGGCTCGATCGCCGCCCACGCCTCATCCGACAACCAAAACAAATGCCGTGCCATCGCCGCTCCTGCTCGAACCTATCGCAGGCAGATATTTACCAGAAAGTATCAACAACTTGTATGGGTTCTCAACCTAGTGCTGCGGCTGGAACAGGAAGAACACGCCGACCGAGTGCGCCAGCGCCAGAAGCGCGCGAGGATGCGGCCTTGCTTGATATGCCGGATATACTGGCTCGCCTTTGCCGCCCTCTCGTTGTTTGCGGCCGAGCCGCTGGGGGGAGGGGCCTCCGTCGAGGCAGGCGAAGTGTGGCGGTGGGGAGGGTGAAATGAAGCAGGCACTGGTGACAACGGCTATGATGCTGCTGGGCGGCTGCTCCATGCGCCCGGACGGTCTGAACTGGCTCCCGCCCGAAGCCCACGGGTCGCAACCGGACTTGCCCGTTGCGGGCGGCGGGACGGCCGACCGGCTGCTGATCGTCCGTGATGGGCGGACGGTGGCGAGCGTGCCCACCGAGCCGGGGCGCGGGGGCGTGGTGTATTTGCCACCGAAGTAGGGAACAGCCGAAAGGCTCGGAAGCGCAAGCGTGAAAAATATTTGCTCACCCCTTGAAGAGCGAGCTTCTGTTCCTACTTTGTGCTCGTCCCCGGAATTGGCCGACGGATACGATCGTTGCAACGTGCAACCCGCCATATGATGGCCGCGATTACGGCTGTGCCGTAGCCGAGCCACCCGGCGAGTTCGATACTCGGGTCATTGAGCATAGCTCGGACCTCCTCGATCATACTATACCACTTAGCGTTTCAGACCCGCACCGAAGAGCGGCTCTACTCGGGGCCTAACAGGGGTGGATTGTGGTGTGCGTCCGGGGGCATCCAGTTAATTCTGGGGTTTCGGGTCAAGACTGGCCCCCGGACTCTGCTAATGCTACGACTGGCGTATGAAGACGCCTAGTTGGTTCCGCGTCCGCGGATACTCTCACTTTGACGTGCCAGTGAAAGTCGCTTTTGCGGAAGGCTTATGCCCGGAGCAGGTGATCCGGCACCCTTGGTCGCCTCTGATCCACTACATAAAGACTGAGAAGCGCTACAAGGCGAGGGAACGCAAGACTGTCCCCAAGGAGCGGCCCATCATGTTCGCATCCCATCGGGATGCCTGCATTCTCAGCAAATACTCAGCGGAGCTAGTTGCGCTGCTTGACCGATGGTATCGAGACAATTCGCTGGACGAGACTGTGATCGCCTATCGCTCACTCGGACTCTCCAACTATCACTTTGCCCGACGTGTTCAGGACTATGTGCGCTCCCAACCATCACTGACGGTCATGTGCTTCGACGTGACGGGTTTCTTTGACAATCTCGACCACGGCCGGTTGAAGGAGCGTCTCCGGTGGATATTGGGCGGCGGCGAGATTCCGGGGGACTGGTACGCCATCCTCAAGGCGATTACCTGCTATCATTACGTCAACCTGGACGACCTTAAGAAACACGATCACTTGGCCCAGCGCATCAAGGAGCGAGGGCACCATCCGCTTGCCACCATCGGGCAAATCAAGGCCCTTGGCGTGCCGATCAACAAGAACCCCAACAAGGTCGGCATCCCGCAGGGGACGCCCATCAGCGCCAGCTTCTCGAACCTGTACATGACCGCGCTGGACATGGAGCTTGCAGTGGAGGCGACGAAACGGGGCGCGCTATATCAACGCTACTCCGACGACATCCTGATCGCCTGTCATCCGACATCCGCAAATACCCTTGAGAAACTGGTCGAAGATAGGCTGCTCGCGTGGGGGCTTGCGCTACAGAAGGCGAAGACGGAGCGGGTGACGTTAGCGGGCGCCAGCACGTTGACCTTTCAATATTTAGGCTATCAGTTGGGGCACGTCGAAGCGAAACTACGGCTCAAGTCGCTATCGCGCCAGTGGCGGACGGTGAAGCGAGCACTCAAGAAAACGGAGCGTGTCGGCTCGAGCGCAGTAGCGGCTGGGAAGGCGAAGCAGATTTACACGCGCAAGCTCAACGTCCGCTTCACCGACGCGGGACCGCGCAATTTTCTGGCCTACGCGGATCGCTCAGCCGATACCCTGGAATCAGGATCGCTGCGTAAGCAGGTGAAGCGGCTGCGGCGGCACGTGCAGGGGGAGATTGCTCGGTTGAAGGGCAAGCCTCCGAAAAAACCTTGAGGCCCTGCAACCGTCTTAATTCCCGTCTTTCTAGGACGGTGGCCCCTCCGGGTCTGCACTAAGTCGTTGAAAAGATTGGTGGACGCACTAGGGCTCGAACCTAGGACCCGCTGATTAAGAGCCGAAATCAGAGGGTCCTTGCGCTTCCGCATGGGTCCAAAAGCGGCAGAACTCTGCGATTCATAGTCCGTATTGTTCTTGAAATGTTCGCCAGTATTCACTACAAAAAGCTACCTAAGGTAGCAGGGACATATGGCGAACAAAGTTGGTTTGACGGATGCTCGAATCGCAGGTTTGAAAGCGCCTGCAAGCGGGCAAATTGAGGTCGCTGATGGCATCGTGACCGGTCTGCGGCTCCGAATGGGAGCGAGCGGCACCAAAACTTATATCTTGCGTAAGCGCGTTCAAGGTAAATGGTTGAACGTGACTATCGGGCGGCATGGCCCGAATTTCACTCTTGCACACGCCCGTCGGAAGGCGCGGGACCTGCTCGTCGACGTTGAGCAAGGCAAAAGCATCGCCAGAAAGCCGGGAGCGAAGAGGAAGGGATCGAAGGGTGTCGGCACCGTCGCCGAGCTATACGAGACATATCTGGCTCAACAGATCGTCGGCAAAAAGCGGAGCGCGAATGAGTTCGACCGGGTTTTCCGCAAGTACATCGAACCTGAGCTAGGCGACCGCCTCGCCGATTCGATCACCCGAAGCGACGTCAGCCGCTTCGTAGAGAAGATCGCATTTGAGCGGGGCAAGGAAACCCTGACGATGGCTCGCATCGTTTATCGGCACCTTTCGACGTTTTACTCATGGGCGCTCACCAGACTTGAACATTTGCCAGCCAATCCGTGCCGGGACGCCTGGCGCCCAAAAAGGAGCGAGCCTCGCGACCGGGTGCTCAGCGATCGAGAGGTCGCTGCACTGTGGCAAGCCGCCGTCGAGGATGGCTATCCGTTTGGTCATCTTGTGCAGATGTTGATTCTCACAGCCCAGCGCCGGGGAGAGGTGCTCGACGCCACTTGCGACGAGTTCGACTTCAAGGGGAAAGTTTGGACTGTACCAGGAGATAGAGCGAAAAACGGCAAGGCAAATTTGGTGCCTTTATCCGCACAGGCCCTCGAGGTCGTCACCGACATCTTCGCGGCCGCCGGGGTCGCGCCTGAAGACGCTCACAAGCAATCCCAAATTCTATTGGCATCCAAGGTGACCAGCACAAACAGTGTCAGTGGGCTGTCAAAGGCCTGGAAGCGGATAAGGGCAAGCGTGGACGAGAAACTCGGCTATGAAGCCGCTCATTTTACCATGCATGACATTCGCCGGACGGTGGCGACCGGACTGCAGAGGATTGGAATACCGCTGGTCGTTTCAGAGGCCGTTCTCAATCATCAGTCCGGCTCGGCAATGGCTGGTGTCGCCGGGGTTTATCATCGGCATCAGTACACGAATGAAAAACGCGAAGCTCTCGCACTGTGGGGCAGGGAGGTCCTACTGATCGTCGCGAAATATCCGCCGCAGGATAGTCAGGAAGAATGACTCTCCACCCCGAGCCAGTCGGCGCGTATCCGATCCATGCCCGCGACAACGCCTTCAACGGTCACTGTATATCCAATGAAGTCCTCACGAACATCATCGCCTTCGATAATCCAGACATCGTCAGAAGTGGTAGTCAGGATCATTCCGCGTCTGCCACCGCTTAGCGTTCCTGACACTCGTATCCGACCCGCGCTCACAACGTCCCCCGTTCAAAAACGCCAATGGCGCAGCACTTCGCGCACATAGGCCGGCGTCTCGCCATTGCGAGGAATACCGCCTGCGCGCTCGACGGCTCCTGGACCTGCGTTGTAGGCAGCGAGGGCCAGATGAACCACCCCGAACTTGTCCAGCATCTGGCGTAGGTATCGGGCCGCACCCAAGATGTTCGCCTTGGGATCGAAGCGATTTGAAACGCCAAGGTCCCGCGCTGTCCCTGGCATCAATTGGCCCAAACCTGCGGCGCCAGCCTTGCTGACGGCCATGGGATTATATCGTGATTCCGTCCATACAAGAGCGTCGAGAAGGCCGCTTGGTAGCGAGTATTGAGCCTCAGCAGCGTAAACATGAGGAAGGTAGCTTGCCCGACGGAAGCCCGATGGCGCGCTGTCTTGGGGCTTTGGATATCGATAGGGCAGATAGGTTCGACCCGCATCGGTAGCCGGCGGCTCGGACGAGGCCGTTGGAGGCTTTCTCCAAATGCCATGTTCGACGAGACGAAAGCCATCGGTTCCTTCTGCAACGCGGAAGCCGTCGGTCGTCAACTCCGAGGCAACGGTCATCTGAGGGGGCTCCATCTCCTGCGCGTGAGCGGAGAGGACGGACCCAAATCCTGTCGTTGCCCCTGCAACTACTGCCCATTTCAGTTTCATTCCCGAATCCTTTGGTGGTCGAATCGGGAAAGAACATATATAGAACATCCGATGTAGGAAAATGAAACGTCAGCGACGCAGAGCGGAGGCTGCGCGGGTGGAGGCACGTTACGATGGAGATGCCCCCATGCACCAACACCGATCATCCCAATTCCAAGGTCTGCAACTGGAAAACCGCGCGCGCAAGATAGTTGAGCAGCTGGGCGGTGCCTGGTCGCGTTCGCGCGGAATGTGCTGTTGCCCAGCCCACGACGACCGCACTCCTTCGCTAAGCATCACACTCGGAAAGCGCGCAATCCTTGTTCATTGCTTTGCCGGCTGCACGAACGAGGCGGTGATAGAAGCTATGGCTGGGCTCGGAATACGAATTGCGGACCTGTTCGATGGCACGAGTGGTCCGATCGTGGCCGAACCGCGCGAGGAAGTCGCCAATCGCAATGCGCTGAGGCTCTGGCGTGAGGCGTCGACCATCGCTGGCAGCCCCGCCGAAAAGTACCTCGTGTCCAGAGGCATCACGATTTCTTCGCCGGAGCTGCGTTTCCACCCGCATATGCCGCTGGGGCCAAAGGGTGCTGTCCGGTTTCTACCCGCGATGGTGGCGGCCGTGCGCAATGATGCCGGAATTCTGGCGCTGCACCGCTCCTTCCTTGACCTCGACAAAACCAGCGTGGCTTCTTTCGATCAGCCCAGGCGTGCGTTAGGCAGCCCCGGTTCTGGCGCGGTGCGTTTCGCGTACCCCAATGGGGGACGCCTTGGCTGTCAATCGGCGTCCAAACGGGACCCCCGATCGGCGCGCAAAAGGGACCCCCTCTCAAGGATGGGGCGACGGTCGAGGAACGCTCCTTGCGCTGCGCGCGGCGTAGGGAGGGCGGAGCCCGACCGGAGACGCGCGCAGCGCAAAGCATCTTAATCCCGGCATGCGGTGGGATCAGTTGCGGTTCTTGAAGCGCCAGCTGTCGTTGCCGGTCTCGATGATGTCGCAATGATGAGTGACGCGATCGAGGAGCGCGGTGGTCATCTTGGGATCGCCGAACACGGTGGGCCATTCCCCGAACGCCAGATTGGTGGTGATGACGACGCTGGTTTGCTCATAAAGTTTGCTGACCAGATGGAATAGCAACTGCCCGCCTGAGCGGGCGAACGGCAGATAGCCAAGCTCGTCCAGAACGACGAGATCGAGCCGGGAGAGCTGGGCGGCAAGCGCGCCACTCTTGCCGATCCGGGCCTCTTCTTCGAGGCGGGTCACCAGATCCACGGTGTTGAAGTAACGCCCCCGCGCCCCGGCGCGGACGACATTGGCGGTGATGGCGATGGCCAGATGGGTTTTGCCGGTGCCTGTGCCGCCGACCAGGACGACATTGCGCCGTGCAGGCAGGAACGCGCCGCTGTGTAGCGAACGGACCAGCGCCTCGTTGATCGGCGTCCCCTCGAAGTGGAAGGCATCGATGTCCTTCACCACCGGCAGCTTGGCGGCTGCCATCCGGTATCGGATCGAGGCTGCGTGGCGATGGGTTGCCTCCGCGCGCAGCAGATCGGTCAGTATCTCCATCGTCGTGCGCTGCCGCTGAAGGCCGGTGGTGACGGCCTCATCGAACGCGCCGGCCATCCCCTTGAGACCGAGCCCGCGCATCGCGTCGATCATGTCATGCCGCTGCATGGAAGTTCCTCAGTTGGTCGTAACGGGCACAGTCTGCGATCGGCGGATGCCGCAGGGCGTTGTCCTCCGAAGTGACGATCGTCAGCGGTCGCGGCGGTTCGCGGCGCCGCGCCAGGATATTGAGGATCAGATCATCGCTCGCCGTGCCGGTCGCCAGCGCCTCCCGCACGGCGGCCTCGACGAGTTCCAGACCGTCGGTCAGCACGGCCGCCAGCACGCGCACGAACCGCCGATCGGCATCATCGCCATTGCCCAGCTTGCGGCGCAACCGAGCCAGTGCTGGCGGCAGCTCCCAGTCCTGGAAAGGGGCGCCGTTGCGCAAAGCGCCGGGCTTGCGGGCCAGCACCGGCAGATAGTGCCAGGGATCATAGATCGTGCGGTTACGCCCGAAGAAGCGGGGATGTTCGGCAACGACCTCATCATCGCATCGGACGACAATGCGGTCGGCATAGGCCCGGACCTGCACCGTGCGTCGTGCCACCGTCGAGAGAACCGAGTATCGGTTGCGATCGAAGCTGATCAGGCAGGTGCCCGACACGCCATGCTCGCTCTCGTTGAAGCCGTCGAACGGCCCCAGCATCGGTTGCAATGCGGGGCGTTCCATCTCCAGCATCTGCGCCACGGTCTGTTCGCCCTGTTCAGGATGCGCCTGCCGTTCGGCCCAGCGCCGGCACTCGGCTTCCAGCCAGCCATTGAGCTCCTCAAGGCTGGCGAACCGCAAACGGGGCTGGAAGAAGCGCCCTCGGATCGTCTGCACCTGGTTCTCGACCTGGCCCTTCTCCCAGCCCGCCGCCGGCGAGCAGGCCGTGGGCTCGACCATATAATGGTCGGCCATGATCAGGAACCGCCGGTTGAACACCCGCTCCTTGCCGGTGAACACGCTCGTCACCGCCGTCTTCATATTATCGTAGATGCCGCGCTTGGGCACGCCACCGAAAAAGGCAAAGCCGCGCGCATGCGCGTCAAACAGCATCTCCTGGCTCTCGCGGGGATAGGCCCGGACATACACCGCCCGCGATGCGCACAGCCGCATATGCGCGACCTTCACCCGCATCGGCTTGCCGTCGATCTCGACATCCTCGTGGCTCCAGTCGAACTGGTAGGCCTCGCCGGGCTGGAACAGCATCGGAATAAAGGCCGTCGTCCCATCGCCGACATTCCTGCGCCGCTCGACCTTCCACCGTGCCGCATAGCGACGCACGGCATCGTAGGAGCCTTCGAAACCCTCACGCACCAAAAGGTCGTGGATACGGGTCATCCGAAGCCGGTCACGTCGGCCCAGCAACTCATTCTCTTCCAGCAGCACATCAAGACGATCCTGAAACGGACCGATCCGCGGTAGCGGCTGAACCTTGCGTCGATAATCAAATGCGCCTTCCGGCGCCCGGATCGCCTTGCGGATCACCTTCCGCGACACATGCAGATCACGCGCGATCGCCTTGATCGCCTTCCCGCCGGCGAACTCTCGCCGGATCCGAACCACTGTCTCCAAAACCAACAT
Protein sequences of DBSCAN-SWA_4 >NC_020561|2554304:2607859|2566563_2567292_+|WP_015449229.1|DBSCAN-SWA MQRHEMLAALKGLGLKGMIAAFDDAVTNGIRRDRTAMEMLGDLLRAETAHREAASIRYRMTAARLPAIKDLDGFVFADTPINESLVRSLHAGSFLPERRNIVLVGGTGTGKTHLALAITAAVVRAGARGRFFNTVDLVNRLEEETRQAKAGSLAAQMARLDVVVLDELGYLPFARSGGQMLFHLISKLYEKTSVIITTNLAFGEWPSVFQDAKMTTALLDRVTHHCDIIETGNDSWRFKNRS >NC_020561|2554304:2607859|2567426_2569658_-|WP_187293999.1|DBSCAN-SWA MAEDALGTFEQIVSAAASLIGQRPAGVDAFAGINELTASRAVENLGRVQGELASAYRHLRDEPAIARVVVADENDRRETLYICRATPVTCGGVQLCSNLGPKGRLAALPVGDFARIRLPSGAVDLEVVEKAVFKPTCPGGMWDSQPTVFQAEGHGPLTIVSLRELLAAAGYADDDLDALERQLADDDEAVNVVEGLRRSVLTAMQLRDQPILDQFQDEIFRLPLDSRLAILGPPGTGKTTTLVRRLRQKVDFQFLSEEEQDLVETADANGPPHNQSWLMFTPTELLKQYVKEAFAREGVPAPEQRIRTWSDHRRDLARRSLPILRSATGGTLVINERVDPLQPATITDQIAWFEDFNAFSSAIFLEQISAHAAAIGGSADPTIARIGSRVAGLLSSASTRPVAAIAAVSAVFEELRKTSSSLNDEIRQRLRRPLARQVAADPTLLDDLARFIVTLSPDVEDDEDAEGDDEDEVPQQGRRAAQDAFLKALRGRAVAEARKRPVSRTSRNGRVLQWLTDRGMMLPEMADIGELVVVQRAARRLVASPTAFVRGVPARYRTFRRARLAVGRWYQESDIAGGEVSPHEVDVILLAMLRNARDMLDDTNLMRRLGDRTPQILSDVSRLQRNQILVDEATDFSPVQLACMAALASPWTNSFFACGDFNQRLTVWGARSVDHLRWIFPDIDVREIDVAYRQSRRLNELASALAAKDGKASTRMCQPARKVDPLSAPNIDPLFREFRRLSR >NC_020561|2554304:2607859|2561395_2561971_+|WP_144062144.1|DBSCAN-SWA MGEVVVLADFTHLEDQIRDCYGRVVYTHKTHEKMADRCSRTLRRFKIAQIVLTAVTASGAFSVVFLDDTLLKIATAVASIASLIISGYMKGFDPGATAQKHRDAAASMWPIRESYLSLLTDLRMKRISDDEAVKERDALQAKLAAIYRGAPQTTGDAYTDAQDALKNKEDLTFSDAEIDCFLPTSLRKTAA >NC_020561|2554304:2607859|2602557_2603832_+|WP_015459188.1|integrase|DBSCAN-SWA MANKVGLTDARIAGLKAPASGQIEVADGIVTGLRLRMGASGTKTYILRKRVQGKWLNVTIGRHGPNFTLAHARRKARDLLVDVEQGKSIARKPGAKRKGSKGVGTVAELYETYLAQQIVGKKRSANEFDRVFRKYIEPELGDRLADSITRSDVSRFVEKIAFERGKETLTMARIVYRHLSTFYSWALTRLEHLPANPCRDAWRPKRSEPRDRVLSDREVAALWQAAVEDGYPFGHLVQMLILTAQRRGEVLDATCDEFDFKGKVWTVPGDRAKNGKANLVPLSAQALEVVTDIFAAAGVAPEDAHKQSQILLASKVTSTNSVSGLSKAWKRIRASVDEKLGYEAAHFTMHDIRRTVATGLQRIGIPLVVSEAVLNHQSGSAMAGVAGVYHRHQYTNEKREALALWGREVLLIVAKYPPQDSQEE >NC_020561|2554304:2607859|2585994_2586762_+|WP_144062146.1|DBSCAN-SWA MPTAAELVPIVGSERALAKLPGTGGAVAREDGTVLVTGRGYATRDGVIPAAEIYFVDGLRPAGEGVWLGEADKVAWRDETTGYECIMLRATRGGYLSGYVGVPRDHPLWGWEHGAVGPDLGVEVHGGLTYSRICEDGPSPERRLVEEARRICHVPTLPSQYEPVIHATGHRPGDVHAWWFGFDCNHAYDLVPEADRQPQPFLGAEVGAEYRDDAYVIQEILNLAAQLRAVADGVPAPPRQGPPLPPIGLDPHAGG >NC_020561|2554304:2607859|2577287_2579339_+|WP_015459164.1|DBSCAN-SWA MAQKTQTPSRTKSLFTADLVVPAIRASFTKLNPRELVRNPVMFVTAVVAALMTVLLVIGQDDLSTGFKMQLVVWLWLTVLFGTFAEALAEGRGKAQAASLRATKAELTAKRLKGDGRQYDNVAASQLKIGDIVLVETNDLIPSDGEVVSGVASVNEAAITGESAPVIREAGGDRSAVTAGTRVISDEIRVKVTVEPGKGFLDRMIALVEGAERQKTPNEIALTLLLVGLTIIFLIAVGTIPGFASYAGGSIHIAILAALLITLIPTTIAALLSAIGIAGMDRLVRFNVLAKSGRAVEAAGDVDVLLLDKTGTITIGDRQASEFRPVGGVAPEALAEAALLASLADETPEGRSIVVLARDRFLVPTAVLPDGAEVIPFTAQTRISGVRIGGALIQKGAVDSVLRANPGLGETAGATELRRITDEIARAGGTPLAVARDGRLLGAIFLKDVVKAGIRERFGELRAMGIRTVMITGDNPLTAAAIAAEAGVDDFLAQATPEDKLELIRKEQQGGKLVAMCGDGTNDAPALAQADVGVAMNTGTQAAREAGNMVDLDSDPTKLIEVVGLGKQLLMTRGALTTFSVANDVAKYFAIIPAMFVALYPGLGVLNVMGLATPQSAILSAIIFNALIIPLLVPLALKGVAYRPMGAGPLLARNLAVYGLGGLVAPFIGIKIIDLVVGGLGLA >NC_020561|2554304:2607859|2570279_2571143_-|WP_015459159.1|DBSCAN-SWA MIRTSRLTARIAGAAAFLVGLATPFAAAAQSRASELHTFHCLQGCPIGAPATNDIVVREIYTLSSNDLTKLADWVAYRVTREGIGVSGDRDWRRDPWLSSDETLTPDAYDGASGALHIDRGHQAPLSSFSGTPHAADTNILSNITPQSSALNQGPWVRLEDRERALATRLGVPVYVYTGPLFERMMKPLPTGGPYQRVPSGYWKVVALADGRATAFVFDQAAARRSDYCDARVSLLHVELRSRLVLFPRATVPFASLDTELGCGEPAPADPTPDEIPPERPATRRGR >NC_020561|2554304:2607859|2557842_2560947_-|WP_015459151.1|DBSCAN-SWA MLQTVLDVCKFKQDAIEVALTKQIEDLADLVGHSEADARTFFDKTYVTEGMATLLRMTMQRLAGLNDQAAFELRQAMGGGKTHSMLAAGYLAAHPQLAADVDPAIVKGFAPVPAKVVVISGRNIPHDVYLWGSVARQLGKEELFSKFWSNGAKAPNEDDWRALFGDEPTLILIDELPPYLENAVTISVGGGHLGDVTSHALANLLSAATKLPRLCIVLSTLVGQYKASGELSKIISQISAEATRQTKPITPVELGSDEIYRILRKRLLAEEPDGGLITTVAETYGQLLSDAVTANMITKSAEKIADEVSATYPFHPGLKTVVATFKDNENFRQTRGLMRIAALMIKAAQGRQYNDVYLLGPQHLDLAERETRDFVNSIYNLDAAITQDIVDTGASDAHAEAIDANAGNDAASQAARLILMASLAEANDAVKGLQEDTIRAYLIAPGRPESDIIAGFDALVQRSWYLHKRENGAWIFSKNENLTKKIENTARTTAQPKIDADLARRLIELFTPKRKNAYGRVLALPKLEDIDTKGERLLIVLSPDAQIPPEKARMLFEGTTEKNNFAIVSGDDHSLASVEEKVRTAYAVEKVIQAEGPNSQNRADLEKRQTDAEIDVYATIASTLNKIWFPMRVPGGTDGLVSTALKLDAHRRTDGAGYDGEAATEGALTDVNSQKLILSVEENYAKLRDRAQDMLWPASERRARWADVQDRAASNVRWLWLPRGGLDELRKIALSRGDWRDSGTGWIEKGPFEKEKTSVNVDVLNYDEKTGEATLFISPKNAGSNPVIYWSNSSDVSESSTVLDDTTVRTTDMKRFFLAVDPDGNHQKGAATEWLNSLNLTHEPKPVGGGYEVELTVKPTGTIRWNIDGTNPAEGAVYDGPIKLDGKTDVTIYAHAEAEGVVIRKEFRVNRRDGAGQMIDGEKPATIRKPISLGTTEKVFAAVTKAKEAGITFRSVVVIVGSGGNNVSTSFSGDVPVKPGAIEEVAKFARAQLGNELAEVKVQWKSASVDRAAEIDAFMGAIEEIYTGTDIEQA >NC_020561|2554304:2607859|2604041_2604611_-|WP_144062042.1|DBSCAN-SWA MTVASELTTDGFRVAEGTDGFRLVEHGIWRKPPTASSEPPATDAGRTYLPYRYPKPQDSAPSGFRRASYLPHVYAAEAQYSLPSGLLDALVWTESRYNPMAVSKAGAAGLGQLMPGTARDLGVSNRFDPKANILGAARYLRQMLDKFGVVHLALAAYNAGPGAVERAGGIPRNGETPAYVREVLRHWRF >NC_020561|2554304:2607859|2574486_2575359_+|WP_015459162.1|DBSCAN-SWA MKFLVTVSATALFAWPLAAQAQSTAEDKVAAEAVPPSVLTVSGSATLASDYRFRGVSQSDQEMAVQGGLTIAHDSGFYVGAWASNLAGWGTFGGANMELDLIGGFKAPLSDNATLDIGLTWYMYPGGADKTDFAEPYAKLTGTTGTATLTAGVAYAPKQQALGRWYDTGTEAAAGIYNNPGAKDDNIYLWGDAAVGIAGTRITAKAHIGHSWGQDGLGPNATAVAPTGEYWDWSLGADVTWRNLTFNVSYVDTDISVSEASRLRPSFSKGQDGTGNIAGSTVVMSLTAAF >NC_020561|2554304:2607859|2606344_2607859_-|WP_015460387.1|transposase|DBSCAN-SWA MLVLETVVRIRREFAGGKAIKAIARDLHVSRKVIRKAIRAPEGAFDYRRKVQPLPRIGPFQDRLDVLLEENELLGRRDRLRMTRIHDLLVREGFEGSYDAVRRYAARWKVERRRNVGDGTTAFIPMLFQPGEAYQFDWSHEDVEIDGKPMRVKVAHMRLCASRAVYVRAYPRESQEMLFDAHARGFAFFGGVPKRGIYDNMKTAVTSVFTGKERVFNRRFLIMADHYMVEPTACSPAAGWEKGQVENQVQTIRGRFFQPRLRFASLEELNGWLEAECRRWAERQAHPEQGEQTVAQMLEMERPALQPMLGPFDGFNESEHGVSGTCLISFDRNRYSVLSTVARRTVQVRAYADRIVVRCDDEVVAEHPRFFGRNRTIYDPWHYLPVLARKPGALRNGAPFQDWELPPALARLRRKLGNGDDADRRFVRVLAAVLTDGLELVEAAVREALATGTASDDLILNILARRREPPRPLTIVTSEDNALRHPPIADCARYDQLRNFHAAA >NC_020561|2554304:2607859|2600451_2600679_+|WP_015459186.1|DBSCAN-SWA MKQALVTTAMMLLGGCSMRPDGLNWLPPEAHGSQPDLPVAGGGTADRLLIVRDGRTVASVPTEPGRGGVVYLPPK >NC_020561|2554304:2607859|2560962_2561238_-|WP_015459152.1|DBSCAN-SWA MGENSNLDLLRLVGEALYGERWQAPIAADLGVSDRAVRYWLSSANPCPDDVGTRLLKVIISKRDRIVDLEDAVRKHLASTAPSDTATPTAS >NC_020561|2554304:2607859|2582594_2583275_+|WP_015459167.1|DBSCAN-SWA MSAKVLVIDDDAAIRRLLRNTLERAGYAVIEAVNGRDALGQAASQHPDAILLDLGLPDRDGLSLIPLLRTDNGVLLVVSAREATDEKVSALDLGADDYVTKPFDTEELLARLRVALRHRSMSETAPKVIRKGDLSIDLDRRVVCRAGEELHLTRKENDVLAVLARHVGRVVTHERIIAAAWGADEDPRIEYLRIVIRNLRQKLEAPGPVGSVIANELGVGYRLRAD >NC_020561|2554304:2607859|2563532_2564417_+|WP_015459155.1|DBSCAN-SWA MGVGEDFSRFKDGYNIDASTMASISYRYRRITRQLNKDFWNTESETTHSLYVGSYGRDTAARGLSDLDVGFVLPNSLYHQYNAHLGNGQSALLQAVKRSIQKTYTTSESFGDGQVVVVSFTDGITFEILPAFDNSDGDSWTYPNANGGGSWKTCNPRAEMRAVDTRSLLTNRNLKYLCRMMRVWRDVHSVPMSGMLIDTLAYQFIEGWAHRDKSFFYHDYMARDFFLYLSQRDTTQSYWRAPGSGALVARKGAFERKAASAYAKALEAITYDANGHDWSRRQKWREIFGTLFPA >NC_020561|2554304:2607859|2588327_2588702_+|WP_015459174.1|DBSCAN-SWA MTLQSSRVAADPFLLRFATPRSADAEMPGRYSPDLGVWVIDRDGGEVPIIEVAGGSLVATQSKTMTHVEVDDDDPARFGSMETGTSTRVRQEADDEDASLCLPELTTKTDVQQERDDETVTAYW >NC_020561|2554304:2607859|2586764_2587544_+|WP_015459172.1|DBSCAN-SWA MDPSDWHFGDLPWGWWLRARDGVLEDQDGRTWCTVRDAFWHGELAMPSSNVVREQVELLQRVLTAIHGRWLGGAERQQDLFDGSMVFWRFYPCWLASIGLIEPRARGTVLEAPLTPKGRSVMLMLQATRDPAWEALPMSEVLEAVSAAERGTADEAREQALRDFERSLGRRRHLFAREQVGRSHLVTLTGVAIDARMPTLRVMWSQAFATERARDDLFAWLATRVDRWDDWGLLAYRKGADALTRHLFSLVLLDGSTSS >NC_020561|2554304:2607859|2554304_2555822_+|WP_015449228.1|transposase|DBSCAN-SWA MLIVETIAKIRREHRDGKPIKEIARDLRLSRNTVRKAIRAPEADFSYERKEQHRPQTGPFRERLDELLAENEERPRRERLRLTRIHDLLEREGFTGSYDAVRRYAARWKQERHAGGSGDMSKVFIPLMFRPGEAYQFDWSHEDVEIAGKPMRVKVAHMRLCWSRAPFVRAYPRETQEMVFDAHARGFAFLGGVPTRGIYDNMKTAVTTVFTGKERVFNRRFLIMTDHYGVEPVACSPAAGWEKGQVENQVQTGRERLFKPRLRFASMEELNAWLEAECRRWAERYAHPDMEDMTIAQALEMERPSLQPLTTPFDGFFESEHVASSTCLVSFDRNRYSVMAVAARHAVQLRAYADRVVIRCAGKVVAEHARLFGRNQTKFDPWHYLPVLIRKPGALRNGAPFQDWDLPPALAQLRRKLGKSDDADRRFVRVLAAVPEDGLEAVEAAVREAMAAGTANDEVILNILSRRREPQPVQAINVVVDLRLKHPPIADCARYDTVRGLNAAA >NC_020561|2554304:2607859|2599043_2599286_+|WP_015459185.1|DBSCAN-SWA MPTNPRPDRLHNRPHGWGRILVLDTPANYSPDQADAHARLMTALRAGDAPDEADVRLLRADWDGHLEGLSQRAEAWRWGA >NC_020561|2554304:2607859|2572712_2573936_-|WP_015459161.1|DBSCAN-SWA MPVTQNLWKASLIATLACAPAPGALAQSSSPGDVVVMRRVVAKPTNATHIPDQPVEDDGRWVEGAWQWTGFAACTDRAPRSRAVTCMTDDKPAEEARCRLPKPATTDFAERRDGCTFDWRTGDYGAWSSTCADPATRTRAVQCLRSDGIPAPDDGICTGPKPDVLDEAEQISGCAWEDRGWNNTSGCGAVTEQRDVACRDTSSGAQADPSLCLHAAEAGLIGDLGSLTRSATDYSACSDNEAFDKGLKGYEEALVGRLAPMRWAASRGGRSGVAYSIPGGNVAFQSTARYPIVSDDQKFEIALGIRQGDGDPDRTGAYLGIVVFNAAGVRIPNGTSYGFYPWGANLSFSDRAWQDRKAIIGKTAASVFRLPADAAFFSPLVYLNNGNASPNASYEVDYFTVREVTQE >NC_020561|2554304:2607859|2579929_2582590_+|WP_187294061.1|DBSCAN-SWA MSGDGRPDPDALLRAAAQEGRGRLKIFLGAAPGVGKTYEMLSDGAAQRQEGRDVVVGVVETHGRVETEALVRGHELIPRREVPYQGRILHEMDLDALLERAPELVLVDELAHTNAPGSRHPKRYQDVEELLAAGIDVYTTVNIQHIESLNDVVASFTRVRVRETVPDGILEMADIEVVDIPPDELIERLKAGKVYLPREATRALTHFFSKSNLSALRELALRRAAQAVDAQMLEHVRALGVGGTWAASERIVVAVSELPGADGLVRAAKRIADALHAPWTAVYIETPRAQTFGAGEHRSLAAVMNLATQLGGVVATVPATSIVAGLKAYLTDARATQLIVGKSQRSRWFELRHGSVVDRLVRETPGVAVHVLPLESPPPRTGGFRIHNAWGSRSGYAWTAAMVAGVTAIASGLFHVLDLGNVALLYLLPVMAAATLFGLRTGLFAGLTSSLAYNFFFLPPTGTLTVNNPENVISILVLLGVAIATSQLTARVRAQADMASSSARTNAALAGFLRRLTGIGDPMELARAICEDIARLFDLRVVLLVPNGGGLSLQAASHPGCDLETMELAAARWAFDTSAPAGRGSGTLASSDWYFQPLRAGDRTQGVLGLAKEDGGDPLRADQLPLLTSLVDQAALVMERFRLEEEMRDVESVRTRDRLRQALLSSVSHDLRTPLTAVIAAADQLDHGATPDLIGTIKAESARLNRFVSNLLDMARVEAGALKLNIEAIDLSDAVTGAAHDARRALEGHPVRLDVPPDLPLVRADPQLLHHCLLNLLDNAGRYGDPGTEIVIEGRHLFGHIRLAVLDNGPGLPPGREAEVFETFRRFEGSDRAIGGTGLGLAIVKAFAEAMGMSVEASNREDGSGASFALMFPSPLIVRDVAQGSV >NC_020561|2554304:2607859|2556664_2557237_-|WP_015459149.1|DBSCAN-SWA MTNQTIAATVQPLSLQDAPALIERLWPAQKISVEAQKERKAVQSQTLTALGSYWKGRKPLVLVRACVLGALLPATDDPAKDLEIFEKLMRIDDEAFIFRDQSTKAAEVAQILLSERILDLAGLRELFAIRGTAEAVNDEAFLGAVATGKVAWRKDAPVVKRVNRRAKLTPYRRPTLTPFFVSSGGYPGSP >NC_020561|2554304:2607859|2595876_2596359_+|WP_015459181.1|DBSCAN-SWA MEHNMNIEASPPATATLPTTRDDRYGTVTINAPGLAIISDMARNGYPTTSIASALGMSPRILRECRKRQPEVEEAWATGLAGLEQELVHSLLVAARKGAIAAAMFLLKTRHGYRETGQTDASPKVAVQINLPAAMEAKAYAAMVEAEARMVGDGGSDGDA >NC_020561|2554304:2607859|2584301_2584907_+|WP_144062145.1|DBSCAN-SWA MAAEVGMRFQRESVPYDPMSWMLAPRRVFDGAAPVDACLDRDDCMRGVLVHGLGLGLDVDRAAIDTLMSSDDDDFDEHEFGHLYDGQFGGAGRSKRERTGRGTRLRLYTATIAETRDNVMTQAFHASVARNSGEIRARLAGRFGPDLADAADIRIGVHAASPLVVALVPGAVIEMIRQMDRDCGTVAARTFAVDIQQCIQA >NC_020561|2554304:2607859|2593102_2595502_+|WP_015459180.1|DBSCAN-SWA MADALPSTDTVGAAMNYIGTGMRLVVLRPDSKKPVSEKGWQNAEPKGRDFEDGKNIGVQLGAKSGHLVDIDFDCAEARALSGLGCFFADLPAFRRASLSADAPGHRIVVCADAPDAVMQFAFTKNPEQEAIADLGLTKSVILELRAGKGYTVFPPSVIEGDRLVWNPRVSADVPTMAWEELRLKAGILAFAAFAAACYPPEGGRDNFCLSLAGALIHAGVAAETAEDIIAAIVALKGDNPRDRRGKAIATAEKRDAGEPVTGLPAFLENIGMRACEKRLRDWLGMAAAEAGEPLPPDAILIGRPDTHAVLAEIEEMLIAKSGRVYRRGSDLVRVSTLEEPVRDGDEIVRHAGLVELRSASPAWLAIEASRVGNFAQRSGNKIVPVAPPAGLMAMLGAVADESRFPPLRGLSMTPTLRCEQPGYDPESRLFLAFPPGMFPPGNMTPTQAEAEAALVRLAHPLRGFPFVADADRSVALSGMIAAVIRGEMRTCPLHLIDAPARGTGKTKLAEIIGIMGTGVPPSGVTHSDDGDENEKRLVAILRTGDPVILIDNVSSDLEGDFLCAMLTSETVQARILGQSERVRLSTRVLTLATGNNIRMRGDMARRAVRCRLDAHMANPDERSFDFDAVADVREARPALVTDALTVVRAFVAAGKPATVPPFGSFEDWDLVRGALVWLGHADPAETRAAVKEDDADVEEKVELLRLLHEHVGIGARFTMAELGSASRREALRTALARMLDRGVWDSRRAGRLLRRHKDVPFLGVTLRARPNTANVQEWWLAGEPEEALLDYRGEPACPF >NC_020561|2554304:2607859|2592723_2593110_+|WP_041864905.1|DBSCAN-SWA MGVAEYEMPAPCKGAGEGATYRAAAPSFSDHPWLWEISNGDYETLYYAAREAVRFAEALFDLDREYEDGRDLVGRIQARWDATPEMLRACLPPMMNVAVRREAERLLLWESQIAEARFQAVKGMVAHG >NC_020561|2554304:2607859|2583646_2583946_-|WP_015459168.1|DBSCAN-SWA MHALQVMSALSQATRLATFRRLVEALPEGMASSDIAAAVGTTPNTMSAHLAVLQRAGLVSSEKVGRTVVYRAETAPAEGLSEFLALACERGKKARSTRH >NC_020561|2554304:2607859|2598572_2598773_+|WP_015459184.1|DBSCAN-SWA MTLSVAAANRIARAAAARRQRDEARRLAALAVRGAYDPPRWVLDRLTSGDRMEYEAARDEARKGNV >NC_020561|2554304:2607859|2565059_2566577_+|WP_015449228.1|transposase|DBSCAN-SWA MLIVETIAKIRREHRDGKPIKEIARDLRLSRNTVRKAIRAPEADFSYERKEQHRPQTGPFRERLDELLAENEERPRRERLRLTRIHDLLEREGFTGSYDAVRRYAARWKQERHAGGSGDMSKVFIPLMFRPGEAYQFDWSHEDVEIAGKPMRVKVAHMRLCWSRAPFVRAYPRETQEMVFDAHARGFAFLGGVPTRGIYDNMKTAVTTVFTGKERVFNRRFLIMTDHYGVEPVACSPAAGWEKGQVENQVQTGRERLFKPRLRFASMEELNAWLEAECRRWAERYAHPDMEDMTIAQALEMERPSLQPLTTPFDGFFESEHVASSTCLVSFDRNRYSVMAVAARHAVQLRAYADRVVIRCAGKVVAEHARLFGRNQTKFDPWHYLPVLIRKPGALRNGAPFQDWDLPPALAQLRRKLGKSDDADRRFVRVLAAVPEDGLEAVEAAVREAMAAGTANDEVILNILSRRREPQPVQAINVVVDLRLKHPPIADCARYDTVRGLNAAA >NC_020561|2554304:2607859|2579351_2579924_+|WP_015459165.1|DBSCAN-SWA MGKDFTSALRPAIVMTILFAALLGIAYPLAMTGIGQAIFPSQANGSLVRDAGGKVIGSTVVGQAFTSDRYFQTRPSAAGEGYDGLASSGSNLGPTSQALVDRVKPDIEKRRAEGVTGPVPVDLVTASGSGLDPDLSPEAALAQAPRIAKVRNLPVERVRSLVTDQLETSVLGAPHVNVLALNRSLDALTR >NC_020561|2554304:2607859|2557236_2557803_-|WP_015459150.1|DBSCAN-SWA MASKSMATIGYGVPTNEIDPQRFVVRIPANGRDLVEIIEDFGITGPQYDPQTNMRSDRVVRCRLDQKRWRQISAAVRTAFNERLKGRKMGSGKWTSGDNDVHRLLGRELCVLVWAVESCEASLIPAAIDAWTGLRPEERWWLYSMAVHATGRAEDIDKGWRKAIRIGLTEGAGAVEAGNKRQISFEDL >NC_020561|2554304:2607859|2596377_2597733_+|WP_015459182.1|DBSCAN-SWA MAGTGAGGAEDLNLALLGGRGSGKTTALALLVLRHCVQYEDKARVLILRATYKSLANLWDELEALFRDAFPGGITSNRADFVIRCPNGAVVTLGNLSSQKDVVKWQGQEANLLAVDEITNFTTLRHINMLRANLRGPAGIPTRMIVLGNPGGPLHATIARMHVHGRVPWKPYELPDGSRWVYAPSTYIDNPTIDTERYARSIIASAGGDRALASAWLENNWNDLAGAFFADVFGDHLIIPDDPGFRVPKGKGHGWYSCVALDWGWSAPTAAVLAVHARRPGLIGPGGRVFPQGSWIIVDEVHSARSDDPSLGKSWPPQMVAEEVLAACERWGIRPHGVGDDARGLQNDTLLEQLGRHGLHLTKPTKDRISGWVKVKSLMAAARDGDPDTPGLWMSERCRFGLETLPLLPRDDVRMEDVDTSANDHFADAVRYLVNSAPRIVTSGRVIGHYY >NC_020561|2554304:2607859|2555808_2556537_+|WP_015449229.1|DBSCAN-SWA MQRHEMLAALKGLGLKGMIAAFDDAVTNGIRRDRTAMEMLGDLLRAETAHREAASIRYRMTAARLPAIKDLDGFVFADTPINESLVRSLHAGSFLPERRNIVLVGGTGTGKTHLALAITAAVVRAGARGRFFNTVDLVNRLEEETRQAKAGSLAAQMARLDVVVLDELGYLPFARSGGQMLFHLISKLYEKTSVIITTNLAFGEWPSVFQDAKMTTALLDRVTHHCDIIETGNDSWRFKNRS >NC_020561|2554304:2607859|2591008_2592196_+|WP_015459177.1|integrase|DBSCAN-SWA MSDLIELTDKYLKALPPPTGGQTVVRDTLPGFFVVVGKRTKTFTVQCDVKDDLGRRRTKKVALGQVGDLTVAQARAKAKATLGALQVAGKFEERRKEWTLGEAWDHMRDFDLPAKGSRPRTVDGYEKTIRRLMGDWLKVPLRKFAEKPHLVTERHRTITRDNGPYAANHFGRAFRRLYNYAQAKLDRTLPVTAWSKVITFNREYRRNTGMGDGDLGHWFTQLADIPNGVRREFHLFTLLSGSRPDALSRAEWRHLDVKRRALHIPAPKGGEDRAFDIPLSRPMLASLARARRIGSKLAPRQSETFIFPAAKSKAGHIVEWKEDRDVLGKWGVDLRQSYTIAAETLDVSERTLKRLLNHATQDVTMGYGDRDRMWPRLLEEQARISAHLMAYAKRS >NC_020561|2554304:2607859|2597748_2598576_+|WP_051128732.1|DBSCAN-SWA MPTPLDESTPVRGDVPLSMHPDALLNVSDTLNRPDTLGIPALSAAREALRLCYDCYGRLNDAERDLQAVAEPALRRQYPADRGGHTEVSGNVRMVNGKPTRIVDAEEFVTAAEQALARVSPAVDRRMAELKGYRDTLAQRVATALDVPARKSPEGLALASEVRAHIKAMKQPAARGKFVLDAVEAGDLPTVAAVLHAPAFLSGLDVGTHALVRSRAAARFAPVDSAQLDAAEVAIAQVAAAGSATTRRFGAVLALRTTPAVKAAKSVKALAGAGA >NC_020561|2554304:2607859|2587540_2588197_+|WP_041864904.1|DBSCAN-SWA MMAREPAGDRTRTASVGPDRIHELARRRACDEALVIDRLVETLRLASFRSFLASTVVSMSAIVPSVLDMVGSDVPSALQRIRPGHLWPRSTSRAGRSPASSALGRKDLVWPMRIGDAVMADGILAWVEAAILGSSLDIVLRAGGVELATYAGVARLQVDDRLPDTVLSACEGRPLDQIVDHPLLRGRGYVVDGAYQARDASVLTFDVGRRSLEMPWRP >NC_020561|2554304:2607859|2589659_2590517_+|WP_015459176.1|DBSCAN-SWA MNWRRLVDLVFPYIEPLTTAEKIAEEQRLRRDIAAIEAADFTRSDERALDEAQKVSATEIERVRTAEGKATTYLAVLAALVPVIITLQAANWEKKAGPAPDAARLFVLAVATVYVAAAGFHAFKTLQVQGFQRVGEAEIAAAWQSQKPLRRLTRGTLLATRRSRDAVNAKITRIRVTHEHLLRAFGTFVLLLLLDPLFYAMGFRNAAPDAPAAKNVVVEHRRVAAPPSTAPPPPSQRQSPAGSLERTSARPQSVGASPTVARKVESAADQAQDCPAGAKECSGPR >NC_020561|2554304:2607859|2605629_2606358_-|WP_015459191.1|DBSCAN-SWA MQRHDMIDAMRGLGLKGMAGAFDEAVTTGLQRQRTTMEILTDLLRAEATHRHAASIRYRMAAAKLPVVKDIDAFHFEGTPINEALVRSLHSGAFLPARRNVVLVGGTGTGKTHLAIAITANVVRAGARGRYFNTVDLVTRLEEEARIGKSGALAAQLSRLDLVVLDELGYLPFARSGGQLLFHLVSKLYEQTSVVITTNLAFGEWPTVFGDPKMTTALLDRVTHHCDIIETGNDSWRFKNRN >NC_020561|2554304:2607859|2569851_2570052_-|WP_015459158.1|DBSCAN-SWA MPRGTRHEITGILLEQRGELVLEVAGGGTWRLDAGWKARRLLGMRVNVVGVRDGFDLLAVERIERV >NC_020561|2554304:2607859|2601255_2602305_+|WP_144062041.1|DBSCAN-SWA MFASHRDACILSKYSAELVALLDRWYRDNSLDETVIAYRSLGLSNYHFARRVQDYVRSQPSLTVMCFDVTGFFDNLDHGRLKERLRWILGGGEIPGDWYAILKAITCYHYVNLDDLKKHDHLAQRIKERGHHPLATIGQIKALGVPINKNPNKVGIPQGTPISASFSNLYMTALDMELAVEATKRGALYQRYSDDILIACHPTSANTLEKLVEDRLLAWGLALQKAKTERVTLAGASTLTFQYLGYQLGHVEAKLRLKSLSRQWRTVKRALKKTERVGSSAVAAGKAKQIYTRKLNVRFTDAGPRNFLAYADRSADTLESGSLRKQVKRLRRHVQGEIARLKGKPPKKP >NC_020561|2554304:2607859|2592455_2592737_+|WP_015459178.1|DBSCAN-SWA MTAALDDSWGQLAERVAGIVTRHLADLPPRRVSPWFTPDDAADYLRLTRRGLEDMRAKGTGPRFHKVNDRVVRYHVRDLDAWLLGEGGKRGRS >NC_020561|2554304:2607859|2599424_2600185_-|WP_144061970.1|transposase|DBSCAN-SWA MARHLFWLSDEAWAAIEPHLPHGRPGKPRVDDRTVISGILHVLKTGCRWRDVPAAYGPPTTIYNRYNRWASRGIWQRLFEKIAGAGPVPDELSIDSTHVKAHRSAAGSKKGEWQEAIGRSRGGRTCKVHCLADDRGRPVAIALTPGNVADISMAVPLLSVTAPARRLIGDKAYDANSLRRWLAERRIKAVIPSTASRRTPYPLNRRIYRRRNVIERLFCRLKNWRRIATRYDRYATNYLAAIALVATIAEWIK >NC_020561|2554304:2607859|2564456_2564936_-|WP_015459156.1|DBSCAN-SWA MPEHLENEGVKPVFGADLASSAELIEWLGDRICEIEDFSGQLPSIAILVNESASMAPIAAGLSERLESRNIRAVACPDGQVMGQENDVRVFEVEHIKGLEFEAVFFVDVDKLEQREPDLFDKYIYVGATRAATYLGVTCSDPTVPASLSHVQHLFGERW >NC_020561|2554304:2607859|2575466_2575556_+|WP_041864902.1|DBSCAN-SWA MTLDLWLAALTALGLLAYLVAVLIRPERF >NC_020561|2554304:2607859|2571154_2572594_-|WP_041864900.1|DBSCAN-SWA MPKNIVIFSDGTGQDGGVRPEQRLSNIYKMYRAARPGPDSQIDPRLQVAFYDPGLGTDTSATGITSIRRGITKLLSSVTGRGITTNIADCYEFIVNHYTPGDRIWLFGFSRGAYTARCIANVLMLCGVPTKVPEGELPRFRLRVREISGRAVIRVYEHGAGHDRAKYEEERDELARRFRHDFGSDAGGEANVAPYFVGVFDTVASLGAKGPLRIALAALVALLVAAAAATLAGVAHWAFGIRWTWSFLAAAAVIGGWTLWGYLRTALKIMWPPLPGRRWPSMHLAQWSGRNFDRLLSRSVLYARHAIAIDENRADFPRLKWGWGKGVERPPEVEGEAKPFIQLWFAGNHSDIGGSYPESESRLSDISLQWMLEEARSVPHPLLVDDSRLRLFPSAAGLQHDEVAGMADTIRARTPAPLRWLTRALTWKVERRQPIETATLHPSVEERFELPSVPRQGGEGPYRPEALRNHERFRDRFGA >NC_020561|2554304:2607859|2585018_2585900_+|WP_015459170.1|DBSCAN-SWA MDTQTKVGRIDAPAAGGVRLLHPLRLQEGPTGEGSGRGPYIGVAVDCETTGLDWKAGRIIELALRRVRYDRDGTITDIDRAYEWREDPGEPLTEEVSRLTGLTDQDLAGEEIDTDAAVRLLRSASFVVAHNSAFDRRWIEARLPEAAGLRWCCSMAQVDWRGRGFDGKALGYLLVQNGFYFCGHRAANDVDAMIEMLRHRDGRGRTALAEMIERGSAPSWVVRASGAHFDLKDALRARGYRWSADLKVWAKEVADDDLVPEQLWLAGNVYSADAKAKALCPELVRVTPRTRFL >NC_020561|2554304:2607859|2603819_2603975_-|WP_172595472.1|DBSCAN-SWA MILTTTSDDVWIIEGDDVREDFIGYTVTVEGVVAGMDRIRADWLGVESHSS >NC_020561|2554304:2607859|2588715_2589663_+|WP_015459175.1|DBSCAN-SWA MILIVTNKRDVTTDFVVMEMRRRGLPFVRLNTEDLPQHQVAMIDGDPSELTLTGACGSLRLSDVTGAYYRRPGSFEVAGSAPVSEYVVAEWSAVLRSLWNALEGRWLNSPFSILRAEDKPRQLAAARRVGLRIPGTLVTNDFALARDFLSGGPMVAKPLRHALIDDGEVGSVIFTNRVERLHDADAEGFGRAPMILQREIVKRADVRVVVVGGAVFATRILSQAYDETQVDWRRGVRQDLDHEPLGLPPDIVSGCLAVTRDLGLRFAAIDLVEDGQGAFWFLEANPNGQWAWIEQKTGARITSAIVDVLASKARV >NC_020561|2554304:2607859|2575569_2577273_+|WP_015459163.1|DBSCAN-SWA MTLQGWILILGFVAILLALAKPVGAWLFALYEGRRTPLHLVLGPFERGFYRLSGIDPTQEQGWRRYAVHMLLFNAALMFFSYAVLRLQAFLPLNPQGLGPVSEHLSFNTAISFTSNTNWQSYGGEATMSNLSQMLGLTIHNFLSAATGIALAFALFRGFARREAKTVGNFWADVTRITLYLLLPLCVALTIFYIASGVPQTLGGVVDVTTLEGARQSILLGPVASQEAIKMLGTNGGGFFNANSAHPFENPTALTNLVQMLSIFVVGVGLTWCFGKAVGNTRQGWAILSAMMILFLAGTTITYWQEAAGNPVLHSLGVEGGNMEGKEARFGIAASALFAVITTAASCGAVNAMHDSFTALGGMIPLFNMQLGEVVIGGVGAGIYGFLLFAILAVFVAGLMVGRTPEYVGKKIEAREVKLAVLAIAVLPLIILGFTALSSVADQGLAGPLNKGPHGFSEILYAFTSGVANNGSAFAGLTANTPWYNGLLGVAMWLGRFFIIVPMLAIAGSLAAKKYTPESAGSFPTTGGLWVGLLVGIILILGGLTFLPSLALGPIADHLAMIRGQLF >NC_020561|2554304:2607859|2561970_2563512_+|WP_015459154.1|DBSCAN-SWA MAQPSRTGEGLIQDLAEVLEIPVSRYESADRSYRSVSRWLDRPESRFAHVDLDVYTQGSFRLGTAIRPLNGEEHYDLDIVCEFRISKAHTTQKQLHDDLGHELELYAAKHGMQEPSPWQRCWTLNYADEAQFHMDVLPSVPDAQRQRVLREARSLPLDYVDQSVSITDTEHANFRRLSDEWPASNPNGYADWFQSRMKPAFESRRKAIMLAEAKADVADIPVYRVKTPLQSAIQILKRHRDMRFADEPERRPTSIVITTLAAHAYQQETTISGALLSILQRMDSYIVQQDNGYWIANPSDPRENFADAWNEDADRRDAFYDWLETARADFSLAASQQDPSAFVDALAPRIGRSLVEAAVARRSRPSTGGMSFVSRASRALQRVIDAPHRKPATWPTVRSGTVEIIVATAMRDGFRPRIFTSDDAPVPPGAKLRFDARTDVPRPFRVYWQVVNTGAAATAARNLRGGFDEVTVVPGVLSRTEDAKYPGSHSIECFVVKDGYLAARSGPFLVNIG |
50 | Lactococcus_phage(23.08%) | transposase,integrase | attL 2600640:2600657|attR 2605969:2605986 |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_5 |
3093961 : 3134667
Sequences of DBSCAN-SWA_5
Nucleotide sequences of DBSCAN-SWA_5 >NC_020561|3093961:3134667|DBSCAN-SWA CATGGGCAAGCTAACAGCGAACGAAGTGAAGGCCGCGTTGGGAAAGCCCGGATCGTATCAGGACGGCGACGGCCTTTTTCTGAAAGTCGACCAACGGGGCGGCGCGGCATGGCGACTGCGCATCCAACATGAGGGTAAGCGGCGCGACATAGGCCTCGGCAGCGCAAAACTCGTCACTCTCGCAGCTGCACGGGCGAAGGCAGCGGAGGCGCGTAAGGCGATTCGCGAGGACAAGCGCGATCTCATTGCCGAGAAGCGCGAAGCAAAGGCCGCGGCGGTCACGTTCCGAGAAGCCGCACTGGCGCTACATGAAGCTCACAAACATCAATGGGGCAACGCAAAGCACGGGGAGCAATGGCTAGCGACTCTCGAAAGCTACGCCTTCCCCTCGCTTGGTCGGAAGCAGGTCGGGTCCGTCGCCGCGGGCGATATCATCGCGGCGATCGCCGGGGTGTGGACGTCCAAGCCGGAAACCGGGCGAAGGGTACGCCAGCGGATTTGCGCGGTGCTCGACTACGCACATGCGCGCGGATGGAGAGCAAGCGAGGCACCGGTGCGGGCGCTTATGGCTGGCAAGGGGCTGCCCAAGCAGGCGGGCGGCAAGCATCACCCTGCAATGCCGTATCAAGACGTGCCTGCGTTCCTCACCCGACTTCGCGCAACCGGCGGCGTCTGGGGGCGGCTTGCGCTCGAGTTCGTGATTTTCACGGCCGCCAGAAGCCAAGAAGTGCGGCTGGCGACGTGGGACGAGTTCGACATACAGAACGGCCTCTGGACTGTTCCGGCAGAGCACATGAAGATGAGGCGCGAGCATATCGTGCCGCTTTCTGCGCAAGCGCTGGCGGTGCTGCGAAGCGCTCAGGCAGTGCGGGTCAGCGGAACCACGCTTGTGTTCCCTGGCGCGAACGGCACCACAATGTCCGACATGACGCTGCTCGCCGTTCTCCGGCGGATGAAAGAACCGACGACGGTGCACGGCTTTCGCTCCTCGTTTCGCACGTGGGTCGCCGAGAAAACCAACTTTCCGGGCGAGGTCGCCGAAGCTGCGCTAGCCCACCAAAACCGCAACGAGGTCGAACGGGCATACCAGCGTGGCGCCTTGCTGGAAAAGCGGCGCAAGCTGATGGAAGCGTGGGGTGCCTTCTGTGAGGCCGGTTCCGAAAAGGTGCTACCATTCAGACAGGCGGCGCAGGCCTAAGCCCGTGTCTGCCTGCGGCTGACGGCAATCAGGATCGCCAACGGCGAAGGACGTCACGCACGTACGCTGGCGTCTCGCCATTCAACGGAATCCCGCCGGCGCGTTCGACGGCCCCCGGACCGGCATTGTAGGCGGCAAGCGCCATGTGCACCGTCCCGAAGCGGTCCAGCATTTGCCGGAGATATCGGGCAGCGCCGGCAAGATTCGCCAGCGGGTCGAAACGATTGGAAACACCCAGATCCTTTGCCGTGCCCGGCATCAACTGCCCTAGCCCGGCAGCGCCCACCCTGCTGATAGCGAAGGGATTGTAGCGGGATTCTGTCCAGATCAGCGCGTCCAGCAGCCCCGCGGGCAAACCGTATTTGGCTTCAGCAGCATAGACATGGGGAAGGTATGCTGCGCGACGAAAGCCGTTCGGCCCGGAGAGCCCCGTTGTTTTTCCGCTGGCCAGCCGGTCGGTTCCTTTGGAGCCGACCGTGACTGGCGTTTGCCCCGATTCCTGCACGGCAACCGCGGAGGGCTCTTCCCAGACGCCATGCTCCACAAGCTGGAAGCCGTCGGCCGTCTGCGCCAAGCGATAACCTTGGTTTCTTTCATCAATGGCGGCATCCACGCGAACAGCGGGCGCGGATGCTTGGCCGTGCGCCGGCACCGGCGCCGCCGCCACCAAGCCGAGGGCGATCGATACAACTCGCAGGCTCATATCCCGATTCCTTGTCTGGTCGAATCGAGAGAGAACATATAGAGAACATGGGTTGTAGGAAAACGCATCAGCGCGGGTGCGGAGAGGAGACAGCGCTGAGAAGGCATGCCGAACGGAGAAACCATGCCCCAACGCCATCAACCGGAGCGTCTCGAGCGCCGCGCGCGGGAGATCATCGACAGTCTCGGCGGAACATGGCGTCGCAACCGCGGGATGTGCCGCTGCCCCGCCCATGATGATCGCACCCCGTCGCTGAGCGTGAGTCTCGGCCGGAGCGCCATTCTGTTCCACTGCTTTGCCGGCTGCTCGAACGGCGAGGTCATTGCGGCCCTCGGCCTCCGCGGTGTCCGCGCCCGCGAGCTGTTCGATGGCTCGGGCGAACCATTGTCCGAGGCGCCTCGCAAGGAAACGGCGGACCGCAATGCCTGCCGGCTCTGGCGCGGAGGTGATGTACTTAGGGGAAGCCCGGCCGAGGTTTACCTGCTGAAGCGGCGGCTGACGCAGTTTTCATCCGATTTGCGTTTCCACTCCAGGACCCCGCTCGGGCCGCGCGGCTCGGTCCGGTTCCTGCCCGCGATGCTGGCAGCCGTCCGCACCGACCTCGGCGTCATCGCAGTTCATCGGACATTTCTCGACCCGTTGACGGGCCGCCTCGCCGGTTTCGAGCGACCCAAGCGGGCGCTGGGAAGCCTGCAAAACGGGGCGGTGCGTCTTGCCGCGCCACGTCGAGGGCGCCTCGGTTTGGCTGAGGGAATCGAGACGGCACTGTCGGCGATGCAGCTCTTCGGGGTCCCGTGCTGGGCAACACTCGGGAACGAGCGCTTCGGGCTGGTCACCATCCCGGAGAGCGTGCGCGAGCTTCATCTCTTCGTCGATAACGACCCGGGCGGGGACCTAGCCGAGGAGCGCGCCCGCGAAGCCTACGCCTGCGACGGTCGTCGGATCGTCGCCACGCGGCCGACCAACCTCAACGAGGACTGGAATGACGTCCTGATGAGGATGGCACTGGCCGCAAGCTGATTGGGCCAGAGGAGAGGACGCCCGGGCTCCGGCGAGGCTGCAGGTTCGTGAGCAGCCTCGTCAGGAGGTCCCATGTCCCTTGCATCACCGGCATTCGCCTTTCCCTGTCTCAGCGAGCCCTTGGTTCTCACAGCGGCGCGCGACATCGCCGAGCTCATCGAAGCAGGCCAGGCGCTCTCGCGTGCCGGCCTGAATGCGATCCTCAACCGTCTGTTCGGCGGCAGCGATGCCGAGGGCCGATGGTCGGTGCGCGATGCCCATGCAGCGATCGAGCTCGCCCAAGTGCTCTGGTTACGCGCCCATGCCGGACTCACCCCCGCATCGCCAGCCGACAGCGCTCACGAGGCATTCACGCTGATCGAAACCTTGCTGCCAAGCCAGACCGTCCGAAGCGAGGAGCAGATAGAACTCCAGCAGTTCTCGACGCCGCCACGCATCGCCTGGCTTCTCGCGCGAGCCTGCGCGCTCCGCGCGGGCGAGGCGGTACTCGAGCCATCTGCTGGCACCGGCATGCTGGCGACCTGGGCGGCGAAAGCAAACGCCCGTCTGATCCTCAACGAGATATCTCCGCTCCGCCGGGATTGCCTCGCCTGCCTGTTCCCGCAGGCCACCGTGAGCGGACATGACGGCGAGCTCATCGACGAGCTTCTTGACCCACGTCAGGTGCCGACGGCTATTCTCGTCAATCCACCCTTTTCGCACGGGATCGAACGCGGACACGACGGGCATACGGGGAGCCGTCACCTGCGCTCGGCCTGGAAGCGGCTCGCCCCCGGCGGGCGACTGGCCGCGGTCATGCCCGAGTGGTTCGATGTCGGTGGGTTCCTCTGCGCGGTGCGAGAACCAGTCACGGTGCAGCTCAATGTTGCGGTGGAGCGCGGCTTCTCGAGGAACGGCACCTCGATCACAACGCGCCTGCTGGTGCTCGACAAGATCGAAGCCGGCGCAGATCCGGCGGTGGCTCGCACCAACGACTTCGCCGCCTTGTGTGAGCTGATCGATGCGATCCCGGCCCGACCCTTCGCGACGGCGCCTCAGTCCTCCGCCTCATTGGTCCTTGCTCCGTTTCGGCTCGCGAGTTTGGCGGCCCGGCCGCGGCCAGTGCCGCCCCGGCACGCGGCGCCGCCAGCTCCAAGCGAACCCTGTGCGTACGAGGCGCTCGAAGCGCCGGCTCCGATCGCCGACCAGGTTGGGCACTACCTCCCTTATCGGCCGAGCCGCATAGCGATTGCGGGTGCGACCGAGCATCCGACGCCGCTGGTTGAATCGGTCGCGATGGGCTCAATCACCGCTCCGGTCCCGACCGAAGTCCCGCTGCTTCCCGCCGGGGTCATTGCGCGAGGTCTGCTCTCGGCTGCCCAGGCCGAGACCCTGATCTACGCCGTTAGCGCCCATGGGCGCGACCTGCCCGGCCGCTTCGTCCCGCAGGACAAGGGCTGTTCGCTCGCAACGAGCGCCGAAGGCGCCGTCTACCGCATGGGCTACTTCCTCGGCGACGGGACAGGAGCGGGGAAAGGACGGCAGGTTGCCAGCGTGATCCTCGATCGCTGGCTGCGCGGCGAGCGGCGTCACATCTGGATTTCGAAGAACGAGGCGCTGCTCGAGGACGCCCGCCGCGACTGGACCGCGCTCGGTGGTCTCCCCATCGACATTCAGCCGCTCCGGCAATGGAAGCTCGGTCTGCCGGTGACGATGGGGGACGGCATCTTGTTCGTCACCTATCCGACGCTGCGCTCGGGGAGGTCCGACGCCACCCGCCTCGACCAGCTCCTTGAATGGGCTGGGGACGATTTCGAAGGCGTGATCGTATTCGACGAGGCGCATGCGATGGCCAATGCCGCGGGCGGCGAAGGCTCGCGCGGCAAGGTCAAGGGCTCGGAGCAGGGAGTGGCCGGCGTGCGGTTGCAGAACCTGCTGCCGCGGGCGCGAGTGCTCTACGCTTCCGCGACGGGCGCATCCGACGTCAACAATCTCGCCTATGCAACGCGTCTTGGGCTCTGGGGTCCGGAGACGGCCTTCGCCAACCGAGAGGCCTTCGTGAGTGACATCCGTGACGGCGGCATTGCCGCGATGGAGCTCGTCGCCAGGGACCTGAAGTCGCTCGGGCTCTACACGGCCCGCGCTCTGTCGTTCGCCGGCGTCGAGTATGAGATCCTGGAGCACCAGCTCACGCCTCCGCAGATTGCGGTCTATGACGCTTACGCCGAAGCTTGGGCCATCTTATGCGCAGCTCGCCATAAGATCGCTTATCGCGAGGAAGCGGTAATGCGGAGGAACGCGGCGTAGTCCTGCAAAGCTGCGGAACATTTCGCCACGTCCTTCACATTATTCTATAGGGTGTCGAGCCATCCCATCAGATTGGCATCTCGCTTCCACGAAGGTGGCACGTTGCTCGCCGCCGGGGCGCCAACCTGTTCCAGCGCCTTTCGTTTGGTCTCCAGATTGGCCTGAGCATAGTGATTGGTCGTATCGAGACTGACGTGCCCGAGCCAACTGCGGATGACGGTGATATCGACTCCGGCGGCAACGAGGTGGACGGCGGTCGCGTGCCGGAAGCTGTGAGGCGTAACGTGTTTTGACTGGAGGGTCAGCGTCGATTTCGCGGCTTGTTCCACGTAGGCGTTGAGCTTGAACCGCACCCCTGACGCCCCTAGCGGTTCACCGTATCGATTGACGAAGATCCGCTCATCGGGCGCACGCGGCTGTCGCTCCAGTAGCTTCCTGAGCAATGCCACGGTTTCTGGCCAGAGCGGGCAGATGCGTTCCTTGCGCCCCTTGCCGTAAAGACGCACGAAGTTCGGTGCATCGAACCGGATTGCTTCGGGACAGAGGTCAAGCGCCTCTTGGATTCGCGCGCCACTGTTATAGAGGAACGAGAGCAGTACATGGTCGCGCAGCCCTTCGAGTGTCGATCGGTTGGGCTGGGCGAGGATGGCCTCGACCTCCTCGGGCTCGAGGTAGCACGGCGCGGAGGTGGGCTCCCGCTTCAACGGAACAGCCAGGACCTCTGAGCACTGCGCGATGTATTCCGGATTCTTGTCCGCCACGAAGCTGAAGAAGCTGCGGATGGCGGCCAGTCGGCAGTTCCGCGTGCCGATCGTGGTCTTGCGGCCATGCTCGGTATGATGAAGGAACGCGCGCACCTCGCCGGCCGAGACGTCGGTGAGCGTCAGCCGCGCGACCCCGCAGCCTTTTCGCTCCGCGACGAACCGAAGCAGCAGCCGCCAGGTGTCGCGATAAGAGCGGATCGTGTGGATTGACGCGCTGCGCTGCTCGGCCAGCCACTCCTGGAAGAACGCCCGCAACAACGCGGGCAATGGATTGCCTTTCCTCATGGCCGCGCCTCCGTGACAAGGCACGGGGCGCCGAGCGCGCGGAACCGTTCACTGGCTTCCTGCAACAGATCCTGCGTGACGGTGATGTAGACCAGCGTGGAGTGGAGATCCCGGTGCCCCATGTAGGTCGAGAGGAAGTGCAGTTTTTCCTGAGGGTTAATGCCGGACCGGTACCACTGGAGGATGCGGTTCACCACCATCGAGTGACGAAGGTCATGGACCCGCGGCCCGGTCCGGCCCGAAGCGGGCTTGAGCCCAGCACGGCGCATGACGTTGGTAATCATTGTCGTAACCGCCTCGGGCCTGTAGCGGTCATTTAAGTGGGCATGCCAGAACAGCCCTGATTTCGGGTTCTGTGGGCCGCCAGCGCGCCGCCTCGCATCGATGTACGCGCGCAGTTCGACCGCGACACTGTCGGATAGAGGCAAGATCCTGGTCTTGTAGAACTTCGTTTCCCGGATCGTGATCGTGCTTGATTGCAGGTCCACGTCACCAAGATCGAGCCATGCCAGCTCGCTTCGCCGTAGCCCGGCACAATAGGCCAGCATGATCATGGTGTAGAGGGTCAACGGCCGCAGCGGAGCGTCCGGCGACGGATAAGTCCGTGCGGTATCGAGCATACGCCGAACGTCAGCCGGGCTGAAGATATGCGGTTGCCGATGCTCCCGCGCTACTTCCCGCTCTGGCCGGGGATTGAAGCGCTTTGGCGGGATAGTCGGATCGAGGCGGAACCGCGCCTTGGTCAGGATGCGCGCCAGCTTCTGGCATTCAGCCGCGTGGTTGCGGGTCGGCTTGGCAGCCGCCCAGCTCGCAATCATTGCCTCAAGTGGTTGCTCCGCGAGGTCGGGACGGGCCTGAAGGAACCGATCGAACCGCAGCAGCCAGTGAGCCTGCGCTTCATATTGATATCCCCGGCTGCGCATCAGCATGACATGGTCTTGCATGAAGTCACCCAGCACGCTGCCGAAGGGCGCAGGCCGTCGTAACGCGGCCAAGGATTCGTCGGGATTCGGGGAAGCCAAGGCCCGCCAGATCGGCTTGCTTTGCTTGACGTTGTACCGACGGCGAAGTGCAGCAACAGGGTTGTCGGCGATCAGCCCGATTTCGACGAGGTGGTCGAGGAAGCGGTCGACAATGCAGACCTGATTGAGCAGCGTCGACAATCGCCAACGTTTTTGCATCTCCTTCAGCCAGGCGTCGAGCATCTGCCGGTCCACCGCCGGATGCCGGCGGGCAACATCTTCGAAGGTGCAAAGGAACCAGCGATATGTCGGTACGCTTCCCGGCCGGAACTGCGATTTTACCAGGAAGGCGTCGACGACGGTGCGATCGGGATCGTGCCAGGCGCTCATGACAGCACCTCCATTCCAGGCACCTCGAGTGCCACGGCTCGGAGATCATCTGTTGCCAGTTTGAGATAAGTATTGGTGGATTCGGTGGATCGATGCCCGAGCACGTCGCCGATGATCTTTTGCGGGACCGATGCCCGCAGCATTTCGACCGCACGTGCGTGGCGGAAGACATGCGGCCCCCGCTTTCCTGCTGGCACTACGCCTGCGGCGGCCAACCGACCGCGGATCATGCCGTACAGGTTCGTCATTGCGATATAGGGTGCGCAGGATCGGACGAAGATTTCCCGCACTTCAACCTGGGGCCGCCCAAGGCGCAGATAATCCAGCAGCGCTTCACCAACAGTCACCATTAGCGGCATGTACGAGTACGCGTTGGTCTTGGTGTGGCAGATCCGGAGGGATTCTGCGCGCCAGTCCACGTCATCGAGCCGAAGGCGGCATATCTCACCTTCGCGCAGCCCATACGTGGCAAGCAGCTGAAGTATCGCATAATCGCGTAGTCCGCGTGGCGATCTGTCCTCCTGCGTTGTCGCCAGAACCGCGGCGATTTGGCTCCTTTCCAGCGTCGACGGCACATCTTCGTAGGCATAGAGCATGGGGCCGATGATGTGTGGCGTCAGATCGGTCGGGATGCAACCCGTCCGATGCAGATGGCGAACCACCGAACGGAGACGCTCAGCGACATCGGCCAATGATTTGCGCCGTAAACCAGGCGCGCGCATGTCCATGTAGAGATCGATGTCCACGATGCTCAACGTTTCGAGGCTGGCGGCACCGGCCCGGTCGAACTGCCATCGCAGGAAGTTTCGCGCCTCCCACATCAGCGCCGCAATGGACGCGCTTGCCAAACCGCGCTCCTCGCGCAGCCATGCCTCGTATTCGCGGCAAATTTCATGTCGATGCTCGTCATCGGGACCGATCATTTCTGCGTCCGGGGGCCAATTGCCTTGAGCAAGCCGGAGGAGCTTGGCGATCGCGGTGCGGGGCAACATGTGCCAACGCGCACTGGGAGGCCGACCGTACTGGATCTCAAAATCCTGAACCGCATAGCCAAAATACTGATCGACCTGCTGCGGCGTCACAGTCTCGACCTGTATATCGCACTCGGCCAGATAATCGAGAAACGCGCGGGCGTAGAGACGGTGGTTCGCCACGACCACGGGATTGTAATTCTGTGTGGTGAGCGAATTCGAGAGTTCGGTGATCAACTCGTCATGCAACTTCAACATGATTGCCTCCTCAGCTGGTCCAGGGACCGCCGAGGTTCGCAACCAATATAATGCGCAGCAACTGCGCACACTATCATGGGAATTCGCCGGTTACGCCGCGCTCCTCCGCATTACCGCTTCCTCGCGATAAGCGATCATTCACGCGAACCTGCGCGAGGCGCTCGAGGCGACCCGTATTGTCGATGCCGACAGTGGCGACACGCTCAACAGCGGAGCCAAGTCCGCAGCGCTATCGATATTCGAGGGCACCAAGCAGCGCTTCTTCGCCCAGCTCCTCCTGTCGATGAAGCTGCCCAGCCTGCTGCCCGCGATCGAAACCGCACTCGCCGACGGGAATTCCGTGGTCGTCCAGCTGGTTTCGACCGCCGAGGCCATGCTCGACCGGCGCTTGGCGGAACTTTCGGACGAGGAGCGCGAGGCGCTGGAGATCGACCTCTCCCCGCGCGAATACGTGATCGACTACCTGGCGAAGAGCTTTCCGGTTCGCCTGATGGCCGTGTTCACGGACGAAAACGGCACTGCGCGTTCCGAGCCGATGGTCGATGATGAAGGCCGCCCGGTGCTGTGCCGTTCGGCACTCGCCGCGCGTGACCGGATGATCGAGCAACTCTGCGCGCTACCGCCGATCGCCACCGCGCTCGATGCGATCATCGAGCGGTTCGGGGTCGATCAGGTCGCCGAGGTGACCGGCCGCCGCCGCCGGCTGATCGTCGGGCCGGACGGGCGGCAGAAGCTCCAGGCCCGCAGCCCGCGCGCCAATGTCGCTGAAACGCAGGCCTTTATGGACGGCGTCAAGCGCATCCTGGTGTTCTCCGACGCGGGCGGCACGGGGCGAAGCTACCACGCCGATCTTGCAGCGAAGAACCAGGCCCGCCGCGTGCATTTCCTGCTCGAGCCGGGCTGGCGCGCCGATGCGGCGATCCAAGGGCTCGGGCGCACCAACCGCACCAACCAGGCCTCGGCGCCGCTGTTCCGTCCGGTGACCACCGATGTTCGCGGCGAGCGGCGGTTCATCTCGACAATCGCCCGCCGCCTCGACAGCCTTGGCGCGCTCACCCGCGGACAGCGCCAGACGGGCGGGCAGAATCTCTTCGATCCGGCCGACAACCTCGAAAGCGTTTACGCCAAGGAGGCGCTCAATCGCTGGTTCGGGCTGCTGTTCACTGGCAAGCTCGAGGCGATCGGGCTGGCCAGGTTCCAGGAGTTAACCGGGCTCCGGATCGAAGGGCCCGACGGCGGCATGGTCGACGATCTGCCGACGATCCAGCGCTGGCTTAACCGCATCCTCGCGCTCTCGATCGCCTTGCAGAACGCCATTTTCGACGAGTTCATTGCCCTCGTCGAAGCGCGCGTCGATGCGGCACGGCAGGCCGGCACGCTCGATCTCGGCGTCGAGACGATCGCCGTCGAAAGCTTCGAGATCCTCTCGGACACTCTGCTTCGCACCGATGCCCTGTCGGGCGCGACGACGCACCTGCTCGAGTTGGAGATTGCCCGCGCCCTGAAGCCTCTGACACTGGCGAGGCTGCACGAACAATACGGAATCGACGATCCACAACACCGCCCCCTGCGCAACAGCCGCTCCGGCCGCGTCGGACTTCTCGTGCCGGCCCGAAGCATGTTGACCGACGAGGGTGCGCGGATCGCGCAGTTCGAACTGCATCGTCCGCTGAAGCGGGGATATCTCACCGCTGACCAGCTGGACGAGAGCAGTTGGGAGCCTGTCGACCAGAACGAATTCCGGCGCCTGTGGCAGGCGGAGGTCGACGAAGCGGCATCGAGCCGCAAGCATGAGCGGCTCCATCTCGCGACCGGCCTGCTGCTGCCGGTCTGGGACAAGTTGCCGACCGACCATGTGCGGGTCAGCCGGATTTGCGCGGGTGACGGCCGTTCGCTTCTCGGCCGCGAAGTTCCGATCAACTGCCTGTCCGAGCTTTGCCGGGCTCTCGGAATGGATCAAGGACCTACTATCTCCGCCGATGAACTGGTGGCGTCGGTCCTGGCGACCGGCCGTCCGCTCGAGATCGGAGGAAGGGAGCCACTGACGCTCAAGCGGAGCCTCGTGAACGGGAGGCAGCGACTTGAACTGACGGGATGGAGCGCCGCCCGCCTCGACTGGTACAAAGCGCAAGGCTGCTTCACGGAGATCATCCGTTACCAGACCCGCCTGTTCGTTCCGCTCGGTGAAAGCGCTTCGATCATTACGTCGCTGCGCTCGGCCTGAGCGCGCCACGCTCGGGCGTTTTGGCGTTGTGGACGCTGCGTGGACCGGCGATGGCGGGCTGCGCGTGAACTACGTTCCGAGGCGGTGCGGATCGTTGGCAAACGGCGGTTTGCCATAGGCCTGCCGGAACGACGCGATTAGCTCGGCCTCCACCTCGACGGGCACTTGCCCCGGGGTCTGCTTCCACGCGACGAGAAGCTGGTCAACGCGTGGCAACTGCCAGATCAGACGTCCACCCCAGTGACCTACAGGCTTCCCCGCGCCGAAGTCGGCGAACTGTGTGAGGCGGCGATTCAGCTGATCTGCTTTGCCGATATAGACCACCTCGGCATCATCGACCCAGTTAGCGGCGAGCGCTTCGTGCGAGACGGTCGGGTCTTTGCCCTTGAACCATCCCCCGCAGCTTTGCTCGGCAAACGTGAGGGGTTTGCTGCCGCCGTAGGTTATTACATACACCCCGCCCGACGAAGGACAGGCACTTGCCCGGACCTCCCCGAAGGGAACCCAACCGGCGAATCCACCCGCTTCCAATGCGGGGCGCGTAAACTCCACGAGTGCCATACAGGACTAACCCCAGGTGAGGTTTTCGATTTTCTTGTTGAGCAGGTCATTGGGAAACGCACGAATCAATAGAGACTCGATGTCGCCGATCATCCCTTCGGAAACCTGATAGGCGCTCACATACTCTGCCAGCTCATGAAGCCGAACATTGATCGGTCTGATCTGACGCCGCATTTCGTCCGAGGTACGAAAATCCTGGCGCCGTTCTGGGTGCTGCACGCGCAAGATCCGTTGAAGGTTTGGATCGCGGTCCCGGTTATAGGCCAGATTTATCTCCTTCCAGAGCGACTGGGCCTTCGTCTTGCCAGTGTATAGTGCTCGGCCACGACTGTCGTAGAAGATGTAGACGCCGCGATGGCGTTCTAGCTCGCGCTTCAGACCCGTAAGGTAAGGATGCTCGTTTTCGCCTTTCCCGGGGTCGAAGATGCGACAATTAGAACCATTGCCCACCGTCGTTCTCTGAATTTGGAAAAATTCGACGATCGGTTTGATGGCCTCTGACTCAGTCCGGCGCTTCGATGCCTTCACTACGGCGGTAAGAAGGCCAACCATTTGGCGCGTTGTGACCATCTCTCGCGAGCGCCAATTCGCAAGTCCGGTTATGCTGATGCCAAGATACTCGGCAACGGCACGATCCGTTGGTGCGGGCGAGCCCGCAACCGCGATCGAGCGGCGCACCTGATCGATTAGTTCTTTTCCGTTCACCCCAACCCCCAGCCGTCACCTTAGCTTCAAAGGCCGCGAATGAAAGAAGCGCTTGTGCGCCTTCCGCGCAAAACCGGTTCCTCTTGACCGCGCCTTCGGGTCAAAGGGCTAACAGCGGCTCGCGTGAACTCTGAGCGAGCAGCTCAACGATGAACGTCCGCGCAGGTGGCGTTTCGAGCAGGTCCGTCTCGGCGGCCGTGTGGTTGAGCCATTGCATGACCTGGCGCCGGTGAATGATCGCGCCATGCCGCTCCTGGTAGCGCGCGACCTCGGGATTGGCGGCGACGGTCACGATCCGGTAGCAGACCGGCCAGTCTCCCATTGCCGGTTCCCAGATGCCGGCGAGATAGAAGAAGTTTCCGCCATCGAGCGTGACCCGGTATGTCTTGTCGCCGACGGACATCCGGAATTCGGATGCCGGGATCAGGCAGCGGTGGCTCGGGAAGGTTTTGCCTTCGGACCGCACGAAGCGAAAGGACACGCCATCGCTGAAGCGGGGGTTGGAACCCCAGACCGCCTCGATCATCTCGATCTCCTGGGGATCCTCCGGATTTCGCCGGATGATCGGGCGGCGGCTGTCGAGCGGCGCGTCCGAATCGAAGGGTGTCGGAGTCGGATGCATGGCGAGAACATACAAGGAACACGATCTTCGGACAAGACCGACGGAGGCTGGGGGAGCAGATGTGCAATGACTATAAGCTCGAGGTCGACATCGCCTCGATTGCCGAGGACTTCGACAACCTCAGCATCAAGATCAGGATGCCGGAAGGCGCGCCCAACGTGCCCGCGCGCGAGGACATCAAGATCTCCGACACCGCGCCTATCGTGCGCGGCGTCGAAGGGGAAAGAGGCGTCGGCGAGCTGGTCAACCGGCGCTGGAGCTGGCCGGGACCGGGCGGCAAACCGGTCTATAATTTCCGCTCGGAAGGGCGCGAGTTTACCTCGAACCGGTGCCTGATCCTTGCCGACGGCTTCTACGAGTTCACCAAGCCGGACGACCCCAAGCAGAAGCGGCAGAACAAGTGGCTGTTCACGCTGCGTGACCACCCCTGGTTCTGCATCGCCGGCATCTGGCGGCCGCATGCGGAGGTCGGGGAAGTGTTCACGATGCTCACGACCGATCCGGGCGAGGACGTCGCGCCGTATCACTCTCGCCAGATCATTCCCCTGCCGCGCGAGCGATGGGCCGACTGGCTCGATCCGTCGGTGCCGGCGGAGGAGGTCCTGCAGGTGCTTCCCAAGGGCAGCCTTGCGGTGCGACGCGTCTACCCGCCCGAAGCGGCGCAGGCGGCCCTCCTCTAAAGCTAGTGGAAGTTCCGCGACCTGATCTTGATTCCCTCGGTCGGCTGGTCCGGGACGATGCCCGAGCCAAGGCGTAGATCGGGTATGTAGATGTCGCGCGGCGAACGGCCGAGCGACCGTCCGGCCGGGTCGCGCGGATCGGGGCCCATGCCGTGCTTGGCCACGTCAGCCCGGCTCACCCGGTAGACCGTCGCCACATCCTCCCGCTCGCCGACGCCCGCCAGCAGGCCCGACATGTCTTCGAGCCGCTGCGCCACGACATGGATTACGTCGCCTTCGCGCTGGACGCGACCGCGCACGCCCATCATCGAGCTTCCGAGCACGACCCGGCGATGCTTCTCGAACAGGTCGGGCCAGACCACCAGGTTGACGACATCGGTCTCGTCCTCGAGCGTGATGAACATCACCCCCTTGGCCGATCCGGGCTTCTGCCGGACAAGGACAAGCCCGGCGATATTGACCATGCGCCCGTCCTTCATGTCGCGCAGCGCCGAGCACGGCGTCATTGCGCGCCTGGCGAGTTCGTCGCGCAGGAACGCCAGCGGATGCGCGCGAAGCGACAGCCCGATGGCGCGGTAGTCTTCGACGACCTCCTGGCCTTCGGTCATCGCGGCGAGCGGCACCGCCGGTTCCGTCGCCTCGGGGCGCAGCCGCCGCTCGCGTTCGTCCGCCGCGGCAAAGAGCGGCAGCGGCGCGTTGCCGAGACCCTTGACCTCCCACAGACCGGCACGCCGCGAGGGTGCGATCGATCCGAACCCGTTTGCCTGGCCTATCCGGTCGAGCGCGCCCGCACCGACGCCGGCTCGGCGCTGGATCTCCTCGACCGAAGCGTAAGGCCGGTCGCCGCGCGCGAGCACGATCGCGGCGCCGTCCTCGTTGCGCAGGCCGCGCACCTGCGACAGCCCGAGGCGCACCGCCATGAACCGGCCCTTGGAGGGCTCGAGCGTGCAGTCCCAGCGGCTCGCGTTGACATCGACCGGCCGCACTTCGACCCCGTGCGCGCGGGCATCGCGGACGATCTGCGCGGGCGCGTAGAAGCCCATCGGCTGCGCGTTGAGCAAGGCTGCGCAGAACACGTCCGGATGGTGGCACTTCATCCAGGACGAGGCGTAGGCGATCAGGGCAAAGCTCGCTGCGTGGCTCTCGGGGAAACCGTAGCTGCCAAAACCCTCGATCTGCTTGAAGGTCTGCTCGGCGAAGGCGCGATCATAGCCGCGCGCGATCATGCCATCGATGAGCTTGTCGCGGAAATGGCTGATTCCGCCGGTCAGCTTGAACGTCGCCATCGCCCGCCGCAGCAGGTCGGCCTCGCCGGGGGTGAACCCGGCGCACTCGATCGCGACGCGCATCGCCTGTTCCTGGAACAGCGGCACGCCAAGCGTCTTCTCGAGCACGCGGCGCAGCTCCTCCTTCGGGTAGGTGACCGGCTCCCTGCCCTCACGGCGCCGGAGATAGGGATGGACCATGTCACCCTGGATCGGACCAGGGCGGACGATCGCGACCTGGACGACAAGATCGTAGAACGTCCTTGGCTTGATCCGCGGCAGCATCGCCATCTGCGCGCGGCTTTCGATCTGGAACACGCCGAGCGTGTCGGCCTTGCGGATCATGGCGTACGTTGCCGAATCCTCGGGCGGGATCGTGGCGAGGTCGAGCTCCAGCCCCTTGTCCTCCTCGAGCATCGCGAACGCCCGGCGCATGCAGCCGAGCATCCCGAGGCCAAGCACGTCGACCTTCATGAAGCCGAGCGCGTCGATATCGTCCTTGTCCCACTCAATGATCTGCCGGTCGGGCATTGCCGCCGGCTCGATCGGCACCAGCTCGTCGAGCCGGCCGAGCGTCAGCACGAACCCGCCCGGATGCTGCGAGGTATGCCGCGGCGTGTCGATCAGCTCCCGCGCCAGCTCGAGCGTGAGAAGCAGCCGCCGGTCGGCGAGGTCGAGGTTGAGCTCCTCGGCGTGGCGCTCGTCGACGCCTTCGCGGCTCCAGCCCCAGACCTGGCTGGCGAGGCCCGCGGTCACATCCTCGCTGAGCCCTAGCGCCTTGCCGACCTCGCGCACGGCGCCGCGGGCGCGGAAGCGCGAGACGACCGCGGTCAGCGCGGCGCGCTCGCGGCCGTAGGTCTCGTAGATCCACTGGATCACTTCCTCGCGCCGCTCGTGCTCGAAATCGACGTCGATGTCGGGCGGCTCGCGCCGCTCGGCCGAGACGAAGCGCTCGAACAGCAACTCGGATTTGACCGGATCGATCGAGGTGATGCCGAGCACGAAGCACACCGCCGAGTTCGCCGCGCTGCCGCGTCCCTGGCACAGGATATTCTTCGAGCGCGCGAACGCGACGATCGAATGGACCGTCAGGAAATACGGCGCATAACGCAGCTCGGCGATCAGCTTGAGCTCGTGCTCCAGCTGCACGCGCACCTTGTCGGGCAGCCCGCCGGGATAGCGCTCCGGCGCCTTCTCCCAGGTGAGCCGCTCGAGTTCCTGCTGTGGCGAGAGCCCGCTCGGACCCACCTCGTCGGGATAGGTGTATTCGAGGTCCGAGAGCGCGAATTGGCACGCCCGCGCGATTTCGACGCTGCGGTGCACCGGGCCGACATCCCCGAGGTAGCGCTGGAACAGCCGCTCCATCTCCTCGGCGCTTTTGAGGTGGCGGTCGGCAAAGCGCTCGCGCCGCCGGCCGAGCCGGTCGATCGTGCACTTCTCGCGGATGCACGAGACAACGTCCTGCAGCATCCGCCGCTCGGGCGAATGATAGAGCACGTCGCCGGTGGCGACGGTAGGCACCCCGGCGGCCGCAGCCTGCGCAGCCAGATCGCGCAGCCGGATCGCGTCGCGCGGACGCCGGCGCAGCGTCAGCGCCATGTACGCCCGTGGGCCGAACAGCCGCCGGCAGCGCGCGAGCGCCGCCTCGTTTGCGGCGTCCGCCCGGTCGGGAACGAGGATCGCGATCAGCCCCGCCGCCCAGTCCTCGAGATCCGACCAGTCGAGGTCGCAGCCGCCCTTGCCGGCGCGGCCCTTGCCGATCGAAAGCAGCCGGCAGAGACGCCCGTAGGCGTCGCGGTCGGTGGGATAGACGAGCAACGATGTGCGGCAGACGAGATCGAGCCGGCAGCCGACGATCGAGCGCACGCCGGTCACCTTCTCCGCGTCCATCGCCCGGACGAGGCCGCCGAGCGTATTGCGGTCGGTGACCCCGAGCGCCGGCAGGCCGAGCAGCGCCGCCGCGGCGAACAGCTCTTCCGGGCTCGACGCACCGCGCAGGAAGCTGAAGTGCGTGGTCACCTGCAGCTCGACGTAGCGCGCACCGTCCAGGGGATAGGCGCCGGTCACGCGAAGGCTCCGTGGATGAACCACTGCATCGGCCCGGTCGCGGGCTTCTCGCCGTCGCCGAGCCGGAACAGCCAGTAGCGCCCGCCGCCGAGCGTCTCGACCTGGAAATAGTCGCGCACGGTATAGGGCGCATCGGCCTCGGTGCCGCCGTCACGCCACCACTCGCCGTGAAGCCGCTCGGGCCCGTCGGCACGCACCACCCGGTGGCGCTGGCCGCGCCACACGAACAGCCGCGGCGGATGATCGGGGAGCAGCGCCATCACCTCGATCCGCTCGGGCGGATCGAGCATCCGCGCCGGCCGCGGCAGGTCGTCCTCCCACACAATTGCTGCGGGCGGGGCCAATGCCGGGATCGAGCCCACCTCGCGCTCGGGCATGGCGCTTTCGCGCGGCGACGCGCGATATAGGCAGCGCTGCCCGAAACGGTTCGCCAGCGCGTCGACCAGCGCGGGCAGGTCCGGGCCACGAGCGGCGGGGCTGGCAAGCCCGTCGGCCTGCGACGGCGCGAGCGGCTCGGAAAGCGGACAGGCGAGCGTCATCCCCTCGATGCCGAGGCCCGGCTCGATCTCGCCGATGCGCGCGTTCAGCAGCCGCGCAAGGTGCGGCTCGTCGCGCGTCGGCTTGGCGGTCCCGACGCGAACCGCCTGGACATGGCCGTCGACGCGCTCGAACAGCAGGTCGAGCCGGCGTGCCCCGCGGCCTTTGGCAAGGAGCTGGGCCGCCAGATCGCGCACCAGGTCGCCGATGACCTGCTCGAACGCCTCCGGAGTGCCGATCGGCTCAAGCAGGCGCCGCGTTGCGCGCGGCACTTCGCTGGCGACCACCGGATCGATCGGCTCGGGCAGAGTGCCGAGCGCCTGGTCAAGACGGCGGTAGAGCTGGCGCCCGAAGCGCTTGGCGAGCGGCGCGCGCGGCGCGGCGATCAACTGCTCGATGCGCGAGAACCCCATGCGCACGAGCTCGGCCGCGACGCCGGGCTCGATCCGCAGCGCCGAGACCGGCAGCAGCTCGAGCGCCTGCACAGTCCGGCGAGGTTCGACATGGATGGGCCGCGCGCCGCGGACATGGCGCGCCACCGCATGAGCGCAGCCGGCGGTATCCGCGACTGCGACATGCGCAGCCAGACCCGACGCGCGAACCCGGCGGTAGAGGTCCTTGGCCAGCGCGATCTCGCCGCCGAACAGGTCCTCGCAGCCCGTGATGTCGATCCACAGGCCGTCGGGCGGATCGGACGCGACCTGCGGCGAGTAGCGCCGGGCTGCCCACAGCGCGAGCCGCCGCAGCGCCTCGCGGTCGCCATCGAGGTCCGCATCCACCACTTCCAGATCCGGCACCAGCGCGCGAGCCTTGGTCACCGTCATGCCCGGAACGACGCCCATCGCCAGCGCGCGCGCATCGGGGCTGGCGATCACGCGCCGGCCATGATCCGGCAGGACGGTGGCGAGCGACGGGGCGTCGTCAGGCGGCGGCTTGCCGCGCTTCCGCCGCAGCCGGTCCGTCGGCCAGTGCGGCAGGAAGAGCGATACGACCCTGCGCATCGCAGGCCTCCACTATCCAGTGTTTGGGGTCGCCGCCGCGCGCCCGCTCGAGTGTCACCGACCAGCGCGGGCGACCGAGGCTCGGCAGGCCCATTTCCTCACTGGGGGCTGCACGCACCCGCCAGCGGGTCGTCGCCGCGCTCCCCGCTGCCGCCGCGTCAACCGGCGAGGCCCGGCGGAAGACGAAAGCCGGCACGCCGGATTGCTCTGCCGCGAGCTGGAGCCGGCGCGAGGCCGTGAGCGAAAGCTTTCCTGTCTCTCCGACGACGCCGCCGAGGCCCGCATGGCGCAGGCATTCCTCCATCGCGATGAGCACGTTGGTGTCGCTGCCCGCCTCGACAAAGATCACTCGGTCGGGGTGGAGCCCGGCAAGATGAAGCGCGGGCGCAAACAGATCCCGCCAGCGCAGGCACCATAGCACCGTGCCTTCGGTGCGGGCGAGGATGCCGGCGAGGAACACGGTCGCCGCAGCATCGTCGCTGAGGTCCGGGCTGCCGGCGATTTCGTGGAGCGCGCCCGTGGCAATCCCGCCGGCGGGCAGATGGCCGTCGATGGCTTCAACGCCGAACGGCAGCGTCTCGTGGCGGGCGCCCACGCTCTCGATCCGCGCGATCTTGGCGCGCAACTCGGCAATGGCGGCAGGCTGACGCATGGCATCCCAAAGACTCGGAAGGGCGGAAGAATGTTCCCTCTATGTTCTCACCCTCGCCCGCGTGGAGTCAAGCGCGAAGAAACCGTCGCAGCGACCGCCGGGGAATGGACGGAAGTGGGCGCAGCTAGCCCGCGGCAGGGGGAGCGAGGGTGCCGAGCCAAGCCACGACCGCGAGGATCGCGGCGCCAGCCGCCGCCTCCAGGCCCACGCTGAGCCGCAGGGCGCTGACTGCACCCGCGGTGTCGCCGCCCGCAAGTACCTGCCCGAGCCGCGGCGTATGCCGGAACCGGTTGGCGGCGGCGAGGCAGACCATCAGCAGGAACAGGGCGATCTTGAGCAGCAGGAGCCGGCCATAGCCAGTCGCGGCAATTTCGCCGATACGGTCGGGCCCGACCAGGACCCAGCTGTTGACCACCCCGGTCAGCAGCAGGACGGCCACCAGCACCGACCCCAATCCGGAGAAACCGTGCAGTCCCTGGTGAAGCGCTTCGCGCTTCCCAGGCGCATCCGGCGAGCCGAGCAGCAGCGCGAACCCGGCGAGCGCCCCCAGCCAGACGGTTGCGGCGAGGAGGTGGAGAATGTCGCTGCCGAGGTGGATCCACCCGAGCGTGCCTTCGGTCGCGGCGGCGTGGCCCAGCCACGCGAGCGAAGCTGTCGCGCCAGCGCCGAGCGTGGCCGTCGCGACCAATGTTGCCCGGCCCGGGCTCGCCAGGATCACCACCAGCAGCGCGGCAAGCGCCAGCACGGCTCTGGCGGCAGCCGCCTGGCCGAGGGTCATGTAGGAGACGACCGAGCCCATCGCCTCGAGAGTGAACCCGCGAGCGAGCGAGCCTTCGAACAGGCTTGCCTGCGCCGCGATCGCCAGCGCGGAGCCAAGGGCCAGCAGCGCGGCTCCGCCCGCCAGGACCCACCGGGCCCAGGGCAGGCGTGACGGCGACGCGGCCCCCGCGGCGGGCAGCGCGTGGAGCAGGAACAGCGACAGGCCGAACAGGATCGTCGCTCCTGCATATTGGACGAACCTGAGCCCGATGATCGCAGGCTCGATCACCCGTCAGGCGACCTTGAAGTTGAACGTGCCGCTCATCTTGTGCCCGTCGGCCGAGGCCGCGGTCCACACGACCCGGTAGTCGCCGCGCCGCAGCGCCGATGCTGGGGTAGCGACGACACGCTTGCCGTCGCGCAGGACCGTCACGTTCACCGGCACCTTCATGCGGTGCGCCGGCATCGTCAGCTCGACCTTCGAGAAACGCGGCACCAGCCGCTCGTTGAAGGTGAGAGTGATCGTGCGCGGCGCCGTGCGCAGCGTGGCGTTTGCCGCCGGACTGGACGATTCAAGCTTCGCGTGAGCGAAGGCGGGAGCGGCGCTGAGCAGCGCCGCCGCTCCGCCGAGCGCAATCAATGAAGAGACTATTCGAGACATTACAAACCTCCTTGCCACGTTCCCCGGATCAGCATTCGAAAATCATAGGTATTCCGCCGTGCATTCATGCTGAACCTGCCCGACACCCGTCATTCAGCCCGGCCTTTCGTCCGTGCGCGACCATCGCGGGCGGGCGCGGCCGAACGGCTCGAACCTATTGCCCGCTCATATTATGCCCGGCGTGCTCGTCAGCCTCGGCGGCAGGCTCGGGCGTTGGCGTCTGCGTCGGACGCGGCGAAGGCGTAGAGCGGGCCGGAGCCGGACGGCTGGTGGCCGCCGGCGCGGGCGCGGCAGGCTCGGCCGCCGAGGCCCGGGTCGCGGAAGGCATCGGCTCGCCGTTCAGCATCGCCCGGGTCATCTCCGCCTCCTCCTGCTGGCCCGCGCGGGTCTTGTTGACCTGTTCGCGCACCGCACCGGTGACCCCGCTTGCGAGAGCCGCGTCGGACAAGGCCACGCCGCCCTCGTGATGGGCGAGCATCTTGCGCATGAACGTCTCGCTGATGTCGGCGCCCGAGGCCGCCATCATTTCCTGCTGCATCTCGGTCATCGCGGCCTGGAAGGGCTGGCCGCTCTGCGGGTTGGGCGAGCCCTCCTTGACCAGCTTGCGCAGCGCCTCGACCTCTGCTGACTGCTTGTCGATCGATTCCTGCGCCATCTGGCGGACGTGGTCGCTCACATTCTGCTGCAGCCCGATCCGCGACATGTCGATCGCCCCCTGGTGATGCACGATCATCTTGCGCACCCAGCTGTCGCCGACGTCGGTGCCGACGGCCGCCATCATCTCGTCGTTCATCTTCTTTTCGGCCGCGGCGAACGGATTGTTCGGATCGACCGCGTCAGCGGCATCCGCCTGTTCGGTAGCGGCGGGTTCGGCTTCCTCGTTCGATCCGCACCCGGCGGCGGCAAGCGCCGCGCCGAGGGCAATCGCCGTTGAGATTATCCTGCGCATTGATCTGACTCCTTTCCCGAAGTGGCTTCCTGGCTGATGGAGGGGCGCTTGTTCACCGCTCAGCCGACGGCGAGAAATCCAGCTTGCCTGTCCTCTTGGTCATGCAACGAACGGCTCGGCCCGGCCGGCACCGTCGAAGGCCATGACCTCGAACGGGTCCTTCGAACCGTCCGCCATTTCCATTCCGGGCGAGCCGCGCGGCATGCCTGGGACCGCGAGGCCCTTGATGCCCGCGGGCCTGGTCTCGAGCAGCTTGGCAACGTGCTCGAACGGCACGTGACCCTCGATCGCATAGCCGCCCACGATGGCGGTGTGGCAGGAGCGCAGACTGTCGGGGACGCCATACTGCTTCTTGACCGCGTCGATGTCGGGGCGGTCGACGACGCTCACCTGGTAACCGGCCTTGGACGCGAGATCGGCCCAGGCCTTGCAGCAGCCGCAGCTCGGGTCGCGGTAGACGGTCATCTCGGAGGCGAGGGTCCGGCCGGGCGAGGGAGCTCCCTCCGGCGCCTCGGCGGCCTGCTGCGATGCCGGCGTTCCGGCGCGGCCGCACCCGGTGGCAACGAGGCCCGCGAGCGCGGCCATGCCGCCGACGATGCTTCGGCGCGTAACTCTCAGGGTGGTTCGATCCATCATAAGTCTCCAGCTGGCAGCGTCACAGACGCTCCATGATCCGCTTCATCTGCGCGATTTCCCGTTCCTGCGAGGCGATGATCCCGTCGGGGCCGAAGCAGAGCTCGCGGATTTCGGGATCGTCGAGCGGCGCCTCGCGGCACATCAGGATCGCCCCCGAATGGTGCGGGATCATCGAGCGGACGAACTGCTTGTCGCCGACGAGCGCCTGGGTGCGGACCGCCGCGAACGCGAGCACGAACACAGCGGCCAGCGCGGCGTAAAGGACCAGGTTGAGCCGCTTGTTGGGATACATCGAGCCCATCATCAGCAGCATCAGCACGCCCATCGGCATCGCCATGGTGAGCGCCATGTAGAAGAAGTTGATGTTGTGGATGAACTCGCCCCAGCCGCGGATCATCTCGAACATCACCAGGTACATGATGAGGGTGCTGATCGCGAGATTGAGCGCGAGCATCCAGTAGGGCTTCGCGCCGGATGAAGCGCCATGCTGCGGATGGTGGTCGATCATCACAAATCCTTTCGCTCAGGTCATTCGTTCCTCGTCAGAACCACCAGCGGATGCCGGTCAGCACGCTCCAGCCGCCGGTTTCCTCGCCGGCGGCGCGCAGCAGCCGCCGGGTCTCGCCGAACGCGCGCCGGTACTGCACCCCGATGTAGGGCGCGAACTCGCGGCGGATGTCGTAGCGCAGCCGCAGCGCGGCCTCAGCTTCCGTCAGCCCCGAACCGAGCCTCAGCTCGGGCACGTCCTGGGCCGAGAAGTTGACCTCGACGTTGGGCTGCAGGATCAGCCGCTGGGTGATGCGCTGGTCGTAGCTGGCCTCGGCCCGGGCCATCAGCTCGCCCTTGTGCGAGAGGAACAGCGTTCCCTCGACATCGAAGAAGCTCGGCGCCAGCCCCTCGACGCCGACGACCGCGTAGGCGCGCGACGGGTCGGGCTCGAAGTCGTAGCGCAGACCCCCCTGCACGTTGAAGTAGGGGCCGATCGCGCGGCTGTAGAGTGCCTGGACCTCGGCCTGCTCGACGGGCCCGCCGAACGTGCCGTCGCCCTCGCTTTTCAGCCACAGGCGGTTGATGTCGCCACCGTACCAGGCCGCCGCATCCCATTCGAACCCGTCGCGCCCCTCGCGCAACTGGGCCTCGGCGACATTGAACATCACCTGGTAGAGCTTCTGCCCGCCGTGGAATTGCTGCAGGTGATGCCGCCCCATCTCCATCGCGTCGCTGCCGTAGACGGCGTCGGCGGCTGCTTCCGCCGGGATCGGCGGCGGAGGCGCGCTGCCCGCCGGGAGGTCGGTGCCGGACGGCCCGGCAGCGGCGCCGGGGGATTCAGCAGCCGGCATCGCATGGCCGGCGTGCGGATCGGCCGGCTGCGCGGGCTCTTCCGCGGGAGCGGCCATCGAGTGCCCCGCGTGCGGGTCCGCGGCCTGCGGCTCGGGACTTGCCGCCGGGGCCGGCATCGCGTGTCCGGCGTGAGGATCGTCTTCTTCCGCTGTCTCAGCGGGCGGTGCGCCCGTGTCATGGTCGGCGTGAGGATCGTCCGCGGACGGCGGTGGTTCCTGCTGCGGTGCCGGGGTCATGACATGGCCGGCGTGGGGATCCGGCTCCGCCGGGGCGCCGGGCTGGGCCGGCGCGGCGTGGCCTTCGTGCTGGGCGAGCGCGGGCGTGCCGAGGGCGAGGCCGGAGGCCCCGAGGGCCAGAGCGAAGAGACGCCGGCTCATTGACCGCCTCCCTGTCCGGCTTCGTCGCGGACCGTGACGACCCGCATCATGCCTGCGGTCATGTGATAGAGATTGTGGCAGTGAAACGCCCAGTCGCCGACCGCATCGGCGGTCAGGTCGAAAGTCATCTTGCCGCCCGGCGGCACGTTGACTGTATGCTTGCGCGGACCATGGGCGCCGTGGCCGGTCACGAGCTCGAAGAAATGCCCGTGGAGGTGGATCGGGTGCGGCATCATCGTGTCGTTGACCAGCGTCACCCGGACCCGCTCGTTGAGCCGGAACGGGATCGGCTCGGCCGGCTCGCTCAGCTTCACGCCGTCGAACGACCACATGTAGCGCTCCATGTTGCCGGTGAGGTGGATCTCGAGCGCGCGCGACGGCGCGCGCACGTCCGGGTTCCGCTCGAGCGCCATCAGGTCGCGGTAGACCAGCACCCGGTGGCCAAGGCCTTCGAGACCGACAGGCGGCTCGCCGGTGCGGTCGCTCGGCATGGGCGAGATCGACTGCACGCCCGGTCCGGGCTTGACCTGCGGGGCGTTGCTGAAGTCCCGCATATTGTGCTCCATCCCGGCAGCGCCGTGGTCCATCCCCCCCATCTCTCCAGTGGTCGCGGCAGGAGCCGCTTCGTGCCCCATCGCCGCGTGATCGGCCGCCGTCGGCGCAGCGGCTCCCTGACCCATCGCCGCGTGGTCGGCGGCTGCGGGCGGCGAGGCGCCATGCCCCATAGCGGCATGATCGACGCCGGTCGCCGGCGTGCCGGGGTGCCTCATGGCGGCATGATCGGCGGCAACCGCTGCGGCTACCGCGGTGGTCGCCGGGCCGGTCCAGCCGGTGGCCTTCCAGAGGTCGCGCGAGGCATTTTGCTCGGCCGACGGATCGACGCCGCGCACGGCGGCCGGATTGGGCGCGTTCGGCGCGTCCATGCCCGTGTGATCCATGCCGCCCATCCCCATGTCCTTCATGGTGAGCAGCGGGCGCGGCCGCAGCGGCGGGACGGGCGCCGACATGCCCTCGCGCGGGGCGAGCGTGGCGCGGCCGAGCCCGGAGCGGTCGATCGCTTCGCTGACGAACGTATAGGCGCGGTCGGCGGGCGTGACGATCACATCGTAGGTCTCGGCGACCCCAATCTGGAACTCGTCGACCACCACCGGGCGCACGTTCTGGCCGTCGGCTTGGACAACCGTCATCGCCAGGTCCGGGATGCGGACGTTGAAATTGGTCTGTGCCGAAGCGTTGATGATCCGCAGGCGCACCCGCTCGCCGGGCGTGAAGAGCGCGGTCCAGTTGTCGAACGGGCCGTAGCCGTTGACCGTGAAGTGGTAGGTCGAGCCGGTCACGTCCGAGATGTCGGTCGGGTCCATGCGCATCGCCGCCCATTGCTGGCGTTCCGACGCCGGCAGGTCGCGCCCGGCGAGAAGGCCCGACAGGGTCAGCCGCTGCGTGTTGAAGTAGCCGCCGCCCATCTGCTTGAGCTTGCGGTAGATCATGGCGCCGGTCATCCGGCTGTGGTCGGCGAGCACGATCACGTGCTCGCGGTCGTAGGCGACCGGATCTGCGCCGGCCGGGTCGATCACGATCGGCCCGTAGACGCCGTCCTGCTCCTGGTAGCCCGAGTGGCTGTGGTACCAGTAGGTGCCATTCTGGCGGACCGGGAACTCGTAGACGAAGGTTTCGCCCGGATTGATGCCGGGGAAGCTGACGCCCGGAACCCCGTCCATCGCGAACGGCACGAGGAGGCCGTGCCAGTGGATCGACGTCTGGGCGTCGTCGTGATGGCCGGCGGGAAGGGCGTTGGTCACCCGCAGCCGCACGTTCTGGCCTTCCCTGAGGCGCAGCAGCGGCCCCGGGACGGTCCCGTTGATGGCGACGGCGGGGCTGGTCTTCCCGTCCACCGTCACCGCCGCGTGACCGATCGACAGCGCGATGTCGGTGCCGCTGAGGACGGGAAGCGGGCGGGCGATGCCGGCCGAGACCGGCTGCGCCCATGACGGCATCGCGCTGGCAAGACCGGCGCCGCCACCGCAGATGGCCGCCGCGCGCAGGAACGCGCGCCGCTCGAGGGTGAGCGATTCAAACATGGATGATCCTTCGCTGGCCACGTGAACTGCCGAAGATTGGCGGGGCCGGACGAGCCCCGCCAACCCCGTTACGGCTCAGTGGTGCTGGTGCGCCGGCTGCGGCGTGCCCGCCGACCCGGAATGCTCAGCATGGGCACCGGCCCCGGCGGCTCCGCCGTGAGACGAATGGTCCATCTGCTCCATGTCCTTGCAGCAGCACTCCTTGCCCTCCGACTTCATCTTCTCGCAGCAGGCCATCGGCTTTGCCTCGGGCGCTTCCTGCGCATGAGCAATGGCGGGGAGTGCGATCGCCACGGCGACCGCAGTCAAAAACGTCTTCATTGTCCAACCTTTCGTGAAATCGTTCGAAATGATCGATTTCACGCGAGGCGCGGAGGCGGTGTCGGCAACTCGGCTAGATAGCTCGGCCCGCTCTCGGCGAGCGCTGTTCCGCTCGTGGAGCCGGCGAGCGGCGCCGGCTCGAGCGTGACCACCGGCGGCACCACGATGGCCGTGCACATGGCCGCGCAACAGACCTTGCCGCTCGAATGTCCCTTCTGGTCCGACTGCTTCTCGTGGCAGTCACCGCCCTGCATCATCTGCGCGTGATGGTCGGCCGGCGCTGTCGCCATCGCCGCACCGCCCTGGAGCGCCAAAGGCGCCCACAGCAGGGCTATGGCAGCGAGCGCTGCGAGAAACCGTCCGAGGGTCATGCGGGGCAGATTTGCGCTCATCGGCTGATTGGTTCAACCATACCTTTCGCTGATGGTTCCGGTTTCGTCACCACGTTTGGTCCGGCGGATCGACTCACTATATAGCCCCCGTACCATAGTACGGAGTCAAGCATGACAGCGATGACGATCTCGCAGCTGGCGAGGAGCGGCGACGTCGGCGTCGAGACCGTGCGCTACTATCAGCGCCGCGGCCTGCTCGCGGACCCCCGTCCGCAGAAGAGCCGAACGTCCGGCACCCGCCACTATGGCGAGGACGAGGCGCGCCGTTTGCGGTTCATCCGCTCGGCCCAGAACGCCGGCTTCACGCTCGAGGAGATCCGCGAGCTTCTTGACCTCGACAGCAACGGCGACCGCGCGCGGGCCCGCGAGATGGCGACGGCCCGGATCGAGGCGCTCGATGCGCGAATTGCCGAACTTCAGCGGGCGCGGCAGGCGCTCGCCTCGCTCGCCCGCGACTGCGCCGCCGCGAAGAAGGGGCCGTGCCCGATCATCGCCTCGTTCGAAGCGTGCTGAGCGGCCCCGGCTCGTTGCGGGCCAATGCCCGCGCTCACTCGCAGCTCGTAGACCGCAGGGCGCCATCACAGCCTCACGCTTCTAAGCCGTAGCGAATTGCCGATGACCGCCACCGAGCTGAACGCCATCGCCGCCCCAGCGATGATCGGGCTGAGCAGCAGCCCGAACCACGGATAGAGGATCCCGGCGGCGATCGGCACACCGGCGGCGTTGAACACAAACGAGAAGAAAAGGTTCTCGCGGATGTTGCGCATCACCGCGCGGCTGAGGCGCCGCGCGCGCACGATCCCGCCGAGGTCGCCCTTGACCAGCGTGACCGCAGCGCTCTCCATGGCCACGTCCGTTCCTGTTCCCATGGCGATGCCGACGTCGGCCGCGGCCAGCGCGGGCGCGTCGTTGATGCCGTCGCCCGCCATCGCCACGCGGCGGCCCTCGCCCCGCAGCTTTTCGACCATCGCCTGCTTCTGCTCGGGCAGGACTTCGGCGATGACCTCGTCGATGCCGCCGATCCGGCGGGCCACCGCATCGGCGGTCCGGCGATTGTCGCCGGTCATCATCACCAGGCGCACGCCGTCGCGGCGAAGCGCCTGCACCGCCTCGACCGCGGTGTCCTTGATGGGATCGGCCACGACCAGCAGGCCGGCGAGCTGCCCGTCGACCGCCACGAACATCACGCCTTGCCCCTCTGCCCGGAACTGGTCCGCCGCCTTCTCCAGCGCGCTTCCATCGACACCGACCATCTCGAGCAGCGCCCGGTTGCCGAGCGCGACGTCGCGCCCGCCAATCCTCCCGGTCACGCCCTTGCCGGTGTGGGACTGGAACTCGGCCGCCTGCGCGAGCTCGAGCCCCCGTTCCTCGGCGCCGGAGACGATGGCCGCCGCGAGCGGATGCTCGCTGCCCTTTTCGAGGGCCGCCGCAAGCCCGAGCACTTCTGCCTGGTCGAACCCCGATGCGGTGACGACCTCGACGAGCTTGGGCTTGCCAACCGTTAGCGTGCCGGTCTTGTCGACCACCAGCGTGTCGACCTTCTCCATCAGCTCGAGCGCCTCGGCGTTCTTGACCAGCACACCGGCGCCGGCGCCGCGCCCCGTGCCGACCATGATCGACATCGGCGTGGCGAGGCCGAGCGCGCAGGGACAGGCGATGATCAGCACGGCCACGGCGTTGACCAGCGCGTGGCTCAGCCGCGGCTCGGGGCCGACAAGCGCCCAAACGATGAAGGCGAGCACGGCGACCGCCACGACCGTCGGCACAAACCAGCCGGAAACCTTGTCGGCGAGCGCCTGGATCGGCGCCCGCGAGCGCTGGGCGTCGGCGACCATGCGGACGATCTGCGACAGCATCGTGTCGCGGCCGACGCGTTCCGCGCGCATCATGAGGCCGCCGGTGCCATTGACGGTGGCGCCGGTGATCCTGTCGCCGGGGGTCTTCTCGACCGGGACCGGCTCGCCGCTGATCATCGACTCGTCGACCGAGCTGCGGCCCTCGACGACCACGCCGTCCACGGGAACCTTCTCGCCCGGGCGTATGCGCAGCAGGTCGCCGACTTGGACGTGCTCGAGCGGGACGTCCTCCTCCGCGCCGTCCGCCGCGACCCGGCGCGCTGTCTTGGGCGCCAGCCCGAGCAGCGCGTGGATGGCCCTGCCAGTCGCGCTGCGCGCCCGCAGTTCGAGCACCTGGCCGAGCAGGACCAGCGTCACGATGACCGCTGCGGCTTCGAAGTAGACCGGCACCAGCCCGCCCATCGTCCGCAGCGATTCCGGGAAGATCCCGGGAGCGACCGCGGCGACGACGCTGTAGAGGTAGGCGACCCCGACGCCGAGGCCGATCAGGGTGAACATGTTCAGGTGGCGCGACCTGAGCGAGGCCCAGAAGCGCTCAAAGAACGGCCAGCCGCCCCACAGCACGACCGGCGTCGCGAGCGCGAGCTGGACCCACATCGACGCGCGCATCGGCAGGAGCTCGAGGCCGAACACCTCCGCGCCGAGCGTCAGCACGACCAGCGGAACCGCGAGCAGCGCGCTGCCCCAGAACCGCCGGGTCATGTCGATCAGCTCCGGATTGGGGCCTTCGTCGGCGACCGGCTCGAGCGGCTCGAGCGCCATCCCGCAGATCGGGCACGTGCCCGGCCCGTCCCGGCGAACCTCGGGATGCATCGGGCAGGTCCAGATGCTCCCCTCGGCAGCCTGCGGCAGCGCGCCGAGGGCCGGGTTTGTCACCGCCGGGTCATGCTCCGGCGGGTTCAGATAGCGGTCCGGGTCGGCTTTGAACTTGTCGAGGCAGCGTTCGCTGCAGAAGAAGTAGGCGTGGCCGCCGAGCTCATGCCGGTAGCGCGCGGTCTGCGGATCGACCGTCATGCCGCAGACGGGGTCGCGCATGACCGCGGCAGCCTCTCCGGCATGATGGTGGGAATGGTCGTGGTGCTGGCACTGGGACATCGGAAGGCTCCTGTCGCTTGGCGTGCATTTCGCAGTACGACTCGAGTTTACCTTCCGTACCGTGGTACGGAGTCAAGCGACGAGTGACACGCCCACGTCCCCGCGGGCGGCGATGCTGTCGAAGGCACAACGCCCGAGCGCCGGCCTGCTCCCGCTCGGAGCCGTATCCGATCCGCATCTATGGTTCGCCGATTCAATTCGATAAATATGGAAATGACTTGACGTGAGGGACCGGAACCCTCATTTCGATGAAAGTCAAAGAGAGGATCCCGCCAATGGACGCACAGTCGGCAGTCTCGGCCTTGGGAGCGCTGGCGCAGGAACACCGCCTCGCGTTGTTCCGGTTGCTGGTACAAGCGGGTGAGGACGGGATGCCGGCCGGCGCCATCGCCGACGCTTTGGGCGTCCCGAACTCGTCGCTGTCCTTCCATCTCGCGCAGCTGAGCAAGGCCGGGCTGGTGCTTCAGGAAAGGCGGCACCGCTCGATCATCTACCGGGCCGACTATGGCGCGATGAACGATCTTCTCGACTACCTGATGGAGAATTGCTGCGCTGGTGCCGATTGCGGCTCCACTCCCGGCTGCGCCGTCGAAAATCCTGAAACCCAACCCGAAAGGAAATCCGCATGAAGCGTCTGCACGTGCATGTCGGCGTCGAGGACCTCGACCGCTCGATGTCGTTCTATTCGACCCTGTTCGGCGCGCAGCCCGTCGTGGTGAAGAGCGACTACGCGAAGTGGATGCTCGACGATCCCCGCGTGAACTTTGCCATCTCGTCGGGCCAGCATGCCGCCAAGGGTATCGAGCATCTCGGCATCCAGGTCGAGAGCAACGACGAGCTCGCGGAGGTCTATGGCCGTCTCAAGGCGGCTGATGGTCCGGTGCTGGAGGAAGGCGCCACCACCTGCTGTTATGCGAAAGCCGAGAAGAGCTGGATCGCCGATCCCGACGGCGTGGTCTGGGAAGCCTTTTTCACCAACGGCGAGGCGACTGTATACGGTGACAGCCCTGCGCTCGGCGCGCTTTCCGGCAACGCCGCCGAGAACGCCTGCTGCGCCCCCGCCATGCCGGCACCGCAGGTGGCCTGCTGCAAATGAGCGTGACCATCTATCACAATCCCGCCTGCGGAACCTCGCGCAACACGCTGGCGCTGATCCGCGCCACCGGCTCCCAGCCGGAGGTGGTGCATTATCTCGAAACGCCGCCGAGCCGCGAGGAACTGGTCTCCTTGATAGAGGGCATGGGCATCGGCCCGCGCGACTTGCTGCGGCAGAAGGGCACCCCCTATGCCGAGCTGGGCCTCGACGATCCGGCGCTCACCGACGACCAGCTCGTTGACGCGATGATCGCGCACCCGGTCCTGATCAACCGACCGATCGTCGTGGGACCGAAGGGCGTCAAGCTCTGCCGCCCGTCCGAGGAGGTGCTGTCGATCCTCGACCGGCCGCTCGAGGCGGATTTCGTCAAGGAAGACGGCGAGGTGGTGCCCGCCAATGGCTGACCAGACTGTCGCGGCGAAGCCCGCCCTGACGACCTTCGAGCGCTATCTATCGCTTTGGGTGGCACTCTGCATCGGGGTCGGCGTCGCGCTCGGTGCGGCGCTCCCCGGCGTGTTCGCAGCCGTTGCCGCGGCGGAGGTGGCGCGGGTCAACCTCCCCGTCGCGCTGCTCGTCTGGTTGATGATCATTCCCATGCTGCTGAAGATCGACTTCGGTGCCCTCGGGAAGGTGCGCCAGCACATGCGCGGGGTGGGCGTCACCCTTTTCATCAACTGGGCGGTGAAGCCCTTCTCGATGGCGCTGCTCGGCACGGTGTTCATCGGCTGGCTCTTCCGGCCGCTGCTTCCGGCCGACCAGATCGAAAGCTACATCGCCGGCCTGATCCTGCTCGCCGCGGCGCCGTGCACGGCGATGGTGTTCGTCTGGTCCAATCTCTGCGAAGGCGAGCCCCATTACACGCTGAGCCAAGTCGCGCTGAACGACCTGATAATGGTCTTCGCCTTCGCGCCGCTGGTCGGCCTGCTCCTCGGCGTCGCCTCGATCACCGTGCCGTGGGACACGCTGCTGCTGTCGGTGCTGCTCTACATTGTCGTGCCGGTCATCGTTGCCCAGCTCATCCGCCGCCAACGCCTCCGCGCCGGCGGTCAGGCGGCGCTCGACGCCTTGCTCCGCCGCCTTAGTCCGGTGTCGCTTGTGGCGCTTCTCACCACGCTGGTGCTGCTGTTCGGCTTCCAGGGCGAGGCAATCCTCGCGCAGCCGCTCGTCATCGCGCTGCTGGCCGTGCCGATCTTGATCCAGGTCTATTTCAACGCCGGGCTCGCCTATTGGCTGAGCCGCCGGTTCGGAGTCGCCTGGTGTGTGGCCGCCCCCGCCGCGCTGATCGGCGCAAGCAACTTCTTTGAGTTGGCGGTCGCGGCCGCAATCAGCCTGTTCGGTCTGCAGTCGGGCGCAGCCCTCGCGACGGTTGTCGGCGTGCTGGTCGAGGTACCCGTGATGCTGTCAGTCGTCGCGGTCGTGAAGCGAACGCGCGGCTGGTACGAGCGCCAAGCCCCCATTTCTTCAACATGATCGCACTCCTCTGCGCGATCCAATGCGATCGAGCAGAGGAAAGGGGGCTGTTGCCCTGATCGAGCCCGCTGGGCGTCGATAGTCGACGCGCCGGACTCGCAATCAGGAGACAGTCCATGATCAAGACGATTCCGCTGAACAAGCTCGTTCGGTCGCCCCGCAATGTCCGCCGGCACGCCGACTGTGCCGCCGATGCCGAGCTGAAGGCCAGCATTGCCGCCCACGGCCTTCTGCAGAACCTCGTCGTGCGTCCTGCCGCGAAGGGCAGGTTCGAGGTCGAGGCCGGCGAGCGCCGTCGCCGTGCGATGCTTGCCCTTGTCGACGACAAGGTCCTGCCGCGCGGGCATGAGGTCACCTGCCTAGTCCTTGAGAACGACGACAGTGCCGTCGAAACCAGCCTCGCCGAGAATTTCCACCGCCTGGCGATGAACCCCGCCGACGAGGCGCAGGCTTTCGCGTCGCTCGTGGAAAGCGGGGTTTCCGTCGAGGATGTCGCTCGCCGCTTCGGGCTCACCGTACGCTTCGTCGAGGGTCGCTTGCGCCTCGCCCAGCTTGCGCCTGTCGTCTTCGAAGCGCTCGCGGCCGGCGAGATTACGCTCGACATCGCCAAGGCCTTCGGCGCCACGTCCGATCAGGAGATTCAGGCCCGCGTCTTCGAGCAGGCCTCCTCGGGCTACTACGCGCCGAGCCCGGACAGCATTCGCCGAATGGTGCTCTCGGGCACCGTGCGGGGCAGCGATCCGCGCGCCCGGCTCGTCGGCCGCGATGCCTATGTTGCGGCCGGCGGCCGCATCGAACGCGAGCTGTTTGACGATGACGACAGCGAGTCCTGGGTCGATGTGGCGCTTCTGGAAAGTCTCGCCCAGGCACAGATGGAAGAGCAGGCCAAGGCGATTGCCGCGGAGCAGGGCCTCGCCTGGGTCCGGCCAACGCTCGACTCCTACGCGAGCCACGACCTCGTCGACGAACTGGTCCGGCTTCCCGCCGAACCGGCGCCGCTGACCGAGGCCGAACTTGCCCGCCTCGAGGCGCTTGACGCCTCCTGGGACGAGCACGCGGCGATCGTCGAGGACGAGGACAGCGCCGAGGAAGCCGTGGCCGCGGCCGAAGCGGCGATCGAAGCGATCGAGCGCGAGTGCCAGGAGCTTCGCAACCGCCCCCCTGTGCTCGCGCCCGAGCTCAAGGCCGAGGCAGGCATGATCCTCACGCTCTCGCGCGACGGCACGCCGGTGCTCCAGCCGGTCTTCTATGGCGAACGCCACGTCGCCGCCGGAGCCGAGGATGAGGGCATCGAGATCGTTCCGGCTGACGCAGGCGAAGGCAAGCGCCGTTCGGCTCTGTCAAAGCGGCTCGTCGACGAGCTCGCCATGCAGCGCCGCGACATCCTGTCGCTGCACATCGCCTCCGACCCTGGCCTGGCGCTCGACCTGCTGGTCTTCTCGCTCGCCGATGCCGACACGCACGACTGGCGTTCGCGCGCGTCGACCACCTTGCGCGGCGGGGTGCCGGCCGGGCCCATCGTCGGCTTCGAGGCCAAGGATGCCCCGGCGAGCGCGGCTCTGGCCGAGCTCAAGGCCGGGCTCGACGAGAGCTGGCGGGCCGGCAAGGACGTGTCGGCGCGGTTCGACCATTTCCGGGCGCTGTCCGATGAATCCCGCGCCGCCTGGCTTGGCTTCGTCGTCGCGCGCACGCTCGAGGCCAGCCTCAACATGGCTGGCGAGAGGCGGATCGCATTCCAGGACCATCTCGGCTGTCTCATCGGCATCGACACGGCCCAATGGTGGCGTCCGACCGCGGCCAATTACTTCGATCGGGTGTCGAAGCAGGTGATCCTCGACGCGCTCGCCGACGTCGGCGGCACGGAGCTTTCCTCCCGCTTCGCATCGGTCAAGAAAGCCGACCTCGCGATGAGCGCCGAGCGCGTTTTCGCCGGAACCTACATCACCGAGGTCGAAGTCCGCGAGCGCGCGCTCGCCTGGGTGCCCGAGGTCATGCGCTTCGCGAGCCCTCTGCCGGACGAGCCCGAACAGGCATCCTCGCCCGGGAACGAGGCCGATGGCGCGCCCGCCGACGGCTTGAGCGACGGGCAGGAGGACCCGCGCGAGCTGGCTGCCTGACCTCCTCCTGACAGGCACGCCGCTCGTTCCGGTGAAGCGGTCCCCTGCCTCCGGGCAGGGGGCCGCTTCCTTCCTGGGCAGAGAGCGGAGCGGGCTTTGCGGCTTGCGGCCTCGCGTGGCGATCACGCCCGCGGGCCGGATCAGGAGACGCATCATGGCCTTTCGCAAGACCGACCGCGCCGGGCAGTCGCCCGCTGCGCGCATCACTCAGGAAATCGTTGCGCGGCTCGAAGCCGGCACCAGACCGTGGATCAAGCCCTGGCGCGGCGTAGCCGTTTCGCGTCCATTGCGCGCCTGCGGCACGCCTTATCGCGGCATGAACGTGTTCTGGCTGTGGATGGTCGCCGACATGTGCGGCTACACCTCGCCGTTCTGGATGACCTACAACCAGGCGCAGAAGCTCGGGGGCCAGGTGCGCAAGGGCGCGAAGTCCACCATCGCAATCTTCTACAAGAGCTACACCAAGGAAGTTGAAGCGGGCGAGCCCGGGGAAACCGCCGAAGAATCCCGGCGCGTCCTCAAGGCCTATCCCGTGTTCAACGCCGACCAGGTCGACGGCCTTCCCGAACGATTCCATCCCGCCGCCGCGCTGGAGCTGGTCGAGCCTGCCGGGAGGGAGCAGGAGCTTGACGCTTTCTTTGCCCGCATTCCGGCCGTGCTTCGCCACCAGGGCGACGAGGCCTATTACGAGCCGATCGCGGACCGCATCACCATGCCGCCTGCGCAGCTCTTTGGCGGCTTCGACCACTATTATGCGACGCTCGCGCACGAGCTGTCGCACTGGACCGGCCATGCCAGCCGCCTCGACCGCGATCTCAAGAACCGGTTCGGCACCGCGGCCTACGCCGCCGAAGAGCTGGTCGCCTTATCTGGACAGTCTGCACCGCATGGCACTTTGCAGTAGGATGTGGTGATGGACGCCGACGGTAGGCAGTTATCAGGCTGGCCGTAGCGCTGACCCAAGAGCTGGCGGAGGCCAGCCTGATAACTGACGGGCCATAGGGCGCAACGGTGCGAGGTCAGCTTCCGGGTGGGGGGTCCGGGGGGAGGGTTTCGCGCTGATGTCGATCATCTCTGTCTGCGAGCGTCGCGAGGCCAAGGGTCTCGATGCGCATGTGCCGCTGATCTTGCGCGACGACGCGCTTTACGATCCCGATCTGGATCGCTTCTTTCTCGACCTGCCGCTGTCGGGCGTCCGCTCGCGGCACTCGCTTCGCGCCTGTGCCTACGATGTCGCGGTCTGGCTTCGCTTTCTCGATGCCTTTGGCAAGACCGTGTGGACTGCGACCCGCGACGATGTGGTCGCCTATCATCGTGAGCGACGCCGCGACGAGGCTGATAACCGGATCACGGCGGCAAGCTGGAACCGGGCCGTCGCCAGTCTCGATCGCCTCTACCGCTGGGGCGAGCAGCAAGGGCTGATCGCCGAGGCGCCGTTCAGCCGCCGTGCCGTGTGGCGACCGGCGCAGGGCGGCCGTCGCGGCATGATCGCGGCGCGCAACGACGCCTATGAACGTGTCGCGAGGCGGTCGGATGTTCGGTTCGTCCCGATGGACGACTACCGCATTTTCCGTGAGGTCGGCCTGCGCGGTCTCGCCCCTGACGGCACCGAGCGCCCTGGCGCGCGCGATCGGAACGGGCTGCGCAACGCGCTGTTCGCCGATCTTCTCGTCACCACCGGCCTGCGCCTCGAAGAGGCGTCGTGCCTGCTCGCTGCCGAGCTCACGGCGATCGACCGCGAAGACGGTGATGGACAACAACTTTGGCTGCCTCTCCCGCCGCCGCTCACCAAGGGCGACCGAGGACGCAGCGTTCTGGTCCCGCGCCGGCTGCTTCGTCAGATCGCCGCCTATGTCGCCGTCGAACGGGCCGCAGGCGTTGCCAAGTTTGCCGCGCGCGATGGCGCGGCCAGTTTTGAGCGACCGATCCCTGTCACTCGCGCCGGTCTCGACCGCATGCGGGATATCTGCACCCCGGAGGAACGATGCCGCCTGATCCTGTGTGATGAGGATGGAACGCTCCGGGAGCCGGCGGCGCTATGGCTGACCGAGGTCGGGCAACCTGTCCGTCCCAACTCGTGGGAGGTGATCTTCACTCGGGCCTGCAAGCGGTGCGCGGAGAACGGTTTTCCGCTGTCGATCAGCCCGCACCAGCTTCGCCACACCTTCGCAGTCCATATGCTCGCGTTGCTGATCCAGCAGCGACTACGCGAAGCCGCATTGCCGGCGGGGGCGGTAGAGAGCTACCGGCTGATCTTGGGCGACCCGCTGCAACAGGTGCAACGCCTGCTCGGCCACGCGAGCCTTACGACCACCTATATCTACCTCGACCATATCGCGACCCGCGCCGATACGGTCGACGCAGCCGTAGAGGAGCTACTCGCGCTGCTGCCGGGACCGCAGGGCGCATGAGCGGGCGTCCCCGCAGGGGCCGGCCTGTCGCTTTCGCTCCGATCACCCCGGAGGCTGTGCAGCCCGATCCGGTGCTCGGCCTCAAGTTCACCATCGAGGCGCGGCATGGCGGGACCGTCCTGGTCGATATGACCGGGCTCGAGCCTCGTCCGCTCGCCATTGCCTTTGCCGGCGCGCTCCATCGGTCAGCGGCACTCGGCGGATCAATCGGTGCGGCCAGCGTCATCAAGCAGTATCTCCAGGTCTATCGTCACTTCTTCGCCTGGCTTGCCGACGAAGCGCCGAAGGTGGCCGGCATCAGTGATCTCCACGCGGCCCATATCGATGGTTTTTCCTCCACGCTCGAGCGACTCGGGAAGAGCGCGATCCACCGGCACATAATCGTGGGCAAGGTGATCAACACATTGCGCATGATGGAGGCAGATCGGCCCGACCGGATCGCGCCCGACCTGCATGAGCGGCTTCGTTACACGTTGGCCACTTCGGCAGGTCGCTCGACCCCGCGCGATGCCTACAGTCCGTTCGTCGCTCGCGCACTGCGCGACGCCGCAAGGGCCGATATCGAAGCGATGTTCCGCCGTTTCGGCGCCGATGACGACCACACCGATGAGGGCGATCCGATCATCGCCAGGGCGCGCGCTGAAGTCGAAGCGGTCATCGCGCGGCAGGGCTTTATCGTCGCAGACCATCCCGCGCTGAAGAGCCTCTATTTCATGCGCGCACGGCGCGGCCTGCCGATCAGCACGCTGGTCGATGACCTGCATCGCCGCCATCATCTTCACGTCACCGATATCCCCCCATTGTTCGTGCTACTTTCGCTCGATACTGGCCTGGAGCCCGAGTGCTTGAAGGCGTTGACGGTTGATTGCCTGGCCAACCCATCCGCTGGCACTGTCGAGTTGCGTTACCTGAAGCGACGCGCCCGCGCTGCCGAGCACAAGAGTATGCGCGTGCGCGACGGCGGCATCGGCACGCCAGGTGGCCTTATCCGCCGGCTGATCGAGGTCACGGCAGTTGCGCGCCAACATCTGTCCGCTGATTGTCTCTGGGTCTACCACAACGCTGGCGGGCTGCACGCCGGTATTCGTCATCCGCGCGAACGTCTCGATGCGTGGGTGGCTCGGCACCGGATTGTCGATGACGACGGCAAGCCGCTCTACCTTCTGCTTTCCCGCCTTCGCAAGACCCACAAGGCGCTGTGGTACACCAAGACCGAAGGGCATATGGCTCGGTTCGCGGTCGGGCATAGCCGTGAGGTCGCCGCGCGCCATTACGCGGATCTACCCTCGCTCCGGCCCCTGCACGAGGCCGCGGTCGCCGATGCCTTCCGCGAGGCGGTCGCCGCTGCGATGCCGACCATACTCCCGCCCACTGCGGAGCAGGCACTGCGCGAAGGGCCCGAACAGGCCGCGCCGCTGATGCCGCCAGATACGGTCGATCCGCTGCTCGACGGCGAACAGGATGTCTGGCTCGCCGCTTGTGCGGGCTTCCATCGCAGTCCCTTCGCCGAGGTTGGTTCGCCCTGTTCTCAGCCTTTCTGGGGGTGTCTCCATTGCCCCAACGCTGTCATCACCGCGCGCAAGCTGCCCGCGATCCTGGCCTTCCTCGCATTCGTCGAAGATCAACGGCAGGGCCTGCCTGGTTCTGACTGGATGGCCAAGTTCGGGCAGGTTCATGCCCGGATAGTCAACCAGATTCTGCCAGCATTTTCCGATGCGGTCGTCGCCGAAGCGCGGCTGCGCGCAACCGACGAGCATCTCTATCTCCCGCCCGAGACGCGCGCATGACGGCCCCCGCCTATGTCCCGGCGCTCGCGTTCGATGATCGGCCCGTGTTGACGATTGTGCCGCTCAAGCCGGGCCATTCCCGTGAGGCACTGTCGCGTGTCGGTGACCCGAGCTGGGATCTTGGACCCGCTGTTTTCCGCGAGAACGCCCGGCGCTGCCATGTCACCGTGCATTTCGACGTGCTTGAACATGCCGATGTGCAGGCGACGATGCGCGCCTATCTCTATGCCCGTCTCAACGTCGATCTCCCCGGCTACCGTCCGAAGCTATTGCCCGCCTGTATCCGGCAAGCGTTCAATCGGGCGCGGCGCTTCTTCGGTTTTGCGCGCGGACGACTGGGTGTGCTCGACCTTAGCCGCATCGATCCCGCGCTGATCGATGCTTATGCCCGCCATCTGCGTGATGATCCCGCAAGGAGGCCCGTCATCGTTGGCCACCTTCTCGAGGTGATCATCGATCTATACCATTACCGCGACCATCTTCCCGGTGGCGGCCTGTCGTTTGAGCCTTGGGAAGGACGGGCTCCTGCGCGCGTCGCTGGCTATCGGCATGTCCGGGAGAACCGCACACCGCGTATGCCCGAGGGGATCATCACCCCGCTGCTGGCCTGGTCGCTGCGCTATGTCACCGTCTTCGCGCAGGATATTTTCGCGGCCCGGCGTGAGCTTGATCGCCTCGAAGCCCAACGTGACCGCTGGTTTGCTGCCGAACGGGGACTGGCACACGCCGAGCGTCGTCAACGGCAGCGGATGCGATTGCAGCGCTATTTCCGGTCGCGCGCGCGGCAAGGCCGAGGCGTACCGGTCTGGGTAAGCCCACCCAACGGATCTGCCTCCATCGATCCGGCCAGCGGGACGACCATGCCAGTCGTCAACGTGCAGCTTCTCCATCTCCACGTCGGGGTCGACGCCGCCGCAGAACCGGCGATGCACCTTACTCTCGCTGGCGGTGCTTCCGATCTCATTGAAGCTGCCATCGCGGATATGGGCGTCGAACCTGGCGGTATGGATACGCCGATCTCGATTGATCCCGATCGTGGTTTACCATGGCGGCCCCGTTTCGATGCCAAGTCCCTTATCCTCGAAGAGCGCATGCTGCAGGCGGCAGCCTATGTGGTCTGCGCTTATCTGACCGGGATGCGCGATTGCGAGGTGCAGGCGATGCGTCGGGGATGCCTGACCCTCAAGCGGAGTGAGGATGGCGTCGTGTCACGGCACCGCGTGCGTTCCACCGCCTACAAGGGCAAGGGTAGCGGGGGAGCGCCGGCCGAGTGGGTGACCATTGCTCCTGTCGCTGATGCGATCAGTGTGCTCGAGCTATTGTGCAAGCGTGCGGCCGACGCTCGCGGCCTTGATACACTCTGGCCCGTGCTGTCACTGACGCGGAGCCGCAAGTCGCACGTTTCCGCGGAGATCGTCCGTCAGATCAACGCCTACCGTGACCATCTCAACACGCTGTTCGGTTCGGACGAGGCGCCCGCTATTCCGCCAGGACCGGACGGCAAGCCCTGGCGGATTACGACCCGCCAGTTTCGCCGCACCATTGCATGGCATATCGCCAACCGCCCCTTCGGAACGGTGGCTGGCATGATCCAGTACAAGCATGCCTCCGTTGCCGCCTTCGAGGGCTACGCAGGAAGCAGCGCCTCCGGGTTCCGCGCCGAGGTGGAGACGCAGCGCGGACTCGGCCAGCTCGACGACTTGCTCGATTATTTCGACAGGCGGCGAGGTGGGGCCTCCCTTTCCGGACCAGCGGGCCGGCGCATCGAGCGCGTCCTCGATGAGACGGCCACGCAACTCAGCCCATTGCCAGCGATGATCGCCGACCGGTCCCGTCTGCGCACCATGCTCGCCAGCCTTGCCCGCACCCTGCATGTCGGCACGCTGGCCGATTGCTTCTTCGATCCGGCAACAGCCTTGTGCCTCAAGCGTGCGACAGATTCCTCGGCGAACCGGCCGTTGACAGCACTTTGCGAACCGACCCGCTGTCCGAACGCCTGCATCGCCGAACGTCACCGGCCCATCTGGGAGCGGAGCGCCACCGAGGCGCGGATGTTGCTGCGCGAGAAGCGCCTGCCTGGATTGCAACGAGCAGCGCTCCAGCACGAGGTTGACCGGATCGAGGGCGTGCTGGCCCAGATCGCGCCTGAAGGCGCGACGCCGCCCTTTCGGGCGGCGGTAGAGGGAGAGGAGGGCCTGGAGATGGGGTGA
Protein sequences of DBSCAN-SWA_5 >NC_020561|3093961:3134667|3100212_3101568_-|WP_009823940.1|integrase|DBSCAN-SWA MSAWHDPDRTVVDAFLVKSQFRPGSVPTYRWFLCTFEDVARRHPAVDRQMLDAWLKEMQKRWRLSTLLNQVCIVDRFLDHLVEIGLIADNPVAALRRRYNVKQSKPIWRALASPNPDESLAALRRPAPFGSVLGDFMQDHVMLMRSRGYQYEAQAHWLLRFDRFLQARPDLAEQPLEAMIASWAAAKPTRNHAAECQKLARILTKARFRLDPTIPPKRFNPRPEREVAREHRQPHIFSPADVRRMLDTARTYPSPDAPLRPLTLYTMIMLAYCAGLRRSELAWLDLGDVDLQSSTITIRETKFYKTRILPLSDSVAVELRAYIDARRRAGGPQNPKSGLFWHAHLNDRYRPEAVTTMITNVMRRAGLKPASGRTGPRVHDLRHSMVVNRILQWYRSGINPQEKLHFLSTYMGHRDLHSTLVYITVTQDLLQEASERFRALGAPCLVTEARP >NC_020561|3093961:3134667|3125068_3126142_+|WP_015459658.1|DBSCAN-SWA MADQTVAAKPALTTFERYLSLWVALCIGVGVALGAALPGVFAAVAAAEVARVNLPVALLVWLMIIPMLLKIDFGALGKVRQHMRGVGVTLFINWAVKPFSMALLGTVFIGWLFRPLLPADQIESYIAGLILLAAAPCTAMVFVWSNLCEGEPHYTLSQVALNDLIMVFAFAPLVGLLLGVASITVPWDTLLLSVLLYIVVPVIVAQLIRRQRLRAGGQAALDALLRRLSPVSLVALLTTLVLLFGFQGEAILAQPLVIALLAVPILIQVYFNAGLAYWLSRRFGVAWCVAAPAALIGASNFFELAVAAAISLFGLQSGAALATVVGVLVEVPVMLSVVAVVKRTRGWYERQAPISST >NC_020561|3093961:3134667|3119987_3120275_-|WP_144062063.1|DBSCAN-SWA MKSIISNDFTKGWTMKTFLTAVAVAIALPAIAHAQEAPEAKPMACCEKMKSEGKECCCKDMEQMDHSSHGGAAGAGAHAEHSGSAGTPQPAHQHH >NC_020561|3093961:3134667|3116135_3116624_-|WP_015459648.1|DBSCAN-SWA MIDHHPQHGASSGAKPYWMLALNLAISTLIMYLVMFEMIRGWGEFIHNINFFYMALTMAMPMGVLMLLMMGSMYPNKRLNLVLYAALAAVFVLAFAAVRTQALVGDKQFVRSMIPHHSGAILMCREAPLDDPEIRELCFGPDGIIASQEREIAQMKRIMERL >NC_020561|3093961:3134667|3128413_3129163_+|WP_015459660.1|DBSCAN-SWA MAFRKTDRAGQSPAARITQEIVARLEAGTRPWIKPWRGVAVSRPLRACGTPYRGMNVFWLWMVADMCGYTSPFWMTYNQAQKLGGQVRKGAKSTIAIFYKSYTKEVEAGEPGETAEESRRVLKAYPVFNADQVDGLPERFHPAAALELVEPAGREQELDAFFARIPAVLRHQGDEAYYEPIADRITMPPAQLFGGFDHYYATLAHELSHWTGHASRLDRDLKNRFGTAAYAAEELVALSGQSAPHGTLQ >NC_020561|3093961:3134667|3105588_3106287_-|WP_144062062.1|DBSCAN-SWA MNGKELIDQVRRSIAVAGSPAPTDRAVAEYLGISITGLANWRSREMVTTRQMVGLLTAVVKASKRRTESEAIKPIVEFFQIQRTTVGNGSNCRIFDPGKGENEHPYLTGLKRELERHRGVYIFYDSRGRALYTGKTKAQSLWKEINLAYNRDRDPNLQRILRVQHPERRQDFRTSDEMRRQIRPINVRLHELAEYVSAYQVSEGMIGDIESLLIRAFPNDLLNKKIENLTWG >NC_020561|3093961:3134667|3113235_3114159_-|WP_015459644.1|DBSCAN-SWA MIEPAIIGLRFVQYAGATILFGLSLFLLHALPAAGAASPSRLPWARWVLAGGAALLALGSALAIAAQASLFEGSLARGFTLEAMGSVVSYMTLGQAAAARAVLALAALLVVILASPGRATLVATATLGAGATASLAWLGHAAATEGTLGWIHLGSDILHLLAATVWLGALAGFALLLGSPDAPGKREALHQGLHGFSGLGSVLVAVLLLTGVVNSWVLVGPDRIGEIAATGYGRLLLLKIALFLLMVCLAAANRFRHTPRLGQVLAGGDTAGAVSALRLSVGLEAAAGAAILAVVAWLGTLAPPAAG >NC_020561|3093961:3134667|3132453_3134667_+|WP_015449375.1|integrase|DBSCAN-SWA MTAPAYVPALAFDDRPVLTIVPLKPGHSREALSRVGDPSWDLGPAVFRENARRCHVTVHFDVLEHADVQATMRAYLYARLNVDLPGYRPKLLPACIRQAFNRARRFFGFARGRLGVLDLSRIDPALIDAYARHLRDDPARRPVIVGHLLEVIIDLYHYRDHLPGGGLSFEPWEGRAPARVAGYRHVRENRTPRMPEGIITPLLAWSLRYVTVFAQDIFAARRELDRLEAQRDRWFAAERGLAHAERRQRQRMRLQRYFRSRARQGRGVPVWVSPPNGSASIDPASGTTMPVVNVQLLHLHVGVDAAAEPAMHLTLAGGASDLIEAAIADMGVEPGGMDTPISIDPDRGLPWRPRFDAKSLILEERMLQAAAYVVCAYLTGMRDCEVQAMRRGCLTLKRSEDGVVSRHRVRSTAYKGKGSGGAPAEWVTIAPVADAISVLELLCKRAADARGLDTLWPVLSLTRSRKSHVSAEIVRQINAYRDHLNTLFGSDEAPAIPPGPDGKPWRITTRQFRRTIAWHIANRPFGTVAGMIQYKHASVAAFEGYAGSSASGFRAEVETQRGLGQLDDLLDYFDRRRGGASLSGPAGRRIERVLDETATQLSPLPAMIADRSRLRTMLASLARTLHVGTLADCFFDPATALCLKRATDSSANRPLTALCEPTRCPNACIAERHRPIWERSATEARMLLREKRLPGLQRAALQHEVDRIEGVLAQIAPEGATPPFRAAVEGEEGLEMG >NC_020561|3093961:3134667|3121203_3123576_-|WP_015459654.1|DBSCAN-SWA MSQCQHHDHSHHHAGEAAAVMRDPVCGMTVDPQTARYRHELGGHAYFFCSERCLDKFKADPDRYLNPPEHDPAVTNPALGALPQAAEGSIWTCPMHPEVRRDGPGTCPICGMALEPLEPVADEGPNPELIDMTRRFWGSALLAVPLVVLTLGAEVFGLELLPMRASMWVQLALATPVVLWGGWPFFERFWASLRSRHLNMFTLIGLGVGVAYLYSVVAAVAPGIFPESLRTMGGLVPVYFEAAAVIVTLVLLGQVLELRARSATGRAIHALLGLAPKTARRVAADGAEEDVPLEHVQVGDLLRIRPGEKVPVDGVVVEGRSSVDESMISGEPVPVEKTPGDRITGATVNGTGGLMMRAERVGRDTMLSQIVRMVADAQRSRAPIQALADKVSGWFVPTVVAVAVLAFIVWALVGPEPRLSHALVNAVAVLIIACPCALGLATPMSIMVGTGRGAGAGVLVKNAEALELMEKVDTLVVDKTGTLTVGKPKLVEVVTASGFDQAEVLGLAAALEKGSEHPLAAAIVSGAEERGLELAQAAEFQSHTGKGVTGRIGGRDVALGNRALLEMVGVDGSALEKAADQFRAEGQGVMFVAVDGQLAGLLVVADPIKDTAVEAVQALRRDGVRLVMMTGDNRRTADAVARRIGGIDEVIAEVLPEQKQAMVEKLRGEGRRVAMAGDGINDAPALAAADVGIAMGTGTDVAMESAAVTLVKGDLGGIVRARRLSRAVMRNIRENLFFSFVFNAAGVPIAAGILYPWFGLLLSPIIAGAAMAFSSVAVIGNSLRLRSVRL >NC_020561|3093961:3134667|3101564_3102797_-|WP_015449381.1|integrase|DBSCAN-SWA MLKLHDELITELSNSLTTQNYNPVVVANHRLYARAFLDYLAECDIQVETVTPQQVDQYFGYAVQDFEIQYGRPPSARWHMLPRTAIAKLLRLAQGNWPPDAEMIGPDDEHRHEICREYEAWLREERGLASASIAALMWEARNFLRWQFDRAGAASLETLSIVDIDLYMDMRAPGLRRKSLADVAERLRSVVRHLHRTGCIPTDLTPHIIGPMLYAYEDVPSTLERSQIAAVLATTQEDRSPRGLRDYAILQLLATYGLREGEICRLRLDDVDWRAESLRICHTKTNAYSYMPLMVTVGEALLDYLRLGRPQVEVREIFVRSCAPYIAMTNLYGMIRGRLAAAGVVPAGKRGPHVFRHARAVEMLRASVPQKIIGDVLGHRSTESTNTYLKLATDDLRAVALEVPGMEVLS >NC_020561|3093961:3134667|3099211_3100216_-|WP_009823939.1|integrase|DBSCAN-SWA MRKGNPLPALLRAFFQEWLAEQRSASIHTIRSYRDTWRLLLRFVAERKGCGVARLTLTDVSAGEVRAFLHHTEHGRKTTIGTRNCRLAAIRSFFSFVADKNPEYIAQCSEVLAVPLKREPTSAPCYLEPEEVEAILAQPNRSTLEGLRDHVLLSFLYNSGARIQEALDLCPEAIRFDAPNFVRLYGKGRKERICPLWPETVALLRKLLERQPRAPDERIFVNRYGEPLGASGVRFKLNAYVEQAAKSTLTLQSKHVTPHSFRHATAVHLVAAGVDITVIRSWLGHVSLDTTNHYAQANLETKRKALEQVGAPAASNVPPSWKRDANLMGWLDTL >NC_020561|3093961:3134667|3105090_3105573_-|WP_144062160.1|DBSCAN-SWA MEFTRPALEAGGFAGWVPFGEVRASACPSSGGVYVITYGGSKPLTFAEQSCGGWFKGKDPTVSHEALAANWVDDAEVVYIGKADQLNRRLTQFADFGAGKPVGHWGGRLIWQLPRVDQLLVAWKQTPGQVPVEVEAELIASFRQAYGKPPFANDPHRLGT >NC_020561|3093961:3134667|3123851_3124205_+|WP_015459655.1|DBSCAN-SWA MDAQSAVSALGALAQEHRLALFRLLVQAGEDGMPAGAIADALGVPNSSLSFHLAQLSKAGLVLQERRHRSIIYRADYGAMNDLLDYLMENCCAGADCGSTPGCAVENPETQPERKSA >NC_020561|3093961:3134667|3096953_3099167_+|WP_015459635.1|DBSCAN-SWA MSLASPAFAFPCLSEPLVLTAARDIAELIEAGQALSRAGLNAILNRLFGGSDAEGRWSVRDAHAAIELAQVLWLRAHAGLTPASPADSAHEAFTLIETLLPSQTVRSEEQIELQQFSTPPRIAWLLARACALRAGEAVLEPSAGTGMLATWAAKANARLILNEISPLRRDCLACLFPQATVSGHDGELIDELLDPRQVPTAILVNPPFSHGIERGHDGHTGSRHLRSAWKRLAPGGRLAAVMPEWFDVGGFLCAVREPVTVQLNVAVERGFSRNGTSITTRLLVLDKIEAGADPAVARTNDFAALCELIDAIPARPFATAPQSSASLVLAPFRLASLAARPRPVPPRHAAPPAPSEPCAYEALEAPAPIADQVGHYLPYRPSRIAIAGATEHPTPLVESVAMGSITAPVPTEVPLLPAGVIARGLLSAAQAETLIYAVSAHGRDLPGRFVPQDKGCSLATSAEGAVYRMGYFLGDGTGAGKGRQVASVILDRWLRGERRHIWISKNEALLEDARRDWTALGGLPIDIQPLRQWKLGLPVTMGDGILFVTYPTLRSGRSDATRLDQLLEWAGDDFEGVIVFDEAHAMANAAGGEGSRGKVKGSEQGVAGVRLQNLLPRARVLYASATGASDVNNLAYATRLGLWGPETAFANREAFVSDIRDGGIAAMELVARDLKSLGLYTARALSFAGVEYEILEHQLTPPQIAVYDAYAEAWAILCAARHKIAYREEAVMRRNAA >NC_020561|3093961:3134667|3093961_3095158_+|WP_015459632.1|integrase|DBSCAN-SWA MGKLTANEVKAALGKPGSYQDGDGLFLKVDQRGGAAWRLRIQHEGKRRDIGLGSAKLVTLAAARAKAAEARKAIREDKRDLIAEKREAKAAAVTFREAALALHEAHKHQWGNAKHGEQWLATLESYAFPSLGRKQVGSVAAGDIIAAIAGVWTSKPETGRRVRQRICAVLDYAHARGWRASEAPVRALMAGKGLPKQAGGKHHPAMPYQDVPAFLTRLRATGGVWGRLALEFVIFTAARSQEVRLATWDEFDIQNGLWTVPAEHMKMRREHIVPLSAQALAVLRSAQAVRVSGTTLVFPGANGTTMSDMTLLAVLRRMKEPTTVHGFRSSFRTWVAEKTNFPGEVAEAALAHQNRNEVERAYQRGALLEKRRKLMEAWGAFCEAGSEKVLPFRQAAQA >NC_020561|3093961:3134667|3096074_3096881_+|WP_144062158.1|DBSCAN-SWA MCRCPAHDDRTPSLSVSLGRSAILFHCFAGCSNGEVIAALGLRGVRARELFDGSGEPLSEAPRKETADRNACRLWRGGDVLRGSPAEVYLLKRRLTQFSSDLRFHSRTPLGPRGSVRFLPAMLAAVRTDLGVIAVHRTFLDPLTGRLAGFERPKRALGSLQNGAVRLAAPRRGRLGLAEGIETALSAMQLFGVPCWATLGNERFGLVTIPESVRELHLFVDNDPGGDLAEERAREAYACDGRRIVATRPTNLNEDWNDVLMRMALAAS >NC_020561|3093961:3134667|3095186_3095861_-|WP_015459633.1|DBSCAN-SWA MSLRVVSIALGLVAAAPVPAHGQASAPAVRVDAAIDERNQGYRLAQTADGFQLVEHGVWEEPSAVAVQESGQTPVTVGSKGTDRLASGKTTGLSGPNGFRRAAYLPHVYAAEAKYGLPAGLLDALIWTESRYNPFAISRVGAAGLGQLMPGTAKDLGVSNRFDPLANLAGAARYLRQMLDRFGTVHMALAAYNAGPGAVERAGGIPLNGETPAYVRDVLRRWRS >NC_020561|3093961:3134667|3110917_3112459_-|WP_015459642.1|DBSCAN-SWA MRRVVSLFLPHWPTDRLRRKRGKPPPDDAPSLATVLPDHGRRVIASPDARALAMGVVPGMTVTKARALVPDLEVVDADLDGDREALRRLALWAARRYSPQVASDPPDGLWIDITGCEDLFGGEIALAKDLYRRVRASGLAAHVAVADTAGCAHAVARHVRGARPIHVEPRRTVQALELLPVSALRIEPGVAAELVRMGFSRIEQLIAAPRAPLAKRFGRQLYRRLDQALGTLPEPIDPVVASEVPRATRRLLEPIGTPEAFEQVIGDLVRDLAAQLLAKGRGARRLDLLFERVDGHVQAVRVGTAKPTRDEPHLARLLNARIGEIEPGLGIEGMTLACPLSEPLAPSQADGLASPAARGPDLPALVDALANRFGQRCLYRASPRESAMPEREVGSIPALAPPAAIVWEDDLPRPARMLDPPERIEVMALLPDHPPRLFVWRGQRHRVVRADGPERLHGEWWRDGGTEADAPYTVRDYFQVETLGGGRYWLFRLGDGEKPATGPMQWFIHGAFA >NC_020561|3093961:3134667|3106968_3107589_+|WP_015459640.1|DBSCAN-SWA MCNDYKLEVDIASIAEDFDNLSIKIRMPEGAPNVPAREDIKISDTAPIVRGVEGERGVGELVNRRWSWPGPGGKPVYNFRSEGREFTSNRCLILADGFYEFTKPDDPKQKRQNKWLFTLRDHPWFCIAGIWRPHAEVGEVFTMLTTDPGEDVAPYHSRQIIPLPRERWADWLDPSVPAEEVLQVLPKGSLAVRRVYPPEAAQAALL >NC_020561|3093961:3134667|3130666_3132457_+|WP_037465845.1|DBSCAN-SWA MSGRPRRGRPVAFAPITPEAVQPDPVLGLKFTIEARHGGTVLVDMTGLEPRPLAIAFAGALHRSAALGGSIGAASVIKQYLQVYRHFFAWLADEAPKVAGISDLHAAHIDGFSSTLERLGKSAIHRHIIVGKVINTLRMMEADRPDRIAPDLHERLRYTLATSAGRSTPRDAYSPFVARALRDAARADIEAMFRRFGADDDHTDEGDPIIARARAEVEAVIARQGFIVADHPALKSLYFMRARRGLPISTLVDDLHRRHHLHVTDIPPLFVLLSLDTGLEPECLKALTVDCLANPSAGTVELRYLKRRARAAEHKSMRVRDGGIGTPGGLIRRLIEVTAVARQHLSADCLWVYHNAGGLHAGIRHPRERLDAWVARHRIVDDDGKPLYLLLSRLRKTHKALWYTKTEGHMARFAVGHSREVAARHYADLPSLRPLHEAAVADAFREAVAAAMPTILPPTAEQALREGPEQAAPLMPPDTVDPLLDGEQDVWLAACAGFHRSPFAEVGSPCSQPFWGCLHCPNAVITARKLPAILAFLAFVEDQRQGLPGSDWMAKFGQVHARIVNQILPAFSDAVVAEARLRATDEHLYLPPETRA >NC_020561|3093961:3134667|3116658_3117834_-|WP_015459649.1|DBSCAN-SWA MSRRLFALALGASGLALGTPALAQHEGHAAPAQPGAPAEPDPHAGHVMTPAPQQEPPPSADDPHADHDTGAPPAETAEEDDPHAGHAMPAPAASPEPQAADPHAGHSMAAPAEEPAQPADPHAGHAMPAAESPGAAAGPSGTDLPAGSAPPPPIPAEAAADAVYGSDAMEMGRHHLQQFHGGQKLYQVMFNVAEAQLREGRDGFEWDAAAWYGGDINRLWLKSEGDGTFGGPVEQAEVQALYSRAIGPYFNVQGGLRYDFEPDPSRAYAVVGVEGLAPSFFDVEGTLFLSHKGELMARAEASYDQRITQRLILQPNVEVNFSAQDVPELRLGSGLTEAEAALRLRYDIRREFAPYIGVQYRRAFGETRRLLRAAGEETGGWSVLTGIRWWF >NC_020561|3093961:3134667|3126258_3128259_+|WP_015459659.1|DBSCAN-SWA MIKTIPLNKLVRSPRNVRRHADCAADAELKASIAAHGLLQNLVVRPAAKGRFEVEAGERRRRAMLALVDDKVLPRGHEVTCLVLENDDSAVETSLAENFHRLAMNPADEAQAFASLVESGVSVEDVARRFGLTVRFVEGRLRLAQLAPVVFEALAAGEITLDIAKAFGATSDQEIQARVFEQASSGYYAPSPDSIRRMVLSGTVRGSDPRARLVGRDAYVAAGGRIERELFDDDDSESWVDVALLESLAQAQMEEQAKAIAAEQGLAWVRPTLDSYASHDLVDELVRLPAEPAPLTEAELARLEALDASWDEHAAIVEDEDSAEEAVAAAEAAIEAIERECQELRNRPPVLAPELKAEAGMILTLSRDGTPVLQPVFYGERHVAAGAEDEGIEIVPADAGEGKRRSALSKRLVDELAMQRRDILSLHIASDPGLALDLLVFSLADADTHDWRSRASTTLRGGVPAGPIVGFEAKDAPASAALAELKAGLDESWRAGKDVSARFDHFRALSDESRAAWLGFVVARTLEASLNMAGERRIAFQDHLGCLIGIDTAQWWRPTAANYFDRVSKQVILDALADVGGTELSSRFASVKKADLAMSAERVFAGTYITEVEVRERALAWVPEVMRFASPLPDEPEQASSPGNEADGAPADGLSDGQEDPRELAA >NC_020561|3093961:3134667|3117830_3119912_-|WP_015459650.1|DBSCAN-SWA MFESLTLERRAFLRAAAICGGGAGLASAMPSWAQPVSAGIARPLPVLSGTDIALSIGHAAVTVDGKTSPAVAINGTVPGPLLRLREGQNVRLRVTNALPAGHHDDAQTSIHWHGLLVPFAMDGVPGVSFPGINPGETFVYEFPVRQNGTYWYHSHSGYQEQDGVYGPIVIDPAGADPVAYDREHVIVLADHSRMTGAMIYRKLKQMGGGYFNTQRLTLSGLLAGRDLPASERQQWAAMRMDPTDISDVTGSTYHFTVNGYGPFDNWTALFTPGERVRLRIINASAQTNFNVRIPDLAMTVVQADGQNVRPVVVDEFQIGVAETYDVIVTPADRAYTFVSEAIDRSGLGRATLAPREGMSAPVPPLRPRPLLTMKDMGMGGMDHTGMDAPNAPNPAAVRGVDPSAEQNASRDLWKATGWTGPATTAVAAAVAADHAAMRHPGTPATGVDHAAMGHGASPPAAADHAAMGQGAAAPTAADHAAMGHEAAPAATTGEMGGMDHGAAGMEHNMRDFSNAPQVKPGPGVQSISPMPSDRTGEPPVGLEGLGHRVLVYRDLMALERNPDVRAPSRALEIHLTGNMERYMWSFDGVKLSEPAEPIPFRLNERVRVTLVNDTMMPHPIHLHGHFFELVTGHGAHGPRKHTVNVPPGGKMTFDLTADAVGDWAFHCHNLYHMTAGMMRVVTVRDEAGQGGGQ >NC_020561|3093961:3134667|3120271_3120625_-|WP_015459652.1|DBSCAN-SWA MSANLPRMTLGRFLAALAAIALLWAPLALQGGAAMATAPADHHAQMMQGGDCHEKQSDQKGHSSGKVCCAAMCTAIVVPPVVTLEPAPLAGSTSGTALAESGPSYLAELPTPPPRLA >NC_020561|3093961:3134667|3107591_3110921_-|WP_015459641.1|DBSCAN-SWA MTGAYPLDGARYVELQVTTHFSFLRGASSPEELFAAAALLGLPALGVTDRNTLGGLVRAMDAEKVTGVRSIVGCRLDLVCRTSLLVYPTDRDAYGRLCRLLSIGKGRAGKGGCDLDWSDLEDWAAGLIAILVPDRADAANEAALARCRRLFGPRAYMALTLRRRPRDAIRLRDLAAQAAAAGVPTVATGDVLYHSPERRMLQDVVSCIREKCTIDRLGRRRERFADRHLKSAEEMERLFQRYLGDVGPVHRSVEIARACQFALSDLEYTYPDEVGPSGLSPQQELERLTWEKAPERYPGGLPDKVRVQLEHELKLIAELRYAPYFLTVHSIVAFARSKNILCQGRGSAANSAVCFVLGITSIDPVKSELLFERFVSAERREPPDIDVDFEHERREEVIQWIYETYGRERAALTAVVSRFRARGAVREVGKALGLSEDVTAGLASQVWGWSREGVDERHAEELNLDLADRRLLLTLELARELIDTPRHTSQHPGGFVLTLGRLDELVPIEPAAMPDRQIIEWDKDDIDALGFMKVDVLGLGMLGCMRRAFAMLEEDKGLELDLATIPPEDSATYAMIRKADTLGVFQIESRAQMAMLPRIKPRTFYDLVVQVAIVRPGPIQGDMVHPYLRRREGREPVTYPKEELRRVLEKTLGVPLFQEQAMRVAIECAGFTPGEADLLRRAMATFKLTGGISHFRDKLIDGMIARGYDRAFAEQTFKQIEGFGSYGFPESHAASFALIAYASSWMKCHHPDVFCAALLNAQPMGFYAPAQIVRDARAHGVEVRPVDVNASRWDCTLEPSKGRFMAVRLGLSQVRGLRNEDGAAIVLARGDRPYASVEEIQRRAGVGAGALDRIGQANGFGSIAPSRRAGLWEVKGLGNAPLPLFAAADERERRLRPEATEPAVPLAAMTEGQEVVEDYRAIGLSLRAHPLAFLRDELARRAMTPCSALRDMKDGRMVNIAGLVLVRQKPGSAKGVMFITLEDETDVVNLVVWPDLFEKHRRVVLGSSMMGVRGRVQREGDVIHVVAQRLEDMSGLLAGVGEREDVATVYRVSRADVAKHGMGPDPRDPAGRSLGRSPRDIYIPDLRLGSGIVPDQPTEGIKIRSRNFH >NC_020561|3093961:3134667|3114162_3114531_-|WP_041864946.1|DBSCAN-SWA MSRIVSSLIALGGAAALLSAAPAFAHAKLESSSPAANATLRTAPRTITLTFNERLVPRFSKVELTMPAHRMKVPVNVTVLRDGKRVVATPASALRRGDYRVVWTAASADGHKMSGTFNFKVA >NC_020561|3093961:3134667|3112379_3113111_-|WP_015459643.1|DBSCAN-SWA MRQPAAIAELRAKIARIESVGARHETLPFGVEAIDGHLPAGGIATGALHEIAGSPDLSDDAAATVFLAGILARTEGTVLWCLRWRDLFAPALHLAGLHPDRVIFVEAGSDTNVLIAMEECLRHAGLGGVVGETGKLSLTASRRLQLAAEQSGVPAFVFRRASPVDAAAAGSAATTRWRVRAAPSEEMGLPSLGRPRWSVTLERARGGDPKHWIVEACDAQGRIALPAALADGPAAAEARQAAA >NC_020561|3093961:3134667|3106387_3106813_-|WP_051128846.1|DBSCAN-SWA MIEAVWGSNPRFSDGVSFRFVRSEGKTFPSHRCLIPASEFRMSVGDKTYRVTLDGGNFFYLAGIWEPAMGDWPVCYRIVTVAANPEVARYQERHGAIIHRRQVMQWLNHTAAETDLLETPPARTFIVELLAQSSREPLLAL >NC_020561|3093961:3134667|3120736_3121138_+|WP_015459653.1|DBSCAN-SWA MTAMTISQLARSGDVGVETVRYYQRRGLLADPRPQKSRTSGTRHYGEDEARRLRFIRSAQNAGFTLEEIRELLDLDSNGDRARAREMATARIEALDARIAELQRARQALASLARDCAAAKKGPCPIIASFEAC >NC_020561|3093961:3134667|3124201_3124672_+|WP_015459656.1|DBSCAN-SWA MKRLHVHVGVEDLDRSMSFYSTLFGAQPVVVKSDYAKWMLDDPRVNFAISSGQHAAKGIEHLGIQVESNDELAEVYGRLKAADGPVLEEGATTCCYAKAEKSWIADPDGVVWEAFFTNGEATVYGDSPALGALSGNAAENACCAPAMPAPQVACCK >NC_020561|3093961:3134667|3114685_3115480_-|WP_015459646.1|DBSCAN-SWA MRRIISTAIALGAALAAAGCGSNEEAEPAATEQADAADAVDPNNPFAAAEKKMNDEMMAAVGTDVGDSWVRKMIVHHQGAIDMSRIGLQQNVSDHVRQMAQESIDKQSAEVEALRKLVKEGSPNPQSGQPFQAAMTEMQQEMMAASGADISETFMRKMLAHHEGGVALSDAALASGVTGAVREQVNKTRAGQQEEAEMTRAMLNGEPMPSATRASAAEPAAPAPAATSRPAPARSTPSPRPTQTPTPEPAAEADEHAGHNMSGQ >NC_020561|3093961:3134667|3103170_3105021_+|WP_144062159.1|DBSCAN-SWA MLDRRLAELSDEEREALEIDLSPREYVIDYLAKSFPVRLMAVFTDENGTARSEPMVDDEGRPVLCRSALAARDRMIEQLCALPPIATALDAIIERFGVDQVAEVTGRRRRLIVGPDGRQKLQARSPRANVAETQAFMDGVKRILVFSDAGGTGRSYHADLAAKNQARRVHFLLEPGWRADAAIQGLGRTNRTNQASAPLFRPVTTDVRGERRFISTIARRLDSLGALTRGQRQTGGQNLFDPADNLESVYAKEALNRWFGLLFTGKLEAIGLARFQELTGLRIEGPDGGMVDDLPTIQRWLNRILALSIALQNAIFDEFIALVEARVDAARQAGTLDLGVETIAVESFEILSDTLLRTDALSGATTHLLELEIARALKPLTLARLHEQYGIDDPQHRPLRNSRSGRVGLLVPARSMLTDEGARIAQFELHRPLKRGYLTADQLDESSWEPVDQNEFRRLWQAEVDEAASSRKHERLHLATGLLLPVWDKLPTDHVRVSRICAGDGRSLLGREVPINCLSELCRALGMDQGPTISADELVASVLATGRPLEIGGREPLTLKRSLVNGRQRLELTGWSAARLDWYKAQGCFTEIIRYQTRLFVPLGESASIITSLRSA >NC_020561|3093961:3134667|3115579_3116113_-|WP_041865451.1|DBSCAN-SWA MDRTTLRVTRRSIVGGMAALAGLVATGCGRAGTPASQQAAEAPEGAPSPGRTLASEMTVYRDPSCGCCKAWADLASKAGYQVSVVDRPDIDAVKKQYGVPDSLRSCHTAIVGGYAIEGHVPFEHVAKLLETRPAGIKGLAVPGMPRGSPGMEMADGSKDPFEVMAFDGAGRAEPFVA >NC_020561|3093961:3134667|3129320_3130670_+|WP_015449377.1|integrase|DBSCAN-SWA MSIISVCERREAKGLDAHVPLILRDDALYDPDLDRFFLDLPLSGVRSRHSLRACAYDVAVWLRFLDAFGKTVWTATRDDVVAYHRERRRDEADNRITAASWNRAVASLDRLYRWGEQQGLIAEAPFSRRAVWRPAQGGRRGMIAARNDAYERVARRSDVRFVPMDDYRIFREVGLRGLAPDGTERPGARDRNGLRNALFADLLVTTGLRLEEASCLLAAELTAIDREDGDGQQLWLPLPPPLTKGDRGRSVLVPRRLLRQIAAYVAVERAAGVAKFAARDGAASFERPIPVTRAGLDRMRDICTPEERCRLILCDEDGTLREPAALWLTEVGQPVRPNSWEVIFTRACKRCAENGFPLSISPHQLRHTFAVHMLALLIQQRLREAALPAGAVESYRLILGDPLQQVQRLLGHASLTTTYIYLDHIATRADTVDAAVEELLALLPGPQGA >NC_020561|3093961:3134667|3124668_3125076_+|WP_015459657.1|DBSCAN-SWA MSVTIYHNPACGTSRNTLALIRATGSQPEVVHYLETPPSREELVSLIEGMGIGPRDLLRQKGTPYAELGLDDPALTDDQLVDAMIAHPVLINRPIVVGPKGVKLCRPSEEVLSILDRPLEADFVKEDGEVVPANG |
35 | Ralstonia_phage(12.5%) | integrase | attL 3084562:3084577|attR 3140470:3140485 |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_6 |
3621454 : 3628698
Sequences of DBSCAN-SWA_6
Nucleotide sequences of DBSCAN-SWA_6 >NC_020561|3621454:3628698|DBSCAN-SWA CATGCGGCTTTTCGGACGGAAGGGCGGCGGCGGGGCCGTGCGCCCCGCGCTGGCGCGATCGGGCGGCTGGACGATGGCGGCATGGCCGCAGGAAGCCGGCTGGCCGCGCGGATATGACGCGCAGGCGCGCGATGCGGTGATCGCCAACCCGGTGGCGCAGCGGGCGATGCGGCTGGTGGGCGACAGCGTGCGATCGTTGAGCCTGGTCGTGACGGGCGGGCCGGAGAAGGCGGCCGCGCTGGCGGGGGCGGCGATGGAGCCGCTGGCGGTACACCTGCTGCTGCACGGCGACGGATATGCCGGCATCGGCTGCGATGCGGCGGACGAGCCCATGCGCCTGCACGTGCTGCGGCCCGAGCGGATGACGGCGGAAACCGACGCGCGGGGCTGGCCGGTGGCCTATCGCTATCAGGTGGGGCAGGCGGTGGAGCGGCTGCCCGTCACCGATCCGCTGGGCCGGCCGGGCATCATCCACCTGCGCGCCTTCCACCCGCTGGACGACCAATATGGGCTGGGCTGCCTGGAGGCGGCGGCGGGCGCGGTGGCGATCCACAATGCGGCGACGCGGTGGAACAAGGCGCTGCTGGACAATGCCGCCCGCCCATCCGGCGCGCTGGTATACGACCCCGGCGAGAAGGGCGCGGAGCTGAGCCCCGAGCAGTTCGACCGGCTGAAGGCGGAGATGGAGGCGGGCTTTGCCGGCGGCACCAACGCCGGCCGGCCGATGCTGCTGGAAGGGGGATTGCGCTGGCAATCGCTGAGCCTTTCGCCCGCCGACATGGATTTCGTGGCGCTGAAGGCCAATGCCGCGCGCGAGATCGCCCTGGCCTTCGGCGTGCCGCCGATGATGCTGGGCCAGCCGGGCGACAATACCTACGCCAATTATGCCGAGGCCAACCGCGCCTTCTGGCGGCTGACCGTGCTGCCGATGGCGGCGCGCATCCTGGACGGGATCGGCGAGGCGCTGGGCCATTGGTGGCCGGGCATCGCCCTCGAGGTGGACATGGACCGGATCGTCGAGCTGCACGGCGACCGCGAGCCGCTGTGGCGGATGGCGGCGGCGGCCGATTTCCTGACGGCCGAGGAAAAGCGCGAGATGCTGGGATTCGGGAGGACGCCATGATGGGGACCGACATGATCGCCCGGCTGGTGGAGCAGGCCGAGGCGGAGGGCGCGAGCCTGGTGACGCTGCGCGCCCTGGCCGAGGAAGCGAGCGAGATCGGCGCCGAGCGCGCCATGGAGCGGCTGGGGCTGGCCGATGCCGCCGCGCGCGCGGACATCGGCGAGCTGCGCCAGCTTCTGGGCGCGTGGCGCGATGCCAAGCGGACGGCAAGGAACGAGGTGATCGGATGGGCGATACGGATCGCGCTGGCGCTGCTGCTGCTGGGGCTGGCGGTGAAGACCGGGCTGATCGGGCTGGCGCGGGGGTGAGGAGCATGAGCGCGGTTTCCGCCGGGCTTTCCTGTCCCCCGGCCCCTCAAGGCGAAGGGGCGATCAGGTTCGCGGGCTATGCGGCGATATTCGACCGGGTGGACCGGGGCGGGGACGTGGTGCGGGCGGGCGCGTTCCGGCGCGCGGTGGAGGCCGGGCCGAAGGGCGTGCCGCTGCTGTGGCAGCACGACCCCGCCCGGCCGATCGGGCGGATCGACTATCTGGCCGAGGACGGCCACGGCCTGCGGGTGATCGGGCGGCTTTCCGGCCGGTCGCCGGCGGCGCGCGAGGCGGCGGCGCGGATCGGCGACGGCAGCGTGCGGGGCCTGAGCTTCGGCTATCGCGTGCGGGGAAGCGCGCAGAGAACAAACAGGGAACTTTTCGACCTCGACCTGGTGGAAGTTTCGCTGGTGACATTCCCGATGCAGCCGCGCGCGCGGGTGCATGCGGTGGAGCGGATATGACCGAGGAGACCAATATGGACGGCACTTACGACGTGAAGGCGGATGCGCTGGAGGCGAGCTTCGACGCGGCGGCGCGCGCCGACGAGCTGGCCGCGCTGCGCGGCGAACTGGCGGCGGTGAAGGCGCGGATCGACGCCCAGGCCGCCGCCGCGACCCGGCCGGCGCTGGCGGCGACGGGGGTGAAGGGCGATGCCTCGCCCGAGCGGCGCGCCTTCGTCGACGAATATCTGCGCCGGGGGGCGGACAGCGGCGTCAAGAGCCTAGCCGGCACCAGCCCGGCCGAGGGCGGCTATGCCGTGCCGCGCGAGATCGACGAGGTGATCGACAGCACGCTGAAGGCGATATCCCCCATCCGCGCCATCGCCAATGTGGTGAAGGTGGGATCGGCCGGCTATCGCAAGCTGGTGGCGACGGGCGGCTTCGCTTCCGGCTGGGCCGCCGAGGGCGCGGCCCGGCCGGCGACGGCGACCCCCGGCTTCACCGAGATCGTGCCGCCGATGGGCGACCTCTACGCCAATCCCGCCGCCAGCCAGGCGATGCTGGACGATGCCCGGTTCGACGTCGAGGCGTGGCTGGCGGGCGAGATCGCCACCGAATTCGCCCGCGCGGAGGGGGCGGCCTTCGTTTCCGGCAACGGCGCCGACAAGCCCCGCGGCTTCCTGGCCGGCCCGATCACCAACGAGGCGGACGGCGCGCGCGCCTTCGGCACGCTGCAATATGTGCCGAGCGGGGCGGCGGGCGCCTTCGCCGCCAGCAACCCGCAAGACCGGCTGGTCGACCTCATCCAGGCGCTGCGCCCGCCCTATCGCCAGGGCGCGGTGTTCGTGATGAATTCGGCCACGCTTTCGGCGATCCGCAAGTTCAAGACCGCCGACGGCGCCTTCCTGTGGCAGCCGGGCCTGGCCGAAGGGCGGCCGGACACGCTGCTCGGCTATCCGGTGGTCGAGGCCGAGGACATGCCCGACATCGCCGCCAACGCGCACGCCATCGCCTTCGGCAATTTCCGCGCCGGATATCTGGTGACGGAACGCGCCGAGACGCTGATCCTGCGCGATCCCTACACCAACAAGCCCTTCGTCCACTTCTACGTCAGCAAGCGCGTGGGCGGCGCGGTGAGCAATTCGGAAGCGATCAAGCTGATGAAGTTCGCCAGCGCCTGAGGACGAAGACGCGCCGCGCGGGTTTCCCCTTCCCCTGGCCCGCGCGGCGCGGTTGTTTTCCCCCCTGTTCGGAGACAGATATGGCAGACGCCTTCGCCGCCAGCGCGGACGCGGCCTATGCGCCCGCGATGCGCGCGGCCGCCATCGTGCCGCATGACAGCAACCCCCTGAACGACATCACCAAGGCCATCTATGTGGGCGCCGGCGGCGACGTGGCCGTCCGCGCCGCGCGCGACGGCGCGGACCATGTGTGGAAGGCGGTGCCGGCGGGATCGATCCTGCCGATCCGCGCCAGCCATGTGCGGGCCACCGGCACCACCGCCGGCGACCTGCTGGGCCTTTACTGAGCGGACGGCCGGCCATGCGCCTGGGCCTTGGCTTCGCGATCGCGCTGTTCCGGCGGCGCGACGCAACCGGCGGGGGCAGCCAGCCTCCGCTTTCCGTGCTGCTGGCCGAGGACGGGCTTGCCCTGGCCCTGGAAGACGGCGGCGCCTTCATCCTGGAGTGATCCATGCCATCGACGAAGATATCCGCGCTGCCGATCGCAGGGGCGATCGGCGGGGGCGAGCTGCTGCCCGCCGTGCAGGGCGGCGGCAATGTTGCGGTGACGCCGGCCATGCTGCGCCGGCACGCGCTGGCGGGCACGCCGCCGCTGATCGGGAGGCCGTTCGCCAAGCTGATGGTGGACGGCGACAGCAAGGCCGGCGAGGCGCCGGCCGCCGCCCGCTGGGTGGCGGCGCGCACGCCCCTGAACGTGTGGATCATGCCGGTGAGCTTCAGCGTCGGCGGATCGACATCGGGCACGCAGGCCGGCACGGGGCTGACCAACCCCGACCGCATGGCGGCGATGACGGCGGGCGTGGCGGCCGAGACGGCGGCCGACTGGATCGTCGACATGCTGCTGACCATCGGCACCAACGACGTGGTGCTATCGGGCCTTGCGGCGGACACCATCCTGGCCAACGTCCGCAAATATCACGACGCCTTCCGCGCGGCGGGCGGGCGCTTCCTGATCCTGATGGGGGTGGACCCGCGATCGGGGCTTTCCGCGGCGATGGCGCGCCAGATCGTGGCCGTGAACCGCGCCTATGCCGATTATTGCATGACGGTGCCGGACGCGATCTTCTGCGACACCGCGCCCTGGTGGCTGGACCCGGCGCCCACGAACGCCGCCTTTTCGCCGATCGGGGGCAGCACGGGCAACGCCTTCAGCATGGCGGCGGACGGGCTGCACGGCAGCGCTTATGGCAGCTACCGCAAGCAATTCGCCCTGGGGCCGATATTGCGGGCGATCTACCGCCCGCGCGATCCGATACCGTTGAACGCGGGCGACAGCTTCGACGCGACGACGGCGCCGCGCGGCAACATGCTGGGCGCCAATGCCCGCTGCGTGGCACTGGGCGGCACCAGCAGCATCGTCAATTCGGGCACCGGCGCCATCGCGGGGACCCCGCCCCTGGGCTGGACGGCGACGGGCACGCTGACCGGCGACCTGGGCGTGGCCTTTTCGACCGCAACCTGCGCGCCGCTGGAAGCCTATACCGGATCGAGCGGCTGGACGGCGGTGCGCATCGCCTTCACCGGCACGCCGGGCGAGGCAGGGTCGCTGACGCTGAGGCTGACCACCGCTTCCGCCCAGCAGGCGGGGATGCTGCTGGCAGGATCGGCGCTGCTGAGCGGCAACGCGCTGGTCGGCTGCCACGGCATCAGCGTGACCACGGCCAACATCACCCCGACCGTGAACAACATTCTGGGCGCGACCGGCACGCTGGGGGCGCCGGACCAGTTGCCGCAGATCGACGGATTGATCGCGCTGGACCTGATGGCCCAGCCGACCGCCGGCACCAACAGCGGCCTTTCCGTGGCGATCCGCTGGCGCGCGGGCGTGGCGATGAGCGGATCGATCGACCTGATCGGCGCGACATGGCGGCGGATCGACCCGATCCCCGCGGCGGCGGCCTGAGGGGAGGGCGGATGAGCATCTATCTGAAAGACCCCGACGCGACCGTCGATTATCTGATCGACTGGGGGGAGCGGCTGGAGGGCCGCAGCGTGACCGCGAGCGACTGGGCGGTGGTGCCGACCGAGGCGGGCGGGATCATCATCGCCGACGATGCGGTGGACGGCACCCGCACCCGGGCGACGCTGAGCGGCGGCGTGCCGGGCCATGTGTATCGCGCGACCTGCCAGGTCGAGCTTTCCGACGGGCGGATCGACGAGCGTTCGATCACCCTGCGCGTGGAGGAACGCTGATGCCGGCGGAGCAGGCGACGCCGCGCACGGCGGTGAGCGTGGAGGAAGCGCGGGCCTGGCTGCGGATCGACGGCGAGGCGGAGGATGCGGCGCTGGCCAGACTGATCCGGGCGGCGAGCGGCCTGTGCGAGCAGTTCATCGGCCAGGCGCTGCTGACCCGGCTGGTGAGCGAGACGGTGCCGGCCCGCGGCGACTGGCAAAGGCTGGCGCAAAGGCCGGTGCGCAGCATCGCCGAGGTGGCCGGATTGCCGGCGGAAGGGGCGGCCTTCGCCCTGCCGGTCGATGCCTATGCCATCGATATCGACGCGGCCGGCGACGGCTGGGTGCGGATCGACCGGCCCGGCGCGGCCGGCCGCGCGATCATCACCTACGAGGCCGGGATGGCGGCGGACTGGAACGGCGTGCCCGAGCCGATCCGCCAGGGGATATTGCGGCTGGTGGCGCATCTGCACGCCCATCGCGATGCCCCGGACGATGCCGGCCCGCCGGCGGCCGTGGCCGCGCTGTGGCGGCCGTTCCGCAGGATGCGGCTGTGAGCCGGCCGGGCGAGCTGAGCGGGCGGCTGCGCGAGCGGATCGCCATCCTGCGCCGGGACGAGGCGCGCGATGCGCTGGGCGGGGCGGACGGCGGGTGGCACCCGATGGGCGATGCCTGGGCCGCGATCGAGCCGGCGGGCACCGGCGCGCCGGTGGCGGGCGAGGCGATCCGCGCGCGGCCGCGCTGGCGGATGACGGTGCGGGCCGGGGCCGGGCTGATGCCCGGCGACCGCATCCGCTGGCGCGGCGCCGTGCTGCGCGTGCGGCAGGCGACCGCCGATCCGCGCCTGCCCGATCGCATCCTGGCCGAAGCGGAGGAGGAGCCATGAACCGGCGGATCGACGCCGCCGCCCGGCAGCGGGTGGCGATGCTGATCGCGGCGTTGCGCGAAACGGCGGAGGCGGAGCTGCCGGCCGGGCTGGCGGTGGAAGGCCGCGACGACGGCATCGCCATTTCCGGGCCGGGACTGGCGCGGCGGCTGGCCTATGACGGCAGCTTGCGCGGGCTCGCCTTCGCCATGCGGGGCATGAGGAGACGGGGATGAGCGCCATGGCATTGCAGGCGGCGCTGGTGACGGCGCTTTCCGCCCGGCTAGGCACGATGGTGAGCGGCATATTCGACGGGCCGCCGGTGCGGGCGGCCTTTCCCTATGTTTCGATCGGCGGCTGGGCGAGCGGCGACTGGAGCCACAAGACGGGGCGGGGCCGCGAGCACCGGCTGGCGGTGACGATCTGGGACGATGGCGGCCGGCCGGGACGGCTGCACGGGCTGATGGCCGAGGCCGAGGCGGCGATCGAGGGGATCGGGGAGATTGCGGGCGCCGAGCTGGCGAGCATCGCCTTCGTCCGATCCCGCATCATCCGCGACGCCGACGGGCCGTGGGCGGGGATCATCGAATATCGGGTGCGGGTGGCGGGATGACCGCCGCCGGCCGGCGGGACGGACAGGCGGCCCTGGGGTGCGGAGCCCACGGGGCGGAAGAAGGAGACCCATATGGCGGTGGAGAAGGGCAGCGCGTTCCTGCTGAAGATCGGCGATGGCGCGGCGACGCCGGCCTTCGCGACGGTGGCGGGGCTGCGCACGACGCAGATGTCGATCAACGGCGAGGCGGTGGCGATCACCACCAAGGATTCGGGCGGCTGGCGCGAGCTGCTTTCGGGTGCGGGCGTGCGATCGGTTTCGGTGGCGGGCAGCGGCGTCTTTACCGGATCGGCGGCGGAAAGCCGGCTGAAGGCCAATGCGCTGGCCGGGATGATCGACGATTATCGCCTGAGCTTCGAGGGCGGCGAGCAGATGCAGGGGCGCTTCCTGCTGACCCGCCTGGATTATAGCGGCGATTATAACGGCGAGCGCACCTATACGCTGGCCCTGGAAAGTTCCGGCCCGGTGGTGTCGCTGTGA
Protein sequences of DBSCAN-SWA_6 >NC_020561|3621454:3628698|3627622_3627841_+|WP_015460104.1|DBSCAN-SWA MNRRIDAAARQRVAMLIAALRETAEAELPAGLAVEGRDDGIAISGPGLARRLAYDGSLRGLAFAMRGMRRRG >NC_020561|3621454:3628698|3627837_3628218_+|WP_015460105.1|DBSCAN-SWA MSAMALQAALVTALSARLGTMVSGIFDGPPVRAAFPYVSIGGWASGDWSHKTGRGREHRLAVTIWDDGGRPGRLHGLMAEAEAAIEGIGEIAGAELASIAFVRSRIIRDADGPWAGIIEYRVRVAG >NC_020561|3621454:3628698|3626482_3626761_+|WP_041865534.1|DBSCAN-SWA MSIYLKDPDATVDYLIDWGERLEGRSVTASDWAVVPTEAGGIIIADDAVDGTRTRATLSGGVPGHVYRATCQVELSDGRIDERSITLRVEER >NC_020561|3621454:3628698|3626760_3627297_+|WP_015460102.1|head,tail|DBSCAN-SWA MPAEQATPRTAVSVEEARAWLRIDGEAEDAALARLIRAASGLCEQFIGQALLTRLVSETVPARGDWQRLAQRPVRSIAEVAGLPAEGAAFALPVDAYAIDIDAAGDGWVRIDRPGAAGRAIITYEAGMAADWNGVPEPIRQGILRLVAHLHAHRDAPDDAGPPAAVAALWRPFRRMRL >NC_020561|3621454:3628698|3627293_3627626_+|WP_015460103.1|head,tail|DBSCAN-SWA MSRPGELSGRLRERIAILRRDEARDALGGADGGWHPMGDAWAAIEPAGTGAPVAGEAIRARPRWRMTVRAGAGLMPGDRIRWRGAVLRVRQATADPRLPDRILAEAEEEP >NC_020561|3621454:3628698|3624588_3624855_+|WP_015460099.1|DBSCAN-SWA MADAFAASADAAYAPAMRAAAIVPHDSNPLNDITKAIYVGAGGDVAVRAARDGADHVWKAVPAGSILPIRASHVRATGTTAGDLLGLY >NC_020561|3621454:3628698|3622889_3623348_+|WP_015460097.1|head,protease|DBSCAN-SWA MSAVSAGLSCPPAPQGEGAIRFAGYAAIFDRVDRGGDVVRAGAFRRAVEAGPKGVPLLWQHDPARPIGRIDYLAEDGHGLRVIGRLSGRSPAAREAAARIGDGSVRGLSFGYRVRGSAQRTNRELFDLDLVEVSLVTFPMQPRARVHAVERI >NC_020561|3621454:3628698|3625019_3626471_+|WP_015460100.1|DBSCAN-SWA MPSTKISALPIAGAIGGGELLPAVQGGGNVAVTPAMLRRHALAGTPPLIGRPFAKLMVDGDSKAGEAPAAARWVAARTPLNVWIMPVSFSVGGSTSGTQAGTGLTNPDRMAAMTAGVAAETAADWIVDMLLTIGTNDVVLSGLAADTILANVRKYHDAFRAAGGRFLILMGVDPRSGLSAAMARQIVAVNRAYADYCMTVPDAIFCDTAPWWLDPAPTNAAFSPIGGSTGNAFSMAADGLHGSAYGSYRKQFALGPILRAIYRPRDPIPLNAGDSFDATTAPRGNMLGANARCVALGGTSSIVNSGTGAIAGTPPLGWTATGTLTGDLGVAFSTATCAPLEAYTGSSGWTAVRIAFTGTPGEAGSLTLRLTTASAQQAGMLLAGSALLSGNALVGCHGISVTTANITPTVNNILGATGTLGAPDQLPQIDGLIALDLMAQPTAGTNSGLSVAIRWRAGVAMSGSIDLIGATWRRIDPIPAAAA >NC_020561|3621454:3628698|3628290_3628698_+|WP_015460106.1|tail|DBSCAN-SWA MAVEKGSAFLLKIGDGAATPAFATVAGLRTTQMSINGEAVAITTKDSGGWRELLSGAGVRSVSVAGSGVFTGSAAESRLKANALAGMIDDYRLSFEGGEQMQGRFLLTRLDYSGDYNGERTYTLALESSGPVVSL >NC_020561|3621454:3628698|3624869_3625016_+|WP_170112669.1|DBSCAN-SWA MRLGLGFAIALFRRRDATGGGSQPPLSVLLAEDGLALALEDGGAFILE >NC_020561|3621454:3628698|3622575_3622884_+|WP_015460096.1|DBSCAN-SWA MGTDMIARLVEQAEAEGASLVTLRALAEEASEIGAERAMERLGLADAAARADIGELRQLLGAWRDAKRTARNEVIGWAIRIALALLLLGLAVKTGLIGLARG >NC_020561|3621454:3628698|3621454_3622576_+|WP_015460095.1|portal|DBSCAN-SWA MRLFGRKGGGGAVRPALARSGGWTMAAWPQEAGWPRGYDAQARDAVIANPVAQRAMRLVGDSVRSLSLVVTGGPEKAAALAGAAMEPLAVHLLLHGDGYAGIGCDAADEPMRLHVLRPERMTAETDARGWPVAYRYQVGQAVERLPVTDPLGRPGIIHLRAFHPLDDQYGLGCLEAAAGAVAIHNAATRWNKALLDNAARPSGALVYDPGEKGAELSPEQFDRLKAEMEAGFAGGTNAGRPMLLEGGLRWQSLSLSPADMDFVALKANAAREIALAFGVPPMMLGQPGDNTYANYAEANRAFWRLTVLPMAARILDGIGEALGHWWPGIALEVDMDRIVELHGDREPLWRMAAAADFLTAEEKREMLGFGRTP >NC_020561|3621454:3628698|3623362_3624508_+|WP_041865533.1|capsid|DBSCAN-SWA MDGTYDVKADALEASFDAAARADELAALRGELAAVKARIDAQAAAATRPALAATGVKGDASPERRAFVDEYLRRGADSGVKSLAGTSPAEGGYAVPREIDEVIDSTLKAISPIRAIANVVKVGSAGYRKLVATGGFASGWAAEGAARPATATPGFTEIVPPMGDLYANPAASQAMLDDARFDVEAWLAGEIATEFARAEGAAFVSGNGADKPRGFLAGPITNEADGARAFGTLQYVPSGAAGAFAASNPQDRLVDLIQALRPPYRQGAVFVMNSATLSAIRKFKTADGAFLWQPGLAEGRPDTLLGYPVVEAEDMPDIAANAHAIAFGNFRAGYLVTERAETLILRDPYTNKPFVHFYVSKRVGGAVSNSEAIKLMKFASA |
13 | Geobacillus_phage(25.0%) | head,portal,capsid,tail,protease | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
Acr ID | Acr position | Acr size | Homology with known anti | Neighbor HTH/AcRanker | Neighbor Aca | In prophage | Protospacer in prophage |
---|
CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
NC_020563_1 | 33078-33167 | Orphan |
NA
Consensus repeat of NC_020563_1
|
1 spacers
spacers of NC_020563_1
>1.1|33102|42|NC_020563|CRISPRCasFinder GCGTGAGTGTTGGGGTAGCCTTCGGTGAGTTTTGGGGTCGGT |
CRISPR arrays and Neighbor proteins around NC_020563_1
The CRISPR arrays of NC_020563_1 >merge|NC_020563|1|33078-33167|CRISPRCasFinder AAAAGTGAGTTTTGGGGTCGAGTCGCGTGAGTGTTGGGGTAGCCTTCGGTGAGTTTTGGGGTCGGTAAAAGTGAGTTTTGGGGTCGAGTC >NC_020563|1|1|33078-33167|CRISPRCasFinder AAAAGTGAGTTTTGGGGTCGAGTC GCGTGAGTGTTGGGGTAGCCTTCGGTGAGTTTTGGGGTCGGT AAAAGTGAGTTTTGGGGTCGAGTC
>NC_020563.2|WP_015460621.1|31113_31338_+|conjugal-transfer-protein-TraD MARRERTRHLIELGGLVQKAGLVELADDDRATLYGALLDCTARVQGDDAGNVLALWKRRGKRAFDAEAEGAGNG >NC_020563.2|WP_015460622.1|30770_31076_+|conjugal-transfer-protein-TraD MRKVRDYDAELRALNDKAKALKARKVQQLGELVTSTGADALDLDTLAGALLAAVEAADANEKEAWRSRGAAFFQGRGRKAGRRTGGNGEGARQTGAGKEQA >NC_020563.2|WP_015460623.1|27463_30598_-|Ti-type-conjugative-transfer-relaxase-TraA MAIYHFSAKVISRANGSSAVASAAYRAAERLHDDRLGRDHDFSNKAGVVHSEILAPEGAPERLNDRATLWNEVEAGEKRKDAQLAREVEFSIPRELNQQQGIQLARDFVEKQFVERGMVADMNVHWDMGKDGQPKPHAHVMLSMREVGPEGFGQKVREWNSTALLQEWRVAWADHVNERLAELDIDARIDHRTLEAQGIDLEPQHKIGPAASRMPEQGLEAERVEDHARIARENGEKIIARPEIALDAIARQQATFTRRDLAQFAFRHSDGKDQFDQVMSAVRSSPELVALGRDGKGEDRFTSRDMIAAEQRLERAAEGLAIDRGHGVADAHVTRALASAEGRGLDLSAEQRGALAHITGDKGLASVVGYAGSGKSAMLGVAREAWEAQGYQVRGAALSGIAAENLEGGSAIASRTIASMEYQWEQGRELLGPRDVLVIDEAGMIGTRQMECVLSHAEQAGAKVVLVGDPEQLQAIEAGAAFRAVTERHGWAEITEIRRQCEDWQRDATKALATGRAGEAIHAYEAHGMVQAAETRELARADLVDRWDAERIAAPDQSRIILTHTNAEVRDLNLAARDRLRDAGELGPDVRVSAERGARDFATGDRIMFLKNERGLGVRNGTLGKVEQVSPERMAVKLDDGRSVAFDLKDYAHVDHGYAATIHKSQGVTVDRAHVLATPGMDRHSAYVALSRHRDGVQLHYGRDDFGDDRRLVRTLSRERAKDMASDYGRDRDAEIRAFADRRGLSGEIRLPERAERSPVEILGPRAGTMRQMGEDPRTVRDAGDRGAGAGQAAAERQPRRGMFDGFRPAPQRPAPESTPAGEREKAAPKRGMFDGLKLSAAPLKGAERAPVPADRGQGRDYARAVERASRSAEAVLQARASGAPVLEHQKVALERTTQALDQIRPGASRDLASAMQRDPALLREAAAGRSGPMIEAMAQEARVRADPNLRADRFVERWQGLKQERDRLYRAGDMAGRERTGKEMAGMAKSLERDPQVELVLRNRTRELGLEIGMGRGRGMNSGDLGRELARDLGIGMGRGMSR >NC_020563.2|WP_015460630.1|26733_27456_-|hypothetical-protein MMDEDNYRNNGRAGDDPQAAFEQLRGEVALVRLAVEGLARARESIEIPDYQPTLANTEKILLALTQRVDVIAKSPAMKLTPETMGERVNASVASATGELHNLVNSTRSDMSEAARELRGLIGTTRARWQQDRWLFWIGLGGVVLGILLYALLAGLIARAMPDSWQLPERMATRALAEPTLWDAGTHLMQRASPASWEGIVAAANLARDNRETIEACGAAAAKAKKTVRCTIEVKPANNDR >NC_020563.2|WP_007683476.1|24224_25586_+|hypothetical-protein MKRGHDLTGLMKFATRPEWADDLHDALDDHLGPVLTQFDIDSDELPGIIGDHWAMTLWGCAFEDLVTRVFEPDGRNIVDEYLKRRGWNEAGPNKLYMRALKTSVMSVYEVSAIEPGVGFLARDLIRGGDPVQVRERTASRTLGPWDRIGVRIIPVSGHRILAGGLLSFTAEATSALLEALRLGQGKRGPRAKLVIDDDQLRDLAPLISMVWLFDILPRMLEPVAIPTLHNADGEEVVFHRVRFPFTRGTTQALIGDRLDTVPALQRETSHFWNWLGTRTKQGKKGTGQMAWGVSMEDGTPVLGNLELKGRALILSVTSAERAERGVALVTQALGALVGTPLTEIETIEQAMAARQEGRTVSEPAPDIPVEVATPLVHGMLDRQYRTLLDEPVPMLGDKTPRQCAGSKAGRDQLATWLKHLENLSGRHADIDDPMATYDFGWIWQELGIEELRR >NC_020563.2|WP_007683474.1|23634_24228_+|recombinase-family-protein MTRAPYLIGYARVSKGDEQSNAAQRRALDAAGCRRVFEEIASGGRWDRPKLLEMIGQLRDSDVVVVWKLDRLSRSLKDLLHIMERIEAAGAGFRSLTEAIDTTTAAGRMMMQMVGSFAEFERAMIRERTSAGLAQARAEGRIGGRRRKLGEKQRREIAESVISGRKSGAEMARLYHVSEPTVSRIVAAHRQTMELPA >NC_020563.2|WP_007683471.1|20557_23488_-|Tn3-family-transposase MTTRQRAALLMLPDDEAAIVKHYSLSGEDMTAIDTARTPATRLGYALQLCCLRYPGRHLRHGELLPAVMLDHIAEQVGVDAKVIADFARRTPTRYDQLAAIKTRFGFSDLSRPHRVELRTWLTNEAASIIDGRALLGRLLDEMRARRIVIPGVSVVERMAAEAIHQAETDLVAAIDGGLGHEMRQQLDALIDDKVHDRQSRLSWLREPEPRVASASLLEIVEKITLIRGTGISAFSPDVRHEPRLGQFAREGVRYTAQAFQQMRPARRRVVLLATLRELEATLTDAAIDMFIALVGRAHLRARKRLEQRVAVSGREGRERMLRIARVLEAISQAARAGGDVAAAVDAVASLDIIDADAAIIRRTASPHRNEVLDEIAAEYRAFKRMGPSFVRAFDFQGRAGMQPLRDAMAILADLDGDWRRALPDDVPLGHVEHRWRRHVMTAGGIDRTHWEMATYSALSNALASGGIWVPTARVHRALSVLLAPPASPVPKPAFSLGDPHAWLDERAARLDSALREVARDLDKRDPPLFAGERLRFPKDPKEDPGQDEGRQLALTCYGMVPATRITDVLSQVQRWTGFIQHFGHVSTGLPPADERAFLATLIAEATNLGLSRMAEVCGVASRRALLRMQTWHMREETFRAALASLTDAIHAEPLAAWFGSGHRASADGQAYYLGGAGEAGGTVNAHYGRDPVVKIYTTITDRYAPLHQTVIAGTAGEAIHALDGILGHESSADITALHTDGGGVSDIVFAVMHLLGLDFEPRIPRLSDRQLYGFEPARRYGRLAPLFGRRLGRDLIVSHWAEIAEVIAAMRDRTVTPSLILKKLSAYRQQNSLAAALREVGRIERTLFTLRWFDDTDLRRTVTAELNKGEARNSLARAVAFHRLGRFRDRGLENQQTRAAALNLVTAAIILFNCRYLGRAVDELRHRGTPVDPAMLSRLSPLGWDRINLTGDYIWSESLDLDADGLMPLLIKPLP >NC_020563.2|WP_007682112.1|19932_20187_-|hypothetical-protein MLKLHDFCNRAGARILWCTPVFGQAVGTQHIDEILAVWYPTHKTFLDLSDAPGAKESYRLRGACVAYAVIHRCSGSNSPLDGNG >NC_020563.2|WP_015460628.1|18956_19847_-|haloalkane-dehalogenase MSLGAKPFGEKKFIEIKGRRMAYIDEGTGDPILFQHGNPTSSYLWRNIMPHCAGLGRLIACDLIGMGDSDKLDPSGPERYAYAEHRDYLDALWEALDLGDRVVLVVHDWGSVLGFDWARRHRERVQGIAYMEAVTMPLEWADFPEQDRDLFQAFRSQAGEELVLQDNVFVEQVLPGLILRPLSEAEMAAYREPFLAAGEARRPTLSWPRQIPIAGTPADVVAIVRDYAGWLSESPIPKLFINAEPGSLTTGRIRDFCRTWPNQTEITVAGAHFIQEDSPDEIGAAIAAFVRRLRPA >NC_020563.2|WP_001389365.1|18032_18797_+|IS6-like-element-IS6100-family-transposase MTDFKWRHFQGDVILWAVRWYCRYPISYRDLEEMLAERGISVDHTTIYRWVQCYAPEMEKRLRWFWRRGFDPSWRLDETYVKVRGKWTYLYRAVDKRGDTIDFYLSPTRSAKAAKRFLGKALRGLKHWEKPATLNTDKAPSYGAAITELKREGKLDRETAHRQVKYLNNVIEADHGKLKILIKPVRGFKSIPTAYATIKGFEVMRALRKGQARPWCLQPGIRGEVRLVERAFGIGPSALTEAMGMLNHHFAAAA |
You can click texts colored in the table to view more detailed information
CRISPR_ID | Spacer_Info | Spacer_region | Spacer_length | Hit_ID | Protospacer_location | Mismatch | Identity |
---|
CRISPR_ID | Spacer_Info | Spacer_region | Spacer_length | Hit_phage_ID | Hit_phage_def | Protospacer_location | Mismatch | Identity |
---|---|---|---|---|---|---|---|---|
NC_020563_1 | 1.1|33102|42|NC_020563|CRISPRCasFinder | 33102-33143 | 42 | NZ_CP005192 | Sphingobium sp. MI1205 plasmid pMI3, complete sequence | 31147-31188 | 0 | 1.0 |
NC_020563_1 | 1.1|33102|42|NC_020563|CRISPRCasFinder | 33102-33143 | 42 | NZ_CP005087 | Sphingobium sp. TKS plasmid pTK3, complete sequence | 20315-20356 | 0 | 1.0 |
NC_020563_1 | 1.1|33102|42|NC_020563|CRISPRCasFinder | 33102-33143 | 42 | NC_020562 | Sphingomonas sp. MM-1 plasmid pISP1, complete sequence | 159529-159570 | 0 | 1.0 |
NC_020563_1 | 1.1|33102|42|NC_020563|CRISPRCasFinder | 33102-33143 | 42 | NZ_CP005193 | Sphingobium sp. MI1205 plasmid pMI4, complete sequence | 18238-18279 | 0 | 1.0 |
NC_020563_1 | 1.1|33102|42|NC_020563|CRISPRCasFinder | 33102-33143 | 42 | NC_020563 | Sphingomonas sp. MM-1 plasmid pISP4, complete sequence | 33102-33143 | 0 | 1.0 |
NC_020563_1 | 1.1|33102|42|NC_020563|CRISPRCasFinder | 33102-33143 | 42 | NZ_CP005088 | Sphingobium sp. TKS plasmid pTK4, complete sequence | 56281-56322 | 0 | 1.0 |
NC_020563_1 | 1.1|33102|42|NC_020563|CRISPRCasFinder | 33102-33143 | 42 | NZ_AP017658 | Sphingobium cloacae strain JCM 10874 plasmid pSCLO_4, complete sequence | 34773-34814 | 1 | 0.976 |
NC_020563_1 | 1.1|33102|42|NC_020563|CRISPRCasFinder | 33102-33143 | 42 | NZ_CP047220 | Sphingobium yanoikuyae strain YC-JY1 plasmid unnamed3, complete sequence | 56938-56979 | 2 | 0.952 |
1. spacer 1.1|33102|42|NC_020563|CRISPRCasFinder matches to NZ_CP005192 (Sphingobium sp. MI1205 plasmid pMI3, complete sequence) position: , mismatch: 0, identity: 1.0
gcgtgagtgttggggtagccttcggtgagttttggggtcggt CRISPR spacer gcgtgagtgttggggtagccttcggtgagttttggggtcggt Protospacer ******************************************
2. spacer 1.1|33102|42|NC_020563|CRISPRCasFinder matches to NZ_CP005087 (Sphingobium sp. TKS plasmid pTK3, complete sequence) position: , mismatch: 0, identity: 1.0
gcgtgagtgttggggtagccttcggtgagttttggggtcggt CRISPR spacer gcgtgagtgttggggtagccttcggtgagttttggggtcggt Protospacer ******************************************
3. spacer 1.1|33102|42|NC_020563|CRISPRCasFinder matches to NC_020562 (Sphingomonas sp. MM-1 plasmid pISP1, complete sequence) position: , mismatch: 0, identity: 1.0
gcgtgagtgttggggtagccttcggtgagttttggggtcggt CRISPR spacer gcgtgagtgttggggtagccttcggtgagttttggggtcggt Protospacer ******************************************
4. spacer 1.1|33102|42|NC_020563|CRISPRCasFinder matches to NZ_CP005193 (Sphingobium sp. MI1205 plasmid pMI4, complete sequence) position: , mismatch: 0, identity: 1.0
gcgtgagtgttggggtagccttcggtgagttttggggtcggt CRISPR spacer gcgtgagtgttggggtagccttcggtgagttttggggtcggt Protospacer ******************************************
5. spacer 1.1|33102|42|NC_020563|CRISPRCasFinder matches to NC_020563 (Sphingomonas sp. MM-1 plasmid pISP4, complete sequence) position: , mismatch: 0, identity: 1.0
gcgtgagtgttggggtagccttcggtgagttttggggtcggt CRISPR spacer gcgtgagtgttggggtagccttcggtgagttttggggtcggt Protospacer ******************************************
6. spacer 1.1|33102|42|NC_020563|CRISPRCasFinder matches to NZ_CP005088 (Sphingobium sp. TKS plasmid pTK4, complete sequence) position: , mismatch: 0, identity: 1.0
gcgtgagtgttggggtagccttcggtgagttttggggtcggt CRISPR spacer gcgtgagtgttggggtagccttcggtgagttttggggtcggt Protospacer ******************************************
7. spacer 1.1|33102|42|NC_020563|CRISPRCasFinder matches to NZ_AP017658 (Sphingobium cloacae strain JCM 10874 plasmid pSCLO_4, complete sequence) position: , mismatch: 1, identity: 0.976
gcgtgagtgttggggtagccttcggtgagttttggggtcggt CRISPR spacer gcgtgagttttggggtagccttcggtgagttttggggtcggt Protospacer ******** *********************************
8. spacer 1.1|33102|42|NC_020563|CRISPRCasFinder matches to NZ_CP047220 (Sphingobium yanoikuyae strain YC-JY1 plasmid unnamed3, complete sequence) position: , mismatch: 2, identity: 0.952
gcgtgagtgttggggtagccttcggtgagttttggggtcggt CRISPR spacer gcgtgagttttagggtagccttcggtgagttttggggtcggt Protospacer ******** **.******************************
Region | Region Position | Protein_number | Hit_taxonomy | Key_proteins | Att_site | Prophage annotation | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
DBSCAN-SWA_1 |
12403 : 23488
Sequences of DBSCAN-SWA_1
Nucleotide sequences of DBSCAN-SWA_1 >NC_020563|12403:23488|DBSCAN-SWA ATCAGGCGGCTGCTGCGAAATGGTGGTTGAGCATGCCCATGGCCTCCGTCAGCGCCGAGGGCCCAATGCCAAAAGCTCTCTCCACAAGGCGCACCTCGCCCCTGATGCCGGGCTGCAGGCACCAGGGGCGAGCCTGTCCTTTGCGCAGGGCTCGCATGACTTCGAATCCCTTGATCGTGGCATAGGCCGTGGGGATCGATTTGAAACCGCGCACCGGCTTGATCAGTATCTTGAGCTTTCCGTGATCGGCCTCGATCACGTTATTGAGATACTTCACCTGCCGGTGGGCCGTCTCCCGGTCCAGCTTTCCTTCGCGCTTCAATTCGGTGATCGCTGCACCATAGCTCGGCGCTTTGTCGGTATTGAGCGTGGCAGGCTTTTCCCAGTGCTTCAGGCCTCGCAGGGCCTTGCCCAGGAACCGCTTCGCTGCCTTGGCGCTGCGGGTCGGCGACAGGTAGAAATCGATCGTGTCGCCCCGCTTGTCGACTGCCCGGTACAGGTAGGTCCACTTGCCCCGCACCTTGACGTAGGTTTCATCCAGGCGCCAGCTCGGATCAAAGCCACGCCGCCAGAACCAGCGCAGCCGCTTCTCCATCTCCGGGGCGTAGCACTGGACCCAGCGATAGATCGTCGTATGGTCGACCGAAATGCCGCGTTCCGCCAGCATTTCCTCAAGGTCGCGATAGCTGATCGGATAGCGACAATACCAGCGCACCGCCCACAGGATCACATCACCCTGGAAATGGCGCCACTTGAAATCCGTCATCGTTCCGTCCGTCCAATCTCCGCCAAGCATGCTCAAGCTTCACGATTTTTGCAACAGAGCCAGATGGCCTATGCGAACCGCGATACCGGCAACGTGCTGGGTGCCGCGGAAACGGCGGGGATGACCGAGTATCTCGATCATCCCGCGATCAATCCCAAGCAATCGAAACACTGGGCGCTGGACATCGCCGTCGCTTTTCCCACGCTGCACATAAACCTTTCGCCCGGCGGTTTCTGGTCGCACAGGTTCTGGCCGATCAGCGTCAATCGCACGCGGTGGGAAGCCGAGTTTCATGTACCCAAGCCACTGACCGGGCGGGAAAAGCTGCAACAGGAACTCTATATCACGCGCATCGCGGAAATCCTGCTGGAGGACGTGACGAACACCGAACGCACCCAGCGCGGCATCGAGGCCGGCGCATACGATACGATGCAGTTGCAGGATGGCGAAGTACAGATCCGCCACAACCTTCACCACGTCGATCGCTGGGTGAACGCCAGGACCGTGCGCGAAGCGCTGGGCATCTGACAGGCAGCGCCCTTCACTTGCCCGTCAGTCAAAAATGGGGTCCTCAATGTCCGGATTGGAGCAACGCAGCTATATCGTGACGGGCGCAGGCGGCAGTATCGGCCGCGCTGCCGCCCTTATTCTTGCCCGTCGGGGCGCGAATGTGGTTGTCGCCGATATCGTCGAAGCCGCAGCCGAGGAAACGGTCAGACTGATACGCGCGGAAGGTGGCAACGCCGTCTACAGCCATTGCGACGTCAGTATCGAGGACAACGCGTGGTCGACCGTGGAACTCGCCATCACCGAATTCGGTCGCCTTGATGGCGCATTCAACAATGCGGGGCTGCCGCCGGTCAACGTGCCGCTTCATCAACTGACGAGCGCCGATTTCCAACGATCCCTGGATATCAACGTGCTCGGCGTTTTTCATTGCATGAAACATCAGATCGCAGCGATGCTGAAAGCGGGGAGCGGCGGTTCGATCGTGAACACGGCGTCGGTTGGCGGCACCGTCGCGATCCCCGCACATGGCGAGTACATCGCAGCCAAACATGCCGTTATCGGGCTGACGCGGATCGCTGCCGTCGATTACGGGCAACAGGGAATCCGCGTGAACGCAGTTCTACCGGGCGCTATCCGCACTCCGATGCTGATGTCGGTGATCGAGAACGAGCCTTCGCACGCGGAATTCCTGAAAACCGCCCACCCGATCGGTCGATATGGCGAACCGGCGGAGATCGGCGAAACGGCAGCCTGGTTGCTGTCGGATGCCTCTTCGTTCGTCACCGGCGCCGCCATTGCTGCCGATGGCGGCTACACCGCAATCTGAACTGCCAGAACCGTTGGGAACCGGACTATGAGCGCAATACGAATGATGGCGGTCTTCGCACGGAGAGTGGGAATGACGCCTCAGGCGTTTCACGATCACTGGCGCTATCCCCACGGCTCGCTGGGACAAGGCATCCGGATTATCAAGCGCTATGTCCAGAGCCATCAGGTCGACGTCGATTTCCTCGATGCCGATCAGAACCGCTACGAAGGCATCGCCGAAGTCTGGTTCGACAGCGAAGCGGACGCTGCCGCCTTCGTGACCGATCCTGATTATGTGGCCTATGCGCTGAAAGATGAGCCGAATTTCGTCGACATGGACAACTTCTTTGCCGTTTTCGCACGCGAAGAGGTTCTCAAAAGCCGCGCCGAAGCGCTGCGCGGTGACGAGGCGGCCTATAACTGGTCGATCGACCGACGACCGAACTTCGCCAAGCTTACCCAGGTGGTGTTCGAGGAAGGCGGCACGGCCTGGGCCGGAGACGACGATTTCGCACTTGGCGAGCGTCTTGGCGCATTTCGCCATACCCGGTGCCATCCGATCGAAAAGCTCGTCCTCCCCGGCCCCTCGGTGATCGGTGTGCGCGAACTTTATTGGCCGACGCGGACATTGGCAGAACAGGGCATAGCTGCAGCAGGTGACGCTTTCCATGCGCTGTTTAGCCGAGCGCCAAAGGCACAAACGATCTTCTCGATAAGTGAGCGCTACGTCTGACGGTGCAGGTTAACCGCCCCTGCGCCTCAAGACGCTTCTCAACCAGACGCCGCTCCTGCGCAGCCCATGGTAGAGCCAGAATGCGATCCATACCGAGCTTCAGCGGCCCTTCCAGGCGCTTCTGCAAATAGCGGCCGATCTCGAACGGGTTAGTCTTCGCCTGCCCTTCCACCACGGTCTTGCCGTCGGGATCGAGAACACAGATGCTCGTGCTGCGCTGGCTGACATCCAGACCCACGCTGTATGTTGCTTCGTGCCGGATAGGCGAGCCGTCCGATCAGGCGCAGGATGTTGCGCTCTGCCGGCGATTGTGGCGCATTGCGCAGCCAGGATAGCCATGTCGCGGCCTGCCCCGGTCGTCGGGCTAGCAAATCGTCCAGACCCTGAGCCAGCCAGCGCGGACGACGGCGATGCTGGCCGATGCGCGAAATCGTCGAAGCCATTTTCTATCTGCTACGGGCAGGATGTCCCTGGCGGCTTTTGCCCGACAGTTTTCCGCCATGGCGCACGGTATACCAGTGGTTCTGCACCTTGCGCGACGATGGGGTATTCGAAAGCCTCAATCATCATCTCGTTCGGATCGATCGCATCCGGACAGGGCGAGAACCAGCACCATCCGCGGCAGTCATCGACAGCCAGAGCGTGAAGACCACCGAAGCAGGCGGACCTCGTGGTTACGACGCAGGCAAGAAGACCATGGGACGCAAACGCCATGCAATGGTCGATACCGATGGTCGCGCCCTTATCATACTGGTCCATCCTGCCGATGTGCAGGACCGCGATGGCGCGGTGCCCTTGCTCCAGCAGTCTCACCAGCGGCATCCCTTCGTCGCGCGCGCCTATGCCGACAGCGCCTACAACAGCGATCGCGTCCGGGACGCCACCTCCATTACGATCGAAATCGTCCGCAAATTCGCCGATCAGACCGGCTTCGTCGTCCATCCCAGACGATGGATCGTCGAACGCACCTTCGCATGGATCAACCGCAATCGCCGTCTGGCCAAAGACTTCGAGCGGACCATCAAATCCGCAACCGCGCTCCTCTATGCCGCCGCTGCCATCGTCCTCATCCGACGCATCGCTCGTTACCCATGAGATTCAAGACAGACTCTTAACAAAATTTCGTTACTTGACGGATTTCTTGGACTTAGGGTTGCAGTGCGCCGCATGTAGATGGCTGTTCGTGGAGAAAGACAACGTGATACGTATGTTGGGTGAATTTTGGCCAGATATATTCGTGATCTTACTGGTCTGTTCGCTCGTATCATTTGCAAACCAAAAATTCGGAATATTTGTCGCTACTCCGATAACATTGGGATCGACTGCATTAACGCTATTGCTCGGAGCCGCTGGGGCGACATGATTCTACTCTTATCTTCCTTTTGGCCTTGGCTTGTTCCAGCAATTGCGTGGATCGGATTTTTCACCGGTCGCTATTTTAGAAGAAAGAGCATAGCGAAAAACACATGATTGCCGCAACTCTGGCGGCTTCGCGCGCCTACGCCGGGCCGGGTTACTCCCGGCCGTGAAAGCGGCATCCATCTGCGGCCTGCACGTCAAGCGTGTCGGCGCTCCTTCGCAGCCGCCAACATCTCGCGCCGAAGGATTGCGTTCATGCGAGCCTGATAGCCCTTGCCCTGGGATTTGAGCCAAGCAAGCACATCGGCGTCGAGCCGCGCCGTTACCTGCTGCTTGATCGGCTTATAGAACCGTCCACGCTCGGCGGTCTTCCAGAAATCCTCTGTCAGCGTCGGGGCATCACTATAGTCGATACCGCTATCCGGCATCGCCTGCAGGGCATCCAGCTCGGCCTTCTGTGCCTCGGTCAAGGGGGGCAGATTGCCCGGATCGAGCTGGAAGCGGACGGTATCAGCGTCTTTCTTCTTCATAACGTCGTCTTTCCTTTCGATCGGCTTTTCGGGCCGAAATGATATGGATGATCTCCACATATTCGCCGTCGTCGTCATCCTCAGCGACGGTGTGGGCCACCAGCAGGAGTAAATGTCCTTCGACCAGGCCCAAGGTCTGCCATCGTTGCTCGCCGCCTTCGATCCGATCATGCACGCTAAGCGCAAAGGGGTCTGCGAAGGCGCTGGCGGCCGTCTCGAAACTGATGCCATGCTTTCTCAGATTGCTCTCGGCTTTGGCTGGGTGCCAGGAAAAGCGCATGGATAACATGACGGGAATATAATTATATTTTTGTCGTTATTCAAGGCGTATCCTTGACCGGCGCTGACCACCCGATCCGCTCCAACGTGTCCGTCAATGTTGATCGCGGCACGCTGAACGTGCGAGAAACCGACGCTTTGCTGGCCCCATTGGACAGAGCGGCAACTATCAACTGCATCTTTTCCGCGGCGATCGTCGGCGGTCGTCCACCTTGTCGGCCCCGCCGCTTTGCGGCGGCCAGGCCCGCCATGATCCGCTCCCGCGTCAACGCTCGCTCGTATTGGGCGAGCGCGCCGAACAGCGAGAACAGCAACTCGCCATGCGGCGTGGTGGTATCCATCTGTTCGGTCAGCGACCGGAAGGCGATGCCGCGTTCCTTGAGGTCGGTGACGATCGCCAGCAGGTGCGGCAGCGAGCGGCCGAGCCGATCGAGCTTCCAGACAACCAGCGTGTCGCCGGCGTTCATGTAGTCGAGGCAGGCTTTCAGGCCGGCGCGATCGTCGCGCGCACCGGAAGCCTTGTCGGAGTAGAGGTGCCGTTGATCGACACCAGCGGCCGTGAGCGCATCGCGTTGCAGATCGACCGGCTCTGTTGCAAAAATCGTGAAGCTTGAGCATGCTTGGCGGAGATTGGACGGACGGAACGATGACGGATTTCAAGTGGCGCCATTTCCAGGGTGATGTGATCCTGTGGGCGGTGCGCTGGTATTGTCGCTATCCGATCAGCTATCGCGACCTTGAGGAAATGCTGGCGGAACGCGGCATTTCGGTCGACCATACGACGATCTATCGCTGGGTCCAGTGCTACGCCCCGGAGATGGAGAAGCGGCTGCGCTGGTTCTGGCGGCGTGGCTTTGATCCGAGCTGGCGCCTGGATGAAACCTACGTCAAGGTGCGGGGCAAGTGGACCTACCTGTACCGGGCAGTCGACAAGCGGGGCGACACGATCGATTTCTACCTGTCGCCGACCCGCAGCGCCAAGGCAGCGAAGCGGTTCCTGGGCAAGGCCCTGCGAGGCCTGAAGCACTGGGAAAAGCCTGCCACGCTCAATACCGACAAAGCGCCGAGCTATGGTGCAGCGATCACCGAATTGAAGCGCGAAGGAAAGCTGGACCGGGAGACGGCCCACCGGCAGGTGAAGTATCTCAATAACGTGATCGAGGCCGATCACGGAAAGCTCAAGATACTGATCAAGCCGGTGCGCGGTTTCAAATCGATCCCCACGGCCTATGCCACGATCAAGGGATTCGAAGTCATGCGAGCCCTGCGCAAAGGACAGGCTCGCCCCTGGTGCCTGCAGCCCGGCATCAGGGGCGAGGTGCGCCTTGTGGAGAGAGCTTTTGGCATTGGGCCCTCGGCGCTGACGGAGGCCATGGGCATGCTCAACCACCATTTCGCAGCAGCCGCCTGATCGGCGCAGAGCGACAGCCTACCTCTGACTGCCGCCAATCTTTGCAACAGAGCCAAAAATGTCGTCGCTCATGTCGGTCTCTCCTATTGTCCTGCGCGGGTGATCGGCGTCGCCGCGTCTGTGGTTTGTTCGGATCTTAGAAAATGAGCCGGTTCCGGATTATGCTGGGCGCAATCGCCGGACAAACGCCGCAATCGCCGCGCCAATCTCGTCCGGACTGTCCTCCTGGATGAAATGGGCGCCTGCGACCGTGATTTCGGTCTGGTTTGGCCATGTGCGGCAGAAGTCGCGTATTCGGCCCGTGGTCAAGGATCCCGGCTCGGCGTTGATGAAGAGTTTCGGAATCGGGCTTTCGCTGAGCCAGCCGGCATAGTCCCGGACGATCGCGACCACGTCGGCCGGGGTGCCTGCGATCGGGATTTGGCGAGGCCAAGACAGGGTCGGTCGACGGGCTTCGCCGGCGGCGAGGAAGGGCTCGCGATAGGCGGCCATCTCCGCTTCGCTTAAGGGGCGCAGGATCAATCCGGGGAGAACTTGTTCGACAAAAACATTGTCCTGCAACACCAATTCTTCGCCCGCCTGCGAGCGAAAGGCCTGAAACAGATCGCGATCCTGTTCGGGAAAATCCGCCCATTCGAGCGGCATGGTGACCGCTTCCATATAGGCAATCCCCTGTACACGCTCGCGGTGGCGGCGGGCCCAGTCGAAGCCGAGGACGGACCCCCAGTCATGCACGACCAGAACAACCCTGTCCCCGAGATCGAGCGCCTCCCACAGCGCGTCGAGATAGTCACGATGCTCGGCATAGGCATAACGCTCGGGCCCCGACGGATCGAGCTTGTCCGAATCGCCCATGCCGATCAGGTCACAGGCGATCAGCCGTCCCAGCCCGGCGCAATGCGGCATGATATTGCGCCACAGATAGGACGACGTCGGATTGCCGTGCTGGAAGAGGATCGGATCGCCGGTCCCTTCATCGATATAGGCCATGCGCCGGCCCTTGATCTCAATGAATTTCTTCTCGCCAAATGGCTTTGCGCCGAGGCTCATCGATATTCTCCTTGAGCGATTTTCTGGTCTGCCCGGTCGGCGATGCATATCGCATGATGAATATCATGTCAATCGAGGAATCGTTCTAGCCATTGCCGTCGAGCGGAGAGTTGCTGCCGGAGCATCGATGGATGACGGCGTAGGCCACGCAGGCGCCTCTCAGGCGGTAGCTTTCCTTAGCCCCGGGCGCATCCGAAAGATCGAGGAAGGTTTTGTGAGTCGGATACCAGACCGCGAGAATCTCATCGATATGTTGAGTGCCCACGGCCTGGCCAAACACGGGCGTGCACCACAGGATCCGCGCGCCGGCTCTGTTGCAAAAATCGTGAAGCTTGAGCATGCTTGGCGGAGATTGGACGGACGGAACGATGACGGATTTCAAGTGGCGCCATTTCCAGGGTGATGTGATCCTGTGGGCGGTGCGCTGGTATTGTCGCTATCCGATCAGCTATCGCGACCTTGAGGAAATGCTGGCGGAACGCGGCATTTCGGTCGACCATACGACGATCTATCGCTGGGTCCAGTGCTACGCCCCGGAGATGGAGAAGCGGCTGCGCTGGTTCTGGCGGCGTGGCTTTGATCCGAGCTGGCGCCTGGATGAAACCTACGTCAAGGTGCGGGGCAAGTGGACCTACCTGTACCGGGCAGTCGACAAGCGGGGCGACACGATCGATTTCTACCTGGCCTCTGTACATTATTCGGACACAGATTCACGGTAGCGGCTTGATGAGCAGCGGCATGAGGCCATCGGCATCGAGGTCGAGGCTCTCGGACCAGATATAATCGCCGGTGAGATTGATGCGGTCCCATCCCAGCGGCGACAATCGGGACAGCATGGCGGGATCGACAGGTGTGCCGCGGTGCCGCAGTTCATCGACGGCACGACCGAGATAGCGGCAGTTGAAAAGAATGATGGCGGCCGTGACGAGGTTGAGCGCTGCCGCGCGGGTCTGCTGGTTCTCCAGGCCACGGTCGCGAAACCGCCCGAGCCGATGGAAGGCGACAGCGCGGGCCAGGCTGTTGCGTGCCTCGCCCTTGTTGAGTTCGGCGGTGACCGTCCGGCGCAGATCGGTGTCGTCGAACCAGCGCAGCGTGAACAGTGTGCGCTCGATCCGTCCGACCTCCCGAAGTGCCGCAGCAAGACTGTTTTGTTGACGATAGGCGGACAGCTTCTTCAAAATCAGCGACGGTGTGACAGTGCGGTCGCGCATCGCCGCGATGACCTCCGCAATTTCGGCCCAATGGCTGACGATCAGGTCCCGACCGAGGCGGCGCCCGAACAGCGGTGCGAGCCGACCATAGCGCCGCGCGGGCTCGAAACCATAGAGTTGTCGATCCGACAGGCGCGGAATGCGCGGCTCGAAATCGAGGCCGAGCAAATGCATGACGGCGAACACGATATCGGAGACGCCGCCACCATCGGTGTGGAGGGCGGTGATGTCCGCGCTGCTTTCGTGGCCGAGGATGCCGTCGAGTGCGTGGATCGCCTCGCCCGCCGTCCCGGCGATGACCGTCTGATGAAGCGGTGCGTATCGATCGGTGATGGTGGTGTAGATCTTGACCACGGGGTCGCGACCGTAATGTGCGTTGACCGTTCCTCCGGCTTCGCCGGCACCGCCGAGATAATAAGCCTGCCCATCGGCCGATGCGCGATGCCCCGAACCAAACCAGGCGGCGAGCGGTTCGGCATGGATAGCGTCGGTCAGGCTGGCGAGCGCGGCGCGAAACGTTTCCTCGCGCATATGCCACGTCTGCATACGCAACAGTGCGCGACGCGAGGCGACACCGCAAACTTCGGCCATGCGCGACAGACCAAGGTTGGTCGCCTCCGCGATCAAGGTGGCAAGAAAGGCGCGTTCGTCGGCGGGAGGCAGACCGGTCGAGACATGGCCGAAATGCTGGATGAAGCCGGTCCATCGTTGAACCTGCGACAGCACATCCGTGATACGGGTGGCAGGCACCATGCCGTAGCAGGTGAGCGCAAGTTGCCGCCCTTCATCCTGACCAGGGTCTTCTTTGGGGTCTTTCGGAAACCGCAGCCGCTCGCCGGCAAAAAGGGGGGGATCGCGTTTGTCCAGATCGCGGGCGACCTCGCGCAGGGCGCTATCAAGCCTTGCCGCCCTCTCGTCGAGCCAGGCGTGCGGATCGCCGAGCGAGAAAGCGGGCTTCGGTACGGGGCTGGCCGGTGGGGCAAGCAGCACACTGAGCGCGCGATGGACGCGCGCGGTCGGCACCCAGATACCACCGGATGCCAAAGCATTGGAAAGCGCGCTGTAAGTCGCCATCTCCCAATGGGTGCGATCGATCCCGCCTGCGGTCATGACATGCCGACGCCAGCGGTGCTCGACATGGCCAAGCGGCACGTCATCGGGCAACGCCCTGCGCCAGTCTCCGTCGAGATCGGCGAGGATCGCCATCGCATCGCGCAACGGCTGCATTCCGGCCCGGCCCTGGAAATCAAAAGCGCGCACGAACGACGGCCCCATCCGCTTGAAGGCGCGATATTCCGCCGCGATCTCGTCCAGAACCTCGTTCCGGTGCGGTGAGGCGGTACGTCGGATGATAGCGGCGTCGGCGTCGATGATGTCGAGGGATGCCACGGCATCGACCGCCGCGGCGACATCGCCGCCAGCCCGGGCCGCCTGGCTGATCGCTTCCAGAACCCTGGCGATGCGTAACATCCGTTCGCGGCCCTCGCGGCCGGAAACGGCCACGCGCTGTTCGAGCCGTTTGCGGGCGCGCAGATGGGCCCGCCCCACGAGCGCAATGAACATATCGATCGCCGCATCGGTCAGTGTCGCCTCCAGCTCGCGCAGCGTCGCGAGCAGCACGACCCGCCTGCGGGCCGGGCGCATTTGCTGGAAAGCCTGTGCCGTGTAGCGCACACCCTCCCGGGCGAATTGCCCCAGGCGCGGTTCGTGGCGAACATCAGGGGAGAAAGCCGATATGCCCGTCCCACGAATCAGCGTGATCTTTTCGACGATTTCCAAAAGCGAAGCGGATGCGACACGCGGTTCTGGCTCGCGCAGCCAGGACAGACGGCTCTGCCGGTCATGCACCTTGTCATCGATCAGCGCGTCGAGTTGCTGACGCATCTCATGGCCAAGGCCGCCATCGATCGCGGCCACGAGATCCGTTTCCGCCTGATGTATCGCCTCGGCCGCCATGCGCTCCACGACGCTGACACCCGGGATCACGATCCGACGCGCGCGCATCTCATCCAGAAGCCGGCCGAGCAGCGCGCGTCCGTCGATGATGCTCGCCGCCTCGTTTGTCAGCCATGTCCGCAATTCCACGCGGTGCGGGCGGCTCAGATCGCTGAAGCCGAAGCGCGTCTTGATCGCGGCGAGCTGATCGTAGCGCGTGGGCGTGCGTCTGGCGAAATCAGCTATGACCTTTGCATCCACCCCGACCTGCTCGGCAATATGGTCGAGCATGACCGCAGGCAGCAGTTCCCCGTGGCGCAGATGCCGGCCCGGGTAGCGCAGGCAACAAAGCTGCAGGGCGTAGCCGAGTCGGGTGGCGGGCGTGCGCGCGGTGTCGATCGCGGTCATGTCCTCACCTGACAGACTATAGTGCTTCACGATCGCCGCCTCGTCGTCGGGCAACATCAGCAGGGCAGCGCGCTGCCGCGTCGTCAT
Protein sequences of DBSCAN-SWA_1 >NC_020563|12403:23488|20557_23488_-|WP_007683471.1|transposase|DBSCAN-SWA MTTRQRAALLMLPDDEAAIVKHYSLSGEDMTAIDTARTPATRLGYALQLCCLRYPGRHLRHGELLPAVMLDHIAEQVGVDAKVIADFARRTPTRYDQLAAIKTRFGFSDLSRPHRVELRTWLTNEAASIIDGRALLGRLLDEMRARRIVIPGVSVVERMAAEAIHQAETDLVAAIDGGLGHEMRQQLDALIDDKVHDRQSRLSWLREPEPRVASASLLEIVEKITLIRGTGISAFSPDVRHEPRLGQFAREGVRYTAQAFQQMRPARRRVVLLATLRELEATLTDAAIDMFIALVGRAHLRARKRLEQRVAVSGREGRERMLRIARVLEAISQAARAGGDVAAAVDAVASLDIIDADAAIIRRTASPHRNEVLDEIAAEYRAFKRMGPSFVRAFDFQGRAGMQPLRDAMAILADLDGDWRRALPDDVPLGHVEHRWRRHVMTAGGIDRTHWEMATYSALSNALASGGIWVPTARVHRALSVLLAPPASPVPKPAFSLGDPHAWLDERAARLDSALREVARDLDKRDPPLFAGERLRFPKDPKEDPGQDEGRQLALTCYGMVPATRITDVLSQVQRWTGFIQHFGHVSTGLPPADERAFLATLIAEATNLGLSRMAEVCGVASRRALLRMQTWHMREETFRAALASLTDAIHAEPLAAWFGSGHRASADGQAYYLGGAGEAGGTVNAHYGRDPVVKIYTTITDRYAPLHQTVIAGTAGEAIHALDGILGHESSADITALHTDGGGVSDIVFAVMHLLGLDFEPRIPRLSDRQLYGFEPARRYGRLAPLFGRRLGRDLIVSHWAEIAEVIAAMRDRTVTPSLILKKLSAYRQQNSLAAALREVGRIERTLFTLRWFDDTDLRRTVTAELNKGEARNSLARAVAFHRLGRFRDRGLENQQTRAAALNLVTAAIILFNCRYLGRAVDELRHRGTPVDPAMLSRLSPLGWDRINLTGDYIWSESLDLDADGLMPLLIKPLP >NC_020563|12403:23488|17087_17396_-|WP_007682056.1|DBSCAN-SWA MLSMRFSWHPAKAESNLRKHGISFETAASAFADPFALSVHDRIEGGEQRWQTLGLVEGHLLLLVAHTVAEDDDDGEYVEIIHIISARKADRKERRRYEEERR >NC_020563|12403:23488|13742_14504_+|WP_007682039.1|DBSCAN-SWA MSGLEQRSYIVTGAGGSIGRAAALILARRGANVVVADIVEAAAEETVRLIRAEGGNAVYSHCDVSIEDNAWSTVELAITEFGRLDGAFNNAGLPPVNVPLHQLTSADFQRSLDINVLGVFHCMKHQIAAMLKAGSGGSIVNTASVGGTVAIPAHGEYIAAKHAVIGLTRIAAVDYGQQGIRVNAVLPGAIRTPMLMSVIENEPSHAEFLKTAHPIGRYGEPAEIGETAAWLLSDASSFVTGAAIAADGGYTAI >NC_020563|12403:23488|12403_13168_-|WP_001389365.1|transposase|DBSCAN-SWA MTDFKWRHFQGDVILWAVRWYCRYPISYRDLEEMLAERGISVDHTTIYRWVQCYAPEMEKRLRWFWRRGFDPSWRLDETYVKVRGKWTYLYRAVDKRGDTIDFYLSPTRSAKAAKRFLGKALRGLKHWEKPATLNTDKAPSYGAAITELKREGKLDRETAHRQVKYLNNVIEADHGKLKILIKPVRGFKSIPTAYATIKGFEVMRALRKGQARPWCLQPGIRGEVRLVERAFGIGPSALTEAMGMLNHHFAAAA >NC_020563|12403:23488|15162_15456_-|WP_145907151.1|DBSCAN-SWA MGLDVSQRSTSICVLDPDGKTVVEGQAKTNPFEIGRYLQKRLEGPLKLGMDRILALPWAAQERRLVEKRLEAQGRLTCTVRRSAHLSRRSFVPLALG >NC_020563|12403:23488|14549_15218_+|WP_081440685.1|DBSCAN-SWA MAVFARRVGMTPQAFHDHWRYPHGSLGQGIRIIKRYVQSHQVDVDFLDADQNRYEGIAEVWFDSEADAAAFVTDPDYVAYALKDEPNFVDMDNFFAVFAREEVLKSRAEALRGDEAAYNWSIDRRPNFAKLTQVVFEEGGTAWAGDDDFALGERLGAFRHTRCHPIEKLVLPGPSVIGVRELYWPTRTLAEQGIAAAGDAFHALFSRAPKAQTIFSISERYV >NC_020563|12403:23488|15638_16313_+|WP_007682041.1|transposase|DBSCAN-SWA MREIVEAIFYLLRAGCPWRLLPDSFPPWRTVYQWFCTLRDDGVFESLNHHLVRIDRIRTGREPAPSAAVIDSQSVKTTEAGGPRGYDAGKKTMGRKRHAMVDTDGRALIILVHPADVQDRDGAVPLLQQSHQRHPFVARAYADSAYNSDRVRDATSITIEIVRKFADQTGFVVHPRRWIVERTFAWINRNRRLAKDFERTIKSATALLYAAAAIVLIRRIARYP >NC_020563|12403:23488|18032_18797_+|WP_001389365.1|transposase|DBSCAN-SWA MTDFKWRHFQGDVILWAVRWYCRYPISYRDLEEMLAERGISVDHTTIYRWVQCYAPEMEKRLRWFWRRGFDPSWRLDETYVKVRGKWTYLYRAVDKRGDTIDFYLSPTRSAKAAKRFLGKALRGLKHWEKPATLNTDKAPSYGAAITELKREGKLDRETAHRQVKYLNNVIEADHGKLKILIKPVRGFKSIPTAYATIKGFEVMRALRKGQARPWCLQPGIRGEVRLVERAFGIGPSALTEAMGMLNHHFAAAA >NC_020563|12403:23488|17427_17988_-|WP_029987071.1|DBSCAN-SWA MFATEPVDLQRDALTAAGVDQRHLYSDKASGARDDRAGLKACLDYMNAGDTLVVWKLDRLGRSLPHLLAIVTDLKERGIAFRSLTEQMDTTTPHGELLFSLFGALAQYERALTRERIMAGLAAAKRRGRQGGRPPTIAAEKMQLIVAALSNGASKASVSRTFSVPRSTLTDTLERIGWSAPVKDTP >NC_020563|12403:23488|19932_20187_-|WP_007682112.1|DBSCAN-SWA MLKLHDFCNRAGARILWCTPVFGQAVGTQHIDEILAVWYPTHKTFLDLSDAPGAKESYRLRGACVAYAVIHRCSGSNSPLDGNG >NC_020563|12403:23488|18956_19847_-|WP_015460628.1|DBSCAN-SWA MSLGAKPFGEKKFIEIKGRRMAYIDEGTGDPILFQHGNPTSSYLWRNIMPHCAGLGRLIACDLIGMGDSDKLDPSGPERYAYAEHRDYLDALWEALDLGDRVVLVVHDWGSVLGFDWARRHRERVQGIAYMEAVTMPLEWADFPEQDRDLFQAFRSQAGEELVLQDNVFVEQVLPGLILRPLSEAEMAAYREPFLAAGEARRPTLSWPRQIPIAGTPADVVAIVRDYAGWLSESPIPKLFINAEPGSLTTGRIRDFCRTWPNQTEITVAGAHFIQEDSPDEIGAAIAAFVRRLRPA >NC_020563|12403:23488|16774_17107_-|WP_007682049.1|DBSCAN-SWA MKKKDADTVRFQLDPGNLPPLTEAQKAELDALQAMPDSGIDYSDAPTLTEDFWKTAERGRFYKPIKQQVTARLDADVLAWLKSQGKGYQARMNAILRREMLAAAKERRHA |
12 | Escherichia_phage(28.57%) | transposase | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
Acr ID | Acr position | Acr size | Homology with known anti | Neighbor HTH/AcRanker | Neighbor Aca | In prophage | Protospacer in prophage |
---|
CRISPR_ID | Spacer_Info | Spacer_region | Spacer_length | Hit_ID | Protospacer_location | Mismatch | Identity |
---|
CRISPR_ID | Spacer_Info | Spacer_region | Spacer_length | Hit_phage_ID | Hit_phage_def | Protospacer_location | Mismatch | Identity |
---|
Region | Region Position | Protein_number | Hit_taxonomy | Key_proteins | Att_site | Prophage annotation | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
DBSCAN-SWA_1 |
0 : 6641
Sequences of DBSCAN-SWA_1
Nucleotide sequences of DBSCAN-SWA_1 >NC_020542|0:6641|DBSCAN-SWA
Protein sequences of DBSCAN-SWA_1 >NC_020542|0:6641|2233_3100_-|WP_015449216.1|DBSCAN-SWA MPLAASHRAKAIVEALGGTWRGTRGECRCPAHDDHGPSLSVRLGERAILFHCFAGCDTRDVLTALRRRKLHDAVPLTMPRPKAMADHRALALRLWKASQPIAGSPAADYLAARGLAPPYPRCLRYNPRTIVGAGDQRRFFPAMIAAVENDLGVVAVQRTCLDLADILHKPLSKPKIALGLLGNAAIRLAPAGEELGLAEGIEDALSAMAWFGTPTWALGGVERLGLVAIPERVKRIIVYGDRGAAAAAMLKKARPHLTAHGRELVLRLPERHADWNDAWRVRRAAEAA >NC_020542|0:6641|3162_3489_-|WP_015449217.1|DBSCAN-SWA MINFAKFEIIGRIGEIDARPKVTLLSVCANHRRKGDDDEWQEDSHWNRVSVFSEGQRKHIADRAQVGDLVRIAGRLKDSSYERDGVTHYTTDRIVEEFGILAAKGMPG >NC_020542|0:6641|5438_6641_+|WP_015449219.1|DBSCAN-SWA MAASLDIEGTQDLLSVSTLAQRTSSVLERLRDSARSARADERREPTFPIGKAAELVGRTAAAIREAEKDGRLPPPPRTENNRRVGYTLAQLNDMRGLFGTRPWRAATDPCCVIAVQNFKGGVGKSTLSVHLAQYLAIKGYRVALIDCDSQASATTLFGYVPDLDLTEEDTLYPFLRHDDMEALDYALRKTHFDGLELVPANLRLFQSEYEIAARMARGQGNLIDRMAQGIASIADRFDVVVLDPPPALGAISLSVLRAANALVVPVPPTVMDFSSTAAFLAMLDETIETLADRGLAPSLQFLRFVASKVDENKSMQKELLNLMRTLFGHAIVRTPLKDSAEIDNATARLMTVYELDGPVTSSAVRNRCLAYLDGVNSEIEVDIRSMWPSHLARLRKEGLA >NC_020542|0:6641|4155_5244_+|WP_041865596.1|DBSCAN-SWA MRVAAALQAKGGDEFAKPGSIVEVKFVKGQSLSLTASRLLALMILTAGGDAWEDRPHKMRKADIRRGHKGNERISDMLEELHRTLFAVDDKSWRGKKATLRFSLISSSREEVEDEEGAEAGWIEWEFTPEARKLIQESETYAVLNRQAVLGFRSTYALKLYEIGALRLHRRQSLWKGDMTALRALLGIAPDVYKDFAQLRRKVLEKAKAEIDQLAHFRVEWREIRQGRTVTEIEFRFEPKDAPEQIATVDEIARHSAGRKARREDEVETVAVEAVTQAAVAALVSKDKTGAGEVTFPNGTIRFGSDTLAAIGRSAGGGWDIDLIADAYRAQMGERLAKLKGAKLIASWTGFCESFLARRGRP |
4 | Pseudomonas_phage(33.33%) | NA | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_2 |
12205 : 18440
Sequences of DBSCAN-SWA_2
Nucleotide sequences of DBSCAN-SWA_2 >NC_020542|12205:18440|DBSCAN-SWA GTCAGTTGCCGGGTCCGATGACGAGGCAATCGCGCGCGGCCTGAACGATGTCAGCGGTGACCTCCTCGACGCGGTTGAGGGTCATGATCCGCCGCATCTGCGCGAACAGGCGCTCCATGAGCCGGAAGTTGCCGCGCGTGATGCGGATCACGGCTGCCTGCGCTTCGATCGCGTCGAGTTGGGCCGGGTCGAAGCTGATCCCGAAATCGCCGGCGTGGGTCGCGAGCAGCAGCCGCATTTCGGTCTCGGTAAGCGGTTTGAACTCGTGGACGAAGCCGATCCGCGAGTAGAGCTGGGCATAGCGGGCGAGGCGCTTCTCCAAGCCCGGCATACCCATCAGGATCAGCCCGAAGCCGTGGCGATCGGCCATGTCGCGGAGATGTTCGAGCGACTTTATGGTCAGGCGGTCGGCCTCGTCGACGATGACGAGGGGGCAGGCGATGCGGGCGGCGTCATGGGTGATTTCGTCCTGCGAGCCGCCCGCGACCGTCAGCCGGGCATAGCCCAGCTTAATGAGGTTGAGGCCCAACACCGCGTCGATCGTCTTGGGCGTGTTGCTGACCGACACGGTGTAGAAGACGGCGCGGCAGCGAGCGACCTTTTCGCCAAGGAGCGCGGCAATGGGACGGAGCGCGGCATATTCCCCCAGGTCGGGGAAGCTGGAAAACTCGCGCGCCGATCGCGTCTTTCCGACGCCCGGCCGGCCATGACACACGCCGATATAGCGGTAGTGCGCGCAGGCTTCGGCAAACTCGACGAACCGGGCATGTTCGCGGGTCGCCAGGAACGGCTGCTCGACCAGGAACTCAATCGTCAGCGGCGTAGAGCCGGAGCCCTCGGTAGGTGGTGGGCCGGGAAGCCGTATCCTCTTGGTCGCCATCGAAGGTCTCCGTGGGGGCAGGCACCTGCTTGTCGCGCTCCTTCGCGCCGCGCCGGGCGCGCTGGATCGCGCGCAACGACGGGTGCCCAGCGTGCTCGGAGCTGAGCGCACGGCAGACGAACCGGCCCTGGTGGTAGACGTGGATCTCGGTCAGGTCGCGGGGGTCATAGACCGCCTCGACCTGCTCGCCGACGAAGGCCGCGAGCGTGGGTTCGACATAGCGCCTGCCCATCAGACGGATGCCGTCGCGCAGCACTTTACGGGGCTTGGGCACGTGCACGAGCAGCATGTCGAGCTGTTCAAGGCTGTCGGGCATTGCGGGCAGGAACCCGCCCTTCTGCCAGCGCGTGATCGGCGGTTCGCCGGTGCTGCCATGCGGGCGACGATGATAGACGCCGCACACGAACGCCTCGAAGCGGGCGCGCAGCTCGTCGAGCGTGAGCACTGGCGCCGACAGCGGCTTGCCCGCGATCAGGTGGCCGGGCAGGTCGGGCAGGAACATGTCGTTGATGGTGCGGAACAGCCGCTCGATCTTGCCGCGCCCGCGAGGACGCCCCGGCAGCGAATGGATGAGGCGGATTTTGAGGGCGATGCACGCCTGCTCGATATGCTCGGAGATGAAATCCGAGCCGTTGTCGACGTAAAGCTGTTCGGGAATGCCGCTGACGATCCATTCGGGATTGGGCTTGCGCCAGATCGCCTGCCGCAAGGCGAGCGCCGTGTTGAGGGCACTGGGGGCATCGAGGCTGAGGAAGTAACCGGCGATGGCTCGGCTGTGATCGTCAACGATGACGGTCAGCCAAGGACGCACCGGGGTTCCGGCATCATCGAGCACGAGAATGTCGAGAACGGTATGATCGGCCTGCCACATCTCGTTCGAGGTCGCGGCTTCGCGCCGATGCACTAGCTCGTGCTGGTCGCGGTAGACGGCCGGATCGGAGGCTGCGGCGATCTGGCTTGCCGGTATCGCCCTGACGACACGCGCGACGGCCGCATAGCTGGGGGTTCGGTGTCCATGCGCGATGGCGAGTTCCTGCACCTTTCGGTGAATGGCGGCGACCGGGGGTCGCGGGCGCTTGGTGGCGAGAGTGCGGGTCAGTTCGACCAGATGTTCCGGCAGGTGCAGCCTGCCCCGATCGTTGCGCGGCAGACGCGCGAGACCGGCAAGTCCTTCGGCACGGTAGCGCCCGAGCCAGCGCTGCAACGCCCGCTCGCTCAGCGTGCCGGTGCGGGCGAGGTCCGCGAGCGGGATGCCATCGGCGAGGTGCGGTTCAAGGACGCGGTAGCGCTCGATCGCCAGCGGCGGGACAAGCGCCGGATGCGTCACCGCCGACGGTCCGTGTCGCAGGTAAGGAAAACGGAGATTTGCCTGCCCATCGCCATGCGAGTCCGCTCGCGTTGCCAAACCGGCCTGTTCGACGTAAATCTACCCGGTAAAGAAAACTAGGGCAACATAGACCTGCCGATGACGACTTTCTTCCCAAACCGCGCCCGATCGGGGCGCCACCGGCCCTACGCATGAAAATCGGTTACGCCCGCGTATCGACCGCCGAGCAGAACCTGGACCTCCAGCGTGATGCGCTGAAGGCTGCCGGCTGCGAGAAGGTCATCACCGACAAGGCCTCCGGGGCGACTGCCGCTCGCCCTGGATTGGAAAAGGTGAAGGAGCTGCTTCGCGCCGGCGACACACTGGTGGTCTGGCGCCTCGGCAGGCTCGGCCGCTCGCTCCGTGATCTGATCGGATGGATGACCTACCTCGACGAGGAAAAGGTCGGGCTGCTGAGCCTGCACGAGGCGATCGACACGACCACCACGTCGGGCAAGCTTACCTTCCACCTGTTCGGGGCATTGGCGGAGTTCGAGCGCAACCTGATCCGCGAGCGGACCCAGGCCGGTCTCACCGCAGCCCGCGCTCGCGGCAAGAAGGGTGGCCGGCCGGCCGCACTCGGCAAGGACAAGCGCGACCTGCCCGTCAGGCTCTACCACGAGAACACGATGCCGATCGCCAAGATTTGCTCGATGCTCGGCATCTCCAAGCCAAAACTCTACGCCTATGTGCGATCGGCCGAGACCAAGCCGGTAGCCGCGTAGCGACGCTCGCCGACAAATAGCGCGATCAGAATGCCGCCACTTAGCGCGAGCGTTTACACCGTTCGCGGTCCCGATCGGCCCAGCGTCCCCCGGAACCGTTCGTTATCGGGAAGGCATCTAAGCGGTCTGCACCATCCTTCATCTAAAGTTCTCCAAGCGCTTCTGAGGATTTTCGGGCGCACCAAAAGCACTTGCTCCTCCAGTGGCTGGAGCATGTAACCGCAATTTGCTCTCCTGCGGACGGGGGGCTGGAGGTGGCTTTTTGACGGCCCGTGTTGAAGCAATCGGTATGTCTCGACGCTCATTAGTGGCGCGGCTGGTGGCCTGTGGAACCGGCTTGGTGCTCGGTTCTCCAACGCTCGCGCGACAGACTTCCCGGTCTGCGGGCTCGTGGAACTTCGGTGCGGCTCACCTCGAATGGGAGCCGCTGGAAGTCGGCACGGGTGATACCCTCCTTGTTTCGGCACAAGTGCGCGGAGTGCCGGTGCGGGCGGTGCTCGACAGTGGCAGCGGCGCGTCGATCATGAGCACGGCGCTCGCGGCGAAGCTCGGCTTGAACGATGGTGAGCGGCGCATGATTAGCGGGCTGAGCGCCAAGGCCCCGGTGCTGCTGGTCCGCGACATCGACGTACAGCTCGCCCGCGAAACCCGTCGCTTACCCTTTGCCGTCGTTGGTGATCTGAGTTCAGTGTCGGCGGCCTTCGGACGACCGATCGATATCCTCCTCGGTGCCGATATGTTCACGGGTAGCTGCATCGCGCTCGATTTCGCGAAAAGGCGTATGGCGGTCGTCAAGTCAGGCACGTTTCTCGCCGGTCCCGACTGGCGCGCTGTCGCGCTCGGGCGCGGTGCCAAGCAGGAGCTGTTCATTCGAGCTTCCGTCTCGGGTTTGCCTCCCGTGCCCTTGATGATCGATGTGGAGCGGCGTGCAAAATTGACCCCCTTAGCGGGGTGATCGGCGTCTAAAATTGACCCCCATCATTCCAGAGTGTGAGGCGTCGCGGCTTGGGCTTCCAGGCGGCGAGGACGGGGATGTTGATTGTGGAGACTATTGCCAAGATACGCAGGGAGCACAGGGACGGTAAGCCGATCAAAGAGATTGCGCGTGATTTACGGTTGTCGCGCAACACGGTGCGCAAGGCGATCCGTGCTCCGGAGGCGGATTTCAGCTACGAGAGGAAGGAGCAGCATCGTCCGCAGACCGGTCCATTTCGCGAACGGTTAGATGAGTTGCTGGCGGAGAACGAAGAGCGCCCCCGGCGCGAGCGACTGCGGCTGACGCGGATTCATGATCTGCTGGAACGTGAGGGGTTCACCGGCTCCTACGATGCGGTGCGGCGCTATGCGGCCCGCTGGAAGCAGGAGCGCCACGCCGGTGGCAGCGGGGATATGAGCAAGGTGTTCATCCCGCTCATGTTCCGGCCTGGCGAGGCCTACCAGTTTGATTGGAGCCACGAGGACGTGGAGATCGCCGGCAAGCCGATGCGGGTTAAGGTGGCGCATATGCGGCTATGCTGGTCGCGGGCGCCCTTTGTGCGGGCTTATCCGCGTGAGACCCAGGAGATGGTGTTTGACGCCCATGCCAGGGGCTTTGCTTTTCTCGGCGGGGTGCCGACGCGCGGCATCTACGACAACATGAAGACCGCGGTGACGACGGTGTTCACTGGCAAGGAGCGGGTGTTCAACCGGCGCTTCCTGATCATGACGGATCATTATGGCGTTGAGCCGGTGGCCTGTAGCCCGGCGGCAGGCTGGGAGAAGGGACAGGTCGAGAACCAGGTCCAGACCGGCAGGGAACGGCTGTTCAAGCCACGTCTGCGGTTTGCCAGCATGGAAGAGTTGAACGCATGGCTGGAGGCCGAGTGTCGCCGATGGGCCGAGCGCTATGCCCATCCGGATATGGAAGATATGACCATCGCCCAGGCACTGGAGATGGAACGACCCTCCCTACAGCCGCTCACCACGCCTTTTGACGGCTTCTTCGAGAGCGAACATGTGGCGAGCTCGACCTGCCTCGTCAGCTTCGATCGCAACCGTTACTCGGTCATGGCCGTTGCTGCCCGGCATGCGGTGCAACTGCGCGCCTATGCCGACCGGGTCGTCATCCGTTGTGCCGGCAAGGTGGTCGCCGAGCATGCCCGCCTGTTCGGCCGCAATCAGACGAAGTTCGATCCCTGGCACTATCTGCCGGTCCTGATCCGCAAGCCAGGCGCATTGCGCAACGGCGCTCCCTTCCAGGACTGGGATCTTCCGCCGGCCCTGGCCCAGCTGCGCCGCAAGCTGGGCAAAAGCGATGACGCAGACCGACGCTTTGTACGGGTACTGGCAGCGGTGCCCGAGGATGGCCTGGAGGCAGTCGAAGCTGCCGTGCGCGAAGCCATGGCGGCGGGCACGGCCAATGACGAGGTCATTCTCAACATCCTGTCGCGCCGACGCGAACCACAGCCTGTGCAGGCGATCAATGTTGTCGTCGATCTCAGGCTCAAGCATCCGCCCATTGCCGATTGCGCGCGCTACGATACGGTGCGAGGCCTCAATGCAGCGGCATGAGATGTTGGCAGCCCTCAAGGGGCTGGGCCTGAAGGGCATGATCGCCGCGTTCGACGATGCCGTCACCAATGGCATCCGCCGTGACCGGACCGCCATGGAGATGCTTGGCGATCTGCTACGCGCCGAAACGGCCCACCGTGAAGCCGCCTCGATCCGGTATCGCATGACTGCGGCCAGGCTGCCGGCCATCAAGGATCTCGACGGCTTTGTCTTCGCCGACACACCGATCAACGAGAGCCTGGTGCGTTCGCTCCATGCCGGCTCGTTCCTGCCGGAACGGCGCAATATCGTGCTGGTTGGTGGCACCGGCACCGGCAAGACGCATCTCGCGCTCGCCATCACCGCTGCGGTGGTCCGCGCCGGGGCCAGGGGCCGGTTCTTCAATACCGTCGATCTGGTCAATCGTCTGGAGGAAGAAACCCGGCAGGCCAAGGCCGGCAGCCTGGCCGCCCAGATGGCCCGCCTGGACGTCGTGGTTCTGGACGAGCTCGGGTATCTGCCGTTCGCCCGGTCAGGAGGCCAGATGCTGTTCCATCTGATCAGCAAACTCTACGAAAAGACCTCGGTGATCATCACCACCAATCTCGCCTTCGGCGAATGGCCTAGCGTCTTCCAGGATGCCAAAATGACGACGGCGCTGCTGGACCGTGTCACGCATCATTGCGACATCATCGAAACCGGCAACGACAGCTGGCGGTTCAAAAACCGAAGCTAA
Protein sequences of DBSCAN-SWA_2 >NC_020542|12205:18440|13010_14411_-|WP_015449225.1|integrase,transposase|DBSCAN-SWA MTHPALVPPLAIERYRVLEPHLADGIPLADLARTGTLSERALQRWLGRYRAEGLAGLARLPRNDRGRLHLPEHLVELTRTLATKRPRPPVAAIHRKVQELAIAHGHRTPSYAAVARVVRAIPASQIAAASDPAVYRDQHELVHRREAATSNEMWQADHTVLDILVLDDAGTPVRPWLTVIVDDHSRAIAGYFLSLDAPSALNTALALRQAIWRKPNPEWIVSGIPEQLYVDNGSDFISEHIEQACIALKIRLIHSLPGRPRGRGKIERLFRTINDMFLPDLPGHLIAGKPLSAPVLTLDELRARFEAFVCGVYHRRPHGSTGEPPITRWQKGGFLPAMPDSLEQLDMLLVHVPKPRKVLRDGIRLMGRRYVEPTLAAFVGEQVEAVYDPRDLTEIHVYHQGRFVCRALSSEHAGHPSLRAIQRARRGAKERDKQVPAPTETFDGDQEDTASRPTTYRGLRLYAADD >NC_020542|12205:18440|14602_15175_+|WP_015449226.1|DBSCAN-SWA MKIGYARVSTAEQNLDLQRDALKAAGCEKVITDKASGATAARPGLEKVKELLRAGDTLVVWRLGRLGRSLRDLIGWMTYLDEEKVGLLSLHEAIDTTTTSGKLTFHLFGALAEFERNLIRERTQAGLTAARARGKKGGRPAALGKDKRDLPVRLYHENTMPIAKICSMLGISKPKLYAYVRSAETKPVAA >NC_020542|12205:18440|12205_13084_-|WP_009824028.1|DBSCAN-SWA MATKRIRLPGPPPTEGSGSTPLTIEFLVEQPFLATREHARFVEFAEACAHYRYIGVCHGRPGVGKTRSAREFSSFPDLGEYAALRPIAALLGEKVARCRAVFYTVSVSNTPKTIDAVLGLNLIKLGYARLTVAGGSQDEITHDAARIACPLVIVDEADRLTIKSLEHLRDMADRHGFGLILMGMPGLEKRLARYAQLYSRIGFVHEFKPLTETEMRLLLATHAGDFGISFDPAQLDAIEAQAAVIRITRGNFRLMERLFAQMRRIMTLNRVEEVTADIVQAARDCLVIGPGN >NC_020542|12205:18440|17711_18440_+|WP_015449229.1|DBSCAN-SWA MQRHEMLAALKGLGLKGMIAAFDDAVTNGIRRDRTAMEMLGDLLRAETAHREAASIRYRMTAARLPAIKDLDGFVFADTPINESLVRSLHAGSFLPERRNIVLVGGTGTGKTHLALAITAAVVRAGARGRFFNTVDLVNRLEEETRQAKAGSLAAQMARLDVVVLDELGYLPFARSGGQMLFHLISKLYEKTSVIITTNLAFGEWPSVFQDAKMTTALLDRVTHHCDIIETGNDSWRFKNRS >NC_020542|12205:18440|16207_17725_+|WP_015449228.1|transposase|DBSCAN-SWA MLIVETIAKIRREHRDGKPIKEIARDLRLSRNTVRKAIRAPEADFSYERKEQHRPQTGPFRERLDELLAENEERPRRERLRLTRIHDLLEREGFTGSYDAVRRYAARWKQERHAGGSGDMSKVFIPLMFRPGEAYQFDWSHEDVEIAGKPMRVKVAHMRLCWSRAPFVRAYPRETQEMVFDAHARGFAFLGGVPTRGIYDNMKTAVTTVFTGKERVFNRRFLIMTDHYGVEPVACSPAAGWEKGQVENQVQTGRERLFKPRLRFASMEELNAWLEAECRRWAERYAHPDMEDMTIAQALEMERPSLQPLTTPFDGFFESEHVASSTCLVSFDRNRYSVMAVAARHAVQLRAYADRVVIRCAGKVVAEHARLFGRNQTKFDPWHYLPVLIRKPGALRNGAPFQDWDLPPALAQLRRKLGKSDDADRRFVRVLAAVPEDGLEAVEAAVREAMAAGTANDEVILNILSRRREPQPVQAINVVVDLRLKHPPIADCARYDTVRGLNAAA |
5 | Burkholderia_phage(20.0%) | integrase,transposase | attL 6711:6726|attR 27718:27733 |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_3 |
31125 : 40571
Sequences of DBSCAN-SWA_3
Nucleotide sequences of DBSCAN-SWA_3 >NC_020542|31125:40571|DBSCAN-SWA CATGAACGACAAGCTCGAACTCGACATTCCAGTGTTGTTGCCGGACCTTCCTGACGCGGCCGATGCCTGCCTGGACCGTCTCGTATCAACCCTGTCAAAGCGCGAAGGTGTCGAGCGGGCGCATGTGATCTGTCTCGACGGCAACACGCCAGCGGCGCTGTGCATCCATTTCGATGCCGCCAAGCTGCCGCTGCCACGGTTGCGCGAAATGGTGCGCGCCGCAGGTGCCGAAATCACCGAGCGCTACGGCCATGCGATCTGGCAGGTGACGGGGATCAACCACGAACGTCGGGCGCGCACGGTCAGCGATGCGCTGTGCGCCTTGCCCGGCGTGGTCCAGGCAAGCGCCAGCACCAGCGGGTCTGTCCGGGTCGAATATGATCGCCGCGAAACCACCGAAGAGGAGGTGCGCGCTGCGCTTCGCAAGCTGAAGGTGAGGGTGGGCAAGCGGGTCGCTGCCGATGAACATGCCGGCCACGACCACGGCCCCGGGGGCCACGGCGCGGGCGAAGAGAAGCACGGCCCCGGCGATGGCCACGACCATAGCCACGCCGAATTTCTCGGCCCCAATACCGAGCTGATCTTCGCGCTCGCCTGTGGCGCGCTGCTGGGGATCGGCTACGCGATCGAGAGGCTGGTCGCTGGCGCGCCCGAATGGCTGCCGACCGCCTGCTATATCGCAGCCTATTTCTTCGGCGGATTCTTCACGCTGCGCGAGGCGATCGACAACCTCCGGATGAAGAAGTTCGAGATCGACACGCTGATGTTGGTTGCTGCGGCCGGCGCCGCTGCGCTGGGCGCATGGGCCGAAGGTGCGCTGCTGCTGTTCCTGTTCAGCCTCGGCCATGCGCTTGAGCATTATGCAATGGGCCGCGCCAAGAAGGCGATCGAGGCGCTGGCGAAGCTCGCGCCCGAAACCGCGACCGTGCGACGCGGCGGGCAGACCAGCGAAATCCCGGTCGAGCAGATGGTCGTCGGGGACATTGCCATCGTCCGCCCGAACGAGCGCCTGCCTGCCGACGGCTTCGTCATCAAGGGCACCAGCGCGATCAACCAGGCCCCGGTCACGGGTGAAAGCATCCCGGTCGACAAGGTGCCGGTCGCAGATGCCGCAGCGGCACGCGCCAAGCCCGATGCAGTCGACGCGGAAAGCCGGGTTTTTGCTGGTACGATCAACGGCGGCGGCGCGATCGAGATCGAGGTGACACGCCGTTCCAATGAAAGCGCGCTTGCCAAGGTCGTGAAGATGGTGAGCGAAGCGGAGACGCAGAAGTCGCCGACGCAGCGCTTCACCGACCGGTTCGAACGGATCTTCGTGCCCGCCGTCCTGGTCCTGTCAGTGCTCCTGCTGTTCGCATGGGTGGTCGTCGACGAGCCGTTCCGCGACAGCTTCTATCGCGCGATGGCGGTGCTGGTGGCGGCAAGCCCGTGCGCGCTCGCGATCGCAACGCCGAGCGCGGTCCTGTCGGGCGTTGCCCGTGCAGCGCGGGGCGGCGTGCTCGTGAAGGGCGGTGCACCGCTCGAAAACCTCGGGTCGCTCAAAGCGATCGCTTTCGACAAGACGGGCACGCTGACCGAAGGTCGTCCGCGCATCACTGATGTCGTGCCCGTCGATGGCGCCGATGAAGGCGAGTTGCTGGCGCTGGCTGTCGCCGTCGAAGCGTTGAGCGACCATCCCCTGGCCCAAGCGATCGTCAAGGACGGCCGCGAACGTCTGAACGATCGTGCCGTGCCGACTGCCGGCGACCTCAAGAGCCTGACCGGTCGGGGCGTCACCGCGAGCGTGGATGGCGAGACGGTGTGGATCGGCAAGGCCGAGATGTTCGGAAGTGAGGGCATTCCGGCGCTGGGGCAGGGCGCGCAGGACGCAATCGCGAAGCTGCGCGAGAACGGCCGCACGACAATGGTGGTCCGCAAGGGGGACCGCGACCTGGGCGCGATCGGCCTGATGGACACGCCTCGCGAAGCTGCCAAGACCGCGTTGCGTCGGCTGCACGAGATGGGCGTGTCGCGGATGATAATGATCTCCGGCGATCACCAGAAAGTCGCCGAAGCGATCGCCGACGAAGTCGGGATCGACGAAGCGTGGGGCGACCTCATGCCCGAAGACAAGGTTGCCGCGATCAAGAAGCTCGCCGGCGAAGACAAGGTCGCGATGGTGGGCGACGGCGTGAACGATGCGCCGGCGATGGCAAGCGCCACGGTCGGCATCGCGATGGGCGCGGCGGGGTCGGACGTTGCGCTGGAGACAGCCGACGTGGCGCTGATGGCCGACGATCTGGCGCACCTGCCATTCGCAGTCGGTCTTAGCCGCCATACCCGTGGAATCATCCGGCAGAACGTCTTCGTCAGCCTGGGTGTCGTCGCCTTCCTGGTGCCTGCTACCATCCTCGGCCTCGGCATTGGGCCAGCGGTGGCGGTTCATGAGGGATCGACGCTGCTCGTCGTCATCAATGCGCTCAGGCTGCTCGCCTACCGCGATCCGGCAGGGAAGGCGGCATGAAGTGGCTGATCGCCTTCGACCTCGACGGCACGCTTGCCGAGAGCAAGCGGCCGTTATCGGAGGATATGGCCGCAATCCTCGCCCGATTGCTCGCCATCACGGATGTGGCGGTGATCTCGGGCGGGGACTGGCCGCAGTTCGAAAAGCAGATCGCCTCGCGCCTTCCTGCCGGCGTCGCGCTCGACCGTCTCTGGTTGATGCCGACCACAGGCACGAAACTCTATCGGTTCATCAATGGCGCTTGGCGCGCGGTCTATGCCGAACTGTTCGACGACGCGGAGAAGGCGAAGATCCGCACGGCCTTCGATCAGGCGCTGACCGATGCCGGTCTCGCCGACGAACGTATCTGGGGCGAACGGATCGAGGACCGGGGCAGCCAGATCACCTTTTCGGGCCTGGCGTCACCTCAAGCTTTGTACGCTGAACAACGATTTTCCGGACATAATGTGAACATTGGTGTCAGGCAGCGGCGCGAAGGTACTGGTAGAGGGTTTCGCGGCTGATGCCCATGTCGCGCGCGATGGTCGCCTTGCGTTCACCGGCGGCAACGCGGCGGTGCAGATCGGCAATCATCTCATCCGAGAGCGACCGTTTCCGCCCCCGATAGGCGCCGCGCTGCCGAGCGATCGCAATGCCTTCGCGTTGCCGTTCGCGGATCAGGGCGCGCTCGAACTCAGCAAACGCACCCATGACCGAGAGCATCAGATTGGCCATCGGGGAGTCCTCCCCCGAGAAGGACAGGCTTTCCTTCTCAAACTCGATCCGGATACCTCGCTTGGTGAGGCCCTGGACCAGCTTGCGCAGGTCATCCAGATTGCGGGCCAACCGATCCATGCTGTGGACGACGACGGTATCGCCTTCGCGGGCGAAAGTGAGCAGAGCCTCAAGCTGGGGGCGGTTGACGTCCTTGCCCGATGCCTTGTCGGTGAAGGTCCGATCGAGCGACTGACCTTCCAATTGGCGATCCACATTCTGATCGAACGTGCTGACCCGGACATAGCCGATCCGTTGCCCCTTCACTGCGCACTCTCTCCATCAAAATGTCAGGTTGAAATCTATGATACGCAACGACATTCGTCAACAAATTCCGTTTAGTACCCTAATCTGACAGAAACCGCGGTAGCGCCCTGACGTCCGGTTAGGCTATACTCCAAGCTGACATCACAAAATCAAATCCCTCAAAGATGGCGTCAATTGCCTTGATGGGGTAAAAACTGGAGTGCTTCCAGGATGGGACATTCGGCAACGTCATCTCCAGTGCATCTGGCTAACGTGGTCGCCAGCGTCACCTCTATCTTCTGCAAATCTGCAATGCGGCGGCGGACATCTTCAAGGTGGAGTTCGGTTCTCGCCATAACCTCCGCGCAGGTTTGTGCACCGGTATCGGTGAGCGACAACAATGACCGTATCTCATCCATTGCATAGCCAAGGTCGCGCGCGCGCAGGATGAACTGCAACCTTTGCACCAGTTCCGGCGAATAGACGCGGTAGCCGTTGGAACTGCGAGGCGGCCCCGGCAGCAATCCCACCTTCTCGTAATAGCGGATGGTTTCGAGATTGCAGCCTGTTTTCCGGGCAAGCTGGGCACGCAAGATGCCGACCTGTTGCTCCATGAAAATACCTCTTGAGTCTGTAGTGGCTACAGAGCCTACACTGCGAATCACTCAGTTGGAACAAGGGATTCAAGCCATGGTCTCAACGCCGGAAGCGGGACAGCCAGCCCTCACCGAAAACCACGAGCCGAAGCAGGCAAACTGGGTTGCGGCGGGCGCATTGATCGGCGCGGGGCTCGCCTCGGCTTGCTGCGTCGTCCCGCTACTGTTGGTTATGCTCGGAATTTCCGGCGCGTGGATCGCCAACCTGACGGCGCTCGAACCTTACAAGCCCTATGTCGCAGGCGTAACGCTCGCGCTGCTCGGCTACGGCTTCTGGCATGTCTATTTCAAGCCGAAACCGCCCTGTGAAGACGGTTCCTACTGCGCTCGTCCACAATCGGCTTGGACCACCAAGGCGGTGCTGTGGCTGGGCCTTGCCGTCGCCATCCTGGCGCTCACCATCGACTGGTGGGCACCCTGGTTCTATTGAAAGGAAATACCCATGAAAAAGACAGTATGTATCGCTATGGCCGTGCTGGCTATGGCTGGCGGCGGGGTCGCCTATGCAGTTAGTGGCACGGCGCAAGATCGCCCCGCGGCTACCGCAACCGCTCAGAAGCAAACCACCTTTGCCATAGAGAACATGACCTGCGCCACCTGCCCGATCACCGTGAAGAAGGCGATGGAAGGCGTCGCCGGGGTAACGGCGGTCACGGTCGATTTCGCAGCCAAGACCGCGCGCGCAACCTATAACCCGCGCCGCACCAATGCTGCTGCGATTGCCGCTGCATCAACCAACGCGGGTTATCCGGCGCGCGCTATTCAAAACTGAGCGGGGCGCCGCCAGCGCCTCAAGAAAGCGATAAGCAATGAACGACTGTTGCAACCGCCCCGGGCAGGAAGGATTTGACGTGGCCGTCATCGGCGCGGGTTCTGCCGGGTTCTCCGCCGCGATCGCCGCTGCCGATCTGGGCGCGAAAGTGGCGCTCGTCGGTCACGGCACGATTGGCGGCACCTGTGTCAATGTTGGCTGCGTTCCTTCCAAGACGCTGATCCGCGCCGCCGAAGCGGTGCATGGTGGGCTTGCCGCCGCGCGCTTTCCCGGCCTTGGGGGCGCTGTCCAGATGGATGACTGGTCCGTATTGGCGGCGTCGAAGGACGATCTTGTCACGACGCTGCGCCAAAAAAAATATGTCGATCTGCTGCCCGCCTATGACGGTGTGAGCTATATCGAGGGCAAGGCACGCTTTGCCGATGGTGCGCTGATTGTCGGCGATGCGCCCATGAAGGTCGGCAAGGTCATATTGGCGATGGGTGCGCACGCCGCCGTGCCGCCGATTCCGGGGATGGACAGCGTGCCCTACCTAACCAGCACATCGGCGCTGGCGCTGGATCGCTTGCCCAAATCGCTGCTGGTGATCGGTGGCGGGGTGATCGGGGTAGAACTGGGACAGATGTTTTCGCGTCTGGGCGTTGATGTCACCATCTGCTGCCGAAGCCGCCTGCTCCCCGAAATGGACCCGGAAGTGAGTGCCGCGCTGAAAAACTATTTGGAAGCCGAGGGCGTGCGGGTTTGCGCAGGTGTCGGCTATCAGCGTATCGCCCAAACACAGAGCGGGGTCGAATTGACCTGCGAGGGGCATTGTGACACCGTCGCGGCGGAACAGGTGCTCATCGCCACCGGACGCCGACCCAATAGCGACGGGCTTGGGCTGGAGGAGCGCGGCATAGTGCTTGCCCGTAATGGTGGAATCGTCGTCGATGACCACCTCGAAACGTCGGTTCCAGGCATTTACGCGGCGGGCGATGTAACGGGACGGGACCAGTTCGTCTACATGGCCGCCTATGGCGCAAAACTGGCCGCGCGCAACGCGGTGACGGGCAACCAATACCGCTACGACAATTCCTCCATGCCATCCGTCGTCTTCACCGACCCGCAAGTCGCCAGCGCCGGCCTCACTGAAACGACAGCACGGGCGCAAGGCCTGGACATCAAGGTTTCGCTGCTCCCGCTCGATGCTGTGCCAAGGGCACTGGCAGCACGCGATACGCGGGGTCTCATCAAACTGATTGCCGACAAGGCGAACGACCGTTTGCTGGGCGGCCAGATCATGGCACCCGAAGGCGCGGATTCGATCCAGACACTGGTGCTGGCGATCAAACACGGCATGACCACCCTGGAATTGGGTGCAACGATATTTCCTTACCTGACCACTGTGGAAGGCCTCAAACTGGCGGCCCAAACGTTTGATAAGGATGTCGCCAAACTGTCTTGCTGCGCCGGGTGATTGCTCGGGGATCAACCGGCAACGAATGGCCTCCCTCCAGTGCAAGCCGCTTTGGTGTCAATTTTGTTATTCGCCTCAGCGGGAGAGCAAAATGGCACGACGCCGACTTCTGACCGGAGACGAGCGCCGGCGCCTGTTCGATCCTCCTGTCCAAGAAACCGCGATCATTGGGCATTATACCCTTTCTGCGGAAGATGTTGAATTGGTTGGGCGCCGCTATGGTCCAGCAAATCGCCTCGGTCTGGCTGCACAAATCGCTTTGATGCGACATCCCGGCTTTGGTCTGCAACCCGAGATCGGGCTTCCCGACGTCATTCTTCAGTACCTCGCGGCACAGTTATTCGTCGATCCTTCCTCCTTCTCTGCATATGGTCAGCGCGCGCAAACCCGTACCGATCATGCCGATCTCGTGGCGCGTTATCTTGGCATACGCCCGTTTCGACGCGGCGACCTGGCACTTGCCCTGAATCTTGCCGCGCAAGCCGCCGAGTATACAGACAGAGGTGAACCGATTGTTCGCGCCCTCATGGTTGGCCTGAAGGGTGAGCGGTTCATTCTTCCGTCAGGCGACACACTGGAACGCGCCGGTCTTGCTGGCCGGGCACGCGCACGCAAAGCTGCCGCAGCCGCAATCGTCGAAGGCCTCAGCTCTGCTAAACTGACACGGCTAGACGAACTCGTAATCAACAACCCGGATTTCGGCATGACACCGCTGGCGTGGTTGCGTAATTTCGAAGAAGCCCCGACTGCGGCCAATATCAATGGCTTGCTTGAGCGCCTGCGCTATGTTCGCGGCATAGGTATCCACCCGGTAGTTGGGGGCGCCATTCCGGAATTCCGCTTTGCCCAATTTGTCCGCGAGGGCGGCGTGGCACCGGCATTCCTGCTTTCGGATTACAGCGTCAATCGCAGGCGGGCGACGTTGACGGCCGCAGTGATCGACCTTGAGGCCAGACTTGCCGATGCCGCGATCCAAATGTTTGACCGACTTATCGGCGGCATGTTCACGCGCGCGCGGCGTGGGCGCGAGCGTCGCTACCAAGATAGTATTCAGTCGGTGGGGCAACTCATGCGGCTGTTTGGCGCCACGATTACAGCACTTGATGAGGCTGTCCAGAATGGCGGCGATCCGCTCGAATTGATTGACGAAGCGGTGGGCTGGCACCGGCTTGTTGCGGCAAAGGCCCAAGTAGATGCCCTTGCTGATCTTGCCGGCGAGGACGCACTGGTAACGGCAACCGAGCGTTACGCCACGCTACGGCGTTTCAGCCCGGCATTTCTGGACGCCTTCACCTTCAAGGCGTCTGGAACAGGGACGGCACTGATCAAAGCCATCGATGTCATTCGCGATGCGAACACACGAAAGTCGCGCGACTTTCCCGATGGCGTTCCACTGCCATTCCCCAATCGGCAGTGGAAGCGTCTCATCACCGAAAGCGGCCGTATCGACCGCCGACGTTATGAAATTGCGATCATGGCAACCTTGCGTGATCGTTTGCGCGCCGGTGATGTATGGATCGAGGGAACCCGCAACTATCAGCGCTTCGATGCCTATTTGCTGGGTCGGCGCGACGCCGCCAAAGTGGCGGATGTGCTTCCGTTCGATTCAAATGCTGCATCCTACCTCGCTGACCGGGCACGAAATCTTGACTGGCGGCTGCGCCGATTTGCCAAGCAGTTGAAAACAAACAAGCTTGAGGGAGTGTCGCTCGAACGAGACCGGCTCAAGCTTCAGCAAATGCCGCCTGTCACCCCACCGGAAGCTGAAGCCCTCGATCGCAAGCTCGATACCCTGCTTCCCCGCGTGCGCATCACCGAGCTGCTGCTTGAAGTCGCCGAACGCACTGGTTTTTTGAACGCATTCCGTGACCTGCGCTCAGGCAAGGAGCACGACAACCCCAGCACGGTACTCGCCGCAATTCTGGCTGATGGCACCAACCTCGGGCTGGAGCGGATGGCCAATGCCAGCGAAGGCGTCAGCTATGCCCAACTCGCATGGACCCACAACTGGTATCTTTCACCCGAGAACTATCAGGCCGCGCTGGCCATGATCATCTCAGCCCATCACGAATTACCCTTCGCGCGGCATTGGGGCGCTGGCACCAGTTCGTCGTCCGATGGCCAGTTCTTCCGGTCGGGGCGGAGCCGTTCAGGGGCGGCGGACGTCAATGCCAAATATGGCGCCGAACCTGGCGTGAAAATCTATTCTCACCTTTCCGATCACTTCGCATCATTCGGATCACGAATCATGTCCGCGACGGCAGGTGAAGCGCCTTACGTGCTCGACGGGCTTGTCCTGGGCGCCGGCAACCTTCCGTTGCATGAGCACTATACCGATACCGGCGGCGCCACCGATCATGTTTTCGCACTCTGCCACCTACTCGGGTTCCGCTTCGCGCCCCGGCTGCGCGATATTGGCGACCGCAAGCTGGGTTCGATCGCTGCGCCATCGACATACAAGGGCATCGAAAATCTGATGGGCCGCACCATCAAAACGGCAGCGATCGAGGCCGATTGGGATGACATCGTCAGGATTGTCGCCTCAATCAAGGATGGCACGGTGGCGCCGTCAGTAATCTTGCGAAAACTTGCCGCCTACAAACGCCAGAACAGGCTGGATTTTGCATTGGCTGAACTGGGCCGTATCGAGCGCACTTTGTTCACGCTCGATTGGCTTGAACAACCGGAACTGCGACGTGCCTGTCAGGCCGGTCTCAACAAAGGCGAGGCGAGGCACACGCTTGCCGCCGCCATCTATACCAACCGGCAGGGTCGGTTCACCGATCGCTCGCTGGAAAATCAGGAATTTCGCGCATCTGGGCTGAACCTGCTGATTGCGGCGATTTCCTACTGGAACACGGTCTATCTCGACCGGGCCGCCCAGCACCTCAACGCTGTCGGCACGACGTTCGATGCGGCACTGCTTGCGCACCTTTCTCCGATGGGCTGGGCGCACATCAGTCTGACCGGCGATTACCTCTGGGAGCAGGCCAGGCGACTTCCAGCAGGTGAATTCCACCCACTCAACGAGCCAATGGCGCGGTTGAAGCGTGTAGCGTAG
Protein sequences of DBSCAN-SWA_3 >NC_020542|31125:40571|35724_36054_+|WP_004213249.1|DBSCAN-SWA MKKTVCIAMAVLAMAGGGVAYAVSGTAQDRPAATATAQKQTTFAIENMTCATCPITVKKAMEGVAGVTAVTVDFAAKTARATYNPRRTNAAAIAAASTNAGYPARAIQN >NC_020542|31125:40571|31125_33627_+|WP_004212886.1|DBSCAN-SWA MNDKLELDIPVLLPDLPDAADACLDRLVSTLSKREGVERAHVICLDGNTPAALCIHFDAAKLPLPRLREMVRAAGAEITERYGHAIWQVTGINHERRARTVSDALCALPGVVQASASTSGSVRVEYDRRETTEEEVRAALRKLKVRVGKRVAADEHAGHDHGPGGHGAGEEKHGPGDGHDHSHAEFLGPNTELIFALACGALLGIGYAIERLVAGAPEWLPTACYIAAYFFGGFFTLREAIDNLRMKKFEIDTLMLVAAAGAAALGAWAEGALLLFLFSLGHALEHYAMGRAKKAIEALAKLAPETATVRRGGQTSEIPVEQMVVGDIAIVRPNERLPADGFVIKGTSAINQAPVTGESIPVDKVPVADAAAARAKPDAVDAESRVFAGTINGGGAIEIEVTRRSNESALAKVVKMVSEAETQKSPTQRFTDRFERIFVPAVLVLSVLLLFAWVVVDEPFRDSFYRAMAVLVAASPCALAIATPSAVLSGVARAARGGVLVKGGAPLENLGSLKAIAFDKTGTLTEGRPRITDVVPVDGADEGELLALAVAVEALSDHPLAQAIVKDGRERLNDRAVPTAGDLKSLTGRGVTASVDGETVWIGKAEMFGSEGIPALGQGAQDAIAKLRENGRTTMVVRKGDRDLGAIGLMDTPREAAKTALRRLHEMGVSRMIMISGDHQKVAEAIADEVGIDEAWGDLMPEDKVAAIKKLAGEDKVAMVGDGVNDAPAMASATVGIAMGAAGSDVALETADVALMADDLAHLPFAVGLSRHTRGIIRQNVFVSLGVVAFLVPATILGLGIGPAVAVHEGSTLLVVINALRLLAYRDPAGKAA >NC_020542|31125:40571|36091_37510_+|WP_004213247.1|DBSCAN-SWA MNDCCNRPGQEGFDVAVIGAGSAGFSAAIAAADLGAKVALVGHGTIGGTCVNVGCVPSKTLIRAAEAVHGGLAAARFPGLGGAVQMDDWSVLAASKDDLVTTLRQKKYVDLLPAYDGVSYIEGKARFADGALIVGDAPMKVGKVILAMGAHAAVPPIPGMDSVPYLTSTSALALDRLPKSLLVIGGGVIGVELGQMFSRLGVDVTICCRSRLLPEMDPEVSAALKNYLEAEGVRVCAGVGYQRIAQTQSGVELTCEGHCDTVAAEQVLIATGRRPNSDGLGLEERGIVLARNGGIVVDDHLETSVPGIYAAGDVTGRDQFVYMAAYGAKLAARNAVTGNQYRYDNSSMPSVVFTDPQVASAGLTETTARAQGLDIKVSLLPLDAVPRALAARDTRGLIKLIADKANDRLLGGQIMAPEGADSIQTLVLAIKHGMTTLELGATIFPYLTTVEGLKLAAQTFDKDVAKLSCCAG >NC_020542|31125:40571|35316_35712_+|WP_014072602.1|DBSCAN-SWA MVSTPEAGQPALTENHEPKQANWVAAGALIGAGLASACCVVPLLLVMLGISGAWIANLTALEPYKPYVAGVTLALLGYGFWHVYFKPKPPCEDGSYCARPQSAWTTKAVLWLGLAVAILALTIDWWAPWFY >NC_020542|31125:40571|37601_40571_+|WP_015449233.1|transposase|DBSCAN-SWA MARRRLLTGDERRRLFDPPVQETAIIGHYTLSAEDVELVGRRYGPANRLGLAAQIALMRHPGFGLQPEIGLPDVILQYLAAQLFVDPSSFSAYGQRAQTRTDHADLVARYLGIRPFRRGDLALALNLAAQAAEYTDRGEPIVRALMVGLKGERFILPSGDTLERAGLAGRARARKAAAAAIVEGLSSAKLTRLDELVINNPDFGMTPLAWLRNFEEAPTAANINGLLERLRYVRGIGIHPVVGGAIPEFRFAQFVREGGVAPAFLLSDYSVNRRRATLTAAVIDLEARLADAAIQMFDRLIGGMFTRARRGRERRYQDSIQSVGQLMRLFGATITALDEAVQNGGDPLELIDEAVGWHRLVAAKAQVDALADLAGEDALVTATERYATLRRFSPAFLDAFTFKASGTGTALIKAIDVIRDANTRKSRDFPDGVPLPFPNRQWKRLITESGRIDRRRYEIAIMATLRDRLRAGDVWIEGTRNYQRFDAYLLGRRDAAKVADVLPFDSNAASYLADRARNLDWRLRRFAKQLKTNKLEGVSLERDRLKLQQMPPVTPPEAEALDRKLDTLLPRVRITELLLEVAERTGFLNAFRDLRSGKEHDNPSTVLAAILADGTNLGLERMANASEGVSYAQLAWTHNWYLSPENYQAALAMIISAHHELPFARHWGAGTSSSSDGQFFRSGRSRSGAADVNAKYGAEPGVKIYSHLSDHFASFGSRIMSATAGEAPYVLDGLVLGAGNLPLHEHYTDTGGATDHVFALCHLLGFRFAPRLRDIGDRKLGSIAAPSTYKGIENLMGRTIKTAAIEADWDDIVRIVASIKDGTVAPSVILRKLAAYKRQNRLDFALAELGRIERTLFTLDWLEQPELRRACQAGLNKGEARHTLAAAIYTNRQGRFTDRSLENQEFRASGLNLLIAAISYWNTVYLDRAAQHLNAVGTTFDAALLAHLSPMGWAHISLTGDYLWEQARRLPAGEFHPLNEPMARLKRVA >NC_020542|31125:40571|34817_35240_-|WP_014072603.1|DBSCAN-SWA MEQQVGILRAQLARKTGCNLETIRYYEKVGLLPGPPRSSNGYRVYSPELVQRLQFILRARDLGYAMDEIRSLLSLTDTGAQTCAEVMARTELHLEDVRRRIADLQKIEVTLATTLARCTGDDVAECPILEALQFLPHQGN >NC_020542|31125:40571|34086_34647_-|WP_004213255.1|DBSCAN-SWA MKGQRIGYVRVSTFDQNVDRQLEGQSLDRTFTDKASGKDVNRPQLEALLTFAREGDTVVVHSMDRLARNLDDLRKLVQGLTKRGIRIEFEKESLSFSGEDSPMANLMLSVMGAFAEFERALIRERQREGIAIARQRGAYRGRKRSLSDEMIADLHRRVAAGERKATIARDMGISRETLYQYLRAAA |
7 | Salmonella_phage(50.0%) | transposase | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_4 |
43987 : 54331
Sequences of DBSCAN-SWA_4
Nucleotide sequences of DBSCAN-SWA_4 >NC_020542|43987:54331|DBSCAN-SWA GTCAGTCATCTTGTTCTCCAATCCGGGCGCGAATGAAGCGGACGCGCTCGGGAACCGGGGCGCGGGGGAGGGCTACCAGCTCGTAGCGAAGGCTGGCATAGACCGTGACCATTGCCTGATAGGTGGCCTCGGCTTCCTCCGGGCTTTGTTTCCTCTCCGCATCCCCGACGAATATCTCGGGCCAAGGCGGCGCGATGAACACCCGGCGCTGATAGCGGAACAGGTCGGCGGCGCGATCGACAGCGGCCGGGACGGGCAAACTGCACAGGTGCAGATAGCCGACAACATCGGGCACGCCACGATCGAAGATGACCGGCCCCGCGCGATCATGGGCCTCATGCCAGGACCGCAACTCCCAGCCCAGCATCAATTCCGCGAAGGCCGCGCGATCGGCCCAGGGGAGGGCGGTGCCGCCGATTGCGAGCTGATCGCGGATGATGGCGCGGCCGGCTTCCGGCATGTGGTCGAAACCTTCGGCCGCCAGCGCTGCAATGAGGGTGCTTTTGCCGGAGCCAGGGCCGCCGGTGATGATATGAAAATCGGGCAGATCAGCCATTCTCCTCGCCTTCCGCTTCCGCGTCGAAGGCGCGTTTGCCCCGCCGCTTCCAGAGCACCAGCCTGTCGCGATCCTCGCGCGCCTGCGCGGCGAGTTCGAGCATCGCGCCGTAGAGAGTGGCGCGATCGTCGTCGGCCAGTTCGACCAGGCCGGCTTTCTGGACGAGGCCGCCCAGCTCTATGAGATGGCGGGTGCGTTCGCGGCGGGCCACGACCCATCCTCTCGTATCGGTGCGGCGCTGGGCGGCCTCAATCCTGCGTCTCGCCTGCGCCAGTCTGGTTTCCGCCTTGTGCGTTGCCGCCAGCGCGTCGGGAAGCCTTGCGCCCGCGTCCCTGAAAGAAGGCCGCGCCCTTCACGCGCCACGCCTCCCTTTCGTCCGCGCTGGCGCTTTCGGCAGCGGCGAGCAGCGCGCCCGCCAGCGTGTCGAGGTCGAGCGCATCGGCCCCGGTGGACGTGACCAGCTCGCCGAGCTGCTGGACCTTCTTCACCTTGAGCGCGCGCGCCTTGTCATTGAGCGCCCGCAGCTCCGCGTCATAGTCGCGCACCTTCCGCATCATCTCTCCCTCGCGCCCATGAACCAGAGCCGCAGCCTAGCACGATCGTGCCGGAAAGGTAGTGCCCATGCTAGAAAATGCGGCCAAGTCAATCATGCGAACGGCAGAGAGTGTGAGTGCGCGCTTATACGTCGTTGCCGACGTGCGCTCAGAAAGGCAGTATAGCGGCCGTCATGGCGATCTACCATTTCTCGGCCAAGGTCATCAGTCGCGCCAACGGATCGAGCGCCGTTGCCAGTGCAGCCTATCGTGCGGCTGAGCGGCTGCACGATGACCGGCTCGGGCGCGATCATGATTTCTCGAACAAGGCCGGCGTCGTCCATTCGGAAATCATGTTGCCCGAAGGTGCGCCAGAGCGTTTAAACGATCGCGCGACCTTGTGGAATGAGGTTGAGGCCGGCGAGAAGCGGAAGGACGCTCAGCTTGCCCGCGAGGTCGAGTTCTCGATTCCGCGCGAGTTGAACCAGCAACAGGGCATCCAGCTTGCCCGCGATTTTGTCGAAAAGCAGTTCGTGGAACGCGGCATGGTCGCGGACATGAATGTGCATTGGGACATGGGGAAGGACGGCCAGCCCAAGCCGCACGCGCATGTCATGCTGTCGATGCGCGAGGTAGGGCCGGAGGGCTTCGGCCAGAAGGTGCGCGAGTGGAACAGCACGGCGCTCTTGCAGGAGTGGCGCGAGGCGTGGGCCGATCATGTCAACGAGCGGTTGGCCGAGCTGGATATTGATGCCCGCATAGACCATCGGACGTTGGAGGCGCAGGGCATCGACCTTGAGCCGCAACACAAGATCGGGCCGGCGGCGTCGCGTATGCCCGAACAGGGGCTTGAGGCCGAGCGGGTCGAGGACCATGCCCGGATCGCGCGCGAGAATGGCGAAAAGATCATCGCCAATCCGCAAATCGCGCTCGATGCTATCACGCGGCAACAGGCGACGTTCACGCGGCGCGACCTGGCGCAGTTCGCGTTCCGCCACAGCGACGGCAAGGATCAGTTCGACCAGGTGATGAGCGCGGTGCGAACCTCGCCCGAGCTGGTGGCGCTGGGGAAGGACTGGCGCGGCGAGGACCGCTTTACTTCCCGCGATATGATCGAGACGGAGCAGCGGTTGGAGCGCGCCGGCGATCGGCTTGCCGATCGGACGGGGCATGGTCTCCCGGCAACGTCGCAACCAAGGGGGCTGGACGCTGCCGGGAGTGGTGGCCTCGCGCTGGGGGACCAGCAACGGGACGCGCTGGCGCATATCACCGGCAAGAACGACCTGGCAATCGTGGTCGGCTATGCCGGCACCGGCAAATCGACCATGCTGGGCGTGGCGCGCGACGAATGGGAGCGGGCGGGCTATCAGGTGCGGGGCGCGGCCTTGTCGGGGATCGCGGCCGAGGGGCTTGAGGGCGGTTCCGGCATCGCCTCGCGCACGATCGCGAGCATGGAATATCAGTGGGACCAGGGCCGCGAGCTGCTAGGCCCGCGCGACGTGCTGGTGATCGACGAGGCCGGCATGATCGGCACGCGCCAGATGGAGCGCGTATTGTCTGAGGCCGAGAGGGCAGGGGCGAAGGTGGTCCTTGTCGGCGACGCCGAGCAGTTGCAGGCGATCGAGGCCGGCGCTGCGTTCCGCTCGCTTGCCGAGCGCCACGGCGCGGCTGAGATCAACGAGGTTCGCCGGCAGCATGAGGACTGGCAGCGTGACGCCACGCGGGCGCTGGCGACGGGCCGCACCGGCGAGGCGATCCATGCCTATGCCGAGCACGGCATGGTCCACGCGGCCGAGACGCGCGAGGCGGCGCGGGCCGAGCTGATCGACACATGGGACGCGCAGCGCCGCGCCGATCCGGACAAGAGCCGGATCATCCTCACCCACACCAATGCCGAGGTGCGCGATTTGAACCTGGCTGCGCGCGATCGGCTGCGCGACGCCGGCGACCTGGGGCAGAACGTGCGCGTGTCGGCCGAGCGGGGCGCGCGCGACTTCGCGGCGGGCGACCGCATCATGTTCCTGAAGAACGAGCGCGGGCTAGGGGTGAAGAACGGCACGCTGGGGCAGGTCGAGCGGGTATCGCCCGACAGCATGGCCGTTCGCCTTGACGATGGCCGGAGCGTCGCGTTCGACCTGAAAGATTATGCCCATGTCGATCATGGCTATGCTGCGACCATCCACAAATCGCAGGGCGTGACGGTCGATCAGGGGCATGTGCTGGCGACGCCGGGGATGGACCGCCATTCGGCCTATGTGGCGCTGTCGCGGCACCGTGACGGGGTGCAGCTCCATTACGGCCGTGACGACTTCGCCGACGATCGCCGGCTTGTCCGCACGCTGGGGCGCGAGCGGGCGAAGGACATGGCATCGGACTATCCGATGTTCCGTGATCCCGATCAGGAGATTCGTGCTTTCGCTGACCGGCGCGGGCTGTCGGGCGAGATCCGCGTTGCGGATGCGCCCGAGCGTCAGGGCGTGGAGCTTCTTGGTCCGCGCGCGGGCACCATGCGCCAGATGGGCGAAGATCCACGCACGCGCGAGATCGGCCACCGTGATGCCAGCGGCCGCGGCGCTGGCGACGCGAAGGCCGCGGCCGAGCGACAGCCGCGGCGGGGTATGTTCGACGGCTTGAAGCTGTCGGCCGACCCGGCGAAGGCGGCCGAGCCAGCACAGGGCGCGAAGGAAAAGGCGGCTCCCAAGCGTGGCATGTTCGACGGGCTGAAACTGCCGTCGCGCACGCCAGCGCCGACGCCGGCGAAGGAAATCCCAGCGCGCGCTGAGCAGGGCCAGGACCGCGATTTCCGGCGTGCGGTCGAGCGCGCCTCGCGTTCGGCCGAAGCGGTGTTGCAGGCGCGGGCATCGGGCGGGCCGGTGCTGGAGCATCAGAAGGTCGCGCTCGAGCGCGCAACCGAGGCGCTGGACCAGGCCAGGCCGGGAGCCTCGCAGGATATGGCAGCAGCGATGAAGCGCGATCCCGCCTTGCTGCGCGACGCGGCGGCGGGGCGCAGCGGCCCGATGATCGAGGCGATGCGCCAAGAGGCGCGGGTGCGGGCCGATCCGAACTTGCGCGCCGACAGGTTCGTGGAACGCTGGCAACAACTCTCCCAGGACCGCGACCGCCTCTATCGCGCTGGCGACACGACAGGCCGCGATCGGGCAGGCAAGGAAATGGCGGGCATGGCGAAAAGCCTTGAGCGCGATCCTCAGGTGGAATCGATCCTGCGCAATCGTACGCGCGAGCTGGGGCTTGAGATCGGCATGGGGCGGGGCCGGGACATGGGCGGGGGCGATCTAGGTCGCCAGTTGACGCATGAACTCGGGATCGGCCGCGATCATGGCTTAAGCCGGTAAGGAGAAACGAGAATGGGCGATGATGATGGTTTCGACGACGCCGACGATCCGGCGCGGGCGTTCGCGCGGGTCGAGGATCGGTTGGCGTCGGTGCATGGCGAGGTGGCGTTGCTGCGCGCCGCGATCGAGGGACTGACCGCCGCGCGCGAGAATATCGAGATTCCCGATTATGAACCGACGCTGGAACGCACCGAAAAGATATTGGTGGCGCTCGCCCAGCGGATCGACCCCATTGCCAAAAGCCCGCTTCTGTCGATGACGCCCGATTCTATGGCGAGCCAGATCGCGACGGCGGCTACGGCCGCTCGCCGTGAGGACGCGCGGCTTGTCGCCGAGGCGCGCGCGGGGCTGGACCAGGCCGCGCGCGAGATCGGTAATCGGCTCGCATCGGCACGGCACGGCGACGAGCAAAATCGCTGGTTGTATGTGATGGGCGCTTGCGGCGTGGTGCTGGGGCTGTTGCTCTATGCCTTCCTTGCCGGTCCGATCGCGCGGTGGACGCCCGATAGCTGGCGTTGGCCCGAGCGCATGGCAATCCGCGTCCTCAATGAGCCTGGCCCTTGGGGGGCGGGGCAGCGGTTGATGCAGGCGGCCGATGCGGAGAGCTGGGCGCTGATCGTCGCGGCCTCGCCGCTGACGGATGCCAATCGCGAGACGGTGCAGAAATGCCGTGAACGGGCCGCCAAGGCGAAAGAGCCGGTGCGCTGCACGATCGAGGTGAAGCCATAATAGGAAAATGGCTGAATTCAGCAGCAATTCACTCCCAATGGCGAACCATGTCCGTGACGGTTTATGGAGGATACCATGCGTGCTCTCATCACAGCATTATTACTTCTTGGCCCATGCAGCGCTGCATACGGCCAGATTGCCCCGCGCGCGCCCTCTGGTCCTGAACGAATACCCGGCGCCATCATTGATGTGCAAGTGCGCTTTGATCAGCAAGCTGCTGGACGCGAACTCGGCGAGACTCGCAAGCGCATAAAACGCGGGCGGGACAGCGGTGATCTATCGAAGCAAGAAGCTCGCGCGCTGCGTAAGGAAGCGGATCAAATCGGCACATTGGCAGAACGCTATGGTCGGGATGGCATTTCCGATTCCGAACGCCGGGAGTTGGATATGCGTTCCCGAGCACTGGGAGGCTTGACCGAAGCACGGCGCGCGCAAGGAGGTAATAAAGTCCCCTGACTCTCCGAAAATTTTCGGCGAAAAAATTAATCGTCTAACATGATTTCATAAGATGAGCAGAAATGCTCTACCAAGTCTTTCCAATTGTCATTGTAAAGAGCATCTCTAAGCTTGATTAGGGTGCCGCCTCTTGAGATTTCCCAAATCTCTATTATTGGTAGATAGAAGCATCCTTTCACGTTTCGTAACATGATAACGTTGAAGAGAATCCGGGCGGCTCTGCCATTGCCATCTTCGAAGGGATGAATGGCGTTTACACCAGAAAACGCAAGAATAGCTGCCCATAGTGGTGAGGAATCTTCGCATTCCTGAATTGCGCGATCAAGATTTTCAATCAAAGATGGGATCATATGCGGAGGCGGGTATATGCCGCGATTTCCGAACTTGTCCTTGTTGGTGCCTACTTCTTTCGATCGCATCCTTTTCTGCGAGACAGCTATTAATTCGATCGTTGCATCCAAAGCCCCAAGGATTCCCGTGGATTGTGCAGAAGCAACGCTCTGTGCAATGGTTATTTCGGTAGAGGATGTCCGGGATTTTTGGACGCCGCCGTCCATAAAGTTGAGATAGGGACTGAGCGGGTGTTGTAGCTCCTCGACCCTGGATTTGATATCCTGTCGAGCGGAATTATGAATCTTCCGGATTGCGGCCAGGGGATCGGTTCCTCTCTTGTCCGCAAGATCGGCCTCTACCATCCGCAGTCGATCCCGCAATTCAGGCTTCAGGGTCGCCAGCGCTGGCGGCTTGAAGGGGGGCACATAGACATCGCCGATATGGTATTCGGCATTGTTACCCAATGGATCGTTCATTCTAGACCTCAAGCCTGAGGATCGCGGCCTCAGTGGATTTGTGATGGACCATGCCCTGGGATACCGTCACTACTCGATCTGCGGCCTGAATCGACTGCTGCCGATGTGCGATCACGACCCGCGTAATGTTCAGGCGGGAAAGCGCTTCAAGCACTTTTGCCTCATTCTCGGGATCGAGATGTGCCGTGCCCTCGTCCATGAACAATATGGCGGGTCGGGCATAAAGCGCCCTTGCCAACAACACCCGCTGCTTTTGTCCGCCCGATAGCACTGACCCCATGTCTCCGATCAACGAATCATACCGAAGCGGCATCGCATGGATTTCGTCATGAACAGCCGCCAGCCTTGCGACCTCTTCCACCCGTTCCATGTTGATTTCGGGATCGAAGAAAGCAATGTTTTCGGCGATACTCCCGGCGTAAAGCTGGTCATCTTGCGAGACAGACCCGATCGCCTGGCGGTAGACCTGCTTGGAAAAGCTCTTGATCGACCGGCCACCGACGAGAACCTCTCCGTGAAAAGGCTCAAACAGGCCCATTATAACCTTCATCAACGTCGTTTTCCCGCCGCCGGAAGGACCGATCAGCGCAATCATTTCGCCAGGATGGATGGTTAGATTTACGCCTTTGAGGACACTTTGTTCGCCAAACCCGTAACGGAAATGGACGTTGCGAATTTCCAAAGTCTGGCTGAAATCTGGACACCTCGAAAAACTCGTGGCGCCTGACGGCCTGAAGTGATTCACTGCCTTTGCGCGATAGCGGAGGCGATGATGGCAGGGCAGCCGGGTTTCTTCGATCTTTCGGACCGATATGAGGCGTTGAGCGCGGCGGGTGATCCGCTAGAGCGTCTGGCGGCAGTCGTGGACTTCGAGGTTTTCCGGGGACCGCTGGTAGCGGCGCTGCGCCGTAGTCCTCGCGGCAAAGGTGGCCGCCCGCCGTTTGATCCGGTCCTCATGTTCAAGATCCTCGTGCTCCAGGCGCTCTACTCGCTTTCGGACGAAGCGACCGAGTTCCAGATCAAGGACCGCCTCTCTTTCCAGCGTTTTCTGGGGCTTGGCCTGGATGGCACCGTGCCCGATGCGACGACAGTGTGGCTGTTCCGGGAGCGTCTGGTGAAGGCCAAAGCCATCGACAGGCTATTTGCCCGCTTCGATGCCGCTCTCACAGACCGAGGGTACCTTGCCATGGGCGGGCAGATCATCGACGCGACCGTTGTGCCGGCTCCCAAGCAGCGGAACACCGAGGAGGAGAAGGCGGCCATCAAGGAGGGCCGGATACCCGAGCGCTGGAAGGATAACCCTGCGAAGATCCGGCAAAAGGATCGCGACGCCCGCTGGAGCGTCAAGTACACCAAGGCCAAGATCAAGGAAGGCGCGGACCCCAAGGCCTTCAAGCCGGTCGATCTGGCCATCCCGATGTTCGGCTACAAGAACCATATCGGCATCGACCGGGCGCATGGCCTGATCCGCACCTGGGATGCCAGCGCCGCCAATGCCCACGATGGCGCAAGGCTGCCAACGCTGATTAGTCCGGACAACACGGCCGCGGGGGTATGGGCCGATACGGCCTACCGCTCGAAGAAGAACGAAGCCTTTCTCGCCAAGGGCATGTTCACCAGCCACATCCACCAGCGCAAGCCACACCGCCGAGCGATGCCTAAGCGTATTGCACGCGCCAATGCCAAGCGGTCGGCGGTCCGCTCCGCCGTCGAGCACGTCTTTGCCGGGCAAAAGCACCGCATGGGTCTGGTGGTGCGCACCATCGGTATTGCTCGTGCCCGCATCAAGATCGGCATGACCAACCTTGCCTATAACTTCCAGCGCCTGGCCTGGCTCGAGGGGCGAATTGCGCCCGCGTGACGCCAAAGACGGCCCGCGCAGCCGCCTCTCCAGCAAAAATCAGCAGATTACTCACCGAAACCGAGGCTCATATGCCGCTCGTGAGCCTTCACGCCTCCATCATGCCGAAATCAGACGGTTCTTCGAGGTGTCCATCTGGCTGATCGACATTGCCGACCGAGTTGGCGTCTTCCTGCTTCGACAGCGCAATATCGGCGACCCGTCCCAGATGCACCTGCATTATCTTGTAATTCAATAATTGCTCGATAAGTCGCATGGAAGCATCGAGAAATTGTCCTTTATATGCTTGGAACGCAAAGAGCATTCCAAGGGACATATTTCCTGCGAGAACAAGGCTGACGCCCAAATAAACGAATGATATTTGTTCGATGACAAGGACAAGTTGTCCGAGGCTGTCGAATCCTGCTGTAGTGCGACCCAGCTTGATTTGCGCATTGACGGCATCGGCTTTCGTTTTTTGCCAGAGCCTTTGGCGGTTGCCTTCCTGACCGAAGGCCTTGATCGCAGCTATGCCCCGGATGCTTTCGATAAACGCGCTCTGCTCTCGTGCGGCTGTCGCAATTGCATCAATGTTCTTGAACCGAAGTGAACTCAGGAAAGCCAATCGCAAGCTCGCATAAAGCAGGAAGAGCGTTGAGGTGACGAGTGCCAGAAGTGGCGCGTAGACCACCATCAATGTCAGCGTCGCCAACGCCATGATGCCATCGATAAATCCCGTCACCATCCCGTCGCTAACGAGTTGGGTGATGGGCTGCGTGGACCCGAAGCGGGATACGATGTCCCCGATGTGCCGCCGCTCGAACCATTGCAGTGGGAGGCGTAGATGTAGGTGCTTGCAATCGTAGCGGCGTTTGCTTGCAAACAGAGCCTTTTTTTGCTCGGATTATCTGACAACCGGCTCGGCGATCTGCTCGCGCTATCTGCCAATGTGCTCGCCATCATCCTGCGCTGCTCGCAATCATAAAGGTCTGCTCACATCCCCTGCCAGCGAAACAGGGCGTCAGCGCGGCACCCGGATGGAACCTTCCGCGATCCCCTCGGCGACATTGGAGAGCATGACGGCGTAGCGATGCAAACCGGGGAAGCGACCAGCGTTCTTGTCGATTTCACCGCCGTATCCGGCGTCGAGCAACGCCTTGAAGTCGGGCATCACGTCGAACATGCCGCCGAGCAGATCGATATCGGAGGCCCCGGTAAGCAACAGGGCGTTCGCATGGGCGTCGAGACGCTTCATCGTTTCCAGCATTTCGGTCCTGGTCATCGCATCCTGTCCGCTTGCGGGGTTCCATCGCCTAGCACGACGCGCCGGCCCTGTTCGCGAAAACCTTCCTTTCTGACGACCTCGCGGGCGGGCCATGACGACAACGCATTTCTGCGGCCCGCCGCTTGTCAGAACTGTTCTGGAAACCTGCTTCGCAAAAGGGGTGACTGCACGAAAGTGACTTATCTGCACGCTAAGAGCGAACGGACCGGGAACGCTGGGACTTTGTGTTCTTCTCCGTGGCTTCGAGGGCTTTGTAGAGGGCGGTTTTGCCGATCTTGAGGCGCGCGGCGGCCTCGCGAACTGTGAGGCCGGAAGCGATATGTTCGCGCGCCTTGCGGAGCTTGTCAGGTGTGACCACTGGCCGCCGCCCACCGGGGCGACCTCGCTCGCGAGCGGCCTTGAGGCCTGCATGGGTGCGCTCCCGGATCAGATCGCGCTCAAACTGGGCAAGCGAGCCGAAGATGTTGAACACCAGCATCCCGCCCGAAGTGGTGGTGTCGATGTTCTCGGTGAGCGAGCGGAACCCGATACCTCGCGTCGCTAGCTCGCCGACCTTCTCAATCAGATGGCTCATCGAGCGCCCAAGACGGTCGAGTTTCCAAACCACCAGCGTGTCGCCGCTGCGCAGATAGGCGAGCGCCTCGGCCAGGCCGGGCCGATCGGCTTTTGCACCAGATGCGTGATCGTCGAATATCCGGTCGCACCCGGCCGCGTTCAGCGCGTCGAGCTGGAGTGAGAGCTTCTGGTCTGCCGTCGAGACGCGCGCATAGCCGATCAACGCCAC
Protein sequences of DBSCAN-SWA_4 >NC_020542|43987:54331|53737_54331_-|WP_013039114.1|DBSCAN-SWA MALIGYARVSTADQKLSLQLDALNAAGCDRIFDDHASGAKADRPGLAEALAYLRSGDTLVVWKLDRLGRSMSHLIEKVGELATRGIGFRSLTENIDTTTSGGMLVFNIFGSLAQFERDLIRERTHAGLKAARERGRPGGRRPVVTPDKLRKAREHIASGLTVREAAARLKIGKTALYKALEATEKNTKSQRSRSVRS >NC_020542|43987:54331|48440_49157_+|WP_015449241.1|DBSCAN-SWA MGDDDGFDDADDPARAFARVEDRLASVHGEVALLRAAIEGLTAARENIEIPDYEPTLERTEKILVALAQRIDPIAKSPLLSMTPDSMASQIATAATAARREDARLVAEARAGLDQAAREIGNRLASARHGDEQNRWLYVMGACGVVLGLLLYAFLAGPIARWTPDSWRWPERMAIRVLNEPGPWGAGQRLMQAADAESWALIVAASPLTDANRETVQKCRERAAKAKEPVRCTIEVKP >NC_020542|43987:54331|43987_44542_-|WP_015449237.1|DBSCAN-SWA MADLPDFHIITGGPGSGKSTLIAALAAEGFDHMPEAGRAIIRDQLAIGGTALPWADRAAFAELMLGWELRSWHEAHDRAGPVIFDRGVPDVVGYLHLCSLPVPAAVDRAADLFRYQRRVFIAPPWPEIFVGDAERKQSPEEAEATYQAMVTVYASLRYELVALPRAPVPERVRFIRARIGEQDD >NC_020542|43987:54331|49232_49613_+|WP_144062182.1|DBSCAN-SWA MRALITALLLLGPCSAAYGQIAPRAPSGPERIPGAIIDVQVRFDQQAAGRELGETRKRIKRGRDSGDLSKQEARALRKEADQIGTLAERYGRDGISDSERRELDMRSRALGGLTEARRAQGGNKVP >NC_020542|43987:54331|51194_52283_+|WP_015449243.1|transposase|DBSCAN-SWA MAGQPGFFDLSDRYEALSAAGDPLERLAAVVDFEVFRGPLVAALRRSPRGKGGRPPFDPVLMFKILVLQALYSLSDEATEFQIKDRLSFQRFLGLGLDGTVPDATTVWLFRERLVKAKAIDRLFARFDAALTDRGYLAMGGQIIDATVVPAPKQRNTEEEKAAIKEGRIPERWKDNPAKIRQKDRDARWSVKYTKAKIKEGADPKAFKPVDLAIPMFGYKNHIGIDRAHGLIRTWDASAANAHDGARLPTLISPDNTAAGVWADTAYRSKKNEAFLAKGMFTSHIHQRKPHRRAMPKRIARANAKRSAVRSAVEHVFAGQKHRMGLVVRTIGIARARIKIGMTNLAYNFQRLAWLEGRIAPA >NC_020542|43987:54331|49639_50422_-|WP_084673707.1|DBSCAN-SWA MNDPLGNNAEYHIGDVYVPPFKPPALATLKPELRDRLRMVEADLADKRGTDPLAAIRKIHNSARQDIKSRVEELQHPLSPYLNFMDGGVQKSRTSSTEITIAQSVASAQSTGILGALDATIELIAVSQKRMRSKEVGTNKDKFGNRGIYPPPHMIPSLIENLDRAIQECEDSSPLWAAILAFSGVNAIHPFEDGNGRAARILFNVIMLRNVKGCFYLPIIEIWEISRGGTLIKLRDALYNDNWKDLVEHFCSSYEIMLDD >NC_020542|43987:54331|44534_44753_-|WP_015449238.1|DBSCAN-SWA MARRERTRHLIELGGLVQKAGLVELADDDRATLYGAMLELAAQAREDRDRLVLWKRRGKRAFDAEAEGEENG >NC_020542|43987:54331|53283_53529_-|WP_015449414.1|DBSCAN-SWA MLETMKRLDAHANALLLTGASDIDLLGGMFDVMPDFKALLDAGYGGEIDKNAGRFPGLHRYAVMLSNVAEGIAEGSIRVPR >NC_020542|43987:54331|50423_51104_-|WP_144062183.1|DBSCAN-SWA MEIRNVHFRYGFGEQSVLKGVNLTIHPGEMIALIGPSGGGKTTLMKVIMGLFEPFHGEVLVGGRSIKSFSKQVYRQAIGSVSQDDQLYAGSIAENIAFFDPEINMERVEEVARLAAVHDEIHAMPLRYDSLIGDMGSVLSGGQKQRVLLARALYARPAILFMDEGTAHLDPENEAKVLEALSRLNITRVVIAHRQQSIQAADRVVTVSQGMVHHKSTEAAILRLEV >NC_020542|43987:54331|52371_53148_-|WP_144062184.1|DBSCAN-SWA MFASKRRYDCKHLHLRLPLQWFERRHIGDIVSRFGSTQPITQLVSDGMVTGFIDGIMALATLTLMVVYAPLLALVTSTLFLLYASLRLAFLSSLRFKNIDAIATAAREQSAFIESIRGIAAIKAFGQEGNRQRLWQKTKADAVNAQIKLGRTTAGFDSLGQLVLVIEQISFVYLGVSLVLAGNMSLGMLFAFQAYKGQFLDASMRLIEQLLNYKIMQVHLGRVADIALSKQEDANSVGNVDQPDGHLEEPSDFGMMEA >NC_020542|43987:54331|45269_48428_+|WP_015449240.1|DBSCAN-SWA MAIYHFSAKVISRANGSSAVASAAYRAAERLHDDRLGRDHDFSNKAGVVHSEIMLPEGAPERLNDRATLWNEVEAGEKRKDAQLAREVEFSIPRELNQQQGIQLARDFVEKQFVERGMVADMNVHWDMGKDGQPKPHAHVMLSMREVGPEGFGQKVREWNSTALLQEWREAWADHVNERLAELDIDARIDHRTLEAQGIDLEPQHKIGPAASRMPEQGLEAERVEDHARIARENGEKIIANPQIALDAITRQQATFTRRDLAQFAFRHSDGKDQFDQVMSAVRTSPELVALGKDWRGEDRFTSRDMIETEQRLERAGDRLADRTGHGLPATSQPRGLDAAGSGGLALGDQQRDALAHITGKNDLAIVVGYAGTGKSTMLGVARDEWERAGYQVRGAALSGIAAEGLEGGSGIASRTIASMEYQWDQGRELLGPRDVLVIDEAGMIGTRQMERVLSEAERAGAKVVLVGDAEQLQAIEAGAAFRSLAERHGAAEINEVRRQHEDWQRDATRALATGRTGEAIHAYAEHGMVHAAETREAARAELIDTWDAQRRADPDKSRIILTHTNAEVRDLNLAARDRLRDAGDLGQNVRVSAERGARDFAAGDRIMFLKNERGLGVKNGTLGQVERVSPDSMAVRLDDGRSVAFDLKDYAHVDHGYAATIHKSQGVTVDQGHVLATPGMDRHSAYVALSRHRDGVQLHYGRDDFADDRRLVRTLGRERAKDMASDYPMFRDPDQEIRAFADRRGLSGEIRVADAPERQGVELLGPRAGTMRQMGEDPRTREIGHRDASGRGAGDAKAAAERQPRRGMFDGLKLSADPAKAAEPAQGAKEKAAPKRGMFDGLKLPSRTPAPTPAKEIPARAEQGQDRDFRRAVERASRSAEAVLQARASGGPVLEHQKVALERATEALDQARPGASQDMAAAMKRDPALLRDAAAGRSGPMIEAMRQEARVRADPNLRADRFVERWQQLSQDRDRLYRAGDTTGRDRAGKEMAGMAKSLERDPQVESILRNRTRELGLEIGMGRGRDMGGGDLGRQLTHELGIGRDHGLSR >NC_020542|43987:54331|44790_45096_-|WP_015449239.1|DBSCAN-SWA MRKVRDYDAELRALNDKARALKVKKVQQLGELVTSTGADALDLDTLAGALLAAAESASADEREAWRVKGAAFFQGRGRKASRRAGGNAQGGNQTGAGETQD |
12 | Ochrobactrum_phage(20.0%) | transposase | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_5 |
60030 : 62388
Sequences of DBSCAN-SWA_5
Nucleotide sequences of DBSCAN-SWA_5 >NC_020542|60030:62388|DBSCAN-SWA TTCACAGTGCTTTCCGGTTGAGGCGCAGCGCGTTGGTGACCACGCTCACCGAAGAGAGCGCCATCGCCGCCCCCGCGATAATGGGCGAGAGCAGAATACCGAACAGCGGATAGAGCGCACCCGCCGCTACGGGCACGCCCGCGACATTGTAGATGAAGGCGAAGACGAGATTCTGGCGGATGTTCGACATGGTCGCTTGGCTGAGCCGCCGCGCCCGGACGATGCCAGTCAGGTCGCCCTTGAGCAGCGTGACGCCAGCGCTCTCGATCGCGACGTCGGTGCCGGAACCCATGGCGATGCCGACGTCGGCGGCGGCCAACGCGGGGGCGTCGTTGACACCGTCGCCGGCCATGGCGACGACCCGCCCCTCGCGCTTGAACCTGGCGACCACGGCGCTCTTCTGATCGGGCAGCACCTCGGCTTCGACCTCGTCGATGCCCAGGCGTCGGGCGACCGCTTCCGCGGTCGTGCGGTTGTCGCCGGTCAGCATGACCACGCGAATGCCCTCCGCCTTGAGCGCGGCGAGCGCTTCTGGTGTCGTGGCCTTCACCGGATCGGCGATCGCGAAGGCGCCGCCGACTGTGCCGTCGACCCCGATGAAGATCGCGGTGGCGCCATCCCGGCGCAGCGCGTCGGCCTGGCTGGCGAGCGCATCGGTCGCAACCCCTTCGTCGGCAAGGAACTGCGCATTGCCGAGGACGATGCGCCGCCCTTCGACCGTGCCGAGCGCGCCGCGCCCGGTCGGCGAGTCGAAGTCGATGACGTCGCTCGTGACGATGCCGCGATCCTTCGCGGCCTCGACGATTGCCAACGCGAGCGGATGCTCGGACGCCCGCTCGACCGAGGCGGCAAGACGCAGCAGCTCGGCTTCGTCGAAGCCGGGCGCCGGCACGACCTGGGTGACGGCAGGCCGGCCTTCGGTCAGCGTGCCTGTCTTGTCGACGACGAGAGTGTCGACCTTCTCCATATGCTCGAGTGCTTCGGCATTTTTGATGAGGACACCGAGCCCGGCGCCGCGACCGACCCCGACCATGATCGACATCGGCGTTGCCAGCCCCAGTGCGCAGGGACAGGCGATGATCAGCACGGCGACCGCCGCCACCAGCCCATAGGCGAAGCGCGGCTCGGGACCCCAGATGCCCCAGGCGATGAACGCGACGGCCGCGACCGCGATGACCACGGGCACAAACCAGCCCGACACCTGATCGGCCATGCGCTGGATCGGCGCGCGCGAGCGCTGCGCCTCAGCGACCATCTGGACGATGCGTGCGAGCATGGTATCGCGGCCGACCTTGTCGGCGACGATCACCAGCGCACCGGTCTGGTTGAGCGTGCCGCCGATCACTGTGTCGGCCTTTGCCTTGGTGACGGGCATGGATTCGCCGGTGACCATCGACTCGTCGAGCGAGGAGCGGCCGTCCTCGACTACGCCGTCGACCGGCACCTTCTCGCCCGGCCGCACCCGCAGGCGGTCGCCCACCGCAACCAGATCAAGGCTGATTTCCTCTTCGTTCCCATCAGAACCGATCCGGCGCGCGGTCTTTGGTGCAAGGTTGAGCAGCGCCTTGATCGCGCCCGAGGTGCGTTCCCGCGCGCGCAGTTCGAGCATCTGGCCGAGCAGCACGAGGACGGTGATCACCGCGGCCGCCTCGAAATAGACGGCAACCATGCCGTCCTCGCCGCGGAAGGCCGGCGGGAACAGCTGAGGCGCGAGCGTCGCGATGACGCTGTAGATCCAGGCGACCCCGGTCCCCATCGCGATCAGGGTGAACATGTTGAGGTTGCGGGTCTTGAGCGAGGCCCAGCCACGCTCGAAGAACGGCCAGCCCGCCCAGAGCACGACAGGCGTCGCCAGCACGAACTGGATCCATACCGAGGTGGACATCGGCACGAGGCGATGGATCGCGGGAAAGACATGCGCGCCCATCTCGAGGATCAGCACCGGGAGAGCCAGCACGAGGCCGACCCAGAAGCGCCGCGTGAAATCGACTAGCTCGTGGCTGGGGCCGCTGTCGGCGGTCACGGTCGCGGGTTCGAGCGCCATGCCGCAGATCGGACAGGACCCGGGATGGTCCTGGCGGATCTCGGGATGCATCGGGCAGGTCCAGATCGTGCCTTCGGGCGCCGCGACCGGCGGCGTCGGCGGGCCGAGATAGCGCTCGGGATCGGCGATGAATTTGGTCCGGCAGCCCGCGCTGCAGAAATGATAGCTTTCACCGCTATGCTCGGCGTGATGCGCCGTGGTCGCCGGGTCGACGGTCATGCCGCAGACCGGGTCCTTGACGCCGGTCGCCGCTTTAGCGGTGCCGTGGCCGCCGCAGCAACCGCCGCCATGCGCCGCTCCATGTGTCTCGTTCAT
Protein sequences of DBSCAN-SWA_5 >NC_020542|60030:62388|60030_62388_-|WP_015449247.1|DBSCAN-SWA MNETHGAAHGGGCCGGHGTAKAATGVKDPVCGMTVDPATTAHHAEHSGESYHFCSAGCRTKFIADPERYLGPPTPPVAAPEGTIWTCPMHPEIRQDHPGSCPICGMALEPATVTADSGPSHELVDFTRRFWVGLVLALPVLILEMGAHVFPAIHRLVPMSTSVWIQFVLATPVVLWAGWPFFERGWASLKTRNLNMFTLIAMGTGVAWIYSVIATLAPQLFPPAFRGEDGMVAVYFEAAAVITVLVLLGQMLELRARERTSGAIKALLNLAPKTARRIGSDGNEEEISLDLVAVGDRLRVRPGEKVPVDGVVEDGRSSLDESMVTGESMPVTKAKADTVIGGTLNQTGALVIVADKVGRDTMLARIVQMVAEAQRSRAPIQRMADQVSGWFVPVVIAVAAVAFIAWGIWGPEPRFAYGLVAAVAVLIIACPCALGLATPMSIMVGVGRGAGLGVLIKNAEALEHMEKVDTLVVDKTGTLTEGRPAVTQVVPAPGFDEAELLRLAASVERASEHPLALAIVEAAKDRGIVTSDVIDFDSPTGRGALGTVEGRRIVLGNAQFLADEGVATDALASQADALRRDGATAIFIGVDGTVGGAFAIADPVKATTPEALAALKAEGIRVVMLTGDNRTTAEAVARRLGIDEVEAEVLPDQKSAVVARFKREGRVVAMAGDGVNDAPALAAADVGIAMGSGTDVAIESAGVTLLKGDLTGIVRARRLSQATMSNIRQNLVFAFIYNVAGVPVAAGALYPLFGILLSPIIAGAAMALSSVSVVTNALRLNRKAL |
1 | uncultured_virus(100.0%) | NA | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_6 |
68812 : 70880
Sequences of DBSCAN-SWA_6
Nucleotide sequences of DBSCAN-SWA_6 >NC_020542|68812:70880|DBSCAN-SWA CATGGACCATTCGTCCCACATGAACACCCGGAAGATGGGTGCCTATTGGAGCCTTGCCGTTCAGACCGTCATCAGCGGCGTCATCATGTATCTGGTGATGTTCGTCATGATCGACGGGCTCGACAGCTTCTACAACAACCTCAACATGCTCTACATGACGCTGATGATGGTCGCACCGATGGTGGTGCTGATGATCCTCGCCATGCGACACATGTTCGTGTCGAAGGCCGCCAACATCGCGCTGATCGCCGCGTCGCTGGTCGCCTTCTTCGGCAGCTTTGCACTCATCCGCACCCAGACCACGATCGGCGACACGGCCTTTCTGCGCTCGATGATCCCGCATCATTCCGGGGCGATCCTGATGTGCCAGGAAGCAAAGCTCAGCGATCCGGAAGTCATCCGGCTTTGCGAAGCGATCAAGCGCTCGCAGCGGCAGGAAATCGACGAGATGAAGGCCATACTCGCGCGGCGCTGAGTGGCACGGTCGGGCTACGCCCGGATACGTGCGCCCGGCCAGGTATGTGAGCCGGCCGATGGTCGACACGAGGCGCGGGGCGGACGAGATTGCCAGGGAGGTGGTGCGGCTTGCGCCGTTGTTCCTGCGACAGGCTGGCGGCCATGTCCTGACGCTGCTCGATCCTGCCGGGATCGTGCTGAGCTATAATGAAGAAGGCGAGCGTGCCGAATGCTGGCCCCTCGATCGCGTCCTGGGACAGCCCCATGACCTGTTCTACCCGCCCGACGAGATTGCGGCCGGCCGCCCGTGCGCAGACCTGGCGGCCGCCCTTCACGAAGGCACGCTGGAGCGCGAAGCGTGGCGGGTCTGCGAGAACGGAGCGGAGTATCTCGCGCGCCTGACGATCAGCGCCCTGTTCGAAGGCGACGCACATCGAGGCTTTGCCTGCATCAGTCGCGATGTCACCGACGAGGCGGCGGTCCGGGCGTCAATCGAGACCCGCGAGCAGCATCTCCAGTCGATCCTGGCGACCGTCCCCGATGCGATGATCATCATCGACGAGACCGGCGACATCACGTCGTTCAGCGCGGCCGCCCAACGCCTGTTCGGCTATTCGGAAACCGAGCTCGTCGGCCGCAACGTCTCCTGCCTGATGCCCCAGCCCGATCGCGACCGGCATGACGAATATATCGCGCATTATCTCCAGACCGGCGAGCGCCGGATCATCGGCCTCGGCCGCGTCGTGGTCGGCCAGCGCCGCGACGGCTCGACCTTCCCGATGCAGCTGTCGGTTGGCGAGGCCGGTGAAGACGGGCAGCGGTTGTTCACCGGCTTCATCCGGGATCTCACCGCCAAGGAGCAGGATGAGCTCAGGCTCAAGGAACTGCAGGCAGAGCTGGTCCATGTCTCCCGGCTGAGCGCGATGGGCACGATGGCCTTGACCCTCGCGCATGAGCTCAACCAGCCGCTTGCCGCGGTCGCGCTCTATCTCGAGACGATCCGCGACATGCTCGACGAGCGGGACGACGAACCCTTCGTGTCGCTACGGTCGGTGATGGATGATGCGGCACAGGAAACGCTGCGGGCCGGCCATATCGTCCGGCGCCTGCGCGATTTCGTCGCCCGCGGCGAGGTCGACAAGAGCCTCCACGATCTGCCGCAGGTTGTCGCCGAGGCGAGCGAGCTCGCGCTGGTCGGTGCACGCGAGCGCGGCATCCGCAGCTTCTTCGCGGTCGATCCCGCCGCGACGCCGGTTCTCGTCGACAGGGTGCAGATCCAGCAGGTGCTGGTCAACCTGATGCGCAACGCCATCGAGGCGATGGCGGCGTGTCCTGTTCGCGACCTCAAGGTAGCGACCAGGTTGCGCCCTGACGGGCTGATCGAGGTGACCGTGGAGGATACCGGCCCCGGCATCGCCGACGAGGTCCGGGAGCAGCTCTTCACGGCGTTCAAGTCGACAAAGGCCGACGGCATGGGCCTCGGCCTTTCGATCTGCCGGACCATCATCGAAGCGCATGGGGGCCGTATCTGGATGGAGCGCCCCGACCGCGGCGGCGCGCGCTTTCATTTTACCTTGATCCATGCGCGGGCGGAGGAGGAACATGGGGGATAG
Protein sequences of DBSCAN-SWA_6 >NC_020542|68812:70880|69344_70880_+|WP_015449254.1|DBSCAN-SWA MVDTRRGADEIAREVVRLAPLFLRQAGGHVLTLLDPAGIVLSYNEEGERAECWPLDRVLGQPHDLFYPPDEIAAGRPCADLAAALHEGTLEREAWRVCENGAEYLARLTISALFEGDAHRGFACISRDVTDEAAVRASIETREQHLQSILATVPDAMIIIDETGDITSFSAAAQRLFGYSETELVGRNVSCLMPQPDRDRHDEYIAHYLQTGERRIIGLGRVVVGQRRDGSTFPMQLSVGEAGEDGQRLFTGFIRDLTAKEQDELRLKELQAELVHVSRLSAMGTMALTLAHELNQPLAAVALYLETIRDMLDERDDEPFVSLRSVMDDAAQETLRAGHIVRRLRDFVARGEVDKSLHDLPQVVAEASELALVGARERGIRSFFAVDPAATPVLVDRVQIQQVLVNLMRNAIEAMAACPVRDLKVATRLRPDGLIEVTVEDTGPGIADEVREQLFTAFKSTKADGMGLGLSICRTIIEAHGGRIWMERPDRGGARFHFTLIHARAEEEHGG >NC_020542|68812:70880|68812_69286_+|WP_015449253.1|DBSCAN-SWA MDHSSHMNTRKMGAYWSLAVQTVISGVIMYLVMFVMIDGLDSFYNNLNMLYMTLMMVAPMVVLMILAMRHMFVSKAANIALIAASLVAFFGSFALIRTQTTIGDTAFLRSMIPHHSGAILMCQEAKLSDPEVIRLCEAIKRSQRQEIDEMKAILARR |
2 | Klosneuvirus(50.0%) | NA | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_7 |
75163 : 77866
Sequences of DBSCAN-SWA_7
Nucleotide sequences of DBSCAN-SWA_7 >NC_020542|75163:77866|DBSCAN-SWA CATGACCATCAGGATGGCCCTGCTCGGCAGCGCCGCCATCGTCGCGGGCTACGCCGCGCCGGCTCTCGCTCAGGCCGATCCGACACCGCAGCCAAAAGCACAAGTCCTGGGGGATGACAGCGACATCGTCGTTACCGGCTCACGAATCCGTCGGCAGGATCTGGCCGGGGTTGGACCGGCCACCGTGGTCACCGCCGAGCAGATCCAAAATACCGGCATCGTCAATATCGAAACCGCCTTGCAGCGTTTGCCGGCCAATGCCGGCTTCGCGGGCAACCAGACATCGGCCTACTGGACCAACAACGGTTATGGCACTGCTCAGGTCAATCTGCGCGGCCTCGGCATCAAGCGCACACTCGTGCTCCTCAACGGCCGACGCCTCGTCGCGGGCGGCACCGGCGCCAATTCCTCACCCGATCTCAACATGATCCCGGTCGTGGCGCTGGCACGCACCGATGTCCTCAAGGATGGCGCGTCCGCTATCTACGGGGCCGATGCCATGGCCGGCGTGGTCAACCTTGTCACGCGCACCGATTATGAAGGCCTGGGTCTCAGCGTCCGACAAGGCATTACCGAGCGCGGCGACGGCTCCGATCTCACCGCGGACCTACTTTGGGGCATTCGCAACGATCGTGGCGGGTTCATGGCAGCAGTCACCTACCAGAAGACCAGCGCCGTCAACATGGCCTCGCGTGCGCCCTGCTCGCTCGCTGAAACGACGCCGGGCTCGCTGAGCTGCGTCAACAGCGCCTCGACGATCGGCGGACGCGCGGTGCTGCCCAACGGCCAACAGATCAACTTCAACCAGGTACCCGGAGGGAACGGGAACTTCTACGAGCCTTACAGTCCCGCCAAGCACAACTTCAACTCGAACCCGTTTCTCAATGCGGTCAGCCCAGTCGAGCGCGTCAGCACGGCCTTCTTCGCGGACTATGCGCTGACCGACGGTATCCAGGCGTTCGGCGAGTTCCTCTATACGTTCCGCAAGTCGAATCAGATCGCGACACCCGGCACGCTGCGCAATCTCTCGATACCAGCCAGCAATCCGACCAATCCGACCGGCCAGAACCTGGTCCTGGCCCAGCGCCGCCTCGCCGAACCCGGCGCGCGCCACTTCTTCCAGGAGACCGATACCTGGCAAGGCACCTTCGGGCTGCGCGGCAAGCTGGCGAACGACTGGGCCTGGGAAGTCGCGGGCAGCTTCGGGCGTAACACGGCCGTCGACGGCTCGACCAACATCGCCAATCTCGAGCGCGTCGCGAACACGCTCGACAGGAGCAAATGCTCCAGCACGGCGGGCGCGGCCATCCCCTGCGGTGACTATCTTGGGTTCGGAGACCTGACACCTCAGGTTCTGGATTATATTCTGTTCACCTCGCGCGATCGCGGAGGCAACGAACTCGGCACGGTCACGGCTGACCTCAACGGCGATCTCTTCTCCCTTCCAGCCGGCGCGGTGTCCTTCGCCACCGGCGTGGTCTATCGGCGCGAAAAAGGCTGGCGCGATCCCGACCCGTTGACGGTGCTCGGCGTGGCGAACGTCAATCAGCAGGATCCTATTTCGGGCTCGAGCACCGCCAAGGAAGCCTATCTCGAGCTATCGGTGCCGGTGCTTGCCAACACAGCTTTCGCCAAGGCGCTCACGCTCGATGGCGCAGTCCGCTACTCAGACTATAATCTCTTCGGGAGCGACTGGAATTACAAGCTCAGTGCCGACTGGGTAGTCAATGACAGCATCCGTCTGCGCGGCACTTATGGGACGGGCTTTCGCATTCCCAACGTGCCGGAGCTTTTCGGCGGCGTCTCGGAAGGCAATCTGACCACGACCGATCCCTGCTCGCGCTACACCTCCAGCGGCAACGTGACCTTGATCGCCAATTGTCAGGCGTCCGGTGTGCCGGCCAACTATACGCAGCTCGGCACCACGATCCTCACCACAGTGGGCGGTAACCAGAGCCTTCGGCCCGAAAGCTCCACGACCTGGACCGTTGGTACGGTGATATCGCCGCGCGGCATCATCCCAGGGTTGTCGCTGACGGCAGACTGGTTCGACATCAAGATCAAGGACGCGATCCGGGCCATTCCCGGCTCGACCAAGCTCGCAGTCTGCTATGCGAGCCAGAACCTGTCGCACCCGTTCTGCAGCGACTTTACGCGCAGCGCGCTGACCGGCGAGGTCACTTACCTCTCCGCCCAGCCCATCAACACCGGCCGCGAGGAGATGAATGGTCTCGATCTGGGTCTGGTGTACAGTGGTGCGGTGGGCGAGGTGAAGATCTCCTTGGATCTCAACATGACCTATCTCAACAAATATGTCGTAAACCCCTTCCCCGGCGGCGCGCCGATCTATTTCGACGGGTTTATCGGGGGCGGCAATGGCGGCTATCCGAAATGGCGTGGTTATGGCGTGCTGACGGCGGAAAAGGACGGCATCAGCGCGACCTGGTCGACACAATGGATCGGCAAGGCGACCGACTTCAACGCATCGGCCGGCGACATCGGCTACCGCACGCCGAACGTCTTCTACCACAATCTTCAGCTTGCCTTCGCGATCGACGAGAAGACACGTTTCCAGATCGGGGCCGATAATCTGTTCGACCGCAAGGCGCCCTATATCCAGAGCTTCACCGATGCCAACACCGACACGATGACCTATGATCTGCTCGGACGGCGCTTCTACGCCGGCTTCCGAACCGCGTTCTGA
Protein sequences of DBSCAN-SWA_7 >NC_020542|75163:77866|75163_77866_+|WP_015449259.1|DBSCAN-SWA MTIRMALLGSAAIVAGYAAPALAQADPTPQPKAQVLGDDSDIVVTGSRIRRQDLAGVGPATVVTAEQIQNTGIVNIETALQRLPANAGFAGNQTSAYWTNNGYGTAQVNLRGLGIKRTLVLLNGRRLVAGGTGANSSPDLNMIPVVALARTDVLKDGASAIYGADAMAGVVNLVTRTDYEGLGLSVRQGITERGDGSDLTADLLWGIRNDRGGFMAAVTYQKTSAVNMASRAPCSLAETTPGSLSCVNSASTIGGRAVLPNGQQINFNQVPGGNGNFYEPYSPAKHNFNSNPFLNAVSPVERVSTAFFADYALTDGIQAFGEFLYTFRKSNQIATPGTLRNLSIPASNPTNPTGQNLVLAQRRLAEPGARHFFQETDTWQGTFGLRGKLANDWAWEVAGSFGRNTAVDGSTNIANLERVANTLDRSKCSSTAGAAIPCGDYLGFGDLTPQVLDYILFTSRDRGGNELGTVTADLNGDLFSLPAGAVSFATGVVYRREKGWRDPDPLTVLGVANVNQQDPISGSSTAKEAYLELSVPVLANTAFAKALTLDGAVRYSDYNLFGSDWNYKLSADWVVNDSIRLRGTYGTGFRIPNVPELFGGVSEGNLTTTDPCSRYTSSGNVTLIANCQASGVPANYTQLGTTILTTVGGNQSLRPESSTTWTVGTVISPRGIIPGLSLTADWFDIKIKDAIRAIPGSTKLAVCYASQNLSHPFCSDFTRSALTGEVTYLSAQPINTGREEMNGLDLGLVYSGAVGEVKISLDLNMTYLNKYVVNPFPGGAPIYFDGFIGGGNGGYPKWRGYGVLTAEKDGISATWSTQWIGKATDFNASAGDIGYRTPNVFYHNLQLAFAIDEKTRFQIGADNLFDRKAPYIQSFTDANTDTMTYDLLGRRFYAGFRTAF |
1 | Acinetobacter_phage(100.0%) | NA | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_8 |
84652 : 85678
Sequences of DBSCAN-SWA_8
Nucleotide sequences of DBSCAN-SWA_8 >NC_020542|84652:85678|DBSCAN-SWA TTCAGGCGAGGTCGAGGACGATCCGGCCCTCGATGTCGCCATGATGCATGCGGGAGAAGACGTCGTTGATGTTCTCCAGCTTGTCTGCATGGACCGTGGCCTTGACCTTGCCCTCGCCGGCGAACGCCAGTGCCTCGAGCAGATCGAGCCGCGTGCCGACGATCGAGCCGCGCACGGTAATGCCGTTCAGCACGGTGTCGAAGATCGACAACGGGAAGTCGCCCGGCGGCAGGCCGTTGAGCGCGACCGTGCCGCCGCGCCGGACCATGCCGAGCGCCTGCTGGAACGCCTTGGGCGAAACGGCGGTCACCAGCGCCCCGTGCGCGCCGCCAATGGCTTTCTTGAGCGCTGCCGAAGGGTCCTCGTTGCGCGCGTTGACCGTCAGTGTGGCGCCAAGGCGTGTAGCGAGGTCGAGCTTGCTATCGTCGATGTCGACCGCGGCCACATTGAGACCCATGGCTCGGGCATATTGCACCGCCATGTGGCCGAGCCCGCCAATGCCGGAGACGACCACCCACTCGCCGGGTCGGGCCTCGGTGGCCTTCAATCCCTTGTAGACGGTGACGCCCGCGCAGAGGATCGGCGCAATGTCGAGGAAGTCGACATTGTCGGGAAGGTGACCAACATAGTTGGGATCGGCGAGGACATATTCGGCAAAGCTGCCATTGACCGAATAGCCGGTGTTCTGCTGTTCGTGGCAAAGCGTCTCCCAGCCACCCAGGCAATGCACGCAGTGCCCGCAGGCGGTGTAGAGCCAGGGCACACCGACCCGGTCGCCTTCCTTCACATGGGTGACGCCGGCACCAACGGCGGCGACATGCCCGACGCCCTCATGGCCAGGAATGAAGGGCGGGTTGGGCTTGACCGGCCAGTCCCCTTCTGCCGCGTGCAGGTCGGTATGGCACACGCCGGTTGCCGCAATCTTGACCAGGACCTGCCCGGGGCCGACCGTCGGGATCGGCGCGTCCTCGATGACCAGGGGCTTGCCAAATTCGCGGACGACCGCCGCCTTCATGGTTTTCGCCAT
Protein sequences of DBSCAN-SWA_8 >NC_020542|84652:85678|84652_85678_-|WP_004212676.1|DBSCAN-SWA MAKTMKAAVVREFGKPLVIEDAPIPTVGPGQVLVKIAATGVCHTDLHAAEGDWPVKPNPPFIPGHEGVGHVAAVGAGVTHVKEGDRVGVPWLYTACGHCVHCLGGWETLCHEQQNTGYSVNGSFAEYVLADPNYVGHLPDNVDFLDIAPILCAGVTVYKGLKATEARPGEWVVVSGIGGLGHMAVQYARAMGLNVAAVDIDDSKLDLATRLGATLTVNARNEDPSAALKKAIGGAHGALVTAVSPKAFQQALGMVRRGGTVALNGLPPGDFPLSIFDTVLNGITVRGSIVGTRLDLLEALAFAGEGKVKATVHADKLENINDVFSRMHHGDIEGRIVLDLA |
1 | Tupanvirus(100.0%) | NA | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_9 |
93081 : 94992
Sequences of DBSCAN-SWA_9
Nucleotide sequences of DBSCAN-SWA_9 >NC_020542|93081:94992|DBSCAN-SWA ATCAGGCTTCGACCACGAGCGGCGTGCCCGTCCCCTTCGCCTGGCACGCGAGCACATAGCCTTGCGCCTTGTCGGCGGGCGCAAGGCCGGATTCGACCTCCATCGTCACTTCGCCCTGCAGCAGCTTGACCACGCAGGCGCCGCATGTGCCCGCACGGCATGCATAGGGGATCTCGACGCCCGCGCCCTCGGCCGCCTCCAGCACGGTCTCGTCCGCGGGCAAGGCCGCCGACACCCCCGACACGGAGAAGGTCACCGTGCTCGGCGCCACCTCGGCGCTCGGGGCCGGCTTATCCGCTGGCGGGGGCGCGGGCTTGACCTCGAGATCCTCGTGGTCGGCCGGCAGCGAGGCCGGACCGAACGCCTCGGTGTGCAGCTGCGCTTCGGGGACGCCGAGTTCCGCCAGCACGCCGCGCATCGCGCCCATCATCGCCGGCGGGCCGCACATATGGATGCGCCGGCTCGCGATCTCCGGCACCGCGGCCAGGATCATCTCGCGCGTGATCGGCCCTTCGGGGCCCATCCAGACGGTGCCGGGCGCGCGCTGCATCGCGGCGACGACATGGAGGTTGGGAAAACGGCGTTCGAGCCGCTCGAGCTCGTCGCGGAACACGAATTCCTCGGTCGACCGTGCGCCGTAGAAGAAGAAGATATCACCCTTCCACGCGGTGTCGGTGAGATAGCGCAGCACCGACATCATCGGGGTGATGCCGACCCCGCCCGCGATCAGCACGATGCTCTGGGCATCCGTTCCCGTGAAGGTGAAGGCGCCGAACGGTCCGCTGACCTTCAGCAGATCGTCGGCGACGACCTTGTCGTGCAGGTATCGCGAGACCACGCCCTGCTCCTCGCGCTTGACCGTCAGTTCGACATAGGCCCGCTGGGTCGGCGAAGAGGCGATCGTGTAGGAACGGCGCGCGGTCTTGCCCGCTTCGGGCTCTACCTCGACCTGCAGGAACTGGCCCGGCAGGAAGTCGAACGGCAGCCGGTCGGCGGTCGGGTCCGCGAGCCGGAAGGTCAGCACGCTCGGCGTCTCTCGCACGATCTGGACCACGCGCAGCTGCCCCGCCCAGCTTTTGGGCTTGCGCAGCGACGCCCCAGCGGGTGCGGCCGCATTGCTCGGCGCGAGCCCGGCGCTGTCGGCAACGGGTGCAGGCGCGGGCGCAACCCTGGTAGCGGGAGGGGCAGCCTTCGGCTCCGGCTTTGCTGGAACCTTGGCGGTGGCGACACCGCCGGCGATCGCCCCGATCCGCCGCAGGCGGAAGATCTGCAGCGCGATAAGCGTGGCGCTCACCAGCCCCAGGAACAGCATGAGAAGGAGGTGCGAGGGCGAGATGCCGAACCAGTGATCGGCATGGCCGGGCGCGGCCTGGTCGATGTCGAGCTGATCGCGCAGCCAGGCAAGCCCGACCTCCCGGGGCGGCTGCAAGCCCCCGATCGCGCCCTGCGCGGCGGTGCCGGAGCGGAACAGGTCGGTTCCCTCGCGCAGGCGCCGCGCCGCATCGAGCCGGGCCGAGACCGTGGTCGCACGCGCGCCGTCGGCCGAGGCGGCGTTGATCATCGCCAGACCCGTGCTTACCCGTTCGCCGGCCTGCGCAGCGATCCGCTGCCTTGCCGCTTCGTCAAGCGCGGGAAAGGCGAGCAGCTGCGAGATGAAGCCGGTCCGGCGCGATTCGTGGCCGTGCTCATTCTCCCCTTCCCCATTGATCATGCGATCCATCATGGGGTTCATCATCTTCGCCATGTCCGACGACCCCGGCTCTGCCGGTGCGGACATGCCCCCGCCGGCCGGCATGCCTGCGCCATGATGCGCGGCATGATCCTCGCCCTCCTGCGCGCCCGCGCTCGCGGACAGGCTGAGAGCGATTGCGCAGCCGAGGCTCAAGCGTCGAAGGCTGATCCGCCTGGACAT
Protein sequences of DBSCAN-SWA_9 >NC_020542|93081:94992|93081_94992_-|WP_015449268.1|DBSCAN-SWA MSRRISLRRLSLGCAIALSLSASAGAQEGEDHAAHHGAGMPAGGGMSAPAEPGSSDMAKMMNPMMDRMINGEGENEHGHESRRTGFISQLLAFPALDEAARQRIAAQAGERVSTGLAMINAASADGARATTVSARLDAARRLREGTDLFRSGTAAQGAIGGLQPPREVGLAWLRDQLDIDQAAPGHADHWFGISPSHLLLMLFLGLVSATLIALQIFRLRRIGAIAGGVATAKVPAKPEPKAAPPATRVAPAPAPVADSAGLAPSNAAAPAGASLRKPKSWAGQLRVVQIVRETPSVLTFRLADPTADRLPFDFLPGQFLQVEVEPEAGKTARRSYTIASSPTQRAYVELTVKREEQGVVSRYLHDKVVADDLLKVSGPFGAFTFTGTDAQSIVLIAGGVGITPMMSVLRYLTDTAWKGDIFFFYGARSTEEFVFRDELERLERRFPNLHVVAAMQRAPGTVWMGPEGPITREMILAAVPEIASRRIHMCGPPAMMGAMRGVLAELGVPEAQLHTEAFGPASLPADHEDLEVKPAPPPADKPAPSAEVAPSTVTFSVSGVSAALPADETVLEAAEGAGVEIPYACRAGTCGACVVKLLQGEVTMEVESGLAPADKAQGYVLACQAKGTGTPLVVEA |
1 | Synechococcus_phage(100.0%) | NA | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_10 |
98454 : 102890
Sequences of DBSCAN-SWA_10
Nucleotide sequences of DBSCAN-SWA_10 >NC_020542|98454:102890|DBSCAN-SWA GCTAAGCCGGTTGCAGCGGCCGTAGCGGCCTGAACTTGCCCGCGCCGATCTTGGCGCTGCTGCGCCAGAGGTAATCGCCGGTCAGGTTGATGTGCTCCCAGCCGAGCGGCGACAGGTACTGCAACAGGCCGTCATCGACGGCTTGACCGTGGCCACGCAAGGCGTTCGCGGCCCGCTCCAGATAGACCGTGTTCCATAGCACGATGGCCGCCGTCACCAGGTTGAGGCCGCTGGCCCGGTAGCGCTGCTGCTCGAAGCTGCGGTCGCGGATTTCCCCCAGGCGGTTGAAGAACACGGCGCGGGCCAGCGCGTTGCGCGCCTCGCCTTTGTTCAGCCCGGCATGCACGCGGCGGCGCAGCTCGACGCTTTGCAGCCAGTCCAGGATGAACAGTGTGCGCTCGATGCGTCCCAGCTCACGCAGGGCGACGGCCAGGCCGTTCTGGCGCGGGTAGCTGCCAAGCTTCCTGAGCATCAGCGAGGCCGTCGCCGTGCCCTGCTTGATCGAGGTGGCCATCCGCAGGATTTCATCCCAATGGGCGCGGACGTGCTTGATGTTGAGCGTGCCGCCGATCATCGGTTTCAACGCCTCGTAGGTGGCATCGCCCTTCGGGATGTAGAGCTTGGTGTCGCCCAGGTCACGAATGCGCGGGGCGAAGCGGAAGCCCAGCAGGTGCATCAACGCGAAGACGTGGTCCGTGAACCCTGCCGTGTCGGTGTAGTGCTCCTCGATCCGCAGGTCGGATTCGTGATACAGCAGGCCGTCGAGCACGTAGGTCGAGTCGCGCACGCCGACGTTCACGACCTTGGTGTGGAACGGCGCGTACTGGTCGGAGATATGGGTGTAGAACGTCCGCCCTGGGCTGCTGCCGTATTTCGGATTGATGTGGCCGGTGCTCTCGGCCTTGCTGCCGGTGCGGAAGTTCTGGCCGTCCGACGATGACGTGGTGCCGTCGCCCCAATGCTCGGCGAAGGGATGTCGGAACTGCGCGTTGACCAGCTCGGCCAGTGCCGCCCCGTAGGTTTCGTCGCGGATGTGCCAGGCTTGCAGCCAGGCCAGCTTGGCGTAGGTCGTGCCGGGGCACGATTCCGCCATCTTGGTCAGGCCCAGGTTGATCGCGTCGGCGAGGATCGTGGTCAGCAACAGGTTTTTGTCCTTGGCCAGGTCGCCTGACTTCAGGTGGGCGAAGTGCCGGGTGAAGCCCGTCCATTCGTCTACCTCCAGCAGCAATTCGGTGATCTTGACGTGCGGTAGGATCATCGCCGTCTGGTCGATCAGGGCCTGTGCGGTATCGGGCACCGCCGCATCGAGCGGCGTGATCTTCAGGCCCGACTCCGTGATGATGGCGTCCGGCAGCTCGTTGGCCAGCGCCATGCGGTTGACGGTGGCGAGCTGCGTTTCCAGCAGCGTCAGCCGGTCATGCAGGTACTGGTCGCAATCGGTGGCCACGGCCAGCGGCAATTCGCTGGCCTGCTTGAGGCTGGCGAATTTCGCGGGCGGCACCAGGTAGTCCTCGAAGTCCTTGAACTGGCGCGAGCCTTGCACCCAGATGTCGCCGGAGCGCAGCGCGTTCTTCAGCTCCGACAGCGCGCACAGTTCGTAGTAGCGCCGGTCGATGCCGGTGTCGGTCATCACCAGCTTCTGCCAGCGCGGCTTGATGAACTCGGTCGGCGCGTCGGTGGGCACCTTGCGGGCGTTGTCGCTGTTCATGCTGCGCAGCACCTCGATGGCGTCGAGTACGTCCTTGGCGGCGGGCGCGGCCCGCAACTTGAGCACGTCGAGAAATTCCGGCGCGTAGCGGCGCAGCGTGGCGTAGCTCTCGCCGATGCGGTGCAGGAAATCGAAGTCCTCGGGTTGCGCGAGCCGCTGCGCTTCGGTGACGCTCTCGGCGAAAGCATCCCAGGACATGACGGCCTCGATGGCGGCGAACGGATCGCGGCCCGCTTGCTTGGCCTCGATCAGCGCCTGGCCGATGCGCCCGAACAGCCGCACCTTGGCATTGATCGCCTTGCCGGATGCCTGGAACTGCTGCTGATGCTTGTTCTTGGCGGCATTGAACAGCTTGCCCAGGATGCGGTCATGCAGGTCGATGATTTCGTCGGTGACGGTGGCCATGCCCTCGATGGCGAGCGCCACCAGGGTCGCGTAACGCCGCTGCGGCTCGAACTTCGCCAGGTCGGCGGGCGTCATCTGGCCGCCCTCGCGGGCGATCTTGAGCAGCCGGTTCTGGTGAACCAGCCGCTCGATGCCGGAGGGCAGGTCGAGCGCCTGCCACGCCTTGAGGCGTTCGATGTGTTCCAGCATGTGCCGCGAGTTCGGTTTGACCGGGGATTGCCGCAGCCAGGCCAGCCACGTCGTCTTGCCGTTGTCGCGGCGCTTGAGCAGATCGTCGAGGCGACGGCGATGCACGTCCGTCAGCGGCTCAGCCAAGGCGTCGTAGAGACGCCGGTTGGCGCGGGTAATCGCTTCGGCGCTCGCCCGCTCGACGGCGTTGAGGGCGGGCACAATGACCGACTGCCGCCGCAGGTGCTCGATCAAGGCTCTGGCCAGCACGATGCCCTTGTCGGTTTGCATGGCCAGCTCGGTCAGCAACTGGACAGCCTGCCGGTAGTGGCCAATCGTGAACGGCTGGAAGCCGAACACCGTTTGCAGCTCGACCAGGTGCTCGCGTCGGGTCTGCTCACGCTGCCCGTACTCGTCCCAGCTTTCGATGCCGACCTTGAGCTGGTTGGCGACCAGTCTCAGCAATGGCGGGAACGGTGGCTCATCAGCGCCAAGGATGACGCCGGGAAAGCGCAGGTAGCAGAGCTGCACCGCGAAGCCCAGCCGATTGGCCGGGCCGCGCCGCTGCCGGATGATGGAGAGGTCGCTTTCGCTGAACGTGTAGTGACGGATCAACTCATCCTTGGTGTCCGGCAACGCCAGCAGGCTTTCGCGCTCGGCGGCGGAGAGGATCGAACGGCGGGGCATGCGGTTTCCTTCTTCTTGAAAACGTAGGTTTGTGACAAGCCCGCCAAGGCAACCGGCGCGGCACGGGAATCAAGGCATTGCGATTCTCGAAAATAGTTCTTGAAATTCTATTCTTGATTGCATATCATCTCAACGAGTTTCGATAAGAAAGGATGCCCATGCGACCGTCTGTTGTGCTTGACATGAAGCGAAGCGCAGTGCGTGAAGCGGTAGGCCGCTTTCGCGCCGCGAACCCGCGCGTCTTCGGCTCGGTGCTGCATGGCACCGACCGGGATGGCAGCGACCTCGACCTGTTGGTCGATGCGCTGCCCGGTGCCACGTTGTTGGACTTGGGCGATTTGGAAGAAGAACTGAAATCGCTGCTCGGCGTTGACGTCGATCTGCTGACTCCCGGCGACCTGCCGCCGAAGTTCCGGGCCAAGGTGCTCGCGGAGGCGCAACCGATATGAGCGAGAACCGCCTGCCCGATTACCTCGACCACATTCAGCAGGCCGCAACCGATGCGCGCAGCTTCGTGGAAGGGATGGCCAAGGACGACTTCTTGGCCGACAAGCGCACCCAGCAGGCCGTCATCATGAGCCTGATCGTCATCGGCGAGGCGGCCACAAAGGTGATGGATGGCTACGTCGAGTTCACCCAGGCAGGCTCTGTTGCAAAAATCGTGAAGCTTGAGCATGCTTGGCGGAGATTGGACGGACGGAACGATGACGGATTTCAAGTGGCGCCATTTCCAGGGTGATGTGATCCTGTGGGCGGTGCGCTGGTATTGTCGCTATCCGATCAGCTATCGCGACCTTGAGGAAATGCTGGCGGAACGCGGCATTTCGGTCGGCCATACGACGATCTATCGCTGGGTCCAGTGCTACGCCCCGGAGATGGAGAAGCGGCTGCGCTGGTTCTGGCGGCGTGGCTTTGATCCGAGCTGGCGCCTGGATGAAACCTACGTCAAGGTGCGGGGCAAGTGGACCTACCTGTACCGGGCAGTCGACAAGCGGGGCGACACGATCGATTTCTACCTGTCGCCGACCCGCAGCGCCAAGGCAGCGAAGCGGTTCCTGGGCAAGGCCCTGCGAGGCCTGAAGCACTGGGAAAAGCCTGCCACGCTCAATACCGACAAAGCGCCGAGCTATGGTGCAGCGATCACCGAATTGAAGCGCGAAGGAAAGCTGGACCGGGAGACGGCCCACCGGCAGGTGAAGTATCTCAATAACGTGATCGAGGCCGATCACGGAAAGCTCAAGATACTGATCAAGCCGGTGCGCGGTTTCAAATCGATCCCCACGGCCTATGCCACGATCAAGGGATTCGAAGTCATGCGAGCCCTGCGCAAAGGACAGGCTCGCCCCTGGTGCCTGCAGCCCGGCATCAGGGGCGAGGTGCGCCTTGTGGAGAGAGCTTTTGGCATTGGGCCCTCGGCGCTGACGGAGGCCATGGGCATGCTCAACCACCATTTCGCAGCAGCCGCCTGA
Protein sequences of DBSCAN-SWA_10 >NC_020542|98454:102890|101579_101870_+|WP_001247892.1|DBSCAN-SWA MRPSVVLDMKRSAVREAVGRFRAANPRVFGSVLHGTDRDGSDLDLLVDALPGATLLDLGDLEEELKSLLGVDVDLLTPGDLPPKFRAKVLAEAQPI >NC_020542|98454:102890|102125_102890_+|WP_041865600.1|transposase|DBSCAN-SWA MTDFKWRHFQGDVILWAVRWYCRYPISYRDLEEMLAERGISVGHTTIYRWVQCYAPEMEKRLRWFWRRGFDPSWRLDETYVKVRGKWTYLYRAVDKRGDTIDFYLSPTRSAKAAKRFLGKALRGLKHWEKPATLNTDKAPSYGAAITELKREGKLDRETAHRQVKYLNNVIEADHGKLKILIKPVRGFKSIPTAYATIKGFEVMRALRKGQARPWCLQPGIRGEVRLVERAFGIGPSALTEAMGMLNHHFAAAA >NC_020542|98454:102890|98454_101421_-|WP_003100881.1|transposase|DBSCAN-SWA MPRRSILSAAERESLLALPDTKDELIRHYTFSESDLSIIRQRRGPANRLGFAVQLCYLRFPGVILGADEPPFPPLLRLVANQLKVGIESWDEYGQREQTRREHLVELQTVFGFQPFTIGHYRQAVQLLTELAMQTDKGIVLARALIEHLRRQSVIVPALNAVERASAEAITRANRRLYDALAEPLTDVHRRRLDDLLKRRDNGKTTWLAWLRQSPVKPNSRHMLEHIERLKAWQALDLPSGIERLVHQNRLLKIAREGGQMTPADLAKFEPQRRYATLVALAIEGMATVTDEIIDLHDRILGKLFNAAKNKHQQQFQASGKAINAKVRLFGRIGQALIEAKQAGRDPFAAIEAVMSWDAFAESVTEAQRLAQPEDFDFLHRIGESYATLRRYAPEFLDVLKLRAAPAAKDVLDAIEVLRSMNSDNARKVPTDAPTEFIKPRWQKLVMTDTGIDRRYYELCALSELKNALRSGDIWVQGSRQFKDFEDYLVPPAKFASLKQASELPLAVATDCDQYLHDRLTLLETQLATVNRMALANELPDAIITESGLKITPLDAAVPDTAQALIDQTAMILPHVKITELLLEVDEWTGFTRHFAHLKSGDLAKDKNLLLTTILADAINLGLTKMAESCPGTTYAKLAWLQAWHIRDETYGAALAELVNAQFRHPFAEHWGDGTTSSSDGQNFRTGSKAESTGHINPKYGSSPGRTFYTHISDQYAPFHTKVVNVGVRDSTYVLDGLLYHESDLRIEEHYTDTAGFTDHVFALMHLLGFRFAPRIRDLGDTKLYIPKGDATYEALKPMIGGTLNIKHVRAHWDEILRMATSIKQGTATASLMLRKLGSYPRQNGLAVALRELGRIERTLFILDWLQSVELRRRVHAGLNKGEARNALARAVFFNRLGEIRDRSFEQQRYRASGLNLVTAAIVLWNTVYLERAANALRGHGQAVDDGLLQYLSPLGWEHINLTGDYLWRSSAKIGAGKFRPLRPLQPA |
3 | Salmonella_phage(50.0%) | transposase | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_11 |
108000 : 108594
Sequences of DBSCAN-SWA_11
Nucleotide sequences of DBSCAN-SWA_11 >NC_020542|108000:108594|DBSCAN-SWA TGTGGCGTTGATCGGCTATGCGCGCGTCTCGACGGCAGACCAGAAGCTCTCACTCCAGCTCGACGCGCTGAACGCGGCCGGGTGCGACCGGATATTCGACGATCACGCATCTGGTGCAAAAGCCGATCGGCCCGGCCTGGCCGAGGCGCTCGCCTATCTGCGCAGCGGCGACACGCTGGTGGTTTGGAAACTCGACCGTCTTGGGCGCTCGATGAGCCATCTGATTGAGAAGGTCGGCGAGCTAGCGACGCGAGGTATCGGGTTCCGCTCGCTCACCGAGAACATCGACACCACCACTTCGGGCGGGATGCTGGTGTTCAACATCTTCGGCTCGCTTGCCCAGTTTGAGCGCGATCTGATCCGGGAGCGCACCCATGCAGGCCTCAAGGCCGCTCGCGAGCGAGGTCGCCCCGGTGGGCGGCGGCCAGTGGTCACACCTGACAAGCTCCGCAAGGCGCGCGAACATATCGCTTCCGGCCTCACAGTTCGCGAGGCCGCCGCGCGCCTCAAGATCGGCAAAACCGCCCTCTACAAAGCCCTCGAAGCCACGGAGAAGAACACAAAGTCCCAGCGTTCCCGGTCCGTTCGCTCTTAG
Protein sequences of DBSCAN-SWA_11 >NC_020542|108000:108594|108000_108594_+|WP_013039114.1|DBSCAN-SWA MALIGYARVSTADQKLSLQLDALNAAGCDRIFDDHASGAKADRPGLAEALAYLRSGDTLVVWKLDRLGRSMSHLIEKVGELATRGIGFRSLTENIDTTTSGGMLVFNIFGSLAQFERDLIRERTHAGLKAARERGRPGGRRPVVTPDKLRKAREHIASGLTVREAAARLKIGKTALYKALEATEKNTKSQRSRSVRS |
1 | Salmonella_phage(100.0%) | NA | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_12 |
116832 : 117813
Sequences of DBSCAN-SWA_12
Nucleotide sequences of DBSCAN-SWA_12 >NC_020542|116832:117813|DBSCAN-SWA CTCAATATTGGTCCAGGTCGCGGCCATGGGCGACTTCGCCGTGGAAATGCCGGGATATTCCCGCGGCATAGGATTCGACATCGGTTTCCCCGTCATTGCCAGGCGCGAAATGCGTGAGGATGACCTTCTTGACGCCCGCGGCGGCGGCGATTTCGCCGATCTGCTCCCCGGTCAGGTGGTCTTCGCGCATATGTTCCATCAGCGGCGGAATCTGATCGGCGGAGAAGCCCGGATAGCTTCGCAGCAATTTTTCCATGCGGGCAATGTCGATCACCTCGCAGACAAGATAATCGGCATCCTTCGCCAGGATCTTCAGGCGTTCGGACGGTCCGGTGTCGCCGCTGAAGACATAGACCTTGCCTCCCGCCTCCACGCGATAGGCATAGGAACGGGACGACCGCGCCTCGATCGATCCGGCCGGATAATGATAGTGATCGACGCCGATGGCGAACACCCGCACCCGATCGTCCTTGTAGACCAGCCGCGGTTCATTGAGGTCAAGCGGCAGGTCATGGCCGACCGCCGATCCGGCGAGGGGCGGCTTGACCGGGCCGCCGACCGTCACCGGGGCCAGCTCCACCGGCCGAAAGGCGGCGAGCGTCCCGGCGACGAGCTGCGCCGATCCCGGCGGTCCGTATATCTCCAGCGGAGGCGCGTTGGCGAGAAGCCAGCGGTCGATCATCAGCACGGGAACATCGGCGACGTGATCGATATGATGGTGGGTTAGGAACAGGGCCTTGACCGACCCCAGCCCCAGCTTGCTGGCCGCCATCTGCCTCAAAAGGCCGTCGCCCGCATCGATGAGATAGGTTTGCCCGCCGACGACGATCGCATTGGCGGGCTGGGAGCGTTCCAGCCGCGCTATCGGCCCGCCGGCCGTGCCCAGCAGGATGAGGCGCGGCGGCTGCGCAGGCGGCGTTTCCGCCCGCGCAGCCACGGTTCCGGCTAGTGCGAATATCAATATTGGGAGCAGATGCTTCAT
Protein sequences of DBSCAN-SWA_12 >NC_020542|116832:117813|116832_117813_-|WP_007687842.1|DBSCAN-SWA MKHLLPILIFALAGTVAARAETPPAQPPRLILLGTAGGPIARLERSQPANAIVVGGQTYLIDAGDGLLRQMAASKLGLGSVKALFLTHHHIDHVADVPVLMIDRWLLANAPPLEIYGPPGSAQLVAGTLAAFRPVELAPVTVGGPVKPPLAGSAVGHDLPLDLNEPRLVYKDDRVRVFAIGVDHYHYPAGSIEARSSRSYAYRVEAGGKVYVFSGDTGPSERLKILAKDADYLVCEVIDIARMEKLLRSYPGFSADQIPPLMEHMREDHLTGEQIGEIAAAAGVKKVILTHFAPGNDGETDVESYAAGISRHFHGEVAHGRDLDQY |
1 | Pandoravirus(100.0%) | NA | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_13 |
123113 : 125490
Sequences of DBSCAN-SWA_13
Nucleotide sequences of DBSCAN-SWA_13 >NC_020542|123113:125490|DBSCAN-SWA CATGCACGCCAGCGCCGCGATCCTGCGCCATGCGCAAGCCCCTTTCGCCATCGAGCCATTGGACCTGCCCGAACCCGGCGGGAACGAGATCCTCGTCCGCGTCGCCGGTGTCGGCATGTGCCACACCGACCTGGTGGTCAGGCATATCCCCGCCGAATGGGCGCCCCTGCCCGCCGTGCTGGGCCATGAAGGATCCGGCATCGTCGAGGCGGTCGGCCCCTCCGTCACGCGCTTCGCCGTCGGGGACCATGTGGTCCTGACCTATGATTCATGCGGCTGGTGCGACAATTGCCATGGCGGCACGCCATCCTTCTGCTCCGAATTCAGCGAGCGGAACGAACTGGGGCTGCGGCCCGGATCGCATCGGTCGGGCCGGGATTCGGACGGGGTCGAATTGCAGACGCGCTGGTTCGGCCAGTCGTCCTTCGCCACCCATTCGCTGGCGACCGAGCGCAATGCGGTCAAGGTGTCGCCGGACCTGCCCATAGAATTGCTGGGGCCGCTGGGCTGCGGCATCCAGACCGGGGCGGGCGCTGTCCTCAACGCCCTGCAGGTCCGTTTCGACAGCAGCGTCGTCATATTCGGGACAGGCGCGGTCGGCATGGCGGCGATCATGGCGGCGCGCATCGCCGGGGCGCGGAAGGTGATCGCGGTCGACCTGCACGAAAACCGCCTGCAACTCGCGCAGGAACTGGGGGCGACGCATCGCATCTCCGGCCGCGATCCCGAACTGGTCGAGCAGATCATGTCCATCACCGGGGGCGGCGCGACCCATGCGCTGGACACGACCGCCGCGCCCGCCGTCATCGCCGGCGCCCTGGCGGCCCTGCGCGCGCGGGGGAAGCTGGGGCTGGTGGGCGGCGGCGGCGCGCCGCTGGACCTGCCGCAGGAGGCGCTGATGAAGGGGCAGCAAATTTCCTTCATCATCGAGGGCAATTCGGTGCCCCAATTGCTGATCCCGCAACTGATCGAACTCTGGTCGGAGGGGCTTTTTCCCTTCGACCGGCTGATCGCCCGATATCCGCTGGAGAAGATCAACGATGCCGAGCGCGACCTTGCATCCGGTTCGGTCATCAAGCCGGTCCTGCTGCCGGGCCGATGACGCGACGAACCGCTGACACCCACTACGAAGAAGGCACGAGCCAGTGACGACAGAGACCCAAATCCAGGAAAAGCCCGTTCCCGAAACCACCTTCCCCTGGGAGCGCGGCACTTATGATCCGCCGCCGGCCTATGCCTGGCTGCGCGAGCACGAGCCGGTCCGGCGCGTCGTCCTGCACGACGGCACGCCGGCCTGGCTCGTCACCCGCTATAACGACGTCCGGAGCATCCTGGCCGATCCCAGGGTCAGTTCGAACCAGAATCTGCCCGGCTTCCCCCAGATCGAACTGTTGCCGCGCCCCAGCGAGGAGGAAAGCACCTTCCTCAACATGGACGCGCCGCGCCACACGCTGTTCCGGCGGCTCATCTCCAAGCATTTCATCGTGAAGAAGCTGGAGGTCATGCGGCCGCGCATCCAGGCGCTGGTCGACGAGCATATCGACCATATCATCGATCGTTCGGAGCCGTTCGATTTCGTGGAGGAGATCGCCCTGCCCGTCCCCTCCACGGTGATCGCCTGGCTGCTGGGCGTGCCGCCGTCGGACCATCCCTTCTTCAACCGGGAGACGGAGGCGCTGCTGGCCGCCAGCCTGGGGACGGAGGAAGCCATAGAGCGCGCCACGGAGGCCTATGCGAACATCAACGACTATGTCGACCGGCTGATCGCCGAGCGGGAGAAGCTGGACGATCCGGGCGACGACATATTGGGCGACCTGGTGCGCGCCAGCCGGGAGGGCCAGATCGAGCGGCGGGACGTGCTGAACACCGCATGGCTGCTGCTGGTCGCGGGTCATGACACGACCGCGAACATGATCGGCCTGGGCATGCTGACGCTGCTGGAGCATCCGGATCAACTGGCGCAGCTTCAGGCGGAACCGGCCCTGATCCCGGACGCCATAGAGGAACTGCTGCGCTATCTGACGGTCGTGCACCTCATCATCCTGCGCATCGCGACGGAGGATATAGAGATTGGCGGGGTGACGATCCCTGCGGGCGAAGGGATCATCCCGCTCAACTTCGCGGCGAACCGCGACGACGGGCATTTCCCGGATGCGGCGAAGTTCGACATACGCCGCCGCCCGCGCGACCATGTCGCCTTCGGCTATGGCGTGCACCAGTGCATCGGTCAGGCGCTGGCCCGCATCGAATTGCAGATCGTGTTCGAAACGCTGCTGCGGCGCATTCCGAACGTGCGGCTGGCGACCGATCCGGCCGACATCCAGTTCAAGAGCCATGCGAGCATCAACGGCATCGCCCGGCTGCCGGTGGCGCTCTAG
Protein sequences of DBSCAN-SWA_13 >NC_020542|123113:125490|124257_125490_+|WP_007687826.1|DBSCAN-SWA MTTETQIQEKPVPETTFPWERGTYDPPPAYAWLREHEPVRRVVLHDGTPAWLVTRYNDVRSILADPRVSSNQNLPGFPQIELLPRPSEEESTFLNMDAPRHTLFRRLISKHFIVKKLEVMRPRIQALVDEHIDHIIDRSEPFDFVEEIALPVPSTVIAWLLGVPPSDHPFFNRETEALLAASLGTEEAIERATEAYANINDYVDRLIAEREKLDDPGDDILGDLVRASREGQIERRDVLNTAWLLLVAGHDTTANMIGLGMLTLLEHPDQLAQLQAEPALIPDAIEELLRYLTVVHLIILRIATEDIEIGGVTIPAGEGIIPLNFAANRDDGHFPDAAKFDIRRRPRDHVAFGYGVHQCIGQALARIELQIVFETLLRRIPNVRLATDPADIQFKSHASINGIARLPVAL >NC_020542|123113:125490|123113_124214_+|WP_007687828.1|DBSCAN-SWA MHASAAILRHAQAPFAIEPLDLPEPGGNEILVRVAGVGMCHTDLVVRHIPAEWAPLPAVLGHEGSGIVEAVGPSVTRFAVGDHVVLTYDSCGWCDNCHGGTPSFCSEFSERNELGLRPGSHRSGRDSDGVELQTRWFGQSSFATHSLATERNAVKVSPDLPIELLGPLGCGIQTGAGAVLNALQVRFDSSVVIFGTGAVGMAAIMAARIAGARKVIAVDLHENRLQLAQELGATHRISGRDPELVEQIMSITGGGATHALDTTAAPAVIAGALAALRARGKLGLVGGGGAPLDLPQEALMKGQQISFIIEGNSVPQLLIPQLIELWSEGLFPFDRLIARYPLEKINDAERDLASGSVIKPVLLPGR |
2 | Synechococcus_phage(50.0%) | NA | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_14 |
136929 : 142602
Sequences of DBSCAN-SWA_14
Nucleotide sequences of DBSCAN-SWA_14 >NC_020542|136929:142602|DBSCAN-SWA GTCATGCGCTTTCTCCCTCCCGCTCCAGCACGGGGCCAAGGCCCATAGCCTGGCGAAGGAAGCGCGCCGCATGGTCCAGTTCGCCCCGCGCCGGAGCGATGACCCCGCCCAGCGTCAGGAAGCCGTGCATCTGGTCAGGCAGGTGTCGGTGCTCGCTGCCCCGCACCTCGCGCTTCAGCCGCCCGGCCAGCGCGATGGTGTCGTCGCGCAAGGGATCGTGGCCGACCGACAGGATGAGCGTGGGCGGCAGGGCGCCGAAATCCGGATGGCGCGCCGGCGATGCGCGCCAGTCGCCCGCCAGCGCCACATCGTCCAGATAAAGGTCGCGGAACCAGCGAAGCGTCGAGGATGTGAGCGGATAGCCGCTGCTGAATTCGTCCATCGACGCGGTTCTGCGGCCCATGTCGCCGGCGGGGTAGAAGAGGGCCAGCGCGGATGGCGTCGCTTCGTCCCTCAGCGCGAGCGCCGCGACCAATGCCAGCGTCGCGCCCGCGCTGTCGCCCGCAACCGCCAGCCCCGCCGTTTCGATGCCCAGCCCGGCCACCGCCTCAGGCACATGGCGCAGCGCGGCGACGGCGTCCTCCACCGCCGCAGGAAAGGGATGTTCGGGCGCCAGCCGGTAATCGACCGCGACCAGGATCGCGCCCGCCTGCCTGGCCAGCAGCCGGCAGGTGGAATCATGGGTGTCCAAATCCCCCATGACCCAGCCGCCGCCATGCAGGTAGACGATCAGCGGGCGCGGCCCGATCGCCGCCTCCGGCTCGTATATCCGCGTCCGCAGCGGATTGCCGCCCGCCCCGATCGTCAGATCCCTGGCGCGGACATCGCGTTCCGGTTCGAAGCCTGCGCGCAGCGACGTGGCGCGATAGCCTTCCCGCGCCTGTTGCGCCGTGCCCTGTTCGAGCGGAGGCAGGGCCGCCGCGCGATATTGATCGACGACGCCCTGCGCCCCCGGTTCCAGGGGGACGCGAAGTTCGATTGCCTCTGTCATTGTCCAACCACTTTCTGCACGACGCCGCCATCGACCAGGAATTCGGCGCCGGTGACATAGGATGCTTCATCCGAAGCGAGGAACGCGACGACCTGGGCCACTTCGTCCATGCCCCCCGGCCGTCCCAGCGGGATAAGCGACGTGTCGACCTTCAATCCCGCGGTCATGGGCGTGCTGATCATGCCGGGCAGCACCGCGTTCACCCGGACGCCCGCCGCCGCGAAGTCGAGGGCGGCCGCCTTGGTGAGGCCGATGACCCCGAATTTGGAGGCGACATAGCCGTAACGTTCGCGATTGCCCCTGACGCCGTTGATCGAGGAGATGTTGATCACCGATCCGCCGCCGGCCGCCGCCATGCGCGGAATGCAGGCGCGCATGCCGTAGAAGACGCCGCCCAGATTGACGGCGATCTGCCTTTCCCATTGGTCGTCGGACGTGGTTTCCAGCGCGCCGCCCCCGCCCACGCCCGCATTGTTCACCAATATGTCGATCCTGCCGAACCGTTCGACGCATCGGTCCGCCGCCGCCCCCCAGCTTTCGCCGCAGGACACGTCCAGGCGGAGATAGTCGGCGGCATCGGCGCCCATCTCCCCGGCCAGCAGCCGGCCCTCCTCGTCGATCACGTCGGTGACCAGAACCCTGGCGCCCTCGGCGACGAAGCGGCGGACATAGGCCGCGCCCAGGCCACGGGCGGCGCCCGTGATCAGGGCGGTCTTCCCCGTCAGGCGGCTCATTCCGCCGCCTGCGTCGTCAGCTCGCCCAAGGGGGTGAAGCGATAATCGCCCTCGGCCGGCCCGCGCGTCATGTTCCAGAACTCCAGCAGTTCGAACGGCCAGACGGCGGAAACGCGGCCGGTCTTGTTCCTGTACCAGCTCGTGACGCCTTCCGCGCCCCATGCCCGCGCGCGGTTGGCGCGGTCGATCTTCCCCACGAAGGCATCGGTCGCGGCGTCGGTGGGTTCCATGTCCTTCAGCCGCTTCACGACCAGGGTGCGGATGCACTCCAATATATATTCGATGCTGCATTCGACGATGAAGATCGCGCTGCCGTTCGCCGCCACGGCCGAATTGGGACCGCCGCAGATGAAGAAATTGGGGAAGCCCGGCACCGTCATGGTGGCATAGGCCCGCGGATCGCCGTCCCAATATTCGTGCACCTCCACGCCGTCGCGCCCGCGCACCTCCATGTCGGCCAGGAACTCGTCCGCGCGGAAGCCGGTGGCATAGGCGATGACGTCCACCTCATGCAGCACGCCGTCCGCCGTGACGACGCCTTCGGGCACGATCCGCTCTATTCCTTCGGTCACCAGGTCGACATGCGGCTTGCGCAGCGTGGCCGACCATTGGCCATTGTCGCGCAACAGGCGCTTGGCCCCCGCCGGATAGGTGGGAATGACCTTGTCGATCAGGTCCGGCCTGTCGGCGAGCTGGCGGCGGAATTCGGCCTCCAGTTCCTGGCGGAAGGCATGGTTGATGGCGCCGATGGAACCCTCTTCGCTCCAGCCTTCATCGGCGGTGACGGTGGGCATCCGGCCTTCCGTCGCGAGCCAGAACTGCCAGAAGCGATACCAGCGCGCATAGCCCGGAATGTGCCGGAACAGCCACATCAGTTCCGGGCTGATGTCGTCCAGATAGTCGGCCGTGGGCAACAGCCAGGGCGCGCTGCGCTGGAACACCTCCAGCTTGCCGACCTCGTCGGCGATCGCCGGCGCGATCTGATAGGCGCTGGCGCCGGTCCCGATGATCGCCACCCGCTTGCCCTTCAGGCTTACGTCATGGTTCCAACGGGCGGAATGCATGCTTGCGCCCGCGAAGCTGTCCATGCCCTCGATATTGGGCAGCTTGGGCGAGTTGAGCTGGCCGGTGCCGCTGATGACGACATTGGCGCGGTGGACCCTGACCTCCCCGTCGGTGACGGCCGTCACCTCCCACTCGCCCGCTTCCTCGTCATAGATCACCGATCGGACGAGGGTATTGAAAACGATGCGGTCGCGCACGCCATAGTCATGCGCCACTTTCAGGACATAGTCCTGGATCTCCGACTGGCGCGAGAATTTCTGCGGCCAGTCGGTCTTCTGCGCGAAGCTGAAGCTATAGGCATAGTTCGACGTGTCGAGACGGCAGCCCGGATAGGTGTTTTCCCACCATGTCCCGCCGATCTCTGGATTCTTTTCCAGCACGACATGCTCCACCCCGGCCTGCTTGAGGCGATAGGCGGCGACGACGCCGGTGATGCCCGCGCCGATGATCAGCACCTTGAACGGCCGGTCGGGATCGAGTTCGGACAGGGTCCACCGGGGCGCGCCCACGTCATGGTCGGCCACCTCATGGATCAGAAGATCCCGGTAATCGTCGAAATCGCGGGTGATGAGGAAGCTGACGATCGCCCGCAGCTCCGCATCGTCGGTGTGGCCGGGCGTCTGGATCTCGCCGTCGCGATACCGCCTGATCGCGTCGAAGGCGAGCCGCCTCGCTTCCTTCTGCTGTGCTTCGTCAAGGCCGCCCTGCGGCATCGGGATGAAGGATTTGCGCGTATGCGGCGGCTTGAGCGATGGGGGAACCAGCGACGTGTCGCCCGTCAGCTTCGCCAGCGCGGCCAGCAGGGCGGGCAGTTCCGCCTTCTCCAATATACGCTCCAGTTCGGCATCGTCGATGTCGATCGGAAGATAGGTGTCGAACGCATCTGCATGGTCGATGGGGTTGGCGGACATGGTCTTTCCTCTCTGGCCGCGAACCTGCGGCATATGCGTGGCGGCGCGTGAACCGGGCCGTGAACTAGGCATCGAACATGATGACCTGGCGAAGCTGGTCGCCCGATGCCAGGGCGTCGAAGGCGGCGTTGATGTCCTCCAGCCCGATGCGGCGCGAAATCAGCCGCTCGACCGGCAGCCGGCCTTCGCGCCACAGCCTTTCCAGGAACGGAATGTCGCGCCGGGGAACCGACGAACCAAGATAGCTGCCGACGATGCGGCGCGCCTCGGCGGTCACCTGCAAGGGCGATATCTGCGCCATGGCGCCGGGCGCGGGCAGGCCGACCGTCACGGTCGTGCCGCCGGGCGCGGTGATGGCGAAGGCGGTTTCGAACGCCTTGGGATGGCCGGCGCATTCCATGACGATGTCGGCCCTGATCCCCCGGTCGACGGCGTCCTGCGGCGACAGCGCCTCGTCGGCGCCCAGCGAACGGGCTAGGGCGAGCTTGTCCGGAAGCGCGTCGATCGCGACGATCCGGCGCGGACGCAGCGCCTTCGCGGTGATGAGCGCGGCCATGCCGACGCCGCCCAGGCCGATGACCGCCACCACGTCCGCCTCGGTAGGCCGCGCCTCGTTGAGGAGCGCGCCGCCGCCGGTGAGCACGGCGCAGCCCAGCAGCGCCGCCACGTCCGGCGGAACGTCGTCGCCCACCGGGACGGCGGAACTGCGATGCGCGACCAGGTGATCCGCGAAGGCGGAGACCCCCAGATGGTGGAGGACGGGGTGGCCGCATTCGCTGAGGTGCACGGCGCCGTCGAGCAGCGTCCCCGCCCCGTTGGCGGCGGTGCCGCGGGTGCAGGGAATGCGCCCTTCGGTCCGGCATGCGTCGCAGGCGCCGCAGCGGGGCATGAAGGTGAGGACGACGCGCTGTCCCAGGGCGAGGTCGTCGACATCCGGTCCCAGGGCGACGACGCGCCCGCTGGATTCATGCCCCAGCAGCATGGGCAGCGGGCGCCTGCGGTTTCCGTTGATGACGGACAGGTCCGAATGGCAGAGGCTGGCCGCCTCGACCTTGACCAATATCTCGCCGGGGCCGGGCGGCGTCAGGTCCAGCGTGCCGATCCGGATCGGCCGGCTGTCGGCATAGGGCCGCGGCGCGTCCGACGTTTCCAGGATCGCGCCCCGGATCGTCATCGACCCGTCGCTCCGTCCATCCATGGCGGGGGATGCTGCGCTCATGCGATCAGACCCCGGCCATGGAGCCGCCGTCCACGACAAGGGCGGTTCCGGTGATCCAGGACGCGTCGTCGGACGCCAGGAAACTGACCGCGCCGGCAATGTCCTCCGGCGTGCCCAGCCGACCGAGCGGATGACGCCGGGCGAGATCCTCCCGCGTGAACGCCTCGCCATGATGCGCCAGCACCGCATCCACCATCGGCGTGTCGATCACGCCCGGGCAGATGCAGTTCACCCGGACTCCCGATGCCGCGAAATCGAGCGCCATGCACTTCGTCATTCCGACGACCGCGCTCTTGGAACTGTTGTAGGCGACCGCGCCCGGCGTGCCCTTCAGCCCCGCGACGGAGGCGATGTTGATGATCGATCCCCTGGTCTGCCGAAGGTGGGGCACGCAATGCCGGGCGATCAGATAGGCGCTGGTCGTGTTCACTTCCAGCACCCGCGTCCACCGCGCTTCGTCGATGTCGAGCACGGTGCCGCCCGCCCACATGCCCGCCGCATGCACGACCACGTCCAGCCGCCCGGTCCATCCGACGATGGAGGCGACGGCGAATTCGATCGAGGCGGCGTCGCCGACGTCCAGCGTGGCCGCCCGCGCCCTGCCCTGTCCGCCAATCGCCTCGGCGACCTGGCCGGCGGCCCGCGCATCGATGTCGGTGACGAGGACGGACGCGCCCTGCTCCGCGAAGCGCCTGGCAATGGCTTTTCCGACGCCCCCCGCGGCGCCGGTTACGAGAACGGATTTATCCTGCAT
Protein sequences of DBSCAN-SWA_14 >NC_020542|136929:142602|136929_137919_-|WP_015449279.1|DBSCAN-SWA MTEAIELRVPLEPGAQGVVDQYRAAALPPLEQGTAQQAREGYRATSLRAGFEPERDVRARDLTIGAGGNPLRTRIYEPEAAIGPRPLIVYLHGGGWVMGDLDTHDSTCRLLARQAGAILVAVDYRLAPEHPFPAAVEDAVAALRHVPEAVAGLGIETAGLAVAGDSAGATLALVAALALRDEATPSALALFYPAGDMGRRTASMDEFSSGYPLTSSTLRWFRDLYLDDVALAGDWRASPARHPDFGALPPTLILSVGHDPLRDDTIALAGRLKREVRGSEHRHLPDQMHGFLTLGGVIAPARGELDHAARFLRQAMGLGPVLEREGESA >NC_020542|136929:142602|140693_141803_-|WP_015449282.1|DBSCAN-SWA MTIRGAILETSDAPRPYADSRPIRIGTLDLTPPGPGEILVKVEAASLCHSDLSVINGNRRRPLPMLLGHESSGRVVALGPDVDDLALGQRVVLTFMPRCGACDACRTEGRIPCTRGTAANGAGTLLDGAVHLSECGHPVLHHLGVSAFADHLVAHRSSAVPVGDDVPPDVAALLGCAVLTGGGALLNEARPTEADVVAVIGLGGVGMAALITAKALRPRRIVAIDALPDKLALARSLGADEALSPQDAVDRGIRADIVMECAGHPKAFETAFAITAPGGTTVTVGLPAPGAMAQISPLQVTAEARRIVGSYLGSSVPRRDIPFLERLWREGRLPVERLISRRIGLEDINAAFDALASGDQLRQVIMFDA >NC_020542|136929:142602|138649_140629_-|WP_015449281.1|DBSCAN-SWA MSANPIDHADAFDTYLPIDIDDAELERILEKAELPALLAALAKLTGDTSLVPPSLKPPHTRKSFIPMPQGGLDEAQQKEARRLAFDAIRRYRDGEIQTPGHTDDAELRAIVSFLITRDFDDYRDLLIHEVADHDVGAPRWTLSELDPDRPFKVLIIGAGITGVVAAYRLKQAGVEHVVLEKNPEIGGTWWENTYPGCRLDTSNYAYSFSFAQKTDWPQKFSRQSEIQDYVLKVAHDYGVRDRIVFNTLVRSVIYDEEAGEWEVTAVTDGEVRVHRANVVISGTGQLNSPKLPNIEGMDSFAGASMHSARWNHDVSLKGKRVAIIGTGASAYQIAPAIADEVGKLEVFQRSAPWLLPTADYLDDISPELMWLFRHIPGYARWYRFWQFWLATEGRMPTVTADEGWSEEGSIGAINHAFRQELEAEFRRQLADRPDLIDKVIPTYPAGAKRLLRDNGQWSATLRKPHVDLVTEGIERIVPEGVVTADGVLHEVDVIAYATGFRADEFLADMEVRGRDGVEVHEYWDGDPRAYATMTVPGFPNFFICGGPNSAVAANGSAIFIVECSIEYILECIRTLVVKRLKDMEPTDAATDAFVGKIDRANRARAWGAEGVTSWYRNKTGRVSAVWPFELLEFWNMTRGPAEGDYRFTPLGELTTQAAE >NC_020542|136929:142602|141852_142602_-|WP_015449283.1|DBSCAN-SWA MQDKSVLVTGAAGGVGKAIARRFAEQGASVLVTDIDARAAGQVAEAIGGQGRARAATLDVGDAASIEFAVASIVGWTGRLDVVVHAAGMWAGGTVLDIDEARWTRVLEVNTTSAYLIARHCVPHLRQTRGSIINIASVAGLKGTPGAVAYNSSKSAVVGMTKCMALDFAASGVRVNCICPGVIDTPMVDAVLAHHGEAFTREDLARRHPLGRLGTPEDIAGAVSFLASDDASWITGTALVVDGGSMAGV >NC_020542|136929:142602|137915_138653_-|WP_015449280.1|DBSCAN-SWA MSRLTGKTALITGAARGLGAAYVRRFVAEGARVLVTDVIDEEGRLLAGEMGADAADYLRLDVSCGESWGAAADRCVERFGRIDILVNNAGVGGGGALETTSDDQWERQIAVNLGGVFYGMRACIPRMAAAGGGSVINISSINGVRGNRERYGYVASKFGVIGLTKAAALDFAAAGVRVNAVLPGMISTPMTAGLKVDTSLIPLGRPGGMDEVAQVVAFLASDEASYVTGAEFLVDGGVVQKVVGQ |
5 | Acanthamoeba_polyphaga_mimivirus(25.0%) | NA | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_15 |
156747 : 160427
Sequences of DBSCAN-SWA_15
Nucleotide sequences of DBSCAN-SWA_15 >NC_020542|156747:160427|DBSCAN-SWA CTTAGGCGGCGCGGCTATCCGGCGGCAGATTGAGGGCCTTGCGGCCAGCGGAAGCGGCTGCTCGATCCCAGAGGAAATCGCCCGAGAAAGCAATATGCTCCCAACCTACTGGGGACGTATGGGCGAGTAGATCATCAGGCACCGCGACCGATGTTGATCGTAGATGCTGGGCGGCGGCATCCATGTAGATCGTGTTCCAATAGACGATCGCGGCGACGAGCAGATTGAGGCCGGACGCCCGATATTGCTGCGCCTCGTGGCTGCGGTCGATGATGCGGCCCTGGCGGAATGTGTAGATCGCCTGCGTCAGGGCATGGCGCTGCTCGCTGTTGTTGAGACCGGCATGGCATCGCCGGCGCAGATCGGGATTTTCTAGCCAGTCCAGCATGAACAGCGTCCGCTCGATCTTGCCGATTTCCTGTAGCGCGACATCGAGCTGGTTCTGCCGTTCGTAGGCAGCGAGCTTTCGCAGCATGACCGATGGCGCGACATGGCCGGCCTTGATCGACCCTACGAGGCGCAGCACGTCTTCCCATTGCTCACCGATGATGTCGGTGCGGATACGCTTGCCCAACAGCGGGGTGATGGACGGATAGGCCGTGACCGGCGCGATCGGCGCGAGCCGCCTGTCGGGGAAGTCGCGCAGGCGCGGGCAGAAGCGAAATCCCAGCATCGCGCACAGGGCGAAGACATGATCGGTCGCCCCGCCGGTGTCGGTGTAATGCTCGACGATCCTGAGATCGGTGCCGTGGCTGGTGAGGCCGTCGAGGACATAGGGCGCCTCATGCGTCGCGGCCGAGATCACTTTGACGTGGTAGGGCGCGCGCTGATCAGAATTATGGGTGTAGAAGCTGAAGCCATGATCGACGCCATAGCGGGCGTTGATATCGCCGCCCGACGCGCCGCGCTTCGCGGCCCTGAAGAACTGGCCATCGGAACTGGACGTGGTGCCGTCGCCCCATACTCGTGAGAATGGCAGGCGGTGGTGCGCGTTGATGAGGAGGGCCAGCGCCGCGCGGTAGCTGTCGTCGCGGATATAGGCGTCATGGGTCCAGAGGAGCTGATCGCGCGTGACGCCCTGACTCGCCGCCGCCATGCGCGACAGGCCGAGATTGGTCGCATCGGCGAGGATCGTGGCGAGCAGCGCGTTTTCATTGGGACACGGCTCGCCGGTACGCAGGTTGGTGAACGCCGCAAGAAAGCCGGTTTCCTGCGCCACCTCATGCAGTAGCTCGGTGATGCGGATGCGTGGCATCATGGCATCGAGCCGGTCGGCCAGCGCTTCGGCGTCGGGCGTCGCGATCGTGCGAACGGGGGATATCTGGAGGCGGTCGTCGCGATATCGCACACCCTCCAGAGCGTCGCGCTTCAGGCGCTGGGCAAATTTCTTAAGCCGCCAGTCAAGTTCGCGGCCCCGCTGTTCGAGCCATTCGCCGGCTGTGGGTGGCAGACCGAGAGCCGACACGATCGGCTTCGCCTGGGGTTCGCTGAGCAGGTAGCTGTCGAACCGGCGGTATCCTGCCGATCGCTCCACCCAGACATCCCCGGAACGCAGCTTGTTGCGCAGATGCGCGATCGTCGCGATTTCCCAGAGACGCCGGTTGATCTTGCCGTCATCGCCCACGACGATTTTCCGCCATTCCTTGCGGAAGGGCATCGGAGCGTCGGCAGGCAGATCGCGTTTGCCCGACCTGTTGAGGTCGCGCAGCATCTCGATGGCGGCGATCGTTTTTGCACTGCCCTTTCCGGCCCTGAACTGGAGCGCCTCAAGCAGGTCGGGGGCGAACTTGCGCAACGTCGCATAGCGGTCGGCCGCCACCGTCAGCGGATCAAGGTTGGCGGTTTCCGCGATTGTCGCCACTTCTGGCCTCGCCTTCAGGAGGGTGTTCCACCCGACCGATGCGTCCAGGGCCTCCATTGGATCTTCGCCGGTATCGACCGCATCGGTCAATGCGTCGATTGTCCTGCGAAAGATCAGCATCAGCCGCGCCACATTCTTTGACGTGGTGGCATAGCTGCGCGCCTGGGCGTTCTTCGCGCGGGTGAAGATGCTGCCGATCAGCTTGTCCGCCATCTCCAAGGCGTTATCGGTCAGTCGTTCCTCGAGGTCGAGCAGGAACGCAACCAGCGTAGCGCGCCGTCGGGAGGGAATATATCGCTCGATCATATAGGCAGGCGAAGCGCGGCCTTCCCTGACATATTGCCGAAATCGGTCGGCATGGATGCGGCCAGCCACGTCCGGGGAGATGCCGATCTTCCGCACTTGCCGCAGGCGGTCGAGAATCTGGCGGATGTGATCGGGCCTGGCCGCAACGGGAATGGTCTTGAGCGAAGCGAGTTGGCTCATGCCGCCGGCGTCGTCAAAGACCTGGTCGAGGGCGTGGACCTGTTCGGCGCTAAGATCAGCGATCAGGGCGTAGGCAGCCTGCTTGCGGGCACGCGCGCGTCCCGCACTACTGGCGCGTTCGATGGTGGAGATGGACGGCAGTAGAATCCGGGCCTCGCGAAGGGCGGTGACAACGCCGATCGCTATCGTCATCCCCTTGTCGGTCGCCCATGCCGTTTTCGCGGCCGCTTCGATCATGAAGGGAATGTCGGCGCGGGTTGGTCCACGAAGGCCCGCTCGCATCGCCAGCTCGCGGGCATGGTCCGTCATCGTCTGATCCCGCGCTGCATAGTTGGCCAGGTCGGTTACGTGTAGACCAAGTTGTTCCGCGACGAAGGCGGCGAGATCATGGGGTATCGCTCCCTTGTCCTGTATCAACTGCGCCAGTGTGATGCCCGGGTGCCGCAAGAGCGCGAGTTGCAGCGCGACACCCAACTGGTTCCGTCGCTCCCGTCGTGCGCCGATGATCTCGATGTCCGAAGGCTCGAAGGTGTAGAGCCGGGCTAGATGGTCGCGCTCGCTCGGGATGGCGAGGATCTGATCGCGTTCGCTCTCGGTCAGGAGTTGATGCTTACGCTTCGTCATTCCGATTCCCGTCCACAAAAGGTCATCTGGGCCTATGGACAGCCATGAGTAAAGAGACATAGTATGTGGACGGATATGGATGGGGCGAAGCGATCGCCCTTCAGATCGTCCACAGAACGACCGTTTGGGAGACGTGCGGCATGGGTGATATTCTGGGCTATGCCCGCGTCAGCACCGGCGATCAGGACGTTGCGGGCCAGACCATGCGTCTGGAGAAGGCTGGCGCCATCAAAATCTTCACCGATGTCATTTCAGGTAAGAGCATGGAACGGCCCGGTCTGGCTGAGCTAATCGCCTATGCCCGCAAGGGTGACACGCTGGCGGTGGTCCGCCTCGATCGGCTTGGGCGCTCGCTTGCCGAACTTCTCACCACGGTTGAGACGCTACGCGGTCAGGGCATCGCGCTCCTGAGCCTTGAAGAAAAGATCGACACTTCGTCAGCTGCCGGCGAGCTCATCTTCCATGTGTTTGGGGCCATCGCCCATTTTGAGCGACGGCTGATTTCTGAGCGAACCAGAGATGGTATTGCCGCCGCCCGTGCTAAGGGCAAACAGCCTGGCCGTCAGCCGCTGGACATGTCCAAAGTGGATGCGGCCATCAAGCTGGTCGAAGCTCGTATCTCGCCTACCGAAGCAGCGCGGCAACTCGGCATCGGCCGATCGACCATTTATCGCGAAATGCGTAGGATGGGAATCGAACGGCCCGCCTGA
Protein sequences of DBSCAN-SWA_15 >NC_020542|156747:160427|156747_159717_-|WP_015449292.1|transposase|DBSCAN-SWA MTKRKHQLLTESERDQILAIPSERDHLARLYTFEPSDIEIIGARRERRNQLGVALQLALLRHPGITLAQLIQDKGAIPHDLAAFVAEQLGLHVTDLANYAARDQTMTDHARELAMRAGLRGPTRADIPFMIEAAAKTAWATDKGMTIAIGVVTALREARILLPSISTIERASSAGRARARKQAAYALIADLSAEQVHALDQVFDDAGGMSQLASLKTIPVAARPDHIRQILDRLRQVRKIGISPDVAGRIHADRFRQYVREGRASPAYMIERYIPSRRRATLVAFLLDLEERLTDNALEMADKLIGSIFTRAKNAQARSYATTSKNVARLMLIFRRTIDALTDAVDTGEDPMEALDASVGWNTLLKARPEVATIAETANLDPLTVAADRYATLRKFAPDLLEALQFRAGKGSAKTIAAIEMLRDLNRSGKRDLPADAPMPFRKEWRKIVVGDDGKINRRLWEIATIAHLRNKLRSGDVWVERSAGYRRFDSYLLSEPQAKPIVSALGLPPTAGEWLEQRGRELDWRLKKFAQRLKRDALEGVRYRDDRLQISPVRTIATPDAEALADRLDAMMPRIRITELLHEVAQETGFLAAFTNLRTGEPCPNENALLATILADATNLGLSRMAAASQGVTRDQLLWTHDAYIRDDSYRAALALLINAHHRLPFSRVWGDGTTSSSDGQFFRAAKRGASGGDINARYGVDHGFSFYTHNSDQRAPYHVKVISAATHEAPYVLDGLTSHGTDLRIVEHYTDTGGATDHVFALCAMLGFRFCPRLRDFPDRRLAPIAPVTAYPSITPLLGKRIRTDIIGEQWEDVLRLVGSIKAGHVAPSVMLRKLAAYERQNQLDVALQEIGKIERTLFMLDWLENPDLRRRCHAGLNNSEQRHALTQAIYTFRQGRIIDRSHEAQQYRASGLNLLVAAIVYWNTIYMDAAAQHLRSTSVAVPDDLLAHTSPVGWEHIAFSGDFLWDRAAASAGRKALNLPPDSRAA >NC_020542|156747:160427|159857_160427_+|WP_011607925.1|DBSCAN-SWA MGDILGYARVSTGDQDVAGQTMRLEKAGAIKIFTDVISGKSMERPGLAELIAYARKGDTLAVVRLDRLGRSLAELLTTVETLRGQGIALLSLEEKIDTSSAAGELIFHVFGAIAHFERRLISERTRDGIAAARAKGKQPGRQPLDMSKVDAAIKLVEARISPTEAARQLGIGRSTIYREMRRMGIERPA |
2 | Salmonella_phage(50.0%) | transposase | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_16 |
171179 : 182075
Sequences of DBSCAN-SWA_16
Nucleotide sequences of DBSCAN-SWA_16 >NC_020542|171179:182075|DBSCAN-SWA TTCAGGCATCGTCGATGCCCGTGACGCTGGGATCACGGAGCGGCCGCAGCTCGCCGCGCGCGACCATGTCGGGGATGGTGAAGGCGTAGCGCCCGAGCATGTTGATGTGTTGGGAGCCGAGCGGCGAGAGGCGCGCGACGTCCTCGTCGAGGATCACATGACCTTCGGTGCGGAGCTGGTTGAGCGCGGCGTCGATGTAGATCGTGTTCCAGAGCACGACCAGATTGACGACGAGCCCAAGCGCGCCGAGTTGATCTTCTTGCCCTTCGCGGTAACGCTGACGAAGCTCACCGCGCTTGCCGTGGAATACGGTGCGGGCGAGCTGGTGACGGCTCTCGCCCCGGTTGAGCTGGACCAGGATGCGTCGCCGGTAGGTTTCGTCATCGATGAACCGCAGCATGTAGAGCGACTTGATGAGGCGGCCAAGTTCCTGGAGCGCGCGCGCTAGCTTGGTCGGGCGATCGTTGGTCTGAAGGACACGGGTGAGCCCAGTCGCGCGCACGACGCCGAGTTTCAGCGATCCGGCCAGGCGCAGCAGGTCATCCCAGTGCTCGACGATCAGCTTGGTGTTGATCCTGTTCGACGCGAGGTCGTCCAGCACACCGTAATCGGCCTTTCCATCGACACGCCAGAACCGCGCGCCGCCGATATCGGCGATGCGCGGCGAGAACTGGAGCCCGAGGAGGTAGAAAACGCCGAAAATCGTGTCGGTATAGCCGGCCGTATCGGTCATGATTTCGCTTGGCCGCAATTCGGTTTCCTGGTCGAGCACGACCGCCAGCAGATGGAGGCTGTCGCGCAACGTACCGGGGACCGTCACGGCATTCAGCCCGGAAAAACGGTCGGAAACCAGATTGTACCAGGTGACGCCGCGTTCGCGCCCGAAATAGCGCGGGTTCGGGCCGGCGTGGATCGTGCGGACCGGCACGGTGAATCGCAGCCCATCGGCCGAGGCGACCTCGCCGCCGCCCCAGGCACGGGTCAGTGGAATCCTGTTATGCGCGGCGACCAGTAGCGCGTTGGCCGCCGTCAGCGTTTCGGCCCGCATGAAGTTCTGCTTCACCCAGCTCAGCCGCGACCGGCGCAAGGCGGGGACCTCCGAGCGGACAAGCGGCTCGAACCCGGTATTGGTTGCCTCCGCGACAAGCACTGCACAGATCGTGGTCGCTATATCCTCAGCGCGCGCATTGCCCTCACTGGCATGAGTGAACGCTGCCCCGAACCCCGTTCGCGCGTGCATCTCCAGGATCAGCTCGGGGAGATCGAGCCGCGGCAGTCGCGCATCGATCGCATTATGCAGGTTGGTGAGGCTCAATGGCTCGTCGATCTTATCGAGCGGTGCGATTGATAGGTCGGCGGCGCTCGCGGTCTTCGTAATTGTCACCGCATCGTTGTCCGGCACGCGCGCGGCTGTTTCGCGGTAGGCAAGATCGAGCCGCGCCGAAAGTTTGGCGATTTCCTCGTCGCCAGCGACCGCCACGCCGACCGTGCGGCATACCGTCGGTCGCGCCGCCTCCCAGCCAGCTCCCGTCAGTAGGCCCTTGCGAGGATCGGCATAGCGAAGTGAGCGAACGGGAAAGACGTCGCGGCGACGGATGGCGCGGCGTAGTCGGTCGAGCGTACAAAGCCGGTATCCGATCAGATCGAATGCACCGTCCGCGGTCTTCAGCTGGCGGGCCCAGGCCTTGGGTACGAAAGATGTTGGTACCGGTCCACGGCGCTTTTCACCCTGGTGAACTCTCCGCAGGTGATCGACCGCGTCCAACAGCGGCTGCCCCGCTGGCGCCGCCGCAAGATCGAGCCCAGCAAGCATCTTCGGCAGATAGCGCATCGTGCCAGTCTGCTGGCGCAACTCGACGAAGTAGCTCTCGTCGGTCGGCCGGGCGAGCAGGTTCACCTGCGCGACGGCCTCGGTCAGCGCTGCCCGGTCGACGAGCGCGAACACGGCTGCGCGGACCTCGGTGTCGCCGATACTGTCATCGAGCAGCACCGCCCCGACATCGCGAAGCCGGAGCGATGCGGCGTCGAGATCGCGCAGTGAGCGCATCCGCGCCTCCTTGGCGTTGATGCGCGCATTGGAGAACATCTTGGTCGACACGGCGTCGAACAGGTCGATGACATCGTCGCTGGCCGACGCTTCAAGCGTCCGGATGAAGGCAACCAGGGTCGCCGCGCGGCGATCGTCCGGCAACCGGGCGACGGCCTGCGCTTTCGCGGCGCCGGCAAATCGCGCTAATGCGGCGGCCTTGCCGGGCGGTACGCGGTCGAGTTCGGGCAAGCCTGTGGTGAATGTACGGATTTCGGTCAGTCGGTCGATGGCCCGGCTGATCTCAGGGCCGCTTTGCAGATATGGCCCATCACGGAGTCGATCGAGCGGGCTTTGCCGTCCATCTTCGGCAACAGCGACGAGCGTATCGAGCCGGGTACGTTGCTCTGACGTCAGCTTGTCGACCAGGCGGCGATGGACGTGCGCGGCGACTCGCGTGCGGACCCGTGCGATATCGCGCTCGAGGACGGACAATCCCGGTAACATCACCTTGAACTCGAGCAGCCATGTCGCGGCTGCGTCGAACAGGGCCGACGGTCGATCGGTGCCCGTCCAGCACAGCGCATACAGGAAACGATGCAATCGGAACGCGACGCCGGGATCGGAGAAGACTCGGTAGCCATAATGAATGCGGATGCGCGGGCCGTGCCGCCATCGTCCCTTGGTCGCGCAATATCGCGCCATTAGTTCGGCCGATCCGTCGATCGCCAGTTGGTCGCCAGCAAATCGCGTGACGGACGCAGGTATTTGCGCCGGATCTTCGAGAAAAGTGCCGAGCAGCCTCAACGAGCCTAGCTGCACCGCGACACCGAGCCGGTTGTGGTCGCCGCGGTGTGCGCCGATGAACGCCCGGTCCGCATCGTCGAGATGAAAGTGCCGCGCCAGCTGTTCCGAGGTCGGATCGCCTACGAAGCGACCATATCGCAGTGCCTGGTCGTCCGAGAGAAAGCTGACTGGCACCGCTTCACGCTATGCCGCTGCCGCGGATCTGGTCCCGCGCTTGGCCAACAGTTCGCTGGCCCGCTCTCGCGGCTGTCCTTTGGCGTCGACATAGGCATAGAGTGTCGACACCGATACTCCGAGCTGGACAGCGACGTCGCGCGCGGCATTGTCGCGGTCTGCCATCAACGCCATTGCGGCCTTCAACTTCTGTTTCGTCATCACCCGTGGCCGCCCGCCCGCGCGGCCACGGGCCCTGGCCGCAGCCAGCCCTGCCATCGTGCGCTCGTGGATCAGGTCGCGCTCGAACTCGGCCAGGGTTGCGAAGATGCCGAACACTAACCGGCCCGTGACGGTCGTCGTATCGATATCGCCGGTCAGCACCTTCAGGCCGACGCCGCGCTTCTGCAGATCTCCGACCAGCCCGACGACGTGAGGCAGCGACCGTCCAATTCGGTCGAGCTTCCAGACAACAAGAGCGTCGCCATCACGCATGATATCGATTGCCTTGTCGAGCCCAGGCCGTTCGGTCGTACGGCCCGAGCAAACGTCGTCGAAGATCCGGTTGCATCCGGCACGATCGAGCGCATCACGTTGCAGATCGAGATTCTGCTCGCCCGTCGACACCCGCATGTACCCGATCAGCATCGACGCCCGCTCCCTCTTGCATCAATCTATCGAACGATGGGTTTACCGGCTATAGGTTTCTGGACGGGTAGATGTGAGATTGAAGGCCGTCCAGTGGGTCGATCGCGTTCGTCGGACCGCTTCGCATCAAACCGGGGTTTATTGGAAAAACCAGACGGCCGCTTCCGCCCCCTTGCGGACATTCTGTGCAGACTGTAGCATCGTCGACATACGCTGGCTGATCATCATCGCTTCGTTCGGTTACGTGATGACAGACGCATCCGTCTCAGCACAGCCGATGCGGCAGCCTAGCTGGGATGAGGTGAAGGAGACGGCCAAGGCCTGCTCTCTGGTGGTTCACCGCATTGATGTTCCTGCGAGCCCGGTCGGTTCGGGAGAATGGGCTTACTACATCGACCGGCAATCGACCTCGAGGCAGCGGACGTGTTTCTACAAGCGACTGGGCATCGGAGAAGTCGAGAAGACGATGCGGGAGTTTGACTTCGCTGGCGCGAGGCAGTCTGGCTGTCCGGCGTCAGCGCCCATGGCACAGATTAGGGGTTGAGATTTAAGGAGGGTTTGGGCTTCGTCGTAGTGACGAAGGAACGAAGATGAAGCCCAAACCCTCCTTGCGAAAATCGCCAGCAAAGGCCTCTGCCGAAGCCGTGGTGAAAGCCATTCGCCGACAGACGCGGCGGCATTTCTCGGCCGAGGACAAGATCCGCATCGTGCTCGACGGATTGCGCGGCGAGGACAGCATCGCCGAGCTGTGTCGGCGTGAAGGCATCGCCCAGAGCCTCTACTACACCTGGTCGAAAGAGTTCATGGAGGCCGGCAAACGCCGGCTCGCGGGCGATACCGCCCGTTCTGCGACCACGGGCGAGGTGCAGGATCTGCGCCGTGAAGCCCGTGCCCTGAAGGAATGCGTTGCTGACCTGACGTTGGAAAACCGCCTGCTCAAAAAAAGCATGATCGCGGATGGGGGCGACGACGAATGAGGTACCCCGCCGCAGAAAAGCTGGAGATTATCAGGATCGTCGAGCAGTCGCACCTGCCGGCCAAGCACACTCTGGACAAGCTGGGTATTCCTCGCCGGACGTTCTACCGCTGGTATGACCGCTTCCTTGAGGGCGGTCCTGAAGCCCTGGAGGATCGCCCCTCGGCGCCGAGCCGGGTGTGGAACCGCATCACCGAGGATATCCGCGCGCAGATCGTCGAGATGGCGCTCGAGGCGACCGAGCTTTCCCCTCGCGAACTGGCGGTGCGCTTCACCGACGAGAAGCGCTATTTCGTGTCGGAAGCCACGGTTTACCGCCTGTTGAAGGCCCATGACCTGATCACCAGTCCAGCCTATACCGTGATCAAGGCCGCTGACCAGTTCCACACGAAGACCACCCGGCCGAATGAGATGTGGCAAACCGACTTCACCTACTTCAAGATCATCGGGTGGGGCTGGATGTACCTGTCGACCGTGCTCGACGACTACTCGCGCTACATCATTGCCTGGAAACTGTGCACCAACATGCGCGCCGAGGACGTCACTGACACGCTGGACTTGGCCCTCAAGGCATCGGGCTGCGACAGCGCCACGGTCCTGCACAAGCCCCGGCTACTATCCGACAACGGCCCCAGTTACATCGCGGGCGAACTCGCCGAGTACATCGAGGCCCAGCAGATTGTCCGGCGTCAGCGCCCATGGCACAGATTAGGGGTTGAGATTTAAGGAGGGTTTGGGCTTCGTCGTAGTGACGAAGGAACGAAGATGAAGCCCAAACCCTCCTTGCGAAAATCGCCAGCAAAGGCCTCTGCCGAAGCCGTGGTGAAAGCCATTCGCCGACAGACGCGGCGGCATTTCTCGGCCGAGGACAAGATCCGCATCGTGCTCGACGGATTGCGCGGCGAGGACAGCATCGCCGAGCTGTGTCGGCGTGAAGGCATCGCCCAGAGCCTCTACTACACCTGGTCGAAAGAGTTCATGGAGGCCGGCAAACGCCGGCTCGCGGGCGATACCGCCCGTTCTGCGACCACGGGCGAGGTGCAGGATCTGCGCCGTGAAGCCCGTGCCCTGAAGGAATGCGTTGCTGACCTGACGTTGGAAAACCGCCTGCTCAAAAAAAGCATGATCGCGGATGGGGGCGACGACGAATGAGGTACCCCGCCGCAGAAAAGCTGGAGATTATCAGGATCGTCGAGCAGTCGCACCTGCCGGCCAAGCACACTCTGGACAAGCTGGGTATTCCTCGCCGGACGTTCTACCGCTGGTATGACCGCTTCCTTGAGGGCGGTCCTGAAGCCCTGGAGGATCGCCCCTCGGCGCCGAGCCGGGTGTGGAACCGCATCACCGAGGATATCCGCGCGCAGATCGTCGAGATGGCGCTCGAGGCGACCGAGCTTTCCCCTCGCGAACTGGCGGTGCGCTTCACCGACGAGAAGCGCTATTTCGTGTCGGAAGCCACGGTTTACCGCCTGTTGAAGGCCCATGACCTGATCACCAGTCCAGCCTATACCGTGATCAAGGCCGCTGACCAGTTCCACACGAAGACCACCCGGCCGAATGAGATGTGGCAAACCGACTTCACCTACTTCAAGATCATCGGGTGGGGCTGGATGTACCTGTCGACCGTGCTCGACGACTACTCGCGCTACATCATTGCCTGGAAACTGTGCACCAACATGCGCGCCGAGGACGTCACTGACACGCTGGACTTGGCCCTCAAGGCATCGGGCTGCGACAGCGCCACGGTCCTGCACAAGCCCCGGCTACTATCCGACAACGGCCCCAGTTACATCGCGGGCGAACTCGCCGAGTACATCGAGGCCCAGCAGATGAGTCACGTCCGTGGCGCCCCGTTGCATCCGCAAACGCAGGGCAAGATCGAGCGCTGGCACCAGACCTTGAAGAATCGCATCCTGTTGGAACATTACTTCCTGCCCGGCGACCTCGAAGCCCAGATCGAAGCGTTCGTGGAGCACTACAATCACCAGCGCTACCACGAGAGCCTGAACAACGTGACGCCCGCTGACGCCTACTTCGGCAGGGCTCCCGCTATCATCAAGCGTCGCGAAACCATCAAGCAAAAAACCATCGAATATCGCCGCTTGCTTCACCGCAAGCTCGCCGCCTAACATCAACCCCCAGACGAGGCTCGCTCTCCGCTAATCCACGCCCCGATCTGTGCCAAATGTCTCGACGACGGACACCGCCGAATTCTAGCATCTCCTGCAGCGTCCACAGGATAAGCCGCTCGTCCTCAACCAGCAGCAGCACCTTGGCGTCCGACATGCGACGTCCCCCGTTCCTCCTTTTCCGATGCCGCGGCAGGAGTTTGTATCGCAATACGAACGAAAATAGGTGCGGGTTGGTTCCCCGGTCCCGTCCTTGCACGTCGTAACCGACCGATCCTAGGCCGTTCTACTGCGTGCGTGGTTGGAGTTTTGTTCCGAGATGGCGTGAGCGACCAAGTTCGGCATGGCGGTTTGCGCTGCGGGTCTTGAGCAGATCCTGGCGCGAAAGGATGCCGAGCACGCGCCTGGTTCCGGGCTCGATGATCGGAATGCGACCGATGCCGGATTCGACGATGAGATCGGCGACGACGCCGGTGGGCGTATCGGGATAGGCAACGGGCTGTGCTGCGTCCGAGATGGCGTCGATGAGCGGCGTTTGATCGTCGTCGCGCTCGCCCTGCCAGCGCAGGGCATCGGTGCGAGAGACGAGACCCAGCAATCTTCCTTGCGGGTCGGTCACCGGATAGCTGCGATGCACGGCTTTGGTTGCGAAGAAGGATACCGCCGCCTCCACGGTCATCGTGCCCGGTAGGGTCGCCGGATCGCGCGTCATGATCTGACCGGCCTGGATGAGGTCGAGCGGATCGACCGTATATTCCTGCAGGATATGCCGTCCGCGCCGCGCGATTTTCTCGGTGAGGATCGATCGCCGCATCAGCAGCACGCTGATGGCATAGGCGCCGCCCGCAGCCGCGATCGTGCATGGGATGGCGCTGAAGTGCCCGGTGAGTTCGACCGCGAAGATCGCGCCCGTCATCGGCGCACGCATCGCGCCGCTCATGATTCCCGCCATGCCGATCATGGCCCAGAACCCCGGATCGCCGGGCAGGAACTGCCCGAGCAGGAAGCCGGCTGCGCCGCCCAGGATGAGGAGCGGGGCAAGAACGCCGCCAGACGTGCCTGACCCCAGCGCGACCAACCAGACGATGGCCTTCACGACGAGGAGCGCAGCAACGACCCGGAGCGATAACGAGCCATTGAGGAGAGCCTCGATGCTCGCATAACCGGCGCCAAGCACATGTGCATCGATGAGACCACCCAGACCAACGACCACCGCACCGATCGCCGGCCACCACATCCAGTGGAATGGCAGGCGATGGAAGAGATCCTCGATACGGTAGAGGGATGTCGATAGTAGAGCGGCTTCAAGACCGACGACGAGGCCGATCCCGCCGGAAGCGAGCAGGGCCCAGCTTTCCTGCGGAGCCATCGCCGCCATTGGAAACATCGGTCCCGATCCGAGCAGAAGCGGACGCCAGGCGAAGGAAACAAGTGCTGCAACGAGGACGGGGACGAAGCTGCGCGGTTTCCACTCGAACAGGAGCACTTCGACGGCCAGCAAGATCGCCGCGAGCGGCGTACCGAAGATGGCGGTCATGCCGGCTGCCGCACCGGCCACGAGGAGCGTCTTGCGCTCGGCCGCGCTGAGATGGAAGCATTGGGCGAACAGCGAACCAATGGCCCCGCCCGTCATGATGATCGGTCCTTCCGCGCCGAAGGGTCCGCCACTGCCGATCGAGATCGCTGAGGACAGGGGTTTGAGCAGCGCCACCTTGAGCGACAGTCGGCTCTCGCCGAAGAGGATCGTTTCGATCGCCTCGGGAATGCCATGGCCGCGAATCTTGTCCGATCCGAAACGCGCCATCAGACCCACGATAAGGCTGCCGATGATCGGAATCACGACCACCAGCCAGCCCACCGATGCGTCGGTTATCACGGAGTGATCGGCGGAGAGTCGACCGAACCAGAAGAGATTGGTGGCAATCGCAATGAGTTTGACCAGCACCCATGCGCCGAAGGCACCGCCCGTCCCGACCACGACGGCCATGACCGCCAGCATGACCATGCGGCGATCGACACTATGGTCGGCGAGTTGGTGGCCTTCAACGGAACGTGGGGGTGTCAGCGACATGAATCGATATTATCCTGCAAAAGTAGCTCGAGCCCATTATATCGCATTGCGATGTATTTCAATCAGTGAGGATCAATTGAGGCAAAAGGATAAATTGCTGGGGGATGCCGACTATGTGGCGCTGGCCTCATTTCGCCACGCCATTCGCCGCTTCCAGGCGTTCAGCGAGGAAAAGGCCATCGAAGTCGGGCTGACGCCGCAACAGCACCAGGCTCTCCTCGCGATCCGGGGCTGTCCGCCGGATGAAGCCACCGTCGGCCATGTCGCCGAACGTCTGATATTAAAGCCGCATAGCGCCACGGGCCTGATCAATCGCCTCGAAGCCTTGGCGCTTATTACGCGCGAGGCGGCTGCTACCGATCGCAGGCGGGCGCTGCTACGCCTTACCCCCAAGGCTTATGCCTTGCTCGACGCCCTGTCGGCTGTCCATCGCGAGGAAATTCAGCGCTTGCGGCCGGTCTTCACCGGCATTTTCGAACAATTGGGATGAGCAAGCAGCATTTAATGCAATTATGATGTATTGAAACAGAAATAACATTCATTTGTCTTATGTTATTCTAACCGGCACAACGCGATCTGCAGAGTGAAGTTGCTTCACGACAACAGCGCCCGGAGAGGATCCTGCGAAAGCAGGCCGGAGCGCGGGAAGGGAGCCAGAAACCGAGGATAAGTGCATGACGCAGACGGAGTATGGCGATCGGTGCAGGATGTGGCCGCATCGCTGGACGGCGGCGCAGGCGTCTCGGTTTAGCGGCTACTTCACCAGGTTAGACGCCGGCTCGCGGCCCCCCCTGCCGCAGCCGGCGTCTTTCCGAAGTTCGGTCAGGCCGGTCACTTACAAAATATTTGGGCCGCAAACTGGCCACTCAGCCGATCTGGCATCGTACTTCAAGTAACAGAGGGGCGTTGCTTCAATGACTGAACCCAAAGATCAGGCATCTCGCCTTCAGACAGTCGCCGCCGCGCACGGCGTTGCCAGCGATGTGGCCTTTGGAGCTGTCGCTCCGCCGCTTTATCTTTCGAGCACCTATGAGTTTGCCGGCTACGATCAACCGCGCTCCTATGACTATGGTCGGGCGGGTAATCCCACCCGCGATCTTCTCGGGCAGGCACTGGCGAAGCTTGAAGGTGGCGCAGGCGCGGTCATCACGGCGAGCGGCATGGCCGCGCTCGATCTTCTCGTCGGGAGAATCGGGCCGGGCGATCTTATCCTCGCACCGCATGATTGCTACGGCGGTACGATGCGCCTGTTGAAGGCGCGCGCGAACCGAGGGCATTGCGTCGTCCGGTTCGTTGATCAGGGGGACGAGAGCGCCTTTGCCGCCGCGCTGGAGGACATTCCGGCACTGGTGCTTATCGAGACGCCGAGCAATCCCTTGATGCGGGTCGTCGATATCGCAGAGCTTGCCGCAAAATCGCGCTCGGCGGGGGCTGCGGTTGCCGTCGATAACACGTTTCTTTCACCGGCTATCCAGCAACCGATCGCGCTGGGCGCGGACTATGTCATCCATTCCACGACCAAATATCTCAACGGCCATTCCGACGTGATCGGCGGCGCAGTTGTCGCGGCTGATCCGGTGCAGGTCGATGATTTGCGTAATTGGGCAAATGTCGTGGGGAGCACAGGCGCGCCGTTTGATGCCTGGCTAACCTTACGGGGCCTTCGCACGCTATTTGCCCGGATGGAGCAGCAGCAGCGCAACGCGATGATCGTCGCGCAATATCTCGACCGACATACGGCGGTGTCCCAAGTCCACTATCCAGGTCTCGTCGCCCATCCCGGCCACCGGATCGCCTCACGCCAGCAGCGCGGTTTCGGGGCGATGTTGAGCTTTGAACTCGCAGGAGGGGTGGAAGCGGTTCGGCGGTTTGTCGCTGCGGTCGGCTATTTCACGCTCGCGGAATCGCTGGGTGGGATCGAGAGCCTGGTCGCGCATCCGGCGACCATGACCCATGCCGATATGGGCGAGGAGGCGCGCTCAAGAGCGGGCATCAGCGACAGCCTGCTACGGCTTTCGATTGGCCTCGAAGCCGAACAAGATCTGATCGCCGGCCTCGATCAGGGATTGGCGGCATGCGCGGGATGA
Protein sequences of DBSCAN-SWA_16 >NC_020542|171179:182075|178203_179988_-|WP_015449309.1|DBSCAN-SWA MSLTPPRSVEGHQLADHSVDRRMVMLAVMAVVVGTGGAFGAWVLVKLIAIATNLFWFGRLSADHSVITDASVGWLVVVIPIIGSLIVGLMARFGSDKIRGHGIPEAIETILFGESRLSLKVALLKPLSSAISIGSGGPFGAEGPIIMTGGAIGSLFAQCFHLSAAERKTLLVAGAAAGMTAIFGTPLAAILLAVEVLLFEWKPRSFVPVLVAALVSFAWRPLLLGSGPMFPMAAMAPQESWALLASGGIGLVVGLEAALLSTSLYRIEDLFHRLPFHWMWWPAIGAVVVGLGGLIDAHVLGAGYASIEALLNGSLSLRVVAALLVVKAIVWLVALGSGTSGGVLAPLLILGGAAGFLLGQFLPGDPGFWAMIGMAGIMSGAMRAPMTGAIFAVELTGHFSAIPCTIAAAGGAYAISVLLMRRSILTEKIARRGRHILQEYTVDPLDLIQAGQIMTRDPATLPGTMTVEAAVSFFATKAVHRSYPVTDPQGRLLGLVSRTDALRWQGERDDDQTPLIDAISDAAQPVAYPDTPTGVVADLIVESGIGRIPIIEPGTRRVLGILSRQDLLKTRSANRHAELGRSRHLGTKLQPRTQ >NC_020542|171179:182075|171179_174185_-|WP_015449303.1|transposase|DBSCAN-SWA MPVSFLSDDQALRYGRFVGDPTSEQLARHFHLDDADRAFIGAHRGDHNRLGVAVQLGSLRLLGTFLEDPAQIPASVTRFAGDQLAIDGSAELMARYCATKGRWRHGPRIRIHYGYRVFSDPGVAFRLHRFLYALCWTGTDRPSALFDAAATWLLEFKVMLPGLSVLERDIARVRTRVAAHVHRRLVDKLTSEQRTRLDTLVAVAEDGRQSPLDRLRDGPYLQSGPEISRAIDRLTEIRTFTTGLPELDRVPPGKAAALARFAGAAKAQAVARLPDDRRAATLVAFIRTLEASASDDVIDLFDAVSTKMFSNARINAKEARMRSLRDLDAASLRLRDVGAVLLDDSIGDTEVRAAVFALVDRAALTEAVAQVNLLARPTDESYFVELRQQTGTMRYLPKMLAGLDLAAAPAGQPLLDAVDHLRRVHQGEKRRGPVPTSFVPKAWARQLKTADGAFDLIGYRLCTLDRLRRAIRRRDVFPVRSLRYADPRKGLLTGAGWEAARPTVCRTVGVAVAGDEEIAKLSARLDLAYRETAARVPDNDAVTITKTASAADLSIAPLDKIDEPLSLTNLHNAIDARLPRLDLPELILEMHARTGFGAAFTHASEGNARAEDIATTICAVLVAEATNTGFEPLVRSEVPALRRSRLSWVKQNFMRAETLTAANALLVAAHNRIPLTRAWGGGEVASADGLRFTVPVRTIHAGPNPRYFGRERGVTWYNLVSDRFSGLNAVTVPGTLRDSLHLLAVVLDQETELRPSEIMTDTAGYTDTIFGVFYLLGLQFSPRIADIGGARFWRVDGKADYGVLDDLASNRINTKLIVEHWDDLLRLAGSLKLGVVRATGLTRVLQTNDRPTKLARALQELGRLIKSLYMLRFIDDETYRRRILVQLNRGESRHQLARTVFHGKRGELRQRYREGQEDQLGALGLVVNLVVLWNTIYIDAALNQLRTEGHVILDEDVARLSPLGSQHINMLGRYAFTIPDMVARGELRPLRDPSVTGIDDA >NC_020542|171179:182075|180082_180478_+|WP_015449310.1|DBSCAN-SWA MLGDADYVALASFRHAIRRFQAFSEEKAIEVGLTPQQHQALLAIRGCPPDEATVGHVAERLILKPHSATGLINRLEALALITREAAATDRRRALLRLTPKAYALLDALSAVHREEIQRLRPVFTGIFEQLG >NC_020542|171179:182075|176553_177916_+|WP_144062188.1|transposase|DBSCAN-SWA MKPKPSLRKSPAKASAEAVVKAIRRQTRRHFSAEDKIRIVLDGLRGEDSIAELCRREGIAQSLYYTWSKEFMEAGKRRLAGDTARSATTGEVQDLRREARALKECVADLTLENRLLKKKHDRGWGRRRMRYPAAEKLEIIRIVEQSHLPAKHTLDKLGIPRRTFYRWYDRFLEGGPEALEDRPSAPSRVWNRITEDIRAQIVEMALEATELSPRELAVRFTDEKRYFVSEATVYRLLKAHDLITSPAYTVIKAADQFHTKTTRPNEMWQTDFTYFKIIGWGWMYLSTVLDDYSRYIIAWKLCTNMRAEDVTDTLDLALKASGCDSATVLHKPRLLSDNGPSYIAGELAEYIEAQQMSHVRGAPLHPQTQGKIERWHQTLKNRILLEHYFLPGDLEAQIEAFVEHYNHQRYHESLNNVTPADAYFGRAPAIIKRRETIKQKTIEYRRLLHRKLAA >NC_020542|171179:182075|174194_174812_-|WP_015449304.1|DBSCAN-SWA MLIGYMRVSTGEQNLDLQRDALDRAGCNRIFDDVCSGRTTERPGLDKAIDIMRDGDALVVWKLDRIGRSLPHVVGLVGDLQKRGVGLKVLTGDIDTTTVTGRLVFGIFATLAEFERDLIHERTMAGLAAARARGRAGGRPRVMTKQKLKAAMALMADRDNAARDVAVQLGVSVSTLYAYVDAKGQPRERASELLAKRGTRSAAAA >NC_020542|171179:182075|180902_182075_+|WP_015449312.1|DBSCAN-SWA MTEPKDQASRLQTVAAAHGVASDVAFGAVAPPLYLSSTYEFAGYDQPRSYDYGRAGNPTRDLLGQALAKLEGGAGAVITASGMAALDLLVGRIGPGDLILAPHDCYGGTMRLLKARANRGHCVVRFVDQGDESAFAAALEDIPALVLIETPSNPLMRVVDIAELAAKSRSAGAAVAVDNTFLSPAIQQPIALGADYVIHSTTKYLNGHSDVIGGAVVAADPVQVDDLRNWANVVGSTGAPFDAWLTLRGLRTLFARMEQQQRNAMIVAQYLDRHTAVSQVHYPGLVAHPGHRIASRQQRGFGAMLSFELAGGVEAVRRFVAAVGYFTLAESLGGIESLVAHPATMTHADMGEEARSRAGISDSLLRLSIGLEAEQDLIAGLDQGLAACAG |
6 | Enterobacteria_phage(25.0%) | transposase | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_17 |
187487 : 189720
Sequences of DBSCAN-SWA_17
Nucleotide sequences of DBSCAN-SWA_17 >NC_020542|187487:189720|DBSCAN-SWA GATGTTGATTGTGGAGACTATTGCCAAGATACGCAGGGAGCACAGGGACGGTAAGCCGATCAAAGAGATTGCGCGTGATTTACGGTTGTCGCGCAACACGGTGCGCAAGGCGATCCGTGCTCCGGAGGCGGATTTCAGCTACGAGAGGAAGGAGCAGCATCGTCCGCAGACCGGTCCATTTCGCGAACGGTTAGATGAGTTGCTGGCGGAGAACGAAGAGCGCCCCCGGCGCGAGCGACTGCGGCTGACGCGGATTCATGATCTGCTGGAACGTGAGGGGTTCACCGGCTCCTACGATGCGGTGCGGCGCTATGCGGCCCGCTGGAAGCAGGAGCGCCACGCCGGTGGCAGCGGGGATATGAGCAAGGTGTTCATCCCGCTCATGTTCCGGCCTGGCGAGGCCTACCAGTTTGATTGGAGCCACGAGGACGTGGAGATCGCCGGCAAGCCGATGCGGGTTAAGGTGGCGCATATGCGGCTATGCTGGTCGCGGGCGCCCTTTGTGCGGGCTTATCCGCGTGAGACCCAGGAGATGGTGTTTGACGCCCATGCCAGGGGCTTTGCTTTTCTCGGCGGGGTGCCGACGCGCGGCATCTACGACAACATGAAGACCGCGGTGACGACGGTGTTCACTGGCAAGGAGCGGGTGTTCAACCGGCGCTTCCTGATCATGACGGATCATTATGGCGTTGAGCCGGTGGCCTGTAGCCCGGCGGCAGGCTGGGAGAAGGGACAGGTCGAGAACCAGGTCCAGACCGGCAGGGAACGGCTGTTCAAGCCACGTCTGCGGTTTGCCAGCATGGAAGAGTTGAACGCATGGCTGGAGGCCGAGTGTCGCCGATGGGCCGAGCGCTATGCCCATCCGGATATGGAAGATATGACCATCGCCCAGGCACTGGAGATGGAACGACCCTCCCTACAGCCGCTCACCACGCCTTTTGACGGCTTCTTCGAGAGCGAACATGTGGCGAGCTCGACCTGCCTCGTCAGCTTCGATCGCAACCGTTACTCGGTCATGGCCGTTGCTGCCCGGCATGCGGTGCAACTGCGCGCCTATGCCGACCGGGTCGTCATCCGTTGTGCCGGCAAGGTGGTCGCCGAGCATGCCCGCCTGTTCGGCCGCAATCAGACGAAGTTCGATCCCTGGCACTATCTGCCGGTCCTGATCCGCAAGCCAGGCGCATTGCGCAACGGCGCTCCCTTCCAGGACTGGGATCTTCCGCCGGCCCTGGCCCAGCTGCGCCGCAAGCTGGGCAAAAGCGATGACGCAGACCGACGCTTTGTACGGGTACTGGCAGCGGTGCCCGAGGATGGCCTGGAGGCAGTCGAAGCTGCCGTGCGCGAAGCCATGGCGGCGGGCACGGCCAATGACGAGGTCATTCTCAACATCCTGTCGCGCCGACGCGAACCACAGCCTGTGCAGGCGATCAATGTTGTCGTCGATCTCAGGCTCAAGCATCCGCCCATTGCCGATTGCGCGCGCTACGATACGGTGCGAGGCCTCAATGCAGCGGCATGAGATGTTGGCAGCCCTCAAGGGGCTGGGCCTGAAGGGCATGATCGCCGCGTTCGACGATGCCGTCACCAATGGCATCCGCCGTGACCGGACCGCCATGGAGATGCTTGGCGATCTGCTACGCGCCGAAACGGCCCACCGTGAAGCCGCCTCGATCCGGTATCGCATGACTGCGGCCAGGCTGCCGGCCATCAAGGATCTCGACGGCTTTGTCTTCGCCGACACACCGATCAACGAGAGCCTGGTGCGTTCGCTCCATGCCGGCTCGTTCCTGCCGGAACGGCGCAATATCGTGCTGGTTGGTGGCACCGGCACCGGCAAGACGCATCTCGCGCTCGCCATCACCGCTGCGGTGGTCCGCGCCGGGGCCAGGGGCCGGTTCTTCAATACCGTCGATCTGGTCAATCGTCTGGAGGAAGAAACCCGGCAGGCCAAGGCCGGCAGCCTGGCCGCCCAGATGGCCCGCCTGGACGTCGTGGTTCTGGACGAGCTCGGGTATCTGCCGTTCGCCCGGTCAGGAGGCCAGATGCTGTTCCATCTGATCAGCAAACTCTACGAAAAGACCTCGGTGATCATCACCACCAATCTCGCCTTCGGCGAATGGCCTAGCGTCTTCCAGGATGCCAAAATGACGACGGCGCTGCTGGACCGTGTCACGCATCATTGCGACATCATCGAAACCGGCAACGACAGCTGGCGGTTCAAAAACCGAAGCTAA
Protein sequences of DBSCAN-SWA_17 >NC_020542|187487:189720|188991_189720_+|WP_015449229.1|DBSCAN-SWA MQRHEMLAALKGLGLKGMIAAFDDAVTNGIRRDRTAMEMLGDLLRAETAHREAASIRYRMTAARLPAIKDLDGFVFADTPINESLVRSLHAGSFLPERRNIVLVGGTGTGKTHLALAITAAVVRAGARGRFFNTVDLVNRLEEETRQAKAGSLAAQMARLDVVVLDELGYLPFARSGGQMLFHLISKLYEKTSVIITTNLAFGEWPSVFQDAKMTTALLDRVTHHCDIIETGNDSWRFKNRS >NC_020542|187487:189720|187487_189005_+|WP_015449228.1|transposase|DBSCAN-SWA MLIVETIAKIRREHRDGKPIKEIARDLRLSRNTVRKAIRAPEADFSYERKEQHRPQTGPFRERLDELLAENEERPRRERLRLTRIHDLLEREGFTGSYDAVRRYAARWKQERHAGGSGDMSKVFIPLMFRPGEAYQFDWSHEDVEIAGKPMRVKVAHMRLCWSRAPFVRAYPRETQEMVFDAHARGFAFLGGVPTRGIYDNMKTAVTTVFTGKERVFNRRFLIMTDHYGVEPVACSPAAGWEKGQVENQVQTGRERLFKPRLRFASMEELNAWLEAECRRWAERYAHPDMEDMTIAQALEMERPSLQPLTTPFDGFFESEHVASSTCLVSFDRNRYSVMAVAARHAVQLRAYADRVVIRCAGKVVAEHARLFGRNQTKFDPWHYLPVLIRKPGALRNGAPFQDWDLPPALAQLRRKLGKSDDADRRFVRVLAAVPEDGLEAVEAAVREAMAAGTANDEVILNILSRRREPQPVQAINVVVDLRLKHPPIADCARYDTVRGLNAAA |
2 | Escherichia_phage(50.0%) | transposase | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_18 |
198629 : 202626
Sequences of DBSCAN-SWA_18
Nucleotide sequences of DBSCAN-SWA_18 >NC_020542|198629:202626|DBSCAN-SWA CATGGCCAGCCATGCCATGCCTGCCGTCCAGGCGCCGACATCATTGACCGGGCAATCGGTCGATGAACTCATCTTGGCCAGCCAATCGCTGGATGACGGAACGATGAGGACGCAACTGTCGGTCCCGACCGCGCATTGCGGCGGATGCATGGCGAAGATCGAGCGCATCCTTGGTAACCTCGAGGGCGTCGTAGCGGCTCGGGTCAATCTATCGACGCGGCGTGTGACCGTCACATGGCGGCAAGCGCAAACCGCGAGTGCACCGCCTTTGCTGGCAACGCTGAACGAGGCGGGCTTCGAGGCAAATCTTCTCAGCCAACCGGACGAACGCGCCGATCCCGAAAAGCGCCGGCTGATCGTCGCGACAGCGGTCGCCGGCTTCGCGGCGATGAACATCATGCTGCTTTCGGTGTCGGTCTGGTCAGGCGCCGACCCCGCTACGCGGCAGCTGTTCCATCTCATCTCGGCGCTGCTCGCGCTTCCCGCGGTATTCTTTTCGGGGCGCATCTTCTTCTTGTCGGCATGGTCGGCATTGCGCGCGGGACGCACCAATATGGATGTGCCCATCTCGATCGGCATCGTCCTGACACTGGGTCTCAGCATCTACGACACGCTGCATTTCGGACGCCATGCCTATTTCGACGCGGTCGTGACGCTGATCTTTTTCCTGCTTGTCGGACGAACGCTCGACCATGCCATGCGCGACAAGGCCCGTTCGGCCGTGCTCGGGCTGACGCGAATGACGCCTGCGGGCGCGAATGTAATCGGCACGGATGGATCGCGCGCTTTTAGACCGCTCGAGGATGTCTCGGTGGGTGATGTCATCCTCGTCGCGCCAGGCGAGCGTGTTCCGCTGGATGGTCTTGTCCTTGCGGGCGAGGGCGATCTCGACACTGCCGCGGTCACGGGCGAGGCGATGCCGGTGCCGGTACGACCGGGAAGCACGCTCGTTTCGGGCATGCTGAATCTCAATGGTTCGCTCAAGGTGCGTGTGACCAACAGTCACGCGCAGTCCTTTCTATCCGAAATGGTGCGAATGATGGAAGCGGCCGAGCAGGGCAGAGCACGCTATCGCCGTCTCGCCGACCGTGTTGCGGCCTATTACTCGCCGATCATCCATTCGCTGGCCTTGGCCGTGTTTGCCGGGTGGTTCCTTGACACAGGCGATTGGCATCGGGCGCTGACCATTGCCATTTCGGTGCTCATTGTGACCTGTCCCTGCGCGCTTGGTCTCGCCATACCCATGGTTCAGGTCGCCGCGGCGAAGCGCTTGTTCGAGCACGGCATCGCGCTCAAGGATGGAAGCGCGCTCGAGCGGCTTGCACAGGTCGATACCGTGATTTTTGACAAGACCGGCACGCTCACTCATGGCGATTTGCGAGTGAGCAAGATCGCCATCGATGAACCCTATCGCGCAATCGTGATGGCGCTGGCGTCGCGTTCCAACCATCCCGTCGCACGCGCGATTGCCGCCAACGGCGGCGCCTTGTCCGGCCTCGATCTGGACAGTTTTTCCGAGCTGCCGGGCCGGGGTTTGGAGGGCCAGCGGAACGGCCATCTGTTTCGCCTGGGGCGTGCCGATTGGGCTTTGAATCCAGGCACGGGCGAGAGTGGCGGTTTCCAGACCGTGTTCTCGGTCGATGGTGAGCTTGCCGGCCATTTTGCCTTCTTGGACGTTGAAAAGACGAGTGCGCGGGAAGCGGTGGCCAAGCTTGACGCGCTTGGGCTGCCTATCGAACTGCTTTCGGGCGATCACATTGCTTCTGTGTCGGGGTTCGCGAACGCGATCGGTATCGCCCACTGGCGTGCCGGATTGCTGCCGCAGCATAAGGTGGAGCGGCTTCAGGCGCTGGCGGACAGTCAGCGCCTGACCTTGATGGTGGGTGATGGGCTGAACGATGGACCGGCACTGGCGGCGGCGCACGTTTCGATGGCTCCTTCCAATGCCGCCGACATCGGTCGGGCGGCCGCGGATATTGTTTATCTCGGCCAGGATCTCGATGCGGTTCCGCGCGCTGTGCAGATCGCGCGCGCGGCCCGGCAACGTGTCCGCCAGAACCTGGCGCTTTCCGTCGGTTATAACCTTTTGGTGATTCCTGTGGCCATGGCGGGCTATGTCACCCCACTGCTTGCTGCCGTCGCCATGTCGCTTTCGTCGATTTCGGTCGTTGCGAACTCTCTGCGAATTCCCGCAGCACGCGGATCACGGCTTAGCAGGCGGGCGCCGGCACCGACCCTCGTTCCCCTGGCCGCTACCCGATGAGCGGGATCCTGTGGCTGGTGCCAGTCGCGCTGCTGATGGGCCTTGCTGGGCTGGGCGCCTTTCTTTGGTCGATGCGCACCGGGCAGTATGATGATCTGGATGGAGCGGCCGAGCGAGTCGTGACCGACGCCAAGTCAGACCAGCCTCTTGTCGAACCTGATGATTGGCGACCCGAAACCGAGGGCGAGGCAAATCGTTTGCGATAAGGATGGTGGTGTGGACACCATTGGGCAAAGGACGAAGTGATGATGCAGGCACCGAGTTGGAATTGGGTGGGCGTCCTTGCCTTCATCTGCGGGGCGGCGGTCATCATTCCCAGCCTTGTCGCGCTTGCACTCAGTCTTGCCGCTGTTGCCCGAAACCTCCTGTGAGGCTGGCATCGGTCTGACGGAGGGATAGCACGGCAAGCGGCTCAACCGCAGGTGATCGAGAATGTCGGCACGGGGATAATTCGGGGAAATGCCGGTGGAGAAGCTCGGGCGTCTTGCCCGGGAGCGGCGGCCCGGCGACGTAACCATCCGTTGGGTGCCCGTGAGCGAAGTTACCAAGGGATGTTTCGTCAATGACTGAACCCAGGAATCCTCCGCCCCGGCCACAAACGATAGCCGCAGCGCACGGCGTTGCCAGCGACCCTGCCTATGGGGCGGTTGCCCCGCCGCTTTATCTTTCGAGCACTTATGAATTTGCCGGCTATGCTCAACCGCGTCCCTATGATTATGGCCGTTCCGGAAATCCGACCCGCGATCTCCTTGCGGAGGCGCTCGCGAAGCTGGAGGGCGGTGCCGGGGCGATCATGACGTCGAGTGGTATGGCGGCTCTCGACCTTCTGGTCGGACGGATGGGCCCCGGCGATCTCATACTCGCGCCCCATGATTGCTATGGCGGCACGATGCGCTTGCTGAAGGCGCGCTCGAACCGCGGCCATTGTGTTGTCCGGTTCGTCGACCAGGGCGACGAGAAAGCTTATGCGGCCGCGCTGGAGGATGTGCCGACGCTGGTCCTGGTCGAGACGCCGAGCAATCCCCTGATGCGGGTTGTCGACATCGAGGCGCTGGCGACCATGGCGCGCGCGGCGGGTGCCGCGGTGGCCGTCGACAACACATTTCTCTCGCCGGCCATCCAGCAGCCGATCGCCTTGGGCGCCGACTATGTCGTTCACTCCACGACAAAGTTTCTCAATGGCCATTCCGATGTGATCGGGGGAGCGGTCGCTGCGGCCTATCCGGCGCAAGTCGAGGACCTCCGCCATTGGGCGAACATCGTCGGGCTGACCGGGGCGCCGTTCGATGCTTGGCTGACACTGCGCGGCCTTCGCACACTGTTTCCACGGATGGACCAGCAACAGCGTAGCGCCATGATCGTTGCCCAATATCTGGAGCGGCATTCGGCAGTGACGAAGGTGCATTACCCTGGACTTCCTGGGCATAGCGGACATCGGATCGCTTCGCGCCAACAGCGCGGCTTCGGCGCAATGCTGAGCTTCGAGCTCGCCGGAGGGGTGGACGCGGTGCGCCGGTTCGTGGCGACCGTCGGCTTTTTCACCGTCGCCGAATCGTTCGGCGGGATCGAGAGCCTCGTTGCACATCCGGCGACCATGACCCCTGCCGATATGGGCGAAGAGGCGCGTGCAAGGTCGGGTATAGGCGACGGCCTGCTGCGGCTTTCGGTCGGTCTTGAGGCTGAAGAGGATCTGATCGCCGGCCTCGAACGAGGACTGGCGGCTTGCGCAGCATGA
Protein sequences of DBSCAN-SWA_18 >NC_020542|198629:202626|200887_201097_+|WP_004212718.1|DBSCAN-SWA MSGILWLVPVALLMGLAGLGAFLWSMRTGQYDDLDGAAERVVTDAKSDQPLVEPDDWRPETEGEANRLR >NC_020542|198629:202626|198629_200891_+|WP_015449327.1|DBSCAN-SWA MASHAMPAVQAPTSLTGQSVDELILASQSLDDGTMRTQLSVPTAHCGGCMAKIERILGNLEGVVAARVNLSTRRVTVTWRQAQTASAPPLLATLNEAGFEANLLSQPDERADPEKRRLIVATAVAGFAAMNIMLLSVSVWSGADPATRQLFHLISALLALPAVFFSGRIFFLSAWSALRAGRTNMDVPISIGIVLTLGLSIYDTLHFGRHAYFDAVVTLIFFLLVGRTLDHAMRDKARSAVLGLTRMTPAGANVIGTDGSRAFRPLEDVSVGDVILVAPGERVPLDGLVLAGEGDLDTAAVTGEAMPVPVRPGSTLVSGMLNLNGSLKVRVTNSHAQSFLSEMVRMMEAAEQGRARYRRLADRVAAYYSPIIHSLALAVFAGWFLDTGDWHRALTIAISVLIVTCPCALGLAIPMVQVAAAKRLFEHGIALKDGSALERLAQVDTVIFDKTGTLTHGDLRVSKIAIDEPYRAIVMALASRSNHPVARAIAANGGALSGLDLDSFSELPGRGLEGQRNGHLFRLGRADWALNPGTGESGGFQTVFSVDGELAGHFAFLDVEKTSAREAVAKLDALGLPIELLSGDHIASVSGFANAIGIAHWRAGLLPQHKVERLQALADSQRLTLMVGDGLNDGPALAAAHVSMAPSNAADIGRAAADIVYLGQDLDAVPRAVQIARAARQRVRQNLALSVGYNLLVIPVAMAGYVTPLLAAVAMSLSSISVVANSLRIPAARGSRLSRRAPAPTLVPLAATR >NC_020542|198629:202626|201453_202626_+|WP_015449328.1|DBSCAN-SWA MTEPRNPPPRPQTIAAAHGVASDPAYGAVAPPLYLSSTYEFAGYAQPRPYDYGRSGNPTRDLLAEALAKLEGGAGAIMTSSGMAALDLLVGRMGPGDLILAPHDCYGGTMRLLKARSNRGHCVVRFVDQGDEKAYAAALEDVPTLVLVETPSNPLMRVVDIEALATMARAAGAAVAVDNTFLSPAIQQPIALGADYVVHSTTKFLNGHSDVIGGAVAAAYPAQVEDLRHWANIVGLTGAPFDAWLTLRGLRTLFPRMDQQQRSAMIVAQYLERHSAVTKVHYPGLPGHSGHRIASRQQRGFGAMLSFELAGGVDAVRRFVATVGFFTVAESFGGIESLVAHPATMTPADMGEEARARSGIGDGLLRLSVGLEAEEDLIAGLERGLAACAA |
3 | uncultured_virus(50.0%) | NA | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_19 |
223805 : 227105
Sequences of DBSCAN-SWA_19
Nucleotide sequences of DBSCAN-SWA_19 >NC_020542|223805:227105|DBSCAN-SWA AGTGAGGACGATCGGGAACACCATCCTGTTTTCAGCGACCGACCTGATGCGGTTCGTCGGCTGCGCGCATGCGACAGCGCTCGACCTTGCCTATATGCGCGGCGAGCCGCTCACTCCCCGGGAAGATACCGAGGATGCAGCCCTGCTTCAGAAACAAGGCGATGCCCATGAGGCCGCCCATTTGGAGAAGCTGAAAGACGCGGGCAACGGGGTAGTCGAGATTGCGCGTGGCGATCTTGCACAGAATGCAGACGAGACGCGAGCGGCCTTGGCGCAAGGCTCGCAGATCATCTTTCAAGGGGCTTTTCTCGCCGAGCGTTGGGGTGGGTGGTCCGATTTTCTCGAGCGAGTCGAGCGACCATCATTGCTGGGGCCGTTCAGCTACGAAGTGACCGACACCAAGCTGAAGCGCAAAGCCCATCCCAAGCATGTGCTGCAGCTCGTGCTCTATTCGGACCTTTTGGCGGAGATCCAGGGCGTGATGCCTGAGCGCGCGCATGTGCAGCTAGGAGATGGCACGCGCGCGACGCTGAGGTTGGCCGACTATGCCCATTATGCGCGCGGTGCGCGGGCTAAGCTCGAAGTCTTTGTTGCTTCCCCTGAGCCAACGCGGCCGATCCCCTGCGCGGATTGTTCGCTGTGCCGATGGGCCGACCATTGCGACGCCGTCTTCACCAGCCAGGACAGCCTGTTCCAAGTCGCCAATATTACCCGCGGCCAAGTGAAGAAGCTCGAGGCGTCCGGCATCGAAACCATGGCCGCGCTGGCTCGGCATGACGGCCCGGTGCGCGGCGTTGCGAGTGCGACGGCGGAGAAGCTTGTCGGCCAGGCCAGACTGCAGCACGCGCGCAAAACCGGCGAACCCGCCTTCGAGTTGCGTCCGGCACAGCCAGGCAAAGGCTTTGACCTGCTCCCTCGGCCGCAGGCGGGCGATCTCTTCTACGACATCGAGGGCGACCCCCATTACGAGGGAGGCCTTGAATATCTGCACGGCGTGTGGGCCGATGATAGTTTCCATGCCTTCTGGGCGCATGACCATGCCGCCGAAGCGCAAGCGCTCGAACGGTTGCTTGCGTTTTTTCGTAATCGCCTCACGGCTTATCCGCAGGCCCGCATCTATCATTATGCACCCTATGAGATCACCGCGCTGCGGCGTCTCACCACGCGCTATGGTATCGGCGAGGCCTTTCTCGACCGCCTCATGCGCGAACGCCGCTTTGTCGACCTCTATGCGGTCGTGCGCGGCGCGCTCATCGCTTCCGAACCGAGCTACTCCATAAAGGCGCTGGAGGCCTTTTATGGCTTGAAGCGCGAGGGCGAGGTCAAGACAGCGGGCGGGTCGGTCGTTGCTTATGAAAATTGGCGCGAGACGGGCGATCAGCAAATCCTCGACGAGATCGAGGACTATAATCGGATCGATTGCCAGTCGACCCAGCTACTGCGCGATTGGCTAGCGGGCATCCGGCCCGATGGTCCCTGGCCCGTGCTTGCGCAGGACGCGGCCGAGCAGGAGGTGGTCGAGGACGAGGAAACGACGGCGCTGCGCGATCGCCTGGCCGCATCCTCTCTGACTCCAGAGCGTCAGGAATTGCTGTTCAATCTGGGGCTGTTCCACAAGCGTGAAGCCAAGCCTGCTCAGTGGACGGTTTTCGACAGCGCGGCGCGCGATGAGGAAGAGCTTGTCGACGATCTCGACGCCTTGGCGGGACTCGAGGCGATATCCGGGATCGAACCCATCAAGCGCTCGGTGATGCGCACCTATCGCTTTCCGTCTCAGGAAACCAAATTGCGCGAAGGCGGCAAGGCCACTGTGCCCGGGATTGACGGGCCGCCGTCAACCGTTGCGATTGAGGCGCTGGATCGCGATGCGTGCACGATCACCTTGAAGGTCGGCGTGGCAAGGGCCGAACTCCTCACCGATCGGCTGACCCTGCATCCGGATTGGCCGCTCGATACCAAGGTGTTGGCCGCAGCGGTGCGCGACGTCATCGAGGATCAATGCGGGCCGCGGCGCTATCGTGCCGTCGATGATCTGCTGTCGGGCGCCGCCCCGCGCCTGAACGGCATCGACGGCGATATCCTCTGCGGCGGGGAACCGGTGGCGGGCGCCATCGCCGCGGCGCAAGCCATGGATCAGACATTGCTCCCGATCCAGGGTCCGCCGGGAACGGGAAAGACGCACGTCACGGCCCGGGTGATCCTGGCGCTGGTCAAAGCAGGAAATCGGGTCGCCGTCGCCTCGAACAGCCATGAAGCGATCCGCAATGTCCTGCTCGGCTGCCTGCGCGCGCGCGAGGAAGAAGGCGGCACCTTTCCCGTGTCGTTCGCCCACAAAGTTTCGAGCGGTGATGATGGCTATGCTAGCGATTGTCCCGTTCACCGCGCCACGGCCAATGATGACCTGATCCTCGCCCGCGCCAATGTGGTCGGCGGCACGGCCTTCTTCTTCGCGCGCGATGAGAATGTGCAGGGCTTCGATTGGTTGTTTGTCGATGAGGCGGGGCAAGTGGGCCTCGCCAACTTGGTCGCCATGGGTCGTGCCGCGCGCAATATCGTGCTGGTCGGCGATCCGCGTCAGCTGCCTCAGGTCATCCAGGGCGCGCATCCTGCGCCCGCCAACCTGTCATGCCTGGAATGGATGCTGGGCGAGCATGCCACCGTCCCTCCCGATCGGGGCATATTCCTTGCCGAGACCCGGCGGATGCATCTCGCGGTTTGTGATTTCATTTCAGACCAGGTCTACGAGGGACGCCTTGCCAGCCATGGGGATACCAAGCGCCAGAGCGTTACTGGAACGGCCTGGCCCACGGCCGGCGCCTTCTGGGTGCCGGTCGTCCATGACGGTAATGCCCAGTTTGCCGCTGAGGAAGTTGCGGCGATCGGCGCGGCAATCGAGAATCTCCTGCAAGGGAGCTGGACCGACAAGAACGGCGCCACGCGCCCTATCGGCCCCGGCGACATCATCGTCGTCGCCCCCTATAATGCACAGGTCAACGCGCTGCGGGCAGGGCTGCCGAGCAGCATCCGCGTCGGCACGGTCGACAAGTTCCAGGGCCAGGAAGCCCCTATATGCCTGGTCTCCATGACGGCTTCCTCGGCCGACGAGACGGCCCGCGGCATGGAGTTTCTCTTCTCGCTCAACCGCATCAATGTCGCGGTTTCACGCGCCAAAGCGCTTGCGCTCGTTTTTGGCAGCGACCGCTTGCGCGAGGCCAACTGCAGCAGCATCGAGCAGATGCGGCTCGTCAACACGCTCTGTGCCCTTCCGCCGTTTTCTGCGCCTCTCAGTACGGGATCTTGA
Protein sequences of DBSCAN-SWA_19 >NC_020542|223805:227105|223805_227105_+|WP_015449351.1|DBSCAN-SWA MRTIGNTILFSATDLMRFVGCAHATALDLAYMRGEPLTPREDTEDAALLQKQGDAHEAAHLEKLKDAGNGVVEIARGDLAQNADETRAALAQGSQIIFQGAFLAERWGGWSDFLERVERPSLLGPFSYEVTDTKLKRKAHPKHVLQLVLYSDLLAEIQGVMPERAHVQLGDGTRATLRLADYAHYARGARAKLEVFVASPEPTRPIPCADCSLCRWADHCDAVFTSQDSLFQVANITRGQVKKLEASGIETMAALARHDGPVRGVASATAEKLVGQARLQHARKTGEPAFELRPAQPGKGFDLLPRPQAGDLFYDIEGDPHYEGGLEYLHGVWADDSFHAFWAHDHAAEAQALERLLAFFRNRLTAYPQARIYHYAPYEITALRRLTTRYGIGEAFLDRLMRERRFVDLYAVVRGALIASEPSYSIKALEAFYGLKREGEVKTAGGSVVAYENWRETGDQQILDEIEDYNRIDCQSTQLLRDWLAGIRPDGPWPVLAQDAAEQEVVEDEETTALRDRLAASSLTPERQELLFNLGLFHKREAKPAQWTVFDSAARDEEELVDDLDALAGLEAISGIEPIKRSVMRTYRFPSQETKLREGGKATVPGIDGPPSTVAIEALDRDACTITLKVGVARAELLTDRLTLHPDWPLDTKVLAAAVRDVIEDQCGPRRYRAVDDLLSGAAPRLNGIDGDILCGGEPVAGAIAAAQAMDQTLLPIQGPPGTGKTHVTARVILALVKAGNRVAVASNSHEAIRNVLLGCLRAREEEGGTFPVSFAHKVSSGDDGYASDCPVHRATANDDLILARANVVGGTAFFFARDENVQGFDWLFVDEAGQVGLANLVAMGRAARNIVLVGDPRQLPQVIQGAHPAPANLSCLEWMLGEHATVPPDRGIFLAETRRMHLAVCDFISDQVYEGRLASHGDTKRQSVTGTAWPTAGAFWVPVVHDGNAQFAAEEVAAIGAAIENLLQGSWTDKNGATRPIGPGDIIVVAPYNAQVNALRAGLPSSIRVGTVDKFQGQEAPICLVSMTASSADETARGMEFLFSLNRINVAVSRAKALALVFGSDRLREANCSSIEQMRLVNTLCALPPFSAPLSTGS |
1 | Beihai_Nido-like_virus(100.0%) | NA | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_20 |
231013 : 238926
Sequences of DBSCAN-SWA_20
Nucleotide sequences of DBSCAN-SWA_20 >NC_020542|231013:238926|DBSCAN-SWA CGTGAACGTTTTTGATCTCGATACTGACCTAATAGAGCGTTATGAAAATTTCGCGCGTTCGTTCACTTCGATCCGCGCAGACGACATTAGCGAACAGGTCAACGCGATTTACGCCGGGCAGAAATTCTGGCCCGAACCCCTGATCGGTCTGAACCCCCATTTCCTCGAAGGCCGCGGCCTCGTCGATCTTGCCAAGGACGGCGTGGTCGATCCGGATTTACCCAAGGTGTTCGCCGTAGGAGCCGGACGCAGCCCGATCAGCCTGCATCGCCATCAGGACGAAGCGCTCATGAAAGCGCTTCAGGCCAAAAATTATATCGTCACCACCGGCACCGGCTCGGGCAAGTCGCTATGCTTTTTTGTGCCGATCATTGACCGCATCCTGAAGGCGCGTCGCGCTGGCGAGGCCCAGCGCACGCGGGCGATCGTCATCTACCCGATGAACGCGCTCGCCAATTCGCAGCAGGAGGAGCTTGAGAAGTTCATCGGGGAGTGCGGCCTGTCCGATGGGATGAAGCCGACATTCGCCCGCTACACCGGCCAGGAGAAAGAAGCCGAGCGCAAGCGCGTGGCTGACGAAAAGCCGGACATCATTCTCACCAACTTCATGATGCTCGAACTGCTGATGACTCGGCAGGACGAACTCGACCAGCAGGTCATCGCCAATATGGCGGGCCTGGAGTTTCTGGTTCTCGACGAACTCCATACCTATCGCGGCCGACAGGGGGCGGATGTTGCGCTGCTGGTGCGACGGGTGCGCGAACGCATGGGAGCGCAGAAAATGCTCTGCGTTGGCACCTCGGCGACCATGGCGTCCGGGGATGAGGATGCGGGCCGCAAGGCGGTGGCGTCTGTCGGCACAACCCTTTTCGGCAGCCCCGTGCATCCGGATGATGTCATCACCGAAAGCCTGAGACGCTGCACCCAAGGCATCGCCAACGAGGCCACCCTTAAATCTGCGATCGTCAACTCCGCTGGCCTTCCATCCACCGCAGAGGCCTTCAAAGTCGATCCCCTCGCGATCTGGATAGAAACCAATATTGGGCTTGAGGTCGGCGAAACACTGAAGCGGGCGAAGCCCTCAACCCTCACTGATGCATCACACCGCCTGTCCGAGGAGACAGGGATCGAGGCGGAGGCTTGCCGCAAGGCCCTCCAGGATCGCCTGATCGCCATGAGCGATTTCAAGGATGACCGCGACCAGGCTTTCATGGCGTTCAAGCTGCATCGCTTCCTATCCGGCGCTGGCGACGCCCATGCGACGCTCGAAGCTGCTGGGAAGCGCCGGGTGGCACTGGCGGCGGAAAAGTTCGACCGCGACAATAGCGAGGCCCGCCTCTATCCGGTCTTCTTCTGCCGCGAATGTGGACAGGAGGTGCATAGCGTCACCATTGACGATGATGGCACTGTCATCGCGCGCCCGATCGATCAGACCCCGAGAGACGGGGTTGATGAAAGCGGGGCGGAGCACGGCTTTCTCGTGCCCGACGCCAAGGGTGATCTGAATTTCGCCGGTTCTATCGATGATTATCCTGATGACTGGATTGAGCTGACGGCATCGGGCGAACCGCGGCTCAAATCCTCCCATCGCGGCAAGCATGACGGCCGCCTGATGATGCTGGGCGCTAACGGGCGGCGCGATGAGCATGGCGTTCGCTCGTGGTTCTTCCCCGGTAAATTCCGCTTCTGTCCGCATTGCCGCAATCAACCGCCGCCAGGCGCTCGCGACATCAACAAATTGGCTGGCCTCTCGGCCGAAGGTCGTAGTTCGGCCACGACGCTCATCACGTCGGTGGCCCTCGACTGGATGGAACGGGGCCACATCCCGTCTGAAAAGCATCGCCGCAAACTGCTCGGCTTCACCGATAACCGGCAGGATGCCGCACTCCAGGCGGGACACTTCAACGACTTTATCTTCGTCACACTCCTGCGCGGCGCCATGCTGCGTGCCGTCTTCAAAGCCGGTGAGGATGGTTTGAGCCATGAGCAATTTGGCGATGTCCTGCGCCGTGCGCTCGGGTTCGATCCCGAAAGCGTCGATCGCCGAGCGGAATGGATGCTCGATCCGAACCCGGCCAGCTTCGCGGCGCTGGACGAGGCCAAGAAGGCGGTAAACCGCGTGCTGGCCTATCGCCTCTGGAACGATCTGCGGCGGGGCTGGCGCTATACTAATCCCAACCTCAATCAGTTGGACCTGCTGCGCGTCGAATATCCCGGCATCGCCATGCTCGCCGGCGACGATCGCGCCTGCGCCGGCATGGACCAGGAACAGCTCAATGACGGCCAGCGCCGTGGCTTTGAGCTGCTTTCCCAAATCTCCAGTGACGTTCGGCGCGAAATGTTCCGTCTGATGTTCGACGCCATGCGGCAAGGCCTCGCCGTTGCCGTCGATGCCCTGGACCTCAATGAGATTGAGCAGGTCGCGCAGAAATCCCGCCAGCTTCTCAAAGACCCCTGGGCAATCAGCCGCGAAGAAGAGAACAAGGATCTTGTGTGCCAAACAACGCTTGTGCTCGGCACACAAGGCGGCAAGGCGGACAGGATTATCCGGGTTTCCGCGCGCAGCGGTCTTGCCAAAGAATTGGCGAAGTTGGCGGGGGCGCTGCCCCTTGCCGATCGCGAACCGTTGATCGAAGCCATGTTGATGGCGGCCGCGCGCCACCAGATCGTGCGCCAGTTTTCCGCCGGTCCGCTCACAGGTTGGCGGCTCGCGCCGAGCGCCTTGCAGCTCCGTGCAGGGAAGGGGACACCAAGCCCGAGCCAAGACAATGCGTTCTTCCGCCAGCTCTATAGCGATATAGCCGCGCGCCTCGCCACACCCGGCGGGCTGCCCCTGGCCTTTGAAGCGCGCGAGCATACAGCGCAGGTCGATTCCCTCCTGCGCGAAATGCGTGAGTGTCGGTTCCGGTACGGCAAGAGCGACCATGAACGCATGAAGGAAATTGCGGGCGATGCGCGCGTTCTCAATGAGAAGCGTGACTTTCTGCCCTTGCTCTATTGCTCACCCACCATGGAACTGGGCGTAGATATATCGCAGCTCAATGTCGTGTACCTGCGCAACGCGCCGCCAACCCCGGCCAATTATGCGCAACGCGCAGGCCGTGCTGGCCGCAGCGGACAGGCCGCGCTGGTCATCACCTACTGCGCCGCGCAAAGCCCACACGACCAATATTATTTCGAGCGCCGGACCGATCTCGTCGCCGGCATCGTTCGCGCACCCGCGATCGACCTCATCAATCGCGACCTGCTTGCCGCGCATGTCCATGCCGAATGGCTCGCTGCCGCTCAGGAGGGGCTGGGCAAGTCGATCCCGGAAAACCTCGATATGACCGAGACCGATCTGCCGATCTGCGAACGGCTGCAAAAAGCTTTTGCCGTGGCCAGCGCGGACGTCGATGCCCGTGATCGTGCCGAGCGGGTCGTCGTCTCGGTCCTGCCACAGGAAGGCGCAGCCGAAATCGGCGAAGCCTCGGACTTTGTCGGCCATCTCTGGAGCAACGCGGCAACGGCGTTCAACACGGCGTTCAAGCGCTGGCGAACACTCTACAATTCCGCCCATGAAGAACGAAAGGCCGCCAGCGAACTCGGCAATCAAACCGGGCTGTCGGCGCAGGAACGTCAGGAGGCTCGGACCCGCTATCTGGCGGCCGATCGTCAGGTGCAACTGCTGGAAACAGGCGCATCGTCCACCAGCTCGGACTTCTATGCCTATCGCTATCTGGCGACCGAAGGCTTCCTACCTGGCTATAATTTCCCGCGCTTGCCGCTCTACGCCTTCATCGACGGGGAAAAGTCATCGACGGTGCTCCAGCGTCCGCGCTTCCTGGCGATCGCGGAGTTTGGTCCGAACAGCCTCGTCTATCATGAGGGCAAGGCCTATCGCTGCAATCGCGCCAAGCTGCCTGCCGGAACGCGGACCACAGACAACAAGCTGACGACGATGACTGTGCGATGCTGTCATTCGTGCGGCGCGGCCCATAGCGTGGAAACGCAGGAGCGCTGCATTGCTTGCGGCGATGTGCTGACCGAGGAAGGGCGGCTTTCCAAGCTCTATCGGATCGAGAATGTCGATGCCGTGCCCGGTGCGCGCATTACCGCCAATGACGAGGATCGTCAGCGGCGCGGCTTTGAAATCCGCACGATTTTCGAGTGGGAACCGATGCGCCAAGACCAGCTTCTGCTGAAGTCTGACGACAGCCCGCTGGCGGTATTGCGCTACGGACCGCAGACTCGCGTATCCCGCGTCAATCTCGGCCTGCGCCGCCGCGCCAAGAAGGAAGATACCGGCTTCGACATCGACACGATTTCGGGCCGCTGGCTCAAGAATGAGACAAAAGGTGAGGATGAGGGCGACGATCCCAAAAGCGCCATTCGTCAAACCATCGTGCCTCTGGTCGAGGACACGAAGAACGCGCTGCTGCTGCAATTCGATCGCCAGCTCGACCTCGATGAAGCGCAGATGGCGACCCTCCAGCACGCCCTGATCCGCGCCATCGAGACAGAGCATGTCCTGGAAACCGGCGAGCTTCTGGGCGAGCCGTTACCGACCCGCGACGATCGCAAGGCGATCCTCTTCTATGAGGCGTCCGAAGGCGGGGCAGGGGTTCTCAAGCGGCTGATGGACGGCGCGGAGCGGTGGCATCGTATCGCGGACGTGGCGCTCGACCTGATGCATTACCGTCGCGACAACGGCGAGCTGGTCGAGGCCAACGAACCTTGTGTTGCTGGTTGCTACCGCTGCCTGCTGTCCTATTACAACCAACCGGACCATGAACTGATCGACCGGCGCGATCCTGCGGTGATTGACGTGCTCGATCGGCTAGCTCGCAGCGAGAATGACTGGCCTGCCGATGCTCCCGCCGGTGCCGGGACGCATGATCCATGGATGGCGGCGCTGGCGCAGTGGGGCGCGCCGACGCCCTCCACGGAAACCATTGGCGGCGAGACCTATCATCTTTGCTGGCCGGGGCTGATGGTCATGGCTGTGCCCGGTCCGCCGCCGCCAGCACTGGTGACGCGCTGCGCCGAAATTGGCCGCGACTTGATCGCGCTGCCGGCAACCCCTGACGAAACCATGCCGCCTGACCTTGCCGCGGCGCTCGGAGTATCCGCATGAACCTGCCCGCCTTCCAGACTGGCCAGCTCCAGACGGGCGATTTGATCCAGGCCCGTGGGCGGGAATGGATCGTGCTTGGTAAACCGGATGAGGGGCTTGTTCGCGTGCGGCCCCTCTCGGGGTCCGAAGAAGACGCCATCATCATCGCCCCCAGTCTTGAACGGATGCCCGTTCGTCCCGCCACCTTCGATCTGCCGGCGGCGGATCATCCCGATACGCAGGACGCCGCCCGTCTGCTGGCCGACGCGCTACGCCTGTCTCTCCGTCGCGGCGCAGGGCCTTTCCGCAGCGCCGCGCATTTGGGCGTCGAGCCGCGCGCCTACCAGCTCGTCCCCTTGCTCATGGCACTGAGACTCGACGTGAAACGGCTGCTCATTGCCGATGACGTTGGTATCGGAAAGACCGTTGAGGCGGGCATGATCCTCCGCGAGATGATCGACCGAGGCGAGGTCGAGCGCTTCTCCGTGTTGTGTCCGCCGCATCTCGTGGATCAGTGGGTCGGCGAGCTGGCGGAAAAGTTCGATATTGATGCCGTGGCCGTCACATCGGCGCGCGCCCGTTCGCTGGAACGCGGACTGGCGCTCGGCGACACCATCTTCGGCGTCTATCCCTACACCGTCGTCAGCCTCGACTACATCAAGGCCGACAGCCGGCGGGAAAGCTTCGCGCAGGCTTGTCCTGGCTTCGTCATAGTTGACGAGGCCCACACCTGCGTCGGCGGTAGCGAGAAGGGCACACAGCAACGCTACGCTCTGCTCGAACGGCTCGCGGAAGACGCCACGCGCCACCTGCTCCTGCTCACAGCGACCCCGCATAGCGGCAATCAGGATGCCTATGCGCGTCTGCTCAGTCTGTTGCACGGCGACCTCGCGGACGGCCCCGAGGCGGGCGATCAGAACGCACGGCGCCGCTATGCAGACCGGCTCGCCAAACACTTTGTCCAGCGTCGCCGCCCCGATATTGCCGACAAGTGGGGGGATGCGGGCGCTTTTGCGAAGTCGATGAAAGCGGATGCGCCCTACACTCTGACCGGCGATTTCCAGAACTTCCAGGAAGACGTGCTGGAATACTGCCTTGGTGTCGCAACCCGTGCCGATGGGGAACGGGCGCGGCGGCTCGCTTTCTGGGGAACACTCGCGCTAATGCGCTGCATCGGCTCGTCGCCGGCGGCAGCCCTCAGCGCCTTGCGCAACCGGCTCGGAGGCATGGCGGAGGAAGCCGCACTCGCGCCAGTCATGTTTGATGACGACGGTGATGAGCTGGCCGACACCGACATTGAACCCGCGACCGCGCTGGACCGCGAGGAGCTGGCCGAGCTGAAAGCGCTCATCACGCGCGCGGAAGGACTGGCGGCCCGCTTTGAGGATGATCCCAAATTTCGCGAACTCGTCCGGCAGGTGAAAGACCTGACGGCCAAGAAGGGTGCCCGGCCCGTCATCTTCTGTCGCTTCATCACAACGGCCGAAGCTGTGGGCGAAGCGCTGCGCGCCCAGTTCAAGAAGCACGAAGTCGAAATCGTCACGGGCCGCCTGACGCCCGAGGAACGCCGTGGGCGCGTCGAGGCGATGGTGGATCACCCCAATCGCATCCTCGTCGCCACCGATTGTCTGTCCGAAGGGATCAACCTCCAATCGCTGTTCAACGCGGTGATCCATTACGACCTCAACTGGAACCCGACACGGCACCAGCAACGCGACGGGCGCGTCGATCGTTTTGGGCAGCCCGAAGATACGGTCTGGTCGGTGATGATGTTCGGCTCAAACTCGATCATCGACGGTGCGGTCATCAAGGTCATCACCGAGAAGATGAAGCGCATCCAACAGGCGACTGGCGTGATCGTGCCGGTGCCGGAGGATTCCGCAAGTGTCTCCAACGCGCTGATGCAGGCGATGCTGCTGCATTCCAGCAAACCGCGCGCGCAGGGACTGTTCGATTTCGGCGACGCCGAGGAACGGCTTGAAATGGAGTGGAAGGACGCGGAGGAAAACGCGCGTAAAAGCCAGACGCGGTATGCCCAGGCCGCGCTCAAGCCCGACGAAGTTCTCCCCGAATGGCACCGACTGCGCGAACTTCTCGGCGGCCCCGATGAAGTAGAGCGGTTCACCCGCCGGGCACTGGCGCGCATCGATACGCCGCTCGGTACGGTCGGGAGCCACTACCGGGTCAGCTATGAGGATTTGCCGAAACAGCTTGGCGAGCGCCTTGCGGCCCGCAATCTGACGGGCACCCGCGCCATCGGATTTGCCGACAAGCTGCAACCCAATGTGGCCCATGTCGGCCGCACGCATCCGCTGGTCGCCATGCTCGCCGAAACCATGGCCGAAGGGGCGCTCGATCCCGGCGGTGTCGAAGGCAAGGCCACGCTCGGCCGCTGCGGTGCCTGGCGCTCTACGGCGGTGAACAAACTCACCACCGTCCTGTTGCTGCGCCTGCGCTTCAAGCTGACGACCAGCGGTCGAAAGACACTGTTGGCCGAAGAAGCCACCGCGCTGGCTTTCGCCATGGGTGACAATACCGCCATCGCACAAGGCCCCGACGCCCTGGCTTTGCTCGAACCCGATGCCAGCGGCGACATCGCGCCGCCCGTCATCGCAAGGCAGGTCGATCAGGCGCTAACCCGCGTGCCCGAATATGATGTCGCGATCAGCGCTTATGCCCAGGATCGTGCTGCCCGGCTTTCAGAGGATCATGACCGCGTGAAGTTCGCAACAAGAGGCGAGGGGGCCACCACCGAAGTGGAAGCTGTTCTGCCTGCTGACGTGATCGGCCTCTATGTCCTCGTGCCGGAGGCGAACTGA
Protein sequences of DBSCAN-SWA_20 >NC_020542|231013:238926|231013_236128_+|WP_015449354.1|DBSCAN-SWA MNVFDLDTDLIERYENFARSFTSIRADDISEQVNAIYAGQKFWPEPLIGLNPHFLEGRGLVDLAKDGVVDPDLPKVFAVGAGRSPISLHRHQDEALMKALQAKNYIVTTGTGSGKSLCFFVPIIDRILKARRAGEAQRTRAIVIYPMNALANSQQEELEKFIGECGLSDGMKPTFARYTGQEKEAERKRVADEKPDIILTNFMMLELLMTRQDELDQQVIANMAGLEFLVLDELHTYRGRQGADVALLVRRVRERMGAQKMLCVGTSATMASGDEDAGRKAVASVGTTLFGSPVHPDDVITESLRRCTQGIANEATLKSAIVNSAGLPSTAEAFKVDPLAIWIETNIGLEVGETLKRAKPSTLTDASHRLSEETGIEAEACRKALQDRLIAMSDFKDDRDQAFMAFKLHRFLSGAGDAHATLEAAGKRRVALAAEKFDRDNSEARLYPVFFCRECGQEVHSVTIDDDGTVIARPIDQTPRDGVDESGAEHGFLVPDAKGDLNFAGSIDDYPDDWIELTASGEPRLKSSHRGKHDGRLMMLGANGRRDEHGVRSWFFPGKFRFCPHCRNQPPPGARDINKLAGLSAEGRSSATTLITSVALDWMERGHIPSEKHRRKLLGFTDNRQDAALQAGHFNDFIFVTLLRGAMLRAVFKAGEDGLSHEQFGDVLRRALGFDPESVDRRAEWMLDPNPASFAALDEAKKAVNRVLAYRLWNDLRRGWRYTNPNLNQLDLLRVEYPGIAMLAGDDRACAGMDQEQLNDGQRRGFELLSQISSDVRREMFRLMFDAMRQGLAVAVDALDLNEIEQVAQKSRQLLKDPWAISREEENKDLVCQTTLVLGTQGGKADRIIRVSARSGLAKELAKLAGALPLADREPLIEAMLMAAARHQIVRQFSAGPLTGWRLAPSALQLRAGKGTPSPSQDNAFFRQLYSDIAARLATPGGLPLAFEAREHTAQVDSLLREMRECRFRYGKSDHERMKEIAGDARVLNEKRDFLPLLYCSPTMELGVDISQLNVVYLRNAPPTPANYAQRAGRAGRSGQAALVITYCAAQSPHDQYYFERRTDLVAGIVRAPAIDLINRDLLAAHVHAEWLAAAQEGLGKSIPENLDMTETDLPICERLQKAFAVASADVDARDRAERVVVSVLPQEGAAEIGEASDFVGHLWSNAATAFNTAFKRWRTLYNSAHEERKAASELGNQTGLSAQERQEARTRYLAADRQVQLLETGASSTSSDFYAYRYLATEGFLPGYNFPRLPLYAFIDGEKSSTVLQRPRFLAIAEFGPNSLVYHEGKAYRCNRAKLPAGTRTTDNKLTTMTVRCCHSCGAAHSVETQERCIACGDVLTEEGRLSKLYRIENVDAVPGARITANDEDRQRRGFEIRTIFEWEPMRQDQLLLKSDDSPLAVLRYGPQTRVSRVNLGLRRRAKKEDTGFDIDTISGRWLKNETKGEDEGDDPKSAIRQTIVPLVEDTKNALLLQFDRQLDLDEAQMATLQHALIRAIETEHVLETGELLGEPLPTRDDRKAILFYEASEGGAGVLKRLMDGAERWHRIADVALDLMHYRRDNGELVEANEPCVAGCYRCLLSYYNQPDHELIDRRDPAVIDVLDRLARSENDWPADAPAGAGTHDPWMAALAQWGAPTPSTETIGGETYHLCWPGLMVMAVPGPPPPALVTRCAEIGRDLIALPATPDETMPPDLAAALGVSA >NC_020542|231013:238926|236124_238926_+|WP_015449355.1|DBSCAN-SWA MNLPAFQTGQLQTGDLIQARGREWIVLGKPDEGLVRVRPLSGSEEDAIIIAPSLERMPVRPATFDLPAADHPDTQDAARLLADALRLSLRRGAGPFRSAAHLGVEPRAYQLVPLLMALRLDVKRLLIADDVGIGKTVEAGMILREMIDRGEVERFSVLCPPHLVDQWVGELAEKFDIDAVAVTSARARSLERGLALGDTIFGVYPYTVVSLDYIKADSRRESFAQACPGFVIVDEAHTCVGGSEKGTQQRYALLERLAEDATRHLLLLTATPHSGNQDAYARLLSLLHGDLADGPEAGDQNARRRYADRLAKHFVQRRRPDIADKWGDAGAFAKSMKADAPYTLTGDFQNFQEDVLEYCLGVATRADGERARRLAFWGTLALMRCIGSSPAAALSALRNRLGGMAEEAALAPVMFDDDGDELADTDIEPATALDREELAELKALITRAEGLAARFEDDPKFRELVRQVKDLTAKKGARPVIFCRFITTAEAVGEALRAQFKKHEVEIVTGRLTPEERRGRVEAMVDHPNRILVATDCLSEGINLQSLFNAVIHYDLNWNPTRHQQRDGRVDRFGQPEDTVWSVMMFGSNSIIDGAVIKVITEKMKRIQQATGVIVPVPEDSASVSNALMQAMLLHSSKPRAQGLFDFGDAEERLEMEWKDAEENARKSQTRYAQAALKPDEVLPEWHRLRELLGGPDEVERFTRRALARIDTPLGTVGSHYRVSYEDLPKQLGERLAARNLTGTRAIGFADKLQPNVAHVGRTHPLVAMLAETMAEGALDPGGVEGKATLGRCGAWRSTAVNKLTTVLLLRLRFKLTTSGRKTLLAEEATALAFAMGDNTAIAQGPDALALLEPDASGDIAPPVIARQVDQALTRVPEYDVAISAYAQDRAARLSEDHDRVKFATRGEGATTEVEAVLPADVIGLYVLVPEAN |
2 | uncultured_Mediterranean_phage(100.0%) | NA | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_21 |
243832 : 245540
Sequences of DBSCAN-SWA_21
Nucleotide sequences of DBSCAN-SWA_21 >NC_020542|243832:245540|DBSCAN-SWA CCTAGCGGGCCACCGGAATATCCGACAACCGCGTGGTCCAGGCGGGACTCTTCTGGCTGCGCTTCGTGTCAAACTCACGGGCACCAAGACCCTGCGCCGCCAACCCGATTGTCCCTCGGCCGAACCGTGCGTTGATACCGTCCACGGCTACAAGCAAATCCGACGCGCGCGGGTCGGCCGCAGCAAACAGGTCGCCCTGCGATGCCGCGACCGGGCCAAGTTCCTCCAGCAATACACCGCATTTCGTATAGGCCCTGCCGGGCTCGAACAGCGCCCCTGTCATCCCCGTTGCACGCGCGACGATAATGCGCGGGTCATTGGTCGCCGGTGACAACCGCACTGATCGTGACGCGGAAGGGGAATTAGCCTTGAACCGTGAACCGTGCGCAAATGCGATCAGCCGCGTCGCAACGAGCTGCTGCGCGCGGATTTTTTCCGCTGCTCGCCATGCGTGCCGCGCTACGGCCTCTTGTAAAGATGGCAAGTCCGTGACCGGGGCACCAAAGGAGCGCGTCACCGCCGTCCCCTTCAAGGCTTCAGGCTCGGGATGGAAATCGTCGCACTCGGTCCCGTTCAGTTCCCGGACCAATCGTTCGAGAACTACGGTGCCGACATCGCGGGCAACCGACGGGGGCAGCGCGGCTAGGTCCGCCGTGGTTCTCACCCCCAGCGGACGCAGCTTGGCAGCCAGCGCGCGCCCAACGCCCCAGACATCCTCTATAGGCCACTCCCGAAACAACCGCTCCCGCAAGGCTAGGTCGTGCAGATCCACAACCCCGCCCCAAATCTTCTCTGTGGCCTTGGCGAGCGCATTGGCGACCTTGGAAAGCGTTCGCGTCGGCCCAAGGCCGATCCGCGTCGGCAGACCCACCGAACGCAATATGGCGGCGCGAAGCCGGTGCGCCGAAGCCACGTCACCAAGGCCGGAGGGCAGGGGAGGAAGCCGGAAAAAGCTCTCGTCGATCGAGTAGATCTCCAAGATGTCGCTATGCTCCGCGATGACGGCATTGAAGCGGCGATTGAGGTCGGCATAGAGTTCGTAATTGCTGCTGCGAAGCTGGATGCCGTGCCGCCTGATCATATCACGCATTTTGAACACCGGCTCACCCATCCGGACGTGCAAAGCCTTGGCTTCCTCGCTGCGCGATACGGCGCATCCGTCATTGTTCGACAGCACAATGACCGGCACGTTCCGCAATGCAGGATCGAAAAGCCGCTCCGCGCTGACATAGAAATTGGCGACATCGACAATGGCCCAGGTCATGCGTGCCTCACCGATGGAAGTCGCGGACACTGGCGCGGACCACGCCCCAAATCTCGACGGTCTCGTCGGCGATGGTCGCGGGATAGGTCATCCTGACATTGCGTGATTCCAGATAGGACACACCTTGCCGAACGGCGAGCTGGCGCACGACAAACCCGCCATCGACCACGGCAATTACGATATGGCCGTTGCCCGGCGTTACGTCGCGATCGACCACAACCACGTCATTATCATAGAGGCCGGCATCGATCATGCTCGCTCCGCTGATACGGAAAACAAAGCTGGCGGCCCGGTCGAGCCGCAGCAATTCCACCAGATTGATGGCGTCATCCGCCCAATCCTGGGCAGGGCTTGGAAAGCCGGCTCCCGCCGCGCCGATGATCCGAAGCCGAAGGCCCTGCGGAACCGCCGCGATCGGGACCGGCTGCATCGACGGCAGCAGCAT
Protein sequences of DBSCAN-SWA_21 >NC_020542|243832:245540|243832_245095_-|WP_015449358.1|DBSCAN-SWA MTWAIVDVANFYVSAERLFDPALRNVPVIVLSNNDGCAVSRSEEAKALHVRMGEPVFKMRDMIRRHGIQLRSSNYELYADLNRRFNAVIAEHSDILEIYSIDESFFRLPPLPSGLGDVASAHRLRAAILRSVGLPTRIGLGPTRTLSKVANALAKATEKIWGGVVDLHDLALRERLFREWPIEDVWGVGRALAAKLRPLGVRTTADLAALPPSVARDVGTVVLERLVRELNGTECDDFHPEPEALKGTAVTRSFGAPVTDLPSLQEAVARHAWRAAEKIRAQQLVATRLIAFAHGSRFKANSPSASRSVRLSPATNDPRIIVARATGMTGALFEPGRAYTKCGVLLEELGPVAASQGDLFAAADPRASDLLVAVDGINARFGRGTIGLAAQGLGAREFDTKRSQKSPAWTTRLSDIPVAR >NC_020542|243832:245540|245102_245540_-|WP_015449359.1|DBSCAN-SWA MLLPSMQPVPIAAVPQGLRLRIIGAAGAGFPSPAQDWADDAINLVELLRLDRAASFVFRISGASMIDAGLYDNDVVVVDRDVTPGNGHIVIAVVDGGFVVRQLAVRQGVSYLESRNVRMTYPATIADETVEIWGVVRASVRDFHR |
2 | Salmonella_phage(50.0%) | NA | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_22 |
250032 : 251121
Sequences of DBSCAN-SWA_22
Nucleotide sequences of DBSCAN-SWA_22 >NC_020542|250032:251121|DBSCAN-SWA GTCACGCGGGCGCAATTCGCCCCTCGAGCCAGGCCAGGCGCTGGAAGTTATAGGCAAGGTTGGTCATGCCGATCTTGATGCGGGCACGAGCAATACCGATGGTGCGCACCACCAGACCCATGCGGTGCTTTTGCCCGGCAAAGACGTGCTCGACGGCGGAGCGGACCGCCGACCGCTTGGCATTGGCGCGTGCAATACGCTTAGGCATCGCTCGGCGGTGTGGCTTGCGCTGGTGGATGTGGCTGGTGAACATGCCCTTGGCGAGAAAGGCTTCGTTCTTCTTCGAGCGGTAGGCCGTATCGGCCCATACCCCCGCGGCCGTGTTGTCCGGACTAATCAGCGTTGGCAGCCTTGCGCCATCGTGGGCATTGGCGGCGCTGGCATCCCAGGTGCGGATCAGGCCATGCGCCCGGTCGATGCCGATATGGTTCTTGTAGCCGAACATCGGGATGGCCAGATCGACCGGCTTGAAGGCCTTGGGGTCCGCGCCTTCCTTGATCTTGGCCTTGGTGTACTTGACGCTCCAGCGGGCGTCGCGATCCTTTTGCCGGATCTTCGCAGGGTTATCCTTCCAGCGCTCGGGTATCCGGCCCTCCTTGATGGCCGCCTTCTCCTCCTCGGTGTTCCGCTGCTTGGGAGCCGGCACAACGGTCGCGTCGATGATCTGCCCGCCCATGGCAAGGTACCCTCGGTCTGTGAGAGCGGCATCGAAGCGGGCAAATAGCCTGTCGATGGCTTTGGCCTTCACCAGACGCTCCCGGAACAGCCACACTGTCGTCGCATCGGGCACGGTGCCATCCAGGCCAAGCCCCAGAAAACGCTGGAAAGAGAGGCGGTCCTTGATCTGGAACTCGGTCGCTTCGTCCGAAAGCGAGTAGAGCGCCTGGAGCACGAGGATCTTGAACATGAGGACCGGATCAAACGGCGGGCGGCCACCTTTGCCGCGAGGACTACGGCGCAGCGCCGCTACCAGCGGTCCCCGGAAAACCTCGAAGTCCACGACTGCCGCCAGACGCTCTAGCGGATCACCCGCCGCGCTCAACGCCTCATATCGGTCCGAAAGATCGAAGAAACCCGGCTGCCCTGCCAT
Protein sequences of DBSCAN-SWA_22 >NC_020542|250032:251121|250032_251121_-|WP_015449243.1|transposase|DBSCAN-SWA MAGQPGFFDLSDRYEALSAAGDPLERLAAVVDFEVFRGPLVAALRRSPRGKGGRPPFDPVLMFKILVLQALYSLSDEATEFQIKDRLSFQRFLGLGLDGTVPDATTVWLFRERLVKAKAIDRLFARFDAALTDRGYLAMGGQIIDATVVPAPKQRNTEEEKAAIKEGRIPERWKDNPAKIRQKDRDARWSVKYTKAKIKEGADPKAFKPVDLAIPMFGYKNHIGIDRAHGLIRTWDASAANAHDGARLPTLISPDNTAAGVWADTAYRSKKNEAFLAKGMFTSHIHQRKPHRRAMPKRIARANAKRSAVRSAVEHVFAGQKHRMGLVVRTIGIARARIKIGMTNLAYNFQRLAWLEGRIAPA |
1 | Burkholderia_phage(100.0%) | transposase | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_23 |
255007 : 258870
Sequences of DBSCAN-SWA_23
Nucleotide sequences of DBSCAN-SWA_23 >NC_020542|255007:258870|DBSCAN-SWA GTCAGAGCATCGCGGCGTCGGGGCGTCCCTCAGACGCGGAGCAAAGTCGTTCGGTCGCCACCTCTTGCCGAGAACCATCGCCGAAGATCACGACGACACGCGGGCCAAGGACAAGACGCGCGGGACGGCCATCAACTTCAACGGCAAGGCTGGCCTCTGGCGTTTGCTCCCCTGGTCGCGCGACTGGCTCAGTCCCTCCCCGCGGTTCCGGACGAGCGACGGTCACTGCCGCGGCCTGCATCGGCGTTGCCTCGATGCTGGGAATGTCCGCCTTACCCTTCAGCCCGGCAATGAACCGCGCGCCTTCCGCCAATGTCGCGCCGCTCTCATCCCAGGCGCCCACATAGGCCTCGACTTCCGCAGGATAATCCCTGGCTGTGCGATGCAGGTCATAGAGCAGCCGGATCGGCGCGCGCGGGAGCGCCTCCCTGAGATAGGCAGGCATGTCACCATAGGCCGCATAGAGCGAGACGAACTGCTTGGAGCGTCCCAGCGCAGCAGCGATTTCGACCTGGCTCATGCCGCCCTTGAGCATACGGGCAATCGCATGAGCGAGCTCGACCGAAGACAGGTTCGCACGCTGATCGTTTTCAACGATCTGATCGGCCAGCAGATGGGTTCCCTCGCCCTCGGCCACGATAATCGCCGGGATCGTCACCCGGCCCGCGAGCTCGGAGGCACGAAAGCGCCGCTCGCCCAACCGGATGCGATACAGGCCAGCGGCGTCGGCGGGAGCAACCGTGATCGGCTGCAAGACCCCGCGGGCCGCTATGGACGCCGCAAGCTCGGCCAGGGCGTCGTCGTCGAAACTGCGCCGGGGATTGCCGGGGTCGGGGATGACCCGCTCCAAAGGAATATCCTCGACCCGGCGCGCCATTTCATGCGCGCCGACCAACAGGCCAAACTCCTCGATGCGTTCATCGAACTTACCCATCAGGCGGCGGCCGCTGGCGCTGAACCCAGTCGCCGTTCGATCTCGGCGAGGATCGGCCGAATTTCCTCGGCCGCCGCCTTGGCGCCGCCACCCCCTTCCATCCAGACCGGTCGATGATTTTCCGCGGCATGCTTATAGGTCGGTCGCGCCTTGATGAAGGCAGGGAACATCAGCCGCTCGCCCACCGCGTGGGCGAGCTTGACGGCATTGTCTATTTCGCGCCGGTCGAACGGGTTGACCATTGATGGCAGCAATCCGAGGAAATCGATCTTGCGCCCTGCCCCCGCCGCCTCGGCCTTGCGCAGCGCGGTCATCCTTTGCGCGCAGTCTGCTCAAGCAGACTTTTCACGGCCATCCCCCCGGTCAGGGGGATGGCCGTGCCGATCAGTATTCTTCGGCGAGTAGGATCGTCAGAACCCGAGTGGTGACGTCGGGATTGGCTGGATCGGGTGAGCCCCAGGCAAGCTCGAGATCATAATAATCGATCTTCCAGTAGAGAATGGCGCTGTCATATTCGAAGCGACCAAAATCATGCTCGCCCAATGGATCGTTGTCGGAATTGAAGCTGTCATAATTTCGCACCGTGCGCAGGATCTCAGCGCGTCTGCGAAATCCGCGGAAGAGGGACACATCCCCGACCAGCGCGGCAACGCCCGCGGTCATGACGATCTGGTTTCGGCCAGGGCTGCAAATAGATCTCCGCAGGCTGTCATTCAGTGCGGCGATGCGGTCGATCCGCTCGGCTCGGTCTGCCGATTGTTGAGAAGGCTGCGACATGGCTGTGACTCCGGTCTGGATCAGCCCCGCCCATCGGGGCCTTCCGACAGGCGCCCCAAAGGGCGGCGTCATAGCGGGGTCGTGCACCCGGAGGGTCGCAACGCAGTGGAGAAGCCCGCCTATGCGGGCTTGCTCGGCCCCGCGCGCCGACCAGGAAGCCGACAACGGAAAAGGACCCAATGGCGTGGACACCTGTCCGGTGTCGCACCCGTGTGATTGGCGCTCTCGGACTGCAGAGACTCACCCTCTCGATTACGCCCATCTCAAGCCGAAGTTCGAGATCGGTGGGAATGCCGAAGCAGCAGGCAAATGATGCCACCACCGCTCACGGCAGGGATCGTCACCCTAATGGGCTGAGACTTGGAAATCCAAGTCTCAGGGCGAAGCCTAGAGCCCGATCCGGGTTCGCCGGAGCCGCCCGTTCAGCTCTTCACGGTTCTCCGGAAATTGCCGAGAACAGAGCACAGCTTCTGGCTTTTACGCTTCAGATCAGCGAGCCTTTCTTCCGCTTCGGCTTGCTCGCAAAGCAGCATAAGGCGTTCAGCCTGCTGTCCGCGGATCGAGACATGGCGCAAGATCACGATCCATTTGCCGGTTCGCTCAAGCGCGATCTTCGCCTGACCATGGGCAAGAAACGACCGTAGGCCCTCGAGTGTGCGGAGATCCGAAAGGGCCTTGGCGGCCGCCTTTCCTTCTTGAGCGAAAGCGCCTTCTGCCTCGATAAGTTTGCTGAGGTCATCGAACCGCTGACCGATCAAATGCCGCAGCGCGTCCTTGGATGACAGGTTTTGCCGAAACCCGTCATTTCCAACTTAGCGATCCAAGCCCATCGGCCCTTCAAATACCGCACCAAGCCGCGAGATTGCACCACCCGATGAGTTCATCGATCCAGTGATCGCTGGTGGTGCCTCTCGGCTTCAGATCGATCCCATGCCCTCTGCGGGCGGCGTGCGATGCGGTGGGGCTGCTCAGACCGCGTCGGATTTTTGCCGGCGGATGGCGAGGGCCAGCACGCCATACCCCAGGAGTCCGGAGACAAGTGATCCAATGAGAACGCCAAGCTTCGCCTCCGCCTGAAGCATCGGCGCGCCCGGAAAGGCCAGGAGCGTGATGAAAATGCTCATCGTAAAGCCAATCCCGCATAAGAGAGCGACGCCGAGCATATGCAAACGGGTCGCATGGGCTGGCATATGCGCGATGCCAAGCCGGATCGCGAGTATCGAAACCCCGAGCACGCCAGCGACCTTGCCAAGCAGCAGACCCAGCGCTGCCCCAAGCGTCACCGGTGCGGCAAAGGCGCCCGCCGGGAGTGCTAGCACCGGCACACCGGCATTGGCGAAGCCGAAGATCGGCACGACCAGGTAGCCAACCGGCAAATGAAGCCCATGTTCGAGACGATGAAGTGGACTGCGCGCGTCGCCATCAGGCCGCGCGGGTGTGCGGTCCATAAGGATGGTGAACGCGAGAATAACCCCGGCGAGCGTGGCATGGATGCCCGAGCGATAGACGAAGAACCACAGGGCTCCGCCCAGGATCAGATAGGGTAGGAGGCGACGCACACGGAGGCGATTCAGGCCGTAAAGCATGCCGGTGAATATGCCGGCACCGGCAAGGTCAGGCAGAGACAGGTCAGCGGTATAGAAAAGCGCGATGATAAGGACAGCACCCAGGTCATCGATGATCGCCAGCGCCGCGAGGAATATCCTGAGCGATGCGGGGACGCGATTGCCCAGGAGCGCGATCACGCCGAGCGCGAAAGCGATGTCGGTAGCGGCAGGGATCGCCCAGCCTGCCGCAGTTGGCCCCTGGTTGAAAGCAAGGTAGACGAGCGCGGGCACGGCCATGCCGCCGGCGGCAGCGATGCCCGGCAGGACGCGCCTCGGCCAGGTGGATAGCTGGCCGTCCAGCATCTCACGCTTAATTTCCAGCCCGACGAGCAGGAAGAAGAGGGCCATCAGGCCATCGTTGATCCAGTGCGCCAGCGAAAGCGGACCGAGGTTTGAATGAAGGACCGCCTCATATCCGGAGCGCAAGGGCGAGTTCGCGATGATGAGTGCCAGCGCAGCGGCCGCCATCAACACCAGGCCCGCAGAAGACTGGCTGTTGAGAAAGGCGCGAATGGCGCTGGGGTGTAAGACACGGAAAGTCTTCAT
Protein sequences of DBSCAN-SWA_23 >NC_020542|255007:258870|256324_256717_-|WP_021245272.1|DBSCAN-SWA MSQPSQQSADRAERIDRIAALNDSLRRSICSPGRNQIVMTAGVAALVGDVSLFRGFRRRAEILRTVRNYDSFNSDNDPLGEHDFGRFEYDSAILYWKIDYYDLELAWGSPDPANPDVTTRVLTILLAEEY >NC_020542|255007:258870|255007_255940_-|WP_015449366.1|DBSCAN-SWA MGKFDERIEEFGLLVGAHEMARRVEDIPLERVIPDPGNPRRSFDDDALAELAASIAARGVLQPITVAPADAAGLYRIRLGERRFRASELAGRVTIPAIIVAEGEGTHLLADQIVENDQRANLSSVELAHAIARMLKGGMSQVEIAAALGRSKQFVSLYAAYGDMPAYLREALPRAPIRLLYDLHRTARDYPAEVEAYVGAWDESGATLAEGARFIAGLKGKADIPSIEATPMQAAAVTVARPEPRGGTEPVARPGEQTPEASLAVEVDGRPARLVLGPRVVVIFGDGSRQEVATERLCSASEGRPDAAML >NC_020542|255007:258870|257685_258870_-|WP_015449370.1|DBSCAN-SWA MKTFRVLHPSAIRAFLNSQSSAGLVLMAAAALALIIANSPLRSGYEAVLHSNLGPLSLAHWINDGLMALFFLLVGLEIKREMLDGQLSTWPRRVLPGIAAAGGMAVPALVYLAFNQGPTAAGWAIPAATDIAFALGVIALLGNRVPASLRIFLAALAIIDDLGAVLIIALFYTADLSLPDLAGAGIFTGMLYGLNRLRVRRLLPYLILGGALWFFVYRSGIHATLAGVILAFTILMDRTPARPDGDARSPLHRLEHGLHLPVGYLVVPIFGFANAGVPVLALPAGAFAAPVTLGAALGLLLGKVAGVLGVSILAIRLGIAHMPAHATRLHMLGVALLCGIGFTMSIFITLLAFPGAPMLQAEAKLGVLIGSLVSGLLGYGVLALAIRRQKSDAV |
3 | Leptospira_phage(33.33%) | NA | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_24 |
261877 : 262780
Sequences of DBSCAN-SWA_24
Nucleotide sequences of DBSCAN-SWA_24 >NC_020542|261877:262780|DBSCAN-SWA GTCATGCTGCCAACTCCTCGCGCTGGGGCTGCTGCGCCCACATCCAGTCGGCGGCTTGCTGGGCCTTGCTGGCGGCGGTGAAGATCGCCCGTGGATCTTTCTTCAGAGCACTGAGCCACGATGCCACATAGGCGGCGTGATCGGGGCGCGGATGATGGGCAATGCCGAGGTCCGCCAGGACGAACGATGCGGTGAGTTCGGCGCAGGCTTCTTCGATCGGCAGCCAATCGCGCTTGAACCGGTCCTGGAAGTTGCGGTCGAGCCGATGCTTTGCCCCTGTGGCGTGGCCGGACTCGTGCAGCAGGGTGCCCATGTGAGCGGCTGCATCGTGAAACGCCGAGAAAGGCGGCATGAAGATCTGGTCGAGATCGATGCGATAATGGGCATCGTAGGCGCCTTCGGTGATCGGAATGTTCAGCGCCGCGACGAATGCTTCGGCATGGGCGAGGCGTTCGGTCTCCGGCAATAGGGTGACCGGCGCAGGCTCATAACCTTCGACCTGCGCGAGGTTGAACACGGTGAAAGCGCGAGCGAACATACGACCGGGCCGCTGATCGCCATCGTCATGGTCTTGATCGACGTTGGACGCAGCCTGCTTCCAGAAAACGACTGTCGTGCCGCGTTCGCCCTTGCGAACCTGGGCGTCGACCGCCTGCCACTGCCGATAGGTGCCCCACAGGCCGCTGGCATAGCCGCTCACCTGTGCGGCCGCCCACAACGCGAGCACGTTCACGCCCCGGTAGCGCTTATGGGATGAGAAGTTCTCGGGCCGCGTGACGGCGGCGCCATCATGATGCCAGGGCATGCGCCAGGTGCCCGCGCCCGCCTCGATGGCGGCAATGATCTCGGTGGTGATGCGGGTATAAACGTCGGCGCGCTCGGCCGGCGCTGATGAGCGGGACAT
Protein sequences of DBSCAN-SWA_24 >NC_020542|261877:262780|261877_262780_-|WP_015449374.1|DBSCAN-SWA MSRSSAPAERADVYTRITTEIIAAIEAGAGTWRMPWHHDGAAVTRPENFSSHKRYRGVNVLALWAAAQVSGYASGLWGTYRQWQAVDAQVRKGERGTTVVFWKQAASNVDQDHDDGDQRPGRMFARAFTVFNLAQVEGYEPAPVTLLPETERLAHAEAFVAALNIPITEGAYDAHYRIDLDQIFMPPFSAFHDAAAHMGTLLHESGHATGAKHRLDRNFQDRFKRDWLPIEEACAELTASFVLADLGIAHHPRPDHAAYVASWLSALKKDPRAIFTAASKAQQAADWMWAQQPQREELAA |
1 | Caulobacter_phage(100.0%) | NA | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_25 |
268333 : 269080
Sequences of DBSCAN-SWA_25
Nucleotide sequences of DBSCAN-SWA_25 >NC_020542|268333:269080|DBSCAN-SWA CCTACTGCAAAGTGCCATGCGGTGCAGACTGTCCAGATAAGGCGACCAACTCCTCGAACGCATAGGCCTGATCGCCGAAGCGCTTGCCGAAGGATCGGGCAAGCCGGTCTGGATGACCCGTCCAATGACCAGCCTCATGCGCGAGCGTGGCCGCCCATGCAGCACGCGTGTCGAACTGTTCGACGGGCGGCATCGTGATGAGGTCTGCGCGCAGGTTATAATAGGCGCGATCGCCTCCAGTCCGCAGCCGCGCCGGCAGGCGATCGACGAAGCGTTGTGCCCGCGCACCGAGCCGGTCGCCCGGTGGGACGAGCGATAGCGGTTCTGGATAGAAATCGGACGGCAGCGCGTCGATCTGGTCTGCATTGAAGACCGCATAGGACCGCATCACCCGGCGATGCTCATCGACCAGCTCGCCGGGACGTTCGCTGGAATCGACCTTCTTCGAATAGGCCTTGTAGAAGATTGCAAATTGCGCCTGCTCACCGGCACGGACCTGCCCGCCAAGGGTCTGCGCCTGGCGGTAGGTCATCCAATGCCGTGACCGATAGCCGCATTGCTCGGCAGCCAGCCACAGCCAAAAGGTGTTCATGCCGCGATACGGCTCGCCATTGGCCCGCAGCGGACGCCCCTGCGCCGCACACGAACGCCAGGGCCGCACCCAGGGTCGCACACCGGCATCGAGCCGGTCGATGATGGCGCACGTGATGGTCTCGGCCGGCGAGGGCCGGCCCGATGATCGAGTCAT
Protein sequences of DBSCAN-SWA_25 >NC_020542|268333:269080|268333_269080_-|WP_015449378.1|DBSCAN-SWA MTRSSGRPSPAETITCAIIDRLDAGVRPWVRPWRSCAAQGRPLRANGEPYRGMNTFWLWLAAEQCGYRSRHWMTYRQAQTLGGQVRAGEQAQFAIFYKAYSKKVDSSERPGELVDEHRRVMRSYAVFNADQIDALPSDFYPEPLSLVPPGDRLGARAQRFVDRLPARLRTGGDRAYYNLRADLITMPPVEQFDTRAAWAATLAHEAGHWTGHPDRLARSFGKRFGDQAYAFEELVALSGQSAPHGTLQ |
1 | Caulobacter_phage(100.0%) | NA | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_26 |
272135 : 275721
Sequences of DBSCAN-SWA_26
Nucleotide sequences of DBSCAN-SWA_26 >NC_020542|272135:275721|DBSCAN-SWA TCTATAGGGTGTCGAGCCATCCCATCAGATTGGCATCTCGCTTCCACGAAGGTGGCACGTTGCTCGCCGCCGGGGCGCCAACCTGTTCCAGCGCCTTTCGTTTGGTCTCCAGATTGGCCTGAGCATAGTGATTGGTCGTATCGAGACTGACGTGCCCGAGCCAACTGCGGATGACGGTGATATCGACTCCGGCGGCAACGAGGTGGACGGCGGTCGCGTGCCGGAAGCTGTGAGGCGTAACGTGTTTTGACTGGAGGGTCAGCGTCGATTTCGCGGCTTGTTCCACGTAGGCGTTGAGCTTGAACCGCACCCCTGACGCCCCTAGCGGTTCACCGTATCGATTGACGAAGATCCGCTCATCGGGCGCACGCGGCTGTCGCTCCAGTAGCTTCCTGAGCAATGCCACGGTTTCTGGCCAGAGCGGGCAGATGCGTTCCTTGCGCCCCTTGCCGTAAAGACGCACGAAGTTCGGTGCATCGAACCGGATTGCTTCGGGACAGAGGTCAAGCGCCTCTTGGATTCGCGCGCCACTGTTATAGAGGAACGAGAGCAGTACATGGTCGCGCAGCCCTTCGAGTGTCGATCGGTTGGGCTGGGCGAGGATGGCCTCGACCTCCTCGGGCTCGAGGTAGCACGGCGCGGAGGTGGGCTCCCGCTTCAACGGAACAGCCAGGACCTCTGAGCACTGCGCGATGTATTCCGGATTCTTGTCCGCCACGAAGCTGAAGAAGCTGCGGATGGCGGCCAGTCGGCAGTTCCGCGTGCCGATCGTGGTCTTGCGGCCATGCTCGGTATGATGAAGGAACGCGCGCACCTCGCCGGCCGAGACGTCGGTGAGCGTCAGCCGCGCGACCCCGCAGCCTTTTCGCTCCGCGACGAACCGAAGCAGCAGCCGCCAGGTGTCGCGATAAGAGCGGATCGTGTGGATTGACGCGCTGCGCTGCTCGGCCAGCCACTCCTGGAAGAACGCCCGCAACAACGCGGGCAATGGATTGCCTTTCCTCATGGCCGCGCCTCCGTGACAAGGCACGGGGCGCCGAGCGCGCGGAACCGTTCACTGGCTTCCTGCAACAGATCCTGCGTGACGGTGATGTAGACCAGCGTGGAGTGGAGATCCCGGTGCCCCATGTAGGTCGAGAGGAAGTGCAGTTTTTCCTGAGGGTTAATGCCGGACCGGTACCACTGGAGGATGCGGTTCACCACCATCGAGTGACGAAGGTCATGGACCCGCGGCCCGGTCCGGCCCGAAGCGGGCTTGAGCCCAGCACGGCGCATGACGTTGGTAATCATTGTCGTAACCGCCTCGGGCCTGTAGCGGTCATTTAAGTGGGCATGCCAGAACAGCCCTGATTTCGGGTTCTGTGGGCCGCCAGCGCGCCGCCTCGCATCGATGTACGCGCGCAGTTCGACCGCGACACTGTCGGATAGAGGCAAGATCCTGGTCTTGTAGAACTTCGTTTCCCGGATCGTGATCGTGCTTGATTGCAGGTCCACGTCACCAAGATCGAGCCATGCCAGCTCGCTTCGCCGTAGCCCGGCACAATAGGCCAGCATGATCATGGTGTAGAGGGTCAACGGCCGCAGCGGAGCGTCCGGCGACGGATAAGTCCGTGCGGTATCGAGCATACGCCGAACGTCAGCCGGGCTGAAGATATGCGGTTGCCGATGCTCCCGCGCTACTTCCCGCTCTGGCCGGGGATTGAAGCGCTTTGGCGGGATAGTCGGATCGAGGCGGAACCGCGCCTTGGTCAGGATGCGCGCCAGCTTCTGGCATTCAGCCGCGTGGTTGCGGGTCGGCTTGGCAGCCGCCCAGCTCGCAATCATTGCCTCAAGTGGTTGCTCCGCGAGGTCGGGACGGGCCTGAAGGAACCGATCGAACCGCAGCAGCCAGTGAGCCTGCGCTTCATATTGATATCCCCGGCTGCGCATCAGCATGACATGGTCTTGCATGAAGTCACCCAGCACGCTGCCGAAGGGCGCAGGCCGTCGTAACGCGGCCAAGGATTCGTCGGGATTCGGGGAAGCCAAGGCCCGCCAGATCGGCTTGCTTTGCTTGACGTTGTACCGACGGCGAAGTGCAGCAACAGGGTTGTCGGCGATCAGCCCGATTTCGACGAGGTGGTCGAGGAAGCGGTCGACAATGCAGACCTGATTGAGCAGCGTCGACAATCGCCAACGTTTTTGCATCTCCTTCAGCCAGGCGTCGAGCATCTGCCGGTCCACCGCCGGATGCCGGCGGGCAACATCTTCGAAGGTGCAAAGGAACCAGCGATATGTCGGTACGCTTCCCGGCCGGAACTGCGATTTTACCAGGAAGGCGTCGACGACGGTGCGATCGGGATCGTGCCAGGCGCTCATGACAGCACCTCCATTCCAGGCACCTCGAGTGCCACGGCTCGGAGATCATCTGTTGCCAGTTTGAGATAAGTATTGGTGGATTCGGTGGATCGATGCCCGAGCACGTCGCCGATGATCTTTTGCGGGACCGATGCCCGCAGCATTTCGACCGCACGTGCGTGGCGGAAGACATGCGGCCCCCGCTTTCCTGCTGGCACTACGCCTGCGGCGGCCAACCGACCGCGGATCATGCCGTACAGGTTCGTCATTGCGATATAGGGTGCGCAGGATCGGACGAAGATTTCCCGCACTTCAACCTGGGGCCGCCCAAGGCGCAGATAATCCAGCAGCGCTTCACCAACAGTCACCATTAGCGGCATGTACGAGTACGCGTTGGTCTTGGTGTGGCAGATCCGGAGGGATTCTGCGCGCCAGTCCACGTCATCGAGCCGAAGGCGGCATATCTCACCTTCGCGCAGCCCATACGTGGCAAGCAGCTGAAGTATCGCATAATCGCGTAGTCCGCGTGGCGATCTGTCCTCCTGCGTTGTCGCCAGAACCGCGGCGATTTGGCTCCTTTCCAGCGTCGACGGCACATCTTCGTAGGCATAGAGCATGGGGCCGATGATGTGTGGCGTCAGATCGGTCGGGATGCAACCCGTCCGATGCAGATGGCGAACCACCGAACGGAGACGCTCAGCGACATCGGCCAATGATTTGCGCCGTAAACCAGGCGCGCGCATGTCCATGTAGAGATCGATGTCCACGATGCTCAACGTTTCGAGGCTGGCGGCACCGGCCCGGTCGAACTGCCATCGCAGGAAGTTTCGCGCCTCCCACATCAGCGCCGCAATGGACGCGCTTGCCAAACCGCGCTCCTCGCGCAGCCATGCCTCGTATTCGCGGCAAATTTCATGTCGATGCTCGTCATCGGGACCGATCATTTCTGCGTCCGGGGGCCAATTGCCTTGAGCAAGCCGGAGGAGCTTGGCGATCGCGGTGCGGGGCAACATGTGCCAACGCGCACTGGGAGGCCGACCGTACTGGATCTCAAAATCCTGAACCGCATAGCCAAAATACTGATCGACCTGCTGCGGCGTCACAGTCTCGACCTGTATATCGCACTCGGCCAGATAATCGAGAAACGCGCGGGCGTAGAGACGGTGGTTCGCCACGACCACGGGATTGTAATTCTGTGTGGTGAGCGAATTCGAGAGTTCGGTGATCAACTCGTCATGCAACTTCAACAT
Protein sequences of DBSCAN-SWA_26 >NC_020542|272135:275721|274488_275721_-|WP_015449381.1|integrase|DBSCAN-SWA MLKLHDELITELSNSLTTQNYNPVVVANHRLYARAFLDYLAECDIQVETVTPQQVDQYFGYAVQDFEIQYGRPPSARWHMLPRTAIAKLLRLAQGNWPPDAEMIGPDDEHRHEICREYEAWLREERGLASASIAALMWEARNFLRWQFDRAGAASLETLSIVDIDLYMDMRAPGLRRKSLADVAERLRSVVRHLHRTGCIPTDLTPHIIGPMLYAYEDVPSTLERSQIAAVLATTQEDRSPRGLRDYAILQLLATYGLREGEICRLRLDDVDWRAESLRICHTKTNAYSYMPLMVTVGEALLDYLRLGRPQVEVREIFVRSCAPYIAMTNLYGMIRGRLAAAGVVPAGKRGPHVFRHARAVEMLRASVPQKIIGDVLGHRSTESTNTYLKLATDDLRAVALEVPGMEVLS >NC_020542|272135:275721|273136_274492_-|WP_009823940.1|integrase|DBSCAN-SWA MSAWHDPDRTVVDAFLVKSQFRPGSVPTYRWFLCTFEDVARRHPAVDRQMLDAWLKEMQKRWRLSTLLNQVCIVDRFLDHLVEIGLIADNPVAALRRRYNVKQSKPIWRALASPNPDESLAALRRPAPFGSVLGDFMQDHVMLMRSRGYQYEAQAHWLLRFDRFLQARPDLAEQPLEAMIASWAAAKPTRNHAAECQKLARILTKARFRLDPTIPPKRFNPRPEREVAREHRQPHIFSPADVRRMLDTARTYPSPDAPLRPLTLYTMIMLAYCAGLRRSELAWLDLGDVDLQSSTITIRETKFYKTRILPLSDSVAVELRAYIDARRRAGGPQNPKSGLFWHAHLNDRYRPEAVTTMITNVMRRAGLKPASGRTGPRVHDLRHSMVVNRILQWYRSGINPQEKLHFLSTYMGHRDLHSTLVYITVTQDLLQEASERFRALGAPCLVTEARP >NC_020542|272135:275721|272135_273140_-|WP_009823939.1|integrase|DBSCAN-SWA MRKGNPLPALLRAFFQEWLAEQRSASIHTIRSYRDTWRLLLRFVAERKGCGVARLTLTDVSAGEVRAFLHHTEHGRKTTIGTRNCRLAAIRSFFSFVADKNPEYIAQCSEVLAVPLKREPTSAPCYLEPEEVEAILAQPNRSTLEGLRDHVLLSFLYNSGARIQEALDLCPEAIRFDAPNFVRLYGKGRKERICPLWPETVALLRKLLERQPRAPDERIFVNRYGEPLGASGVRFKLNAYVEQAAKSTLTLQSKHVTPHSFRHATAVHLVAAGVDITVIRSWLGHVSLDTTNHYAQANLETKRKALEQVGAPAASNVPPSWKRDANLMGWLDTL |
3 | Thermus_phage(50.0%) | integrase | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
Acr ID | Acr position | Acr size | Homology with known anti | Neighbor HTH/AcRanker | Neighbor Aca | In prophage | Protospacer in prophage |
---|