Contig_ID | Contig_def | CRISPR array number | Contig Signature genes | Self targeting spacer number | Target MGE spacer number | Prophage number | Anti-CRISPR protein number |
---|---|---|---|---|---|---|---|
NZ_AP017372 | Halorhodospira halochloris strain DSM 1059 | 11 crisprs | Cas9_archaeal,c2c9_V-U4,DEDDh,cas3,cas8e,cse2gr11,cas7,cas5,cas6e,cas1,cas2,csa3,DinG,RT,cas6,WYL,csx16,csx1,cas10,csm2gr11,csm3gr7,csm4gr5,csm5gr7 | 1 | 26 | 4 | 0 |
CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
NZ_AP017372_1 | 306593-306723 | Orphan |
NA
Consensus repeat of NZ_AP017372_1
|
1 spacers
spacers of NZ_AP017372_1
>1.1|306629|59|NZ_AP017372|CRISPRCasFinder TACGAATCCCCCGGAGCCATTTGCTACCCCGGTGTGGAGGACGCCGTGAATCCATCCCT |
CRISPR arrays and Neighbor proteins around NZ_AP017372_1
The CRISPR arrays of NZ_AP017372_1 >merge|NZ_AP017372|1|306593-306723|CRISPRCasFinder GGAGGCTTCATGGCGCCATCCCGGCGCCAAGACCTCTACGAATCCCCCGGAGCCATTTGCTACCCCGGTGTGGAGGACGCCGTGAATCCATCCCTGGAGGCTTCATGGCGCCATCCCTGGCGCCAAGACCT >NZ_AP017372|1|1|306593-306723|CRISPRCasFinder GGAGGCTTCATGGCGCCATCCCGGCGCCAAGACCTC TACGAATCCCCCGGAGCCATTTGCTACCCCGGTGTGGAGGACGCCGTGAATCCATCCCT GGAGGCTTCATGGCGCCATCCCTGGCGCCAAGACCT
>NZ_AP017372.2|WP_096407439.1|303040_306493_+|AAA-family-ATPase MRLTRLEIQTLPGIEPGFAITDFGPGLNLVTGPNGSGKSSLIRALQALVVEPGPDDHFAIAVAASFSGDGQWTVRRTGQQQVWELDGRPAHSPRLPRRDVLRCYWLTMEDLLVADERDDRLGAELRRSLAGGYDLVALRNEEQFRVGKQNGVREAKALREAQAEQRRVEAEYADLYRQEAQLPELDEQIDAARRAAGRSEQLRQALALLQARRRRQQVEAVLADFATGLERLRGDEMERLNKLEQRRLNLEHELRRQGERREQARDQLAAAGFGDTPRPQAVELDSARDELEEARRAQEQLDQEQRQLERARASEQRARAELAGTANTEQSNPESANLISPQALHEAESLARRLQRCQAQQADLQGKLEGMPAATFATSHVELYWQAAHALRVWLAVGGFDARLLYAVLALALAGCGAAGVGAYQVEDWIALGGSVLAGLGIAAAGWLAPSDDRRAAQQRFSETGLQAPSRWRADAVRERLQELETARADLQRRQADAERAAELRNALQQVDKELAELEAERHELGHRLGFDPQLTAAALDRFVRLSEQLDRARDEGAAAQENCRRLEKTLEKSLARVRQHLAASGVECSAATGGLVELESALHELRERSRRAESAEREYHQAHSEQQRLERELDELASERGQLFEQAGLEEDQRDELARRCEQHPAWREQRDRLREAAAVEAERRSALASDEQLLSRVDSGDEQGLQAELEQAENEAAELEKLRDQASTIRTRLHDAGRDWRLEQAMAATESRQAALHERFAEGLFAEAAQVLLDNVEREHRQHHEPQVLSDARDRFRRFTRHNYDLCLGEDFTFFARDLSQQVDRDLGELSTGTRMQLLLAVRLAWADQLEQDRESLPLFLDEALTASDEERFVLIGATLGQMAREEGRQVFYLSARRHELPLWQRAVGELPHVIDLAQIRFGAASDSAQVDFALPERDPVPAPGEQEPAYDYAQRIGVPAIAPRAPAGSMHIFHLLRDDLPLLHRLLEDWRTMTLGQFEGLLDSSAAPAVIAAQTERQRLRGRCSVARLWLNAWRYGRGTPVDRTVLERSAAVSATFIDRVAELCDGLSGDGEALIEALRNGEVRRFQSSKIDELAQWLEDEGYIDPATPLDAEARERQVVQDAAMIASVDEIRRVVAWLEAGLAAA >NZ_AP017372.2|WP_096407437.1|301724_303044_+|DNA-repair-exonuclease MPVKILASADLHLGRRPSRLPESLRSSSRELGPAGVWSKMVDAALEAEVDVVVLAGDVVEHENDFYEAYRELYQGVQRLSAGGIKVYAVVGNHDVHVLPRLAGQIENIELLGRAGEWESVTLQVKSERVTFWGWSFPRPQVTYSPLAGQRLERGAGPNIGVLHCDRGQSSSPYAPVAEGELVAAGVDCWLLGHTHAADDCSYDYKSAYSGDGLGGYLGGYLGSPVGTDPGEPGAHGPWLITIEGGQIGRIEQWPLAALRWEELRVDLSAITAAEQARERVLEVARDLDQQLQDAPPQAVGLRVVLGGQSDLGSEAVALFGDEARDHLLNGAAGTHYFIERVQLATRPSIDLAELAADSDPPGLLARQLLLLERPLDDPQRQQLIAEARRRLQERAREARWQEISPATIDDEQAAEWLRQAGLRILERLRAQQPQEEAEQ >NZ_AP017372.2|WP_096407434.1|299373_301641_-|UPF0149-family-protein MKDHEYPPYEESISFHSQIRRLVEDDDLQDEEVLDGLLSIVTSDLPAELEIQIDSPRPESVAELYEAIEDHGPEIPKNLFIDMAWRLASRQEDIWQRLALYWALDTLWELRHIVLLCFLSHAQDQTLSPYVRARLPIIAHWLPDEECAELVADIIDDAPTDGSSAKPVFEQQASVDGIYLSIPEGGALQHLAITGHNEGEHFLGLVELEVDSGFYSVETLCGLDQRQLSEELELLQIDSPLGEATPDIAIALLNNALASQLESEEPPPASLIDLVTIFALQGQIAPEPISTRQWLDILDPDNKLESLTPQKRGRLINQSAKWAEQFPIVDTWFDDNPESASIIDAYSSPNKRELELRRHLDVDRRPWWAEHCFRSALALQQGWNQDTWMSFAAVGKALLDGRELRKIPIFDSILSATLEAHERGCRCSGSSKLDEDLDGDPFSDPSLRPESLHIPDKLRQSLEGFYNREHAKAVWDSGFMGLHGYLFAIATHPEPISPSEWLGPLLNPDDQSQAGVAANKAEFSEIIGNLLQLYNVINSQVFEGVAELPEGCTLKSEPMDNFHPDAPVSQWARGFRTARRCFGHLLDWLDEAREDMPQNSQERENWEMEVAEVCGFSTMALEIFADRQKAERVCRGAQEDNEKTTIENMAKFAHTTFYESFSDIAIFAGTLRRDIDSDDDEQGSMGMPAGTAANASEKGEPRLDQIPFSTPSTQPPQQPGPQQPARSNKVGRNEPCPCGSGKKYKRCCGDPRNSH >NZ_AP017372.2|WP_096407432.1|297114_299355_+|copper-translocating-P-type-ATPase MAEQSLELVIEGMTCASCVARVERMATRLPGVHSASVSLPTERATISFDPAQVDSEQIIEAISKGGFKATVRRDQERSMPDSARELGSLWRDLWLAVALTIPLVAVAMGPMLLPGLDSAMQQVLAERSWLWVEWLLVTPVLLWAGRRFFARGAPALLRLHPEMNSLVMLGTSAAWLYSTTVLLWPELFPEQARGVYFEAVGVIITLVLVGRYFELKSRGQASQAIRRLLELQVPSARVIRQGREEEVEVKRLEPGDQVVVRPGERLPVDGRIIEGSSYIDESMVTGEPVPVARGVGDEVVGGTVNRSGSFTFAASRVGADTVLGQIVRMVEGAQASKPPIQSLVDRVAGWVVWAAIALAIGAALSWSLLGFGIDHALVVAAAVLLIACPCAMGLATPMAIMVGTGRGAEQGILFRRGAAFQASAGVNVVVMDKTGTLTVGRPVLTDIEPADGFMADDVLVQAAAVEGRSEHPLAEAIVAAAHARNLGVAEVADFAAVPGYGVQGKVDGEEIVVGARRFLNQLQIMVPRGLHERAAELAKQGRTPLFVAIGGRVAALLAVADPIKEGSKPAINALHSMGLRTVMLTGDDRETADAVARQIGITEVKADVLPADKEAVIGEMQSKGLRVAFVGDGINDAPALARADVGVAVGTGTDVAIEAGDVVIMAGDPRSMARGLSLARRTFRTIRQNLFWAFVYNITLMPVAAGVLYPLWGVLLSPMMAAAAMSLSSLLVVSNSLRLRRVELVR >NZ_AP017372.2|WP_096407429.1|296660_296873_+|heavy-metal-associated-domain-containing-protein MEAVKIRIEGMSCSHCEASVREVLETLPGVEQVIEVSAEAQQAQVKGRPDPALVAQRLEEIGFAGMVTDD >NZ_AP017372.2|WP_096407427.1|295798_296674_+|ion-transporter MFSSGDNNSQHNYSAGLRGRIQWLVETPWFQNTIIVLICINGVTLGLVTSDDIKAWAGGLIPLINQVIIGVFVVEVALRIVAWGPRFFRGPWNLFDFFVVAIALVPDGGAYSVLRALRILRLLRLISQVGRLRIIVESLLRALPGIGWIGVLLLLVYYVFAVMGTELYGESFPEFFGTVGLSMYTLFQVMTLESWSEAIARPVMEQYAGAWFYFVTFILVSAFTVLNLFIGIIVNSMQSLHWEEEEEKRMESEGKAHTEREEMLHQIKEMNAKIDRLERRLSNNGERDGSG >NZ_AP017372.2|WP_096407424.1|295239_295698_-|Hsp20/alpha-crystallin-family-protein MAMMRYDPLNTLRQLQTDLDRIFSAGSQGMLGTPSENGESASNWMPAVDIAEDDKAYHVHVDLPGVDAENIDVAMDNGMLTIKGYREDNKSEDGPNWKRVERVRGTFFRRFTLPENVDADNIQARCRNGVLEVAVPKREEQPGKRIKVEQAS >NZ_AP017372.2|WP_096410284.1|294686_295124_+|universal-stress-protein MSEIVVGLDGSEGSQRALEWAVDEARLRSTGVRAVYVIDRRYLDSELGVLVAQPASELEAEAHGIVDRAIESLSAADDVAIDKHVLHAKDHGVVGTLLDQIGADAQLLVVGSRGHGGFAGLLLGSVSHQILQHAPCPVVVVPYRR >NZ_AP017372.2|WP_096407422.1|292509_294348_-|carbon-starvation-protein-A MSAIWLAMAALALYIFGWLWYSRYLANHIYRLDPNFITPAHRYRDGVDFVPTNKWILWGHHFTSVAGAAPIIGPAIAIYWGWGPALAWVALGTVFAAGVHDFGALVLSNRHRGQSIGTMANRIIGRRAKILFLFIILILILMVNAVFAWAIANLFINNPSAVLPVILQIPVAIWIGYKVLRRGGNLLLPSIIALALMYGTAVVTTYVEWLQIDLVRWFGGEGASTFVFGLEATPASFLIWILALLGYVYVASTLPVWKLLQPRDYINAQQLIVGLAILYLGLLLTQPQVTAPIYNNAAETSWFPLLFITIACGAISGFHGLVASGTSSKQLDREPDARTVGYLGALGEGILALIAIIAVATVFASQSEFLDSYSSFAKAGDVGIGNFVEGASVLASGVGIPSEIAATIVALIIVCFAATTLDSAVRLLRYIIGELGNEYRVHHLTRRHIGTSLAIGMTALLALVPDGGQGVGSGGYLLWPLFGTSNQLLAGITLMLISLWLFRQGRNPLPTLVPMIFLLAMTIWALTQQLVLDWSGVGEADAQWLLFALGAIILGFAVWILLEAIRLFYHREELEALRDPADETAEETNDSAENGRPTGKGQQTEQTDKGES >NZ_AP017372.2|WP_096407419.1|292238_292508_-|hypothetical-protein MAQKLRERLAAFASGYDRMLRLGHEAEVRRELAEREDLIMLMLFSETMGLPNPASYYTLELYPALIESYHQWHKRMGMEHSPLDHVRCC >NZ_AP017372.2|WP_096407442.1|306887_307202_-|hypothetical-protein MQNKTFLATLLASAFALASTSALAFDAAEGEDAGDPWAEPAGEEMEQDEGEAAQDPWGQPEEEAAEDPWGQPEGEAADPFGEPAEEGDTEGGEDLDDFEGGQQW >NZ_AP017372.2|WP_096410285.1|307795_311476_+|PD40-domain-containing-protein MGCFTVLTAAEREAVELPRFPSLSPDGEEIVFSWGGDLWRVGSDGGEATRLTAHQFDDLYSSWSGNGQWLVFNSMRDGYLNLYRMRRDGSELSQLTYSDRFIRSPDYSEDSDGEPVITFSSYLEGDVYREQRPYSLSPQGGEHSRLLEAFGSEPRLSPNGERVVFTRGGYYHGWNRRHYQGPEARNIWVYDFASEQFSAITSRDGDDGRARWLDDETLIFMSDREDRTVNLYRVALTENGSCDSQQDSADEPCEPISAAQAERLTPFDERDVRYFDVAPDAEKAVLQVWDSLYTLDLADPEAEPVKLSLRAGEVGRDKHELRRIDRDVTEAALSPDGQVMAYIAYGRVYVRNLDEHSPTRRVSPPNHARHKDLAWSPDGLTLYFTSDADGSESIYAARVLLTRDEIEQAYQQPGYELPTAAIDELPATRAPIAEEEADQQRPDEPERQQPERQPRQEDQRPADAENDSGASGVDEDPFGPHEPPDPIDPQPDPDPADPDPMGPDPLDPQPDPVEPEPSEPVADPDTVPEDELTDEVAEDADVEGLLDPERWHEAVQFVVSPLIADEQSSDRQASPSPDGNYLAFRRGRGDLKIKDLSSGEIEKLVPGWDSSIEWRWSPDSRYIAYSQNDLNFAANIFVVPVDGSHEPVNITRHPRNDLNPRWSQDGRKLAFISNRSNETYDIYRVYLDRGLERYSPRDITRYYRDSRRAAGQLEPLPVDLDERAAKLEELEEQPAELDLENAWRRVERVTATPVNEYALEITPGGDRYTFNRSGEGLMLRSWDGSESKRLGGVASVQQLSLTGDRLVYVSGGRAGVVKLDNAKHERPDISDRLRIDLREQSLQKFHEAARVIEEGFYRPDMKGLDWQGLVADYESLIERARTPSEFSDIANQLMGELAASHMGVNNPGDYIQRREPSGRLGIEHERVELADGVSGYRVLSLVPEGPAAEEPMPLRPGDVITAVEQQRFAGDESLLQVLRGRVGKELLITFRRPDDGPNVERQALITPISFSELARLKYDDFKRRSRNKVAELSEGRLGYVHIQAMNHVSLERFQADLYAAAHGKEGLIIDVRNNGGGHTTDRILTSLMSPVHAFTLPAGADENETGHYPQDRLDAPRYTAPANMLANEKSYSNAEILAHAFRTLNRGTLVGEQTYGGVISTGSRTLIDGATVRRPFRGWYLPDGTDMEHNGAQPDIHIEQRPEDEVAGRDRQLEKAVEDLLERLDS >NZ_AP017372.2|WP_162549277.1|311614_311755_+|hypothetical-protein MNPSLEASWRHPWRQDLHTGVAGCGSRGVLEATPETDLRCPNMCAY >NZ_AP017372.2|WP_096407444.1|311767_312817_-|alpha/beta-fold-hydrolase MSIKRFFMRMPTPILVGLLIVIILLGASVACSLGSGGQEQAAETSADGENADPPRFPRWDVEGRDWPGRESSRFVEINGINWHYQVYGDGPVLLLVHGTAAASHSWHPLIAELAEHFTVINLDLPGHGFTSRPDAERFVMTEMAADLGDLLDHIGYQPELVVGHSAGAALLARMVVDGHISPQALISINGSFIRRQGPIGRFFAPVGRWIFESDRAANFFAGRVEDQQTVADALERMGTNLDERQVELYTRLVRTPGHIGSALRMMARWQLYELEPHLSKLDLPVVLVAGEEDGLVDPDEAVDVANRMPRASVIRLDGLGHFAHEEDPARTLEIIFNIADAKLQESFAR >NZ_AP017372.2|WP_096407447.1|312818_313982_-|methionine-adenosyltransferase MDEHYLFTSESVSQAHPDKIADQISDTILDAVLEADPHGRVACETAVKTGMVLLFGELTTAAEVDFETLVRDKVCELGYNHSQLGFDGNTCAVINALGQQSPDIALGVDRTDPEQQGAGDQGLMFGYATDETETLMPAPIQYAHRLMQRHSQLLQETTLQWLRPDAKAQVTFSYADGQPQAIDTVVLSTQHAADVDLETVREAVIEQIVKPVLPQQWLSAETRFLINPTGRFVVGGPLGDAGLTGRKVVVDTYGGVARVGGGCFSGKDPSKVDRSAAYACRYVAKNIVAAGLARRCEVQLSYAIGIAEPVSINVETFGTGKLPRAKLVELVRNHFDLRPYGIIRSLDLLRPIYAKTAAHGHFGREETGFTWERTDMAQTLADSAANI >NZ_AP017372.2|WP_096407449.1|314215_315226_+|type-I-glyceraldehyde-3-phosphate-dehydrogenase MTINVAINGYGRIGRNVLRALYESGRNDEIRIVAINDLGDAETNAHLTRYDTAHGRFPGDVKVEGGDLVVNGDRIKVCAERNPADLPWGDLGVDVVMECTGLFTSKEKAGAHIQAGAKKVLISAPGGKDVDGTVVYGVNQGVLTSGHEVISNASCTTNCLAPMVKAIQDKIGVEQGLMTTIHAYTNDQVLTDVYHSDLRRARSATHSQIPTKTGAAAAVGLVLPELNGKLDGFAIRVPTINVSLVDLTFTASRDTSVDEVNQVVKAAAGGELSGVLAYNEDPLVSIDFNHNPASSVFDATLTRAMGSRLIKACAWYDNEWGFSNRMLDTTVAMMRA >NZ_AP017372.2|WP_096407452.1|315358_316078_-|Crp/Fnr-family-transcriptional-regulator MEPNDCRNCEIRSLALFGEISSEGVDQFAEQTYQVQYPAGATIYEQGDKPEAAFTLREGIIKLVRNSGTDRSQIVRLLVKGDLMGIEGIFEEPYRQSAIALTPVRVCYLPLPMLDRMRTEEPRFTEALLGRWRRALNEVEELAVELGTRKAEERVAAFLLHWQEKAEHDDDNWMPFPLSRTELGQMLGLRVETVSRVLARWKREGIFEERSSRLRLLEPDCLDQLLAQGTETATAAHRE >NZ_AP017372.2|WP_096407455.1|316667_317558_-|phosphoribosylaminoimidazolesuccinocarboxamide-synthase MQNNNALYSSTLTSLELLHSGKVRDIYAIDEDRLMIVATDRLSAFDVILPDPIPGKGAVLTRLSNFWFRHTAGIADNHLLDDDPHEFLTPQESELLGDRAVVVKRLRPLPVEAIVRGYLAGSGWQSYQQDGTVSGVALPAGLQQSQRLPQPIFTPSTKAQLGEHDEAISFAQTAELIGEELAEQIRTISLRVYEHACIHAEKCGLIIADTKLEFGLGEDGQPVIIDELLTPDSSRFWPADAWQPGTTPPAFDKQFIRDHLETLGWNKEPPAPSLSADVIAKTAEKYSEAERRLVVS >NZ_AP017372.2|WP_096407457.1|317581_319036_-|deoxyribodipyrimidine-photo-lyase MPQTAIIWLRRDLRLQDQPAFAAATKIADYVLPLYIHAPHEERPQAGAASRWWLHHSLSSLRQELRERGSDLFLDSGSSTSTLMRWAQANSASLVLCTAISEPWAEERDNKTAAELAQAGIELRITADGLLTDPHAIRNRSNTPYRAFTPFWRQVRGQLNPPQAKPAPTSLPPPPGHAHNSSAELEQLNLLDRVRWYDKFADYWQPGSTAASHRLARLSPEFFAAYPDERDFPAQPGTSLLSPHIHFGELSIREVWHQAAHSQPENHSGPANHSGVETYLAELGWREFAYHLLTQQPNLHSYPVDRRFAAMPWRDDPDNSLYSAWHLGQTGIPLVDAGMKELWATGWMHNRVRMVVGSFLVKNLRLPWQLGEEYFRDTLVDWDLASNSMGWQWVAGCGADAAPYFRIFNPVRQGERFDPEGEYVRRWLPQLGALNKKQIHQPWTAPAATLDSAGIRLGKDYPWPITDLQSSRREALEAFQSIKG >NZ_AP017372.2|WP_096407462.1|319400_320219_+|co-chaperone-DjlA MQLYRIIQNWLGRILGALAGGLAAGPLGIALWLGIVLGFLAGYGVDVWVRVTQVVGLVWSRCGLGFDQRVFIGTTAMVMGYVAKHDGRVSEAEISAARRVLNELPLDELGRKRAITVFNRGKDPGAPLRWILLMLRTVGRRRPEELARFLDFQLRVAAADGLPDAGREKLLRWIWRHVGVSGVDLDARLDGMRRGKLNRTVRPTIDHAYKLLGVSRNASSEQVRKAYRRAISKSHPDRMVGNGHSEQEIEEASERTRQIRAAYEAIREVRGS |
You can click texts colored in the table to view more detailed information
CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
NZ_AP017372_2 | 346492-346615 | Orphan |
NA
Consensus repeat of NZ_AP017372_2
|
2 spacers
spacers of NZ_AP017372_2
>2.1|346529|14|NZ_AP017372|PILER-CR CCCTCTCCAATCCA >2.2|346580|17|NZ_AP017372|PILER-CR GCAAATCCCCCTGATCA |
CRISPR arrays and Neighbor proteins around NZ_AP017372_2
The CRISPR arrays of NZ_AP017372_2 >merge|NZ_AP017372|2|346492-346615|PILER-CR TTGTGCCTGTCCCCTTCCTCACTGTGCCTGTCCCTCTCCAATCCATCTAAATCTCTGTGCCTGTCCCCCTGCAAATCCCCCTGATCATTGTGCCTGTCCCTTTGATCTCTGTGCCTGTCCCCCT >NZ_AP017372|2|1|346492-346615|PILER-CR TTGTGCCTGTCCCCTTCCTCACTGTGCCTGTCCCTCT CCAATCCATCTAAA TCTCTGTGCCTGTCCCCCTGCAAATCCCCCTGATCAT TGTGCCTGTCCCTTTGA TCTCTGTGCCTGTCCCCCT
>NZ_AP017372.2|WP_109962906.1|344549_346085_-|glycogen-synthase-GlgA MPAYPSARERLEGPIKYLSLGSGAGSVKGQAQTSSTKGQAQSEGQAQISQSEGQAPGGGQAQPKGQAPTPSKGQAQSPAPSKGQAQSSNRYHKQKIIEGRLPGSEVSVWLLDDPELFERVGSPYATASGEPWPDNHLRFYWLSRVAAAIAAGEVLDWQADILHANDWQSALAPVFLQDYSESHQERPRTVFSIHNLAYRGIFSADVFAQLELPAAMWNPERLEFYGELAFIKGALTLSDAITTVSPTYAREIQTPAFGWGLDGLLRSRSSDLHGIINGVDTTTWDPATDPHLAANYSAPDPQAKAKNRQAIAVEIGLDDDPQSPLLGFIGRLVEQKGIDLILGALPRLLASGARLAILGSGDNTLERALLQAAQAHPGRVGVSIGYDEGQAHRIEAGSDIFLMPSRFEPCGLNQLYSLRYGTPPLVNPTGGLADTVLDVDAHAGGNGFCTAAADAGSLAATVERALSYWQDQEAWQKIQARGMSADYSWDRSADAYVDLYERIRATGWQRR >NZ_AP017372.2|WP_096407499.1|343968_344565_-|DedA-family-protein MAAEIIIEILEHLGLIGIFIAMIFIAPETLMPFLGYAASQGDYHPLAALAAASLGSTFGSTLIYYAARWLDRERMIWWLTLGGRWYLFKRSDIAAMDKVFSRHGALIVFFGRFLPTVRSVVSVPAGLLPMPMPKFLLFTFLGSTAWNSLLVLGGYTAGANWERMVEYLGTFGTLITFAFIALIIGFVLFRLRTLTLGK >NZ_AP017372.2|WP_162549281.1|343528_343972_-|rhodanese-like-domain-containing-protein MSLNSMDVNQKYRCPAIALLLTIAFLALPHPAHSNYYSVAPDPGDLLQFNGVLVDIREPYEWQQTGIVEGSKTITYRHTEDFIEHLEPHLNQELRPIALICRTGNRTRQAAHLLSQKVDAPVINIEGGIFRLMHLGYRPVPYQDEEP >NZ_AP017372.2|WP_096407493.1|340689_341673_-|Rpn-family-recombination-promoting-nuclease/putative-transposase MTNNHHDPAYKRFFSQPVMIKDLLVEYVGEDWVKELDFSTLEKQNGSYAADDYRDRHDDLIWRVRWGKEWLYVYLLLEFQSDIDQFMAVRMMTYLGLLYQDLIAQGKLTSDGRLPPVLPVVLYNGQRRWSAATDIDSLIERIPGGLSAYRPQMRYMLLDEGALLSKDNSPELHSLVHALFRLEHSRTPDDMRSIVATLSKWLVKPEQRPIRREFAIWIQRVLLRRKPFADSKLFDWEEVQDLEEVNEMLAERMNEWEREWKQEGRLEGRQEGLLAGEGKSLLLLLEQKFGKEAAEQYRPRVEQADEPTIQQWLINILTANSIEEVFR >NZ_AP017372.2|WP_162549280.1|337940_338105_-|hypothetical-protein MAGEGKSLLLLLEQKFGKEAAEQYRPRVEQADEPTIQQWLINILTANSIEEVFR >NZ_AP017372.2|WP_096407491.1|336969_337461_-|ammonia-forming-cytochrome-c-nitrite-reductase-subunit-c552 MTDLLDAKGDDFWGSNFHDYRKAIDQEEHTIGCTNCHDPDDKMRLTLTSVPLKEYLERQGKEWQEKSTQKMRSLVCAQCHSEYYFETEEHGTAGKVHFPWDNGKDPLDMYEFKSDGDPERDGFAGQFVDWTHAVSKAPMLKVQHPEYEMYQVSIAGDRFPASA >NZ_AP017372.2|WP_096407488.1|336507_336900_+|hypothetical-protein MTSRSATITAILVGFAIAISGCQALWPAGDAKPELGEQQEIHQTLNAMDPIEYGEYKLDIEFGSKGWIAEREGEYFMGGSLDTSDTSEGMTLTLQQTHQYNEQIGWVELDGRGTELVLEYQEDPQSLNLQ >NZ_AP017372.2|WP_162549279.1|333660_335085_-|WD40-repeat-domain-containing-protein MATLLGVAQGASSERRLWWITSTIVGVVALPLALGLYQLDQNRTEMDSGLWSEARELDSAESYADYLENCTTCEREGRAEEKLQAAQDDERLWSQARDSDSEESYADYLENCTTCERKERAEEKLQAAQDDERLWSQARDSDSGESYADYLENCTTCERKERAEEKLQAAQEASEKWVFEGHDLKVKDVTVADHTVYSGGEDGIIRAIDSDTGEEQRVFEGHNGTIHSLAVSGETVYSSDSRGIVRATYAGDSGHTIGAYGAEAGEELWVFDGHDGMVRGVATDGNTVYSAGASMPLGGDEDETVRAILEGVEYWVFEGHDREVRDVAVDDDTVYSASADGTVRAIDSSTGNEHWVFEGHGASPVLGVEADGDTVFSAGMDNTVRAIDADTGSEQWIFEGHSSGVRSVAASGDTVYSASGGRGGDNSVRAIDAHTGKEQWVFEAHEGTVNGVAVKGDTVYSASDDGTVRAITPP >NZ_AP017372.2|WP_096407483.1|332551_333151_-|hypothetical-protein MRKITFPLAALAAPLSLINLSSASASSLEGPYIGLGTAIATSYHYELEADVTTGLLNQSDTNTVELRSGHLGSMGSSFDILLGGGATTESIYYGFELFYSAGNSDEELLEADLESDDVEATATVEVQDGYGVSLRLGYLHTSRSMAYLKTVYTEREFEGTLDIRAGDNSESFSESGNSVDSELALVSSCSVRTCLCHFV >NZ_AP017372.2|WP_096407481.1|331436_331919_-|RDD-family-protein MSEAQNHQALTQESDKKLYGGFWIRVGAALIDMLVLLIPMLLLSYLLLVLIAPTTHEEELFYQGIDSVLAFAIWLVYTAGFHSSTWQATLGKRALGLKVTSLEGNRISFGHAAGRYVAEILNVLTLGIGYIMVGLTSRKQGLHDMVAGTYVVRTEDRGPF >NZ_AP017372.2|WP_096407506.1|348458_349724_+|glucose-1-phosphate-adenylyltransferase MQENASPRYVSRLTRNTLALILAGGRGTRLKHLTQWRAKPAVPFGGKFRIIDFPLSNCVNSGIRRIGVLTQYKAHSLIRHIRQGWSSLRADFSEFVELLPAQQRIETSWYLGTADAVYQSLDIVRMHNPELVLILAGDHVYKMDYGPLLAYHVEKGADVTVGCIEVPLDEASAFGLMNINEDNQVVRFEEKPADPTPMPGSQTHSLASMGIYVFNREFMFKALGVDARTSSEHDFGKDIIPSLIDKAQVYAYPFRDPATGDQSYWRDVGTVDAFWRANLELVEVTPELNLCDREWPIWTFQEQLPPAKFVFDEDQRRGMVVDSMVSGGCIVAGAYLRRSVLFSSVVVDERTKVQDSVILPEARIEPGCRISNAVIDKHCRIEAGTVIGEDPEEDARRFHVTDSGVVLVTPDMLGQEIHVVY >NZ_AP017372.2|WP_096407509.1|349710_351477_+|glycoside-hydrolase MLSIEKSSTTGPSPDKVRVVLCWHMHQPSYVNPASGDYELPWTYLHGIKDYTDMAAHLEANPQARAVVNFSPILIEQIEDYAEQIKGFLASGERLRDPLLNALAQPVISADPEHRRSILEQCRRINRPRLVDPYPQYRQLMEFADLLDQQPTMLRYLDESFHEDLVTWYHLAWLGETVRGSEPLAKRLIEKGHGYSVHERRELLALIGEQLSGLLPRYRKLAEQGRVELSMTPYGHPILPLLQDLQSALEAWPDAPMPEQVTAYPGGEERARWHLEHGREVFERAFGQAPHGCWPSEGALSEPTVRLLSECGFKWAASGSGVLENSLNGNGVEEQQRNGHWHRAYIFQGEASGAGENSVEPTRCFFRDDGLSDAIGFVYSDWHGDDAVANLVVKLEEIAVASKDPGNTVISIIMDGENAWEHYPANGYYFLSGLYEKLSEHPRLHLTTFAEAIEQVEPIALDRLVAGSWVYGTLSTWIGEVDKNRAWELLVAAKQAYDSQIDKLEGPARDRAERQLAICESSDWFWWFGDYNPPDVVRDFDHLFRIQLAALYQCLGLEPPQELDHRFTHIGTGSPQMGGVMRQGRLES >NZ_AP017372.2|WP_096407511.1|351473_353033_+|4-alpha-glucanotransferase MSGRGLTEQRRAGVLAHLSSLPGGPGNGDLGAHSRYFVDWLANCGFSVWQMLPLGPTHEDLCPYQCLSVHAADPGFIDLQQLVEAGYLSAEQAIPPTDLSRSELLNWRYQRLRDARAGFVARHGQNGKGQAQGEEGQAPPPPPPPETNSELRELRQFRACHSHWLEDYALYMALRRENEFRPWWEWPQPLRDRQPQALEEARERLGEELNQVVFEQFIFFRQWAALRAYAAEKGVLLFGDMPIFVAHDSAEVWAQREYFDLGADGQPLSVAGVPPDYFAADGQRWGNPHYNWQRMAEDGFKWWLQRLETQLELFDFVRLDHFRGLAAYWSIPVEAETARDGHWEPAPGHDLLSAVAQRFGQIPLVAEDLGIITDDVVALREQFALPGMKVLQFAFDSDSANPYLPHNHTADSVVYTGTHDNDTTMGWYADLEPWVTERMHSYLGHPNEPMPWPLVRASLASVSGLAILPLQDLLALGSDHRMNIPGVAEGNWRWRFEWEWLPDDLSGWLWELNYLYGRV >NZ_AP017372.2|WP_096407514.1|353393_355943_-|alpha-glucan-family-phosphorylase MKENIFTLEVQPNIPPNLSRLEELAEDLYYSWDRHVRALFVQLDPELWEACGHNPKVFLRRIAQHKLEEAAQDEAYIADYNRTLSAYDTYHEQAALTSKVAPYIDPDNDLVAYFCAEFGFHESVPLYSGGLGILAGDHCKAASDLRLPLVAVGLLYRQGYFSQTIDHEGNQQAHYAPSSISELPITPCLDDDGEQVQVSVDAPGREIHLRVWQMRAGHVLIYLLDSEVPENDAADRAITYQLYGGDAHMRILQELCLGLGGVRALRKLGISPSVWHINEGHSAFQIVERCRELISEGYDSATAIEAVASETVFTTHTPVPAGHDIFEPEMVAEHLAPNLADTDIPIEDILALGNGQKGFDMTSLALRGSRFHNGVSAIHGGVASQMEQHIWPEIPAQENPITSITNGIHVPTFLAQEWANLFDQRWHAWRNQLLNEDFWKVVDELPDHRFWSMRRSLKSELLRDVYQRVLKRCQRNGMSDAMIERMTSNISNPDPDLLVIGFARRFATYKRALLIFYEIDRLKELLNDPQRPVILIFAGKAHPHDEKGQAMIRRIHELSLDPDLIGKIILLEDYDMAQARKLVTGVDVWLNNPEYPLEACGTSGQKAAINGVLNLSVLDGWWDEGYEKGNGWAILPHSAGFDPEYRDREEARDLLNLLSDEVIPLYFNRGNSGYATEWVKMSKAAMRTTLPRFNAQRMVMDYVSELYAPARAQSKILQADSLSGAQELARWKERVREHWGGTWLERIDAAPTSLLHGESLPIRVKAHLNGLSCDDVTIECRFSAMEEPRDAASTVRYQLQPEGETEDGMPVFAIDIEPRFDGLQYYRICMYPTHPLASHPFEFGGLRWL >NZ_AP017372.2|WP_096407516.1|356933_357164_-|Rpn-family-recombination-promoting-nuclease/putative-transposase MADHPANPRDALLKATLETPERAAVVLRESLPDKVRERLSDDLPTPLPGSYVDPSPQETHSDRLFEAQMMASQPGL >NZ_AP017372.2|WP_096407519.1|357185_357506_-|hypothetical-protein MRQNTALKILNSQADEGRAVFTRRDLDSLFRSDRTKARKAGIARLVEAGWLKPAARGGGVYVYPPGLPQDGYTPERIARALRRGEYNYISLESALSEWGALTRNRQ >NZ_AP017372.2|WP_096407521.1|358004_358421_-|PIN-domain-containing-protein MRVFLDASAIIYLLEGDGQTRDATRQVLLELERGSDETPVLMASALSRLECRVRPLRESDTQALERLDGFFDDPGLSVIALDTAVLDRATELRAQYRLRTPDAIQAACLLTVDPRGAFVTGDGDFEKVPGLHVYRIPH >NZ_AP017372.2|WP_096410287.1|358420_358696_-|type-II-toxin-antitoxin-system-Phd/YefM-family-antitoxin MENVISAQEIKRRGISAVDQALKNGPVHVIQRNRPRYVILSEESYQRLSEGAQARKRLWDRLLGDDEAYGAARNRAELDRELQSEREGWRD >NZ_AP017372.2|WP_096407525.1|359166_359364_+|DUF2283-domain-containing-protein MKLQYFEDTDTLYIEFQSRAISETRDLDENTILDLDSEGNVCAITFEHASQRTDVNHLHVEGLAA >NZ_AP017372.2|WP_162549282.1|359702_360023_-|hypothetical-protein MYSYIQEKDVRQALEQTRPDRAEELVMTVAEEWIKRGEKRGEKRGQKRGSHQTATKTLLRQIERKFGAEAKEASRARVERAALGELEMWLDRILDAERIEDVFAED |
You can click texts colored in the table to view more detailed information
CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
NZ_AP017372_3 | 540570-540664 | Orphan |
NA
Consensus repeat of NZ_AP017372_3
|
1 spacers
spacers of NZ_AP017372_3
>3.1|540594|47|NZ_AP017372|CRISPRCasFinder CGGATGGTGGCGCCGGCGAAGTTTTAGAGGTTCCCTCCAGGAGTGGG |
CRISPR arrays and Neighbor proteins around NZ_AP017372_3
The CRISPR arrays of NZ_AP017372_3 >merge|NZ_AP017372|3|540570-540664|CRISPRCasFinder TTCACGGCGTCCTCCACACCGGGGCGGATGGTGGCGCCGGCGAAGTTTTAGAGGTTCCCTCCAGGAGTGGGTTCACGGCGTCTTCCACACCGGGG >NZ_AP017372|3|2|540570-540664|CRISPRCasFinder TTCACGGCGTCCTCCACACCGGGG CGGATGGTGGCGCCGGCGAAGTTTTAGAGGTTCCCTCCAGGAGTGGG TTCACGGCGTCTTCCACACCGGGG
>NZ_AP017372.2|WP_096407957.1|539517_540375_-|bifunctional-methylenetetrahydrofolate-dehydrogenase/methenyltetrahydrofolate-cyclohydrolase-FolD MPAQILDGKAIAAERRSMVARSVDERSAQGKRPPGLAVILVGSDPASAVYVRNKRRACDEAGLLSRSYDLPAETSEAELLQQIDQLNADEQIDGILVQLPLPGHINAQTVIERIDPQKDVDGFHPENMGRLITRLPGLRPCTPHGVMTLLEHTGVDLAGLDAVVIGQSNIVGRPMALELLNARCTITICHSRTKDLAARVNAADLVVASVGSPGLVRGDWIARGAIVIDVGINRRADGKLTGDVDFDEACEKASWITPVPGGVGPMTVATLLENTLEAAKLREQI >NZ_AP017372.2|WP_096407955.1|539069_539450_-|PilZ-domain-containing-protein MSANERRHFSRVEFQAPAQLATQAGSVHDVEILDISMRGALVRLSSGTLPPIELCSEGNRFTLKISLSEIDTIEMEVEAAHCHEHEIGLRCVRIDLDSIMHLRSLIEANLGDPDLVNRELANLIED >NZ_AP017372.2|WP_096407951.1|537644_539030_+|DNA-repair-protein-RadA MSRRTRPHYVCQDCGASQPQWVGQCPECGEWNTLEEHIEPARNVAANPVTAARSPGLAASAGEVSALAEVSTAPEPRLSTSVDELDRVLGGGLVPGSVVLIGGDPGIGKSTLLLQTLAALSRHYPSLYATGEESLQQVALRARRLGVADAPLQLMAETSVETILATAQQLRPEALVIDSIQTVHSAALSSAPGSVSQVRDSAAQLVRWAKETGTALILVGHVTKEGAIAGPRVLEHMVDTVLYFESDQGSRYRLLRAVKNRFGAANELGLFAMTEDGLRQVRNPSAIFLSRHECAVSGSAIVVSREGSRPLLLEVQALVADSSLAQPRRVAVGIEQSRLSLLLAVLQRHGGVVTAGEDVFINVVGGVRIHETAGDLPVLAAVLSSMRNRPLPMNSVLFGELGLAGEVRPVPGGEERLAEAAKHGFTLAVVPEKNAPRKGIKGMEIHPVRRLEQAFEVLFSN >NZ_AP017372.2|WP_096407949.1|536536_537634_+|alanine-racemase MSREACALIDLDAVRDNLRVARAVAARSRVMAVIKSDGYGHGLVRVAQAIGEDVDAFAVTDLDEALALRRAGFNQRIVLLQGPFEAAEIPLAAAEQLELVIHSAWQIEAIEQAQVSAALQLWLKVDTGMHRLGFQADEVAAAWRRLTAIPAHTVNPEIGFMTHLACADDRDDTMTDRQIEAFEEACKDFGGPLSAANSAGLLGWLESHFDWVRPGIMLYGVSPFSDRQPLDFPLRPAMTLRGRIIAVKHLGAGQKVGYGATWSCPEDMPIGIVSIGYGDGYPRHAMHGTPVDVAGRRASLVGRISMDMLAVDLRGMVTLPAPGDPVTLWGEQPRPESVADSAGTIAYELFCRTPSRVRRVYLDDQ >NZ_AP017372.2|WP_096407946.1|535154_536540_+|replicative-DNA-helicase MQHDGAATNAESLKVPPHDLEAEQAVLGGLMLDNSAWDQIADRLHEEDFYRREHRLVYRAMAELADGGQPMDVVTLSGRLRQQGRLDDAGGLQYLGGISRETPSAANIRAYADIVRERSVLRQLIRAGSDVAAAAFEPQGRDSETLLDYAEQTIFAIAEQTGRHRQGFVGMRELMPQVIDRIDALYRTQEAVTGLPTGFDDLDHLTSGLQNGDLVIVAGRPSMGKTTFAMNIVEHVVMHRKLPVAVFSMEMPAEALAMRMLASLGRVHLQRVRSGRLQDDDWPRLTSTMSLLAEAPLFVDDSPGLSPTDVRARSRRLQREHDGLGLIVVDYLQLMQSSGLRENRAGELSEISRGLKALAKELNAPVIALSQLNRSLEQRPNKRPIMSDLRESGAIEQDADLIAFIYRDEVYHEDSPDKGVAELIIGKQRQGPIGTVRLTFLGEYTRFENFAEDIYGGGIPG >NZ_AP017372.2|WP_096407944.1|534522_534975_+|50S-ribosomal-protein-L9 MELILLEKVANLGDLGDRVRVRPGFGRNYLLPYGKAKPATAENIRYFEERRAELEKQAREALEAAQSRLEKLQATPLTIKAKSGEQGKLFGSVAPGDIAAAAEQAGVELAKREVRMPDGPIRVTGEYDVQVQLHTDVVGAVRVVVEGQEP >NZ_AP017372.2|WP_096407941.1|533562_534447_+|hypothetical-protein MKAFAAFILRGPFQAATVMIAASLLPFLAVIAMGVLSLVTLRQGLQQGLFAAALAGGMLAALLWAMAGTYEPALRIVIEQWLPVLVLAEVLRRTVSLPLTLFVWAGLGALTVAGFHVVVDDPMAHWLAVTEQFLAATGAEQLPEETEAFLREDLLPIMTGLWVVNLMSVVLIGLLLGRWVQAIMFNPGGLREEFYRLDLGRSAAFVALVVLLAAVFSGPGPIYDLALVLAAAFIVQALAATHALMGKRNWSAAWLVPVYLVIPFLYMPMALLGIGEALFQWRRRLLGDGSGGAA >NZ_AP017372.2|WP_096407938.1|533325_533550_+|30S-ribosomal-protein-S18 MGRFFRRRKYCKFTAEGVKEIDYKDLNTLKNYITDTGKIVPSRITGTNARYQRQLSRAIKRARYLALLPYTDRH >NZ_AP017372.2|WP_096407936.1|532883_533243_+|30S-ribosomal-protein-S6 MRHYEIVFMVHPDQSDQVPAMLERYRSIVESNGGTIHRLEDWGRRQLAYPINKLIKAHYVLMNVECGQEELDELTSAFRFNDAVIRNMVLARDEAVTEPSPLLKGGEKREERRDYAEEE >NZ_AP017372.2|WP_096407933.1|531719_532502_+|23S-rRNA-(guanosine(2251)-2'-O)-methyltransferase-RlmB MANRLGSEAEERAKSHSLIYGRHPVREAATYDPAGVVAIWVDQALRRDPKLERLFNKLKKQGVTFYRVKRRELDEMVGGANHQGVVLSYRGAAVRGEAELNDLLDSARDPLLLVLDRVQDPHNLGACLRSAAAAGAAGVVAPRDHAASLSPAVHKVAAGAVQSVPFFQVTNLARALANMQQAGLVTIGAAGDGAQTLYSLELRGAIALVMGGESEGLRRLTRKNCDYVAAIPMPGSIESLNVAVAAGVVLFEAVRQRSNC >NZ_AP017372.2|WP_162549320.1|541344_542148_+|phosphodiesterase MLCPDEPLRVLHISDLHLGDDPQWSYQGVRPWERLTEALIGVDPDCLGAQGLSRAPFDLVVVTGDLAHDQGESVYAKLSEQLAALKVPVLVLPGNHDDPEGFQRIFTDSGQVSYCREYFAGGWRILCLNSQVPGQITGRLGGQQLNALEQDLQQNQDLPTLIALHHAPVEVGTPWLDVQRLEDGESFLELVERYPQVRGVVFGHVHQDFAERRQSGLRLLAAPAVSIQFEPGSAVFAVEPSPPGVRWLELCSNGSLQSEVWWLEGCD >NZ_AP017372.2|WP_179948771.1|542204_542891_-|Bax-inhibitor-1/YccA-family-protein MSEQYSNSRAATATGREQAQQQALATNRLIRNTYILLAITLAFSAVTAGIAVLTDAPRLNIFVVLGGFFGLLFLTQYLRNSAWGLASIFALTGFMGYTLGPVINLYLGLPNGGETVMMAFGGTAAIFLGLSGYALASRRDFSFMRGFLFAGILVAFVAAIAAYFLQMPGLSLAVSVMFMILMSGLILYQTSEMVNGGETNYIMATITLYIAIYNLFTSLLHLLGLAND >NZ_AP017372.2|WP_096407963.1|543036_543231_-|hypothetical-protein MADEESPSTQQHGSLLSSSHNRHNSYEHKIEAIRDLIEDDPERAVAVIKLWLEGTQNSGKEEKS >NZ_AP017372.2|WP_096407965.1|543357_543828_-|flagellar-basal-body-associated-FliL-family-protein MPKSIYVLLTAATVLLTITLGFVLAIATGWITPPGMQQYDSDPASTEVDYDDAQYVELEPSLTVNFGDGERLRYLEADVQVQTSQDEVVEALERHSAAIRDELIMLFSEQSPEDLNDVEGREELRNRSEEIINGILEKRGVEGRIDDVFFTEFVMQ >NZ_AP017372.2|WP_096407968.1|543860_545045_-|hypothetical-protein MTLFAIVVLILLLALREVCSNRIRRLSHAQPHSTRRWRIARSWHTFALGLAAAAFLPTFVQQPELPILSEAHSLLSHTWPLFLIASGASVGLAIRIVNPQIKREIRRRQASIERRNRAQYGMNPERLSRGLRMWILDHGPAFDYRFDVETPDGVGNIVIGAEEGNFMIYVLPAEHAREGYATALQRSSKIAEHLDARGIVWIPDDKIKKAQTGDEHLAFVMRGSIVEVFRWIERTNEARRRNRERQEQRRNRALRSAQGEGIQWGSITEAEAMKKHDREAWERFARKTPIHPDMRDRVYRRHGARCAYCGFTMDPGRGQWEVIVSDYDHICRYPAKTRLVPYGIKPATSYEMPDCEQCHIEAPGHFEACISRLAPIHTRCKRERQEGKQDTAAD >NZ_AP017372.2|WP_096410302.1|545377_546673_+|trigger-factor MQVSIETTEGLGRRMTVQVPAERVEQEIERRLKDMAGRMKMDGFRPGKVPVKMVRKQYGEHVRQEVVNELLRQTYSDALKEQDLRPAGAPQVTPKQDESGQDLIYEASFEVLPQIEITGIEQIKVERPQVEVTDADVDNVLDRLRQQHADYEEVDRPAAQGDRVEIDFHGTVDGEEFQGNKAEDAAIIIGAGQLPEDFEQALVGAAAGTELTVEHTFPQGGDSPVAGKTAAFQVSVKRVEQANLPELDDAFAARLGVESGLNDLRDAVRANLENERDQAVRQRVKRQVMDQLAELNPVELPKSLIDGEIQALREQSGGASEGGMPETERDAYEEIARRRVQLGLLVNELVRSQQIQLDKERMMRELRQMAAQSGQDPNEALQQYAQNRRMMESLEASIIEEQAVDWLLEQVQTEERGMSFDELLNRDGNVS >NZ_AP017372.2|WP_096407970.1|546695_547340_+|ATP-dependent-Clp-endopeptidase-proteolytic-subunit-ClpP MSVEQHSSAPDIYNTGLVPMVVEQSPRGERAYDIFSRLLKERVIFLVGPVEDYQANLLVAQLLFLESENPDKDVHLYINSPGGSVTAGLAIYDTMQFIKPDVATLCVGQAASMGALLLAAGAEGKRYALPNSRMMIHQPLGGFQGQATDIDIHAREILSMRERLNAILSRHTGQDIETIRNDTDRDNFMTAEAAANYGLVDKVLESRTSSGKPA >NZ_AP017372.2|WP_096407973.1|547454_548738_+|ATP-dependent-Clp-protease-ATP-binding-subunit-ClpX MSDRKQGKGEDSGKLLYCSFCGKSQHEVRKLIAGPSVFICDECVDLCNDIIREELQESAEAEGEGLPKPHEINRALDEYVVGQEHAKKVLSVAVYNHYKRLEGHVDRDEVELTKSNILLIGPTGSGKTLLAETMARLLNVPFTIADATTLTEAGYVGEDVENIIQKLLQKCDYDVEKAQHGIVYIDEIDKVSRKADNPSITRDVSGEGVQQALLKLIEGTTASVPPQGGRKHPQQEFVQVDTTNILFVCGGAFAGLDKVIRERSEKGGIGFSAEIKGEKERASVGDTLRTVEPSDLVNYGLIPEFVGRLPVVATLDELDEEALVEILKEPKNALVKQYRKLFEMEGVELDLRDDALRAVANKAMERKTGARGLRTIIEQVLLETMYELPSMDNVSKVVVDESVIKGENQPYIVYATPECTKAASSDE >NZ_AP017372.2|WP_096407976.1|548873_551336_+|endopeptidase-La MVSKAQSPQQTQSENHTQAQAPLLPLRDVVVYPHMVIPLFVGRERSINALESAMESDKRIFLVAQRNAEVDEPAGGDLYSYGTVATILQMLKLPDGTVKVLVEGGERAQLVELLESDDYLAAKLSAVAEPESDPEDRELEVLARSAMSHFEQYVKLNKKIPPEILSSLAGIEEPGRLADTIAAHMALKVEEKQAILEMEKPSQRLEHLMGLIESEIDVLQLEKRIRGRVKQQMEKSQREYYLNEQMKAIQKELGELEDVPNEVEELERKIEESGMPQQALDKSRQELNKLKMMSPMSAEATVVRNYLDWIVSLPWKEKSRVRLDMKRAQKVLDEDHYGLDKVKERILEYLAVQRRVRKLKGPILCLVGPPGVGKTSLGQSIARATNRKFSRMSLGGVRDEAEIRGHRRTYIGSLPGKIVQNLSKVGKRNPLFLLDEVDKMAMDFRGDPASALLEVLDPEQNYSFNDHYLEVDFDLSDVMFVCTANTMNIPEPLLDRMEVIRLPGYTEQEKVAITKRHLLPKQMKANGLRKGELDLKDSAMRDIIRHYTREAGVRNLEREVATICRKVVKGLVEDEAKKRQSKGVQVTSRNLDKYLGVRRYRYGRAESEDRVGLATGLAWTEVGGELLTIEVAVVPGKGKATHTGQLGEVMKESIDAAMTVVRSRARTLGIQPEFYAQHDYHIHVPEGAIPKDGPSAGIGMCVALVSSLTGIPVRASVGMTGEITLRGEVLPIGGLKEKLLAALRGGIETVLIPAENEKDLADVPKEVKSKLDIRCVRWIDEVFDVALLQRPEPLAEESVSDEDETSQRSKVSENGSVRPH >NZ_AP017372.2|WP_096407978.1|551639_551912_+|HU-family-DNA-binding-protein MNKSELIEAVADSADLSKAAASRAVDAMVESITDALKEGDQVTLVGFGTFSVRERAARTGRNPQTGETIEIPASKVPGFKPGKALKDAVN |
You can click texts colored in the table to view more detailed information
CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
NZ_AP017372_4 | 760082-760660 | TypeI-E |
I-E
Consensus repeat of NZ_AP017372_4
|
9 spacers
spacers of NZ_AP017372_4
>4.1|760111|32|NZ_AP017372|CRISPRCasFinder,CRT CAGCGACAATTAACCGGCATTCCTGGCAAAAT >4.2|760172|32|NZ_AP017372|CRISPRCasFinder,CRT,PILER-CR CTCCGACGCTGCTCTCCTCAGCTTCGGCTTGG >4.3|760233|32|NZ_AP017372|CRISPRCasFinder,CRT,PILER-CR GTAAGTACCCCGACGCGGAGCCGTCGCACTAC >4.4|760294|32|NZ_AP017372|CRISPRCasFinder,CRT,PILER-CR CGCGCTAATGCACTGCTGGATTTGCAAACTGA >4.5|760355|32|NZ_AP017372|CRISPRCasFinder,CRT,PILER-CR GAGCTTAGACGATGTGATGCGCGCTAATGCGC >4.6|760416|32|NZ_AP017372|CRISPRCasFinder,CRT,PILER-CR CGCGTCTTTAGCCGCCGCCTCTGCGCCTTCTT >4.7|760477|32|NZ_AP017372|CRISPRCasFinder,CRT,PILER-CR TCTCGGCTAACGTTTTCCTCATGCCTCGATCG >4.8|760538|32|NZ_AP017372|CRISPRCasFinder,CRT,PILER-CR AGCGGCACGTTTACCATGCCCGAAGATGAAAT >4.9|760599|32|NZ_AP017372|CRISPRCasFinder,CRT,PILER-CR CAATCTTAGCACTGTCAAGATCGACGGACTGG |
cas2,cas1,cas6e,cas5,cas7,cse2gr11,cas8e,cas3,c2c9_V-U4 |
CRISPR arrays and Neighbor proteins around NZ_AP017372_4
The CRISPR arrays of NZ_AP017372_4 >merge|NZ_AP017372|4|760082-760660|CRISPRCasFinder,CRT,PILER-CR GCGTTCCCCGCGCCTGCGGGGATGAACCGCAGCGACAATTAACCGGCATTCCTGGCAAAATGCGTTCCCCGCGCCTGCGGGGATGAACCGCTCCGACGCTGCTCTCCTCAGCTTCGGCTTGGGCGTTCCCCGCGCCTGCGGGGATGAACCGGTAAGTACCCCGACGCGGAGCCGTCGCACTACGCGTTCCCCGCGCCTGCGGGGATGAACCGCGCGCTAATGCACTGCTGGATTTGCAAACTGAGCGTTCCCCGCGCCTGCGGGGATGAACCGGAGCTTAGACGATGTGATGCGCGCTAATGCGCGCGTTCCCCGCGCCTGCGGGGATGAACCGCGCGTCTTTAGCCGCCGCCTCTGCGCCTTCTTGCGTTCCCCGCGCCTGCGGGGATGAACCGTCTCGGCTAACGTTTTCCTCATGCCTCGATCGGCGTTCCCCGCGCCTGCGGGGATGAACCGAGCGGCACGTTTACCATGCCCGAAGATGAAATGCGTTCCCCGCGCCTGCGGGGATGAACCGCAATCTTAGCACTGTCAAGATCGACGGACTGGGCGTTCCCCGCGCCTGCGGGGATGAACCGT >NZ_AP017372|4|3|760082-760660|CRISPRCasFinder GCGTTCCCCGCGCCTGCGGGGATGAACCG CAGCGACAATTAACCGGCATTCCTGGCAAAAT GCGTTCCCCGCGCCTGCGGGGATGAACCG CTCCGACGCTGCTCTCCTCAGCTTCGGCTTGG GCGTTCCCCGCGCCTGCGGGGATGAACCG GTAAGTACCCCGACGCGGAGCCGTCGCACTAC GCGTTCCCCGCGCCTGCGGGGATGAACCG CGCGCTAATGCACTGCTGGATTTGCAAACTGA GCGTTCCCCGCGCCTGCGGGGATGAACCG GAGCTTAGACGATGTGATGCGCGCTAATGCGC GCGTTCCCCGCGCCTGCGGGGATGAACCG CGCGTCTTTAGCCGCCGCCTCTGCGCCTTCTT GCGTTCCCCGCGCCTGCGGGGATGAACCG TCTCGGCTAACGTTTTCCTCATGCCTCGATCG GCGTTCCCCGCGCCTGCGGGGATGAACCG AGCGGCACGTTTACCATGCCCGAAGATGAAAT GCGTTCCCCGCGCCTGCGGGGATGAACCG CAATCTTAGCACTGTCAAGATCGACGGACTGG GCGTTCCCCGCGCCTGCGGGGATGAACCGT >NZ_AP017372|4|1|760082-760659|CRT GCGTTCCCCGCGCCTGCGGGGATGAACCG CAGCGACAATTAACCGGCATTCCTGGCAAAAT GCGTTCCCCGCGCCTGCGGGGATGAACCG CTCCGACGCTGCTCTCCTCAGCTTCGGCTTGG GCGTTCCCCGCGCCTGCGGGGATGAACCG GTAAGTACCCCGACGCGGAGCCGTCGCACTAC GCGTTCCCCGCGCCTGCGGGGATGAACCG CGCGCTAATGCACTGCTGGATTTGCAAACTGA GCGTTCCCCGCGCCTGCGGGGATGAACCG GAGCTTAGACGATGTGATGCGCGCTAATGCGC GCGTTCCCCGCGCCTGCGGGGATGAACCG CGCGTCTTTAGCCGCCGCCTCTGCGCCTTCTT GCGTTCCCCGCGCCTGCGGGGATGAACCG TCTCGGCTAACGTTTTCCTCATGCCTCGATCG GCGTTCCCCGCGCCTGCGGGGATGAACCG AGCGGCACGTTTACCATGCCCGAAGATGAAAT GCGTTCCCCGCGCCTGCGGGGATGAACCG CAATCTTAGCACTGTCAAGATCGACGGACTGG GCGTTCCCCGCGCCTGCGGGGATGAACCG >NZ_AP017372|4|2|760143-760659|PILER-CR GCGTTCCCCGCGCCTGCGGGGATGAACCG CTCCGACGCTGCTCTCCTCAGCTTCGGCTTGG GCGTTCCCCGCGCCTGCGGGGATGAACCG GTAAGTACCCCGACGCGGAGCCGTCGCACTAC GCGTTCCCCGCGCCTGCGGGGATGAACCG CGCGCTAATGCACTGCTGGATTTGCAAACTGA GCGTTCCCCGCGCCTGCGGGGATGAACCG GAGCTTAGACGATGTGATGCGCGCTAATGCGC GCGTTCCCCGCGCCTGCGGGGATGAACCG CGCGTCTTTAGCCGCCGCCTCTGCGCCTTCTT GCGTTCCCCGCGCCTGCGGGGATGAACCG TCTCGGCTAACGTTTTCCTCATGCCTCGATCG GCGTTCCCCGCGCCTGCGGGGATGAACCG AGCGGCACGTTTACCATGCCCGAAGATGAAAT GCGTTCCCCGCGCCTGCGGGGATGAACCG CAATCTTAGCACTGTCAAGATCGACGGACTGG GCGTTCCCCGCGCCTGCGGGGATGAACCG
>NZ_AP017372.2|WP_096408413.1|759667_759967_+|type-I-E-CRISPR-associated-endoribonuclease-Cas2 MAMLVVVTEAVPPRLRGRLAIWLLEVRAGVYVGDVNRRVREMIWEQVNALVEDGNVVMAWSSRHESGFEFQTCGKNRRVPVDYEGLRLVRFAPDPEAEG >NZ_AP017372.2|WP_096408410.1|758738_759665_+|type-I-E-CRISPR-associated-endonuclease-Cas1 MTTEFVPLKPIPIKDRVSMIFVGRGQLDVRDGAFVVVDEVNGERMHIPVGSVACLLLEPGARISHAAVKLAATVGTLLIWVGEAGVRLYSAGQPGGARSDKLLYQARLALDEKLRLKVVRRMYALRFQEEPPERRSVEQLRGIEGARVRKMYKVLAQKYGVEWKGRSYDPNEWDNADPVNKCLSAATSCLYGVCEAAILAAGYAPAIGFLHTGKPQSFVYDVADIVKFETVVPAAFRVAAQNPAQPDRAVRIACRDSFRDTHVLQRLIPLIEDLLEAGGIDPPPPAPEAQPPAIPEPKSIGDHGHRSK >NZ_AP017372.2|WP_096408408.1|758032_758734_+|type-I-E-CRISPR-associated-protein-Cas6/Cse3/CasE MFLSRVHINPQALTPKNLMPVLEGDSYRNHQLLWRLFTEEDERPFLFRQEFEHSFDSSSGKPRGLPLFYVLSRVEPQADSELFSCEVKSFEPKLSAGQQLAFKLRANPVVAKREEGRKNSRHHDVLMDAKRAAKDNGVTDKVAIRCYMDEAAQSWLANKGRSEKAGYTLQSAPEVSGYQQHVHRRKGRDIRFSSVDFQGILTVNDPERFAQSLAEGIGRSRAFGCGMWMVRRV >NZ_AP017372.2|WP_096408405.1|757295_758033_+|type-I-E-CRISPR-associated-protein-Cas5/CasD MNYLVFRLYGPLASWGEAAVGPTRPSASYPGRSAILGLLAAALGIRREEEATLAQLRDNVTLAVKQCSAGTLLRDYHTAQVPSHDKKAVWLTRRDELGVAKDKLNTILSAREYRSDGYWVVAIRLSDEAPWTLDEMAEALRHPRFMLYLGRKSCPLAAPLHPRVVSAGGVREALSEEFPGFTGSKMEDDEKRRLGIDAEVSFAWEGDAGDILPQETRYPYDEPLHRGRWQFASRSEHWHQTREES >NZ_AP017372.2|WP_096408403.1|756247_757285_+|type-I-E-CRISPR-associated-protein-Cas7/Cse4/CasC MSTFIQLHLLTSYPPANLNRDDLGRPKTARMGGVDRLRVSSQSLKRTWRTSELFEDALVGHVGTRTKRLGTEVYEALTGAGIAEKKSLEWARAIANVFGKIQKSGTEIEQLAHLSPEERQGVDELVATLIQEQRAPTEDELKLLRKNPHAADIGLFGRMLAAHPAFNVEAACQVAHAITVHPVAVEDDYFTAVDDLNFGEEDMGAGHIGETGFAAGLFYSYVCINRDQLIDNLSGDVELADKAIAALTEAAVKVSPKGKQNSFGSRAYASYVLVEKGRQQPRSLSVAFLKPVYGQDQAGTAIKALEGQRESFEKVYGPCAEGHYVLNAVAGEGSLDELKAFLVQN >NZ_AP017372.2|WP_096408400.1|755613_756222_+|type-I-E-CRISPR-associated-protein-Cse2/CasB MSRSNINYQVLREAEARSSVYQWWQRVSRAVEADGEGGLPAFSTAVRPALRRAKTPDDALLTEGFRLLWFAVPDNLKAPRNMPALGCVAAVLAEVREMDQQKSFAAAMGSQVEKTGKPRVSELRFQQLQQSHDLEELQRRLRRAVALLGKKVHVLSLADNIMQWHREKSGHPDYRPDRRLPVRWATDYFTELASYQKAAATN >NZ_AP017372.2|WP_096408398.1|753976_755617_+|type-I-E-CRISPR-associated-protein-Cse1/CasA MNLIDEPWLPFRLRSGAIEYGPPCELAREDVVDLAPPRADFHGAAWQFLIGLLQTTCPPDDLEEWQAWWADPPTAEQLQEHFARVRHAFNAFGDAPLFMQELDPMEDARSASVASLLIEAPGDQGIKFNTDHFIKRGFGEAMCPRCASLALFTMQVNAPAGGSGYRTGLRGGGPLTTLVLPDDSQAPLWQKLWLNVLNADDLGGGEPDFTDGSVFPWLAATRVSKQAGTEITPEEVHPLHAYWAMPRRFRMHKEEAECRCQVCGAETTEVVREVRAKNYGHNYGGAWVHPLTPYRQDPKKPDEPPLSTKGQQGGLGYRHWEALVLEDTRNHQNLPARVVLDYQEKAEALRDFGSVSQHARLWVFGYDMDNMKARGWYATYMPLLAIPKEQGLRDRFLEWIDAMVQAASDAAWLLRSTVKSAWYSRPKDASGDFSFIDQRFWEGTESAFYSHLHQLAERLPEQDGAFMPEDVARRWHMTLYETALELFDELSLAGDAEALDMKRIVAARNELGKRLWRNKTMKTLRTWAGMEEGVGKSKDKAAKEEA >NZ_AP017372.2|WP_096408395.1|750875_753662_+|CRISPR-associated-helicase/endonuclease-Cas3 MESLPAYFRYWAKIPKERGFGWDACHLLPYHALDVAATGKYLLDSDEELLERFSAAVQMAPDVFRRLLVFSLALHDLGKFARSFQSLAAIDGVDLVEPDPRYVYRSRHDALALAYWKHYGQECLRNPETGNEWLDAPSELTGRQSLAFWLSVAFGHHGKPVDMEKAALDLAFSPEDKAAAWGFVEDAAALLEPSFPHAQLSDKHWRDHVLKPASWELAGFGVLADWLGSDQSVFGHRAESMPLATYWHEYALPGAEQVVERSGLRGHKEMVAFPGFSQMFGFEPAPLQSWAESVPLADGPQLFLLEDITGAGKTEAALTLAHRLLAAGHRNGVYFALPTQATSNAMYTRVGAVYRDFYSRDSQPSLVLAHGARQLRDDFTRSILPEMAPDTPYTPDDEGGLAQCSQWLADSRKKALLADVGVGTVDQALLGVLPRRHQSLRLLGLARKVLVVDEVHAYDTYTGTLLERLLEAHARHGGSAILLSATIPQSMRRRFLEAWQRGREGGQALQPASEAFPLATHLYSEGLDETPVAARTSSERDLPVDFVHSEEEALSRVVEAARSGRCACWIRNTVDDAIGAYQALRESLPEPDKALLFHARFTMGDRQRIENDALRLFGKESGNAERAGRVLIATQVVEQSLDLDFDVLVSDLAPVELLIQRAGRLYRHARTPDGDLLLSGTDQRESPVFHVHAPEWNDEPDAEWVRRALIGTSYVYPDFGMLWLTMRVLRERGAIRLPAEARLLLEAVYAPEVDVPEGLQRASDEALAEQLSHRSMAGFNVLDLSKGYSGKSVEGGWSDDEEIGTRLSDEPSVQVVLVRVDENQRVKPWNSDTAHPWAMSTVQLRKSQADRLPSLPEELGHEIELLREEVRSLRYARFWLPADERAANHAAYDSLLGAVIPRKGGEQEASTVGTVPHSGSSSNENEEH >NZ_AP017372.2|WP_096408393.1|750180_750744_-|nucleoside-deaminase MYIPEFNITLPGWLHEMLSGELQQLPGDEAQMRFVISLAIENIRQESGGPFAAAVFDSSGNLLAPGLNLVTSLHCSILHAEIIALALAQQRIGSHDLSDAGRSHHTLVTSAEPCAMCLGAIPWSGVSRVVFGALDADVREIGFDEGTKPDHWKEALATRGIEVRGEVLRSEAARLLQAYSEKGGPLY >NZ_AP017372.2|WP_096408390.1|749927_750140_+|hypothetical-protein MFVFTLAIIIMAALALLSGIAILFYSRSGNSTSSGREFSMAVFFVSALNFVSNSLVFGVVLGVNAMVGFY >NZ_AP017372.2|WP_096408415.1|763849_764542_+|DUF4338-domain-containing-protein MLRLHEQGKITLPPSRLRKRRRRATFPPTPATDPQPLLNTPVNMMPKPTFHIVQGNAAQSRCWNEYIARYHYLGYTPLDGHQIRYNVYAGEQLVALLGFGASAWKLADRERFIGWSSEQRERNLSLVVNNTRFLILPWVQVRGLASKILGLAARQLPLDWQQRYGFQPVLLETFVEWPRHTGTCYKAANWQWVGRTTGRGKKSTSHKQRLPTKDIWLYPLRRDFANRLCS >NZ_AP017372.2|WP_096408418.1|764624_765014_+|type-II-toxin-antitoxin-system-VapC-family-toxin MDIVADTNIFLAVALNEPDRDRIITLTADASALAPEILPYEIGNALSAMVKRRQLSYSEALEAEKSVRRIPVRLVSTDIRSSLQLALDQDIYAYDAYFLQCAQALSCPLLTLDRRMRQVARELGIRVLE >NZ_AP017372.2|WP_162549345.1|765234_765834_-|hypothetical-protein MFLNPPPNMYGFWQPTTAELPIDDWARHEFAHARCGDRRLQERLITVARDFAAHSQADTPEACGTRARTKAAYRFLANPRASMQQLIRSHAQASAGRCRHHDVVLAVQDTTTLNYSAPTITEGLGPIGSRADGAQGLIVHDTMAFSTEGTPLGLIDVYAWARHCEDRGLRRLSGDCYLPYRSNVANQPREERASAEWPC >NZ_AP017372.2|WP_096408425.1|766204_767125_+|DUF1016-family-protein MPRYWVIAPIDSQPADFFEKVWRFDIEKEVISIGWSQFGDVSGMSRDELAKVVAHHYPEKPQQTKGLITNMVWSFCHKIEPGDVVIARRGRKILAAVGTVREKAFYKAGKNPDVDHRLFLPVTWHQEPRDKDFGAVVFPMPTLAEIDETQYQSLVEGSGLEVAKSEDGETYENQAEFVLEKYLEEFIVSNFSGIFKGELEVYVDEDGNTGQQYTTDIGSIDILAEDRRNNSLVVIELKKGRPSDQVVGQIMRYMGWVKKNLALEDQKVRGLVICRGEDQRLSYALEMVDHVDIRYYKVSFSLTERP >NZ_AP017372.2|WP_096408428.1|767487_767928_+|NfeD-family-protein MISAWNIWLASAIGLLLVDLLLFGGASGVLLAMAGMALFGMGAALLGLSWEFQILSAALSGVLLIPLALKALKKLTPGELSQSLDDPRLRGQQFKVYTDSGGQARVTVFGDEFMARPSSIDQSLKDGSLVRIVRFEGNTAIVTPND >NZ_AP017372.2|WP_096410315.1|767972_768938_+|paraslipin MTTLLIVLLAVLLIIIIIKGLVIVPQRHAMVIERLGRYHRTLNAGLNLIIPILDQPRPITIVRYRDNQKTINTEKKIDLREVVLDFPKQEVITKDNVGVRIDGVLYYQIMDAQAAIYGAENLVLAVQTLAQTSLRSEIGRMELDQIFESRQEINARLQNTMDDAGNKWGVKVNRVEIRDIDIPDDIREAMNKQMAAERARRAEVREAEGYKQAEILKAEGDKEAAVQRAEGEKRAIQQILEAAAGTEGLEARDAMRYLIAQEYMETLPKVAQEGERVFIPLEATSLMGSVGGIRELLGPTTGAAAASSSSSSGAGSGGSGG >NZ_AP017372.2|WP_096408430.1|769016_769352_+|transposase MVRFGAFHQLLKLKAEEAGAWAVEAPTRQIKPSQTCHACGQQEKKPLSQRWHSCPCGTSCSRDENAARVLLAWLERSLSGREPADAWREVRPGHPLDESALPSKRETHAVA >NZ_AP017372.2|WP_096408433.1|769400_769673_+|HigA-family-addiction-module-antidote-protein MLVEEFLRPMQITQRELADAIHVPYQRVNELVNQKRGITPSTALRLARFFGVSADFWLNLQVRWDLYKTQQVEKDELAEIQDVTHWQKMA >NZ_AP017372.2|WP_096408436.1|770279_771059_-|hypothetical-protein MEVASAFVAWFYDILAFFGYTHPVHPIFVHITIGLVVAAMVFALIALVPQYNRYAITARDCVTFAFISAVPTMLVGLMDWVHYFGGHLSSLFKIKITLALILIPLLGLAVYLHSKLNIRSILLHIVYLAGFVNIVLLGYYGGELIHASATPHAETAADEDPDRDPDAVTYSQVSRIMQNQCVHCHSRHNDLGGLDLSSYDALMEGGDSGAVVEPGEPQESLLVLMLDGSEEPLMPLGGPELPQSDIDTISKWVEKGAER >NZ_AP017372.2|WP_096408438.1|771076_771571_-|SsrA-binding-protein-SmpB MTAVSKKAGKSKAGGGNVIAVNRKAGFDYFIEERLEAGLALEGWEVKSMREKRVNLTESYVLVRRGEAWLVGCNITPLSTASTHIRPDPTRTRKLLLHRREISRLAGSVDRAGYTVVPLQLYWKRGKAKLEIGLAKGKQKQDKRADKKEKDWQRQRERLLKHKV |
You can click texts colored in the table to view more detailed information
CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
NZ_AP017372_5 | 761958-763756 | TypeI-E |
I-E
Consensus repeat of NZ_AP017372_5
|
29 spacers
spacers of NZ_AP017372_5
>5.1|761987|32|NZ_AP017372|CRISPRCasFinder,CRT CGGACTCGACCTCCTCCATCGAGCCGTAACTC >5.2|762048|32|NZ_AP017372|CRISPRCasFinder,CRT GTATGAGCGACAGCAATCTTAGCACTGTCAAA >5.3|762109|32|NZ_AP017372|CRISPRCasFinder,CRT GCACAGACGACTACTGAATCTTACGGTTTCCA >5.4|762170|32|NZ_AP017372|CRISPRCasFinder,CRT ATAATGCAAGACATGCTAACCGATGCAAATCC >5.5|762231|32|NZ_AP017372|CRISPRCasFinder,CRT ATATCCTTAACGCCCTCCGAGACTACTAACCA >5.6|762292|32|NZ_AP017372|CRISPRCasFinder,CRT GTCGGGGCTGTCTTAGTGAGGCCCGACCGGAC >5.7|762353|32|NZ_AP017372|CRISPRCasFinder,CRT TACCCTCATCTTCCGATGAGACTAACATATCC >5.8|762414|32|NZ_AP017372|CRISPRCasFinder,CRT GACGACGAAACCATTCGCGCTAGCGAAGAATA >5.9|762475|32|NZ_AP017372|CRISPRCasFinder,CRT AGTTGGGTGCTGAGCTTGTCCCTGCAATGCTT >5.10|762536|32|NZ_AP017372|CRISPRCasFinder,CRT CGTTCAGCTGCTCGCGGACACGCTCTTCGTCA >5.11|762597|32|NZ_AP017372|CRISPRCasFinder,CRT CTGCCCATGGAATATGAGCCGGATCGCCATTG >5.12|762658|32|NZ_AP017372|CRISPRCasFinder,CRT TCCCGCGTCTATACCGACAAGAGTTTGGGCGC >5.13|762719|32|NZ_AP017372|CRISPRCasFinder,CRT TTTGCCGCGGAACACCCTGAAGCCACGCCAGA >5.14|762780|32|NZ_AP017372|CRISPRCasFinder,CRT ATAACCGGCGGCGGTGAGCCGTCAGATGAGTG >5.15|762841|32|NZ_AP017372|CRISPRCasFinder,CRT AGTTTAGACCCGAGCGAGTACGGACAGCAGGC >5.16|762902|32|NZ_AP017372|CRISPRCasFinder,CRT GTTGAGTTGCAAACCACCGACCTGCCTACAGA >5.17|762963|33|NZ_AP017372|CRISPRCasFinder,CRT TTCGCCGGTAGAAAGCTGATTTTCAAGCGCGAC >5.18|763025|32|NZ_AP017372|CRISPRCasFinder,CRT,PILER-CR TATCACTGGTTGTACGGCGCACCGCTGCTTGC >5.19|763086|32|NZ_AP017372|CRISPRCasFinder,CRT,PILER-CR ATTACGGCACGGGGCGATCAGGGAAACGGGTC >5.20|763147|32|NZ_AP017372|CRISPRCasFinder,CRT,PILER-CR GCGCAACAGTTCATCACCGTATGACGTGTACG >5.21|763208|32|NZ_AP017372|CRISPRCasFinder,CRT,PILER-CR CCCTGGCGCCCGGACGATGCCCGTGTCTATCA >5.22|763269|32|NZ_AP017372|CRISPRCasFinder,CRT,PILER-CR ATGAACCGATCACCAGCCTTGTCCCACGGCAA >5.23|763330|32|NZ_AP017372|CRISPRCasFinder,CRT,PILER-CR TCCTTGAGTCTTTGTGGAGATACACTAATGGA >5.24|763391|32|NZ_AP017372|CRISPRCasFinder,CRT,PILER-CR GCGATGGAGCTGTTTGGCGCGCGCTACTTTAG >5.25|763452|32|NZ_AP017372|CRISPRCasFinder,CRT,PILER-CR GGCCGATGGCAAGTGCGGATAGAGGATTTGAG >5.26|763513|32|NZ_AP017372|CRISPRCasFinder,CRT,PILER-CR AAAGGCTGGTTAGGTGGCATCAGAGCCATTAA >5.27|763574|32|NZ_AP017372|CRISPRCasFinder,CRT,PILER-CR TCCGGTAGGGGCATAGGACGTAAAGCGAACCC >5.28|763635|32|NZ_AP017372|CRISPRCasFinder,CRT,PILER-CR TGTTCAGCACAGCCTTGTTGCTTGAACTCTCG >5.29|763696|32|NZ_AP017372|CRISPRCasFinder,CRT,PILER-CR TCATCGGTAGTCATTAAATCTGCTACTCGTAT >5.30|762169|33|NZ_AP017372|PILER-CR GATAATGCAAGACATGCTAACCGATGCAAATCC >5.31|762230|33|NZ_AP017372|PILER-CR GATATCCTTAACGCCCTCCGAGACTACTAACCA >5.32|762291|33|NZ_AP017372|PILER-CR GGTCGGGGCTGTCTTAGTGAGGCCCGACCGGAC >5.33|762352|33|NZ_AP017372|PILER-CR GTACCCTCATCTTCCGATGAGACTAACATATCC >5.34|762413|33|NZ_AP017372|PILER-CR GGACGACGAAACCATTCGCGCTAGCGAAGAATA >5.35|762474|33|NZ_AP017372|PILER-CR GAGTTGGGTGCTGAGCTTGTCCCTGCAATGCTT >5.36|762535|33|NZ_AP017372|PILER-CR GCGTTCAGCTGCTCGCGGACACGCTCTTCGTCA >5.37|762596|33|NZ_AP017372|PILER-CR GCTGCCCATGGAATATGAGCCGGATCGCCATTG >5.38|762657|33|NZ_AP017372|PILER-CR ATCCCGCGTCTATACCGACAAGAGTTTGGGCGC >5.39|762718|33|NZ_AP017372|PILER-CR GTTTGCCGCGGAACACCCTGAAGCCACGCCAGA >5.40|762779|33|NZ_AP017372|PILER-CR GATAACCGGCGGCGGTGAGCCGTCAGATGAGTG >5.41|762840|33|NZ_AP017372|PILER-CR GAGTTTAGACCCGAGCGAGTACGGACAGCAGGC |
cas2,cas1,cas6e,cas5,cas7,cse2gr11,cas8e,cas3,c2c9_V-U4 |
CRISPR arrays and Neighbor proteins around NZ_AP017372_5
The CRISPR arrays of NZ_AP017372_5 >merge|NZ_AP017372|5|761958-763756|CRISPRCasFinder,CRT,PILER-CR,PILER-CR GCGTTCCCCGCGCCTGCGGGGATGAACCTCGGACTCGACCTCCTCCATCGAGCCGTAACTCGCGTTCCCCGCGCCTGCGGGGATGAACCGGTATGAGCGACAGCAATCTTAGCACTGTCAAAGCGTTCCCCGCGCCTGCGGGGATGAACCGGCACAGACGACTACTGAATCTTACGGTTTCCAGCGTTCCCCGCGCCTGCGGGGATGAACCGATAATGCAAGACATGCTAACCGATGCAAATCCGCGTTCCTCGCGCCTGCGGGGATGAACCGATATCCTTAACGCCCTCCGAGACTACTAACCAGCGTTCCCCGCGCCTGCGGGGATGAACCGGTCGGGGCTGTCTTAGTGAGGCCCGACCGGACGCGTTCCCCGCGCCTGCGGGGATGAACCGTACCCTCATCTTCCGATGAGACTAACATATCCGCGTTCCCCGCGCCTGCGGGGATGAACCGGACGACGAAACCATTCGCGCTAGCGAAGAATAGCGTTCCCCGCGCCTGCGGGGATGAACCGAGTTGGGTGCTGAGCTTGTCCCTGCAATGCTTGCGTTCCCCGCGCCTGCGGGGATGAACCGCGTTCAGCTGCTCGCGGACACGCTCTTCGTCAGCGTTCCCCGCGCCTGCGGGGATGAACCGCTGCCCATGGAATATGAGCCGGATCGCCATTGGCGTTCCCCGCGCCTGCGGGGATGAACCATCCCGCGTCTATACCGACAAGAGTTTGGGCGCGCGTTCCCCGCGCCTGCGGGGATGAACCGTTTGCCGCGGAACACCCTGAAGCCACGCCAGAGCGTTCCCCGCGCCTGCGGGGATGAACCGATAACCGGCGGCGGTGAGCCGTCAGATGAGTGGCGTTCCCCGCGCCTGCGGGGATGAACCGAGTTTAGACCCGAGCGAGTACGGACAGCAGGCGCGTTCCCCGCGCCTGCGGGGATGAACCGGTTGAGTTGCAAACCACCGACCTGCCTACAGAGCGTTCCCCGCGCCTGCGGGGATGAACCGTTCGCCGGTAGAAAGCTGATTTTCAAGCGCGACGCGTTCCCCGCGCCTGCGGGGATGAACCGTATCACTGGTTGTACGGCGCACCGCTGCTTGCGCGTTCCCCGCGCCTGCGGGGATGAACCGATTACGGCACGGGGCGATCAGGGAAACGGGTCGCGTTCCCCGCGCCTGCGGGGATGAACCGGCGCAACAGTTCATCACCGTATGACGTGTACGGCGTTCCCCGCGCCTGCGGGGATGAACCGCCCTGGCGCCCGGACGATGCCCGTGTCTATCAGCGTTCCCCGCGCCTGCGGGGATGAACCGATGAACCGATCACCAGCCTTGTCCCACGGCAAGCGTTCCCCGCGCCTGCGGGGATGAACCGTCCTTGAGTCTTTGTGGAGATACACTAATGGAGCGTTCCCCGCGCCTGCGGGGATGAACCGGCGATGGAGCTGTTTGGCGCGCGCTACTTTAGGCGTTCCCCGCGCCTGCGGGGATGAACCGGGCCGATGGCAAGTGCGGATAGAGGATTTGAGGCGTTCCCCGCGCCTGCGGGGATGAACCGAAAGGCTGGTTAGGTGGCATCAGAGCCATTAAGCGTTCCCCGCGCCTGCGGGGATGAACCGTCCGGTAGGGGCATAGGACGTAAAGCGAACCCGCGTTCCCCGCGCCTGCGGGGATGAACCGTGTTCAGCACAGCCTTGTTGCTTGAACTCTCGGCGTTCCCCGCGCCTGCGGGGATGAACCGTCATCGGTAGTCATTAAATCTGCTACTCGTATGCGTTCCCCGCGCCTGCGGGGATGAACCG >NZ_AP017372|5|4|761958-763756|CRISPRCasFinder GCGTTCCCCGCGCCTGCGGGGATGAACCT CGGACTCGACCTCCTCCATCGAGCCGTAACTC GCGTTCCCCGCGCCTGCGGGGATGAACCG GTATGAGCGACAGCAATCTTAGCACTGTCAAA GCGTTCCCCGCGCCTGCGGGGATGAACCG GCACAGACGACTACTGAATCTTACGGTTTCCA GCGTTCCCCGCGCCTGCGGGGATGAACCG ATAATGCAAGACATGCTAACCGATGCAAATCC GCGTTCCTCGCGCCTGCGGGGATGAACCG ATATCCTTAACGCCCTCCGAGACTACTAACCA GCGTTCCCCGCGCCTGCGGGGATGAACCG GTCGGGGCTGTCTTAGTGAGGCCCGACCGGAC GCGTTCCCCGCGCCTGCGGGGATGAACCG TACCCTCATCTTCCGATGAGACTAACATATCC GCGTTCCCCGCGCCTGCGGGGATGAACCG GACGACGAAACCATTCGCGCTAGCGAAGAATA GCGTTCCCCGCGCCTGCGGGGATGAACCG AGTTGGGTGCTGAGCTTGTCCCTGCAATGCTT GCGTTCCCCGCGCCTGCGGGGATGAACCG CGTTCAGCTGCTCGCGGACACGCTCTTCGTCA GCGTTCCCCGCGCCTGCGGGGATGAACCG CTGCCCATGGAATATGAGCCGGATCGCCATTG GCGTTCCCCGCGCCTGCGGGGATGAACCA TCCCGCGTCTATACCGACAAGAGTTTGGGCGC GCGTTCCCCGCGCCTGCGGGGATGAACCG TTTGCCGCGGAACACCCTGAAGCCACGCCAGA GCGTTCCCCGCGCCTGCGGGGATGAACCG ATAACCGGCGGCGGTGAGCCGTCAGATGAGTG GCGTTCCCCGCGCCTGCGGGGATGAACCG AGTTTAGACCCGAGCGAGTACGGACAGCAGGC GCGTTCCCCGCGCCTGCGGGGATGAACCG GTTGAGTTGCAAACCACCGACCTGCCTACAGA GCGTTCCCCGCGCCTGCGGGGATGAACCG TTCGCCGGTAGAAAGCTGATTTTCAAGCGCGAC GCGTTCCCCGCGCCTGCGGGGATGAACCG TATCACTGGTTGTACGGCGCACCGCTGCTTGC GCGTTCCCCGCGCCTGCGGGGATGAACCG ATTACGGCACGGGGCGATCAGGGAAACGGGTC GCGTTCCCCGCGCCTGCGGGGATGAACCG GCGCAACAGTTCATCACCGTATGACGTGTACG GCGTTCCCCGCGCCTGCGGGGATGAACCG CCCTGGCGCCCGGACGATGCCCGTGTCTATCA GCGTTCCCCGCGCCTGCGGGGATGAACCG ATGAACCGATCACCAGCCTTGTCCCACGGCAA GCGTTCCCCGCGCCTGCGGGGATGAACCG TCCTTGAGTCTTTGTGGAGATACACTAATGGA GCGTTCCCCGCGCCTGCGGGGATGAACCG GCGATGGAGCTGTTTGGCGCGCGCTACTTTAG GCGTTCCCCGCGCCTGCGGGGATGAACCG GGCCGATGGCAAGTGCGGATAGAGGATTTGAG GCGTTCCCCGCGCCTGCGGGGATGAACCG AAAGGCTGGTTAGGTGGCATCAGAGCCATTAA GCGTTCCCCGCGCCTGCGGGGATGAACCG TCCGGTAGGGGCATAGGACGTAAAGCGAACCC GCGTTCCCCGCGCCTGCGGGGATGAACCG TGTTCAGCACAGCCTTGTTGCTTGAACTCTCG GCGTTCCCCGCGCCTGCGGGGATGAACCG TCATCGGTAGTCATTAAATCTGCTACTCGTAT GCGTTCCCCGCGCCTGCGGGGATGAACCG >NZ_AP017372|5|2|761958-763756|CRT GCGTTCCCCGCGCCTGCGGGGATGAACCT CGGACTCGACCTCCTCCATCGAGCCGTAACTC GCGTTCCCCGCGCCTGCGGGGATGAACCG GTATGAGCGACAGCAATCTTAGCACTGTCAAA GCGTTCCCCGCGCCTGCGGGGATGAACCG GCACAGACGACTACTGAATCTTACGGTTTCCA GCGTTCCCCGCGCCTGCGGGGATGAACCG ATAATGCAAGACATGCTAACCGATGCAAATCC GCGTTCCTCGCGCCTGCGGGGATGAACCG ATATCCTTAACGCCCTCCGAGACTACTAACCA GCGTTCCCCGCGCCTGCGGGGATGAACCG GTCGGGGCTGTCTTAGTGAGGCCCGACCGGAC GCGTTCCCCGCGCCTGCGGGGATGAACCG TACCCTCATCTTCCGATGAGACTAACATATCC GCGTTCCCCGCGCCTGCGGGGATGAACCG GACGACGAAACCATTCGCGCTAGCGAAGAATA GCGTTCCCCGCGCCTGCGGGGATGAACCG AGTTGGGTGCTGAGCTTGTCCCTGCAATGCTT GCGTTCCCCGCGCCTGCGGGGATGAACCG CGTTCAGCTGCTCGCGGACACGCTCTTCGTCA GCGTTCCCCGCGCCTGCGGGGATGAACCG CTGCCCATGGAATATGAGCCGGATCGCCATTG GCGTTCCCCGCGCCTGCGGGGATGAACCA TCCCGCGTCTATACCGACAAGAGTTTGGGCGC GCGTTCCCCGCGCCTGCGGGGATGAACCG TTTGCCGCGGAACACCCTGAAGCCACGCCAGA GCGTTCCCCGCGCCTGCGGGGATGAACCG ATAACCGGCGGCGGTGAGCCGTCAGATGAGTG GCGTTCCCCGCGCCTGCGGGGATGAACCG AGTTTAGACCCGAGCGAGTACGGACAGCAGGC GCGTTCCCCGCGCCTGCGGGGATGAACCG GTTGAGTTGCAAACCACCGACCTGCCTACAGA GCGTTCCCCGCGCCTGCGGGGATGAACCG TTCGCCGGTAGAAAGCTGATTTTCAAGCGCGAC GCGTTCCCCGCGCCTGCGGGGATGAACCG TATCACTGGTTGTACGGCGCACCGCTGCTTGC GCGTTCCCCGCGCCTGCGGGGATGAACCG ATTACGGCACGGGGCGATCAGGGAAACGGGTC GCGTTCCCCGCGCCTGCGGGGATGAACCG GCGCAACAGTTCATCACCGTATGACGTGTACG GCGTTCCCCGCGCCTGCGGGGATGAACCG CCCTGGCGCCCGGACGATGCCCGTGTCTATCA GCGTTCCCCGCGCCTGCGGGGATGAACCG ATGAACCGATCACCAGCCTTGTCCCACGGCAA GCGTTCCCCGCGCCTGCGGGGATGAACCG TCCTTGAGTCTTTGTGGAGATACACTAATGGA GCGTTCCCCGCGCCTGCGGGGATGAACCG GCGATGGAGCTGTTTGGCGCGCGCTACTTTAG GCGTTCCCCGCGCCTGCGGGGATGAACCG GGCCGATGGCAAGTGCGGATAGAGGATTTGAG GCGTTCCCCGCGCCTGCGGGGATGAACCG AAAGGCTGGTTAGGTGGCATCAGAGCCATTAA GCGTTCCCCGCGCCTGCGGGGATGAACCG TCCGGTAGGGGCATAGGACGTAAAGCGAACCC GCGTTCCCCGCGCCTGCGGGGATGAACCG TGTTCAGCACAGCCTTGTTGCTTGAACTCTCG GCGTTCCCCGCGCCTGCGGGGATGAACCG TCATCGGTAGTCATTAAATCTGCTACTCGTAT GCGTTCCCCGCGCCTGCGGGGATGAACCG >NZ_AP017372|5|3|762141-762900|PILER-CR GCGTTCCCCGCGCCTGCGGGGATGAACC GATAATGCAAGACATGCTAACCGATGCAAATCC GCGTTCCTCGCGCCTGCGGGGATGAACC GATATCCTTAACGCCCTCCGAGACTACTAACCA GCGTTCCCCGCGCCTGCGGGGATGAACC GGTCGGGGCTGTCTTAGTGAGGCCCGACCGGAC GCGTTCCCCGCGCCTGCGGGGATGAACC GTACCCTCATCTTCCGATGAGACTAACATATCC GCGTTCCCCGCGCCTGCGGGGATGAACC GGACGACGAAACCATTCGCGCTAGCGAAGAATA GCGTTCCCCGCGCCTGCGGGGATGAACC GAGTTGGGTGCTGAGCTTGTCCCTGCAATGCTT GCGTTCCCCGCGCCTGCGGGGATGAACC GCGTTCAGCTGCTCGCGGACACGCTCTTCGTCA GCGTTCCCCGCGCCTGCGGGGATGAACC GCTGCCCATGGAATATGAGCCGGATCGCCATTG GCGTTCCCCGCGCCTGCGGGGATGAACC ATCCCGCGTCTATACCGACAAGAGTTTGGGCGC GCGTTCCCCGCGCCTGCGGGGATGAACC GTTTGCCGCGGAACACCCTGAAGCCACGCCAGA GCGTTCCCCGCGCCTGCGGGGATGAACC GATAACCGGCGGCGGTGAGCCGTCAGATGAGTG GCGTTCCCCGCGCCTGCGGGGATGAACC GAGTTTAGACCCGAGCGAGTACGGACAGCAGGC GCGTTCCCCGCGCCTGCGGGGATGAACCGGTTGAGTTGCAAACCACCGACCTGCCTACAGAGCGTTCCCCGCGCCTGCGGGGATGAACCGTTCGCCGGTAGAAAGCTGATTTTCAAGCGCGACGCGTTCCCCGCGCCTGCGGGGATGAACCG TATCACTGGTTGTACGGCGCACCGCTGCTTGC GCGTTCCCCGCGCCTGCGGGGATGAACCG ATTACGGCACGGGGCGATCAGGGAAACGGGTC GCGTTCCCCGCGCCTGCGGGGATGAACCG GCGCAACAGTTCATCACCGTATGACGTGTACG GCGTTCCCCGCGCCTGCGGGGATGAACCG CCCTGGCGCCCGGACGATGCCCGTGTCTATCA GCGTTCCCCGCGCCTGCGGGGATGAACCG ATGAACCGATCACCAGCCTTGTCCCACGGCAA GCGTTCCCCGCGCCTGCGGGGATGAACCG TCCTTGAGTCTTTGTGGAGATACACTAATGGA GCGTTCCCCGCGCCTGCGGGGATGAACCG GCGATGGAGCTGTTTGGCGCGCGCTACTTTAG GCGTTCCCCGCGCCTGCGGGGATGAACCG GGCCGATGGCAAGTGCGGATAGAGGATTTGAG GCGTTCCCCGCGCCTGCGGGGATGAACCG AAAGGCTGGTTAGGTGGCATCAGAGCCATTAA GCGTTCCCCGCGCCTGCGGGGATGAACCG TCCGGTAGGGGCATAGGACGTAAAGCGAACCC GCGTTCCCCGCGCCTGCGGGGATGAACCG TGTTCAGCACAGCCTTGTTGCTTGAACTCTCG GCGTTCCCCGCGCCTGCGGGGATGAACCG TCATCGGTAGTCATTAAATCTGCTACTCGTAT >NZ_AP017372|5|4|762996-763756|PILER-CR GATAATGCAAGACATGCTAACCGATGCAAATCC GCGTTCCTCGCGCCTGCGGGGATGAACC GATATCCTTAACGCCCTCCGAGACTACTAACCA GCGTTCCCCGCGCCTGCGGGGATGAACC GGTCGGGGCTGTCTTAGTGAGGCCCGACCGGAC GCGTTCCCCGCGCCTGCGGGGATGAACC GTACCCTCATCTTCCGATGAGACTAACATATCC GCGTTCCCCGCGCCTGCGGGGATGAACC GGACGACGAAACCATTCGCGCTAGCGAAGAATA GCGTTCCCCGCGCCTGCGGGGATGAACC GAGTTGGGTGCTGAGCTTGTCCCTGCAATGCTT GCGTTCCCCGCGCCTGCGGGGATGAACC GCGTTCAGCTGCTCGCGGACACGCTCTTCGTCA GCGTTCCCCGCGCCTGCGGGGATGAACC GCTGCCCATGGAATATGAGCCGGATCGCCATTG GCGTTCCCCGCGCCTGCGGGGATGAACC ATCCCGCGTCTATACCGACAAGAGTTTGGGCGC GCGTTCCCCGCGCCTGCGGGGATGAACC GTTTGCCGCGGAACACCCTGAAGCCACGCCAGA GCGTTCCCCGCGCCTGCGGGGATGAACC GATAACCGGCGGCGGTGAGCCGTCAGATGAGTG GCGTTCCCCGCGCCTGCGGGGATGAACC GAGTTTAGACCCGAGCGAGTACGGACAGCAGGC GCGTTCCCCGCGCCTGCGGGGATGAACCGGTTGAGTTGCAAACCACCGACCTGCCTACAGAGCGTTCCCCGCGCCTGCGGGGATGAACCGTTCGCCGGTAGAAAGCTGATTTTCAAGCGCGACGCGTTCCCCGCGCCTGCGGGGATGAACCG TATCACTGGTTGTACGGCGCACCGCTGCTTGC GCGTTCCCCGCGCCTGCGGGGATGAACCG ATTACGGCACGGGGCGATCAGGGAAACGGGTC GCGTTCCCCGCGCCTGCGGGGATGAACCG GCGCAACAGTTCATCACCGTATGACGTGTACG GCGTTCCCCGCGCCTGCGGGGATGAACCG CCCTGGCGCCCGGACGATGCCCGTGTCTATCA GCGTTCCCCGCGCCTGCGGGGATGAACCG ATGAACCGATCACCAGCCTTGTCCCACGGCAA GCGTTCCCCGCGCCTGCGGGGATGAACCG TCCTTGAGTCTTTGTGGAGATACACTAATGGA GCGTTCCCCGCGCCTGCGGGGATGAACCG GCGATGGAGCTGTTTGGCGCGCGCTACTTTAG GCGTTCCCCGCGCCTGCGGGGATGAACCG GGCCGATGGCAAGTGCGGATAGAGGATTTGAG GCGTTCCCCGCGCCTGCGGGGATGAACCG AAAGGCTGGTTAGGTGGCATCAGAGCCATTAA GCGTTCCCCGCGCCTGCGGGGATGAACCG TCCGGTAGGGGCATAGGACGTAAAGCGAACCC GCGTTCCCCGCGCCTGCGGGGATGAACCG TGTTCAGCACAGCCTTGTTGCTTGAACTCTCG GCGTTCCCCGCGCCTGCGGGGATGAACCG TCATCGGTAGTCATTAAATCTGCTACTCGTAT GCGTTCCCCGCGCCTGCGGGGATGAACCG
>NZ_AP017372.2|WP_096408413.1|759667_759967_+|type-I-E-CRISPR-associated-endoribonuclease-Cas2 MAMLVVVTEAVPPRLRGRLAIWLLEVRAGVYVGDVNRRVREMIWEQVNALVEDGNVVMAWSSRHESGFEFQTCGKNRRVPVDYEGLRLVRFAPDPEAEG >NZ_AP017372.2|WP_096408410.1|758738_759665_+|type-I-E-CRISPR-associated-endonuclease-Cas1 MTTEFVPLKPIPIKDRVSMIFVGRGQLDVRDGAFVVVDEVNGERMHIPVGSVACLLLEPGARISHAAVKLAATVGTLLIWVGEAGVRLYSAGQPGGARSDKLLYQARLALDEKLRLKVVRRMYALRFQEEPPERRSVEQLRGIEGARVRKMYKVLAQKYGVEWKGRSYDPNEWDNADPVNKCLSAATSCLYGVCEAAILAAGYAPAIGFLHTGKPQSFVYDVADIVKFETVVPAAFRVAAQNPAQPDRAVRIACRDSFRDTHVLQRLIPLIEDLLEAGGIDPPPPAPEAQPPAIPEPKSIGDHGHRSK >NZ_AP017372.2|WP_096408408.1|758032_758734_+|type-I-E-CRISPR-associated-protein-Cas6/Cse3/CasE MFLSRVHINPQALTPKNLMPVLEGDSYRNHQLLWRLFTEEDERPFLFRQEFEHSFDSSSGKPRGLPLFYVLSRVEPQADSELFSCEVKSFEPKLSAGQQLAFKLRANPVVAKREEGRKNSRHHDVLMDAKRAAKDNGVTDKVAIRCYMDEAAQSWLANKGRSEKAGYTLQSAPEVSGYQQHVHRRKGRDIRFSSVDFQGILTVNDPERFAQSLAEGIGRSRAFGCGMWMVRRV >NZ_AP017372.2|WP_096408405.1|757295_758033_+|type-I-E-CRISPR-associated-protein-Cas5/CasD MNYLVFRLYGPLASWGEAAVGPTRPSASYPGRSAILGLLAAALGIRREEEATLAQLRDNVTLAVKQCSAGTLLRDYHTAQVPSHDKKAVWLTRRDELGVAKDKLNTILSAREYRSDGYWVVAIRLSDEAPWTLDEMAEALRHPRFMLYLGRKSCPLAAPLHPRVVSAGGVREALSEEFPGFTGSKMEDDEKRRLGIDAEVSFAWEGDAGDILPQETRYPYDEPLHRGRWQFASRSEHWHQTREES >NZ_AP017372.2|WP_096408403.1|756247_757285_+|type-I-E-CRISPR-associated-protein-Cas7/Cse4/CasC MSTFIQLHLLTSYPPANLNRDDLGRPKTARMGGVDRLRVSSQSLKRTWRTSELFEDALVGHVGTRTKRLGTEVYEALTGAGIAEKKSLEWARAIANVFGKIQKSGTEIEQLAHLSPEERQGVDELVATLIQEQRAPTEDELKLLRKNPHAADIGLFGRMLAAHPAFNVEAACQVAHAITVHPVAVEDDYFTAVDDLNFGEEDMGAGHIGETGFAAGLFYSYVCINRDQLIDNLSGDVELADKAIAALTEAAVKVSPKGKQNSFGSRAYASYVLVEKGRQQPRSLSVAFLKPVYGQDQAGTAIKALEGQRESFEKVYGPCAEGHYVLNAVAGEGSLDELKAFLVQN >NZ_AP017372.2|WP_096408400.1|755613_756222_+|type-I-E-CRISPR-associated-protein-Cse2/CasB MSRSNINYQVLREAEARSSVYQWWQRVSRAVEADGEGGLPAFSTAVRPALRRAKTPDDALLTEGFRLLWFAVPDNLKAPRNMPALGCVAAVLAEVREMDQQKSFAAAMGSQVEKTGKPRVSELRFQQLQQSHDLEELQRRLRRAVALLGKKVHVLSLADNIMQWHREKSGHPDYRPDRRLPVRWATDYFTELASYQKAAATN >NZ_AP017372.2|WP_096408398.1|753976_755617_+|type-I-E-CRISPR-associated-protein-Cse1/CasA MNLIDEPWLPFRLRSGAIEYGPPCELAREDVVDLAPPRADFHGAAWQFLIGLLQTTCPPDDLEEWQAWWADPPTAEQLQEHFARVRHAFNAFGDAPLFMQELDPMEDARSASVASLLIEAPGDQGIKFNTDHFIKRGFGEAMCPRCASLALFTMQVNAPAGGSGYRTGLRGGGPLTTLVLPDDSQAPLWQKLWLNVLNADDLGGGEPDFTDGSVFPWLAATRVSKQAGTEITPEEVHPLHAYWAMPRRFRMHKEEAECRCQVCGAETTEVVREVRAKNYGHNYGGAWVHPLTPYRQDPKKPDEPPLSTKGQQGGLGYRHWEALVLEDTRNHQNLPARVVLDYQEKAEALRDFGSVSQHARLWVFGYDMDNMKARGWYATYMPLLAIPKEQGLRDRFLEWIDAMVQAASDAAWLLRSTVKSAWYSRPKDASGDFSFIDQRFWEGTESAFYSHLHQLAERLPEQDGAFMPEDVARRWHMTLYETALELFDELSLAGDAEALDMKRIVAARNELGKRLWRNKTMKTLRTWAGMEEGVGKSKDKAAKEEA >NZ_AP017372.2|WP_096408395.1|750875_753662_+|CRISPR-associated-helicase/endonuclease-Cas3 MESLPAYFRYWAKIPKERGFGWDACHLLPYHALDVAATGKYLLDSDEELLERFSAAVQMAPDVFRRLLVFSLALHDLGKFARSFQSLAAIDGVDLVEPDPRYVYRSRHDALALAYWKHYGQECLRNPETGNEWLDAPSELTGRQSLAFWLSVAFGHHGKPVDMEKAALDLAFSPEDKAAAWGFVEDAAALLEPSFPHAQLSDKHWRDHVLKPASWELAGFGVLADWLGSDQSVFGHRAESMPLATYWHEYALPGAEQVVERSGLRGHKEMVAFPGFSQMFGFEPAPLQSWAESVPLADGPQLFLLEDITGAGKTEAALTLAHRLLAAGHRNGVYFALPTQATSNAMYTRVGAVYRDFYSRDSQPSLVLAHGARQLRDDFTRSILPEMAPDTPYTPDDEGGLAQCSQWLADSRKKALLADVGVGTVDQALLGVLPRRHQSLRLLGLARKVLVVDEVHAYDTYTGTLLERLLEAHARHGGSAILLSATIPQSMRRRFLEAWQRGREGGQALQPASEAFPLATHLYSEGLDETPVAARTSSERDLPVDFVHSEEEALSRVVEAARSGRCACWIRNTVDDAIGAYQALRESLPEPDKALLFHARFTMGDRQRIENDALRLFGKESGNAERAGRVLIATQVVEQSLDLDFDVLVSDLAPVELLIQRAGRLYRHARTPDGDLLLSGTDQRESPVFHVHAPEWNDEPDAEWVRRALIGTSYVYPDFGMLWLTMRVLRERGAIRLPAEARLLLEAVYAPEVDVPEGLQRASDEALAEQLSHRSMAGFNVLDLSKGYSGKSVEGGWSDDEEIGTRLSDEPSVQVVLVRVDENQRVKPWNSDTAHPWAMSTVQLRKSQADRLPSLPEELGHEIELLREEVRSLRYARFWLPADERAANHAAYDSLLGAVIPRKGGEQEASTVGTVPHSGSSSNENEEH >NZ_AP017372.2|WP_096408393.1|750180_750744_-|nucleoside-deaminase MYIPEFNITLPGWLHEMLSGELQQLPGDEAQMRFVISLAIENIRQESGGPFAAAVFDSSGNLLAPGLNLVTSLHCSILHAEIIALALAQQRIGSHDLSDAGRSHHTLVTSAEPCAMCLGAIPWSGVSRVVFGALDADVREIGFDEGTKPDHWKEALATRGIEVRGEVLRSEAARLLQAYSEKGGPLY >NZ_AP017372.2|WP_096408390.1|749927_750140_+|hypothetical-protein MFVFTLAIIIMAALALLSGIAILFYSRSGNSTSSGREFSMAVFFVSALNFVSNSLVFGVVLGVNAMVGFY >NZ_AP017372.2|WP_096408415.1|763849_764542_+|DUF4338-domain-containing-protein MLRLHEQGKITLPPSRLRKRRRRATFPPTPATDPQPLLNTPVNMMPKPTFHIVQGNAAQSRCWNEYIARYHYLGYTPLDGHQIRYNVYAGEQLVALLGFGASAWKLADRERFIGWSSEQRERNLSLVVNNTRFLILPWVQVRGLASKILGLAARQLPLDWQQRYGFQPVLLETFVEWPRHTGTCYKAANWQWVGRTTGRGKKSTSHKQRLPTKDIWLYPLRRDFANRLCS >NZ_AP017372.2|WP_096408418.1|764624_765014_+|type-II-toxin-antitoxin-system-VapC-family-toxin MDIVADTNIFLAVALNEPDRDRIITLTADASALAPEILPYEIGNALSAMVKRRQLSYSEALEAEKSVRRIPVRLVSTDIRSSLQLALDQDIYAYDAYFLQCAQALSCPLLTLDRRMRQVARELGIRVLE >NZ_AP017372.2|WP_162549345.1|765234_765834_-|hypothetical-protein MFLNPPPNMYGFWQPTTAELPIDDWARHEFAHARCGDRRLQERLITVARDFAAHSQADTPEACGTRARTKAAYRFLANPRASMQQLIRSHAQASAGRCRHHDVVLAVQDTTTLNYSAPTITEGLGPIGSRADGAQGLIVHDTMAFSTEGTPLGLIDVYAWARHCEDRGLRRLSGDCYLPYRSNVANQPREERASAEWPC >NZ_AP017372.2|WP_096408425.1|766204_767125_+|DUF1016-family-protein MPRYWVIAPIDSQPADFFEKVWRFDIEKEVISIGWSQFGDVSGMSRDELAKVVAHHYPEKPQQTKGLITNMVWSFCHKIEPGDVVIARRGRKILAAVGTVREKAFYKAGKNPDVDHRLFLPVTWHQEPRDKDFGAVVFPMPTLAEIDETQYQSLVEGSGLEVAKSEDGETYENQAEFVLEKYLEEFIVSNFSGIFKGELEVYVDEDGNTGQQYTTDIGSIDILAEDRRNNSLVVIELKKGRPSDQVVGQIMRYMGWVKKNLALEDQKVRGLVICRGEDQRLSYALEMVDHVDIRYYKVSFSLTERP >NZ_AP017372.2|WP_096408428.1|767487_767928_+|NfeD-family-protein MISAWNIWLASAIGLLLVDLLLFGGASGVLLAMAGMALFGMGAALLGLSWEFQILSAALSGVLLIPLALKALKKLTPGELSQSLDDPRLRGQQFKVYTDSGGQARVTVFGDEFMARPSSIDQSLKDGSLVRIVRFEGNTAIVTPND >NZ_AP017372.2|WP_096410315.1|767972_768938_+|paraslipin MTTLLIVLLAVLLIIIIIKGLVIVPQRHAMVIERLGRYHRTLNAGLNLIIPILDQPRPITIVRYRDNQKTINTEKKIDLREVVLDFPKQEVITKDNVGVRIDGVLYYQIMDAQAAIYGAENLVLAVQTLAQTSLRSEIGRMELDQIFESRQEINARLQNTMDDAGNKWGVKVNRVEIRDIDIPDDIREAMNKQMAAERARRAEVREAEGYKQAEILKAEGDKEAAVQRAEGEKRAIQQILEAAAGTEGLEARDAMRYLIAQEYMETLPKVAQEGERVFIPLEATSLMGSVGGIRELLGPTTGAAAASSSSSSGAGSGGSGG >NZ_AP017372.2|WP_096408430.1|769016_769352_+|transposase MVRFGAFHQLLKLKAEEAGAWAVEAPTRQIKPSQTCHACGQQEKKPLSQRWHSCPCGTSCSRDENAARVLLAWLERSLSGREPADAWREVRPGHPLDESALPSKRETHAVA >NZ_AP017372.2|WP_096408433.1|769400_769673_+|HigA-family-addiction-module-antidote-protein MLVEEFLRPMQITQRELADAIHVPYQRVNELVNQKRGITPSTALRLARFFGVSADFWLNLQVRWDLYKTQQVEKDELAEIQDVTHWQKMA >NZ_AP017372.2|WP_096408436.1|770279_771059_-|hypothetical-protein MEVASAFVAWFYDILAFFGYTHPVHPIFVHITIGLVVAAMVFALIALVPQYNRYAITARDCVTFAFISAVPTMLVGLMDWVHYFGGHLSSLFKIKITLALILIPLLGLAVYLHSKLNIRSILLHIVYLAGFVNIVLLGYYGGELIHASATPHAETAADEDPDRDPDAVTYSQVSRIMQNQCVHCHSRHNDLGGLDLSSYDALMEGGDSGAVVEPGEPQESLLVLMLDGSEEPLMPLGGPELPQSDIDTISKWVEKGAER >NZ_AP017372.2|WP_096408438.1|771076_771571_-|SsrA-binding-protein-SmpB MTAVSKKAGKSKAGGGNVIAVNRKAGFDYFIEERLEAGLALEGWEVKSMREKRVNLTESYVLVRRGEAWLVGCNITPLSTASTHIRPDPTRTRKLLLHRREISRLAGSVDRAGYTVVPLQLYWKRGKAKLEIGLAKGKQKQDKRADKKEKDWQRQRERLLKHKV |
You can click texts colored in the table to view more detailed information
CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
NZ_AP017372_6 | 2199793-2199897 | Orphan |
NA
Consensus repeat of NZ_AP017372_6
|
1 spacers
spacers of NZ_AP017372_6
>6.1|2199827|37|NZ_AP017372|CRISPRCasFinder GGGGCGGGTGGTGGCGCCGGCGAAGTTTTAGAGGTGC |
CRISPR arrays and Neighbor proteins around NZ_AP017372_6
The CRISPR arrays of NZ_AP017372_6 >merge|NZ_AP017372|6|2199793-2199897|CRISPRCasFinder CCTCCAGGGATGGATTCACGGCGTCCTCCACACCGGGGCGGGTGGTGGCGCCGGCGAAGTTTTAGAGGTGCCCTCTAGGGATGGATTCACGGCGTCCTCCACACC >NZ_AP017372|6|5|2199793-2199897|CRISPRCasFinder CCTCCAGGGATGGATTCACGGCGTCCTCCACACC GGGGCGGGTGGTGGCGCCGGCGAAGTTTTAGAGGTGC CCTCTAGGGATGGATTCACGGCGTCCTCCACACC
>NZ_AP017372.2|WP_096410004.1|2198263_2199610_-|HAMP-domain-containing-protein MGSLFLKLFLWLWLTGIIIAGAFVVSWHHWAPASSLPSQAELEQVAEEISNLYAEEGGWGAVHGYLRSLSRDQKLRFVLLGQDELATRGMQRRMLRGLSQQDREILLDTAEDRGRLNGMLYKRVVVDFAGAHDFYLIALHPVEGLGGLPVWLRAVIALAVTGGLAGGLAAYLSRPLRRLRRASQALASGDLHARVPVVERGGDEIAALGRDFNAMAERLESLVEAQNRLLRDISHELRSPLSRLQVALELARRETSGDSNALAKMENDIERMDQLIGQLLTLARLESGAGASNMESVDLHELIGQVCEDAQFEAQASGCNVLKEDGPHLQITGDRHLLRSALENVLRNALRYTPAGGVINVAWQRDQDGIWVSVIDSGPGVSEERLNDLFEPFVRLSAARERDSGSCGLGLAIVRRAVQAHGGSVAAHNRPQGGLEVRFWLPLKGPAS >NZ_AP017372.2|WP_162549497.1|2197968_2198124_-|hypothetical-protein MNPSLEASWRHPWRQDLHTGADGGAGEVFRGSPVLVMFSPFGVISACALPN >NZ_AP017372.2|WP_096410003.1|2194859_2197997_+|PAS-domain-S-box-protein MGRDKVEQRTSDFSQNLLDKLPVAVCSLDSAGCFTYLNTAACQLLGYPDESALLGVNFSAVIDAERTPLPADTVTERLVAATDEPLCADTTLWLQPRVGAALPVEVEVAALRSAETAAGEVAVTLRECRTVQDITEQRARERFQQQLVAILDSTSDLVTLHGSDGSLIYINEALRARFGLQQLPSCPSVEEAIRRRHPPWAADLLLNEGLPTARREGLWQGQTAFLDADGQEIPASQVIIGHRDSSGEVVQLSTIIRDISDLKRTQQEAVESEQRFRLIAEHTRDVFWLRSEQKALYVNPAYERIWGQTREHFYANPNSFLESVHPEDRKEVETAINQAIAAGEEINIRYRIVRPDGDVRWMEARSSPFKISGTEETLRAGVARDVTDEALAMRRLEQLVAILDNTSDIVALHGEDGTLLYLNAAGRAKAGIDNQAVDGMLGIADDGSMPPLSLEEAISRFHPPWAADLLLNEGLPTVRREGLWQGETAILKADGTQAPTSQVLIGHRNSYGELTYISTILRDISDQKKAQQEVAESERRFRLIAENIRDAFWLRTDAQTLYVNPAYERIWGQPVESFYANPKAFLESIHPEDRYGVEKALDQAISEREEFNALFRIIRSNSEVRWIQMHSLPVPGRESEGMRAGIARDVTEREQALIQLREANRTKSEFLNAVSHDLRTPLNAIIGFTDLLADSELDAHQREQVKLCQAAGRTLLGLIDTLLDLSRLKAGRMTLQKEAFLLRKFLAERMPMLSQQAEDKGLNLQCSVDNGLPDRLQGDTTRLSQVLFNLVSNAIKFTDSGYVRVHFSRYDSKRLQVSVQDNGPGIPDEFQERIFEPFDRGTDAIKHLQGSGLGLAISRQLVNLMGGEIWLHSTPGQGSTFFFTAELPEDESEPAVDPAQTAPESANGAPADRENVAGTRVLVAEDEPTNILLIQALLERCGAEATVAENGQEALDIWQQAEQEFDLILLDMQMPILDGAQTVKSLREAEVAQNRVYTPVAMLSAHASTEVREQCLQSGADTYMTKPVRLDSLTDLLSWAKRRQR >NZ_AP017372.2|WP_109962895.1|2190516_2194395_-|PAS-domain-containing-protein MENKKGSKCGCLSAKCKLDLLQQVSGIGFWEYDLCTGQVNADAQCQSLYGYDNSESERSLATDYALWREHVHPEDLSWVEEEVQAAIANGQPWRFTYRIYRSDGQLCWLHSSGHVERDDQGEPCLLIGFEQDVTAQKLQEQALEQAHERLQRAEEIVSLGHWISYPATGELIWSPMTYQLCGFPAEAEPPGWEEFIARIPEQDRHQVAEYQLQSKADTDQSQGEYRIVHPNGRVVWVREIANRWQDENGRWIIQGTIQDITEQREALERSEARRQRLEAILETIPDVALTETDLEGTIREASRSAERIFGYSREELLGSDICMLHDPAEHARVREGIARLQRTAQGYSVECELIRGTGERFQAQLSVAPLLNERGEVVGKIGACIDLSAQFADEQRLRMAQEAAGFGVWDWDLAADQVYWDEACWRMLGYDPEQQSILTFADWQKFVHPEDLERVQPIVESHLAAGSPFTIELRYRCADGSWLWVQGRGQTLRRGADGSPTYMVGTHVDVQTLKETEFALRRSELELTEAKRIARLGHWLYDIKSGDVHWSSEIYEIIGLEPAETAMDWDTFLSRVPPEDHPELYEAIERTLNRGDPYELEHHLMSVDGGRRIVQARGYAEFDAAGNPQLLRGTAQDVTEQRVLQRELAEREAHYRDLVENQPLMIERFLPDTTVTYANPALGDCLGVEPEALIGQRWLDYLPAEERENIEAHLAGFTPSHPVSQFENSMPGKNGMQLWTMWTNRAFFDEKGELSHFQSVGVDITERRRAEQAEQQLREQLETRQKELEAIFAAARSVSLIKTDLNSVIEEASTGAEVLFGYSREELIGRHVSLLHTAEDIERLADYVERLFKDHEPIRMETDLVHRDGSRFPALFTVHPITDKHGELVATLGVSLDMSAQKRVEQELADTIRAKDTFLSAVSHDLRTPLNALMGFLELLDDPQLSAEQRSEYMEQCRQGAQRLLGLIETLLDLARLEAGRLELRPRATELPALIDNQVAMLRSRACEKGLSLDYSIAEAVPRWVEVDDTRLGQLLSNLLSNAVKYTQQGGVELEVRTVDNTRVSFAVHDTGPGLTEQQQKEIFTAFDRGGYRGTSQGYGLGLAIVSELINLLDGELSLSSTPGQGSTFAFTIPLPRASEPSPEGESSIAAGESDAEPTASSARRPLNILVADDEPANVMLAQALLLKLGCEVVCAQSGTEALAAWQAEDFDMLLLDLKMPDLDGDQVARNVRAEEHEQGRQPTRIVLCTAYAYSEVESLINESGCDAYLGKPLDRSALSSLLDWVASSYAK >NZ_AP017372.2|WP_096410000.1|2189753_2190440_+|response-regulator-transcription-factor MKPPHILLIEDERSSRSICLNALLRAEYQATVALTAREGLRLLRSTHFDLVLLDLNLPDADGLSLASTLHGSHPELPIIMMTVRTAAEQRAAGLEAGAVDYLSKPFHQTELIHRVRRALAAGPPTPATQITFGPWLLEVEQRSLHHAEGFELELTLGEARLIEFLLRARGRPVNRDQLAEAVARSREGNPKSVDVLVSRLRRKLEDKTRGLRHIVAVPGLGYRIEVGE >NZ_AP017372.2|WP_096409999.1|2189424_2189757_+|Hpt-domain-containing-protein MEIIDIDSALERLDGDRELCRELLLDFYIDYYEVDRILRDRLRDGATEDARQLLHNLRGAAGNLGMGRLETVAKTLSQQIRLGAVEVSALNNFSEALHRVLSELEHTKRL >NZ_AP017372.2|WP_162549496.1|2186024_2188763_-|PAS-domain-containing-protein MATLNVNEEYPGESLDVALCRFRPDTTLTFVNAAYARLFGGGSGSLLGQRWIEWVPVSARPRVWSVLAQLGPAAPFQTYTHEVCLVDGGVMPYQWTDVALFGPDLEVTEIQSVGHPVDGGCSSDATAAEAATCYSGGEQLLVESEWRARIISEAMEEVVWLRAGDQMLYVNSSYERIWGRSIEELMANPDSFLDAVHPGDRERIICSWAACKAGDVRFDETYRIVRPDGEVRWVHAVNSAPFASAGYTACSVGSARDVTAQIEIEQALYQANHDLRVAEQIARLGHWISDLHEGTLTWSPVTFQLCGFDVSRSPPDFDAFLERVHPEDRPKVAESQLAREPQRERTEAEYRIIHTDGRVVWVRELAQRDKGEEGQPILRGTIQDITAMKEAQATARQRQEELEGIFKAATSVGLVKTDLASSVIDVSAGAEQLFGYSREELIGQPVNCLHITEDQADLAKWVGHLVCEQRELNFETRLMRKSGESFPARLFIHPIQDEHGSVVATLGVTFDISDLKAVQQQLEDASRIKTTFLRAVSHDLRTPLNALISYVELLGNADLSREQRQNFVKRCQEAGERQAQLIDSLLDIARLQSGNIQLKRVPMELHQLLEEQCRLLLQRAQEEGLTLEWSIAEDVPGWVIGDPTRLVQVLSNLVENAIKYTDRGRVCVEVQAEGGELVGFAVKDSGPGLSEAQQQTMFNAFDRLGYDGPVAGSGLGLAIAKELAHRLGDGLWVESAPGQGSTFGFTAHLPPCAPAQKVEDKIEEQLPESNSLGQSLRVLVAEDEQTSAVVVPLRLQRLGCEVTLVKRGDAALEAARQGVFDLLLLDLSMPGLNGIEVARVLRSEEDARPGVRRVPKVLCTAYSREEIEQEFDLVEVDALLEKPIREKHLRQLISRVRLGLDPAEAGGRPYGV >NZ_AP017372.2|WP_096409997.1|2184599_2185778_-|hypothetical-protein MRAGYEDSPAILGLGANIQVGDRLTFDVYDEEGRLLMRRGKQILSQNQLRRIFHDGRVELSGRSLSKLGHSRFTHPGGPAVRRGNPVQEDLSPHERLHACAQALQRNYERIRAGERDFLPRLRQVVERLQRLIDLDTDAALGVAHLSRAYPEHILQPLRQAIVADIVARAAGCCEGYRNSLIGAALSADIGMLELRAVLDQQSTDISQAQRKALVEHPERSAQILREAGLDDDNWLRAVMEHHERLDGSGYPRGIRGDSVCDIAGLLMVANVYMAMVTPRAHRSARPPKEVLRELFCEADHLYPAHYAQYIVRELGIYPPGTVVELETGDVGVVTRRAGKWARPWFYSLQRAAGRRLRAFECNLTEEDLEIVRSYRPEDIKVPIPECAPWGY >NZ_AP017372.2|WP_096409996.1|2182155_2184612_-|PAS-domain-S-box-protein MGLLKGTLAYADWLWVQGRGQTLRRGADGSPTYMVGTHVDVQTLKETEFALRRSELELTEAKRIARLGHWLYDIKSGDVHWSSEIYEIIGLEPAATAMDWDTFLSRVPPEDHPELYEAIERTLNRGDPYELEHHLMSVDGGRRIVQARGYAEFDAAGNPQLLRGTAQDVTEQRVLQRELAEREAHYRDLVENQPLMIERFLPDTTITYANPALAAYVQTEPDRLIGQRWIDLFPVDEQKRAQAHLTSLTPQQPVGRLENSLTGADGLRYWILWTNRAFFDDSGTLSHFQAVGVDITARRRAEQAEQQLREQLETRQKELEAIFAAARSVSLIKTDRDTVIQEVSCGTEALFGYSRSELIGQHVSMLHVQEHVEARQLDQSSLPIRLETKMRRRDGTTFMAHLAVHPILDADDQIIAALGVSFDISDQKRVEQELADAIRAKDTFLSAVSHDLRTPLNALMGFLELLDDPQLSPEQRSEYMEQCRQGAQRLLGLIETLLDLARLEAGRLELRPRATELPALIDNQVAMLRSRACEKGLSLDYSIAEAVPRWVEVDDTRLGQLLSNLLSNAVKYTQQGGVDLEVCAVDDTRVSFAVHDTGPGLSEQQQKEIFTAFDRGGYRGTSQGYGLGLAIVSEFINLLDGELSLSSTPGQGSTFAFTIPLPRASEPSPEGESSIAAGESDAEPTASSARRPLNVLVADDEPANVMLAQALLIKLGCEVVCAQSGTEALAAWQAEDFDMLLLDLKMPDLDGDQVARSVRAEEHEQGRQPTRIALCTAYAYSEVESLISESGCDAYLGKPLDRSALSSLLDWVASGLQR >NZ_AP017372.2|WP_096409995.1|2178538_2182030_-|PAS-domain-S-box-protein MGPPPFARIAALLAVSLSGVFCGAGVLSLAEYGGMTQEGGGMDGSSAKPNKLTAGFLHQLLDSQAVAVCALDSGGRFTYLNPAACRRLGHSDDTALLGERFDTVIDTERAEPSAAALTEQLTAVAVTGEPLSIYSVLWLHPCARTSFPVIIEASPLNSDETEERGVVVTFRDATVQHHALNQALQRAEQAERIGAIGHWIHYPESGQLIWSLMTYELFGFDPDGSKPDWGAFIARVPEADRGQIAESQFKADPSRRSCEGEYRIVHPGGRTLWVREIAQRLKDENGQSIIQGTIQDITKQRKALERSEAQRQRLAAILRTVPDVALIETDLEGKVSELNRSAELMFGYSREAFLGSDIYMLHDPAEHAQVREGIARLQRTAQGYSVECELIRGTGERFRAQLSVAPLLNERGEVVGKIGACIDLSAQFAREQRLRMAQEAAGFGVWEWDVETDRAHWDEASWRMLGYDPEQQGTLTYAQWQALVHPEDLERILPEFEHHLAAGTPFTIEFRYRCADGSWLWVQNRGQTLRRAADGSPIYMVGTHVEIQQLKETERALAKSEQRFRDVTLAAGEYIWEIDPEGRYTFITSPAEPLLGRPVEAIIGCSVFDFMPDDEAERVHGLLQAWADERSAWRDLEHVSLRPDGSLVHQRVSGLPILDEDGNLTGFRGTGRDITAEKEAERAQKRLTERLRLATSAAELGIWEYDLKSGRLECDECMCRLYGIDPATFGHASEGWVEAPESKSLDTTVFGNTFEDWAETLLPDSRDSTVAALNEAVASRTPFDIQMEIRRADDGSCRTLHGHAQVICDASGIPVRIVGISRDITAEQEYRRQLAAAKERFAGIFEQTGSGVAVYRPVDEGRDFECIEINPASERLDQITRDEVIGRRLTDCFPGVVEMGLLAALQRVARTGVPEELPLASYQDKRITAWRENRIFRLSSGEVVAVYDDRTEIKQAQQESERARKQLANLTAQLPGFIYQYRLWPDGRHAFVYANGRAEQIYGVTPEQAIEYPDHLFEVIYEADRGEFYRSIERSAQALTPWYQTFRIHHSSKGTAWLEGNSMPERLADGSTLWHGYIHDITDRVRAEQELAQSKARLEEFFNQSISGFFFMMLDEPIDWQGATEEQKEALLDYALTHVVGCQNPYIFGGGFKNKHIYWVVKS >NZ_AP017372.2|WP_096410006.1|2199927_2200668_-|response-regulator-transcription-factor MTRILLVDDDQELTAMLSDYLTGDGFEVVTAYDGQKALEKVDTAGPDIIVLDIMLPVYDGFEVLRRLRQSHHDQPVLMLTARGDDVDTVVGLELGADDYLPKPCNPRVLVARLRALLRRTQSEVASSAEQLQVGDLCLDLGQRRATLRGCESAAITPLELTDAELDLLACLLRRVGQAVDKDKLSREALNRPLTPYDRSIDWHISNLRRKLGPFDDGSERIKTVRGVGYQYVSGKGTYKHSPGASL >NZ_AP017372.2|WP_162549498.1|2200642_2200795_-|hypothetical-protein MNPSLEASWRHPWRQDLYTGATTGQLSGSGRAFEVPITRKSKFDPDSACR >NZ_AP017372.2|WP_096410007.1|2200903_2201332_-|hypothetical-protein MRKQILSTGILCLTAALFSPSVLASDSKGELEQSWERGAMQERIIQRLDLSDEQRDKLLEIRNRHLDKMHEEMKEVLTGEQLEKFLDLRESAEQRLRQGGGGDWRGDSRSGSNNRSDSGRGEFGRGESGRGEPGRDGSGRGN >NZ_AP017372.2|WP_096410008.1|2201553_2204997_-|pyruvate-carboxylase MAQFHKILIANRGEIAIRVMRAANELGKRTVAVYAQEDKLGLHRFKADEAYQIGEGMGPVEAYLSIDEIIRVAKMAGADAVHPGYGLLSENPRLVDACERAGITFIGPRAETMRALGDKASARHVAIAAGVPVIPASEVLGEDMAAARRWADEIGYPLMLKASWGGGGRGMRPILGPEELEAKVLEGRREAEAAFGSGEGYLEKVIERARHVEVQVLGDTHGGLYHLYERDCTVQRRNQKVVERAPAPYLTPEQRAEVCELGLKVARHVDYQNAGTVEFLMDMDTGSFYFIEVNPRIQVEHTVTEEVTGIDIVKAQIRISEGEHLDAATGKADQGEIWLNGHAMQCRVTTEDPQNNFIPDYGRITAYRSATGMGIRLDGGTAYAGGVITRYYDSLLVKVTAWAPTPTEAISRMDRALREFRIRGVSTNIPFVENLIKHPVFLDNTYTTKFIDTTPELFEFDKRRDRATRLLTYLAEITVNGHPEVIDRPRPAAGIPLPTPPKAQGEPLPGTRNLLEEQGPQGLVDWLAGRKELLLTDTTMRDAHQSLLATRMRTFDMVRVAPAYAANLPQLFSVECWGGATFDVAYRFLQECPWQRLRQIREAMPNVMTQMLLRGSNGVGYTNYPDNVVRAFVHQAAASGVDIFRVFDSLNWVENMREAMDAVLETGKVCEGTICYTGDILDPGRAKYDLKYYVKMGKELRDAGAHMLGVKDMAGLLKPEAARVLFPALKEEVGLPIHFHTHDTSGIAGATILAAADVGVDVADVAMDSFSGNTSQPVFGSIVEALRHTERDTGFDMENVRAISNYWEQVRAHYAAFETGQQSPSSEVYLHEMPGGQFTNLKAQARSLGLEERWHEVAQAYADANQIFGDIVKVTPSSKVVGDMALMMVSQGITREQVEDPAVDVNFPDSVIDMLRGNLGQPPGGWPQGIQQKVLKGEQPLQDRPGKYLEPLDLEEARQQASEALDGAEIDDEDLNGYLMYPKVFTEYMRRSQRYGPVSALPTRNFFYGMEPGEEISVDIEYGKTLEIRLMTVSEPGDDGNRRVFFELNGQPRTVHVADSKAKAQVVQTPKAESDNPAHVGAPTPGVVAAVAATPGQRVKAGDLLLTIEAMKMEMGLHAERDGEIKAVHVQPGSQIEAKDLLIEFAE >NZ_AP017372.2|WP_162549499.1|2205139_2205304_+|hypothetical-protein MNPSLEASWRHPWRHEASTPGRMVAPAKLLEAPCKHHPAKQVTHRFFHSFCGQL >NZ_AP017372.2|WP_096410009.1|2205644_2205941_+|hypothetical-protein MSFQWRTYTFETPWEALVGAWAEISDINNLQISTVRSLATDCDAATRHISQEDANDSLPRIYDVGTHAEEQEVPFDPDDHITALMEAAERALHELGEA >NZ_AP017372.2|WP_096410010.1|2205977_2206529_-|RNA-2',3'-cyclic-phosphodiesterase MDRRQRLFFALWPDDDLRSAICSRVPSGHGGRPVARDNLHLTLAFIGAADTAYAECLAEAAQAVRFEPFAFELQGLGCFGNGKVLWLGNVKPREPLERLAQDLSAVLQPCGFEPEARPFCPHVTVVRKPKAPLNLGPIEPVLWEVERFCLVSSIPAQGGVKYEVVRSYCAAKDHARSGTGDLP >NZ_AP017372.2|WP_096410011.1|2206485_2206707_-|Txe/YoeB-family-addiction-module-toxin MKLIFSENAWEDYLYWQKTDKKILNRINRLIKEIKREPFEGVGKPEPLKHSSTARRVNPYGSASTPLLRAVAR >NZ_AP017372.2|WP_096410012.1|2206703_2206955_-|type-II-toxin-antitoxin-system-prevent-host-death-family-antitoxin MDAISYTAARANLAKTMEQVCEDHSPVIITRSKSQSVVMISLEDYEALQETAYLLRAPKNARRLLESVVELEQGGGQEKALFE >NZ_AP017372.2|WP_096410013.1|2207932_2209705_+|DUF262-domain-containing-protein MDTRGDENSALQVKVQSFEELTAAGRELKLDDYQRGFVWDEQRVRQLIDDLAEFANQQLSNSKATQPAYAYYMGTVLLSRESEQPGERGNSAYVIDGQQRLAALSLLWSAAQEGSEVPPAMAFSYRDSRSQAQLQAAYKTIMTGLDRTGLKATHRTRELFKDVELFRHITLTVVTTGSIDEAFTFFDSQNSRGVPLHTTDLLKAHHLRAIRNHAPHQSSQAEPIQRDSARRWEGMQQASNEKNSPAGEDPVHRLFNYYLWRARNWFGPLSQRPLYPSRKALQKTFKQNAYPPRELENRLRGRPPEAVNGAEGNKLAEGGIDRVACFPCTARVHHSVVKWEPHQKEWDMEVSLPSLGTAPHNLPFTLRQPIAEGAGFFLYAQRYERLLAHMETPPQVEHVPGQRSGDDWTDFRHLYQKVVLELSHYLRQAFLLASMLYIDRFGTCRLYEFALWLEYILGAERLIKASIFQSSSRSVLERNDAGSESIGNLLDFIAVNEIPDPVIRALQEDRAADKALEKALQDNISEGSFTFGEHSVRDRYIKAVSRYFNQADKDRDKEKLPENDQERRDWILERRNWITDMLRSGGINHG |
You can click texts colored in the table to view more detailed information
CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
NZ_AP017372_7 | 2237053-2237231 | Orphan |
NA
Consensus repeat of NZ_AP017372_7
|
2 spacers
spacers of NZ_AP017372_7
>7.1|2237089|35|NZ_AP017372|CRISPRCasFinder TAGTTGTTGTTGCTATTCTTGAAGACCAATCCATT >7.2|2237160|36|NZ_AP017372|CRISPRCasFinder CTTCCTCTTTTTGACTTTCTAGGCGAGCGCGTGTAA |
WYL |
CRISPR arrays and Neighbor proteins around NZ_AP017372_7
The CRISPR arrays of NZ_AP017372_7 >merge|NZ_AP017372|7|2237053-2237231|CRISPRCasFinder GTCTGAATCTGGCCCTGTTTGAGAAGGGATTAAGACTAGTTGTTGTTGCTATTCTTGAAGACCAATCCATTGTCTGAATCTGGCCCTGTTTGAGAAGGGATTAAGACCTTCCTCTTTTTGACTTTCTAGGCGAGCGCGTGTAAGTCTGAATCCGGCCCTGCTTGGGAAGGGAACAAGAC >NZ_AP017372|7|6|2237053-2237231|CRISPRCasFinder GTCTGAATCTGGCCCTGTTTGAGAAGGGATTAAGAC TAGTTGTTGTTGCTATTCTTGAAGACCAATCCATT GTCTGAATCTGGCCCTGTTTGAGAAGGGATTAAGAC CTTCCTCTTTTTGACTTTCTAGGCGAGCGCGTGTAA GTCTGAATCCGGCCCTGCTTGGGAAGGGAACAAGAC
>NZ_AP017372.2|WP_096410032.1|2236023_2236422_-|YjbQ-family-protein MRTTITVTTHQREELVDITEPIRRAVAEAEVSDGLLALYVQGATAAIMIQENWDASVPRDAVNLLQQLVPRGVWEHDAQDGNGDSHLKAGLIGPSETIPIINGKMGLSTWQGIFLCEFDGPRRERTVVCTLN >NZ_AP017372.2|WP_162549502.1|2235494_2235935_-|DUF488-domain-containing-protein MHTLYTIGYERTYLEAFIATLQRASVATIIDVRASPHSRRREFAFKHLARELPGAGIGYESWPVLGAPQAARDAAKAGDAQRFYQLYASHLEEPKTQDALHSLAERAVTEAVALLCYERDPAECHRLLIAERLERSHKLASHHLAG >NZ_AP017372.2|WP_096410030.1|2234415_2234907_-|BrxE-family-protein MRAIEVLAELRALVGYLGEEHGWWGSQFFARSSRTFLMPVFPRSLPLSQYQGVTVAAARAHDERIGVGRIVHLFRMPELHEQAAAAVLRDATGIDQVLAHLGSREEAMQRLSALAYPVEPNEGPVLVGGWDEDLALSLGKMAGHYAAAIHENRRAYPYLRHAE >NZ_AP017372.2|WP_096410029.1|2233633_2234419_-|DUF1819-family-protein MSEARLYTTRLQAGLGLVDETLALLELYRSGMSVRELYTAALDSGRFPTMTARRLLNLVQEGFAPRYMEDPEVAAILKRLAEYWQRDELIQLFMLYTARANCILADFIREVFWPRYMAGFDELSRDDATAFVEAAVREGRTQKPWAPSTVRRVASYLLGTCTDFGLLGNCRLPPRPIRPVRIHPRVATYLAYNLRGLGFADRQIIRHPDWGLFGLEGDDVRQQLKRLAPEGHFIVQSAGDVTQITWGYRTMGEAVNALAGH >NZ_AP017372.2|WP_096410028.1|2233080_2233653_-|DUF1788-domain-containing-protein MPSLDINFNELMERIRRGREFGHASFEPIFYLVFSPEEILSVKRKMHAWTSRLANEGWEVHTFSIAQAVDEILSNAPMRQTWLMADRRKPVDWDKTNSSLANAIANGALQQRLEATLEPLEGNQHAILLVTDLEALHPYMRIGVIEGQLQGRFQVPTVFFYPGIRTGDTRLKFLGFYPEDGNYRSVHVGG >NZ_AP017372.2|WP_109962896.1|2230107_2233065_-|BREX-system-P-loop-protein-BrxC MAIKDLFDPSRDIYRSIEKVIAFGVSQEERLKKEIAEYVVTDAIDEQFNDLLRKMQAAMDAGGENEVGVWVSGFYGSGKSSFTKYLGLAFDESVTVDGVPFRQHLQDRLKSSSTRQLLNNVAKRFPAAVLMLDLATEQVSGATMAEVSSVLYYKVLQWAGYSRNLKVAYLERRLKQEGRYEEFLEMFREKTNGEDWSGYRNDELVIDSLIPEIAHELYPQLFPTQQAFNTESTDVVRFENDRVQEMLEIVREASGKEYVIFIIDEVGQYVGSRPQMILNLDGLAKNIKAQGQGKVWIIGTGQQTLTEDVPGASVNSQELFKLKDRFPININLQADDIKEICYRRLLGKSAEGSRQIGELFDQYGQALRQSTKLEDARAYGADFDRQTFIDLYPFLPAHFDILLHLLGSLARSTGGIGLRSAIKVIQDILVDETGNRTPVADRPVGWLANNVILYDALEKDIERAFPTIHKAVANVYKTHYVASELHQRVAKTVAVLQILGNLPITRRNVASLMHPDATQPSEATEVEGAIEDLIGDSYVPFGEQNGGLRFFSEKLNDIEQERSKLPIRQAERKRLINIALGEAFSPLPTTQLGGSLSVQSGLKVQNGGLPTSLAGERNSIQTLVELVDPADYEAARTRLTDESVTRNAEQQVLLIGRYPTEIDDLTAEIHRSQEIANKYRNEPDQEVKEYCKSQQDRAARLQGELQRQLKRSLVQGSFIFRGQVTAVETLSSDLIEAARKHLAEVATQVFDRYSEAPVCANTDLAERFLRVGNLNGVTSQLDPLGLVQTVNGQPQINTQNRAIVSIRDHVDKVGQVEGKSLTKRFSEAPFGWSQDTLRYLVAAMLMAGVIKLRVGGRDVTVNGQQAQEALKTNNAFKNVGVSLRDDSPSNEMLALAAERLTEFTGESVVPLEDEISRTAMGLFPKLQQRFAQLSAKLTSLELPGGERLGNLTQAMVEMQEADASDAPQRLWALHTSRLASWRSAA >NZ_AP017372.2|WP_162549501.1|2228118_2230128_+|IS4-family-transposase MLEHRLRDLGRIECYPVESDEDLRLWEELLEAEHFLGSGPLVGRRLRYLVRSENFGDVAALAFSAPALRLGARDGWIGWSDVTRAEHLDRVVCNTRFLVRGHLRVSGLASHVLGQILRRLPEDWAAKYGEPPVLVETFIDRSRHRGGCYRAANFIYIGDTAGRGRNDRYHEGGAGAKAVYLYPLCSDWRRRLGAPEQPPCTPDIDDWARHEFAHVSLGDQRLQQRLIRVGRALAAQPTASLPQACGNRAATQAAYRLFAHPRVTMNSILGSHYQATVSRCHAEPVVLAVQDTTTLNYVAHPLSEAGFGPIGSRADGAHGLIVHDTLAINPSGTPLGLIDVQAWARETEDHGLRRLGSDEWTLDNKESGKWTDSHQRASELQQQLDTGTRVVSVADREGDLFELLTAATDPERADLLVRAKHDRPLADGSGRLFGHMKALDAAGVQELTLPKRGNQKARTTRMAVTFDRVTLQPPKNKRGQEPVTLDVIRTTEINPPKGAQPVTWTLLSTVPVETLEDACERLAWYTKRWQIEVYHRTLKSGCRIEERQLGSADSLEACLGVDLVVAWRVSLLTHQSREDPNAPCTVFFTPDQWKALWVRTGAEGIPEDNDEPTLREATRTVATLGGFLGRRSDGEPGAQALWKGLQRLDDITEMFCIMIERARSGRAPP >NZ_AP017372.2|WP_096410026.1|2226023_2227760_+|hypothetical-protein MACLSLTREIQHERRVVQPEPDRNGGARLTDLGERSVRIEAASLHALDLLDAPRLEHLDLTGCRPGLFLALARCPHLQRIDLPAGEPGAVIHWDQHGVIDNEAVIYGAVEHLDLCGRGYAFGLPSTGAAARSWQGARICTTPASWFAATEQALIWLGSECTPVKLKIPQPARSIAIHGPGVEAVEAREDAALEAIDLHQALDLRQITLASPLYGLAIERAERLERVLASGEALQLKYCGDLLSGVLLEGTWSHALLADTTIRDDTAPVLEQVVVRGGERPIGQGQARHVHPWLPENRRTPLLASEIPLLLEAAQAGERRASTALIRWAEAVPRRNVLFALQTLYSLLERPDPPFERIWQARQTLARRFSSSQRPRPEWGWNLPEDLIEEALRMDLRLFARCRGQVTATMRLDVHLRNAPRSQVLRILAAIAADRGVPTAERSVALDLLREALAVAARGFIARPRGAEYQGPPSDLGPLVRVIIEQADRSMADDLLTWGERVVTLSKRVRYLGEFAAHGHAPSRAAALAIGLQCPPSGQHRRWGAEQAQAIRQQAMAAALTPPRSDRLADLNANQEARS >NZ_AP017372.2|WP_096410025.1|2224733_2225732_-|WYL-domain-containing-protein MTDTILRQWTMLQAIPRYPRGVSAPGLHERLFSEGYDIRLRTVQRDLNTLSLEFPLLCERQGNEQRWSWRPDAPVLDVPGLSPAAALAFRLAELHLSGLLAPEALRALQPHFEAAKRVLAHGGSQLANWPDRVRVISRSQPLLAPPIDQEVYDRVCQGLLEGRQLYAQYRTRSRGNELKSYRVHPFALVSRDPVTYLVATLRDYTDVRQLALHRVEAAELLEDPVVPPEGFDVDAYIEEERAFDLPEQGEPIDLELRISAGVAEHLGEAPLATDQVIEACDDGWCTLWASVPLTAQLRWWLLGFGQAVQVLEPQALREEIAAELRAAAHAYD >NZ_AP017372.2|WP_096410024.1|2223766_2224438_-|SOS-response-associated-peptidase MCGRFALTTPMAEIAANYFDISGVEEFTPSFNIAPGLAIATIRTGESGAVECSWARWGFRPRWADQHAPQPINARAEKAATSRYFREAFERRRCLIPASGWYEWRQENGTKQPYYITLKEEDAERVIFLAGLWEPLDEPPGACCVVLTEPAAPALETIHPRQPVVLDPACRWEWLSPERTTRTAVRQASRRLPAARLQYWRVSTAVNRPQNDGQELIQSGGSA >NZ_AP017372.2|WP_096410033.1|2237968_2238214_-|class-I-SAM-dependent-methyltransferase MADQYEQLYFDLVHGWLRDYLPSGGGALVLDIGAGSGRDAAWLAEQGHDVVAVEPAAELRQEAQRRHPDEWISWLGNMVPI >NZ_AP017372.2|WP_162549503.1|2239331_2240291_+|hypothetical-protein MNKTSPHEHRIPINRKESFYTGTIFPMLAAEGGFEALRRIITEVYPASLDIPLLKKPWRQGWDMQFFTEYSLKESIVPDTPNKMFENIELSGSKETPDIVLYFYPNDTQASPLRGTLVGIEAKMFTQPSLPDLEGQISAQRQILQQMSEKLDNCDLYQLVLIPEETIKKYDEQKLSDSVQEGRFQGWLTWNSVLDAWQKANGDYASPAGLFAKDLEFALENFDSLVSEVSRSGQNCDAKLTGKAIYNGFKEGSTDFTYKIMGCSGGLNGNRLNNHIEKSEWQYQEYEVSNQHEPFNHNWFSIDDFVKKIDNRCPKSDDH >NZ_AP017372.2|WP_096410036.1|2240280_2241786_+|DUF2779-domain-containing-protein MTIDLTQKASTAPRLSKSRFIAGWQCPLRLWYAVHHPELAPPPDDRQQAIFDRGHKIGELAQQRYLGGRLVAADFRHIEAAIDETNALMAKPEVPVLYEPAILHRNVLTRVDILARFASGWDIIEVKSSTRAKEVFRVDLAVQYWILRGAGVPIDRAGLLLLNRDYVYPGGEYDLQSLFRFEELTEQCQARQGWVEEQVERFQAIVAGASPPAIEPGEQCTTPYTCPFTSHCWRDREQAANPITLLPNLASSRVASLREKGIEAIEDLPPDYRLTDVQQRVRQATLSGLSWQSSGLKAALEKVSWPLFYLDFEAAMMALPPYAGMRPYDPVPFQYSCHIQRRPYGSLEHQEFLATEDGDPRTLLAESLLDTLGDSGSIIVYSGYEQATINRLAQALPDQAGRLRALIPRLVDLLAIVRNHYYHPDFRGSFSIKKVLPALVEGMDYLDMEVADGEAAGRAWQQMLASEDTAEQERLAAALRAYCRQDSLAMYRLREALMELT >NZ_AP017372.2|WP_096410037.1|2241823_2242126_-|hypothetical-protein MPDDGTGSLPLLRHIILNDNKSSTNKLALLRTLCRIADGADGLAVVDDNDKIKLHMGLVALTWIRLFMPMLRASIPQLPKHQEAQKGWVLPETVSVTALG >NZ_AP017372.2|WP_096410039.1|2242497_2243277_-|hypothetical-protein MNYTNFWIDGRSFEKARNHFRAAVADPKARPNQKVVLDLNEHAYMLGMELTHGYPFIRNDGLSMTIHDHELTPRENSYNIIDLINIRFTERALKGAYTNLVYHEEAQQFEEKLSNLNVQRREDALKFSEEVCRWGRGMRVWGRLNQHYSQSQLGEAISSWLSSVRNYDSYIDPISQGVSIRGLGVSFASKHLRLLDPSRFAVLDSVLSEGLGYALTPQGYNLFMNDLVKIKNDYLQEWRLCDIEASIFALVRQRVITTG >NZ_AP017372.2|WP_109962898.1|2244148_2244493_+|IS1634-family-transposase MVDRYRSLADIERGFRALKSTLQIAPVHHRLPDRMRAHALICFLALILYRVLRMRLKANKSEYSVERALEALESVQWHRVKINGESHTGVSVSNLQRKLFKDMEVKPPKQATTA >NZ_AP017372.2|WP_096410042.1|2244945_2245599_+|MarR-family-transcriptional-regulator MKSQDIGLLLKLVALRSREGHGHDTHASKGAKALPDDWRDWALDDVGSDLCQESMPGLDDDQLLSRYSVRALAEETGISKSQVSLALQRCLEVGLVRKERSTGVPRANVRALLKFIVHGVRYVFPAKPGEITRGIATTFAAPVLEGQLYSAGELPMVWPDARGNSKGQAIEPLFKSVPFAVRRDPELYAMLALVDAIRLGHPRESKVAAERLAEYLE >NZ_AP017372.2|WP_096410043.1|2245602_2246307_+|hypothetical-protein MSLFDDQRAMLRRVAEGLGTELRDQVAFVGGCTTGLLLTDAFTREQVRSTDDVDLIISVMTYAHLNRFKEALKTKGFKDPSPMDGEMPICAMKLGELRIDLIPDHDEVLGFSNHWYPLALKTAEPVSLGGDLTIRVVTPPLFIATKLEAYKGRGESDPLSSNDIEDILNLVDGRPELLDEVRAADSALQAYIAAELSELLGKDDFSYAVQSQAGDPDREALLFERLEILTGVRG >NZ_AP017372.2|WP_096410044.1|2246306_2247005_+|SpoIIE-family-protein-phosphatase MQARWFSQQGRERARNSDAAAVGQQGQHLLAVLVDGAEKGPRGAELARHWADTVMQALAEASTRSQATVGARLRQAHAQLRHDFLHDIASYCMVSLDLETLAMHVWHCGDCRVGLRRPTKTRWLTTPHLLVHQPGLPSSCSPEEQERREQQLTRSLNARRFCPPENHVFSLCQDQTLLLSTDGYWQEHLEAGTPRDCLQDDASLLTLPVRPGSLAHVEQASDTDNLRYVSPA >NZ_AP017372.2|WP_096410046.1|2249040_2249862_+|type-IV-toxin-antitoxin-system-AbiEi-family-antitoxin MSRQKRDNLKRLLEAVPAGFLVDSAWLERHGIGRRSTYAYVKNGWLTRVHRGVFRRPAPNAPKTGVIDWKVCLLSMQYVMGYDVHVGGTSALGQHGFDHYLHLGSNVPVRVYGDAIPTWLVRLPLSAPIETRRTSLFVDRALGLTKDNKDAATILSWDWQLRISSPERAVMEAMDELPTHETFHNLDRIFESLTTLRPRTLSALLHSCKKIKVKRLFFVFADRHDHPWRKRLDAEEFNLGSGDRALVSGGRMHPRYRIMVPEDFVKPEVSDGA |
You can click texts colored in the table to view more detailed information
CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
NZ_AP017372_8 | 2254998-2255176 | TypeIII |
NA
Consensus repeat of NZ_AP017372_8
|
2 spacers
spacers of NZ_AP017372_8
>8.1|2255035|34|NZ_AP017372|PILER-CR CCTGCGGGCGGCTTTCTTTACCTCCCTTCGGCAC >8.2|2255106|35|NZ_AP017372|PILER-CR TTTTTGACACCGGCTAATCGTCTACGCGATAGCCC >8.3|2255034|35|NZ_AP017372|CRISPRCasFinder CCTGCGGGCGGCTTTCTTTACCTCCCTTCGGCACT >8.4|2255105|36|NZ_AP017372|CRISPRCasFinder TTTTTGACACCGGCTAATCGTCTACGCGATAGCCCT |
CRISPR arrays and Neighbor proteins around NZ_AP017372_8
The CRISPR arrays of NZ_AP017372_8 >merge|NZ_AP017372|8|2254998-2255176|PILER-CR,CRISPRCasFinder GTCTGAATCTGGCCCTGTTTGAGAAGGGATTAAGACCCTGCGGGCGGCTTTCTTTACCTCCCTTCGGCACTGTCTGAATCTGGCCCTGTTTGAGAAGGGATTAAGACTTTTTGACACCGGCTAATCGTCTACGCGATAGCCCTGTCTGAATCTGGCCCTGTTTGAGAAGGGATTAAGAC >NZ_AP017372|8|5|2254998-2255176|PILER-CR GTCTGAATCTGGCCCTGTTTGAGAAGGGATTAAGACC CTGCGGGCGGCTTTCTTTACCTCCCTTCGGCACT GTCTGAATCTGGCCCTGTTTGAGAAGGGATTAAGACT TTTTGACACCGGCTAATCGTCTACGCGATAGCCCT GTCTGAATCTGGCCCTGTTTGAGAAGGGATTAAGAC >NZ_AP017372|8|7|2254998-2255176|CRISPRCasFinder GTCTGAATCTGGCCCTGTTTGAGAAGGGATTAAGAC CCTGCGGGCGGCTTTCTTTACCTCCCTTCGGCACT GTCTGAATCTGGCCCTGTTTGAGAAGGGATTAAGAC TTTTTGACACCGGCTAATCGTCTACGCGATAGCCCT GTCTGAATCTGGCCCTGTTTGAGAAGGGATTAAGAC
>NZ_AP017372.2|WP_096410412.1|2253796_2254171_-|DUF2384-domain-containing-protein MTARKQHQLGAAGLRAYPNIARAWGLTETQAARLLGAPESTYRRWKRNPERASLDVNHLERLSLILGIHKNLHILLPREDAANSWVRRPNTNPLFAGHTPLERMLGGQVGDLVAVRQHLDGARG >NZ_AP017372.2|WP_096410048.1|2253105_2253744_-|RES-family-NAD+-phosphorylase MVTSRFPPIALFEGIAGDPADLDALNELEGLTANRLREEAGEIHLIKQEDRRYGPRWSPIMAALCYPRPSRFTDCSFGVYYCADNERTAVAETRYHRERFQAESNEPPMAVEMRVYIAELDADLLDLRGDTNLATSYLDPDSYANSQRLGAIARMHDHYGLAYPSVRDQEGGDCAAVFRPPALGPTRQGKHFEYRWDGQRITAVVELRETNY >NZ_AP017372.2|WP_162549504.1|2251801_2252302_+|cyclin-dependent-kinase-inhibitor-3-family-protein MTLCPGKIGPGRVHPWQRKLDDDIESIVQWGASRVVTLMENSELVSFGVGDLGARIRERLGDHCWHHLPIIDGSVPSAKAEKNWEPIADDLHSCLGAGERICIHCLGGLGRTGVIACRLLVELGFSPDEALGRVRQARPGAVETKEQLDYVTRLPELPAVQKRISN >NZ_AP017372.2|WP_096410411.1|2250797_2250959_-|IS3-family-transposase MSDEQLHQEIRALIQGSAFTGEGHRKVWAKLRQLRGVYTSRKRVLRVMREHEH >NZ_AP017372.2|WP_096410046.1|2249040_2249862_+|type-IV-toxin-antitoxin-system-AbiEi-family-antitoxin MSRQKRDNLKRLLEAVPAGFLVDSAWLERHGIGRRSTYAYVKNGWLTRVHRGVFRRPAPNAPKTGVIDWKVCLLSMQYVMGYDVHVGGTSALGQHGFDHYLHLGSNVPVRVYGDAIPTWLVRLPLSAPIETRRTSLFVDRALGLTKDNKDAATILSWDWQLRISSPERAVMEAMDELPTHETFHNLDRIFESLTTLRPRTLSALLHSCKKIKVKRLFFVFADRHDHPWRKRLDAEEFNLGSGDRALVSGGRMHPRYRIMVPEDFVKPEVSDGA >NZ_AP017372.2|WP_096410044.1|2246306_2247005_+|SpoIIE-family-protein-phosphatase MQARWFSQQGRERARNSDAAAVGQQGQHLLAVLVDGAEKGPRGAELARHWADTVMQALAEASTRSQATVGARLRQAHAQLRHDFLHDIASYCMVSLDLETLAMHVWHCGDCRVGLRRPTKTRWLTTPHLLVHQPGLPSSCSPEEQERREQQLTRSLNARRFCPPENHVFSLCQDQTLLLSTDGYWQEHLEAGTPRDCLQDDASLLTLPVRPGSLAHVEQASDTDNLRYVSPA >NZ_AP017372.2|WP_096410043.1|2245602_2246307_+|hypothetical-protein MSLFDDQRAMLRRVAEGLGTELRDQVAFVGGCTTGLLLTDAFTREQVRSTDDVDLIISVMTYAHLNRFKEALKTKGFKDPSPMDGEMPICAMKLGELRIDLIPDHDEVLGFSNHWYPLALKTAEPVSLGGDLTIRVVTPPLFIATKLEAYKGRGESDPLSSNDIEDILNLVDGRPELLDEVRAADSALQAYIAAELSELLGKDDFSYAVQSQAGDPDREALLFERLEILTGVRG >NZ_AP017372.2|WP_096410042.1|2244945_2245599_+|MarR-family-transcriptional-regulator MKSQDIGLLLKLVALRSREGHGHDTHASKGAKALPDDWRDWALDDVGSDLCQESMPGLDDDQLLSRYSVRALAEETGISKSQVSLALQRCLEVGLVRKERSTGVPRANVRALLKFIVHGVRYVFPAKPGEITRGIATTFAAPVLEGQLYSAGELPMVWPDARGNSKGQAIEPLFKSVPFAVRRDPELYAMLALVDAIRLGHPRESKVAAERLAEYLE >NZ_AP017372.2|WP_109962898.1|2244148_2244493_+|IS1634-family-transposase MVDRYRSLADIERGFRALKSTLQIAPVHHRLPDRMRAHALICFLALILYRVLRMRLKANKSEYSVERALEALESVQWHRVKINGESHTGVSVSNLQRKLFKDMEVKPPKQATTA >NZ_AP017372.2|WP_096410039.1|2242497_2243277_-|hypothetical-protein MNYTNFWIDGRSFEKARNHFRAAVADPKARPNQKVVLDLNEHAYMLGMELTHGYPFIRNDGLSMTIHDHELTPRENSYNIIDLINIRFTERALKGAYTNLVYHEEAQQFEEKLSNLNVQRREDALKFSEEVCRWGRGMRVWGRLNQHYSQSQLGEAISSWLSSVRNYDSYIDPISQGVSIRGLGVSFASKHLRLLDPSRFAVLDSVLSEGLGYALTPQGYNLFMNDLVKIKNDYLQEWRLCDIEASIFALVRQRVITTG >NZ_AP017372.2|WP_162549505.1|2257333_2257504_-|hypothetical-protein MATPYEKLKSLANAEQYLKPGVTFKQLDEIAYAICDNEAARQLNEGSSQKTEKIVR >NZ_AP017372.2|WP_096410049.1|2257731_2259141_+|DUF262-domain-containing-protein MTVVVASCTVAKLFSGETFEASDGTLIEGNLHLPEYQRPYRWGEAQIRRLLEDLRRYFCPPHPGSSPAHLFYLGSIILHQDGEGRLNIIDGQQRLTTMALLMWQQAPGSEPKLRYESPLSHAQIRKNQEWLKQQENWNRAWLKLERINITVVVTRSEDDAYCFFETQNTGGVRLSGPDIIKAHHLRATPRSRQDRYARLWESLGDLNPVVDAVLKARSWNALNFRHVPSRREPLSVRETVVTELAENTGEGHADVAYGLTATSRTPDGAVVQVAHADGYAMRQPLNAGINSIHYLEYFESLRRILLTNHREPDLDSFHNFYQGLIVGRQGCSYLKKLYDSCLLLYASHFGRSQLFEASLRLFRVVYAPRVTNEKTVKEATASKFVRENPVFDWILMSYTHEQCMERLRLFEVKVSAKNLGQSDDGVKKRFVQAVNEWFSLELPKDRMAEQYDDALQKAIKSTLEVVNHG >NZ_AP017372.2|WP_096410050.1|2259133_2261071_+|DUF262-domain-containing-protein MDNCVLTQVQAPAAILDEDIAFVIPSYQRPYVWPDDAVVKLFDDIFRAWQWDACSNYYIGTVLTAPISHIEGAAYELIDGQQRITTLMLIALAFRVTGQETALNPLAERGNAPRLTFAIREQVQALLGYWSGLDGYQYPGEDAVKTNPYLTRLDDALNVLKQLVGAIEKDRRIELAGYIHTNVQWVNNTMPGSMDLNRLFATMNTRGVQLEQSDILKSMLLRRISTDKSRYEAIWQACEQMDNYFERNVRQIFGGTDWSDMLPEHLRHFDPARFLLSEEQNLEAERPGSGLTISQLADSSYPDENEGSSALDLDDTVYCQSIIGFPLLLMHALRIYTARSGYADIGGRLHSERLIDSFSVLVDASETEVKEFVECLWEVRYQFDRWVVKWVERSDEDERQLRLTDINRSPSNGNWYLTRSVKKLFELVPLQSVRHFTGEHSAQYWLTPFLGLLCMETNPSENTVLDIMERIDNQLSLAEASQKQASFELLSGDVSAQRSVAAIIEYLKTPCGTKFEHYWFQKLEYLLWKRDHSQDEKVLNYRIVSRNSIEHVYAQNEEFKNEMKRDYLDAFGNLVLLNPSENSSYGYQSVNKKKADFKDRSHYNGSYDSLKLKEIFSLMGQGEWSPNLVEAHQEAMFELISCHYS >NZ_AP017372.2|WP_162549506.1|2261142_2261535_-|hypothetical-protein MQHFRFLLELIFELLGALLVAVSLVGDLIGALSEKGKNRTLGQLQRPYSPELARAGLKVTKQSGHKGSCKPKMTGSEFESVKKLLARGVPPNHVAKALGRCTAGRRRQRMLSVRFFPFSERAPTGVSNST >NZ_AP017372.2|WP_096410051.1|2261526_2261805_+|hypothetical-protein MLQRKLDSAHQERNQAEHERHRLEKDYASLQSQVDRHKERIEQLQAEVMQERERHQEAQELARYHQEQSQALMEVLRRGDGTGEEMEGGGYS >NZ_AP017372.2|WP_096410052.1|2261964_2262939_+|L,D-transpeptidase-family-protein MWLCSPTICTAAAAALLAPALAGPSTASAGPGDRERALHRYEAPAQVEVVGGVYHVPVKPDEALAEVAEREGVGVERLRAANPHTATENASERALRIPARHVLPDTPREGLVIDVAGMRLFHYPEETDAVEVFPISTGREGWPTPVAMKTEVAERLENPAWYPPESIRDSRAASDESGSLPRMVPPGAENPLGEHVLILEVDGYLVHGTNEPHSIGERTSHGCARMHPQDIEHLFERVKAGTPVRFVDQPFRIGRSARGEVWVESHPSAPDGSNPELDRRFVRALPDIAGEGVAINGARLIEAVNDQDGIAVRVSVGEEGAGKR >NZ_AP017372.2|WP_096410053.1|2263122_2264055_+|acetoin-utilization-protein-AcuC MTGVPLRIATDKRLGAYHFGPGHPFGPGRMAAFLEALDELELAYEALPLAEADTATLTRFHAREYVERVQSLAGTGAPLDLGDTPAVPGIDGAAKRVVGTVAAAVDDLLAGRVRRAFVPIAGLHHGQRDRASGFCVYNDCGVALETLLAAGVAPVAYVDIDVHHGDGVYDSFETDPRVIFADIHQDGRTLFPGTGAAEAQGKGAAHGTKLNVPLPPGADDDAFVEAWERIEAHLERHQPKVIVMQCGADGLAGDPLASLRYTSKTHASAARRLRVLTERWAEGRLLALGGGGYDLSNIAAAWTAVVREIA >NZ_AP017372.2|WP_162549507.1|2264182_2264623_-|DUF488-domain-containing-protein MHTLYTTGYERTDLDTFIVTLQRASVATVIDVRASPHSRRREFAFKHLARELPGAGIGYESWPVLGAPQAARDAAKVGDAQRFYQLYASHLEEPKPKDALHSLAERAVTEAVALLCYERDPAECHRLLIAERLERSHKLASHHLAG >NZ_AP017372.2|WP_096410055.1|2264730_2265279_-|peptide-methionine-(S)-S-oxide-reductase-MsrA MRVSRSITVGGGCFWCIEGVFQQLPAVHQAISGYAGGESPDPSYREVCSGRTGHAEVVQVNFDPEQVEERALMELFFAIHDPTLHNRQGPDVGSQYRSIILYADKEQRQTAEAVIKEIGAAGEYSAPIVTELVPLTTFYPAEEMHQRYYEAAPEAPYCRSMIAPKIAKARERFPRLFDGWAN >NZ_AP017372.2|WP_096410056.1|2265315_2265723_-|YjbQ-family-protein MRKTITVTTHQREELVDITEPIRRAVAEAEVSDGLLALYVQGATAAIMIQENWDASVPRDAVNLLQQLVPRGVWEHDSQDGNGDSHLKAGLIGPSETIPIINSKMGLSTWQGIFLACEFDGPRRERTVVCTLIAM |
You can click texts colored in the table to view more detailed information
CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
NZ_AP017372_9 | 2266356-2266887 | TypeIII |
NA
Consensus repeat of NZ_AP017372_9
|
7 spacers
spacers of NZ_AP017372_9
>9.1|2266388|39|NZ_AP017372|PILER-CR AGACCCTATAGTTCGTTGAAATATATCAATCGCTTCCCT >9.2|2266459|40|NZ_AP017372|PILER-CR AGACCTGGACCTTGACAGGATTTTGGTTGCCTACACTATC >9.3|2266531|40|NZ_AP017372|PILER-CR AGACGGCGTCCTTTTGGACGTAGGGCACCGCCTTCATCAC >9.4|2266603|39|NZ_AP017372|PILER-CR AGACGCGCACTCCGGGCTTATTTCGTCACGCTCGAGTTG >9.5|2266674|37|NZ_AP017372|PILER-CR AGACCCCACCAGTGCGTAAGAGTTGTCATACGCGGAT >9.6|2266743|37|NZ_AP017372|PILER-CR AGACAATTTTATGTTGAACCATTCCGTGTCCTGGTCC >9.7|2266812|40|NZ_AP017372|PILER-CR AGACAATTCCTTTCTGATAGCTGTTAACGATTCAACAGCC >9.8|2266392|35|NZ_AP017372|CRISPRCasFinder,CRT CCTATAGTTCGTTGAAATATATCAATCGCTTCCCT >9.9|2266463|36|NZ_AP017372|CRISPRCasFinder,CRT CTGGACCTTGACAGGATTTTGGTTGCCTACACTATC >9.10|2266535|36|NZ_AP017372|CRISPRCasFinder,CRT GGCGTCCTTTTGGACGTAGGGCACCGCCTTCATCAC >9.11|2266607|35|NZ_AP017372|CRISPRCasFinder,CRT GCGCACTCCGGGCTTATTTCGTCACGCTCGAGTTG >9.12|2266678|33|NZ_AP017372|CRISPRCasFinder,CRT CCCACCAGTGCGTAAGAGTTGTCATACGCGGAT >9.13|2266747|33|NZ_AP017372|CRISPRCasFinder,CRT AATTTTATGTTGAACCATTCCGTGTCCTGGTCC >9.14|2266816|36|NZ_AP017372|CRISPRCasFinder,CRT AATTCCTTTCTGATAGCTGTTAACGATTCAACAGCC |
csx16,csx1,cas6,cas10,csm2gr11,csm3gr7,csm4gr5,csm5gr7 |
CRISPR arrays and Neighbor proteins around NZ_AP017372_9
The CRISPR arrays of NZ_AP017372_9 >merge|NZ_AP017372|9|2266356-2266887|PILER-CR,CRISPRCasFinder,CRT GTCTGAATCTGGCCCTGTTTGAGAAGGGATTAAGACCCTATAGTTCGTTGAAATATATCAATCGCTTCCCTGTCTGAATCTGGCCCTGTTTGAGAAGGGATTAAGACCTGGACCTTGACAGGATTTTGGTTGCCTACACTATCGTCTGAATCTGGCCCTGTTTGAGAAGGGATTAAGACGGCGTCCTTTTGGACGTAGGGCACCGCCTTCATCACGTCTGAATCTGGCCCTGTTTGAGAAGGGATTAAGACGCGCACTCCGGGCTTATTTCGTCACGCTCGAGTTGGTCTGAATCTGGCCCTGTTTGAGAAGGGATTAAGACCCCACCAGTGCGTAAGAGTTGTCATACGCGGATGTCTGAATCTGGCCCTGTTTGAGAAGGGATTAAGACAATTTTATGTTGAACCATTCCGTGTCCTGGTCCGTCTGAATCTGGCCCTGTTTGAGAAGGGATTAAGACAATTCCTTTCTGATAGCTGTTAACGATTCAACAGCCGTCTGAATCTGGCCCTGTTTGAGAAGGGATTAGAGT >NZ_AP017372|9|6|2266356-2266883|PILER-CR GTCTGAATCTGGCCCTGTTTGAGAAGGGATTA AGACCCTATAGTTCGTTGAAATATATCAATCGCTTCCCT GTCTGAATCTGGCCCTGTTTGAGAAGGGATTA AGACCTGGACCTTGACAGGATTTTGGTTGCCTACACTATC GTCTGAATCTGGCCCTGTTTGAGAAGGGATTA AGACGGCGTCCTTTTGGACGTAGGGCACCGCCTTCATCAC GTCTGAATCTGGCCCTGTTTGAGAAGGGATTA AGACGCGCACTCCGGGCTTATTTCGTCACGCTCGAGTTG GTCTGAATCTGGCCCTGTTTGAGAAGGGATTA AGACCCCACCAGTGCGTAAGAGTTGTCATACGCGGAT GTCTGAATCTGGCCCTGTTTGAGAAGGGATTA AGACAATTTTATGTTGAACCATTCCGTGTCCTGGTCC GTCTGAATCTGGCCCTGTTTGAGAAGGGATTA AGACAATTCCTTTCTGATAGCTGTTAACGATTCAACAGCC GTCTGAATCTGGCCCTGTTTGAGAAGGGATTA >NZ_AP017372|9|8|2266356-2266887|CRISPRCasFinder GTCTGAATCTGGCCCTGTTTGAGAAGGGATTAAGAC CCTATAGTTCGTTGAAATATATCAATCGCTTCCCT GTCTGAATCTGGCCCTGTTTGAGAAGGGATTAAGAC CTGGACCTTGACAGGATTTTGGTTGCCTACACTATC GTCTGAATCTGGCCCTGTTTGAGAAGGGATTAAGAC GGCGTCCTTTTGGACGTAGGGCACCGCCTTCATCAC GTCTGAATCTGGCCCTGTTTGAGAAGGGATTAAGAC GCGCACTCCGGGCTTATTTCGTCACGCTCGAGTTG GTCTGAATCTGGCCCTGTTTGAGAAGGGATTAAGAC CCCACCAGTGCGTAAGAGTTGTCATACGCGGAT GTCTGAATCTGGCCCTGTTTGAGAAGGGATTAAGAC AATTTTATGTTGAACCATTCCGTGTCCTGGTCC GTCTGAATCTGGCCCTGTTTGAGAAGGGATTAAGAC AATTCCTTTCTGATAGCTGTTAACGATTCAACAGCC GTCTGAATCTGGCCCTGTTTGAGAAGGGATTAGAGT >NZ_AP017372|9|3|2266356-2266887|CRT GTCTGAATCTGGCCCTGTTTGAGAAGGGATTAAGAC CCTATAGTTCGTTGAAATATATCAATCGCTTCCCT GTCTGAATCTGGCCCTGTTTGAGAAGGGATTAAGAC CTGGACCTTGACAGGATTTTGGTTGCCTACACTATC GTCTGAATCTGGCCCTGTTTGAGAAGGGATTAAGAC GGCGTCCTTTTGGACGTAGGGCACCGCCTTCATCAC GTCTGAATCTGGCCCTGTTTGAGAAGGGATTAAGAC GCGCACTCCGGGCTTATTTCGTCACGCTCGAGTTG GTCTGAATCTGGCCCTGTTTGAGAAGGGATTAAGAC CCCACCAGTGCGTAAGAGTTGTCATACGCGGAT GTCTGAATCTGGCCCTGTTTGAGAAGGGATTAAGAC AATTTTATGTTGAACCATTCCGTGTCCTGGTCC GTCTGAATCTGGCCCTGTTTGAGAAGGGATTAAGAC AATTCCTTTCTGATAGCTGTTAACGATTCAACAGCC GTCTGAATCTGGCCCTGTTTGAGAAGGGATTAGAGT
>NZ_AP017372.2|WP_096410056.1|2265315_2265723_-|YjbQ-family-protein MRKTITVTTHQREELVDITEPIRRAVAEAEVSDGLLALYVQGATAAIMIQENWDASVPRDAVNLLQQLVPRGVWEHDSQDGNGDSHLKAGLIGPSETIPIINSKMGLSTWQGIFLACEFDGPRRERTVVCTLIAM >NZ_AP017372.2|WP_096410055.1|2264730_2265279_-|peptide-methionine-(S)-S-oxide-reductase-MsrA MRVSRSITVGGGCFWCIEGVFQQLPAVHQAISGYAGGESPDPSYREVCSGRTGHAEVVQVNFDPEQVEERALMELFFAIHDPTLHNRQGPDVGSQYRSIILYADKEQRQTAEAVIKEIGAAGEYSAPIVTELVPLTTFYPAEEMHQRYYEAAPEAPYCRSMIAPKIAKARERFPRLFDGWAN >NZ_AP017372.2|WP_162549507.1|2264182_2264623_-|DUF488-domain-containing-protein MHTLYTTGYERTDLDTFIVTLQRASVATVIDVRASPHSRRREFAFKHLARELPGAGIGYESWPVLGAPQAARDAAKVGDAQRFYQLYASHLEEPKPKDALHSLAERAVTEAVALLCYERDPAECHRLLIAERLERSHKLASHHLAG >NZ_AP017372.2|WP_096410053.1|2263122_2264055_+|acetoin-utilization-protein-AcuC MTGVPLRIATDKRLGAYHFGPGHPFGPGRMAAFLEALDELELAYEALPLAEADTATLTRFHAREYVERVQSLAGTGAPLDLGDTPAVPGIDGAAKRVVGTVAAAVDDLLAGRVRRAFVPIAGLHHGQRDRASGFCVYNDCGVALETLLAAGVAPVAYVDIDVHHGDGVYDSFETDPRVIFADIHQDGRTLFPGTGAAEAQGKGAAHGTKLNVPLPPGADDDAFVEAWERIEAHLERHQPKVIVMQCGADGLAGDPLASLRYTSKTHASAARRLRVLTERWAEGRLLALGGGGYDLSNIAAAWTAVVREIA >NZ_AP017372.2|WP_096410052.1|2261964_2262939_+|L,D-transpeptidase-family-protein MWLCSPTICTAAAAALLAPALAGPSTASAGPGDRERALHRYEAPAQVEVVGGVYHVPVKPDEALAEVAEREGVGVERLRAANPHTATENASERALRIPARHVLPDTPREGLVIDVAGMRLFHYPEETDAVEVFPISTGREGWPTPVAMKTEVAERLENPAWYPPESIRDSRAASDESGSLPRMVPPGAENPLGEHVLILEVDGYLVHGTNEPHSIGERTSHGCARMHPQDIEHLFERVKAGTPVRFVDQPFRIGRSARGEVWVESHPSAPDGSNPELDRRFVRALPDIAGEGVAINGARLIEAVNDQDGIAVRVSVGEEGAGKR >NZ_AP017372.2|WP_096410051.1|2261526_2261805_+|hypothetical-protein MLQRKLDSAHQERNQAEHERHRLEKDYASLQSQVDRHKERIEQLQAEVMQERERHQEAQELARYHQEQSQALMEVLRRGDGTGEEMEGGGYS >NZ_AP017372.2|WP_162549506.1|2261142_2261535_-|hypothetical-protein MQHFRFLLELIFELLGALLVAVSLVGDLIGALSEKGKNRTLGQLQRPYSPELARAGLKVTKQSGHKGSCKPKMTGSEFESVKKLLARGVPPNHVAKALGRCTAGRRRQRMLSVRFFPFSERAPTGVSNST >NZ_AP017372.2|WP_096410050.1|2259133_2261071_+|DUF262-domain-containing-protein MDNCVLTQVQAPAAILDEDIAFVIPSYQRPYVWPDDAVVKLFDDIFRAWQWDACSNYYIGTVLTAPISHIEGAAYELIDGQQRITTLMLIALAFRVTGQETALNPLAERGNAPRLTFAIREQVQALLGYWSGLDGYQYPGEDAVKTNPYLTRLDDALNVLKQLVGAIEKDRRIELAGYIHTNVQWVNNTMPGSMDLNRLFATMNTRGVQLEQSDILKSMLLRRISTDKSRYEAIWQACEQMDNYFERNVRQIFGGTDWSDMLPEHLRHFDPARFLLSEEQNLEAERPGSGLTISQLADSSYPDENEGSSALDLDDTVYCQSIIGFPLLLMHALRIYTARSGYADIGGRLHSERLIDSFSVLVDASETEVKEFVECLWEVRYQFDRWVVKWVERSDEDERQLRLTDINRSPSNGNWYLTRSVKKLFELVPLQSVRHFTGEHSAQYWLTPFLGLLCMETNPSENTVLDIMERIDNQLSLAEASQKQASFELLSGDVSAQRSVAAIIEYLKTPCGTKFEHYWFQKLEYLLWKRDHSQDEKVLNYRIVSRNSIEHVYAQNEEFKNEMKRDYLDAFGNLVLLNPSENSSYGYQSVNKKKADFKDRSHYNGSYDSLKLKEIFSLMGQGEWSPNLVEAHQEAMFELISCHYS >NZ_AP017372.2|WP_096410049.1|2257731_2259141_+|DUF262-domain-containing-protein MTVVVASCTVAKLFSGETFEASDGTLIEGNLHLPEYQRPYRWGEAQIRRLLEDLRRYFCPPHPGSSPAHLFYLGSIILHQDGEGRLNIIDGQQRLTTMALLMWQQAPGSEPKLRYESPLSHAQIRKNQEWLKQQENWNRAWLKLERINITVVVTRSEDDAYCFFETQNTGGVRLSGPDIIKAHHLRATPRSRQDRYARLWESLGDLNPVVDAVLKARSWNALNFRHVPSRREPLSVRETVVTELAENTGEGHADVAYGLTATSRTPDGAVVQVAHADGYAMRQPLNAGINSIHYLEYFESLRRILLTNHREPDLDSFHNFYQGLIVGRQGCSYLKKLYDSCLLLYASHFGRSQLFEASLRLFRVVYAPRVTNEKTVKEATASKFVRENPVFDWILMSYTHEQCMERLRLFEVKVSAKNLGQSDDGVKKRFVQAVNEWFSLELPKDRMAEQYDDALQKAIKSTLEVVNHG >NZ_AP017372.2|WP_162549505.1|2257333_2257504_-|hypothetical-protein MATPYEKLKSLANAEQYLKPGVTFKQLDEIAYAICDNEAARQLNEGSSQKTEKIVR >NZ_AP017372.2|WP_096410057.1|2267048_2267339_-|CRISPR-associated-protein-Csx16 MTTWFVSRHPGAAAWAERQGIEVDRFVEHLDWAAVERGDAVIGTLPVHIAAMICQRGAAYWHLSLELPLDMRGKELSEDDMELAGARIERFHVEKK >NZ_AP017372.2|WP_096410058.1|2267478_2268606_+|TIGR02584-family-CRISPR-associated-protein MATEGKNTLLCIAGLTPQVVTETLYAITIESQGALPDRLEIITTTEGRRRLLLTLLSKDGGHGYLDRFYQDYGLDRANLAFDESCVHVIHGLDGEPLADIVTEQDNCAAADLIHERIRQLTQQTQKLHVSIAGGRKTMGFYAGYSLSLYARPSDRLSHVLVNAPFESHPSFFYPPPQPLTLQLPGRNDIISTAEAQVRLADLPFVRLREELGEDLPYAGLSFSEAVERAQQVITPAQLALDLAERTANLQGQVIKLSPTHFVWLTWFADRARREKPPLRFDHEAAKELERYIDWLDGSNSPLHESLHSAREELESEGCSNYFERTRSRLNKALAERSGLPARAVARYQIHACSNRPQSTYALRLTPEQIRMVGEP >NZ_AP017372.2|WP_162549508.1|2268630_2268822_-|hypothetical-protein MLIHKMLKAYPPVVPQHQEGLPGMHVCLERLANMDPEGHNIHSIVLRMHEPILIGQMVAQQQL >NZ_AP017372.2|WP_162549509.1|2269236_2269980_+|CRISPR-system-precrRNA-processing-endoribonuclease-RAMP-protein-Cas6 MVKWQARYPKAPHPFVLGLSLNSGGQVSAGEKLSLGVTLLGRATGTIPYWVHVLQAAGEQGLGPQRVPLALETVHQECGPGDGDWALVYLPGETFEPQPAQHPKPPPVPNRVRLRLHTPLRVRRGGRHVSAQELAFHDLFRTLLRRLSMLSQFHGPGPLEGDPRTLVEIARGIAWQKTDWRWHDWQRFSARQGRRVPMGGVIGEALLDGNDLVFIWSLLWFGQWVHASRGASMGLGRYEIISEDAIS >NZ_AP017372.2|WP_096410060.1|2269976_2272607_+|type-III-A-CRISPR-associated-protein-Cas10/Csm1 MSTEQIKTSWRTQDHVVLGALIHDIGKLFERGDLLDSYRNDEDMLQAYCPFQIRGRYFSHKHAVHTLAWAERLAERIPALDPEHLGIGTDHWLNLAARHHKPSSALEALIKRADDLASQERDPLGIDARFISRKVRLEPILERVTLEADPGRARTTETRVPLTPMEPGSPYFPEHAHKMDPPMNWDREKCAWVSQQDLGDAYARLGQDLLGQLEQLPTAEAPPSAIIGTLLTLLERYTAQVPSATNTAHPDISLFDHLRVTAAIAEGLYTYHQDQGDGLENVEKRDQTAKWALVCGDLSGIQRFIYRITSRGAARALRGRSLYLQLLTDGLASRMRRELDLHAPAQIYASGGKFFLLIPSTRVDQARQVAATINDELLAPFQGQLRLGLGTAHLAPNHFRAGHMGERWQATIDDLHRDRTRPWAGRMAHPPEKEDQDFFAPESPSEDGHCHACGRDDPPGDICDRGEGRRLCQQCNDLEQLGLAIRHASAITWHEPGTQRRGWELPGTGRVIRLPSREDPESLPLAAGDVLERLEGWPELAEARPGIAYSARFIGRWQESCGESELEELAASSQGIKRLGILRMDVDNLGQIFARGLRFGSTTETESSADMGSLSRTATLSRQLHWFFSTHLTRLLEQAEAPAQIMYAGGDDLFIVGAWHAMPELAVRIQHDLQRFASHNPVFSLSGGIELVGGRYPIGHAAELAGIQEEHAKGHRRSDKEGQTRDKCALAFLHTPVGWEQMEHVEGVREQLERFLEATGNRAVLGYLRRAVADMEGLQRRYAQGRWSDTELHALVEAQRWRWQLLSRLRRLRRRHQHHTEAVGAIDRLQEVFIEQQQPHKPPDPHLLGLPGRWVELKYRQSGNYLPDRGEEVHAP >NZ_AP017372.2|WP_096410061.1|2272603_2273053_+|type-III-A-CRISPR-associated-protein-Csm2 MNAANANHPRHKQQGPSGSDPATIRGFIEDDQADQLVATAERLGQDMGKQVSTSQVRNIFSSIKRLEMREQQHSPSGDAPLSPNVRRELLLLKPRLAYATARENRLKPLHDAVTTALDVVAQQGDQNALRRLSAFYEAIVAYHQYHGGK >NZ_AP017372.2|WP_096410062.1|2273055_2273859_+|type-III-A-CRISPR-associated-RAMP-protein-Csm3 MTDVSQYATLQSKVFLRGELRAETGLHIGGSETGLGIGGADSVVVRDPLDHTPYVPGSSLRGKLRSLLERARGLEGANGNAEGGFALGKNNAGVPGRDPSTALAQLFGITADQNARGPSRLIVRDARLTPDSYQALMDAPGTDMPMTEVKTEVSIDRITSAAMPRQLERVPAGARFDFELVITVMVADDRQQWLNLILEGLDLLQDDTLGGNGSRGYGRISVDLRELLERDSQAYREGREAIPITDLDIPPALQGHPEATSPSTATA >NZ_AP017372.2|WP_096410063.1|2273873_2274869_+|type-III-A-CRISPR-associated-RAMP-protein-Csm4 MRCYRLHFRAPLHIDDRGTGYYEASDPFVRSDTVSAALLTTWGQLDPENATARAAKPPFRVSSAMPWLEGTPLLPRPVPHRAAPAPQGDPALAKVTKGVQWLSPRLWHRIWHEGWQQALHPDTVCTPQKEIALARDEASEPSPAWAQERRPRLNVDRITDGPVEGQLFEFGRIHFLPSAGLYLLAEHADETARQGFEAALSLLGDTGLGADRNAGNGQFTWEPAADFQERLGVRQTEPGESGVLVSLANPGLSERQWAGDERSAYDITTRGGWIANYGIRRARVRMLTEGSFLSVTIQGRVLDVTPRALASELPHPIYRDGRALMLRPEEG >NZ_AP017372.2|WP_096410064.1|2274869_2276369_+|hypothetical-protein MTDIRGARPETENVCIEVVTPLHIGDGESLIKDADFVQERPGHPFRVIDKAGLERRLAEQGGDEVEAYLAHQEMPGLQDLVTLAGGASHAPGYDLPPHEPGHAPASPEIRSTIKDAWLRPLLPGSALKGALRTAWIAQHLRDQVIQPRAQELNKPPRFAAARFLGRLTSAPAHAGSGRPGPNSDAFRVLRPRDAQAPRSALSWVDIRIAKSPRDGKVGWHVTTRSGRRQVDDWSQATALNAEALAPGTVLATQISWDGLLCANESAWRATGNEHIALPRGFCELRDVLIRHARHQIQREKKDLFAWELKAAYRTWQQLEQQLEQAIQQGGAPLRLGFGIGWLGMTGDWLSDETFHTVLAETHWKVKQPHRFPKTRRLVVERGQPQAPLGWVILWPADSGPPPTGQDPEEQKDREDPGDAGHPWVNTKIAELQKAHNSSLEEVLRGKKLAQACQLIDDLETRSEVLADIRRRWQERGWWNDPRGRAMKQARQIYGELTGE >NZ_AP017372.2|WP_162549510.1|2277575_2278202_+|CRISPR-system-precrRNA-processing-endoribonuclease-RAMP-protein-Cas6 MLNPALRWMPTLLPVLHNLQLKRSKLKLQRISLVNTDGLAASGADNKINVTAEELLKINELATPQHPPHPPEFITIRVQQHPLRLRRKNRYVGSEQFDPGVFISALLRRASMLNSITSQATETDFRYLTQLGRSIGLNRSELHWFDWHRHSTPQDRRVPMGGLLGEFQLDSVPEEIWPWVWLGQWLHVGKGAVMGMGRYQLAEYAADN |
You can click texts colored in the table to view more detailed information
CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
NZ_AP017372_10 | 2276804-2277052 | TypeIII |
NA
Consensus repeat of NZ_AP017372_10
|
3 spacers
spacers of NZ_AP017372_10
>10.1|2276841|33|NZ_AP017372|PILER-CR,CRT GTTTCCTGAGCGTTGGCTCCGCAGTCGCAGCAG >10.2|2276911|34|NZ_AP017372|PILER-CR,CRT CCCCGTGACATAGAACATTGCAATTGGTTGAGTA >10.3|2276982|34|NZ_AP017372|CRT GATCTGTATCTTGAATGGGCACTCCCACCCGACC |
csm5gr7,csm4gr5,csm3gr7,csm2gr11,cas10,cas6,csx1,csx16,cas2,cas1 |
CRISPR arrays and Neighbor proteins around NZ_AP017372_10
The CRISPR arrays of NZ_AP017372_10 >merge|NZ_AP017372|10|2276804-2277052|PILER-CR,CRT AGTCTGAATCTGGCCCTGTTTGAGAAGGGATTAAGACGTTTCCTGAGCGTTGGCTCCGCAGTCGCAGCAGAGTCTGAATCTGGCCCTGTTTGAGAAGGGATTAAGACCCCCGTGACATAGAACATTGCAATTGGTTGAGTAAGTCTGAATCTGGCCCTGTTTGAGAAGGGATTAAGACGATCTGTATCTTGAATGGGCACTCCCACCCGACCAGTCTGAATCTGGCCCTGTTTGAGAAGGGATTAAAAC >NZ_AP017372|10|7|2276804-2276981|PILER-CR AGTCTGAATCTGGCCCTGTTTGAGAAGGGATTAAGAC GTTTCCTGAGCGTTGGCTCCGCAGTCGCAGCAG AGTCTGAATCTGGCCCTGTTTGAGAAGGGATTAAGAC CCCCGTGACATAGAACATTGCAATTGGTTGAGTA AGTCTGAATCTGGCCCTGTTTGAGAAGGGATTAAGAC >NZ_AP017372|10|4|2276804-2277052|CRT AGTCTGAATCTGGCCCTGTTTGAGAAGGGATTAAGAC GTTTCCTGAGCGTTGGCTCCGCAGTCGCAGCAG AGTCTGAATCTGGCCCTGTTTGAGAAGGGATTAAGAC CCCCGTGACATAGAACATTGCAATTGGTTGAGTA AGTCTGAATCTGGCCCTGTTTGAGAAGGGATTAAGAC GATCTGTATCTTGAATGGGCACTCCCACCCGACC AGTCTGAATCTGGCCCTGTTTGAGAAGGGATTAAAAC
>NZ_AP017372.2|WP_096410064.1|2274869_2276369_+|hypothetical-protein MTDIRGARPETENVCIEVVTPLHIGDGESLIKDADFVQERPGHPFRVIDKAGLERRLAEQGGDEVEAYLAHQEMPGLQDLVTLAGGASHAPGYDLPPHEPGHAPASPEIRSTIKDAWLRPLLPGSALKGALRTAWIAQHLRDQVIQPRAQELNKPPRFAAARFLGRLTSAPAHAGSGRPGPNSDAFRVLRPRDAQAPRSALSWVDIRIAKSPRDGKVGWHVTTRSGRRQVDDWSQATALNAEALAPGTVLATQISWDGLLCANESAWRATGNEHIALPRGFCELRDVLIRHARHQIQREKKDLFAWELKAAYRTWQQLEQQLEQAIQQGGAPLRLGFGIGWLGMTGDWLSDETFHTVLAETHWKVKQPHRFPKTRRLVVERGQPQAPLGWVILWPADSGPPPTGQDPEEQKDREDPGDAGHPWVNTKIAELQKAHNSSLEEVLRGKKLAQACQLIDDLETRSEVLADIRRRWQERGWWNDPRGRAMKQARQIYGELTGE >NZ_AP017372.2|WP_096410063.1|2273873_2274869_+|type-III-A-CRISPR-associated-RAMP-protein-Csm4 MRCYRLHFRAPLHIDDRGTGYYEASDPFVRSDTVSAALLTTWGQLDPENATARAAKPPFRVSSAMPWLEGTPLLPRPVPHRAAPAPQGDPALAKVTKGVQWLSPRLWHRIWHEGWQQALHPDTVCTPQKEIALARDEASEPSPAWAQERRPRLNVDRITDGPVEGQLFEFGRIHFLPSAGLYLLAEHADETARQGFEAALSLLGDTGLGADRNAGNGQFTWEPAADFQERLGVRQTEPGESGVLVSLANPGLSERQWAGDERSAYDITTRGGWIANYGIRRARVRMLTEGSFLSVTIQGRVLDVTPRALASELPHPIYRDGRALMLRPEEG >NZ_AP017372.2|WP_096410062.1|2273055_2273859_+|type-III-A-CRISPR-associated-RAMP-protein-Csm3 MTDVSQYATLQSKVFLRGELRAETGLHIGGSETGLGIGGADSVVVRDPLDHTPYVPGSSLRGKLRSLLERARGLEGANGNAEGGFALGKNNAGVPGRDPSTALAQLFGITADQNARGPSRLIVRDARLTPDSYQALMDAPGTDMPMTEVKTEVSIDRITSAAMPRQLERVPAGARFDFELVITVMVADDRQQWLNLILEGLDLLQDDTLGGNGSRGYGRISVDLRELLERDSQAYREGREAIPITDLDIPPALQGHPEATSPSTATA >NZ_AP017372.2|WP_096410061.1|2272603_2273053_+|type-III-A-CRISPR-associated-protein-Csm2 MNAANANHPRHKQQGPSGSDPATIRGFIEDDQADQLVATAERLGQDMGKQVSTSQVRNIFSSIKRLEMREQQHSPSGDAPLSPNVRRELLLLKPRLAYATARENRLKPLHDAVTTALDVVAQQGDQNALRRLSAFYEAIVAYHQYHGGK >NZ_AP017372.2|WP_096410060.1|2269976_2272607_+|type-III-A-CRISPR-associated-protein-Cas10/Csm1 MSTEQIKTSWRTQDHVVLGALIHDIGKLFERGDLLDSYRNDEDMLQAYCPFQIRGRYFSHKHAVHTLAWAERLAERIPALDPEHLGIGTDHWLNLAARHHKPSSALEALIKRADDLASQERDPLGIDARFISRKVRLEPILERVTLEADPGRARTTETRVPLTPMEPGSPYFPEHAHKMDPPMNWDREKCAWVSQQDLGDAYARLGQDLLGQLEQLPTAEAPPSAIIGTLLTLLERYTAQVPSATNTAHPDISLFDHLRVTAAIAEGLYTYHQDQGDGLENVEKRDQTAKWALVCGDLSGIQRFIYRITSRGAARALRGRSLYLQLLTDGLASRMRRELDLHAPAQIYASGGKFFLLIPSTRVDQARQVAATINDELLAPFQGQLRLGLGTAHLAPNHFRAGHMGERWQATIDDLHRDRTRPWAGRMAHPPEKEDQDFFAPESPSEDGHCHACGRDDPPGDICDRGEGRRLCQQCNDLEQLGLAIRHASAITWHEPGTQRRGWELPGTGRVIRLPSREDPESLPLAAGDVLERLEGWPELAEARPGIAYSARFIGRWQESCGESELEELAASSQGIKRLGILRMDVDNLGQIFARGLRFGSTTETESSADMGSLSRTATLSRQLHWFFSTHLTRLLEQAEAPAQIMYAGGDDLFIVGAWHAMPELAVRIQHDLQRFASHNPVFSLSGGIELVGGRYPIGHAAELAGIQEEHAKGHRRSDKEGQTRDKCALAFLHTPVGWEQMEHVEGVREQLERFLEATGNRAVLGYLRRAVADMEGLQRRYAQGRWSDTELHALVEAQRWRWQLLSRLRRLRRRHQHHTEAVGAIDRLQEVFIEQQQPHKPPDPHLLGLPGRWVELKYRQSGNYLPDRGEEVHAP >NZ_AP017372.2|WP_162549509.1|2269236_2269980_+|CRISPR-system-precrRNA-processing-endoribonuclease-RAMP-protein-Cas6 MVKWQARYPKAPHPFVLGLSLNSGGQVSAGEKLSLGVTLLGRATGTIPYWVHVLQAAGEQGLGPQRVPLALETVHQECGPGDGDWALVYLPGETFEPQPAQHPKPPPVPNRVRLRLHTPLRVRRGGRHVSAQELAFHDLFRTLLRRLSMLSQFHGPGPLEGDPRTLVEIARGIAWQKTDWRWHDWQRFSARQGRRVPMGGVIGEALLDGNDLVFIWSLLWFGQWVHASRGASMGLGRYEIISEDAIS >NZ_AP017372.2|WP_162549508.1|2268630_2268822_-|hypothetical-protein MLIHKMLKAYPPVVPQHQEGLPGMHVCLERLANMDPEGHNIHSIVLRMHEPILIGQMVAQQQL >NZ_AP017372.2|WP_096410058.1|2267478_2268606_+|TIGR02584-family-CRISPR-associated-protein MATEGKNTLLCIAGLTPQVVTETLYAITIESQGALPDRLEIITTTEGRRRLLLTLLSKDGGHGYLDRFYQDYGLDRANLAFDESCVHVIHGLDGEPLADIVTEQDNCAAADLIHERIRQLTQQTQKLHVSIAGGRKTMGFYAGYSLSLYARPSDRLSHVLVNAPFESHPSFFYPPPQPLTLQLPGRNDIISTAEAQVRLADLPFVRLREELGEDLPYAGLSFSEAVERAQQVITPAQLALDLAERTANLQGQVIKLSPTHFVWLTWFADRARREKPPLRFDHEAAKELERYIDWLDGSNSPLHESLHSAREELESEGCSNYFERTRSRLNKALAERSGLPARAVARYQIHACSNRPQSTYALRLTPEQIRMVGEP >NZ_AP017372.2|WP_096410057.1|2267048_2267339_-|CRISPR-associated-protein-Csx16 MTTWFVSRHPGAAAWAERQGIEVDRFVEHLDWAAVERGDAVIGTLPVHIAAMICQRGAAYWHLSLELPLDMRGKELSEDDMELAGARIERFHVEKK >NZ_AP017372.2|WP_096410056.1|2265315_2265723_-|YjbQ-family-protein MRKTITVTTHQREELVDITEPIRRAVAEAEVSDGLLALYVQGATAAIMIQENWDASVPRDAVNLLQQLVPRGVWEHDSQDGNGDSHLKAGLIGPSETIPIINSKMGLSTWQGIFLACEFDGPRRERTVVCTLIAM >NZ_AP017372.2|WP_162549510.1|2277575_2278202_+|CRISPR-system-precrRNA-processing-endoribonuclease-RAMP-protein-Cas6 MLNPALRWMPTLLPVLHNLQLKRSKLKLQRISLVNTDGLAASGADNKINVTAEELLKINELATPQHPPHPPEFITIRVQQHPLRLRRKNRYVGSEQFDPGVFISALLRRASMLNSITSQATETDFRYLTQLGRSIGLNRSELHWFDWHRHSTPQDRRVPMGGLLGEFQLDSVPEEIWPWVWLGQWLHVGKGAVMGMGRYQLAEYAADN >NZ_AP017372.2|WP_096410067.1|2278248_2279460_+|TIGR02221-family-CRISPR-associated-protein MHTLVSFIGRTRRPEQGYERIAYNFPDGAVQNGIAFIGNGVAQYTKPDRLVILGTSGSMWDQVIVDYPEVKLGEEKDLALSDSVDNQATTAEQLSEVAAALSETASFTVDLRLIPETPGMEQTWEILHTLVDATSGSDRLTIDITHGFRHLPMVAMMAALYRRTLDDSQSFSVDALWYAQLPPGAKEAEMHNIVGILALADWMEAIQHSRTTGDLSRVAELLREEAPEIAENLAQGSFKETIHQGTQARGPYRKARKTLSETTLPGPAGLFQPILEDQISWVDGQHLHVRQAAHARSALERKDYLRAALYGYEAFVTQLTREHHSIEQLDHHEKRKAASDAFAESCQGRSKEDPKCRAFHQLRQLRNALAHGDQPKHADVQAALHSPQALHKLLSEALDRLLP >NZ_AP017372.2|WP_096410068.1|2279594_2280182_-|NUDIX-domain-containing-protein MGKGKVLVVPRQDLPDSWLPHEGALRSTWGEVKEVISTAGTLWLERSQAEYDHAYKQLIAYVRLRDSQGSYAVYKRQGSEQRLHGLWSVGLGGHVDEGDCSAADDSDAAKALERAAYRELEEELNGFTPERLEFLGLINEEKTEVGLVHLGMVWEAVAGIDRPKPGAELGEMGWRSPGQLPEDELEYWSRLAMRL >NZ_AP017372.2|WP_096410069.1|2280181_2282353_-|hypothetical-protein MNTLVATLGTTWQVLPEIFAYTNPGAAPLYEHSSAAGDIQEERKDYGLRPVQSLWIITTEGGIEEWQNLCQWQQYLPEPIEMRCWYIKGIEELFTPGENRAMADLIYRVALHARLHTRRNNSCLYFALAGGRKTMSAELQQAAHLFGADALLHVVDRFAKQEREQFNSLSFQSLCQPLPKEFADSIRPLVTNGDLPGNEALLNSFQELDCEFEERFPLPGFGDQQRFSIESVPAQDDLHAWVKERQKRAEALLANYRLQVSAHEKLGNFRALYGLSPQTVEKLRQTRLGCDQHKQEQELAWLRRLPKAELHCHLGGILDSAGMIRVAEAMADDLAAEDRRNREFAIWRRDMETAIRSGNIPYLKSLLPGGSLSGKPLRDNFSVTQPLSVAALLYAFRDNPQLLDSLLYGVYQQPSQFTAVGIESYEELGNLQGSGLLQSEKTLRAAMAELGAICRRERIGYLELRCSPLNYVRGDLDKDDVVRILVEEAERIEDCDVRLLFIASRHRDPEQTKEHIELALHWFDNSKSFRERFVGFDLAGAEHAMQPAAMREYFLPLHERVVRMTIHAGEGERAENIWEAVYELSADRIGHGLTLKEYPDLIDRFRDRRIALEMCPSSNRQIRGYYHPHYSPQEERKYPLSSYLEAGLRVSINTDNPGISRTSLSEEFLTAAQMTPGGLSAWHILQIIRNGYQAAFCGQEQRRSHLIEAEKKIIEAVQDGAIE >NZ_AP017372.2|WP_096410070.1|2282532_2282814_+|CRISPR-associated-endonuclease-Cas2 MARKLFLAAYDVRCPQRLTKSVRVIKGYASGGQKSAYECWLTQAEQEELHLQMANVIDPRVDQFALLPLEPRKPLVTLGAAEEPADPDFFYFG >NZ_AP017372.2|WP_096410071.1|2282827_2283628_+|hypothetical-protein MSHAEPRTIYINADRVNVRHEDSALRVNRPGKAATYIPIVRIGRAVIRGCGGEELLGACLALARAGVVIHFQDGNGQQSAWLQPSGEPKNQPAQELAALIGEHTALGPYHWWRDAQRRHCWSMVFRHSPKGDFHSGCKRLEKYLRKLSPLHWIDHEIEALSRDLRSWLQAEIHRRGWNSVCRVLAAQGEDLESELYRCLYIPLLWRFVRWRRQQSLEISEYKRLEFVELQLANPIPRQLYRHLHALTEEYYVSWHKMSKNKVQADE >NZ_AP017372.2|WP_096410072.1|2283620_2283932_+|CRISPR-associated-endonuclease-Cas2 MSEHPVNHLVCYDIRDPRRLRRVHRKMKEWGTPLQYSVFYCRLVPSARQQLAEVLRHEIDERVDDVRIYALQNRAQGTYQGPAPLPVGLILPGLYLKEQFPGQ >NZ_AP017372.2|WP_096410073.1|2284042_2285029_+|CRISPR-associated-endonuclease-Cas1 MGTLYIDRRGTKLDYAHKALLIREPDKQPRSVPLNLLERLVVIGNVELTSNVLTNLGASGIGVTFMPARGQNRSSFMRSESHGDSTRRLGQYELATTQPNDPVWAIKLIRLRLASQHRLLHQALIHRPEQRQPIFCALEEIDRMRSHLRHSSQSLTLEQSRGYEGSATAAFFRGYTSLFPESLGFKSRNRRPPRDPVNAILSLGYALAHGDALRATMASGLDPAIGFLHQPAWGRDSLACDLTEIARSRVEQLTWHLFANRSLRAGDFSTDSDGEGVRLRKSARCNFFACWEAHAKLHRRWQKRAANTIASHCLHLGKSLNPGNSEYD >NZ_AP017372.2|WP_096410074.1|2285021_2285252_+|helix-turn-helix-transcriptional-regulator MTEFELLVRQQARSKKIPMAEVARRSRLSRQSLYNICNCTSHPKLQTFVDLAHALDISPMVLLEAYLQSADKEEQP >NZ_AP017372.2|WP_162549511.1|2286555_2286975_-|hypothetical-protein MSERVSQSTTEANAGQPPNFCKLFVHPKLGQILVLLDEGAEQGPEVRVCCRPSGVSVCTATYRYPDTPEGLLEAQGDFEAFDDEQAFEVARQMFVQMAAKGACDAHQQVADVVIIPDPAAMEVRELDLALANFSQQIIQ |
You can click texts colored in the table to view more detailed information
CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
NZ_AP017372_11 | 2285423-2285815 | TypeIII |
NA
Consensus repeat of NZ_AP017372_11
|
5 spacers
spacers of NZ_AP017372_11
>11.1|2285459|35|NZ_AP017372|PILER-CR,CRISPRCasFinder,CRT CTGTTCTGTTCCCCTTGGTTGAGTATCATTCTGCT >11.2|2285530|38|NZ_AP017372|PILER-CR,CRISPRCasFinder,CRT GTTTCGATCAGCGGGTCATCAAAACCCTTCCGGCGAAT >11.3|2285604|35|NZ_AP017372|PILER-CR,CRISPRCasFinder,CRT CGGCTTCCGTCTGAGCGACTTCCCGCTCCATCTCC >11.4|2285675|34|NZ_AP017372|PILER-CR,CRISPRCasFinder,CRT GCCATATCATCGTCAGTATCCTCCCTGAAGCCTA >11.5|2285745|35|NZ_AP017372|PILER-CR,CRISPRCasFinder,CRT TCCAACTCTTTGAGGACCTTGCTTGGGTTGAATTT |
cas1,cas2,csx1,cas6,csm5gr7 |
CRISPR arrays and Neighbor proteins around NZ_AP017372_11
The CRISPR arrays of NZ_AP017372_11 >merge|NZ_AP017372|11|2285423-2285815|PILER-CR,CRISPRCasFinder,CRT GTCTGAATCTGGCCCTGTTTGAGAAGGGATTAAGACCTGTTCTGTTCCCCTTGGTTGAGTATCATTCTGCTGTCTGAATCTGGCCCTGTTTGAGAAGGGATTAAGACGTTTCGATCAGCGGGTCATCAAAACCCTTCCGGCGAATGTCTGAATCTGGCCCTGTTTGAGAAGGGATTAAGACCGGCTTCCGTCTGAGCGACTTCCCGCTCCATCTCCGTCTGAATCTGGCCCTGTTTGAGAAGGGATTAAGACGCCATATCATCGTCAGTATCCTCCCTGAAGCCTAGTCTGAATCTGGCCCTGTTTGAGAAGGGATTAAGACTCCAACTCTTTGAGGACCTTGCTTGGGTTGAATTTGTCTGAATCTGGCCCTGTTTGAGAAGGGATTAAGAC >NZ_AP017372|11|8|2285423-2285815|PILER-CR GTCTGAATCTGGCCCTGTTTGAGAAGGGATTAAGAC CTGTTCTGTTCCCCTTGGTTGAGTATCATTCTGCT GTCTGAATCTGGCCCTGTTTGAGAAGGGATTAAGAC GTTTCGATCAGCGGGTCATCAAAACCCTTCCGGCGAAT GTCTGAATCTGGCCCTGTTTGAGAAGGGATTAAGAC CGGCTTCCGTCTGAGCGACTTCCCGCTCCATCTCC GTCTGAATCTGGCCCTGTTTGAGAAGGGATTAAGAC GCCATATCATCGTCAGTATCCTCCCTGAAGCCTA GTCTGAATCTGGCCCTGTTTGAGAAGGGATTAAGAC TCCAACTCTTTGAGGACCTTGCTTGGGTTGAATTT GTCTGAATCTGGCCCTGTTTGAGAAGGGATTAAGAC >NZ_AP017372|11|9|2285423-2285815|CRISPRCasFinder GTCTGAATCTGGCCCTGTTTGAGAAGGGATTAAGAC CTGTTCTGTTCCCCTTGGTTGAGTATCATTCTGCT GTCTGAATCTGGCCCTGTTTGAGAAGGGATTAAGAC GTTTCGATCAGCGGGTCATCAAAACCCTTCCGGCGAAT GTCTGAATCTGGCCCTGTTTGAGAAGGGATTAAGAC CGGCTTCCGTCTGAGCGACTTCCCGCTCCATCTCC GTCTGAATCTGGCCCTGTTTGAGAAGGGATTAAGAC GCCATATCATCGTCAGTATCCTCCCTGAAGCCTA GTCTGAATCTGGCCCTGTTTGAGAAGGGATTAAGAC TCCAACTCTTTGAGGACCTTGCTTGGGTTGAATTT GTCTGAATCTGGCCCTGTTTGAGAAGGGATTAAGAC >NZ_AP017372|11|5|2285423-2285815|CRT GTCTGAATCTGGCCCTGTTTGAGAAGGGATTAAGAC CTGTTCTGTTCCCCTTGGTTGAGTATCATTCTGCT GTCTGAATCTGGCCCTGTTTGAGAAGGGATTAAGAC GTTTCGATCAGCGGGTCATCAAAACCCTTCCGGCGAAT GTCTGAATCTGGCCCTGTTTGAGAAGGGATTAAGAC CGGCTTCCGTCTGAGCGACTTCCCGCTCCATCTCC GTCTGAATCTGGCCCTGTTTGAGAAGGGATTAAGAC GCCATATCATCGTCAGTATCCTCCCTGAAGCCTA GTCTGAATCTGGCCCTGTTTGAGAAGGGATTAAGAC TCCAACTCTTTGAGGACCTTGCTTGGGTTGAATTT GTCTGAATCTGGCCCTGTTTGAGAAGGGATTAAGAC
>NZ_AP017372.2|WP_096410074.1|2285021_2285252_+|helix-turn-helix-transcriptional-regulator MTEFELLVRQQARSKKIPMAEVARRSRLSRQSLYNICNCTSHPKLQTFVDLAHALDISPMVLLEAYLQSADKEEQP >NZ_AP017372.2|WP_096410073.1|2284042_2285029_+|CRISPR-associated-endonuclease-Cas1 MGTLYIDRRGTKLDYAHKALLIREPDKQPRSVPLNLLERLVVIGNVELTSNVLTNLGASGIGVTFMPARGQNRSSFMRSESHGDSTRRLGQYELATTQPNDPVWAIKLIRLRLASQHRLLHQALIHRPEQRQPIFCALEEIDRMRSHLRHSSQSLTLEQSRGYEGSATAAFFRGYTSLFPESLGFKSRNRRPPRDPVNAILSLGYALAHGDALRATMASGLDPAIGFLHQPAWGRDSLACDLTEIARSRVEQLTWHLFANRSLRAGDFSTDSDGEGVRLRKSARCNFFACWEAHAKLHRRWQKRAANTIASHCLHLGKSLNPGNSEYD >NZ_AP017372.2|WP_096410072.1|2283620_2283932_+|CRISPR-associated-endonuclease-Cas2 MSEHPVNHLVCYDIRDPRRLRRVHRKMKEWGTPLQYSVFYCRLVPSARQQLAEVLRHEIDERVDDVRIYALQNRAQGTYQGPAPLPVGLILPGLYLKEQFPGQ >NZ_AP017372.2|WP_096410071.1|2282827_2283628_+|hypothetical-protein MSHAEPRTIYINADRVNVRHEDSALRVNRPGKAATYIPIVRIGRAVIRGCGGEELLGACLALARAGVVIHFQDGNGQQSAWLQPSGEPKNQPAQELAALIGEHTALGPYHWWRDAQRRHCWSMVFRHSPKGDFHSGCKRLEKYLRKLSPLHWIDHEIEALSRDLRSWLQAEIHRRGWNSVCRVLAAQGEDLESELYRCLYIPLLWRFVRWRRQQSLEISEYKRLEFVELQLANPIPRQLYRHLHALTEEYYVSWHKMSKNKVQADE >NZ_AP017372.2|WP_096410070.1|2282532_2282814_+|CRISPR-associated-endonuclease-Cas2 MARKLFLAAYDVRCPQRLTKSVRVIKGYASGGQKSAYECWLTQAEQEELHLQMANVIDPRVDQFALLPLEPRKPLVTLGAAEEPADPDFFYFG >NZ_AP017372.2|WP_096410069.1|2280181_2282353_-|hypothetical-protein MNTLVATLGTTWQVLPEIFAYTNPGAAPLYEHSSAAGDIQEERKDYGLRPVQSLWIITTEGGIEEWQNLCQWQQYLPEPIEMRCWYIKGIEELFTPGENRAMADLIYRVALHARLHTRRNNSCLYFALAGGRKTMSAELQQAAHLFGADALLHVVDRFAKQEREQFNSLSFQSLCQPLPKEFADSIRPLVTNGDLPGNEALLNSFQELDCEFEERFPLPGFGDQQRFSIESVPAQDDLHAWVKERQKRAEALLANYRLQVSAHEKLGNFRALYGLSPQTVEKLRQTRLGCDQHKQEQELAWLRRLPKAELHCHLGGILDSAGMIRVAEAMADDLAAEDRRNREFAIWRRDMETAIRSGNIPYLKSLLPGGSLSGKPLRDNFSVTQPLSVAALLYAFRDNPQLLDSLLYGVYQQPSQFTAVGIESYEELGNLQGSGLLQSEKTLRAAMAELGAICRRERIGYLELRCSPLNYVRGDLDKDDVVRILVEEAERIEDCDVRLLFIASRHRDPEQTKEHIELALHWFDNSKSFRERFVGFDLAGAEHAMQPAAMREYFLPLHERVVRMTIHAGEGERAENIWEAVYELSADRIGHGLTLKEYPDLIDRFRDRRIALEMCPSSNRQIRGYYHPHYSPQEERKYPLSSYLEAGLRVSINTDNPGISRTSLSEEFLTAAQMTPGGLSAWHILQIIRNGYQAAFCGQEQRRSHLIEAEKKIIEAVQDGAIE >NZ_AP017372.2|WP_096410068.1|2279594_2280182_-|NUDIX-domain-containing-protein MGKGKVLVVPRQDLPDSWLPHEGALRSTWGEVKEVISTAGTLWLERSQAEYDHAYKQLIAYVRLRDSQGSYAVYKRQGSEQRLHGLWSVGLGGHVDEGDCSAADDSDAAKALERAAYRELEEELNGFTPERLEFLGLINEEKTEVGLVHLGMVWEAVAGIDRPKPGAELGEMGWRSPGQLPEDELEYWSRLAMRL >NZ_AP017372.2|WP_096410067.1|2278248_2279460_+|TIGR02221-family-CRISPR-associated-protein MHTLVSFIGRTRRPEQGYERIAYNFPDGAVQNGIAFIGNGVAQYTKPDRLVILGTSGSMWDQVIVDYPEVKLGEEKDLALSDSVDNQATTAEQLSEVAAALSETASFTVDLRLIPETPGMEQTWEILHTLVDATSGSDRLTIDITHGFRHLPMVAMMAALYRRTLDDSQSFSVDALWYAQLPPGAKEAEMHNIVGILALADWMEAIQHSRTTGDLSRVAELLREEAPEIAENLAQGSFKETIHQGTQARGPYRKARKTLSETTLPGPAGLFQPILEDQISWVDGQHLHVRQAAHARSALERKDYLRAALYGYEAFVTQLTREHHSIEQLDHHEKRKAASDAFAESCQGRSKEDPKCRAFHQLRQLRNALAHGDQPKHADVQAALHSPQALHKLLSEALDRLLP >NZ_AP017372.2|WP_162549510.1|2277575_2278202_+|CRISPR-system-precrRNA-processing-endoribonuclease-RAMP-protein-Cas6 MLNPALRWMPTLLPVLHNLQLKRSKLKLQRISLVNTDGLAASGADNKINVTAEELLKINELATPQHPPHPPEFITIRVQQHPLRLRRKNRYVGSEQFDPGVFISALLRRASMLNSITSQATETDFRYLTQLGRSIGLNRSELHWFDWHRHSTPQDRRVPMGGLLGEFQLDSVPEEIWPWVWLGQWLHVGKGAVMGMGRYQLAEYAADN >NZ_AP017372.2|WP_096410064.1|2274869_2276369_+|hypothetical-protein MTDIRGARPETENVCIEVVTPLHIGDGESLIKDADFVQERPGHPFRVIDKAGLERRLAEQGGDEVEAYLAHQEMPGLQDLVTLAGGASHAPGYDLPPHEPGHAPASPEIRSTIKDAWLRPLLPGSALKGALRTAWIAQHLRDQVIQPRAQELNKPPRFAAARFLGRLTSAPAHAGSGRPGPNSDAFRVLRPRDAQAPRSALSWVDIRIAKSPRDGKVGWHVTTRSGRRQVDDWSQATALNAEALAPGTVLATQISWDGLLCANESAWRATGNEHIALPRGFCELRDVLIRHARHQIQREKKDLFAWELKAAYRTWQQLEQQLEQAIQQGGAPLRLGFGIGWLGMTGDWLSDETFHTVLAETHWKVKQPHRFPKTRRLVVERGQPQAPLGWVILWPADSGPPPTGQDPEEQKDREDPGDAGHPWVNTKIAELQKAHNSSLEEVLRGKKLAQACQLIDDLETRSEVLADIRRRWQERGWWNDPRGRAMKQARQIYGELTGE >NZ_AP017372.2|WP_162549511.1|2286555_2286975_-|hypothetical-protein MSERVSQSTTEANAGQPPNFCKLFVHPKLGQILVLLDEGAEQGPEVRVCCRPSGVSVCTATYRYPDTPEGLLEAQGDFEAFDDEQAFEVARQMFVQMAAKGACDAHQQVADVVIIPDPAAMEVRELDLALANFSQQIIQ >NZ_AP017372.2|WP_096410076.1|2287123_2288041_-|cation-diffusion-facilitator-family-transporter MKEFLSRENTSLTVSVIIAALFAAAGIGLGLWMDSLMILFDGAYSLISLVLSMLALYVARLVRQPGNRHFPFGYAALEPLVIAVKGVTITLLCLVSLASALHALLTGGSQIDLDIAIAFTMIGLIACFSCTVYLRWSLARNESGLVAADFEQWRMDTVLSVAILLGFAAAYALERTAWADWAVYADPAMVALVAGYFIWVPLRMTSAAVRELVLAAPPAAMREEVLQATSDLGLPSEAVRMTKVGPYLVLELLVTPSEHTSPEALRFGLYRRLAHIEARPVVLMRASSGADGSWPWLDTPGERPW >NZ_AP017372.2|WP_096410077.1|2288135_2289080_+|LysR-family-transcriptional-regulator MRIEQIESVLAVVESGSVAAAARRLGQSRTTVSTAISALEDELGVTLFERSGNRLELSPVGGAILTDCRRLQQVADQIRSRCLHHLSGAESRLCIARDDALPESLWRELLRRLKERYPQTSVSVYVAPPQELPALVERQSVDVAYGLIPPSLSFGYHHLREIADVLMHTVAAAEHPLARMPRVTQDDLVLHTEVTLAYMGTSTLVAESPETANYLAFTQFEIMRDVVMEGSGWADLPLPLIAEPLNRGELRVIRHPEATWWMTLSALETDQAHGRPVVTWMGNALEACFTQWGLAEPAVSTPAVSADLNEQKEP >NZ_AP017372.2|WP_096410078.1|2289160_2289373_-|hypothetical-protein MNDRDYFAAHTYVSWQDAENVLRNAGNRKPSVDDVIAARARMRYAEADAMLALRAESSEVEGSLSDNSSG >NZ_AP017372.2|WP_096410079.1|2289512_2290355_-|hypothetical-protein MSALNWQAFIALTILAGFLLPLVFISYKKALQVGLGVVLFWLAWTLGLSALFVGYSLSGQLASFQLTLIVLVGLAGVATVYYYNRWKECSARLLEENEALRTSMRDLQDNFSAAAPAKGVRPTRLVRGTKQHRQELLQQVRSAQKRIIILSGWVTRYGFDNTMRRALRNAAKRGVKIYIGWGYKSRQEVAQSSGEATPAEKGLIELARNQKDSMTLAHFKNHSKLLLVDSACTIGSFNWLSNAFSVNDELSVIIEDPGFVEEMWTSVSKDIKRNAISEAL >NZ_AP017372.2|WP_096410080.1|2290580_2290940_-|hypothetical-protein MNQLGWADLQELLCRSQGQLDAVVFSGGEPTMQPAIFDAVKAVRDLGFKVVLHTAGSYPQLLQEALPWVDWVAMDIKGEWAHYPEVTGAANSAEKARESVEAVKASGVAYELRVLEGVG >NZ_AP017372.2|WP_162549512.1|2291015_2291165_-|hypothetical-protein MIETLHAGGVDNFDVLLRGSAGYPPQLYDAEHPLALLYCGGRKDLLASP >NZ_AP017372.2|WP_096410081.1|2291382_2292354_-|hypothetical-protein MKLIKFTPEKADVPWSGEYRVAQVNPRIRMVNLHELELEDCSGSLKVLYLAPMDEFGQPCPPWEGSVVRVTVTLTQALHGGWYNRVDKLEQIDEYSTLQLLPHRLCPVPGLLYRLYEVVNREITNPALRRFLERVFADQKRTRAFVSKPASVDCHHVEPGGLLKHSLQIVHGLDMLTWGHQNGVSRQCLLVSALLHDLGKVARDVLGMLPFQAREHASLNKLLLEKELSLLKEEDLEAWLLLHYMFSAIEGITDGNRVPGVGLLLALDRFSAAEDASRRAFESLPRYRQIASLKPSNGGPSRSFYRPRSEALQLGERCAGAVM >NZ_AP017372.2|WP_162549513.1|2292350_2292617_-|hypothetical-protein MTQIKGPGFSAYIVPAAVEEMAQVLLSAFASEDSDVLRAAVGAREGQPLQEALEDWLLLQLDLEDQAVAQILFRLFVDRLQRRVEVLT >NZ_AP017372.2|WP_096410083.1|2293315_2295238_+|threonine--tRNA-ligase MPNITLPDGSVKSFDNPPTIHEIATSIGSKLAKDAVAGRIDGELVDLTCTVDRDAQVEIVTAKDDDGLEIIRHSTAHLMAQAVKQLHPEMQVTIGPTVENGFYYDFAGEHSISEDQLEAIEQRMSELAEADQPVEREVWDRQAAKEFFLEQGETYKAQIIDELPEGEAVSVYRQGDFVDLCRGPHVPSTGKLKAFKLTKVAGAYWRGDQNNEMLQRLYGTAWGDRKQLKAYLQRLEEAEKRDHRRLARSLDLFHVQEESPGMVFWHPRGWQLYLTVESYIRDLMRNNGYHEVRTPMLVDRSLWERSGHWEMFASNMFVTESESRDYAVKPMNCPCHVEIYKQGLKSYRELPLRLAEFGSCHRNEPSGTLHGLMRVRGFVQDDAHIFCTEEQIQSEVRAFIDLVHTAYRHFGFNEVIIALSTRPDERVGDDAVWDKAEQALAQALEDHGLNYTVQPGEGAFYGPKIEFSLRDCLERVWQLGTIQVDFSMPGRLGAQYVDEDGERRTPVMLHRAILGSLERFIGILIEHYGGALPTWLAPVQVAVLNITDRQADYAQQIAASLREYGFRADVDLRNEKIGYKIREHTLQKVPYMLVLGDREMDTQTVAVRMRDGTDLGSMGYEELVARLQQDISHPGCNTED |
You can click texts colored in the table to view more detailed information
CRISPR_ID | Spacer_Info | Spacer_region | Spacer_length | Hit_ID | Protospacer_location | Mismatch | Identity |
---|---|---|---|---|---|---|---|
NZ_AP017372_6 | 6.1|2199827|37|NZ_AP017372|CRISPRCasFinder | 2199827-2199863 | 37 | NZ_AP017372.2 | 478561-478597 | 0 | 1.0 |
NZ_AP017372_6 | 6.1|2199827|37|NZ_AP017372|CRISPRCasFinder | 2199827-2199863 | 37 | NZ_AP017372.2 | 871648-871684 | 2 | 0.946 |
1. spacer 6.1|2199827|37|NZ_AP017372|CRISPRCasFinder matches to position: 478561-478597, mismatch: 0, identity: 1.0
ggggcgggtggtggcgccggcgaagttttagaggtgc CRISPR spacer ggggcgggtggtggcgccggcgaagttttagaggtgc Protospacer *************************************
2. spacer 6.1|2199827|37|NZ_AP017372|CRISPRCasFinder matches to position: 871648-871684, mismatch: 2, identity: 0.946
ggggcgggtggtggcgccggcgaagttttagaggtgc CRISPR spacer ggggcagatggtggcgccggcgaagttttagaggtgc Protospacer *****.*.*****************************
CRISPR_ID | Spacer_Info | Spacer_region | Spacer_length | Hit_phage_ID | Hit_phage_def | Protospacer_location | Mismatch | Identity |
---|---|---|---|---|---|---|---|---|
NZ_AP017372_4 | 4.9|760599|32|NZ_AP017372|CRISPRCasFinder,CRT,PILER-CR | 760599-760630 | 32 | NZ_CP032687 | Rhizobium sp. CCGE531 plasmid pRCCGE531b, complete sequence | 159105-159136 | 7 | 0.781 |
NZ_AP017372_4 | 4.9|760599|32|NZ_AP017372|CRISPRCasFinder,CRT,PILER-CR | 760599-760630 | 32 | NZ_CP032692 | Rhizobium sp. CCGE532 plasmid pRCCGE532b, complete sequence | 425982-426013 | 7 | 0.781 |
NZ_AP017372_4 | 4.9|760599|32|NZ_AP017372|CRISPRCasFinder,CRT,PILER-CR | 760599-760630 | 32 | NC_020061 | Rhizobium tropici CIAT 899 plasmid pRtrCIAT899b, complete sequence | 159137-159168 | 7 | 0.781 |
NZ_AP017372_5 | 5.1|761987|32|NZ_AP017372|CRISPRCasFinder,CRT | 761987-762018 | 32 | CP000662 | Rhodobacter sphaeroides ATCC 17025 plasmid pRSPA01, complete sequence | 247056-247087 | 7 | 0.781 |
NZ_AP017372_5 | 5.1|761987|32|NZ_AP017372|CRISPRCasFinder,CRT | 761987-762018 | 32 | NZ_CP009112 | Rhodococcus opacus strain 1CP plasmid pR1CP1, complete sequence | 402167-402198 | 7 | 0.781 |
NZ_AP017372_5 | 5.9|762475|32|NZ_AP017372|CRISPRCasFinder,CRT | 762475-762506 | 32 | NZ_CP045721 | Pantoea eucalypti strain LMG 24197 plasmid unnamed1, complete sequence | 180184-180215 | 7 | 0.781 |
NZ_AP017372_5 | 5.9|762475|32|NZ_AP017372|CRISPRCasFinder,CRT | 762475-762506 | 32 | NZ_CP022517 | Pantoea vagans strain FBS135 plasmid pPant1, complete sequence | 206637-206668 | 7 | 0.781 |
NZ_AP017372_5 | 5.14|762780|32|NZ_AP017372|CRISPRCasFinder,CRT | 762780-762811 | 32 | NZ_KY349138 | Mycolicibacterium sp. CBMA 213 plasmid pCBMA213_2, complete sequence | 49120-49151 | 7 | 0.781 |
NZ_AP017372_5 | 5.29|763696|32|NZ_AP017372|CRISPRCasFinder,CRT,PILER-CR | 763696-763727 | 32 | MK504443 | Lactobacillus phage 521B, complete genome | 31167-31198 | 7 | 0.781 |
NZ_AP017372_5 | 5.35|762474|33|NZ_AP017372|PILER-CR | 762474-762506 | 33 | NZ_CP045721 | Pantoea eucalypti strain LMG 24197 plasmid unnamed1, complete sequence | 180183-180215 | 7 | 0.788 |
NZ_AP017372_5 | 5.35|762474|33|NZ_AP017372|PILER-CR | 762474-762506 | 33 | NZ_CP022517 | Pantoea vagans strain FBS135 plasmid pPant1, complete sequence | 206637-206669 | 7 | 0.788 |
NZ_AP017372_4 | 4.3|760233|32|NZ_AP017372|CRISPRCasFinder,CRT,PILER-CR | 760233-760264 | 32 | NZ_CP041653 | Streptomyces sp. RLB1-9 plasmid pRLB1-9.1, complete sequence | 122375-122406 | 8 | 0.75 |
NZ_AP017372_4 | 4.9|760599|32|NZ_AP017372|CRISPRCasFinder,CRT,PILER-CR | 760599-760630 | 32 | NZ_CP054028 | Rhizobium sp. JKLM19E plasmid pPR19E01, complete sequence | 1308792-1308823 | 8 | 0.75 |
NZ_AP017372_5 | 5.10|762536|32|NZ_AP017372|CRISPRCasFinder,CRT | 762536-762567 | 32 | MN586006 | Mycobacterium phage Bachome, complete genome | 46095-46126 | 8 | 0.75 |
NZ_AP017372_5 | 5.10|762536|32|NZ_AP017372|CRISPRCasFinder,CRT | 762536-762567 | 32 | NZ_CP021777 | UNVERIFIED_ORG: Enterobacter cloacae strain AR_0053 plasmid unitig_2, complete sequence | 15741-15772 | 8 | 0.75 |
NZ_AP017372_5 | 5.11|762597|32|NZ_AP017372|CRISPRCasFinder,CRT | 762597-762628 | 32 | NZ_CP054609 | Paenibacillus cellulosilyticus strain KACC 14175 plasmid unnamed1, complete sequence | 467359-467390 | 8 | 0.75 |
NZ_AP017372_5 | 5.11|762597|32|NZ_AP017372|CRISPRCasFinder,CRT | 762597-762628 | 32 | MK675901 | Shewanella phage S0112, complete genome | 29865-29896 | 8 | 0.75 |
NZ_AP017372_5 | 5.21|763208|32|NZ_AP017372|CRISPRCasFinder,CRT,PILER-CR | 763208-763239 | 32 | KX961385 | Bordetella virus LK3, complete genome | 10463-10494 | 8 | 0.75 |
NZ_AP017372_5 | 5.21|763208|32|NZ_AP017372|CRISPRCasFinder,CRT,PILER-CR | 763208-763239 | 32 | KY000220 | Bordetella phage FP1, complete genome | 46123-46154 | 8 | 0.75 |
NZ_AP017372_5 | 5.21|763208|32|NZ_AP017372|CRISPRCasFinder,CRT,PILER-CR | 763208-763239 | 32 | KY000221 | Bordetella phage CN1, complete genome | 45629-45660 | 8 | 0.75 |
NZ_AP017372_5 | 5.21|763208|32|NZ_AP017372|CRISPRCasFinder,CRT,PILER-CR | 763208-763239 | 32 | NC_047877 | Bordetella phage CN2, complete genome | 47895-47926 | 8 | 0.75 |
NZ_AP017372_5 | 5.22|763269|32|NZ_AP017372|CRISPRCasFinder,CRT,PILER-CR | 763269-763300 | 32 | MG757154 | Streptomyces phage Bing, complete genome | 10964-10995 | 8 | 0.75 |
NZ_AP017372_5 | 5.24|763391|32|NZ_AP017372|CRISPRCasFinder,CRT,PILER-CR | 763391-763422 | 32 | NZ_CP015043 | Rhodovulum sp. P5 plasmid pRGUI04, complete sequence | 45717-45748 | 8 | 0.75 |
NZ_AP017372_5 | 5.26|763513|32|NZ_AP017372|CRISPRCasFinder,CRT,PILER-CR | 763513-763544 | 32 | NC_028795 | Enterobacter phage E-3, complete genome | 2696-2727 | 8 | 0.75 |
NZ_AP017372_5 | 5.26|763513|32|NZ_AP017372|CRISPRCasFinder,CRT,PILER-CR | 763513-763544 | 32 | NC_016974 | Providencia stuartii plasmid pMR0211, complete sequence | 149759-149790 | 8 | 0.75 |
NZ_AP017372_5 | 5.37|762596|33|NZ_AP017372|PILER-CR | 762596-762628 | 33 | MK675901 | Shewanella phage S0112, complete genome | 29864-29896 | 8 | 0.758 |
NZ_AP017372_4 | 4.1|760111|32|NZ_AP017372|CRISPRCasFinder,CRT | 760111-760142 | 32 | NZ_CP016453 | Sphingobium sp. RAC03 plasmid pBSY17_1, complete sequence | 303921-303952 | 9 | 0.719 |
NZ_AP017372_4 | 4.2|760172|32|NZ_AP017372|CRISPRCasFinder,CRT,PILER-CR | 760172-760203 | 32 | NZ_CP017076 | Novosphingobium resinovorum strain SA1 plasmid pSA1, complete sequence | 51753-51784 | 9 | 0.719 |
NZ_AP017372_4 | 4.8|760538|32|NZ_AP017372|CRISPRCasFinder,CRT,PILER-CR | 760538-760569 | 32 | NZ_CP015203 | Rhodococcus sp. 008 plasmid pR8L1, complete sequence | 672277-672308 | 9 | 0.719 |
NZ_AP017372_5 | 5.1|761987|32|NZ_AP017372|CRISPRCasFinder,CRT | 761987-762018 | 32 | NC_013858 | Azospirillum sp. B510 plasmid pAB510d, complete sequence | 272189-272220 | 9 | 0.719 |
NZ_AP017372_5 | 5.12|762658|32|NZ_AP017372|CRISPRCasFinder,CRT | 762658-762689 | 32 | MN032972 | Leviviridae sp. isolate H2_Rhizo_Litter_7_scaffold_10692 sequence | 627-658 | 9 | 0.719 |
NZ_AP017372_5 | 5.12|762658|32|NZ_AP017372|CRISPRCasFinder,CRT | 762658-762689 | 32 | MN033187 | Leviviridae sp. isolate H2_Rhizo_Litter_49_scaffold_9067 RNA-dependent RNA polymerase (H2RhizoLitter499067_000001) gene, complete cds; and hypothetical protein (H2RhizoLitter499067_000002) gene, partial cds | 578-609 | 9 | 0.719 |
NZ_AP017372_5 | 5.14|762780|32|NZ_AP017372|CRISPRCasFinder,CRT | 762780-762811 | 32 | NZ_AP022319 | Burkholderia sp. THE68 plasmid BTHE68_p1, complete sequence | 1456123-1456154 | 9 | 0.719 |
NZ_AP017372_5 | 5.14|762780|32|NZ_AP017372|CRISPRCasFinder,CRT | 762780-762811 | 32 | NZ_CP026091 | Ralstonia solanacearum strain IBSBF 2570 plasmid unnamed, complete sequence | 1643147-1643178 | 9 | 0.719 |
NZ_AP017372_5 | 5.14|762780|32|NZ_AP017372|CRISPRCasFinder,CRT | 762780-762811 | 32 | NC_014309 | Ralstonia solanacearum CFBP2957 plasmid RCFBPv3_mp, complete genome | 498224-498255 | 9 | 0.719 |
NZ_AP017372_5 | 5.14|762780|32|NZ_AP017372|CRISPRCasFinder,CRT | 762780-762811 | 32 | NZ_CP026093 | Ralstonia solanacearum strain SFC plasmid unnamed, complete sequence | 1643282-1643313 | 9 | 0.719 |
NZ_AP017372_5 | 5.14|762780|32|NZ_AP017372|CRISPRCasFinder,CRT | 762780-762811 | 32 | NZ_CP012940 | Ralstonia solanacearum strain UW163 plasmid unnamed, complete sequence | 377406-377437 | 9 | 0.719 |
NZ_AP017372_5 | 5.14|762780|32|NZ_AP017372|CRISPRCasFinder,CRT | 762780-762811 | 32 | NZ_CP012944 | Ralstonia solanacearum strain IBSBF1503 plasmid unnamed, complete sequence | 1523646-1523677 | 9 | 0.719 |
NZ_AP017372_5 | 5.14|762780|32|NZ_AP017372|CRISPRCasFinder,CRT | 762780-762811 | 32 | NC_017575 | Ralstonia solanacearum Po82 megaplasmid, complete sequence | 1642871-1642902 | 9 | 0.719 |
NZ_AP017372_5 | 5.14|762780|32|NZ_AP017372|CRISPRCasFinder,CRT | 762780-762811 | 32 | NZ_CP026308 | Ralstonia solanacearum strain IBSBF 2571 plasmid unnamed, complete sequence | 1642817-1642848 | 9 | 0.719 |
NZ_AP017372_5 | 5.14|762780|32|NZ_AP017372|CRISPRCasFinder,CRT | 762780-762811 | 32 | NZ_CP051295 | Ralstonia solanacearum strain CIAT_078 plasmid megaplasmid, complete sequence | 375047-375078 | 9 | 0.719 |
NZ_AP017372_5 | 5.17|762963|33|NZ_AP017372|CRISPRCasFinder,CRT | 762963-762995 | 33 | JQ067087 | Pseudomonas phage PaMx11, complete genome | 14969-15001 | 9 | 0.727 |
NZ_AP017372_5 | 5.21|763208|32|NZ_AP017372|CRISPRCasFinder,CRT,PILER-CR | 763208-763239 | 32 | MN694560 | Marine virus AFVG_250M172, complete genome | 44240-44271 | 9 | 0.719 |
NZ_AP017372_5 | 5.27|763574|32|NZ_AP017372|CRISPRCasFinder,CRT,PILER-CR | 763574-763605 | 32 | NC_009620 | Sinorhizobium medicae WSM419 plasmid pSMED01, complete sequence | 296578-296609 | 9 | 0.719 |
NZ_AP017372_5 | 5.36|762535|33|NZ_AP017372|PILER-CR | 762535-762567 | 33 | MN586006 | Mycobacterium phage Bachome, complete genome | 46094-46126 | 9 | 0.727 |
NZ_AP017372_5 | 5.1|761987|32|NZ_AP017372|CRISPRCasFinder,CRT | 761987-762018 | 32 | NZ_CP007130 | Gemmatirosa kalamazoonesis strain KBS708 plasmid 2, complete sequence | 805892-805923 | 10 | 0.688 |
NZ_AP017372_5 | 5.8|762414|32|NZ_AP017372|CRISPRCasFinder,CRT | 762414-762445 | 32 | NC_019849 | Sinorhizobium meliloti GR4 plasmid pRmeGR4d, complete sequence | 123441-123472 | 10 | 0.688 |
NZ_AP017372_5 | 5.8|762414|32|NZ_AP017372|CRISPRCasFinder,CRT | 762414-762445 | 32 | NZ_CP019586 | Sinorhizobium meliloti strain CCMM B554 (FSM-MA) plasmid pSymB, complete sequence | 1582839-1582870 | 10 | 0.688 |
NZ_AP017372_5 | 5.8|762414|32|NZ_AP017372|CRISPRCasFinder,CRT | 762414-762445 | 32 | NC_017326 | Sinorhizobium meliloti SM11 plasmid pSmeSM11d, complete sequence | 123245-123276 | 10 | 0.688 |
NZ_AP017372_5 | 5.8|762414|32|NZ_AP017372|CRISPRCasFinder,CRT | 762414-762445 | 32 | NC_017323 | Sinorhizobium meliloti BL225C plasmid pSINMEB02, complete sequence | 1358525-1358556 | 10 | 0.688 |
NZ_AP017372_5 | 5.8|762414|32|NZ_AP017372|CRISPRCasFinder,CRT | 762414-762445 | 32 | NZ_CP021828 | Sinorhizobium meliloti strain KH35c plasmid psymB, complete sequence | 1188349-1188380 | 10 | 0.688 |
NZ_AP017372_5 | 5.8|762414|32|NZ_AP017372|CRISPRCasFinder,CRT | 762414-762445 | 32 | NZ_CP021820 | Sinorhizobium meliloti strain M162 plasmid psymB, complete sequence | 461741-461772 | 10 | 0.688 |
NZ_AP017372_5 | 5.8|762414|32|NZ_AP017372|CRISPRCasFinder,CRT | 762414-762445 | 32 | NZ_CP021831 | Sinorhizobium meliloti strain HM006 plasmid psymB, complete sequence | 368399-368430 | 10 | 0.688 |
NZ_AP017372_5 | 5.8|762414|32|NZ_AP017372|CRISPRCasFinder,CRT | 762414-762445 | 32 | NZ_CP021814 | Sinorhizobium meliloti strain M270 plasmid psymB, complete sequence | 1689565-1689596 | 10 | 0.688 |
NZ_AP017372_5 | 5.8|762414|32|NZ_AP017372|CRISPRCasFinder,CRT | 762414-762445 | 32 | NZ_CP021795 | Sinorhizobium meliloti strain USDA1157 plasmid psymB, complete sequence | 846546-846577 | 10 | 0.688 |
NZ_AP017372_5 | 5.8|762414|32|NZ_AP017372|CRISPRCasFinder,CRT | 762414-762445 | 32 | NZ_CP021806 | Sinorhizobium meliloti strain T073 plasmid psymB, complete sequence | 898532-898563 | 10 | 0.688 |
NZ_AP017372_5 | 5.14|762780|32|NZ_AP017372|CRISPRCasFinder,CRT | 762780-762811 | 32 | NZ_AP022319 | Burkholderia sp. THE68 plasmid BTHE68_p1, complete sequence | 790797-790828 | 10 | 0.688 |
NZ_AP017372_5 | 5.14|762780|32|NZ_AP017372|CRISPRCasFinder,CRT | 762780-762811 | 32 | NZ_CP050100 | Rhizobium leguminosarum bv. trifolii strain 9B plasmid pRL9b3, complete sequence | 164525-164556 | 10 | 0.688 |
NZ_AP017372_5 | 5.14|762780|32|NZ_AP017372|CRISPRCasFinder,CRT | 762780-762811 | 32 | NZ_CP025017 | Rhizobium leguminosarum strain Norway plasmid pRLN5, complete sequence | 145251-145282 | 10 | 0.688 |
NZ_AP017372_5 | 5.14|762780|32|NZ_AP017372|CRISPRCasFinder,CRT | 762780-762811 | 32 | NZ_CP053443 | Rhizobium leguminosarum bv. trifolii strain CC275e plasmid pRltCC275eC, complete sequence | 127274-127305 | 10 | 0.688 |
NZ_AP017372_5 | 5.14|762780|32|NZ_AP017372|CRISPRCasFinder,CRT | 762780-762811 | 32 | NZ_CP044308 | Escherichia coli strain C27A plasmid pC27A-3, complete sequence | 91247-91278 | 10 | 0.688 |
NZ_AP017372_5 | 5.14|762780|32|NZ_AP017372|CRISPRCasFinder,CRT | 762780-762811 | 32 | NZ_CP018232 | Rhizobium leguminosarum strain Vaf-108 plasmid unnamed4, complete sequence | 146709-146740 | 10 | 0.688 |
NZ_AP017372_5 | 5.16|762902|32|NZ_AP017372|CRISPRCasFinder,CRT | 762902-762933 | 32 | MF399199 | Acinetobacter baumannii strain D46 plasmid pD46-4, complete sequence | 109511-109542 | 10 | 0.688 |
NZ_AP017372_5 | 5.16|762902|32|NZ_AP017372|CRISPRCasFinder,CRT | 762902-762933 | 32 | MF399199 | Acinetobacter baumannii strain D46 plasmid pD46-4, complete sequence | 176712-176743 | 10 | 0.688 |
NZ_AP017372_5 | 5.16|762902|32|NZ_AP017372|CRISPRCasFinder,CRT | 762902-762933 | 32 | NZ_KT601170 | Staphylococcus sciuri strain wo28-3 plasmid pwo28-3, complete sequence | 7774-7805 | 10 | 0.688 |
NZ_AP017372_5 | 5.16|762902|32|NZ_AP017372|CRISPRCasFinder,CRT | 762902-762933 | 32 | NZ_KX982169 | Staphylococcus sciuri strain wo27-9 plasmid pWo27-9, complete sequence | 51627-51658 | 10 | 0.688 |
NZ_AP017372_5 | 5.16|762902|32|NZ_AP017372|CRISPRCasFinder,CRT | 762902-762933 | 32 | NZ_KX982171 | Staphylococcus sciuri strain wo28-1 plasmid pWo28-1, complete sequence | 55935-55966 | 10 | 0.688 |
NZ_AP017372_5 | 5.16|762902|32|NZ_AP017372|CRISPRCasFinder,CRT | 762902-762933 | 32 | NZ_CP040051 | Acinetobacter baumannii strain VB16141 plasmid unnamed1, complete sequence | 65937-65968 | 10 | 0.688 |
NZ_AP017372_5 | 5.16|762902|32|NZ_AP017372|CRISPRCasFinder,CRT | 762902-762933 | 32 | NZ_CP040051 | Acinetobacter baumannii strain VB16141 plasmid unnamed1, complete sequence | 104115-104146 | 10 | 0.688 |
NZ_AP017372_5 | 5.16|762902|32|NZ_AP017372|CRISPRCasFinder,CRT | 762902-762933 | 32 | NZ_KX426227 | Acinetobacter lwoffii strain ED23-35 plasmid pALWED1.1, complete sequence | 135629-135660 | 10 | 0.688 |
NZ_AP017372_5 | 5.16|762902|32|NZ_AP017372|CRISPRCasFinder,CRT | 762902-762933 | 32 | NZ_AP014650 | Acinetobacter baumannii strain IOMTU433 plasmid pIOMTU433, complete sequence | 54973-55004 | 10 | 0.688 |
NZ_AP017372_5 | 5.16|762902|32|NZ_AP017372|CRISPRCasFinder,CRT | 762902-762933 | 32 | NZ_AP014650 | Acinetobacter baumannii strain IOMTU433 plasmid pIOMTU433, complete sequence | 93153-93184 | 10 | 0.688 |
NZ_AP017372_5 | 5.16|762902|32|NZ_AP017372|CRISPRCasFinder,CRT | 762902-762933 | 32 | CP033569 | Acinetobacter pittii strain 2014N21-145 plasmid p2014N21-145-1, complete sequence | 191947-191978 | 10 | 0.688 |
NZ_AP017372_5 | 5.16|762902|32|NZ_AP017372|CRISPRCasFinder,CRT | 762902-762933 | 32 | NZ_CP012007 | Acinetobacter baumannii strain Ab04-mff plasmid pAB04-1, complete sequence | 56063-56094 | 10 | 0.688 |
NZ_AP017372_5 | 5.16|762902|32|NZ_AP017372|CRISPRCasFinder,CRT | 762902-762933 | 32 | CP040054 | Acinetobacter baumannii strain VB35179 plasmid unnamed1, complete sequence | 52236-52267 | 10 | 0.688 |
NZ_AP017372_5 | 5.16|762902|32|NZ_AP017372|CRISPRCasFinder,CRT | 762902-762933 | 32 | CP040054 | Acinetobacter baumannii strain VB35179 plasmid unnamed1, complete sequence | 126675-126706 | 10 | 0.688 |
NZ_AP017372_5 | 5.16|762902|32|NZ_AP017372|CRISPRCasFinder,CRT | 762902-762933 | 32 | NZ_CP050433 | Acinetobacter baumannii strain PM194229 plasmid pPM194229_1, complete sequence | 118555-118586 | 10 | 0.688 |
NZ_AP017372_5 | 5.16|762902|32|NZ_AP017372|CRISPRCasFinder,CRT | 762902-762933 | 32 | NZ_CP050386 | Acinetobacter baumannii strain VB82 plasmid pVB82_1, complete sequence | 129578-129609 | 10 | 0.688 |
NZ_AP017372_5 | 5.16|762902|32|NZ_AP017372|CRISPRCasFinder,CRT | 762902-762933 | 32 | NZ_KU744946 | Acinetobacter baumannii strain A297 (RUH875) plasmid pA297-3 clone Global clone 1 (GC1), complete sequence | 92231-92262 | 10 | 0.688 |
NZ_AP017372_5 | 5.16|762902|32|NZ_AP017372|CRISPRCasFinder,CRT | 762902-762933 | 32 | NZ_KU744946 | Acinetobacter baumannii strain A297 (RUH875) plasmid pA297-3 clone Global clone 1 (GC1), complete sequence | 170225-170256 | 10 | 0.688 |
NZ_AP017372_5 | 5.16|762902|32|NZ_AP017372|CRISPRCasFinder,CRT | 762902-762933 | 32 | NZ_CP020596 | Acinetobacter baumannii strain HWBA8 plasmid pHWBA8_1, complete sequence | 154076-154107 | 10 | 0.688 |
NZ_AP017372_5 | 5.16|762902|32|NZ_AP017372|CRISPRCasFinder,CRT | 762902-762933 | 32 | NZ_CP040260 | Acinetobacter baumannii strain P7774 plasmid unnamed1, complete sequence | 135714-135745 | 10 | 0.688 |
NZ_AP017372_5 | 5.16|762902|32|NZ_AP017372|CRISPRCasFinder,CRT | 762902-762933 | 32 | NZ_KT779035 | Acinetobacter baumannii strain D4 plasmid pD4, complete sequence | 102224-102255 | 10 | 0.688 |
NZ_AP017372_5 | 5.16|762902|32|NZ_AP017372|CRISPRCasFinder,CRT | 762902-762933 | 32 | NZ_MK323043 | Acinetobacter baumannii strain Acb-45063 plasmid pAb45063_b, complete sequence | 3661-3692 | 10 | 0.688 |
NZ_AP017372_5 | 5.19|763086|32|NZ_AP017372|CRISPRCasFinder,CRT,PILER-CR | 763086-763117 | 32 | NC_016626 | Burkholderia sp. YI23 plasmid byi_1p, complete sequence | 704027-704058 | 10 | 0.688 |
NZ_AP017372_5 | 5.24|763391|32|NZ_AP017372|CRISPRCasFinder,CRT,PILER-CR | 763391-763422 | 32 | NZ_CP044079 | Paracoccus yeei strain FDAARGOS_643 plasmid unnamed2, complete sequence | 64512-64543 | 10 | 0.688 |
NZ_AP017372_5 | 5.40|762779|33|NZ_AP017372|PILER-CR | 762779-762811 | 33 | NZ_AP022319 | Burkholderia sp. THE68 plasmid BTHE68_p1, complete sequence | 1456122-1456154 | 10 | 0.697 |
NZ_AP017372_4 | 4.6|760416|32|NZ_AP017372|CRISPRCasFinder,CRT,PILER-CR | 760416-760447 | 32 | MG065659 | UNVERIFIED: Campylobacter phage C5, complete genome | 37923-37954 | 11 | 0.656 |
NZ_AP017372_4 | 4.6|760416|32|NZ_AP017372|CRISPRCasFinder,CRT,PILER-CR | 760416-760447 | 32 | MG065655 | UNVERIFIED: Campylobacter phage C2, complete genome | 5920-5951 | 11 | 0.656 |
NZ_AP017372_4 | 4.6|760416|32|NZ_AP017372|CRISPRCasFinder,CRT,PILER-CR | 760416-760447 | 32 | MG065666 | UNVERIFIED: Campylobacter phage A12a, complete genome | 4290-4321 | 11 | 0.656 |
NZ_AP017372_4 | 4.6|760416|32|NZ_AP017372|CRISPRCasFinder,CRT,PILER-CR | 760416-760447 | 32 | KJ190158 | Escherichia phage vB_EcoM_FFH2, complete genome | 111492-111523 | 11 | 0.656 |
NZ_AP017372_4 | 4.6|760416|32|NZ_AP017372|CRISPRCasFinder,CRT,PILER-CR | 760416-760447 | 32 | MG065654 | UNVERIFIED: Campylobacter phage C15, complete genome | 4317-4348 | 11 | 0.656 |
NZ_AP017372_4 | 4.9|760599|32|NZ_AP017372|CRISPRCasFinder,CRT,PILER-CR | 760599-760630 | 32 | NZ_CP022605 | Ochrobactrum quorumnocens strain A44 plasmid unnamed1, complete sequence | 589538-589569 | 11 | 0.656 |
NZ_AP017372_5 | 5.1|761987|32|NZ_AP017372|CRISPRCasFinder,CRT | 761987-762018 | 32 | NC_020062 | Rhizobium tropici CIAT 899 plasmid pRtrCIAT899c, complete sequence | 1330843-1330874 | 11 | 0.656 |
1. spacer 4.9|760599|32|NZ_AP017372|CRISPRCasFinder,CRT,PILER-CR matches to NZ_CP032687 (Rhizobium sp. CCGE531 plasmid pRCCGE531b, complete sequence) position: , mismatch: 7, identity: 0.781
caatcttagcactgtcaagatcgacggactgg CRISPR spacer cacgctcggcactatcaagatcgacggattgc Protospacer ** **..*****.**************.**
2. spacer 4.9|760599|32|NZ_AP017372|CRISPRCasFinder,CRT,PILER-CR matches to NZ_CP032692 (Rhizobium sp. CCGE532 plasmid pRCCGE532b, complete sequence) position: , mismatch: 7, identity: 0.781
caatcttagcactgtcaagatcgacggactgg CRISPR spacer cacgctcggcactatcaagatcgacggattgc Protospacer ** **..*****.**************.**
3. spacer 4.9|760599|32|NZ_AP017372|CRISPRCasFinder,CRT,PILER-CR matches to NC_020061 (Rhizobium tropici CIAT 899 plasmid pRtrCIAT899b, complete sequence) position: , mismatch: 7, identity: 0.781
caatcttagcactgtcaagatcgacggactgg CRISPR spacer cacgctcggcactatcaagatcgacggattgc Protospacer ** **..*****.**************.**
4. spacer 5.1|761987|32|NZ_AP017372|CRISPRCasFinder,CRT matches to CP000662 (Rhodobacter sphaeroides ATCC 17025 plasmid pRSPA01, complete sequence) position: , mismatch: 7, identity: 0.781
cggactcgacctcctccatcgagccgtaactc CRISPR spacer aggcctcgacctcctccatcgagcggaggcgc Protospacer ** ******************** * ..* *
5. spacer 5.1|761987|32|NZ_AP017372|CRISPRCasFinder,CRT matches to NZ_CP009112 (Rhodococcus opacus strain 1CP plasmid pR1CP1, complete sequence) position: , mismatch: 7, identity: 0.781
cggactcgacctcctccatcgagccgtaactc CRISPR spacer ccggctcgacgtcctccatcgagacgtgaagc Protospacer * *.****** ************ ***.* *
6. spacer 5.9|762475|32|NZ_AP017372|CRISPRCasFinder,CRT matches to NZ_CP045721 (Pantoea eucalypti strain LMG 24197 plasmid unnamed1, complete sequence) position: , mismatch: 7, identity: 0.781
agttgggtgctgagcttgtccctgcaatgctt CRISPR spacer actggtatgctgagcatggccctgcaatgctg Protospacer * * * .******** ** ************
7. spacer 5.9|762475|32|NZ_AP017372|CRISPRCasFinder,CRT matches to NZ_CP022517 (Pantoea vagans strain FBS135 plasmid pPant1, complete sequence) position: , mismatch: 7, identity: 0.781
agttgggtgctgagcttgtccctgcaatgctt CRISPR spacer actggtatgctgagcatggccctgcaatgctg Protospacer * * * .******** ** ************
8. spacer 5.14|762780|32|NZ_AP017372|CRISPRCasFinder,CRT matches to NZ_KY349138 (Mycolicibacterium sp. CBMA 213 plasmid pCBMA213_2, complete sequence) position: , mismatch: 7, identity: 0.781
ataaccggcggcggtgagccgtcagatgagtg- CRISPR spacer ctcggcggcggcgttgggccgtcagatg-gtgt Protospacer * . ******** **.*********** ***
9. spacer 5.29|763696|32|NZ_AP017372|CRISPRCasFinder,CRT,PILER-CR matches to MK504443 (Lactobacillus phage 521B, complete genome) position: , mismatch: 7, identity: 0.781
tcatcggtagtcattaaatctgctactcgtat CRISPR spacer tcgctggtaatcattaagtctgctactcccat Protospacer **...****.*******.********** .**
10. spacer 5.35|762474|33|NZ_AP017372|PILER-CR matches to NZ_CP045721 (Pantoea eucalypti strain LMG 24197 plasmid unnamed1, complete sequence) position: , mismatch: 7, identity: 0.788
gagttgggtgctgagcttgtccctgcaatgctt CRISPR spacer gactggtatgctgagcatggccctgcaatgctg Protospacer ** * * .******** ** ************
11. spacer 5.35|762474|33|NZ_AP017372|PILER-CR matches to NZ_CP022517 (Pantoea vagans strain FBS135 plasmid pPant1, complete sequence) position: , mismatch: 7, identity: 0.788
gagttgggtgctgagcttgtccctgcaatgctt CRISPR spacer gactggtatgctgagcatggccctgcaatgctg Protospacer ** * * .******** ** ************
12. spacer 4.3|760233|32|NZ_AP017372|CRISPRCasFinder,CRT,PILER-CR matches to NZ_CP041653 (Streptomyces sp. RLB1-9 plasmid pRLB1-9.1, complete sequence) position: , mismatch: 8, identity: 0.75
gtaagtaccccgacgcggagccgtcgcactac CRISPR spacer ccaagaaccccgacgcgaagccgtcgttcggc Protospacer .*** ***********.********. * .*
13. spacer 4.9|760599|32|NZ_AP017372|CRISPRCasFinder,CRT,PILER-CR matches to NZ_CP054028 (Rhizobium sp. JKLM19E plasmid pPR19E01, complete sequence) position: , mismatch: 8, identity: 0.75
caatcttagcactgtcaagatcgacggactgg CRISPR spacer cgcggttagcactgtgaagatggacggatcgg Protospacer *. ********** ***** ******..**
14. spacer 5.10|762536|32|NZ_AP017372|CRISPRCasFinder,CRT matches to MN586006 (Mycobacterium phage Bachome, complete genome) position: , mismatch: 8, identity: 0.75
cgttcagctgctcgcggacacgctcttcgtca CRISPR spacer tgttctcctgctcgcggacacgctcagcgatg Protospacer .**** ****************** ** ..
15. spacer 5.10|762536|32|NZ_AP017372|CRISPRCasFinder,CRT matches to NZ_CP021777 (UNVERIFIED_ORG: Enterobacter cloacae strain AR_0053 plasmid unitig_2, complete sequence) position: , mismatch: 8, identity: 0.75
cgttcagctgctcgcggacacgctcttcgtca CRISPR spacer cgttcagctgctcgctgatacgcccgtgcaaa Protospacer *************** **.****.* * *
16. spacer 5.11|762597|32|NZ_AP017372|CRISPRCasFinder,CRT matches to NZ_CP054609 (Paenibacillus cellulosilyticus strain KACC 14175 plasmid unnamed1, complete sequence) position: , mismatch: 8, identity: 0.75
ctgcccatggaatatgagccggatc--gccattg CRISPR spacer ggacccattgaatatgagacggatcgggcgat-- Protospacer .***** ********* ****** ** **
17. spacer 5.11|762597|32|NZ_AP017372|CRISPRCasFinder,CRT matches to MK675901 (Shewanella phage S0112, complete genome) position: , mismatch: 8, identity: 0.75
ctgcccatggaatatgagccggatcgccattg CRISPR spacer cttccgatggaatatgagccggagagatgctg Protospacer ** ** ***************** * ...**
18. spacer 5.21|763208|32|NZ_AP017372|CRISPRCasFinder,CRT,PILER-CR matches to KX961385 (Bordetella virus LK3, complete genome) position: , mismatch: 8, identity: 0.75
ccctggcgcccggacgatgcccgtgtctatca CRISPR spacer gtcggtggcccggacgatgcccgtttcaatct Protospacer .* * ***************** ** ***
19. spacer 5.21|763208|32|NZ_AP017372|CRISPRCasFinder,CRT,PILER-CR matches to KY000220 (Bordetella phage FP1, complete genome) position: , mismatch: 8, identity: 0.75
ccctggcgcccggacgatgcccgtgtctatca CRISPR spacer gtcggtggcccggacgatgcccgtttcaatct Protospacer .* * ***************** ** ***
20. spacer 5.21|763208|32|NZ_AP017372|CRISPRCasFinder,CRT,PILER-CR matches to KY000221 (Bordetella phage CN1, complete genome) position: , mismatch: 8, identity: 0.75
ccctggcgcccggacgatgcccgtgtctatca CRISPR spacer gtcggtggcccggacgatgcccgtttcaatct Protospacer .* * ***************** ** ***
21. spacer 5.21|763208|32|NZ_AP017372|CRISPRCasFinder,CRT,PILER-CR matches to NC_047877 (Bordetella phage CN2, complete genome) position: , mismatch: 8, identity: 0.75
ccctggcgcccggacgatgcccgtgtctatca CRISPR spacer gtcggtggcccggacgatgcccgtttcaatct Protospacer .* * ***************** ** ***
22. spacer 5.22|763269|32|NZ_AP017372|CRISPRCasFinder,CRT,PILER-CR matches to MG757154 (Streptomyces phage Bing, complete genome) position: , mismatch: 8, identity: 0.75
atgaaccgatcaccagccttgtcccacggcaa CRISPR spacer tagtaccgctcgccagccttgtcccacgtaag Protospacer * **** **.**************** *.
23. spacer 5.24|763391|32|NZ_AP017372|CRISPRCasFinder,CRT,PILER-CR matches to NZ_CP015043 (Rhodovulum sp. P5 plasmid pRGUI04, complete sequence) position: , mismatch: 8, identity: 0.75
gcgatggagctgtttggcgcgcgctactttag CRISPR spacer tgcatggggctgcttggcgcgcgctacatgaa Protospacer ****.****.************** * *.
24. spacer 5.26|763513|32|NZ_AP017372|CRISPRCasFinder,CRT,PILER-CR matches to NC_028795 (Enterobacter phage E-3, complete genome) position: , mismatch: 8, identity: 0.75
aaaggctggttaggtggcatcagagccattaa CRISPR spacer gtcagtcggataggtggcatcagagccatcaa Protospacer . .*..** *******************.**
25. spacer 5.26|763513|32|NZ_AP017372|CRISPRCasFinder,CRT,PILER-CR matches to NC_016974 (Providencia stuartii plasmid pMR0211, complete sequence) position: , mismatch: 8, identity: 0.75
aaaggctggttaggtggcatcagagccattaa CRISPR spacer tgaggctggttcggtggcatgagagctgatta Protospacer .********* ******** *****.. * *
26. spacer 5.37|762596|33|NZ_AP017372|PILER-CR matches to MK675901 (Shewanella phage S0112, complete genome) position: , mismatch: 8, identity: 0.758
gctgcccatggaatatgagccggatcgccattg CRISPR spacer gcttccgatggaatatgagccggagagatgctg Protospacer *** ** ***************** * ...**
27. spacer 4.1|760111|32|NZ_AP017372|CRISPRCasFinder,CRT matches to NZ_CP016453 (Sphingobium sp. RAC03 plasmid pBSY17_1, complete sequence) position: , mismatch: 9, identity: 0.719
cagcgacaattaaccggcattcctggcaaaat CRISPR spacer gtggtataattaaccggcattcatggccaagc Protospacer * *.*************** **** **..
28. spacer 4.2|760172|32|NZ_AP017372|CRISPRCasFinder,CRT,PILER-CR matches to NZ_CP017076 (Novosphingobium resinovorum strain SA1 plasmid pSA1, complete sequence) position: , mismatch: 9, identity: 0.719
ctccgacgctgctctcctcagcttcggcttgg CRISPR spacer tcgcctcgctgcgctcctcaccttcggctccg Protospacer .. * ****** ******* ********. *
29. spacer 4.8|760538|32|NZ_AP017372|CRISPRCasFinder,CRT,PILER-CR matches to NZ_CP015203 (Rhodococcus sp. 008 plasmid pR8L1, complete sequence) position: , mismatch: 9, identity: 0.719
agcggcacgtttaccatgcccgaagatgaaat CRISPR spacer agcggcacgtagaccatgcccgaacgccacgc Protospacer ********** ************ .. * ..
30. spacer 5.1|761987|32|NZ_AP017372|CRISPRCasFinder,CRT matches to NC_013858 (Azospirillum sp. B510 plasmid pAB510d, complete sequence) position: , mismatch: 9, identity: 0.719
cggactcgacctcctccatcgagccgtaactc CRISPR spacer cttcgtcgacctccagcatcgagccgtagcgg Protospacer * ********* ************.*
31. spacer 5.12|762658|32|NZ_AP017372|CRISPRCasFinder,CRT matches to MN032972 (Leviviridae sp. isolate H2_Rhizo_Litter_7_scaffold_10692 sequence) position: , mismatch: 9, identity: 0.719
tcccgcgtctataccgacaagagtttgggcgc CRISPR spacer cttcgtgtcaataccgacaagagtttctggac Protospacer ...**.*** **************** * .*
32. spacer 5.12|762658|32|NZ_AP017372|CRISPRCasFinder,CRT matches to MN033187 (Leviviridae sp. isolate H2_Rhizo_Litter_49_scaffold_9067 RNA-dependent RNA polymerase (H2RhizoLitter499067_000001) gene, complete cds; and hypothetical protein (H2RhizoLitter499067_000002) gene, partial cds) position: , mismatch: 9, identity: 0.719
tcccgcgtctataccgacaagagtttgggcgc CRISPR spacer cttcgtgtcaataccgacaagagtttctggac Protospacer ...**.*** **************** * .*
33. spacer 5.14|762780|32|NZ_AP017372|CRISPRCasFinder,CRT matches to NZ_AP022319 (Burkholderia sp. THE68 plasmid BTHE68_p1, complete sequence) position: , mismatch: 9, identity: 0.719
ataaccggcggcggtgagccgtcagatgagtg CRISPR spacer cacgcaggcggcggtgagccgtcaggtcaggt Protospacer .* *******************.* **
34. spacer 5.14|762780|32|NZ_AP017372|CRISPRCasFinder,CRT matches to NZ_CP026091 (Ralstonia solanacearum strain IBSBF 2570 plasmid unnamed, complete sequence) position: , mismatch: 9, identity: 0.719
ataaccggcggcggtgagccgtcagatgagtg CRISPR spacer gagaccggcggcgctgagccgtcggacgtgct Protospacer . .********** *********.**.* *.
35. spacer 5.14|762780|32|NZ_AP017372|CRISPRCasFinder,CRT matches to NC_014309 (Ralstonia solanacearum CFBP2957 plasmid RCFBPv3_mp, complete genome) position: , mismatch: 9, identity: 0.719
ataaccggcggcggtgagccgtcagatgagtg CRISPR spacer gagaccggcggcgctgagccgtcggacgtgct Protospacer . .********** *********.**.* *.
36. spacer 5.14|762780|32|NZ_AP017372|CRISPRCasFinder,CRT matches to NZ_CP026093 (Ralstonia solanacearum strain SFC plasmid unnamed, complete sequence) position: , mismatch: 9, identity: 0.719
ataaccggcggcggtgagccgtcagatgagtg CRISPR spacer gagaccggcggcgctgagccgtcggacgtgct Protospacer . .********** *********.**.* *.
37. spacer 5.14|762780|32|NZ_AP017372|CRISPRCasFinder,CRT matches to NZ_CP012940 (Ralstonia solanacearum strain UW163 plasmid unnamed, complete sequence) position: , mismatch: 9, identity: 0.719
ataaccggcggcggtgagccgtcagatgagtg CRISPR spacer gagaccggcggcgctgagccgtcggacgtgct Protospacer . .********** *********.**.* *.
38. spacer 5.14|762780|32|NZ_AP017372|CRISPRCasFinder,CRT matches to NZ_CP012944 (Ralstonia solanacearum strain IBSBF1503 plasmid unnamed, complete sequence) position: , mismatch: 9, identity: 0.719
ataaccggcggcggtgagccgtcagatgagtg CRISPR spacer gagaccggcggcgctgagccgtcggacgtgct Protospacer . .********** *********.**.* *.
39. spacer 5.14|762780|32|NZ_AP017372|CRISPRCasFinder,CRT matches to NC_017575 (Ralstonia solanacearum Po82 megaplasmid, complete sequence) position: , mismatch: 9, identity: 0.719
ataaccggcggcggtgagccgtcagatgagtg CRISPR spacer gagaccggcggcgctgagccgtcggacgtgct Protospacer . .********** *********.**.* *.
40. spacer 5.14|762780|32|NZ_AP017372|CRISPRCasFinder,CRT matches to NZ_CP026308 (Ralstonia solanacearum strain IBSBF 2571 plasmid unnamed, complete sequence) position: , mismatch: 9, identity: 0.719
ataaccggcggcggtgagccgtcagatgagtg CRISPR spacer gagaccggcggcgctgagccgtcggacgtgct Protospacer . .********** *********.**.* *.
41. spacer 5.14|762780|32|NZ_AP017372|CRISPRCasFinder,CRT matches to NZ_CP051295 (Ralstonia solanacearum strain CIAT_078 plasmid megaplasmid, complete sequence) position: , mismatch: 9, identity: 0.719
ataaccggcggcggtgagccgtcagatgagtg CRISPR spacer gagaccggcggcgctgagccgtcggacgtgct Protospacer . .********** *********.**.* *.
42. spacer 5.17|762963|33|NZ_AP017372|CRISPRCasFinder,CRT matches to JQ067087 (Pseudomonas phage PaMx11, complete genome) position: , mismatch: 9, identity: 0.727
ttcgccggtagaaagctgattttcaagcgcgac CRISPR spacer ggtgccggtaacaagctgattttcaagctgccc Protospacer .*******. **************** *
43. spacer 5.21|763208|32|NZ_AP017372|CRISPRCasFinder,CRT,PILER-CR matches to MN694560 (Marine virus AFVG_250M172, complete genome) position: , mismatch: 9, identity: 0.719
ccctggcgcccggacgatgcccgtgtctatca CRISPR spacer gggcagagcccgggcgatgcccgtgtccatga Protospacer ..* ******.*************.** *
44. spacer 5.27|763574|32|NZ_AP017372|CRISPRCasFinder,CRT,PILER-CR matches to NC_009620 (Sinorhizobium medicae WSM419 plasmid pSMED01, complete sequence) position: , mismatch: 9, identity: 0.719
tccggtaggggcataggacgtaaagcgaaccc CRISPR spacer gctgccgcgggcatgggacgtaacgcgaaccg Protospacer *.* .. ******.******** *******
45. spacer 5.36|762535|33|NZ_AP017372|PILER-CR matches to MN586006 (Mycobacterium phage Bachome, complete genome) position: , mismatch: 9, identity: 0.727
gcgttcagctgctcgcggacacgctcttcgtca CRISPR spacer ttgttctcctgctcgcggacacgctcagcgatg Protospacer .**** ****************** ** ..
46. spacer 5.1|761987|32|NZ_AP017372|CRISPRCasFinder,CRT matches to NZ_CP007130 (Gemmatirosa kalamazoonesis strain KBS708 plasmid 2, complete sequence) position: , mismatch: 10, identity: 0.688
cggactcgacctcctccatcgagccgtaactc CRISPR spacer gggacacgacctcctccatcgtgcgcaggcgg Protospacer **** *************** ** ..*
47. spacer 5.8|762414|32|NZ_AP017372|CRISPRCasFinder,CRT matches to NC_019849 (Sinorhizobium meliloti GR4 plasmid pRmeGR4d, complete sequence) position: , mismatch: 10, identity: 0.688
gacgacgaaaccattcgcgctagcgaagaata CRISPR spacer cttggcgaaaccattcgcgcgatcgaagggcc Protospacer .*.*************** * *****...
48. spacer 5.8|762414|32|NZ_AP017372|CRISPRCasFinder,CRT matches to NZ_CP019586 (Sinorhizobium meliloti strain CCMM B554 (FSM-MA) plasmid pSymB, complete sequence) position: , mismatch: 10, identity: 0.688
gacgacgaaaccattcgcgctagcgaagaata CRISPR spacer cttggcgaaaccattcgcgcgatcgaagggcc Protospacer .*.*************** * *****...
49. spacer 5.8|762414|32|NZ_AP017372|CRISPRCasFinder,CRT matches to NC_017326 (Sinorhizobium meliloti SM11 plasmid pSmeSM11d, complete sequence) position: , mismatch: 10, identity: 0.688
gacgacgaaaccattcgcgctagcgaagaata CRISPR spacer cttggcgaaaccattcgcgcgatcgaagggcc Protospacer .*.*************** * *****...
50. spacer 5.8|762414|32|NZ_AP017372|CRISPRCasFinder,CRT matches to NC_017323 (Sinorhizobium meliloti BL225C plasmid pSINMEB02, complete sequence) position: , mismatch: 10, identity: 0.688
gacgacgaaaccattcgcgctagcgaagaata CRISPR spacer cttggcgaaaccattcgcgcgatcgaagggcc Protospacer .*.*************** * *****...
51. spacer 5.8|762414|32|NZ_AP017372|CRISPRCasFinder,CRT matches to NZ_CP021828 (Sinorhizobium meliloti strain KH35c plasmid psymB, complete sequence) position: , mismatch: 10, identity: 0.688
gacgacgaaaccattcgcgctagcgaagaata CRISPR spacer cttggcgaaaccattcgcgcgatcgaagggcc Protospacer .*.*************** * *****...
52. spacer 5.8|762414|32|NZ_AP017372|CRISPRCasFinder,CRT matches to NZ_CP021820 (Sinorhizobium meliloti strain M162 plasmid psymB, complete sequence) position: , mismatch: 10, identity: 0.688
gacgacgaaaccattcgcgctagcgaagaata CRISPR spacer cttggcgaaaccattcgcgcgatcgaagggcc Protospacer .*.*************** * *****...
53. spacer 5.8|762414|32|NZ_AP017372|CRISPRCasFinder,CRT matches to NZ_CP021831 (Sinorhizobium meliloti strain HM006 plasmid psymB, complete sequence) position: , mismatch: 10, identity: 0.688
gacgacgaaaccattcgcgctagcgaagaata CRISPR spacer cttggcgaaaccattcgcgcgatcgaagggcc Protospacer .*.*************** * *****...
54. spacer 5.8|762414|32|NZ_AP017372|CRISPRCasFinder,CRT matches to NZ_CP021814 (Sinorhizobium meliloti strain M270 plasmid psymB, complete sequence) position: , mismatch: 10, identity: 0.688
gacgacgaaaccattcgcgctagcgaagaata CRISPR spacer cttggcgaaaccattcgcgcgatcgaagggcc Protospacer .*.*************** * *****...
55. spacer 5.8|762414|32|NZ_AP017372|CRISPRCasFinder,CRT matches to NZ_CP021795 (Sinorhizobium meliloti strain USDA1157 plasmid psymB, complete sequence) position: , mismatch: 10, identity: 0.688
gacgacgaaaccattcgcgctagcgaagaata CRISPR spacer cttggcgaaaccattcgcgcgatcgaagggcc Protospacer .*.*************** * *****...
56. spacer 5.8|762414|32|NZ_AP017372|CRISPRCasFinder,CRT matches to NZ_CP021806 (Sinorhizobium meliloti strain T073 plasmid psymB, complete sequence) position: , mismatch: 10, identity: 0.688
gacgacgaaaccattcgcgctagcgaagaata CRISPR spacer cttggcgaaaccattcgcgcgatcgaagggcc Protospacer .*.*************** * *****...
57. spacer 5.14|762780|32|NZ_AP017372|CRISPRCasFinder,CRT matches to NZ_AP022319 (Burkholderia sp. THE68 plasmid BTHE68_p1, complete sequence) position: , mismatch: 10, identity: 0.688
ataaccggcggcggtgagccgtcagatgagtg CRISPR spacer gacgccgggcgcggtgagccgtcagattcgcc Protospacer . .**** ***************** *.
58. spacer 5.14|762780|32|NZ_AP017372|CRISPRCasFinder,CRT matches to NZ_CP050100 (Rhizobium leguminosarum bv. trifolii strain 9B plasmid pRL9b3, complete sequence) position: , mismatch: 10, identity: 0.688
ataaccggcggcggtgagccgtcagatgagtg CRISPR spacer ggtgccggcggcggtgaaccgacagattcgat Protospacer . .*************.*** ***** *
59. spacer 5.14|762780|32|NZ_AP017372|CRISPRCasFinder,CRT matches to NZ_CP025017 (Rhizobium leguminosarum strain Norway plasmid pRLN5, complete sequence) position: , mismatch: 10, identity: 0.688
ataaccggcggcggtgagccgtcagatgagtg CRISPR spacer ggtgccggcggcggtgaaccggcagattcgat Protospacer . .*************.*** ***** *
60. spacer 5.14|762780|32|NZ_AP017372|CRISPRCasFinder,CRT matches to NZ_CP053443 (Rhizobium leguminosarum bv. trifolii strain CC275e plasmid pRltCC275eC, complete sequence) position: , mismatch: 10, identity: 0.688
ataaccggcggcggtgagccgtcagatgagtg CRISPR spacer ggtgccggcggcggtgaaccggcagattcgat Protospacer . .*************.*** ***** *
61. spacer 5.14|762780|32|NZ_AP017372|CRISPRCasFinder,CRT matches to NZ_CP044308 (Escherichia coli strain C27A plasmid pC27A-3, complete sequence) position: , mismatch: 10, identity: 0.688
ataaccggcggcggtgagccgtcagatgagtg CRISPR spacer actgacggcggcggggagccgccagatgtacc Protospacer *. . ********* ******.****** ..
62. spacer 5.14|762780|32|NZ_AP017372|CRISPRCasFinder,CRT matches to NZ_CP018232 (Rhizobium leguminosarum strain Vaf-108 plasmid unnamed4, complete sequence) position: , mismatch: 10, identity: 0.688
ataaccggcggcggtgagccgtcagatgagtg CRISPR spacer ggtgccggcggcggtgaaccggcagattcgat Protospacer . .*************.*** ***** *
63. spacer 5.16|762902|32|NZ_AP017372|CRISPRCasFinder,CRT matches to MF399199 (Acinetobacter baumannii strain D46 plasmid pD46-4, complete sequence) position: , mismatch: 10, identity: 0.688
gttgagttgcaaaccaccgacctgcctacaga CRISPR spacer caatagttgcaaaccacagacctaccttaaac Protospacer ************* *****.*** *.
64. spacer 5.16|762902|32|NZ_AP017372|CRISPRCasFinder,CRT matches to MF399199 (Acinetobacter baumannii strain D46 plasmid pD46-4, complete sequence) position: , mismatch: 10, identity: 0.688
gttgagttgcaaaccaccgacctgcctacaga CRISPR spacer caatagttgcaaaccacagacctaccttaaac Protospacer ************* *****.*** *.
65. spacer 5.16|762902|32|NZ_AP017372|CRISPRCasFinder,CRT matches to NZ_KT601170 (Staphylococcus sciuri strain wo28-3 plasmid pwo28-3, complete sequence) position: , mismatch: 10, identity: 0.688
gttgagttgcaaaccaccgacctgcctacaga CRISPR spacer caatacctgcaaaccaccaacctgcatacaac Protospacer * .***********.****** ****.
66. spacer 5.16|762902|32|NZ_AP017372|CRISPRCasFinder,CRT matches to NZ_KX982169 (Staphylococcus sciuri strain wo27-9 plasmid pWo27-9, complete sequence) position: , mismatch: 10, identity: 0.688
gttgagttgcaaaccaccgacctgcctacaga CRISPR spacer caatacctgcaaaccaccaacctgcatacaac Protospacer * .***********.****** ****.
67. spacer 5.16|762902|32|NZ_AP017372|CRISPRCasFinder,CRT matches to NZ_KX982171 (Staphylococcus sciuri strain wo28-1 plasmid pWo28-1, complete sequence) position: , mismatch: 10, identity: 0.688
gttgagttgcaaaccaccgacctgcctacaga CRISPR spacer caatacctgcaaaccaccaacctgcatacaac Protospacer * .***********.****** ****.
68. spacer 5.16|762902|32|NZ_AP017372|CRISPRCasFinder,CRT matches to NZ_CP040051 (Acinetobacter baumannii strain VB16141 plasmid unnamed1, complete sequence) position: , mismatch: 10, identity: 0.688
gttgagttgcaaaccaccgacctgcctacaga CRISPR spacer caatagttgcaaaccacagacctaccttaaac Protospacer ************* *****.*** *.
69. spacer 5.16|762902|32|NZ_AP017372|CRISPRCasFinder,CRT matches to NZ_CP040051 (Acinetobacter baumannii strain VB16141 plasmid unnamed1, complete sequence) position: , mismatch: 10, identity: 0.688
gttgagttgcaaaccaccgacctgcctacaga CRISPR spacer caatagttgcaaaccacagacctaccttaaac Protospacer ************* *****.*** *.
70. spacer 5.16|762902|32|NZ_AP017372|CRISPRCasFinder,CRT matches to NZ_KX426227 (Acinetobacter lwoffii strain ED23-35 plasmid pALWED1.1, complete sequence) position: , mismatch: 10, identity: 0.688
gttgagttgcaaaccaccgacctgcctacaga CRISPR spacer caatagttgcaaaccacagacctaccttaaac Protospacer ************* *****.*** *.
71. spacer 5.16|762902|32|NZ_AP017372|CRISPRCasFinder,CRT matches to NZ_AP014650 (Acinetobacter baumannii strain IOMTU433 plasmid pIOMTU433, complete sequence) position: , mismatch: 10, identity: 0.688
gttgagttgcaaaccaccgacctgcctacaga CRISPR spacer caatagttgcaaaccacagacctaccttaaac Protospacer ************* *****.*** *.
72. spacer 5.16|762902|32|NZ_AP017372|CRISPRCasFinder,CRT matches to NZ_AP014650 (Acinetobacter baumannii strain IOMTU433 plasmid pIOMTU433, complete sequence) position: , mismatch: 10, identity: 0.688
gttgagttgcaaaccaccgacctgcctacaga CRISPR spacer caatagttgcaaaccacagacctaccttaaac Protospacer ************* *****.*** *.
73. spacer 5.16|762902|32|NZ_AP017372|CRISPRCasFinder,CRT matches to CP033569 (Acinetobacter pittii strain 2014N21-145 plasmid p2014N21-145-1, complete sequence) position: , mismatch: 10, identity: 0.688
gttgagttgcaaaccaccgacctgcctacaga CRISPR spacer caatagttgcaaaccacagacctaccttaaac Protospacer ************* *****.*** *.
74. spacer 5.16|762902|32|NZ_AP017372|CRISPRCasFinder,CRT matches to NZ_CP012007 (Acinetobacter baumannii strain Ab04-mff plasmid pAB04-1, complete sequence) position: , mismatch: 10, identity: 0.688
gttgagttgcaaaccaccgacctgcctacaga CRISPR spacer caatagttgcaaaccacagacctaccttaaac Protospacer ************* *****.*** *.
75. spacer 5.16|762902|32|NZ_AP017372|CRISPRCasFinder,CRT matches to CP040054 (Acinetobacter baumannii strain VB35179 plasmid unnamed1, complete sequence) position: , mismatch: 10, identity: 0.688
gttgagttgcaaaccaccgacctgcctacaga CRISPR spacer caatagttgcaaaccacagacctaccttaaac Protospacer ************* *****.*** *.
76. spacer 5.16|762902|32|NZ_AP017372|CRISPRCasFinder,CRT matches to CP040054 (Acinetobacter baumannii strain VB35179 plasmid unnamed1, complete sequence) position: , mismatch: 10, identity: 0.688
gttgagttgcaaaccaccgacctgcctacaga CRISPR spacer caatagttgcaaaccacagacctaccttaaac Protospacer ************* *****.*** *.
77. spacer 5.16|762902|32|NZ_AP017372|CRISPRCasFinder,CRT matches to NZ_CP050433 (Acinetobacter baumannii strain PM194229 plasmid pPM194229_1, complete sequence) position: , mismatch: 10, identity: 0.688
gttgagttgcaaaccaccgacctgcctacaga CRISPR spacer caatagttgcaaaccacagacctaccttaaac Protospacer ************* *****.*** *.
78. spacer 5.16|762902|32|NZ_AP017372|CRISPRCasFinder,CRT matches to NZ_CP050386 (Acinetobacter baumannii strain VB82 plasmid pVB82_1, complete sequence) position: , mismatch: 10, identity: 0.688
gttgagttgcaaaccaccgacctgcctacaga CRISPR spacer caatagttgcaaaccacagacctaccttaaac Protospacer ************* *****.*** *.
79. spacer 5.16|762902|32|NZ_AP017372|CRISPRCasFinder,CRT matches to NZ_KU744946 (Acinetobacter baumannii strain A297 (RUH875) plasmid pA297-3 clone Global clone 1 (GC1), complete sequence) position: , mismatch: 10, identity: 0.688
gttgagttgcaaaccaccgacctgcctacaga CRISPR spacer caatagttgcaaaccacagacctaccttaaac Protospacer ************* *****.*** *.
80. spacer 5.16|762902|32|NZ_AP017372|CRISPRCasFinder,CRT matches to NZ_KU744946 (Acinetobacter baumannii strain A297 (RUH875) plasmid pA297-3 clone Global clone 1 (GC1), complete sequence) position: , mismatch: 10, identity: 0.688
gttgagttgcaaaccaccgacctgcctacaga CRISPR spacer caatagttgcaaaccacagacctaccttaaac Protospacer ************* *****.*** *.
81. spacer 5.16|762902|32|NZ_AP017372|CRISPRCasFinder,CRT matches to NZ_CP020596 (Acinetobacter baumannii strain HWBA8 plasmid pHWBA8_1, complete sequence) position: , mismatch: 10, identity: 0.688
gttgagttgcaaaccaccgacctgcctacaga CRISPR spacer caatagttgcaaaccacagacctaccttaaac Protospacer ************* *****.*** *.
82. spacer 5.16|762902|32|NZ_AP017372|CRISPRCasFinder,CRT matches to NZ_CP040260 (Acinetobacter baumannii strain P7774 plasmid unnamed1, complete sequence) position: , mismatch: 10, identity: 0.688
gttgagttgcaaaccaccgacctgcctacaga CRISPR spacer caatagttgcaaaccacagacctaccttaaac Protospacer ************* *****.*** *.
83. spacer 5.16|762902|32|NZ_AP017372|CRISPRCasFinder,CRT matches to NZ_KT779035 (Acinetobacter baumannii strain D4 plasmid pD4, complete sequence) position: , mismatch: 10, identity: 0.688
gttgagttgcaaaccaccgacctgcctacaga CRISPR spacer caatagttgcaaaccacagacctaccttaaac Protospacer ************* *****.*** *.
84. spacer 5.16|762902|32|NZ_AP017372|CRISPRCasFinder,CRT matches to NZ_MK323043 (Acinetobacter baumannii strain Acb-45063 plasmid pAb45063_b, complete sequence) position: , mismatch: 10, identity: 0.688
gttgagttgcaaaccaccgacctgcctacaga CRISPR spacer caatagttgcaaaccacagacctaccttaaac Protospacer ************* *****.*** *.
85. spacer 5.19|763086|32|NZ_AP017372|CRISPRCasFinder,CRT,PILER-CR matches to NC_016626 (Burkholderia sp. YI23 plasmid byi_1p, complete sequence) position: , mismatch: 10, identity: 0.688
attacggcacggggcgatcagggaaacgggtc CRISPR spacer cgaacggcacgaggcgatcatggaaattaccc Protospacer ********.******** *****. . .*
86. spacer 5.24|763391|32|NZ_AP017372|CRISPRCasFinder,CRT,PILER-CR matches to NZ_CP044079 (Paracoccus yeei strain FDAARGOS_643 plasmid unnamed2, complete sequence) position: , mismatch: 10, identity: 0.688
gcgatggagctgtttggcgcgcgctactttag CRISPR spacer cacatggtgctgtttggcccgcgctatcgcgg Protospacer **** ********** *******.. ..*
87. spacer 5.40|762779|33|NZ_AP017372|PILER-CR matches to NZ_AP022319 (Burkholderia sp. THE68 plasmid BTHE68_p1, complete sequence) position: , mismatch: 10, identity: 0.697
gataaccggcggcggtgagccgtcagatgagtg CRISPR spacer tcacgcaggcggcggtgagccgtcaggtcaggt Protospacer .* *******************.* **
88. spacer 4.6|760416|32|NZ_AP017372|CRISPRCasFinder,CRT,PILER-CR matches to MG065659 (UNVERIFIED: Campylobacter phage C5, complete genome) position: , mismatch: 11, identity: 0.656
cgcgtctttagccgccgcctctgcgccttctt CRISPR spacer gattcgtttaaccgccgcctcagcgccttggg Protospacer .. . ****.********** *******
89. spacer 4.6|760416|32|NZ_AP017372|CRISPRCasFinder,CRT,PILER-CR matches to MG065655 (UNVERIFIED: Campylobacter phage C2, complete genome) position: , mismatch: 11, identity: 0.656
cgcgtctttagccgccgcctctgcgccttctt CRISPR spacer gattcgtttaaccgccgcctcagcgccttggg Protospacer .. . ****.********** *******
90. spacer 4.6|760416|32|NZ_AP017372|CRISPRCasFinder,CRT,PILER-CR matches to MG065666 (UNVERIFIED: Campylobacter phage A12a, complete genome) position: , mismatch: 11, identity: 0.656
cgcgtctttagccgccgcctctgcgccttctt CRISPR spacer gattcgtttaaccgccgcctcagcgccttggg Protospacer .. . ****.********** *******
91. spacer 4.6|760416|32|NZ_AP017372|CRISPRCasFinder,CRT,PILER-CR matches to KJ190158 (Escherichia phage vB_EcoM_FFH2, complete genome) position: , mismatch: 11, identity: 0.656
cgcgtctttagccgccgcctctgcgccttctt CRISPR spacer gattcgtttaaccgccgcctcagcgccttggg Protospacer .. . ****.********** *******
92. spacer 4.6|760416|32|NZ_AP017372|CRISPRCasFinder,CRT,PILER-CR matches to MG065654 (UNVERIFIED: Campylobacter phage C15, complete genome) position: , mismatch: 11, identity: 0.656
cgcgtctttagccgccgcctctgcgccttctt CRISPR spacer gattcgtttaaccgccgcctcagcgccttggg Protospacer .. . ****.********** *******
93. spacer 4.9|760599|32|NZ_AP017372|CRISPRCasFinder,CRT,PILER-CR matches to NZ_CP022605 (Ochrobactrum quorumnocens strain A44 plasmid unnamed1, complete sequence) position: , mismatch: 11, identity: 0.656
caatcttagcactgtcaagatcgacggactgg CRISPR spacer acatcttaacactttcaagatcgactcctgaa Protospacer ******.**** *********** . ..
94. spacer 5.1|761987|32|NZ_AP017372|CRISPRCasFinder,CRT matches to NC_020062 (Rhizobium tropici CIAT 899 plasmid pRtrCIAT899c, complete sequence) position: , mismatch: 11, identity: 0.656
cggactcgacctcctccatcgagccgtaactc CRISPR spacer tctcctcgaccttctccatcgaggcgttgaaa Protospacer . ********.********** *** .
Region | Region Position | Protein_number | Hit_taxonomy | Key_proteins | Att_site | Prophage annotation | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
DBSCAN-SWA_1 |
340689 : 389155
Sequences of DBSCAN-SWA_1
Nucleotide sequences of DBSCAN-SWA_1 >NZ_AP017372|340689:389155|DBSCAN-SWA TTTACCGGAAGACCTCTTCTATGCTGTTAGCGGTCAGTATATTGATAAGCCACTGCTGGATTGTCGGCTCATCAGCCTGCTCGACCCTAGGGCGGTATTGCTCCGCTGCTTCCTTACCAAACTTCTGTTCGAGTAGCAAAAGAAGGGATTTACCTTCACCAGCCAATAGGCCTTCTTGGCGACCTTCCAGCCGACCCTCCTGTTTCCACTCCCTCTCCCACTCATTCATACGTTCAGCTAGCATCTCATTTACCTCCTCAAGGTCCTGCACTTCTTCCCAATCGAAGAGTTTTGAATCAGCGAAGGGTTTACGGCGCAGCAGCACGCGCTGTATCCAGATGGCAAACTCTCGTCGGATAGGGCGCTGCTCAGGTTTAACCAGCCACTTGCTGAGGGTAGCGACGATCGAGCGCATATCATCCGGCGTGCGGCTGTGTTCAAGCCTGAACAGGGCGTGAACCAAGCTATGCAACTCGGGTGAGTTATCTTTGCTGAGAAGCGCTCCCTCATCGAGAAGCATATAGCGCATCTGAGGCCGGTAGGCAGACAGACCGCCGGGGATGCGCTCGATCAGAGAATCAATATCGGTGGCCGCCGACCAGCGCCGCTGTCCATTATATAGCACCACGGGCAGAACAGGTGGCAGCCTCCCATCACTGGTGAGTTTACCTTGCGCGATAAGATCCTGATAGAGCAGCCCCAGATAGGTCATCATGCGCACTGCCATGAACTGATCTATGTCGCTTTGAAACTCCAGCAGCAGGTAGACGTAAAGCCACTCTTTGCCCCAGCGTACGCGCCAAATCAGATCATCGTGGCGATCGCGATAGTCATCGGCAGCGTAGGAACCGTTCTGCTTCTCCAGGGTAGAGAAGTCGAGCTCTTTTACCCAATCCTCGCCGACGTACTCAACGAGCAGGTCCTTAATCATAACCGGCTGAGAGAAAAACCTCTTGTAGGCCGGGTCGTGATGATTGTTTGTCAAGTAAGCCTCCGCCGGAGCTGAGTAAATATAGAATCCTGCCTAGATAAATTGCACCATCGCGCCAACGTGGTCCACAGCCTCGCCGACTAATAGCTCCTACCTCCCCCACCTCGTGCCGATCGCTCCGCTATCTCGTTACTTCGCCAGCCAAGCCCGGCTGAAGCGCGAAATCCAGCGCTTCTGCCTCCAGCCTTAGTGGCCCTACCTCCCCTGATCTCGTGCCGATCGCTACGCTATCGCGTTACTTCGCCAAGCCCGGCTGAAGCGCGCATCCTTGCGCGCTCAGCCGGCGCTCGCATCCATGCGAGCGCCCTATGGGGCTCTTTGGCTGCGTAACGCTGCTTCGCTAAAGGCACGATGTCGGGGAGGTAGGGCCCCTAAGGCTGGAGGCTGTCAAAATTATCCAAGGTCAAGCGTGTCAACATCACCTCAGGTTGACCGCTTTTCACCGGTTTAATTTGTCCGGTTATCACAGCGCGGGTCTTAAATCTTAAAGACGCTCATGCCCGAGCAAGAAAATGTGTTATGCAAGGCCCAATAAAAACGAGGCGCCAGGGATGGCAGCTTCCCTGCCCACACATCCCTAGCGCGCCCATCCCATTGCCAGCATCCGTGCCCTGACTAGCGTAATCCTTCACGCCTCCATGTCTCGACATGCCCCCAGCCCCGCTGGCAATCCTTTGCCTTGACCTTAAAGCAGGGCGCACACATCCATGTGCTTTGGCCACTGCTAAGGGCTGCTATGCGCTTCCTGCGCGGCGCGGTGGGAGCGGCTTCCATGCCCCGAATGCAGGATCAATACTACTCCCAAATCGTAACCTGGCCAATGGGAAAAGTTCCCTATTTATATGGGAATTTTCTGAAATTTAGCAGCTAGTGGTTATCAGGATGTGAACTATATGGTGGCATAATGTGAATGGTGGACAGATTTATTACCGGAGGGAGAAGTTGACCATGGCCAAGATTGCCCACGAACCAGTAAAACGCGCCATGAGCCGCATTCGTGAACTAAGTGCTGACGAGGAAGCCCGCCGGCTCGCTTTCGTCAGAGAACGGGCTCTACGCGACGAGGTTTCGCAGCTCAATGAGGCTAGACAAGAAGGTCGACAAGAGGGAATAAAAGAGGCTAGACAAGAAGGTCGACAAGAGGGAATAAAAGAGGGTCGGCAAGAAGGAATAAAAGAGGGACAACAAAGAGGACGACAAGAGGCTAAAGCCGAAACCGCCCGAAACCTTATAAAAACTAACGCGCTAACAGATCAGCAGATCGCTCAGGCAACCGGTTTAACACATGAGGAAATAGCTCAACTGCGTGCTGAGCGGCAGGGCTGACACTTATGTCGTACGCATATCATCCGGTTTGCGGCTGTGTTCAAGTTTTTTATCGGTTAATGGTGCCCCGGGCAGGATTCGAACCTGCGACCTATCGCTTAGGAGGCGATCGCTCTATCCGGCTGAGCTACCGGGGCAAGGTCGGTGGCGTGATGGTTTGGGGTAACCGCCGGTTTTAGCCGCCAACTGAAGCCCGGCCAGTCTAGCGGTTTTTTGTGCTATTCCGCAAGACAGGCTTGTAAAGTCGATAGGCAGGTCAGCTAAGGCAATATAATCACTGCGGGGACCTCTAAACCCCCCCTGTAAAAAACCCCCGAGTCTTGCCGAACCGGTGTAAAAGAGCCGTACAGCCATCCCTAGTGCTTGATGGCTCTATCCTCCCTGGAGCTTAGACCTTCACGCTGAGGCAGAAGGGGTCCCGAGTTATTCTTTAGAGAAGCCGCAGAGGCGCCCCCCGAGACCCGGAGGAGCCTCAGAGACGTCCCAAAGATGCCCCAAAGGCGTCATTGCGCTACCCATAGGCTTAGGGCTCTTCATCTTGATAAGGCACCGGCCTATAACCCAAGTGCATTAACCGAAATATCCCTCCTTCGATATTTATAACCGGGGCATCAACCTTTTGGGACAGCAGATGGGCCGCTTGCCGAGTGCGGTTACCGGTTCGGCAGATAAGCGCAATAGGGCGGAGTTCCTGATTTAAGTGAGGCTCTAGATGCTCTATAAAATCCTCGGTGTGGCGGTAGGTTATGGTCTTTGACCCCTCCACGATGCCAGTCTGCTGCCATTCGTAAGGCTCACGAATATCGACCAATACGCCGTTAAATTGCAATAGATCGCCTGGATCAGGGGCTACCGAGTAATAGTTGCTGTGTGCCGGATGCGGTAAGGCAAGAAAGGCTATGGTTAAGAGCAGAGCGATTGCTGGGCAGCGGTATTTTTGGTTGACATCCATCGAATTTAAGCTCACTTGCCTAACGTCAACGTCCTCAGACGGAAGAGTACAAAGCCGATAATCAGAGCTATGAAAGCGAATGTAATCAAGGTGCCAAAGGTGCCCAAATATTCTACCATCCTTTCCCAGTTAGCGCCGGCTGTATAGCCACCAAGGACTAATAGGGAGTTCCAAGCGGTACTGCCGAGAAAGGTAAAGAGCAGGAACTTTGGCATAGGCATGGGCAAGAGGCCTGCGGGAACCGAGACAACGCTGCGCACGGTAGGCAAGAAGCGGCCAAAAAAGACGATCAAGGCGCCGTGACGCGAAAAGACCTTATCCATTGCCGCAATGTCGCTCCTCTTGAATAGATACCAGCGGCCACCTAGAGTTAGCCACCAAATCATCCGTTCCCGGTCAAGCCAGCGGGCCGCATAGTAAATCAGCGTCGAGCCGAAGGTTGAGCCGAGACTGGCCGCTGCCAGCGCCGCTAAGGGGTGGTAATCCCCTTGCGAGGCGGCGTAACCGAGAAAAGGCATCAACGTCTCGGGGGCGATGAAGATCATGGCAATGAAGATGCCAATCAGCCCGAGGTGTTCAAGGATTTCTATGATTATCTCCGCTGCCATCCTGTAGCCCTGATTCTCTCGTACAAATCAACGTAGGCATCAGCTGACCGATCCCATGAGTAATCGGCGCTCATGCCGCGAGCCTGTATCTTTTGCCAAGCCTCTTGATCCTGCCAATAGCTGAGGGCGCGTTCCACGGTCGCGGCTAATGAGCCGGCATCGGCTGCCGCCGTGCAAAAGCCGTTGCCACCGGCGTGCGCATCTACATCGAGCACCGTATCAGCAAGTCCGCCGGTGGGGTTGACCAGCGGGGGAGTGCCGTAGCGCAGCGAGTAGAGCTGGTTGAGACCGCAGGGTTCGAAGCGAGAAGGCATTAGAAAGATATCACTCCCGGCCTCGATGCGGTGTGCCTGTCCCTCATCGTAACCGATGGATACGCCGACCCGACCAGGATGGGCTTGAGCTGCCTGGAGCAATGCTCGCTCCAGGGTGTTATCGCCAGAGCCGAGAATGGCCAGGCGAGCGCCGCTGGCTAACAGCCGGGGCAATGCGCCGAGGATCAAATCGATCCCCTTTTGTTCAACCAAGCGGCCGATAAAGCCGAGCAGGGGGGATTGGGGGTCATCGTCAAGGCCGATTTCTACCGCGATGGCCTGACGGTTTTTGGCCTTGGCTTGAGGGTCAGGGGCAGAGTAGTTGGCGGCTAGATGCGGGTCGGTAGCCGGATCCCACGTTGTGGTGTCAACGCCATTGATAATGCCGTGCAGATCACTGCTGCGCGAGCGCAGCAGGCCGTCGAGTCCCCAGCCGAATGCCGGGGTTTGAATCTCGCGGGCGTAGGTTGGGCTGACTGTGGTGATTGCATCACTGAGGGTTAGCGCGCCCTTGATAAAGGCCAACTCGCCGTAGAACTCCAGCCGTTCGGGGTTCCACATTGCAGCGGGCAACTCGAGCTGAGCAAATACGTCAGCCGAGAAAATGCCGCGGTAGGCCAGGTTGTGGATGCTAAAGACCGTCCGTGGGCGCTCTTGGTGGCTCTCGCTATAGTCCTGCAAAAATACCGGGGCTAGTGCGCTTTGCCAGTCGTTGGCGTGGAGGATATCGGCCTGCCAATCGAGCACCTCCCCGGCGGCGATGGCCGCCGCTACCCGCGACAACCAGTAGAAGCGCAAGTGGTTATCCGGCCACGGCTCCCCGCTGGCGGTGGCATATGGCGATCCAACCCGCTCGAAAAGCTCGGGATCGTCGAGCAGCCATACAGACACCTCGCTGCCCGGGAGCCGGCCTTCAATGATCTTTTGCTTGTGGTACCGGTTGGAGGATTGTGCCTGTCCCTTTGAGGGTGCCGGGGATTGTGCCTGTCCCTTTGAGGGGGTCGGTGCCTGTCCCTTTGGTTGTGCCTGTCCCCCTCCCGGTGCCTGTCCCTCTGATTGGGAGATTTGTGCCTGTCCCTCTGATTGTGCCTGTCCCTTTGTGGAGGATGTCTGTGCCTGTCCCTTTACACTCCCGGCGCCACTGCCGAGGGAGAGGTACTTGATCGGCCCTTCAAGGCGTTCGCGGGCGCTGGGGTAAGCTGGCATCAGAAGACGCACGTCGTGGCCGCGGTCGGCTAGCGCACAGGTGAGTCCGTAAGCGACGTCGCCCAGACCGCCGGTTTTAGCCAGTGGCCAGGCTTCTGCGGTGACAAAGAGAATTTTCAAGGCCTAGTCTCCCGTCCCCAGTGCTCCGTCATCATGGCTACTGCTTACCAGCACCACGGCTGCCAGAGGTGGCAAGGTCAGAGAGACAGAGTAGGGCTGACCGCTCCACGGGATCTCTTCAGCCATAACTCCGCCACCGTTGCCGAGGTTGGAGCCGCCGTAGCAGGCGGCGTCACTGTTTAATCGCTCTTGCCAAAAGCCCGGCTGAGGGACGCCAAGGCGGTATCCGAAGCGCGGAACCGGGGTGAAGTTGAGCGCGACTAGGGAGTAGTCATGGTGTTTTTCTTTGTGCCTGTCCCCTTCCTCACTGTGCCTGTCCCTCTCCAATCCATCTAAATCTCTGTGCCTGTCCCCCTGCAAATCCCCCTGATCATTGTGCCTGTCCCTTTGATCTCTGTGCCTGTCCCCCTCGCTTCGGGCATAACGGTAGTAGCTGAGGACGGACTGGTCGCTGTCGTGGCAATCGGCCCAGGCGAAGCCGCGGTGGTCGAAGTCGTAGTGGTGGAGGGCTGGGCAGCTGGAGTAGAGGGCGTTGAGGTCGGCGATCAGCTGTTGGATGCCGCGGTGGTATTCGGAGCATAGTTCGTGCCAGTCGAGGCCTCGGGTCTCTGACCACTCGTGGCGTTGGGCGAACTCGCAGCCCATGAAGAGCAGCTTCTTGCCGGGGTAGGTGAACTGGTAGGCGTAGAGGAGGCGAAGGTTGGCGAAGGCCTGCCAGTCGTCGCCGGGCATGCGTTGGAGGAGGCTGCCCTTGCCGTGCACTACCTCGTCGTGGCTAAACGGCAGGACGAAGTTCTCGGTGAAGGCGTAGAGCATGCCGAAGGTTAGCTGATCGTGGTGATAGCGCCGGTGGACCGGATCCTCGCGCATGTAGGTCAGGGTGTCGTGCATCCAGCCCATGTTCCACTTCATGGTAAAACCTAGGCCGCCGAGATAGGTCGGGCGGGTAACCTGCGGCCATGAGGTCGACTCCTCGGCTAGTACACAGGTGCCGGGTTGTTCGCCGTGGGTGACGCTATTCATCTCGCGTAGAAACGAAATCGCCTCAAGATTCTCATTGCCGCCGTAGATGTTCGGGATCCAGTCGCCGTCCTCGCGCGAATAATCGAGATAGAGCATGGAGGCGACGGCATCGACCCGGAGCCCGTCGAGGTGGAAGGTCTCTAGCCAGTAGAGGGCGCTGGCGAGCAGGAACGAGCGCACCTCATTACGCCCGAAGTTGAAGATCAGCGTTCCCCAGTCGCGATGCTCACCCTGGCGCGGGTCGGCATGCTCATACAGCTGCGTGCCATCAAACCGGGCCAGGGCCCATTCGTCGCGCGGGAAATGGGCCGGCACCCAATCGAGGATAACGCCGATGCCGTTACAGTGGCAGTGATCGACGAAATAGCGAAAGTCATCAGGCGAGCCGAAACGGCTGGTAGGCGCAAAATAGCCGGTCGTCTGATACCCCCACGAACCGTCAAAGGGGTGCTCGGTGATCGGCAGCAGCTCGATGTGGGTAAAACCGAGGGCGCGGACGTGGTCGACTAGGCGGTGGGCGAGGGTGCGGTAGTCGAGGAACTGACCGTGTTCATCGAGTTGCCACGAACCTAGGTGAACCTCGTAGATAGACATTGGCTGGTGCTGCCAAAAATCCGGGTGCCGGCGGCGGTGCTCCATCCACTCCGCATCCTGCCAGCGATAGTTATCAGGCGGCTCGACAATGGCAGCCGTATTCGGGCGCAGCTCGAAACGCCGCCCGTAAGGATCGGTCTTGGTTAAGACCTCTCCGCTATCGCGATTGCGGATGGCGAACTTATACAGCGCCCCGGGTTCGACAGCCGGGATAAAAATTTCCCAGATCCCGGCCTCAGGGCGCACCTGCATCGGATGGCGAGCGGCGTCCCAGTCATTGAAATCGCCGATCACACTGACCCGATCAGCATTCGGAGCCCACACCGCAAAGCGCACCCCGCGGCTGCCCTGGTGCTCAACCTGGTGGGCACCGAGAAAATTCTGGGTATGCCAGTGACGCCCCTCGCTAAAGAGGTGCAGATCGAAATCGGCGAGCGTCGGGGCCTGCTCATCGCTGGAGCTAGGGGCAACAGCGCCGGGTGACGAAGGTGTTGTTGAGCGCGGCGAAGAGGTCGGTTTACTCACGGCGTTATCCAAATCCAAAAGGCTGTAACAAAAGTTGTGTTCCTAAACTACTAACATGGTAATAGAATTAACTGAAAGATACACACCAGTCCCCACAAGGAGTCTCCGGCCATGCAAGAGAACGCCTCGCCGCGCTACGTTAGCCGCCTGACGCGGAACACCTTAGCCCTGATTCTCGCCGGTGGTCGCGGGACTCGCCTCAAACATCTCACCCAATGGCGCGCCAAACCGGCGGTGCCATTTGGCGGCAAGTTTCGCATAATCGACTTTCCGCTCTCCAACTGCGTTAACTCCGGGATTCGTCGCATCGGAGTGCTGACACAATATAAGGCCCACTCGCTGATTCGCCACATTCGCCAGGGCTGGAGCTCGCTGCGCGCCGATTTCAGCGAGTTCGTCGAACTTTTACCGGCCCAGCAGCGTATCGAGACCTCTTGGTATTTGGGCACCGCCGATGCCGTCTACCAGAGCCTGGACATCGTGCGCATGCACAACCCGGAGCTAGTGCTGATCCTCGCCGGTGACCACGTCTATAAGATGGACTACGGTCCGCTGCTCGCCTACCACGTCGAGAAGGGCGCCGATGTCACCGTCGGCTGCATCGAGGTGCCACTGGACGAGGCCAGCGCCTTCGGGTTGATGAACATCAACGAAGATAATCAGGTGGTGCGTTTCGAAGAGAAGCCGGCCGACCCCACCCCGATGCCCGGCTCGCAGACCCACTCCCTGGCCTCAATGGGCATCTACGTCTTCAATCGCGAGTTCATGTTCAAGGCGCTGGGAGTGGATGCGCGCACCAGCTCCGAGCACGACTTTGGTAAAGACATCATCCCCTCGCTCATCGACAAGGCCCAGGTCTACGCCTATCCTTTCCGTGACCCGGCTACCGGTGATCAGTCATACTGGCGCGATGTCGGAACAGTGGACGCCTTCTGGCGGGCCAACCTGGAGCTCGTCGAAGTCACCCCGGAGCTCAACCTCTGCGACCGCGAGTGGCCTATCTGGACGTTCCAAGAGCAGTTGCCGCCGGCCAAATTCGTCTTCGACGAGGATCAACGCCGCGGCATGGTCGTCGACTCCATGGTCTCGGGTGGCTGTATCGTCGCCGGGGCCTACCTGCGCCGCTCAGTGCTCTTCTCCTCAGTGGTGGTTGATGAGCGGACCAAGGTGCAGGATTCGGTGATCCTCCCTGAGGCGCGCATCGAGCCGGGCTGTCGTATCAGCAACGCGGTTATCGACAAGCACTGCCGCATCGAGGCCGGCACCGTTATTGGTGAGGATCCGGAGGAGGATGCCCGGCGTTTCCATGTTACTGATTCTGGGGTGGTTCTGGTTACCCCAGATATGCTTGGTCAGGAGATCCATGTTGTCTATTGAAAAATCATCGACCACCGGGCCGTCCCCGGACAAGGTACGGGTTGTACTGTGCTGGCACATGCACCAGCCGAGCTACGTCAATCCGGCTAGCGGCGATTACGAATTGCCGTGGACGTACCTTCACGGCATTAAAGACTATACCGACATGGCCGCACACCTTGAGGCCAATCCCCAGGCTCGCGCCGTGGTCAACTTCTCGCCGATCCTGATTGAACAGATTGAGGACTACGCCGAGCAGATCAAGGGTTTCCTGGCGAGCGGCGAGCGCTTGCGGGATCCGCTGCTGAACGCGCTCGCGCAGCCGGTGATTTCCGCCGACCCTGAGCATCGGCGCAGCATCCTCGAGCAGTGCCGGCGCATTAACCGCCCGCGACTAGTCGACCCCTATCCGCAGTATCGGCAGCTGATGGAGTTTGCCGATTTGCTCGACCAGCAGCCGACTATGCTGCGCTATCTCGATGAGAGCTTCCACGAAGATCTGGTCACTTGGTACCACCTGGCCTGGCTGGGAGAGACGGTGCGCGGCAGTGAGCCGCTGGCCAAGCGGCTGATCGAAAAGGGGCATGGATACTCGGTCCACGAGCGGCGCGAGCTCCTGGCCCTGATTGGCGAGCAGCTCTCGGGACTCTTGCCGCGCTATCGTAAGCTGGCTGAGCAGGGGCGGGTTGAGCTGTCGATGACCCCCTACGGCCACCCCATCTTGCCGCTGCTCCAGGACCTCCAATCAGCCCTAGAGGCGTGGCCGGATGCGCCGATGCCGGAACAGGTTACGGCCTATCCTGGGGGCGAGGAGCGGGCCCGCTGGCACCTCGAGCACGGGCGGGAGGTCTTCGAACGCGCCTTTGGCCAGGCGCCGCACGGGTGCTGGCCGTCGGAAGGGGCGTTGAGCGAGCCGACCGTGCGGTTGCTGAGTGAGTGCGGCTTCAAGTGGGCGGCTAGCGGCAGCGGGGTGCTGGAAAATAGCCTCAATGGCAATGGCGTCGAGGAGCAGCAGCGTAATGGCCACTGGCACCGCGCCTACATCTTCCAGGGCGAGGCTAGCGGGGCTGGCGAGAACAGCGTCGAGCCGACGCGGTGCTTTTTCCGCGACGACGGGCTATCCGATGCTATCGGCTTCGTCTACTCCGACTGGCACGGGGATGACGCCGTCGCCAATCTCGTCGTCAAGCTCGAGGAAATCGCTGTTGCTAGCAAAGATCCCGGCAATACGGTGATCAGTATAATCATGGATGGCGAGAACGCCTGGGAGCACTACCCGGCGAACGGCTACTACTTCCTCTCCGGTCTTTACGAAAAGTTGAGCGAACATCCCCGGCTCCACTTGACCACCTTCGCTGAGGCGATTGAGCAGGTAGAGCCGATCGCCCTGGACAGGCTGGTCGCTGGCAGCTGGGTCTACGGCACCCTCTCGACCTGGATCGGTGAGGTCGACAAGAACCGCGCTTGGGAGCTGCTCGTCGCCGCCAAGCAGGCCTACGACAGCCAAATCGATAAACTCGAGGGTCCCGCCCGGGATCGCGCCGAGCGGCAGCTGGCAATCTGCGAAAGCTCGGACTGGTTCTGGTGGTTCGGTGACTACAACCCGCCCGACGTGGTGCGCGACTTCGACCACCTCTTTCGCATCCAACTCGCCGCGCTCTACCAGTGTCTCGGCTTAGAGCCGCCCCAAGAGCTCGATCACCGCTTCACCCACATCGGCACCGGTAGCCCGCAGATGGGCGGGGTCATGCGTCAAGGTCGTCTCGAATCATGAGTGGGCGTGGCCTAACAGAGCAACGACGGGCTGGCGTACTTGCACACTTGAGCTCCTTGCCGGGTGGGCCGGGCAACGGCGACCTAGGCGCACACTCGCGCTACTTCGTAGATTGGCTAGCCAATTGCGGCTTCAGCGTCTGGCAGATGCTACCGCTGGGACCGACCCACGAAGACCTCTGCCCCTACCAGTGCCTGTCCGTCCATGCCGCCGACCCGGGCTTCATCGACCTGCAACAACTCGTCGAAGCCGGCTACCTCAGCGCCGAGCAAGCCATACCGCCGACCGACCTCAGCCGCAGCGAACTCCTCAACTGGCGCTACCAACGCCTGCGCGACGCCAGAGCCGGCTTCGTCGCCCGCCACGGCCAGAACGGAAAGGGACAGGCACAGGGAGAAGAGGGACAGGCACCGCCACCACCACCGCCACCAGAAACAAACTCCGAGCTACGCGAGCTCCGGCAGTTTCGTGCCTGTCACTCTCACTGGTTGGAGGATTACGCCTTGTACATGGCGCTGCGCCGGGAGAACGAGTTCAGGCCCTGGTGGGAGTGGCCGCAGCCGTTGCGTGATCGGCAGCCGCAGGCGTTGGAGGAGGCGCGCGAGCGCCTGGGGGAGGAGTTAAATCAGGTCGTATTCGAGCAATTCATCTTTTTCCGCCAGTGGGCAGCGCTACGCGCCTATGCGGCAGAAAAGGGAGTGCTGCTATTCGGCGACATGCCGATCTTCGTCGCCCACGATAGCGCCGAGGTATGGGCGCAGCGCGAGTACTTCGACCTTGGCGCAGATGGCCAACCGCTGAGTGTTGCCGGCGTGCCGCCCGACTACTTCGCCGCCGATGGCCAGCGCTGGGGCAACCCCCACTACAACTGGCAGCGGATGGCAGAAGACGGCTTCAAGTGGTGGCTGCAGCGCCTGGAGACCCAGCTCGAACTATTCGACTTCGTGCGCCTCGACCACTTCCGCGGCCTAGCCGCCTACTGGTCAATCCCGGTTGAGGCAGAGACCGCCCGCGATGGCCACTGGGAACCGGCCCCAGGCCACGACCTACTTAGCGCAGTGGCCCAGCGCTTTGGCCAGATCCCGCTGGTTGCCGAGGACCTGGGCATCATCACCGACGATGTCGTGGCCCTGCGCGAGCAATTTGCCCTGCCGGGAATGAAAGTCCTGCAGTTCGCCTTCGACAGCGACTCGGCAAACCCCTACCTGCCGCACAACCACACAGCAGATAGCGTGGTCTACACCGGCACCCACGATAACGACACAACCATGGGCTGGTACGCCGATCTGGAGCCGTGGGTCACCGAGCGCATGCACTCATACCTCGGCCACCCCAACGAGCCGATGCCGTGGCCACTAGTGCGCGCCTCACTGGCCTCAGTTTCCGGCCTGGCCATCCTGCCGCTGCAAGACCTGCTCGCCCTGGGATCGGATCACCGCATGAACATCCCCGGCGTGGCCGAAGGCAACTGGCGTTGGCGCTTCGAGTGGGAGTGGTTACCGGATGACCTCAGCGGTTGGCTCTGGGAACTAAATTATCTTTATGGCCGCGTTTGAGTTGATGTGGCTTAAATCTTGGTTGATTCGCCCAGGATGCCAGTTTGATATGTCGCTAACTTGCAACAGGCAACAGGCGCCTAGAACAGCTCCCCATTGGGAGCCTCAATAAATTCTCCCCGGAGCCATATCCCCCTGGCCATATCCCCGGAGCCATTATCTGCCCCGGCGTGGAGGCTTCATGGCGCCAGGGATGGCGCCATGAAGCCTCCATGAAGCCTCCAGGGATGGATTCACGGCGTCCTCCACACCGGGGCAGATAATGGCTCCGGGGATATGGCCAGGGGGGATTTTTCAGAGGCAACCCCCGGGCGCCGCGCACTCCCAAGATAAAGCAGCTGCTTGAAAATGCCGGGCGTCTCACAACCATCGCAAACCACCAAACTCAAAAGGGTGGCTGGCCAACGGATGGGTCGGATACATACAGATCCGGTAGTACTGCAAACCGTCGAAGCGCGGCTCGATGTCGATAGCAAAAACCGGCATGCCATCCTCCGTTTCACCCTCCGGCTGTAGCTGATAGCGCACCGTAGAGGCCGCATCCCGCGGCTCCTCCATGGCGCTGAACCGGCACTCGATGGTGACATCATCACAGCTCAGGCCGTTAAGGTGGGCCTTGACCCGTATCGGCAGCGATTCGCCGTGGAGCAGGCTTGTGGGTGCAGCGTCGATCCGCTCCAGCCAAGTCCCACCCCAGTGCTCGCGCACCCGCTCCTTCCAGCGCGCTAGCTCCTGCGCCCCGCTTAGGCTATCGGCTTGAAGAATCTTGCTCTGCGCGCGGGCCGGGGCATAGAGCTCGCTGACATAGTCCATGACCATGCGCTGGGCATTAAAACGCGGCAATGTCGTGCGCATCGCCGCCTTAGACATCTTCACCCATTCAGTCGCGTAGCCACTGTTGCCACGATTGAAATAGAGCGGAATCACCTCATCAGAGAGGAGGTTGAGTAAATCGCGCGCCTCTTCCCGGTCCCGATACTCAGGATCAAAGCCTGCCGAATGAGGGAGGATCGCCCAACCGTTGCCTTTCTCATAGCCCTCGTCCCACCAGCCGTCGAGCACCGACAGGTTAAGCACGCCGTTGATTGCCGCCTTTTGCCCCGAAGTGCCGCACGCCTCTAGCGGATACTCAGGATTGTTCAGCCAGACATCGACACCGGTTACCAGCTTACGCGCCTGCGCCATGTCGTAGTCTTCGAGCAGGATAATCTTACCGATCAGATCCGGGTCGAGACTCAGCTCGTGGATCCTGCGGATCATCGCCTGACCCTTCTCATCATGGGGGTGGGCCTTACCGGCGAAGATCAAGATAACCGGTCGCTGCGGGTCGTTGAGCAGCTCCTTGAGGCGATCTATCTCATAGAAAATCAGTAGCGCCCGCTTGTAAGTAGCAAAACGCCGTGCAAAGCCGATGACCAGCAGATCCGGATCTGGATTGCTGATATTGCTGGTCATCCGCTCAATCATGGCATCTGACATACCGTTACGCTGGCAACGCTTCAATACTCGCTGATAGACATCGCGCAACAGCTCCGATTTAAGGCTGCGGCGCATACTCCAGAAACGGTGATCAGGCAACTCGTCAACCACTTTCCAGAAATCTTCGTTAAGCAGCTGATTGCGCCAAGCGTGCCAGCGCTGATCAAAGAGATTGGCCCACTCCTGCGCGAGAAAAGTCGGCACATGGATGCCGTTGGTTATCGAGGTTATGGGGTTCTCCTGAGCGGGTATCTCAGGCCAGATATGCTGCTCCATCTGTGAGGCAACACCACCGTGGATAGCGCTGACGCCGTTATGAAAGCGCGAGCCGCGCAGCGCTAGGCTAGTCATGTCAAACCCTTTCTGGCCGTTGCCCAGAGCAAGGATATCTTCGATGGGGATGTCAGTGTCAGCCAAGTTGGGCGCTAGGTGCTCAGCTACCATTTCCGGTTCGAAGATATCGTGGCCGGCGGGTACCGGTGTGTGAGTAGTGAACACGGTCTCGCTGGCGACTGCTTCAATAGCTGTGGCCGAGTCATACCCCTCACTAATCAACTCGCGGCAGCGCTCGACAATCTGAAAGGCTGAGTGGCCTTCGTTGATATGCCAAACACTCGGAGAGATGCCGAGCTTGCGTAGCGCGCGGACACCACCGAGGCCTAGGCAAAGCTCCTGGAGGATGCGCATATGCGCATCGCCGCCGTAGAGCTGATAGGTGATAGCGCGGTCAGCTGCATCATTTTCCGGGACCTCACTGTCGAGCAGATAGATGAGCACATGCCCGGCACGCATCTGCCATACCCGCAGATGTATCTCCCGACCGGGGGCGTCGACGCTAACCTGCACCTGCTCGCCGTCGTCATCGAGGCACGGTGTTATCGGCAGCTCGGAGATGCTGCTCGGGGCGTAATGGGCCTGCTGATTGCCTTCGTGATCAATGGTCTGTGAGAAGTACCCCTGACGGTAGAGCAGCCCCACAGCTACTAACGGCAGACGCAGATCGCTAGCCGCCTTGCAGTGATCTCCGGCGAGGATGCCGAGGCCGCCGGAGTAGAGCGGGACACTTTCGTGGAAGCCAAACTCGGCGCAAAAATAGGCTACCAGATCGTTATCTGGGTCGATGTACGGCGCCACTTTGCTGGTCAGCGCCGCCTGCTCGTGGTAGGTATCGTAGGCCGACAGGGTGCGATTGTAGTCGGCAATGTAGGCCTCATCCTGAGCCGCCTCCTCGAGCTTATGCTGAGCAATACGGCGCAAGAAGACCTTGGGGTTGTGCCCGCAGGCCTCCCAGAGCTCGGGGTCGAGCTGCACAAAAAGCGCTCTAACGTGCCGATCCCAAGAGTAGTAGAGATCCTCAGCCAGCTCTTCGAGGCGGCTTAGATTAGGGGGGATATTGGGTTGAACTTCAAGGGTGAAGATGTTCTCTTTCATGGAGGTCTTAGTGTCTCGGTTGGTCCTAGGTGCTTGTTGCTGACAAGTGGTCAGCTGGTAGCGGGTAGTTTACTAGTCTTAGGATGGCGTGGCTAAGGGAATTTTTGGGGGCGCGTGACCTCCGGTATTCCGCCGGCGCTACCATCCGCCCCGGTGTGGAGGACGCCGTGAATCCATCCCTGGAGGCTTCATGGCGCCATCCCTGGCGCCAAGACCTCCACACCGGGGCGGATGGTAGCGCCGGCGGAGTTTTTAGAGGCCACCAATACTAGCTAGTTGCTGGCGAAGATATCCTCTATATTAGCTTCGTAATATGATAAGTCTATCGAAGTCCTCCGGCACCTCGATCTGCCCTTTCAGGATGCGGTTGAAGAATCCGACCCATGAAAACAACCAAATACTTCGATGCAACCAGACCCCGTCCTGACCGAGCCAGGATTGAGGACGGATGGATTCTACGGGGAAGTGGATGGAAATCCCGGAGGCATAGGCGATGCGGCTGACCAGTGCAGAGCGCGAGGTCATCCTGGATGTGGCGCGCGAGGTTCTCGGTGACGACGTCAAGGTTCGCCTGTTCGGGTTGATGTTTCCTGGGTCGAGTCTCTTAAAGACGATCCGGAGCGGGCTGAGCGACTTGATGCCTTTGTGGCGCGTTATGGACGTATGCAGGATACGATCGGCGATCGGCTTATCCCGAGCCTGATGCGAGATATATTGGAAACGCCTGGCTCTACGCTGGACAATCTAAACCGGATGGAGAAACTGGGTATATTGGCTGCGGTTTCCGATTGGGTCGAGGCCAGGAATTTGCGGAACCGGCTTGTCCATGAGTATATGCGCGATGCCGAAGAGTTTGCTTCGGCTGTCAATCGTGCTCGTGAATGCGTGTCATTGCTGATATCGACCTACAACAATATCCTTGATTATGCAGCTCGACAGCTTGAGCCACCTGATGGCTACATGTGGCCTGAGAGATTAGTCTAATCCTTACTATAAGCCAGGTTGGCTTGCCATCATCTGCGCCTCGAAGAGGCGATCGCTATGCGTCTCCTGCGGGCTCGGATCGACGTAGCTGCCCGGGAGGGGCGTGGGAAGATCATCGCTGAGACGCTCCCGCACCTTGTCGGGAAGACTCTCCCGCAACACTACGGCAGCTCGCTCCGGCGTCTCTAGGGTCGCCTTCAACAGGGCATCGCGAGGATTAGCTGGGTGATCGGCCATCGGCTCTATTCAGCCATTCGATCACTGCCGATTCCGGGTGAGGGCGCCCCACTCAGAGAGAGCGGACTCAAGGCTGATGTAATTGTATTCGCCCCTGCGCAGGGCCCGGGCAATACGCTCCGGTGTGTAACCATCCTGAGGCAATCCGGGTGGGTAGACATAGACCCCACCTCCACGGGCAGCGGGCTTTAGCCAGCCGGCTTCGACCAGCCGAGCGATCCCTGCCTTTCGGGCCTTGGTGCGATCCGAGCGGAACAGGCTATCGAGATCGCGCCGAGTAAAGACGGCGCGGCCCTCATCGGCTTGGGAGTTGAGGATCTTGAGCGCAGTATTCTGACGCATGGCAAGCACCCGCGGAGTACTAGACACAGAGACTAAAAAAAATGGCTTTCGTGTCCAGTTGCTGCCGTCCAAGGAACTCGGGGCGGGGGGGGGCGGGAGACCGAATCGTCAACGAGACCCCCGGTGGGGGTGGCTGCGTGCCGCCGCCGGGCCTCCATACCCTCCAGTTGCGGTCTTGACAACCGTTGGCTGCTAAATTTTAGAAAAATCCCATATAAATAGGGAACTTTTCCCATGTACATGGCCATGTTACGTGAGCAGTATCGTTCCTGCATTCGGGGAATGGAAGCCGCCCTACCGCGCCGCGCAGGAAGCGCTCGGATGCCCTAGGTAGTGAACATCCGCTCGGCAAATTTTTTCTCCGGCTCGCCTTGAACTTCGACGTGGAGCAGTATCCAGGTTGGCTTGCCATCATCGGCGTAAACCTCGACAAGTTTGTCGGCATAATTTCACGGAAAAAGGGGTCTGTCCCCTTTTTGCGCTGCCTGTCCGCCGGGGTTAGTGCGGTATCCGGTAGACGTGCAGGCCGGGGACTTTCTCGAAGTCGCCATCCCCCGTGACGAATGCGCCCCTCGGGTCTACCGTTAGCAGGCAAGCTGCCTGGATCGCATCGGGTGTGCGCAGCCTGTATTGGGCGCGCAGCTCCGTAGCTCTGTCCAGCACGGCGGTGTCTAGCGCTATCACCGAGAGCCCCGGGTCGTCGAAGAAGCCATCCAGTCGCTCCAGTGCCTGCGTGTCGGATTCTCGCAGCGGGCGCACCCGGCACTCTAGCCGGCTCAAGGCTGAGGCCATCAGCACCGGCGTTTCGTCAGAGCCGCGTTCCAGTTCCAAAAGCACCTGCCGGGTGGCATCCCGGGTCTGCCCGTCTCCCTCCAGCAGATAGATGATGGCGCTGGCGTCCAGAAAAACACGCATTAATCGCGCCAGCCTTCGCGCTCCGATTGTAGTTCACGGTCCAGCTCCGCACGGTTGCGCGCTGCGCCGTATGCCTCGTCATCCCCTAGCAGGCGATCCCATAGCCTTTTTCGGGCCTGGGCACCCTCCGATAAGCGCTGGTAAGACTCCTCGCTCAGAATTACGTAGCGGGGGCGGTTGCGCTGGATGACGTGCACCGGCCCGTTTTTCAACGCCTGATCCACCGCGCTGATGCCACGGCGCTTGATCTCTTGTGCGGAAATGACGTTTTCCATCAGTACGACCTCGAGCACTTAATTTAGTATCCATTTTAGCACCGAAATCGAGCAGGGAGGCTCTGGCTCTTAGCTCCCGGGGTAGCGGACGGCCCGGGGTTACGCCCAAAGGAATCCAGCCGAACCGTTCGGCACGATCCAGGTTATCGAGGGCAGGGCCTACGCGCTCGCCGTCCGTCGAGAAGTCCTCCGGCACCTCGATCTGCCCTTTCAGGATGCGGTTGAAGAATCCGACCCATGAAAACAACCAAATACTTCGATGCAACCAGAACCCGTCCTGACCGAGCCAGGATTGAGGACGAATGGATTCTACGGACCATAGAAAACCCCGAGCATGAGCACGTCCAAGAAGATCTACGAGTGCGGCGATGGGCCCGGATTCCTGAGGCTGGGGGTCGCTACCTTCGGGTAATTTTGCTTCCGGACCGAGAGACGGTCCATAATGCTTTTTTCGATAGAGGTTTCAGGCCATGAAATTGCAGTATTTCGAGGATACGGACACCTTGTACATCGAGTTCCAGAGCCGAGCGATCTCGGAAACCAGAGACCTGGACGAAAACACGATTCTTGACCTAGATTCGGAAGGGAACGTTTGCGCAATTACCTTTGAGCATGCCAGTCAGCGTACTGACGTAAACCACCTGCATGTAGAAGGCCTAGCTGCATAACAAAGGGTTAGACAATGGATCTGTGCGACCTATCTTAGGGTTAAGGCAAAGGATTGTCAGGCGGGGTAGGGTATGTCGAGACAGGGAGGCGTGAGGGATTACGCTAGTCAAGGCACGGATGCTGTAATGGGACGAGCGCGCTGGGGATCTGTGAGCAAGGATGCTGCCATCCCTAGAGCCCCAAGTCGTGCACTTTGGCGGCGGAGCGAGTGCAGCAAGGGGGCCTTCTCAAAGGGGCACACCAAGCCGCCAAGGCCTATTTATTACATGCTTGGGCCCTTGGAAAGCTCTTACGACGTTGGTTAGCAGGGATCGGGCTGCCAGCCTTGCGGGCCATACTAGTCTTCAGCGAAGACGTCCTCGATGCGTTCGGCATCAAGGATCCGGTCGAGCCACATCTCCAGTTCTCCTAGCGCTGCACGCTCAACCCGAGCCCGTGACGCCTCTTTGGCTTCCGCCCCGAACTTGCGCTCAATTTGCCGAAGTAGCGTCTTGGTGGCCGTCTGGTGACTGCCTCTCTTCTGACCTCTCTTCTCTCCTCGCTTCTCTCCTCGCTTGATCCACTCTTCAGCTACCGTCATCACCAACTCCTCCGCTCGATCCGGTCGCGTCTGCTCCAGAGCCTGACGTACGTCCTTCTCCTGGATGTAACTATACACCCTCGCAATATACACCAACAGCGGCTTCTCCAGTGGATGCCCGGGCGGCAGATCGCGCAATAGCTTGATCAGATCGTCTCGATCCAGGTCCTCGACAAACGCCCAGGCGTCGACTTGTGCTTGATGAGCACGTAGAGAAAGACCGGGCGACCATCGCGCATCTGCGCCTCGAAGAGGCGATCGCTATGCGTCTCCTGCAGGCTCGGATCGACGTAGCTGCCCGGGAGGGGCGTGGGAAGATCATCGCTGAGACGCTCCCGCACCTTGTCGGGAAGGCTCTCCCGTAACACTACGGCAGCTCGCTCCGGCGTCTCCAGGGTCACCTTCAGCAAGGCATCGTGAGGATTAGCTGGGTGATCGGCCATCGGCTCTATTCAGCCATTCGATCACTGCCGGTTCCGGGTGAGGGCGCATCACCTCGGAGACAGCATTTGTGTCGAGCAGGATCATCCTATATCTGGTGTTCTGCGAGGTAATCCGGGTCTTGTACATCGCGCTGCACCACGGCAGTACGCACCAGCACGCTCTACCTGCTGGTGATACCGCTCTGTAGGCTCCGGGCCGAACTTCTCCTGGAGCTGTAAGAGCAGCAGCTTACCCTGGCCCTCCTGCCTCCAATGCTCGTGCCACTTATCCACCCGCTCAGCCAGCATTTTATGCACCTCCTCCAACTCCTGCGCTCGATCAATGAGCTCCCGATCCGGCTCCGGCACATGTTTGCGCCGCAGATAGACCCGACGGAACCAAGTGAGAAACGCTCTCCTTCCGGACGAAGCCAGTGAGCAGGTCGCTGACCATCCGCGGATAGGAGAAGAGCATCTTGTAGGTGTTGTCGTTTTCCTGGGGCACCGTCAGCCCCGCCGAGGCGACGTCCTCATGGACTTCCTCGCCGCCGATACGCAGGCATCGGGTGCCGCGGCCATTGGCGATATTGTCGCCGGGGCAATAGTAACTTGGCGTCGCGTTGCGGCCGACGAGGAGCTTGCGTGCGCAGCACCCGCAGGTGGCGAGTCCCTGGAGGAGCGCCCCGATTTATTCCGTAGGCAGGTATATGATCAGTTAGGTCTTGACAACGTAATCGCCGGGATTCCGCTTATGGTATGGACTCTCGACGTAGCGCACGCGGTCGGTCAACTCTAGAGTTCGTTAGATAACCGCCTCGAAGTTTGCCAGGCGCTCAGCAGTAATGACCAGTCCCTGGGCGGCCAGCGTTTCAAGATAGTCACTGACTGATTTCTCTGGGTTCTTGAGGGCCTCTCGGTGAGTCTTAGCCACGTTGACGACTGCGCCTTCGTGCAAATCCATTTGTCGCTCTACGAAAATGTCAGGGTGAAGCGCTTCGATGTCGTACTGGCCAAGCGCTTCGGCCGGGAAGTCGTCCAGGTTGAATGTAATGATATTCTGTGCGCCGACACGAATCGCGGCCGCCAAGACGTGCGGTCATCCGGGTCGGGAAGATCCAAGCAACTCTCGAGTTGCTCATACCCTTCAACCAAGCAATCCGGAACCGCCTTATTCATCAGCGCCCGCGTCCGAACAAGCTGGTTTTGGAGTTCCGGACGGTCCCGCAACACACTATGGATCCATTCATCGTGAATCCTCTCCGTCCATTTCGCAGCAAAGAACCCCGAAACCGCTAAGCGTAGAAGAAAATCCCGTAATGGCGCGGGGTAGAGCACGCAGGCATCATAAATGACGGTGGTGCGCATTCAGTAGCCCATGTTGTACTCTTGTGCTTGCTCCACCAGCTCATCTAGCGCCGCTTTGCTGCTCTCATCACGCTGGGCTTTGTACGCCATCAAGTGCTCGTACCGGATGCGCCGGTGCGTGCCCACGCGGGTAAAGGGGATGGCGCCCTGTTCAAGCAGCTTAACCAGGTGCGGGCGCGAAACATTCAATAGATCGGCAGCCTGCTGAGTGGTGAGCTCCGCATGGGTCGGAACAATGGTAACCGCGTTGCCTTGCGCCATCTCCGCAAGCAGATCCCGCAGCAGAGCGAGGGCATGCCTTGGCAGCACCAGATCCCTGCCATCCAGTTGGACCCGCGCCCTGTCGGCCTCCGGGAGCTGAGTCAGCAACCTGCTGATTTCCTCCGCACTTGCTCGCGCAAGCGTTGCCGTAGCCTGATCCGGCAACGGCTGACGATCATCGACGGTGTTCATCACGAAGGGCTCCTCCAAGATACTGCATGCCTTTCACTCGGATCATATTCGAAACAAACGAAAGGCGCAATGCTCACGGGAAATCCGCACGGAAGGGCACCATGTATCACCGGGGATATGCGCCCCCGTGCGTAAAACGCACGCTAGGATGCTGCCGGTCGAAACGGCGTATGGCGCAGCTTAACGGCCTGGCGTTTGCTGCGTTCACCGCGCCGATCGTAGCCGGCGGTGGTTTGGGGGCTGCTGTGCCCCATCAGGTGTTAATCGGTCTCCCGATCCACGGCCCGCGGGCGATCGGATCGTCATCGAGACCCCCTGTGGGGGCGTCTGCGTGCCGCCGCCGGGTCTCCATACCCTCCAGTTGCGGTCTTGACAACCGTTGGCTGCTAAATTTTAGAAAAATCCCATATAAATAGGGAACTTTTACCATGTACATGGCCATGTTACGTGAGCAGTATCGTTCCTGCATTCGGGGAATGGAAGCCGCCCCACCGCGCCGCGCAGGAAGCGCTCGGATGCCCTAGGTAGTTGTCTAAGTACACGGATCTGTGCGGCCTATCTTAGGGTTAAGGCAAAGGATTGTCAGGCGGGGTAGGGTATGTCGAGACAGGGAGGCGAGAGGGATTACGCTAGTCAAGGCACGGATGCTGTAACGGGATGAGCGCGCTGGGGATGTGTGAGCAAGGATGCTGCCATCCCTAGAGCCCCAACTCGTGGGCTTTGGCGGCGGAGCGAGTGCAGCAAGGGGGTCTTCTCAAAGGGGCACACCAAGCCGCCAAGACCCATTACGTGCTTGGCCCCTTGGAAAGCTCTTACAACGTTGGTTAGCAGGGATCGGGCAGCCAGCCTTGCGGGTCATACTAGTCGTCAGCGAAGACGTCCTCGATGCGTTCGGCATCAAGGATCCGGTCAAGCCACATCTCCAACTCCTCGAATTCAGCATGCTCCACCCGAGCACGCGATGCCTCCTTAGTTGTCGGGCCAAACTTGCGCTCGATCAAACGTAGGAGTGTCTCGGCCGCATGCTCCTGACGGCCCTCCTGACGGCCCTCCTGAAGGCCCTCTTGGCGGCCCTCCTGACGGCCCTTTTTGATTCCGGCACGCTCAAAACTAGTCACGTATGGCATCTGGTAATCCTCCTCCAATTCATGGGCCTCGGTGATAAACGCCTCCTCCAGGTCCTCGGGCAACTGTATCAACCAGTCGATGAGACGCATAAGCTCCTTAGTCTCATCCTTACTGTAGCCACGCTCGAGCAGCATGCGGGTCAGCCCCAGCTTAACTTCTTTGCGCTGGTGCGGATCGTGCTCGTTCTTCGCGGCCAGCTGGGCTAGAACCACACAGGCAAAGGGATTGAGATCAGCCTCCAGCTCCGCCCAGCGCTGCTGCCAGTCTAAAAGTTTAGCCACCGGGAAGCTAAACTCTAGGCGGCAGCCCAGCCGCTCGTAGCGATAAGTATCCGAGCGAAAGCTAGGGGAGGTGTCCGTGAGCACCGCTACACTGACGATATCCCGCTGGTAGCGATCATAAAGCCGATAGTGATAGGTGAACATCCGCTCGGCAAATTTTTTCTCCGGCTCGCCTTGAACTTCGACGTGGAGCAGTATCCAAGTTGGCTTGCCATCATCGGCGTAAACCTCGACAAGTTTGTCGGCATACCGCCGACCCGAATCAGCATCCCGAATAATCTGCTGAAGCTCCTTATCGCGAAAATGAGATCCCTTCGACCAGTCGATCTGTTCATGGATCTGCGGGAAGAGCAGCGCCATGGCGTGTTCAAGGTAATACTCCAGCGCCTCCTTCCACGGCGAGTCGTAGTCGCTTTTCTCACTATCCGTATGTGATGTCTGCGTCATTTCAAAGCTGGTTCCAGTCCAAGCCAAGCCAATTAAGATCAATATACCTGTCTATAGTCGGGGCGACTGCAGCAAAAGACAAGCCGTCAGGGAGAGGAGGGCGCCAACAGAGTGGTGGTGGCCGCCCAAAAGCCACCACCCCATAAGCGCGTCGTAAGCTGCACTAGCTGTCTTGGTGTTGTGCTAGGCTATATTGCAAGCATTAAAGAGGTGATCACCCACTGGCTCAAGCGTCACTTAATGCATTGACCGATGAGCAGATCGCCCAGGCGACCGGATTGAGCGTTGAGGATATCGCCAAGCTACGCGCCGATGGGCAGGGCGAGCACTGACTGTAGTGCTTAAAAATTATTAAGCGCTCTGAACTGTTCTAGTGAGGCATTCCAGAGCAATGCGTGGTGGCGGAGTAGTTCTCAATGCCGGAGGCGAAGACTATCCGGCGCACCTCAGAGTGTTCTTTTAGGTAGGAGTGCGGACTTTGTTCTAATTGAGCGATGCCGAGCGCCAACGACCCCGCCAAGCCACTGTACGAGACCTTGTGCGATCAGACAGCTGGCGTGCTGAAACAGCGGCTTGCCTTGCTCCCCTCTGGCGGAGCAGGTCTGAACCGCAAAGGCGATCTAGTTGAAGCGATCTACGTATCCACGCCGTGAACAACCAAGTGCAAGCTTTACTCGGGTATTGGAAAGCACATAGCAAATGGCCACTCAGCTCAACCTTTACAACAGAAAGCAACAGGTATAGCTTATGGCCTATAATCAAGGCAAGAAAGCGGAGCATGTGGCAAATCATTTTTACTGAGCGCTTTGAAGGGTGGCTTCGCGGGCTTAGCGATGATGATCGGGTAGCTGTTCTTGCGGTCATCAATCTTCTGAAGGAGAGAGGGCCTGAGCTGCCAAGGCCTCATGCAGACACAGTTAACGGTTCGCATTACACGAACATGAAAGAGCTCAGGATCCAGAGCCAGGGGCGACCGCTGCGAGCGTTTTTGCGTTTGATCCGCTGCGCCAGGGCGTTGTGTTGTGCGCAGGAAACAAAGGCGGGAAGGATAAACGGTTTTATAAAGACATGATCCCGGTGGCTGACAATGAGTTTGCAGCCCACCTAGAGGGACTAAACAGGAGAGACAACGATGGCACGCACTCTTGAAGATTATCTTGCTGACGAAAAACCGGAGGTCGTGCAGCGCGCAAAGGAGAAGGCCGACGAGATGCGCCTGGAGATGCATCTGGCCGAAATTCGTCAAAAATGCGAAATGACTCAAGCACAGCTAGCAAGAATTATGGATGTGAAACAGCCGACCGTTGCCGGCCTTGAGCGTGAGGGTAAAGATCTGCGGCTGAGTACACTAAAGCGCTATGTAGAGAGCTTGGGTGGGCGTGTGCGCCTGGATATTGAGCTTCCGGATGGCACCCATAACGACTTTCCGGTCTGACCTCCCTCCGCTTTAGCTGTAGCGTCTTAAGCGAACCAAGGGCGCTGAATCACCTCGCAATTGAAGTGCATATGTACTAGCTGCTGGCGAAGATATCCTCTACGCTCTCGGCATCGAGAATTCGGTCAAGCCACACCTCCAACTCCTCGAATTCAGCATGCTCCACCGAGCACGCGATGCCTCCTTAGTTGCCGGGCCAAACTTGCGCTCGATCAAACGTAGGAGTGTCTCGGCCGCATGCTCCTGACGGCCCTCCTGACGGCCCTTTTTGATTCCGGCACGCTCAAAACTCGTCACATATGGCATCTGGTAATCCTCCTCCAATTCATGGGCCTCGGTGATAAACGCCTCCTCCAGATCCTCAGGCAACTGTATCAACCAGTCGATGAGACGCATAAGCTCCTTAGTCTCATCCTTACTGTAGCCACGCTCGAGCAGCATGCGGGTCAGCCCCAGCTTGACTTCTTTGCGCTGGTGCGGATCGTGCTCGTTCTTCGCGGCCAGCTGGGCTAGAACCACACATGCAAGAAAGACAAGCCGCTAGGGAGAGGGCGATGTCGACAGCGTTGCGGTGGCCGCCCACAAGGCCACCTCAGGATGCCCCGCACAACGCCTCCGGCGGCTCGAAGCCGATATCATCCGGTTGCCGGCTCGCGGCGAAGAAGTCCCGATACCACCGCACCGTGCGCTCGACGCCCTCGCGCCAGGGCACCTGCGGCTCGTAGCCGAGCTCCTCCCGGCACTTGGTCAGATCCGGGCAGCGGCACCGTCTCCGGTGTCGGGATTGACGCATCGAATAGATCCATGGTGGCGGCGCAGTTATCGCTGCCGGAGGTGATGACCATCCGGCCAACTCAGCGCGTTGGAGCACGCTTCGGCGGAGCCAGACCGCCGCACTCTGTTAACGCGCGGCTTATGGCTGGCGTTATGTGCGGTTGATACCCGTGGCGTTACAGGTTACACTTAAGGCATGATTCGAAGCTCCAAGCACAAGGGCTTGGCCGATGTTGATGCACAATCCCCCCCACCCTGGTGCAGTGTTGCGTGAGCTTTGTCTAGAGCCCATGGGCCTCTCTGTAACGGCCGCTGCAGAAGCTTTGGGAGTAAGTCGAAAAACCTTGAGTGCTGTGTTGAACGGCAAAGCTGGCATTAGTCCAGAGATGGCCATTCGGCTTTCTATTGCGTTTGATACTTCGGCAGAGAGTTGGCTGAACCAGCAGTCCCAGTATGAGCTCTGGCATGCTGAGCAGCATCGTAAAGAGCTCAAGGTAAAGAAGCTTGTCGCCGCCTGACTCAACATCCTCACTGCAGCCACGTTCGAGCAGCATGCGGGTCAGCCCCAGCTTAACTTCTTTGCGCTGGTGCGGATCGTGCTCGTTCTTCGCGGCCAGCTGGGCTAGAACCACACAGGCAAGAAAGAGAGGCCGCTAGGGAGAGGACGATGTCGACAGCGTGGCGGTGGCCGCATAAAAGACCACCTCTCACTTAAAGGAGGCCATCAAGCCGCCAAGGATTGATGATTTTCAGGCCGCTTCCAGAGAAATCAGACTCGTTGCGAGTAGCGACTGCCATGCCGTTACGACGGGCTATGGCGGCGATTTGCCCATCCGGGGCCGATAGAGGACGGCCGCTCCGGCGTCGAGATGCCATTATGAGCCCGTAATAACGGGCAGACGCCTCATCAAAGTCGAGGACGCGTTGTGCGAAGGCCTTGTGGATGAAGGTCTCAAAGCGCGCGCGAAGGTCTTCACGGCGCTGACTCTCGGGCATGGCATGAAGGCCGTACTCGATCTCGGCGATGGTGATAGAGCTCACATATAGTGAACCGCCATCGGCTCTATTTACCCATTCGATTACTGCCGATTCCGGGTGCGGGCGCATCACCTCGGAGACAACATTTGTGTCAAGCAGGATCATCCTTTCGGCGTGACCGGGTCGTGAGGGTCATGGTGCGGAAGCTCCAGCTCCACACCGTGCGCGGGACCGAACGTAGTCCGGGCGAGATCGCCCAAGCGCTCTGGCCCCGACACGGCCGACTGCAGGATGCGCCTCACTTCCTCCTCCATAGATACGCCGTGTCGGTCGGCCCGGGCCCGCAGGAGCCGTAGCGTTTCCTCATCGAGACGGCGGACGCTGATGCTGCTCATAGCCTGCTTCCCCTTGATTTCCGGCTGCTAGCACATGATAGCAAAATGCTGAGCATAAGCCCAAAAGAATGCGGTCGGCTCGAGATGGACTAATCCCGTCAATACTGCGGGCACGCTGCCCGTTCAAGCCGAGCAGCATCCGGGTGACGTTATGATCAGCGGCTGCGGGAAGTGGATGGAAATTCCGGAGGCATAGGCGATGCGGCTGACCAGTGCGGAGCGCGAGGTCATCCTGGATGTGGCGCGCGAGGTTCTCGGTGACGACGTCAAGGTTCGCCTGTTCGGGTCGCGTGTTGATGATAGCGCACGTGGTGGCGACATCGATTTGTTAATCGAGTGCGACGAGCAGGTCCAGAACCGCGCGTCAGCGGCGAGTCGGGTAGCGGCGCTGCTGCAAATGCGGCTCGGTGATCAGCGTATCGATGTCCTGATTCTGGATCCGTCGACCACACGACTTCCGGTCCATGACGAGGCCCTTCGAGAAGGAGTGGAGATTTGAGCCGGCAGGTCTAGAGACATCCCCTAAGAACTTCCCCCCAGGCCTAGCTCCCTCCTAGAACGTGCTATCCGAATAACCGAAGAGACGAAAATCTTCGGCGAAGAGCCGTCTGACGATATCGGCACTCTCATCATCGTAGTAGTTCTTGCATTTGGTCGCGGCCCCGGTCGGAGCAGGGCCAGAGCTTTGCAACTGCAGCTGCGACGAATCCCCGCCGATACTATCCATAATAGTTCTTAGATCATGCTCGAGGCTCTCGACGTAGCCAACGTAATCGATGCGCTCAATGCCTATAGGCTCAATTATGCGGTGCTGAGGCATCCAGTGGATGTTCGCCAAGAGCCCCCCACGCTCCAGAAAGCGGCAGAAACCCAAAAAAGAGGCCTCGCCATTGCCATAGGCCTTGATCTCGCCACCAAAGCGGCGCATCCAGCTGTCTGCATTCGATTTGCGAGCGAATTTCTGCAGATAGGCTGAGAGGGTCCTATGGTAAGGATTACGGACTACGGTGAATATATAGTGATCATTAAGTACGGTATCGATCTGCTGCTGCGATAGATTGCTGAGCTTGTCGTAGGAGGACTTGCCCTGCCTGTCCTCATGCCCAGAGGGAAAGTGATAATCAAGGGTGCTGGATATTGTGCTGTTAGCAGCTTTCGGAATGCGAATATAGGTGTACCCATGCCGCATGGATACGCCAATGCGCGCATCTACCGAGCGTAGCTGATAGCGCCACAACTGAGGCCGGTTACGATAAAACGGTTTCGTGGCGCAGCGTAGATCCTTGTGTAACTTGGGGATCGAAACACCCAACGTGCGTTTGTGCGGGCTAGAAGGATAGACATATCAAGAATTCGCCGGAGTTAAGAATTCGCCTGGGGACATGCTGCATGATTGCATTAAGCTAATCGTAATCAGGAGCTCTCACGCCAGAGTCGCACCATCCTAGCATAGCTATGCTGGGAGTTTTTTAAAGACACCCATTAGGCCACCAACTTGGCCAGGCCAGCACCTGTCCGTAATCCTGCTCTAAGCGCGCTCTGATTTTAGCAGCAAGACTGCCAGACGTTGGTGCAGCGCGATCATCAGCAAGCGCCCATTTTGAATGGACTATATCGATAGTTACCGGTGCCCTTGTAATGCCGAGGACGCGGAGCATCGCCAGGCGGTGACTGCCCTCGCGGCACTTTACAAGCCTACCGTCACGGTCGATTCGTACAGTGATCTCATGTCCAGGTTTGCCGCCGAGAGCCGACTGCTCTTTATAGCCGTCGTGGCTCAGTTCGAGCAGCAGTTTCCACTGCCTGTCAAAGTATGAATCCAGCTCAGCCACACTTTGCATGCGAGGATTCTTGATGTAACCAAGTCTCTCCAGCTGCTCGCGCATCTGGCCATAGCAATCCGTATCTTTGTACTTGCCCCCAGAAAGCCATATCTCGCGGATTTGAGCCTCGCGGAAAGGTTCGAGTGCCTGTCCCCCTAAATCCCATGTTCCGCCACATATCCAGAAAGGCCAGCGCTTGAGGTACCCCGGCCATTCGTAAAACATCTTCAGGTACTCAATATTCAGAGGGTTGGTCTCAATAATGACCGAGCCTCTGCGCGCTAGCATTAGCCACCGCGAACTATGGTGCAGGGTCTTGGATAGGGGGTTCATTTTGTCCAATGTGGGTAACCAATACTTTGCTTAACCAATGTTACCAGCGCTCACCAGCAAGTGCGTGACGGTATAACTCATCGTAGTCCTGGGCAACACGCTCAATACGAAAATGGTCGATTACTCGCTGCCGACACGCTCCTCTATCAAGGGTTTGGGCTTTATGAATGGCCTGTTCGAGTGCCTCGGGCTCAGGCTCAGAGATGATTCCACTAATGCCGTCCTCGACCAACTCGGCAACTGAACCGCGCTGCGTGGCTACCACCGGCGTGCCGACAGCAAGGCTCTCGATAAGAACTAGGCCGAAGGGTTCGTCCCACTTGATCGGGAAGAGCAGCGCGTCAGCGTGCTGTAGCAGGTGAAGTTTGCGCTCACCGCCTACTGTTCCAGCGACCTTTAACGAGCGCACAAAGGGTAACCAAGTAGCGGGGTACTTGAGGTGCCAACCGCCGGCGAGCACTACTGGCTGCCCGCGGCGGGCAACGCGGATAGCTAAATCGGCCCCTTTCTTGCGTCGGCTGACCTTGCCGAGAAAAAGGGCATAGCCTCCTTTGCCTGGACCTAGCGGGTACTCATCTACGGGAAGGCCGTTATAGACGAATACAGAGTGTCCGTGTCGTGTAGCGTGGTTTGCGGACAGATAGACTCGGTTGGCAAGTGGCAAGGCGCCGGTGCGGTGGTTGCCGCGAACCGTAACAACGAACGGTACACCTAGGCAGTCCAGATCCTCACCCGTTGCCCTGTCTGTCTCGGGATCAGTGGCGTGAAAATGGATAATGGTTTGGCCGGAGGCCGATGCCTCGATAGCTTCCTTCAAGGGTGTCCCGCACAATCTATCAAGCCGATAGTGCTCGAAGGTCTCGGTTGCGTGTCCACCTTGAGCAACCACGACCACCCGGTAGCCAAGCTTACTTTGTGCTGTCGCGAGCCACTCTATGCTGCGCGGCATGCCGCCGTCGCATGGTGCTCCTGGTACTGCCTTTGCAGCCAAATGGATGATGGTCGAAGGGGAGTTCGTCACAGCCGCTCCGCTGGTGTTTGCTAAAAATAATGCTTAGCATAATAGCAAGCTGTCAGTTTAAGAGCTAAGCAATTTGTCGATTTGCTATTATCGATTTTGGATATGCTTGTTAAGTAAATGAGGGTCATGCCTCATAAGAACATCCAAGTCCTGTTCAGTATTGCTTGCTGAATATGTAGGCAGCCCAACCAAGTCAATCTTTTTGGCGTGGCTATACCGCAAATACTCACGGTAAATTTCTAATGCGCTTTCTTTTCGTAGTTCTCCAGGTTTTCGCTGTTCACGGATATGCTCAGGACAGTTTGTAAGTATAATACGGCTTGGTTTGGGAGCGAAAAAAGTGGCGACTCGACTTAGGGAAGAGTACGCTAGCGGCATGTTTGACCTGAGCCCAAAAAAATTGAAATCAACAAAAAAACGGTCAAGCAGTAAGTGTTTTTTTCGGCTCCGCAACGCAAACACAAAAAAGAAATGAATCAGCGCAGTTATGCTAACAAGCCCGGAAAGCCGTTCTTCAGCAATATTCTTTTTTATTTCCAGCCTTTTCATTTTTGACCTTACAAAAAACTTTGTGACAAAATTCCTGCGATAAAGGTCTTTAAACTTATAAATGAGAAACGCATTTCCAAAATTTTCTTTCAATGCAAATATTAAAGCGGTCTTTCCGCTGCCGTCCGGCCCAGCAATAACAGTCGTACGCCTTTTAAATAGAAACTTACGTCTTTTTTTGTGTATGTAAGATTTTGCCGAGTGAAGCCAATCCTTTTCGGGCGTTACATTTGCGTCTTGCAGGTACTTTATTGCTTTATTGTTTTGGTCTTCACTGATGCTCCCACTAAGGGCTTGGTCGACAATGCCTAAAATTAGTTGTGCCTGTTTTGTTTTTAACTTCTCGAGATCGGTTTTATATCTCTGAAGGCGAGCTATTACTTCAGGTGAGCTTATGTCTTTGTTTTTGTGGAAAAGATGCGTAATGTATAAAAGAGATCCAATATTTGTGTCTAAAGATGAGTTGCTCTGAGAGTTAACGTTAATATGTTTCAATAACTCTTGGGTTCTAACATAAAGCAAAGCGCCATATGATGATTTCTTAACCTCAACTTTTGACCATACATCAATATCTACGCTTGCCTCAAGCTTGCGGAGAGTAATTTTGGTCTTGCTATTTTTGCTGTAACAGACGTCTATTAGGCACGCATGTCTCTTTGCTTCCTTAAAAATACACTCTATAAAATTAATCTCGTGCCGCGGATCAACAATCAGATCATAGTCACCGGTTTCAGCCTCTTCTGCGGTTAGTGGCTTATATCTGAGTAACGAAACATGCTTAAGATTGCTGTCCAAAGCGGCTCTAAGGATGCTAACTGGGTGATCAGCCGATGATGTAAGGTGCTGTGTGTTGTGATTCATATTTATTCCATCGCAGAGCTACATCATGGATATTTTTATAAAATGTTGAAAGCATCTAATTTTATGGGCCATGAACGTCTGTTGTATGTTTAATTATTACATGCGGTTCACGAGCTGAGTGTAATAAAAGTACTCTCTAATTGTCAAAGGTGTGCAAGGGTCCATCTTTTTATTCGTTCTAGTTTGTGGATTAGATTGCTCTCCTCGGGTGAGGTTTCGCGTCCCCAGCCATTGATTATGTGTGAAAACAGTTCGCAACTAAAAATTTTTACTGCGGTAACGATGTCAAGCTGGGGTCGTTTGGAATCGATAAAACTTGATGCCAGACGCTCGGAAGAAATCCAAAAGTCTTTTAGTTTTGCTTGGCTGATTTTCTGTTTGCTGCGAGTTTCTCTATAGTATAAATACATTATGTCAAAGACATGGGATCTGTAACAGGAATGCTCCCAATCTATGAAGCGAGGCCTCCCTGAGAATAGCAGTATGTTCTTTGTGCTTAAATCTCCGTGGGAAAATACCTTTTCAATTTTGGCGTCTTCTAGGCTTCTATATACTTCATTGTACCAATGGCCATAGTCCAAGACTTTTTTGTTGTTGGCCGATAACTCTATTAGCATATCATATATATGGTTCGGGCTGTGTAATATCTTTTGGGGTGAGCAAACACCTTGCGAGGTGATTTTTTGGAGTTTAGCAATAAGCTGGATAACTTCTGATTGGATATGAAAATTCTTGATGATTGAGCTAATCTGGTCGTCAGCCATCTCACCAATATGACATTGTAGGTTAGCGTCATACCGTAGTTCTGGTATTAGTTTAGCTGATGTAAGTTGTCTCAAACAGCTTGGATCATCTATACTTGATCTTAATGAATCTAAATCTGTGTCTGAAGATAAAAAACAAACTGCTTTGTTTTCGAAGTCAAAGATAATTCGGCGGCTTTTTGGTTCGCGGACGTATGCAAAAAGGCCGATATCAGGGAGGGTTATAATCTTATCTGATGAGTAAAGATCTGCATTAATAAGTGTTTGAGTCAAAATGCATCGTGTAAGTGATGAGTAATGGGCTGCTGTTGGTGTGGCGGCAAAGCTGGTTGATATACGAGGAATTACCGCTCTTCCGTTGCGGTATGCAAATTTCCAGTCGGCGAAACTATGCTTATTAAGATTGATATAAAAGCCACGCTTTAGTTTGGCAAAGTTAGCCGAACTGATATTGATCTGATTGTTTGATGTTTGGCTCATGGTTCACGTTCATGCATCATTTAAGCAATCAATGTGTATGTTCGCATTGGTTCTTAGCATAGTAATCTCGCAGTTTGCTGCCAATGTAAAAGCTAGCCAAGCAAACACCGACGAAGAAGAAGACTGTTCGGCCGAATGATTGGTGAACCCTGACTGTAGCTAGTAGCATGATGGCAAGAGAGGTGGCCGAGCTAATCGCAAATATCGCTGGAAATGCGGCTCGGCCGTAGTGCCTAAGTGAGGTTGCTGCCCATGTAAAGAGCCCAAGATATGTGCCGAAGTATAGTATTGCCCCGATAGCTCCAAGAGCATATAAAACTTCAATATACAGATTATGAAAATGCACTTGTCCAGGTCGGGCTTCCTGAACAGGTTCGTAATTCATCTCGGTGCCCCATCCGAGCAGTGGTCGCTCCTTCCACGCCTCTATGCCAACACGCGGGAACTCAATTCGGAGAGCAACCGATCTGCTTACTGGATCTTCGGCTGCAGCATCAGGATCAGTCAAATATTGCTGAATAGTTTCAACTGTTTCTATTGCACGATCTTTCGCGTGGTGAGTTTGAAATAAAGCAAGTGTAGCTAAAACGATCGTTATACATGATGCTACGGCTACAATAGATACAACCTTCCAGCGTGATTTGCTGCTTGATGGTAGAAAAATAGATAACAAAAAAGAAGCGGCAATAATCCCGCCGACGCTACCTATCCATGCGTTCCTCGAGCCGGTAATAATAAGAGTCCATGCGACAAATGATAATAATAGCAAATATCCCAATATTGTCACGATTGACGATAAAGAAAACAAGGAGTTCTGGGAGTAGGCTCTATAAATAACACGTATAACTGCTACAAAAAAGCCTATAAGAGCAAAACTAAGTGCCATGGCAATCTGATTAGGGTTGCCAGGCCCGAACAAACGCCTGCTGTCCCAATCCAATGCAAACAAACTATTCCAGTCTAAATCTTGCAAGAACCAAATTGATAGCCCAGTCAACGCGGCTAATAAAAAGAGTCTTATCCGGTATATGTTGTCTCTAACGAACAACGCACCAATTATGAATATCAGCGCTGAAGTGCGAAGCAGATCACCGGTTTCGGGTCCTTGTTTAACTGGGACATCAAGTTGCTCATACAATGATATGCCATGTCGCAAGCCAAGGTAGGCAATGAAGACAACACTTGCCCAGAAAACCAAATTTCTACGGAGGACAGGCCAGAATTCTCGTAAAGCTACAAAAAAGGCCAATAATAGTATGCCCCAAGCTAGTCTGTTGATGCTGCCACCGAGCCAAGTCCCTGTAAAAACAATTATCAGCGCTGATACAAATCCAACTATATCGGCTATATGAAACCGAGTGCATTGTGGAAGCCAGACATCTGTTTTGTAATTATTAAAAAAACTAATCATGGTTAGATTTAGTTTGCTGCGGGCGGCACAAAAACAAGTGAGCATGTTGGGTTAGAGAGGTTATAATGCCTATGTCTGTATTGTGAATTAGGCTGGAATTGCCTTTTTTGTGTTCAATCGCTTCGAGTTGAAAGCCACTCGATTGCAGCGCATCTTTTATTTCCTGAAAAAAAGGTGATGCTTGCTTGTGCAGTTCGAAAAAAATGAAGCAATAAGCTCCGCTAGCGAGTAATTTTTGCATGCCTCTAAACGCATATCCTTCATATCCTTCTATATCCATACGTACTATGATTGGCGTATGGGTATTTTTTCTGTGCTTGTTAATAAGATCGTCTACTGTAGTTATCCATACTTCTTCGTAGGATCTTTCTGTGTTGTCTGAACTTGTGAAATTGTGCGTATTAGATGTCTTGCCTAATGACAATTTGCCTAGTTTATTTGATTCGCCAATTCCGCAGTGAATGGGGGTAATGTTTGTATATTTGTTTAGTGCTATATTGTACTCTAAGCGCTCAATGTTGCTTTGCTCAGCTTCTATTGCAAGAATGGTAGAATCCCCGCGTAGTGAATTTGCACTAATGAGGCTGTAGTATCCAATATTCGCTCCAATATCCAAAACCAATGGGGGCTGACTAAAGTTTGAGTCGATTTCCTTTATTCGGTTCTGGAAGTATACAGTGCTAGAGTGTTCCCGCGTTTTATGAATTGCAAGTTCATGTTCTATCTTATGTTGAGAATGCGGATCTATATTCAGCCGCATCTTGCTGCCGTGAATGGTGCAATAAACATTGTTATCGGGATCAGATAAACGTTTCAGGCGACGCAGCTTGAGTTTGTCTAGCCTTTTTCTGAGGCTTTTCGAAGCTCTGATCCTTGGTGTGAGATCGCGTAATCTGCTTAAAATTGATGCCATTGGTGAAAAGCCCTAAGCTTGTACCATAATGTGTTGTAAATGAGCGAGTCAACAATCATTCTTTTTTTGTAAAAGCCTAAGCGGATAGTATCTGCTGATCCAAGGGTGGCGCTTAAGCATTGACATGCTCCAAGGTTTCTTGTCGTTTTCGCCTGGAAAAGCTACTAGGCGACAATTGTTAGGCAAGTTACCTTGATGCTTGGGTAGCCAAGCTGCACGATATATGCCTTCGTTTCTCGTTAGTATTGATAGTCGGTCTCTGGCGCGTATATATATCCAAGCCTGGTCAGTGCCGCGAATTTTAGCATCTACAGTCTCATTCGGCGATTTTGTTGGGTCAAAATCGTACCATAGGTCATTAGCATATCCGCACTGCACAACGAGGATTGATGAATTAACCGGGCATCGATAAGGCCAGTAACCACGTCTCAATCGGTGTTTTATCGCGGAAAATCCGCGACGTACCTTCATTCCATCGCCAATTGCCAATGGCTGATTAAGCTCGAAAAGGTTGGAAAGGTCTCCTGTAACAACAACGTCCAGGTCTAAATACATAAACTTATCTTCATTGATCAACTTTTCATGTTCTGATGAAAAAATCCATAGCTTAGCGTAATTGCCTTTTGTTTTGTCAATTAAATATCGACATTTATTCGGTAGTTCTATTGGCTCGATTTGTGCGTCTAGGCCCGAAGGGTCATCAGTAACACAGATCATACGAAAAGGCTGGGTGTGATGAATGCTTAGCATCTCATGGAGCGCATTAACATGGGCTGATGAGTACGGAGCCCCTTCAGATGGTGTTCCCCAGCGAAAGCATACAAAGACTAACATGATTTTGAGGGCGCCTTATTTGAAAAAAACAGTATCAAATATAGCATCAATATGCTCCAATTGTGTATTAAACTGGGCACGCTAAACTGCTGCAAAGCTTTATGTCCAATGACTTTCGTGTCCATTTATTCATATAATAAACCTACTGTAACCAGACTGAAGTAACTTTATATAAGTGGCTGTGAATTAGCTATACGTCGGCCTCGCGAAGTGGTTATTCATGCTCCGGATGCCACATTATTCCAACCCAATGGTGGGAGCGATGACGTACAGCTTCAATTGTGCCATCGTCAGACCACGCCAGTGGAAGCATGTCATCGCCCAGATCAGTCTCACGGATAGCGAGAAAGTGGAAGCTGTTAACTGTGGGGTTACAGTTTATTTTTTCCACGTCGTCTGATCTATCAGTGATTCCCTCTTTGCCGCATAGGTATAAGTCGAGCACATCACCTGTAATGATAGCATGGCGACAGGCGACATGCCCCGAAATCGGCGTTAAACGCCCACTCTGGTAGTGGTTCAACATCTGCATGCCGCGACAAATGCCAAGAACTGGTACAGAGTAACCGATGCTGTACTCAAGGGCTGCTCTCTCAAGCGCGTCCCGCGCCGGTGCCGAACCGATATCATTGCCGCCAGATAGCACAATTGCCTCTGGGGCAAGTTTACACAGATACTCTGCGAGACCATTGCTCGGAACTCCGCTACAAAGAGGAATTGGCAAGAAGCCTAATTCCCAGAGCATAGCTGCCAACCGCACATCAAGGGTGTCGCGATTTTCTGCGCGGCTTGGAAGTGAGTCAATTCGTTGTGTTAGGGCAGCACGTTTAATCAAGCGTTAATCATCTCTACCGATTTGGTCTCGCGGGTTAGTTGATCACTGAACACTTAGTAATCCCCACCAGTAGGTTCAGGAGGTTCACTCTAGCGTTACGCGCCGGCGGTACTCCTGGTGCGCTGTAACCATAAACTCAGTTATGAAGGCGCTAAATATTGCTAGCATGCCCCCTAAGACAATCCCCAACATGATTATCAAACCTGTTCTGGGTTCTTCCTCTTCCAGGTCCTCCTTTGTGAGGTCTAACAAGTCTTGCCTAGCTTGCTCCCAAGCAGTTTTAAGTTCCTTGCGCTGGTCCAGTAGACTTTCATAGAGCCACACGCTACCGGCCACATCAGCAATCCCGGCGTCATCCGGCAGCTCGGTTAGGCGGGCTTCTAACTTGTGTACTGCCCGATCTACGTGTTCCAGTCTTGATTTGATTTCTGTAATCCTTTTCTCAGTCTCTTTTTTCTTGGTCGTCTTCTCTGTATCGATCTTCTCTTCCACCGCCTCTTCGGTATCTGTCGGTTCGGGCTGGGTAGCCGTCTGCTGGTAGACTGCATACACTACCGTGGCGACGACAACTAACAGGAAGACCACCGCCATGATGCGCCAGCGGCGCAGCATTGTAATGTAAGGTCGATCAGGGATATCTCGTCATCCTGAACTGGCCGCCGGGTCGATCCATGACCGGTTTGCCCAGTTTGTGCACCTTGATCTTCGGTAGAGGCCATCTTTTCCTTCTTGTCTTATGTGAGCAAATAAGGTGGGTTATCCGGACTAGGCTAGGTGTCACAGGTAGCGCTACGACGGCTACATTAGCCACGTTAAGCTACAAATAACATCGAATTATTGCAAGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGAACGAGTGAGGCCAAAAAAGGCAGAAAAGAAGCAGGTTTGTTGTTGCCTTTGTTGTTGCGTCTATCAGTTAGTAAGGCGGTAAGCCTCTGATGAGCCATGAGGTTCATTGGGCGTAATGTGTATAATGCTGAGTTCTCGTAACTGCCAGGGGAGGAGTCAGCTTTGGTAACCCGTAAAAGTGCTCTAAAGTTCCTGGGCGATACCGCTGTCCTGTGGGTCGATCCACAGCTGATCACACACCATAGAGGTAGTAAGTTTCCACTCACTGCTCAAATTCGCGCTAAGTACAAGAGCAAATTGCGGCGTCCAGTGACTTGGGCTAGTAGGCGCTGGCACCCCTTTTTCGTGAGGGAGGCTTGGCTACCTCCAGCTGAACCAATCGAAAGCGAGGGCAAGTATCGAAGAGTAAGCGATCTTATTGCAAATAGACAAAGGCTGGAAAGAAGTGATTGGTATCAAAGCATGATCGACCGGGTGCGCACTAAGGGGTCAGTGCGCTACAAGTTTCAGAAGATGTATAGTGAGTCTGAAGTAGACGCTTTCTTCAATGAGCATGTTGGGCCACTGATCGAAAGCATGGCGAAAGAGGGGTACAGGGAAGATCTGGCTCCCGAGCATGGTACTGTGTTAGTTGGTGAAGACGGAAGGCTGTATAAGACAGGCGGTGGAAGCCACCGTTTCTATGTGGCGCGCGAATTGGGTGTGAAAAGCGTGCCCGTTACGGTTGCTAGCGTCCACGAAGATTGGGTTCGAGCCCAGGGTGTTGTACCCGATCGCCACGGGCTGCGTGAACTGCCAAAGCTGCTAAGATCTCTGGGGAACTGCACTGACTAGCAGCAGGGTAAACCTCATTACTGTGAGGTGCGAAGCCCCAAAGGAGCTTGGATATGGTCAATTAACCTAGATGTCCAACGGGATACATTGCAGTTTGCAATCACAGGCGCTCAAAGCGCAGCACTGAGCAATGAACCCTTTATGTGCACAAAGCTACTCCTGTGACCGGCAAGTAAGTTCCGAGACGGCTGGATTCAGAAGTGGTAATGCATAATCCAGGATCAGCCTTTGGAAAAAACGAAAACTTTTTCGCCTCTGCGTAACGCATACTTGACTACATCTTCTCCCTTTGCAGTTACTTCCCTTTCAAGCTTCAAGCCTCCGCGCTCAATTCTGTCCTTGAAATCTCTGCCATAAAACCGCACATGGTCTTTTTGCCCGAAGTGAAGCAGACGGCCATGTTTTGTATCGATGTCCTCATTCTCATAAGTGGTGTCCCAGCCTTCTATTATCGGAACCATGCATATTAGCAAGCCATCTTTTCTTAAAATGCGACTTATTTCGCTAGCACTCCTTAGGTCATCTACATGTTCTAAAACATGATTGGCAATGACCAATTTTACTGAGGAGTCTGGCAGGTTTATGTCTTCTATATTTAATATTAAGTCTGCAGTCTGTTTAATATCGGCTGTTGAATAATTATTCCATCTCTCTCTTAGCCGTTTTTCTAGACCTTGCTCTGGTGCGAAATGAATGACCTTATCATCAACAGCTATTTCATCGGCGATCAGGTTTCTTTCAAGCGCAAGCCATAAAAGCCTATGCCTTTCTTTGCTGTAACATTTGGGACATCTGATATCTATCCTAGGAGGACTTCCAGCTATACCAAAAAATCCTTGATATCCACATATGCAGCAAATTCTTTCAGCTATTGGTCTCAAGGACTTCAATGCCTTACGGTATTGCTTGATATTCCAGAGCAAGTTATGCAAAAATATTCTTTTTATATAACATCTAACCTTGTTTTTCATGATGGAAACCGTTGTTTGTGGTAGAAACTTTATTCAGATTAAATGCCGCTATTAGTGGTTTGTTTTCAGGAACTGTTGCCCCAAATAAGTGACAATTCGAAAAACGTATTTACTGGTTCTAGGCTAGTCGTAACGCTTCTGTGATAGTTAAAACCGACTTCTTATGTTTACAGCAGGATCAGTTGGCTTTAATCCCCGTAATTACATTTTACAAAATTTCCGGCACCTTCGCAATCTGTGTGGGTGACACCCCCATCTACTCACGCAACCGCCGATCTACATATTGTGCCATAGGGTGAGGCAGGCAGATGGGTGAGCTTGCCAGCGATCAGGTAGCAGATGGAGATCAGATTCTTCGTGGTGCGAAATCCTCGTGCACGGCCCTTGGCAACTTGAATCTGACCGTTCATGCTTTCGACGGGGCCGTTTGATAAGCGAGAGTCAAAGGCGTTGAGGATGCCCTGCCAGTGAGCCTTGAGCGTTAACGCTAGGCGCTTGAACGGCTCCAGCCGGCAGCGCCGAGCCCAACTGTACCAGGCGTTGAGGTTTTGCTCAGCCTCTCTGCGAGCACCACTGCCGGTCAGTATTTCGCGCAGTGCCTCCTTGAGCCGCCAGGCGCGGGCTGTTTTCAGCCTCGTCCGTGACAGGTAATGCAGTTCGGTGAGCTGCTTGAGCGTGTGCCGCGAGGCATCCTTAAGCCAGACCCAGCGAGTGCGCTTGAGCGCAGGCTCCAGGCGCACCTCGGCGCGGCGGACCTCGTCGACTGCCTGATTAGCCAGCTGCACCACATGGAAGGCGTCGAAGGTAATCGCTGCCTCGGGTAGATACCGCTGCACGCCGGCGATGTAGGCCTTGCTCATATCGATGCATACCGCCTCGACAGAGTCCCTATCGCCGCCGTGATCGTGTAGATCCGCCACAAACGCCCCAATGGTCGACTGGTTCCGCCCTTCGCAGGCAAAGAGTAGGCGTCGGTCATCAAGGTCCTGGAAGACAGTGATATAGTTATGCCCGCGCCGCGAGGCCGTCTCGTCGATGCCGACAGCGCGAACGTTGCTGAAGTCCTCCTGAGCACGGGCTGCTTCCACGTAGTGGTTGAGGATTCGCCAAATTGGGTCATCACCTGTTAGCGGCTTTGGAAAACCGTACAAAATTGGCGGAGGATTCAGACATACTAATACGTTACTGAGATCCACCGCCATGTCCTTTTCCTTACAGGTGGTAAATTAACTCTCTGCTTTGGCTTTTTTGCTCGGAGACTTGCGCTTAGCTTCTCGGGTGTTTCTGCGTTCTTGAGCTTCCTTGTAGCGATATGACTCGCCGTCTATGGCGAGCACTTCGGCGTTATGCAGCAGTCTATCGATCAGGGATACGACACATGCAGCACTCGGGAAGACCTCGCCCCACTCGCTAAAGGGGCGATTGGTGGTGATGATAATAGAGTTCTTCTCGTACCGGCGGCTGACCAATTCGAAGAGCAGATCGGCATAACGCGTCGAGTAGGAGAGATACCCTAGCTCGTCGATGACCAGTAAGCGTGGCCTAGTGAAGCGGTTAAAGCGCTGCTGGAGTAGCCTCTCACTGTCTTGTGAGGCCAGCTCGGCGAGAATGGTGCTGGCGTTGACAAAGAGCGCGGTATAGCCGGCGATCACCGCCTGGTAGGCGATATTGCGGGCTATCATGGTTTTGCCGACGCCATTACTGCCGAGCAGGACGACATTACCGGCCTCGGGGATGAATTCCAGGCTCATCAGGGCATTGATAGCGCCGCGGTCACAGCTAGTAGGCCACGACCAATCGAATTCGCTCATCGGCTTGAAACGCCCTATATGTGCTTCGCTCAGGCGGCGCTCGAGAGAGCGGCGGGTGCGCTCTTGCTCTTCCCAGCACAGCATCTGCTCGATCCACTGCTTATCCTCGATCTCTTCCCAGTGGGCGAGGATGCCATGCAAGCGCAGGGCCTTGGCCCGCTCAATCATTTGCTGCTCAGGTGTTTTCGGGGTCATTGTCGTCTTTCTCATAGTTAGGGGTTAGGGCGTCGTAGCCATCGAGCCCGCGCAACCGGATGCTTGGCTCACGTTGACGTAGCTGCTCGGGTAAGCGCAGAGGCAGCGATGGCGGGCCAGGGGTTTGATCGCGGCGTTGTTCGAGGATCTGGCTGAGGGCATGTGGGTGGGGTACACCACGCTCGAGAGCGTGCTCAATGGCGGCATCGAGCTCGCTGGCGCCGTAGTGATCGAGCAGACGAAGTAGCGCAGCTGTGACGCTGCCGAGGCTATTGGTGCGCGTTGCCGCTTGCGAGAGAAACTCCTTGGCGCGCGGTACAGAGCTGCTAAGCCGGTCGGTGCCACGATGTAGTCGAGCAGCTTGCTTCTCTTGGGCTAGCTTATCGATATGGCGCGGGTCCTCTATTTGAGCGTGGCGATCATAGCTGCGTGGATGCGAGGCTATAACGTTCTCGCCATCAAGCACCTGGATCCTCTCCAGAGTTGCGCTGACGCTGAGTGTTTTGCGGACATATTTGTGCGGCACCGAGTAGTCGTTAAGGTCAAAGCGCACATAAGGGGTTTTGCCTACCTTTACAGGCACGTTCTCATAGCAAGGAAATGGCTCGCTGGGTAAGGCACGTAGGTAAGGCTGCTCCTCGGCAAATGCCTGGCGAACACTGCTATTTTGCTCCTGATCGCAGCGCCGCTCGGCGGCAATGCTTACGCACCAGTGCTGCGCCTGCTCGTTCAGATCATCAAGGCAGTTAAACTGCCTACCGGGCCAGAAGCTGGAACGCACGTAGCGGATTGCCCGCTCGACACGGCCCTTTTCGTTGCCTCGGGCTACGGCTACTGGCCGTGGCTCAAAGCCGTAGTGGGCGCTAAGATCGAGTAGCTGCGGATTAAAGCGGATCTGTTCGGCGTAGCGCTCCAGCACCGCGCTTTTGAGGTTATCGTAGAGCAGGCACCTGCACACCCCCTGCCAGGCCTCGAAGGCGGCGACGTGGCCACGCAAAAAGTTGGCCGTCGATGAGCCGAGGTAGAAGCGCAGAAAGATCCAGCGCGAATAGCTCAGGACCATGACAAATGCCATCAGTGGGCGCTTAGCCCGACCTATGGGCAGTTTGCCGAAATGGCCCCAATCGACCTGTGCCTGCTCCCCGGGCAGGGTACGCAGGCGCATATACGCCTCAGCGACCGGTTTCGGCCGGTACTGGCTTATCAGGTGGCGAAAGTGATCAGGGCCGCCCGAATAACCACGCTCGCGGGCCATGTCATAGAGCCGGCTAGCTGGTATTTTGGGGTAATCTGCCAGGGTCTGCTCGATCCAGGGTACATAGGGATCAATGATCGATCCTCGCCTAGTGCGTTGTGCACTAGGGATACCGGCCTTTGCCAGCACCCTATGGACGACATCGGTGTGGACATTGAGTTGTCGGGAAATGGTGCCGACGCGCCAGTGCTCGGCGTAGTGATAGCGCATGATCTGCGCCTCAAGCTCTTTGGATATAGCCATATCCTTACACTCCATATGCTCGGCGAGATGAGCGCAGAAACATGCGCCTCGGGCTTAGCGCAAGGCAATTCTCGCCGGATCTCAGGGACCCGAGGGGCGCAACGGCTCGCGGCGAGAATAGCGGCGCAGGGTATTTGTGCGGACAAATACCGAGCACGGGCGAGCGCAGAAGTCACAGCGCGGGGCGCTCTCTGTTAGTCGGCGCGCTTGCTGATAATCGGGCCGCTCAACAGGCCGTGCTCGAGCTTGGTTATCTGGCTGCGAGAGCTTAAGTGAAGCATCACTATCAACAGGTGTGGAACCCTGATGCGTCACTTTTTTGTCTCGGCTTAGGGATAGGGATAGGGGTCGAGTCCGGGCCCTGGCTCGATAACGACTCTGGCGCTCTGCATGCTTGGCTTTGCCTCGCCTAGAGTCTTGATAACGCCTACCCGCCTCGCGCAGGGACTGGCGGCGGGCAGCAGTTGCGCATTGCTGGCTACAGTAGCGATTGCCGCGATCGCATTGACTGCAGATCATGACCTGTCGATTGCAGCAATCGCAAAAGAAAAGTCGGCGGGTCAATGCTTTAGGCTGGTTCATCGGGGCGCTTATCCAGGTAAATTGCAAACACTGGACAACGGCGCTAAACCCGAGGATACTGATATGTGCCGTGTCGCAGGACGCGGTGGGCGCGGCATCAGGAGGGTTGTCAGCCTCTGGTTACCTGATGTCGCGTCGCCCTTACTACTTGCCTACCGATTCTCACCCTCTTATAAGCAAATCTCCAGTACTTTTAGCTGCTTAAGCAATAGCTCTGTGGACTTGTGGACGAGCGCTTCGCGCCGCCCTGAGCCCTACGGGCCGTGTGGACAGCCCGTGGACAACCCTGCGGGTTGCCCACCGCCTGCCCACACTCTCAGGGCTCTCGCCCACAGGGCCCACAGAGCCAGCTACTGGTTTATACGGAAAAGAAGGGACAAGCGAAAGCAGAAATCGACACAGAGCGTGGTAGTTAGTCTTAGCTTTTAGCTACCCAGTAGTTATGCGTGTTCTTGAATCCTCCGCCAAATATGTACGGATTTTGGCAGCCAACAACATCACCAACACCCAGGTAGTTAGCGACCTTCTGGACCGACATCTGCTGGCAGAGGGTAATGACCATCGCCTCGAACAACTGCGAGAACCCGCTGCCTGGACGCGCCCAAGGCACCTCGACCTGAGTGGTCTTCCCGCAGTCCTGACATCGCACCCTGGGCACGGCGGCATGGATATAGGCCTGGTGCTCGAAGAAATGCAGGTGCTGCCAGGAACGCTGACGGGTGTCATGAACTGCTTGAGCCTCGGCGCCACAGGCCGGGCAGGCGAAGCGGCTGCCGCTGGTGAAGGTGATATCGAAGTCGATGCGCTTGGCCTGCGCATCAAAGCGGATGTCACTGACTTTCCAGGGTGCACTCAACCCAAGCGCGGCGGTGAACAGGGCGTGCTCAGTAGCCATGGCAACTCCTCGCAAAAATGGTGTGGCGAGGCACAACATATAGGATCACACCCAGCGTTGACTACCTCGTCTTAAGCTGCTGTCACCCACACAATTCGCGAAGGGACCAAATTTCCGACCAGCTCTCTTCAATAGAATCCTGCCGGTTAAGGTTCCAGGTTTCTCTAATGGTGCCTATCATCTCGTCCATTTCACCACGCATACATTGAGAAAGTCTTGACGACAGCTCATTGAAATCCCTGGCATTAGCACCAGGGAGAGCCACATGATGTTCAAAAAGTAATTCTTTCTCATGTTCGACATTAACGTGTACTACTGGTTTGCCCAAAGCGATTGCTTCCATCATCACGCTGGAAATATCCGTTATCAAACAATTCATTCGCGAAAGCAACTCAAGAGAATCAACAATGCCATCTGCAGGCAAAATCACCATGCGGCTGAGTTGTTTAAAAGGCAAATTAAGATTATGGCAACGCACCAAGGGGTGAAGCTTGAGGTAAAGTGTCATCTTATTTGCTTCTAAATACTCGTCGAATCGTTCCCAGTCAAAGCCGTATATCTCATTAATACGTGGTTTTCTTGTTGGTGCATACAATTTGTGAAAATCACCTGCGTGCTTTTTCAGCACAGACTCTATTGAGTCTGGCAAAACTACCTTTTCATCTCCGTTTCTTAGGTTCATGGCTCTGTAGAAGCGTGGATAACCTAAGGCTCTTATCTCAGGGACAGCACACTTGTCATTCATCTCCTTCCAGTAGCGCATGTATGACTCAACCGCGCCCTGAGCTATCGAGAAATGAGCATAGGGAGCCATAACCGTTTTGGGGTGATTGTTAAGATTCTTGGTTGTAAGTAATCCGGTATATTCGTGTCCTTTAGTTATCATGCCGTGGTAAAGCTTGATTATAAGCTGGTTCCTGATTTTGCCAACACGCCGCAAACCCGATAGCGTTATTTTGCTTGTAAATGAGTCATTTTGCCAAAAAATGGCCTGGCTGTTATAAGTGCCCAAAAGACATCTTTCTTCTTGTAGATGAAATCGATGCATTTATGGTTGTGGGCTCCCCGGTGTATTATTGCTTCAACTTCTTCAGATCTATTTGTGGTTCTTGCCACCACTTTTATGGCTTTATAATCTTTTCTTTCCAGAACGCGAATAAGGGGAAGGGTGTCTGAAGACGGTGCCGACGCCCGCAATGTAATGACAGCGGAATAGCGGTCAACTCGCACTAAAAGGGCTATAACCGATATGCTGTACGAAAGGGCTAAGTAGTAGAGGGCTTTAACCATGTGACGCGTTCTAGGTTCCGGGCTAAGTCGTGTTAGTAAGTGCCTGCAAAGCATGCACCACAATTGGCTTTTCTGTTTATTATAGGGATGAGAAATAATAGATTAAGATTATCGGTCTTGGTTACAAGAATCAAGCACTGCAGTATTGCGGAGGGCCATGATGCTCCAAGGTTATATGTTCTTAGGGCGTGGCATAAGGGCACCTTTGAAGTCTCCCCCGGAGTCAGTCAACATCCACGGGTGGGGGCGCCGTGGATCCATCCGTGGATGTCATGGCGCCATCCCAGGCGCCATAACACCCACACACTCGGTAGTTGAGTGACTCCGGAGGAGTTTTCGGAGATTACTAGAGGCAATAGGCACTTAATAGTCGGCTTTTTGAGGTGCCCACGAGTAAGTGCCGAAACTCTAGAGTGTTATGTAACTCACCGATCTGATACCTTCGACATAACACAAACAGCCTGAAGCACAACATCGGTTCACCTGAATAGGTTTGGAGCTGGAGTGACCCATTCTAAGACGAAAGTGCGAAACCCCATATGGATTCATGCGTTAAAGTCATACTACTCATTCAGGGAGAAGCTTCACAAGCATGGTTAGGAAGATAATTTTCGGCGTCCTGATTACCATTTTAATTGCAATCGCAATCGCCTTCACAGCCTTAGGCTGCTCCGAGGAATCTGACCATTGGCGGGCCGCGTCGCCTATGCCAGAACAAAGGGGACAGCATGCTATCGCGGTTCACGACGGTTCCCTCTATTTCTTCGGCGGTACGGTGACGGAAGATGTCCTAGCAGAAGGTGTCTTCAGCTACGAATATAGCACCGATACTTACACGGAGGGTCTTGCCTCCATTCCCACTAAGCGAAGCCGGCTTCAGGCTAGCACTGTGGGCGATTTCATCTACGTAATCGGCGGTTGGGATAAATCCCGCTATCTCGGTACGGTTGAGAAGTATGACCCCTTAAAAGATTCCTGGACCACCGGGCTTGAGCCTATGCCCACCATCCGGCGCGATGCTGCTCAAGTGGTCCACCACGGCAAGATCTACGTGATCGGCGGACGGGAGAAGGGCAAGCAGGGATCCACAGCCAATGAGGTCTACGACCCTGATGAGGATGCTTGGCGCAGCTTGGCCCCGATGCCGACCTACCGGCGCCAAATCACCGCCGAAGTACATGATGGCGTGATCTACGTCTTCGGAGGGGAAGATAACGAGACAAAGGACATGGACGTGGTCGAGGCATACCATATCGACGAGGACCGCTGGGAAACCGGATTGGCACCAATGCCAACTGGGCGCCACGAACCTGCCTCTGCAATCGACTCAGAGCGCGGCCAAATTTATGTCACTGGTGGCGACGTGGGGTCCCGCGACGTTACTGGTGCCCACGAACGCTACGATATCGACACCGATACTTGGAGCACCCTTGACCCCCTTCCAATACCCCGCTACAACTTCCATGGCCGCTGGCTCGACGGCGCAGCCCATTACCCAGGTGGCCGCGTCCCCGCGGGCGACGGTGTCCGAGATTACCACATTTATGATCCAGGCCAAGACTGAAGGGAACTATCCCCTGACTGTCAGGCTAGCACGTCCTTGACAAAAAACGTCGTGTTGCACAACTACAGCACGCACCCCCAAGTTAATCAGGTGATTAATGATTGACCTGATTTGCGAACCTGTCGAGCCTGTGGAGGAGCAGTCGACGCAATAGACCGGCAAAGAAAAGAGGCTCCACGGATTCCATGATGCTAATAAATCCCTCCTCTGCTCCAAATCTGACATCATGCGCCGCCCCCCAGAAACAAAATAGCCCCAAGCTTTCGGCACACGCTCGGGATGGCTGAGCGGGCGTACTCGCTTGGGGCGAATGTAGGCAAGCTGTGCCGCATCCCTGTCCTATAGTCAAGATCTCGGCATAGGCGCGGGGGTCGGTTTGGTGCAATACCTACGAAGGTGCCCAAAAATCTTTCGCTGCAGCCAAGTCCGCTTCCGAATCGACCTCAGCCCACCCATTCTCAACAAACGCCGCCCGCGCCTCCCAGCCGCTATCGATCAGATGCTGCAGGAAGCTGGTCATGTACATGTTGTCGTAGTCCTTGCCGTCGTAGGTGGCATCGCGGTCCATGGCGCGCCAGACGGCGGGGAGTTGGGGGACGAGGTCGGCACGGACCTTGAGAAGCCCCATGTACTGGCCCTGGATCTCGTCGTAGCTCGACGGCTTCTTGCCCAGTTCGGTGATCCGGTTGCCGTCGGTAAGCTTGAGCGTCTCGGCATCCGCCAGCGGGTCGTCCATCCGAGCCTCCCAGTAGCGGCGCCAGGCGCGATCGACGGTGAGGCAGACCGGGGCGTTGCATGCGAGCAGGGCGTTGAGCACCCGCGGCTCGTAAACAATATCGCCGTAGGCGATGATCACCTCGTCGCTACCAGCCATGACCGACTCGGCGGCGAAGAGGGTGGCGACCATGTTGGTTTGATCGAAGCGCTCGTTGATGTGCAGGGTGATATCCGGGCGCTGCAGCCACTCGGCCCGGTAGCCGCCGACGACGTGGATATCTTCGATACCAGCGCCGCGCAGGACTTCGAGTTGGTGCTCAAGCAGAGGCTTGCCCTCAAGCTCGACCATGCACTTCGGCCGATCGTCGGTGAGCGGGCGCAGCCGGGTGCCTTGCCCGGCGGCCAAGATGATCGCTCTCAT
Protein sequences of DBSCAN-SWA_1 >NZ_AP017372|340689:389155|375826_376612_-|WP_162549288.1|DBSCAN-SWA MLVFVCFRWGTPSEGAPYSSAHVNALHEMLSIHHTQPFRMICVTDDPSGLDAQIEPIELPNKCRYLIDKTKGNYAKLWIFSSEHEKLINEDKFMYLDLDVVVTGDLSNLFELNQPLAIGDGMKVRRGFSAIKHRLRRGYWPYRCPVNSSILVVQCGYANDLWYDFDPTKSPNETVDAKIRGTDQAWIYIRARDRLSILTRNEGIYRAAWLPKHQGNLPNNCRLVAFPGENDKKPWSMSMLKRHPWISRYYPLRLLQKKNDC >NZ_AP017372|340689:389155|360091_360388_-|WP_096407530.1|transposase|DBSCAN-SWA MADHPANPHDALLKVTLETPERAAVVLRESLPDKVRERLSDDLPTPLPGSYVDPSLQETHSDRLFEAQMRDGRPVFLYVLIKHKSTPGRLSRTWIETI >NZ_AP017372|340689:389155|361732_362185_-|WP_096407535.1|DBSCAN-SWA MNTVDDRQPLPDQATATLARASAEEISRLLTQLPEADRARVQLDGRDLVLPRHALALLRDLLAEMAQGNAVTIVPTHAELTTQQAADLLNVSRPHLVKLLEQGAIPFTRVGTHRRIRYEHLMAYKAQRDESSKAALDELVEQAQEYNMGY >NZ_AP017372|340689:389155|348458_349724_+|WP_096407506.1|DBSCAN-SWA MQENASPRYVSRLTRNTLALILAGGRGTRLKHLTQWRAKPAVPFGGKFRIIDFPLSNCVNSGIRRIGVLTQYKAHSLIRHIRQGWSSLRADFSEFVELLPAQQRIETSWYLGTADAVYQSLDIVRMHNPELVLILAGDHVYKMDYGPLLAYHVEKGADVTVGCIEVPLDEASAFGLMNINEDNQVVRFEEKPADPTPMPGSQTHSLASMGIYVFNREFMFKALGVDARTSSEHDFGKDIIPSLIDKAQVYAYPFRDPATGDQSYWRDVGTVDAFWRANLELVEVTPELNLCDREWPIWTFQEQLPPAKFVFDEDQRRGMVVDSMVSGGCIVAGAYLRRSVLFSSVVVDERTKVQDSVILPEARIEPGCRISNAVIDKHCRIEAGTVIGEDPEEDARRFHVTDSGVVLVTPDMLGQEIHVVY >NZ_AP017372|340689:389155|365575_365995_-|WP_096407543.1|DBSCAN-SWA MVLAQLAAKNEHDPHQRKEVKLGLTRMLLERGYSKDETKELMRLIDWLIQLPEDLEEAFITEAHELEEDYQMPYVTSFERAGIKKGRQEGRQEHAAETLLRLIERKFGPATKEASRARWSMLNSRSWRCGLTEFSMPRA >NZ_AP017372|340689:389155|369819_370800_-|WP_162549286.1|DBSCAN-SWA MTNSPSTIIHLAAKAVPGAPCDGGMPRSIEWLATAQSKLGYRVVVVAQGGHATETFEHYRLDRLCGTPLKEAIEASASGQTIIHFHATDPETDRATGEDLDCLGVPFVVTVRGNHRTGALPLANRVYLSANHATRHGHSVFVYNGLPVDEYPLGPGKGGYALFLGKVSRRKKGADLAIRVARRGQPVVLAGGWHLKYPATWLPFVRSLKVAGTVGGERKLHLLQHADALLFPIKWDEPFGLVLIESLAVGTPVVATQRGSVAELVEDGISGIISEPEPEALEQAIHKAQTLDRGACRQRVIDHFRIERVAQDYDELYRHALAGERW >NZ_AP017372|340689:389155|359702_360023_-|WP_162549282.1|DBSCAN-SWA MYSYIQEKDVRQALEQTRPDRAEELVMTVAEEWIKRGEKRGEKRGQKRGSHQTATKTLLRQIERKFGAEAKEASRARVERAALGELEMWLDRILDAERIEDVFAED >NZ_AP017372|340689:389155|346208_346565_-|WP_109962907.1|DBSCAN-SWA MQGDRHRDLDGLERDRHSEEGDRHKEKHHDYSLVALNFTPVPRFGYRLGVPQPGFWQERLNSDAACYGGSNLGNGGGVMAEEIPWSGQPYSVSLTLPPLAAVVLVSSSHDDGALGTGD >NZ_AP017372|340689:389155|356933_357164_-|WP_096407516.1|transposase|DBSCAN-SWA MADHPANPRDALLKATLETPERAAVVLRESLPDKVRERLSDDLPTPLPGSYVDPSPQETHSDRLFEAQMMASQPGL >NZ_AP017372|340689:389155|379432_380182_-|WP_096407575.1|DBSCAN-SWA MKNKVRCYIKRIFLHNLLWNIKQYRKALKSLRPIAERICCICGYQGFFGIAGSPPRIDIRCPKCYSKERHRLLWLALERNLIADEIAVDDKVIHFAPEQGLEKRLRERWNNYSTADIKQTADLILNIEDINLPDSSVKLVIANHVLEHVDDLRSASEISRILRKDGLLICMVPIIEGWDTTYENEDIDTKHGRLLHFGQKDHVRFYGRDFKDRIERGGLKLEREVTAKGEDVVKYALRRGEKVFVFSKG >NZ_AP017372|340689:389155|377531_378059_-|WP_096407571.1|DBSCAN-SWA MLRRWRIMAVVFLLVVVATVVYAVYQQTATQPEPTDTEEAVEEKIDTEKTTKKKETEKRITEIKSRLEHVDRAVHKLEARLTELPDDAGIADVAGSVWLYESLLDQRKELKTAWEQARQDLLDLTKEDLEEEEPRTGLIIMLGIVLGGMLAIFSAFITEFMVTAHQEYRRRVTLE >NZ_AP017372|340689:389155|359166_359364_+|WP_096407525.1|DBSCAN-SWA MKLQYFEDTDTLYIEFQSRAISETRDLDENTILDLDSEGNVCAITFEHASQRTDVNHLHVEGLAA >NZ_AP017372|340689:389155|373482_374865_-|WP_162549287.1|DBSCAN-SWA MISFFNNYKTDVWLPQCTRFHIADIVGFVSALIIVFTGTWLGGSINRLAWGILLLAFFVALREFWPVLRRNLVFWASVVFIAYLGLRHGISLYEQLDVPVKQGPETGDLLRTSALIFIIGALFVRDNIYRIRLFLLAALTGLSIWFLQDLDWNSLFALDWDSRRLFGPGNPNQIAMALSFALIGFFVAVIRVIYRAYSQNSLFSLSSIVTILGYLLLLSFVAWTLIITGSRNAWIGSVGGIIAASFLLSIFLPSSSKSRWKVVSIVAVASCITIVLATLALFQTHHAKDRAIETVETIQQYLTDPDAAAEDPVSRSVALRIEFPRVGIEAWKERPLLGWGTEMNYEPVQEARPGQVHFHNLYIEVLYALGAIGAILYFGTYLGLFTWAATSLRHYGRAAFPAIFAISSATSLAIMLLATVRVHQSFGRTVFFFVGVCLASFYIGSKLRDYYAKNQCEHTH >NZ_AP017372|340689:389155|353393_355943_-|WP_096407514.1|DBSCAN-SWA MKENIFTLEVQPNIPPNLSRLEELAEDLYYSWDRHVRALFVQLDPELWEACGHNPKVFLRRIAQHKLEEAAQDEAYIADYNRTLSAYDTYHEQAALTSKVAPYIDPDNDLVAYFCAEFGFHESVPLYSGGLGILAGDHCKAASDLRLPLVAVGLLYRQGYFSQTIDHEGNQQAHYAPSSISELPITPCLDDDGEQVQVSVDAPGREIHLRVWQMRAGHVLIYLLDSEVPENDAADRAITYQLYGGDAHMRILQELCLGLGGVRALRKLGISPSVWHINEGHSAFQIVERCRELISEGYDSATAIEAVASETVFTTHTPVPAGHDIFEPEMVAEHLAPNLADTDIPIEDILALGNGQKGFDMTSLALRGSRFHNGVSAIHGGVASQMEQHIWPEIPAQENPITSITNGIHVPTFLAQEWANLFDQRWHAWRNQLLNEDFWKVVDELPDHRFWSMRRSLKSELLRDVYQRVLKRCQRNGMSDAMIERMTSNISNPDPDLLVIGFARRFATYKRALLIFYEIDRLKELLNDPQRPVILIFAGKAHPHDEKGQAMIRRIHELSLDPDLIGKIILLEDYDMAQARKLVTGVDVWLNNPEYPLEACGTSGQKAAINGVLNLSVLDGWWDEGYEKGNGWAILPHSAGFDPEYRDREEARDLLNLLSDEVIPLYFNRGNSGYATEWVKMSKAAMRTTLPRFNAQRMVMDYVSELYAPARAQSKILQADSLSGAQELARWKERVREHWGGTWLERIDAAPTSLLHGESLPIRVKAHLNGLSCDDVTIECRFSAMEEPRDAASTVRYQLQPEGETEDGMPVFAIDIEPRFDGLQYYRICMYPTHPLASHPFEFGGLRWL >NZ_AP017372|340689:389155|385245_386211_-|WP_096407579.1|DBSCAN-SWA MHRFHLQEERCLLGTYNSQAIFWQNDSFTSKITLSGLRRVGKIRNQLIIKLYHGMITKGHEYTGLLTTKNLNNHPKTVMAPYAHFSIAQGAVESYMRYWKEMNDKCAVPEIRALGYPRFYRAMNLRNGDEKVVLPDSIESVLKKHAGDFHKLYAPTRKPRINEIYGFDWERFDEYLEANKMTLYLKLHPLVRCHNLNLPFKQLSRMVILPADGIVDSLELLSRMNCLITDISSVMMEAIALGKPVVHVNVEHEKELLFEHHVALPGANARDFNELSSRLSQCMRGEMDEMIGTIRETWNLNRQDSIEESWSEIWSLRELCG >NZ_AP017372|340689:389155|367822_368122_+|WP_096407553.1|DBSCAN-SWA MRLTSAEREVILDVAREVLGDDVKVRLFGSRVDDSARGGDIDLLIECDEQVQNRASAASRVAALLQMRLGDQRIDVLILDPSTTRLPVHDEALREGVEI >NZ_AP017372|340689:389155|382168_383686_-|WP_096409949.1|transposase|DBSCAN-SWA MAISKELEAQIMRYHYAEHWRVGTISRQLNVHTDVVHRVLAKAGIPSAQRTRRGSIIDPYVPWIEQTLADYPKIPASRLYDMARERGYSGGPDHFRHLISQYRPKPVAEAYMRLRTLPGEQAQVDWGHFGKLPIGRAKRPLMAFVMVLSYSRWIFLRFYLGSSTANFLRGHVAAFEAWQGVCRCLLYDNLKSAVLERYAEQIRFNPQLLDLSAHYGFEPRPVAVARGNEKGRVERAIRYVRSSFWPGRQFNCLDDLNEQAQHWCVSIAAERRCDQEQNSSVRQAFAEEQPYLRALPSEPFPCYENVPVKVGKTPYVRFDLNDYSVPHKYVRKTLSVSATLERIQVLDGENVIASHPRSYDRHAQIEDPRHIDKLAQEKQAARLHRGTDRLSSSVPRAKEFLSQAATRTNSLGSVTAALLRLLDHYGASELDAAIEHALERGVPHPHALSQILEQRRDQTPGPPSLPLRLPEQLRQREPSIRLRGLDGYDALTPNYEKDDNDPENT >NZ_AP017372|340689:389155|376826_377447_-|WP_096407569.1|DBSCAN-SWA MIKRAALTQRIDSLPSRAENRDTLDVRLAAMLWELGFLPIPLCSGVPSNGLAEYLCKLAPEAIVLSGGNDIGSAPARDALERAALEYSIGYSVPVLGICRGMQMLNHYQSGRLTPISGHVACRHAIITGDVLDLYLCGKEGITDRSDDVEKINCNPTVNSFHFLAIRETDLGDDMLPLAWSDDGTIEAVRHRSHHWVGIMWHPEHE >NZ_AP017372|340689:389155|380442_381384_-|WP_109962847.1|transposase|DBSCAN-SWA MAVDLSNVLVCLNPPPILYGFPKPLTGDDPIWRILNHYVEAARAQEDFSNVRAVGIDETASRRGHNYITVFQDLDDRRLLFACEGRNQSTIGAFVADLHDHGGDRDSVEAVCIDMSKAYIAGVQRYLPEAAITFDAFHVVQLANQAVDEVRRAEVRLEPALKRTRWVWLKDASRHTLKQLTELHYLSRTRLKTARAWRLKEALREILTGSGARREAEQNLNAWYSWARRCRLEPFKRLALTLKAHWQGILNAFDSRLSNGPVESMNGQIQVAKGRARGFRTTKNLISICYLIAGKLTHLPASPYGTICRSAVA >NZ_AP017372|340689:389155|357185_357506_-|WP_096407519.1|DBSCAN-SWA MRQNTALKILNSQADEGRAVFTRRDLDSLFRSDRTKARKAGIARLVEAGWLKPAARGGGVYVYPPGLPQDGYTPERIARALRRGEYNYISLESALSEWGALTRNRQ >NZ_AP017372|340689:389155|365173_365476_+|WP_096407540.1|DBSCAN-SWA MARTLEDYLADEKPEVVQRAKEKADEMRLEMHLAEIRQKCEMTQAQLARIMDVKQPTVAGLEREGKDLRLSTLKRYVESLGGRVRLDIELPDGTHNDFPV >NZ_AP017372|340689:389155|387159_388017_+|WP_162549289.1|DBSCAN-SWA MPEQRGQHAIAVHDGSLYFFGGTVTEDVLAEGVFSYEYSTDTYTEGLASIPTKRSRLQASTVGDFIYVIGGWDKSRYLGTVEKYDPLKDSWTTGLEPMPTIRRDAAQVVHHGKIYVIGGREKGKQGSTANEVYDPDEDAWRSLAPMPTYRRQITAEVHDGVIYVFGGEDNETKDMDVVEAYHIDEDRWETGLAPMPTGRHEPASAIDSERGQIYVTGGDVGSRDVTGAHERYDIDTDTWSTLDPLPIPRYNFHGRWLDGAAHYPGGRVPAGDGVRDYHIYDPGQD >NZ_AP017372|340689:389155|358004_358421_-|WP_096407521.1|DBSCAN-SWA MRVFLDASAIIYLLEGDGQTRDATRQVLLELERGSDETPVLMASALSRLECRVRPLRESDTQALERLDGFFDDPGLSVIALDTAVLDRATELRAQYRLRTPDAIQAACLLTVDPRGAFVTGDGDFEKVPGLHVYRIPH >NZ_AP017372|340689:389155|370887_372210_-|WP_096407561.1|DBSCAN-SWA MNHNTQHLTSSADHPVSILRAALDSNLKHVSLLRYKPLTAEEAETGDYDLIVDPRHEINFIECIFKEAKRHACLIDVCYSKNSKTKITLRKLEASVDIDVWSKVEVKKSSYGALLYVRTQELLKHINVNSQSNSSLDTNIGSLLYITHLFHKNKDISSPEVIARLQRYKTDLEKLKTKQAQLILGIVDQALSGSISEDQNNKAIKYLQDANVTPEKDWLHSAKSYIHKKRRKFLFKRRTTVIAGPDGSGKTALIFALKENFGNAFLIYKFKDLYRRNFVTKFFVRSKMKRLEIKKNIAEERLSGLVSITALIHFFFVFALRSRKKHLLLDRFFVDFNFFGLRSNMPLAYSSLSRVATFFAPKPSRIILTNCPEHIREQRKPGELRKESALEIYREYLRYSHAKKIDLVGLPTYSASNTEQDLDVLMRHDPHLLNKHIQNR >NZ_AP017372|340689:389155|378539_379211_+|WP_096407573.1|DBSCAN-SWA MVTRKSALKFLGDTAVLWVDPQLITHHRGSKFPLTAQIRAKYKSKLRRPVTWASRRWHPFFVREAWLPPAEPIESEGKYRRVSDLIANRQRLERSDWYQSMIDRVRTKGSVRYKFQKMYSESEVDAFFNEHVGPLIESMAKEGYREDLAPEHGTVLVGEDGRLYKTGGGSHRFYVARELGVKSVPVTVASVHEDWVRAQGVVPDRHGLRELPKLLRSLGNCTD >NZ_AP017372|340689:389155|358420_358696_-|WP_096410287.1|DBSCAN-SWA MENVISAQEIKRRGISAVDQALKNGPVHVIQRNRPRYVILSEESYQRLSEGAQARKRLWDRLLGDDEAYGAARNRAELDRELQSEREGWRD >NZ_AP017372|340689:389155|374857_375778_-|WP_096407567.1|DBSCAN-SWA MASILSRLRDLTPRIRASKSLRKRLDKLKLRRLKRLSDPDNNVYCTIHGSKMRLNIDPHSQHKIEHELAIHKTREHSSTVYFQNRIKEIDSNFSQPPLVLDIGANIGYYSLISANSLRGDSTILAIEAEQSNIERLEYNIALNKYTNITPIHCGIGESNKLGKLSLGKTSNTHNFTSSDNTERSYEEVWITTVDDLINKHRKNTHTPIIVRMDIEGYEGYAFRGMQKLLASGAYCFIFFELHKQASPFFQEIKDALQSSGFQLEAIEHKKGNSSLIHNTDIGIITSLTQHAHLFLCRPQQTKSNHD >NZ_AP017372|340689:389155|340689_341673_-|WP_096407493.1|transposase|DBSCAN-SWA MTNNHHDPAYKRFFSQPVMIKDLLVEYVGEDWVKELDFSTLEKQNGSYAADDYRDRHDDLIWRVRWGKEWLYVYLLLEFQSDIDQFMAVRMMTYLGLLYQDLIAQGKLTSDGRLPPVLPVVLYNGQRRWSAATDIDSLIERIPGGLSAYRPQMRYMLLDEGALLSKDNSPELHSLVHALFRLEHSRTPDDMRSIVATLSKWLVKPEQRPIRREFAIWIQRVLLRRKPFADSKLFDWEEVQDLEEVNEMLAERMNEWEREWKQEGRLEGRQEGLLAGEGKSLLLLLEQKFGKEAAEQYRPRVEQADEPTIQQWLINILTANSIEEVFR >NZ_AP017372|340689:389155|343528_343972_-|WP_162549281.1|DBSCAN-SWA MSLNSMDVNQKYRCPAIALLLTIAFLALPHPAHSNYYSVAPDPGDLLQFNGVLVDIREPYEWQQTGIVEGSKTITYRHTEDFIEHLEPHLNQELRPIALICRTGNRTRQAAHLLSQKVDAPVINIEGGIFRLMHLGYRPVPYQDEEP >NZ_AP017372|340689:389155|368176_368812_-|WP_162549284.1|DBSCAN-SWA MRHGYTYIRIPKAANSTISSTLDYHFPSGHEDRQGKSSYDKLSNLSQQQIDTVLNDHYIFTVVRNPYHRTLSAYLQKFARKSNADSWMRRFGGEIKAYGNGEASFLGFCRFLERGGLLANIHWMPQHRIIEPIGIERIDYVGYVESLEHDLRTIMDSIGGDSSQLQLQSSGPAPTGAATKCKNYYDDESADIVRRLFAEDFRLFGYSDSTF >NZ_AP017372|340689:389155|366480_366768_+|WP_096407547.1|DBSCAN-SWA MLMHNPPHPGAVLRELCLEPMGLSVTAAAEALGVSRKTLSAVLNGKAGISPEMAIRLSIAFDTSAESWLNQQSQYELWHAEQHRKELKVKKLVAA >NZ_AP017372|340689:389155|366068_366347_-|WP_162549247.1|DBSCAN-SWA MLQRAELAGWSSPPAAITAPPPWIYSMRQSRHRRRCRCPDLTKCREELGYEPQVPWREGVERTVRWYRDFFAASRQPDDIGFEPPEALCGAS >NZ_AP017372|340689:389155|343968_344565_-|WP_096407499.1|DBSCAN-SWA MAAEIIIEILEHLGLIGIFIAMIFIAPETLMPFLGYAASQGDYHPLAALAAASLGSTFGSTLIYYAARWLDRERMIWWLTLGGRWYLFKRSDIAAMDKVFSRHGALIVFFGRFLPTVRSVVSVPAGLLPMPMPKFLLFTFLGSTAWNSLLVLGGYTAGANWERMVEYLGTFGTLITFAFIALIIGFVLFRLRTLTLGK >NZ_AP017372|340689:389155|388405_389155_-|WP_096407583.1|holin|DBSCAN-SWA MRAIILAAGQGTRLRPLTDDRPKCMVELEGKPLLEHQLEVLRGAGIEDIHVVGGYRAEWLQRPDITLHINERFDQTNMVATLFAAESVMAGSDEVIIAYGDIVYEPRVLNALLACNAPVCLTVDRAWRRYWEARMDDPLADAETLKLTDGNRITELGKKPSSYDEIQGQYMGLLKVRADLVPQLPAVWRAMDRDATYDGKDYDNMYMTSFLQHLIDSGWEARAAFVENGWAEVDSEADLAAAKDFWAPS >NZ_AP017372|340689:389155|369092_369734_-|WP_162549285.1|DBSCAN-SWA MLARRGSVIIETNPLNIEYLKMFYEWPGYLKRWPFWICGGTWDLGGQALEPFREAQIREIWLSGGKYKDTDCYGQMREQLERLGYIKNPRMQSVAELDSYFDRQWKLLLELSHDGYKEQSALGGKPGHEITVRIDRDGRLVKCREGSHRLAMLRVLGITRAPVTIDIVHSKWALADDRAAPTSGSLAAKIRARLEQDYGQVLAWPSWWPNGCL >NZ_AP017372|340689:389155|344549_346085_-|WP_109962906.1|DBSCAN-SWA MPAYPSARERLEGPIKYLSLGSGAGSVKGQAQTSSTKGQAQSEGQAQISQSEGQAPGGGQAQPKGQAPTPSKGQAQSPAPSKGQAQSSNRYHKQKIIEGRLPGSEVSVWLLDDPELFERVGSPYATASGEPWPDNHLRFYWLSRVAAAIAAGEVLDWQADILHANDWQSALAPVFLQDYSESHQERPRTVFSIHNLAYRGIFSADVFAQLELPAAMWNPERLEFYGELAFIKGALTLSDAITTVSPTYAREIQTPAFGWGLDGLLRSRSSDLHGIINGVDTTTWDPATDPHLAANYSAPDPQAKAKNRQAIAVEIGLDDDPQSPLLGFIGRLVEQKGIDLILGALPRLLASGARLAILGSGDNTLERALLQAAQAHPGRVGVSIGYDEGQAHRIEAGSDIFLMPSRFEPCGLNQLYSLRYGTPPLVNPTGGLADTVLDVDAHAGGNGFCTAAADAGSLAATVERALSYWQDQEAWQKIQARGMSADYSWDRSADAYVDLYERIRATGWQRR >NZ_AP017372|340689:389155|366961_367393_-|WP_096407549.1|DBSCAN-SWA MILLDTNVVSEVMRPHPESAVIEWVNRADGGSLYVSSITIAEIEYGLHAMPESQRREDLRARFETFIHKAFAQRVLDFDEASARYYGLIMASRRRSGRPLSAPDGQIAAIARRNGMAVATRNESDFSGSGLKIINPWRLDGLL >NZ_AP017372|340689:389155|381408_382188_-|WP_096409948.1|DBSCAN-SWA MTPKTPEQQMIERAKALRLHGILAHWEEIEDKQWIEQMLCWEEQERTRRSLERRLSEAHIGRFKPMSEFDWSWPTSCDRGAINALMSLEFIPEAGNVVLLGSNGVGKTMIARNIAYQAVIAGYTALFVNASTILAELASQDSERLLQQRFNRFTRPRLLVIDELGYLSYSTRYADLLFELVSRRYEKNSIIITTNRPFSEWGEVFPSAACVVSLIDRLLHNAEVLAIDGESYRYKEAQERRNTREAKRKSPSKKAKAES >NZ_AP017372|340689:389155|360770_361163_+|WP_162549283.1|DBSCAN-SWA MRNALLPDEASEQVADHPRIGEEHLVGVVVFLGHRQPRRGDVLMDFLAADTQASGAAAIGDIVAGAIVTWRRVAADEELACAAPAGGESLEERPDLFRRQVYDQLGLDNVIAGIPLMVWTLDVAHAVGQL >NZ_AP017372|340689:389155|372353_373454_-|WP_096407563.1|DBSCAN-SWA MSQTSNNQINISSANFAKLKRGFYINLNKHSFADWKFAYRNGRAVIPRISTSFAATPTAAHYSSLTRCILTQTLINADLYSSDKIITLPDIGLFAYVREPKSRRIIFDFENKAVCFLSSDTDLDSLRSSIDDPSCLRQLTSAKLIPELRYDANLQCHIGEMADDQISSIIKNFHIQSEVIQLIAKLQKITSQGVCSPQKILHSPNHIYDMLIELSANNKKVLDYGHWYNEVYRSLEDAKIEKVFSHGDLSTKNILLFSGRPRFIDWEHSCYRSHVFDIMYLYYRETRSKQKISQAKLKDFWISSERLASSFIDSKRPQLDIVTAVKIFSCELFSHIINGWGRETSPEESNLIHKLERIKRWTLAHL >NZ_AP017372|340689:389155|349710_351477_+|WP_096407509.1|DBSCAN-SWA MLSIEKSSTTGPSPDKVRVVLCWHMHQPSYVNPASGDYELPWTYLHGIKDYTDMAAHLEANPQARAVVNFSPILIEQIEDYAEQIKGFLASGERLRDPLLNALAQPVISADPEHRRSILEQCRRINRPRLVDPYPQYRQLMEFADLLDQQPTMLRYLDESFHEDLVTWYHLAWLGETVRGSEPLAKRLIEKGHGYSVHERRELLALIGEQLSGLLPRYRKLAEQGRVELSMTPYGHPILPLLQDLQSALEAWPDAPMPEQVTAYPGGEERARWHLEHGREVFERAFGQAPHGCWPSEGALSEPTVRLLSECGFKWAASGSGVLENSLNGNGVEEQQRNGHWHRAYIFQGEASGAGENSVEPTRCFFRDDGLSDAIGFVYSDWHGDDAVANLVVKLEEIAVASKDPGNTVISIIMDGENAWEHYPANGYYFLSGLYEKLSEHPRLHLTTFAEAIEQVEPIALDRLVAGSWVYGTLSTWIGEVDKNRAWELLVAAKQAYDSQIDKLEGPARDRAERQLAICESSDWFWWFGDYNPPDVVRDFDHLFRIQLAALYQCLGLEPPQELDHRFTHIGTGSPQMGGVMRQGRLES >NZ_AP017372|340689:389155|351473_353033_+|WP_096407511.1|DBSCAN-SWA MSGRGLTEQRRAGVLAHLSSLPGGPGNGDLGAHSRYFVDWLANCGFSVWQMLPLGPTHEDLCPYQCLSVHAADPGFIDLQQLVEAGYLSAEQAIPPTDLSRSELLNWRYQRLRDARAGFVARHGQNGKGQAQGEEGQAPPPPPPPETNSELRELRQFRACHSHWLEDYALYMALRRENEFRPWWEWPQPLRDRQPQALEEARERLGEELNQVVFEQFIFFRQWAALRAYAAEKGVLLFGDMPIFVAHDSAEVWAQREYFDLGADGQPLSVAGVPPDYFAADGQRWGNPHYNWQRMAEDGFKWWLQRLETQLELFDFVRLDHFRGLAAYWSIPVEAETARDGHWEPAPGHDLLSAVAQRFGQIPLVAEDLGIITDDVVALREQFALPGMKVLQFAFDSDSANPYLPHNHTADSVVYTGTHDNDTTMGWYADLEPWVTERMHSYLGHPNEPMPWPLVRASLASVSGLAILPLQDLLALGSDHRMNIPGVAEGNWRWRFEWEWLPDDLSGWLWELNYLYGRV >NZ_AP017372|340689:389155|363045_364041_-|WP_096410288.1|DBSCAN-SWA MTQTSHTDSEKSDYDSPWKEALEYYLEHAMALLFPQIHEQIDWSKGSHFRDKELQQIIRDADSGRRYADKLVEVYADDGKPTWILLHVEVQGEPEKKFAERMFTYHYRLYDRYQRDIVSVAVLTDTSPSFRSDTYRYERLGCRLEFSFPVAKLLDWQQRWAELEADLNPFACVVLAQLAAKNEHDPHQRKEVKLGLTRMLLERGYSKDETKELMRLIDWLIQLPEDLEEAFITEAHELEEDYQMPYVTSFERAGIKKGRQEGRQEGLQEGRQEGRQEHAAETLLRLIERKFGPTTKEASRARVEHAEFEELEMWLDRILDAERIEDVFADD |
43 | Escherichia_phage(28.57%) | transposase,holin | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_2 |
1511921 : 1555736
Sequences of DBSCAN-SWA_2
Nucleotide sequences of DBSCAN-SWA_2 >NZ_AP017372|1511921:1555736|DBSCAN-SWA GGTGGAGAGAAGCCATCGCAAGAGTGGGAGCCTGGAAGATGCTGAGCTTGAAGCCCTGCGCAGGGTGGCTAATGTAAGAATCAAGGCAATCATTGATATCGCTTACCTGACTGCTCTACGCAAAGGCGATCTGCTTAACCTGCGCTTATCGGATCTGTACGACGATTGGATTCAATGCCGAGTCGGCAAGACAGGTCAGGAGGTCAGGATTGGTTGGAGTGACGCGTTGCGAGAAGCTGTTAGCCGAGCTCGTAAATTGGCGCGCCGGGGAGGATTCGAACCCCCAACCTTCTGATCCGTAGTCAGAGCCTTAAACAGCCCTTAAACTTCTGATAAAACACAGGTTATGCCTTTCTGATATCTACAAAAGAAACACTCAAAACCCTACGTAAGCCTTTGATAAATAGGAGCCTGGTTGCATTTTTGTAGACAAGTTATCACGCTTCATGCTCGTATAGCTTATCCATGATCAAGTTTGCGGCCTGTGCTAAGGAGCCAAAGCAATGTCTTAAGCGCACAGCACGATCAGTAGTCTAGGAAGGTTTCACTTTCTTTGCGGTTTTGTTTATTAGGGCCACAGTAGCCGGAGCTATATATCTCCAATCCGGGCTCTGAGCGTCGTGTGCCACCTCTTCTGCGCCAGGCTCCCCTATTTGCCAGAGAGTTACCGCACCCCAGCACGTGTTGACCAGCGACCACCTTGGCCGGATAATTCAACTACTCAATCAATGAGAGTGTCCAGAATCTGCTCAGCAGGACGGGATGAGACAGGCTAATTCACCCCACCGCCACCTCTAATTGGGCTCTGAGGTGGTAACTTGCTTGAAGAGACGACGGAATGAAGGCTGTTCCCGGATACATTCTCGTCCCGCAAGTACGGTAGTATAGTAGGCATGACGATCCCGAACTGGAACCATCAGGGGGTCCTCCCCCCTTACGTAGGGAGTCAGACTGGTTCCGATGGCCGCTCTCCCTATCCGACGACACTTGTCGAAGTGTTAGAGCACTTTGGCACTTCCCCTGAGCGATGCAAAGTCCTTCGTGGGTTTCTCGACTATCGGCAGGAGCTGTACAGCATCGGGGTTAAGCAGGGATTCCAGTGGGTGAACGGAAGCTTCGCGGAGAATGTGGAGATTCTCGAGGAGCGCCCCCCAGAGGACGTGGACGTAGTGACTTTTTTTGCGGTGCCTTCGGGAGAGAGCCAGCAAACGCTCTTGGAAAAAAACGCGGACGTATTCCGTCCAGCCTCAGTAAAACGGCGGCATGGTGTCGATGGTTATCCAGTTCCTTGGGATGGCGGCGACTGGCAGAGGCTAGTGAAGCATGCGAGCTATTGGTACAGCCTTTGGTCTCATCGGCGTAACCACCTCTGGAAAGGATTCGTTCAGATCCCGTTGGAGCCTTCTGAAGAAGTCCCACGAGCGGTTTTGGATAACATCACGTCGGCTATCGGGGAGAATAAATAATGACGCGCCCTTCTGAACATGAGCAACTCTCTGCCGAGAAGGCGCAGCTCGAAAGTTTCCTCGCTGATGTTCCGCAGGAGAGTGCAATCGAGCGGATAGCACTAGAGGATCGCTTGGAAAGTGTTCGCGTTCGGCTCGAGGAGCTCGCCACCAATAGTAGCGAGGCAGCCCGGGTGAGTCTGACCTTCTCAGGCAAGCCCGTGCTGAACTCCGAGGGTATTTTGGCCGATTTCAGTGCAGCGGCTTTGCAGAAATTCGAACAGATGGTGGCAACGGTGGGTGCTTATCTACGCACAGGCGAACTTAGGTCCTCGGGGCCATTGCCAGCACGGCAAGAACACCGCCTTATGATCACAGGGACCACCGTGGGATCTTTTGGCTTTCATCTAGAAGAGAGGAGCGAACCTGAACAGCCGGAGATGGGGTTAGAAGATAGCTCGGTCCGAAAGGCTTTGGACCGAGTAACTCGTGTGATAGAAGGAGCTCTAGAGTCCGAAGAGGCACTAGCTGAGGAGGTCTCCGATCTTGATTCTCGCTCAGTGCGAAGTCTGCATGAATTTATGAACACAGTCGCAAGTCGTGATGCCTGGTTCCGGCTGGAGGCCGTCGGTCGTCGCGTGGAATTCCCCGACATGGAGCATCTAAGCCGTACTGTGGAGTGGCTTCAAGAGGACAATATCCATGAAGAAGAAGCGGAGCTGCGGGGCACTCTCGAAGGTTATTTGCCACACACGCGCACCTTCGAGCTTGCAGAAGACGAACTAGGCCTCGTGAAGGGTAAAATAGCTCCAGCGATTGACGATCCTCATGATTTACAGCCCTACCTGAAGAAGCGGGTTACTTTGAGCGTGCGGAAAACAGCTCTTGGAGGCGGTAAGCCCCGATTTGCTCTTGTATCCCCTCCCAGTTCGCCGGACGAGGATGCCGACGTGTAAACTAGCCGTCGTAGCCTTCTCTAGCTGAGCCAACGCTGCTCGGCATCTTCCACGCGGAGGCGGGTTTCCAGCGTGTATAGCTGGATGTCGAGCATGTCGAGCGGCTTGGCTTCCTCGCAGGGCGTACTTGGCGGCTTTGATCAGCGGATTTTTGGGTTCAAGTCCGAGTTGTTGCTATGGCTATTCATTGCGTTGGCTTCCTTCTCCCCACTTCATTGCACCCAGCCAGAGGAGCTTGCCTAATGCGGGTGCTGCTGTGGAACAAATGAAAACATACGCCGCAGGACTCCACCCAAACCAGTAGCAGATAAAGCCGGTCACGATGCTTACCACGGATAGCACGAACGCAGAGCTAAGAATAACTGGGACAAGAGAACTCCTGGGCGGACCTATTAAGAAGGCTGTCATAGCGACGACGAATAAAGAAAGGAGCAATGAGCGTACGAGAAGATCTATCAAGCTTTCGCCTTCAAACCACGGCAAGCCAATCGTATCAACTATAACTAGATTGCTAGACATAAGCGCAACGGCAAATACCCCGAGACTGCCCCCAAAACGCTCAGATAGATCACGAAGCGTATCTGGTGATATCAAACCCGACCTCCACTGTCATACCAGTTTATTCAAAAGACGCGATCTTTGCGCGTATTTTACTGATGCCGTATAACGTCATCCATAATCCCGTATTAGGACGCGATAGCATCATTTTACCTATGAAGCCCGTACTTCGATCCACGAACGTCAAACGCCGGCCAGGTACCTTCGAGGATATCAACAGATGAAGCCGCTATACTCACGCTCTCTATTGATCCCGACTTATAAACTACCGTGTTGTTTTGGTCTGTCACGATTTCTGCTACTATTATCTGATTCGCGTCACAAACAATCGCTAAATTCGGATTGTGGGTTACTATTATAACTTGCCGATGCTCTGCGGCAGATTTGATAGCACTCACAACCCGGATAATTCACGACAGGTAGGAAATAAAGGTTGGTGCGGCGCTTTATTTCCTTCATAATCAGAGCTGTACGAACACGGAAAACAAAAGGAAACGCCGCACCATGGGTGAAACGATACCTCTCTTCGAACCATCTTTCAATAAGTCTCTGCGGGTCGAGACTCGACCGGAACGCCTTAGCTCTGAGGCGGGAGTAATGCTCCAGCGCGAGGCTTTTGAGCGTACTGGAATCATCTCCTGGATGAGCGCACGACTAGAGGATTCCCGTGATCAAAGTCGTGTAAAGCATTCACTTAGCGAGCTGCTGCGCGATAGCTTAACCATGATGGGCCAAGGCTGGGATGACCAGCGTGATGCCAGCACCTTGGGCAACGATCCTATGCTCGCAGCATGCGGCAGCGATGGTCGTGGCGAATCCGTCATTGATCGGGGCCTTGCTTCGCAGCCAACTATCTCTCGGCTGTTAGACATGCTGGCGCACGAACATAACCTAGATGTGCTCAATAAGGCGCTTCTCAAGCTAGTCGAGCAGCGGATACGCTCCCGTAACAAAGGGCGCCGCGCGCGAACCATGACGATTGATGTCGATGGTGTCCCATTGCAGGCTCATGGTCAGCAGCCGGGCAGCGCTTTCAGCGGGCACACCGGACAACGTCAACATTACCCGCTGTTCGCTCTGTGTGCAGAGACCGGAGATATGATTGGAGTGCTGTTGCGCCCAGGTAATGCAGGCCCGGCGAGCGAGGCGGCGGCTTTGGTCCCGCGGCTAGTCGAGTACTTGCGCACTCATGTGGCTGAGCAAGTTCGGGTGCGACTGGATGCGGGGTTTGCCGACGCTAACACCCTCGCGGCTTTGGATGCGCACAAGATTGAGTTTATCGCCCGATTGCCTAGGAACCAGGCCCTTGAGCGCCACTTCGAACCACACCGCTACCGGTGTGGCCGCCCTTCAAAAAGAGAGCGTGAGTGGACTAGGGAGATCAGCTATCAAGCAGGCACCTGGGATCATCCCCGGCGTGTAGTCATGGTTATTCGAGAACGCCCTGGGGAACTATTCCGCGATGCTTTCTATCTGGTCACCAACGTTTGCGGCAGTCAGCGCTTTGTACGCAGCCACTATCAGCGGCGCGGCAAAGCAGAGGCACATTTTGGCGAGTTTAAAGATGTGGTTGGCGAGTCGCTTCCTTGCACCTGCCGGGGCAAGGCGACAGAACAACAGGTGTTAGCCCGCAGCCAGGCGCTTCTAGGGCTTAGAGCATTGGCTTACCAGCTGCTTCACATTCTACGCGAGAGCATGGAGCGTGCTACAGGTGATGGCTGGACCTTACGCCGACTGCGAGAACAAGTGCTCAAAAGCGCGGTTCGGGTGCTGCGTCATGCTCGCAAGTTGCAAGTCATTTTAGAGCGCCGCTGTGCTAAGCACTGGCGCGCTTTGCTGCGACGTCTGCCCAAAGAGTACCCAGCTCCCGGATAGATTTGATCCTTGAGCTTAAAATCAACTAGGCTCAAGCCGCTGGAAGCAACCCGGCGGGGGGTCGCTGTGTCTATAAAGCGCATTTTCGGCTCTTAGTAACCACTAGAGAGGGTCAAATCACGAGCAGTATAGCCACTTTACAGCCAAATATTGCTCGGGTTGCGGCTATTTTCTGTTCAGTGTTGATATATACCTACTCTTGTAGGGGGTAAACTCGGCTTCTCATGAATAACGCGGGTTTTATGTCAGATATCCTCCCCATACACTTGCAACTCTACTCCGTACCAATGACCATAGCAAATCCAACTGAGAGGGTGCGACAAGACGCGACACTTCCGCTAAGATTCTGAAAGGCAATGCGCCTTAGAGATTCAGTCACTCACCTCGAGCATCTCTTCGCTCACCCTGACTGTTCGAGTGGAGGCGCCGGTCACACGACGCCCCCTCACGGATCCCAGCGGGCGGATTTCCCGCACTGGGCTCCTCAGGTGAATCTCATGAGGCCATTGCCACCGCGGCTCGTTTCTTCGTCTTGGCTCCCCCTAGGAGCAGCACCATGGCGCGGCACTCGGAAGTTCATCAGTTTGGTGTATAACTTATCGATTAGCGAAAGTGCCGCTTTCAAAAGTTTCGCTACACCGCTTCTTCGAGGCGCTGCATCTTCGGCTTCGGCACCTCCAGGTGCCTGATACAGCTATAAATCCCCCATTGAACACACTATACTATTGAAGCAAGTTAGCGAAAGCCCTACGCACGTACCACTACGGGCTGTGCACCCAATCTGAACCGTTGTCCGTTCAAAATCAAAGCCATATATGACGCGAGCAATTTGTTCAGCAAATGAACTGATGAACTTCCGGGCACTATGCGCCTCAGCCATGTGCTCGGTGAACCCCCGGCGGGCGCGTGTTCTCCTTGATCCGCACCACCTGCCCCACGCCGGCGCCCGGCTGTGCCCCGCCGTCAGGGCCCATACGTGGAACGCATTTATGATTCAGGACACTAGGGCTAGCTGACAAGAGTGCGAGGGCCAATAACCGTTCTGCATGAATTTGACGGGTTGGGGGAGGGTGATGTTAGCTTTTACACTCGGGGGTACTTATCAAGCGTTGCCTGTCAAGCGCGATAGCGATGGTATTGGGAAAAGAACTACGCATGCCGGCCCCCTAGCCCCCTAGGCAATAGAGGAGATGACAAATGACTGATCCCCCCCCCGCGCGCCACTAGCAAGCAGCGCATCAGTGCGGGCCTCCAGCCGCTCGACGAGATCTTGGATGCGGTCTCCTGCCCAGCGCGTCTTCCACATCCGAGGCGGCCCCCGGCTGCGGCAAGACAACGCTGGGCCTGCATTTCCTGACCGCCGAGACGGACGCCAGCGCCGTCCACACCCGGAACTCCTCAAAAAGCAAAACCTTTTAAATCCGAAAAAATTTCAGAAGAGGAAACTTATGAAGTTATTACCCCCTATTGCTCTTGCCATAATTGTCTTCATTCAAGGTTGCGGTGGTTCAAATTCAGAGGAATATGACTCAAAATTGATTTACAATTTTGTTGCTTCCTGCTCTCAGCAAGGACAGGCTTCCGTTGAGCAGTGTGGTTGTATGATGGACGAAATCAGAAAAAATGTTTCACAGGATAAATTCATGGAGTACGAAACAAACATGCTTTCTGGGGGCAAGATCCCAGATGAATTTAATAAAATCATCAGCAGCGCTAGGTCAGTTTGTAGGTGACCATCGCAGCAGCTGCTCCCGCTCTGTAGCGTTTCCAAGGGCTCTTGCTAGCGACATAGCGGGAGCAGCTGGTGGTGTGGGGCTGGTCTTTCTACACAACTCCGCTGGCGTCAGGGCGGAGCACGGCCTGAGCGCACGATTTTGAGCGTCAGGCAGAACGTGTGGGTGATGTCATCGAGCTTATCAGCGTCTACCTTAAGGACTCGGCCGAAAAACTAGAGGATATCCATAGCAGTTTCTGGGTCTACCGCCTCTCGGAATATAATGAGAAGGTCGAGAATGTGCGCGAGTGAACGATTTGATAAAGGAGAAAAACCATGGGTGCTGTAGCAACCGCAACCGTCGTTGTAGCTGCGCTGATCATCATCGCCGCCTGGGTGTGGTTCAATTATCACCTCCAGCAAGCATTCGAGCAAATCAGCGCGAATAGCCGTCATTTTTCGCCAGGTCTCGTTTGGCTCAACCTAATCCCGTTTTTCAACATCGCCTGGACGGCCGTTTTGGTGGTGATGCTCGATCGCGGCTATCGAAAAGAGTATCCCAGTCAGGCCCATGAGGTCCTCGGTCACATCATCTGTCAAGACCTGACTGGTCATATACCAGCTTACGAAATAACTCGGGGTGCTCCTCGTACCACCGCTTGAGCTTTTGGACGGGGGTGATGTGCCCGAGTGCGCGCTGGGGAATGTGGTGATTGTAGAGCCGGCGATCGTGCCAGAGCATCGAGTCGAGATCGGCGGCCGAGTCGAAGTGGGTCGTGCGCAGGATCTCGTTGACCCGGCCGTTGAAGCGCTCCACCAGCCCGTTGGTCTGGGGGCGGCGCGGCGGCGTGAGGCGGTGCTCGATGTCGTAGCGGGCGCACAGCGCGTCGAGGACATGGCCCAGGTCTTCCAGAAGGTCCGAGCGCCGCCGCTCGGGGTCAGGCGACCCCCCTGATCACGTCCAGACGAGTTGTGGGGAGTCGCCAGAGCATGGGTGGGCATTTACAAAACCCCTCTACGACATATACGTTTACCTTGTGTGCCCTCCCCCCAGGCCATACTAAATACCGGTGCAGGATTAATCCCGGATTATCTTGATTGATAGCCTTAGTAACCGTTAAAGGAGCACCCTAAGTCATGACAGCCAATCAGCTTGTAGTCCGGACAACCGTCATCAGCTGCTTCCTTTTTATTGGTGGATGTGGCCAACAACTCCCAACGTGTGATAGCAATGATACAATTGAAATAATTACGGACCTTATAAGTGAGGATGTGGCCTCTGGCAGAGCTGGAGTAGGTCCCACGGATGTTAACGTTCGAATAGATGATATCGAGCTGATAGATGACGGCGAAACGCGCCGCCGTTGCGAAGTCACATATCTAGCTTCAAATATTCCCATAGAGGAGGTTGACGAGTCACACAGCAGGATCATGACCTATGATATTTATCAGGAAGGTGATGATAATATTTGGGAAGTTGTTGATATCAGTTATGAAAGGTTCAAGTAAAAAGCAGAACCAGAATAACAAGTCACCACAAATGCCATCACGGTGCTGTGACGAGCGTGGCGACCCCCGTGATCACGTCCAGGCGAGTTGTGGGGAGTCGCCAGGCTTGTTCTGGCCAAACGATTCCTAACACCAATTTTAACCAACCGTACCTTCTCTCATCTCTGCCAAGCCAGGCCTACCGATCCCTTATGGCTACGACTCCACCACCTCATACGCCGCCATCTGCACAAACATCCGCGTCGCTATCTCAAACGCCTGCCCCTCATCAATACAATTTAGGCATATCGACAACAAAAAATAATCCAGCTGGATGACTAGCTCTGGCGGCCCCTTGGGGGGTCACCAGAGCGTCTGCCAATTACAAACCACCCCTTTCAGCTAGACTGGTATAGCCTTCGGCGGAGACGATGATGTGATCAATAAGCCTGACATCGATAAGCTCGAGTGCCTGACGAAGCCTCTCAGTGATCCTCTCATCCGCAACCGAGGGCTCCAGATGGCCGCTCGGGTGATTGTGGACTAGGATTACGGCAGCAGCTTGACGTCTTAGGGCCTCTTTGAGCACTTCGCGTGGATGTATTGTGGCCCCGTCAATAGTGCCCCGGAAAAGCTCCTCAACACCCAATATGCGGTGCTTGGTATCGAGATATACAACGCCGAAGACCTCATGATCTTTACCAAGAAGCTGGGTTTGCAGGGCCTGGTAAGCGGAACTCGGACAGCTCAGAGATCGACCCTTAGCCAGTCTACGACGGGCAATTCTCTGCGCTATCTCGAGAAGCTCATCGCCTGTCACTGGCTCAGATACAACGTAAGTGCCGCTCTTCTCCCCGGTTTGGAGCTTGGCGTGGATCATAGCAAGTTTCTCCTTTGCGTAGTGCCACATCACGGGCAAAGGCACACATCGGCCCTTACAGGGCTAAGTGCCTGCCCGGCCATTGGCGGTTAGGATTAAGGTGTAGCGAAGGTAGGAGGGACTTAGTGGCTAAGCAGTTGCCCCGTTACATCGAAGAGCGAGTTCTTTAATTCACTTAGCTGATCGATGCGTATAGCTTGCGGGAAAAGGTGATCGACGCGGTGCTGGATACCGATTCCGATCATCTCAAGCTGTGCCGATTGGGCACGGCTGATCATCGACAGTGTGCTATCCGTATCGTTCGGCTCACCGTCGGTTAGAACCAGAACCACCTTACGAGGCTCAGAACGTGCGAGCAGATCTGCCGCTGCAAACCACAGCGCCCCTGTCATCGGAGTGCTGCCACGCGCCTTTTGCACGAAGGCTCCAGTTCTAGTGCTTACCCTGTCATCGTGGGCAAGCACTTGAGTGACATAGCTGCTTTGCCCCCTTAGTCCTGGGAATACCGAGACCGCCCTAGAGACACCGTTTATAGACTCTAGCGCCAGGGCTAGAGACATAGCCGCATCTAGGGCCAAGCGATCCGGCCCACCCTGCATCGAGAGCGAGAGATCGATGAGAAGATGCAGTGCGGTGTTAGGGGCAGGTTGCGCTCTGCTTGTCCTGAATATCCGGCTATCCGCAACACCCGCCCGGTGCAGATGTTTCGCACTTAAACGCCGGCCAGAGCGTGCGGTACGTGAACGGGTCAATCGTTGAGACTGAACCAGCCCTTGCAGCTGCGCATTTAGGTGCGCTGAATGGACCTTGACCCGCTCCAAGGCCTCCTTGCCTAGCTCTGCATCGCCCTGATAGGACTGCACTGTTGGCAGCAGCTGAGTGTCATCGCTAGAGTGGCTTTGCAGGATGTTGCCAACCGCCTCGAATAGATCCTCCGGCAGAGCATCCTTATCGGCTTGCAGTGCTGCCTGTGCGGCTTGAGCTGCCAGACTTGAGTCTTGCTCTGAAGAGCAACCTTGCTGCGATGTCGAGCCTTGCTGTAACGTGCTGTCGGCCGGCTGCTCATGATGACCCTCATCACCAGAGTCGGCATCAGTAGCGGCACTAGCTTCGGCATCAGCACCCTCATCTGCAGACGCACCGGCGCAATCCTCACCACTTGCCAGCTGCCCCTCTTGGGAGTCATCCTTCCCTGATGCCTGATCATCCGCTGCGTGATCATCGGCGGGTTGATTATCAGGCTGATCAGCCGCGTTAGCTTGCTCAGGTGATTGTCCCTGTGAGAGAGACTCAAGCAGGGAGATAATCCGCTGCGCTAAAGCCATCGTCTCGGCTGTACAGGTGAGTCCCGGAATCTCGGCTAGAAGCCCGTGTAACTGCCTCACGAACTGCTGACCGAATACCTGTGAGAGCACCTCTTCTGATTGCTGAGAGTGCATCTGCAATGCGCTTTGCCGACGATAGTGATAACGGCCAATCAACAGTACCGCGTTTCCAAGCACCTGCCCCGGTGGATCTTGATCACTGGGAGGTTGCATCCGACCTGCTTCAATCAAATGCTCCAGCACCGCATCGAGGGTCTTCCTCGTCCCTGGATAAGTGGTAATCATCGCCGCCTCAATGCGGATATCCTCGATCACTCCCTCCAGAAGCTTCCCTAAGGGCTGGGGATGACGCGCCTGAGTGAAGTCGGTGAAGCGCACATGACCGGCTTCATGGGCCAAGAACCCATAAGCCAACGTCCTGCTCTGCGCCTCTTCGGATAGACCCGGGATGTTAATGCTCTGCCCGTCAGTGAACGCATTCTGACCGCCAACCCGAACCTTGACGCCAAACTTACGCCCATAGGCGGCAGCCACTATCGGTAGAGCATCGTTGAGTGTACTAAGTCGCATAATCGCCTCCTAAAAACGAAAAAGGCGCAGCAGCCGGGCTAACCGGCCACTGCGCCGTGGTAGAGGTTGAGCTGATTGGTATCGGCTCGCTAGAAAAAGAAGCTTGGCACCGGCTGAGGGTCGGTTATTACCGGCCTGTGCGAGGGAGTGCTATCGCCGCTGTTCTCGCCCTCTTGCTCAGGCGTTGTGTCGGCCCCACTATCCTCAAGACCGGCATCCGATGAATCGATCTGCTCAGCGCTTTTGTCCGTTGGCAGTGTGGGCAGTGGCAAGCTGAGTGCATCGATCTGACCGGCACCATGACGGGCCATGCGCTCGGCATCGGATAGCAACAGGGCTAGGCCCATCCCTTCGTTGAAAAGGGAGCCCTGGATAGGGGCTGTAGCCGGCAACCGGTGCAGCCATTGATCGACGCTACCGAGTACTGGCTCGATACGCCTGTCGATAAAGCTCAGACAGGCCAGCTTGTTACGGATCTTTCTGAAGGTGCCTAAGGCACGGCGGTTGAGCTTATCCTTGCCCTCGAAGCTATCAGTTAAATCCCGGGCCATTTGCTCGACCTCGCCAAAGACACCGTCACCGAGTTCGGCTACCTCCTCGTCTAAGGTGCCAGGCACGATAGCCGGCTTTATCTCAACGAGCTGATAGCCAAATCGCAGCCGTGCAGCCACACTACTTGCTGGCTCCACGGCCCGGCGAATCGCCGTCTCGAAACCAGGCAGAGTTTGAATCCAATCCTCTAGGTCCCGGTCGTAAGTGGACAGAAAGTTATCACGCTCGGCATCGAACTGAGCCTTAATCTGATCGAGCTCCTCAGCCAGAGCGGTGGTCTGCTCCTTAGCAACGGCAAATCCGCCGAGAAAGCGGGTGCCAACCCGCAGACAGGAGCGTTCTGCCGATTGCTTAAGCCGATGGAAGGCCTTGAGCTGTTCCGGGTCACAGACGCGCTTGGAACCGAGAGATACCAAATCTTCGGGCGGTATCTCCCCATCATCCAAGTGTAGATCCTCTGCGCGCAGCTTCTTACGCCCTGACCAGATCCGTATATCCAGCACTACCAAAGTTAGCTGATCGGTGATTTCAGTGATGTTAATCATCGTCGACCTCCATAGAGAGTAGATCGCCGCACAGCACGACACACCAACTCGCGGGGTGACGAGCTGGTGTGTTGTGGCAGCGGTATCGGCGAATGTGGGTGAGTAAGACGGGGGTATAGGTGAATGGGTGGAACGGCTCAGCAATTGCTGCTTTGCCAGTAGTCGCCGAACACATCGGCAGCTATCCGGTGAATGGCCTCGCGCTGCTCGGCCTCTGCACGTGCTGTCAGTGCCTGGTTTAACGAGTACTCGAAAACGTTAGGCGCCCCCTTAAACGTGGCGGCTAAGGTTGCCCAGCGCACCAGAGTCCTTGTGCTCATGGTCACCGTCAGCTGACCAGCCTCACTATCTGAGCCCATGAACAAGCGCCTTACCTCACCGGCGACACTTAGCATCCTGCTGACTATCTCCTCGGGCAGGCTCGGCACGGCAGCTTGGAGCATCTCCTTCTCAGCTTCCGGATCCGGGTACTCAACGTGGATAACCCGAAAGCGGTCCATGAATGCCACATTTTGGCGTAGAACGCCCTGATACAGCCCGGTGGTATCGCCAGAGCCGAGCGAGTTGCCGGTGGCAATGAGGCGAAACTGCGGATGGGGATGGATCACCTCACCACCATTCTCAGCAATCACAAGCGGTTGCCCCTCTATGATGTCATTAAGCCCGGCCAATTCCGCAGGATCCATCATGTCGAGCTCGTTTAGTATGAGGATGTGACCGTCACGGGCTGCGACTGCCAGCGGCCCGTGAACAAAGTCGGTTGAGCCCTTAACCAGCACGAACTGCCCTATCAGTGACTGAAACTCCAAGCGGCCATGGCATGTTACCGATTGGACGGGCCAGTTAAGTCGGGCAGCTATTTGGCTGATCAAGGTGCTCTTACCTGAGCCGGTCGGACCGCAGAGGAAAAGCCCATCACCGGCGCTGTGATGCAGAAAGGCGAGCACATCGCCTAATAGCCGGTGGCGAAATACATAGCTAGATTTCAGTGGGGGGATGTTCGGATGTGTCGGATCCGAATAGCCTGGCACTTCAAGTCCTGGATGTGCGGGGACATTGAAGGTTTGGGACACGTTAAATTGCGTTAAAGCAGACATGGAATTCTCCTTTGTCATCAGTAAAGTAGACAAGAGCACCACCACTACCCCAATGGCTGGGGCAGTGGCGGCAGGGTTGATTCAGGGTTAAGAAGACGGTGGCTGTGAGCCGTTACCACCTTGCTTAGCGGATGCTGGGCGGTTTTGCTCATTAAGGCTTGCAGGCTGTGGCTGTGGGATTTTTGAGAGCTGCTCTGAGAGTGAATTACTCTCTTGTGGCTGAGTAGCTGAGGATTGAGCTGAGTAGAACTCCTCGGCTATAGCACTACCCTCCTCTGGGGACTCCTCGAAAGCGCCATTGGCCAGGCACTGCTTCGCAGCCCGATCCAATGCTTTTTGATCAAAGCCCATGGCCTCACGCTCTTGTTGAAGGCGATCGGCTTCAGCTAGGGTCTCTTCCAGTGTCCACCCTTCTCGCAGGGTGAGGTCGACGTAGAAAACCGGTCTGCCAAAGGACTGCCTAGTCGATTTGCCCCTTAGGCGCAGCTCAAGGGGAAGGTAGGCCAATCGATTGCAGGATATCGCCTGGAAATAGCTCAATCGGGCCTCTAATGTCCTGATGGAGTTAAATCCGGTTGTCCTAAAGATAAAGCTACCGAGTGGGTCTTCCTGCTCGCCGATGAGCACACTAAGCCGGGCAAAAGGCTTACAACGCCCTCCTGCGGCCAACTCACAGCCATCCGGTGATGGGCAGGGCAACTCAACTAGCCCCTCTTTGGTCCGTCGACGACAGGTCTCCCCATCCCCCATGCAAAGGGGGCGACCCTGGTCTCGATCGAACAGGGCGTATGAGGCACGCAGGTTGAGATTCGGCTCACTGAAGAGCATGCGCACCGGCAAACGGCGCAGCTTCGCCTCGCCCTGCTGCTCGCGCAGCTGAGTATCGAGTGGGTGGATTATCCAGTCCCCGGCAGACTGGATCTGGGAGGTGATGGTGAACTGATCATCCATCTGCGGCAATCGCTTACCATCGCGCTCGACTACCCGACCGATGGTGATCCGCCCGAGTACCGGTGGAGTTATAGCTAAGCCTTTGATCATCTTGGATCTCCTCTAAACGAAAAAACCCGCCTTAGCACGTACCCCGTAATCAGGGGGTACGTGCTAGACGGGTCTGATGGTGGTAATTTAAGTTGAACTAATCGCGGATCGTGAACCGCCTCGAGCCCGGCACCTGCTGTTTGTAGCGTGCTACCAGCTCAGGATAGTCGGCCTCAAGAGCCTTGGTATCGACTCTATTGCTCGGGGCGGATTGCTTCCAGGTCACGGTACCGGTGGGAAAGATAGCCTTCTCCGCCTCGCCCATTGCCGCCTGGATGCTCTGCTTTAGCACATCCTCACGGCGCCGACGTTCAGCTATCTCTTCACGCACTCGCTGCAACTGAGCAAAGCTGCGGCCAAGCTCAGCATCCTCCTGAAAGTCAACAACTGTCCCTGTATGGCTCTCCTGGTAAAGCTCGCGTAGGGCTTGAGCCGAACTTTCGGAACCATCGGGCCGTGGCGGAGTATCCGCCTCGACACAGGACCAGAAGGCTTGCTCCAGGGCGATCAACCGCTCTATCACATCCTCATCGCGCTCGATGCGGTGTATTTCCAAGTTCTGACCACCGATAAGAACAGCAACGTCAGCCGCCCTTTGCCCAGTCACCGCTAACTGGTGCTGAACCTGCAGCTGCACGTGTTGCGGTACTCCATTGCGCCAACGCCTCGCCTCCATTACACCCGTGGTCTTGATCTCCAAGATTTGGATATCAGGATCACCGACCACCTCACGGTCTAAGTTACATAGCATCCAAGGATGCTCGGGGTGCTGGAGGATGGCGTTGACGCGCCGCACCTTACGCCCGGTGACCTGGGCGTAAGCATCGGCTACGGTCGGCTCAAGGAGCTGCCCCCACCAGGCAGGATTTTGCGGATCGACTGACTCTGCCGATTGCTCGACCTGCCGTCCGCTCTTATCAAGCCAAACCTCCAGTGGCGAGGAGTACGGATGCATGCCAATTACAGCCGCTGCCTCAGATGCCCCAATTCCACCACGACGGATCGACAACCACGTCGCACGATCTAGATCCCTTGTCTCGACCAGACGATGCGCCTGACCCCAGTGCTCCGGAGCACGGCTACGCTGGCCACGGCTTTCAAGACACACAACGCCTTCAGTGGCTACCGTCATGGCGCACCTCCTCCCCACCGGCGCTAGAGCGAATCGGCAGCGTGAATCGATATCCACAGGCCATGCAGATGTGTGTATCGAGGACCGGACCGTCGAGTTGAGCGCCGATTCTCGCCCCACTGAGCCCACCTGCTACACCACCTAACAGACCTCCAATCAAAGCTCCCGAAGCAGTGCCCGTCGGGCCACCTGCCGCAGCCGCTGTAGCTCCCATCCTGCTGCCATGTATAGCACCAGTTATGCCGGAAGCTAATCCGGCACTTGCACCTGTGACTGCTCCTACAGTCTGACCGAGACGACGTTCTTCGACGTCTCTCGACCAGCATTTTGGACACGAATACATACGTCACCTCCACTTTGCTTTAGAAACCGCAAACCCCTCTGGCTTGCACGGAGATGATGTATATTTTAAATTTTGATGAGTCCAAGGTTTGAGTTAGGTCGACGTGAGTTAGGTTTTTGTCTTGCTAGGGTTACCCGGATGATTTAATTAGGCTGCACTTGAGTTACGTTATTATATAGACTTTTCTTATAATTTCATCAACCTAAACATAACCTTAGGCTATTCATGGCGAGTGTTTAAAAGAGATTCTAATTTAGGTAAATGTTAAGATCTATATATAGAACAGCTGATACGGCTGACACAGCTATTACAGCTGATACAGCGATCACAGAGGATACTACAGTTACAGCTGACACAGCTGATACAGCTAATGCAGCGGATACAGATACGCATCCATACACACAGTGCACGATATATCTAAAAAAACTAAGAACATAATCAGCATTTCTGCTATATAATTAAACTTCAGCCTAGTAGGCCAACTAAAACTCACTAAGATTCCCCTCCCCAAAGGGGAATCGATTCTTATCGATTACGACGAATGATTACACCCCACGAAGAGTAGATCCAGCAATAGCACAATCAACTCAGATACTTGCGCACTAAAAAAAGCAAGCATCTGGATCTGATACATGCTGTATCCAGATGCTTGCTTTCATGCGTCCTATCGTGCCTTACAACCCTCTTAGCAACCAAGCCTCAAAGATCGGCTCTTTTAAGATAGCCGATCTATTTGTTAACCACAAACCGCAAAAGGTGAAGACAATGAAAACCCAACCCAGATACCATCCACTAAACCGAAACTGGCGCCTTCACTACGAACCTACCTACCAAGGACTACCTGTGATGTGTGAGAGGCATGATCACAGCCCTCTAGTCGAGAATTTTTTAGAGAAAACGCTCACACTCTTTCAACGCACAAGAGAACAACACCCAAGAACTTTCGCACTAAGATTCGATCTCTACTTTCCAGCCGACTTCGACCTAGCAGCGATTAACCATGGCAGTGACATCATGAAAACCTTTTGGCGCTACCTAGACTCGCAATTTAATACCGCCTCTTTAGCGCATTCACCGAAAACCGAGTACATCTGGGTGCGTGAAATAGGCCCACAATCAGACAAACCCCACTTTCACGTACTTCTAATGCTCGATGCTAACGCCATCTTAAACCTTGGCAATCCAGCGCCTAGTCCTGATGGCACTTACTCCGACAACACACTCGCTCATCGGATTATCCGCGCCTGGCTGGGCGCTTTAGCTTATCCTGCTCTAGATCCTCTAGGTTCGTTGGTATATTTCCAGAAAGATTCTCACACTGGAGAGTTTATACGCTGGCTTCTTGATAGATCTGACGATTATAACTGGACAAGACTATTCTATTTAACCAGCTATTTATTCAAGACATACTCCAAGCCTGTCGGCCAAGGTATTCGCGCGTTCGGCTCATCTCGTCTCTACCGAAGGACTCCAGCTTAGACTCATGTACTTGTCGCGTTTGATATTTTTTTAGTACTTAGATATTTACCTTTTTGACTGAATTGATAAACGGGGATAGCTGGATTTGGCATAAGGAGCTTAAGGCTATCCCCGATTTGTCTTTTGTAAAGCCACACCAAGCCTCTCACAGATTTTAAGAGTAAAGCACTACGTATGATTCATTTAGCCTATAAATCGCTCATATCGGCTATGCGTAGCTCTCACAGGCATTTATGTTTGCTCTTCTCGATGGTTATTTAAAACATGGACATGTTTCGATTATAACTAGATTGCGGTTTTACCTAGCCCGGTAATTAACGATTTGTTTAAAAAACTACACACTGACTCTCTAAGCGATTTCTTACATTCATCAGTAGAGTAGCATAGATGCATCCCACTCAAATGCTCACAGCAGCTGTGCGTAGCTCACAGAGGCATTTCTAACTGCTAGGTTCGATTATTATTCACTTGAAACAAGATTTACTTGAAACAAGATTTAGTGGCTTTTTTGATTGAATTTCCACTTGACTTTTTCATCACACCGCACAATATTAGCGATGCTTTGCGGCACTTGGCATACCAACGCAATGCCTGCTCGAAAGGTCATTTTCGAAACGCACGGATTATTTAAAAGCACTCGATCACGCTATCCTCAGACAATTCCATCACGAGTTGATGTTACGCCCAACAAGCTGTGACCCTTTTGGACACTGGCGTCACGAGCTCAGACAACTTCAACGGTATCATTAAAAATTATAAATGAGCGCACTAACAGCGGTTACTTAACCTTTTTTATAGCTGTATGGTTTTTTTCAAAAATTTAAAAACTCTCGCATCAGCGATTGATTATCGCGCAACCTCCAAACACCATTTTCTATTCCACGCAAACCCTCAAATTCTGGTTGATCTATAACCTCCCAGAAAGCATCAACGAAGACTTTATCTTTTGCATCAAAAGGTTCTTGTGGAACTGTTAGCGATGCACACTGGATACACATCATCTCATACTCCTCATCGTCACAGGCGTCGCTGAACACTTTATGAACGTAAACCCAATCCGGCTTAGTTCGCCGATCAGCATATCCCCTGGATGTTTGTTTCCATGTTAGTATCCACTGATTTGTAGGGCGCATGTCTGTTGGAGCTAAATGCACCTCTTTTTTAATATCATCTTGAACAGGCTGCGGAATGGCATCAGTGTAGTCGAATTCACCATCTACGTACCAAGCAATATAAACCATCCCATCGTAGTCCGGATCATAATCGAAAATACTCTTTTTGGCTCTTATCAGAGGAACATGAATTAATTGATTAACTATGGCCTTTCTATGCTTTTCTCTCAGCTCCTCTAGACGGTCTCGCTTTGATCGCTCGTCCGGATATGATTTCTTCGCCTTTTGGAGGTACTCCCAAAAAAGACGGCTTGCTTCTTCAAGCAAAGACTCTTGGGAAAGGCGAACGCATAGCTCCTCATTCCCTGATAAGCTGAGCCCATTGTTAGATAAATTGGCACTCCCAACGATTACACTGCTGTCGCCAATGTAGAGTTTCGAGTGAAGACCATCCAAAAACCATAATTTTTCCCAGCCAATCTCTCTTGCAAGCTCCGCGATAGACTCTGGATTACTTCCGAGAGTTGGTGATACAATTACATACTCTAGCGCTCCTGGATCTACGAATTCTCTCCAGTTTGCACCTATATAAGCTACAGCGATTCGTTTAGGGGCAATCTTTTCCAGTTCTTTTTTATTTTAGGTGTACCAATCAGCATTCTTAATTCCAGTTGCAATTTGGTAGACGATGAATTTCAATAGACTGTATCATGTGAAAAGCTTTGGACATATTTGCAGCATTCCAAAAAACTTGCACCCTGGCTACGCATTGGACTTATGAGCGATGTTTACGATTTTGATACAGGTAAGCTTAATTAACCCTATACAGCCTCTCAAAATAGCTGTGCATGGCTTATAAGAGTATTTACGCTGCATGTTTTCAGTCTAAGCCTTGTTTGATTGATATATTGTGCCTTTAGTTATTCACCTGATTAACTAGACTGATTTTACCCCATGTTTAACGGTTATCTTGAACTTTGTCACCGATTAATCGATTTCTACTTCGCAGAGCGCCTGTGAGTGCTTTTGTTGTTTTTGATGCAGGTTGGCTTATTTTTGTTTGTCAAACCTCTCATAAAAGCTGTGCGTAGCTCACAGAGGTATTTTTAACTGATAAACTCGATACTTATCCACCCGAAACAAGACTTTAGCACAACCCCTTCACTAACCTACAGGATCGGACCTATCTGAAAACTAGAATCACTTCCCTCACTCTGTCACAGCACCACTCGATTGACTCTAACCACCTCTAGCCTCTCACCAGACAACCCATAGACACCCTTTAAGTCACCTAACCCTTCAGCACCCACACCTCTACTCATCACCCTTAAACGAACAGAGTAACAACCACTGCTTGGCCTACCCAGAGTCAGTCTAGCATCCCTCACCTGAGCGGATCAGCAGTGAGAACAAATTCGACTAAATCGCATGATCACACCCTCACCAAACCCATCAACTACGATCATCTACTACGGCACTGCTAATGGTATGAGCGAAACCCTTCTCCCTTGCCATTGCACTGACTCAAAACCCTAAAACGCTCAGGCAATGGCAGCACAGCCGGATCGCTCACTTTCACCTACATTTAACCATGAGGACAGAAACAATGTTAGCGTCTACCAAAGATCCTCTCCAGTATACTGCACTTCACTACGACAGTGAGTTCAACGGCTTGCCCGTAGACCAAGGTCAAGGGCCACTAATCGCCCGTCATCTCGGTCTACTCGAAGACTTACTCCAACGAGCTAGAAACGCGCACGATAACTCCCTAGCTGTGCGCTTCGACCTGCACTGCCCGAATACCATAGCCATCTCAGCGACTCTCAACCAAGGCAACGGTCTTGTCAGCCTTTTCTGGAGTAATCTCTACGAGCAACTCTTTAACGCTCAACCTGCAGCGCCCTTCGATCTGCACTTTGCCTGGACACGTGAGCACGATCCTCACACCGGACAACAGGCCACTTACAAATCCCTAATCTTACTAAACGCTCGCGCCTTTCATGGGCTCTACAGTAACGAGCAGGCGCAGGATATAGGCACAAGTGACAGTCTCGCCGGATGCATCCTGCGAGCCTGGGCTAAATCCTTACGTATCTCTGACCCACCACCGCCTGAGCTAGTCTCCTTTCCAACAGATCCGCTAAGCGGCAAGACCCAGGTCATGCTCTTGAACAGATACGATCATAACGCCTGGCGAGAGCTTTTCACTCAATCGAGCAGCCTATGCAAATACGAAGGTAAGCCGCTCGGTCGGGTCTTCTGCGCATTTAGAACTAGCAACCGCCAATAGCTCAAGCACAAAGATGCCTTGCGGCTCAACCAGCCTCTAATACAAGGGAGGAAAGTCGAGCGAGGTACTTTCCTCCCGCTCATGTTAAAGGCTTACCGGGTAAAGCGCTAAGCCATACATCATGACGATGAACTCGTTAAAAGCCACATCAACTCATCATCCTATGACAGCAGCGCCGTAGTCCTACCCACATCAAATCAAGGAGATCCCAACCATGACAGATAACGACCTGCTCGAGCGCATATACGCGCAGATGCACCAGCCTATCCCCACCACAGCAAAAGCACAGCCAAGCACCGGCTTCACAAACAAGCAGACCACAAACGATACCGCAGCACCAGGATCTACCCAAGATACGCCATACACACCGGCGCCGGTTCACACTGAACAACTACACAGCGCGAATCTCGCCAACTCACCCCACAACGACGATATCGATCCGACCACATTGATCCAGGCGGTTATAGAGCATCGCCAAATCCATCAACTCACCCAAGAACAACTCGCCACTGAGCTTGGGATCAATATAAGCACCCTCAGAGACTGGGAGCAGGGCCGACGCCAGCCCAAAGGACCATCCCAAGCCCTGTTACGTCTATTCATCACTTCATCTAGAAGCTAACCGAGCTAGAAATTTCTACTGGCTCAGCATCCAAACCAGATGCCCCTTCTCCCGTAACACCACACAACCGAGACCTCGATGATTGCGGCTGCCCCTGCATATGGGGCGCAGCCGCGACTGCCTGGAAACGCCTGTTCTTCCTAAAGAGCATCGCATAGGCCCGAAACCATGGCGACTCAACCGTTTTACGTTAACTAGAACAAAGGCACCAATATGCTATAGCCAAGAAGCGCTCACACTTTCGCATCAACTCCTCAAATTCCTTTCATCTTTGGAGAGCTCTCTAGCCAGCCCCCTGCTCGGTCCATTCCCAAGGACGGGGAATGGACCGGGGAACGGGGGGCTGGCTTTTTTACTTTCTGCTCAATCTCAAATACTCCACCTCGCGCCTGCCACTGCTAACACTCAGACGATCCCTTGAGCACTGCTTGAGGCTAGCACGTCGACTAAAGATCGTTCATCAATCCAAAACGCCAGACGACAGAGTACCGTGACTTAATTTCAGTTACCTAAGGAAGAGAGACAGGATCACTTCTAGTGCGAGCCGCCTGCACGCTACCTGATAGGATCACCCTGTGGGCATCTAGCCTGACGGTCCAGAGTATCGGCTGCGTGAACCGAGGAGGCGAAAATCACAATTCTCTGGTTCACTGATAATTACCACGCTCGTAGGCACTGTAGCACTCAACCCAAAACCGCACAGTGCCTTACCCCCCTAACATATTGGGGAACTTTTTTGGGAGAATTTGGGAACAATTTTGAGGTATTGGAGATCGATACACACATAGACTGAAAGGTTGAGCCTGATATGCTGCCACCGACATTGGCAAGCAACTAATCTCAGAGGAACTTTAGACAACCAAGGAGTATGTAGGAAAAGTACCAATGGCTCGCAAGCATCAACCGAGGTGGTATAAACGCGCCCGCAAGGTCATCCAGAACCAGCATTCCCGGTGGCAGCTGATCGATCCGCATCACCCCTCCGGACTAGAGTTTGACCCTTACAAAGTAGAGACCGCCCGCCTAATTGCCTTAGTTGACCTCGAGGAACTTGTACACAAGCTCCTCAAAATCCCCAATGATGGGAAATCAATCTGGGACGAATTCCAGCCAAGCAGCGCAGCCAGTTCCTCCAAAGGCTTACCATTCATGCCCCATGAGGATAGCCTAATTGCTCAGGCACAGCGCATCTTCAAAGCGCTATACCAGCCGACTATCTGGGGCATAGAACCTCTCCTGCAAGAGTGGAAACCCCTAAATATCAACCCTCTCATTATGAAAGCTGATGACGAGCTCAGACCCCATACACAGCACCACGCTACAATGCAGAACCAGCGTGGCCGTTGGTTCCCCTCATCCAAGGAGAGTGTATCGGAACTGCTAGAACGCCTCCAAAAGAGCAGCCATAGCCTACGCAAGTTTGCCGAGGACCCGTCAATACGCAACCGGTTACGCAGTGCGCGCAAGCGCAGTAAGAAAATCGACGAATACCTCCGAGATTGCTTTCGCTGCAACCCCAAGCCGCGGCTCATGATACGGGTCGACTTAGGAGCGCATGAACTCCGTCCTCACCATTTCACCCCTGAGGGCCGAATCTGGTTAAACCCAGACAATCTGCAGCTCCATGCCGAGACTCTTGAAGAGCTGATCATGAGAAGAATTCTCGCGCGCCACAAGCCGCAACGAGATTGGCTCAACCGGTGGACAGGCTACCTCGTAAAGCGTCAATACGATTTTGACAAGGGGTATTATTGGCACCTAATTATGTTCTTCGACCCCAATACCGTAGGTAGTCACCCTGCGACCATGGAAAACGATCTGAGCGAGGCGTGGAAAGAAGTTACCCGCGAGCAAGATCTCACAGGATTGTGCTGGGGAGCCAATCGCGATATTAAAGCGCGCACCCCGGAAAGAGGCAGCCTATTGCTGCCGAAGAGCAAAAACCTTCCTATAAGGGAACTCGCTTTCTATCTCGGTGATCTCGACTACATCGCCCACGTCCACCGATGGGACAATCGCCATAACCTGCGCCATCCTCAATCCCCATCCAAAAAGTCTGCCCCCAAGCGCAGCCGCTCAACTGTTAAGAACTCCGCGAAACAGCCGAAAAAAACTCTGATAGAAATGGGCTCCGAGCGCTTATCGCCGCTTGACACCGGTAACGAGCAAAGCCAAGGGCAGGGCTTCGAGACAGGCTCTGAAAAGGGTGACGATTCCCCGCAGGATCCACCCGGTAACGCTTAGCTCTCCCCTCGCGACCATGCCTCCCAACTACAGTATTCATTATGCATATCCCTCACCGCCTCTCGCCAACTCTCGAATACGCGCTCTCTCATACCTGGTGATTAGGAATATCCGCATCTCTAATATCTATATTCCGCCTTTCCTCAACACAGTCCAAAGCCTGATCGCTTCCCGTGTAGCTGCGAACTAGCGCATCGATTACCTCGCTTCGGCTGCCAAGACCGAGTTCTTCCTTGAAATTATCCAAACTATCCCGGATCGCGGCATCCACCGTCAGCTCAAGCCGCACCTGGCCCTGCTGTTGCCTCTTTTTACGCAGCCGCTGTTGACGTTCGCTCTGACTCATGCCCACCTTCTCAATTGCGCTGATCTTCTAGCCAGAACCTAAAGAAGAAGTCGCCTATTGTCGGCCGGTGACGCCTTAAACCTCCATCCCGCGTAACAGGAGACGCACCTGCAGCGCTATTCAAGTGACCGGTGACGAAACAAAACGCGCTTTGAGCGAGCAACAACGCCGAGCGCCCCTGTAAGCAGTGGGCAGCTGACAGCGCCGCTGGGCGACCCAGTCGAATATACATCGAAGGTTTGCAGCGGCTAGCCGAGATAGCTGCCGGGGGCTTCGTACCTCGTCATGAATTCAATATCGCAGCCAAAAGATACGCCATTAGCCCTAAGCCAATGTGGAACAGCGATGAGCGCAAGACCATGCGCGGTATCAAGGTGAGGTTTCAGAACGTAACTACACGGCGCGTCGAAAAAGGCGAAAACGCCCTGCCCAGTCATGAATTGGCGCAAGGTGGTCAAATCTTGCAGCGGTTGCGCCGTAGCACTATCTGGGCGGCGCAAGGCGCGCAGAAAAGTGGCATCAAACAGCCGTGAACTCAGCATTTGATCAAGCCGGCCTCCAGCGCGAACCAAAACCCGATTCCGCGAAAACTCCAACAACCCCAAGACTATCAGCAATCCGACCGCGACTATCAGCAGCATAACCAGGGTCTCGACGCTCCCGCTCGAGAGCACCCGATCAAACATCTGTAGCATGAACAGCGGCGGGACCAGCATCAGCAGGTTGATAAAGAAGCTGAATGCGGCCACCGCGATTATGGATTGTTTGAATTGGCCGAGGCACTCGCGCAACTCGCCCTTGGCCTTAGCGCCGGCTCCTTGAATACTCTGAGGATTGGCCATGAGCCTCTCTATTTGTAGGGATAATTGGAACTTGCTTGGTAACTTGGCTTAACCTGGACCTTAACGGTTCAGTTTTTAAAAAGGCGAGCAGCAGTTTAACTACAAGATTATCTTGTGGCTAGTGCTTAGGCTTATCAGTGCTTAATCTAGCGCCTGATAATCAGGCTTTTGACAGAAGACGGCGCATCCAGTTCAAGGAACGCGGCAATCCCCAACGCGCTGCTTTTGCCTGTTCTCCCAGCATCCGCAGAACTTCATGCGCCTGGGTGCGCGAAAGAGCTAAACCTGCAACAGGCTCAGAAGCTTCATAGGATCCTGGCTCTGCTGGCAACTGATACCCAAGCAGACCCAGTCGCAGCTGATCACCATCGACTTGAATACGTATATCACTAATCAAATACGAAGCTGCCCTGCTGCTCACACAATCATGCCTTTCTTCTTTTTTCTGCTCCTCAGACCCCTCACTGCCAGAAATCTGTTGCCGATGGCCAGCTATCTCAGAGCGTGCAGCTATGTGCTCAAGAGCCATTACCGTGTCATCAGCTGGCGCGCCTTCTCCGGCCGGATGGCTTGCCACGCTCGTTGGTCGTTGTGGCCTTAATCATTCGTTGGGCTACTTGGTCCGCCGCGAAGCCCAGGACAGCGGTCGCCGGCAGACACGGAGAGCTGGCTCTCCCTCAACCGAGGCGCATGGCCCCTGGTGACAAAGCTGTTGGATCTACCCCCAATCGGCGCTCAACGCGTATCGAATAGGGTACGGCCAACACAAATGCCAGCGGATGTGGCTATTTAACGTAAAGGCGGGATTATACGTACTACTGAGTGAGCATCAGCACCAGGAGGCGAGCCGCTGCAGTCAGGCTCACCAGTGGATTGACCCCGGAAGTGTGCATCTTGGGAAGGTCAGAGGTCATCGCGAGCCCCATCGGGATCAGGGCACAAAAGTCTCAATCGAGGCGAAAGAGTCTCGGCGGGAGGTGCCCCAATTAGGACCCGCGATGACCCGAGATCACACTAACACCATACAAAGGTGAGTGAAACCCTGGTAATAGCCATTTGAACACATCAGTAAGGCCGTCCAATCAAACGTTATTGCGACGCAATAGATTTGTGAAGCCGTTCAGTTAACGCTGAGAGGTGGTAATGTCCGGTAATTAGCATTACGGCAATTTATGCCTATGCGCCGTACCGCATATAGGTTCACGGTGGCTAATACCTTGCCTGCACTCCCGCACCGGGCCAGGGTGGGCGGATTTACCGCATTCCTTGATTCAGGTTGTGTTGTTCAAAATTAAATCAGTGAGACTGCTAGCCTAGATGTGTCTCCCTTCTAAAAGACTGGAGGTCTCTATGCCCCAGCTAAGACGCCTTTATCCAGCAGCCATAGCACTCTCGAGCGCCGTGTGCACGGGCTCCCTGTCCGCTGCGGAGCGGTTCACCGGAGGATATATCGGTGTAGAAGGGGACTTGTTATCTTCCGTTACCTTTGAGCGGGACTTCGATGATGGTTTCGCCGGGGAAGTAGTTGAAGATTTTGTAGAATATGAAGATCCAACAGATTTCTTAGAAAATGATGATCAGCAGGAAGGGTTCGCAAGTGCTCTTATGTACACATCGCATGAAGCTGGAATTGAGGAGGTAGTTCCCGGAGGAAGTGTAATCTTAGGCGGCGGAATGCAGCAGGATGGCTTTTATTCTGGCATCGAGGCCCGTTACCACTTCGGCGGCGTGGATGAAGAATTTGAAGATGATCTGGAAGAGAGCCTTGAGTTTGAGGATGGCTATTCGATCTCTGCTCGATTTGGGACTCTGCTACGCGAAGGTAATGTAATGCTCTATGGAAGCTTGGGGTACGCAACTAGGGAGGTGACTTACGAAGTTTTTGAGGACGATGAAAATAGCGATAGTAACGATCATTCTGGCTTTCGTGCAGGCATTGGTCTTGAATATCGGCCCGATTATCCGCCGATCTTTGTTCGCCTTGAGACTTCGCGAACTGATTTCGGGGATGAAACTTATGAAGACGAGGATGATGGTAACGAGTTTGATTTTGCCGATATCGATGACTTGGTCGAGCACGCCGCGCACCTCGGGGTAGGATATCGATTCTAGGATTTAATTTGAGTTCAACGTATAAACTTCTCGCCTCATCTGTAATGATCGCAAGCCTAGCTTGCTGATCTAAATCAGCGGCGGAGACTCGTCAAGGAATGCCTTATGAGACCCGCCAATCCCAGAACTCCTCCCGTTTGCAGTGGCAGTATAGGGCAGCCAGCTCGAGCACCTAGCCTATTTTAATGCGCCGGTAAGCGTGGGACGAAGTGTGTCAGGATGATGCCTGTGTTGTAAGTAAAGGCGGAAAAGAGACTAGCCATGTACTCTGCTCGCGAGGATATTCTCTCATGCATTGACCAAGCACTCTTCTTTCGCCAGCGCAAGGCGACGGAGTATCCCAACGACTGTCGGAACCAACTTTCAGTCGCTGAACTCCAGCGCCTTAGGGTCTACGTCTCAGAGCTACCCGAAGCGCACGACTTGTTCATCAAGTGGGATACCGCTTGGGCTAAGGTCGAGGATGAAGCGATCTACTTCTTCGAGGAAGTCCTTGAATCAACTGGTGAACGGACGGACATTTTCGCCCGCTACGGCTTCCATGGAAGGGAGGACCCGGAATCTTTTTGTGAGAGGCTTGCAGCACGCTTAGACGATTTTGCTCTACGCGGAGAGGAAGGCGGCACCGCATAAGCCCAAAAGGAGCAGCAGCAAGATCAGATCCGCAAGGAGCGTAAGCGTGCTAAAGAACTTGAGAGCGAGCTTCGACGCAAGGAACAGGCGCTTGCTGAGACAGCGGCTCTATTAGCGCTGCGAAAAAAAGCCAATGCGATCTGGGGAACGGAAGACGAGGACGCATGATCAGTGCCTCAGATCGCGAAACTGCTGTGAAGCTGATCGACGAAGCGCGTGCGAACGGGGCTCGGCTCGAGCCGGCCTGTCGGGAGCTAGGTATAACGCCGCGCACTTACCAGCGCTGGAAGCGTGTCGACGAAGGAAGCGGCGTCAAGGCCGACCAAAGGCCATTGACGCCGCGGCCGACGCCTCCCAATGCGCTGACCACAGAGGAGAAAGCGGCGATACTTGAAGCTTGTCATAGGCCGGAGCACGCCGACTTGCCGCCGGCACAGATCGTAGCCCGACTAGCCGATGAAGAGGGGATCTATATAGCCTCAGAATCAAGCTTCTATCGCGTACTACGCGCCAACGCGGAGCAGCGCCATCGCGGTCGCGCAAAAGCGCCGATACGGCGTAAGCGTCCGACCAGCTACCGAGCCGATCAACCCAACACGGTTTGGTCGTGGGATATCACATGGATACCCGGCCCAGCAAAAGGTATCTTCTTGTATTTGTTCATGATTATCGACATTTACTCGCGTAAGATAGTCGGTTGGGAGGTGCATGAAAACGAGACCGGAGCTGCGGCGGCAGCCCTGCTTGAGCAGACCGTGCTAGCCGAGGGGTGTTTAACGCGCCCCTTGGTATTGCACTCGGATAACGGTTCACCGCTTAAGGGAGCCACTATGCTAGAGACCATGCGGCGGCTGCAAATTGAGACCTCATTCAGTCGTCCGCGCGTATCCAACGACAACCCCTACTCAGAGGCACTGTTCAGGACATGCAAGTACGTCCCTAGCTTCCCTTCTCGTGGTTTCTCTGGGCTCAAGGATGCCCGCACGTGGGTAGCCAATTTCGTGCAATGGTACAACCATCACCACCGCCACAGCAGCATAAAGTACGTCACACCGCAGCAGCGCCACCTGGGACTTGATCAAGAGATCTTAGAAGAGCGCCAAAAGCTCTATGAGAAAGCCAAGGAACGCAACCCTCAGCGCTGGAGCGGAGAAACTCGGGATTGGAGTCCCGTCGGCCCCGCCTGGTTGAACCCTGAACTTGACAGCCAGGGGGTCCAGGATAAGGAGCTTTCCAAATAATGATTGAGGCGACAACTACCTTGACAAACGCCGCTCCTGATAGGCCAAGGTTTCTGCACAGTGCCCAAGTTCACCGAGGCCTCCCCTGCCCCAATGCAAGGGCGCATCAATGCTCTATGGCAATTATATCCCAGGTTTACGATCTCAGAAGACAGCGTTACCCTGTAACGATAATAGAGCTGCTTACCTAAACGGGGTTTAAATGTGATCGGCTATCACCACCGTAACCCTTACCAAGGCTACCCGCTTCTTGAGCAACAGGATGATGTAGAGCTAGTGCGTATTGACCCTGAGAAGAACGTGTGGCGATTTTACTATCTCTGGCTAGCACCAGATTTTTTCTGCTCAGTAAGACTGGTGCGCTTCTGGGGAAGAATCAGCACTAGTGGAGGACAGCACCGGGTAGAACCATTTGATGATATCGAGCAGGCGCGCGATGCCCTCGCAAAGATCGTTAATCAGAAGCTGCGCCGGGGATATAGATACAAAGGTGCGCTTTAGAGAGATCCTTGAAGCGCACGGGCGTAAGCCTGGCGTGCGGTGCGGGTAATCGCCAGCTGGCGGTTAAGCCGCTCGATCTCCTGATCTGCCACACGCAGGTTGATTAGCTGCTGACGGGCGTTTTCACTCATCTCATCAATCTTGTACTCGGTACCATCTATGGTGACAGTACGCTGCTGCGTCTCTTCTTCAGCCATCGTTTCCTCCTAGTTGAATGTCCGGTCTAACTCTGTGGCTCACGATTGTAACGGCTGTTGATTATAGTTCCTGATAAGGTGGCAGTACATCACTCATCAAAGGCTGCAAGATCAATTACCGCTGCCGCTTGAACTGCATACTCTACGAAGAGCTTCTCACAGGCGCGATGGTTCTGGCGCATAACCAATTCCGACTTGACTTGCAGGCCCGACGGCGCATTTACTCGCTCACGCTGGGACAACTGATCCCGCGTTATTCATGAGAAGCCGAGTTTAACCCTTAGAAGAGATGTTGCTTGAACAGGCTGAACAAGAGAGATTTGCCCCCAAAACCACTGACACAAAAGCGCCATTCCAGACTACCCTTAGGTACACCGGATTCACGATCGTTCAACTCTTCACTCCTCCGACTCCTTCGACCGGGAGCCTTTGCTCGAGTACTTCAACTTCCTAATTTCTTCCTGGGTCAGCGCGGGAGGGTCCCCGAGCCTGATATTTCTATTCAGTTCGACCATCAGAGCCACGATATCCTCTTTATCTGCCTGACTCCTACCTGACTTAATCGCCGTCATCAGGCTCTCCATTGGTAGCGAGAGAACCAACCCGCCCTCATCGAGCTGGACCTGGACACCAAGGCGGCGGAAAACCGCCAGCAATGCTTCCAGTGTGCTTGTCCTAACCTTGTCGACATCTCTAAGCGTCTCGACCCTATTGATGGTTGGTCGCGAAACCTGCGAAGCCTTGGCTAGGTCCGATTGGCTCATATTGAAAAGCGCGCGGAGAGTCCGAAGCACCACCGAAACCTGTTCAGCGTAACTTGGCAGATGCTCTTGACCTTTTCTGTCCTTGTGATTCATGATGAATATCATTACTTGCTTATGAATTTTGATGAACCATTTTAGCACCTCATGAAACACATTATGCCACCAATCGGGCTCGTTACAAACATGCTCACGGGGCTTACCGCGATTGAGGAAGCGTCGTCATCACCATGAGTTGACCCAACAGCAGACATGAAGAGCTTGCCACCACTCAAAATGATGCGTGAACGGAGCCGCAAAGATACGCCACACGCAAAGCGCGAGATGAAGTGCTCAAGGCTCTGTGCAAACAGAACGAGCATCGTATGGAGTTTGTGCCGGAGGATGAAATGCTATTAACTGACAGAGAAATCAGACAGCTCACCGGGCGGAAGCGGTTCGCCGCACAGTGCCGCGCCCTTGGCAGGATGGGACTGCCACATGATAGAGACCCGGATGGCCGTCCGCTAGTATTGCGGGAGGTAGTTATGGAAAGATTAGGTGGAGAGAAGCCATCGCAAGAGTGGGAGCCTGGAAGATGCTGAGCTTGAAGCCCTGCGCAGGGTGGCTAATGTAAGAATCAAGGCAATCATTGATATCGCTTACCTGACTGCTCTACGCAAAGGCGATCTGCTTAACCTGCGCTTATCGGATCTGTACGACGATTGGATTCAATGCCGAGTCGGCAAGACAGGTCAGGAGGTCAGGATTGGTTGGAGTGACGCGTTGCGAGAAGCTGTTAGCCGAGCTCGTAAATTGGCGCGCCGGGGAGGATTCGAACCCCCAACCTTCTGATCCGTAGTCAGAGCCTTAAACAGCCCTTAAACTTCTGATAAAACACAGGTTATGCCTTTCTGATATCTACAAAAGAAACACTCAAAACCCTACGTAAGCCTTTGATAAATAGGAGCCTGGTTGCATTTTTGTAGACAAGTTATCACGCTTCATGCTCGTATAGCTTATCCATGATCAAGTTTGCGGCCTGTGCTAAGGAGCCAAAGCAATGTCTTAAGCGCACAGCACGATCAGTAGTCTAGGAAGGTTTCACTTTCTTTGCGGTTTTGTTTATTAGGGCCACAGTAGCCGGAGCTATATATCTCCAATCCGGGCTCTGAGCGTCGTGTGCCACCTCTTCTGCGCCAGGCTCCCCTATTTGCCAGAGAGTTACCGCACCCCAGCACGTGTTGACCAGCGACCACCTTGGCCGGATAATTCAACTACTCAATCAATGAGAGTGTCCAGAATCTGCTCAGCAGGACGGGATGAGACAGGCTAATTCACCCCACCGCCACCTCTAATTGGGCTCTGAGGTGGTAACTTGCTTGAAGAGACGACGGAATGAAGGCTGTTCCCGGATACATTCTCGTCCCGCAAGTACGGTAGTATAGTAGGCATGACGATCCCGAACTGGAACCATCAGGGGGTCCTCCCCCCTTACGTAGGGAGTCAGACTGGTTCCGATGGCCGCTCTCCCTATCCGACGACACTTGTCGAAGTGTTAGAGCACTTTGGCACTTCCCCTGAGCGATGCAAAGTCCTTCGTGGGTTTCTCGACTATCGGCAGGAGCTGTACAGCATCGGGGTTAAGCAGGGATTCCAGTGGGTGAACGGCAGCTTCGCGGAGAATGTGGAGATTCTCGAGGAGCGCCCCCCAGAGGACGTGGACGTAGTGACTTTTTTTGCGGTGCCTTCGGGAGAGAGCCAGCAAACGCTCTTGGAAAAAACGCGGACGTATTCCATCCAGCCTCAGTAAAACGGCGGCATGGTGTCGATGGTTATCCAGTTCCTTGGGATGGCGGCGACTGGCAGAGGCTAGTGAAGCATGCGAGCTACTGGTACAGCCTTTGGTCTCATCGGCGTAACCACCTCTGGAAAGGATTCGTTCAGATCCCGTTGGAGCCTTCTGAAGAAGTCCCACGAGCGGTTTTGGATAACATCACGTCGGCTATCGGGGAGAATAAATAATGACGCGCCCTTCTGAACATGAGCAACTCTCTGCCGAGAAGGCGCAGCTCGAAAGTTTCCTCGCTGATGTTCCGCAGGAGAGTGCAATCGAGCGGATAGCACTAGAGGATCGCTTGGAAAGTGTTCGCGTTCGGCTCGAGGAGCTCGCCACCAATAGTAGCGAGGCAGCCCGGGTGAGTCTGACCTTCTCAGGCCAGCCCGTGCTGAACTCCGAGGGTATTTTGGCCGATTTCGGTGCAGCGGCTTTGCAGAAATTCGAACAGATGGTGGCAACGGTGGGTGCTTATCTACGCACAGGCGAACTTAGGTCCTCGGGGCCATTGCCAGCACGGCAAGAACACCGCCTTATGATCACAGGGACCACCGTGGGATCTTTTGGCTTTCATCTAGAAGAGAGGAGCGAACCTGAACAGCCGGAGATGGGGTTAGAAGATAGCTCGGTCCGAAAGGCTTTGGACCGAGTAACTCGTGTGATAGAAGGAGCTCTAGAGTCCGAAGAGGCACTAGCTGAGGAGGTCTCCGATCTTGATTCTCGCTCAGTGCGAAGTCTGCATGAATTTATGAACACAGTCGCAAGTCGTGATGCCTGGTTCCGGCTGGAGGCCGTCGGTCGTCGCGTGGAATTCCCCGACATGGAGCATCTAAGCCGTACTGTGGAGTGGCTTCAAGAGGACAATATCCATGAAGAAGAAGCGGAGCTGCGGGGCACTCTCGAAGGTTATTTGCCACACACGCGCACCTTCGAGCTTGCAGAAGACGAACTAGGCCTCGTGAAGGGTAAAATAGCTCCAGCGATTGACGATCCTCATGATTTACAGCCCTACCTGAAGAAGCGGGTTACTTTGAGCGTGCGGAAAACAGCTCTTGGAGGCGGTAAGCCCCGATTTGCTCTTGTATCCCCTCCCAGTTCGCCGGACGAGGATGCCGACGTGTAAACTAGCCGTCGTAGCCTTCTCTAGCTGAGCCAACGCTGCTCGGCATCTTCCACGCGGAGGCGGGTTTCCAGCGTGTATAGCTGGATGTCGAGCATGTCGAGCGGCTTGGCTTCCTCGCAGGGCGTACTTGGCGGCTTTGATCAGCGGATTTTTGGGTTCAAGTCCGAGTTGTTGCTATGGCTATTCATTGCGTTGGCTTCCTTCTCCCCACTTCATTGCACCCAGCCAGAGGAGCTTGCCTAATGCGGGTGCTGCTGTGGAACAAATGAAAACATACGCCGCAGGACTCCACCCAAACCAGTAGCAGATAAAGCCGGTCACGATGCTTACCACGGATAGCACGAACGCAGAGCTAAGAATAACTGGGACAAGAGAACTCCTGGGCGGACCTATTAAGAAGGCTGTCATAGCGACGACGAATAAAGAAAGGAGCAATGAGCGTACGAGAAGATCTATCAAGCTTTCGCCTTCAAACCACGGCAAGCCAATCGTATCAACTATAACTAGATTGCTAGACATAAGCGCAACGGCAAATACCCCGAGACTGCCCCCAAAACGCTCAGATAGATCACGAAGCGTATCTGGTGATATCAAACCCGACCTCCACTGTCATACCAGTTTATTCAAAAGACGCGATCTTTGCGCGTATTTTACTGATGCCGTATAACGTCATCCATAATCCCGTATTAGGACGCGATAGCATCATTTTACCTATGAAGCCCGTACTTCGATCCACGAACGTCAAACGCCGGCCAGGTACCTTCGAGGATATCAACAGATGAAGCCGCTATACTCACGCTCTCTATTGATCCCGACTTATAAACTACCGTGTTGTTTTGGTCTGTCACGATTTCTGCTACTATTATCTGATTCGCGTCACAAACAATCGCTAAATTCGGATTGTGGGTTACTATTATAACTTGCCGATGCTCTGCGGCAGATTTGATAGCACTCACAAGAGTGTTGTAAATTGTTTCGTTGTCTAGGTTGCCCTCGGGCTGGTCTATTAAAAGAGGCGTGCCCTCAGGGCTCAGAACGAGGTGAAACACTAAGAGCAAAACGCCTCTTTCACCTGGCGATAACCTTTCAATTCTCTTTTCTCCCCATCGAATCTCATATATTGCGGCGAACTTTTCGAGCCCGTACAAGAAATTATACAGCTCATGGCGGGATGCGCTTCCTTTGATTTGGTCGTCTGGGTTGCGCGTTGGCTGGTGACCTTCATCCTCTCGCAAATCAGCTCGAAGTGCTTGTTCTAGCGCCTCCGCTACTTTTGTTAACGTTCCGGCTGTATTATCAAAACACTCTTCAACTAGACCACTAGCCTTCTGTCGCCCTTCTTCGGATCCTGAAAATGTTCCGCGGCGCCCTTGATTGATATAGGAAAGGAACCCGTCAACTAATCCGACGTCTATGATTGATGCTGCAAAATTCAGTGGTACCTCGTCTACACTCAGCTCCTCTTGCCACAACCGCTGCGCAACAGGCTGATATAATTCCTCCAGCTCCTTTCGCTCATTCTCGATGGCCTCAGCAATGCGCTGGGTAATGCGGGCACGTTCCCGTCGTGCGTCTTCAAGGCGCTGGTCTAGAGTTCCATCCTGTATAACAGAGATTTCCCGTCTTAGCTTCCGCAAGCCAGACTCTTCGTCATTTAGCAGACCCCAAACTCGTTTCTTCCAGCGCGCTCTATTTGCAAGCTCGCTATCGAACTCACGCTCTATGGCGCTCTGATCCCTTTGTAGCGCTTGCAGCTCCTCGTTGCAACGTGCGCACCACGCTAACAGCCCCTCCCCTTTTTCGCTGCCCAAAATAACATGCACTCGTTCCAGAGCTCTGTTCGCTTGCTCCTGTTTATCCCAGGTTTTGTCCCATTGAATTGATAGGGAAATAAAACTTTCCGGGTCCAGATCGACAGAATAAAAATCCTCACAGTATTTCTGAATCGTGTTTTCAATATGAGCTTTTACGTTCTCACCTACGCGGCTAAGGCGAATCGCACTTTCGCTGTATTGTCTCCATGCTTTCTCAATTCGGCGCACTCTTTTTAGCCTTGCGTCGAGATCTCTAGCTCGTTTCCTTAGGCGTGCTGCGTGTTCCTCGACCTGCTTATCAGTCGCATTTATCGCTGCGCGTGACTTCACGGACAACCTAGGCTTTCCTTGCCAAGCCTCCCGTAATTCCTCTAGTCTCTGGTTCTCAAGATACTTGAGCCGCGCTATCCATTCGCTCGAAAGACGATCCTCGAGATCGACAATCTTTTCGTTTACTACTTTTAGCTCACTACGGGCCTGCTCTGCACGGTCGCGAAACGGCCGAACACTCTCCGAGACTAGCTCTCGAAAGTTATTAGCTTTCTGCCGCTTATCTTCTGGCAGGTGTCGAAAAACTACTTGCTCAAGTTCGCGCGAGAACCTCACCGGTCCTTCATCGCTGATCTCAGTGCAAATATTTTCTATATAACTTTGAGGGAGATACTCGACCTTCCTAGGTTTACTATGGTCAGTCTTTTCATCTAGGCCGATAGAAATAACCTCTCCAGAGGCCCAGGTTACTTCTGTTTCAAATTTCGAGGCAAGGTTTTGCGGTCGTTTGCAAAATCGATCCCGATTTAGAAAAGAAAACTCTTCCTCCGCGTGCGAGTTGCCAGCCAGAGCCGCACAGTCTAAAAGAGCGCTTTTGCCAGATCCCTTGTTACCAATGATAGCTACAAGGCCAGGGTTAACAGGGAGTTCGACATCAAACCATCCCGCATTTTCAGTGCTGGACTTCGGGCGAATAGCGATTTGATCAATGTAACGTGATGGGTTGCTTGCTAGGTCCTGAAGCTTGGGTGGCGTTGTACCGACGTACGAGCGAGCGGAAGGCTCATTGACACACTGACGAAGACCAGAAAATGTGAGCTCGCAGCACAGCCATGCGTCTCGTCCGAGAGGAAACTGGAAAAGATCCTCGGTCTTGTGAGAATCAGACGTAGGAATGCATAATTTGAGATCACGGAACTCTTCTCGGAATTGTTCTTTGCTTGGGGATTTTTCCCCAATTGCCCACCGGCGGGTGTTCTCATTGCTACTAAAAAGAAGGTGTGACGCTTGAAGAAGACGCTTCCGAGTAGAGTGCGCCTGCCCATGCCAGCTAAGTCTCGAAAGATCCTCGTCAGGTGGTACTGCGATTAGGTATCGTCCCGAAAAGATGCTGGGGCTTTGTTGGAGAGCGCTCTGGACGTCACTTAAAGCGACGCTTGCCTGTTGCATTCCAATTTTGAGGGCGTCTTCCTTCGAGAAGGTCTCATGTTGGCTTTGTAGTAATTCGCCGTACTGGGCCAACATTTTCTTTTTTAAGGGCCAGCGCTCGGTTGGTGACTCGGGACCTGTTTCTACGTCAAATTTTAGTTGGTTAAGGAAGCGGTCCTCGATGTCATCCTCACAAACTTCTTCAGAAAAAATGACATGCAGATTGATACGCCTGCCTTCAATTAACTGGTCTAGTCGAAGCTCAACGTTGGGCAGCAGCAGAAGCTCGCACCCGTAAACAATGTATTCTTCGGCGCAAATGTCTGCAAGCGGAGTAGGGTCGTTTTGAAGACGCTTCAGTTTCCTAAAGCCTTCAATCGTAAAGTAGTCGGTGACGCCAATTACAGAAACTTTATTGTGATATGCTTTCTCAAAGAGTGCTGCAGCGTAAGCAATAAAATCTGTCCCAAACTGGTTATTAAGGTAAGAAAATGGGGTGTGAACTTGGAGGTCCCATTTGTGCCATTGAGAGCCGCGTCCTTTGCTGCTAGCGCGAATATCTGACATCGTTTCTACCTGTATTTTTTCCTACGATCTTACTGGAACAGCCCACCTTATGATGTCCCCGACTGTCGAGTCGGCCCGAGGAACCTCCCCCCCGAGACGCTTCAAGAAACGTGCGTGACATTCTCGCATCAAACGGCTCAGATCATCGATCCACTCGGCCAGCATACCTGTAAAATGGCTTTGTCGTACTCACTGGATCCTCCTGATGTACGCTTGTCGCAGTGCCCACGACCCGATGACGTGACCCCTTCGCTCCACGGCCATTACAGCCGCCTCGTCACTACTACGGATCACTCCGCCCCTGCGCCTCGCATCGGTACTCGAGACCTCGGGATGATTGTCCCTCGGTCGGCTCCCTTGGCATCGAGAAAACAGGTTACCACGTTCTATAACCAAGCCTGGAAAACCGTCACGCCGCATATACACCGGACGCCGCGCCCAGCCAATCATCAGGCACTCGCTGGGCTCGTCCTGGGCCCTTGACACGGCCACGGTACTTATATCCTTTAGATGCCTCATCAGCGGTTCACTCTCGTTCGTCTCGGTGATCCACACCTGACAGGTCGAGCCTGCCTTTTCCCCAACGCTCACCACCACTACTCTTAACCGCAGCAGCTTGGGGTGGTTTGTGCATCCTCCTGAAAGGCGGCACCGAGGGGCCTACCCTCATCTTGGATACAGCATGCAGCCGTCACGTCCCTGCCCACGGCTGCTTCGTGGCACACGTCAAGCGGTGTTTATCAAAATGGTTATCGCCTCAAGCGATTTCAGCCGCCTTGAGATGGCACGCATCCCAGCGTACCATGCGGTCGAATGCCGGAACAGGGTGCGAGGTGCCGAAGGGGATGACTCAACAGGATACACTGGATGTCGCCGAGTCAATCGAGGCATTTCTTGCGCGCTGGCAGGGCGTGACCGGCACTGAGCAGGCCAACTCCCAACTTTTTCTCACCGAGCTCTGCCACGTCTTGGGCCTACCTACCCCGGAGCCTGCTGGGGATGATCCGCAGCTCAACGCCTATGTCTTCGAGCGCCGGCTCCATGAGCACGCCCCGGATGGTTCGACGCGTACTCGGCGGATTGATCTCTACCGACGCGGCTGTTTTGTTCTTGAGTCTAAAAAACTGCGCCAGGGTGAGCACACTGCCGGCTGGGACAAAGCACTTACTCGAGCCTTTCAGCAAGCCGAGGGCTATGCCCGAGCCTTACCAGCATGGGAGGGACGACCACCTTTCTTGTTAATAGTAGATGTGGGTCGGGTGCTCGAGTTTTATTCCGAGTTTTCCCGCAGTGGCGGTAACTATGTACCGTATCCGAATCCCAATCAGCACAGAATACGTCTAGAGGATCTTCGAGATCCGGAGATACGAGAGCGTCTTCGTCTCGTTTGGCTCGATCCAGACGCCTTAGACCTATCCAAACATGCGGCGCGTGTTACCCAGCGAATTAGTGGCGACTTGGCAAAGCTCGCCCGTTCTTTGGAGAAGGAAGGGTATGAGGTTGAAAGAGTCTCTAGGTTCCTGATGCGGGCGATGTTCACCATGTTCGCTGAGGACATTGGCCTCATACCTAATGGCCGCTATCAGGAAATGCTTAGGGAGGTGCGTCAAAGGCCTGAGGTGTTCCCGGATGCGATGCGCGCTTTGTGGGAACGTATGCGTGATGGTGGCTTTGAATCTCGCTGGTTGGTACGTATTCCGCAGTTCAACGGGAATCTTTTTGATGAGATCAATCCCATACCTCTAAATGAAGAGCAAATCGACTTACTAATCAAAGCGGCCGCCCACGATTGGACCGAGGTGGAACCGGCGATATTCGGCACATTGCTGGAGCGAGCTTTAGATCCAAAGGAGAGGCACAAGCTCGGGGCACATTACACACCTCGCTCTTATGTCGAACGCCTGGTAATGCCGACCGTCATAGAACCGTTGCGTGAAGAGTGGGCCGATGTGCAGGCGCAGGCTGCGATGCTCATCTCTCAGGGCAAAGAAAAGGACGCCCTTGCGGCTGTGGAACGCTTCTTCGGTAGGCTTCTAGAGGTCCGAGTTCTTGATCCGGCATGCGGCTCGGGCAACTTCCTTTATGTCAGCATGGAGGCCCTTAAGCGTCTTGAGGGTGAGGTCTTAGAGATGCTCTCCGAGCTTCGAGGTGGGCAAGCCTGGTTTGATGTCGAAGGTATGACAGTAGACCCTCACCAATTCCTCGGCCTCGAGATAAATCCATGGGCAGCACATATTACAGAAATGGTGCTCTGGATCGGCTATGTGCAGTGGCACTACCGCATACATGGCCGTGTGGACAACCTACCTGAACCCATTCTGCGCCGGTTTCACAACATCGAGCACCGAGATGCGCTGATCGAGTACGACGATGTGGAGCCCATGCTCGATGAACACGGTGAGCCGGTCACGATCTGGGACGGCAGCACCAGACCCTCGCCGGTTACGGGTGAGCCGATTCCCGACGAGACCTGCCGCAGAGCAGTCTACCGCTATATCAATCCCCGTCGCGCTGCATGGCCGGAGGCTGACTATATCGTCGGCAACCCGCCGTTCATTGGCGCTTCGACCATGCGCCGTGCCTTAGGAGATGGCTACGTGGATGCTGTACGTGTTGTCTACAAAGGTGAGGTGCCGGAGTCAGCGGATTTCGTCATGTACTGGTGGCACCTAGCAGCTGGAAAAGTGCGACAGGGGCAGGCTCAGAGATTCGGCTTTATCACCACTAACAGCTTGCGGCAGACTTTCAACCGGCGTGTGTTGGAACCGCACTTGAATGACGCCAAAAGGCCCCTGTCCCTGCTATTTGCCATTCCTGATCACCCTTGGGTCGATAGCAGTGATGGAGCGGCGGTAAGGATCGCCATGACCGTTGGTGCAGCTGGTGAGCAGGCCGGGGTGCTGCGGCAGGTGACGCAAGAGCAGGAAGCCGATGATGAGGCCCGGAACGTGAATCTGTCCCGCAGGGAAGGCCGCGTGTTTGCAGATCTAACCATTGGGGCGAATGTGGCGGGGGCAGTATTGTTGCACGCCAATCGCAACGTGAGCCAAAGGGGATTCGAACTGGGCAACTCAGGCTTTATTGTTACACCTGAAGAAGCCAAATCATTGCAATACGACGACCATTTACCTATCGCAGAAATCATCCGTGCATATCGAAATGGTCGAGATCTCACTCAAAGCCCACGCGGGGTCATGGTTATCGACCTGTTTGGGTGGTCTCTCGATGAAGTCCGTAAACGATACCCCGGTGTGTATCAATGGATATTGGAGCGGGTTAAACCCGAACGTGAGCACAATCGTGATCCGAGGCTCAGAGAGCGGTGGTGGTTGCATCGTCGTTTGCGTGAGGATTTGAGAATCATGCTGAGCGGCGTTAACCGCTACATCGCAACTGTCGAAACCACCAAGCACCGCGTTTTCACGTTCCTAGATGCCAACATCCTGCCGGATAACATGTTGATAAATATTGCCATTGATGACGCTTGGTGCCTTGGTATTCTTTCCTCTCGGCTGCATGTGACATGGACGCTTGCTGCAGGAGGAACCCTAGAAGATCGTCCCCGTTACAATAAAACCCGCTGCTTTGAAACCTTCCCATTCCCCGATGCGAAATCGGGGGGCAGCGCTAATATCCGCGACTTGGCAGAACGCCTAGATGGCCACCGCAAACGCCAACAGGCAGAGCATCCTGACATTACCCTGACCGGCATATACAACGTCATGGAGAAGGTTCGCGCCGGAGAGAAGCTGACCAAGAAGGAGCGCACGATACACGAGCAGGGTTTGGTCTCCGTGCTCGCAGAGCTTCACGAAGAGCTTGACCGAGCGGTATTTGCCGCCTATGGCTGGGATGATCTTGCGGACTCTCTCATTGGCCGCCCAGGTGCAACTACGCCGCGCTCGGACAAGCCCGAGGATCAGGAAAAGGCTGAGCAGGAGCTTTTGCAGCGCCTCGTGGATCTCAATGCGAAGCGGGCTGCTGAGGAGGCGCGCGGGCATGTGCGCTGGCTGCGCCCCGAGTTCCAAGCGCCGGACGAGGCGCATAAGGAGACGAGCGAGGTCGATCATAAGCCGGGTCAGGCTGAGGTAGTCGCTGAGCCTGTTGCGGAAACCAGCGGCAAAAAACATTCCTTCCCCAAAGACACCGGCAGGCGTATAAGAGCAGTTCGCAATGCCCTCGCCGAAGGACCTCAGACCACGGAGAGCATCGCTGCACGCTACAGCCACCGCCCGCGCAAGGCAGTCCGGGCGGCACTCGAAGCGCTCGAGGCCATTGGCATGGCAAGCCACGAGGGAGAGTTGTGGTGGGAAACATAGGCTTGACATTTGCTTGGAAGTAAAGAGGATTTATAAATCACCATGGCAGTACCCGACTACCAAAGCATTATGCTCCCATTGCTGCGCCTAGCAGAAGATGGACAAGAGCACTCCTTGCGGGCAGCTATTCAGTCCCTTGCCGATAACTTCGAGCTTCAGGATTCGGAACGCAAGGAACTTCTACCGAGCGGGCGGCAGTTCAAATTTGACAACCGTGTTGGCTGGGCACGCACGTATTTGAAAAAAGCAGGACTCTTGGATGTTGTTGGCTGCCAAAATCCGTACATATTTGGCGGAGGATTCAAGAACACGCATAACTACTGGGTAGCTAAAAGCTAAGACTAACTACCACGCTCTGTGTCGATTTCTGCTTTCGCTTGTCCCTTCTTTTCCTTATAAACCAGTAGCTGGCTCTGTGGGCCCTGTGGGCGAGAGCCCTGAGAGTGTGGGCAGGCGGTGGGCAACCCGCAGGGTTGTCCACGGGCTGTCCACACGGCCCGTAGGGCTCAGGGCGGCGCGAAGCGCTCGTCCACAAGTCCACAGAGCTATTGCTTAAGCAGCTAAAAGTACTGGAGATTTGCTTATAAGAGGGTGAGAATCGGTAGGCAAGTAGTAAGGGCGACGCGACATCAGGTAACCAGAGGCTGACAACCCTCCTGATGCCGCGCCCACCGCGTCCTGCGACACGGCACTTATCAGTATCCTCGGGTTTAGCGCCGTTGTCCAGTGTTTGCAATTTACCTGGATAAGCGCCCCGATGAGCCAGCCTAAAGCATTGACCCGCCGACTTTTCTTTTGCGATTGCTGCAATCGACAGGTCATGATCTGCAGTCAATGTGATCGCGGCAATCGCTACTGTAGCCAGCAATGCGCAACTGCTGCCCGCCGCCAGTCCCTGCGCGAGGCGGGTAGGCGTTATCAAGACTCTAGGCGAGGCAAAGCCAAGCATGCAGAGCGCCAGAGTCGTTATCGAGCCAGGGCCCGGACTCGACCCCTATCCCTATCCCTAAGCCGAGACAAAAAAGTGACGCATCAGGGTTCCACACCTGTTGATAGTGATGCTTCACTTAAGCTCTCGCAGCCAGATAACCAAGCTCGAGCACGGCCTGTTCAGCGGCCCGATTATCAGCAAGCGCGCCGACTCACAGAGAGCGCCCCGCGCTGTGACTTCTGCGCTCGCCCGTGCTCGGTATTTGTCCGCACAAATACCCTGCGCCGCTATTCTCGCCGCGAGCCGTTGCGCCCCTCGGGTCCCTGAGATCCGGCGAGAATTGCCTTGCGCTAAGCCCGAGGCGCATGTTTCTGCGCTCATCTCGCCGAGCATATGGAGTGTAAGGATATGGCTATATCCAAAGAGCTTGAGGCGCAGATCATGCGCTATCACTACGCCGAGCACTGGCGCGTCGGCACCATTTCCCGACAACTCAATGTCCACACCGATGTCGTCCATAGGGTGCTGGCAAAGGCCGGTATCCCTAGTGCACAACGCACTAGGCGAGAATCGATCATTGATCCCTATGTACCCTGGATCGAGCAGACCCTGGCAGATTACCCCAAAATACCAGCTAGCCGGCTCTATGACATGGCCCGCGAGCGTGGTTATTCGGGCGGCCCTGATCACTTTCGCCACCTGATAAGCCAGTACCGGCCGAAACCGGTCGCTGAGGCGTATATGCGCCTGCGTACCCTGCCCGGGGAGCAGGCACAGGTCGATTGGGGCCATTTCGGCAAACTGCCCATAGGTCGGGCTAAGCGCCCACTGATGGCATTTGTCATGGTCCTGAGCTATTCGCGCTGGATCTTTCTGCGCTTCTACCTCGGCTCATCGACGGCCAACTTTTTGCGTGGCCACGTCGCCGCCTTCGAGGCCTGGCAGGGGGTGTGCAGGTGCCTGCTCTACGATAACCTCAAAAGCGCGGTGCTGGAGCGCTACGCCGAACAGATCCGCTTTAATCCGCAGCTACTCGATCTTAGCGCCCACTACGGCTTTGAGCCACGGCCAGTAGCCGTAGCCCGAGGCAACGAAAAGGGCCGTGTCGAGCGGGCAATCCGCTATGTGCGTTCCAGCTTCTGGCCCGGTAGGCAGTTTAACTGCCTTGATGATCTGAACGAGCAGGCGCAGCACTGGTGCGTAAGCATTGCCGCCGAGCGGCGCTGCGATCAGGAGCAAAAGAGCAGTGTTCGCCAGGCATTTGCCGAGGAGCAGCCTTACCTACGTGCCTTACCCAGCGAGCCATTTCCTTGCTATGAGAACGTGCCTGTAAAGGTAGGCAAAACCCCTTATGTGCGCTTTGACCTTAACGACTACTCGGTGCCGCACGAATATGTCCGCAAAACGCTCAGCGTCAGCGCAACTCTGGAGAGGATCCAGGTGCTTGATGGCGAGAATGTTATAGCCTCGCATCCACGCAGCTATGATCGCCATGCTCAAATAGAGGACCCGCGCCATATCGATAAGCTAGCCCAAGAGAAGCAAGCTGCTCGACTACATCGTGGCACCGACCGGCTTAGCAGCTCTGTACCGCGCGCCAAGGAGTTTCTCTCGCAAGCGGCAACGCGCACCAATAGCCTCGGCAGCGTCACAGCTGCGCTACTTCGTCTGCTCGATCACTACGGCGCCAGCGAGCTCGATGCCGCCATTGAGCACGCTCTCGAGCGTGGTGTACCCCACCCACATGCCCTCAGCCAGATCCTCGAACAACGCCGCGATCAAACCCCTGGCCCGCCATCGCTGCCTCTGCGCTTACCCGAGCAGCTACGTCAACGTGAGCCAAGCATCCGGTTGCGCGGGCTCGATGGCTACGACGCCCTAACCCCTAACTATGAGAAAGACGACAATGACCCCGAAAACACCTGA
Protein sequences of DBSCAN-SWA_2 >NZ_AP017372|1511921:1555736|1513385_1514321_+|WP_096409495.1|DBSCAN-SWA MTRPSEHEQLSAEKAQLESFLADVPQESAIERIALEDRLESVRVRLEELATNSSEAARVSLTFSGKPVLNSEGILADFSAAALQKFEQMVATVGAYLRTGELRSSGPLPARQEHRLMITGTTVGSFGFHLEERSEPEQPEMGLEDSSVRKALDRVTRVIEGALESEEALAEEVSDLDSRSVRSLHEFMNTVASRDAWFRLEAVGRRVEFPDMEHLSRTVEWLQEDNIHEEEAELRGTLEGYLPHTRTFELAEDELGLVKGKIAPAIDDPHDLQPYLKKRVTLSVRKTALGGGKPRFALVSPPSSPDEDADV >NZ_AP017372|1511921:1555736|1523965_1524925_-|WP_096409509.1|DBSCAN-SWA MSALTQFNVSQTFNVPAHPGLEVPGYSDPTHPNIPPLKSSYVFRHRLLGDVLAFLHHSAGDGLFLCGPTGSGKSTLISQIAARLNWPVQSVTCHGRLEFQSLIGQFVLVKGSTDFVHGPLAVAARDGHILILNELDMMDPAELAGLNDIIEGQPLVIAENGGEVIHPHPQFRLIATGNSLGSGDTTGLYQGVLRQNVAFMDRFRVIHVEYPDPEAEKEMLQAAVPSLPEEIVSRMLSVAGEVRRLFMGSDSEAGQLTVTMSTRTLVRWATLAATFKGAPNVFEYSLNQALTARAEAEQREAIHRIAADVFGDYWQSSNC >NZ_AP017372|1511921:1555736|1532575_1532944_+|WP_096410365.1|DBSCAN-SWA MHQPIPTTAKAQPSTGFTNKQTTNDTAAPGSTQDTPYTPAPVHTEQLHSANLANSPHNDDIDPTTLIQAVIEHRQIHQLTQEQLATELGINISTLRDWEQGRRQPKGPSQALLRLFITSSRS >NZ_AP017372|1511921:1555736|1549405_1552879_+|WP_096409533.1|DBSCAN-SWA MTQQDTLDVAESIEAFLARWQGVTGTEQANSQLFLTELCHVLGLPTPEPAGDDPQLNAYVFERRLHEHAPDGSTRTRRIDLYRRGCFVLESKKLRQGEHTAGWDKALTRAFQQAEGYARALPAWEGRPPFLLIVDVGRVLEFYSEFSRSGGNYVPYPNPNQHRIRLEDLRDPEIRERLRLVWLDPDALDLSKHAARVTQRISGDLAKLARSLEKEGYEVERVSRFLMRAMFTMFAEDIGLIPNGRYQEMLREVRQRPEVFPDAMRALWERMRDGGFESRWLVRIPQFNGNLFDEINPIPLNEEQIDLLIKAAAHDWTEVEPAIFGTLLERALDPKERHKLGAHYTPRSYVERLVMPTVIEPLREEWADVQAQAAMLISQGKEKDALAAVERFFGRLLEVRVLDPACGSGNFLYVSMEALKRLEGEVLEMLSELRGGQAWFDVEGMTVDPHQFLGLEINPWAAHITEMVLWIGYVQWHYRIHGRVDNLPEPILRRFHNIEHRDALIEYDDVEPMLDEHGEPVTIWDGSTRPSPVTGEPIPDETCRRAVYRYINPRRAAWPEADYIVGNPPFIGASTMRRALGDGYVDAVRVVYKGEVPESADFVMYWWHLAAGKVRQGQAQRFGFITTNSLRQTFNRRVLEPHLNDAKRPLSLLFAIPDHPWVDSSDGAAVRIAMTVGAAGEQAGVLRQVTQEQEADDEARNVNLSRREGRVFADLTIGANVAGAVLLHANRNVSQRGFELGNSGFIVTPEEAKSLQYDDHLPIAEIIRAYRNGRDLTQSPRGVMVIDLFGWSLDEVRKRYPGVYQWILERVKPEREHNRDPRLRERWWLHRRLREDLRIMLSGVNRYIATVETTKHRVFTFLDANILPDNMLINIAIDDAWCLGILSSRLHVTWTLAAGGTLEDRPRYNKTRCFETFPFPDAKSGGSANIRDLAERLDGHRKRQQAEHPDITLTGIYNVMEKVRAGEKLTKKERTIHEQGLVSVLAELHEELDRAVFAAYGWDDLADSLIGRPGATTPRSDKPEDQEKAEQELLQRLVDLNAKRAAEEARGHVRWLRPEFQAPDEAHKETSEVDHKPGQAEVVAEPVAETSGKKHSFPKDTGRRIRAVRNALAEGPQTTESIAARYSHRPRKAVRAALEALEAIGMASHEGELWWET >NZ_AP017372|1511921:1555736|1518202_1518487_+|WP_096409500.1|DBSCAN-SWA MKLLPPIALAIIVFIQGCGGSNSEEYDSKLIYNFVASCSQQGQASVEQCGCMMDEIRKNVSQDKFMEYETNMLSGGKIPDEFNKIISSARSVCR >NZ_AP017372|1511921:1555736|1520958_1522731_-|WP_096409506.1|DBSCAN-SWA MRLSTLNDALPIVAAAYGRKFGVKVRVGGQNAFTDGQSINIPGLSEEAQSRTLAYGFLAHEAGHVRFTDFTQARHPQPLGKLLEGVIEDIRIEAAMITTYPGTRKTLDAVLEHLIEAGRMQPPSDQDPPGQVLGNAVLLIGRYHYRRQSALQMHSQQSEEVLSQVFGQQFVRQLHGLLAEIPGLTCTAETMALAQRIISLLESLSQGQSPEQANAADQPDNQPADDHAADDQASGKDDSQEGQLASGEDCAGASADEGADAEASAATDADSGDEGHHEQPADSTLQQGSTSQQGCSSEQDSSLAAQAAQAALQADKDALPEDLFEAVGNILQSHSSDDTQLLPTVQSYQGDAELGKEALERVKVHSAHLNAQLQGLVQSQRLTRSRTARSGRRLSAKHLHRAGVADSRIFRTSRAQPAPNTALHLLIDLSLSMQGGPDRLALDAAMSLALALESINGVSRAVSVFPGLRGQSSYVTQVLAHDDRVSTRTGAFVQKARGSTPMTGALWFAAADLLARSEPRKVVLVLTDGEPNDTDSTLSMISRAQSAQLEMIGIGIQHRVDHLFPQAIRIDQLSELKNSLFDVTGQLLSH >NZ_AP017372|1511921:1555736|1535782_1536376_-|WP_096409520.1|DBSCAN-SWA MANPQSIQGAGAKAKGELRECLGQFKQSIIAVAAFSFFINLLMLVPPLFMLQMFDRVLSSGSVETLVMLLIVAVGLLIVLGLLEFSRNRVLVRAGGRLDQMLSSRLFDATFLRALRRPDSATAQPLQDLTTLRQFMTGQGVFAFFDAPCSYVLKPHLDTAHGLALIAVPHWLRANGVSFGCDIEFMTRYEAPGSYLG >NZ_AP017372|1511921:1555736|1538749_1539121_+|WP_096409525.1|DBSCAN-SWA MYSAREDILSCIDQALFFRQRKATEYPNDCRNQLSVAELQRLRVYVSELPEAHDLFIKWDTAWAKVEDEAIYFFEEVLESTGERTDIFARYGFHGREDPESFCERLAARLDDFALRGEEGGTA >NZ_AP017372|1511921:1555736|1542361_1542655_+|WP_096409492.1|integrase|DBSCAN-SWA MERSHRKSGSLEDAELEALRRVANVRIKAIIDIAYLTALRKGDLLNLRLSDLYDDWIQCRVGKTGQEVRIGWSDALREAVSRARKLARRGGFEPPTF >NZ_AP017372|1511921:1555736|1511921_1512215_+|WP_096409492.1|integrase|DBSCAN-SWA MERSHRKSGSLEDAELEALRRVANVRIKAIIDIAYLTALRKGDLLNLRLSDLYDDWIQCRVGKTGQEVRIGWSDALREAVSRARKLARRGGFEPPTF >NZ_AP017372|1511921:1555736|1520338_1520836_-|WP_096409505.1|DBSCAN-SWA MIHAKLQTGEKSGTYVVSEPVTGDELLEIAQRIARRRLAKGRSLSCPSSAYQALQTQLLGKDHEVFGVVYLDTKHRILGVEELFRGTIDGATIHPREVLKEALRRQAAAVILVHNHPSGHLEPSVADERITERLRQALELIDVRLIDHIIVSAEGYTSLAERGGL >NZ_AP017372|1511921:1555736|1552921_1553218_+|WP_096409534.1|DBSCAN-SWA MAVPDYQSIMLPLLRLAEDGQEHSLRAAIQSLADNFELQDSERKELLPSGRQFKFDNRVGWARTYLKKAGLLDVVGCQNPYIFGGGFKNTHNYWVAKS >NZ_AP017372|1511921:1555736|1536536_1536953_-|WP_162549426.1|DBSCAN-SWA MASHPAGEGAPADDTVMALEHIAARSEIAGHRQQISGSEGSEEQKKEERHDCVSSRAASYLISDIRIQVDGDQLRLGLLGYQLPAEPGSYEASEPVAGLALSRTQAHEVLRMLGEQAKAARWGLPRSLNWMRRLLSKA >NZ_AP017372|1511921:1555736|1515383_1516754_+|WP_096409498.1|transposase|DBSCAN-SWA MGETIPLFEPSFNKSLRVETRPERLSSEAGVMLQREAFERTGIISWMSARLEDSRDQSRVKHSLSELLRDSLTMMGQGWDDQRDASTLGNDPMLAACGSDGRGESVIDRGLASQPTISRLLDMLAHEHNLDVLNKALLKLVEQRIRSRNKGRRARTMTIDVDGVPLQAHGQQPGSAFSGHTGQRQHYPLFALCAETGDMIGVLLRPGNAGPASEAAALVPRLVEYLRTHVAEQVRVRLDAGFADANTLAALDAHKIEFIARLPRNQALERHFEPHRYRCGRPSKREREWTREISYQAGTWDHPRRVVMVIRERPGELFRDAFYLVTNVCGSQRFVRSHYQRRGKAEAHFGEFKDVVGESLPCTCRGKATEQQVLARSQALLGLRALAYQLLHILRESMERATGDGWTLRRLREQVLKSAVRVLRHARKLQVILERRCAKHWRALLRRLPKEYPAPG >NZ_AP017372|1511921:1555736|1554218_1555736_+|WP_096409535.1|transposase|DBSCAN-SWA MAISKELEAQIMRYHYAEHWRVGTISRQLNVHTDVVHRVLAKAGIPSAQRTRRESIIDPYVPWIEQTLADYPKIPASRLYDMARERGYSGGPDHFRHLISQYRPKPVAEAYMRLRTLPGEQAQVDWGHFGKLPIGRAKRPLMAFVMVLSYSRWIFLRFYLGSSTANFLRGHVAAFEAWQGVCRCLLYDNLKSAVLERYAEQIRFNPQLLDLSAHYGFEPRPVAVARGNEKGRVERAIRYVRSSFWPGRQFNCLDDLNEQAQHWCVSIAAERRCDQEQKSSVRQAFAEEQPYLRALPSEPFPCYENVPVKVGKTPYVRFDLNDYSVPHEYVRKTLSVSATLERIQVLDGENVIASHPRSYDRHAQIEDPRHIDKLAQEKQAARLHRGTDRLSSSVPRAKEFLSQAATRTNSLGSVTAALLRLLDHYGASELDAAIEHALERGVPHPHALSQILEQRRDQTPGPPSLPLRLPEQLRQREPSIRLRGLDGYDALTPNYEKDDNDPENT >NZ_AP017372|1511921:1555736|1535276_1535534_-|WP_096409518.1|DBSCAN-SWA MSQSERQQRLRKKRQQQGQVRLELTVDAAIRDSLDNFKEELGLGSRSEVIDALVRSYTGSDQALDCVEERRNIDIRDADIPNHQV >NZ_AP017372|1511921:1555736|1541149_1541302_-|WP_162549248.1|DBSCAN-SWA MSQRERVNAPSGLQVKSELVMRQNHRACEKLFVEYAVQAAAVIDLAAFDE >NZ_AP017372|1511921:1555736|1519605_1519977_+|WP_096409503.1|DBSCAN-SWA MTANQLVVRTTVISCFLFIGGCGQQLPTCDSNDTIEIITDLISEDVASGRAGVGPTDVNVRIDDIELIDDGETRRRCEVTYLASNIPIEEVDESHSRIMTYDIYQEGDDNIWEVVDISYERFK >NZ_AP017372|1511921:1555736|1527081_1527441_-|WP_162549422.1|DBSCAN-SWA MYSCPKCWSRDVEERRLGQTVGAVTGASAGLASGITGAIHGSRMGATAAAAGGPTGTASGALIGGLLGGVAGGLSGARIGAQLDGPVLDTHICMACGYRFTLPIRSSAGGEEVRHDGSH >NZ_AP017372|1511921:1555736|1537726_1538488_+|WP_162549427.1|DBSCAN-SWA MPQLRRLYPAAIALSSAVCTGSLSAAERFTGGYIGVEGDLLSSVTFERDFDDGFAGEVVEDFVEYEDPTDFLENDDQQEGFASALMYTSHEAGIEEVVPGGSVILGGGMQQDGFYSGIEARYHFGGVDEEFEDDLEESLEFEDGYSISARFGTLLREGNVMLYGSLGYATREVTYEVFEDDENSDSNDHSGFRAGIGLEYRPDYPPIFVRLETSRTDFGDETYEDEDDGNEFDFADIDDLVEHAAHLGVGYRF >NZ_AP017372|1511921:1555736|1545467_1548560_-|WP_096409531.1|DBSCAN-SWA MSDIRASSKGRGSQWHKWDLQVHTPFSYLNNQFGTDFIAYAAALFEKAYHNKVSVIGVTDYFTIEGFRKLKRLQNDPTPLADICAEEYIVYGCELLLLPNVELRLDQLIEGRRINLHVIFSEEVCEDDIEDRFLNQLKFDVETGPESPTERWPLKKKMLAQYGELLQSQHETFSKEDALKIGMQQASVALSDVQSALQQSPSIFSGRYLIAVPPDEDLSRLSWHGQAHSTRKRLLQASHLLFSSNENTRRWAIGEKSPSKEQFREEFRDLKLCIPTSDSHKTEDLFQFPLGRDAWLCCELTFSGLRQCVNEPSARSYVGTTPPKLQDLASNPSRYIDQIAIRPKSSTENAGWFDVELPVNPGLVAIIGNKGSGKSALLDCAALAGNSHAEEEFSFLNRDRFCKRPQNLASKFETEVTWASGEVISIGLDEKTDHSKPRKVEYLPQSYIENICTEISDEGPVRFSRELEQVVFRHLPEDKRQKANNFRELVSESVRPFRDRAEQARSELKVVNEKIVDLEDRLSSEWIARLKYLENQRLEELREAWQGKPRLSVKSRAAINATDKQVEEHAARLRKRARDLDARLKRVRRIEKAWRQYSESAIRLSRVGENVKAHIENTIQKYCEDFYSVDLDPESFISLSIQWDKTWDKQEQANRALERVHVILGSEKGEGLLAWCARCNEELQALQRDQSAIEREFDSELANRARWKKRVWGLLNDEESGLRKLRREISVIQDGTLDQRLEDARRERARITQRIAEAIENERKELEELYQPVAQRLWQEELSVDEVPLNFAASIIDVGLVDGFLSYINQGRRGTFSGSEEGRQKASGLVEECFDNTAGTLTKVAEALEQALRADLREDEGHQPTRNPDDQIKGSASRHELYNFLYGLEKFAAIYEIRWGEKRIERLSPGERGVLLLVFHLVLSPEGTPLLIDQPEGNLDNETIYNTLVSAIKSAAEHRQVIIVTHNPNLAIVCDANQIIVAEIVTDQNNTVVYKSGSIESVSIAASSVDILEGTWPAFDVRGSKYGLHR >NZ_AP017372|1511921:1555736|1540859_1541060_-|WP_096409528.1|DBSCAN-SWA MAEEETQQRTVTIDGTEYKIDEMSENARQQLINLRVADQEIERLNRQLAITRTARQAYARALQGSL >NZ_AP017372|1511921:1555736|1518804_1519131_+|WP_162549421.1|DBSCAN-SWA MGAVATATVVVAALIIIAAWVWFNYHLQQAFEQISANSRHFSPGLVWLNLIPFFNIAWTAVLVVMLDRGYRKEYPSQAHEVLGHIICQDLTGHIPAYEITRGAPRTTA >NZ_AP017372|1511921:1555736|1531638_1532322_+|WP_162549425.1|DBSCAN-SWA MLASTKDPLQYTALHYDSEFNGLPVDQGQGPLIARHLGLLEDLLQRARNAHDNSLAVRFDLHCPNTIAISATLNQGNGLVSLFWSNLYEQLFNAQPAAPFDLHFAWTREHDPHTGQQATYKSLILLNARAFHGLYSNEQAQDIGTSDSLAGCILRAWAKSLRISDPPPPELVSFPTDPLSGKTQVMLLNRYDHNAWRELFTQSSSLCKYEGKPLGRVFCAFRTSNRQ >NZ_AP017372|1511921:1555736|1526063_1527098_-|WP_096409511.1|DBSCAN-SWA MTVATEGVVCLESRGQRSRAPEHWGQAHRLVETRDLDRATWLSIRRGGIGASEAAAVIGMHPYSSPLEVWLDKSGRQVEQSAESVDPQNPAWWGQLLEPTVADAYAQVTGRKVRRVNAILQHPEHPWMLCNLDREVVGDPDIQILEIKTTGVMEARRWRNGVPQHVQLQVQHQLAVTGQRAADVAVLIGGQNLEIHRIERDEDVIERLIALEQAFWSCVEADTPPRPDGSESSAQALRELYQESHTGTVVDFQEDAELGRSFAQLQRVREEIAERRRREDVLKQSIQAAMGEAEKAIFPTGTVTWKQSAPSNRVDTKALEADYPELVARYKQQVPGSRRFTIRD >NZ_AP017372|1511921:1555736|1542210_1542405_+|WP_162549420.1|DBSCAN-SWA MLLTDREIRQLTGRKRFAAQCRALGRMGLPHDRDPDGRPLVLREVVMERLGGEKPSQEWEPGRC >NZ_AP017372|1511921:1555736|1528206_1528920_+|WP_162549423.1|DBSCAN-SWA MKTQPRYHPLNRNWRLHYEPTYQGLPVMCERHDHSPLVENFLEKTLTLFQRTREQHPRTFALRFDLYFPADFDLAAINHGSDIMKTFWRYLDSQFNTASLAHSPKTEYIWVREIGPQSDKPHFHVLLMLDANAILNLGNPAPSPDGTYSDNTLAHRIIRAWLGALAYPALDPLGSLVYFQKDSHTGEFIRWLLDRSDDYNWTRLFYLTSYLFKTYSKPVGQGIRAFGSSRLYRRTPA >NZ_AP017372|1511921:1555736|1522820_1523828_-|WP_096409508.1|DBSCAN-SWA MINITEITDQLTLVVLDIRIWSGRKKLRAEDLHLDDGEIPPEDLVSLGSKRVCDPEQLKAFHRLKQSAERSCLRVGTRFLGGFAVAKEQTTALAEELDQIKAQFDAERDNFLSTYDRDLEDWIQTLPGFETAIRRAVEPASSVAARLRFGYQLVEIKPAIVPGTLDEEVAELGDGVFGEVEQMARDLTDSFEGKDKLNRRALGTFRKIRNKLACLSFIDRRIEPVLGSVDQWLHRLPATAPIQGSLFNEGMGLALLLSDAERMARHGAGQIDALSLPLPTLPTDKSAEQIDSSDAGLEDSGADTTPEQEGENSGDSTPSHRPVITDPQPVPSFFF >NZ_AP017372|1511921:1555736|1525012_1525966_-|WP_179948759.1|DBSCAN-SWA MIKGLAITPPVLGRITIGRVVERDGKRLPQMDDQFTITSQIQSAGDWIIHPLDTQLREQQGEAKLRRLPVRMLFSEPNLNLRASYALFDRDQGRPLCMGDGETCRRRTKEGLVELPCPSPDGCELAAGGRCKPFARLSVLIGEQEDPLGSFIFRTTGFNSIRTLEARLSYFQAISCNRLAYLPLELRLRGKSTRQSFGRPVFYVDLTLREGWTLEETLAEADRLQQEREAMGFDQKALDRAAKQCLANGAFEESPEEGSAIAEEFYSAQSSATQPQESNSLSEQLSKIPQPQPASLNEQNRPASAKQGGNGSQPPSS >NZ_AP017372|1511921:1555736|1543824_1544760_+|WP_096409530.1|DBSCAN-SWA MTRPSEHEQLSAEKAQLESFLADVPQESAIERIALEDRLESVRVRLEELATNSSEAARVSLTFSGQPVLNSEGILADFGAAALQKFEQMVATVGAYLRTGELRSSGPLPARQEHRLMITGTTVGSFGFHLEERSEPEQPEMGLEDSSVRKALDRVTRVIEGALESEEALAEEVSDLDSRSVRSLHEFMNTVASRDAWFRLEAVGRRVEFPDMEHLSRTVEWLQEDNIHEEEAELRGTLEGYLPHTRTFELAEDELGLVKGKIAPAIDDPHDLQPYLKKRVTLSVRKTALGGGKPRFALVSPPSSPDEDADV >NZ_AP017372|1511921:1555736|1541458_1541917_-|WP_162549419.1|DBSCAN-SWA MNHKDRKGQEHLPSYAEQVSVVLRTLRALFNMSQSDLAKASQVSRPTINRVETLRDVDKVRTSTLEALLAVFRRLGVQVQLDEGGLVLSLPMESLMTAIKSGRSQADKEDIVALMVELNRNIRLGDPPALTQEEIRKLKYSSKGSRSKESEE >NZ_AP017372|1511921:1555736|1529732_1530488_-|WP_162549424.1|DBSCAN-SWA MDGLHSKLYIGDSSVIVGSANLSNNGLSLSGNEELCVRLSQESLLEEASRLFWEYLQKAKKSYPDERSKRDRLEELREKHRKAIVNQLIHVPLIRAKKSIFDYDPDYDGMVYIAWYVDGEFDYTDAIPQPVQDDIKKEVHLAPTDMRPTNQWILTWKQTSRGYADRRTKPDWVYVHKVFSDACDDEEYEMMCIQCASLTVPQEPFDAKDKVFVDAFWEVIDQPEFEGLRGIENGVWRLRDNQSLMREFLNF >NZ_AP017372|1511921:1555736|1540566_1540863_+|WP_096409527.1|DBSCAN-SWA MIGYHHRNPYQGYPLLEQQDDVELVRIDPEKNVWRFYYLWLAPDFFCSVRLVRFWGRISTSGGQHRVEPFDDIEQARDALAKIVNQKLRRGYRYKGAL >NZ_AP017372|1511921:1555736|1519057_1519381_-|WP_096409502.1|transposase|DBSCAN-SWA MEDLGHVLDALCARYDIEHRLTPPRRPQTNGLVERFNGRVNEILRTTHFDSAADLDSMLWHDRRLYNHHIPQRALGHITPVQKLKRWYEEHPELFRKLVYDQSGLDR >NZ_AP017372|1511921:1555736|1533829_1535188_+|WP_096409517.1|DBSCAN-SWA MARKHQPRWYKRARKVIQNQHSRWQLIDPHHPSGLEFDPYKVETARLIALVDLEELVHKLLKIPNDGKSIWDEFQPSSAASSSKGLPFMPHEDSLIAQAQRIFKALYQPTIWGIEPLLQEWKPLNINPLIMKADDELRPHTQHHATMQNQRGRWFPSSKESVSELLERLQKSSHSLRKFAEDPSIRNRLRSARKRSKKIDEYLRDCFRCNPKPRLMIRVDLGAHELRPHHFTPEGRIWLNPDNLQLHAETLEELIMRRILARHKPQRDWLNRWTGYLVKRQYDFDKGYYWHLIMFFDPNTVGSHPATMENDLSEAWKEVTREQDLTGLCWGANRDIKARTPERGSLLLPKSKNLPIRELAFYLGDLDYIAHVHRWDNRHNLRHPQSPSKKSAPKRSRSTVKNSAKQPKKTLIEMGSERLSPLDTGNEQSQGQGFETGSEKGDDSPQDPPGNA >NZ_AP017372|1511921:1555736|1539285_1540362_+|WP_096409526.1|transposase|DBSCAN-SWA MISASDRETAVKLIDEARANGARLEPACRELGITPRTYQRWKRVDEGSGVKADQRPLTPRPTPPNALTTEEKAAILEACHRPEHADLPPAQIVARLADEEGIYIASESSFYRVLRANAEQRHRGRAKAPIRRKRPTSYRADQPNTVWSWDITWIPGPAKGIFLYLFMIIDIYSRKIVGWEVHENETGAAAAALLEQTVLAEGCLTRPLVLHSDNGSPLKGATMLETMRRLQIETSFSRPRVSNDNPYSEALFRTCKYVPSFPSRGFSGLKDARTWVANFVQWYNHHHRHSSIKYVTPQQRHLGLDQEILEERQKLYEKAKERNPQRWSGETRDWSPVGPAWLNPELDSQGVQDKELSK |
36 | Rhizobium_phage(16.67%) | transposase,integrase | attL 1526332:1526346|attR 1549191:1549205 |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_3 |
1627203 : 1637991
Sequences of DBSCAN-SWA_3
Nucleotide sequences of DBSCAN-SWA_3 >NZ_AP017372|1627203:1637991|DBSCAN-SWA GCTATCCAGCCGGGAGTAGCTGCTTCTCGGCGAGCCACTGCGAATTAAATAGCCGTGAAGAGTAGCGCGCCCCGCCATCACAGAGCACCGTCACAATTGTATGCCCTGGCCCAAGCTCTTTGGCTAATCGCAGCGCTGCACTAACGTTAACCCCGCTCGAACTGCCTACGAACAAACCCTCTTGGTACAGTAGCCGATAGACCATCGGCACAGACTCTGTATCCGGGATACTGTATGCAGTATCTATCGGGGTGTCCTGGAGATTGGCGGTTACCCTACTACTGCCTATCCCCTCGGTGATCGAACTGCCGGAGCTGGAACTCGGAGTGCCGTGATAGACGTAGTTATATAGGGAACTACCGGCCGGGTCAGCGAGTACAATGCGCACCTTGTCCGAGCGCTCTTTGAGGGCACGCGCAACCCCGCCCAGCGTTCCTCCGGTACCGGTAGCAGCGACAAAGGCATCTATCTGACCGGCGCATTGCTCCCATATCTCGGGCCCGGTTGTAGAGTAATGGGCATCACGGTTGGCGGTATTATCGAACTGATTTGCCCATATCGCCCCCTCCATCTCTGCAGCCATCCGCCCGGCTATCTTCTGATAGTTGTTGGGGTCACTATAAGGAGCGGGAGGCACCAGGTGGACCTCAGCGCCAAGGGTCTTCAGCAGGTCGATCTTCTCTTGGGTTTGAGTCTCGGGGATTACAATGATACAGCGGTAACCGCGGGCGTTGCAGATATGCGCCAAACCAATGCCGGTATTGCCGGCGGTGCCCTCCACTACTATCCCACCGGGGCGCAGCTGACCAGCCCTTTCGGCGTTATTTATTATCCCCCAAGCGGCCCTATCCTTAACCGAGCCGCCGGGGTTTAGCCACTCTGCCTTTGCGAGTATTTCGCACCCTGTCTCGGCACTTAATGCAGACAACCGAATCAGCGGGGTCCTACCTATCGCCCCAACAAAGCCATCGCAGATGTCCATACCAAACCCCTTATCTGATGCCAGTATTGATATATATCATACTGGCATCAGACTAGGTTACGCTACTAATAGATGGCAAAGCCGATATTAATTCACGGCCTCTCCGATGGGGGTACCTCAAGGTCGGGCTGATCGGACCCTTCTTTCTCTGGCGGGATCAGATCCTGCTTACTCACACCGAGAGCCAGGGAGGTAGCACTGGCAATGAAGATCGACGAGTAAGAACCGACTAGCACCCCTACCAGCAAGGCAACAGCGAACCCGCGAATGATCTCACCGCCTAAAAAGATGAGGGCACATACCACTAGCAAGGTGGTAAACGAGGTCACCAAGGTGCGCGAGAGGGTCTGGTTTATCGCCATATTCATGACGAACTCTGTCGTCCCCTTGCGAACCTTGAGGAAGTTCTCGCGGATACGATCAGAGACAACAATAGTGTCATTGAGCGAATAGCCGATAGTTGCCAGCAGTGCCGCTAGCACCGTCAAGTCGAAGGTCATCCGGGTCAGCGAGAAGAAGCCTATGATGATCACTATATCGTGCAGCAGGGCTGCTACGGCACCTACCGCAAAGCGGTACTCGAAGCGAAAGGCGACGTAGATCAAGACACCAAGCAGGGCCAAACCTAGGGCCATCAAACCCTGCTCAGTTAGCTCCTCGCCGATCTGCGGACCGACAAACTCCACTCGGCGCAAATCGACACCATCATGAGCAGCGCTGAGCTTATCGAAGATCTGCTCGGATATTTGCGCCTGCTCAACGCCATCCATAGGCGGAACACGGATCATGATCTCGCGCGGTGAGCCGAAGTGCATTACCTGAGCATCTTCGAACTCAGTCTCGGCGAGTATCGCCTCCACTTCGGATACCTCAATCGACTCCTCGTAACCTACTTCGACTAGAGTCCCGCCGGTAAAGTCTATTCCCAGGTTCAACCCCTGCCAAAGTAAGGAGCCGATAGAGATCACCAACAGCAGCGCCGAAAGGCCAAAGCCGATGCGCCGATGGCCGAGGAAATTGAAATTTGGATCCTGTTTAAAGATCTCCATAGTTAGATCTTAAGCTCCGTGACCTTGCGCCCACCATAGATCAGATTGATCACAGCGCGCGTGCCAACAATGGCTGTAAACATGGAGGTTGCCAAACCGATAGCGAGGGTGACCGCAAAGCCCTGCACTGGACCTGTTCCAAATATAAACAGCATTGCAGCGGCTATGAGCGTAGTCACGTTGGCATCGGCGATGGTTGAAAGTGCTTTTGAATAGCCACGGTCAATGGCTGTCTGAATTGGCGTTCCGGCCTTCAACTCTTCGCGGATGCGCTGGAATATGAGCACGTTAGCGTCTACAGCCATACCGACGGTCAGGACGATACCAGCAATGCCCGGCAATGTCAGTGTAGCCTGCAACATAGATAGTACGGCTACTATGATTATCAGGTTGGTGAACAGCGCAAGATTGGCAACTAATCCAAACACCCGGTAATAGACGGCCATGAAGGCGACGACCAGGAAGAACCCGATGACAACGGCCATAAAGCCGCGCTCGACATTCTCGGCACCCAAGCTCGGGCCGATAGTCCGCTCTTCAACAATATCTATAGGGGCAGCCAAGGCACCAGCCCGCAGCAGCAATGCCAGATTACTCGCCTCTTCGCTTGAGTCTATCCCGGTGATGCGGAATCGCCCACCCAAGGCTTCGCGGATCCGAGCAACGCTGATCACCTCTTCTATCTCGCGGCTAACCCGCTGCGGCTCACCATCGACCAGTTCCGTCTCGGTAACCCGCTCAATGAACAGAACCCCCATATGATCGCCGACACGCTCACGGGTAGCCCGGTTCATGATCCTGCCGCCAGTACCGGAGAGGCGGATGTTTACCTCGGGTGAACCGGTCTGCTGATCATAACCTGATGAGGCATCGATGATCCGCTCTCCGGTAACAATGTAATCACGCTCGAGCAGGACATAATCACCAGCATGTTCAACATTGCGCGTGGGGAAGAGTTCCGAGCCCGCTGGTACGGCATCAGGGTCATCCGGGTCGCCAGTAAACGGGAAGTCGCTGTGATCTACGAAGCGGAACTCAAGGGTGGCAGTAGCACCAAGTATCTCTTTCGCCTGCGCTGGGTCTTGCACCCCAGGCAACTGTACCACTATCCGATCAGATCCCTGGCGCTGTACAATCGGTTCAGCCACACCCAGCTCATCAACCCGGTTGCGCAGCGTAGTGACGTTCTGCTGCACAGCGAGGTTGATCATCTCCTGCTGATCATCTTCGTCAAGCTTGACGATGAGCAGATAATCGCCATTGCTCGGCTGCTTATCTATCTCTATTCCTTCAACCTGCTCGCTGATTAACCCTTCGGCGCGGGCGCGCAAATCCTCGCTGGTGAAGATCAGCCGCAGATCCTGCTCATCACTCACCTCAATGCTGGAATAGCGGATTCGCTCTTCGCGCATGAGGGTGCGGATATCCCTCTCGTAACCCTCAAGGGTGTCGTTGATCATCGCATCGAGATCGACCTGGAGCAGGAAATGGACCCCGCCACGCAGATCCAGCCCCAAGAACATCGGCTCAGCCCCTATGGCACGCAAGAACTGCGGCGTGGCCGGAACCAAGGTCATGGCTACCGAATAGTCGCGGCCGAGTTTGGCAGAGATCATGTCAGAGGCGCGCATCTGCCGATCCGTAGAGTCGTAGCGCAACACCCAGCGCTCTTCTAGGACCTCATGCACCGCCGGGCTCAAGCCTTCCTCTTCAAGGAAAGAGAGAACCTCGGACTGGGTCTCCTCGGCAGGGATCTCACCATCCTCGGTAGTTATTACCACCGAAGGGTCTTCACCAAAGAGGTTCGGCAGGGCGAATATTACCCCTACCGTCAGGACGGTGAGAACTAATATGTATTTCCAGAGTGGATAGCGGTTCAAGAGCTACTTGCCCTTCTTGTTTTCCAGTTCCTTGATCGTGCCCTTAGGCATCACGTTGGCCACCGCATGCTTCTGGATCTGCACCTGCAGACCGTTAGCTACCTCTACTGTGGCAAAGTTCTCGCCGACCTGAGTAATGCGCCCGAGCAAACCACCGTTGGTAACCACCTCATCACCCTTGGTCAAATCCTCGACCATTTGGCGGTGCTCTTTAGCCCGCTTACTCTGCGGACGTATCAGCAGGAAATAAAACAGCGCGATCATGCCCACCAGTAGCATGATGCTAAACCATATATCACCCTGCTGAGCCTCATTCTGCGCCCAGGCACTGGAAATCAAGAAATCCATATAGCTAGCTCCGTTATATTACAAAATTCTCTTATAGCTTACCTAAACAGCGCCCTATGATTGCTGTTCGGTCTCTGTCTCCCGCGAGTCATCGCCGCTGCGCGCCTTATCCAGCGTTCCTTGAGGTAACACCTCACCAACCATCTCGCGCTTAATGCTCCACTCTACCCCCTGCGCAACCTCCAAGCGAACGAAATCATCACTAACTCGGGTGATGCGACCAAGCTCGCCCCCCGCGGTGACAACCTCATCGCCCACTTGTAAGCCCTCAACCATTTCCTGATGTTCCTTCATCCTTTTGCGCTGCGGACGAATTAGCAGAAAATAGAAAATAGCAATCAGGAGCGCCGTGAAGACCAGGAAAAAGACCAGGTCACCCGTGCCCGGCTCCTGTTGCGCATATGCCGGTGCAATGAAAAAATCCAGCAACATCAACCCACTCCCTTATTCAATAGCCTCATGATCGAGTCACCTCTGGGCTCCCGCCGAACAGCTCAGAGGTGCATTATGTCATACGCGCGGGGGCTTTTGCTCGCGTGCAGTATAGAAATCATCAACGAAGGATGCCAGCCTACCCTCACTAATCGCTGCACGCAGCCCGGCCATTAGAGTCTGGTAATAGCGCAGATTATGCAATGTTGCCAACCTTGAACCGAGCATCTCGTTGCAGCGTTGCAGATGACTTAGATACGCCCGTGAAAAATGCCTGCAAGTATAACAATCACATGCCGGATCGACAGCCCGCGTATCCTGCCTATAGCGGCTGTTGCGCAAGCGCAATAGCCCTTGTGAGGTAAACAGAAAGCCGTTGCGCGCATTGCGCGTCGGCATCACACAGTCGAACATATCTATACCGCGGCGCACACACTCTACCAAATCTTCCGGCTTGCCAACCCCCATCAAGTAACGCGGCTGCTTAGCAGGTAACTGTGGTTCGAGGTGCTCGAGCACGGCATCGCGCTCAGCGCTTGGCTCACCGACTGAGAGGCCGCCGATGGCATAGCCATCAAAGCCTATATCCAGCAAACCTGCCAACGAAGCGCTACGCAACTGCGGATATACCCCACCCTGCACAATACCAAAAAGTGCTGCTGCGCTCTCACCATGGGCATCACGACTGCGCTGCGCCCAGCGCAAAGACAGCTCCATGGAACGCTTCACCTCATCCCACTGCGCCGGATAGGCCGGACATTCATCAAATATCATGACAATGTCTGAACCGAGCTCCCGCTGCACAGCCAGCGACTCTTCAGGGCCCATAAAGACCTTAGCCCCATCAACCGGCGAGCGAAACTCGACACCGGCCTCACTAACCCGGCGCAACTTGCCCAGACTGTAGACCTGAAAACCTCCCGAGTCGGTCAATATCGGCCCCGACCAGCCCATAAAGCTGTGCAAGCCGCCATGCTGGCGAATGATCTCGGTGCCTGGTCGCAGCCAAAGATGGAATGTATTGGCTAAAATGACCTCGGCACCCAGCCCGCTAACCTCCTCGGGCGTCAAGCCTTTGACCGTCCCGTAGGTGCCAACCGGCATGAACGCAGGTGTCTCGATCGTCCCGCTCGGGAGTCTCACCCTGCCCAGTCGTGCTGCGCCATCATTCGCGAGGAGTTCAAAAATGGCCCTGCGGCCAACGCCATTAACACTCAAACCGGTATATAGCTCCCTGCCTCGCTAAATCCACTCGACTACACCGGTAGCCAAATCGTACTTTGCCCCGATGACTTGCAACCGCCCTTGTTGAATAGCTCGCTCAAGTAGCTCAGACTTAGCACTCAACCGCTCGGCTGTGGAGCGTACATTGGCTATTGTCGCCTCCTCAATGATCTCGGCATCTGCCGTTGAGGTGTCGCCAAGCAGAGGCCGAACCGAGGGCTGCAACTCATCGATGATCTCCTGCAAGGCATCTGAAGGCGCCCTGCTTTGGCTACGCAAGGAATCAACCGTTGCCCCTATTGCGCCACAGCAAGAATGCCCCAAGACTACAATCAGCGGCACAGATAAGACCTCTGCAGCGTATTCAAGGCTGCCGATCTGGCCCCTGCCAGCGTAATTGCCAGCCACCCTGACTACGAACAGATCGCCTACTCCCTGGTCGAACACCTCTTCGGCCGGCACCCTTGAGTCTGCGCAGCCGAGGATTGCGGCAAAAGGCTGTTGCTTGGCAGCTACCTCCAAGCGCCGCTGATGGGTCATACGCTCATGCAAGCAAAGCTCGCCGTGCACAAACCGCTGATTGCCTTCTTTAAGTCTTGCCAAGGCAGCAGATGGAGTTAACATGTAGAATATTCCTCATGCATCAACGGGGGGAGGAGGACGATGTTACCACTTTTTGGCTTATCAGCATTACTAGCTGATGTGGGCTTGGTGAAATTAGCAAAGCCAATTTTTGATGCTCCCTCTGGCCGTTAAACCTAAAACCGTTCAGAATATGCGCTTATAAGTCAGCCGAGGAGTATCACGTTTGCCAAGCCGGGCACAGCTGAGCATAGTCGAGAGAGTTCTGGCTACTGACGTAACAGCGGTCATCGTCCTAGATGAAGATGGCGCGCTGGTATACGCCAACAATATGGCCTACCGGATGCTGGGCATCTATCTGCCCGATGGCTCCACTACTGAAGACCGTGCAGACACAGCTGACGATAATGACACCGCTGAACTCACGCAGCAGTTGCCCGACGAAATAGGCTCCTTTCGCTGGATTGCCAGCACCGGCACCCCGTTGCGTGATGTTAGGTTAACCCTCAAGCAATCCTCCGCGGGCCAGCGGGTATTCTCTATAAATGCGTCTCCCTTGCCAAGAAATGCTCATGAGACGCCGTGTGTCGCCTTATCTATACACGACTTTACCGAGCAGTATAAGTACTACGAAGAGGCCCTGCACAAGAGCCAGGAACGTCTGCAGCTGGCTACCGAAGCGGCCGGTATCGGGATCTGGGAGCGCGACCTTACCAACGATACGTTCCATTGGGATGAGCGCATACCCACCATCTACGGTATAAAATATGCCGATCTGCCAAGCAATTATGAGCAATGGCGCAAGATAGTGCTCGCAGAAGACTTGCCCGCACTAGAGCGCTCCTTCCGCCAAGCGAGGATCAACCACACGCGCTTCGAGACCGATTTCCGCATCCACGCCGGGAGCGGCGAAATACGCCACATCAGGGGCTTCGGCCAGTACATATATGACAACTCTGGAAACGCAATAAAACTTGTCGGAGTGAACGAAGACATTACCGAGCGCAAGCAAGTTGAGCATGAGCTTGCCCAGAGCAAGGAGAGATTGGAGGAGGCACAGCGAATAGCCCGCCTAGGCCACTGGATCGCCTCAACCGATAAAGGCCTGCTCGAGGTGGACCTGTGGTGGTCCCCGGTTATATACGAGATCTTTGGTCGTAACCCGGAAAGCTTTAAACCCGATTTTAAATCCTACCTCAGGGCCGTACATCCAGAAGACCGGGGAGAGGTAATAGAGCTACTGCGCCGCCTGCGCGCTGATCACGATCACAGGAGTGAGCATCGGATAATTCGCACCGACGGCGTCACCAAGTGGGTGCGTGCAATCGCTCGGACTGAAGATCAGTCTCCGGTTAACGCTCGCCGAATACTCGGCACAATCCAGGACATAACCGAGCAAAGAGAACTCCAACAAGAGTTGGAGTACCGCGCCTCCCATGATCCTCTTACAGACTTATTCAATCGTGCTAAACTCCAGCAGCAACTGCGGGCCGCGCAAGCAGCCTACGAACGGCATGGAACACCTTTCGCGCTAGTGATCGCCGACGTTGATCACTTTAAAGCAGTAAATGATCGTTGGGGACATTCGGTAGGAGATGCTGTGCTCTGGGAAATAGGCCGCCGCATGTCGAGCCAATTACGTGAAACCGATTTTCTCGGCCGCTGGGGAGGCGAAGAATTTCTTATTCTAGCCAACCACACCGATGAGTCTGGTGCAGCGCAGCTTGCCGATCGCATTCGTGAGTCTATATGCGGCTCTGAAATTCAGGGCATAGGCACCATAACCACCAGCTTTGGAGTGGCAGCTACAGAGCCAGGCACCTACCTAGAGGTGCTGGAAAACCGTGCCGATAAGGCTCTCTACGCCGCCAAGGAGCGAGGCCGCAACACCGTAATGAAGTATTCGCAGCTCTGACACGAAATTTCATCGCAACAAAAGGCGCACACCCGCCATGATAAACAGCCCTTCCCCCGAGGACTGTGAACGCATAGCCAGCCGCATGGATGGTTTTTTGTATCGCTGTAGATACGATGAAGAGCTAACCATGCTCTTTATCATCGGCGCAGTCCAGGCAACTACCGGCTATCCGCGCGAATCCTTACTAAACAATTGTGAACTGTCATACGCCCACTTAATCAACCCTGACGACCAGGCCATGTGTCGTGGCTCTCGCATCCGAGCAATAATGGACGATAGCGAATGGAGTATCGACTATCGCATTTGGCACCCGCAGGAGGGCTGGCGCTGGGTAAATGAGCGCGGCGCGGCGCTGCTCGATGACAACGGTGAAGTCGCCTATCTTGAGGGTGCTATAGTTGATATCTCCGCCCGCAAGGAGGCCGAACAAGCCCTGGCCAGGACAACTCACGCCCTGCGCGAACGCTTCAAGGAGGTGGCCTGCCTCGGTGAGGTGGCTGCACTCACCAATAAAGATATATCCATAGACGAAACGCTTACTGCGATAGTCGGCCTTCTGCCGAGCGGCTGGCAGTACCCAAGCCGCTGTGAGGCAATCATAAATTACCGCGGCAAGCGGTTTGCAACCAGCCGACGAGAACCTCACGAGACGCGCCTTGAGGCCCCGATTAGGGCAAGCGGTGAAAATGTCGGCCAGATCACCGTATGGTATGTCTTACCGCCCCCTGATGCCAATCAGCCATTTCTTAACGAAGAGAGCGTCCTCTTAGAGCGTACCGCGATCATTATTGGCCAGTACCTAACCCGCAAGCACGCCGAACGTGAGCGCGAGCAGCTGATGCGGCTAATCGAGGCAAGCCCTGACTACATCGGCATGAGCAATCCAAAGGGGATGATCCTCTATCAGAACCCTGCTTTATCAAAACTAACCGGCTTCGACACCGCTGCTGATGACCTATGGTTGAGAGATGCCCACCCGGACTGGGCCGCCTCCAAGCTCTACCAAGAGGCCCTGCCCGCTGCTGCTCAGCATGGCCAATGGCAGGGAGAGACAGAGATATTTGATGGCCAGGGAGGCACGATCCCAGTCTCCCAAACTATAGTAGCCCATACCGACTCTAGGGGTGATGTTGAAATCTTCTCGACCATAATGCAGAACCTATCAGCAAGCCGTGAAGCCAACCGGGTCAAGAGTAACTTCCTTGCTGCGGTAAGCCATGACCTGCGTACCCCTTTAAATGCCATCATAGGCTACTCAGACTTGCTCTGCCGTAGCGAGCTCGATAACAAACAGCGCGAGTTTGTCGCATTGACTCGGCGCGCGAGCGATAAACTGCTCCTGCTCATCGACATGCTGCTCGACCTATCGCGCTTGGAACACGGTAAGCTCAAGCTGCGTCGTGAGGACTTCGACATAGTCGATTTCGTCACCTCTCAGGTAGAGCTTGTTAAACCACGAGCTGAAAATAAGGGGGTAGCGGTCAAGCAGCATATAGCGCCGGAGATACCGCGTTACGTAGAAGGAGATGCCGCTCGAGTCGGCCAAATTATCAGTAACCTGCTTGGCAATGCGGTCAAATTTACCGATACCGGGGAAGTGACCATCAAACTCGCGCAGGCCTCCCCAGACTGGATCCACATTCAGGTCAGCGACAACGGGCCTGGCATCCCCGAGGAGGAGCGCCAGAGAATATTCGACTGGTTCTCACAGGGCCATATGGGCACCGAACAGCGCGAAGGCACTGGTTTGGGGCTCAAGATATGCCTGGAATTGGTCAGACTGATGGGAGGGTCAATTGATGTCGACGATAACCCGGCTGGCGGGGCTATATTTACAGTCATCATCCCTCTACCCAGGGCAGCCGGCGAAGATAATCGAAGATCATCGCAGGCCGACTCTGCTGATAACCCGGCCACTGCCCAATCTAGGCAAGTCAAGCCGCCAGAAAGCTCGACGGAAGAACTGCGCAGGACCGACGGCGGGAAACTAAATTTACTTGTTGCCGAAGATGAACCGACCAATGCCCTGCTCTTATTGCGCTTGCTCGAGCAGTATGGGGTGCAGGCTGACTTGGTAAATGATGGCCAACAAGCCTATGAGAAGGGCTCAAGTGGCGATTACGACGGCATATTGATGGATGCCCAAATGCCACTTATGAGTGGGGAAGAGGCGATAGCCGCTATAAGGTCGTACGAGTCTGAACACAGTAAAGAGCCTATCCCTATCATAGTCATCTCTGCGCACGCTATTGAAGAGGTCAAAAATTCCGCGCTTGATGCGGGGGCGGATCAATACATAACTAAACCGATCAGCTTCAATGCCCTGGGTAATGTCCTGAAGAGATTGGCTGGTGTCCAAGCAATCCAGTTATAA
Protein sequences of DBSCAN-SWA_3 >NZ_AP017372|1627203:1637991|1628276_1629233_-|WP_096409594.1|DBSCAN-SWA MEIFKQDPNFNFLGHRRIGFGLSALLLVISIGSLLWQGLNLGIDFTGGTLVEVGYEESIEVSEVEAILAETEFEDAQVMHFGSPREIMIRVPPMDGVEQAQISEQIFDKLSAAHDGVDLRRVEFVGPQIGEELTEQGLMALGLALLGVLIYVAFRFEYRFAVGAVAALLHDIVIIIGFFSLTRMTFDLTVLAALLATIGYSLNDTIVVSDRIRENFLKVRKGTTEFVMNMAINQTLSRTLVTSFTTLLVVCALIFLGGEIIRGFAVALLVGVLVGSYSSIFIASATSLALGVSKQDLIPPEKEGSDQPDLEVPPSERP >NZ_AP017372|1627203:1637991|1635651_1637991_+|WP_096409600.1|DBSCAN-SWA MINSPSPEDCERIASRMDGFLYRCRYDEELTMLFIIGAVQATTGYPRESLLNNCELSYAHLINPDDQAMCRGSRIRAIMDDSEWSIDYRIWHPQEGWRWVNERGAALLDDNGEVAYLEGAIVDISARKEAEQALARTTHALRERFKEVACLGEVAALTNKDISIDETLTAIVGLLPSGWQYPSRCEAIINYRGKRFATSRREPHETRLEAPIRASGENVGQITVWYVLPPPDANQPFLNEESVLLERTAIIIGQYLTRKHAEREREQLMRLIEASPDYIGMSNPKGMILYQNPALSKLTGFDTAADDLWLRDAHPDWAASKLYQEALPAAAQHGQWQGETEIFDGQGGTIPVSQTIVAHTDSRGDVEIFSTIMQNLSASREANRVKSNFLAAVSHDLRTPLNAIIGYSDLLCRSELDNKQREFVALTRRASDKLLLLIDMLLDLSRLEHGKLKLRREDFDIVDFVTSQVELVKPRAENKGVAVKQHIAPEIPRYVEGDAARVGQIISNLLGNAVKFTDTGEVTIKLAQASPDWIHIQVSDNGPGIPEEERQRIFDWFSQGHMGTEQREGTGLGLKICLELVRLMGGSIDVDDNPAGGAIFTVIIPLPRAAGEDNRRSSQADSADNPATAQSRQVKPPESSTEELRRTDGGKLNLLVAEDEPTNALLLLRLLEQYGVQADLVNDGQQAYEKGSSGDYDGILMDAQMPLMSGEEAIAAIRSYESEHSKEPIPIIVISAHAIEEVKNSALDAGADQYITKPISFNALGNVLKRLAGVQAIQL >NZ_AP017372|1627203:1637991|1631515_1631893_-|WP_096409597.1|DBSCAN-SWA MLLDFFIAPAYAQQEPGTGDLVFFLVFTALLIAIFYFLLIRPQRKRMKEHQEMVEGLQVGDEVVTAGGELGRITRVSDDFVRLEVAQGVEWSIKREMVGEVLPQGTLDKARSGDDSRETETEQQS >NZ_AP017372|1627203:1637991|1627203_1628184_-|WP_096409593.1|DBSCAN-SWA MDICDGFVGAIGRTPLIRLSALSAETGCEILAKAEWLNPGGSVKDRAAWGIINNAERAGQLRPGGIVVEGTAGNTGIGLAHICNARGYRCIIVIPETQTQEKIDLLKTLGAEVHLVPPAPYSDPNNYQKIAGRMAAEMEGAIWANQFDNTANRDAHYSTTGPEIWEQCAGQIDAFVAATGTGGTLGGVARALKERSDKVRIVLADPAGSSLYNYVYHGTPSSSSGSSITEGIGSSRVTANLQDTPIDTAYSIPDTESVPMVYRLLYQEGLFVGSSSGVNVSAALRLAKELGPGHTIVTVLCDGGARYSSRLFNSQWLAEKQLLPAG >NZ_AP017372|1627203:1637991|1631116_1631461_-|WP_096409596.1|DBSCAN-SWA MDFLISSAWAQNEAQQGDIWFSIMLLVGMIALFYFLLIRPQSKRAKEHRQMVEDLTKGDEVVTNGGLLGRITQVGENFATVEVANGLQVQIQKHAVANVMPKGTIKELENKKGK >NZ_AP017372|1627203:1637991|1633135_1633741_-|WP_096409598.1|DBSCAN-SWA MLTPSAALARLKEGNQRFVHGELCLHERMTHQRRLEVAAKQQPFAAILGCADSRVPAEEVFDQGVGDLFVVRVAGNYAGRGQIGSLEYAAEVLSVPLIVVLGHSCCGAIGATVDSLRSQSRAPSDALQEIIDELQPSVRPLLGDTSTADAEIIEEATIANVRSTAERLSAKSELLERAIQQGRLQVIGAKYDLATGVVEWI >NZ_AP017372|1627203:1637991|1631971_1633081_-|WP_096410370.1|tRNA|DBSCAN-SWA MFELLANDGAARLGRVRLPSGTIETPAFMPVGTYGTVKGLTPEEVSGLGAEVILANTFHLWLRPGTEIIRQHGGLHSFMGWSGPILTDSGGFQVYSLGKLRRVSEAGVEFRSPVDGAKVFMGPEESLAVQRELGSDIVMIFDECPAYPAQWDEVKRSMELSLRWAQRSRDAHGESAAALFGIVQGGVYPQLRSASLAGLLDIGFDGYAIGGLSVGEPSAERDAVLEHLEPQLPAKQPRYLMGVGKPEDLVECVRRGIDMFDCVMPTRNARNGFLFTSQGLLRLRNSRYRQDTRAVDPACDCYTCRHFSRAYLSHLQRCNEMLGSRLATLHNLRYYQTLMAGLRAAISEGRLASFVDDFYTAREQKPPRV >NZ_AP017372|1627203:1637991|1633925_1635614_+|WP_096409599.1|DBSCAN-SWA MPSRAQLSIVERVLATDVTAVIVLDEDGALVYANNMAYRMLGIYLPDGSTTEDRADTADDNDTAELTQQLPDEIGSFRWIASTGTPLRDVRLTLKQSSAGQRVFSINASPLPRNAHETPCVALSIHDFTEQYKYYEEALHKSQERLQLATEAAGIGIWERDLTNDTFHWDERIPTIYGIKYADLPSNYEQWRKIVLAEDLPALERSFRQARINHTRFETDFRIHAGSGEIRHIRGFGQYIYDNSGNAIKLVGVNEDITERKQVEHELAQSKERLEEAQRIARLGHWIASTDKGLLEVDLWWSPVIYEIFGRNPESFKPDFKSYLRAVHPEDRGEVIELLRRLRADHDHRSEHRIIRTDGVTKWVRAIARTEDQSPVNARRILGTIQDITEQRELQQELEYRASHDPLTDLFNRAKLQQQLRAAQAAYERHGTPFALVIADVDHFKAVNDRWGHSVGDAVLWEIGRRMSSQLRETDFLGRWGGEEFLILANHTDESGAAQLADRIRESICGSEIQGIGTITTSFGVAATEPGTYLEVLENRADKALYAAKERGRNTVMKYSQL >NZ_AP017372|1627203:1637991|1629235_1631113_-|WP_096409595.1|DBSCAN-SWA MNRYPLWKYILVLTVLTVGVIFALPNLFGEDPSVVITTEDGEIPAEETQSEVLSFLEEEGLSPAVHEVLEERWVLRYDSTDRQMRASDMISAKLGRDYSVAMTLVPATPQFLRAIGAEPMFLGLDLRGGVHFLLQVDLDAMINDTLEGYERDIRTLMREERIRYSSIEVSDEQDLRLIFTSEDLRARAEGLISEQVEGIEIDKQPSNGDYLLIVKLDEDDQQEMINLAVQQNVTTLRNRVDELGVAEPIVQRQGSDRIVVQLPGVQDPAQAKEILGATATLEFRFVDHSDFPFTGDPDDPDAVPAGSELFPTRNVEHAGDYVLLERDYIVTGERIIDASSGYDQQTGSPEVNIRLSGTGGRIMNRATRERVGDHMGVLFIERVTETELVDGEPQRVSREIEEVISVARIREALGGRFRITGIDSSEEASNLALLLRAGALAAPIDIVEERTIGPSLGAENVERGFMAVVIGFFLVVAFMAVYYRVFGLVANLALFTNLIIIVAVLSMLQATLTLPGIAGIVLTVGMAVDANVLIFQRIREELKAGTPIQTAIDRGYSKALSTIADANVTTLIAAAMLFIFGTGPVQGFAVTLAIGLATSMFTAIVGTRAVINLIYGGRKVTELKI |
9 | uncultured_Mediterranean_phage(57.14%) | tRNA | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_4 |
2011382 : 2058504
Sequences of DBSCAN-SWA_4
Nucleotide sequences of DBSCAN-SWA_4 >NZ_AP017372|2011382:2058504|DBSCAN-SWA GTTATTTAACCACGCGCAGGTTTGGGCGTTTGCCTCCACCAGAGTTGGAACTTGATGAGCCAGGGGCGTCGTCGCCACCATCCGGATCTTGTCCGGCATTGACCTCATGTGTAGCCTCCCCGCTCTGATTAGTCTCTTCCTCCATCTCATCCTCGGCCCCGAACATCATGCCTTGGCCGTTTTCGCGCGCGTAGACCGCCATAACAGCACCCACCGGAATGGTCACCCCACGAGCAACGCCACCAAACCTGGCACTAAAGCGTATAACATCATTACCCATATTGAGGCCTTGAACAGCTCGTGGGGAGACATTCAAAACCAACCTGCCATCTTCTGCATACTCGGTCGGCGCATCAATATCTGAGCGGCTAGCATCAACCAACAGGTAAGGGGTCTTATCGTTATCGGCAATCCACTCGTAGATTGCCCTTACCAAATAAGGACGACTTGAGTTCATAGTTCGCGCTCCTATCTCAAACTACCGGGCCATATTTAGCTCATCAGTAGTTAGGCTTGCCTTGAACCAATTTGTCAAAAACTGCTCCTCGGCATATTTTTTTACCGCGCCAGCTTCATCAGGTAACTCAACTCCCGCTATAGGCAGACGCCATAGCAGCGGCAGAATGGTGATATCCATCACAGTCAATTCATGGCTCAGGAAGTACTGCTGATCTTCAAAAAGCTCTGCTGCTGCAGTTAAGCTCTCGCGCAATGACCTGCGGGCACTCTGCGCTTTGGATTTTGACGATGTCAAGATCCGATCATAGTCCGCATACCAATCACGCTGCATACGGTATACAACCAATCGCGACTTTGCCCGAGAAATCGGATCAACCGGCATCAAGGGTGGATGTGGGTAGCGTTCGTCCAAATACTCGATGATTATGTCTGGCTCGTAAAGGGTAACATCCCTATCCACTAGCGTGGGCACATCTCCATAAGGATTTTGCTCCAACAACTCCACGGCAGGGCAGTCCGGATCAACCTTAATGCGCTCTGCATCCACGGCCTTAACTGCCAAAGCGAGTCGCACGCGATGACAATGAATACACAGCTCTCCAGTGTAGAGGCGGATCACTTCCCTTCTCCTAACGCAAAGGCCCGGGATGCTCTTATGCACCCCGGGCTACAAACAGCCGCCATTGGTTAATGGTCAAGGCGTTTACTGCTTTTTCAGTTCACGCCAATACTCTTTGTACAAGAGATAGAAGACGGTTGTCATAACCAGTAGGAAGAGAATAACCCAAATCCCCATCCTCTCCCTATCTGCCCTTATCGGTTCAGCAGCATAGGCAAGAAACGCTGTCAATTCGGCCGTCATCGCCTCATATTCGCGTCTGCTCATCTCGCCGTCGCCAGACACCTCAATGTCGACTAGATTACCATCATCGTCGTGGACAGGTTTGGGCGTCCCCTGAAGGTGAGCCAGCACGTGCGGCATACTGGTCCCCTCTTGGATCCAGTTATCAAAACCGACGGCAGCGTCTTCATCCTTGAAGTAGGCGTGGAGAAAAGTATATACCCAATCCTCGCCATGCACCCGGGTAGTCATGGACAGATCCGGAGGTTCGATGCCAAACCAGTTCTCCCCATCCTCCGGATCCATGGGCGAGATCATTTGCTCATGATGTTCGATATCGTCATCAAAAATGAGCTTATCCTCGATCCACTCCTCACCCATGCCGGTGTCTTCCGCCACCCGATCGTAGCGCAGATACTCCACCGAGTGGCAGCCCATGCAGTACTGGGCAAAGTGCTCAGCGCCCTTGCGGATTGACTCCCGGTCATGGTGACTGACGTTGGGATCCTTCATGTCGATCCCGCCCTGGGCAGCACCAGCGAACGAGACCGCCAAGAGCGCACCAAGAACGTACCCTAGTCTCTTTTTGCTCATTTGTCGGTCACCCTTTCCGGAACGGGTTTCGTTTGCTCAAAGCCGAAGTAGGTATAAACCCACAGGAATACGAAGAACCCGAAATAGACCACCGTAAAGATACGCGAAATAAGCATCGGCAACCCGCTTGGTTCCTGCATTCCGGCCCATGTGAGGATCAAGAAAGAGGCTACAAAGACAGCGAGTGCGATCTTGAAACCACGCCCCCGATAACGGATCGAACGCACTTCACTGCGATCAATCCAGGGCAGGAAGAGCAGTATCATCGAACCAGCGAACATCACTACGACGCCCAATGCCTCATTCGGGACGCACTGCAGCATCGCGTACCACGCAGTGAAGTACCAGATCGGATGGATCGTCTCCGGCGTCTGCAGCGGATTGGCCTCTTCCAGGTTCGGGGCCTCAATGAAATAGCCGAAGAACTCTGGAGCAAAGAAGACAATCGCACAGAATATGATCAGGAATATACCCATCCCGACCATGTCTTTAACCGTCACATAAGGATGGAATGCTATCCCATCGACCGGCTTGCCGTTTTTATCTTTGTTCTCTTTGATCTCGACGCCATCCGGGCTACTCGACCCAACATGGTGCAACGCCACTATATGGAGGAAGACCAGCAAGATCAGCACCAGAGGCAAGCCGATAACGTGCAGCGAGAATAGCCTACCGAGGGTTGCCTCGTTAACCAGGAAACCGCCTTGCAGCCAAACTACCAAGTCGTCACCGATAAACGGCACCACGCCGAACAGCGCCACAATGACCTGGGTTGCCCAGTACGATAACTGACCCCAAGGCAGTGCGTAGCCCATGAAGGCCTCGGTCATCATCACCAGATAGATCAGCACCCCGATAATCCAGAGCAGCTCCCGAGGCTTTTTATAGGAGCCGTACATCAGGCAGCGGAACATGTGGAGATAAACGACCACAAAGAACGCCGAGGCACCTACAGTGTGCAGGTAGCGCATCAGCCACCCCCACTCGACGTCACGCATAATGAATTCGACCGACGCAAAGGCGCGATCGACCGACGGCTGATAGTACATCGTCAGCCAGATGCCGGTGATGATCTGAATGACCAGCACCAGGATAGACAGCGATCCGAACAAGTACCAGAAGTTGATATTCTTCGGTACATAGTACTCGGTCATGTGGTAACGCCAAGTACTGGTAAGCGGAAAGCGCTTATCAACCCAGGCTTGTAGTTCGCTTAGTCGGCTAGCGCTTTGCTCACTCATGTCGCCTCGACCTCCTCTTCAGGATCCTCGGGATCCTCACCAATGACCAGCTTATCGTCATCAGTAAAGCGGTGGGGCGGGATCTCCAAGTTTTTCGGGGCCGGATTGCCTGAATAGACACGCCCGGACATGTCGTACATTGCTCCGTGACAGGCGCAGAAGAATCCACCTCGCCAATCGTCATCGAAAGGCTTAGCTTCAACCTCCGGGTGATAGACGACTATACAGCCGAGGTGCGTGCAAACCGGGCTGACAACCAGCACCTCTGGGGAGACGCTGCGGTATAGATTACGAGCGTATTCCGGCTGCTGCTCCGCTTCAGAGTCAGGATCGCGCAGCCTACCCCGCAGGATGTCTTCGTCCTCAAGGTTATCAAGATGTTCCTGAGTCCTGGAGATAACCCACACCGGGCTCCCCTTCCACTCGAACTCAAGACGTTGTCCGTCAGCAAGCTTGCTGACATCTACCTCTATAGGAGCACCTACTGCTTGTGCTTCTACGTTTGGTTTGAGGAAGCCGAGGAAGGGCACGGTTGCAAATACCGCTCCCACCCCGCCAACCACTGTTGCGGCACCGGTCAAAAAACGGCGCCGGCTAGGATCTGGGGCTTCGTTACTCATGACTGTTCCCTACGTGTCGTACCCAAGGGTGTACAAACAGCGCGTAAGCATCTTACATACTGGCTCGGGCGTCAAGACAGTAGCTGTCCATTCCGCCCTAACTCATCGCTGGTGCGTAGCATAAGATATTGACGAGATAACCGAAAGGGGGTTTTCACTACTAAGCAGGATTAGCTATCTCAATGTATTGATGATCAATATCAAATTTAGCCGCAATATGCTCACCTAAACGTTGAACTCCACCCCTTTCGGTAGCGTGGTGACCGGCCGCAAAGTAGTGAATACCGGCCTCTCGAGCGATATGGGGAATGGGCTCAGATATTTCACCACTAATGAATGCATCAGCACCATAGGCACAGGCCTGCTCAATTAGCCGTTCGGCACCACCACTGCACCATGCGATGCGGCGAATCAGCCCAGGTCCAGAACCGACGTGTAGTGGTGCCTGCCCAAGCACAGATTCAAGGCGCTGACTTAGTTGATTAGCACTAAGTTCTTGCCCCGGCACGCCAAACCACAGCAGGTCAGGGATGCCCTTTATCGAATATCTGGATTCGTTAGTTAGCTCCAGCATGTCCCCGAGGGCAGCATTGTTGCCAACCTCAAGATGAACATCCAGCGGCAGATGATAGGCAAACAGATTCATGCCTGAGCACAACAGCCGGCGAAGTCGTCTTGCCTTATAGCCAACTATTCTGTTTTCTTCGCCTTTCCAAAAGTAACCGTGATGAACCAGGATCGCATCGGCTTTCGCCGCCTCAGCCTTTTCGAGCAAAGAAGCACAAGCGCTTACGCCTGTGACCAGGCGCCCAATCTCCTCACGGCCTTCAACTTGCAGGCCATTTGGCGCATAGTCGGATATACTCTCAACCGATAAGTAGTCATTTAGGTATGCTTCGAGCTGATCTCTATGTAACATTGCTCTGTCCATCCATTATGAGGTATCTGCAAAAATTTTTGGCATCCCCTGTTGCCTTGTTCTTGTTATGGAGGAGACCATGACCCATCCCCTAAAGAAGGCTTGGCGATATTCCAGGCACACAGAACGCCATGGATTAGTTACACGAGGCGCCACAAGGGGGGGATTAGAAGACGCCTCCAGGGCGGAATGCCCGACTCGCACCGCTGCGGTTCTGCAGGGCGCCAAACAGATTTGTGGCGACTCCATAGTGCGGTTATTTGCATGACTCTTCCTGGATCCAAGCAGAAACCTATTCAACCATGGCCGTACACTCACTACGGCGAGTGGATAAGGTTTGTTTTAGGTTATGCGACCCTCGGCATAGCTATAGCCATTGCTCTGGTCTGGCTTAATCCTGGCTGGCTTTACTCAGTAATTCCGCAAGACGATTCCGCCAGGCACGAAGGCTTTAATCGAGACATGCCGGTCGAAACTGACCGAGATATTGCGGCGCGCCCTGCTCAGTTAGGCCAGCCCGTTTCCTATGCCGAATCAGTAGCTAGGGCCGCTCCCGCAGTGGTTAACATCTACTCTGCCCCGAGTGAAACAGAGCAGTTCACGCCACCTGGCTACCGCCACCCGCTGTTGGAGAGATTCTTCGAGCAACCCGGGCATCCCCCAAGGCTTCCTAGGCACCAGGCCAACCTTGGCTCCGGTATTATTATCAGCGAAAACGGCTATATAGTCACCAACCATCATGTGATTAAACAGGCAGAAAGCATCAAAGTTGTTCTTCCTGACAAACGCGAAGCCCGAGCAACAGTCATTGGCGAAGACCCGGAAACTGATCTAGCCCTGCTTAGCATAGACCTCCAAGAACTGCCGGTAATCTCCTTCGGCGATGAGAGTGATGTCCGCGTCGGGGATGTGGTTCTTGCTATCGGCAACCCATTTGGTGTTGGCCAAACAGTAACCAAGGGGATAATCAGCGCGACAGGCCGTGATCAGCTTGGTCTATCTACATTTGAGAGCTTCCTACAGACCGATGCGGCCATAAACGTAGGCAACTCCGGAGGCGCCCTGATAGATGCCCACGGACGCCTGATAGGCATAAATACCGCGTTGTTCGACCGAGGTGGCGGCGGCTCTCACGGCATAGGATTCGCTATCCCTGCAAGCATGGTTCAGTCGGTTATAAGCGACTTCTTCGAACATGGACAGGTCGTGCGCGGCTGGCTGGGAGTCAAGACACAACGCCTAACTCCCCCTCTAGCTCGTTCCTTCGATTTAGATGAGAGCAAAGGCGTCGTAGTAACCGAAATATCGCCTGGCGGACCGGTTGAAGGTGGCACACTCAAAACAGGCGATGTGCTGACAAAGATCGACGGCACCCAGATTGAGAGTGTGCAAGATTTTCTTAGGGCCACCGGGAGAAGCCCGCCAGGAACCAGAGTTGAAGTCAGTGGCTATCGTGACGGTGACCCATTTAATAAAAAAATCATTTTAGGCAAGAATCCTAAATCTGACGCACGATAATAGAACCCCTCTAGAAAAATCCCCTGATCCATTATCTGCCCGATGTGGAGGACGCCGTGAATCCATCTCTGGAGGCTTCATGGCGCCATCCCTGGCGCCATGAAACCTCCAAATCGGGTAGATAAGGCTCAGGGGGAATCTTTTAGAGGCGTAGAGGCGCTCAATAGGAGGATTTCGGCTCTTGCAGATCCCGTCAATAATCCCCTGCAGACAAACCTGAGAGAGCGCCCAAGAAGAGATCATTCTCTTGCGGTGTGCCGACGGTTACCCGGAGGCAATTACTAAGCCGCGGATGGCTTCCATCAAGTGATTTGACCAACACGCCCTCGGCCAACAAATGCTTATAGGCCCTTGACGCTGGAATCGCTTTAAGCCGAAAGGTTATAAAATTGGTCTCGCTCGGCAGCACCCGCTCTACCACGGGCATATGCCGCAGCTCTTCAGCCAACCTAGATCGCTCGCCAAGCACTCGTTTAATCGCCTCATCGAGCTGCTGACGATGCTCCAGGGCAAACTTTGCGCTTACCTGGGCTAGCACTCCAACATTGTAAGGAAGACGGCACTTTTCTAGCTGCTCTACCCACTCGGGGTGGGCGATCAGAACCCCTACCCGCAGCCCAGCCAAGCCGACCTTGGAGAGCGTACGCAGCAGCAGGACGTTGGGGTGGTCAAGCAAGCGCGGCAAGAAACTCGAATCAGCGTAGGCGTAGTAAGCCTCATCAACGACCACCACTCCCTCCGTTGATAAGGCCAAGGAGGATATCTCATCCAGATCCAAACCATTACCAGTAGGGTTATTAGGGTGGGCAACATAGGTAACAGCCGGTTGATGCTTGCCTACAGCGGCCTGCATAGCCGCTAAATCAAGGCCGTAATCATCTGCCAGATCAATCTCCACAAACTCAGATCCACTCAATGTGGCGATTACCCGATACATGGCAAAGCTAGGTGCTGGCGCCATAACTGTTCGGCCATGCCCGGCAATAGCTAGATTGATCAGCTGAATCAGCTCATCAGAACCATTGCCTAGCAGCAGCTCTGCACAGGCAGGAACTTCAAAGACCTCGCGGACTACTTTTTTAAGATCGCCGCAATTAGGGTCCGGATATCTGTTTAGCGCAACTTCGCTAAGTCGATTTAGCCACTCATCGCGCAGATCTCCGGGCCAAGGCCAGGGGCTCTCCATAGCGTCCAGCTTTATCGCATCGCCAGGATCGGCGACCTTATAGCCGACAAGATCCTGTATTTCGGGACGCACCCACCGCCTTACCTGCGCAGATAACTCACTCAATTGCCCTCTCCGTGACCGGCTTTTCTAAACTCTGCTGCCTTAGCATGTGCCCCTAACCCTTCACCATGGGCAATCACCGCGGCACTCTCGGCCAACTTAGCGGCCCCCTCAGGCGAACAAAAAAGTGTCGTTGACCGCTTCTGGAAATCATAGACGCCAAGAGGTGAAGCAAAGCGGGCAGTTCGTGAGGTCGGTAGAGTATGGTTAGGACCAGCGCAATAATCCCCTAGTGACTCTGCAGTGTGATGGCCGAGAAAGATTGCTCCAGCGTGGCGAATGCCACCAACTAGCCGCTGAGGTTCGGCAACCGAAAGTTCCAGATGTTCTGGCGCGACCCGGTTGGCTACATCTTGCGCCTCGGCCAAATCGCGGACACAGATCAGGGCACCGCGTTTAGCAAGTGACTCACGGACAATATCGGAACGTTCTAGCTCCGGCAGCATGCGCTCCATAGCCGCTTGGACCTGGTCCAGGTAGACAAAATCAGGACAGACCAGCAAGGCCTGAGCATGCTCATCATGTTCGGCCTGCGAAAACAGGTCCATAGCGATCCATTCGGGGTCAGCCTGTCCATCGCTAATAATCAGAACCTCGGATGGACCTGCAATCATATCGATCCCGACCACCCCGTAGACTCTCCGTTTTGCTTCTGCTACGTAGGCATTCCCCGGGCCCACGATCTTATCCACTGACGGGACCGTTTGGGTGCCATAAGCCAATGCCGCAACCGCCTGTGCCCCACCTAAGGTAAATATACGATCCACCCCGGCAATCTTAGCCGCTGCCAATACCAGGGATGACAACTCGCCATTCGGGGCCGGCACAGTCATAACGATTTCTTCGACCCCGGCGACACGGGCCGGTACAGCATTCATAAGCACTGAGGATGGGTATGCAGCTTTACCGCCCGGAACATATACTCCCACTCTGTCTATGGCAGTAACCTGCTGACCGAGCAGATTGCCGTCTTCATCCTCATACTCCCAGGAATCAATGCGCTGACGAGTAGCATAGGCGCGAATCCGCGTTGCGGCGTGTTCAAGGGCACTCCGCGACTCAGGGCAAATTTCTTGAGCAGCCGCCTCTATCCGCTCGCGGGAGACTTCCAGCTCAGCAACAGAGCCAGCATCAAGGCCATCAAAGGTACGTGTATACTCGAGTAGCGCCTTGTCGCCTCCTGTGCGCACCCCTTCAACCACCTCTGCCACCCGCTGCTCGAGCTCTGCCCCAGGGTGACTCTCCCAGCCGACCAGTTGCTCCAGGCTCTCCCAGAAATCCGGCTGGGTTGTACTAAGGCGTTTAATATCTACCATTGTTGCGTTGCCCCCTGCAGCCCATCATTTGTTATCAGCTGCTGCATCAGCAGCAGTGGCCAAATCTTCGATCAAGGCCCCGATTCGCTGATGCTCCATCTTCAATGAAGCCTTATTAACCACTAACCGTGAGCTTATCTCAGTGATAGTTTCCAAGGGTTCAAGACCGTTAGCGCGGAGAGTATTACCTGTATCGACCAGATCTACGATGAGATCTGCCATGCCCACTAGCGGCGCCAACTCCATAGAACCGTAAAGCTTAATGATCTCGGCCTGCCTGCCCTGAGCAGCAAAATGCTGCCGCGCCAATTCCACAAATTTTGTCGCAACTCGTGGTCGAGGTATATCCAAGCTTGACCCTGTTGGCCCGGCAACCATCAGACGACAACGTGCTATACCTAGGTCAACTGGGTCGTAAATGCTTTCGCCGCGGCTCTCAAGCAAAACATCTCGGCCAGCCACGCCTAAATCAGCCCCGCCATGCTCAACATAGGTTGGCACGTCAGAGGCGCGCACTATCAGCAGGCGCAGGTCATCTCGGTTGGTCTCGATAATTAGTTTTCGGCTCGAGTCCGGACTTTCCACCGGCTCTATCCCGCAGGTAGCGAGAAGCGGCAGGGTCTCATCCAGTATGCGTCCCTTGGACAGCGCCAAGGTCAGGGTTTGTCTCATGAACTCTCCGTTGTTCTAATCAGGCACTCGTTGAATATCTGCGCCCAGCTGAGCAAGTTTCTCTTCGATGCACTCATAACCACGATCAATATGATATATACGATCAACGGGCGTCTCACCCTCTGCCACCAACCCAGCCAGAACCAGGCTAGCCGATGCCCGCAGATCAGTGGCCATTACCGGGGCTGCCTTAAGCCTTTCAACCCCGGTGATTGTCGCACTAGAACCATCGATTCGTATATTGGCACCCATGCGCTGCATCTCCAGGCAGTGCATGAAGCGATTTTCAAACACTGTCTCTATTACTCGGCCTGAACCGGTGGAAATGGCATTAAGAGCGCACAGCTGCGCCTGCATATCGGTCGGAAAGCCTGGGAAAGGGGCTGTCTCAATGTCCACGGCCGAGGGTCGTTGACTCATAACTAGCTCTATCCAGTCTTCACCAACCTCAATTTCCGCCCCGCTTTGACGCAATTTTGCGATAACCGCTTCAAGAAGACTAGGATCGGTATCTTTGAGTTTAACCCGCCCTTGGGTCATCGCTACAGCGGTTAGGTATGTTCCAGTCTCTATTCGGTCAGGTAAAACCCTGTGTGATACTCCGGTTAGTGAGTCGACGCCCTCGATAACTAGCGTGGTGGTGCCCGCACCCCTGATACAGCCGCCCATGGCATTGATGCAATTGGCCAGATCTACCACCTCTGGCTCGCGAGCAGCATTCTCCAAGATAGTAGTACCCCGTGCCAGGGCTGCTGCCATCATCAGGTTCTCGGTACCAGTCACGGTACAGATATCCATATGGATGTGAGCACCACGCAGGCGTTGGGCCGTACCCTTGATGTAGCCTTCTTCAATCTCAAGGTCGGCACCCATCGCCCGTAGTCCGCTGATGTGGACATCCACGGGGCGGGAACCAATTGCACAGCCTCCCGGCAGGGCAACTTCCGCGTGACCGAATCTAGCCAATATCGGGCCTAGCACTAATATCGACGCCCGCATAGTCTTGACCAACTCATACGGGGCCTTGAATTCCTTAATAGAGGAGGTATCCACCTCTACTTCCATACCTTCATGCACGGTCAGCTCGACGCCCATCCTGCCGAGCAGCTCCATGGTCGTAGTGACATCATGAAGATGCGGGATATTGCCCACTTTCATGGGACCTTCAGCAAGCAGAGTAGCAGCCATCACCGGCAGGGCTGCGTTCTTAGCACCAGATATACGTATCTCTCCGTCCAGCTGATGCCCACCTCTTATCAGCAAACGCTCCATGAGCTAGTCCATTCTCTCCTCGAACCCTTCGCCCCCTCGTGCCACCATTCTCAGCACATCCTCTACCCCGCTTGCCCGTATGATGGCACGCATCTGTTGTGGTGCATCCCTAAACGCCAGGCTTACCCCAGCTTCGCGAGCCGCCCGTGACCACTCGACTAGCAGCGCAACCCCAGCGCTATCGGTGCGCTCCACCGCCGAGAGGTCGATCAGTCCAATCTCGCCCGAACGGACGAGTTCCTCACCTTGATGCCAAACATTAGCCACGGAGTCCAAATCAAGCGCCCCGTAAAGCTTAATACAACCGCTCTGGGTATCGCACTCAAGCTGCTCGCAAACCATTAAAAACCAATCTCCTGGTTACGCGCCCTCATGTCCTCTACAACTTCCGGCAAATCCTTATCGCGCAGACGAGAACGAAAATCCTCGCGGTAATTCTGTACTACACTAACCCCTTCAAAGGTCACGTCCTCCAGCTTCCAGGCATCTGAGCTATCAATGAAACGAAACAAAACCCGCGAGCCGCCCTCTACTTCGACCCCAACCTGAATCCCGGCATCGCGCTGTCTACTACCCAGGATACGAAAATCGAGGTCCTTATATTCTTCCATGCTCGAGGCATAGGTGCGCACCAGACTACGCTTAAACTCGTTGATAAATCTCTCCCGGGTCTCTGCGTCGGCATTACGCCAGTGAGGTCCGAGCATACGGGCCCCGATCCGCTCGAAATCAACATGCGGTTCAATGATCGGGGCAAACTTCTCATAGGTTCTTACCGGGTCATCTTTAAGTTCGTCGCCATATTCGTCCAACAGGGCAACAACCTTACTGCTGACCTCATCGACGATCTCGCGTGGGTGCTGCTTAGCATCCGCCGCTGCGCTCCACACAAAGGCCAGCGTAAAAAGAGCGAGGACCACAAGTAGACCAACTCTCGTTCGCAAGGATTTACTTATCATAGGCATACCAATTGCTCTTAGGAGCCACGCCCACCAATTGCGGCAGATTTGCGTGACTCGTAGTGGCAGTAGATGAAACTCCCTTAAGTCTTGAGGTTAGCTTAACTCTGCCCAACGGTAAACTTGCGCCGCTTCAATCCCCCATTGAGCTTATAAACTGACCTATTAACCTCTCCAGGACCACCGCGGATTGAGTCTCGGTGAATTGATCGCCCGGCTGGAGAGTCTCGTCTTCCCAACCCGGATCTATTCCTATGTATTGTTCACCCAACAACCCAGATGTATAGATTGCAGCCCTACTGTCAGCGGGTATGGAGTCGTACCTTTGGTCTATATTGAGCACCACTCGTGCCTGATGGGTCTGCGGATCGAGACGAATGGTTTCAACCCGGCCAATCTTTACCCCGGCCATCTGCACCTGCGCCCTCTCGCGCAATCCCGCCACGTTGTCAAACGTCGCGCTAACCTGGTAACCGGGGGATTTGCGAATCTCCACTAGGCCGCTAACTTGCAGCGAAAGGAACAGCATCGCAGCCAGCGCCACCACCACGAAAAGCCCAACCCATATCTCTACAAGCCGCTGTTGCATAAGTTTTCTCTCTAAAATCAGCCGAACATCAGGGCCGTTAACAGAAAGTCCAGGCCGAGGACTATAAGAGACGCATGAACCACACTCAACGTAGTCGCCCGACTGACTCCTTCGGAAGTCGGCACCGCCATATACCCCTTATGAAGCGCAACCCAGCCCACTACCACAGCAAAAACTACGCTTTTTATAAGCCCATTGAGTATATCATCCCGGACGCTAACAGCATTCTGCATGTTAGACCAAAAGGCACCGCCGTCGATTCCCACCATGCCCACCGATACCAAATAACCGCCAATTATGCCTACCGCGCTAAATATGGTTGCCAATAGCGGGACGGCGATGAGAACCCCAAGGAAACGCGGCACCAGGATACGCCTTTCCGGATCAATCGCCATCATCTCCATGCTCGACAACTGCTCTGTAGCACGCATCAGGCCAACCTCTGCGGTAAGGGCTGAACCAGCCCGACCGGCAAAGAGCAGTGCCGTGACCACCGGCCCTAGCTCCCGGACCAGAGTTAGTGCAACCATCACGCCTACCTGCTCGGCGGCACCGAAATCCGAGAGGGTGTAATAGCCCTGCAAGCCGAGCACCATGCCAATGAAGAGCGCCGATACGACAATTATGACTAGCGATAGAACGCCAAGCTTGTAGACTTGATCAATAACCAGCAGGGGGCGGCGCACCCCAATCGACAAGGAGCGCATTAGCCGCAAAAAGAGGAAGACGCCCTCACCCACCCCCCTCAAACCAGTTATTACCTTGTTGCCAAGTGCCTGTAACAGGCTAATCACGCTTGCAGCTCCTTATGTATCTCTTCGGCAAAACTCGGTCCGGGGTAGTGAAAAGGCACAGGGCCATCGGCACGGGCATCGATGAATTGACGCACCGCTGGATATTCGGAACGCCGCAGCTGTTCAGGCGTGCCCTCCGCGATCAGCCTACCCTCGGCCAGAAGGTAGACGTAATCAGCTATGGCCAGGGTCTCTTCGACATCATGGGAAACCAGAACGGAAGTTAGCCCAAGGATGTCATTGAGCCTCCTTATCAAGTCAACTAGCACACCCATGGTAATGGGATCTTGACCGGCAAAGGGTTCATCGTACATAACCAGGACCGGATCCAAGGCAATCGCCCGTGCCATAGCAACTCGCCTAGCCATACCTCCAGAAAGTTCGCTAGGCATTAGGTTCTGCGCCCCGCGCAGGCCGACTGCCTCCAGCTTTAAAAGCACCAAAGTACGCAGCAAAGGCTCCGGTAGCCGGGTGTGTTCGCGCACGGGAAAGGCGACATTCTCGAACACATTCAGGTGGGTGAACAGGGCCCCGCTCTGGAAGAGAACCCCCATACGTCGACGTAACGCGTATAGCTCGCGGCGCTTTAGCTTAGGCACCTCAACGCCTGCAACCTCCACCAGGCCCTCAGTAGGAGCGAGTTGCCCACCCAAAACCCGCAACATAGTAGTCTTGCCACTACCACTGGGGCCCATTATTGCAGTCACTTTGCCGCTGCGTATATCAACGTCAACGCCGTCAAAAATCCACCGGGAGCCACGTCGCACCTTTAGACCACGCACTTTGGCAAAGATATCATTTGCTTCATCGACGGCCATGGTTATTGTGCCTGTATTATAATCTGAACAATCTATCCGGATAGGGCTAAAAACCTCTTCGAGGACCAAGTATCCACTCAACTGACCATGCGGTCAAACTAGCGCAGGCCGGAGAAATACTGATAGCCTCGCAGGGATGACTCACGGATCCGCCAAAGCCGCAGTTGCGTGGCTTTACAGGTTGCTAGAAGATCTCCAAGAGCAAGCACTAAGGTGTCAAGAGAATTGAACAACACCCTCTACCAAAGCAAGTCAGAGCACAACGACCAGGAACTTGCTGAGCACCAAGAGACAAATAAGCGTCTGCAGAGACTTGGCCAGGAGGTACTTAAGCTGGAAGCAAAGGCTGTCAACTCTTTGGCTGAACGCATCGGTGATAACTTCAGCAGAGCCTGCCGCCATATACTATCCTGCACTGGCAGGGTAATAGTCACCGGCATGGGTAAATCGGGGCATATAGGCTCTAAAATAGCAGCCACCCTAGCCAGCACCGGCACCCCGGCGTTTTTCGTCCACCCCGGCGAAGCTAGCCATGGCGATCTAGGCATGATAACCGGCGCCGATGTGGTGCTGGCTCTTTCCAACTCTGGCGAAACCCATGAACTCAACGCCATATTGCCCCGCTTAAGGCGTCTCGGCGTACCGCTTATCTCCCTTACCGGCAATCCGGATTCCACCCTGGCCCGCGAAGCCACAGTGCACATCGACATACAGGTGGAAAAGGAGGCATGCCCCTTGGGCTTAGCACCAACCTCTAGCACAACCGCGAGTCTAGCTATGGGCGATGCGCTAGCGATAGCCCTGCTTGATGCCCGCGGCTTTACTGCCGAGGACTTTGCCCGCTCCCACCCTGGCGGTCGCCTGGGTCGGCGACTTCTGCTGCACATTGAGGACGTCATGCAGACCGGCGACAAGGTTCCGCAAGTTACTCCCGGAACATTGCTGCGCGAGGCCTTGCTGGAGATAAGCCACAAAGGCTTAGGCATGACGGCTGTAGTGGACTCTACTCAACAAGTGCTCGGCATATTTACAGACGGTGACCTACGCCGCGCTTTAGATCAAGGCATTGATGTGCACACCACCGAGATTGCGACAGTAATGACCCAAGCTCCGCAAACGGCCCCGCCACACCTGCTCGCGGCCGAAGCCGCGGAGCGCATGGAGCGCCATCGTATCAACGCCCTGTTGATAACCGCTGAAGATGGCAGGCTGATAGGCGCATTGAACATGCATGACCTACTCCAGGCCGGTGTTGTATGAGTAGAATAAGCACCACAAAGCTTGCTTATTGCACTCCCCCTAGTGAAGCCATTCTAAGTCTGGCGGCAAACGTCAAAACTGCGATATTTGATGTGGATGGCGTTCTAACTGATGGCACGGTTTATGTCGGCGAGGACGCCAAGCAGATGCTCGCGTTCCATATCCATGATGGCAAGGGACTGCGCATGCTTATAGAGGCAGGGATAAACGTTGCCTGGGCGACTGCGCGCCGCGGCGACGCCGTCCTAGCCCGGGCGCACGAACTCGGGGTTGAACTGGTTATGGATGGCTGCCGGAATAAGGCTCAAGCGGTGCACCAGGTAGCAGCTCAGTTCGGCCATGGGCCGAGTGCATGCTCTTATCTCGGTGACGATATTATCGACATCGCGGCAATAGAGGTTGTCGGTTTAGGGGCTGCGGTAGCTGATGCCCACCCTCAAGTAATCTCCTGTGCTGCTTGGACTACCCAGCACTCGGGAGGACGGGGCGCAGCCCGCGAATTGGCAGAACTGCTACTCCTTGCTCAGAACAAATTAAACTATTAAACGTCCAGATCGCCTGAGGTGGCAGATAAGTGGCGCAGAGAATACTCTTCCCCATTGCGGTAGTTGTTTTGACTATCTTGGTTGGCATCTTCCTAACCCATGATGCCCCCGAAGAAGAGTTGCCCGCTGAACCCGAGGAAGTAGAGCGAGCTGACTACTTTCTGGCCGAATTCGCAATTCATCATCATGATGAACAAGGTAAAATGCGCTCCATATTGAGAGGTAAACAAGGCGAACATTTTCCCGTAAGTCAGACCATTCAGATTGACCAGCCGCATTGGCAGGCGGAGTCAACCGATGGCTCGATTTGGTTCGCAAACTCGCCGTTTGGCTCATTCAAACGCGACACCCAAGTCATAATTCTGCATGAAGACGTAGACATACGCCGCCAGGCAGATGAGCAGCGCCCACCCCTGGATATTGTCACCCGTGAACTGTTTATCAATATCGATAACCATCAAGCAATGACTGAAGAACGGGTAGTCGCCACCGATCCATGGGGCTCGGCAGAAGGCATTGGCATGACTCTGGACTACTATCGCGACAGGCTGCAGTTGCACAACCAGGCGAAAGGACGCTATGAGACTAAACAAGGCGAGTAAATATTTATATACAGCACTGATTCTCTGTCATTGCGCTCTGTCGCTGGCCAGTGATTCGCAGCCCCGCCCACCGATAGAGGTTGAAGCTGACCGAGTCGACATGGATGCCAAGAGCGGAATTAGTATATACCAAGGCAACGCTGTAATGACCCATGGCGAGATGAGGATAAGCGGTGATCGCATAGAGATACACACCACCGAAGAGGGTGAACTAAAGCACCTAATCGTTATCGGCCAACCAGCCCGATACCGCGATCTACCCGAGGGAGAAGAACAGGAAGTGCGCGCTAAAGCGAGTCGTATGGAATATTATGTATCTGGCCCGGAACGGGCGTTATTTTTCGGTGATGCCCTTTTTTGGCAAGGGCAAGACTCGCTAAGCGGCGACAGCGTATTCGTTAATCTCGAAACCCAGGCTGTTAAAGCCCAAGGCAATGATGACCAACGAGCCCGAGCAATCATCTACCCTGGCCAACGTGAGGAGCGCTGAGCTGTGGCGACCTTGAAGGCGGAGGGACTATACAAGAGCTATCGGGGCCGGGCTATCGTTAGTGATGCCAACCTAGAGGTCGCTAAGGGTGAGATAGTCGGCCTACTCGGCCCTAACGGGGCCGGCAAAACAACTTGCTTTTACATGGTGGTTGGCCTGGTTAAGGCTGACGCTGGTAGCATAACCCTTAACCAGGCAGATTTAACCGAGCTACCCATTCATGCTAGGGCCAAAGCTGGTATCGGCTATCTCCCGCAGGAGCCATCAGTATTTCGTAAACTAAGCGTCGCAGACAACCTCAACGCCATTTTGCAGCTGCGCTCAGATCTCAGCCGAGCCGAGCGCAAAAGTGCAGCAGAAGAGCTGCTAGAAGACTTTGGCGTAAATCACGTTCGCGATTCTATGGGAATTTCTCTGTCTGGAGGGGAAAGGCGTCGGGTTGAGATTGCCAGAGCCCTTGCCGCTAATCCGCAATTTATTTTATTGGATGAACCCTTCGCCGGAGTCGACCCAATATCGGTTGGCGACATTCAACGGATAGTTCGCCGGTTAGCCGATCGCGGCATCGGTGTTCTCATTACCGACCACAACGTACGCGAAACCCTGGGCATTGTTCAGAGGGCATATATCCTTAGCGACGGCCGGGTCTTAGCTGCAGGAAACGCCCAAGAAATCCTCGCTGACCCTAAAGTAAGGGAGGTCTATCTGGGTGAGGATTTTAGCTTATGATAAGCTTTAGTGTAGTACTGTTTGAAAATTTAACCACCACTGTAAAGAATTTGATATGAAGCAAACGCTTCAGCTAAGAGTGGGCCAACAGCTGGCCATGACGCCTCAGTTACAGCAGGCCATTCGGCTGCTGCAGCTGTCTAGTCTCGAATTGCATAACGAAATCCAGCAGGCGCTGGAATCCAATCTAATGCTGGAGACTGATGAGGGCGAAGAGACAAGCGTTGCAGACAAGTCAAATCAGGATCAAAACGCGGAAGAGCAACCTAGCGGTGAGGCAATACCCGACGAACTACCCTTGGATACCAGCTGGGAAGATATTTATGACGCCAGCTACGCCTGCGCATCCCCAGGAGCTGATTCAGCCACAGACATCTTAGAGAATCGCAGCTCCAACCCCCAGGGACTGATAGAGCATTTGCTCTGGCAAGTGGAGATGGAGGACCTTACCGAGAGTGAACGTTTAGTTGCCGAGATCATTGTTGATTCTATTGATGAGGACGGCTACCTCCAGTCAAGTGTTGAAGAGATACACGGCCAGCTTCCGACAAGCATCCATTTCAGCCCTAGCGATGTGGAGTCTATCCTTGCCCGTATCCAGCGCTTGGATCCGCATGGCATAGCCGCTCGATCACCTGCAGAGGCACTTTTTATTCAGCTTGAGCAACTCCCCGAAACCACACCTTGGCGAGGGGAAGCGATGGAAATAGTCCTCAACCACTTGGAACGAGTCGCCGAGCAGCGCTATGACGAGATACAAGCCGAGCTAGGCTTGAGTGGCTCGGCTGTTGACGAAACACTCGAACTGATCAGGAGCCTCGATCCTCGGCCGGGTCAACAACTGAGCAACTCTGCACCAGAGTACATCATTCCTGACGTTACCGTATTCCGCCAGGACGGCGCTTGGCATGTTGAACTCAATCGCGAGATGACGCCTACTCTACGGATCAACCCCTACTACGCCAGTCTCGTTAAACGCGGCGATACTAGTGCCGATAATCACTGTTTACGGACCCACCTCCAGGAGGCTCGCTGGTTGTTAAAGAGCTTGCAAAGCCGCAATGAGACGCTGCTTAAGGTGGCCAGCGCCATCGTCGATCGGCAAAGAGAGTTCATGGACCACGGCGAGGAGGCCATGCAGCCTCTGGTATTGCGCGAGATAGCCGAGACAATTGACATGCATGAGTCTACAATATCGCGGATAACAACCAACAAATACATGTACACACCACGTGGGACGTATGAGTTCAAGTACTTTTTTTCCAGTCACGTAAGCACCACAGATGGAGGTGAGGCATCTGCTACTGCCATCCGGGCACGAATAAAACGCTTGATAAGCGCTGAAGAGACCAGCAAACCGCTTAGTGACAGCGCCATCGCTGAATCTCTCAGGGATGAGGGCATCAAGGTAGCACGTAGGACCGTAGCCAAGTATCGTGAATCTTTGGGCATTGCCTCGTCTTCGCAACGCAAAGTCAAACAAAGAGGCTACGGCAATAAGACGCAAGCTGCAGAACTTCGCAAATGAAGGATCCAAGCCATGCAAATCAAACTCAGCGGTCACCATATAGAGATTACCGACTCCTTACGCGACTATGTAAATGACAAGATGAGCAGGCTACAGCGTCACTTTGACAATTTGATTGACACTGAAGTCATACTATCTGTTGAAAAATTGCAACAGAAAGCGGAGGCTAATATACAGCTCGGCAGTGGTGGAGGCAGGGTATTTGCAAATGCAGTCTCTAATGATATGTACGCAGCAATCGATGCGCTGATCGACAAACTGGATCGGCAGATAAAGAAACAGAAGGATAAAGCGGTAGACAAGAAACGTCAGGGCACGTCCTTAAAGTCGAGCCAGCACGAAGACTCGCTCGGGACGTAGTTAAGCTAACTAGCTTACGGACGACATATGGACATCGAGCAACTGATTTCCCCAGAACGGGTCCGCTGCGTTAGTGAGGCCAAGGACAAAGAGCAGGTCCTGTCTTATGTTGGCCAGCTTATAGGCGAAGCCGAAGATCGTCTAACTGGTGATGAGATCTGTAAACGCCTAATGGCCCGTGAACGCCTTGGCAGTACCGGGCTGGGGCATGGCGTCGCCCTGCCTCACGCCCGCCTTGAGGGTATTGATAAGGCCATTGGCGCCTTCATACGGCTCGACCAAGGGATAGACTTCCACGCCTTTGATAGAGCCCCGGTGGACATCCTCTTTGCTCTAGTAGTTCCGGAGCATTTCACTGACGAACACCTGCAGATACTGGCCGCCTTGGCGGAAATGTTTAGTGATCCAGAGCTATGCGAAAAGCTGCGCAGCCCCAGTAGCGATGATAGCTTGCATGAAATACTCCGGGGATGGCGTCCGTCGTCAGATTCAACATGAGTCTTTCAAAATCAAACGATGGCACTATAATCCCCGGTGTGCTGATGCATATCAATGGCCGGGGAGTTCTGCTCCGGGGGCCTAGTGGGGTTGGCAAGAGCGACTGCGCCCTGGCAATGATTCAACGCGGCCATCTTCTTGTAGCAGATGATGCCGTGCTGATTAAACAACAGGACAGTCAGCTGATAGGATCATGCCCGCAGATCGGGTTTGGGCTAATACACCTGCGCGACTTAGGGATAGTAGACATAAGAGAAATCTACGGCGCACAGTCTCTATCCCGCAGCAGCAGGATAGATCTGCAAGTCACACTAACTAGGAATGCAGGCAGTAATTGCCCAATTTCACTAATCAAGGGGCGCCGTCATAGCGCTAATTACTGCGGCACTATTATCCCCGCACTGAAGCTGACCGCTGTAGAGCAACGGCCATTAGCGGAACTCATTGAAATCGCAGTAGCACAGCTAAACTCAACCAGCGAAAAACTATCACACCAAGACATTCCAAACCGTATTACTTACCAAGAGAGTCTTGTTAATACGGACTCTCCCCCATTTGACGGTGCTGGTGAAGATCAAAATACGGCTTATACCCCGATAAACAACCGTACTTCTCGGCTATCACGCGCAGCAGAGGAGGTGGCGTGGCCGAAATCAGGTTAATCGTCGTCAGCGGCCTGTCTGGATCCGGCAAGAGCGTTGCCCTAAACACCCTTGAGGATGCTGGATTTTACTGCATCGATAATTTGCCAGTAAATTTAATCGCTGACTTCGCCCTCTTTGCCAAAAAAAACCATCAAAGGATTGGTAGCAAATTTGCGGTGGGCATAGATGCCCGCAACCCTAAAGAAGATCTTGTCAACCTGCCGCAAACTATAGAATACCTGCAGCAAACTGGCATAACCACGGAAGCCATCTTTCTTTATGCCGATGATTATATTTTGATGCGTCGCTACAGTGAAACACGCCGGCGTCACCCGTTGGCTCAAGATGGCAGACCGCTCAGTGACTCGTTAAAAGACGAAAAGGCGGTATTAGAACCATTGCGCGAACGCGCTGATTGGACTATAGACACCTCAAGGACTAGTGTTCACGATCTAAGATCACTGGTTTATGAAAGGCTCGGAGGCGACCGCCCTACACTCGCGATATTGGTTCAGTCTTTCGGTTTTAAACATGGAATCCCGACCGATGCCGACTATGTCTTTGACGCGCGCTGCCTACCTAATCCGCACTGGGTACCGGAGCTAAGGGCTAGTACCGGGCAGCACGTCGGGGTTTGTGACTTTTTAGAAAACCACCCGGAGACCGAGACTCTATACAATCAGATATTGGGTATAATAGCTTACTGGTTACCAACTTATAAGGAATCAGGAAGGAGCTACTTGACCGTGGCAATAGGTTGTACTGGCGGTCAACACAGATCCGTATATTTAGCAGAAAGGCTTGCCAACGACTTGGCTGAACAGTGCAGTCGGCTAACCTTACGCCACAGGGAACTTCCATGAGCGTAGGATTGCTCCTGATAACTCACCAAAGGATAGGCCAGGAGCTATTACAGATTGCTTCCCAAACTCTAGGTGTGTGCCCGCTACAAACCAAAGCTCTGGGCGTTTATTTTGAAGATGAGCCACATAAGATGGCCCACAAAGCCGAAGAAATGATAAAAGATCTCGACAGCGGCTCGGGGGTGCTTATTTTATCCGATGCCTACGGCTCAACCCCGGCCAACATAGCCGTATCAGCCGCTAAGAACAAGACCACAAGGTTGGTTACAGGAGTAAACCTGCCGATGCTCTTGCGCATCTTCAACTACCACGAAAGCTCTCTTGATGAACTTACCGAAGCTGCTTTTAAAGGTGGGCGCGACGGCATCGTATCACCAACCACCTAGGAGCAGCATGGATGACTTGTAAGGAAGCTGTTATACAAAACCGTCTGGGACTTCACGCCCGTGCTGCGGCACGTTTCGTCAGTGTTGCTTCCGGCTTTAAGTGCGACATCCATGTCTGCTCTGGAGATAAAAAGGTAAACGGCAAGAGCATAATGGGGCTGATGATGCTAGCAGCTGGGCTCGGCACGCAAATCACCATCGAAGCCAACGGCCAAGATGAGGAACTAGCTGCCAGGGATCTAGCCGAACTAGTGGAGAACGGATTCGGCGAAGATCCCAACCAGGAGGATTAAGTTATTCCCCTGCGACCATCATACCATCCACTAGGATTGAACCGGTTCGAATGTTTCCGCGGCAATCCATATCGTTACCTACCGCCCTTATACCGGCGAAGATATCACGCAGATTAGCTGCTATAGTGACCTCCTGAACAGGCCGGCGTATAGCCCCATTTTCTACCCAAAATCCAGCAGCGCCTCGCGAATAATCCCCGGTAACTAGGTTTACGCCCTGGCCCATTAGCTCTGTCACGATCAAACCACGCTCAAGGTTCCTAACCAAAGCGCCAAAGTCATCACGTCCAGAGACGACCTCGAGGTTGTGCACCCCGCCAGCATTTGCTGTCGTCTCTAAACCCAAGCGCCGAGCAGAGTATGTATCAAGTACATAGCGCTGGAGGGTGCCATCACTGATAAGGGGTGAATCGGTCGTCGCAACACCTTCATCATCAAAGGCAGCGCTACCCAGAGCTCGCGGTATATGAGGACGCTCGACCATCTGCATCCATTCGGGGAAAATCTGCTCGCCCTGGCAATCGACAAGGAAAGAAGCTCGCCGATAAAGGGCAGATCCTCGCAAAGCGCCAACCAAATGGCCTACCAGTCCGCGCGCTACCGGGGCTTCAAATAGAACCGGAACCCGCTCGGTAGAGACCGGCTCTGCCCCCAGGCGCTTTACCGCGTGCTCCGCAGCCTTTAGACCAACCTGTTCCGCAGCTTCGAGGTCAGCGGGGTTGCGCGCTACCGTATACCAATAGTCTCGCTGCATGCCACTGCCTTCACCTGCCACGGCAACACAGTTGATGCCGTGGCGCGACTCAGGCACAGATGCCATAAACCCATTACTATTGGCGTAGGTATGCACCCCCTGGTGGGCGCTGACCCCAGCGCCCTCGGTGTTGCTGATCCGCGGATCAGAATTGAGGGCAGCTGCCTCGCAACTACGTGCCAGTTCAATGGCATCATCGGCGCTAATCGACCAGGGGTGATAGAGATCTAAATCGGGTATATCACGAGCCATGAGTTCGCTTGGAGCTAAACCATTAGCTGGATCTTCTGAGGTAAAACGAGCTATATCACAGGCCGCTGCAACTGACTCGGTGACGGCCCGATCGCTTAGTTCGCCCGCGGAGGCACTACCCTTACGACCCCCGAAATAGACCGAAATGCCCAGACTACGGTCTCGATGGTGCTCGAGGGTATCTATTTCACCCTTACGCACGTTAACCGAAAGCCCCAAACCGGCCCCGAGGCTCACCTCTGCTGCATCTGCACCGGACTCATTGGCACGCTCCAAAGCCATATGTGCCAATCTCTCAATATCCGCGACCGCAGGCAAGCCACCATTATTGCGTCCTTTATCCGTCAAGCCTGTGTGCCCCCTACAGTAATACCGTCGACTTTGAGTGTAGGCTGTCCCACGCCAACCGGGACGCTTTGCCCTTCTTTACCACACACTCCTATGCCTCCGTCCAACTCAAGATCATTGCCTACCATGCTCACACGGTTGAGGACTTCCGGCCCACTACCGATTAGTGTGGCACCTTTGACCGGATGGCTTATCTTGCCATTCTCAATCATGTAGGCCTCACTAGCCGAGAAGACGAATTTTCCGCTAGTTATATCCACCTGACCACCACCGAAATGTACCGCATATATACCCCTATCCACCGAGGAGATAATCTCCTCAGGCGATTCTTGTCCAGGTAGCATGTAGGTGTTGGTCATCCGCGGCATAGGCAGATGTGCATAAGACTCGCGCCGGGCATTGCCGGTTCGTTGAGAGTTCATAAGCCCAGCATTCAGCCGATCCTGCATATAGCCAACCAATACCCCATTCTCAATCAGTGTATTGTACTGCGTTGGATGGCCTTCGTCGTCAATAGTTAACGAACCGCGCCGGCCAGGCAAGGTTCCATCATCGACTACTGTACATAGAGAAGAGGCTACACTCTCACCAATACGCCCACTAAAGGCAGATGTCCCCTTGCGGTTGAAGTCACCTTCCAGGCCATGGCCTACCGCCTCATGGAGCAGAACCCCTGGCCACCCAGGCCCAAGCACCACCGGCAGCGTCCCGGCTGGTGCATCCGCAGCTTCTAGATTGATCAGTGCCTGGCGGGCTGCCTCGCGGGCATACGACTCGACTTTATCAGGGGTAACGAACATGTCATAAGTCTGACGTCCGCCGCCACCACTGACGCCGCTCTCGCGGCGTTCACCGGACGCTGCCAGGACAGATACATTGATCCTCACCATGGGCCTCACATCGGCAGCCCAACCACCTGCAGAATCAACCACCAGGATTGTCTCATGGCTACCAGCAAGACTGGCCACCACATGAACTATTCGCGGATCAACAGAGCGCGCTGCATTCTCTGCCCGATGCAGCAAAGCCACCTTATCATCGGCACTGAGGCTGGCCAGCGGGTCATCGCAGTTATAGAGAGGGTTCACTTGACGTGGGCTGAAAGCCATCGGCGTCGAGTTATTGCCACTATGAGCTACAGCACCAGCTGCCTCCGAAGATTCCAGGAGGGCATTTAAGTCAATCTCATCGCAATAAGAGAAGCCTGTCTTCTCCCCGCTTATAGCTCTCACCCCGACCCCACGATCCAGGCTACGAGTACCCTCTCGGACTATCCCATCCTCGAGCACCCAGGCTTCACGCCGGGCTAGCTGAAAGTAGAGATCCCCGAAATCAACCCCCGGGCGCATCAAACGCGCTACGACTCGTTCCAAGTGTGACTGGTCTAGCTCAGCCGGATCGAGGAGTTGGCGCTGAGCTTGCTGCAAAAGATCGGTACCCATATGTGAGTCATAGACCTCGGTAATTGGACAAAAATAAGGCTACAGCTTGCGATGGCCTAGTGCTGGGAAACGACTGCGAATCTCATGCAAGCGCTGCAGATCAACCTCTCCAAGAGCAATGCCAGAGCCGCTAGGCAGGCTGTCTATAACTCCGCCCCACGGATCTACCACCATACTCTCGCCGTGAGTTTCCCGGCCATTGACATGGTAACCACCCTGATCAGGGGCAACCACGTAGCAGAGGTTCTCCACAGCTCTTGTCCTGACTAAAATGTTCCAATGGGCTTCACCGGTAACCGCAGTGAACGCAGAGGGCAGCACCAAAAGTTCCATGCCCTGCTTGGCAAGCTCGCGAAAGAGTTCAGGGAAACGCAGATCATAACACACTCCCAGGCCTATCTTGCCTAACGGAGTGTCGAGCAACAGCGGCTCATGTCCACCCTGCTGTACTTCTGATTCCCGGTAAGCCTCTCCCGGAGCCACTTCAACGTCAAAGAGGTGAATCTTATCGTAACATCCCCACCGCTTGCCATCAGGACCGTATACCGGAACCGCTGGCCTGATCTTGGTGCCGCTGGCCGTGCTCAAGGGCACCGTACCACCCACTAGGAAGATGCCGTGGCTACTGGCCTGCGCTGACAAGAAATCCTGGATAACACCAACTCCGTCGGGCTCGGCAACCTTCAATTTATCGCTCTCTGTATAGCCCATGAAAGCAAAATTTTCCGGCAGAGCAACCAGGCTTGCTCCTGCTCTGCAGGCCTTGCTGATCAGACGCCCTGCCTCCTGCAGATTGGCATCTACATGAGGGCCAGAGGCCATCTGGACGGCGGCAACAGAAAACCTTTGTCTAGATGTCATCACATTCCTTTTACAGTTGTTATCCTGATGAGACTTCCCTCAGGGTTCTGGCAAGGAACCTCTTAAAACTCCCCACCGAGGCTACCTGCCTCGGTGGGGAGGTCTTGGCGCCAGGGATGGCCCCATGAGACCTCCAGGGATGGATTCACGGCGTCCTCCACACCGAAGGCAGCTAGCCTCGGTGGGGAGTTTTGAGAGGCCTCCGGCAGCCCCCCTGAACCCCTCAGCATCAAGGCCGGACTCGCTCTACTTGTGGCTCATTCCAAGGACCGGTTACACGATAGACGAAAGAGGCTGCCCGATCCATCGGACCACCGATGATCCAGTCGGCCATGACTCCTAAAGCACCTCCTGCCCCGCCACCAAATACCAAACCAAATATGGGTAACGTAGCAGCAATTCGCGGCATCACGGCAACCTGCGCATCATAATAGCGGGCAACGTAATCGACAGCGCCTGACATGTCGACTCTAGCAGCTGATGAGTCTATATGCATTTTATCAATTCTCGCCATACCATCACCCAAGAGACTGAACTCCCCCTCCACGCTTTCGAAGGCAAGGCCATCTCCGAACAAGTCGCCGAAGTCCAGCAGAACTCTGCGCGGTAACATCCGCAGCCCAAATAGCCCCACTAATCGCCCTGCCCCGGGGTTGATCTCGGTTATCCGTCCATCGCTCATCTGCAAGCTTAAGTCGCCACTAAGTGAGGCCAAATCGAAATCGAGCGGCGAACCTGGCCACTGCCCTTTGGCGTCCAACTCCACCCCAGCGCGCACTACAGCCGCCGGGGCCCCGAATAGGCCAAGAAGCCTAGCCACATCGGCAGAGAGCATAACTGCTTCGAAGTCTGTACACTGAGGTAATTGGGCTTCGTTACAGTCCTCTTGCCAGTGCGACTGCAGCTGCAAATCCAGCGCCTTCCCGCGGAGTGTAACTTTGCCTTGAGCAAGATAGTCGGTACCATTCCCTGGCTGCAAACTGAGCTTAGCCAGCCCGGCATGCCGGCCGTCAAGCCGCAAGCTCTCAATCCGGCCCGCGAAGATAGGCAATGCTGCGGCATCTTCCGGCGTAAGGCGGGGAAAATCTCTGACCTGAGGCTCAACTTCTTTAGGCGCTGGATCGGGTACTCTGGTGCGACCCTGCAAGGGTACATCTAAATGATCAAGATTAACTCGGATATGTGCTCCACTGTCACGCCAAAAGGCCTCACCAGTGGCTAGATCACCTTCTAAATCGAACCATCCGCGTCCATCTCTTGCGGCCCTAGCGGCAATTCGCTGAACCCCGTAGTTACGTCCACTAAAGAGCAAATTGCCCAGGGTAAGGTTGAGGTTAACCGGCAATAGGTCATGCCAGTCAACTTCCGCCGTTTGCCCATCAGCATCGCCCCCAACATCCGCTAACCCACCACCGACATTATCTGCGAGCCATATCCTCCAGGTATCTACATCGAGCACCGGCAATCTGCCATCGATCGTTATCCCGGCATTAGGCAACTCGGCAGCACCACCTCCTAATCTAATCCCTAGGGCTTGAGGCACGGTTCCTTCAGGGGCAAAGGCCAAAAGATCCAAAACATCAGCGTAGTGCAGGGCATAAGACTCTAGCCCCACATCTGTAAAAGTCGCATCTAGCTGCAATTCGCGGGCCTCTTGCTTACTCAGCCCGAAAGGCTGGGGCATATCTATGGCTAAGCCCTGCAACGAAGAGTTCAGCGAGAGTTCCAGCTGGGGATCGATAGAGGCTGGCTCGAAAGTAGGGGCGATGAACCGCAAGTTCCAGGCCGACTTACCCTCTAACCAGGGCAAGTGATCAGCGTCAATTAACCCGGTTAACGAGTATGGGCTACCATGGGCGGTTACTTCGCCGACCGTCCTTTTCCGCCCATCGTGAGTCTTTAAACCTGCTGAGGCGCTAAACTCCTCTCCACCCCAGATACCATCCAATTCAAGGTCACGCAGACCCTGATTATCTACTTCCACCCCCCCTTTTACATCGCCTAACTGCAAATACTGCTCGAAGTCAAAATCGACGCCGTGAAAATCTATGCCAGTGCGGTATGTCGTGCCCCCTTCTACCGCATCTGTTAATGGCAGCCACAGATCAAGATCCATTTTGAGGGGGCCCGCAACTGCAAAAGGTATAGGTGGGTCATCGAGAAGCTCCCTACCAATCGGTGACTCGGTCAAAAAACAGATGCCGTCTTGGGCATCTCCGCGCGCTCTGCCATAAAGATCAATAACTGTTTTGCTTAAGTCGTCAATTTTTAGCTCGACATCATAAAGCTCTGCCCCACAAGTCACTCCTCGGTGCCCATCAACCGTTAATGAAAGGTTATCCAGGTTAACGTCTACATCCATTTCACTTATTAAAGGCCAATCTTCTTGATAAAGAAGATCAGCATCACGGAACCTGCTTTTAGCTAGAAAAACGCCGTCATTATCAGTAAACGGGTAATCGGCCGGATTTCCCTGAAAAAGAACCTCAAAACCGTCCAACCTGCCGCCTAAGAAAGCCATTTCTATCCAATTATAGACTGGCTCAGGAATAACCTTTGGGGGGATGTTTTCGACCAGCTCTGTTACTTGCAGAGGCCTTATTAAGGAGGTATACAACAACAACTCAGGCCCTTGCCCTGGAAAATAATCCAAGCGACCACGAACATTTTTACTTATAACTTCGCTATCAAAAACACCCCTCTCCAACGATACCCACCATGCCCCATCAAAAAAAAGCCCGCCTCTAAACCGCGCTCGTGCTTGGTTAACCGGCACGGGATCAGCAAATATCTCAGGGAGATATATGGACTCGTCGAGCAACTGGGCTTGTGCGACAAAGCGTCGACCCTGAAAACTAGCTGCAGCGGATACCGCGCCTACCCCAGGCAACCCATTACGTGGTTCGATTCCGGAGCTCGCTACAGAAAGGTTGCCATAAACCCCCCTAATACGTTGAGATAAGCCGACATCGTCGTCAAACTCAATTACCGGCCATAACCCCCCATTAAGGTTGCCAGAGATACCTCCAACGTCGAATATCGCCAAAAAAGGCTCAACATCTACTGAATCTAGGCCAACAAAAACTTCTGGCGCCGTCGACTTGTCAAGCGAGGCGCGCAGCGCCAAACTTATCCCCTTAGCCAAGTTGGCTTGTTCGCCAGCAACCCGCGCTCCTGCTTTGAGCTGCCTACCTTGTTGGACGAGGCGCACGTCTATGCCAGACACTGCGAGCGGCGACTGCTGAGCGGGGCTTACCTGCACCTCGGCAGCTGATATATATATATTAGCGGGTAAGCGAATGGGTTCTTCACCCTCCGCTAACTCCGGAAGCCCCCTTAATTGCCAACCTTTCTCCTCATCTTCAAAAACAATAACTCTTGCATCGCTAATATATATATCAAGACCATACGGCCCTTGCTCTGCCCAGTACCAGGGGGGTTTAGCCACTTCTATACCGAGACTACCAACTTCGAGCCGATCATCCCCCTCGCCCAAGCGCAGGCCATACAGATCCAGCGCAGGCCATACGCCATCCCAGCTTAAGCGGGCAGAATCAATAGTAACCGGGTGCTCTAAAGCCTTAGTCAACAAGTGCTCTAACGGCTCCTCCAAGGGCAGCTGCAGCCAGGCAACCGAGCGCAAGACAACCCCTACGGCGAGGACAGTTGCAGTAAGCACTAAAAGTAAGGCACTCAACCCCTTACCAACTAGGCTTAGTACGGCAATCACTTTATGCTGAGCAGACTCAGGCATGCTACACCGGTATCACATCAAACTGATCCGGGGCATAAAGTGTTTCCACCTGCAGCTCAATGGGTCGCTCGATGAACTCCTCAAGGTGAGCCAGGCCAGTCGATTCCTCATCAAGCAGTGTTTCTACTACTGAAGGGGCGGCAAGAACCATAATCCGCTGTGACTCAAACTGACGTACTTCTCGAAGAATTTCTCGGAATATCTCGTAGGAGACGGTCTCCGCCGTCTTGACAGTGCCCCTGCCGTCGCAATTCGGACATGCCTGGCAGGTGAGCTGTTCCAGGCTCTCGCGCGTTCGCTTGCGGGTCATCTCCACCAAACCGAGTGCGGAAACCTCACCGATCTGGGTTTTGGCATGATCCCTGTCCAGAGACTTTTCCAGTGCCCGCAACACCTGCCGCTGGTGCTCGGGATCGTTCATGTCGATGAAGTCGACGATAATAATCCCACCGATATTACGTAAACGCAGCTGCCTTGCTATAGCTTGAGCCGCTTCAAGGTTAGTCTTAAAAATCGTCTCATCAACATTGCGATGACCGACAAACCCACCTGTGTTGACATCAACAATGGTCATCGCCTCGGTCTGATCTATTACCAAGTGGCCGCCTGATTTAAGGTCGACGCGTCTCGCTAAAGCCCTCTCTATCTCATCCTCTATCCCATAAAGGTCAAAAATCGGTCGCTCACCAGAGTATAGCTCTATCCGCTCTAGCGTGTGAGGCAATAGATCGGAGCAGAATTCGAGCAGGCGCTGATAGCCTTCGTGAGAATCTACCCGTACCCGTTCCAAATCGCGATTCGCCAAATCGCGCATAGCGCGCTGATAAAGCGGTAGATCCTCGTGCACCAGAGTGCCCGGTTGAGCACTGGCGGCACGCTTGCCTATCGAGTCCCACAAGCGGAGCAGGAATTCCCGATCAGAGCTTATCTCCTCTGCACAGGCGCCCTCTGCCGCTGTGCGAAAAATCAGCCCAACCTGTCCAATACCGCCCTCACTCTGCTCATTGATAGCCTGACCAATCGAACGTAACCGCGCACGTTCACTTTCATCCTCAATGCGCGCGGATATTCCCATACCGCAAGAACTTGGGGTGAGGACCAGATAACGTGAAGGCACGGTTATCTGAGTGGTAACGCGTGCCCCTTTATCCCCGATTGGGTCTTTAATCACCTGGACAAGAAGGCTCTGCTGTTCGCGAATCAGTTGATCTATGGGGGGGGCTGGATCAGAGGCATCACGCTGCAAATCAGAGACGTGAAGAAATGCAGTCCTCTCAAGCCCAGCATCAATGAAGGCAGCCTGCATGCCAGGCAAAACGCGGCGTACACTGCCTAGGTAGATGTTGCCTACCAAGCCGCGCCGTTGGGCCCTCTCCAGGTGGATTTCCTGGAGTACACCATTTTCTACCAGGGCCACCCGCGTTTCACGCGGAGTCAAATTTATCAAAATCTCCTGGCTCAACCGGACCAGCCCTCTCTCCAGGGTAAGGCGAATTCGTCGAACAGGAGTGTTGTCTCATACAACGGCAGCCCCATAACCCCAGTATAACTACCTTCTAAGTGCTCGACGAACACCCCCCCTAGACCCTGTATAGCATACGCTCCCGCCTTATCAGCAGGCTCACCGCTCGCCCAATAGCGTTCGACTTCTTCTTCCGTGATTGTACGTAGAGTCACATGGCTGACCGACAAACGAGTAGCTTCGCGATCGTCTGCTAGAGCTACCCCAGTCACCACTCGATGCGTATGTCCACTCAGGCTTAGCAACATATCGCGAGCATGTGTTTTATCCTTCGGTTTGCCGAGTATCTGGTCGTCCAGAACCACCGCAGTATCAGCCCCTAATGCCGGAGCACCCCCTTCTGCTACCGCATAACCGCGCCGCGCTTTTTCCAACGCCATTCGCAAGACATACATTTCAGCCCCTTCATTTGGATCAGGGGTCTCGTCAACCACCACCATTATGGAGTGGTATTTAATCCCTATCAGCTCCAGCAATTCCTTACGTCGCGGGGAACGTGATGCGAGGGTTATCAACTGGTCCATGGCTGACTTTCCTTTCGGTAGTTAGTGCGTGGGGAATCGCTAATTGTGGATGCTACGTTTGCTGTCTATCCCCGATGATAAGGGTGGCCACTGAGAAGGGTCCAGGCCCGGTATACTTGCTCGGCAACCAGAACCCGCACCAACATATGGGGCAGGGTCAGCGGTGATAACGACCAACGCCTATCAGCCCGCTGCAGGACAGCCGGAGCTAAGCCACCGGCTCCACCGATGAGCAAGGCAGCTCTGCCGCCGTGATGACTCCAGATATCAAAGTGCTCGGCCCACTGTCTGGTGCTGAGCGCCTCACCCCGCTCATCAAATGCAATACATGGGACACCGTCTTTTACTGCGCGCAACAACCCGTCTGCCTCCCGTTCAGCAACCTGGGCAGCGTTATTGCTACGAGCAGCAGCAACCTCGGTTAGACTCAGTCGCCAGCCACGACCGAGACGTTCAGCATACTCGTTGTAGCCTTCGCGCACCCACTTCGGTGGCTTAGTTCCTACTGCTATGAGATCTAGGCGCATAATTCAGATTAGTAGCGTATGCCTATAATAGAGCCCTGCTTGCCCCAGTATGGAGGACGCCGTGAATCCATCACTGAAGGCTCATGGCGCCATCCCTGACGCCAAGATCTCCACACTGGGGGAGCAAGGAGGCTCAGGTGGGAGATTTATAGGCGCCTGCCTAACCGCTCCCAACTTGCATCTCATCATCCATTTCCCACATCCGCTCAAGACGGTAGAAATCCCGGCTCTCAGCAGTCATGACGTGCAATACCACATCTCCTAGGTCGATCAAGACCCACTCGGAGCCAGCCTGATCGCCTTCGATCGCCAAGTTAGATAGTCCCTGTTTTTTGGCCGCATCAACCACGTGCCGAGCAATCGACTGGATATGCCGACCCGAATTCCCGGTAGTAACCACCACTAAATCAGTAATCGGAGTCCTTTCACGGACATCAATTACTACCGTCTGCTCTGCCTTGATATCTTCAAGGGCCTGACGGACTATATTTTCAAGCTCTTGTAAGTGCATAGGGCGCTCTCTAGTTAACCATCACAACTGTCTATACCCATACAGAGACTCCGCAACGATGCGCTCCCTTACCCTTTCGGGCAGCAGATAGCGGACCGACTTGCAATTGGCCAGGGCTCTACGAATGCCAGTAGCAGATATTGGCAGGGGGGTTACCGGCTGCACGTATATGCCACCAAATGCCTGATTATGCAGCAGTGCAGGATCACTGATGTGGCGCTCAGCGAGTACAGCGGCCAGTTCGGCTGGCATCTGGCCTGTTACACCAGGGCGCTCAGCGACGACAATATGGGCCAAATCGAAGAGTTGTCGCCAACGCGACCACCCCGGCAAGCCGAGAAAAGTATCATAGCCGAGAATCAGGCAAATCGATGTCTGGGCACCCTTCTCGCGCTGCTCCGCCCATATCTGCACCAGCGTATCGACCGTATACGATGGGCCTTCTCTGTGCAGCTCACGGGTATCTACTACGGCAGCCGGATTGTCGGCGATTGCCAATTCGACCAACTCGGCTCTTACTTGAGGGCTGAGTCTAGGTCGTGCTCGGTGCGGTGGTATGCGCGCCGGCATAAACCGCAGCTCGCTGAGCTGTAACGCTTCCCGAACCTCCTCCGCCGGGCGCAAGTGGCCATAGTGGATGGGATCAAAAGTACCTCCGAGGATACCTATTGAATGGTGGGCTGCCTGCAAACCGATTGTCACCCCTTGCTAACCACCGCGGATATGGCCATCGCCGATTACGATAAATTTCTCAGTAGTCAGCCCTTCCAGCCCGACTGGTCCACGTGCGTGGAGCTTATCGGTACTTATCCCTATCTCAGCACCGAGGCCATATTCATAGCCATCGGCGAAACGTGTCGAGGCGTTAACCATAACTGAACTGGAATCTACCTCGCGCAAAAACCGCTGTGCCATAGGGTAGCTCTCGGTAACTATAGACTCGGTATGTCCAGAGCCGTAGCGCTGTATATGGTCAAGCGCAGCATCAAAGTCATCGACGATAGTGATGCTGAGGATAGGCCCGAGGTACTCGGTTCCCCAATCTTCTTCACTAGCCGCTATAGCCTCAGGCACCATGGCGCAAACCCGTGGGCATCCGCGCAATTCAATATCCGCTTGGCGCAACTGCGCGGCTATCTGGGGGAGGACGCGCTCGGCAACATCAGCCGCTACTAGCAGTGTCTCCATGGTGTTACAAGTGCCAAGGCGCTGGACTTTGGAATTGACCGCAATGGCAACCGCTTTATCAATGTCAGCAGCGCTGTCAATATAAACATGGCAGACGCCATCGAGATGTTTGATTACCGGGATGCGCGCCTCATTGCTAATCCTCTCGACCAGCGATTTACCCCCTCTAGGCACAATTACATCGACATACTCGGGCATCTGGATCAGTGCGCCAACCGCATCTCTATCCGTAGTGGCAATGACCTGCGCACCGTCTTCAGGCAAGCCAGCCTGTACCAATGCCGAGCGAATACACTGACCAATGGCTTGATTCGACTTGATTGCCTCAGATCCGCCACGCAGAATCGCAGCGTTGCCCGATTTAATGCACAGGCCGGCGGCATCCGCGGTCACATTAGGCCGTGACTCATAGATTATGCCGATCACTCCCAACGGCACCCGCATCCTGCCGACTTGAATACCCGAAGGCCGTGAGGCCACCTCGCGCACCGCCCCTACCGGGTCAGGCAGGGCTGCTATCTCGCGCAACCCATCTGCCATCGTCTGGATGCGTGCAGAGGTTAGTTCCATACGATCAATGAGGGCATCATCCAGCCCTGCCTGGCGGGCTGCGCCGAGGTCTTCTTTATTGGCCTCCGCTATGGCCGCGGCGTTGGCCTCGATTTGCTCGGCCATGCCCCGCAGTGCTGCATTACGGGCAGCAGTGGTAGAACGCGCAACCTCACGCCCCGCAGCACGGGCTCGTTTGCCGATCGCCTCGACTTGTTTAGCTATATTGGCCGTTGAGTCGGTAGTCATTGTTCTCCGCTATATAGATGATCATCACAACAAGTCTCCCAAGTATCCCTGAGGGCACTTATAAAAGGCGCCACCGTTTACCAGCCACTACCAATCACCTAGATTCTAGCGCCGCGAAGCCGCTCTAGCCAAGCGCGATACTAGTGCCCTCAAGGCCTCTTGCGGATTACTTCCGGGAACTCCCTTGATGGCGCAATCGACCTCGTGGCAACGGCGCAATAGCCGATGCCACGTATGAGTGGGCTGGCGCCTAGCTGTGGCCTTCAGCAAGCCGCTGCGTCGTTTAAATATCTTCTCCTGATAGAGGATCTTATCATCCGCACCCGCAGATAAGCGGGCGGCAGCTCGTATGTCACGAGCCAAGGCCCATAGAATCAGCGGTTGGCCAGTCCCTTCCTGCTCCAGGATCTCAAGTATGCGGTGGGCCCTGGGCAGATCTCCATTAAGGGCAGCATCGGCTAAATCATCGACGCTATATCTAGCGCTGTCGGCTAATGAGAGCGCAGCGTTATCGGCGCCGACCGGACCTTTGCCGTTAAGCAGCAGGAGCCTTTCGACCGCTTGATCGACGGCAAGAAGATTGCCCTCTGCCCGCTCCACAATGAGCTCAACCGCCTCGCTATCAGGCTTTAGGCCGGCTCTCTCCAGGCGCTGCCGCACCCACTGCGACATCTTGTCAAGCGGCAATGGCCAACAATAAACAAAAACCCCCGCCTGCTCAATGGCTTTGACCCAAGCAGATTCGCGCGCACCTTTATCAAGTCGGTTGCTGGTGACCAGCAATAGCGCATCATCCGGTGGATCGCTGCAATAAACCTGTATAGCACGAGAGCCGGCATTACCAGGCTTACCGTTAGGCATACGCAGCTCAAGCAGGCGCCGATCACCGAAGATCGACATGCTCGCAGCCGCCTCATTCAGCCGCCCCCAATCGAAGTTTGAGTCGACATCAAAAACCTCGCGCTCGGAAAAGCCAGACTGACGCGCCGCCTCACGCCAAGCATCGCTTGCCTCGCGTTGGAGCAGCGGCTCTTCACCGGCGATTAGACAGACCGGCGGCAGGGCCTGCTGTCGCTCGAGCCGGCGCAGAAATGCCTCGGGGGTTTCAGGCATGGTTAGATAACCTGCAGACGCAGCATTATCATCCGCGCAACCTCGTCACGCAGTTCCTCCTCTATATCATCCTCGCGGCGGCGACGTTGATCGGCCCCACCCTCAACCTCGGCAAAGGTGCGCATTGCATTTGCCGCCTGGCGGTCGATGAGCACATCTCCCGCGCTATCCTCAACGCTATAGCTCACTCTATATTCCAAGCGATACTCCTCGGCATCACCACCGACTCCCACCGCTGCGGTCTCCCGGTCGCTGCTTCGCGTATGCAGAATCACAACTAGGTCGGCCTCGTTACGGCTCTCTACCTGCCTACCCCCGGCGCCTTCAATGCCCCTGCGCACAGCCCTACGCAAATCCGAGCTACCGATATCGTCGATCACATAAAGGCCCTGTCCAGCAAGAGATATACCCCCAGGCGATCCCCGCAACTGCCAACCACAGGCGCTGACGACCAAGCCGAGCAAAACCACGGCTAGTAGCGCCCCCCTGAAATAGACTCTGAAGTGGCCGATCTTCATCTCACTTAGCAACAACCACGTTGAGCAGCTTGCCTGGCACGAAGATGACTTTTTTCACCTCTTTGCCCTCCACAAAGCGTTGTACATTAGGTTCTTGCAAAGCAGCCTGCCGAGCTTCATCTTCGCTGGCATCAGCCGGCAGGGTGACGTGACCACGCAACTTGCCATTAACCTGGACAACCAACTCCAACTCATCGCGCTCCAAGGCGCTAGCATCGGCCTGCGGCCATCTGGCATCAACAACGGGCTGCTCATAACCTAGCTGCCACCACAGGTAGTGAGACAAATGCGGGGTTATCGGGGCAAGGATCAATACCGCAGTCTCCAGGCCCTCCTGCATCACGGCCCGACCGGCATCGCTTTGATCTTCAACTGCCTTGCCCAGGGCATTGGTCAGCTCCATGGTCGCAGCTATTGCGGTGTTGAATGTGAACCGCTTGCCTATGTCATCTGAGGCCTTAACAATTGTCTCATGCACCTTGCGGCGCAGATCTTTCTGCTCCTTTGTCAATGACTCATGCAAGCTTTGCGCATCCAAATCGGGCGGCGCCCCGGCACTAACGTGATCGCGTACCAACCCATAGAGACGGCGTAGGAAGCGATAGGATCCTTCGACTCCGGAGTCCGACCACTCCAATGATTGATCCGGTGGCGCTGCAAACATGGTGAAAAGGCGAACGGTATCAGCCCCAAAACGCTCCACCATCTCCTGCGGATCAACGGTATTGCCCTTTGACTTAGACATCTTGGCGCCGTCTTTTAAGACCATCCCTTGGGTCAACAGACGGGTAAATGGCTCATCGGAACTGACCAGCCCCTCATCGCGAAGCACCTTATGAAAGAAGCGGGCATACAGCAGATGAAGGACCGCGTGCTCGATACCACCAATGTATTGATCTACCGGCAGCCACTGGTCGGCGCGCTCATCGAGCATGCTCTGATCCTGATCGGCACAGGCAAAGCGAGCAAAATACCACGACGACTCCATGAAGGTATCGAAGGTGTCGGTCTCGCGCTCAGCAGACCCACCACACTCAGGGCAGGTGGTAGAGTAGAATGATGGCATTGACTTGAGCGGGGAGCCGGCACCGGTAATCTCTACCTCTTCCGGCAAGGTGACCGGCAGCTGCTCATCGGGGACCGGCACAGCACCACAACTCGGGCAGTGGATTATCGGAATCGGCGCCCCCCAGTAGCGCTGTCGCGAAATCCCCCAGTCCCGCAGCCGATATTGGGTCTGGCGCTGACCAAGGCCCTCAGCTTCGAGGCGCTCGGCTATGGCCACAAAGGCGTTCTGCGAGGTCATCCCCGAAAACGGCCCAGATTCGACGAGCACTCCATAGTCACTGAAAGCACCTTCAGAGATATCGAGATCACTTCCATCACCAGGATGGATGACTGGCTTTATGGGGATGTTGTACTGAGTAGCAAACTCCCAATCGCGCTGGTCGTGAGCAGGCACTGCCATCACTGCCCCTGAGCCGTACTCCATAAGGACAAAATTGGCTACCCAGACCGGTATTCGCTGCCCAGTTAGCGGGTGAATTGCCTCCAGACCGGTATCTACTCCGCGCTTCTCGCGGGTTGCCATGTCGGCCTCGGCCGTACCTCCGCTGCGAGCCTCGTCGACCAGATCGGCGATCTGCTGATTACTTGCTGCCAACTCCAGGGCTAGCGGGTGTTCGGGGGCCAAACCCATATAAGTAACGCCATAAAGGGTATCAGGACGAGTAGTAAAAACCGTCAAAACCTCATCGCGGCCGTCGAGAGCAAAGCTCAGCTCGACACCCTCAGAGCGTCCGATCCAATTGCGCTGCATATTAAGCACCTGGTCTGGCCAACCCTTAAGGTTATCCAAACCATCAAGCAGCTCGCCAACATAATCGGTAATCCGCAGAAACCATTGCGGTATCTCGCGGCGCTCGACGAGGGCTCCAGAACGCCAACCACGACCATCAATGACCTGCTCGTTGGCGAGCACCGTCTGGTCAACCGGATCCCAGTTGACCGCAGCAGTATCACGATACACCAGACCCTTACGGTAGAGGCGAGTAAACAACCACTGCTCCCAGCGATAGTACTGCGGGTCGCAGGTCGCCAGTTCGCGCTGCCAGTCATAGCCAAAGCCGAGGCGTTGGAGCTGCTCGCGCATGGCCTCGATATTTTCCCTTGTCCATGCCGCAGGGGGAACGCCACGCTCCATTGCGGCATTCTCTGCCGGCAGACCGAAGGCGTCCCAGCCCATGGGCTGGAGCACGTTATACCCTTGCATGCGTTTGTAGCGGCTAACCACGTCGCCGATGGTATAGTTGCGGACATGGCCCATGTGCAGCCGGCCGCTTGGGTATGGGAACATGGACAAACAGTAGTACTTGGGGCGTGAATCGCCCTCAGTGGCACGGAAGGATTCTTTCTCCTGCCAATACTGCTGGGCTTGAGTCTCTATTTTCTGGGGTTGGTACTGCTCTTCCATTGCGCTCGCATTATTGATGTGGATTGATAGCATGACTAATTAGCCCGTCAGAGCCCCTCAAAAAGTCGCTGTGAACCACCCAGCCCTCCAGAGAGAGCCCGATAATCCCTTTAAGGATAACTCAGGGTGAGTTGCAACCGAAGAGCTGAACATAGAAACTGTTCATAGAAGGGGCACGATTAGATCAAGGAGAGGCCAATGAGCAACCCTGAAGCGGAGAATCAAGATCGCCTAAGCCAAGGTTATGAGCTACTACTAGATAGGGTCAGAAAGATATTCGCAGAAGCTGAGGAGCACGTCCCCTCCCTCAGCGAGGCGTTAAGCCATGCCAAACAAGAGTGCATTAACAATGGTGAGTTAACTCACGAAGAGGCTAACCAGGTAGAGCACTATCTGCGCCGAGATGCTGAAGAAGCCGGTGCTTGGCTCGCTCAAAGCAGTGATGATGATCCACATCTAGGTGACTGGCTGCGCATGGATCTGCAAATGCTTGAGAGCTGGCTATGGGAGGCCTTCTCATCCATCGCTGATCGCACCAAACTGGAGTTGCAAGGGTTCACTACCACCGGCGAACCAAGTGTATATCACAGTGGTGAGGTTGCTGGGCCAGGCACCTTGCAGTGCTTCTCCTGCGGCCGCGAACTGAGCTTTTCCCGCCCTGAGGCTATCCCGAAGTGCCCTGGCTGCGATAATGAGGAATTTATTCGCACCGGCTACGGCGATTTTGAAGATTAACCCAAAGTACTTACTTGGCCCTGCCATTATTAGAGCATCTGCATTACAGGGAACCTCTAAAACTTCGCCAGCGCCACTCATCCGCCCCGGTGTGGAGGTCTTGGCGCCAGGGATGGCGCCATGAAGCCTCCAAGGATGGATTCACGGCGTCCTCCACACCGGAGCGGATGAGTGGCGCTGGCGAAGTTTTAGAGGCACCCTACAGGGGCATTTGTTAGCTGCTGTATTCGACATCGCCGGTAAGCGAGATATTTTGATCAGCTGCTATGGTTATGCTGTTGCCCCAAATTTGCACAGAACCCTGGGCAGACACAGCTACTCCGGCTGAATTGGCGCCATTGGCAAAGCTCACCATCCCATTCAAGCTCGAGACATCCAAACTCTCTTGGGCCTGGAGCGAAAATCTTCCTTGATCCACCTGAACTTCAATCCCTTTGCCAGCTGAGCCCAACTCCAGATCATACCCTGCAGTTATAGCCATACCGGCCTTAGCATTAAATTCAACATTGCCACCAGCTGAACAATTTAAGCTCTCGCTTGAGCCTACCACTGCGGATCCCCCAGCTCGAACACTGATATTGGCAGCAGAGTCGAAGCTTGCGCACCCATTGGCCACTTCTAGACGATCATTAGCTGCAACCGCCTCAGACCTCTGGCCTGCAGCGATCTCCTGTGCCTGCGCTTGGCAATCCAGCTCTATAGGCCCTTCGGCACAGGCGAGGCGGACCATATCGGCAGAGCGCGACTGGCCCAGCTTGATATCCAAAAGGCTCGCTAGGCCAGGACTGTGCAGAGATATCCGCGAACCATCAGCTGCACAACTGCCGGCTCCCTCGACCAGTAAACCTTGACCGGTGCGAGTCTGCACACCACCGTCATGACATTTGAAAGGTGGGGCCAACGGCTTTTCCCTTGCCTGCGGCAACACTCCGAGTATTACCGGCAAATCAGGATCGTGATCAAAGCAGGAGACCAAAACGCGGTTTCCCCCGCTCAGCGGTGTATACCACCCGGCACCATGTACTGCCGGGGAGGCAAAAGGCTGAAGGCGCTGCACTGGATCCGATATCGGCGCAAAGCCCTCTTCGGCGACCACTGCCTCCACTCTAGCCATTCCGGCTGCCGCAGTCTGAGCACGCCCTGCGCCGTGCCCAGAGTTGGCTACCTGAGCGCTCATCACCAGAGGGCGCACCTGCGTGGCAGGCAGTTCGGGACGATAACGCAGCAACTTGCTGCCGAGCAGTTCCGGCATGCTCTCTATTTCAACCCAGAACTCCTGACCATTAGAGCCCCGGCGCAGTCGATGACGGGCGGAGACTATCAAATGCCGACCTTCATAACGGCTCACAAAGTCATCTCCGCCGGCAGCTAGATCACTAACGTTGACACAAACACCAGCTCGTAAAGCTGAGCAATATCCCCAAGCAAAAAACCGATACTGCTGCCCAGTGCTACGCTCGGAATAGATGCGCTGCAAGCGTTCTATGTCAAAACCGGCATCTGTGTATCTATGTGCTTTGGCAGCCGAGACAAAGGGTTGTTCTACTAACTTTTCCCGGATGCGGTATCCGGATAAGCCATAACCGTGTTGATGACCAAGCGTATTGCCTGGCGCCACCTTTAACTGGTTATCGCAGGAATTTGGGCGCAATTGCAAGCCAGGTAATGCGCTAGAGTCATCGACAAATATAGGCCTCTCTGTAGGGCCATCTATCCAAGTGTAATAGATACCTTCTCGCGCCAGGATGCGTTGCAAGAGCGCATAATCGCTCTCTGCCGCCTGCAGAATAAAGCCCCGCTGACCAGTCGACGACCCTACACGCCAGTCAAAATCATCCTCGTCGTAACCTACCTTGGCTAGCAATTGAGCAACTAGTTCTCGCTTTTCTATCCGGGAATAGAGTCTTGTCGAGTGGTTCCAGTTGAGTTGATTAATACATGGCTCGATCCGACCTGTAAATCGCCTCTCGCCACTCACCGGACAGTCCCCTGACGACCAGCTGGTAACTACACCAGACACTTCGCGAGGAGCCCAACCGCCATTAGCCCAGCTTAATTTCGCAACATTCCCAACTATGAGATTGGATTGCCATAGGCGTGAGCACTCTCCATCAAGGGGGCCGGAAAAATCAAAACGAAACGGAGCAGAAAGCTTCTCTTCACCATCTATGGCAGTAATTTGCAACACGATACCTGCAACCTCGACCCGTGCCCATATTTTGCTCTGTTCTAAATTATCGCTCATCAGCTTATCCTTAGGAAAAATCAAGGGAATATCCAACGCAGTCGCTTTTGCGACGTGGAGCCTCTAAATACTCCCCAGTAACCATTTTGCTGCCCGGTGTGGACGCCGCGGATCCATCCCTGGAGGCTTCATGGCGCCATCTCTGACACCATGATCACCACATCTGGTGGCCTCCAAGACCCACCCTGGAGGCACCCAACTGCAACTCGATATTCCCGCTTAATTTTCGGTTATCTCGCTAGCCCGCAAGGAGATTTTTTTGGCTTCCAGGGAGATGTTTTCCCCCTCAATCACCACATCACCATCGCCATCAATTCTTATGCCAAAATCGCTGCCTATTGCTATTTTGCCACCCCCATTACTGATGATTTCGATATCCCCATGCGAGCCAACGGCTAAGCATCCCGCCTCAGCAACCACCGTAGCATCACCATGACGAGTTTTAAGCAGCATCCCTTGACCGCTGCTCAAGTTCAGCGACTCTCCTGCATCAATCTGCGCGTTTGCGTTCTCAGCGCTACACCATAAATCGCCCTGTTCGGCACTTAAGCTTAGATTAGCCCCAGCGCTCCATCTTATCTCCCCTTCGTCAACCGCTATAGTCCAATCACCACCCACCTGGGTGGAACAATCGGCTGCAACCTCAACTAGCCGATTGCCGCCACAGGCGATGTTAAAGTTACCGCCACTGTTTATTTCTAGAGCCCCTTGCGCCGAACTTAAGCTTACTCGAGCATCACAATCTTGCCGACCAGCCTCCATGCGCAGCTCACCGGACTTTTGAGCAGCAGTTAAGGAGAGCGACTCAGCACCATCACGATCATCGAATACCAACTCATGACCTGCCGGTGTGCGCAGCACATGTTGGTCTGGATTGGCGGAATTGACCGGCAAAGGTTGATGTTGATTGCCCAGTGCACCGAGCATTATCGGCCTCTCAAGATCGCCATCAAGCCCCGCCACCGCGACCTTTGTATGGGGCAACATTGGCCAATGCCAACCGTAGCTTCTACCACCATAAGGCTGGAGCAAGCGCAGTGGTGGCGTTGCCTGGCCAGGCGGAGAGTCCCCCTGGTCCATAGCCAGACGTGCCCGGTAAGATCCTTCCTCATCAATATAAGCTTGACTAGGGTTAGAGCCTTCAAGCTCGGCTACCATTAGGCCGCTGGCAGCTAATCTGTTCTTGGGCAGATCGGCGTACTCAAGGGCGACAGATAACAGCCTCGCCCGGCAAATATAGCTGGCTCTTTCTTTTCCCGGACCCCCTCGAATAGCTGCCGCCTGATCCCCTTGGTGGTTGGCGCTGATAACAAAATAGGCACTATTTAAAGCGTGGTTAGGATGGTTGTTAATACGAACGATTTGGCCAGGCGTTAACCTGTTCTCGGTATCTATTTCTATGCCGCCGCGTTGGGCATCAAAGGCACGCTGATAGACCTGCGCAAGTTTCTCGTGGGACTCTCTGCCACCGCCACACTCACCAAAGCGCTCCACCTCACCCCATCTACTAGCCGGCTGCGACTGCACAAGGGGGTCAGTTAATGCGCTACTAGCCACGGCCGCCTGCGATTGGGCTGGAAATTGCGGGTCAAAGCCAGCTATTGCAACTTGCCTAGGCTTCAGTTCGGTCCAGCTGCGGAGGCCGAAAATAGTTCCAGGAGACCGCACCTGCCCGCTTTGCGGAACGTAATCGAGCTCCAAACTTGAACTGCTACTTCCCTGTAGAGAGTCACTGATCACCACTTCGGTAAGGCTTTGCCGATGTCTGAGCAACATAAATAGCCCAGCGCGCGAGAGTATCCGCTGGATAAACTCCAAGTCACTCTCCTGGCCCTGGAGCAGAATCGCCTGTTTCTCGTGCGAGCCCTGAACCTCGAGCGACACCGAGGTGTTTGCTGGCAGCAATTCAGTGAGAACCCCGTATGCTGCGTCCGCAGCGCTGATATTGCGCCATACCCGTTGATGGCGAGATAGCTCCAAAAGTGCCAACGGGGAGCGGATTTTTATGACAAGAGAGGTTAAAGCTGTTGGATGGGCAACATCTTCGACTGCGAATATAATACCATGAATGACATGCGGGTTAGGCTCTAGCGACAAAGCCAAGCTGACTGTAGCAGCCATCCAGTTTGAACGATCGAGCAAGTGATCGCTTGAAATGCGCACAGTGTAGCTATAGTCATCTCCTACCTGCTCCACACCCTCAACCGTCTCTACCTCAATGATTTCACTGTCAATTCCAGGCAAACCAATTCTGAACAAGCGATTTGGGTGGCGTTTGGTTAGCAAAGCAAATAATGACGGCTGCAGACTGGCAACCTGCTGTTGAGCCGAAGGGTTAGCTGAAGTAATCCGCTCCATGGATTCCATTGTCCAATTCCTATTCAGTGGTTATTGAAAAGAGCAAAGAGCTTTCTCGCTCCGCGTTGCCCGCAAGCCAGGTTGAGCATCCTAGACGCCACTTATCTGCGCCCATTACCCATGTGTCGCGTTCACCAGGGGCGATTAAAAGCTCGATTTGGGCCTGATTACGCTGTCCTAGATAAAGTCCAACTAATCTTTTTAATGCTTCACCTCTTGCACTGCCCGGGCTTAACTCAACAGCATCAAGGTGTTGTAACGGCCCTATGACTATCTTCGTCATCCCTCCCGCGACAACCGTCTGCCCGCCTAGGCAAAAATCCTCGCCCACTTTAAGGCCAGTGCTACCTCCCAAGCACCCCTGCTCTGTGGGTTCGGCGCTGTCTACTCCGACCCATGTATGAACTCTATCTGATACTGCCACTGCCATTGTTCCACTGACCCTTTGCACGAGGCTCTGCAGGGCATATTTGCCAGACCGGAGTCGGCAATAAAGATCGGGAACAAGCCCCAGTCCATCATGCGACCGTGAAGTAACCGTTAAACTGCGCCAGGAATCTACCCAACCATGAGGATGCGCTATCTGCTCAACATGCCGCCAACAATCGACTAGAACTGAATATACAGAACCGTTGATTAAGTTAAGCAGCGCCCGCAAACTCTCGCCACTGCGATCTTCAGCTGCAGCCCTTTCAATCCAGTAGTGCGGTAAAGGCGAACCACCGCCGGCTAGCGCTAAGACATTGGCCTCAATTATCCAAAGCCCGTCCTCATCTATCCAACAGCGCCTAATCTCACCTGCGGGAAAACTCAGTTCACTGCTAGGCCGGATAACATAGCGCAGCGAATCTTCATCGTTGAAGCTCTCCAGCAGCATAACTGCTTGCCGTAACGAGAAATCATAACCTCTAGAAACGAGTCGCTGCAGGGCTAAAGGCAATTTGTCCATTGCTCTTCCTCACCGCTAGGCTCAAATACCAAAACCAGCTCAACAAATCGATCTAGCGGTGCCCATTCAAGCAAGACGCCGTGAATAAGCTCAGCCAACAGGCAGGCCTCCCCTCTATCTGGAATATGTCTAGTATCGATAGGCATCCTTACCTTGATGCCCTGCTCAATGACACCCTTGCGCCAGCGAGTAGTGCTTTGCCAATCGATATCACCTATTGCCCGCACCGCAACCGGTACTTGGCTGTGAGCAGCGTTGGCAACAGCAAACAGCAAGGTGCGCATAAACTGTTTATCGGCAAGCCTCTCTATGTCGCAGCGCAACAGGCAATTTAAGAGCTGAGTCTGCGCCGTCTCTACCGGGGCTGGCCTGAACGCACGCGGCCTATGCAAATTATTTACCGTAACGCCTTCTGGTACCTCACCTGAGCAGACATCAATGTTGCCTTTCGACAGATGTAAGCGCGGCAGGTCGCCGTTTGAGTAGAGTACATCGATAGATAGAGTCTGCTCTTGTTTGGCGCTGGCCGGATCGAGACGTATTTCTGGATCGCCCCGGCCGGCGCTCCCCTTATCCGGCCTTACAACTCGGTATAACCCTCCTGTGATGCCATCAACTCCCCATGTTGAAAGGGCCGAGTAGGATTGGCTTTCTCCACTATGGGTACTCCTCCCCTCTACCTGCAAAACCCGATGTACAACCCGACTGTCTGGCCCGTCTGTTTGACTTCTAACAAGACACTCATCTGCACCGGCCTGCAAACGCACAGGATCGGCCCAGCCCCTATTTAGATTAATCGCAGGCACACAGCCGTAGCGCAACACGCTCCGGGATGATTGTTCAACTCCGGGGGCAGGAAACGCCAGGTGGAATATAATATCTAGGGAGTTTAGTTGATCCGGCAGGTCTAATTGATCCAACCCACACAAATTAACAAAAAGCAGTTGCTCGCGGGCCAAAAAATATGCCTGTAACCTTTCAAGGGCTTGACTGCCGGGCAAAGAGACCCCGAACAAATCGCTGCATGACAATGGGAAAGCCGACTCAAAATAAATACTATCTTTTAATCGCAACACGCTGGCGGATAACTCTGCACCTGATAAGCGCACTTCAACATCGTCCACTCTCCTGAGCAAGGCTTCACGCATGGCATAGCCCTGCCCTGACTCATCATCAATCCACAGCAACAACGGTGAAAGGGCATGCTTGTCCCTGAAAGCAGCGGCGCTACCGGTTAGTTTGAACCGCAGCTGACTGCCACCGCCGTAGTGAGAGACCCATTGGAAATCCTCAAGCTTGAATGCCCGCAAAGCAATAGAACGCTGCGCGGCAAATGGCAGGCTTAGCGACTCATTGCCAACTGGCTGGGAATAAAAATGGGTCCCCGCAGCTATATCAGCTCCCTCAATTCTATTTTCATCCAGACCCAATTCAAGTGTTACCGTACTAGGATAGGGGTCTATAAGCTGCGGACAGACAGACTCCAGGAGTTGCTCATGTAGTCCTGCAATAGAGTGCTTAAGCTCTTGGCGCACCCCAGCAGTAAGATAAGCAACCCCCTCAAGCAGTCTTTCTACATAAGGGTCGCGATCGTTTACTTGGGATAAACTGAGCATTTTACCCTGCTCAGGGAAGCAACGCGCAAACTCCTCACCTCCTTCATTTAAGCGCCTCATCTCACCTTGAAAAGCGCGGAGCATGGCCATGCTGTTCTACCTCCATAAAGAAATAGTCCCATCACCTTGAACTGCAAAGCACATTTCTAAATCAAGGGCAGCCTCGCCTAAACCTTTCAACTTAGCATTTACTATTAATGCCAATACGTGGCTACCCTGCTGACTGGGAGGCTGCACTTTAATATTTGCTGAAATTACTCTGGGCTCAAAATCCAAAATATATTTTTTTATATCGGCAGCTAAGCGAGCTGATTGATACTCAACATCAGAGCCAAATGCAGATAAGTCCTTAACGCCGTAGCCTGGAATGTGAGACAGGCACCCTTTGCGCGCAGAAAGAAGCAATCTAATATGTTCTGCAACGCTCAAACGAAGCGATTCACACGATTTAAAATTATGCCAATTAATTTGCTGTCCAGAACATCCCTGTAAAGAGACTCTTCCTTGAACGATATCGAGCAATCCTGCCCCAGGCAT
Protein sequences of DBSCAN-SWA_4 >NZ_AP017372|2011382:2058504|2014512_2015136_-|WP_096409880.1|DBSCAN-SWA MSNEAPDPSRRRFLTGAATVVGGVGAVFATVPFLGFLKPNVEAQAVGAPIEVDVSKLADGQRLEFEWKGSPVWVISRTQEHLDNLEDEDILRGRLRDPDSEAEQQPEYARNLYRSVSPEVLVVSPVCTHLGCIVVYHPEVEAKPFDDDWRGGFFCACHGAMYDMSGRVYSGNPAPKNLEIPPHRFTDDDKLVIGEDPEDPEEEVEAT >NZ_AP017372|2011382:2058504|2012543_2013275_-|WP_096409878.1|DBSCAN-SWA MSKKRLGYVLGALLAVSFAGAAQGGIDMKDPNVSHHDRESIRKGAEHFAQYCMGCHSVEYLRYDRVAEDTGMGEEWIEDKLIFDDDIEHHEQMISPMDPEDGENWFGIEPPDLSMTTRVHGEDWVYTFLHAYFKDEDAAVGFDNWIQEGTSMPHVLAHLQGTPKPVHDDDGNLVDIEVSGDGEMSRREYEAMTAELTAFLAYAAEPIRADRERMGIWVILFLLVMTTVFYLLYKEYWRELKKQ >NZ_AP017372|2011382:2058504|2058063_2058504_-|WP_096409915.1|plate|DBSCAN-SWA MPGAGLLDIVQGRVSLQGCSGQQINWHNFKSCESLRLSVAEHIRLLLSARKGCLSHIPGYGVKDLSAFGSDVEYQSARLAADIKKYILDFEPRVISANIKVQPPSQQGSHVLALIVNAKLKGLGEAALDLEMCFAVQGDGTISLWR >NZ_AP017372|2011382:2058504|2051013_2053044_-|WP_096409911.1|DBSCAN-SWA MSDNLEQSKIWARVEVAGIVLQITAIDGEEKLSAPFRFDFSGPLDGECSRLWQSNLIVGNVAKLSWANGGWAPREVSGVVTSWSSGDCPVSGERRFTGRIEPCINQLNWNHSTRLYSRIEKRELVAQLLAKVGYDEDDFDWRVGSSTGQRGFILQAAESDYALLQRILAREGIYYTWIDGPTERPIFVDDSSALPGLQLRPNSCDNQLKVAPGNTLGHQHGYGLSGYRIREKLVEQPFVSAAKAHRYTDAGFDIERLQRIYSERSTGQQYRFFAWGYCSALRAGVCVNVSDLAAGGDDFVSRYEGRHLIVSARHRLRRGSNGQEFWVEIESMPELLGSKLLRYRPELPATQVRPLVMSAQVANSGHGAGRAQTAAAGMARVEAVVAEEGFAPISDPVQRLQPFASPAVHGAGWYTPLSGGNRVLVSCFDHDPDLPVILGVLPQAREKPLAPPFKCHDGGVQTRTGQGLLVEGAGSCAADGSRISLHSPGLASLLDIKLGQSRSADMVRLACAEGPIELDCQAQAQEIAAGQRSEAVAANDRLEVANGCASFDSAANISVRAGGSAVVGSSESLNCSAGGNVEFNAKAGMAITAGYDLELGSAGKGIEVQVDQGRFSLQAQESLDVSSLNGMVSFANGANSAGVAVSAQGSVQIWGNSITIAADQNISLTGDVEYSS >NZ_AP017372|2011382:2058504|2027665_2028178_+|WP_096409892.1|DBSCAN-SWA MRLNKASKYLYTALILCHCALSLASDSQPRPPIEVEADRVDMDAKSGISIYQGNAVMTHGEMRISGDRIEIHTTEEGELKHLIVIGQPARYRDLPEGEEQEVRAKASRMEYYVSGPERALFFGDALFWQGQDSLSGDSVFVNLETQAVKAQGNDDQRARAIIYPGQREER >NZ_AP017372|2011382:2058504|2024459_2025281_-|WP_096410401.1|DBSCAN-SWA MAVDEANDIFAKVRGLKVRRGSRWIFDGVDVDIRSGKVTAIMGPSGSGKTTMLRVLGGQLAPTEGLVEVAGVEVPKLKRRELYALRRRMGVLFQSGALFTHLNVFENVAFPVREHTRLPEPLLRTLVLLKLEAVGLRGAQNLMPSELSGGMARRVAMARAIALDPVLVMYDEPFAGQDPITMGVLVDLIRRLNDILGLTSVLVSHDVEETLAIADYVYLLAEGRLIAEGTPEQLRRSEYPAVRQFIDARADGPVPFHYPGPSFAEEIHKELQA >NZ_AP017372|2011382:2058504|2032796_2033186_+|WP_096409898.1|DBSCAN-SWA MSVGLLLITHQRIGQELLQIASQTLGVCPLQTKALGVYFEDEPHKMAHKAEEMIKDLDSGSGVLILSDAYGSTPANIAVSAAKNKTTRLVTGVNLPMLLRIFNYHESSLDELTEAAFKGGRDGIVSPTT >NZ_AP017372|2011382:2058504|2031945_2032800_+|WP_096410403.1|DBSCAN-SWA MRLIVVSGLSGSGKSVALNTLEDAGFYCIDNLPVNLIADFALFAKKNHQRIGSKFAVGIDARNPKEDLVNLPQTIEYLQQTGITTEAIFLYADDYILMRRYSETRRRHPLAQDGRPLSDSLKDEKAVLEPLRERADWTIDTSRTSVHDLRSLVYERLGGDRPTLAILVQSFGFKHGIPTDADYVFDARCLPNPHWVPELRASTGQHVGVCDFLENHPETETLYNQILGIIAYWLPTYKESGRSYLTVAIGCTGGQHRSVYLAERLANDLAEQCSRLTLRHRELP >NZ_AP017372|2011382:2058504|2022124_2022463_-|WP_096409886.1|DBSCAN-SWA MVCEQLECDTQSGCIKLYGALDLDSVANVWHQGEELVRSGEIGLIDLSAVERTDSAGVALLVEWSRAAREAGVSLAFRDAPQQMRAIIRASGVEDVLRMVARGGEGFEERMD >NZ_AP017372|2011382:2058504|2020201_2020849_-|WP_096409884.1|DBSCAN-SWA MRQTLTLALSKGRILDETLPLLATCGIEPVESPDSSRKLIIETNRDDLRLLIVRASDVPTYVEHGGADLGVAGRDVLLESRGESIYDPVDLGIARCRLMVAGPTGSSLDIPRPRVATKFVELARQHFAAQGRQAEIIKLYGSMELAPLVGMADLIVDLVDTGNTLRANGLEPLETITEISSRLVVNKASLKMEHQRIGALIEDLATAADAAADNK >NZ_AP017372|2011382:2058504|2023213_2023669_-|WP_096409888.1|DBSCAN-SWA MQQRLVEIWVGLFVVVALAAMLFLSLQVSGLVEIRKSPGYQVSATFDNVAGLRERAQVQMAGVKIGRVETIRLDPQTHQARVVLNIDQRYDSIPADSRAAIYTSGLLGEQYIGIDPGWEDETLQPGDQFTETQSAVVLERLIGQFISSMGD >NZ_AP017372|2011382:2058504|2055421_2056351_-|WP_096409913.1|plate|DBSCAN-SWA MDKLPLALQRLVSRGYDFSLRQAVMLLESFNDEDSLRYVIRPSSELSFPAGEIRRCWIDEDGLWIIEANVLALAGGGSPLPHYWIERAAAEDRSGESLRALLNLINGSVYSVLVDCWRHVEQIAHPHGWVDSWRSLTVTSRSHDGLGLVPDLYCRLRSGKYALQSLVQRVSGTMAVAVSDRVHTWVGVDSAEPTEQGCLGGSTGLKVGEDFCLGGQTVVAGGMTKIVIGPLQHLDAVELSPGSARGEALKRLVGLYLGQRNQAQIELLIAPGERDTWVMGADKWRLGCSTWLAGNAERESSLLFSITTE >NZ_AP017372|2011382:2058504|2043995_2044643_-|WP_170113012.1|DBSCAN-SWA MGILGGTFDPIHYGHLRPAEEVREALQLSELRFMPARIPPHRARPRLSPQVRAELVELAIADNPAAVVDTRELHREGPSYTVDTLVQIWAEQREKGAQTSICLILGYDTFLGLPGWSRWRQLFDLAHIVVAERPGVTGQMPAELAAVLAERHISDPALLHNQAFGGIYVQPVTPLPISATGIRRALANCKSVRYLLPERVRERIVAESLYGYRQL >NZ_AP017372|2011382:2058504|2034829_2036275_-|WP_096409900.1|protease|DBSCAN-SWA MGTDLLQQAQRQLLDPAELDQSHLERVVARLMRPGVDFGDLYFQLARREAWVLEDGIVREGTRSLDRGVGVRAISGEKTGFSYCDEIDLNALLESSEAAGAVAHSGNNSTPMAFSPRQVNPLYNCDDPLASLSADDKVALLHRAENAARSVDPRIVHVVASLAGSHETILVVDSAGGWAADVRPMVRINVSVLAASGERRESGVSGGGGRQTYDMFVTPDKVESYAREAARQALINLEAADAPAGTLPVVLGPGWPGVLLHEAVGHGLEGDFNRKGTSAFSGRIGESVASSLCTVVDDGTLPGRRGSLTIDDEGHPTQYNTLIENGVLVGYMQDRLNAGLMNSQRTGNARRESYAHLPMPRMTNTYMLPGQESPEEIISSVDRGIYAVHFGGGQVDITSGKFVFSASEAYMIENGKISHPVKGATLIGSGPEVLNRVSMVGNDLELDGGIGVCGKEGQSVPVGVGQPTLKVDGITVGGTQA >NZ_AP017372|2011382:2058504|2044682_2045960_-|WP_096409907.1|DBSCAN-SWA MTTDSTANIAKQVEAIGKRARAAGREVARSTTAARNAALRGMAEQIEANAAAIAEANKEDLGAARQAGLDDALIDRMELTSARIQTMADGLREIAALPDPVGAVREVASRPSGIQVGRMRVPLGVIGIIYESRPNVTADAAGLCIKSGNAAILRGGSEAIKSNQAIGQCIRSALVQAGLPEDGAQVIATTDRDAVGALIQMPEYVDVIVPRGGKSLVERISNEARIPVIKHLDGVCHVYIDSAADIDKAVAIAVNSKVQRLGTCNTMETLLVAADVAERVLPQIAAQLRQADIELRGCPRVCAMVPEAIAASEEDWGTEYLGPILSITIVDDFDAALDHIQRYGSGHTESIVTESYPMAQRFLREVDSSSVMVNASTRFADGYEYGLGAEIGISTDKLHARGPVGLEGLTTEKFIVIGDGHIRGG >NZ_AP017372|2011382:2058504|2025506_2026541_+|WP_096409890.1|DBSCAN-SWA MNNTLYQSKSEHNDQELAEHQETNKRLQRLGQEVLKLEAKAVNSLAERIGDNFSRACRHILSCTGRVIVTGMGKSGHIGSKIAATLASTGTPAFFVHPGEASHGDLGMITGADVVLALSNSGETHELNAILPRLRRLGVPLISLTGNPDSTLAREATVHIDIQVEKEACPLGLAPTSSTTASLAMGDALAIALLDARGFTAEDFARSHPGGRLGRRLLLHIEDVMQTGDKVPQVTPGTLLREALLEISHKGLGMTAVVDSTQQVLGIFTDGDLRRALDQGIDVHTTEIATVMTQAPQTAPPHLLAAEAAERMERHRINALLITAEDGRLIGALNMHDLLQAGVV >NZ_AP017372|2011382:2058504|2037365_2040890_-|WP_096409901.1|DBSCAN-SWA MPESAQHKVIAVLSLVGKGLSALLLVLTATVLAVGVVLRSVAWLQLPLEEPLEHLLTKALEHPVTIDSARLSWDGVWPALDLYGLRLGEGDDRLEVGSLGIEVAKPPWYWAEQGPYGLDIYISDARVIVFEDEEKGWQLRGLPELAEGEEPIRLPANIYISAAEVQVSPAQQSPLAVSGIDVRLVQQGRQLKAGARVAGEQANLAKGISLALRASLDKSTAPEVFVGLDSVDVEPFLAIFDVGGISGNLNGGLWPVIEFDDDVGLSQRIRGVYGNLSVASSGIEPRNGLPGVGAVSAAASFQGRRFVAQAQLLDESIYLPEIFADPVPVNQARARFRGGLFFDGAWWVSLERGVFDSEVISKNVRGRLDYFPGQGPELLLYTSLIRPLQVTELVENIPPKVIPEPVYNWIEMAFLGGRLDGFEVLFQGNPADYPFTDNDGVFLAKSRFRDADLLYQEDWPLISEMDVDVNLDNLSLTVDGHRGVTCGAELYDVELKIDDLSKTVIDLYGRARGDAQDGICFLTESPIGRELLDDPPIPFAVAGPLKMDLDLWLPLTDAVEGGTTYRTGIDFHGVDFDFEQYLQLGDVKGGVEVDNQGLRDLELDGIWGGEEFSASAGLKTHDGRKRTVGEVTAHGSPYSLTGLIDADHLPWLEGKSAWNLRFIAPTFEPASIDPQLELSLNSSLQGLAIDMPQPFGLSKQEARELQLDATFTDVGLESYALHYADVLDLLAFAPEGTVPQALGIRLGGGAAELPNAGITIDGRLPVLDVDTWRIWLADNVGGGLADVGGDADGQTAEVDWHDLLPVNLNLTLGNLLFSGRNYGVQRIAARAARDGRGWFDLEGDLATGEAFWRDSGAHIRVNLDHLDVPLQGRTRVPDPAPKEVEPQVRDFPRLTPEDAAALPIFAGRIESLRLDGRHAGLAKLSLQPGNGTDYLAQGKVTLRGKALDLQLQSHWQEDCNEAQLPQCTDFEAVMLSADVARLLGLFGAPAAVVRAGVELDAKGQWPGSPLDFDLASLSGDLSLQMSDGRITEINPGAGRLVGLFGLRMLPRRVLLDFGDLFGDGLAFESVEGEFSLLGDGMARIDKMHIDSSAARVDMSGAVDYVARYYDAQVAVMPRIAATLPIFGLVFGGGAGGALGVMADWIIGGPMDRAASFVYRVTGPWNEPQVERVRP >NZ_AP017372|2011382:2058504|2030447_2030795_+|WP_096409895.1|DBSCAN-SWA MQIKLSGHHIEITDSLRDYVNDKMSRLQRHFDNLIDTEVILSVEKLQQKAEANIQLGSGGGRVFANAVSNDMYAAIDALIDKLDRQIKKQKDKAVDKKRQGTSLKSSQHEDSLGT >NZ_AP017372|2011382:2058504|2036314_2037097_-|WP_096410405.1|DBSCAN-SWA MASGPHVDANLQEAGRLISKACRAGASLVALPENFAFMGYTESDKLKVAEPDGVGVIQDFLSAQASSHGIFLVGGTVPLSTASGTKIRPAVPVYGPDGKRWGCYDKIHLFDVEVAPGEAYRESEVQQGGHEPLLLDTPLGKIGLGVCYDLRFPELFRELAKQGMELLVLPSAFTAVTGEAHWNILVRTRAVENLCYVVAPDQGGYHVNGRETHGESMVVDPWGGVIDSLPSGSGIALGEVDLQRLHEIRSRFPALGHRKL >NZ_AP017372|2011382:2058504|2015296_2016055_-|WP_096409881.1|DBSCAN-SWA MLHRDQLEAYLNDYLSVESISDYAPNGLQVEGREEIGRLVTGVSACASLLEKAEAAKADAILVHHGYFWKGEENRIVGYKARRLRRLLCSGMNLFAYHLPLDVHLEVGNNAALGDMLELTNESRYSIKGIPDLLWFGVPGQELSANQLSQRLESVLGQAPLHVGSGPGLIRRIAWCSGGAERLIEQACAYGADAFISGEISEPIPHIAREAGIHYFAAGHHATERGGVQRLGEHIAAKFDIDHQYIEIANPA >NZ_AP017372|2011382:2058504|2016319_2017573_+|WP_096410400.1|DBSCAN-SWA MTLPGSKQKPIQPWPYTHYGEWIRFVLGYATLGIAIAIALVWLNPGWLYSVIPQDDSARHEGFNRDMPVETDRDIAARPAQLGQPVSYAESVARAAPAVVNIYSAPSETEQFTPPGYRHPLLERFFEQPGHPPRLPRHQANLGSGIIISENGYIVTNHHVIKQAESIKVVLPDKREARATVIGEDPETDLALLSIDLQELPVISFGDESDVRVGDVVLAIGNPFGVGQTVTKGIISATGRDQLGLSTFESFLQTDAAINVGNSGGALIDAHGRLIGINTALFDRGGGGSHGIGFAIPASMVQSVISDFFEHGQVVRGWLGVKTQRLTPPLARSFDLDESKGVVVTEISPGGPVEGGTLKTGDVLTKIDGTQIESVQDFLRATGRSPPGTRVEVSGYRDGDPFNKKIILGKNPKSDAR >NZ_AP017372|2011382:2058504|2026546_2027086_+|WP_096410402.1|DBSCAN-SWA MSTTKLAYCTPPSEAILSLAANVKTAIFDVDGVLTDGTVYVGEDAKQMLAFHIHDGKGLRMLIEAGINVAWATARRGDAVLARAHELGVELVMDGCRNKAQAVHQVAAQFGHGPSACSYLGDDIIDIAAIEVVGLGAAVADAHPQVISCAAWTTQHSGGRGAARELAELLLLAQNKLNY >NZ_AP017372|2011382:2058504|2056332_2058057_-|WP_096409914.1|plate|DBSCAN-SWA MAMLRAFQGEMRRLNEGGEEFARCFPEQGKMLSLSQVNDRDPYVERLLEGVAYLTAGVRQELKHSIAGLHEQLLESVCPQLIDPYPSTVTLELGLDENRIEGADIAAGTHFYSQPVGNESLSLPFAAQRSIALRAFKLEDFQWVSHYGGGSQLRFKLTGSAAAFRDKHALSPLLLWIDDESGQGYAMREALLRRVDDVEVRLSGAELSASVLRLKDSIYFESAFPLSCSDLFGVSLPGSQALERLQAYFLAREQLLFVNLCGLDQLDLPDQLNSLDIIFHLAFPAPGVEQSSRSVLRYGCVPAINLNRGWADPVRLQAGADECLVRSQTDGPDSRVVHRVLQVEGRSTHSGESQSYSALSTWGVDGITGGLYRVVRPDKGSAGRGDPEIRLDPASAKQEQTLSIDVLYSNGDLPRLHLSKGNIDVCSGEVPEGVTVNNLHRPRAFRPAPVETAQTQLLNCLLRCDIERLADKQFMRTLLFAVANAAHSQVPVAVRAIGDIDWQSTTRWRKGVIEQGIKVRMPIDTRHIPDRGEACLLAELIHGVLLEWAPLDRFVELVLVFEPSGEEEQWTNCL >NZ_AP017372|2011382:2058504|2043001_2043463_-|WP_096409904.1|DBSCAN-SWA MRLDLIAVGTKPPKWVREGYNEYAERLGRGWRLSLTEVAAARSNNAAQVAEREADGLLRAVKDGVPCIAFDERGEALSTRQWAEHFDIWSHHGGRAALLIGGAGGLAPAVLQRADRRWSLSPLTLPHMLVRVLVAEQVYRAWTLLSGHPYHRG >NZ_AP017372|2011382:2058504|2042348_2042936_-|WP_096409903.1|DBSCAN-SWA MDQLITLASRSPRRKELLELIGIKYHSIMVVVDETPDPNEGAEMYVLRMALEKARRGYAVAEGGAPALGADTAVVLDDQILGKPKDKTHARDMLLSLSGHTHRVVTGVALADDREATRLSVSHVTLRTITEEEVERYWASGEPADKAGAYAIQGLGGVFVEHLEGSYTGVMGLPLYETTLLFDEFALPWREGWSG >NZ_AP017372|2011382:2058504|2020864_2022121_-|WP_096409885.1|DBSCAN-SWA MERLLIRGGHQLDGEIRISGAKNAALPVMAATLLAEGPMKVGNIPHLHDVTTTMELLGRMGVELTVHEGMEVEVDTSSIKEFKAPYELVKTMRASILVLGPILARFGHAEVALPGGCAIGSRPVDVHISGLRAMGADLEIEEGYIKGTAQRLRGAHIHMDICTVTGTENLMMAAALARGTTILENAAREPEVVDLANCINAMGGCIRGAGTTTLVIEGVDSLTGVSHRVLPDRIETGTYLTAVAMTQGRVKLKDTDPSLLEAVIAKLRQSGAEIEVGEDWIELVMSQRPSAVDIETAPFPGFPTDMQAQLCALNAISTGSGRVIETVFENRFMHCLEMQRMGANIRIDGSSATITGVERLKAAPVMATDLRASASLVLAGLVAEGETPVDRIYHIDRGYECIEEKLAQLGADIQRVPD >NZ_AP017372|2011382:2058504|2022462_2023086_-|WP_096409887.1|DBSCAN-SWA MPMISKSLRTRVGLLVVLALFTLAFVWSAAADAKQHPREIVDEVSSKVVALLDEYGDELKDDPVRTYEKFAPIIEPHVDFERIGARMLGPHWRNADAETRERFINEFKRSLVRTYASSMEEYKDLDFRILGSRQRDAGIQVGVEVEGGSRVLFRFIDSSDAWKLEDVTFEGVSVVQNYREDFRSRLRDKDLPEVVEDMRARNQEIGF >NZ_AP017372|2011382:2058504|2031289_2031955_+|WP_162549471.1|DBSCAN-SWA MSLSKSNDGTIIPGVLMHINGRGVLLRGPSGVGKSDCALAMIQRGHLLVADDAVLIKQQDSQLIGSCPQIGFGLIHLRDLGIVDIREIYGAQSLSRSSRIDLQVTLTRNAGSNCPISLIKGRRHSANYCGTIIPALKLTAVEQRPLAELIEIAVAQLNSTSEKLSHQDIPNRITYQESLVNTDSPPFDGAGEDQNTAYTPINNRTSRLSRAAEEVAWPKSG >NZ_AP017372|2011382:2058504|2023686_2024463_-|WP_096409889.1|DBSCAN-SWA MISLLQALGNKVITGLRGVGEGVFLFLRLMRSLSIGVRRPLLVIDQVYKLGVLSLVIIVVSALFIGMVLGLQGYYTLSDFGAAEQVGVMVALTLVRELGPVVTALLFAGRAGSALTAEVGLMRATEQLSSMEMMAIDPERRILVPRFLGVLIAVPLLATIFSAVGIIGGYLVSVGMVGIDGGAFWSNMQNAVSVRDDILNGLIKSVVFAVVVGWVALHKGYMAVPTSEGVSRATTLSVVHASLIVLGLDFLLTALMFG >NZ_AP017372|2011382:2058504|2027115_2027688_+|WP_096409891.1|DBSCAN-SWA MAQRILFPIAVVVLTILVGIFLTHDAPEEELPAEPEEVERADYFLAEFAIHHHDEQGKMRSILRGKQGEHFPVSQTIQIDQPHWQAESTDGSIWFANSPFGSFKRDTQVIILHEDVDIRRQADEQRPPLDIVTRELFINIDNHQAMTEERVVATDPWGSAEGIGMTLDYYRDRLQLHNQAKGRYETKQGE >NZ_AP017372|2011382:2058504|2017766_2018864_-|WP_096409882.1|DBSCAN-SWA MSELSAQVRRWVRPEIQDLVGYKVADPGDAIKLDAMESPWPWPGDLRDEWLNRLSEVALNRYPDPNCGDLKKVVREVFEVPACAELLLGNGSDELIQLINLAIAGHGRTVMAPAPSFAMYRVIATLSGSEFVEIDLADDYGLDLAAMQAAVGKHQPAVTYVAHPNNPTGNGLDLDEISSLALSTEGVVVVDEAYYAYADSSFLPRLLDHPNVLLLRTLSKVGLAGLRVGVLIAHPEWVEQLEKCRLPYNVGVLAQVSAKFALEHRQQLDEAIKRVLGERSRLAEELRHMPVVERVLPSETNFITFRLKAIPASRAYKHLLAEGVLVKSLDGSHPRLSNCLRVTVGTPQENDLFLGALSGLSAGDY >NZ_AP017372|2011382:2058504|2050262_2050799_+|WP_096409910.1|DBSCAN-SWA MSNPEAENQDRLSQGYELLLDRVRKIFAEAEEHVPSLSEALSHAKQECINNGELTHEEANQVEHYLRRDAEEAGAWLAQSSDDDPHLGDWLRMDLQMLESWLWEAFSSIADRTKLELQGFTTTGEPSVYHSGEVAGPGTLQCFSCGRELSFSRPEAIPKCPGCDNEEFIRTGYGDFED >NZ_AP017372|2011382:2058504|2043623_2043974_-|WP_096409905.1|DBSCAN-SWA MHLQELENIVRQALEDIKAEQTVVIDVRERTPITDLVVVTTGNSGRHIQSIARHVVDAAKKQGLSNLAIEGDQAGSEWVLIDLGDVVLHVMTAESRDFYRLERMWEMDDEMQVGSG >NZ_AP017372|2011382:2058504|2033480_2034767_-|WP_096410404.1|protease|DBSCAN-SWA MALERANESGADAAEVSLGAGLGLSVNVRKGEIDTLEHHRDRSLGISVYFGGRKGSASAGELSDRAVTESVAAACDIARFTSEDPANGLAPSELMARDIPDLDLYHPWSISADDAIELARSCEAAALNSDPRISNTEGAGVSAHQGVHTYANSNGFMASVPESRHGINCVAVAGEGSGMQRDYWYTVARNPADLEAAEQVGLKAAEHAVKRLGAEPVSTERVPVLFEAPVARGLVGHLVGALRGSALYRRASFLVDCQGEQIFPEWMQMVERPHIPRALGSAAFDDEGVATTDSPLISDGTLQRYVLDTYSARRLGLETTANAGGVHNLEVVSGRDDFGALVRNLERGLIVTELMGQGVNLVTGDYSRGAAGFWVENGAIRRPVQEVTIAANLRDIFAGIRAVGNDMDCRGNIRTGSILVDGMMVAGE >NZ_AP017372|2011382:2058504|2018860_2020177_-|WP_096409883.1|DBSCAN-SWA MVDIKRLSTTQPDFWESLEQLVGWESHPGAELEQRVAEVVEGVRTGGDKALLEYTRTFDGLDAGSVAELEVSRERIEAAAQEICPESRSALEHAATRIRAYATRQRIDSWEYEDEDGNLLGQQVTAIDRVGVYVPGGKAAYPSSVLMNAVPARVAGVEEIVMTVPAPNGELSSLVLAAAKIAGVDRIFTLGGAQAVAALAYGTQTVPSVDKIVGPGNAYVAEAKRRVYGVVGIDMIAGPSEVLIISDGQADPEWIAMDLFSQAEHDEHAQALLVCPDFVYLDQVQAAMERMLPELERSDIVRESLAKRGALICVRDLAEAQDVANRVAPEHLELSVAEPQRLVGGIRHAGAIFLGHHTAESLGDYCAGPNHTLPTSRTARFASPLGVYDFQKRSTTLFCSPEGAAKLAESAAVIAHGEGLGAHAKAAEFRKAGHGEGN >NZ_AP017372|2011382:2058504|2013271_2014516_-|WP_096409879.1|DBSCAN-SWA MSEQSASRLSELQAWVDKRFPLTSTWRYHMTEYYVPKNINFWYLFGSLSILVLVIQIITGIWLTMYYQPSVDRAFASVEFIMRDVEWGWLMRYLHTVGASAFFVVVYLHMFRCLMYGSYKKPRELLWIIGVLIYLVMMTEAFMGYALPWGQLSYWATQVIVALFGVVPFIGDDLVVWLQGGFLVNEATLGRLFSLHVIGLPLVLILLVFLHIVALHHVGSSSPDGVEIKENKDKNGKPVDGIAFHPYVTVKDMVGMGIFLIIFCAIVFFAPEFFGYFIEAPNLEEANPLQTPETIHPIWYFTAWYAMLQCVPNEALGVVVMFAGSMILLFLPWIDRSEVRSIRYRGRGFKIALAVFVASFLILTWAGMQEPSGLPMLISRIFTVVYFGFFVFLWVYTYFGFEQTKPVPERVTDK >NZ_AP017372|2011382:2058504|2046065_2047073_-|WP_096409908.1|DBSCAN-SWA MPETPEAFLRRLERQQALPPVCLIAGEEPLLQREASDAWREAARQSGFSEREVFDVDSNFDWGRLNEAAASMSIFGDRRLLELRMPNGKPGNAGSRAIQVYCSDPPDDALLLVTSNRLDKGARESAWVKAIEQAGVFVYCWPLPLDKMSQWVRQRLERAGLKPDSEAVELIVERAEGNLLAVDQAVERLLLLNGKGPVGADNAALSLADSARYSVDDLADAALNGDLPRAHRILEILEQEGTGQPLILWALARDIRAAARLSAGADDKILYQEKIFKRRSGLLKATARRQPTHTWHRLLRRCHEVDCAIKGVPGSNPQEALRALVSRLARAASRR >NZ_AP017372|2011382:2058504|2028962_2030435_+|WP_096409894.1|DBSCAN-SWA MKQTLQLRVGQQLAMTPQLQQAIRLLQLSSLELHNEIQQALESNLMLETDEGEETSVADKSNQDQNAEEQPSGEAIPDELPLDTSWEDIYDASYACASPGADSATDILENRSSNPQGLIEHLLWQVEMEDLTESERLVAEIIVDSIDEDGYLQSSVEEIHGQLPTSIHFSPSDVESILARIQRLDPHGIAARSPAEALFIQLEQLPETTPWRGEAMEIVLNHLERVAEQRYDEIQAELGLSGSAVDETLELIRSLDPRPGQQLSNSAPEYIIPDVTVFRQDGAWHVELNREMTPTLRINPYYASLVKRGDTSADNHCLRTHLQEARWLLKSLQSRNETLLKVASAIVDRQREFMDHGEEAMQPLVLREIAETIDMHESTISRITTNKYMYTPRGTYEFKYFFSSHVSTTDGGEASATAIRARIKRLISAEETSKPLSDSAIAESLRDEGIKVARRTVAKYRESLGIASSSQRKVKQRGYGNKTQAAELRK >NZ_AP017372|2011382:2058504|2030822_2031293_+|WP_096409896.1|DBSCAN-SWA MDIEQLISPERVRCVSEAKDKEQVLSYVGQLIGEAEDRLTGDEICKRLMARERLGSTGLGHGVALPHARLEGIDKAIGAFIRLDQGIDFHAFDRAPVDILFALVVPEHFTDEHLQILAALAEMFSDPELCEKLRSPSSDDSLHEILRGWRPSSDST >NZ_AP017372|2011382:2058504|2011382_2011838_-|WP_096409876.1|protease|DBSCAN-SWA MNSSRPYLVRAIYEWIADNDKTPYLLVDASRSDIDAPTEYAEDGRLVLNVSPRAVQGLNMGNDVIRFSARFGGVARGVTIPVGAVMAVYARENGQGMMFGAEDEMEEETNQSGEATHEVNAGQDPDGGDDAPGSSSSNSGGGKRPNLRVVK >NZ_AP017372|2011382:2058504|2033197_2033479_+|WP_096409899.1|DBSCAN-SWA MTCKEAVIQNRLGLHARAAARFVSVASGFKCDIHVCSGDKKVNGKSIMGLMMLAAGLGTQITIEANGQDEELAARDLAELVENGFGEDPNQED >NZ_AP017372|2011382:2058504|2040891_2042352_-|WP_096409902.1|DBSCAN-SWA MSQEILINLTPRETRVALVENGVLQEIHLERAQRRGLVGNIYLGSVRRVLPGMQAAFIDAGLERTAFLHVSDLQRDASDPAPPIDQLIREQQSLLVQVIKDPIGDKGARVTTQITVPSRYLVLTPSSCGMGISARIEDESERARLRSIGQAINEQSEGGIGQVGLIFRTAAEGACAEEISSDREFLLRLWDSIGKRAASAQPGTLVHEDLPLYQRAMRDLANRDLERVRVDSHEGYQRLLEFCSDLLPHTLERIELYSGERPIFDLYGIEDEIERALARRVDLKSGGHLVIDQTEAMTIVDVNTGGFVGHRNVDETIFKTNLEAAQAIARQLRLRNIGGIIIVDFIDMNDPEHQRQVLRALEKSLDRDHAKTQIGEVSALGLVEMTRKRTRESLEQLTCQACPNCDGRGTVKTAETVSYEIFREILREVRQFESQRIMVLAAPSVVETLLDEESTGLAHLEEFIERPIELQVETLYAPDQFDVIPV >NZ_AP017372|2011382:2058504|2028181_2028907_+|WP_096409893.1|DBSCAN-SWA MATLKAEGLYKSYRGRAIVSDANLEVAKGEIVGLLGPNGAGKTTCFYMVVGLVKADAGSITLNQADLTELPIHARAKAGIGYLPQEPSVFRKLSVADNLNAILQLRSDLSRAERKSAAEELLEDFGVNHVRDSMGISLSGGERRRVEIARALAANPQFILLDEPFAGVDPISVGDIQRIVRRLADRGIGVLITDHNVRETLGIVQRAYILSDGRVLAAGNAQEILADPKVREVYLGEDFSL >NZ_AP017372|2011382:2058504|2011859_2012459_-|WP_096409877.1|DBSCAN-SWA MIRLYTGELCIHCHRVRLALAVKAVDAERIKVDPDCPAVELLEQNPYGDVPTLVDRDVTLYEPDIIIEYLDERYPHPPLMPVDPISRAKSRLVVYRMQRDWYADYDRILTSSKSKAQSARRSLRESLTAAAELFEDQQYFLSHELTVMDITILPLLWRLPIAGVELPDEAGAVKKYAEEQFLTNWFKASLTTDELNMAR >NZ_AP017372|2011382:2058504|2053263_2055411_-|WP_096409912.1|DBSCAN-SWA MESMERITSANPSAQQQVASLQPSLFALLTKRHPNRLFRIGLPGIDSEIIEVETVEGVEQVGDDYSYTVRISSDHLLDRSNWMAATVSLALSLEPNPHVIHGIIFAVEDVAHPTALTSLVIKIRSPLALLELSRHQRVWRNISAADAAYGVLTELLPANTSVSLEVQGSHEKQAILLQGQESDLEFIQRILSRAGLFMLLRHRQSLTEVVISDSLQGSSSSSLELDYVPQSGQVRSPGTIFGLRSWTELKPRQVAIAGFDPQFPAQSQAAVASSALTDPLVQSQPASRWGEVERFGECGGGRESHEKLAQVYQRAFDAQRGGIEIDTENRLTPGQIVRINNHPNHALNSAYFVISANHQGDQAAAIRGGPGKERASYICRARLLSVALEYADLPKNRLAASGLMVAELEGSNPSQAYIDEEGSYRARLAMDQGDSPPGQATPPLRLLQPYGGRSYGWHWPMLPHTKVAVAGLDGDLERPIMLGALGNQHQPLPVNSANPDQHVLRTPAGHELVFDDRDGAESLSLTAAQKSGELRMEAGRQDCDARVSLSSAQGALEINSGGNFNIACGGNRLVEVAADCSTQVGGDWTIAVDEGEIRWSAGANLSLSAEQGDLWCSAENANAQIDAGESLNLSSGQGMLLKTRHGDATVVAEAGCLAVGSHGDIEIISNGGGKIAIGSDFGIRIDGDGDVVIEGENISLEAKKISLRASEITEN >NZ_AP017372|2011382:2058504|2047592_2050064_-|WP_096410406.1|tRNA|DBSCAN-SWA MEEQYQPQKIETQAQQYWQEKESFRATEGDSRPKYYCLSMFPYPSGRLHMGHVRNYTIGDVVSRYKRMQGYNVLQPMGWDAFGLPAENAAMERGVPPAAWTRENIEAMREQLQRLGFGYDWQRELATCDPQYYRWEQWLFTRLYRKGLVYRDTAAVNWDPVDQTVLANEQVIDGRGWRSGALVERREIPQWFLRITDYVGELLDGLDNLKGWPDQVLNMQRNWIGRSEGVELSFALDGRDEVLTVFTTRPDTLYGVTYMGLAPEHPLALELAASNQQIADLVDEARSGGTAEADMATREKRGVDTGLEAIHPLTGQRIPVWVANFVLMEYGSGAVMAVPAHDQRDWEFATQYNIPIKPVIHPGDGSDLDISEGAFSDYGVLVESGPFSGMTSQNAFVAIAERLEAEGLGQRQTQYRLRDWGISRQRYWGAPIPIIHCPSCGAVPVPDEQLPVTLPEEVEITGAGSPLKSMPSFYSTTCPECGGSAERETDTFDTFMESSWYFARFACADQDQSMLDERADQWLPVDQYIGGIEHAVLHLLYARFFHKVLRDEGLVSSDEPFTRLLTQGMVLKDGAKMSKSKGNTVDPQEMVERFGADTVRLFTMFAAPPDQSLEWSDSGVEGSYRFLRRLYGLVRDHVSAGAPPDLDAQSLHESLTKEQKDLRRKVHETIVKASDDIGKRFTFNTAIAATMELTNALGKAVEDQSDAGRAVMQEGLETAVLILAPITPHLSHYLWWQLGYEQPVVDARWPQADASALERDELELVVQVNGKLRGHVTLPADASEDEARQAALQEPNVQRFVEGKEVKKVIFVPGKLLNVVVAK >NZ_AP017372|2011382:2058504|2047075_2047606_-|WP_162549472.1|DBSCAN-SWA MLLSEMKIGHFRVYFRGALLAVVLLGLVVSACGWQLRGSPGGISLAGQGLYVIDDIGSSDLRRAVRRGIEGAGGRQVESRNEADLVVILHTRSSDRETAAVGVGGDAEEYRLEYRVSYSVEDSAGDVLIDRQAANAMRTFAEVEGGADQRRRREDDIEEELRDEVARMIMLRLQVI |
47 | Staphylococcus_phage(22.22%) | tRNA,plate,protease | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
Acr ID | Acr position | Acr size | Homology with known anti | Neighbor HTH/AcRanker | Neighbor Aca | In prophage | Protospacer in prophage |
---|