Contig_ID | Contig_def | CRISPR array number | Contig Signature genes | Self targeting spacer number | Target MGE spacer number | Prophage number | Anti-CRISPR protein number |
---|---|---|---|---|---|---|---|
NZ_CP040506 | Hungatella hathewayi WAL-18680 chromosome, complete genome | 7 crisprs | DinG,DEDDh,cas3,cas2,cas1,cas5,cas7b,cas8b1,cas6,RT,csa3,WYL | 0 | 9 | 6 | 0 |
CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
NZ_CP040506_1 | 687022-687117 | Orphan |
NA
Consensus repeat of NZ_CP040506_1
|
1 spacers
spacers of NZ_CP040506_1
>1.1|687049|42|NZ_CP040506|CRISPRCasFinder CGATAAGGCTTTTTCATATAATAAGTTTTCAGGTACGCTCTC |
CRISPR arrays and Neighbor proteins around NZ_CP040506_1
The CRISPR arrays of NZ_CP040506_1 >merge|NZ_CP040506|1|687022-687117|CRISPRCasFinder TCTCCCGAATATACTGGTAAGCATTGGCGATAAGGCTTTTTCATATAATAAGTTTTCAGGTACGCTCTCTCTCCCGAATACACTGGTAAGCATTGG >NZ_CP040506|1|1|687022-687117|CRISPRCasFinder TCTCCCGAATATACTGGTAAGCATTGG CGATAAGGCTTTTTCATATAATAAGTTTTCAGGTACGCTCTC TCTCCCGAATACACTGGTAAGCATTGG
>NZ_CP040506.1|WP_006780529.1|682238_685247_+|leucine-rich-repeat-protein MKRNHKRVISTLLIMLLISTQPAVMAWADSVVPADHLLGDDENTGKKTGYATPTDADQKDEDAPKPDDAPKGDEIPSTEIPSNKNLTDGKGPENENSGNTEKVVIRWEFVDDDNLIEGELSLIGVSPENRADFDTVVSMLPEQVRVELEETGEVTLPIIGWSCPEYQQDEDEEWPFTGEYEFIAELPEGYVCEPPISVLVTLGGAMVNTINDRFTIDGLNYKELGPDTVQLIGYDGAPVGTLVIPDKVRKPSNGREYQVASIGHYAFLDCSGLTGDLVIPDTVTEIGDMAFSGCHFTGELTLSDSLVTIGEYTFFECGFTGQLVLPQTLTRIGERAFTYTTFSGQLILPEKLNYIGELAFYHCNFTGDLIIPDGVTIINYDTFSGNSFTGTLTLPNKLKEIGNESFFECGFTGELVLPDGLTSIGISAFKDCSKLTGRLSIPDGITSIEQHAFYNTGFTSFDTTKQEIADLLYASGGVQASKIMVEGQPYQLHIQPAPEPDFPVGNMTYRKIGSDTVELTGYEGNSDMDIIIPDTVTDQLSGMTYSVTRIGSSAFYGKAITGSLHLPNTLVSIGDRAFKKNRFTGDLTIPVSVSHMGTGAFDSAGFTGDLTIEGKLTKLEDYVFFKCGFTGALSLPDTLTYVGYAVFKDCGFTGSLQLPAGITYIDAASFYNCSSFTGTLQLPVGVNYIGYNSFFDCSGFTGALQLPKPITEIGEMAFYGCDGLDSAHLGPNVQKLGAQVFPESLPLSTDSPRVQLLINTYLNQNAIADTSWDGNEDVPDGAVATVKQDMTIAGDRRIGTEAVITVPSGVILTVDGNLVVNGTISVEGTMIINGSISGPGTLIIGVNGRVVGDTSGIRVVYVRRGSSGTSGSSAVNPDILIGNWERTEDGIWKFRQTRGTYAANRWGIVDGLWYYFDQEGRMLTGWQYINNQWYYLCREEDIKTKTNLKEGAMATGWHFDLVYQAWFYLDTNGAMAVGEKVIDGKQYYFNPESDGTRGAMQQ >NZ_CP040506.1|WP_138669417.1|679305_682164_+|leucine-rich-repeat-protein MKRNRKRVISTLLIALLISTQPAVMAWADSMAPADHLLEDDENTGTKTGYATPADADQKDEDAPKPDDAPKGDEIPSTEIPSNENLTDGKGPENENSGNTEKVVIRWEFVDDDNLSGGELSLIGVSPENRADFDTVVSMLPEQVRAESEEAGEVTLPITRWNCQDYHQDEDGEWPFTGEYEFIAELPEGYVCEPSISVLVTLGGAMVNTINDRFTVDGLKYKELGPDTVQLMGYDGAKPEGTLIIPDKVRKPSNGREYQIVSIFSNAFRDCSGLTGDLVIPDMVTEIGVSAFEGCHFTGELTLPDSLVTIEENAFLKNEFTGQLVLPAKLNYISKNAFYQCNFTGDLIIPDGVTIIEIGAFGYNNFTGTLTLPKKLKGIGRVSFYKSGFTGELNIPDTVTYINNDAFSGCGFTGELDLPDGLTTIGGHAFENCSKLTGRLSIPDEITIIENSAFNNTGFDGFDTTKQEIANLLYASGVDENKIKVGNQPYQPSQTPSAPGFQVGDMEYQIIGSDTVKLTGYHGNSDTDISIPDTVRVSGTTYSVTQIGSYVFYDKAITGSLHLPNTLVSIEGKKAFVGNAFTGDLTIPAGVTHIELGSFEDAGFTGNLTIEGRLTRLEIVTFGRCGFTGTLSLPDTLTYIGEDAFRGCGFTGHLQLPKQVTEIGKRAFYGCDSLDSAHLGPNLQKLWPQAFPEELPLSTDSPRVQLLINAYLNQDAIADTSWDGNEDVPDGALAAVKQDTTVTGDRQIGTEAVITVPSGVILTVDGNLVVNGTISVEGTLIINGSLTGSGTLIVGVNGRVVGDTSGIRVVYVSRGSSGNSGSSAVNPDILIGNWERTEDGIWKFRQARGTYAANRWGIVDGLWYYFDREGRMLTGWQFINNQWYYLCREEDIKTKTNLKEGAMATGWHFDPIYQAWFYLDTSGAMAVGQKVIDGKQYYFNPEPDGTRGAMQQ >NZ_CP040506.1|WP_006780526.1|676267_679231_+|leucine-rich-repeat-protein MKRNHKRVISTLLIMLLISTQPAVMAWADSVVPADHLLEGDENTGKKTGYATPTDADQKDEDALKPDDTPKGDEIPSTEIPSTEIPSHENLTDEKGPENENSGKTEKVVIRWEFVDDDNLSGGELSLIGVSPENRADFDTVVSMLPEQVRAEIEEAGKVTLPITDWSCPEYQQDKDGEWPFTGKYEFIAELPEGYVCESPISVLVTLGGAMVNTINDRFTVDGLKYKELGPDTVQLMGYDGAKPVGTLIIPDKVRKPSNGREYQVINISNGAFQDCSGLTGDLVIPDTVTKIGNRAFSKCGFTGQLVLPQTLVRIEHDTFAGTAFSGQLILPEKLNYIGVYAFLDCNFTGDLIIPDEVTDVGYGAFEGNNFTGTLILPKKLKTIDREGFTLCGFTGELNIPDTVTDIGMFAFYKCGFTGDLILPDGLTSIGTSAFEGCSEFTGRLSIPDGITSIGKDAFKNTSFDGFDTTNQEIANLLYASGVDKDKIKVGDQPYQPSQPPKAPGFQVGDMDYQIIGSDTVALTGYHGNSDTDIIIPDMVTDIVSGRTYPVTHIGSDAFWKKAITGSLHLPNTLISIEEGAFAENKFTGSLLLPESLVSIGVGAFYDSGFTGDLTIPANVSYIGPSSFEKAGFTGDLTIEGKLTKLEGYEFIGCGFTGALVLPDTLTSIGDLTFQDCGFTGSLQLPKLVTEIGEKAFYGCDSLDSVYLGPNLQKLGAQAFPESLPLSTDSPRVQLLINTYLNQDAIADTSWDGKEDVPDGAVVTIKQDTTVTGDRRIGTEAVITIPSGVILTVDGNLTVDGNLVVDGTISVEGTLSINGSLSGSSTLIVRVNGRIVGDTSGIRVVYVSHGSSGNSSSSTVNPDILIGTWERTEDGIWKFHQARGTYAVNRWGIVDGLWYYFDKEGRMLTGWQYINNQWYYLCREEDSKTNTGLKEGAMATGWHFDPVYQAWFYLDTSGAMAVGEKVIDGKQYYFNPESDGTRGAMQQ >NZ_CP040506.1|WP_138669415.1|673106_676193_+|leucine-rich-repeat-protein MKRKYKQVISTLMAVLLISTQPAVMAWADSVVPADHLLEGDENTGKKTRYATPADADQKDEDALKPDDTPKGDEIPSVEIPSTEIPSHKNLTDGKGPENENSGNTEKVVIRWEFVDDDNLSGGELSLIGVSPENRADFDTVVSMLPDKLRVEIEETGEVTLPIIGWTCQEYHQDEDEEWPFTGEYEFIAELPEGYVCEPPISVLVTLGGAMVNTINDRFTVDGLRYKELGPDTVQLIGYDGAKPTGLLVIPEHVRKPSNDREYQVISIGFEAFLDCSGLTGDLVIPDTVTEIGNNAFKGCHFTGELTLSDSLVTIGEYAFNDCGFTGQLDLPQTLTRIGLSAFAETTFSGQLILPEKLNYIGIYAFADCNFTGDLIIPEGMTNTGYGAFEGNSFTGTLTLPKKLKEINRESFFLCGFTGELNIPDTVTDIGSYAFSECGFTGGLVLPDGLTSIGSYAFKDCSELTGRLSIPDEITSIGDNPFTGTGFEGFDTTKQEIADLLYASGVDKNKIKVGNQPYQPASSPQEFSEGDMDFQVIGNNTVKVTDYRGNSNTDIVIPDTVTDRVSGKTYTVTHIGSYAFGSKNITGSLYLPNTLVSIEDSAFMLNRFTGILSLPESLNTIGGAAFYDNNFTGDLTIPENVSHIGASAFESAGFTGNLIIKCKLTYLKDQAFSNCGFTGTLSLPDTLTAIGGYTFKNCGFTGSLQLPAGITSIGESSFFGCNSFTGELYLPKPVTEIGEKAFYGCSSLNSAHLGSNLQKLGIQAFPESLPLSTDSPRVQLLINTYLNQNAIADTSWNGKEDVPDGAVATVKQDTTITGDRRIGTEAVITVPSGGILTVDGNLVVDGMISVEGTLVINGSLSGSGTLIIGVNGRVVGDTSGIRVVYVSRGSSGNNSGSSSTVNSNILLGTWERTEDGIWKFRQTRGTYAANRWGIVDGLWYYFDREGRMLTGWQFINNQWYYLCREEDIKTKTNLKEGAMATGWHFDPVYQAWFYLDTSGAMAVGQKMIDGKQYYFNPEPDGTRGAMQQ >NZ_CP040506.1|WP_006780523.1|670940_672932_+|hypothetical-protein MKRFLVIFFLILALCTGMFFSMSVSSVSEGPEAVNGVLDFRGTDFTSSVYHLNGQWEFYYDCLYTPEDFRQGVPTGGEFLTLPNSWNVNGYPALGHATFRLLIQAEPGEHYLLFIPEIISSAVIWSNGTELYRAGVVGDSAANTVTGVRNELLAVSPEDGVIELVVQTANYHLTGSGLFYPMMFGRDTVMLHHFVWQRTAAAAAMGGILLIGVYHLFLYLFRRLERLYLIFSVTCLVTVLRLVMETNSMVQYFFRDGLTFLLNRVYLLLFAFHSICICLFMLEAFSLQLSRRLRRVVMACFLLPVLGVFLLPNTAAVACLFLALIPNGLAAVLALRSGKIGRDPYRLLYLFSLILFIVYAPLTKTVLEAKLYIPGVVSNLFLILSQCVMLSRSYADAHEQVERVNENLERLVEERTAQLNNTNRQLAASQDALREMIGNISHDLKTPLTVLNNYLELLGDDSIASNEQERAEYIGIAYHKNLDLQRLIHNLFEVTRMESGTVMYHPEWVQGSHLMEEVERKYANLICDRELSFSVHVDDTVDLKIDRHKIWSVLDNLIYNALRHTPKGGSISLCLRGNGEQAVLTVSDTGEGISAEHLPHIFERFYKVSPDRGEKDGSSGLGLYIVKTTMEAMGGTVEVESTLGEGTVFTLTLPARIQSSDEK >NZ_CP040506.1|WP_006780522.1|670263_670944_+|response-regulator-transcription-factor MDDSYRLLAVDDEPDILRTNRRYLEARGYRVDTAVCAADALELLKNQKYDAILLDVLLPDMNGFALCEAVRALTSAPILFLSCMDGEEDKIKGLMAGGEDYITKPYSLKELAARVYAQVRRGSMKRFVIDHQNRLLQIDNQIIPLSQKEFELFLFLMDHSGQILPAAELYQEVWRTGKPDSANTVAVHITRLRHKLEDAGSVIGRIETVRGEGYRFIPKLEARATI >NZ_CP040506.1|WP_006780521.1|669390_670119_+|hypothetical-protein MITPMTKTLETELLWSEEELTKKSKMKNEGAGLLLLGIGILAAGVCNHLLLQIIYESRIIWSAVTICCVLLGIVLAWFGIKLINKVGASVAEETAKDSGYTAKEILECYQESRQPSTLLLSLSSSPSKEKDFMEVGFLTKNWLKLPKNIFCGIMRISDVAAIWYEETALPGYDPGIFVVKSDGKLRYVKCKSDAGREIVDAITARNSKSITIRKFMFDGNEYDAFQSPQKTADIYRITQYER >NZ_CP040506.1|WP_034858377.1|668755_669274_+|hypothetical-protein MFKHWFKTWLKKEEGNAMIMGAFGIILLLMFMGIMVDMGLYFTSYRRLSAVTKYSSEEIQQMLPYYSFANDYESAFRTEFNKNLYEYGYTLDNVDRSTITRINTSRLGNPIISVEMDVALHDTYQCIFLPIIGISELPVNASRKTAQSYGIEKRYTAGMPVELWTGGVELDD >NZ_CP040506.1|WP_080568828.1|668021_668747_+|pilus-assembly-protein MMLISHKKGDGNMSGRWKKLKRESGQAMVEFALVLPILLLAIIGCMEVAWYMTAKYNLNQYAEAVGRNVKGPYMLIWYHDVHPNDWVVESTGRKPSWLSPEEQALWSFDEYDGWFAFADPGPGETIDPWYYSYAFDSEILFKKRLQGLVTMIDPDKVNYTIRGGWYINAEVLHVPGKKASWAAPRDGEKIEYYSADVRVDMTYRYEPLTVVGQWMFCHGTDYLTMKVDGRYVYNLPPGINT >NZ_CP040506.1|WP_006780518.1|667522_668041_+|pilus-assembly-protein MNKFIKRWRRLLSRREEGQSLVEFAFVFPVLLIFFSGIADTGWMIYNYISLADMTDTAVHANIKSNPSDAEDFISLYIEKSFPEFNGSAIQLSADTQVTRYDYYDYVYKSNKNKHWKVPMYYKVLKTTLDINYQVDYLTPMGKLIFGDTDNHMDLSAHSSAVKVLENDAYKP >NZ_CP040506.1|WP_138669419.1|688509_691428_+|MBL-fold-metallo-hydrolase MKKMKSLTAGKPSQLTIAQNRYEKIRIGVDPKYDPEYDPEYDPEYDTEYELALKGKCLCGCGDIAISDVWNQEAYRFLKQDYPDNFDHPNERFAAVHPSLWSNGRNNQINGIFEVIKDSIYQVRGYDMANISFVRTRNGWLVLDTLMSEECTYAALELAEDYFSKLGAAFKLQGNIKGIIISHSHVDHFGGVKAVCSYNLAGSTDLNGSYSYEELTKNCPIYAPSGFTEASVSENAYAGNAMGRRASYQYGSFVKPDKEHPDAEENWRRSISIGIGQGQSTGKVGFLKPTNIIDENTPPITIDGLEIDCQLTPGTEAPSEMNHYFPRYKALWMAENCAGTLHNLYTLRGAQVRDGNAWAKYLVETAERYGDKAEVIFQAHNWPHWKNETGTLSLKDFLLETASIYKFINDQTLLYLNQGFKMEEAAQKLRLPYALEHNWNLKPYYGTPSHDAKAVYQKYLGWYDANPIHLNPLSPEERAKAMAGYLTRSLNGESLKDSLEHDLDEGKYRTVADFAYQMYLAGGAGDCNAGHAKDLCAEALRQLAYTSESGPWRNCYLAGAWELEQGKERIHASMGTDLISNMEPYMLLDYIGILYDGDKSVEYDELGHHRNDMEFIMDITEGNKMTRFHIYIRNGAILYYQYKPEELSKPLPGEICHFSLGKEELIQLLAPPSIGQKTLDERIQALKVENSGKSFLNLIFYNLVNLKNDRFQTFDIVTPHDREFLTESEKKVDLREETKACIRMLEGHLKSIADFGDYDLLAFDEQGMNEWLETDGFHSILVKEAQVVEDTNFFAPAPVTKSKDWQNNLGIGPDGFFCKYEYIQVLESCYRFLAEPFLMGADHVHKDDRFTEKTMYLKKAILLLEPYLNRYRQNFRYDVIIENDQMRLQGNDAKAWDELKGKIFSDFDSRFFHEIPQLPKDGIVYGRQLAYTLYLLYQELYCQYVDGGVPVPEKERTAVIKEYRKPHYKKEE >NZ_CP040506.1|WP_006780532.1|691432_694498_+|leucine-rich-repeat-protein MKRNHKRAISAMLIVLLISTQPAVMAWADSVAPTDHFLEDDGNTGTKTGYATPADADQKDEDAFKPDSTPKGDEMPSHEIHSKENLTDGKQPENEISEKTVLQWEFVDDDYLDGGELSLIGVSPENRADFDTVISMLPEQVRVEIEETGEVTLPIIDWSCQEYQKDEDGEWPFTGEYEFIAELPEGYVCEPPISVLVTLGGAMVNTINDRFTVDGLKYKELGPDTVQLMGYDGVTPVGTLVIPDKVRKPSNGREYQVASIGHNAFPNCSGLTGDLVIPDTVTEIGDSAFRGCHFTGELTLSDSLVTIGEDAFYECGFTGQLVLPQTLTRIGDYAFENTTFSGQLILPENLNYIGTAAFYLCNFTGDLIIPDGVTIIDYGAFYGNSFTGTLTLPKKLKGISSESFCRSGFTGELNIPDTVTDIGESAFAGCGFTGELILPDGLTNIGPYAFMDCSKLTGRLSIPDEITSIGDDAFDNTGFDGFDTAKQEIADLLYASGVDMNKIKVGNQSYQPAFSPQKFTEGGMEYQVIGSDTVALTGYNGNSDTDISIPDKVTNRLSGTTYFVTHIGSEAFYNKAITGSLHLPNTLVSIGDSAFYKNRFTGDLTIPANVSHMGSGAFEFAGFTGDLTIEGKLTKLEDYEFFECGFTGALSLPDTLTYIGVAVFRDCGFTGSLQLPAGITYIDASSFFNCNSFTGTLQLPAGVNYIGDYGFFNCSGFTGVLKLPKPITEIGELAFFGCDGLDSAHLGPNVQKLGAEAFPESLPLSTDTPRVQLLINTYLNQDVIADTSWDGMEDVPDGAVATVKQDTTVTGDRRIGTEAVITVPSGVILTVDGNLVVDGTIFVDGTLIINGSLSGSGTLIIGANGRVVGDTSGIHVVFLNRGSSGNNSGSSSTVNPNLLIGTWERTEDGIWKFRQARGTYAANRWGIVDGLWYYFDKDGRMLTGWQFINNQWYYLCREEDIKTKTGLKEGAMATGWHFDPVYQAWFYLDTSGAMAVGQKMIDGKQYYFNPEPDGTRGALQQ >NZ_CP040506.1|WP_006780533.1|694510_697234_+|leucine-rich-repeat-protein MQKKTTNQIISIVFIVFLMSTQPAVMAWADSVAPADLFLEDDGNTGKKTGYATPTDADQKDEDVLEPDDMSKDKIFEKVVVQWEFVDDDNLSGGELSLIGVSPENRADFDTVVSMLPEQVRAEIEMAGEVTLPITDWSCPKYQKDEDGEWPFTGEYEFIAELPEGYVCEPPISVLVTLGGAMVNTINDRFTVDGLNYKELGPDTVQLIGYDGAKPTGTLVIPDHVRKPSNGREYQIVSIGSEAFLGCSGLTGELQIPDAVTSIGNFAFFNTSFTGTLILPDQLVLIGNSVFSNCSFTGDLTIPEGVASIGSRAFYNAGFTWKLTLPEGLTKIESGTFTNCGFTGELKIPDTVTFIDKQAFENCGFTGQLLFPDGVTGISDRAFYGCGSFTGRLSLSDKVSVIGNDAFYGTNFEGFDTTALLTANLLYDSGIPENMIQLVNSPYQYKGVKFLDGNMEYYDLTNSKCRLTSYYGNIRDDISIPALAKNPLESRFLVSEIGSNVFKGKNITGTLQLSSGLESIEDGAFSGNSFAGNLTIPESVSHIGSAAFEHAGFTGTLTLPNTLTSIEEQTFYGCGFKSLDLPEGLTSIGTASFGHCTSVSGVLYLPESVTEIGDSAFYGCDSLEAVHLGRNVKKLGRKAFPESTPLYTDSPQVQLLINTYLNRNTIADTSWNGGEDVPDGAIASLKQDVVVTGDKRIGTEAVITVPDGRNLTVDGNLTLDGTLVIHGSISGTGTIYVGKNGKITGDTSGVHVVYPSLPPGGDNDNNNSSSNSSGSSSSAVNPNLLIGTWERTEDGIWKFRQARGTYAANRWGIVDGLWYYFDKEGRMLTGWQFINNQWYYLCREEDIKTKTGLKEGAMATGWHFDPVYQAWFYLDTSGAMAVGQKMIDGKQYYFNPESDGTRGALQP >NZ_CP040506.1|WP_006780534.1|697498_699364_+|TIGR03960-family-B12-binding-radical-SAM-protein MRKLALPDEILLSIQQPARYIGGEVNTVNKDLSQVEIRFAMCFPDVYEIGMSHLGIQILYDMFNRREDIWCERVYSPWTDLDKIMREEKIPLFALESQDPVKDFDFLGITLQYEMSYTNILQILDLSQIPLHASGRSESDPIVIGGGPCAYNPEPLAEFFDIFYIGEGETSYYELMDRYKENKKQGGSRLSFLEMAAEIPGIYVPAFYDVTYKEDGTIESFLPNNPHAKPVIEKVVVKEMDTVYYIEKPIVPFIKVTQDRVVLEIQRGCIRGCRFCQAGNVYRPLREHGLDYLKDYAYKMLKSTGHEEISLSSLSSSDYTQLEGLVNFLIDEFKGKGVNISLPSLRIDAFSLDVMSKVQDVKKSSLTFAPEAGSQRLRDVINKGLTEEVILQGAAEAFKGGWNRVKLYFMLGLPTETVEDMEGIALLSEKVAEEYYEIPKDQRNGRVQVVASSSFFVPKPFTPFQWARMCTKEEFLERAYIVKDKFREMKNFKSLKYNYHEADLTVLEGVLARGDRRTGALIEETYRQGALFESWSENFNNQLWMDAFETCGIDPDFYTVRERSLDEIFPWDFIDAGVTKEFLKREWLQAIDEKVTPNCRQRCSACGARKYEGGVCYEGKN >NZ_CP040506.1|WP_006780535.1|699347_700058_+|radical-SAM-protein MKVRIKFTKHGAMKFIGHLDIMRYFQKAMRRADVDIKYSEGFSPHQVMSFAAPLGVGLTSNGEYMDIEVNSMKDSKTMVHQLNEVMVEGIEVLSCRRLEDTAKNAMSMVAAADYTVRFRDRARPDDMDAFFEELISFYGRESIVITKKTKRGEREVDLKPLIYDLHREGDAIFLQLSTGSSDNIKPELVLEAFYSGKGQTFSELDIQIQREEVYGNTGDEEHRVLTPLEDFGEDIE >NZ_CP040506.1|WP_006780536.1|700050_701250_+|ribonuclease-E/G MNKFIITRWEGRVLTALINEEGVFQLGLEDDGEKSLLNNIYIGKVKNVVKNIGAAFVELGNGQMAYYSLTENTRHHYTKPHGNGPLHAGDEIIVQVSKDAVKTKDPVISSNLNFTGRYSVLTAGKDVLGFSAKIADQEWKQEMKARIAPELEDGCGIIVRTNAYGADAGEILAEIRELKTCYKTVMAAGTYRTCYSLLYEAAPSYVGSLRDARNGSIDEIITDDDEICQTLSVYLGKEQPEDLGKLTLYQDSMVSLLKLYSLEKALEEASGRRVWLKSGGYLVIEPTEALTVVDVNTGKYTGKKNPRETILKINLEAARETARQMRLRNLSGIIIIDFIDMTEEEDRKLLMDSLTQWCQKDPVKTTVVDITKLNLVEVTRKKQRRPLHEIMGDNKRRLM >NZ_CP040506.1|WP_138669421.1|701251_702607_+|Trk-system-potassium-transporter-TrkA MKIIIVGCGKVGSSLAEQLYMEGHEITLIDRDADVLGAITNSIDVMGMVGNGAVYKVQMEAGIEDTDLLIATTNSDELNMLCCLIAKKAGNCQTIARIRNPEYAEEIRYIREELNLSMAINPELAAAREMSRLLRFPSAIKIDTFAKGRVEILKFIIPEHSVLHNMQVYEVTPKLRCNVLICAVERGEDVIIPNGNFQMMGGDKVYFVAPPVESMKFFKEVGIVNNSIKTAMFVGGGRITYYLAKMLQDTPIQIKIIEQDFERCKVLSEELPNVMVIHGDGSNQQVLLEEGIRQTEAFASLTGFDEENIMLSLYAASQSKAKLITKVNRIAFENVIESMNLGSIIYPKLITADSILQYVRAMQNSLGSNVETLYKIVANRAEALEFRVEKNAPMIGVPLEKLSLKDNLLVACINRNGKIITPRGKDTIEEHDTVIIVTTNTGLNDLKDILK >NZ_CP040506.1|WP_006780539.1|702620_704066_+|TrkH-family-potassium-uptake-protein MNIGIVRYFLGWVLNIEAFLMLLPCATAVVYQDHTGIYFLIVMIMCWFLGWLAVHRKPKNTVFYAREGFVTVALSWVLLSFFGALPFWISGEIPSLADAVFETISGFTTTGASILNNVEGLSQSMLMWRSFTHWVGGMGVLVFLLAILPLAGGGYSMHIMRAESPGPSVGKLVPKVKATAKLLYLIYFSMTVIEVILLLAGKMPLFDALTTGFGTAGTGGFGIKNNSIAFYDSYYLQGVITIFMILFGINFNVYYLFLFKRPKEALKSEEARTYLGIILVSTLLIAWNVRSFFPTLFDAFHHAAFQVASIITTTGFSTVDFDVWPQFSKTILIWLMFIGACAGSTGGGMKVSRFIIWIKETLKELASLIHPRSVKVMKLEGKPIEHNIVRSANAYFIVYILIFASSVLLVSLDEFDFNTTFTAVAATFNNIGPGMGGVGPASNFSEFSVMSKLVLMFDMLAGRLEIFPMLLLFSPGTWRKQ >NZ_CP040506.1|WP_138670084.1|704162_705308_-|MFS-transporter MNYPVFLRCCYGYAVSGMSVLVVGAILPSLIREAGLSYALAGGLLSMMAIGNLFASLFFPAMVSAIGKRMAITIMASIVPCSYLVLTFLPGIPVMYLIMALVGVARGSITIINNATVNEISNNSNKMVNLLHCSFAVGAFLAPFLTALLSYAGFTWKSIMYVIIALCVTSTLSYATMEYPSDGREKKHQNLSEKHAFLKSFDFYCIGFVLFFYLGVENCINGWFVTYLQSTGIMTETFATTMVSFTWLVIMAGRLVCASLSKHYSKSAIVLMNAIGSGICFFILISSSWLPVITVALLGFGFFLAGIYPGCIANAGPIIGGSTMGMSVLTAISAMGGIITPQLVGSAADRIGLVAAIGILSVNVIVVIVLSAINFRRLRQR >NZ_CP040506.1|WP_006780541.1|705464_705710_+|iron-only-hydrogenase-system-regulator METRIAVIGIIIEDKESVALVNEILHQYGSYIIGRMGLPYEKKQVNIISVVVDAPGDIISALSGKLGNIRGVSAKALHSKA |
You can click texts colored in the table to view more detailed information
CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
NZ_CP040506_2 | 3206890-3208870 | TypeI-B |
NA
Consensus repeat of NZ_CP040506_2
|
30 spacers
spacers of NZ_CP040506_2
>2.1|3206920|33|NZ_CP040506|CRISPRCasFinder,CRT CCATGTCGACAACGCAGACCCATACGGCATCTA >2.2|3206983|34|NZ_CP040506|CRISPRCasFinder,CRT TTGCTCCTTTCACAATGCCTCCAAAAACCTTATC >2.3|3207047|34|NZ_CP040506|CRISPRCasFinder,CRT TTATTGCTGCCAAGTCCAGCATACATCTCAAATG >2.4|3207111|36|NZ_CP040506|CRISPRCasFinder,CRT CAATATCTCGCTTTTTGTTCTCATTTTGCTACCTCC >2.5|3207177|35|NZ_CP040506|CRISPRCasFinder,CRT CCATAAGGTCCTGGAAACCATTTTTCTGTATGCAA >2.6|3207242|35|NZ_CP040506|CRISPRCasFinder,CRT AAAAATGGGGACGGCTTGGTGGTTGGCAATTATAC >2.7|3207307|34|NZ_CP040506|CRISPRCasFinder,CRT AGGATATGAAATACAAAAATAAAGAGGGGTATTA >2.8|3207371|34|NZ_CP040506|CRISPRCasFinder,CRT CATGGTCAAATGGCTCGGTAGAGGCTGCACCGTT >2.9|3207435|34|NZ_CP040506|CRISPRCasFinder,CRT CTTTGAAAACCGCTGTTTCCTCAAGTGACTGTTG >2.10|3207499|35|NZ_CP040506|CRISPRCasFinder,CRT GAACAGACGGAGACCGGCGCCGATGAGGAGACTGG >2.11|3207564|36|NZ_CP040506|CRISPRCasFinder,CRT ACATCCTCGACGAAGGAATACAAAACTTATCAAAGA >2.12|3207630|35|NZ_CP040506|CRISPRCasFinder,CRT GTTCCAAAGTTTCCACACATCCATCTTGCGTAGTT >2.13|3207695|36|NZ_CP040506|CRISPRCasFinder,CRT TGATGACCCATCTTATATCATGGGAAAAAAATTACA >2.14|3207761|34|NZ_CP040506|CRISPRCasFinder,CRT TTATACTCAAATTGCGATATCGCAACAGAAAGGA >2.15|3207825|35|NZ_CP040506|CRISPRCasFinder,CRT TTTTCATTTACTCCTGCAACAACAAACACTCTTTC >2.16|3207890|36|NZ_CP040506|CRISPRCasFinder,CRT ATGCTGAAATCTGGTTGATTCCAGACGCAATATTGC >2.17|3207956|36|NZ_CP040506|CRISPRCasFinder,CRT AATTAAGAAACTTCTTTACCTGCACTTTACCATGGA >2.18|3208022|36|NZ_CP040506|CRISPRCasFinder,CRT GATAAAGATAACCGACAGCGTTGCAATACCGTTCGT >2.19|3208088|36|NZ_CP040506|CRISPRCasFinder,CRT AGGACCTGGGTCAGAACAAAATCATCATCCGGTACC >2.20|3208154|35|NZ_CP040506|CRISPRCasFinder,CRT ATCTTTTCCACCGGACCAACTTACCATATAGTGCA >2.21|3208219|36|NZ_CP040506|CRISPRCasFinder,CRT CGGGTACCAGTACCCCTATACCGGTCATATCCTTAA >2.22|3208285|36|NZ_CP040506|CRISPRCasFinder,CRT GAAGAAGGGAGGGGATGCCGATGGAGAAGGAGATTC >2.23|3208351|36|NZ_CP040506|CRISPRCasFinder,CRT GTCTCAAGGTAAAATAACTGGCCGAAACAGGAAATT >2.24|3208417|35|NZ_CP040506|CRISPRCasFinder,CRT AGGGAAATCAGCTGCGGGTATGAGTGCAGCTATGT >2.25|3208482|36|NZ_CP040506|CRISPRCasFinder,CRT GCATTCTCCATGGCGGTTAGAAGGGTGGCTTCCGTG >2.26|3208548|34|NZ_CP040506|CRISPRCasFinder,CRT ACTGCATGTCTGTCTTCTTAATCAATCCGATAGC >2.27|3208612|34|NZ_CP040506|CRISPRCasFinder,CRT TGTATACCGGTGTCAATCAGGAAGGACAGACCTA >2.28|3208676|35|NZ_CP040506|CRISPRCasFinder,CRT AGTTTGTCCATCTTGGTCTTTAACTCTAAAAGAGT >2.29|3208741|36|NZ_CP040506|CRISPRCasFinder,CRT GCTATAACCGGATTCTTTATACGCATATGATATTGA >2.30|3208807|34|NZ_CP040506|CRISPRCasFinder,CRT GATATTATGGCGACAAACAGGGAGCTGCCGGACA >2.31|3206922|33|NZ_CP040506|PILER-CR CCATGTCGACAACGCAGACCCATACGGCATCTA >2.32|3206985|34|NZ_CP040506|PILER-CR TTGCTCCTTTCACAATGCCTCCAAAAACCTTATC >2.33|3207049|34|NZ_CP040506|PILER-CR TTATTGCTGCCAAGTCCAGCATACATCTCAAATG >2.34|3207113|36|NZ_CP040506|PILER-CR CAATATCTCGCTTTTTGTTCTCATTTTGCTACCTCC >2.35|3207179|35|NZ_CP040506|PILER-CR CCATAAGGTCCTGGAAACCATTTTTCTGTATGCAA >2.36|3207244|35|NZ_CP040506|PILER-CR AAAAATGGGGACGGCTTGGTGGTTGGCAATTATAC >2.37|3207309|34|NZ_CP040506|PILER-CR AGGATATGAAATACAAAAATAAAGAGGGGTATTA >2.38|3207373|34|NZ_CP040506|PILER-CR CATGGTCAAATGGCTCGGTAGAGGCTGCACCGTT >2.39|3207437|34|NZ_CP040506|PILER-CR CTTTGAAAACCGCTGTTTCCTCAAGTGACTGTTG >2.40|3207501|35|NZ_CP040506|PILER-CR GAACAGACGGAGACCGGCGCCGATGAGGAGACTGG >2.41|3207566|36|NZ_CP040506|PILER-CR ACATCCTCGACGAAGGAATACAAAACTTATCAAAGA >2.42|3207632|35|NZ_CP040506|PILER-CR GTTCCAAAGTTTCCACACATCCATCTTGCGTAGTT >2.43|3207697|36|NZ_CP040506|PILER-CR TGATGACCCATCTTATATCATGGGAAAAAAATTACA >2.44|3207763|34|NZ_CP040506|PILER-CR TTATACTCAAATTGCGATATCGCAACAGAAAGGA >2.45|3207827|35|NZ_CP040506|PILER-CR TTTTCATTTACTCCTGCAACAACAAACACTCTTTC >2.46|3207892|36|NZ_CP040506|PILER-CR ATGCTGAAATCTGGTTGATTCCAGACGCAATATTGC >2.47|3207958|36|NZ_CP040506|PILER-CR AATTAAGAAACTTCTTTACCTGCACTTTACCATGGA >2.48|3208024|36|NZ_CP040506|PILER-CR GATAAAGATAACCGACAGCGTTGCAATACCGTTCGT >2.49|3208090|36|NZ_CP040506|PILER-CR AGGACCTGGGTCAGAACAAAATCATCATCCGGTACC >2.50|3208156|35|NZ_CP040506|PILER-CR ATCTTTTCCACCGGACCAACTTACCATATAGTGCA >2.51|3208221|36|NZ_CP040506|PILER-CR CGGGTACCAGTACCCCTATACCGGTCATATCCTTAA >2.52|3208287|39|NZ_CP040506|PILER-CR TACGAAGAAGGGAGGGGATGCCGATGGAGAAGGAGATTC >2.53|3208356|36|NZ_CP040506|PILER-CR GTCTCAAGGTAAAATAACTGGCCGAAACAGGAAATT >2.54|3208422|35|NZ_CP040506|PILER-CR AGGGAAATCAGCTGCGGGTATGAGTGCAGCTATGT >2.55|3208487|36|NZ_CP040506|PILER-CR GCATTCTCCATGGCGGTTAGAAGGGTGGCTTCCGTG >2.56|3208553|34|NZ_CP040506|PILER-CR ACTGCATGTCTGTCTTCTTAATCAATCCGATAGC >2.57|3208617|34|NZ_CP040506|PILER-CR TGTATACCGGTGTCAATCAGGAAGGACAGACCTA >2.58|3208681|35|NZ_CP040506|PILER-CR AGTTTGTCCATCTTGGTCTTTAACTCTAAAAGAGT >2.59|3208746|36|NZ_CP040506|PILER-CR GCTATAACCGGATTCTTTATACGCATATGATATTGA >2.60|3208812|34|NZ_CP040506|PILER-CR GATATTATGGCGACAAACAGGGAGCTGCCGGACA |
cas2,cas1,cas3,cas5,cas7b,cas8b1,cas6 |
CRISPR arrays and Neighbor proteins around NZ_CP040506_2
The CRISPR arrays of NZ_CP040506_2 >merge|NZ_CP040506|2|3206890-3208870|CRISPRCasFinder,CRT,PILER-CR TATTTAATACAGCTACTGTTCTTCTTCAACCCATGTCGACAACGCAGACCCATACGGCATCTAATTTAAATACAGCTACTGTTCTTCTTCAACTTGCTCCTTTCACAATGCCTCCAAAAACCTTATCATTTAAATACAGCTACTGTTCTTCTTCAACTTATTGCTGCCAAGTCCAGCATACATCTCAAATGATTTAAATACAGCTACTGTTCTTCTTCAACCAATATCTCGCTTTTTGTTCTCATTTTGCTACCTCCATTTAAATACAGCTACTGTTCTTCTTCAACCCATAAGGTCCTGGAAACCATTTTTCTGTATGCAAATTTAAATACAGCTACTGTTCTTCTTCAACAAAAATGGGGACGGCTTGGTGGTTGGCAATTATACATTTAAATACAGCTACTGTTCTTCTTCAACAGGATATGAAATACAAAAATAAAGAGGGGTATTAATTTAAATACAGCTACTGTTCTTCTTCAACCATGGTCAAATGGCTCGGTAGAGGCTGCACCGTTATTTAAATACAGCTACTGTTCTTCTTCAACCTTTGAAAACCGCTGTTTCCTCAAGTGACTGTTGATTTAAATACAGCTACTGTTCTTCTTCAACGAACAGACGGAGACCGGCGCCGATGAGGAGACTGGATTTAAATACAGCTACTGTTCTTCTTCAACACATCCTCGACGAAGGAATACAAAACTTATCAAAGAATTTAAATACAGCTACTGTTCTTCTTCAACGTTCCAAAGTTTCCACACATCCATCTTGCGTAGTTATTTAAATACAGCTACTGTTCTTCTTCAACTGATGACCCATCTTATATCATGGGAAAAAAATTACAATTTAAATACAGCTACTGTTCTTCTTCAACTTATACTCAAATTGCGATATCGCAACAGAAAGGAATTTAAATACAGCTACTGTTCTTCTTCAACTTTTCATTTACTCCTGCAACAACAAACACTCTTTCATTTAAATACAGCTACTGTTCTTCTTCAACATGCTGAAATCTGGTTGATTCCAGACGCAATATTGCATTTAAATACAGCTACTGTTCTTCTTCAACAATTAAGAAACTTCTTTACCTGCACTTTACCATGGAATTTAAATACAGCTACTGTTCTTCTTCAACGATAAAGATAACCGACAGCGTTGCAATACCGTTCGTATTTAAATACAGCTACTGTTCTTCTTCAACAGGACCTGGGTCAGAACAAAATCATCATCCGGTACCATTTAAATACAGCTACTGTTCTTCTTCAACATCTTTTCCACCGGACCAACTTACCATATAGTGCAATTTAAATACAGCTACTGTTCTTCTTCAACCGGGTACCAGTACCCCTATACCGGTCATATCCTTAAATTTAAATACAGCTACTGTTCTTCTTCTACGAAGAAGGGAGGGGATGCCGATGGAGAAGGAGATTCATTTAAATACAGCTACTGTTCTTCTTCAACGTCTCAAGGTAAAATAACTGGCCGAAACAGGAAATTATTTAAATACAGCTACTGTTCTTCTTCAACAGGGAAATCAGCTGCGGGTATGAGTGCAGCTATGTATTTAAATACAGCTACTGTTCTTCTTCAACGCATTCTCCATGGCGGTTAGAAGGGTGGCTTCCGTGATTTAAATACAGCTACTGTTCTTCTTCAACACTGCATGTCTGTCTTCTTAATCAATCCGATAGCATTTAAATACAGCTACTGTTCTTCTTCAACTGTATACCGGTGTCAATCAGGAAGGACAGACCTAATTTAAATACAGCTACTGTTCTTCTTCAACAGTTTGTCCATCTTGGTCTTTAACTCTAAAAGAGTATTTAAATACAGCTACTGTTCTTCTTCAACGCTATAACCGGATTCTTTATACGCATATGATATTGAATTTAAATACAGCTACTGTTCTTCTTCAACGATATTATGGCGACAAACAGGGAGCTGCCGGACAATTTAAATACAGCTACTGTTCTTCTTCAAC >NZ_CP040506|2|2|3206890-3208870|CRISPRCasFinder TATTTAATACAGCTACTGTTCTTCTTCAAC CCATGTCGACAACGCAGACCCATACGGCATCTA ATTTAAATACAGCTACTGTTCTTCTTCAAC TTGCTCCTTTCACAATGCCTCCAAAAACCTTATC ATTTAAATACAGCTACTGTTCTTCTTCAAC TTATTGCTGCCAAGTCCAGCATACATCTCAAATG ATTTAAATACAGCTACTGTTCTTCTTCAAC CAATATCTCGCTTTTTGTTCTCATTTTGCTACCTCC ATTTAAATACAGCTACTGTTCTTCTTCAAC CCATAAGGTCCTGGAAACCATTTTTCTGTATGCAA ATTTAAATACAGCTACTGTTCTTCTTCAAC AAAAATGGGGACGGCTTGGTGGTTGGCAATTATAC ATTTAAATACAGCTACTGTTCTTCTTCAAC AGGATATGAAATACAAAAATAAAGAGGGGTATTA ATTTAAATACAGCTACTGTTCTTCTTCAAC CATGGTCAAATGGCTCGGTAGAGGCTGCACCGTT ATTTAAATACAGCTACTGTTCTTCTTCAAC CTTTGAAAACCGCTGTTTCCTCAAGTGACTGTTG ATTTAAATACAGCTACTGTTCTTCTTCAAC GAACAGACGGAGACCGGCGCCGATGAGGAGACTGG ATTTAAATACAGCTACTGTTCTTCTTCAAC ACATCCTCGACGAAGGAATACAAAACTTATCAAAGA ATTTAAATACAGCTACTGTTCTTCTTCAAC GTTCCAAAGTTTCCACACATCCATCTTGCGTAGTT ATTTAAATACAGCTACTGTTCTTCTTCAAC TGATGACCCATCTTATATCATGGGAAAAAAATTACA ATTTAAATACAGCTACTGTTCTTCTTCAAC TTATACTCAAATTGCGATATCGCAACAGAAAGGA ATTTAAATACAGCTACTGTTCTTCTTCAAC TTTTCATTTACTCCTGCAACAACAAACACTCTTTC ATTTAAATACAGCTACTGTTCTTCTTCAAC ATGCTGAAATCTGGTTGATTCCAGACGCAATATTGC ATTTAAATACAGCTACTGTTCTTCTTCAAC AATTAAGAAACTTCTTTACCTGCACTTTACCATGGA ATTTAAATACAGCTACTGTTCTTCTTCAAC GATAAAGATAACCGACAGCGTTGCAATACCGTTCGT ATTTAAATACAGCTACTGTTCTTCTTCAAC AGGACCTGGGTCAGAACAAAATCATCATCCGGTACC ATTTAAATACAGCTACTGTTCTTCTTCAAC ATCTTTTCCACCGGACCAACTTACCATATAGTGCA ATTTAAATACAGCTACTGTTCTTCTTCAAC CGGGTACCAGTACCCCTATACCGGTCATATCCTTAA ATTTAAATACAGCTACTGTTCTTCTTCTAC GAAGAAGGGAGGGGATGCCGATGGAGAAGGAGATTC ATTTAAATACAGCTACTGTTCTTCTTCAAC GTCTCAAGGTAAAATAACTGGCCGAAACAGGAAATT ATTTAAATACAGCTACTGTTCTTCTTCAAC AGGGAAATCAGCTGCGGGTATGAGTGCAGCTATGT ATTTAAATACAGCTACTGTTCTTCTTCAAC GCATTCTCCATGGCGGTTAGAAGGGTGGCTTCCGTG ATTTAAATACAGCTACTGTTCTTCTTCAAC ACTGCATGTCTGTCTTCTTAATCAATCCGATAGC ATTTAAATACAGCTACTGTTCTTCTTCAAC TGTATACCGGTGTCAATCAGGAAGGACAGACCTA ATTTAAATACAGCTACTGTTCTTCTTCAAC AGTTTGTCCATCTTGGTCTTTAACTCTAAAAGAGT ATTTAAATACAGCTACTGTTCTTCTTCAAC GCTATAACCGGATTCTTTATACGCATATGATATTGA ATTTAAATACAGCTACTGTTCTTCTTCAAC GATATTATGGCGACAAACAGGGAGCTGCCGGACA ATTTAAATACAGCTACTGTTCTTCTTCAAC >NZ_CP040506|2|1|3206890-3208870|CRT TATTTAATACAGCTACTGTTCTTCTTCAAC CCATGTCGACAACGCAGACCCATACGGCATCTA ATTTAAATACAGCTACTGTTCTTCTTCAAC TTGCTCCTTTCACAATGCCTCCAAAAACCTTATC ATTTAAATACAGCTACTGTTCTTCTTCAAC TTATTGCTGCCAAGTCCAGCATACATCTCAAATG ATTTAAATACAGCTACTGTTCTTCTTCAAC CAATATCTCGCTTTTTGTTCTCATTTTGCTACCTCC ATTTAAATACAGCTACTGTTCTTCTTCAAC CCATAAGGTCCTGGAAACCATTTTTCTGTATGCAA ATTTAAATACAGCTACTGTTCTTCTTCAAC AAAAATGGGGACGGCTTGGTGGTTGGCAATTATAC ATTTAAATACAGCTACTGTTCTTCTTCAAC AGGATATGAAATACAAAAATAAAGAGGGGTATTA ATTTAAATACAGCTACTGTTCTTCTTCAAC CATGGTCAAATGGCTCGGTAGAGGCTGCACCGTT ATTTAAATACAGCTACTGTTCTTCTTCAAC CTTTGAAAACCGCTGTTTCCTCAAGTGACTGTTG ATTTAAATACAGCTACTGTTCTTCTTCAAC GAACAGACGGAGACCGGCGCCGATGAGGAGACTGG ATTTAAATACAGCTACTGTTCTTCTTCAAC ACATCCTCGACGAAGGAATACAAAACTTATCAAAGA ATTTAAATACAGCTACTGTTCTTCTTCAAC GTTCCAAAGTTTCCACACATCCATCTTGCGTAGTT ATTTAAATACAGCTACTGTTCTTCTTCAAC TGATGACCCATCTTATATCATGGGAAAAAAATTACA ATTTAAATACAGCTACTGTTCTTCTTCAAC TTATACTCAAATTGCGATATCGCAACAGAAAGGA ATTTAAATACAGCTACTGTTCTTCTTCAAC TTTTCATTTACTCCTGCAACAACAAACACTCTTTC ATTTAAATACAGCTACTGTTCTTCTTCAAC ATGCTGAAATCTGGTTGATTCCAGACGCAATATTGC ATTTAAATACAGCTACTGTTCTTCTTCAAC AATTAAGAAACTTCTTTACCTGCACTTTACCATGGA ATTTAAATACAGCTACTGTTCTTCTTCAAC GATAAAGATAACCGACAGCGTTGCAATACCGTTCGT ATTTAAATACAGCTACTGTTCTTCTTCAAC AGGACCTGGGTCAGAACAAAATCATCATCCGGTACC ATTTAAATACAGCTACTGTTCTTCTTCAAC ATCTTTTCCACCGGACCAACTTACCATATAGTGCA ATTTAAATACAGCTACTGTTCTTCTTCAAC CGGGTACCAGTACCCCTATACCGGTCATATCCTTAA ATTTAAATACAGCTACTGTTCTTCTTCTAC GAAGAAGGGAGGGGATGCCGATGGAGAAGGAGATTC ATTTAAATACAGCTACTGTTCTTCTTCAAC GTCTCAAGGTAAAATAACTGGCCGAAACAGGAAATT ATTTAAATACAGCTACTGTTCTTCTTCAAC AGGGAAATCAGCTGCGGGTATGAGTGCAGCTATGT ATTTAAATACAGCTACTGTTCTTCTTCAAC GCATTCTCCATGGCGGTTAGAAGGGTGGCTTCCGTG ATTTAAATACAGCTACTGTTCTTCTTCAAC ACTGCATGTCTGTCTTCTTAATCAATCCGATAGC ATTTAAATACAGCTACTGTTCTTCTTCAAC TGTATACCGGTGTCAATCAGGAAGGACAGACCTA ATTTAAATACAGCTACTGTTCTTCTTCAAC AGTTTGTCCATCTTGGTCTTTAACTCTAAAAGAGT ATTTAAATACAGCTACTGTTCTTCTTCAAC GCTATAACCGGATTCTTTATACGCATATGATATTGA ATTTAAATACAGCTACTGTTCTTCTTCAAC GATATTATGGCGACAAACAGGGAGCTGCCGGACA ATTTAAATACAGCTACTGTTCTTCTTCAAC >NZ_CP040506|2|1|3206892-3208870|PILER-CR TTTAATACAGCTACTGTTCTTCTTCAACCC ATGTCGACAACGCAGACCCATACGGCATCTAAT TTAAATACAGCTACTGTTCTTCTTCAACTT GCTCCTTTCACAATGCCTCCAAAAACCTTATCAT TTAAATACAGCTACTGTTCTTCTTCAACTT ATTGCTGCCAAGTCCAGCATACATCTCAAATGAT TTAAATACAGCTACTGTTCTTCTTCAACCA ATATCTCGCTTTTTGTTCTCATTTTGCTACCTCCAT TTAAATACAGCTACTGTTCTTCTTCAACCC ATAAGGTCCTGGAAACCATTTTTCTGTATGCAAAT TTAAATACAGCTACTGTTCTTCTTCAACAA AAATGGGGACGGCTTGGTGGTTGGCAATTATACAT TTAAATACAGCTACTGTTCTTCTTCAACAG GATATGAAATACAAAAATAAAGAGGGGTATTAAT TTAAATACAGCTACTGTTCTTCTTCAACCA TGGTCAAATGGCTCGGTAGAGGCTGCACCGTTAT TTAAATACAGCTACTGTTCTTCTTCAACCT TTGAAAACCGCTGTTTCCTCAAGTGACTGTTGAT TTAAATACAGCTACTGTTCTTCTTCAACGA ACAGACGGAGACCGGCGCCGATGAGGAGACTGGAT TTAAATACAGCTACTGTTCTTCTTCAACAC ATCCTCGACGAAGGAATACAAAACTTATCAAAGAAT TTAAATACAGCTACTGTTCTTCTTCAACGT TCCAAAGTTTCCACACATCCATCTTGCGTAGTTAT TTAAATACAGCTACTGTTCTTCTTCAACTG ATGACCCATCTTATATCATGGGAAAAAAATTACAAT TTAAATACAGCTACTGTTCTTCTTCAACTT ATACTCAAATTGCGATATCGCAACAGAAAGGAAT TTAAATACAGCTACTGTTCTTCTTCAACTT TTCATTTACTCCTGCAACAACAAACACTCTTTCAT TTAAATACAGCTACTGTTCTTCTTCAACAT GCTGAAATCTGGTTGATTCCAGACGCAATATTGCAT TTAAATACAGCTACTGTTCTTCTTCAACAA TTAAGAAACTTCTTTACCTGCACTTTACCATGGAAT TTAAATACAGCTACTGTTCTTCTTCAACGA TAAAGATAACCGACAGCGTTGCAATACCGTTCGTAT TTAAATACAGCTACTGTTCTTCTTCAACAG GACCTGGGTCAGAACAAAATCATCATCCGGTACCAT TTAAATACAGCTACTGTTCTTCTTCAACAT CTTTTCCACCGGACCAACTTACCATATAGTGCAAT TTAAATACAGCTACTGTTCTTCTTCAACCG GGTACCAGTACCCCTATACCGGTCATATCCTTAAAT TTAAATACAGCTACTGTTCTTCTTCTACGA AGAAGGGAGGGGATGCCGATGGAGAAGGAGATTCATTTA AATACAGCTACTGTTCTTCTTCAACGTCTC AAGGTAAAATAACTGGCCGAAACAGGAAATTATTTA AATACAGCTACTGTTCTTCTTCAACAGGGA AATCAGCTGCGGGTATGAGTGCAGCTATGTATTTA AATACAGCTACTGTTCTTCTTCAACGCATT CTCCATGGCGGTTAGAAGGGTGGCTTCCGTGATTTA AATACAGCTACTGTTCTTCTTCAACACTGC ATGTCTGTCTTCTTAATCAATCCGATAGCATTTA AATACAGCTACTGTTCTTCTTCAACTGTAT ACCGGTGTCAATCAGGAAGGACAGACCTAATTTA AATACAGCTACTGTTCTTCTTCAACAGTTT GTCCATCTTGGTCTTTAACTCTAAAAGAGTATTTA AATACAGCTACTGTTCTTCTTCAACGCTAT AACCGGATTCTTTATACGCATATGATATTGAATTTA AATACAGCTACTGTTCTTCTTCAACGATAT TATGGCGACAAACAGGGAGCTGCCGGACAATTTA AATACAGCTACTGTTCTTCTTCAAC
>NZ_CP040506.1|WP_006782613.1|3206536_3206749_+|helix-turn-helix-transcriptional-regulator MIKYDPLWETMKKRNISQYKLIKDYGIDKAQLQRLRKNEVVKTIILNKLCEILDCRIEEILVYEPDITEE >NZ_CP040506.1|WP_006782612.1|3206083_3206425_-|hypothetical-protein MDKVRVESKNTDSYRFKERVYRLMNGTYDLDIYSVGEMNTVETEFSDGKYCEELYKDIFEANCRICERLGEEEDKDVEIIIHNYNLMTEYLCMKMFDYGVLFCKREIAKTVVV >NZ_CP040506.1|WP_034859973.1|3204670_3205987_+|hypothetical-protein MEKEMKNLRKIVSLMAAVCMLLSIWQPMTAKAAEGKLVVEGNVTLSGGNGNMEDVLITIKHGYMTDGPVVGTGHPDANGHYAIETTVSSMGFLALIVTPSLPGYDNYSPSSNIYPGQNTADLLLVANGSPTTYGVSGTITMNGAALPSNLCPIVDFEVPATTTKKELQACGGNYFCNGLAGNKVIITPHLDGYTFTPESITIEKIDRVYDDANFVMTPNGTAETPVAPETPDTPDAPALPENTESTEAAKNTVTLYFMTGSSPADPGEVFEQFEVAKNSRGNTTTLAKTIKSRTPVKDGYKFNYWQEGKLSDPTVLGNRINTVIWTNDTDQYIYAQYTKIEEPPTSDNTGNTVTLYFMTGSSPSDPGEVFEQFEVAKNSRGNTTTLAKTIKSRTPQKDGLKFNYWQEAKLSDPTVLGNRLYTVIWTNDTDQYIYAQYK >NZ_CP040506.1|WP_006782609.1|3204047_3204260_+|helix-turn-helix-transcriptional-regulator MITYDKLWETMDKKGITKYKLVNDYGISKSMINRLNHNMGINTNTINNLCSILNCNVEDILTFCPDEEKA >NZ_CP040506.1|WP_034860053.1|3203581_3203920_+|helix-turn-helix-transcriptional-regulator MDNERKTLGKRINQTRKDRGITADKLSELCNINATYLRQIEGSGKTPSLPVFISICNSLKVSANYLLQDELDVSEISDIEELETLWETAEPSQYELVVSMLKAAIAHIKGEE >NZ_CP040506.1|WP_006782607.1|3202562_3203438_-|hypothetical-protein MKKLWDNIQKGYGNFWRDDRCDWNQSHLSQADKRSLWYGAVLLCALTVLFTELLYSYHFGKIERQNEKNIEMALNRITAYAASATDDEYEAIARTIRQDLIYSDFSRDKENYIRYIPNTAQICRLYPQTFPNQVYLLCNNTGMLYGLDIFEDDAAVSGATQASGDTQVSADTQFSGGTKVSGGYDDISEATLLITKMPGNKTGHARLDRTRGILSIQKMKSLFCDDCIRDIMAALDEYGTMNEFVIMDGKEKKFYPIKEGVLDIGDYHLELTYKDNGYDIAIQYSPVQQSR >NZ_CP040506.1|WP_006782606.1|3201967_3202345_-|Hpt-domain-containing-protein MTLKEAYEKLGGDYADTTCRIGEDMLLRLIGILLKDSNYTDICTSLKQQDYEAAFRAAHTLKGVTLNLGLSSLADKTAKLVETLRSVQDTNDIHLAFTDFDSAYRDMDTVFSELLASLALGGVKQ >NZ_CP040506.1|WP_006782603.1|3197959_3199903_-|extracellular-solute-binding-protein MSRGLKKCVRLILYTTMMSCILTGCGKQAPKEKIVVEILYNNHFKQVEKLVESTYDDIDLRIEISPYSSEELRRLERGVGPELVIAAQPDSDMVQKYLLDLSDTRASSAYDGTIMSDLKQDGKTYLIPLPGVYSGYVVNETMFEQAGISMPTSNTELVEALAKLKEKGLGVGEDHTNFSMRSDYNAEVGMFYVGCMIPDFLGTVEGVQWLADFKEKKAMFTGVWEESFVLPDELVNAGIMDPAAIARQRNSILCEQRLSNGTLAAAFGDSSLYYACVEQNQKEVLKGTAEAYSYRMLPLLGSEGNHPWFMFAPSALMGVNNAISEEKQEACKRIVDLLSTPEGQAALIQDMGPGISCLLEYQQQEDWIPAGVEEYIESGYIYNVLFPSKTIEYLGGCVRDVMAGKCTVEEALQDIDNYYYEGTGKSEYDFTVIGEMAHDLLMENFNTRREETEIGNFVADCVAEVSGAPIAVVNGGGIRASFYQGVVYGGDTAAVCPFDNRIIVVEMDGQTVWDMLENGLSTCTEEFPGGQFLQISGLHYTFDSSKPAGSRLVSVTWPDGTVLERSERFQVAVNDYMAGINSYAEGNGDGYTMLNCYDEETPKGSVSLVNEMEYYYRDAMALYFEEHRDEAVDVQLEGRIRDLAKEQ >NZ_CP040506.1|WP_034859970.1|3195342_3197946_-|hybrid-sensor-histidine-kinase/response-regulator MRRRQSLNIKQKEQREFITLRFASALILMTAILGVFAFVVYQNEAEKTVTNISSVYLEEMTTQISSHFQTNLDSQFSQIRTIAGAITEADLEQEASLQDFLEQAQEDNGFAHIAMISAKGIAYSPEGTTPVMSKISVLDKLLSGTEELVSVNETIWESNMILLGVPIPPVSFQGEKLTAVIIGIPTAEIGAKLGMESEKETNSYTNIVTRDGDFVIKSTFSNDGLYGSNLFSIYEKQAVFDKGYDMESFHADIQAGKCGMTLLTVGTHHEYLYYVPISGTNWYMVTSMAYETVNDKILYLSRFMVLVGLGIFSVVLLIIILFFLALRRIETRNQELLLVEKERAEAANRAKSDFLSQMSHEIRTPLNGIIGMTEVGRQHIGEPDRISHCFDKIILSSQHLLALINDILDMAKIESGKIELHLEKFDLGQLLQSLTTVFYVQAKHKKIDYEIYLRGELEEFLVGDALRLNQILTNLLSNAMKFTPEKGRVSLMIEELRRDEETIWLRFEVSDTGRGITPENLERVFETFTQENSGIARQYGGTGLGLPITKNFVEMMGGTITVTSEAGSGSIFRVDLPFGRIQEGEEEAFGYHQSVLVVNKDVELETHLANVLKRAGFTVYTVETEGGEPDMIPEKVKGNAPYDLCFLEWGCCDDIKRLAGVIRQESQNEALHIIITGYDQDELDDTASLCGADGTLCQPAFLVDIVQLMKRLEGETQTPVETENSAILRDAKVLVVEDNEINLYIAVELLQHTGAEVSTAKNGQEAVEKFAASPEGYYDLILMDVQMPVMDGYRATNTIRQLSRKDAGSVIIIAMTANSFYEDIRKCMDSGMNAHIAKPFVMEDVISTYTDVLTAEGKEGYDSTKNE >NZ_CP040506.1|WP_034859967.1|3193857_3195330_-|GntR-family-transcriptional-regulator MILKYDGMMYERVFQILKYKIESGLLPAGTSLPSRSDLCQELGTSEKTVRRALTMLEEAGLIETRQRKRPVVCAGRDEVHLTTRLALEKIDADITSDVLKTGVLLCYPIIKNGIALCEPEDLYIPRKIVEHMNIEDGEEFWKLSKRLWRFFVARNENDLSLQVVESLGLSDLKPLQDDRTVRARFYEQLKEFMRALEHGEAPESVHFDDMSGIYGLAEGERPAFRAAPDSAVLLGRKQLEKLLAGAEVRYSAVYMDILGLIAAERYRPGDKLPSHKELQTIYGVSVDTTIKAIQILQDWGVVRTVRGNGIFVEMDREELEKIQVPAHLIAYHVRRYLDSLELLALTIEGAAACAAPRITEQAIQEAKAEIIRQWEEEYLYERTPAILLKLITEHVGIDALNAIYMLLQRNFRIGRSIPGLLNTSKTPVNCEIHEKCVDVIELLSAGNQEAFSEKASLLFEDIYRLVIEECKRLGFYEAAVEIYDGSALWK >NZ_CP040506.1|WP_006782614.1|3209051_3209342_-|CRISPR-associated-endonuclease-Cas2 MGKSMNYNYAFVFYDVGEKRVQKVFKICKKYLSHFQYSVFRGEMTPSKLISLRSDLKKVIDTKEDFVCIIKLMNDNVFGEEILGEANGLTGEELIL >NZ_CP040506.1|WP_034860055.1|3209342_3210341_-|type-I-B-CRISPR-associated-endonuclease-Cas1 MGSTRYIMSMGELSRKDNSLCFRKDGKNVYIPIENTKEIYCLSEVSFNTKLLDFLAKNHVVVHFFNYYEGYSGSFYPRDQYNSGKLVIKQAETFRNSRMQVAKAIVLGIGQNMDEVLHHYYKHEKKEVKETIDWLRKEFKERVQKAEQVNELMSIEGEAWMRFYGDFKYFLPEDFVMNKRVKRPPDNPINAMISFGNTLLYVKTISSIYRTHLDQRISFLHEPSEGRFSLSLDMSEVFKPVIVYRTIFDLVNNRKIQVEKHFDKKVNYCLLNEEGRKIFIEAFEGRMESVFVHAGLKRKVSYRTAIKLDCYKLIKMILEGREFVPFSLKEGK >NZ_CP040506.1|WP_006782617.1|3210484_3213031_-|CRISPR-associated-helicase/endonuclease-Cas3 MQLNDVLNFEEPIYAHICEGKNAETLQQHTKLCQKYYKKLMDTKMLKLILGRFLQRYMENCTKEAELFFWEMLEGSIIFHDTGKINPAFQRERMKKPFKYQKDFQILEGSKHSLLSSVIYLDYCYYLLAQMTMSIDEKRKLKSLIYVNAYIISRHHDDLGAMREYGEKFLEGGQIYEMISRLPNEKQTLYKGPFHFNQENIGTVCKAFPGGGKKRESREDDRKGGMELYIYARLLYSLLTSADYYATTEYMNGFAINQFGEVNQVDELRRVYEACGVLKSIREYEKTNVGTKFVAENEINALRCQLFLEAEAEWKVHKEENVFFLEAPTGSGKSNTAMNLSFQMLKAGQTKLCYVYPFNTLVEQNLDSIKRIFGGNEDIMSMVTVVNSVTPIKIDEDKKKAMSENNSEFYQSALLDRQFLNYPFILTTHVGLFETLFSNKREALFGFLQMAGSVIVLDEIQSYKNTLWSEIIIFLKAFAEFMNMKVLIMSATLPDLEYLTEESGQVVRLMKRRDQYFLNPVFRERVQLSYEMLKEKTDFEQLHHHICDHVQQEKKVLVEFIKKKSAYEFYEYACEYGILGMELRLLTGDDNRLDREKILNEIRCSDKGVILIATQVVEAGVDIDMDIGYKDISKLDSEEQFLGRINRSCKKGGVTYFFDLDNAGDIYKDDFRINRELTLENEEMREALKNKQFAEYYLSVINLLKESRNKSASEEGLEHFFKEVKHGNFKEIAAHMHLIEENSWTMSVYLSRMIELPDGTQLDGEVCWEEYKKLLLNQELPYAKKQVLLSKVRSQMNYFIYEIKKNSNLVYSDRIGELYMIQNGDRYFENGKLNKQALEEAGGMFIEL >NZ_CP040506.1|WP_006782618.1|3213106_3213832_-|type-I-B-CRISPR-associated-protein-Cas5 MEILKFTLKGKNAFFKMPEVNTYYYFTYGNIHKVALLGIFGAILGYNGYAQMTEEDQYPEFYERLKDISISIVPQKGSKGYIPKKVQSFNNSVGYASQEQGGNLIVKQQWLENPCWEVYVKIDSSEAEAIKKAIMNHTCVYVPYLGSNDHPADICDAEVLTGEFINDEEIAYIDSLFPAAKVELDYEDDDVTPYKYSEYLPIALDEHSLMYCMEKFYVTNIPVLCHECDVCRVGGKNIVFY >NZ_CP040506.1|WP_034860056.1|3213833_3214796_-|type-I-CRISPR-associated-protein-Cas7 MNKRVYGVLGISSIMANWNADFSGYPKTTSDGQTYGSDKALKYPMKKMWENEGKPVIYIKSMCFEEGKKGEVNLIPRTLKERYEQVFGVELKKGGDVREVLKKLFQAVDIKNFGATFAEAGNNISITGAVQIGQGFNKYDGTEPQEQPILSPFRDPKAKEKSKKSEGDEGEEAKNSTLGTKIVSNEAHYFYPFSINPLAYKGYMELGVTEGYLESDYEMFKKAALTSATSFATNSKAGCENEFALFVETKEDFYLPTLTEYIEFEKGDVNTITITCADLFEQVKDKILSVEIYYNPYTTKIDPEKINGAKYYNILTQKEV >NZ_CP040506.1|WP_006782620.1|3214788_3216528_-|hypothetical-protein MIQDCLEIFKYKLDKYDDERLVLDNYVPKDGTYILIEMSEPQWNVKDTVAIRFNKKDGKLEGKTSSNYRLISTLDYYSKLIAMNKPVDPQKVIHSNNYLSFAVKKESIATGKLSSEVLDLYYEILKNPIQKYSKPNVRKLYEETEKMCGEVDRILAEQIHQWVRENLSKLEIDTSKKDYLKLYFIFPDEAKTKELYRKEGSRYTIPNIYNNNDFNCLINDEIYGLPNDNMGMNSKKVFLANKSKRVQVPYLLNREQVMLQAKFYDFLYGQASKGNLNIYFDENRKEIIPLKNGESPTADMSGYFIRLKKGMEAEIHNVDAVPCYNPHLQTTFFYQQYLETDQSDNYGMITDRKRLELLIDDVLFGKSLISNYFTDVGDITIKDGTLVQNLIMSRELLFSWFYKNDGVNPWPVLQKCSKTMIYNSINKGYWKKTRHQINLLWSLKDYFKKEEIMYPVVESLRKHINEKDDWMFDNDEEYYFAIGQMVSYFINKSKAAKKPLSFINPFLNAKDDDMIKSHLEVLFKKYDYDIMYMDLRVKRLFSNVMIHKPVEKIDTTMIAAGVAANNLIFEKKEAERNDE >NZ_CP040506.1|WP_006782621.1|3216540_3217239_-|CRISPR-associated-endoribonuclease-Cas6 MHYVFEIRIKIFTLQSISKEDSYAAVTDFIDGVLIENEVWEQMHNENCYKQYCFNGLYPIEKEGIYKREQVYQFIVRSTNKDLIEYLSYNLPKHENNLMKGLTCENRMISKKHITSLYSITPVIVKGKNNGYWRDDMTFEDFEQRLKVNLIKKYNELEHTKLDENFELHTLLEFKNYGPIPVPYKNVKLLADKIELKIADNETAQALAYMALGTGICEMNSRGMGFVNCHYV >NZ_CP040506.1|WP_006782622.1|3217601_3219017_-|ATPase-AAA MIEKLIAEATECDFKVALETRRPKSWLKSVSAFANGIGGTLFFGIDNEGKITGIEDIQSDAEAISRFIKERITPLPQFVLTPVREGDKDILLLSIAAGRTTPYYYKADGIMEAYIRVGNESVVAPDYVVNELILKGSNRSFDTLLTDARKEDYSFTLLEATYRERTGVRLETSDYFSFGLTNREGVLTNAGKLLADQYIVYNSRVFCTRWNGLEKGSIFDDALDDKEYEGNLIYLLQSSCDFVRNNSKVRFVKEARYRIDKPDYADRAVMEALVNALIHRDYIVAGSEIHVDMYDDRLEIQSPGGMFEGRPIQECDIDSIGSVRRNPVIADLFHRMKYMERRGSGLKKILSETRKLPGYTEQLKPEFFSTPSDFRVVLKNINYNMEEDTIQDTIQDTIQDTIQDKSKRMKEIIAYCKEARTREEIQSYIGIVNRAHFRRAYLKPLLKTGMLEMTLPEKPSSRNQKYISSHK >NZ_CP040506.1|WP_006782623.1|3220075_3220450_-|DUF3783-domain-containing-protein MREMVLYYNTVQNPNVAKLKGVLVRMGVRIKNITPEQVTQTVGYLAGIEGYPESEIPEVLPVIEEEMLVMRGFTSRRMDELLMNLRKAGVPKIALKAVVTESNCGWSFYHLYEEIREEHKKMSL >NZ_CP040506.1|WP_138670220.1|3220490_3221687_-|hypothetical-protein MAGAGNDEMLTEDKRAGAGSTGAVGMLAVAAAAEESLRAELEKRSNPRYDLMGNEYGAKLQKAIEKKDEEKQKLTDLRGEYLRVYLNRSFSLSSDDNSEYEKLLEKLSCDRLEEYRKSAAEQARSAVEHFKDDFMYKIRSAIREALIRKDELNRVISGLDFGKDKYQFYIGKNKGPDGQYYDMFMADSLEINPAQLDVSMDNQLDFFTMEHENHYGQMVNDLINVFIPPDNATPEELEEAKRNMDKYADYRTYLSFDMQQLVQNEDETIKIRLSKMIKKNSGGEGQNPLYVALLASFAQAYRINLKPKVQRNPTIRLVVLDEAFSKMDAEKVASCIQLIRGLGFQALISATNDKIQNYVETVDKIFVFANPNKKCISIQEFEREEFGELKADLVDGEG |
You can click texts colored in the table to view more detailed information
CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
NZ_CP040506_3 | 3219152-3219837 | TypeI-B |
NA
Consensus repeat of NZ_CP040506_3
|
10 spacers
spacers of NZ_CP040506_3
>3.1|3219180|37|NZ_CP040506|PILER-CR,CRT CTAATGTCCATGCCCTTGTATATCTTGCAAAAATTTA >3.2|3219245|38|NZ_CP040506|PILER-CR,CRT CCCTCCGGATGGCCGCCCTGTCTCCTCCAGGGCCTATA >3.3|3219311|39|NZ_CP040506|PILER-CR,CRT ACGTACATCCACTGTCAACGGGCATCTACGGGCCATGAA >3.4|3219378|37|NZ_CP040506|PILER-CR,CRT ACCGCGACGCCCATGGGACTGTCCGGGTCCATGTTGT >3.5|3219443|37|NZ_CP040506|PILER-CR,CRT ACCATACAACCTATTTCCCAGGCATCTCCACAGCAGA >3.6|3219508|37|NZ_CP040506|PILER-CR,CRT ACCCATCAGAGTGATACTCAATCTGCACCTTACCAGC >3.7|3219573|36|NZ_CP040506|PILER-CR,CRT ACGGATTGATTGTTCTGGTGGTGCTTTTAGCATTTC >3.8|3219637|39|NZ_CP040506|PILER-CR,CRT ACTGCAAGTTGTTTCGCTTTGTAATCATCAATCAATATC >3.9|3219704|38|NZ_CP040506|PILER-CR,CRT CCGGAAGGCGCTCACCTGTATTGCGGTATACAACCTCC >3.10|3219770|38|NZ_CP040506|PILER-CR,CRT CCTAGCCGACTATTTTAATGTTACTGTCGATTTCTTAA >3.11|3219182|35|NZ_CP040506|CRISPRCasFinder AATGTCCATGCCCTTGTATATCTTGCAAAAATTTA >3.12|3219247|36|NZ_CP040506|CRISPRCasFinder CTCCGGATGGCCGCCCTGTCTCCTCCAGGGCCTATA >3.13|3219313|37|NZ_CP040506|CRISPRCasFinder GTACATCCACTGTCAACGGGCATCTACGGGCCATGAA >3.14|3219380|35|NZ_CP040506|CRISPRCasFinder CGCGACGCCCATGGGACTGTCCGGGTCCATGTTGT >3.15|3219445|35|NZ_CP040506|CRISPRCasFinder CATACAACCTATTTCCCAGGCATCTCCACAGCAGA >3.16|3219510|35|NZ_CP040506|CRISPRCasFinder CCATCAGAGTGATACTCAATCTGCACCTTACCAGC >3.17|3219575|34|NZ_CP040506|CRISPRCasFinder GGATTGATTGTTCTGGTGGTGCTTTTAGCATTTC >3.18|3219639|37|NZ_CP040506|CRISPRCasFinder TGCAAGTTGTTTCGCTTTGTAATCATCAATCAATATC >3.19|3219706|36|NZ_CP040506|CRISPRCasFinder GGAAGGCGCTCACCTGTATTGCGGTATACAACCTCC >3.20|3219772|36|NZ_CP040506|CRISPRCasFinder TAGCCGACTATTTTAATGTTACTGTCGATTTCTTAA |
cas6,cas8b1,cas7b,cas5,cas3,cas1,cas2 |
CRISPR arrays and Neighbor proteins around NZ_CP040506_3
The CRISPR arrays of NZ_CP040506_3 >merge|NZ_CP040506|3|3219152-3219837|PILER-CR,CRISPRCasFinder,CRT ATTTAAATACAGCTACTGTTCTTCTTCACTAATGTCCATGCCCTTGTATATCTTGCAAAAATTTAATTTAAATACAGCTTCTGTTCTTCTTCACCCTCCGGATGGCCGCCCTGTCTCCTCCAGGGCCTATAATTTAAATACAGCTTCTGTTCTTCTTCAACGTACATCCACTGTCAACGGGCATCTACGGGCCATGAAATTTAAATACAGCTACTGTTCTTCTTCAACCGCGACGCCCATGGGACTGTCCGGGTCCATGTTGTATTTAAATACAGCTACTATTCTTCTTCAACCATACAACCTATTTCCCAGGCATCTCCACAGCAGAATTTAAATACAGCTACTGTTCTTCTTCAACCCATCAGAGTGATACTCAATCTGCACCTTACCAGCATTTAAATACAGCTACTGTTCTTCTTCAACGGATTGATTGTTCTGGTGGTGCTTTTAGCATTTCATTTAAATACTGCTACTGTTCTTCTTCAACTGCAAGTTGTTTCGCTTTGTAATCATCAATCAATATCATTTAAATACAGCTACTGTTCTTCTTCACCGGAAGGCGCTCACCTGTATTGCGGTATACAACCTCCATTTAAATACAGCTTCTGTTCTTCTTCACCTAGCCGACTATTTTAATGTTACTGTCGATTTCTTAAATTTAAATACAGCTACTGTTCTTCTTCACC >NZ_CP040506|3|2|3219152-3219835|PILER-CR ATTTAAATACAGCTACTGTTCTTCTTCA CTAATGTCCATGCCCTTGTATATCTTGCAAAAATTTA ATTTAAATACAGCTTCTGTTCTTCTTCA CCCTCCGGATGGCCGCCCTGTCTCCTCCAGGGCCTATA ATTTAAATACAGCTTCTGTTCTTCTTCA ACGTACATCCACTGTCAACGGGCATCTACGGGCCATGAA ATTTAAATACAGCTACTGTTCTTCTTCA ACCGCGACGCCCATGGGACTGTCCGGGTCCATGTTGT ATTTAAATACAGCTACTATTCTTCTTCA ACCATACAACCTATTTCCCAGGCATCTCCACAGCAGA ATTTAAATACAGCTACTGTTCTTCTTCA ACCCATCAGAGTGATACTCAATCTGCACCTTACCAGC ATTTAAATACAGCTACTGTTCTTCTTCA ACGGATTGATTGTTCTGGTGGTGCTTTTAGCATTTC ATTTAAATACTGCTACTGTTCTTCTTCA ACTGCAAGTTGTTTCGCTTTGTAATCATCAATCAATATC ATTTAAATACAGCTACTGTTCTTCTTCA CCGGAAGGCGCTCACCTGTATTGCGGTATACAACCTCC ATTTAAATACAGCTTCTGTTCTTCTTCA CCTAGCCGACTATTTTAATGTTACTGTCGATTTCTTAA ATTTAAATACAGCTACTGTTCTTCTTCA >NZ_CP040506|3|3|3219152-3219837|CRISPRCasFinder ATTTAAATACAGCTACTGTTCTTCTTCACT AATGTCCATGCCCTTGTATATCTTGCAAAAATTTA ATTTAAATACAGCTTCTGTTCTTCTTCACC CTCCGGATGGCCGCCCTGTCTCCTCCAGGGCCTATA ATTTAAATACAGCTTCTGTTCTTCTTCAAC GTACATCCACTGTCAACGGGCATCTACGGGCCATGAA ATTTAAATACAGCTACTGTTCTTCTTCAAC CGCGACGCCCATGGGACTGTCCGGGTCCATGTTGT ATTTAAATACAGCTACTATTCTTCTTCAAC CATACAACCTATTTCCCAGGCATCTCCACAGCAGA ATTTAAATACAGCTACTGTTCTTCTTCAAC CCATCAGAGTGATACTCAATCTGCACCTTACCAGC ATTTAAATACAGCTACTGTTCTTCTTCAAC GGATTGATTGTTCTGGTGGTGCTTTTAGCATTTC ATTTAAATACTGCTACTGTTCTTCTTCAAC TGCAAGTTGTTTCGCTTTGTAATCATCAATCAATATC ATTTAAATACAGCTACTGTTCTTCTTCACC GGAAGGCGCTCACCTGTATTGCGGTATACAACCTCC ATTTAAATACAGCTTCTGTTCTTCTTCACC TAGCCGACTATTTTAATGTTACTGTCGATTTCTTAA ATTTAAATACAGCTACTGTTCTTCTTCACC >NZ_CP040506|3|2|3219152-3219835|CRT ATTTAAATACAGCTACTGTTCTTCTTCA CTAATGTCCATGCCCTTGTATATCTTGCAAAAATTTA ATTTAAATACAGCTTCTGTTCTTCTTCA CCCTCCGGATGGCCGCCCTGTCTCCTCCAGGGCCTATA ATTTAAATACAGCTTCTGTTCTTCTTCA ACGTACATCCACTGTCAACGGGCATCTACGGGCCATGAA ATTTAAATACAGCTACTGTTCTTCTTCA ACCGCGACGCCCATGGGACTGTCCGGGTCCATGTTGT ATTTAAATACAGCTACTATTCTTCTTCA ACCATACAACCTATTTCCCAGGCATCTCCACAGCAGA ATTTAAATACAGCTACTGTTCTTCTTCA ACCCATCAGAGTGATACTCAATCTGCACCTTACCAGC ATTTAAATACAGCTACTGTTCTTCTTCA ACGGATTGATTGTTCTGGTGGTGCTTTTAGCATTTC ATTTAAATACTGCTACTGTTCTTCTTCA ACTGCAAGTTGTTTCGCTTTGTAATCATCAATCAATATC ATTTAAATACAGCTACTGTTCTTCTTCA CCGGAAGGCGCTCACCTGTATTGCGGTATACAACCTCC ATTTAAATACAGCTTCTGTTCTTCTTCA CCTAGCCGACTATTTTAATGTTACTGTCGATTTCTTAA ATTTAAATACAGCTACTGTTCTTCTTCA
>NZ_CP040506.1|WP_006782622.1|3217601_3219017_-|ATPase-AAA MIEKLIAEATECDFKVALETRRPKSWLKSVSAFANGIGGTLFFGIDNEGKITGIEDIQSDAEAISRFIKERITPLPQFVLTPVREGDKDILLLSIAAGRTTPYYYKADGIMEAYIRVGNESVVAPDYVVNELILKGSNRSFDTLLTDARKEDYSFTLLEATYRERTGVRLETSDYFSFGLTNREGVLTNAGKLLADQYIVYNSRVFCTRWNGLEKGSIFDDALDDKEYEGNLIYLLQSSCDFVRNNSKVRFVKEARYRIDKPDYADRAVMEALVNALIHRDYIVAGSEIHVDMYDDRLEIQSPGGMFEGRPIQECDIDSIGSVRRNPVIADLFHRMKYMERRGSGLKKILSETRKLPGYTEQLKPEFFSTPSDFRVVLKNINYNMEEDTIQDTIQDTIQDTIQDKSKRMKEIIAYCKEARTREEIQSYIGIVNRAHFRRAYLKPLLKTGMLEMTLPEKPSSRNQKYISSHK >NZ_CP040506.1|WP_006782621.1|3216540_3217239_-|CRISPR-associated-endoribonuclease-Cas6 MHYVFEIRIKIFTLQSISKEDSYAAVTDFIDGVLIENEVWEQMHNENCYKQYCFNGLYPIEKEGIYKREQVYQFIVRSTNKDLIEYLSYNLPKHENNLMKGLTCENRMISKKHITSLYSITPVIVKGKNNGYWRDDMTFEDFEQRLKVNLIKKYNELEHTKLDENFELHTLLEFKNYGPIPVPYKNVKLLADKIELKIADNETAQALAYMALGTGICEMNSRGMGFVNCHYV >NZ_CP040506.1|WP_006782620.1|3214788_3216528_-|hypothetical-protein MIQDCLEIFKYKLDKYDDERLVLDNYVPKDGTYILIEMSEPQWNVKDTVAIRFNKKDGKLEGKTSSNYRLISTLDYYSKLIAMNKPVDPQKVIHSNNYLSFAVKKESIATGKLSSEVLDLYYEILKNPIQKYSKPNVRKLYEETEKMCGEVDRILAEQIHQWVRENLSKLEIDTSKKDYLKLYFIFPDEAKTKELYRKEGSRYTIPNIYNNNDFNCLINDEIYGLPNDNMGMNSKKVFLANKSKRVQVPYLLNREQVMLQAKFYDFLYGQASKGNLNIYFDENRKEIIPLKNGESPTADMSGYFIRLKKGMEAEIHNVDAVPCYNPHLQTTFFYQQYLETDQSDNYGMITDRKRLELLIDDVLFGKSLISNYFTDVGDITIKDGTLVQNLIMSRELLFSWFYKNDGVNPWPVLQKCSKTMIYNSINKGYWKKTRHQINLLWSLKDYFKKEEIMYPVVESLRKHINEKDDWMFDNDEEYYFAIGQMVSYFINKSKAAKKPLSFINPFLNAKDDDMIKSHLEVLFKKYDYDIMYMDLRVKRLFSNVMIHKPVEKIDTTMIAAGVAANNLIFEKKEAERNDE >NZ_CP040506.1|WP_034860056.1|3213833_3214796_-|type-I-CRISPR-associated-protein-Cas7 MNKRVYGVLGISSIMANWNADFSGYPKTTSDGQTYGSDKALKYPMKKMWENEGKPVIYIKSMCFEEGKKGEVNLIPRTLKERYEQVFGVELKKGGDVREVLKKLFQAVDIKNFGATFAEAGNNISITGAVQIGQGFNKYDGTEPQEQPILSPFRDPKAKEKSKKSEGDEGEEAKNSTLGTKIVSNEAHYFYPFSINPLAYKGYMELGVTEGYLESDYEMFKKAALTSATSFATNSKAGCENEFALFVETKEDFYLPTLTEYIEFEKGDVNTITITCADLFEQVKDKILSVEIYYNPYTTKIDPEKINGAKYYNILTQKEV >NZ_CP040506.1|WP_006782618.1|3213106_3213832_-|type-I-B-CRISPR-associated-protein-Cas5 MEILKFTLKGKNAFFKMPEVNTYYYFTYGNIHKVALLGIFGAILGYNGYAQMTEEDQYPEFYERLKDISISIVPQKGSKGYIPKKVQSFNNSVGYASQEQGGNLIVKQQWLENPCWEVYVKIDSSEAEAIKKAIMNHTCVYVPYLGSNDHPADICDAEVLTGEFINDEEIAYIDSLFPAAKVELDYEDDDVTPYKYSEYLPIALDEHSLMYCMEKFYVTNIPVLCHECDVCRVGGKNIVFY >NZ_CP040506.1|WP_006782617.1|3210484_3213031_-|CRISPR-associated-helicase/endonuclease-Cas3 MQLNDVLNFEEPIYAHICEGKNAETLQQHTKLCQKYYKKLMDTKMLKLILGRFLQRYMENCTKEAELFFWEMLEGSIIFHDTGKINPAFQRERMKKPFKYQKDFQILEGSKHSLLSSVIYLDYCYYLLAQMTMSIDEKRKLKSLIYVNAYIISRHHDDLGAMREYGEKFLEGGQIYEMISRLPNEKQTLYKGPFHFNQENIGTVCKAFPGGGKKRESREDDRKGGMELYIYARLLYSLLTSADYYATTEYMNGFAINQFGEVNQVDELRRVYEACGVLKSIREYEKTNVGTKFVAENEINALRCQLFLEAEAEWKVHKEENVFFLEAPTGSGKSNTAMNLSFQMLKAGQTKLCYVYPFNTLVEQNLDSIKRIFGGNEDIMSMVTVVNSVTPIKIDEDKKKAMSENNSEFYQSALLDRQFLNYPFILTTHVGLFETLFSNKREALFGFLQMAGSVIVLDEIQSYKNTLWSEIIIFLKAFAEFMNMKVLIMSATLPDLEYLTEESGQVVRLMKRRDQYFLNPVFRERVQLSYEMLKEKTDFEQLHHHICDHVQQEKKVLVEFIKKKSAYEFYEYACEYGILGMELRLLTGDDNRLDREKILNEIRCSDKGVILIATQVVEAGVDIDMDIGYKDISKLDSEEQFLGRINRSCKKGGVTYFFDLDNAGDIYKDDFRINRELTLENEEMREALKNKQFAEYYLSVINLLKESRNKSASEEGLEHFFKEVKHGNFKEIAAHMHLIEENSWTMSVYLSRMIELPDGTQLDGEVCWEEYKKLLLNQELPYAKKQVLLSKVRSQMNYFIYEIKKNSNLVYSDRIGELYMIQNGDRYFENGKLNKQALEEAGGMFIEL >NZ_CP040506.1|WP_034860055.1|3209342_3210341_-|type-I-B-CRISPR-associated-endonuclease-Cas1 MGSTRYIMSMGELSRKDNSLCFRKDGKNVYIPIENTKEIYCLSEVSFNTKLLDFLAKNHVVVHFFNYYEGYSGSFYPRDQYNSGKLVIKQAETFRNSRMQVAKAIVLGIGQNMDEVLHHYYKHEKKEVKETIDWLRKEFKERVQKAEQVNELMSIEGEAWMRFYGDFKYFLPEDFVMNKRVKRPPDNPINAMISFGNTLLYVKTISSIYRTHLDQRISFLHEPSEGRFSLSLDMSEVFKPVIVYRTIFDLVNNRKIQVEKHFDKKVNYCLLNEEGRKIFIEAFEGRMESVFVHAGLKRKVSYRTAIKLDCYKLIKMILEGREFVPFSLKEGK >NZ_CP040506.1|WP_006782614.1|3209051_3209342_-|CRISPR-associated-endonuclease-Cas2 MGKSMNYNYAFVFYDVGEKRVQKVFKICKKYLSHFQYSVFRGEMTPSKLISLRSDLKKVIDTKEDFVCIIKLMNDNVFGEEILGEANGLTGEELIL >NZ_CP040506.1|WP_006782613.1|3206536_3206749_+|helix-turn-helix-transcriptional-regulator MIKYDPLWETMKKRNISQYKLIKDYGIDKAQLQRLRKNEVVKTIILNKLCEILDCRIEEILVYEPDITEE >NZ_CP040506.1|WP_006782612.1|3206083_3206425_-|hypothetical-protein MDKVRVESKNTDSYRFKERVYRLMNGTYDLDIYSVGEMNTVETEFSDGKYCEELYKDIFEANCRICERLGEEEDKDVEIIIHNYNLMTEYLCMKMFDYGVLFCKREIAKTVVV >NZ_CP040506.1|WP_006782623.1|3220075_3220450_-|DUF3783-domain-containing-protein MREMVLYYNTVQNPNVAKLKGVLVRMGVRIKNITPEQVTQTVGYLAGIEGYPESEIPEVLPVIEEEMLVMRGFTSRRMDELLMNLRKAGVPKIALKAVVTESNCGWSFYHLYEEIREEHKKMSL >NZ_CP040506.1|WP_138670220.1|3220490_3221687_-|hypothetical-protein MAGAGNDEMLTEDKRAGAGSTGAVGMLAVAAAAEESLRAELEKRSNPRYDLMGNEYGAKLQKAIEKKDEEKQKLTDLRGEYLRVYLNRSFSLSSDDNSEYEKLLEKLSCDRLEEYRKSAAEQARSAVEHFKDDFMYKIRSAIREALIRKDELNRVISGLDFGKDKYQFYIGKNKGPDGQYYDMFMADSLEINPAQLDVSMDNQLDFFTMEHENHYGQMVNDLINVFIPPDNATPEELEEAKRNMDKYADYRTYLSFDMQQLVQNEDETIKIRLSKMIKKNSGGEGQNPLYVALLASFAQAYRINLKPKVQRNPTIRLVVLDEAFSKMDAEKVASCIQLIRGLGFQALISATNDKIQNYVETVDKIFVFANPNKKCISIQEFEREEFGELKADLVDGEG >NZ_CP040506.1|WP_006782625.1|3224013_3224637_-|DUF4194-domain-containing-protein MINYYEELSPEEQLKVTQSIQLLYKQTFLLERKYDKKTGRFTGNRDFYVCNKHLEFIREYFRVMGIEVMENSQLGVIYVRGEAVVGDKLPKLATLYLLILKLIYDEQMASVSSSVNVYTTLSDMHERLGNYRLFKKQPSATDIRRAISLLKKYQIIEPLEMMDELEGHSRIIIYPCINVVLFGDDVRGLLESYGEGEDEDDSDETEI >NZ_CP040506.1|WP_006782626.1|3224633_3226013_-|hypothetical-protein MKQLLNEIPDNFWSLFRSKNRPIYIEALLQINEEYQYSNYFLSREICIQTLSDYFSKQKIFLEQDEMEDDFDLLEPMATRILNWLLRAGWLRKVDDYYSMTVNIVIPDYAAVFVDAFTQLCSDEGDATQVYIQNIYAILFSFKNDARANLSLLKTALVNTRKLNKTLQDMLHNMDKFFASLLEKGFYGDLLKEHLDGYVEEIVRRKYHILKTSDNFYLYKTDIKMWLNEMRQNPEWLSEVCERNRRMRGKSVEVRSVLEQIDLIERGFDDIEHRIANMDKEHSKYIRATVTRLNYLLNEEDNMKGLVIQLLNHLSLSDRQDEEIGEIGGMMNLSQFTILSDKSLYRPRRPRQDFTEHLSADEEPEELSKDEILKLNKIRNRYSRKQIEEFVFSHMTDGRMEVTPGTVSSDEDFEKLVLAYDYSTRKDSPYRVREQETEAIDNGRYRYPKLVFEKKRKNG >NZ_CP040506.1|WP_006782627.1|3226169_3226424_-|TfoX/Sxy-family-protein MGEIAKMVNLGEVIEKQLGEVGITTAEQLRETGSKQAWLKIKAIDDSACIHRLLAMEGAIRGVKKTALPEDVKEDLREFYRAAK >NZ_CP040506.1|WP_006782628.1|3226429_3227065_-|endonuclease-III MTKEELALEVVERLKKEYPEAGCTLDYNQAWKLLVSVRLAAQCTDARVNVVVQDLYAKYPDVESLAEADVDDIERIVKPCGLGHSKARDISGCMKMLRDEFGGKVPDDFDALMKLPGVGRKSANLIMGDVFGKPAIVTDTHCIRLVNRIGLVDGVKEPKKVEMALWKLIPPEEGSDFCHRLVFHGRDVCTARTKPFCDRCCLKDICGKIGV >NZ_CP040506.1|WP_006782629.1|3227143_3228052_-|FtsX-like-permease-family-protein MNFRTWRYLFKLGWKNLWYHKVYTAASALTMSACIFLFGLLFLAVLNVDSVLQRTEEDVYVAVFFDEDVAPERIDEVGNLIRNRAEVLRTVYTTADEAWDEFRADFFEETELMEGIFEDDNPLSASSHFQVYIKGIEQQESFVAYASSLEGVRKVTHSADTVRALVKMKDVISRVAMGSAGLLVLLSVLLIHNTLSVVIEAQKDKMHVMRLMGAREEFIKVPFCVQAFVMALLGLCVPLLLLFGCYRWGVGLVSSGLRLADGGVTLLPWEAVFPQLIVACVLLGVVTGVVGALSVLGKLKKR >NZ_CP040506.1|WP_006782630.1|3228041_3228737_-|ATP-binding-cassette-domain-containing-protein MDNRMIVLDHVTKVYGSQKALDNVSLEIKAGEFVFLTGNSGAGKTTMLELILKETEPTKGNLIVNGIQLSQLKERQIYRYRRFIGMVFQDFKLFPDFTVYENVAFAQRVIGAEPRDMKVSVRDALFKVGLEKKAGCYPGQLSGGEKQRTALARAMVNRPVLLLADEPTGNLDQRNAEDIMRLLEKINDQGTTILTVSHNQDLVKSMKKREISVRYGKVIRDSGKGGLSYEF >NZ_CP040506.1|WP_006782631.1|3228765_3229830_-|flagellar-biosynthesis-protein-FlhB MAAEEKTEKATPKRRQDERKKGNVFQSNDVAAVASILVLFNSLGALAPGIYKNLKSSVELFFSYAADRNFHLTDMNVQETMGRAMIYFASAALPLLLIGVLTAVIVTFFQTRMAFSWEVMKFKLERISPMKGFKRMFSMRALVELLKAVVKITCLIVAIYLFVKSRMHEFARLMDGSVAGAVAYTGKTAIALVNTVGIAFIFVAGFDFLYQWWEYEKNLRMSKQEIKDEYKQMEGDPQIKGRIRERQRQIASRRMMQNVPKADVIIRNPTHFAVALGYDSNAHRAPVVLAKGADRVALKIVEIGEENGVYIMENPPLARGLFAAVEVDMEIPEEYYQAVAQVLAFVYKLKKKKV >NZ_CP040506.1|WP_006782632.1|3229909_3230689_-|flagellar-biosynthetic-protein-FliR MSQDVLQNFDIFLLVLARMAGMVLVNPVFGRKGLPMMVRMGLVLSLSLFVLPAAELQAVAVSGLTTFGMAEAIIKEVMMGLAIGYVFQLFFSMLYVAGDVLDTLFGFSMGKVMDPISGIQSSVFAQFINVFFFLYFFATGSHLLMVKIFAYTYEVVPVGVTGFVSNALLSYLINLFGSVFGMVIRLTLPFAAAEFVLEVTMGVLMKLIPQIQVFVINIQAKILLGLLLMMLFAYPVGAFLDTYISSMMTEVQTVMMSFR |
You can click texts colored in the table to view more detailed information
CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
NZ_CP040506_4 | 3549713-3549844 | Orphan |
NA
Consensus repeat of NZ_CP040506_4
|
1 spacers
spacers of NZ_CP040506_4
>4.1|3549753|52|NZ_CP040506|CRISPRCasFinder AGAACTTAGAGCGCTCGCTTTTTGGGGATTCGCCCCACAAAATTTAATACGT |
CRISPR arrays and Neighbor proteins around NZ_CP040506_4
The CRISPR arrays of NZ_CP040506_4 >merge|NZ_CP040506|4|3549713-3549844|CRISPRCasFinder GTTTCTGTAGCTCAGCAGGATAGAGCGTCCGCCTCCTAAGAGAACTTAGAGCGCTCGCTTTTTGGGGATTCGCCCCACAAAATTTAATACGTGTTTCTGTAGCTCAGCAGGATAGAGCGTCCGCCTCCTAAG >NZ_CP040506|4|4|3549713-3549844|CRISPRCasFinder GTTTCTGTAGCTCAGCAGGATAGAGCGTCCGCCTCCTAAG AGAACTTAGAGCGCTCGCTTTTTGGGGATTCGCCCCACAAAATTTAATACGT GTTTCTGTAGCTCAGCAGGATAGAGCGTCCGCCTCCTAAG
>NZ_CP040506.1|WP_006778088.1|3548786_3549665_+|DUF3881-family-protein MHKFLRTVGFSMYQKKRDIDKLIQGLAEDRDKMRILQLDSEESLCELRVETAPGMGIAIVGGLDERDRFDVEYYYPYFVSHERSSIADCSIQRHTEKETYAGLLDDYRVGISLIYYVENMMEYRTRELAHESVDVDYVSLSGLCVNGKVLLPIQKTQKQIEMAKVASKDRNNLLEAAKNGDEDAMETLTIEDIDLYSQVSKRMIKEDIYSIIDTCFLPCGIECDQYSVIGDILHIDVFKNRITEEEVYDFTLDCNDIIFHTAINKKDLIGEPKVGRRFKGQIWMQGTAKFKS >NZ_CP040506.1|WP_006778087.1|3547304_3548639_+|NADP-specific-glutamate-dehydrogenase MSYVDEIYARVVEQNPGENEFHQAVKEVLDSLKLVIDANEEKYRKVALLERLVEPERVISFRVPWVDDNGQVQVNKAYRVQFNSAIGPYKGGLRFHPSVNQGILKFLGFEQTFKNSLTGLPIGGGKGGSNFDPKGKSDREVMAFCQSLMTELYKYIGKDQDVPAGDIGVGAREIGYLYGEYKRITGLYEGVLTGKGLTYGGSLIRTQATGYGLVYILDEMLKNNGKELSGKTVLVSGSGNVAIYAVEKVHELGGKVVAMSDSNGYIYDKDGIKLDIVKDIKEVRRGRIKEYVDAVPTAVYTEGKGIWTIPCDIALPCATQNELNLDDAKALFENGCFAVAEGANMPSTREATDFFVEKKMLFMPGKAANAGGVATSALEQSQNSQRLSWTAEEVDAKLKGIMVNIFAKADDAAKRYGVAGNYVAGANIAGFEKVVEAMMAQGVV >NZ_CP040506.1|WP_006778086.1|3546563_3547064_+|DUF4446-family-protein MENSMLSSWSIDPAFIILGLGVVTLILLVITIVCVVQIRKLYRRYDIFMRGKDAETLEDTIFGLIDELKEMKAEDKANKEAIRVLTRNVRGTYQKFGMVKYNAFKGMGGNLSFAFALLDLNNTGFVLNSVHSREGCYLYIKIVEKGETEVLLGSEEKEALEQALGY >NZ_CP040506.1|WP_006778085.1|3545590_3546541_+|ParB/RepB/Spo0J-family-partition-protein MAKRTGLGKGLGAIFGEDVMDSAQADQLKEEKGEYKTGREKTVKAGSKEEEETGKEITLKLSQIEPNTGQPRKDFNPEMIQELAGSIRQYGVLQPLLVQKKGDHYEIIAGERRWRAAKEAGLKEIPVVIREYTKQQTMEIALIENVQREDLNPIEEAQAYQQLMQEFDLTQEEIAARVSKNRATITNSMRLLKLDKRVQEMLTQGMISSGHARALLALEDGEQQYQVALKIASERLSVRDVEKLVKQLSKPKKAKKVEEEERDLSFIFKDLEERMKQIMGTKVNINKKDRNKGRIEIEYYSEAELERLVELIESIR >NZ_CP040506.1|WP_006778084.1|3544820_3545591_+|ParA-family-protein MGRIIAIANQKGGVGKTTTTINLSACLAEAGQKVLLVDFDPQGNATSGVGLEKGYIDKTVYELLVDECQIEECLVKEVQENLDVLPSDVNLAGAEIELLDLEDKELLLKQQLDKIKDDYDYILIDCPPALSLLTINALTAANTVLIPIQCEYYALEGLNQVLKTVGLVHKKLNPNLETEGVVFTMYDARTNLSLEVVESVKSTLNQNIYKTIIPRNVRLAEAPSHGIPINLYDSRSTGAESYRLLAAEVMSRGEDI >NZ_CP040506.1|WP_006778083.1|3543637_3544588_-|alpha/beta-fold-hydrolase MTNKNKLLTMLILSSSAVAATALINKCIKISATSKNILEEPESFCYRWRFGNIHYTKSGSGKPLLLIHDLDAASSGYEWNQVVSSLSKEYTVYTMDLLGCGRSEKPCLTYTNYLYVQLIADFVKSEIGHRTDVISTGHSSALAIMACNNNPELFDKLLLINPDSILTCSQIPGKYAKLYKGFLDLPVIGTLLYHIATSKQAIRESFITQYFYNPYSVRESYVNSYYEAAHLNLSPKSVYASVHCNYTKANIVNAIKKIDNSIYIIGGAGMDNIKDLLNEYTIYNPAIEYTLLPDTKYLPQLEKPAEFVSTVKMFFS >NZ_CP040506.1|WP_006778082.1|3542793_3543519_+|16S-rRNA-(guanine(527)-N(7))-methyltransferase-RsmG MFDKFTELMREELSEFSIELSEHQLHQFYQYFELLVEWNKVMNLTAITELEDVVTKHFVDSLSLVKAVSDLSDEKILDMGTGAGFPGIPLKIAFPELKITLLDSLNKRINFLNEVIGQLQLGEIQAVHGRAEDYGRDKLYREQYDYCVSRAVANLSTLSEYCMPYVKIGGAFIPYKSGKIEEELNQAKGAVKLLGGKIEEVVTFVLPKTDVERSFVIVRKTEGTSKKYPRKAGLPSKEPLK >NZ_CP040506.1|WP_006778081.1|3540881_3542804_+|tRNA-uridine-5-carboxymethylaminomethyl(34)-synthesis-enzyme-MnmG MPNLEETYDIVVVGAGHAGCEAALACARLGLETIMFTVSVDSIALMPCNPNIGGSSKGHLVRELDALGGEMGKNIDKTFIQSKMLNESKGPAVHSLRAQADKQEYSRNMRQVLENTDHLTVRQAEVSEILVEDGRIQGVRTYSGAVYHSKAVILATGTYLKARCIYGDVSNATGPNGLQAANHLTDSLKAHGVEMFRFKTGTPARVDRRSIDFSKMEEQFGDERVVPFSFSTDPESIQKEQVSCWLTYTNSNTHEIIRANLDRSPLFSGAIEGTGPRYCPSIEDKVVKFPDKDRHQVFVEPEGLYTNEMYLGGMSSSLPEDVQYAMYRTVPGLEQVKIVRNAYAIEYDCINALQLKPTLEFKKIEGLFSGGQFNGSSGYEEAAVQGFMAGVNASMKILGREPYVLDRSQAYIGVLIDDLVTKENHEPYRMMTSRAEYRLLLRQDNADLRLRKIGYEIGLVSREDYEKLVEKEKNIEREVDRLEHTNIGANKQVQEFLESHGSTALKTGATLAELVRRPELNYFMLTEIDSERPDLSADTAEQVNINIKYEGYIKRQQQQVSQFKKLERKKLDEKFDYNSVKGLRREAIQKLNAHKPVSIGQASRISGVSPADISVLLVYLEQQRHQHQESCTELEEENVR >NZ_CP040506.1|WP_006778080.1|3539483_3540857_+|tRNA-uridine-5-carboxymethylaminomethyl(34)-synthesis-GTPase-MnmE MKTDTIAAIATAMSSSGIGIIRISGEQAFSVLQEIFRTKQGKKLDKIVSHRVHYGHIYDGNEMIDEVLVLVMRGPHSYTAEDTVEIDCHGGVLMMKKILETVIKYGARPAEPGEFTKRAFLNGRIDLSQAEAVIGVINAKNQYALKSSVSQLAGSVSDRIKRLREQIIYEIAFIESALDDPEHISLDGYGEKLLGNLEPMIQEMEKLVSSADNGRVMTEGVRTVILGKPNAGKSSLMNVLVGEERAIVTDVAGTTRDTLEEHIRLQGISLNIIDTAGIRETEDVVEKIGVLKARNMADEADLIIFVVDASIPLDENDEEIIELIRNKKAVVLLNKTDLEMTVTKEYLEEKTGHVVIPVSAKEETGIELLEQEIKSMFYQGEIDFNDEVTITNVRHKTALVEALASLRMVRQSVCDGMPEDFYSIDLMNAYEVLGSVIGEAVEEDLVNEIFSKFCTGK >NZ_CP040506.1|WP_006778079.1|3538486_3539404_+|protein-jag MNTITVSAKTLDEAITKALIELGTTSDNLDYTVIDEGSAGFLGIIGAKPVKISAKKKRELDTLDDFLDKDQEAKKQQEAAKREAKAAQKAAKPVEKKPAKPVRENKPVKEEKVYKEEKVVKEEKAAPVENQEKPVVSSKKSVDGTVYEETAKKFLVQMFAAMNMEVEITASYHEGDKELYVDMSGADMGILIGKRGQTLDSLQYLVSLVVNKDCDGYVRVKLDTENYRARRKDTLETLAKNIAYKVKRTRRSVSLEPMNPYERRIIHSALQNDKFVITRSEGEEPFRHVVVSLKRENRENRDKNN >NZ_CP040506.1|WP_006778089.1|3550136_3551549_-|tyrosine-type-recombinase/integrase MASIVKRGKTYSVVYYEGTGDKRQQKWESGLTYSAAKSMKAKIEHEQAQQTTGDESKNRLKEMTISEFLYEFIEKYGYKKWAASTYDGNVGLLENYVHPHIGDKKLLSLTTKMIDDYYDFLEKEAEPATNMGKPTREHITASTIHDIHKILRCAFNLAVRWDYRKKNPFLNATLPEHKEQERVVLEPNQILKVLKYTCRPDNYDYYLIHCAVLIAIGCTIRGGEIGGLQWDRVHYEKMIFHIDRAIDRISKKNLKLPKVRILFKFPNLIPGAKTCIVLKQPKTDNSARDVDVPQMVLNSLQILRQMQEKLKAELGSDGYIDYNLVICQANGRPMMTEHLNKRFKEILVEMNDPEIKAEEIVFHSLRHTSATAKLFVSQGDFNSVMQAGGWANLEMLTRRYGKHSFQDNREKLAHKMDDFLGNGLEEASGNDGGTVIAQPGAIEQALQTLFQANPDLLIQVIQSVQSANKE >NZ_CP040506.1|WP_006776437.1|3551563_3551770_-|helix-turn-helix-domain-containing-protein MAVGEFNHEKQAVSEKRTYSVQEIADILQISRSMAYNLCKQSLFKTVKVGKYVRVSKPSFDEWLDTRK >NZ_CP040506.1|WP_006776439.1|3552376_3552733_+|winged-helix-turn-helix-transcriptional-regulator MEKRLFDSELKVMETLWENGELSAKQIAELLRQQIGWSKTTTYTVLKKCIDKEIIKRSDPNYICSACISKEDVRQYETHELINKMYDGAPDKLVASIIGNEKMDKDMIRHLKELIQNL >NZ_CP040506.1|WP_006776440.1|3552746_3553967_+|M56-family-metallopeptidase MEGIIAAHVSGSIMICFILLLRKLFVFHFVGSAWAIFWKILTLRLLCPFTIRLPGIEHFFITEKKNSHIRDTAERIVQYENNIPNELAIMIPIIWGIGGLICMGKFVIPHIKNRKVYQMALPLENESVAIWIKRQSLRRKIYVKVSDRIITPLTYSIWKPVILLPRMDGEIDELHLEQILEHELIHIKRFDVLFKWLLAFICAVYWVNPFIWIMYSFANRDIELACDEAVLKSRSKDYKKSYILTLIYLEEKRVRGDFLCNFFSRYPMEERVQIMIRNGDKKALKNMILPAIAALLIALFSISSMAGEYDGEWSPKNERRDNRNLTATTTDIQMQLPIFQKRLPDNFGSLSGGDFDIPTIIIRKSKENYSACAVDKNGTVIYEESGTTKNVEATLEHIYNKLFQRN >NZ_CP040506.1|WP_006776441.1|3553978_3554425_+|hypothetical-protein MRIGKHFWVLIIAVLISLSAISSVMAEEINSNVTDIPTLEITRDETGCYSVLAKDHEGSIIFTEKGLTGTIEDIIEHSYSNIFRAATKSCTHIPCNHEIVTGGINHVIDWDTDICTMITNDFYRCACCDQILGIVPGSTTVVGTHPAH >NZ_CP040506.1|WP_138670240.1|3554511_3554898_+|hypothetical-protein MSSSFPAYAVSTDSTVVGGWDEDTGYFVNADAYNKAMEKRGLLRSDPVHEGERQRKDQSGNTYFRAHGWTSWPGVYHYTRARMETYGGSILTDSGRKWGTSETQATSPWHKFDPDVSDRARTYYGSEE >NZ_CP040506.1|WP_006776443.1|3555031_3556261_+|hypothetical-protein MIKSSRIYNFFFKNSFSVLCIMLLTVFIFIVGITLTSLTISKPTSENKEHVSIGEKRYTLIDNFLDADSFREFRHNDKKVNMLGDFYNKLTGMDNAKLLSMFNQSVVIDDFQGDEKFYYHTKEFRDKFPDAELAIKSMQMNQLAFEHNKLKVKKGNMPIWNKISFNDNTFPILLGSSYEGIYNIGDIVKGSFYTKNINFQVIGILEDNTQVYYKTDPAYMLDEYIIIPYPAMAWTVNPNDFVFEGILYFALVNSDIVIDSDEKNFLTGIRAIANSTGFVDFSLVGIDDQIIKNQELIFMISEHQRLIGCILVVMYIILTAVLYCQLKVHLKKNDISGQPVFNGPSDRKKFFRKYSMFYVISFILSLVLQLRLIPRIFLGVFAAELLILGSVYLIVSLAYYKMFLKENMK >NZ_CP040506.1|WP_006778091.1|3556385_3556928_+|sigma-70-family-RNA-polymerase-sigma-factor MERDKLCSKIRDYKNGNRIALNEIISQMTPLVKKFARKCFFMEYDDAFQEFSMVLIEAVSKIRTYENDGQCIVYINTCFKNKYCLLCKNYYIYKEIEELYENKDIPSQIECFSDIIFIIDISRYINQIECDSHKKIAELYLVEGKSDREISETLCISRQYVNRVRRRLLNDLRTEYFMQK >NZ_CP040506.1|WP_006776966.1|3556966_3559540_-|CHAP-domain-containing-protein MLLKNGDTGIQVKYLQQGLKIMCCNPGSIDSAFGPGTQAAVEKFQEEWGLTVDGIVGNDTWNCLLAEIKPIQQALKNKGFYTGAITGIAKDSTYNAVIRFQSSRDLTADGMVGAATRARLFNEDQGGGDESMLPLSIGDRGDYVLYLQYGLRILCCSPGALDGVFGSGTAEAVKKFQAKYGITDNGIADTTTWNTLKGQITDIQSRLLERNYSIAIVDGLATSALVETIKKYQEANWLTADGQVGPATYELLFSDVEDGATDALPLKTGSRGPRVLYFQYALRISCINPNGTDGVYGPGTKSAVDRYKTRKGLTADGMVDTVTWEKMRDEIRPLQTALVNRGYDVGFVDGIATEKVYNSVLQFQTDHNLVADGMIGNATKALLLGGTAGGGTVSSTLKLGSNGSLTRYLQRLFNELGYQIPIDGIFSQETHNAALSFQTTHGLEADGIVGGGTWRKLFEVYRVDVPGTGVEKLLNVVKHELAWGFAEDNANNITPYGQWYEMNRSPWCAMFVSYCAYQAGVLDTLVPKFAWCPSGMTWYKNRQKYHKRNSGYIPKKGDVIFFYNDELGRVAHTGIVVDGDENYVTTIEGNTTIDAVEQRTYNRNHSTIDGYGDNGGEAIELPAPPTEEEINEILVDHYREFLDACYIILPSEQITLNYEATIPMPPNGKALVEASADTTIFDNSINNPNAVTFDVEGGIAMSQEIALSEALTLTFEESGLEDAQSLADIVFDINMSLDTGASVVASGIRTEADGTWFYISYAVKKEVQIADGYPPVNFVFKYTLCLKSDDSAGARFFELVEEFVTEYRKEINVVVGVAAVIGLAIAFKALLLAGGISGLIAATKAVLGAAAKVAIVA >NZ_CP040506.1|WP_006776967.1|3559676_3559910_-|hypothetical-protein MEELIVNLVQSQGIWAVLFVFLLLYTIKKNDKLDELQEARERKYQELLTQLTVKLSIVNTVNEKLDTIQAVLKEKSD |
You can click texts colored in the table to view more detailed information
CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
NZ_CP040506_5 | 4011840-4011944 | Orphan |
NA
Consensus repeat of NZ_CP040506_5
|
1 spacers
spacers of NZ_CP040506_5
>5.1|4011867|51|NZ_CP040506|CRISPRCasFinder TGCGGGCAAGTTCTGCTCGGAGTGTGGCAGCCCCAAACCGGCGCCGGCTTC |
WYL |
CRISPR arrays and Neighbor proteins around NZ_CP040506_5
The CRISPR arrays of NZ_CP040506_5 >merge|NZ_CP040506|5|4011840-4011944|CRISPRCasFinder CTGGACCTGTCAGTGCGGAACGGTGAATGCGGGCAAGTTCTGCTCGGAGTGTGGCAGCCCCAAACCGGCGCCGGCTTCCTGGACCTGCCAGTGCGGAACGGTGAA >NZ_CP040506|5|5|4011840-4011944|CRISPRCasFinder CTGGACCTGTCAGTGCGGAACGGTGAA TGCGGGCAAGTTCTGCTCGGAGTGTGGCAGCCCCAAACCGGCGCCGGCTTC CTGGACCTGCCAGTGCGGAACGGTGAA
>NZ_CP040506.1|WP_006778451.1|4009829_4010663_+|deoxyribonuclease-IV MLTVGCHLSSSKGYLSMGKEAVKIDANTFQFFTRNPRGGKAKDLDVQDVESYLEFAREHGIERILAHAPYTLNACSADEGLREFARNTMEDDLRRLEYTPGNCYNFHPGSHVKQGVEVGITYIAQMLNEILKPEQTTTVLLETMSGKGSEVGRNFEELREILDRVELDSHMGVCLDTCHVWDGGYDIVNHLDEVITEFDRIIGLDRLKAIHLNDSMNPLGAHKDRHAVIGGGHIGEEALVRVINHPALKHLPFYLETPNDLDGYAREIALLRKLWYD >NZ_CP040506.1|WP_006778450.1|4008790_4009684_+|ABC-transporter-permease MKINPVYKRETMVSSRSFRMSLIVLVFNSVLAVVALLSMYSVIARVKVTAEIRYSSFLELYTFVATMEFIMLMFIIPAITAGSISGERERQTLELMLTTKMTPAEIVLGKLFSSLSTVAMLIISSFPVLALVFIYGGVRIPDVGMLLLCYVTTAFLAGCLGICFSSIFKRSTLATVVSYCVIILLVAGTYAANRFALSLSQATVDTYLVNVESMAQQANSGGLLYLMLLNPAVTFYVTINGQVGNDQVVNNITRWFGERPANVVTENWNLFSIGVQLALAVLFLWIAIRKVNPRKKK >NZ_CP040506.1|WP_006778449.1|4007850_4008801_+|ABC-transporter-ATP-binding-protein MLKIENLKKTYGKVSALDGLNMNIGESSLYGFVGPNGAGKTTTIKIITGLLLPSSGTVTVNGVDAVREPEKLKESIGYVPDFFGVYDNLKVSEYMEFFASCYGLDGLKARKRYMELLGQVGLDEKVDFYVDGLSRGMKQKLCLARALIHNPSLLIMDEPTSGLDPRTRYEFKEILKELREQGKTVLISSHILSELSEICTDIGIIEQGKIVLEGNMEEILSRINTSNPLIISVFGGRETAMTILKSHPLVETITIREEDIVVGFTGDKQDEANLLAQLVDADVLVYGFVRERGNLESVFMQITDHEEDEVVLIHEN >NZ_CP040506.1|WP_034857837.1|4005397_4007851_+|hypothetical-protein MKKTKRWAAGLMLLALLVFHVFPAWAVEESSTQTPLIESKITMDVNYGYDNTAKGGRYIPVEVALHNTEEEAFDGQLQVLTMESDYNIYRYDYPVYIEGGASVDKMMDIPLGNRIDQLFINLVDGAGNQVIHKRVKLNVSSEVPELFIGILSDTPEKLQYINGVGVDYSMLRTRTFVMDEENFPEDEIGLNLVDVLLISNYRIRDLSEMQSQALVEWVRSGGIMILGTGARVDDTLGRFAPELLDESYDAPELVQVDMAQDFEAEGPGNAVLEMVCADFSLSGANVIFSDDQLALLATVAYGKGTVAVAAYDFVDIAEFCQRNPSYIDALFTNVLGEDKINRLAESAYSGNSNQYWSANNMINTGNVDRLPDIPLYTMEIIIYIFLVGPGIYIFLRQRELNRYYRSAIVLLSLTFTAIIYLMGSRTRFQDTFYTYARFLDTSEDTVNETTYLNIQTPYNNPYTMKLDPRYSIKPITRSYYDNMSSIPKFTGNEDYKVAIRYEADATTVSAQNVIAFEPKYFQLDKMEANVKGIGFTGTIVMFEDEVTGSVTNSFKEPVEDAALLFYDKMILLGDMEPGETKKLDDLELLQVPLAHNNQIAEKITGKDQYEKPDINSRDYMDALTRTNLLICYLDNSVTSYTTNARVVGIINQPEDDPLHLDTYEVEGITVVSSSIPVYQDEDGVVYRSALMRKPTVISGSYYNMSNTLYGIDPLTIEYSLGNDIEVEKLYIRYVSESFTETASAGSLTPFTGSIYFYNHNTGNYDKMNERQLMYTREQLDDYLSPGNTIMVKYLYSNVSEYSWDILLPMLDIVGREY >NZ_CP040506.1|WP_138669841.1|4004266_4005391_+|HAMP-domain-containing-protein MSRRFRTRVITNIMYSTVITCLVEVFLVTNLSMLGNYALKAGRDTSFLAMFANAGSLVTIVYVLIGIVMFAITFLLMQEKSIRYIDRISAAMQNISEGDLNTTVEVIGDDEFSGMAANLNKMVEDIRELMDKERESERTKNELITNVAHDLRTPLTSIIGYLELLSGPAQMSPEMQKKYIDITYTKAKRLEKLIEDLFGFTKLNYGKISMKISKVDIIKLLSQLLEEFYPNFEEKNLSYELQSNVPAKVISADGNLLARLFENLVGNAIKYGADGKRILVRVHATEQIVTVSVTNYGYVIPKDELPMIFDKFYRVEQSRSTNTGGTGLGLAIAKNIVDMHGGTIGVTSDLNGTVFTVRLQVNFDINKENFGKLG >NZ_CP040506.1|WP_006778445.1|4003560_4004265_+|response-regulator-transcription-factor MSQINILVVDDEKEIAELVEIYLVSDGYKVFKANNAQEGLDILEKEDIHMVLLDIMMPGMDGLEMCKKIRETNNIPIIMLSARSTDLDKILGLGTGADDYVVKPFNPLELTARVKSQLRRYTQLNPNSGSQETEKNEIAIKGLVINKDNHKVLVYDEEIKLTPIEFDILYLLASNPGRVFSTDEIFEKVWNEKVYEANNTVMVHIRRLRGKMKEDSRQNKIITTVWGVGYKIEK >NZ_CP040506.1|WP_006778444.1|4001232_4003371_+|RNA-degradosome-polyphosphate-kinase MAELNAYYTKTENYVNRELSWLEFNYRVLSEARDKNLPLFERLKFLSITASNLDEFFMVRVASLKDMVHAGYTKPDLAGLRASEQLVKIGEKTHEFVNMQYSTYNRSLVPTLRQNGLRIVEHHEELTEAEAGYVDEYFEENIYPVLTPMAVDSSRPFPLIYNKSLNIAALLQKKDGEGDLDFATVQVPKGLPRIVEIPSSGKERVVILLEEIIERNIHSLFLNYNIISAHPYRIMRNADLTIDEEEAEDLLVEIQKQLKKRQWGEAIRLEIEEKTDKRLLKRLKKELELGSDDIYEISGPLDLTFLMKMYGLSGFEELKTPKYMPQQNPAFMNDDDIFANIRKGDILLHHPYESFQPVVEFIQKAAKDPEVLAIKQTLYRVSGNSPIIAALAEAADNGKQVSVLVELKARFDEENNIIWAKMLEKAGCHVIYGLLGLKTHSKITLIVRREEDGIRRYVHLGTGNYNDSTAKLYTDCGLFTCHPQIGEDATAVFNMLSGYSEPLHWNQLIVAPIWLRKRFTRMIRREAENARAGKTARIIAKVNSLCDRDIIGTLYEASCAGVQIDLIVRGICSLKAGVPGLSENIRVRSIVGNFLEHSRIFYFENDGAPEIYMGSADWMPRNLDRRVEITFPVLDEELKQKVLHILQVQLDDNVKAHILMPDGTYEKIDKRGKALVNAQDTFCEEAVQAVKDELDRRDPVSNRVFVPIESHN >NZ_CP040506.1|WP_006778443.1|3999684_4001229_+|HD-domain-containing-protein MATHIFAAIDVGSFELELGIYEISTKNGIRQIDHLRHVIALGKDTYNTGKISYELVDEMCQILAGFKSVMDTYRVEAYRAYATSAMREAKNNQIILDQILVRTGIEVEIISNSEQRLLSYKAIAVKETEFSKIIQKGTAIVDVSFGSVQISLFDKDALVSTQNMKVGVLRLRELLNRIQAETRVQYSLVEELVDNELITFKKIYLKDREIKNIVGIGESILYLFRAAGGSEVQKVEKIGIAEFKKFCERLVTLPVSQIEDEFGVNADYATLLVPSALIYKQILEMTGAEMLWIPGIRLCDGIAAEYAEKIRLVKFGHGFEDDILAASRSMSKRYRCHTSHIQNIEGFAVKIFDSMKRFHGLGERERLLLRIATILHDCGKFVSMSNPSQCAYNIIMATEIIGLSHREREIIANVVRYNTAEFDYNQVHVENGDAEGATILVAKLTAMLRLANAMDRSHKEKMENCKLAVKENQLVISTSYEGDLSLEMIAITQKADFFEEIFGIRPVLKQKRRV >NZ_CP040506.1|WP_006778442.1|3998241_3999540_+|ATP-binding-protein MNTKQLIIYRDFQYQRLFDDMTLLLGRDENACDGTMPDSFSCASQLIELAAVYGFEGNLWHCFLALCVANHENAYSTACEIRGAVDGTLNNLALQDFRILKQIFDYDITTLNRFTDGSELWNYLAAYKAADGGVGKVFNKRIRDRIIELSLSLAKAESAEEFQDTTTEFYKEFGVGKFGLNKAFRIVEEKGKACIEPIVNVEHVYLDDIIGYELQKQKLIANTESFIQGKAANNVLLFGDAGTGKSSSIKAILNEYYNQGLRIIEVYKHQFHALSSVLEQVQDRNYRFIIYMDDLSFEESELEYKYLKAIIEGGLGRKPKNVLIYATSNRRHLIREKFSDKRELDDELHVNDTVQEKLSLVARFGVTIYFGAPDKKEFQNIVKLLAEKYHVEMPVEELYAEANKWELNHGGLSGRTAAQFITHLLGLPENYG >NZ_CP040506.1|WP_080568845.1|3997172_3998228_+|prephenate-dehydratase MALCGEVAHHKIETNKPVYDKAREAEKISAARAQVDTEFEKQAVEEIFTQLMAISRRYQYQLLEQNGKSIQTGFRPVPSLPMTGIKVVYQGVEGAYSHAATLQYFGDNVDAFHVKTWEDAMKAVEDGQADYAVIPIENSSAGAVSDNYDQLIKHSNVIVAEIQISVSHALLGLPGAAESDIQSVYSHPQALMQCSEFLNSHREWRQISVENTAVAAKKIIEDNDITQAAVASETAGRLYGLTTLHPSINHNKDNTTRFIILAKEHIYRQDAGKLSICFELPHKSGSLYNMLGNFIYNGVNMVMIESRPIQGRNWEYRFFVDIEGNLSDASVQNALKSISEEASNMWILGNY >NZ_CP040506.1|WP_006778453.1|4012005_4013037_+|hypothetical-protein MATVSYKCPNCGGGLVYEPESGQYQCEYCLSEFTQQKLEEMTPQMDSSQSGETAAAMLYHCPSCGAEIVTDETTAATFCFYCHNPVVLSGRLEGQYHPDYVLPFAVDREKAVEIFTDWVQKKRYVPKSFFSKEQIEKMTGVYFPYWLYSCKVDGTMEAEGVKLRTWIAGNLQYTETQKYEIRRDGHMNINRVPRNALKKADRQLVEGVLPYDMKELRPFSMGYLSGFMAEKRDMEREAFVSELSREVTDFAVTGLQNSVSGYEKVSVRNRQADIRDEKWQYALMPVWTLTYRDNSGKICYFACNGQTGKVCGQLPVDMGRLMILFAEVFLPLLAVLLVVGYLL >NZ_CP040506.1|WP_138670257.1|4013081_4013978_+|TPM-domain-containing-protein MVSVFVCIVAAMAFCLMWSGGAWADTTDVSGAAGRVDASDGRRVYDMAGLLTEDEIAGFEQTIGEYRDRMKLDIVVVTTEDSEGKSAMEYADDFFDYGGFGYGRLKNGVLFLIDMDNRELYVSTSGDVIRLLTDSRIESILDDVYVGAGRSDFADSVDAFLKDMDQYYRMGIESGQYNYDTETGRISIHRSIRWYELLLALAVSGFVAGSVCMGVVNRYGMKKERRQAANYLMAYRADCRFEYQNQTDNLVNKFVTTAIIPRQQNHSGGGSSGGSHSGRSSTHSSSSGRSHGGGGRKF >NZ_CP040506.1|WP_006778455.1|4014037_4015867_-|ferrous-iron-transporter-B MEEHQHVIALAGNPNVGKSTIFNGLTGMHQHTGNWPGKTVASARGEFQVGEETYELVDLPGTYSLAAHSEEEEIARDFICSGEAQLTIVVCDATCLERGLHLLKQILALEYVKDNGVPVILCVNLCDEAGKKGIEIDFELLQDVLQLPVVSCCARCSKELTVLKDAIHETYGHALNYSCLDFSPKRLAEEVVRYTKVNYRKREDTIDRIVTGRITGGLVMILMLLAVFWLTMAGANYPADLLWDGLFWLESRIANGLAYIGAPQMMIDVLVYGIYRVLAWVVAVMLPPMAIFFPLFTLLEDLGYLPRVAFNMDRSFKRCKACGKQCLTMAMGFGCNAAGVIGCRIIDSPRERMIAILTNAMVPCNGRFPTLFTMITLFFLAGVHGSVTGSILSALILTGVILLGVAATLGASWLLSHTLLKGVPSSFTLELPPYRRPQIGKVVVRSIFDRTLFVLGRAVAIAAPAGLIIWILANINVGGQSILLYLTSFFDPFGRLMGLDGVILVAFILGFPANEIVIPIILMAYLQTGHLVEMNDSSALLQLLVSQGWTWKTAVSMLIFCLFHWPCSTTCLTIRKETGSWRWTAVAFLMPTILGIGLCIAVTAILNLF >NZ_CP040506.1|WP_006778456.1|4016281_4016584_+|hypothetical-protein MKQYWTWRGKYIGVRQGDYLVTYGGNVLGKFYGQELYNQEGHYIGEIGRNERMFRDVTKNGFRRPIFSYGVKGSISPCYRDCSAYPLLAGQEDFVFTEDK >NZ_CP040506.1|WP_006778457.1|4016814_4018092_+|serine-dehydratase-subunit-alpha-family-protein MEKTNERYNAYIQILKEELVPAMGCTEPIALAYAAAKAREVLGEMPDRVLVEASGSIIKNVKSVIVPNTNHLKGIPAAATAGIIAGKAERELEVIAEVTPEEINQMKEFLETVPIDVKHIDQGITFDIVVTLYKGGSYAKVRIANYHTNIVLVEKDHRILSQKPVEGESEEGLTDRSLLDMEHIWDFINTVDVADVKEVLDRQIAYNTAISEEGLRGNYGANIGQVLLDTYGDDIRTRAKAKAAAGSDARMNGCELPVVINSGSGNQGITTSVPVIEYAKELNVGEEKLYRALALSNLTTIHQKTLIGRLSAYCGAVSAGAGAGAGIAYLCGGDYKDVVHTVVNALAIVSGIVCDGAKASCAAKIASAVDAGILGYNMYKRGQQFYGGDGIVTKGVEATIKNVGRLGKEGMKETNEEIIKIMIGE >NZ_CP040506.1|WP_006778458.1|4018095_4019289_+|dicarboxylate/amino-acid:cation-symporter MKKLSQSLPFRLVLGVVIGIIIGQIANTPVMNVVVTVKYILNQMIVFCVPLIIIGFIAPSITKLGNNASKMLGVAVTIAYVSSLGAALFSMIAGVILIPHLSIVTEVEGLKDLPPIVFQLDIPQIMPVMSALVFSLLLGLAATWTKAKVITTVLDEFQKIVLDIVTKVVIPILPIFIAFTFCALSYEGTITKQLPVFIQVVIIVMVGHYIWLALLYFIGGAYSGKNPMNVVKNYGPAYITAVGTMSSAATLAVALRCAKKSEPTLRSDMVDFGIPLFANIHLCGSVLTEVFFVMTVSKILYGSVPSIGTMVLFCALLGVFAIGAPGVPGGTVMASLGLITGILGFDEMGTALMLTIFALQDSFGTACNVTGDGALTMILTGFAEKHNIKKQEIKIDL >NZ_CP040506.1|WP_006778459.1|4019432_4019723_-|helix-turn-helix-transcriptional-regulator MIADRIRILRQRNNWSQTDLANKLGITRSSVNAWELGISVPATKTVVELAGIFHVSADYILGISTDSDTINLEGYTDREKAIIYNLLNYFLEEHGR >NZ_CP040506.1|WP_006778460.1|4019917_4020172_+|hypothetical-protein MDTDVKNGLIQLKDKLENEAGQANELYRCLILAEYGIEYMDGDSRETEAAALQAFSRLAGILAGQLKENVEFVTLLIKQVEKTV >NZ_CP040506.1|WP_034857842.1|4020382_4021315_+|WYL-domain-containing-transcriptional-regulator MLKGAKSDLKCDRILSMYTRLLRGEIIYKKELADEFHVNARSVQRDLDELRNFFSEQRLKDGNDQDLIYDQKHKGYRLVQAGEETLNNSEIFAICKILLESRSLVKKELFPIMDKLLALCSTEKERKKLFDLVANEKWHYIELQHGKKLLKNIWEISNAIYEKYCMEIKYRRQGEAETVQRTVKPVGIMFSEYYFYLTAFIKEEKHAEDVYPTIYRIDRIEEFKILSEHFNLPYASRFEEGEFRKRIQFMYGGKLRKVKFRYNGPSIEAVLDRLPTAQYVEEGGGEYTVSAEVYGDGIEMWLRSQGTFIT >NZ_CP040506.1|WP_006778464.1|4022880_4023234_+|hypothetical-protein MKEELIASLKQQIFFKEEDSIYFVYDIKNMTFLQLRDRLMGNGKILSEDFEHHIYIVQVMSGMANMNPAYLAIKLDDKKVYFIGYAKEGIIKQHIAQKAIDKILSLLITSGDDVFCM |
You can click texts colored in the table to view more detailed information
CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
NZ_CP040506_6 | 4525984-4526121 | Orphan |
NA
Consensus repeat of NZ_CP040506_6
|
3 spacers
spacers of NZ_CP040506_6
>6.1|4526002|18|NZ_CP040506|CRT ATCAGAACAACAGCCAGA >6.2|4526038|30|NZ_CP040506|CRT GCAGAAACAACAGCCAGAACAGCGGCCAGA >6.3|4526086|18|NZ_CP040506|CRT GCAGAAACAACAACCAGA |
CRISPR arrays and Neighbor proteins around NZ_CP040506_6
The CRISPR arrays of NZ_CP040506_6 >merge|NZ_CP040506|6|4525984-4526121|CRT AGAGATGTCAGAACAGCAATCAGAACAACAGCCAGAACAGCAGCCAGAACAGCAGCAGAAACAACAGCCAGAACAGCGGCCAGAACAACAGCCAGAACAGCAGCAGAAACAACAACCAGAACAGCAGCCAGAACAGCA >NZ_CP040506|6|3|4525984-4526121|CRT AGAGATGTCAGAACAGCA ATCAGAACAACAGCCAGA ACAGCAGCCAGAACAGCA GCAGAAACAACAGCCAGAACAGCGGCCAGA ACAACAGCCAGAACAGCA GCAGAAACAACAACCAGA ACAGCAGCCAGAACAGCA
>NZ_CP040506.1|WP_006778893.1|4524735_4525722_+|hypothetical-protein MKRNIEGFKGRGNWYKGNLHSHTVNSDGKLTPAESVKLFQDNGYHFLCLSEHDLYTDYRKEFDSPEFIILPGLEASAVLFEKEDGIHRKKVHHIHGILGTELMQQKAVKPLFRHMERLEVPVYYGEWDGAAVAQQLADELAARGCITTYNHPVWSRVEEREFVDTDGIFGLEIFNYNTVNESGTGYDTAHWDVMLRKGRRIHGFASDDNHNEGLFDDACGGYVWVKADGLTHDNIISALVEGNYYSSSGPEIYDWGIREGVVYVDCSPVNRVNVIAGGYVNGGRTVMCGSLQETMTRAEYPLNGDETYVRVECVDASGRTAWSNAIFL >NZ_CP040506.1|WP_006778892.1|4523703_4524717_+|ABC-transporter-ATP-binding-protein MAYIEFRNITKMFGDNRVLDEITMEVQKGDLVTLLGPSGCGKSTLLRCLSGLESVTEGQIFLDGEDITETPPSQRNVGMVFQQYSLFPNMTVEQNIAFGLKMKKAAPELIDEKVRGAIRMVELEGKEKSYPANLSGGQQQRVALARSIVMEPKVLLLDEPLSAIDAKLRKSLQSSIRQIHKELGLTTIFVTHDQDEAMVMSDVIQLFHAGKIEQSGSPIAMYTEPKTKFAAGFIGNYNILTASEFIRVTGKPYEASEDVAIRPETISVSRTVKDVANAYHFEGIIKNNTPRGNVLRYDIDVNGVMLKADVLFRSFQLYENGSRVQLAVENHNCLALK >NZ_CP040506.1|WP_006778891.1|4522906_4523701_+|ABC-transporter-permease-subunit MKKSKRLPQLLIILISIYLLIPFVVTFIYSLSTEWVGIIPSGFTVKNYVELFQDMDFWLSVGRTLVICVVSVSISIALLLGVMFVVTMYAPWLGKYIQFICMIPYALQGVILSISIVSLFSGTGTFLSNRMMMLFGAYSIMVLPYIYQGIRNNLNAINSKMLVDAAQMLGAGRLYAFFRVVIPNIMPGVIVSSLLAVSIVFGDFVLANNIAGNNYQNIQVYLYVNMTKSSSKASAIVVLIFVVVFGITGTVLWLQNKGKKVAGR >NZ_CP040506.1|WP_006778890.1|4522059_4522905_+|ABC-transporter-permease-subunit MNIKKQTWKNCLVLLPFAIVVCLYELLPLLQLALNSFHDENTGAWSLSNYGKIFSTPLYQASIVNSIRISLISALVGICVAFIAAKSYHDAGEKFQNFFTMVLNMTSNFSGVPLTFGFMILLGNTGVLTLVAQKLGFLQDFNLYSGNGLTLIYIYFQIPLATLLLIPAFLGIKKEWREAAILLHCGSLRYWFLIGIPNLLTSLLGTLSVLFSNALAAYATAYALLLSNYALLPLQISSKFKGDVRINKELGGALSVVMICLMVAATLVNNYLTKKHAKGAA >NZ_CP040506.1|WP_006778889.1|4520898_4521999_+|ABC-transporter-substrate-binding-protein MKKSCVSVLLAMSMAAALLSGCAKSAAAENVDYNSKGWDAIVADAKKEGKVNSVGMPDTWANWIGTWQGINDEYGIAHEDLDMSSSEEIALFKEEGKDGTKDIGDVGQQWGPVAESEGVTLKYKTSYWDDIPSWAKDDDGDWIVCYVGTISIITNNALVDKAPQSFQDILEGDYKVTIGDVSAASQAQHAILATAYAMGGDMDNLQPAYDFWSTLAKEGRIDTGDTSTARIESGEIAVGLFWDYNALNYRDNAVSNNPNASFTVCVPSDGSVQSGYASIINVNAPNPNAACLAREYILSDQGQINLAIGYATPIRSNVVIPAEVQAKRIDQSQYASAHAIEDFDKWTNVCQDIITYWEENIIPAIK >NZ_CP040506.1|WP_138670285.1|4519965_4520661_+|C40-family-peptidase MFVFASPMEAKASTALAGPGMEGGSYVATVNASTVNINASQNSDVVIAQATQGMSFNVLEDMGDGWLKIKVGSAEGFIPFDESISLQEEMEASDDEASLTVNMNGLEVTTEQRQNLVNYALQFVGGRYKYGGSDPHSGVDCSGFTRYVMANGAGVAMNRSSTAQSTQGVSIALEQIRPGDLIFYGNGSRINHVAMYIGNGQIVHASTYKTGIKVSDWLYRSPVKVVNVLGD >NZ_CP040506.1|WP_006778887.1|4518982_4519693_+|C40-family-peptidase MLKTILKAMAAFCVCGFFLGAAPDTSYGAVKESTCVGVETSSSYLVKIDAPSARIYTGKSTSAAVADTVQRGQTYDVISYQNGWVKINTGKSEGYLKTAGQATVVETAREKVDEAAAVRAQVVDFALQFVGNPYVYGGTDPNTGADCSGFTSYVLRHAAGVSLSHSSVAQAGEGRVVSEEEMKQGDLVFYSNGFRINHVAIYAGNGQVVHASTNKTGIKTSPWNYRTPVKIVRVLP >NZ_CP040506.1|WP_006778886.1|4518310_4518811_+|DUF1700-domain-containing-protein MNKEEFLRRLRQALAGDVPPGVIEENIRYYDSYISGEVRKGQSEEEVIAAIGDPRLIAKTIEETTEGAGEGSYTDADDRSGYGSYERNPYEKNTYERNPYETNRSFHMIDLNKWYWKLLAVVLVFSIISLIITVVGGIFTLLAPLIGPLFLIWMVVWIFRMFNNRR >NZ_CP040506.1|WP_006778885.1|4516238_4518176_+|fructose-bisphosphatase-class-III MRELAYLKLLSREYPTIKAASSEIINLTAIRGLPKGTEYFFSDLHGEHEAFIHLLRSSSGIIREKIKETFGYIIPEEEQVELANLIYYPDQVLNQIGASGKDTDDWKRINIYRLVQICKEVSSKYTRSKVRKKLPPEFAYIIDELIHVDYNADNKRVYYSEIIRSIIDIDVADKFIIALCELIQNLTVDNLHIIGDIFDRGPRADLIMNELMHFHDVDVQWGNHDISWMGAATGNLACICNVLRIAISYNSFDVLEDGYGINLRPLSMFAASTYRDDECARFVPHILDQNIYDAVDPGLAAKMHKAIAVIQLKVEGQIIKRHPEYRMDDRLLLEQVDGKKGTVCIGGKEYPMLDMKFPTIDWEDPLKLSEDEVELLHTLSLSFRHSDLLHKHVKFLYSHGALYKSYNKNLLYHGCIPMKKDGSFDTMVFNGVSYSGKSLMDFVDRMIQNAYFLKGESKEKEDARDFMWYLWCGEKSPVYGKDKMTTFEHYFVADAATHKETMNPYYQLSVKEEYCDKILEEFGLPTKGAHIINGHVPVKIKDGETPVKAGGKLYIIDGGLSKAYQSKTGIAGYTLIYNSNHLALAEHKPFTPGKENTPKVTIVEKMKNRVMVGDTDLGKELAGRIEDLKELVAAYREGVIKEKMV >NZ_CP040506.1|WP_006778884.1|4515332_4516139_-|MBL-fold-metallo-hydrolase MEQLYVFGTGNAIVTRCYNTCFAIKNSDGEYFMVDTGGGNGILRILEDMNVDMKRIHHIFLTHEHTDHLLGIVWLVRMISVLMKKELYDGNLYIYCHEDLVETVTTVCRLTLQPKFFKAIGDTIHLVAVKDGETRQILNWPVTFFDIHSTKAKQFGFTMTLEQGRRLTCAGDEPYNPLCEKFVAGSDWLLHEAFCLYGDRERFNPYEKHHSTVKDACQLAEELHIPNLVLWHTEDKSLDTRKETYMAEGTQYYHGNLFIPYDGEILEL >NZ_CP040506.1|WP_034858771.1|4526226_4526535_+|rhodanese-like-domain-containing-protein MYQTITMKQLEQMLDCHEDIFLLDVRNRASYEMCHMEGAVNIPCEELDEKMESLPKDKTIVCYCARGGQSMLACNHLSAMGYSVVNTANGLSSYRGKYLVKG >NZ_CP040506.1|WP_006778896.1|4526649_4527192_+|phosphodiesterase MKYMFASDIHGSACYCRKMLEIYRQSGAGRLILLGDILYHGPRNDLPEEYAPKLVTEMLNQYKDQIYAVRGNCDAEVDQMVLEFPIMADYALLELNGKTFYATHGHIYNQDCLPPMQAGDVLIHGHIHLPVAEKMGDKFLLNPGSTSLPKEGNPNSYAMLDGEIFTIYDFDGNKVKEIAL >NZ_CP040506.1|WP_050810052.1|4527247_4528375_+|class-I-SAM-dependent-RNA-methyltransferase MEAVLKREIIDLGYEISLVEDGRVTFVGDDEAICRANIFLRTAERVLLKVGSFRAESFEELFQGTKAIAWEEYIPQDGKFWVAKASSIKSKLFSPSDIQSIMKKAMVERMKKAYGLERFPETGSSYPLRVFLYKDMVTVGIDTSGDSLHKRGYRTLTSKAPITETLAAALILLTPWNKDRILVDPFCGSGTFPIEAAMMAANMAPGMKRTFLSEDWKNLIPRKCWYEAMDEANEMVDDTVEVDIQGYDIDGEIVKAARANAEAAGVGHMIHFQQRPLSALSHPKKYGFLITNPPYGERIEEKENLPALYREIGERFRALDSWSAYMITAYEDAEKYMGRKADKNRKIYNGMMKTYFYQFLGPKPPRRKAGDEIET >NZ_CP040506.1|WP_080568850.1|4528397_4529717_+|peptidase-C1 MLVILLSTASAWSMGKTGDPAVVAVDGTRVYYPVYALDKTRVAPLLPSSYDYRKEGRAPKVKDQGNYGTCWAFASLTALESALMPGEKMDLSEDHMSLQNGFNLTQDDGGEYTMSMAYLLGWQGPVYEKDDPYGDGVSPQGLKPVKHVQEIQILPQKDYQKIKAAVYFRGGVQSSLYTSIKNYKSRSVYYNENTFSYCYIGDEKPNHDAVIVGWDDNFPKENFNMELPGDGAFLCASSWGTAFGDGGYFYVSYYDSNIGMHNILYTGVESVDNYDRIYQTDLCGWVGQLGYGKESAYGANVYQAGERENLEAVGFYATDVNTEYEVYVSRHVPETPDFAERELAASGKFENAGFYTVKLDTPVELDAGERFGVMIKITTPGSVHPVAIEYQADNTLSLVDISDGEGYISFRGTSWESMEEKYGCNLCLKAYTSVRDGAS >NZ_CP040506.1|WP_006778899.1|4529811_4530423_+|RdgB/HAM1-family-non-canonical-purine-NTP-pyrophosphatase MGHKIVFATGNEGKMKEIRLILADLGLEILSMKEAGVDLDIVEDGKTFEENAAIKARAVWEKTGGIVLADDSGLVIDYLDGEPGIYSARYLGEDTSYECKNRVILERMEKAQGEERSARFVSAIAAVLPDGRELGTLGIVEGVIAGEPAGDGGFGYDPIFYLPEFGMTSAEIPIELKNEISHRGKALVAMKDKIRKVFEEEHR >NZ_CP040506.1|WP_006778900.1|4530419_4530905_+|metallophosphoesterase MKILVVSDTHRKDDGLKMVIEKEKPLDMLIHLGDAEGSERYIAEWVNPECRLEMVLGNNDFFSMLDKEREIKIGKYRALLTHGHYYGVSMGAEGLAEEARNRGCGMALFGHTHRPYYGKIGGVVVINPGSLSYPRQEGKKGSYGIMTVTDEGEVEYSQNFL >NZ_CP040506.1|WP_006778901.1|4531868_4532249_+|hypothetical-protein MRTVGRIDRSIYSCVADDIVTDEVIITEERIAHIAERHPGDYERFCQCLKEVVERPDFIVETQKPNTALLLKELAELDGKKFKTILRLMTSREKSDYKNSIITFMKIDEKEWNRLIRNKKIIYKRE >NZ_CP040506.1|WP_138669894.1|4532663_4532960_+|hypothetical-protein MQRGRELDVFRQFSVLRDMKEQDLMYQLLCLKECLSQEEEERAEAVRVWRGSILAKRMTREMQNILIALAELEADCESLEEFPTVEEIAARAKTIEVW >NZ_CP040506.1|WP_006778903.1|4533411_4534578_+|GGDEF-domain-containing-protein MSRALIHANIYLLPVIVLFIISQDVKKSLPRNLNTHFFIVLVWQTIGMMVLETCSWVPDGEMWEGARMLVWVCNILYAMLYAGFAFSWFVYIYSRIPGVENLLENRKKLRLLSIPVLISCLVLIMTPWTHWVFWVNENNSYERGPYYIAPYLFISGYMLTAIVLSFLQRRRVTRAGEKQECVRLAVYAMIPIAGLVLQLLDYKFWSAWPFTALAILTIYVSMQNGQITTDGLTGLNNRRQLEKYLLSRCDMNDGKLWCLIILDVDDFKSINDVYGHIVGDKVLCRVAKVLKAAYGNTDSFLARFGGDEFVVVASCDGAEKAKDVLQLFYDKLEESNRQAAKPLRVTLSAGYACYDGVRVNDRHSLMKAADEAMYREKQRKKAGDCMPA >NZ_CP040506.1|WP_006778904.1|4534613_4535756_+|GGDEF-domain-containing-protein MGRFGYIQINMFAALMLLILYINSKSKFPYSRNSKRFRKIITLMILTLVTDTAIRVFDGQGAAWVSTAMWVCVWLYYAAVDMLAYGWFLFTYANLYEDRDLIEHKWLILFTSAPLLVLVLMMTGWPGLVFGVDGQNHYVRGAAFLLQCFVWAAYILAAGGMAFFLRRKANMREKREEYVYLAYFPVLPLAGGLLQLVVKDMAAIWPFTVASMVMVYVKMQRTQISLDPMTGLNNRSRFNQFIQSKIDGGRNQNPWYLLLIDVDKFKQINDSFGHMAGDAALIKVASVLKRTFGKMNAFIARYGGDEFVVVLECRKEKDILNAMQQLDTMLEHENRHENTPYQLCCSAGYVRFDGEIMKTKEQLIAAADKEMYLQKKSRNA |
You can click texts colored in the table to view more detailed information
CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
NZ_CP040506_7 | 5395628-5395861 | Orphan |
NA
Consensus repeat of NZ_CP040506_7
|
3 spacers
spacers of NZ_CP040506_7
>7.1|5395661|33|NZ_CP040506|CRISPRCasFinder GCTTAAAGGGAACCGGGTCTCATAGGGTGTCAG >7.2|5395727|35|NZ_CP040506|CRISPRCasFinder TCAGTTTTTAATAATTCTTTCATTCCGTCCTGAAA >7.3|5395795|34|NZ_CP040506|CRISPRCasFinder GTTATTAAGTGTACTATCCTTATTTTCATCTTGT |
CRISPR arrays and Neighbor proteins around NZ_CP040506_7
The CRISPR arrays of NZ_CP040506_7 >merge|NZ_CP040506|7|5395628-5395861|CRISPRCasFinder GTCACAGCTTGCGAAAGCTGTGTGGATTGAAACGCTTAAAGGGAACCGGGTCTCATAGGGTGTCAGGTCACAGCTTGCGAAAGCTGTGTGGATTGAAACTCAGTTTTTAATAATTCTTTCATTCCGTCCTGAAAGTCACAGCTTGTGAAAGCTGCGTGGATTGAAACGTTATTAAGTGTACTATCCTTATTTTCATCTTGTGTTGCAGTTTGCGAAGGTCTAGTGGGTTGAAAG >NZ_CP040506|7|6|5395628-5395861|CRISPRCasFinder GTCACAGCTTGCGAAAGCTGTGTGGATTGAAAC GCTTAAAGGGAACCGGGTCTCATAGGGTGTCAG GTCACAGCTTGCGAAAGCTGTGTGGATTGAAAC TCAGTTTTTAATAATTCTTTCATTCCGTCCTGAAA GTCACAGCTTGTGAAAGCTGCGTGGATTGAAAC GTTATTAAGTGTACTATCCTTATTTTCATCTTGT GTTGCAGTTTGCGAAGGTCTAGTGGGTTGAAAG
>NZ_CP040506.1|WP_006779655.1|5394488_5395421_+|hypothetical-protein MFREYTMLDPKKLAPFFGFADYEIEKLCENGNGVGMEQLKEWYDGYYMPEIGDVYNPRSVVEALEENLCRDYWNKTGGFSELEEYITMNFDGLGEDVTALVAGQEITVNVLGFSNDLDSFQDKDEVLTALIHLGYLTYKDGTVRIPNKEIREEFVNSIKKLSWGTVSLLLKQSRELMDALLLRDVALVGQLLESVHDDMQEFKEYNNEHTLKCVIHLAFYAASDDYTLQFESAAGKGYADCCMIPKKPGLPGIILELKYNGNLGKAIEQIKEKNYMKIFEQQVKSIYLVTINYDKKSKKHKCCIEVVDNK >NZ_CP040506.1|WP_138670010.1|5394083_5394461_+|hypothetical-protein MKIKNTVCYYKHLNTHNVICINFNDYFENLSVTEGIAKISERLIHDLKQAFPNILGGEDDLILCLDMITQVSGEKFIFLIDEWDCVFRFHKGEGQEQQQFLSFMKLLLKDKSYVELAYITGILPI >NZ_CP040506.1|WP_006779653.1|5393159_5393789_+|GntR-family-transcriptional-regulator MQKVALKDQVYKSILKEILDGKFSMDSIINEKVLSEQFEVSKTPVREALVRLCSEGILENLPRYGYRLIPVTQNEIQEIIEYRKVMEIEALRLSFDYIGPADIQKLKQLDESAQEAVISRDVHLAWERNENFHWELGDLCPNRYFRSSIKSALTVGNRYANQYFSSIWREDKPLDRSHTKIIEALEEKDLAKAQEILTFDIELMKAILL >NZ_CP040506.1|WP_006779652.1|5392378_5392999_+|dihydroxyacetone-kinase-subunit-L MEIQAVKRAVSAVYEKMAEQKDYLIQLDQQNGDGDLGLSMCGGFGALCEALDATEETDFGKVFLMASKTFNEAAPSSLGTILSFGMMGMAKKLKGKTEVSQEEMAEAMQAGVDNIMEKAGSKVGEKTILDALVPAIEELRRCGGEMAAGDVWAAAAAAAGQGSESTRQMKSVHGRAAYYAEKSIGILDGGSVVGKLIFEGIAESVR >NZ_CP040506.1|WP_006779651.1|5391349_5392357_+|dihydroxyacetone-kinase-subunit-DhaK MKKMINAPADFVQETVEGIIAAYGDRLTLLNGDFRMVMSNRPGREGKVGIVTGGGSGHLPLFLGYVGDGMVDGCAVGNVFASPSAGKMSELIKACDFGSGVLCLYGNYGGDNMNFKMACDEAEFEDIETRIVTAADDVASAPAELAQKRRGVAGLIYAYKIAGAAADERRSLDEVADAAKKALGNIRSMGVALSPCIVPEVGEPTFSIPDDEIEIGMGIHGEKGIEVCKMLTADETAAVILKKIVADMQLEAGDEVSVMINGLGATPLEEQMILYRAVHRTLDEMGVSVFMPHIGEFATSMEMAGLSVTIFKLDEELKRLLRAPASTPFYTNANK >NZ_CP040506.1|WP_006779650.1|5390465_5391302_+|alpha/beta-hydrolase MEYAEHYCYVEPDIRLHYIDEGSGRTIVFVTGFSGSAQGFEHQIEYFKQSFRVIAVDPRNHGKSSWSPRGNTYAQQGRDLGVLMETLGLEHVILAGWSFGAYAVLNYLEQFGTKRVDAFVTIDNPVCAISEDEREFRAGNLDMLRDFHFRYFQSEEGFRQFVVENFIDGIFFLNPPQDEEGRNRVLNTCLRLPLEVGDQLIVDGHLSDKRDVMKTVDESIPCLFYVADYRKEAGLRCIPRDYPNSEVVTLGNHMMFYEFPEVFNHIMEDFLQRHHLVE >NZ_CP040506.1|WP_006779649.1|5389448_5390411_+|ABC-transporter-permease MSGKIDGKKIFKQYGITLVLLALCILFTILNPVFFTLRNIMNVMRQMSMIGIASVGGMFVIIQGGIDLSEGAVVSFVNVVCAWLMMSAGMSPELAILISLIVSAAIGYLNGVLVTMAKMPPLIVTLAVQGGLYGISYIITNSHSIAGFPDSFRFIGQGYIGFLPVPVVLMVLVLAIGWFVLNKTYFGRYIYAIGGNDDVARLSGIRVNRIRRLVYMLGGLFAGVSGVIFLSRLMSGQANTGAGFEMDVLTALVLGGVSINGGSGKIFNAVMGVAIIGVLNNGLVLVNVNQHVQEVIKGVVLIAAVAFDCLSKSKSSGNEA >NZ_CP040506.1|WP_006779648.1|5387937_5389452_+|sugar-ABC-transporter-ATP-binding-protein MENNIALELKNISKQYPGVLALDSMSITFRKGEIHALLGENGAGKSTLIKVCTGAIRPSSGTIEIGGQQFSHMTPQLSEQNGVAVVYQELNLVEELSVAENIYLGQKAGGRHLFNGAAVAKKAQELLDRLEMNLPATAKIKELSPGYQQLVEIAKALSLDARILILDEPSAALTDSEVQKLFKTILKMQEMGTTVIYISHRLDEIFQIADRVTVLRDGCKIQTLDVKDTDKDRLISLMVGREMTEVYPKYEGREGEPEDVILDISHVSGNGLKDISFQVRKGEILGLGGLVGAGRTELAQILFGVVKKDAGVIRIHGQEVEFHSPTEAIAHGIALVPEDRKQQGLILNMSIEKNISLASLKRMSKGLVINNRTEKITAQDYAKALKLKAASLEYDADTLSGGNQQKIVLAKWMATEPDIIILDEPTRGVDVGAKYEIYLLMHEMIRAGKTLIMISSEMEELINMSDRIVVLSEGRQAGELKKEEFNQETILKYASGADAKEVCS >NZ_CP040506.1|WP_006779647.1|5386750_5387845_+|sugar-ABC-transporter-substrate-binding-protein MKKQFLAAGLSVVLGSMMVLAGCSNGDGGTTTAASATAQEEKKTEMTTTAGAAGDTSAAANGAVTKDNAKWKVGVTITDLTVPVWDDYAQAIKKYGEPEGMYVNIVSPEGNAAEQISQMENFVTDGYDVIVVSAADNESMGQEAKKVTEEGVIVFSQGYEFDNYSAAMLEEKQVFGHHTAEMASRWINEKYPDGKCKVIVAGNQTIPLMMERTEGIYNGLKEFAPNAEVVATVYGSNEEEFLPMMENAFTANPDANMVISYCAGGALAAREAAKGMGLASDDFGIFCTDCDDGVADAIYNDDLIRGGLSMGGGDYMAKAVVETLVKMLNGEEYDKVINFPEIEVNKDNVLEQADALGYKVQSAK >NZ_CP040506.1|WP_006779646.1|5386207_5386708_+|L-2-amino-thiazoline-4-carboxylic-acid-hydrolase MAIKNNENPVMETVAVNRSQIEHRATWMGLIYDEMKKEGLDAEGIIRRAIKRTGCIHGEGFRKQCADPADGSQFCQVFLGTEDNVGPQTFGMDHICSDRDNVSVEFHYCALVSAWKKLGFDDETCALLCDIAMDGDRGIAEAMGMTLDLTDTIAKGCETCKLHFYK >NZ_CP040506.1|WP_006779656.1|5396150_5396879_+|YebC/PmpR-family-DNA-binding-transcriptional-regulator MSGHSKFANIKHKKERNDAAKGKVFTVIGREIAVAVKEGGADPANNSKLRDVIAKAKANNMPNDTIDRGIKKAAGDANSVNYEVLTYEGYGPNGVAIIVDTLTDNKNRTAANVRSAFTKGSGNVGTPGSVSYMFDKKGQIIIDKEECEMDPDELMMAALDAGAEDFAEEEDSFEILTAPDDFSAVREALEAAGIPMMEADVTMIPQTWVELDDEDSIKKMNKILDLLDEDDDVQAVYHNWDE >NZ_CP040506.1|WP_006779657.1|5397109_5399104_+|M28-family-peptidase MDSKIEDWNEDVEYAFRLAKRMEEFRSNPALGYRTAGSKAEFETGEMLLAEMRQLGFSNVRKEQIRVDAWEFERAVLRCRIEDTGRYREFQLGAYQTNFHTAGFQEYSIIYAGKGTARDYEGLDVSGKLVLVEINQREEWWINFPVYQAHVKGAAAVIAVQERGFGEVDSTALNAQDIAGPAGAPAFSISQADAAALRELMGDGREMKALFDAETSVRTDCRSYNIVGEIPGEEEEMILLTAHYDSYFSGFQDDNAAVAMMFGIGKHLLERGYRPRKTLVFCAVSAEEWGVSNSKYDWSVGAWRQVFEVHPEWQGKVMADLNFELPAHAHDRKDGIRCVYEYEDFLRHFLGTIKVDETAYPGGIEVHSPIQTMSDDFSMAIAGIPSMVNDFTSGSFMETHYHSQFDNEEFYEEAVYRFHHYLYGELVQAFDRTVLPPLDFGRLFEAMVESIDLEFSKEAYESGIRLKQLALQAVEEGRRVYRWITQINHMAGMTASGGYERERRILMQVFRKAQDSFVRLNWHDEVLFPQELVRKNLSHIRRAECCLDGGDIRGALEEIYEIDNNRYAFLFDREVFDYFTDYVFGRPKEELLWGGGRIVHHENLYGLVSSLRKKYETHSTDVTQELEVLKRVEANQMEYYLADIDYMIRETELIINNLKKIEGI >NZ_CP040506.1|WP_006779658.1|5399108_5399744_+|GNAT-family-N-acetyltransferase MESQKLYRVERNDMGRLEELLAECFMRDPLYCRLIPDEETRVRLMPELMHCDLEEMFATCEIFADSPDIHGVLVVSDESEPYNIFQYYLTEAYASLKTEECLIREDPSLKTFWNFFLGRDYLNSRWTDQLHQEERLHIIYLAVEPAMQHHGISTLLMDEAIAYAREHQLMISLETHNEKNVAMYQHYGFKIYGVVEKHFDLKQYCLVREVQ >NZ_CP040506.1|WP_006779659.1|5399830_5400793_+|Gfo/Idh/MocA-family-oxidoreductase MKIGVVGNGMIVKRFLEDLKQVEGASAEAICVRSQSREKGEQLAAAYEIGKVYTDYPECLRDGSLDAVYIGIINSEHYEYVKLALEAGKHVICEKPFTVEAWEARELAKLAREKGLFLWEAFKIAYSPVFQSVKEHLTEIGAVKLVQCNYSRVSSRYADYLEGRVLPAFDPELSGGCMYDINLYNLHFTVGLFGRPNALHYYANKGYNGIDTSGVVVMEYDGFQAVLTGSKDSSSPCGCVIQGEQGYIRTEGPASAASSAEINLGNGPVPIAQDEENGTLAGETRAFVAQYENGDYESCYQMLEHSVLVMELLEAAVKDR >NZ_CP040506.1|WP_006779662.1|5402224_5402422_+|helix-turn-helix-transcriptional-regulator MATRIPCTPFGKRMKIAMVEQDIPQHELAKRLGLASSTVSDVIYGRNCCERTKERIAETLGIRVN >NZ_CP040506.1|WP_138670012.1|5402536_5404123_+|transposase MRLNKNTNDNYTVRQLKLPLEIEKLIDISDPVYTFCEVMDHIDLSKYFVAKGYKTGRPRCDEHKLLKVILFAFMEHGISSLRDIEKLCRNDIRYLYLLDGMKAPSFATIGSFIRKELTDSIEQIFLDVNTYIFQKDHVDLEHVYLDGTKIEANANRYTWVWKKSCTRNRGKVFEKISMLLDAMNQEVLGYLNLKLEKREEYAIGYVSELLELYRKGTGLDESMFVSGCGHRKSIYQKQYQELQGYLERLKTYAHHIEICGEERNSYSKTDHSATFMRLKRDYMGNDQLLPAYNLQTAVCDEYIAVVEVKPYASDMECFVPLMEKFHKTYGRYPKYPVADAGYGSYNNYLYCEEHGMEKYMKFTMFQKETKDKKYHENPYRAVNFQRDESGNLLCPGGRKFRFKCRRPVYKNQYGRTEELYECESCEGCEYKSECSPKASGNRTIRMNEELTAIHQEVLSNLESIHGALLRMNRSIQAEGTFGVLKWDRSYKRLFRRGEKNVILELTLISCGFNIYKYHNKKQRKEAAA >NZ_CP040506.1|WP_006779664.1|5404389_5405310_+|aldo/keto-reductase MKHIKLGRSGLTVPAIAVGCMRINEMGSAQVAEWIDGALEMGANFFDHADIYGRGACEELFGQAMAEAGVKREDVILQSKCGIIPGKMYDCSKEHILESVEKSLKRLGTEYLDVLLLHRPDALIEPEEVAEAFDELERSGKVRHFGVSNQNSMQMELLRRYVKQELAADQLQLSVTNSNMIRSGLEVNMQTEGAVNRDGSVLDYCRLHDITIQVWSPFQYGFFEGVFLGSLEYPELNQVIDEIAKGYGVSATAIATAWIMRHPAEMQMIAGTTKLGRLRDICESSEIVLSREEWYRIYLAAGHMLP >NZ_CP040506.1|WP_006779665.1|5405425_5406448_-|PTS-sugar-transporter-subunit-IIC MNQTGVKAFLARKNVSITVKTYLIDALGAMAFGLFASLLIGTIFATLGEKTNIALFVTIADYAKGATGAALGVSIAYALKAPQLVLFSAATVGIAGNALGGPVGALVATIVGTELGKIVSKETRVDILVTPGVTIISGVLVAQFAGPGVSAFMTAFGNLVKNATEMQPFFMGILVSALIGIALTLPISSAAICIMLSLDGLAGGAATAGCCAQMIGFAVLSFRENGIGGLLAQGLGTSMLQMGNIVKNPRIWIPPTLASMITGPIATMVFKLQNIPAGSGMGTCGLVGPIGVYTAMGGGTSMWIGILLVCFVLPAVLTYGFGIVLRRMGWIKDGDLKLDL >NZ_CP040506.1|WP_034858139.1|5406766_5407438_+|helix-turn-helix-domain-containing-protein MMFSEKLQIIRKNRGLTQEELAEKLSVSRQAVAKWEAGHTYPDITNLIGISNFFNVTVDYLVKEQECSLNITDAQDKDIERLILFRLEANVNTYAAYMNETSPTRLNSHDFTYTNAPYLYHDTYVGGEKFAGEEVIWHEGNVQYAMNYCGQVLGQQFSGDFLKEALRKADMKMPYRGPEYYQSGEYTYKCNVVGDFTWFQGYEEIYCNTEKVYECYFHGGTTN >NZ_CP040506.1|WP_006779667.1|5407559_5408966_-|glucuronate-isomerase MKQFMDKDFLLSTESARMLYHDFAEKMPVLDYHCHINPQEIAEDRKFDNITQVWLGGDHYKWRQMRSNGVEEKYITGDASDREKFQKWAETLPKLIGNPLYHWSHLELQKYFGYTGYLNGDTAEEVWNLCNAKLQEDSMSVRNIIRQSNVTLICTTDDPVDSLEWHKKIAADTTFDVQVLPAWRPDKAMNVEKPTFAAYMAQLSEVSGVKVTDFASLKEALKNRMAYFAENGCCVSDHALEYVMYVPATDAEVDAVMAKGLAGQPVSKEEELQYKTAFMLFVAREYNRMGWIMQLHYGCKRDNNAFMFEKLGADTGFDCINNYAPSAQMADFLNALSAGNEIPKTIIYSLNPNDNASIGTIIGCFQDTAAAGKIQQGSAWWFNDHKVGMTEQMTSLANLGCLGNFIGMLTDSRSFLSYTRHEYFRRIMCELIGGWVENGEYPADMKALKEIVEGISYNNAVKYFGFNL |
You can click texts colored in the table to view more detailed information
CRISPR_ID | Spacer_Info | Spacer_region | Spacer_length | Hit_ID | Protospacer_location | Mismatch | Identity |
---|
CRISPR_ID | Spacer_Info | Spacer_region | Spacer_length | Hit_phage_ID | Hit_phage_def | Protospacer_location | Mismatch | Identity |
---|---|---|---|---|---|---|---|---|
NZ_CP040506_6 | 6.2|4526038|30|NZ_CP040506|CRT | 4526038-4526067 | 30 | NC_029005 | Streptomyces phage phiSAJS1, complete genome | 25000-25029 | 6 | 0.8 |
NZ_CP040506_2 | 2.27|3208612|34|NZ_CP040506|CRISPRCasFinder,CRT | 3208612-3208645 | 34 | NZ_CP014068 | Enterococcus gallinarum strain FDAARGOS_163 plasmid unnamed, complete sequence | 15850-15883 | 7 | 0.794 |
NZ_CP040506_2 | 2.57|3208617|34|NZ_CP040506|PILER-CR | 3208617-3208650 | 34 | NZ_CP014068 | Enterococcus gallinarum strain FDAARGOS_163 plasmid unnamed, complete sequence | 15850-15883 | 7 | 0.794 |
NZ_CP040506_6 | 6.2|4526038|30|NZ_CP040506|CRT | 4526038-4526067 | 30 | NZ_CP019297 | Vibrio campbellii strain LMB29 plasmid pLMB99, complete sequence | 52948-52977 | 7 | 0.767 |
NZ_CP040506_6 | 6.2|4526038|30|NZ_CP040506|CRT | 4526038-4526067 | 30 | NZ_CP020081 | Vibrio campbellii strain 20130629003S01 plasmid pVCGX4, complete sequence | 18078-18107 | 7 | 0.767 |
NZ_CP040506_6 | 6.2|4526038|30|NZ_CP040506|CRT | 4526038-4526067 | 30 | MW084976 | Bacillus phage Kirov, complete genome | 50075-50104 | 7 | 0.767 |
NZ_CP040506_6 | 6.2|4526038|30|NZ_CP040506|CRT | 4526038-4526067 | 30 | NZ_CP022991 | Paraburkholderia aromaticivorans strain BN5 plasmid pBN1, complete sequence | 283095-283124 | 8 | 0.733 |
NZ_CP040506_6 | 6.2|4526038|30|NZ_CP040506|CRT | 4526038-4526067 | 30 | MN693358 | Marine virus AFVG_25M395, complete genome | 15246-15275 | 8 | 0.733 |
NZ_CP040506_7 | 7.2|5395727|35|NZ_CP040506|CRISPRCasFinder | 5395727-5395761 | 35 | CP046512 | Bacillus cereus strain JHU plasmid p1, complete sequence | 48925-48959 | 8 | 0.771 |
NZ_CP040506_2 | 2.7|3207307|34|NZ_CP040506|CRISPRCasFinder,CRT | 3207307-3207340 | 34 | NC_013940 | Deferribacter desulfuricans SSM1 megaplasmid pDF308, complete sequence | 256871-256904 | 9 | 0.735 |
NZ_CP040506_2 | 2.37|3207309|34|NZ_CP040506|PILER-CR | 3207309-3207342 | 34 | NC_013940 | Deferribacter desulfuricans SSM1 megaplasmid pDF308, complete sequence | 256871-256904 | 9 | 0.735 |
NZ_CP040506_3 | 3.17|3219575|34|NZ_CP040506|CRISPRCasFinder | 3219575-3219608 | 34 | MN033296 | Leviviridae sp. isolate H4_Rhizo_Litter_20_scaffold_389 RNA-dependent RNA polymerase (H4RhizoLitter20389_000001), hypothetical protein (H4RhizoLitter20389_000002), and hypothetical protein (H4RhizoLitter20389_000003) genes, complete cds | 3462-3495 | 9 | 0.735 |
NZ_CP040506_2 | 2.7|3207307|34|NZ_CP040506|CRISPRCasFinder,CRT | 3207307-3207340 | 34 | NZ_CP014607 | Endosymbiont 'TC1' of Trimyema compressum strain not applicalbe isolate TC1 plasmid pTC1, complete sequence | 19518-19551 | 10 | 0.706 |
NZ_CP040506_2 | 2.30|3208807|34|NZ_CP040506|CRISPRCasFinder,CRT | 3208807-3208840 | 34 | MN284895 | Mycobacterium phage Marshawn, complete genome | 35327-35360 | 10 | 0.706 |
NZ_CP040506_2 | 2.37|3207309|34|NZ_CP040506|PILER-CR | 3207309-3207342 | 34 | NZ_CP014607 | Endosymbiont 'TC1' of Trimyema compressum strain not applicalbe isolate TC1 plasmid pTC1, complete sequence | 19518-19551 | 10 | 0.706 |
NZ_CP040506_2 | 2.60|3208812|34|NZ_CP040506|PILER-CR | 3208812-3208845 | 34 | MN284895 | Mycobacterium phage Marshawn, complete genome | 35327-35360 | 10 | 0.706 |
NZ_CP040506_2 | 2.7|3207307|34|NZ_CP040506|CRISPRCasFinder,CRT | 3207307-3207340 | 34 | NZ_LN906635 | Lactobacillus reuteri plasmid p53608_1, complete genome, strain ATCC 53608 | 131546-131579 | 11 | 0.676 |
NZ_CP040506_2 | 2.37|3207309|34|NZ_CP040506|PILER-CR | 3207309-3207342 | 34 | NZ_LN906635 | Lactobacillus reuteri plasmid p53608_1, complete genome, strain ATCC 53608 | 131546-131579 | 11 | 0.676 |
1. spacer 6.2|4526038|30|NZ_CP040506|CRT matches to NC_029005 (Streptomyces phage phiSAJS1, complete genome) position: , mismatch: 6, identity: 0.8
gcagaaacaacagccagaacagcggccaga CRISPR spacer gccggagttacagccagaacatcggccaga Protospacer ** *.*.. ************ ********
2. spacer 2.27|3208612|34|NZ_CP040506|CRISPRCasFinder,CRT matches to NZ_CP014068 (Enterococcus gallinarum strain FDAARGOS_163 plasmid unnamed, complete sequence) position: , mismatch: 7, identity: 0.794
tgtataccggtgtcaatcaggaaggacagaccta CRISPR spacer tgtatgccggtgtcaatcaggaagaacgaacaac Protospacer *****.******************.**..**
3. spacer 2.57|3208617|34|NZ_CP040506|PILER-CR matches to NZ_CP014068 (Enterococcus gallinarum strain FDAARGOS_163 plasmid unnamed, complete sequence) position: , mismatch: 7, identity: 0.794
tgtataccggtgtcaatcaggaaggacagaccta CRISPR spacer tgtatgccggtgtcaatcaggaagaacgaacaac Protospacer *****.******************.**..**
4. spacer 6.2|4526038|30|NZ_CP040506|CRT matches to NZ_CP019297 (Vibrio campbellii strain LMB29 plasmid pLMB99, complete sequence) position: , mismatch: 7, identity: 0.767
gcagaaacaacagccagaacagcggccaga CRISPR spacer gcagaaacagcagctagaacagcgatagca Protospacer *********.****.*********.. . *
5. spacer 6.2|4526038|30|NZ_CP040506|CRT matches to NZ_CP020081 (Vibrio campbellii strain 20130629003S01 plasmid pVCGX4, complete sequence) position: , mismatch: 7, identity: 0.767
gcagaaacaacagccagaacagcggccaga CRISPR spacer gcagaaacagcagctagaacagcgatagca Protospacer *********.****.*********.. . *
6. spacer 6.2|4526038|30|NZ_CP040506|CRT matches to MW084976 (Bacillus phage Kirov, complete genome) position: , mismatch: 7, identity: 0.767
gcagaaacaacagccagaacagcggccaga CRISPR spacer gcagaaacaacagacggaacagcttataca Protospacer ************* *.******* .* *
7. spacer 6.2|4526038|30|NZ_CP040506|CRT matches to NZ_CP022991 (Paraburkholderia aromaticivorans strain BN5 plasmid pBN1, complete sequence) position: , mismatch: 8, identity: 0.733
gcagaaacaacagccagaacagcggccaga CRISPR spacer tcacgaacatcagccagaacagcggcgtct Protospacer ** .**** ****************
8. spacer 6.2|4526038|30|NZ_CP040506|CRT matches to MN693358 (Marine virus AFVG_25M395, complete genome) position: , mismatch: 8, identity: 0.733
gcagaaacaacagccagaacagcggccaga CRISPR spacer aatgatacaacagccagaacaacggcgggt Protospacer . ** ***************.**** .*
9. spacer 7.2|5395727|35|NZ_CP040506|CRISPRCasFinder matches to CP046512 (Bacillus cereus strain JHU plasmid p1, complete sequence) position: , mismatch: 8, identity: 0.771
tcagtttttaataattctttca-----ttccgtcctgaaa CRISPR spacer tcaattttcaataattctttcattcatttccctcc----- Protospacer ***.****.************* **** ***
10. spacer 2.7|3207307|34|NZ_CP040506|CRISPRCasFinder,CRT matches to NC_013940 (Deferribacter desulfuricans SSM1 megaplasmid pDF308, complete sequence) position: , mismatch: 9, identity: 0.735
aggatatgaaatacaaaaataaagaggggtatta CRISPR spacer tagaaatgaaatacaaaaagaaagaggcaccttg Protospacer .** ************** ******* .. **.
11. spacer 2.37|3207309|34|NZ_CP040506|PILER-CR matches to NC_013940 (Deferribacter desulfuricans SSM1 megaplasmid pDF308, complete sequence) position: , mismatch: 9, identity: 0.735
aggatatgaaatacaaaaataaagaggggtatta CRISPR spacer tagaaatgaaatacaaaaagaaagaggcaccttg Protospacer .** ************** ******* .. **.
12. spacer 3.17|3219575|34|NZ_CP040506|CRISPRCasFinder matches to MN033296 (Leviviridae sp. isolate H4_Rhizo_Litter_20_scaffold_389 RNA-dependent RNA polymerase (H4RhizoLitter20389_000001), hypothetical protein (H4RhizoLitter20389_000002), and hypothetical protein (H4RhizoLitter20389_000003) genes, complete cds) position: , mismatch: 9, identity: 0.735
ggattgattgttctggtggtgcttttagcatttc CRISPR spacer gcccttatcgttctggtggtgctttcagcacgac Protospacer * .* **.****************.****. *
13. spacer 2.7|3207307|34|NZ_CP040506|CRISPRCasFinder,CRT matches to NZ_CP014607 (Endosymbiont 'TC1' of Trimyema compressum strain not applicalbe isolate TC1 plasmid pTC1, complete sequence) position: , mismatch: 10, identity: 0.706
aggatatgaaatacaaaaataaagaggggtatta CRISPR spacer ataatatgaaaaacaaaaataaacaggtagctag Protospacer * .******** *********** *** . * .
14. spacer 2.30|3208807|34|NZ_CP040506|CRISPRCasFinder,CRT matches to MN284895 (Mycobacterium phage Marshawn, complete genome) position: , mismatch: 10, identity: 0.706
gatattatggcgacaaacagggagctgccggaca CRISPR spacer gcctggatggcgacaaacagggtgatgccgcgct Protospacer * . **************** * ***** .*
15. spacer 2.37|3207309|34|NZ_CP040506|PILER-CR matches to NZ_CP014607 (Endosymbiont 'TC1' of Trimyema compressum strain not applicalbe isolate TC1 plasmid pTC1, complete sequence) position: , mismatch: 10, identity: 0.706
aggatatgaaatacaaaaataaagaggggtatta CRISPR spacer ataatatgaaaaacaaaaataaacaggtagctag Protospacer * .******** *********** *** . * .
16. spacer 2.60|3208812|34|NZ_CP040506|PILER-CR matches to MN284895 (Mycobacterium phage Marshawn, complete genome) position: , mismatch: 10, identity: 0.706
gatattatggcgacaaacagggagctgccggaca CRISPR spacer gcctggatggcgacaaacagggtgatgccgcgct Protospacer * . **************** * ***** .*
17. spacer 2.7|3207307|34|NZ_CP040506|CRISPRCasFinder,CRT matches to NZ_LN906635 (Lactobacillus reuteri plasmid p53608_1, complete genome, strain ATCC 53608) position: , mismatch: 11, identity: 0.676
aggatatgaaatacaaaaataaagaggggtatta CRISPR spacer ttctttccttatataaaaataaaaaggggtatta Protospacer * . ***.*********.**********
18. spacer 2.37|3207309|34|NZ_CP040506|PILER-CR matches to NZ_LN906635 (Lactobacillus reuteri plasmid p53608_1, complete genome, strain ATCC 53608) position: , mismatch: 11, identity: 0.676
aggatatgaaatacaaaaataaagaggggtatta CRISPR spacer ttctttccttatataaaaataaaaaggggtatta Protospacer * . ***.*********.**********
Region | Region Position | Protein_number | Hit_taxonomy | Key_proteins | Att_site | Prophage annotation | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
DBSCAN-SWA_1 |
1137171 : 1161820
Sequences of DBSCAN-SWA_1
Nucleotide sequences of DBSCAN-SWA_1 >NZ_CP040506|1137171:1161820|DBSCAN-SWA GCTATTCTACGGTAACGAATGTCGGATACGAGTCGAGTCCTTCTTTGCTCTGAGAGCGAATGAATTCCGTTACCCGAGCTTTCCCCTCAATTCCGTATTCATTCACAATCTGTACCATATCGCCTAAGAAGAAATCCTCTCCATATCGGTACATCCTCGTTGATTCAACCTTTCCCTCAAAGGATTTGGTTGCGATGTTCTCAGCCAGATTCTCTAAACCTCTTTGAGAAAGCTGTGCATTATACTCAGTGTCCGTCAAGGTTTCATTATCCACGGTCGAAGAAACATCCCTAGCATCCGTATAAAGCTCCCTTCGATTCAAACCTGTTCCGGCACCAGATGCACAAGCCACGGTTGTAGTCCTCCGATCAGCTCCTTCCCCCTCTCCGGCAACCAAAGTAACTGTTTTTAAAGTCTTCTTTGATTCCAGATAATTGGTATTGATTACATTCTCAAATTTTGGAGAAAAGATGACATATGGATTCGTAAACTGGTCGTAAGAACGATCTGCGCCTGCATAGAGCTTAAAGACGAACTTGTTATCATCGGACAGCTTGATTCGGAAACCGACATTCTTGGAATCGCACAGCTTTTTAATGGCATCATACAGATTGTCTCCGGTAAACTGTGCATCTACCGTCAGTCCGGTAATCGCCGGGTCCGTGGATGCCTCGAATATCAGTCCTTCTACCTTTCGGGAAGCATCGGAAGGATTGATGATATTCTCATCCAGCAGCTTTTTGATTCCATTTTGAAAGTTTCCGCTCAGAATCGTTTGCTTCCAAATAATGCGGCGCTCCAGAATGGATTCCAATGACCTTCCAGTGACCGTAAAGTGGTTTCCGTTTTCGGCATCAGACTCAATCTTTCTGTCCTCGACAATCATGGTCTGGTCGGATTCTTTCAGCCAGAGATAGTAGTCGTCTTTCAGGATTTCAAGAACAGAATCGTTAATGCTTGTATATACCTCGAAATCTCCATAGGCAGAATACCGCTCCGTCCATATCAGAGACTCAAAGGTATCAAGCACAGAAAGCATTTTCAGAAAAGTGTCCAGAACAATCAATTCCATAACTATACCCCCTCAAACGCTGTTCTGTTTTCAATCTTAAACTGCACATTGGTCGTTCCTTCTTCCACCACATAAGCGAAAATATTATCGCCTTTGGATAGCTGAAACCAGTCAGAATCTTTATCAAGGCAGTTTAAAATATTGGTGTAGATACCGTTTCGAAGAAGCGTAATTGATTTATCCCCTTTAATGGTGGAGATAATGATTTCATCGCCGGCAACCATTCCAGAACCGGTTAGCTGCTCCAATTTATCTGTATCAATACGCATTACCTCTCTCGTCCCGGTATTGTAAATCGTGATATTTCTCACATTTCCGATGGCATGAATGGTAATCACAACCCCGATCTCGGCATCACCGGAGTAATATACCGTCTGCTCGGTTTCATTCTTAATCTCGCCAAATTCAATCAAGGACTCGGTTAAAGATTCATTCGAAAAAGCGAACTCAAACAGAGGTTCCACTCCATAGAAGATAGTGGTATTAGTTCCATCCGGACCAGCAGAATAAAAATAAGGATCAGGACACACGATGGAAATCTGCGTCGTCTCATCGCTGCTGAAAATATCTGGTTCATTTGATTCCACATAACCATAAGTCTCACAAATACGATTATCTGTCTCTATGAGAAGCGTTACTTTCTTCTTTATCGGAAAGTATTTGTAGGAGTCATGTCTTGTGTCTTCAATCTGAGGATTAAACATCAGTTTCAGAGACATAACAATATTTCTGGAATTTACTCTTGCCGAGTTATACAGCGATCCGTCATTCGTAGAGATTTCTGTCGTGTTAATATCTGCTTTGCTCGGTCCCAATCCGCTGATAGATTGAACGGCGAACCCGGATTCCTCCGGGAACGCTAATTCAAATCTTTTTGATTCGCCCAAATAATTAGTTACAGTTACTGCTCTAATCATGTGTTACCCACCAGCCCTTTCATCGCCGAAAATTGATTCTTTGTCTGCCGATAAATATCAATTCTCGACAGAGCCTTAGGCGAATAATTGTTTTGCGTGAATTGATAGGTATTTCCTGTAGGAGAACTTTCTCCATTTTGAACTTCTATCTCGGAAACTCGGTCATTCATCCCAGTGCTGACAGATAATGCCTGATTTCTGCTAAACAAAGTATTCAACCTTCCGCTCCCTGCTTCTACGGCGGATAGGTCAAGAACCGGTCGAATGGTAGGCTGAACATCCATGTCTGCATCTACATAGTCTGCAATCCTGGAAATGATATCATTCAATCCATCAATAGAAGATCTGGCAATTTCCCGTCCAGCCTTTCCAGCCTTGGATACATTGTCAATCAACGCATTTATGAAACCGACTCCTGCAAAGTTACCGATTCCGTAAAAGCGTTTAGAAGGAGAATGCTCGTCCAATTCGTCTTCCGCTGCTTCAGCGGCTGCGGCTGCCATGGCTCTTGCTTTTGCTTCCGCTTTCCAAGTATTTTCGCTGATACCATCACAGAAACCATCGACCAAGTATGAACCGGCAGATTTGAACTGACTATAATAGTCTTTGATGGCGGTTACAGAGCCACTCAGCGTAGTTGTAAAAGCTGTTCGGAGTTCACTATCTTTGCTTCTCACACCGGCGATAAATTTAACCATGCACTCTCTACCAGTCGAGGTAAACTCCGCATATTTATTTTTAATCACCGTAAGACAAGCGCTGATAATGTTTGTAAACGCCAGCCTCGCACTACTGTCCTGCGATCTGACACCGGCAATAAGCTTCACCATCGTCTGGGTTCCGGTTGACGTAAATTCCCCGTACTTATTTCGTATTACAGTCAAACAACCGCTAACGATATTGGTAAAGGTTGTTCTGGAAGAACTATCCTGAGACCGTACACCGGCGATAAATTTAACCATAAGCGTGGAACCGCTGGTTTGGAACTCGCCTTGTTTTCCGTTGATCGCCGTCAAGACAGTCTGAACCAGCGTAGTGAATGTCGTTGTCAGTTCGGATTTCTTCGCATTTGCACCATTGATGAAGGACGACAACATACTTGAAGCCGCGGCTGTTACTTTCGATTCTGCATTATTGAATGCATTGATAAATCCGGTTACACCGGTTTCACCAAGCGTTGTCAACGCAGAGCTGAAGGAAGTCATACCGCTTGTATCCAGACCAACCATCCCATTTGCCATACCGACAAGCCGGTTTGTCTGGGTGATCACTCCGGACAACAACGTCGTATCAATACCACTGATGCTGTTGTAATAATTACTGAAATGGGAACCGAACGATGCCATATCACTACCAAAACTGGCAAGTGTCATATCATCGGAGAACCATCCGCCTTCTTTTGGAAGACTTTTCTGAAGCTCAACAATGGATGTAGCAGCATTGGTTGTAGTGGTAACGATGTTCGCATCCACATCTTTCATATAGTCGGAATATTGTGCGAAGCTCTTACCAAAGGAAACCAAACTTGTGCCAAAGGCGGCAATATCATTGTCTCCGGTAAACCAGCTTACCAATCCACCCGTATTCGGTAACGTATTCGCCAACTCAACTACTGCTTTGCCAGCCGTTGCGGAATTTGTAACGGCCTCCACATCAATACCTGCAATTGCGTCAGAGTAGGATTTCATCGCTCTGCCAAATGGCACCAACTTCTCCCCGAACGTGTCCATGTCGTTCTCTCCCGTAAAGAAGCCCACGACACCGCCACTGTTGGGAACAGTATTTGCTAATTCGATTAAAGCCTTTCCCGCAGTAGCAGATTCCACGATTACATTCGCATCCAAACCTCTTACCGCCTGAGAGAACAGCATCATCGCCTCGCCAAACGGTACAAGCTGCTCGCCAAACGCATCCATATCATTTTCGCCAACAAAGAAACCTACCACGCCTCCAGAATTCGGAATTGTGGTTGCCATTTCAGCCATAGCCTTTCCTGCGGTAGCAGCATTCGTTACAGTATCTACATCCAGTCCTCTTACGGCATTTGCAAACCCCATCATTGCTTCGCCAAATGGAATAAGCTGGGCGCCGAAAGCACTCATATCGTTCTCTCCCGTAAAGAAGCCGATGACTCCTCCGGAATTCGGAAGGGTTGCCGCCATCTCCGCAAGTGTCCTTCCCGCCGTAGCCGCATTTGCCACCAATTCCCCGTCCATACCAGCAATGGCGATAGAGAAATCTCGCATCGCTTCACCGAATGGAACGAGTTGAGTAGCAAAGTCGCTCAGAGAAGATCCTCCTGTAATCCAGGAAGTCAATCCGTTCAAAATATCGGCAGCGGTCAGAATAAGAATTGTTTCTGCCAATGCTTTTACGCCGTCCAGCATAGAGGGATCAAGCTGTGTAGCACCCTCGATAAATGGCTGCACATTCGTCATAAACGCAGAAAGATCTGAACCAATTTGAGGGAATTGACTTGAAACTCCAGACATGAAACCGCCGACAATTCCGCCAACAAATTGACCGATAGCCGTACCAATTCCCTGAAGAAGATTTCCGCCCTCACCGATAAGCCATTCCAACCCAGGAATCTGAGCCAGAGCGCCGACAGCCGCCAGAACCAATGCCAACTCCGCAATGACTGCACCCATTCCGAGAACACCAACCATAGCCCCAGGTACCAAGGAAGCAACAGCACTGAGAGCAAGCATAATTGCTGAGAGCAAACCAATTCCAGCGATTCCTTTGATGAGTACATTCACATCAATGCCACTCAAGGCGTCGATTACCCCGTCAAAGAAAGCCATCAGTAACTCTACGCCAGCTTTAATCAATTCCGGTAGTTTCGTTGTGATAGCCTGAATAATCCCAATCAGAATATCGAATAACTGCTCCACGATGGTCGGTGTGTGTTCGACCAGAGCCGAAAGGACACTGTCGATCAGGACAAATAGCCCGTCCACAACTGCTGGTACAGCCGTAACCAAAGCATCGACTGCGGCAAGAACCAATGCTGTAAATGCCTCGGCAATAGCTGGCCCGCCATTTGCGATTACTCCAGCAAGAGAAAGGATTCCTTCCCCGATAGATTCGAACAGCAACGGAATCAAACTGAGAATACTGGATACTGCCACTACTAGAGATGCTGCTCCCGCCGCTCCAGATACTGCCAAAGCAGAAAGTCCAGTGGAAAATGCGAGAATGCCAGCACCTGCGGCCAGACATCCTACTCCCAACACAGCAATGGCGGCCGAAAGTCCTAAAATAGCTGGGGTCAATGGCCCTAATGCCACTCCTGCGACACCGAGAACCGTGAAAGAACCTGCCAGTGCCACCAACCCTTTGGCGATGCTCTCCCAAGACATATTCCCCAATGACTTGAGAACCGGGGTAAATATCGCCAATGCAGCGGACACGGTAAGAACCGCTGCCGCACCCGGAAGTGCAGTTTTCATTGCGTTGAGTGCCACAACAAGAATGGTCATGGAACCTGCAAGGGTTACCAGTCCTCTGGCGATTTCATCCCAGGACATTCCGCCCATATTTCGGACTGCTTCGCCGATAATGAGTAATGCTGCACCGACCTCTACCATTCCAGTCGCTTTCGACACCATTCCTTTTGGAAGTAGATTCATCGCAACTGTCACGGCCGCCAGAGAACCGGCCATCGTGGTAAGACCTCGTCCAATCTCTCCCCAAGTCAGGTTCCCCATCTTTTTCACTGCTTCTCCAAACACGAGCATGGCTGCTCCAAGAATCGTCATCGCTGTAGCGGTGGAAACTACATGCTTCGCGTTAGCCGTAACTTTGGTGAATACCGCCAGTTCGGTAAGAACCACGGCAACCGCAGATAGTCCTTGAATCAGGTTTGAAGTGTCCAGATCTCCAAATGCCTTAACCGCATCCGCCAGAATATTGATGGACGCTGCAAGAAGAACCAATCCGGTTCCTTTCAGAACACCCATTCCATCCAAATCTGTAGCCTTCAGGAACAACGCCAGTTCTGTGCAAAGAACGCCGACTCCGATTAGACCTTTAGCCAAAGAGCCCACATCCAAAGCTCCTAAATCTTCAACTGCTCCTACAAGAACTCGAATCGCTGCCGCAAATACTACCAAACCGGCAGAACCTTTTATCAGCCCCTTCGATGTTTTGGAAAGCGCTGTTGCAGACGCTACCAGAATAGCAGATAACCCAGCAACACCGACCAATCCTTTCAGAAGCTCGTCCCAATCCAAACCAGATAGTTTCTGAACTGCGCCTGCAAGAATAAGAACGGCAGTAGACATCCCAATCATCGCAATGGTCAACTGTCCCATTCCTTTGATTGCTGCTCCGTTCATTATCTTTTCAAAGATGGCCATTGAACCAAGCAGTTCAACAAACAGAACACTCAAAGCCCCCAAGGACGCATTTAGCTTCTCGGAATCAACCAGAGACAATGCCACAATCGCTGCGGTCAGGATTGCCATAGCGCCGGCAATTTTCAGAAGAGTCCCGGCCTTTAGACTCGACTGCCATGCTTCGAGGCTCCCCTTAACTCCATCCAAAATATCTTTGAACGAACCAAGAATTCCGCCGCCATTTTCTGTGATCTCCGATAAAGAGTCAATGAATTTTTTCACTCCAATCAGAATTGCAGAAAACAATCCCGTATTGATTAAGTCCAAAATCGGATCGAAACTCGCTGTATCAAATGCTGTAAGAATTGCTTCTCCAAGGTTTCCAAACGCATTTGCGACAATGGAACCCAGCTTCGATAAAACAGGCGCTACCTTCTCGACAATCCCAATAATTCCTTCAAATGCCTTCTTTACCAACTCTCCTAATTTTACAAACGGTTCAAACCGAGTCTGTACCTTATCCGCAAAATTATCGAGACCACTGGTATCAACATTCGCAAACTCGCTGAACGCATCGGCAACTGTTTTTACAAAAGTCTTTATTCCATCCGCAATTGGTTTCAGGAAATTCCCGATTCCTTCGATAGCTTTGTTAAAGGCATCAGACGATTTAATAGCTTCATCAATACCAACAATGAAATCTCCAACGCTGGCTGTAAACCCAAGAATTCCATCTCCGGCCGGAGCCACATAACCAATCAAATCTGCAAATCCACCAACAAGCGCTTTGACACCTTGAAGTCCAATATCAAATAAAGCGAATACCCCTTTGAATGTTCTCTTCAGGTTATTCGCTGTTTCTTCGCCCATTTTGAATTTTTCTGTTAGTTCCTGTAATCCTACAGTGAGATTGTAAAGCTGTTCTCCAGTCATCGGCGGAAAGACTTCTCTAAACGCTTCTTTAACTGGCTTTATAATGCTTAAAACACCCTCGAAAGCATTCCTTACTGCTTCAATCAACGCAGTTCGTCCGCCAAGATCTTTCCAATCCTGCAACATCTTATTTCTCGCTTCGGCAGAAGCATTTACCATGTTGCCAAGGGCGTTGCTAACCTCAGTTAAAAGTTCTTTTGCCTCTTCGAAATCACCAATGATGATTTCCCAACTCTGAGTCCAACCAGACTGAACAAATTCTTTCAAGGTGTCCCATAACTGCGTGAATGTCTTTACCTTGGTAGCTGCATCCAATGCTGTCTGAGCCAGTTTTGTAATCTCTTTAGCTTGTTCTTCCGTATATCCCTGAGCGATAAGATCTGCTTCCGAGTAAGCTCCAGACAACTGTGTCAGAGTTTCGGTCAACACCTCTGTTGTCAGCCATCCACCTTCGGTCAGAGAAGCTCTGAATGAACCATACTGTTCAATCATCGCGTCCATGTTGGTTCCGAAATGTTCTGCGGTTCGAGTTAAGGCATCTTGAAATAGCTGACCGCCCATTCCAGCATTTACAACGGAATTCCAATCTTGCAAACTAACCTTGCCCGCTGCAATCGCCTGCGAAAGCTGATACATGGCGGTACTGGCCTGATAAGCATTAGAACCCGAAGCTGCTGCTAAGTTGGCAATACCTTTGATTGACGTTACTGATTTATCCAAGTCAACACCGGCAGCCGTGAAAGTACCAATGTTACGGGTCATTTCCGTAAAATTGTAAATCGTCTGATCAGCGTATTTGTTCAACTCATCAAGAGCAGCATTTACCTGGTCAATGGTTGTCCCCTTGCTCTGTGTATTGGCAAGAATAGTCTGAACTGCATTGATCTGTGTTTCATACTCCTGAAATCCAGTCTTAATCGGATCGATTGTCAGTGCAGAAACAATATTTTTACCAGCATTTAACGCTGAATTTGTGATGTTCGCCAGAGCCGTAACTGCCATGACTTCCAACGCCGAAAATCGCATCTTTACTGTTTCGACCGCATTGGAAAGCGGAGTCATATTGCAATTTTTTGCTGCGGCATTAACGTCATCCAATCCTTTGGAGGCACCTTTTAAGTTTAAGCTTTTTTCGAGCTTTTCAATTGAAGATATACTGGTCTGAACATTCTGCTCAAACTGTTTGTTATCGAATCGCATTTCAACGACTCTTTCATCAATCGTCTTACTCATAGCTTAGTAACCTCCTTCCATGCGTTATTTGCAATTTCGTCAAAAATAGGCTGGATAGCAGGATTGATGTAATCTCGCCCCTGTACCCAGCCGCCGTTTCGAGTTCCGTGTCCATACTGCAAAATAACAGCGATTGGAACTCCATTTTGAACATTTGAATTATGAAATGAAATTGTAACCGAACCTTTCCGATTCTCAATTTCGTAATACCAGGAATTCGCCGTCTCCCCAGAATCCACTGGTGTTGCAGACGCAAGGGCGGCTACTCCCTCTTTACCAAACTTGTCCAGGTCTCCGATGTGAACCGCTTCTTTTGCTCTCTCCAGAAAGCGTGTCAACTTGGAGAAGTCACCCTTTTGTCTGAAACTTATCATCGTGTATCCTCTTTAAATTCGAGTTGCGTAATCCAGTGAGATCCATCCTGCTCCAGATTTCAACTTACCCCATCCAGCATCAGAACCAGCGCCAGTTTTAACCTCGACGATGGTGTAGACTCCTTTCGGACAGAAACCATTGGTTCCGTAATTCGTTCCGGGACCTTTACGGATATACAGATCGGAAATGTCCACCTGAACCAGAAAATTTCCAGAAGGCTTCTCTACTGATCCACCAGAATCAGAAGAAGCTGCGCCTTTATAGGTACAGTAAGCTTCATGAACGCTGATCCATCCTGCACCGGATTTCAACCTGCCCCAGTAACCGTTCTGAATTTCAGTAATCGTGTAAGTACCTCGGTCAGTAATCATCCCATTGGTTCCGTAATTCGTCCCAGGGCCTTTGCGAATATTCAAATCACCGATGTTGACTTTGTAAAGTCCAGTTTTGTAGGTTTTCTGGACGCTGTCAGTCGTACTTCCGCCAAGCTGAGCAGTTACCCGATTCGCAAGATTCCCCAGCCTGGAATACAGCCAATCCCCAGGACAAGATTTGTTGGCAAACCACCGATGAACCGTAAGGATCATCTCGTTTGATTTGGGACTGTAGTTCAGAGATTTGTCCTTGTCGCCAAACCAAAGGAGTTTTGTCTTACCATTTCTCCGGCAGATGTCAACACACAGAGCAACCAGCTTTTCATATACTGCATCAGTCATGGCATACGGATGAGTCATATCACTGGCACATTCAATCGTCACAGCACGCTGATCATTCGCATTGCTGGAAGAACACCAGCTTCTGTTTGCTTCATCCACGCAAAGAACCACACGTCCGTCACTACCAATTCCATAGTTGCAAGAAGCCTGTACGTTACTACTGTCAAAGCAGCCCCCAATAGATTCCGCTGAAAGCTGACCGACCACACAATGCGGAGTGATCCGGTCGATTGAATGCGTTCTGGCTCCACTATGGTTTGGACTTTTTACCGTACAATTCACCAAGCTGCTGTTACTCATAGTAATCACCCTTTCGTATTCCATTTCTTTTTACGGGCCGCATTTAGTGCCGCATTCCGTTTCATAATTTCTCTGCGGCTATGTTTCTTCGGCGGCCTGCTCTTCACATCGCATACTCTTATCAAAGTAAACAGTTTATTGAGATGCCATTTCTGACATTCAAATGGAATATTTAAAGCTATCATCCAGTAATAAACGAGTTCTGCCGTAATCTGCTCTCTGCTTCCCTGCGTTTTTTTCTCCTCGAAAAACCGGGTGGCAGTCATAGGAAGAGCAATATACCGATTGACCTCATTGATATTGCTGTTTGTCAGATAGTTATAAACTTCCGGGTTCACGTTCTGTGTAAGAGTCATACATTTTACATAATCGATGGTTTCTTCCAAAGTTTTTTCCTGCTTCGTCAGAAACGGCTTATTCCATCTCGATTCCCATTTTGAAAGAGAAACAAGAGAATGCTCCAATTGCAAGGTCTGAGCCTTTGTGTAAACAAACTCTTGCTTCACCTCATCCCAGAATTCCGTGGATGGTATTGTGATTCGGAGCATCTCTCACCTCTCCATTAATTCTGAGTGTTTGCTGCAATTGCAGGAGCTGTGGCAGAATTGCCGATGTTCATCACCGCATTCACAAAGTCTGCTGCTGTCTTGTCATTCGTAACCAGTTCCTCAAAGAGAATCTCATAAGCAGGAGATTCCATAAAGGATCTGGAAATCTCCTCAGACTTCATAAAGCGACGGCCATCTTCACTCTTGACACCGTAAGCTTTCTTAATGAGATCTTCAAAGAATTCCATAATCTGGCCGCCATCAGCGCCGGCACCAATACTTTTGAGCTGTACATCATAGCCGCCCTTCACGCTTGTCTGCATCTTGACAATTTCCGGCTTTGACAGATGGAAATAGAAATCCTCTTTTCTTTCGACACCGTTCAGATCGATATAGGGAATGGTTTTCTTCAGCATAATTTTTTCTCCTTTCAAATAAAAAGAAGCCCCGCACATTGAATACGAGGCTTCCTATCGTTATTCTGTTTCCAAGGTAAGCCCAGAAAGACCATAAATCTTCGTGATGCTTTCCCCGTTGTGCGTGGCAGTCACCTTAATGCTCTGAGTATCCTTATTCTTGATAAGGAGTACAATGTTCATGTCGTCATCTAGCGCAACCGGTCCCTTAGTACCGCCTACGAGCTCAACAACCGTCTCTGCTTCAGCCGGCTCAGCCTCAATCTTGAGAGCCAGGTAATTTCCCGACTGCTCAGAAGTATTACTGCTGAAATCAACATAACCATTGACATACTTCAGAGTGCCTGTCACTTCATCATCAGCAACAACCATATCACTCTGCAATTCATTCACTGCTTTTCCAAATAAAACAGCCTCTCCGTCTTCAGGCTTAACAGAAAGGCTCATTAAGGGAGGTCTTCAGCAGCCAGAAGCTCAATCACTTCATCAGGAAACGGCAGTCTGGGATCAACGCCATCATTACCCTCTGGAGTAGTCGGGTCTTTACCATACAGGATTTCTTCCAGAGCTGCCAGCTTCTTCGCATCAATCTTAGTAGAATCCAGTGTAAGGATTGCGGTAGGTTTCAGCTTCTTACCATCGATTATCTTCGCAATCTCCGCCGGAGTCGTGCTGAATTCCCAGGACAGAGCGATTGCCTCCGGGCTGTCATTCACAGTGGTATAGCCTTTCTCAGAAACAGAGGCAAGGCAATTATAAACAAGATGCAGCTTATAACCATAGTCATTGGAATCTACATCATTGCCAAGAATAGTGCGATAAGAAAGTCCAAACTGCTTCCTGCTCTGCTGGCCTGCAAAGACACCAGGAGCGACTTCAACGGAACCGTCACACTCCGCAAATTCATTCGGAGAAGTATAAGCCTCAATCGTTCCGCCGAAATCCTCTGCGGACATCAGGTTCAGATACTTGATGTTATCCGCATAGATCGGGGAGGGCTCCGCCCCGGAAGGGCTCTCTGTCACCGCGCTCAGACCGTTCCAAGCAACACCTTTGTTATACTGTCCACCAGTCTGAATCGGGTAGAGAACACCATGATCAACACCAGTTTCGTAGAGGCGTTCCCCAACTTTATCCCAAATAAGCTTACTCATTGAATTATTCCTCCAATCTTAGAAATACACATTAAAAATGTAGTGATTCAGGTTATCTTTTTTGAAATGCCGGTCAAACCGGCTCATCGGTAAATTCGTTACTTTCTGCACCAAGGATGTATCCGGATCTTTATCAATAACGGTAACGGCATATCTTCGATTAGATAAATATACCCCGTCGTTTGCATATGTCTTGTCTACATCATCAAGGCTATATACAATAGCGGGGTAATTCATCTTAATAGATTCCGGAGGCTGAAAATAACATCGGCACTGTTCACCTTCTATCGGGCAAGATAAAATCTCGCACAACAGTTTATGAAACAGGATTCGTCGATCAGTCATTATACACACCTCCTACCGTCAGAATCAGACGCGGATACTGTACTTCAACACTGGAAATCTTCCACTTCGCTCCCATGAACTCGACATACCGCATTGCGTGAAAATTCTGATAAGCAAAAGGATCGGCCACGATGCTAATCTCATTGGAAATGTTGATATCGTCATTGAGTTTATCGGAAGTCTGATACCGACTGGTATTCCGAATCAAATCGCCGAAATACTCTCGCTCAGTAATTTCCCCGTCCCAAACACCCGGACGAACGTCCTTTGATATTGCATAGCCGATTTTCCCAAAAAACTTTGCCATTTTGAATTTTCTCCTTTACTCTGTCTCCAAAGTCAATCCAATAAGCCCATAAGTCTTCGTAGTGGAATTTTCTCCATTGTCTACCGTCACCTTAATGCTCTGAGTATCCTTATTCTTAATAAGGAGTACAATGTTCATGTCGTCATCAAGCGTAACCGGTCCTTTGGTACCACCTACGAGTTCAACGGTCGCAACTGCATCTTCGGAATCAGCATCAACTTTCAAAGCAAGATAGTTTCCTTCCTGCTCAGAAGTATTACTGCTAAATCCCGTATATCCAGTAACATGCTTCAATGTACCGGTAATCTCAGACTCTCTGATAGCAATATTCTCCTGTAACGAATTTACCGTTTTCCCGAACAGATTGGCTTCTCCATCTTCGGGACTAACGGAGAAGCTGATTAAGGGTTTACCGTTACGTCCTCTTCAATCGCAATGGCGGAGTACACTCTGGTCAGAGCGCCGGAGCATCTGGTTTCCAGAAGGGACTTCTCCTGGTTAAAGTCGATATCAAACTGCGTGAAGTGAGTAACTTCGCCGCCTTTCGTAGCACCCAGAGAGTAGTCATTCAGGTTCGTGATGATGGCAAGCAGCTTCTTGGTCTTGCTGTCGTCCGTCTTACGAGTCTTGCCTTCGAACTGCTCGGCCGTATGGATTTCCCCGACATTCAGAGCAGAAGCCAGCTCTGATTTAGAGCTATAAATGCGGCGGCCGTTCATATCACGAGCCAGGAGCATTACATTGAGCATGTGAGGAGTGATGTACATATCGGGAGTGCCTGTACCCTTATAATCCTCTCTTGCATACAAAACCGCATTGATCATTGCTTCAGCGTAAATATAGTTCTCGCCAAAGTTCACGCCGGTATTAGTACCCTGAAGTTCATTTTTTGCGGCGGCAACATCCAAATCTGCATGAATGGTATACAGGTCATCATCGGTCCAGATAGGTCTGATCTTATCCGGATCGATCTTACCCTCATCACCATCATCACGACCGTCCCCCAGCATCATCGCAATAGCCAGCTCTTCATCGAGCATCAGGCGGTCGATGTCATACAGATACTTTACATAATCGAAATCTGTGATATCGACAATGTCGTCGCGATGCAGAGCATTCTTCACATAAACAGTCTGCGGATCGGTGGTTCTGCGTACCAGCTTGAAATTTCCGGCCTGCTTCTTCTCTTTTCCTTTCTTGTAGCCCCTGGCGCGAAGTGTATCAATACCGCGAATATCGGTCTGGCTGGTTCTGATTCTGGAAATCGGGCTCTTATGTACTTTCTTCATCACATTGGTAATCCAGCCCTGGTCATTGGTAATGAGTTCGGGAGCCCCCGGACGCACTTCCTGATATTCCGGGAACAGACTCGTCACATTTCCGTCGCCGGTCTGAACAAATCCGCCGCTGACAGCATCATGCTGAAGACCGTTCTGCTCCGCATAAAGCTGAAGCGCTGTCTGGAAAGTACCAACCTGGCTGGTCTTCGCCGCCTTGATGATGTCTTCCTGTGCGGAATGCGTCAGAAAGCCACCGGTTTCGTTTTTCTTATCGTTGTCAAACACATTATGCTTCATCTCGGTATTTCCTCCTTTAGAATCGTCATCATTTTTATCTTCAGGCTTATCGGTTTCCCCCATAGCCTGTCCGATCATTGCATAAACCACATTTTTCTGCTTTTCATTGAGGGTATTAAATACCTGCTCAATTGTCTCTTCAACTTCCTCAGGTTTTTCTTCAGAAGTCTTATCTTCCTTAGATTTGGATTTCTCCTCCGCCTTCTTTTCCTCGGATTTGTCATCCTCCTCGGCGGAATGATAAATCATAATGTTCTCGTCATATCCAATAATGGTACGGTCTTCTGAAGTCTCACCGTGAGCCATAACAGAATCAATGAAAGCTCCCGGATTAGCTCCAGCCAGAACAAGGCTCAGTTCATAGATAACGCCATGCATCACACTCGCTCCGGCCTGTTTAAGCTGACCGGCACAAATAGAAAGTGAACGAACATCTCCGTGCTTAACTAGCTTCTTCGCTGCCTGTCCGGATTCACTGTCATTGAAACTACAGTAGGCATAAACGCCCTCATCACGATTTTCCAGTACCCCATGACCGAGTACACGATTGGGATCGGAATGATTATGTCCCCAAATTAGTGGAACAGTTTGTCCATTCTGGTTCTTAAACGCATCCCTTTTGATGGTACGGCCATCGGTGCAAAGAAGATCGTTTCTAGTGGCCCAACCACTAAAATCGTATTTCTCCATTTTGAAAATCACTCCTTCTATCAGTATTGTGCGAAAGCCATTGCTACTTCCTCCGTTTCCTCTTCTTTACCGCTTTATATTCGGAAGCTATCTTGTCAAACTCTTGCTGATAAAGATCTTCATAAGTGGCATCAAGATTTTCTTTTGCCGCTTTATAAGCTTCCCTTGCGGCGGTAACGGCAGCCTTTAACTCTGTACTAACTTTTTCTCTTTCCGATTTAGCATTCGCAGAATTATCAGCCCTTTCTTCTTTGGTGTCCTCGGTAATTCGCTTCTTCTTAAGACTTGCGGAAGTTCTCACCTCTTCCTTTTCAGCTTTCGCCTGCTCACTCACCTTAGATTTATCCTCGCTGGCATCATCACGAAGCTTTGTAATTTTCTCATTTCGCTCCGCTACTCGCTTTGCCCTTTCCTCTTTGGATAACCCGGATGGAATTTCTATTGCCATTAAGCGTTCGATCTCAGTATTCTTCTTTTCATCGATACGCTCTTTCTGGTCTTCTACTTCTTCTCCAATATCCTCCAAATCAGATTTTTTACGGGAGTCAACCCTACTCCTTCTCGACGAAGATTCCTCGGTAAGCTGAGTATTCAGTTCCTTTAATTTAGCCGAGATCTGCTCTCGGGTCGCCTTGGCCTTTGCTCTCAGCTCAGCAATTTTTTGTTTCCGTTTTTCCTGTTCTTCTTTTACCTTTTCCTTCTTCTCACCAGAAATCTCATTTTTTGTATAAGCCCAGACTTTCTTTCCCTCATCGTTAAGCTTCATTGTGGAACGACGCCCTTTGAGTTCTCTGGTTCTCATATAATATTCATGAGCTTTCACCGGGTCGTAATAAGGAGATGCATAGTGTTGAAGAGGCTCGTTAATATCCATTAGGACTCCTCCTCATCATCCGAAACATAGCTTCCTATAATTTCATCAATCTCCTTTTCAAGACCGTCAAGCAGCTCGTTTACGATACTGTCATAATCGGCTCCGGCATCACTTTCATTGGATTCGACATCAGAACTACCGTTTGAAGGATCAGATTTGGCCTCACTGATATTGCTGTTCTTCAGCGCATCAGCTTTTGGATCATCAGACGGTTTCATACCAATAATCTGGCGAATTTCGTTTGATGTCATAATCTCATTTCTTGTGAATTTGTCAGCAATTTCTGACAGATCAGCTACTGGTACAAGTTTGAAGGGGTCACGGAAGAACAGAATCGATTGCTTTTGAGACCTGGCTGTTTTAGTAAGGAACTTACGTTTCATTTCGTCAACGATTGCTGAAATGATCGGCTCAATAGTACGGTTGTAGTAATTCAGCATGGTCTTCTCGTCTGCGGAACCATCCAATATACTCTGAGTGATACCTAACTGGCTGTATAGCATACTCGTTAGATATTCAATCTGCTTCATCAGATTATTTTCCACAGAACGATTCAACTGTGTGATTCGCTCCGTACCATCAGTATACGCAATACCATATTTAGAACCGGCCAACTGACGCTCAATCTCGACACGCCTCTTCTCAGCCTGTTGACGCCTTGCTTCTGTTTTTATCACATAGGGAAGCTGGATAATTAAATCGAGTTTTCCTGAACTGCTCTGCTCATCAACAACGTCCAATAAATTCAGTTTTCTTATCAAACGCTGCATCGTTGAGTTTGGCTCATTCATCACCGCATAAAGCGGATTTTCAATAATGGCGACCGTATCTTTTGGAACTACAATGTCTTCCTTTAATCCAGTCCGCTCATTATAAACTCTTACTTTGATATGATTCGGAAACCATTCCAGAATCTTCCCGGTTCGCATTGACTCGATTTTATAGGAGCCTGTAGTGTCAGGGTCATCATCCGTATCCACCGGGATAATCGCCACACATCCCTCATCGAGCATTGACAAAACCACATCCTGAAGGAAAGCCCGCCCAGTCTGGTCAATGTTGGCTGATAAATTCAGACAATCATTTAGCCCCGAAGAAATTTTTTCAAGAAATCTTTCGGAGTCGTCCAGACGGACATGTTGAATGTTAATTGAAGCGCAATCCAATGCGATTCGATTATATACAGAGGTAACGATAGATCTTTCATTTCCTCTTGTGAGTCTTGGACGGTCGGGCCTGTACGAATATCCAACCCCTATGTCCCGATAGAAACCTGTTGGGTCTCTATTTAAAAAAGCGTTCCAGGCATGTTTAATCCTGGAACCGATTGAAACTTCCATTTTGAAATCGTCACCTCCTATTCGAAAGCATCTCGGTTGAGCTTGAAAGCGACAAACGCATCCATCATAGCTGCCACGGCATCAATCTTTGCGTCATAACGCTTTTTCAGCAATTTACGGTTTCCATTCGTATCTTCCATAACGATGCAGTTCCCCATCGCAAAGGTCATAAGTTCTTCATCAAACAAAAGCATCCGCTCCTCAGAAAGTTTCTTTAACTCTCCTAAAGGAACGGATTCCGTCTTAGCACCCTGTATTACCTTTTCGATTCCAAACGGACCATTTTCAGAAGACCATCTCTCAATGAACTCCTTTGCGTTGTATGGGTCATACCCCAAGCAACGAACGTCATAGCCAAATTCTGTGATATGGTTATCCAAATCTTCATAGACTTCCATCATATCCAAAACGGTTCCCTCTAGGACAATCAGGCTTCCTTCGTCCATGAATTGGTCGTATTTGATTCTCATTGCTGCCGGAAGTTTCATCAGAGTCGATGAAGAAATGTAGTTCCTGGTTTTAACTCCAAAGGAACCATTCGATAACGGGAAAAGGAACGTAAAAGCACAGAAATCATCCCCCTGCGACAAATCAATTCCCAAAGAGCAGGGCATCTGCCAATAGCTTCTCTTCTTATGAGGAAGAGTTTCTTCATATGTGAAGTAATAGGTGTAGCCCTCCATCGGCAATCCAAATCTCTTAGCCAAGATATCGTTTCTGGCCGCCGGAGACTTCTCCGCTCTTTCCACATCGAGTTGATAGGTCTCATAGCTTACCGTTTTACCAATATTGGGATTTGCCTTCAGCCACATATCTGGATTGCCCACTTCATCGATAGAATCGAGTTTGTACCACCAGATGGAAACATGAGGATTGATATATTCTCCTTTGAGAATGTCCATCAACTCCATTTTGATTGTGTCGCCTGCTCCATTTCTCACTGTTCCCTCAGAACTCGTGGCGACAATGATGTAATCATCCAATTTAGACGCACCCTGCTCCAAAGCGCCAACCACATCTTCTCTAGTATCACCGGACAGCCACTCATCCACTGTAGAAATCTTAGGTCGTAATCCCTGAAGCTTTGCGATGGACATTGGCCGCACTTCCAAAAGCGAACCTGTGAGAAAATTCTCAATACCCTTTTTGGTGGAGGCCAATTTCATTCTCTTTGCTTTAGAACCAGTCGTATTCTGCAAAGAGCCCTCTGTCAGAAACCGGAACAATGGACCTCTCGACCTTGTAATCGCAGTGCGAAAAGGTGACATCACCTCATCAGCCTGTTTCATGGTAGGGGCTGTCGTGACTTGATGAGTCGTGGATGTGTCGATATTCAATCCATAGGAATGAACACAGGTGTCATACAAAGATTTAGCAGCTCCTCGCCCAACGATAAGATATTGTTTCTTCGTCAGACGCTGCTTGATTCTTTTATTCACATATCGGCCACCATGTCCGTCAGAACTTGGCTCCCACACACTTCGTTCGACGAAGTAGTACCATCCATAAAGCTGCTCGCCCCACAATTTAAACGAGTCCAGCAAATTCAAATCAGAACCGTCCGTCAATGTTAGTTCTGATTCGCAATAGGCAATCCATCCTTCGACGGCCTGGTCATCATAGTAAATACCAGGATTGGCTATTAGGTCGTCAATTCGGTTCATCTCCATAGAGATTTCTTTACAAACCGGTATCTCTCCCCTGATTACGGCATCCCGAAACATGCCGTAGTATTTGGGAACGGCAGTGTTTGATAATGCCATAATTGAATCACCTACTTGCTTGTTGCTTTCTTGATGACCGCGTCAATTCCCTTCGTCATGTACTTCGATGCATAATTGGTGGCGGTCTGCTTTGCGGCATTGGTCAGCACATCCTGTACAAACTTTCTACCGACAGAAATTTCTGAACTGGTAAGCTGTTTATACTGCTTTTCCATTTGAAGACGGTTGATCTTTGAGCGGAGTTCCGAATCAGACATCTTCTTCACCTCATCATCGGAACTCGTCTTCTTTCCACTTGCTCTTGCAAGTTGTTCGGGAGTTCTTCGGACGCCCCATTTCATCCCAAGAATCCCGTGATGCTGTAGTAATGCTTCATTACTCATTTTGAATCTCCCTCCTTTGCGATATATGATGTCACTCCGTTTGCTGCGTTCCCAGTCTCGTAATACGGAACTTCTGTTACCACAATATTTCGATCCAGAACTTTGTTCTCAGTATCCAGCATTTGAGATTGGAATGCTTTCGGCGTTACCCTATACTCGCCATCGTAGGACTCGTGTTCTTCGGACTTATCTTCGTCAGTTTCCGCTGCAACATTCAGTCTCCACTCCGCCTCAGCAATCATCTTTTCCATAGACGCCATTACAGCGGAACTCAAAGGCGGATCGAACAGGAGCTTTACCTTCATCTGCATATACGACTTTACCAATTGCAACTTTGTCTCGTCAGAAATGAATTCTTTCCATGTAGCACTTTTATCCTGAACAGAGAATCCAGATGGTGGACCAACACCAAGTTGCGTCAAGATCATAAATACTGAATTGATATGTATGATAAGATCTGAATCAAAGTGCTCATACTCTTCTGTAATACCCAGCATCTTTTTAATTGATGTCAGTATGCTTTCCATAATCGCTATAACCTCCTCTCCATCAATGTTTCCAGGGACATGTATCGTTTCTGCTTCGAACAATAGGTTCTGTGACAAGAAGACTTTCATCTCCATAGTGAATGGCATTATGTGTTGTAAGAATTGTTGAGATGAGAAATTCTGGATTTAAAAGAAAATCGCTTCTCTTTAAAATATCCTCCACGGAAATCGGATTCATATGGTGAATCAATATCTTCCCACATATCTCACGACCTTCTATTCCGAGGTCACATCCGTTATCTCTCACAATCACAAAATCACGAACTGACTTCCACTCCATAGACCGATAGAAAATCTGATTCAGATATCGGTCAAACCCAAACGTGTCTGCCCCGATGACTCCGCCCAAACGAAGATACTCGTATCGTTCTTTAAAAGTCTTCAATTTCGATAATTCCGAATATGTCCTAATCATCATCGTTACCCTGTCCACTGTATATACGAAACGCATTGATGGCGTCCTTATAGAGATCTTTAATCTCATCTGTGGAGTCAATAGCTCTTACTTTTGCCCGCAACAGATTGTTCTCTTCCTCCAGTCTCTCCCTCTCAAGCTTCTCTCTGGAAGAGCCCAGTTTTAAATAGTGAGTAATGACCTGAGAAGAAGCAGTCCCTTCCAGCAATTGTCTTTCAGCCAGGTCAACAGCCAGAGAAATCATCTGAAGTTCCCTTGCTTCCGGAGTCAAAGCAGGACGAATCTTTTTGGAAGAACCTGTCGATTCAGAACTCTTTACTTTTCTAGCCATTTACTGCCTCCTTCCCGTCTGTTCTTCAATAGTTTCATAAAAGTTTTCCGGCAGTATTTAAAAGAACCCACAAGGCTGACTGTAACTTTTTTACCGAAAGGAGAAAAAAGAGTAAAAAGAACCACAGCTTATTACTTAGCCAACCTTATGAGCTCTGTTAAATACTGCCGGGAGGTAAAAACATTCTCCGAAAAATACCCCCGGGGAATTTTTAAAGACCGCCGCGATGACGGAGGGGGGTGCGATTTTTGCTACCCCCCCCTATACCATCTGATACCTAGACAGCCACCGCGTCTCGCGTAACTTTTTTGTAAATGTTTCGGAAATCGTATTTTACGATCTCATCAATTGCTCGTTCAACTTCCAAGTCATTCTCTTCATCGGAGAGTTGGTCCGAGGTTCTGGCAATTCTACCAAGATACGAACATGAATGATAACCTTTTTCCTCATCAAACAGTAACCATGAAGTGAACTGTTCAAATGGATCGAAAGGATTGTCAATGGTTGTAAGCATACACTTCTTCGCCATTTACTTTGTTCACTCCTTTCCATTCAGATACTTCGACACAGTAGAACTGGAAACACCCAAAGCCGCTGCTATCTCAGCAGTGCTATAGCCAGAAGCATTCAGCGCCGCAATACGATTTACTTTGGCAGAACTAAGGGTTGTTGTTGCACGCGGAGTAGCTCTCTGTCTGACTGTATCTATGTTTGTATTGTTGAGAATCTGGGTAAGCTTGTTCTCGCTGATAGCGCCGGCTTGAATCGCTTCCCACTCACGATCTGTAATCTCGACAGGGGTTCTCTTGGCCTCAACAGCAGTACGGGCCGCAGTAAGGGCCTGTTGATTAGCCTTCTTGATTTCGGCTTTTGTCATATCGGGGTTATCTTTTTTCTTGGCGGACACAATAGAGTTTGCCATGGTCTGTGCCTGACGCTCGCGGGGGGCGTTCTTTAAAGCCACATTAAGCTTGGCCATAAGAGAATCAACCTCTGTCTGGTAGGTCTGTTTTGCTGAAGCAGAGTAGGCTATCTTTCCGCTATTAACCATCTCCCTACGGGCCTGATTAGCCAGGGACTTCATGGTATTTGCATAGTCCGCATACGCCTCTTCCTGTGGTGTCCCAGAAGATAAACTACGGGCGTCTCTGGTTTCTGCCATCTTGGTACTCTTTTGAGTCCGCACCTGGGTCTTTCCGTTCTTATCTATGTACTCCTCCCTGACGCTCTTCCAACTTTGCTCTCCGGTTTCCTTGTCAATGATCGGGCTTCCTTTTCTTTTCAACACAGAAGTCTCAGACTTGGCACGAGAAATCAGCGTAGAAGCTCCGCCATAACCATCATCGTCATCATGAGCCTGGTACTTCTTCTTCAAAGCCGTAATACCATTGTCCTGCTCACTCTTCTTGTAGTCTAGCTTATGCTTCTCTGCGTCGATGACGACCATACTGTGACGTACAGCTCTTGCAAGCTCATCCTGAGTAGCACCTTTTAAAGTCATATCAGTAATCAAGTTTGAAATCTTACCCATTTCTGTCTGGGTGTTTTTCATAATCTTAATCTTCTGACCGCCGCTGTTACAATAATCGTCACCTTTCTTAACGGTCCCATAAGACATCTTAGGATCGAATCCTTCCAACCCCTTTAATTGTGGAGTAGAAGTAATCTTCACCCTACTATTAGAAGAATTACAAGGAATCACCATGACAGTATCGCCATCGAAGTCAGCTCCGGAAAGACGGTCAGCAACCTTTTTGTTAATGCCAATAGCGTCTGCTGGTGTATTCCCAAGAACTCTTCTTCCTTCTGGCTGCTTATTATTAACAGTTAAGATTGGAATCTCAAAAGTTCCACCATGTGGATACCGCACAAGAGCTACTGTTTCTCCATTCTTGTAGTTCGGAGCATAGACCTCATTGTCCTTGATAGATGTCAATGGAAGAATAACTTGATACTTCTGTCTGGGAAGAGCGGCTGCCTGTAAATGAACAGCGGCTGCATCGCAGTCATCAGCAAAAGATTTCAAAAGAACCTTTTTCACTGTTGGATTCGTAAGAGAACAAATCTCATCAAATTCAGACTGCTTGTCTGCGGCCGCTAAATTCAACTGCTTCTTTATCAAAGTTCTACTCTGTTTCGACAGAAACTGAGATGGAAGCTTGTCTGCCCATTCGCCCCAATCCCCTTCTTCTGCTCGCTTGTTGATAAGGGAAAGCTGTTTCTTCCCATTTTTATCATAGTAGTAACTCTGACCTCCTCTTTCAGTAGTGGGGTTATCCGGGTCATTAACTCCCTCTTTAATTAAAGAACCGAACGGGTTATCCGGGTCATCTTTGATTGGTTTCAGAACATCCATTTTAGGTGTCCCTTTTTTCTTATTGGTGTTGAACATAACATCCACGCCATCAGGAAGGTCATCTGAATAAACCGCCATCCCTTTGATGTAGTGTGTCTTGTCAACTAAGATGCGAACCTGAGCATAATGGGATTCTCCAAGAGACAGGTCGTCAACACCTCTTCGAATTTCCACAACACCATCCTTCAATTCGCCGCCATCTTCCGAATACCGAATCTGCAACCGCTTTGAATCCATGCTTTTGGGATAAACAAATTTGGGATCAAAAGATTCCCCATCATCATGAGAGACATAATCTTTCAGAGAATTGATATTCTCAAAATCGTAAATCTCTTTATGCTCTGTTCCAGGAGGACAAATCACACGAAGCGTTGTTTTCTTCCCAGGATTTGTTACCTGATCCACTCGTCCGCCGTAAACAGGATAACCCTCCATCTCCAGCATATAGAGTGCTTCATTCAGTTTCTCCTTGGAAATTCCAAGTTCACGTTCCACACCGGCACCAACATCAATCATGCCTTTTTCGTCAATTTGCTTTTTGATAATTTCAGCAGTTGTCTTAGCTTGGTTCATACGAACCTCAGAATTTTCATTCAAAAGCGAACGAACAGAAGAGTCGTTTGCGAAGCCCATTTCTTTCGCAATTTCATTTAAGCTCAAACCATCTTCTCGAAGAGATTTTGCTCTCGCCACATCCAGCGCACGACGTTCATCTTTTGCTAACGATTTCTGCGTACGGTATTGAGTGGTGGTTAAACCCATGGCTTTTGCAATTTCGGTATCACTCATACCCTGACTTTTCAGTTCATCCACTCGACTCAGAAAATCTCCACTATGCTGATACGGATTATCACCAGATCCCCACGGATAACGACCGGAACGGCGGGGCATTCCATAATGCATCAAAATTTCTTCTGCAATCGGATTCATAAGTTAGCCCTCCTGTTCTTTGATTTTGTTGATTACTTTGTCAAATGTGATGATCTTGTCCATGATGGGAACAATGGTTTCAGCCGTTGGATTCTCATAAAGAATCTGGTTGTTCTGATAAATCCGAAGTTCCATTTCAATGTCGGCTGGCTTGATTTTGTATTCCAAACAAAAAAGAGCAGCATATATTTCAAGCTGCTCCATGTGCGCGGGAATGACGCCGGTTTTTAAATCATGAATGCGAAGCATCCGATTTCGAAACGTAATCGCATCTGTTGTTCCAAAACAATTTTCCGAATAGAAAAGAGGTTGTTCGGGAACCATCTTGAAGCCAATGGCATCATTTACATACATATTCAATGTTTTCTGAGACTTTGGAAGTTTCTGCCCAAGTGAGATACACTTTGCAGCAAAATCGTGAAGTTCTGTTCCCTTTTGAGTTGCCAGAAATTTTGAATACGATTCTGCAACTTTGGATTCATCATAGTTGATCCAATGGTATTTACTTGCGCCAAGAAAGGCGTGTTGCCCTTCAAGAGCAGAATGCTTGTTGAAGATCATGTAACACTTCCTCCTTATTTTCAGGACAAATAAATCTTGAGAACGACATCTCGTTCATTCGTCCAACATAATATTCTTGGTTCGGTTGTTTCTTGGCGCGAACGCTTTTTTTACATTCTAAAGTGGCCCACTTATCGTTATAAAGAATCAGCAAATCAGGAATTCCCTGGATATGACTGGAATCCAGTTTTGTTACGATACAGCCTTTGAACATTCTTTTCAGTTCTTGAATCAATTTGTTCTGAAATTCGCTTTCCAACAT
Protein sequences of DBSCAN-SWA_1 >NZ_CP040506|1137171:1161820|1161000_1161558_-|WP_006781007.1|DBSCAN-SWA MIFNKHSALEGQHAFLGASKYHWINYDESKVAESYSKFLATQKGTELHDFAAKCISLGQKLPKSQKTLNMYVNDAIGFKMVPEQPLFYSENCFGTTDAITFRNRMLRIHDLKTGVIPAHMEQLEIYAALFCLEYKIKPADIEMELRIYQNNQILYENPTAETIVPIMDKIITFDKVINKIKEQEG >NZ_CP040506|1137171:1161820|1150230_1152096_-|WP_006780997.1|head,protease|DBSCAN-SWA MEKYDFSGWATRNDLLCTDGRTIKRDAFKNQNGQTVPLIWGHNHSDPNRVLGHGVLENRDEGVYAYCSFNDSESGQAAKKLVKHGDVRSLSICAGQLKQAGASVMHGVIYELSLVLAGANPGAFIDSVMAHGETSEDRTIIGYDENIMIYHSAEEDDKSEEKKAEEKSKSKEDKTSEEKPEEVEETIEQVFNTLNEKQKNVVYAMIGQAMGETDKPEDKNDDDSKGGNTEMKHNVFDNDKKNETGGFLTHSAQEDIIKAAKTSQVGTFQTALQLYAEQNGLQHDAVSGGFVQTGDGNVTSLFPEYQEVRPGAPELITNDQGWITNVMKKVHKSPISRIRTSQTDIRGIDTLRARGYKKGKEKKQAGNFKLVRRTTDPQTVYVKNALHRDDIVDITDFDYVKYLYDIDRLMLDEELAIAMMLGDGRDDGDEGKIDPDKIRPIWTDDDLYTIHADLDVAAAKNELQGTNTGVNFGENYIYAEAMINAVLYAREDYKGTGTPDMYITPHMLNVMLLARDMNGRRIYSSKSELASALNVGEIHTAEQFEGKTRKTDDSKTKKLLAIITNLNDYSLGATKGGEVTHFTQFDIDFNQEKSLLETRCSGALTRVYSAIAIEEDVTVNP >NZ_CP040506|1137171:1161820|1146054_1147059_-|WP_006780989.1|DBSCAN-SWA MSNSSLVNCTVKSPNHSGARTHSIDRITPHCVVGQLSAESIGGCFDSSNVQASCNYGIGSDGRVVLCVDEANRSWCSSSNANDQRAVTIECASDMTHPYAMTDAVYEKLVALCVDICRRNGKTKLLWFGDKDKSLNYSPKSNEMILTVHRWFANKSCPGDWLYSRLGNLANRVTAQLGGSTTDSVQKTYKTGLYKVNIGDLNIRKGPGTNYGTNGMITDRGTYTITEIQNGYWGRLKSGAGWISVHEAYCTYKGAASSDSGGSVEKPSGNFLVQVDISDLYIRKGPGTNYGTNGFCPKGVYTIVEVKTGAGSDAGWGKLKSGAGWISLDYATRI >NZ_CP040506|1137171:1161820|1148083_1148299_-|WP_034859381.1|DBSCAN-SWA MALKIEAEPAEAETVVELVGGTKGPVALDDDMNIVLLIKNKDTQSIKVTATHNGESITKIYGLSGLTLETE >NZ_CP040506|1137171:1161820|1138244_1139159_-|WP_006780986.1|tail|DBSCAN-SWA MIRAVTVTNYLGESKRFELAFPEESGFAVQSISGLGPSKADINTTEISTNDGSLYNSARVNSRNIVMSLKLMFNPQIEDTRHDSYKYFPIKKKVTLLIETDNRICETYGYVESNEPDIFSSDETTQISIVCPDPYFYSAGPDGTNTTIFYGVEPLFEFAFSNESLTESLIEFGEIKNETEQTVYYSGDAEIGVVITIHAIGNVRNITIYNTGTREVMRIDTDKLEQLTGSGMVAGDEIIISTIKGDKSITLLRNGIYTNILNCLDKDSDWFQLSKGDNIFAYVVEEGTTNVQFKIENRTAFEGV >NZ_CP040506|1137171:1161820|1152969_1154310_-|WP_006780999.1|portal|DBSCAN-SWA MEVSIGSRIKHAWNAFLNRDPTGFYRDIGVGYSYRPDRPRLTRGNERSIVTSVYNRIALDCASINIQHVRLDDSERFLEKISSGLNDCLNLSANIDQTGRAFLQDVVLSMLDEGCVAIIPVDTDDDPDTTGSYKIESMRTGKILEWFPNHIKVRVYNERTGLKEDIVVPKDTVAIIENPLYAVMNEPNSTMQRLIRKLNLLDVVDEQSSSGKLDLIIQLPYVIKTEARRQQAEKRRVEIERQLAGSKYGIAYTDGTERITQLNRSVENNLMKQIEYLTSMLYSQLGITQSILDGSADEKTMLNYYNRTIEPIISAIVDEMKRKFLTKTARSQKQSILFFRDPFKLVPVADLSEIADKFTRNEIMTSNEIRQIIGMKPSDDPKADALKNSNISEAKSDPSNGSSDVESNESDAGADYDSIVNELLDGLEKEIDEIIGSYVSDDEEES >NZ_CP040506|1137171:1161820|1139155_1145668_-|WP_006780987.1|DBSCAN-SWA MSKTIDERVVEMRFDNKQFEQNVQTSISSIEKLEKSLNLKGASKGLDDVNAAAKNCNMTPLSNAVETVKMRFSALEVMAVTALANITNSALNAGKNIVSALTIDPIKTGFQEYETQINAVQTILANTQSKGTTIDQVNAALDELNKYADQTIYNFTEMTRNIGTFTAAGVDLDKSVTSIKGIANLAAASGSNAYQASTAMYQLSQAIAAGKVSLQDWNSVVNAGMGGQLFQDALTRTAEHFGTNMDAMIEQYGSFRASLTEGGWLTTEVLTETLTQLSGAYSEADLIAQGYTEEQAKEITKLAQTALDAATKVKTFTQLWDTLKEFVQSGWTQSWEIIIGDFEEAKELLTEVSNALGNMVNASAEARNKMLQDWKDLGGRTALIEAVRNAFEGVLSIIKPVKEAFREVFPPMTGEQLYNLTVGLQELTEKFKMGEETANNLKRTFKGVFALFDIGLQGVKALVGGFADLIGYVAPAGDGILGFTASVGDFIVGIDEAIKSSDAFNKAIEGIGNFLKPIADGIKTFVKTVADAFSEFANVDTSGLDNFADKVQTRFEPFVKLGELVKKAFEGIIGIVEKVAPVLSKLGSIVANAFGNLGEAILTAFDTASFDPILDLINTGLFSAILIGVKKFIDSLSEITENGGGILGSFKDILDGVKGSLEAWQSSLKAGTLLKIAGAMAILTAAIVALSLVDSEKLNASLGALSVLFVELLGSMAIFEKIMNGAAIKGMGQLTIAMIGMSTAVLILAGAVQKLSGLDWDELLKGLVGVAGLSAILVASATALSKTSKGLIKGSAGLVVFAAAIRVLVGAVEDLGALDVGSLAKGLIGVGVLCTELALFLKATDLDGMGVLKGTGLVLLAASINILADAVKAFGDLDTSNLIQGLSAVAVVLTELAVFTKVTANAKHVVSTATAMTILGAAMLVFGEAVKKMGNLTWGEIGRGLTTMAGSLAAVTVAMNLLPKGMVSKATGMVEVGAALLIIGEAVRNMGGMSWDEIARGLVTLAGSMTILVVALNAMKTALPGAAAVLTVSAALAIFTPVLKSLGNMSWESIAKGLVALAGSFTVLGVAGVALGPLTPAILGLSAAIAVLGVGCLAAGAGILAFSTGLSALAVSGAAGAASLVVAVSSILSLIPLLFESIGEGILSLAGVIANGGPAIAEAFTALVLAAVDALVTAVPAVVDGLFVLIDSVLSALVEHTPTIVEQLFDILIGIIQAITTKLPELIKAGVELLMAFFDGVIDALSGIDVNVLIKGIAGIGLLSAIMLALSAVASLVPGAMVGVLGMGAVIAELALVLAAVGALAQIPGLEWLIGEGGNLLQGIGTAIGQFVGGIVGGFMSGVSSQFPQIGSDLSAFMTNVQPFIEGATQLDPSMLDGVKALAETILILTAADILNGLTSWITGGSSLSDFATQLVPFGEAMRDFSIAIAGMDGELVANAATAGRTLAEMAATLPNSGGVIGFFTGENDMSAFGAQLIPFGEAMMGFANAVRGLDVDTVTNAATAGKAMAEMATTIPNSGGVVGFFVGENDMDAFGEQLVPFGEAMMLFSQAVRGLDANVIVESATAGKALIELANTVPNSGGVVGFFTGENDMDTFGEKLVPFGRAMKSYSDAIAGIDVEAVTNSATAGKAVVELANTLPNTGGLVSWFTGDNDIAAFGTSLVSFGKSFAQYSDYMKDVDANIVTTTTNAATSIVELQKSLPKEGGWFSDDMTLASFGSDMASFGSHFSNYYNSISGIDTTLLSGVITQTNRLVGMANGMVGLDTSGMTSFSSALTTLGETGVTGFINAFNNAESKVTAAASSMLSSFINGANAKKSELTTTFTTLVQTVLTAINGKQGEFQTSGSTLMVKFIAGVRSQDSSSRTTFTNIVSGCLTVIRNKYGEFTSTGTQTMVKLIAGVRSQDSSARLAFTNIISACLTVIKNKYAEFTSTGRECMVKFIAGVRSKDSELRTAFTTTLSGSVTAIKDYYSQFKSAGSYLVDGFCDGISENTWKAEAKARAMAAAAAEAAEDELDEHSPSKRFYGIGNFAGVGFINALIDNVSKAGKAGREIARSSIDGLNDIISRIADYVDADMDVQPTIRPVLDLSAVEAGSGRLNTLFSRNQALSVSTGMNDRVSEIEVQNGESSPTGNTYQFTQNNYSPKALSRIDIYRQTKNQFSAMKGLVGNT >NZ_CP040506|1137171:1161820|1137171_1138242_-|WP_100932830.1|DBSCAN-SWA MELIVLDTFLKMLSVLDTFESLIWTERYSAYGDFEVYTSINDSVLEILKDDYYLWLKESDQTMIVEDRKIESDAENGNHFTVTGRSLESILERRIIWKQTILSGNFQNGIKKLLDENIINPSDASRKVEGLIFEASTDPAITGLTVDAQFTGDNLYDAIKKLCDSKNVGFRIKLSDDNKFVFKLYAGADRSYDQFTNPYVIFSPKFENVINTNYLESKKTLKTVTLVAGEGEGADRRTTTVACASGAGTGLNRRELYTDARDVSSTVDNETLTDTEYNAQLSQRGLENLAENIATKSFEGKVESTRMYRYGEDFFLGDMVQIVNEYGIEGKARVTEFIRSQSKEGLDSYPTFVTVE >NZ_CP040506|1137171:1161820|1156966_1157380_-|WP_034859382.1|DBSCAN-SWA MIRTYSELSKLKTFKERYEYLRLGGVIGADTFGFDRYLNQIFYRSMEWKSVRDFVIVRDNGCDLGIEGREICGKILIHHMNPISVEDILKRSDFLLNPEFLISTILTTHNAIHYGDESLLVTEPIVRSRNDTCPWKH >NZ_CP040506|1137171:1161820|1161529_1161820_-|WP_006781008.1|DBSCAN-SWA MLESEFQNKLIQELKRMFKGCIVTKLDSSHIQGIPDLLILYNDKWATLECKKSVRAKKQPNQEYYVGRMNEMSFSRFICPENKEEVLHDLQQAFCS >NZ_CP040506|1137171:1161820|1149514_1149829_-|WP_006780995.1|DBSCAN-SWA MAKFFGKIGYAISKDVRPGVWDGEITEREYFGDLIRNTSRYQTSDKLNDDINISNEISIVADPFAYQNFHAMRYVEFMGAKWKISSVEVQYPRLILTVGGVYND >NZ_CP040506|1137171:1161820|1154327_1156073_-|WP_006781000.1|terminase|DBSCAN-SWA MALSNTAVPKYYGMFRDAVIRGEIPVCKEISMEMNRIDDLIANPGIYYDDQAVEGWIAYCESELTLTDGSDLNLLDSFKLWGEQLYGWYYFVERSVWEPSSDGHGGRYVNKRIKQRLTKKQYLIVGRGAAKSLYDTCVHSYGLNIDTSTTHQVTTAPTMKQADEVMSPFRTAITRSRGPLFRFLTEGSLQNTTGSKAKRMKLASTKKGIENFLTGSLLEVRPMSIAKLQGLRPKISTVDEWLSGDTREDVVGALEQGASKLDDYIIVATSSEGTVRNGAGDTIKMELMDILKGEYINPHVSIWWYKLDSIDEVGNPDMWLKANPNIGKTVSYETYQLDVERAEKSPAARNDILAKRFGLPMEGYTYYFTYEETLPHKKRSYWQMPCSLGIDLSQGDDFCAFTFLFPLSNGSFGVKTRNYISSSTLMKLPAAMRIKYDQFMDEGSLIVLEGTVLDMMEVYEDLDNHITEFGYDVRCLGYDPYNAKEFIERWSSENGPFGIEKVIQGAKTESVPLGELKKLSEERMLLFDEELMTFAMGNCIVMEDTNGNRKLLKKRYDAKIDAVAAMMDAFVAFKLNRDAFE >NZ_CP040506|1137171:1161820|1156413_1156944_-|WP_006781002.1|DBSCAN-SWA MESILTSIKKMLGITEEYEHFDSDLIIHINSVFMILTQLGVGPPSGFSVQDKSATWKEFISDETKLQLVKSYMQMKVKLLFDPPLSSAVMASMEKMIAEAEWRLNVAAETDEDKSEEHESYDGEYRVTPKAFQSQMLDTENKVLDRNIVVTEVPYYETGNAANGVTSYIAKEGDSK >NZ_CP040506|1137171:1161820|1158249_1160997_-|WP_006781006.1|DBSCAN-SWA MNPIAEEILMHYGMPRRSGRYPWGSGDNPYQHSGDFLSRVDELKSQGMSDTEIAKAMGLTTTQYRTQKSLAKDERRALDVARAKSLREDGLSLNEIAKEMGFANDSSVRSLLNENSEVRMNQAKTTAEIIKKQIDEKGMIDVGAGVERELGISKEKLNEALYMLEMEGYPVYGGRVDQVTNPGKKTTLRVICPPGTEHKEIYDFENINSLKDYVSHDDGESFDPKFVYPKSMDSKRLQIRYSEDGGELKDGVVEIRRGVDDLSLGESHYAQVRILVDKTHYIKGMAVYSDDLPDGVDVMFNTNKKKGTPKMDVLKPIKDDPDNPFGSLIKEGVNDPDNPTTERGGQSYYYDKNGKKQLSLINKRAEEGDWGEWADKLPSQFLSKQSRTLIKKQLNLAAADKQSEFDEICSLTNPTVKKVLLKSFADDCDAAAVHLQAAALPRQKYQVILPLTSIKDNEVYAPNYKNGETVALVRYPHGGTFEIPILTVNNKQPEGRRVLGNTPADAIGINKKVADRLSGADFDGDTVMVIPCNSSNSRVKITSTPQLKGLEGFDPKMSYGTVKKGDDYCNSGGQKIKIMKNTQTEMGKISNLITDMTLKGATQDELARAVRHSMVVIDAEKHKLDYKKSEQDNGITALKKKYQAHDDDDGYGGASTLISRAKSETSVLKRKGSPIIDKETGEQSWKSVREEYIDKNGKTQVRTQKSTKMAETRDARSLSSGTPQEEAYADYANTMKSLANQARREMVNSGKIAYSASAKQTYQTEVDSLMAKLNVALKNAPRERQAQTMANSIVSAKKKDNPDMTKAEIKKANQQALTAARTAVEAKRTPVEITDREWEAIQAGAISENKLTQILNNTNIDTVRQRATPRATTTLSSAKVNRIAALNASGYSTAEIAAALGVSSSTVSKYLNGKE >NZ_CP040506|1137171:1161820|1157988_1158240_-|WP_006781005.1|DBSCAN-SWA MAKKCMLTTIDNPFDPFEQFTSWLLFDEEKGYHSCSYLGRIARTSDQLSDEENDLEVERAIDEIVKYDFRNIYKKVTRDAVAV >NZ_CP040506|1137171:1161820|1156084_1156417_-|WP_006781001.1|DBSCAN-SWA MSNEALLQHHGILGMKWGVRRTPEQLARASGKKTSSDDEVKKMSDSELRSKINRLQMEKQYKQLTSSEISVGRKFVQDVLTNAAKQTATNYASKYMTKGIDAVIKKATSK >NZ_CP040506|1137171:1161820|1148469_1149177_-|WP_006780993.1|DBSCAN-SWA MSKLIWDKVGERLYETGVDHGVLYPIQTGGQYNKGVAWNGLSAVTESPSGAEPSPIYADNIKYLNLMSAEDFGGTIEAYTSPNEFAECDGSVEVAPGVFAGQQSRKQFGLSYRTILGNDVDSNDYGYKLHLVYNCLASVSEKGYTTVNDSPEAIALSWEFSTTPAEIAKIIDGKKLKPTAILTLDSTKIDAKKLAALEEILYGKDPTTPEGNDGVDPRLPFPDEVIELLAAEDLP >NZ_CP040506|1137171:1161820|1149844_1150054_-|WP_006780996.1|DBSCAN-SWA MKVDADSEDAVATVELVGGTKGPVTLDDDMNIVLLIKNKDTQSIKVTVDNGENSTTKTYGLIGLTLETE >NZ_CP040506|1137171:1161820|1145664_1146042_-|WP_006780988.1|DBSCAN-SWA MISFRQKGDFSKLTRFLERAKEAVHIGDLDKFGKEGVAALASATPVDSGETANSWYYEIENRKGSVTISFHNSNVQNGVPIAVILQYGHGTRNGGWVQGRDYINPAIQPIFDEIANNAWKEVTKL >NZ_CP040506|1137171:1161820|1147621_1148023_-|WP_006780991.1|DBSCAN-SWA MLKKTIPYIDLNGVERKEDFYFHLSKPEIVKMQTSVKGGYDVQLKSIGAGADGGQIMEFFEDLIKKAYGVKSEDGRRFMKSEEISRSFMESPAYEILFEELVTNDKTAADFVNAVMNIGNSATAPAIAANTQN >NZ_CP040506|1137171:1161820|1149195_1149522_-|WP_006780994.1|DBSCAN-SWA MTDRRILFHKLLCEILSCPIEGEQCRCYFQPPESIKMNYPAIVYSLDDVDKTYANDGVYLSNRRYAVTVIDKDPDTSLVQKVTNLPMSRFDRHFKKDNLNHYIFNVYF >NZ_CP040506|1137171:1161820|1157372_1157711_-|WP_006781004.1|DBSCAN-SWA MARKVKSSESTGSSKKIRPALTPEARELQMISLAVDLAERQLLEGTASSQVITHYLKLGSSREKLERERLEEENNLLRAKVRAIDSTDEIKDLYKDAINAFRIYSGQGNDDD >NZ_CP040506|1137171:1161820|1152139_1152970_-|WP_006780998.1|DBSCAN-SWA MDINEPLQHYASPYYDPVKAHEYYMRTRELKGRRSTMKLNDEGKKVWAYTKNEISGEKKEKVKEEQEKRKQKIAELRAKAKATREQISAKLKELNTQLTEESSSRRSRVDSRKKSDLEDIGEEVEDQKERIDEKKNTEIERLMAIEIPSGLSKEERAKRVAERNEKITKLRDDASEDKSKVSEQAKAEKEEVRTSASLKKKRITEDTKEERADNSANAKSEREKVSTELKAAVTAAREAYKAAKENLDATYEDLYQQEFDKIASEYKAVKKRKRRK >NZ_CP040506|1137171:1161820|1147064_1147607_-|WP_006780990.1|DBSCAN-SWA MLRITIPSTEFWDEVKQEFVYTKAQTLQLEHSLVSLSKWESRWNKPFLTKQEKTLEETIDYVKCMTLTQNVNPEVYNYLTNSNINEVNRYIALPMTATRFFEEKKTQGSREQITAELVYYWMIALNIPFECQKWHLNKLFTLIRVCDVKSRPPKKHSRREIMKRNAALNAARKKKWNTKG |
24 | Arthrobacter_phage(27.27%) | portal,terminase,tail,head,protease | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_2 |
1166677 : 1185253
Sequences of DBSCAN-SWA_2
Nucleotide sequences of DBSCAN-SWA_2 >NZ_CP040506|1166677:1185253|DBSCAN-SWA CCTATTCCGATTCATACTGAAGCTCCTCCTCGCAACGCAGACAATCATTATTAGCGGCTCCGAAACAGCCATGACAATGCTTCCTCATATCATACTCAGCCTTGGCTATTTCAACAAATTCGTCATTTTCTTCTTTAAAGAAGCGATTAAGCTTCACTGTACAGTCTTCTGGTGTAATTGTATAGAAAATTCCAACTGTGTCGTAATCTCCATTTTCCGGATCAACTAAAAAATTTTCGCAATAGACCTTAAATGGTTTACTTGCCGGAAAATATGGCATGGTAATAGGGAACTTCTCTTCCATTACCCGATCGATCAATCCACTATGATAGGAATCATATGGATTGTTCATATTGATTCCACAGTATCTATCTACATCGTGATACTTTACGGTTCCATCAGAATATACATATTTAAAAAGAGAACTCATTCGCTTACACTGATAGTTTACAATTTCACCGCGATATCCCGAAAGGTCCGTAATATCACTCCAAACATCCTCTGTATCCTCGATCGATGTAAGTGGCTTTCCGTCAATTAGGCGATTCAGAATATATTTAGTCATACCGATACTGAAACCGCTATGGCCGTCCTCGGACAGACTGTGATAGGCTTTCAGAGCACTTTCGTAGCAGGCGCAACCATAATCCCACTCCCCTTCTTTTGTTCCGGAGGCTTTTCTTTCGTTCTCACAAGCAATCTCTACCTCTTTTTCAGCCCATGTCTGCGTATTGGATTTTTCCCGGCAGGATGACAAAGGAATATTCCGGTCATCTATATACTCATTTGCAAAAATCTTTCTGGTATCAGAACCAAAGTTCTCGATGATTTCCGGAAGATTCTCATTGACCGCATCAAACACCAGATTTCTCTCCTTACACCACTCAACAGCTTTTTGAAGCATGTCTTCCACGCGGCAGGTCCACAGAATCAACTTGTCTCCATCCTTTTTCCGATTACGAAGATACTCTATCAGTTCTTCGTTCGGCGCTCCAATCCCTGGCCAGTTGTTCTCACATAAAGTTCCATCGAAATCTACTGCAATAATTTTCACTGATTTAAGATTCATACGTTTTTTCTCCTTTCTTATAGCGGACTCATCCCTTATGCAACTACTATATCCACCTTGTGGTCGCCCTCATTATTGCTCGACCCTCTCCCAAATTCTCCAATTACTCAAGCCATTTGTTGTCGATATAATAGAAACTGTAGACACAGACTCCAATTAAGACCATCCAAATAATCCAGAATATCCACATTGCAAAATCGGTTTCTAAGTATTCTACGGTTTCATCAATCTTCATATTTTCATAAAATGGTGAGTTATCGGCTATTGTTTTATCAGCCAGTTCTGTAAATATGGTTCCGGTAAAGTTTAAACCAACTCCATAATACTTATAACGAATGTGGCTGGATTCTTTAACAGTGTCTATATACTCCTTTCCCGGCAGGTCGATTTTACTAACTGAAAAAACGTAATTTAAAAATGATATTTCATCACATATCTGTTCCTCACTACCAGCATAATCCCATGTCCAATAGGTTTCCGTTGTATAGTAAGTATGGGTTTTTCCATTCGTAGTTGTTGTATGTGCAACCTGGCGAGTATGCATTGTATATCGTTCTTTCACTTTTTCAATATAAATGTATTCTCCACCAATTCCAGGATATGTAACTGTATCGACCGCTTTCAAATCGCCATATACAAAAGCGTTCCCGGCATTAGTCCTCATTCCGTACTCAAACAGTTCCTGAGATTCTATTTTTATCGCCTTGTTGTATTTTTCATTTTTATCCAATTGATAATCCGAAATTTTTCCAGCAATAAGGACTCCAATCAGAAGCATAATAGCGACAATCGATACACTTACCAGTATTTCCCGTTTCGTAATCTCGAAATTCCCGAAATCAAAACTTCTACGCTTGCGTCCTCTCATAGCTTTTACTCTCCAAACAGATTCTGCGGAGCATCGGCAGGAGCACTATAGTCAAGATAGGTATATCCCCGAACCTCATATCCGAGAATATTGAGAAAAAATCTTGTCGGGAATTTGCGTATATATCGGTTGTATTCTTTTACCTGCTTATTGTAGTTACTCCGATATTCTGCAATTAAATTTTCTGTAATTGAAAGTTCATTTATTAACTCTTTATAATTTTCATTAGATTTTAATTCTGGATACGCTTCTGACACCGCCGTGATAGCTGTAGTAACGTTTTCGATATCACCGGTAGAGCCTCTATTTTCAACAATCGCCGTCAATGTTTCCGCTTCATGCTTATCGTACTGCTTAACACAATTAGCAAGGTTGTAAACCAAATCTACACGTCGTTTTTCCTGAACTTTGATGTCAGAATCAGCAGTATTAACTTGCTCTTCCAAGGTAAATGCTCTGTTCTGGGAACTCTGTATACAAAACACACCTAACAAAATAATAGCAATAATACCGACGGCTACAATTAAAGCTACTTTCCAGTTGTTTTTAATTTTTCTCATGATTTTTTCTCCTTCTAACTCTTCTGAATTTATCTACACTCTTTATCACACCCGTATTCTTATTGATAATGCGATAATAGAACTCTGTCTCGTCAACCAACATCCAATCTTTACAGTTTAAATAATGAGCAAATAAGCATTCTTTTTGCTCTCTGGTCAATTTTTTCGGTTGCTTCATGTGGTTTTCTCCTCGCTGTAATTGATTTTCTTAGATTTGTGGAATACCAGCTTTTGATAAATTTCTTTGGCTTCTTCCCCTTGATAGGCATTGATAATTTCCACCCTTCCTTTTTGCTGCCTTCCAACAATCAGAACGCCAGCGTCTTTCCCATGAGAAAAATCCCAACTCACAATAACACTATCTGTTGATTTCATTCACTATCACCTCCCCAAGCTTATTTTTGATACGTCCTAAGATATCCTCCACCAATCTTCTCGTATTCGGATGCAGTTTTATATAGCTTTTCCGCTCCTCATACCAAGAGAAGATCTCCAGCAGGTTCCCTTTAAACCAGCTAAAGGACCACCAATCACAAATCATCTCCAGAACATAACAGTAAGGCATTTCCAAAATAATTTCTCCTTCTTCGGGAGCGTCATTGATAAGCACCCAATACTGCCAATGGTGAGGGTTTCGATGAATATGTAACAGCCAAGCTTTTCTGAAATCCTCAACTACTCCATAAGAGCGATTGCCTCCATAAAAATAAATATCATAGGGGCCATACTCATCGGGCTCCGTTTTGGATTGGTCATGTGCAAATACGATATTGTGTTCCGCACCACTGCCCTCTGTAATTTCAGGAAGATTTTTCTGCAACCAACGAAATCCGGCTTCAACGTTGGATTTATGTTGCGCCAAATATCGGTCATATTGATAGCTCATTTTTTCTTTTCCTCCCACTTCATAGGTTTCTGGGAATTGAGATTGTATCCATAATCCAAGCATTCATTACACGGGTCGAATTTTTCTCCCAATTCCTTGTGTTTACAGGTTTTACAATACTTTTTAAAATCCACTTCCAAATACTCTTCATTCATGGTTAATCTCCTTTCACCACTTCACAAACCTCGATTCATTAAAATCTCTCTTATCTTTCAGTGCCTTACTAATTGCCAAATCAATCCCACTACGAGATTTCAAGTGATAGTAATACAAATCTTTGAATGGCGTATTCAATCTGTCTATTCGCCCCGCAGATTGCTGCATGATTTTGTAAGAATAGTTCTGTGAGTAGAATATAATGGTATCTGTCTTGATGCAGTTCCATCCTTCCGCTCCAGCATTGTACTGAACAAGATACACCCAGTTTTTCGACTCTGGGATTGGCTGGTGTTTGTGACCATTCCACTCTGCAATTTCAAAGACTCCATCGTCTTCATAAATTCGAAACAGCCCGTTCAAAAGCTCCAGCTCATAATCAAAGTTGTAAAATATAATGGCTCTGGGATGCTTCTCTACAATTTCCATCAAGGCGATCTGTCGTGACTCGTCCGTATTCACAATTTTCCGCCATACATAGCAAAGACCGGCGGCATTCGCAATCGGCTCATTCTTATACGGGTCCCATCGAGTTCGTCCAACATCTTTGTATCGCTCTATGCTGTATCGAATAAATATATCTTCGTGATGAGAAACCGTCTGACGCTTGAAATCCATATTCACCAAAATACGATTTCTGAGTCGAATCAATCTCCCAATATTCAAATATCGGTCAATCTTTGGGTATTTACTGAATCGACTATAAACCACATGTTCTCGGATGAATTCTGTCCGGTTTTTGTAAAACCCATTTGCGATGAATACCGGAATATAATCCTGCCAGGTATCTCCCGGAGTTGCAGACAATAGAATCCACTGATTTGATTTGGTGATTTTCAAAAACGCCTTCACCCAAGCTCCGGAGCCTATCACTCTCTGCTCGTCAAATATAAAGAAAGCATCCTTTACATCCTCGTACTTCTTGATATTATTCCAGGAATCCACGACAATCTGATTTGAATATAGATTGACATCCTCGTGAACCGAAAGAAGGAAAGGCGAAAGATCACCCTCCCATTCCATCGTGTCCCGCTTTCTGGCTGTTGTGATGATGTATAAGTCCTTTGGGGGATCGTCCATCGCAACATAATCCTCAACCCCCATCAAGCAATCCGGATTTCCGCCATTCTGGAGATAGTAATAAGCTAATGCTGTTCTTGATTTTCCGCTTCCAACGCCGCCACACAGAATACAACCATTTTTCATTTTTTTTACTGCTGCTATTTGATAGTCATACAATTCAACGGCCATTCCGTTGCTCCTCAAGAACCGCTGTTATGCTGTTTTTCAGGTTCGCCATATCCGAGTACATCTCGTTTTCATTCTTCGTGCAGTCATCTTCTATGGGAGCCATATTTAACAGACTATCCAGCTCTTTTTCTAATGCTTTCAATCGTTCATTCACGCTTATCCCTCCTCATAAACTTTCATATACTGTGAAATAATCTTCTCATAGTCCACACATCTGAAAAATATATAGGTGTAAACCAGTAATTCTTCAAATCATCAGCCATAGTCATTGGTTTGATCAAAGAATTTCCAACTTTAAAATATCCAGCAATTCCTAGAAGAGAAAGTTGAATATAGCACATCAAAGCTGGAACTTCTTCAATGTCTTGTCCGGAAACTAATAAATGGTTTTGATAATTATAATTTTCTTTTTCTAATTGCTTTCGTGCTTCGTGAATAGCAGCAATCAAGTTGGCTCCCGCTCCGCAACATGGATCGTTGATGGTAATATAGCCATCTTTCCTTACTTTCTCAGTTACATCCGTAATGGTGATTTTTGCCAGAAGCCGACACATATCATAAGGAGTAAATATTTGTTTCAACTCATCGTAGCCAAGATTTAAGCTCATATACATCTTTCCTAAAAAATCCTGCTCCGGATTCATTTCTAAAGACATGACCAGATTTGCAAACAACTGTGGAAAAAGTTCCTGTTTTGACTTACTGTAATGGCGAATGATATCCAAGTATCGTTTCTCTCGCTCATCAAATTGAGATTTGTCCACAGAATTTGATATAGAACAAGCCGACATAACAATAAAGTCTTTCCAAATATCCCACGGTCTGTTTTTTTCTGAAACAAGTTGACGAAAGATGTTCATAAATTCTTTCTCATATCTGTCGGGTTTGAAAGTGGGAAGATTTTTCATTTTCGGCTTCGGTGATAAGCTTACTTGATGTTCCTCTTTATGAACTACAAAGTCGTGCTTCTCTGATATTTTCTTTGGTTTTTCGACCTTGGAAATAGAAGCATTGATCTTCGGTTTCGCTGTGGTTCGCTTCTTTTTCCGCTTCCAAAATGCCATAGCTTTTCTCCTTTCGCAAAATAAAGGGCTGTTTCCTCTAGCCTTAGGACATTTACCTTGCTGGCAATATCAGGCACCCTATTTGTCGCTTGTTAGTGGAATGGAACTTCTTCCGGGCCTTCTTCCTCCGCATACTTTTCAGCAAACTCATCCTCTTCGATGGTGACATACATCGTCTTCAGATAAGCCTTAATTCCGGTCTTACCATTCACTTCCCAAGAATACGGTCGAATCGTCAAATCAACATTCCGAATCTCCGCATAGTCCAAAGTGGAAATGGATTCATCATCCAACGGTGTTTTTGTCTTTCTGGTAATCATATACACCTTAGGCGGGATATTCTCGAAGCTGACTGCCACCTGAATATAATGTCTCGGCTCTTCATCCTCGTCTCTCGGAGCCAGCACTCTTACATTCCATCCATCATTGGAGAGTTTCTCCGCCTGTTCCGGATCTTCGATGATGACACAGAAGTTCCGGTTTCCAGCCCGATTGTATTTAGACTCTTCGCCTCGGAAGTTTCGAAAAATAATTCTTGCGTTTTCAATAATGATATTGGGTACATTTTTGTAAGCCATAATATACTTCTCCTCTTCTTTAATTAAATGGTATTTCCTCATCAGCGTCTTCTGGAATGTTCATAAAGTCCTCGAGTTTAGGTTTTGGAATATAAGGATCATCGGATATGAACCATTCGAAGTCACCGTATTTGGATATGGTTTCCACTGCGTCGTCTACAAGCTTGTCATAATAGGAACGATCAATAGAATCCTCTTTGGAGAGTTCTTTAACCATTTCGGACTCTAACCACCGATAGCCCTTTGACCCAGTGGCAGCATAATATCGTCCGTCCTTTTCTCTCATGAGCAATCCGCCACCAGCCCCCGATTTAATCGGACAGAACTGACCAACTCGTCCAATGAAAATATAATTGTGAGTCGCCTCGTCTTTTTTACAGAGCTCGCTGTATCTTGCTCCTATTTCTTCGTATGTATATCCGTATTTAGATGCGACATCTTCAAGAGATTTCCCTCCAGCCTGACTATTCCAAGCCTTGTCGATGGCATTTAATTCCCTTTCTTCCTCGGTTGTAAGCTGCGGAAGTTTCTCGTTCATGTCTAAATATAAAGCGCTGCTTACCGACTTGGTTTCACACATATCTTTAAAGACAATTTCTTCGCCGCTGAAAAGTTTTTTGAAGACATAGGGAATCTGGAACTGAGTTCCTGTGGCTGTCCATTTTCCGTCTTTATACTTGGCAATATAGACAGCGTCATTTACCAGGCACATCCGGTCATATGTAGCCTCGTGCTCAAAGGTATAACCATATCGCTTTCCGTAATCCATAACAAACTTGATAATCTCCGGCGTCGCATCGGGAATCTTGATAGAATCTGTCTTAATGTGAGCAACAGTAAAGCCCCGTTCCTGTACCTCATGCTTGAGGTTAATCATGAACAGAGCTCCTCGTTTGGCTACAATATTATCTTTGTTTCTCGGATCACGGAACGGATTCTCGAAATTAGCAGAAGTTAGACCATATACCGAGTTGATTGCCGTCTTCAAAGCATTCGCTAAATCCTTTGCTGTCATCTCTCCGTCAATGACCTTCTGGATATATGGTGTCAACTTCCCATCCAGCATATGATTGACTTCGTCCCAAGCTTCGTGTTTGATGCTGACTCGTCCTTCCACAATGTCACGGAAGGCTCTCGTAAATTTCACACCGAACAGAACTTCTGCAATTGCACTATGAGGATGCATAGAGGAAATATCCAGCAATGCCACATTTCCGTACATACCAGGTTCTGCATAGACATAACCGCCTTCTCCAACTTCTTCTCCTCGATATGTCGATTTTCCATTCTCATACTTGTATCTAGGAAAATATGGTAAGAGACTTCCTTCATCGCCATGCGTTTGCGCCATCATTTCAGGACACGCTTCGGCCAAGAAGGAATAGGTTTCTTCATCAAGGTGATGCACTGGCTCTGCCAGATTTCGGTAATTGAACTGATCCTGTGGTTTCCGCTCATTTCCAAATATAATTTTCTGTGTAAGCGTATTGGTCGTATCGTTCACCGTCATTCCAGCCAAATCTGCCAGAATCTGTCGTGCTGTCCAGTCAGCTTTCAGATAATGGAATGCCGCTTCGGTTGCGATTACATCGTTATCACAATACTCGGCAACTTTCGTCCACATTTCTTCCGGAACCGGTTGATCCCACGGAAGTCCCAACTCCTGATGGTGGATACCCATCTCGATTTCCAATTTCTTCAGGCTCTTTTTATTTCCAGCAGATGCAAAGTCATACACATCTGTATAAGAAACATTGTAGGCTTCTCCAAAGAAACAATTGGGGCTGCCGCTGATGATTTTTTGCGAGAGGTTATAAAGTTGCTCGTTTGTATAACCCATGAGTCTTGCATACAGAATATGATTATCATATCGCCGACAGTTAAACCCAACCAACCGGAATCGCATCAATTCCTCAATCTCCGTTGGAGTCGGGTTAATCATACGCACCACGGGCTTTCCTTCGCCCTCGATTTTCCAGTTTACAAGGAACATATTTGGAAATACCTCAATATCGTAAAATACCAACTTTGCTTCTTCGTTTCTCCCCGCTGTGGAAGGGTCTGCTGATTTAAACTGCATCTTATTAACCAGCTTGATACAGTATTCGGCCTGATGCGTACTATTCGCAGCAAATGCTAAAACCGCATTCCGCATGTCCGTCACGTCATAACTCAAATCGCTGGAATATGCGTCCTCCAGTATTTTGTAGATAAAGTCGATACTGGGCTTAGTACCCGGATGGATTTCTTTATTCAGATTTCTCTTGATTAGTGTTCTAAGCCCTTTCTCGCTTTTAATCGCTTCAAAATTTACCATTTTATCTTCTCCTTTCATCGGTAAACCAGAGCTAATCGTTGCGATGGGCAAATCATTACACTTTGTAAGTTTTCTTCGTAATGAGCTTTTACCCGTGAACACTTTCACTTCAATATGGTCGTCATAAATACGACTCAGTTTTTTCACATCTCCCGTATAAATATAATGAAGATGAACCCCCTTTCCGCTTTTACTTAGCTCTGCATAAGTCGCCGGCCACTTGCTTGCTTCTTCTACATTCCGTTCGAAGGATTTGTTCCCTTCCTTATCTGGAATATCAAAATCAATTACAATGTGGTTTTCCGGGACTTTAACATAGTGGATTTTAGAAGTATCCAAATCAGACAGCTTCGTTTTTACCTTGTCCCATTTCATAGAAGGTGTTTCTTTGTCCGTTGCATATTGTGCTGGACAATCGGAACACACTTTATCAAAAACGGACTTGACCGTATCAAATTGTAATGATGACGGTTTTTCCTCTTGCTTTTCCACAATCGTTTCCTCTTCGAATTTTTCAGTCCGGAATCCGATATAATAGCTTCTCACTCTTGAACCGTCTTCCATATTGAATCGCTCTTTGTAGTCATGGAAATAGTTCTTCAGTTCTTCTTTAAAAATTCTCTGAGAAAACGGATAGCCTACTTTTGCTTCATCACAATAGGTCTTATACATTTCCCAGGCAGCCTTTAAGGTTGTACCGTTTTCCCGTTTGAATACATGGTAGGAATCGATAATGAAATTATAGAAATCATTAGATGCTCCCAGCATTGCGATGGGGATATAGTCATCATATAGACCAGGATTGTTTAGATAGATTTCCTGACAGTGATATGCAATCGCTCCCAGTTCGAATTCAATCTGCTTCATAATCGCTTTGTATTCTCTCGGATTCAGTTTATTTCCCGAAGGAGATACGTCAATCAGCCGTCGAATCAAGCCAGATTTTGCATCTGTGATCTTAACTGGCTTGTTGGTACCCATGAACAAAAAGCATTTGAATCGGTTGGAGTAAGTCGATTTGAACTTCTCATTTACGGTCATCAGCTCATGGGACACCAAGCTGTTTAACCTGGTATTATCTTCAATTCTTGACAAATCACCATCATGCTGGATCGCAACAAGCGGATTACTCTTGAATGCCTCCAATGCAAAAGAATTGCTGGACGAACCCAAAGCTTTTGCGTCAAAAACAGAATAATACCCCTCAAAGAGCTGCTGAATAATATTGAGAACTGTGGATTTACCTGTTCCGGCAGCTCCATACAGAACCATAAATTTTTGCAGTTTTTTCGATTCTCCACATACCACAGAACCAATAGCCCATTCAATTTTCCGTCGCTCAGTCTCAGAGTACAGAGTAGACATCAATTTGTTATAAGCAGACAAATCGCCAGCTTCAAGCGGATATTTCAGCTTTTTACTGGCGTAATCTTTTTTATCGGTTTTTGTATTGGAGAATATCAATTTGTCATCCAACATGTGGAAAGAATCCCGCATTTGCTTCTGACAATATTTATGCCAGGAATCAATCATCCCGGATTCCGCATCCCACATGTGAAGAACTTTAATTTCAGAGTCAAAGCGCTGGCGGCTTTCTTCTGCGTATCTATCCAGTTCACGGTCAATGAGTTGCAAAGCATCTTGTTCGTCCGTAGACCATAAACCTCGTTCCTCAATCCAGATAGCGTAGAAGTCACCACCTCGAATCATCAGATCGGAGCTTTTCTTAATAATGAACTTCGGATAGATTTCAATTACACCACGCTTTGTACTACGTGTGGAAATCATCAAAAAGTCGATCATCTCATTTTTTACTCTCCTTTATCGCGCTTCATTTCCTCTATTGTCGATTCCAGTTTCTCAATCCTCTTTTTCTGCTCCACACGATCCAGCTCCAGAAGAACCAGATTAACCGTTATGATAAGAGCAATCGTGCTTAATTTCCGGTTATAATGGGCCTGTTTATTCAGAGATTTCCGAATGGACCGGATTGCCGCCTCCGAATTGCTGAGACTTCCGAAAATATAATTCATAACCTCACACATCTTACTTTTTTCCTCCCTTCATTCCATTCAGAAAACTGGTAATCGTCTCAAATTTCCAATCTTTCTGATTATGATAAGTGAATATAAATTCCTGACCATTTTTCTGGCGGATACGGATGCTGTTTCTTCCATTTGGAAACCACATATCAATCCGATCTCCAGAATAATCGGGAAAATAACTTTCAAACCACTTCATTACTTCGCTGTGGCTCATGGTAATCTCTCCTTCTTCTAAGTATTTTCGTCCAAGTACCAGCACATCTGATACCAGATTTCAACGGATCTCAAATCGTATCGGCTGTGATTTACAGTAAACAGTCCTCCATCGCCATTTCGGCTATACTTCCGATCCAGAAACCTCTGGACAATATCCTCAACATAATCCCTGTCGAACTTGGAATCGTTCATAGAACCCAATCCTAGATTGACAATCATATTCCAAAACCACTGTCCAGTTCGGTTTCCAATGTCTGGATCATCCATAATATGTTCTTCACACCGAATTGCTAGTGCAATCATCATTTCCAGCACGCTGCACATCCGATTGTCCAAATATACGGAAATCATAGAGCTGCTGTATCCGTTTTCATAACCAAACCGATACCTTAAATCCACTCCATCTTCCGCTCGATTTCCATCCATCGGAATACTGTAAGTAAATTCGATTCGATGCAGCTCTCTTAAAAGCTTTCGATACGATAATCTCTTTGAATATCTTCCATCAAATACAAGCTGATACATCCAGTTAAAATATGCATCATTAAGCTCGTTCTTTGTCATTGCTCCTCCATTCGATGTGGCATCGTTTTTACAACATCAGAATAGTTTCTCTGGTCAAGCAGGATTTCATAATCGCACTTTAACCGGTCATTTCGAACAAACACAGAGTCGTCCTCATATTCTCCGAAATGGTTCAGAGATTCCTTACCAACAATTTCATCCACATCGTCTACCTCTTCATCATTCTCATCAGCCAGAACTTCGTCTGCATAGTAAGTGAGGCTGATTTTTTCATACTCTTCAAATTCGCCAAATTCCTCTGGCGAGATGACATAAGGTTTTTCCACAAACGCCGCTCCTTTCTTTTCCTCAACCGTTTTGGAATAATCCGTATAGCCCTCTTTCTGAATGATAGATGCGTACTTTTTGAAATCCATGTCATCTTCATCCTTCGAAGTTCTATCTTCTGCTACTTTAAGGCCATCTCGAAAACCTTCCACAAAACTCTTTCCCGCTTCCTCTACGATTTTCCTTTCGGCATAAGCTGCTTTTACAGAGTCGATTTCTTCTTGAGCAATCAGCTCATATTTTCGTTTCAGCAGTTGCCATGTACATACAGAACCAATTCCCGCTCCAGCAATAAAAGCAAGGAAAGTCAATCCTTTACTGCTCATCCTCTTCCTCCTCATTCCTGATTGTCATTACGGTTATTGCCAAACCGCCAAAAAGGAAGGAGACACTCAACAGAATGCCTCCCATAATATGTCTTTTTCTCTTAGTGTCCAGAACGTAGTCCAGTACCGATATCACATTTTCCAGTCCGTCCATATCAGTGCTCCTTTCCCGTTGACAGAATTGCAATCCCACCAACGAAACAGATACCCGACATAGCCGCTAATGTATAAGACACAAATGCTAAAAGATTACGCATAATGATTCTCCTTTCTTCACTCATACTTTGAAAAATAATGATTTTCAACTTGAAACATTGGAACGCCATATGCGCTGTATTCTCCCGCTGTAAAGAATACAACATCATAATTTCTCCTTGACTCCAGTTCCTCGTAGACAAGGTCACAAATATCCTCGCGTACTTCACACCTGTCTACTCGGCCATTCCACATAGATGAAAATTGATTCGGCTGGTAAATCACCTCATATACTGTATCAGGAAAGTGTTCAGAATCTACCCGATTCAATATGGTATCGATAACAAGCCGCTTCCCTTCTTCGCATTCACCTTCTGCTTCAGCCATAGTAACCAGGGCGATTAACTCTACATCTTCTCTCGACATTTTTGGTATAGCTTCTTCTGATCTCCCCATTTTCTCAGTGGCAATTGGAATGGATTCCTCTTGCGAAACCGTAATAATCGGTTCTGTCTTTTCGACAATGCTTGCCCTAGGTATTGCTACAACGTCTTCCCCTTCTGAGCGGAATTCAGATACAAAGAATGAAGATGCTATCATGATACCGCACAATATCGGAACCGTTATTACTTTGATTAACCTGCGCATAAATTCCTCCCAAATAAAAAGCTATCCCTAAGAATTACAGTAACTCCTAGGGATAGTTATATTTTTTCACATCAAATCCCAGATGTTTCCATCGACATTGAAATCCAGAAGGATTGCCTGATCAAATCCATTGACATAATCCGAATAGCTCAGATTATCAGAATACAGGCCGAAGTCAATGTAATTATCGCCCTTGGAATTTTCCGGATCGTAAACCCATCCCACAATCTGACCAGCTTTTGTTCTCGGAAGTCCGAGCATCTCATAAACCTCATTCAGAAATACACGCTTCTTCGCTTTCAGCAGATCGTTCGCATAACGCTCCTGAGCTTTGATGAACATCAGATTATATTCATTATTGCTTTCCCAGTGAGGATTCAGAATGGAATTCCCATCTTCATCCTGCGTGTACTTTTCAAAGAATCTGGCATAACCGCTGATATCCGCCGGACTTACCACAAATCCGTTCTTCTTAACTTTCTTCTCTTTTCCAGTCTCCTCATCGATAATCGTTTCGTCAAACTTTTTGGCTTTGAGATTATATTTCAATTCACGGTCAACCTCTTCACCAAACCTCTCGATAACACGACTGCGATACTCTTTGAATCCCTTATCGATAGCCGCATAAGCTGCTCCCAGAGCCACATTTCTCTTGCGAAGAATGTTGTTAGATGCCAAAATACTGGTAATCGACAATGCGCCAAGTACAACCGAGGGTCCATACAGTTTAGCAAATTTTACTCCGGTCTGGACATAAATAAGAGTGAGATCCTTTTTTGAATCTTCAACGGAGTAAGACTCCCCGGCTTCGGTAACACCTGTCGCTGTCGCTGTATGTACCTTGTCGATATCGTTCTTAGTGTTTTCCACAATCTTGTCCACTTTTGTTGTCGCTTTACAAGCCATAACAGCACTTGTAACCACGCCAATAACGCCGGCCACGACGAGAATCTCCGGACTATGTTTTTTTAACTGGAAACCGGTCTTGCTAAGAAAACCATTCATGCTCTTTACAATCTCTGTTTTTTTCATGGTTATTTGTTCTCCTCTTCTACTTTTTCTGTTTTCTTTAAATGGTCAATCAGATGCTGCGTGTACCAAAGAATTTTCTCCAAATCCTGGATTCCGTTTTTCTTCTTCCAACGGCAGGCATATTTGATGATATTCGCGGTATCGGCAGCCTCAATTCCTTTTAAATCAAAGGTGAAAGCTTCAATCACATCAATAACTTCCATACCCGTTTCTGAAATATAATGATCCGGATGAGATACCATCCTGTCTTCTGACTCATACATCTTGAATCCCTCCTTTACAACGGCATTGGTTTAGGCAATTTTAAAATATAACCATCCCTTACTCGAACCGCCCTGCATCCAGCAATATCAGTCCATCCGTATTTATTGGCAGCATAATTGTCATTGGATACGTTTGCCAAATCATAAAGATCTGCAACACTAACTACCTCATACTGTGCAATAATTTCGTTCATGGCGTCTAATACCGATTCCGCATCTCCACGAGTTTCGAATAAAAGCTCATCATATTCGTAGCTCGTCCGGCTCTTCGGTGCTGTATAATCTTTCTTTCCGCTGTCGTAATACTTCTGATAGGATACCTTGGACGCTGTAGAGTTCTTTTTTGACTTCCCAGTTTCTCCATAGAGGATCATATCAATACCATTGGTAACTATATCGGAAATTGCCTTTTTTATTGCCGGCACCAGAACATCCATCACAATATAAGATTTTACGTTATTGACATCTTCAGAAATGAATACGTCTGCAAACTTCTGCATTTCTGATTTTTTCTTTGATTTTACCGTCCCAGAAATCACTTTCTCTACACGTTTTTCGGGAACAAGATTTTTCTGCTCCTCCTTTGATTTGTGGGAATTCGGCTTATATTCCTCCATTAAGTTGTCTCCTTTCCGCTCACCAAACTGATCTTTCCAGGCAATATAATCTTTGTACCCGGAAGTCGGTTGTTTTTCTTTTTAAACTGATAAGTAAGGTTTGACCTTGCTTTCTTTTCTGAAACTGCCCGTGTAGAAGCAATCCAGCGATTTGCAACACAATTGTCAAATTCCATAACTGGGCCATCATACGAATACATGTTCATAAATTTCACCTCCGGATAAAAGAAAAAAAGGGAAAGCACCTTGTTACAGGTACTCTCCCTCGTGTTGAAGCACATTTTTTCGTTTAGGCTTCTTCGGAATCCTCTTTTTCATTCTCAATGATTGGTTCTTCGGGTTCATCCCACTCAGCATCGATAATCTGCTGCTCCTTCTGAGCTTTGATTTTGGCAATCATCGGCTTACCCACATACCTGTAGATTACAACACCTGCAAGTACGGCTAAACCGATACCGGCCGCAACCTTAAACCCCTTACCAGAACTCGCTTTAACGATTTCCTCTGTGGCTGTCTCCATAACCTCTTCGTTGTTCATGATTTCATTGGTTTCCATGTTTATTCTCCTTTCAATTTTTGAAAATGTGTGGTTCTTCTTCCATTAAAGCCACTGTTTTTTTCGCGCGTCACATCAGCTCACTGAAATTGTATCTCGGAGCAATACTGTAATCAATCACCAGGCAGGGAGTTCCATCGCTGGCCAATTGAGAACTAAATGATAGATCAATATACCCATTATCAATATTCCATCCCAACTCATCGCCGATTTTGATATTATCCAGACCGACTTCGTAGTAGAAATCATTTAAGGATACATACATCTCATCGCGCATCTGACGATTTAATTCACATTCCGCTTTCTTAATTTTGTCGATATCGCCTTTAAAATATCTGCCAGAAATCGCGTCATAGCAGAGCGTATTTCCCTTTTCTGTAATGATTACCTCTCTTGTTACCACTGGATTTTTCTCGATTTTATCCTTTGCAACGGCATCTTTCACAGTTTCGTGCTTCTTCTCTCCGAACATCTTAACGACTTTTCCCTGATAATCCTTGAGAGCGGATTCCGATAAGGTATATGCCGTTGCAAGTGCTGCATTCCGTTTAGCGTTTACTGAGCTGGCTCCAATTAAACATGCAATGGAGAGGGTTCCCGTAATCGCTGCCGGAATATAACAGAACCAAGTTGTTTTTACCACATCTGCGACTTCAAGCTCCTCGGCTCCAATTTCCTCTTTTCTTTCCTCAATGAGAATCAGTGCTTTCGGCGTTGCCCGGACCGCCATGACAGTCGTTGTGATCATACCGGCAATACCAATTCCTGTAAGAATTTCAGGGCTGTGCTTTGCCATGGATGTTCGAACTGACGACAAAACTTTTGATATGTTCGATTTCTTCATTATCTCATTCCTCCAAATATCACACTGAATATCGTTTTAACCATGTCAAGAGCCTGATTCTCAGTAAAGCCGGCACGAACAAAGCTGTCCATAATTACTTTGGTTTCCGCTGTTGCTTTATCGTACTGCTTATATTTTTCTAACTTCTCAATTTCCTTTCTCAGAGTATCAATCTCATTTTCTTTATTCCAAATTTCCATCTTTAACGCTGCTTCTTTTGTAACCATTACATCATTCTGACAATAATGCAGAATGTCTTCTGAACCAGCATCAAAGCCCCAAGGATAACGGCCAGATCTACGGGGCACAGGGCCCCTGGATTCCGGCTTAACCAACCAGAATTCCGGACGAACCCCATGAGAGTCCGAAGCGGGGCCGCAGGCCGCATTGCCATCGTAGCTCACACCAGCGAAATGAGCCGAAGAAAACCCCTCTTTTGTAGCATTTCTCAGCCATCCCCACGTAAGCTGATCTTCAAAACAAGCAATTCGATTCTTGCACTCTTTCATCAAAGGCAGCTGTTCATCGCTATCTGGTTCCAGATTCTTGTTATCCCATTCGTCTTCGTGACCAACAATCTGTCCAACGGTTGGAATAGTAAGTCCGTAAATCTTATCACGCAACTCTTCTGGAAACGCCATAAACAGAACTGTATCCATCCACTTCTTCAGATCGGACTTTTCAAACCCGCCTTTGTTTGTAGAGCAGTTATTCATCGGCCGGCAGGTAACATATTCGTCAAATATAAACATGACGCTCTCGTCCGTAACCTTGTGAGCGGTCGCGGTAAACTCTCCAAGCTCTGCCAGCGGAATGACCATCTGATCTCCTACCTGAATGTTTGCTGTTTCGATTTTCTGCTTTCTTAATGCCTTCATAATGTTTCTCCTTTCGAAAATAAAATTTGTGGTTATAAAATAAGACCGAGAAGTGTCTTGGCCGTATTTTCTGCAACTTGAAATATGTGGTTATTTGCTGGATTTTCCGACAGGTAAAGAAAAAGCTCCATTTTCAATATAAACCCTTCTATCACTAAATCGGCTTCTGTCATCGGATGATCCATAATGGCCAGTAGAATCTCGTCAACCGCCCACCGTTCATACGAACGTTCCATAATGGCTGATTTAGGCCAATTTTCTCCCGGTTCAAACAGATGCATATTCGTATAATTCATGATTTTTTGAATAGCCTCATCATTCATCAGCACTTGCTCCAAACTAAAAAGAAAGAGCCCTTGTTAGGACTCCTCTTCGTTTTCATCATCTTTTTTGGCAAGTGCCTCGTTTACCTTTTCTTCAATTTTCTCATCCATTTTCTTTTCGTTTACCCAATCGGTAAGAATACTTACTCCAAATCCGATCACCGTAACTGCAATCCCAATGGCTTTGATAAAATTTTTGTTTTTCATAAAGCATTAGCCTCCTTTTCATAATACGGTCTGTAATTTTTGCGAATCATTCAAATTTGTTGACCGCCATCGTGTCTATAATGATGCACTCCAAGCCATCTTCCAACGTTGATTTGTAATTATCAAAATCTAACCAATAGCAATCCATTTCTTCTATCATATAGCTGATATCCCATCCGAGTTCATCGCCTCCGTCTATGCCTTCGACACCTAAAAATGACAAGTATTCATTTAATGAACAATCGCCTTTAATGGAAAGATTCCGATTTACATGATATTGGGCGTTTAACACCGCTGCCATTGTGGTTCTAAAATACTTCTTTGAGGCAAGATCATAGAAAAGCAACCGTTCACTTTCAGAATCCATGTCCATGTTGTAAACCTGATAACCCCAGTCGTAGGAAGACACCATGGCATCTTTCGCCATTTCCGCATGGATTTTATCATCCGCATCCTCTCCGTAAACTGTCTTGGCTGACTTCCGATATTGCTTATAGGATTCATTGAGCATAACGTATGCACTCATCAAAGAAGCCTGTTTCTTTTGATTTAATGCATTCGCTCCAAAGATACAAGCAATAGTTGAAACTCCCAGCAGTACAGAAGGAATATAAGTCAGCCCCGCTACTCGGACGATTTCTAATTTAGTTAGATTTTCGCCCTTCTCCAGCTCTGCCTCTTTCAACAATTTTATTGCTTTAGGGGTTGCCTGAACAGCCGTAATGGTTGTTGCAATGACTCCAATAGAAGCTACTACCGTTAAAATTGTCGGAGATGAGCGATATAATTGACGCCCGACTCTTTTCAAGATTTTAACTTTTTGCAT
Protein sequences of DBSCAN-SWA_2 >NZ_CP040506|1166677:1185253|1181016_1181619_-|WP_006781041.1|DBSCAN-SWA MEEYKPNSHKSKEEQKNLVPEKRVEKVISGTVKSKKKSEMQKFADVFISEDVNNVKSYIVMDVLVPAIKKAISDIVTNGIDMILYGETGKSKKNSTASKVSYQKYYDSGKKDYTAPKSRTSYEYDELLFETRGDAESVLDAMNEIIAQYEVVSVADLYDLANVSNDNYAANKYGWTDIAGCRAVRVRDGYILKLPKPMPL >NZ_CP040506|1166677:1185253|1169347_1169548_-|WP_006781024.1|DBSCAN-SWA MKSTDSVIVSWDFSHGKDAGVLIVGRQQKGRVEIINAYQGEEAKEIYQKLVFHKSKKINYSEEKTT >NZ_CP040506|1166677:1185253|1172622_1173108_-|WP_006781030.1|DBSCAN-SWA MAYKNVPNIIIENARIIFRNFRGEESKYNRAGNRNFCVIIEDPEQAEKLSNDGWNVRVLAPRDEDEEPRHYIQVAVSFENIPPKVYMITRKTKTPLDDESISTLDYAEIRNVDLTIRPYSWEVNGKTGIKAYLKTMYVTIEEDEFAEKYAEEEGPEEVPFH >NZ_CP040506|1166677:1185253|1167848_1168613_-|WP_006781021.1|DBSCAN-SWA MRGRKRRSFDFGNFEITKREILVSVSIVAIMLLIGVLIAGKISDYQLDKNEKYNKAIKIESQELFEYGMRTNAGNAFVYGDLKAVDTVTYPGIGGEYIYIEKVKERYTMHTRQVAHTTTTNGKTHTYYTTETYWTWDYAGSEEQICDEISFLNYVFSVSKIDLPGKEYIDTVKESSHIRYKYYGVGLNFTGTIFTELADKTIADNSPFYENMKIDETVEYLETDFAMWIFWIIWMVLIGVCVYSFYYIDNKWLE >NZ_CP040506|1166677:1185253|1177236_1177470_-|WP_006781032.1|DBSCAN-SWA MCEVMNYIFGSLSNSEAAIRSIRKSLNKQAHYNRKLSTIALIITVNLVLLELDRVEQKKRIEKLESTIEEMKRDKGE >NZ_CP040506|1166677:1185253|1170227_1171466_-|WP_006781027.1|DBSCAN-SWA MAVELYDYQIAAVKKMKNGCILCGGVGSGKSRTALAYYYLQNGGNPDCLMGVEDYVAMDDPPKDLYIITTARKRDTMEWEGDLSPFLLSVHEDVNLYSNQIVVDSWNNIKKYEDVKDAFFIFDEQRVIGSGAWVKAFLKITKSNQWILLSATPGDTWQDYIPVFIANGFYKNRTEFIREHVVYSRFSKYPKIDRYLNIGRLIRLRNRILVNMDFKRQTVSHHEDIFIRYSIERYKDVGRTRWDPYKNEPIANAAGLCYVWRKIVNTDESRQIALMEIVEKHPRAIIFYNFDYELELLNGLFRIYEDDGVFEIAEWNGHKHQPIPESKNWVYLVQYNAGAEGWNCIKTDTIIFYSQNYSYKIMQQSAGRIDRLNTPFKDLYYYHLKSRSGIDLAISKALKDKRDFNESRFVKW >NZ_CP040506|1166677:1185253|1179137_1179707_-|WP_006781038.1|DBSCAN-SWA MRRLIKVITVPILCGIMIASSFFVSEFRSEGEDVVAIPRASIVEKTEPIITVSQEESIPIATEKMGRSEEAIPKMSREDVELIALVTMAEAEGECEEGKRLVIDTILNRVDSEHFPDTVYEVIYQPNQFSSMWNGRVDRCEVREDICDLVYEELESRRNYDVVFFTAGEYSAYGVPMFQVENHYFSKYE >NZ_CP040506|1166677:1185253|1180741_1181002_-|WP_006781040.1|DBSCAN-SWA MYESEDRMVSHPDHYISETGMEVIDVIEAFTFDLKGIEAADTANIIKYACRWKKKNGIQDLEKILWYTQHLIDHLKKTEKVEEENK >NZ_CP040506|1166677:1185253|1177471_1177684_-|WP_006781033.1|DBSCAN-SWA MSHSEVMKWFESYFPDYSGDRIDMWFPNGRNSIRIRQKNGQEFIFTYHNQKDWKFETITSFLNGMKGGKK >NZ_CP040506|1166677:1185253|1177701_1178250_-|WP_006781034.1|DBSCAN-SWA MTKNELNDAYFNWMYQLVFDGRYSKRLSYRKLLRELHRIEFTYSIPMDGNRAEDGVDLRYRFGYENGYSSSMISVYLDNRMCSVLEMMIALAIRCEEHIMDDPDIGNRTGQWFWNMIVNLGLGSMNDSKFDRDYVEDIVQRFLDRKYSRNGDGGLFTVNHSRYDLRSVEIWYQMCWYLDENT >NZ_CP040506|1166677:1185253|1173127_1177228_-|WP_006781031.1|DBSCAN-SWA MIDFLMISTRSTKRGVIEIYPKFIIKKSSDLMIRGGDFYAIWIEERGLWSTDEQDALQLIDRELDRYAEESRQRFDSEIKVLHMWDAESGMIDSWHKYCQKQMRDSFHMLDDKLIFSNTKTDKKDYASKKLKYPLEAGDLSAYNKLMSTLYSETERRKIEWAIGSVVCGESKKLQKFMVLYGAAGTGKSTVLNIIQQLFEGYYSVFDAKALGSSSNSFALEAFKSNPLVAIQHDGDLSRIEDNTRLNSLVSHELMTVNEKFKSTYSNRFKCFLFMGTNKPVKITDAKSGLIRRLIDVSPSGNKLNPREYKAIMKQIEFELGAIAYHCQEIYLNNPGLYDDYIPIAMLGASNDFYNFIIDSYHVFKRENGTTLKAAWEMYKTYCDEAKVGYPFSQRIFKEELKNYFHDYKERFNMEDGSRVRSYYIGFRTEKFEEETIVEKQEEKPSSLQFDTVKSVFDKVCSDCPAQYATDKETPSMKWDKVKTKLSDLDTSKIHYVKVPENHIVIDFDIPDKEGNKSFERNVEEASKWPATYAELSKSGKGVHLHYIYTGDVKKLSRIYDDHIEVKVFTGKSSLRRKLTKCNDLPIATISSGLPMKGEDKMVNFEAIKSEKGLRTLIKRNLNKEIHPGTKPSIDFIYKILEDAYSSDLSYDVTDMRNAVLAFAANSTHQAEYCIKLVNKMQFKSADPSTAGRNEEAKLVFYDIEVFPNMFLVNWKIEGEGKPVVRMINPTPTEIEELMRFRLVGFNCRRYDNHILYARLMGYTNEQLYNLSQKIISGSPNCFFGEAYNVSYTDVYDFASAGNKKSLKKLEIEMGIHHQELGLPWDQPVPEEMWTKVAEYCDNDVIATEAAFHYLKADWTARQILADLAGMTVNDTTNTLTQKIIFGNERKPQDQFNYRNLAEPVHHLDEETYSFLAEACPEMMAQTHGDEGSLLPYFPRYKYENGKSTYRGEEVGEGGYVYAEPGMYGNVALLDISSMHPHSAIAEVLFGVKFTRAFRDIVEGRVSIKHEAWDEVNHMLDGKLTPYIQKVIDGEMTAKDLANALKTAINSVYGLTSANFENPFRDPRNKDNIVAKRGALFMINLKHEVQERGFTVAHIKTDSIKIPDATPEIIKFVMDYGKRYGYTFEHEATYDRMCLVNDAVYIAKYKDGKWTATGTQFQIPYVFKKLFSGEEIVFKDMCETKSVSSALYLDMNEKLPQLTTEEERELNAIDKAWNSQAGGKSLEDVASKYGYTYEEIGARYSELCKKDEATHNYIFIGRVGQFCPIKSGAGGGLLMREKDGRYYAATGSKGYRWLESEMVKELSKEDSIDRSYYDKLVDDAVETISKYGDFEWFISDDPYIPKPKLEDFMNIPEDADEEIPFN >NZ_CP040506|1166677:1185253|1181618_1181825_-|WP_007868844.1|DBSCAN-SWA MNMYSYDGPVMEFDNCVANRWIASTRAVSEKKARSNLTYQFKKKNNRLPGTKIILPGKISLVSGKETT >NZ_CP040506|1166677:1185253|1179773_1180739_-|WP_100932823.1|DBSCAN-SWA MKKTEIVKSMNGFLSKTGFQLKKHSPEILVVAGVIGVVTSAVMACKATTKVDKIVENTKNDIDKVHTATATGVTEAGESYSVEDSKKDLTLIYVQTGVKFAKLYGPSVVLGALSITSILASNNILRKRNVALGAAYAAIDKGFKEYRSRVIERFGEEVDRELKYNLKAKKFDETIIDEETGKEKKVKKNGFVVSPADISGYARFFEKYTQDEDGNSILNPHWESNNEYNLMFIKAQERYANDLLKAKKRVFLNEVYEMLGLPRTKAGQIVGWVYDPENSKGDNYIDFGLYSDNLSYSDYVNGFDQAILLDFNVDGNIWDLM >NZ_CP040506|1166677:1185253|1183929_1184220_-|WP_006781046.1|DBSCAN-SWA MNDEAIQKIMNYTNMHLFEPGENWPKSAIMERSYERWAVDEILLAIMDHPMTEADLVIEGFILKMELFLYLSENPANNHIFQVAENTAKTLLGLIL >NZ_CP040506|1166677:1185253|1171642_1172530_-|WP_006781029.1|DBSCAN-SWA MAFWKRKKKRTTAKPKINASISKVEKPKKISEKHDFVVHKEEHQVSLSPKPKMKNLPTFKPDRYEKEFMNIFRQLVSEKNRPWDIWKDFIVMSACSISNSVDKSQFDEREKRYLDIIRHYSKSKQELFPQLFANLVMSLEMNPEQDFLGKMYMSLNLGYDELKQIFTPYDMCRLLAKITITDVTEKVRKDGYITINDPCCGAGANLIAAIHEARKQLEKENYNYQNHLLVSGQDIEEVPALMCYIQLSLLGIAGYFKVGNSLIKPMTMADDLKNYWFTPIYFSDVWTMRRLFHSI >NZ_CP040506|1166677:1185253|1184473_1185253_-|WP_006781048.1|DBSCAN-SWA MQKVKILKRVGRQLYRSSPTILTVVASIGVIATTITAVQATPKAIKLLKEAELEKGENLTKLEIVRVAGLTYIPSVLLGVSTIACIFGANALNQKKQASLMSAYVMLNESYKQYRKSAKTVYGEDADDKIHAEMAKDAMVSSYDWGYQVYNMDMDSESERLLFYDLASKKYFRTTMAAVLNAQYHVNRNLSIKGDCSLNEYLSFLGVEGIDGGDELGWDISYMIEEMDCYWLDFDNYKSTLEDGLECIIIDTMAVNKFE >NZ_CP040506|1166677:1185253|1181908_1182175_-|WP_006781043.1|DBSCAN-SWA METNEIMNNEEVMETATEEIVKASSGKGFKVAAGIGLAVLAGVVIYRYVGKPMIAKIKAQKEQQIIDAEWDEPEEPIIENEKEDSEEA >NZ_CP040506|1166677:1185253|1182245_1183019_-|WP_006781044.1|DBSCAN-SWA MKKSNISKVLSSVRTSMAKHSPEILTGIGIAGMITTTVMAVRATPKALILIEERKEEIGAEELEVADVVKTTWFCYIPAAITGTLSIACLIGASSVNAKRNAALATAYTLSESALKDYQGKVVKMFGEKKHETVKDAVAKDKIEKNPVVTREVIITEKGNTLCYDAISGRYFKGDIDKIKKAECELNRQMRDEMYVSLNDFYYEVGLDNIKIGDELGWNIDNGYIDLSFSSQLASDGTPCLVIDYSIAPRYNFSELM >NZ_CP040506|1166677:1185253|1183018_1183897_-|WP_006781045.1|DBSCAN-SWA MKALRKQKIETANIQVGDQMVIPLAELGEFTATAHKVTDESVMFIFDEYVTCRPMNNCSTNKGGFEKSDLKKWMDTVLFMAFPEELRDKIYGLTIPTVGQIVGHEDEWDNKNLEPDSDEQLPLMKECKNRIACFEDQLTWGWLRNATKEGFSSAHFAGVSYDGNAACGPASDSHGVRPEFWLVKPESRGPVPRRSGRYPWGFDAGSEDILHYCQNDVMVTKEAALKMEIWNKENEIDTLRKEIEKLEKYKQYDKATAETKVIMDSFVRAGFTENQALDMVKTIFSVIFGGMR >NZ_CP040506|1166677:1185253|1178246_1178864_-|WP_006781035.1|DBSCAN-SWA MSSKGLTFLAFIAGAGIGSVCTWQLLKRKYELIAQEEIDSVKAAYAERKIVEEAGKSFVEGFRDGLKVAEDRTSKDEDDMDFKKYASIIQKEGYTDYSKTVEEKKGAAFVEKPYVISPEEFGEFEEYEKISLTYYADEVLADENDEEVDDVDEIVGKESLNHFGEYEDDSVFVRNDRLKCDYEILLDQRNYSDVVKTMPHRMEEQ >NZ_CP040506|1166677:1185253|1169531_1170059_-|WP_006781025.1|DBSCAN-SWA MSYQYDRYLAQHKSNVEAGFRWLQKNLPEITEGSGAEHNIVFAHDQSKTEPDEYGPYDIYFYGGNRSYGVVEDFRKAWLLHIHRNPHHWQYWVLINDAPEEGEIILEMPYCYVLEMICDWWSFSWFKGNLLEIFSWYEERKSYIKLHPNTRRLVEDILGRIKNKLGEVIVNEINR >NZ_CP040506|1166677:1185253|1168618_1169173_-|WP_006781022.1|DBSCAN-SWA MRKIKNNWKVALIVAVGIIAIILLGVFCIQSSQNRAFTLEEQVNTADSDIKVQEKRRVDLVYNLANCVKQYDKHEAETLTAIVENRGSTGDIENVTTAITAVSEAYPELKSNENYKELINELSITENLIAEYRSNYNKQVKEYNRYIRKFPTRFFLNILGYEVRGYTYLDYSAPADAPQNLFGE >NZ_CP040506|1166677:1185253|1166677_1167745_-|WP_006781020.1|DBSCAN-SWA MNLKSVKIIAVDFDGTLCENNWPGIGAPNEELIEYLRNRKKDGDKLILWTCRVEDMLQKAVEWCKERNLVFDAVNENLPEIIENFGSDTRKIFANEYIDDRNIPLSSCREKSNTQTWAEKEVEIACENERKASGTKEGEWDYGCACYESALKAYHSLSEDGHSGFSIGMTKYILNRLIDGKPLTSIEDTEDVWSDITDLSGYRGEIVNYQCKRMSSLFKYVYSDGTVKYHDVDRYCGINMNNPYDSYHSGLIDRVMEEKFPITMPYFPASKPFKVYCENFLVDPENGDYDTVGIFYTITPEDCTVKLNRFFKEENDEFVEIAKAEYDMRKHCHGCFGAANNDCLRCEEELQYESE >NZ_CP040506|1166677:1185253|1169159_1169351_-|WP_006781023.1|DBSCAN-SWA MKQPKKLTREQKECLFAHYLNCKDWMLVDETEFYYRIINKNTGVIKSVDKFRRVRRRKNHEKN >NZ_CP040506|1166677:1185253|1178853_1179018_-|WP_006781036.1|DBSCAN-SWA MDGLENVISVLDYVLDTKRKRHIMGGILLSVSFLFGGLAITVMTIRNEEEEDEQ |
25 | Faecalibacterium_phage(40.0%) | NA | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_3 |
1548181 : 1554082
Sequences of DBSCAN-SWA_3
Nucleotide sequences of DBSCAN-SWA_3 >NZ_CP040506|1548181:1554082|DBSCAN-SWA ATTAAAACTTGAAGGTGTCTTTAAATCCCTGCCACTTCTGATCCTTCTCGCTGATAATCAGCTTCATGCCGTCCTCAATCGGCCACTCAATGCCAATCTCCGCATCATCCCAGGCCATGCCGCCCTCGTCGCCGGGACGGTAGAAATCCGTGCATTTATAACAGAATTCCGCTTCATCGGACAGGACTAAAAATCCGTGGGCAAATCCCTCCGGAATATAGAACTGTTTCTTATTCTCGGCTGTCAGCTCCACGCCAAACCATTTTCCATAAGTCCTGGAGCCGCTTCGCAGGTCCACCGCCACGTCGAACACCCGGCCACGCACCACGCGGACCAGCTTGCCCTGGGGATACTGCTTCTGAAAATGCAGACCGCGCAGCACTCCCTTCACCGACATGGACTGGTTGTCCTGGACAAATACCATGTCGAGACCTGCCTCCTTAAAATCATTCTGGTTGTAGGTCTCCATAAAATATCCCCGCTCGTCACCAAATACAGACGGCTCAATGATATACAGGCCCTCAATGTCACAAGGGGTCACTTTTATCTTTCCCATGATGTACTTCCTCCTGCTGCTAATTTCTATTTTATCTAATATTCTATCTCCTGTAAATACCGTGACAGCGCATCCTGCCAGGACGGCAGACGCTCAAACCCATTTTCCTCCAGTTTGTCCTTGCTCATGCGGCTGTTCATGGGCCGTTTGGCTCTGGACGGATAGCTGGCGGAATCCACCGGCGATACCGCCACCTTCATGCCGGCCTGTTTGAATATCTCGCAGGCAAACTCATACCAGCTGCACAGCCCCTCGTTGGTGGCGTGATACCTGCCGTAGCGCTCGGTCTCAATCATATCCACCAGAAGTCTGGCCAAATCGTAGGTGTAGGTAGGCGAGCCAACCTGGTCATCCACCACGGTAATGCTGTCGTGGGTCTTGCCCAAATTCAGCATGGTCTTGATAAAGTTCTTCCCGTTGACGCCGAACACCCAGGCGATGCGGACGATGAAATACTTGCTCAGATGACGCTCCACCGCCAGCTCTCCCTCGCATTTGGTCAGGCCGTAGACATTGAGCGGCTCCCGCTCATCATCCGGCTCCCAGGCTCTGGTGCCCTGGCCGTTGAACACGTAGTCCGTGCTGATGTACATCATCTTGATGTCCAGTTTCTCACAGATTCTGGCCACGTTCTCCGTGCCGTCGGCATTGACTCTCCGGCACACATCCACGTTGTCCTCAGCGGCATCCACCGCCGTGTAGGCGGCACAGTGAATCACCGCCTCCACGTCCGCCTCCGTAATGACCCTCTCCACGCTGGCCGCGTCCGTGATATCCATCTCCTCGATGTCCACGCCCACCGCCACATGGCCCCGCTTCTCCAGTTCATTTACCACATCATGGCCCAACTGGCCTTTGACACCTGTCACCAATACTCTCATTATAATCTCTCTCCATACATCTTGTCAAAGTAGTTTGCATATTCTCCGGAAATAATGTTCTGCCACCACTCCTGATTGTTCAAATACCACTCGATGGTCTGGGCGATTCCGGTATCAAAATTGTACTGCGGCTTCCAGCCAAGTTCTGTCTCCAGCTTCGTCGGGTCGATAGCGTAACGCATGTCATGGCCAGGTCGGTCGGTGACAAATTTAATCAGACTCTCCGGTTTATTCAAAGCCTTTAAGATGGTTTTCACCACTTCTAAGTTGGTTCTCTCATTGTGTCCGCCTACGTTGTACACTTCCCCCTCTTTGCCCTTGTGGATAATCAAGTCGATGGCGGTGCAGTGGTCAGCCACATGAAGCCAGTCGCGGACATTTTCCCCTTTTCCGTATACCGGAAGCTCCTCGTCTGCCAGCGCGCGGCTGATAATCAGCGGAATCAGCTTCTCCGGGAAATGATACGGCCCGTAGTTGTTGGAGCACCGGGAGATGGTCACCGGCAGGCCGAAGGTTCTGTGGTAAGCCAGCACGAACAGATCGGCGCTGGCCTTGGAGCTGCTGTACGGGCTGGAGGTGTGAATCGGGGTCTCCTCCGTGAAGAATAAGTCCGGCCGGTCAAGCGGCAGGTCGCCGTACACCTCATCGGTGGACACCTGGTGAAAACGTTTCACGCCGTACTTCTTGGAAGCGTCCAAAAGCACTCTGGTTCCCTGCACGTTGGTCTGCACAAAGATTCCCGGGTCCGTGATGGAGCGGTCCACATGGCTCTCCGCCGCGAAATTCACCACGATATCAAATTTCTCTTTCTCAAACAAGTCCATGATAAACGGCTCATCCGCGATGTCGCCTTTCACAAACTTGTAGTTTGGCTTGTCCTCCACCGGTTTTAAGGTCTCTAAATTGCCTGCGTAGGTGAGTAAATCCAGGTTCACAATCTGGTAATCCGGGTACTTATTCACCATATGATGCACGAAATTGCCGCCGATAAAACCGGCGCCGCCTGTCACTAAAATCTTCATATTATCAATTCCTCCTTGGTTTTCCTCTTCTTTATTTAAATGATTTTAATACTCTTACTTATCAATATACTTTCCCGCCAGCACATCCATCAGGTACTGCCCGTACTGGTTTTTCTTTAACGGCTCAATATTGGTGAGCAGCTGGTCTTTTGTGATCCAGTGGTTTAAGTAGGCAATCTCCTCCAGGCAGGCAATCTTGCGGTGCTGGTGTGTCTCCATGGTGCGGACAAAGTTGGTGGCATCCACCAGCGACTCATGGGTTCCCGTGTCCAGCCAGGTGAATCCCTGTCCCAAAAGCTCCACATCCAGCTTCCCGTCCTCCAGATAAATCCGGTTCAAATCCGTGATCTCCAGCTCGCCTCTGGCCGACGGCTTCAGATTCTTCGCATACTCCACCACCTTATTATCATAGAAGTAAAGGCCGGTGACACAGTAGTTGGATTTGGGCTGCTGCGGTTTCTCCTCGATAGAAACGGCCTTGCCGTCGGAGTCAAACTCCACAATCCCGAACCGCTCCGGGTCATCCACATAGTAACCGAACACCGTGGCGCCCTCCTGCTTGTTGGCCGCGTGGAGCAGGCGCTTCTTCAACCCATGTCCCGCGAAAATGTTGTCGCCCAGAATCATGGCTACATGGTCATCCCCAATAAAATCCGCTCCGATGATAAATGCCTGCGCCAGTCCATCCGGTGACGGCTGCACCGCGTAGGACAATGCAATCCCAAACTGATGCCCGTCGCCAAGCAGCGATTCGAATCTCGGCGTATCCTCCGGCGTAGAGATAATCAGGATTTCCCGGATACCCGCATTCATCAGAACGGACAGCGGATAGTAAATCATCGGCTTGTCGTAGATGGGAAGCAGCTGTTTGGATGTCACCATCGTCAATGGATACAGTCTTGTACCGGAACCTCCGGCTAAAATAATACCCTTCATCATACACCTCCGTGTGGTTTGTTATTTGCATTATACAGGGTGACGTTACGTCACTGTCAAGGGTTAATTCGAAAGTTGAGATTAAGATTCTCTTAAAATGATGTCCAAATTTTTGAAACAAATTTTATTTTCTGAAAAAACATAACTATTTCAAAATTTGTCTTGACAAACAAATCCTCCCATGATATAGTTAGGAAGTATTTTTGATAAGATTTGATTTCACATGTTTTTGTAAGTCATGAATCATTATTGATAACTTACAAAAGCATGTGTTTTTTTATTTTTTGGAGAAAATATAATATTATAATGGAGGTACCTATCAATGAATAACGGTACAGTAAAATGGTTTAACAGCACTAAGGGATTTGGATTTATCACAAACGACGAGACAGGCGAGGAAGTGTTTGTACACTTTTCCGGTATTGCCACAGATGGCTATAAATCTTTAGAGGATGGTCAGAAAGTAACCTTCGATACCACTCAGGGTAATCGTGGTCTTCAGGCTGTCAATGTGTGCGCAGCATAAGAATTGACATTCTTACAGGCGTCCAGGCGGTTCGCAATGAGCGAATCTGCCTGGACGCCTGTTTTTTTGCTTTCTACAACAGGAATGAGTCCGGTGTTACTGGCTCTGGGCACATTTCCGGCAGATGAAAATGCCGGCCATTTTGGTGATGTGGATACAGCCATGCTCTCTCATCTTCTTCTGGAGGAAGGATTTGAAATCCAGCTGGCGCTGGCTTAGGATCTCGCTCTGGTTGCCGTGGCAGGATAAAATGTAATCCAGAAGCGGTTCCGCTTCGTCCACAGTCAGGTAATCGTCGTACAGCACTTTTTCCACGGAATCGAAATGCTCCTCCAACATTTCCTGGCCGTTTTCCAGGCCGAACAGCTCGTAGAGGGCCACCTCCGAGAGCGTGATTCTGGAATCGAATTCCTGTACCAGGGAGGTGATTTCCTTCATGTGCTCTCTGCCGTAGGCGCTGCAGTAGAAAACACCGTCCTCCCTCATGACCCGGTTAATCTCCTCCAGGGCCTTATCCAGATTCTTGACGTAGAAGAGCACATGGTTGGCGATGACCCGGTCGAAGCTGGCGTCACCGTAGGGCAGCTTCTGGCAGTCCGCCACGGCGTAGCGGAATACCTCCTGCCTGGCATCCACATTTTTCAAAAGCTCGGACGCGTCTTCCACCATCCCGGCGGAGATGTCGGTCAGGAAGATGTTTTTTCCACGAAGGCTGTCAAGTTCCGCCTCCTGCCACAGCTCCCCGTTGCCGCAGCCCACCTCCAGGATAGAGTCCGCCGTTTCCAGTTCCATCTGGGACAACAGCCACGGGAACCAGGACACCGGGTTGTGGGAAAACCGGCGGTGGAGCTCGATTCTGACATTTAAGTTGACCGCTGTCTTGTACTGGTCCACCAGCGCATGCTCCATGTTGGTGATATGAATCAGATTCAGAATGCTGTTCCAGTCCACATGCCTGGACTCCCCCAGCATCCTGGTGGTATCGTCCAAAGTCTGGGCCATCATCTGGAGATGGCGGATTCTCTTCTCAATCAGTTCCTTCTGCAGATCCAGCGAATGGGCGATATCCTCCACCTCGTCGTTAATCGTCATCGCCACAATCTCCTCCAGCGAAAACCCCAGGTATTTCAGAGAAAGAATCTTCTGCAGCCGCCCAAAGTCCTCATCCGTATACAACCGGTACCCCGCCTCACTCACCTGGCTCGGCTTCAAGATCCCCTGCTTATCATAATACCGAATCGTCCGTATCGTCACATTCGCCTTCCGCGCAAACTCCCCAGATGTGTAATACCCCTTGTGCCCCATCATGTCCACCCCCTTCATTCTCTAACCTTTTTATCTTCTTGCTACACTGTCTTTCCTTGTGGACTGCATTCTCTCACTCAGAGCCTGCTTTAAGACAGCAGAGCAGCTGATATGATTCTCCTCCACAAAAGTATTCAGCCATGCAGGGATGGTAAGTGTTTTCTTTACGGAGCGATTGCCATATTTTGCCGCATAGGAATCCATATCCAGAGCGACCATATTTACAAATTGTCCCGGTTGCACGGGAATTTCACTTAATTCTGATGCCTTCGGCGCCTCTCTCCCATCCTCCAATTCCGTAAGCACCCAGCCGCTGGCCGCGTCCTCCGACATAAAAATAGCTTCAGCCATCGTATCACCGCCGGTGACACACCCCGGTAAATCGGGAAATTCAACCACATAACCACCACTTCCATCTTCATATGGGGTAAATATTGCCGGGTATACTAATTTCATTGTTATTGCTCCTTTCTATCTGGGCTGCAACAGCACGTTAAAGCCCTGCCTGTTTCAGTATTGATTTCACCAACATAGGGTTTATATCTTTCCCTTTATGCTCCGGAACTGTTATCTTTCCGGGTTTTACAAAATGCTTGTATTGATGATGAGAACCGACTTGATTCACTTCAAACCAGCCATCCTTCAAAAGCAGTTTTTCTACTTCCCTAAACCGCAT
Protein sequences of DBSCAN-SWA_3 >NZ_CP040506|1548181:1554082|1553899_1554082_-|WP_006781432.1|DBSCAN-SWA MRFREVEKLLLKDGWFEVNQVGSHHQYKHFVKPGKITVPEHKGKDINPMLVKSILKQAGL >NZ_CP040506|1548181:1554082|1549616_1550639_-|WP_006781437.1|DBSCAN-SWA MKILVTGGAGFIGGNFVHHMVNKYPDYQIVNLDLLTYAGNLETLKPVEDKPNYKFVKGDIADEPFIMDLFEKEKFDIVVNFAAESHVDRSITDPGIFVQTNVQGTRVLLDASKKYGVKRFHQVSTDEVYGDLPLDRPDLFFTEETPIHTSSPYSSSKASADLFVLAYHRTFGLPVTISRCSNNYGPYHFPEKLIPLIISRALADEELPVYGKGENVRDWLHVADHCTAIDLIIHKGKEGEVYNVGGHNERTNLEVVKTILKALNKPESLIKFVTDRPGHDMRYAIDPTKLETELGWKPQYNFDTGIAQTIEWYLNNQEWWQNIISGEYANYFDKMYGERL >NZ_CP040506|1548181:1554082|1553436_1553862_-|WP_006781433.1|DBSCAN-SWA MKLVYPAIFTPYEDGSGGYVVEFPDLPGCVTGGDTMAEAIFMSEDAASGWVLTELEDGREAPKASELSEIPVQPGQFVNMVALDMDSYAAKYGNRSVKKTLTIPAWLNTFVEENHISCSAVLKQALSERMQSTRKDSVARR >NZ_CP040506|1548181:1554082|1552197_1553406_-|WP_034859739.1|DBSCAN-SWA MGHKGYYTSGEFARKANVTIRTIRYYDKQGILKPSQVSEAGYRLYTDEDFGRLQKILSLKYLGFSLEEIVAMTINDEVEDIAHSLDLQKELIEKRIRHLQMMAQTLDDTTRMLGESRHVDWNSILNLIHITNMEHALVDQYKTAVNLNVRIELHRRFSHNPVSWFPWLLSQMELETADSILEVGCGNGELWQEAELDSLRGKNIFLTDISAGMVEDASELLKNVDARQEVFRYAVADCQKLPYGDASFDRVIANHVLFYVKNLDKALEEINRVMREDGVFYCSAYGREHMKEITSLVQEFDSRITLSEVALYELFGLENGQEMLEEHFDSVEKVLYDDYLTVDEAEPLLDYILSCHGNQSEILSQRQLDFKSFLQKKMREHGCIHITKMAGIFICRKCAQSQ >NZ_CP040506|1548181:1554082|1548771_1549617_-|WP_006781438.1|DBSCAN-SWA MRVLVTGVKGQLGHDVVNELEKRGHVAVGVDIEEMDITDAASVERVITEADVEAVIHCAAYTAVDAAEDNVDVCRRVNADGTENVARICEKLDIKMMYISTDYVFNGQGTRAWEPDDEREPLNVYGLTKCEGELAVERHLSKYFIVRIAWVFGVNGKNFIKTMLNLGKTHDSITVVDDQVGSPTYTYDLARLLVDMIETERYGRYHATNEGLCSWYEFACEIFKQAGMKVAVSPVDSASYPSRAKRPMNSRMSKDKLEENGFERLPSWQDALSRYLQEIEY >NZ_CP040506|1548181:1554082|1550693_1551575_-|WP_006781436.1|DBSCAN-SWA MKGIILAGGSGTRLYPLTMVTSKQLLPIYDKPMIYYPLSVLMNAGIREILIISTPEDTPRFESLLGDGHQFGIALSYAVQPSPDGLAQAFIIGADFIGDDHVAMILGDNIFAGHGLKKRLLHAANKQEGATVFGYYVDDPERFGIVEFDSDGKAVSIEEKPQQPKSNYCVTGLYFYDNKVVEYAKNLKPSARGELEITDLNRIYLEDGKLDVELLGQGFTWLDTGTHESLVDATNFVRTMETHQHRKIACLEEIAYLNHWITKDQLLTNIEPLKKNQYGQYLMDVLAGKYIDK >NZ_CP040506|1548181:1554082|1548181_1548736_-|WP_006781439.1|DBSCAN-SWA MGKIKVTPCDIEGLYIIEPSVFGDERGYFMETYNQNDFKEAGLDMVFVQDNQSMSVKGVLRGLHFQKQYPQGKLVRVVRGRVFDVAVDLRSGSRTYGKWFGVELTAENKKQFYIPEGFAHGFLVLSDEAEFCYKCTDFYRPGDEGGMAWDDAEIGIEWPIEDGMKLIISEKDQKWQGFKDTFKF >NZ_CP040506|1548181:1554082|1551897_1552101_+|WP_006781435.1|DBSCAN-SWA MNNGTVKWFNSTKGFGFITNDETGEEVFVHFSGIATDGYKSLEDGQKVTFDTTQGNRGLQAVNVCAA |
8 | Enterobacteria_phage(42.86%) | NA | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_4 |
1982462 : 1993200
Sequences of DBSCAN-SWA_4
Nucleotide sequences of DBSCAN-SWA_4 >NZ_CP040506|1982462:1993200|DBSCAN-SWA ATCATGCCAGCACCTCCTTTCCCACTTTCCACTGAGCTGATGTCTGATAATCCTTCCACATGGCCGCATACACTCCTTTCCGTTCCAACAGTTCCCGGTGGGTGCCACTCTCTGCCACCACGCCGCTTTCCAACACCAGAATCCGGTCCGCCCCCTGGACGGTGGACAGCCGGTGGGCAATCATCAGAACCGTCTTTCCTCTGGTCAATGTTTCAAATGCCTTCTGTATCAGCATTTCATTTTCCGGGTCCGCAAATGCTGTCGCCTCGTCCAGTACCACAATCGGCGCATCCTTTAAAATCGCCCGGGCCAGCGCAATCCGCTGCTGTTCCCCGCCCGATAAGTAAATTCCCTTCGTCCCTACGACCGTATGGATTCCCTGGGGCATTTTCGCCAGAATATCATCGCACTGGGCGGCATGGGCCGCCTCAAGCGCCTCCTTCTCTGTGGCATCCGGCCTGGCCGCCCGGATATTTTCCAGAAGACTTGTTTTAAACAGATGAGTATCCTGGAACACAAAGGATATCAGGTTCATCAGCTCTCTGGAGGGAATGTTCCTCACATCCACGCCGCCGATAAGCACCTGCCCCGCATCTACATCCCAGAACCTTGGAATCAGACTGGCCGCCGTCGTCTTTCCCCCGCCGGAGGGGCCCACAAGCGCCGCTGTCCGGCCGGCCGTTACCTGGAAGCTGACATGGGAGAGCGCCTGGGAAACCGCCCCAGGATAGGTAAAGGATACATCACGAAACTCGATAGATGTCTCACTCGGAACACTCCCGTCCCCTCCGGCCGGTTCCTTCAAAAGCTCCGTTCCCATAATTGGAGCCAGACGGCGCACCGCCTCATCCGCCTCCATAACGGCCGTGCTGGCAAACAGAATCCGGTTCATCATCATGGTGCACAGCGGCGTAAACAGAATATAAAATAACAAATCCACCAGCACGGCCCAGCCATCCGATGCCCTGGCGCACAACAGCATCCCCATCGGAATCAGAAGCAGAAACGTACTGTTGAGCACCGTGGTAAACGCCGTCATCGGCCGGCCGCAGCCCATGGCATAGTCCGTGGCCAGCTCTTTGTATTTGATGATGGAAGTGTAAAAGCTCTTGAAGGAATATACCGTCTGCTGAAATACCTTCACCACCGGAATTCCCCTGACATATTCAGTCGCCTCCGCGCTCAGCTCCTCGATGGACTTTTGGTAGCGCTCGAAAAAGTCTGCATTCTTTCCGCCCATCATCCGCGTCAGCAGATAGACGGAGACTGCCAGCGGCAGCAGACAGACAAGCCCCATCCGCCAGTCAAAGACAAACAAAAGAACAATAGCCGCCACCGGCGTCACCACAGCCCCCACCAAATCCGGCAGCTCATGAGCCAGCATTGTCTCCGTCATCCCCGCATTGTCATCAATCTGCTTGCGCAGACGTCCTGACTGACTGGCTGTGAAATAGCCAAGAGGGACATCCACCAGATGGGAAGCTGCCGCCTTTCTCATATTTTTGGCCGTCCGGAATGCCGCCAGATGGGTACACATCAGCGCCGCAAAATACACCGCCGCGCTGGCCAGCGCAAACCACAGCGCCATCCACCCGTAACGGACAGCCCCTCCCGCATTTTCATAATCCGGCATCACCCGGATAACCTCGCGGATGACATACCAGATACAGATATAAGGCACCGTAGCCAGCACTGCCGAAACACCCGATAGAATACACCCCGTTATCGTCAGCCTCCTGTGGCCGCCGGCGTAGTCAAGAAGCACTGCAACACTGTTTCGTTTTTTCTCTTTCACGCTTGCACCTCCTTATGTTTTTGTAGTTAGTCATTGCTAACTATTGTTTTAAAAAGAAGCCCGTTTTCAGACGGACTTCTTTGTGCCGATTGCCAGTTCTTCCATAATTTTCCTGACAACATTAAGTATAACCATTTCATTTTCGATTGTAAAGCCAGGTAAATGAGAGACTTTCGCAATAATGCCTGATTCCTGATTTTTCAGGCACCAGTTTTCTCAATTATCTCCACCACTTCCTCCAGACTCTTCTTTCCAAAATACACATCCTCATCATTCACCACCATGCAGGGCACACTCATAATCCGGTACTTCTTTTTTAATTCCGGATAATGCATTAAATCCACCATCTCCGCCGTGATATGAGGGGAAATGGAGGCCGCCTTCTGGGCCGCCATCACCACCTCCGGGCACATGGTGCAGGAGAGGGATATCATGACTTTCACATTCCTGTCCCTGGTGATGCTCTTTAATCTGTTCACAATATCGGTATCCGTCTGCTGTCCCGGCCCCGCTACGTTGTAGAGGGCAATAATAAACGAATTGAATTCATGGCCGCCCGGCACACCGTGAAACAGTATCCCACTGCCGGTACCGTCCTCGTAACACAGCTCCATAGCCGGAAGGAGACGGTCCTGGGAGGAATCCCCTTCTTCCCAGATGATTTTGCCGCTTATCTCCGTCAATTCAGACAAAAAGCCTTTGATTTCACCGGACAATGAGGTGTCATCCAGCCAGGTACGGATGCGTACGCGGTTCTCAAATTTAGGGAACAGGCTCTCCAGCTGGGCACGGATTTCCGTGTTCAGGAAACGGTTGTCGTTATCCTGGGATGAACTTGCAGAGATGTCAGAAGACGTTTGGCCTGACGGTCTATCCGTCTCACTCTTGTCTGCCATCTCCTCCACCCATTCCAGTCTGCTTAAATCCGCCTTTTTTCTCACGAGCTCCGGTATTTCCAGTCTGTCATGCAACTCCGCCACATATTTCTCCATGGAAACCGCCGCCACGGCGCCGTCGGAAACCGCGGTCACTACCTGACGGAGATTCTTTACGCACAAATCTCCGGCGGCATAGACGCCATCCAGATTGGTTTTCTGGCTGGCGTCCGTAATAACATACCCCTGGCCGTCCAACTGGATCTCTTCGCCCAGCCACCCGGTATTTGGCACATAGCCGGCAAACACGAACACGCCAAAACCCTCTTCCGATTCATAAGTCCATTCCCCGCCCGTCTCATTATTCCTGAATCTGGCCGACGTCACCATGGATGCTCCTTTGACTTCCATGATTTCCGTCTGAAAATGAACGGCAATCTTGTCACTCCTCTTCAACTGGTCTGAGACCGTCTTTGCGCAAGTGAAATCTTCTTCCCGGACAATGAGAGTCACCTTTTTGGCATATTGGGTCAGAAACAGACCTTCTTCCACCGCCGCAAATCCTCCGCCGATGACAAAGACCTCCATCCCGGTAAAGAATTCCCCGTCACAGGTCGCACAGTAGGCAACGCCGCGGCCCTGGAACTCTTTTTCCCCTTTAAAGCCCAGTTTTCTCGGATTGGCGCCGGTGGCAATGATCACCCCCAAAGCGCTAAAATCGCCTTTTGTGGTGTGGAGGATTTTCATCTCTTCCCCCAGCTCCATACTCACAACTTCTCCGATAGCAAACTCCGCGCCAAAGGCTTCCGCCTGCAGGCGCATGGATTCGGTCAACTCTTTGCCGCTGGTCTTCTTGACCCCCGGATAGTTGACTATCTCCGAGGTTATGGTTATCTGACCACCGATTTTTTCTTTTTCCATTACCAGCACTCGGTATCTGGCTCTTGCCATGTAAATGGCGGCAGATAAACCGGCCGGCCCGCCGCCTGCGATGATAACGTCATATAAATTTTTTGTGATTTCTTCCTGCATGTGCACCTCCTGTTATTTGAAAGAAAACCCCGTCTCTTTTTTCTTGAGACGGGGAATTGGAAATTACCGCATTACAACAGTCCTACTAAATCAAGACTCGGTTTTAATGTCTCTGCTCCCGGCTGCCATTTGGCCGGGCAGACCTGGTCGCCATGCTCGGCCACAAACTGGGATGCCTGCACACGGCGGAACAATTCGTCAGCATTTCTTCCTACGTTTCCGGCAATCACCTCATAAGCTACAATTTTGCCTTCCGGATTTACAATAAAGCTGCCGCGCTCGGCCAGGCCATCGGATTCAATCATCACCTCGAAGTCCCGCGCAAGGGCGCCGGTTGGGTCGGCCAGCATGGGATATTGGATTTTCTGGATCGTCTTGGACACGTCGTGCCATGCTTTGTGGACAAAGTGTGTATCGCAGGACACACTGTAAATTTCACAGTTGATGGCCTGGAACTCACTGTATTTGTTGGCCAAATCTTCTAACTCCGTGGGACATACAAATGTAAAGTCGGCCGGATAGAAGAAAAATACGGACCATTTCCCTAAGATATCGTTCTTGGACACTGGTTTGAATTCTCCATTTGTGTAAGCCTGTACCGTAAAGTCGCTGATTTCTTTTCCAATTAATGACATTTGCGTATTCTCCTTTTCTTGTTGATTTCTGATTTTGTTGGTTCCTGATTTTGTTAGTACATGATTTTGTTCACTAGTATTTGCCCTAATCAGGCAGTTAATACCGGAATTATGGTTTTATTGCAGTTAGTAATTGCTAACTTTTGTTTATGATAACATTCGTCCTTAAATTTGTCAATAAATAGTATTGATAATAATTATCGATATTTGCATGCGGCCCCTAAAACCTGAAAATTCCCATATAAAAATAACCGGAGCCTGCCTTGTTTCTGGCACACTCCGGTTATCACTGCCTATTCACTATAAACTCAGTGGGAAAACTGGCTCTGGTACAGCTGGTAGTAAGCGCCCTTCCGCTCCATCAGCATCTCATGGGTCCCCGCCTCCATCACATTTCCATTGTCAATGACGAAAATGTGGTCCGCTCTGCGGATGGTGGAGAGCCGGTGGGCAATCACGAACGAAGTTCTCCCCTCCAGCATCGCCGCAATCCCTCTCTGCACCAGCAGCTCCGTATGGGTATCAATGCTGGACGTCGCCTCATCCAGTATCAGGATTCTTGGATTCGACACCATCGTCCTGGCAAATGCCAGCAGCTGCCTCTGTCCAATGGACAGCCGCGCCCCTCTCTCGCTTATCTGGGTATCATACCCATGCTCCAGATTCATGATAAACTCGTGGGCATCCACCGCCTTCGCCGCCGCTATCATTTCCTCATCCGTGGCGTCCAGCCGCCCATACTTGATATTCTCCCTTATGGTCCCGGAGAACAGGAAATTATCCTGGGTCATAATGCCCATCTGCCGCCGCAGACTCTTGATTGTCACATCCTGCACATCATACCCGTCAATCGTAACCCGCCCGGCCGTCACGTCATAGAACCGGCTGATCAGATTCACAATGGTCGTCTTCCCCGCTCCGGTGGGTCCCACCAGTGCGATGGTCTCTCCGGGCCGGATTCTGAAATTCACATCCTCCAGAATCATCCGCTCCGGCTCATCCTGATAGGCAAAGGACACATGCTCAAACGCCACCTCTCCCTGAATATCCGGAAGCTCCGACGCCCCATCCCGGTCCACAATATCCGCTTTTGTATCCATAATATCAAAAATCCGTTCCGCCCCGGAAATGTTGGTCACCAGCTTATTATAGAAGTTGGCCAGATTCCGGATCGGGCTCCAGAACATGGCAATGTAGGTGGAAAATGCCAGAAACGTGCCGATGCCGATTTCCTCCACGCCCACAATCTCGATTCCAATATAGTATAACAGAAACCCGCCAAGCCCCCAGGTAATCTCCACCACCGGCCCAAATCCGTCCGCCAGACGCACCGCGTCCAGAAACGCCTGCCGGTGTTCCTTCGTCAGCTGGGCAAACTCCTGTCTGGTCTCCGGCTCCGCGGCGAAGCTCTGGACAATCTTTATCCCCGACAAATCCTCGTGAACATAGGCATTTAAATTGGAAGTCTTCTTCCGGTATACCTGCCAGCGCCGGTGCGCCCGCACCTCGATAAAGAGCATGCCAAGCAGCAACAGCGGCAGAGTCAGAAGCGCCGCCATGGCCAGCTTGTAATTCTTCACCACCATGATAACCGCCACACAGAACACCGTCAGCATATCCGGTATCAGCTGGGTCACACTGTCCGACAGCACATCCTTCAGCGAGTTCACATCCCCGATGATGCGGGCCAGAATCTTCCCCGTCGGCCGGCTGTCAAAGAAATGAAAACTCAGCGTCTGGATATGCTCATACAGTTCCTCCCGAATCGTCACCAGCACCCGGTTGGAGACGTCGGCCATCATGTACATCCTGGCCCTGGTTCCAATCAGGAAGATGACAAACAGCACCATCGCCCCGGCGCCCAGTTTCAAAAGCCCCGGCACATCCCCGCCGGCCACGCACACGTTGATGGCATACTCCATCAACAGCGGCGCCGACAGACTGATGGCGATGGTAACCGCCATAATTCCCAAAACAATGACAATCTCTTTCTTATATGCAAACAAATATTTGTACAGACGAAGCAGCGTCTCTTTCTTCAACACTTCCTTCTGCTCTTCGTCCATTCTGCTGGAATTTACTGACATACTGCTCCCTCCTTTCCACCGTTCCAGGTCTGTCCATGCAGGCGCGGGCTTTCCCCGTACTGTACCTGGTAAGTCTTGTAATACTGCCCCTTCAGCGCCATCAGCTCCTCATGGGTTCCTCTCTCGATGATTCGCCCGCCATCCAGAATCAGAATTTCATCCGCGTGGCGCACCGCCGAGATTCTGTGGGCGATGATAATCTTGGAACAGTCCTTCATCTCGGACAAATGTCCCTCAATCACCTTCTCCGTCTCCATGTCGAGAGCCGACGTGGAGTCATCCAATATCAGGATGGGCGCTCCCTTGGCCATGGCCCTGGCAATGCTGATTCTCTGTTTCTGTCCGCCGGACAGTCCCACGCCCCGCTCGCCAATCACCGTGTCATACTGCTCCGTCAGCTTGTCGATAAACTCGCTGGCCCCGGAGCGCAGGGCCGCCTCCTTCACACTCTCCCACGGCACCACTTCCTTGCCGCCGGTCTTGATATTCTCCGAAATGCTGTCCGAGAACAGGAACACATCCTGCATCACCACCGCCGTACTGCTCCGCAGCCGTTTTAACGGCAGTCTGCGGATATCCGTCCCGTCCAGCAGAATCCGTCCCCTGGACACATCGTAGAACCGCTGTATCAGATTCACCACCGAGGTCTTCCCGGAGCCGGTCACGCCCATGATTCCCAGCGTTCCGCCCGGTTCCAGGGAGAAGGTGATATCCGACAGAATCCGATTCCCGTACAGGTCAAAATCCACATGGTCGAAGGACAGTCTGCCCTCAATCTTCTCCGGCGCCACCGGCATTTCCGGCTCCCGGATATCGGGCTTCTCCGCCATAATTTTCTTTATCTTCCGGTTGGAAGCCATGGCCGCCGCAAAGTCGTTGGACAGCCAGCCCACCATCTCCATAGGCCAGATGATATTGTTGGCATACTCGGAAAATGCTCCCAGCTGCCCAATGGTAATCTGCCCCCGGATGACCAGAATTCCGCCAAAGACAATCACCGCCAGCAAAAGCACCTTGGACAGAAACGAAATCCCCGGCTGGTATTTGGTGATAAACCGGGCCTGGTCCATGTTCAGCTTGTAGAACCGGTGGTTGTGCTTCTTAAACTTTTCAATCTCATACTCTTCCCTGGCAAACGCCTTCACCGTCCGCACGCCCGCCAGATTTTCCTGGGCCACCGTGTTTAACTCGGCCGTCTCCTCGCTGATTTTGTCGTAGACAGCCCCAAGCCCATTCTCCATCCTGATGGCGCTCCAGGCAATCAACGGCATCACCACCACTGGCAGAAGCGTCAGTATGGGGCTGATGCGGAACATGCACACCATCACAATCACCGTGTGGAATGTACATTCCAGAATCAGCATCCCCACATACCCGCAGGCGTTCCAGATTCTGTCCACGTCGTCCTTCACACGGGCCATCAGTTCCCCCGTGTTGTGGCTGTCAAAATATCCCATGGACAAGGTCTGGATATGGTCAAACAAATCCTTTCGCAGCTTGCTGCCAATCCCCACCGCCGCGTAATCAAAGATAAATTCCTTCACATACTGGAACACCGCCCGTCCCAGCCCAATTCCCACCAGGCCAAGCAGCAGCTGGAGCAGAAGCTGCCGCTTCCCGCCCACAATCACATCGTCGATGATATGCCTGGTAATCTGCGGCGCCAGCGCGTCCAGCAGGATACTGACAACCATGGCAATGATGGCGACCAGATAAAGCGGACAGTATTTTTTAATATAGTTCCAAAGATTCTTCATAGCCGTAATTCCCCCTGATATTTTTATGTAACTCACACGCAAAAACAGCGCATAAAAAGTCCCTTCTACGGTCTTTTCACGCGCTGTTTCCTTCACAAATGCATAAAAAATGACCGTGGCCCACAACAGCCACGGTCTCTATGGATTCTGATCTCCTGCGATTGATGGAGTTTATATTCCTCCATCAAAGACAGTTCCCATCATCTACGAGGCAGACTGAGCGGGTAGTATATTCTCTCTCATTCTGTTTATGACTGCTGCTTCCCAATCTGTTGTTGTTGATTCCATTGCTGTTAGAGTTAACCATCCGTTTCATTCTTCTGCCTCCTTTCCTCTTCTTATTTTGAATCTGACATTTTGTATCTGACAATTTTTGTATTTTCTGGCAATTCGAATTCCTTGTTTTAGGATACCGCTAGTTTTCCGGTCTGTCAAGCACTTTTCAGTTTTCTTCCAAACCTTTTCCTCCAAACACTTCTCCCTACACCCGCAGCATCCGGTAGCCGATTCCCACATGGGTCTGGATGTATTGGGGCGAGTTTTCGTCCTTTTCCAGCTTTTTGCGCAGGGTCGCCATAAACACCCGCAGGGACGCCACGTCATTATCCCAGGAGCTGCCCCAGATTTCCTTCGTGATATAGGAATGGGTCAACACCTTGCCCACATTTTTGGCAAGCAGGCACAACAGCTTGTATTCCATGGGCGTTAAGTGGAGCTCCTCGTCGTTTAAGTACACACAGCCCGCCGAGTAGTCAATCTTCAGCCCCCCATTTTCAAACACGGAGGTCTCCGCCCCCGCATTGCTGGCATAGTAGCTCAGCCGCCGGAACGTCACCCGCAGTCTGGCCAGAAGTTCCTCCACGGAAAATGGCTTCGTCAGGTAGTCGTCCGCCCCCGCGTCCAGCGCCTCGATTTTATCCGTCTCCTCACTCCGGGCGCTGATGACGATAATGGGCATACTGGACCAGGAACGGATTTTTTTGATAATCTCCACCCCGTCCATATCCGGCAGTCCCAAGTCTAACAGCACCACCTCCGGATTGTGGGAAACGGCCTCCATCAGCGCCATCTCCCCGTTTCCCGCCGTGCGGTGTTTATAATTGTGTGTCTCCAATGTGGTGGAAATGAGATTGCGGATAGCCGCATCATCCTCCACCACCAGTACCAGTGGCTTATTCACGAAGCATTACCTCCTCTATGGGCAGTGTGAAGCGGAAAATGCTTCCTGCCGGTTGATTATCCATTACCATAATCTCGCCGCCGTGGGCCGAGATAATGGATTTGCACAGCGCTAACCCCAGTCCCAGACTGCGGCGGCTGTCCACCACGCCTTTATTCAGCGTGTAAAACATATCGAAAATATTGGTCTTCTCCTCATCCGGAATCCCCGGGCCGTTGTCCGCGATTTCCACCACCGCCATGCCATCCTGCTTGTCCGTCGTAATCACAATCTCGGAGCCGGCCGGCGTATACTTGACGGCATTGTTGACAATGTTGATGATGACCTGGACGACCAACTGGGCGTCGGCCCTCACCAGCATAAATTCCTCCTTCTGACGGACGGTTATCTTGTGCTCCACACTCTTCCGGTTGACATGGCGCAGCGCCTCGGAAACCACGTCGTCCAGAAGCTCCGCCGACAGCCGCAGCTTCATGGTGCCGTCCTCAATCCTGGTGACGGAAAGCAGATTCTCCACCAGATTAATCAGCCACAGGGAATCGTCGTACATATCCTGATACAGCTTCCCCCGCTGTTCCCTGTCCATGTCGTCCGCGTTGGCCAGCAGGATGCCGGCATTTCCCGATATGGAGGTGAGCGGCGTCCGCAGATCGTGGGAGATGGAGCGCAGCAGGTTGGCCCGCAGCTGCTCGTTCTTCGCCACCAGCATGCTCTGCTCCCGCTCCCGGAGCGCAATCTCATTTTCCAGCGCCAGGGCGCACTCTCCGAGAATGGAGCGCACCACGCTGCTCTCGAACCCTTCCAGAGGGGCCTCCCCCATGGAAATGCCCACCACCGCGTAGCTTTTATCCTTCGCACAGACCGGCAGGTACAGGTACCCCGCCTCCTGGTATTTCTCCGTAAATGCCCCGGCCCGCGTCCGGTTCTCGAATACCCACTCGGCCGCCTTCCGCTCTGACGGTCCCCGAAATGCTTCCGAACTGGTCCTGTCCGGCGGTTCCGGAACTCCTTCTGTCCTTCCCCTGTCCGTCGTCTCCGTCCCTGGCATTTTCCTTCCATTCGCAGTCTCCGCTCCTGGTGCTTTCCTTCCATCCGTTTTCGAAGTCCCGGACTCGGCTGTCCCCACGTCCGCCGCCTGAAACAGCATCGGCTCCGCCAGCTTCCCGTCCTCCACCAGGTAAACCACCACGTCTCTCTTCAGCAGTTTCACCAACTGGTCGGCCACAACGCTTCCAATCAGTTCCACGCCCTTCCCCTTCTGAATCAGCTGGTTGGTCTCCAGCAGAATCTTCGTCCGGTAAGCCGCCTCCGAGGCATCCTTCGTCTGCTGCTTCATCCTGACCGCCAGGGTACTGGCGATGAAAGCCGACAGGAACAT
Protein sequences of DBSCAN-SWA_4 >NZ_CP040506|1982462:1993200|1982462_1984256_-|WP_138669611.1|DBSCAN-SWA MKEKKRNSVAVLLDYAGGHRRLTITGCILSGVSAVLATVPYICIWYVIREVIRVMPDYENAGGAVRYGWMALWFALASAAVYFAALMCTHLAAFRTAKNMRKAAASHLVDVPLGYFTASQSGRLRKQIDDNAGMTETMLAHELPDLVGAVVTPVAAIVLLFVFDWRMGLVCLLPLAVSVYLLTRMMGGKNADFFERYQKSIEELSAEATEYVRGIPVVKVFQQTVYSFKSFYTSIIKYKELATDYAMGCGRPMTAFTTVLNSTFLLLIPMGMLLCARASDGWAVLVDLLFYILFTPLCTMMMNRILFASTAVMEADEAVRRLAPIMGTELLKEPAGGDGSVPSETSIEFRDVSFTYPGAVSQALSHVSFQVTAGRTAALVGPSGGGKTTAASLIPRFWDVDAGQVLIGGVDVRNIPSRELMNLISFVFQDTHLFKTSLLENIRAARPDATEKEALEAAHAAQCDDILAKMPQGIHTVVGTKGIYLSGGEQQRIALARAILKDAPIVVLDEATAFADPENEMLIQKAFETLTRGKTVLMIAHRLSTVQGADRILVLESGVVAESGTHRELLERKGVYAAMWKDYQTSAQWKVGKEVLA >NZ_CP040506|1982462:1993200|1991123_1991822_-|WP_006781785.1|DBSCAN-SWA MNKPLVLVVEDDAAIRNLISTTLETHNYKHRTAGNGEMALMEAVSHNPEVVLLDLGLPDMDGVEIIKKIRSWSSMPIIVISARSEETDKIEALDAGADDYLTKPFSVEELLARLRVTFRRLSYYASNAGAETSVFENGGLKIDYSAGCVYLNDEELHLTPMEYKLLCLLAKNVGKVLTHSYITKEIWGSSWDNDVASLRVFMATLRKKLEKDENSPQYIQTHVGIGYRMLRV >NZ_CP040506|1982462:1993200|1988875_1990642_-|WP_006781783.1|DBSCAN-SWA MKNLWNYIKKYCPLYLVAIIAMVVSILLDALAPQITRHIIDDVIVGGKRQLLLQLLLGLVGIGLGRAVFQYVKEFIFDYAAVGIGSKLRKDLFDHIQTLSMGYFDSHNTGELMARVKDDVDRIWNACGYVGMLILECTFHTVIVMVCMFRISPILTLLPVVVMPLIAWSAIRMENGLGAVYDKISEETAELNTVAQENLAGVRTVKAFAREEYEIEKFKKHNHRFYKLNMDQARFITKYQPGISFLSKVLLLAVIVFGGILVIRGQITIGQLGAFSEYANNIIWPMEMVGWLSNDFAAAMASNRKIKKIMAEKPDIREPEMPVAPEKIEGRLSFDHVDFDLYGNRILSDITFSLEPGGTLGIMGVTGSGKTSVVNLIQRFYDVSRGRILLDGTDIRRLPLKRLRSSTAVVMQDVFLFSDSISENIKTGGKEVVPWESVKEAALRSGASEFIDKLTEQYDTVIGERGVGLSGGQKQRISIARAMAKGAPILILDDSTSALDMETEKVIEGHLSEMKDCSKIIIAHRISAVRHADEILILDGGRIIERGTHEELMALKGQYYKTYQVQYGESPRLHGQTWNGGKEGAVCQ >NZ_CP040506|1982462:1993200|1984456_1986163_-|WP_006781779.1|DBSCAN-SWA MQEEITKNLYDVIIAGGGPAGLSAAIYMARARYRVLVMEKEKIGGQITITSEIVNYPGVKKTSGKELTESMRLQAEAFGAEFAIGEVVSMELGEEMKILHTTKGDFSALGVIIATGANPRKLGFKGEKEFQGRGVAYCATCDGEFFTGMEVFVIGGGFAAVEEGLFLTQYAKKVTLIVREEDFTCAKTVSDQLKRSDKIAVHFQTEIMEVKGASMVTSARFRNNETGGEWTYESEEGFGVFVFAGYVPNTGWLGEEIQLDGQGYVITDASQKTNLDGVYAAGDLCVKNLRQVVTAVSDGAVAAVSMEKYVAELHDRLEIPELVRKKADLSRLEWVEEMADKSETDRPSGQTSSDISASSSQDNDNRFLNTEIRAQLESLFPKFENRVRIRTWLDDTSLSGEIKGFLSELTEISGKIIWEEGDSSQDRLLPAMELCYEDGTGSGILFHGVPGGHEFNSFIIALYNVAGPGQQTDTDIVNRLKSITRDRNVKVMISLSCTMCPEVVMAAQKAASISPHITAEMVDLMHYPELKKKYRIMSVPCMVVNDEDVYFGKKSLEEVVEIIEKTGA >NZ_CP040506|1982462:1993200|1986234_1986798_-|WP_138669613.1|DBSCAN-SWA MSLIGKEISDFTVQAYTNGEFKPVSKNDILGKWSVFFFYPADFTFVCPTELEDLANKYSEFQAINCEIYSVSCDTHFVHKAWHDVSKTIQKIQYPMLADPTGALARDFEVMIESDGLAERGSFIVNPEGKIVAYEVIAGNVGRNADELFRRVQASQFVAEHGDQVCPAKWQPGAETLKPSLDLVGLL >NZ_CP040506|1982462:1993200|1991814_1993200_-|WP_138670155.1|DBSCAN-SWA MFLSAFIASTLAVRMKQQTKDASEAAYRTKILLETNQLIQKGKGVELIGSVVADQLVKLLKRDVVVYLVEDGKLAEPMLFQAADVGTAESGTSKTDGRKAPGAETANGRKMPGTETTDRGRTEGVPEPPDRTSSEAFRGPSERKAAEWVFENRTRAGAFTEKYQEAGYLYLPVCAKDKSYAVVGISMGEAPLEGFESSVVRSILGECALALENEIALREREQSMLVAKNEQLRANLLRSISHDLRTPLTSISGNAGILLANADDMDREQRGKLYQDMYDDSLWLINLVENLLSVTRIEDGTMKLRLSAELLDDVVSEALRHVNRKSVEHKITVRQKEEFMLVRADAQLVVQVIINIVNNAVKYTPAGSEIVITTDKQDGMAVVEIADNGPGIPDEEKTNIFDMFYTLNKGVVDSRRSLGLGLALCKSIISAHGGEIMVMDNQPAGSIFRFTLPIEEVMLRE >NZ_CP040506|1982462:1993200|1987106_1988885_-|WP_006781782.1|DBSCAN-SWA MSVNSSRMDEEQKEVLKKETLLRLYKYLFAYKKEIVIVLGIMAVTIAISLSAPLLMEYAINVCVAGGDVPGLLKLGAGAMVLFVIFLIGTRARMYMMADVSNRVLVTIREELYEHIQTLSFHFFDSRPTGKILARIIGDVNSLKDVLSDSVTQLIPDMLTVFCVAVIMVVKNYKLAMAALLTLPLLLLGMLFIEVRAHRRWQVYRKKTSNLNAYVHEDLSGIKIVQSFAAEPETRQEFAQLTKEHRQAFLDAVRLADGFGPVVEITWGLGGFLLYYIGIEIVGVEEIGIGTFLAFSTYIAMFWSPIRNLANFYNKLVTNISGAERIFDIMDTKADIVDRDGASELPDIQGEVAFEHVSFAYQDEPERMILEDVNFRIRPGETIALVGPTGAGKTTIVNLISRFYDVTAGRVTIDGYDVQDVTIKSLRRQMGIMTQDNFLFSGTIRENIKYGRLDATDEEMIAAAKAVDAHEFIMNLEHGYDTQISERGARLSIGQRQLLAFARTMVSNPRILILDEATSSIDTHTELLVQRGIAAMLEGRTSFVIAHRLSTIRRADHIFVIDNGNVMEAGTHEMLMERKGAYYQLYQSQFSH |
7 | Bacillus_phage(66.67%) | NA | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_5 |
3861366 : 3868349
Sequences of DBSCAN-SWA_5
Nucleotide sequences of DBSCAN-SWA_5 >NZ_CP040506|3861366:3868349|DBSCAN-SWA ATTACATCAAATAACATCCCGAACGAAGCTTATCAAGCTCCAGTTTTTTATTCACCGAGCGCATCAGCGTCTCCCCGATGAGAACCGCGTTGACGCCCGCCTCATAGAGCACATGGATGTCTGCCGCCGTCCGGATACCGCTCTCCGCCACAAACAGAATTTCCGGCGGAACCAGGGAGCGCAGCTGGATGGAGTTGTTGAAATCCACCTCAAAGGTTTTCAGATTCCGGTTATTGATTCCGATAATCCGCGCGCCCGCCTCACGGGCCGATAAAACCTCCTCCTTGGTGTGTGCCTCGACCAGTGCACTGAGGCCAAGCCTGTCGCAGACATTTATATAAGAAGAAAGTGTTTCCGTATCTAGCAGAGAGCAGATTAACAGCACCGCGTCGGCGCCGATGACCTTCGCCTCATAAATCTGATACTCATCAACAGTAAAATCCTTACGGAGTAGGGGGATTTGTACCGTGTTTTTTATCTCCGTCAGGTATTGATTGCAGCCCATAAAGTAATCCGGCTCCGTCAGCACCGAGATGGCGGAAGCGCCTGCCATCTCGTAATCTCTGGCAATCTCCGTGTAGGGAAAGTTGGGGGCGATGACTCCTTTGGAGGGAGACGCCTTCTTCACCTCACAGATAAAAGCCATGTCAGGGCGATTGATGGCCTGCTCAAAGGGGAACTCTCTTTTCGGCAGAGCCAGAGCCTTTTCTTTCATTTCTTCCAGCGGACATCGTTTCTTTGCTTCCAGTACCCGCAGCCTTGCGGTATCTGCTATGGCGTCCAATATCATATCTTCACCTCGTATCATTTGCAGCGTTTAACAGGTTTTCGTGGCATTTACAAATTCATTCAGTTTCGCGTAGGCCATACCCGAGTCAATCAGCTCAGCCGCCCTTCGCACACAGTCTTTGATGGTGCAGTCATCAATGCCCAGATACAAACTCATGGCGGAATTCAAAATCACGATATCCCGCTTCGCCCCCTTCAGTTCTCCTGATAAAATGTCTCTGGTTATCTGCGCGTTCTCCTCCGGCGTCCCTCCGATGATATCGGACAGCTCATAGCGGGAAAGTCCCACCTCCTCAGGAGTGATATCATAAGCCTTTAATTTCCCAAACCGGATTTCACACACGTGTGTGGGCCCGGTCACCGTGGCCTCATCCAGTCCGTCCGCACCGCAGACCACCAGCCCGCGGGTAACGCCCAGATTGCTCAGCACTTTGGCCAGCGGCTCCACCAGCGATTTATCATACACGCCCAACAGCTGCATGGTAGCCCCCGCCGGGTTGGACAGCGGCCCCAGGATATTGAAGATGGTCCGTATGGCCAGTTCTTTTCTCACCGGCGCCGCATATTTCATGGAGGAGTGGTATGCCTGTGCAAACATGAAGCACATCCCCGTTTCCTCCAGCAGCTTCTCGGACTGCTCCGCCGTCAGCGCCAGATTCACGCCAAGCTGTTCCAGCACGTCGGCGGCCCCGCTCTTGCTGGACACGCTGCGGTTGCCGTGTTTGGCCACTGGGACACCGCCCGCGGCTACCACGAAAGCGCTTGTGGTGGAAATGTTGAAGGTACCCACCTCGTCGCCGCCGGTGCCCACAATATCAATCACATCGAAGGACGGTGTCAGTTTCAATGCCTTTTCCCGCATCACCGCCGCGCAGGCCGTGATTTCCTCGATGCTTTCTCCCTTCATCCGAAGCCCTGTCAGAAATGACGCCATCTGCGCCTGTGTGGCTTCCCCGTTCATAATTTCTCCCATTACCTGTTTTGCGGTATCAAAGTCAAGATTTTTCCGCTCCGTTACGTCGTGAATTGCCTGCTGTATCATAAGATTACCTCCAAAAAATTTTTCAATATTACCATACCGTCCGGCGTCATAATCGATTCCGGATGGAACTGAAGTCCATAGATAGGAAACTTTCGGTGCTGTACCGCCATGATTTCCCCCATATCGTCCTGAGCCGCAATCAGAAGCTCGTCGGGAAGGGTCTCCCTGGAAGCGATCAAGGAATGGTATCTCGCCGCCGGAATCTGATCCGGCAGTCCCGCAAACAGGGGACAGCCTGGGTTCAGGGAAATGATACTCTGCTTCCCGTGCATCAACTTTTTCGCGTGGACGATGGCGGCGCCAAACACTTCGCAGATGGCTTGATGCCCCAGGCAGACGCCCAGGATGGGGACGATACCCTTCATCTCCCGTATCACATCCTCGCATACTCCCGCGTTTTTCGGATAACCTGGTCCGGGAGAGAGAATCACGTGGCTGGGATGCAGCTCTTTGATTTCATCAACGGTCATCTCGTCATTCCGGATCACCCGGATATCCGGCGTCAGACTTCCGAAGAGCTGAACCAGATTGTAGGAAAAACTGTCATAATTATCAATCAGCAGTATCATCAGTCATTCACCTCCCTGGCTTTTAAGATGGCATTGATAACGGCTCCCGCCTTGTTGGCGGATTCTTCGTATTCCAATTCCGGAACACTGTCTGCCACAATGCCGCCTCCCGCCTGAACATACACCTTGTGGTTTCTCTTCACCGCCATGCGGATGGCGATGCAGGTATCCATGTTGCCGGTAAAGTCCAGATATCCCAGCGCCCCGCCGTAGATGCCCCGCGGCACCTGCTCCAGTTCGTCAATAATTTCACAAGCCCTGATTTTAGGCGCTCCCGACAGCGTTCCCGCCGGCAGCACGGCTTCGATGGCGGAGCAGCCGTCACAGTCCGGCCGGATATCACCCTCCACCTGGGAGCAGATGTGCATAATCCGGGAGTATTTGTGGATCATTTTGTATCCGGTGACTTCCACGGTGGAAAACTTTGAGATTCTTCCCAAATCGTTTCTTCCCAAATCCACCAGCATGTTGTGCTCTGCCAGCTCCTTCTCGTCGGCCAGCAGTTCGGCTTCCAGTCTGGTGTCCTCCTCCGGTGTGGCGCCTCTTGGTCTGGAACCGGCCACCGGGAAAGTAGTCAGCCGTCCCTGCTTCAGCCGTACCAGGGTCTCCGGCGAGGTGCTCATGATTTCGTCGTCGTCCAGCTTCATGTACACCATGTAGGGAGACGGATTGGTGGTCCGCAGCACCCGGTAGGCATTTAACAGGCTGCCCTCATAGGGACTGGAAAACTGTCGGGAGATAACCGCCTGGAAAATGTCTCCGTCCACGATATACTCCTTCGTCCTCTCCACCATCCGGCAGTAGTCTTCTTTTGAGACATTGCAAGTGAATTCCGGCTTGTCGGTGGTGACGGATTTGGCCAGCGGCGCATTACTCTGTATCAGCCGCGCGATGGATTCCAGCTCCCCGCAGGCCCTGCCGTACTGCTCCATCACGTTGTCAGTTTTCATGTTGACCACGATGGATATCTTTTGCTTCAAATGGTCGTAGGCGATGACCTTGTCAAACAGCATCAAATCATAATCGCTGCCGTCATCTTGTTTCAGTTTCAGCGTCGGCTCCGCGTACTGGATCATGGAGTAGGAAAAATATCCCACGAAGCCTCCGGTAAACGGCGGCAGTCCCTCGATGACCGGCGCCTTGTAATCTTTCAGGATATCCCGAAGGACATCCAGCGGCTTGCTGGTTTTGACGGAGCGCCTTGTATGCTCCTCCCCGGAGGAATCCTCCACGTCCACGGTGCCGTTCTTGCAGGTCAATCTCATAATCGGGTTAAATCCCAGGAAAGAATAGCGTCCCCAGGTCTCACCGCCCTCAATGCTTTCCAGCAGATAGAACCGGGTGCTCTTTTCCGCGATGCGGCGAAGAAGGGTGATTGGCGTCACCACATCCGCGTAAACTTCCCGGCAGAGTGGAATTCGTTTGTAATACTTTGCGAGTTTTGTAATTTCCTCACAGTCTGGTGTGATAATCATAGTGCGTTTCTCCTTTCCTGATGATATTTAAAATAATTGAAATAATAAAAAAAGACTTCTGCCCCTCGAAAGGGACAAAAGTCTTGAATTAACTTCTGCGGTACCACCCAAATTGGCGTAAACTACGCCCACTCAAAAATGCACTAACATGCATCTCCCCTTGGTAACGGATAGGAACCCCGTCGCCACCTACTCTCCATGAATACTGATAAATATGGATTTCAAAGCGCCCTCGCAAGTCCATTCGGATGAATTATTAAGACCTTCTTCCACCATATGAAGGCTCTCTGTGCAAAATAGAAACATCTTACTTACTCTTGCTCACCGGTTTGTTTGGTGATATTTAATTGTGCTTTAGTATATTACAGCGAAAAAGAAATGTCAACTAAATTTTGGAAAAAATTTTAAACCTTATGTAAAAGAAGTGTATAAATTCCTCAATTTTACAGATACTACTTCCAGAAACGGATAGTACAGTAAATGGAGGAATAATTTTATGAAAGCTACGGGAATCGTAAGAAGAATTGATGATTTGGGGAGGGTAGTAATACCAAAGGAGATTCGTAGAACACTTCGTCTGCGGGAGGGAACGCCACTGGAGATTTTCACGGACAGAGAGGGAGAAATCATTTTAAAGAAGTATTCTCCTATGGTGGAACTGACTGCATTTTCCGGTCAATATGCGGAAGCCATGGCCCAGTCGACCGGGATGATTGTATGTATTACGGACAGAGACCAGGTGATTGCCGTGTCCGGCGGCTCCAAGAAGGACATGATACAGAAGACTATCAGCAAGCAGCTGGAGCATATCATCGGAGAGCGCAACGTGGTGATGGCATCCAAGGATGACAAGGCGTTTATCTCCCTGACCTCCGATGAGATGGAGGGCATCACAGCCCAGGTGATTGCGCCAATTATCTGCGAGGGCGATGCCATCGGCTCAGTGGCCTTACTTAGCAGGGATGCCAAAGCAAAATTCGGAGATATGGAAATGAAGCTGGTCTTGACAGCCGCCGCCTTCCTGGGACGCCAGATGGAGGCATAAGACAAATAGACAGAGAAATGAAAAAAGAGGGGATTGCCTGTCACGCAATGCCCTCTTTTTCCAATGCCAGCATAATGCTGACGGTAAAGGTATGTTCCTCATGGCTGATTTTCATGTCGCCCTCGTATTTTTCCACCACGGTGGAGACGTTGGAGAGGCCGATGCCGCAGAAGGTTTCTCCGGTTTTGGTGCTCTCGTAGCGGGCGCCGCTCATGTGGTATTCTCCGTTGAAGGAGTTGGTGATGCTGATGACAAGAAATGCCTTGCGGAACAGGATTTCCACATTGATAAATTTGCGGATGGAGTCGTCGGTGATTCGCTCGCAGGCTTCCACCGCGTTGTCCAGAAGGTTCCCCAGGATGATGCACATGTCCACATTTTCGATGGGAAGGGTAGGCGGAATCATGATGTTGGTGGTGAAGGCGATGTGGGCTTCCTCCGCCTGGAGCTTGGTGCGGCTTAACATGATGTTGACCATCCGGTTCCCGCTGCTGACGGGGACCGTTAAATGGTTGGTGGAATCGTAGATTCCGTTGATATATTCCAGGGCCTGGCCGTACTGCTCCATCTGAAGCAGGGAGACAAGGCTGTGTAAGTGGCGTTTTAAATCGTGGCGCAGGGCCGTGATTTCCGTCTGGGCTTCCTCCACCTGTTTGTAGTAGTCGGTCTGCATGGACAGCAGCTGGTCGGCCATAATCCGCTTCTGGCGCACCTCGTTGATGATAAAGGTCTTGTCCACCAGGTAGAACAGAAAGAAGGTGGTGCACATGAGGATTCCGGCAATATCGGAATACAGAAGAAAGGTGTTGTCCCGCACGGAAAACAGGTGGATGGATATGAGCAGGATGCCAAAGGGATAGATGTAGCTGATAATGGCATACAGCGTCAGCTTGTTAGAGTCGTGCATCTTGCGGAACTGGACAAACAGCAGGGTGAACAGGAAAAATAAAATGCAGGCCAGCATCTGGGACGCGGCGCCCTGAACGAAATTGCTGGGCAGCGCCGGTATCTTCTGATGCACCAGTCCCATAATCAGGACGGAAGCTATCAGCTTGCAGGAATAGTTCAAAAACACGAACAGGCAGGCCACTAAAACCTTAATCTGATTGGAGTCCTGGTAGAACAGACAGGCAATCAGAAGCGAGAACAGACAGTAGAAAAACCAGGCGCTGAACAGCGGAAAATCCTGCACATAGGTCACCATCTGAAACAGAAAGTAGACCAGAAAGGCCCCCCGGATAAGATACCGCGGCAGCGCCTTGGGGTGCAGGAAGGTCTGGCAGAGCCAGAAGAATATCCAGGTCTCAGCCATAGACAAAAGCAGGCTGTTAATCAGGTGGTAGACGGACGGCATCATAGGTGGGCCTCCATATAGCTGAGAAATTCATGGCTGACATCCTTGCGCTTGTGCTTGCTCATAGGGATGACGTCCTGGTTGTCCATAGTAACCGATTCCTTGAAAATGTTTCGGACCCGGCTGATATTTACCAGGAAGCTCTTATGGATCCGGATAAAGCCGTGGGGCCTTAATTCCTCCTCCAGTGCGCTGATGGTTCCTGTAAAGGAGAAACGCCCCTCCTCAGCAACGACCTGAATGGTGCGGATATTGGATTCCAGGTAGTAAATATGCTCCGTCATCAGCTTGGTAGTGCCGGTTTTGGACGGAAAGGTGAAAAAGGAATTCCGCTGCTGAATCAGGTGGTCCAGCGTGTTGCGGACGGCATTCAGAAAATCCTCCTTCACCACCGGTTTTACCAGATAGCGGCAGGCGTTCACCAGATACCCATCCACCGCGTAGTCCACGGTGGCCGTCACAATCAGGATAGGAAGACTCTCATCCACTTTCCGAATCTCCTTGGCCACCTCAATGCCGTTCATCTCCTCCATCCGCATATCCAGTATCAGCAGGTCGTAAAATTGCTTCTTCTGATAGTCCTTAATCAAATCAATTCCGTTGCTATAAGTATAAATCTGAATGGGATAATGCTCCTGCAATTCAGACAAATATTCCGTGAGCTGTAATGTATCCTCTGCGCTGTCGTCGCAGCATACGATTCTGAACAT
Protein sequences of DBSCAN-SWA_5 >NZ_CP040506|3861366:3868349|3867638_3868349_-|WP_006778316.1|DBSCAN-SWA MFRIVCCDDSAEDTLQLTEYLSELQEHYPIQIYTYSNGIDLIKDYQKKQFYDLLILDMRMEEMNGIEVAKEIRKVDESLPILIVTATVDYAVDGYLVNACRYLVKPVVKEDFLNAVRNTLDHLIQQRNSFFTFPSKTGTTKLMTEHIYYLESNIRTIQVVAEEGRFSFTGTISALEEELRPHGFIRIHKSFLVNISRVRNIFKESVTMDNQDVIPMSKHKRKDVSHEFLSYMEAHL >NZ_CP040506|3861366:3868349|3863192_3863765_-|WP_006778312.1|DBSCAN-SWA MILLIDNYDSFSYNLVQLFGSLTPDIRVIRNDEMTVDEIKELHPSHVILSPGPGYPKNAGVCEDVIREMKGIVPILGVCLGHQAICEVFGAAIVHAKKLMHGKQSIISLNPGCPLFAGLPDQIPAARYHSLIASRETLPDELLIAAQDDMGEIMAVQHRKFPIYGLQFHPESIMTPDGMVILKNFLEVIL >NZ_CP040506|3861366:3868349|3865736_3866285_+|WP_006778314.1|DBSCAN-SWA MKATGIVRRIDDLGRVVIPKEIRRTLRLREGTPLEIFTDREGEIILKKYSPMVELTAFSGQYAEAMAQSTGMIVCITDRDQVIAVSGGSKKDMIQKTISKQLEHIIGERNVVMASKDDKAFISLTSDEMEGITAQVIAPIICEGDAIGSVALLSRDAKAKFGDMEMKLVLTAAAFLGRQMEA >NZ_CP040506|3861366:3868349|3866325_3867642_-|WP_006778315.1|DBSCAN-SWA MMPSVYHLINSLLLSMAETWIFFWLCQTFLHPKALPRYLIRGAFLVYFLFQMVTYVQDFPLFSAWFFYCLFSLLIACLFYQDSNQIKVLVACLFVFLNYSCKLIASVLIMGLVHQKIPALPSNFVQGAASQMLACILFFLFTLLFVQFRKMHDSNKLTLYAIISYIYPFGILLISIHLFSVRDNTFLLYSDIAGILMCTTFFLFYLVDKTFIINEVRQKRIMADQLLSMQTDYYKQVEEAQTEITALRHDLKRHLHSLVSLLQMEQYGQALEYINGIYDSTNHLTVPVSSGNRMVNIMLSRTKLQAEEAHIAFTTNIMIPPTLPIENVDMCIILGNLLDNAVEACERITDDSIRKFINVEILFRKAFLVISITNSFNGEYHMSGARYESTKTGETFCGIGLSNVSTVVEKYEGDMKISHEEHTFTVSIMLALEKEGIA >NZ_CP040506|3861366:3868349|3862182_3863196_-|WP_006778311.1|DBSCAN-SWA MIQQAIHDVTERKNLDFDTAKQVMGEIMNGEATQAQMASFLTGLRMKGESIEEITACAAVMREKALKLTPSFDVIDIVGTGGDEVGTFNISTTSAFVVAAGGVPVAKHGNRSVSSKSGAADVLEQLGVNLALTAEQSEKLLEETGMCFMFAQAYHSSMKYAAPVRKELAIRTIFNILGPLSNPAGATMQLLGVYDKSLVEPLAKVLSNLGVTRGLVVCGADGLDEATVTGPTHVCEIRFGKLKAYDITPEEVGLSRYELSDIIGGTPEENAQITRDILSGELKGAKRDIVILNSAMSLYLGIDDCTIKDCVRRAAELIDSGMAYAKLNEFVNATKTC >NZ_CP040506|3861366:3868349|3863764_3865237_-|WP_080568843.1|DBSCAN-SWA MITPDCEEITKLAKYYKRIPLCREVYADVVTPITLLRRIAEKSTRFYLLESIEGGETWGRYSFLGFNPIMRLTCKNGTVDVEDSSGEEHTRRSVKTSKPLDVLRDILKDYKAPVIEGLPPFTGGFVGYFSYSMIQYAEPTLKLKQDDGSDYDLMLFDKVIAYDHLKQKISIVVNMKTDNVMEQYGRACGELESIARLIQSNAPLAKSVTTDKPEFTCNVSKEDYCRMVERTKEYIVDGDIFQAVISRQFSSPYEGSLLNAYRVLRTTNPSPYMVYMKLDDDEIMSTSPETLVRLKQGRLTTFPVAGSRPRGATPEEDTRLEAELLADEKELAEHNMLVDLGRNDLGRISKFSTVEVTGYKMIHKYSRIMHICSQVEGDIRPDCDGCSAIEAVLPAGTLSGAPKIRACEIIDELEQVPRGIYGGALGYLDFTGNMDTCIAIRMAVKRNHKVYVQAGGGIVADSVPELEYEESANKAGAVINAILKAREVND >NZ_CP040506|3861366:3868349|3861366_3862155_-|WP_006778310.1|DBSCAN-SWA MILDAIADTARLRVLEAKKRCPLEEMKEKALALPKREFPFEQAINRPDMAFICEVKKASPSKGVIAPNFPYTEIARDYEMAGASAISVLTEPDYFMGCNQYLTEIKNTVQIPLLRKDFTVDEYQIYEAKVIGADAVLLICSLLDTETLSSYINVCDRLGLSALVEAHTKEEVLSAREAGARIIGINNRNLKTFEVDFNNSIQLRSLVPPEILFVAESGIRTAADIHVLYEAGVNAVLIGETLMRSVNKKLELDKLRSGCYLM |
7 | Acinetobacter_phage(50.0%) | NA | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_6 |
4329922 : 4334207
Sequences of DBSCAN-SWA_6
Nucleotide sequences of DBSCAN-SWA_6 >NZ_CP040506|4329922:4334207|DBSCAN-SWA CTTACAAATAATTATATACATTTTTCATATAAGCCTCATGAAATCGTTCAAGTTGCATTTCTCTTGTATTCTGGTGAGCAGTAATAAGTGCGTTACGACATAATTTTGACCGTAGTTCTTCATTATCATATAAACTTTTTATTGCGTCAGCTAATTCAGCTACATTTCCAGGAGTAAATAAGAGTCCATTTAATCCTGGATTAATCACATCTGGTATTCCGCCAACTCTAGAACCGATTACAGTTATGCCATTACTCATAGCCTCTAAAAGAACCATTCCCAGCCCTTCATTGTACGAGGGCAAAATCAAAATATCGGATGTACGCATTTCGCTAAACAATTCTTGTCCCCATTTCTTTTGACCAAGCAATATAATCTTATCCTCAAGATTATATTCATTTATTTTTCGCCTAATATTTTCTTGTTCGACTCCTTGTCCGATAATTTTTAGCTTGATTCTACTTGTAGAGAGCATATTGACAGCATCTATTAAATCATTTATTCCTTTTTCGGGAGATAACCTTCCGACAAAAATCAATTGCAATTCGGGCTTGGTTCGCGGCTTTATTTCTGTCACAATCTGTCTGTCGTATATAAGTCCGTCATGAAAGATAACTGAAGGAACATAAGGGTTCCGGTATTTTTTCTCTAATTCGCTCGAAACCCAAGTTTGGAGTTTTGATTTTGCAAGGTAGTTAACTTGCAGCCTTTTAACAAAACAAACAATGACTTTCCAAATAGCTTTGGACTTTTTTATATCGGGAATGCCATCTGGATCTCCAATTACCCTGCTGATAAAAATCAAATCACGTTTTCTAAATAAGAGCCACATAATCCATTGTGAAACAGAAATAGGAGAACATATCACCAAAGCAGGTTTGCAAAATAGTTTATTCAATTTTTTTATTCTTTCATACATTGTAATTATGTAGCCTTTTAATCCGGATGGTTGATTAAAAATTCCACAAAAAGTCACTTGCTTGCTAAGCGTTGAAGTATCATACATGGAGTATTTCGGAATATCTGTTTCTGAAATATCATATATACGGCTCCATATAATGACTTCATCAATTCCATCAAAGCATTCATAAAAATCTAATTGGGAACTTCCACTTACATAGTAGTTGTTGCCATATCGGTATACGATTCCATCGCCTATAACAATTAGACGATTATGCTTTTTGTTCATCTTAGTTTTCCTCAAAATGCAATAGCTTTATTGCAGACTGGTAATTTTAAAAATAGTTCTAAATCTTCTGGTGTTCCTAATCCATACATTCCATCCATAACAGAACCAATATTATAAGTTGTTATATGATAACCTTTTTCATTCAAAAGAATATTATAAGCAGGCGCCACATAAAATTCATTATTTACTCTAAAATTTTCTTCAATCATTTTGTCGGCAGCTTTTACAAAGTCATGTCCTTTTTTAAAATTATAGATTCCAACCGTGGCCTCGTCAGAAATGACTTCTTTTTCAACGACTTTATTTATATGACCTATTTCATCATACCCCACATATGACCATTTAGAATCCCCTGCTTTCATAGTCATAATTAAGCCGTCTGCGTTATCTGTGAGCATTTTGTTTAAATAATCATCAATATTAATATCAATCCATTGATCACAATTTGCTATCATCAAAGAATCATCGTTATCAATGTATTTTCTTGATAATAATACGGTACATGCCGGGCCTTCTGTCACTTCATTTACCGGAATAACAATGCTTCCTGGAGCCATTTCCAACAGATGACTTTTCAAATCATAGTTGTCTAAGTGCTCCTGCAAAACTAAAAAGATGAATCTATGTTCTTGTGTTGGCTTAATGTTATGTATAACAATATCTATCATAGATTTTCCGTTTATCTCTATAAGTGGTTTTGGATTTTTATAGCCAACTGATGAAAAACGTGAGCCTCTTCCTGCCATTGGTAATACAATATTTATCATATCTTAAACCTCCATATATTTATCGTCGTTTGCTCCTGGTACTTTTACCACAACATTTATAGTATCCGTAATTGCGAGAAAATCAGTTTTATCAAATGGTTCCATTACAATCATATCATTTTCTTTATAGCGAATGCCATTCATCTCAACTTCACCGGAAATAATCACTGTGATTTCCGTTGCAATTTTATGGTAGTGAATTTCTTCATAATCACCGGCATTATATTTCTTAACGGCAACCTCCACATCATTTGTATTAAACAGTGATGGTTCAAAATTACCCACAAACCATCCTCGAATCATATCATCTAAATGTGCTGTCTTCATATTACTGTATTCCTTTCTGGAGCAATTGCTTTTTGATAAGATTCTATCCCTTTAGGAGTTGCCAAAGAAAAGTAGCATTTACTCTCTATCTCAAAAATTCGCAATTTTTTTTGCTTTAATATCATTTCATTAAACACTGGACAAATGTAAAACTTTCCATCTGTTTGCGCATTTTTTTTAATTAATTCCTTTGATGCACTCACAAAATCATCGCCTCTTTTATAATAATAAATACCTGCGGTAGCCATACGGCTAATTGGCCGTTTTTCGGCAGCTTCAATGATGATTCCTGTCTCATCACATTTGACATAAGACCATCGAGGATGTACTGATTGAAAAGTTATGGCACCTCCATCTGCATCACTATTTTTAAAAGAAGCGATAATTGGTTCCAACGGCCTTTGAATAATAATATCTCCATTTACAATAAGAAGCGGCTCATCATTATCAATATATTCTACCGCCAGCAAAGCAGTACATACAGCTCCCGCAGTGGCATTAGGTACAGAAATAATCGTTGCTTTTGGATAAAGTAGTTTCGCAACCATAGAAGTATGATATCGGTCAATTTCTTCTTTTTTTATAGCAAATATCAGTTTATTATTCTTTAAAATATTGGAATCCAAACTTTCAAGGACATTTTGAAGCAATGGTTTTCCATTAATTTCAATTAGATTTTTGGGATATTCATAACCAGATTCCCGAAAAGATGTGTCAGAACCCGAAAATAGTAATAATATGTTCATCATAAATCCCCCCTTTCAATAGTTTTAATCATAGTCATTATATTATCAAAAGTTACATCTTGGACAGTCCCTACTGTCAAAACATAAGCACCAGCGGCTTTTGCGGCATTAATACCATTTATATTATCTTCTACTACGAGACATTGCTCTGGCTTAAGTTTTAATGTATTTATGGTTTTTGTGTAAATTTCAGGATTTGGCTTTCCTTCCGTTACATCTTCATTGCTCATAATAATGTCCAGATATTGTGACAGAGCGCTTTTTTTCATCATAACTTCAATCGAATTTTTTATGGAGTTTGAAGCAACTCCTATTTTATAGCCGTTTGCTTTTAGCTGTGATAATGCATATTCATGTTGAAATACAGGATTGCATTTGGCATAGACAATCTCCATTGTGTATAATTGCTTCATTTCGTTTATAAATGAATGCAACTGTACTGGTAAACCCTGCTCCATAGAAAGCATTTCTAATTTTTTTCTTGTGGGTAACCCATCATATGTTACCAAGTGCTCATATCGAGAAATTTGATATCCAAATAATGCTAACGCTTTATTCAGTGCTTCGTAATGCCATTCCTTTGCATCAATCAAGACGCCATCCATATCAAAAATAACAGCTTCAATTTTTTTCATGATAAAAATACTCCTTTGCTTCTTCAGGATAATCTGTACAAAATATTAGTCTATTTGAATCAATACACGAAAACTGCTTATATTCTGCCCATTTTTTTGTGTGATGTCTTTGATGAAGTTCAGGGGACACGATGCAGACCGCTTTTCCGGCATTTAAATGTCGAGATATTTTTTCTTCCGTTATCCAATCATCAGATTCAAAACCATCTAACCATACACCAGCAGCTTTTTGATAAAAAGCTGGAAATGGTTCATACTCACTTTCTCTCGTAAAGAATTTGAGTTCTTTTTCTATGTATCCCAATGTATCTGGTATCGCCATATCAAAACAAAAATAATTATTGATTTGATATTTGTCCAATAATTGTTTCAGTATGGTTTGCAATCCATCTGCTTTAATATTAATAGCCAGAGTATAGGGGGCTGCTTGGTATAACTCAAGAAATTTTTCGACAGCTACAGCCTTTTCTGTTGGCATATCATGGGAAATGACTAATTTTCTATTAAAATCTCTAAAATCCGTTTCTGTTCCCATATTCGTTAACGCAGCTCTTTCCAACGCCTCCCATTGATTGCGTTCTTCGAAAGTTTTCCACAAACCACGATGTGCAATAATCTCCAT
Protein sequences of DBSCAN-SWA_6 >NZ_CP040506|4329922:4334207|4329922_4331110_-|WP_006778723.1|DBSCAN-SWA MNKKHNRLIVIGDGIVYRYGNNYYVSGSSQLDFYECFDGIDEVIIWSRIYDISETDIPKYSMYDTSTLSKQVTFCGIFNQPSGLKGYIITMYERIKKLNKLFCKPALVICSPISVSQWIMWLLFRKRDLIFISRVIGDPDGIPDIKKSKAIWKVIVCFVKRLQVNYLAKSKLQTWVSSELEKKYRNPYVPSVIFHDGLIYDRQIVTEIKPRTKPELQLIFVGRLSPEKGINDLIDAVNMLSTSRIKLKIIGQGVEQENIRRKINEYNLEDKIILLGQKKWGQELFSEMRTSDILILPSYNEGLGMVLLEAMSNGITVIGSRVGGIPDVINPGLNGLLFTPGNVAELADAIKSLYDNEELRSKLCRNALITAHQNTREMQLERFHEAYMKNVYNYL >NZ_CP040506|4329922:4334207|4332949_4333585_-|WP_006778727.1|DBSCAN-SWA MKKIEAVIFDMDGVLIDAKEWHYEALNKALALFGYQISRYEHLVTYDGLPTRKKLEMLSMEQGLPVQLHSFINEMKQLYTMEIVYAKCNPVFQHEYALSQLKANGYKIGVASNSIKNSIEVMMKKSALSQYLDIIMSNEDVTEGKPNPEIYTKTINTLKLKPEQCLVVEDNINGINAAKAAGAYVLTVGTVQDVTFDNIMTMIKTIERGDL >NZ_CP040506|4329922:4334207|4331880_4332204_-|WP_006778725.1|DBSCAN-SWA MKTAHLDDMIRGWFVGNFEPSLFNTNDVEVAVKKYNAGDYEEIHYHKIATEITVIISGEVEMNGIRYKENDMIVMEPFDKTDFLAITDTINVVVKVPGANDDKYMEV >NZ_CP040506|4329922:4334207|4332200_4332950_-|WP_138670281.1|DBSCAN-SWA MNILLLFSGSDTSFRESGYEYPKNLIEINGKPLLQNVLESLDSNILKNNKLIFAIKKEEIDRYHTSMVAKLLYPKATIISVPNATAGAVCTALLAVEYIDNDEPLLIVNGDIIIQRPLEPIIASFKNSDADGGAITFQSVHPRWSYVKCDETGIIIEAAEKRPISRMATAGIYYYKRGDDFVSASKELIKKNAQTDGKFYICPVFNEMILKQKKLRIFEIESKCYFSLATPKGIESYQKAIAPERNTVI >NZ_CP040506|4329922:4334207|4331121_4331877_-|WP_006778724.1|DBSCAN-SWA MINIVLPMAGRGSRFSSVGYKNPKPLIEINGKSMIDIVIHNIKPTQEHRFIFLVLQEHLDNYDLKSHLLEMAPGSIVIPVNEVTEGPACTVLLSRKYIDNDDSLMIANCDQWIDINIDDYLNKMLTDNADGLIMTMKAGDSKWSYVGYDEIGHINKVVEKEVISDEATVGIYNFKKGHDFVKAADKMIEENFRVNNEFYVAPAYNILLNEKGYHITTYNIGSVMDGMYGLGTPEDLELFLKLPVCNKAIAF >NZ_CP040506|4329922:4334207|4333571_4334207_-|WP_006778728.1|DBSCAN-SWA MEIIAHRGLWKTFEERNQWEALERAALTNMGTETDFRDFNRKLVISHDMPTEKAVAVEKFLELYQAAPYTLAINIKADGLQTILKQLLDKYQINNYFCFDMAIPDTLGYIEKELKFFTRESEYEPFPAFYQKAAGVWLDGFESDDWITEEKISRHLNAGKAVCIVSPELHQRHHTKKWAEYKQFSCIDSNRLIFCTDYPEEAKEYFYHEKN |
6 | Synechococcus_phage(50.0%) | NA | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
Acr ID | Acr position | Acr size | Homology with known anti | Neighbor HTH/AcRanker | Neighbor Aca | In prophage | Protospacer in prophage |
---|