Contig_ID | Contig_def | CRISPR array number | Contig Signature genes | Self targeting spacer number | Target MGE spacer number | Prophage number | Anti-CRISPR protein number |
---|---|---|---|---|---|---|---|
NZ_CP029123 | Escherichia coli strain AR434 plasmid unnamed1, complete sequence | 0 crisprs | DEDDh | 0 | 0 | 1 | 0 |
NZ_CP029122 | Escherichia coli strain AR434 chromosome, complete genome | 11 crisprs | RT,csa3,PD-DExK,cas5,cas6e,cas1,cas2,cas3,DEDDh,c2c9_V-U4,DinG | 0 | 24 | 9 | 0 |
CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
NZ_CP029122_1 | 892263-892402 | Orphan |
NA
Consensus repeat of NZ_CP029122_1
|
1 spacers
spacers of NZ_CP029122_1
>1.1|892312|42|NZ_CP029122|CRISPRCasFinder ACAGCAGTCGGATGCGGCGTAAACACCTTATCTGACCTACGT |
CRISPR arrays and Neighbor proteins around NZ_CP029122_1
The CRISPR arrays of NZ_CP029122_1 >merge|NZ_CP029122|1|892263-892402|CRISPRCasFinder TTTGTATCGTTGTAGGCCTGATAAGACGCGGCAAGCGTCGCATCAGGCAACAGCAGTCGGATGCGGCGTAAACACCTTATCTGACCTACGTTTTGTGTCGTTGTAGGCCTGATAAGACGCGGCAAGCGTCGCATCAGGCA >NZ_CP029122|1|1|892263-892402|CRISPRCasFinder TTTGTATCGTTGTAGGCCTGATAAGACGCGGCAAGCGTCGCATCAGGCA ACAGCAGTCGGATGCGGCGTAAACACCTTATCTGACCTACGT TTTGTGTCGTTGTAGGCCTGATAAGACGCGGCAAGCGTCGCATCAGGCA
>NZ_CP029122.1|WP_001375265.1|891170_892211_+|permease MTGQSSSQAATPIQWWKPALFFLVVIAGLWYVKWEPYYGKAFTAAETHSIGKSILAQADANPWQAALDYAMIYFLAVWKAAVLGVILGSLIQVLIPRDWLLRTLGQSRFRGTLLGTLFSLPGMMCTCCAAPVAAGMRRQQVSMGGALAFWMGNPVLNPATLVFMGFVLSWGFAAIRLVAGLVMVLLIATLVQKWVRETPQTQAPVEIDIPEAQGGFFSRWGRALWTLFWSTIPVYILAVLVLGAARVWLFPHADGTVDNSLMWVVAMAVAGCLFVIPTAAEIPIVQTMMLAGMGTAPALALLMTLPAVSLPSLIMLRKAFPAKALWLTGAMVAVSGVIVGGLALLF >NZ_CP029122.1|WP_001295551.1|890462_891098_+|NAD(P)H-binding-protein MSQVLITGATGLVGGHLLRMLINEPKVNAIAAPTRRPLGDMPGVFNPHDPQLTDALAQVTDPIDIVFCCLGTTRREAGSKEAFIHADYTLVVDTALTGRRLGAQHMLVVSAMGANAHSPFFYNRVKGEMEEALIAQNWPKLTIARPSMLLGDRSKQRMNETLFAPLFRLLPGNWKSIDARDVARVMLAESMRPEHEGVTILSSSELRKRAE >NZ_CP029122.1|WP_000037608.1|889816_890335_-|protein/nucleic-acid-deglycase MSKKIAVLITDEFEDSEFTSPADEFRKAGHEVITIEKQAGKTVKGKKGEASVTIDKSIDEVTPAEFDALLLPGGHSPDYLRGDNRFVTFTRDFVNSGKPVFAICHGPQLLISADVIRGRKLTAVKPIIIDVKNAGAEFYDQEVVVDKDQLVTSRTPDDLPAFNREALRLLGA >NZ_CP029122.1|WP_000449030.1|889393_889837_+|YhbP-family-protein METLIAISRWLAKQHVVTWCVQQEGELWCANAFYLFDAQKVAFYILTEEKTRHAQMSGPQAAVAGTVNGQPKTVALIRGVQFKGEIRRLEGEESDLARKAYNRRFPVARMLSAPVWEIRLDEIKFTDNTLGFGKKMIWLRDSGTEQA >NZ_CP029122.1|WP_000189314.1|889040_889343_-|DNA-damage-response-exodeoxyribonuclease-YhbQ MTPWFLYLIRTADNKLYTGITTDVERRYQQHQSGKGAKALRGKGELTLAFSAPVGDRSLALRAEYRVKQLTKRQKERLVAEGAGFAELLSSLQTPEIKSD >NZ_CP029122.1|WP_000908554.1|888550_889054_+|N-acetyltransferase MLIRVEIPIDAPGIDALLRRSFESDAEAKLVHDLREDGFLTLGLVATDDEGQVIGYVAFSPVDVQGEDLQWVGMAPLAVDEKYRGQGLARQLVYEGLDSLNEFGYAAVVTLGDPALYSRFGFELAAHHDLRCRWPGTESAFQVHRLADDALNGVTGLVEYHEHFNRF >NZ_CP029122.1|WP_001375267.1|888032_888557_+|SCP2-domain-containing-protein MLDKLRSRIVHLGPSLLSVPVKLTPFALKRQVLEQVLSWQFRQALDDGELEFLEGRWLSIHVRDIDLQWFTSVVNGKLVVSQNAQADVSFSADASDLLMIAARKQDPDTLFFQRRLVIEGDTELGLYVKNLMDAIELEQMPKALRMMLLQLADFVEAGMKNAPETKQTSVGEPC >NZ_CP029122.1|WP_000421305.1|886828_887824_-|U32-family-peptidase MELLCPAGNLPALKAAIENGADAVYIGLKDDTNARHFAGLNFTEKKLQEAVSFVHQHRRKLHIAINTFAHPDGYARWQRAVDMAAQLGADALILADLAMLEYAAERYPHIERHVSVQASATNEEAINFYHRHFDVARVVLPRVLSIHQVKQLARVTPVPLEVFAFGSLCIMSEGRCYLSSYLTGESPNTVGACSPARFVRWQQTPQGLESRLNEVLIDRYQDGENAGYPTLCKGRYLVDGERYHALEEPTSLNTLELLPELMAANIASVKIEGRQRSPAYVSQVAKVWRQAIDRCKADPQNFVPQSAWMETLGSMSEGTQTTLGAYHRKWQ >NZ_CP029122.1|WP_001301318.1|885941_886820_-|U32-family-peptidase MKYSLGPVLWYWPKETLEEFYQQAATSSADVIYLGEAVCSKRRATKVGDWLEMAKSLAGSGKQIVLSTLALVQASSELGELKRYVENGEFLIEASDLGVVNMCAERKLPFVAGHALNCYNAVTLKILLKQGMMRWCMPVELSRDWLVNLLNQCDELGIRNQFEVEVLSYGHLPLAYSARCFTARSEDRPKDECETCCIKYPNGRNVLSQENQQVFVLNGIQTMSGYVYNLGNELASMQGLVDVVRLSPQGTDTFAMLDAFRANENGAAPLPLTANSDCNGYWRRLAGLELQA >NZ_CP029122.1|WP_000130392.1|884728_885736_-|LLM-class-flavin-dependent-oxidoreductase MTDKTIAFSLLDLAPIPEGSSAREAFSHSLDLARLAEKRGYHRYWLAEHHNMTGIASAATSVLIGYLAANTTTLHLGSGGVMLPNHSPLVIAEQFGTLNTLYPGRIDLGLGRAPGSDQRTMMALRRHMSGDIDNFPRDVAELVDWFDARDPNPNVRPVPGYGEKIPVWLLGSSLYSAQLAAQLGLPFAFASHFAPDMLFQALHLYRSNFKPSARLEKPYAMVCINIIAADSNRDAEFLFTSMQQAFVKLRRGETGQLPPPIQNMDQFWSPSEQYGVQQALSMSLVGDKAKVRHGLQSILRETDADEIMVNGQIFDHQARLHSFELAMDVKEELLG >NZ_CP029122.1|WP_000646033.1|892415_892991_-|divisome-associated-lipoprotein-YraP MKALSPIAVLISALLLQGCVAAAVVGTAAVGTKAATDPRSVGTQVDDGTLEVRVNSALSKDEQIKKEARINVTAYQGKVLLVGQSPNAELSARAKQIAMGVDGANEVYNEIRQGQPIGLGEASNDTWITTKVRSQLLTSDLVKSSNVKVTTENGEVFLMGLVTEREAKAAADIASRVSGVKRVTTAFTFIK >NZ_CP029122.1|WP_001158034.1|893000_893591_-|DnaA-initiator-associating-protein-DiaA MQERIKACFTESIQTQIAAAEALPDAISRAAMTLVQSLLNGNKILCCGNGTSAANAQHFAASMINRFETERPSLPAIALNTDNVVLTAIANDRLHDEVYAKQVRALGHAGDVLLAISTRGNSRDIVKAVEAAVTRDMTIVALTGYDGGELAGLLGPQDVEIRIPSHRSARIQEMHMLTVNCLCDLIDNTLFPHQDD >NZ_CP029122.1|WP_000246837.1|893610_894006_-|YraN-family-protein MATVPTRSGSPRQLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTVFVEVRYRRSALYGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNEVEWIKDAFNDHS >NZ_CP029122.1|WP_000249160.1|893963_896000_-|penicillin-binding-protein-activator-LpoA MVPSTFSRLKAARCLPVVLAALIFAGCGTHTPDQSTAYMQGTAQADSAFYLQQMQQSSDDTRINWQLLAIRALVKEGKTGQAVELFNQLPQELNDSQRREKTLLAVEIKLAQKDFAGAQNLLAKITPADLEQNQQARYWQAKIDASQGRPSIDLLRALIAQEPLLGAKEKQQNIDATWQALSSMTQEQANTLVINADENILQGWLDLQRVWFDNRNDPDMMKAGIADWQKRYPNNPGAKMLPTQLVNVKAFKPASTNKIALLLPLNGQAAVFGRTIQQGFEAAKNIGTQPVAAQVAAAPAADVAEQPQPQTVDGVASPAQASVSDLTGEQPAAQPVPVSAPATSTAAVSAPANPSAELKIYDTSSQPLSQILSQVQQDGASIVVGPLLKNNVEELLKSNTPLNVLALNQPENIENRVNICYFALSPEDEARDAARHIRDQGKQAPLVLIPRSSLGDRVANAFAQEWQKLGGGTVLQQKFGSTSELRAGVNGGSGIALTGSPITPRATTDSGMTTNNPTLQTTPTDDQFTNNGGRVDAVYIVATPGEIAFIKPMIAMRNGSQSGATLYASSRSAQGTAGPDFRLEMEGLQYSEIPMLAGGNLPLMQQALSAVNNDYSLARMYAMGVDAWSLANHFSQMRQVQGFEINGNTGSLTANPDCVINRKLSWLQYQQGQVVPAS >NZ_CP029122.1|WP_000809262.1|896064_896925_+|16S-rRNA-(cytidine(1402)-2'-O)-methyltransferase MKQHQSADNSQGQLYIVPTPIGNLADITQRALEVLQAVDLIAAEDTRHTGLLLQHFGINARLFALHDHNEQQKAETLLAKLQEGQNIALVSDAGTPLINDPGYHLVRTCREAGIRVVPLPGPCAAITALSAAGLPSDRFCYEGFLPAKSKGRRDALKAIEAEPRTLIFYESTHRLLDSLEDIVAVLGESRYVVLARELTKTWETIHGAPVGELLAWVKEDENRRKGEMVLIVEGHKAQEEDLPADALRTLALLQAELPLKKAAALAAEIHGVKKNALYKYALEQQG >NZ_CP029122.1|WP_000816988.1|896967_898059_-|fimbrial-protein MKRAPLITGLLLISTSCAYASSGGCGADSTSGATNYSSVVDDVTVNQTDNVTGREFTSATLSSTNWQYACSCSAGKAVKLVYMVSPVLTTTGHQAGYYKLNDSLDIKTTLKANDIPGLVTDQTVSVNTRFTQIKSNTVYSAATQTGVCQGDTSRYGPVNIGANTTFTLYVTKPFLGSMTIPKTDIAVIKGAWVDGMGSPSTGDFHDLVKLSIQGNLTAPQSCKINQGDVIKVNFGFINGQKFTTRNAMPDGFTPVDFDITYDCGDTSKIKNSLQMRIDGTTGVVDQYNLVARRRSSDNAPDVGIRIENLGGGVANIPFQNGILPVDPSGHGTVNMRAWPVNLVGGELETGKFQGTATITVIVR >NZ_CP029122.1|WP_000044770.1|901392_902088_-|molecular-chaperone MSKRTFAVIITLLCSFCIGQALAGGIVLQRTRVIYDASRKEAALPVANKGAETPYLLQSWVDNIDGTSRAPFIITPPLFRLEAGDDSSLRIIKTADNLPENKESLFYINVRAIPAKKKSDNVNANELTLVFKTRIKMFYRPAHLKGRVNDAWKSLEFKRSDHSLNIYNPTEYYVVFAGLAVDKTDLTSKIEYIAPGEHKQLPLPASGGKNVKWAAINDYGGSSGTETRPLQ >NZ_CP029122.1|WP_001045434.1|902167_902752_-|type-1-fimbrial-protein MNKVTKTAIAGLLALFAGNAAATDGEIVFDGEILKSACEINDSDKKIEVALGHYNAEQFRSVGDRSPKIPFTIPLVNCPVTGWEHDNGNVEASFRLWLETRDNGTVPNFPNLAKVGSFAGTAATGVGIRIDDAESGNLMPLNAMGNDNTVYQIPADSAGIVNVDLIAYYVSTVEASEITPGEADAVVNVTLDYR >NZ_CP029122.1|WP_001323952.1|903152_903908_-|galactosamine-6-phosphate-isomerase MERGTASGGASLLKEFHPVQTLQQVENYTALSERASEYLLAVIRSKPDAVICLATGATPLLTYHYLVEKIHQQQVDVSQLTFVKLDEWVDLPLTMPGTCETFLQQHIVQPLGLREDQLISFRSEEINETECERVTNLIARKGGLDLCVLGLGKNGHLGLNEPGESLQPACHISQLDARTQQHEMLKTAGRPVTRGITLGLKDILNAREVLLLVTGEGKQDATERFLTAKVSTAIPASFLWLHSNFICLINT >NZ_CP029122.1|WP_000534351.1|903908_904700_-|PTS-N-acetylgalactosamine-transporter-subunit-IID MGSEISKKDITRLGFRSSLLQASFNYERMQAGGFTWAMLPILKKIYKDDKPGLSAAMKDNLEFINTHPNLVGFLMGLLISMEEKGENRDTIKGLKVALFGPIAGIGDAIFWFTLLPIMAGICSSFASQGNLLGPILFFAVYLLIFFLRVGWTHVGYSVGVKAIDKVRENSQMIARSATILGITVIGGLIASYVHINVVTSFAIDSTHSVALQQDFFDKVFPNILPMAYTLLMYYFLRVKKAHPVLLIGVTFVLSIVCSAFGIL |
You can click texts colored in the table to view more detailed information
CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
NZ_CP029122_2 | 937163-937280 | Orphan |
NA
Consensus repeat of NZ_CP029122_2
|
1 spacers
spacers of NZ_CP029122_2
>2.1|937203|38|NZ_CP029122|CRISPRCasFinder GTGCTCAACTTGTTGATGTTGTTGTGTTTTGTACCTGA |
CRISPR arrays and Neighbor proteins around NZ_CP029122_2
The CRISPR arrays of NZ_CP029122_2 >merge|NZ_CP029122|2|937163-937280|CRISPRCasFinder TGCCGGATGCGATGCTGGCGCACCTTATCCGGCCTACGGGGTGCTCAACTTGTTGATGTTGTTGTGTTTTGTACCTGATGCCGGATGCGATGCTGGCGCATCTTATCCGGCCTACGGG >NZ_CP029122|2|2|937163-937280|CRISPRCasFinder TGCCGGATGCGATGCTGGCGCACCTTATCCGGCCTACGGG GTGCTCAACTTGTTGATGTTGTTGTGTTTTGTACCTGA TGCCGGATGCGATGCTGGCGCATCTTATCCGGCCTACGGG
>NZ_CP029122.1|WP_000460519.1|935831_937142_+|serine-dehydratase-subunit-alpha-family-protein MFDSTLNPLWQRYILAVQEEVKPALGCTEPISLALAAAVAAAELEGPVERVEAWVSPNLMKNGLGVTVPGTGMVGLPIAAALGALGGNANAGLEVLKDATAQAIADAKALLAAGKVSVKIQEPCNEILFSRAKVWNGEKWACVTIVGGHTNIVHIETHNGVVFTQQACVAEGEQESPLTVLSRTTLAEILKFVNEVPFAAIRFILDSAKLNCALSQEGLSGKWGLHIGATLEKQCERGLLAKDLSSSIVIRTSAASDARMGGATLPAMSNSGSGNQGITATMPVVVVAEHFGADDERLARALMLSHLSAIYIHNQLPRLSALCAATTAAMGAAAGMAWLVDGRYETISMAISSMIGDVSGMICDGASNSCAMKVSTSASAAWKAVLMALDDTAVTGNEGIVAHDVEQSIANLCALASHSMQQTDRQIIEIMASKAR >NZ_CP029122.1|WP_000401598.1|934472_935804_+|HAAAP-family-serine/threonine-permease MEIASNKGVIADASTPAGRAGMSESEWREAIKFDSTDTGWVIMSIGMAIGAGIVFLPVQVGLMGLWVFLLSSVIGYPAMYLFQRLFINTLAESPECKDYPSVISGYLGKNWGILLGALYFVMLVIWMFVYSTAITNDSASYLHTFGVTEGLLSDSPFYGLVLICILVAISSRGEKLLFKISTGMVLTKLLVVAALGVSMVGMWHLYNVGSLPPLGLLVKNAIITLPFTLTSILFIQTLSPMVISYRSREKSIEVARHKALRAMNIAFGILFVTVFFYAVSFTLAMGHDEAVKAYEQNISALAIAAQFISGDGAAWVKVVSVILNIFAVMTAFFGVYLGFREATQGIVMNILRRKMPAEKINENLVQRGIMIFAILLAWSAIVLNAPVLSFTSICSPIFGMVGCLIPAWLVYKVPALHKYKGMSLYLIIVTGLLLCVSPFLAFS >NZ_CP029122.1|WP_000622115.1|932833_934198_+|L-serine-ammonia-lyase MISAFDIFKIGIGPSSSHTVGPMNAGKSFIDRLESSGLLTATSHIVVDLYGSLSLTGKGHATDVAIIMGLAGNSPQDVVIDEIPAFIELVTRSGRLPVASGAHIVDFPVAKNIIFHPEMLPRHENGMRITAWKGQEALLSKTYYSVGGGFIVEEEHFGLSHDVETSVPYDFHSAGELLKMCDYNGLSISGLMMHNELALRSKAEIDAGFARIWQVMHDGIERGMNTEGVLPGPLNVPRRAVALRRQLVSSDNISNDPMNVIDWINMYALAVSEENAAGGRVVTAPTNGACGIIPAVLAYYDKFRRPVNERSIARYFLAAGAIGALYKMNASISGAEVGCQGEIGVACSMAAAGLTELLGGSPAQVCNAAEIAMEHNLGLTCDPVAGQVQIPCIERNAINAVKAVNAARMAMRRTSAPRVSLDKVIETMYETGKDMNDKYRETSRGGLAIKVVCG >NZ_CP029122.1|WP_001375219.1|932372_932762_+|enamine/imine-deaminase MKKIIETQRAPGAIGPYVQGVDLGSMVFTSGQIPVCPQTGEIPADVQDQARLSLENVKAIVVAAGLSVGDIIKMTVFITDLNDFATINEVYKQFFDEHQATYPTRSYVQVARLPKDVKLEIEAIAVRSA >NZ_CP029122.1|WP_000861734.1|930064_932359_+|2-ketobutyrate-formate-lyase/pyruvate-formate-lyase MKVDIDTSDKLYADAWLGFKGTDWKNEINVRDFIQHNYTPYEGDESFLAEATPATTELWEKVMEGIRIENATHAPVDFDTNIATTITAHDAGYINQPLEKIVGLQTDAPLKRALHPFGGINMIKSSFHAYGREMDSEFEYLFTDLRKTHNQGVFDVYSPDMLRCRKSGVLTGLPDGYGRGRIIGDYRRVALYGISYLVRERELQFADLQSRLEKGEDLEATIRLREELAEHRHALLQIQEMAAKYGFDISRPAQNAQEAVQWLYFAYLAAVKSQNGGAMSLGRTASFLDIYIERDFKAGVLNEQQAQELIDHFIMKIRMVRFLRTPEFDSLFSGDPIWATEVIGGMGLDGRTLVTKNSFRYLHTLHTMGPAPEPNLTILWSEELPIAFKKYAAQVSIVTSSLQYENDDLMRTDFNSDDYAIACCVSPMVIGKQMQFFGARANLAKTLLYAINGGVDEKLKIQVGPKTAPLMDDVLDYDKVMDSLDHFMDWLAVQYISALNIIHYMHDKYSYEASLMALHDRDVYRTMACGIAGLSVATDSLSAIKYARVKPIRDENGLAVDFEIDGEYPQYGNNDERVDSIACDLVERFMKKIKALPTYRNAVPTQSILTITSNVVYGQKTGNTPDGRRAGTPFAPGANPMHGRDRKGAVASLTSVAKLPFTYAKDGISYTFSIVPAALGKEDPVRKTNLVGLLDGYFHHEADVEGGQHLNVNVMNREMLLDAIEHPEKYPNLTIRVSGYAVRFNALTREQQQDVISRTFTQAL >NZ_CP029122.1|WP_001297162.1|928822_930031_+|propionate-kinase MNEFPVVLVINCGSSSIKFSVLDASDCEVLMSGIADGINSENAFLSVNGGEPAPLAHHSYEGALKAIAFELEKRNLNDSVALIGHRIAHGGSIFTESAIITDEVIDNIRRVSPLAPLHNYANLSGIESAQQLFPGVTQVAVFDTSFHQTMAPEAYLYGLPWKYYEELGVRRYGFHGTSHRYVSQRAHSLLNLAEDDSGLVVAHLGNGASICAVRNGQSVDTSMGMTPLEGLMMGTRSGDVDFGAMSWVASQTNQSLGDLERVVNKESGLLGISGLSSDLRVLEKAWHEGHERAQLAIKTFVHRIARHIAGHAASLRRLDGIIFTGGIGENSSLIRRLVMEHLAVLGVEIDTEMNNRSNSCGERIVSSENARVICAVIPTNEEKMIALDAIHLGKVNAPAEFA >NZ_CP029122.1|WP_000107720.1|927465_928797_+|threonine/serine-transporter-TdcC MSTSDSIVSSQTKQSSWRKSDTTWTLGLFGTAIGAGVLFFPIRAGFGGLIPILLMLVLAYPIAFYCHRALARLCLSGSNPSGNITETVEEHFGKTGGVVITFLYFFAICPLLWIYGVTITNTFMTFWENQLGFAPLNRGFVALFLLLLMAFVIWFGKDLMVKVMSYLVWPFIASLVLISLSLIPYWNSAVIDQVDLGSLSLTGHDGILITVWLGISIMVFSFNFSPIVSSFVVSKREEYEKDFGRDFTERKCSQIISRASMLMVAVVMFFAFSCLFTLSPANMAEAKAQNIPVLSYLANHFASMTGTKTTFAITLEYAASIIALVAIFKSFFGHYLGTLEGLNGLILKFGYKGDKTKVSLGKLNTISMIFIMGSTWVVAYANPNILDLIEAMGAPIIASLLCLLPMYAIRKAPSLAKYRGRLDNVFVTVIGLLTILNIVYKLF >NZ_CP029122.1|WP_000548347.1|926454_927444_+|bifunctional-threonine-ammonia-lyase/L-serine-ammonia-lyase-TdcB MHITYDLPVAIDDIIEAKQRLAGRIYKTGMPRSNYFSERCKGEIFLKFENMQRTGSFKIRGAFNKLSSLTDAEKRKGVVACSAGNHAQGVSLSCAMLGIDGKVVMPKGAPKSKVAATCDYSAEVVLHGDNFNDTIAKVSEIVEMEGRIFIPPYDDPKVIAGQGTIGLEIMEDLYDVDNVIVPIGGGGLIAGIAVAIKSINPTIRVIGVQSENVHGMAASFHSGEITTHRTTGTLADGCDVSRPGNLTYEIVRELVDDIVLVSEDEIRNSMIALIQRNKVVTEGAGALACAALLSGKLDQYIQNRKTVSIISGGNIDLSRVSQITGFVDA >NZ_CP029122.1|WP_000104211.1|925417_926356_+|transcriptional-regulator-TdcA MSTILLPKTQHLVVFQEVIRSGSIGSAAKELGLTQPAVSKIINDIEDYFGVELVVRKNTGVTLTPAGQLLLSRSESITREMKNMVNEISGMSSEAVVEVSFGFPSLIGFTFMSGMINKFKEVFPKAQVSMYEAQLSSFLPAIRDGRLDFAIGTLSAEMKLQDLHVEPLFESEFVLVASKSRTCTGTTTLESLKNEQWVLPQTNMGYYSELLTTLQRNGISIENIVKTDSVVTIYNLVLNADFLTVIPCDMTSPFGSNQFITIPVEETLPVAQYAAVWSKNYRIKKAASVLVELAKEYSSYNGCRRRQLIEVG >NZ_CP029122.1|WP_000145820.1|924884_925229_-|DNA-binding-transcriptional-activator-TdcR MTGITIFYGDNIIRYVVNIKKGLRPYFKQLPDNYQAKFELNLMSKFSNFIINKPFSAINTAARHIFSRYLLENKHLFYQYFKISNTGIDHLEQLINVNFFSSDRTSFCECNRFP >NZ_CP029122.1|WP_001295544.1|937353_937518_-|hypothetical-protein MSKKSAKKRQPVKPVVAKEPARTAKNFGYEEMLSELEAIVADAETRLAEDEATA >NZ_CP029122.1|WP_000633577.1|937540_938242_-|pirin-family-protein MITTRTARQCGQADYGWLQARYTFSFGHYFDPKLLGYASLRVLNQEVLAPGAAFQPRTYPKVDILNVILDGEAEYRDSEGNHVQASAGEALLLSTQPGVSYSEHNLSKDKPLTRMQLWLDACPQRENPLIQKLALNMGKQQLIASPEGTMGSLQLRQQVWLHHIVLDKGESANFQLHGPRAYLQSIHGKFHALTHHEEKAALTCGDGAFIRDEANITLVADSPLRALLIDLPV >NZ_CP029122.1|WP_001041010.1|938346_939243_+|DNA-binding-transcriptional-regulator-YhaJ MAKERALTLEALRVMDAIDRRGSFAAAADELGRVPSALSYTMQKLEEELDVVLFDRSGHRTKFTNVGRMLLERGRVLLEAADKLTTDAEALARGWETHLTIVTEALVPTPAFFPLIDKLAAKANTQLAIITEVLAGAWERLEQGRADIVIAPDMHFRSSSEINSRKLYTLMNVYVAAPDHPIHQEPEPLSEVTRVKYRGIAVADTARERPVLTVQLLDKQPRLTVSTIEDKRQALLAGLGVATMPYPMVEKDIAEGRLRVVSPESTSEIDIIMAWRRDSMGEAKSWCLREIPKLFSGK >NZ_CP029122.1|WP_001198780.1|939293_939650_-|DUF805-domain-containing-protein MQWYLAVLKNYVGFSGRARRKEYWMFTLINAIVGAIINVIQLILGLEFPFLSLIYLAATIIPVIALCVRRLHDTDRSGAWALLYLVPIIGWLVLFVFACLEGNSGSNRYGNDPKFGSN >NZ_CP029122.1|WP_000384145.1|939891_940257_-|DUF805-domain-containing-protein MDWYLKVLKNYVGFRGRARRKEYWMFILVNIIFTFVLGLLDKMLGWQRAGGEGILTTIYGILVFLPWWAVQFRRLHDTDRSAWWALLFLIPFIGWLIIIVFNCQAGTPGENRFGPDPKLEP >NZ_CP029122.1|WP_000531204.1|940549_941536_-|glutathionyl-hydroquinone-reductase-YqjG MGQLIDGVWHDTWYDTKSTGGKFQRSASAFRNWLTADGAPGPTGTGGFIAEKDRYHLYVSLACPWAHRTLIMRKLKGLEPFISVSVVNPLMLENGWTFDDSFPGATGDTLYQHEFLYQLYLHADPHYSGRVTVPVLWDKKNHTIVSNESAEIIRMFNTAFDALGAKAGDYYPPALQTKIDELNGWIYDTVNNGVYKAGFATSQQAYDEAVAKVFESLARLEQILGQHRYLTGNQLTEADIRLWTTLVRFDPVYVTHFKCDKHRISNYLNLYGFLRDIYQMPGIAETVNFDHIRNHYFRSHKTINPTGIISIGPWQDLDEPHGRDVRFG >NZ_CP029122.1|WP_000603618.1|941605_942088_-|DoxX-family-protein MILSIDSNDANTAPLHKKTISSLSGAVESMMKKLEDVGVLVARILMPILFITAGWGKITGYAGTQQYMEAMGVPGFMLPLVILLEFGGGLAILFGFLTRTTALFTAGFTLLTAFLFHSNFAEGVNSLMFMKNLTISGGFLLLAITGPGAYSIDRLLNKKW >NZ_CP029122.1|WP_000096086.1|942183_942483_-|hypothetical-protein MSSKVERERRKAQLLSQIQQQRLDLSASRREWLEATGAYDRRWNMLLSLRSWALVGSSVMAIWTIRHPNMLVRWARRGFGVWSAWRLVKTTLKQQQLRG >NZ_CP029122.1|WP_000785722.1|942472_942877_-|hypothetical-protein MADTHHAQGPGKSVLGIGQRIVSIMVEMVETRLRLAVVELEEEKANLFQLLLMLGLTMLFAAFGLMSLMVLIIWAVDPQYRLNAMIATTVVLLLLALIGGIWTLRKSRKSTLLRHTRHELANDRQLLEEESREQ >NZ_CP029122.1|WP_000031415.1|942879_943185_-|DUF883-domain-containing-protein MSKEHTTEHLRAELKSLSDTLEEVLSSSGEKSKEELSKIRSKAEQALKQSRYRLGETGDAIAKQTRVAAARADEYVRENPWTGVGIGAAIGVVLGVLLSRR |
You can click texts colored in the table to view more detailed information
CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
NZ_CP029122_3 | 1310603-1311119 | Orphan |
I-E
Consensus repeat of NZ_CP029122_3
|
8 spacers
spacers of NZ_CP029122_3
>3.1|1310632|32|NZ_CP029122|PILER-CR,CRISPRCasFinder,CRT TCCACGCTGTAACGGCCATCATTAAGTTTAGT >3.2|1310693|32|NZ_CP029122|PILER-CR,CRISPRCasFinder,CRT GCTGATGGTCTGGGAGTGTCCATCGGGCAACT >3.3|1310754|32|NZ_CP029122|PILER-CR,CRISPRCasFinder,CRT GAAGTAGGCCTGACAGTGATTGAACGCATACT >3.4|1310815|32|NZ_CP029122|PILER-CR,CRISPRCasFinder,CRT AGTTGGGGCGGCGCAATAACGAGACGATACGC >3.5|1310876|32|NZ_CP029122|PILER-CR,CRISPRCasFinder,CRT GGGAGTGGCACTTCTGGGGTAGCGGCGGCCCT >3.6|1310937|32|NZ_CP029122|PILER-CR,CRISPRCasFinder,CRT TCAACGCGCTCAGACGTTGCGTGAGTGAACCA >3.7|1310998|32|NZ_CP029122|PILER-CR,CRISPRCasFinder,CRT AAATATCCAGGGCTGGGCTGGAGGCAGACGGC >3.8|1311059|32|NZ_CP029122|PILER-CR,CRISPRCasFinder,CRT CCCGGAATGCATTCTGAAGGTTTGCTGTATAT |
CRISPR arrays and Neighbor proteins around NZ_CP029122_3
The CRISPR arrays of NZ_CP029122_3 >merge|NZ_CP029122|3|1310603-1311119|PILER-CR,CRISPRCasFinder,CRT GAGTTCCCCGCGCCAGCGGGGATAAACCGTCCACGCTGTAACGGCCATCATTAAGTTTAGTGAGTTCCCCGCGCCAGCGGGGATAAACCGGCTGATGGTCTGGGAGTGTCCATCGGGCAACTGAGTTCCCCGCGCCAGCGGGGATAAACCGGAAGTAGGCCTGACAGTGATTGAACGCATACTGAGTTCCCCGCGCCAGCGGGGATAAACCGAGTTGGGGCGGCGCAATAACGAGACGATACGCGAGTTCCCCGCGCCAGCGGGGATAAACCGGGGAGTGGCACTTCTGGGGTAGCGGCGGCCCTGAGTTCCCCGCGCCAGCGGGGATAAACCGTCAACGCGCTCAGACGTTGCGTGAGTGAACCAGAGTTCCCCGCGCCAGCGGGGATAAACCGAAATATCCAGGGCTGGGCTGGAGGCAGACGGCGAGTTCCCCGCGCCAGCGGGGATAAACCGCCCGGAATGCATTCTGAAGGTTTGCTGTATATGAGTTCCCCGCGCCAGCGGGGATAAACCA >NZ_CP029122|3|1|1310603-1311119|PILER-CR GAGTTCCCCGCGCCAGCGGGGATAAACCG TCCACGCTGTAACGGCCATCATTAAGTTTAGT GAGTTCCCCGCGCCAGCGGGGATAAACCG GCTGATGGTCTGGGAGTGTCCATCGGGCAACT GAGTTCCCCGCGCCAGCGGGGATAAACCG GAAGTAGGCCTGACAGTGATTGAACGCATACT GAGTTCCCCGCGCCAGCGGGGATAAACCG AGTTGGGGCGGCGCAATAACGAGACGATACGC GAGTTCCCCGCGCCAGCGGGGATAAACCG GGGAGTGGCACTTCTGGGGTAGCGGCGGCCCT GAGTTCCCCGCGCCAGCGGGGATAAACCG TCAACGCGCTCAGACGTTGCGTGAGTGAACCA GAGTTCCCCGCGCCAGCGGGGATAAACCG AAATATCCAGGGCTGGGCTGGAGGCAGACGGC GAGTTCCCCGCGCCAGCGGGGATAAACCG CCCGGAATGCATTCTGAAGGTTTGCTGTATAT GAGTTCCCCGCGCCAGCGGGGATAAACCA >NZ_CP029122|3|3|1310603-1311119|CRISPRCasFinder GAGTTCCCCGCGCCAGCGGGGATAAACCG TCCACGCTGTAACGGCCATCATTAAGTTTAGT GAGTTCCCCGCGCCAGCGGGGATAAACCG GCTGATGGTCTGGGAGTGTCCATCGGGCAACT GAGTTCCCCGCGCCAGCGGGGATAAACCG GAAGTAGGCCTGACAGTGATTGAACGCATACT GAGTTCCCCGCGCCAGCGGGGATAAACCG AGTTGGGGCGGCGCAATAACGAGACGATACGC GAGTTCCCCGCGCCAGCGGGGATAAACCG GGGAGTGGCACTTCTGGGGTAGCGGCGGCCCT GAGTTCCCCGCGCCAGCGGGGATAAACCG TCAACGCGCTCAGACGTTGCGTGAGTGAACCA GAGTTCCCCGCGCCAGCGGGGATAAACCG AAATATCCAGGGCTGGGCTGGAGGCAGACGGC GAGTTCCCCGCGCCAGCGGGGATAAACCG CCCGGAATGCATTCTGAAGGTTTGCTGTATAT GAGTTCCCCGCGCCAGCGGGGATAAACCA >NZ_CP029122|3|1|1310603-1311119|CRT GAGTTCCCCGCGCCAGCGGGGATAAACCG TCCACGCTGTAACGGCCATCATTAAGTTTAGT GAGTTCCCCGCGCCAGCGGGGATAAACCG GCTGATGGTCTGGGAGTGTCCATCGGGCAACT GAGTTCCCCGCGCCAGCGGGGATAAACCG GAAGTAGGCCTGACAGTGATTGAACGCATACT GAGTTCCCCGCGCCAGCGGGGATAAACCG AGTTGGGGCGGCGCAATAACGAGACGATACGC GAGTTCCCCGCGCCAGCGGGGATAAACCG GGGAGTGGCACTTCTGGGGTAGCGGCGGCCCT GAGTTCCCCGCGCCAGCGGGGATAAACCG TCAACGCGCTCAGACGTTGCGTGAGTGAACCA GAGTTCCCCGCGCCAGCGGGGATAAACCG AAATATCCAGGGCTGGGCTGGAGGCAGACGGC GAGTTCCCCGCGCCAGCGGGGATAAACCG CCCGGAATGCATTCTGAAGGTTTGCTGTATAT GAGTTCCCCGCGCCAGCGGGGATAAACCA
>NZ_CP029122.1|WP_001199979.1|1309591_1310263_+|7-carboxy-7-deazaguanine-synthase-QueE MQYPINEMFQTLQGEGYFTGVPAIFIRLQGCPVGCAWCDTKHTWEKLEDREVSLFSILAKTKESDKWGAASSEDLLAVISRQGYTARHVVITGGEPCIHDLLPLTDLLEKNGFSCQIETSGTHEVRCTPNTWVTVSPKLNMRGGYEVLSQALERANEIKHPVGRVRDIEALDELLATLTDDKPRVIALQPISQKDDATRLCIETCIARNWRLSMQTHKYLNIA >NZ_CP029122.1|WP_001288227.1|1309312_1309453_-|hypothetical-protein MSEENKENGFNHVKTFTKIIFIFSVLVFNDNESKITDAAVNLFIQI >NZ_CP029122.1|WP_001679366.1|1308426_1309299_-|YgcG-family-protein MRYFILMFTFVCSFVAAQPTIVPQLQQQVTDLTSSLNSQEKKELTHKLESIFNNTQVQIAVLIVPTTKDETIEQYATRVFDNWRLGDAKRNDGILIIVAWSDRTVRIKVGYGLEEKVTDALAGDIIRSNMIPAFKQQKLAQGLELAINALNNQLTSQHQYPTNPSESESASSSDHYYFAIFWVFAVMFFPFWFFHQCSNFCRACKSGVCISAIYLLDLFLFSDKIFSIAVFSFFFTFTIFMVFTCLCVLQKRASGRSYHSDNSGSAGGSDSGGFSGGGGSSGGGGASGRW >NZ_CP029122.1|WP_000036723.1|1307068_1308367_+|phosphopyruvate-hydratase MSKIVKIIGREIIDSRGNPTVEAEVHLEGGFVGMAAAPSGASTGSREALELRDGDKSRFLGKGVTKAVAAVNGPIAQALIGKDAKDQAGIDKIMIDLDGTENKSKFGANAILAVSLANAKAAAAAKGMPLYEHIAELNGTPGKYSMPVPMMNIINGGEHADNNVDIQEFMIQPVGAKTVKEAIRMGSEVFHHLAKVLKAKGMNTAVGDEGGYAPNLGSNAEALAVIAEAVKAAGYELGKDITLAMDCAASEFYKDGKYVLAGEGNKAFTSEEFTHFLEELTKQYPIVSIEDGLDESDWDGFAYQTKVLGDKIQLVGDDLFVTNTKILKEGIEKGIANSILIKFNQIGSLTETLAAIKMAKDAGYTAVISHRSGETEDATIADLAVGTAAGQIKTGSMSRSDRVAKYNQLIRIEEALGEKAPYNGRKEIKGQA >NZ_CP029122.1|WP_000210878.1|1305343_1306981_+|CTP-synthase-(glutamine-hydrolyzing) MTTNYIFVTGGVVSSLGKGIAAASLAAILEARGLNVTIMKLDPYINVDPGTMSPIQHGEVFVTEDGAETDLDLGHYERFIRTKMSRRNNFTTGRIYSDVLRKERRGDYLGATVQVIPHITNAIKERVLEGGEGHDVVLVEIGGTVGDIESLPFLEAIRQMAVEIGREHTLFMHLTLVPYMAASGEVKTKPTQHSVKELLSIGIQPDILICRSDRAVPANERAKIALFCNVPEKAVISLKDVDSIYKIPGLLKSQGLDDYICKRFSLNCPEANLSEWEQVIFEEANPVSEVTIGMVGKYIELPDAYKSVIEALKHGGLKNRVSVNIKLIDSQDVETRGVEILKGLDAILVPGGFGYRGVEGMITTARFARENNIPYLGICLGMQVALIDYARHVANMENANSTEFVPDCKYPVVALITEWRDENGNVEVRSEKSDLGGTMRLGAQQCQLVDDSLVRQLYNAPTIVERHRHRYEVNNMLLKQIEDAGLRVAGRSGDDQLVEIIEVPNHPWFVACQFHPEFTSTPRDGHPLFAGFVKAASEFQKRQAK >NZ_CP029122.1|WP_001071648.1|1304324_1305116_+|nucleoside-triphosphate-pyrophosphohydrolase MNQIDRLLTIMQRLRDPENGCPWDKEQTFATIAPYTLEETYEVLDAIAREDFDDLRGELGDLLFQVVFYAQMAQEEGRFDFNDICAAISDKLERRHPHVFADSSAENSSEVLARWEQIKTEERAQKAQHSALDDIPRSLPALMRAQKIQKRCANVGFDWTTLGPVVDKVYEEIDEVMYEARQAVVDQAKLEEEMGDLLFATVNLARHLGTKAEIALQKANEKFERRFREVERIVAARGLEMTGVDLETMEEVWQQVKRQEIDL >NZ_CP029122.1|WP_000254738.1|1303918_1304254_+|endoribonuclease-MazF MVSRYVPDMGDLIWVDFDPTKGSEQAGHRPAVVLSPFMYNNKTGMCLCVPCTTQSKGYPFEVVLSGQERDGVALADQVKSIAWRARGATKKGTVAPEELQLIKAKINVLIG >NZ_CP029122.1|WP_000581937.1|1303670_1303919_+|type-II-toxin-antitoxin-system-antitoxin-MazE MIHSSVKRWGNSPAVRIPATLMQALNLNIDDEVKIDLVDGKLIIEPVRKEPVFTLAELVNDITPENLHENIDWGEPKDKEVW >NZ_CP029122.1|WP_000226815.1|1301358_1303593_+|GTP-pyrophosphokinase MVAVRSAHINKAGEFDPEKWIASLGITSQKSCECLAETWAYCLQQTQGHPDASLLLWRGVEMVEILSTLSMDIDTLRAALLFPLADANVVSEDVLRESVGKSVVNLIHGVRDMAAIRQLKATHTDSVSSEQVDNVRRMLLAMVDDFRCVVIKLAERIAHLREVKDAPEDERVLAAKECTNIYAPLANRLGIGQLKWELEDYCFRYLHPTEYKRIAKLLHERRLDREHYIEEFVGHLRAEMKAEGVKAEVYGRPKHIYSIWRKMQKKNLAFDELFDVRAVRIVAERLQDCYAALGIVHTHYRHLPDEFDDYVANPKPNGYQSIHTVVLGPGGKTVEIQIRTKQMHEDAELGVAAHWKYKEGAAAGGARSGHEDRIAWLRKLIAWQEEMADSGEMLDEVRSQVFDDRVYVFTPKGDVVDLPAGSTPLDFAYHIHSDVGHRCIGAKIGGRIVPFTYQLQMGDQIEIITQKQPNPSRDWLNPNLGYVTTSRGRSKIHAWFRKQDRDKNILAGRQILDDELEHLGISLKEAEKHLLPRYNFNDVDELLAAIGGGDIRLNQMVNFLQSQFNKPSAEEQDAAALKQLQQKSYTPQNRSKDNGRVVVEGVGNLMHHIARCCQPIPGDEIVGFITQGRGISVHRADCEQLAELRSHAPERIVDAVWGESYSAGYSLVVRVVANDRSGLLRDITTILANEKVNVLGVASRSDTKQQLATIDMTIEIYNLQVLGRVLGKLNQVPDVIDARRLHGS >NZ_CP029122.1|WP_000046812.1|1300009_1301311_+|23S-rRNA-(uracil(1939)-C(5))-methyltransferase-RlmD MAQFYSAKRRTTTRQIITVSVNDLDSFGQGVARHNGKTLFIPGLLPQENAEVTVTEDKKQYARAKVVRRLSDSPERETPRCPHFGVCGGCQQQHASVDLQQRSKSAALARLMKHDVSEVIADVPWGYRRRARLSLNYLPKTQQLQMGFRKAGSSDIVDVKQCPILAPQLEALLPKVRACLGSLQAMRHLGHVELVQATSGTLMILRHTAPLSSADREKLERFSHSEGLDLYLAPDSEILETVSGEMPWYDSNGLRLTFSPRDFIQVNAGVNQKMVARALEWLDVQPEDRVLDLFCGMGNFTLPLATQAASVVGVEGVPALVEKGQQNARLNGLQNVTFYHENLEEDVTKQPWAKNGFDKVLLDPARAGAAGVMQQIIKLEPIRIVYVSCNPATLARDSEALLKAGYTIARLAMLDMFPHTGHLESMVLFSRVK >NZ_CP029122.1|WP_000039683.1|1311756_1313235_-|sugar-kinase MSKKYIIGIDGGSQSTKVVMYDLEGNVVCEGKGLLQPMHTPDADTAEHPDDDLWASLCFAGHDLMSQFAGNKEDIVGIGLGSIRCCRALLKADGTPAAPLISWQDARVTRPYEHTNPDVAYVTSFSGYLTHRLTGEFKDNIANYFGQWPVDYKSWAWSEDAAVMDKFNIPRHMLFDVQMPGTVLGHITPQAALATHFPAGLPVVCTTSDKPVEALGAGLLDDETAVISLGTYIALMMNGKALPKDPVAYWPIMSSIPQTLLYEGYGIRKGMWTVSWLRDMLGESLIQDAKAQDLSPEDLLNKKASCVPPGCNGLMTVLDWLTNPWEPYKRGIMIGFDSSMDYAWIYRSILESVALTLKNNYDNMCNEMNYFAKHVIITGGGSNSDLFMQIFADVFNLPARRNAINGCASLGAAINTAVGLGLYPDYATAVDKMVRVKDIFMPVESNAKRYDAMNKGIFKDLTKHTDVILKKSYEVMHGELGNADSIQSWSNA >NZ_CP029122.1|WP_001164578.1|1313261_1314539_-|MFS-transporter MQHNSYRRWITLAIISFSGGVSFDLAYLRYIYQIPMAKFMGFSNTEIGLIMSTFGIAAIILYAPSGVIADKFSHRKMITSAMIITGLLGLLMATYPPLWVMLCIQVAFAITTILMLWSVSIKAASLLGDHSEQGKIMGWMEGLRGVGVMSLAVFTMWVFSRFAPDDSTSLKTVIIIYSVVYILLGILCWFFVSDNNNLRSANNEEKQSFQLSDILAVLRISTTWYCSMVIFGVFTIYAILSYSTNYLTEMYGMSLVAASYMGIVINKIFRALCGPLGGIITTYSKVKSPTRVIQILSIIGLLALTALLVTNSNPQSVAMGIGLILLLGFTCYASRGLYWACPGEARTPSYIMGTTVGICSVIGFLPDVFVYPIIGHWQDTLPAAEAYRNMWLMGMAALGMVIVFTFLLFQKIRTADSAPAMASSK >NZ_CP029122.1|WP_000021330.1|1314857_1315643_+|SDR-family-oxidoreductase MSIESLNAFSMDFFSLKGKTAIVTGGNSGLGQAFAMALAKAGANIFIPSFVKDNGETKEMIEKQGVEVDFMQVDITAEGAPQKIIAASCERFGTVDILVNNAGICKLNKVLDFGRADWDPMIDVNLTAAFELSYEAAKIMIPQKSGKIINICSLFSYLGGQWSPAYSATKHALAGFTKAYCDELGQYNIQVNGIAPGYYATDITLATRSNPETNQRVLDHIPANRWGDTQDLMGAAVFLASPASNYVNGHLLVVDGGYLVR >NZ_CP029122.1|WP_000059312.1|1315712_1317167_+|FAD-binding-oxidoreductase MSLSRAAIVDQLKEIVGADRVITDETVLKKNSIDRFRKFPDIHGIYTLPIPAAVVKLGSTEQVSRVLNFMNAHKINGVPRTGASATEGGLETVVENSVVLDGSAMNQIINIDIENMQATAQCGVPLEVLENALREKGYTTGHSPQSKPLAQMGGLVATRSIGQFSTLYGAIEDMVVGLEAVLADGTVTRIKNVPRRAAGPDIRHIIIGNEGALCYITEVTVKIFKFTPENNLFYGYILEDMKTGFNILREVMVEGYRPSIARLYDAEDGTQHFTHFADGKCVLIFMAEGNPRIAKATGEGIAEIVARYPQCQRVDSKLIETWFNNLNWGPDKVAAERVQILKTGNMGFTTEVSGCWSCIHEIYESVINRIRTEFPHADDITMLGGHSSHSYQNGTNMYFVYDYNVVDCKPEEEIDKYHNPLNKIICEETIRLGGSMVHHHGIGKHRVHWSKLEHGSAWALLEGLKKQFDPNGIMNTGTIYPIEK >NZ_CP029122.1|WP_000147666.1|1317188_1318598_+|MFS-transporter MTGRCLFGFSGEKPFLLPDNEGVKMNTSPVRMDDLPLNRFHCRIAALTFGAHLTDGYVLGVIGYAIIQLTPAMQLTPFMAGMIGGSALLGLFLGSLVLGWISDHIGRQKIFTFSFLLITLASFLQFFATTPEHLIGLRILIGIGLGGDYSVGHTLLAEFSPRRHRGILLGAFSVVWTVGYVLASIAGHHFISENPEAWRWLLASAALPALLITLLRWGTPESPRWLLRQGRFAEAHAIVHRYFGPHVLLGDEVVTATHKHIKTLFSSRYWRRTAFNSVFFVCLVIPWFVIYTWLPTIAQTIGLEDALTASLMLNALLIVGALLGLVLTHLLAHRKFLLGSFLLLAATLVVMACLPSGSSLTLLLFVLFSTTISAVSNLVGILPAESFPTDIRSLGVGFATAMSRLGAAVSTGLLPWVLAQWGMQVTLLLLATVLLVGFVVTWLWAPETKALPLVAAGNVGGANEHSVSV >NZ_CP029122.1|WP_001324445.1|1318575_1319355_+|electron-transfer-flavoprotein-subunit-beta/FixA-family-protein MNILLAFKAEPDAGMLAEKEWQAAAQGKSGPDISLLRSLLGADEQAAAALLLAQRKNGTPMSLTALSMGDERALHWLRYLMALGFEEAVLLETAADLRFAPEFVARHIAEWQHQNPLDLIITGCQSSEGQNGQTPFLLAEMLGWPCFTQVERFTLDALFITLEQRTEHGLRCCRVRLPAVIAVRQCGEVALPVPGMRQRMAAGKAEIIRKTVAAEMPAMQCLQLARAEQRRGATLIDGQTVAEKAQKLWRDYLRQRMQP >NZ_CP029122.1|WP_001324446.1|1319351_1320212_+|electron-transfer-flavoprotein-subunit-alpha/FixB-family-protein MNIAIVTINQENAAIASWLAAQDFSGCTLAHWQIEPQPVVAEQVLDALVEQWQRTPADVVLFPPGTFGDELSTRLAWRLHGASICQVTSLDIPTVSVRKSHWGNALTATLQTEKRPLCLSLARQAGAAKNATLPSGMQQLIIVPGALPDWLVSTEDLKNVTRDPLAEARRVLVVGQGGEADNQEIAMLAEKLGAEVGYSRARVMNGGVDAEKVIGISGHLLAPEVCIVVGASGAAALMAGVRNSKFVVAINHDASAAVFSQADVGVVDDWKVVLEALVTNIHADCQ >NZ_CP029122.1|WP_001130266.1|1320359_1320935_-|glycerol-3-phosphate-responsive-antiterminator MPLLHLLRQNPVIAAVKDNASLQLAIDSECQFISVLYGNICTISNIVKKIKNAGKYAFIHVDLLEGASNKEVVIQFLKLVTEADGIISTKASMLKAARAEGFFCIHRLFIVDSISFHNIDKQVAQSNPDCIEILPGCMPKVLGWVTEKIRQPLIAGGLVCDEEDARNAINAGVVALSTTNTGVWTLAKKLL >NZ_CP029122.1|WP_000109532.1|1320951_1321212_-|ferredoxin-family-protein MSVARNLWRVADAPHIVPADSVERQTAERLISACPAGLFSLTPEGDLRIDYRSCLECGTCRLLCDESTLQQWRYPPSGFGITYRFG >NZ_CP029122.1|WP_001295150.1|1321202_1322474_-|FAD-dependent-oxidoreductase MEDDCDIIIIGAGIAGTACALRCARAGLSVLLLERAEIPGSKNLSGGRLYTHALAELLPQFHLTAPLERCITHESLSLLTPDGATTFSSLQPGGESWSVLRARFDPWLVAEAEKEGVECIPGATVDALYEENGRVCGVICGDDILRARYVVLAEGANSVLAERHGLVTRPAGEAMALGIKEVLSLETSAIEERFHLENNEGAALLFSGGICDDLPGGAFLYTNQQTLSLGIVCPLSSLTQSRVPASELLTRFKAHPAVRPLIKNTESLEYGAHLVPEGGLHSMPVQYAGNGWLLVGDALRSCVNTGISVRGMDMALTGAQAAAQTLISACQHREPQNLFPLYHHNVERSLLWDVLQRYQHVPALLQRPGWYRTWPALMQDISRDLWDQGDKPVPPLRQLFWHHLRRHGLWHLAGDVIRSLRCL |
You can click texts colored in the table to view more detailed information
CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
NZ_CP029122_4 | 1333504-1334082 | Unclear |
I-E
Consensus repeat of NZ_CP029122_4
|
9 spacers
spacers of NZ_CP029122_4
>4.1|1333534|31|NZ_CP029122|CRISPRCasFinder TTGCCCGCGCAATTCCGGGAGCATCCGCAAT >4.2|1333595|31|NZ_CP029122|CRISPRCasFinder ACGGACAAAATATATATTGATTTGCGAATTA >4.3|1333656|31|NZ_CP029122|CRISPRCasFinder GTAAAGAAACTGCCGACAAATCCCTGTTCGT >4.4|1333717|31|NZ_CP029122|CRISPRCasFinder CCCGTCACCGACGCGCAGTGGCGCTACCGTG >4.5|1333778|31|NZ_CP029122|CRISPRCasFinder GGATCTAACGCGCTGTAAAAATTCCGTGCTT >4.6|1333839|31|NZ_CP029122|CRISPRCasFinder TGCGGATTACCGGCAAAACATGGGAGCAAAC >4.7|1333900|31|NZ_CP029122|CRISPRCasFinder CCGAACGGCTGGCGAAGCAGGTGGCTGGCGT >4.8|1333961|31|NZ_CP029122|CRISPRCasFinder GTTTACCGCCCCGCAGAGGCGCTGGCAGATC >4.9|1334022|31|NZ_CP029122|CRISPRCasFinder GGATGACCTGTCGCTAAAACTCGCCGCGTAC >4.10|1333533|33|NZ_CP029122|PILER-CR GTTGCCCGCGCAATTCCGGGAGCATCCGCAATT >4.11|1333594|33|NZ_CP029122|PILER-CR GACGGACAAAATATATATTGATTTGCGAATTAT >4.12|1333655|33|NZ_CP029122|PILER-CR GGTAAAGAAACTGCCGACAAATCCCTGTTCGTT >4.13|1333716|33|NZ_CP029122|PILER-CR GCCCGTCACCGACGCGCAGTGGCGCTACCGTGA >4.14|1333777|33|NZ_CP029122|PILER-CR GGGATCTAACGCGCTGTAAAAATTCCGTGCTTT >4.15|1333838|33|NZ_CP029122|PILER-CR ATGCGGATTACCGGCAAAACATGGGAGCAAACC >4.16|1333899|33|NZ_CP029122|PILER-CR GCCGAACGGCTGGCGAAGCAGGTGGCTGGCGTA >4.17|1333960|33|NZ_CP029122|PILER-CR GGTTTACCGCCCCGCAGAGGCGCTGGCAGATCC >4.18|1334021|33|NZ_CP029122|PILER-CR GGGATGACCTGTCGCTAAAACTCGCCGCGTACA >4.19|1333534|32|NZ_CP029122|CRT TTGCCCGCGCAATTCCGGGAGCATCCGCAATT >4.20|1333595|32|NZ_CP029122|CRT ACGGACAAAATATATATTGATTTGCGAATTAT >4.21|1333656|32|NZ_CP029122|CRT GTAAAGAAACTGCCGACAAATCCCTGTTCGTT >4.22|1333717|32|NZ_CP029122|CRT CCCGTCACCGACGCGCAGTGGCGCTACCGTGA >4.23|1333778|32|NZ_CP029122|CRT GGATCTAACGCGCTGTAAAAATTCCGTGCTTT >4.24|1333839|32|NZ_CP029122|CRT TGCGGATTACCGGCAAAACATGGGAGCAAACC >4.25|1333900|32|NZ_CP029122|CRT CCGAACGGCTGGCGAAGCAGGTGGCTGGCGTA >4.26|1333961|32|NZ_CP029122|CRT GTTTACCGCCCCGCAGAGGCGCTGGCAGATCC >4.27|1334022|32|NZ_CP029122|CRT GGATGACCTGTCGCTAAAACTCGCCGCGTACA |
cas2,cas1,cas6e,cas5 |
CRISPR arrays and Neighbor proteins around NZ_CP029122_4
The CRISPR arrays of NZ_CP029122_4 >merge|NZ_CP029122|4|1333504-1334082|CRISPRCasFinder,PILER-CR,CRT TGTGTTCCCCGCGCCAGCGGGGATAAACCGTTGCCCGCGCAATTCCGGGAGCATCCGCAATTGTGTTCCCCGCGCCAGCGGGGATAAACCGACGGACAAAATATATATTGATTTGCGAATTATGTGTTCCCCGCGCCAGCGGGGATAAACCGGTAAAGAAACTGCCGACAAATCCCTGTTCGTTGTGTTCCCCGCGCCAGCGGGGATAAACCGCCCGTCACCGACGCGCAGTGGCGCTACCGTGAGTGTTCCCCGCGCCAGCGGGGATAAACCGGGATCTAACGCGCTGTAAAAATTCCGTGCTTTGTGTTCCCCGCGCCAGCGGGGATAAACCATGCGGATTACCGGCAAAACATGGGAGCAAACCGTGTTCCCCGCGCCAGCGGGGATAAACCGCCGAACGGCTGGCGAAGCAGGTGGCTGGCGTAGTGTTCCCCGCGCCAGCGGGGATAAACCGGTTTACCGCCCCGCAGAGGCGCTGGCAGATCCGTGTTCCCCGCGCCAGCGGGGATAAACCGGGATGACCTGTCGCTAAAACTCGCCGCGTACAGTGTTCCCCGCGCCAGCGGGGATAAACCG >NZ_CP029122|4|4|1333504-1334082|CRISPRCasFinder TGTGTTCCCCGCGCCAGCGGGGATAAACCG TTGCCCGCGCAATTCCGGGAGCATCCGCAAT TGTGTTCCCCGCGCCAGCGGGGATAAACCG ACGGACAAAATATATATTGATTTGCGAATTA TGTGTTCCCCGCGCCAGCGGGGATAAACCG GTAAAGAAACTGCCGACAAATCCCTGTTCGT TGTGTTCCCCGCGCCAGCGGGGATAAACCG CCCGTCACCGACGCGCAGTGGCGCTACCGTG AGTGTTCCCCGCGCCAGCGGGGATAAACCG GGATCTAACGCGCTGTAAAAATTCCGTGCTT TGTGTTCCCCGCGCCAGCGGGGATAAACCA TGCGGATTACCGGCAAAACATGGGAGCAAAC CGTGTTCCCCGCGCCAGCGGGGATAAACCG CCGAACGGCTGGCGAAGCAGGTGGCTGGCGT AGTGTTCCCCGCGCCAGCGGGGATAAACCG GTTTACCGCCCCGCAGAGGCGCTGGCAGATC CGTGTTCCCCGCGCCAGCGGGGATAAACCG GGATGACCTGTCGCTAAAACTCGCCGCGTAC AGTGTTCCCCGCGCCAGCGGGGATAAACCG >NZ_CP029122|4|2|1333505-1334081|PILER-CR GTGTTCCCCGCGCCAGCGGGGATAAACC GTTGCCCGCGCAATTCCGGGAGCATCCGCAATT GTGTTCCCCGCGCCAGCGGGGATAAACC GACGGACAAAATATATATTGATTTGCGAATTAT GTGTTCCCCGCGCCAGCGGGGATAAACC GGTAAAGAAACTGCCGACAAATCCCTGTTCGTT GTGTTCCCCGCGCCAGCGGGGATAAACC GCCCGTCACCGACGCGCAGTGGCGCTACCGTGA GTGTTCCCCGCGCCAGCGGGGATAAACC GGGATCTAACGCGCTGTAAAAATTCCGTGCTTT GTGTTCCCCGCGCCAGCGGGGATAAACC ATGCGGATTACCGGCAAAACATGGGAGCAAACC GTGTTCCCCGCGCCAGCGGGGATAAACC GCCGAACGGCTGGCGAAGCAGGTGGCTGGCGTA GTGTTCCCCGCGCCAGCGGGGATAAACC GGTTTACCGCCCCGCAGAGGCGCTGGCAGATCC GTGTTCCCCGCGCCAGCGGGGATAAACC GGGATGACCTGTCGCTAAAACTCGCCGCGTACA GTGTTCCCCGCGCCAGCGGGGATAAACC >NZ_CP029122|4|2|1333505-1334082|CRT GTGTTCCCCGCGCCAGCGGGGATAAACCG TTGCCCGCGCAATTCCGGGAGCATCCGCAATT GTGTTCCCCGCGCCAGCGGGGATAAACCG ACGGACAAAATATATATTGATTTGCGAATTAT GTGTTCCCCGCGCCAGCGGGGATAAACCG GTAAAGAAACTGCCGACAAATCCCTGTTCGTT GTGTTCCCCGCGCCAGCGGGGATAAACCG CCCGTCACCGACGCGCAGTGGCGCTACCGTGA GTGTTCCCCGCGCCAGCGGGGATAAACCG GGATCTAACGCGCTGTAAAAATTCCGTGCTTT GTGTTCCCCGCGCCAGCGGGGATAAACCA TGCGGATTACCGGCAAAACATGGGAGCAAACC GTGTTCCCCGCGCCAGCGGGGATAAACCG CCGAACGGCTGGCGAAGCAGGTGGCTGGCGTA GTGTTCCCCGCGCCAGCGGGGATAAACCG GTTTACCGCCCCGCAGAGGCGCTGGCAGATCC GTGTTCCCCGCGCCAGCGGGGATAAACCG GGATGACCTGTCGCTAAAACTCGCCGCGTACA GTGTTCCCCGCGCCAGCGGGGATAAACCG
>NZ_CP029122.1|WP_000063176.1|1333114_1333408_+|type-I-E-CRISPR-associated-endoribonuclease-Cas2 MSMVVVVTENVPPRLRGRLAIWLLEVRAGVYVGDTSKRIREMIWQQITQLAGCGNVVMAWATNTESGFEFQTWGENRRIPVDLDGLRLVSFLPVDNQ >NZ_CP029122.1|WP_000144861.1|1332194_1333118_+|type-I-E-CRISPR-associated-endonuclease-Cas1 MTFVPLSPIPLKDRTSMIFLQYGQIDVLDGAFVLIDKTGIRTHIPVGSVACIMLEPGTRVSHAAVHLAATVGTLLVWVGEAGVRVYSSGQPGGARADKLLYQAKLALTEDLRLKVVRKMYELRFREPPPARRSVEQLRGIEGSRVRQTYALLAKQYGVKWNGRKYDPKDWEKGDVVNRCISAATSCLYGISEAAVLAAGYAPAIGFIHSGKPLSFVYDIADIIKFDSVVPKAFEIAARQPAEPDKEVRLACRDIFRSTKLTGKLIPLIEEVLAAGEIEPPQPAPDMLPPAIPEPETLGDSGHRGRGG >NZ_CP029122.1|WP_000281446.1|1331547_1332198_+|type-I-E-CRISPR-associated-protein-Cas6/Cse3/CasE MYLSRITLHTGQLSPAQLLHLVDRGEYVMHQWLWDLFPGGKERQFLYRREELQGAFRFFVLSQERPAESDTFTIECRSFAPELRTGQQLCFNLRANPTICKSGKRHDLLMEAKRQVRGQAEGSDVWLHQQQAALDWLAAQGERSGFTLLDTSVDAYRQQQLRRENSRQLIQFSSVDYTGMLTVTDPGLFLQRLSQGYGKSRAFGCGLMLIKPGAEA >NZ_CP029122.1|WP_000085051.1|1330819_1331566_+|type-I-E-CRISPR-associated-protein-Cas5/CasD MSQYLIFQLHGPMASWGVDAPGEVRHTHELPSRSALLGLLAAGVGIRRDDTERLNAFNRHYSLVVCASRNPRWARDYHTIQMPKEVRKARYFSRREELSDPDLLSAIISRRDYYTDAWWMVAVATTADAPYSLEQLQDGLRHPVFPLYLGRKSHPLALPLAPLLLEGNACDALCNAYQQYQDHFHKLKVSLPKLQDECWWEGEHDGLVASKILRRRDVPLNRQQWLFGERTINQGPWLSKEEPCTSQE >NZ_CP029122.1|WP_000956458.1|1327816_1327969_+|type-I-toxin-antitoxin-system-Hok-family-toxin MLTKYALVAIIVLCCTVLGFTLMVGDSLCELSIRERGMEFKAVLAYESKK >NZ_CP029122.1|WP_000039842.1|1326817_1327552_+|phosphoadenosine-phosphosulfate-reductase MSKLDLNALNELPKVDRILALAETNAELEKLDAEGRVAWALDNLPGEYVLSSSFGIQAAVSLHLVNQIHPDIPVILTDTGYLFPETYRFIDELTDKLKLNLKVYRATESAAWQEARYGKLWEQGVEGIEKYNDINKVEPMNRALKELNAQTWFAGLRREQSGSRANLPVLAIQRGVFKVLPIIDWDNRTIYQYLQKHGLKYHPLWDEGYLSVGDTHTTRKWEPGMLEEETRFFGLKRECGLHEG >NZ_CP029122.1|WP_001290706.1|1325031_1326744_+|assimilatory-sulfite-reductase-(NADPH)-hemoprotein-subunit MSEKHPGPLVVEGKLTDAERMKLESNYLRGTIAEDLNDGLTGGFKGDNFLLIRFHGMYQQDDRDIRAERAEQKLEPRHAMLLRCRLPGGVITTKQWQAIDKFAGENTIYGSIRLTNRQTFQFHGILKKNVKPVHQMLHSVGLDALATANDMNRNVLCTSNPYESQLHAEAYEWAKKISEHLLPRTRAYAEIWLDQEKVATTDEEPILGQTYLPRKFKTTVVIPPQNDIDLHANDMNFVAIAENGKLVGFNLLVGGGLSIEHGNKKTYARTASEFGYLPLEHTLAVAEAVVTTQRDWGNRTDRKNAKTKYTLERVGVETFKAEVERRAGIKFEPIRPYEFTGRGDRIGWVKGIDDNWHLTLFIENGRILDYPGRPLKTGLLEIAKIHKGDFRITANQNLIIAGVPESEKAKIEKIAKESGLMNAVTPQRENSMACVSFPTCPLAMAEAERFLPSFIDNIDNLMAKHGVSDEHIVMRVTGCPNGCGRAMLAEVGLVGKAPGRYNLHLGGNRIGTRIPRMYKENITEPEILASLDELIGRWAKEREAGEGFGDFTVRAGIIRPVLDPARDLWD >NZ_CP029122.1|WP_000211954.1|1323232_1325032_+|NADPH-dependent-assimilatory-sulfite-reductase-flavoprotein-subunit MTTQVPPSALLPLNPEQLVRLQAATTDLTPTQLAWVSGYFWGVLNQQPAALAATPAPAAEMPGITIISASQTGNARRVAEALRDDLLAAKLNVKLVNAGDYKFKQIASEKLLIVVTSTQGEGEPPEEAVALHKFLFSKKAPKLENTAFAVFSLGDSSYEFFCQSGKDFDSKLAELGGERLLDRVDADVEYQAAASEWRARVVDALKSRAPVAAPSQSVATGAVNEIHTSPYSKDAPLVASLSVNQKITGRNSEKDVRHIEIDLGDSGLRYQPGDALGVWYQNDPALVKELVELLWLKGDEPVTVEGKTLPLNEALQWHFELTVNTANIVENYATLTRSETLLPLVGDKAKLQHYAATTPIVDMVRFSPAQLDAEALINLLRPLTPRLYSIASSQAEVENEVHVTVGVVRYDVEGRARAGGASSFLADRVEEEGEVRVFIEHNDNFRLPANPETPVIMIGPGTGIAPFRAFMQQRAADEAPGKNWLFFGNPHFTEDFLYQVEWQRYVKDGVLTRIDLAWSRDQKEKVYVQDKLREQGAELWRWINDGAHIYVCGDANRMAKDVEQALLEVIAEFGGMDTEAADEFLSELRVERRYQRDVY >NZ_CP029122.1|WP_000987944.1|1322551_1322917_-|6-carboxytetrahydropterin-synthase-QueD MMSTTLFKDFTFEAAHRLPHVPEGHKCGRLHGHSFMVRLEITGEVDPHTGWIIDFAELKAAFKPTYERLDHHYLNDIPGLENPTSEVLAKWIWDQVKPVVPLLSAVMVKETCTAGCIYRGE >NZ_CP029122.1|WP_001295150.1|1321202_1322474_-|FAD-dependent-oxidoreductase MEDDCDIIIIGAGIAGTACALRCARAGLSVLLLERAEIPGSKNLSGGRLYTHALAELLPQFHLTAPLERCITHESLSLLTPDGATTFSSLQPGGESWSVLRARFDPWLVAEAEKEGVECIPGATVDALYEENGRVCGVICGDDILRARYVVLAEGANSVLAERHGLVTRPAGEAMALGIKEVLSLETSAIEERFHLENNEGAALLFSGGICDDLPGGAFLYTNQQTLSLGIVCPLSSLTQSRVPASELLTRFKAHPAVRPLIKNTESLEYGAHLVPEGGLHSMPVQYAGNGWLLVGDALRSCVNTGISVRGMDMALTGAQAAAQTLISACQHREPQNLFPLYHHNVERSLLWDVLQRYQHVPALLQRPGWYRTWPALMQDISRDLWDQGDKPVPPLRQLFWHHLRRHGLWHLAGDVIRSLRCL >NZ_CP029122.1|WP_000490426.1|1334163_1335201_-|alkaline-phosphatase-isozyme-conversion-aminopeptidase MFSALRHRTAALALGVCFILPVHASSPKPGDFANTQARHIATFFPGRMTGTPAEMLSADYIRQQFQQMGYRSDIRTFNSRYIYTARDNRKSWHNVTGSTVIAAHEGKAPQQIIIMAHLDTYAPLSDADADANLGGLTLQGMDDNAAGLGVMLELAERLKNTPTEYGIRFVATSGEEEGKLGAENLLKRMSDTEKKNTLLVINLDNLIVGDKLYFNSGVKTPEAVRKLTRDRALAIARSHGIAATTNPGLNKNYPKGTGCCNDAEIFDKAGIAVLSVEATNWNLGNKDGYQQRAKTAAFPAGNSWHDVRLDNQQHIDKALPGRIERRCRDVMRIMLPLVKELAKAS >NZ_CP029122.1|WP_000372108.1|1335452_1336361_+|sulfate-adenylyltransferase-subunit-CysD MDQIRLTHLRQLEAESIHIIREVAAEFSNPVMLYSIGKDSSVMLHLARKAFYPGTLPFPLLHVDTGWKFREMYEFRDRTAKAYGCELLVHKNPEGVAMGINPFVHGSAKHTDIMKTEGLKQALNKYGFDAAFGGARRDEEKSRAKERIYSFRDRFHRWDPKNQRPELWHNYNGQINKGESIRVFPLSNWTEQDIWQYIWLENIDIVPLYLAAERPVLERDGMLMMIDDNRIDLQPGEVIKKRMVRFRTLGCWPLTGAVESNAQTLPEIIEEMLVSTTSERQGRVIDRDQAGSMELKKRQGYF >NZ_CP029122.1|WP_001090386.1|1336362_1337790_+|sulfate-adenylyltransferase-subunit-CysN MNTALAQQIANEGGVEAWMIAQQHKSLLRFLTCGSVDDGKSTLIGRLLHDTRQIYEDQLSSLHNDSKRHGTQGEKLDLALLVDGLQAEREQGITIDVAYRYFSTEKRKFIIADTPGHEQYTRNMATGASTCELAILLIDARKGVLDQTRRHSFISTLLGIKHLVVAINKMDLVDYSEKTFTRIREDYLTFAGQLPGNLDIRFVPLSALEGDNVASQSESMAWYSGPTLLEVLETVEIQRVVDAQPMRFPVQYVNRPNLDFRGYAGTLASGRVEVGQRVKVLPSGVESNVARIVTFDGDREEAFAGEAITLVLTDEIDISRGDLLLAADEALPAVQSASVDVVWMAEQPLSPGQSYDIKIAGKKTRARVDGIRYQVDINNLTQREVENLPLNGIGLVDLTFDEPLVLDRYQQNPVTGGLIFIDRLSNVTVGAGMVHEPVSQATAAPSEFSAFELELNALVRRHFPHWGARDLLGDK >NZ_CP029122.1|WP_001173673.1|1337789_1338395_+|adenylyl-sulfate-kinase MALHDENVVWHSHPVTVQQRELHHGHRGVVLWFTGLSGSGKSTVAGALEEALHKLGVSTYLLDGDNVRHGLCSDLGFSDADRKENIRRVGEVANLMVEAGLVVLTAFISPHRAERQMVRERVGEGRFIEVFVDTPLAICEARDPKGLYKKARAGELRNFTGIDSVYEAPESAEIHLNGEQLVTNLVQQLLDLLRQNDIIRS >NZ_CP029122.1|WP_001246104.1|1338444_1338768_+|DUF3561-family-protein MRNSHNITLTNNDSLTEDEETTWSLPGAVVGFISWLFALAMPMLIYGSNTLFFFIYTWPFFLALMPVAVVVGIALHSLMDGKLRYSIVFTLVTVGIMFGALFMWLLG >NZ_CP029122.1|WP_000517476.1|1338961_1339273_+|cell-division-protein-FtsB MGKLTLLLLAILVWLQYSLWFGKNGIHDYTRVNDDVAAQQATNAKLKARNDQLFAEIDDLNGGQEALEERARNELSMTRPGETFYRLVPDASKRAQSAGQNNR >NZ_CP029122.1|WP_000246138.1|1339291_1340002_+|2-C-methyl-D-erythritol-4-phosphate-cytidylyltransferase MATTHLDVCAVVPAAGFGRRMQTECPKQYLSIGNQTILEHSVHALLAHPRVKRVVIAISPGDSRFAQLPLANHPQITVVDGGDERADSVLAGLKAAGDAQWVLVHDAARPCLHQDDLARLLALSETSRTGGILAAPVRDTMKRAEPGKNAIAHTVDRNGLWHALTPQFFPRELLHDCLTRALNEGATITDEASALEYCGFHPQLVEGRADNIKVTRPEDLALAEFYLTRTIHQENT >NZ_CP029122.1|WP_001374730.1|1340001_1340481_+|2-C-methyl-D-erythritol-2,4-cyclodiphosphate-synthase MRIGHGFDVHAFGGEGPIIIGGVRIPYERGLLAHSDGDVALHALTDALLGAAALGDIGKLFPDTDPAFKGADSRELLREAWRRIQAKGYTLGNVDVTIIAQAPKMLPHIPQMRVFIAEDLGCHMDDVNVKATTTEKLGFTGRGEGIACEAVALLIKATK >NZ_CP029122.1|WP_000568943.1|1340477_1341527_+|tRNA-pseudouridine(13)-synthase-TruD MIEFDNLTYLHGKPQGTGLLKANPEDFVVVEDLGFEPDGEGEHILVRILKNGCNTRFVADALAKFLKIHAREVSFAGQKDKHAVTEQWLCARVPGKEMPDLSAFQLEGCQVLEYARHKRKLRLGALKGNAFTLVLREVSNRDDVEQRLIDICVKGVPNYFGAQRFGIGGSNLQGAQRWAQTNTPVRDRNKRSFWLSAARSALFNQIVAERLKKADVNQVVDGDALQLAGRGSWFVATTEELAELQRRVNDKELMITAALPGSGEWGTQREALAFEQAAVAAETELQALLVREKVEAARRAMLLYPQQLSWNWWDDVTVEIRFWLPAGSFATSVVRELINTTGDYAHIAE >NZ_CP029122.1|WP_001374723.1|1341507_1342269_+|5'/3'-nucleotidase-SurE MRILLSNDDGVHAPGIQTLAKALREFADVQVVAPDRNRSGASNSLTLESSLRTFTFENGDIAVQMGTPTDCVYLGVNALMRPRPDIVVSGINAGPNLGDDVIYSGTVAAAMEGRHLGFPALAVSLDGHKHYDTAAAVICSILRALCKEPLRTGRILNINVPDLPLDQIKGIRVTRCGTRHPADQVIPQQDPRGNTLYWIGPPGGKCDAGPGTDFAAVDEGYVSITPLHVDLTAHSAQDVVSDWLNSVGVGTQW |
You can click texts colored in the table to view more detailed information
CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
NZ_CP029122_5 | 1836729-1836846 | Orphan |
NA
Consensus repeat of NZ_CP029122_5
|
1 spacers
spacers of NZ_CP029122_5
>5.1|1836760|56|NZ_CP029122|CRISPRCasFinder TGCATCCGGCACCCGGAGCCTGATGCGACGCTGGCGCGTCTTATCAGGCCTACAAA |
CRISPR arrays and Neighbor proteins around NZ_CP029122_5
The CRISPR arrays of NZ_CP029122_5 >merge|NZ_CP029122|5|1836729-1836846|CRISPRCasFinder CCGAGCCGTAGGCCGGATAAGGCGTTCACGCTGCATCCGGCACCCGGAGCCTGATGCGACGCTGGCGCGTCTTATCAGGCCTACAAACCGAGCCGTAGGCCGGATAAGGCGTTTACGC >NZ_CP029122|5|5|1836729-1836846|CRISPRCasFinder CCGAGCCGTAGGCCGGATAAGGCGTTCACGC TGCATCCGGCACCCGGAGCCTGATGCGACGCTGGCGCGTCTTATCAGGCCTACAAA CCGAGCCGTAGGCCGGATAAGGCGTTTACGC
>NZ_CP029122.1|WP_000332037.1|1835501_1836632_-|ribonucleotide-diphosphate-reductase-subunit-beta MAYTTFSQTKNDQLKEPMFFGQPVNVARYDQQKYDIFEKLIEKQLSFFWRPEEVDVSRDRIDYQALPEHEKHIFISNLKYQTLLDSIQGRSPNVALLPLISIPELETWVETWAFSETIHSRSYTHIIRNIVNDPSVVFDDIVTNEQIQKRAEGISSYYDELIEMTSYWHLLGEGTHTVNGKTVTVSLRELKKKLYLCLMSVNALEAIRFYVSFACSFAFAERELMEGNAKIIRLIARDEALHLTGTQHMLNLLRSGADDPEMAEIAEECKQECYDLFVQAAQQEKDWADYLFRDGSMIGLNKDILCQYVEYITNIRMQAVGLDLPFQTRSNPIPWINTWLVSDNVQVAPQEVEVSSYLVGQIDSEVDTDDLSNFQL >NZ_CP029122.1|WP_000135040.1|1835247_1835502_-|ferredoxin-like-diferric-tyrosyl-radical-cofactor-maintenance-protein-YfaE MARVTLRITGTQLLCQDEHPSLLAALESHNVAVEYQCREGYCGSCRTRLVAGQVDWIAEPLAFIQPGEILPCCCRAKGDIEIEM >NZ_CP029122.1|WP_000301049.1|1834543_1835194_+|lipopolysaccharide-kinase-InaA MAVSAKYDEFNHWWATEGDWVEEPNYRRNGMSGVQCVERNGKKLYVKRMTHHLFHSVRYPFGRPTIVREVAVIKELERAGVIVPKIVFGEAVKIEGEWRALLVTEDMAGFISIADWYAQHAVSPYSDEVRQAMLKAVALAFKKMHSINRQHGCCYVRHIYVKTEGKAEAGFLDLEKSRRRLRRDKAINHDFRQLEKYLEPIPKADWEQVKAYYYAM >NZ_CP029122.1|WP_072163405.1|1831809_1832124_-|hypothetical-protein MTNKLGGELIDIADKKLAPLINDSFSYTRDFFAYSKQENNIFTFDNSKFVDPKEKEGLMIQHSNGQLVITGKYCPEGVQTAFTQEQYDKLIRYINIFFTFPKCE >NZ_CP029122.1|WP_000768974.1|1830514_1831591_+|glycerophosphodiester-phosphodiesterase MKLKLKNLSMAIMMSTIVMGSSAMAADSNEKIVIAHRGASGYLPEHTLPAKAMAYAQGADYLEQDLVMTKDDHLVVLHDHYLDRVTDVADRFPDRARKDGRYYAIDFTLDEIKSLKFTEGFDIENGKKVQTYPGRFPMGKSDFRVHTFEEEIEFVQGLNHSTGKNIGIYPEIKAPWFHHQEGKDIAAKTLEVLKKYGYTGKDDKVYLQCFDADELKRIKNELEPKMGMDLNLVQLIAYTDWNETQQKQPDGSWVNYSYDWMFKPGAMKQVAEYADGIGPDYHMLIEETSQPGNIKLTGMVQDAQQNKLVVHPYTVRSDKLPEYTTDVNQLYDVLYNKAGVNGLFTDFPDKAVKFLNKE >NZ_CP029122.1|WP_000948732.1|1829151_1830510_+|glycerol-3-phosphate-transporter MLSIFKPAPHKARLPAAEIDPTYRRLRWQIFLGIFFGYAAYYLVRKNFALAMPYLVEQGFSRGDLGFALSGISIAYGFSKFIMGSVSDRSNPRVFLPAGLILAAAVMLFMGFVPWATSSIAVMFVLLFLCGWFQGMGWPPCGRTMVHWWSQKERGGIVSVWNCAHNVGGGIPPLLFLLGMAWFNDWHAALYMPAFCAILVALFAFAMMRDTPQSCGLPPIEEYKNDYPDDYNEKAEQELTAKQIFMQYVLPNKLLWYIAIANVFVYLLRYGILDWSPTYLKEVKHFALDKSSWAYFLYEYAGIPGTLLCGWMSDKVFRGNRGATGVFFMTLVTIATIVYWMNPAGNPTVDMICMIVIGFLIYGPVMLIGLHALELAPKKAAGTAAGFTGLFGYLGGSVAASAIVGYTVDFFGWDGGFMVMIGGSILAVILLIVVMIGEKRRHEQLLQKRNGG >NZ_CP029122.1|WP_000857251.1|1827250_1828879_-|anaerobic-glycerol-3-phosphate-dehydrogenase-subunit-A MKTRDSQSSDVIIIGGGATGAGIARDCALRGLRVILVERHDIATGATGRNHGLLHSGARYAVTDAESARECISENQILKRIARHCVEPTNGLFITLPEDDLSFQATFIRACEEAGISAEAIDPQQARIIEPAVNPALIGAVKVPDGTVDPFRLTAANMLDAKEHGAVILTAHEVTGLIREGATVCGVRVRNHLTGETQALHAPVVVNAAGIWGQHIAEYADLRIRMFPAKGSLLIMDHRINQHVINRCRKPSDADILVPGDTISLIGTTSLRIDYNEIDDNRVTAEEVDILLREGEKLAPVMAKTRILRAYSGVRPLVASDDDPSGRNVSRGIVLLDHAERDGLDGFITITGGKLMTYRLMAEWATDAVCRKLGNTRPCTTADLALPGSQDPAEVTLRKVISLPAPLRGSAVYRHGDRTPAWLSEGRLHRSLVCECEAVTAGEVQYAVENLNVNSLLDLRRRTRVGMGTCQGELCACRAAGLLQRFNVTTSAQSIEQLSTFLNERWKGVQPIAWGDALRESEFTRWVYQGLCGLEKEQKDAL >NZ_CP029122.1|WP_001209908.1|1826001_1827261_-|glycerol-3-phosphate-dehydrogenase-subunit-GlpB MRFDTVIMGGGLAGLLCGLQLQKHGLRCAIVTRGQSALHFSSGSLDLLSHLPDGQPVADIHSGLESLRQQAPAHPYSLLGPQRVLDLACQAQALIAESGAQLQGSVELAHQRITPLGTLRSTWLSSPEVPVWPLPAKKICVVGISGLMDFQAHLAAASLRELDLSVETAEIELPELDVLRNNATEFRAVNIARFLDNEENWPLLLDALIPVANTCEMILMPACFGLADDKLWRWLNEKLPCSLMLLPTLPPSVLGIRLQNQLQRQFVRQGGVWMPGDEVKKVTCKNGVVNEIWTRNHADIPLRPRFAVLASGSFFSGGLVAERNGIREPILGLDVLQTATRGEWYKGDFFAPQPWQQFGVTTDETLRPSQAGQTIENLFAIGSVLGGFDPIAQGCGGGVCAVSALHAAQQIAQRAGGQQ >NZ_CP029122.1|WP_001000370.1|1824814_1826005_-|anaerobic-glycerol-3-phosphate-dehydrogenase-subunit-C MNDTSFENCIKCTVCTTACPVSRVNPGYPGPKQAGPDGERLRLKDGALYDEALKYCINCKRCEVACPSDVKIGDIIQRARAKYDTTRPSLRNFVLSHTDLMGSVSTPFAPIVNTATSLKPVRQLLDAALKIDHRRTLPKYSFGTFRRWYRSVAAQQAQYKDQVAFFHGCFVNYNHPQLGKDLIKVLNAMGTGVQLLSKEKCCGVPLIANGFTAKARKQAITNVESIREAVGVKGIPVIATSSTCTFALRDEYPEVLNVDNKGLRDHIELATRWLWRKLDEGKTLPLKPLPLKVVYHTPCHMEKMGWTLYTLELLRKIPGLELTVLDSQCCGIAGTYGFKKENYPTSQAIGAPLFRQIEESGADLVVTDCETCKWQIEMSTSLRCEHPITLLAQALA >NZ_CP029122.1|WP_001374259.1|1823722_1824622_-|ISNCY-family-transposase MTESTTSSPHDAVFKTFMFTPETARDFLEIHLPEPLRKLCNLQTLRLEPTSFIEKSLRAYYSDVLWSVETSDGDGYIYCVIEHQSSAEKNMAFRLMRYATAAMQRHLDKGYDRVPLVVPLLFYHGETSPYPYSLNWLDEFDDPQLARQLYTEAFPLVDITIVPDDEIMQHRRIALLELIQKHIRDRDLIGMVDRITTLLVKGFTNDSQLQTLFNYLLQCGDTSRFTRFIEEIAKRSPLQKERLMTIAERLRQEGHQIGWQEGMHEQAIKIALRMLEQGFEREIVLATTQLTDADIPNCH >NZ_CP029122.1|WP_001075164.1|1836865_1839151_-|ribonucleoside-diphosphate-reductase-subunit-alpha MNQNLLVTKRDGSTERINLDKIHRVLDWAAEGLHNVSISQVELRSHIQFYDGIKTSDIHETIIKAAADLISRDAPDYQYLAARLAIFHLRKKAYGQFEPPALYDHVVKMVEMGKYDNHLLEDYTEEEFKQMDTFIDHDRDMTFSYAAVKQLEGKYLVQNRVTGEIYESAQFLYILVAACLFSNYPRETRLQYVKRFYDAVSTFKISLPTPIMSGVRTPTRQFSSCVLIECGDSLDSINATSSAIVKYVSQRAGIGINAGRIRALGSPIRGGEAFHTGCIPFYKHFQTAVKSCSQGGVRGGAATLFYPMWHLEVESLLVLKNNRGVEGNRVRHMDYGVQINKLMYTRLLKGEDITLFSPSDVPGLYDAFFADQEEFERLYTKYEKDDSIRKQRVKAVELFSLMMQERASTGRIYIQNVDHCNTHSPFDPAIAPVRQSNLCLEIALPTKPLNDVNDENGEIALCTLSAFNLGAINNLDELEELAILAVRALDALLDYQDYPIPAAKRGAMGRRTLGIGVINFAYYLAKHGKRYSDGSANNLTHKTFEAIQYYLLKASNELAKEQGACPWFNETTYAKGILPIDTYKKDLDTIANEPLHYDWEALRESIKTHGLRNSTLSALMPSETSSQISNATNGIEPPRGYVSIKASKDGILRQVVPDYEHLHDAYELLWEMPGNDGYLQLVGIMQKFIDQSISANTNYDPSRFPSGKVPMQQLLKDLLTAYKFGVKTLYYQNTRDGAEDAQDDLVPSIQDDGCESGACKI >NZ_CP029122.1|WP_001220074.1|1839846_1843599_+|AIDA-I-family-autotransporter-adhesin-YfaL/EhaC MRIIFLRKEYLSLLPSMIASLFSANGVAAVTDSCQGYDVKASCQASRQSLSGITQDWSIADGQWLVFSDMTNNASGGAVFLQQGAEFSLLPENETGMTLFANNTVTGEYNNGGAIFAKENSTLNLTDVIFSGNVAGGYGGAIYSSGTNDTGAVDLRVTNAMFRNNIANDGKGGAIYTINNDVYLSDVIFDNNQAYTSTSYSDGDGGAIDVTDNNSDSKHPSGYTIVNNTAFTNNTAEGYGGAIYTNSVTAPYLIDISVDDSYSQNGGVLVDENNSAAGYGDGPSSAAGGFMYLGLSEVTFDIADGKTLVIGNTENDGAVDSIAGTGLITKTGSGDLVLNADNNDFTGEMQIENGEVTLGRSNSLMNVGDTHCQDDPQDCYGLTIGSIDQYQNQAELNVGSTQQTFVHALTGFQNGTLNIDAGGNVTVNQGSFAGIIEGAGQLTIAQNGSYVLAGAQSMALTGDIVVDDGAVLSLEGDAADLTALQDDPQSIVLNGGVLDLSDFSTWQSGTSYNDGLEVSGSSGTVIGSQDVVDLAGGDNLHIGGDGKDGVYVVVDASDGQVSLANNNSYLGTTQIASGTLMVSDNSQLGDTHYNRQVIFTDKQQESVMEITSDVDTRSDAAGHGRDIEMRADGEVAVDAGVDTQWGALMADSSGQHQDEGSTLTKTGAGTLELTASGTTQSAVRVEEGTLKGDVADILPYASSLWVGDGATFVTGADQDIQSIDAISSGTIDISDGTVLRLTGQDTSVALNASLFNGDGTLVNATDGVTLTGELNTNLETDSLTYLSNVTVNGNLTNTSGAVSLQNGVAGDTLTVNGDYTGGGTLLLDSELNGDDSVSDQLVMNGNTAGNTTVVVNSITGIGEPTSTGIKMVDFAADPTQFQNNAQFSLAGSGYVNMGAYDYTLVEDNNDWYLRSQEVTPPSPPDPDPTPDPDPTPDPDPTPDPEPTPAYQPVLNAKVGGYLNNLRAANQAFMMERRDHAGGDGQTLNLRVIGGDYHYTAAGQLAQHEDTSTVQLSGDLFSGRWGTDGEWMLGIVGGYSDNQGDSRSNMTGTRADNQNHGYAVGLTSSWFQHGNQKQGAWLDSWLQYAWFSNDVSEQEDGTDHYHSSGIIASLEAGYQWLPGRGVVIEPQAQVIYQGVQQDDFTAANRARVSQSQGDDIQTRLGLHSEWRTAVHVIPTLDLNYYHDPHSTEIEEDGSTISDDAVKQRGEIKVGVTGNISQRVSLRGSVAWQKGSDDFAQTAGFLSMTVKW >NZ_CP029122.1|WP_000990756.1|1843726_1844449_-|bifunctional-2-polyprenyl-6-hydroxyphenol-methylase/3-demethylubiquinol-3-O-methyltransferase-UbiG MNAEKSPENHNVDHEEIAKFEAVASRWWDLEGEFKPLHRINPLRLGYIAERAGGLFGKKVLDVGCGGGILAESMAREGATVTGLDMGFEPLQVAKLHALESGIQVDYVQETVEKHAAKHAGQYDVVTCMEMLEHVPDPQSVVRACAQLVKPGGDVFFSTLNRNGKSWLMAVVGAEYILRMVPKGTHDVKKFIKPAELLGWVDQTSLKERHITGLHYNPITNSFKLGPGVDVNYMLHTQNK >NZ_CP029122.1|WP_001281225.1|1844595_1847223_+|DNA-topoisomerase-(ATP-hydrolyzing)-subunit-A MSDLAREITPVNIEEELKSSYLDYAMSVIVGRALPDVRDGLKPVHRRVLYAMNVLGNDWNKAYKKSARVVGDVIGKYHPHGDLAVYNTIVRMAQPFSLRYMLVDGQGNFGSIDGDSAAAMRYTEIRLAKIAHELMADLEKETVDFVDNYDGTEKIPDVMPTKIPNLLVNGSSGIAVGMATNIPPHNLTEVINGCLAYIDDEDISIEGLMEHIPGPDFPTAAIINGRRGIEEAYRTGRGKVYIRARAEVEVDAKTGRETIIVHEIPYQVNKARLIEKIAELVKEKRVEGISALRDESDKDGMRIVIEVKRDAVGEVVLNNLYSQTQLQVSFGINMVALHHGQPKIMNLKDIIAAFVRHRREVVTRRTIFELRKARDRAHILEALAVALANIDPIIELIRHAPTPAEAKTALVANPWQLGNVAAMLERAGDDAARPEWLEPEFGVRDGLYYLTEQQAQAILDLRLQKLTGLEHEKLLDEYKELLDQIAELLRILGSADRLMEVIREELELVREQFGDKRRTEITANSADINLEDLITQEDVVVTLSHQGYVKYQPLSEYEAQRRGGKGKSAARIKEEDFIDRLLVANTHDHILCFSSRGRVYSMKVYQLPEATRGARGRPIVNLLPLEQDERITAILPVTEFEEGVKVFMATANGTVKKTVLTEFNRLRTAGKVAIKLVDGDELIGVDLTSGEDEVMLFSAEGKVVRFKESSVRAMGCNTTGVRGIRLGEGDKVVSLIVPRGDGAILTATQNGYGKRTAVAEYPTKSRATKGVISIKVTERNGLVVGAVQVDDCDQIMMITDAGTLVRTRVSEISIVGRNTQGVILIRTAEDENVVGLQRVAEPVDEEDLDTIDGSAAEGDDEIAPEVDVDDEPEEE >NZ_CP029122.1|WP_000012305.1|1847371_1849060_+|DUF2138-domain-containing-protein MSGEKKAKGWRFYGLVGFGAIALLSAGVWALQYAGSGPEKTLSPLVVHNNLQIDLNEPDLFLDSDSLSQLPKDLLTIPFLHDVLSEDFVFYYQNHADRLGIEGSIRRIVYEHDLTLKDKLFSSLLDQPAQAALWHDKQGHLSHYMVLIQRSGLSKLLEPLLFAATSDSQLSKTEISSIKINSETVPVYQLRYNGNNALMFATYQDKMLVFSSTDMLFKDDQQDTEATAIAGDLLSGKKRWQASFGLEERTAEKTPVRQRIVVSARWLGFGYQRLMPSFAGVRFEMGNDGWHSFVALNDESASVDASFDFTPVWNSMPAGASFCVAVPYSHGIAEEMLSHISQENDKLNGALDGAAGLCWYEDSKLQTPLFVGQFDGTAEQAQLPGKLFTQNIGAHESKAPEGVLPVSQTQQGEAQIWRREVSSRYGQYPKAQAAQPDQLMSDYFFRVSLAMQNKTLLFSLDDTLVNNALQTLNKTRPAMVDVIPTDGIVPLYINPQGIAKLLRNETLTSLPKNLEPVFYNAAQTLLMPKLDALSQQPRYVMKLAQMEPGAAWQWLPITWQPL >NZ_CP029122.1|WP_001295211.1|1849056_1849680_+|DUF1175-domain-containing-protein MRHGLLALICWLCCVVAHSEMLNVEQSGLFRAWFVRIAQEQLRQGPSPRWYQQDCAGLVRFAANETLKVHDSKWLKSNGLSSQYLPPEMTLTPEQRQLAQNWNQGNGKTGPYVTAINLIQYNSQFIGQDINQALPGDMIFFDQGDAQHLMVWMGRYVIYHTGSATKTDNGMRAVSLQQLMTWKDTRWIPNDSNPNFIGIYRLNFLAR >NZ_CP029122.1|WP_122633159.1|1849823_1854218_+|alpha-2-macroglobulin-family-protein MRLEAPGRDYRRYQMEEYGGVDVRLYRIPDPMAFLRQQKNLHRIVVQPQYLGDGLNNTLTWLWDNWYGKSRRVMQRTFSSQSRQNVTQALPELQLGNAIIKPSRYVQNNQFSPLKKYPLVEQFRYPLWQAKPFEPQQGVKLEGASSNFISPQPGNIYIPLGQQEPGLYLVEAMVGGYRATTVVFVSDTVALSKVSGNELLVWTAGKKQGEAKPGSEILWTDGLGVMTRGVTDDSGTLQLQHISPERSYILGKDAEGGVFVSENFFYESEIYNTRLYIFTDRPLYRAGDRVDVKVMGREFHDPLHSSPIVSAPAKLSVLDANGSLLQTVDVTLDARNGGQGSFRLPENAVAGGYELRLAYRNQVYSSSFRVANYIKPHFEIGLALAKKEFKTGEAVSGKLQLLYPDGEPVKNARVQLSLRAQQLSMVGNDLRYAGRFPVSLEGSETVSDASGHVALNLPAADKPSRYLLTVSASDGAAYRVTTTKEILIERGLAHYSLSTAAQYSNSGESVVFRYAALESSKQVPVTYEWLRLEDRTSHSGELPSGGKSFTVNFAKPGNYNLTLRDKDGLILAGLSHAVSGKGSTAHTGTVDIVADKTLYQPGETAKMLITFPEPIDEALLTLERDRVEQQSLLSHPANWLTLQRLNDTQYEARVPVSNSFAPNITFSVLYTRNGQYSFQNAGIKVAVPQLDIRVKTDKTHYQPGELVNVELTSSLKGKPVSAQLTVGVVDEMIYALQPEIAPNIGKFFYPLGRNNVRTSSSLSFISYDQALSSEPVAPGATNRSERRVKMLERPRREEVDTAAWMPSLTTDKQGKAYFTFLMPDSLTRWRITARGMNGDGLVGQGRAYLRSEKNLYMKWSMPTVYRVGDKPAAGLFIFSQQDNEPVALVTKFAGAEMRQTLTLHKGANYISLTQNIQQSGLLSAELQQNGQVQDSISTKLSFVDNSWPVEQQKNVMLGGGDNALMLPEQASNIRLQSSETPQEIFRNNLDALVDEPWGGVINTGSRLIPLSLAWRSLADHQSAAANDIRQMIQVNRLRLMQLAGPGARFTWWGEDGNGDAFLTAWAWYADWQASQAIGVTQQPEYWQHMLDSYAEQADNMPLLHRALVLAWAQEMNLPCKTLLKGLDEAIARRGTKTEDFSEEDTRDINDSLILDTPESPLADAVANVLTMTLLKKAQLKSTVMPQVQQYAWDKAANSNQPLAHTVVLLNSGGDATQAAAILSGLTAEQSTIERALAMNWLAKYMATMPPVVLPAPAGAWAKHKLTGGGEYWRWVGQGVPDILSFGDELSPQNVQVRWREPAKTAQQSNIPVTVERQLYRLITGEEEMSFTLQPVTSNEIDSDALYLDEITLTSEQDAVLRYGQVEVPLPPGADVERTTWGISVNKPNAAKQQGQLLEIARNEMGELAYMVPVKELTGTVTFRHLLRFSQKGQFVLPPARYMRSYAPAQQSVAAGSEWTRMQVK >NZ_CP029122.1|WP_001104488.1|1854218_1855868_+|DUF2300-domain-containing-protein MNWRRIVWLLALVTLPTLAEEPPLQLALRGAQHDQLYKLSSSGVTNVSTLPDTLTTPLGSLWKLYVYAWLEDTHQPEQPYQCRGNSPEEVYCCQAGESITRDTALVRSCGLYFAPQRLHIGADVWGQYWQQRQAPAWLASLTTLKPETSVTVKSLLDSLATLPAQNKAQEVLLDVVLDEAKIGVASMLGSRVRVKTWSWFADDKQEIRQGGFAGWLTDGTPLWVTGSGTSKTVLTRYATVLNRVLPVPTQVASGQCVEVELFARYPLKKITAEKSTTAVKPGVLNGRYRVTFTNGNHITFVSHGETTLLSEKGKLKLQSHLDREEYVARVLDREAKSTPPEAAKAMTVAIRTFLQQNANREGDCLTIPDSSATQRVSASPATTGARTMAAWTQDLIYAGDPVHYHGSRATEGTLSWRQATAQAGQGERYDQILAFAYPDNSLSRWGAPRSTCQLLPKAKAWLAKKMPQWRRILQAETGYNEPDVFAVCRLVSGFPYTDRQQKRLFIRNFFTLQDRLDLTHEYLHLAFDGYPTGLDENYIETLTRQLLMD >NZ_CP029122.1|WP_001567753.1|1855872_1856649_+|YfaP-family-protein MRKIFLPLLLVALSPVAHSEGVQEVEIDAPLSGWHPVEGEDASFSQSINYPASSVNMADDQNISAQIRGKIKNYAAAGKVQQGRLVVNGASMPQRIESDGSFARPYIFTEGSNSVQVISPDGQSRQKMQFYSTPGTGTIRARLRLVLSWDTDNTDLDLHVVTPDGEHAWYGNTVLKNSGALDMDVTTGYGPEIFAMPAPVHGRYQVYINYYGGRSETELTTAQLTLITDEGSVNEKQETFIVPMRNAGELTLVKSFDW >NZ_CP029122.1|WP_000786548.1|1856722_1857907_-|acetyl-CoA-acetyltransferase MKNCVIVSAVRTAIGSFNGSLASTSAIDLGATVIKAAIERAKIDSQHVDEVIMGNVLQAGLGQNPARQALLKSGLAETVCGFTVNKVCGSGLKSVALAAQAIQAGQAQSIVAGGMENMSLAPYLLDAKARSGYRLGDGQVYDVILRDGLMCATHGYHMGITAENVAKEYGITREMQDELALHSQRKAAAAIESGAFTAEIVPVNVVTRKKTFVFSQDEFPKANSTAEALGALRPAFDKAGTVTAGNASGINDGAAALVIMEESAALAAGLTPLARIKSYASGGVPPALMGMGPVPATQKALQLAGLQLADIDLIEANEAFAAQFLAVGKTLGFDPEKVNVNGGAIALGHPIGASGARILVTLLHAMQARDKTLGLATLCIGGGQGIAMVIERLN |
You can click texts colored in the table to view more detailed information
CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
NZ_CP029122_6 | 2457166-2457289 | Orphan |
NA
Consensus repeat of NZ_CP029122_6
|
1 spacers
spacers of NZ_CP029122_6
>6.1|2457209|38|NZ_CP029122|CRISPRCasFinder CGGACGCAGGATGGTGCGTTCAATTGGACTCGAACCAA |
CRISPR arrays and Neighbor proteins around NZ_CP029122_6
The CRISPR arrays of NZ_CP029122_6 >merge|NZ_CP029122|6|2457166-2457289|CRISPRCasFinder CGACCCCCACCATGTCAAGGTGGTGCTCTAACCAACTGAGCTACGGACGCAGGATGGTGCGTTCAATTGGACTCGAACCAACGACCCCCACCATGTCAAGGTGGTGCTCTAACCAACTGAGCTA >NZ_CP029122|6|6|2457166-2457289|CRISPRCasFinder CGACCCCCACCATGTCAAGGTGGTGCTCTAACCAACTGAGCTA CGGACGCAGGATGGTGCGTTCAATTGGACTCGAACCAA CGACCCCCACCATGTCAAGGTGGTGCTCTAACCAACTGAGCTA
>NZ_CP029122.1|WP_000212657.1|2456725_2457031_-|monooxygenase MATLLQLHFAFNGPFGDAMAEQLKPLAESINQEPGFLWKVWTESEKNHEAGGIYLFTDEKSALAYLEKHTARLKNLGVEEVVAKVFDVNEPLSQINQAKLA >NZ_CP029122.1|WP_000716929.1|2454995_2456600_-|FAD-NAD(P)-binding-protein MKKIAIVGAGPTGIYTLFSLLQQQTPLSISIFEQADEAGVGMPYSDEENSKMMLANIASIEIPPINCTYLEWLQKQEASHLQRYGVKKETLHDRQFLPRILLGEYFRDQFLRLVDQARQQKFAVAVYESCQVTDLQITNAGVMLATNQDLPSETFDLVVIATGHVWPDEEEATRTYFPSPWSGLMEAKVDACNVGIMGTSLSGLDAAMAVAIQHGSFIEDDKQHVVFNRDNASEKLNITLMSRTGILPEADFYCPIPYEPLHIVTDQALNAEIQKGEEGLLDRVFRLIVEEIKFADPDWSQRIALESLNVDSFAQAWFAERKQRDPFDWAEKNLQEVERNKREKHTVPWRYVILRLHEAVQEIVPHLNEHDHKRFSKGLARVFIDNYAAIPSESIRRLLALREAGIIHILALGEDYEMEINESRTVLKTEDNSYSFDVFIDARGQRPLKVKDIPFPGLREQLQKTGDEIPDVGEDYTLQQPEDIRGRVAFGALPWLMHDQPFVQGLTACAEIGEAMARAVVKPASRARRRLSFD >NZ_CP029122.1|WP_000587555.1|2454171_2454984_+|hypothetical-protein MIITRADLREWRIGAVMYRWFLRHFPRGGSYADIHHALIEEGYTDWAESLVEYAWKKWLADENFAHQEVSSMQKLATDPGEIPFCSQFARSDDHARIGCCEDNARIATAGYAAQIASMGYSVRIGSVGFNSHIGSSGERARVAVTGNSSRISSAGDSSRIANTGMRVRVCTLGERCHVASNGDLAQIASFGANARIANSGDNVHIIASGENSTVVSTGVVDSIILGPGGSAALAYHDGERVRFAVAIEGENNIRAGVRYRLNEQHQFVEC >NZ_CP029122.1|WP_001069997.1|2453382_2454168_+|thiosulfate-reductase-cytochrome-B-subunit MNPSQHAEQFQSQLANYVPQFTPEFWPVWLIIAGVLLVGMWLVLGLHALLRARGVKKSVTDYGEKIYLYCKAVRLWHWSNALLFVLLLASGLINHFALVGATAVKSLVAVHEVCGFLLLACWLGFVLINAVGGNGHHYRIRRQGWLERAAKQTRFYLFGIMQGEEHPFPATTQSKFNPLQQVAYVGVMYGLLPLLLLTGLLCLYPQAVGDVFPGVRYWLLQAHFALAFISLFFIFGHLYLCTTGRTPHETFKSMVDGYHRH >NZ_CP029122.1|WP_001310861.1|2452717_2453386_+|4Fe-4S-dicluster-domain-containing-protein MSFTRRKFVLGMGTVIFFTGSASSLLANTRQEKEVRYAMIHDESRCNGCNICARACRKTNHVPAQGSRLSIAHIPVTDNDNETQYHFFRQSCQHCEDAPCIDVCPTGASWRDEQGIVRVEKSQCIGCSYCIGACPYQVRYLNPVTKVADKCDFCAESRLAKGFPPICVSACPEHALIFGREDSPEIQAWLQQNKYYQYQLPGAGKPHLYRRFGQHLIKKENV >NZ_CP029122.1|WP_001297805.1|2452015_2452654_+|YdhW-family-putative-oxidoreductase-system-protein MNHRDELPLAKVSEVDEAKRQWLQGMRHPVDTVTEPEPAEILAEFIRQHSAAGQLVARAVFLSPPYSVAEEELSVLLESIKQNGDYADIACMTGSQDDYYYSTQAMSENYAAMSLQVVEQDICRAIAHAVRFECQTYPRPYKVAMLMQAPYYFQEAQIEAAIAAMDVAPEYADIRQVESSTAVLYLFSERFMTYGKAYGLCEWFEVEQFQNP >NZ_CP029122.1|WP_001678907.1|2449900_2452003_+|aldehyde-ferredoxin-oxidoreductase MANGWTGNILRVNLTTGNITLEDSSKFKSFVGGMGFGYKIMYDEVPPGTKPFDEANKLVFATGPLTGSGAPCSSRVNITSLSTFTKGNLVVDAHMGGFFAAQMKFAGYDVIIIEGKAKSPVWLKIKDDKVSLEKADFLWGKGTRATTEEICRLTSPETCVAAIGQAGENLVPLSGMLNSRNHSGGAGTGAIMGSKNLKAIAVEGTKGVNIADRQEMKRLNDYMMTELIGANNNHVVPSTPQSWAEYSDPKSRWTARKELFWGAAEGGPIETGEIPPGNQNTVGFRTYKSVFDLGPAAEKYTVKMSGCHSCPIRCMTQMNIPRVKEFGVPSTGGNTCVANFVHTTIFPNGPKDFEDKDDGRVIGNLVGLNLFDDYGLWCNYGQLHRDFTYCYSKGVFKRVLPAEEYAEIHWDQLEAGDVNFIKDFYYRLAHRVGELSHLADGSYAIAERWNLGEEYWGYAKNKLWSPFGYPVHHANEASAQVGSIVNCMFNRDCMTHTHINFIGSGLPLKLQREVAKELFGSEDAYDETKNYTPINDAKIKYAKWSLLRVCLHNAVTLCNWVWPMTVSPLKSRNYRGDLALEAKFFKAITGEEMTQEKLDLAAERIFTLHRAYTVKLMQTKDMRNEHDLICSWVFDKDPQIPVFTEGTDKMDRDDMHASLTMFYKEMGWDPQLGCPTRETLQRLGLEDIAADLAAHNLLPV >NZ_CP029122.1|WP_001070230.1|2449253_2449880_+|ferredoxin-like-protein MNPVDRPLLDIGLTRLEFLRISGKGLAGLTIAPALLSLLGCKQEDIDSGTVGLINTPKGVLVTQRARCTGCHRCEISCTNFNDGSVGTFFSRIKIHRNYFFGDNGVGSGGGLYGDLNYTADTCRQCKEPQCMNVCPIGAITWQQKEGCITVDHKRCIGCSACTTACPWMMATVNTESKKSSKCVLCGECANACPTGALKIIEWKDITV >NZ_CP029122.1|WP_000528342.1|2448588_2448798_+|fumarate-hydratase-FumD MGNRTKEDELYREMCRVVGKVVLEMRDLGQEPKHIVIAGVLRTALANKRIQRSELEKQAMETVINALVK >NZ_CP029122.1|WP_001295403.1|2446620_2448033_-|pyruvate-kinase-PykF MKKTKIVCTIGPKTESEEMLAKMLDAGMNVMRLNFSHGDYAEHGQRIQNLRNVMSKTGKTAAILLDTKGPEIRTMKLEGGNDVSLKAGQTFTFTTDKSVIGNSEMVAVTYEGFTTDLSVGNTVLVDDGLIGMEVTAIEGNKVICKVLNNGDLGENKGVNLPGVSIALPALAEKDKQDLIFGCEQGVDFVAASFIRKRSDVIEIREHLKAHGGENIHIISKIENQEGLNNFDEILEASDGIMVARGDLGVEIPVEEVIFAQKMMIEKCIRARKVVITATQMLDSMIKNPRPTRAEAGDVANAILDGTDAVMLSGESAKGKYPLEAVSIMATICERTDRVMNSRLEFNNDNRKLRITEAVCRGAVETAEKLDAPLIVVATQGGKSARAVRKYFPDATILALTTNEKTAHQLVLSKGVVPQLVKEITSTDDFYRLGKELALQSGLAHKGDVVVMVSGALVPSGTTNTASVHVL >NZ_CP029122.1|WP_000534291.1|2457603_2458860_+|hypothetical-protein MGSDAKNLMSDGNVQIVKTGEVIGATQLTEGELIVEAGGRAENTVVTGAGWLKVATGGIAKCTQYGNNGTLSVSDGAIATDIVQSEGGAISLSTLATVNGRHPEGEFSVDQGYACGLLLENGGNLRVLEGHRAEKIILDQEGGLLVNGTTSAVVVDEGGELLVYPGGEASNCEINQGGVFMLAGKASDTLLAGGTMNNLGGEDSDTIVENGSIYRLGTDGLQLYSSGKTQNLSVNVGGRAEVHAGTLENAVIQGGTVILLSPTSADENFVVEEDRAPVELTGSVALLDGASMIIGYGADLQQSTITVQQGGVLILDGSTVKGDGVTFIVGNINLNGGKLWLITGAATHVQLKVKRLRGEGAICLQTSAKEISPDFINVKGEVTGDIHVEITDASRQTLCNALKLQPDEDGIGATLQPA >NZ_CP029122.1|WP_001174942.1|2458900_2460274_-|multidrug-efflux-MATE-transporter-MdtK MQKYISEARLLLALAIPVILAQIAQTAMGFVDTVMAGGYSATDMAAVAIGTSIWLPAILFGHGLLLALTPVIAQLNGSGRRERIAHQVRQGFWLAGFVSVLIMLVLWNAGYIIRSMENIDPALADKAVGYLRALLWGAPGYLFFQVARNQCEGLAKTKPGMVMGFIGLLVNIPVNYIFIYGHFGMPELGGVGCGVATAAVYWVMFLAMVSYIKRARSMRDIRNEKGTAKPDPAVMKRLIQLGLPIALALFFEVTLFAVVALLVSPLGIVDVAGHQIALNFSSLMFVLPMSLAAAVTIRVGYRLGQGSTLDAQTAARTGLMVGVCMATLTAIFTVSLREQIALLYNDNPEVVTLAAHLMLLAAVYQISDSIQVIGSGILRGYKDTRSIFYITFTAYWVLGLPSGYILALTDLVVEPMGPAGFWIGFIIGLTSAAIMMMLRMRFLQRLPSVIILQRASR >NZ_CP029122.1|WP_001373655.1|2460488_2461130_+|riboflavin-synthase MFTGIVQGTVKLVSIDEKPNFRTHVVELPDHMLDGLETGASVAHNGCCLTVTEINGNHVSFDLMKETLRITNLGDLKVGDWVNVERAAKFSDEIGGHLMSGHIMTTAEVAKILTSENNRQIWFKVQDSQLMKYILYKGFIGIDGISLTVGEVTPTRFCVHLIPETLERTTLGKKKLGARVNIEIDPQTQAVVDTVERVLAARENAMNQPGTEA >NZ_CP029122.1|WP_000098911.1|2461169_2462318_-|cyclopropane-fatty-acyl-phospholipid-synthase MSSSCIEEVSVPDDNWYRIANELLSRAGIAINGSAPADIRVKNPDFFKRVLQEGSLGLGESYMDGWWECDRLDMFFSKVLRAGLENQLPHHFKDTLRIASARLFNLQSKKRAWIVGKEHYDLGNDLFSRMLDPFMQYSCAYWKDADNLESAQQAKLKMICEKLQLKPGMRVLDIGCGWGGLAHYMASNYDVSVVGVTISAEQQKMAQERCEGLDVTILLQDYRDLNDQFDRIVSVGMFEHVGPKNYDTYFAVVDRNLKPEGIFLLHTIGSKKTDLNVDPWINKYIFPNGCLPSVRQIAQSSEPHFVMEDWHNFGADYDTTLMAWYERFLAAWPEIADNYSERFKRMFTYYLNACAGAFRARDIQLWQVVFSRGVENGLRVAR >NZ_CP029122.1|WP_001182363.1|2462608_2463820_-|Bcr/CflA-family-multidrug-efflux-MFS-transporter MQPGKRFLVWLAGLSVLGFLATDMYLPAFAAIQADLQTPASAVSASLSLFLAGFAAAQLLWGPLSDRYGRKPVLLIGLTIFALGSLGMLWVENAATLLVLRFVQAVGVCAAAVIWQALVTDYYPSQKVNRIFATIMPLVGLSPALAPLLGSWLLVHFSWQAIFATLFAITVVLILPIFWLKPTTKARNNSQDGLTFTDLLRSKTYRGNVLIYAACSASFFAWLTGSPFILSEMGYSPAVIGLSYVPQTIAFLIGGYGCRAALQKWQGKQLLPWLLVLFAVSVIATWAAGFISHVSLVEILIPFCVMAIANGAIYPIVVAQALRPFPHATGRAAALQNTLQLGLCFLASLVVSWLISISTPLLTTTSVMLSTVVLVALGYMMQRCEEVGCQNHGNAEVAHSESH >NZ_CP029122.1|WP_000269501.1|2463932_2464865_+|LysR-family-transcriptional-regulator MWSEYSLEVVDAVARNGSFSAAAQELHRVPSAVSYTVRQLEEWLAVPLFERRHRDVELTAAGAWFLKEGRSVVKKMQITRQQCQQIANGWRGQLAIAVDNIVRPERTRQMIVDFYRHFDDVELLVFQEVFNGVWDALSDGRVELAIGATRAIPVGGRYAFRDMGMLSWSCVVASHHPLALMDGPFSDDTLRNWPSLVREDTSRTLPKRITWLLDNQKRVVVPDWESSATCISAGLCIGMVPTHFAKPWLNEGKWVALELENPFPDSACCLTWQQNDMSPALTWLLEYLGDSETLNKEWLREPEETPATGD >NZ_CP029122.1|WP_000190982.1|2464861_2465887_-|HTH-type-transcriptional-repressor-PurR MATIKDVAKRANVSTTTVSHVINKTRFVAEETRNAVWAAIKELHYSPSAVARSLKVNHTKSIGLLATSSEAAYFAEIIEAVEKNCFQKGYTLILGNAWNNLEKQRAYLSMMAQKRVDGLLVMCSEYPEPLLAMLEEYRHIPMVVMDWGEAKADFTDAVIDNAFEGGYMAGRYLIERGHREIGVIPGPLERNTGAGRLAGFMKAMEEAMIKVPESWIVQGDFEPESGYRAMQQILSQPHRPTAVFCGGDIMAMGALCAADEMGLRVPQDVSLIGYDNVRNARYFTPALTTIHQPKDSLGETAFNMLLDRIVNKREEPQSIEVHPRLIERRSVADGPFRDYRR >NZ_CP029122.1|WP_000102278.1|2466185_2466275_+|stress-response-protein-YnhF MSTDLKFSLVTTIIVLGLIVAVGLTAALH >NZ_CP029122.1|WP_000701040.1|2466440_2467610_+|MFS-transporter MKINYPLLALAIGAFGIGTTEFSPMGLLPVIARGVDVSIPAAGMLISAYAVGVMVGAPLMTLLLSHRARRSALIFLMAIFTLGNVLSAIAPDYMTLMLSRILTSLNHGAFFGLGSVVAASVVPKHKQASAVATMFMGLTLANIGGVPAATWLGETIGWRMSFLATAGLGVISMVSLFFSLPKGGAGARPEVKKELAVLMRPQVLSALLTTVLGAGAMFTLYTYISPVLQSITHATPVFVTAMLVLIGVGFSIGNYLGGKLADRSVNGTLKGFLLLLMVIMLAIPFLARNEFGAAISMVVWGAATFAVVPPLQMRVMRVASEAPGLSSSVNIGAFNLGNALGAAAGGAVISAGLGYSFVPVMGAIVAGLALLLVFMSARKQPETVCVANS >NZ_CP029122.1|WP_000007283.1|2467755_2468337_-|superoxide-dismutase-[Fe] MSFELPALPYAKDALAPHISAETIEYHYGKHHQTYVTNLNNLIKGTAFEGKSLEEIIRSSEGGVFNNAAQVWNHTFYWNCLAPNAGGEPTGKVAEAIAASFGSFADFKAQFTDAAIKNFGSGWTWLVKNSDGKLAIVSTSNAGTPLTTDATPLLTVDVWEHAYYIDYRNARPGYLEHFWALVNWEFVAKNLAA |
You can click texts colored in the table to view more detailed information
CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
NZ_CP029122_7 | 3110425-3110516 | Orphan |
NA
Consensus repeat of NZ_CP029122_7
|
1 spacers
spacers of NZ_CP029122_7
>7.1|3110451|40|NZ_CP029122|CRISPRCasFinder GCGCTGCGGGTCATTCTTGAAATTACCCCCGCTGTGCTGT |
CRISPR arrays and Neighbor proteins around NZ_CP029122_7
The CRISPR arrays of NZ_CP029122_7 >merge|NZ_CP029122|7|3110425-3110516|CRISPRCasFinder CCACCTTTTTTACCTGCTTCAGATGCGCGCTGCGGGTCATTCTTGAAATTACCCCCGCTGTGCTGTCCACCTTTTTTACCTGCTTCTGATGC >NZ_CP029122|7|7|3110425-3110516|CRISPRCasFinder CCACCTTTTTTACCTGCTTCAGATGC GCGCTGCGGGTCATTCTTGAAATTACCCCCGCTGTGCTGT CCACCTTTTTTACCTGCTTCTGATGC
>NZ_CP029122.1|WP_001347171.1|3108982_3110311_+|pyrimidine-utilization-transport-protein-G MAMFGFPHWQLKSTSTESGVVAPDERLPFAQTAIMGVQHAVAMFGATVLMPILMGLDPNLSILMSGVGTLLFFFITGGRVPSYLGSSAAFVGVVIAATGFNGQGINPNISIALGGIIACGLVYTVIGLVVMKIGTRWIERLMPPVVTGAVVMAIGLNLAPIAVKSVSASAFDSWMAVMTVLCIGLVAVFTRGMIQRLLILVGLIVACLLYGVMTNLLGLGKAVDFTLVSHAAWFGLPHFSTPAFNSQAMMLIAPVAVILVAENLGHLKAVAGMTGRNMDPYMGRAFVGDGLATMLSGSVGGSGVTTYAENIGVMAVTKVYSTLVFVAAAVIAMLLGFSPKFGALIHTIPAAVIGGASIVVFGLIAVAGARIWVQNRVDLSQNGNLIMVAVTLVLGAGDFALTLGGFTLGGIGTATFGAILLNALLSRKLVDVPPPEVVHQEP >NZ_CP029122.1|WP_001028095.1|3108467_3108962_+|pyrimidine-utilization-flavin-reductase-protein-F MNIVDQQTFRDAMSCMGAAVNIITTDGPAGRAGFTASAVCSVTDTPPTLLVCLNRGASVWPVFNENRTLCVNTLSAGQEPLSNLFGGKTPMEHRFAAARWQTGVTGCPQLEEALVSFDCRISQVVSVGTHDILFCAIEAIHRHATPYGLVWFDRSYHALMRPAC >NZ_CP029122.1|WP_001001184.1|3107866_3108457_+|malonic-semialdehyde-reductase MNEAVSPGALSTLFTDARTHNGWRETPVSDETLRELYALMKWGPTSANCSPARIVFIRTAEGKERLRPALSSGNLQKTLTAPVTAIVAWDSEFYERLPLLFPHGDARSWFTSSPQLAEETAFRNSSMQAAYLIVACRALGLDTGPMSGFDRQYVDDAFFAGSTLKSNLLINIGYGDNSKLYARLPRLSFEEACGLL >NZ_CP029122.1|WP_001323674.1|3107056_3107857_+|pyrimidine-utilization-protein-D MKLSLSPPPYADAPVVVLISGLGGSGSYWLPQLAVLEQEYQVVCYDQRGTGNNPDTLAEDYSIAQMAAELHQALVAAGIEHYAVVGHALGALVGMQLALDYPASVTVLVCVNGWLRINAHTRRCFQVRERLLYSGGAQAWVEAQPLFLYPADWMAARAPRLEAEDALALAHFQGKNNLLRRLNALKRADFSHHAVRIRCPVQIICASDDLLVPSACSSELHAALPDSQKMVMRYGGHACNVTDPETFNALLLNGLASLLHHREAAL >NZ_CP029122.1|WP_001126787.1|3106662_3107049_+|pyrimidine-utilization-protein-C MPKSVIIPAGSSAPLAPFVPGTLADGVVYVSGTLAFDQHNNVLFADDPKAQTRHVLETIRTVIETAGGTMADVTFNSIFITDWKNYAAINEIYAEFFPGDKPARFCIQCGLVKPDALVEIATIAHIAK >NZ_CP029122.1|WP_001345643.1|3105958_3106651_+|peroxyureidoacrylate/ureidoacrylate-amidohydrolase-RutB MTTLTARPEAITFDPQQSALIVVDMQNAYATPGGYLDLAGFDVSTTRPVIANIQTAVTAARAAGMLIIWFQNGWDEQYVEAGGPGSPNFHKSNALKTMRKQPQLQGKLLAKGSWDYQLVDELVPQPGDIVLPKPRYSGFFNTPLDSILRSRGIRHLVFTGIATNVCVESTLRDGFFLEYFGVVLEDATHQAGPEFVQKAALFNIETFFGWVSDVETFCDALSPTSFARIA >NZ_CP029122.1|WP_001345642.1|3104867_3105959_+|pyrimidine-utilization-protein-A MKIGVFVPIGNNGWLISTHAPQYMPTFELNKAIVQKAEHYHFDFALSMIKLRGFGGKTEFWDHNLESFTLMAGLAAVTSRIQIYATAATLTLPPAIVARMAATIDSISGGRFGVNLVTGWQKPEYEQMGIWPGDDYFSRRYDYLTEYVQVLRDLWGSGKSDFKGDFFTMNDCRVSPQPSVPMKVICAGQSDAGMAFSAQYADFNFCFGKGVNTPTAFAPTAARMKQAAEQTGRDVGSYVLFMVIADETDDAARAKWEHYKAGADEEALSWLTEQSQKDTRSGTDTNVRQMADPTSAVNINMGTLVGSYASVARMLDEVASVPGAEGVLLTFDDFLSGIETFGERIQPLMQCRAHLPALTQEVA >NZ_CP029122.1|WP_001295606.1|3103941_3104580_-|HTH-type-transcriptional-regulator-RutR MTQGAVKTTGKRSRTVSAKKKAILSAALDTFSQFGFHGTRLEQIAELAGVSKTNLLYYFPSKEALYIAVLRQILDIWLAPLKAFREDFAPLAAIKEYIRLKLEVSRDYPQASRLFCMEMLAGAPLLMDELTGDLKALIDEKSALIAGWVKSGKLAPIDPQHLIFMIWASTQHYADFAPQVEAVTGATLRDEVFFNQTVENVQRIIIEGIRPR >NZ_CP029122.1|WP_001299828.1|3099939_3103902_+|trifunctional-transcriptional-regulator/proline-dehydrogenase/L-glutamate-gamma-semialdehyde-dehydrogenase MGTTTMGVKLDDATRERIKSAATRIDRTPHWLIKQAIFSYLEQLENSDTLPELPALLSGAANESDEAPTPAEEPHQPFLDFAEQILPQSVSRAAITAAYRRPETEAVSMLLEQARLPQPVAEQAHKLAYQLADKLRNQKNASGRAGMVQGLLQEFSLSSQEGVALMCLAEALLRIPDKATRDALIRDKISNGNWQSHIGRSPSLFVNAATWGLLFTGKLVSTHNEASLSRSLNRIIGKSGEPLIRKGVDMAMRLMGEQFVTGETIAEALANARKLEEKGFRYSYDMLGEAALTAADAQAYMVSYQQAIHAIGKASNGRGIYEGPGISIKLSALHPRYSRAQYDRVMEELYPRLKSLTLLARQYDIGINIDAEEADRLEISLDLLEKLCFEPELAGWNGIGFVIQAYQKRCPLVIDYLIDLATRSRRRLMIRLVKGAYWDSEIKRAQMDGLEGYPVYTRKVYTDVSYLACAKKLLAVPNLIYPQFATHNAHTLAAIYQLAGQNYYPGQYEFQCLHGMGEPLYEQVTGKVADGKLNRPCRIYAPVGTHETLLAYLVRRLLENGANTSFVNRIADTSLPLDELVADPVTAVEKLAQQEGQTGLPHPKIPLPRDLYGHGRDNSAGLDLANEHRLASLSSALLNSALQKWQALPMLEQPVAAGEMSPVINPAEPKDIVGFVREATPREVEQALESAVNNAPIWFATPPVERAAILHRAAVLMESQMQQLIGILVREAGKTFSNAIAEVREAVDFLHYYAGQVRDDFANETHRPLGPVVCISPWNFPLAIFTGQIAAALAAGNSVLAKPAEQTPLIAAQGIAILLEAGVPPGVVQLLPGQGETVGAQLTGDDRVRGVMFTGSTEVATLLQRNIASRLDAQGRPIPLIAETGGMNAMIVDSSALTEQVVVDVLASAFDSAGQRCSALRVLCLQDEIADHTLKMLRGAMAECRMGNPGRLTTDIGPVIDSEAKANIERHIQTMRSKGRPVFQAVRENSEDAREWQSGTFVAPTLIELDDFAELQKEVFGPVLHVVRYNRNQLPELIEQINASGYGLTLGVHTRIDETIAQVTGSAHVGNLYVNRNMVGAVVGVQPFGGEGLSGTGPKAGGPLYLYRLLANRPESALAVTLARQDAEYPVDAQLKAALTQPLNALREWAANRPELQALCTQYGELAQAGTQRLLPGPTGERNTWTLLPRERVLCIADDEQDALTQLAAVLAVGSQVLWPDDALHRQLVKALPSAVSERIQLAKAENITAQPFDAVIFHGDSDQLRALCEAVAARDGAIVSVQGFARGESNILLERLYIERSLSVNTAAAGGNASLMTIG >NZ_CP029122.1|WP_001678465.1|3098009_3099518_-|sodium/proline-symporter-PutP MAISTPMLVTFCVYIFGMILIGFIAWRSTKNFDDYILGGRSLGPFVTALSAGASDMSGWLLMGLPGAVFLSGISESWIAIGLTLGAWINWKLVAGRLRVHTEYNNNALTLPDYFTGRFEDKSRILRIISALVILLFFTIYCASGIVAGARLFESTFGMSYETALWAGAAATILYTFIGGFLAVSWTDTVQASLMIFALILTPVIVIISVGGFGDSLEVIKQKSIENVDMLKGLNFVAIISLMGWGLGYFGQPHILARFMAADSHHSIVHARRISMTWMILCLAGAVAVGFFGIAYFNEHPAVAGAVNQNAERVFIELAQILFNPWIAGILLSAILAAVMSTLSCQLLVCSSAITEDLYKAFLRKHASQKELVWVGRVMVLVVALVAIALAANPENRVLGLVSYAWAGFGAAFGPVVLFSVMWSRMTRNGALAGMIIGALTVIVWKQFGWLGVYEIIPGFIFGSIGIVVFSLLGKAPSAAMQKRFAEADAHYHSAPPSRLQES >NZ_CP029122.1|WP_001151437.1|3110939_3111536_+|NAD(P)H:quinone-oxidoreductase MAKVLVLYYSMYGHIETMARAVAEGASKVDGAEVVVKRVPETMPPQLFEKAGGKTQTAPVATPQELADYDAIIFGTPTRFGNMSGQMRTFLDQTGGLWASGALYGKLASVFSSTGTGGGQEQTITSTWTTLAHHGMVIVPIGYAAQELFDVSQVRGGTPYGATTIAGGDGSRQPSQEELSIARYQGEYVAGLAVKLNG >NZ_CP029122.1|WP_001143120.1|3111556_3111784_+|hypothetical-protein MPTQEAKAHHVGEWASLRNTSPEIAEAIFEVAGYDEKMAEKIWEEGSDEVLVKAFAKTDKDSLFWGEQTIERKNV >NZ_CP029122.1|WP_001044313.1|3111821_3113063_-|bifunctional-glucose-1-phosphatase/inositol-phosphatase MNKTLIAATVAGIVLLASNAQAQTVPEGYQLQQVLMMSRHNLRAPLANNGSVLEQSTPNKWPEWDVPGGQLTTKGGVLEVYMGHYMREWLAQQGMVKSGECPPPDTVYAYANSLQRTVATAQFFITGAFPGCDIPVHHQEKMGTMDPTFNPVITDDSAAFSEQAVAAMEKELSKLQLTDSYQLLEKIVNYKDSPACKEKQQCSLVDGKNTFSAKYQQEPGVSGPLKVGNSLVDAFTLQYYEGFPMDQVAWGEIKSDQQWKVLSKLKNGYQDSLFTSPEVARNVAKPLVSYIDKALVTDRTSAPKITVLVGHDSNIASLLTALDFKPYQLHDQNERTPIGGKIVFQRWHDSKANRDLMKIEYVYQSAEQLRNADALTLQAPAQRVTLELSGCPIDANGFCPMDKFDSVLNEAVK >NZ_CP029122.1|WP_000097602.1|3113354_3114614_-|YccE-family-protein MSSNIHGISCTANNYLKQAWNNIKNEHEKNQKYSITLFENTLVCFMRLYKEIRRQKAEDYIPCLECDSLEKEFEEMQNDNDLSLFLRTLRTNDTETYSGVSEGITYTIQYVRDIDIVRVSLPGRGSESITDFKGYYWYGFMEYIENINACDDVFSEYCLDDENMSIQPEWINTPGISDLDTGIDLSGISFIQSEINKTYGLKYAPVDGDGYCLLRAILVLKEHEYSWALGSHKTQKQVYEEFIKIVDKQTIEALVDTAFNDLREDVKTLFGVNLQSDNKIQGQGGFLSWSFLSFKKEFIDSCLNDKKCILHLPEFIFNDNKARLVLDTDPEQKVNEVKNFLTALSDSICSLFIVNSNVASISLGNESFSTDDDLEYGYLINTGNHYDVYLPPELFAQAYELNNKERNAQIDFLTRYAIY >NZ_CP029122.1|WP_000420629.1|3114873_3115794_+|curved-DNA-binding-protein MELKDYYAIMGVKPTDDLKTIKTAYRRLARKYHPDVSKEPDAEARFKEVAEAWEVLSDEQRRAEYDQMWQHRNDPQFNRQFHHSDGQSFNAEDFDDIFSSIFGQHARQSRQRPATRGHDIEIEVAVFLEETLTEHKRTISYNLPVYNAFGMIEQEIPKTLNVKIPAGVGNGQRIRLKGQGTPGENGGPNGDLWLVIHIAPHPLFDIVGHDLEIVVPVSPWEAALGAKVTVPTLKESILLTIPPGSQAGQRLRVKGKGLVSKKQTGDLYAVLKIVMPPKPDENTAALWQQLADAQSSFDPRKDWGKA >NZ_CP029122.1|WP_000024560.1|3115793_3116099_+|chaperone-modulator-CbpM MANVTVTFTITEFCLHTGISEEELNEIVGLGVVEPREIQETTWVFDDHAAIVVQRAVRLRHELALDWPGIAVALTLMDDIAHLKQENRLLRQRLSRFVAHP >NZ_CP029122.1|WP_000209869.1|3116191_3116791_-|molecular-chaperone-TorD MTTLTAQQIACVYAWLAQLFSRELDDEQLTQIASAQMAEWFSLLKSEPPLAAAVNELENCIATLTVRDDARLELAADFCGLFLMTDKQAALPYASAYKQDEQEIKRLLVEAGMETSGNFNEPADHLAIYLELLSHLHFSLGEGTVPARRIDSLRQKTLTALWQWLPEFVVRCRQYDSFGFYAALSQLLLVLVESDHQNR >NZ_CP029122.1|WP_001062101.1|3116787_3119334_-|trimethylamine-N-oxide-reductase-TorA MNNNDLFQASRRRFLAQLGGLTVAGMLGPSLLTPRRATAAQAATDAVISKEGILTGSHWGAIRATVKDGRFVAAKPFELDKYPSKMIAGLPDHVHNAARIRYPMVRVDWLRKRHLSDTSQRGDNRFVRVSWDEALDMFYEELERVQKTHGPSALLTASGWQSTGMFHNASGMLAKAIALHGNSVGTGGDYSTGAAQVILPRVVGSMEVYEQQTSWPLVLQNSKTIVLWGSDLLKNQQANWWCPDHDVYEYYAQLKAKVAAGEIEVISIDPVVTSTHEYLGREHVKHIAVNPQTDVPLQLALAYTLYSENLYDKNFLANYCVGFEQFLPYLLGEKDGQPKDAAWAEKLTGIDAETIRGLARQMAANRTQIIAGWCVQRMQHGEQWAWMIVVLAAMLGQIGLPGGGFGFGWHYNGAGTPGRKGVILSGFSGSTSIPPVHDNSDYKGYSSTIPIARFIDAILEPGKVINWNGKSVKLPPLKMCIFAGTNPFHRHQQINRIIEGWRKLETVIAIDNQWTSTCRFADIVLPATTQFERNDLDQYGNHSNRGIIAMKQVVPPQFEARNDFDIFRELCRRFNREEAFTEGLDEMGWLKRIWQEGVQQGKGRGVHLPAFDDFWNNKEYVEFDHPQMFVRHQAFREDPDLEPLGTPSGLIEIYSKTIADMNYDDCQGHPMWFEKIERSHGGPGSQKYPLHLQSVHPDFRLHSQLCESETLRQQYTVAGKEPVFINPQDASARGIRNGDVVRVFNARGQVLAGAVVSDRYAPGVARIHEGAWYDPDKGGEPGALCKYGNPNVLTIDIGTSQLAQATSAHTTLVEIEKYNGAVEQVTAFNGPVEMVAQCEYVPASQVKS >NZ_CP029122.1|WP_001323677.1|3119333_3120506_-|pentaheme-c-type-cytochrome-TorC MRKLWNALRRPSARWSVLALVAIGIVIGIALIVLPHVGIKVTSTTEFCVSCHSMQPVYEEYKQSVHFQNASGVRAECHDCHIPPDMPGMVKRKLEASNDIYQTFIAHSIDTPEKFEAKRAELAEREWARMKENNSATCRSCHNYDAMDHAKQHPEAARQMKVAAKDNQSCIDCHKGIAHQLPDMSSGFRKQFDELRASANDSGDTLYSIDIKPIYAAKGDKEASGSLLPASEVKVLKRDGDWLQIEITGWTESAGRQRVLTQFPGKRIFVASIRGDVQQQVKTLEKTTVADTNTEWSKLQATAWMKKGDMVNDIKPIWAYADSLYNGTCNQCHGAPEIAHFDANGWIGTLNGMIGFTSLDKREERTLLKYLQMNASDTAGKAHGDKKEEK >NZ_CP029122.1|WP_001120112.1|3120635_3121328_+|two-component-system-response-regulator-TorR MPHHIVIVEDEPVTQARLQSYFTQEGYTVSVTASGAGLREIMQNQPVDLILLDINLPDENGLMLTRALRERSTVGIILVTGRSDRIDRIVGLEMGADDYVTKPLELRELVVRVKNLLWRIDLARQAQPHTQDNCYRFAGYCLNVSRHTLERDGEPIKLTRAEYEMLVAFVTNPGEILSRERLLRMLSARRVENPDLRTVDVLIRRLRHKLSADLLVTQHGEGYFLAADVC |
You can click texts colored in the table to view more detailed information
CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
NZ_CP029122_8 | 3414320-3414464 | Orphan |
NA
Consensus repeat of NZ_CP029122_8
|
1 spacers
spacers of NZ_CP029122_8
>8.1|3414372|41|NZ_CP029122|CRISPRCasFinder TGCGAAAATGCCTTATCTGGCCTACAGATTCGATGCGATTC |
CRISPR arrays and Neighbor proteins around NZ_CP029122_8
The CRISPR arrays of NZ_CP029122_8 >merge|NZ_CP029122|8|3414320-3414464|CRISPRCasFinder GTAGGTCGGATAAGATGCGCAAGCATCGCATCCGACAATAAGTGCCGGATGCTGCGAAAATGCCTTATCTGGCCTACAGATTCGATGCGATTCGTAGGTCGGATAAGATGCGCAAGCATCGCATCCGACAATAAGTGCCGAATGC >NZ_CP029122|8|8|3414320-3414464|CRISPRCasFinder GTAGGTCGGATAAGATGCGCAAGCATCGCATCCGACAATAAGTGCCGGATGC TGCGAAAATGCCTTATCTGGCCTACAGATTCGATGCGATTC GTAGGTCGGATAAGATGCGCAAGCATCGCATCCGACAATAAGTGCCGAATGC
>NZ_CP029122.1|WP_001091569.1|3412970_3414254_+|putative-acyl-CoA-thioester-hydrolase MNTFSVSRLALALAFGVTLTACSSTPPDQRPSDQTAPGTSSRPILSAKEAQNFDAQHYFASLTPGAAAWNPSPITLPAQPDFVVGPAGTQGVTHTTIQAAVDAAIIKRTNKRQYIAVMPGEYQGTVYVPAAPGGITLYGTGEKPIDVKIGLSLDGGMSPADWRHDVNPRGKYMPGKPAWYMYDSCQSKRSDSIGVLCSAVFWSQNNGLQLQNLTIENTLGDSVDAGNHPAVALRTDGDQVQINNVNILGRQNTFFVTNSGVQNRLETNRQPRTLVTNSYIEGDVDIVSGRGAVVFDNTEFRVVNSRTQQEAYVFAPATLSNIYYGFLAVNSRFNAFGDGVAQLGRSLDVDANTNGQVVIRDSAINEGFNTAKPWADAVISNRPFAGNTGSVDDNDEIQRNLNDTNYNRMWEYNNRGVGSKVVAEAKK >NZ_CP029122.1|WP_000533646.1|3411765_3412836_+|tyrosine-type-recombinase/integrase MGRRRSHERRDLPPNLYIRNNGYYCYRDPRTGKEFGLGRDRRIAITEAIQANIELFSGHKHKPLTARINSDNSVTLHSWLDRYEKILASRGIKQKTLINYMSKIKAIRRGLPDAPLEDITTKEIAAMLNGYIDEGKAASAKLIRSTLSDAFREAIAEGHITTNPVAATRAAKSEVRRSRLTADEYLKIYQAAESSPCWLRLAMELAVVTGQRVGDLCEMKWSDIVDGYLYVEQSKTGVKIAIPTVLHVDALGISMKETLDKCKEILGGETIIASTRREPLSSGTVSRYFMRARKASGLSFEGDPPTFHELRSLSARLYEKQISDKFAQHLLGHKSDTMASQYRDDRGREWDKIEIK >NZ_CP029122.1|WP_001303849.1|3411569_3411788_+|excisionase MYLTLQEWNARQRRPRSLETVRRWVRECRIFPPPVKDGREYLFHESAVKVDLNRPVTGSLLKRIRNGKKAKS >NZ_CP029122.1|WP_000545745.1|3411362_3411530_+|hypothetical-protein MHFRVTGEWNGEPFNRVIEAENISDCYDHWMLWAQIAHADVTNIRIEELKEHQAA >NZ_CP029122.1|WP_000120065.1|3410517_3411120_-|hypothetical-protein MSYFLRKKWMVNLSGSGKILWALNMKKDSYPYLICMTVSGLIFIFLFFWWRADIYRVTFLNQSISHYYILFSMGIAFLLSLFWVKKGIVKQSGWKSLSAYLKVYAGMCIFAGFFLIIPLTTLTYFLPGETSSYVAPYRYTSGSSKSCSGAEVDDPDLHENIRICYPYGNYEYDNIIYVEKKINILGAVVTYAQTARDDTE >NZ_CP029122.1|WP_000763365.1|3410085_3410307_+|TraR/DksA-family-transcriptional-regulator MADIIDSASEIEELQRNTAIKMRRLNHQAISATHCCECGDPIDERRRLAVQGCRTCASCQQDLELISKQRGSK >NZ_CP029122.1|WP_001395510.1|3409705_3409987_+|cell-division-protein-ZapA MHFSGSGLHILCAYACRHGACSMTPQQENALRSIARQANSEIKKARQQFPDKNVDDICRSVLKKHRETVTLMGFTPTHLSLAIGMLNGVFKER >NZ_CP029122.1|WP_023148020.1|3409503_3409695_+|DUF1382-family-protein MHKASPVELRTSIEMAHSLAQIGVRFVPIPVETDEEFHTLAAFLSQKLEMMVAKAEADERDQV >NZ_CP029122.1|WP_072126246.1|3409348_3409531_+|DUF1317-domain-containing-protein MTHPHDNIRVGAITFVYSVTKRGWVFPGLSVIRNPLKAQRLAEEINNKRGAVCTKHLPLS >NZ_CP029122.1|WP_001372450.1|3408671_3409352_+|YqaJ-viral-recombinase-family-protein MTPDIILQRTGIDVRAVEQGDDAWHKLRLGVITASEVHNVIAKPRSGKKWPDMKMSYFHTLLAEVCTGVAPEVNAKALAWGKQYENDARTLFEFTSSVNITESPIIYRDENMRTACSPDGLCSDGNGLELKCPFTSRDFMKFRLGGFEAIKSAYMAQVQYSMWVTRKDAWYFANYDPRMKREGLHYVVVERDEKYMASFDEMVPEFIEKMDEALAEIGFVFGEQWR >NZ_CP029122.1|WP_001372426.1|3414487_3416749_-|hydratase MIKLSEKGVFLASNNEIIAEEHFTGEIKKEEAQKGTIAWSILSSHNTSGNMDKLKIKFDSLASHDITFVGIVQTAKASGMERFPLPYVLTNCHNSLCAVGGTINGDDHVFGLSAAQRYGGIFVPPHIAVIHQYMREMMAGGGKMILGSDSHTRYGALGTMAVGEGGGELVKQLLNDTWDIDYPGVVAVHLTGKPAPYVGPQDVALAIIGAVFKNGYVKNKVMEFVGPGVSALSTDFRNSVDVMTTETTCLSSVWQTDEEVHNWLALHGRGQDYCQLNPQPMAYYDGCISVDLSAIKPMIALPFHPSNVYKIDTLNQNLTDILREIEIESERVAHGKAKLSLLDKVENGRLKVQQGIIAGCSGGNYENVIAAANALRGQSCGNDTFSLAVYPSSQPVFMDLAQKGVVADLIGAGAIIRTAFCGPCFGAGDTPINNGLSIRHTTRNFPNREGSKPANGQMSAVALMDARSIAATAANGGYLTSASELDCWDNVPEYAFDVTPYKNRVYQGFVKGATQQPLIYGPNIKDWPELGALTDNIVLKVCSKILDEVTTTDELIPSGETSSYRSNPIGLAEFTLSRRDPGYVGRSKATAELENQRLAGNVSELTEVFARIKQIAGQEHIDPLQTEIGSMVYAVKPGDGSAREQAASCQRVIGGLANIAEEYATKRYRSNVINWGMLPLQMAEVPTFEVGDYIYIPGIKAALDNPGTTFKGYVIHEDAPVTEITLYMGSLTAEEREIIKAGSLINFNKNRQM >NZ_CP029122.1|WP_001036475.1|3416931_3418365_-|anion-permease MNKKSLWKLILILAIPCIIGFMPAPAGLSELAWVLFGIYLAAIVGLVIKPFPEPVVLLIAVAASMVVVGNLSDGAFKTTAVLSGYSSGTTWLVFSAFTLSAAFVTTGLGKRIAYLLIGKIGNTTLGLGYVTVFLDLVLAPATPSNTARAGGIVLPIINSVAVALGSEPEKSPRRVGHYLMMSIYMVTKTTSYMFFTAMAGNILALKMINDILHLQISWGGWALAAGLPGIIMLLVTPLVIYTMYPPEIKKVDNKTIAKAGLAELGPMKIREKMLLGVFVLALLGWIFSKSLGVDESTVAIVVMATMLLLGIVTWEDVVKNKGGWNTLIWYGGIIGLSSLLSKVKFFEWLAEVFKNNLAFDGHGNVAFFVIIFLSIIVRYFFASGSAYIVAMLPVFAMLANVSGAPLMLTALALLFSNSYGGMVTHYGGAAGPVIFGVGYNDIKSWWLVGAVLTILTFLVHITLGVWWWNMLIGWNML >NZ_CP029122.1|WP_001372427.1|3418440_3419493_-|4-oxalomesaconate-tautomerase MKKIPCVMMRGGTSRGAFLLAEHLPEDQTQRDKILMAIMGSGNDLEIDGIGGGNPLTSKVAIISRSSDLRADVDYLFAQVIVHEQRVDTTPNCGNMLSGVGAFAIENGLIAATSPVTRVRIRNVNTGTFIEADVQTPNGVVEYEGSARIDGVPGTAAPVALTFLNAAGTKTGKVFPTDNQIDYFDDVPVTCIDMAMPVVIIPAEYLGKTGYELPAELDADKALLARIESIRLQAGKAMGLGDVSNMVIPKPVLISPAQKGGAINVRYFMPHSCHRALAITGAIAISSSCALEGTVTRQIVPSVGYGNINIEHPSGALDVHLSNEGQDATTLRASVIRTTRKIFSGEVYLP >NZ_CP029122.1|WP_000679972.1|3419676_3420630_+|LysR-family-transcriptional-regulator MKHELSSMKAFVILAESSSFNNAAKLLNITQPALTRRIKKMEEDLHIQLFERTTRKVTLTKAGKRLLPEARELIKKFDETLFNIRDMNAYHRGMVTLACIPTAVFYFLPLAIGKFNELYPNIKVRILEQGTNNCMESVLCNESDFGINMNNVTNSSIDFTPLVNEPFVLACRRDHPLAKKQLVEWQELVGYKMIGVRSSSGNRLLIEQQLADKPWKLDWFYEVRHLSTSLGLVEAGLGISALPGLAMPHAPYSSIIGIPLVEPVIRRTLGIIRRKDAVLSPAAERFFALLINLWTDDKDNLWTNIVERQRHALQEIG >NZ_CP029122.1|WP_000815449.1|3420670_3421666_-|6-phosphogluconolactonase MKQTVYIASPESQQIHVWNLNHEGALTLTQVVDVPGQVQPMVVSPDKRYLYVGVRPEFRVLAYSIAPDDGALTFAAESALPGSPTHISTDHQGQFVFVGSYNAGNVSVTRLEDGLPVGVVDVVEGLDGCHSANISPDNRTLWVPALKQDRICLFTVSDDGHLVAQDPAEVTTVEGAGPRHMVFHPNEQYAYCVNELNSSVDVWELKDPHGNIECVQTLDMMPENFSDTRWAADIHITPDGRHLYACDRTASLITVFSVSEDGSVLSKEGFQPTETQPRGFNVDHSGKYLIAAGQKSHHISVYEIVGEQGLLHEKGRYAVGQGPMWVVVNAH >NZ_CP029122.1|WP_000213425.1|3421820_3422639_+|bifunctional-pyridoxal-phosphate/fructose-1,6-bisphosphate-phosphatase MTTRVIALDLDGTLLTPKKTLLPSSIEALARAREAGYQLIIVTGRHHVAIHPFYQALALDTPAICCNGTYLYDYHAKTVLEADPMPVNKALQLIEMLNEHHIHGLMYVDDAMVYEHPTGHVIRTSNWAQTLPPEQRPTFTQVASLAETAQQVNAVWKFALTHDDLPQLQHFGKHVEHELGLECEWSWHDQVDIARGGNSKGKRLTKWVEAQGWSMENVVAFGDNFNDISMLEAAGTGVAMGNADDAVKARANIVIGDNTTDSIAQFIYSHLI >NZ_CP029122.1|WP_000891692.1|3422639_3423698_-|molybdenum-ABC-transporter-ATP-binding-protein-ModC MLELNFSQTLGNHCLTINETLPANGITAIFGVSGAGKTSLINAISGLTRPQKGRIVLNGRVLNDAEKGICLTPEKRRVGYVFQDARLFPHYKVRGNLRYGMSKSMVDQFDKLVALLGIEPLLDRLPGSLSGGEKQRVAIGRALLTAPELLLLDEPLASLDIPRKRELLPYLQRLTREINIPMLYVSHSLDEILHLADRVMVLENGQVKAFGALEEVWGSSVMNPWLPKEQQSSILKVTVLEHHPHYAMTALALGDQHLWVNKLDEPLQAALRIRIQASDVSLVLQPPQQTSIRNVLRAKVVNSYDDNGQVEVELEVGGKTLWARISPWARDELAIKPGLWLYAQIKSVSITA >NZ_CP029122.1|WP_000604034.1|3423700_3424390_-|molybdate-ABC-transporter-permease-subunit MILTDPEWQAVLLSLKVSSLAVLFSLPFGIFFAWLLVRCTFPGKALLDSVLHLPLVLPPVVVGYLLLVSMGRRGFIGERLYDWFGITFAFSWRGAVLAAAVMSFPLMVRAIRLALEGVDVKLEQAARTLGAGRWRVFFTITLPLTLPGIIVGTVLAFARSLGEFGATITFVSNIPGETRTIPSAMYTLIQTPGGESGAARLCIISIALAMISLLISEWLARISRERAGR >NZ_CP029122.1|WP_000101993.1|3424389_3425163_-|molybdate-ABC-transporter-substrate-binding-protein MARKWLNLFAGAALSFAVAGNALADEGKITVFAAASLTNAMQDIATQYKKEKGVDVVSSFASSSTLARQIEAGAPADLFISADQKWMDYAVDKKAIDTATRQTLLGNSLVVVAPKASEQKDFTIDSKTNWTSLLNGGRLAVGDPEHVPAGIYAKEALQKLGAWDTLSPKLAPAEDVRGALALVERNEAPLGIVYGSDAVASKGVKVVAIFPEDSHKKVEYPVAVVEGHNNATVKAFYDYLKGPQAAEIFKRYGFTTK >NZ_CP029122.1|WP_000891515.1|3425329_3425479_-|multidrug-efflux-pump-accessory-protein-AcrZ MLELLKSLVFAVIMVPVVMAIILGLIYGLGEVFNIFSGVGKKDQPGQNH |
You can click texts colored in the table to view more detailed information
CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
NZ_CP029122_9 | 3910465-3910618 | Orphan |
NA
Consensus repeat of NZ_CP029122_9
|
1 spacers
spacers of NZ_CP029122_9
>9.1|3910518|48|NZ_CP029122|CRISPRCasFinder TCAGCGTCGCATCAGGCATCTGCGCATAACCGCCGGATGCGGCGTAAA |
CRISPR arrays and Neighbor proteins around NZ_CP029122_9
The CRISPR arrays of NZ_CP029122_9 >merge|NZ_CP029122|9|3910465-3910618|CRISPRCasFinder CGCCTTATCCGGCCTACCGATCCAGCACAGGTTTGTAGGCATGATAAGACGCGTCAGCGTCGCATCAGGCATCTGCGCATAACCGCCGGATGCGGCGTAAACGCCTTATCCGGCCTACCGATCCGGCACAGGTTTGTAGGCATGATAAGACGCG >NZ_CP029122|9|9|3910465-3910618|CRISPRCasFinder CGCCTTATCCGGCCTACCGATCCAGCACAGGTTTGTAGGCATGATAAGACGCG TCAGCGTCGCATCAGGCATCTGCGCATAACCGCCGGATGCGGCGTAAA CGCCTTATCCGGCCTACCGATCCGGCACAGGTTTGTAGGCATGATAAGACGCG
>NZ_CP029122.1|WP_000952760.1|3908590_3910330_+|flagellar-type-III-secretion-system-protein-FlhA MLSRSDLLTLLTINFIVVTKGAERISEVSARFTLDAMPGKQMAIDADLNAGLINQAQAQTRRKDVASEADFYGAMDGASKFVRGDAIAGMMILAINLIGGVCIGIFKYNLSADAAFQQYVLMTIGDGLVAQIPSLLLSTAAAIIVTRISDNGDITHDVRHQLLASPSVLYTATGIMFVLAVVPGMPHLPFLLFSALLGFTGWRMSKRPQAAEAEEKSLETLTRTITETSEQQVSWETIPLIEPISLSLGYKLVALVDKAQGNPLTQRIRGVRQVISDGNGVLLPEIRIRENFRLKPSQYAIFINGIKADEADIPADKLMALPSSETYGEIDGVLGNDPAYGMPVTWIQPAQKAKALNMGYQVIDSASVIATHVNKIVRSYIPDLFSYDDITQLHNRLSSMAPRLAEDLSAALNYSQLLKVYRALLTEGVSLRDIVTIATVLVASSAVTKDHILLAADVRLALRRSITHPFVRKQELTVYTLNNELENLLTNVVNQAQQGGKVMLDSVPVDPNMLNQFQSTMPQVKEQMKAAGKDPVLLVPPQLRPLLARYARLFAPGLHVLSYNEVPDELELKIMGALM >NZ_CP029122.1|WP_032283079.1|3907860_3908646_-|putative-lateral-flagellar-export/assembly-protein-LafU MTTIKLIVNSVSKSERESIIAALHGQSIFSGGGLSPLNKISPSHPPKPATVAVPEETEKKARDVNEKTALLKKKSATELGELATSINTIARDAHMEANLEMEIVPQGLRVLIKDDQNRNMFECGSAQIMPFFKTLLVELAPVFDSLDNKIIITGHTDAMAYKNNIYNNWNLSGDRALSARRVLEEAGMPEDKVMQVSAMADQMLLDAKNPQSAGNRRIEIMVLTKSASDTLYQYFGQHGDKVVQPLVQKLDKQQVLSQRMR >NZ_CP029122.1|WP_001226155.1|3906734_3907790_-|DNA-polymerase-IV MRKIIHVDMDCFFAAVEMRDNPALRDIPIAIGGSRERRGVISTANYPARKFGVRSAMPTGMALKLCPHLTLLPGRFDAYKEASNHIREIFSRYTSRIEPLSLDEAYLDVTDSVHCHGSATLIAQEIRQTIFNELHLTASAGVAPVKFLAKIASDMNKPNGQFVITPAEVPAFLQTLPLAKIPGVGKVSAAKLEAMGLRTCGDVQKCDLVILLKRFGKFGRILWERSQGIDERDVNSERLRKSVGVERTMAEDIHHWSECEAIIERLYPELERRLAKVKPDLLIARQGVKLKFDDFQQTTQEHVWPRLNKADLIATARKTWDERRGGRGVRLVGLHVTLLDPQMERQLVLGL >NZ_CP029122.1|WP_001059874.1|3906285_3906738_-|GNAT-family-N-acetyltransferase MNNIQIRNYQPGDFQQLCAIFIRAVMMTASQHYSPQQIAAWAQIDESRWKEKLAKSQVRVAVINAQPVGFISRIERHIDMLFVDPEYTRRGVASALLKPLIKSESELTVDASITAKPFFERYGFQIVKQQHVECRGAWFTNFYMRYKPQH >NZ_CP029122.1|WP_001295202.1|3905712_3905979_-|hypothetical-protein MEWYMGKYIRPLSDAVFTIASDDLWIESLAIQQLHTTANLPNMQRVVGMPDLHPGRGYPIGAAFFSVGRFYPARRRGNGAGNRNGPLL >NZ_CP029122.1|WP_001293003.1|3903898_3905356_+|cytosol-nonspecific-dipeptidase MSELSQLSPQPLWDIFAKICSIPHPSYHEEQLAEYIVGWAKEKGFHVERDQVGNILIRKPATAGMENRKPVVLQAHLDMVPQKNNDTVHDFTKDPIQPYIDGEWVKARGTTLGADNGIGMASALAVLADENVVHGPLEVLLTMTEEAGMDGAFGLQSNWLQADILINTDSEEEGEIYMGCAGGIDFTSNLHLDREAVPAGFETFKLTLKGLKGGHSGGEIHVGLGNANKLLVRFLAGHAEELDLRLIDFNGGTLRNAIPREAFATIAVAADKVDALKSLVNTYQDILKNELAEKEKNLALLLDSVANDKAALIAKSRDTFIRLLNATPNGVIRNSDVAKGVVETSLNVGVVTMTDNNVEIHCLIRSLIDSGKDYVVSMLDSLGKLAGAKTEAKGAYPGWQPDANSPVMHLVRETYQRLFNKTPNIQIIHAGLECGLFKKPYPEMDMVSIGPTITGPHSPDEQVHIKSVGHYWTLLTELLKEIPAK >NZ_CP029122.1|WP_001291992.1|3903179_3903638_-|xanthine-phosphoribosyltransferase MSEKYIVTWDMLQIHARKLASRLMPSEQWKGIIAVSRGGLVPGALLARELGIRHVDTVCISSYDHDNQRELKVLKRAEGDGEGFIVIDDLVDTGGTAVAIREMYPKAHFVTIFAKPAGRPLVDNYVVDIPQDTWIEQPWDMGVVFVPPISGR >NZ_CP029122.1|WP_000189539.1|3901843_3903088_-|esterase-FrsA MTQANLSETLFKPRFKHPETSTLVRRFNHGAQPPVQSALDGKTIPHWYRMINRLMWIWRGIDPREILDVQARIVMSDAERTDDDLYDTVIGYRGGNWIYEWATQAMVWQQKACAEEDPQLSGRHWLHAATLYNIAAYPHLKGDDLAEQAQALSNRAYEEAAQRLPGTMRQMEFTVPGGAPITGFLHMPKGDGPFPTVLMCGGLDAMQTDYYSLYERYFAPRGIAMLTIDMPSVGFSSKWKLTQDSSLLHQHVLKALPNVPWVDHTRVAAFGFRFGANVAVRLAYLESPRLKAVACLGPVVHTLLSDFKCQQQVPEMYLDVLASRLGMHDASDDALRVELNRYSLKVQGLLGRRCPTPMLSGYWKNDPFSPEEDSRLITSSSADGKLLEIPFNPVYRNFDKGLQEITGWIEKRLC >NZ_CP029122.1|WP_000174677.1|3901384_3901786_-|sigma-factor-binding-protein-Crl MTLPSGHPKSRLIKKFTALGPYIREGKCEDNRFFFDCLAVCVNVKPAPEVREFWGWWMELEAQESRFTYSYQFGLFDKAGDWKSVPVKDTEVVERLEHTLREFHEKLRELLTTLNLKLEPADDFRDEPVKLTA >NZ_CP029122.1|WP_000749881.1|3900290_3901346_+|phosphoporin-PhoE MKKSTLALVVMGIVASASVQAAEIYNKDGNKLDVYGKVKAMHYMSDNDSKDGDQSYIRFGFKGETQINDQLTGYGRWEAEFAGNKAESDTAQQKTRLAFAGLKYKDLGSFDYGRNLGALYDVEAWTDMFPEFGGDSSAQTDNFMTKRASGLATYRNTDFFGVIDGLNLTLQYQGKNENRDVKKQNGDGFGTSLTYDFGGSDFAISGAYTNSDRTNEQNLQSRGTGKRAEAWATGLKYDANNIYLATFYSETRKMTPITGGFANKTQNFEAVAQYQFDFGLRPSLGYVLSKGKDIEGIGDEDLVNYIDVGATYYFNKNMSAFVDYKINQLDSDNKLNINNDDIVAVGMTYQF >NZ_CP029122.1|WP_000006256.1|3910647_3911145_-|REP-associated-tyrosine-transposase-RayT MSEYRRYYIKGGTWFFTVNLRNRRSQLLTTQYQMLRHAIIKVKRDRPFEINAWVVLPEHMHCIWTLPEGDDDFSSRWREIKKQFTHACGLKNIWQPRFWEHAIRNTKDYRHHVDYIYINPVKHGWVKQVSDWPFSTFHRDVARGLYPIDWAGDVTDINAGERIIL >NZ_CP029122.1|WP_000009291.1|3911320_3912079_-|C40-family-peptidase MSFMSSFLLGRFLHPGVFSLCVLLPLFASATTSHISFSYAARQRMQNRARLLKQYQTHLKKQASYIVEGNAESRRALRQHNREQIKQHPEWFPAPLKASDRRWQALAENNHFLSSDHLHNITEVAIHRLEQQLGKPYVWGGTRPDQGFDCSGLVFYAYNKILEAKLPRTANEMYHYHRATIVANNDLRRGDLLFFHIHSREIADHMGVYLGDGQFIESPRTGENIRVSRLAEPFWQDHFLGARRILTEETIL >NZ_CP029122.1|WP_001225679.1|3912370_3913111_+|murein-L,D-transpeptidase MRKIALILAMLLIPCVSFAGLLGSSSSTTPVSKEYKQQLMGSPVYIQIFKEERTLDLYVKMGEQYQLLDSYKICKYSGGLGPKQRQGDFKSPEGFYSVQRNQLKPDSRYYKAINIGFPNAYDRAHGYEGKYLMIHGDCVSIGCYAMTNQGIDEIFQFVTGALVFGQPSVQVSIYPFRMTDANMKRHKYSNFKDFWEQLKPGYDYFEQTRKPPTVSVVNGRYVVSKPLSHEVVQPQLASNYTLPEAK >NZ_CP029122.1|WP_000333380.1|3913081_3913849_-|class-II-glutamine-amidotransferase MCELLGMSANVPTDICFSFTGLVQRGGGTGPHKDGWGITFYEGKGCRTFKDPQPSFNSPIAKLVQDYPIKSCSVVAHIRQANRGEVALENTHPFTRELWGRNWTYAHNGQLTGYKSLETGNFRPVGETDSEKAFCWLLHKLTQRYPRTPGNMAAVFKYIASLADELRQKGVFNMLLSDGRYVMAYCSTNLHWITRRAPFGVATLLDQDVEIDFSSQTTPNDVVTVIATQPLTGNETWQKIMPGEWRLFCLGERVV >NZ_CP029122.1|WP_000284050.1|3914054_3914633_-|D-sedoheptulose-7-phosphate-isomerase MYQDLIRNELNEAAETLANFLKDDANIHAIQRAAVLLADSFKAGGKVLSCGNGGSHCDAMHFAEELTGRYRENRPGYPAIAISDVSHISCVGNDFGFNDIFSRYVEAVGREGDVLLGISTSGNSANVIKAIAAAREKGMKVITLTGKDGGKMAGTADIEIRVPHFGYADRIQEIHIKVIHILIQLIEKEMVK >NZ_CP029122.1|WP_000973093.1|3914872_3917317_+|acyl-CoA-dehydrogenase-FadE MMILSILATVVLLGALFYHRVSLFISSLILLAWTAALGVAGLWSAWVLVPLAIILVPFNFAPMRKSMISAPVFRGFRKVMPPMSRTEKEAIDAGTTWWEGDLFQGKPDWKKLHNYPQPRLTAEEQAFLDGPVEEACRMANDFQITHELADLPPELWAYLKEHRFFAMIIKKEYGGLEFSAYAQSRVLQKLSGVSGILAITVGVPNSLGPGELLQHYGTDEQKNHYLPRLARGQEIPCFALTSPEAGSDAGAIPDTGIVCMGEWQGQQVLGMRLTWNKRYITLAPIATVLGLAFKLSDPEKLLGGAEDLGITCALIPTTTPGVEIGRRHFPLNVPFQNGPTRGKDVFVPIDYIIGGPKMAGQGWRMLVECLSVGRGITLPSNSTGGVKSVALATGAYAHIRRQFKISIGKMEGIEEPLARIAGNAYVMDAAASLITYGIMLGEKPAVLSAIVKYHCTHRGQQSIIDAMDITGGKGIMLGQSNFLARAYQGAPIAITVEGANILTRSMMIFGQGAIRCHPYVLEEMEAAKNNDVNAFDKLLFKHIGHVGSNKVRSFWLGLTRGLTSSTPTGDATKRYYQHLNRLSANLALLSDVSMAVLGGSLKRRERISARLGDILSQLYLASAVLKRYDDEGRNEADLPLVHWGVQDALYQAEQAMDDLLQNFPNRVVAGLLNVVIFPTGRHYLAPSDKLDHKVAKILQVPNATRSRIGRGQYLTPSEHNPVGLLEEALVDVIAADPIHQRICKELGKNLPFTRLDELAHNALAKGLIDKDEAAILVKAEESRLCSINVDDFDPEELATKPVKLPEKVRKVEAA >NZ_CP029122.1|WP_000532698.1|3917359_3917833_-|C-lysozyme-inhibitor MGRISSGGMMFKAITTVAALVIATSAMAQDDLTISSLAKGETTKAAFNQMVQGHKLPAWVMKGGTYTPAQTVTLGDETYQVMSACKPHDCGSQRIAVMWSEKSNQMTGLFSTIDEKTSQEKLTWLNVNDALSIDGKTVLFAALTGSLENHPDGFNFK >NZ_CP029122.1|WP_001118055.1|3917986_3918757_+|2-oxoglutaramate-amidase MPGLKITLLQQPLVWMDGPANLRHFDRQLEGITGRDVIVLPEMFTSGFAMEAAASSLAQNDVVNWMTAKAQQCNALIAGSVALQTESGSVNRFLLVEPGGTVHFYDKRHLFRMADEHLHYKAGNARVIVEWRGWRILPLVCYDLRFPVWSRNLNDYDLAIYVANWPAPRSLHWQALLTARAIENQAYVAGCNRVGSDGNGCHYRGDSRVINPQGEIIATADAHQATRIDAELSMVALREYREKFPAWQDADEFRLR >NZ_CP029122.1|WP_000978828.1|3920195_3920645_-|hypothetical-protein MMKYLMVLLSLFSGSVLGMGRVNELCGIDSVKTIEIINLPSYVTTLVPLSKEGLNEIYRYKVVVNEISDLYAGKIIDLLQMKYFRKEKYNNIRWGVSIISKGNNKCEIYFDAFGECGSVNGINVCFEKNEMIGWIKKEIPLLSQKIGGL >NZ_CP029122.1|WP_001087742.1|3925557_3926910_+|membrane-protein MNSNVLTQTIVTGSDPRGLPEFSAIREEINKASHPSQPELNWKLVESLALAIFKANGVDLHTATYYTLARTRTQGLAGFCEGAELLAAMVSHDWDKFWPQGGPARTEMLDWFNSRTGNILRQQISFAESDLPLIYRTERALQLICDKLQQVELKRVPRVENLLYFMQNTRKRLEPQLKSNTENAAQTTVRTLIYAPETQASSTPEAVVPPLPGLPEMKVEVRSLTENPPQASVIKQGSTVRGFIAGIACSVAVASALWWWQVYPVQQQLLQVNDTAQGAATVWMASPELENYERRLQQLLDTSPVQPLETGMQMMRVADSRWPESLQQQQASTQWNEALKTRAQSSPQLRGWLQTRQDLHAFADLVMQREKEGLTLSYIKNVIWQAERGLGQETPVESLLTQYHDARAQKQNTDTLEKQINERLEGVLSRWLLLKNNVMPEAATGTTAEK |
You can click texts colored in the table to view more detailed information
CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
NZ_CP029122_10 | 4100634-4100749 | Orphan |
NA
Consensus repeat of NZ_CP029122_10
|
1 spacers
spacers of NZ_CP029122_10
>10.1|4100665|54|NZ_CP029122|CRISPRCasFinder TGGCCTACGCGCTGTGTTTTTGTAGGCCGGATAAGCAAAGCGCATCCGGCATTC |
CRISPR arrays and Neighbor proteins around NZ_CP029122_10
The CRISPR arrays of NZ_CP029122_10 >merge|NZ_CP029122|10|4100634-4100749|CRISPRCasFinder AACGCCTGATGCGACGCTGACGCGTCTTATCTGGCCTACGCGCTGTGTTTTTGTAGGCCGGATAAGCAAAGCGCATCCGGCATTCAACGCCTGATGCGACGCTGGCGCGTCTTATC >NZ_CP029122|10|10|4100634-4100749|CRISPRCasFinder AACGCCTGATGCGACGCTGACGCGTCTTATC TGGCCTACGCGCTGTGTTTTTGTAGGCCGGATAAGCAAAGCGCATCCGGCATTC AACGCCTGATGCGACGCTGGCGCGTCTTATC
>NZ_CP029122.1|WP_000151734.1|4099111_4100614_+|L-arabinose-isomerase MTIFDNYEVWFVIGSQHLYGPETLRQVTQHAEHVVNALNTEAKLPCKLVLKPLGTTPDEITAICRDANYDDRCAGLVVWLHTFSPAKMWINGLTMLNKPLLQFHTQFNAALPWDSIDMDFMNLNQTAHGGREFGFIGARMRQQHAVVTGHWQDKQAHERIGSWMRQAVSKQDTRHLKVCRFGDNMREVAVTDGDKVAAQIKFGFSVNTWAVGDLVQVVNSISDGDVNALVDEYESCYTMTPATQIHGEKRQNVLEAARIELGMKRFLEQGGFHAFTTTFEDLHGLKQLPGLAVQRLMQQGYGFAGEGDWKTAALLRIMKVMSTGLQGGTSFMEDYTYHFEKGNDLVLGSHMLEVCPSIAVEEKPILDVQHLGIGGKDDPARLIFNTQTGPAIVASLIDLGDRYRLLVNCIDTVKTPHSLPKLPVANALWKAQPDLPTASEAWILAGGAHHTVFSHALNLNDMRQFAEMHDIEITVIDNDTRLPAFKDALRWNEVYYGFRR >NZ_CP029122.1|WP_001371424.1|4097400_4099101_+|ribulokinase MAIAIGLDFGSDSVRALAVDCASGEEIATSVEWYPRWQKGQFCDAPNNQFRHHPRDYIESMEAALKTVLAELSVEQRAAVVGIGVDTTGSTPAPIDADGNVLALRPEFAENPNAMFVLWKDHTAVEEAEEITRLCHAPGNVDYSRYIGGIYSSEWFWAKILHVTRQDSAVAQSAASWIELCDWVPALLSGTTGPQDIRRGRCSAGHKSLWHESWGGLPPASFFDELDPILNRHLPSPLFTDTWTADIPVGTLCPEWAQRLGLPESVVISGGAFDCHMGAVGAGAQPNALVKVIGTSTCDILIADKQSVGERAVKGICGQVDGSVVPGFIGLEAGQSAFGDIYAWFGRVLGWPLEQLAAQHPELKAQINASQKQLLPALTEAWAKNPSLDHLPVVLDWFNGRRTPNANQRLKGVITDLNLATDAPLLFGGLIAATAFGARAIMECFTDQGIAVNNVMALGGIARKNQVIMQACCDVLNRPLQIVASDQCCALGAAIFAAVAAKVHADIPSAQQKMASAVEKTLQPCSEQAQRFEQLYRRYQQWAMSAEQHYLPTSAPAQAAQAVPTL >NZ_CP029122.1|WP_001300811.1|4096183_4097062_-|arabinose-operon-transcriptional-regulator-AraC MAEAQNDPLLPGYSFNAHLVAGLTPIEANGYLDFFIDRPLGMKGYILNLTIRGQGVVKNQGREFVCRPGDILLFPPGEIHHYGRHPEAREWYHQWVYFRPRAYWHEWLNWPSIFANTGFFRPDEAHQPHFSDLFGQIINAGQGEGRYSELLAINLLEQLLLRRMEAINESLHPPMDNRVREACQYISDHLADSNFDIASVAQHVCLSPSRLSHLFRQQLGISVLSWREDQRISQAKLLLSTTRMPIATVGRNVGFDDQLYFSRVFKKCTGASPSEFRAGCEEKVNDVAVKLS >NZ_CP029122.1|WP_001148402.1|4095333_4096098_-|DedA-family-protein MQALLEHFITQSTVYSLMAVVLVAFLESLALVGLILPGTVLMAGLGALIGSGELSFWHAWLAGIVGCLLGDWISFWLGWRFKKPLHRWSFLKKNKALLDKTEHALHQHSMFTILVGRFVGPTRPLVPMVAGMLDLPVAKFITPNIIGCLLWPPFYFLPGILAGAAIDIPAGMQSGEFKWLLLATAVFLWVGGWLCWRLWRSGKATDRLSHYLSRGRLLWLTPLISAIGVVALVVLIRHPLMPVYIDILRKVVGG >NZ_CP029122.1|WP_000916291.1|4094521_4095220_+|thiamine-ABC-transporter-ATP-binding-protein-ThiQ MLKLTDITWLYHHLPMRFSLTVERGEQVAILGPSGAGKSTLLNLIAGFLTPASGSLTIDGVDHTTTPPSRRPVSMLFQENNLFSHLTVAQNIGLGLNPGLKLNAAQQEKMHAIARQMGIDNLMARLPGELSGGQRQRVALARCLVREQPILLLDEPFSALDPALRQEMLTLVSTSCQQQKMTLLMVSHSVEDAARIATRSVVVADGRIAWQGKTNELLSGKASASALLGITG >NZ_CP029122.1|WP_000235700.1|4092927_4094538_+|thiamine/thiamine-pyrophosphate-ABC-transporter-permease-ThiP MATRRQPLIPGWLIPGVSAATLVVAVALAAFLALWWNAPQGNWVAVWQDSYLWHVVRFSFWQAFLSALLSVVPAIFLARALYRRRFPGRLALLRLCAMTLILPVLVAVFGILSVYGRQGWLASLCQSLGLEWTFSPYGLQGILLAHVFFNLPMASRLLLQALENIPGEQRQLAAQLGMRGWHFFRFVEWPWLRRQIPPVAALIFMLCFASFATVLSLGGGPQATTIELAIYQALSYDYDPARAAMLALIQMVCCLGLVLLSQRLSKAIAPGTTLLQGWRDPDDRLHSRICDTVLIVLALLLLLPPLLAVIVDGVNRQLPEVLAQPVLWQALWTSLRIALAAGVLCVVLTMMLLWSSRELRARQKMLAGQALEMSGMLILAMPGIVLATGFFLLLNNTIGLPQSADGIVIFTNALMAIPYALKVLENPMRDITARYSMLCQSLGIEGWSRLKVVELRALKRPLAQALAFACVLSIGDFGVVALFGNDDFRTLPFYLYQQIGSYRSQDGAVTALILLLLCFLLFTVIEKLPGRNVKTD >NZ_CP029122.1|WP_001371422.1|4091968_4092952_+|thiamine-ABC-transporter-substrate-binding-subunit MLKKCLPLLLLCTAPVFAKPVLIVYTYDSFAADWGPGPKIKKAFEADCNCELKLVALEDGVSLLNRLRMEGKNSKADVVLGLDNNLLDAASKTGLFAKSGVAADAVNVPGGWNNDTFVPFDYGYFAFVYDKNKLKNPPQSLKELVESDQNWRVIYQDPRTSTPGLGLLLWMQKVYGDDAPQAWQKLAKKTVTVTKGWSEAYGLFLKGESDLVLSYTTSPAYHILEEKKDNYAAANFSEGHYLQVEVAARTAASKQPELAQKFLQFMVSPAFQNAIPTGNWMYPVANVTLPAGFEQLTKPATTLEFTPAEVAAQRQAWISEWQRAVSR >NZ_CP029122.1|WP_001297366.1|4090149_4091805_+|DNA-binding-transcriptional-regulator-SgrR MPSARLQQQFIRLWQCCEGKSQDTTLNELAALLSCSRRHMRTLLNTMQDRGWLTWEAEVGRGKRSRLTFLYTGLALQQQRAEDLLEQDRIDQLVQLVGDKATVRQMLVSHLGRSFRQGRHILRVLYYRPLRNLLPGSALRRSETHIARQIFSSLTRINEENGELEADIAHHWQQISPLHWRFFLRPGVHFHHGRELEMDDVIASLKRINTLPLYSHIADIVSPTPWTLDIHLTQPDRWLPLLLGQVPAMILPREWETLSNFASHPIGTGPYAVIRNTTNQLKIQAFDDFFGYRALIDEVNVWVLPEIADEPAGGLMLKGPQGEEKEIESRLEEGCYYLLFDSRTHRGANQQVRDWVSYVLSPTNLVYFAEEQYQQLWFPAYGLLPRWHHARTIKSEKPAGLESLTLTFYQDHSEHRVIAGIMQQILASHQVTLEIKEISYDQWHEGEIESDIWLNSANFTLPLDFSLFAHLCEVPLLQHCIPIDWQADAARWRNGEMNLANWCQQLVASKAMVPLIHHWLIIQGQRSMRGLRMNTLGWFDFKSAWFAPPDP >NZ_CP029122.1|WP_001248770.1|4089929_4090061_-|glucose-uptake-inhibitor-SgrT MRQFYQHYFTATAKLCWLRWLSVPQRLTMLEGLMQWDDRNSES >NZ_CP029122.1|WP_000637846.1|4088649_4089828_-|sugar-efflux-transporter-SetA MIWIMTMARRMNGVYAAFMLVAFMMGVAGALQAPTLSLFLSREVGAQPFWIGLFYTVNAIAGIGVSLWLAKRSDSQGDRRKLIIFCCLMAIGNALLFAFNRHYLTLITCGVLLASLANTAMPQLFALAREYADNSAREVVMFSSVMRAQLSLAWVIGPPLAFMLALNYGFTVMFSIAAGIFTLSLVLIAFMLPSVARVELPSENALSMQGGWQDSNVRMLFVASTLMWTCNTMYIIDMPLWISSELGLPDKLAGFLMGTAAGLEIPAMILAGYYVKRYGKRRMMVIAVAAGVLFYTGLIFFHSRMALMTLQLFNAVFIGIVAGIGMLWFQDLMPGRAGAATTLFTNSISTGVILAGVIQGAIAQSWGHFAVYWVIAVISVVALFLTAKVKDV >NZ_CP029122.1|WP_000888642.1|4100813_4101509_+|L-ribulose-5-phosphate-4-epimerase MLEDLKRLVLEANLALPKHNLVTLTWGNVSAVDRERGVFVIKPSGVDYSVMTADDMVVVSIATGEVVEGTKKPSSDTPTHRLLYQAFPSIGGIVHTHSRHATIWAQAGQSIPATGTTHADYFYGTIPCTRKMTDAEINGEYEWETGNVIVETFEKQGIDAAQMPGVLVHSHGPFAWGKNAEDAVHNAIVLEEVAYMGIFCRQLAPQLPDMQQTLLDKHYLRKHGAKAYYGQ >NZ_CP029122.1|WP_000035637.1|4101583_4103935_+|DNA-polymerase-II MAQAGFILTRHWRDTPQGTEVSFWLATDNGPLQVTLAPQESVAFIPADQVPRAQHILQGEQGFRLTPLALKDFHRQPVYGLYCRAHRQLMNYEKRLREGGVTVYEADVRPPERYLMERFITSPVWVEGDIRNGAIVNARLKPHPDYRPPLKWVSIDIETTRHGELYCIGLEGCGQRIVYMLGPENGDASALDFELEYVASRPQLLEKLNAWFANYDPDVIIGWNVVQFDLRMLQKHAERYRIPLRLGRDNSELEWREHGFKNGVFFAQAKGRLIIDGIEALKSAFWNFSSFSLETVAQELLGEGKSIDNPWDRMDEIDRRFAEDKPALATYNLKDCELVTQIFHKTEIMPFLLERATVNGLPVDRHGGSVAAFGHLYFPRMHRAGYVAPNLGEVPPHASPGGYVMDSRPGLYDSVLVLDYKSLYPSIIRTFLIDPVGLVEGMAQPDPEHSTEGFLDAWFSREKHCLPEIVTNIWHGRDEAKRQGNKPLSQALKIIMNAFYGVLGTTACRFFDPRLASSITMRGHQIMRQTKALIEAQGYDVIYGDTDSTFVWLKGAHSEEEAAKIGRALVQHVNAWWAETLQKQRLTSALELEYETHFCRFLMPTIRGADTGSKKRYAGLIQEGDKQRMVFKGLETVRTDWTPLAQQFQQELYLRIFRNEPYQEYIRETIDKLMAGELDARLVYRKRLRRPLSEYQRNVPPHVRAARLADEENQKRGRPLQYQNRGTIKYVWTTNGPEPLDYQRSPLDYEHYLTRQLQPVAEGILPFIEDNFATLMTGQLGLF >NZ_CP029122.1|WP_001117011.1|4104099_4107006_+|RNA-polymerase-associated-protein-RapA MPFTLGQRWISDTESELGLGTVVAVDARTVTLLFPSTGENRLYARSDSPVTRVMFNPGDTITSHDGWQMQVEEVKEENGLLTYIGTRLDTEESGVALREVFLDSKLVFSKPQDRLFAGQIDRMDRFALRYRARKYSSEQFRMPYSGLRGQRTSLIPHQLNIAHDVGRRHAPRVLLADEVGLGKTIEAGMILHQQLLSGAAERVLIIVPETLQHQWLVEMLRRFNLRFALFDDERYAEAQHDAYNPFDTEQLVICSLDFARRSKQRLEHLCEAEWDLLVVDEAHHLVWSEDAPSREYQAIEQLAEHVPGVLLLTATPEQLGMESHFARLRLLDPNRFHDFAQFVEEQKNYRPVADAVAMLLAGNKLSNDELNMLGEMIGEQDIEPLLQAANSDSEDAQSARQELVSMLMDRHGTSRVLFRNTRNGVKGFPKRELHTIKLPLPTQYQTAIKVSGIMGARKSAEDRARDMLYPERIYQEFEGDNATWWNFDPRVEWLMGYLTSHRSQKVLVICAKAATALQLEQVLREREGIRAAVFHEGMSIIERDRAAAWFAEEDTGAQVLLCSEIGSEGRNFQFASHMVMFDLPFNPDLLEQRIGRLDRIGQAHDIQIHVPYLEKTAQSVLVRWYHEGLDAFEHTCPTGRTIYDSVYNDLINYLASPDQTEGFDDLIKNCREQHEALKAQLEQGRDRLLEIHSNGGEKAQALAESIEEQDDDTNLIAFAMNLFDIIGINQDDRGDNMIVLTPSDHMLVPDFPGLSEDGITITFDREVALAREDAQFITWEHPLIRNGLDLILSGDTGSSTISLLKNKALPVGTLLVELIYVVEAQAPKQLQLNRFLPPTPVRMLLDKNGNNLAAQVEFETFNRQLNAVNRHTGSKLVNAVQQDVHAILQLGEAQIEKSARALIDAARNEADEKLSAELSRLEALRAVNPNIRDDELTAIESNRQQVMESLDQAGWRLDALRLIVVTHQ >NZ_CP029122.1|WP_000525176.1|4107017_4107677_+|bifunctional-tRNA-pseudouridine(32)-synthase/23S-rRNA-pseudouridine(746)-synthase-RluA MGMENYNPPQEPWLVILYQDDHIMVVNKPSGLLSVPGRLEEHKDSVMTRIQRDYPQAESVHRLDMATSGVIVVALTKAAERELKRQFREREPKKQYVARVWGHPSPAEGLVDLPLICDWPNRPKQKVCYETGKPAQTEYEVVEYAADNTARVVLKPITGRSHQLRVHMLALGHPILGDRFYASPEARAMAPRLLLHAEMLTITHPAYGNSMTFKAPADF >NZ_CP029122.1|WP_001200579.1|4107793_4108609_-|co-chaperone-DjlA MQYWGKIIGVAVALLMGGGFWGVVLGLLIGHMFDKARSRKMAWFANQRERQALFFATTFEVMGHLTKSKGRVTEADIHIASQLMDRMNLHGASRTAAQNAFRVGKSDNYPLREKMRQFRSVCFGRFDLIRMFLEIQIQAAFADGSLHPNERAVLYVIAEELGISRAQFDQFLRMMQGGAQFGGGYQQQTGGGNWQQAQRGPTLEDACNVLGVKPTDDATTIKRAYRKLMSEHHPDKLVAKGLPPEMMEMAKQKAQEIQQAYELIKQQKGFK >NZ_CP029122.1|WP_000746150.1|4108863_4111218_+|LPS-assembly-protein-LptD MKKRIPTLLATMIATALYSQQGLAADLASQCMLGVPSYDRPLVQGDTNDLPVTINADHAKGDYPDDAVFTGSVDIMQGNSRLQADEVQLHQKEAPGQPEPVRTVDALGNVHYDDNQVILKGPKGWANLNTKDTNVWEGDYQMVGRQGRGKADLMKQRGENRYTILDNGSFTSCLPGSDTWSVVGSEIIHDREEQVAEIWNARFKVGPVPIFYSPYLQLPVGDKRRSGFLIPNAKYTTTNYFEFYLPYYWNIAPNMDATITPHYMHRRGNIMWENEFRYLSQAGAGLMELDYLPSDKVYEDEHPNDDSSRRWLFYWNHSGVMDQVWRFNVDYTKVSDPSYFNDFDNKYGSSTDGYATQKFSVGYAVQNFNATVSTKQFQVFSEQNTSSYSAEPQLDVNYYQNDVGPFDTRIYGQAVHFVNTRDDMPEATRVHLEPTINLPLSNNWGSINTEAKLLATHYQQTNLDWYNSRNTTKLDESVNRVMPQFKVDGKMVFERDMEMLAPGYTQTLEPRAQYLYVPYRDQSDIYNYDSSLLQSDYSGLFRDRTYGGLDRIASANQVTTGVTSRIYDDAAVERFNISVGQIYYFTESRTGDDNITWENDDKTGSLVWAGDTYWRISERWGLRGGIQYDTRLDNVATSNSSIEYRRDEDRLVQLNYRYASPEYIQATLPKYYSTAEQYKNGISQVGAVASWPIADRWSIVGAYYYDTNANKQADSMLGVQYSSCCYAIRVGYERKLNGWDNDKQHAVYDNAIGFNIELRGLSSNYGLGTQEMLRSNILPYQNSL >NZ_CP029122.1|WP_000800453.1|4111270_4112557_+|peptidylprolyl-isomerase-SurA MKNWKTLLLGIAMIANTSFAAPQVVDKVAAVVNNGVVLESDVDGLMQSVKLNAAQARQQLPDDATLRHQIMERLIMDQIILQMGQKMGVKISDEQLDQAIANIAKQNNMTLDQMRSRLAYDGLNYNTYRNQIRKEMIISEVRNNEVRRRITILPQEVESLAQQVGNQNDASTELNLSHILIPLPENPTSDQVNEAESQARAIVDQARNGADFGKLAIAHSADQQALNGGQMGWGRIQELPGIFAQALSTAKKGDIVGPIRSGVGFHILKVNDLRGESKNISVTEVHARHILLKPSPIMTDEQARVKLEQIAADIKSGKTTFAAAAKEFSQDPGSANQGGDLGWATADIFDPAFRDALTRLNKGQMSAPVHSSFGWHLIELLDTRNVDKTDAAQKDRAYRMLMNRKFSEEAASWMQEQRASAYVKILSN >NZ_CP029122.1|WP_000241271.1|4112556_4113546_+|4-hydroxythreonine-4-phosphate-dehydrogenase-PdxA MVKTQRVVITPGEPAGIGPDLVVQLAQREWPVELVVCADATLLTDRAAMLGLPLTLRTYSPNSPAQPQTAGTLTLLPVALRESVTAGQLAVENGHYVVETLARACDGCLNGEFAALITGPVHKGVINDAGIPFTGHTEFFEERSQAKKVVMMLATEELRVALATTHLPLRDIADAITPALLHEVIAILHHDLRTKFGIAEPRILVCGLNPHAGEGGHMGTEEIDTIIPLLDELRAQGMKLNGPLPADTLFQPKYLDNADAVLAMYHDQGLPVLKYQGFGRGVNITLGLPFIRTSVDHGTALELAGRGEADVGSFITALNLAIKMIVNTQ >NZ_CP029122.1|WP_001065381.1|4113542_4114364_+|16S-rRNA-(adenine(1518)-N(6)/adenine(1519)-N(6))--dimethyltransferase-RsmA MNNRVHQGHLARKRFGQNFLNDQFVIDSIVSAINPQKGQAMVEIGPGLAALTEPVGERLDQLTVIELDRDLAARLQTHPFLGPKLTIYQQDAMTFNFGELAEKMGQPLRVFGNLPYNISTPLMFHLFSYTDAIADMHFMLQKEVVNRLVAGPNSKAYGRLSVMAQYYCNVIPVLEVPPSAFTPPPKVDSAVVRLVPHATMPHPVKDVRVLSRITTEAFNQRRKTIRNSLGNLFSVEVLTGMGIDPAMRAENISVAQYCQMANYLAENAPLQES >NZ_CP029122.1|WP_000610901.1|4114366_4114744_+|Co2+/Mg2+-efflux-protein-ApaG MINSPRVCIQVQSVYIEAQSSPDNERYVFAYTVTIRNLGRAPVQLLGRYWLITNGNGRETEVQGEGVVGVQPLIAPGEEYQYTSGAIIETPLGTMQGHYEMIDENGVPFSIDIPVFRLAVPTLIH |
You can click texts colored in the table to view more detailed information
CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
NZ_CP029122_11 | 4123662-4123794 | Orphan |
NA
Consensus repeat of NZ_CP029122_11
|
2 spacers
spacers of NZ_CP029122_11
>11.1|4123679|42|NZ_CP029122|PILER-CR TGTCACACGCAGATAAATCCAACTTTCAATATTGTTAAGTTC >11.2|4123738|40|NZ_CP029122|PILER-CR CATGGCGTAGCAAAAAGAAATTTTCAATATTGCTTTATGG |
CRISPR arrays and Neighbor proteins around NZ_CP029122_11
The CRISPR arrays of NZ_CP029122_11 >merge|NZ_CP029122|11|4123662-4123794|PILER-CR ATCACCAATATTGAAAATGTCACACGCAGATAAATCCAACTTTCAATATTGTTAAGTTCCTCACCAATATTGAAAACATGGCGTAGCAAAAAGAAATTTTCAATATTGCTTTATGGATCACCAATATTGAAAG >NZ_CP029122|11|3|4123662-4123794|PILER-CR ATCACCAATATTGAAAA TGTCACACGCAGATAAATCCAACTTTCAATATTGTTAAGTTC CTCACCAATATTGAAAA CATGGCGTAGCAAAAAGAAATTTTCAATATTGCTTTATGG ATCACCAATATTGAAAG
>NZ_CP029122.1|WP_000692204.1|4122800_4123571_-|electron-transfer-flavoprotein-FixA MKIITCYKCVPDEQDIAVNNADGSLDFSKADAKISQYDLNAIEAACQLKQQAAEAQVTALSVGGKALTNAKGRKDVLSRGPDELIVVIDDQFEQALPQQTASALAAAAQKAGFDLILCGDGSSDLYAQQVGLLVGEILNIPAVNGVSKIISLTADTLTVERELEDETETLSIPLPAVVAVSTDINSPQIPSMKAILGAAKKPVQVWSAADIGFNAEAAWSEQQVAAPKQRERQRIVIEGDGEEQIAAFAENLRKVI >NZ_CP029122.1|WP_001091499.1|4121844_4122786_-|electron-transfer-flavoprotein-subunit-alpha/FixB-family-protein MNTFSQVWVFSDTPSRLPELMNGAQALANQINTFVLNDADGAQAIQLGANHVWKLNGKPDDRMIEDYAGVMADTIRQHGADGLVLLPNTRRGKLLAAKLGYRLKAAVSNDASTVSVQDGKATVKHMVYGGLAIGEERIATPYAVLTISSGTFDAAQPDASRTGETHTVEWQAPAVAITRTATQARQSNSVDLDKARLVVSVGRGIGSKENIALAEQLCKAIGAELACSRPVAENEKWMEHERYVGISNLMLKPELYLAVGISGQIQHMVGANASQTIFAINKDKNAPIFQYADYGIVGDAVKILPALTAALAR >NZ_CP029122.1|WP_001287715.1|4120507_4121794_-|FAD-dependent-oxidoreductase MSEDIFDAIIVGAGLAGSVAALVLAREGAQVLVIERGNSAGAKNVTGGRLYAHSLEHIIPGFADSAPVERLITHEKLAFMTEKSAMTMDYCNGDETSPSQRSYSVLRSKFDAWLMEQAEEAGAQLITGIRVDNLVQRDGKVVGVEADGDVIEAKTVILADGVNSILAEKLGMAKRVKPTDVAVGVKELIELPKSVIEDRFQLQGNQGAACLFAGSPTDGLMGGGFLYTNENTLSLGLVCGLHHLHDAKKSVPQMLEDFKQHPAVAPLIAGGKLVEYSAHVVPEAGINMLPELVGDGVLIAGDAAGMCMNLGFTIRGMDLAIAAGEAAAKTVLSAMKSDDFSKQKLAEYRQHLESGPLRDMRMYQKLPAFLDNPRMFSGYPELAVGVARDLFTIDGSAPELMRKKILRHGKKVGFINLIKDGMKGVTVL >NZ_CP029122.1|WP_000203747.1|4120223_4120511_-|ferredoxin-like-protein-FixX MTSPVNVDVKLGVNKFNVDEEHPHIVVKADADKQVLELLVKACPAGLYKKQDDGSVRFDYAGCLECGTCRILGLGSALEQWEYPRGTFGVEFRYG >NZ_CP029122.1|WP_001183198.1|4118834_4120166_-|MFS-transporter MQPSRNFDDLKFSSIHRRILLWGSGGPFLDGYVLVMIGVALEQLTPALKLDADWIGLLGAGTLAGLFVGTSLFGYISDKVGRRKMFLIDIIAIGVISVATMFVSSPVELLVMRVLIGIVIGADYPIATSMITEFSSTRQRAFSISFIAAMWYVGATCADLVGYWLYDVEGGWRWMLGSAAIPCLLILIGRFELPESPRWLLRKGRVKECEEMMIKLFGEPVAFDEEQPQQTRFRDLFNRRHFPFVLFVAAIWTCQVIPMFAIYTFGPQIVGLLGLGVGKNAALGNVVISLFFMLGCIPPMLWLNTAGRRPLLIGSFAMMTLALAVLGLIPDMGIWLVVMAFAVYAFFSGGPGNLQWLYPNELFPTDIRASAVGVIMSLSRIGTIVSTWALPIFINNYGISNTMLMGAGISLFGLLISVAFAPETRGMSLAQTSNMTIRGQRMG >NZ_CP029122.1|WP_000600725.1|4118196_4118727_-|glutathione-regulated-potassium-efflux-system-oxidoreductase-KefF MILIIYAHPYPHHSHANKRMLEQARTLEGVEIRSLYQLYPDFNIDIAAEQEALSRADLIVWQHPMQWYSIPPLLKLWIDKVFSHGWAYGHGGTALHGKHLLWAVTTGGGESHFEIGAHPGFDVLSQPLQATAIYCGLNWLPPFAMHCTFICDDETLEGQARHYKQRLLEWQEAHHG >NZ_CP029122.1|WP_000377129.1|4116341_4118204_-|glutathione-regulated-potassium-efflux-system-protein-KefC MDSHTLIQALIYLGSAALIVPIAVRLGLGSVLGYLIAGCIIGPWGLRLVTDAESILHFAEIGVVLMLFIIGLELDPQRLWKLRAAVFGGGALQMVICGGLLGLFCMLLGLRWQVAELIGMTLALSSTAIAMQAMNERNLMVTQMGRSAFAVLLFQDIAAIPLVAMIPLLATSSASTTMGAFALSALKVAGALVLVVLLGRYVTRPALRFVARSGLREVFSAVALFLVFGFGLLLEEVGLSMAMGAFLAGVLLASSEYRHALESDIEPFKGLLLGLFFIGVGMSIDFGTLLENPLRIVILLLGFLIIKIAMLWLIARPLQVPNKQRRWFAVLLGQGSEFAFVVFGAAQMANVLEPEWAKSLTLAVALSMAATPILLVILNRLEQSSTEEAREADEIDEEQPRVIIAGFGRFGQITGRLLLSSGVKMVVLDHDPDHIETLRKFGMKVFYGDATRMDLLESAGAAKAEVLINAIDDPQTNLQLTEMVKEHFPHLQIIARARDVDHYIRLRQAGVEKPERETFEGALKTGRLALESLGLGPYEARERADVFRRFNIQMVEEMAMVENDTKARAAVYKRTSAMLSEIITEDREHLSLIQRHGWQGTEEGKHTGNMADEPETKPSS >NZ_CP029122.1|WP_000624375.1|4115670_4116150_-|type-3-dihydrofolate-reductase MISLIAALAVDRVIGMENAMPWNLPADLAWFKRNTLNKPVIMGRHTWESIGRPLPGRKNIILSSQPGTDDRVTWVKSVDEAIAACGDVPEIMVIGGGRVYEQFLPKAQKLYLTHIDAEVEGDTHFPDYEPDDWESVFSEFHDADAQNSHSYCFEILERR >NZ_CP029122.1|WP_000257192.1|4114750_4115593_+|bis(5'-nucleosyl)-tetraphosphatase-(symmetrical) MATYLIGDVHGCYDELIALLHKVEFTPGKDTLWLTGDLVARGPGSLDVLRYVKSLGDSVRLVLGNHDLHLLAVFAGISRNKPKDRLTPLLEAPDADELLNWLRRQPLLQIDEEKKLVMAHAGITPQWDLQTAKECARDVEAVLSSDSYPFFLDAMYGDMPNNWSPELRGLGRLRFITNAFTRMRFCFPNGQLDMYSKESPEEAPAPLKPWFAIPGPVAEEYSIAFGHWASLEGKGTPEGIYALDTGCCWGGTLTCLRWEDKQYFVQPSNRHKDLGEAAAS >NZ_CP029122.1|WP_000610901.1|4114366_4114744_+|Co2+/Mg2+-efflux-protein-ApaG MINSPRVCIQVQSVYIEAQSSPDNERYVFAYTVTIRNLGRAPVQLLGRYWLITNGNGRETEVQGEGVVGVQPLIAPGEEYQYTSGAIIETPLGTMQGHYEMIDENGVPFSIDIPVFRLAVPTLIH >NZ_CP029122.1|WP_000787103.1|4124044_4125559_+|L-carnitine/gamma-butyrobetaine-antiport-BCCT-transporter MKNEKRKTGIEPKVFFPPLIIVGILCWLTVRDLDAANVVINAVFSYVTNVWGWAFEWYMVVMLFGWFWLVFGPYAKKRLGNEPPEFSTASWIFMMFASCTSAAVLFWGSIEIYYYISTPPFGLEPNSTGAKELGLAYSLFHWGPLPWATYSFLSVAFAYFFFVRKMEVIRPSSTLVPLVGEKHAKGLFGTIVDNFYLVALIFAMGTSLGLATPLVTECMQWLFGIPHTLQLDAIIITCWIILNAICVACGLQKGVRIASDVRSYLSFLMLGWVFIVSGASFIMNYFTDSVGMLLMYLPRMLFYTDPIAKGGFPQGWTVFYWAWWVIYAIQMSIFLARISRGRTVRELCFGMVLGLTASTWILWTVLGSNTLLLIDKNIINIPNLIEQYGVARAIIETWAALPLSTATMWGFFILCFIATVTLVNACSYTLAMSTCREVRDGEEPPLLVRIGWSILVGIIGIVLLALGGLKPIQTAIIAGGCPLFFVNIMVTLSFIKDAKQNWKD >NZ_CP029122.1|WP_000347117.1|4125589_4126732_+|crotonobetainyl-CoA-dehydrogenase MDFNLNDEQELFVAGIRELMASENWEAYFAECDRDSVYPERFVKALADMGIDSLLIPEEHGGLDAGFVTLAAVWMELGRLGAPTYVLYQLPGGFNTFLREGTQEQIDKIMAFRGTGKQMWNSAITEPGAGSDVGSLKTTYTRRNGKIYLNGSKCFITSSAYTPYIVVMARDGASPDKPVYTEWFVDMSKPGIKVTKLEKLGLRMDSCCEITFDDVELDEKDMFGREGNGFNRVKEEFDHERFLVALTNYGTAMCAFEDAARYANQRVQFGEAIGRFQLIQEKFAHMAIKLNSMKNMLYEAAWKADNGTITSGDAAMCKYFCANAAFEVVDSAMQVLGGVGIAGNHRISRFWRDLRVDRVSGGSDEMQILTLGRAVLKQYR >NZ_CP029122.1|WP_000349926.1|4126860_4128078_+|L-carnitine-CoA-transferase MDHLPMPKFGPLAGLRVVFSGIEIAGPFAGQMFAEWGAEVIWIENVAWADTIRVQPNYPQLSRRNLHALSLNIFKDEGREAFLKLMETTDIFIEASKGPAFARRGITDEVLWQHNPKLVIAHLSGFGQYGTEEYTNLPAYNTIAQAFSGYLIQNGDVDQPMPAFPYTADYFSGLTATTAALAALHKARETGKGESIDIAMYEVMLRMGQYFMMDYFNGGEMCPRMSKGKDPYYAGCGLYKCADGYIVMELVGITQIEECFKDIGLAHLLSTPEIPEGTQLIHRIECPYGPLVEEKLDAWLAAHTIAEVKERFAELNIACAKVLTVPELESNPQYVARESITQWQTMDGRTCKGPNIMPKFKNNPGQIWRGMPSHGMDTAAILKNIGYSENDIQELVSKGLAKVED >NZ_CP029122.1|WP_000351348.1|4128151_4129705_+|crotonobetaine/carnitine-CoA-ligase MDIIGGQHLRQMWDDLADVYGHKTALICESSGGVVNRYSYLELNQEINRTANLFYTLGIRKGDKVALHLDNCPEFIFCWFGLAKIGAIMVPINARLLREESAWILQNSQACLLVTSAQFYPMYQQIQQEDATQLRHICLTDVALPADDGVSSFTQLKNQQPATLCYAPPLLTDDTAEILFTSGTTSRPKGVVITHYNLRFAGYYSAWQCALRDDDVYLTVMPAFHIDCQCTAAMAAFSAGATFVLVEKYSARAFWGQVQKYRATITECIPMMIRTLMVQPPSANDRQHRLREVMFYLNLSEQEKDAFCERFGVRLLTSYGMTETIVGIIGDRPGDKRRWPSIGRAGFCYEAEIRDDHNRPLPAGEIGEICIKGVPGKTIFKEYFLNPKATAKVLEADGWLHTGDTGYCDEEGFFYFVDRRCNMIKRGGENVSCVELENIIATHPKIQDIVVVGIKDSIRDEAIKAFVVLNEGETLSEEEFFRFCEQNMAKFKVPSYLEIRKDLPRNCSGKIIRKNLK >NZ_CP029122.1|WP_000004404.1|4129813_4130599_+|crotonobetainyl-CoA-hydratase MSESLHLTRNGSILEITLDRPKANAIDAKTSFEMGEVFLNFRDDPQLRVAIITGAGEKFFSAGWDLKAAAEGEAPDADFGPGGFAGLTEIFNLDKPVIAAVNGYAFGGGFELALAADFIVCADNASFALPEAKLGIVPDSGGVLRLPKILPPAIVNEMVMTGRRMGTEEALRWGIVNRVVSQAELMDNARELAQQLVNSAPLAIAALKEIYRTTSEMPVEEAYRYIRSGVLKHYPSVLHSEDAVEGPLAFAEKRDPVWKGR >NZ_CP029122.1|WP_000122876.1|4130604_4131195_+|carnitine-operon-protein-CaiE MSYYAFEGLIPVVHPTAFVHPSAVLIGDVIVGAGVYIGPLASLRGDYGRLIVQAGANIQDGCIMHGYCDTDTIVGENGHIGHGAILHGCVIGRDALVGMNSVIMDGAVIGEESIVAAMSFVKAGFHGEKRQLLMGTPARAVRSVSDDELHWKRLNTKEYQDLVGRCHASLHETQPLRQMEENRPRLQGTTDVTPKR >NZ_CP029122.1|WP_000333120.1|4131280_4131676_-|carnitine-metabolism-transcriptional-regulator-CaiF MCEGYVEKPLYLLIAEWMMAENRWVIAREISIHFDIEHSKAVNTLTYILSEVAEISCEVKMIPNKLEGRGCQCQRLVKVVDIDEQIYARLRNNSRDKLVGVRKTPRIPAVPLTELNREQKWQMMLSKSMRR >NZ_CP029122.1|WP_001126376.1|4131936_4135158_-|carbamoyl-phosphate-synthase-large-subunit MPKRTDIKSILILGAGPIVIGQACEFDYSGAQACKALREEGYRVILVNSNPATIMTDPEMADATYIEPIHWEVVRKIIEKERPDAVLPTMGGQTALNCALELERQGVLEEFGVTMIGATADAIDKAEDRRRFDVAMKKIGLETARSGIAHTMEEALAVAADVGFPCIIRPSFTMGGSGGGIAYNREEFEEICARGLDLSPTKELLIDESLIGWKEYEMEVVRDKNDNCIIVCSIENFDAMGIHTGDSITVAPAQTLTDKEYQIMRNASMAVLREIGVETGGSNVQFAVNPKNGRLIVIEMNPRVSRSSALASKATGFPIAKVAAKLAVGYTLDELMNDITGGRTPASFEPSIDYVVTKIPRFNFEKFAGANDRLTTQMKSVGEVMAIGRTQQESLQKALRGLEVGATGFDPKVSLDDPEALTKIRRELKDAGAERIWYIADAFRAGLSVDGVFNLTNIDRWFLVQIEELVRLEEKVAEVGITGLNAEFLRQLKRKGFADARLAKLAGVREAEIRKLRDQYDLHPVYKRVDTCAAEFATDTAYMYSTYEEECEANPSTDREKIMVLGGGPNRIGQGIEFDYCCVHASLALREDGYETIMVNCNPETVSTDYDTSDRLYFEPVTLEDVLEIVRIEKPKGVIVQYGGQTPLKLARALEAAGVPVIGTSPDAIDRAEDRERFQHAVERLKLKQPANATVTAIEMAVEKAKEIGYPLVVRPSYVLGGRAMEIVYDEADLRRYFQTAVSVSNDAPVLLDHFLDDAVEVDVDAICDGEMVLIGGIMEHIEQAGVHSGDSACSLPAYTLSQEIQDVMRQQVQKLAFELQVRGLMNVQFAVKNNEVYLIEVNPRAARTVPFVSKATGVPLAKVAARVMAGKSLAEQGVTKEVIPPYYSVKEVVLPFNKFPGVDPLLGPEMRSTGEVMGVGRTFAEAFAKAQLGSNSTMKKHGRALLSVREGDKERVVDLAAKLLKQGFELDATHGTAIVLGEAGINPRLVNKVHEGRPHIQDRIKNGEYTYIINTTSGRRAIEDSRVIRRSALQYKVHYDTTLNGGFATAMALNADATEKVISVQEMHAQIK >NZ_CP029122.1|WP_000597260.1|4135175_4136324_-|glutamine-hydrolyzing-carbamoyl-phosphate-synthase-small-subunit MIKSALLVLEDGTQFHGRAIGATGSAVGEVVFNTSMTGYQEILTDPSYSRQIVTLTYPHIGNVGTNDADEESSQVHAQGLVIRDLPLIASNFRNTEDLSSYLKRHNIVAIADIDTRKLTRLLREKGAQNGCIIAGDNPDAALALEKARAFPGLNGMDLAKEVTTAEAYSWTQGSWTLTGGLPEAKKEDELPFHVVAYDFGAKRNILRMLVDRGCRLTIVPAQTSAEDVLKMNPDGIFLSNGPGDPAPCDYAITAIQKFLETDIPVFGICLGHQLLALASGAKTVKMKFGHHGGNHPVKDVEKNVVMITAQNHGFAVDEATLPANLRVTHKSLFDGTLQGIHRTDKPAFSFQGHPEASPGPHDAAPLFDHFIELIEQYRKTAK >NZ_CP029122.1|WP_000543597.1|4136779_4137601_-|4-hydroxy-tetrahydrodipicolinate-reductase MHDANIRVAIAGAGGRMGRQLIQAALALEGVQLGAALEREGSSLLGSDAGELAGAGKTGVTVQSSLDAVKDDFDVFIDFTRPEGTLNHLAFCRQHGKGMVIGTTGFDEAGKQAIRDAAADIAIVFAANFSVGVNVMLKLLEKAAKVMGDYTDIEIIEAHHRHKVDAPSGTALAMGEAIAHALDKDLKDCAVYSREGHTGERVPGTIGFATVRAGDIVGEHTAMFADIGERLEITHKASSRMTFANGAVRSALWLSGKEGGLFDMRDVLDLNSL |
You can click texts colored in the table to view more detailed information
CRISPR_ID | Spacer_Info | Spacer_region | Spacer_length | Hit_ID | Protospacer_location | Mismatch | Identity |
---|
CRISPR_ID | Spacer_Info | Spacer_region | Spacer_length | Hit_phage_ID | Hit_phage_def | Protospacer_location | Mismatch | Identity |
---|---|---|---|---|---|---|---|---|
NZ_CP029122_7 | 7.1|3110451|40|NZ_CP029122|CRISPRCasFinder | 3110451-3110490 | 40 | NZ_CP041417 | Escherichia coli strain STEC711 plasmid pSTEC711_1, complete sequence | 47951-47990 | 0 | 1.0 |
NZ_CP029122_11 | 11.1|4123679|42|NZ_CP029122|PILER-CR | 4123679-4123720 | 42 | NZ_AP023206 | Escherichia coli strain TUM18781 plasmid pMTY18781-1_lncX3, complete sequence | 141085-141126 | 0 | 1.0 |
NZ_CP029122_11 | 11.2|4123738|40|NZ_CP029122|PILER-CR | 4123738-4123777 | 40 | NZ_AP023206 | Escherichia coli strain TUM18781 plasmid pMTY18781-1_lncX3, complete sequence | 141028-141067 | 1 | 0.975 |
NZ_CP029122_6 | 6.1|2457209|38|NZ_CP029122|CRISPRCasFinder | 2457209-2457246 | 38 | NZ_CP043437 | Enterobacter sp. LU1 plasmid unnamed | 113727-113764 | 2 | 0.947 |
NZ_CP029122_9 | 9.1|3910518|48|NZ_CP029122|CRISPRCasFinder | 3910518-3910565 | 48 | NZ_CP053606 | Escherichia coli strain NEB_Turbo plasmid F', complete sequence | 4089-4136 | 3 | 0.938 |
NZ_CP029122_9 | 9.1|3910518|48|NZ_CP029122|CRISPRCasFinder | 3910518-3910565 | 48 | NZ_CP053608 | Escherichia coli strain NEB5-alpha_F'Iq plasmid F'Iq, complete sequence | 4088-4135 | 3 | 0.938 |
NZ_CP029122_9 | 9.1|3910518|48|NZ_CP029122|CRISPRCasFinder | 3910518-3910565 | 48 | NZ_CP014271 | Escherichia coli K-12 strain K-12 DHB4 plasmid F128-(DHB4), complete sequence | 4088-4135 | 3 | 0.938 |
NZ_CP029122_9 | 9.1|3910518|48|NZ_CP029122|CRISPRCasFinder | 3910518-3910565 | 48 | NZ_CP014273 | Escherichia coli K-12 strain K-12 C3026 plasmid F128-(C3026), complete sequence | 4088-4135 | 3 | 0.938 |
NZ_CP029122_1 | 1.1|892312|42|NZ_CP029122|CRISPRCasFinder | 892312-892353 | 42 | NZ_CP010208 | Escherichia coli strain M11 plasmid B, complete sequence | 30214-30255 | 7 | 0.833 |
NZ_CP029122_3 | 3.7|1310998|32|NZ_CP029122|PILER-CR,CRISPRCasFinder,CRT | 1310998-1311029 | 32 | NZ_MG299151 | Shigella sonnei strain SH287-2 plasmid pSH287-2, complete sequence | 51276-51307 | 7 | 0.781 |
NZ_CP029122_3 | 3.7|1310998|32|NZ_CP029122|PILER-CR,CRISPRCasFinder,CRT | 1310998-1311029 | 32 | NZ_KY471628 | Shigella sonnei strain SH15sh99 plasmid pSH15sh99, complete sequence | 45716-45747 | 7 | 0.781 |
NZ_CP029122_3 | 3.7|1310998|32|NZ_CP029122|PILER-CR,CRISPRCasFinder,CRT | 1310998-1311029 | 32 | NZ_MG299131 | Shigella sonnei strain SH271-2 plasmid pSH271-2, complete sequence | 51276-51307 | 7 | 0.781 |
NZ_CP029122_3 | 3.7|1310998|32|NZ_CP029122|PILER-CR,CRISPRCasFinder,CRT | 1310998-1311029 | 32 | NZ_KY471629 | Shigella sonnei strain SH15sh105 plasmid pSH15sh104, complete sequence | 45716-45747 | 7 | 0.781 |
NZ_CP029122_3 | 3.7|1310998|32|NZ_CP029122|PILER-CR,CRISPRCasFinder,CRT | 1310998-1311029 | 32 | NZ_MG299133 | Shigella sonnei strain SH272-2 plasmid pSH272-2, complete sequence | 51276-51307 | 7 | 0.781 |
NZ_CP029122_3 | 3.7|1310998|32|NZ_CP029122|PILER-CR,CRISPRCasFinder,CRT | 1310998-1311029 | 32 | NZ_MG299128 | Shigella sonnei strain SH262-2 plasmid pSH262-2, complete sequence | 51276-51307 | 7 | 0.781 |
NZ_CP029122_3 | 3.7|1310998|32|NZ_CP029122|PILER-CR,CRISPRCasFinder,CRT | 1310998-1311029 | 32 | NZ_MG299147 | Shigella sonnei strain SH284-2 plasmid pSH284-2, complete sequence | 51276-51307 | 7 | 0.781 |
NZ_CP029122_3 | 3.7|1310998|32|NZ_CP029122|PILER-CR,CRISPRCasFinder,CRT | 1310998-1311029 | 32 | NC_018995 | Escherichia coli plasmid pHUSEC41-1, complete sequence | 29015-29046 | 7 | 0.781 |
NZ_CP029122_3 | 3.7|1310998|32|NZ_CP029122|PILER-CR,CRISPRCasFinder,CRT | 1310998-1311029 | 32 | NZ_CP053235 | Escherichia coli strain SCU-106 plasmid pSCU-106-1, complete sequence | 78292-78323 | 7 | 0.781 |
NZ_CP029122_3 | 3.7|1310998|32|NZ_CP029122|PILER-CR,CRISPRCasFinder,CRT | 1310998-1311029 | 32 | NZ_CP005999 | Escherichia coli B7A plasmid pEB1, complete sequence | 39563-39594 | 7 | 0.781 |
NZ_CP029122_3 | 3.7|1310998|32|NZ_CP029122|PILER-CR,CRISPRCasFinder,CRT | 1310998-1311029 | 32 | KU932021 | Escherichia coli plasmid pEC3I, complete sequence | 51902-51933 | 7 | 0.781 |
NZ_CP029122_3 | 3.7|1310998|32|NZ_CP029122|PILER-CR,CRISPRCasFinder,CRT | 1310998-1311029 | 32 | NZ_CP024154 | Escherichia coli strain 14EC033 plasmid p14EC033g, complete sequence | 18560-18591 | 7 | 0.781 |
NZ_CP029122_3 | 3.7|1310998|32|NZ_CP029122|PILER-CR,CRISPRCasFinder,CRT | 1310998-1311029 | 32 | NC_011754 | Escherichia coli ED1a plasmid pECOED, complete sequence | 49240-49271 | 7 | 0.781 |
NZ_CP029122_3 | 3.7|1310998|32|NZ_CP029122|PILER-CR,CRISPRCasFinder,CRT | 1310998-1311029 | 32 | NZ_CP015141 | Escherichia coli strain Ecol_732 plasmid pEC732_3, complete sequence | 81434-81465 | 7 | 0.781 |
NZ_CP029122_3 | 3.7|1310998|32|NZ_CP029122|PILER-CR,CRISPRCasFinder,CRT | 1310998-1311029 | 32 | NZ_LR213460 | Shigella sonnei strain AUSMDU00008333 isolate AUSMDU00008333 plasmid 3 | 28916-28947 | 7 | 0.781 |
NZ_CP029122_3 | 3.7|1310998|32|NZ_CP029122|PILER-CR,CRISPRCasFinder,CRT | 1310998-1311029 | 32 | NZ_MH287044 | Escherichia coli strain 5.1-R1 plasmid pCERC6, complete sequence | 36182-36213 | 7 | 0.781 |
NZ_CP029122_3 | 3.7|1310998|32|NZ_CP029122|PILER-CR,CRISPRCasFinder,CRT | 1310998-1311029 | 32 | NZ_MH618673 | Escherichia coli strain 838B plasmid p838B-R, complete sequence | 32230-32261 | 7 | 0.781 |
NZ_CP029122_4 | 4.1|1333534|31|NZ_CP029122|CRISPRCasFinder | 1333534-1333564 | 31 | NC_007336 | Cupriavidus pinatubonensis JMP134 megaplasmid, complete sequence | 62682-62712 | 7 | 0.774 |
NZ_CP029122_4 | 4.1|1333534|31|NZ_CP029122|CRISPRCasFinder | 1333534-1333564 | 31 | NZ_CP013104 | Paraburkholderia caribensis strain MWAP64 plasmid 1, complete sequence | 1222106-1222136 | 7 | 0.774 |
NZ_CP029122_4 | 4.1|1333534|31|NZ_CP029122|CRISPRCasFinder | 1333534-1333564 | 31 | NZ_CP012748 | Paraburkholderia caribensis MBA4 plasmid unnamed, complete sequence | 2467672-2467702 | 7 | 0.774 |
NZ_CP029122_4 | 4.4|1333717|31|NZ_CP029122|CRISPRCasFinder | 1333717-1333747 | 31 | NZ_CP034185 | Deinococcus sp. S14-83 strain S14-83T plasmid unnamed1, complete sequence | 17977-18007 | 7 | 0.774 |
NZ_CP029122_4 | 4.7|1333900|31|NZ_CP029122|CRISPRCasFinder | 1333900-1333930 | 31 | NC_007336 | Cupriavidus pinatubonensis JMP134 megaplasmid, complete sequence | 530641-530671 | 7 | 0.774 |
NZ_CP029122_1 | 1.1|892312|42|NZ_CP029122|CRISPRCasFinder | 892312-892353 | 42 | NZ_CP048307 | Escherichia coli strain 9 plasmid p009_C, complete sequence | 24899-24940 | 8 | 0.81 |
NZ_CP029122_3 | 3.6|1310937|32|NZ_CP029122|PILER-CR,CRISPRCasFinder,CRT | 1310937-1310968 | 32 | NZ_CP012748 | Paraburkholderia caribensis MBA4 plasmid unnamed, complete sequence | 1417960-1417991 | 8 | 0.75 |
NZ_CP029122_4 | 4.4|1333717|31|NZ_CP029122|CRISPRCasFinder | 1333717-1333747 | 31 | NZ_CP017753 | Cupriavidus sp. USMAHM13 plasmid unnamed1, complete sequence | 97498-97528 | 8 | 0.742 |
NZ_CP029122_4 | 4.7|1333900|31|NZ_CP029122|CRISPRCasFinder | 1333900-1333930 | 31 | NZ_CP036297 | Planctomycetes bacterium Pla86 plasmid pPla86_1, complete sequence | 14953-14983 | 8 | 0.742 |
NZ_CP029122_4 | 4.7|1333900|31|NZ_CP029122|CRISPRCasFinder | 1333900-1333930 | 31 | NZ_CP036288 | Planctomycetes bacterium Pla133 plasmid pPla133_1, complete sequence | 14983-15013 | 8 | 0.742 |
NZ_CP029122_4 | 4.7|1333900|31|NZ_CP029122|CRISPRCasFinder | 1333900-1333930 | 31 | NZ_CP015882 | Ensifer adhaerens strain Casida A plasmid pCasidaAB, complete sequence | 3454-3484 | 8 | 0.742 |
NZ_CP029122_4 | 4.7|1333900|31|NZ_CP029122|CRISPRCasFinder | 1333900-1333930 | 31 | NZ_CP017750 | Cupriavidus sp. USMAA2-4 plasmid unnamed1, complete sequence | 148992-149022 | 8 | 0.742 |
NZ_CP029122_4 | 4.10|1333533|33|NZ_CP029122|PILER-CR | 1333533-1333565 | 33 | NC_007336 | Cupriavidus pinatubonensis JMP134 megaplasmid, complete sequence | 62681-62713 | 8 | 0.758 |
NZ_CP029122_4 | 4.16|1333899|33|NZ_CP029122|PILER-CR | 1333899-1333931 | 33 | NC_007336 | Cupriavidus pinatubonensis JMP134 megaplasmid, complete sequence | 530640-530672 | 8 | 0.758 |
NZ_CP029122_4 | 4.19|1333534|32|NZ_CP029122|CRT | 1333534-1333565 | 32 | NC_007336 | Cupriavidus pinatubonensis JMP134 megaplasmid, complete sequence | 62682-62713 | 8 | 0.75 |
NZ_CP029122_4 | 4.19|1333534|32|NZ_CP029122|CRT | 1333534-1333565 | 32 | NZ_CP013104 | Paraburkholderia caribensis strain MWAP64 plasmid 1, complete sequence | 1222106-1222137 | 8 | 0.75 |
NZ_CP029122_4 | 4.19|1333534|32|NZ_CP029122|CRT | 1333534-1333565 | 32 | NZ_CP012748 | Paraburkholderia caribensis MBA4 plasmid unnamed, complete sequence | 2467671-2467702 | 8 | 0.75 |
NZ_CP029122_4 | 4.19|1333534|32|NZ_CP029122|CRT | 1333534-1333565 | 32 | NC_008759 | Polaromonas naphthalenivorans CJ2 plasmid pPNAP03, complete sequence | 12670-12701 | 8 | 0.75 |
NZ_CP029122_4 | 4.22|1333717|32|NZ_CP029122|CRT | 1333717-1333748 | 32 | NZ_CP034185 | Deinococcus sp. S14-83 strain S14-83T plasmid unnamed1, complete sequence | 17977-18008 | 8 | 0.75 |
NZ_CP029122_4 | 4.22|1333717|32|NZ_CP029122|CRT | 1333717-1333748 | 32 | NZ_CP017753 | Cupriavidus sp. USMAHM13 plasmid unnamed1, complete sequence | 97497-97528 | 8 | 0.75 |
NZ_CP029122_4 | 4.25|1333900|32|NZ_CP029122|CRT | 1333900-1333931 | 32 | NZ_CP017750 | Cupriavidus sp. USMAA2-4 plasmid unnamed1, complete sequence | 148991-149022 | 8 | 0.75 |
NZ_CP029122_4 | 4.25|1333900|32|NZ_CP029122|CRT | 1333900-1333931 | 32 | NC_007336 | Cupriavidus pinatubonensis JMP134 megaplasmid, complete sequence | 530640-530671 | 8 | 0.75 |
NZ_CP029122_4 | 4.26|1333961|32|NZ_CP029122|CRT | 1333961-1333992 | 32 | NZ_CP006991 | Rhizobium sp. IE4771 plasmid pRetIE4771e, complete sequence | 532343-532374 | 8 | 0.75 |
NZ_CP029122_1 | 1.1|892312|42|NZ_CP029122|CRISPRCasFinder | 892312-892353 | 42 | NZ_CP048307 | Escherichia coli strain 9 plasmid p009_C, complete sequence | 24786-24827 | 9 | 0.786 |
NZ_CP029122_4 | 4.1|1333534|31|NZ_CP029122|CRISPRCasFinder | 1333534-1333564 | 31 | NC_011987 | Agrobacterium radiobacter K84 plasmid pAtK84c, complete sequence | 86182-86212 | 9 | 0.71 |
NZ_CP029122_4 | 4.2|1333595|31|NZ_CP029122|CRISPRCasFinder | 1333595-1333625 | 31 | CP011075 | Brevibacillus laterosporus strain B9 plasmid unnamed1, complete sequence | 244686-244716 | 9 | 0.71 |
NZ_CP029122_4 | 4.2|1333595|31|NZ_CP029122|CRISPRCasFinder | 1333595-1333625 | 31 | GU075905 | Prochlorococcus phage P-HM2, complete genome | 78536-78566 | 9 | 0.71 |
NZ_CP029122_4 | 4.4|1333717|31|NZ_CP029122|CRISPRCasFinder | 1333717-1333747 | 31 | NZ_CP017750 | Cupriavidus sp. USMAA2-4 plasmid unnamed1, complete sequence | 405875-405905 | 9 | 0.71 |
NZ_CP029122_4 | 4.4|1333717|31|NZ_CP029122|CRISPRCasFinder | 1333717-1333747 | 31 | NZ_AP022593 | Mycolicibacterium arabiense strain JCM 18538 plasmid pJCM18538, complete sequence | 2248363-2248393 | 9 | 0.71 |
NZ_CP029122_4 | 4.8|1333961|31|NZ_CP029122|CRISPRCasFinder | 1333961-1333991 | 31 | NZ_CP040723 | Rhodococcus pyridinivorans strain YF3 plasmid unnamed4, complete sequence | 35740-35770 | 9 | 0.71 |
NZ_CP029122_4 | 4.10|1333533|33|NZ_CP029122|PILER-CR | 1333533-1333565 | 33 | NC_011987 | Agrobacterium radiobacter K84 plasmid pAtK84c, complete sequence | 86181-86213 | 9 | 0.727 |
NZ_CP029122_4 | 4.13|1333716|33|NZ_CP029122|PILER-CR | 1333716-1333748 | 33 | NZ_CP034185 | Deinococcus sp. S14-83 strain S14-83T plasmid unnamed1, complete sequence | 17976-18008 | 9 | 0.727 |
NZ_CP029122_4 | 4.22|1333717|32|NZ_CP029122|CRT | 1333717-1333748 | 32 | NZ_CP017750 | Cupriavidus sp. USMAA2-4 plasmid unnamed1, complete sequence | 405875-405906 | 9 | 0.719 |
NZ_CP029122_4 | 4.25|1333900|32|NZ_CP029122|CRT | 1333900-1333931 | 32 | NZ_CP036297 | Planctomycetes bacterium Pla86 plasmid pPla86_1, complete sequence | 14953-14984 | 9 | 0.719 |
NZ_CP029122_4 | 4.25|1333900|32|NZ_CP029122|CRT | 1333900-1333931 | 32 | NZ_CP036288 | Planctomycetes bacterium Pla133 plasmid pPla133_1, complete sequence | 14983-15014 | 9 | 0.719 |
NZ_CP029122_4 | 4.25|1333900|32|NZ_CP029122|CRT | 1333900-1333931 | 32 | NZ_CP015882 | Ensifer adhaerens strain Casida A plasmid pCasidaAB, complete sequence | 3454-3485 | 9 | 0.719 |
NZ_CP029122_4 | 4.26|1333961|32|NZ_CP029122|CRT | 1333961-1333992 | 32 | NZ_CP040723 | Rhodococcus pyridinivorans strain YF3 plasmid unnamed4, complete sequence | 35740-35771 | 9 | 0.719 |
NZ_CP029122_3 | 3.1|1310632|32|NZ_CP029122|PILER-CR,CRISPRCasFinder,CRT | 1310632-1310663 | 32 | NZ_CP030933 | Enterococcus gilvus strain CR1 plasmid pCR1A, complete sequence | 51062-51093 | 10 | 0.688 |
NZ_CP029122_4 | 4.11|1333594|33|NZ_CP029122|PILER-CR | 1333594-1333626 | 33 | GU075905 | Prochlorococcus phage P-HM2, complete genome | 78535-78567 | 10 | 0.697 |
NZ_CP029122_4 | 4.16|1333899|33|NZ_CP029122|PILER-CR | 1333899-1333931 | 33 | NZ_CP036297 | Planctomycetes bacterium Pla86 plasmid pPla86_1, complete sequence | 14952-14984 | 10 | 0.697 |
NZ_CP029122_4 | 4.16|1333899|33|NZ_CP029122|PILER-CR | 1333899-1333931 | 33 | NZ_CP036288 | Planctomycetes bacterium Pla133 plasmid pPla133_1, complete sequence | 14982-15014 | 10 | 0.697 |
NZ_CP029122_4 | 4.17|1333960|33|NZ_CP029122|PILER-CR | 1333960-1333992 | 33 | NZ_CP040723 | Rhodococcus pyridinivorans strain YF3 plasmid unnamed4, complete sequence | 35739-35771 | 10 | 0.697 |
NZ_CP029122_4 | 4.19|1333534|32|NZ_CP029122|CRT | 1333534-1333565 | 32 | NC_011987 | Agrobacterium radiobacter K84 plasmid pAtK84c, complete sequence | 86181-86212 | 10 | 0.688 |
NZ_CP029122_4 | 4.20|1333595|32|NZ_CP029122|CRT | 1333595-1333626 | 32 | CP011075 | Brevibacillus laterosporus strain B9 plasmid unnamed1, complete sequence | 244686-244717 | 10 | 0.688 |
NZ_CP029122_4 | 4.20|1333595|32|NZ_CP029122|CRT | 1333595-1333626 | 32 | GU075905 | Prochlorococcus phage P-HM2, complete genome | 78536-78567 | 10 | 0.688 |
NZ_CP029122_4 | 4.22|1333717|32|NZ_CP029122|CRT | 1333717-1333748 | 32 | NZ_AP022593 | Mycolicibacterium arabiense strain JCM 18538 plasmid pJCM18538, complete sequence | 2248362-2248393 | 10 | 0.688 |
1. spacer 7.1|3110451|40|NZ_CP029122|CRISPRCasFinder matches to NZ_CP041417 (Escherichia coli strain STEC711 plasmid pSTEC711_1, complete sequence) position: , mismatch: 0, identity: 1.0
gcgctgcgggtcattcttgaaattacccccgctgtgctgt CRISPR spacer gcgctgcgggtcattcttgaaattacccccgctgtgctgt Protospacer ****************************************
2. spacer 11.1|4123679|42|NZ_CP029122|PILER-CR matches to NZ_AP023206 (Escherichia coli strain TUM18781 plasmid pMTY18781-1_lncX3, complete sequence) position: , mismatch: 0, identity: 1.0
tgtcacacgcagataaatccaactttcaatattgttaagttc CRISPR spacer tgtcacacgcagataaatccaactttcaatattgttaagttc Protospacer ******************************************
3. spacer 11.2|4123738|40|NZ_CP029122|PILER-CR matches to NZ_AP023206 (Escherichia coli strain TUM18781 plasmid pMTY18781-1_lncX3, complete sequence) position: , mismatch: 1, identity: 0.975
catggcgtagcaaaaagaaattttcaatattgctttatgg CRISPR spacer catggcgtagaaaaaagaaattttcaatattgctttatgg Protospacer ********** *****************************
4. spacer 6.1|2457209|38|NZ_CP029122|CRISPRCasFinder matches to NZ_CP043437 (Enterobacter sp. LU1 plasmid unnamed) position: , mismatch: 2, identity: 0.947
cggacgcaggatggtgcgttcaattggactcgaaccaa CRISPR spacer cagacgcagaatggtgcgttcaattggactcgaaccaa Protospacer *.*******.****************************
5. spacer 9.1|3910518|48|NZ_CP029122|CRISPRCasFinder matches to NZ_CP053606 (Escherichia coli strain NEB_Turbo plasmid F', complete sequence) position: , mismatch: 3, identity: 0.938
tcagcgtcgcatcaggcatctgcgcataaccgccggatgcggcgtaaa CRISPR spacer ccagcgtcgcatcaggcatctgcgcataactgccggatgcggcataaa Protospacer .*****************************.************.****
6. spacer 9.1|3910518|48|NZ_CP029122|CRISPRCasFinder matches to NZ_CP053608 (Escherichia coli strain NEB5-alpha_F'Iq plasmid F'Iq, complete sequence) position: , mismatch: 3, identity: 0.938
tcagcgtcgcatcaggcatctgcgcataaccgccggatgcggcgtaaa CRISPR spacer ccagcgtcgcatcaggcatctgcgcataactgccggatgcggcataaa Protospacer .*****************************.************.****
7. spacer 9.1|3910518|48|NZ_CP029122|CRISPRCasFinder matches to NZ_CP014271 (Escherichia coli K-12 strain K-12 DHB4 plasmid F128-(DHB4), complete sequence) position: , mismatch: 3, identity: 0.938
tcagcgtcgcatcaggcatctgcgcataaccgccggatgcggcgtaaa CRISPR spacer ccagcgtcgcatcaggcatctgcgcataactgccggatgcggcataaa Protospacer .*****************************.************.****
8. spacer 9.1|3910518|48|NZ_CP029122|CRISPRCasFinder matches to NZ_CP014273 (Escherichia coli K-12 strain K-12 C3026 plasmid F128-(C3026), complete sequence) position: , mismatch: 3, identity: 0.938
tcagcgtcgcatcaggcatctgcgcataaccgccggatgcggcgtaaa CRISPR spacer ccagcgtcgcatcaggcatctgcgcataactgccggatgcggcataaa Protospacer .*****************************.************.****
9. spacer 1.1|892312|42|NZ_CP029122|CRISPRCasFinder matches to NZ_CP010208 (Escherichia coli strain M11 plasmid B, complete sequence) position: , mismatch: 7, identity: 0.833
acagcagtcggatgcggcgtaaacaccttatctgacctacgt CRISPR spacer acaaatgccggatgcggcgtaaacgccttatctggcctacgc Protospacer ***. *.****************.*********.******.
10. spacer 3.7|1310998|32|NZ_CP029122|PILER-CR,CRISPRCasFinder,CRT matches to NZ_MG299151 (Shigella sonnei strain SH287-2 plasmid pSH287-2, complete sequence) position: , mismatch: 7, identity: 0.781
aaatatccagggctgggctggaggcagacggc-- CRISPR spacer cgttatccagggctgagctgcaggcag--ggcca Protospacer . ************.**** ****** ***
11. spacer 3.7|1310998|32|NZ_CP029122|PILER-CR,CRISPRCasFinder,CRT matches to NZ_KY471628 (Shigella sonnei strain SH15sh99 plasmid pSH15sh99, complete sequence) position: , mismatch: 7, identity: 0.781
aaatatccagggctgggctggaggcagacggc-- CRISPR spacer cgttatccagggctgagctgcaggcag--ggcca Protospacer . ************.**** ****** ***
12. spacer 3.7|1310998|32|NZ_CP029122|PILER-CR,CRISPRCasFinder,CRT matches to NZ_MG299131 (Shigella sonnei strain SH271-2 plasmid pSH271-2, complete sequence) position: , mismatch: 7, identity: 0.781
aaatatccagggctgggctggaggcagacggc-- CRISPR spacer cgttatccagggctgagctgcaggcag--ggcca Protospacer . ************.**** ****** ***
13. spacer 3.7|1310998|32|NZ_CP029122|PILER-CR,CRISPRCasFinder,CRT matches to NZ_KY471629 (Shigella sonnei strain SH15sh105 plasmid pSH15sh104, complete sequence) position: , mismatch: 7, identity: 0.781
aaatatccagggctgggctggaggcagacggc-- CRISPR spacer cgttatccagggctgagctgcaggcag--ggcca Protospacer . ************.**** ****** ***
14. spacer 3.7|1310998|32|NZ_CP029122|PILER-CR,CRISPRCasFinder,CRT matches to NZ_MG299133 (Shigella sonnei strain SH272-2 plasmid pSH272-2, complete sequence) position: , mismatch: 7, identity: 0.781
aaatatccagggctgggctggaggcagacggc-- CRISPR spacer cgttatccagggctgagctgcaggcag--ggcca Protospacer . ************.**** ****** ***
15. spacer 3.7|1310998|32|NZ_CP029122|PILER-CR,CRISPRCasFinder,CRT matches to NZ_MG299128 (Shigella sonnei strain SH262-2 plasmid pSH262-2, complete sequence) position: , mismatch: 7, identity: 0.781
aaatatccagggctgggctggaggcagacggc-- CRISPR spacer cgttatccagggctgagctgcaggcag--ggcca Protospacer . ************.**** ****** ***
16. spacer 3.7|1310998|32|NZ_CP029122|PILER-CR,CRISPRCasFinder,CRT matches to NZ_MG299147 (Shigella sonnei strain SH284-2 plasmid pSH284-2, complete sequence) position: , mismatch: 7, identity: 0.781
aaatatccagggctgggctggaggcagacggc-- CRISPR spacer cgttatccagggctgagctgcaggcag--ggcca Protospacer . ************.**** ****** ***
17. spacer 3.7|1310998|32|NZ_CP029122|PILER-CR,CRISPRCasFinder,CRT matches to NC_018995 (Escherichia coli plasmid pHUSEC41-1, complete sequence) position: , mismatch: 7, identity: 0.781
aaatatccagggctgggctggaggcagacggc-- CRISPR spacer cgttatccagggctgagctgcaggcag--ggcca Protospacer . ************.**** ****** ***
18. spacer 3.7|1310998|32|NZ_CP029122|PILER-CR,CRISPRCasFinder,CRT matches to NZ_CP053235 (Escherichia coli strain SCU-106 plasmid pSCU-106-1, complete sequence) position: , mismatch: 7, identity: 0.781
aaatatccagggctgggctggaggcagacggc-- CRISPR spacer cgttatccagggctgagctgcaggcag--ggcca Protospacer . ************.**** ****** ***
19. spacer 3.7|1310998|32|NZ_CP029122|PILER-CR,CRISPRCasFinder,CRT matches to NZ_CP005999 (Escherichia coli B7A plasmid pEB1, complete sequence) position: , mismatch: 7, identity: 0.781
aaatatccagggctgggctggaggcagacggc-- CRISPR spacer cgttatccagggctgagctgcaggcag--ggcca Protospacer . ************.**** ****** ***
20. spacer 3.7|1310998|32|NZ_CP029122|PILER-CR,CRISPRCasFinder,CRT matches to KU932021 (Escherichia coli plasmid pEC3I, complete sequence) position: , mismatch: 7, identity: 0.781
aaatatccagggctgggctggaggcagacggc-- CRISPR spacer cgttatccagggctgagctgcaggcag--ggcca Protospacer . ************.**** ****** ***
21. spacer 3.7|1310998|32|NZ_CP029122|PILER-CR,CRISPRCasFinder,CRT matches to NZ_CP024154 (Escherichia coli strain 14EC033 plasmid p14EC033g, complete sequence) position: , mismatch: 7, identity: 0.781
aaatatccagggctgggctggaggcagacggc-- CRISPR spacer cgttatccagggctgagctgcaggcag--ggcca Protospacer . ************.**** ****** ***
22. spacer 3.7|1310998|32|NZ_CP029122|PILER-CR,CRISPRCasFinder,CRT matches to NC_011754 (Escherichia coli ED1a plasmid pECOED, complete sequence) position: , mismatch: 7, identity: 0.781
aaatatccagggctgggctggaggcagacggc-- CRISPR spacer cgttatccagggctgagctgcaggcag--ggcca Protospacer . ************.**** ****** ***
23. spacer 3.7|1310998|32|NZ_CP029122|PILER-CR,CRISPRCasFinder,CRT matches to NZ_CP015141 (Escherichia coli strain Ecol_732 plasmid pEC732_3, complete sequence) position: , mismatch: 7, identity: 0.781
aaatatccagggctgggctggaggcagacggc-- CRISPR spacer cgttatccagggctgagctgcaggcag--ggcca Protospacer . ************.**** ****** ***
24. spacer 3.7|1310998|32|NZ_CP029122|PILER-CR,CRISPRCasFinder,CRT matches to NZ_LR213460 (Shigella sonnei strain AUSMDU00008333 isolate AUSMDU00008333 plasmid 3) position: , mismatch: 7, identity: 0.781
aaatatccagggctgggctggaggcagacggc-- CRISPR spacer cgttatccagggctgagctgcaggcag--ggcca Protospacer . ************.**** ****** ***
25. spacer 3.7|1310998|32|NZ_CP029122|PILER-CR,CRISPRCasFinder,CRT matches to NZ_MH287044 (Escherichia coli strain 5.1-R1 plasmid pCERC6, complete sequence) position: , mismatch: 7, identity: 0.781
aaatatccagggctgggctggaggcagacggc-- CRISPR spacer cgttatccagggctgagctgcaggcag--ggcca Protospacer . ************.**** ****** ***
26. spacer 3.7|1310998|32|NZ_CP029122|PILER-CR,CRISPRCasFinder,CRT matches to NZ_MH618673 (Escherichia coli strain 838B plasmid p838B-R, complete sequence) position: , mismatch: 7, identity: 0.781
aaatatccagggctgggctggaggcagacggc-- CRISPR spacer cgttatccagggctgagctgcaggcag--ggcca Protospacer . ************.**** ****** ***
27. spacer 4.1|1333534|31|NZ_CP029122|CRISPRCasFinder matches to NC_007336 (Cupriavidus pinatubonensis JMP134 megaplasmid, complete sequence) position: , mismatch: 7, identity: 0.774
ttgcccgcgcaattccgggagcatccgcaat CRISPR spacer tccctatcgcaatgccggcagcatccgcaat Protospacer *. *. ****** **** ************
28. spacer 4.1|1333534|31|NZ_CP029122|CRISPRCasFinder matches to NZ_CP013104 (Paraburkholderia caribensis strain MWAP64 plasmid 1, complete sequence) position: , mismatch: 7, identity: 0.774
ttgcccgcgcaattccgggagcatccgcaat CRISPR spacer ttgcgcgcgcaattccgtgagcagcgccatc Protospacer **** ************ ***** * ** .
29. spacer 4.1|1333534|31|NZ_CP029122|CRISPRCasFinder matches to NZ_CP012748 (Paraburkholderia caribensis MBA4 plasmid unnamed, complete sequence) position: , mismatch: 7, identity: 0.774
ttgcccgcgcaattccgggagcatccgcaat CRISPR spacer ttgcgcgcgcaattccgtgagcagcgccatc Protospacer **** ************ ***** * ** .
30. spacer 4.4|1333717|31|NZ_CP029122|CRISPRCasFinder matches to NZ_CP034185 (Deinococcus sp. S14-83 strain S14-83T plasmid unnamed1, complete sequence) position: , mismatch: 7, identity: 0.774
cccgtcaccgacgcgcagtggcgctaccgtg CRISPR spacer agcgtcaccgacgcgcagggccgctaccaac Protospacer **************** * *******.
31. spacer 4.7|1333900|31|NZ_CP029122|CRISPRCasFinder matches to NC_007336 (Cupriavidus pinatubonensis JMP134 megaplasmid, complete sequence) position: , mismatch: 7, identity: 0.774
ccgaacggctggcgaagcaggtggctggcgt CRISPR spacer ccgaacaggtggcgaagcaggtgatgggcca Protospacer ******.* **************.. ***
32. spacer 1.1|892312|42|NZ_CP029122|CRISPRCasFinder matches to NZ_CP048307 (Escherichia coli strain 9 plasmid p009_C, complete sequence) position: , mismatch: 8, identity: 0.81
acagcagtcggatgcggcgtaaacaccttatctgacctacgt CRISPR spacer attgatgtcggatgcggcgtaaacgccttatccgacctacaa Protospacer *. * ******************.*******.*******.
33. spacer 3.6|1310937|32|NZ_CP029122|PILER-CR,CRISPRCasFinder,CRT matches to NZ_CP012748 (Paraburkholderia caribensis MBA4 plasmid unnamed, complete sequence) position: , mismatch: 8, identity: 0.75
tcaacgcgctcagacgttgcgtgagtgaacca CRISPR spacer acaacgcggtcggacgttgcgtgattaccccg Protospacer ******* **.************ *. **.
34. spacer 4.4|1333717|31|NZ_CP029122|CRISPRCasFinder matches to NZ_CP017753 (Cupriavidus sp. USMAHM13 plasmid unnamed1, complete sequence) position: , mismatch: 8, identity: 0.742
cccgtcaccgacgcgcagtggcgctaccgtg CRISPR spacer gacgtcaccgacgcgcagtcgcgcttcttca Protospacer ***************** ***** *. ..
35. spacer 4.7|1333900|31|NZ_CP029122|CRISPRCasFinder matches to NZ_CP036297 (Planctomycetes bacterium Pla86 plasmid pPla86_1, complete sequence) position: , mismatch: 8, identity: 0.742
ccgaacggctggcgaagcaggtggctggcgt CRISPR spacer agcggcagctggcgatgcaggtggcttgcgt Protospacer ..*.******** ********** ****
36. spacer 4.7|1333900|31|NZ_CP029122|CRISPRCasFinder matches to NZ_CP036288 (Planctomycetes bacterium Pla133 plasmid pPla133_1, complete sequence) position: , mismatch: 8, identity: 0.742
ccgaacggctggcgaagcaggtggctggcgt CRISPR spacer agcggcagctggcgatgcaggtggcttgcgt Protospacer ..*.******** ********** ****
37. spacer 4.7|1333900|31|NZ_CP029122|CRISPRCasFinder matches to NZ_CP015882 (Ensifer adhaerens strain Casida A plasmid pCasidaAB, complete sequence) position: , mismatch: 8, identity: 0.742
ccgaacggctggcgaagcaggtggctggcgt CRISPR spacer ttgcgcagctggcgcagcaggtggctgccga Protospacer ..* .*.******* ************ **
38. spacer 4.7|1333900|31|NZ_CP029122|CRISPRCasFinder matches to NZ_CP017750 (Cupriavidus sp. USMAA2-4 plasmid unnamed1, complete sequence) position: , mismatch: 8, identity: 0.742
ccgaacggctggcgaagcaggtggctggcgt CRISPR spacer gggtacggctggcgaaggaggcggctgcgga Protospacer * ************* ***.***** *
39. spacer 4.10|1333533|33|NZ_CP029122|PILER-CR matches to NC_007336 (Cupriavidus pinatubonensis JMP134 megaplasmid, complete sequence) position: , mismatch: 8, identity: 0.758
gttgcccgcgcaattccgggagcatccgcaatt CRISPR spacer gtccctatcgcaatgccggcagcatccgcaatc Protospacer **. *. ****** **** ************.
40. spacer 4.16|1333899|33|NZ_CP029122|PILER-CR matches to NC_007336 (Cupriavidus pinatubonensis JMP134 megaplasmid, complete sequence) position: , mismatch: 8, identity: 0.758
gccgaacggctggcgaagcaggtggctggcgta CRISPR spacer gccgaacaggtggcgaagcaggtgatgggccag Protospacer *******.* **************.. *** .
41. spacer 4.19|1333534|32|NZ_CP029122|CRT matches to NC_007336 (Cupriavidus pinatubonensis JMP134 megaplasmid, complete sequence) position: , mismatch: 8, identity: 0.75
ttgcccgcgcaattccgggagcatccgcaatt CRISPR spacer tccctatcgcaatgccggcagcatccgcaatc Protospacer *. *. ****** **** ************.
42. spacer 4.19|1333534|32|NZ_CP029122|CRT matches to NZ_CP013104 (Paraburkholderia caribensis strain MWAP64 plasmid 1, complete sequence) position: , mismatch: 8, identity: 0.75
ttgcccgcgcaattccgggagcatccgcaatt CRISPR spacer ttgcgcgcgcaattccgtgagcagcgccatca Protospacer **** ************ ***** * ** .
43. spacer 4.19|1333534|32|NZ_CP029122|CRT matches to NZ_CP012748 (Paraburkholderia caribensis MBA4 plasmid unnamed, complete sequence) position: , mismatch: 8, identity: 0.75
ttgcccgcgcaattccgggagcatccgcaatt CRISPR spacer ttgcgcgcgcaattccgtgagcagcgccatca Protospacer **** ************ ***** * ** .
44. spacer 4.19|1333534|32|NZ_CP029122|CRT matches to NC_008759 (Polaromonas naphthalenivorans CJ2 plasmid pPNAP03, complete sequence) position: , mismatch: 8, identity: 0.75
ttgcccgcg-----caattccgggagcatccgcaatt CRISPR spacer -----cgtgaaactcatttccgggagcatccgcattt Protospacer **.* ** ***************** **
45. spacer 4.22|1333717|32|NZ_CP029122|CRT matches to NZ_CP034185 (Deinococcus sp. S14-83 strain S14-83T plasmid unnamed1, complete sequence) position: , mismatch: 8, identity: 0.75
cccgtcaccgacgcgcagtggcgctaccgtga CRISPR spacer agcgtcaccgacgcgcagggccgctaccaact Protospacer **************** * *******.
46. spacer 4.22|1333717|32|NZ_CP029122|CRT matches to NZ_CP017753 (Cupriavidus sp. USMAHM13 plasmid unnamed1, complete sequence) position: , mismatch: 8, identity: 0.75
cccgtcaccgacgcgcagtggcgctaccgtga CRISPR spacer gacgtcaccgacgcgcagtcgcgcttcttcaa Protospacer ***************** ***** *. ..*
47. spacer 4.25|1333900|32|NZ_CP029122|CRT matches to NZ_CP017750 (Cupriavidus sp. USMAA2-4 plasmid unnamed1, complete sequence) position: , mismatch: 8, identity: 0.75
ccgaacggctggcgaagcaggtggctggcgta CRISPR spacer gggtacggctggcgaaggaggcggctgcggaa Protospacer * ************* ***.***** * *
48. spacer 4.25|1333900|32|NZ_CP029122|CRT matches to NC_007336 (Cupriavidus pinatubonensis JMP134 megaplasmid, complete sequence) position: , mismatch: 8, identity: 0.75
ccgaacggctggcgaagcaggtggctggcgta CRISPR spacer ccgaacaggtggcgaagcaggtgatgggccag Protospacer ******.* **************.. *** .
49. spacer 4.26|1333961|32|NZ_CP029122|CRT matches to NZ_CP006991 (Rhizobium sp. IE4771 plasmid pRetIE4771e, complete sequence) position: , mismatch: 8, identity: 0.75
gtttaccgccccgcagaggcgctggcagatcc CRISPR spacer catcatcctcccgcagatgcgctggccgatcc Protospacer *.*.* .******** ******** *****
50. spacer 1.1|892312|42|NZ_CP029122|CRISPRCasFinder matches to NZ_CP048307 (Escherichia coli strain 9 plasmid p009_C, complete sequence) position: , mismatch: 9, identity: 0.786
acagcagtcggatgcggcgtaaacaccttatctgacctacgt CRISPR spacer gttgatgtcggatgcggcgtaaacgccttatccgacctacaa Protospacer .. * ******************.*******.*******.
51. spacer 4.1|1333534|31|NZ_CP029122|CRISPRCasFinder matches to NC_011987 (Agrobacterium radiobacter K84 plasmid pAtK84c, complete sequence) position: , mismatch: 9, identity: 0.71
ttgcccgcgcaattccgggagcatccgcaat CRISPR spacer gctaccgcgcaattcgaggagcatccgctgg Protospacer . *********** .*********** .
52. spacer 4.2|1333595|31|NZ_CP029122|CRISPRCasFinder matches to CP011075 (Brevibacillus laterosporus strain B9 plasmid unnamed1, complete sequence) position: , mismatch: 9, identity: 0.71
acggacaaaatatatattgatttgcgaatta CRISPR spacer tgaggcaaaatatagattgatttccgaaaat Protospacer .*.********* ******** ****
53. spacer 4.2|1333595|31|NZ_CP029122|CRISPRCasFinder matches to GU075905 (Prochlorococcus phage P-HM2, complete genome) position: , mismatch: 9, identity: 0.71
acggacaaaatatatattgatttgcgaatta CRISPR spacer acggaaaaattatatattgattttacttctg Protospacer ***** *** ************* .*.
54. spacer 4.4|1333717|31|NZ_CP029122|CRISPRCasFinder matches to NZ_CP017750 (Cupriavidus sp. USMAA2-4 plasmid unnamed1, complete sequence) position: , mismatch: 9, identity: 0.71
cccgtcaccgacgcgcagtggcgctaccgtg CRISPR spacer gacgtcactgacgcgcagtcgcgcttcttca Protospacer ******.********** ***** *. ..
55. spacer 4.4|1333717|31|NZ_CP029122|CRISPRCasFinder matches to NZ_AP022593 (Mycolicibacterium arabiense strain JCM 18538 plasmid pJCM18538, complete sequence) position: , mismatch: 9, identity: 0.71
cccgtcaccgacgcgcagtggcgctaccgtg CRISPR spacer gacatcaccgacgcccagtggcgcgacgtcc Protospacer *.********** ********* ** .
56. spacer 4.8|1333961|31|NZ_CP029122|CRISPRCasFinder matches to NZ_CP040723 (Rhodococcus pyridinivorans strain YF3 plasmid unnamed4, complete sequence) position: , mismatch: 9, identity: 0.71
gtttaccgccccgcagaggcgctggcagatc CRISPR spacer cgagaccgcctcgccgaggcgctggcagcga Protospacer ******.*** *************
57. spacer 4.10|1333533|33|NZ_CP029122|PILER-CR matches to NC_011987 (Agrobacterium radiobacter K84 plasmid pAtK84c, complete sequence) position: , mismatch: 9, identity: 0.727
-gttgcccgcgcaattccgggagcatccgcaatt CRISPR spacer cgcta-ccgcgcaattcgaggagcatccgctggg Protospacer *.*. *********** .*********** .
58. spacer 4.13|1333716|33|NZ_CP029122|PILER-CR matches to NZ_CP034185 (Deinococcus sp. S14-83 strain S14-83T plasmid unnamed1, complete sequence) position: , mismatch: 9, identity: 0.727
gcccgtcaccgacgcgcagtggcgctaccgtga CRISPR spacer cagcgtcaccgacgcgcagggccgctaccaact Protospacer **************** * *******.
59. spacer 4.22|1333717|32|NZ_CP029122|CRT matches to NZ_CP017750 (Cupriavidus sp. USMAA2-4 plasmid unnamed1, complete sequence) position: , mismatch: 9, identity: 0.719
cccgtcaccgacgcgcagtggcgctaccgtga CRISPR spacer gacgtcactgacgcgcagtcgcgcttcttcaa Protospacer ******.********** ***** *. ..*
60. spacer 4.25|1333900|32|NZ_CP029122|CRT matches to NZ_CP036297 (Planctomycetes bacterium Pla86 plasmid pPla86_1, complete sequence) position: , mismatch: 9, identity: 0.719
ccgaacggctggcgaagcaggtggctggcgta CRISPR spacer agcggcagctggcgatgcaggtggcttgcgtg Protospacer ..*.******** ********** ****.
61. spacer 4.25|1333900|32|NZ_CP029122|CRT matches to NZ_CP036288 (Planctomycetes bacterium Pla133 plasmid pPla133_1, complete sequence) position: , mismatch: 9, identity: 0.719
ccgaacggctggcgaagcaggtggctggcgta CRISPR spacer agcggcagctggcgatgcaggtggcttgcgtg Protospacer ..*.******** ********** ****.
62. spacer 4.25|1333900|32|NZ_CP029122|CRT matches to NZ_CP015882 (Ensifer adhaerens strain Casida A plasmid pCasidaAB, complete sequence) position: , mismatch: 9, identity: 0.719
ccgaacggctggcgaagcaggtggctggcgta CRISPR spacer ttgcgcagctggcgcagcaggtggctgccgag Protospacer ..* .*.******* ************ ** .
63. spacer 4.26|1333961|32|NZ_CP029122|CRT matches to NZ_CP040723 (Rhodococcus pyridinivorans strain YF3 plasmid unnamed4, complete sequence) position: , mismatch: 9, identity: 0.719
gtttaccgccccgcagaggcgctggcagatcc CRISPR spacer cgagaccgcctcgccgaggcgctggcagcgac Protospacer ******.*** ************* *
64. spacer 3.1|1310632|32|NZ_CP029122|PILER-CR,CRISPRCasFinder,CRT matches to NZ_CP030933 (Enterococcus gilvus strain CR1 plasmid pCR1A, complete sequence) position: , mismatch: 10, identity: 0.688
tccacgctgtaacggccatcattaagtttagt CRISPR spacer ccgctgctgtgacgcccatcattaagttactc Protospacer .* .*****.*** ************* .
65. spacer 4.11|1333594|33|NZ_CP029122|PILER-CR matches to GU075905 (Prochlorococcus phage P-HM2, complete genome) position: , mismatch: 10, identity: 0.697
gacggacaaaatatatattgatttgcgaattat CRISPR spacer gacggaaaaattatatattgattttacttctgg Protospacer ****** *** ************* .*.
66. spacer 4.16|1333899|33|NZ_CP029122|PILER-CR matches to NZ_CP036297 (Planctomycetes bacterium Pla86 plasmid pPla86_1, complete sequence) position: , mismatch: 10, identity: 0.697
gccgaacggctggcgaagcaggtggctggcgta CRISPR spacer cagcggcagctggcgatgcaggtggcttgcgtg Protospacer ..*.******** ********** ****.
67. spacer 4.16|1333899|33|NZ_CP029122|PILER-CR matches to NZ_CP036288 (Planctomycetes bacterium Pla133 plasmid pPla133_1, complete sequence) position: , mismatch: 10, identity: 0.697
gccgaacggctggcgaagcaggtggctggcgta CRISPR spacer cagcggcagctggcgatgcaggtggcttgcgtg Protospacer ..*.******** ********** ****.
68. spacer 4.17|1333960|33|NZ_CP029122|PILER-CR matches to NZ_CP040723 (Rhodococcus pyridinivorans strain YF3 plasmid unnamed4, complete sequence) position: , mismatch: 10, identity: 0.697
ggtttaccgccccgcagaggcgctggcagatcc CRISPR spacer ccgagaccgcctcgccgaggcgctggcagcgac Protospacer ******.*** ************* *
69. spacer 4.19|1333534|32|NZ_CP029122|CRT matches to NC_011987 (Agrobacterium radiobacter K84 plasmid pAtK84c, complete sequence) position: , mismatch: 10, identity: 0.688
ttgcccgcgcaattccgggagcatccgcaatt CRISPR spacer gctaccgcgcaattcgaggagcatccgctggg Protospacer . *********** .*********** .
70. spacer 4.20|1333595|32|NZ_CP029122|CRT matches to CP011075 (Brevibacillus laterosporus strain B9 plasmid unnamed1, complete sequence) position: , mismatch: 10, identity: 0.688
acggacaaaatatatattgatttgcgaattat CRISPR spacer tgaggcaaaatatagattgatttccgaaaata Protospacer .*.********* ******** ****
71. spacer 4.20|1333595|32|NZ_CP029122|CRT matches to GU075905 (Prochlorococcus phage P-HM2, complete genome) position: , mismatch: 10, identity: 0.688
acggacaaaatatatattgatttgcgaattat CRISPR spacer acggaaaaattatatattgattttacttctgg Protospacer ***** *** ************* .*.
72. spacer 4.22|1333717|32|NZ_CP029122|CRT matches to NZ_AP022593 (Mycolicibacterium arabiense strain JCM 18538 plasmid pJCM18538, complete sequence) position: , mismatch: 10, identity: 0.688
cccgtcaccgacgcgcagtggcgctaccgtga CRISPR spacer gacatcaccgacgcccagtggcgcgacgtccc Protospacer *.********** ********* ** .
Region | Region Position | Protein_number | Hit_taxonomy | Key_proteins | Att_site | Prophage annotation | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
DBSCAN-SWA_1 |
1347549 : 1354689
Sequences of DBSCAN-SWA_1
Nucleotide sequences of DBSCAN-SWA_1 >NZ_CP029122|1347549:1354689|DBSCAN-SWA ATTAACTCCTTAATTCCGCAATTTCACCTGCGGTCAGATAACGGATCGGGCGGTCACCGAGAATAAAAATCAGCTTTGCCGTTTCCTCCAGCTCTTCCATATTGTTGGCGGCTTCTTGCAGGCTTTCACCGCAAACCACTGGGCCATGATTTGCCAGTAAAAAAGCCTGATTGTCTGCTGCCAGTTCCGCCAGATCCTGTGCGATGCGTTTATCGCCCGGTCGGTAATAAGGCACCAGCGGGACATTTCCCATCCGCATCACCACGTATGGTGTGAACGGACGAATAACGTTGCTGCTGTCCAGCCCTTGCAGGCAGGAAAGCGCCGTCGACCATGTGCTGTGCAAATGCACCACCGCTTTACAGCGCGGATTGTTGCGATACAGCGCCAGATGAAAGAGCACCTCTTTCGAGGGTTTGTCACCACTTAACCATTCGCCATCCGCGGCGACTTTGGAAAGCCGCTGCGGATCGAGATTGCCCAGGCATGAACCTGTCGGTGTCGCCAGTAAATTCCCGTCAGGTAAAAGCAGCGACAGATTGCCAGCCGAACCGGTTGCATAGCCGCGCTGAAAGAATGAACTGGCAATCCGCGTCATCTCCTCTCGCAAAGACTGCTCTACTTTTGCGAAATCGCTCATGATAAAAACTCTCTTTGGGCTCGTGAAAAAAAGGCTTCATCACCGAAGTTGCCAGATTTAAGGGCGAGTGAGACAGGCTTATCCAGTGCGTTTACCCACGGCACGCCGGGGGAAATGGTTGGGCCAATATGAAACCCTTTTATGCCCAGGCTCTGTGTGACTACGCCGGAGGTCTCACCGCCTGCGACAATAAAGCGTGTCACGCCTTCCGCTGCTAACCGCGCCGCTAGTTGAGAAAACAGAGTTTCTACTGCCTGACTGGCTTTTTGTGCACCGTATTGCTGTTGAATTGCTGCCAATGCGTCAGTGCTGGCGGTGGCAAAAACCAGTGGAGCAAGTACACTTTCCTGGCCCAGAACCCACTCTGCCAGTTCGTGTGCATAAGCGGCCAGAGTTTCGGTTGAGAGGCAGCGTGCCACATCAACTTCACGGGCTGGTGCAATTTGACGGTAATGTGCCACCTGGCGGTTGGTCATTTGAGAGCATGAACCGGAGAGCACTACGCCGCGCCCAGCGAGCGGATGCCCTGCTTCGCGAGCCTGGTTACCGTTTTCTTGCGCCCACTGCCGGGCCAGGCCAATCGCCAGACCAGAACCGCCCGTTACCAGTGGGGCATCGCGCAAGGCTTCTCCCTGAATTTCCAGATGGTGTTCGGTCAGCGCATCAAGCACCGCGTAGCGGTAGCCCTCTTGCTGTAAGCGAGCCAGCTCTTGACGAACGGCATCCACACCTTGTTCGAAAACATGTGCCGAAACGACGCCGCAGCGCCCTGTGGATTGCGCTTCAACCAGACGGGGAAGATAGCTGTCGGTCATGGGATTTACCGGGTGATGGCGCATCCCGGATTCGGCCAGCAGTTGATTCATTACGAACAAATACCCCTGATAAACCGTACGTCCGTTGACCGGCAGGGCCGGAGAGAAGACCGTAAACGGCGTGTCGAGAGCATCCATTAATGCATCGGTAACCGGGCCGATATTACCTTTCGCCGTACTGTCGAAAGTAGAGCAGTATTTGAAATAGATCTGTTTGCAACCTTGCTGTTGCAACCAGCTCAGAGCCGCCAGCGATTGCTGTGTGGCTTCAACCACCGGACAGGAGCGCGTTTTCAGGCTGATCACCAGTGCGTCGATTGCTTCCGGCATTTTACCTGTTGGAACACCGTTAATTTGTACCGTTGGTAGACCGTTTTCCACCAGAAAACTGGCGATATCCGTCGCGCCGGTAAAATCATCGGCGATAACGCCAATCTTGATCATGATTTCGCTCCCGGTAGAGTGATGCCAGAGAAAATCTTGATAACTGCGCTATCGTCTTCTTTCCCGTAACCTGCGTTACTGGCGCTGGTGAACATATTCAATGCTGTTGAGGCCAATGGCAGCGGGAAGTGCAGGGCTTTGGCTGTATCGGCAACCAGACCAAGATCCTTAACAAAAATATCGACGGCTGAATGCGGGGTGTAATCGCCATCCACCACATGACGCATCCGGTTTTCGAACATCCAGGAATTTCCGGCGGCATTGGTCACGACGTCATACATCACATCCAGCGGGATCCCCGCACGGGCTGCAAGTGCCATCGCTTCGGCTCCGGCAGCAATATGTACGCCCGCTAACAACTGGTGAATAATTTTTACGGTCGAACCTAGTCCCGGTTCTGCACCTATGCGATAAACTTTTCCGGCAACGGCTTCCAGCACGGGTGCCAGTCGTTCAAAGGCAATATCGCTACCGGAGGCCATGACAGTCATTTCACCGTTAGCGGCTTTTACTGCACCACCAGAAACTGGCGCATCCAGCATTTCCAGATCGAATCCAGCCAGAGCGGTAGCAATTTCTTGCGCATCAGCACTAGCGATAGTGGAAGAAACCATTACTGCCGTACCGGGTTTCAGATGTTGTGCAACGCCTGTTTCACCAAACAGCACCTGTTTAACCTGGGCCGCATTGACCACCAGCACCAGCAGAGCGTCCAGTTTTTCGGCAAACGTCGCGGCGTTATCAGAAACCCCGCAAGCACCTGCCTCTTTCAACGTAGCGCAGGCATTGCTGTTCAGGTCTGCGCCCCAGGTAGAAAGACCTGCGCGGACACATGACAGTGCTGCTCCCATTCCCATTGACCCTAAGCCAACGATACCGACATGAAACTCAGATCCCGTTTTCATCTGCTCTCCTTGTTAATTTAAGTGATATTTTGTTTGATATTGTGAATATAAGCGCTGGAAGATAACGATATGGTGAGCTGATTCACATAAATTAACATTGTGTGTTATTTTATGTGAACTAAGCGTTAGTTGCGCCGCGCACGTTTCGCAGGCAAATAGCGTAGAATGTCAGCAGGACAAAGGAAGGAGCAAAAGTTGATACCCGTAGAGCGTCGCCAAATCATCCTTGAGATGGTAGCTGAAAAAGGCATTGTCAGTATTGCTGAACTAACGGACAGAATGAATGTGTCACATATGACCATTCGTCGGGATTTACAAAAACTGGAGCAGCAGGGAGCCGTTGTGCTGGTGTCCGGAGGCGTCCAGTCTCCGGGACGCGTGGCGCATGAACCTTCTCATCAGGTAAAAACTGCGCTGGCAATGACGCAAAAAGCGGCTATTGGCAAGCTGGCGGCAAGTCTTGTTCAGCCGGGAAGCTGTATCTATCTGGATGCGGGAACGACCACGTTAGCGATAGCACAGCATCTGATTCACATGGAGTCACTGACTGTGGTCACAAACGATTTCGTTATTGCGAACTACTTGCTCGACAACAGTAATTGCACAATTATTCACACTGGCGGTGCAGTGTGTCGGGAAAACCGTTCCTGTGTCGGGGAAGCCGCTGCGACCATGCTGCGCAGCCTGATGATTGATCAGGCTTTTATTTCTGCATCGTCATGGAGTGTGCGGGGGATTTCTACGCCAGCGGAAGATAAAGTCACGGTGAAACGGGCGATTGCCAGTGCCAGCCGCCAGCGAGTTTTGGTCTGTGATGCGACGAAATATGGTCAGGTGGCGACATGGCTGGCGTTACCATTAAGCGAGTTTGATCAGATTATCACAGACGACGGTTTGCCGGAGAGTGCCAGTCGCGCGCTGGCGAAGCAGGATCTCTCTTTGCTGGTAGCGAAAAATGAATAATGGCCTGCAATAACATTTGGTTACTCATGCTTCACAGAAGAAGCATGAGACTACTTTATTTTATAAAATGACAGCCGCCCGCTTTTCGGCGTGCCGGTATCAATATAAATCTGGTTAGCGAACGTCTGAATGTTATCAAACATCATATGTCCAAATATAAAATAATCAGCGCCGTTTATTTGTTGTAACTCGCCATTAAGCGATTTCTGCACACGATCAACAGGCCAGAGTAATTCGCTCTCCGCTATTTCTTTACCAAAGAGATATTCACTCCCCGGATAATCTGCATGTGCGATGACATATTTTATGTTGTCGTTAGTGATTTCAATAATATGTGGAAGGTGATGGAATTTCAGCAACAGATCTATTGCCTCTTGTTGCTCTGAATCATTTAAATCGAAAAACCAGTCACCACCGCTGGCAAGCCACATATTGCCATCGCCAGTTTCGAATGCATCCAACGCCATCGCTTCGTGGTTGCCTTTAACCGACGTAAACCAGGGTTGGTTTAGCAGGCGCAGCACGTTAAGACTCTTCGGCCCACGATCAATATTATCGCCTGTGGAAATAAGTAAGTCGGTTTCAGGGTAAAAAGAGAGTTGATGTAAACGGGATTGTAATAACTGATATTCACCATGAATATCACCAACGACCCATATATGGCGATAGTGATGGGCATTGATTTTTTGATAGCGTGTAGATGGCATGGTTTTACCCTGTAAAATAAGCTTTCCTATTATACAGGGTATTTTTATTTGATTCGTCAGTTGTCGTTAATATTCCCGATAGCAAAAGACTATCGGGAATTGTTATTACACCAGACTCTTCAAGCGATAAATCCACTCCAGCGCCTGACGCGGAGTCAGTGAATCCGGGTCGAGATTTTCCAGTGCTTCGACCGCAGGCGAAGTTTCTTCTGGCACTGACAGCAAAGACATCTGCGTACCATCCACTTGCGTCGCGGCGGCGTTCGGCGAAATGCTTTCCAGCTCACGCAGTTTTTGCCGTGCGCGCTTAATCACCTCTTTTGGCACGCCCGCCAGAGCTGCAACCGCCAGGCCGTAGCTTTTGCTCGCCGCGCCATCCTGCACGCTATGCATAAAGGCAATGGTGTCGCCGTGCTCCAGCGCATCGAGATGCACGTTGGCGACGCCTTCCATTTTCTCCGGTAACTGGGTCAGCTCGAAATAGTGGGTAGCAAATAACGTCAATGCCTTAATCTTATTCGCCAGATTTTCCGCGCACGCCCACGCCAGCGACAGACCATCGTAGGTGGACGTTCCACGCCCGATCTCATCCATTAACACCAGACTGTATTCGGTGGCGTTATGTAAAATATTGGCGGTTTCAGTCATCTCCACCATAAAGGTTGAGCGCCCGGAAGCCAGGTCATCTGCCGCGCCTACGCGGGTAAAGATGCGATCGATAGGTCCAATCTCGACTTTTTGTGCCGGTACATAGCTGCCGATGTAGGCCATCAGCGCAATCAGTGCGGTCTGGCGCATATAGGTACTTTTACCGCCCATGTTCGGGCCGGTAATAATCAACATACGGCGCTGCGGCGACAGATTCAGCGGGTTAGCGATAAATGGCTCATTCAGTACTTGTTCAACTACCGGATGGCGACCTTCGGTAATGCGAATGCCCGGTTTATCAATGAAGGTCGGGCAGGTGTAGTTCAGGGTATAGGCCCGTTCCGCCAGGTTAACCAGCACGTCGAGTTCCGCCAGCGCGCTCGCGCTCTGTTGCAACGCTTCCAGATGCGGCAACAGCAGGTCGAACAGCTCTTCATAAAGCTGTTTTTCCAGTGCCAGTGCTTTGCCTTTTGAGGTGAGAACTTTATCTTCGTACTCTTTTAGCTCTGGAATGATGTAGCGCTCGGCGTTTTTCAGCGTCTGGCGACGCATGTAGTTGATGGGTGCCAGATGGCTTTGCCCACGGCTGATTTGAATGTAGTAGCCGTGCACCGCATTAAAGCCAACTTTCAGCGTGTCCAGGCCGGTACGTTCACGCTCGCGGACTTCCAGACGCTCCAGATAATCGGTCGCGCCGTCAGCCAGCGCGCGCCACTCATCCAGCTCTTCGTTATAGCCCGATGCGATAACACCACCGTCGCGTACCAGCACCGGCGGTGTGTCGATGATTGCTCGCTCCAGCAGATCGCGCAGCTCGGCAAACTCGCCCATCTTCTCACGTAGCGCCTGTACCGGTGCACTATCGACAGTTTCTAACTGCGCACGCAGCTCCGGCAGTTGCTGGAAAGCGTGGCGCATACGGGCCAGATCGCGTGGGCGAGCAGTTCGTAAAGCCAGACGTGCCAGAATACGTTCCAGGTCGCCGACCTGACGCAGTACCGGCTGCAACTCGGCGGTGAAATCCTGCAATGCGCCAATAGTTTGCTGGCGCTCAAGCAACACGCGGGTATCGCGCACTGGCATATGCAGCCAGCGTTTCAGCATACGACTACCCATCGGCGTGACGGTGCAGTCGAGCACAGAAGCCAGCGTATTTTCCGCACCGCCCGCCAGGTTCTGAGTAATTTCCAGGTTACGACGTGTCGCGGCATCCATAATGATGCTGTCCTGCTCACGTTCCATGGTGATAGAACGAATATGCGGCAGGGTCGTGCGTTGGGTATCTTTCGCATACTGCAACAGACAACCGGCAGCACAAAGTCCGCGCGGCGCGTTCTCGACGCCAAAACCGACCAGATCGCGGGTGCCAAATTGCAGATTCAACTGCTGGCGCGCGGTGTCGATTTCAAACTCCCACAGCGGGCGACGGCGCAGGCCGCGACGGCCTTCAATCAGCGACATCTCGGCGAAATCTTCTGCATACAGCAGTTCCGCCGGATTAGTGCGTTGCAGCTCTGCCGCCATCGTTTCGCGGTCGGCCGGTTCGCTCAGACGAAAACGCCCGGAGCTGATATCCAGCGTCGCGTAGCCGAAACCTTTGCTGTCCTGCCAGATAGCCGCCAGCAGGTTGTCCTGACGCTCCTGTAACAGGGCTTCATCGCTGATGGTGCCTGGCGTAACGATACGCACAACTTTGCGCTCAACCGGACCTTTGCTGGTCGCCGGATCGCCAATTTGTTCGCAGATGGCAACGGACTCTCCCTGATTCACCAGTTTGGCGAGATAGTTTTCCACCGCATGGTAGGGAATCCCCGCCATCGGGATCGGCTCTCCCGCCGAAGCACCGCGTTTGGTCAGTGAAATATCCAGCAGTTGCGACGCGCGTTTTGCGTCGTCATAAAACAGTTCATAAAAATCACCCATCCGGTAAAACAGCAGGATCTCGGGATGCTGGGCTTTCAGCCTGAGATACTGCTGCATCATGGGCGTATGGGCGTCGAAATTTTCTATTGCACTCAT
Protein sequences of DBSCAN-SWA_1 >NZ_CP029122|1347549:1354689|1352127_1354689_-|WP_001272924.1|DBSCAN-SWA MSAIENFDAHTPMMQQYLRLKAQHPEILLFYRMGDFYELFYDDAKRASQLLDISLTKRGASAGEPIPMAGIPYHAVENYLAKLVNQGESVAICEQIGDPATSKGPVERKVVRIVTPGTISDEALLQERQDNLLAAIWQDSKGFGYATLDISSGRFRLSEPADRETMAAELQRTNPAELLYAEDFAEMSLIEGRRGLRRRPLWEFEIDTARQQLNLQFGTRDLVGFGVENAPRGLCAAGCLLQYAKDTQRTTLPHIRSITMEREQDSIIMDAATRRNLEITQNLAGGAENTLASVLDCTVTPMGSRMLKRWLHMPVRDTRVLLERQQTIGALQDFTAELQPVLRQVGDLERILARLALRTARPRDLARMRHAFQQLPELRAQLETVDSAPVQALREKMGEFAELRDLLERAIIDTPPVLVRDGGVIASGYNEELDEWRALADGATDYLERLEVRERERTGLDTLKVGFNAVHGYYIQISRGQSHLAPINYMRRQTLKNAERYIIPELKEYEDKVLTSKGKALALEKQLYEELFDLLLPHLEALQQSASALAELDVLVNLAERAYTLNYTCPTFIDKPGIRITEGRHPVVEQVLNEPFIANPLNLSPQRRMLIITGPNMGGKSTYMRQTALIALMAYIGSYVPAQKVEIGPIDRIFTRVGAADDLASGRSTFMVEMTETANILHNATEYSLVLMDEIGRGTSTYDGLSLAWACAENLANKIKALTLFATHYFELTQLPEKMEGVANVHLDALEHGDTIAFMHSVQDGAASKSYGLAVAALAGVPKEVIKRARQKLRELESISPNAAATQVDGTQMSLLSVPEETSPAVEALENLDPDSLTPRQALEWIYRLKSLV >NZ_CP029122|1347549:1354689|1350517_1351315_+|WP_001272549.1|DBSCAN-SWA MSAGQRKEQKLIPVERRQIILEMVAEKGIVSIAELTDRMNVSHMTIRRDLQKLEQQGAVVLVSGGVQSPGRVAHEPSHQVKTALAMTQKAAIGKLAASLVQPGSCIYLDAGTTTLAIAQHLIHMESLTVVTNDFVIANYLLDNSNCTIIHTGGAVCRENRSCVGEAAATMLRSLMIDQAFISASSWSVRGISTPAEDKVTVKRAIASASRQRVLVCDATKYGQVATWLALPLSEFDQIITDDGLPESASRALAKQDLSLLVAKNE >NZ_CP029122|1347549:1354689|1348184_1349447_-|WP_000590392.1|DBSCAN-SWA MIKIGVIADDFTGATDIASFLVENGLPTVQINGVPTGKMPEAIDALVISLKTRSCPVVEATQQSLAALSWLQQQGCKQIYFKYCSTFDSTAKGNIGPVTDALMDALDTPFTVFSPALPVNGRTVYQGYLFVMNQLLAESGMRHHPVNPMTDSYLPRLVEAQSTGRCGVVSAHVFEQGVDAVRQELARLQQEGYRYAVLDALTEHHLEIQGEALRDAPLVTGGSGLAIGLARQWAQENGNQAREAGHPLAGRGVVLSGSCSQMTNRQVAHYRQIAPAREVDVARCLSTETLAAYAHELAEWVLGQESVLAPLVFATASTDALAAIQQQYGAQKASQAVETLFSQLAARLAAEGVTRFIVAGGETSGVVTQSLGIKGFHIGPTISPGVPWVNALDKPVSLALKSGNFGDEAFFSRAQREFLS >NZ_CP029122|1347549:1354689|1349443_1350352_-|WP_000847985.1|DBSCAN-SWA MKTGSEFHVGIVGLGSMGMGAALSCVRAGLSTWGADLNSNACATLKEAGACGVSDNAATFAEKLDALLVLVVNAAQVKQVLFGETGVAQHLKPGTAVMVSSTIASADAQEIATALAGFDLEMLDAPVSGGAVKAANGEMTVMASGSDIAFERLAPVLEAVAGKVYRIGAEPGLGSTVKIIHQLLAGVHIAAGAEAMALAARAGIPLDVMYDVVTNAAGNSWMFENRMRHVVDGDYTPHSAVDIFVKDLGLVADTAKALHFPLPLASTALNMFTSASNAGYGKEDDSAVIKIFSGITLPGAKS >NZ_CP029122|1347549:1354689|1351365_1352022_-|WP_001141340.1|DBSCAN-SWA MPSTRYQKINAHHYRHIWVVGDIHGEYQLLQSRLHQLSFYPETDLLISTGDNIDRGPKSLNVLRLLNQPWFTSVKGNHEAMALDAFETGDGNMWLASGGDWFFDLNDSEQQEAIDLLLKFHHLPHIIEITNDNIKYVIAHADYPGSEYLFGKEIAESELLWPVDRVQKSLNGELQQINGADYFIFGHMMFDNIQTFANQIYIDTGTPKSGRLSFYKIK >NZ_CP029122|1347549:1354689|1347549_1348188_-|WP_001278994.1|DBSCAN-SWA MSDFAKVEQSLREEMTRIASSFFQRGYATGSAGNLSLLLPDGNLLATPTGSCLGNLDPQRLSKVAADGEWLSGDKPSKEVLFHLALYRNNPRCKAVVHLHSTWSTALSCLQGLDSSNVIRPFTPYVVMRMGNVPLVPYYRPGDKRIAQDLAELAADNQAFLLANHGPVVCGESLQEAANNMEELEETAKLIFILGDRPIRYLTAGEIAELRS |
6 | Escherichia_phage(83.33%) | NA | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_2 |
1965040 : 1974482
Sequences of DBSCAN-SWA_2
Nucleotide sequences of DBSCAN-SWA_2 >NZ_CP029122|1965040:1974482|DBSCAN-SWA AATGATTGAATTTAGCCATGTCAGCAAACTGTTTGGCGCACAAAAAGCCGTTAACGATCTTAATCTCAATTTTCAGGAAGGGAGTTTTTCGGTGCTGATTGGCACATCTGGCTCCGGCAAATCCACCACCTTGAAAATGATTAACCGCCTGGTGGAACATGACAGCGGAGAGATCCGCTTTGCCGGAGAAGAAATTCGCTCGCTGCCAGTACTGGAGTTGCGCCGCCGGATGGGCTATGCCATTCAATCTATTGGCCTGTTCCCCCACTGGAGCGTGGCGCAAAACATCGCTACCGTGCCGCAATTACAAAAATGGTCGCGGGCACGGATTGACGATCGTATCGACGAATTAATGACGCTACTGGGGCTGGAGTCAAATTTGCGTGAGCGTTATCCGCATCAGCTTTCCGGTGGTCAGCAGCAACGTGTGGGAGTGGCGCGCGCACTGGCTGCCGATCCGCAAGTCTTACTGATGGATGAACCTTTTGGCGCACTGGACCCGGTAACGCGCGGCGCGTTGCAACAAGAGATGACGCGCATTCACCGTTTGCTGGGGCGCACTATTGTGCTGGTCACTCATGATATTGATGAGGCGCTAAGGCTGGCAGAACATCTGGTATTGATGGATCACGGTGAAGTGGTGCAGCAGGGGAATCCGCTGACGATGCTGACTCGTCCGGCGAATGATTTTGTCCGCCAGTTTTTTGGACGTAGTGAACTGGGGGTGCGCCTGCTTTCGTTACGTAGTGTGGCGGATTACGTGCGTCGCGAAGAACGGGCAGAAGGTGAGGCACTGGCAGAAGAGATGACGCTACGCGATGCGCTCTCCCTGTTTGTCGCGCGGGGATGCGAGGTGCTGCCGGTGATGAACACGCAGGGCCAGCCTTGCGGCACGCTGCATTTTCAGGATCTGCTGGAGGAGGCGTAAGCGTATGAAAATGTTGCGCGATCCGCTGTTCTGGCTCATTGCTTTGTTTGTGGCACTGATTTTCTGGCTGCCTTACAGCCAGCCGCTGTTTGCTGCCTTGTTCCCACAACTGCCACGACCCGTTTATCAGCAAGAAAGTTTTGCAGCTCTGGCACTGGCTCATTTCTGGCTGGTGGGAATTTCGAGTTTGTTTGCGGTGATCATTGGCACTGGTGCCGGAATTGCTGTCACTCGCCCGTGGGGCGCGGAATTTCGCCCACTGGTGGAAACTATTGCCGCCGTTGGACAGACTTTTCCGCCTGTCGCAGTGCTGGCGATTGCCGTTCCGGTGATCGGCTTTGGTCTGCAACCAGCGATTATCGCCTTGATCCTTTACGGTGTGCTGCCCGTCCTGCAGGCGACACTTGCCGGGCTGGGAGCGATTGATGCCAGCGTGACAGAAGTTGCGAAAGGTATGGGAATGAGTCGTGGTCAGCGACTGCGTAAGGTCGAGCTACCGCTGGCGGCTCCGGTGATTCTGGCGGGCGTGCGAACTTCGGTGATTATCAACATTGGTACGGCGACGATCGCCTCAACGGTAGGGGCCAGCACGCTGGGTACGCCCATCATCATCGGGCTTAGCGGATTTAATACCGCGTATGTGATCCAGGGGGCGTTACTGGTGGCACTGGCGGCGATCATCGCAGACCGCCTGTTTGAAAGGCTGGTGCAGGCGCTTAGCCAGCACGCAAAATAAAGGTATAACCTGCGAGCATGACGCCACCAATTCCGCCTAACGCCATAAACAGGAACAGGGCGATGACCCCAATTTTAGCTATGCGCATATTGCACTCCTTATGTTAACGAAAGGATTGTACAGTAAAGCGCATTTGTTAACGAATCATTAAATGCCGAGTGGGAAAATATCATGGCCTTGTTCTTGCCAACTGGTGAGTTGCTGCTGTTGGGCGGAGGTTCGATTTTCACCGCACCACACCAGCAATGTACGGCCTTCGAATAGTTCAGGGCGTAGTTGATTGAGCGAGTGGGCGAGGACATCAATGCGCCATCCTTGTTGACTGGCAATCCAGCCCTCCAGCCACAGACGGGTGGTATCCTGAATATTCCAGCCAACCACCAGCGCATCTTTACCCTGTTTTTTACGTGCCGAAGCCAGACAAATGGCGATGTAGTTGATCAGTACGCCGTCGAGGATCGCCAGCAGCGCCTGGAGAGTCGGTTGTTGGCACTGAAGCCGTCGGCGCAGAGGAATAAACAGATGTGTGGTGAGTGTCTGGGCGGGGTAATCCTGACCGCGCTCTTTGATCCACGTTCGCAGGCTATGCAGATTGCCGCTTTGCAGGTAGGTCAGTAATGTTTCTTGCTGATCGCGCCAGCCGTTCTGCACATCAACATTTTCATTACTGAGCAGCATTTTAACTTTGCTGACCTGCACGCCGTTGTCGATCCAGCGTTTGATCTCGCGGATACGGTCAATATCGGCATCGTTGAACAGTCGATGACCGCCGTCTGTCCGTTGCGGTTTCAGCAACCCGTAACGCCTCTGCCACGCGCGTAACGTGACAGGGTTAATATCACAAAGCAACGCCACTTCACCAATTGTGTAAAGCGCCATCGTCTCACCCTTGCTCGCGAGGTCCCGGTTTAACTTTAGACGCAGTTTTGCGAACCAGGTAGTTTTGCCCGTTTTTTGTGCATCTATAGGGTGATTTTATTTTTGCCAGGCGATTTTGAGTGATCGTACTCACGAATTCTCATTTTTCTGCAAGAGTTCAAAGAAAGTTAAACGCAGGCAATGTATGTTACGCGTTTTAAAGGGAAGTGTGGTTTGCGGGTATGTACGATTTTAATCTGGTGTTGCTGCTGCTTCAGCAGATGTGCGTTTTTTTAGTCATTGCGTGGTTAATGAGTAAAACGCCATTATTCATACCGTTAATGCAGGTCACGGTTCGTCTGCCGCATAAATTTCTCTGCTACATCGTCTTTTCCATCTTCTGCATCATGGGCACCTGGTTTGGGTTGCACATTGACGATTCTATTGCCAATACCCGTGCGATAGGCGCGGTAATGGGCGGCTTACTCGGCGGTCCGGTCGTCGGTGGGCTGGTTGGTCTGACCGGCGGCTTACATCGATATTCGATGGGGGGCATGACCGCGCTAAGTTGCATGATCTCAACCATCGTTGAAGGATTGCTCGGCGGCCTGGTACACAGCATCCTGATCCGTCGCGGGCGCACTGATAAAGTCTTTAACCCCATTACCGCCGGTGCCGTCACGTTCGTCGCTGAAATGGTGCAAATGCTGATCATCCTTGCGATCGCCCGACCTTATGAAGATGCGGTGCGTCTGGTGAGTAATATTGCTGCACCAATGATGGTCACCAATACCGTCGGCGCGGCGCTGTTTATGCGTATATTGCTCGATAAACGCGCGATGTTTGAAAAATACACTTCTGCTTTTTCTGCCACTGCGCTGAAAGTGGCAGCCTCGACGGAAGGCATTTTGCGACAGGGGTTTAACGAAGTGAACAGCATGAAAGTGGCTCAGGTGCTGTATCAGGAGCTGGATATTGGTGCAGTCGCGATTACCGATCGAGAGAAATTGCTGGCCTTTACCGGAATTGGTGACGACCACCATTTACCCGGCAAACCGATTTCTTCGACTTACACCTTAAAAGCGATTGAAACCGGTGAAGTGGTCTACGCTGATGGCAACGAAGTACCTTATCGTTGCTCTTTGCATCCGCAATGCAAACTGGGGTCGACGCTGGTAATTCCGTTGCGTGGCGAAAATCAGCGCGTGATGGGCACCATCAAATTGTATGAAGCCAAAAACCGTTTATTCAGTTCAATAAACCGCACGCTGGGCGAGGGGATTGCGCAACTGCTTTCGGCGCAGATTCTTGCCGGGCAATATGAGCGGCAAAAAGCGATGCTCACCCAGTCAGAAATCAAACTGCTTCACGCCCAGGTGAATCCTCATTTTTTGTTTAATGCGCTTAACACCATTAAAGCGGTGATCCGCCGCGACAGCGAACAGGCCAGCCAGCTGGTGCAGTATCTTTCCACTTTTTTCCGCAAAAACTTAAAGCGGCCTTCGGAGTTTGTTACTCTCGCCGACGAAATTGAACATGTGAACGCTTATCTGCAAATTGAAAAGGCGCGCTTCCAGTCGCGGTTGCAGGTCAACATTGCTATTCCGCAAGAATTATCCCAGCAGCAATTGCCCGCGTTTACCCTGCAACCGATAGTGGAAAACGCCATTAAACATGGGACATCACAACTGCTGGATACAGGGCGAGTGGCAATCAGCGCCCGACGTGAGGGGCAACATTTGATGCTGGAGATCGAAGACAATGCCGGTTTGTATCAACCGGTAACCAATGCCAGTGGGCTGGGGATGAATCTGGTGGATAAGCGTTTACGTGAACGGTTTGGCGATGACTATGGGATAAGCGTCGCCTGTGAGCCTGATAGTTACACCCGAATAACGTTACGACTACCATGGAGGGACGAGGCATGATTAAAGTCTTAATTGTCGATGATGAACCGTTAGCACGGGAGAACCTGCGCGTATTTTTGCAGGAGCAGAGCGATATTGAAATCGTTGGAGAGTGTTCAAACGCCGTGGAAGGGATCGGCGCGGTGCATAAACTGCGCCCGGATGTGCTGTTTCTCGATATCCAGATGCCGCGCATCAGTGGTCTGGAAATGGTGGGGATGCTCGATCCGGAACATCGTCCGTATATTGTTTTTCTCACCGCGTTTGACGAATACGCTATTAAAGCCTTTGAAGAACATGCTTTTGATTATCTGCTGAAGCCAATTGATGAAGCGCGACTGGAGAAAACGCTGGCGCGATTGCGTCAGGAGCGCAGCAAGCAGGATGTTTCGCTGTTACCGGAAAATCAACAGGCGCTGAAATTTATCCCTTGTACGGGGCATAGTCGGATTTATTTGCTGCAAATGAAAGATGTGGCATTTGTCAGCAGTCGGATGAGCGGTGTCTACGTTACCAGCCACGAAGGGAAAGAGGGCTTTACCGAATTGACATTACGTACCCTGGAAAGTCGTACACCACTACTGCGCTGCCATCGTCAGTATCTGGTTAATCTCGCGCATTTACAGGAGATTCGTCTGGAAGATAACGGCCAGGCCGAGTTGATTTTGCGTAATGGCTTAACCGTGCCGGTCAGCCGCCGTTATCTGAAAAGCTTAAAAGAGGCGATTGGCCTGTAAAAGACTGCTAAAATGGCTTTTTGCCTCATCAACACCTGAAGGCCTCATGCTAAGTAACGATATTCTGCGCAGCGTGCGCTACATTTTGAAAGCCAATAATAATGACCTGGTGCGTATTCTGGCGCTGGGTAATGTCGAAGCCACCGCGGAACAGATCGCCGTCTGGCTACGTAAAGAAGACGAAGAGGGTTTTCAGCGTTGTCCGGACATTGTTTTGTCGTCATTCCTCAACGGCCTGATTTATGAAAAACGCGGCAAGGATGAGTCTGCTCCGGCACTGGAGCCGGAACGTCGCATTAATAACAACATCGTGCTGAAAAAATTACGCATCGCGTTTTCGCTGAAAACCGATGACATTCTGGCTATCCTCACCGAACAGCAGTTCCGCGTTTCGATGCCGGAAATTACAGCGATGATGCGCGCACCGGATCATAAAAACTTCCGCGAATGCGGCGATCAATTTTTACGTTATTTTCTGCGTGGACTGGCAGCGCGCCAGCATGTGAAGAAAAGCTAAGACGGGTATGGCGGCCATGCGAAACATGGCCGCCGACAGATTATTTCACTTCTTTAAAACCAGCGGCTTTCATCACCAGTTCCATTTGCGCCATAGTGATACCTTTTTTGGCATCTTCAGCAGAAACGTTGATTCCTGAAATACCCTGCAGGGCTTTAAAATCCACTTTTTCCATATCGATAGTCACGTTTTCCTGCACGTAGGTATCTGTATAGGTTAATTTTTCTTCAACACCCGCGATGTTTTTGTATTTGGCGCTTAACGGCTCAAGTGTCTTGGCAGCGTCTTCTTTGGTGGTTGCACCAATAGAAGCAAATTGAATTTTGGTTTCAGAAGATTGCTTAAGCACCTTGTCACCTTTGTAGACATAGGTAATGGCAATTTCAGTGCCGTTCAGATTGGCGCTGAATTTCTTCGATTCTTCTTTGTCACCGCAGCCAGCAAGAGAGAAAACCAGAACAGATGCAACAACGAGGGAAAACAGCTTATTGAAAGCCTTCATGTAAAACTCCATTTTATTTAATCAAGAAACTGGTGACTCTCACCAGGGGCTATATAGAATATGCCTAATACCGTGACGTGAGCAGTCCGGAACTGGAGTAGAACTCTTAGTAAAAAGCACTATTTCATCCTTGTTGCTGAAGCATGGGGAATAATTGTTCGCAAAGTAAAACACCGTTATTCATTGCTTCTACCCGTGCCTCGCTTTCTGTATTACGAAATTGTCCCAACACATGTGCCAGCCGATAAAAACCGACCGCGGAGAGGTCATTCGCCAGCAACTCTGCCTGACTAATAGCGCTCTGTTCCTGATAGCGCCAGCCGTTATGGAGCAGTTGAATAAGTAACGCCTGGCAGCGCATCAGCAACTGATGAGCGGTAGACGGCACAGGCAAAACGCTGGCAGAAGGTAGCGGTGCCACAGGCGCAGTTTCCGCGTCCAGCGCCCAGGCGCGGGTTTTTGTCATCATCACCCGTGGTTCCAGTGTCAATTGCCCATCAACAAAACTGACAAAGCCAGAAACCAGACTCACGGGGTCGTCTGTTTGTTGCAAAAGCGCCGCCATGCGTTCAACGGCAAAAGGAGAACAGGCAGATGCCGGAAGGGATAACGTCAGGAGATTATCTTCACCTTCGCCGCTGATTACCTGCGCATCCAGCGTCTGGCGGCTGCTATCCCAACCGAGCGAAATACACTCAGCGACCGGCAGAATAAATAAGTTATCGACCTGATTAAGAGGCCGTATGCAGGCGGGGGGACGCTGGCGTAAATATTCCCGCAAAGCCACAATGCCCGGCTGGCGTAACGGCGCGCTCAACATTTGCCAGGCATCAGGCGACAGCGGCACAACGCTGCTTAAGCGGTTGCGGGTAGCTAACAGCAGCTCGCCATCGGCACTGCGTTTTGCTGCTTGTGAAACAATTTGCCCACCCGCCAGTGCGCCAGCCTGAAAACTAAACAGCCGACGCGTAGCTGCCGGTGAGTTTTCCTGTTCACTTCGCGGCCAACTGCGCGAAAGGTGCAAAATACTGCCGGTGTCGGGATCGGTAAACCAGATGCGTAAACCATAATGCTCAATATCCTGCCAGCAACGCATACCTAAAGACACCAGCCGCAGATGATCAAGCTTTGCTTCTCCGGCAATGCCAGAGCCAACGACCGTGCGCCACGGCACAGGAGGAACTTCACCAACACTGTCGCGCCGGGCCATCTCTTGTGCGCAATTTAATCGACTGTTTAATGCCGCAAGCTGACGTAAGCATTCTCCGGCATGATAGTGGCTGGCGCGAGTGTGGAAGGCATCAACGCTTGCCCGTAGCTGTCGTAGTGATTCACTCACCCAGCGCCAGTTGCAGGCCTCTGCCGCCTGCAATGCGCGGTTGAATGCTGCCTCGTAGTGAATAAGCGGCTGGCTGATGCCGCCAAGCCATAATGCCTGGCTTAATTGCTGAACATATTGACGACACGTTTGGCCTTCTTCGCTGGCAAACGGATCGTCAGATGATGTGACGTGTTCGCTGCGCATCTGCCAGATTAAATGGTTAAATTCTGCTTGCTGCGCTTTGGCCTCGACGAAGGCCTGCACAGCCAGTACGACATGTTCGCAAAGTGTGCCTTCAATACAATCACAACGGGCGAAACGAATACTGCTACGGGAATAAAAACGCACATCGCTCATCGGTAAGCGGGCAGAGGGAATTTCGCCCGGCGCACAGAACAACTCAATGGTGATGCCTTTAGCGACCAGCGCCTGCGCGCGTTTGCGGGTGGCATCGGGCAGGGTAGCCAGTTCTTCCAGCCAGATTGCCGGATCCCACTCTTCTTCTTTTTCCGTGGGCTGGACGGTGGCACAAAGTCGTTGATAACTTAACACCAGCATCACGCGATGACGGCACATGCCGCTGGCCCCGCAGCTGCACTGAGCCTCTTTCAGTGCCTGGCTGTTCGCCAGCTGGGTACGGACACCGTCACTGAAGGTGGCGATTAAAGCGCCGTTCTCATGGCTGATTTCCGGGACGTTGCCATTTTCCAGTTCCTTAAGACTGCGCTTAACAAAACCGGCATTGCTTAACGCCGTCAGTGCCTGCGGTGTCAGTTCTAATAATTCCGGACGTAGTGAATTCATGACTGAAGATTCTCCGCAAGCCATGATGCCAGCTCGCCCGGCGTCATGGCGGCTATTTGTGCGCCGACATTAACCAGCGCCTGGGCCGTATCGCGGTCATAGCAAGGCGTTGCGGTGCTATCGAGCGCTGCCAGTCCCAGCACTTTGATGCCGCTCTGGACACACTTTTTCACCTGATGCGTCAGCAATGATGATGAACCCCCTTCGTAAAAATCGCTCACGAGGATAATGACGCTTTTTGCTGGTTGTTCAATAAGTTGCCGACCATACTCCACGGCACTGGCGATATTGGTCCCGCCGCCCAACTGTACTTTCATTAATAACTCTACCGGATCGGCAACGTCTGCCGTGAGATCAACCACGCTTGTGTCAAACGCCACCAGATGGGTACGAATGCCGGGTAACTGCCACAAACAGGCCGCCATCACCGCAGAGTGGATCACCGAGTCCACCATCGATCCGCTTTGATCAACCAGTAAGACCAGTTGCCATTGTTCGCTTTGGCGTTTAATGCGGCTGTTAAAGCGGGGGGATTCGATATACAACTTGCCGTGTTGCGGGTGCCAGTGTTGCAGGTTGGCGCGCAGAGTACTTTTGAAATCAAAGTTTCGCGCCAGTGGAATTAATGAGCGGCGACGGCGATCGCGGACACCAGAAAAAGCCTGACGAACTTCCTTTGCCAGTCGCGCCATAATTTCTTCAACAACCTGGCGCACTATCTGGCGGGCGGCAGCCAGTACTTCGGGATTCATCAGATGTTTGGTGTGCAAAACGGCGCGTAGCAGGCTTTCCGAAGGCTGCATACGTTCCAGCACGTCGAGATTCGTCACCACATCTTCAATGCCGTAGCGCAGTACGGCATCGCTTTCCAGTCGCTCAATCACCTGTTGCGGAAACAGAGTGTGAATACTGTTGATCCACTCAGGGGTGGTGAGATTTGAGCCACCTAATCCACCAGAGCGTTCACCACGCTGGAGCCGTTCAGGATCGCGCCCATACAGCCACTCCAGCGCGTGGTCTATCTGCCGGGCGTTGTCATCCAGCCCACAAAGCGTCGTTTCTGCCGCTTCGCCAAGAATTAATCGCCAGCGTTGTAGCTCACGGGTGGTCAGAAGATCGTTCAGTTCAGACAT
Protein sequences of DBSCAN-SWA_2 >NZ_CP029122|1965040:1974482|1966683_1966791_-|WP_001216963.1|DBSCAN-SWA MRIAKIGVIALFLFMALGGIGGVMLAGYTFILRAG >NZ_CP029122|1965040:1974482|1966850_1967582_-|WP_001240401.1|DBSCAN-SWA MALYTIGEVALLCDINPVTLRAWQRRYGLLKPQRTDGGHRLFNDADIDRIREIKRWIDNGVQVSKVKMLLSNENVDVQNGWRDQQETLLTYLQSGNLHSLRTWIKERGQDYPAQTLTTHLFIPLRRRLQCQQPTLQALLAILDGVLINYIAICLASARKKQGKDALVVGWNIQDTTRLWLEGWIASQQGWRIDVLAHSLNQLRPELFEGRTLLVWCGENRTSAQQQQLTSWQEQGHDIFPLGI >NZ_CP029122|1965040:1974482|1969485_1970205_+|WP_000598641.1|DBSCAN-SWA MIKVLIVDDEPLARENLRVFLQEQSDIEIVGECSNAVEGIGAVHKLRPDVLFLDIQMPRISGLEMVGMLDPEHRPYIVFLTAFDEYAIKAFEEHAFDYLLKPIDEARLEKTLARLRQERSKQDVSLLPENQQALKFIPCTGHSRIYLLQMKDVAFVSSRMSGVYVTSHEGKEGFTELTLRTLESRTPLLRCHRQYLVNLAHLQEIRLEDNGQAELILRNGLTVPVSRRYLKSLKEAIGL >NZ_CP029122|1965040:1974482|1965040_1965967_+|WP_000569325.1|DBSCAN-SWA MIEFSHVSKLFGAQKAVNDLNLNFQEGSFSVLIGTSGSGKSTTLKMINRLVEHDSGEIRFAGEEIRSLPVLELRRRMGYAIQSIGLFPHWSVAQNIATVPQLQKWSRARIDDRIDELMTLLGLESNLRERYPHQLSGGQQQRVGVARALAADPQVLLMDEPFGALDPVTRGALQQEMTRIHRLLGRTIVLVTHDIDEALRLAEHLVLMDHGEVVQQGNPLTMLTRPANDFVRQFFGRSELGVRLLSLRSVADYVRREERAEGEALAEEMTLRDALSLFVARGCEVLPVMNTQGQPCGTLHFQDLLEEA >NZ_CP029122|1965040:1974482|1967803_1969489_+|WP_001295431.1|DBSCAN-SWA MYDFNLVLLLLQQMCVFLVIAWLMSKTPLFIPLMQVTVRLPHKFLCYIVFSIFCIMGTWFGLHIDDSIANTRAIGAVMGGLLGGPVVGGLVGLTGGLHRYSMGGMTALSCMISTIVEGLLGGLVHSILIRRGRTDKVFNPITAGAVTFVAEMVQMLIILAIARPYEDAVRLVSNIAAPMMVTNTVGAALFMRILLDKRAMFEKYTSAFSATALKVAASTEGILRQGFNEVNSMKVAQVLYQELDIGAVAITDREKLLAFTGIGDDHHLPGKPISSTYTLKAIETGEVVYADGNEVPYRCSLHPQCKLGSTLVIPLRGENQRVMGTIKLYEAKNRLFSSINRTLGEGIAQLLSAQILAGQYERQKAMLTQSEIKLLHAQVNPHFLFNALNTIKAVIRRDSEQASQLVQYLSTFFRKNLKRPSEFVTLADEIEHVNAYLQIEKARFQSRLQVNIAIPQELSQQQLPAFTLQPIVENAIKHGTSQLLDTGRVAISARREGQHLMLEIEDNAGLYQPVTNASGLGMNLVDKRLRERFGDDYGISVACEPDSYTRITLRLPWRDEA >NZ_CP029122|1965040:1974482|1970762_1971224_-|WP_024176190.1|DBSCAN-SWA MKAFNKLFSLVVASVLVFSLAGCGDKEESKKFSANLNGTEIAITYVYKGDKVLKQSSETKIQFASIGATTKEDAAKTLEPLSAKYKNIAGVEEKLTYTDTYVQENVTIDMEKVDFKALQGISGINVSAEDAKKGITMAQMELVMKAAGFKEVK >NZ_CP029122|1965040:1974482|1971348_1973349_-|WP_001317947.1|DBSCAN-SWA MNSLRPELLELTPQALTALSNAGFVKRSLKELENGNVPEISHENGALIATFSDGVRTQLANSQALKEAQCSCGASGMCRHRVMLVLSYQRLCATVQPTEKEEEWDPAIWLEELATLPDATRKRAQALVAKGITIELFCAPGEIPSARLPMSDVRFYSRSSIRFARCDCIEGTLCEHVVLAVQAFVEAKAQQAEFNHLIWQMRSEHVTSSDDPFASEEGQTCRQYVQQLSQALWLGGISQPLIHYEAAFNRALQAAEACNWRWVSESLRQLRASVDAFHTRASHYHAGECLRQLAALNSRLNCAQEMARRDSVGEVPPVPWRTVVGSGIAGEAKLDHLRLVSLGMRCWQDIEHYGLRIWFTDPDTGSILHLSRSWPRSEQENSPAATRRLFSFQAGALAGGQIVSQAAKRSADGELLLATRNRLSSVVPLSPDAWQMLSAPLRQPGIVALREYLRQRPPACIRPLNQVDNLFILPVAECISLGWDSSRQTLDAQVISGEGEDNLLTLSLPASACSPFAVERMAALLQQTDDPVSLVSGFVSFVDGQLTLEPRVMMTKTRAWALDAETAPVAPLPSASVLPVPSTAHQLLMRCQALLIQLLHNGWRYQEQSAISQAELLANDLSAVGFYRLAHVLGQFRNTESEARVEAMNNGVLLCEQLFPMLQQQG >NZ_CP029122|1965040:1974482|1970251_1970722_+|WP_001295430.1|DBSCAN-SWA MLSNDILRSVRYILKANNNDLVRILALGNVEATAEQIAVWLRKEDEEGFQRCPDIVLSSFLNGLIYEKRGKDESAPALEPERRINNNIVLKKLRIAFSLKTDDILAILTEQQFRVSMPEITAMMRAPDHKNFRECGDQFLRYFLRGLAARQHVKKS >NZ_CP029122|1965040:1974482|1973345_1974482_-|WP_001292774.1|DBSCAN-SWA MSELNDLLTTRELQRWRLILGEAAETTLCGLDDNARQIDHALEWLYGRDPERLQRGERSGGLGGSNLTTPEWINSIHTLFPQQVIERLESDAVLRYGIEDVVTNLDVLERMQPSESLLRAVLHTKHLMNPEVLAAARQIVRQVVEEIMARLAKEVRQAFSGVRDRRRRSLIPLARNFDFKSTLRANLQHWHPQHGKLYIESPRFNSRIKRQSEQWQLVLLVDQSGSMVDSVIHSAVMAACLWQLPGIRTHLVAFDTSVVDLTADVADPVELLMKVQLGGGTNIASAVEYGRQLIEQPAKSVIILVSDFYEGGSSSLLTHQVKKCVQSGIKVLGLAALDSTATPCYDRDTAQALVNVGAQIAAMTPGELASWLAENLQS >NZ_CP029122|1965040:1974482|1965971_1966703_+|WP_000783120.1|DBSCAN-SWA MKMLRDPLFWLIALFVALIFWLPYSQPLFAALFPQLPRPVYQQESFAALALAHFWLVGISSLFAVIIGTGAGIAVTRPWGAEFRPLVETIAAVGQTFPPVAVLAIAVPVIGFGLQPAIIALILYGVLPVLQATLAGLGAIDASVTEVAKGMGMSRGQRLRKVELPLAAPVILAGVRTSVIINIGTATIASTVGASTLGTPIIIGLSGFNTAYVIQGALLVALAAIIADRLFERLVQALSQHAK |
10 | Enterobacteria_phage(85.71%) | NA | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_3 |
2543045 : 2571648
Sequences of DBSCAN-SWA_3
Nucleotide sequences of DBSCAN-SWA_3 >NZ_CP029122|2543045:2571648|DBSCAN-SWA GTTATATTTTTTCGATCTCGACCAGATTAGTGTGCTGCGGGTTTCCCTTCGCCAGTGGTGAAGGGCGCAGAGTGGTTAGCGTATTCACACAGCCGCCATGGTCGATTTTATCGCCAGACATATTGGCCTCGTGCCAGGCTCCCTGGCCCATAGCGCTAACCCCAGGGAGAATGCGTGGTGTCACTTTGGCTGGTAGCCGAACTTCGCCACGATGGTTAAACACCCGCACCATATCGCCGTTGGCAATCCCACGTTTCTGCGCATCTATAGGGTTGATCCACACCTCCTGACGGCAGGCAGCCTTCAGGAGATCAATATTGCCGTAGGTCGAGTGAGTACGGGATTTGTAATGGAAACCAAACAGTTGAAGTGGGAAGGTTCTACGTTCAGGGGAGTCCCAGCCTTCAAAGGTAGAGGCATAAACTGGCAGTGGGCTTATCACTTCATCTTTTTCCAGTTCCCAGGTACGGGCAATTTCCGCCAGCTTGCTGGAATAAATTTCAATCTTACCGGAAGGTGTTTTAAGTGGATTTGCCTCGGGGTCGTCACGAAATGCTTTGTAGGCGACAAAATGACCATTGGGATCTTTACGCTTATAGATACCCATTTTTTTCAGTTCGTCGTAAGACGGTAACGCCGGATCTTTGGCAAGCATTTTGGCGTACAGATGTTGTAACCATTGTTCCTGCGTGCGACCTTCAGTGAACTTTTGATAGACGTCAGGTCCAAGACGTTTCGCGACTTCACTCAGGATCCAGTAAATCGGTTTACGTTCGAATTTTTCGCTGGTGACTGGCTGGAGGAAAATGAGATATCCCATGTTACCGGCGTAGTCGTTAGGAATAATATCTTCCTGCTCAACGGTCATCAGGTCTGGCAGCAGAATGTCGGCATATTTTGCCGATGAGGTCATAAAGTTTTCGATGACCACAATCATTTCGCATTTCGATTCGTCTTGCAGAATTTCATGCGTTTTGTTGATGTCAGAATGCTGATTAACGAGGGTATTTCCCGCGTAGTTCCAGATGAACTTAATGGGCACATCCAGTTTATCTTTGCCGCGGACGCCGTCGCGGATTGCCGTCATTTGCGGACCATGATCGATAGCATCTGTCCAGCTGAAGCAGGAGATTGACGTTTTGACCGGATTATCCAGCACCGGCAGGCGTTCTATGGTAATGGTATAGGTCGATTCACGCGCGCCACTATTTCCGCCGCTGATGCCGACATTGCCCGTCAAAATAGGTAACATAGCAATAGCGCGTGCAGTCAGTTCGCCGTTTGCCTGGCGTTGCGGCCCCCAGCCCTGGCAGATATAAGCGGGTTTTGCTGTGCCAATTTCACGCGCCAGTTTGATGATACGGTCTACCGGGATACCGGTAATTTGCGAAGCCCACTGCGGCGTTTTCGCTGTGTTATCATCACCTTCACCAAGAATATAGGCTTTATAGTGACCATTTTTGGGTGCATCTGCGGGTAAGGTTTTTTCGTCATAGCCGACGCAGTATTTATCGAGAAAAGGTTGATCAACGAGATTTTCGTTAATCAATACCCAGGCAATACCCGCAACCAGCGCGGCATCGGTGCCCGGGCGAATAGGGAGCCATTCATCTTCACGACCAGCAGCCGTATCGGTATATCGCGGATCGATAACAATCATTTTGGCGTTCGATTTCTCGCGCGCTTTTTCAAGAAGATAAGTGATGCCACCGCCGCTCATGCGGGTTTCTGCCGGGTTGTTACCAAACATCACGACCAGCTTGCTGTTTTCAATATCCGTGGTGCTGTTGCCATCATTACTGCCGTAGGTGTAGGGCATGGCACAGGAAATTTGCGCGGTGCTGTAGGAGCCATACTGATTGAGTGAACCGCCGTAGCAGTTCATCAGGCGTTTGACCGCCGAGGCTGATGGCGAAGAGCGGGTCATATTGCCGCCAACAATCCCCGAAGAGTACTGAATATATACAGCCTCATTGCCATATTGTTCGACGGTTTTTTTCAGGCTACTGGCGATAGTATCCAGGGCTTCATCCCAGCTAATCCGTTCGAATTTGCCTTCACCGCGTGTACCCACGCGTTTCATTGGGTAATTCAAGCGATCGGGATGATTAATACGCCGGCGGATGGAGCGACCGCGCAAACAGGCGCGTACCTGATGGTTGCCGTACTCATCGCTGCCGGTATTGTCAGTTTCCACCCAGGTCACTTCATTATCTTTAACATGTAGACGAAGTGCACAGCGGCTACCACAGTTGACGGAACAGGCACCCCAGATCACTTTTTCGCTGGCCTGTTGTACCGTTGCCGCTGCACTGCGCAGGGTGAACGGTAAAGAAAAACCGCCTGCAGCCAGCGCCAGAGAACCTATCGCGGTTGATTTAACGAGTGTTCTGCGGCTGATGCCCACCATTCGGTCATTTTTGGACATAACTCACTCCCTGTTCTTTATCGTTATATAAATGTTTATATATTGAATATTTAGCGCGCTAACAATAGAGGGAGTCTACCCATTTTGGGTTAAGAATTATTAATCCATATCAATAGAAGGGTATGAGTAATAAGGTGGGATTATGTTGTATGTTCAAATCGCCGGATGTGTCGTATCCGGCGTTCAGTCGATAATGTATTACTGCGGTTCGGCAGGCGCGCCATCCTGGGTAGACTGCGCGGGAGCAGAGACGTTACCGCTGGTGGTGCGGGTATAGAGAATTTTATGCGTATCATTAGCGCAATGGCCGACGACCTGGGAATCAGGCTGATCAACCTGGTCATTGGGTACAATACTTAACGTGAAGCTGCTTTCGGGTACGCCATTATTGATAATGCGCTGTGATATATCGCTCTGTATGCGCTCACAGGATCCCGGCGCGGCGAGTACCACGGGTGAGGCGAGGGCGAGCAGAAGCGCGGCACAGCAGGTTGAGAGTTTCATCATAAGCTCCTTACGCGAAGATAACTTCTTTAAGCATAGCATTTAACGTGTAAAGTACTGTATTTGCTACTATGATTGAGAATCATCTCTACTCTCTGGTGACTGTTGTGAAATACAAATTACTACCATGCTTACTCGCGATATTCCTCACAGGATGTGACCGCACAGAGGTAACACTTTCATTTACCCCTGAGATGGCCAGTTTCTCTAATGAATTCGATTTTGATCCGCTGCGTGGTCCGGTAAAAGATTTCACTCAGACATTAATGGATGAGCAAGGTGAAGTGACGAAACGTGTTTCTGGGACTTTGTCGGAAGAAGGCTGTTTTGATTCACTCGAATTACTGGATCTGGAAAATAATACCGTGGTCGCTCTGGTACTGGACGCCAATTATTACCGTGATGCCGAGACGCTGGAGAAGAGAGTACGTTTACAGGGAAAATGCCAGCTAGCAGAATTACCTTCTGCCGGGGTGAGTTGGGAAACCGATGATAATGGCTTCGTGATTAAAGCCAGCAGCAAACAAATGCAGATGGAATATCGCTATGATGATCAGGGTTATCCGCTGGGTAAAACCACGAAAAGTAACGACAAAACATTATCTGTCAGCGCCACGCCATCAACGGATCCGATCAAAAAATTAGATTACACAGCGGTTACTTTACTGAATAATCAACGGGTTGGTAATGTAAAACAGAGCTGTGAATATGACAGTCACGCTAATCCGGTGGACTGTCAGCTAATCATTGTTGATGAAGGAGTAAAACCCGCCGTCGAACGGGTTTACACCATCAAAAATACGATCGATTATTATTAATGCTATTGTGCGGTCGGCTTCAGGAGAGTCTGACCCGGTGTTTTGTGCTCTGCCAGATACTGATGCTGGAATATACACATGCGAATGGCATTACGATATTGACCATTAATAAAGAACTCGTGCATCAATTCACCTTCAACCGAAAAGCCAAGCTTGCGGTAAATGTGAATCGCTTTTTCATTCTCTTTATCAACGATCAGATACAGCTTATAGAGATTGAGAACGGTAAAGCCATAGTCCATTGCTAATTTGGCGGCACGGGTTGCCAGACCTTTCCCCTGATACTCCGGGGAGATAATTATCTGAAATTCTGCGCGGCGATGAACATGGTTAATTTCCACCAGCTCCACCAGACCGGCTTTTTCGCCGTCACATTCCACCACAAAGCGCCGTTCGCTCTGATCGTGAATATGCTTATCATACAGATCAGAGAGTTCAACAAAGGCTTCGTAGGGTTCCTCAAACCAGTAACGCATCACACTGGCGTTGTTGTCGAGTTGATGTACATAGCGTAAATCTTCACGCTCCAGCGGGCGTAGCTTAACACTGTGGGCGCTTGGCATAACGTGTCCTTACATTCCTTAAATCAATAACAGGTTAGGGGGTAATAACGCGGCCAGTTCGACGGTCCAGGCAGCGCAAAGTATTGGGCTCCCAGTAGGCATTGATGTTGGCGCTTTGCTCACATTTATCGCGGTTATCAAAAGCGGCGTCGGCTTTATCCCACTCTTTTTCAGTGCGTTTATTCACTTTCTGGCGCAGATTGCGCGTGTCATTCCATTGCTCTTTTTCCATAGCGGCGTGCTGGCGGCTTTGTGCACTGTCGCCAGACTCAATCACCAGTTTGTTAGTTTCGGCATGAACAGTTGTGCTCAATGCCAGTGCGCAAGGCAGCAGAATAGCGAGCAGGCCGATTCGTTTGCTGAGAGTGATTTTCATAATTCATTCCCTGTATGAATGATTAAAGGTGATTCTACACCATCCACTGCGGACGCAAAACGTACCAGGAGGGTGTTTATATTGATGATATTATGTCGCCCTATAACTATACATGATGTCAATAAGAGACAAAGATGATTAAAACAACGTTACTATTTTTTGCTACTGCGCTGTGTGAAATTATTGGATGCTTTCTGCCCTGGTTGTGGTTAAAACGAAACGCCAGTATCTGGCTGTTGCTTCCGGCGGGGATTTCACTGGCGCTGTTTGTCTGGTTGTTAACGTTGCATCCAGCGGCGAGTGGGCGTGTTTACGCGGCTTATGGTGGCGTTTATGTCTGCACGGCGTTGATGTGGCTGCGCGTTGTGGATGGCGTGAAACTGACTCTTTATGACTGGACGGGTGCGTTGATTGCGCTTTGCGGCATGTTGATCATTGTTGCGGGCTGGGGGCGCACGTAGGAACATAAATCCATTTTATCAATAAGATAAGAGGAAGTGTCAGCTGACAAAAGGTATTCTATTTCATCTTTTGTCAACCATTCACAGCGCAAATATACGCCTTTTTTTGTGATCACTCCGGCTTTTTTCGATCTTTATACTTGTATGGTAGTAGCTCAGTTGCGTAGATTTCATGCATCACGACAAGCGATGCAAGGAATCGAACATGAAGATCGTAAAGGCTGAAGTTTTTGTTACCTGTCCGGGGCGTAATTTCGTCACATTAAAAATCACCACTGAGGACGGTATTACGGGCCTTGGGGATGCCACCCTCAATGGACGTGAGCTTTCCGTGGCCTCTTATTTGCAGGATCACCTTTGTCCGCAGCTTATTGGTCGCGATGCGCACCGTATCGAAGATATCTGGCAGTTTTTCTATAAAGGTGCTTACTGGCGTCGCGGTCCGGTTACGATGTCGGCCATTTCAGCGGTTGATATGGCGCTGTGGGATATTAAAGCCAAAGCTGCCAACATGCCGCTTTACCAGTTACTCGGCGGCGCGTCTCGTGAAGGGGTGATGGTTTATTGCCATACCACCGGTCACAGTATTGATGAAGCTCTGGATGATTATGCCCGTCATCAGGAGCTGGGATTCAAAGCCATCCGCGTGCAGTGCGGAATCCCTGGTATGAAAACCACCTACGGCATGTCGAAAGGTAAAGGTCTGGCTTATGAACCCGCAACCAAAGGACAGTGGCCGGAAGAGCAGCTGTGGTCGACGGAGAAATACCTCGATTTCATGCCGAAATTGTTTGACGCGGTACGTAACAAGTTTGGTTTTAATGAACATTTGCTGCATGACATGCACCATCGCTTAACGCCTATTGAAGCGGCGCGCTTTGGTAAAAGCATTGAAGATTATCGCATGTTCTGGATGGAAGACCCGACGCCTGCAGAAAACCAGGAATGCTTCCGTCTCATTCGCCAACATACCGTCACACCCATCGCAGTGGGTGAAGTCTTCAACAGCATCTGGGACTGCAAACAACTGATTGAAGAGCAACTCATCGATTATATCCGCACCACGCTGACCCATGCAGGCGGAATTACCGGTATGCGCCGGATTGCCGATTTTGCTTCGCTGTATCAGGTACGTACTGGCTCACACGGTCCTTCCGATTTGTCACCAGTCTGCATGGCTGCGGCGCTGCACTTTGATCTGTGGGTCCCCAATTTCGGTGTCCAGGAATACATGGGTTATTCCGAACAAATGCTCGAAGTCTTCCCGCACAACTGGACTTTCGATAACGGCTATATGCATCCGGGAGACAAACCGGGTCTTGGCATCGAATTCGATGAAAAGCTGGCGGCGAAATATCCCTATGAACCTGCTTATCTACCAGTCGCACGTCTGGAAGATGGCACGCTGTGGAACTGGTAAGGAGTAAGATAATGAAAAGCATATTAATTGAAAAACCGAATCAACTGGCGATTGTCGAACGTGAAATACCCACCCCGTCAGCGGGTGAAGTACGAGTAAAAGTGAAACTTGCCGGAATTTGTGGTTCAGATAGCCATATTTATCGTGGGCATAATCCTTTTGCGAAATATCCGCGCGTCATTGGACATGAATTCTTTGGCGTCATTGATGCGGTGGGTGACGGCGTGGAAAGCGCCAGAGTCGGTGAACGTGTTGCTGTCGATCCGGTGGTCAGCTGTGGGCATTGCTATCCGTGCTCTATAGGTAAACCGAACGTTTGTACGACACTGGCTGTTTTAGGTGTGCACGCTGACGGTGGTTTCAGTGAATATGCCGTGGTTCCGGCAAAAAATGCGTGGAAAATTCCTGAAGCAGTGGCCGATCAATATGCGGTAATGATCGAACCTTTTACCATTGCGGCTAACGTAACCGGACATGGTCAACCGACTGAAAATGATACCGTTCTGGTTTATGGTGCCGGTCCAATCGGCCTGACGATCGTTCAGGTATTAAAAGGCGTCTATAACGTTAAAAATGTGATTGTTGCCGATCGCATTGATGAACGACTGGAAAAAGCGAAAGAGAGCGGGGCAGACTGGGCGATCAATAACAGCCAGACACCGCTTGGCGAGATTTTCGCTGAAAAAGGCATCAAGCCGACATTAATTATCGATGCGGCTTGTCATCCTTCTATCCTGAAAGAGGCCGTAACGCTGGCTTCTCCAGCGGCACGTATTGTATTGATGGGCTTCTCCAGTGAACCGTCTGAAGTGATTCAGCAAGGAATTACCGGAAAAGAACTCTCTATTTTCTCTTCACGCTTAAATGCAAATAAATTTCCGGTTGTTATCGACTGGTTAAGTAAAGGGTTAATTAAACCAGAAAAATTAATTACCCATACGTTTGATTTCCAGCATGTTGCTGATGCCATTAGTTTATTTGAACAGGATCAAAAACATTGCTGCAAAGTCTTACTCACTTTTTCTGAATAATACCAATAACGGCGAGTAAGTAGTACGCATCTTACCTCTTTTTTAGAGATAACCATTATGACAATAGAAAAACATGAAAGAAGCACTAAGGATTTGGTGAAAGCAGCAGTATCGGGATGGCTGGGCACTGCGCTTGAATTTATGGATTTCAAGAGTCATGCGTGTTAACTATTTGATAAATATTAAATTAATTTTTCATTGCTTCGTTATGGGGCATGGTTGGGGCAAACTCGCTTAACTGTGTATTTAACAAAGCTACCTGTGCATTATTGTTTTCAGACATCCATTTTCCGTATACCTGAAATACCATTTGCGCATCTGCATGGCCCATCTGGTTTGCTATAAATGCCGGGTTAGCACCAGCTGTCAGCGACCAGCAGGCATAAGTATGTCTCGACTGATATGATTTTCGATGGCGGAGTCCGGCACGTTTTATCGCTGCGTCCCACATCTGCCTTATTGAGTCAACGGTAAAATGGTCACCATAATTTTTTACTCTCGCTGACACTTCAGGTTGAAAAACAAAGGTGCATTTTTGTTTTTCTGTTCTGCCATACTCTCTGAGGTGAACATCAATGATATGCTCTTTGCTCAGTCTCGTTAATGTCATCTGACTCCGGAGAGCGTCGATTGCTGGCTTAATAAGATGAATGACCCGATTGGTTCCCGCCTGTGTTTTTGGTACCGTGAAACGGTCTTTTGCTAAATTTCTCCTGATCATCATTGTTCCATTTTTCAGATCTATGTCCTCCCATCCAAGTGCACACAGCTCACCAGGGCGAACGCCAGTATAAACAGAAACACACCATAAATTTTTTGCTTGCTGATTTCTGCACGCATCGATAAGACGGATAAATTCTTCCCGCGAAAGAGGATCCGGAATGGTTCTTGATTCCTTTAATGGCGAGATCCCCTTAAACGGATTATCTGCCAGGTAACCGTTATCAACACCAAACTGGAACACGGCGTTAAGATTTGTCATGTAATTATTTACAGTTACAGCCGATCTCCCTGGTTGTGTAACAATATAGTTACTTTTGGGGATCTGGTATCCAGTCAGTAACTCTTTACGAACCTCCAGTAATTTTTCTTTATTAATCGATGAGGCAAGATTTTTTTCACCGATTATGCTCAGGATATTTTTGATGACGGCACGGTATGTGTTGAGTGATGTTTTGGCGACTTCAGTTTCTTTCAGTGCCAGAAATTTTTCAGCCAGTTCTTTTATGGTTAAATCTTGTCGGGCCTCACCAAATTTTTCCAGATTGCGTGAGGAGGGAAACTGTTTTGCATAGTCGAAAACACCAGTTTTTATTGCGTAACAAACAGAGGAGCGTAGTTCACCTGCAACGCGCCTGTTTTTTGCTGTGTCAGGAACCCCCAGGTTTTCCCTGACTCTTACGCCTTTATAAACAAACCAGATACGTAATTTCCCTCCATGGTTTTCCACGCCTGTCGGATATTTCATTTCAACTTCTCTCATTAGTTAGTGTGGCTTTTAGTCAAGTAAGATGACGTCTTGGTCTCGCTGATGCCTGGCGCTCAATCCAGCGATCAATTTCTTCCAGGTTGTAAAAGCATGGACTGTTATCCCATGGCATACCGTCATGAGCGACATGCTTATATTCCCTTCCTTCCATAAACGATTTTTCCCGGGCCTTTTTTAACGTACCTTTTTTTATTCCTTTCAGCGCAATTAACTGCTCTTCGGATACCCATTTGCCGGGAGAGACAATCATGATTACTTCGCTCATCGATTTCTTTATCTCTTACATTAGACGAGCGCCGGTTGCAGAATACCAGTCACAACCGGCGACAGTTGAACATTAAGAATCAGCCTGACTCGGGATCAGTTTTTGCCAGATAACTGAAACGTATTTTGCCTGGTAACGGGCGTCATCAAGTGCATTATGGCGCTCACCTTCGAATTGAATAGCCGTTCTGGCATCGAAGTCTATGACTTTCCCCAGCTCAACGATTGTGCGTACATCGCGATCGTTGTAGTAACGCCACGGGCAGGGGATCCCCTGCCGTTCGTATGAACGGCGCAAAATCGTGTTGTCGAAGTTGGCTCCATTCCCCCAAACCTGAACAAAAAATTCACCGGAGTTTTCGTCGATAAATTCCCGCAATTGTAACAGTGCATCATCTAACGGGATTTCATCGGTCATAATGGCAGATTGCGCTTCGCGTGATTGCTTAAGCCACCATTTAATGGTGTCCCGATCAATGACTCCGCCAGCAGTTTCCAGATCGATAGTCTTACTAAATTCCGGTCCCATATCTCCGGTTTGCGGATCGAAAAATATTGCACCTATTGAGGTGATCGGGGCATCAGGATTTTTTCCCATGGTTTCAAGGTCGATCATTAGATGGTCACACGTCCTGCTGGTGGATGTGATTTCGTGATGACCGTTCACCTTAATTGGGTGATCTGCCGTCTCGCCAGTTTCATTATCGCTATTGTGATGCTGATTGCCGCCAGTGTTCTCCTTGTGTGGATGTTCAGCGCCTTCCATTTCCTCCGGATCATCTTCCTGAACTTCAACCTGATACTCTTCATCGAATGTTTCTTGGTATGTTGCGTCGCCCATCACCGCGCCACAATCAGGGCAGTTGCCGCCGCCGGTCTGACCGCAGGCGGTGCAGACTTTTTCCACTTCCTGTTGCGCTACTGGTTCAGGCTGTTTCGTTTCTGGCTCGTTTTGTAACGCATTTGGACTGTTTTGTTCCGCTTTTTGGTAGTTCCGTTCCGATTCATGCTGGTTCTGGTTCACAGAATCGCGGGTCTGGATCCCCTTAACCCATTTCGGATCATTCGGGTCACTAATCCCTTCAACAAATTCACCACGTGATGCAGCAAGCAACTTATCAGCGTCAGGCTGGCTGATATTGGCTGCCTGCATAATTTTGTTTACTTCGTCAGCGGTAACTTTTATCGGCTCTGGTTGTTCTGAATCTTCAGCGGTATCTACATTTTGCGGTAAGCCCGTGTATGTGCCATTTTTTCGGGCAAAATATTCTTCTTTTGTGATTTCAGTGGCGCCAGCAGCCAGTGCCTTATCCAGACCAGAAAGTTTGTTTGCGCGACCGTATTTTTCTCCGTCCTTATCTGCGAAGAGGAAATAGAACGGCCCCTCACGCTCTACAGATGGTTCAGCTTCCGGCGCGGTTTCATTTTTTGGGATATCAGATACCTCAGTTTCCACTGCATCAGTTTGTGTTTCTGATGACTGGAGAACATCAACAGTGCCCAGGTCTGTTTCTTCATTCTCAAACACGCCCTTTGTCGTCAGGTATTCGCAGATATATTTGTTCAGTGCTACGGGATCTTTGTGAATGTCGATCGGACGCTCACGGACAAGGCCAAAAATAGTTTGGCGGTCGTAGCGAAGGGCATCAGGCTGTTTGCGCATTGATGCCGAGATACGCTTCCAGTCTTCGCGGTCGTTGTCGATAACTTCTTTTTTTGCCCAGCGATGGATGCTGCCGTCAATGTTTCCGGCATCCACATCACCAGGCCAGAGAGCGTAGGCCAGTTCGTCATCCAGTGTTTTCCATGTCTGCTTGTATTCGCGATGAATGGCAGCAATGACCGGGCTGATTTTTCCTGTTGAATTTTCAGTGTACTGTTGATTGGCTCTGGCGCGTGCGAGATCAACAACAGACGTGTATTTTCCGGTTTCCTTGCGTTCACCTTCGCGACGTTTTTTCCAGATGCGCATCTCTGCCTGAATTTCGGGCCATTTAGCACCAGGAATACATTTATGCTTAACCCACCCAATGGCGTGCAACTTAAGCTCCGGATACATGGCGTTAACTTCTGGCATTTTCATCAACGCTTCAACGATATGTCCGTCGAATGTTGCCATGTCTTCCTGCAACAATTCCTGTGCGCTAATAACCATATCAACGGTGATGTTTTCACATGTGTCGAACTTAACCATGACAGCGTTCTGTACTTCAGGGGCCAGCTTGTCAAAAGTGACGTTCATCGGATCGGATTCAGTCTCAACCGGGACAAAAGAAGCAGACTCCTCATCCCAGCGGTTTTCCTGCATATATTCAGCATCCCATGAATCGAGGGCAGGGCGGGGTATGCCAGGTTTATCCTCGCAGACAATAAATTTATAAGCGCAGTCCTGAGCAGCCGGATAATGTTCCAGGAATTGCCAGTGAAATTTTGCTCGAGCACGGCGTTCGTCGCCAGCTTCAATGGCTGTGGCTACAGCCACAGCGCCTTCTTCCCTTGTTGCCAGTTCGTCAGGAATAGCGGCGCAAATAAAGACTTTACTCATTTTGTTTTAACCTCATGACAGATTTAAGGATGAACAAATCCCTGCCATTGCTGGCATATAAGAATGAAACCGGATATTTATTACGGAACTGTTTTAAAGACCTGCCGGGATTTCGATATTATCCTGGTTAATAACTTTATCGACCGGGTAACAGTTACCGGGAATTTTCTGTTCGGTTGCTGCAGTCATACACTCCTGCATTGTCCTGTGAACACTGACTGCAATATCAACTGGCACTCCGGAAACAAGAAAAACTGTCAGAACAAGCGCAAATGCTGAATTCATTGTGCACATCCTTTTGGCATCAGACGTAAACGAGCCAGCATTGAAACAATGCATATTTTATTTAATAGCTCCCGTTCTTGTTTTCTCTTGTTAATGGCATCTTCAGTAAATACAGGGTTACTGATAGTGACACCAATTTCAAAACAACCTTCAGACGTATTAACGTTTGGTAATAACGTTTTCATTATCGCGTCCTCAACAATGAATTTTGTGATGCAGTGCCTGGTGCCTCCAGGTGACGTTAACCAGTTAACAATTAACGTCGGATATCCGGATTAGTGATTTCAGGTTGTATCGTGAGATCAGTGATGGAAAAAGTATTACGTACATGATCGCCGGGTTAAATAAAGAATATGGCGATGTGGTGGAATCCGGACTGCTTTTTGCAGATCCTGCCGTTGTAGATCGTGAAACTGACGAACTTATAGAAAAAGCAATTGCTTTCAAGCTTGCGTATCGACAGCAATACCAACAAAAAGCTGGATGGAATTATGAGTCTTCTTTTTGCTGAACGCCCACTGGTTATAAACACGCAGCTGGCAATGAAAATTGGCTTAAACGAAGCCATTGTTTTGCAACAACTGCACTACTGGTTGAGAGATACCAACTCCGGCATGGAATGTGATGGTGTTCGCTGGATTTATAACACAACGGAACAATGGCTGGAACAGTTCCCATTCTGGTCAGAGTCAACGTTAAAGCGCGCGTTTGCAAGTCTGAAAACGCTGGGGCTTTTGCGTTGTGAAAAGCTCAATAAATCAAAGCGCGATATGACCAATTTCTACACGATCAACTACGGGAGCGAGCTTTTAGATGGTGGCAAATTGAGCGAATCCATCGGTTTAAAATGCGCCGCTCCATCAGGTCAAAATGACACGATGGAAGAGGTCAAAATGAAACGCTCCATTGGTTCAAAACGACTCAATGTCATCGGGTCAAAATGGCCTGATGATCTTACAGAGAATACAACAGAGATTACTACAGAGAATAAAAAGACTTCTCGTCCGGAAGCTTCGCAACCGGACCCGCAGACGGTTGAACAGGATTTTTTAACCCGACACCCTGACGCGGTTGTGTTCAGTGCGAAAAAACGCCAGTGGGGCAGCCAGGAAGATTTGGCGTGTGCGCAGTGGATCTGGGGGCGAATCGTGAGTCTTTACGAGCAGGCCGCCAGCGATGATGGCGAGATTTCGCGACCGAAAGAACCCAACTGGACCGCATGGGCCAACGACGTGCGCACAATGCGGATGCTGGATGGCAGAACTCACAGACAAATTTGTGAAATGTTTGGTCGGGTGCAGCGGGATCCATTCTGGGTAAAAAATATCATGAGTCCGTCAAAGCTTCGCGAAAAATGGGATGAACTGGTTATCCGCCTGGGGCGTTCGTCTGTACAGCGTTGTGTGAATCATATTTCTGAGCCGGATACCGAAATTCCGCCGGGCTTCAGGGGGTAAGTGTTAATTTCTGGTCATGAGGTAATTTTCAGGAGGGCTTGTGGCAAAAGTTTTTACACAAGAAGAGCGGGAAAAAATTAAAGGGCAGGTTGTTGAACTCGTACGCCGGAGTGGGCGTGAGACGTTACGGCAACTGGAAGTCAAGACAGGTGCGACAAGATATCTGATGAGCGTTCTCGCAAGAGAGCTGGTTGCCAGCGGCGATGTATACAACTCTGGTTACGGGTTATTCCCGTCTGAACAGGCGCGTAAGGACTGGCAAAATGCCCGTAAAAAGCTCTCAAGGGCAAAGCTGAAGGAACCATCTGCGGTTGATCCGGACCTTATCTGGTCATTACCTGACGGAGAAATACGTCGTTACGACAGGCGTCATAATATGATTTGTACTGAGTGTCGTAAAAGCGAAGTTATGCAGCGCATATTGTCGTTTTATCAGGGGGATGTCCGGTATTTATTGAAGTGACGAGATTAAAGTGCATTAGTTCAGATGCAAATTGACATTTTGTGGCACAGGGTAGAGCTAGCGTGGTTGTCCGCTTTGTGCCAAGAGCGGACTTTGCAAAATGGGGGTTATTTCAATCAAAACGTAACGTCACAACCAGCCGACGCTCTCTCGCCATTTATAATTAGTAACTTTATCATTTTCGCTTATTTTTTTAGATATAGAGCGCGGCTCTCTTCCTAGATACTCAGATATTTCTATCGGGGACAAATCAAAATCAACAAGCATGACTCTAAGTTTTTCCATTTCCTTTAAAGTCCAAGGCTTGCCATAATTTTCATAAAGAGATACCTTATGCTCCCTGATAGTTCGCTTTCTCTGAGTTTCACTTTCCCTCTGAAAAATCTCGCTTTTAAATTGTGAACGGAACTCTTTACAGAAGTTGTTATCAGTTTTTTTGTTATATAGATTGCTAAAAATAAGCTGGGATGCCGGGTCTAAATCAGGTGTGTTAATAATGAAAACTTTGACCTTTTCCATATAGGGATATTCAATTTTACCCAATATGCTTAATTGCTTTATATTAAAAAAACCTCTCAAATCAAAAGATTTAATGAGCTTTGATTGGATTACACTTTCAATCCTGCTTGCATTTAAATAACAGTACTTTGCCATTGGAAGGGCCCATACTACCAGTTGAGGAATATATTTTTCTAATGCAATGCTCTGCCAATCAGTATCAAAGGTTTGGCTTTTTGCTAGCATGTTTTTCGGCAAATCAGAATACATTGTGGTAGATGCCCATATCTTATAATCATTCGCCAAGGCTTGATAGTATTTTGTGTGGTTATGAATTTTATAAGCTGACATGAAGCGATAAACGTCTTCGTCGTGACCAGCGTCGTAAATTGTTCGATTTCCTCTAAGATAACCGTCGTAATGTTCGGTTATCCTTCTACCTACATTACAACTTACCCCAACGTAAACCACACGACTGAAAAGTCCTTTATGGACAATAAGATAAACTCCGCTACAGCCAGATTTCCTGGCCTCTGATAGAGAACCTAAAAATCTCCATTCCATAATTAAATCCATAATTATTGCTACTGTTTTTGTTTATCATTATTTTCGTGAAACTTCAACAATTTTATCCAAAAGCTAAGGGCAAAGACTTATATAATTATACTTGTCATCGTTAGCGATTATATAGAGTAGTGGCGCTGACCTGCTCCCTGGTGATTCACACAGAATGCTGTTAGTAATGTCCGTTCCTCGCTCTCAGCGGACCTTCAGCTCAGTGATATCGTCCGCTCTGTGCAAAGAGCGGACGTTGGTATGCAAGAGCCCTCCAAAAGTTGATGGTTGGTTTGCAGGGGGGCTTAAAGAAACTGCACTTATCAAGTTGAAGTTCTGTATTCAGCGAAATCGTAGCACTCTGACGATAAGTAACTCCGGTACTCCGCTCTTCGATGAAGAACCAGAGTAATCCCCCCGAAAAACCAGCGCATCAAAATTGGATCTTCAGCGGTAGCTTATCGGCTATCGGAAGTACAGGTGTGGATTCGTGGTGAATTGCTTTGATAATAAACGATTAATACGGAAAAACGCATTAATCATTTATTAGCTTTTAGTAAACCACAATTTATTCCGTTTTACATATCATAGTAGTCGATTGGAGAATATAGTTTCTGGGAATGTACTCTTCAAAGTGTTCGTCCTTTTTAAATACATGAACTACATTTGGGAATAATTGATAGTCAACAGGGTGTATAGCGTTTGGATTATGGTACATGTACATGGCTGTACACCATGGTTCTTGATAGTTAGGGTCACTTACATCGGCTGAAAATGGATGTGGGGCTGCATCCTGATCAGTTTTAACACCACTGACGTACACTTTGAATCCACTCGCCTCTACACCTGCAAGAATTCCCATCCGGTTAAACTTAGGTATGGTTGCTTGAGTAGTGAGTAAAACGGCAGAAACATAATTATTTTGTTCTGAGCCAAAAAAGTTCGACTTGATACTTCTATTTTCATCTGTATGTCTTTCAATAGAAATGCCTGACTCAATATCAATCCCGTACAAATAGCTATGCAAGGCTTCGCTTGAGAAGGCCATGGACATTCTTTTTGAATAATCCTGCATTGCTATGACAAATGGTTTGTTCTTTGTATGGTTGAGTTCCCAGTAATGAACTTTCTCTGGCTCAGGGCAATGCCGGACTTTTTTTAATAAACTTCTTGCAAACTTAAAAGGCATGACATTTAGAACATGTTTTCTTAATTCATCCATCTGTTCATCGTTAATGACTTTTCTTTCAAGAGGGGCTTCTGCTTCAGCAATGCTTACAGCCTCTACAGCAATTTCCACTCCAAATTTAGATAGCAGAAAATCTGGTTGATTGTATTCTCTATTCATTTCAAAGTCGAGTTCATAAAATACAGCGTTCAAATATAATTCAAATAACCTTGAATTAAATGCATCACTTTGAAAATCCCTTATAAATATTCCATCAGGATCTTTGAACCAGTATGCAAGTTCCTCAAGAACAATATATGCAGGGAAATGAAGAGGGTCTTCGAGGAGCATTTTTATATAAACATTCCTTTTTTTCGCTGGGACCTTACTCAAGAATAATGAAAAAGGTTTGGTTGATTCATCGCCTTGCATGAATGTACCATTTTGGTGCTGCGCCAGCATCTTTGGTATGTCATCGTTCAAATTATTAAGCAAGACATCCATTGAATCAAATGAAGCCAAGACGTTTATTGCTCTGAATTTTTTATCTAAATCCCGACCTAAGACTATTGCGTTAAAATCTTTATCAATATTGCATATGATTATTGTGGATAACAATGTTATCCCATTCCCCTCATATTTAAACCAGCGTATCTCCTCAGAAAATGTCTTAAGGTAAGGTGAGCGACCGTAAAAATAAATATCAAATTGTTCTTTGCTGATCTCACTGAAGTGTAATCCTGCGTTCATACCAATTCCTTTTCAATGAATAATTGGCCTTTAGGAGTGATTCCCTTTGTCTTTAATTCAGTTCTAACTAGTTCTTTAATCCAATAGCCTAAGCTCATCATGCAGTTGGATCATAAGACAACGCCCTATAGTGCTCGTGATACTATAGGGCATCTGACCACACTGTTAACTGGAGTAACGACTATGGCAGGAATACAGCATAACCAAACTCACCCCAAACTTACATAGCGCTTTCTGGCCGTGAGCATAACAAGGTCCACTCCTCGCTCATAAGGGACAACCATACTCAAATCTCCCACATTGCAGGAGATTTGAGTATGAACACGTCACCGTGGAACAAAGACCGTATCATAGGCCAAAAAAGACCACTTCAGATATCTCATATCTGGGGTATCCGAATCCGACTTGAACTGGAAGGTAAAACTCGCGATTTAGCTCTGTTCAACATGGCCCTGGATAGTAAGCTTCGAGGCTGTGATCTGGTCAAACTCAAAGTATCTGATGTTGCATATGGTGGCTCTGTTTCAAGCAGAGCAACGGTGTTGCAACAGAAAACCGGTAGCCCTGTTCAATTTGAGATAACCAAAGGGACAAGAGAAGCTGTTGCTGCATTGATACAGCTTAGCAATTTGCACAGTAAAGACTTCTTGTTTCGGTCTAGGGTCGGAACTAACCAGCACATTTCAACCCGGCAATACAACCGAATCTTTCATGGGGGGGTAGAAAAGCTTGGTCTCGAAGATTCGCTTTACAGCACACATTCCATGAGAAGAACAAAACCTTACCTGATCTACAAGAAAACCAAGAATCTCCGGGTGATCCAACTTCTGTTGGGTCATAAGAAACTGGAAAGCACAGTCCGTTATCTGGGCATTGAAGTCGATGATGCGTTAGAGATTTCTGAATCGATTGAAGTCTAAGGTTGTCAGGGCTGCAACAGCAGCCCTGTGCCATAAGCGGAAGTATTTAACAACTATCAGTGTTGTTCAACAGATAAAGGGGCACTTGATTTTTTCTGTTCTCAGGAAATGATAAAAGCGCGTCGGTTCAAGCCTGCTTAACGGGAGTTTGTTAATCCTGTTGCCGTGACGTTTTGACACCATTATGATGGGGAGACACTTAATGTATGAAGGTTCCGCCACTTATACCTGTCCAACAACTGCCTCGGATGTTTCTTTGTATGAATAAGTGGTAATGAGTAGTGAATCGCTAACAGTCACCCGAACAATCGGTGCCTGCAATTAATTCTATATTCTAAACGAGGGGGAGATTATTACACATGAAATTTAAGGACAAGAACCTTAAGGCTCTCGCGGAATGTATCATAGGAGATAATAAGGCATTTCTGTATCGTTCAAGCAGTCACATCACTGAATTTTTCCAGGACTGCGGCATGGATGTTACTCATGACGGATCCACTCGGTGGAAATGGACGGCCCAGAGGCTTGAAGAACTTCTTTATGAGCCACAGTCAAAGCCACATACTTTGCCGGAAAGGTTTGTTCATGTGCTCAGAACTTTAATGTTAAAAGAAGATGCAATGGATGACGATCCAGGAAGATTAAAGGCGCTTGAAGAACTGAACAAGCCTTTGATGCGGGAAGGCTATGAGGCATTCTATGGTGACGATCGCCTTTTGTATATACGCCATACCGATACCAAAACGGTTTCAGTCAGTAATAACCCTCATCGGCCCTTAACGCCTCACGAAGTAGAATGCAGAAGGTTACTGACCGCGTTTCTTGATACCTGCTCAGAAGATGAGTTAATAGAAGATATTCTCCTTCCTTTATTCCGGCAACTTGGTTTTCACCGGATAACAGCAGTGGGACATAAAGATAAAGCGCTGGAATACGGGAAAGACATCTGGATGAAGTTCACACTGCCAACTCAGCATGTTCTTTATTTCGGCATTCAGGCAAAAAAAGGTAAGTTGGATGCGTCCGGTGCCAGCAAATCTACGAATTCAAACGTGGCAGAAATCTTCAACCAGGTACTGATGATGCTTGGCCATGAAATATTTGACCCAGAAACAAATAGAAAGGTGCTGGTAGATCATGCCTTTATCGTTGCTGGCGGAGAAATTACTAAACAGGCGAGGAACTGGCTGGGCGGGAAACTTGATGCCAGCAAAAGAAGCCAGATAATATTTATGGACCGGGAAGACATTCTTAATTTATATACTGTAAGTAATGTACCTCTGCCAACAGGTGCTCTCATCTCTGATGATGCCGTTAAGAACGATGATATTCCTTTCTAATCAGAAGTACGTCTTTTTCTGAAAGAATACGTGATAGGTAGCCACACCACACCTTTAGTGACCCCTTAATCTGGTAATATAACAGCCCGTATGAATGTCCGCGGCATCGCGGGCTGAAATTTATTAAAAATACTTATTCATCAAGCTGGAGTAGTTTGCCGAGTAACTGTAAACGCCCAACTTAACCGGACCATTCACTTTTAGATTGCTACCAGCAAACCAACTTCCGTTTCTCGCTCAAAGCGGACTAGAAGGTTAGCTTGCGTCGGACTTGGCGTATTTAAAGAAGTGCTGGTGGTAACTGGTTGTTGTGTTCCATTTCTACAAAACAAAATCACAGAAACTATACCCAATAGTTATATTGAATCAATGATGAGACAGCCTCATATTTATCAGAACTGGTGTACGTCCAATACAGGAGGTTGTCGTGCTGGTTCTCAAATATGCGCTAGCTATTGCGGCTGTAATGGCAATTTATTGTCTTGCTATTGTTCTTACGGATCGCCTTTCTGATTGATTTTATATTGGCGAGGTGACGGGAGTTAAGTAGAATTGCTGCGGGTGCTTGAGGCTATCTGCCTCAGGCATGAACACCAAAGGCAGATAGAGAAAAGCCCCAGTTAACATTACGCGTCCTGCAAGACGTTTAACATTAATCTGAGGCTCAATCTATGAACGGCAAATCTAGGTTAGCCTCTTACGTGCCGAAAGGCAAGGAGAAGCAGGCTATGAAGCAGCAAAAGGCGATGTTAATCGCCCTGATCGTCATCTGTTTAACCGTCATAATGACGGCACTGGTAACGAGGAAAGACCTCTGCGAGGTACGAATCCGAACCGGCCAGACGGAGGTCGCTGTCTTCACAGCTTACGAACCTGAGGAGTAAGAGACCAGGCGGGGGAGAATCCCTCGCCACCTCTGATGTGTCAGGCATCCTCAACGCACCCGCACTTAACCCGCTTCGGCGGGTTTTTGTTTTTATTTTCAACGCGTTTGAAGTTCTGGACGGTGCCGGAATAGAATCAAAAATACTTAAGTAGCGCGCAGGGATAAGAGGGATGGTCCCTTAAAGGGGAGAGCTAATTATCCGGAAGGATTCTGATGATGAACATCGAAGAACTGCGTAAAATTTTTTGTGAAGATGGCCTCTATGCTGTGTGCGTTGAAAATGGAAATCTTGTTAGTCATTACCGCATTATGTGTTTGCGAAAGAATGGGGCTGCGTTAATTAATTTTGTGGATGGTCGAGTGACAGACGGATTTATCTTGCGCGAAGGTGAGTTTGTCACTTCATTACAGGCACTGAAAGAGATTGGAATAAAAGCAGGCTTTTCAGCTTTTGCAGAAGAATAAACTCATCTACAATCTTGCGCGGGGCTGAACTCCCGCTGAGTAACACCGTGCCACCGGAGAAAACCGATGGCACGCAACGTAAAATATTACAATTCTGATAATTCGCCCGTTCTTGCCTGCACGCACGAGCGGTATTCTCACGCATTCAAGTCTGAATGGTTCCAGCACCCTCCATGCACTGAAGAGCAGGCTGAATGGATAATTCAGTGTTACCGCAGGCGCGGATACGAGGTTAAGAAAGCTCTTAGTCTCGACTACCGTCACTGGATAATCTCAGTCAGATTGCCTTACTCCGAACGCCCACCGCGTCCGTCCCGTACATTCCAGCAACGCATCTGGAGGTAACGTGCGGGTATTACTTCGACCTGTTCTGGTACCGGAACTCGGGCTGGTGGTCGTTAAGCCGGGCCGTGAATCCATGCCGGTATTCCACAATACCCGGGTACTGGTGGAGCCGGAACCGAAAAGCATGCGTAATCTGCCGTCCGGGGTCGTTCCTGCCGTTCGCCAGCCGCTGGTGGAAGACAAAACATTGCTGCCGTTTTTCAGTAACGCACGGGTAATTCGTGCTGCTGGTGGTGCTGGTGCATTGTCTGACTGGCTGTTGCGCCATATTAAATCCTGCCAGTGGCCACACGGCGATTATCATCACAGCGAAACCGTCATTCACCGTTATGGTACCGGCGCAATGGTGTTGTGCTGGCACTGCGACAACCAGCTGCGTGACCAGACATCCGAATCACTCGAGCAACTTGCTCATCAAAACCTGTCAGCATGGATGATTGACGTCATCGGTCACGCAATAAGCGGTACGCAGGAGCGTGAATTATCTTTGGCTGAATTATCCTGGTGGGCGGTCCGCAATCAGGTGGCGGACGCGCTACCGGAAGCGGTATTACGTGGTTCGCTGGGGTTGCGTGCGGAAAAAATCCGCTCAATGTACCGTGAAAGCGACATCGTACCGGGAGAGCAGACCGCCAACAGCATACTGAAACAGCGCACAAAAAATCTTGCGCCGCTGCCTCACGCCCACCAGCAACAGAACCCACCACAGGAAAAGACGGTGGTCAGCATTGCCGTTGATCCTGAGTCTCCGGAATCTTTCATGAAACGACCTAAACGTCGCCGCTGGGTTAACGAGAAATACACACGCTGGGTGAAGACACAGCCGTGTGCGTGTTGTGGTAAGCCAGCCGACGATCCCCATCACCTGATTGGTCATGGTCAGGGCGGAATGGGGACAAAATCTCACGATATTTTCACGCTACCGCTGTGTCGGGAGCATCACAACGAGCTTCATGCGGATCCTCTGGCGTTCGAAGAAAAGCATGGTTCTCAGGTTGATTTAATTTTTCGTTTTCTTGATCACGCCTTTGCAACTGGCGTGCTTGGGTAAAAGAGGTGACTGATGCTCATAGATTTGGTTTTACCTTACCCGCCGACGGTGAACACTTACTGGCGACGCCGTGGCAGCACATATTTTATCTCGGAGGAGGGAAAGCGTTATCGCCGGGCTGTGGCGCTTATTGTTCGCCAGCAGCGGCTGAAATTAAGCCTGTCCGGAAGGCTGGCGATAAAGGTGATTGCAGAGCCACCGGATAAGCGTCGTCGCGACCTGGACAATATCCTGAAAGCACCGCTGGATGCGCTGACGCATGCGGGAGTGTTAATGGACGATGAGCAGTTTGATGAAATCAATATCGTTCGTGGTCAGCCAGTATCTGGTGGACGTCTGGGGGTGAAGATTTACCCCATAATGCATGAAGAGCAGGTCAAAAAATGAAACTGGAAGATTTACCGAAATACTACTCCCCAAAATCCCCTGGCCTGACCGATGCATCGGCCTCAACGTCAAAAGATGCGCTGAGTATCACTGATGTGATGGCCGCGCAGGGCATGACACAGAATCGGGCTGAGATGGGTTTTTCTGCGTTCCTGGGGAAAATGGGCATCAGTATGAATGACAGGGCGCGGGCAACAGAATTACTGGCAGATTATGCACTCAGTCGGTGCGATCGTGTGGCGGCGTTGAGAAAGCTTCCGGCAGAAATAAAACCGGTAGTGATGCGCATTATGGCTTCGTACGCTTTTGAGGATTATGCCCGCAGCGCAGCGAGTAAAAAGCAGTGCCCTTGTTGCTATGGGGAAAAATTTATTGAAAGCGTAGTTTTTACAAACAAGGTCCAGTATCCGGATGGTAAGCCGCCGGTATGGGCAAAGTGTACGAAAGGTGTGTATCCGTCTTACTGGGAAGAATGGAAAAAAGTCAGGGAGGTGGTAAAAGTTGCCTGTCCGGAGTGTGGCGGAAAGGGTGAGGTTTCCACCGCCTGTAAGGATTGCCGTGGGCGTGGTGTCGCCATTCATCGTGAAGAGTCGGTAAAACGTGGTATGCCTGTTATCAGAGACTGCCAGCGTTGTGGTGGTCGTGGCTATGAAAGACTACCATCAACGGAGGCATTTAATGCTATATGCGAGGTGACAAACCAGATAACACGCGCGTCATGGGAAAAAACAGTTAAGAAATTCTATGATGCGCTGGTGACCCGGTTTGATATTGAAGAAGCATGGGCTGAGCGGCAGTTAAAAAAGGTAACTAGGTAACAAGGTTGATTTTTCCGGAATCTGTGGTAAATTCGTCATAACGATGGGCGTTTTATGCCTGACGTTAGAAGAGTTTCTACAACCCGCCGCCGAGCGGGTTTTTTATTGCGGAATTAATTATGGACCGTTATTATTCTGCTCCCGGCCCTTTAGCTCAGTGGTGAGAGCGAGCGACTCATAATCGACAGGTCGCTGGGTCAAATCCAGCAAGGGCCACCAACCGTCACCAGTTCATCAGGAAAGAGCGTCAACCCTTTAAGTTGAGTGTGCGAGGTTCGAGTCCCCGGTGGCGGTCCAGTGCCGACTTCGCTCAGTAGGTAGAGCAACTGACTTGTAATCAGTAGGTCACCAGTTCGATTCCGGTAGTCGGCACCATATGCGGGCATCGCATAATGGCTATTACCTCAGCCTTCCAAGCTGATGATGCGGGTTCGATTCCCGCTGCCCGCTCCAGTTAGAGTCTTTCAGTCTGCGATGATGGGAAATCCCGGAGTGACTGAAAGACGTTTAAGTTATGAATGATCGCTTTTTTTTGCAAAATTGCTGTGCAGAAATACTAACCTTCGGGCAGGCGATCATTCATAAGCACTCTGCTTTTATTCCGATTAACTGTGGGTGGTTTGTTGGATAGAGTGCTTTCCTTACTGTATATATTGTTTCGCCCGCTTTTGCGGGCTTTTCTTTTCAAATCCCTTTCATTTCTCAGTGTAAAACTACGCCATCCGTTATTTGCGGAGGTGAGGCTATGAAATCCATGGACAAAATTTCAACGGGCATTGCCTATGGCACCTCCGCAGGCAGTGCTGGCTACTGGTTTTTACAGCTGCTCGATAAAGTCACGCCCTCACAGTGGGCAGCAATAGGTGTGCTGGGTATCCGCACCGGCGGGCGCGTGCTGGCGGTAAACAGCCAGACCCGGACGCTGACGCTCGACCGTGAAATCACGCTGCCATCTTCCGGCACCACGCTGATAAGCCTGGTTGACGGGCAGGGGAGTCCGGTCAGCGTGGAGGTTCAGTCCGTCACCGACGGCGTGAAGGTGAAAGTGAGCCGTGTTCCTGACGGCGTTGCTGAATACAGCGTATGGGGGCTGAAGCTGCCGACGCTGCGCCAGCGCCTGTTCCGCTGCGTGAGTATCCGTGAGAACGACGACGGCACGTATGCCATCACCGCCGTGCAGCATGTACCGGAAAAAGAGGCCATCGTGGATAACGGGGCGCACTTTGACGGCGACCAGAGCGGCACGGTAAATGGTGTCACGCCGCCAGCGGTGCAGCACCTGACCGCAGAAGTCACCGCAGACAGCGGGGAATACCAGGTGCTGGCCCGCTGGGACACGCCGAAGGTGGTGAAGGGCGTGAGCTTCCTGCTTCGCCTGACCGTGGCAGCGGATGACGGCCGTGAGCGGCTGGTCAGCACGGCCCGGACGACGGAAACCACTTACCGCTTCACACAACTGGCTCTGGGGAACTACAGGCTGACAGTCCGGGCAGTAAATGCGCGGGGGCAGCAGGGCGATCCGGCGTCGGTATCGTTCCGGATTGCCGCACCGGCAGCGCCGTCGCGGATTGAGCTGACGCCGGGCTATTTTCAGATAACCGCCACGCCGCATCTTGCGGTTTATGACCCGACGGTACAGTTTGAGTTCTGGTTCTCGGAAAAACGGATTGCGGATATCAGGCAGGTTGAAACCAGCGCGCGTTATCTTGGTACGGCACTGTACTGGATAGCCGCCAGTATCAATATCAAACCGGGCCATGATTATTATTTTTACGTTCGCAGTGTGAACACCGTTGGCAAATCGGCATTCGTGGAGGCTGTCGGTCAGCCGAGTGATGATGCATCAGGCTATCTGGATTTTTTCAAAGGCGAGATAGGGAAAACCCATCTGGCTCAGGAGCTGTGGACGCAGATTGATAACGGTCAGCTTGCGCCTGACCTGACTGAAATCAGGACGTCCATAACGGATGTCAGCAATGAAATAACACAGACCGTCAATAAGAAACTGGAAGACCAGAGTGCAGCGATCCAGCAGATACAGAAGGTTCAGGTTGATACAAATAATAACCTGAACAGCATGTGGGCAGTGAAGCTGCAGCAGATGCAGGACGGACGCCTTTATATTGCGGGTATCGGTGCCGGTATTGAGAACACCCCTGACGGCATGCAGAGTCAGGTGCTGCTGGCAGCAGACAGGATTGCGATGATTAATCCTGCGAATGGCAACACAAAGCCGATGTTTGTTGGTCAGGGCGATCAGATATTCATGAATGAAGTGTTCCTGAAACGCCTGACGGCCCCCACCATTACCAGCGGCGGTAATCCTCCGGCATTTTCCCTGACATCAGACGGGAGACTGACGGCGAAAAATGCGGATATCAGTGGCAGTGTGAATGCGAACTCAGGAACGCTCAACAATGTCACGATTAACCAGAACTGTACGATTAAGGGCATGCTGGAGGCGACCCAGGTCAGAGGAGATTTCGTTAAAGCTGTATCAAAAGCCTTCCCGAAAAAAGTCGGTACGTGGGGTAACACGGAAACACCAAACGGTACGGTTACAGTAACCATCAGCGATGATCATAACTTTGACCGCCAGATTATTATTCCGCCCATTATTTTTAACGGTATAGCGTATGACGATCCGGGGAGCGGAAATAACCCAGGAGGCACGCGATACACGGGTTATGGTTTTGAAGTTCGCAAAAACGGCGTATTAATCGCATCCAGAGAAACTAAAGGGGCCATTCCCGGTAGTTACAGTGCAGTTATTGATATGCCGAGTGGCAGGGGAAGCGTCACTCTGGAGTTTAAGATTTTCCAGAAAGGCAATCAGGGGGCAGGCAATATCACCGACTGTACGGTGATTGTGACCAAAAAAGCGGCTTCCGGCATCAGTATTCGTTGAAATATTTATAACCCCAATAAAGGGCGTCAGGAATGACGCCTTTTTTATTGCAGAAAAGCGAGAGGTAATTATGCGTAAAGTTTGTGCAGCAATTTTGTCCGCAGCCATTTGTCTGGCCGTATCCGGTGCGCCTGCATGGGCGTCTGAACATCAGTCCACGCTGAGCGCGGGGTATCTTCATGCCCGGACCAACGTTCCCGGCAGTGATGATCTGAACGGGATTAACGTGAAATATCGTTATGAGTTTACGGATACGCTGGGGCTGGTGACGTCATTCAGCTATGCAGGAGACAAGAATCGCCAGCTGACCCGTTACAGCGATACCCGCTGGCATGAAGATTCCGTGCGTAACCGCTGGTTCAGCGTGATGGCGGGGCCGTCTGTGCGCGTGAATGAATGGTTCAGCGCGTATGCGATGGTACTGGTGGAAGAACTATCGAGCAAGCACGTGCGAACTTGCGGGTAATGTATGAGCAAAAAGCTGGCCTTGCTAATACTGACCTAAACACCCTTACCGGTGAATATTCTGGTTTCTATCAACAACCAACGAGCGCTTACGCAACAGAAGAGTTAAATTACCCAATCGGTCTGGCGGGCGCTTTAATAGTGCTCCAAACGAGAGCCAACACTGCTTCTTCCTGCGTTCAGGTGTACCACCCTTATAATAATCCGGGAATTACTTATAGACGAATATATGAAGGAGGTAGCGGTACCTGGTCTGAATGGAAGAGAGATGTATCAACAGAAAGGGTTGAAGAGGGAAAAGAAACAACTTACGTATATTCTACGTATTCTTCAGGCGCACCACGCTTACAGGTTTCCAAATCTGGTTTGTGGGGTTGTCATAATGGCACTGGCTGGTTGCCATTAGCTGTTGGGCAAGGAGGTACAGGTGCGACAACAGTAGAAGATGCGCGAAACAACTTAAGTCTTGGCGAAAGTAGCGCAGTTAAATTTAAAAACCTTACTTTAACCGAAGCGCTCGACACGACATTAGGACTGCTTACAAAAACAGGACGAGACTGGAACACGCAGCATACTGATAACATTAATAAATTTATACCAATTGCAGGCAGTACAAACGGCCCGGCAGGCTCTATGGTTCTTGGCGGCATTCATGTTCAATTTAGTAAAAATTATGCTGTGCAGTTCGGAGGCCGCAATTCCGGTTTTTGGGGAAGAACAATTGAAAATGGAACGACACAGGAATGGAAGAAATTACTAACAGTAGACGATCTCAATTCATCTACCGATCTTGCTGTCAGGTCATTAACCACATCTAACCCGGTAAAATCTGGCGGAGGGCGAATTGATGTCCTTGGAAGCACGTCAGACTATAGCAAAATGGATTGCTTTGTACGTGGGTTTGATAGCACCGGTAATTCTCTCGTGTGGGCGTTGGGTTCATCAGTCGGCGTAAGTAAGATGCTATCGCTAAAAAATTTCTTTAGCGGAGCTGAGATACTGTTAAATGGTAATGACGGCGCGGTTCAACTCAAAACAGGTGCTGTTAACGGGGCTACAGCGCAGACGCTCACTATCAACAAGAATGAGGTTAACTCAACCGTTGATTTAACCCTTACAAAGCAATCAGGGACTGGTAATCGTTTTGTTTTACAGAACTCAGGTAATGCAGAACTACCGTTTTCTGTCAGGGTGTGGGGTTCCAGTACTCGACAAAACGTTTTTGAGGTTGGAACGTCTGCTGCGTATCTGTTTTATGCGCAAAAAACGTCAGCAGGCCAGTTGTTTGATGTAAATGGCGCTATTAATTGCACAACGCTGAATCAGTCATCAGACCGCGACCTTAAAGACGATATTCTCGTTATCAGCGACGCGACGAAAGCAATCCGTAAAATGAACGGATACACCTACACGCTCAAGGAAAACGGGATGCCTTATGCTGGCGTTATTGCACAGGAAGTAATGGAGGCGATACCAGAAGCTGTGGGATCGTTTACTCATTATGGTGAAGAGTTGCAAGGTCCGACCGTTGACGGCAACGAGCTACGCGAAGAAACTCGCTATCTTAATGTTGACTACTCCGCCGTGACGGGTTTACTTGTTCAGGTCGCCCGTGAAACAGATGATCGCGTTACCGCGCTGGAAGAGGAAAACACAACGCTACGTCAAAATCTGGCAACAGCAGGCACCCGGATCAGCACTCTGGAAAATCAGGTAAGCGAACTGGTTGCACTTGTCCGGCAGTTAACAGGAAGCGAACATTGATATCCTTCAAGCCCTGAAGGAGGCTGTTCCTGGTACGTTCAGACTGTTGTTGAGCTGGAAATCGCAACGGAGGAAGAAACTTCGTTGCTGGAAGTCTGGAAGAAGTATCGGGTGTTGCTGAACCGTGTTAATACAACAACTGCACCGGATATTGAATGGCCAGTAGCACCTATAGGGTAA
Protein sequences of DBSCAN-SWA_3 >NZ_CP029122|2543045:2571648|2547391_2547733_-|WP_000705211.1|DBSCAN-SWA MKITLSKRIGLLAILLPCALALSTTVHAETNKLVIESGDSAQSRQHAAMEKEQWNDTRNLRQKVNKRTEKEWDKADAAFDNRDKCEQSANINAYWEPNTLRCLDRRTGRVITP >NZ_CP029122|2543045:2571648|2564076_2565126_+|WP_001373319.1|DBSCAN-SWA MRVLLRPVLVPELGLVVVKPGRESMPVFHNTRVLVEPEPKSMRNLPSGVVPAVRQPLVEDKTLLPFFSNARVIRAAGGAGALSDWLLRHIKSCQWPHGDYHHSETVIHRYGTGAMVLCWHCDNQLRDQTSESLEQLAHQNLSAWMIDVIGHAISGTQERELSLAELSWWAVRNQVADALPEAVLRGSLGLRAEKIRSMYRESDIVPGEQTANSILKQRTKNLAPLPHAHQQQNPPQEKTVVSIAVDPESPESFMKRPKRRRWVNEKYTRWVKTQPCACCGKPADDPHHLIGHGQGGMGTKSHDIFTLPLCREHHNELHADPLAFEEKHGSQVDLIFRFLDHAFATGVLG >NZ_CP029122|2543045:2571648|2563796_2564075_+|WP_023147795.1|DBSCAN-SWA MARNVKYYNSDNSPVLACTHERYSHAFKSEWFQHPPCTEEQAEWIIQCYRRRGYEVKKALSLDYRHWIISVRLPYSERPPRPSRTFQQRIWR >NZ_CP029122|2543045:2571648|2547867_2548194_+|WP_000598292.1|DBSCAN-SWA MIKTTLLFFATALCEIIGCFLPWLWLKRNASIWLLLPAGISLALFVWLLTLHPAASGRVYAAYGGVYVCTALMWLRVVDGVKLTLYDWTGALIALCGMLIIVAGWGRT >NZ_CP029122|2543045:2571648|2550702_2550813_+|WP_001360138.1|DBSCAN-SWA MTIEKHERSTKDLVKAAVSGWLGTALEFMDFKSHAC >NZ_CP029122|2543045:2571648|2567076_2569239_+|WP_001373320.1|DBSCAN-SWA MKSMDKISTGIAYGTSAGSAGYWFLQLLDKVTPSQWAAIGVLGIRTGGRVLAVNSQTRTLTLDREITLPSSGTTLISLVDGQGSPVSVEVQSVTDGVKVKVSRVPDGVAEYSVWGLKLPTLRQRLFRCVSIRENDDGTYAITAVQHVPEKEAIVDNGAHFDGDQSGTVNGVTPPAVQHLTAEVTADSGEYQVLARWDTPKVVKGVSFLLRLTVAADDGRERLVSTARTTETTYRFTQLALGNYRLTVRAVNARGQQGDPASVSFRIAAPAAPSRIELTPGYFQITATPHLAVYDPTVQFEFWFSEKRIADIRQVETSARYLGTALYWIAASINIKPGHDYYFYVRSVNTVGKSAFVEAVGQPSDDASGYLDFFKGEIGKTHLAQELWTQIDNGQLAPDLTEIRTSITDVSNEITQTVNKKLEDQSAAIQQIQKVQVDTNNNLNSMWAVKLQQMQDGRLYIAGIGAGIENTPDGMQSQVLLAADRIAMINPANGNTKPMFVGQGDQIFMNEVFLKRLTAPTITSGGNPPAFSLTSDGRLTAKNADISGSVNANSGTLNNVTINQNCTIKGMLEATQVRGDFVKAVSKAFPKKVGTWGNTETPNGTVTVTISDDHNFDRQIIIPPIIFNGIAYDDPGSGNNPGGTRYTGYGFEVRKNGVLIASRETKGAIPGSYSAVIDMPSGRGSVTLEFKIFQKGNQGAGNITDCTVIVTKKAASGISIR >NZ_CP029122|2543045:2571648|2555719_2556685_+|WP_000054501.1|DBSCAN-SWA MSLLFAERPLVINTQLAMKIGLNEAIVLQQLHYWLRDTNSGMECDGVRWIYNTTEQWLEQFPFWSESTLKRAFASLKTLGLLRCEKLNKSKRDMTNFYTINYGSELLDGGKLSESIGLKCAAPSGQNDTMEEVKMKRSIGSKRLNVIGSKWPDDLTENTTEITTENKKTSRPEASQPDPQTVEQDFLTRHPDAVVFSAKKRQWGSQEDLACAQWIWGRIVSLYEQAASDDGEISRPKEPNWTAWANDVRTMRMLDGRTHRQICEMFGRVQRDPFWVKNIMSPSKLREKWDELVIRLGRSSVQRCVNHISEPDTEIPPGFRG >NZ_CP029122|2543045:2571648|2558769_2560119_-|WP_001678529.1|DBSCAN-SWA MNAGLHFSEISKEQFDIYFYGRSPYLKTFSEEIRWFKYEGNGITLLSTIIICNIDKDFNAIVLGRDLDKKFRAINVLASFDSMDVLLNNLNDDIPKMLAQHQNGTFMQGDESTKPFSLFLSKVPAKKRNVYIKMLLEDPLHFPAYIVLEELAYWFKDPDGIFIRDFQSDAFNSRLFELYLNAVFYELDFEMNREYNQPDFLLSKFGVEIAVEAVSIAEAEAPLERKVINDEQMDELRKHVLNVMPFKFARSLLKKVRHCPEPEKVHYWELNHTKNKPFVIAMQDYSKRMSMAFSSEALHSYLYGIDIESGISIERHTDENRSIKSNFFGSEQNNYVSAVLLTTQATIPKFNRMGILAGVEASGFKVYVSGVKTDQDAAPHPFSADVSDPNYQEPWCTAMYMYHNPNAIHPVDYQLFPNVVHVFKKDEHFEEYIPRNYILQSTTMICKTE >NZ_CP029122|2543045:2571648|2549625_2550645_+|WP_000836058.1|DBSCAN-SWA MKSILIEKPNQLAIVEREIPTPSAGEVRVKVKLAGICGSDSHIYRGHNPFAKYPRVIGHEFFGVIDAVGDGVESARVGERVAVDPVVSCGHCYPCSIGKPNVCTTLAVLGVHADGGFSEYAVVPAKNAWKIPEAVADQYAVMIEPFTIAANVTGHGQPTENDTVLVYGAGPIGLTIVQVLKGVYNVKNVIVADRIDERLEKAKESGADWAINNSQTPLGEIFAEKGIKPTLIIDAACHPSILKEAVTLASPAARIVLMGFSSEPSEVIQQGITGKELSIFSSRLNANKFPVVIDWLSKGLIKPEKLITHTFDFQHVADAISLFEQDQKHCCKVLLTFSE >NZ_CP029122|2543045:2571648|2555496_2555739_+|WP_072163420.1|DBSCAN-SWA MRISDFRLYREISDGKSITYMIAGLNKEYGDVVESGLLFADPAVVDRETDELIEKAIAFKLAYRQQYQQKAGWNYESSFC >NZ_CP029122|2543045:2571648|2546083_2546794_+|WP_001321287.1|DBSCAN-SWA MKYKLLPCLLAIFLTGCDRTEVTLSFTPEMASFSNEFDFDPLRGPVKDFTQTLMDEQGEVTKRVSGTLSEEGCFDSLELLDLENNTVVALVLDANYYRDAETLEKRVRLQGKCQLAELPSAGVSWETDDNGFVIKASSKQMQMEYRYDDQGYPLGKTTKSNDKTLSVSATPSTDPIKKLDYTAVTLLNNQRVGNVKQSCEYDSHANPVDCQLIIVDEGVKPAVERVYTIKNTIDYY >NZ_CP029122|2543045:2571648|2555224_2555413_-|WP_000854559.1|DBSCAN-SWA MKTLLPNVNTSEGCFEIGVTISNPVFTEDAINKRKQERELLNKICIVSMLARLRLMPKGCAQ >NZ_CP029122|2543045:2571648|2565138_2565513_+|WP_000904112.1|DBSCAN-SWA MLIDLVLPYPPTVNTYWRRRGSTYFISEEGKRYRRAVALIVRQQRLKLSLSGRLAIKVIAEPPDKRRRDLDNILKAPLDALTHAGVLMDDEQFDEINIVRGQPVSGGRLGVKIYPIMHEEQVKK >NZ_CP029122|2543045:2571648|2548399_2549614_+|WP_001295394.1|DBSCAN-SWA MKIVKAEVFVTCPGRNFVTLKITTEDGITGLGDATLNGRELSVASYLQDHLCPQLIGRDAHRIEDIWQFFYKGAYWRRGPVTMSAISAVDMALWDIKAKAANMPLYQLLGGASREGVMVYCHTTGHSIDEALDDYARHQELGFKAIRVQCGIPGMKTTYGMSKGKGLAYEPATKGQWPEEQLWSTEKYLDFMPKLFDAVRNKFGFNEHLLHDMHHRLTPIEAARFGKSIEDYRMFWMEDPTPAENQECFRLIRQHTVTPIAVGEVFNSIWDCKQLIEEQLIDYIRTTLTHAGGITGMRRIADFASLYQVRTGSHGPSDLSPVCMAAALHFDLWVPNFGVQEYMGYSEQMLEVFPHNWTFDNGYMHPGDKPGLGIEFDEKLAAKYPYEPAYLPVARLEDGTLWNW >NZ_CP029122|2543045:2571648|2565509_2566331_+|WP_000762889.1|DBSCAN-SWA MKLEDLPKYYSPKSPGLTDASASTSKDALSITDVMAAQGMTQNRAEMGFSAFLGKMGISMNDRARATELLADYALSRCDRVAALRKLPAEIKPVVMRIMASYAFEDYARSAASKKQCPCCYGEKFIESVVFTNKVQYPDGKPPVWAKCTKGVYPSYWEEWKKVREVVKVACPECGGKGEVSTACKDCRGRGVAIHREESVKRGMPVIRDCQRCGGRGYERLPSTEAFNAICEVTNQITRASWEKTVKKFYDALVTRFDIEEAWAERQLKKVTR >NZ_CP029122|2543045:2571648|2552471_2554943_-|WP_001372999.1|DBSCAN-SWA MSKVFICAAIPDELATREEGAVAVATAIEAGDERRARAKFHWQFLEHYPAAQDCAYKFIVCEDKPGIPRPALDSWDAEYMQENRWDEESASFVPVETESDPMNVTFDKLAPEVQNAVMVKFDTCENITVDMVISAQELLQEDMATFDGHIVEALMKMPEVNAMYPELKLHAIGWVKHKCIPGAKWPEIQAEMRIWKKRREGERKETGKYTSVVDLARARANQQYTENSTGKISPVIAAIHREYKQTWKTLDDELAYALWPGDVDAGNIDGSIHRWAKKEVIDNDREDWKRISASMRKQPDALRYDRQTIFGLVRERPIDIHKDPVALNKYICEYLTTKGVFENEETDLGTVDVLQSSETQTDAVETEVSDIPKNETAPEAEPSVEREGPFYFLFADKDGEKYGRANKLSGLDKALAAGATEITKEEYFARKNGTYTGLPQNVDTAEDSEQPEPIKVTADEVNKIMQAANISQPDADKLLAASRGEFVEGISDPNDPKWVKGIQTRDSVNQNQHESERNYQKAEQNSPNALQNEPETKQPEPVAQQEVEKVCTACGQTGGGNCPDCGAVMGDATYQETFDEEYQVEVQEDDPEEMEGAEHPHKENTGGNQHHNSDNETGETADHPIKVNGHHEITSTSRTCDHLMIDLETMGKNPDAPITSIGAIFFDPQTGDMGPEFSKTIDLETAGGVIDRDTIKWWLKQSREAQSAIMTDEIPLDDALLQLREFIDENSGEFFVQVWGNGANFDNTILRRSYERQGIPCPWRYYNDRDVRTIVELGKVIDFDARTAIQFEGERHNALDDARYQAKYVSVIWQKLIPSQADS >NZ_CP029122|2543045:2571648|2545670_2545976_-|WP_001307224.1|DBSCAN-SWA MKLSTCCAALLLALASPVVLAAPGSCERIQSDISQRIINNGVPESSFTLSIVPNDQVDQPDSQVVGHCANDTHKILYTRTTSGNVSAPAQSTQDGAPAEPQ >NZ_CP029122|2543045:2571648|2560436_2561039_+|WP_023147793.1|integrase|DBSCAN-SWA MNTSPWNKDRIIGQKRPLQISHIWGIRIRLELEGKTRDLALFNMALDSKLRGCDLVKLKVSDVAYGGSVSSRATVLQQKTGSPVQFEITKGTREAVAALIQLSNLHSKDFLFRSRVGTNQHISTRQYNRIFHGGVEKLGLEDSLYSTHSMRRTKPYLIYKKTKNLRVIQLLLGHKKLESTVRYLGIEVDDALEISESIEV >NZ_CP029122|2543045:2571648|2555036_2555228_-|WP_001083281.1|lysis|DBSCAN-SWA MNSAFALVLTVFLVSGVPVDIAVSVHRTMQECMTAATEQKIPGNCYPVDKVINQDNIEIPAGL >NZ_CP029122|2543045:2571648|2550832_2552113_-|WP_000877001.1|integrase|DBSCAN-SWA MKYPTGVENHGGKLRIWFVYKGVRVRENLGVPDTAKNRRVAGELRSSVCYAIKTGVFDYAKQFPSSRNLEKFGEARQDLTIKELAEKFLALKETEVAKTSLNTYRAVIKNILSIIGEKNLASSINKEKLLEVRKELLTGYQIPKSNYIVTQPGRSAVTVNNYMTNLNAVFQFGVDNGYLADNPFKGISPLKESRTIPDPLSREEFIRLIDACRNQQAKNLWCVSVYTGVRPGELCALGWEDIDLKNGTMMIRRNLAKDRFTVPKTQAGTNRVIHLIKPAIDALRSQMTLTRLSKEHIIDVHLREYGRTEKQKCTFVFQPEVSARVKNYGDHFTVDSIRQMWDAAIKRAGLRHRKSYQSRHTYACWSLTAGANPAFIANQMGHADAQMVFQVYGKWMSENNNAQVALLNTQLSEFAPTMPHNEAMKN >NZ_CP029122|2543045:2571648|2571522_2571648_+|WP_072163404.1|tail|DBSCAN-SWA MEIATEEETSLLEVWKKYRVLLNRVNTTTAPDIEWPVAPIG >NZ_CP029122|2543045:2571648|2570070_2571468_+|WP_032181053.1|DBSCAN-SWA MWGCHNGTGWLPLAVGQGGTGATTVEDARNNLSLGESSAVKFKNLTLTEALDTTLGLLTKTGRDWNTQHTDNINKFIPIAGSTNGPAGSMVLGGIHVQFSKNYAVQFGGRNSGFWGRTIENGTTQEWKKLLTVDDLNSSTDLAVRSLTTSNPVKSGGGRIDVLGSTSDYSKMDCFVRGFDSTGNSLVWALGSSVGVSKMLSLKNFFSGAEILLNGNDGAVQLKTGAVNGATAQTLTINKNEVNSTVDLTLTKQSGTGNRFVLQNSGNAELPFSVRVWGSSTRQNVFEVGTSAAYLFYAQKTSAGQLFDVNGAINCTTLNQSSDRDLKDDILVISDATKAIRKMNGYTYTLKENGMPYAGVIAQEVMEAIPEAVGSFTHYGEELQGPTVDGNELREETRYLNVDYSAVTGLLVQVARETDDRVTALEEENTTLRQNLATAGTRISTLENQVSELVALVRQLTGSEH >NZ_CP029122|2543045:2571648|2543045_2545472_-|WP_000041556.1|DBSCAN-SWA MSKNDRMVGISRRTLVKSTAIGSLALAAGGFSLPFTLRSAAATVQQASEKVIWGACSVNCGSRCALRLHVKDNEVTWVETDNTGSDEYGNHQVRACLRGRSIRRRINHPDRLNYPMKRVGTRGEGKFERISWDEALDTIASSLKKTVEQYGNEAVYIQYSSGIVGGNMTRSSPSASAVKRLMNCYGGSLNQYGSYSTAQISCAMPYTYGSNDGNSTTDIENSKLVVMFGNNPAETRMSGGGITYLLEKAREKSNAKMIVIDPRYTDTAAGREDEWLPIRPGTDAALVAGIAWVLINENLVDQPFLDKYCVGYDEKTLPADAPKNGHYKAYILGEGDDNTAKTPQWASQITGIPVDRIIKLAREIGTAKPAYICQGWGPQRQANGELTARAIAMLPILTGNVGISGGNSGARESTYTITIERLPVLDNPVKTSISCFSWTDAIDHGPQMTAIRDGVRGKDKLDVPIKFIWNYAGNTLVNQHSDINKTHEILQDESKCEMIVVIENFMTSSAKYADILLPDLMTVEQEDIIPNDYAGNMGYLIFLQPVTSEKFERKPIYWILSEVAKRLGPDVYQKFTEGRTQEQWLQHLYAKMLAKDPALPSYDELKKMGIYKRKDPNGHFVAYKAFRDDPEANPLKTPSGKIEIYSSKLAEIARTWELEKDEVISPLPVYASTFEGWDSPERRTFPLQLFGFHYKSRTHSTYGNIDLLKAACRQEVWINPIDAQKRGIANGDMVRVFNHRGEVRLPAKVTPRILPGVSAMGQGAWHEANMSGDKIDHGGCVNTLTTLRPSPLAKGNPQHTNLVEIEKI >NZ_CP029122|2543045:2571648|2561398_2562379_+|WP_023147794.1|DBSCAN-SWA MKFKDKNLKALAECIIGDNKAFLYRSSSHITEFFQDCGMDVTHDGSTRWKWTAQRLEELLYEPQSKPHTLPERFVHVLRTLMLKEDAMDDDPGRLKALEELNKPLMREGYEAFYGDDRLLYIRHTDTKTVSVSNNPHRPLTPHEVECRRLLTAFLDTCSEDELIEDILLPLFRQLGFHRITAVGHKDKALEYGKDIWMKFTLPTQHVLYFGIQAKKGKLDASGASKSTNSNVAEIFNQVLMMLGHEIFDPETNRKVLVDHAFIVAGGEITKQARNWLGGKLDASKRSQIIFMDREDILNLYTVSNVPLPTGALISDDAVKNDDIPF >NZ_CP029122|2543045:2571648|2563478_2563730_+|WP_000980999.1|DBSCAN-SWA MMNIEELRKIFCEDGLYAVCVENGNLVSHYRIMCLRKNGAALINFVDGRVTDGFILREGEFVTSLQALKEIGIKAGFSAFAEE >NZ_CP029122|2543045:2571648|2563050_2563263_+|WP_001013632.1|DBSCAN-SWA MNGKSRLASYVPKGKEKQAMKQQKAMLIALIVICLTVIMTALVTRKDLCEVRIRTGQTEVAVFTAYEPEE >NZ_CP029122|2543045:2571648|2546796_2547357_-|WP_001138581.1|DBSCAN-SWA MPSAHSVKLRPLEREDLRYVHQLDNNASVMRYWFEEPYEAFVELSDLYDKHIHDQSERRFVVECDGEKAGLVELVEINHVHRRAEFQIIISPEYQGKGLATRAAKLAMDYGFTVLNLYKLYLIVDKENEKAIHIYRKLGFSVEGELMHEFFINGQYRNAIRMCIFQHQYLAEHKTPGQTLLKPTAQ >NZ_CP029122|2543045:2571648|2562898_2563006_-|WP_122083109.1|DBSCAN-SWA MLTGAFLYLPLVFMPEADSLKHPQQFYLTPVTSPI >NZ_CP029122|2543045:2571648|2557277_2558222_-|WP_001678528.1|DBSCAN-SWA MDLIMEWRFLGSLSEARKSGCSGVYLIVHKGLFSRVVYVGVSCNVGRRITEHYDGYLRGNRTIYDAGHDEDVYRFMSAYKIHNHTKYYQALANDYKIWASTTMYSDLPKNMLAKSQTFDTDWQSIALEKYIPQLVVWALPMAKYCYLNASRIESVIQSKLIKSFDLRGFFNIKQLSILGKIEYPYMEKVKVFIINTPDLDPASQLIFSNLYNKKTDNNFCKEFRSQFKSEIFQRESETQRKRTIREHKVSLYENYGKPWTLKEMEKLRVMLVDFDLSPIEISEYLGREPRSISKKISENDKVTNYKWRESVGWL >NZ_CP029122|2543045:2571648|2556725_2557148_+|WP_001373616.1|DBSCAN-SWA MAKVFTQEEREKIKGQVVELVRRSGRETLRQLEVKTGATRYLMSVLARELVASGDVYNSGYGLFPSEQARKDWQNARKKLSRAKLKEPSAVDPDLIWSLPDGEIRRYDRRHNMICTECRKSEVMQRILSFYQGDVRYLLK >NZ_CP029122|2543045:2571648|2552147_2552384_-|WP_001296941.1|DBSCAN-SWA MIVSPGKWVSEEQLIALKGIKKGTLKKAREKSFMEGREYKHVAHDGMPWDNSPCFYNLEEIDRWIERQASARPRRHLT |
31 | Escherichia_phage(25.0%) | lysis,integrase,tail | attL 2544110:2544124|attR 2567888:2567902 |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_4 |
2973073 : 2983851
Sequences of DBSCAN-SWA_4
Nucleotide sequences of DBSCAN-SWA_4 >NZ_CP029122|2973073:2983851|DBSCAN-SWA GTCATTTTGCTAATAACACTTTTCTGGCTTCATCGACTTTATTTGGAGCATCGAACCAATATTCGGCGAGCAAAGGGCAGATCTCTGTATCAACGACTTGTTCATACCAAGCCTGGGCATCGTTGATTTTTTGGCCGATAGCTGGGGTGACATAGCTGTGACCGATACAAAACTGTGGTCCTAAGGTAACATCTTTTGCCAGCATATCGTTAAGTACCGTCAGTCGAGATTTGATGAATGCTAACATGTCAAAATCAATTGCGTAATTATAATTTACCCAGTTTCTCCACGCGTCATTAAAAGCTGGCTTTAGATCGATAAACGCGAAACGACGGCGCAGCGCTAGGTCAAGTAGTGCGAGTGAGCGGTCTGCAATATTCATCGTACCGATGATATATAGATTCTCAGGAATGTATATTTTTTCATCATCATTTTTAGGATAGGAAAGAGATAATGCCTCAGTCGGCGTGCGTTTATCTGCTTCCATCAACGTGAGTGTTTCACCGAAGATTTGCGCCGGATTGCCACGATTAATCTCTTCAATTATCACCACATATTTCGAGGTAGGATTATTGACTGCAGTTTTGATTGCATTTACAAAAGGTCCATCAATTAGCGTCAATTGCCCTTCTTTACCTGGACGCCAGCCGCGAATAAAGTCTTCGTAAGAGAGGTTCGGGTGAAATTGCACCGCGCTAATACGCTCAGGTGCTTTTTCTCCCATCAAGCAGTACGCCAGACGTCGCGCTAACCAGGTTTTTCCAGTTCCGGGTGGTCCTTGTAATATCAGGTTTTTCTTGTCGATCAGGCGCTGAAGTGTGAGTTGGATCTTAGCCTCTTCCAAGAAACAGCCATCCTGCACCAGATGACTGATGTCATAAGGAACGTGAGTGAGTTTTGGCAACGGCGCACTCTCTTCGACAGTTTCCTCAGTTATCGTCTGGGATTCATTTTTCTCAAAATTCAGGTATTCATAATTTCCGGACTCAAGGGACTTTAACTCATCATCATTGCATAGTTTCTGTAGATAAAAGTAGATTGAGCTTGTAACTGTTTTGCTCTCAGGATGCTCTACCTTGAATACGTCCAAGTAGTTTTCTTCCAACTCCGAAGCAGTGAAATAAGGGCCATTTTTTTTCAGGCATAACGCCTTAATTTTATTCAGTAAAGAGGCTTTCCACGTTCTATTTGCCACCTCATCTTTAGACTGGCTAAGATCTGTATTCCACGCTGATAAAGAAAGTTCTGGGAAAGAATGAACCGGATAGTTTGGTTGGGTAAAGACCTCGTTCAGCGCCCGCATAAGACTCAGGTAACTTTGTCCACTACAGCGACCTTTTGCCCCGTTTTTAATGATCTTAATGTTTAACACTGTCTGAATGTAATACTGCGACTGGCTATCTAAAGTAGGGTAGAACCAAGGACGGGTCCAGTACAACCCCATGGTGAGATTCCAACCGACATTCATTACTGTAGAAGCAATGTCATATGCGGCAGTGAAGTCTGCAGAGTTAGTATTCTGGTTATCCGCAAAGGTCATTGCCTGCGAGAACATTTCCCACAAGCATTCAATGTCATTAGGGTCGCGTGACTTTTCATAACCAAAGAACCAAGATTTTTGGTTATTCAACAGCGGGATTCCGGCAAAGGAGTCAGGAATCGGTTCGTTCACGCCCAACAAATTCGCTAGCTTGGCAGCAATAATTTTGCGATTGCTGTCGGTCAAGTTACGATTGAACAAGCCCATAGTAGTAAACGGACAAATGTCTTTTAAGGGAAAGATCTCTCCCATAATAGATTTGTCCTGCAGATGGGACATTCCTTCCACACCTGAGGCAATTAGATGAATACCTTTGACTAATTCATCTCTACGATTTCGCCAAGTCAGTAACGCGTTGGCAAAAGCCTCATAAAAACTAGCCCAAGCAAATTTGCCATCATGTTCTGCTGTATCCACGGGAACTCCATTATACTTGTTGAGCAATGATAATTTATCTGTGTGAGTTTTAACATATTACTATCAGTTATAGAAAAATTTAACTACCCGATACAGAGAGCGGCATGCTGAATTTGACCTGACTTGCTTCCAACTAATTAAAATCAACTTATTTATCAATTGGTTATTTTGGCGCATAGCGGTCATCAAGGGAATATCGCGTTGTCATAAGGTGTCGAGGCTCGGAGGTTCAAATCCTCTCATGCAAAAAATAAATAAAATTAATGACGGTTGGAAATTATTCAATACATACACTCTCGAAAGTGCATCAGCCAACCGCAGCACGTCTTGCATACGGCGTGTCTGCAGTTTTATATAATCCTGGCTGGAAACCTCTTATACAAAGTAGATACACCAATATCATAGATGATCGCCACCTTCTGACGCGGGACTTTTGATGCAATTAATCGCCCGGCCTGCGCCCATTGTTCTGATGTAAGTTTAGGACGACGTCCACCAATTCCTCCCTGTGCGCGAGCATCTTCCAGTCCAACTTTTGTCAGTCATCAATCAGTTCACGCTTCATTTCAACCAGGGCTCCATCACATGGAAAAATAAAAAAAACCATCGATGTTGACGTGTCAATGTTACCTGTCAGGCTACGGAAATCCCCCCCTTACCCGCAGCTACTCGGTAAGTATAATAAGATGTTTCCTGCTCCTTCCTAACCTGTCCAGTTTCCAGACAATTAAAGTATCACCTTATATAAGAGACTGCAGAGCGCACTTTAACCTAGGTCGTTCTGAAATAGTTGCAGATTCGATTAATCAACGATTATACTCCCCGGCACTCCAGAGGATCTGGTAAACATAAAGTCAGAGCTTGGGTATACTGGCACACAAATGGCAGATCTTGCAGGTGCAGCCAGTCATAGCCAGTGGCGAAAATACACGAGTGGTTCTGAGCCCCGCGCCATGTCATCACATATCTTGTTTTTTTATTGCTACCAATCTGACTTTGAGTACTAATGAGCTAGATAGAATTGTTGGTAATGACTCCAACTTACTGATAGTGTTTTATGTTCAGATAATGCCCGATGACCTTGTCATGCAGCTCCACCGATTTTGAGAACGACAGTGACTTCCGTCCCAGCCTTGCCAGATGTTGTCTCAGATTCAGGTTATGTCGCTCAATGCGCTGAGTGTAACGCTTGCTGATAACGTGCAGCTTTCCCTTCAGGCGGGATTCATACAGCGGCCAGCCATCCGTCATCCATACCACGACCTCAAAGGCCGACAGCAGGCTCAGAAGACACTCCAGTGTGGCCAGAGTGCGTTCACCGAAGACGTGCGCCACAACCGTCCTCCGTATCCTGTCATACGCGTAAAACAGCCAGCGCTGACGTGATTTAGCACCGACGTAGCCCCACTGTTCGTCCATTTCAGCGCAGACAATCACATCACTGCCCGGTTGTATGCGCGAGGTTACCGACTGCGGCCTGAGTTTTTTAAGTGACGTAAAACCGTGTTGAGGCCAACGCCCATAATGCGTGCACTGGCGCGACATCCGACGCCATTCATGGCCATATCAATGATTTTCTGGTGCGTACCGGGCTGAGAGGCGGTGTAAGTGAACTGTAGTTGCCATGTTTTACGGCAATGAGAGCAGAGATAGCGCTGATGCCCGGCAGTGCTTTTGCCGTTACGCACCACGCCTTCAGTAGCGGAGCAGGAAGGACATCTGATGGAAATGGAAGCCACGCAAGCACCTTAAAATCACCATCATACACTAAATCAGTAAGTTGGCAGCATTACCTTGTTAGTTTGTCTTTTGTGTTTACCTGATTCGGGTAAACGCCTTTACCTGATTTAGGTAAACTTTTCTTACCTGATTCAGGTAAATTTACCTCTTTCAGGTAAACTTTATTTTTCTTACCCGATTCGGGTAATGTTGACCATTCACTGACCACATTATTGATGCCGATATTCCGCCCGCTCTGAATAAAAATCCCACGCTTTACCAGAACACTTTTTGCAGCAGAACACTTGTGCGGCAATATCCCGGTTAATTCGGAAAGTTGCTCGTTGCTAACCCAATCCAGTTTTTTATTAAAGCCATATGTTTTGCGCATGACAGCCAGAAAGACCAGAAGCTGGTGCTGTGTTAATCCGGCCAGCATCACAGCTTCCAGCAACTCATTTGCAATGCGCGTATAACCATCATCGAGATCTGCCACGCGCCGCTCCTTTTGTGCCACATCCGGCACAGGAAAATTGAATATCTCAGCAGTGTTTGCCATAATTCCTCCCGCAATGAGTGTGTTACGATTTGCACCTGAAAGTCGGTTCTGTTCCCGCAGACCGACTTTCGCCATTTTTGAACCTGTCATATTGCCCCCAGCATGGTGGTGACCATCGCCATCAATGGACCAGCCAGATCCGGGTCCACTCGAAACATCGACACAATGCCTTCACTCATCTCCTTCAGTTTCTGGTGGCGTGGTGCGTTGAGAATGACCGCCTGCTTTGCCTCACAGAGTTCCTTTTCCATTTCAGCCAGCCGAGCCATGAAGCTATCCTGCTCAACCAGGTGGCCGCGATATTCCAGCGGTAGTACCGCCAGAATTGCCGGGGTCAGTTCACGCACGTTATTTCGGTATTTTTCAGAATCGAATTTGTTATCGAGGAAGCGGAACAGCTTCTGGCGTGCACGGCTGACATCATCAGGGAAATCGATAGTGCCGCCGCCCTGCTCCCGATACTCATTCACAATGAGTGTGGCAACGACATCCTGATTATCTTCAGCCGACCAGGCGCGGACGGCATCACGGATTTTTTCGTGGCCTGGCACCTGTTTTGTTTGAGAACGATTTATCACCGCAGTCGGGCTAAATCCGCTAGTCTGTTGGTATGTAAGTAGTTGCATAATTGACTCCTTTAGTTTGAATTGACTGTTAAGTTGATTGCTTATTGTTAAAGAGCGTGAAATGGAAATTTAAGCTGCGTTCTTTTCGGTGTGTGGAAACAACTTCGGAAGATCCGGGCGAATCTGGTATGCCTTCACTACTCCACCAGTAGCCGTAACAATGCTGCCGACATGTTCAGGGGATACCTTTGCTTTGTTGTGAAGCCACTTATATCTATGACAAGGATGAAGCCGAGCGCATTGTCGAAAATACCGCATACACTGCAGAACGTCAGCCGGAACGCGACATCACTCCGGTTAACGATGAAACCATGCAGGAGATTAACACTCTGCTGATTGCCCTGGATAAAACATGGGATGACGACTTATTGCCGCTCTGTTCCCAGATATTTCGCCGCGACATTCGCGCATCGTCAGAACTGACACAGGCCGAAGCAGTGAAAGCTCTTGGATTCCTGAAACAGAAAGCCTCTGAGCAGAAGGTGGCTGCATGACACCGGACATTATCCTGCAGCGTACCGGGATCGACGTGAGAGCTGTCGAACAGGGAGATGATGCGTGGAACAAATTACGACTCGGCGTCATCACGGCTTCAGAAGTTCACAATGTGATAGCAAAACCCCGCTCCGGTAAAAAGTGGCCTGACATGAAAATGTCCTACTTTCACACCCTGCTGGCTGAGATTTGCACCGGTGTGGCTCCGGAAGTTAACGCTAAGGCGCTGGCCTGGGGAAAACAGTACGAGAATGACGCCAGAGCCCTGTTTGAGTTTACTTCCGGCGTGAATGTTACTGAATCCCCGATCATCTATCGCGACGAAAGTATGCGCACCGCCTGCTCTCCCGATGGTTTATGCAGTGACGGCAACGGCCTTGAGCTGAAATGCCCGTTTACCTCCCGGGATTTCATGAAGTTCCGGCTCGGTGGTTTCGAGGCCATAAAGTCGGCTTACATGGCCCAGGTGCAGTACAGCATGTGGGTGACACGAAAAGATGCCTGGTACTTTGCCAACTATGACCCGCGTATGAAGCGTGAAGGACTGCATTATGTCGTGGTTGAGCGGGATGAAAAGTACATGGCGAGTTTTGACGAGATGGTGCCGGAGTTCATCGAAAAAATGGACGAGGCACTGGCTGAAATTGGTTTTGTATTTGGGGAGCAATGGCGATGACGCATCCTCACGATAATATCCGGGTAGGCGCGATCACTTTCGTCTACTCCATTACAAAGCGAGGCTGGGTATTTCCCGGCCTTTCTGTTATCAGAAATCCACTGAAAGCACAGCGGCTGGCTGAGAAGATAAATAATAAACAGGAGGATATATGAGTCAGGTTGGTAATCATTCATTCGAATTTCCGGCATCGCAAGGTGTACAGGGTGGTACTGTTACACTCTTCCTTACCATACCAGGAAGATCGCTGGCTCGTTTCCTCGCTTCAGATAATTACGGCCATACACTGGAACGCTCTCAGCGAGAAATTAATCCAAATCGAGTACGAAAATTTTTAAATTATCTCACTAACGCAGACTCAAGAAATGAGTCTTTTATCATTCCCCCTCTCGTAGGTAACTGTGATTCGAATATAGAATTTGTACCGTTTGGCAACACAAATGTTGGTATAGCCAGAATTCCCCTCGACGCCGAAATAAAACTTTTTGATGGCCAACATCGTGCAGCTGGCATTGAGATATTTTGCCGAAGTTCCCCATCAACGCTCATGGTTCCCATGATGCTTACAATGAATCTGCCGCTAAAAACCCGGCAGCAGTTCTTTTCGGACATAAATAACAACGTTTCTAAGCCATCAGCGACCATCAATATGGCGTATAACGGCCGGGATGATATTGCTCAGGGAATGATATCCTTCCTGACCCAACATACTGTATTTGCCGATATAACCGATTTTGAACACAACGTAGTGCCATTAAAAAGTAATATGTGGGTGAGTTTCAAGGCACTCACTGATGCAACGTCAAAGTTTGCTAGGAACGGCAATCAACAACTTGAAATGGGATATATAGAATCTGTCTGGGAGGCATGGATTACACTAACTCAGATTGACTCAATCCGACATGGTGTACACCACGCTACGTACAAGCGCGATTATATTCAGTTCCATGGAGTAATGATTAACGCTTTCGGTTTTGCGGTTCAACAGATGATGGTTAATCATTCCATCGCAGAAATAACTTCTATGATCGAAAAACTCTGTGCAACTACCAGCTCTGCAGAAAGAGAGGATTTTTTTCTGATGGATAACTGGGCGGGGATCTGCACGAAAGCCAGCCAGGAAAAACTATCGGTTATTGCCAATGTGGCAGCGCAGAAAGCAGCAGCAAACAGACTGATACAAGCTTTTACCAAAGGAAGTCTGGAAACAACTTAATGAATCAACATTGTCTCATATCAGCATGCTGTACGGCGTCTTTAAGGAACGGTGAGCATGAAAAACAAAATCATCATGGAGCTACAGGCTCCTTTTTTATTATTCGCATTCACCCTCAAGCGTATTAACCAACAATTCAGGGATTAATGAAAGATGGCAGACATCATTGATTCAGCATCAGAAATTGAAGAATTACAGCGCAACACAGCAATAAAAATGCGCCGCCTGAACCACCAGGCTATATCTGCCACTCATTGTTGTGAGTGTGGCGATCCGCTAGATGAACGAAGACGCCTGGCCGTTCAGGGTTGTCGGACTTGTGCAAGTTGCCAGGAGGAGATCGAACTTAAGAACAAACAATGGGGACTGTGATGGCCTCAAAGCAGCAAATTTCAACATCGTCCAACTGAGGTGTAAAAATGTTCAGAATCATTTTTCCTAACACCTGGTACGTCGACCACCACGGCACTCCCTGCAAAATCCTGCGTTCTACCCACAACAAAGTTCACTACATCCGAAAAGGCAGAACATGTATCGCCAGCATGTTCCGCTTTAATCATGACTTTGAACCTGTGAATAAAGCTGATGCAGATCGGATAGCAGAAGAGATCGAAACGGCAGAACACATTAAGAAGTTACGTGCCATACGCAGGAAATAGAAAAATTGATAAATTCAATACTGCATTTCTCAGCATTAAATTTATCTCTATGACCAGTCAAGAGATGTACCTGCCATGAGCTTAATATCATGTCAGATATATCGGTCACAAACTCCCTCAGCAGCTAAGAGGAGGACAAATGTCTCGACTAATCACTTTACAGGACTGGGCTAAAGAAGAATTTGGGGACTTAGCACCAAGTGAGCGAGTTCTGAAAAAATACGCGCAAGGGAAAATGATGGCCCCACCCGCTATAAAAGTTGGTCGCTACTGGATGATTGACCGAAATTCCCGTTTTGTAGGAACGCTGGCAGAACCGCAACTCCCAATAAACGCAAACCCAAAACTCCAACGGATAATCGCTGATGGCTGCTAGACCCCGATCTCACAAAATCTCTATACCCAATTTATATTGCAAATTAGATAAGCGAACCGGAAAGGTATATTGGCAATACAAACATCCACTATCCGGTCGTTTTCATAGCTTAGGAACTGATGAGAATGAAGCAAAACAAGTTGCTACTGAAGCAAATACCATTATTGCTGAACAACGTACCAGACAAATATTAAGCGTCAATGAGCGTCTGGAAAGAATGAAAGGCAGGCGCTCAGACATTACGGTGACAGAATGGCTTGATAAATATATTTCTATCCAGGAGGACAGGCTGCAACATAATGAACTAAGACCCAACTCCTATCGGCAAAAAGGCAAACCCATTCGTCTTTTCCGTGAGCATTGTGGAATGCAACACCTCAAGGATATTACCGCACTTGATATTGCCGAAATAATTGATGCTGTAAAGGCTGAAGGTCATAACAGGATGGCGCAAGTCGTGAGAATGGTGTTGATCGACGTCTTCAAAGAAGCACAACACGCAGGACATGTTCCGCCAGGATTTAACCCAGCGCAGGCAACAAAACAACCGCGAAATCGAGTAAACCGCCAAAGATTGTCACTGCCCGAATGGCAGGCAATATTTGAAAGCGTAAGCAGACGGCAGCCCTATTTAAAATGCGGCATGCTACTTGCTCTTGTTACTGGACAACGTTTAGGCGATATCTGCAATTTGAAATTCTCTGATATATGGGACGACATGTTGCACATTACTCAGGAAAAAACCGGTTCAAAACTTGCTATTCCGCTTAACCTGAAATGCGATGCTCTGAATATTACCCTTCGTGAAGTTATATCTCAGTGCAGGGATGCTGTTGTTAGTAAATATCTGGTCCATTACCGTCACACTACCTCTCAAGCAAACAGAGGAGACCAGGTTTCTGCAAATACTCTGACAACGGCTTTTAAAAAGGCCAGGGAAAAATGTGGCATAAAATGGGAGCAAGGAACTGCGCCCACATTTCATGAGCAGCGATCTCTGTCAGAACGGTTATATCGGGAACAGGGTCTGGATACGCAAAAGTTGTTAGGCCATAAATCCAGAAAAATGACCGACCGATACAATGATGATCGTGGTAAAGACTGGGTTATCGTAGATATCAAAACAGCATAGAAAATAGCCAGTTTTGGGGAAGGGTTTTGGGGAAAGTTTTGGGGAAGATTTTACATCATCATAAAACAACGGGCGTATAACACGCCCGTTTCAATATTTAACACATGTAGAGATTACATGTTCTTGATGATCGCATCACCAAACTCTGAACATTTCAACAGTTTAGCGCCTTCCATCAGACGTTCGAAGTCATAGGTTACGGTCTTCGCATTGATTGCGCCTTCCATACCTTTAACAATCAGGTCTGCGGCTTCGGTCCAACCCATGTGGCGTAACATCATCTCAGCAGAGAGAATAATAGAGCCTGGGTTTACTTTGTCCTGACCGGCATATTTCGGCGCAGTACCGTGGGTGGCTTCAAACAGGGCGCATTCGTCACCGATGTTTGCACCTGGGGCGATACCGATACCGCCAACCTGCGCTGCCAGGGCGTCAGAAATGTAGTCACCGTTCAGGTTCATACAGGCGATAACATCATATTCAGCCGGACGCAGCAGGATCTGTTGCAGGAATGCATCAGCAATCACGTCTTTAATGACGATCTCTTTGCCGGTGTTCGGGTTTTTAACTTTCAGCCACGGGCCGCCGTCGATCAGTTCACCGCCAAACTCTTCACGCGCCAGCTGGTAGCCCCAGTCTTTAAACGCTCCTTCGGTGAACTTCATGATGTTGCCTTTGTGCACCAGAGTCACAGAGTCACGATCGTTAGCAATTGCGTATTCGATCGCTGCACGAACCAGACGTTTGGTGCCTTCTTCCGAACACGGCTTAATACCGATACCACAATGTTCCGGGAAGCGAATTTTCTTCACCCCCATCTCTTCACGCAGGAATTTAATCACTTTCTCGGCGTCGGCAGAGTCTGCTTTCCATTCGATACCCGCATAAATGTCTTCCGAGTTTTCACGGAAGATAACCATATCGGTCAGTTCAGGGTGTTTAACCGGGCTTGGAGTGCCCTGATAGTAACGTACCGGACGCAGGCAGATGTAGAGATCCAGTTCCTGGCGCAGGGCAACGTTCAGAGAGCGAATACCGCCACCAACAGGAGTGGTCAGCGGACCTTTAATGGCAACGCGATATTCACGAATCAGATCAAGGGTTTCAGCAGGCAGCCAGACATCCTGACCATAAACCTGTGTGGATTTTTCACCGGTGTAAATTTCCATCCAGGAGATTTTACGCTCGCCTTTATAGGCTTTCTCGACTGCAGCGTCGACCACTTTCAGCATGGCTGGGGTTACATCTACACCGATTCCATCACCTTCAATGTAAGGGATAATCGGATTTTCAGGAACGTTGAGTTTGCCGTTTTGCAGGGTGATCTTCTTGCCTTGTGCCGGAACAACTACTTTACTTTCCAT
Protein sequences of DBSCAN-SWA_4 >NZ_CP029122|2973073:2983851|2981118_2981355_+|WP_000088653.1|DBSCAN-SWA MSRLITLQDWAKEEFGDLAPSERVLKKYAQGKMMAPPAIKVGRYWMIDRNSRFVGTLAEPQLPINANPKLQRIIADGC >NZ_CP029122|2973073:2983851|2977393_2977933_-|WP_001753331.1|DBSCAN-SWA MQLLTYQQTSGFSPTAVINRSQTKQVPGHEKIRDAVRAWSAEDNQDVVATLIVNEYREQGGGTIDFPDDVSRARQKLFRFLDNKFDSEKYRNNVRELTPAILAVLPLEYRGHLVEQDSFMARLAEMEKELCEAKQAVILNAPRHQKLKEMSEGIVSMFRVDPDLAGPLMAMVTTMLGAI >NZ_CP029122|2973073:2983851|2979255_2980320_+|WP_001678641.1|DBSCAN-SWA MSQVGNHSFEFPASQGVQGGTVTLFLTIPGRSLARFLASDNYGHTLERSQREINPNRVRKFLNYLTNADSRNESFIIPPLVGNCDSNIEFVPFGNTNVGIARIPLDAEIKLFDGQHRAAGIEIFCRSSPSTLMVPMMLTMNLPLKTRQQFFSDINNNVSKPSATINMAYNGRDDIAQGMISFLTQHTVFADITDFEHNVVPLKSNMWVSFKALTDATSKFARNGNQQLEMGYIESVWEAWITLTQIDSIRHGVHHATYKRDYIQFHGVMINAFGFAVQQMMVNHSIAEITSMIEKLCATTSSAEREDFFLMDNWAGICTKASQEKLSVIANVAAQKAAANRLIQAFTKGSLETT >NZ_CP029122|2973073:2983851|2978115_2978427_+|WP_072163463.1|DBSCAN-SWA MPLLCCEATYIYDKDEAERIVENTAYTAERQPERDITPVNDETMQEINTLLIALDKTWDDDLLPLCSQIFRRDIRASSELTQAEAVKALGFLKQKASEQKVAA >NZ_CP029122|2973073:2983851|2973073_2975029_-|WP_000379042.1|DBSCAN-SWA MDTAEHDGKFAWASFYEAFANALLTWRNRRDELVKGIHLIASGVEGMSHLQDKSIMGEIFPLKDICPFTTMGLFNRNLTDSNRKIIAAKLANLLGVNEPIPDSFAGIPLLNNQKSWFFGYEKSRDPNDIECLWEMFSQAMTFADNQNTNSADFTAAYDIASTVMNVGWNLTMGLYWTRPWFYPTLDSQSQYYIQTVLNIKIIKNGAKGRCSGQSYLSLMRALNEVFTQPNYPVHSFPELSLSAWNTDLSQSKDEVANRTWKASLLNKIKALCLKKNGPYFTASELEENYLDVFKVEHPESKTVTSSIYFYLQKLCNDDELKSLESGNYEYLNFEKNESQTITEETVEESAPLPKLTHVPYDISHLVQDGCFLEEAKIQLTLQRLIDKKNLILQGPPGTGKTWLARRLAYCLMGEKAPERISAVQFHPNLSYEDFIRGWRPGKEGQLTLIDGPFVNAIKTAVNNPTSKYVVIIEEINRGNPAQIFGETLTLMEADKRTPTEALSLSYPKNDDEKIYIPENLYIIGTMNIADRSLALLDLALRRRFAFIDLKPAFNDAWRNWVNYNYAIDFDMLAFIKSRLTVLNDMLAKDVTLGPQFCIGHSYVTPAIGQKINDAQAWYEQVVDTEICPLLAEYWFDAPNKVDEARKVLLAK >NZ_CP029122|2973073:2983851|2982600_2983851_-|WP_000444487.1|DBSCAN-SWA MESKVVVPAQGKKITLQNGKLNVPENPIIPYIEGDGIGVDVTPAMLKVVDAAVEKAYKGERKISWMEIYTGEKSTQVYGQDVWLPAETLDLIREYRVAIKGPLTTPVGGGIRSLNVALRQELDLYICLRPVRYYQGTPSPVKHPELTDMVIFRENSEDIYAGIEWKADSADAEKVIKFLREEMGVKKIRFPEHCGIGIKPCSEEGTKRLVRAAIEYAIANDRDSVTLVHKGNIMKFTEGAFKDWGYQLAREEFGGELIDGGPWLKVKNPNTGKEIVIKDVIADAFLQQILLRPAEYDVIACMNLNGDYISDALAAQVGGIGIAPGANIGDECALFEATHGTAPKYAGQDKVNPGSIILSAEMMLRHMGWTEAADLIVKGMEGAINAKTVTYDFERLMEGAKLLKCSEFGDAIIKNM >NZ_CP029122|2973073:2983851|2980473_2980692_+|WP_001678640.1|DBSCAN-SWA MADIIDSASEIEELQRNTAIKMRRLNHQAISATHCCECGDPLDERRRLAVQGCRTCASCQEEIELKNKQWGL >NZ_CP029122|2973073:2983851|2981344_2982487_+|WP_000741339.1|integrase|DBSCAN-SWA MAARPRSHKISIPNLYCKLDKRTGKVYWQYKHPLSGRFHSLGTDENEAKQVATEANTIIAEQRTRQILSVNERLERMKGRRSDITVTEWLDKYISIQEDRLQHNELRPNSYRQKGKPIRLFREHCGMQHLKDITALDIAEIIDAVKAEGHNRMAQVVRMVLIDVFKEAQHAGHVPPGFNPAQATKQPRNRVNRQRLSLPEWQAIFESVSRRQPYLKCGMLLALVTGQRLGDICNLKFSDIWDDMLHITQEKTGSKLAIPLNLKCDALNITLREVISQCRDAVVSKYLVHYRHTTSQANRGDQVSANTLTTAFKKAREKCGIKWEQGTAPTFHEQRSLSERLYREQGLDTQKLLGHKSRKMTDRYNDDRGKDWVIVDIKTA >NZ_CP029122|2973073:2983851|2978423_2979104_+|WP_001372461.1|DBSCAN-SWA MTPDIILQRTGIDVRAVEQGDDAWNKLRLGVITASEVHNVIAKPRSGKKWPDMKMSYFHTLLAEICTGVAPEVNAKALAWGKQYENDARALFEFTSGVNVTESPIIYRDESMRTACSPDGLCSDGNGLELKCPFTSRDFMKFRLGGFEAIKSAYMAQVQYSMWVTRKDAWYFANYDPRMKREGLHYVVVERDEKYMASFDEMVPEFIEKMDEALAEIGFVFGEQWR >NZ_CP029122|2973073:2983851|2979100_2979259_+|WP_000149533.1|DBSCAN-SWA MTHPHDNIRVGAITFVYSITKRGWVFPGLSVIRNPLKAQRLAEKINNKQEDI >NZ_CP029122|2973073:2983851|2980739_2980979_+|WP_000488406.1|DBSCAN-SWA MFRIIFPNTWYVDHHGTPCKILRSTHNKVHYIRKGRTCIASMFRFNHDFEPVNKADADRIAEEIETAEHIKKLRAIRRK |
11 | Enterobacteria_phage(40.0%) | integrase | attL 2971046:2971069|attR 2982554:2982577 |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_5 |
3298227 : 3306997
Sequences of DBSCAN-SWA_5
Nucleotide sequences of DBSCAN-SWA_5 >NZ_CP029122|3298227:3306997|DBSCAN-SWA TTTAACTGGAAACTTCCGTTGACCGGCTGTATTCATGGCTGCGAATTTTCGCCATCAGCTCGTCTGTCAGTTCGGACACCCACTGGATAGCCAGCCGCTTTTCTTCGTCGCTGCACTCACTAGCCGCTACAAGCTTGATAAAAAAATCAATACGCTGAAGCTTCAATGACTCCAAAAGATAGTCCTGCATCTTCCCTCCTATCATTACACGGATACACAAAAACTGTATATACACCCACTGTTTATATAAACAGTATAATAGGAACAGAAAAATGTAAAACTGTTTTTTGTCAGTTAATTGGATGTACTGATGTCGGTCAATAAAGCACAAAATGTTAAACAGCAGCCTTAGTACCATTGACGCCATTTGTCATCTTCCTGCAGCCTCTGGTTACGGTAAAAAATACGTAAACCGGCACCGGATGGAATGCTGCCGCCACGCAGAAGCAAATCAATCTCAGATGCACTACCTTCAAACCCCCTGGTTGTCAGTTCTGCCTCAAGCTGCAGGCGCTGCTGCTCCAAAATACTCTGTATGTATGCTTTTTTCCGCTTCGGTTTTACCAGTCTCAACCTGGCTGTCAGCTCCCGCCGTTCCTTCTGGCCCATGTTGTGGAGATATTCCTGCAGCTCCTTCTCATCCATGGTTTTAATATCGGGTAAATCACCCCCTGATTTGTTCAGATTTTCAACAGGGGGACAGTTATTGCCACGAGTCCAAGGGGCGCAAGCGCCCTGGTCGGCTGCCGCCTCCTGAACGTCAACGGCTTTACGAACCATTTTCCACTTCACTGCATGAGTGCAGATCTTGCCCTCTGCAATGGGTGACCAGATGCCATAAATACGAATGCCATGATCGCCATAGGCGGTCGGCTCTTCGTTGATTTCATAAGCGGTTCTGATGAGGTGATATTTACGGGGAACCAGTACGCCGCCCTGCTTCATGATGTAGGTGGCAAAACAACCAGCATCAGCAGCAGCCAGAATGGCATCAAGACGCGGGTTATCCAGTACCGGCGCACCTGCTTTTTTGTCACCCTGTTGCCTTGCCGCCTGACCAGCCAGCAAGCGAAGTTCACGGTAAGCCTGACGCCCCGGAATACCAAAGAAGCGGAATTGCTGAACACGATGCAGAGACGCCCAGGCATTCACGTATTCAGCGTTATCACGCAGAGATTTACCCGTTTCCTTGCTGATCTCGCCAGCCAGACCACGACCGTCAATGTTCTTACTGATATATTTCGCGATGTAGCTTGTCGGCGTTCCTTTGCGCGGGTTAATCAACTCAGACTTAAAGCGCGGCCCAGTGTTATTGCCCAGCTCCTCGCGGTCTTCACGGATGGCAAACTTACGCAGTAATGCAGTGATGGCACGGCGGTCTTTTTTGCGCATGAAACACAACAGGTGCCAGTGAACTGTGCCATCATGATGCGGCTCAGCCACCCGCACGCCATACCAGCGCAACCCGGCTTTGTGCATCGCCTTACGAAATGCAGCAAACATACCGACCAGATAATCGCTGCTTTGTCTTACCGTCGCGTTTGTCCAGGTCGGGTTTGGTCTGCCGTTATTTAGCGTGGAATGGAAACGCGACGGACAGGTGATAGTGTAGAAAACGGCGCAGTCACCGCGCATTTCCGCGATAAGCTCCAGACCTTTAACACAGGCCATCATCTCATTGCGGCGATGCGCAGGGTTGCTGCTGCTGGCGTTTACCACATCCTCCATGTCCAGCGTGTCGCCGTCTTCGTTCACCAGTTCATGAGAACGGAAAAACTCCAGCGACTTACGACGCTGCTCACGTTTATGCATCACGGCTTCATAGCTGACATAAGGAGATGCTTTTTTGCTGACCAGGCAAACAGCGCGCAACTGCTCTTCCCGCCATTCGCAACGCATCTTCCATAATTTCCGATACCACCAGTCGGCGCACAACATACGCGCCAGCGAACCCGGAATGAGTTCATAGGGCACGGGTTTACGGCGGTTTCTTTTCCGACGGAGTTGCTCAAACGCAGGTGGGATGACATCCAGACGCAGGGTTTCCGCTGCCACCTTTTCCCATGTCTTGCGGATTTCTTCTGGCTTAACGTCATCGGTGGCATACAAATCACCACAAGCTGCATCAAGGCACATACTCATATGCGCAGCTACCAGGGTGGAAAGGCGTTTCACCTGATCCTGACTCATTTCAGGCAGGATCAGCAGGCCGTCCAGCCCTTCATGGCTTGCCATAAAGCGAAAAGATGCAGATAGCTGACTGTCGCGTACATGCTCCAGTCGTTCCAGACATGGCTTAATCGTCTCACGTAAATAGCGGGAATAAGCCTTTGGCCTGCCCAGGCTGCTGAAGTATTCAATACGTTGCATCAGCGGCTTGCTGATATGGGAAGGCTGGGCGTTGACGTCCGCCAGAATGACCATGTCTGAATTAAAACGCTGCTGCTCATGCGCCAGCTTTGCCCGGCTAATGAGCTTATCCTGCTCCATTTCGCGCTGGACAGGATCACGTGATTCATTAAAGAAATAACGCTCCCAGACCTGATCACTCAGTGCCTCGCGGCGCAACTGTTCCTGCTCGTTATCGGCAGCGTACAGAGTGATCAGGTTTGAAAGCGTAGAAACCGGCGCAACTTCCGCCGGGTCCAGATAAGGGTTAATGGCCTTTTTCGGGCTGTTCCATGAGAATGCTGCGGCAGCCTCGTTAAAGCCGCAGCAGTTGTTCATATCGGCATGACTCATGCACGTACTCCGTACACGGCAGAACTATCCACGCCACGCGAATAATCAAATCCCATCCAGCAGCGCGGCCCGGAAACAGCAATGATTTCTGTTGCTGATTTACCCTCGCCAGCTGCCACACCGATGCTGCGTTTTACCTTGATATAGTGGTGAGTAAAATTGCGATACAGCGAACGGATCAGGGATGTGTCACTGTTAGAAACAATGACCGGATGTCCTTCTGATGACCGATGTTCAAGAACGGATGCCAGGTGATACTGGTCATCTTCAGTGAAACCATCAGTGTGATAGCCGGAAAACGTACCGTCATATGGCGGATCGCAATACACCACATCTCCCGCCTTCAACATCGCCAGCGTTTCATCAAAGCTGGCGCAGATAAACGTCGCTCGCTGGGCTTTTTCTGCAAATGTGCGAAGTTCTTTTTCAGGGAAATACGGATTTTTATAATTACCGTAGGGAATGTTGAAATGCCCGCTCTTGTTATAGCGACATAAACCACGGTAACCGTGACGATTGAGATACAGGAAATATACCGCTTTCATGAAATCAGTAATTTCAGTTGAGCAGTTAAACTCCTGCCTTATGTTGTAATAAGCCACCTCCCTGTTTGCGATCTCAAATAAAACTCTGGCGCGAGATATAAACGATTCACAATCAGCGGCAACCTTTTTATAGAGGTTGATTAAATCAGGATTAATATCCGCAACCAGATAGCTGGGATAATCCGTTTCCATCATCACAGCACAGGAACCCGCGAAAGGTTCAACCAGTCGCGGGCCAGCAGGAAGGTATTTTTTCAGTTCTGGCATAATGGCGGTTTTATTTCCCGCCCATTTCAGGATGGTGCTCATACAGCACCTCCGTTGTAATGTTTGCCTTTCAGTTCTGCGATTTCCTGACAGGTAATGCAAAGCTGCACTCCCGGAATGGCGCGGCGTCGTGCTGGCGGAATTGGCGCTTCACATTCAATGCAAAGTACGCGTGACACGCCCGGTGATTTGGCACGGGCTGCACGGATATGGCGCTGGCGTTCTTCTTCAACGCGCTGCTGTACGAGATCCATTGCATCAGCCATTAGTGGATCTCCTGCGCTTCGTTCTGGATTGCTTCAGCAGTCACGCGCAGCAGTTCTGCCGCTTCCACGTGGTTTAGCTGGCGGGATGAGATATGACACGCCAGGCTATCAAGGCGAGCAGCCATTGCTTCAGCCCTTGCCCGGCGTTCTTCCAGACGAGCCTCTGTCAGTAAAATATTAAGCCCTGCGTCATCCGGTCCGGTTTTAGTCGTGAGGGTTTCAATATTACGCATAATCAATTCTCCTGAATTTAGATAAAGGGATGCCCGGCGGGTTTACGCCATTAATTTCATTAGTTGGTTAATTCGGCATGGTTAGCCGTCTGGGAAATAAGCTCACCACTGCACGAAAATGATTCATTGCTTTAATCAGCTCCCGCTTTTCGTCAGTGGTCAGCTCATTAATGCTGATGCTATGACGTTCAGCTGGAATTTTTGCCATAAAGAATATGGCAGCCAGTGCCCGTTTATTTTGTTCATTATTGATATCCCGTGGATCACGCATATCTTTAATAAACCGCTCAAGCTCTGACTCAATATTCAGGCCAAAAACTTTCGCCCTTAACTCCGCAATGTGATTAAGTCCATTCAGGCGTTCACCGGGGCTTAATGGAACAGTTGCTGCAGCGCCATTAATTGCCATACTTCATATCCCCCAAACGCAGCTATCGTTCTTTGTTCTTACGGTAACGCTCAAGAGGAGATACATTTTTTCGTATCGTCTCTTTAACCTGCTCTCCCCGTAAAAACGTCCCATCCTTTAACGTGAAAAAGTAACTGCCATCGCCCGACAATGACGGATAGCAACAGAGCAAATCATCTTCAGGTACTGAATAACTCTCCCCTCTGTAACGAAACTGATAAACCACTTCACTTTCTGCCGCATACATTTGGACTTTCTCCGTTTCCTCGTGGTCAATTCAGACAGCAATTCATCTTGTGAATGACATGGATGCCAGCGTTTTCCATCCTCACCCGTGATCCAGCCGTGACCGTAGTGCATTGCCGGGCTTTGTTTTACCAGCAGCGATGCAAATGATGGTTCTTTCGTCAGCATAAACACCTCACAGCAAACCGAATGAAGCACCAAGGCCAGTCATGGTATCAACTGCACTCGCCATCGCAGGATTAGCCTGTAAACGGGCTTGCAATGAAACAGCAGCCAGCGCCATCAGTCGTGTTACAGAGTTAATGCTGCTGATAGCATCACGACGACCTGCACTGGTTTTTACATCGCCAGATACCGCACCTGCAGCAACACGCCCGATCTCTGCGGTTGCACTCATGACGTAATGCGGTAGTTTCTCTTTTGCCACCTCATTAATCGGAACACATGGCAGACAATGAATCTGTGCCAGAAAACCATCTACCAGCGTTGAATCTTCAGTCAGATCGGTAAGCAACCAAATTTCTGGTGCGGTTAATAAATGAGGTTGAGCTGGGTTCAGTTTGTTCCGCAGAATCTGCACATTCATGCCTGCACGTTCTGCCAGTTGCACCAGATTGTGGCGCAGTGCGAATGCACGACAGGCTTCATCAAAATGTGGATGTTTGGAAACTTGGTAATCAAACATGGTCGACACCTCTGATGTATCCCAAAATGGAACTAGTTGAATACAACATTGCAATCAGTAAGTGCATCAACGGTAAGAGCAGCAAGGTTGATCATTACCTTTTCTCTTTTCTTGTCTTTCCGAAGGCGATGCCGAGGGATGCGACCGTCAGCCAGCATATCGTTAATTGTGTCGATTGAAAGACCAGTAAGTTCGCTATAACGCTCAATTGTGACATGTGGCGTATTCAGGGTTATTGAAATGTTAGGGGTCATGATGCAACATCTCCTATTGGCTTGTGGTGAGTCAGTTTTAATCGTGGCTTTAACTTCACATTTCGGAGAATAGGATCACAAATCGGTTATGTCAACACATGAAATCACATTTCGCCATGTGGACGAAAAAAAGAAATCCTTAATCATGCAGAATCGCGGAGGGCAATCGGTTATAGATCGGATACTGAAAGCCTATGGTTTTTCTTCCCGACAAGCATTCTGTAATCACCTAGGTATATCGCAAAGTACAATGGCGAACAGGTATGCCCGTGACACTTTCCCTGCTGATTGGGTTGTTATCTGTAGCATGGAGACTGGAGTGCCGGTCGAGTGGTTGGCATTTGGCACTGATACCGAGAAGGGAAGCATTACAAATAATGCAGAAAAAAGTCACAACAATTGTGACAGCAAGCATCAACATCTCAATAGAGAACAAGACATCCAAAATGAGAACTCTTTTACTATTAACCAAGGTGGAAAAGCAGCAATAGAGCGAATCGTTTTGGCTTATGGATTTAAGACAAGACAAGCTTTAGCTGATCATATTGGTGTATCAAAAAGTACATTAGCCAATCGTTACATGAGAGATACCTTTCCTGCTGACTGGATTATTCAATGCTCACTGGAAACCGGTGCTTCATTAACATGGCTAACCACTGGTAACGGGGCAATGTTTGAAAAGCCTCGAAACGATACTATCACTATCCCATATCATAAAATAATTGATGGATCTCTTGCTCAAGAAACCTTCTTGACTTTTGACTCTAAGTTGTTAGAAGGAACCTTTCTGCAACCTTTAGCAGTATTCATTGATGAGGAAATATATATTGTAGAATCAAAATTTAATGAAGTTACTGATGGCAAGTGGCTTGTGAATATTGAAGGGAAAATAAGTATCAAAGATTTGACTCGCATACCCGTTGGTATGGTTAAAGTTGTAGGCACTAACGCAAGTTTTGAATGCTTACTTACTGACATTATCGTTTTGGCAAAATGTAAAAGAGTTTTTACTAAAAATGTATAAAGAGAAACATCATGACTGAACCAACCAATAAAGATAGCGAAATAAAAAAACACCTATTAGAATTTCTTGATTCACAGTCTGAAAATATAGCAAAACACTTCTACTCTCATATAAAAGACTTAATAGAAGCAGGAGAGCTTTCTGAAGCTCATAATAACCTAGCGCTAATTGAAAAATACATAACTAGGCCACCGATGGATGAAGAACCCAATATAAATGAAAATAAAGCCAATAAAAGAAAAAATGTAAAATCACTTGAACCTAATAATTATGTAGAACATATAATACAATTAGAAGAACGAAACAGCATATTAACTCTACAGTTAGAGCATTATACTCAGGATCTTAATAGAAAAAACGCAATAATCGAAAACAACGTAAAACAAATTAATTCATTGATTAGTGAAAATAAGGAACTCCGTAGCCAAGTACAGCAACAAAGAATCGATGATAAAATCCCCACCTATGTTAACGATGTTAAATCAGATCTTGGTAGTGATGACAAACATTTTATATTGATGTCTATTATCTGGTCTATTGCAGGGGTATTTTTTGGCTTCCTTGCAGTAGTATCTGCTTTTTTTACATTATACATGAACTTAGATTTAAAAAATCTCACTAACCTTCAGTTAATATATATCTTCACGCGAGGATTAGTTGGAATCGCCATTCTTTCATGGCTATCATATATCTGCCTTAGTAACTCAAAAAAGTACACACATGAATCGATCAGGCGAAAAGATCGTCGACATGCTTTGATGTTTGGTCAAGTTTTTTTGCAGATATACGGTTCTACAGCAACTAAAGAGGATGCAATAGAAGTCTTTAAGGATTGGAATATTTCAGGTGACTCTGCATTTTCAGGTCAGACAGAGCAACCACCGAGTTTTGCGTCATTTTTGAATACAATCAAAGACAAAGTTAAAGTAACTGGAAGTGATAAAGAAACAGATTAATCATGAACATGTATGCTACTAAGTAAAAAATACATTGAATACTGTTGTTATATACAGTTAAATTTAGCCCTCTGATATGAGGGCATTTTTTATGGCAGTACGAAAACTCACCACAGGAAAATGGCTTTGCGAATGTTACCCCGCCGGACGTAGCGGACGCCGTGTGCGTAAACAATTCGCCACCAAAGGCGAAGCACTGGCCTTCGAGCGATACACCATGGGGGAAATAGAAGCAAAACCCTGGCTGGGCGAATCAGTGGATCGTCGGACACTGAAAGATATGGTTGAGCTATGGTTCAAATTACATGGCAAATCTCTTACTGCCGGACAGCATGTCTACAACAAGCTGCTGTTGATGGTTGACGCCTTGGGAAATCCCCTTGCAACTGATCTCACCTCAAAAATGTTTGCTCACTATCGAGATAAACGCCTGACAGGCGAGATCTACTTCAGCGAGAAATGGAAGAAAGGAGCAAGCCCGGTCACCATTAACCTGGAGCAAAGCTATCTAAGTAGTGTTTTTAGCGAACTATCCCGTCTGGGCGAATGGTCGTATCCGAACCCACTGGAGAACATGCGAAAATTCACCATCGCAGAAAAAGAGATGGCATGGCTTACCCATGAGCAGATTGTTGAATTGCTGGCTGATTGCAAACGTCAGGACCCAATTCTGGCACTGGTAGTTAAGATATGCTTAAGCACAGGCGCACGCTGGCGTGAAGCCGTAAATCTTACCCGCTCACAGGTGACCAAATACCGAATTACCTTTGTCAGAACGAAGGGGAAGAAAAACAGAAGCATCCCTATCAGTAAAGAGCTTTACGAAGAGATCATGGCGCTCGATGGGTTCAATTTCTTCACAGACTGCTATTTTCAATTTTTATCCGTGATGGAAAAAACGTCTATCGTGCTCCCTCGCGGTCAACTCACACACGTTCTGCGCCATACGTTTGCAGCGCACTTCATGATGTCGGGTGGAAACATTCTGGCCTTACAAAAAATTCTCGGACACCACGATATAAAAATGACTATGCGTTACGCACATCTGGCACCGGATCATCTGGAAACGGCGCTCCGTTTCAATCCTCTGGCAACGCTGCCAAGTGGCGACAAAGTGGCGGCAGCGGTTGGCATTACCCCGTAA
Protein sequences of DBSCAN-SWA_5 >NZ_CP029122|3298227:3306997|3300964_3301822_-|WP_001544405.1|DBSCAN-SWA MSTILKWAGNKTAIMPELKKYLPAGPRLVEPFAGSCAVMMETDYPSYLVADINPDLINLYKKVAADCESFISRARVLFEIANREVAYYNIRQEFNCSTEITDFMKAVYFLYLNRHGYRGLCRYNKSGHFNIPYGNYKNPYFPEKELRTFAEKAQRATFICASFDETLAMLKAGDVVYCDPPYDGTFSGYHTDGFTEDDQYHLASVLEHRSSEGHPVIVSNSDTSLIRSLYRNFTHHYIKVKRSIGVAAGEGKSATEIIAVSGPRCWMGFDYSRGVDSSAVYGVRA >NZ_CP029122|3298227:3306997|3304018_3304897_+|WP_001680871.1|DBSCAN-SWA MQNRGGQSVIDRILKAYGFSSRQAFCNHLGISQSTMANRYARDTFPADWVVICSMETGVPVEWLAFGTDTEKGSITNNAEKSHNNCDSKHQHLNREQDIQNENSFTINQGGKAAIERIVLAYGFKTRQALADHIGVSKSTLANRYMRDTFPADWIIQCSLETGASLTWLTTGNGAMFEKPRNDTITIPYHKIIDGSLAQETFLTFDSKLLEGTFLQPLAVFIDEEIYIVESKFNEVTDGKWLVNIEGKISIKDLTRIPVGMVKVVGTNASFECLLTDIIVLAKCKRVFTKNV >NZ_CP029122|3298227:3306997|3302346_3302688_-|WP_000996717.1|DBSCAN-SWA MAINGAAATVPLSPGERLNGLNHIAELRAKVFGLNIESELERFIKDMRDPRDINNEQNKRALAAIFFMAKIPAERHSISINELTTDEKRELIKAMNHFRAVVSLFPRRLTMPN >NZ_CP029122|3298227:3306997|3305944_3306997_+|WP_001372563.1|integrase|DBSCAN-SWA MAVRKLTTGKWLCECYPAGRSGRRVRKQFATKGEALAFERYTMGEIEAKPWLGESVDRRTLKDMVELWFKLHGKSLTAGQHVYNKLLLMVDALGNPLATDLTSKMFAHYRDKRLTGEIYFSEKWKKGASPVTINLEQSYLSSVFSELSRLGEWSYPNPLENMRKFTIAEKEMAWLTHEQIVELLADCKRQDPILALVVKICLSTGARWREAVNLTRSQVTKYRITFVRTKGKKNRSIPISKELYEEIMALDGFNFFTDCYFQFLSVMEKTSIVLPRGQLTHVLRHTFAAHFMMSGGNILALQKILGHHDIKMTMRYAHLAPDHLETALRFNPLATLPSGDKVAAAVGITP >NZ_CP029122|3298227:3306997|3303109_3303619_-|WP_000460892.1|DBSCAN-SWA MFDYQVSKHPHFDEACRAFALRHNLVQLAERAGMNVQILRNKLNPAQPHLLTAPEIWLLTDLTEDSTLVDGFLAQIHCLPCVPINEVAKEKLPHYVMSATAEIGRVAAGAVSGDVKTSAGRRDAISSINSVTRLMALAAVSLQARLQANPAMASAVDTMTGLGASFGLL >NZ_CP029122|3298227:3306997|3301818_3302046_-|WP_000752610.1|DBSCAN-SWA MADAMDLVQQRVEEERQRHIRAARAKSPGVSRVLCIECEAPIPPARRRAIPGVQLCITCQEIAELKGKHYNGGAV >NZ_CP029122|3298227:3306997|3298227_3298416_-|WP_001376441.1|DBSCAN-SWA MQDYLLESLKLQRIDFFIKLVAASECSDEEKRLAIQWVSELTDELMAKIRSHEYSRSTEVSS >NZ_CP029122|3298227:3306997|3298574_3300968_-|WP_001376443.1|DBSCAN-SWA MSHADMNNCCGFNEAAAAFSWNSPKKAINPYLDPAEVAPVSTLSNLITLYAADNEQEQLRREALSDQVWERYFFNESRDPVQREMEQDKLISRAKLAHEQQRFNSDMVILADVNAQPSHISKPLMQRIEYFSSLGRPKAYSRYLRETIKPCLERLEHVRDSQLSASFRFMASHEGLDGLLILPEMSQDQVKRLSTLVAAHMSMCLDAACGDLYATDDVKPEEIRKTWEKVAAETLRLDVIPPAFEQLRRKRNRRKPVPYELIPGSLARMLCADWWYRKLWKMRCEWREEQLRAVCLVSKKASPYVSYEAVMHKREQRRKSLEFFRSHELVNEDGDTLDMEDVVNASSSNPAHRRNEMMACVKGLELIAEMRGDCAVFYTITCPSRFHSTLNNGRPNPTWTNATVRQSSDYLVGMFAAFRKAMHKAGLRWYGVRVAEPHHDGTVHWHLLCFMRKKDRRAITALLRKFAIREDREELGNNTGPRFKSELINPRKGTPTSYIAKYISKNIDGRGLAGEISKETGKSLRDNAEYVNAWASLHRVQQFRFFGIPGRQAYRELRLLAGQAARQQGDKKAGAPVLDNPRLDAILAAADAGCFATYIMKQGGVLVPRKYHLIRTAYEINEEPTAYGDHGIRIYGIWSPIAEGKICTHAVKWKMVRKAVDVQEAAADQGACAPWTRGNNCPPVENLNKSGGDLPDIKTMDEKELQEYLHNMGQKERRELTARLRLVKPKRKKAYIQSILEQQRLQLEAELTTRGFEGSASEIDLLLRGGSIPSGAGLRIFYRNQRLQEDDKWRQWY >NZ_CP029122|3298227:3306997|3304908_3305853_+|WP_001678408.1|DBSCAN-SWA MTEPTNKDSEIKKHLLEFLDSQSENIAKHFYSHIKDLIEAGELSEAHNNLALIEKYITRPPMDEEPNINENKANKRKNVKSLEPNNYVEHIIQLEERNSILTLQLEHYTQDLNRKNAIIENNVKQINSLISENKELRSQVQQQRIDDKIPTYVNDVKSDLGSDDKHFILMSIIWSIAGVFFGFLAVVSAFFTLYMNLDLKNLTNLQLIYIFTRGLVGIAILSWLSYICLSNSKKYTHESIRRKDRRHALMFGQVFLQIYGSTATKEDAIEVFKDWNISGDSAFSGQTEQPPSFASFLNTIKDKVKVTGSDKETD >NZ_CP029122|3298227:3306997|3302805_3303102_-|WP_000956192.1|DBSCAN-SWA MLTKEPSFASLLVKQSPAMHYGHGWITGEDGKRWHPCHSQDELLSELTTRKRRKSKCMRQKVKWFISFVTEGRVIQYLKMICSVAIRHCRAMAVTFSR >NZ_CP029122|3298227:3306997|3302045_3302279_-|WP_001244224.1|DBSCAN-SWA MRNIETLTTKTGPDDAGLNILLTEARLEERRARAEAMAARLDSLACHISSRQLNHVEAAELLRVTAEAIQNEAQEIH >NZ_CP029122|3298227:3306997|3303651_3303873_-|WP_000188448.1|DBSCAN-SWA MTPNISITLNTPHVTIERYSELTGLSIDTINDMLADGRIPRHRLRKDKKREKVMINLAALTVDALTDCNVVFN |
12 | Salmonella_phage(90.0%) | integrase | attL 3297897:3297910|attR 3307039:3307052 |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_6 |
3386580 : 3412836
Sequences of DBSCAN-SWA_6
Nucleotide sequences of DBSCAN-SWA_6 >NZ_CP029122|3386580:3412836|DBSCAN-SWA TATGACAACGGACGATCTTGCCTTTGACCAACGCCATATCTGGCACCCATACACATCCATGACCTCCCCTCTGCCGGTTTATCCGGTGGTGAGCGCCGAAGGTTGCGAGCTGATTTTGTCTGACGGCAAACGCCTGGTTGACGGTATGTCGTCCTGGTGGGCGGCGATCCACGGCTACAATCACCCGCAGCTTAATGCGGCGATGAAGTCGCAAATTGATGCCATGTCGCATGTGATGTTTGGCGGTATCACCCATGCGCCAGCCATTGAGCTGTGCCGCAAACTGGTGGCGATGACGCCGCAACCGCTGGAGTGCGTTTTTCTCGCGGACTCCGGTTCCGTAGCGGTGGAAGTGGCGATGAAAATGGCGTTGCAGTACTGGCAAGCCAAAGGCGAAGCGCGCCAGCGTTTTCTGACCTTCCGCAATGGTTATCATGGCGATACCTTTGGCGCGATGTCGGTGTGCGATCCGGATAACTCAATGCACAGTCTGTGGAAAGGCTACCTGCCAGAAAACCTGTTTGCTCCCGCCCCGCAAAGCCGCATGGATGGCGAATGGGATGAGCGCGATATGGTGGGCTTTGCCCGCCTGATGGCGGCGCATCGTCATGAAATCGCGGCGGTGATCATTGAGCCGATTGTCCAGGGCGCAGGCGGGATGCGCATGTACCATCCGGAATGGTTAAAACGAATCCGCAAAATGTGCGATCGCGAAGGTATCTTGCTGATTGCCGACGAGATCGCCACCGGATTTGGTCGTACCGGCAAACTGTTTGCCTGTGAATATGCAGAAATCGCGCCGGACATTTTGTGCCTCGGTAAAGCCTTAACCGGCGGCACAATGACCCTTTCCGCCACACTTACCACGCGCGAGGTTGCAGAAACCATCAGTAACGGCGAAGCCGGCTGCTTTATGCATGGGCCAACTTTTATGGGCAATCCGCTGGCCTGCGCGGCAGCAAACGCCAGCCTGGCGATTATCGAATCCGGCGAATGGCAGCAGCAGGTGGCGGCTATTGAAGTGCAGCTGCGCGAGCAACTGGCACCAGCCCGTGATGCCGAAATGGTTGCCGATGTGCGCGTACTGGGGGCAATCGGTGTGGTCGAAACCACTCGTCCGGTGAATATGGCGGCGCTGCAAAAATTCTTTGTCGAACAGGGTGTCTGGATCCGGCCTTTTGGCAAACTGATTTACCTGATGCCGCCCTATATTATTCTCCCGCAACAGTTGCAGCGTCTGACCGCAGCGGTTAACCGCGCGGTACAGGATGAAACATTTTTTTGCCAATAACGGGAAGTCCGCGTGAGGGTTTCTGGCTACACTTTCTGCAAACAAGAAAGGAGGGTTCATGAAACTCATCAGTAACGATCTGCGCGATGGCGATAAATTGCCGCATCGTCATGTCTTTAACGGCATGGGTTACGATGGCGATAATATTTCACCGCATCTGGCGTGGGATGATGTTCCTGCGGGAACGAAAAGTTTTGTTGTCACCTGCTACGACCCGGATGCGCCAACCGGCTCCGGCTGGTGGCACTGGGTAGTTGTTAACTTACCCGCTGATACCCGCGTATTACCGCAAGGGTTTGGCTCTGGTCTGGTAGCAATGCCAGACGGCGTTTTGCAGACGCGTACCGACTTTGGTAAAACCGGGTACGATGGCGCAGCACCGCCGAAAGGCGAAACTCATCGCTACATTTTTACCGTTCACGCGCTGGATATAGAACGTATTGATGTCGATGAAGGTGCCAGCGGCGCGATGGTCGGGTTTAACGTTCATTTCCACTCTCTGGCAAGCGCCTCGATTACTGCGATGTTTAGTTAATCACTCTGCCAGATGGCGCAATGCCATCTGGTATCACTTAAAGGTATTAAAAACAACTTTTTGTCTTTTTACCTTCCCGTTTCGCTCAAGTTAGTATAAAAAAGCTGAATGCGAAACATTAAAAAACATTAATATCAATGTGTTACAATATCATTGGTCTAAAAAATAGACTACATGATGCTACAAAACACAACATATCCAGTCACTATGAATCAACTACTTAGATAGTATTAGTGACCTGAGACAGAGCATTAGCGCAAGGTGATTTTTGTCTTCTTGCGCTAATTTTTTGTTATCAAACATGTCGCACTCCAGAGAAGCACAAAGCCTTGCAATCCAGTGCAAAGATTTGTGTGCCTCAGTTTTGTCTAAGTGTTCTACTGAAAACATAGTAAAATCGGCAACAGCTGGAAATCATTCAATACTCGCACTATCGAAAGTTCACCAGCCAACCGCAGCACGTTCTTGCATACGACGTGCTGCGGTTTTCTTTATGATTTATGCACAATGGACAATTTGAAATTATTGATGATTGTATGGTGCATCGTTTTCTGAACCTACACTGATTTTTTGGTATAGCCTTGCCTAGTCAGTCTTACCGGATCAAACTCTTCGCTATTGCAATACTAACCAAAATCATCAATTTGACAGCGATTAACCAGAATAATAGTATACTATCACCAGTAAGAAATTATCGTTATTTGTAGCGATACATATTATATATATATCATTCTCAGGTGCGTACATGATTATCAACCAGGTACCTATAAAAATAAAAATCTTTATCTTTTTATTTTCATGCATCTCTATTATATTTTTGTTACTGCATGCAAATAATGGAATATACATAACACAAACAACACAAATAAGTTATAGTGTTTTCATTATTGGGCTTTTTTTCATAAACCTGATGATTTTTATTTTTCTATTGCTTTACTATGTTTCTAATCAGAGACAAAGTTATCTCTTAATTCTTTCATTCGCGTTTTTGAGCAACACGTATTATTTATTAGAAGTGGCTATTATTTCTTTATCTCCGTTAGGTAACGATTTATCTACAATCTATCAGAAATCAAATGATATCGCAATATATTATCTATTCCGTCAGTTCAGCTTTATATCTATAATCTTTCTGGCTGTTTATTCCACCAATGTTAAAAATAAAAGTGTTTTAGAAGATAAAAGAAACATAATAATTGTTGTTTTGTCAATATTAATTCTTTTTATTACTCCGTTTGTAGCAAAAAATCTAAGCAGTGACAATATAAAATATAGTCTTAATATTATACAATACTCGCTGAATCGTCATTTGCCGACGTGGAATATCGTGTACACCAAAATAATATCAGTATTTTGGCTTGTATTACTTATCAGCTCATGCATCAGCATACGTAATTACTCAAAAATATGGTTGTGTATAATACTTATTAGTATAGTGTCAGTATGCAATAATCTAATTTTATTGTATTTTATTGATAAATCCCATCCTGCATGGTATATGACAAAATTTCTTGAATTGATATCAATGATTTATATCATTTCAACACTCATGTATTATGTTTTCAGGAAATTAAATCATGCTAATCATATGGCAATTCATGATCCACTAACGAATACATACAATAGAAGATACTTTATTGACTCATTGAAGAATATATCAAAACACCATGATTTCTCAGTAATAATGTTAGATATTGACAGTTTCAAAAGCATCAATGACAAATGGGGGCATCATATGGGTGATCAAGTCATAGTAATGGTTACCAGAATAATAAAAAAATCCATCAGGAAAGAGGATGTATTAGGGCGCTTAGGCGGTGAGGAGTTCGGTATTATCATTAAAGGTAATACTCAAAAGCTCTTGCTATCAATTGCAGAGCGAATCAGAAAAAACATTGAAGAGCAATGCTCGGAAAAATTATTATCGCATGGACCTGAGAAAATAACTGTCAGTATTGGTTGCTTTACTTCAAAAGAGAATAATCTCAGCCCATCTGAAATGTTAGTCAATGCCGATAAAGCGTTATATCAAGCCAAAAGAACCGGAAAAAACAAGGTGATAATTCACTCAAAATAAACACCTTTTTAAAATACAGCCCCAATAAACTGCAGAATATTATCCCATATAATATCCTGCAGTTCGTAATGCACTATTCGATAATGGGTACTGTTGGCCATTCAATATCCGGTGCAGTTGTTGTATTAACACGGTTCAGCAACACCCGATACTTCTTCCAGGCTTCCAGCAACGAGTTTTCTTCCTCCGTTGCGATCTCCAGATCTACAGCATCCTGAAGTGGCGCAATATGCTCACTGAATTCCTGGATGTAGAACTATGTGGTGACGGTCTTCCAGCCATTCGGCTCCTGCTGTATCGAAGCATACCAGGCTATTTCAATATCGCTATGCTGCGGCAGCATTTAACCCCTTGTAATTCATCACCATAATTGATTTAATTCACAAACAAAACTATAACATGGTGAAATTAATGAAAAAAAACACAGATGATGGGGCTAAAATTTACACACCACTTACCCTAAAGCTTTATGACTGGTGGGTTTTGGGAGTATCAAATCGGCTTGCATGGGGATGTCCTACAAAGGAACACCTTCTTCCACACTTTCTGGAACATTTAGGTAACAACCATCTGGATATTGGTGTTGGAACTGGGTTTTACCTTACTCACGTACCTGAGAGTAGTCTGATATCTTTAATGGATTTGAACGAAGCTAGCCTGAACGCGGCATCTACAAGGGCTGGGGAATCAAAAATTAAACATAAAATTAGCCATGATGTTTTTGAACCTTATCCCGCGGCGTTACATGGTCAATTTGATTCCATTTCCATGTTTTACCTTCTTCACTGCCTGCCTGGAAATATATCTACAAAAAGCTGTGTAATACGCAATGCGGCGCAGGCCTTAACTGACGATGGAACTCTATACGGAGCCACAATTCTTGGCGATGGAGTTGTGCACAATAGCTTCGGTCAAAAACTGATGCGCATTTACAATCAGAAAGGCATCTTTTCAAACACAAAAGATTCCGAAGAAGGCTTAACACATATACTCTCAGAGCATTTCGAGAATGTTAAAACCAAGGTTCAAGGTACTGTAGTAATGTTTTCCGCTTCAGGGAAAAAATAGCATCCAACCGCAGCTCGCTCTTGCTTAAGACGTGCTGCGGCATAATCCCAATGATTACTCCCTGACAGGGTTCGTAGGCCACTCAATATCAGGTGCAGTTGATGTATCAACACGGTTCAGCAACACCCGATACTTCTTCCAGACTTCCAGCAACAAGGTTTCTTCCTCCGTTGCGATTTCCAGCTCAACAACAGTCTGAACGTACCAGGAACAGCCTCCTTCAGGGCTTGAAGGATATCAATGTTCGCTTCCTGTTAACTGCCCGACAAGTGCAACCAGTTCGCTTACCTGATTTTCCAGAGTGGTGATCCGGGTGCCTGCTGTTGCCAGATTTTCACGTAACGTTGTATTTTCCTCTTCCAGCGCGGTAACGCGATCATCTGTTTCACGGGCGACCTGAACAAGTAACCCCGTCACGGCGGAGTAGTCAACATTAAGATAGCGAGTTTCTTCGCGTAGCTCGTTGCCGTCAACGGTCGGACCTTGCAACTCTTCACCATAATGAGTAAACGATCCCACAGCTTCAGGTATCGCCTCCATTACTTCCTGTGCAATAACGCCAGCATAAGGCATTCCGTTTTCCTTGAGCGTGTAGGTGTACCCGTTCATTTTACGGATTGCTTTCGTCGCGTCGCTGATAACGAGAATATCGTCTTTAAGGTCGCGGTCTGATGACTGATTCAGCGTTGTGCAATTAATAGCGCCATTTACATCAAACAACTGGCCTGCTGACGTTTTTTGCGCATAAAACAGATACGCAGCAGACGTTCCAACCTCAAAAACGTTTTGTCGAGTACTGGAACCCCACACCCTGACAGAAAACGGTAGTTCTGCATTACCTGAGTTCTGTAAAACAAAACGATTGCCAGTCCCTGATTGTTTTGTAAGGGTTAAATCAACTGTTGAGTTAACCTCATCCTTGTTGATAGTGAGCGCCTGCGCTTTATCAGTTCATCCAGCGCGGCTGCTTTGTTCATGGCTTTGATGATATCCCGTTTCAGGAAATCAACATGTCGGTTTTCCAGTTCCGGAAAACGCCGCTGCACCGACAGGGGGATCCCGTCGAGAATACTGGCAATTTCACCTGCGATCCGCGACAGCACGAAAGTACAGAATGCGGTTTCCACCACTTCAGCGGAGTCTCTGGCATTTTTCAGCTCCTGTGCGTCGGCCTGCGCACGCGTAAGTCGATGGCGTTCGTACTCAATAGTCCCTGGCTGGAGATCTGTCTCGCTGGCCTGCCGCAGTTCTTCAACTTCCCGGCGCAGCTTTTCGTTCTCAATTTCAGCATCCCTTTCGGCATACCATTTTATGACGGCGGCAGAATCATAAAGCACCTCATTACCCTTCCCACCACCCCGCAGAACGGGCATTCCCTGCTCCTGCCAGTTCTGAATGGTACGGATACTCGCGCCGAAAATGTCAGCCAGCTGCTTTTTGTTGACTTCCATTGTTCATTCCACGGACAAAAACAGAGAAAGGAAACGACAGAGGCCAAAAAGCCCGTTTTCAGCACCTGTCGTTTCCTTTCTTTTCAGGGGGTGTTTTAAATAAAAACATTAAGTTACGACGAAGAAGAACGGAAATGCCTTAAACCGGAAAATTTTCATAAATAGCGAAAACCCGAGAGGTCGCCGCCCCGTAACCTGTCGGATCGCCGGAAAGGACCCGCAAAATGATAATAATTATCATCTGCATGTCACAACGTGCATCTACGCCATCAAACCACGTCAAATAATCAATTATGACGCAGGTATCATATTAATTGATCTGCATCAACTTAACGTAAAAACAACTTCAGACAATACAAATCAGCGACACTAAATACGGGACAACCTCATGTCAACGAAGAACAGAACCCGCAGAACAACAATCCGCAACATCCGCTTTCCTAACCAAATGATTGAACAAATTAACATCGCTCTTGATCAAAAAGGGTCCGGGAATTTCTCAGCCTGGGTCATTGAAGCCTGCCGCCGGAGACTGTGCTCAGAAAAAAGAGTTTCTCCTGAAGCAAACAAAGAAAAGAGTGACATTACTGAATTGCTCAGAAAACAGATCAGACCAGATTGAAGCAATTTAGATAATCGTGCAGACTACGCCCCTCATATCACATGGAAGGTACTACAATGGCTCAGGTTGCCATTTTTAAACAAATATTCGATAAAGTGCGAAATAATTTAAACTATCACTGGTTTTATTCTGAACTAAAACGTCACAATGTCTCACATTACATTTACTATTTAGCTACAGAGAATATTCATCTTGTTCTTGAAAACGATAATACGGTTTTAATAAAAGGACAGGGTAAGGTTGTAAATGTAAGATTTTCAAAAAATAAATGCCTTATAGAAGCCACCTTAAAAGGATTCAAATCAGGAGAGTTATCATTTTACGAATACAGGAAAAATCTTGCTACAGCAGGGGTTTTCAGATGGATTACAAATATCCACGAAAACAAAAGGTATTACTATACCTTTGATAATTCATTACTCTTTACTGAGAACATTCAGAACACTACACAAATATTTCCGCACTAAATCATAACGTCCGGTTTCTTCCGTGCCAGAACCGGACTCGCTGGCATGATGAAATATGTGTACCCGGTAACCCCGGTGTGCATCGTTTTTGATTATTCCCGCACACTCGCGCAGAAGGAGTTCCCCGTCGGGCTACGGTCTCTGTTAATACGGGAATACGGCGACGATACAGCGCATGATGTGTCAGGCTTGAATACCTTTATCCTTTAAAAGGGATATCAGTTAAGTTATCCCGTGTAGGGTATAAACCATTATCAAAGCCACTCTGTAGGAAGTGGCTTTTGTAATGGCAATAAAAAGCCCCGCGAATGCGAGGCTAAATCCTGGTATTTGTAATGACTGGTTCTTATCTCAACGCAGCCCCTTACCGCGCGCAAAATGCTCAATATCAAGCATCAGCAATGAGATGTTTAATCTGGATTCACTCCAGAAGTGAGCACCACCCTGTCTACAGAGCCAGATGTGAAGGATGATGAGTAAAATTATCGCTATCATCGAAGGCATTGCGTCCTGATGTATTCCTGAAGCGTTCTCAGTGCTGTTTGGTCGCGGATAATTCCGTCCCGGATACCGAGAACGTTTCGTCCAGCAACTGGAGAGAGTTCGACGGTGGCATCATTGCCCATGCCGGAGGCGCTGGAGGTTTCGGCTGAGGATGGCACAGGGCATTTTCCTTTGACGAGCACCCGACCACCATTATCAAGCTTGCGCCGAAGAGCATCATTTTCAGCTTTCGCATCAGCTAACTCCTTCGTGTATTTATCATCGAGTGCATCAGCATCACGCTGGCGCTGCTGCATGTCAGTAATGGTGGCGTTCGCCTTCTCCAGTTCACTGGCCTTGTTATCGCGCTGTTCTTTGTAGGCGATTGCATTATCACGGTAATGATTAACAGCCCATGACAGGCAGACGATGATGCAGATAACCAGAGCGGAGATAATCGCGGTTACCCTGCTCATTGTTGCCCCCACAAACAGACCTCACGCTCAATCTCACGACGAGTCATCAGGCCTTTCCATTGCTTACCGCCAGCGTATGCCCAGCGACGTAGCTGGTCACATGCGCCCTTGATATCGCCCTGGTTTATTTTGCGAAGAAGAGCGGATGTTCTGAAATTGCCTGCGCCCACGTTATAGACGAACGAGTAAAGAGCGCCGCGCGTTGTTTCCGGTATATCGACTTTGATGTACGGGTTAATTTGTCTGGCTACCGTGGCAAGGTCTTTATTCAGGAGGGCTTTGCATTCTGCTTCGGTATACGTTTTACCGGGAATGATGTCTTTTCCTGTATGCCCGTGACATACAGTCCATACGCCAACGATATCTTTGTATGGTATGTAGCTGACACCTTCCAGACCATCGTTACCACTTGGGCCAGTGATTAACACTGATGCTATAGCAATTGCTCCGCCACCAATAGCAGCAGCAACGGCTTTTCGTAATGATGGAGGCATTATTCACCTCTCGCAGCCTTGCGCTTATCTTCTTTAATCTTGAAATAAAGGTTTGTCAGGTACGTCAGCAGGCCAAATACCAAGCTACCCAGCACACCTATTGCTGCCCACTGTGAGGGCGTGACTTTATCGAGCAGCTGTAAAAACCAGTAGCCAGCACTGCCTGCGGAGGTGCCGTAGGCAATGCCCGTTGTTAACTTATCCATGGATTTCATAGCCTCACCTCCGCAAATAACGGATGGTGTACACGGTTCGGAACGAAGAGGAAAGGTATAGAAGTTACATTAGCGTAAGGCTTGAACATCTATTCAAAAAGAAAAACGCCGCGATTATTCTGGCGTAGCTGAAAGCATCATACAATTATCAAATACGAAAATTACAAAATCATTAAAACGCATCACGTTACATCATGTCTTTTTCTAAAAAAAATCTTGATGAATATTGATGGGGAGGAACACCAAAATACCTTCTGAAAACACTTACAAAATATGACGTGTTTTCATAACCGCATATCTCAGCAACTTTTCCAACAGAATATAAATTGTAGCTTAATAACCTTTCCGCCATCACCATTCGCTCTTCAAGAATTAACTTACTAAATGATAAGCCTTCGTGCTTTAATTTTCTTTTTAACAGACTTTCACTCAGATACAGTCTTGAAGATATATCACAAAGTCTCCATGCTGCAGATATATCCGTGTGAATAATAGCCTTAACTTTACTTCCTAAACTATTAAGACATCCAAATAAAAAACTTTGCACTATTTTCTCTGAAGATAAGATAGCAAGACATGCAAGTGATATTTGATTTCTAACAAAATCCACAGTTCTGCCATCACAATTCAAGCATGCAATCAAGTTCTTTAACAATGAAAAATCTTCACATTCCACCATCAAGTATGCCGGATAAAACCTTCTTACAGAAAAAGGTGAGAGTGTGTTGCTTTTAAAGAAATCATTAACTGTTTTTTCTTCAACATCTACGATCATTACATGATCTATATTTGATGAAAAAAATCTTTTAAATTGTAATCAATGAGAACAGCACTTCCTTTTTTAAACAAAATATCTTCTTTACCAATTCGGACATCAAACGAATTCAACACCAAAATGATAGAACATATGTATGGCATATTATCCACCTGATATCATTGGGGTTACACCAGGTAAGTGTAGGTGGAAAATCAATATTCGCCAGTTCAACAATAAGGAAAATTTCATTACATCACAAGTATAAAATTATGTATTTAACTCACAAAGACAAATTATTAAACCAATCTGTTATATTATATATAGCTGCGTGGAATCATAATATCATATATTTTGACTGGCATGTTTACCAACTTTAAGTTGCATCTCAATTGTTTCTTCAGCGTAAACAGAGTTTTTATACAAACTGACACTCTGGGTATCATAGTGTAGTTTTTACGATTGTAAATATCCTGCATGCAGGAACTCATCCTTTTGGATGATATCGCATACAATTAATTTACCATCAGTCTTAGAGCCAGTTCGTCCGGATAGGGATCGAAGTAATTTTGTGTAAGCAAGTAATCATTAGGATACTCACCCAGATAATGCTTCAGCAGAGTCAACGGCGCAAGAAGAGGTAATGTGCCAGAACGATAGTTAAGTATAACCTCGCTCAACTCTTTACGCTGGCGTGTACTTAAGTAGTTACTAAAATACCCCTGTATATGCATCAGCACATTCGTGTGATTTTTACGTGATGCAGGTTTTCTGAGAATCGCCATCAGCTTATCACGATACACCTCAAAGTATGATTCAAGGTCCGCCCACTCGTGTATTGCAGCCACAAATGGTCCCATATCTTTATAGCCTGCCTGACTATGCGCCAACAACTGAAGCTTATAACGACTATGAAAAGCTAATAACTCTCTTCTTGATAATTTCTCCTTGTAAAGGTGATTGAGCTCATGCAAAGCAAAAACTCTTTCAACAAAATTCTCACGAAGCACTGGATCATGTAATCGCCCATCCTCTTCAACCGGTAGCCAGGAAAACTTTTCCATCAAAGTGCTCGTAAATAGTCCCACTCCATCTTTACGACCTCGATTACCATTTTCATCATAGACACGCACGCGCTCCATGCCACAGCTGGGAGATTTAGCACAAACCACAAACCCCGATACATCCTTTAATTTGTCCATATAAGAACGACTAAACTCTGTCATTCTCTCTGTCACATCCTCATTCTGGTCGTGGCTGAAACACATCCGTATATTTCCTTGCATCGAGCGCACAAGACGTAGAGCAGGACGCGGAACTGGCAGCCCTATAGCCATTTCCGGACATACTGGTCTGAATGTTACCCATTCCACTAATTTGTCCATTAAAAAGTCAGCTCTTTTGTGACCACCATCAAAACGAACAGCAGAACCGGCCAAACAACCGCTGATTCCAATCACAGGTTTTTTTATCATATTCTCCCCCTTGACTAATTCATTAACACATAAACTGTGTAGTGCACGGAATAAATTGCCTTTCTGGCGTCATCACTGACAATTTTTCTGTTATGGACTATTCCTAATATAGTATGAAAGTTCTTTAAGTGATCGGTCGTAATCATCTATCTTTCATACTTACTCTCAACTATCAAAAGTACAGGATTTATTATGAAGTTATGGCCTGTGTTGACTGGCATTGCACTCTCTTTCACTCTTATAGCATGTAAGGCCCCGACACCACCTAAAGGTGTGCAGCCGATTACAAATTTTGACGCCAACCGCTACCTCGGAAAATGGTATGAAATAGCTCGCCTCGAGAACCGGTTCGAACGTGGTCTGGAACAGGTCAGCGCTACTTATGGAAAACGGAACGACGGAGGGATTCGTGTACTTAACCGTGGATACGATCCAACGAAAAATAAATGGAGCGAGAGCGAAGGTAAAGCATACTTTACTGGAGATACTAAAACTGCAGCATTGAAGGTTTCGTTTTTTGGCCCCTTCTATGGTGGCTATAATGTAATCAAACTGGATGATGAGTATAAGTATGCTCTTGTCAGTGGTCCGAACAGAGAATACCTATGGATTCTGGCAAGGACCCCAACTATTCCAGATAAAGTAAAAGCAGACTATGTGCGAACCGCTCAAAAGTTGGGATTCAATGTCAATGAATTATTATGGGTTAAACAATAAAATCCCTACCCGAAATAATACTTATTAGAAAAAAACCAGCCTTTGGGGAGGCTGGCTAAATCAGGAAACAAGCTGTTATATGATAATAACTACGTTGTGATTCCAACATTTAAAATGTTAGACTAATGACAATCAGACAGCAACTTTTCCTTTAATTATTTCGAACAATCAGCATCCATCTCCAATCGGAGATCCAACACCATCAGCATACCCTCCACTACGCCCTCAGCTTTCTGAAGCATCCTGCCAACCCAACAATCAGATCGCCCATGCTTACGTGCAAGCGCCATAAAAGTCATGCCGCCGACATAATAGTCCACCAATAAATCATGCAAATCGCTGTTGTTCTTTTTCAGACGGGCCATGCACCCGCAAATGATCATCGCATCATCGTCACAACATTGCGGGCGAGATTTTACTTTTGAAGGAATTAATCCCTTAAAACCGGCGGCAATGGACGACCAAGTCACATCTTCATGATTATTAGCCGCCCACGCTCCCCAACGCTCAAGAACCATCTGAATATCACGCATCAACTTACTCCACAAAAATCAGACCAGAACGCCAATCACAAGCAAAAATCAACAAAACAGTATTAGTTGATTGTTATCTCTGACTTCATACTCCTGCTCCTGTCAGGGTTTTGGCGTAATTCTTCAGTATTCGGTAATCGGTCAAAACAGAACCGGGGAAACGATATAAGCGCAGATGCCCCCAGCGGTGGCGAAGAAGTTCTGCCATATAAAACTCAAACATCATTCATTCCCCATTTCGGTGATGGTCAGTTCCAGCCTCCCACCTTTGGTAACAGGCATCTTCACAACGCGGTAATCAACGACCTGAGCATCATCCAGCCAGAAACCTGCTTTAGTGAGTGCGTCAAAAGCGGCTTTTTGCAGATTATCCAGGTCACGGCGACGGCGATCCGGCATGTGGCACTCAATGCGGATTTTCACAGGCATAGCCAGGCCGATATCCAGCATTGCGTTTTTAATGATTCGGGCGACGTTATCGCGGTATGCCTGCCCCTCTGCGCTGACGTGCGTGCGCCCGCGATTATGGCGGTAATAGCGATTATTGCTCGGAGGCCAGGGTAATGTGATGTTGTAGGTATTCACGCCTTGATTACCCCCTCTTTCATCCAGATAACCTGCGTTCTCGCCATACCTTCCAGCGCGCATTCTTTTGCATATGCGGCATCGACAAAATGTGTGCGGCGGTCGATTTCGTCGTGGCAGGCAGAACATGCAATGGTGGCAATCAGGTCTGGCGGTTTGGTACCGGTGCCGCACAATCCAGTCAGCCGGATATGTGCCAGTACAGACGTTTCAGGGTTGCCATTACATACGCCAGGGGTTCTTACCTGGCATTCCCGACCACGCGCTGCTTTTCTCAAATCAGCCATGATTCCTCCTTGCTGCCAGTCGCAACCATTTTTTATCAACCAGGCTGGCGGTATATCCGAGCAGTGTTGGTATTTCGGATGGCTTCAGCTCAGGTTTACGCTTACGACGATTTGGTACTCTGTAGATGTGTCCGTTCATGACACGAATAAGCGGTGTAGCCATTACGCCTCCTGCTTGTCGCGCAGCAGCTGGAACTCGCAGCTCTGCGGAATAGTCAGGTGGCAGCCAATATTCATCGCCCAGGCTTCAACCTTACACAGGAAGACATACATCTCTCCGGCATCAAGATCGGAGGTATGGCGTAACGACTGGATAGTGGTGATTTCACCGGTTACGACATCAACCAGTTCTTTGGTTTCATAACCGAGGTATGTGTGTTTGAGAGCTTCTTTTACCCATGCTGCGGTAGCGAACGATTTCCCCCTGCTGATAAGGTATTCACTGATTTCGCTGTACCACATGTGGCTGAGTGCATTCTGGGAAAGACTGCGTTTCTCACGCCACGGTTTAAGCACCATGCGAAAGCATTTTCCGTCCTCCAGATAAGGCTGGATCTGCTGGCCGATAGCGGTGAAGTTGCCGCGATGTAATTTGATGCCATCTTGTGGTAGGTTCACGCTTCACCTCCGCAGAGGTCAAACGCAGGATGCAAAAAATCGCAGGTGCATTTCTGCATCTGTGAAGGGAGAAGAGAGTTTGGATTGTATGTGCGCATAAACGTCCCCGTTTAGCGCAGAAGTCACCGGAGTTGTTCAAGCTCCGATGACTTTATTATTACGAATTGATTTTACAAAATCAAAAGGTATGTTAGTGACGCGGGTCTGTTATTATGCGAGAAGGGTTTCCGTATAAAACAAGGACCTTACTTCCTTGAGTAAATAACGGATCTTTGCCTTGAACAATGGTCATTAAATTCCCATTCTCAGTTTCGACAACATATTCCATGCCTGTTTGTTTTGTTGCTGAAGATTCGATTGCTGCCCCGGCAATACCACCAATGACTGCACCACCAACGGCACCAACGATATTAGAACGAACTCCCCCACCAAGCGCAGAACCAGCGGTTGCCCCCACGGCAGCCCCAGCAGTCCCGCCTAACGCGGAAGTCCCACTGATATCAACCCCCCTGGCACTAATAACTGTACCAGCGATAGTTCGATTAACCATGCCCACAGAGCCAACAGAATAACTATTTGGCGATATATTTTGTGCGCATCCAACCAACACTAAGAGTGGAGCAATTACGAATAATCGCTTCATTTAGCTACCCTAACAGGAAACATTGGACGAGAAAGATCAACACTTTCTAATGCTTGCAAGAACTGCGTTATGTTGTTTTGCACCGCGCGATTAACAGATTCGCGTGCTCGAACAATACCGTAGAATGCGTAACTGGCTGGAACAGTACCGGTAGACTCAATATCCTGCGTATATATAATATCACCATTCGCACGGTTGATTATTTCATACCTTGCAATTGCTTTAGTTGTCATTGAAACACCAAAAGCAGGAACGTCAAGAGCCAACACTTTAACATTTAAGCTAACCGTATTTGGTGAACTATCACGAAAAATAGTCATTCGGTCGAGTGCTTCCTGCAAAGATTCACGCCAAATTGGAGTTATAGCCTCCATACCAGCAGTGATATCCCCTTTCTGCTCATCTGGACGAGCAAGTGATACCGTTAATGACTTAATTTCAGCATCTATTTTTTTCTGGCTAACTCCCACGTTAGGTGTTGAAAAATTCAATGGTGGCACACTAGCGCAACCTGTTAAAGAACCAATAATCATGGCTAATAATATTATCTTCTTCATAAATTTACCTTATTGTTATAACCAAAGGAATTATAAAGTAAAAAAGTTCACTATCACTAGCCATTAACGACATCAATTTCAGAGAAACATGGTACTCATTTCCACAAATTTGACACAAGTCATTTTCATCTACATATTCCATCATACTTGATGCATATGTTATTGAAGCCTCTATCCTATCCGTTCATAATAGCAATAGTTACCCGGGTGATAGTACCTCTATGATTACTCGTCTTTCTGATTGATTGGATTAAATATGCGCGCCAAAATTTATCAACTTTCGTTATGGATATTTATTTCGTTTCTAGCGATCTATGCCTTTATTATCTATAAAGGTTCTTATATTGGAGTAGCATTGCATCAAATTGCTTGGATCATCATTATTGCCTCTGGCTTGATTGCTAGGCTAACTAAACCAAAGCAAAAACCAATTTCGTCCAATAATTAGACATGTATTAAAAAATGATATTTTTATGTACATAGTCTATTGAAAATTGCCGCGATAAAATGCCAACACCCGCTTCATCGCGGCACTCTGGCGACACTCCTTGAAAATCAGATTCGTGCTCACCTTTCCTTCCCGTTCTTCCCTGGTAGCGAACCGGTAATACACCGTTCGCCAGACCTTACCATCAATAACTAAGATTCCTGTCCGCGCCATTTTAGCCGCAGCCTGGTTTATGCTGGTTACTGTTGCGCCTGTTACCGCAGCAACGTCCTGCGCACAGAAGCTCTTATGCGTCCCCAGGTAATGAATAATTGCTTCTTTTCCCGTCATACACTGGCTCCTTTCAGTCCGAACTTAGCTTTGATTTCTGCAATCTTCGCCAGAGCCTGTGCACGATTTAGAGGTCTACCGCCCATGACAGGAAGTTGTTTTACTGGTTCAGGGATCGCCTCACCACGGTTAATTCTCGCAGTCATATGGACAAGCTCATCTGCGGCCTTACGGCGTAATTCCGCATCAGTAAGCGCATTGGCCCGCATGTTCTGATACAGGTTGGTAACCAGCCAGTAGTGCGCGTTTGATTTCCACGGATAAGACTCCGCATCCGGATACAGGCCTCGCTTCCGGCAATACTCGTAAACCATATCAACCAGCTCGCTGACGTTTGGCAGTCCGGCGATAACGGATGCTTCTTCCCGGCACCATGCAACAAACTGCCCGGGTGATGGCAGGAATGGTCGATTCTGCCGACGGGCTACGCGCATTCCAGCGTTAACCTGTTCCATTGTGGTGATCCCGTTTTCCCGAAAAGCCAGCACCCACTGGCGGCGGATTTCGTTCAGTTCGTTCTGGTCCCGGTTAGCCAGGCTCGCTGGGAAAGTTGCCAGTAACTGGCTGAATACACCGTTGATTATCTGCGCTAACCATCATCGAGATCTGCCACGCGCGGCTCCTTTTGTGCCGCATCCGGCACTGGAAAATTGAATATCTCAGCAGTGTTTGCCATAATTCCTCCCGCAATGAGTGTGTTACGATTTGCACCTGAAAGTCGGTTCTGTTCCAGCAGACCGGCTTTCGCCATTTCTGAACCTGTCATATCGCCCCCAGCATGGTAGTAACCATCGCCATCAATGGACCAGCCAGATCTGGGTCCACACGAAACATCGACACAATACCTTCACTAATTTCCTTCAGTTTCTGGTGGCGTGGTGCGTTGAGAATGACAGCCTGTTTTGCCTCACTGAGTTCCTTTTCCATTTCAGCCAACCTAGCCATGAAGCTATCCTGCTCAACCAGGTAACCGCGATATTCCAGCGGTAGTACCGCCAGAATTGCCGGGGTCAGTTCACGCACGTTATTTCGGTATTTTTCAGAATCGAATTTGTTATCGAGGAAGCGGAACAGCTTCTGGCGTGCACGGCTGACATCATCAGGGAAATCGATGGTGCCGCCGCCCTGCTCCCGATACTCATTCACAATGAGTGTGGCAACGACATCCTGATTATCTTCAGCCGACCAAGCGCGGACGGCATCACGGATTTTTTCGTGGCCTGGCACCTGTTTTGTTTGAGAACGATTTATCACCGCAGTCGGGCTAAATCCGCTAGTCTGTTGGTATGTAAGTGGTTGCATAATTGACTCCTTTAGTTTGAATTGACTGTTAAGTTGATTGCTTATTGTTAAAGAGCGTGAAATGGAAATTTAAGCTGCGTTCTTTTCGGTGTGTGGAAACAACTTCGGAAGATCCGGGCGAATCTGGTATGCCTTCACTACTCCACCAGTAGCCGTAACAATGCTGCCGACATGTTCAGGGGATACCTTTGCTTTGTTGTGAAGCCACTTATAGACGGCCTGCTGTGAAACTTCGCAAGCAGCGCCCAGTTTCTTTTGTGAACCAACGATATTGATCGCTGTTTTGATAGCTGGGTTCATAACAACCTCCGTGGTTAATTTGAATCAAGATTAAAACTATGGTTGTTTTTAGTCAACAACCATTTTCGTTTGATGGAATAAAACCTTGGTTGTACATTTGGACTATGAAAACAACACTCTCAGAAAGACTTAAAGAAGCCAGATTAGCGCGAGGCCTTACACAAAAGGCGCTTGGGGATTTGGTCGGGGTTAGCCAAGCTGCTATTCAGAAAATCGAAACAGGGAAAGCTAATCAAACAACTAAAATCGTGGAGATCGCGAACGCTTTGGGTGTGCGCGCAGAATGGTTATCTTCTGGCGTTGGAAATATGTCAGACAGTACAGTGCAACCAATACAATCAACTGTCAGCCATTCCAAATACTTCAAGATTGACGTTCTTGATATAGAAGTCAGTGCTGGGCCGGGAGTCATCAACCGTGAGTTTGTAGAAGTTCTACGCTCGGTTGAGTACTCGTTTGACGATGCTCGTCACATGTTCGATGGTAGGAAGGCGGAAAATATCCGCATCATTAACGTGCGTGGTGACAGCATGTCAGGAACGATCGAACCAGGTGATCTGCTGTTCGTTGATATCACAGTTAAATCTTTCGACGGTGATGGTATCTATGCGTTTCTGTACGACGACACAGCCCATGTAAAGCGCCTGCAAATGATGAAGGATAAGCTGCTGGTCATCTCTGATAACAAAAGCTACTCACCGTGGGACCCGATCGAGAAAGACGAGATGAACCGGGTGTTCATCTTCGGTAAGGTTATTGGGAGCATGCCGCAGACATATAGGAAGCATGGTTAAAGTGAGGCTAAAAAACAGTTACAGCAATAGGCCTGTTGTTTTTCTTTAAACACGCAGTGTTAAACCGCTCTTTGAGATGCGGAGTAATGAGATGGAAGACTTGAATCACATAAGGGTTAGTGATGGAGTGCGTAGCGAGCAGCAATAGTGCAATACCTAATGTCGTTGAAGTAATACGTCGCATCAATGAAGGTTCCACTCAGCCATTTCTTTGCAAATGTGATGATGGGCAGTTGTATGTTTTGAAGTCAAAACCATCAATGCCCCCGAAAAATCTCTTAGCTGAGTTCATTTCGGCGTGTTTGGCTAATGATATCGGCCTTCCTTTACCTGACTTTAAAATCGTATTTGTGCCAGAGGAACTTATAGAGTACTCACCTGATCTGCAGCAACAAATTTGTACAGGATATGCCTTTGCTTCATTGTTCATTGACGGTGCAATAGCGTTAACGTTTACGCAGTCAAGAAACGAAACGATCATCCCAGTCGAACAGCAAAAATTAATCTATGTTTTTGATAAATGGATATTAAATGCAGACAGAACGCTTACTGACAAAGGTGGAAACGTTAACATCCTTTATGACATCAGTAACGATAAGTATTATCTGATTGACCATAATCTCTCATTTGATCAGAATGCTGGACCTGAAGATTTTTCTGTGCACGTGTACGGCCCTGGTAACCGCAAATGGCAATATGATTTAGTGGATCGCGTAGAGTACCGCCAGAGGGTCGTTAACAGTTTACACAAGCTTCCTGCTATCCTTGACGAAATTCCAGAAGAGTGGATAGTAGATGAGGAGTTTTTACCTTTTGTCTGCACTACGCTAGACAAAGGTGATTGTGATGAATTTTGGAGCGCAATAGAATGACAACTCCATGCCTATATAGCATCGTTCGCTATGCGCCTTATGCGGAGACTGAAGAATTCGCAAACATAGGCGTACTTCTGTGCGCGCCAAAAGAAAATTACTTTGATTTCCAGCTCACAAAGCGAAATGACTCTCGTGTAAAGAATTTTTTCCATGATGATTGTATTTTCCCTGTAGCAAAAGACTCAATACAAAGAGAACTACAGTTCGCAAAAATGCATGCGACCCAGATTGTTGGACATCAACAACTTGCACAATTCTTCAGATATTTTACAAACAAAAAAGAATCAATTTTTCAGTTCAGTTCTACGAGAGTGATTCTCAGCGAAAACCCAAAAGAAGAGCTGGCCCGCATTTACAATAAATATGTAAACCACTCTGACTACACAAAAGAGCGCCGTGAAGATGTTCTAGCCAGAGAGCTAAAACGAAGTATCGATAGAATAGATGGATTGAAGAACGTCTTCAAACAAGCAACCATTGATGGGTATTTCGCAAAGTTCTCAATGCCATTGGTCGCCAAGAAGCATGACAGGATCCAATGTGCCATCAAACCTCTGGCATTCACTCAAGCTGAACCAGGAAAAATGATGGAGCATAGTGATACTTGGGTGATGAGAATAACTCGAGCAGCAGAAGAAAACCTGCTTTCACTTGATGACATTTTATTCACAATTGAAACTCCTGAATCACCAAACTCAGGCCAAAGCAAAGTTATTGACATCATAAAGAGAACTATGGATGCTAAGAAAATAAATCATATACCTGCATCCAACCACAAAGAAACTATTGATTTTGCAAAAAAAATACTTCCCCAAGTTTAAAATTTATTTTTGTATGTGATATTCCTTATTAATAACCCGGCCACCGTGCCGGGTTTTCTTTTGCCTCCCCTCATCACACAAACCGCTCAAAAAACCACCATAACCTCGCTTCAGTTATCGCTATGCGATTCAAGTCACAAAATAAATCCATCCTAAATACAACCAGTTATATCTAAAACAACCAATAAAACAACTTTTGTTGTTGACGGTAAAACAACTATAGTTTTAAATAGGTTCATCGCAACAACACAACGATACGGCAACCACCTGATTCACCGTTGCGATGACCGCTTAGATCCGCAGTTTGAATTTCAGCAGGCTTCGGGGAGTGCGAGGGGTGAAACGGACGCGTGAACGTCGGTGTGACCAGCTGAAATCAACTCAACATTTCATACCTTAGTCGCTTCAACGAGGCGGCTTAGTTATGACAACCGGCGGCCATCCACCGCCTGAATACGCGCAGAAGTCTCTATATGTTCAGCAGCCCAGCTTACGGGCAGGAGTTTTTATGGTTCATCAACATTACGGAACGCAGACCGTTAATCGCGGCGCGGTCATGCCAGGAATGCTGGTCAAACACAAAGATGGTACCTGGACTGCATCAGCTAATTTACGCGGACGGCTTTATCTGCATCGCGGCATCGAGCGCACTTATACCCGTGATTTGCTCGTGGAAGTTTTTCTCGACGGACGCGGTAACGGCCTGAATCACTAATCCCCTTTCCTGTTTTCCTAATCAGCCTGGCATTTCGCGGGCGATATTTTCACAGCCATTTTCAGGAGGTCAGCCATGAACGCTTATTACATTCAGGATCGTCTTGAGGCTCAGAGCTGGGCGCGTCACTACCAGCAGATCGCCCGTGAAGAGAAAGAGGCAGAACTGGCAGACGACATGGAAAAAGGCCTGCCCCAGCACCTGTTTGAATCGCTATGCATCGATCATTTGCAACGCCACGGGGCCAGCAAAAAAGCCATTACCCGTGCGTTTGATGACGATGTTGAGTTTCAGGAGCGCATGGCAGAACACATCCGGTACATGGTTGAAACCATTGCTCACCACCAGGTTGATATTGATTCAGAGGTATAAAACGGATGAGTACAGCACTCGCAACGCTGGCTGGGAAGCTGGCTGAACGTGTCGGCATGGATTCTGTCGACCCACAGGAACTGATCACCACTCTTCGCCAGACGGCATTTAAAGGTGATGCCAGCGATGCGCAGTTCATCGCATTGCTGATCGTCGCCAACCAGTACGGCCTTAATCCGTGGACGAAAGAAATTTACGCCTTCCCTGATAAGCAGAACGGCATTGTTCCGGTGGTGGGCGTTGATGGCTGGTCCCGCATCATCAATGAAAACCAGCAGTTTGATGGCATGGACTTTGAGCAGGACAATGAATCCTGTACATGCCGGATTTACCGCAAGGACCGTAATCATCCGATCTGCGTTACCGAATGGATGGATGAATGCCGCCGCGAACCATTCAAAACCCGCGAAGGCAGAGAAATCACGGGGCCGTGGCAGTCGCATCCCAAACGGATGTTACGGCATAAAGCCATGATTCAGTGTGCCCGTCTGGCCTTCGGATTTGCTGGTATCTATGACAAGGATGAAGCCGAGCGCATTGTCGAAAATACTGCATACACTGCAGAACGTCAGCCGGAACGCGACATCACTCCGGTTAACGATGAAACCATGCAGGAGATTAACACTCTGCTGATCGCCCTGGATAAAACATGGGATGACGACTTATTGCCGCTCTGTTCCCAGATATTTCGCCGCGACATTCGCGCATCGTCAGAACTGACACAGGCCGAAGCAGTGAAAGCTCTTGGATTCCTTAAACAGAAAGCCACTGAGCAGAAGGTGGCAGCATGACACCGGACATTATCCTGCAGCGTACCGGGATCGACGTGAGAGCTGTCGAACAGGGGGATGATGCATGGCACAAATTACGGCTCGGCGTCATCACCGCTTCAGAAGTTCACAACGTGATAGCAAAGCCCCGCTCAGGAAAGAAGTGGCCTGACATGAAAATGTCCTACTTCCACACCCTGCTGGCTGAGGTTTGCACCGGTGTGGCTCCGGAAGTTAATGCTAAGGCGCTGGCCTGGGGAAAACAGTACGAGAACGACGCCAGAACCCTGTTTGAATTCACTTCCAGCGTGAATATTACTGAATCCCCGATCATCTATCGCGACGAAAATATGCGCACCGCCTGCTCTCCCGATGGTTTATGCAGTGACGGCAACGGCCTTGAACTGAAATGCCCGTTTACCTCCCGGGATTTCATGAAATTCCGGCTCGGTGGTTTCGAGGCAATAAAATCGGCTTACATGGCCCAGGTGCAGTACAGCATGTGGGTGACGCGAAAAGATGCCTGGTACTTTGCCAACTATGACCCGCGCATGAAGCGTGAAGGCCTGCATTATGTCGTGGTTGAGCGGGATGAAAAGTACATGGCGAGTTTTGACGAGATGGTGCCGGAGTTCATCGAAAAAATGGACGAGGCACTGGCTGAAATTGGTTTTGTATTTGGGGAGCAATGGCGATGACGCATCCTCACGATAATATCCGGGTAGGTGCGATCACTTTCGTCTACTCCGTTACAAAGCGAGGCTGGGTATTTCCCGGCCTTTCTGTTATCAGAAATCCCCTGAAAGCACAGCGGCTGGCTGAGGAGATAAATAATAAACGGGGAGCTGTATGCACAAAGCATCTCCCGTTGAGTTAAGAACGAGTATCGAGATGGCACATAGCCTCGCTCAAATTGGAGTCAGGTTTGTGCCAATACCAGTAGAAACAGACGAAGAATTTCATACGTTAGCCGCATTCCTTTCACAAAAGCTGGAAATGATGGTGGCGAAAGCAGAAGCAGATGAGAGAGACCAGGTATGACAACCACTGAATGCATTTTTCTGGCAGCGGGCTTCATATTCTGTGTGCTTATGCTTGCCGACATGGGGCTTGTTCAATGACACCTCAGCAAGAAAACGCCCTTCGCAGCATTGCCCGTCAGGCTAATTCTGAAATCAAAAAAGCCAGACAGCAGTTTCCGGATAAAAACGTCGATGACATTTGCCGTAGCGTACTAAAGAAGCACCGCGAAACGGTAACGCTGATGGGATTCACACCGACTCATTTAAGCCTGGCGATCGGCATGTTGAACGGCGTCTTTAAGGAACGGTGAACATGAAAAGCAAAATCATCAGGGAGCTACAGGCTCCTTTTTTATTATTCGCATTCACCCTCAAGCGTATTAACCAACAATTCAGGGATTAATGAAAGATGGCAGACATAATTGATTCAGCATCAGAAATTGAAGAATTACAGCGCAACACAGCAATAAAAATGCGCCGCCTGAACCACCAGGCTATATCTGCCACTCATTGTTGTGAGTGTGGCGATCCGATAGATGAACGAAGACGACTGGCCGTTCAGGGTTGTCGGACTTGTGCCAGTTGCCAGCAAGATCTGGAGCTTATCAGTAAACAGAGAGGTTCGAAGTGAGCGAAATTAACTAGAAGCCAAAGATAAAATCATCGCTGAGCAGGAGAAAATCGCTAACGGAGAAAAGACAGTAAGTCAGTATATGAAAACCGCATGATATCATCAGATAAAAATCGGTCGTAAAGCGAAATATTAATACCAGAACAAACGAGTCGAGGTAAATTATATTACCTCGATAAATTAACTAAAACTTGCCCGCTATATACTATATCATTCAGTATCATCACGCGCGGTCTGTGCATATGTCACTACCGCACCTAATATATTAATTTTCTTTTCAACATAGATAATATTATCGTACTCATAATTGCCATACGGATAGCAAATGCGAATATTCTCATGTAGATCGGGGTCATCCACCTCAGCTCCAGAACAACTTTTTGAACTACCGGAAGTATACCGATACGGTGCAACATAAGACGATGTCTCTCCAGGCAAAAAATAAGTTAGTGTCGTAAGGGGTATAATCAGAAAAAATCCAGCAAATATGCACATCCCTGCATAAACCTTAAGGTATGCTGACAGACTCTTCCAGCCGCTTTGTTTTACTATCCCCTTCTTAACCCAAAACAGAGATAACAGAAAAGCTATTCCCATGCTAAACAGAATGTAATAGTGGGATATACTCTGATTAAGAAACGTGACCCTGTAGATATCTGCCCGCCACCAGAAGAAAAGGAAAATAAAGATCAGCCCTGAAACTGTCATGCAAATCAAATAAGGATACGAATCTTTTTTCATGTTTAGCGCCCATAAAATTTTTCCTGACCCGGACAAATTTACCATCCATTTTTTGCGCAGAAAATAGCTCATTACTTACTGCACAATAATACACAAAATTGCGTAAATTTTTTGCATGGATTTTAGCTCTTTCAGCCGACATTTAAGGGGTAAATAGCATTTCCTAAAAGCAACTGCACCAACCCAACAGAATGGGCTACCGCTTACGTTGAGAGCAAAAAAGTGTATAGCAGCAATGAACAGCATCCTCGCACTGACGAGGATTTCTTTTATCTGAACTCGCTACGGCGGGTTTTGTTTTATGGAGATGATAAATGCACTTCCGAGTCACAGGTGAATGGAATGGAGAACCATTCAACAGAGTTATCGAAGCCGAGAACATCAGCGACTGCTATGACCACTGGATGCTGTGGGCGCAGATAGCACATGCAGACGTAACCAATATTCGAATTGAAGAACTGAAAGAACACCAAGCCGCCTGATGGCGGTTTTTTCTTGCGTGTAATTGCGGAGACTTTGCGATGTACTTGACACTTCAGGAGTGGAACGCTCGCCAGCGACGCCCAAGAAGCCTTGAAACAGTTCGTCGATGGGTGCGCGAATGCAGGATATTCCCTCCTCCGGTTAAGGATGGAAGAGAATATCTGTTCCACGAATCAGCGGTAAAGGTTGACTTAAATCGACCAGTAACAGGTAGCCTTTTGAAGAGGATCAGAAATGGGAAGAAGGCGAAGTCATGAGCGCCGGGATTTACCCCCTAACCTTTATATAAGAAACAATGGATATTACTGCTACAGGGACCCAAGGACGGGTAAAGAGTTTGGATTAGGCCGAGACAGGCGAATCGCAATCACTGAAGCTATACAGGCCAACATTGAGTTATTTTCAGGACACAAACACAAGCCTCTGACAGCGAGAATCAACAGTGATAATTCCGTTACGTTACATTCATGGCTTGATCGCTACGAAAAAATCCTGGCCAGCAGAGGAATCAAGCAGAAGACACTCATAAATTACATGAGCAAAATTAAAGCAATAAGGAGGGGTCTGCCTGATGCTCCACTTGAAGACATCACCACAAAAGAAATTGCGGCAATGCTCAATGGATACATAGACGAGGGCAAGGCGGCGTCAGCCAAGTTAATCAGATCAACACTGAGCGATGCATTCCGAGAGGCAATAGCTGAAGGCCATATAACAACAAACCCTGTCGCTGCCACTCGCGCAGCAAAATCAGAGGTAAGGAGATCAAGACTTACGGCTGACGAATACCTGAAAATTTATCAAGCAGCAGAATCATCACCATGTTGGCTCAGACTTGCAATGGAACTGGCTGTTGTTACCGGGCAACGAGTTGGTGATTTATGCGAAATGAAGTGGTCTGATATCGTAGATGGATATCTTTATGTCGAGCAAAGCAAAACAGGCGTAAAAATTGCCATCCCAACAGTATTGCATGTTGATGCTCTCGGAATATCAATGAAGGAAACACTTGATAAATGCAAAGAGATTCTTGGCGGAGAAACCATAATTGCATCTACTCGTCGCGAACCGCTTTCATCCGGCACAGTATCAAGGTATTTTATGCGCGCACGAAAAGCATCAGGTCTTTCCTTCGAAGGGGATCCGCCTACCTTTCACGAGTTGCGCAGTTTGTCTGCAAGACTCTATGAGAAGCAGATAAGCGATAAGTTTGCTCAACATCTTCTCGGGCATAAGTCGGACACCATGGCATCACAGTATCGTGATGACAGAGGCAGGGAGTGGGACAAAATTGAAATCAAATAA
Protein sequences of DBSCAN-SWA_6 >NZ_CP029122|3386580:3412836|3410085_3410307_+|WP_000763365.1|DBSCAN-SWA MADIIDSASEIEELQRNTAIKMRRLNHQAISATHCCECGDPIDERRRLAVQGCRTCASCQQDLELISKQRGSK >NZ_CP029122|3386580:3412836|3397027_3397987_-|WP_000592543.1|DBSCAN-SWA MIKKPVIGISGCLAGSAVRFDGGHKRADFLMDKLVEWVTFRPVCPEMAIGLPVPRPALRLVRSMQGNIRMCFSHDQNEDVTERMTEFSRSYMDKLKDVSGFVVCAKSPSCGMERVRVYDENGNRGRKDGVGLFTSTLMEKFSWLPVEEDGRLHDPVLRENFVERVFALHELNHLYKEKLSRRELLAFHSRYKLQLLAHSQAGYKDMGPFVAAIHEWADLESYFEVYRDKLMAILRKPASRKNHTNVLMHIQGYFSNYLSTRQRKELSEVILNYRSGTLPLLAPLTLLKHYLGEYPNDYLLTQNYFDPYPDELALRLMVN >NZ_CP029122|3386580:3412836|3398859_3399237_-|WP_001204777.1|DBSCAN-SWA MRDIQMVLERWGAWAANNHEDVTWSSIAAGFKGLIPSKVKSRPQCCDDDAMIICGCMARLKKNNSDLHDLLVDYYVGGMTFMALARKHGRSDCWVGRMLQKAEGVVEGMLMVLDLRLEMDADCSK >NZ_CP029122|3386580:3412836|3393679_3394090_+|WP_000079508.1|DBSCAN-SWA MAQVAIFKQIFDKVRNNLNYHWFYSELKRHNVSHYIYYLATENIHLVLENDNTVLIKGQGKVVNVRFSKNKCLIEATLKGFKSGELSFYEYRKNLATAGVFRWITNIHENKRYYYTFDNSLLFTENIQNTTQIFPH >NZ_CP029122|3386580:3412836|3404345_3405101_+|WP_000259990.1|DBSCAN-SWA MVVFSQQPFSFDGIKPWLYIWTMKTTLSERLKEARLARGLTQKALGDLVGVSQAAIQKIETGKANQTTKIVEIANALGVRAEWLSSGVGNMSDSTVQPIQSTVSHSKYFKIDVLDIEVSAGPGVINREFVEVLRSVEYSFDDARHMFDGRKAENIRIINVRGDSMSGTIEPGDLLFVDITVKSFDGDGIYAFLYDDTAHVKRLQMMKDKLLVISDNKSYSPWDPIEKDEMNRVFIFGKVIGSMPQTYRKHG >NZ_CP029122|3386580:3412836|3394441_3394594_-|WP_001139678.1|DBSCAN-SWA MPSMIAIILLIILHIWLCRQGGAHFWSESRLNISLLMLDIEHFARGKGLR >NZ_CP029122|3386580:3412836|3392440_3393001_-|WP_001372490.1|DBSCAN-SWA MEVNKKQLADIFGASIRTIQNWQEQGMPVLRGGGKGNEVLYDSAAVIKWYAERDAEIENEKLRREVEELRQASETDLQPGTIEYERHRLTRAQADAQELKNARDSAEVVETAFCTFVLSRIAGEIASILDGIPLSVQRRFPELENRHVDFLKRDIIKAMNKAAALDELIKRRRSLSTRMRLTQQLI >NZ_CP029122|3386580:3412836|3403467_3404007_-|WP_001182899.1|DBSCAN-SWA MQPLTYQQTSGFSPTAVINRSQTKQVPGHEKIRDAVRAWSAEDNQDVVATLIVNEYREQGGGTIDFPDDVSRARQKLFRFLDNKFDSEKYRNNVRELTPAILAVLPLEYRGYLVEQDSFMARLAEMEKELSEAKQAVILNAPRHQKLKEISEGIVSMFRVDPDLAGPLMAMVTTMLGAI >NZ_CP029122|3386580:3412836|3410517_3411120_-|WP_000120065.1|DBSCAN-SWA MSYFLRKKWMVNLSGSGKILWALNMKKDSYPYLICMTVSGLIFIFLFFWWRADIYRVTFLNQSISHYYILFSMGIAFLLSLFWVKKGIVKQSGWKSLSAYLKVYAGMCIFAGFFLIIPLTTLTYFLPGETSSYVAPYRYTSGSSKSCSGAEVDDPDLHENIRICYPYGNYEYDNIIYVEKKINILGAVVTYAQTARDDTE >NZ_CP029122|3386580:3412836|3399322_3399463_-|WP_000971068.1|DBSCAN-SWA MMFEFYMAELLRHRWGHLRLYRFPGSVLTDYRILKNYAKTLTGAGV >NZ_CP029122|3386580:3412836|3409705_3409987_+|WP_001395510.1|DBSCAN-SWA MHFSGSGLHILCAYACRHGACSMTPQQENALRSIARQANSEIKKARQQFPDKNVDDICRSVLKKHRETVTLMGFTPTHLSLAIGMLNGVFKER >NZ_CP029122|3386580:3412836|3398179_3398704_+|WP_000780581.1|DBSCAN-SWA MKLWPVLTGIALSFTLIACKAPTPPKGVQPITNFDANRYLGKWYEIARLENRFERGLEQVSATYGKRNDGGIRVLNRGYDPTKNKWSESEGKAYFTGDTKTAALKVSFFGPFYGGYNVIKLDDEYKYALVSGPNREYLWILARTPTIPDKVKADYVRTAQKLGFNVNELLWVKQ >NZ_CP029122|3386580:3412836|3404076_3404307_-|WP_001067458.1|DBSCAN-SWA MNPAIKTAINIVGSQKKLGAACEVSQQAVYKWLHNKAKVSPEHVGSIVTATGGVVKAYQIRPDLPKLFPHTEKNAA >NZ_CP029122|3386580:3412836|3407305_3407512_+|WP_000233576.1|DBSCAN-SWA MVHQHYGTQTVNRGAVMPGMLVKHKDGTWTASANLRGRLYLHRGIERTYTRDLLVEVFLDGRGNGLNH >NZ_CP029122|3386580:3412836|3387928_3388405_+|WP_000767389.1|DBSCAN-SWA MKLISNDLRDGDKLPHRHVFNGMGYDGDNISPHLAWDDVPAGTKSFVVTCYDPDAPTGSGWWHWVVVNLPADTRVLPQGFGSGLVAMPDGVLQTRTDFGKTGYDGAAPPKGETHRYIFTVHALDIERIDVDEGASGAMVGFNVHFHSLASASITAMFS >NZ_CP029122|3386580:3412836|3409348_3409531_+|WP_072126246.1|DBSCAN-SWA MTHPHDNIRVGAITFVYSVTKRGWVFPGLSVIRNPLKAQRLAEEINNKRGAVCTKHLPLS >NZ_CP029122|3386580:3412836|3405969_3406797_+|WP_000210934.1|DBSCAN-SWA MTTPCLYSIVRYAPYAETEEFANIGVLLCAPKENYFDFQLTKRNDSRVKNFFHDDCIFPVAKDSIQRELQFAKMHATQIVGHQQLAQFFRYFTNKKESIFQFSSTRVILSENPKEELARIYNKYVNHSDYTKERREDVLARELKRSIDRIDGLKNVFKQATIDGYFAKFSMPLVAKKHDRIQCAIKPLAFTQAEPGKMMEHSDTWVMRITRAAEENLLSLDDILFTIETPESPNSGQSKVIDIIKRTMDAKKINHIPASNHKETIDFAKKILPQV >NZ_CP029122|3386580:3412836|3390555_3390732_-|WP_072163407.1|tail|DBSCAN-SWA MQEFSEHIAPLQDAVDLEIATEEENSLLEAWKKYRVLLNRVNTTTAPDIEWPTVPIIE >NZ_CP029122|3386580:3412836|3400101_3400272_-|WP_000224914.1|DBSCAN-SWA MATPLIRVMNGHIYRVPNRRKRKPELKPSEIPTLLGYTASLVDKKWLRLAARRNHG >NZ_CP029122|3386580:3412836|3399818_3400109_-|WP_001372487.1|DBSCAN-SWA MADLRKAARGRECQVRTPGVCNGNPETSVLAHIRLTGLCGTGTKPPDLIATIACSACHDEIDRRTHFVDAAYAKECALEGMARTQVIWMKEGVIKA >NZ_CP029122|3386580:3412836|3389150_3390482_+|WP_001753290.1|DBSCAN-SWA MIINQVPIKIKIFIFLFSCISIIFLLLHANNGIYITQTTQISYSVFIIGLFFINLMIFIFLLLYYVSNQRQSYLLILSFAFLSNTYYLLEVAIISLSPLGNDLSTIYQKSNDIAIYYLFRQFSFISIIFLAVYSTNVKNKSVLEDKRNIIIVVLSILILFITPFVAKNLSSDNIKYSLNIIQYSLNRHLPTWNIVYTKIISVFWLVLLISSCISIRNYSKIWLCIILISIVSVCNNLILLYFIDKSHPAWYMTKFLELISMIYIISTLMYYVFRKLNHANHMAIHDPLTNTYNRRYFIDSLKNISKHHDFSVIMLDIDSFKSINDKWGHHMGDQVIVMVTRIIKKSIRKEDVLGRLGGEEFGIIIKGNTQKLLLSIAERIRKNIEEQCSEKLLSHGPEKITVSIGCFTSKENNLSPSEMLVNADKALYQAKRTGKNKVIIHSK >NZ_CP029122|3386580:3412836|3386580_3387870_+|WP_001356070.1|DBSCAN-SWA MTTDDLAFDQRHIWHPYTSMTSPLPVYPVVSAEGCELILSDGKRLVDGMSSWWAAIHGYNHPQLNAAMKSQIDAMSHVMFGGITHAPAIELCRKLVAMTPQPLECVFLADSGSVAVEVAMKMALQYWQAKGEARQRFLTFRNGYHGDTFGAMSVCDPDNSMHSLWKGYLPENLFAPAPQSRMDGEWDERDMVGFARLMAAHRHEIAAVIIEPIVQGAGGMRMYHPEWLKRIRKMCDREGILLIADEIATGFGRTGKLFACEYAEIAPDILCLGKALTGGTMTLSATLTTREVAETISNGEAGCFMHGPTFMGNPLACAAANASLAIIESGEWQQQVAAIEVQLREQLAPARDAEMVADVRVLGAIGVVETTRPVNMAALQKFFVEQGVWIRPFGKLIYLMPPYIILPQQLQRLTAAVNRAVQDETFFCQ >NZ_CP029122|3386580:3412836|3401366_3401927_-|WP_000720581.1|DBSCAN-SWA MKKIILLAMIIGSLTGCASVPPLNFSTPNVGVSQKKIDAEIKSLTVSLARPDEQKGDITAGMEAITPIWRESLQEALDRMTIFRDSSPNTVSLNVKVLALDVPAFGVSMTTKAIARYEIINRANGDIIYTQDIESTGTVPASYAFYGIVRARESVNRAVQNNITQFLQALESVDLSRPMFPVRVAK >NZ_CP029122|3386580:3412836|3411765_3412836_+|WP_000533646.1|integrase|DBSCAN-SWA MGRRRSHERRDLPPNLYIRNNGYYCYRDPRTGKEFGLGRDRRIAITEAIQANIELFSGHKHKPLTARINSDNSVTLHSWLDRYEKILASRGIKQKTLINYMSKIKAIRRGLPDAPLEDITTKEIAAMLNGYIDEGKAASAKLIRSTLSDAFREAIAEGHITTNPVAATRAAKSEVRRSRLTADEYLKIYQAAESSPCWLRLAMELAVVTGQRVGDLCEMKWSDIVDGYLYVEQSKTGVKIAIPTVLHVDALGISMKETLDKCKEILGGETIIASTRREPLSSGTVSRYFMRARKASGLSFEGDPPTFHELRSLSARLYEKQISDKFAQHLLGHKSDTMASQYRDDRGREWDKIEIK >NZ_CP029122|3386580:3412836|3411569_3411788_+|WP_001303849.1|DBSCAN-SWA MYLTLQEWNARQRRPRSLETVRRWVRECRIFPPPVKDGREYLFHESAVKVDLNRPVTGSLLKRIRNGKKAKS >NZ_CP029122|3386580:3412836|3405223_3405973_+|WP_000389051.1|DBSCAN-SWA MECVASSNSAIPNVVEVIRRINEGSTQPFLCKCDDGQLYVLKSKPSMPPKNLLAEFISACLANDIGLPLPDFKIVFVPEELIEYSPDLQQQICTGYAFASLFIDGAIALTFTQSRNETIIPVEQQKLIYVFDKWILNADRTLTDKGGNVNILYDISNDKYYLIDHNLSFDQNAGPEDFSVHVYGPGNRKWQYDLVDRVEYRQRVVNSLHKLPAILDEIPEEWIVDEEFLPFVCTTLDKGDCDEFWSAIE >NZ_CP029122|3386580:3412836|3400723_3400825_-|WP_072157016.1|DBSCAN-SWA MRTYNPNSLLPSQMQKCTCDFLHPAFDLCGGEA >NZ_CP029122|3386580:3412836|3395542_3395758_-|WP_000839582.1|lysis|DBSCAN-SWA MKSMDKLTTGIAYGTSAGSAGYWFLQLLDKVTPSQWAAIGVLGSLVFGLLTYLTNLYFKIKEDKRKAARGE >NZ_CP029122|3386580:3412836|3402411_3402705_-|WP_000145917.1|DBSCAN-SWA MTGKEAIIHYLGTHKSFCAQDVAAVTGATVTSINQAAAKMARTGILVIDGKVWRTVYYRFATREEREGKVSTNLIFKECRQSAAMKRVLAFYRGNFQ >NZ_CP029122|3386580:3412836|3400271_3400727_-|WP_001372486.1|DBSCAN-SWA MNLPQDGIKLHRGNFTAIGQQIQPYLEDGKCFRMVLKPWREKRSLSQNALSHMWYSEISEYLISRGKSFATAAWVKEALKHTYLGYETKELVDVVTGEITTIQSLRHTSDLDAGEMYVFLCKVEAWAMNIGCHLTIPQSCEFQLLRDKQEA >NZ_CP029122|3386580:3412836|3393389_3393623_+|WP_000105084.1|DBSCAN-SWA MSTKNRTRRTTIRNIRFPNQMIEQINIALDQKGSGNFSAWVIEACRRRLCSEKRVSPEANKEKSDITELLRKQIRPD >NZ_CP029122|3386580:3412836|3409503_3409695_+|WP_023148020.1|DBSCAN-SWA MHKASPVELRTSIEMAHSLAQIGVRFVPIPVETDEEFHTLAAFLSQKLEMMVAKAEADERDQV >NZ_CP029122|3386580:3412836|3399459_3399822_-|WP_001372483.1|DBSCAN-SWA MNTYNITLPWPPSNNRYYRHNRGRTHVSAEGQAYRDNVARIIKNAMLDIGLAMPVKIRIECHMPDRRRRDLDNLQKAAFDALTKAGFWLDDAQVVDYRVVKMPVTKGGRLELTITEMGNE >NZ_CP029122|3386580:3412836|3390881_3391550_+|WP_000239881.1|DBSCAN-SWA MVKLMKKNTDDGAKIYTPLTLKLYDWWVLGVSNRLAWGCPTKEHLLPHFLEHLGNNHLDIGVGTGFYLTHVPESSLISLMDLNEASLNAASTRAGESKIKHKISHDVFEPYPAALHGQFDSISMFYLLHCLPGNISTKSCVIRNAAQALTDDGTLYGATILGDGVVHNSFGQKLMRIYNQKGIFSNTKDSEEGLTHILSEHFENVKTKVQGTVVMFSASGKK >NZ_CP029122|3386580:3412836|3400917_3401370_-|WP_000825400.1|DBSCAN-SWA MKRLFVIAPLLVLVGCAQNISPNSYSVGSVGMVNRTIAGTVISARGVDISGTSALGGTAGAAVGATAGSALGGGVRSNIVGAVGGAVIGGIAGAAIESSATKQTGMEYVVETENGNLMTIVQGKDPLFTQGSKVLVLYGNPSRIITDPRH >NZ_CP029122|3386580:3412836|3411362_3411530_+|WP_000545745.1|DBSCAN-SWA MHFRVTGEWNGEPFNRVIEAENISDCYDHWMLWAQIAHADVTNIRIEELKEHQAA >NZ_CP029122|3386580:3412836|3407587_3407884_+|WP_000995439.1|DBSCAN-SWA MNAYYIQDRLEAQSWARHYQQIAREEKEAELADDMEKGLPQHLFESLCIDHLQRHGASKKAITRAFDDDVEFQERMAEHIRYMVETIAHHQVDIDSEV >NZ_CP029122|3386580:3412836|3395045_3395543_-|WP_001372488.1|DBSCAN-SWA MPPSLRKAVAAAIGGGAIAIASVLITGPSGNDGLEGVSYIPYKDIVGVWTVCHGHTGKDIIPGKTYTEAECKALLNKDLATVARQINPYIKVDIPETTRGALYSFVYNVGAGNFRTSALLRKINQGDIKGACDQLRRWAYAGGKQWKGLMTRREIEREVCLWGQQ >NZ_CP029122|3386580:3412836|3394622_3394829_-|WP_001228702.1|DBSCAN-SWA MRKLKMMLFGASLIMVVGCSSKENALCHPQPKPPAPPAWAMMPPSNSLQLLDETFSVSGTELSATKQH >NZ_CP029122|3386580:3412836|3408671_3409352_+|WP_001372450.1|DBSCAN-SWA MTPDIILQRTGIDVRAVEQGDDAWHKLRLGVITASEVHNVIAKPRSGKKWPDMKMSYFHTLLAEVCTGVAPEVNAKALAWGKQYENDARTLFEFTSSVNITESPIIYRDENMRTACSPDGLCSDGNGLELKCPFTSRDFMKFRLGGFEAIKSAYMAQVQYSMWVTRKDAWYFANYDPRMKREGLHYVVVERDEKYMASFDEMVPEFIEKMDEALAEIGFVFGEQWR >NZ_CP029122|3386580:3412836|3407889_3408675_+|WP_000100847.1|DBSCAN-SWA MSTALATLAGKLAERVGMDSVDPQELITTLRQTAFKGDASDAQFIALLIVANQYGLNPWTKEIYAFPDKQNGIVPVVGVDGWSRIINENQQFDGMDFEQDNESCTCRIYRKDRNHPICVTEWMDECRREPFKTREGREITGPWQSHPKRMLRHKAMIQCARLAFGFAGIYDKDEAERIVENTAYTAERQPERDITPVNDETMQEINTLLIALDKTWDDDLLPLCSQIFRRDIRASSELTQAEAVKALGFLKQKATEQKVAA |
41 | Enterobacteria_phage(46.88%) | lysis,integrase,tail | attL 3388496:3388510|attR 3412910:3412924 |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_7 |
4197341 : 4218730
Sequences of DBSCAN-SWA_7
Nucleotide sequences of DBSCAN-SWA_7 >NZ_CP029122|4197341:4218730|DBSCAN-SWA ATTAATGCTCGCGGGTCTGGTGGAACTGAACGTCCGGATAACGTTCCTGTGCCAGGCGCAGGTTAACCATGCTGGTAGCGATGTAAGCGAGATTATCGCCGCCATCAAGCGCCAGTTGGCTTTCGTTCTTACGCTTGAACTCTTCAAATTTCTTCGCGTCCGCACATTCTACCCACCGGGCGGTGGCAACGTTGACTGATTCATATACTGCTTCAACGTTGTATTCGCTCTTCAGGCGCGATACCACTACATCAAACTGCAGCACACCAACCGCACCAACGATCAGGTCGTTGTTGGAGATCGGACGGAACACCTGCACCGCGCCCTCTTCGGAAAGCTGTACCAGCCCTTTGAGCAGCTGTTTTTGCTTCAGCGGATCTTTCAGGCGGATACGACGGAACAGTTCTGGTGCGAAGTTCGGAATACCGGTGAACTTCATCATCTCACCCTGGGTAAAGGTGTCGCCGATCTGAATGGTGCCGTGGTTGTGCAGGCCGAGGATATCGCCAGGATACGCTTCTTCAACGTGCGAACGGTCACCCGCCATAAAGGTCAGCGCGTCAGAGATCACCACGTCTTTCGCGGTGCGCACCTGGCGCAGCTTCATGCCTTTTTCATATTTACCGGATACCACGCGCATAAACGCCACGCGGTCGCGGTGTTTCGGGTCCATGTTGGCCTGAATTTTAAATACGAAGCCGGTAAATTTGTCTTCGCTCGCTTGTACGGTACGGGTATCAGTCTGACGCGGCATCGGCGCAGGTGCCCACTCCACCAGGCCATCCAACATATGATCGACGCCGAAGTTACCCAGCGCAGTACCGAAGAATACCGGAGTGATTTCGCCCGCAAGGAACAGCTCTTTGTCGAACTCGTTAGACGCGCCTTTAACCAGTTCCAGTTCGTCACGCAGCTGCTGTGCCAGATCTTCACCAACCGCAGCATCGAGATCCGGGTTATTCAGCCCTTTAACAATGCGGACTTCCTGAATGGTGTGCCCTTTACCGCTCTGATAGAGATAGGTTTCATCTTTATAAAGGTGGTAAACGCCTTTAAACAGCTTGCCGCAGCCAATTGGCCAGGTGATCGGTGCGCAGCCAATTTTCAGCTCGTTCTCAACTTCATCGAGCAATTCCATCGGGTCGCGGATATCACGGTCAAGTTTGTTCATAAAGGTGAGGATCGGCGTGTCGCGCAGACGGGTAACTTCCATCAGCTTACGGGTACGATCTTCAACACCTTTTGCGGCGTCGATAACCATCAGGCAGCAGTCCACCGCCGTCAGGGTACGATAGGTATCTTCCGAGAAGTCTTCATGCCCCGGGGTGTCGAGCAGGTTAACCAGGCAATCGTGATACGGAAACTGCATCACAGACGTAGTAATGGAGATCCCACGCTGCTTTTCCATCTCCATCCAGTCCGACTTAGCGTGCTGGTTGGAACCACGGCCTTTTACTGTACCGGCGGTCTGAATGGCCTGTCCGAACAGCAGCACCTTCTCGGTGATGGTAGTCTTACCGGCGTCCGGGTGAGAAATAATGGCAAAGGTTCTTCTTTTCGCTACCTCTGCGGCCAAAAGGCTTGTCATAATTGCTGTCTCTTTGATTATGTACAATCATATGTACAAAGAAAAATTCATGCCGCTGATTGTACATTTTGGAGGTATCAGAGCATAGCAGAACATGATCAGTAGGTGTATTTGTAGCCAAAGAACATTTGTACATGCAGCAATCTTTGTTCTATTGACACTGTTGATTGGGCGGTGTACAACACAAACAAAAACAGGATGTTAGAGGTCTCAGCAGGACACCGACCAGACGGTGAAGTGACAAAAAGATACGCAAGGGAGCCGCGGCTCCCTACTGAAATATTATGACTTTAAGTGAATTTTTACTTCTTATTGCTATACTAGTTGACATAAAATTATACCGCCTTTCAATATAATCGCTCAAAGCAGTTGAAAATTTGTTCTTTAAGCCCCATATTCATTAAACACCAAATACTGTGGATAAAAATTTTCCAATAAGCTAGGATTTGTCCCAATTTATGGGTACCAGCTGGAAACGAACAAACAATGAACTCAAACGTTGCTAGTATTTATAGTGCTGCTAATGTTAACAGTAATGATTTAGCTTTAGAGTTGTACTGGAAAATCCAAGAAGTCTCTGCATGGTTTGTGAAACATGTAAACGCAAAATCGGTTGAGCAACTACGCGACTTTAACCCGTCATTCGCTGAAATTGCCGATCTTTCTGACGCCACTGCAGATATCATTACGAAGTTGCTTCAAGTCGGTGTTTGGGACGATGAAAGAGTTATGGCAAATGCCCGCCAAGCAGTGCTTTTAATGCGCCAAGTCGCAGAAGCGATCGAGCGTGGCGATAACGACAGTATTCAAGACGGAGCCAACCGCTTATCAGCAATGGCATTCGTTTAACTTAGTCAACTTAAAAAGTGAGTTTTACCTAGCTAGGTAAGTCAGGAGCCAATATGATGAATAAAATTGAAGCACGCCGCATTGCGCTGTTGCGAGAAGCCATCAAAAACGTTGATAAAATCAAAGAAATTCAAACGTTTATCGATCAAGAGCTAAAGGCTATGAATCGAAAAGCTGCATAAAGTAAAAAAACCCGGCAAAGCCGGGTTTTTTATTACCATCCTTTTACACTTTCGGCAGAACAGTTGACCACCATTCTGAAAGAACGTATTTCCCTTAGGGCGAGGATTAATGAGTTCCGCGCTTTTAGCATCTCCGGATACCTCACGTACGCCTTAACCCTCTGCCGCAGAGTCCGTATTCTGAGAGGCTCGCCAACGGTAAGCAAAGGCAAAAAGCGTCAGATCCAGCTCGAAGGTCTCGTGCATTCCATCCACGGACACGGAGGTACCCAGTAATCCCTTCCGAACCAGGGACTCCACACCAGTTTGCGGATCATCAAGCCGGTTCATTTGTCCCCATGTCATCACAGCACCGCCGAATGCTACAAAGTGCTGTACCACGTTTTTTTCACTGTCCGTAAGTGAATCATAAAGTGCGGCCAACTGCTGCCTTGAAGCAGCTTTTCCTTTCTGGGCAAGAAGACGCTGATGCCATCCTCCGGCAATGATCTTGACGATCGGGAATGGAATAATAAATGCGAACAGCGTCAGGACATAGTCGTGAAGGATCAGATAACAGCTCATTCCGGCAAGTCCCGCGACAAATATGGCGACATTAATGCCGAAATCCTTCTCCGTATAAACCCTGTCGAACAGTTTTCCAAGAAATTCAGTCATTGATTTACTCCCTACATGCTATGTCGCACCGCCCCAAATCAAAAGAGCGAATGGTTCAGTGTCGGCAGTATTCCCAATGATTAGCCAACATGCGCCCGGCATCATACCAGTTTTCAGTCAACACCAAGCCGGCAATAAGAGAGAACTCTTGATCCATCATGGCAATCAGATCTCCAAACAACACTGACAGAAATTCCTGGATGAAGTTGCTTTTCGATTTACCGGCTACGCTGACCTAAGTGGTGTTGCCGTATTTATCAGTTTATTAACAGCCCTCATAGCCCAGTACCTGTGAACATTGCTCAGTCCCTAGACAAACCGCTGTGTGGTGACGGTCTTCCGGCCATTCGGTTCCCACTGTATTGAAGCATGCCAGGCTATTTCAATATCGCTATGCCGTGGCATCATTTAACCCCTTGTAATTCATCGTCATAACTGTAGTAATGTTTTCCGCTTCAGGGAAAAAATAGCATCCAACCGCAGCACGTTCTTGCATACGACGTGCTGCGGCATAATCCCAATGATTACTCCCTGACAGGGTTCGTAGGCCACTCAATATCAGGTGCAGTTGATGTATCAACACGGTTCAGCAACACCCGATACTTTTTCCAGGCTTCCACCACCAGCACGACAAGATGCCGCATACAGTGACCCAGTCAGTCCAGTTTCCAGACAACCAGTGCGTCACCTTTTTGAAGGCGCTTTAAAGCACGTTTTAATCCAGGTCGGCCTGTCCTTATTCCGCTTAATTTATCTTCAAATATTTGTTCACATCCTGCACAAACAAGAGCGTTTCGTTGCAGGTCTGTATTCTGGTCATTTGTTGATACCCTTACATAGCCAATCAGCACGCTGAATCTCCCGTCCAAAAGCACAAATCATGCCATGCAGGCCAGAAACCGCCATTATCTAAAACCTCGGTTTACAGGAAACGGTAAACAGGGCCAGGAACGCCGTGCAAAAGAATGGCGATACCTTGTCCGGTGGGCTTACTTTTGAAAACGACTCAATCCTTGCCTGGATTAGAAATACTGACTGGGCAAAGATTGGTTTTAAAAATAATGCCGACAGCGACACTGATTCATACATGTGGTTTGAAACAGGCGACAACGGCAATGAATATTTCAAATGGAGAAGCCGCCAGAGCACCACAACAAAAGACCTGATGACTCTTAAATGGGATGCTTTGTCTGTCCTTGTTAAAGCCCTTTTCAGCAGTGAAGTAAAAATATCGACAGTCAATGCACTGAGGATATTTAATTCATCTTTTGGTGCTATTTTTCGTCGTTCTGAAGAATGCCTGCATATCATCCCTACACGAGAGAATGAGGGAGAAAATGGTGATATAGGGCCACTACGCCCCTTTACGCTTAATCTCAGAACTGGTCGGATAAGCATGGGGCATGGTCTTGATGTTACAGGGGATATATTTGCAAACCGTTTTGCAATTAACAGTAGTACCGGCATGTGGATTCATATGCGTGACCAGAATGTTATTTTGGGACGCAATGCGGTATCCACCGATGGTGCGCAGGCATTACTTCGTCAGGACCACGCTGATCGCAAATTTATGATTGGTGGACTGGGGAATAAGCAATTTGGCATCTACATGATTAATAACTCAAGGACAGCCAATGGCACCGATGGTCAGGCGTACATGGACAACAATGGCAACTGGCTTTGCGGTGCGCAAGTTATTCCCGGCAATTATGGTAATTTTGACTCACGTTATGTGAGAGATGTCCGACTTGGTACACGTGTTGTTCAGACTATGCAAAAAGGCGTGATGTATGAGAAATCAGGTCATGCAATTACGGGGCTTGGCATTATCGGTGCAGTTGATGGCGATGATCCGGCAGTATTCAGACCAATACAAAAATACATCAATGGCACATGGTATAACGTCGTACAGGTGTAATTTATGCAGCATTTAAAAAATATTAAGTCTGGAAATCCAAAAACAAAAGAACAATATCAGCTAACAAAGAATTTTGATGTTATCTGGTTATGGTCCGAAGACGGAAAAAACTGGTATGAGGAAGTGAAAAACTTTCAGCCAGACACAATAAAGATTGTTTACGATGCAAATAATATTATTGTCGCCATCACCAAAGATGCCTCCACGCTTAACCCTGAAGGTTATAGCGTCGTTGAGGTTCCAGATATTACAGCCAACCGCCGCGCTGATGATTCCGGTAAGTGGATGTTTAGGGACGGAGCTGTGGTTAAACGGATTTATACGGCAGACGAGCAACAACAACAGGCCGAATCACAAAAGGCCGCATTGCTTTCCGAAGCTGAATCAGTCATCCAACCGCTGGAACGCGCTGTCAGGCTGAATATGGCAACAGACGAGGAACGCACACGACTGGAAGCATGGGAACGCTACAGTGTTCTGGTCAGCCGTGTGGATACGGCAAATCCTGAATGGCCACAAAAACCAGAGTAAAAATTAAGGCCCGATAGCGGGCCTTCTCTCATTCTGGTTGTTCGGGAAACGTTACTGGCAGGCCGGAAGTGTCTGTAGATTCGACTTTCTGCGCATAGAGCATCCACTCGGTTAATTTTTGTTTATTCTCGTCGGAAATGATGCCCAGCCGTAGCTGTGAGTCCCATAGCTGGGTTTTATCCCTGACAAGTTGCAACAGGCTTTGCTTTTCATTTTCCGCTTGTTGCCTCTGCTCTTCCTCGGTATAAGTTCGCTTTATCACTACGCCATCTTTGAACATCCATTTCCCCGAAATATCAGCCCGGCGATTTGCTGTAATATCAGGTAATTCAACGACGCTTGCACCCTCTGGATTAATTGCTGAAACATCCTTTTCAATACAAATAATAACGCCGTTATGGTCATAGACCATTTTCAAAGTGTCTGGCTGGAAATTCTTTTGTTCCTCATACCAGTTTTTTCCATCATCTGAATAAAGCCATTTGATGTTAAATTGCTTTGTTAGCTGGTATTGCTCTTTTGTTTTAGGGTTGCCAGCAGTAATGTTTTTTAAGTGCATCATCGTTAAATACTCCCCGCGTTATACCACGTCCCATTAATGCAATACTGAATTGGCCTTGCCTGAGTTGTATCAATTAATTCATCACGGTTTCCGTTAACTGAACCCGTAACGACATAACCTGACCTGTCAGACCAGCCGGGGCCTTTCCATGTCTGAACAGATGACAGACCGCCAAGGCGAATACCTGTAATAAACCTTGAGTTACATTCTGCCTGCGTATATGCACCAACATCCCCCGCAGAGGGTTTGCGTGTTGTGGTGTAAAACTCTGACCAGTTAGCTTCAAAGCCATAACCATCACGCGCTGAACGATAAAAAATACCGCCGTTCCTGTAATTCACGCGGAACTGTACAGCAGGGCAACTCCCCGCATTCATATTGAAGTGGAGGATTAATGTCGATGCACCACTGATATCTGCATCATAAACACCGCTATTCCAGTTCCAGCCAACAGCTTTACCCGCCGTTATCCTTCGCCAGACCCGCCAGAACTAACTGAGTCAGTATTAACTGGCACCGGGCTTCGCTTACTCCGGTAGTTCTCGTCATCATGCGTGGCGTTACCCACTTGTCAGCAGGTAAGAAATGAAGGACTGCGGCGGCGGTTTCTGTCATATCTTGCTGTTTTAGCATGTCTTTTTCCCTTCTGGTTAACATGACATACCAATAACTCTTGTCTAAAAAGCCAGCAAGATAAAAAGTCAGTATTCACGACCACCAGCGTGTTTACTGTACTGCACCAAGTTTACAGGTACAAAAAACCCGCTCAGTGGCGGGTTGCTATCACAGCTATATATTTACTTATTATGCCGTTACTAACATTTATCTTCGACATATAATCGAAAACAAGGTTTACTTAAAACTCTGCTTTCATTTTATCCGGGAATTTTTTATTTGCAGCATAATAACTACCAAGTACATAAGCGTTCATTTGCTGCTCTACATCAACCCGACATGCCGCACTAGAACAAGCTCCACTGATAAGCCCAAAAGAACTCCCTTTAGCAGAGAGATCAGCTTTGATTTCCTCTACAGTGTTTTTCCCCATAGCAACTACACACCCTGTCACAATATATCTAGCCTTCACATCATCCATGCTAAGGATAGTAGTTTTCGCAATTTTGCTGTATCCATCATTTTTATAAACATCCATGGCAAACGCACGGCAATCTGTATAATACGGACTTGCTTTAACTTGCGAATACTCAGGTAATTTCATACCTGCACAACCAACTAAACAAAAACCTATCGCTGCTATTAATACCTTTTTCATTACAGTCATAACCTAGAAGCATCATTGAAACTAATTTATTAAATAATCATCGAGTTTCTGGAATACAGACGTTAACCATCTTTCCAAAATCTAAAAGATAATAAGAAAAAATGTTTAACGCACCAATCCATTTCATAGTTTCATGAGACATCTGGCACAAAAAAACCCGCTCAGTGGCGGGTTCTTAAATCTTATCAACGGTAGACATACAAAGCCCATCGTTGGGAAAATCTTATCCATATTTTTTGAAAAATGCAAGCATCATGTCGTCATCTTCGGCGAAAACCATTTATCTTGTCACATTTCTCAATTGTATCTCTGCATATGCTTCTTCCTGCCAGCACTTTGTAACCAGTTTATCAATGACATCTGCATATCCTTTGTACCACTGATAATCCGTCAGGTCTGGTACCAGCTTCTGGACATGATGCCGCGCCAGTGTGGTTGGTAAACGGCTAAACCGGTTTCCATTGCAACGCCCACAAATCTTATAAACAGGCGTGCCATGAAGCCGGGTCCTTTTTTCATCCAGGACAATACCTTTACCCTTACACCCTCTGCACGCTGTGCTGACTTCTCCCTTACCATGACAATGCTGACATAGTTCCTTCACCCACTCTTCCTTGATAACAGATTCCCCGCTTCTGGAGTGTTTCACCACTTCGCGCAATACATTATGAAATCCAGTACCAGCACAATGCTCACAGCGAGCCTTACTTGCCGCAGACCTGGAATAATCAGCAAAGGCAAAATTCACAAGGTAAGGGATGATCTGTAACCGGGTTTCTTCACTCAATTTGTTCAATGTCGGGTTATCCAGTGCCATCGCGTAATTGAGCAGACCTTCAATCGCAAATTGAGGATCCTGAACACCAACTTTTGCCAGGAATAAGGCAAACCCAAGCGGTGCTTTCGACTGCACCATCCCCTGCGCAGCCATCACATCCGTAATCGTTAAACCACCTGAGCCTGTCGCCGGTGCGTCATCGCTCAATTTTGGAGATTTTGGGGAGTAATATTTTGGTAAGGCTTCAAGGTTCATGCTCGTTCTCCACTTACGCCAATACGCCAATTGCCAGCGCACGATCGATAAAACGAAATATCAGCTCCAGCTGAGAGCCATACTTCTCTTCAAATGCCACGGTATCCGCATGCAGCTCGTCGTGATGCTTTCTGCACAAAGGCAACACAAAGAGGTCATGCGCTTTTGTTCCCATTCCACCCTGACCGTGACCTATCAGGTGGTGGGGATCATCAGCGGGCTTTCCACAACATGCACACGGCTGTGTCTTAACCCAGCGCGTGTACTTTTCATTAACCCAGCGGCGACGTTTTGGGCGTAACATAAAAGACTCCGGCGACTCCGGATCCACTTTCAGCGCCAGCACCTTTTTCGCTTTATCCTGGATGATGCTGGTGGCAGGAACCGAAGGCACAAGGTCACTTTCCCGGGTGACAGACGGCACAACAGGCTTTGGTAATCTCAGTGCCTTACGGGCTGCACTTTCCGGTAAGGCATCCGCCAGGTCATTACGAATCAGCCACCAGCACAGTTCCGGCATTGTCACAACGTGACTGTCATCAAAACCGAGATCCCGACGCACAACAGACAACACCCAGCGGGCACAGTTATCCGTTGCCATTGATTCCAGCCGTTCCGTGAACTGATCGCGCAGCTGGTTATCGCAGTGCCAGCACAGACGGATTGCGCCCGGCGCGTGTCGCATTGTGGTCATGTTCTCGCTGTGCCAGTCGGAATGAGGCCACTGGCAGCCTTTTTCACGAAGTAACCAGCTTTCAAGACATTCCACGCCACCAGCACGACGGATCACTGCCTCATTGCGGAACACGGCCCGAACGGCAGGATCATCCGCCAGCGGTTGTGATGCCGCCGGAACGGCACCACTGGCGAAAGATGAATAACGTTCCGGCTCAGGCTCCAGCAGGACACGCCCCTGCATAAACAGGGGCATCAGCTCTGAACCTGGCCTGAACAATACGATCCCCATACGCGGGGCAATTTCAGGGGTCAGTAGTGCTCTCACGGTCACCTCAATGAACGGTATCGAGCAGCTTTAACAGCTCAGGGAATCGGGATTCGAAGAAATGCGGCTGCGTCTCGCGCGGATTTGCGGGACTGGTGATGTTCTTGCCGAACATGCAGCCTTTCGCTGTCAGCGACCAGAATTTTTTGATGTTGTTATCCGGAGGAAACAACACGATCTCCACTGAAGCAGGTGCCGACGTTGGTTTCGGCAGACGACGTAACTGCTCAACTATTGCTGCGCACGCCGCGCTCTGGAATTTGCGCCCCGCCGTGCTTATCAGGCTTTTACCAGCAAACTCCCCTTTGTTAGGGTGTCGCCAGTACGTATTCACGCTGGGCGGGAAAGGCAGGATCAGTTTCATACTTTCAGGCCTCTCTCATGTAGCCAGTGGGTTGCACGCAGCCTTGCGTTTTCCTCACCGGCAAGCAGTGAGCGGATAATCCCGACAGCCTCGCTGTCGTCGTCCTTCACCGCGGTATGAAGCGTTATCCCCCGGGCCACGCCACGCTTTATCGTGATGACGCCTTTTTTCTCCAGTGCGCGAAGATGCTCCACCGCTGCATTCACCGAACGGTATCCCAGCATGGTTGCCACCTCCTGATTGGTTGGCGGGAAGCCACGTTCTTTCTGGTAAGAAATCAGCATATCCAGCACCTGCTGCTGGCATTGAGTTAACGTCGTCATTAAGCCCCCACGTAATTCCCTGACAGATACCACTCTTCACCTGATGCAGCCCGCTTACTGCTTTTCCGTAAACACCGTTCACGACGCGCCAGAAAATTGTTTCGTTCTGGCTGGGAGTAGCTTTCACGGAATGCCGCCATCCACACCGTTGCAGCACGACGGTATAAGCCCCTGGACTCCAGTTCTTCCGCCTGGCGGGTCAGGCACAAAATCACCCGGGGATCGTTAGTGCCGACATAGAAATTGCGCACAGGTCTGGTTTCACGAACTGGTTGTGGTTCCGGCTCCTGCGCTCTCTCAGTCAGGCGTGGGAAATGTCTGCGTGTATCTCCTTCACAACGGTGAGCCACACGCCCACTCTGACGTAACTTGCTTGCTGACTGCAGAACGCGCTGCCGTGAGTAACCTGCAAAAGCATCCGCAATGTCTCCGGAAGTACACCCCGGATGGGCTTCAATGAATTTCTGAACTTCATTCAAAAGACTCATGATCACCCCCTGAATCCTGCCGGGATCTGGCTGTAGTCCACGTTGTCGTAACTGGCTTTGAAGTACGGGTCCTCACGTCTAGCTGCAGATACCGCAGGAACTTCCCAGGATTCTTCGAAATGACGATCCGGACCAAAGAACGTGACAGCCTGTTTCACAAATTGTGTGCCGCTGTTACCCATCGCAGATACCCAGCCCGCGTAGCGTTTCACACCTTCCAGCATGGTTTCGGGGTTTACCCCCTCATTCAAACGGGCTTTCCAGGCTTTGAAGGCTGCAGATTTTGAATTGCCACCAGCACGTTTGGGATATGCCAGCCATGCCTGCTCAAACTCCGGAGAGTATTCCGGTCGGTTTGAACGAACTCGCACGGACTCATCAACTGATGCACCAACAGCTATTGGTTCATTGACTGGTTCTTTGACTGGTTCAAAAGAGTGACTGGTTCTGGGTGAATCTCCTGCACTACCCCCTGGTGCAACTCCTGCACTACCTAGTGAATTTGCTGCACCAGATAGTGAATTATTTGCACTACCCCCTAGTGAATCTCCTGCACCATCCAGATGAAGGAGATAGATATTACTTGAGTTACCTTTTTCACCTTTCCGGGTGACTTTTTTTACCAGCCCGGACTCACAAAGGGCCGCAATATGATTCATCACAGAACGTTTGCTAATCTCACACTGGTCAGCAATATGCTGGTAGCTGGGCCAGCACTCACCCTGATCGCTGGCATTATCAGCCAGCTTGATCAGAACCAGTTTTCGCAATGGATTACCCACTCGAATTTTCATCGCTTTAACCATCAGCTCCATACTCATGCTGCACCTCCGAGATGCTTCATGTTTTTTCCGGAGCGAAAGGCTATAAGCGGCATACTGACGCGGTAATTACGGCCCAGCGGTTCACAAATCACCTTCTGGCATTCACGGTCAACCAGGCTAACACGTAGAACATGCCCTGCAGGTGTGGTGTACCACTGCCCAACTGTAGGAATTGATGTTTTTTTACGCTGAAGCAAACGGCAAATATTGAGGATCAACGGATTAAGCATGACGATGCCCTCCGCTGATATTCAGGAGACGGTGAATATGAAAATTAGCCTTATCCGCCAGACGAATACGTTCAGCCTGCAAGTTAAGAAGGGTTTCTACCAAAACCTGATGCGCCTGCGGATCCGAAAGAGTTACCTTGCGCAGAGCACGTAGTGCAGTTGTTACATAACTGAGTTTATGTAAGTCTTCATCATTCAGACGAGAGAGGGCTGGGACAGTAGCCATGATGGCAGCCTCCGTATGCAATGGATAACTTCCACCACCGGAAACGCCAATTTCGCTGGTGGTGAACTGAGCAGGGTTGGCGTAACCGGCGCATACGGAAACCGGCGCACCTTTCGGTGCCCCCACCCAGCCCACCATAATTTGGGTATAGCTGAGTTGTAGCAACAAAAAAGACGCTAACGCGCCAATTGTCGCCGTATGCAATTCCAGGACGCCAATCCCGACACCCGCTTTATAAGGTGCCTGAACAGTGTAACGTCCCGGAATGGCAGAATCAATGTGCTGGTGGTCCTTCACACTCAACAAAATCACGCCTGAATTTCCACAAAGGACTAAAGCACTCATGCGGGTAGTCTTTGCGAAGATAGATAACGCGCTGTGTTTCTGGCTCCCAACGAATAACATGGACATAAAGCCCTCTTCCGTCACGAAACCAGCGGTTAAGTTCTTGCACAACTCGCCCCCCACAGTCAGGTAAAGTTCTCTGTGGTTACTTACAGCCAGGTGATTTGGTAATCTGCATTCATGCCGTAACAACAGGTGTTCAGCGACGCTGACCACCAGCTGTTGCGACAAACGGTTATTTGCCGTTAAACTGTTCATGCGTTAGTTTCTCCACAGACACAAAACGCCACGACGCCCGGAGCTGCACACTCGCGGGCGTCACTCTTTTCTGGAGCGCAAAAGATTTTGTAGACCAGTGCTGCATGCTCCTGGAGCTTCGAAATTGACAGATACAACTCATCATTAATTGCTGTCTGCTCGTGTGGCTCCACGACCCCATCTTCGATTGCCGAACGAATCTGCTTTGAGTAATTCCCGATCTGTTCGATGACTTCCAGCAGGCGCTGGTTTATATCGGCGTTCTCTACTTCCTCAATTTCAGGAAGCGATACGAACACCCCACCAGCAGACTGTGCGACAGCATCCGCAATGTAGTGAGTGCCAGCCGCGCGCTGTAAAACCATTGCCCATCCCAGCGGGAAAATCTGATCGCCATCGGCACGAAGGCGGTTAAATAATGCGTTCTCTGTTACATCCAGCCAGTCAGCTGCTTCAGCGTAACCACCCGGCAACGCTGCGATAGTTTTTCTGACAGCTTTCACGTACCACTCAGGCTGTTTTTCTACTTTCCAGTGATACTTACCCACGGTTAGCCTCATCGTTCTGTGGTTAAAAATTGAAGGTGTTCTGTTAATCTTTCGGATAGATATCCGGTCTTAAGTCAGATTTCGTAATTGCACCTGACGTGCATTGCTCAAGTTTTTTCGCCAGCACAAAACTGGCTTTTTTATAACCATTGAAAACCAGCCGTAAGTAGCCAGGTGTTGAGCCAACTTTTCCGGCCAACTCGCCCTGCTGTTCTTTGGTTAAAGAGTCCCAATACGCTTTCATACAATATGTACCTCCGATATACATATTACATGATTGAAATGAACCTTCAAGATACTTGTACCTTATCGGTACAAAGGTTTTAATTTCGTTATGAAAACAATCCATGACATCCGGCGGTCTAACGCCAGAAAACTGAGAGATGGTGTTGGCGGGAATTCTTCCTTTGCCACCATGATTGATCGCGAGCCAACCCAGACCAGCAGGTTTATGGGAGATGGTGCTACTAAAAATATCGGTGACAGCATGGCGCGGCACATCGAAAAATGTTTCGACCTGCCTGTCGGATGGCTTGATCAAGAACACCAGACCACGAACATCACAAAAAAACCTGATGTTTCAATCACTAACAAACAAATAACGTTAGTCCCTGTCATATCATGGGTACAGGCCGGAGCATGGAAAGAAGTTGGCTATTCTGAGGTTGATTTGAGCACAGCAGAAACTTACCCCTGCCCTGTACCCTGTGGCGAAATGACTTATATCTTGCGGGTGATTGGTGATTCAATGATTGATGAGTACCGCCCTGGAGACATGATTTTTGTTGATCCCGAAGTCCCTGCCTGCCACGGTGACGACGTTATTGCATTGATGCACGATACAGGCGAAACCACCTTCAAGCGATTGATAGAAGATGGAACACAGCGTTATCTCAAAGCATTAAACCCAAACTGGCCTGAGCCTTACATTAAGATTAACGGTAATTGCTCTATAATTGGTACAGTGATTTTCTCGGGAAAACCAAGAAGATACAAAATAAAGGCCTAATCAATATTTATAACCTGCTTCGGCAGGTTTTTTTATACTTGACAATGTACCCTTGAGATACATAATGTATCTAAAAGAAACATAACACAGGCAAGATTAAACTAAATTTGGTTGTAACACGGCGTATGGCACATGCGTCGTTAGCGGTCTGGGGACGTTAAAGGGGACAATCCACTCCTTGCTCGGGCAAACAAACCAGGTAGCCGGAATGTGCAAGTCAATGATGATGCTGATAAGACGCCTAACCAGCGTGGCGATCCGGTTTGACGCCTGGGAAGAGACCAGGGTGCAACGATGAGGGCATTTATGGAACCGCGACAAAGTGTGGTGCCGTAACTGGCTAAGTGCTCTCAGCGTTGTGGTAATCCGCGAAATGGCGCGGCGGTAAGTATGGCGGGGTTACTCTTTCCCCGTTGAGGACACCGGATTGTCAGGTTGACCATACGCCTGAGTGACAACCCCACCACAACAGCCACTGCTTTGGCGGTACCAGTTTGTACACTTGCTTCCGGCTGGTACCGCTCTTTTTACAAAACAGAGAAGGGCATCACCGGACGACGGGCTCATAACCCAATCCATCCGGGCGGCTGCCACCGCAGGTGTTCTTCTCTGTTTTGTGGAGAAACCAACCGACCTTGCAGGGTCGATATGATGAGGAGCAGCAAAATGGCTAGCGAACGCAGTACTGATGTGCAGGCATTTATCGGGGAGCTGGACGGCGGCGTATTTGAAACCAAAATCGGCGCTGTTCTCAGTGAAGTCGCTTCCGGTGTGATGAACACGAAAACCAAAGGTAAGGTCTCGCTCAACCTGGAAATCGAACCGTTTGATGAGAACCGTGTAAAAATCAAACACAAACTCTCATATGTTCGCCCGACTAACCGCGGGAAAATTTCCGAAGAAGACACCACCGAAACGCCGATGTATGTCAATCGCGGTGGTCGCCTGACTATTCTGCAGGAAGACCAGGGACAATTACTGACTCTTGCCGGTGAACCTGACGGAAAACTCCGCGCAGCAGGTCGTTAATATCGTTTTTAATAAACTGATTATTTATCTCATCACTGAATATCTTTATATAGTGAGGACTTATTATGTCTCAGAACTTAGACACAACCGCAATTAATCAAATCCATGCCCTTATTTCTGCTCAGGGTGTTAATGAAATTATCAGTAAGATTGGTGCCGATGCTGTGGCATTGCCTGAGAATTTCCGCATTCATGATCTGGAAAAATTTAATTTAAATCGCTTCCGTTTCCGTGGTGCGCTTTCCACTGCCAGCATCGATGACTTTACCCGTTATTCTAAAGATCTTGCAGATGAAGGCACCCGCTGCTTTATCGATGCTGATAATATGCGAGCCGTCAGTGTGCTTAACCTGGGTACTATTGATGAACCAGGTCACGCAGATAACACCGCCACTCTCAAACTGAAAAAGACAGCACCGTTCTCTGCCCTGTTGTCTGTTAACGGCGAGCGTAACTCCCAGAAGTCACTGGCAGAATGGATTGAAGACTGGGCCGACTACCTTGTGGGCTTTGATGCTAATGGTGACGCCATTCAGGCAACAAAAGCGGCTGCGGCGGTCCGTAAAATCACGATTGAAGCAAACCAGACCGCTGATTTTGAAGACAATGACTTCAGCGGCAAACGCTCTCTGATGGAGTCTGTCGAAGCGAAAACCAAAGACATTATGCCAGTAGCATTTGAGTTTAAATGCGTTCCGTTTGAAGGCCTGAAAGAACGTCCGTTTAAATTACGCCTCAGCATTATCACTGGTGATCGCCCTGTACTGGTTCTGCGCATTATTCAGCTGGAAGCAGTGCAGGAAGAAATGGCTAACGAATTTCGTGATCTGCTTGTTGAAAAATTCAAAGACAGCAAAGTAGAAACCTTTATTGGTACTTTCACCGCCTGATTTCATTACTGCAAATGCCCCTGCGGGGGCATTTATGGAAACGTAATTAACTCAATAATCGCCTGATGGCGAGGGTTTTCTTTAACCAAAATTCAGCGCGGTGCAGCGCATATAAAGTGGAGAACAAAATGTCATTTATTAAAACTTTTTCCGGGAAGCATTTTTATTATGACAAGATAAATAAAGACGACATCGTTATTAACGATATCGCGGTTTCCCTTTCAAATATCTGCCGCTTTGCCGGTCATCTTTCTCACTTCTACAGCGTCGCCCAACATGCGGTGCTTTGCAGCCAGCTGGTGCCGCAGGAATTTGCTTTTGAAGCTTTAATGCATGATGCAACAGAAGCATATTGCCAGGACATCCCCGCACCACTGAAACGACTTCTTCCTGACTATAAACGGATGGAAGAAAAAATAGACGCCGTAATCCGTGAGAAATACGGGTTACCTCCTGTTATGAGCACGCCAGTGAAATATGCCGATCTCATTATGCTGGCAACCGAACGCCGCGATCTCGGGCTTGATGATGGCTCTTTCTGGCCTGTGCTGGAAGGTATCCCGGCAACAGAGATGTTCAACGTGATTCCACTGGCTCCAGGCCATGCCTACGGGATGTTTATGGAACGCTTTAACGAATTATCGGAGTTACGCAAATGCGCATGAATGTTTTCGAAATGGAAGGGTTTCTTCGTGGGAGATGTGTACCGCGAGATCTGAAAGTGAATGAAACGGATGCTGAATACCTAGTGCGTAAATTCGATGCGCTTGAAGCTAAATGTGCAGCACAGGAAAACAAAGTAATACCAGTGTCAACTGAACTGCCACCAGCAAATGAAAGTGTTTTGTTATTTGATGCTAATGGAGAAGGCTGGCTGATTGGCTGGCGTTCTCTCTGGTATACATGGGGGCAAAAAGAAACCGGAGAATGGCAGTGGACATTTCAGGTCGGGGACCTTGAAAACGTCAATATCACTCACTGGGCAGTAATGCCAAAAGCACCGGAGGCTGGAGCATAATGACCACATTTACCGATAAAGAACTGATTAAAGAAATCAAAGAACGAATCAGCAGCCTAGAGGTTCGAGACGATATTGAGCGCCGTGCTTATGAAATTGCTCTGGCATCGCTAGAAGAGGAGCCGGTGGCATGGCTGCATTCAGAAAATGGCTTAGGTATTCCGGCAATAACGAGGAGTAAAAACATTGCTGACAGTTGGTTATCAAAGGGCTGGTATGTTCAGCCGCTATATATAGCCAAGCCAGTGCCGGTGGTGCCAGATGCTCGTCCGTCTTTAAATAATGGCATAGTCGGTTTTGATGAAGGCTGGAACGCCTGCCGCGCCGCCATGCTTCATGGTGCCGAACCTGTAAGCCAGACTTACAAGTTGAACAAGCTGTCGGGCAACTCTCCAGTAACTCCGGATGGTTGGATAAGCTGTAGTGAGCGAATGCCGAACGATAAACAGTATGTTTGGTGTTGGGGGAAGCCTTACGGCTGGACTGAGTGCGATACCTTCGAAGGGTATTACGATTGGTCGAGAAACAAATGGTGGGCAGTTACTGACGATAGGGAAGAACCGGCATCGAAAGTAACCCACTGGATGCCGCTACCGGAGCCGCCGCAGGAGGTGAAGTAATGAACAACTTAATGACAACTAAACAAGTCGCCGACTTCTGTGGCGTTTCAGTATCGACTGTTCTTCGCTGGAACAGCGTAAACAGGAGAACTGGCCAGAAATACAGGCCTGACTTTCCAGATCCTGATATTAAATCCTGCCCAAATAAATGGGCATCACGCAAGATATACAGATTTGCTGGAGTTATTGAGTAACGTGTATTATCTCAGATGGGAGCTGACATATCTATGGCACAGACCAAACTAATCTGACAGTCAAGTCTGTGCCAAGAGCAAACGTTGCTAATTTAATGTTAAGTTGTTTCTCTTGAAGACGATCACGCTATGATACAACCTATAAAATTATCGATATCTGAGAGAACCAGTAAAAAGTTTAAGTCAAGGGTTCTATAAGTGTGTTAATTTATTGGGAAGCAATAAGCTTCCCTTCTCTTTATTTAGGTACGCTATCAACACTTTTGAGAATCGTTGATGTTAAAGGCTGCGACTTTAGTAAGTCTCGAGAAATATCCGGATCTGGTATAGCTATTTTTACTAAACTCTCAAGTTTTTCAAACCACTCCCCATCATTCATTGAGAATAAGTTATGAAGAGCTTCATTTTTATTTTTGAACCCCCAAGCCTTATAATTATTACCACAGTATTTTGACGCAGCAGTTAGAATATGCCAGCGTAATTTTGAATACTTCCCATCAAACCTCTTATTTGAGATAAGAGCCTTTAGCCTATACAAGCAATAGCAAGATATATAATAATCATCCTCGAGGGCATCGGAAGAGAAAACCTCATTAAGTAAGTCACCAGTTAACCTATTTGGATAACGACTGGAATAATCTGGTCTCATCATAACAATAGCAGCATATGCTCGCGCAACTTCTCTGATATCAAATATGCGTACTGGAGCTATACTTTCTGAACTATACTGCCCCTTTCTTCTCTCAAAATAAATTTTATTTCCTTCAATAGCACCTTTAGCATTAAAATAATGTTCTAATTCCCTTAATTTTTTTAGCGTTGAAATGAACTGTGCATCTTCAACTTTAGATTGTCTATTCGTAGCTCGTACAATGTCATCAAGGATCGCTGGCTCATCGGTTTCAATTAATTTAATCATTAAACTAACTGATTCATCTACTTGAATGTCTTTTGATATAAGAACATTAGACGTTTGACATCCATTAACAATTTGAAAATCTCGTATAAAAATTTCTTGCCCTGCAGGCCTCACGCTTGATGCAACTATTGTCACTCCATTATTCATTAAACCAAATCTTGCTTTTTTTCCATCAGTATCTAGTGTTCCTGCAATTTCTGAATTTACGTCACCATCAATTCCCAGAAAATCTCTAACATTTTCTTCGAATAATTTTTTTCGAGGGTTTCCATTTTTGTCTTTAAGTATAGAATCAATAAAACTACGAGCTTTGACTGTAGCTACATAGGCATTATTAATATTAGGGGCCGCTGGGAATGGAGCATAGCCTATTGTAGGAAGTTTAGCTTCTATTGGACCTTCCGCAGCAAGCCAAAGCTCATGAATCATGTCTTTGTGAGCCATAATGAAGGATGTTTCATGTGAAAACCCGAGTGACTTTAAGCTTTTTTCACCCGATGCAAATGCAGCTTTTATTTCTCTGGCCTCTGTATTTTGTGCAGCACTAAAAAAATATGCGTATAAGTCTGGAAGGCCATTTTTGACTCTGCCGATATTAGCAAATATCAAGTTGAACATTTTTTTAAAGTCCGCTAGATATTCACTGTGGGGTTGTTGTGGCGCTGGTGAAAGATAATCTCTTATTGACGCAATGTAAGAGTCTATTTCCTGTTTGCTCCACTTCTCCGATGACTTAGCTTGCGTGAATACTAATGAAACTTGAAACTCGCGACGTGAGTTTTGGAAAATCTCCTGAAGTTCTTCAGTTGAAAAAATGGCTCTGTCGTCTAAAAATAAAAATGCTCCATCAATACCAGGGTCCGGGCCTTCATACACAAGATCACTGACCTCAACTTTGTCGCCTGAGTATTTTGAAAAAGCACAATAGTTCACAAACGCCTCAAAATTTTTGGTCTCCTCATAGGGCGCAGCGAATGCTTTGCAAAAGGCATCGAAATAAGATTTCGTGACTAAATGCATAAAAACTCCTTTCTGAAATAAGATAAGTTTATTAGAAACAACAAGATAAAGATAAAGATAATAGTAATACAAGTTAAAGACATACTGCATCACATAAGTTGGTCAAGATCAATAAATCAATAAGGATCCCCTATGTTTTATGTATTGTTGATAATTCTATCACCTCTTACTTCTGGTCTAAGCGTAACCTACTCAAGAATGCCATTGAAAATGTCCACTGTTCGCTCAAAACAGACTGTCAGGGCTACGCAAGCTGTCAGGTAGATTCTGGGCCAGTACAAGTAACGATCGATTCAACTCTCTCCCACCATGCCTGATATGCTTTACGCTGTTCTTCTAGATAATCGCTCTTGTCATAAACCTGCCAAACCCCTGGCAGTTTATGGCCGAGCATTATTTCTGCAATATGAGGAGCAGTAAGATCAGAAAAGTTTGTTCGTGCTGTTCGCCTCAAATCATGAAGAGACCAATGAGGGAATTGATGCCCCAAACGCCTCCATGCGTACTGCATTAAATTGTAAGGCAGCGACTGCAATGATGTTCGACCAACGGGTTCCCTGCTTCCTTCCTTAGTAAAAAGCATATCGGAACCATTGTTCATAGAGATAGCGTTCTTTATAAGCTCCTCCACCGGTTCAATAATGGGCCTCTTTAGCGGTTCGCCTGTTATCTCTCCAGTCTTATGTCGTTCTGGTGGTACAGTCCATACCTTATTAATGAAATCAAAATCGCCCACCCTGGCAGTAATTAGCTCTGAACTACGGCAGCCAAAATGCAGCAATAGCTTAATGAAGGCCCGGTATTTAGGAACCATTCGAGAACCATCGATCGCAGCATAAAGGATTTTAATTTCATCATGTGTCAGAAACCGTTTCTTCTGACCTTTACGGATATCCATATCTTTACCCGTGATATCCGACAGCGGGCGAGTTTCAATGAGCTTTCTCTTATACGCCCAGACATGGGCCTGCTTTGCGTTAATTAGCAATCGGTCTGCTATTGCTGGAGTCTTAGTGCTAAGAGGCTCCAGGACTTCTAACCAATCATGCAATGTAGCTGCATCGTGAGGGATATTCCCGATTTTAGAGAACAGGTGCAGCTCAAACGAGCGGAGTATCTGTTCAGAACCTTTTTTATTTTTTACACAATATGCTTCATACCAGGCACGGATCACAGACTCTACCGTCATGGCTTCAGTAGCTTTTCGTTTTTCAGCCTGCTTGACCAATCGTGGATTACGGTTTGACTCAAGTTCACCACGGAGACGGATAACTTCTTCTCTGGCCTCTTTTAATCCAGTTGCCGGGTAAGTTCCGATATCAAGACGCTCACCTTTCCCTGCCCATTGATAACGATATTGGAACACTACGCGACCTTTCGGTGATACTCTGACAGACAGACCATCACGATCGGATTTAACCAAAACCTTATCACGTTCCTTTCCAACGACTGAACGCAACCACGCATCAGACAACGCCAT
Protein sequences of DBSCAN-SWA_7 >NZ_CP029122|4197341:4218730|4209166_4210003_-|WP_001446923.1|DBSCAN-SWA MNSLTANNRLSQQLVVSVAEHLLLRHECRLPNHLAVSNHRELYLTVGGELCKNLTAGFVTEEGFMSMLFVGSQKHSALSIFAKTTRMSALVLCGNSGVILLSVKDHQHIDSAIPGRYTVQAPYKAGVGIGVLELHTATIGALASFLLLQLSYTQIMVGWVGAPKGAPVSVCAGYANPAQFTTSEIGVSGGGSYPLHTEAAIMATVPALSRLNDEDLHKLSYVTTALRALRKVTLSDPQAHQVLVETLLNLQAERIRLADKANFHIHRLLNISGGHRHA >NZ_CP029122|4197341:4218730|4199480_4199777_+|WP_001378647.1|DBSCAN-SWA MYWKIQEVSAWFVKHVNAKSVEQLRDFNPSFAEIADLSDATADIITKLLQVGVWDDERVMANARQAVLLMRQVAEAIERGDNDSIQDGANRLSAMAFV >NZ_CP029122|4197341:4218730|4212226_4212589_+|WP_000135682.1|DBSCAN-SWA MASERSTDVQAFIGELDGGVFETKIGAVLSEVASGVMNTKTKGKVSLNLEIEPFDENRVKIKHKLSYVRPTNRGKISEEDTTETPMYVNRGGRLTILQEDQGQLLTLAGEPDGKLRAAGR >NZ_CP029122|4197341:4218730|4215549_4217250_-|WP_001419254.1|DBSCAN-SWA MHLVTKSYFDAFCKAFAAPYEETKNFEAFVNYCAFSKYSGDKVEVSDLVYEGPDPGIDGAFLFLDDRAIFSTEELQEIFQNSRREFQVSLVFTQAKSSEKWSKQEIDSYIASIRDYLSPAPQQPHSEYLADFKKMFNLIFANIGRVKNGLPDLYAYFFSAAQNTEAREIKAAFASGEKSLKSLGFSHETSFIMAHKDMIHELWLAAEGPIEAKLPTIGYAPFPAAPNINNAYVATVKARSFIDSILKDKNGNPRKKLFEENVRDFLGIDGDVNSEIAGTLDTDGKKARFGLMNNGVTIVASSVRPAGQEIFIRDFQIVNGCQTSNVLISKDIQVDESVSLMIKLIETDEPAILDDIVRATNRQSKVEDAQFISTLKKLRELEHYFNAKGAIEGNKIYFERRKGQYSSESIAPVRIFDIREVARAYAAIVMMRPDYSSRYPNRLTGDLLNEVFSSDALEDDYYISCYCLYRLKALISNKRFDGKYSKLRWHILTAASKYCGNNYKAWGFKNKNEALHNLFSMNDGEWFEKLESLVKIAIPDPDISRDLLKSQPLTSTILKSVDSVPK >NZ_CP029122|4197341:4218730|4208122_4208941_-|WP_001677149.1|DBSCAN-SWA MSMELMVKAMKIRVGNPLRKLVLIKLADNASDQGECWPSYQHIADQCEISKRSVMNHIAALCESGLVKKVTRKGEKGNSSNIYLLHLDGAGDSLGGSANNSLSGAANSLGSAGVAPGGSAGDSPRTSHSFEPVKEPVNEPIAVGASVDESVRVRSNRPEYSPEFEQAWLAYPKRAGGNSKSAAFKAWKARLNEGVNPETMLEGVKRYAGWVSAMGNSGTQFVKQAVTFFGPDRHFEESWEVPAVSAARREDPYFKASYDNVDYSQIPAGFRG >NZ_CP029122|4197341:4218730|4214134_4214497_+|WP_001242749.1|DBSCAN-SWA MRMNVFEMEGFLRGRCVPRDLKVNETDAEYLVRKFDALEAKCAAQENKVIPVSTELPPANESVLLFDANGEGWLIGWRSLWYTWGQKETGEWQWTFQVGDLENVNITHWAVMPKAPEAGA >NZ_CP029122|4197341:4218730|4204483_4204909_-|WP_000217632.1|DBSCAN-SWA MTVMKKVLIAAIGFCLVGCAGMKLPEYSQVKASPYYTDCRAFAMDVYKNDGYSKIAKTTILSMDDVKARYIVTGCVVAMGKNTVEEIKADLSAKGSSFGLISGACSSAACRVDVEQQMNAYVLGSYYAANKKFPDKMKAEF >NZ_CP029122|4197341:4218730|4213607_4214144_+|WP_001401560.1|DBSCAN-SWA MSFIKTFSGKHFYYDKINKDDIVINDIAVSLSNICRFAGHLSHFYSVAQHAVLCSQLVPQEFAFEALMHDATEAYCQDIPAPLKRLLPDYKRMEEKIDAVIREKYGLPPVMSTPVKYADLIMLATERRDLGLDDGSFWPVLEGIPATEMFNVIPLAPGHAYGMFMERFNELSELRKCA >NZ_CP029122|4197341:4218730|4208937_4209162_-|WP_001446924.1|DBSCAN-SWA MILNICRLLQRKKTSIPTVGQWYTTPAGHVLRVSLVDRECQKVICEPLGRNYRVSMPLIAFRSGKNMKHLGGAA >NZ_CP029122|4197341:4218730|4206952_4207309_-|WP_108711101.1|DBSCAN-SWA MKLILPFPPSVNTYWRHPNKGEFAGKSLISTAGRKFQSAACAAIVEQLRRLPKPTSAPASVEIVLFPPDNNIKKFWSLTAKGCMFGKNITSPANPRETQPHFFESRFPELLKLLDTVH >NZ_CP029122|4197341:4218730|4207631_4208126_-|WP_001373594.1|DBSCAN-SWA MIMSLLNEVQKFIEAHPGCTSGDIADAFAGYSRQRVLQSASKLRQSGRVAHRCEGDTRRHFPRLTERAQEPEPQPVRETRPVRNFYVGTNDPRVILCLTRQAEELESRGLYRRAATVWMAAFRESYSQPERNNFLARRERCLRKSSKRAASGEEWYLSGNYVGA >NZ_CP029122|4197341:4218730|4200112_4200616_-|WP_001378643.1|DBSCAN-SWA MTEFLGKLFDRVYTEKDFGINVAIFVAGLAGMSCYLILHDYVLTLFAFIIPFPIVKIIAGGWHQRLLAQKGKAASRQQLAALYDSLTDSEKNVVQHFVAFGGAVMTWGQMNRLDDPQTGVESLVRKGLLGTSVSVDGMHETFELDLTLFAFAYRWRASQNTDSAAEG >NZ_CP029122|4197341:4218730|4203093_4203627_-|WP_000972143.1|tail|DBSCAN-SWA MMHLKNITAGNPKTKEQYQLTKQFNIKWLYSDDGKNWYEEQKNFQPDTLKMVYDHNGVIICIEKDVSAINPEGASVVELPDITANRRADISGKWMFKDGVVIKRTYTEEEQRQQAENEKQSLLQLVRDKTQLWDSQLRLGIISDENKQKLTEWMLYAQKVESTDTSGLPVTFPEQPE >NZ_CP029122|4197341:4218730|4197341_4198928_-|WP_000202566.1|DBSCAN-SWA MTSLLAAEVAKRRTFAIISHPDAGKTTITEKVLLFGQAIQTAGTVKGRGSNQHAKSDWMEMEKQRGISITTSVMQFPYHDCLVNLLDTPGHEDFSEDTYRTLTAVDCCLMVIDAAKGVEDRTRKLMEVTRLRDTPILTFMNKLDRDIRDPMELLDEVENELKIGCAPITWPIGCGKLFKGVYHLYKDETYLYQSGKGHTIQEVRIVKGLNNPDLDAAVGEDLAQQLRDELELVKGASNEFDKELFLAGEITPVFFGTALGNFGVDHMLDGLVEWAPAPMPRQTDTRTVQASEDKFTGFVFKIQANMDPKHRDRVAFMRVVSGKYEKGMKLRQVRTAKDVVISDALTFMAGDRSHVEEAYPGDILGLHNHGTIQIGDTFTQGEMMKFTGIPNFAPELFRRIRLKDPLKQKQLLKGLVQLSEEGAVQVFRPISNNDLIVGAVGVLQFDVVVSRLKSEYNVEAVYESVNVATARWVECADAKKFEEFKRKNESQLALDGGDNLAYIATSMVNLRLAQERYPDVQFHQTREH >NZ_CP029122|4197341:4218730|4212654_4213479_+|WP_001763729.1|DBSCAN-SWA MSQNLDTTAINQIHALISAQGVNEIISKIGADAVALPENFRIHDLEKFNLNRFRFRGALSTASIDDFTRYSKDLADEGTRCFIDADNMRAVSVLNLGTIDEPGHADNTATLKLKKTAPFSALLSVNGERNSQKSLAEWIEDWADYLVGFDANGDAIQATKAAAAVRKITIEANQTADFEDNDFSGKRSLMESVEAKTKDIMPVAFEFKCVPFEGLKERPFKLRLSIITGDRPVLVLRIIQLEAVQEEMANEFRDLLVEKFKDSKVETFIGTFTA >NZ_CP029122|4197341:4218730|4217506_4218730_-|WP_001680166.1|integrase|DBSCAN-SWA MALSDAWLRSVVGKERDKVLVKSDRDGLSVRVSPKGRVVFQYRYQWAGKGERLDIGTYPATGLKEAREEVIRLRGELESNRNPRLVKQAEKRKATEAMTVESVIRAWYEAYCVKNKKGSEQILRSFELHLFSKIGNIPHDAATLHDWLEVLEPLSTKTPAIADRLLINAKQAHVWAYKRKLIETRPLSDITGKDMDIRKGQKKRFLTHDEIKILYAAIDGSRMVPKYRAFIKLLLHFGCRSSELITARVGDFDFINKVWTVPPERHKTGEITGEPLKRPIIEPVEELIKNAISMNNGSDMLFTKEGSREPVGRTSLQSLPYNLMQYAWRRLGHQFPHWSLHDLRRTARTNFSDLTAPHIAEIMLGHKLPGVWQVYDKSDYLEEQRKAYQAWWERVESIVTCTGPEST >NZ_CP029122|4197341:4218730|4210594_4210795_-|WP_000649477.1|DBSCAN-SWA MKAYWDSLTKEQQGELAGKVGSTPGYLRLVFNGYKKASFVLAKKLEQCTSGAITKSDLRPDIYPKD >NZ_CP029122|4197341:4218730|4209999_4210551_-|WP_000521508.1|DBSCAN-SWA MGKYHWKVEKQPEWYVKAVRKTIAALPGGYAEAADWLDVTENALFNRLRADGDQIFPLGWAMVLQRAAGTHYIADAVAQSAGGVFVSLPEIEEVENADINQRLLEVIEQIGNYSKQIRSAIEDGVVEPHEQTAINDELYLSISKLQEHAALVYKIFCAPEKSDARECAAPGVVAFCVCGETNA >NZ_CP029122|4197341:4218730|4202537_4203065_+|WP_001681074.1|tail|DBSCAN-SWA MQHLKNIKSGNPKTKEQYQLTKNFDVIWLWSEDGKNWYEEVKNFQPDTIKIVYDANNIIVAITKDASTLNPEGYSVVEVPDITANRRADDSGKWMFRDGAVVKRIYTADEQQQQAESQKAALLSEAESVIQPLERAVRLNMATDEERTRLEAWERYSVLVSRVDTANPEWPQKPE >NZ_CP029122|4197341:4218730|4201571_4202534_+|WP_001171282.1|DBSCAN-SWA MQKNGDTLSGGLTFENDSILAWIRNTDWAKIGFKNNADSDTDSYMWFETGDNGNEYFKWRSRQSTTTKDLMTLKWDALSVLVKALFSSEVKISTVNALRIFNSSFGAIFRRSEECLHIIPTRENEGENGDIGPLRPFTLNLRTGRISMGHGLDVTGDIFANRFAINSSTGMWIHMRDQNVILGRNAVSTDGAQALLRQDHADRKFMIGGLGNKQFGIYMINNSRTANGTDGQAYMDNNGNWLCGAQVIPGNYGNFDSRYVRDVRLGTRVVQTMQKGVMYEKSGHAITGLGIIGAVDGDDPAVFRPIQKYINGTWYNVVQV >NZ_CP029122|4197341:4218730|4205955_4206945_-|WP_001360050.1|DBSCAN-SWA MRALLTPEIAPRMGIVLFRPGSELMPLFMQGRVLLEPEPERYSSFASGAVPAASQPLADDPAVRAVFRNEAVIRRAGGVECLESWLLREKGCQWPHSDWHSENMTTMRHAPGAIRLCWHCDNQLRDQFTERLESMATDNCARWVLSVVRRDLGFDDSHVVTMPELCWWLIRNDLADALPESAARKALRLPKPVVPSVTRESDLVPSVPATSIIQDKAKKVLALKVDPESPESFMLRPKRRRWVNEKYTRWVKTQPCACCGKPADDPHHLIGHGQGGMGTKAHDLFVLPLCRKHHDELHADTVAFEEKYGSQLELIFRFIDRALAIGVLA >NZ_CP029122|4197341:4218730|4207305_4207632_-|WP_000210170.1|DBSCAN-SWA MTTLTQCQQQVLDMLISYQKERGFPPTNQEVATMLGYRSVNAAVEHLRALEKKGVITIKRGVARGITLHTAVKDDDSEAVGIIRSLLAGEENARLRATHWLHERGLKV >NZ_CP029122|4197341:4218730|4210885_4211560_+|WP_000848748.1|DBSCAN-SWA MKTIHDIRRSNARKLRDGVGGNSSFATMIDREPTQTSRFMGDGATKNIGDSMARHIEKCFDLPVGWLDQEHQTTNITKKPDVSITNKQITLVPVISWVQAGAWKEVGYSEVDLSTAETYPCPVPCGEMTYILRVIGDSMIDEYRPGDMIFVDPEVPACHGDDVIALMHDTGETTFKRLIEDGTQRYLKALNPNWPEPYIKINGNCSIIGTVIFSGKPRRYKIKA >NZ_CP029122|4197341:4218730|4205189_4205942_-|WP_001047105.1|DBSCAN-SWA MNLEALPKYYSPKSPKLSDDAPATGSGGLTITDVMAAQGMVQSKAPLGFALFLAKVGVQDPQFAIEGLLNYAMALDNPTLNKLSEETRLQIIPYLVNFAFADYSRSAASKARCEHCAGTGFHNVLREVVKHSRSGESVIKEEWVKELCQHCHGKGEVSTACRGCKGKGIVLDEKRTRLHGTPVYKICGRCNGNRFSRLPTTLARHHVQKLVPDLTDYQWYKGYADVIDKLVTKCWQEEAYAEIQLRNVTR >NZ_CP029122|4197341:4218730|4214496_4215117_+|WP_001377405.1|DBSCAN-SWA MTTFTDKELIKEIKERISSLEVRDDIERRAYEIALASLEEEPVAWLHSENGLGIPAITRSKNIADSWLSKGWYVQPLYIAKPVPVVPDARPSLNNGIVGFDEGWNACRAAMLHGAEPVSQTYKLNKLSGNSPVTPDGWISCSERMPNDKQYVWCWGKPYGWTECDTFEGYYDWSRNKWWAVTDDREEPASKVTHWMPLPEPPQEVK |
25 | Escherichia_phage(56.0%) | integrase,tail | attL 4198867:4198886|attR 4218961:4218980 |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_8 |
4307036 : 4313595
Sequences of DBSCAN-SWA_8
Nucleotide sequences of DBSCAN-SWA_8 >NZ_CP029122|4307036:4313595|DBSCAN-SWA GATGAAAATTGCGCTGGTTATTTTCATCACCCTTGCCCTGGCGGGCTGTGCGCTGTTATCACTCCATATGGGAGTGATCCCCGTGCCGTGGCGCGCGCTGCTGACCGACTGGCAGGCCGGACGCGAGCATTATTATGTATTGATGGAGTACCGACTGCCGCGCTTGCTGCTGGCACTGTTTGTCGGTGCAGCCCTCGCCGTGGCGGGCGTGCTGATACAGGGGATTGTGCGCAACCCTCTGGCATCACCGGATATTCTCGGCGTTAACCATGCCGCCAGCCTGGCCTCTGTGGGGGCTCTACTTCTTATGCCGTCACTGCCCGTGATGGTGCTGCCGCTGCTGGCCTTTGCGGGCGGCATGGCGGGGTTGATATTACTGAAGATGCTGGCAAAGACCCACCAGCCGATGAAGCTGGCGCTCACCGGCGTGGCGCTTTCTGCATGCTGGGCCAGCCTGACGGATTATCTGATGCTCTCGCGCCCACAGGATGTGAACAACGCCCTGCTGTGGCTGACCGGCAGCTTATGGGGCCGTGACTGGAGCTTTGTGAAGATTGCCATCCCGCTGATGATTTTATTTCTGCCGCTGAGCCTGAGTTTTTGCCGCGATCTCGACCTCCTTGCACTCGGCGATGCGCGCGCCACCACGCTCGGTGTGTCGGTGCCCCATACCCGATTCTGGGCTTTGTTACTAGCTGTCGCCATGACATCTACCGGCGTGGCCGCCTGCGGCCCGATTAGCTTTATTGGTCTCGTGGTGCCGCATATGATGCGTAGCATCACCGGTGGACGTCACCGCAGACTGCTGCCTGTTTCAGCCCTGACAGGTGCGTTGCTGTTGGTGGTTGCCGATCTGCTGGCGAGAATTATTCATCCCCCACTGGAGCTCCCGGTTGGCGTGCTGACCGCCATTATCGGTGCGCCGTGGTTTGTCTGGTTGCTTGTGAGAATGCGATAAATGACTTTACGAACTGAAAATCTGACGGTCAGTTACGGGACAGACAAGGTACTTAACGACGTTTCACTCTCACTGCCAACGGGGAAGATCACCGCCCTGATCGGTCCTAACGGTTGCGGGAAATCGACGCTGTTAAACTGTTTTTCGCGGCTTTTAATGCCGCAGTCTGGCACCGTATTTCTCGGCGATAATCCCATAAATATGCTCTCATCGCGCCAGTTGGCCCGCAGGCTTTCGCTGCTGCCTCAGCACCATTTAACGCCAGAGGGGATCACAGTCCAGGAGCTGGTTTCGTATGGTCGTAATCCCTGGCTGTCACTCTGGGGGCGTCTCTCCGCTGAAGACAATGCACGAGTTAATGTCGCCATGAACCAGACCCGGATCAATCATCTTGCCGTTCGTCGGTTAACCGAGCTTTCCGGCGGTCAGCGCCAGCGCGCATTTCTGGCGATGGTCCTGGCCCAGAATACGCCCGTTGTATTACTTGATGAGCCAACCACCTATCTTGATATCAATCACCAGGTGGACCTGATGCGGTTGATGGGCGAACTCCGGACTCAGGGGAAAACGGTGGTCGCTGTGCTGCACGACCTTAATCAGGCTAGCCGGTACTGCGATCAACTGGTGGTAATGGCAAACGGACATGTTATGGCGCAAGGCACACCAGAAGAGGTGATGACCCCAGGATTGCTGAGAACAGTATTCAGCGTGGAAGCGGAAATACACCCCGAGCCGGTATCTGGCAGGCCGATGTGCCTAATGAGGTAGATTGCACAGGCCGTAAGAACCAAACCACGACTGAATGAAACTGGACTGGCGCCAGCAAGCCTGTTCAGACTGGGGCTGAACTTTTCCGGACTCTGAAAGATTACCAATACTCATCGTCCATCCGCTTGCTTTAGGCTGACAGGTTCATAATCAACGCAAACCAGAGCTGTACAGGCTTGGGCGCGGCTTTCAAACCAGTCGTGATCACGGCAATCAATTTTGAACTCTGCTTAACGGACATTTCTGTATAACCCTTACGGCAACGAAAAACGCGAAGTTAAAATTTTAGAAACCCAAAAACGTGACATGACTAAGTTTAGATTTCAGGGGGGGAGATCAAAAAATTTCGCTCTGTGCCAGAGCGGACATTCACGGAGCTGGTTCATTACCAATGAGGTTGGGCTTTTGAGGATAAATCAATGATCAGACGCCAACGTAAATCAAAAGCACCCCTGGAACGGAATTGCTAATCCAGTTTCTGACCATCGATTTTTCTAAAAAGTGTGCGTTTGCTACTACTTAGGTAGGTGCAGCTTTCTTAATCACCGGCAGCCACGTTATACAGGCCAGTTGATGGATCGATTGTTATCAATGATATCTTTATGAGTCGGTGTCTCACCCAGCTTACCGAAGCTGGCATAAAGTGAAGGCAGACGGGCCCGTCCTTCTCCCTTTTTCGCCAGAGGGAAAGCGCGAAGCATGGTGGCAGCCTCCCAGTCACAACAATAGGATGGTGTGCACGGCTGCTGACGCCATGATTCAGCGATAGAGCCGGAAAATACGGGGTCAAAGCCGGTATCATTAACCAGAGTCATCGTGACCTGTACTGCGGCTGGATGATCTCCTGCTACTGCAACTGCTAGGCGACCGCGGCTCCCTTCAGGTAGTCTGTTGTGGTCAACTAAAACTGGCCTCCGCGTTAGAGTTTTTCCAGTATCGGTTTTCTGATTCGTTTGGTGGTAACCCACCATTATATTCGTGCGGTCTTAGTGCGCTGTAATATCCAACGATATAGTCCGTTATTGCGTGAGCTGCATCGCTGAAGCTTACATAGCCCGTCGCTGGCACCCATTCGTTCTTCAGACTCCTGAAGAAGCGCTCCATTGGGCTGTTATCCCAGCAGTTTCCACGCCGACTCATACTCTGCCTGATCCGGTATCGCCACAGTAACTGCCGGAACTGCCTGCTCGTATAATGACTGCCTTGATCGCCTGGAACATCACCCCGACGGGCTTACCACGGGTTTCCCATGCCATTTCCAGTGCTTTCATGGTAAGCCTGCTGTCCGGCGAGAACGACATGGCCCAGCCCACTGGTTTTCTTGCGAACAGGTCGAGAACAACGGCGAGGTACGCCCAGCGCTTACCCGTCCAGATATAGGTCACATCACCGCACCACACCTGATTTGGTTCCGTTACGGCGAACTGTCGCTCAAGATGATTCGGGATAGCAACGTGCTCATGACCGCCACGCTTATACCGGTGAGTCGGCTGCTGGCAACTGACCAGCCCCAGCTCTTTCATGAGTCTGCCAGCAAGCCAGTGCCCCATCTGGTAGCCTCTCCGGGTTGCCATTGTGGCGATGCTTCTTGCTCCGGCAGAGCCGTGGCTGATGCCATGCAGTTCAAGTACCTGGCTGCGTAATACAGCCCGTCTGCCGTCTGGTTTTTCAGGACGGTTTTTCCAGTATCTGTAGCTGCTGCGATGAACCCCGAACACATGGCAGAGTGTGACCACAGGATAATGCGCTCTGAGTTTCCTGATTATCGAGAACTGTTCAGGGAGTCTGACATCAAGAGCGCGGTTGTAGATTCAATTGGTCAACGCAACAGTTATGTGAAAACATGGGGTTGCGGAGGTTTTTTGAATGAGACGAACATTTACAGCAGAGGAAAAAGCCTCTGTTTTTGAACTATGGAAGAACGGAACAGGCTTCAGTGAAATAGCGAATATCCTGGGTTCAAAACCCGGAACGATCTTCACTATGTTAAGGGATACTGGCGGCATAAAACCCCATGAGCATAAGCGGGCTGTAGCTCACCTGACACTGTCTGAGCGCGAGGAGATACGAGCTGGTTTGTCAGCCAAAATGAGCATTCGTGCGATAGCTACTGCGCTGAATCGCAGTCCTTCGACGATCTCACGTGAAGTTCAGCGTAATCGGGGCAGACGCTATTACAAAGCTGTTGATGCTAATAACCGAGCCAACAGAATGGCGAAAAGGCCAAAACCGTGCTTACTGGATCAAAATTTACCATTGCGAAAGCTTGTTCTGGAAAAGCTGGAGATGAAATGGTCTCCAGAGCAAATATCAGGATGGTTAAGGCGAACAAAACCACGTCAAAAAACGCTGCGAATATCACCTGAGACAATTTATAAAACGCTGTACTTTCGTAGCCGTGAAGCGCTACACCACCTGAATATACAGCATCTGCGACGGTCGCATAGCCTTCGCCATGGCAGGCGTCATACCCGCAAAGGCGAAAGAGGTACGATTAACATAGTGAACGGAACACCAATTCACGAACGTTCCCGAAATATCGATAACAGACGCTCTCTGGGGCATTGGGAGGGCGATTTAGTCTCAGGTACAAAAAACTCTCATATAGCCACACTTGTAGACCGAAAATCACGTTATACGATCATCCTTAGACTCAGGGGCAAAGATTCTGTCTCAGTAAATCAGGCTCTTACCGACAAATTCCTGAGTTTACCGTCAGAACTCAGAAAATCACTGACATGGGACAGAGGAATGGAACTGGCCAGACATCTAGAATTTACTGTCAGCACCGGCGTTAAAGTTTACTTCTGCGATCCTCAGAGTCCTTGGCAGCGGGGAACAAATGAGAACACAAATGGGCTAATTCGGCAGTACTTTCCTAAAAAGACATGTCTTGCCCAATATACTCAACATGAACTAGATCTGGTTGCTGCTCAGCTAAACAACAGACCGAGAAAGACACTGAAGTTCAAAACACCGAAAGAGATAATTGAAAGGGGTGTTGCATTGACAGATTGAATCTACAGTAGCCTTTTTTAATATTTCATTCTCCATTTCAATGCGTTGTAGCTTTTTCCTCAGCTTACGTATTTCGATTTGTTCTGGTGTTATCGGAGAGGCTTTTGGTGTTTTGCCCTGACGCTCATCACGCAGTTGTTTGACCCATCTTGTCATTGTGGAAAGGCCAACATCCATAGCTTTGGCGGCATCTGCCACCGTGTATGTCTGGTCAACAACCAGTTGAGCGGATTCGCGTTTAAACTCTGCGCTAAAATTTCTTTTTTTCATTGGAGCACCTGTGTTGTTCTGAGGTGAGCATATCACCTCTGTTCAGGTGGCCAAATTCAGTAAACCACTTCACCACTTCAGAGCAGACGAGACAAAAAGAATGGTTGACGCATCAGCTCAACCCCATATAGTTGTAACACTTGAGCCAAACCCTTGGGCCGCTTTTTACTTTGATATTAACATTGCTAATACAGGGAACGCACCTGCCTATAATGTTGAGGTTGTGTTTGATCCTCCACTAGTAAATGCGGAGCATAGAGAAAAAAGTGAGATTCCGTTTAGTAAGGTAAGCGTGTTAAAAAATGGGCAATCACTTACCAGCAATCTCTGTAAGTATGAACAAATCAAAGATCAAATTTATAATATTAATATAAGCTGGGCAAGCAAACCTAAATCAAACGATAGAGAAACAAATGAATATGTGTATGACATGGCGACATTTGAAGGAATAAGTTATCTAGGAGCGAGAAGCCCATTGACGCAAATTGCAGAACAAATTAAAGGTATAAGAGAGGATTGGAAACCTATTGCACAAGGAGCTAAAAAAGTAAAAGCAGACGTATATACTTCAAGCGATAGAAACGAAGAACGCACGTATCTGCAAGAGCAACACGATTTGGCAATAAAAAGGAGAGATGAGAAAAGAGAAAAAAGATTAGAGTCTGGTGAATAATTTTAAAGGGAGTGGGTAACTAACCCACTCGTAACTATAAACCTGTAATTAATCACTTATTTTTGACAACAGATAATTACTGAACGCACTGCAAGTGACTAACATAAATTTAGCTTCAGCTAAGGTAGGATTTACATCTTCTTCTGTTAGCGCATGACGTATTCCACCCTGATCACTTGTATAACCATAGAGCTGACTAAAAGCGCCTTTCATTGCAGAGTGTATATATCCTTTTTCCTCTATAGCTTTAAGACAAGCCCCCAAGGTTCCTTTATCATTGCCCGTGATTTTCCTGCATAAAGATTCAATTGCAGAGATAGACTCTTTAATCGAGTTTCTGTAGTCTGGCTGCTCTCTATCCGTCATTAGTTGTAACGCCCTTTCGAAATGGCTACGCGATGAATCAGTGCCATTATCAACTGCGTTCTGAACACTTTCAATTTCGTTATCATTTGAAATAGGAGTAATACAACCATTTATTATGGTATAACCAACGCCATGCTTTTTAAAGATGGAATTGAGATGCTTCGATAGATTAATATATGAATTAGTTCTCTCAATGATGAACTCAATTAAATCATATACCAAATACCATGCTTCCCCATATATATAATCTCGGATAGCAGTCAGCAACGTCTTATCACTTTTGTATCCACTTTCATAACGAGGAATATTATCCGCAGGTTGATTTAGATAATATATCCACACAGACTGCGCACATTTTGTTGCTGTAGCAGTTTGACGATTGTTAGTCCAAAGGAAAAGATATAAGCAATTCCACAATGCCATGCGTGTATCAGAATTAAGATCATTCAGCTGGACATGCTCTCTAACGTCAACATGACCATACCTCACAGAAAATGGCTTTATCAT
Protein sequences of DBSCAN-SWA_8 >NZ_CP029122|4307036:4313595|4311698_4312049_-|WP_000747102.1|transposase|DBSCAN-SWA MKKRNFSAEFKRESAQLVVDQTYTVADAAKAMDVGLSTMTRWVKQLRDERQGKTPKASPITPEQIEIRKLRKKLQRIEMENEILKKATVDSICQCNTPFNYLFRCFELQCLSRSVV >NZ_CP029122|4307036:4313595|4307993_4308761_+|WP_000175457.1|DBSCAN-SWA MTLRTENLTVSYGTDKVLNDVSLSLPTGKITALIGPNGCGKSTLLNCFSRLLMPQSGTVFLGDNPINMLSSRQLARRLSLLPQHHLTPEGITVQELVSYGRNPWLSLWGRLSAEDNARVNVAMNQTRINHLAVRRLTELSGGQRQRAFLAMVLAQNTPVVLLDEPTTYLDINHQVDLMRLMGELRTQGKTVVAVLHDLNQASRYCDQLVVMANGHVMAQGTPEEVMTPGLLRTVFSVEAEIHPEPVSGRPMCLMR >NZ_CP029122|4307036:4313595|4312770_4313595_-|WP_000594911.1|DBSCAN-SWA MIKPFSVRYGHVDVREHVQLNDLNSDTRMALWNCLYLFLWTNNRQTATATKCAQSVWIYYLNQPADNIPRYESGYKSDKTLLTAIRDYIYGEAWYLVYDLIEFIIERTNSYINLSKHLNSIFKKHGVGYTIINGCITPISNDNEIESVQNAVDNGTDSSRSHFERALQLMTDREQPDYRNSIKESISAIESLCRKITGNDKGTLGACLKAIEEKGYIHSAMKGAFSQLYGYTSDQGGIRHALTEEDVNPTLAEAKFMLVTCSAFSNYLLSKISD >NZ_CP029122|4307036:4313595|4312149_4312722_+|WP_000227281.1|DBSCAN-SWA MVDASAQPHIVVTLEPNPWAAFYFDINIANTGNAPAYNVEVVFDPPLVNAEHREKSEIPFSKVSVLKNGQSLTSNLCKYEQIKDQIYNINISWASKPKSNDRETNEYVYDMATFEGISYLGARSPLTQIAEQIKGIREDWKPIAQGAKKVKADVYTSSDRNEERTYLQEQHDLAIKRRDEKREKRLESGE >NZ_CP029122|4307036:4313595|4307036_4307993_+|WP_000684856.1|DBSCAN-SWA MKIALVIFITLALAGCALLSLHMGVIPVPWRALLTDWQAGREHYYVLMEYRLPRLLLALFVGAALAVAGVLIQGIVRNPLASPDILGVNHAASLASVGALLLMPSLPVMVLPLLAFAGGMAGLILLKMLAKTHQPMKLALTGVALSACWASLTDYLMLSRPQDVNNALLWLTGSLWGRDWSFVKIAIPLMILFLPLSLSFCRDLDLLALGDARATTLGVSVPHTRFWALLLAVAMTSTGVAACGPISFIGLVVPHMMRSITGGRHRRLLPVSALTGALLLVVADLLARIIHPPLELPVGVLTAIIGAPWFVWLLVRMR >NZ_CP029122|4307036:4313595|4310627_4311779_+|WP_001254876.1|transposase|DBSCAN-SWA MRRTFTAEEKASVFELWKNGTGFSEIANILGSKPGTIFTMLRDTGGIKPHEHKRAVAHLTLSEREEIRAGLSAKMSIRAIATALNRSPSTISREVQRNRGRRYYKAVDANNRANRMAKRPKPCLLDQNLPLRKLVLEKLEMKWSPEQISGWLRRTKPRQKTLRISPETIYKTLYFRSREALHHLNIQHLRRSHSLRHGRRHTRKGERGTINIVNGTPIHERSRNIDNRRSLGHWEGDLVSGTKNSHIATLVDRKSRYTIILRLRGKDSVSVNQALTDKFLSLPSELRKSLTWDRGMELARHLEFTVSTGVKVYFCDPQSPWQRGTNENTNGLIRQYFPKKTCLAQYTQHELDLVAAQLNNRPRKTLKFKTPKEIIERGVALTD >NZ_CP029122|4307036:4313595|4309318_4309576_-|WP_000177060.1|DBSCAN-SWA MTLVNDTGFDPVFSGSIAESWRQQPCTPSYCCDWEAATMLRAFPLAKKGEGRARLPSLYASFGKLGETPTHKDIIDNNRSINWPV |
7 | uncultured_Caudovirales_phage(16.67%) | transposase | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_9 |
4557435 : 4580852
Sequences of DBSCAN-SWA_9
Nucleotide sequences of DBSCAN-SWA_9 >NZ_CP029122|4557435:4580852|DBSCAN-SWA AATGGCGTACTATAACATAGAGAAACGACTAAAATCCGATGGCACACCACGCTATCGCTGTAATGTGATTATCAAAGAAAAAGGTGTTATCACTTACAGGGAAAGCAAAACATTCCCTAAACATGCTCATGCCAAAACATGGGGCGCACAGAAAGTGATGGAATTAGATCTATATGGCATTCCATCATCAAATGCTGTTGACGGACTTACAGTCCGTGACTTACTACACAAATATTTAAATGACCCAAATGCCGGAGGTAAAGCAGGCCGTACTAAAAGATATGTGCTGGAACTGCTTATGGATAGTGACATCTCCGCGATCAAACTATCTGAACTGACAGAAAATGACGTAATTGAACATTGCAGGCTAAGAAACAACGCTGGTGCAGGCCCAGCAACAGTCAGCCACGATGTTAGTTATCTTGGCAGTGTTCTGGATGCGGCAAAACCTGTATACGGAATCAATTACACATCAAACCCGGCGAAAAGCGCTCGTCCATATCTACTTAAACTCGGTTTGATTGGTAAATCAAACCGTCGTAATCGTAGACCAGCATCTGATGAACTGAACATGCTCATTGAAGGCCTTCAACAACGATCTACTCATAAATGCTCAAAAATTCCGTTCGTTGATATCCTCAAATTTTCTGTGTGGTCCTGTATGCGAATCGGAGAAGTATGCCGGTTACGATGGGAAGATCTCGACCAGGAACAAAAATCTATACTAGTAAGAGACAGGAAAGATCCACGTAAAAAGGAAGGCAACCATATGAAAGTTGCCTTGCTTGGGGAAGCCTGGGATATCGTCCAGCGACAACCCAAAAAATCAGAATTCATTTTTCCATATAACAGCACTTCTGTTACCGCAGGATTTCAGAGGGTAAGAAGCAAATTAGGTATTAAAGATCTGAGATACCATGATTTGCGTAGAGAAGGGGCAAGTCGCTTATTTGAGGCTGGTTTTAGTATTGAGGAAGTCGCTCAGGTTACAGGGCATCGTTCATTAAACGTGCTATGGCAGGTATATACCGAACTGTATCCGAAATCTTTACATAATCGTTTTGAAGAACTCCAAAGGAGCAGAAATAAGACCTCTTGACACTGTTTATCCATACAGTTAAAAATAATACTGTATACAAATACAGTGTAGGGGACTTTTATGCGTATTGAAATCTGCATAGCCAAAGAAAAAATGACTAAAATGCCAACCGGTGCTGTGGATGCGTTAAAGGAAGAATTAACCCGACGCATCAGTAAACGTTATGACGATGTAGAAGTGATCGTAAAAGCCACCAGCAACGATGGCCTTTCTGTTACGCGCACCGCTGATAAAGATTCAGCTAAAACTTTTGTTCAGGAGACTCTGAAAGATACCTGGGAGTCTGCTGACGAGTGGTTTGTTCATTAAGTTCATCATCTCTGGACACCATTCACTTGTATTGACTTTAAGATCATTAAATTGTAATAATTCAAAGGGATGGGGCAGGTTTTTCCTCTTGCGCATGCAAGGGCGGCTCTGATAATGAACAGTAATATTTTACTGTTGAGCTTATGCTCACTGATCCTGCCGGGCAGGATGTTCAACGGTTCCAATATATCCTGCCCCTTCCGCTTTTAAAAAGGATCTTTTATGCATGACAATATTTGGTTTACATATAAAGCACGTATCCAAGCGCACCATCGACTAGAATGGCTTGAAAAACACTCTCAATTTATCCTCGTTTGGTATGCTATATTGAGTGCGGTACTTTCAATTGTAACGTTGCGATTTCCAAAGGTTCTAGGAGATAATACAGATGTCGTTGCGGCGATACTTTCAGTGGCTCTACTGGGTATTTCTCTGATCGTATCTAACCTAGATTTTCGTGGTCGAGCAATAGCCATGAGAAGGAATTATATTGCACTACAGCGACTCTATTTTGACATTACCACCAGTCAACAGTTATCTCTTGAACAGAAAGAAAAATATTTTAATTTGCTCAATGAGGTTGAGAATCACCGTGACATAGATGATAAAATTTCAAGGGTAACTCAAGTTGGACTTAAGACGAGGATCCCCACACAAAAAGAAAAAATAATTGTTATTTTATGGATATTACTTCGAATATTTATTACTGCCGCACTTTATATACTCCCATTAATATATCTTTGGATTGACTATGACTGCAAGCAGAATTTTTAAAAAGTCATTCTCGAAAAAAAATCTTCTAAAAGTATACTCTGAAAAAATCAAAGAATCAGGAGCGATTGGCATAGATCGGATTCGCCCATCAAAACTTGATTTGACAATAAAAAATGAGATCACTTTCATTTTTGAAAAGGTTAATTCTGGCAATTACAAATTTACAGCATATAAAGAAAAATTAATATCTAAAGGCGCTAACTCTACACCCAGACAGATTTCCATACCAACTGCTAGGGACAGAATTACTCTTAGAGCTCTCTGTGAATGCCTTACGGAAATATATCCTAAGTCCAGATTAAAACTACCACATACAGTAATTGACTCATTGAAAGAAGCATTAAACAACAGTCTATATGCTGAATATGCAAAAATAGATCTTAAAAGTTTCTATCCTTCAATTGAACATAAATTGATAATTAATGCAATAAAAAATAAAATTAGAAAAAAAGAAATTAGACAGTTAATAACATCATCATTAATCGTGCCTACTGTAAGTGGAACCACAGGAAGCAAAGGTATCCCTAATAATACCAGAGGAGTACCTCAGGGATTAGCGATATCAAACATTTTAGCTGAAATATCACTATCTAATTTCGATGATGAAATCAATAAAATGCATGACATATGGTACATGCGATACGTTGATGACATTCTTATTTTAACACCAAAATATCAAGCAACAAAAATAGCTTCTCATATCATTGATAAGCTTCAATCATTAAATTTAAACCCACATCCATTAAATGAAGAGAACTCAAAATCCAAAGTAGGCAGTTTGGATGAAAGTTTTAACTTTTTGGGATACCACATAGAAAATCGAGAATTATTGATAAAACATGAGAGCATTCTTAGATTTGAGTCATCCTTAGCAAAAATTTTTACTGCATATAGGCACGCTCTACTACAAGCTAAAAGTAAGCGTGATAAAGAACGAGCTGTTGCATATTGTCAGTGGAAACTAAATCTCAGAATTACGGGATGTGTGTTTGAAGGTAAACGATTGGGATGGGTATCGTACTTCTCACAAATAACCTCAACAGCTCAACTTCGCTCTGTTAATCATACTATCAATAATCTTATCCGCCGATTCGGCCTTTCATCAGAAATAAAACCAAAATCTTTGATTAAAACTTTCTATGAACTCCGCAGAGGTAGAGCGGAGACTTTTAAATACATACCTAACTTTGACAATCTACATATATCTCAGAAACGAGAACTTGTTTCTATGTGGATAGGTAAAGAGAAGGAAAAAAAACTTAGCAATAGTGAAATAGAGAGGAAGTTTAAATTTAAAATTGCGAAATCAGTAAAAGAGCTTGAAGAGGATATTTCAGGAATATCATAGATATGTAATCCATTAAGTCATTAAAATATATCGCAATAAACACACTATTTAAACTCACAAACCAGCCGCAGTATCCTGCCATGGCAAGTTGCTGCGGCTTTTTATGTTCAACGGATCAACAGCCAGATCAGAAGACACGCTACCATCGGCACAGCAAAATCCATCAGGCTTGCCACATCCCATGCGCGTGGATCAAAACCGCCCCACCACGGCATGTTAATCCGCTTGCCATGCCCGAACATTTCAATCCAGCGATATTATGCTTGGGTGTGTTCGCCCGCAATGAAGAACGTACAACTGGCTATCGCTCCGTAAACCCAGTTTCCGGTAAAAAGCCCCACCAGTATCTGCGCAGCCACAGCACAAAGTGCATGAAGGAAAGGTGTTATACCCATTTTCATCCTACCCAATAAAACGGGGCGCACAGCCCCTTAATATTATTTAGAGGCAAGCGCCGCCTCAATTGCAGATAATCTTTGTTTTAATTCTGCGTTTTCTTCTTCCAGCGCGGTAACGCGATCATCTGTTTCACGGGCGACCTGAACAAGTAAGCCCGTCACGGCGGCGTAGTCAACATTCAGATAACGTGTTTCTTCACGTAATTCATTGCCGTCAACCGTTGGTCCCTGCAACTCTTTACCGTAATGAGTGAATGAGCCTACCGCTTCCGGTATTGCCTCCATTGCTTCCTGTGCAATAACACCAGCGTAAGGCAGTCCGTTTTCCTTAAGTGTGTAGGTGTACCCGTTCATTTTACGGATAGCTTCGGTTGCGTCACTGATAATCTGAATATTGTCTTTCAGCTCGCGGTCTGATAACTGATTCAGTGTTGTACAGTTAATAGCACCGTTTACATCAAACAGCTGACCTGACGATGTTTTTTGAGCATAAAACAGATACGCGGCAGACGTTCCAACCTCAAAAACGTTTTGTCGATCACCTGAACCCCACACCTTGACAGCAAATGGTAGTTCTGTATTACCTAAGTTCTGTAAAACAAAACGATTGCCAGTCCCTGTTTGTTTTGTAAGAGTTAAATCAACAGTTGAGTTAACCTCATCCTTGTTGATAGTGAGCGCCTGCGCTTTAGCACCGTTAACAGCACCTGTTTTGAGTTGAACCGCGCCGTCATTACCATTTAGCAGTATCTCAGCTCCGCTAAAGAAATTTTTTAGCGACAGCATCTTACTTACGCCGACTGATGAACCCAACGCCCACGCGAGAGAATTACCGGTGCTATCAAACCCACGTACAAAGCAATCCATTTTGCTATAGTCTGACGTGCTTCCAAGGACATCAATCCGCCCTCCGCCAGATTTTATCGGGTTAGATGTGGTTAATGACCTGACAGCAAGAGCGGTAGATGAATTGAGATCGTCTACTGTTAGTAATTTCTTCCATTCCTGCGTCGTTCCATTTTCAATTGTTCTTCCCCAAAAACCGGAATTGCGGCCTCCGAACTGCACAGCATAATTTTTACTAAATTGAACATGAATGCCGCCAAGAACCATAGAGCCTGCCGGGCCGTTTGTACTGCCTGCAATTGGTCTAAATTTGTCGACGTTATCAGTATGCTGAGCGTTCCAGTCTCGTCCTGTTTTTGTAAGCAGTCCTAATGTCGTGTCGAGCGCTTCGGTTAAAGTAAGGTTTTTAAATTTAACTGCGCTACTTTCGCCAAGACTTAAGTTGTTTCGCGCATCTTCTACTGTTGTCGCACCTGTACCTCCTTGCCCAACAGCTAATGGCAACCAGCCAGTGCCATTATGACAACCCCACAAACCAGATTTGGAAACCTGTAAGCGTGGTGCGCCTGAAGAATACGTAGAATATACGTAAGTTGTTTCTTTTCCCTCTTCAACCCTTTCTGTTGATACATCTCTCTTCCATTCAGACCAGGTACCGCTACCTCCTTCATATATTCGTCTATAAGTAATTCCCGGATTATTATAAGGGTGGTACACCTGAACGCAGGAAGAAGCAGTGTTGGCTCTCGTTTGGAGCACTATTAAAGCGCCCGCCAGACCGATTGGGTAATTTAATTCTTCTGTTGCGTAAGCGCTCGTTGGTTGTTGATAGAAACCAGAATATTCACCGGTAAGGGTGTTTAGGTCAGTATTAGCAAGGCCAGCTTTTTGCTCATACATTACCCGCAAGTTCGCACGTGCTTGCTCGATAGTTCTTCCACCAGTACCACCCTGGGCTAAACCTAACGGAGACCATTGTCCAGAAGAAGAATCATAAACCCCCCAACTCCCGTTAGTATGGATTTCAAAGAAACGATCACCAGTTGGACCATACATACGAGTTGTTGATTCATTTTGACCAAAGCGGCTTAATTCATCTCTGCGGGTATGACGTATCCATTGCGGGCCAGAACTTTGGCTGTAAATGTACGAATACAATGATTTTGTATCTGAACCAACAAATAGACCGGAATATCTTTTTGTTGACACTTCTGATCTAACTATATAACCAGAAATAAGATTATCCCCGGCACTAGACGCTACTGATGGATAACCTTTCGCTGAGATGTAAATCCTGATAAATCCAATATAACCTGATGGCTCAACAGAGATATCAGTGCAGGTCATAGCAACTTCACCAAGACCAATATTGTTACCAACAACGGATTGTGCCTGGTTGCGATAATTAAGTGCGTCCGCTGCAGATTTCGCCGCGTTTGTTTCACTGGATTTAGCATTAGTTTCGCTGCTCTTTGCTGCTGCCTCGCTATTTTTCGCGTTGGTTTCTGATTTTTTGGCTGCTGTCGCGGAGTTTGCCGATGCAGTTTGTGAGGCCGCTGCCGCCTGTGCGCTGTTATCCGCATTCGTCTCAGACGTTTTTGCGGCCTTCGCGGAATTTCCTGCCGCCGTTGCCGAAGAGGCTGCACTACTGGCGCTCGAGGCTGCGCTCGTTTCTGATGATTTCGCTGCCTCTTTTGAGGCCGCCGCATCCCGGGCTGAGGTGGCAGCTTCTGACGCTTTCGTGGTCGCGGTGGATGCAGAAGTGGCTGCAGATTTTTGTGACGCTGCAGCATTCGTTTCTGACGTTTTCGCCGCACCGGCACTGGTAGCCGCCGCGCTTTTTGAGGACTCTGCAGCGGCAGCACTTTTTGATGCTTCAGTAGCCTTTGTTGATGCCGTTCCTGCGCTGGAAGACGCTGACTGAGCCGACGTCGCGGCCTGCCCGGCTGATGTGCTGGCTGCACGTGCTGAGCCTGCAGCATCAGTCGCATGGGTTCCCAGAGCACCACGCTGGCCGACTGCTCCGCACGGGTGCATTCATTCAGTGTTTCCTGCCGGATACCCAGTCTGATTCTGTCGCCGCTCCACCACGAATCTGGATTTTCCAGGCCTCAGCCTCCTTAACAGGGTCAATCCACGGCATCACTGGTCCGGAATACACCGCGGTATACAGTGAAGAACGGTCAAGATCGCGGGGTAACCTGATAACACCGGATGCCACAGCCTGTTTCAGCCATGCACGATACATCGGGCGGGTGACGGCACCAATAAACCAGTCCTGCAGGATCAGGTAGCCATCAGTAGACTCAACCAGCTCCTGACGCTGGGCGCTGTAAGTGCCGTTATAGTTGCGCGCTGTACTGGAAAAACTCAGACGACTACCAGCCGCCACTGCACGCAACTGACCATTACGAAAAGTTTCAAGATTAGGATTGGGGCGATCCGACTTCACCATTCCGATTTCTTCGCCGGGTTTCAGATCGTCGTAAATAATGCCTGGCTGAATGGTAAGCTCGCGTTCATTCTCCTTGCTGCCATTATAGCAATTGCTCCGCCACCAATAGCAGCAGCAACTGCTTTTCGTAATGATGGAGGCATTATTCACCTCTTGCAGCCTTGCGCTTATCTTCTTTAATCTTGAAATAAAGGTTTGTCAGGTACGTCAGCAGGCCAAATACCAGGCTACCCAGCACACCTATTGCTGCCCACTGTGAGGGCGTGACTTTATCGAGCAGCTGTAAAAACCAGTAACCGGCACTACCTGCTGAGGTGCCATAGGCGACACCCGTTGTTAACTTATCCATGGATTTCATAACCCCACCTCGCAGATGCGGGTGCTGTGTAATGGAAATAAAAAGGCCACCTGACGTGGCCACCAGATTATTTCCCCACCAGCTCGTTTATCTCTTTCACTGTCTGGTTAAACCGCTCTGACTCAAGCTCAACACCTAAGGCCCGACGCCCCAGCGCCATTGCTGCTTTTATTGTGGAACCGGATCCCATAAAAAAATCAGCAACCAGATCACCAGGTCGACTACTGGCATTGATTATTTGCCTGAGCATATCCGCCGGTTTCTCACACGGATGTTTACCCGGGTAGAACTGAACGGGTTTATGCATCCAGACATCGGTATAAGGCACGGAGACTGATACGGAGAAATAGCGCCGGAGAGATTTAAACTCATCCAGCAATTCAGAATATTTGCGATTCAGTGAATCATAAGATGCCACCAGCTGGTGGTGTGGTTGTTCCAGTTGTTGTTCCTGAAACTTCTCTGCCGCTATACGGGAAAACAGTGCCTGTAACTTCCGATAGTCAGCCTCATTCGGCAACTGCCACTGACTGGCACCAAACCAGTGGGAAACCATATTTTTCTTACCTGTGGCTTCGGCAATTTGTTTTGCCGTTATACCCAGTTCGGCACGAGCATCCCTGAAATACGATATCAGCGGTGCCATTATGTGCTGTTTGAGTTCCCTTTCTTTTGCCGCATAGCCGTCACTTTTGCCGCGATATGGCCCCTGGTAATGTTCAGCAAACAGAACGCGCTCTGTGGCAGGAAAATATGCGCGCAGACTTTCTTTATTACACCCATTCCAACGTCCGGACGGCTTCGCCCAGATGATATGGTTAAGCACGTTGAAACGTTCACGCATCATGATCTCAATATCAGATGCCAGGCGATGCCCACAGAACAGGTAAAGGCTTCCGGCAGGTTTTAACACCCGCCAGAACTGGGCCAGACAGTGGTCCAGCCACTTAAGGTAATCTTCGTCCCCTTTCCACTGATTGTCCCAGCCGTTGGGTTTCACCTTGAAGTACGGCGGATCGGTAACAATCAGGTCAATGGAATCATCAGGCAGGGACTGAATAAAATGCAGGCAATCAGCGTTGATTAAATCAACACTGTTTATTTTTACAGTATTTTTCATGGATCAGTAAGCGTAACTCTGGTAGGCTCACTCTGCTTTTGCGCTAAAGCAGTGGGCCGTGGTTCGCTTGTGACCAGTAAGCATGAGCGAATGGCTGGCAGGTGCTACCAACACCCACCAGCCGCCCATTTTCACAAATTAAAAGTCCTTCATTGCTGAAGGCGTCTGTAACAGCCGAACTGGTAATCTGCCAGCCCCGCCATAACCAACTGGGTCAGTATTAACTGACAGCGTTCGCGTGAAAGATATGTGTTTTGTGCAATCTCCCCGACTGTTGCCGGTTCGATGCTTAATTCATTAAAAACAACTTTCGCCGTTTCTGTCATATCTTGCTGTTTTAGCATGTCTTTTTTCCTTCTGGTTAACATGACATACCAATAACTCTTGTCTAAAAAGCCAGCAAGATAAAAAGTCAGTATTCACGACCACCAGCGTGTTTACCGTACTGCACCAGGTTTACAGGTACAAAAAAACCCGCTCTACGGCGGGTTTAAGCTGTGTGGCGAAGTAACCACTCTTAACAGATTATGATAGTTTTTGCGTACGCGTTAGTAATTTATTATGCTCTTTACAATTGTTAGCTTAACCGCATGGAATTGACCAAAAAAATGAGCAAAGCAAGCATACGCCACGTAATCGTTCATGAGCTCTTAAAAGAATCTAATAAAGACTTCGATCACTCCAAACCATACAATCTTCGTGATACAGAACTAGATAAAACAAATGATATAGTAAAAAAATTAGTAGACGGTGTTATTGATTTGTATGGTTCAAAAGGGAACTCAGCGCATTATGGTGTTTTTATTAAAAATAAAACAAAGCAAGGCCCTATACCAGAACTATTTCATAAATACTCTTTAGTTCAACAATCTGTTTCAAGTGATTTCATTGAATTATCGAAGGAAGTTATGAAACAAATGTATAAATCTGCTCAAGAGCAGATTTGGGCTTCTGGAGGATATGTTGTTTTTACTGATTATATTTTATCTGGTTTCCGTTATCTATTGGTTACAATGATCAAAAAAACTAATGGCGTAACTATTAGTGAAAATTTAGAGCCAGAGGAAATGATTCACTTAGAACTTGGTAATATTAACCAAGCAGCAAAAATAAATTTCAGATATTATGAAGAATACCAAAAAGCAGATGACTTAAAAAAAACAGACTTAAGTTATCTAAGCTTTATAAGCAAAACTACGGGACAGTCAGCGGCAGCATATTTTATAGCAGCATTAGGATGTGACAAGGGGATTGCTTCAGCAGGTGCAACCCGTAAGTTACCAGATGAAATAAGGCGTTTCTTTAAGAAAGAACCTCTTTTAAAAAATCAAGCAGAGTCATTTAGAAATGATGTTATCAAATACTTAGAAAAGCAATTTGACAACGAGCACTCTGCAAGGCTTTCTGATATCGAATCGCTTGCTTCAGGCCATATGTCCTATTTAAAAGAGGAAGAAAAAACAGAACTTGTTGATAAATTAATGAAACACCTCAATAGTGAGGAAGTCAGAATCCCATCAGAGTTCGTAATCAATAAAAACTCCTTAGATAAAATCAGCAATGTGATATATAAAACCCCATCATTGAGCTTTCACTTCGACAAGGATTTACTCGGTGTCACAACTGATGCTAAAATATATTATGATGACGAAAACCAAAGCCTAACATTTAATAATTTGCCTGTTGAAGCATTAACTAAGATAAGAAGAGCGTTGAAAGAAACTGATAACCCAAGTAATGAAGAAGATAAAGAATGAATGATTTTAGTATAATAGTTAATCTGTATAGATTATCAAGCTATCCTCATTTTGACGGGGCTAAGTTTTCTGCGCGTATAGCTTATAATGCAGACGTAAAATCATTGTTCAAAAGAATTTTGAACCCTACTTTTCAAGCTGGTACAGCTGACGAAATAGAGGTGGATGGTCATTTAATTTATGATTATGAAGACTTTCCTGAAAAGGGAAATTTTCTTACATACTCGTTTAAAATTTCACAAGGAAGTGCGAATCGTTTTTATAAAAATAAAAACGAGTTTGTAAAAATAAACACGCTCAAGAAAGGCATAATGCCAGAGTATTTCTATATTATAGAGGATGATTTCTATTCATTAGAAACACCAAAACCTTCTTATATCCAAAAAATTGAGGACATTTGTGAGCTAATCAATGCTCTTTCCATGCTTGCTCATTTCCATGATATAAAAAAAGATAGCAAAGGTACATTTTATCGTTTAGTCTTTATTTTAAACTCAGAGTCTAAATCTTCTTCTGCTGTAATTGAAACAAATATTACAGAAGAAATTTTTAATGATAAAACAGTAAATACTCAGTTAGTTAAAACATTAGTAAGTAGTGAAGCTACTACTGATGCCCATCACATTGAAAAGATTAACACTTTCAGAAACACAGTTATTGAGTATGTTAATAAAAATGGAAATTCCTTTGTCGAGTTAATTAACAAGTGGGATTTCATATGCGAACTTTATACGAACAACTTAGCTGCTTATATGTCGGCATTTTCTTTTCATAAAGCAAGGAAAGAAGTTGTTGATGCTGAACTCGATTACTCAGAAAAACTGTCAAAAATAATTTCAGAAATTTCTAACAAAGCTCTTGCAATACCTATTTCACTAGCTGGTTCAATTGCTATTTTCAAATTAACAACAAAAGCTGATTGGATTATTGCTTTAATTGGATTGATTATCACAGCAATAATAACATCTGCAATGATTGTGTCACAAAAAAAACAACTTGCTCGTATTTCACACTCTAAAGAAATACTTTTTGGACAATTAAGATATAGAATAAAGGATGACACCAGCGATCTTAAAGAGAGCTTAGAAGAGGCTATTAAAAAATTAAATGACAATGAGGATTTTTGTCATAAGGTGCTTGACAGTTTATTATCACTAGCATGGATGCCTACATTCATAGGCATCATCGGTATTTTATTTAAATTAATGCCAAATATTACTTGAGCATGTACATAACCCCATTAATGAAACCTAACGCAGTCTGCAATTGCTTCCGGATTGTTCCATCTGAGCATCTGCGTCTCTTCGCAATAGAACGCAGTGAAATACCTATAACGAAGTGAGCAATGATCAGCTCATATTCGTCTGGTTTATACTTACACAGGCGGGCAACACAGCCGTCTATCATGATGCCTTCATCATCATCACACTGAAGGCGTATTTTTTTACCATGAGGTAAAAGCCCCTTAAAGCCTGCTGCTATCGGCTGCCAGTCCACACCACTGTTATCTGCTGCAGCCCACGCTCCCCAGCGGTCTAAAACTTCATACATATCACGCATCAACTTTCTCCACAAAATCAGGCCAGCACGCCAATTGCCAGCGCACGATCGATAAAACGAAATATCAGCTCCAGCTGGGAGCCATACTTCTCTTCAAATGCCACGGTATCCGCATGCAGCTCGTCGTGATGCTTTCTGCACAAAGGCAACACAAAAAGGTCATGCGCTTTTGTTCCCATTCCACCCTGACCGTGACCTATCAGGTGGTGGGGATCATCAGCGGGCTTTCCACAACATGCACACGGCTGTGTCTTAACCCAGCGCGTGTACTTTTCATTAACCCAGCGGCGACGTTTGGGGCGTAACATAAAAGACTCCGGCGACTCCGGATCCACTTTCAGCGCCAGCACCTTTTTCGCCTTATCCTGGATGATGCTGGTGGCAGGAACCGAAGGCACAAGGTCACTTTCCCGGGTAACAGACGGCACAACAGGCTTCGGTAATCTCAGTGCCTTACGGGCTGCACTTTCCGGTAAGGCATCCGCCAGATCATTACGAATCAGCCACCAGCACAGTTCCGGCATTGTCACAACGTGACTGTCATCAAAACCGAGATCCCGACGCACAACAGACAACACCCAGCGGGCACAGTTATCCGTTGCCATTGATTCCAGCCGTTCCGTGAACTGATCGCGCAGCTGGTTATCGCAGTGCCAGCACAGACGGATTGCACCCGGAGCGTGTCGCATTGTGGTCATGTTCTCGCTGTGCCAGTCGGAATGAGGCCACTGGCAGCCTTTTTCACGAAGTAACCAGCTTTCAAGACATTCCACGCCACCAGCACGACGGATCACTGCCTCATTGCGGAACACGGCCCGAACGGCAGGATCATCCGCCAGCGGTTGTGATGCCGCCGGAACGGCACCACTGGCGAAAGATGAATAACGCTCCGGCTCAGGCTCCAGCAGGACACGCCCCTGCATAAACAGGGGCATCAACTCTGAACCTGGCCTGAACAATACGATCCCCATACGCGGGGCAATTTCAGGGGTCAGTAGTGCTCTCACGGTCACCTCAATGAACGGTATCGAGCAGCTTTAACAGCTCAGGGAATCGGGATTCGAAGAAATGCGGCTGCGTCTCGCGCGGATTTGCGGGACTGGTGATGTTCTTGCCGAACATGCAACCTTTCGCTGTCAGCGACCAGAATTTTTTGATGTTGTTAATCGCGGTACGACTGTATCGTTCGCGCTGCTCGACGATCCCCAGCTTCACCATCTGGTGATATGCCTGATTAGCTGTCAGGCGGATACCATACTGCTTCAGCAGTGCACTCAGTGACAGCGTGGGGCGGCTTGAGCCATCAGGCGCGTCAGCAGGAGCATCAATGGCATAGCGCGGTGCCAGATTCGGTAAGCCAACAGCCTCCTGGAGTTTCTGACAGGCACCAAGCACTGAAGAGTTAGACAGGTTTAATTCCCGGCGCATAAAGTCCAGCAGAATCACACCAGCCTGCATCTTGTCAGCAGCCTGTCCGGATAATTTTTCCGGTGCGCTGGTTACCATGTCGAAAGTACGGATCACCTTAAGATGGAATGACGGGCTTATCCACATTGCATAGGCATACACCAGTTCTTTGCAGACATACGTCCCCTGGTTATTTCCGCCACGAATAACGTTAACTGGCTCTATATTGACCGAGTTGCAAATCTGCAACTCGCTTATTAAACGTTCAGTTTGCTCATTGCGGAGCCAGAATGCAGGCTTATGCTTATCCAGAGAACCGGCAGCCCTGTGCAGATCGTTCAGGCTGTAACGCCCATAAGCATCACGACGAACTTCAATACCATCAATAACCATCAGATTATTCATACTTCGTTTCTCCTCTTAATCAGGCGGCTGCACCCGCCGGTTTCTCATACTTACTGATAGTGATCTCGACCTTCCCTTTCGGGATAACCGGTCCCCACTCCACCAGCATTCTTTTCACCTGTCTGTCGTCTTCCCACACACCCGCGTGGGTCAACGCGTCAAACAGCGCCTTGTTATAGTTGTCCAGATCGCGGATCCGGTTATCCGGAGGAAACAACACGATCTCCACTGAAGCAGGTGCCGACGTTGGTTTCGGCAGACGACGTAACTGCTCAACTATTGCTGCGCACGCCGCGCTCTGAAATTTTCGCCCCGCCTCGCTTATCAGGCTCTTACCAGCAAATGCCCCTTTGTTGGGGTGTCGCCAGTACGTGTTCACACTGGGCGGGAAAGGAAGGATCAACTTCATACTTTCAGGCCCCTCTCATGTAACCAGTGGGCTGCACGCAACCTGGCGTTCTCCTCACCGGCAAGCAGTGCGCGGATGATACCGACCGCCTCGCTGTCGTCGTCCTTCACTGCGGTATGAAGCGTGATCCCCCGGGCCACGCCACGCTTTATCGTGATGACGCCTTTTTTCTCCAGTGCGCGAAGATGCTCCACGGCTGCATTCACTGAACGGTATCCCAGCATGGTTGCCACCTCCTGATTGGTTGGCAGAAAGCCACGCTCTTGCTGGTAAGAAATCAGAATATCCAGCACCTGCTGCTGGCATTGAGTTAACGTCGTCATGCCGCCATCTCCCTAACCAGTTTTTCCGCCTGCTGGCGAACCTGCGCCAGAAACGCCTCACCACATGCCTCAAGTTCATCGCGCCCGATGTAGCTGATTGCCGGTCCCTTCCAGGTCTTATCGAAAACAGCAATAGCACCAGCGAAGAAAGCGCCTGTCGGCACCTGCTTCTCATCCTTCGGGATAAACCAGGCAGGCAGTTCAAAACCAATACGCCCGCGAATAAAAGCAATATGGTCCGCATCTTCCGGCCACCACACTTCGCTGGTGGCAGCTTTGATCAGGAAAACATAGCGCCCACCCTTATCACGCATGGCACTGGCATGTTTCATGATGTAACGCATGCCGGTGATGTATTGCCCCTCATGCTGACTGGCGCGGCTGTATGGGGGATTACCAAAGGCAGCCCCTTTAAGCTCCGCAAGACGTTCTGACCAGTCATGCGCCAGCGCGTTGTCTTCCGCCGTGTAATACGCAGCACATTTGGCGTTATCACCGTCAGTGAACAGATCCAGAACAAACGGGCCAAACAGGGTGTTAATTCCCCAGAAAATGTTGTCCGGCGTGCGCCACTGATCGCCCACTTCCTTCAGTTCATGGGCTGGTTTGTTCCGCAGTTCCACCAGCGCCTGGCAATATTTATTACTCATTAAGCCCCCACGTAATTCCCTGACAGATACCACTCATCACCCGGTACAGCGCGCTTGCTGCTTTTCCGTAAACACCGCTCACGACGCGCAAGAAAATTGTTTCGCTCTGGCTGGGAGTGGCTTTCACGGAATGCCGCCATCCACACCGTTGCAGCACGACGGTATAAGCCCCTCGACTCCAGTTCTTCCGCCTGGCGGGTCAGGCACAAAATCACCCGGGGATCGTTAGTGCCGACATAGAAATTGCGCACAGGTCTGGTTTCACGAACTGGTTGCGGTTCCGCCTCCTGCGCTCTCTCAGTCAGGCGCGGGAAATGTCTGCGTGTATCCCCTTCACAACGGTGAGCCACACGACCACTCTGACGTAACTTGCTTGCTGACTGCAGAACGCGCTGCCGTGAGTAACCTGCAAAAGCATCCGCAATGTCTCCGGAAGTACACCCCGGATGGGCTTCAATGTATTTCTGAACTTCATTCAAAAGACTCATGATCACCCCCTGAATCCTGCCGGGATCTGGCTGTAGTCCACGTTGTCGTAACTGGCTTTGAAGTACGGGTCCTCGCGTCTGGCTGCAGATACCGCAGGAACTTCCCAGGATTCTTCGAAATGACGATCCGGACCAAAGAACGTGACAGCCTGTTTCACAAATTGTGTGCCGCTGTTACCCATCGCAGATACCCAGCCCGCGTAGCGTTTCACACCTTCCAGCATGGTTTCGGGGTTTACCCCCTCATTCAAACGGGCTTTCCAGGCTTTGAAGGCTGCAGATTTTGAATTGCCACCAGCACGTTTGGGATATACCAGCCATGCCTGCTCAAACTCCGGAGAGTATTCCGGTCGGTTTGAACGAACTCGCACGGACTCATCAACTGATGCACCAACAGCTATTGGTTCATTGACTGGTTCTTTGACTGGTTCAAAAGAGTGACTGGTTCTGGGTGAATCTCCTGCACTACCCCCTGGTGCAACTCCTGCACTACCTGGTGAATTTGCTGCACCAGATAGTGAATTATTTGCACTACCCCCTAGTGAATCTCCTGCACCATCCAGATGAAGGAGATAGATATTACTTGAGTTACCTTTTTCACCTTTCCGGGTGACTTTTTTTACCAGCCCGGACTCACAAAGGGCCGCAATATGATTCATCACAGAACGTTTGCTAATCTCGCACTGGTCAGCAATATGCTGGTAGCTGGGCCAGCACTCACCCTGATCGCTGGCATTATCAGCCAGCTTGATCAGAACCAGTTTTCGCAATGGATTACCCACTCGAATTTTCATCGCTTTAACCATCAGCTCCATACTCATGCTGCACCTCCGAGATGCTTCATGTTTTTTCCGGAGCGAAAGGCTATAAGCGGCATACTGACGCGGTAATTACGGCCCAGCGGTTCACAAATCACCTTCTGGCATTCACGGTCAACCAGGCTAACACGTAGAACATGCCCTGCAGGTGTGGTGTACCACTGCCCAACTGTAGGAATTGATGTTTTTTTACGCTGAAGCAAACGGCAAATATTGAGGATCAACGGATTAAGCATGACGATGCCCTCCGCTGATATTCAGGAGACGGTGAATATGAAAATTAGCCTTATCCGCCAGACGAATACGTTCAGCCTGCAAGTTAAGAAGGGTTTCTACCAGAACTTGATGCGCCTGCGGATCCGAAAGAGTTACCTTGCGCAGAGCACGTAGTGCAGTTGTTACATAACTGAGTTTATGTAAGTCTTCATCATTCAGACGAGTGAGGGCTGGGACAGTAGCCATGATGGCAGCCTCCGATAACAGTGAATTACCTTCACCACCGGAAACGCCAATTTCGCTGGTGGTGAACTGAACGGGGTTGGCGTAACCGGCGTTATCGGAAACCGGCGCACCTTTCGGTGCCCCCGTCCAGCCCACCATAATTTGGGTGTGCACAGACGCAGACGATAAAAAAGACGCTGGCGCGTCATATATCGCCGATAACATTTCCAGGACGCCAATCCCGGCACCCGCTTTATAAGGTGCCTGAACAGTGTAACGTCCCGGAATGGCAGAATCAATGTGCTGGTGGTCCTTCACACTCAACAAAATCACGCCTGAATTTCCACAAAGGACTAAAGCACTCATGCGGGTAGTCTTTGCGAAGATAGATAACGCGCTGTGTTTCTGGCTCCCAACGAATAACATGGACATAAAGCCCTCTTCCGTCACGAAACCAGCGGTTAAGTTCCTGCACAACTCGCCCCCCACAGTCAGGTAAAGTTCTCTGTGGTTACTTACAGCCAGGTGATTTGGTAATCTGCATTCATGCCGTAACAACAGGTGTTCAGCGACACTGACCACCAGCTGTTGCGACAAACGGTTATTTGCCGTTAAACTGTTCATGCGTTAGTTTCTCCACAGACACAAAACGCCACGACGCCCGGAGCTGCACACTCGCGGGCGTCACTCTTTTCTGGAGCGCAAAAGATTTTGTAGACCAGTGCTGCATGCTCCTGGAGCTTCGAAATTGACAGATACAACTCATCATTAATTGCTGTCTGCTCGTGTGGCTCCACTACCCCATCTTCGATTGCCGAACGAATCTGCTTTGAGTAACTCCCGATCTGTTCGATGACTTCCAGCAGGCGCTGGTTTATATCGGCGTTCTCTACTTCCTCAATTTCAGGAAGCGATACAAACACCCCACCAGCAGACTGTGCGACAGCATCCGCAATGTAGTGAGTGCCAGCCGCGCGCTGTAAAATCATTGCCCATCCCAGCGGGAAAATCTGATCGCCATCTGCACGAAGGCGGTTGAATAAAGCGTTCTCTGTTACATCCAGCCACTCAGCAGCTTCAGCGTAACCCCCCGGCAACGCCGCGATAGTTTTTCTGACAGCTTTCACGTACCACTCAGGCTGTTTTTCTACTTTCCAGTGATGCTTACCCACGGTTAGCCTCATCGTTCTGTGGTTAAAAATTGAAGGTGTTCTGTTAATCTTTCGGATAGATATCCGGTCTTAAGTCAGATTTCGTAATTGCACCTGACGTGCATTGCTCAAGTTTTTTAGCCAGCACAAAACTGGCTTTTTTATAACCATTGAAAACCAGCCGTAAGTAGCCTGGTGTTGAGCCAACTTTTCCGGCCAACTCACCCTGCTGTTCTTTGGTTAAAGAGTCCCAATACGCTTTCATACAATATGTACCTCCGGTATACATATTACATGATTGAGATGAACCTTCAAGATACTTGTACCTTATCGGTACAAAGGTTTTAATTTCTTTATGAAAACAGTCCATGACATCCGGCGGTCTAACGCCAGAAAACTGAGAGATGGTGTTGGCGGGAATTCTTCCTTTGCCACCATGATTGATCGCGAGCCAACCCAGACCAGCAGGTTTATGGGAGATGGTGCAACTAAAAATATCGGTGACAGCATGGCACGGCACATCGAAAAATGTTTCGACCTGCCTGTCGGATGGCTTGATCAAGAACACCAGACAACGAACATCACAAAAAAACCTGACGTTTCAATCACTAACAAACAAATAACGTTAGTCCCTGTCATATCATGGGTACAGGCCGGAGCATGGAAAGAAGTTGGCTATTCTGAGGTTGATTTGAGCACAGCAGAAACTTATCCCTGCCCTGTACCCTGTGGCGAAATGACTTATATCTTGCGGGTGATTGGTGATTCAATGATTGATGAGTACCGCCCGGGAGACATGATTTTTGTTGATCCTGAAGTCCCTGCCTGCCACGGTGACGACGTTATTGCATTGATGCACGATACAGGCGAAACCACCTTCAAGCGGTTGATAGAAGATGGAACACAGCGTTACCTCAAAGCATTAAACCCAAACTGGCCTGAACCTTACATTAAGATCAACGGTAATTGCTCTATAATTGGTACAGTGATTTTCTCAGGAAAACCAAGAAGATACAAAATAAAGGCCTAATCAATATTTATAACCTGCTTCGGCAGGTTTTTTTATACTTGACAATGTACCCTTAAGATACATAATGTATCTATAGGATACATAACACAGGCAAGATTAAACTAAATTTGGTTGTAACACGGCGTATGGCACATGCGTCGTTAGCGGTCTGGGGACGTTAAAGGGGACAATCCACTCCTTGCTCGGGCAAACAAACCAGGTAGCCGGAATGTGCAAGTCAATGATGATGCTGATAAGACGCCTAACCAGCGTGGCGATCCGGTTTGACGCCTGGGAAGAGACCAGGGTGCAACGATGAGGGCATTTATGGAACCGCGACAAAGTGTGGTGCCGTAACTGGCTAAGTGCTCTCAGCGTTGTGGTAATCCGCGAAATGGCGCGGCGGTAAGTATGGCGGGGTTACTCTTTCCCCGTTGAGGACACCGGATTGTCAGGTTGACCATACGCCTGAGTGACAACCCCACCACAACAGCCACTGCTTTGGCGGTACCAGTTTGTACACTTGCTTCCGGCTGGTACCGCTCTTTTTACAAAACAGAGAAGAGCATCACCGGACGACGGGCTCATAACCCAATCCATCCGGGCGGCTGCCACCGCAGGTGTTCTTCTCTGTTTTGTGGAGAAACCAACCGACCTTGCAGGGTCGATATGATGAGGAGCAGCAAAATGGCTAGCGAACGCAGTACTGATGTGCAGGCATTTATCGGGGAGCTGGACGGCGGCGTATTTGAAACCAAAATCGGCGCAGTTCTCAGTGAAGTCGCTTCCGGTGTGATGAACACGAAAACCAAAGGTAAGGTCTCACTCAACCTGGAAATCGAACCATTTGATGAGAACCGTGTGAAAATCAAACACAAACTCTCATATGTTCGCCCGACTAACCGCGGGAAAATTTCCGAAGAAGACACCACCGAAACGCCGATGTATGTCAATCGCGGTGGTCGCCTGACTATTCTGCAGGAGGACCAGGGACAATTACTGACTCTTGCCGGTGAACCTGACGGAAAACTCCGCGCAGCAGGTCGTTAATATCGTTCTTAATAAACTGATTATTTATCTCATCACTGAATATTTTTATATAGTGAGGACTTATTATGTCTCAGAACTTAGACGCAACCGCAATTAATCAAATCCATGCCCTTATTTCTGCTCAGGGTGTTAATGAAATTATCAGTAAGATTGGTGCCGATGCTGTGGCATTGCCTGAGAATTTCCGCATTCATGATCTGGAAAAATTTAATTTAAATCGCTTCCGTTTCCGTGGTGCGCTTTCCACTGCCAGCATCGATGACTTTACCCGTTATTCTAAAGTTCTTGCAGATGAAGGCACCCGCTGCTTTATCGATGCCGATAATATGCGTGCCGTCAGTGTGCTTAACCTGGGTACTATTGATGAACCAGGTCACGCAGATAACACCGCCACTCTCAAACTGAAAAAGACAGCACCGTTCTCTGCTCTGTTGTCTGTTAACGGCGAGCGTAACTCCCAGAAGTCACTGGCAGAATGGATTGAAGACTGGGCCGACTACCTTGTGGGCTTTGATGCTAATGGTGACGCCATTCAGGCAACCAAAGCGGCTGCGGCGATCCGTAAAATCACAATTGAAGCGAACCAGACCGCTGATTTTGAAGATAATGACTTCAGCGGCAAACGCTCCCTGATGGAGTCTGTCGAAGCGAAGACCAAAGACATTATGCCAGTGGCATTTGAATTTAAATGCGTTCCGTTTGAAGGTCTGAAAGAACGTCCGTTTAAATTACGCCTCAGCATTATCACTGGCGATCGTCCTGTACTGGTTCTGCGCATTATTCAGCTGGAAGCGGTACAGGAAGAAATGGCTAACGAATTTCGTGATCTGCTTGTTGAGAAATTCAAAGACAGCAAAGTAGAAACCTTTATTGGTACTTTCACCGCCTGATTTCATTACTGCAAATGCCCCTGCGGGGGCGTTTACGGAAGCGATAATTTTAACTATTGCCGCCCCTATAAAGAACCATTAAACAATAACGTGACGAAGCTATTGATAGTAAATGAAGCACTTGCAAAATAATTGCATTGCGTATATATACACATGTGTTGTATTACAGCGATAATGGTAAAGCAAATGATTAACTCTGAAGCAATTGAGCAACTAATGTGGCTATGGTCCTTATTTGACATTAAATTCTTATCTATTCTTGCCGCTGCCTTCACTATATATTTTGGCGTGCAAAAAATATCAAAAAAGGTGACAGTGTCGTATTCAGCAAATGCAAGTAGAATATATGACATGCATATATCAACCATAATCCTGAATAATAAAAGAGATAATGCAATTGCTATATCTTCAATCAATATGGAGGTTGAAGGTAAAGGGATACTACAAGTTATTAAATTTGACTCCCCTCTTCTTTTAAAGAACTATGATTCTTTAAAAGTTGAACCACCAAAATTTAGCAGCCTTTATAATAATGATGGCGTAGTTAAGTTAGATATTTATGATAAGTTTCATTTTTATATAATCACGACATCTGGAGATGAAATTAAATGTATTTCTGAAAATAAATATGTGGCACCAAACATGGAAAACAAAATAGCTACAGACATAAGAAAATTTAATGGCATTGTCTTAACAAACAGAATGTCTTATATTTTTTTCTATGCAAATGACAACAGAGAGAAATACTGCATAATAGATGTTTCATTGTTCATAAATGGTGACAACCCATTTCATTTTAATTTTTTAAAAGAAGATGAATTAAGAGATTTTTCTAGCATCCTTATTAGTTACGGATATCACCAACAGTTTAAAAGTTATGCATTGTTTAAAATAGACAACCATCTTGCTCCTTCTTTGGTTTTAAATAAATCAATGATAGAAAATAATATTATTGAAATGAATAAGTAACTCACCGGGTGCAGCCGGTTATGATGGAGAAATGATATGAATACCTTGTTTTTACTGATGGCTGAATTCAATACCCCAAACATTGAACTCTCAGCAGTTAGTCAAAAATACTTTGGTATGAGTCCAGCCACAGCAGAAGCAAAAGCAAACGCTTGTAAGTTGCCTGTACCTACATATCGCATCGGTACATCACAAAAAGCAAAACGCTGCATCAACATTCAGGATCTTGCGGAATATATTGACAAAAGACGGGAAGAAGGGCGAGCTGAGTGGGAAAAAGTCAGAACGGAAAAAAAATATAACTAAACTAAAACTATGGATAACCCGTATATGTACGGGTTATTTTTCTTTATCACTATCTTTTCTTGATTTGAACAATCCACTAACAACGAAACCAACCAAACCAACAATACTAATTGTACTTGTACCTAGTAATGCAACGATCGCTTCAACTGGAGCCTTTCCTTCATGTGCAATAAGAAACGATGTAAACATTGCGACAACGAATAAGCACCAACACGACATAAACCAAACCGTGAATGATGCCATTTTTGTCCGGAGCTCATTGTCTATTTCTTTACCAGTTGCGTCAGCTATCTTATCCCGTACTTGTGATTTGAGCATATCAAGCTGAGCTTGAAGACTGTCCATTCTGTTCTGCTGCATAAACTCATGCAATGCACCAGTATTAGAACCAAACTCTTCTTCCTCCAGAATAGCCTTATTTTCTGAAGAAGAATCATCATCGCGTTCATGATTAGACGGCTCAAATGCGGATTCAAAAGCCTGCTCTTGACTGTCAGTAGAGTTTAAGGATGCTTCAGAGCGACCATTTTCAACACCTGCGGCCGCTCCGATCAGTTTATAGATATCTGAATTATGAGACATGTCATCCCTGAATATTACTTTTTCAGAGGCCCTGACATTGCTGTCGGTTATTCAATAAATCATGATAATAAGCCTTGATCGCATCATTTGAGATGATCGACGAGCCAATACCATTATAAGCTTGTGACCAAGGCGTACCTGGCATATGAGTTAGAGTTGATAACTCAATTCCATTTTTCGAGCCGTAAAACTTATAAACAGCCCCGATAATGCTCTCTGCTTGCGGATCCATAGTAACGATGCCACCAAAAGGAGCTACTGCTACATTCGTAACAGGTTTATTCCCATAGTCTTTGAAAGCATCGTACATTCCAGGAATAACTGGACCGTACTTCCACGCGGAGACACATTCATTGAGCAAAGGCTTACCTGTTAATGCTAAATAGTAACCATGGGCAATATAAGTAAGCTTCTGCAGTTGCATGTGGGTCAGAGGATTATGATGTTGGTTTCCCAACGTTATGAATTTATTGGCTATTTGTACCGGACTGTACAT
Protein sequences of DBSCAN-SWA_9 >NZ_CP029122|4557435:4580852|4569113_4569458_-|WP_016159280.1|DBSCAN-SWA MRDMYEVLDRWGAWAAADNSGVDWQPIAAGFKGLLPHGKKIRLQCDDDEGIMIDGCVARLCKYKPDEYELIIAHFVIGISLRSIAKRRRCSDGTIRKQLQTALGFINGVMYMLK >NZ_CP029122|4557435:4580852|4564805_4565021_-|WP_000839596.1|lysis|DBSCAN-SWA MKSMDKLTTGVAYGTSAGSAGYWFLQLLDKVTPSQWAAIGVLGSLVFGLLTYLTNLYFKIKEDKRKAARGE >NZ_CP029122|4557435:4580852|4577246_4577609_+|WP_000135682.1|DBSCAN-SWA MASERSTDVQAFIGELDGGVFETKIGAVLSEVASGVMNTKTKGKVSLNLEIEPFDENRVKIKHKLSYVRPTNRGKISEEDTTETPMYVNRGGRLTILQEDQGQLLTLAGEPDGKLRAAGR >NZ_CP029122|4557435:4580852|4575905_4576580_+|WP_000859462.1|DBSCAN-SWA MKTVHDIRRSNARKLRDGVGGNSSFATMIDREPTQTSRFMGDGATKNIGDSMARHIEKCFDLPVGWLDQEHQTTNITKKPDVSITNKQITLVPVISWVQAGAWKEVGYSEVDLSTAETYPCPVPCGEMTYILRVIGDSMIDEYRPGDMIFVDPEVPACHGDDVIALMHDTGETTFKRLIEDGTQRYLKALNPNWPEPYIKINGNCSIIGTVIFSGKPRRYKIKA >NZ_CP029122|4557435:4580852|4573142_4573961_-|WP_021527492.1|DBSCAN-SWA MSMELMVKAMKIRVGNPLRKLVLIKLADNASDQGECWPSYQHIADQCEISKRSVMNHIAALCESGLVKKVTRKGEKGNSSNIYLLHLDGAGDSLGGSANNSLSGAANSPGSAGVAPGGSAGDSPRTSHSFEPVKEPVNEPIAVGASVDESVRVRSNRPEYSPEFEQAWLVYPKRAGGNSKSAAFKAWKARLNEGVNPETMLEGVKRYAGWVSAMGNSGTQFVKQAVTFFGPDRHFEESWEVPAVSAARREDPYFKASYDNVDYSQIPAGFRG >NZ_CP029122|4557435:4580852|4573957_4574182_-|WP_001446924.1|DBSCAN-SWA MILNICRLLQRKKTSIPTVGQWYTTPAGHVLRVSLVDRECQKVICEPLGRNYRVSMPLIAFRSGKNMKHLGGAA >NZ_CP029122|4557435:4580852|4578685_4579468_+|WP_000610754.1|DBSCAN-SWA MINSEAIEQLMWLWSLFDIKFLSILAAAFTIYFGVQKISKKVTVSYSANASRIYDMHISTIILNNKRDNAIAISSINMEVEGKGILQVIKFDSPLLLKNYDSLKVEPPKFSSLYNNDGVVKLDIYDKFHFYIITTSGDEIKCISENKYVAPNMENKIATDIRKFNGIVLTNRMSYIFFYANDNREKYCIIDVSLFINGDNPFHFNFLKEDELRDFSSILISYGYHQQFKSYALFKIDNHLAPSLVLNKSMIENNIIEMNK >NZ_CP029122|4557435:4580852|4570472_4571270_-|WP_001061404.1|DBSCAN-SWA MNNLMVIDGIEVRRDAYGRYSLNDLHRAAGSLDKHKPAFWLRNEQTERLISELQICNSVNIEPVNVIRGGNNQGTYVCKELVYAYAMWISPSFHLKVIRTFDMVTSAPEKLSGQAADKMQAGVILLDFMRRELNLSNSSVLGACQKLQEAVGLPNLAPRYAIDAPADAPDGSSRPTLSLSALLKQYGIRLTANQAYHQMVKLGIVEQRERYSRTAINNIKKFWSLTAKGCMFGKNITSPANPRETQPHFFESRFPELLKLLDTVH >NZ_CP029122|4557435:4580852|4558593_4558842_+|WP_001217553.1|DBSCAN-SWA MRIEICIAKEKMTKMPTGAVDALKEELTRRISKRYDDVEVIVKATSNDGLSVTRTADKDSAKTFVQETLKDTWESADEWFVH >NZ_CP029122|4557435:4580852|4571998_4572652_-|WP_000066917.1|DBSCAN-SWA MSNKYCQALVELRNKPAHELKEVGDQWRTPDNIFWGINTLFGPFVLDLFTDGDNAKCAAYYTAEDNALAHDWSERLAELKGAAFGNPPYSRASQHEGQYITGMRYIMKHASAMRDKGGRYVFLIKAATSEVWWPEDADHIAFIRGRIGFELPAWFIPKDEKQVPTGAFFAGAIAVFDKTWKGPAISYIGRDELEACGEAFLAQVRQQAEKLVREMAA >NZ_CP029122|4557435:4580852|4575614_4575815_-|WP_000649477.1|DBSCAN-SWA MKAYWDSLTKEQQGELAGKVGSTPGYLRLVFNGYKKASFVLAKKLEQCTSGAITKSDLRPDIYPKD >NZ_CP029122|4557435:4580852|4579807_4580356_-|WP_000019186.1|DBSCAN-SWA MSHNSDIYKLIGAAAGVENGRSEASLNSTDSQEQAFESAFEPSNHERDDDSSSENKAILEEEEFGSNTGALHEFMQQNRMDSLQAQLDMLKSQVRDKIADATGKEIDNELRTKMASFTVWFMSCWCLFVVAMFTSFLIAHEGKAPVEAIVALLGTSTISIVGLVGFVVSGLFKSRKDSDKEK >NZ_CP029122|4557435:4580852|4559593_4560964_+|WP_001678535.1|DBSCAN-SWA MTASRIFKKSFSKKNLLKVYSEKIKESGAIGIDRIRPSKLDLTIKNEITFIFEKVNSGNYKFTAYKEKLISKGANSTPRQISIPTARDRITLRALCECLTEIYPKSRLKLPHTVIDSLKEALNNSLYAEYAKIDLKSFYPSIEHKLIINAIKNKIRKKEIRQLITSSLIVPTVSGTTGSKGIPNNTRGVPQGLAISNILAEISLSNFDDEINKMHDIWYMRYVDDILILTPKYQATKIASHIIDKLQSLNLNPHPLNEENSKSKVGSLDESFNFLGYHIENRELLIKHESILRFESSLAKIFTAYRHALLQAKSKRDKERAVAYCQWKLNLRITGCVFEGKRLGWVSYFSQITSTAQLRSVNHTINNLIRRFGLSSEIKPKSLIKTFYELRRGRAETFKYIPNFDNLHISQKRELVSMWIGKEKEKKLSNSEIERKFKFKIAKSVKELEEDISGIS >NZ_CP029122|4557435:4580852|4580378_4580852_-|WP_000287252.1|DBSCAN-SWA MYSPVQIANKFITLGNQHHNPLTHMQLQKLTYIAHGYYLALTGKPLLNECVSAWKYGPVIPGMYDAFKDYGNKPVTNVAVAPFGGIVTMDPQAESIIGAVYKFYGSKNGIELSTLTHMPGTPWSQAYNGIGSSIISNDAIKAYYHDLLNNRQQCQGL >NZ_CP029122|4557435:4580852|4567894_4569121_+|WP_046657265.1|DBSCAN-SWA MNDFSIIVNLYRLSSYPHFDGAKFSARIAYNADVKSLFKRILNPTFQAGTADEIEVDGHLIYDYEDFPEKGNFLTYSFKISQGSANRFYKNKNEFVKINTLKKGIMPEYFYIIEDDFYSLETPKPSYIQKIEDICELINALSMLAHFHDIKKDSKGTFYRLVFILNSESKSSSAVIETNITEEIFNDKTVNTQLVKTLVSSEATTDAHHIEKINTFRNTVIEYVNKNGNSFVELINKWDFICELYTNNLAAYMSAFSFHKARKEVVDAELDYSEKLSKIISEISNKALAIPISLAGSIAIFKLTTKADWIIALIGLIITAIITSAMIVSQKKQLARISHSKEILFGQLRYRIKDDTSDLKESLEEAIKKLNDNEDFCHKVLDSLLSLAWMPTFIGIIGILFKLMPNIT >NZ_CP029122|4557435:4580852|4571675_4572002_-|WP_032235543.1|DBSCAN-SWA MTTLTQCQQQVLDILISYQQERGFLPTNQEVATMLGYRSVNAAVEHLRALEKKGVITIKRGVARGITLHTAVKDDDSEAVGIIRALLAGEENARLRAAHWLHERGLKV >NZ_CP029122|4557435:4580852|4571289_4571679_-|WP_000767133.1|DBSCAN-SWA MKLILPFPPSVNTYWRHPNKGAFAGKSLISEAGRKFQSAACAAIVEQLRRLPKPTSAPASVEIVLFPPDNRIRDLDNYNKALFDALTHAGVWEDDRQVKRMLVEWGPVIPKGKVEITISKYEKPAGAAA >NZ_CP029122|4557435:4580852|4566731_4567898_+|WP_046657263.1|DBSCAN-SWA MELTKKMSKASIRHVIVHELLKESNKDFDHSKPYNLRDTELDKTNDIVKKLVDGVIDLYGSKGNSAHYGVFIKNKTKQGPIPELFHKYSLVQQSVSSDFIELSKEVMKQMYKSAQEQIWASGGYVVFTDYILSGFRYLLVTMIKKTNGVTISENLEPEEMIHLELGNINQAAKINFRYYEEYQKADDLKKTDLSYLSFISKTTGQSAAAYFIAALGCDKGIASAGATRKLPDEIRRFFKKEPLLKNQAESFRNDVIKYLEKQFDNEHSARLSDIESLASGHMSYLKEEEKTELVDKLMKHLNSEEVRIPSEFVINKNSLDKISNVIYKTPSLSFHFDKDLLGVTTDAKIYYDDENQSLTFNNLPVEALTKIRRALKETDNPSNEEDKE >NZ_CP029122|4557435:4580852|4579504_4579774_+|WP_001093912.1|DBSCAN-SWA MNTLFLLMAEFNTPNIELSAVSQKYFGMSPATAEAKANACKLPVPTYRIGTSQKAKRCINIQDLAEYIDKRREEGRAEWEKVRTEKKYN >NZ_CP029122|4557435:4580852|4566290_4566485_-|WP_001355891.1|DBSCAN-SWA MLKQQDMTETAKVVFNELSIEPATVGEIAQNTYLSRERCQLILTQLVMAGLADYQFGCYRRLQQ >NZ_CP029122|4557435:4580852|4575019_4575571_-|WP_000515860.1|DBSCAN-SWA MGKHHWKVEKQPEWYVKAVRKTIAALPGGYAEAAEWLDVTENALFNRLRADGDQIFPLGWAMILQRAAGTHYIADAVAQSAGGVFVSLPEIEEVENADINQRLLEVIEQIGSYSKQIRSAIEDGVVEPHEQTAINDELYLSISKLQEHAALVYKIFCAPEKSDARECAAPGVVAFCVCGETNA >NZ_CP029122|4557435:4580852|4559064_4559616_+|WP_000543834.1|DBSCAN-SWA MHDNIWFTYKARIQAHHRLEWLEKHSQFILVWYAILSAVLSIVTLRFPKVLGDNTDVVAAILSVALLGISLIVSNLDFRGRAIAMRRNYIALQRLYFDITTSQQLSLEQKEKYFNLLNEVENHRDIDDKISRVTQVGLKTRIPTQKEKIIVILWILLRIFITAALYILPLIYLWIDYDCKQNF >NZ_CP029122|4557435:4580852|4565088_4566141_-|WP_000799656.1|DBSCAN-SWA MKNTVKINSVDLINADCLHFIQSLPDDSIDLIVTDPPYFKVKPNGWDNQWKGDEDYLKWLDHCLAQFWRVLKPAGSLYLFCGHRLASDIEIMMRERFNVLNHIIWAKPSGRWNGCNKESLRAYFPATERVLFAEHYQGPYRGKSDGYAAKERELKQHIMAPLISYFRDARAELGITAKQIAEATGKKNMVSHWFGASQWQLPNEADYRKLQALFSRIAAEKFQEQQLEQPHHQLVASYDSLNRKYSELLDEFKSLRRYFSVSVSVPYTDVWMHKPVQFYPGKHPCEKPADMLRQIINASSRPGDLVADFFMGSGSTIKAAMALGRRALGVELESERFNQTVKEINELVGK >NZ_CP029122|4557435:4580852|4561401_4563555_-|WP_001753753.1|DBSCAN-SWA MTCTDISVEPSGYIGFIRIYISAKGYPSVASSAGDNLISGYIVRSEVSTKRYSGLFVGSDTKSLYSYIYSQSSGPQWIRHTRRDELSRFGQNESTTRMYGPTGDRFFEIHTNGSWGVYDSSSGQWSPLGLAQGGTGGRTIEQARANLRVMYEQKAGLANTDLNTLTGEYSGFYQQPTSAYATEELNYPIGLAGALIVLQTRANTASSCVQVYHPYNNPGITYRRIYEGGSGTWSEWKRDVSTERVEEGKETTYVYSTYSSGAPRLQVSKSGLWGCHNGTGWLPLAVGQGGTGATTVEDARNNLSLGESSAVKFKNLTLTEALDTTLGLLTKTGRDWNAQHTDNVDKFRPIAGSTNGPAGSMVLGGIHVQFSKNYAVQFGGRNSGFWGRTIENGTTQEWKKLLTVDDLNSSTALAVRSLTTSNPIKSGGGRIDVLGSTSDYSKMDCFVRGFDSTGNSLAWALGSSVGVSKMLSLKNFFSGAEILLNGNDGAVQLKTGAVNGAKAQALTINKDEVNSTVDLTLTKQTGTGNRFVLQNLGNTELPFAVKVWGSGDRQNVFEVGTSAAYLFYAQKTSSGQLFDVNGAINCTTLNQLSDRELKDNIQIISDATEAIRKMNGYTYTLKENGLPYAGVIAQEAMEAIPEAVGSFTHYGKELQGPTVDGNELREETRYLNVDYAAVTGLLVQVARETDDRVTALEEENAELKQRLSAIEAALASK >NZ_CP029122|4557435:4580852|4572651_4573146_-|WP_072165319.1|DBSCAN-SWA MIMSLLNEVQKYIEAHPGCTSGDIADAFAGYSRQRVLQSASKLRQSGRVAHRCEGDTRRHFPRLTERAQEAEPQPVRETRPVRNFYVGTNDPRVILCLTRQAEELESRGLYRRAATVWMAAFRESHSQPERNNFLARRERCLRKSSKRAVPGDEWYLSGNYVGA >NZ_CP029122|4557435:4580852|4569475_4570465_-|WP_001360050.1|DBSCAN-SWA MRALLTPEIAPRMGIVLFRPGSELMPLFMQGRVLLEPEPERYSSFASGAVPAASQPLADDPAVRAVFRNEAVIRRAGGVECLESWLLREKGCQWPHSDWHSENMTTMRHAPGAIRLCWHCDNQLRDQFTERLESMATDNCARWVLSVVRRDLGFDDSHVVTMPELCWWLIRNDLADALPESAARKALRLPKPVVPSVTRESDLVPSVPATSIIQDKAKKVLALKVDPESPESFMLRPKRRRWVNEKYTRWVKTQPCACCGKPADDPHHLIGHGQGGMGTKAHDLFVLPLCRKHHDELHADTVAFEEKYGSQLELIFRFIDRALAIGVLA >NZ_CP029122|4557435:4580852|4574186_4575023_-|WP_032181493.1|DBSCAN-SWA MNSLTANNRLSQQLVVSVAEHLLLRHECRLPNHLAVSNHRELYLTVGGELCRNLTAGFVTEEGFMSMLFVGSQKHSALSIFAKTTRMSALVLCGNSGVILLSVKDHQHIDSAIPGRYTVQAPYKAGAGIGVLEMLSAIYDAPASFLSSASVHTQIMVGWTGAPKGAPVSDNAGYANPVQFTTSEIGVSGGEGNSLLSEAAIMATVPALTRLNDEDLHKLSYVTTALRALRKVTLSDPQAHQVLVETLLNLQAERIRLADKANFHIHRLLNISGGHRHA >NZ_CP029122|4557435:4580852|4577674_4578499_+|WP_001753751.1|DBSCAN-SWA MSQNLDATAINQIHALISAQGVNEIISKIGADAVALPENFRIHDLEKFNLNRFRFRGALSTASIDDFTRYSKVLADEGTRCFIDADNMRAVSVLNLGTIDEPGHADNTATLKLKKTAPFSALLSVNGERNSQKSLAEWIEDWADYLVGFDANGDAIQATKAAAAIRKITIEANQTADFEDNDFSGKRSLMESVEAKTKDIMPVAFEFKCVPFEGLKERPFKLRLSIITGDRPVLVLRIIQLEAVQEEMANEFRDLLVEKFKDSKVETFIGTFTA >NZ_CP029122|4557435:4580852|4557435_4558533_+|WP_000332259.1|integrase|DBSCAN-SWA MAYYNIEKRLKSDGTPRYRCNVIIKEKGVITYRESKTFPKHAHAKTWGAQKVMELDLYGIPSSNAVDGLTVRDLLHKYLNDPNAGGKAGRTKRYVLELLMDSDISAIKLSELTENDVIEHCRLRNNAGAGPATVSHDVSYLGSVLDAAKPVYGINYTSNPAKSARPYLLKLGLIGKSNRRNRRPASDELNMLIEGLQQRSTHKCSKIPFVDILKFSVWSCMRIGEVCRLRWEDLDQEQKSILVRDRKDPRKKEGNHMKVALLGEAWDIVQRQPKKSEFIFPYNSTSVTAGFQRVRSKLGIKDLRYHDLRREGASRLFEAGFSIEEVAQVTGHRSLNVLWQVYTELYPKSLHNRFEELQRSRNKTS |
29 | Shigella_phage(36.0%) | lysis,integrase | attL 4548700:4548713|attR 4567418:4567431 |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
Acr ID | Acr position | Acr size | Homology with known anti | Neighbor HTH/AcRanker | Neighbor Aca | In prophage | Protospacer in prophage |
---|
CRISPR_ID | Spacer_Info | Spacer_region | Spacer_length | Hit_ID | Protospacer_location | Mismatch | Identity |
---|
CRISPR_ID | Spacer_Info | Spacer_region | Spacer_length | Hit_phage_ID | Hit_phage_def | Protospacer_location | Mismatch | Identity |
---|
Region | Region Position | Protein_number | Hit_taxonomy | Key_proteins | Att_site | Prophage annotation | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
DBSCAN-SWA_1 |
2205 : 49895
Sequences of DBSCAN-SWA_1
Nucleotide sequences of DBSCAN-SWA_1 >NZ_CP029123|2205:49895|DBSCAN-SWA CTTACATTTCAAAAACTCTGCTTACCAGGCGCATTTCGCCCAGGGGATCACCATAATAAAATGCTGAGGCCTGGCCTTTGCGTAGTGCACGCATCACCTCAATACCTTTGATGGTGGCGTAAGCCGTCTTCATGGATTTAAATCCCAGCGTGGCGCCGATTATCCGTTTCAGTTTGCCATGATCGCATTCAATCACGTTGTTCCGGTACTTAATCTGTCGGTGTTCAACGTCAGACGGGCACCGGCCTTCGCGTTTGAGCAGAGCAAGCGCGCGACCATAGGCGGGCGCTTTATCCGTGTTGATGAATCGCGGGATCTGCCACTTCTTCACGTTGTTGAGGATTTTACCCAGAAACCGGTATGCAGCTTTGCTGTTACGACGGGAGGAGAGATAAAAATCGACAGTGCGGCCCCGGCTGTCGACGGCCCGGTACAGATACGCCCAGCGGCCATTGACCTTCACGTAGGTTTCATCCATGTGCCACGGGCAAAGATCGGAAGGGTTACGCCAGTACCAGCGCAGCCGTTTTTCAATTTCAGGCGCATAACGCTGAACCCAGCGGTAAATCGTGGAGTGATCGACATTCACTCCGCGTTCAGCCAGCATCTCCTGCAGCTCACGGTAACTGATGCCGTATTTGCAGTACCAGCGTACGGCCCACAGAATGATGTCACGCTGAAAATGCCGGCCTTTGAATGGGTTCATGTGCAGCTCCATCAGCAAAAGGGGATGATAAGTTTATCACCACCGACTATTTGCAACAGTGCCGTGTACATCGAAATACGGCTTATCAGGCGTTAAAAGATGCTTGCGATGACTTGTTTGCAAGACAATTCAGTTATCAGAGTCTTAGTGAAAAAGGTAACACTATTAATCACAAATCAAGATGGGTGAGCGAGGTGGCTTATATTGATAATGAAGCTGTCGTTAGACTTATTTTTGCCCCTGCTATTGTGCCTTTAATTACTAGGTTAGAAGAACAATTTACAAAGTATGAAATACAACAAATAAGTAATTTAACAAGTGCTTATGCTGTTCGTTTATATGAAATATTGATTGCATGGCGTAGTACTGGAAAAACGCCTCTCATAACTATGTATGATTTTAGACAAAAAATAGGTGTACTCGAGACTGAATACAAACGAATGTATGATTTTAAAAAATATGTTTTAGACATTGCATTAAAACAAGTAAATGAACATACCGATATTATTGTCAAAGTTGAACAGCATAAAACAGGTAGATCTATTACTGGTTTTTCATTTAGCTTTAAACAGAAAAAATCAGCCACGCATTCAGTCGAATCTAAAAGAGATCCGAATACATTAGACCTCTTTTCAAAAATAACAGATAAACAACGCCATCTATTCGCCAACAAGCTCTCAGAGCTTCCTGAGATGAGTAAATATTCACAAGGCACAGAAAGCTATCAGCAGTTTGCTGTACGTATAGCTGCCATGCTGCAAGATGCAGAGAAAGCAGGTCTGTGGTTGGATCTTTCCGCTTTAATCGGAACCGTCCATACATTTCAGGAATACGGCTAAAGGCAACCCCCATTTCTTTAATCAGTGATGTCTTTGCTTTTTCAAACTTAGGCAAATCGTGTAAACAAAAACGGTCGTACGCTGCTGCTTCAACTTCTTGTTCAGGCGTTAGTGGTGCGGGCTCAACAAAAGTAAGTTTTGGATCGCAGTGTACAGGATACGCATTTAAAATGTAGTAATGTGGGTTGATAAAGAAACGACTATACAACTCTGAGAACCCAGGGGATTGTAAACGGAGGTTACGCACCTCAACGGTTTCTAAATCTTTGATACCGTTGGGAACTTTAAACACCGTCTGATAGTTTTTATTGACAGACACGTCAATAATCTTTTGCTCCAACTCTCGTGGGTAACAGAAACCTAATACTTCTACGTCTTTTTCACCCAAGAGAGAAACGATTTCAGCACGGAAAATAAAGACGGTTAAAATGGTATTCTTAACATAAATGTCATATGAATACAATTCTGTTTCCAATGTTTCAACATCTTGTCTCCCGACAATAAGGTCGTTGAAAATAGTGTTAAACACTTCAGCGAGTTTGGAATTCTTTAAATGAAATTGAGGTTCACGCAAGTACATAATGTTTCCTTATAATAAACGACTAAGCGGACATTACATCCGCTTAGTCGTTTTTATTTACTATGAATAGATTTTAACAGAGTAACGACGTTTATGCTCTTCTGTTGATTGAGCGATAGACCATTCCGTCATGGACAGTTTTTTACGGTTCACACGAGACTTAAGGTCATCAGTAAACCCTTTACTGATAGTTTCTAATTCTCGACTGGGGTTATTAATAGTAACCAGCACGACTTGATCTAAGTGAGTTTTCACTCTGTCAAGTTGAGAAGCAGAAATAAAAAGCACAACATCGCTTGTTTTTGCTTTGTAGTAATGTTTAATTAACAACTCTGCTGACTTAGGTAAAATGTTTTCAACCCCAACAATGGGTTTATTTTGTTCACCTACTACGGCAAATAATTCACGGGCGCCGTCTGAAAGCGCACGACGTAAATCAAACTGACGTTCGTCAATAAATACAACGGCGTCTTTTGCATACTTAGTCAACATCTCTTCTACTTCTTTAACACGTAGAAGTGGGTTTTGCGTGGTTGTTAATTTGTTTTCGTGGTTCATTACCACAATCGCGGTTACTGACATAGTAAACTCCAATGAATAGTTAGAAGGTAGCCCTCTTAAAATTCATTGGAGAATAAATTAGTCGGATGATGAAGAACTTGAATCAGAGTCACTGGAGTCTTCTCTGGATTCGGTATCGATTTGTGATAGCGCAGCCGAAGTAGTCGCAGCTATAACAAAAGCTTCACCTTGTCTTAACCCGCCATTTTCTAACGGAGTGCTAAGATTATTCTGAGTCTGTTCTTTCTCCATCTCATGTTGCTGAGCCTGAAGTTCACGAAGTTTATGTTCTAAATCCTCGTGTCTCACTAACTGGGTGATTTCATAAGGCTTATCGTTGTCTTCATCCACATCGACAGACTTTAATGTTGCCAAATGAGTAAACCCGCCAAAGAGTTTAGGTTTTAGCGCTACATCACCTACTTCGTCATCATCCACAAATGTACAAATGATTTCGGATGCAAATGGAGTGGCTGCCTCATATACCTGCGCACCACCAATGACCCAGATTTCATCTTCGCTGTCACGATGAAAAGGAAGAAACTGATTGATAAAGCTATCAGGAGTGATGTAGACGACATGACCTTCTTTGGTAAAGGTTTCAGAACGTTTCATGTCATGGGTTGTGATGGTCTGAATGTCTTCAAACGTTTCCGCTTTTGATGTCACCACCACGTTTAGTCGATTAGGTAACCCGCGATTATTCAACGACTCAAATGTTTTACGCCCCATCACCACGATGTTGTTTTGTGTCATCTTTTTGAAGTTCAACATATCGTCTTTCAAACGATACATCAGTGTGTTGTTTTTACCGATAAAGCATTGGTTGTTAATTGCAAGAATCATACGAATCATGATGTTTCCTTAATGGAGTTTAGGGTTGGATGAATTAATGCCGGCTTCATTAAGCGCGTGTTCAGTTAAACGAATAAATAAATCCAACTGAAAACAAGGGTATAGGAAGTATAAACGGCACTGTTGCAAAGTTAGCGATGAGGCAGCCTTTTGTCTTATTCAAAGGCCTTACATTTCAAAAACTCTGCTTACCAGGCGCATTTCGCCCAGGGGATCACCATAATAAAATGCTGAGGCCTGGCCTTTGCGTAGTGCACGCATCACCTCAATACCTTTGATGGTGGCGTAAGCCGTCTTCATGGATTTAAATCCCAGCGTGGCGCCGATTATCCGTTTCAGTTTGCCATGATCGCATTCAATCACGTTGTTCCGGTACTTAATCTGTCGGTGTTCAACGTCAGACGGGCACCGGCCTTCGCGTTTGAGCAGAGCAAGCGCGCGACCATAGGCGGGCGCTTTATCCGTGTTGATGAATCGCGGGATCTGCCACTTCTTCACGTTGTTGAGGATTTTACCCAGAAACCGGTATGCAGCTTTGCTGTTACGACGGGAGGAGAGATAAAAATCGACAGTGCGGCCCCGGCTGTCGACGGCCCGGTACAGATACGCCCAGCGGCCATTGACCTTCACGTAGGTTTCATCCATGTGCCACGGGCAAAGATCGGAAGGGTTACGCCAGTACCAGCGCAGCCGTTTTTCCATTTCAGGCGCATAACGCTGAACCCAGCGGTAAATCGTGGAGTGATCGACATTCACTCCGCGTTCAGCCAGCATCTCCTGCAGCTCACGGTAACTGATGCCGTATTTGCAGTACCAGCGTACGGCCCACAGAATGATGTCACGCTGAAAATGCCGGCCTTTGAATGGGTTCATGTGCAGCTCCATCAGCAAAAGGGGATGATAAGTTTATCACCACCGACTATTTGCAACAGTGCCGATTCACCGGTGAACCGGATGAGCTTCAGCTGGCACGATATTTTCACCTTGATGAAGCAGACAAGGAATTTATCGGAAAAAGCAGAGGTGATCACAACCGTCTGGGCATTGCCCTGCAAATTGGATGTGTCCGTTTTCTGGGCACCTTCCTCACCGATATGAATCATATTCCTTCCGGCGTCCGGCATTTTACCGCCAGACAGCTCGGGATTCGTGATATCACCGTTCTTGCAGAATACGGTCAGAGGGAAAATACCCGCCGTGAGCATGCAGCGCTGATACGTCAGCACTATCAGTATCGTGAATTTGCCTGGCCCTGGACATTTCGCCTTACCCGTCTTTTATATACCCGGAGCTGGATAAGCAACGAACGTCCTGGCCTGCTTTTCGATCTGGCGACAGGGTGGCTTATGCAACATCGTATTATTCTCCCCGGAGCCACTACGCTGACCCGGTTGATTTCAGAGGTAAGGGAAAAGGCGACGTTGCGCCTGTGGAACAAACTGGCACTGATACCGTCAGCCGAACAGCGTTCACAGCTGGAGATGCTGCTGGGGCCAACTGATTGCAGCCGCCTGTCTTTACTGGAATCACTGAAAAAGGGCCCTGTGACCATCAGTGGTCCGGCGTTTAATGAAGCAATTGAACGCTGGAAAACTCTGAACGATTTTGGCCTGCATGCTGAAAACCTGAGTACACTCCCGGCTGTGCGCCTGAAAAATCTCGCACGTTATGCTGGTATGACTTCGGTGTTCAATATTGCCAGGATGTCACCGCAGAAAAGGATGGCGGTTCTGGTTGCCTTTGTCCTTGCATGGGAAACGCTGGCGCTGGATGATGCATTGGACGTTCTGGACGCCATGCTGGCCGTTATCATCCGTGACGCCAGAAAGATTGGGCAGAAAAAACGGCTCCGCTCGCTGAAGGATCTGGATAAATCTGCATTGGCGCTCGCCAGCGCATGTTCGTACCTGCTGAAAGAAGAAACACCGGACGAATCGATTCGTGCTGAGGTGTTCAGCTACATCCCAAGGCAAAAGCTGGCTGAAATCATCACGCTTGTCCGTGAAATTGCCCGGCCCTCAGACGATAATTTTCATGAAGAAATGGTGGAGCAGTACGGGCGCGTTCGTCGTTTCCTGCCCCATCTGCTGAATACCGTTAAATTTTCATCCGCACCTGCCGGGGTTACCACTCTGAATGCCTGTGACTACCTCAGCCGGGAGTTCAGCTCACGGCGGCAGTTTTTTGACGACGCACCAACGGAAATTATCAGTCGGTCATGGAAACGGCTGGTGATTAACAAGGAAAAACATATCACCCGCAGGGGATACACGCTCTGCTTTCTCAGTAAACTGCAGGATAGTCTGAGGCGGAGGGATGTCTACGTTACCGGCAGTAACCGGTGGGGAGATCCTCGTGCAAGATTACTACAGGGTGCTGACTGGCAGGCAAACCGGATTAAGGTTTATCGTTCTTTGGGGCACCCGACAGACCCGCAGGAAGCAATAAAATCTCTGGGTCATCAGCTTGATAGTCGTTACAGACAGGTTGCTGCACGTCTTTGCGAAAATGAGGCTGTCGAACTCGATGTTTCTGGCCCGAAGCCCCGGTTGACAATTTCTCCCCTCGCCAGTCTTGATGAGCCGGACAGTCTGAAACGACTGAGCAAAATGATCAGTGATCTACTCCCTCCGGTGGATTTAACGGAGTTGCTGCTCGAAATTAACGCCCATACCGGATTTGCTGATGAGTTTTTCCATGCTAGTGAAGCCAGTGCCAGAGTTGATGATCTGCCCGTCAGCATCAGCGCCGTGCTGATGGCTGAAGCCTGCAATATCGGTCTGGAACCACTGATCAGATCAAATGTTCCTGCACTGACCCGACACCGGCTGAACTGGACAAAAGCGAACTATCTGCGGGCTGAAACTATCACCAGCGCTAATGCCAGACTGGTTGATTTTCAGGCAACGCTGCCACTGGCACAGATATGGGGTGGAGGAGAAGTGGCATCTGCAGATGGAATGCGCTTTGTTACGCCAGTCAGAACAATCAATGCCGGACCGAACCGCAAATACTTTGGTAATAACAGAGGGATCACCTGGTACAACTTTGTGTCCGATCAGTATTCCGGCTTTCATGGCATCGTTATACCGGGGACGCTGAGGGACTCTATCTTTGTGCTGGAAGGTCTTCTGGAACAGGAGACCGGGCTGAATCCAACCGAAATTATGACCGATACAGCAGGTGCCAGCGAACTTGTCTTTGGCCTTTTCTGGCTGCTGGGATACCAGTTTTCTCCACGCCTGGCTGATGCCGGTGCTTCGGTTTTCTGGCGAATGGACCATGATGCCGACTATGGCGTGCTGAATGATATTGCCAGAGGGCAATCAGATCCCCGAAAAATAGTCCTTCAGTGGGACGAAATGATCCGGACCGCTGGCTCCCTGAAGCTGGGCAAAGTACAGGTTTCAGTGCTGGTCCGTTCATTGCTGAAAAGTGAACGTCCTTCCGGACTGACTCAGGCAATCATTGAAGTGGGGCGCATCAACAAAACGCTGTATCTGCTTAATTATATTGATGATGAAGATTACCGCCGGCGCATTCTGACCCAGCTTAATCGGGGAGAAAGTCGCCATGCCGTTGCCAGAGCCATCTGTCACGGTCAAAAAGGTGAGATAAGAAAACGATATACCGACGGTCAGGAAGATCAACTGGGCACACTGGGGCTGGTCACTAACGCCGTCGTGTTATGGAACACTATTTATATGCAGGCAGCCCTGGATCATCTCCGGGCGCAGGGTGAAACACTGAATGATGAAGATATCGCACGCCTCTCCCCGCTTTGCCACGGACATATCAATATGCTCGGCCATTATTCCTTCACGCTGGCAGAACTGGTGACCAAAGGACATCTGAGACCATTAAAAGAGGCGTCAGAGGCAGAAAACGTTGCTTAACGTGAGTTTTCGTTCCACTGAGCGTCAGACCCCTACTAGCGGCCCCAGGCAGCAGGTCGATGCAAGAATGGCGGCCAGCCCGCCGGCGAAGAGCGCACCGCGCCCGTTTTGTGGTTCAGACATACGTTGGCCCTTTTGAATTTGGATTGGATAGCGTAACCTTACTTCCGTACTCATGTACGGAGTCAAGCGATATGGAAAATAATTTGGAAAACCTGACCATTGGCGTTTTTGCCAAGGCGGCCGGGGTCAACGTGGAGACAATCCGCTTCTATCAGCGCAAGGGCCTGTTGCGGGAACCGGACAAGCCTTACGGCAGCATCCGCCGCTATGGGGAGGCGGACGTGGTTCGGGTGAAATTCGTGAAATCGGCACAGCGGCTGGGGTTCAGTCTGGACGAGATTGCCGAGCTGTTGCGGCTCGACGATGGCACCCACTGCGAGGAGGCCAGCAGCCTGGCCGAACACAAGCTCAAGGACGTGCGCGAGAAGATGGCCGACTTGGCGCGCATGGAAACCGTGCTGTCTGAACTCGTGTGCGCCTGCCATGCACGAAAGGGGAATGTTTCCTGCCCGTTGATCGCGTCACTACAGGGCGAAGCAGGCCTGGCAAGGTCAGCTATGCCTTAGCGTGCTTTATTTAATGAGATGGTCACTCCCTCCTTCCCGGTACTATGCTGAGGACAGGCTTTCATTCGGAGAACTATCATGGAAAACATTGCGCTCATTGGTATCGATCTGGGTAAAAACTCTTTCCATATTCATTGCCAGGATCGTCGCGGGAAGGCTGTTTACCGTAAAAAATTTACCCGGCCAAAGTTGATCGAATTTTTGGCGACATGCCCCGCTACAACCATCGCAATGGAAGCCTGTGGCGGTTCTCACTTTATGGCACGCAAGTTGGAAGAGTTGGGGCATTCCCCAAAGCTGATATCACCACAATTTGTCCGCCCGTTCGTTAAAAGCAATAAAAACGACTTTGTCGACGCCGAAGCTATTTGTGAAGCTGCATCGCGTCCGTCTATGCGTTTTGTGCAGCCCAGAACGGAATCTCAGCAGGCAATGCGGGCTCTGCATCGTGTCCGTGAATCCCTGGTTCAGGATAAGGTGAAAACAACCAATCAAATGCATGCTTTTCTGCTGGAATTTGGCATTAGCGTTCCCCGAGGAGCTGCCGTTATTAGCCGACTGAGTACCATTCTTGAGGATAATAGTTTGCCTCTTTACCTCAGCCAGTTATTGCTGAAATTACAACAGCATTATCACTATCTTGTTGAGCAGATTAAAGATCTGGAATCCCAGTTGAAACGAAAGTTGGACGAAGATGAGGTTGGACAGCGCTTGCTGAGCATTCCCTGCGTCGGAACACTGACAGCGAGTACTATTTCAACTGAGATTGGCGACGGGAAGCAGTACGCCAGCAGCCGTGACTTTGCGGCGGCAACAGGGCTTGTACCTCGGCAGTACAGCACGGGAGGTAGGACGACATTGCTGGGAATTAGTAAGCGAGGTAATAAAAAGATCCGAACTTTGTTGGTTCAGTGTGCCAGGGTATTCATACAAAAACTGGAACACCAGTCTGGCAAATTGGCCGATTGGGTCAGGGATTTACTGTGCCGGAAAAGCAACTTTGTCGTCACTTGTGCTCTGGCAAACAAGCTGGCCAGAATAGCCTGGGCCCTAACGGCACGACAGCAAACTTATGTAGCATAACGGCAGAAATACACCGGTTTAAAGAATTACTGATCTGGTTTTGCGAATACTGATATTGATGATACTAACGGCCCACCGGCCTGTTGAGGAACCTGTAAAACGGAAAGGCTCATTGAAGCCGTATATTTTCTGGAGGTTCATCAGGCGCGGAACTCATCAAGGCGCGGGAATAAAATCCCATTCAGACGCCGGATAGATTCAAGCAAGCCAACTTGTCGTCAAAATCGGTGTTGCAAAAACGGGAGTGACCATAGATTCCGTTTTCTGAGACGACCCCTTGTAGGATTGGCTGTATCTGGGGACACTATAACCGTCAAAGAAGCCGGTTTGGTGTTGGTCATTGGGGTTATCCTGTGGATCTATGGTATAATCTTAACCAAGGTCAGCAATTCCTAAGGGGGTCAAATGGACGCTTTCACATTAGGCATGTTGGGGTTGCTCATTTTTTTCACTGTCGTCACTGGCGGCAGTCTGTATCTCTACCATGAGAAACAGAAGGAAAAAAAGCATCACAACGCCTAAAGAGTAACTGCGACAATGAACGTAAAGGCCAGCAATAGCTGGCCTTTTTTTATACTTGCAGTTTGAGGTGTTACATGTCACGATAAGGTAAGTGTGACATGTAACGGAAAAGGTGATTTTTTGAAAGTATCCAATGAAGATGCTCAGGCTACGGCGATCTATCTCCTCAGAGCTGCTTCGCGCCCAGCTTTCTGGCGTGACGTCCCATTCGATAAGAAACTTGAAGCCGTGGACAGCCTGAACAGCATAGGGCGATCACCATCAGAACTCACTGAATGGATTAATAAATACCTGACAGCAGAGCAAATCAATAAACTCGGGACATCAATTAGGCAACGTCGCAGAAGGGGATATGGTGTTGGTAAAAGCATAACTATAAGCGATAAAGCCCACAGGATTTTGAAGCGGTTGTCAGAAGTCGATGGTTGCAGTTTGTCAGAAGTGATCGAGAAACGCCTAGCCCGAGCTTATAAAAATACATGGGACCATAAATAGAGTGACACTAGGGGTTGCATTTTTAAAACCCGTGATAGCATCTAAATGAACCCAAACGAATAGGGGGGCTGGCGGCGATGCCGACCATTATCCCGACAATGGAGAAATAGACCATGATGACATTGACTACCGTCTCGAAGAAAACTTCAAATAATTCAGCCCTTGTATTCTGGCGCGTTGGTACAAAACGGAAAGGCATCCTTGATGTCCATATTGATTTTGACCACGAAGAAGCGGATCTTTTGGCTGAGCTCGTAGCCATTCGCTATCTGGCGCTGGACAAACAGGTTTTTTGCAGAGAGCCAGGTGCTGGTGCTGGTTACAAGCTGGTGGTATCCAAAGGTGCAATCAAGAAGCTGGCGCTGGGCAAATCCACCAAGGCTTTTGCTTTCAAGTTTGCGGCTTGCCTCACTGGGCGTTTAAAGGGCGCTACTATTGAAGTCTCGCAGAGCATGGAGTTTATGGATGAGCCGGGAGAGGGCAACATTGAGCTCCTCGATGTGGACAAGCAAGCCTATACCCAGACCCATGACGAAATATCTACACCTGCCATTGGCCCCGTCCTGGTTACTCAACATGCCATCGATCAGTATCAGGCCCGGATAACCTCTGGAGACCCTAAAAAACCGTGGGCCTCACTTGTTGGCCGCCTCCAGCATCCAGAGTTACAGGTTCAGCCCTTTGACGAGAAAGTGGCTCGCCATAAAGCCAGAAAGTATGGCCGCGTAGATAACGTGGAAGTCTGGGGCCATAGAGATTCCAAGTTCAAGTACCTGATGGTGATCAACGACGACAACCAAAAACGTGTTCTTGTCACAGTGTTTGAGCGAAATGAGTAACCTGCTTCTCATTTGTTCTGAATACACAACCTCTCCAAATCACGTATTCATACCCTAAAAGGGCAATAACGCAGTCGTTGATAACGACTCTCCCACCTGAAAATAGCTCCAGCATAAAAACTGGAGCTGTAAATGAGATCGAGACCTTCCCTTTTACTGATGCGAAACCTGCGGTCATTGGCCGTTGTGGTGCTCGCAACTTTGCCATGCGTTGCTTTTGCACAATGGCGGGTTGTAGCCGTGAGCACCGAAGTAGACAAAATGCGCTTCAACACCATCATAGACGCGCATTCCTTCATGAAGAATTATCGCTCTGGTGAGGAGCAAGGGAAGGGAACGCCAGTTGGGGATGCGCTTTATCCGGTTGCCAGTTATGGTGATGGCCGTTATGTCTCCAGGATCTGTTTTAAGTATCTAGGTGCTACTGGCGATTATGACCCCACTACCTGCACTGGCGACCCAGCTACGGTCTACTGGCGCTCAACCTATGTTCTGCCGGGTGAGATGGATAAAACACCGTTCCTGGACAGGGATCTCGGCTTACCAACAACGACCATGTGTGTAGGGAACCCTATCCATCTTGGCACCGGCAACAAGTTCCAGGCTGAACTGGATTATCAAAGCGGAGGCTCCGACCCTTTCACATTCACCAGATACTACAATAGCCACCTTCCTGATGAAGAGCTCGGCGGCTGGCGGCATACCTACTCGCGGAGCGTCGAGGTCAACGCCTCGAAGTACGGTGAGAACATGGTTGTCCTGCACCGGCCAGAAGGGCAACAACTCGCGTTCTACAACTCGTCCTCTGTCTGGGTGCCAACATGGAAAACCGATGATACCTTGACGAAGGATGCTACCGGCTGGCGCTACACTCAATCTGACGGGGTAGTCGAGGCCTATGATGAAACCGGACGACTGACCGGCATCGAGAAGCCCAACGGCAATCACATCACCCTGAGCTATCTGAACGGAGAGCTATCCTCGATCACTGACGGCTTTGGCCGCACCATCCAGTTCCAGTATCAAGATGGCCGCATGGTCAGTGTCACCGATCCTGCTGGTGGCAGTATTCAGTACCAGTACAACAGCGCTGGCAAGCTGGCGGAAGTGATCTACCAGGACAATACGAGCCGCAGCTATCTCTACGATGACCCGAATGCACCGGGTTTGCTCTCTGGGCTGGTGGACGAGAACGGCAACCGTTTCGCCACATGGGGATATGATACTCAGGGGAGAGCGGTTCTAAGCGAACACGCCGGGGGCGCAGAGAAAACTCAGGTAAGCTACAACGCTGACGGAAGCGTCTCTGTCACCAATGCCCTCGGCCACGTTCAGCGATACACCTACAGTCGCCACAATGGGATGCTCAAGCCTGATGTGGTTGAGGGTGCGCCGTGTACCGGCTTTGTGGGCGGCAAGGAAACCTACGTCTACGACAGCAAAGGCCTCGTCTCCAGCATTACTGACCGCGCTGGGCAGAAGCGCACGTTCACCCATAACGACCGGGGATTGGAAACCACCCAGATAGACCAGGACGGGGGTAAGGTTACGACCGACTGGCTTCCTTCCAAGTCGCTCCCGGCAAAAATCACAGAACCAACCAGGATCACTGAGCTCACCTACGACACTCATCTCCGGGTGATAAGCCGCAAGGTCACTGATCGCAGCTCGGGCGCTTCCCGGACATGGACCTACACCTATGCCCCTGTTGGTACAGGAAAGCCGAGCCTGTTGGCCTCGGTTGATGGTCCGCGCACCGACGTCAGCGATGTTACGACATTTGACTACGATGACCAGGGCAACCTGATCCGGACAACGAACGCGCTGGGGCAGGTGACGCAGTTTGGTGACTACGACGCGAACGGTCGCGCCGGGACCATTCAGGGTGTCAATGGTGTAACCCAAACCCTCACCTATGACGCCAGAGGAAGACTTGTCAGCTCCACTGGGCCAGAAGGAACCACGGTCTACAACTATGATGCTGTGGGCCTCCTGAGTTCGCTGACCAAGCCAAATGGTGCAACCGTCAGCTATGAGTACGACGCTGCACATCGGCTGGTGGCGGAAACAGATGCACAGGGCAACCGGCGCGAGCTTGAGCTCAATGACCTCGGGAACCCAGTAGAAGAGCGACTGCTCGATGCGCTGGGCCAGACCCGTTGGATAGAGCGCCGGATCTTCAACGAAATCGGCTGGCTCTCCAGTGTCTCCGACGCCTATAGCAATCAGTCATCGTTTTCCTACGATGTGGTGGCAAACCTGATACAGGAGACCAGTCCCTCTGGTAACACACACTCCTACAAGTACGACGGCTTCCATCACCGGACACAAACGACCGATCCCCTCGGGAAGGTCACGCAGGTGCTCTACAAGGATACCGGCGATGTTTACCGTGTCTCCGACCCTCGTTCGCGCCTGACCTACTACAGCTACAACGGCTTTGGCGAAGTGACCCAGGTCCGGAGCCCGGACACCGGCACCACCGACATTACCTATGACGAAGCCGGTAACGTGGCAACGCGCAAAACGGCCAAGGGGCAAACCACAAGCTACAGCTATGACGCGCTGAACCGGATCATCGAAACCTCCAGCGATGTCGCTGGCGAATCGCCAATTCTGTACGGGTACGACGAAGCAACCTCACCATACGGCATAGGCCGCTTGACCTCAGTCGATGATGGCAACGGTGTCCGGAGATTTGGCTACACCCCCGAAGGATGGCTGGCTTACGAAACCTGGGAAACCCACGGGCAGAGCCTGACTACCCAGTACCAATACGATGGTGCAGGCCTCGTCACGAAGATCACGTATCCCAGTGGCCGTGAGGTCTCCTACACCCGTGACTCAGCCGGTGACGTCATCGAGGTGACAACGACACAAGCAGGCACCACAACAAACCTGGCAAGCCAGATTGAGCGAGCGCCCTTTGGCCCCGTCACCAGTATGGTCAGAGGGAACGGCATTTCAGAAAGCCGCACTCTGGATCTCGATTACCGTGTCACCGGCATCGACGCTGCTAGGGTGCATTCGCTGGTCTATCGGTACACGCCAGACTCGTTGATTTCAGCCATAGACGACAATCTCAGCTCATCAGTCAATCAGTCACTCGGTTATGACGCGGTTGGCCGCATCACCTCTGCTGAGGGGATCTATGGCGTTTTGGGCTATGGCTATGACGCCACCGGCAACAGGACCTCGATCACGACCGATGGCCTGAGCCAAAGCTACACCATCAACTACATGAACAACTGGTTGGTGAAGGCCGGGCAAACCTCCAGAAGCTATGACGCCAATGGCAACCTGACGAAGCAGGGGGCGGATACCTTCACCTATGACAGCCAGAACCGGCTGGTGGCCGCAACGGTCGCGGGAGTGACTGTAAGCTACACATACAACCATCTGGATCAGCGTGTAACCAAGACCCTAAACGGGCATACCCGGCTGCTGGTTTACGACCTGGCAGGAAACCTCATCGAGGAGCTGGACGCGGCCACTGGAGACGTGCTGGCGGAGTACATCTGGCTCGATGGGACACCCTTGGGCTTTGTTCAGTCAGGACAGACCTACCAAGTCCACGTCGATCACCTGGGCACCCCGAAGGCACTGACCGACGTCAGCGGCCAAGTCGTTTGGAAGGCGAGCTACAGCCCGTTCGGTAAGGCCAGCATCATCATCCAGGGGCCAACCTTCAACCTGCGATTCCCAGGACAGTATTACGACGCGGAGACCGGGTTCCACTACAACTGGCGGCGTTACTACGACCCAGCGACCGGGCGGTACATTACCAGCGACCCTCTTGGCCTGATCGATGGAGTAAACACCTACGGGTATGTGCATGGAAACCCTATGTCCAATACCGACCCGACGGGTGAATTTGCGTTTGTTGGTGCAGGTATTGGAGCTGGGTTGGAGCTACTTAGCCAACTAATCGAAAATAATGGGAGTTGGAAATGTGTTAGTTGGTCAAAAGTTGGAATCGCCGGAGCGATTGGGGCTATAGGTGGCGGCTGGGCGTCAGGAGTTTTCAGACATGCCAGCTCCGGTAAATCGTGGTTCAAATTAAGCCAAAAATGGAGCAATGTCTCACCCAGAGTAAGGAAAGTTCAAGGGGTTCCACGAGGTAATGAGCTTCACCATTGGGCTATTCAGAGAAATGGCAAGTTTGGCAAATATGTTCCTGACTCAATAAAAAACCATCCTTGGAACTTGAAGTCCATTCCAAGAGATATTCACCAAAACATTCACGGCAATGGACCTACCCCATATAGCGCATTTGGTCGTTGGTGGCACGGGACGCCTGAATGGGCAAAAGTAGCTCAAGCCTCTCCTGTTAGTGGTGGTTTAGCTGATTCAATAAATGATGAGGGATGCGGTTGTGCAAATTGAATTTCCAGCGCTGCTTGTGAGCAGCAAGAAAAGGTCGCTCTTCGTAGTGGCATCAGAATCCGAGTTCGGGAAATGTACTATTCAGTCTTTGAGGAACGGTTATTTTGAGCTGATGGACATTTATGATTCGGAAGGTCGTCACTACAAAATAGACGAGGTTGCGAGCTACAAGCCGCTAAGTCCATTCTGGTACTGGCCTGTAGAAATTGTGATGTATGGTTCTCGACTGTTTAAGGCTAATTTCAATGCTGTTCTTATCTCTAATCTGGATTGCAAGGAGTTAAAATCTGAACTTTGTGATTTGGCTAAAAAGTACAGAAGTAATTTGGATTCCGGCGTCGGAATTGAAAAGATTATGGAGGAAATGGAGTCTGCCAGAACAATTAAAGAATTGATAAAGGTTTTTGGCTAGTTATTCATTGAATACGCTGGTGTTAGTCTAGCTAACCGACGCCAGCGGCCAAGTGTTTGGAAGGCGAGCCATCATAGAGTCAGTAGAGAGAGATACTTTTGGAAATCAATAGTATTATGATTGGTACAAATGAGTTGTAGCTCCATATTTTAAAGGGGGATTTCAGAGAAATGAAGAATAAGAAGGTAATAACTTTGATAATGGCCGCAGCCGCATCATGCTCGTCAGTCTACGCCGCGACATTACCGACCAGTGAGGTAGACGCATACATACTTGCGATGAACACCATGTCACCTATCACTGCAAAGGGCACTGTTGCAAATAGTCGGTGGTGATAAACTTATCATCCCCTTTTGCTGATGGAGCTGCACATGAACCCATTCAAAGGCCGGCATTTTCAGCGTGACATCATTCTGTGGGCCGTACGCTGGTACTGCAAATACGGCATCAGTTACCGTGAGCTGCAGGAGATGCTGGCTGAACGCGGAGTGAATGTCGATCACTCCACGATTTACCGCTGGGTTCAGCGTTATGCGCCTGAAATTGAAAAACGGCTGCGCTGGTACTGGCGTAACCCTTCCGATCTTTGCCCGTGGCACATGGATGAAACCTACGTGAAGGTCAATGGCCGCTGGGCGTATCTGTACCGGGCCGTCGACAGCCGGGGCCGCACTGTCGATTTTTATCTCTCCTCCCGTCGTAACAGCAAAGCTGCATACCGGTTTCTGGGTAAAATCCTCAACAACGTGAAGAAGTGGCAGATCCCGCGATTCATCAACACGGATAAAGCGCCCGCCTATGGTCGCGCGCTTGCTCTGCTCAAACGCGAAGGCCGGTGCCCGTCTGACGTTGAACACCGACAGATTAAGTACCGGAACAACGTGATTGAATGCGATCATGGCAAACTGAAACGGATAATCGGCGCCACGCTGGGATTTAAATCCATGAAGACGGCTTACGCCACCATCAAAGGTATTGAGGTGATGCGTGCACTACGCAAAGGCCAGGCCTCAGCATTTTATTATGGTGATCCCCTGGGCGAAATGCGCCTGGTAAGCAGAGTTTTTGAAATGTAAGGCCTTTGAATAAGACAAAAGGCTGCCTCATCGCTAACTTTGCAACAGTGCCCGCCGATGCGCTCAAGCCGTGGATTGCGCGGCGTGAACGCTGGCCGTCCTTTCTGATCCGGCGCGATCCGCGCGACATCAGCCGTATCTGGGTCCTGGAACCGGAGGGACAGCATTACCTGGAAATTCCCTACCGTACCTTGTCGCATCCGGCTGTCACCCTCTGGGAACAACGGCAGGCGCTGGCGAAACTGCGGCAGCAAGGGCGCGAACAGGTGGATGAGTCGGCGCTGTTCCGCATGATCGGCCAGATGCGTGAGATTGTGACCAGCGCGCAGAAGGCCACACGCAAGGCGCGGCGTGACGCGGATCGCCGCCAGCACCTCAAGACATCAGCTCGGCCGGACAAGCCCGTTCCGCCGGATACGGATATTGCCGACCCGCAGGCAGACAACTTGCCACCCGCCAAACCGTTCGACCAGATTGAGGAGTGGTAGCCGTGGACGAATATCCCATCATCGACCTGTCCCACCTGCTGCCGGCGGCCCAGGGCTTGGCCCGTCTTCCGGCGGACGAGCGCATCCAGCGCCTTCGCGCCGACCGCTGGATCGGCTATCCGCGCGCAGTCGAGGCGCTGAACCGGCTGGAAGCCCTTTATGCGTGGCCAAACAAGCAACGCATGCCCAACCTGCTGCTGGTTGGCCCGACCAACAATGGCAAGTCGATGATCGTCGAGAAGTTCCGCCGCACCCACCCGGCCAGCTCCGACGCCGACCAGGAGCACATCCCGGTGTTGGTCGTGCAGATGCCGTCCGAGCCGTCCGTGATCCGCTTCTACGTCGCGCTGCTCGCCGCGATGGGCGCGCCGCTGCGCCCACGCCCACGGTTGCTGTGGGAGGAAAATAAAGTGTCATCCTAACTACAGGACCCTGCAACGTAGCAATTCCTGCAAAAATCCATTTAAAACAATGACTTATTTCAATTCTGCCATCTGCAAGCGTTTTGTGGGGATGGAGGCCGGAATCATTAAAGTATCATCCTAATTTTCGCTCAACCTTGGAAAATGCTGAGAATCGTAAAAATAAAGTTTCATCCTTTCCGCTACTAGTTCAAGGCTTTCTTCCGGCCGCCGGCAACAACGCTGCTTGGCTGAGGCGATCCACTTGCGCTACCGCTCACTGATCAGATCGAACTTGAACTCGACGAGCAGCCAACCATCAGGCCGAGCGTGATCGCCAAAAAAAACGGATCATGCGAAATCCTTCGTTGAGTTTGGCGCGGGTTTTAGCAAATCAGCTGGCCCGCGATCCGGTTGAGAGTGATCCGAAGCGGTCCTTGACGACCGGCACAAATCGGCCGATTCTGTTGAAAAAGTAGCTTTAGCGGCAGCCTGCCGATCAGGTGTGCCTGCTGTCGAAGTGGCTGCAAGCCACTTCAAGTTGCCTTCCGGCGTTTCACTGAGCGTCCTTGCTCAGGTTTAAGGGTTAATTTGAGGGTTTCTGCTCGTAGCAGGCATACCTATCCCGTCAGCGGTGGCCCTTGAGGCAAAAGCTTGGCCATGCGGCGCAGGTTCTGCACCATCGCAGCCAAGGTGAATTCGTCAGTGGCACCCGTTAGGCCACGCAGTCGTAAACGGTCGAGTTTCATGATCCGTTTGAGGTGGGCGAAAAGCATCTCCACCTTCTTTCGTTCGCAGCGAGAGACGAGGTACTCCGGTGTCTTGGCGATGCGTCGAGCCACGTCGCGGGCAGCCTCATGGATGCTGCGGACGATCTTCCGATTCGGCGTGTTGGGGCAGCATTTCGCTTTCAACGGGCAGGTGGCGCAGTCGGTTTGGCTGGAGCGGTAAATGACGGTTTTGGCCTTAGTTACCCGCGACCTTTGCTGGGTGAAGGCGCGCCATTCACTGCGTAGCGGTTTGCCGGCTGGGCAGCGATATTCATTGGCGTCCTGACTCCAGTGAAAGTCGTTACTGGAGAGGCTGTCGTCCTTGCGCTCGGTCTTGTCCCACACCGGCACATGCGGTTCGATGTCCTTTTCTTCGACCATCCAGGCCAGCATCGGGGCGGTGCCATAAGCGGTATCGCCGATAAGGCGTTCCGGTGTGAGATCGAACTGCGCCTCGACACGCTCGACCATCGTCCTAGTCGAATCGACTTCGGCGGTACGGTGCGCCGGGGTAGCTTCCACGTCCATGATCACACCGTGCTCAGTGTCGATCAGGTAATTCGTGGAGTAGGCAAAAAAGGCCGGGCCACCTGGCGCTGCTGTCCAACGGGACTGAGGATCAGTGAGCGAAATTTTCTTGGGAAGAGCCTCAGCCAGCGCCTCTTCATCAAGGGCTTCGAGGTACTCGCGCACTGCGCGGCTGCTGAGCTTTGGATCGTTCCAATCGACCTCATCTCCCGCCACCCCACGTTGCCGGCTGGCATCCGCCTTAATGATGCTGGCGTCGACGGCGAAACCTTCACCCTTGACTAGGCCGGCTGCCATGCAGCGCCGCAGCACCTCATTGAATAACCAGCGGAATAGATCGCTGTCACGAAAACGCCCATGGCGATTCTTCGAGAAGGTCGAGTGATTGGGGACTTCGTCTTCCAGACCCAACCGGCAGAACCAGCGATAGGCCAGGTTCAGGTGCACCTCTTCGCACAATCGCCGCTCGGAACGAATGCCATAGCAAGTAGCCGACGACCAGCATGCGCACCATCAACTCCGGGTCAATCGAGGGACGCCCGATGGGGCTATAGAAATCTGCCAGGTAGGCACGTAGATCACTGAGATCCAAGCACTGGTCGATGCTGCGCAGGAGATGTTGGGCCGGGACGTGATCTTCCAGATTGAACGAGTAGAACAGGCGCTGCTGTCCTCCCGGTAACTGTCCCATCATGCTGTTCGCCCCCACGCTCGCTGACAAAGCAATTTTGCCAACGGCATGGGGAGGCCGCTACTTTTTCAACAGAATCGGCCGCACACAGCCATTCGGCTCAACAGAACCCTGCCTTTACCCCACCAAACTCCGGAAACTCGCCCGGTAGTCCGTCGGCTTGAGATGCACATGTTGGCTGAGGTCGCCGGGTAGCGTGAACAGGCGCGGACCGTCCCGAAACTGGAACACGCGATACAGATGGAATTGATCGCCCGCCTCCTTGGAGAATTCGAGTTCGTTGTGGCTGACCAAGAAAGACGAGCCTACCCCGCCATTGGTGGTTTTCACCTCGATGAAGCGCTCATGGGCGTCCTCTTCGAACGACAGGATGTCGAACCCCGCACCGTCTCCCTGGGTGTCGGACACCCAATCCAGCCGCTGAAAAAGCTCTGGGTGGCCGAGCTCGGTCAGGCGTTGCTGTTCGTAGCCAATCACCCACTGCTCCCCTGCCCGGCCCAGCTTGCGGTTGGCTTCATCGCGAGCGGCATAATCGAACTTTCGCGGTAGGCGTTGCCGTAGAGATGCCGGGGTACGCACAAGCACTTCACGGGCGGGTGGTTCTACCAAAGCCGCTCGGTAGGTTTTGTCACCCGGAAGTTTTACCTCCTCCAGGGCATCGACAAGAGCGCCGACCGTCTGCTGATGTTCCAGAACGTAGGCGTGTACGGATTTACGCAGCAGCAGTTGGCTGTTGCCGCGTGGCTTGTAGCCGTTGATATAGGGCAGGCCCAGGGCATCGAGTACGGCGCTAATGTTCTGGTGCTTGAGCTCGACTGAAGACTTGCTGCGACCGTTCAGCAGTTGGCGCAGTGCCTGGTTGTGCTCGGACTTGTTGTACGGCTCCCCAGCCGCCTCGGCACGCAGCATGTCGAAATAGTCTTCGACCGTGGCCAGGACCTCTTCTTCGGACCAGTCTTCGCCGATGCGAATGATGCGAAACCCGAGCCGCGTCAGCGCCGGAACGACGGTCGCCTCGCCACCGGAGAAGCTGTCAGCAGTGAGCGGGCCCTGCTCGGGAAATTGCTTGCCGAAGGCCACACCGGCGATGGCCTTGGAATCGCAATCGGTGCCGGTCTTCGGATCACGTACCAGGAAGTCGCGGGACTTGCCGTAGCCGTGGCGCGCCAGGAATTTCGTGCGGCCCAGTTGCACGAACTCATCGATGGCAGCCTGCACGGCGGCGGGGCTTCGAAGCTGGGAGAGTTGAGACACAGGGTCCTTCCTTACTGTCATGGTGTGCCGGGAACCGCCGAGCCACGAGATTATGAGTAGCCCCTGAACAGAAACGTCACGATAAAAGCCGTGAACGCCACCAGGCCCATTAAATCCCTTGCGTATTTGCAGCCCGTGCTGTCCAAACCTGTACCAGGTCCGATCAACACGCTCCAACCATTGAGGTACGAAAACACCGCCTGACCGAACAAAAAATGCGTGATAGCCGCCGCAGCCATGACGCCGGAATCGTCAGACAGGCTGTAGCTGTTGACCATTGCCCTGTACGCCTGCATCTGAAATGGCTATTGCCCTGATGGGAGCCGGCTTTTCAGCCACCGACACCAGCGATGCCGTCAATATTCTTTACCCGATAACCATGACCGTACAAGCTAACAAGGCCTGGCAAGCCAGCGGACTTAAAAAGTCTTTTTTTCTACCATCCCACAAAAAAGTTCGTGGTGGAGAAAAGATAAGCTATGCAAGGCTTTAGGAGACGTGGTTTTTCAGGATGACGAAGAACGATTCGGCGCTAGGTGCAACATAGGTGCATCGCACGAGCGCTAGGAACGGCGAAAAAAGGCGGACGTGGCGAAATCGGTAGACGCAGCAGACTTTAAAATTGGAGTGCCCGCGGGGAAATCCGCGGAGTAGAACCGCTCAAAGTCGGGGAACGCTAACGGGCAATACCCTAAGCCAATCCCGAGCCAAGCCCCTTCGGGGGAAGGTGTAGAGACTGGACGGGCGGCGCCTAAAGCCTTCGGGCAATGGCGAAGGGACAGTCCAGACCACGAACGTCATCAGACGGCGGCGAAAGTCGAGGTGGTACGAAAATCTGCTTCTCTGTGAGAGTACGGGTTCGAGTCCCGTCGTCCGCACCACAAAGCCAAACATCCCTGCGATGATCGACCTCTGGGCGTTTGGTCGTGAGCCCGCCACCCTCGCGCTACGCTTTGCGCAGGCGTCGAAGGCGATCAGGTGCGCCCATCGATTCCGTCGAGCACCATCGCTGCGAGTTCATCGCTGGTGTGTGCACCACTATCGAGAACACGGGTTGTGCATGGAAGCCGTTCCCTTGCGGCCAAGCATCGGGCGACATTCGCTAATCGCCACTCTCGAATCTCCGCATTTCGATTCGGGTCAGGATGCATGGTCTGGTTCGCGATCCGGTGACGCAATAGGTCCTCGTTGAGCGTCAGAAAGATGTGCAGCAGCTGATCGTCGATCCGCCTTACCCCGTCGAGTATCTCAGTCAGATAGTCCGGGTGCACGAGCGTCATTGGGATGATGATGTCCTGCGAGTAATTCCTTCGAATCTCCCTGACCGCCGCGATCGTAAGTCCCCTCCACAAGGGGAGATCCTGATAGTCTCCGCTCGCTGGCATGGGGACCGTTTCTTTCACCACGAACCCGATTTCCTCGGGGTCAAAGATCAGCGATTTGGAACGCCGATCGCGCAGCCGCTTAGCGAGCGTCGTCTTTCCGGCGCCGAAAGGTCCGTTGATCCAGATTATCATTGTCGACGGCCTCTAACCTGAAGGCTCGCAAGAGCGCTCGACGGCCTCGTGCGGAGGCACGATCGGAGTGGTTCCGAAATGCTTCTCAAGATAGGTGACGCCGAACGTCACGATGTCCTGCGCGTCGAACAGGTAGCACTGAGCAAAGCCCACGACACCTTCTCGATGGCGACCGAGCTTCACGTAAGCATTTGCTATAGTTTCAACCGCATCCGGCTTTCCTTCGATAGCAAAGCAATCGAGAATGCCGTTTGAATCGTAATCCGATGCCGTTTTCCAGGCGACTTCACCGTCTCTTCCAAGCATCGGCATCTCATACGTCACCCACCGTTTGTTGGGGATATCGGCAACCGCCTCGGCGTAGTGCAATGCGGTAACGGAGTTTAGCGGCGCACCCAACAGCAGGGCCTTCCCGCCAAGGCGAACGAACCGCTCGACGGGCGATCCTTCCCCCAAGGCGTGACCGAGTTCGTGAGGCTCCGTCAGCGTTTCAGCCAGCGGACCAACCGCGACCATCGATGCATCGGGGTGCGCGCTGCGCCGCGCGCCGGGGGCTTGAACCAGAAATTGATTCAGCAGGCCGAACCCACGGTAAGTCCCGGCTGTTGCGGGATCGAACGGCAGCCAGGTACGGCGGGCTTCGTCATCCAGCCGAGCGCCATTCAGAGTCTCCTCGTAGGGTGATCGGTCCCACGACGCGTATCCCATCACAGTGCCAGTCGGCCCAACCGCGGAGCGTAACGCGGCAACGACCGTCTCCGCTCCTCCTTCGACCGGACCAATCGCTTTAAGTGAGGCATGCACCATCAAGAGGTCACCGGTTTGGACTCCGAGTTTTTGAAGCGCCTCCGTTATTGCCTTCCGCGTATGCATCGCGATATCTCCTCTAAACTGCAAAACACTATACCTATCGAGATATCACTCTACTATACCTATCGAGATATAGAGGTGGTCCCACTTGTTTGAACAACTAAAAGCGTATTTATAAGTGATATTCCGCTCTAGTTAAGCCACCTTGTTTTGTTGGGGTAGCTGATCATAGTAAAACTCATTTGGTGTCATTTTGTCTAGACTCGAATGAGGTCGTTTCAAATTATAAAACTCAAAATATGCACTTAATTGCTTTTTCGCATCTGTGACACTGCTATAAGCTTTGAGATACACCTCTTCATATTTAACGCTCCGCCATAATCGTTCAACCATCACATTATCTACCCATCGACCTTTACCATCCATACTGATTTGAATGCCATTTGATTTCAATACATCAATAAATGCATCACTGGTAAACTGGCTGCCTTGGTCTGTATTAAATATTTCAGGTCGACCATATTTTTCAATCGCTTCATTTAAAGCCGAAATACAAAAATCCACCTCCATACTAATCGATACCCTATGCGCAAGTACCTTGCGGCTATGCCAATCAATCACAGCACATAAATAAACAAAGCCTTTTGCCATAGGGATATACGTTATATCCGTAGACCACACTTGATTACTGCGCTGAATAGCCAACCCTTTGAGCAGATATGGATATTTACGGTGAGCTTGATTAGCCTGGCTTAAATTTGGTTTGCAATATAACGCCTGAATACCCATTTTCTTCATTAAAGTACGTGTATGACGTCGTCCTATATGATGTCCTTGACGATTCAACAAATCACGCATCATACGACTGCCTGCAAAAGGATATTGCATATGTAATTCATCAATACATCGCATCAGCTTCAGATCTGATGCACTCACAGGTTTTGGGCGATAGTAATAACAACCACGGGAGACTTTCAGCAGCTTAGCTTGCTTAGATACTGAAATCTGAAGTGAGTCGTCGATTAACTTTTGTGGTTGAAGCGGCCCAGTTTCTTCAACACACCTTCTAAAAAATCAATTTCTAATGCCTGCTCACCGATTTTTGCATGTAGTTTTTTTAGATCGATGGGTGGTTCTGTTGGAGCTTTTGATTGATCGAAAGCTTGCGAGGAAGCTGAGATCAATTGATTTTTCCAGTCAATAATTTGGTTTTGATGAACATCAAACTCAGCACTCAATTCAGCAAGTGTTTTTTCTGCTTTAATCGCAGCAAGTGCTACCTTAGCTTTAAAATCATTTGAATGATTTCTTCTTGGTCTACGTGCCATAAAATACTCCATATATTGATGTTTATAACATCATTTGAGGAGCAGAATATCACTTATAGGAGTTGTTCAAATTTACGGATCCATCTCTGTGCCTCGTTGAGGTTCTGAACTTCATCGATGATCAGCACTCCGAGTGCGTGCAGGTTGGCCACCTGGCACATGGATGCCATCAAACGTTTGGTACCCAGCTTCTTGCGTCCGTGGCTACGGCTGTAGTGGGTGCCCAGGATCTTGTCTACCTCGTTGAAGAAGCTGAGGCACAGCTCATCCAGGTCACCGTCAATGGGGCAGTCGACCTTCAGATAGACCAATTGAGTGATGTTGTAATCGGGGTGATGGAGAGCCTGGGGGTACATACCCAGAATCCGCTCCAGCGTGCGCGTCTTGCCGCAACCGGAGCAGCCAAACAGCGACAAACTGTTGGCTGTGGAGGTAACACTCTGGTACACCGCCGCATCAAGATCCTCCTCCTCCACCCGGCGATAGCCATTCTGCAGGTGTGCATACCAGGCACCGCTGGCCGGGTTCCGACCGATGTAGCCCTGACGGATCATCAGGCTGATCTTGCTCTCCAGCTCCAAGTGGTGGCTGAGTGGCTGAAAGAAGCCGTGCAACAGACGGGCAATGGCATGAGCCCGGAGCCGACCATCCAGAAGCGCCTCCTGCGGCTCGAAACTGGGAAGTTGCTGCATCAGGCCGACCACTTCCTGCAGATCAGGAATCGGAGGTAATGCACTGATGAGTGGATTGTCCTGATACTCGGGAAGCTGCTGCTCATGATAACGCGCCAGGGGAATCACTCCGCGCTGTAATTCTTCAGTCATCTTCTTCCTCCTTGAATATCAGGTCACTGAGATCCGGAAAGGCATAATCTTCCTGCTTCTCGCCCCGCAGTGGGATCACCTCCGCGGGTTTCGCCCGCTGTGCTTTCTCAGGTTTAAATGCTGTTTTCAGGCGCTCCTGGCGCTTTTCCTGCTGCTTGTTCTCGCGGATCTGGGTGCCCAGATCTTTCTTGCTAATGCCAGTTTTGAGCGGACTGGCGTTCTCTGCCTGGGCAACAATCGACTCGATCTGCTCCAGAAGCTTGCCTCTTTCCGCCAAGGCTTTTGACGCCGCATTGACATCGCTCCGCCGTTCCTCTCTGGACAGTATCCAGACATCCCAGAAGGTCATCCCCCTGAAACGTCGACTACGGTCGGCAAGATCGCAAACCCAGTAATCCTTGAGACTATTGGACGGCCGCAAGTAGATATGGTCCGCACTACGCGGGTCGTATGCGACCGTTACCCCTGTCGGCCGACGCCCCTGGCCTCGGTGAAACCAACCCTCCCTGATTGCTTCCGGGCAACTGTAGAAGCAGCCGAATAGCCTGATTCCCAGCTCCGAAACCGTGGCCGACTCATGGGACAAGAGATTGATCCACACCAGTTCCTCCGGCGCAGTGCGCAACCGACCGGTCAAGCTGGCCAAGCCCCAGTTCCACAGCATGACTGGAATCGCCGGCAGATCGCCCGGCATTCCCGCAGCCCTGTCGTATTTGCTCAGGGTGTGGAAATTGTTGTGGTGGAGGATGCCGGCAATGATGATTTTCGTGAATTCGGGCAAGGTCAGACTGGCATCAAGCCTGTAGTCGTGGCCACCTCGCTTCCGGCTGGTAGTGTCCTCGACCACACCGCTGGCGTAAGGCTTGAAACGCTCCTGCACCGTTCGAAAATAGCGCTCGACGATGCCCTTGGCATCACCTCGCCTGGCCGGTGCGTTCTCGATGCGGACGCCGAAGGCTTGGGAAAATGCCTCAACCTTGGTGCCGTTCAGTTCGCCCTTATCGGCCAGTATCACATCCGGCAAACCCTTGACCGGCCAATCGCTGGCATCGATCTCCAAACCATACTGGCGACAATACTCGACCTTGTCGGCGACCGTGTTGGCCAATGCCACCATGGCACTGACCCAGGAGGGCCCCTCGAATCCGACGTACATACCAACGACCATGCGGCTGAACACGTCGAGCACCATGTAAACGACAGGCCTGCCGACGATAAGGCTGCGGTCATGCTCTGAAACCAGATAAATATCTGCGATGGTGGCATCTATTTGGTAACGGTATCCTGGACCCAGGGTCTCAGTCGTCGATGTGCTGTTGAGCGGCCGGAAGTCCTTGGCAAAGTCCACTGCGGACATCCTGCGCGGCAAGGTGTCGGTGAAATGATATTCGCGCCCATAGAAGTACCGGAACTGCCCCAATGTAGGCAGTTCGGAGGTCGGCAGCTCCGGCTGAACAGCACGGAGCAGGTTCAGCCCGGAGGCGTAGGCATCAGGAATGGACGGATGTTTTTCCTTGAGCAAGCGTTCCTCGATGACCCGGCGAAAAATGCGCTCGATATCCGGCGTGACATTACGCCCTTTCCCCGCCATCACCACTCTTGGCCTGCCCAGCTTGGCTCTGTTCGGCTTTCGGCGCTTGCCACGGGCACCCGAGTTGACATAGTCCGGCAAGAGGGCGTTCCTACACATGCCTCGTTGCCAGTAGCGCCTCAGTAATCTGTACACCGTCTGTTTGGTGACACCATGGCGTTGCATGATGCTTCCGACAATGAGCCCTCTGGGGCGCCGCACGAAAAGCTGGGGATCGTGCATATAATCCGCCAGCATCGCCCACGCCTCATCACGTTTCAGCTGGTCAGGGGAGCCTGCCTCCACCTCCCGAAGGACCGTCTCCTCGAAGGGATCGCCAATGCTCTCCAGCTCACCCTCAATAATCAGACGCTCCAGCTCTGCAACCGATATCGATTCGGGCAGCGCCGTGTCCAAATCGATATCAATCCAGACAGCCTGCTCGGTACCGGACCAGAGCAGGCGCTTTCGGAGCTCTCCCATACGGAACACCTGATTAACGGACAACACTGATCAGCTCCTGATGAATCTGATGAGGGTTGGCAGCTAGATCCTTCGGCTTCAGCACGCGGTAAGGGGTGTCAAGATCGAAGAGAAACCAGTGCCGAGCCAGCAGCTGGCGTAGCCAATAGAGTGCTTGACCAGCCTCCAGTTGGCCGGAAGTATCCATTTCCTGGGCAATGGCGGTCAGCTTCCGGTCTGGGTGACGCTGAAACTCGAGCAAGAACAGCTGCTGATAGTGTGCGAGATCATCCTGTGCAATGTCGTCTTCAGCATGGGCCGGATAGAGCCATTGGATATTGGCAAATGCCTCTCTGGATACCTCGCGCTCCGTGATGATGAACCAGGGAATCCCTTTTTCCTGCCAGTAGCGCCTTTCAAGTTCAAGTCTCTCGATGACTTCCGGCTTCTGCAGATCGGCACTGTACTTGGCCTGGATCGCGACGGATGGGCGCTGGGGGTCATCAAAATCCACCAGGAAATCGCTGGTCAGAACCTGAGGGATTCCTTTGTAGCGACCGTGAGCAAGGCCAAGTTCCTCGGCGATGCGCACCGTATCTTCGGCCCGCATCGGGAACTGCTCACGGATATCAGTGACCTGGGGAGACCGATCCAAGGTCAGGAAAATGGCCAGTTCAAGATCGGATAAAAGATGGTGCAGACGCCGGGTCTTGCTGCCAGGCAGGCGGTGGGAGCGCCCCAACGAGGACACATCCCTGGTGTAGATGAACGGCTTGTAGTCCCGTCCCTGGCCTTGGCCACGCCCCTCCTTGAGCCTTCTATCAATCTGTGCCTGGGTCAGGCCCTTGAATGTTCCAGACATGAAAAAGCCTGCATCCACCGTGTCGATGACCTCAACATAGCGGATACAGGCTTAGGATGAAACTTTATTTGTAAGGATGATACTTTATTTGTACGGATGAAACTTTATTTGCATCCCACAGTTGCCGGAAATGGAGCAACTGGCTCTGGCACTGCTGCGCAAGGTCGGCGTGCGCATGCTGGTGATCGACGAGCTGCACAACGTGCTGGCCGGCAACAGCGTCAACCGCCGGGAATTCCTCAACCTGCTGCGCTTCCTCGGCAACGAACTGCGCATCCCGTTGGTTGGGGTAGGCACGCGCGACGCCTACCTAGCCATCCGCTCCGATGACCAGTTGGAAAATCGCTTCGAGCCGATGATGCTGCCGGTATGGGAGGCCAACGACGATTGCTGCTCACTGCTGGCCAGCTTCGCCGCTTCGCTCCCGCTGCGCCGGCCTTCCCCAATTGCCACGCTGGACATGGCTCGCTACCTGCTCACACGCAGCGAGGGCACCATAGGGGAACTGGCGCACTTGCTGATGGCGGCGGCCATCGTCGCCGTGGAGAGCGGCGAGGAAGCGATCAACCATCGCACACTCAGCATGGCCTGTTGAGTTGCATCTAAAATTGACCCACTGGGGGTGCGGACGATTTCTTGGACGGTTTATACGGACATCAATCCGACCGCATGACGATACTCGATGGGACTACGCCCGCCAAGCGACACTTTGATGCGGCGCTCGTTGTACCAGTGGATATAGGCATCGATTCGCGTCATGAGGTCTTTCAGCGTCACGTGCTGCCAATTCCTCGGGTAGATTAGTTCGGTCTTCAATCGTCCGAAAAAGCCCTCGCATGCAGCATTGTCTGGCGAGCAGCCCTTTTTGGACATCGACCGCGTTAATTGGGCATTTTCAGTGCGGCGGATCCACGCAGGCCAGCGATAATGCGAGCCCCTGTCCGAATGGATAACCGGATGCTCACCGGGTCGCAGTGTCCGTACCGCGTGATCCAGCATGGTATTGACCAGGTTCGCATCCGGGCTGGTGCCGATATTCCAGGCCACCACCAGCCCATCGAAGCAATCGACGATCGGCGAGACGTAGACCTTCCCTGCCGGAATGTGTATTTCCGTCAGATCGGTCAACCATTTCGTATTCGGCGCCGACGCGTGAAAGTCGCGATTCAGCAGATTCGGGACCGCTGGTGTCGGGTCGCCAGCATACGCCGAGAAGCGCCGGCGGCGCGGTGTTCTCACGACCAGACGCTCTTGCGCCATCAAGCGACGCACGACCTTCTCGGACACACGCATGCCACCAAGGCGCAAGGCACTATCAATGCGTCGATAGCCATAGCAGCGGTAGTTGTCCTCGAAGATAGTCCGAATGACCTCACGCACCTGCGTGTACTTGTCGGGCCGCGTCTGCCGCAGGCGTTGATAGAAGTATGTGCTGCGCGCCAGCTTCAGGCCGCACAACAGATTGGCTAATGGAAACGTGACTCTGAGGGCATCAACCACCTTCGTTTTTTCTCGGCTTGTCAGTTCGAGGGGGTTGATGCCCATGTCTTTTTTTATCAATTCACTCGCCTTCTCCAGAATTGCATTCTCCATGCGAAGCCGCTGGTTCTGGCTCTCCAGTTCGGCCAGTTCCCTGAGTAGTGCCTCATGCCGCTGCTCGAGCGAGGTGTCACCTTTCTTCTTTGTCATGGGTTTTAGGGGCACTTTGCCAAGTAATCGATGCTGCCAGTTATACAACGTTGGTCGCGATACACCGACAGTGTCGGCCACATCCTTTGCCGAACCTACGCGCAGGTTCAGTGCAATGACGGCTTGCTGCTTCTCGAGGCGAGAGCGGGCGACTGTGGGAGCGCTGCTGCCGACGACCGTCCTAGCGAATTCAGGGCGTAAATCACGGATCCAGGCACGCAAGGCCTCGCGGCTTGGGTAGCCCAGGCTTCGGATTGTGTGACTCAGGCAGTAGCCTTGTTCGATATAGTGATCTACTGCCCGTTGCTTTTGCTCATCGGTGTACTGCCGTTTTATCCGTTGATAGCCTCGGCGAAGATCCTGATTCCGTTCGAATTCTGCCAACCAGGCCTTCAGCGAGTTCTTGGTGGGGTATCCCAGCTGCCGTAGTGTGGCGCTCATCCGGCGCCCAAGCTTCAGGTACAACCTCACGGCTCGAAGGCGATCTTCATACGAATACATGAACTACTCCTAAAGTAGTCCAAGATTTTGTCCGCACCCCAACTTAGGGTAAAGATTTGCGTCGAAATTTGACCCACGTATGACACTGTTTCCCGTCTGGATATGGCGGGAGAAATCAAGGAGTGATAAACGTGGCGATATTGAGCGCAATTCGACGCTGGCATTTTCGCGATGGTGCGTCGATTCGGGAAATAGCCCGACGAAGCGGCCTGTCCAGGAACACCGTTCGCAAGTATTTGCAAAGCAAGGTGGTTGAACCGCAGTACCCAGCGCGAGACAGCGTTGGCAAGTTAAGTCCTTTTGAGCCCAAGTTAAGGCAGTGGCTCTCCACCGAGCACAAAAAGACAAAGAAGCTGCGCAGAAACCTGCGCAGCATGTACCGGGATTTGGTCGCTTTGGGCTTTACCGGGTCTTATGACCGAGTGTGTGCCTTTGCCCGACAGTGGAAAGATTCCGAACAGTTCAAGGCGCAAACCTCGGGCAAGGGTTGTTTCATCCCCTTGCGCTTTGCTTGTGGCGAAGCCTTCCAATTCGATTGGAGTGAGGACTTTGCCCGCATAGCGGGCAAACAGGTCAAACTTCAGATTGCCCAGTTTAAGTTGGCCCACAGCCGGGCCTTTGTGCTTCGGGCTTACTACCAGCAAAAACATGAAATGCTGTTTGATGCCCACTGGCATGCCTTTCAAATCTTCGGTGGCATTCCCAAGCGCGGCATCTACGACAACATGAAGACCGCTGTGGATTCGGTGGGGCGTGGCAAAGAGCGCAGGGTCAATCAGCGGTTCACTGCCATGGTCAGCCACTACCTGTTTGATGCGCAGTTCTGTAATCCAGCATCGGGTTGGGAGAAAGGCCAGATTGAGAAGAACGTGCAGGATTCCCGCCAACGCCTGTGGCAAGGGGCACCAGACTTTCAAAGCCTTGCTGATTTGAATGTGTGGCTTGAGCATCGCTGCAAAGCGCTGTGGTCTGAGCTGCGCCACCCCGAATTGGACCAAACCGTGCAAGAGGCCTTTGCCGATGAACAAGGCGAGTTGATGGCGCTACCCAATGCCTTTGATGCATTCGTGGAGCAAACCAAGCGAGTCACTTCAACCTGCCTTGTTCACCACGAGGGCAATCGCTACAGCGTTCCTGCCAGTTACGCCAACAGGGCCATCAGCCTTCGGATTTATGCAGACAAGCTGGTGATGGCTGCCGAAGGCCAACACATTGCCGAGCATCCAAGATTGTTTGGCAGTGGCCACGCTCGGCGTGGCCACACACAATACGACTGGCACCATTACTTGTCTGTGCTTCAGAAGAAACCTGGGGCGTTGCGCAATGGTGCGCCATTTGCTGAATTGCCACCCGCGTTCAAGAAGCTTCAATCCATCTTGCTGCAACGCCCCGGCGGTGACCGTGACATGGTGGAAATTCTGGCCCTTGTATTGCACCACGATGAAGGTGCGGTACTCAGTGCTGTGGAATTGGCATTGGAGTGTGGCAAGCCATCGAAGGAGCATGTGCTTAATCTGTTGGGACGTTTGACCGAAGAACCTCCACCCAAACCGATTCCAATTCCCAAGGGGTTAAGGCTGACATTGGAACCACAGGCCAACGTGAACCGCTATGACAGTTTAAGGAGAGCCCATGATGCAGCATGAAGGCCATGTGAGAATCCTCAAATCCTTGAAACTCTTTGGCATGGCACACGCCATTGAGGAGTTGGGCAATCAGAATTCACCAGCATTTAATCAAGCCTTGCCCATGCTGGACAGCTTGATTAAAGCTGAAGTGGCAGAGCGTGAAGTACGTTCGGTGAACTATCAATTGCGGGTGGCCAAGTTCCCCGTGTATCGGGACTTGGTGGGCTTTGACTTCAGTCAAAGCCTGGTTAATGAGGCCACGGTCAAACAATTGCACCGGTGCGACTTCATGGAACAAGCCCAGAACGTGGTGCTGATTGGTGGGCCAGGCACAGGCAAGACTCACCTGGCCACAGCCATTGGTACACAAGCAGTGATGCACTTGAACCGACGGGTGCGTTTCTTCTCCACCGTGGATTTGGTCAATGCACTGGAGCAAGAGAAATCATCTGGGCGTCAGGGACAAATCGCAAACCGTCTGTTGTATGCCGATTTGGTGATTCTGGATGAGCTGGGATATTTGCCTTTTAGCCAAACCGGTGGGGCACTGCTGTTTCACCTGCTCTCAAAGCTGTACGAAAAAACCAGCGTGATACTGACCACCAACTTGAGCTTCTCGGAATGGAGCCGAGTGTTTGGCGATGAAAAGATGACAACAGCGTTGTTGGACCGACTAACCCACCACTGCCACATCCTGGAAACCGGCAATGAAAGTTACCGCTTCAAACACAGTTCAACTCAGAATAAGCAGGAGGAAAAACAGACCCGCAAACTGAAAATCGAGACATAATTCTGACAACAAGGGGTGGGTCAAAATTCAATGCAAATCCCGGGTCAAATTTGGGTGCAAATCAACAGCGTGGGTTTGATCACGTGTCGAAACCCTGGCGTACCCCACCAACATGCTGTTCTCCTCTCAAAACACTTAGAAACGTAGATCGGTGAGAGATCGGAAGCGAGAGGCAGTTTTGAGAGACCTTCAACCTTCGGCAGACTGTCGGCTTTGGTATCTCTCATAAACGGATGTTTTTGAGAGAACTATCTTCGGCCTTCACACGCACGAAAGGCGGCGAAGCTCCGCCGTTAATCCGTCCGCCGGAGATCTCGCCCAGGCAGGCTGAAGGCCGAGCAAGCCTGACAGGCCCGAAAAGCCCGGCACGGGCGTCGGCGGCGATGACGGCGGCGGCATTATCCAGGGTTGATGATGGAAGTGGAGGATATCGACAACCTCTCGCGCAACCAAGACATCGCGGTCGGACTGCAAGTGATCTTGAAGCCACGGGCCCGTCCCACCCCGACATGGACCTCGATGCCCGAACGGACGTTAGATTTCGAGTTCTAGGCGTTCTGCGATGAAGGTTGGATCCCAGCCGGGATTGAAAGTGTCGACGTGGGTGAATCCGAGCCGCTCGTATAGGCCACGCAGGTTCGGGTGGCAGTCGAGCCGCAGCTTGGCGCACCCCTGCGTTCGCGCGGCATGGCGGCAAGCCTCGATCAGCGCGGAGCTGACACCCCGGCCCGCATGTGTCCGTCGCACCGCGAGCTTGTGCAGATATGCGGCCTCCCCCTTGAGGGCGTCGGGCCAGAACTCGGGATCCTCGGCCGACAAGGTGCAACAGCCGACGATGCCGTCGCTGCAACTCGCGACTAGGAGCTCGGATCTCAGGACGAAGGTCTCCGCGAATGTCCGGTCGATCCGCGCGACGTCCCAGGCGGGCGTTCCCTTGGCGGACATCCACGCCGCAGCGTCGTGCATCAGCCGCACAACCTCGTCGATATCACCCGAGCAGGCGACCCGAACGTTCGGAGGCTCCTCGCTGTCCATTCGCTCCCCTGGCGCGGTATGAACCGCCGCCTCATAGTGCAGTTTGATCCTGACGAGCCCAGCATGTCTGCGCCCACCTTCGCGGAACCTGACCAGGGTCCGCTAGCGGGCGGCCGGAAGGTGAATGCTAGGCATGATCTAACCCTCGGTCTCTGGCGTCGCGACTGCGAAATTTCGCGAGGGTTTCCGAGAAGGTGATTGCGCTTCGCAGATCTCCAGGCGCGTGGGTGCGGACGTAGTCAGCGCCATTGCCGATCGCGTGAAGTTCCGCCGCAAGGCTCGCTGGACCCAGATCCTTTACAGGAAGGCCAACGGTGGCGCCCAAGAAGGATTTCCGCGACACCGAGACCAATAGCGGAAGCCCCAACGCCGACTTCAGCTTTTGAAGGTTCGACAGCACGTGCAGCGATGTTTCCGGTGCGGGGCTCAAGAAAAATCCCATCCCCGGATCGAGGATGAGCCGGTCGGCAGCGACCCCGCTCCGTCGCAAGGCGGAAACCCGCGCCTCGAAGAACCGCACAATCTCGTCGAGCGCGTCTTCGGGTCGAAGGTGACCGGTGCGGGTGGCGATGCCATCCCGCTGCGCTGAGTGCATAACCACCAGCCTGCAGTCCGCCTCAGCAATATCGGGATAGAGCGCAGGGTCAGGAAATCCTTGGATATCGTTCAGGTAGCCCACGCCGCGCTTGAGCGCATAGCGCTGGGTTTCCGGTTGGAAGCTGTCGATTGAAACACGGTGCATCTGATCGGACAGGGCGTCTAAGAGCGGCGCAATACGTCTGATCTCATCGGCCGGCGATACAGGCCTCGCGTCCGGATGGCTGGCGGCCGGTCCGACATCCACGACGTCTGATCCGACTCGCAGCATTTCGATCGCCGCGGTGACAGCGCCGGCGGGGTCTAGCCGCCGGCTCTCATCGAAGAAGGAGTCCTCGGTGAGATTCAGAATGCCGAACACCGTCACCATGGCGTCGGCCTCCGCAGCGACTTCCACGATGGGGATCGGGCGAGCAAAAAGGCAGCAATTATGAGCCCCATACCTACAAAGCCCCACGCATCAAGCTTTTGCCCATGAAGCAACCAGGCAATGGCTGTAATTATGACGACGCCGAGTCCCGACCAGACTGCATAAGCAACACCGACAGGGATGGATTTCAGAACCAGAGAAAGAAAATAAAATGCGATGCCATAACCGATTATGACAACGGCGGAAGGGGCAAGCTTAGTAAAGCCCTCGCTAGATTTTAATGCGGATGTTGCGATTACTTCGCCAACTATTGCGATAACAAGAAAAAGCCAGCCTTTCATGATATATCTCCCAATTTGTGTAGGGCTTATTATGCACGCTTAAAAATAATAAAAGCAGACTTGACCTGATAGTTTGGCTGTGAGCAAATTTAAGGGGAAAACGTGTCCCGGGTCAACTGCAGGGCGAGATCGCGAACGTTTTTTAAACAATAAACGATCAGTTTGACGATCTGATCTAGTTGTGATCATATACTCCTTTTTTGGGAGCATGCCATGTACCTTGAACTCTTTACTACACACTTTGACACCATCATCGACAACCGTCAGTCTGCAAAAGTTACCTATCCTTTATCTGATGTTTTATTCGTGACCTTATGTGGCGTGATTGCAGGCGCAGAGGGATGGTCGGAGATCCATGATTATGCTAAAGGTCATCACGAGTGGTTTCAGAAACAAGGTTTTTTAAGTGATGGCGTCCCGGTAGATGACACGATAGCGCGTATTATTTCCAAAATTGCTCCGGAACAATTCAGGCAATGTTTTATCAACTGGATGCAAGCGGTGCACAAGCTAACGCAGGGAGAAGTGATTGCCATAGATGGCAAAACGCTACGTAGCTCTTACCATCCAGAAGACAGAAAATCGACCATCCATATGGTGAACGCCTTTGCTTGTGCGAACAAAGTGGTGTTAGGGCAGCTGAAGACCGTGGAAAAATCGAATGAAATCACAGCCATCCCAGAGCTGATTCGATTGCTGGATATTGAAGGCGCCCTGGTTTCAATAGATGCGATGGGATGTCAAACGGCGATAGCAGAGCAAGTGATAGAAGGGAATGGGGATTATCTGTTGGCGCTTAAGGCAAACCAGGGAACGTTATACAACGCAGTAGAAGCGTTATTTGCCGGGCAACGAAGTCGCCCTCTTGATGGGATTGTCATAGAAAAGAACCGAGGCCGAATAGAAGCAAGAAGTTACCATGTAAAAGACGCCAGCGAACTGAAGGGAAACTTTAGTAAATGGGTTGGTCTGCAAACCGTTGGAATGAATCTAAGTTACCGGGAAGTAAAAGGAAAAAATCGGAACTCACTTATCGTTACTACATCAGTTCAGCCAAGCTGAACGAAGTACAGTTAGCCGAGGCGGTTAGAGCTCATTGGGCCGTTGAAAATAGCCTACATTGGGTGCTGGATGTCAGCATGAAAGAAGATGCTTGCCAGATTTATCAGAATCACGCGGCGGAAAACTGGTCAATACTACGGCAATGGTCTTTAAATATGCTAAGAGCAGAGCCATCGAAAGGCAGCATCCCCGCAAAACAAAAACGTGCCTGGATGAAAACGGATTATTTGGAAGATGTCTTAAAAGCTGGTTTCAGCAGCAGAGTGTTTGAAAATTAAACACTCATGCGGGAGCCCTGGGGTCAACTGCTTTTCAAGAGTAAGCAACTTCTGATAAGATACTTTATGTCAGCAGGGTCTGTTTTCGGGTTGCGGTTTTGCTGAATGCGGGGCGTAGTTTCCTAAATCGATAATTTAACCAGATAGGAGTACAGACATATGAAAATCGTAAAAAGGATATTATTAGTATTGTTAAGTTTATTTTTTACAGTTGAGTATTCAAATGCTCAAACTGACAACTTAACTTTGAAAATTGAGAATGTTTTAAAGGCAAAAAATGCCAGAATAGGAGTAGCAATATTCAACAGCAATGAGAAGGATACTTTGAAGATTAATAACGACTTCCATTTCCCGATGCAAAGCGTTATGAAATTTCCGATTGCTTTAGCCGTTTTGTCTGAGATAGATAAAGGGAATCTTTCTTTTGAACAAAAAATAGAGATTACCCCTCAAGACCTTTTGCCTAAAATGTGGAGTCCGATTAAAGAGGAATTCCCTAATGGAACAACTTTGACGATTGAACAAATACTAAATTATACAGTATCAGAGAGCGACAATATTGGTTGTGATATTTTGCTAAAATTAATCGGAGGAACTGATTCTGTTCAAAAATTCTTGAATGCTAATCATTTCACTGATATTTCAATCAAAGCAAACGAAGAACAAATGCACAAGGATTGGAATACCCAATATCAAAATTGGGCAACCCCAACAGCGATGAACAAACTGTTAATAGATACTTATAATAATAAGAACCAATTACTTTCTAAAAAAAGTTATGATTTTATTTGGAAAATTATGAGAGAAACAACAACAGGAAGTAACCGATTAAAAGGACAATTACCAAAGAATACAATTGTTGCTCATAAAACAGGGACTTCCGGAATAAATAATGGAATTGCAGCAGCCACTAATGATGTTGGGGTAATTACTTTACCGAATGGACAATTAATTTTTATAAGCGTATTTGTTGCAGAGTCCAAAGAAACTTCGGAAATTAATGAAAAGATTATTTCAGACATTGCAAAAATAACGTGGAATTACTATTTGAATAAATAAAAAACTACCGCTAACACTGGCTCATAGGCAATGGCGGGTTGAAGTGCAATTTGCAAAGTCGGTAGCCCGCCCGAGCGTTTTCTCGGTTTGACAGGAAAGGCTCACGCAAACCGCCACTGCCCATAGCCCAAACCGTTATGGTTCAGTGGTGAAAAAAAGCGAAATTGGTATTTTTAGGTTTTTGTGCTTTTGGAGACTGGAAAAGAAAATTGAGTGTTTGTTGCGGAACTGAAAAATGGAAAGTTTCTTGAAAGGCTACCGTTGAGAGGAACAAAAAAGAAAAAAAGAAAACTTGGACGCAGATTGTTCCGGTCTATGCTCCTTCGGAGCGAATGGTTCTTCAGCGGAATCTGCATGGATCTTAAGTCTGCAAGGGTTTGCGTTGTGGTTGCGACCAACGAGAAACGGCTCCGAGAGTTTGGCAGTTCAAGCTCGCAACGTAAAGAAAAATTGAAGTTTTTTGTTTTTTAAGTGAAAATGAATACTTTTGGGAAACGGTAGCAAATTCTGAAAAATAAAGAATTGACCGCAACAAAATAGAGAAAAGAGAAGCCCCGGTTGAAATTTCTTTTTAGGAAACTACGCTCCGTACATCAAAACAAACCCCGGAAAAGTCCCCAGCCTCCTGACATAAAACACCTTATCAGAAGCTGCATCCCCGCTTAAAACAGTTGACCCCGTGCGCGAAATCCCCTTAAATTTAATCCGTTAGCGAGGTGCCGCCGGCTTCCATTCAGGTCGAGGTGGCCCGGCTCCATGCACCGCGACGCAGCGCCGGCAGGCAGAGCAAGTAGAGGGCAGCGCCTGCAATCCATGCCCACCCGTTCCACGTTGTTATAGAAGCCGCATAGATCGCCGTGAAGAGGAGGGGTCCGACGATCGAGGTCAGGCTGGTGAGCGCCGCCAGTGAGCCTTGCAGCTGCCCCTGACGTTCCTCATCCACCTGCCTGGACAACATTGCTTGCAGCGCCGGCATTCCGATGCCACCCGAAGCAAGCAGGACCATGATCGGGAACGCCATCCATCCCCGTGTCGCGAAGGCAAGCAGGATGTAGCCTGTGCCGTCGGCAATCATTCCGAGCATGAGTGCCCGCCTTTCGCCGAGCCGGGCGGCTACAGGGCCGGTGATCATTGCCTGGGCGAGTGAATGCAGAATGCCAAATGCGGCAAGCGAAATGCCGATCGTGGTCGCGTCCCAGTGAAAGCGATCCTCGCCGAAAATGACCCAAAGCGCGGCCGGCACCTGTCCGACAAGTTGCATGATGAAGAAGACCCCCATCAGGGCGGCGACGACGGTCATGCCCCGGGCCCACCGGAACGAAGCGAGCGGGTTGAGAGCCTCCCGGCGTAACGGCCGGCGTTCGCCTTTGTGCGACTCCGGCAAAAGGAAACAGCCCGTCAGGAAATTGAGGCCGTTCAAGGCTGCCGCGGCGAAGAACGGAGCGTGGGGGGAGAAACCGCCCATCAGCCCACCGAGCACAGGTCCCGCGACCATCCCGAACCCGAAACAGGCGCTCATGAAGCCGAAGTGCCGCGCGCGCTCATCGCCATCAGTGATATCGGCAATATAAGCGCCGGCTACCGCCCCAGTCGCCCCGGTGATGCCGGCCACGATCCGCCCGATATAGAGAACCCAAAGGAAAGGCGCCGTCGCCATGATGGCGTAGTCGACAGCAGCGCCGGCCAGCGAGACGAGCAAGACCGGCCGCCGCCCGAAACGATCCGACAGCGCGCCCAGCACAGGTGCGCAGGCAAATTGCATCAACGCATACAGCGCCAGCAGAATGCCATAGTGGGCGGTGACGTCGTTAAATTTAAGGGGAAAACGTGTCCCGGGTCAACTGCTTTTCAAGAGTAAGCAACTTCTGATAAGATACTTTATGTCAGTCGGGTCTGTTTTCGGGTTGCGGTTTTGCTGAATGCGGGGCGTAGTTTCCTAAATCGAAATGCTAGGCATGATCTAACCCTCGGTCTCTGGCGTCGCGACTGCGAAATTTCGCGAGGGTTTCCGAGAAGGTGATTGCGCTTCGCAGATCTCCAGGCGCGTGGGTGCGGACGTAGTCAGCGCCATTGCCGATCGCGTGAAGTTCCGCCGCAAGGCTCGCTGGACCCAGATCCTTTACAGGAAGGCCAACGGTGGCGCCCAAGAAGGATTTCCGCGACACCGAGACCAATAGCGGAAGCCCCAACGCCGACTTCAGCTTTTGAAGGTTCGACAGCACGTGCAGCGATGTTTCCGGTGCGGGGCTCAAGAAAAATCCCATCCCCGGATCGAGGATGAGCCGGTCGGCAGCGACCCCGCTCCGTCGCAAGGCGGAAACCCGCGCCTCGAAGAACCGCACAATCTCGTCGAGCGCGTCTTCGGGTCGAAGGTGACCGGTGCGGGTGGCGATGCCATCCCGCTGCGCTGAGTGCATAACCACCAGCCTGCAGTCCGCCTCAGCAATATCGGGATAGAGCGCAGGGTCAGGAAATCCTTGGATATCGTTCAGGTAGCCCACGCCGCGCTTGAGCGCATAGCGCTGGGTTTCCGGTTGGAAGCTGTCGATTGAAACACGGTGCATCTGATCGGACAGGGCGTCTAAGAGCGGCGCAATACGTCTGATCTCATCGGCCGGCGATACAGGCCTCGCGTCCGGATGGCTGGCGGCCGGTCCGACATCCACGACGTCTGATCCGACTCGCAGCATTTCGATCGCCGCGGTGACAGCGCCGGCGGGGTCTAGCCGCCGGCTCTCATCGAAGAAGGAGTCCTCGGTGAGATTCAGAATGCCGAACACCGTCACCATGGCGTCGGCCTCCGCAGCGACTTCCACGATGGGGATCGGGCGAGCAAAAAGGCAGCAATTATGAGCCCCATACCTACAAAGCCCCACGCATCAAGCTTTTGCCCATGAAGCAACCAGGCAATGGCTGTAATTATGACGACGCCGAGTCCCGACCAGACTGCATAAGCAACACCGACAGGGATGGATTTCAGAACCAGAGAAAGAAAATAAAATGCGATGCCATAACCGATTATGACAACGGCGGAAGGGGCAAGCTTAGTAAAGCCCTCGCTAGATTTTAATGCGGATGTTGCGATTACTTCGCCAACTATTGCGATAACAAGAAAAAGCCAGCCTTTCATGATATATCTCCCAATTTGTGTAGGGCTTATTATGCACGCTTAAAAATAATAAAAGCAGACTTGACCTGATAGTTTGGCTGTGAGCAATTATGTGCTTAGTGCATCTAACGTTTGACATGAGGGGCGGCCAAGGGCGCCAGCCCTTGGACGTCCCCCTCGATGGAAGGGTTAGGCATCACTGCGTGTTCGCTCGAATGCCTGGCGTGTTTGAACCATGTACACGGCTGGACCATCTGGGGTGGTTACGGTACCTTGCCTCTCAAACCCCGCTTTCTCGTAGCATCGGATCGCTCGCAAGTTGCTCGGCGACGGGTCCGTTTGGATCTTGGTGACCTCGGGATCATTGAACAGCAACTCAACCAGAGCTCGAACCAGCTTGGTTCCCAAGCCTTTGCCCAGTTGTGATGCATTCGCCAGTAACTGGTCTATTCCGCGTACTCCTGGATCGGTTTCTTCTTCCCACCATCCGTCCCCGCTTCCAAGAGCAACGTACGACTGGGCATACCCAATCGGCTCTCCATTCAGCATTGCAATGTATGGAGTGACGGACTCTTGCGCTAAAACGCTTGGCAAGTACTGTTCCTGTACGTCAGCAAGTGTCGGGCGTGCTTCTTCTCCGCCCCACCACTCGACGATATGAGATCGATTTAGCCACTCATAGAGCATCGCAAGGTCATGCTCAGTCATGAGGCGCAGTGTGACGGAATCGTTGCTGTTGGTCACGATGTTGTTCAATGGAGGTTCCTTCAGTTTTCTGATGAAGCGCGGAGGTGGCTCAACCTGCGAAAAGAAACGAGTTGCTACGTAAGTCCGAGAACATGCTTTCCATGGTCTCTGAGCTCGCCTTGATGCCCGAGGCATAGACTGTACAAAAAAACAGTCATAACAAGCCATGAAAACCGCCACTGCGCCGTTACCACCGCTGCGTTCGGTCAAGGTTCTGGACCAGTTGCGTGAGCGCATACGCTACTTGCATTACAGCTTACCAACCGAACAGGCTTATGTCAACTGGGTTCGTGCCTTCATCCGTTTCCACGGTGTGCGTCACCCGGCAACCTTGGGCAGCAGCGAAGTCGAGGCATTTCTGTCCTGGCTGGCGAACGAGCGCAAGGTTTCGGTCTCCACGCATCGTCAGGCATTGGCGGCCTTGCTGTTCTTCTACGGCAAGGTGCTGTGCACGGATCTGCCCTGGCTTCAGGAGATCGGAAGACCTCGGCCGTCGCGGCGCTTGCCGGTGGTGCTGACCCCGGATGAAGTGGTTCGCATCCTCGGTTTTCTGGAAGGCGAGCATCGTTTGTTCGCCCAGCTTCTGTATGGAACGGGCATGCGGATCAGTGAGGGTTTGCAACTGCGGGTCAAGGATCTGGATTTCGATCACGGCACGATCATCGTGCGGGAGGGCAAGGGCTCCAAGGATCGGGCCTTGATGTTACCCGAGAGCTTGGCACCCAGCCTGCGCGAGCAGCTGTCGCGTGCACGGGCATGGTGGCTGAAGGACCAGGCCGAGGGCCGCAGCGGCGTTGCGCTTCCCGACGCCCTTGAGCGGAAGTATCCGCGCGCCGGGCATTCCTGGCCGTGGTTCTGGGTTTTTGCGCAGCACACGCATTCGACCGATCCACGGAGCGGTGTCGTGCGTCGCCATCACATGTATGACCAGACCTTTCAGCGCGCCTTCAAACGTGCCGTAGAACAAGCAGGCATCACGAAGCCCGCCACACCGCACACCCTCCGCCACTCGTTCGCGACGGCCTTGCTCCGCAGCGGTTACGACATTCGAACCGTGCAGGATCTGCTCGGCCATTCCGACGTCTCTACGACGATGATTTACACGCATGTGCTGAAAGTTGGCGGTGCCGGAGTGCGCTCACCGCTTGATGCGCTGCCGCCCCTCACTAGTGAGAGGTAGGGCAGCGCAAGTCAATCCTGGCGGATTCACTACCCCTGCGCGAAGGCCATCGGTGCCGCATCGAACGGCCGGTTGCGGAAAGTCCTCCCTGCGTCCGCTGATGGCCGGCAGCAGCCCGTCGTTGCCTGATGGATCCAACCCCTCCGCTGCTATAGTGCAGTCGGCTTCTGACGTTCAGTGCAGCCGTCTTCTGAAAACGACACCATGTGCAAACGATGTCAGAATAGAGTTAAATTTCCTATTGATTGACATATTCCGTCAAAGGTAATAGATTTCATCCTGACACTTTTGCCTTTGGAGGCATCTTGCAAGGTCAACGCATCGGCTATGTCCGCGTCAGCAGCTTCGACCAGAACCCGGAACGGCAATTGGAGGGTGTTCAGGTGGCGCGGGTGTTCACCGACAAGGCTTCTGGCAAGGACACCCAGCGTCCCGAGCTGGAAAGGCTGCTGGCCTTCGTCCGCGAGGGCGACACCGTGGTGGTGCATAGCATGGACAGGCTGGCACGCAACCTTGATGACCTGCGCCGCATCGTCCAAGGGCTGACACAACGGGGCGTGCGCATGGAGTTCGTCAAAGAAGGGCTGAAGTTCACCGGCGAGGACTCACCGATGGCCAATCTGATGCTGTCGGTCATGGGAGCCTTCGCTGAGTTCGAGCGCGCCCTGATCCGCGAACGTCAGCGCGAGGGAATCGTGCTGGCCAAGCAGCGCGGTGCCTACCGGGGACGAAAGAAATCGCTGAACAGCGAACAAATTGCCGAGTTGAAACGGCGAGTTGCGGCAGGCGACCAAAAAACCTTGGTGGCCCGTGACTTCGGCATCAGCCGCGAAACCTTGTACCAGTACCTGCGGGAAGACTGACCATGCCACGCCGCTCAATCCTGTCCGCCACCGAGCGCGAAAGCCTGCTGGCACTGCCAGATGCCAAAGACGAACTGATACGGCACTACACGTTCAACGAAACCGACCTGTCGGTGATCCGTCAGCGTCGCGGCGCCGCGAATCGATTGGGCTTCGCTGTGCAGCTTTGCTACTTGCGATTCCCTGGCACCTTTTTGGGCGTCGATGAGCCTCCGTTTCCGCCCCTGTTGCGCATGGTGGCCGCGCAACTCAAGATGCCAGTGGAAAGTTGGAGCGAGTACGGCCAGCGCGAACAGACACGGCGGGAGCACTTGGTCGAGCTGCAAACGGTTTTTGGGTTCAAGCCCTTCACCATGAGCCACTATCGGCAAGCCGTGCATACATTGACCGAGCTGGCCTTGCAGACCGACAAAGGCATCGTGCTGGCGAGCGCACTTGTCGAGAATCTGCGGCGGCAGAGCATTATCCTGCCCGCCATGAATGCCATCGAGCGCGCAAGCGCCGAGGCCATCACCCGTGCCAACCGACGCATTTACGCGGCGCTGACCGATTCTTTGTTATCACCCCACCGTCAGCGCCTGGACGAACTTCTCAAGCGCAAGGACGGCAGTAAAGTGACGTGGCTGGCATGGCTGCGCCAGTCGCCTGCCAAACCGAACTCTCGCCACATGCTCGAACATATTGAGCGCCTGAAATCCTGGCAAGCACTTGATCTGCCCGCAGGCATCGAGCGGCAGGTTCACCAGAACCGCCTGCTCAAAATCGCTCGTGAAGGTGGCCAGATGACGCCTGCTGATCTGGCAAAGTTCGAGGTGCAACGACGCTATGCCACGCTGGTAGCGCTGGCCATCGAAGGCATGGCCACCGTCACCGATGAAATCATCGACCTTCACGATCGCATCATCGGCAAGCTGTTCAACGCGGCCAAGAACAAGCATCAGCAGCAGTTCCAGGCTTCCGGCAAGGCGATCAACGACAAGGTGCGGATGTATGGGCGCATCGGTCAAGCGTTGATTGAGGCCAAGCAAAGCGGCAGCGATCCGTTCGCCGCCATCGAGGCCGTTATGCCCTGGGACACCTTCGCCGCCAGCGTCACCGAAGCGCAAACATTGGCGCGGCCTGCCGACTTTGATTTCCTGCACCACATCGGTGAAAGCTATGCCACGCTACGCCGCTACGCGCCGCAGTTCCTGGGCGTGCTCAAATTGCGGGCTGCGCCCGCCGCCAAGGGTGTGCTCGATGCCATCGACATGCTGCGCGGCATGAACAGCGACAGCGCGCGCAAGGTGCCCGCCGATGCGCCAACCGCATTCATCAAGCCGCGCTGGGCAAAGCTGGTTCTGACCGACGACGGCATCGACCGGCGTTACTACGAGTTATGCGCCCTGTCGGAGCTGAAGAACGCGCTGCGCTCCGGTGATGTCTGGGTGCAGGGTTCTCGCCAGTTCAAGGACTTCGACGAATACCTGGTGCCGGTCGAGAAGTTCGCCACTTTGAAGCTGGCCAGCGAATTGCCGCTGGCAGTGGCCACCGACTGCGACCAATACCTGCATGACCGGTTGGAATTGTTGGAGGCGCAACTCGCCACAGTCAACCGCATGGCTGCGGCCAACGACTTACCGGATGCCATCATCACCACCGCGTCAGGCCTGAAGATCACGCCGCTGGACGCGGCAGTACCAGACGCCGCGCAAGCCATGATCGACCAGACAGCTATGCTGCTGCCGCACCTCAAAATCACCGAGTTGCTGATGGAGGTCGATGAATGGACGGGCTTCACCCGCCACTTCACACACCTGAAGACCAGCGACACGGCCAAGGACAAAACCTTGCTGTTGACGACGATCCTGGCCGACGCGATCAACCTGGGTCTGACCAAAATGGCCGAGTCCTGCCCTGGCACCACCTACGCCAAGCTGTCTTGGCTGCAAGCCTGGCACATCCGCGATGAAACCTATTCGACGGCGCTGGCCGAGCTGGTGAATGCGCAGTTTCGGCAACCCTTCGCCGGCAACTGGGGTGACGGCACCACGTCATCGTCGGACGGCCAGAACTTCAGAACCGGCAGCAAAGCAGAAAGCACTGGTCATATCAACCCGAAGTATGGAAGCAGTCCAGGACGGACTTTCTACACCCATATCTCCGACCAGTACGCGCCCTTCAGTGCCAAGGTGGTCAACGTGGGCATTCGTGATTCAACTTACGTGCTTGATGGCCTGCTGTACCACGAGTCGGACTTGCGCATCGAGGAACACTACACCGACACGGCAGGCTTCACCGATCACGTGTTTGGCTTGATGCATTTGCTGGGATTTCGCTTCGCGCCGCGTATCCGTGACTTGGGCGAAACCAAGCTATTCATCCCCAAGGGCGATGCCGCCTATGACGCGCTCAAGCCGATGATTAGCAGCGACAGGCTGAACATCAAGCAAATACGCGCCCATTGGGATGAAATTCTGCGGCTGGCCACCTCCATCAAGCAAGGCACGGTAACGGCTTCGCTGATGCTGCGCAAACTCGGCAGCTACCCGCGCCAGAACGGCTTGGCCGTGGCGTTGCGCGAGCTGGGGCGCATCGAGCGCACGCTGTTCATTTTGGATTGGCTGCAAAGCGTGGAGCTGCGCCGCCGCGTCCATGCGGGGCTGAATAAGGGCGAGGCGCGCAACGCGCTGGCCAGGGCGGTCTTCTTCTACCGATTGGGTGAAATCCGCGACCGCAGTTTTGAGCAGCAGCGCTACCGGGCCAGCGGCCTCAATCTGGTGACGGCGGCCATCGTGTTGTGGAACACGGTATATCTGGAGCGTGCCACCAGTGCTTTGCGTGGCAACGGCACGGCGCTGGACGACACATTGTTGCAATATCTGTCGCCGCTGGGGTGGGAGCACATCAACCTGACCGGCGATTACCTATGGCGCAGCAGCGCCAAGGTCGGTGCGGGGAAGTTTAGGCCATTGCGACCGCTGCCACCGGCTTAGCGTGCTTTATTTAATGAGATGGTCACTCCCTCCTTCCCGGTACTATGCTGAGGATAGGCTTTCATTCGGAGAACTATCATGGAAAACATTGCGCTCATTGGTATCGATCTGGGTAAAAACTCTTTCCATATTCATTGCCAAGATCGTCGCGGCAAGGCTGTTTACCGTAAAAAATTTACACGGCCAAAGTTAATCGAATTTTTGGCGACATGCCCCGCTACAACCATCGCAATGGAAGCCTGTGGTGGCTCTCACTTTATGGCACGCAAGTTGGAAGAGTTGGGGCATTTTCCTAAGCTGATATCACCACAATTTGTCCGTCCATTCGTTAAAAGTAACAAAAACGACTTTGTCGACGCCGAAGCTATTTGTGAAGCTGCATCGCGTCCGTCTATGCGTTTTGTACAGCCCAGAACTGAATCTCAGCAGGCAATGCGTGCGCTGCATCGTGTCCGTGAATCCCTGGTTCAGGATAAGGTAAAAACAACCAATCAGATGCATGCTTTTCTGCTGGAATTTGGCATCAGCGTTCCACGAGGAGCTGCCGTTATTAGCCGACTGAGTACCCTTCTTGAGGACAATAGTTTGCCTCTATACCTCAGCCAGTTATTGCTGAAATTACAACAGCATTATCACTATCTTGTTGAGCAGATTAAAGATTTGGAATCCCAGTTGAAACGAAAGTTGGACGAAGATGAGATTGGACAGCGCTTGCTGAGCATTCCCTGCGTCGGAACACTGACAGCGAGTACTATTTCAACTGAGATTGGCGACGGGAAGCAGTACGCCAGCAGTCGTGACTTTGCGGCGGCAACAGGGCTAGTGCCTCGACAGTACAGCACGGGAGGTCGGACGACATTGCTGGGAATTAGTAAGCGAGGTAACAAAAAGATCCGAACTTTGTTGGTTCAATGTGCCAGGGTATTCATACAAAAACTGGAACACCAGTCTGGCAAATTGGCCGATTGGGTCAGGGATCTACTGTGTAGGAAAAGCAACTTTGTCGTCACTTGTGCTCTGGCAAACAAGCTGGCCAGAATAGCCTGGGCCCTAACGGCACGACAGCAAACTTATGTAGCATAA
Protein sequences of DBSCAN-SWA_1 >NZ_CP029123|2205:49895|5913_6618_-|WP_001067855.1|transposase|DBSCAN-SWA MNPFKGRHFQRDIILWAVRWYCKYGISYRELQEMLAERGVNVDHSTIYRWVQRYAPEMEKRLRWYWRNPSDLCPWHMDETYVKVNGRWAYLYRAVDSRGRTVDFYLSSRRNSKAAYRFLGKILNNVKKWQIPRFINTDKAPAYGRALALLKREGRCPSDVEHRQIKYRNNVIECDHGKLKRIIGATLGFKSMKTAYATIKGIEVMRALRKGQASAFYYGDPLGEMRLVSRVFEM >NZ_CP029123|2205:49895|12483_13209_+|WP_000988731.1|DBSCAN-SWA MMTLTTVSKKTSNNSALVFWRVGTKRKGILDVHIDFDHEEADLLAELVAIRYLALDKQVFCREPGAGAGYKLVVSKGAIKKLALGKSTKAFAFKFAACLTGRLKGATIEVSQSMEFMDEPGEGNIELLDVDKQAYTQTHDEISTPAIGPVLVTQHAIDQYQARITSGDPKKPWASLVGRLQHPELQVQPFDEKVARHKARKYGRVDNVEVWGHRDSKFKYLMVINDDNQKRVLVTVFERNE >NZ_CP029123|2205:49895|42720_43068_-|WP_000679427.1|DBSCAN-SWA MKGWLFLVIAIVGEVIATSALKSSEGFTKLAPSAVVIIGYGIAFYFLSLVLKSIPVGVAYAVWSGLGVVIITAIAWLLHGQKLDAWGFVGMGLIIAAFLLARSPSWKSLRRPTPW >NZ_CP029123|2205:49895|41887_42727_-|WP_000259031.1|DBSCAN-SWA MVTVFGILNLTEDSFFDESRRLDPAGAVTAAIEMLRVGSDVVDVGPAASHPDARPVSPADEIRRIAPLLDALSDQMHRVSIDSFQPETQRYALKRGVGYLNDIQGFPDPALYPDIAEADCRLVVMHSAQRDGIATRTGHLRPEDALDEIVRFFEARVSALRRSGVAADRLILDPGMGFFLSPAPETSLHVLSNLQKLKSALGLPLLVSVSRKSFLGATVGLPVKDLGPASLAAELHAIGNGADYVRTHAPGDLRSAITFSETLAKFRSRDARDRGLDHA >NZ_CP029123|2205:49895|45839_48812_+|WP_001138073.1|transposase|DBSCAN-SWA MPRRSILSATERESLLALPDAKDELIRHYTFNETDLSVIRQRRGAANRLGFAVQLCYLRFPGTFLGVDEPPFPPLLRMVAAQLKMPVESWSEYGQREQTRREHLVELQTVFGFKPFTMSHYRQAVHTLTELALQTDKGIVLASALVENLRRQSIILPAMNAIERASAEAITRANRRIYAALTDSLLSPHRQRLDELLKRKDGSKVTWLAWLRQSPAKPNSRHMLEHIERLKSWQALDLPAGIERQVHQNRLLKIAREGGQMTPADLAKFEVQRRYATLVALAIEGMATVTDEIIDLHDRIIGKLFNAAKNKHQQQFQASGKAINDKVRMYGRIGQALIEAKQSGSDPFAAIEAVMPWDTFAASVTEAQTLARPADFDFLHHIGESYATLRRYAPQFLGVLKLRAAPAAKGVLDAIDMLRGMNSDSARKVPADAPTAFIKPRWAKLVLTDDGIDRRYYELCALSELKNALRSGDVWVQGSRQFKDFDEYLVPVEKFATLKLASELPLAVATDCDQYLHDRLELLEAQLATVNRMAAANDLPDAIITTASGLKITPLDAAVPDAAQAMIDQTAMLLPHLKITELLMEVDEWTGFTRHFTHLKTSDTAKDKTLLLTTILADAINLGLTKMAESCPGTTYAKLSWLQAWHIRDETYSTALAELVNAQFRQPFAGNWGDGTTSSSDGQNFRTGSKAESTGHINPKYGSSPGRTFYTHISDQYAPFSAKVVNVGIRDSTYVLDGLLYHESDLRIEEHYTDTAGFTDHVFGLMHLLGFRFAPRIRDLGETKLFIPKGDAAYDALKPMISSDRLNIKQIRAHWDEILRLATSIKQGTVTASLMLRKLGSYPRQNGLAVALRELGRIERTLFILDWLQSVELRRRVHAGLNKGEARNALARAVFFYRLGEIRDRSFEQQRYRASGLNLVTAAIVLWNTVYLERATSALRGNGTALDDTLLQYLSPLGWEHINLTGDYLWRSSAKVGAGKFRPLRPLPPA >NZ_CP029123|2205:49895|24189_24732_-|WP_000587837.1|DBSCAN-SWA MIIWINGPFGAGKTTLAKRLRDRRSKSLIFDPEEIGFVVKETVPMPASGDYQDLPLWRGLTIAAVREIRRNYSQDIIIPMTLVHPDYLTEILDGVRRIDDQLLHIFLTLNEDLLRHRIANQTMHPDPNRNAEIREWRLANVARCLAARERLPCTTRVLDSGAHTSDELAAMVLDGIDGRT >NZ_CP029123|2205:49895|24744_25605_-|WP_000557454.1|DBSCAN-SWA MHTRKAITEALQKLGVQTGDLLMVHASLKAIGPVEGGAETVVAALRSAVGPTGTVMGYASWDRSPYEETLNGARLDDEARRTWLPFDPATAGTYRGFGLLNQFLVQAPGARRSAHPDASMVAVGPLAETLTEPHELGHALGEGSPVERFVRLGGKALLLGAPLNSVTALHYAEAVADIPNKRWVTYEMPMLGRDGEVAWKTASDYDSNGILDCFAIEGKPDAVETIANAYVKLGRHREGVVGFAQCYLFDAQDIVTFGVTYLEKHFGTTPIVPPHEAVERSCEPSG >NZ_CP029123|2205:49895|36358_37198_-|WP_000259031.1|DBSCAN-SWA MVTVFGILNLTEDSFFDESRRLDPAGAVTAAIEMLRVGSDVVDVGPAASHPDARPVSPADEIRRIAPLLDALSDQMHRVSIDSFQPETQRYALKRGVGYLNDIQGFPDPALYPDIAEADCRLVVMHSAQRDGIATRTGHLRPEDALDEIVRFFEARVSALRRSGVAADRLILDPGMGFFLSPAPETSLHVLSNLQKLKSALGLPLLVSVSRKSFLGATVGLPVKDLGPASLAAELHAIGNGADYVRTHAPGDLRSAITFSETLAKFRSRDARDRGLDHA >NZ_CP029123|2205:49895|2205_2910_-|WP_009364894.1|transposase|DBSCAN-SWA MNPFKGRHFQRDIILWAVRWYCKYGISYRELQEMLAERGVNVDHSTIYRWVQRYAPEIEKRLRWYWRNPSDLCPWHMDETYVKVNGRWAYLYRAVDSRGRTVDFYLSSRRNSKAAYRFLGKILNNVKKWQIPRFINTDKAPAYGRALALLKREGRCPSDVEHRQIKYRNNVIECDHGKLKRIIGATLGFKSMKTAYATIKGIEVMRALRKGQASAFYYGDPLGEMRLVSRVFEM >NZ_CP029123|2205:49895|23265_23493_-|WP_000248278.1|DBSCAN-SWA MVNSYSLSDDSGVMAAAAITHFLFGQAVFSYLNGWSVLIGPGTGLDSTGCKYARDLMGLVAFTAFIVTFLFRGYS >NZ_CP029123|2205:49895|11758_11875_+|WP_000338626.1|DBSCAN-SWA MDAFTLGMLGLLIFFTVVTGGSLYLYHEKQKEKKHHNA >NZ_CP029123|2205:49895|43960_44974_+|WP_002075255.1|integrase|DBSCAN-SWA MKTATAPLPPLRSVKVLDQLRERIRYLHYSLPTEQAYVNWVRAFIRFHGVRHPATLGSSEVEAFLSWLANERKVSVSTHRQALAALLFFYGKVLCTDLPWLQEIGRPRPSRRLPVVLTPDEVVRILGFLEGEHRLFAQLLYGTGMRISEGLQLRVKDLDFDHGTIIVREGKGSKDRALMLPESLAPSLREQLSRARAWWLKDQAEGRSGVALPDALERKYPRAGHSWPWFWVFAQHTHSTDPRSGVVRRHHMYDQTFQRAFKRAVEQAGITKPATPHTLRHSFATALLRSGYDIRTVQDLLGHSDVSTTMIYTHVLKVGGAGVRSPLDALPPLTSER >NZ_CP029123|2205:49895|13341_17595_+|WP_001257735.1|DBSCAN-SWA MRSRPSLLLMRNLRSLAVVVLATLPCVAFAQWRVVAVSTEVDKMRFNTIIDAHSFMKNYRSGEEQGKGTPVGDALYPVASYGDGRYVSRICFKYLGATGDYDPTTCTGDPATVYWRSTYVLPGEMDKTPFLDRDLGLPTTTMCVGNPIHLGTGNKFQAELDYQSGGSDPFTFTRYYNSHLPDEELGGWRHTYSRSVEVNASKYGENMVVLHRPEGQQLAFYNSSSVWVPTWKTDDTLTKDATGWRYTQSDGVVEAYDETGRLTGIEKPNGNHITLSYLNGELSSITDGFGRTIQFQYQDGRMVSVTDPAGGSIQYQYNSAGKLAEVIYQDNTSRSYLYDDPNAPGLLSGLVDENGNRFATWGYDTQGRAVLSEHAGGAEKTQVSYNADGSVSVTNALGHVQRYTYSRHNGMLKPDVVEGAPCTGFVGGKETYVYDSKGLVSSITDRAGQKRTFTHNDRGLETTQIDQDGGKVTTDWLPSKSLPAKITEPTRITELTYDTHLRVISRKVTDRSSGASRTWTYTYAPVGTGKPSLLASVDGPRTDVSDVTTFDYDDQGNLIRTTNALGQVTQFGDYDANGRAGTIQGVNGVTQTLTYDARGRLVSSTGPEGTTVYNYDAVGLLSSLTKPNGATVSYEYDAAHRLVAETDAQGNRRELELNDLGNPVEERLLDALGQTRWIERRIFNEIGWLSSVSDAYSNQSSFSYDVVANLIQETSPSGNTHSYKYDGFHHRTQTTDPLGKVTQVLYKDTGDVYRVSDPRSRLTYYSYNGFGEVTQVRSPDTGTTDITYDEAGNVATRKTAKGQTTSYSYDALNRIIETSSDVAGESPILYGYDEATSPYGIGRLTSVDDGNGVRRFGYTPEGWLAYETWETHGQSLTTQYQYDGAGLVTKITYPSGREVSYTRDSAGDVIEVTTTQAGTTTNLASQIERAPFGPVTSMVRGNGISESRTLDLDYRVTGIDAARVHSLVYRYTPDSLISAIDDNLSSSVNQSLGYDAVGRITSAEGIYGVLGYGYDATGNRTSITTDGLSQSYTINYMNNWLVKAGQTSRSYDANGNLTKQGADTFTYDSQNRLVAATVAGVTVSYTYNHLDQRVTKTLNGHTRLLVYDLAGNLIEELDAATGDVLAEYIWLDGTPLGFVQSGQTYQVHVDHLGTPKALTDVSGQVVWKASYSPFGKASIIIQGPTFNLRFPGQYYDAETGFHYNWRRYYDPATGRYITSDPLGLIDGVNTYGYVHGNPMSNTDPTGEFAFVGAGIGAGLELLSQLIENNGSWKCVSWSKVGIAGAIGAIGGGWASGVFRHASSGKSWFKLSQKWSNVSPRVRKVQGVPRGNELHHWAIQRNGKFGKYVPDSIKNHPWNLKSIPRDIHQNIHGNGPTPYSAFGRWWHGTPEWAKVAQASPVSGGLADSINDEGCGCAN >NZ_CP029123|2205:49895|9835_10270_+|WP_000429836.1|DBSCAN-SWA MENNLENLTIGVFAKAAGVNVETIRFYQRKGLLREPDKPYGSIRRYGEADVVRVKFVKSAQRLGFSLDEIAELLRLDDGTHCEEASSLAEHKLKDVREKMADLARMETVLSELVCACHARKGNVSCPLIASLQGEAGLARSAMP >NZ_CP029123|2205:49895|13183_13387_-|WP_032410269.1|DBSCAN-SWA MTAGFASVKGKVSISFTAPVFMLELFSGGRVVINDCVIALLGYEYVIWRGCVFRTNEKQVTHFAQTL >NZ_CP029123|2205:49895|29760_30588_-|WP_017781026.1|transposase|DBSCAN-SWA MSGTFKGLTQAQIDRRLKEGRGQGQGRDYKPFIYTRDVSSLGRSHRLPGSKTRRLHHLLSDLELAIFLTLDRSPQVTDIREQFPMRAEDTVRIAEELGLAHGRYKGIPQVLTSDFLVDFDDPQRPSVAIQAKYSADLQKPEVIERLELERRYWQEKGIPWFIITEREVSREAFANIQWLYPAHAEDDIAQDDLAHYQQLFLLEFQRHPDRKLTAIAQEMDTSGQLEAGQALYWLRQLLARHWFLFDLDTPYRVLKPKDLAANPHQIHQELISVVR >NZ_CP029123|2205:49895|17566_18007_+|WP_001326394.1|DBSCAN-SWA MMRDAVVQIEFPALLVSSKKRSLFVVASESEFGKCTIQSLRNGYFELMDIYDSEGRHYKIDEVASYKPLSPFWYWPVEIVMYGSRLFKANFNAVLISNLDCKELKSELCDLAKKYRSNLDSGVGIEKIMEEMESARTIKELIKVFG >NZ_CP029123|2205:49895|40671_41724_-|WP_088498802.1|DBSCAN-SWA MLLALYALMQFACAPVLGALSDRFGRRPVLLVSLAGAAVDYAIMATAPFLWVLYIGRIVAGITGATGAVAGAYIADITDGDERARHFGFMSACFGFGMVAGPVLGGLMGGFSPHAPFFAAAALNGLNFLTGCFLLPESHKGERRPLRREALNPLASFRWARGMTVVAALMGVFFIMQLVGQVPAALWVIFGEDRFHWDATTIGISLAAFGILHSLAQAMITGPVAARLGERRALMLGMIADGTGYILLAFATRGWMAFPIMVLLASGGIGMPALQAMLSRQVDEERQGQLQGSLAALTSLTSIVGPLLFTAIYAASITTWNGWAWIAGAALYLLCLPALRRGAWSRATST >NZ_CP029123|2205:49895|35730_36231_-|WP_000376623.1|DBSCAN-SWA MDSEEPPNVRVACSGDIDEVVRLMHDAAAWMSAKGTPAWDVARIDRTFAETFVLRSELLVASCSDGIVGCCTLSAEDPEFWPDALKGEAAYLHKLAVRRTHAGRGVSSALIEACRHAARTQGCAKLRLDCHPNLRGLYERLGFTHVDTFNPGWDPTFIAERLELEI >NZ_CP029123|2205:49895|19573_19993_+|WP_072199448.1|DBSCAN-SWA MDEYPIIDLSHLLPAAQGLARLPADERIQRLRADRWIGYPRAVEALNRLEALYAWPNKQRMPNLLLVGPTNNGKSMIVEKFRRTHPASSDADQEHIPVLVVQMPSEPSVIRFYVALLAAMGAPLRPRPRLLWEENKVSS >NZ_CP029123|2205:49895|43236_43791_-|WP_032488579.1|DBSCAN-SWA MTNSNDSVTLRLMTEHDLAMLYEWLNRSHIVEWWGGEEARPTLADVQEQYLPSVLAQESVTPYIAMLNGEPIGYAQSYVALGSGDGWWEEETDPGVRGIDQLLANASQLGKGLGTKLVRALVELLFNDPEVTKIQTDPSPSNLRAIRCYEKAGFERQGTVTTPDGPAVYMVQTRQAFERTRSDA >NZ_CP029123|2205:49895|32900_34424_+|WP_001324342.1|transposase|DBSCAN-SWA MINVAILSAIRRWHFRDGASIREIARRSGLSRNTVRKYLQSKVVEPQYPARDSVGKLSPFEPKLRQWLSTEHKKTKKLRRNLRSMYRDLVALGFTGSYDRVCAFARQWKDSEQFKAQTSGKGCFIPLRFACGEAFQFDWSEDFARIAGKQVKLQIAQFKLAHSRAFVLRAYYQQKHEMLFDAHWHAFQIFGGIPKRGIYDNMKTAVDSVGRGKERRVNQRFTAMVSHYLFDAQFCNPASGWEKGQIEKNVQDSRQRLWQGAPDFQSLADLNVWLEHRCKALWSELRHPELDQTVQEAFADEQGELMALPNAFDAFVEQTKRVTSTCLVHHEGNRYSVPASYANRAISLRIYADKLVMAAEGQHIAEHPRLFGSGHARRGHTQYDWHHYLSVLQKKPGALRNGAPFAELPPAFKKLQSILLQRPGGDRDMVEILALVLHHDEGAVLSAVELALECGKPSKEHVLNLLGRLTEEPPPKPIPIPKGLRLTLEPQANVNRYDSLRRAHDAA >NZ_CP029123|2205:49895|25737_26882_-|WP_085940656.1|transposase|DBSCAN-SWA MEYFMARRPRRNHSNDFKAKVALAAIKAEKTLAELSAEFDVHQNQIIDWKNQLISASSQAFDQSKAPTEPPIDLKKLHAKIGEQALEIGFFRRCVEETGPLQPQKLIDDSLQISVSKQAKLLKVSRGCYYYRPKPVSASDLKLMRCIDELHMQYPFAGSRMMRDLLNRQGHHIGRRHTRTLMKKMGIQALYCKPNLSQANQAHRKYPYLLKGLAIQRSNQVWSTDITYIPMAKGFVYLCAVIDWHSRKVLAHRVSISMEVDFCISALNEAIEKYGRPEIFNTDQGSQFTSDAFIDVLKSNGIQISMDGKGRWVDNVMVERLWRSVKYEEVYLKAYSSVTDAKKQLSAYFEFYNLKRPHSSLDKMTPNEFYYDQLPQQNKVA >NZ_CP029123|2205:49895|27686_29747_-|WP_108711104.1|integrase,transposase|DBSCAN-SWA MGELRKRLLWSGTEQAVWIDIDLDTALPESISVAELERLIIEGELESIGDPFEETVLREVEAGSPDQLKRDEAWAMLADYMHDPQLFVRRPRGLIVGSIMQRHGVTKQTVYRLLRRYWQRGMCRNALLPDYVNSGARGKRRKPNRAKLGRPRVVMAGKGRNVTPDIERIFRRVIEERLLKEKHPSIPDAYASGLNLLRAVQPELPTSELPTLGQFRYFYGREYHFTDTLPRRMSAVDFAKDFRPLNSTSTTETLGPGYRYQIDATIADIYLVSEHDRSLIVGRPVVYMVLDVFSRMVVGMYVGFEGPSWVSAMVALANTVADKVEYCRQYGLEIDASDWPVKGLPDVILADKGELNGTKVEAFSQAFGVRIENAPARRGDAKGIVERYFRTVQERFKPYASGVVEDTTSRKRGGHDYRLDASLTLPEFTKIIIAGILHHNNFHTLSKYDRAAGMPGDLPAIPVMLWNWGLASLTGRLRTAPEELVWINLLSHESATVSELGIRLFGCFYSCPEAIREGWFHRGQGRRPTGVTVAYDPRSADHIYLRPSNSLKDYWVCDLADRSRRFRGMTFWDVWILSREERRSDVNAASKALAERGKLLEQIESIVAQAENASPLKTGISKKDLGTQIRENKQQEKRQERLKTAFKPEKAQRAKPAEVIPLRGEKQEDYAFPDLSDLIFKEEEDD >NZ_CP029123|2205:49895|45279_45837_+|WP_001162012.1|DBSCAN-SWA MQGQRIGYVRVSSFDQNPERQLEGVQVARVFTDKASGKDTQRPELERLLAFVREGDTVVVHSMDRLARNLDDLRRIVQGLTQRGVRMEFVKEGLKFTGEDSPMANLMLSVMGAFAEFERALIRERQREGIVLAKQRGAYRGRKKSLNSEQIAELKRRVAAGDQKTLVARDFGISRETLYQYLRED >NZ_CP029123|2205:49895|2900_3749_+|WP_011977797.1|DBSCAN-SWA MGSCAAPSAKGDDKFITTDYLQQCRVHRNTAYQALKDACDDLFARQFSYQSLSEKGNTINHKSRWVSEVAYIDNEAVVRLIFAPAIVPLITRLEEQFTKYEIQQISNLTSAYAVRLYEILIAWRSTGKTPLITMYDFRQKIGVLETEYKRMYDFKKYVLDIALKQVNEHTDIIVKVEQHKTGRSITGFSFSFKQKKSATHSVESKRDPNTLDLFSKITDKQRHLFANKLSELPEMSKYSQGTESYQQFAVRIAAMLQDAEKAGLWLDLSALIGTVHTFQEYG >NZ_CP029123|2205:49895|23516_23708_+|WP_000951934.1|DBSCAN-SWA MAIALMGAGFSATDTSDAVNILYPITMTVQANKAWQASGLKKSFFLPSHKKVRGGEKISYARL >NZ_CP029123|2205:49895|6608_9641_+|WP_050576375.1|transposase|DBSCAN-SWA MGSCAAPSAKGDDKFITTDYLQQCRFTGEPDELQLARYFHLDEADKEFIGKSRGDHNRLGIALQIGCVRFLGTFLTDMNHIPSGVRHFTARQLGIRDITVLAEYGQRENTRREHAALIRQHYQYREFAWPWTFRLTRLLYTRSWISNERPGLLFDLATGWLMQHRIILPGATTLTRLISEVREKATLRLWNKLALIPSAEQRSQLEMLLGPTDCSRLSLLESLKKGPVTISGPAFNEAIERWKTLNDFGLHAENLSTLPAVRLKNLARYAGMTSVFNIARMSPQKRMAVLVAFVLAWETLALDDALDVLDAMLAVIIRDARKIGQKKRLRSLKDLDKSALALASACSYLLKEETPDESIRAEVFSYIPRQKLAEIITLVREIARPSDDNFHEEMVEQYGRVRRFLPHLLNTVKFSSAPAGVTTLNACDYLSREFSSRRQFFDDAPTEIISRSWKRLVINKEKHITRRGYTLCFLSKLQDSLRRRDVYVTGSNRWGDPRARLLQGADWQANRIKVYRSLGHPTDPQEAIKSLGHQLDSRYRQVAARLCENEAVELDVSGPKPRLTISPLASLDEPDSLKRLSKMISDLLPPVDLTELLLEINAHTGFADEFFHASEASARVDDLPVSISAVLMAEACNIGLEPLIRSNVPALTRHRLNWTKANYLRAETITSANARLVDFQATLPLAQIWGGGEVASADGMRFVTPVRTINAGPNRKYFGNNRGITWYNFVSDQYSGFHGIVIPGTLRDSIFVLEGLLEQETGLNPTEIMTDTAGASELVFGLFWLLGYQFSPRLADAGASVFWRMDHDADYGVLNDIARGQSDPRKIVLQWDEMIRTAGSLKLGKVQVSVLVRSLLKSERPSGLTQAIIEVGRINKTLYLLNYIDDEDYRRRILTQLNRGESRHAVARAICHGQKGEIRKRYTDGQEDQLGTLGLVTNAVVLWNTIYMQAALDHLRAQGETLNDEDIARLSPLCHGHINMLGHYSFTLAELVTKGHLRPLKEASEAENVA >NZ_CP029123|2205:49895|34413_35196_+|WP_001163403.1|DBSCAN-SWA MQHEGHVRILKSLKLFGMAHAIEELGNQNSPAFNQALPMLDSLIKAEVAEREVRSVNYQLRVAKFPVYRDLVGFDFSQSLVNEATVKQLHRCDFMEQAQNVVLIGGPGTGKTHLATAIGTQAVMHLNRRVRFFSTVDLVNALEQEKSSGRQGQIANRLLYADLVILDELGYLPFSQTGGALLFHLLSKLYEKTSVILTTNLSFSEWSRVFGDEKMTTALLDRLTHHCHILETGNESYRFKHSSTQNKQEEKQTRKLKIET >NZ_CP029123|2205:49895|48890_49895_+|WP_000427620.1|transposase|DBSCAN-SWA MENIALIGIDLGKNSFHIHCQDRRGKAVYRKKFTRPKLIEFLATCPATTIAMEACGGSHFMARKLEELGHFPKLISPQFVRPFVKSNKNDFVDAEAICEAASRPSMRFVQPRTESQQAMRALHRVRESLVQDKVKTTNQMHAFLLEFGISVPRGAAVISRLSTLLEDNSLPLYLSQLLLKLQQHYHYLVEQIKDLESQLKRKLDEDEIGQRLLSIPCVGTLTASTISTEIGDGKQYASSRDFAAATGLVPRQYSTGGRTTLLGISKRGNKKIRTLLVQCARVFIQKLEHQSGKLADWVRDLLCRKSNFVVTCALANKLARIAWALTARQQTYVA >NZ_CP029123|2205:49895|20325_20451_+|WP_014342213.1|DBSCAN-SWA MRNPSLSLARVLANQLARDPVESDPKRSLTTGTNRPILLKK >NZ_CP029123|2205:49895|10348_11353_+|WP_000427623.1|transposase|DBSCAN-SWA MENIALIGIDLGKNSFHIHCQDRRGKAVYRKKFTRPKLIEFLATCPATTIAMEACGGSHFMARKLEELGHSPKLISPQFVRPFVKSNKNDFVDAEAICEAASRPSMRFVQPRTESQQAMRALHRVRESLVQDKVKTTNQMHAFLLEFGISVPRGAAVISRLSTILEDNSLPLYLSQLLLKLQQHYHYLVEQIKDLESQLKRKLDEDEVGQRLLSIPCVGTLTASTISTEIGDGKQYASSRDFAAATGLVPRQYSTGGRTTLLGISKRGNKKIRTLLVQCARVFIQKLEHQSGKLADWVRDLLCRKSNFVVTCALANKLARIAWALTARQQTYVA >NZ_CP029123|2205:49895|11995_12370_+|WP_000868820.1|DBSCAN-SWA MKVSNEDAQATAIYLLRAASRPAFWRDVPFDKKLEAVDSLNSIGRSPSELTEWINKYLTAEQINKLGTSIRQRRRRGYGVGKSITISDKAHRILKRLSEVDGCSLSEVIEKRLARAYKNTWDHK >NZ_CP029123|2205:49895|21962_22112_+|WP_014342212.1|DBSCAN-SWA MLFAPTLADKAILPTAWGGRYFFNRIGRTQPFGSTEPCLYPTKLRKLAR >NZ_CP029123|2205:49895|4968_5745_-|WP_108711103.1|DBSCAN-SWA MIRMILAINNQCFIGKNNTLMYRLKDDMLNFKKMTQNNIVVMGRKTFESLNNRGLPNRLNVVVTSKAETFEDIQTITTHDMKRSETFTKEGHVVYITPDSFINQFLPFHRDSEDEIWVIGGAQVYEAATPFASEIICTFVDDDEVGDVALKPKLFGGFTHLATLKSVDVDEDNDKPYEITQLVRHEDLEHKLRELQAQQHEMEKEQTQNNLSTPLENGGLRQGEAFVIAATTSAALSQIDTESREDSSDSDSSSSSSD >NZ_CP029123|2205:49895|39038_39938_+|WP_063865160.1|DBSCAN-SWA MKIVKRILLVLLSLFFTVEYSNAQTDNLTLKIENVLKAKNARIGVAIFNSNEKDTLKINNDFHFPMQSVMKFPIALAVLSEIDKGNLSFEQKIEITPQDLLPKMWSPIKEEFPNGTTLTIEQILNYTVSESDNIGCDILLKLIGGTDSVQKFLNANHFTDISIKANEEQMHKDWNTQYQNWATPTAMNKLLIDTYNNKNQLLSKKSYDFIWKIMRETTTGSNRLKGQLPKNTIVAHKTGTSGINNGIAAATNDVGVITLPNGQLIFISVFVAESKETSEINEKIISDIAKITWNYYLNK >NZ_CP029123|2205:49895|4389_4911_-|WP_011977798.1|DBSCAN-SWA MSVTAIVVMNHENKLTTTQNPLLRVKEVEEMLTKYAKDAVVFIDERQFDLRRALSDGARELFAVVGEQNKPIVGVENILPKSAELLIKHYYKAKTSDVVLFISASQLDRVKTHLDQVVLVTINNPSRELETISKGFTDDLKSRVNRKKLSMTEWSIAQSTEEHKRRYSVKIYS >NZ_CP029123|2205:49895|18378_19083_+|WP_009364894.1|transposase|DBSCAN-SWA MNPFKGRHFQRDIILWAVRWYCKYGISYRELQEMLAERGVNVDHSTIYRWVQRYAPEIEKRLRWYWRNPSDLCPWHMDETYVKVNGRWAYLYRAVDSRGRTVDFYLSSRRNSKAAYRFLGKILNNVKKWQIPRFINTDKAPAYGRALALLKREGRCPSDVEHRQIKYRNNVIECDHGKLKRIIGATLGFKSMKTAYATIKGIEVMRALRKGQASAFYYGDPLGEMRLVSRVFEM >NZ_CP029123|2205:49895|37191_37539_-|WP_000679427.1|DBSCAN-SWA MKGWLFLVIAIVGEVIATSALKSSEGFTKLAPSAVVIIGYGIAFYFLSLVLKSIPVGVAYAVWSGLGVVIITAIAWLLHGQKLDAWGFVGMGLIIAAFLLARSPSWKSLRRPTPW >NZ_CP029123|2205:49895|22078_23215_-|WP_000080860.1|DBSCAN-SWA MSQLSQLRSPAAVQAAIDEFVQLGRTKFLARHGYGKSRDFLVRDPKTGTDCDSKAIAGVAFGKQFPEQGPLTADSFSGGEATVVPALTRLGFRIIRIGEDWSEEEVLATVEDYFDMLRAEAAGEPYNKSEHNQALRQLLNGRSKSSVELKHQNISAVLDALGLPYINGYKPRGNSQLLLRKSVHAYVLEHQQTVGALVDALEEVKLPGDKTYRAALVEPPAREVLVRTPASLRQRLPRKFDYAARDEANRKLGRAGEQWVIGYEQQRLTELGHPELFQRLDWVSDTQGDGAGFDILSFEEDAHERFIEVKTTNGGVGSSFLVSHNELEFSKEAGDQFHLYRVFQFRDGPRLFTLPGDLSQHVHLKPTDYRASFRSLVG |
40 | Escherichia_phage(33.33%) | transposase,integrase | attL 19131:19146|attR 47124:47139 |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
Acr ID | Acr position | Acr size | Homology with known anti | Neighbor HTH/AcRanker | Neighbor Aca | In prophage | Protospacer in prophage |
---|