Contig_ID | Contig_def | CRISPR array number | Contig Signature genes | Self targeting spacer number | Target MGE spacer number | Prophage number | Anti-CRISPR protein number |
---|---|---|---|---|---|---|---|
CP019581 | Lactobacillus helveticus strain LH5, complete genome | 3 crisprs | DinG,WYL,cas14k,cas14j,DEDDh,Cas14u_CAS-V,cas3,RT,csa3,cas2,cas5,cas8c,cas7,cas4,cas1 | 0 | 5 | 2 | 0 |
CP019582 | Lactobacillus helveticus strain LH5 plasmid pCBTLH5_1, complete sequence | 0 crisprs | NA | 0 | 0 | 0 | 0 |
CP019583 | Lactobacillus helveticus strain LH5 plasmid pCBTLH5_2, complete sequence | 1 crisprs | NA | 0 | 1 | 0 | 0 |
CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
CP019581_1 | 962207-962386 | TypeV |
NA
Consensus repeat of CP019581_1
|
2 spacers
spacers of CP019581_1
>1.1|962228|58|CP019581|PILER-CR ATGCTGGATCCACAAACCAGTGTGTTAACCCCTTCACCAATTCCGCCATAAACGGTCC >1.2|962307|59|CP019581|PILER-CR TCTCCTCCGTGACAGGGAGGCGAGATAAACCACTGCTCCAATGGACCAATCAGAGGAGG |
cas14j |
CRISPR arrays and Neighbor proteins around CP019581_1
The CRISPR arrays of CP019581_1 >merge|CP019581|1|962207-962386|PILER-CR ATGTGGGATTCGAACCCACGCATGCTGGATCCACAAACCAGTGTGTTAACCCCTTCACCAATTCCGCCATAAACGGTCCATGTGGGATTTGAACCCACGATCTCCTCCGTGACAGGGAGGCGAGATAAACCACTGCTCCAATGGACCAATCAGAGGAGGATGTGGGATTTGAACCCACGC >CP019581|1|1|962207-962386|PILER-CR ATGTGGGATTCGAACCCACGC ATGCTGGATCCACAAACCAGTGTGTTAACCCCTTCACCAATTCCGCCATAAACGGTCC ATGTGGGATTTGAACCCACGA TCTCCTCCGTGACAGGGAGGCGAGATAAACCACTGCTCCAATGGACCAATCAGAGGAGG ATGTGGGATTTGAACCCACGC
>CP019581.1|AZK91236.1|960592_962059_-|Multidrug-resistance-protein-3 MNKKQITMVTIALMLGNVMSGLDGTIINTAIPTIVAALHGIQFMGWIVAIFLLGMSISIPIWTKIGEKITNKKAFEISLVLFVLGAALQGMAPNIIFFLCSRFIMGIGAGGMGSLPYIIAGYVFKNIKTRTKVLGYLTASWNGAAILGPLVGGWLIDAFSWHWVFYINIPIGLIALIICFIYYKPVTPEKTPVFDIPGAGLLVIGLLLFLMGVQLVGLTANWIVISLIIISLVFIVLFFIRENHAANPIIPVSLFKNKDLDGDFLLFAFTWGAFIAVNTYMPMWAQALLGLSALLGGMTLIPNSIVEIIASQSVAAIQEHMSTFKLALIGIIAMMISSAGLFLADIHTPVQMLTFIGAFSGIGVGFIFVAMQLKVQLDAGLKNMATATSTSYLIRILAQTVMAAVYGVIMNLNLASGVHTHKGITMAMMNKLSDAKSAKLLPQGLVPTMRTIFHAGIKEIMLVSLILLIIAFVLNFYFNFGKKRKKLQ >CP019581.1|AZK91235.1|960072_960585_+|Bacterial-regulatory-protein,-tetR-family MNMKSLHTQQHIEKALFSLLQKKPYAEISIAEITRKAHVSRTSFYRNYSQKNDVLMLFLANQYKKFIVDINEHKLKTLTKQLVAYLTFFKGNPKVMKILLDAGFEGSLLNFQTRYLKKLLSVYHPDLNLPDYAIAYQSGGIYMLLIWWVKQDYATPLEDLINYAEKHIML >CP019581.1|AZK91234.1|958190_959963_-|Oleate-hydratase MHYSNGNYEAFVKAEKPKDVDQKSAYIVGSGLAALASAVFLIRDGQMKGNRIHIFEELSLPGGSMDGIYSKEKESYIIRGGREMEPHFECLWDLFRSIPSTEHEGESILDEFYRLNRKDPSYAKTRVIINRGEALPTDGQLLLTPKAVKEIVDLCLTPEKDLQNKKINEVFTKEFFQSNFWLYWSTMFAFEPWASAMEMRRYLMRFVQHVATLKNLSSLRFTKYNQYESLILPMVKYLKSHGVQFHYDTVVDNIFVNRSNGEKVAKQIILTEKGERKTIDLTENDLVFVTNGSITESTTYGDNFHPASEEHELGASWQLWKNLAAQDSDFGHPDVFCKDIPKANWRMSATITFKNDDIVPFIEAVNKKDPHSGSIVTSGPTTIKDSNWLLGYSISRQPHFKAQKPNELIVWLYGLFSDTKGNYVEKTMPDCNGIELCEEWLYHMGVPEERIPEMAAAATTIPAHMPYITSYFMPRALGDRPKVVPDHSKNLAFIGNFAETPRDTVFTTEYSVRTAMEAVYTLLDIDRGVPEVFASAFDVRMLMNALYYLNDQKKLEDLDLPMGEKLAIKGMLKKVKGTYIEELMKEYKLI >CP019581.1|AZK91233.1|957252_957690_-|Putative-acetyltransferase-YjbC MFSLLAMSLHRNVKFHAIYNEDQFCGITYYAENDKTVYLTYLAINEELRGQGYGSKILTMLEDRFLDKQIVIDIEPVTSKAKNYKQRVSRLKFYKRNGFHRTDQKLKDPDGEFEALTTGKKLDKESFIDTLRQMSFGFYQAKVEK >CP019581.1|AZK91232.1|956330_957188_+|Helix-turn-helix-domain-protein MIKNIYGAKFRELRKQQNITLTKAAKGITSKSTLSLWENGKDNLSFNQVLELLKHIHTQPIEFIENIISSDLLSLSEKIHLAYVASDTVTLHRYVIKKRELSKKHPQNNDIFLEYCFTCMFYQDLSSDNIFTKYDKIRLTNILTNISEWNYKNIFYFGNTLELLDPENINRLCSSLITYSINEKLYHQRWYDEVLAAILNSISILVRRNYLLAEKLLDRFDQMKVSDGYACEKMHAQLYRAFITYIKTKDNRRIYEIINACKALNLKELEDGFITGFKQIKQIYG >CP019581.1|AZK91231.1|955816_955945_-|hypothetical-protein MSDIYDAYVDPNYYTLNSPDHNVLRRVVVQKGNVASRYIYAE >CP019581.1|AZK91230.1|954976_955729_-|hypothetical-protein MDGQNGATAGGHTQTWEYANRTNEWFVGTKPKNKWTTQIARVHISSSTSRYTSNTQLPRLSYLNRAGSQQGINYAGADLKRVEAAVSPDYQYFMIATIDRYNTGYFSIYYLDDINTALDNAGVNDVNIQTLTSVKAFIIPSFVDNIGSIQGYDIDNGANYIYVSSQHSPGYEDISRKIVKIPWGSQNPSEWDFVRLDSNSTINSFSGNYQTEFESVQVIDNNNVWLTVAYHDMDTSTNLTVMNRIYKISW >CP019581.1|AZK91229.1|953543_954602_-|S-layer-protein-precursor MISLAAAALLAVAPVVSPAVVHAADTTSTTTTLNSNAENPVITYNGKKYDSNQDITAAIANSSFSRVPLKGSSTFIQDVKNAFSATESSTDNSKVNIVVYTGDLYTNIAGKYPVRVLATNKAGKSTALTFQVIVGNQGANATYAVAKPQVRGNVTLYTIRDGKVIHNSYGSYVLDGGTTVATFGTVEINGISYTRLNGPDSDLFIETKSVDGTYPESATNEDGQAKTVTKTLMHTAIAYNSDGHSTGKKYYAYRQLTLSAVKKNIKGSMYYNVQGTGDYIKVGNIDGTKRTLTRNAYIYATSKRRADRTLLRKGYTITTYGGSYKFKNGKRYYRIEGATSTNKRYVKVVNFK >CP019581.1|AZK91228.1|952019_953204_-|Putative-transposase-DNA-binding-domain-protein MLKGIKLRLYPNRTQQNQLEQMFGNDRFVWNQMLAMMNERYQNNKDLPFLGKFKLNYLLKPLKKEYPFLKNSDSSSLQVVNEFLTQSWKNFFQDKTGQIGKPRFHSRKYLKKSYTGKSIIKTAGKRYLKIPKLGYIKTSKIGVLQDVKIKRYTVVLEPTGKYYLSLQVEISEPEKYSLTGKQVGIDVGVADLAILSNGLKYPSFDSSYFEKKAKVWQRKCARRRHLAKLLVLQDRNKKVLCPRSLESFTNWQKAQKSIARYQAKIANQRRDYLHKLTTYLVKQYDVIAIEDLKTKNLQKNHHLAKSIANASWRMFRQMLEYKCEWYGKKLIAVDPKNTSRICSKCGYNSGAKPLEIREWTCSKCQTKHDRDINAAVNILHKATPTGQGLAMVTS >CP019581.1|AZK91227.1|951149_951965_-|putative-oxidoreductase MFPETRHFSAGVVHQINDLAQTQQVVEDGLEVGYRLVDTAQVYGNEQAVGDAIRHSNIPREDIFVTSKIWVDDYGYDATLKAFDETMKKLQLDYLDLYLIHKPYNDYYGTWRAMERLYKEGRIRAIGVSSFWNERLADLITFNDVKPAVNQIETNVWNQEWKSQKYMEKEGVQPEAWAPFAEGADHIFTNPVLEEIAEKHHKTTAQVMLRWFLQRNYVVIPKSVHKERLAQNFDIFDFELDKTDMEKIKTLDQGRSILEDEMDPEIAESFR >CP019581.1|AZK91237.1|962554_963055_-|putative-acetyltransferase MIIKPLISEDEAKETSRLFQKCWQNTYKDILPDVFLDNIPENAWVKRLNESGRHNLIFIDDDNKIRAAVSYGRPRDTRMLGCGELMALYVEPDFQGFNIGKTLLNAAENELKKMGYGKIYLWCIDGDENARQFFEHFGWVNNATEKFVEIAGKEYKYLLYQKNLHD >CP019581.1|AZK91238.1|964054_964216_-|hypothetical-protein MQKHIKVIIMTVVILALMVGGQVAPLAVADQLNLSKNAAIITMTITCRKLRTL >CP019581.1|AZK91239.1|964396_964825_-|hypothetical-protein MINNWVSSSKELQLLVDDYLLTVNYRSVIENDLVNYTQGIESYFRNERLTLRDKINKFIEELPESYRELLSEHVGNTDDWIGKLVSTRVFLTHGDRENMVVSNPYKLVQMTKIFDFMVRIFILQKLGITIDKPKILNKDQNV >CP019581.1|AZK91240.1|965039_965237_+|Cold-shock-protein-2 MQGTVKWFNADKGFGFITGSDGKDAFVHFSSIKTDGFKSLEEGQKVSYDVEQGDRGPQATNVVPQ >CP019581.1|AZK91241.1|965725_966034_-|hypothetical-protein MKNKISDKNMFQNFEVKAYWFLNDNQNSGSYGFLKYNAGQDSVFEISPAFCDKTEQFNSPSPYDICGISEYGEIIRGIGYRVGSSFNHPGLSIEKIQFFDLK >CP019581.1|AZK91242.1|966434_967787_-|23S-rRNA-(uracil-C(5))-methyltransferase-RlmCD MEKNQIIDLEITDLSYEAMGVAHYNGMTVFVTNALPGEVVSAKILKVKKNFAFAKIEKIKKESPDRVKVKLNHGVQSGLASLAHIKYDKQLDFKRNQVVNLLKKAHLENIEVGETLASPEEVGYRNKAQVPVREVNGQLEIGFFRRHSHDLMPLTHFFTTDPEIDRVLVAVRDILRKYKVPAYDEINNKGEVRYLEVRRSKSTGEMMVILVCLHKDFMQLPNVAAEVSQIPGVSGLILNHNPKKTNVILGPKDYLVFGNDQITDQIGDLKFRISPQSFFQINSLQTPRLYNLAIKQADLKPDDVVIDAYSGIGTIGLSVAKHVKAVRGIEVVRDAIKDAKDNAKLNDITNAKYYLGKAEEIMPRWAKSGLKTDVVFVDPPRKGLTPEFINAAVKTGPKKIVYISCNPATLVRDLQLFQEKSYEFNRIDPVDMFPQTPHVESVTVLERTEK >CP019581.1|AZK91243.1|968149_969526_+|Na(+)/H(+)-antiporter-NhaC MKKEKVSFTESIIILIALLAILGISVIKFGLSPEVPVLFTVLLLTFWARFRGFTWKDVQDGIKEGIGVAIIPIFIFILIGALIGLWIKAGIIPSIMVLGFHLISGSFFVPSVFIVCAIVGVAIGSGFTTISTVGIALFGIGASMNANPALVAGAIISGAVFGDKMSPLSDSTNLSSAVTESELFDHIKNMMWSTIPSFVVSLILFWILGNSGHMDPTKIERTSHVLQINFSISWWAVVPIVLMLLCAWRKVPAIPTLFVNIAATVIMIFVQNPHESVQSLNNLIMNGFVAKTSDASVNALLTRGGISSMMATVALIISTLSLGGMLMKFNVVQSAMEPLVKHLRKPGRLITVTILSGICINLFVGEQYLSVILPGRAFKPAFDKIKLSPLALSRVLEDGGSVINYLIPWGVAGSFAAATLGVPVLQFLPFAFFSLLSPVFSIISGFTGIGLKWAKDKK >CP019581.1|AZK91244.1|969562_969877_-|hypothetical-protein MRGERSSRPEQVEKQALTLSFTKNAWDKYRQLTGSQKTFIDSELDDLKFNQNRQKSKQVNAELDQALVFEKNDAEIVITDIGYEPYRESQEHKKAQIRMEDMNN >CP019581.1|AZK91245.1|970084_970789_+|Inner-membrane-protein-YbhL MDNFSNPGHREVHDVSEVNGFLSKMYGYMGLSVLVSALAAFLTMTVFRSAVMQMPPAMMWIILFVPIGLSLGINFKATRNPVAGFVMLMILAVIYGFEFALLAGFYTQAQIGTAFVSSAAVFGAMAVFGTFTKKNLNNMGSYLSAALIGLLVAMVVNIFLRNSVASFVFSIIGVVIFTGLTAYDAQKMKAIYNNYGSQVSTNGLAVLGALQLYLDFVNIFLFLLQIFGMGGDRD >CP019581.1|AZK91246.1|970833_971526_-|hypothetical-protein MFKLHKVFDSRHYRRLSSVQKEELKEARDKVHQAQQKEPERLREHLETFNDAVIAIIITIIVLQIQPAFKASQYLEFLGNIVTFIIAFFIIADFWYELHLAFSYFIFKPDKITAICDFCLLATLSLLPVMTKWIMMHDSAFAVTNFGIVYFIAQILKVFVQYFGAKPLMRSSQVMNIMMVKTSVHRIILVFLLTIFLILLSLVVPKVAMVLYILIPFISFFKPNNSRGFR |
You can click texts colored in the table to view more detailed information
CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
CP019581_2 | 1304353-1304505 | TypeV |
NA
Consensus repeat of CP019581_2
|
1 spacers
spacers of CP019581_2
>2.1|1304402|55|CP019581|CRISPRCasFinder CTGGTTAACGGCATTGTATACGGGTTGCGAGGACCCGTAAAGATTTTTTTAAATA |
CRISPR arrays and Neighbor proteins around CP019581_2
The CRISPR arrays of CP019581_2 >merge|CP019581|2|1304353-1304505|CRISPRCasFinder TTTGAACAAAACACATCATATACCATATTAGTTCGAATATCAAACTATTCTGGTTAACGGCATTGTATACGGGTTGCGAGGACCCGTAAAGATTTTTTTAAATATTTGAACAAAACACATCATATACCATATTAGTTTGAATATCAAACTATT >CP019581|2|1|1304353-1304505|CRISPRCasFinder TTTGAACAAAACACATCATATACCATATTAGTTCGAATATCAAACTATT CTGGTTAACGGCATTGTATACGGGTTGCGAGGACCCGTAAAGATTTTTTTAAATA TTTGAACAAAACACATCATATACCATATTAGTTTGAATATCAAACTATT
>CP019581.1|AZK91587.1|1303797_1304244_-|3-hydroxyacyl-[acyl-carrier-protein]-dehydratase-FabZ MSVLDAAEIMDLIPNRYPILFMDKVDELNPGESIICTKNVTINEEFFQGHFPGNPVMPGVLIIESLAQAASILILKTEKYQGKTAYLGAIDSAKFRKVVRPGDVLKLHVTMEKQRDNMGKVKCEAKVEDKVACSAELTFIVPDPKKKI >CP019581.1|AZK91586.1|1303300_1303768_-|DNA-binding-transcriptional-repressor-MarR MNAISDEIKEDYNFISDSLVDIYDQIMRIEESEIKKSRFKDITAKELHLVHTIGLHDHKTTSEVARILRLSKGTLTANLNNLERKGYIFRIRNQRDRRIINLVLTSKGRLLYRAHYAFHRKLVEQCLKGFDGSDIKKMKQALMNVEDFIGEVSGR >CP019581.1|AZK91585.1|1302320_1303304_-|3-oxoacyl-[acyl-carrier-protein]-synthase-3 MKFEDFKIMATASSAPDHVVTNDELATMMDTSDEWITQRTGIKRRRIATEETTSSMCTDVATQLIAQSDLTAKDIDLIAVATMSPDYLTPSVSAMVQGNIGADHAIAFDIDAACSGFVYGLHLVKQMLIANQQKNAILIGGEILSKLLDWSDRSTAVLFGDGAGGVLIRNTAVDKGSFISEDLRTLGNLGQYLTAGQTGNPSPFATDQQPFSPFFKMNGRRVYSFAVKNVPESINDALKQANLTADEVDCFVLHQANQRIVERIADELAVSMAKFPINIDEYGNTAAASEPILLDQLVKQKIIKRGDVIALSGFGGGLTVGTMIMKY >CP019581.1|AZK91584.1|1302019_1302262_-|Acyl-carrier-protein MTKEEVFNKIKDIIVDQLDVDADKVKENTNFKNDLDLDSLDIFEVIDKIEDLYDIEIDTDEGMETVGELVDYVLKQKTDK >CP019581.1|AZK91583.1|1301092_1302010_-|Malonyl-CoA-acyl-carrier-protein-transacylase MKLGYLFSGQGKQFDEMGQDLYQQEPVYRQTIDQASEALNMDMSDATVFDNPVNTQVAIVAMSTGIERIINQDFGDPVGATGLSLGEYSAIVAAKGLDFSDALQLVRDRSHYMDQAGQDHPGKMAAVLKTTADMVDQAVKVGSKKGEIYAANYNTDSQIVIGGSIEGLQAATDYLHEHGVKRVVPLKMTVASHTPFMQEASDLLAKRIQDVSFNQLAFPVISNTTSQPFEVNTIKQTLIDQLINPTHFYNCIQQLTQLGVDTVVELGPGDTLMKFAKNVVANDHTFHIDSVKTLNDFRSKAKLVK >CP019581.1|AZK91582.1|1300361_1301093_-|3-oxoacyl-[acyl-carrier-protein]-reductase-FabG MSDTKQVALVTGAAKGIGLAIAKRLSSDGMTVVINSHHTLTDEEKQSFSDAGFSFDNLVGDVANEADAEKIVGEVVEKYGQIDVLVNNAGITKDKLLSRLKLADFKAVIDTNLVGAFNMTKFAMKFMQKSRSGAIVNLSSISGLHGNLGQANYSASKAGLVGLTKTAAREGALRNIRCNAVAPGMVATDMTGKMSERRQKEFTDQIPLKRFAEPDEIADAVAFLIHNQYITGQVVTVDGGLTI >CP019581.1|AZK91581.1|1299110_1300340_-|3-oxoacyl-[acyl-carrier-protein]-synthase-2 MSRVVITGMGIVSPIGNDVESFLKNLFASKVGINPITKFDAEPTGITVAGEVKDFDPLKRVDKKFAKRNDLFCTYALYSAKEAMEMAGLTEDNIDPEDLGVIYGSGIGGLTTIQEQVIKMHDKSPKRVSPLFVPDSIINMAAGNISIAFNALNTSQGIVTACSSGTNAIGNAFEYIKQGKAKAIIAGGTEASVNEIGISGFAALTALSKETDPKKASIPFDKDRNGFVLGEGSGTVILEDYDHAKARHANILAEIVGYGTTSDAYHMTAPDPEGKGAIRAMQQAVDEAGIDETEVDYINAHGTSTHANDSAESKAIKQVFAKNDHVKVSSTKGMTGHALGAAGAIEAVATIGAIQHNQMPVNVGVVNQDEACDIELVNDDNKKAPVNYAISNSFGFGGHNAVIVFKGCD >CP019581.1|AZK91580.1|1298636_1299107_-|Biotin-carboxyl-carrier-protein-of-acetyl-CoA-carboxylase MNEKEIERLLEKFDQSSLKDFELTQDDFKLKLSKREQNDQVVVQQPTGSKTPVSEVPKSTSANSQPAGEPQQSVKDNVAEIKAPFVGVVYFAPSPDKPVYKKQGDHVEKGEVVCVIEAMKMINEVKSDVTGTISNILVEDGSMVEYDQPIFQVTKG >CP019581.1|AZK91579.1|1298215_1298632_-|3-hydroxyacyl-[acyl-carrier-protein]-dehydratase-FabZ MKTAVNDVIPQRYPFEMIDKFIDVQPGVSASAIKLISINEWFFANQTSSRLAVPRPIMIEAMAQTGVAAILSIPENKGKNVFFGGIKNATFQDDFRPGDKLEFEVVMKKLKRNIGLGHGTIHRDGQSICEADLIFAVE >CP019581.1|AZK91578.1|1296822_1298202_-|Biotin-carboxylase MFKKVLVANRGEIAVQIIRALHDMGITAVAVYSSADKDSLFVHLADEAICIGGPQPSESYLNMAQIISAANLTGCEAIHPGYGFLSENAEFAELCETCHIKFVGPSHELISLMGDKSNAREAMEKAGVPVIPGSQGVVKTVSQAETVAEKIGFPVLLKAAAGGGGKGIREVDRPEDLHSAFEQTQQEARVSFNNDDIYVEKLIRNAKHVEMQVIADEFNHVVYLPERDCSLQRNHQKVIEESPCVQISPTERKKLGEIVANATLKLGYTNTGTYEFLMTEDHHFYFMEMNTRLQVEHTITEEVTGIELVKAQLKVADGQELPFTQADVAVKGHALECRLNAEDPSHNFAPRPGRINHLFFPAGSLGVRIDSGVAQGSFISPFYDSMIAKVVVHLNDRNTVIAKMNRILEELKINGVVTNQTFLKYLINTAEFNSGQYSTNFIENQVLTNKEGFHVAESV >CP019581.1|AZK91588.1|1304728_1304914_-|Pyrimidine-nucleoside-phosphorylase MDYSVGIVLNKKIGDKVESGEPLLTIYSNREEVDDIKKLLYDNIEVADTAKVPELIYTTIE >CP019581.1|AZK91589.1|1305021_1305771_-|Pyrimidine-nucleoside-phosphorylase MGDKTSIPLAAVVAALGIPVPMISGRGLGHTGGTLDKLEAIPGYQVEISEQDFIKQVKKDHLAIIGATGNIAPADKKIYALRDVTDTVDSIPLIAGSIMSKKIASGTDALVIDVKTGAGAFMKTLEDSKALARALVDIGKGVGMQFMALITDMNQPLGNAIGNSLEIEESIDLLKGNGPADLEKLIVTIGGYMAVMGDKAKTTAEGQKMCEEVIHNGQALASFEAMVRDQGGDPNVVNDPNGVLPQAKY >CP019581.1|AZK91590.1|1305947_1307135_+|Putative-niacin/nicotinamide-transporter-NaiP MDNTQQRPTFIFLIIGTAWLFDAMDVGLLSFIMPIVHQQWALSNSQTGLISSVSTIGMVCGGFYFGHLADRIGRKNTLIATLLTFSIGNLILAISPGFYTFLGIRFFVGMGLGGELPVAATYIADIYRGTKRSQMLILADSFWALGWLVASFLSFLLTPVLGWRGILVVTAIAGVFAIVLRKHIHETAPKSTGTQHWLVSLKTTFKPWTLMLWLAWFMVMFSYYGMFMWLPSIMVDKGYGIVNSFGYTTIIVVAQLPGYLCASWLAKRIRVKYVFAIYMLGTAFGAIMFGQSASALLIVISGCVLSFFNLGAYGAIIALTPELYAHNIRGTMTGMAQGIGRIGAIFGPLLIGVLMDHQISISIIFVIFMVSLLIGSIAVLALPSADQQPNGEVNQ >CP019581.1|AZK91591.1|1307131_1307971_+|putative-nicotinate-nucleotide-pyrophosphorylase-[carboxylating] MNPIVLKEKISEFLKEDLGFGDLSVAFLPGGTPLSGSFIAKQSGIICGQEIPQATYDLLGHATYKPRIPDGAPVKAGDIIGTVSGTAQTLLSGERVTLNLIQRMSGIATQTTHFVKLLDDATIRITDTRKTAPGLRLFDKYAVSVGGGFNHRFDLTGGIMLKDNHIALAGGVTQALAAVKRHVGPLTPVEVEVETEEELRQAVAGGANVIMFDNQNPETIKQWRQLVPKTIKVEASGGITAESISTFKGCGADFISIGNLTNDVTPLDISFLVAGAVKS >CP019581.1|AZK91592.1|1308274_1309513_-|Transposase MSQLDNTLKLLGITDTNIQVFGTREEFHGRGSGRKKYLVIQAELTYTLRRCPSCGYNMLPPSGHKLTHVHIAGPMDRPVILELNKQRWRCSNCHSTCTATTPVVSTNHAIGHGLATHVLKLASKSLPAKTIASLTGISTNSVQRILTANIHPHASRRLPINLCFDEFRSTHGSMSFICIDADTHKSVKVLSDRLNRTIKQFFLSQYSTAERAAVQRVIMDMNASYQAFVHELFPNAELIIDRFHIIQLMGRTMDTIRTQCFKQLDKHSRKYKVLKSLWRLFHKANPDIQKSRYLFGLNEYSTEQNAIDIGTDTFPAFKTAYETYIDLHDALMGRHADELKNIITNYQPNGTPLDTAMHTLRKNLNGVINAAKSSYSNGPIEGINRMIRELKRACYGFSNQANMFTRVYQLIA >CP019581.1|AZK91593.1|1309716_1311984_+|UvrABC-system-protein-A MTDLFADGTISIHGAQENNLKDVSLDIPKHKTTVFAGLSGSGKSSLVFDTLAAVSRRELNETFPSFTQQYLPKYGQPEVNRIDNLPVAIVVEQKPIGRNSRSTLATYTGIYSVLRLMFSRIGQPWVGYSEWFSFNLPQGMCPKCQGLGFVDDIDERQLIDPNKSLNEGAMTFAGFQPGTWRWKEYGNSGLFDLDKKIKDYTDEEYDLFMHAPQQKLKNPPANWGRTALYEGLVPRMLRSVIHSASGRHHEAALSKIVTRKPCPVCHGTRLNKKALTGKIAGKNIAEVSDMDLVSVLKFLDNISDPKAKTMVRELRSKIQALVDIGLGYLSLGRGTDTLSGGEAQRIKIAKYLTSSLSDLVYVLDEPSVGLHPHDIKLITQSLKKLKEHGNTIILVDHNPAIISTADYVVEMGPQAGKNGGQVTSTGTYDELLRSDTITGKMLREKITFPKPREPQSWLNVNHVTSHNLKDVSTRIPQGVMTVISGPAGSGKSTLVQAFKQQVSDQDYIDLSQDSVGLNIRSTPATYLNILNPLRKLFSKANNGVSTQLFSYNGKGACPRCKGKGVMITEMAFMDPIVQECELCHGKRYSQEALQYTYHGKDISEVLNLSINDTLEFFKDVPDIYKKVSLLHQVGLGYLNLSQSMTTLSGGEVQRVKLAMELNHTGRIYFLDEPTTGLHLQDTQQLIDLFEGLVDKGNTLILIEHNLKLISRADWLVDMGPDAGKFGGQVCFEGHPKDSLNDKNSRTGAALAAIIS >CP019581.1|AZK91594.1|1312084_1312663_-|Integrase-core-domain-protein MDELNVQVSLYNRHRNGRYSSYKGTVGKVARNVLHQHFNETVPFKVLHTDVTQVRLADTKWAYVSAITDEASKEVLAFQVSNSPNSKLIMDTLDELTENIPEGIKPIIHSDQGWHYQLNYYTDKLSEKNLYKACLVRETVLIMRQLKASFIFLKQNVLMDFHSVKILENSRKSQRITSIGLTIDEYHRKQKA >CP019581.1|AZK91595.1|1312941_1313475_-|Transposase MVKYSSELKAEVVSEYLQGDISISLLSKKRNLPRIQVGRWIQNFRLSGADALKRRRVKRSFSVEFKVDVINYYQTHDETLAEVSAKFDVNSCQISLWRTAFNEYGIEALKPHPKGRKTKVKHNKKKLRKLVNKNEIDQLREELTKKNQELYDAKLENEILKKSMTLFGTSKDERKHK >CP019581.1|AZK91596.1|1313684_1314578_+|Type-I-restriction-modification-DNA-specificity-domain-protein MLELAFEKEVVKTLTTGSNQWVERKDLYGATPDQLWANFRDKLNNNNYAKLQGHPLTDTEFNQVKRAIEFPTPYEAAKLLAAENGSKFPQLRFAGFADAWEQRKLSDFSKTTYGGGTPKTAVTEYWDGNIPWIQSSNLTVDDVQEVNLDKFITDNAIKNSAAKLIPANSIAIVTRVGVGKLTLMKQEFATSQDFLSLSELHVDEQFGLYSIYKLLQKELNNIQGTSIKGMTKADLLTKDIMIPVEKDEQIKIGSFFKQLDHLITLHQRKLEKLQELKKRVSTKDVLLILNLIKFRIL >CP019581.1|AZK91597.1|1314574_1314733_+|hypothetical-protein MKPHDLIFIVLGQHLIHLYNGLSREIKNQPRAIKPLLTAKESYNTSKVSKQG |
You can click texts colored in the table to view more detailed information
CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
CP019581_3 | 2064058-2066355 | TypeI |
I-C
Consensus repeat of CP019581_3
|
34 spacers
spacers of CP019581_3
>3.1|2064090|34|CP019581|PILER-CR,CRISPRCasFinder,CRT AATGACTGATGAACAAAAGACAGCATTGAAGAAT >3.2|2064156|36|CP019581|PILER-CR,CRISPRCasFinder,CRT TGTTCAGCTCTATCTTGATGTCAAAGATCCTAACGG >3.3|2064224|36|CP019581|PILER-CR,CRISPRCasFinder,CRT ATTCTGTAAGGACTGAAACACAAACTCTAAAATATA >3.4|2064292|34|CP019581|PILER-CR,CRISPRCasFinder,CRT ATCGTTATGCAGGTATTACTACTGCTACTAATAC >3.5|2064358|35|CP019581|PILER-CR,CRISPRCasFinder,CRT TCAGGAGATACTGCCGTTGGCTTTGAAGATAAATA >3.6|2064425|35|CP019581|PILER-CR,CRISPRCasFinder,CRT AATTCATTACTGGCATTACTCCAGTCACTGCGCCA >3.7|2064492|35|CP019581|PILER-CR,CRISPRCasFinder,CRT GAAAGGAAATATATATGTCAGTCAAAGTTAATGGT >3.8|2064559|35|CP019581|PILER-CR,CRISPRCasFinder,CRT GATTTTGTTAGAAAATTAACCGATGATTTTTTACA >3.9|2064626|34|CP019581|PILER-CR,CRISPRCasFinder,CRT TACCCACCTAGCTTAGCACCTACATGGAATGACA >3.10|2064692|36|CP019581|PILER-CR,CRISPRCasFinder,CRT CATTGTAGTAAAAGTTGCAGGGTCTTGAGTAAGAAC >3.11|2064760|34|CP019581|PILER-CR,CRISPRCasFinder,CRT AATTAAGTCTAATCATAGGGGCAATAATTGCTTA >3.12|2064826|35|CP019581|PILER-CR,CRISPRCasFinder,CRT TTTTTCGTCTTCTGCTCCTAGTGACTTAGTGAGCC >3.13|2064893|34|CP019581|PILER-CR,CRISPRCasFinder,CRT AAGTGAATCGACAGGTATTGCAGATGATGATTTA >3.14|2064959|35|CP019581|PILER-CR,CRISPRCasFinder,CRT AAGATGTGGAAGAGTGCAAGCACACACAGTCCCAA >3.15|2065026|34|CP019581|PILER-CR,CRISPRCasFinder,CRT TTAGAATACTTATGATCCTTAGATAGATGAACCG >3.16|2065092|35|CP019581|PILER-CR,CRISPRCasFinder,CRT CTCTGATCTGCTCAGCACGTTGCTCTTGCTTTGAA >3.17|2065159|35|CP019581|PILER-CR,CRISPRCasFinder,CRT GATTATTCGGAACAACAGATGATCCTAAATGCTCA >3.18|2065226|35|CP019581|PILER-CR,CRISPRCasFinder,CRT CTAAAGCATACTTAGTAGCATATGAACTGGCTGAC >3.19|2065293|34|CP019581|PILER-CR,CRISPRCasFinder,CRT TACAAACACTTGCCAACTTATAATCCACAAATGC >3.20|2065359|34|CP019581|PILER-CR,CRISPRCasFinder,CRT AAAGGTGTGTGGATACCTGCTGAATATTGGTTAG >3.21|2065425|35|CP019581|PILER-CR,CRISPRCasFinder,CRT AGCATCAACGGCTCTAATTATGTCATTGCCGGTTT >3.22|2065492|34|CP019581|PILER-CR,CRISPRCasFinder,CRT ACCTGATCAAACGTTAGGCAATAGATTAACCGAA >3.23|2065558|34|CP019581|PILER-CR,CRISPRCasFinder,CRT TTTACGAATGTCTTGCCAATTAGTATATTTACTT >3.24|2065624|34|CP019581|PILER-CR,CRISPRCasFinder,CRT AGAATCAAGACCACCTGTTAGCGTTTTACCACCG >3.25|2065690|35|CP019581|PILER-CR,CRISPRCasFinder,CRT AAAGAACATAGTTTGCAGAACTGTTTCCAAGTAAA >3.26|2065757|36|CP019581|PILER-CR,CRISPRCasFinder,CRT GGCTTTAGATGCGTCAAATGCCGCCTGTGGGCTATC >3.27|2065825|34|CP019581|PILER-CR,CRISPRCasFinder,CRT AGAAGATGGATACAATTTTTAGAAATTCCAGTCT >3.28|2065891|35|CP019581|PILER-CR,CRISPRCasFinder,CRT ACACCATATGTGGTCAACGCTTTAATTGGATTACG >3.29|2065958|34|CP019581|PILER-CR,CRISPRCasFinder,CRT ACTAACTGGAACATCCAAGTACTGTGCGGAAACC >3.30|2066024|35|CP019581|PILER-CR,CRISPRCasFinder,CRT ATCGGCTAATTGCTTTTCAAGTGAATCAGCCTTTG >3.31|2066091|34|CP019581|PILER-CR,CRISPRCasFinder,CRT ACTGAGACGCTAGACGCTATTAGATCACAGTCAC >3.32|2066157|35|CP019581|PILER-CR,CRISPRCasFinder,CRT ATCAAGCTTGCCAACGGGTCGATTTTATTCTGTGA >3.33|2066224|34|CP019581|PILER-CR,CRISPRCasFinder,CRT TTTTGAATAGATGGCATGTAAGTCTGATATAGTT >3.34|2066290|34|CP019581|CRT TTTAGGTCCATTGCACTCTTGCCAGTGAAGTCGT |
cas2,cas1,cas4,cas7,cas8c,cas5,cas3 |
CRISPR arrays and Neighbor proteins around CP019581_3
The CRISPR arrays of CP019581_3 >merge|CP019581|3|2064058-2066355|PILER-CR,CRISPRCasFinder,CRT GTCGCACTCCTTGTGAGTGCGTGGATTGAAATAATGACTGATGAACAAAAGACAGCATTGAAGAATGTCGCACTCCTTGTGAGTGCGTGGATTGAAATTGTTCAGCTCTATCTTGATGTCAAAGATCCTAACGGGTCGCACTCCTTGTGAGTGCGTGGATTGAAATATTCTGTAAGGACTGAAACACAAACTCTAAAATATAGTCGCACTCCTTGTGAGTGCGTGGATTGAAATATCGTTATGCAGGTATTACTACTGCTACTAATACGTCGCACTCCTTGTGAGTGCGTGGATTGAAATTCAGGAGATACTGCCGTTGGCTTTGAAGATAAATAGTCGCACTCCTTGTGAGTGCGTGGATTGAAATAATTCATTACTGGCATTACTCCAGTCACTGCGCCAGTCGCACTCCTTGTGAGTGCGTGGATTGAAATGAAAGGAAATATATATGTCAGTCAAAGTTAATGGTGTCGCACTCCTTGTGAGTGCGTGGATTGAAATGATTTTGTTAGAAAATTAACCGATGATTTTTTACAGTCGCACTCCTTGTGAGTGCGTGGATTGAAATTACCCACCTAGCTTAGCACCTACATGGAATGACAGTCGCACTCCTTGTGAGTGCGTGGATTGAAATCATTGTAGTAAAAGTTGCAGGGTCTTGAGTAAGAACGTCGCACTCCTTGTGAGTGCGTGGATTGAAATAATTAAGTCTAATCATAGGGGCAATAATTGCTTAGTCGCACTCCTTGTGAGTGCGTGGATTGAAATTTTTTCGTCTTCTGCTCCTAGTGACTTAGTGAGCCGTCGCACTCCTTGTGAGTGCGTGGATTGAAATAAGTGAATCGACAGGTATTGCAGATGATGATTTAGTCGCACTCCTTGTGAGTGCGTGGATTGAAATAAGATGTGGAAGAGTGCAAGCACACACAGTCCCAAGTCGCACTCCTTGTGAGTGCGTGGATTGAAATTTAGAATACTTATGATCCTTAGATAGATGAACCGGTCGCACTCCTTGTGAGTGCGTGGATTGAAATCTCTGATCTGCTCAGCACGTTGCTCTTGCTTTGAAGTCGCACTCCTTGTGAGTGCGTGGATTGAAATGATTATTCGGAACAACAGATGATCCTAAATGCTCAGTCGCACTCCTTGTGAGTGCGTGGATTGAAATCTAAAGCATACTTAGTAGCATATGAACTGGCTGACGTCGCACTCCTTGTGAGTGCGTGGATTGAAATTACAAACACTTGCCAACTTATAATCCACAAATGCGTCGCACTCCTTGTGAGTGCGTGGATTGAAATAAAGGTGTGTGGATACCTGCTGAATATTGGTTAGGTCGCACTCCTTGTGAGTGCGTGGATTGAAATAGCATCAACGGCTCTAATTATGTCATTGCCGGTTTGTCGCACTCCTTGTGAGTGCGTGGATTGAAATACCTGATCAAACGTTAGGCAATAGATTAACCGAAGTCGCACTCCTTGTGAGTGCGTGGATTGAAATTTTACGAATGTCTTGCCAATTAGTATATTTACTTGTCGCACTCCTTGTGAGTGCGTGGATTGAAATAGAATCAAGACCACCTGTTAGCGTTTTACCACCGGTCGCACTCCTTGTGAGTGCGTGGATTGAAATAAAGAACATAGTTTGCAGAACTGTTTCCAAGTAAAGTCGCACTCCTTGTGAGTGCGTGGATTGAAATGGCTTTAGATGCGTCAAATGCCGCCTGTGGGCTATCGTCGCACTCCTTGTGAGTGCGTGGATTGAAATAGAAGATGGATACAATTTTTAGAAATTCCAGTCTGTCGCACTCCTTGTGAGTGCGTGGATTGAAATACACCATATGTGGTCAACGCTTTAATTGGATTACGGTCGCACTCCTTGTGAGTGCGTGGATTGAAATACTAACTGGAACATCCAAGTACTGTGCGGAAACCGTCGCACTCCTTGTGAGTGCGTGGATTGAAATATCGGCTAATTGCTTTTCAAGTGAATCAGCCTTTGGTCGCACTCCTTGTGAGTGCGTGGATTGAAATACTGAGACGCTAGACGCTATTAGATCACAGTCACGTCGCACTCCTTGTGAGTGCGTGGATTGAAATATCAAGCTTGCCAACGGGTCGATTTTATTCTGTGAGTCGCACTCCTTGTGAGTGCGTGGATTGAAATTTTTGAATAGATGGCATGTAAGTCTGATATAGTTGTCGCACTCCTCGTGAGTGCGTGGATTGAAATTTTAGGTCCATTGCACTCTTGCCAGTGAAGTCGTGTCGCACTCCTTGTGATGCACTTGTGGGTGTA >CP019581|3|2|2064058-2066289|PILER-CR GTCGCACTCCTTGTGAGTGCGTGGATTGAAAT AATGACTGATGAACAAAAGACAGCATTGAAGAAT GTCGCACTCCTTGTGAGTGCGTGGATTGAAAT TGTTCAGCTCTATCTTGATGTCAAAGATCCTAACGG GTCGCACTCCTTGTGAGTGCGTGGATTGAAAT ATTCTGTAAGGACTGAAACACAAACTCTAAAATATA GTCGCACTCCTTGTGAGTGCGTGGATTGAAAT ATCGTTATGCAGGTATTACTACTGCTACTAATAC GTCGCACTCCTTGTGAGTGCGTGGATTGAAAT TCAGGAGATACTGCCGTTGGCTTTGAAGATAAATA GTCGCACTCCTTGTGAGTGCGTGGATTGAAAT AATTCATTACTGGCATTACTCCAGTCACTGCGCCA GTCGCACTCCTTGTGAGTGCGTGGATTGAAAT GAAAGGAAATATATATGTCAGTCAAAGTTAATGGT GTCGCACTCCTTGTGAGTGCGTGGATTGAAAT GATTTTGTTAGAAAATTAACCGATGATTTTTTACA GTCGCACTCCTTGTGAGTGCGTGGATTGAAAT TACCCACCTAGCTTAGCACCTACATGGAATGACA GTCGCACTCCTTGTGAGTGCGTGGATTGAAAT CATTGTAGTAAAAGTTGCAGGGTCTTGAGTAAGAAC GTCGCACTCCTTGTGAGTGCGTGGATTGAAAT AATTAAGTCTAATCATAGGGGCAATAATTGCTTA GTCGCACTCCTTGTGAGTGCGTGGATTGAAAT TTTTTCGTCTTCTGCTCCTAGTGACTTAGTGAGCC GTCGCACTCCTTGTGAGTGCGTGGATTGAAAT AAGTGAATCGACAGGTATTGCAGATGATGATTTA GTCGCACTCCTTGTGAGTGCGTGGATTGAAAT AAGATGTGGAAGAGTGCAAGCACACACAGTCCCAA GTCGCACTCCTTGTGAGTGCGTGGATTGAAAT TTAGAATACTTATGATCCTTAGATAGATGAACCG GTCGCACTCCTTGTGAGTGCGTGGATTGAAAT CTCTGATCTGCTCAGCACGTTGCTCTTGCTTTGAA GTCGCACTCCTTGTGAGTGCGTGGATTGAAAT GATTATTCGGAACAACAGATGATCCTAAATGCTCA GTCGCACTCCTTGTGAGTGCGTGGATTGAAAT CTAAAGCATACTTAGTAGCATATGAACTGGCTGAC GTCGCACTCCTTGTGAGTGCGTGGATTGAAAT TACAAACACTTGCCAACTTATAATCCACAAATGC GTCGCACTCCTTGTGAGTGCGTGGATTGAAAT AAAGGTGTGTGGATACCTGCTGAATATTGGTTAG GTCGCACTCCTTGTGAGTGCGTGGATTGAAAT AGCATCAACGGCTCTAATTATGTCATTGCCGGTTT GTCGCACTCCTTGTGAGTGCGTGGATTGAAAT ACCTGATCAAACGTTAGGCAATAGATTAACCGAA GTCGCACTCCTTGTGAGTGCGTGGATTGAAAT TTTACGAATGTCTTGCCAATTAGTATATTTACTT GTCGCACTCCTTGTGAGTGCGTGGATTGAAAT AGAATCAAGACCACCTGTTAGCGTTTTACCACCG GTCGCACTCCTTGTGAGTGCGTGGATTGAAAT AAAGAACATAGTTTGCAGAACTGTTTCCAAGTAAA GTCGCACTCCTTGTGAGTGCGTGGATTGAAAT GGCTTTAGATGCGTCAAATGCCGCCTGTGGGCTATC GTCGCACTCCTTGTGAGTGCGTGGATTGAAAT AGAAGATGGATACAATTTTTAGAAATTCCAGTCT GTCGCACTCCTTGTGAGTGCGTGGATTGAAAT ACACCATATGTGGTCAACGCTTTAATTGGATTACG GTCGCACTCCTTGTGAGTGCGTGGATTGAAAT ACTAACTGGAACATCCAAGTACTGTGCGGAAACC GTCGCACTCCTTGTGAGTGCGTGGATTGAAAT ATCGGCTAATTGCTTTTCAAGTGAATCAGCCTTTG GTCGCACTCCTTGTGAGTGCGTGGATTGAAAT ACTGAGACGCTAGACGCTATTAGATCACAGTCAC GTCGCACTCCTTGTGAGTGCGTGGATTGAAAT ATCAAGCTTGCCAACGGGTCGATTTTATTCTGTGA GTCGCACTCCTTGTGAGTGCGTGGATTGAAAT TTTTGAATAGATGGCATGTAAGTCTGATATAGTT GTCGCACTCCTCGTGAGTGCGTGGATTGAAAT >CP019581|3|2|2064058-2066289|CRISPRCasFinder GTCGCACTCCTTGTGAGTGCGTGGATTGAAAT AATGACTGATGAACAAAAGACAGCATTGAAGAAT GTCGCACTCCTTGTGAGTGCGTGGATTGAAAT TGTTCAGCTCTATCTTGATGTCAAAGATCCTAACGG GTCGCACTCCTTGTGAGTGCGTGGATTGAAAT ATTCTGTAAGGACTGAAACACAAACTCTAAAATATA GTCGCACTCCTTGTGAGTGCGTGGATTGAAAT ATCGTTATGCAGGTATTACTACTGCTACTAATAC GTCGCACTCCTTGTGAGTGCGTGGATTGAAAT TCAGGAGATACTGCCGTTGGCTTTGAAGATAAATA GTCGCACTCCTTGTGAGTGCGTGGATTGAAAT AATTCATTACTGGCATTACTCCAGTCACTGCGCCA GTCGCACTCCTTGTGAGTGCGTGGATTGAAAT GAAAGGAAATATATATGTCAGTCAAAGTTAATGGT GTCGCACTCCTTGTGAGTGCGTGGATTGAAAT GATTTTGTTAGAAAATTAACCGATGATTTTTTACA GTCGCACTCCTTGTGAGTGCGTGGATTGAAAT TACCCACCTAGCTTAGCACCTACATGGAATGACA GTCGCACTCCTTGTGAGTGCGTGGATTGAAAT CATTGTAGTAAAAGTTGCAGGGTCTTGAGTAAGAAC GTCGCACTCCTTGTGAGTGCGTGGATTGAAAT AATTAAGTCTAATCATAGGGGCAATAATTGCTTA GTCGCACTCCTTGTGAGTGCGTGGATTGAAAT TTTTTCGTCTTCTGCTCCTAGTGACTTAGTGAGCC GTCGCACTCCTTGTGAGTGCGTGGATTGAAAT AAGTGAATCGACAGGTATTGCAGATGATGATTTA GTCGCACTCCTTGTGAGTGCGTGGATTGAAAT AAGATGTGGAAGAGTGCAAGCACACACAGTCCCAA GTCGCACTCCTTGTGAGTGCGTGGATTGAAAT TTAGAATACTTATGATCCTTAGATAGATGAACCG GTCGCACTCCTTGTGAGTGCGTGGATTGAAAT CTCTGATCTGCTCAGCACGTTGCTCTTGCTTTGAA GTCGCACTCCTTGTGAGTGCGTGGATTGAAAT GATTATTCGGAACAACAGATGATCCTAAATGCTCA GTCGCACTCCTTGTGAGTGCGTGGATTGAAAT CTAAAGCATACTTAGTAGCATATGAACTGGCTGAC GTCGCACTCCTTGTGAGTGCGTGGATTGAAAT TACAAACACTTGCCAACTTATAATCCACAAATGC GTCGCACTCCTTGTGAGTGCGTGGATTGAAAT AAAGGTGTGTGGATACCTGCTGAATATTGGTTAG GTCGCACTCCTTGTGAGTGCGTGGATTGAAAT AGCATCAACGGCTCTAATTATGTCATTGCCGGTTT GTCGCACTCCTTGTGAGTGCGTGGATTGAAAT ACCTGATCAAACGTTAGGCAATAGATTAACCGAA GTCGCACTCCTTGTGAGTGCGTGGATTGAAAT TTTACGAATGTCTTGCCAATTAGTATATTTACTT GTCGCACTCCTTGTGAGTGCGTGGATTGAAAT AGAATCAAGACCACCTGTTAGCGTTTTACCACCG GTCGCACTCCTTGTGAGTGCGTGGATTGAAAT AAAGAACATAGTTTGCAGAACTGTTTCCAAGTAAA GTCGCACTCCTTGTGAGTGCGTGGATTGAAAT GGCTTTAGATGCGTCAAATGCCGCCTGTGGGCTATC GTCGCACTCCTTGTGAGTGCGTGGATTGAAAT AGAAGATGGATACAATTTTTAGAAATTCCAGTCT GTCGCACTCCTTGTGAGTGCGTGGATTGAAAT ACACCATATGTGGTCAACGCTTTAATTGGATTACG GTCGCACTCCTTGTGAGTGCGTGGATTGAAAT ACTAACTGGAACATCCAAGTACTGTGCGGAAACC GTCGCACTCCTTGTGAGTGCGTGGATTGAAAT ATCGGCTAATTGCTTTTCAAGTGAATCAGCCTTTG GTCGCACTCCTTGTGAGTGCGTGGATTGAAAT ACTGAGACGCTAGACGCTATTAGATCACAGTCAC GTCGCACTCCTTGTGAGTGCGTGGATTGAAAT ATCAAGCTTGCCAACGGGTCGATTTTATTCTGTGA GTCGCACTCCTTGTGAGTGCGTGGATTGAAAT TTTTGAATAGATGGCATGTAAGTCTGATATAGTT GTCGCACTCCTCGTGAGTGCGTGGATTGAAAT >CP019581|3|1|2064058-2066355|CRT GTCGCACTCCTTGTGAGTGCGTGGATTGAAAT AATGACTGATGAACAAAAGACAGCATTGAAGAAT GTCGCACTCCTTGTGAGTGCGTGGATTGAAAT TGTTCAGCTCTATCTTGATGTCAAAGATCCTAACGG GTCGCACTCCTTGTGAGTGCGTGGATTGAAAT ATTCTGTAAGGACTGAAACACAAACTCTAAAATATA GTCGCACTCCTTGTGAGTGCGTGGATTGAAAT ATCGTTATGCAGGTATTACTACTGCTACTAATAC GTCGCACTCCTTGTGAGTGCGTGGATTGAAAT TCAGGAGATACTGCCGTTGGCTTTGAAGATAAATA GTCGCACTCCTTGTGAGTGCGTGGATTGAAAT AATTCATTACTGGCATTACTCCAGTCACTGCGCCA GTCGCACTCCTTGTGAGTGCGTGGATTGAAAT GAAAGGAAATATATATGTCAGTCAAAGTTAATGGT GTCGCACTCCTTGTGAGTGCGTGGATTGAAAT GATTTTGTTAGAAAATTAACCGATGATTTTTTACA GTCGCACTCCTTGTGAGTGCGTGGATTGAAAT TACCCACCTAGCTTAGCACCTACATGGAATGACA GTCGCACTCCTTGTGAGTGCGTGGATTGAAAT CATTGTAGTAAAAGTTGCAGGGTCTTGAGTAAGAAC GTCGCACTCCTTGTGAGTGCGTGGATTGAAAT AATTAAGTCTAATCATAGGGGCAATAATTGCTTA GTCGCACTCCTTGTGAGTGCGTGGATTGAAAT TTTTTCGTCTTCTGCTCCTAGTGACTTAGTGAGCC GTCGCACTCCTTGTGAGTGCGTGGATTGAAAT AAGTGAATCGACAGGTATTGCAGATGATGATTTA GTCGCACTCCTTGTGAGTGCGTGGATTGAAAT AAGATGTGGAAGAGTGCAAGCACACACAGTCCCAA GTCGCACTCCTTGTGAGTGCGTGGATTGAAAT TTAGAATACTTATGATCCTTAGATAGATGAACCG GTCGCACTCCTTGTGAGTGCGTGGATTGAAAT CTCTGATCTGCTCAGCACGTTGCTCTTGCTTTGAA GTCGCACTCCTTGTGAGTGCGTGGATTGAAAT GATTATTCGGAACAACAGATGATCCTAAATGCTCA GTCGCACTCCTTGTGAGTGCGTGGATTGAAAT CTAAAGCATACTTAGTAGCATATGAACTGGCTGAC GTCGCACTCCTTGTGAGTGCGTGGATTGAAAT TACAAACACTTGCCAACTTATAATCCACAAATGC GTCGCACTCCTTGTGAGTGCGTGGATTGAAAT AAAGGTGTGTGGATACCTGCTGAATATTGGTTAG GTCGCACTCCTTGTGAGTGCGTGGATTGAAAT AGCATCAACGGCTCTAATTATGTCATTGCCGGTTT GTCGCACTCCTTGTGAGTGCGTGGATTGAAAT ACCTGATCAAACGTTAGGCAATAGATTAACCGAA GTCGCACTCCTTGTGAGTGCGTGGATTGAAAT TTTACGAATGTCTTGCCAATTAGTATATTTACTT GTCGCACTCCTTGTGAGTGCGTGGATTGAAAT AGAATCAAGACCACCTGTTAGCGTTTTACCACCG GTCGCACTCCTTGTGAGTGCGTGGATTGAAAT AAAGAACATAGTTTGCAGAACTGTTTCCAAGTAAA GTCGCACTCCTTGTGAGTGCGTGGATTGAAAT GGCTTTAGATGCGTCAAATGCCGCCTGTGGGCTATC GTCGCACTCCTTGTGAGTGCGTGGATTGAAAT AGAAGATGGATACAATTTTTAGAAATTCCAGTCT GTCGCACTCCTTGTGAGTGCGTGGATTGAAAT ACACCATATGTGGTCAACGCTTTAATTGGATTACG GTCGCACTCCTTGTGAGTGCGTGGATTGAAAT ACTAACTGGAACATCCAAGTACTGTGCGGAAACC GTCGCACTCCTTGTGAGTGCGTGGATTGAAAT ATCGGCTAATTGCTTTTCAAGTGAATCAGCCTTTG GTCGCACTCCTTGTGAGTGCGTGGATTGAAAT ACTGAGACGCTAGACGCTATTAGATCACAGTCAC GTCGCACTCCTTGTGAGTGCGTGGATTGAAAT ATCAAGCTTGCCAACGGGTCGATTTTATTCTGTGA GTCGCACTCCTTGTGAGTGCGTGGATTGAAAT TTTTGAATAGATGGCATGTAAGTCTGATATAGTT GTCGCACTCCTCGTGAGTGCGTGGATTGAAAT TTTAGGTCCATTGCACTCTTGCCAGTGAAGTCGT GTCGCACTCCTTGTGATGCACTTGTGGGTGTA
>CP019581.1|AZK92343.1|2063605_2063896_+|CRISPR-associated-endonuclease-Cas2 MMVVVSYDINTESKSGQRRLRHVAKICLDYGQRVQNSVFECKVNSMQLELMKERLLDEIDDSQDSLYFFNLGKNYKNRIKSYGIKEVINLESPVIF >CP019581.1|AZK92342.1|2062564_2063596_+|CRISPR-associated-protein-Cas4/endonuclease-Cas1-fusion MRQLLNTLYINTPDSYLSSDGNNVVVKIKNNAVGRLPLQNFEAIVTFGYSGVSPSLMQKCLEQDISISFLSRTGRLKGRVVGEPTGNVYLRKTQFFNSENDAASLLIAKNMIIGKVYNHRWIIERFIRDHGMQIDRDKFKSISENLKNGLKDLQQVDTIDSLRRLEGSLANGYFSVFDDMIINQKDDFFYHGRSRRPPLDRLNALLSFSYSLLANECADALTTNGLDPYEGFMHVDRPGRKSLALDLMEELRGVIADRFVLRLVNKKEIHASDFVCKADGAYLLTDDARKSFLAKWQDNKISELEHPFLKEKIEWGLVPFAQAQLLARYLRGDLDEYPPFMWK >CP019581.1|AZK92341.1|2061911_2062568_+|PD-(D/E)XK-nuclease-superfamily-protein MSYDEKDYLMISGIQHFVFCKRQWALDHIENQWADNYLTVSGNRLHEKADDPYISETRGSKFVVRAMPIHSQEYGLTGIFDVVEFQKDTKGVQVFGKKEKYLPIPVEYKHGKSKIDDSDRLQVLAEAVCLEEMLFCHLDYGYLYYGRTRHREKVEFSEELRTELKKVISEMHYYWEKKYTPKVKVTKKCKSCSLRDICLPELLKRESVSNYISRKLNQ >CP019581.1|AZK92340.1|2061057_2061909_+|hypothetical-protein MVLSNKIDFKVYVAVHGANPNGDPLNGNRPRQTNDGFGEISDVAIKRKIRNRLQDMGHKIFVQSADRTDDGFKSLKDRADSIDAIKKAAKAKDADKYAEVACKNWMDVRSFGQVFAFKGDKLSVGVRGPVSIQTARSISPIFINEMQITKSVNSTTGDKKSSDTMGMKYSIDFAVYEFGGSINVQLAEKTGFSDDDAESIHKALETLFENDESSARPAGSMEVLRVYWWKHNSKLGQHSPAEMNRAVEIKNNNEPNAPRSLDDIEFIDKTPEDMKENEQIYEG >CP019581.1|AZK92339.1|2059081_2061055_+|CRISPR-associated-protein-(Cas_Csd1) MNWISDLAKVYDDNESIAGKAETVSVGKEKTKTVTLVPISHIAVNIPIQINLDKDGGFKGADVIDEKDNQRTIIPATLKSASRSSGSAPMPIDDTLKYIAKDYYPEISNKDKDSHYYSDYINQLKGFVEYVNNKNSSDRVRQQVNAIYTYVSQNDIFADLLFKAHLFGDEIKQVSAIPMKWTGKEEKPAVYKAITGDLDRSFVRFNVRGLGLDRSFEDPDLYEAWGKYYLTTLRNDTGVDYVDCNNDAILTDNHPKGIIPSASNAKLISANDTTNYTFKGRLLNSDEVATIGYLNSQKAHHALRWLIDKQGFSIGGRYYLAWGRKQQNYMIDNQTSPMFKILTSTYNTDQAESYTNERLAKSYYNSLVKGIELNGDNLEDLVYLMQIDTSTPGRADIVSYQALDLNQYIRKLSSWYGKISLFIQNKAGEFVDPPYSLRTIANMIHGSKANDDLKKNTISELISVILGSQIVPRGIIMPLYNKAIRPLSFNPKDPEARFIGWQPIVRLTSKLLKIWYENEGIKAMLNDEINDRSYLYGRLLAVADVLEGDALKNKEVDRPTNAQRYMSAFAQRPADTWKTIYMNAQPYFKQSKGNRRGQILIDHIFDKLQINEENINRLNDPLDGKFLIGYSQQKVDWFRKIREYSEKKQKEQEKGDK >CP019581.1|AZK92338.1|2058338_2059082_+|CRISPR-associated-protein-Cas5 MLNEIIKSPSFSYKVFGDWALFTDPIMRLGGEKFTYSIPTYQALKGIAESVYWKPTIVIHIDKVRVMKPILVEAKDMRLFKYNNSRSSDLARYTYLKDVEYQVLAHFEFNMNRLDLAKDRNIKKHLAIMKRCIKRGGRRDVFLGTRECQGYVEPCKFGEGEGYYDNSDDRLFGTMFHGFDYPDETGNSMLKARLWNVVMKKGVVDFIRPEDCPVHKPLHELQVKKFKKGENLESVDQLYSQMLEGDS >CP019581.1|AZK92337.1|2055789_2058324_+|helicase-Cas3 MVGNFIGHTKKLKNNTIETQSLRDHLLNTQKYAEKYASDLSLEHVAGLAGILHDLGKYQSKFQEYIIESTRKGDQSKKGSIDHSSFGAIFLRDFISENFSEKENYYDFLDFGGILENAIFSHHNYLGLKDYINPDLMSPFLNRIEKFKDDEEKKRQLKKCKELFYKDVITEEKFTKYFQLAFDEYEAFISKIRNKVTLKTEQILNENAKNNKQVMLELQAKYFLSEYVYSCLLDADRTDATAFKLSKNPNFSDNTELFEKYYSKLVGKLHKLNKNDNSKINQLRAEISEECDQAAERPSGIYTLSASTGSGKTLTSLRFGLKHAKLYHKKHIIYVLPYITIIEQNSEVIRKFLNDNKDDSQNILEFHSNVSQKVADKSEETTNALDLTEDSWDSPIIVTTMVQFLDSIFASGTKHRRRFHNLCDSIVIFDEVQKVPIKCLDMFNEAVNFLKNFGNTNVLLCTATQPALEEVKQKLDLNIDHEIIPNLIEHEQQFKRVEFIDKTQNDDGIDLVLNSIQAAELIFKKSQNFKSILGIFNTIDVTKKIYSNLKNKFDSISDQIKLEYLSTNMCPADRKERIKNVLNLVKEGKRVICISTPLIEAGVDASFECVFRSLSGLDSLVQAAGRCNRNNELKLGKVYLLNMDPSEEHIAKLNEVKTGKDQVLELLSEGIKADDFLNANVIKKYFEMFYSKLASTMSYPTNGINLENYIDGIKNVHELAYQSKRKDASKFEKLTQFSGSETIAKYFQVIKNNTKSVLAPYGDKGEKLIADLNGNQDINSLIMLVKDAQPYIVNLYDNKFNQLFEEGDIYTLCQVGNEVIYAFRPYAYNKLVLGDRKQIEKSIF >CP019581.1|AZK92336.1|2054028_2055288_+|Phosphoribosylamine--glycine-ligase MKDDLVLLVVGSGGREFAVAKKLQESPHVKTVYCAPGNVGMQTIGVETVPIEETDLDGLLDFAKSKHVDWTFVGPENVLCAGITDKFEKAGQKIFGPNQRAAQLEGSKDYALRFMNKYDVPTARHETFTSAETCIAGLKDFDFPVVIKEDGLAGGKGVTIAKNQDVAEETIREMFAGGQTAVVLEECLVGPEYSMFVVVSEDQFTILPMAQDHKRVGDGDKGPNTGGMGSYSPLPQLKKEDRQKMIDEIVKPTMNGLVQGNYHYCGVLYIGLMLTEDGPKVIEYNVRLGDPETQVVLPRVKNDFAELIDAAVNHEKLPEIEENDQSVLGVVVCSKGYPTHPAPNVKIGKLPEGTNTYIDYANVKGDLDNLTGDGGRLFMVISEADNLVQAQDNVYSYLSKLDLPDCFYRHDIGNRALRD >CP019581.1|AZK92335.1|2052474_2054016_+|Bifunctional-purine-biosynthesis-protein-PurH MKRALISVSDKTNLVDFAKGLVRNGYEIISTGGTKKTLDEAGIKTISVEEVTNFPEILDGRVKTLNPYIHGGLLAKRDDPEHMATLKKLNIQPIDLVCVNLYPFKQTIEKADVTREEAIENIDIGGPSLLRAASKNYQDVTVVTDKADYDLVLKEIEEKGNTTLETRAKLAAKVFRATAAYDAIIANYLTKQVGLEDPEKLTLTYDLKEKMRYGENSHQKAWLYEDAIPKSFSILQAHQLHGKKLSYNNIKDADEALRCIREFQDEPTVVAMKHMNPCGIGRGDTLEEAWDRAYEADSVSIFGGVIALNRKVDLATAEKMHKIFLEIIIAPGFDDDALAVLEKKKNVRLLELDFSKENEKTRPEVVSVMGGILQQEQDTLIENTDDWKVVTKAEPTAAQLKTMMFALKAVKHTKSNAIVVANDERTLGVGAGQPNRIDSAKIAINHAGDAIDDRAVLASDAFFPFNDCVEYAAKHGIKAIVQPGGSIRDKDSIEMADKYGVAMVFTGYRHFRH >CP019581.1|AZK92334.1|2051876_2052473_+|Phosphoribosylglycinamide-formyltransferase MRVAILASGNGTNFEALTKKFQAGEIPGTEALMFCNHPNAPVVKRAERLGIPHEAFSVKECGGKTAYEKRLLKVLQDYQIDFIVLSGYLRVVGPTILNEYPNAIINLHPALLPSYPGLNSIERAFEDYKQGKIKETGVTVHFIDAHLDHGPIIAQQAVPIYPDDTVETLEARVHETEHQLFPATLKKVLSQRMEKEEN >CP019581.1|AZK92344.1|2066590_2066749_+|hypothetical-protein MVAGVLGMFAFGYASWRGRQNVTLVIEDYEKKIAEIKKLDQNASGTEKIRFK >CP019581.1|AZK92345.1|2066953_2069617_+|DNA-polymerase-I MADKKLLLIDGNSVAFRAFYALYRQLESFKSPDGLHTNAIYAFKNMLDVLLKDVDPTHVLVAFDAGKVTFRTKMYGEYKGGRAKTPEELLEQMPYIQEMLHDLGIKTYELKNYEADDIIGTFAAKGEKAGFTTTIVTGDRDLTQLASDKTTVEVTKSGVSQLEAYTPEHMKEVNGVTPTEFIDMKALMGDNSDNYPGVEGIGPKTASNLIQEYGSVENLYDHIDEMKKSKRKERLIRDKDKAFLAKKLATIDRDSPVTIDIDDVKREPVDYEKLWQFYEKMNFRKFLAELNASGAGQDGAEVEKVEYTVLNDDNVKDVKATEKDTVEFYLEMLGANYHLADFVGFSLKINDKIYVSRDVDLLEEDNIKHILEDEKIKKNVFDLKRSMVGAHRLGIHTHGLDYDMLLASYLVNNENNSNDLGEIAHLYGDYSVKTDLEVYGKGKSEHIPDDDDELFNHLASKVNAIESLKKTLLEKLKDHEQDDLFDTIEIPTARVLAKMEINGMKVEASTLIQLQNEFAVKLQDLEKKIYQQAGEEFNLNSPKQLGHILFEKLGLPVLKKTKTGYSTSVEVLDQLKTQSPIVKEILDYRQIAKIQSTYVKGLLDVIQPDGRVHTRYLQTLTATGRLSSVDPNLQNIPTRTEEGKQIRKAFVPSDPDGYIFSCDYSQVELRVLAHVSGDQNMQEAFKTGYDIHSHTAMKIFHLESPDEVTPLMRRHAKAVNFGIVYGISDYGLSKNLGISRKRAQEFIDNYFEQYPQIKDYMNKAVQEARDKGYAETIMHRRRYLPDIHAKKYTVRAFAERTAINSPIQGSAADIIKIAMINMQKKLDELHLKTKMVVQVHDELIFDVPKDELETIKKIVPEVMQSAVKLDVPLIADSGWGHNWYDAK >CP019581.1|AZK92346.1|2069625_2070456_+|Formamidopyrimidine-DNA-glycosylase MPEMPEVETVRRTLIPLIKGKTIEKVILWYPKIVATDHEKFLSELPGKKIIDIDRYAKYLLIRLSDNLTIVSHLRMEGKYHLTTSDAPKDKHDHVEFIFTDGTALRYNDVRKFGRMQLILTGTERQTTGIGKLGYEPNSSEFTSEYLVNGLKRKKKNIKNTLLDQSVVAGLGNIYVDEVLWRTKIHPLSQANKIPAEKVIELHDQINQIITEAIKLQGTTVHSFLNANGQVGGFQSKLQVYGHVGEPCSVCGTKFEKIKVNGRGTTFCPHCQVIYK >CP019581.1|AZK92347.1|2070452_2071055_+|Dephospho-CoA-kinase MSYVLALTGGIATGKSTADDFFRKKNIPIIDCDQIAHELMEPGNASWQAIKDHFGMEYLNSDQTINRKKLGQLVFSNKQALSELNQVTHPLIFDKTVAKIKEYRDFALVILDVPVYFEAGLDKKHVANGVLVITLPEQLQIERLKKRNNLTDQEAINRINSQMPLVEKEKMADFVVANTGKIKELENKLEQILIKIREEE >CP019581.1|AZK92348.1|2071057_2071525_+|Transcriptional-repressor-NrdR MECPNCHQNASRVIDSRPSDENRAIRRRRECENCGFRFTTFERIETAPLLVIKNDGTREPFNRKKILHGVMAAGQKRPISSDQFEQLVDHVENKVRKQGISEISSKKIGQYVMDELADLDDVAYIRFASIYREFKDMSSFMKTMEDMMAKKGKGN >CP019581.1|AZK92349.1|2071527_2072859_+|Replication-initiation-and-membrane-attachment-protein MFETADPKHLYYVANRVRLFPEDEKVLIKLYQPLVGAVAVALYQTLIQNYDPYGIISDSKGIYSLQEQLDCSLKQLFNSLHKLEAVGLVQTFLSDNVFNNVLVFKLLQVPAADKFFATPLLASLLKEKVGVPTFHDLSHAFAQDAKLKEKPIKNAKDVSANFFDVFRLPGDEAITPSSDVVQAAQENKVHEVETAKVNDHDSIDWDFIKQEYSRYQIPASEIDLNKEQIRGLIQTYGLSEKEFVDESLPCLHGSYSLNMRDISNTLAENYKRTNTRENVQSQLNEGRKKALAAIKDMDDNDKKLLKAANESSPAEFLYKLKTQKGGITSANEKQIINNLHTQYGLPEDLINILTYTCLTYDTVVSSNLAYKIANDWLQHGVATAVQALQYVKKRRNSFGKKRPVRTYQKRVEKGTDWSKKKADRDAGISTEQLKNLFKDLNNK >CP019581.1|AZK92350.1|2072890_2073799_+|Primosomal-protein-DnaI MEPIDKVIKKIVKERNLGDEQSLISQALHDPDVQAFLTANANKIDQKMVQNSMSNLYEYYSQKHTANKVMAGYAPQLFLNGKVIDIRYVPTKAKLAQDRKQAAERRLQLIDVPTRLHDVSLSEIDVNDDRKQVLTLIYDFLRKYKQDPHVQGLYLSGDYGVGKTYILAGLANYVVTNMNKNVVFLHVPTFIAGLASHFDDNSLQSEIRRLSECDLLILDDIGAESLSQWSRDDVLGVILQARMDNVLPTFFSSNLDMEALESHFEETRNATDPVKARRLMQRVRFLAKEVVVSGPDRRNSLH >CP019581.1|AZK92351.1|2074089_2076024_+|Threonine--tRNA-ligase MSFSITLPDGSKKDFEESLTIADLAHNIATSLGKAAVAGKVNGELKPLDYKLDSDSEVAIITNKDEEGLDVLRATAAFVFEAVAKREYPELRLGEHVADEGGFYVDTDKDDQIKVGELPKLEKAMQKVIKNGEKIEHVQIAKSELEDLYKNDKFKSEVLAKVEGDTVDAYKLGDFVDFGFDALLPNTGKIKQFKLLSVAGAYWLGKSSNPMLQRIFGTAFFKEADLKADLKRRQEIKERDHRTIGRDLDLFFVDPKVGAGLPYWMPKGATIRRVVERYIIDREVADGYQHVYTPVLMNLDAYKTSGHWAHYRDDMFPPMDMGDGEMLELRPMNCPSHIQIFKHHIRSYRDLPLRVAELGMMHRYEKSGALSGLQRVREMTLNDGHTFVALDQVQTEFAKILKLIMDVYKDFDITDYYFRLSYRDPKNTDKYFANDEMWEKSQKMLKGAMDDLGLDYVEAEGEAAFYGPKLDIQTKTALGNDETMSTIQLDFMLPDRFGLTYVGKDGEEHRPVMIHRGIVGTMERFIAYLTEIYKGAFPTWLAPVQAEIIPVNNEAHGEYAEKVRAELAKRGFRVEVDDRNEKMGYKIRESQTQKVPYTLVLGDEEMKSGKVNVRCYGTDEEISKSLDDFINEIDADVKSYSREN >CP019581.1|AZK92352.1|2076115_2076487_+|hypothetical-protein MKLTIKKLPYQLTVCQLSDIKNLNLKNDFYFFAKTDEELSLVCENKNAPSKTINREDGWRAFKIEGQLDFSLIGILAKIAQLLANNGISIFAVSTFNTDYILVKDNNFDSAIKILSENNYEIK >CP019581.1|AZK92353.1|2076493_2077744_+|hypothetical-protein MSRKNYLSPQFLTKISEDGTLNIESLIILTCAIFIASIGLNVNSTATIIGAMLISPLMGPLLAIGTGLALYNTNILRKGAISLLAEIVISLVASTIYFHFSPLTYASQEIIARTSPTIWDVMIAFFGGSAGIIGARKKGANNIVPGVAIATALMPPLCTVGYSIAAGNLKYFLGSGYLFLINCVFITLTAFLGVKIMKWLSHSAGQPGLSFFRKPTLKETGIVLVVIILIIPNVLSAGHMVNKTLVDQNVQNLVAHELGDVDLIKENVDSQEKTINLTVSGKKINAKKIQAAKANLAEYNLKGYSLNIVQVAQVNPNAENQLDRQVNNILNQRQCEQEQANEERQQEQEKHNQEIEKLSPAISSVTAVSDNKNKQITLIELKKNISAKKKKALVKQIKEKYPNINLVEFVQESEKE |
You can click texts colored in the table to view more detailed information
CRISPR_ID | Spacer_Info | Spacer_region | Spacer_length | Hit_ID | Protospacer_location | Mismatch | Identity |
---|
CRISPR_ID | Spacer_Info | Spacer_region | Spacer_length | Hit_phage_ID | Hit_phage_def | Protospacer_location | Mismatch | Identity |
---|---|---|---|---|---|---|---|---|
CP019581_3 | 3.4|2064292|34|CP019581|PILER-CR,CRISPRCasFinder,CRT | 2064292-2064325 | 34 | NC_013940 | Deferribacter desulfuricans SSM1 megaplasmid pDF308, complete sequence | 185122-185155 | 7 | 0.794 |
CP019581_3 | 3.4|2064292|34|CP019581|PILER-CR,CRISPRCasFinder,CRT | 2064292-2064325 | 34 | KY984068 | Erwinia phage vB_EamM_Y3, complete genome | 54331-54364 | 7 | 0.794 |
CP019581_3 | 3.7|2064492|35|CP019581|PILER-CR,CRISPRCasFinder,CRT | 2064492-2064526 | 35 | MH791414 | UNVERIFIED: Aeromonas phage Aswh_1, complete genome | 66400-66434 | 7 | 0.8 |
CP019581_3 | 3.7|2064492|35|CP019581|PILER-CR,CRISPRCasFinder,CRT | 2064492-2064526 | 35 | NZ_CP014003 | Synechococcus sp. PCC 73109 plasmid unnamed5, complete sequence | 162722-162756 | 8 | 0.771 |
CP019581_3 | 3.13|2064893|34|CP019581|PILER-CR,CRISPRCasFinder,CRT | 2064893-2064926 | 34 | AP014404 | Uncultured Mediterranean phage uvMED isolate uvMED-CGF-C28-MedDCM-OCT-S28-C155, *** SEQUENCING IN PROGRESS ***, 3 ordered pieces | 1499-1532 | 8 | 0.765 |
CP019581_3 | 3.6|2064425|35|CP019581|PILER-CR,CRISPRCasFinder,CRT | 2064425-2064459 | 35 | CP022016 | Salmonella enterica subsp. enterica serovar India str. SA20085604 plasmid unnamed1, complete sequence | 347087-347121 | 9 | 0.743 |
CP019581_3 | 3.7|2064492|35|CP019581|PILER-CR,CRISPRCasFinder,CRT | 2064492-2064526 | 35 | KC330683 | Bacillus phage Finn, complete genome | 48058-48092 | 9 | 0.743 |
CP019581_3 | 3.7|2064492|35|CP019581|PILER-CR,CRISPRCasFinder,CRT | 2064492-2064526 | 35 | NC_022765 | Bacillus phage Riggi, complete genome | 47777-47811 | 9 | 0.743 |
CP019581_3 | 3.7|2064492|35|CP019581|PILER-CR,CRISPRCasFinder,CRT | 2064492-2064526 | 35 | MT422786 | Bacillus phage Novomoskovsk, complete genome | 47051-47085 | 9 | 0.743 |
CP019581_3 | 3.8|2064559|35|CP019581|PILER-CR,CRISPRCasFinder,CRT | 2064559-2064593 | 35 | NZ_CP023678 | Zymomonas mobilis subsp. mobilis strain ZM4 substr. 2032 plasmid pZM32, complete sequence | 8876-8910 | 11 | 0.686 |
CP019581_3 | 3.8|2064559|35|CP019581|PILER-CR,CRISPRCasFinder,CRT | 2064559-2064593 | 35 | CP036466 | Zymomonas mobilis strain ER79ag plasmid pER79ag32, complete sequence | 8876-8910 | 11 | 0.686 |
CP019581_3 | 3.8|2064559|35|CP019581|PILER-CR,CRISPRCasFinder,CRT | 2064559-2064593 | 35 | CP036462 | Zymomonas mobilis strain ZM4* plasmid pZM32o, complete sequence | 8878-8912 | 11 | 0.686 |
CP019581_3 | 3.8|2064559|35|CP019581|PILER-CR,CRISPRCasFinder,CRT | 2064559-2064593 | 35 | CP036458 | Zymomonas mobilis subsp. mobilis ZM4 = ATCC 31821 strain ZM4 plasmid pER79ap32, complete sequence | 8876-8910 | 11 | 0.686 |
1. spacer 3.4|2064292|34|CP019581|PILER-CR,CRISPRCasFinder,CRT matches to NC_013940 (Deferribacter desulfuricans SSM1 megaplasmid pDF308, complete sequence) position: , mismatch: 7, identity: 0.794
atcgttatgcaggtattactactgctactaatac CRISPR spacer taacttatccagatattactactgctactaataa Protospacer **** ***.********************
2. spacer 3.4|2064292|34|CP019581|PILER-CR,CRISPRCasFinder,CRT matches to KY984068 (Erwinia phage vB_EamM_Y3, complete genome) position: , mismatch: 7, identity: 0.794
atcgttatgcaggtattactactgct--actaatac CRISPR spacer atcgttatgcaggtgttcctactgatgaactagc-- Protospacer **************.** ****** * ****..
3. spacer 3.7|2064492|35|CP019581|PILER-CR,CRISPRCasFinder,CRT matches to MH791414 (UNVERIFIED: Aeromonas phage Aswh_1, complete genome) position: , mismatch: 7, identity: 0.8
gaaaggaaatatatatgtcagtcaaa--gttaatggt CRISPR spacer aaaaggaaaaatatatgtcaatcaaaagggttatg-- Protospacer .******** **********.***** * * ***
4. spacer 3.7|2064492|35|CP019581|PILER-CR,CRISPRCasFinder,CRT matches to NZ_CP014003 (Synechococcus sp. PCC 73109 plasmid unnamed5, complete sequence) position: , mismatch: 8, identity: 0.771
gaaaggaaatatatatgtcagtcaaagttaatggt CRISPR spacer gaaacgaaatttatatgtcagtcaaggggaaaagc Protospacer **** ***** **************.* ** .*.
5. spacer 3.13|2064893|34|CP019581|PILER-CR,CRISPRCasFinder,CRT matches to AP014404 (Uncultured Mediterranean phage uvMED isolate uvMED-CGF-C28-MedDCM-OCT-S28-C155, *** SEQUENCING IN PROGRESS ***, 3 ordered pieces) position: , mismatch: 8, identity: 0.765
aagtgaatcgacaggtattgcagatgatgattta CRISPR spacer aactgctgaaacatctattgcagatgatgattta Protospacer ** ** .*** *******************
6. spacer 3.6|2064425|35|CP019581|PILER-CR,CRISPRCasFinder,CRT matches to CP022016 (Salmonella enterica subsp. enterica serovar India str. SA20085604 plasmid unnamed1, complete sequence) position: , mismatch: 9, identity: 0.743
aattcattactggcattactccagtcactgcgcca CRISPR spacer cgcgcaatactggcaatactccagtcactatggca Protospacer .. ** ******** *************..* **
7. spacer 3.7|2064492|35|CP019581|PILER-CR,CRISPRCasFinder,CRT matches to KC330683 (Bacillus phage Finn, complete genome) position: , mismatch: 9, identity: 0.743
gaaaggaaatatatatgtcagtcaaagttaatggt----- CRISPR spacer aaaaggaaaaatatatgtcactcaaaa-----ggtaccca Protospacer .******** ********** *****. ***
8. spacer 3.7|2064492|35|CP019581|PILER-CR,CRISPRCasFinder,CRT matches to NC_022765 (Bacillus phage Riggi, complete genome) position: , mismatch: 9, identity: 0.743
gaaaggaaatatatatgtcagtcaaagttaatggt----- CRISPR spacer aaaaggaaaaatatatgtcactcaaaa-----ggtaccca Protospacer .******** ********** *****. ***
9. spacer 3.7|2064492|35|CP019581|PILER-CR,CRISPRCasFinder,CRT matches to MT422786 (Bacillus phage Novomoskovsk, complete genome) position: , mismatch: 9, identity: 0.743
gaaaggaaatatatatgtcagtcaaagttaatggt----- CRISPR spacer aaaaggaaaaatatatgtcactcaaaa-----ggtaccca Protospacer .******** ********** *****. ***
10. spacer 3.8|2064559|35|CP019581|PILER-CR,CRISPRCasFinder,CRT matches to NZ_CP023678 (Zymomonas mobilis subsp. mobilis strain ZM4 substr. 2032 plasmid pZM32, complete sequence) position: , mismatch: 11, identity: 0.686
gattttgttagaaaattaaccgatgattttttaca CRISPR spacer ccccatattagaaaagtaatcgatgattttttcgc Protospacer .. *.******** ***.************
11. spacer 3.8|2064559|35|CP019581|PILER-CR,CRISPRCasFinder,CRT matches to CP036466 (Zymomonas mobilis strain ER79ag plasmid pER79ag32, complete sequence) position: , mismatch: 11, identity: 0.686
gattttgttagaaaattaaccgatgattttttaca CRISPR spacer ccccatattagaaaagtaatcgatgattttttcgc Protospacer .. *.******** ***.************
12. spacer 3.8|2064559|35|CP019581|PILER-CR,CRISPRCasFinder,CRT matches to CP036462 (Zymomonas mobilis strain ZM4* plasmid pZM32o, complete sequence) position: , mismatch: 11, identity: 0.686
gattttgttagaaaattaaccgatgattttttaca CRISPR spacer ccccatattagaaaagtaatcgatgattttttcgc Protospacer .. *.******** ***.************
13. spacer 3.8|2064559|35|CP019581|PILER-CR,CRISPRCasFinder,CRT matches to CP036458 (Zymomonas mobilis subsp. mobilis ZM4 = ATCC 31821 strain ZM4 plasmid pER79ap32, complete sequence) position: , mismatch: 11, identity: 0.686
gattttgttagaaaattaaccgatgattttttaca CRISPR spacer ccccatattagaaaagtaatcgatgattttttcgc Protospacer .. *.******** ***.************
Region | Region Position | Protein_number | Hit_taxonomy | Key_proteins | Att_site | Prophage annotation | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
DBSCAN-SWA_1 |
1183080 : 1222176
Sequences of DBSCAN-SWA_1
Nucleotide sequences of DBSCAN-SWA_1 >CP019581|1183080:1222176|DBSCAN-SWA CTTAACTGGCAATTCTTTGATCAAGAATTGCTTTTACTTTTTTATATGGCTTATAAAGCGTAATTTCTTTATCTGTGGCTCTTGCTAGAATATTCTTTGATGCGGCGATATTACAGTCCATTTCTCCACAATTAGGACATAAGGTCTTTTCATTGTGCTTGCCAAATCTTACGATAAAGTGCCGTCCGCAATTTGGACAATATTGGGAAGTGTAGGCTGGGTTAACGTCTTTAAAGTCAATGCCATATTGCTGGCAATAATACTCTAAACGTTCATTTAATATTCCTTTAGTCCATGATGAAAGATTGCGTCTAACTTGGCGTTCATATTTACTGCCTTTCTTAGCTATTTTTTCCTTAGTAAAGGTCAAATCTTCTTTAACAATCAAACTTGGTGTTTCCAGCTTAATCATTGTTTTAATTGCATGGTTAATTTTACTTTCAAGACAAGCATGATAAGATGCGTGGCGCTTATCTTTGCGCTTATGACCAATATTATTAGCGTTCAACTTGTTTAATTGCCTAGTTAATTGCTTTTTCTTGATAATTTGTTTAGGTATATTGGCGTTTTTTAGCTGATTAAGCTTTTTCTTTAATTGATAGCGATAACCGTAATAAGGATTGCGTCTAGCTAAGTATTGGCTTTCCTGTTCAATTTCAGGTCTAGTAAACTGGCTAAAACCTTTGCCATATTCACGACCACTACTGCAAGAAACCAGCGTGGCAAGCCCTTTATCAATACCGAGTTTAGTGCCTGCTTGATATTCTTTCTTTTGTCTAACTTTAATTACCTTATGAATTTCAAGCCGCTTTTTATCGCGATCTAAGATAATTTGTAAATTGCCATTTAAACGATAATGCCATGGACTTGTCAGCTTTAAACTAAATCTCTTTCTTGATACAGGCGATGAAATGGCAAGCTCTTCTAGCTGATTGCCGGTAAACTTGTACATGCTTTCATCATAAGTCATAGATTTATTTAAGCTATGATGCTTTCGAGGAGTATATTTATAGCGTCTAGTTAATCTGCTTAAATATGAATAAGCATGTTTTCTTTGTTTAGGCGTAACAGCTTCTTCTAATTCTGCTAGGTGCTTTTGCAATCCGTGAGACAAATCTAAAAGTACTGCTTGATTGTTGGTTAAGGCATAATACCACAACTCTCTAATGGAAAGCAGATAGTAGATCAAGTGTCTTTCATCTTTATCTATTGCCTCATTTGCTTGCACGACTCTACGAATTTGGTTAGCTAAATTACTCCACATCGAATTGATGTTGGCACAAGCATCGCTTAATGCGTAACACCAATGTTTATCTAAGAAATTGTATTTGGCTACATACTTTTTGTTTTGCTTAGAGCCTTTAATTTGATCACGTAATTTGCGATACTGCTTAACATCAAGCATATGACTGATGCCACAATATTGATTTAGAAACATATTGCGACATTTGCCATAATTTATAAAAAGCTGATCTAATTTAGTATAAGTATCTTCATCAAGCTTAAGCGAGTATGCTCTAACGGTCTTTGAATAGCTCTTGATTGGTTTTGATGATCTTTTCATTATGATGGCTTCTCTTTCCATAAAGCTTAGCAGAATAAACAGTAATAACTGAAAGCACATCGTCTACTAGCTCTTCATTAGAGCTTTCGGCTTTGGTTTGATTAATAACTGTTATTTCAACATCGTTTTCTTGGCATATGTCCTCAATTAATTCAAAACCAAATCTAACTAAACGATCCTGATAATTGACTACTACTTGACTGCATTGCTGGGTGCAGATCAGATGAATCAGCTCTTTTAAGCCTTTTTTGTGATAATGAAGCCCTGAACCAATATCAGAAATAATTTTAAACGGCTTGCCTTGCATTTCACAGAAATTAGTAACTACTGCTATTTGACGCTTTAAATCGTTTTTTTGTCCGGCAGAAGAGACACGACAATAGCCAATTATCAATTTATCTTTGGATACTGGCTTGGCCGGCTTTTTCCCTGCTAGTGCATCATCCAGCATTTGCGGGGTATATCTTCTTTGATTTCCAGCAGTTCTGATTGGCTTAATAATTCCTTTTCTTTCCCACAGCCTTAGAGTAGGAATTGCGATATTTAAATATTTAGAAGCTTGTCCGATAGATAAATACTTGGGTAATTGTTTATTGGATGCCATAAAAAAACTCCTCAACTTTCGTTAAGGATATTATATCAGTTATTGGCATCAATTAACAGATAATATCACTTTTTAGTATTTATTTTGTTAACTGCTGATAACCCTCTTTCGATCAAAGTTAATCGCATTGGAATGAAACGATACCAAAAACGGTACATGCCATCTTTGATTTTATAAACGGACTTTTTGCTTTTATTAGGATCCTCTGTAATTGGGACAACTCTTTCAACAATCTCTAATTCAATCAAATTATTTAAAAAGCTTCTTAAAGAAGTAGCCTTAATCTTTGTCGCTGTCGCAATTTCATTAAGACGAGAATGATCTTGAGCCACTGCAGCGATGACCGAATTATAATTTGCCGGATCTCTCAATTCCTGTTTGATCAAATTATTTGGTTCCTCAAAAAGACGGCCATCTTTTTGTAAAAAATTATCTACAATAGCCTCTTTAATGCTTTTGCTTTTGAAAAATAGCTCAAATATTGCGGAATACCACCTGAAATAGCATATAGTGCAAAAGCCGATTTCTTATCTATGCTGGGTAACATTTCTTGTGCTTCATGAAAATTAAATGGTTCTATCTTCAATTGCATTGTTCTTCTGCCATATAACGGACTTTGATAACCCAATACTTGCTTTTCCATGAACTACACCGACACTGAAGTGCCGGGTTTCTGGGAACACTGAGTAGTGTTTAGGATCTTACATTGAACTAGTCAAATACCTAACTCACGGAACTCAGCTTGTTACCATGGTCAGTCCCTGACCATTTAGCTTTGTACTTCCTTTATGCAGGATATTAACTGCCGCATTAATATCTCGGTCATGCTTGGTTTGACACTTAGAACAGGTCCATTCACGAATTTCTAATGGCTTAGCGCCACTATTATAGCCACATTTTGAACAAATTCTGGAAGTATTCTTAGGATCAACTGCAATTAGCTTTTTTCCATACCATTCGCATTTGTATTCTAGCATTTGTCTGAATATCCGCCATGAAGCATTGGCAATTGACTTAGCTAAGTGATGATTTTTCTGAAGATTCTTGGTTTTCAAATCTTCAATGATAATTACATCATATTGTTTAACTAAATGCGTAGTTAATTTATGCAGGTAATCTCTGCGCTGATTAGCTACTTTAGCTTGGCACTTAGCTTTTGACTTTTGCGCTTTTTGCCAATTAGTAAAGCTTTCTAAGCTTCTAGGACAAAGCACCTTTTTATTCCTGTCTTGCAAAACTAATAATTTAGCTAAATGTCTGCGACGAGCATATTTTCTTTGCCAGACTTTGGCTTTCTTTTCAAAATAAGAACTGTCAAAACTAGGATATTTTAGCCCATTAGACAGAATTGCTAAATCAGCTACGCCAACATCAATCCCAACCTGTTTGCCAGTTAAGCTGTACTTCTCTGGTTCAGATATTTCTACCTGTAGAGACAAATAATATTTTCCTGTTGGTTCAAGCAGAACAGTGTAGCGCTTAATCTTAACATCTTGCAAAACTCCAACATAGCCTAATTTAGGAATCTTCAAGTACCTTTTACCTGCAGTTTTTATGATTGACTTGCCCGTATAAGACTTTTTCAGATACTTGCGTGAATGAAAACGAGGCTTGCCTACTTGGCCAGTTTTATCTTGAAAGAAGTTCTTCCAAGACTGAGTTAGAAATTCATTAACTACCTGCAAGCTTGAAGAATCGCTGGTTTTCAAAAAAGGATATTCTTTTTTAAGCGGCTTAAGCAGATAATTCAGCTTGAACTTGCCTAAAAAAGGCAAAGCTTTATTATTCTGATAGCGCTCATTCATCATAGCCAGCATTTGATTCCAAACAAAGCGATCATTGCCAAACATTTGCTCAAGCTGGTTTTGTTGAATTTTATTAGGATAGAGTCTTAATTTGATCCCTTTTAGCACATTTGTTCCCTCCTTTCTTTTTTAAGATTATATCAAACATATGTTTAAGAAAGAAGGGGGAAACAAGCAAAAAAACTAAAAAAGGCCGGATTCATCACGGGATTGAAATCCCGTGATTTCTTCGGCAATTAATCAATAAATGACATAGACGAACCACACAGAACTAGAAACAATTTGCTATGTAGAAAATTACGATCAATGTATTTTTGCAATAATGAGCTAAAACCAGGATAAGACTCTGCTAAATAAGGATATTCATCAATTACAAAAACGACTTGCTGCTTTTGCGCAATTTTGCCAATTTCATCTAGTAAATCGTCAAAACTAGCAAAGCTAGCACTGCCTGAAAAATCAGGATTTTCGAATTTAAGCACTACCTCAATAATCTTCGTAAGCATAATTTTTAATATTTTTAGGCACAAAAGAGCTAGGTGAAATTCACCTAGCTCTTTTTGCTAATAATTTTTAGAAAGACGTAATTTTAAATGTGATTAGTCGATAACACGCTTAGCTGATGATGGGAAGAATGATGCTAAGGTTTGTGTCTTAACATTTTGACCAGGAGCTGGAGCGTGAACGAACTTGCCACTGCCAATGTAGATACCTACGTGGTAGTTACCCCAGAATAACAAGTCACCCTTCTTAAGCTTCTTAGTTGAAACTGAAACAGTCTTACCAAGAGTGATTTGACCGTAAGTAGTTCTTGGCAAAGTCTTGTTAGCAGCGTTCTTGTAAACGTATGAAATTAAACCTGAGCAGTCAAATGCTGAAGGACCTGAAGCACCGTAAACGTAAGGTTTACCAATTTCCTTCTTAGCAAGCTTAACGATTGCGTTACGCTTCTTAGTAACAGCTGAAGTCTTCTTAACAGTAGTTGCTGCTGGAGTTGAAGTTTCGATGTCTGACATTGAGTTGTCAGTATTTACAGTGCTAGTTGAAACTTCGTCAGCGTGAACAGTAGCTGGAACGTTAATAGCTGAGATACCTGTAAAGAAAATTGATAAAGCTGCAGTGTATTTAAATAAAGTACGTTTAAAATTCAAAATTAATAGTCTCCCTTAAATTATAGATTTCTTTTATTTAATATTTGTTTCGTTTTTGACTGATTTCTTAAATCAGCTGACGGTTAATATAATACCTTCTGAATGTTACATAATTGTTTCAAAAGAACTACCATCACGTTAAGAAACATTGCATATTTAAAAAAAATCTTGTAAATTTAAGGGCTTTCAAATCACTCTTCAAGCCTTGATAAGATTGAATCCAGCTTACTCTTAGTTTCTGATAGATTATCATTAACTAATATGTGAGCGTATTCTTTCAAGTCTTCCGGCAATAAGTTTAATTCACTGCCGCTGAGTCGCTCTTTGATTTTTTCGGGATCGTCTCCCCGCTTTAAAAGGCGCTCTTTTAGCTCTTCCTTCGTCGAAGTTGTCACATAAAGGAAATAAACTTTATTACCTAATTGTTTTAGATAAGTGTAAACACCTTTAATATCGACAATTAAAGAAACGAGATCACTTTTTTTCCAAGCGAGGTTTAAGGCTTCACGACTTGAACCATATTGATAAGACCCATATCTAATATGTTCAAAAAAGTGCAGTGTCTTGAAGCTTTCATCCGTTTCAAAATGATAGGAGAAATTCTGCTTTTCCCCTGGTCTCATTGGACGCGTCGTATGTGTCAAGACACGCGGTATACCATATTTTTCATTTAAATAATCAGAAATCGTTGTCTTTCCGGCACCGCTCGGGCCCGCGATTAAAATTATTTTCTTCAAAAGAAAGTCCCCCACTTGAAAATGTTGCTCATCTATATTAACATTCCAATGTTACAAAAGACCAAGATTTTTTATCAAGAGAGGAGATAACTCTATGAAAAAAGCAGATGTAAAAGTTGGCGCAATCGTTGGTGCCAAGTCAGAAGAAGAACTTAAGAAGCCATTCCTAGGTAAGGTAGAAAAAATTTATGAAAATTCTGCTCTCTTAGCCATCACTTCATACGATCCAGTTGATGCCAGTGCAATCAGTGACTTGAACAACAAGATCGTAGTCAACTTCAAGAACTTAAATGCTGCTCGTGCCGCTAAAAATATTAAGACTGCTTCAACTAACGAAGTTAAAGTTGAAAAGGTTGCTAAAACTAAAAAGGCCGACAAGGACAGCAAAAAGTAAAACAAATTCAAAAAGATCCTGGGTAATATTTACGCAGGATCTTTTTTTGCCTAAAATTAATACAATATCGATTTAATTTAGGGAGATTTTTTATGAAAGAGAAAACTCGAACAGTTTTATTTTGGTTCATCCTTATTCAGCCATTTTTGGACCTTTATTGGTTTTATCACGGCAAATTGGCTGACGTTTTGCCATTTACTTTGCCTACGATCATCAGAATTTTGGCTGTCTTTGTCATCTTTTGCATGTTTTTCAGCCAAAAACAGAATTGGCAAAAATTGGGTAAAAACAAGTGGCTGCTATTTTACCTAGCTTTATTAATTATTTACTCAGCATTGCACTTGTTACACGTGAAATATTTCAATAGTGTGAATCCAAATGATTACAACTATTCTACTGTCAGTGAGATCTTCTACCTCATTAGAATGATTTTGCCGCTGCTGGTAATTTTCTTTACTACCGAATTGGATTTCACCCGTGACCAATTCTGGCATGTAATTGAAGGAATCAGCGGACTATTTTCATTCACTATTGTAATCAGTAATCTCTTCGTGATTTCACTTAGAAGTTACGAAACGGGGCCAATCAGTGCCAACATTTTTGAATGGTTCTTCAATCCTAATATCGGCTATTCTCACATGGCATCCAAGGGATTCTTCAACTTCGCTAATATGGTTTCGGCCGTCCTATTCATGTTAGTACCGCTAATGCTCTACTTCATGTTTAGCCATTTTAATTGGAAAACCGTTACTTTGAATATTGTGCAAGCTCTTGCCATGATTGAGCTTGGAACCAAAGTTGCCCTGATCGGTTTAATCGGCGGCATTATCATCGGCATTTTGCTTTACGTTTTTCACTTGTTCATCGTCAAAGATGTCACCAAGAATGTCAAGGCAATCTTAGTCGCATTGCTAATCGAAATCGGCGCGATGGCGATCATCCCCTTTGGCCCAGCAATCCAGCGTTACAATTACGAAAAGTTTTTAGCACAACAATCAGATAATAGTTTAACTGTCGCTAAAAAAGAATTGGCCGCTGGTCTCAAAAAATACCCAAGTGGCAAGAAGCGCAAGCAATTTTTAACTGAATTTATCGAAGAGCATTATCAAGATTATGCCCTAAACAAAAAGTTCGTCACTAAGAGCTACCCTTATAAATATGATCCGAAGTTTTGGCTTAAGATCATGAATGAGTCGGGAACTGCGCGAATGCAAAACCGTCACGTTGAAAAGGCTATGCTAGATCAAGTGGTTAAAACTAACAATAATAAGTTAGACAAGTTCTTAGGTATCTCATACACACGTGAAACCAACATTTTTAACCTAGAACGTGACTTCACCAGCCAAATTTATTCATTAGGCTGGATCGGGATGCTGCTGTTTGTTGGACCATATGTAGCAATCATGCTGTACGCTTTTGTTAAATGGCTAATGAATAAGAAGAAGCACACCTACCTGATCAGTTCGATGCTGCTTTCAATTGCCTTTATGCTGTTTGCAGCATTTTCATCAGGCAATGTCATGGACTTCCTAACTGCTAGCTTCATCTTGGCCTTCGTTGAAGGTGGATTATTAGTTGAAATTAAAGATGAACTACACCGGCACTAAAGTGCCGGGTTTCTGGGAACACTGAGTAGTGTTTAGGATCTTACATTGAACTAGTCAAATACCTAACTCACAGAACTCAGCTTGTTACCATGGCCAGTCCCTGACCACTTAGCTTTTTACTTTCTTTATGCAAGATATTAACTGCCGCATTAATATCTCGGTCATGCTTGGTTTGACACTTAGAACAAATCCATTCACGAATCTCTAATGGCTTAGCGCCACTATTATAGCCACATTTTGAACAAATTCTGGAAGTGTTCTTGGGATCAACTGCAATTAGTTTCTTGCCATACCATTCGCATTTGTATTCTAGCATTTGTCTAAACATTCGCCATGAAGCATTGGCAATCGATTTAGCTAAGTGATGATTTTTCTGAAGATTCTTGGTTTTCAAATCTTCAATCGCAATTACATCATATTGCTTAACTAAATGCGTAGTTAGTTTATGCAGATAATCTCTGCGCTGATTAGCTGCTTTAGCTTGGTATTTGGCTTTTGACTTTTGCGCTTTTTGCCAATTAGTGAAGCTTTCTAAGCTTCTAGGACAAAGCACCTTTTTATTCCTGTCTTGCAAAACTAATAATTTAGCTAAATGTCTACGTCGAGAATACTTTTTCTGCCAAATTTTAGCTTTCTTTTCAAAATAAGAGCTATTAAAACTAGGATATTTTAGCCCATTAGACAGAATCGCTAAATCAGCCACGCCAACATCAATCCCAACTCGTTTGCCAGTTAAGCTGTACTTCTCTGGTTCAGATATTTCTACCTGTAGAGACAAATAATATTTTCCTGTTGGTTCAAGCAGAACAGTGTAGCGCTTAATCTTAGTGTTTTGCAAAACTCCATTCTTGCTGGTTTTGACATAGCCCAATTTGGTAATCTTCAAGTACCTTTTACCTGCAGTTTTTATGATTGACTTGCCTGTATAAGACTTTTTCAGATACTTGCGTGAATGAAAACGAGGCTTGCCAATTTGACCAGTTTTATCTTGAAAGAAGTTCTTCCAAGATTGAGTTAGGAATTCATTAACTACCTGCAAGCTTGAAGAATCGCTGTTTTTCAAAAAAGGATATTCTTTTTTAAGCGGCTTAAGCAGATAATTCAGCTTGAACTTGCCTAAAAAAGGCAAATCTTTATTATTCTGATAGCGCTCATTCATCATGGCCAGCATTTGATTCCAAACAAAACGATCATTGCCAAACATCTGCTCAAGCTGGTTTTGTTGTGTTCTATTAGGATAGAGTCTTAATTTGATCCCTTTTAGCACATTTGTTCCCTCCTTTCTTTTTAAGATTATATCAAACATACGTTTAAGAAAGAAGGGAGAATAAGCAAAAAAACAAAAGAAAAAAGCCGGATTCATCACGGGATTAAAATCCCGTGTTTTCTCTGTCAATTAATCAATAAAAAAATCCTCATTCGTCTTTGAATGGGGATTTTCTTTGCTTATTTTCCTTGCTTGATAAAACTTTGATCTGGCACCCGGATCAAGTAGTAACGCGTTGCCAAATGCCCCGCGTGCATTCCAATACCGTTTTGCCAGTTCTTCTTATATGTGGCCGTATAAATGGCTACATAAGCACGTTGATATCTCTTATCCATCTTCTTGAGCGTCTTTTTAACATATGGAATCTTTTCCGCATTAACATCTAATGCCGTATCACCATATGCAATCGATGCATAATCGCTAGACGCTGTAATTTTGCTTCCTGCCTTATAAAATGTATATGATTGAGAACCGTATCTCTTCTTAGCTGAATTAATCGTGACATAGCTTGAATTGCCAACCCCTGTCTTCATTATCAGCGGTTCATATTTAACGCTTGAAGTAATTCGACTCGCATCTTCCAAATCTGGATTATCCATAAAGGTCTGATTAAACATCCAGCCAGCTGCCACTAACAAAACGACAACTTCAACTACAGTCAAGATAAAGTTAGGCCAACTAAAGTGCTTATGCTGTTTGATAATCATTTTAATTCGTCTGCTACGAATATTTTGGATAATAAATACTAAATAAAGTATGATGGCAACCCAGATTAAGATACCAATAATATTCCAGCTAAACATGTCTCGTCCTCCTTGCTCTACATTATATTAGTAGTTTTGCTTTTCTTGGTTTCTTTCTTTTTAGCCGTTGCCTTTTCTGGAACCTGGTATTCGCGCACAACTTTCAATTTTTCAATAATTGCCTCTACTAGTTGATCTACAACCTTATCAGTATCTTTTTCAACTTCTTCTTCAGTTGCCAATTGATCAATTCTAAAGTGCGAAACATTGACATAATCAGACCGCGTTTCAAAATAAACATTAAGCGGATATTCTTGTCCTTCCTCAAAAAAGTTGGCAATCTTCTTATAATTGCTGTAAGGCAAGTTGATAATGTTAGTCTCTAAAGCAAAGGTAACTGGTCGCTGGCCTGTTACCGCTAATTGAGTTTGCACTAATGAATCGTGCTTAAACGCAGCAGCAACCTCCTTAACCATCTTCTCTGCATAGGGCTTAATTTTGCTCTTAGACTTGCTGATATCGCCATATTTTACTGTTTCAAGATCTTGAATGTAATCTTGATTTTCAATTTTATCATTTAAATCTTCTAAATTAATCTTCAAAATAAACTTTCCTTTTTGTTAATTCTACTCACTAACATACTACAAAAAATTTTTTTGTCATCCATTTTTAGAATTCATAATAAGATTTTGCTAAAATAGTTTCTATCAACTACCCCGAAATAAATTCCGAGGTTTGTAGTTCTCATTCTTTTCAGAATTGTGGTAATAGCGTCTTGCGACTCCCACCTAAAGAGACTACATCATCGGGCAGTTGACGATGCCCGTTTATCCATCAAGTACTACTCGACAGATGCTTGTTTGTTAAATTGAGGTGCTTCTTCACCACTGCGGTAAAGCGTGCCTAAAAATTGGATATTCATGGCTGCTAAACGATCATCATTGGACTTGTAGCCACATTTATCGCAAGTATAGAGATGCAGTTCATAATCTCGGTTATCTTTGTGGATACGTCCACATTTAGGACAGCGCTGACTGGTATACTTAGCTGGCACTTTTACTACTGCAGAAGAGTTCAAATTAGCTTTATAAGTCAAGAATTGCTCAAATTGGTAAAAAGCCCATGACAACATTTCATAGCGCTGATCCTTTGGCGACTTTTCCGTGGCAAAACGCACGTCAGTTAAATCTTCCAAGGCATAGATCGTGTTAGAGCCATAATGGTCAATGAGTGTCTTAGTTAAACGATGATTGACATCGCTCATCCAACGGTTCTCTCGTTGACCAATCTTCTTTAATCTACGCTTAGCAGATTTAGTTCCTTTAGCTTGTAACTCCGCTCTTAGCTTCTTATATTTGCGCCTTTTGCGCAATATCTTTTGACCACTAAAAAGGATTGATTTGCCCTTTTCATCATAACAAGCGGCTAAAAACCGCAAGCCGCGATCAATCCCCACAACATGCACTGCCTGTTCAGCTTCAAACTCCGGTATAGCTGTGGTAGCGCTAAGGTGAATATACCATTTACCTCTTAATTGAAGCATTTTAAGTGAACCAAACTTCCATTTAGTCTGATCAAGATACTGGTCAAAGCCTTTGCACACAAAATCTACTTTCCTTCTGCCATTCAAGGTATTTAATGATAGCTGGTTAGTAGTAGATAGGTAAGACCAATCTCGACCACGCTGTAAATCAAGCTGAGGGCGTTTGAAATTAATCGGTTGCCAAAGCCAAGTTAAATCACGATAGATCTGTGCCCAAATACCACGACCTTTGCCATCTTTCTTGCCAGTATCGTAACGAAAAGGATGTTGAAAGAGCTGTGTCTTAACTGTCTTATAGCGAGCTATAACAGTTTTAATGGCTGACTGTGCCATTTGACTCTTAAGTTCAAACTTATCACGCAAATCATGATACAGAGCTTTGTTTAACTTGGACTGTTTTAATTCAAAAGCATGTTCAAAAATATACTGCGAGATAAAGTTGCAGGCATCACGATATTTGAACATGGTATTTGAAAAAGCTTGGGCTGTTTCATCATCAATATTGAGTAATTGAGCTTTAATAACAACTGATTGATCCAAATAATTTCACCTCCTTAGCTGTATTCATTAAATATTATATCATAACCTAAATAGTGAATTATAAATAGAAAGAAAAAGGGACTGAAAGCGCAATTCCTCCCCGCATTAAAATGGGGGGTATCCTTGCTTAATCAGTCCCAAAGTTTATCAATGAAAAATAAAAAAATTAATTTAAAGTACTTATTTTTAATTATCCCAATTGGATTAATTTTTGTGCTTAGTCTAACCTACTTGACCAATAAAAATCAAATTGATGATGAACTTAATTGGATGACGATGAAAGAGAAACAAAAGTTAAATGTTCCTCTAGATAATCAATTACCTGATTTGCCCAACGGCTGTGAGGTAACTAGCTTAGCTATGCTCATGAACTATTATGGCATTAAAGTATCAAAAAATGAATTAGCTCAAAACATTCAACACGTTGATTCTTTTACGGATAATGGAAAATATCGCGGTAACCCTAATCAAGGCTTCGTTGGTCATATGACCGTAGCTAATGCCGGCTGGTGCGTTTATAACGGGCCGCTTTATAACGTTGCGCGTAAATACACCAATCATATCGTCAATGCCTCTGACAGCAACTTCCTGAAGGTATTAAAACTGGTCTCTGATGGCCATCCCGTTTTAATTATTACCACTACAACCTTTAGCCGCGTCAACAACATGCAGACGTGGGAAACCAACGCTGGTAAGGTGAACGTTACCCCGTCATCGCATGCTTGCGTAATTACAGGTTACAACAAGAAAAAGAAGATTGTCTACCTCAACAATCCTTATGGTTTTAAAAATCAAGCAGTCAACTGGCACAAACTAGAACAAAGCTACGACCAACAAGGTCGGCAGGCTCTTTATATGAACTACACCGGCACTGAAGTGCCGGGTTTCTGGGAACACTGAGTAGTGTTTAGGATCTTACATTGAACTAGTCAAATACCTAACTCACAGAACTCAGCTTGTTACCATGGCCAGTCCCTGACCACTTAGCTTTGTACTTCCTTTATGCAGGATATTAACTGCCGCATTAATATCTCGGTCATGCTTGGTTTGACACTTAGAACAAGTCCATTCACGAATCTCTAATGGCTTAGCGCCACTGTTATAGCCACATTTTGAACAAATTCTGGAAGTGTTCTTGGGATCAACTGCAATTAGTTTCTTGCCATACCATTCGCATTTGTATTCTAGCATTTGTCTAAACATTCGCCATGAAGCATTGGCAATTGACTTAGCCAAATGATGATTTTTCTGAAGATTCTTGGTTTTCAAATCTTCAATCGCAATTACATCATATTGCTTAACTAAATGCGTAGTTAGTTTATGCAGATAATCTCTGCGCTGATTAGCTGCTTTAGCTTGGTATTTGGCTTTTGACTTTTGCGCTTTTTGCCAATTAGTGAAGCTTTCTAAGCTTCTAGGACAAAGCACCTTTTTATTCCTGTCTTGCAAAACTAATAATTTAGCTAAATGTCTACGTCGAGAATACTTTTTCTGCCAAATTTTAGCTTTCTTTTCAAAATAAGAGCTATTAAAACTAGGATATTTTAGCCCATTAGACAGAATCGCTAAATCAGCCACGCCAACATCAATCCCAACTCGTTTGCCAGTTAAGCTGTACTTCTCTGGTTCAGATATTTCTACCTGTAGAGACAAATAATATTTTCCTGTTGGTTCAAGCAGAACAGTGTAGCGCTTAATCTTAGTGTTTTGCAAAACTCCATTCTTGCTGGTTTTGACATAGCCCAATTTGGTAATCTTCAAGTACCTTTTACCTGCAGTTTTTATGATTGACTTGCCTGTATAAGACTTTTTCAGATACTTGCGTGAATGAAAACGAGGCTTGCCTACTTGGCCAGTTTTATCTTGAAAGAAGTTCTTCCAAGATTGAGTTAGGAATTCATTAACTACCTGCAAGCTTGAAGAATCGCTGTTTTTCAAAAAAGGATATTCTTTTTTAAGCGGCTTAAGCAGATAATTCAGCTTGAACTTGCCTAAAAAAGGCAAATCTTTATTATTCTGATAGCGCTCATTCATCATGGCCAGCATTTGATTCCAAACAAAACGATCATTGCCAAACATCTGCTCAAGCTGGTTTTGTTGTGTTCTATTAGGATAGAGTCTTAATTTGATCCCTTTTAGCACATTTGTTCCCTCCTTTCTTTTTAAGATTATATCAAACATATGTTTAAGAAAGAAGGGGGGAACAAGCAAAAAAACTAAAAAAGGCCGGATTCATCACGGGATTGAAATCCCGTGTTTTCTCCGGCAATTAATCAATAAATTAAGCAAAACAAAAAGGACAAGCTCATGTAAAGCTTGTCCTTTTTAACTCAAGGCACATGCATAAGCATATGTGGACATGCAAACTGTTCACATCAGTGTGTCCAACATCAGCACTTCCCGTGCTTGCCACTTAAGGCGGATGTCCTCTATCTCATAAATATACGCATCTCGCGTTAACTAATCGACTCGTCAGCTACGTATTGTACTATCTTTTACGTTTCATGTACAGCATTATCTTCTTTAAGTGCCGATATTTTATCCTATAGAATGCTTCAATGCACATTGTAGAAAATAGAAATCCTAACCCAATTGCGACCGTGACGATCGCCACCCGCAAAATGAGTTCTTGACCCAGCGTATAATTATCGGCTACAAGGGCACGAACGGCCTGATATGCTGGCATCCCCGGCACCAGCGGCACCAATGCAGGAATGTTAAAGACCGTTGCAGGACACTTTTTTCTCCTAGCAAAAAACAGTCCCAAAATTCCGATCAAAAAGGCCCCAATCAGGTTAGAAACCATTCTACCCATGCCTAAATGAAAACAAAACCAATAAGCCATCCAGCCAACGGCACCACTAATCCCCAATGAATTAAGGGCACGATGCGGAACATTAATCGTGATAGCAAAACCCACAGAAGCAATATAAGAAAAAGCTACATTTATTACTATCTCTAACCAAAAAGGCATACCTAGACCCCCATGAACTTCAAGACAATGGCGACGCCGCCACCTAAAGCTAGTGCACTTAAAACGGCCTCACACATTCTTACGATGCCAGACAGCAGATCCCCCATGAACAAATCGCGCAAAGCATTTGTCAAAGCCAGTCCTGGGACCAAAGTCATCAGCGCTCCAACCAAAATGTTGCCCACAATCATCTGTGAATCCAGCCAATTCAGTCCTATGGCTAAAAAGCCCATCACCATTGCCGCAATTAGTTCTGATAAAAAGCGGACCTTAGTATACTTTTTAAACTCGCAATATGCCCAAAAACCGAGGCCCCCAACTATGGCTGCACCTGGGAAATCAACCCAGTCATAATTGTCCATAAACAAAACCATCAAGGTTGCACTAAGGACTGCCGCACCGATGATTTGCATCCACATTGGAAAGAACGGCGCATCGGCTACTTTGGCTATCTTTTCTTTCAACTGAGGCAAATCAATTTGCTTAACTGCAAATTGACGTGACAATGCATTAACACGATCTACTAACTCCAAATCAATGTTACGATCGCGCACTTGCTTCATTTGAGAGAGATTGCCATGATTCAAGCTCATAAACACACAGGTAGGCGTGGCAAAAACACGAGGATCTTCTGCCCCTGCATTGCGAGCAATTCTTAGCATTGTATCCTCAACCCGATACATTTCACTGCCGCCTTCAATCATCAAACGTCCAGCGGTTAAACAAATATCCAATATTTCCTGATAAAATTCACCATCGTGCTTATCGGACATAGTTTTATCCTCAATTCATTGTTTCAATATCTATTGTTTTATCTACCCAATCACCACGGAAGAAGTCGCGTTCGTGACTAACAATAATTACTCCACCTGGGAAATTGACAATGGCTTTTCTTAACGCATCCTTAGTATCGTTATCTAGGTGGTTAGTCGGTTCATCAAGGAAAAGCAAATTAGCAGGTTCAAACTGCATTTTGGCTAATTTCACCTTTTCCTGTTCCCCACCAGATAATTCTTTTAATGGACTCATGGCTTGCTGTGCAGTCAAGCCCATCCGTGCGAGCGCTTGTCGTAATTCCTTTGGCTTTTTGCGTTCAAATTCTTCCTGCAAATATTGTAGCGGCGTCATATTGTTGTTAGGCCAAGCTAAGTCCTGCTTAAAGTAAGCCAGTTTTGCCGTTACAGATAATTCATATGAGCCATAAATTGGCTTCAATTGGCCCAATAAAGTCTTAAGCAACGTGGTCTTACCAATTCCGTTAAAACCAGTGATCGCTACTTTTTCATCCCCACCAACGGAGAAATTAAAGGCGGACTTAACTAACGCATGATCATAACCAATAACCAAGTCTTGCGCTTGCAAAAGCAAGTTAGAAGCCGTAGCTACATATGGGAATTCAAACTTAGCCCGCCGGTTGTTCTTCGGCGGTGTTAAAACGTCCATTCTTGCTAATTGTTTTTCACGAGATTTAGCACTCTTAGCACGAGTACCCGCCTTATTCTTGCGGATATATGCCTCAGTCTTAGCAATCTTGCGTTGCTGGTTAGCATAAGCCTTAAGGTAGGTTTCACGGTTAGCCACTTTTTGACGTAACGCTTGCTTCAGTGTACCGGTATAACGCGTAATCGTACCAAAATCGATGTCAATAATGCAATTGGTAATACGACCTAAAAAGTCGTAGTCGTGACTAACTACGATAAAGGCGCCAGAGAAGTTGTTCAGATAATCAACCAGCCAGTCAATATGCGATACATCCAGGTAGTTAGTTGGTTCGTCTAGTACTAAAACATCAGGGCTTTGCAAAAGAAGCTTAGCTAAAATGATCTTAGAACGTTGCCCACCTGAAAGCTTAGAAACATCGTGATCATAACCTAGATCTGCTAGCCCCAATCCGGCTGCGACTTGCTCGATCTCGGTATCAATGTCATAAAAGTGATTTTCTTCCAAGTATGTTTGAATCTTGCCAGCTTTTTCTAGCAGCTTATCGTCGCTATTTTCAGCATACTTGGTATACAGCTCGTTTAATTCTTGTTCTTTCTTAAAAAGATCATCAAAAGCGGTCCGCAAAAAGCCGCGGATAGTTACACCAGGTGCAAGCTTGGCATACTGGTCTAAATAGCCAACATCAATCTTGTTTTGCCACTTAACTTGACCTTCATCTGGGATAATCTCACCCGTCAAGATCTTAATCAAGGTAGATTTACCTACCCCATTTTGACCAGTAACCCCCATGTGGTCTTCCTTATTCAACACAAAATTGGCATCTTCGTACAAAGTCTTGTCGATAAAACTTTGACTTAAGTCTTTAACAGTTAATAAACTCATTTTCCCCTCTAACTATTATTTGTAGATTCTTGTTCTTTTTCATACTTTTGCAGCGCATCAAGCACGCGAGGCCAGTAACGCTTAGGCAGAATCAGTTGCAAGCGATCATGATAAATTGGAAAGTCGCTTGGCTTCATCCCCGTAATATTAGCCACTTCTTGATCCGTTAACTTATATTTATAAACGATGCGCTTGATTTCAACTGAAGCATGACCTCTAAAATAAGACAAACCAAAAAAGAAAATTGCAAAAATACCTGCCAGTATAATCGCTAATTGCAAATTCAATCTAAAAGTGTGCAGGCAGATAAAAAATACGGCAATAAAAGCAATTATTGCGGAAATTATGCCATAAATGACTGGAAATATTGATTCTTTTTTTTGCATAATTTTCTCCTTTCAACGCAAAAGCAGTAACTTTTACGAACTTAGCAAAATTTAGCCAACTTTACAATGACTTTTGAATATGAATTAAGTATACTTGCAATAACTCAATTTGTATTTAGAAAGGGACGATAAAATGAAAAAAAATGTCCATTACTATTTTTCTTTAATCCACTTCCAAATCGATGACTTAGGCATCCATTTAGTGTTGCCGCAAACTTGGCAGGCAATTGATCTTTATCAAGCGATCGCGCATGACCGAGAAAGCATGGGTAAATGGCTGCCTTGGGCTTATAACATGCAATCTTCAGCTGACGAAGCCAAATTTATCAAAACCATCCAAGCCGACATGGTCAAAGAGCGCATGATCGTATTAACCATTCTTGTTAATGGTGAACCATGCGGCATGATCGATTTGCATAATTTAATCACCAACCAAAAAGGCGAAATCGGCTATTGGCTGTCTAGCAAATACCAAGGCCGCGGTATTATGACTAAAAGCGTTCTTGAAGTATGCAAGTATGCCTTCAGCGAGTTAAATCTTAAATACGTTGATTTAATCGTCGCGGTTGAAAATGGTAAAAGTGAACGGGTTGCCAAAAATGCCGACTTTAAATTAATGGGAATAAAACAACATTTGATCCATCATCACACTATGGCTGGCAAAATTTTTAGAAAAATCAATTCCAATTAAATAGACCTGTTACATCTTTAATAATGAAAAAGATGATTGGTATAAGTAGTGATGGTAAAAAGTTCAGCGTTTTAACATCACGAATTTTTAACAGGGCTAACCCTGAACAGGCAATTAAGAAGCCGCCCACAATGGAAATCTCTGTTACTAGTGGACAGGTAAAAAAGCTTGCCGATAAAAATTTGGCAACGAGATAAATGCTGCCTTGCCAGCAGAATAGAACGGGAGCAACAACCAGCATCCCCCAGCCGAAGCTTGCACCAAAAACCATCGACGTTACAAAATCCAGCGTAGCATTAGTGAACAGCATGGTGTTGTCGCCTTTTACAGCCGCAATAACAGGGCCTACAATAGAGAGCGCGCCCCGATACAGCAAAGCAAGATTTCAGTTGCAAGGCCTTTACCTAAATTGGATTTTCCCACTTTTGCAACCAACCGATCAAAACGCCCAGAAATGTCCAGAGCGGTCCCACATACGCCGCCTAGAGCTAGGCTGACAATGAATAAAACGGGATAGTGGCTCTTAGGCATGGTGTTGGTCACGTTTTCCCAGCCAATACCTAGAGCGGCTAGCCCCATTGCGATAAACAAAACGTCTTCGTAACGCTTGTTAATCCCCTTTTTCACTAAACAACCAATGGACGTTCCTACTAATAACGCTAATGTATTTACAATTGTGCCAATCATATTCCTCACCTCTCAGTTTATTTTACATCAGCAAAAAAAACAGCATCCCATATCAGGATGCTGTTAATTTTTACCATTGAAACTTCCAGCCGCCACTACTTGATGATGAACTAGAACTACTACTGCTTGACGACGAAGCATCAGGATTCTTGCCATAACCAGAGCTAGAGCCACTTGCAGGCTTATCTGAATTCTGATCATAGATCTGATAATTAGTAAATGCTTGCGGATCGTCCCACTTGATATGACTATTTTGGTCATTCAAATCATTTTGCCGTGTCTCTTCATTATCTAAAGTTTCAGTTTCTAAACCTAAATTCTTGCGAATTCTATTTGAAATCTTTTGTAATTCTTTAGTTGAAGCAACCTGAATCGAAGAGCCATCAATCCAAGCATCATGCCCTTGAATATAACCACTCTTCAAGTTCTTTGCACAGCCACGATAATTCAAAACCAGGCTAAGCATATCGTTAAAGGTTAAATTAGTCTTCACATATTTATTGAAAATATCAATAAGCTTACGGTAATTGCTGATAGTATTAACTGACATAGCCTTGGTCATAATTGCTTCAATTACCTGACGCTGGCGAAGCTGACGGCCGTAGTCACCACGTGGGTCTTCTTTTCTCATACGGACATAAGCCACGGCATGACGACCATTCAAGTGCTGCTTTCCTTTATGGAAATCACACCAGTCATATGAAAAGTCAAATGGCACATTAACATCTACGCCGCCTACCGCATTAACCAAACTACGCAGCGCCTTCATATTTACTTCTAGGTAATAATCAATTGGTACATTTAACATGCTCGAAACGGTCTTCATGCTTGATTGACTGCCACCAATTTCATAGGCAGAATTCACCCTAAACATGAAGTACTTGTCTCCTGGATCGCCTTTAATATCAGCCAAGGTATCACGCGGAATCGAGGTCATCGTTGCAGAATTTTTTTGCGGATTAGCTGTAGCTAAAATCATAGTATCTGAATTACCACGATCATGACGCCCTTCAATACCTTGATCCACACCTAAAACTAAAACAGAAATCGGTTGTCTATTGGCAATCTTAGCTGAAGCTGGCACATTGCTATCGCCTTTACCATCAACTGCACTATGAATGGTGAAATACATGTGCGCAGCCCAAGCTACCCCAAAGCAAATGACCACTAATGCGACTACGCCAATAAAGCGTGCAAACATATTGCCACTTTTATATACGGTACTTTCTGCAGCAAAAGCACGATTGCGGTGGAGATTAAGCGAGGATTGTCTTGCTCGGTATTCTTCTCTTCGTTTTGGATTTTGGTCCATTATCTGTATCTTTTCCCTTCAAAATTAATACATGTTCATTTGTATCACTTTTAATCGCTAGATGAAATAAAATTACTTATAATTAATAATAACTTATTTTTTGTGTTTCATTGTACATCGAATTATGATGTATACAAAATGTAATTAATGGAGGAAAATTTATGGCGATTCCAACAAGAAGCGAAGTCCCAGAAGATTTGAAGTGGGATTTAACCCGCATCTTTAAAACAGACCAAGACTGGGAAAATGCCTTTGACAAGGCAAAAGATGAGGTGGCTAAGCTAAGTGAATTAAAGGGAAGCTTGGCTAAATCAGGCAAAGATTTGTATGAAGGCTTGACCAAGATTTTGGCAGTTAAACGTGATGTAGAAAATATTTACGTTTATGCCACTATGTCTAGCGATGTTGATACTTCTAACTCACATTATTTGGGCTACGTTAGCCGCGTGCAAAGCTTGTCCAATCAATTTGAAGCAGCAACCAGTTTTATTAATCCTGAAATTTTGAGTATCCCTGCCGAAAAGTTTGAGCAATTCAAAAAAGACGAGCCAAGATTAGCTGATTACGCCCACTATTTGGAAATGATCACTAACAAGCGTCCTCACACTTTGCCAGCAGAAGAAGAAAAAATTATCGCTGACGCAGGGGATGCCATGAGCGTGTCAGAGAATACCTTTAACGTTTTAACCAACTCTGACATGGAATATGGCTATATGCAAGATGAAGACGGCAACATGGAGCAATTATCCAATGGCTTGTATTCATTATTGATTCAGTCCCAAAATCGTGACGTGCGTAAAGGTGCTTTTAATACTCTCTATGCCAGCTATGGTCAATTCCAAAACTCGCTTGCCTCTACTCTCTCCGGCGTTGTGAAAAAACATAACTACAACGCACGCATGCACAAGTATGATTCAGCTCGTGAAGCCGCATTAGCTGATAACGGCGTACCTGTTGAAGTTTACGACACATTAATTAAAGAAGTTGATTCACACCTTGACTTGCTTCACCGTTATGTCGCATTGCGCAAGAAAATTTTAGGTCTTAAAGACTTACAAATGTGGGACATGTACGTGCCGCTAACTGGTAAGCCTGCTTTGTCTTACAACTTTGAAGAGGCTAAAAAGGTAGCTAAAGGAGCCTTGAAGCCACTTGGCGAAGACTACTTAAAGCATGTTGATTATATTTTTAACAACCGTGTGATTGACCCTGTTGAATCTAAGGGCAAGGTTACTGGTGCTTACTCTGGTGGTGCTTACGATACCGATCCATATGAACTTTTGAACTGGGAAGACAATATCGATTCACTCTATACTTTAGTTCATGAAACTGGACACTCAGTTCACTCTTGGTACACCCGCCACAGTCAGCCTTATATCTACGGTGATTACCCAATCTTCGTGGCTGAAATTGCTTCAACCACTAATGAAAATATTTTGACTGAATATTTCTTGGACCATATTACTGATCCTAAGACGCGCGCATTCATCTTGAACTACTACCTTGATTCATTCAAGGGTACATTGTTCCGCCAAACTCAATTTGCGGTATTTGAACAATTTATCCACGAAGCAGATGCTAAGGGCGAACCATTGACTGCCGATATTTTGGATGATGTTTATGGTCAAATTAACCAGCATTACTACGGCGACAGTGTTGAACCTGGTGGCGATATTGCGCTTGAATGGTCACGAATTCCGCACTTCTACTACAACTTCTACGTTTATCAATATGCGACTGGATTTGCGGCTGCAACGGCTTTGGCTAACAAGGTTGTGCATGGTAGTCAGGCTGATAGGGATGCATACCTGGGCTACCTTAAGTCAGGTTCTAGTGACTATCCTACTGAGATCATGAAGCGTGCCGGCGTTGACATGACTAAGCCCGATTATTTGGAAGATGCTTTCAAGACTTTTGAAAAGAGATTGAACGAATTCGAGAGTTTGATTGGTAAGTAATTCATGCAGGCTTATTCTCCAGATTATATTAGGGATGCTTTAGTCCGTTTTGCCTATCACTCAAATGCAATTGAAGGTAATTCATTATCGCTCGGGCAAACGGAAGCCATTGTTTTATACGACAGAGTAACTTTTGTGAATAATAAAGGCGTTAAGCTACGTGATATTTACGAAGCCAGTAATCAAAAAGATGCATTCTATCTAATGCTCAATATGACTAACAATAATGCGGAATTAACCATTGATAATATTTTGAAATTACAATAATTTGAATTAACTAAAAATACCATTGGTACTGCAGGTAAATTCAAACAAAATGAAAATATGATCTTAGGTGCTGATTTCCAAACTTCTTCCGTAGCTGAGACTCCTATTGCTGTCAAACAATGGGCAGAAAATACCAATTATCGTCTAGAAAACAGTCAAACTAAAGATGAATTCCTCCAAAATCTGATGGCTGCACATATTAATTTTGAACGAATTCATCCCTTTGAAGACGGAAATGGCATAACTGGTCGAGAGCTAATTAACTTAGAACTTGCTAAAAATGAAATGCCATCTTTAATCATCGAAAAAGACGATAAAGCGCAATACATTACTTACGAGGCTGATCAAGATGTTGTTAGTCTTACCGAATATGCCAAAAAGAAACTAGCCACTGAACAAAAACGTTACGAAATATTTGAGCGTAATTTTAAAGCTCAGGAAGAATTTGATAAGAATCATCAAGAATAAATAAAACTTTTTCCAAACTCTCATTTTTTTGGTATCATTTCAGCGTATTTGATATCGATAAGTTTTTAATGACACTGACTTGAAAAAGATACCCATACAAAAAAGCTGACAACTCACTTTCACGTGAATTATCAGCTTTTTGCTTTTAAATGGAACCTGTCAAGCCAGCTCCCGTCCACTTATTTGGATTTTCATCTAGCATAATTTTTAATTTAGTTAACGTTGATGGTAACTTGAAATCATGCATCAATGAATAATTATCAATTTCCTGATTTAACTTAAAAATAACCTTATTTAGATCAATATCTTGTTCTATTTCACTTAAAGCATGATCAATTATTGTCATTGCTTCAACATCACCAGTAATATCTGGCTGTGTTTTCATACTAAGCAGAACAGATTTTAATTTTTCATCCATTAGATCATCCCTGCAACTAAACCAACTACATACAAGAAGACGCAACAAATCACGAGATACAGACTGCCCCACTTGCCAGTTTCCTTCAAGATCTCGCCTTCATCACCTTCACGATTAATCGCACCGGTTGCTACCGCAATATTCTGTGGTGACAGCATCCCTGCCGTTGCACCCGTCATGTTCGACGCTACAACCCAGTACTTGCTTACACCTAAGTCCGTTGCAGCTGACAGTTGCAAGTTACCAAAGAGCACGTTAGCTGAAGTAGCGGAGCCTGTTAAGAATGTTTCCAAGAGCTCCAATTAACGGTGCAAACAATGGATAAACAGGTCCAAGCAAGCTTACCAAGGCAATAGCTAATGCATCAGTCATACCAGCGTAGACCATCACCTTAGCAAGTCCTACAATGGCGCAAACCGTAATCGTGGTCTTCCACACGCTCTTAATGCATTGTCTTAAAATCTGCATCATTTTCTTAAAGCTAACGCCTTGAATGGTTCCCCAATTAATGTGGAAATCAAAAGCAATGTTCCTGGAGAATTTAACCAATTAATAGCCAAGGTATTAGGATTCTTGCCAGTGTAAATAACCAAGTTAGTTGTAGCCTTGCTGAGCAAATTATTCACTGGCGCTACCAATGATGATGCAAAAATGACGAAGATAAAGACCAAGATAAATGGTGAACATGCCTTAAGCATTTCCTTCGTACTTGGACGCTCCACATCAGGTCCTTCTTCATCTTCATGCCATCTGGTAAATAAAACCGTAATAGTGATTGTGCAAAGTGAACCTACAATTGCAGGTGATTCTGCACCAACAAATTTAGCGGCTATTACCTGGGGAATAGCCATGCCAAGTCCGACATCAAGGTAATCAAGCCAACGCCTTTAACAACCTTAATGCTGCCTTCCGTCAAAATTACCAAAATAAATGGCACTAGCAATACCAAAACGAATAATTGCAAAGTTACGATGAAGGATAAATTTTCTTGCTTCAAGTTAGTTACTTCAGCCAATGTTAAAATTGGCAATCCAATTGCACCAAATGCTGTTGCAGTGGTATTGGCAATCAGACTAATTACTGACGCTCTAATTGGATCTAACCCAAAGGCAATCAAAATACCTGCCGCAATTGCGACCGCTGTACCAAAACCGGCAATTGATTCAAGAAAGCCACCAAAGCCCCAAGCAATAATTAGCACGATAATGCGCTTATCAGTTGAAATCGTAGCCAGCATGTCCTGAATGATTTTCATTCCGCCCGATTCAGTAGTTACATTGTACGCAAATAACGCCATCACGATTACGAACATGATTGGCCAAATACCCATGATGATCCCTTCTAGGGCTCCTGTCATAGTATCTAAAACCGGCTATTTAAAACTAACGATAGCAAGTACAATCGTTAATACTAATCCAATCACACAAGCACGTGCAGCTGGCATTCGCATCACACCAAGCGAAATAATTAACCAAATGATTGGCAACAATGCTATTGCAATCGAATCCACAAAGGTTTTGGTCAGGTTCAGGATACCTTAGAATCCTATTTTGACTAAATAAAAGATTAAAAAATCGATCTACGAGAAAAATCTATTTACACAAACTATTTGACAGTGTCTAAAAGTTAAAATCAAATACGTAAACTTTATCAATTATTACAAATTTCAAAATTTAATATTAAATATTCAGTAAAAGACATAAACAATCTAGGACTTTTTAGTCATGAAACTTAGATCTATTATTCATAATCCATAATTTGAGCAGTACTATTTAATTTTTTAAAATCTAATTCACCTAATCTTTTGGCAGTAATTACCCCTGCAAGCATTGAATCATTAACATTTAATGCTGTTCTAGCCATATCTACAATTGGATCAATCGCAATTAAAATTCCTAGCACTGAAACGGGTAAATTTAAGGCACCTAAAACCATTAAAGTAGTAAAAGTTGATCCTCCACCAACACCGGCAACGCCAAAACTTGAAATCACATCAATTAATACCAAGGTCAATATGAATTGCCATGAAAAAATATTTACACCTACAACAGACGCACTGATTGCGGCAACCATTGATGGATAAATTCCAGCACATCCATTTTGACCTATGGATAAACCAAAGCTAGCACTAAAATCAGCAATTGTATCACTCACACCCAACTGATTTTTTTGTGTTCTAACATTTAAGGGAAGACTTCCTCCACTAGTTCTAGAAGTAAAAGCAAAAACTAATACTGGCCAAACGTTTTTAAAATAAGTAATCGGATTAATTCCATTAATCATCAATAAAACTGCGTGTACTAGGAACATTACCCCAATTGCAAAGTATGCAGCAATAATAAATATTAATAATTTACCCATTGTCGTAAAACTATTAGTAGCGGCGGCACGAGCAATCAAAGCAAAAATTCCATAAGGCGTTAACTCTAAGACCATTTTTACTATTTTCATAATAATAGCTCTCAAAGCTTGAATACCACGTGCAAAGGTCTCTGCTAATTTTGTTTTTTCTTTCTTAAGACTTAAGAATGCTACCCCGACAAATAATGAAAAAATTACCACTGCAATAGTACTTGCTGCTCGCATACCTGCAAAATCTGCAAATATATTTTTGGGAAAGAATGAAACGATTTGTAGTGCTAAAGTTACATTCTTAAGCGATTCTTGATGGTCTCTAATTGAAGTTAACGCCTGCGAAGTAGCAGTTAACCCACGTGCAAAATTGGCACCTTGTAAATGAAATAAAACCACACTAATGAAACCGATAAATGATGCTACAGCTGTTGTTCCTAATAAAACTACTAAGACGTTTCCTGCAATCTTATGAATTTTTTCTGTAACTTTTAATTGAGTAAAAGCACCTACTAAAGAAACAAAAATCAATGGAATAACTAACATTTGAAGAAGCGAAATATAACCATCCCCAACTATAGAAAGCCAATCCATACTTTGACTAATTACTAAACTTTTTGCTCCAAATATCAGTTGCAAACCAATACCATATACAATTCCAACAACTAATGATAAAAATACTAATTTAGTAAATCCCCACTTTTTATGATGAAGATAGATCATTCCAGCTAATAAAGCCATCGCTACTATAAGCGCTACAAATGTTTGCATAAAATCCCCCTTGCATTACTTTGAGCAAACAAAAGGAGTCATTGACACATACTGTCAATGACTCCTTGGATAATCTTTGTAACAGATGAAAAGTTCCTATTTATGATAGAAAACAGAAAATCCTTGCGCGAGAAAATGACTGTACGCCAAATAAATTGTCAATTTTTTGACAAATGCAACAACAAGAATTTTCTTGACGTACAAACATTTAGCTCACTCCTTTCTTACTTTTCACAAGTAATAATATAGAATTACTTACTTAAAGTCAATAAAAAAATTAAAATTTTTGCTTGCAACTTTGCCCAACAAACTATATAACAATTAATGGAAAGTAAAATTTCTCACATCATTTAAGAACTACTCCTCGGAATGGAGTAGTTCTTTTTCTTTTCCAATGAAAAAACAGGTTGAGATTCAACCTGCTTCTTATTTTTATTAGCGATGTTTAACTACATGGATGCCATAAACAACTAGGTAAATCACTACCTCAATCGTTGCAATAAAGAATGTGACAGGCCAATTAGTAATAAAGCCTAAATATAGACCTGCCCAGACACCTACTAGAGCGAAGAAGACTGACCAGGCAATCATTGCTGGAATCGTTCGGCCGATATGACGGGCAGTTGAAGATGGCAATGTTAGCAAGATAAATACTAGCAATGAACCAACAATTTGTGCACCAATAGCAACTGACATAGCTAAAGAAATGAGGAAAACAATTCCTACGAAGCCAGTATGCACGCCGTGCGCAATAGCACCGATATGGTCAAATGAATCGAAATTCAAGTTTCTTTGAACCAGCAAGATCATCAGCAAAACAATGACTGATAAAACAACCAGTTGGATCACGCCTTGTTTGTCTACACCGATGATACTACCGAACAAAATATTCGTTGCATAGCGGCTATTAGACCCTGAGATGGCAAGAAATAAAACACCTAGTCCAATGAACAATGCAGAAATTGCACTAATTGAAGACTCTTTTTGATCACTGTGCAAGGACAATTCACCAACACCAATTGAACCTAATAGCGTAAAGAGCAACATCCCCCATAACGGTGCAATACCGATGAACACGGCAAAGGCTGCTCCCGCAAAACCGATTTCTGACAAGGTATGTGCTAAAAAGGCATAGCTGCGTGCTACAACGTAGACTCCGACAATCCCACAGGTAATCGCAATAAATGTACTTGCTAAGAAGGCATAACGCATAAATTCATATTGAAACATTATTCGAAATCCTCCTTAAACTCACTCATCTTTCCCTGTCTGATTGAACCCTTATTAAGATAGAGATAATCAGTCATATATTTTTTAGCTAATGGAATATCATGAGTGACGAACAAAACCGTAATCTTGTGTTCCTGATTAAGGTGTTTGATCAGCCCCATCAATTCTTCCTTAGCTGCCGGATCAAGGCTCGCTGTGGCTTCATCCAGAATAATCATATTAGGATTATCCAGCAATGCTTGTGCCAAATATGCACGCTGCTTTTGCCCTCCTGAAGCTTCACCCATTCGAGTATCCTCAATTGGATCTAAGTGAGTTTCAACCAATTGATGATCGATCTGATCCTTCACCTTTTTGGTTTTAAAAAGTGGTGCATTCAAACCAACAAACGCACGAATTGAAAGTGGATATTCTGAATCAATATTTCTAAATTGTGGCACATATCCTAGCTTAACATCTGATGAAAAATTATAGCTACCATCAGTTGGTGTAAGCATCCCCATCAAAATCTTAATTAAGGTTGTCTTTCCTGTCCCATTTGGCCCCAGTAGAGCAGTCATCGATCCTTTTTCAAGATGAAAATTTATATCTTGAAAAACGGTCTTGGTATCAAATCTCATCGAAAGATCATTCACATCTAAAATCAATTTAGCATCCCCTTTACTTGCTAAATTCTATCTATTTTTTGCTAATATTAGCTAAATTTTGGTAATTTTCTCTCATCCAATCTAAGTAAGTCATATGATTTGGAATTGTTTCGCACACATTGAGCACTGGTACACCGTTTTCCTTAGCAAGCTTAACGAAAGTCTTCACAGTTGAACTACTTGCTTGAGTGTTATTAACAAAGAATGCGATCTTCTTGTCCTTAATTTCATTAGTCATCTTGTTGATGGTCTTAGGACTTGGATCAGTACCATTTTCGATAGCTTCTTCAAATTCCTTGTCACCAATCTTGTAACCTGCCTCTTCTAAACCATAATCAAAAACAGGTTCACTAACAAAGACGGGCTTATTGTTCTTCTTATCAGCTGACTTGGCAATTTGCTTAACCTTAGCAATCTTAGCTAAGTACTTGTCACCATTTTCCTTAAAGTAAGCTGCGTGCTTTTTGTCTAACTTAGATAAACGCTTAACTAAGTAGTTAACATACTTGGTTGGCATATCTAAATCATACCAGATATGAGGGTTGTCGACCTTCTTGAGCCCCATTAAGTCTTCACCTACTAAGACAGGCTTCTTGCTTACTGAGCTAGCTAACTTGTTCATCCAGCTATCGTAGCCTAAGCCATTGGCAACAATAATATTGGCATTAGTTAACTTCTTAGCATCGGCAGTAGTTGGTTCAAAATCGTGCGGATCGGTTGCACTATTTTTAATAATTGCTTGAACACTGCCGTACTTACCTGCAACGTTCTTGGCAATATCTGCATAAACATTAGTCGTAGTGACAATAGAAACTTTATCTGACTTAGTTTTAGCTTGGTCTTTATTTGAGCACGCAGAAGCTAGAAGCATAATCATGCCCATTAATCCCATAATAAAAAGTGACTTAGTAATTCTAGATTTAATTTTTTTCAATATATGTTTCTCCATACAAAAAAAGCATTTAAATCAGAAAATTATTCAGCAATTCTTTTCTGACTTAAAGTTAGAATAAGCTAGTTTGATAATCGTGTCAATATTTTTAAGCGTACACAAATAGCACCAGAGTCGCTTAGACCTGATGCTATCAATAAAAACTTATAAATTATAAGAAATACTTAAAATGTAAAATTTGCTGAATTTTATTTAAATGCTTTTGTTCTAGAATGACATCATTATATAAACATATTTTTATATGCTTGTCAAACTCAAACTATTCTTGTAAGTGTCAATATTTTTGCGGTAGGCTATTAGCCATTAGTCTGACCTTAATTTCTTAAATATTTATTTGATTTATTTTTCTGCTAAAATCGTTTAAATAGCTATGTCCTGCTGTTAAGCTTTCAATGGCGCCGTGATGAATGCCCAGATGAGCTTCATGCGTCTTTCTTAGGCTGGCACGAACTGCATCTTTCAATTCTTCCTGATCACCAATATCGAGCTGTCCACCTGCAAATTCACTGCTAAGCCAGTATTCCAGCTCTTTGTTTTTAATGCAAGTCAGTACACGCAGAATTCCTTCGGCACCATCTTCTGACCAGTATTTTCCTTGCCCTTTGACGCGATAAGTATATTTACGGTGATTGCTTTCACAGCAGCCAATTGCCTTAATTACGGACAAATGACGCATTTTGAATGGCTTAATATCCACCCAGCGACGATGAAGATAGCTGCGCAGGCGCCTTAAATTCTCATGATTTTCTGGCGTATTCAGCTCATCAATCAGATTGCTTTCAATCGTGTCTAAAATCACGTCAGTCAAGTGACGGTCATATTTAAACTCAACTGCACTGATCAATTTGCCTTGCATGGCCGGCATGAAGCTTAGACGATCTTTAATCTTTTTGTTCAAGTGAAAGACATCCAAGAAGTGTTCGTGCTGCCGGCAATAGCCAACTATTTCATCAAAGTCTTTCTTGGCATAGCCAGCACCGCCATCGGCATTACTGATGATTAAAGTATCGCTTAAGTCATAGGTATTGGCTAAATAAGCTTTAATTTCTTGCAAAGCATCCAAGCGGCTTAAAGAAACAAACTCTTTGGCCTGTACTCTCTTTCTGCGCGTCTTGCTTAAATTAATAATATCCTCGCAGACTTGATAGCGATGAAATTCCAGTCGCTTCTTGGTGCCCTTGATCACAACGCCATCGCCTTCAAGATAAAGCACTGGCACCTTTTTCTTTTGAGCAATACCATCGTATCTTTCCTCAGACTGCTGCTGTGCTTTAATCTGTTTGCCTGCTTGCACAATCAGCTTGCCAACTTGACTATGGCTAATATTGAAACAGGAGAGAGTATTAACAGCTAAAGCAGTATTGCGGTAAGTTGCCACGGCGCTAACCTGCAAAATATCCTTCACTGCCAAAACACTGTAACGCTTATATTTGTCATAGCCCATCAGCTTATCCAGTGGATATTTAGCCTTTTTGCCAGGACAAGCATATCTTCTGCGCCAATAGCTGATTTCACCAAAGGCAGTCACAACTGAACGCTTATTTTTCTTCTCGATCTGATAGCCTTGCTTTTTAGTCTCTTCAACTAGACCAGCATCAACTTCTTCTAGTGCTGCTGTCATCAATTCTCTAATCAGATCAAAGAAATAGCACATTAATGCCTTTTCACGGGCAATAACATTATTTTCAGACTTAATAATTTTTACGATATCAGCTATAATAGATTCCATAAAAGACCAGCTTCTAATGTGTTTTTTTCGTTAACATCAGACTACTGGTCTTTTTGTTTGAGCTCAAGTCCAATTTCAAAGGATTTGAGCTTTTTTAATTAATTATTCAACTTACCGCAAAAAATTATACACTTAGCTATTCTTTGACGTGCTGACGTAAAACATCCAAGATTTTAATAATATGGTTATCAGCTAAACTATAGTATTTTTCTTGCTTAACCCGTTCACTTTTAACCAGCTTAGCTTCCTCTAAGATTCTAAGATGACGTGAAATAGCTGGCTGGCTAACATCGAATTGTTTAACTAAATTATTTACGCTGCTTTTTTGATGATCATTTAAATAATAAAGAATTTGAATACGGATTGGATGATTGAGAGCCCTAGTGATCCGGATAATTTGATCATTCAACTTTTTCACCTACTTACTAGTTTTAGCTTATTTTATCCCTTTAATTTTCTCACAAAAAAAAAGCGCTACCCGAAGGTAGCGCAAAACATAACGGCTCTATAATTTTTCCTGTTGATGACTTATCTATATGATAAGCTGTTAACAGGATTTTTTATTGCATAAAAAAGGAGGACATAGTCCCCTGTAATAAAAAATCTATCCTACTAAACATTTTATAAATAAAGGTGTGTTTCTATGTCCTCTTTTAATAATTGTATTAAATTTGATCTTGATATTAAAGAAAAAAATATTGTTTTCAAAGACTATTTCTACAAAATGGTCCAGATGAAAAAGCACAAAATCTATGAAGCTGAATTGCTTCAACCTGCTTGTCCTTTCTGCGGCTCCCTAGATCTCTTACACAATGGCCATTTGATTACCAACATTCACTATTTAACAGCTAATGCAAGTTTACCGGTCATTATCAGATTGGCTAAGCAACGAGTTAAATGTCGCGCTTGTGAGCGCTGGTCTATGGCTCAGTCTGAATTGGTAAATAAATACTGTTCTATTTCTAATGCCTCTAAGCTAAAAATTTTGTCTGCGTTAACTGAAGATCGTTCGATGACCAGCATTGTCAGCGAAAACAATGTTTCTGTCAACACTGTTCAAAGAGTTTTAGGCAGCTGCTCTCACCGTTTCCTTGACAGCTATGAGTATCTGCCAGCTCATTTGGCTTTCGATGAATTCAAAGGCGTTGATCGTCAGCTGCACTTTATCTGCTTAGACGGCGATAGTCATCAAGTCGTGCAGATTCTTAGAACTCGCTACAAGAAGACGCTGCTTAAGTATTTTGGTCGTTTCTCGCCCCAAGCTAGAGCTAACGTTAAGACGGTCACGATGGATCTTAATTTTTACTATCAAGATATCGTCAGAGCTTGTTTTCCTAACTCTCAGATTGTCATCGATCGTTTTCATATGATTCAAATGCTTACGCGTTCTTTCAACTCACTAAGAGTCCAAGTCATGAAGACCTTTGATAAAAGATCCAGACAGTATCAGCTGCTCAAATCTCCCTGGAAATTATACTTGAAGAAGTTTGATGAGCTTGAAAAAGTTCATCCTCGCTACAACTGGCATTACAAAGACTGCTTAACTCAAGCACAAGTAGTTACAGAAGGCATTAACACCAGCACTGCTTTAGAGAATTCATATAACCTGATGCAAAGCTTTATTCAAGCCGTTGAAACTGGCAATACGCATGAGCTCAAATCGTTAATCAACTGCCAAGATCAAATTGGAACTTTAATGCACAAAACACTATTAACCTTTAAGCATAATCTAACAGCTGTGCTTAACGGAGCAGCCCTGCCTTATTCTAACGGCTGCCTTGAAGGGTTCAACCGCAAGATCAAACAAATTGAACGAACAGCTTTTGGCTATTCCAATTTTACCAATCTCTTAACCAGAATTCGTTTGGAAGAAGATCTGTACAAAGAAAAAGAACCAAACAGCCTTTTAATGGTTGCCTGATTCTATTCATTTACTCATTATCAACAGGATTTGACAAAGAGCCAACATAACAGAGTATCTTGATTTAAGTAGAGATCATCAAGATATTTATATTATAATACGTTTTTTACATAATGGAGCATTTTTTGATTAATAACTTAAATTATTTTACTCAATTACGCTTTTTTATTCGTTAGGATCGTCTTTAACGTTCTTCTTCCAGAAACTCATAACCAAACCAGCTACAACAGAACCGATAATAGTTGCAAAGAAGTAGAACAAAATGTGACTTGGTCCACCAATATTACCAGCAAGTGGTGATACCCAAAGACCACCGTGAGGAGCTGGAACACCAACGTGCCATAAACCAACTAACGCACCACCGACAGCTGAACCAATGATACATGAAGGAATTACACGACCTGGGTCAGCAGCTGCAAATGGAATAGCACCTTCAGTAATGAATGAGAATCCTAATACCCAGTTTGAAACACCTGCACGACGTTCTTCCAAAGTGTACTTATTCTTGAAGAATGTAGTACCGATAGCTGTAGCAAATGGAGGAACCATACCACCAACCATAACGGCAGCCATCAAAATAGCAGCAGTAGCTGAGTGTGGATCATTAGCAAAGGCACCTGAAGCAAATACGTAAGCAGCCTTGTTGAAAGGACCACCCATATCGATTGACATCATACCTGCCAAGATAGTGGTAAGAAGTACCAAGTTACCAGTACCCATACCGTTCAAGAAGTGGGTAATAGCAAAGTTAATACCACTAAAGATTGGGTTAATGATGTAGAACATAATAGCTGCAATTAACAGCAAGCCCAAGATTGGGTAGAACAGCATTGGCTTCATACCTTCAACTGAGTGAGGTAATTTAGCAAATAGCTTCTTCAAGCCAACCATTAAGTAACCTGCAATGAAACCAGCAGCGATACCACCTAAGAATCCAGCTGGTGAAGTTGCGTGTGCTTGAACGTTAACTTGGAATTGACCGTTAACAATCGCAGCCATGTAACCACCAACGAAACCTGGCATAAGGGCTGGTAAATCACCGATTGATTCGGCAATGTAAGCAGCAAGAACTGGAACCATGAAGGCAAAGGCCAAGTTACCGGCATTGTTTAAGAATACGAATGCTGGTGTCTTAGGACCACCCATGTAGTTTTCTACGATGAATGATACGGCCATTAAGATACCACCACCAATAACGAATGGAAGCATGTGGCTGATACCGTTCATCAAGTTCTTGTAAATGCCTGCCCATAAGCCTGGCTTTTCAGTGCTTGATGATTCAGCTGATTTATCAGCTGCACTTGCGTGGTAAACATCTGCCTTGTCATCCATAATGTCATCGATTAATTCGCCTGGCTTGTTGATACCATCAACAACTGGACGGTTAACTAAGTGCTTGCCGTCAAAACGTGGCATATCAACCTTCTTGTCAGAAGCGATAATAACCCCCTTAGCGCGCTTAATTTCATCAGGGGTTAACTTATCCTTAACACCTTCAGAACCGTTAGTTTCGATTCTAACTTCGATACCGCGCTTCTTACCTTCCTTGATCAAGGCTTCTTGAGCCATGTAAGTGTGGGCAATACCGTTGATACATGCAGTAACACCAACAATTAAAGGCTTGTTGTCATCATCTGAACTACCTTGAGCCTTAGCAGCTTCTTTAGCCTTTTTAGCTTCAGCAGCTTTCTTGTCTTCTTCGTCCTTCTTAGTTTCAGCCTTTTCAAAAAGGTCGATAACTTCTTCTGGAGTCTTAGCAGCCTTAAGTTTTTCAACTAATTCTGGATCAATTAATAAACCAGAAAGCTTAGCTAAAGCTTGAAGGTGAGTATTATCAGCACCAGCTGGTGCTGTAATCATGAAGAATAAGTGAACTGGTTGACCATCAAGTGAGTTGTAGTCAATGCCTTCCTTACTCTTAGCAAACAATACACGAGCCTTGTTAATTGACTTGTTACGTGCGTGAGGCATTGCGATTCCTCCACCAATACCGGTAGTAGATTCATTTTCACGGTCCCAGATAAACTTGATAAATGCATCTTCATCATTAACGATGCCGGTTGCAACTTCAAGGTCAGCCATTTCCTTAATGGCATCTTCTTTATTCTTAGCCTTGAGTTCCATAATCATGGACTCAGGACTTAATATGTCCTTAATTCTCATATATCTTTATCTTTCTCCCTTATAAATTAAGAAATTTGTTCAACCTTAATTTGTGGTAATACTGCATCGATTTGACTCTTTACAGCAATATCCTTGGTAAATGCAGTAGCAGCACCACAAGCCATACCAATTCTGAAGCCTTCCTTAGGATCTTGGGTCTTGTAGTAAGTACCAACGAAACCAGCGATCATTGAGTCACCAGCACCAACTGAGTTAACAGCAGTACCAACAGCACCCTTAGCGTGGTATACGTGATCCTTGGTCAAAAGGAAACCACCTTCACCAGCCATTGAAACCATTACGTTTTGTGCACCCATATCAAGCAACTTTTGTGCATATTCAAGCATAACGTCGCTAGAATCAAAGGTTACGCCAAAGAGGTCAGCCAATTCATGGTGGTTAGGCTTAATTACTAATGGCTTGTATTCCAAAGTATCAAGAAGAGCTTGACCAGTAGTATCAACTACGAATTCTGCACCAGCTTCCTTAATAGTTGGAAGAAGTGATTGATAGAAATCAGTTGGCAAGCTTGGAGTAAGTGAACCACTCATGACAACAATGTCGCCCTTCTTCAAGTCGTTCAAACGATCCTTGAAAGCAGTGATTTCTTGGTCAGTAATATTAGGACCAGCTGCGTTAATTTCAGTTTCCTTTTCTGCGTGAATCTTAACGTTCACACGAGTGTTGTCTAAAATAGTAACAAAGTCGCTTTCAATTCTCTTTTGGTTCAATTGACGAACCAATTCCTTGCCAGTGAAGCCACCAACAAAGCCCCAAGCAGTGTTATCAACACTTAATTGGTTAAGAATTTGAGAAACGTTAATTCCTTTCCCACCTGCTAAAAATGCATCATTATTAGTTCTGTTAACTTCCCCGCGGTTAACTTTTTCTAATTGAAGAACATAGTCAAGCGCTGGGTTAACAGTTACTGTATAGATCATTATTTTGCCTCCTCTAGGTTGATATCGGCAGGTAATGCCGCTTTTTGACTGGTTTTAAGTTGATCTGTTACCACGTAAACGTCCTTAGTGTTGGCGAAAACCGCAAAGTTGCGTTCACCAATCTTGGATGAATCAGCTAAAACATAAGCATGTCTAGCCTGTTTGATTTCGGCAGCCTTAATTGCAGCTTCTTCGGGATCTGGCGTTGTAAGATTACCATTTTTTTCGATTCCGTTGGCTCCAACAAAGCTAACTGAAAAATTCATCCCTCTAATTTGAGCAAGTGCGGTTTGACCAATCACCGCATGAGTGTCGTCTTTGACCTCGCCACCAATTAAAATGGTCTTGATGCCGTGATTCAAGCTGCATAACGCCGTTTCAACGCCATTAGTAATGATCTTCACCTTAGGAATATCGGCCAAAAATGGAACTAATTCATATGTGGTCGTTCCCGCATCGATAAAGACATTATCACCAGGTTGAACGAAGTTGCGAGCTGCATATTTAGCAATCGCGATCTTAACGTCGTGATTCAAATTAAAGCGGACGTTTTGGGAAACATCACGAGAGAAATTCTTTACTGATTGAGCACCGCCATGCACGCGCTTAAGCATACCGTTCTTCTCCATCTGAATAAGATCTCGACGGATAGTAGATTCGGATGTATCAGTTAAGTCGCATAACTCACTAACGCGGCACACTTCGTGCCGATTAACGTAGTTTTCAATTAAGCTTTGGCGTTCCTGTGTCAGCATTCTTTTTTTCACCTCGATCATTATTGTATCCGCTTTTTCTTTCAAAATCAATCATATTTATTCAAGAATAATCATTTTTAGTAATTTTTCTTCAATAAAATTCATTTTTATGCAGTTTTTACGCAAAAAACAGATCATGCGTGTTTACATGATCTGTTTTCTTAGTCACTCATAATTATTCTTAGTCATTTTACCCTATTTGACGAAGAATCCGCAAAAATGACGAAGAAAAACCAACAAGATTATTTCTTGCTGGTTTGATGAATATTTGAAAACTTATTCCTTTGGAATCCAATATAAAGTAACACAAAAGTATGACGTATGTGATTAAGAAATCTATTATTTTTGACATTAATTCTTCCCTTTACCAATTTTAAACGGTAACTGCCCAGTTAAAACAAATTGGCTATACAACATTTTAACCGCATCATCTATGGCTAAATTATTTTCTTTTAGAATTGCTTTTGCCGTATTAACGATGTCCTTGTCCAATTGAATATTAATCGTTTCTTTCTTCATTCAGCCACTCTTTCAACTGTTGTAAATTCTTTAACTCAGGAATATTTCCCCAGCTCGTTGTTAATGCACTTAATTTAGCAGCCAATATTGCTTTTTCCTCTTCTTCATTAGTAAGCTTGCTAAAAGGTAATTTTTTATTATTAGCTATATTAGTCAAAAGACCTATTAAAAAATCACTTTGATCTAATCCTAGCATTTCTAATTTTCTTGGACATCCTTAGCTAAATCTTCATCAACGGAAATGCTTAGTTGTGTTTTTTGCATCGAAGTTTGATCTTTGTAATCCATAATCTATCACCCATTAATTAAGCAAACCGCGCTTAAAGTTTTGACATGGACCTAAATCATCGGTTTGTAAAATCAATTTTGGATAGATACGTCTGATCCACATGAACTACACCGGCACTGAAGTGCCGGGTTTCTGGGAACACTGAGTAGTGTTTAGGATCTTACATTGAACTAGTCAAATACCTAACTCACAGAACTCAGCTTGTTACCATGGCCAGTCCCTGACCACTTAGCTTTTTACTTTCTTTATGCAAGATATTAACTGCCGCATTAATATCTCGGTCATGCTTGGTTTGACACTTAGAACAAGTCCATTCACGAATCTCTAATGGCTTAGCGCCACTGTTATAGCCACATTTTGAACAAATTCTGGAAGTGTTCTTGGGATCAACTGCAATTAGTTTCTTGCCATACCATTCGCATTTGTATTCTAGCATTTGTCTAAACATTCGCCATGAAGCATTGGCAATTGACTTAGCAAAATGATGATTTTTCTGAAGATTCTTGGTTTTCAAATCTTCAATCGCAATTACATCATATTGCTTAACTAAATGCGTAGTTAGTTTATGCAGATAATCTCTGCGCTGATTAGCTATTTTTGCTTGGTATCTGGCTTTTGACTTTTGCGCTTTTTGCCAATTAGCAAAGCTTTCTAAGCTTCTAGGACAAAGCACCTTTTTATTACTGTCTTGCAAAACTAATAATTTAGCTAAATGTCTGCGACGAGCATATTTTCTTTGCCAGACTTTGGCTTTCTTTTCAAAATAAGAACTGTCAAAACTAGGATATTTTAGCCCATTAGACAGAATTGCTAAATCAGCCACGCCACCATCAATCCCAACCTGTTTGCCAGTTAAGCTGTACTTCTCTGGTTCAGGTATTTCAGCTTGTAGAGACAAATAATACTTGCCCGTTGGTTCAAGTACGACCGTATAACGCTTAATCTTAGTGTTTTGCAAAACTCCATTCTTGCTGGTTTTGACATAGCCTAATTTAGGAATCTTCAAGTACCTTTTACCTGCAGTTTTTATGATTGACTTGCCTGTATAAGACTTTTTCAGATACTTGCGTGAATGAAAACGAGGCTTGCCAATTTGACCAGTTTTATCTTGAAAGAAGTTCTTCCAAGATTGAGTTAGAAATTCATTAACTACCTGCAAGCTTGAAGAATCGCTGGTTTTCAAAAAAGGATATTCTTTTTTTAGCGGCTTAAGCAGATAATTCAGCTTGAACTTGCCTAAAAAAGGCAAAGCTTTATTATTCTGATAGCGCTCATTCATCATGGCCAGCATTTGATTCCAAACAAAACGATCATTGCCAAACATCTGCTCAAACTGGTTTTGTTGTGTTCTATTAGGATAGAGTCTTAATTTGATCCCTTTTAGCAC
Protein sequences of DBSCAN-SWA_1 >CP019581|1183080:1222176|1200341_1200719_-|AZK91473.1|DBSCAN-SWA MQKKESIFPVIYGIISAIIAFIAVFFICLHTFRLNLQLAIILAGIFAIFFFGLSYFRGHASVEIKRIVYKYKLTDQEVANITGMKPSDFPIYHDRLQLILPKRYWPRVLDALQKYEKEQESTNNS >CP019581|1183080:1222176|1187651_1188203_-|AZK91460.1|DBSCAN-SWA MNFKRTLFKYTAALSIFFTGISAINVPATVHADEVSTSTVNTDNSMSDIETSTPAATTVKKTSAVTKKRNAIVKLAKKEIGKPYVYGASGPSAFDCSGLISYVYKNAANKTLPRTTYGQITLGKTVSVSTKKLKKGDLLFWGNYHVGIYIGSGKFVHAPAPGQNVKTQTLASFFPSSAKRVID >CP019581|1183080:1222176|1191017_1192208_-|AZK91464.1|transposase|DBSCAN-SWA MLKGIKLRLYPNRTQQNQLEQMFGNDRFVWNQMLAMMNERYQNNKDLPFLGKFKLNYLLKPLKKEYPFLKNSDSSSLQVVNEFLTQSWKNFFQDKTGQIGKPRFHSRKYLKKSYTGKSIIKTAGKRYLKITKLGYVKTSKNGVLQNTKIKRYTVLLEPTGKYYLSLQVEISEPEKYSLTGKRVGIDVGVADLAILSNGLKYPSFNSSYFEKKAKIWQKKYSRRRHLAKLLVLQDRNKKVLCPRSLESFTNWQKAQKSKAKYQAKAANQRRDYLHKLTTHLVKQYDVIAIEDLKTKNLQKNHHLAKSIANASWRMFRQMLEYKCEWYGKKLIAVDPKNTSRICSKCGYNSGAKPLEIREWICSKCQTKHDRDINAAVNILHKESKKLSGQGLAMVTS >CP019581|1183080:1222176|1195985_1197176_-|AZK91469.1|transposase|DBSCAN-SWA MLKGIKLRLYPNRTQQNQLEQMFGNDRFVWNQMLAMMNERYQNNKDLPFLGKFKLNYLLKPLKKEYPFLKNSDSSSLQVVNEFLTQSWKNFFQDKTGQVGKPRFHSRKYLKKSYTGKSIIKTAGKRYLKITKLGYVKTSKNGVLQNTKIKRYTVLLEPTGKYYLSLQVEISEPEKYSLTGKRVGIDVGVADLAILSNGLKYPSFNSSYFEKKAKIWQKKYSRRRHLAKLLVLQDRNKKVLCPRSLESFTNWQKAQKSKAKYQAKAANQRRDYLHKLTTHLVKQYDVIAIEDLKTKNLQKNHHLAKSIANASWRMFRQMLEYKCEWYGKKLIAVDPKNTSRICSKCGYNSGAKPLEIREWTCSKCQTKHDRDINAAVNILHKGSTKLSGQGLAMVTS >CP019581|1183080:1222176|1193028_1193553_-|AZK91466.1|DBSCAN-SWA MKINLEDLNDKIENQDYIQDLETVKYGDISKSKSKIKPYAEKMVKEVAAAFKHDSLVQTQLAVTGQRPVTFALETNIINLPYSNYKKIANFFEEGQEYPLNVYFETRSDYVNVSHFRIDQLATEEEVEKDTDKVVDQLVEAIIEKLKVVREYQVPEKATAKKKETKKSKTTNIM >CP019581|1183080:1222176|1185984_1187160_-|AZK91458.1|transposase|DBSCAN-SWA MLKGIKLRLYPNKIQQNQLEQMFGNDRFVWNQMLAMMNERYQNNKALPFLGKFKLNYLLKPLKKEYPFLKTSDSSSLQVVNEFLTQSWKNFFQDKTGQVGKPRFHSRKYLKKSYTGKSIIKTAGKRYLKIPKLGYVGVLQDVKIKRYTVLLEPTGKYYLSLQVEISEPEKYSLTGKQVGIDVGVADLAILSNGLKYPSFDSSYFEKKAKVWQRKYARRRHLAKLLVLQDRNKKVLCPRSLESFTNWQKAQKSKAKCQAKVANQRRDYLHKLTTHLVKQYDVIIIEDLKTKNLQKNHHLAKSIANASWRIFRQMLEYKCEWYGKKLIAVDPKNTSRICSKCGYNSGAKPLEIREWTCSKCQTKHDRDINAAVNILHKGSTKLNGQGLTMVTS >CP019581|1183080:1222176|1214337_1214610_-|AZK91490.1|DBSCAN-SWA MNDQIIRITRALNHPIRIQILYYLNDHQKSSVNNLVKQFDVSQPAISRHLRILEEAKLVKSERVKQEKYYSLADNHIIKILDVLRQHVKE >CP019581|1183080:1222176|1206987_1207371_-|AZK91483.1|DBSCAN-SWA MAIPQVIAAKFVGAESPAIVGSLCTITITVLFTRWHEDEEGPDVERPSTKEMLKACSPFILVFIFVIFASSLVAPVNNLLSKATTNLVIYTGKNPNTLAINWLNSPGTLLLISTLIGEPFKALALRK >CP019581|1183080:1222176|1193792_1195034_-|AZK91467.1|transposase|DBSCAN-SWA MDQSVVIKAQLLNIDDETAQAFSNTMFKYRDACNFISQYIFEHAFELKQSKLNKALYHDLRDKFELKSQMAQSAIKTVIARYKTVKTQLFQHPFRYDTGKKDGKGRGIWAQIYRDLTWLWQPINFKRPQLDLQRGRDWSYLSTTNQLSLNTLNGRRKVDFVCKGFDQYLDQTKWKFGSLKMLQLRGKWYIHLSATTAIPEFEAEQAVHVVGIDRGLRFLAACYDEKGKSILFSGQKILRKRRKYKKLRAELQAKGTKSAKRRLKKIGQRENRWMSDVNHRLTKTLIDHYGSNTIYALEDLTDVRFATEKSPKDQRYEMLSWAFYQFEQFLTYKANLNSSAVVKVPAKYTSQRCPKCGRIHKDNRDYELHLYTCDKCGYKSNDDRLAAMNIQFLGTLYRSGEEAPQFNKQASVE >CP019581|1183080:1222176|1189034_1189334_+|AZK91462.1|DBSCAN-SWA MKKADVKVGAIVGAKSEEELKKPFLGKVEKIYENSALLAITSYDPVDASAISDLNNKIVVNFKNLNAARAAKNIKTASTNEVKVEKVAKTKKADKDSKK >CP019581|1183080:1222176|1192387_1193011_-|AZK91465.1|DBSCAN-SWA MFSWNIIGILIWVAIILYLVFIIQNIRSRRIKMIIKQHKHFSWPNFILTVVEVVVLLVAAGWMFNQTFMDNPDLEDASRITSSVKYEPLIMKTGVGNSSYVTINSAKKRYGSQSYTFYKAGSKITASSDYASIAYGDTALDVNAEKIPYVKKTLKKMDKRYQRAYVAIYTATYKKNWQNGIGMHAGHLATRYYLIRVPDQSFIKQGK >CP019581|1183080:1222176|1218311_1219226_-|AZK91493.1|DBSCAN-SWA MIYTVTVNPALDYVLQLEKVNRGEVNRTNNDAFLAGGKGINVSQILNQLSVDNTAWGFVGGFTGKELVRQLNQKRIESDFVTILDNTRVNVKIHAEKETEINAAGPNITDQEITAFKDRLNDLKKGDIVVMSGSLTPSLPTDFYQSLLPTIKEAGAEFVVDTTGQALLDTLEYKPLVIKPNHHELADLFGVTFDSSDVMLEYAQKLLDMGAQNVMVSMAGEGGFLLTKDHVYHAKGAVGTAVNSVGAGDSMIAGFVGTYYKTQDPKEGFRIGMACGAATAFTKDIAVKSQIDAVLPQIKVEQIS >CP019581|1183080:1222176|1212794_1214201_-|AZK91489.1|DBSCAN-SWA MESIIADIVKIIKSENNVIAREKALMCYFFDLIRELMTAALEEVDAGLVEETKKQGYQIEKKNKRSVVTAFGEISYWRRRYACPGKKAKYPLDKLMGYDKYKRYSVLAVKDILQVSAVATYRNTALAVNTLSCFNISHSQVGKLIVQAGKQIKAQQQSEERYDGIAQKKKVPVLYLEGDGVVIKGTKKRLEFHRYQVCEDIINLSKTRRKRVQAKEFVSLSRLDALQEIKAYLANTYDLSDTLIISNADGGAGYAKKDFDEIVGYCRQHEHFLDVFHLNKKIKDRLSFMPAMQGKLISAVEFKYDRHLTDVILDTIESNLIDELNTPENHENLRRLRSYLHRRWVDIKPFKMRHLSVIKAIGCCESNHRKYTYRVKGQGKYWSEDGAEGILRVLTCIKNKELEYWLSSEFAGGQLDIGDQEELKDAVRASLRKTHEAHLGIHHGAIESLTAGHSYLNDFSRKINQINI >CP019581|1183080:1222176|1207352_1207850_-|AZK91484.1|DBSCAN-SWA MGIWPIMFVIVMALFAYNVTTESGGMKIIQDMLATISTDKRIIVLIIAWGFGGFLESIAGFGTAVAIAAGILIAFGLDPIRASVISLIANTTATAFGAIGLPILTLAEVTNLKQENLSFIVTLQLFVLVLLVPFILVILTEGSIKVVKGVGLITLMSDLAWLFPR >CP019581|1183080:1222176|1205370_1205634_+|AZK91478.1|DBSCAN-SWA MQAYSPDYIRDALVRFAYHSNAIEGNSLSLGQTEAIVLYDRVTFVNNKGVKLRDIYEASNQKDAFYLMLNMTNNNAELTIDNILKLQ >CP019581|1183080:1222176|1201755_1202097_-|AZK91475.1|DBSCAN-SWA MIGTIVNTLALLVGTSIGCLVKKGINKRYEDVLFIAMGLAALGIGWENVTNTMPKSHYPVLFIVSLALGGVCGTALDISGRFDRLVAKVGKSNLGKGLATEILLCCIGARSLL >CP019581|1183080:1222176|1220332_1220500_-|AZK91495.1|DBSCAN-SWA MKKETINIQLDKDIVNTAKAILKENNLAIDDAVKMLYSQFVLTGQLPFKIGKGKN >CP019581|1183080:1222176|1220985_1222176_-|AZK91497.1|transposase|DBSCAN-SWA MLKGIKLRLYPNRTQQNQFEQMFGNDRFVWNQMLAMMNERYQNNKALPFLGKFKLNYLLKPLKKEYPFLKTSDSSSLQVVNEFLTQSWKNFFQDKTGQIGKPRFHSRKYLKKSYTGKSIIKTAGKRYLKIPKLGYVKTSKNGVLQNTKIKRYTVVLEPTGKYYLSLQAEIPEPEKYSLTGKQVGIDGGVADLAILSNGLKYPSFDSSYFEKKAKVWQRKYARRRHLAKLLVLQDSNKKVLCPRSLESFANWQKAQKSKARYQAKIANQRRDYLHKLTTHLVKQYDVIAIEDLKTKNLQKNHHFAKSIANASWRMFRQMLEYKCEWYGKKLIAVDPKNTSRICSKCGYNSGAKPLEIREWTCSKCQTKHDRDINAAVNILHKESKKLSGQGLAMVTS >CP019581|1183080:1222176|1198010_1198781_-|AZK91471.1|DBSCAN-SWA MSDKHDGEFYQEILDICLTAGRLMIEGGSEMYRVEDTMLRIARNAGAEDPRVFATPTCVFMSLNHGNLSQMKQVRDRNIDLELVDRVNALSRQFAVKQIDLPQLKEKIAKVADAPFFPMWMQIIGAAVLSATLMVLFMDNYDWVDFPGAAIVGGLGFWAYCEFKKYTKVRFLSELIAAMVMGFLAIGLNWLDSQMIVGNILVGALMTLVPGLALTNALRDLFMGDLLSGIVRMCEAVLSALALGGGVAIVLKFMGV >CP019581|1183080:1222176|1188394_1188940_-|AZK91461.1|DBSCAN-SWA MKKIILIAGPSGAGKTTISDYLNEKYGIPRVLTHTTRPMRPGEKQNFSYHFETDESFKTLHFFEHIRYGSYQYGSSREALNLAWKKSDLVSLIVDIKGVYTYLKQLGNKVYFLYVTTSTKEELKERLLKRGDDPEKIKERLSGSELNLLPEDLKEYAHILVNDNLSETKSKLDSILSRLEE >CP019581|1183080:1222176|1205691_1206102_+|AZK91479.1|DBSCAN-SWA MILGADFQTSSVAETPIAVKQWAENTNYRLENSQTKDEFLQNLMAAHINFERIHPFEDGNGITGRELINLELAKNEMPSLIIEKDDKAQYITYEADQDVVSLTEYAKKKLATEQKRYEIFERNFKAQEEFDKNHQE >CP019581|1183080:1222176|1185312_1185633_-|AZK91457.1|DBSCAN-SWA MIKQELRDPANYNSVIAAVAQDHSRLNEIATATKIKATSLRSFLNNLIELEIVERVVPITEDPNKSKKSVYKIKDGMYRFWYRFIPMRLTLIERGLSAVNKINTKK >CP019581|1183080:1222176|1206519_1206822_-|AZK91481.1|DBSCAN-SWA MELLETFLTGSATSANVLFGNLQLSAATDLGVSKYWVVASNMTGATAGMLSPQNIAVATGAINREGDEGEILKETGKWGSLYLVICCVFLYVVGLVAGMI >CP019581|1183080:1222176|1214844_1216122_+|AZK91491.1|DBSCAN-SWA MSSFNNCIKFDLDIKEKNIVFKDYFYKMVQMKKHKIYEAELLQPACPFCGSLDLLHNGHLITNIHYLTANASLPVIIRLAKQRVKCRACERWSMAQSELVNKYCSISNASKLKILSALTEDRSMTSIVSENNVSVNTVQRVLGSCSHRFLDSYEYLPAHLAFDEFKGVDRQLHFICLDGDSHQVVQILRTRYKKTLLKYFGRFSPQARANVKTVTMDLNFYYQDIVRACFPNSQIVIDRFHMIQMLTRSFNSLRVQVMKTFDKRSRQYQLLKSPWKLYLKKFDELEKVHPRYNWHYKDCLTQAQVVTEGINTSTALENSYNLMQSFIQAVETGNTHELKSLINCQDQIGTLMHKTLLTFKHNLTAVLNGAALPYSNGCLEGFNRKIKQIERTAFGYSNFTNLLTRIRLEEDLYKEKEPNSLLMVA >CP019581|1183080:1222176|1216287_1218285_-|AZK91492.1|DBSCAN-SWA MRIKDILSPESMIMELKAKNKEDAIKEMADLEVATGIVNDEDAFIKFIWDRENESTTGIGGGIAMPHARNKSINKARVLFAKSKEGIDYNSLDGQPVHLFFMITAPAGADNTHLQALAKLSGLLIDPELVEKLKAAKTPEEVIDLFEKAETKKDEEDKKAAEAKKAKEAAKAQGSSDDDNKPLIVGVTACINGIAHTYMAQEALIKEGKKRGIEVRIETNGSEGVKDKLTPDEIKRAKGVIIASDKKVDMPRFDGKHLVNRPVVDGINKPGELIDDIMDDKADVYHASAADKSAESSSTEKPGLWAGIYKNLMNGISHMLPFVIGGGILMAVSFIVENYMGGPKTPAFVFLNNAGNLAFAFMVPVLAAYIAESIGDLPALMPGFVGGYMAAIVNGQFQVNVQAHATSPAGFLGGIAAGFIAGYLMVGLKKLFAKLPHSVEGMKPMLFYPILGLLLIAAIMFYIINPIFSGINFAITHFLNGMGTGNLVLLTTILAGMMSIDMGGPFNKAAYVFASGAFANDPHSATAAILMAAVMVGGMVPPFATAIGTTFFKNKYTLEERRAGVSNWVLGFSFITEGAIPFAAADPGRVIPSCIIGSAVGGALVGLWHVGVPAPHGGLWVSPLAGNIGGPSHILFYFFATIIGSVVAGLVMSFWKKNVKDDPNE >CP019581|1183080:1222176|1206247_1206520_-|AZK91480.1|DBSCAN-SWA MDEKLKSVLLSMKTQPDITGDVEAMTIIDHALSEIEQDIDLNKVIFKLNQEIDNYSLMHDFKLPSTLTKLKIMLDENPNKWTGAGLTGSI >CP019581|1183080:1222176|1210078_1210873_-|AZK91486.1|DBSCAN-SWA MFQYEFMRYAFLASTFIAITCGIVGVYVVARSYAFLAHTLSEIGFAGAAFAVFIGIAPLWGMLLFTLLGSIGVGELSLHSDQKESSISAISALFIGLGVLFLAISGSNSRYATNILFGSIIGVDKQGVIQLVVLSVIVLLMILLVQRNLNFDSFDHIGAIAHGVHTGFVGIVFLISLAMSVAIGAQIVGSLLVFILLTLPSSTARHIGRTIPAMIAWSVFFALVGVWAGLYLGFITNWPVTFFIATIEVVIYLVVYGIHVVKHR >CP019581|1183080:1222176|1206775_1206991_-|AZK91482.1|DBSCAN-SWA MMQILRQCIKSVWKTTITVCAIVGLAKVMVYAGMTDALAIALVSLLGPVYPLFAPLIGALGNILNRLRYFS >CP019581|1183080:1222176|1219225_1220002_-|AZK91494.1|DBSCAN-SWA MIEVKKRMLTQERQSLIENYVNRHEVCRVSELCDLTDTSESTIRRDLIQMEKNGMLKRVHGGAQSVKNFSRDVSQNVRFNLNHDVKIAIAKYAARNFVQPGDNVFIDAGTTTYELVPFLADIPKVKIITNGVETALCSLNHGIKTILIGGEVKDDTHAVIGQTALAQIRGMNFSVSFVGANGIEKNGNLTTPDPEEAAIKAAEIKQARHAYVLADSSKIGERNFAVFANTKDVYVVTDQLKTSQKAALPADINLEEAK >CP019581|1183080:1222176|1198791_1200333_-|AZK91472.1|DBSCAN-SWA MSLLTVKDLSQSFIDKTLYEDANFVLNKEDHMGVTGQNGVGKSTLIKILTGEIIPDEGQVKWQNKIDVGYLDQYAKLAPGVTIRGFLRTAFDDLFKKEQELNELYTKYAENSDDKLLEKAGKIQTYLEENHFYDIDTEIEQVAAGLGLADLGYDHDVSKLSGGQRSKIILAKLLLQSPDVLVLDEPTNYLDVSHIDWLVDYLNNFSGAFIVVSHDYDFLGRITNCIIDIDFGTITRYTGTLKQALRQKVANRETYLKAYANQQRKIAKTEAYIRKNKAGTRAKSAKSREKQLARMDVLTPPKNNRRAKFEFPYVATASNLLLQAQDLVIGYDHALVKSAFNFSVGGDEKVAITGFNGIGKTTLLKTLLGQLKPIYGSYELSVTAKLAYFKQDLAWPNNNMTPLQYLQEEFERKKPKELRQALARMGLTAQQAMSPLKELSGGEQEKVKLAKMQFEPANLLFLDEPTNHLDNDTKDALRKAIVNFPGGVIIVSHERDFFRGDWVDKTIDIETMN >CP019581|1183080:1222176|1184596_1185247_-|AZK91456.1|DBSCAN-SWA MASNKQLPKYLSIGQASKYLNIAIPTLRLWERKGIIKPIRTAGNQRRYTPQMLDDALAGKKPAKPVSKDKLIIGYCRVSSAGQKNDLKRQIAVVTNFCEMQGKPFKIISDIGSGLHYHKKGLKELIHLICTQQCSQVVVNYQDRLVRFGFELIEDICQENDVEITVINQTKAESSNEELVDDVLSVITVYSAKLYGKRSHHNEKIIKTNQELFKDR >CP019581|1183080:1222176|1183080_1184643_-|AZK91455.1|transposase|DBSCAN-SWA MKRSSKPIKSYSKTVRAYSLKLDEDTYTKLDQLFINYGKCRNMFLNQYCGISHMLDVKQYRKLRDQIKGSKQNKKYVAKYNFLDKHWCYALSDACANINSMWSNLANQIRRVVQANEAIDKDERHLIYYLLSIRELWYYALTNNQAVLLDLSHGLQKHLAELEEAVTPKQRKHAYSYLSRLTRRYKYTPRKHHSLNKSMTYDESMYKFTGNQLEELAISSPVSRKRFSLKLTSPWHYRLNGNLQIILDRDKKRLEIHKVIKVRQKKEYQAGTKLGIDKGLATLVSCSSGREYGKGFSQFTRPEIEQESQYLARRNPYYGYRYQLKKKLNQLKNANIPKQIIKKKQLTRQLNKLNANNIGHKRKDKRHASYHACLESKINHAIKTMIKLETPSLIVKEDLTFTKEKIAKKGSKYERQVRRNLSSWTKGILNERLEYYCQQYGIDFKDVNPAYTSQYCPNCGRHFIVRFGKHNEKTLCPNCGEMDCNIAASKNILARATDKEITLYKPYKKVKAILDQRIAS >CP019581|1183080:1222176|1187288_1187558_-|AZK91459.1|DBSCAN-SWA MLTKIIEVVLKFENPDFSGSASFASFDDLLDEIGKIAQKQQVVFVIDEYPYLAESYPGFSSLLQKYIDRNFLHSKLFLVLCGSSMSFID >CP019581|1183080:1222176|1200852_1201410_+|AZK91474.1|DBSCAN-SWA MKKNVHYYFSLIHFQIDDLGIHLVLPQTWQAIDLYQAIAHDRESMGKWLPWAYNMQSSADEAKFIKTIQADMVKERMIVLTILVNGEPCGMIDLHNLITNQKGEIGYWLSSKYQGRGIMTKSVLEVCKYAFSELNLKYVDLIVAVENGKSERVAKNADFKLMGIKQHLIHHHTMAGKIFRKINSN >CP019581|1183080:1222176|1211551_1212454_-|AZK91488.1|DBSCAN-SWA MKKIKSRITKSLFIMGLMGMIMLLASACSNKDQAKTKSDKVSIVTTTNVYADIAKNVAGKYGSVQAIIKNSATDPHDFEPTTADAKKLTNANIIVANGLGYDSWMNKLASSVSKKPVLVGEDLMGLKKVDNPHIWYDLDMPTKYVNYLVKRLSKLDKKHAAYFKENGDKYLAKIAKVKQIAKSADKKNNKPVFVSEPVFDYGLEEAGYKIGDKEFEEAIENGTDPSPKTINKMTNEIKDKKIAFFVNNTQASSSTVKTFVKLAKENGVPVLNVCETIPNHMTYLDWMRENYQNLANISKK >CP019581|1183080:1222176|1202167_1203409_-|AZK91476.1|DBSCAN-SWA MDQNPKRREEYRARQSSLNLHRNRAFAAESTVYKSGNMFARFIGVVALVVICFGVAWAAHMYFTIHSAVDGKGDSNVPASAKIANRQPISVLVLGVDQGIEGRHDRGNSDTMILATANPQKNSATMTSIPRDTLADIKGDPGDKYFMFRVNSAYEIGGSQSSMKTVSSMLNVPIDYYLEVNMKALRSLVNAVGGVDVNVPFDFSYDWCDFHKGKQHLNGRHAVAYVRMRKEDPRGDYGRQLRQRQVIEAIMTKAMSVNTISNYRKLIDIFNKYVKTNLTFNDMLSLVLNYRGCAKNLKSGYIQGHDAWIDGSSIQVASTKELQKISNRIRKNLGLETETLDNEETRQNDLNDQNSHIKWDDPQAFTNYQIYDQNSDKPASGSSSGYGKNPDASSSSSSSSSSSSSSGGWKFQW >CP019581|1183080:1222176|1203570_1205367_+|AZK91477.1|DBSCAN-SWA MAIPTRSEVPEDLKWDLTRIFKTDQDWENAFDKAKDEVAKLSELKGSLAKSGKDLYEGLTKILAVKRDVENIYVYATMSSDVDTSNSHYLGYVSRVQSLSNQFEAATSFINPEILSIPAEKFEQFKKDEPRLADYAHYLEMITNKRPHTLPAEEEKIIADAGDAMSVSENTFNVLTNSDMEYGYMQDEDGNMEQLSNGLYSLLIQSQNRDVRKGAFNTLYASYGQFQNSLASTLSGVVKKHNYNARMHKYDSAREAALADNGVPVEVYDTLIKEVDSHLDLLHRYVALRKKILGLKDLQMWDMYVPLTGKPALSYNFEEAKKVAKGALKPLGEDYLKHVDYIFNNRVIDPVESKGKVTGAYSGGAYDTDPYELLNWEDNIDSLYTLVHETGHSVHSWYTRHSQPYIYGDYPIFVAEIASTTNENILTEYFLDHITDPKTRAFILNYYLDSFKGTLFRQTQFAVFEQFIHEADAKGEPLTADILDDVYGQINQHYYGDSVEPGGDIALEWSRIPHFYYNFYVYQYATGFAAATALANKVVHGSQADRDAYLGYLKSGSSDYPTEIMKRAGVDMTKPDYLEDAFKTFEKRLNEFESLIGK >CP019581|1183080:1222176|1208260_1209643_-|AZK91485.1|DBSCAN-SWA MQTFVALIVAMALLAGMIYLHHKKWGFTKLVFLSLVVGIVYGIGLQLIFGAKSLVISQSMDWLSIVGDGYISLLQMLVIPLIFVSLVGAFTQLKVTEKIHKIAGNVLVVLLGTTAVASFIGFISVVLFHLQGANFARGLTATSQALTSIRDHQESLKNVTLALQIVSFFPKNIFADFAGMRAASTIAVVIFSLFVGVAFLSLKKEKTKLAETFARGIQALRAIIMKIVKMVLELTPYGIFALIARAAATNSFTTMGKLLIFIIAAYFAIGVMFLVHAVLLMINGINPITYFKNVWPVLVFAFTSRTSGGSLPLNVRTQKNQLGVSDTIADFSASFGLSIGQNGCAGIYPSMVAAISASVVGVNIFSWQFILTLVLIDVISSFGVAGVGGGSTFTTLMVLGALNLPVSVLGILIAIDPIVDMARTALNVNDSMLAGVITAKRLGELDFKKLNSTAQIMDYE >CP019581|1183080:1222176|1197522_1198008_-|AZK91470.1|DBSCAN-SWA MPFWLEIVINVAFSYIASVGFAITINVPHRALNSLGISGAVGWMAYWFCFHLGMGRMVSNLIGAFLIGILGLFFARRKKCPATVFNIPALVPLVPGMPAYQAVRALVADNYTLGQELILRVAIVTVAIGLGFLFSTMCIEAFYRIKYRHLKKIMLYMKRKR >CP019581|1183080:1222176|1210872_1211520_-|AZK91487.1|DBSCAN-SWA MILDVNDLSMRFDTKTVFQDINFHLEKGSMTALLGPNGTGKTTLIKILMGMLTPTDGSYNFSSDVKLGYVPQFRNIDSEYPLSIRAFVGLNAPLFKTKKVKDQIDHQLVETHLDPIEDTRMGEASGGQKQRAYLAQALLDNPNMIILDEATASLDPAAKEELMGLIKHLNQEHKITVLFVTHDIPLAKKYMTDYLYLNKGSIRQGKMSEFKEDFE >CP019581|1183080:1222176|1195184_1195934_+|AZK91468.1|DBSCAN-SWA MKNKKINLKYLFLIIPIGLIFVLSLTYLTNKNQIDDELNWMTMKEKQKLNVPLDNQLPDLPNGCEVTSLAMLMNYYGIKVSKNELAQNIQHVDSFTDNGKYRGNPNQGFVGHMTVANAGWCVYNGPLYNVARKYTNHIVNASDSNFLKVLKLVSDGHPVLIITTTTFSRVNNMQTWETNAGKVNVTPSSHACVITGYNKKKKIVYLNNPYGFKNQAVNWHKLEQSYDQQGRQALYMNYTGTEVPGFWEH >CP019581|1183080:1222176|1189426_1190941_+|AZK91463.1|DBSCAN-SWA MKEKTRTVLFWFILIQPFLDLYWFYHGKLADVLPFTLPTIIRILAVFVIFCMFFSQKQNWQKLGKNKWLLFYLALLIIYSALHLLHVKYFNSVNPNDYNYSTVSEIFYLIRMILPLLVIFFTTELDFTRDQFWHVIEGISGLFSFTIVISNLFVISLRSYETGPISANIFEWFFNPNIGYSHMASKGFFNFANMVSAVLFMLVPLMLYFMFSHFNWKTVTLNIVQALAMIELGTKVALIGLIGGIIIGILLYVFHLFIVKDVTKNVKAILVALLIEIGAMAIIPFGPAIQRYNYEKFLAQQSDNSLTVAKKELAAGLKKYPSGKKRKQFLTEFIEEHYQDYALNKKFVTKSYPYKYDPKFWLKIMNESGTARMQNRHVEKAMLDQVVKTNNNKLDKFLGISYTRETNIFNLERDFTSQIYSLGWIGMLLFVGPYVAIMLYAFVKWLMNKKKHTYLISSMLLSIAFMLFAAFSSGNVMDFLTASFILAFVEGGLLVEIKDELHRH >CP019581|1183080:1222176|1220480_1220696_-|AZK91496.1|DBSCAN-SWA MLGLDQSDFLIGLLTNIANNKKLPFSKLTNEEEEKAILAAKLSALTTSWGNIPELKNLQQLKEWLNEERND |
43 | Lactobacillus_virus(40.0%) | transposase | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_2 |
2043706 : 2052473
Sequences of DBSCAN-SWA_2
Nucleotide sequences of DBSCAN-SWA_2 >CP019581|2043706:2052473|DBSCAN-SWA CATGTCTGAAATAAGCGTAATTATGGGTAGTAAGTCAGATTGGCCAACGATGAAGCATGCGTGTGAAATCTTGGATCAGTTCAATGTTAGCTATGATAAACATGTTATCTCAGCCCATAGAATGCCAAAGGAAATGTACGATTTTGCCCAAGCCGCTGAAGAAAAAGGCACCAAGGTGATCATTGCTGGTGCAGGTATGGCAGCGCATTTGCCTGGGATGACAGCCGCTAATACCGTGATTCCAGTAATTGGTGTTCCTGGACAAACTAAAGCATTGGGAGGGATGGATTCTCTTTTATCGATTGTTCAAATGCCAACCGGTATCCCTGTAGCTACCACTGCCATTGGTAATGCTGGTGCAAGCAACGCGGCACTTCTTGCACTAGAAATTTTGGGAATTAGTGACGAAAAAATAAGACAGCAATTGAAAGACTATCGCCAAAAGATGCATGATGAAGCAAAAGAAAGCAGTGCGGAACTTGATTGATCCAGATTTTATTGAACAAGGAAAGACTATTGGCATCATTGGAGGCGGACAACTAGGTCAAATGATGGCTTTGTCAGCCAAGTATGGTGGAATGAAAGTCATCACCTTGGATCCAACGCCAGACTGTCCTTGTGGTCAGGTTGCTGACAAGCAGATTGTGGCCGAATATTCAGACATTAATGCAATTGAAGAGCTGGCTAAAGAGAGCGACGTATTAACGTATGAATTTGAAAATGTTGACCTTCAAGCTTTAAAGGATGTTGCTGACAAAGTAAAAATTCCTCAAGGCACCAATCTGCTCTATATTACTAAGCATAGACTGCGTGAAAAGAACTTCTTGCGTAGTGCTGGTTGTATGACCGCACCTTATCTTCCTGTTAAAAACATGGCTGATTTAAAAGAAGCGATTAAAAAGATTGGCTATAACTGTGTCCTTAAAACCTGTGAAGGTGGCTACGATGGCCACGGCCAAGAAGTTTTAAAGAGCGATGCCGATCTGAAAAATAATGCGCATATCAAAGAAATTTTAGCTACAGGAGACTGCATTCTTGAAGGCTGGGTGCCATTCAAAGTTGAAGCTTCTGTCATGGTCGCAAGAAATGCAAAAGGCGAAGTGACAGTTTTCCCAGTTAGTGAAAACTACAATGCCGATGAAATTTTGCATATTAGTATTGTGCCAGCACGTGTTTCAAAAGAAATTTATGCTAAAGCACAGGCTATTGCTAAGCAAATTGCAGAAGCAATTCATTTGTGCGGTATTTTGGGTGTGGAGCTATTTATTCTGGATAACGGTGATATTTATGTCAATGAACTAGCACCACGTCCGCATAATTCAGGTCACTATTCGATTGAAGCCTGCAATTTTGACCAGTTTGATATCCATGATCGTGCTATTTGCAATTGGCCGTTGCCTAAGATTAAGCTGTTGTCCAAAGTGGTAATGGTGAATGTTTTGGGTCAACATGTAGCCGGCGTAAGAAAAGTCATTCCTGAAAAGAGCAACTGGCATTTTCACTACTATGGCAAGGCCGAAATCCGTCATAATCGTAAAATGGGCCACGTTACAATTCTTACTGACGATATTGAAAAAACGTTGCAAGAAATTAATGACACCCATATTTGGAAAACAAAAGTAAATTAATCTTGTTTAAGCAGAGCTATTAAAGCTCTGTTTTTTATTTATCATTTTCACTAAAACACGAACAATACTTCGTGATAGTATTTTTAATGCTCTATAATATATTGACATTATAAGCAATAGTTGATAATATTGTCAGCATAAAGTGAACAAATTAGTATTTGATCTTTAATAATATTCGTGTTTTTACCAAGGAGAAAAATAAAATGGCAAAGTTGTTGTATACCGGTAAAGTTAAGGAAATGTGGTCTACAGATGACCCAGAAGTTTTACGTGTGGTTTATACAGACCAGGCAACTGCCGGTAACGGCGAGAAGAAGGATGACTTTAAGAACAAAGCCTATTTAAACAACGAAATTTCAACTTTGATTTTTGAATACTTGTCCAAAAACGGAGTTCCAACTCATTTCATCAAGAAAATCTCAGACACTGAAGAATTAGTTAAAAAGTGCGACATGTTCCCACTTGAAGTTGTTACTAGAAATATTGCAGCTGGCCATTTCTCAAGTCGTTATGGCTTGGGCGAAGGTGAAAAATTCGATACGCCAGTCGAAGAACTTTTTTATAAGAGTGATGAGCTAGACGACCCAATCATGAACGAATCTGATGCGATTGCATTACATATTGCGACCGCTGAAGAGCAAAAGAAGATCTGGGAATTGTCTCGTCAAGTTAACAAGCTTTTGATTCCGCTTTTTGATAAGGCTGGAATGGAATTAGTCGATTTCAAACTTGAATTTGGTAAAGATGCAGATGGAAATATCATCCTTGCAGATGAATTTTCACCAGATAACTGCCGTCTTTGGGATAAAAAGACCAAGAAGCACATGGATAAAGATGTTTACCGTAGAGACATCGGTGATTTAACCACTGTTTATGAACAAGATTTAGCCCGCATCAAGCAAGCACTTAAGGAGATCTAAATGACTACTGTCCGTATTTACGTAACCTACAAGCCTTCAGTTTTTGATCCACAAGGAGCCGCAATCACCGATTCCGTTAATTCCCTTGGTCATGACGAAGTTAAGAAAATCGTTGTTGGTAAGTTCTTCGATGTAACGCTAGATGAAGATGACTTAGATAAGGTAAAGGCCGATGTTAAAGATATCGCCGAATCTTTACTGGTTAACTTCAATATGGAAACTTACAAGATTAAAGTTTTGGAGGAAAATGCATGAAAATTGCAGTAGTAGTTTTTCCAGGCTCTAATTGCGATATCGACATGTACGAAGCCTTTCACACTGTTTGCAAAGCAGATGTCGAATACGTTTCTTACAAAGAAAAAAGCTTAGATGGCTTTGATGCAGTTGTTTTACCAGGTGGCTTTTCTTACGGCGACTACTTAAGAACTGGTGCAATCGCTCGTTTTTCAAATATCATCCCAGCCGTTAAAAAGATGGCAGATGAAGGAAAATTAGTATTAGGTGTCTGCAATGGTTTCCAAATTTTAACAGAAATGGGACTTCTACCTGGAGCACTTAAGAAAAACGATAGTCTTCAATTCGTGTGCAAAACCGTAACGCTAGAAGTTGAAAATACACATACGCCATTTACTACTGAGTATGAAGATAAAGAACTTATCCGTATTCCTATTGCTCATGGCGACGGCAGCTATTACGCAGACGAAGATGTGCTTGAAGAATTAGAAAATAATCATCAAGTTGTCTTTAGATATCATGGTGAAAATCCTAACGGTAGTCTTCATGATATTGCTGGTATCTGTAACAAAGAAGGCAATGTCTTGGGCATGATGCCTCACCCTGAAAGAGCCGTTGAAGAAATTCTCGGCGGAACAGATGGATTGCCTTTATTTAAGTCACTTTTAAAAGCAGGAGTTCAAGCATGAAACAAGCGATGACACCAGAAGAAATCAAAGAAAAGAAACCTTACCTTGATTGGAGTTTATCAGAACGTGAATATGACTATATTTGCGATCATTTGCTTCATCGCTTGCCCAACTACACAGAAATCGGCTTGTTCTCCGCAATGTGGAGTGAACACTGTTCTTACAAAAAATCTAAGCCAGTTTTGAAGCTTTTCCCAACCAAGGGCAAGCGTGTTGTTCAAGGTCCAGGTGAAGGTGCTGGTGTAGTTGATATTGATGATGGTCAAGCAGTTGTATTTAAGGCTGAAAGTCACAACCACCCAACTACAGTTGAACCTTACCAAGGTGCAGCCACTGGTGTTGGCGGTATTTTAAGAGACGTTTTCAGTATGGGCGCTCGTCCAGTCGCAATTCTTGACTCACTTCACTTTGGTGAATTAAAAAATAACTCAACAATGCGCTACAAGATGGAAGAAACCATCAAAGGCGTCGGCGACTACGGCAATTGCATGGGAATTCCTAACTTAGGTGGCGAAACTACATTTGATCCTTGTTACAACGGCAACATTTTATTAAATGCGATGAATGTCGGCATTATGGATATCAAAGATATGGAGCATGGGGATGCTACAGGTGTTGGCAATGCGGTAATGTACGTCGGTGCTAAAACCGGGCGTGATGGTATTCACGGCGCTACCTTTGCTTCAGCCGACTTTTCTGAAGAACATGCGACCCAACGTTCAGCGGTTCAAGTTGGTGATCCATTTATGGAAAAATTGTTGATGGAAGCTTGCTTGGAGCTGATCTTGAATCATCGTGAATGGCTTGTTGGTATTCAAGACATGGGAGCAGCCGGCATCGTTTCTTCAAGTGCTGAAATGGCCACTGAAGGTAAATCCGGGATGGATCTCAATCTTAACTTGGTACCTCAGCGTGAACCTAACATGTCAGCTTACGAGATTATGCTTAGCGAATCGCAAGAAAGAATGCTCCTTTGTGTAAAGAAGGGCCATGAAGAGGACGTTAAAAAGATCTTCGATGAATTTAATCTTGACGCTGTAACTATTGGTCGCATTACTGATGATGGTAGATACGTGCTTCATCACGATGATCAAGTGGTATGTGATATTCCAGTGGTAACTTTAACTGAAAAAGTCCTTGAAGAAAAAAGTGCAGAAAAGAAACCACAAAGAATCATTGATGCTGAGCAAAGTGAAAATTGGCAACCAAATATTGAAAGTGCCGGTCAAACTTTAAAAGATCTTTTGAACCAGCCAACCATTGCTAACGATCAATTTGTAACGCAACAATATGATTCACAGGTTCGAACGGACACGATTGTAGGTCCTGGTTCAGATAGCGGAGTCTTGCGTGTGCGCCATACTAAGAAGGCAATTGCGATGACTACTGATACTAATGGACGCTTTGTCTATCTTGATCCAAAAGTTGGTGGTCAAAGAACTGTCCTTGAAAGTGCAACAAATATCGTGGCTAGTGGTGCCCAACCTTTAGCAATTACTGACTGTCTTAACTATGGTGACCCTAATGATCCAGAAATTTTCTGGGAATTGCATCAATCTTGTCAAGGGATTGCCGATGCATGTGAAATCTTAGAAACACCGGTTGTTTCAGGAAATGTCTCTCTTTATAACGAAAATAATGGCAAGGCAATTTATCCATCACCAATGATTGGGATGGTTGGTTTAATCAAGGATTATGATCACGTCATTCCAATGCACATGCAAAAAGCAGGCGACAAGATTTATCTAGTGGGTAAGACAGATGATGATTTTGCCGGCTCAGAATTGCAAAAGATGATCACTGGTGAAATTAGCGGATTGCCACATGCGCCAAACTTGCCAGAGATAAAGCAATACCTTTACAAATTGCAAGCATTAATGGCTAACGGCTTAGTCAAAAGTGCACACGATCTAAGCGAAGGTGGCTTAGGCGTTTCCTTAGCTGAAACACTTTTTGATACAGACTTTGGTGCAAAGGTAAAACTTGATTTTGATAAAAATCTGCTTTTCAGTGAGACTCCAGGTCGCTTGATCGTTTCAGTTGATCCTGCAAATGCGACAAAATTTGAACAAGAAATGGGCGATTCCGTTAGTGAAATTGGTCAAGTTACTGCTGATCAGCAATTGAACATTTCGCTGGCCAACGATCAAGTAAATGAAGATGTAGCTGAATTGCAAAAGATTTGGAAGGAGTGCATCCCTTGTTTAATGAAATCAAAAGCCTAAATGAGGAATGTGGCGTCTTCGGTGTTTTCGGCGCTCCTGATGCTAGTCAATTAACATATCTGGGTTTGCATAACCTGCAGCACCGCGGACAAGAAGGGGCCGGCATCGTCTCAAGCGATGGTGAGCATTTGTATCAACACCGCGATCGTGGCCTTTTATCCGATGCTTTTGCTGATCCGAATGATTTAAAAAAATTGGTCGGCGACAGTGCCATTGGTCACGTTCGCTACAGTACAACGGGGCGTAATTCGATTCAAAACGTACAACCATTTCTCTTTCATTTTCTAGACGGTGATGTTGCTTTAGCCCACAACGGCAATTTAGTTAATGCCGTGTCTTTACGAAACAAGCTCGAAAAACAAGGTGCGATTTTTCAATCCTCATCTGATACCGAAATCTTGATTCACTTGATTCGTAATCATATCAAAGATGGCTTTATTTCGGCTTTAAAGCAAAGCCTGAATGAAGTTCACGGTGGTTTTGCCTTCCTGCTTTTGCAAAAAGATCGCATGATTGCGGCACTTGATCCTAATGGGATTCGTCCCTTATGTATTGGTAAATTGGACAATGGCGCCTACGTTGTTTCTAGCGAAACTTGTTCACTTGACATCATCGGTGCCAAGTTCGTTCGTGATGTGCAGCCTGGTGAATTAATTATCATTGATCGAGATGGGATGAAAATTGATCATTTTACTAAAAATACGCATTTAGCAATCTGCTCAATGGAATACATCTATTTTGCTCGTCCTGATTCAATCATACATGGTGTGACTGTTCATAATGCTAGAAAAAGAATGGGGCGGCTGCTAGCTCGTGAGGCTCCAGCCGATGTTGACATGGTAATTGGCGTACCAAACTCATCTTTGTCAGCTGCTTCTGGTTACGCCGAAGAATTGGGGTTGCCTTATGAAATGGGTCTGGTTAAAAACCAATACGTTGCCAGAACCTTCATTCAGCCAACTCAAGCCTTGCGTGAAAAGAGTGTTAAGTTAAAACTGTCAGCTGTTCGTGGTGTCGTTGCTGGTAAAAAGATCGCGGTCATCGACGATTCGATCGTAAGGGGAACCACTTCTAAGCAAATTGTCAAAATGCTTAAAGATGCTGGTGCTAAAGAAGTACACCTTAGAATCGCTAGTCCTCCATTTAGATTTCCATGTTTTTACGGCATTGATATTTCAACTAGAGCCGAATTAATGGCAGCTCATTATTCAGTAGAAGAGATGCGGAAAATTATCGGTGCTGATAGCCTTGGCTTTTTAAGCGTTGATAGCTTGATCAAAGCAATCGATGTGCCTGATCGAGGAGACTCATCGGGTCTGACAGTTGCTTACTTTAATGGCAAATATCCAACAAAATTGGATGACTACGAAGCTGGCTATTTAGCTTCTCTCAATGCTCAAGAACGGCGACAAAGAAAGGATTTATAAAATGAATCGTTATAAAGAAGCAGGTGTTGACGTCACTGCTGGTTACGATTTAGTCAATCGAATTAAGCCGATGGTAGCTGCAACTAAGCGTAAAGGCGTGATGGGCGGCATTGGCAGTTTTGGCGGGATGTTCGACTTAGAAGAATTAGATTATAAGCATCCCGTTCTTGTTTCAGGCACGGATGGTGTAGGTACCAAATTAATGATTGCGCAAAAGATGCACCAAAACGACACGGTCGGCATCGATTGCGTGGCCATGTGCGTTAACGATGTGCTAGCCCAAGGCGCTGAACCTTTATTTTTCTTAGATTACATTGCATGCGGACATAACGATCCAAAGAAATTGGCTGACGTAGTTAAGGGCGTGGCTGAGGGATGCAAGCAAAGCGGCTCTGCCTTAATTGGTGGTGAAACCGCGGAAATGCCCGATATGTATGAACCCCACGAATATGATTTGGCTGGTTTTTCAACTGGGATTGCGGAAAAAGAGGACTTATTATCTAGTTCTTTAGCCAAAGCAGGCGATCATTTAATTGCTTTGCCATCGAGTGGCGTCCATTCTAATGGTTTCAGTTTAATTAGAAAAATTTTGTTCAAAGATCACGACGTTTCATTAAGCGACAAACCAGCAGAATTAGCGGGCAAAACCGTAGGCGAAACACTCCTTACGCCAACCAGAATCTATGTCAAAGCAGTTTTGCCACTAGTAAAAAAGCATTTAGTTCACGGCATTGCCCATATTACTGGTGGCGGCTTTATCGAAAACCTACCTCGAATTTATGGCAATGACTTGCAAGCAGTGGTTAATAAAGGTTCGTGGCCTGTTTTACCAATTTTTGATTATCTGAAAAAACTAGGCGATTTAGATGAACGAGATTGCTACAACACTTTCAATATGGGCGTTGGTTTAGTTTTAGCCGTTCCAGTAGAAAACGTTGCTGCAGTCAAAGAACAACTAAATCGCGAGAATGAAAAATTCTACGAAATTGGTCAATTAGTTGCACGCCCCGAAGGGGAAGAAAAGATCGTAATCAAATAGAGGAGAAAAATGAGAGTTGCTATTTTAGCTTCTGGGAATGGAACCAACTTTGAAGCTTTAACTAAAAAGTTTCAAGCTGGCGAAATTCCCGGAACCGAGGCTTTAATGTTTTGCAATCATCCAAATGCCCCGGTAGTGAAAAGAGCCGAGAGATTAGGCATCCCGCATGAAGCATTTTCCGTTAAAGAATGCGGCGGCAAGACGGCTTATGAAAAGCGCCTGCTAAAAGTTTTGCAGGATTATCAAATTGATTTCATTGTTTTATCCGGATATCTTAGGGTGGTTGGACCCACGATTTTAAACGAATATCCCAATGCAATTATCAACTTGCATCCCGCCTTATTGCCAAGTTATCCCGGCCTCAATTCAATCGAAAGAGCTTTTGAAGATTACAAGCAAGGTAAGATAAAAGAAACCGGCGTGACGGTGCATTTTATCGATGCCCATTTGGATCATGGGCCTATCATTGCTCAACAGGCTGTCCCAATTTATCCAGATGATACGGTTGAGACGCTTGAGGCAAGAGTCCATGAAACAGAACATCAACTATTTCCAGCTACGTTAAAAAAAGTATTAAGTCAAAGAATGGAAAAGGAAGAAAACTAA
Protein sequences of DBSCAN-SWA_2 >CP019581|2043706:2052473|2045536_2046253_+|AZK92328.1|DBSCAN-SWA MAKLLYTGKVKEMWSTDDPEVLRVVYTDQATAGNGEKKDDFKNKAYLNNEISTLIFEYLSKNGVPTHFIKKISDTEELVKKCDMFPLEVVTRNIAAGHFSSRYGLGEGEKFDTPVEELFYKSDELDDPIMNESDAIALHIATAEEQKKIWELSRQVNKLLIPLFDKAGMELVDFKLEFGKDADGNIILADEFSPDNCRLWDKKTKKHMDKDVYRRDIGDLTTVYEQDLARIKQALKEI >CP019581|2043706:2052473|2046504_2047176_+|AZK92330.1|DBSCAN-SWA MKIAVVVFPGSNCDIDMYEAFHTVCKADVEYVSYKEKSLDGFDAVVLPGGFSYGDYLRTGAIARFSNIIPAVKKMADEGKLVLGVCNGFQILTEMGLLPGALKKNDSLQFVCKTVTLEVENTHTPFTTEYEDKELIRIPIAHGDGSYYADEDVLEELENNHQVVFRYHGENPNGSLHDIAGICNKEGNVLGMMPHPERAVEEILGGTDGLPLFKSLLKAGVQA >CP019581|2043706:2052473|2051876_2052473_+|AZK92334.1|DBSCAN-SWA MRVAILASGNGTNFEALTKKFQAGEIPGTEALMFCNHPNAPVVKRAERLGIPHEAFSVKECGGKTAYEKRLLKVLQDYQIDFIVLSGYLRVVGPTILNEYPNAIINLHPALLPSYPGLNSIERAFEDYKQGKIKETGVTVHFIDAHLDHGPIIAQQAVPIYPDDTVETLEARVHETEHQLFPATLKKVLSQRMEKEEN >CP019581|2043706:2052473|2043706_2044192_+|AZK92326.1|DBSCAN-SWA MSEISVIMGSKSDWPTMKHACEILDQFNVSYDKHVISAHRMPKEMYDFAQAAEEKGTKVIIAGAGMAAHLPGMTAANTVIPVIGVPGQTKALGGMDSLLSIVQMPTGIPVATTAIGNAGASNAALLALEILGISDEKIRQQLKDYRQKMHDEAKESSAELD >CP019581|2043706:2052473|2046253_2046508_+|AZK92329.1|DBSCAN-SWA MTTVRIYVTYKPSVFDPQGAAITDSVNSLGHDEVKKIVVGKFFDVTLDEDDLDKVKADVKDIAESLLVNFNMETYKIKVLEENA >CP019581|2043706:2052473|2044154_2045333_+|AZK92327.1|DBSCAN-SWA MMKQKKAVRNLIDPDFIEQGKTIGIIGGGQLGQMMALSAKYGGMKVITLDPTPDCPCGQVADKQIVAEYSDINAIEELAKESDVLTYEFENVDLQALKDVADKVKIPQGTNLLYITKHRLREKNFLRSAGCMTAPYLPVKNMADLKEAIKKIGYNCVLKTCEGGYDGHGQEVLKSDADLKNNAHIKEILATGDCILEGWVPFKVEASVMVARNAKGEVTVFPVSENYNADEILHISIVPARVSKEIYAKAQAIAKQIAEAIHLCGILGVELFILDNGDIYVNELAPRPHNSGHYSIEACNFDQFDIHDRAICNWPLPKIKLLSKVVMVNVLGQHVAGVRKVIPEKSNWHFHYYGKAEIRHNRKMGHVTILTDDIEKTLQEINDTHIWKTKVN >CP019581|2043706:2052473|2050829_2051867_+|AZK92333.1|DBSCAN-SWA MNRYKEAGVDVTAGYDLVNRIKPMVAATKRKGVMGGIGSFGGMFDLEELDYKHPVLVSGTDGVGTKLMIAQKMHQNDTVGIDCVAMCVNDVLAQGAEPLFFLDYIACGHNDPKKLADVVKGVAEGCKQSGSALIGGETAEMPDMYEPHEYDLAGFSTGIAEKEDLLSSSLAKAGDHLIALPSSGVHSNGFSLIRKILFKDHDVSLSDKPAELAGKTVGETLLTPTRIYVKAVLPLVKKHLVHGIAHITGGGFIENLPRIYGNDLQAVVNKGSWPVLPIFDYLKKLGDLDERDCYNTFNMGVGLVLAVPVENVAAVKEQLNRENEKFYEIGQLVARPEGEEKIVIK >CP019581|2043706:2052473|2049376_2050828_+|AZK92332.1|DBSCAN-SWA MFNEIKSLNEECGVFGVFGAPDASQLTYLGLHNLQHRGQEGAGIVSSDGEHLYQHRDRGLLSDAFADPNDLKKLVGDSAIGHVRYSTTGRNSIQNVQPFLFHFLDGDVALAHNGNLVNAVSLRNKLEKQGAIFQSSSDTEILIHLIRNHIKDGFISALKQSLNEVHGGFAFLLLQKDRMIAALDPNGIRPLCIGKLDNGAYVVSSETCSLDIIGAKFVRDVQPGELIIIDRDGMKIDHFTKNTHLAICSMEYIYFARPDSIIHGVTVHNARKRMGRLLAREAPADVDMVIGVPNSSLSAASGYAEELGLPYEMGLVKNQYVARTFIQPTQALREKSVKLKLSAVRGVVAGKKIAVIDDSIVRGTTSKQIVKMLKDAGAKEVHLRIASPPFRFPCFYGIDISTRAELMAAHYSVEEMRKIIGADSLGFLSVDSLIKAIDVPDRGDSSGLTVAYFNGKYPTKLDDYEAGYLASLNAQERRQRKDL >CP019581|2043706:2052473|2047172_2049401_+|AZK92331.1|DBSCAN-SWA MKQAMTPEEIKEKKPYLDWSLSEREYDYICDHLLHRLPNYTEIGLFSAMWSEHCSYKKSKPVLKLFPTKGKRVVQGPGEGAGVVDIDDGQAVVFKAESHNHPTTVEPYQGAATGVGGILRDVFSMGARPVAILDSLHFGELKNNSTMRYKMEETIKGVGDYGNCMGIPNLGGETTFDPCYNGNILLNAMNVGIMDIKDMEHGDATGVGNAVMYVGAKTGRDGIHGATFASADFSEEHATQRSAVQVGDPFMEKLLMEACLELILNHREWLVGIQDMGAAGIVSSSAEMATEGKSGMDLNLNLVPQREPNMSAYEIMLSESQERMLLCVKKGHEEDVKKIFDEFNLDAVTIGRITDDGRYVLHHDDQVVCDIPVVTLTEKVLEEKSAEKKPQRIIDAEQSENWQPNIESAGQTLKDLLNQPTIANDQFVTQQYDSQVRTDTIVGPGSDSGVLRVRHTKKAIAMTTDTNGRFVYLDPKVGGQRTVLESATNIVASGAQPLAITDCLNYGDPNDPEIFWELHQSCQGIADACEILETPVVSGNVSLYNENNGKAIYPSPMIGMVGLIKDYDHVIPMHMQKAGDKIYLVGKTDDDFAGSELQKMITGEISGLPHAPNLPEIKQYLYKLQALMANGLVKSAHDLSEGGLGVSLAETLFDTDFGAKVKLDFDKNLLFSETPGRLIVSVDPANATKFEQEMGDSVSEIGQVTADQQLNISLANDQVNEDVAELQKIWKECIPCLMKSKA |
9 | Synechococcus_phage(33.33%) | NA | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
Acr ID | Acr position | Acr size | Homology with known anti | Neighbor HTH/AcRanker | Neighbor Aca | In prophage | Protospacer in prophage |
---|
CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
CP019583_1 | 4548-4703 | Orphan |
NA
Consensus repeat of CP019583_1
|
1 spacers
spacers of CP019583_1
>1.1|4584|84|CP019583|CRISPRCasFinder CTTCGCGGGAGACCGTTGAAACCCTTGCTACGAGCGAAAGTCTAAAAAATAGACCTTCGCGAAAGTCCGTTGATGACAACCCGC |
CRISPR arrays and Neighbor proteins around CP019583_1
The CRISPR arrays of CP019583_1 >merge|CP019583|1|4548-4703|CRISPRCasFinder AAACCCTTGCTACGAGCGAAAGTCTAAAAAATAGACCTTCGCGGGAGACCGTTGAAACCCTTGCTACGAGCGAAAGTCTAAAAAATAGACCTTCGCGAAAGTCCGTTGATGACAACCCGCAAACCCTTGCTACGAGCGAAAGTCTAAAAAATAGAC >CP019583|1|1|4548-4703|CRISPRCasFinder AAACCCTTGCTACGAGCGAAAGTCTAAAAAATAGAC CTTCGCGGGAGACCGTTGAAACCCTTGCTACGAGCGAAAGTCTAAAAAATAGACCTTCGCGAAAGTCCGTTGATGACAACCCGC AAACCCTTGCTACGAGCGAAAGTCTAAAAAATAGAC
>CP019583.1|AZK92453.1|2969_3533_-|hypothetical-protein MAKTIKQLADELKVSKQTIQYHYQRLPAKNQQKNSQGTNLISTTAERIIRSKVAKPLLANKQQIGSKEPTKTSKENNDLIITLRREVEDLKSQRDKQLAAKDQQISSKDRQIDHLTKLIDQQQQLQLAIVAENRQLKEHVQKLSGLLEPSSTTQQQQSNDKDDALSNSEKQKRMHKNKPNKNWWHFW >CP019583.1|AZK92452.1|2613_2841_-|hypothetical-protein MLNQQLAIKDSQIKEKDEQLNSMQKLLDQSQQLQLMTEKKVEELETTTSIKIKYDNLNLKQKSNAWWKFWLKNKT >CP019583.1|AZK92451.1|1386_2145_-|hypothetical-protein MDNKSNKDALSLIKKNQAETGYEGSIQEYEQLYSLFVNHGYKQAQVAFARLSDISNYLPLLKSGPLSLYILYVLKANNDRGSSFWSIDALAKKLQTTTKSITNWNTKLIDLGLIRRLKGLGKSTTTVLLPTSPIIINKKTQESIKLLEKINYKLQAYVIFKKDNKYITYRFFESNIKYKINNPIIILSVEEDMISDISDSQTAYTDIDAVDGSKIDEITQLLLETEISGSESTKTKEKIKLLVDLYQTYLRK >CP019583.1|AZK92450.1|959_1262_+|mRNA-interferase-YafQ MQIKQTKSFERELKKLVKKHFPITVLKPCLEAIVEQDVLVLKQIKDHALKGNWRGYREFHPARYGNYGKNYDNWIVIYQLDHDELILLLVATGSHEILNQ >CP019583.1|AZK92449.1|691_970_+|hypothetical-protein MSNTIIKNKTISTRVTPDISERAKANLAKQGLTVSEYIRLSLVKAANNEVRLVSFLDSPEALAAKKEAETGQVKNIGSLTDFEDWIDKLDAN >CP019583.1|AZK92448.1|0_588_-|site-specific-tyrosine-recombinase-XerC MQQVVLPIKDSNVLKEVQDTLLNNFKAGRRNYTIFQVGKATLLRVSDVMGLKQADIFNPDGSIKQNAFIHDRKTGKPNTLYLKPVQTELLLYRQWLLDHKLDSEWLFPSIQHPERHITEKQFYKIMSKVGDLLGINYLGTHTMRKTGAYRVYTQSNYNIGLVMHLLNHSSEAMTLAYLGLDQASTENMLNQIDFG >CP019583.1|AZK92454.1|5434_5629_+|hypothetical-protein MTKALRRYFNALRSNEKHIKNVENYLYGTMTNLFGIYWNKLAGAKYRAQHPEEFKNQEALSDWL >CP019583.1|AZK92455.1|5912_6065_+|hypothetical-protein MSKKIWMIIFRYLLAIGLFLAYSAIFDNQDHDLIIKYCFCYMSWPCYYSI |
You can click texts colored in the table to view more detailed information
CRISPR_ID | Spacer_Info | Spacer_region | Spacer_length | Hit_ID | Protospacer_location | Mismatch | Identity |
---|
CRISPR_ID | Spacer_Info | Spacer_region | Spacer_length | Hit_phage_ID | Hit_phage_def | Protospacer_location | Mismatch | Identity |
---|---|---|---|---|---|---|---|---|
CP019583_1 | 1.1|4584|84|CP019583|CRISPRCasFinder | 4584-4667 | 84 | NZ_CP019583 | Lactobacillus helveticus strain LH5 plasmid pCBTLH5_2, complete sequence | 4584-4667 | 24 | 0.714 |
CP019583_1 | 1.1|4584|84|CP019583|CRISPRCasFinder | 4584-4667 | 84 | NC_014386 | Lactobacillus helveticus R0052 plasmid pIR52-1, complete sequence | 1748-1831 | 24 | 0.714 |
CP019583_1 | 1.1|4584|84|CP019583|CRISPRCasFinder | 4584-4667 | 84 | NZ_CP017358 | Lactobacillus plantarum strain TMW 1.25 plasmid pL125-4, complete sequence | 4776-4859 | 24 | 0.714 |
CP019583_1 | 1.1|4584|84|CP019583|CRISPRCasFinder | 4584-4667 | 84 | NZ_CP014934 | Pediococcus claussenii strain TMW 2.53 plasmid pL253-1, complete sequence | 5879-5962 | 24 | 0.714 |
CP019583_1 | 1.1|4584|84|CP019583|CRISPRCasFinder | 4584-4667 | 84 | NZ_CP014937 | Pediococcus claussenii strain TMW 2.54 plasmid pL254-1, complete sequence | 5880-5963 | 24 | 0.714 |
CP019583_1 | 1.1|4584|84|CP019583|CRISPRCasFinder | 4584-4667 | 84 | NZ_CP014913 | Lactobacillus paracollinoides strain TMW 1.1979 plasmid pL11979-1, complete sequence | 22508-22591 | 24 | 0.714 |
CP019583_1 | 1.1|4584|84|CP019583|CRISPRCasFinder | 4584-4667 | 84 | NZ_CP017368 | Lactobacillus plantarum strain TMW 1.277 plasmid pL1277-5, complete sequence | 5770-5853 | 24 | 0.714 |
CP019583_1 | 1.1|4584|84|CP019583|CRISPRCasFinder | 4584-4667 | 84 | NZ_CP017265 | Lactobacillus paracasei strain FAM18149 plasmid pFAM18149.24, complete sequence | 26504-26587 | 24 | 0.714 |
CP019583_1 | 1.1|4584|84|CP019583|CRISPRCasFinder | 4584-4667 | 84 | NZ_CP017960 | Lactobacillus plantarum strain C410L1 plasmid unnamed6, complete sequence | 6891-6974 | 24 | 0.714 |
CP019583_1 | 1.1|4584|84|CP019583|CRISPRCasFinder | 4584-4667 | 84 | NZ_CP017960 | Lactobacillus plantarum strain C410L1 plasmid unnamed6, complete sequence | 20910-20993 | 24 | 0.714 |
CP019583_1 | 1.1|4584|84|CP019583|CRISPRCasFinder | 4584-4667 | 84 | NC_006529 | Lactobacillus salivarius UCC118 plasmid pSF118-20, complete sequence | 4742-4825 | 24 | 0.714 |
CP019583_1 | 1.1|4584|84|CP019583|CRISPRCasFinder | 4584-4667 | 84 | MK994179 | Lactobacillus plantarum strain PC518 plasmid plp75TA, complete sequence | 4831-4914 | 25 | 0.702 |
1. spacer 1.1|4584|84|CP019583|CRISPRCasFinder matches to NZ_CP019583 (Lactobacillus helveticus strain LH5 plasmid pCBTLH5_2, complete sequence) position: , mismatch: 24, identity: 0.714
cttcgcgggagaccgttgaaacccttgctacgagcgaaagtctaaaaaatagaccttcgc CRISPR spacer cttcgcgggagaccgttgaaacccttgctacgagcgaaagtctaaaaaatagaccttcgc Protospacer ************************************************************
2. spacer 1.1|4584|84|CP019583|CRISPRCasFinder matches to NC_014386 (Lactobacillus helveticus R0052 plasmid pIR52-1, complete sequence) position: , mismatch: 24, identity: 0.714
cttcgcgggagaccgttgaaacccttgctacgagcgaaagtctaaaaaatagaccttcgc CRISPR spacer cttcgcgggagaccgttgaaacccttgctacgagcgaaagtctaaaaaatagaccttcgc Protospacer ************************************************************
3. spacer 1.1|4584|84|CP019583|CRISPRCasFinder matches to NZ_CP017358 (Lactobacillus plantarum strain TMW 1.25 plasmid pL125-4, complete sequence) position: , mismatch: 24, identity: 0.714
cttcgcgggagaccgttgaaacccttgctacgagcgaaagtctaaaaaatagaccttcgc CRISPR spacer cttcgcgggagaccgttgaaacccttgctacgagcgaaagtctaaaaaatagaccttcgc Protospacer ************************************************************
4. spacer 1.1|4584|84|CP019583|CRISPRCasFinder matches to NZ_CP014934 (Pediococcus claussenii strain TMW 2.53 plasmid pL253-1, complete sequence) position: , mismatch: 24, identity: 0.714
cttcgcgggagaccgttgaaacccttgctacgagcgaaagtctaaaaaatagaccttcgc CRISPR spacer cttcgcgggagaccgttgaaacccttgctacgagcgaaagtctaaaaaatagaccttcgc Protospacer ************************************************************
5. spacer 1.1|4584|84|CP019583|CRISPRCasFinder matches to NZ_CP014937 (Pediococcus claussenii strain TMW 2.54 plasmid pL254-1, complete sequence) position: , mismatch: 24, identity: 0.714
cttcgcgggagaccgttgaaacccttgctacgagcgaaagtctaaaaaatagaccttcgc CRISPR spacer cttcgcgggagaccgttgaaacccttgctacgagcgaaagtctaaaaaatagaccttcgc Protospacer ************************************************************
6. spacer 1.1|4584|84|CP019583|CRISPRCasFinder matches to NZ_CP014913 (Lactobacillus paracollinoides strain TMW 1.1979 plasmid pL11979-1, complete sequence) position: , mismatch: 24, identity: 0.714
cttcgcgggagaccgttgaaacccttgctacgagcgaaagtctaaaaaatagaccttcgc CRISPR spacer cttcgcgggagaccgttgaaacccttgctacgagcgaaagtctaaaaaatagaccttcgc Protospacer ************************************************************
7. spacer 1.1|4584|84|CP019583|CRISPRCasFinder matches to NZ_CP017368 (Lactobacillus plantarum strain TMW 1.277 plasmid pL1277-5, complete sequence) position: , mismatch: 24, identity: 0.714
cttcgcgggagaccgttgaaacccttgctacgagcgaaagtctaaaaaatagaccttcgc CRISPR spacer cttcgcgggagaccgttgaaacccttgctacgagcgaaagtctaaaaaatagaccttcgc Protospacer ************************************************************
8. spacer 1.1|4584|84|CP019583|CRISPRCasFinder matches to NZ_CP017265 (Lactobacillus paracasei strain FAM18149 plasmid pFAM18149.24, complete sequence) position: , mismatch: 24, identity: 0.714
cttcgcgggagaccgttgaaacccttgctacgagcgaaagtctaaaaaatagaccttcgc CRISPR spacer cttcgcgggagaccgttgaaacccttgctacgagcgaaagtctaaaaaatagaccttcgc Protospacer ************************************************************
9. spacer 1.1|4584|84|CP019583|CRISPRCasFinder matches to NZ_CP017960 (Lactobacillus plantarum strain C410L1 plasmid unnamed6, complete sequence) position: , mismatch: 24, identity: 0.714
cttcgcgggagaccgttgaaacccttgctacgagcgaaagtctaaaaaatagaccttcgc CRISPR spacer cttcgcgggagaccgttgaaacccttgctacgagcgaaagtctaaaaaatagaccttcgc Protospacer ************************************************************
10. spacer 1.1|4584|84|CP019583|CRISPRCasFinder matches to NZ_CP017960 (Lactobacillus plantarum strain C410L1 plasmid unnamed6, complete sequence) position: , mismatch: 24, identity: 0.714
cttcgcgggagaccgttgaaacccttgctacgagcgaaagtctaaaaaatagaccttcgc CRISPR spacer cttcgcgggagaccgttgaaacccttgctacgagcgaaagtctaaaaaatagaccttcgc Protospacer ************************************************************
11. spacer 1.1|4584|84|CP019583|CRISPRCasFinder matches to NC_006529 (Lactobacillus salivarius UCC118 plasmid pSF118-20, complete sequence) position: , mismatch: 24, identity: 0.714
cttcgcgggagaccgttgaaacccttgctacgagcgaaagtctaaaaaatagaccttcgc CRISPR spacer cttcgcgggagaccgttgaaacccttgctacgagcgaaagtctaaaaaatagaccttcgc Protospacer ************************************************************
12. spacer 1.1|4584|84|CP019583|CRISPRCasFinder matches to MK994179 (Lactobacillus plantarum strain PC518 plasmid plp75TA, complete sequence) position: , mismatch: 25, identity: 0.702
cttcgcgggagaccgttgaaacccttgctacgagcgaaagtctaaaaaatagaccttcgc CRISPR spacer cttcgcgggagaccgttgaaacccttgctacaagcgaaagtctaaaaaatagaccttcgc Protospacer *******************************.****************************
Region | Region Position | Protein_number | Hit_taxonomy | Key_proteins | Att_site | Prophage annotation |
---|
Acr ID | Acr position | Acr size | Homology with known anti | Neighbor HTH/AcRanker | Neighbor Aca | In prophage | Protospacer in prophage |
---|