Contig_ID | Contig_def | CRISPR array number | Contig Signature genes | Self targeting spacer number | Target MGE spacer number | Prophage number | Anti-CRISPR protein number |
---|---|---|---|---|---|---|---|
NZ_CP013049 | Mycobacteroides abscessus strain NOV0213 chromosome, complete genome | 3 crisprs | csa3,cas3,DEDDh,cas4,WYL | 0 | 4 | 7 | 0 |
CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
NZ_CP013049_1 | 4360772-4360869 | Orphan |
NA
Consensus repeat of NZ_CP013049_1
|
1 spacers
spacers of NZ_CP013049_1
>1.1|4360795|52|NZ_CP013049|CRISPRCasFinder GCCACCGCCGACAGCCGTGCTGCCGCCAACAGAACCCGAACCACCCAGGGCA |
CRISPR arrays and Neighbor proteins around NZ_CP013049_1
The CRISPR arrays of NZ_CP013049_1 >merge|NZ_CP013049|1|4360772-4360869|CRISPRCasFinder TCGTTGATGGCACTGGTGTGGGCGCCACCGCCGACAGCCGTGCTGCCGCCAACAGAACCCGAACCACCCAGGGCATCGTTGATGGCACTGGTGTGAGC >NZ_CP013049|1|1|4360772-4360869|CRISPRCasFinder TCGTTGATGGCACTGGTGTGGGC GCCACCGCCGACAGCCGTGCTGCCGCCAACAGAACCCGAACCACCCAGGGCA TCGTTGATGGCACTGGTGTGAGC
>NZ_CP013049.1|WP_005085675.1|4358590_4360474_-|dynamin-like-GTPase-family-protein MSDTAQDGQQQLALINELLGHVRKIAGENDRGDLIDRLDRADQLLVDRPLRVVVAGQLKQGKSQLVNSLLNMPVARVGDDETTSVTTVIGYGEQPSAALVVAPAEQYDGGGLAGEPEIIPIPVADIGKDLKRAPQAQGREVLRVEVKVASPILKSGLCLVDTPGVGGFGQPHLSSTLGLLPDADVMLMVSDLSSEFTEPELVFIQQALDLCPVAAIVNTKTDLYPHWRAVVAANSAHLQRAKIAVPAIAVSSALRSHALQLNDKELNTESNFPALVTFLSNAIADQQASTRQQAIAEIGSASEHLTLTLEAELSALQDPQSRDELTSDLERRKREAEEALAHTALWQQVLGDGITDVSTDVEHDLQSRFRRILQRTEEVIDRTDPTKNWAEVGAKLEQAVANSVGNNFVWAHQRAMHLAAQVAETFAIDGLESIKMPRLRASEMGADLSDLKSLSKLEAKQIKLGHKAITGLRGSYGGVIMFGMLTSVAGLGMFNLISLGAGAMLGKKTYNEDMENRMLRIRGEAKTNVRRFLDDVSFVVLKESRDRLRLVQRQLRDHFREIANQTTRSLNESLQAAIASARLEAEDRDARTNEVERQLHILRQVNSHVEGLQPADEQVGRRATRDV >NZ_CP013049.1|WP_005085677.1|4357100_4358594_-|isoniazid-inducible-protein-iniC MSSVVQARVIISDAMRAYQNDSRYLRVAEPHDELRRIAARLDEPIRVAIAGTLNAGKSTLVNALVGEDIAPTDATEATRIVAWFRHGAAPRVTANLFSGVRQDIPIRREGGLSFELDRLDPASVADLDVNWPAPELNEITLIDTPGTSSLSTDVSERSLALLVPEDGVPRVDAVIFLLRSLNAADIGLLTQIGKLVGGERGGAVGVVGVVSRADEIGVGRLDAMLSAREVAARFAGEMERTGICQAVVPVAGLLALTARTLRQAEFVALQRLAELDPQVLAKALLSVDRFVREDDSLPVDAQTRAALLHRFGMFGIRISVAVMRVGVTDATGLAEELLERSGLVELHNIIANQFGQRAGILKSHTALLSARRVLTTYPVHGGRRVIDDIDPLLADTHAFDELRLLAELSSRSTTLTEHETMLIRRLLGSFGTDAAARLGLDETRSDPRRAALDAVGRWRARAEHPLNDPFTSRLCLAAVRSAEGILTQLSAHDRPAY >NZ_CP013049.1|WP_005086456.1|4356439_4357096_-|hypothetical-protein MMMRLFVATLALVAGFSTATAAAEPTPSPAGPASKPVPAGVWISPAEIPMNSDYHWSAVSPTATGRAAFLSMRLCGSPSADVLPPASAIATQAASGIAASVVQAAGQWPEGNTEGASAFQSGIRSQLNYCPGVGDVISVDLRPAPSWYGFAATLLVGKERDNPTSEVHIYNVVPPESGTVSELAVTVPRTGREPSPWKPVDDATVLRALAQPLCKGSC >NZ_CP013049.1|WP_005086454.1|4355766_4356420_-|hypothetical-protein MRGWLRTISALVVIAGLGAAVPAAAEPKSLPDSVWINPRDIPLDRASHWAPLSRGAAPVDRPAFWSANLCFSLGESLPQSPESASATVSSDESGWTAVEVVAHWPGEAAVTDQYASTVYRSLRGRLDHCFNAIGAQVSVTELTNGHAATVTLPAQGGKQPQYHLYVVQPPGTGTVAELTVTNAITGAAGASWVEADDHQVLRDLAAPICRTAKSTAC >NZ_CP013049.1|WP_005112267.1|4355302_4355689_-|hypothetical-protein MHGRGVLSGVCGIATIIAMAVLLTPVSIQVGTGETAQTVTCGMPVGPRLSRTAELDKISSAQNQSTNYHDQCAAKLDTRRMWAIPIGVLSFVIALCAAGHLWKENTPSSSPAQTHHGLSGPPGMRMAH >NZ_CP013049.1|WP_005086452.1|4354832_4355309_+|hypothetical-protein MFNQQYHDQEPDQPNPWYKQPAVLIALAGAGVAVVAVIAALIITRSDDKPAPAGNTSSVSSTTSSTPADSGGGGHEGHGGHSGGGNTTVTETQTVPESPTSQDTTTTEPTTTTTTTTEPTTTTTQPTTTTTQPTTSTRPSVVVPIPGGGNIQVPVGGQ >NZ_CP013049.1|WP_005112263.1|4354307_4354775_+|hypothetical-protein MSQTPKPPEDESYFSEMIEESEKPAPWYRTGPAIVGTIAAIIAIVAVLMTIVLSTDKRVVRDNTPLTSTPSVSSTAPTVQLTSPSESSTAVPTATEAPSEPAAPVQTSEPTYYETPTSTIYRYPIPAIPSPPPIPPIPAIPQIPQIPQIPQIPGL >NZ_CP013049.1|WP_005085687.1|4353345_4354281_+|LLM-class-F420-dependent-oxidoreductase MDFRVFVEPQQGATYSDQLRVAQAAEELGFSAFFRSDHYLAMGGADGLPGPTDAWVTLGAIARETTAIRLGTLVTSATFRHPGPLAVTVAQVDEMSSGRVDFGLGAGWFSEEHEAYAIPFPSLGERFDRLEEQLEILTGLWTTPTGDTYSYAGKHYTVTDSPALPKPVQEPHPPIVIGGLGVKRTPALAARFATEFNLPFQPVDVITDQYARVAHAVEAAGRPADSMTYSAAFALCIGDTDADIARRAANIGREVEEITENSPLAGTPDAVAEKLSIYVDAGVQRVYVQLLDIRDLDHLEFFASTVIPQFS >NZ_CP013049.1|WP_005112262.1|4352724_4353204_-|thiol:disulfide-interchange-protein MAVLALALSVGLSGCSGTEKGETSSAPSSRLLNFTAQTIDGADFAGSSLAGKKAVLWFWAPWCPTCQKEAPDLQKAATAHSDVTFVGVAAQDQVPAMRDFVTKYGLTFIQLADTDAKVWALYDVTHQPAFAFLGAEGKAEVVKSPLSGPELDKKIGQLH >NZ_CP013049.1|WP_005112261.1|4351857_4352709_-|cytochrome-c-biogenesis-protein-CcdA MIDTAALTFALGAGLVAALNPCGFAFLPGYLGLVIAGSDGTASRMTAIARAATATVAMAGGFLTVFGVFGLVVSPVVASAGRYMPFATVVIGVVLVALSIWLLCGRELTLVLPRVSGGAPTASLLSMFGYGLTYAVASLSCTVGPFLAVISTTFKQGSIVSGVLAFIAYGAGMAVTVGVAALAVALLGNSVQATMRKVLPYVGRIAGVIVLLTGLYVAYYGYYEIRLNFGDGSADDPVINAAGTVQAWLVGVVDATGVWPLTGGIALIVAVAAASSYAVRKGR >NZ_CP013049.1|WP_005080038.1|4362686_4363376_+|thioredoxin-domain-containing-protein MRRSENGKRDIWIAGILGVVIVALATYLLVDHRSQSTASTDSPTVTGHSSSLARLRPLDPLALGPVDAPVVLIIYSDYRCPFCAKFSRDTEPQLIERYVNTGKLRIEWRDLPIFGTQSVQAAKAGRAAAEQGRFWEFNRAVYRHAPDRGHAELTDKILLDRAREAEVPDLARFQTAVESDRLLPAVQQDIQEAVAIGAASTPVFLINDQPVVGAQPLDVFISVIEQAQR >NZ_CP013049.1|WP_005112272.1|4363372_4364254_+|cytochrome-c-biogenesis-protein-CcdA MIDVGVLGALLGGVLTLVSPCSALLLPSFFAYSFDRTGLLLRSTGLFYLGMLTVLAPLGAGVGAIGALLTEYRSQVTTTGGLLMTTLGVAIIAGIGFRVGPAARLSARLDLSSGLSVLLLGTVYALAGFCSGPILGSVLTVAAIGSSPIYGALLMSLYSLGMAAPLFVLAFAWGKLGLSQRRWIRGRELRIGRFRTHSTNLVSGGMFIAIGVFFLATDGTAALGGLTGSDTQYDVQARLQSLTAGLPNAAVALGIACAAFVVVLMRLLRARITAAGDARKRSDSTREDADDRR >NZ_CP013049.1|WP_005080036.1|4364271_4364634_+|penicillinase-repressor MRDMFGLGEREATIMELLWSAAEPVTVRDVLDRLERPLAYTTVMTVLDNLHNKGHVTREKVGRAFQYRAAETREAAAARMVREVLAASGDAEGVLLHFAGATSPEESTVLRRILRRGGLA >NZ_CP013049.1|WP_005095144.1|4364630_4365554_+|M56-family-metallopeptidase MTVALVLMAGAALIGVAGPACLGPTVRPSLLPAAALATWLGALLAFLVLTTLSATLLLFPHVLDTSGVRTLVGGCLSGTVPHSSFGTLVQLGISLLPLTILARVAVVAIRSLRSARRRRARHFWMLRAASCRSGEVHWIDDPRPIAYSLGGARGAVVATHGVRHLGERQCAAVIEHEMAHIRGRHHAAVLCADIAAAALPMLPLMRRAPAMVRLMVELAADDAAARVHGPPTVHAALLAMSQSAVAVPGVLHMSSESVAVRLLWLRTQGATGRTPMGRTRLMLGFALLPTTLAGSAVVALFRAYCTL >NZ_CP013049.1|WP_005112273.1|4365634_4366018_+|DoxX-family-protein MIPTGARLVFGVIFLLEGTQKVFGWWSGSPTGSGHPEPFLNWPYWWAGVFELVLGSLLTVGLRTRLAAVLAAGMMAYAYFFEHLQVHWEPMQNGGAFAATFCWGLLLLAVTANDTRLSLDRVLARRA >NZ_CP013049.1|WP_005095148.1|4366030_4366627_-|hypothetical-protein MGNNLLDFVMALVRDQDMAAQYAANPERVIADAHLDGVQPSDVSALIPVVSESMPSISQLAAHPVASGLAHATIPADNIWASGAAVSAFDSPFTAPTHDPSLDAGLHAATDAVSDLSPRQDAITVPDQSTAPTDSAVAAIDQFQAPPQSTDGGFEPQQFGADAGVTGSADPGLHDPFFDQGDLGHHPGIGHDPGIDHF >NZ_CP013049.1|WP_005112275.1|4366808_4368626_+|Hsp70-family-protein MANALGLSIGATQLVATPDDPQSQPVVRSSILTLHEDGPPEVGVPSQPGLTLTGFVGRVGDPVGIVAADGSVHRAEWALTEAMRVLIADAVSSANFTAPPVLSASIPAHWSPQTVSTLRDAIDRVPAFAPGGRQLKLIPDARAALEALAAGPGTPDRGVVVVCDLGGSGSSITLADAARGFQQIGQTIRFADFSGQQIDQMLLTQVLTDLNQDPEGTTSVGALTRLRDQCRLAKERLSGDTATSVPVQLPGITTDVRLTRAELEGHLRGPLSNLINAIQDTVERNGIHPANITAVASVGGGASIPLVTQQLSERMRVPVITAPQPQLAAARGVALLATRPDLPPQDATAMRPVEQPPGDATMMRPVPTGTGTFAAPVAAAANEEPELAWSQDDSAPDLAPLQETGYQSGYTVDLDDEYQGPTEATTARPELRFSHGPYADEDDYDDYPPLPWYRRPLVWFIAAAAVAGIAFTGMMVSLTSSESPAPATTTPSVNVVPSTGDAPAEPPPSPEPPPPPQTHTVTQSVQPPPPSPEPPPPPPPTTTTTTTPPTTTTTTTTTTTTTTTTTTTPTTTTTQPTTTTTQPTSTSRPAITIPGLPPIPIGPRN >NZ_CP013049.1|WP_057138206.1|4368718_4371088_+|LuxR-family-transcriptional-regulator MKVLVNGTAGSGKTFVLSRIREALGSAGTPAVNHVPEAAPGPNTGPFVIDDADSLTDRQLETLRXVVENDPAAIIVVAAAPRRNPALRSLFQALERESPAITLGPVNATDIGRMSPLAAPFAEQIAAASGGVLALTAAAVERLGDPDPLQSALRAIDTRVDELLRRLSPMAQAAVLLMSLNPGIGASDVAAALALPDAADLVDEALASGLIAGPGHVAFAARVHGCAARLLGAARHLDLERSLLRTQLEMGSVSADLALALVGHGLRDLEVARLLSNWATTESDPLSAADLYRAALTAGAEPDPIRVPLAESLARAGDLAAAAAQADEVLASTDPAQRTAGVRVAAAIACHNGDSGQAAALYSWLGADMDGLTALSASPVFVGTGQLAKGRKSLEDNVGAPPTTSATAQRNAALGVIASVEGRGQEALSLLGQAVTQQPSTSSFIPDSTVALAALTMLHSGDTARARSILTGAIRADRRHDVFGVRHRLLAAWIAMLSGDLSGAVQWLPGAQVELSRRDRLFAESLRTGIARRHGDTGPLRQHWQASMDALSAYSIDLYTLLPVGELWVAAARLRTLDVMTHHLDRAFAVLAHLGDPIAWSAPLRWAGVHAAILTNSPETLAPHGQALAAGAGSSTYARILATAGRTWLRVLANHVDGEEVDSAARMLADTGLGWDATRLASQAALHANDPKVAASMLALARDLRSPSAAADGESTPSSAVHSPTSTRLSEREREVAELLLRGLPYRDIGAQLFISAKTVEHHVARIRRRLGAESRSDMFSMLRAILTP >NZ_CP013049.1|WP_005112277.1|4371246_4374177_+|4Fe-4S-dicluster-domain-containing-protein MSTATITLGIIGVIFSLVAWGSFFGGVVKMVRVILSGQPDRTRFRPFLPRLKQLIVEVVAHTRMNKFRTVGWAHWLVMVGFLGGFPLYFESYGQTFNPEFHWPIIGDTFLWHLWDELLGIGTVIGIVTLIIIRQLNHPRKPERLSRFGGSNFFAAYTIEMIVLVEGLGMVLVKSGKIATFHNSHPSSDFFTMNVAQLLPESPIMVSVFAFIKMMSGGLFLLLVGRKLVWGVAWHRFAAFFNIYFKRNADGSVALGAAKPMMSGGKVLEMESADPDVDAFGAGKVEDFSWKDLLDVTTCTECGRCQSQCPAWNTGKPLSPKLLITSLRDHTYAKAPYLLAGEDKSKLSEAEIAEGERQLVGGEGAVIDHEVLWSCTTCGACVEQCPVDIEHVDHIIDMRRYQVLIESEFPGELAGLFKNLENKGNPWGQNAKDRLNWIEEVDFDVPVFGQDVDSFADFEYLFWVGCAGAYEDRAKKTTKAVAELLALAGTKFLVLGADETCTGDSARRAGNEFLFQQLAMQNIELLNSVFEGVEQKQRKIVVTCAHCFNALGNEYPQVGGDYQVVHHTQLLNRLVRDKKLVPVAPVSQDVTYHDPCYLGRHNKVYTAPRELIGASGAAITEMPRHGERSMCCGAGGARMWMEEQLGKRINIDRVDEALSTPASKIATGCPFCRVMLTDGVTARDDSAAVEVVDVAQLLLESVGRTEDIRKALPAKGTAAAAAAERAATKTAEPEPVVAEEEAPATTKAEAAAPAAEAKPVSGLGMATGAKRPGAKKTAAENSTAEASTATPTAEAAPAAPVKGLGMAAGAKRPGAKKATAETSAATPPAASASPPASQAPAAAAAPAPPVKGLGMAAGAKRPGAKKTAPAAPAAPAPEAAAVPSESEAPTATETSAVAEPPVKGLGIAAGARRPGAKKAAPATESAPEPAQPATAPEPTEPATSAPKDEAPETSSTPEPPVKGLGIAPGARRPGRRN >NZ_CP013049.1|WP_030142800.1|4374184_4375495_+|pyridoxal-phosphate-dependent-aminotransferase MDSNATIETVSTDNLPIDVPPRVPGLSPRQHQRVFAQSTKLQDVLYEIRGPVHAHAARLEAEGHRILKLNIGNPAPFGFEAPDVIMRDIIQALPYAQGYSDSKGILPARRAVVTRYELVDGFPYLDVDDVFLGNGVSELITMTTQALLDNGDQVLIPAPDYPLWTAATSLAGGTAVHYLCDETNGWMPDIEDLESKITERTKALVIINPNNPTGAVYSREILTKMVELARKHQLLLLADEIYDKILYDDAEHISVASLAPDLLCFTFNGLSKAYRVAGYRSAWLAITGPKDHAASLLEGVNLLANMRLCPNVPAQHAIQVALGGHQSIEDLVLPGGRLLEQRDVAWTKLNEIPGVSCVKPQGALYAFPRLDPEVHHIHDDDQLVLDLLLNEKILLTQGTGFNWPEPDHLRIVTLPWARDLAVAIERLGNFLASYRQ |
You can click texts colored in the table to view more detailed information
CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
NZ_CP013049_2 | 5105873-5106003 | Orphan |
NA
Consensus repeat of NZ_CP013049_2
|
2 spacers
spacers of NZ_CP013049_2
>2.1|5105899|28|NZ_CP013049|CRISPRCasFinder CCAGCAGCGCCTCGCACCGCAGAAGCCT >2.2|5105953|25|NZ_CP013049|CRISPRCasFinder GTTGTTCCCCACAGCGCCGAGCCGC |
CRISPR arrays and Neighbor proteins around NZ_CP013049_2
The CRISPR arrays of NZ_CP013049_2 >merge|NZ_CP013049|2|5105873-5106003|CRISPRCasFinder CTGCCGCTCCGCACACCGCGGAACCGCCAGCAGCGCCTCGCACCGCAGAAGCCTCTCCCGTCCCGCACAGCGCCGAGCCGGTTGTTCCCCACAGCGCCGAGCCGCCTCCCGTCCCGCACAGCGCCGAGCCG >NZ_CP013049|2|2|5105873-5106003|CRISPRCasFinder CTGCCGCTCCGCACACCGCGGAACCG CCAGCAGCGCCTCGCACCGCAGAAGCCT CTCCCGTCCCGCACAGCGCCGAGCCG GTTGTTCCCCACAGCGCCGAGCCGC CTCCCGTCCCGCACAGCGCCGAGCCG
>NZ_CP013049.1|WP_057138138.1|5103813_5104098_+|helix-turn-helix-transcriptional-regulator MGEEEQQWTPKRVTNADAAVGAVLLRLREQAGISVREVEQRTGISRSVLSRIERGERPARVLELEPLAQVYEMTIEDLITTITADPDVRAASSE >NZ_CP013049.1|WP_057138137.1|5102907_5103762_+|hypothetical-protein MGPDDFFEETESINTWTGEPTTTSKLRTGFLNELRAGPVAGIDDLDAAVALTHLVWDDLIAFGTGGGNTLDDKQLTLAQRALTETLSRIGITLTFPWRDFTAFKSHWVRNGCSGSWQARRDLLENLFAPVQAELDRQEDAQFRAVNAEAVSPHTKTGWPTVDEELTELRRRFRTATTTQDYRDVGNRAVAVLEALSRTVYDPVAHLREGETEPPVDKTKQRIGRYVEDSLAGKDNEAIRGVVNKVLELAHSVKHSTQPTRREAGIIADSVIMLANILRRVDQDF >NZ_CP013049.1|WP_057138136.1|5099493_5102349_+|hypothetical-protein MSTPFSPEELQRVAANRAALADAHAARESEIDMWLERLSNAEAVLMPLTVAGKRPVESAWQQTAATDWDAIEEHIRRGGNIGAHLGASRWVAWDGDNAAATAAMLDGGFDLFTVSAGSRNPAHTHAGGAHCLWRLPGWVPLVRLTGPTKAVMLDGGASIDVLAGNHQIAVPPSVVVIKEPTHYVGTYATASEAGYCEPDRDGWLRVADSDDGVLPELPLWALSDELLAYAPEGTVVGEPPAGWEPMAGTVTVWREAEARVREPGDSDDELTAAVDGLDLLSMLEAAGIAGERVGFDSCGPCETWLRAGSNAEKSITVHNCGVHGARVQVWTTAFADLPQGGHSRLDAYCGLTGRERGAVMKELGLVPEKRLIAVDSDSLEAAAAELEAAAAEGVTTVAVPTTGTPMPDGSHRGEIVQVEVGADGLLRRAQRLRGAAAELRAGQHPVEPRDGAVLIGADSVVGAPTVGANALQPMPEPELTPEEQAAAEAEALADKRDPYRHLPDPCTEELEELVFGDPRLPQVGQIRTAARAAAMSPWSTLGLSIGRGLLRVRASVLVPDLVGGMPAPLNYQIGIVGASGRGKGAAHGLVIYDAGVLIHTEESIDKKFMPPSGAALAGLFVARQKDDDGKVEVVPIREAAWCDWSEVDTLTAQSGRGGNDLASELRAAITGAELGTDPKKDGADPLKVEPLSYRILVSFSTQYGKPAMALMAERDGGTLQRTNWFSTSDPRSLSKRPRGPKPRAEIDIMRGLPTSHVVKHPDGDRELLTVDDAIHDEVWEAQANKIRFDRDVDELAGHQNLNRLRTAAWAALIQHQAHIGAFEWEWAGHVMEHSRRVRDRLEASVKGLKKEQAQENGSLDFDRKNAAELAKVAHTEALLTSLAKWGRDVREGRNKFGKSGPTFTLRDIQASAKNAKSERYTRAAELATELCMRQLWVCEGERMRSVEPDPE >NZ_CP013049.1|WP_126690960.1|5099272_5099497_+|hypothetical-protein MTVPMTVAEASAAHDTPTIEVVARSLPELHDLLTRPRAEGEPLHVRLAGPAAAELFGVGVAVRVAATGGEAVAR >NZ_CP013049.1|WP_057138134.1|5098978_5099203_+|hypothetical-protein MSTEGVRANAEAAFRAAHPDDRSGSLIAGCRAAARVADPDHAVGFTIGALADEERARASEIDVWLDCNGARIAS >NZ_CP013049.1|WP_057138133.1|5098667_5098982_+|helix-turn-helix-domain-containing-protein MRRMGYKEAAEIAGVSLRKMRYLVESGEILARRCGSRVLIAERDLEEWLNSLPLVVDTEADTEADTETATYDEGEGVSDRARTQGAARVRAASATEVRQEVAAA >NZ_CP013049.1|WP_057138132.1|5097422_5098484_-|hypothetical-protein MALTSAFVDCGTVLERCTLVNMINRPDQRPQGLSVTSPARVGLLTCGFTLATLAVATLNTPVAHAEPVVRCRTLEGGGDYVCVTETPTAPATTGSTGTSSGAGLGDWFSEHAGAFVFVIAVAAVVAIVAMVKSGTMKDKREASEAELARGRALAEQHHAAAVQRAHEEAAAQVPPREQWDPHNVGLAPPPMPAPKLPPAPPTSPTDLARYAAFGDVVPWIEGSALAAVTAPNGDRSRAEAAFAEAVRTAGLGTTDEDGTFIPDATLASVRAYLDGSGDVEFVVRTRDITIGEKQLATITHFLLLTARVASAGPWEREVATGRYTIRLSMNAKAAQEQVQEPKETGPRIDPRWA >NZ_CP013049.1|WP_126690959.1|5095431_5097423_-|DUF87-domain-containing-protein MTGGFDELFGDTSTATAAPTKKKAKTAAKKPAAADRPQRSQADHDGEVRAAADRYMALLPEIRAARPQGPRVEEAAPQLTGPAWVIDAWETALFGGDHAAFGLDARFGAEVFTRGDTLYVQVTVPAELAGEGRKAVAETMLAETARRQNLGEYEAIPGDDKRKLFLVRTNAFDTTNGWREHPKTGAFYRSAMSGGKYHQELFDLVGLRVVDPKTGEIKYPQREFGSDDRGGTVTLTLPPGMLPRDVIAAEPALRAALAMPELTVTAGDGLRPTIHLNSKPLVRAFPKSNPLSAGRLWLPRTEAERYAVSNEVRLFLGVTETGEVVAPRLKDRSHAAIFGMTNSGKSTTMVILARSLAAQGAEVWLADAKGTPEFRSLYKEGVPGITHLSVATPALMHATVHKLRELYKFRAAVANELADRGLGVDDILWPRVVLIFDEMGEFLNTALSDGGDRVAKAQAQATVSMLGEIGAKARAYGVHLIVAGQHVYVAALPNKVKEHLSIRAVLGKASDTHIQRLFDEQDRDAAKAAREGILPGQKGRGIVMADDAEVVQFQSFYNDAEQGAKFARDTAGTPRLKRWGHQFPIEDDAPGAHGAWQSWGAWEGDKSFEPDGTVEDLGTVVLDRPNPVTGELEPDPGAAAWDMTSPAYNPGSAPLSAAFQNVN >NZ_CP013049.1|WP_057138130.1|5094782_5095412_-|hypothetical-protein MITNTNATLTVPVGVTSDGNPVAVDLRRNSAVLVAGIAGSGKSEMLCRMRYALERQLDTDHLFHAQGIRAGADAVILAARDERRRRMRERRIDDSPAVLLVDDFGVMCIGRDRSDELVRAIEEIAVKGRAAGVHLVLATQGTEGLGRALLMNLSTRIVVGRLVHPVTTNVFSDGERELIADAGITHIGRGGGVLIGTDGRPTPFAALPL >NZ_CP013049.1|WP_057138129.1|5093898_5094762_-|hypothetical-protein MTTTTDRIIAAANAVRTTHLAALRPLAEVAGRTVAAPPTVEAALAAALHLIAARPALDHTVTELLRELVAAGITSTKLARLLSIRSSTLTARLDAPAPSAAPESELHYGMFRRKDKLSVRAARESLITAAAELGRTYAAALRPISTVSQGSVPEAATVDAALEAVLHLHRSRQHLDAALDPILAALVLGGVRRMSLAEALAVHPNTLQRRLMSEPLAHARWADLIENGDGTWSVAPAEVGRYKPTEELDEALVEAAVAEAITAVAENGPSARAREYSATVAESRVAF >NZ_CP013049.1|WP_057138140.1|5106563_5107043_+|hypothetical-protein MTDTVLGAIAYSEPGGWEGTYTILFLGREVTVRMALGGWDEADPVEPVQRDAVEQFTARKAELCAQADDALYAEYLQRRPELREQFGDDADRLMPIIDGKEGLSNLVAPDFFQVPLPRRGSTDRVVALMYNCSWDVELGLAVKFVNESIAEVGPQNIVL >NZ_CP013049.1|WP_057138141.1|5107043_5107748_+|hypothetical-protein MHEPVWARDAFVNSVWVATGETPEWIADRTDALLAQLGEALGITHWDTTRDDRWEGPADALAAIVRRNPVREALPEGEEGEVIPSEGYSMVLNGAGPSVSVKVAVHAGSIAVGRRVPSHRLHIDLREATSAGITSEVGDAVCAAVASAWRPATLTLTDSATNRLARRGGWKIGAGYRTWISSEVGAVSRVADGLTATELAGGTLISAPDDWSAEQVVAAMTETLAANGLDEVPH >NZ_CP013049.1|WP_079752580.1|5107758_5108124_+|DUF2185-domain-containing-protein MTRHIEFIPHAGACLATTNVMERRGKVRWLVRMPSTGGADNGWQIMSHVDTTDYLNDKSNWRIVAFNDVCNIEPALIGIYDFPVGSDLQIVRDDRGISIYDTPTGRQIPVEEFYVPPQFRD >NZ_CP013049.1|WP_162259894.1|5108145_5108283_+|hypothetical-protein MAALAGQLLFFAMVAALIGGVVWAVAVMRKPRPPLEEADDEDEDW >NZ_CP013049.1|WP_057138142.1|5108350_5108668_+|hypothetical-protein MVEYVLAEGGGASWVGPAVRAVLFTAAGVALLVVGVRRRVARARWNREDDRRLLHPDGSAASDVHRTSNPHSGGIWPIAVGAVILILGLLHVLDLVATLHVAGAI >NZ_CP013049.1|WP_057138212.1|5108718_5109498_+|DUF2185-domain-containing-protein MANKKFHLPAEDIVPGLAPVDGCLATDRILVDGSPVGYMYRDGSGWVFTAGDETPEYLADPGHLNLVSLNVIANYDRATVPYLQAPPGAAFIRHGDAFVEDPEGAPREPDEPAPPALNPQFPVVNGDHALTADWVLTTPEPMNFRIDRDAGSSSAVFWRPGLTAMLTPWGNPRNDTPARRLQQVRGHVSPHGFDHAEWSDDGVHYLTYRLAEGTGDGRLPALYGFVVSSAGHVQLAVYVDSGDDLMSAQALVRSITNRR >NZ_CP013049.1|WP_057138143.1|5109544_5110015_-|hypothetical-protein MDEVLAYQLFGDWSNAHQARGVSINGDFAPEEEAEDWAEELIGGMVAAMAHVGAVVERGPIRVHDGKVFVELDGDAFMVRDIDGEGSRASASLERVLSRFATIAARRGCTQRWFYWYTGDPVGMAYFVAPEELVTSSGVDVRELGTGEQWYEAQPD >NZ_CP013049.1|WP_057138213.1|5110007_5110517_-|hypothetical-protein MGVPTHVMEFSSGWATTMGNGSGPLQFLGPLRQLNGTERWFITFYALPPGKSYEEIRHQGTEEYIQAGGSAEAMMLDIRKPGGQQWGAASVRYFIGHPHEGNLPLDVPIELPRGPEFVSAAEVFDAEEAADIFISYYKTGDIPPGYVLRPVEGYTGGGDLIDLRGVTSG >NZ_CP013049.1|WP_057138144.1|5110652_5111546_-|Abi-family-protein MKPSLSWEAQVALLVERGLTIPNTDECAAFLAAHNYYRFSGYMRYFQQAPHEGNNLFQPGTTFEEIRDVYDADEELRLALIPRLARAEVLLRTHTAYVIANDHGPRGKYLEEDFYTDIGDAEPTVESCLRDIERSKERHILRYKSTDTGGVNFAELPVWSAVEAWSFGTLSKCIERGARGALAGAVATSIGVAKAGFAYRVKALVYLRNRCAHHSRLWHHSVIDAGPTPNNVRAKAKRLGGQFESRSVLDVIASLDNILDRGRTGNAILPELVQRHERTSIFWQGLSRPQNPRDNRE >NZ_CP013049.1|WP_079752582.1|5111884_5113975_+|site-specific-DNA-methyltransferase MADETNILDQLISRVPDESLRSYLAREVDLLRGSRHFGLVFDRHLPESVRLVDYPIRKGVRVALRDESSTETWLVTGFVDQERKVAVLSGDGGERPVDDLAVVREFGEPIYPGLRSVERIPNGPEDAPWHVVINGENYHALQALRSTHRGKVDLIYIDPPYNTGNDGWIYNDRYVDQADRAKSSKWLSFLERRLQIARDLLKPSGVICISIGYAEAHRLQLLAEQTFRAHTVQLVTLQTSEAVNPKAGFNFVHEHLVCITPADFVPQQMTFTGNKPGAAWHAMNLAAFDKTQRPNQAFPIFVDVKTGAVHSTGPTLADLEKSGEYTGDRADYPYEMPAPEGTVAIWPITGKGEHCVWRLVPESFMANWDKGYIKVLRKKGKAAAIAPFGIQYLSEGEIAKVESGETEVLGREPGVPTVILGANMVSGDKIPSMWNELTHRTTEGNSHLKKILGGKRFPYPKPVELIMDIVMGFSQGNRDAVILDFFGGSGTTLEAVMELNTFDGGNRQAILVTNNELSATEAKKLRKAGLHPGDAEWESKGVFEYVCRPRVSTVVRGKRPDNSEYSEGLPANVEMFDLVYLDPSSVRRGREFPALAPLLWLQGGARRDRIEADSGCGWALTGTYGVLFDIDTLTAFADAVTAAATTGSPPQVLFIVTDSLAEYQEAVDRLPVGIDTVQLYEDYLLNYTDNFSGGAR |
You can click texts colored in the table to view more detailed information
CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
NZ_CP013049_3 | 5119191-5119644 | Orphan |
I-E
Consensus repeat of NZ_CP013049_3
|
7 spacers
spacers of NZ_CP013049_3
>3.1|5119216|36|NZ_CP013049|CRISPRCasFinder,CRT ACCGGTTTGGTCGCTCGGCCCGGCCGTGCGCGATGC >3.2|5119277|36|NZ_CP013049|CRISPRCasFinder,CRT ACCGCATCTCGGTGATCTGCACCGTCGACGCCGTGA >3.3|5119338|36|NZ_CP013049|CRISPRCasFinder,CRT ACCTCCGGTCCAGGTACCGCGTCCAGAAGATGCGGT >3.4|5119399|37|NZ_CP013049|CRISPRCasFinder,CRT ACCGGAGCACCTGGCGCTTCACGTTCACGAGGACGAG >3.5|5119461|37|NZ_CP013049|CRISPRCasFinder,CRT CCCTGTTGTTCCAACTTTTGCGGAACCCTCTGCAGCG >3.6|5119523|38|NZ_CP013049|CRISPRCasFinder,CRT CCCCCGTCGGTGTGGAACCGGCAGAGCCACAATGCCAC >3.7|5119586|34|NZ_CP013049|CRT CCCCGGCCGCTGGCTCCGGCACCCGGAAGCCGAG >3.8|5119464|34|NZ_CP013049|PILER-CR TGTTGTTCCAACTTTTGCGGAACCCTCTGCAGCG >3.9|5119526|35|NZ_CP013049|PILER-CR CCGTCGGTGTGGAACCGGCAGAGCCACAATGCCAC |
CRISPR arrays and Neighbor proteins around NZ_CP013049_3
The CRISPR arrays of NZ_CP013049_3 >merge|NZ_CP013049|3|5119191-5119644|CRISPRCasFinder,CRT,PILER-CR CTGCTCCCCGCGCAAGCGGGGATGAACCGGTTTGGTCGCTCGGCCCGGCCGTGCGCGATGCCTGCTCCCCGCGCAAGTGGGGATGAACCGCATCTCGGTGATCTGCACCGTCGACGCCGTGACTGCTCCCCGCGCCTGCGGGGATGAACCTCCGGTCCAGGTACCGCGTCCAGAAGATGCGGTCTGCTCCCCGCGTAAGCGGGGATGAACCGGAGCACCTGGCGCTTCACGTTCACGAGGACGAGCTGCTCCCCGCGCAAGCGGGGATGACCCTGTTGTTCCAACTTTTGCGGAACCCTCTGCAGCGCTGCTCCCCGCGCAAGCGGGGATGACCCCCGTCGGTGTGGAACCGGCAGAGCCACAATGCCACCTGCTCCCCGCGCAAGCGGGGATGCCCCCGGCCGCTGGCTCCGGCACCCGGAAGCCGAGCCGCTGCCCCCGCAGCGGGGATCGA >NZ_CP013049|3|3|5119191-5119585|CRISPRCasFinder CTGCTCCCCGCGCAAGCGGGGATGA ACCGGTTTGGTCGCTCGGCCCGGCCGTGCGCGATGC CTGCTCCCCGCGCAAGTGGGGATGA ACCGCATCTCGGTGATCTGCACCGTCGACGCCGTGA CTGCTCCCCGCGCCTGCGGGGATGA ACCTCCGGTCCAGGTACCGCGTCCAGAAGATGCGGT CTGCTCCCCGCGTAAGCGGGGATGA ACCGGAGCACCTGGCGCTTCACGTTCACGAGGACGAG CTGCTCCCCGCGCAAGCGGGGATGA CCCTGTTGTTCCAACTTTTGCGGAACCCTCTGCAGCG CTGCTCCCCGCGCAAGCGGGGATGA CCCCCGTCGGTGTGGAACCGGCAGAGCCACAATGCCAC CTGCTCCCCGCGCAAGCGGGGATGC >NZ_CP013049|3|1|5119191-5119644|CRT CTGCTCCCCGCGCAAGCGGGGATGA ACCGGTTTGGTCGCTCGGCCCGGCCGTGCGCGATGC CTGCTCCCCGCGCAAGTGGGGATGA ACCGCATCTCGGTGATCTGCACCGTCGACGCCGTGA CTGCTCCCCGCGCCTGCGGGGATGA ACCTCCGGTCCAGGTACCGCGTCCAGAAGATGCGGT CTGCTCCCCGCGTAAGCGGGGATGA ACCGGAGCACCTGGCGCTTCACGTTCACGAGGACGAG CTGCTCCCCGCGCAAGCGGGGATGA CCCTGTTGTTCCAACTTTTGCGGAACCCTCTGCAGCG CTGCTCCCCGCGCAAGCGGGGATGA CCCCCGTCGGTGTGGAACCGGCAGAGCCACAATGCCAC CTGCTCCCCGCGCAAGCGGGGATGC CCCCGGCCGCTGGCTCCGGCACCCGGAAGCCGAG CCGCTGCCCCCGCAGCGGGGATCGA >NZ_CP013049|3|1|5119436-5119588|PILER-CR CTGCTCCCCGCGCAAGCGGGGATGACCC TGTTGTTCCAACTTTTGCGGAACCCTCTGCAGCG CTGCTCCCCGCGCAAGCGGGGATGACCC CCGTCGGTGTGGAACCGGCAGAGCCACAATGCCAC CTGCTCCCCGCGCAAGCGGGGATGCCCC
>NZ_CP013049.1|WP_057138147.1|5118927_5119107_-|hypothetical-protein MGTDYRHSITNLHRGVHCDVNSARTRLHSVIELMEQTNLTAAVADLRAVEQRLADVASR >NZ_CP013049.1|WP_057138146.1|5118040_5118853_-|hypothetical-protein MIERMSRKAGFAIAAAAAATVALSGCSTGVAGQPVAVPATSSATVAGAEAAVESTWLTIPAEQAAAAPKRIELTQTASGARCTAGPAIAAASTPTRRGFVAAGHCDEVPGSTVTVGGESIEPYTDTRRGDRGVTVAWGTANAAPTVAGGFKVAGVLKQPAVQRLEYGTPVCMDGAISGVRCGTVTENDASGIYVKMPIERGDSGAPLFLVSDRGTATLIGIVEQRSNGFTFGAYLDPLLDEVGAKVLTDRTAAIDPSTDPRYSSATSTIQ >NZ_CP013049.1|WP_057138145.1|5113971_5116512_+|DEAD/DEAH-box-helicase-family-protein MKFTLEPYQSTAVEALLSSLTKARADYLDDGDLTAVGLTAPTAAGKTVIATAVLEGLYLGTPTREPRPNTTVLWITDDRALNAQTINKITTASGGRIDVNRIRLLGEEDSRTLEPGLIYFVHIQALQKNSTLHAVRADGTRNDKRTHGVWDMIANTVRERGEDFVVIWDEAHRGSGTTTTERKTIAGTIVGGGPTNIGTVQPPAPVVLGISATPDRFEAAMKAANRVRRLHAVKAGEVRESGLLKDRILLRSLGETQAANNTMLALAVEDLKTSDEAWRTHHENTGDRLVEPLLVVQVEPKVTEARLVEILSVISSTWPELTDYAIAHAFGDPHGPLKVGDKTVRYLAPEAIAGDDRARVVLFKSALTTGWDCPRAEVMISFQNKDDYTEIAQLIGRLVRTPLAKRVEGDDRLNEVAAYLPGFRAEHVARVVNALTEDETVEVDVVVAPVVCERSSKVPSEVFGLLDTLPSYTRQRTTFPSRTAQLMRLASALTEHKFVEQGSAKARVWIVDQMRAADGQRSDEIDAKAKDILSLTINTTVVEYGELIMLSEGKSDTPTNERDLEGYFRRARRVLPDGSANWYFNALCDAGMDEVDAMARLTAMAEMGFKEIVETQAAALISTWRDQHQSEVSRRPRHIRDQIEPLWHVGTTPMLPTTVEARDVYAAATEKVRGGTTEPITTYPSHLYVIPDGKPNAGEFPVDTSRSSWEAAVLEAELKAKTLVGWYRNPSSGKHALAVPYEFGDKYQLMHPDFLFWHDEGDGKYVMDIVDPHNHSLADTHAKWAALSKYAQDHPDRVRRCLAVAEIDGSMRALDLTKDGIDERIAVATNKNLIEALFAAEGMAYT >NZ_CP013049.1|WP_079752582.1|5111884_5113975_+|site-specific-DNA-methyltransferase MADETNILDQLISRVPDESLRSYLAREVDLLRGSRHFGLVFDRHLPESVRLVDYPIRKGVRVALRDESSTETWLVTGFVDQERKVAVLSGDGGERPVDDLAVVREFGEPIYPGLRSVERIPNGPEDAPWHVVINGENYHALQALRSTHRGKVDLIYIDPPYNTGNDGWIYNDRYVDQADRAKSSKWLSFLERRLQIARDLLKPSGVICISIGYAEAHRLQLLAEQTFRAHTVQLVTLQTSEAVNPKAGFNFVHEHLVCITPADFVPQQMTFTGNKPGAAWHAMNLAAFDKTQRPNQAFPIFVDVKTGAVHSTGPTLADLEKSGEYTGDRADYPYEMPAPEGTVAIWPITGKGEHCVWRLVPESFMANWDKGYIKVLRKKGKAAAIAPFGIQYLSEGEIAKVESGETEVLGREPGVPTVILGANMVSGDKIPSMWNELTHRTTEGNSHLKKILGGKRFPYPKPVELIMDIVMGFSQGNRDAVILDFFGGSGTTLEAVMELNTFDGGNRQAILVTNNELSATEAKKLRKAGLHPGDAEWESKGVFEYVCRPRVSTVVRGKRPDNSEYSEGLPANVEMFDLVYLDPSSVRRGREFPALAPLLWLQGGARRDRIEADSGCGWALTGTYGVLFDIDTLTAFADAVTAAATTGSPPQVLFIVTDSLAEYQEAVDRLPVGIDTVQLYEDYLLNYTDNFSGGAR >NZ_CP013049.1|WP_057138144.1|5110652_5111546_-|Abi-family-protein MKPSLSWEAQVALLVERGLTIPNTDECAAFLAAHNYYRFSGYMRYFQQAPHEGNNLFQPGTTFEEIRDVYDADEELRLALIPRLARAEVLLRTHTAYVIANDHGPRGKYLEEDFYTDIGDAEPTVESCLRDIERSKERHILRYKSTDTGGVNFAELPVWSAVEAWSFGTLSKCIERGARGALAGAVATSIGVAKAGFAYRVKALVYLRNRCAHHSRLWHHSVIDAGPTPNNVRAKAKRLGGQFESRSVLDVIASLDNILDRGRTGNAILPELVQRHERTSIFWQGLSRPQNPRDNRE >NZ_CP013049.1|WP_057138213.1|5110007_5110517_-|hypothetical-protein MGVPTHVMEFSSGWATTMGNGSGPLQFLGPLRQLNGTERWFITFYALPPGKSYEEIRHQGTEEYIQAGGSAEAMMLDIRKPGGQQWGAASVRYFIGHPHEGNLPLDVPIELPRGPEFVSAAEVFDAEEAADIFISYYKTGDIPPGYVLRPVEGYTGGGDLIDLRGVTSG >NZ_CP013049.1|WP_057138143.1|5109544_5110015_-|hypothetical-protein MDEVLAYQLFGDWSNAHQARGVSINGDFAPEEEAEDWAEELIGGMVAAMAHVGAVVERGPIRVHDGKVFVELDGDAFMVRDIDGEGSRASASLERVLSRFATIAARRGCTQRWFYWYTGDPVGMAYFVAPEELVTSSGVDVRELGTGEQWYEAQPD >NZ_CP013049.1|WP_057138212.1|5108718_5109498_+|DUF2185-domain-containing-protein MANKKFHLPAEDIVPGLAPVDGCLATDRILVDGSPVGYMYRDGSGWVFTAGDETPEYLADPGHLNLVSLNVIANYDRATVPYLQAPPGAAFIRHGDAFVEDPEGAPREPDEPAPPALNPQFPVVNGDHALTADWVLTTPEPMNFRIDRDAGSSSAVFWRPGLTAMLTPWGNPRNDTPARRLQQVRGHVSPHGFDHAEWSDDGVHYLTYRLAEGTGDGRLPALYGFVVSSAGHVQLAVYVDSGDDLMSAQALVRSITNRR >NZ_CP013049.1|WP_057138142.1|5108350_5108668_+|hypothetical-protein MVEYVLAEGGGASWVGPAVRAVLFTAAGVALLVVGVRRRVARARWNREDDRRLLHPDGSAASDVHRTSNPHSGGIWPIAVGAVILILGLLHVLDLVATLHVAGAI >NZ_CP013049.1|WP_162259894.1|5108145_5108283_+|hypothetical-protein MAALAGQLLFFAMVAALIGGVVWAVAVMRKPRPPLEEADDEDEDW >NZ_CP013049.1|WP_057138148.1|5119895_5120108_+|hypothetical-protein MTTTTITTPLIVDGVPVLEVPPRTAAVMKSVSERSILRWIAEGKLPARRLPGGRQLRIAVADLTALGEPV >NZ_CP013049.1|WP_126690961.1|5120183_5122406_+|hypothetical-protein MTVPMTAVDASLTHDTPIAPATLQQWRPYIPEHIDPTASPAPGGHCECPTVAHIVLHVDGCVQGVSGRAQPYDPDLCGDYDLPLNKLVDAAELGAALRGITVGGLADVMRWEAEDRALDVHAAKSAYEAGVAEGESEAEALFETLRKRLPVEQYDAVLDGLDMNDPGQVLERLRAEVAALPAPFTVPPAPGAGQALTAAPDVADSSAAPEVEDDEEQEEVEPDRTIRDFDGTPPAADQIMKYPMPPIPAHVPPVEGAKTQPVEVLPPLANQWGHQNVAEEWIFSATPGLSHVAAAADARGVTRWGLLGTLLTRVAATIPPTVRLIPASRKVPTEAGATPAGTSINLYSIAVSPPSTGKTDTISAAAALIPGVRTVPPGTGEGMLKEFPRLSVDEDDSEGDGDGTAPKIETIETDAYPGSVLLESDEIDVFVGEMMRQGSKTTGWYRSMWMGGEIGNTVSDRDRRSFIAAHTYRFGIMLGAQPDAVAPMFNETSKGTPQRFMWLPAQQTIARGDYPSRLGIAPVYWFDESPSMIPSTGGQRPPVWIYPPKAATDALDRARWRAATANPMASPAAADRAAAIADRHAVLQQLKVSAVLAVLDGLDQPQDAHWYAAGAIMTVRRRVIHELVAESDRIKAEGKEAEGALNGVYRASSDAARDAERERHVSRCAARMMAGLVKASVAGQPPMTYGAAIRLLSGKKGAAGRSDRELYGRAALAVVRASPGVADTGTHVQWVGTAAI >NZ_CP013049.1|WP_057138214.1|5123052_5123337_+|hypothetical-protein MTLTAEKLTPADAAVGSLVHALAEHLDPDALTTAVQQALLAAEIHLGDAVYLSELVPDVIDGQVWHGEAVMPAAEALRVGMGYVAAALVAGATL >NZ_CP013049.1|WP_065209208.1|5123463_5123709_+|hypothetical-protein MLARTHAALDPARLSGAVTAALATAEITYGSVTLRAGDDDLLAVPSIGEVWVADDAEIPADEALRVGIAYVAAALAAGARL >NZ_CP013049.1|WP_153995297.1|5123727_5123886_+|hypothetical-protein MHATLFADEQVAAALRDARPADEAAVRRAQARAATALDAAFAWTRTVTPWER >NZ_CP013049.1|WP_057138215.1|5124185_5124797_+|hypothetical-protein MLNPATAGRTVVGLDLSLTGAGIAAFDLMTGGLDTAVHRSPAPMVGTLGAHVQRHRALVDGIVQQTVACDPALVVVEGLRFSVSAQDTSLSRRGFLWWAVVEGLVNAGAPVLEVTPSQIKMLATGNGGASKDMMVAEYALAWPDATRDKNTQDRADAAFAAALGAAYLRSPLLPFTVTGKRRKALAKLPAPSAPPRIAAAAVA >NZ_CP013049.1|WP_057138151.1|5124823_5125093_+|glutaredoxin-family-protein MTNATLFTQPGCGGCIFAAKDLTKAGVPYTERNVRTDEAAAEMVQELYREHREPGKVPETPVIVIDGEPYFGAVELHAYLRELRREAAA >NZ_CP013049.1|WP_057138152.1|5125089_5125473_+|helix-turn-helix-domain-containing-protein MSGTIRRPAPGGGTFITRPAAAELRGVTRRRIDSLIQGGELPAYQVSGRVMVLEADVLALELPTVPPDGWISRHAAAKVLGCTHRAVGLMVARGDLPAETINGRVYLREADVRKAATPRRVMPPRRR >NZ_CP013049.1|WP_057138153.1|5128102_5128402_-|hypothetical-protein MNETTWTQTAAEDVPPELLHEVRALLAEFEALDPSPRVQPGRDGAAGGTRRHGRRGGRIRSRLHRGAAGRGGGMTGPVYDDEDIRDDGEQWPDEERDRD >NZ_CP013049.1|WP_057138154.1|5128459_5128813_-|glyoxalase/bleomycin-resistance/dioxygenase-family-protein MKLSLIVLYVPQPTLDLTARFYGALLDAEPVAEKHGDGPQHWSVTGADGLVMELYPMGSRPHTSTRLEFQGADMDAAVQRLIDRAYALPERTRDGAGWWVSDPIGNTVVLLPEPTPS |
You can click texts colored in the table to view more detailed information
CRISPR_ID | Spacer_Info | Spacer_region | Spacer_length | Hit_ID | Protospacer_location | Mismatch | Identity |
---|
CRISPR_ID | Spacer_Info | Spacer_region | Spacer_length | Hit_phage_ID | Hit_phage_def | Protospacer_location | Mismatch | Identity |
---|---|---|---|---|---|---|---|---|
NZ_CP013049_2 | 2.1|5105899|28|NZ_CP013049|CRISPRCasFinder | 5105899-5105926 | 28 | NZ_CP035424 | Leisingera sp. NJS204 plasmid unnamed7, complete sequence | 38441-38468 | 5 | 0.821 |
NZ_CP013049_2 | 2.1|5105899|28|NZ_CP013049|CRISPRCasFinder | 5105899-5105926 | 28 | NZ_CP038236 | Leisingera sp. NJS201 plasmid unnamed2, complete sequence | 9923-9950 | 5 | 0.821 |
NZ_CP013049_2 | 2.2|5105953|25|NZ_CP013049|CRISPRCasFinder | 5105953-5105977 | 25 | CP030179 | Xanthomonas citri pv. punicae strain LMG 859 plasmid unnamed, complete sequence | 20822-20846 | 5 | 0.8 |
NZ_CP013049_2 | 2.2|5105953|25|NZ_CP013049|CRISPRCasFinder | 5105953-5105977 | 25 | CP030179 | Xanthomonas citri pv. punicae strain LMG 859 plasmid unnamed, complete sequence | 143079-143103 | 5 | 0.8 |
NZ_CP013049_2 | 2.2|5105953|25|NZ_CP013049|CRISPRCasFinder | 5105953-5105977 | 25 | CP030160 | Xanthomonas citri pv. punicae strain BD0022 plasmid unnamed, complete sequence | 72056-72080 | 5 | 0.8 |
NZ_CP013049_2 | 2.2|5105953|25|NZ_CP013049|CRISPRCasFinder | 5105953-5105977 | 25 | CP030167 | Xanthomonas citri pv. punicae strain LMG7504 plasmid unnamed1, complete sequence | 71484-71508 | 5 | 0.8 |
NZ_CP013049_2 | 2.2|5105953|25|NZ_CP013049|CRISPRCasFinder | 5105953-5105977 | 25 | CP030170 | Xanthomonas citri pv. punicae strain BD0025 plasmid unnamed, complete sequence | 3602-3626 | 5 | 0.8 |
NZ_CP013049_2 | 2.2|5105953|25|NZ_CP013049|CRISPRCasFinder | 5105953-5105977 | 25 | CP030162 | Xanthomonas citri pv. punicae strain BD0023 plasmid unnamed, complete sequence | 52587-52611 | 5 | 0.8 |
NZ_CP013049_2 | 2.2|5105953|25|NZ_CP013049|CRISPRCasFinder | 5105953-5105977 | 25 | NZ_CP030165 | Xanthomonas citri pv. punicae strain LMG7439 plasmid unnamed, complete sequence | 75057-75081 | 5 | 0.8 |
NZ_CP013049_2 | 2.1|5105899|28|NZ_CP013049|CRISPRCasFinder | 5105899-5105926 | 28 | JQ809663 | Stenotrophomonas phage Smp131, complete genome | 10531-10558 | 6 | 0.786 |
NZ_CP013049_2 | 2.1|5105899|28|NZ_CP013049|CRISPRCasFinder | 5105899-5105926 | 28 | NZ_CP021083 | Deinococcus ficus strain CC-FR2-10 plasmid pDFI2, complete sequence | 41688-41715 | 6 | 0.786 |
NZ_CP013049_3 | 3.2|5119277|36|NZ_CP013049|CRISPRCasFinder,CRT | 5119277-5119312 | 36 | NZ_AP022593 | Mycolicibacterium arabiense strain JCM 18538 plasmid pJCM18538, complete sequence | 3785897-3785932 | 7 | 0.806 |
NZ_CP013049_3 | 3.7|5119586|34|NZ_CP013049|CRT | 5119586-5119619 | 34 | NZ_CP051182 | Thalassobius gelatinovorus strain NEB572 plasmid pAge77, complete sequence | 30119-30152 | 8 | 0.765 |
NZ_CP013049_3 | 3.7|5119586|34|NZ_CP013049|CRT | 5119586-5119619 | 34 | NZ_CP044550 | Hydrogenophaga sp. BPS33 plasmid pBPS33-1, complete sequence | 247075-247108 | 9 | 0.735 |
NZ_CP013049_3 | 3.7|5119586|34|NZ_CP013049|CRT | 5119586-5119619 | 34 | NZ_CP014942 | Rhodococcus sp. BH4 plasmid, complete sequence | 538608-538641 | 10 | 0.706 |
NZ_CP013049_3 | 3.7|5119586|34|NZ_CP013049|CRT | 5119586-5119619 | 34 | NZ_CP021767 | Ralstonia solanacearum strain RS 489 plasmid unnamed, complete sequence | 1500317-1500350 | 10 | 0.706 |
NZ_CP013049_3 | 3.7|5119586|34|NZ_CP013049|CRT | 5119586-5119619 | 34 | NZ_CP042916 | Rhodococcus qingshengii strain RL1 plasmid unnamed1, complete sequence | 130398-130431 | 11 | 0.676 |
1. spacer 2.1|5105899|28|NZ_CP013049|CRISPRCasFinder matches to NZ_CP035424 (Leisingera sp. NJS204 plasmid unnamed7, complete sequence) position: , mismatch: 5, identity: 0.821
ccagca-gcgcctcgcaccgcagaagcct CRISPR spacer -catcgtgcgcctcgcaccgcaggagccc Protospacer ** *. ****************.****.
2. spacer 2.1|5105899|28|NZ_CP013049|CRISPRCasFinder matches to NZ_CP038236 (Leisingera sp. NJS201 plasmid unnamed2, complete sequence) position: , mismatch: 5, identity: 0.821
ccagca-gcgcctcgcaccgcagaagcct CRISPR spacer -catcgtgcgcctcgcaccgcaggagccc Protospacer ** *. ****************.****.
3. spacer 2.2|5105953|25|NZ_CP013049|CRISPRCasFinder matches to CP030179 (Xanthomonas citri pv. punicae strain LMG 859 plasmid unnamed, complete sequence) position: , mismatch: 5, identity: 0.8
gttgttccccacagcgccgagccgc CRISPR spacer accgttccccaccgcgccgagccgt Protospacer ...********* ***********.
4. spacer 2.2|5105953|25|NZ_CP013049|CRISPRCasFinder matches to CP030179 (Xanthomonas citri pv. punicae strain LMG 859 plasmid unnamed, complete sequence) position: , mismatch: 5, identity: 0.8
gttgttccccacagcgccgagccgc CRISPR spacer accgttccccaccgcgccgagccgt Protospacer ...********* ***********.
5. spacer 2.2|5105953|25|NZ_CP013049|CRISPRCasFinder matches to CP030160 (Xanthomonas citri pv. punicae strain BD0022 plasmid unnamed, complete sequence) position: , mismatch: 5, identity: 0.8
gttgttccccacagcgccgagccgc CRISPR spacer accgttccccaccgcgccgagccgt Protospacer ...********* ***********.
6. spacer 2.2|5105953|25|NZ_CP013049|CRISPRCasFinder matches to CP030167 (Xanthomonas citri pv. punicae strain LMG7504 plasmid unnamed1, complete sequence) position: , mismatch: 5, identity: 0.8
gttgttccccacagcgccgagccgc CRISPR spacer accgttccccaccgcgccgagccgt Protospacer ...********* ***********.
7. spacer 2.2|5105953|25|NZ_CP013049|CRISPRCasFinder matches to CP030170 (Xanthomonas citri pv. punicae strain BD0025 plasmid unnamed, complete sequence) position: , mismatch: 5, identity: 0.8
gttgttccccacagcgccgagccgc CRISPR spacer accgttccccaccgcgccgagccgt Protospacer ...********* ***********.
8. spacer 2.2|5105953|25|NZ_CP013049|CRISPRCasFinder matches to CP030162 (Xanthomonas citri pv. punicae strain BD0023 plasmid unnamed, complete sequence) position: , mismatch: 5, identity: 0.8
gttgttccccacagcgccgagccgc CRISPR spacer accgttccccaccgcgccgagccgt Protospacer ...********* ***********.
9. spacer 2.2|5105953|25|NZ_CP013049|CRISPRCasFinder matches to NZ_CP030165 (Xanthomonas citri pv. punicae strain LMG7439 plasmid unnamed, complete sequence) position: , mismatch: 5, identity: 0.8
gttgttccccacagcgccgagccgc CRISPR spacer accgttccccaccgcgccgagccgt Protospacer ...********* ***********.
10. spacer 2.1|5105899|28|NZ_CP013049|CRISPRCasFinder matches to JQ809663 (Stenotrophomonas phage Smp131, complete genome) position: , mismatch: 6, identity: 0.786
ccagcagcgcctcgcaccgcagaagcct CRISPR spacer acagcagcgcatcgcaacgcagaagaac Protospacer ********* ***** ******** .
11. spacer 2.1|5105899|28|NZ_CP013049|CRISPRCasFinder matches to NZ_CP021083 (Deinococcus ficus strain CC-FR2-10 plasmid pDFI2, complete sequence) position: , mismatch: 6, identity: 0.786
ccagcagcgcctcgcaccgcagaagcct CRISPR spacer ccagcagcgcctcgcagcgcagcttctc Protospacer **************** ***** *..
12. spacer 3.2|5119277|36|NZ_CP013049|CRISPRCasFinder,CRT matches to NZ_AP022593 (Mycolicibacterium arabiense strain JCM 18538 plasmid pJCM18538, complete sequence) position: , mismatch: 7, identity: 0.806
accgcatctcggtgatctgcaccgtcgacgccgtga CRISPR spacer gctgcatctcggtgatctgcatggtcgacgccggcg Protospacer .*.******************. ********** .
13. spacer 3.7|5119586|34|NZ_CP013049|CRT matches to NZ_CP051182 (Thalassobius gelatinovorus strain NEB572 plasmid pAge77, complete sequence) position: , mismatch: 8, identity: 0.765
ccccggccgctggctccggcacccgg---aagccgag CRISPR spacer tcctggccggtggctccggcacccggctttatcc--- Protospacer .**.***** **************** * **
14. spacer 3.7|5119586|34|NZ_CP013049|CRT matches to NZ_CP044550 (Hydrogenophaga sp. BPS33 plasmid pBPS33-1, complete sequence) position: , mismatch: 9, identity: 0.735
ccccggccgctggctccggcacccggaagccgag CRISPR spacer cgcccgccgctggctccagcacccggccaccctc Protospacer * ** ************.******** .**
15. spacer 3.7|5119586|34|NZ_CP013049|CRT matches to NZ_CP014942 (Rhodococcus sp. BH4 plasmid, complete sequence) position: , mismatch: 10, identity: 0.706
ccccggccgctggctccggcacccggaagccgag CRISPR spacer tcagcgccgccggcaccggcacccggaaggagcc Protospacer .* *****.*** ************** *
16. spacer 3.7|5119586|34|NZ_CP013049|CRT matches to NZ_CP021767 (Ralstonia solanacearum strain RS 489 plasmid unnamed, complete sequence) position: , mismatch: 10, identity: 0.706
ccccggccgctggctccggcacccggaagccgag CRISPR spacer tcacggccgatggctccggcacctggacctacac Protospacer .* ****** *************.*** . *
17. spacer 3.7|5119586|34|NZ_CP013049|CRT matches to NZ_CP042916 (Rhodococcus qingshengii strain RL1 plasmid unnamed1, complete sequence) position: , mismatch: 11, identity: 0.676
ccccggccgctggctccggcacccggaagccgag CRISPR spacer tcagcgccgccggcaccggcacccggaaggaacc Protospacer .* *****.*** ************** .
Region | Region Position | Protein_number | Hit_taxonomy | Key_proteins | Att_site | Prophage annotation | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
DBSCAN-SWA_1 |
2733914 : 2742452
Sequences of DBSCAN-SWA_1
Nucleotide sequences of DBSCAN-SWA_1 >NZ_CP013049|2733914:2742452|DBSCAN-SWA CTCACTGGATTCCCTCGAGGAATGGGTTGGCGCGGCGCTCGTCGCCGATGCTGGTGGCATTGCCATGTCCCGGCAGCACAACGCTGGAGTCATCGCGCGTAAGCAGCTTGGCGGCGATCGAATCGAGCAGCTGCTGGTGGTTTCCGCCGGGTAGATCGGTGCGCCCAATCGACATCTGGAACAGGGTGTCACCACTGAAAAGCACGTCGACGGGACCGTTCTCGCTATCAACCTCGATACCGAACACGACAGAACCGCGGGTGTGCCCCGGGGTGTGGTCGACGGTGAGCGTGATCCCGGCGAGCTCCAATTTGTCGCCGTCCCCGATCTCGACCAACTCCTTGGGCTCGGCGAACACCTGGCCCTGAATGAACTGGCCCAGCCCCGGACCGATCCCGGCCAGCGGATCGGCGAGCATCACGCGGTCATCGGGATGGATGTACACGGGAATCCCATATTCATCGGCAAGAGGCTGCGCTGTCCAGGTGTGATCCAGATGCCCGTGCGTCAGCAACACCGCTTCCGGGGTGAGATCACGCTCGGCGAGAATCTCCTTGACGCCCCCGATGGCGTCCTGCCCGGGGTCGACGATGACGGCCGGGCCGCCCTCGTGCGGCGCCACGATATAGCAGTTCGTCGCGAACATCCCCGCGGGAAAACCGGTGATCAGCACCCGACCAGCTTAGGCGGCGGGTCCCGTGTCTTCTCTCAGGTGTGGCTGGCACACTCTCTGGCGACCTACATTCTTCGATGATTCTGCAATTGACGCCCTAGAAGGAGAGCCCGGTGCCGACCAACGAGCAGCGACGTCAGACTGCCAAGCGCAAGCTGGAGCGACAGCTGGAGAACCGTGCGGAGCGCGCCCGCAAGCAGCGGATCTGGGCGATCAGCGGTGTCGCTGTGCTCGCTGTCGTCGTGGCGGTAGCCATCACCGCGTGGTTCGCCATCAGCAAGGGCGATAAGACCGACACGGCTCACGACACCGCGTCCACCACGCCGGTATCGACGGTTGACGCACCTCCGACGTCGTCATTGCCGACGCCCGACGAACCACCTCTCGCCGGCCCGGCGCTGCCCAAGTTCGTACCGTCCGCCGAGGTGGGCGCCAACTGTCAGTACCGACCGTCGGGCAAGGCTTCCAAGACCATCGAGAAGCAGATACCCCGGACAGGGAAGATCCCCACCGATCCCGCGACGCTCACCGCCAGCATGAAGACCAGCCAGGGCAACATCGGGCTGGTCTTGGATAACGCCAAAGCGCCCTGCACGGTGAACAGCTTCGCGAGCCTGGCGCAGGGCGGGTACTTCGACAACACCGTCTGCCACCGGATGACCAGCGGCGGCCTGTCGGTACTGCAATGCGGTGACCCGACGGCCACCGGCTCGGGCGGCCCGGGATACGAGTTCGACAATGAATACCCGACCACCCAGTACAAGGCGGATGATCCGGCGGCGCAGCAGGCCGTCGTCTATCCCGCGGGCACGCTGGCCATGGCCAACGCGGGCCCCGGAACCAACGGCAGCCAATTCTTCATGGTCTACAAGGATTCGAAGCTACCGCCGCAGTACACCGTCTTCGGGACGATCGACAAGACCGGGATGGACACCCTGGCCAAGATCGCCAAGAACGGCATCAAGCCCGGCGCACGTGGCGCGGGCGACGGAGAGCCCGCACAAGAAGTGAAGATCATCGACCTGCAGCTCGACTGACATGATCTCTCGTCGCGCACTTCTTGTCGGTGGGGTATCGGCGGTCGGTGTGGTGGCTGCCGGTTGTTCGCGCAACGGCAACCGTCGGCCCGCGCCGGACGAACTCGCCTCGCTGGAAAAGGATTTCGGCGGCCGCATCGGTGTCTACGCGTTGGACACCGGGTCGGGTGACACGGTCGGCCACCGCGCCGATGAACGCTTTCTGATGTGCTCAACGGTCAAGACCTTCATCGTCTCGGCCATCCTGCGCCGGAGACTGAGCGAACCGGGCCTGTTGGACCAGCGAATTCAGTACACGCAATCCGACGTTCTGGAATGGGCGCCGATCACCTCGCAACACGTGTCCACCGGAATGACCGTCTCGGAACTGTGTGATGCGACGCTCCGCTATAGCGACAACACCGGTGCCAACCTGCTGATCACCCAACTCGGCGGCCCGAAGGAGACAGAGAAGTTCGTCCGAAGCCTGGGCGACAACGTCACTCGCATGGACCGCACGGAGGTACAGCTGAACATCCCCGACGGCGATCTGGATACCTCGACCCCGCAGCAGCTGGTGGCCAATCTGCGCCGACTGGTCCTCGACGAAGGGCTGGATTCACGGGGACGGGATCTGCTGACCGATTGGTTGAAGCGAAATACCACCGGCGACCAGTCCATTCGGGCAGCAGTTCCCGCCGGGTGGACGGTCGCAGACAAGACCGGCGGCGGCTTCAAGGGTGAAACCAACGACATCGCGGTGATCTGGCCCCCGGGCCGTGCACCCATCGTGATGGCGGTACTCACCGTCCCGGAGGACCCCACATCCACCAAAGGTAAGCCGACGATTGCCGCGGCCACCCGAATCGTGCTGCGGGCCTTCGGCGCTTGATCGTCTGACGCGGCTCGCGAGATCTACGTTCTGCAGACGCGTACTCGCACTTTCGCGTCACAATGTCAATCTCGGCGTCTGGCGGCGCCGGTAGACCGCCGCTAGGCGGCCGAGGTCACCCGGTAGACGTCATAGACGCCCTCGACATTGCGCACCACGTTGAGGACATGCCCCAGGTGCTTCGGATCGCCCATCTCAAAGGTGAATCGACTGATGGCAACTCGATCGTTGGAGGTGGTCACCGAGGCCGACAAGATGTTCACCCGCTCATCGGCCAGTGCCCGGGTCACGTCGGACAGTAGCCGGTGCCGGTCCAGGGCCTCGACCTGGATGGCCACCAAGAAAACCGAAGACGCCGACGGCGCCCAGTGCACCTCGATGATCCGTTCCGACTGCTGCTGAAGGGATTCCGCGTTGGTGCAGTCGGTCCGGTGCACGCTGACTCCCCCGCCCCGGGTTACGAAACCGAGGATCTGGTCACCGGGTACCGGTGTGCAGCACTTGGCGAGCTTGGTCAGCACCCCGGGCGCCCCTGGCACCGCCACCCCGACATCGTCGGAGCGCCGCTGCCGGATGGGCATGGTCGACGGGGTGGACCGTTCGGCCAACTCTTCTTCGGCGCTGTCCACACCGCCGAGCTGTGCGACCAATCGCTGCACCACGTGCCGGGCCGAAACGTGCCCCTCGCCCACCGCGGTGTAGAGCTGTGAGGTGTCCTGGTAGCGCAGCTCACGGGCCAGCGAGGCCATACCTTCGCCGTTGACCAAACGCTGCAGCGGAAGACCACCGCGGCGCACCTCACGCGCGATGGCGTCCTTGCCGGCCTCGAGGGCTTCCTCGCGCCGTTCCTTGGCGAACCACTGCCTGATCTTGGCCTTGGCCCGCGGCGACACCACGAAGGTCTGCCAGTCGCGCGAAGGTCCCGCATTGGCCGCCTTGGAGGTGAAAACCTCCACCACTTCCCCGTTTTCGAGCTTGCGTTCCAGCGCCACCAGCCGGCCGTTGACACGGGCGCCGATACAGCGATGCCCCACCTCGGTGTGCACCGCGTATGCGAAATCCACCGGCGTGGAACCGGCGGGAAGTGTGATGACGTCGCCCTTGGGCGTGAAGACGAAGATCTCCTGGACCGCGAGGTCGTAGCGCAACGATTCCAGGAATTCCCCGGGGTCGGCGGCCTCACGCTGCCAGTCGAGCAGCTGGCGCATCCACGCCATGTCATCGATCTCGGCGGCGGCATGGGTGGGCGGAACCCCGTTGCGGCCCTTGGCTTCCTTGTACCGCCAGTGCGCGGCGATGCCGTACTCGGCGGTGCGGTGCATGTCCCGGGTGCGAATCTGCACCTCCAGCGGCTTGCCTTCCGGACCGACCACGGTGGTGTGCAAGGACTGGTAGACACCGAAACGCGGTTGGGCGATGTAGTCCTTGAATCGTCCCGCCATAGGCTGCCACAGGGAGTGCACCACGCCCACCGCGGCGTAACAGTCACGAATCTCGTCGCACAAGATACGCACACCCACCAAGTCGTGAATGTCATCGAAGTCGCGGCCCTTGACGATCATCTTCTGGTAGATCGACCAGTAATGCTTGGGGCGCCCCTCCACGGTGGCGCTGATTCGCGAACTATTAAGAGTCGCAACGATTTCGGCGCGTACCTTGGCCAGATAGGTATCACGTGACGGAGCGCGATCCGCGACCAGCCGCACGATCTCCTCGTACTTCTTGGGATGCAGGATCGCGAACGATAGATCTTCGAGCTCCCATTTGACCGTGGCCATACCCAGCCGGTGCGCCAGCGGCGCAATCACTTCCAGCGTCTCACGGGCCTTGCGGGCCTGCTTCTCCGGCGGCAGGAAGCGCATGGTGCGCATGTTGTGCAACCGGTCGGCCACCTTGATCACCAGCACACGCGGGTCACGCGCCATCGCGATGATCATCTTGCGGATGGTTTCGCCTTCGGCGGCGTTGCCCAGCACCACCTTGTCGAGCTTGGTGACGCCGTCAACGAGGTGAGCTACCTCGGTGCCGAATTCGGCGGTCAGCTGCTCCAGGCTGTATCCGGTGTCCTCGACGGTGTCATGCAGGATCGCGGCCACCAGCGTGGTGGTATCCATACCCAGCTCGGCCAGGATGTTGGCCACCGCGACCGGGTGGGTGATGTAGGGGTCCCCCGACTTGCGGAACTGCTCGGCATGCCGGCTTTCCGCTACCTCGTATGCCCGCTGCAGCACCGACCGGTCGGCTTTGGGGTAGAACTCGCGGTGAACAGCGATGAGCGGTTCCAGGACCGGGCTGACCGCGCCGCGCTGGGCTGTCATCCGCCGCGCCAGCCGCGCCCGCACCCGGCGCGAAGCGCTCATGGTGCCGCCGCTCACCCGCAGAGACTCGGTTACCGGGCCGGGCTCACTCAACGCAGGCAGGGATTCGACCGCGACCTGATTCTCGCTGGGCACGTGCTCGTCAGCCACGGTTCACCTCCTACCGCAATAGATTATCTCTCAGATCGCGCGCAGCGCAGTTAATGGCAGCGGCGCCACGGCCGCGCGGCCGCCCAGATCCGAGAGTTCCAGGACAACGCCCGCACTGATCACGTCGGCGCCCGCGGTGGTCAGCAGCCTCGCGGTTGCCGCCAGCGTCCCGCCGGTGGCCAGCACATCGTCGACGATCGCCACGCGGCGGCCCGCAAGTTCCACCCCGTCTGCGGGGATCTCCAGCGCCGCGGTGCCGTATTCGAGCTGGTAGGTCTGCGCATGCACCGGTGGTGGCAACTTCCCTGCCTTGCGCACCGCCAACACGCCGACCCCGAGCCGGATCGCCACCGCGGCCCCGAGCAGAAAGCCGCGGGCATCGATCCCCGCGATCAGCGTGGCCCCCGATGCCGCCTGTGCCAGCGCGTCGGTGACGCGTGACAATCCGTCGGCATCCGCGAACAGGGGCGTCAGGTCCTTGAACTGCACCCCGGGCTCGGGAAAGTCGGCGACCTCACGGGTCAGTGACGCGACGAGTTCGGCTATCTCATCGGGGCCCGTCACGTGGGCAGCACCCAACGATCCATGTTCCAGCCGGCACCCCACCGGGTGACGCTCGGGGTGGCGGCGTACATCGACTTCGGGGCGATCAGCACACGCGGCTGCCGATACAGCGGCAGTGTCGGCAGATCGTTCCATAGGATCGCCGAGGCGTCGCCGAGCAGCCGACTCTGATCCCGGGCGTTGGTTGTCACCGCCAGGGCGGCGATGATGCCGTCGATCTGGCCGTTGGCGTAGTTCGCCGGGTTCTTTCCTTCGCGTGAACGCAGCGCGTACGCATCCATCAACCACGAACCGGTCGAGCCGCTGCCGGGCACGCCGCCGATCGAGCTGAGCAACGCGTCAACCTGGTTGTCCCGGAGCGAATGCGGTCCGGTTTCGGGCGAACTGACATCCTGCACCGTGATACCCGCGGGCTCGCAGGCCTGCGCCATCGCGGACACGATCGCCGCGCGACGGGGATTCGGCGCCTGATATCCCACCCGCAGAGTCAACGGACGGTTGGCGGCGGCCGCCCGTGCCGCTATCGGATCGGAGCGCATGAACCGGCCCGCTTCTGCGGCAGCCTCCACCGAGGCGAAGGAGTCTCCGATGCTGGTGTCGAGCCGTGCGTTCATCACCGGCACTCCGGCGATTCCCGCGATGGCATCCCGCGGCGTGCACAACGCGACCGCACGGCGCAGATCGGGAGCTCCGACCGATCCCCCGGCACCGAAGATCAGTTGCTCCACACCCGACCCGGGCTCCTCGGTGCGCACAAATCCGTCTGGGGCGGTGAGAGTCCCGGCCGATCCCGCTGCGACGTCGACCACCTCAAGATCGCCATCGGACAGCCGCTTTTGCACGTCAATCCCGCTGGGCCACACCGTGACCCGCTTGGTCACCGGCGGGTTGCCCCACCACTTGTCATTAGCCACCAACACCACGGAGCCGTCGTCGTTGTACCCATCGAGCTTGTACGGACCTGATGACGGGAATTTCGACAGGTCCAGATCGGTATCGAGGTTCCACTGGTTGTTCCAGAAATCGGCGATCTTCTGCAGTGCCGGAACATCATTGGAGGCGATCGCGCCCACCAGATCGATGTTGCCGCCCAGCTTGTCGGCGACCACGTGCCACGGCAACATCGTGGTGGCGGTGAACAACTGGGTCCAGTCGGTAAAGGCACGATCGGGCGCGAAGGTCACCCGCGCGGTCTTTTGGCCCGGCTTGCAGTCGACCGTGGAGATGTCGGCGTATCCCGCCGTGCTGGCGGTATCGAAATCGGGCATCCGGCCGGATTGCGCGAACCAGGACAGCACCATGTCTTCGCAGGTCACCGGCTTGCCGTCGGAGTAAACGGCCTTTTCGTTGATGGTGTAGTCAAGTACCAGCGGAATGCGGCCCACTACCTCGACGGTGCCGAAATCGTGATCGGCGATCACCTGACCATCGGGTCCGTGGAAGCCGAAACCGGTGAGCACACGGTTGAACGCCTGTGCTCCGGCCGAGGCGTTACCCGCGACGCTGTTGACGTTGTATGTCGTGAGGCGACCGTCGACGGCGTAATCCAGCGTTTTCGCCGCCGAGGTCCAACACGAGGTCAACAGCCCCGCCGCGAGCAGCACCGTGGTCACAATGGCGCCGATTCGCGCGATTCGTCTTGCCCCGGTCCGTGCTGGACTTCGTGAATCCATCAGTGCCGCCGCGTGTTTCGCTTACCTTGAGGGCGGGCGCCGGGCTTGGGTTTGGCGGGGCGCACTCCGGTCGCCGCGGCATCGGATGTCCCCGATGTCGCGACCTTGGCCGTCTTGCTTCCCACACCCGCGTCTGCCAGCGGTGCGGCATCGGGCTTGCGACGGCTCAGCACCTTCTTGGTGTGGCTGGCAACCAGATCCGTGCGCTCACGCAACGCCACCAACAGCGGGGTGGCGAAGAAGATCGACGAATAGGTGCCCACCAAGATGCCGACCAGCTGCACCAGCGCCAGATCCTTCAGGGTGCCCACGCCCAATAGCCAGATGGCGACCACGATGAGCGAGAGCACCGGGATGGCCGAGATGAGGCTGGTGTTGATGGAACGCATGAACGTCTGGTTCACGGCCAAGTTGGCATGTTCGGCATACGTGCGCCGCGAGGTGTGCTGGAAGCCATGGGTGTTCTCTTCCACCTTGTCGAACACGATCACCGTGTCGTACAAGGAGAATCCGAGAATCGTGAGCAGACCGATCACGGTCGCCGGGGTGACCTCGAAGCCCACCAGGGAGTACACACCCGCGGTGACCAGCAGGTCGAAAACCAGCGCGGCCAGCGCGGCCAGCGACATGTAGCGCTCGTAACGCACGGCGATATAGATGCCGGCCAGCACCAGGAACACGCCCAGCGCCCAGAGCGCCTTGTTGGTGATCTGGCTGCCCCAGGTTTCGGAGACATCCGAGTCGCTGATGGCGTCGATGGACGGCTTGCCGTCCGGGCCCTTCGGCGCGAATCTGTCGAAGAGCGCCTTGTGCAGCTTCGAGGATTGCTCGTTGTCCAGCTTCTCGGTGCGGATCTGAATGGAGGCCGAGTCGCCGGTGCCGGTCACCACGATCGATTCGGCGTCGTGTCCCAGGGTCTGGCTGAAGACATCCTCGGTCTGCTGGACGGTGGCGTCCCCCTTGGGGAACCACACCTTGGTACCGCCCTCGAAGTCGATGCCGAAGACGAAACCCTTGATCGCGATCGACAAAATCGCGACCACCATGAACGCACCGCTGATCCCGAACCACAGCGTGCGTTTCCCGACGACGTCGACAGCGCCAGTACCGGTGTACAGGCGGTAGAGGAATCCGTGTTCCGGTGCGGCGCTAACGGTCTCAACCGCCGTGGATTTGGGCTCCTGCGCGGTGCTTGCGGTGGCGTCCGTGGCGTCTTCCGCGACGTCCTTGTCGGTGCCGTTGCCGTTGCTCAC
Protein sequences of DBSCAN-SWA_1 >NZ_CP013049|2733914:2742452|2738992_2739526_-|WP_005082386.1|DBSCAN-SWA MTGPDEIAELVASLTREVADFPEPGVQFKDLTPLFADADGLSRVTDALAQAASGATLIAGIDARGFLLGAAVAIRLGVGVLAVRKAGKLPPPVHAQTYQLEYGTAALEIPADGVELAGRRVAIVDDVLATGGTLAATARLLTTAGADVISAGVVLELSDLGGRAAVAPLPLTALRAI >NZ_CP013049|2733914:2742452|2733914_2734586_-|WP_005068898.1|DBSCAN-SWA MLITGFPAGMFATNCYIVAPHEGGPAVIVDPGQDAIGGVKEILAERDLTPEAVLLTHGHLDHTWTAQPLADEYGIPVYIHPDDRVMLADPLAGIGPGLGQFIQGQVFAEPKELVEIGDGDKLELAGITLTVDHTPGHTRGSVVFGIEVDSENGPVDVLFSGDTLFQMSIGRTDLPGGNHQQLLDSIAAKLLTRDDSSVVLPGHGNATSIGDERRANPFLEGIQ >NZ_CP013049|2733914:2742452|2735624_2736494_+|WP_005091054.1|DBSCAN-SWA MISRRALLVGGVSAVGVVAAGCSRNGNRRPAPDELASLEKDFGGRIGVYALDTGSGDTVGHRADERFLMCSTVKTFIVSAILRRRLSEPGLLDQRIQYTQSDVLEWAPITSQHVSTGMTVSELCDATLRYSDNTGANLLITQLGGPKETEKFVRSLGDNVTRMDRTEVQLNIPDGDLDTSTPQQLVANLRRLVLDEGLDSRGRDLLTDWLKRNTTGDQSIRAAVPAGWTVADKTGGGFKGETNDIAVIWPPGRAPIVMAVLTVPEDPTSTKGKPTIAAATRIVLRAFGA >NZ_CP013049|2733914:2742452|2741198_2742452_-|WP_005111391.1|DBSCAN-SWA MSNGNGTDKDVAEDATDATASTAQEPKSTAVETVSAAPEHGFLYRLYTGTGAVDVVGKRTLWFGISGAFMVVAILSIAIKGFVFGIDFEGGTKVWFPKGDATVQQTEDVFSQTLGHDAESIVVTGTGDSASIQIRTEKLDNEQSSKLHKALFDRFAPKGPDGKPSIDAISDSDVSETWGSQITNKALWALGVFLVLAGIYIAVRYERYMSLAALAALVFDLLVTAGVYSLVGFEVTPATVIGLLTILGFSLYDTVIVFDKVEENTHGFQHTSRRTYAEHANLAVNQTFMRSINTSLISAIPVLSLIVVAIWLLGVGTLKDLALVQLVGILVGTYSSIFFATPLLVALRERTDLVASHTKKVLSRRKPDAAPLADAGVGSKTAKVATSGTSDAAATGVRPAKPKPGARPQGKRNTRRH >NZ_CP013049|2733914:2742452|2734699_2735623_+|WP_005093596.1|DBSCAN-SWA MPTNEQRRQTAKRKLERQLENRAERARKQRIWAISGVAVLAVVVAVAITAWFAISKGDKTDTAHDTASTTPVSTVDAPPTSSLPTPDEPPLAGPALPKFVPSAEVGANCQYRPSGKASKTIEKQIPRTGKIPTDPATLTASMKTSQGNIGLVLDNAKAPCTVNSFASLAQGGYFDNTVCHRMTSGGLSVLQCGDPTATGSGGPGYEFDNEYPTTQYKADDPAAQQAVVYPAGTLAMANAGPGTNGSQFFMVYKDSKLPPQYTVFGTIDKTGMDTLAKIAKNGIKPGARGAGDGEPAQEVKIIDLQLD >NZ_CP013049|2733914:2742452|2736595_2738962_-|WP_005068903.1|DBSCAN-SWA MADEHVPSENQVAVESLPALSEPGPVTESLRVSGGTMSASRRVRARLARRMTAQRGAVSPVLEPLIAVHREFYPKADRSVLQRAYEVAESRHAEQFRKSGDPYITHPVAVANILAELGMDTTTLVAAILHDTVEDTGYSLEQLTAEFGTEVAHLVDGVTKLDKVVLGNAAEGETIRKMIIAMARDPRVLVIKVADRLHNMRTMRFLPPEKQARKARETLEVIAPLAHRLGMATVKWELEDLSFAILHPKKYEEIVRLVADRAPSRDTYLAKVRAEIVATLNSSRISATVEGRPKHYWSIYQKMIVKGRDFDDIHDLVGVRILCDEIRDCYAAVGVVHSLWQPMAGRFKDYIAQPRFGVYQSLHTTVVGPEGKPLEVQIRTRDMHRTAEYGIAAHWRYKEAKGRNGVPPTHAAAEIDDMAWMRQLLDWQREAADPGEFLESLRYDLAVQEIFVFTPKGDVITLPAGSTPVDFAYAVHTEVGHRCIGARVNGRLVALERKLENGEVVEVFTSKAANAGPSRDWQTFVVSPRAKAKIRQWFAKERREEALEAGKDAIAREVRRGGLPLQRLVNGEGMASLARELRYQDTSQLYTAVGEGHVSARHVVQRLVAQLGGVDSAEEELAERSTPSTMPIRQRRSDDVGVAVPGAPGVLTKLAKCCTPVPGDQILGFVTRGGGVSVHRTDCTNAESLQQQSERIIEVHWAPSASSVFLVAIQVEALDRHRLLSDVTRALADERVNILSASVTTSNDRVAISRFTFEMGDPKHLGHVLNVVRNVEGVYDVYRVTSAA >NZ_CP013049|2733914:2742452|2739522_2741199_-|WP_005093597.1|DBSCAN-SWA MDSRSPARTGARRIARIGAIVTTVLLAAGLLTSCWTSAAKTLDYAVDGRLTTYNVNSVAGNASAGAQAFNRVLTGFGFHGPDGQVIADHDFGTVEVVGRIPLVLDYTINEKAVYSDGKPVTCEDMVLSWFAQSGRMPDFDTASTAGYADISTVDCKPGQKTARVTFAPDRAFTDWTQLFTATTMLPWHVVADKLGGNIDLVGAIASNDVPALQKIADFWNNQWNLDTDLDLSKFPSSGPYKLDGYNDDGSVVLVANDKWWGNPPVTKRVTVWPSGIDVQKRLSDGDLEVVDVAAGSAGTLTAPDGFVRTEEPGSGVEQLIFGAGGSVGAPDLRRAVALCTPRDAIAGIAGVPVMNARLDTSIGDSFASVEAAAEAGRFMRSDPIAARAAAANRPLTLRVGYQAPNPRRAAIVSAMAQACEPAGITVQDVSSPETGPHSLRDNQVDALLSSIGGVPGSGSTGSWLMDAYALRSREGKNPANYANGQIDGIIAALAVTTNARDQSRLLGDASAILWNDLPTLPLYRQPRVLIAPKSMYAATPSVTRWGAGWNMDRWVLPT |
7 | Streptococcus_phage(16.67%) | NA | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_2 |
3332189 : 3349694
Sequences of DBSCAN-SWA_2
Nucleotide sequences of DBSCAN-SWA_2 >NZ_CP013049|3332189:3349694|DBSCAN-SWA GTCACGGCATGTCCTCCTTAGTGTCGTCCAGGTCTTCGGGTTGTTCGTCGTCGGTGTCGGCCAGGTCGGCGGTGTATTCGTCGCGGTCCTCGGCCGGAACCTCGGCGCGGCCACGGTCGTAGACGCGTTGGCCTTTGCGTTTGACCATGTCGATCAGTGCGTCGCGCACGTCGGGATCCAGCTCGTCGATTTCCTTGGCGCGTGCACGTGCCTCGCGGCGCGCATCCCGGCTACGGGCGCGCGGCGTGTCTTCCTTCTTCACGACCCACTCGACAGCGTCAACCAGGCGCCCGCTCTGATCCGGCAGGCGCCGCGCGCGAATGAATGCCTTCTCGTCGTCGACGTCCGCGCCGACCAGCGCGCCGTGGAACATCAGCAGCTGCAAATGGTCCTCGGGTAGTCCGAGTCCCCAGCCGTTCGGGCCGACAGCGTCGCGGAACCCCTCGCACAGCCGGTCCTGCCGCGCGAAGATCTCGTCAAGTTCAGCCTTCGTGAACTTGCGGTCGTATGGGAACTTCGGGAAGACCTTGCCTTGCTTCTTGTTTCGGCTGCTCATCAGAACATGTCCCCGCTTCCTGCCAGGAGGGCCGCGAAGTTCGCGACGTTGCCGATAGTGCGGAACCCGCGCGCGACCGGGTCCTCGTCGCGGGAGTCGTCACCGAACGACACCGTGGGCCGGCCTGCCGTGGTGCGGTCACCTTCGCCCTTGATCGCCATGATCTGGTCGGTGTAGACCACACCCCGGATTTCCGCTGACACGCGGTCGCCGAGCCAGAAGTCCTCGCCGAGGATGTAGGGCTGCCCGTCGCCGACGTCGAACTTCATAGACCGGTACGGCTTCATTTCGTAATCGCCGGCCGCCAGCTCTTGGATGGCGTTGATTACGTATGCGGTGCCGCCCGGATTTTTGAAATACTCACGGAACGCGTAGCTTCCGGTTTTCGCCGACCGAATCGGGTTGACGTACCGCATGAACGCCAGGAACACGTCGTCAAGCTGGCCTTGGTACAGGTTGTCCAGACCTTCGACGCCAGCGGCCTCGACGCCCATGATCACTTGGGCCAGTTGGGAAATGCCGTAGCGGATCGCGAACGTGATTGCCTGGTTGACCCATTGGGGCGATTTGCCGCCGACGATGATGTCTGTGGCGCGGCTCTTGTATATCCGCAGTTTGCGGCGCCGGATGTTTCCGTATCCGACGTCTCGGTACACGAACGGCGGCGGCTTCGGGGAAACCAGCAGCAGCTTCCGGAAGAACGGGTCGACCTCGCCGTCGTGGTCGGCGTCGATCGGGATCAGCGTCTCGGTGATCAGGTCATCGAGGGTGGCGGCGAACAGGTTGATCGCGCCGTCGAGCATGGTGCCCGTCGGGCCCGTCACACCGGACTTGTCTTCGAAACTCAGGATCACGCATGCGCGTGTGGGCTTGAGGATTTCGGCGAGTTCGGGGCCGAACATCGTGTACGGGGCCGGATCGCCGGGAAGCCAGGTGTAGGCGCGGCAGATGACACCGGCGTCCTTCATCACCGGCGACAACACGGTATGGGCGTCCTTCCAGCGCGAACCTATTGTGCACCAACGTGATTGGTCGAAGAGACCCGCCACAGGCATGATCTGCACCGGCCAATTCAGCGGCGAGAGGTTCTGCAGCCACGACTCGGGCGCGAAAATGTTGCGCGGTACCGGGAAGAATCCGTTGAGCGTGAACAGCCGAATGCAGTTGAAGAACATGGCCGTTGCGCAGGTGGTGATCGTGGGGCCACCCCACAGGAACATCTTCGGCAGCTGCACCTCCATCGGGAAGATGGGGTTGGCTGCAAGGTAGATTCCCTTGAGGTGGCGTCGATTTGATATGCATTTGAGGGTGGTGACGGCAGCCTTTCCGGCTTCTTCGTCGTCCTCGATGACCTGGACCTTGCCGCCCCAGCGGGTCCGGAAGTCGTGCGGCTTGTCCGGGTCGGGGTCAATGGTGATGTGAATGTCTTCGTCGTCACCGATCTGGTAGGTGATGATTTCCCGGAGCCAGTCGTTGGCCTTACCGGAGAATGTGATGTTGGCTTCGCCGTCCTCGGTGGCCAACTCTTCCCAGTCCCACTTGTCGAGGTTCTCGACGCGGGCGATGAACCGGAATTCTTTGTCCCAGATACGAACAAGTGGCGCTTTGGTGCGCCGGTTCATGTACGCCCAGCGGCGCTCCAGCAGATGCATCCGCAGCTCGGGGGTGAACTCGCCGGCGAGTGTCTTGGGGTCCAGAGTGAGGGTGGGCGCGGTCATGCGAGCGCGCTTTCGAAGCGTTGCGGCAGCTGGCACCAGATCTTGCCGCCAGCCTGGTTATGCGAGACCGGGATAGTGGCAACGGTTTTCGGCGGGATCGGCACCGAGAAGCCCTGGCCGCGGAACCGCTGCAGCAACGGCAACCCCGAATCGCCATAGTTACTCAGGATCCAGTTCAACAATTCGCTGTTGCGGATGAACTTTTTCACCAGGTTGTCGGGCGGATCTTGCGCGGTGATCGCGATCCGGTGCGTCGGGTCGGTGTCGATGACGCAGTGCTCCCCGGGGTTCAGTTCGGGGACGTCGATCATGTTGGCGTCGCGGGCGCGGGTGAACGTGCCGAGGATTTCGTCGACGAACGGGATCCCGAACAACCGAGACAATTTCGGCCAGTCATCGAACGGGTTCTCTTCGCTCGACACGATGGCGTTCGGTCCGTCGCCGAGACGCACCTTCGACGGTGCAGTCTTGGATGCCTGCACAAAGAAGATCGGCCACGCCGGTTCGGTGGAGCGGTTCGCGATCCGAATGAACCCGAGGCTGGGCCCGCCGGGTGGGCGGATAAACGGGGGCGGCGCGGTGTCGGGACGGTGCCAGCGCGGTTCACCGTCTGCGGCCAGGATGATTTCGTGCAGAGCGACGCGCTGCAGCGCGGGGTCGTCGGGAAGCGCGCATTTAGGCGCTTCCAGCAGTTGCATCGGGATCCACAGCTGCCCGTGCCGACGGGTAGTGACCGTGAAGTATCCGGTCGCGTCCTTGCGGCAGCCGCGCCAGAACCGGGCCTCGGTGTCGTACCAGCCCAGCGATGAATCGGACATCAGGCCCAACGTGAAGGAGATTTCGCGTCGGCCGTCGACGGTGCGTTCGAAGCGGGGCGGCCCGTAGGCGGGGGTGGTCCACACGCCTTCGAACGGGACGTGCACCATGCCGTCGATCGGGCCGGTGATGAATGCGCCTTCGCTGCCCGCGAGTTGCCCGGTGAGCGGCCAATACCGGCCGTCCGACCCGATCCATGCGCACGAGACCGCTTCGCCGCGTGCAGCGGCGGACAGCTTGGACCACGGCACGTTACGGCGGGGCCCGATCAGTTGCGCGCTCGTCATCAGCGGCCACCGCCGACGGGTTCGTGCGTCATCTGGCGCGGTGTGTTCAGCAGGACGCGACGGGTGCGGTCGGCGATCGAGCGTTCGTCACCCTGCGGGTTATTGATGGTGAGATTCACGGACTGATCGACGGGGCCGTTCCCGACGCCCGGAGGCATACCCGAACCCGGATGCGCCGCAGCGCCCGTCAAAGACGGCAGCATCGACGACACTTCGGGCACCATCCCGAACGGCAGGCTCGACGTGGCCCCGCCACCGTCGAAACTGGCTCCCGGGAACCCACCGCCGCCGCCGGCACCGCCCATGCCCCCGGCACCGCCCGCGCCGCCGAGCAGTCCGCCCGCGAACCCGGTGCCCTGCGGGGTGTACTTAATGCCCATGATGGCCTTGGCCAACTTCACGATGCCCAGGTCTTCGATGTTCGGCAGGAACGAACCGTCGAGGCCGAACGTCTCCATGAGTCCGCCGCCGAAGATCTTGCCCAGCTCGCCGACGTCGTCACCGCCGCCCTTCCCGCCGACCGCGTTCTTGGCTTCCTTTGCAGCGGTGAACTTTCCGCGCCTGGCGTCCTCTAGGTCGGCGCGTGCGTCCCCGGCTTCGCGTTTGGCCTTATCCAAAGCGCCCTGCGCGGCGAGCTTCTCGGAATCCTTGGCGTCGGCTTTGAGTTCACGCAGCCGCGCCTCGGCCTCCTTGACGCGTTGATCGGAGTCCTGAACGCGTTCCTCTGATTCGCGGACCTTGCGCGGATCGGCCTCGTAGTAGCCGGGTTCGCCGCCCGGCCCATAGCCGGGGGTGCTTCCGGGCGGGCCCGCCTGGTACCCGCCGCCGAACGCCCCTGCTCCCCCGCCGAATCCGCCGCCGCCACCGGCGAGCATTCCGCCGCCGCCCATGCCGCCGGGGCCCTCCCCGTACAGCTGCGGCAGATACATGTGCTGGTCGAACTGCGGGCTGGTCGCGCCGGCCGCACCCGCACCGACTAGGAAGTTGCCGTGCGAGCCGCCCGATTCGGCATTCGAGCCGTCCGACAAGGTCATCGCCATGTGGCCGTCGTTGGGGTTGGGTCCGTGGTCGTACCAGCCGACCGTGATCGTTCCGGGGCCACCGATTCCGCGCCGGAATCCGCGCGCCGTGAGCCACTGCTCGGCGTTCTTCGTGTTCATCAGCGCGCCGTCGGGCAGGCCCATCGCCCGGTTGATGACCCGGGACACCATTCCCGAGCAGTCGTTGCGGTTGGCCTGGCTGTACGGGGTTCCGACAAATGACTGCGCAGCCATCACGTCCGGGCCGCGTAGACCGCCGACCTCAAACCCTTGGATAGCTCCGAGACGGCGACCGGTTTCAAGCCAGATGTCCAGGGATCGTTTCCCGCCGTCCAACGGGATAAATGCTTCGCCGCCGGTGGATGGTTCGGCCCACTGCACAAGACCCGCCCCGGGCACGGCGGGCTGAATCACCGCGCGGCTGGGCAGGGTGCCGCCGTTGGCGAACGATGCCACCGAATCCCACACGTCGTAGATTCCGCCGGTCGCCCGCGCCGGGATGAACGAGGCAACTGAACCGGGCGCGGGTGCACTGGCGGGGCCGACAGCCACAGGCAGCTGCGGAGCTTGCGCTGCGTATCCGCGGAACATCGCTTCGATGTCCTGTTTGGCCTTCGAGGTATCCGCGCCGACCGGCACTTCGGCGGTCTGGTTGCCAACGGACTTGCGCCAGGCATCCAGGATCTTCTGCCCCTCAGCTGTATTCGCGGTGACCGTGACGGTTTTGTCGGGCAGCTCCTGCACTTGGATGCCGAGCGCGGCGAGCTTGGCCTGAACCTCGGGTGTGTTCGCCGAAATCTTGATGGTCTTGCCATCAGGCATGGCCGCGGTGACATCGCCGAGCGCGGAAGTAAGACGCGCGGCCGTGGCCATGTCCTCGCCGGTGGCCTGCACCCGATGTCGCAAGTTGAACAGACCGTCCGCGGCGCCGTCGAGCTTGTCGGCCAGTCCGCGCATGTTCTCGCCCCACGAGAACGCGCTCTCGGACAAGTCGTGCATCCGTTGCGCGCCAGCCTTGTCGCCGGTCAGGGACGCGAACGCCGAAGCTGCCTTGAGCGTGAACCCGACCGTGTTGCCCAGCCCACCGACCAACAGCGACAAAGCATCCAGCGCGTCCGAGGACATCCGCAGGATGGTTTCGCCGAACAAGATGACGCCCTGCGTCGCGGTGCTGAAGAATCCGATGATCTCGGGTTGGTGCGCTTGTACCCAATCGGTAAGCGAGCCAAGGCCATCGTTGACGAACGTGAACAGACTGGACGCCAAAGGTTCCAGCGCGGTCGCGGTGTTGTTCTTGAAAATCTCCCACTTCTGTTCGAAGTCGTCGGTTTCGGCCGCGGTGTCGTTGATCGACGCCCCGGTCGATTCCAGCGCCGATTGCAGCGTCTGCAGGTCCAACGCCCCGGACGTGATCGCATCGAAGAACGACGCGCCGCCCTTGTTCCCGAAGAACTTGTTGGCCAGGTTCTGCGCGCCAGCCTTATCGCCGGCCTCGTTGAGGCGCCGAATCTCTGCGATGGTCTGCTGCAGCGCCTCGGCGCCGCTCTTACCGCCCTTGGCCAGCGTCGCAAGGCTTTTGGTCAGACCGCCCTTGAGCATCGTGTCCGCGTCGAGCCCGGCCTCCTCCAGAGACGTGACCAGCGCGGCGCTTTGACCGAACGACAGCCCGAGCGCGCGCAGCGGCGGGCCCGCCTTCGTGACGGTAGCCAACAGCTCATTGACGGGGATACCGGTGCGCTGCCAGGCCCCGAACAATGAATCGAGGGTTGCGACCTGGTCTTTGCCTTCGACCCCGAAGGATCGGAACGCCCGGCCCAGCCCGCGCACGTCGACCGCCTCACCGGTGAGCCTGCCCAGATTGGCGACCGATTTCGACACCGCGTCGAGCGTGGGGCCGGTCAGGTGCAGGTCGCGGTTGACCTCGCCGACAACCTTGCCGAGTTCGGCGAACGGCAGCGGCACCGAACGTCCAAGGTTCTTCACCGACACCTCGAGCGCGTCGAGCGCACCGCCGCTGGCGCCGGTGGTGATCTGCAGCTGGTCGAAGGTTTCGTCGAACTGCGCGCCCAGGTCGTACAACTCGCGACCGAGTTTGATCGCGCCGGCGGCTGCCGCAGCCATCCCCGCGCCGACCGCTGTGCCCAAGGCCGCGGCAGCCAGTGAAGCCTTGCCCGCCAACCCTTCATAGCTAAGACCCAAGTTGTCGAGGGCGTCGGCTCCGCTGTTGTGCCGTTTCAGTGACCGCTCGTAGTCGTCTTGGGCGGCGGTGGCCTCTTTGACCGCGCGAATTTCGTCGCGGCGGGCCTTGTTTCGGGCCTCGGTGGCCGCGACGATCCGGTCGTTGGATGCGCCGATCTTCTGTAGCCGCTGGAGTTTGGCTTCGGCGGTGGCCAGCTTGCCGGTCGCGTCAGCGGCCTTGTCTTGAATTTTGATGACGTTGTCGTAGGCGCGGGTCAACTGCGCTTCGGCGGACACGACACCGTCTGCGAGGTTCTTTCCGAAGGTCTTACCGACCTTGGTTCCGAGTGCGCCCACGGAGCGGTTCAGCGCCGTCTGCGACTTCGATTCGATGTCGACAAACGACGGGATGACGGGGAGGGTGTAGTACCCGAAATCCATGCCTTGCGCCACGGTGTGGCTTCCGCCTTACCTCAGGTGCAGCGCGTGCCGCACAATCCGCCGTAGCCACCTCGGCCACGACTCGACATACAGAGGTGTTTCGATTTCGGTGATGTATTGGAATTGGTTGTCCCACAGACGGATGCGTGGGCGGCTTCTGATTCGGCGCCACGGATACCGCTTGGCGCCGGGCTTCACAGCGTGCACTCCTTCTTGACGACCTCGCGCACGTAGTCGACGAACTTGTCGAATTCGTCGCCGGTCGGGCATTTCTCGACTAGTTGCTGCCATTGGTCCTCGCCGCCGATCAGGGTCACCACGGCTTCGTAGTTGCGGCCGCGCGCGAAATCACGTAGGGCACTGACGGGCCAGCGGCCACGGCGCTTCGGGATGGTGAACTTCAGTCCTTCCCAGATCAGATCGACGGTCGCCTTGTCGGGGTCGGTCTGGTCGTCGGCGGTTTCTGCGGACTTGTTGCGGGTGTTGTTGGCCATGGTGTTCAGCGTTTTCCTAGTTCGTGTCGCCGGTTCGCTACTGCCTGTTCCATAGCCGAGACCGCTGGCGCTGCGGGCGACGATTTCGCGGCGTACGCGGCTTGGCGTTCCTTCAAGTTAGCGACATGATCAGCTTTCGCTTGCATAGCTTCGAGAGCTTTTGCTACCTCATGAGGCCGCAGCGGGCGTCCGGGATAGATTTCCCCTGTAAGTGCCTGATATACGGACGCAAGAATGAACATCTGCTCTGTCCATTGTTCTTTGCCACCGTTCTGGGCTCGCACGATCGACGACGCCGGATCGAGCCGGCGAATGTAGGTCCATATCCGGCGCAACGACAAAGTGCCCGTGAACCGTTCGGCATATTCCACGCCCCAGAAACGCTGCAGATCGCTGGCTAGATCGTCCTCGTATCGGTCCAAGATGTTGACCAATGTCGGTATCCCGCCGAACCATTGATCCGGCGCGGCCGGTGTTTCCGGCAGCCGTGACACCCCCACGGCGTCGGCCATGGCGTCGGACAGCTCGCGGTAGTCATCGACTGTCGCGTCGTCGTAGAGTCCGCATTCCTGCCCGTTCAGCAGGTAATCGACAGCGTTGAAAGGGTGTTCGCGTACAAGATTGAGCGGCCACACCTCCAGGTTCAGGGGAACCCGGATGGTGTGGCCGCGGAACAGTGCTTCGGCTTCGGTGGCGCCCAGCGCTTCTAGCCGTGCAGCCTCAGACGTTTTCGCCGCCACCTTCACCGTCGACCTCACCGTCGCCGGTATCGGCTTCGGTGTTCGGCATATCCGCTTCGGCCACGTTCACCGACTGCGACGTGCCTCGGCTGCGACGCCCCCGCTTAACCTTCGGCTGTGCGGCGTCGGTGTCTTCGCCGCCGTCCTCTGTCGGCTCGTAGGGCTTGGCCTCTTCCTTCCCGATCAGCGCCGCGGCCGACACGTCGTCGACCGTGATGATCGAGCCCTTAAGGAATTCGGGCTTGTCGACAAGCAACTCGATTGTCTTCACTTCGGGTGCCCCTTTCTATGCGGCGGTCTGCTGCGCGGCGAACAGCTTGCGGGCAGAGTCCGGGAAGATCCGGCACTCGATTTCGCGGGGAGTCGCATCGCCTTCAGCGTCCTTGATGTTCGGCGTCCAGAACCGCGCCGGCCGCTTCGAAATCTCGCGCCGGATTTCGCCGCTGGCGGTGCGCTTCTCGAACGCCACGTACTCATACAGCGGATTCGGCACAACGATTTCGGTTTCGGTCGAGCCGTTCCACAGGATCCGCTGCATCGCCGGGTTGTCCTCCAGCGCGGACACCTTGCGGGTGAGCTTGAAGTCCTTCGAGGCGACGATGATCGTGCCGTAACCCCATGCCGGGATGTCCTTCTCGTCCCATTCGCGCTGAGTATCGATACCCGCATCGCCCACCAGCAGGCCGAGGAACGCCCACTTGCCGGTGGTCGTCACGAACGGATCGGTGATGGCTGCGGGTAGATCGGCTGTAGTCGGTGCCGCCGAACCCATCCACAGCAGTACGTCGGCCTCGGTATAGAGCTTCACATTGTCGGGATTGCCAGCCATTGGTGCTGTCTCCTTGTCTCTTAGACGGTTTCGATGGTGCGCACCGCCGCTGTGACGGTGAACGACGCCATGTCGGCGCCGGTGTCGGTGTCCCGTGCGGTGACGAACACGCTTCCGCCGGTGCGGAAAATGTGCGCGACCCCTGGCGGGCGGTTGTCGTGCAGATAGCCGTGCACGCGGCGCGCGACCCTGTCGGCCAGATCAGCGCCACGGGCACGCACAGTGATCCGGATCGTCGGGTCGCGTTTGATGGGCCACTGTTCGGGGCCACCGTCGTCATGCACGGTGATCAGCGGCGGGCCGATGCGCAGGCTCCAGTCTTGGGGCACTTCCTCAACCGATATCCGGCACACGCCGCCGAACAAGCCGACGTTCGGCGGCAGCGCCACGAAAGCCTCCAGCGCGTCGGCGAACGCGTTGCGGACGTCTGCGTGCTCTCTCATCGCAGCCTCAACCCCAAGCGCCCGAACGCTTTCGACACCGCACCGTGCTTGGCCTGGTCTTCGTTCTTGCCGTAGATACCGCGGACTTGCCGGTCGGTGACGTACTCATCGACCGTGGCGCCGGCGTTCTCCGCTGCCGGGTTCGCGACCGCGTCCAGCGCCGCGCCGAGGCCCTTGTCGTACTTCGCGATATGGGCGACCGTTTTCTTGTTCAACCGGAACTTGCCTTGCGGTGCCATCAGTTGATGCCGCCCTTCGCCGATTGCGCCAGCACCACCAGCTGATTGCGGTCGGCCCACTGGGACAGCTGCACACCGACCAGCGCGCGGCACTCGCGGCCGCGAATGATCAACCAGTCGTTGTCTCGTATCTGCACATCGAGGTCCAGGACGACGGTGAATGCGGCGGCGGTGATTTCGCCGGTGTCCCCGAACCGCTGCCGCTGATTACCCGCCAGGACAGCTCGGGCGATGACCGTGACCGGATCGCCGTCGGGTTCAACGGTGCCGCGGTCGCCGCGTCGGCCGGCAGCGATGATGGTGACCTGTTCGCCGAGGCTGACGTCTTCACAGAACCGGGCCTCTGCCCGCACGTGACTAGGGTTACCGTCCAGGGCGTCGCGGCGCACACCCGGCCCCTGCATTTCGAACAGCCGGTCACCGAACTCGACGGCGTCACGAGCTGTGAGCGTTTCGGTGTCTGCGTCCACGATCAGGTGGCCGACCGCGTTGATGACCGCGACCTGTGTGCCGGCGATGTCTTCGAACGCCGGGTGCTCTGCCCAGCTGCACCGGCCCTTGGGGATACGGCGCTCGGTGTAGGTGGGCCGGCCCCACGCGTCAACGACCGGTGCGCCAGCGTCGGTTACCGGGTCGCGCTTGACCAGCGTGACGGTGTCGGGTCCAGGATCAAACATGCTCACCATGGCTCGGCCTGCGCGTATCCGGTGAATGTGGCTTGCGGCGCCGCGGTCAGTGATAGGCCAAGCATCTGTAGGTGGCGTTCGGTGAAGTCCAGCATTTCGGCGGCTTGCGCCAATTTGACCGTGAGAGTGCGGTCATCGGTGGTGCGGGTCAGTTCCGTCACCCGAGAATCGGTGACGCCTTCGGGGCCGAACATGGCCTTGACGACGTCGTAGGTGACGAGCTTGCCGCGCTCGTCGGACGCCGGCAGGTCGGGCAGCCGCGACGGATCACGGATCCATGCGGCGGCTGCCCGAACCAATATCTCGGCGAGTGCCTGTTCCGTCGCCGAAAGAGGGCGGAACATGCCCTCGAATTCGGTGACGGTCAGGAACGGCGTTACCTCTGCCACGGCCAGTTACCTTGTCGCGGCCTTGATCTGAGTCTTGCTCATCTTCGCGGCCTGATCGGCGGGTATACCGCGTGCGATGGCGTACTGCCGCCACACCTCAGTGGAGTGCGCCGACTTGGGCCGCTCGATGCCGTCGCCCGGTGCCAACACCACCGAATCCCAACCGCCCTCGCCGCCACCGCCGCCGCCGGCTGTTCCGTCACCGCCGTAATCGGTGGCAAGCGCTTCGGCTGCCGCTTGGGCTGCGGCCTCGGCTTCGGCCTGTGCCCGTTCGTACGACGCGCGGTCGACGATGGCGCCCGCCGCGGTGAGGCGGCGGACGTCGTCGTCATCGAGGTCAGCGAGGACGGCGCCGCGCTTAAGCTCGCGCCACACTCCCTTGTCGTCCAGGCGGCGCAAGAAATCCGCTGTCAGAACGTATTCGGCTGTCACGGGGTCACCAACCCGGTCAGCCAGATACCCGCCTTCGGCTGATCCAACGCGTAAGCGGTCTTACGGGTCGCATCGCAACGGAACGACTCGGTCGGTCCACCGTTGGGGCCGTTACCCTCCGGGTACAGGCCGGTGACTTGGAACGGGCGGGTATCCGAGTAAAACCCTGTGACGCCCTTCTGCCCGATCCAGATCCGGTCAGTGGGATATCCGCGTGCGCCCAGGATGTCCAGGTCGTACACCTTGCCGGGCAGCTTGCCGGTGTAGGCGATGTGCTCATTGGCGACGTTGCCCTGGTAGACCTTGTTGAACTTGTCGTTGTCCAACAGGATCGGCAGCAGGCCCGGATTCATCACCATTGTGTCGGGCTGGAATCCGTACCATTCCTCGGCGCTTCCGCCTTCGGCAATGGAGGGAGCCGCGTTGATGACCTTTTCGATCGCACGCGCGATGTCGCCACGCGGATTGCCGTTGGCGGTATCCCACGCCGCAGAAACCGCCATAGTCGGTACCGCGCTGGACTGAGTCAACGCACGGAACACACGATCATCGGAACGCTTGAACGTGTTGACCAACTGCGTGATCTGCAGGTTGACGCTGTCGATGTCATCCTCGTCCCGCATCTCCTTCGAGACGCGGACGCCCAGGCCCTTCTTGTTCGCCACGGCGAACAGTGCGGTGCCCTTGCGACCGGCCGCGACCGGGATGTCGCCGAACTCGGCGACGTCCTCAGGCTCACCGTCCAGGAAGATCGGGTCGCCCTGCCGGTACGCAACCAGGCCGTTCTTATTGCCGCCGCCGTTACGGAACAACGTTTGCGTGATGAAAACGTTGGTCAGCAGCTCCTTGATCTTCGTCGGAATGAACAGCGGATTTCCGACCATTTCCGACACTGTGAGCCGGGGCCCGTCGCTGATGCTGACGATAGGGGTTGTAGGCATTGTTTCTGTCTCCCTTGCTATTTCAGTCCGGCTCAGACGGTCCGGATGAGGCCGACGGCCTTGGTTGCTACGACGACGCCACCCGGTTCGGTGCAGCGCCCGACGATGGTTCGGGCGTCCGGTGTCGCACCGGCGGGCGTTACGGTGCCGTTCGCAGCTGCGATCAGCAGCTCTCCGAACGCCGCGTCCGCCGCATAGGTGACAGGCACTTCGGCCCCGCCATATGCGCACGCCACCTTGGTCGGCAGCACCGCCGTGTTCAGCACAGGGCGCCCGTCGCTGCCAGTGGTCGGCGCCAGCACCAGGTCTTCGGGCGCGATCGCGTCCGTCAATGCGACGCCGACAACCTTGAACGAACCCGCTGCGGCGGGCTGAATCCGGCCACCGGTGACACCTTCTACGAGCTGTCCGCCCTTGATCGAAACGCCAGCCTTTGGGGTGTATGTCCGCGGTCCGGTCTTGGTGACCTGAGGAATTCCGGGCATGTCAGAATCTCCAATTCTTGAATCGAGGGTCGTTGCGGATTTCGTCTTCGGCGCTGGCCGATGCCTGCGCTTCGGTACCGTGACCGACTTCGGTCAGCGGTACTGCGGTCTCAGCCGGAATCGAATCCAGCAGGGCTGTAGTGCCTTCCGGGTCCGCCTGCATCAGCGCCAGGAAATGGTCGCGGCGAGGCGCCGTGATCTTGCCCTTGGCGACCGCGGCGTCGACGACCTTGGCGTGCGCCTCCGCTACCTGCGTTTGGCATGCCAGTGCGCCTGCGGCAGCATCGGATTCGAGCTTGGCTACCCGGGCCGGATCCATCAGCACCAAACCGGCCTTGGCTGCCGCGGCAACCAGTTCCTTACCGTCGACGGCGGTTCCGGCCCCGGCCTCGGCGGAGTCGTCCTCGGCGGTGGTGGTGGTGGTGTCTTCGTCGGTGTCTTCCTGTTCGGTACTGGCCAGCTTGTCGAGTGCGGCTAGCACCTCTTCGTCGGTGGCGTCCGGTCCTAGACCGAGCCGCTTGGCGACGTCCTCCTTGATGTCAGGCACGGGGCCCTCCTTCGAGATATTCGCCGAAGCACACGCCCCAGCGTTGTCGGCCCGCGCCACCGGCGCGGGTGCGGCCGGACGGCCGAAGTACTTGAACTTGTAAGACGAGCAGGCCATCGCCGACGCGACTGCCTTCTCGGTATCGGCGCGCGCACCCGAATCGTCGATACGGCTGGCAAGTCCAGCGGCGACCATTTCTTCGGCGGTGTACCATGTTTCGTCCGCCATCGCCTGCGCCCAGTCCTCGACGGTCCCGCCTGCGCGGTCGGCGTAGATGTTGGCGTAACTGGCCGATAGCTTCTCCAGGTGATCGGCGGCGCTGCGCATATCCGCGGCGGTGCCCGACCACAGCATGCGTGCGTCATGAATCATGGCCTGCCCATACTTGGACACCACCACCTCATCGGCGGCAACCGCGATCACACTGGCAGCCGACGCAGCAAGCCCATCGACACGGGTGACGGTCTTGCCGGGGTGACGCATGATGGCGTTCGCGATAGCGATGCCGTCGAACGCGTTGCCTCCCGGTGAGTTGATCCGCACGGTCAGTTCCGTTTGAGAGTCCAGTGCGGCAATGTCAAGCACCAGCTGTTCGGCGTTGACACCGAACCACGAGTCGATTTCGTCGTAGATGTGCAGCGTGGCCGTCGGTTTATCCTCAGCCGATGCGGCTTTCGCGACGGTGAACTTGTACCACTCACGGTATTCGCGGGCCATCAGAACAACCTCCCTTGCCCGGACTTGCTGAATGGATGTAGGAATTCGCGCATCGCCTCAACGTCGGCCACTGCCTGCGCGACAGTGCTTGGATCCACGGCGTCATCCGGGACCGCTGGCGCGTCAGGTTCTTTCGGGGGCGCGGCGTCCGGGTCGTTCACGTCGGGTTTGGCCGGCAGGTCCAGCGATTGGCGCAAGGCGCGTTCGATCCGCAGATCCGGCGCCAATAGCCCAGCCTCGACGAACATCTTCAGCGCCGCCGCGGTCGCGTCCTGCTGCGATCCGATCTTGTCGAACACCAGCCGCGGGGTGCGGGCCTCGACGCCGAAGTTGATGTCGACCAGATCCTCGATGATGTGCGCCTGACCGATGTCGCGATACGACTTAGCGGCGGCATTCTCGGCCTGCACGAACGGTCGTTCCTGCACCGCGGCCAGCGCGTAGCTTCCGCCGGTGTCCAGGTTCATGTATTGCGCCAGCCCGGCCAGCGCGATTGCCTTGTCGTGGTACACGATTGCCGCCCGGATGTCGGGCAGGTTGCCCTGTACACCCAGCAGCGCCAGCGATTGTCCCTGGGCGAGTCCGACACCGGATCCCATGCCGCCCTGAAACTCCGAGGCAACCTTCTGCATCGCCTTGACCTCGGCGTCGTCGTTCGGCTTGGATGCGGTGCCGACCGGCACACCCATACCGTTGCGGCGCGCGGCCACAACCTCGATGCGCAGCAGCTCGTTTTTCAGCAGCCAATGCTTGTAGCTCGATCGCAGGATCGAACGGCCCTGCCAGTAGCCCGGTCGCTTGTTGCGCGTGTACACCACAAGCCGATTGATCGGAATATCCAGCGGCGTCGGCCCGTACATGGTGCGCCCCGTCGACGCCGGCGCCAGCTGCGTGATCGAATCCAGCCCGCCATCCATCGCGACGTTGAACTTCTGAATCGTCCACTGCGGGCGCGGCCCCAGCTTGCGCAGCACGAACCGCCCGTCCGCCTCGCGCCGATACACCTGCTCGAATACGGCGTGCCCGAACTGCGCTGTCGGCGAGGCGACCTCGCGCAGATGGTCAATCCACGAAAACCGGCCCCGCGAGCGGCCGGGGTCGTCGACCTCATCGAACCCGACAACCGGAAGATTCATGTTCCGCGAAATGAACTGCACAACCTCGGCGTCGGCGCCGTTCGGGTCGATACGCCAACCGGTTTCGACGATCGGCAAGCTGATCGCCTCCAGCAGCGAGGACACACGCGAGTCGTTGTTGTCCATCTCCAGGAACACCGCCACCGACGCCGGGTGCTGCAGATCCGGAACCTTTTCGTACGGATCCCAATTGACCCAGCCGTCGACGAACGGGGTCACGTAGCCCGATTCGCCGACCGGCATAGCCGTCTTGACCCGCTTGGTCACCGGCCTCCCTCAAATCCGAACATGCTGCGACACAACGCCGCACCTGCGGATTTAGAATGCGGCTCCCAGGACGTCCAGGTGGCTACTTGTTGTTTCGGGATTTCCGGAGCCCATCGAGGGCAGCGCGGCCGGCGAGTCTTCCTCAGCGAATTCCAGAACACCCCAGTGCGCCATCGTTGCGGCGATGACCTGCGCAATTGAGCCTTCGCGGTCGTCCCAAACCTTGTCTCCCCGAGGCAGTTCGCGGGTCATAGCGACCTCCAACCCTTCGGTCAGAATCGGCTGGTTGGTGTGTCCCAGGTCACCGGACATTGCGGCGTCTACGAACCCCTGGAACGCCACCGCGATCTGTCCCGTCGTCGTCAACATGACATCGACATCCAGCTTTTTCAGATACGGCGCCAGCGGTTTCGCCGGGTCATGATCGTCGATCACAATCGTGGCCGGATCCCACAACTCCACCAGCCGCACCACATACGCGGCGACCTGCCCGATCGTGGCTTTCTGGTAGTAGCCGATTTCGATCTGCACCCGGCCTTCGATCGTGCGCTGCCCCACCGCGATAGCCCACCGCGCCAGATCACGTGTACGGGACACCGCCAAAACCTTCTGCCCGACAAGTTCGGGCGCATAGTCAGTCAACGGCTCCCACACCTCCTTGATTGGGATGACCGGGTCAATGAAGCGCGCGTCCGCCGGCCACTCGCCCCACCCGAGAAAGTCGGCTTCCCACAGCGCCACCTTCGATGCCTCGCGACTCTCGACCGCTGTCTTGTAAAACCGCTCGATATCGCGCTCTTTCGAAATCACCCCATATGACGGCTCGGCCAGACGCCAGTTCTCGCGGTCCTCACGCGCCACCTTGCGCTCGACTGAATCCTTAGGTGGATCCGGTGCCGCGAACTCGATGAACAACAGATCTGTGTCCTTGTTCAGCCCGCGGCGCCGCATACCCGCGAACACGTGGCAATTCGGATGCTCCGATTCCACGGGCGCCGTGGACGTGTAGATCGTTTGGGAGTTCGCCGACGCAACCTGTGCACCCTGCAACGCCGAAACTTCGTCCTGCGTCAGGTTGTATGCCTCATCGAAAATCGCCAGGTCAATCCGGTCAAGACCACGGCCCTTGTCGCCAGATCGAACACCGAACTGCACCGTGACTTCAGTGCCGAGCTTCGATCGGACGACGATTTCACCGAAGCCCTGTTTTCCGCCGGTCATTGAGACCACACGGTTTTTCAGCGACGGCCGCGAATTGATGATCGCCTTGACACGTTTGAACACTGCGTCGGAGGTATTGCCGCGCTGCGCGGTGTAGGCGATGTTCTCGCCGAGGATGAACAGCCCGAACAGGATGCGCAGCACCAGAATCAGGGTCTTGCCCTGCTGGCGGGTGCAGACCAGCACCACGTCCGGATGCGTCCATAGATAGTCCGGGCGGCGACTCAAGATCTTGCGCATCGTGCGCCATTGCCACGGCAGCGACCGCTGTTTGGTGATGCGGTGCCCGAACCGCGCGCAACGGTCACCGTCGGTTTCATCACCGGTGAACGAATGCTCGAGCTTGGGTTCCTGCCGCCCGGTCAGTCGCGGCCACTCACCGACCCATGGCGGCGTGTCCTGTTTGTTCGGCGGGGCACCGGCTTTCGGTCGCCGCCCGGTCGCGCGTTTGGCCGGCGACCTAGATGTCGTCGAGGTCGTCGTCTTCCGCGCCGCCGCCACCATTCGGTCCCATCGATCCCGCGCGCTGTTTGTGAATCTCAGCGAGCAAATGCCGTAACAACGTGGTCTGTTGTCTCTCTTCGGCGATCGTGGAATCGATCTTCGCTTCGACAATCACGGTCACCGGGTTCTTGCCATTGTCGTCAGCAGCCGCGACGATGCCGCGCAGATCCAAACGCACCCACGACGATTCGACACCCGAATTGATACCGGCCAACATCTCCAGCCGATCAGCCACGCGCGCGGCCTGCCGTATCAGCGCGGTCAAACCCGGCCCATCCTCCTTACCTTCCAGCTGCGCCCGCAGCCGCTCCCCCGCACCCGGTTCGGTACTTACCGTGACCCGCCCGGTGCCCGTGGCCGTCCGTTTGGTCCCCGCCGCTGTCCGCTTCGTAGCCTTCAC
Protein sequences of DBSCAN-SWA_2 >NZ_CP013049|3332189:3349694|3346165_3347569_-|WP_057138001.1|DBSCAN-SWA MTKRVKTAMPVGESGYVTPFVDGWVNWDPYEKVPDLQHPASVAVFLEMDNNDSRVSSLLEAISLPIVETGWRIDPNGADAEVVQFISRNMNLPVVGFDEVDDPGRSRGRFSWIDHLREVASPTAQFGHAVFEQVYRREADGRFVLRKLGPRPQWTIQKFNVAMDGGLDSITQLAPASTGRTMYGPTPLDIPINRLVVYTRNKRPGYWQGRSILRSSYKHWLLKNELLRIEVVAARRNGMGVPVGTASKPNDDAEVKAMQKVASEFQGGMGSGVGLAQGQSLALLGVQGNLPDIRAAIVYHDKAIALAGLAQYMNLDTGGSYALAAVQERPFVQAENAAAKSYRDIGQAHIIEDLVDINFGVEARTPRLVFDKIGSQQDATAAALKMFVEAGLLAPDLRIERALRQSLDLPAKPDVNDPDAAPPKEPDAPAVPDDAVDPSTVAQAVADVEAMREFLHPFSKSGQGRLF >NZ_CP013049|3332189:3349694|3344903_3346166_-|WP_057138000.1|DBSCAN-SWA MAREYREWYKFTVAKAASAEDKPTATLHIYDEIDSWFGVNAEQLVLDIAALDSQTELTVRINSPGGNAFDGIAIANAIMRHPGKTVTRVDGLAASAASVIAVAADEVVVSKYGQAMIHDARMLWSGTAADMRSAADHLEKLSASYANIYADRAGGTVEDWAQAMADETWYTAEEMVAAGLASRIDDSGARADTEKAVASAMACSSYKFKYFGRPAAPAPVARADNAGACASANISKEGPVPDIKEDVAKRLGLGPDATDEEVLAALDKLASTEQEDTDEDTTTTTAEDDSAEAGAGTAVDGKELVAAAAKAGLVLMDPARVAKLESDAAAGALACQTQVAEAHAKVVDAAVAKGKITAPRRDHFLALMQADPEGTTALLDSIPAETAVPLTEVGHGTEAQASASAEDEIRNDPRFKNWRF >NZ_CP013049|3332189:3349694|3332189_3332744_-|WP_057137988.1|DBSCAN-SWA MSSRNKKQGKVFPKFPYDRKFTKAELDEIFARQDRLCEGFRDAVGPNGWGLGLPEDHLQLLMFHGALVGADVDDEKAFIRARRLPDQSGRLVDAVEWVVKKEDTPRARSRDARREARARAKEIDELDPDVRDALIDMVKRKGQRVYDRGRAEVPAEDRDEYTADLADTDDEQPEDLDDTKEDMP >NZ_CP013049|3332189:3349694|3343472_3344417_-|WP_016895910.1|DBSCAN-SWA MPTTPIVSISDGPRLTVSEMVGNPLFIPTKIKELLTNVFITQTLFRNGGGNKNGLVAYRQGDPIFLDGEPEDVAEFGDIPVAAGRKGTALFAVANKKGLGVRVSKEMRDEDDIDSVNLQITQLVNTFKRSDDRVFRALTQSSAVPTMAVSAAWDTANGNPRGDIARAIEKVINAAPSIAEGGSAEEWYGFQPDTMVMNPGLLPILLDNDKFNKVYQGNVANEHIAYTGKLPGKVYDLDILGARGYPTDRIWIGQKGVTGFYSDTRPFQVTGLYPEGNGPNGGPTESFRCDATRKTAYALDQPKAGIWLTGLVTP >NZ_CP013049|3332189:3349694|3347620_3349294_-|WP_057138002.1|DBSCAN-SWA MVAAARKTTTSTTSRSPAKRATGRRPKAGAPPNKQDTPPWVGEWPRLTGRQEPKLEHSFTGDETDGDRCARFGHRITKQRSLPWQWRTMRKILSRRPDYLWTHPDVVLVCTRQQGKTLILVLRILFGLFILGENIAYTAQRGNTSDAVFKRVKAIINSRPSLKNRVVSMTGGKQGFGEIVVRSKLGTEVTVQFGVRSGDKGRGLDRIDLAIFDEAYNLTQDEVSALQGAQVASANSQTIYTSTAPVESEHPNCHVFAGMRRRGLNKDTDLLFIEFAAPDPPKDSVERKVAREDRENWRLAEPSYGVISKERDIERFYKTAVESREASKVALWEADFLGWGEWPADARFIDPVIPIKEVWEPLTDYAPELVGQKVLAVSRTRDLARWAIAVGQRTIEGRVQIEIGYYQKATIGQVAAYVVRLVELWDPATIVIDDHDPAKPLAPYLKKLDVDVMLTTTGQIAVAFQGFVDAAMSGDLGHTNQPILTEGLEVAMTRELPRGDKVWDDREGSIAQVIAATMAHWGVLEFAEEDSPAALPSMGSGNPETTSSHLDVLGAAF >NZ_CP013049|3332189:3349694|3340438_3340729_-|WP_057137995.1|DBSCAN-SWA MKTIELLVDKPEFLKGSIITVDDVSAAALIGKEEAKPYEPTEDGGEDTDAAQPKVKRGRRSRGTSQSVNVAEADMPNTEADTGDGEVDGEGGGENV >NZ_CP013049|3332189:3349694|3334458_3335565_-|WP_057137990.1|DBSCAN-SWA MTSAQLIGPRRNVPWSKLSAAARGEAVSCAWIGSDGRYWPLTGQLAGSEGAFITGPIDGMVHVPFEGVWTTPAYGPPRFERTVDGRREISFTLGLMSDSSLGWYDTEARFWRGCRKDATGYFTVTTRRHGQLWIPMQLLEAPKCALPDDPALQRVALHEIILAADGEPRWHRPDTAPPPFIRPPGGPSLGFIRIANRSTEPAWPIFFVQASKTAPSKVRLGDGPNAIVSSEENPFDDWPKLSRLFGIPFVDEILGTFTRARDANMIDVPELNPGEHCVIDTDPTHRIAITAQDPPDNLVKKFIRNSELLNWILSNYGDSGLPLLQRFRGQGFSVPIPPKTVATIPVSHNQAGGKIWCQLPQRFESALA >NZ_CP013049|3332189:3349694|3344449_3344902_-|WP_016892119.1|DBSCAN-SWA MPGIPQVTKTGPRTYTPKAGVSIKGGQLVEGVTGGRIQPAAAGSFKVVGVALTDAIAPEDLVLAPTTGSDGRPVLNTAVLPTKVACAYGGAEVPVTYAADAAFGELLIAAANGTVTPAGATPDARTIVGRCTEPGGVVVATKAVGLIRTV >NZ_CP013049|3332189:3349694|3341726_3341969_-|WP_057137997.1|DBSCAN-SWA MAPQGKFRLNKKTVAHIAKYDKGLGAALDAVANPAAENAGATVDEYVTDRQVRGIYGKNEDQAKHGAVSKAFGRLGLRLR >NZ_CP013049|3332189:3349694|3341307_3341730_-|WP_057137996.1|DBSCAN-SWA MREHADVRNAFADALEAFVALPPNVGLFGGVCRISVEEVPQDWSLRIGPPLITVHDDGGPEQWPIKRDPTIRITVRARGADLADRVARRVHGYLHDNRPPGVAHIFRTGGSVFVTARDTDTGADMASFTVTAAVRTIETV >NZ_CP013049|3332189:3349694|3342648_3343044_-|WP_016895912.1|DBSCAN-SWA MAEVTPFLTVTEFEGMFRPLSATEQALAEILVRAAAAWIRDPSRLPDLPASDERGKLVTYDVVKAMFGPEGVTDSRVTELTRTTDDRTLTVKLAQAAEMLDFTERHLQMLGLSLTAAPQATFTGYAQAEPW >NZ_CP013049|3332189:3349694|3343050_3343476_-|WP_057137999.1|DBSCAN-SWA MTAEYVLTADFLRRLDDKGVWRELKRGAVLADLDDDDVRRLTAAGAIVDRASYERAQAEAEAAAQAAAEALATDYGGDGTAGGGGGGEGGWDSVVLAPGDGIERPKSAHSTEVWRQYAIARGIPADQAAKMSKTQIKAATR >NZ_CP013049|3332189:3349694|3335564_3339224_-|WP_057137991.1|DBSCAN-SWA MDFGYYTLPVIPSFVDIESKSQTALNRSVGALGTKVGKTFGKNLADGVVSAEAQLTRAYDNVIKIQDKAADATGKLATAEAKLQRLQKIGASNDRIVAATEARNKARRDEIRAVKEATAAQDDYERSLKRHNSGADALDNLGLSYEGLAGKASLAAAALGTAVGAGMAAAAAGAIKLGRELYDLGAQFDETFDQLQITTGASGGALDALEVSVKNLGRSVPLPFAELGKVVGEVNRDLHLTGPTLDAVSKSVANLGRLTGEAVDVRGLGRAFRSFGVEGKDQVATLDSLFGAWQRTGIPVNELLATVTKAGPPLRALGLSFGQSAALVTSLEEAGLDADTMLKGGLTKSLATLAKGGKSGAEALQQTIAEIRRLNEAGDKAGAQNLANKFFGNKGGASFFDAITSGALDLQTLQSALESTGASINDTAAETDDFEQKWEIFKNNTATALEPLASSLFTFVNDGLGSLTDWVQAHQPEIIGFFSTATQGVILFGETILRMSSDALDALSLLVGGLGNTVGFTLKAASAFASLTGDKAGAQRMHDLSESAFSWGENMRGLADKLDGAADGLFNLRHRVQATGEDMATAARLTSALGDVTAAMPDGKTIKISANTPEVQAKLAALGIQVQELPDKTVTVTANTAEGQKILDAWRKSVGNQTAEVPVGADTSKAKQDIEAMFRGYAAQAPQLPVAVGPASAPAPGSVASFIPARATGGIYDVWDSVASFANGGTLPSRAVIQPAVPGAGLVQWAEPSTGGEAFIPLDGGKRSLDIWLETGRRLGAIQGFEVGGLRGPDVMAAQSFVGTPYSQANRNDCSGMVSRVINRAMGLPDGALMNTKNAEQWLTARGFRRGIGGPGTITVGWYDHGPNPNDGHMAMTLSDGSNAESGGSHGNFLVGAGAAGATSPQFDQHMYLPQLYGEGPGGMGGGGMLAGGGGGFGGGAGAFGGGYQAGPPGSTPGYGPGGEPGYYEADPRKVRESEERVQDSDQRVKEAEARLRELKADAKDSEKLAAQGALDKAKREAGDARADLEDARRGKFTAAKEAKNAVGGKGGGDDVGELGKIFGGGLMETFGLDGSFLPNIEDLGIVKLAKAIMGIKYTPQGTGFAGGLLGGAGGAGGMGGAGGGGGFPGASFDGGGATSSLPFGMVPEVSSMLPSLTGAAAHPGSGMPPGVGNGPVDQSVNLTINNPQGDERSIADRTRRVLLNTPRQMTHEPVGGGR >NZ_CP013049|3332189:3349694|3332743_3334462_-|WP_057137989.1|DBSCAN-SWA MTAPTLTLDPKTLAGEFTPELRMHLLERRWAYMNRRTKAPLVRIWDKEFRFIARVENLDKWDWEELATEDGEANITFSGKANDWLREIITYQIGDDEDIHITIDPDPDKPHDFRTRWGGKVQVIEDDEEAGKAAVTTLKCISNRRHLKGIYLAANPIFPMEVQLPKMFLWGGPTITTCATAMFFNCIRLFTLNGFFPVPRNIFAPESWLQNLSPLNWPVQIMPVAGLFDQSRWCTIGSRWKDAHTVLSPVMKDAGVICRAYTWLPGDPAPYTMFGPELAEILKPTRACVILSFEDKSGVTGPTGTMLDGAINLFAATLDDLITETLIPIDADHDGEVDPFFRKLLLVSPKPPPFVYRDVGYGNIRRRKLRIYKSRATDIIVGGKSPQWVNQAITFAIRYGISQLAQVIMGVEAAGVEGLDNLYQGQLDDVFLAFMRYVNPIRSAKTGSYAFREYFKNPGGTAYVINAIQELAAGDYEMKPYRSMKFDVGDGQPYILGEDFWLGDRVSAEIRGVVYTDQIMAIKGEGDRTTAGRPTVSFGDDSRDEDPVARGFRTIGNVANFAALLAGSGDMF >NZ_CP013049|3332189:3349694|3341968_3342655_-|WP_057137998.1|DBSCAN-SWA MVSMFDPGPDTVTLVKRDPVTDAGAPVVDAWGRPTYTERRIPKGRCSWAEHPAFEDIAGTQVAVINAVGHLIVDADTETLTARDAVEFGDRLFEMQGPGVRRDALDGNPSHVRAEARFCEDVSLGEQVTIIAAGRRGDRGTVEPDGDPVTVIARAVLAGNQRQRFGDTGEITAAAFTVVLDLDVQIRDNDWLIIRGRECRALVGVQLSQWADRNQLVVLAQSAKGGIN >NZ_CP013049|3332189:3349694|3349250_3349694_-|WP_052572199.1|DBSCAN-SWA MKATKRTAAGTKRTATGTGRVTVSTEPGAGERLRAQLEGKEDGPGLTALIRQAARVADRLEMLAGINSGVESSWVRLDLRGIVAAADDNGKNPVTVIVEAKIDSTIAEERQQTTLLRHLLAEIHKQRAGSMGPNGGGGAEDDDLDDI >NZ_CP013049|3332189:3349694|3339418_3339718_-|WP_057137993.1|DBSCAN-SWA MANNTRNKSAETADDQTDPDKATVDLIWEGLKFTIPKRRGRWPVSALRDFARGRNYEAVVTLIGGEDQWQQLVEKCPTGDEFDKFVDYVREVVKKECTL >NZ_CP013049|3332189:3349694|3339723_3340464_-|WP_057137994.1|DBSCAN-SWA MKVAAKTSEAARLEALGATEAEALFRGHTIRVPLNLEVWPLNLVREHPFNAVDYLLNGQECGLYDDATVDDYRELSDAMADAVGVSRLPETPAAPDQWFGGIPTLVNILDRYEDDLASDLQRFWGVEYAERFTGTLSLRRIWTYIRRLDPASSIVRAQNGGKEQWTEQMFILASVYQALTGEIYPGRPLRPHEVAKALEAMQAKADHVANLKERQAAYAAKSSPAAPAVSAMEQAVANRRHELGKR >NZ_CP013049|3332189:3349694|3340744_3341287_-|WP_016895916.1|DBSCAN-SWA MAGNPDNVKLYTEADVLLWMGSAAPTTADLPAAITDPFVTTTGKWAFLGLLVGDAGIDTQREWDEKDIPAWGYGTIIVASKDFKLTRKVSALEDNPAMQRILWNGSTETEIVVPNPLYEYVAFEKRTASGEIRREISKRPARFWTPNIKDAEGDATPREIECRIFPDSARKLFAAQQTAA |
19 | Mycobacterium_phage(81.25%) | NA | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_3 |
3354262 : 3367380
Sequences of DBSCAN-SWA_3
Nucleotide sequences of DBSCAN-SWA_3 >NZ_CP013049|3354262:3367380|DBSCAN-SWA CCTATTCGATGGGGGCGCCGGCGTCGCGGAGGATGTCGGCGGCGGCTTCGACGGCGTCCCAGGCTGTTCGGGCGCCGCCGTGCCGGTCGGGGTGGGTGTTGGCGCGCGCTTTCCGATATGTGGTCCGGGCGGTCTCGGGGTCGTGCAGGATGCGATGTGCCCAATCGGAGACGCTGTCGTCGTTGCCTTGCGCGGCTTTCGCGAGATAGACGGCGGCACCTGCCGGGGTCTGCGCGATCGGCGTGGCCTTGGCTTCAATGGCCTGCCATCCGCGGTACTGCTGGCCGGTTTGGGTGATTCCGTAGCGTTCGACCTTGCGTAGGGCCTCGAGGCCGAGCGCGATGGCACGCAGGTTGTCTTGCCAGCGGGTGAACGTGTCACACGGGTACGACAGCGGTCCGTGGCGGGATTCGATGTTCAGGATGACGCCGGGGTGCTGTGCTGTGGCGTTGGCGCGCGGCATGCCATCAGTGATTCGGAAGTCCTGCTCGCGCATCGCGATTTGCAGTACCGCGGCGGCATACTGCTGGTCTTTCCCGAGGTACCAGAGTTCGCGGTCGAGCCGGGTGAGGGTGTCGCTCCACTGCGCCGAGAAATTCGAGCGGCGGCGGTCGCGGGTGAGGCTGTGGGGCCATGTTTCGATGGGTCGAAGTGTCATGTTGGGTGGGTAGTCGGGCATTTCGGTGAGTCCTTCGGTTGGTCGAGTTGCGGTTGTGGTGGTTCAGAATTCGTCGCGTTTACGTCCGCGTTCGCGGCCGGTTTGTCCTCCCCAGATGCCGTGGTAGTCGCCGATCTGGTCGGCGTAGTCGCCGCATTCGGCGCGGACTGGGCAGTGCCCGCACACCCGTTTCGCTTCGGCTATGACCAGGCGTCGCGCCAGTTCCGATTTCGACCGTTTGATCGTGGCTTCGGACCGTGCGGGCGGCGGGAAGAAAATGTCTGGCCGTGGGTGCCCTCGGCACGCGGCGCGTAGTTTCCACGTTTCGTCCAGTTCGCCGCCGATTTCGGCGAGCCGGCGTTCGATCCGGCCCTTGTTCGGGAAGATCGCGCCGCCGGACATTTACGAGGCCAGCCATTCGGTTTCCCAGTCGACGAGGGCAATGTTTTCGACGTGGCGGCGTCGGGTGTCGACTGGTAGGCGCCAGCCGAGGTCCAGGGCGCCCATGTGCGCGAGGCCGAGAGCGTCGGCTTGGTCGTGGTTCTTGATGCGGTGTTGGTCGTCGAACCATGTGGCTTGGGACTCGGCGAGTACCAGTTTCTTGTGTTCGCCGGGTTTCATGCCGTTGGGGGCGCGTCCGGTGATGAATTTGCCTCGGGTGGTGGGGTTTACGACGGTGATGGGTATTCGTTTCGCGTCGAGGATTGAGAACAGTGCCCACCAGAGGCCGTCGCGGTCGAACTTGCTGGGCAGGTTCGATGCCCATGCGGGGCCTTCGATGACGGCGCGGGCGATGGGTGTTGTGGTGTGGACTTCACCGATGATGGAGGCGATTTCGCGTGCTTCGGTGATGATGCGTCGGCTGCGCCGCCACCACGGAACGCCTTCGCGTAGCGAGTATCCGACGTGCGTGATCACGCCCGGACGGGCGATGCTGCCGGGGTTGTCGCGGACGATCGCGGCGATTCCAGCGCGCGCGAGTGATGGGTCTAGGCCGAGTACGGCTTCGGGGCCGGTCATAGCTTAATCTCCTTGCGTGGCATGGCTAGTCGTTTGACGGCGACGGCGAACAGGAAGGCGACGTCTTCGGGGTCCTGGCCTTCGGCGCCGACGGCGAGCGCACCGATTGGGCATAGCGGGTCACCGTCGTGTTCGGCGAGGTGCGCTTGAAATTCGGCGATGAACTGGTCGAGCACCTTGTCGGCGGCGGCGACCAGCTGGTCTGCTGTGGCCGCAATGGCGGGCGGTATGTGGCTCATGCGTGGGGATCCTGAATCATCCGCAGAGTGCCGCCGCGTCGGCGCGGGTACGGCGCCACCACCATGGGTGCGCGCAATGCGGGGTGCTGGCCGAAAGCCTTGTGTTCCAGCCGGTTCACACGGGCGAGTAGTTCGTCGATGCTGGCAAGGCTGCGTGCGACGGTCATCGCGGCGGACTGGTTTCGGGTGGTGAATATCTCGGCTCCGCGCGGCGATTTCACCACGAAACGCGTGTGATCAGCCAGGTAGGCTTCGACCACGGTCGGTACCCGCGCAGACCATTGGTCGGGGTCTGGTTTGGGTTTGATGCGGACTTCCCAGCGTCCGGTGCCGTTGGCAGCTTGTGCGGGTGTGGTGGTTTTCATGCGGTGTTCTCCAATCGGGGGTGGGTGCAGCGGATGAGCGGCGACTCGGGGTCTTCCGGGTCGGAGTAGCGCAGGCCGTTGGGGTCGCAGTCGGGGCATGCGCGGATCGCGGCGAGCGTGGCTGCGCGCTGGTCGGCGGCAGCTTGCAACTGTTGGGCGTCCCAGGTGTTTGCGTTGCGCCGCGCGTTAAAGCAGGCGCCGCAGGGCTGTTCGGTTCCGCCGGGGTGTTGGGGGCAGTGCTCGGCCGGGCGGCTTCCTACCGGACTATCCCCACTAGAAGTAACTACTAAGGGGTGGGGTGGGGTGGGACCACGGGACTCCCCAGGGGACACGTTTTCGCCGGGTTCTGCGGTGTCCCCGTCGTTGTCCCCTGGGGACATTTGGTCATCTATGCCGGTCACAAAGTTTCGGCCGCGCCGTCTTTGGTTCCGTTTCTTCTCGGCTTCGCGTTTGCGGTTGGCCTCGTTTTCTGCCTTAGTTCGCTGCCATTTTGCCCAATTTCTGACCAAAATTCCGTCACTTTCTAGGCAGCACAGCGGGGCGTCGAGGGGTCCGGGGGCCGTGAGGGCGGCGACGACCGACCGCGGTACGAGAAGCGATTTCAGCTTCGCGTGCGGGATGAACCCGTCGAGTTCTTCCTTTGCTGACCACGACCCGCCGAGAACCCACGCGCCGGCGACCGCGATCCGCATTGGGACACGGACAGGGGTCGTGGGCAGGTTCATGATTGGCTTGGAATCGCTGAAGCCGTCATCGACGTAGAACCACGGCACTACTCGGACACCTCCCCCGCCTGGCTGGTGTTGTTGGCGTGGGCCAGGAACTCGGCGAGCTTGGCGTCGTATTCGTCGATGAGTCGTCGTACTTCCCTCGTGCTGTTGTAACCCCTAGCGAGCTGACGGATAGCCGATTCGATGCCGCCAAGGGCCAACGCGGCCTTGCCGATACGTCCCAGCAGCTCCGGGTCGACAGTGCGGTCAGGCGACCAGACGTGGTCGCTCCAATCGATGGGGGCGCCGATGGCGTCCTCGATGAGCTTGAGGCGTCTTGCCGCCTCTCCGGTTTCGGGCATGGTCTTCATACGCAGCTCGACCGCCTCGTTCACATCCTTTTGGTACTTCCGGCGCCTTGCTTGGTCGCGCGACTCAACGGCTGCGGCAACCTCGTTGGCGCGCAGGGTGTCTATGCGGGCCATGACGGACCTGACGGCGTCCCATGAAGGGGTGTGGTCGGGCTTGACGGCTGCGGACGTGTGGATGTCCATGCGGGTGCGGCTTGGCCCCGGTGACATCAGTCCCCAACCGGCCGGTAGCTCGCCGGTTTTCACGATCGTCGGGTCGTTGACGACGAGATACCAGGCGTGGCATTGGTCGGCCCACTGATCGGCCTTGCCGGGCTTGTTCAGCTCGTTCAGCCAGTCGGCGCGGCTGATTTTCAGTTCGTGCCCGATGAGGATCCTGCCGCTGCTGGTGGTGAACCCGACATAGATGGCGTCGGCGCGCGCACTGGCACCCCAAGATCCGTTACCGCCAACCTCCGGTACGAACACACCACCCGGTAGGTCCAAGCCAGGTTTGATGTAGTGGCGCCGCAGCAACGCCAGCAGCTCAGATGTGTTGGTGACGTTAGGCATTAGCAATCTCCAGCAACACGTCGGCATGGCACGGCTGATCGAGCTTGCACCAGCACACGAGGTCATGCCCGGCCAGGAGGTAGCGAATCACGCCGACTTGCGATGTAAGCCACTTGTCGCCGTTGATGTTCGGGCCACCGCCGCGAACCATGTCGGCATAGCAGGCGACGGTCTGCTCAACGGTCAGAACGGTTCCCTCTGGCAGCCATGGCATTCGTGGTCGACCCGGACCCGCAACGTACGGGTTGCCCCACTTGGTCGGCCGTCCGACGTAGATGGACCCTTCGGGCATCCGCCAGCCCGCGGTGCGCTTGCGCTGGATGCGTTCAGGCATGTTCGTTCGCTTTCTTGACGTTGCATTTGTGTCCGGTGGACGGAATCGGTTCCCAGCAGCCGTCGACCTTGCAGAGCGGCGAGCACGACAGGTGACGGTCACCCTGGCCGCAGGTCACGGGCTCGCCGCACACCTTGCAGCGAGGGAACCGGCGCCGCCTGTAACTCATGCGACTACCTCCCCGCTGCGGCAGCCGTACGGTGAGCAGCCGTCCGGGTCGCCGTCCTCGAATAGGTCGAGCTGCATGTCGGCGTACTCGGCACGTGTCACGCGGTCGATTGGCGCCAGGTCCAACGGAACTCGTGAGCGGTGCAGGAACGCCTCGCCGTCGAGTGGGTTGGCCGACGCACCACCCTTGCGGATACGGCGGTCGAAATCGACCGCGTCATCCCAGAGGGAGTGGAACCGCTTACACATGCACAGATCGGCGGGCGCTTCGGGCTGGTCCCGGTTGTACAGATGCGCGCATGCCTTGGGTTCGTCGAACCCGCGCCAGTGGTCGTCACGGGGGTGGTTGCACGTCGCGCAGATGTCGCGCCGCTCGTACATGTACCGCCACTGGGCGTTGCCGTGGAACGGGCAGCCGATGCACGCACTCTTGGCGGTGTGGCCCCACCCGGCGCGCTCTAGCCAGCGCTGGCAGTCCTTGCGGGACATGCCCAGCTCCAGCAGCGGGTACCGCGGCCGGGAGTAGTTCACGTCCAGCCGGTCGCGTACCCGGTGGATCTCATCAGTGGAGAAGCCGATCCACTGCTCGGCGAATACATCTCGCGGTACCGGTGTCGGGTGTGGGTAGCCCAGCAGCTCGCGCACCTTGACCTTGATCGGCTTGAGCTTGTACTCGCTGGTGCACTGGCGACGGCCCATGCCGTGCCGTTCAGTGGCAGTGGCTAGCCTGGTGCCCACGATCGACCCACGGCCGTCGCCACCGCACACCGAACATGAATCAGGCTCGTCAGATGGTCCACGGCCGGAGCCGCCGCAGGGGGCGCATACGCCATAAACAGGCACCTCGGTACCGGCGGGAGCGAGTGTGAAGTACGGCACCGAGACGAACCGGTGTGCCGGGTCGAGGGTGTCGGCGCGCAGGTTCCCCGACGAAACCCGGTACAACGGGATATCCACCCGGGCAAGCTCGGCGGCGAGCCGGTCCACCTGCTCATAGACCGCAGGTGGCTCCCAGCCGGTATCGGCGAACACCGCCGCGTCCAGACCAGGCAGCGTGCCGTCGCACGCCATGAGTGCCAGCACCGTCGACTGGACACCAGCGCCGAGGGACAGCACCCGGATAGCAGGCGCAGGCATCACGCCACCACCTCGGGGTACTGGTCCCAGGTGCGCCCGTCCAGCTCGCGCCCGGCCCGCTTCTTGCCCACGCGGTCGACGAACTCACATGGCGTCCAAGGCGGCGCTATGGAACCGTCGGGTAGCAGTGCCCGCCGCTGATACTTGAGCCGCTTGCCATTGGCGACAGGCTCATTCAGGCTCAGGTCTGGCGACCACTCGCCCCACTGCTTGAACAGGAACGGCACGCCAGCGGCTACGCACTGATCGCGCATCGAGCGCGCCCAGTCGGGATGCATCGGCCTTGCGCCCGGACCGGATTCGCCACCGACGATCACCCAGTCCAGATGCCCGATCCAGAAAACCGAGTCTTTCCCGATCGGGTCACCATGTAGGTCGATCGGCCCGAGAAGCGGCTCGGCACTGACGAACCGCACAGCGGCCGGGGTGTCCAGCAGTGCCGGGATGCGGAGGTCGGCGCGCTTCTGATCCTCGGCGCTCACGCCCAGCCAAACGTTCGGCAGCGGCCACCAGTCCCCGGTAGCGGACTCCAGATGACCGGTTTTCAGCGAGAGGTCTTCGACTGCCCAGCCAACGAAGGTTTGCGTGACCTCCGACCGGAAATCATCACTGGAAAGCAGCGACCGCATCCGACCGTGCCGCTTAGTAAGCACCTGGAATGTGTGTTGTGGAGCCAAGGCCATTACAGCGAACACGCGGGCGATGTACTCGTCAGGCACGTCGTCGTGGAACAGGTCGGCTTGTGAGCAGACGAAGATCTTCTTTCCGTCACGCTTTCGGAGTGGCCAGTCCAGACGGTTCGGGTGGAGTTGTACCGCCAGACTCGACCCGATGCCTTCACCGTCAAACTTTCGGCCTGCGATACGAATTGGTGTCGATCGTTCGATGTAGCAGTTCAGACATCCATTACTGACACGGGTGCAACCGGTCACTGGTGACCATGTGGCGTCAGTCCATTCGATGCGTGTCTTGTCACCCATGGGACGCTCCGACGGCGTTTGACTCGATGTATGCGATGACTGCCAGGATCGCGGCGGCGAGCTGGAGCGCCCTATCTGTGCTGCGAAGTATGATTGGGCTGCCGGCGAACTGGCTGATCGCTACGTCATTTCCTGCTGCTGTCACGAATCCCCCGACACCCACGGCGAACCGGCCCAGCCTGTTTTGAATCGGTTTGGGCAGCTCGACGACTGCATAGCCGCGTGATTCCAGTAGCTTCGCCGCCGCGTAGAGCGGGTCTTGGCCGGGGCGGGTGACTTCGGGTGCTTCACCTAACGGTTCTGTGCCGTCATCTGTTTGGTTCACGGGTGTGCCTTTCATGAGCCGCTGGCCTCGCCGGTTCCGGGCCAGAGTGTTGGGTAGATGGCTGCGGTTTCGGCCCAGCAGTCGCGGCATTCGCGGGCAATGAACCGGGCGCGGAAGTCCTGGCCGCAGCGCTGGCATTGGAATGTGAACCACGGACGCTGTTCCAGATTCGGCTTTGGTGCGTGCGTACGGTCGTAGCGCGGGGTGGGGAGGGATGTCATTTCGCCTCCAAGCCGGTTATTGACACCGACGCTCGCGGTATATACCGTGGTATATACCAATCGAAGAAACGGGGGTTGAAATGGCCCAGACCACCGACACATTCCGCAAGTACGGCGAACGGTCCGAGTACCTGGTAGTCGACACACGACCCGTTAGTTACTGCCAGCTGCTCGGCCGCAACGTGCCGAACGACGTGACCCGTATCGCCCGCCTGTCCCGCGCAACCGCGCGTGAACTCGGAACCGCGCTCATCGGATGGACAGCCAGCCCGCTCACCTCGGCGAGTCACACCTTTGAGTCCGACCGCAACGCCCCACTGCCCGACGGCGCTACCGGATGCTGGTCTGTCAATGTCGATATCGAGGACGGAACTAACGCCGTGAACGTCGGCGGTATCGGAACCAACCTCACCCAAGAGCAAGCCCAGGATGTCGGGCAGCGTCTCATCGCATGGGCTGGCGTGCAGGTGCCCGTCGATGCCTAGAGACCCGCTGCGCGCTTTCCGATGCAACGATGAATTGTGGTCATCGGCCCAGGACAAGGCTGACGCAGAAGGACGTCCCCTGTCCGAGGTGATTCGCGAACTGCTGTCCAAGTGGGTCACGCGGCCACCTCGCAAAGGCTGAGCAGCCAGTACAGCGCCGCAATCGCCTGCTGGGGTACGACGCCGTTTCCGATGATGCGCAGCATGGCGCTGCGTGAGATCGACACGGCGGTAACCCATCCGAGCGGCCAGCCCTGCATCCACTCAGAGAACCGAGCCGCAAGACGCGGGTTACCCTTCGCGCCGGGCTCTGTCGGCGAGGGGGCCTCGCGGGTGATGGCCTCCCACCGCCGGATCGCAGGCGCGTACTTGCCCCACTGCGGTGTGCCGTTCAACAGTGCGTAGTCGATCAGTTGGCGCGAATGGCCCTCTCGGCTGTCGGGATGAGCGCCGCCGCCTGTCGCATCGCTGGCGCAGGGTGTCGGCAACAATCCATCAATCGCTCGTAACCCCGGGTCGTTTCGATTTAGGTCGGCAGGCGACGAACTATGCGCATCTGTCGCCTTCGGCGTCGGCAGCAACGCGCCCTCGGCGTATGCCTTGGCGACACCGGGCAAGAGCAGCTCGTCACCGCGTACACCCGAGCGGGTCAGGTGACCACCCGTGCCGTCAGCGACCGATGGGGTCGGCAGCAGGTCTGTGATCGAGTCGAGGCTCGGTCGCACGGATGCCCCTGCGCTCGGCGACCGATTGCTTGGATAGCGGCTGGCTGCGGGTGTAGGCAGTAGTTCTACTCGGTGAGCCGCATTACCGCCGAGGGCAGCATCAGATCCCCCGATGAACCCCGCTGATTCGGGCCGCCCTTGGTCCCATCGGTCGCCGTCGGGGTCGGCAGCAGTGCGTCGCGGGTCTCGCTGGTCGAGGCTCCACGGCTTGCCCGCGGGGTAGGCAACAATGAAGACGCGTTCGCGGCGGTGTGGTGCTCCGATGTCGGAAGCGGCAACAGTTGTCCATCGAGCGTCGTACCCGAGGTCGGCCAGGTCTCCGAGTACGGCTCCCGCTGCTCGCAGAATAGGTCGGCTTGATCGGTCTCCCAGATCATCCGGACCGGGTTCCATTGCGCGATGGGCGTAGCCACTGAGTAGCCCCCTGACGTTCTCGATGACCACCAGTCGTGGTCGCAGTTGATTGATTGCCTCGGCGTACTCCAGCCAGAGCCCCGACCTTGTGCCCGATGCGATACCGGCGCGACGGCCCGCCGCCGACACGTCCTGGCAGGGGAACCCACCGCACAGCACGTCTACGGGCTCGACTGCCGACCAATCAACGGCGGTGATGTCGCCCAGGTTCGGCACGCCCGGCCAATGGGCGGCAAGTACTTTCGCCGCGTCCGGGTCGGCCTCGCAATGCCACACCGTGCGGCCACCGGTGACATGCTCGACGGCCAGGTCCAGTCCGCCGGCACCGGAGAACAACGAACCAATGCGGGCCGTCATCCTGCACCGCCGAGTTTTTCGGCCCACTGCACCGCCACCGCCGCCACCTGCACAAGCTCTTCGACGACTTCGCGACGCAGCGAGTCTGCATCTTCAAGCCTGAACACCTCACGGTTCATCTGATAGATGTCAGCCGCCTCGGCAGCCTCCGCAAGTTCCTCGATCAGGATGTCCATCCATGTGACCTCGCCACGCCGAGCGCGTTCGTCTGTGCGTGCCTTGGCCTCTGCTGCTGTCGGAATGCCATAGAACCTCGCCGATTCCCCGCGTATCGCGTCGGCGTAGGTACCTGAAGAAACGATGTTGGGGTGGTTCTGTTCGCCCCATTTGTCCTGTTGCCGTTGACGTTCGGCCGCGACGAGTTGGAGGACGAATTCGGTGCTGGTCATCAGCGCGCACCCCGGTTCCAGAGTCCGTTGCGGGCGTGGAATGTGCCGGTGAATGGTTCGCCGCATCCGGCGGCGCCCCAAGCGGTGTGCGGGATGAACCGGTCGACGTCGAGGGTGCTGGTTTCGCCCCGTTCGGTGAAGATGAATCGGCCCCCGGGGCACCAGAACCCCCAGTCACGGGTTCGGGGTCCGGTGACGACGATGGTCCAGCAGCAGTCTTCGCGTTCCAGCAGCGACATCGGGTCATTCGACTTGGGCAGCTCGACGCGGTGCCGGGTTTTCGCTGCCCGGTAGGCGATGGACGCCCAGTGCCGTTTGATGCGTCGGCCGTCGGCGCGGTGCTCGTAGTAGTGGCCAGCCAGGACGAATGACCAGAACCACCATGGGCGGTCGTGTAGAGCGCGGTCGTCGTCGGATCGAATGAACTGGTGCAGGTAGATGTTCAGCCGCTTGTTACGTGGGATCAGATACCAGCGCAGCAGGTAGGGGTCGTCTTCGCCGCCGATCGCTAGATGGTGCTGGCCTCGGATGAGCTTGCCGAACCACTGGCGCAAGGTCAGGTCTGGTGTTTCGGTCATTGTGGGGTTACCTCGTCGGTGAGGGGTTGGACACTGTCAGGGGGTTCGATGTTGGGGTCGCGGCCACCGAAGTAGCGCAGCCGCAACCCTTCTGGCGCATCTGGGGCCAGTGGTTCGCCTTCGACCATGGTTCGGTCCGTGTCCGGGTCGTATGTGACGGATACGACCCGGACCGGACGGCCTTCGAGGGTTTCGCCGAACTGCTGGCCGATGCTGAACTCGTCGGTGCGGTCGCCGACGAACTTCGCGCACCTCACGTGCCGCTGCCCGGTTCGGCCACGACCAGCGGTTCGGCCGTGGGCTGCGGTGGCAGCTCGTCGGGCAGCGCGGGCGCCGATAGCGGTGTCAGGGCGAACATTTCGCCGTGTTCACGCGCTAAATTGCTGAGCCATTGCAGGATTTCCGTGTCGACGGCGAGCGTGCCGAATGCCGGTGTCTTGCCGGGGTCGATCGCTTCGGCCAGCACCGGATGCTGTTCGCGCACCGCGGTTCTGCATGCTTCGAACGCTGGCGCCATCTCGGACACGAGCGGCACGTAGCCGAGCATGTGCCCCAACACGTCGTACACCTGTGACAGCAGGCAGAAGATGCGGTCGGGGTTCCCGATGGCGAGGGTGACGACGACACCGATCGGGAACTCGCGTGGCTCGACGCTGGTCATGAGATCCACCAGCCGGGCGAGGCGAGCATCGTGACGGCGCCGATGAATGCCGAGGTCAGCATGAACAGCGCGAAGGCTGCAGCGTTGCCGGCGGCTGCCTGGTGGCGGCGCCGCATCCGCCGTACCTTGCTGCGGCGCTGCAGTCGGCGAGCACGTTTCGAGTCCCGGTTCCGTAGACGTCGTCGCCGGATATCGTCGACCGGAACGCTGCCCTGGTACACGCTCTTGTGCTGCAGCGCACCGAGGATCGCCAGTTCGTAGGCGTTGGGTTCCCGCACGGGTTCGGTGTCGGCGTGGTGGTGCGTGGTGGGGTGGACTACTTCAGCCCCATCCTCGGCAGAGTCGTCCACCTGGTCGCATACATCCTGCTCACCGTGTGTGTCCGAGTGGGCTAGTTCGTTGAGGACTTCGAGGTCGATGTCGTGCAGGTAGTCGCGGATCTTGTCGACGCCGTGCAGCTGCTCGGCTTGGCGGTGCCGCTGATAGTCGGTCATCCGTGACGGCTGCCGGACCGCACGCGCGGCCCGGCGCCGAGCACGGCTCGGATTACCCCTCATCGCTGAACTCCGGGCCCGTCTCCGCATCGTCTTCGTCGTCGATGACCTCGGCGTCGACGACGCTCTCGTCGCCCGTGGCCTCGGCGGACGGGACCGGGTTGCCGTCAACCACGTCCAGCATCGAGCCCTGGTTCTCGTCCGCACCTGCCGGCGCCTTCTGCCCTGGCAGTCCGACCCATTCGACCTTGAGGGTGCGTTTGTCGCGCATTTCGCCGTCTGCGGCCTCTACGACCTTCGACTCGATGCACCGGTACTTCACGAGCAGGGTGCCGCCCTCTTTGAGGCAGGGCGGGTTGTTGATCTTGATGTTGGTGGCGCGGAACCCGATGTACGCGTGCGGTTCTCCGTCGTCGATGTCGTCGAGCGCGTTGGTGGACTTCAGATCCTTGGGTTTGGTGGTGGCTCCCATGGCGGTGTTATCTCCTTGCGGTGGTGGTGGTTTCCGGCCCGGTCGGGTCGGTTGCGTCCGACGAGTTGTCGGCCGGGTCTAGGTGGGGATCGGTGACGGTCACGATGACGTCGCCGTCTTGGTCAGGCTTGGAAACCCAAACCGTCGCGTCAGGTCCGTATCCGGCCTGAATGGCGTTGGCGGCGAACATGTGCACGGCACCGAGCGTCAACCCGCCTTTGGCGGTGAACTTTCCGGTGCGCGACATCAGTTCTGGTCGCTTTCGAGGCCGAGTTCGCCTTGCTCACCACCATCGGTGTCGGTGTTGGCGACGGCGTCGAGTTCGTCGGCTTCCCGTAGCGACCAAGCATTGACGTACTCGTTGATCTGCTGATCGAGCCTGCCCGCTTCCTTGAAGCCGTTGAGGGCGTTGACGATGGTGCGTAGTTCCTGGTCGGTCATGTCGGCGCGGTGCTCGAATAGCTCGGTTCGTCCGAGGATTCCGGCGATGACGATGAGCTGTTCGTCGCGGTCGGTGCATTCGGCTTTGTTCAGTGCGGCGAACATCGCCTTAAGCCACTTCTCGCGGGCATCGGCACTGAGTTCCCCCGCGGTCTCTGCCGCTTGCGCACTCTGCTGCGCGGCAGCTTCGGCTGCGCGCTCACGGAGTCGACTCGCTCCCTTCGCGCGCTGCGGCGGGGCTTGGCGTTCCTCGACGACCTCGCCGTCGATCACCGTCGGCTGCGCAGCATCCGTGAGGATCAGGCCGGCGAACTCGTTCGGGTAGGCGCGCCTGCATGCCGCCGCCTCGGCGCACTTGCCGATTTGGTTGCGCGGCATCTTCGCCCACATGCTGTTGGGTTCCTGCCCGACGACCTTGCGGCCCTGGCCGGTGCCCTCGTAGACGTTGTTCGTCTGCACGAACTCGTCGAAATGCGCTACCGCAGTGAAGGGCTCACCGTTTCGGATAACGGTAAACTTCGCGGCGACCGGCGGGGTCTTGCCGGGCCATACCTCTTTCCACTCCCCGTCGTCGCCGCAGTAGAACGGGCCTTCAACGGCCAGCGTGTCTCCGTTGTGGTGCGCGTACTCGCGGACCTTGCGCCGGAACCCGTCGATACCGGTTTGAATGGTGTACTTGGTGACGTACCGCTCAACCTTGCGTCGGCCGCCTTCCCCGTTGTCTAGCCACTCGGTGAGCTTGGTGTTACGGCCGATCATGTAGATCTCTTTGCGGAACGGATCGAGACCGGTGGTTTGGCAGACGTGAAAGAACACGTCCAGATCGCCGTCTGTGGCGTCCTCGATACCGAGCTGCCGCAATGCTGCGCGTTGGGCTTCGCTGAATCGGGCTTGCCCTGTGTGGATCGCGAGTTCCGTTCCGAGCGGCGCGGACACCGCTATCTCGCCGGCAGCCTGCGCCAACGGATCGGGCCAGATCTGGGCGGCGTGTTCATTGGCAACCTTCGTGGTGGTTTCGGTCAT
Protein sequences of DBSCAN-SWA_3 >NZ_CP013049|3354262:3367380|3358675_3359818_-|WP_057138013.1|DBSCAN-SWA MMPAPAIRVLSLGAGVQSTVLALMACDGTLPGLDAAVFADTGWEPPAVYEQVDRLAAELARVDIPLYRVSSGNLRADTLDPAHRFVSVPYFTLAPAGTEVPVYGVCAPCGGSGRGPSDEPDSCSVCGGDGRGSIVGTRLATATERHGMGRRQCTSEYKLKPIKVKVRELLGYPHPTPVPRDVFAEQWIGFSTDEIHRVRDRLDVNYSRPRYPLLELGMSRKDCQRWLERAGWGHTAKSACIGCPFHGNAQWRYMYERRDICATCNHPRDDHWRGFDEPKACAHLYNRDQPEAPADLCMCKRFHSLWDDAVDFDRRIRKGGASANPLDGEAFLHRSRVPLDLAPIDRVTRAEYADMQLDLFEDGDPDGCSPYGCRSGEVVA >NZ_CP013049|3354262:3367380|3361942_3363322_-|WP_057138017.1|DBSCAN-SWA MTARIGSLFSGAGGLDLAVEHVTGGRTVWHCEADPDAAKVLAAHWPGVPNLGDITAVDWSAVEPVDVLCGGFPCQDVSAAGRRAGIASGTRSGLWLEYAEAINQLRPRLVVIENVRGLLSGYAHRAMEPGPDDLGDRSSRPILRAAGAVLGDLADLGYDARWTTVAASDIGAPHRRERVFIVAYPAGKPWSLDQRDPRRTAADPDGDRWDQGRPESAGFIGGSDAALGGNAAHRVELLPTPAASRYPSNRSPSAGASVRPSLDSITDLLPTPSVADGTGGHLTRSGVRGDELLLPGVAKAYAEGALLPTPKATDAHSSSPADLNRNDPGLRAIDGLLPTPCASDATGGGAHPDSREGHSRQLIDYALLNGTPQWGKYAPAIRRWEAITREAPSPTEPGAKGNPRLAARFSEWMQGWPLGWVTAVSISRSAMLRIIGNGVVPQQAIAALYWLLSLCEVAA >NZ_CP013049|3354262:3367380|3358168_3358510_-|WP_065209205.1|DBSCAN-SWA MPERIQRKRTAGWRMPEGSIYVGRPTKWGNPYVAGPGRPRMPWLPEGTVLTVEQTVACYADMVRGGGPNINGDKWLTSQVGVIRYLLAGHDLVCWCKLDQPCHADVLLEIANA >NZ_CP013049|3354262:3367380|3363318_3363711_-|WP_057138018.1|DBSCAN-SWA MTSTEFVLQLVAAERQRQQDKWGEQNHPNIVSSGTYADAIRGESARFYGIPTAAEAKARTDERARRGEVTWMDILIEELAEAAEAADIYQMNREVFRLEDADSLRREVVEELVQVAAVAVQWAEKLGGAG >NZ_CP013049|3354262:3367380|3357315_3358176_-|WP_057138012.1|DBSCAN-SWA MPNVTNTSELLALLRRHYIKPGLDLPGGVFVPEVGGNGSWGASARADAIYVGFTTSSGRILIGHELKISRADWLNELNKPGKADQWADQCHAWYLVVNDPTIVKTGELPAGWGLMSPGPSRTRMDIHTSAAVKPDHTPSWDAVRSVMARIDTLRANEVAAAVESRDQARRRKYQKDVNEAVELRMKTMPETGEAARRLKLIEDAIGAPIDWSDHVWSPDRTVDPELLGRIGKAALALGGIESAIRQLARGYNSTREVRRLIDEYDAKLAEFLAHANNTSQAGEVSE >NZ_CP013049|3354262:3367380|3365923_3366163_-|WP_016892150.1|DBSCAN-SWA MSRTGKFTAKGGLTLGAVHMFAANAIQAGYGPDATVWVSKPDQDGDVIVTVTDPHLDPADNSSDATDPTGPETTTTARR >NZ_CP013049|3354262:3367380|3364946_3365444_-|WP_057138021.1|DBSCAN-SWA MTDYQRHRQAEQLHGVDKIRDYLHDIDLEVLNELAHSDTHGEQDVCDQVDDSAEDGAEVVHPTTHHHADTEPVREPNAYELAILGALQHKSVYQGSVPVDDIRRRRLRNRDSKRARRLQRRSKVRRMRRRHQAAAGNAAAFALFMLTSAFIGAVTMLASPGWWIS >NZ_CP013049|3354262:3367380|3354982_3355321_-|WP_016892133.1|DBSCAN-SWA MSGGAIFPNKGRIERRLAEIGGELDETWKLRAACRGHPRPDIFFPPPARSEATIKRSKSELARRLVIAEAKRVCGHCPVRAECGDYADQIGDYHGIWGGQTGRERGRKRDEF >NZ_CP013049|3354262:3367380|3356174_3356543_-|WP_057138010.1|DBSCAN-SWA MKTTTPAQAANGTGRWEVRIKPKPDPDQWSARVPTVVEAYLADHTRFVVKSPRGAEIFTTRNQSAAMTVARSLASIDELLARVNRLEHKAFGQHPALRAPMVVAPYPRRRGGTLRMIQDPHA >NZ_CP013049|3354262:3367380|3355935_3356178_-|WP_052572192.1|DBSCAN-SWA MSHIPPAIAATADQLVAAADKVLDQFIAEFQAHLAEHDGDPLCPIGALAVGAEGQDPEDVAFLFAVAVKRLAMPRKEIKL >NZ_CP013049|3354262:3367380|3363710_3364289_-|WP_057138019.1|DBSCAN-SWA MTETPDLTLRQWFGKLIRGQHHLAIGGEDDPYLLRWYLIPRNKRLNIYLHQFIRSDDDRALHDRPWWFWSFVLAGHYYEHRADGRRIKRHWASIAYRAAKTRHRVELPKSNDPMSLLEREDCCWTIVVTGPRTRDWGFWCPGGRFIFTERGETSTLDVDRFIPHTAWGAAGCGEPFTGTFHARNGLWNRGAR >NZ_CP013049|3354262:3367380|3366162_3367380_-|WP_057138023.1|DBSCAN-SWA MTETTTKVANEHAAQIWPDPLAQAAGEIAVSAPLGTELAIHTGQARFSEAQRAALRQLGIEDATDGDLDVFFHVCQTTGLDPFRKEIYMIGRNTKLTEWLDNGEGGRRKVERYVTKYTIQTGIDGFRRKVREYAHHNGDTLAVEGPFYCGDDGEWKEVWPGKTPPVAAKFTVIRNGEPFTAVAHFDEFVQTNNVYEGTGQGRKVVGQEPNSMWAKMPRNQIGKCAEAAACRRAYPNEFAGLILTDAAQPTVIDGEVVEERQAPPQRAKGASRLRERAAEAAAQQSAQAAETAGELSADAREKWLKAMFAALNKAECTDRDEQLIVIAGILGRTELFEHRADMTDQELRTIVNALNGFKEAGRLDQQINEYVNAWSLREADELDAVANTDTDGGEQGELGLESDQN >NZ_CP013049|3354262:3367380|3364285_3364546_-|WP_016895884.1|DBSCAN-SWA MRCAKFVGDRTDEFSIGQQFGETLEGRPVRVVSVTYDPDTDRTMVEGEPLAPDAPEGLRLRYFGGRDPNIEPPDSVQPLTDEVTPQ >NZ_CP013049|3354262:3367380|3361131_3361341_-|WP_074342987.1|DBSCAN-SWA MTSLPTPRYDRTHAPKPNLEQRPWFTFQCQRCGQDFRARFIARECRDCWAETAAIYPTLWPGTGEASGS >NZ_CP013049|3354262:3367380|3364542_3364959_-|WP_057138020.1|DBSCAN-SWA MDLMTSVEPREFPIGVVVTLAIGNPDRIFCLLSQVYDVLGHMLGYVPLVSEMAPAFEACRTAVREQHPVLAEAIDPGKTPAFGTLAVDTEILQWLSNLAREHGEMFALTPLSAPALPDELPPQPTAEPLVVAEPGSGT >NZ_CP013049|3354262:3367380|3359814_3360795_-|WP_057138014.1|DBSCAN-SWA MGDKTRIEWTDATWSPVTGCTRVSNGCLNCYIERSTPIRIAGRKFDGEGIGSSLAVQLHPNRLDWPLRKRDGKKIFVCSQADLFHDDVPDEYIARVFAVMALAPQHTFQVLTKRHGRMRSLLSSDDFRSEVTQTFVGWAVEDLSLKTGHLESATGDWWPLPNVWLGVSAEDQKRADLRIPALLDTPAAVRFVSAEPLLGPIDLHGDPIGKDSVFWIGHLDWVIVGGESGPGARPMHPDWARSMRDQCVAAGVPFLFKQWGEWSPDLSLNEPVANGKRLKYQRRALLPDGSIAPPWTPCEFVDRVGKKRAGRELDGRTWDQYPEVVA >NZ_CP013049|3354262:3367380|3360787_3361120_-|WP_131807238.1|DBSCAN-SWA MNQTDDGTEPLGEAPEVTRPGQDPLYAAAKLLESRGYAVVELPKPIQNRLGRFAVGVGGFVTAAGNDVAISQFAGSPIILRSTDRALQLAAAILAVIAYIESNAVGASHG >NZ_CP013049|3354262:3367380|3355321_3355939_-|WP_057138009.1|DBSCAN-SWA MTGPEAVLGLDPSLARAGIAAIVRDNPGSIARPGVITHVGYSLREGVPWWRRSRRIITEAREIASIIGEVHTTTPIARAVIEGPAWASNLPSKFDRDGLWWALFSILDAKRIPITVVNPTTRGKFITGRAPNGMKPGEHKKLVLAESQATWFDDQHRIKNHDQADALGLAHMGALDLGWRLPVDTRRRHVENIALVDWETEWLAS >NZ_CP013049|3354262:3367380|3361421_3361826_+|WP_057138016.1|DBSCAN-SWA MAQTTDTFRKYGERSEYLVVDTRPVSYCQLLGRNVPNDVTRIARLSRATARELGTALIGWTASPLTSASHTFESDRNAPLPDGATGCWSVNVDIEDGTNAVNVGGIGTNLTQEQAQDVGQRLIAWAGVQVPVDA >NZ_CP013049|3354262:3367380|3354262_3354940_-|WP_057138008.1|DBSCAN-SWA MPDYPPNMTLRPIETWPHSLTRDRRRSNFSAQWSDTLTRLDRELWYLGKDQQYAAAVLQIAMREQDFRITDGMPRANATAQHPGVILNIESRHGPLSYPCDTFTRWQDNLRAIALGLEALRKVERYGITQTGQQYRGWQAIEAKATPIAQTPAGAAVYLAKAAQGNDDSVSDWAHRILHDPETARTTYRKARANTHPDRHGGARTAWDAVEAAADILRDAGAPIE |
20 | Mycobacterium_phage(45.45%) | NA | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_4 |
3733428 : 3744619
Sequences of DBSCAN-SWA_4
Nucleotide sequences of DBSCAN-SWA_4 >NZ_CP013049|3733428:3744619|DBSCAN-SWA CGTGAGCGCATTGATGACGGCATCCCCGTTTGACGCCATCCGACACCTGACCGACGGAGGCCGCGAGTACTGGTCGGCACGCGATCTCATGCCGCTGCTCGGATACGAGAAGTGGGAGCGGTTCGCCGACGCCATCAACCGCGCCAAGAGCGCTGCACGCAACGCCGGGTACGACCCTGCGACGCAATTTCCCGGCGCCGGGAAATTGGTCTCCACCGGCAATGGAGCGCAGCGAGCGGTCGAGGACTACCACCTCTCCCGGTACGCCTGCTATCTCGTCGCACTCAATGGCGATCCACGCAAGCCTGAAATCGCAGCCGCGCAGACATACTTCGTCATCAAGACCCGTGAGGCCGAGACCGCCACGGCCGCGCCCGCGCTCACAGGCACCGACCTACTCGCCGCCGCGGTGCTCGAAGCTCAGCGGATGATCGAGGCGAAGGACGCTCGGATCGCAGAGCTTTCGCCCAAGGCCGACCTTGCGGACACCTACCTCACTGCACAAGGCGGGTCCCGGCTGATCCGGGAGGCGGGCAAGCTGCTCGGCATGCGCGAGCGCGAGTTTCGCCAGTGGCTCTTGGATGAGCGGCTGATCTTCGCCAAACACGCTCCGTGTGGCGCGGTGCAGTACGACCACTACGCGCAATTCGCGCACTACTTCCAAGCGCACGAGCACGTCGTCGCGCACTCATGGGGCAGCTGTGCCCACTACACCTTGCGCATTCTGCCGCGAGGGATGGAACTCATCACCGCACGCTTGGCCCGAATCTCCAAGTAATCGCAAGTCTCACAACTGAATAAGTAAAGACGCTGGCGGTCCCGTCGCCAAACAGAAACCGCCAGCGTCCCCTACCAACCAATCCTACTGAGAGGACTTGGCATGCCCCAACATATCCGCAGGCGGTCGCACGGGCGCCGCCGACCCCGGCTGAGCAGCTACGACGCGATCACCGTCGTGCTGGCTGCTATCGCGGTGCTCGCCGCGATGCTGCTGGCCTCCCCGGACTCGCACGCCGACCCGGTGACCGATGACTTCGTGACGACGAGCGGCTGGCGCGTGTGCAACGAGCTGGACGCGCAGCCCAATTTCGACGGCATCCGGTACTCATACCGGGCACTGTCGGCGCGCGGCTACAGCCTCGATCAGTCGGCCCAGATCATCGTGGGCTCAGTGAAGGTGTGGTGCAAACGCCATGCGCCACTGCTCAAGTCATACGCCGACACCTATGCTTCAGAGCCGCAGCAGAGCCAGGGGCGCGCGGCATGACCATCACCTTCGACCCCAACCCGACGTTCGACGAGCTCATGGCCGCGTTCGACAAGGCCGAACGAGCATGCTCCCCCACCGTCGTCAACATCGTCCTTGATCTCGAAATCGCCGACCTGTTCGAGAGATTGGGCAGTCACGGCATCGCCGTCCTGGTCGCCAACCAGAAAGCGTGGCGCGAGTCCGTCAAGGAGTCCGGTACAGACCCTCGATCCGCCTGGACCGCTGACGGCGGCGCCGAGCGCGCACTCGTCGAGTTTTTCACCGACCGCGACAGCCGGGACAAGGCCAGCGCGGCAATTCGCGCATCGGAGGCGGGCTGGTGACGACGACTCACTACCTCGAAATCGAAAGCCATTGGCACGAAAGGCTCTACGACGGCATCAAGACATATGAGGTGCGCCGCGCTGATCGCGATTACCAGAAGGGCGATCGCATCCTGTTCAAGATCGGGCCGTCCAAGGCTCTGTCGTACGCGTCGTGGATCATCACGCACGTCATGTACCAGGCCCCGTGGGTGGCCGATGGCTACGTGATTCTCTCGCTGGAGCATCCACACAAGACGCGACGCGAGAAGGAGTACGAAGCGCGCGGTCGAAGCATCGAGGAATATCGCCGCTCCAATGCCGCACTGCGCGGAGTTATTACGCGTCTGCGCAATCAGCTGAGTGATGCGAATGCCAGACGGCAGGGGACGACCGAATGAGCGAGCCCACGCGCGACCCGCGCGAAGAGAAGCTACCCCAGTGGGCGCGAAAGCTGTTGGCCGATGAGCGCTACCGCGCCAGCCGTGCCGAGCACAGGCTCGCCGAGCACGTCGCCAAAATCGCGAAGTCGCGAATCCGATACGGGGGCTACGACAATCCGATCTACATCCCCGACGACAACGGGTATCAGACAGTGTACTTCTACCCCAATGGGGGCGACAGCACGTTCCAGCAAATCGCCGTCACGATCCGCGACGGCGCTATCGAGATTCAGGGCGGCGACACGCTGACGATCGAACTGCAAGCGGGCAACACCTTTCGCGCTCGCCTCCGGGGTGACTCATGACCGTTATGACGATCGACGTTGACGAGAGCTACGAAACGAACATGCGCGTCCTCAAGGGCGTGCTGTACCGCCTCGTCGAGGCTGTCCGCGACACAGACCCCCATCAGGTGCATCGCGAGCTGGTCTCGATGTGGTTGCGCCACCCCGTGAAAGCGGCGCAACTGATGATGGCGCTCGCCATCGGATTCGACCCGGACACGGTGACAACCAAGATGCTCGACCAGCGCGCCGAGGAAATCGCGGGCATTACAACGCTCCCCCACAAAGGAATTGAGGTCCAACCATGCAGAGCATGAAGACACATCCAGAGGCCGCCTTGGGCGATTGCCCCGCACGGTTCGACAACTACGTGTGCACCCGCGACGCGGGCCACGACGGCAGTCACATGGCCAACGCGTTCGTTGAAGTGGTTGCGATCTGGGACAACGAACTAGCTTGGCGTGCAGACGATGCCCAGGGCTGTTGGGCCCAGCGCAAGGGCAGCGAGTGGGTCGAGGCTGACGCATGAGCGAATGCATCATCCAGGCCGAAATCCCCACCGCTGACGGCCTATACGCCGGTATTCCTGATGAGGTCTACCACGCCGACCGCACCAGCTTGTCGTCGTCAGGTGCTCGTGCACTGTTGGCGCCGTCCTCGCCCGAGATCTTCCACTACCAGCAACGGCAGCCGCCAGAACCCAAGCCGCAATACGACTTCGGGCACGTTGCCCACAAGTTCGTGCTGGGCGAAGGCGCCGATATCTGCGAGCTAGATCCGGCCGTTCACGGGCTGAACAAGGATGGCTCCCCCGCCAAGTCGCCCACCGCCACCGCGATGTGGCAAGCAGCAGCCGAGGAAGCGCGCAAGGCCGGTCAGATCCCGATGCACATCGCCGAGGTGGCCAAGGCCAAAGCGATGGCAGCCAGGGTGCACGAGCACCCGCTCGCCGGGCCGCTACTAGCCGACGGGACACCGGAGCTGTCCGGGTACTGGCACGACCGGGAGACGGGCGTGCGCCTGCGGTTCCGGCCCGACTGGCTGCCCAACCCCGGCCGGGGACGGCTGATCGTCGTCGACTACAAGACCAGCTCCAGCGCCTACCCGGGCCACTTCGCCAAGTCCGCAGCCGAATACGGCTACCACCAGCAGGCGCCGTGGTATCTGGACGGCCTGGCCGCGTGCGAGATCGCCGACGACGCCGCGTTCCTGTTCGTCGTGCAGTCCAAGACGGCGCCCTACCCGATCACCGTGGTCGAGCTCAAGCCCGAAGACATCGACCTCGGGCGGCGCCGCAACCGCAAGGCCATCGACCTGTACGCCCAATGCGTCGCCGATGACCACTGGCCCGGCTACGGCGACCACGTGCACTCGGTATCGCTCCCCAGTTACGCCACCTACCAGCAAGAAGGAGAACTCGATCAGTGACCGTCACCCCCTACCAGCCCATCTCACCCGCACCGCGTACGGCAGTCAGCCAGGCCACCTCAGTCGAACAGTCCCGCGCCGTCGCCGAGGTCCAATCCGCCGTCATCGTGGCCCAGCAGATCCCGCGTGACATGCAGCGGGCCGAAGCGGAGATGCGCGATACGTGCAATCGATCCGCGATGGCGAAACAGGCCTTCTATCAGGTGCCGAACCGGGGCAACGGCGCATCGGTGCACCTCATGCGCGAACTCGCGCGAGTCTGGGGCAACGTGCAGTACGGCGTCAACGAGTTGCACCGCGACGACTCCCGGGGCGAGTCCGAGGTTCAGGCGTGGGCGTGGGATGTGCAGACCAACACCCGCTCTACGCGCACCTTCATCGTCCCCCATGCCCGCATGTCAAAGGGGCGCCGCCAAGAACTCACCGACCTTGGTGACATCACGAACAACAACAACAATGCGGGCGCTCGCGCTGTCCGTGAGTGCATCAACGCCATCTTGCCCAAGTGGTTCACCGAAGCGGCACAGGACATCTGCAAGGCCACGCTGGAGAACGGCGAGGGCGTGCCCTTACCCAAGCGCATCGAGGACATGATCGCCGGATTCCGCGCCATCGGCGTCTCACAGGCGCAATTGGAGACCAAGATCGGCAAGAAGCGCGGCGCCTGGGATGCGGGCGATGTCGCGCAGATGGGCATCACCTACACCTCGATCACCCGCGACGGCTACGACAAAGCCGAGATGTTCCCGCCGGTCGCGGGAGTGACAACCGACGAGATCAAGGCCAAGGCCCCGGACAAACCGAAGGCCGAAGCGACACCAGCTCCCGAGAAGGCACCAAGCCCGGAGAAGGTCGAGGAAGCACCTGAGGCCAACCCCGCTGAATACAACTCGCGCGGTGAGTTTCTGGCCACCAAAAAGACCATCGGCACCATCCGCGGCCTGCTCGGCAACGCGGGCTATTCCCTGCGCGGCGATGCGGCCACCGTCAAAACGCTCACCTATCTGGCCACTGTCGTCGGCCGCGAAATCGCCGATATCAACGACCTATCCGAAGCCGAGGCAGAGGTAGTGACCGACGTTCTGAACCAACCCACCACAACAGAAGGGAATGAATAACCATGTCCGACAACGACACCGAGAAGAAAGAGGAAGGCACCGAACTCGCGCCCGGCGACATCACCGAGTTCATCGTCGTCTTCACCCAACTCAACAAGGGCCGCACGCAGGTCGAAGCAACCAAGGCGCTGCATGAATGCGTCGAGGCCGCGATGGCCACAGGCAAGAAGACCGGCACCGTCACGATCAAGATCAAGGTCGAGCCGCTGGAGTCCGGCGCAGTCAGCCTCGTACCCGATGTCGCCAGCAACCCCGCCAAGGACCCGGCCGGGACGATCTTCTTCGCCGACGGCGAGGGCGGCCTATCCCGCGACAACGCCAGCATGCACTACGGCCTCAGGTAACCCAACCCACCCGAAGGAGTAACACCCATGTCCGACAACACCATTGCGCTACCAAAGCACGACGCCGATCTGATCGACGAGCCCGACGCCGACGCCCCGCTGTACCTCGACCAAGTAGAGGAGACCTGATGTCCCGCAACCTCATCGTCGTAGACCTGGAAACAACCGGCCTCGGCCCGCAGTGCGCGCCGATCGAGGTTGCGGCCATCAACGTCGACACCGGAGAAACACTCGAATTCGTGCCGTACGTCGACCTGTCCAAGGTCTCGATCGAGCCCCAGGCTTTCGCCATCAACCGCTATTTCGAACGCGGTGTGTATGACGCAATGCTCAATCCCGACGACACCATCACAGCGTGGAATGACCTCGCCGACATCCTGAGCGGCAACACCTTTGCCGGATCGAACCCGACATTCGACGCAGCCATGGTCGCACGCAAGGTCGGCACGCACTGGCACTACCGCCTGGCCGACCTCGCCGCCTACGCGGCCCCTGCACTCGGGCGCGACCCGTCCGAGCTGCCGGGACTGGCCGACGTGCTCAACGCCCTCAAGATCGAGAACCGTTGCCCACATTCGGCATTCGGTGACGCCGAGGCAACCGCCAAAGCATTCGTGAAGCTGCGCGACTTCTACGCGGATGTGACCCTATGACCGCCCCGTCCATCTCCCGTCGCTACATCGACGCCACCCCCGTGCGCGAGCACCTGGAGAAGCTGCAGTCGATCGGCTGGACCATCAACGCCATCGCGGCCGCCAATGGCCACCCGGGAAAGCTCGTCACTACTCTGCGCCAGATCCTTCGCGGCCAACAAACCTGTGCCCCATCCACCCGCGACTACGTGATGTGGATGGACCCCGAACTGCCTCCCGAGACCGGAAAACCGTTCGTACTCAAATGGTCCGAATACGTGTACATCGGCGTACCCGACCATGCGGCTGCGCGGGAAATGGGCATCACCTACAACTCCATGTCGGAGCAGCTACGGCGTAACGGTTTTCAGCCATCAGCGCTGCTGTATGAGCTGGCCCGCGAGGAACGCGAGAAAGCCAAGGCACCTGCATGATGCTCACCGAAGATCAACGCTGGCTATTGCGGATGGTCGGCGGGTGGGAAATGCGCGACTGCCTCATCGGTCCCGCAGGTGTCACCCATTTGATGCAATCCTGCTACGGCGGCACCCGCCTGCCTACCGACGGATATCCGTCTCACCTCAAGGGATTTGAGTGCGGACACGGCAAGATCGTGTCGAGGGGCATCCCCGTCGTCACCGTGACCACCGCGCAGCTGAACAAGTTCGCGCGCTCCCTGCCAGTCGAGCTTGTCGCCGAGATGCGCGGGTGCGCCACCGCCGCGCAGCGCAATAACTTACTTCGCCACCAGTTCTGCCACTGTGGGAGCGAACCGTGCGGGTACGCGTACATGGGCGACCGCATTTGCCCGCCGACCGAGCAGCAGGAAGCCGACGCCAAGGCCGAGTTCTGGCGCTGCCAGGACTGGACCGACGACTTACTCGACCGCGCACTCGGGTTCACCGCCGAGGACGAGCCGGTCGGACAACTGGAGCTGTTCGGAGTCAGCGCATGATCACGCCCTACTACCAAGACGAATCGGTCAGCCTGCACCACGGCGACGCCCTCGACGTGGCCAAGGCACTGCCCGCCGGCGGGGCCGATTGCATCGTCACCAGCCCGCCCTACTTCGGCCTTCGCGACTACGGCGAGCCCGGCCAGTATGGGCTGGAGGACTCGCCAGCCCAGTACGTCGAGAATATGCGCGCGCTGTTCGCCGAGCTGCGCCGCGTGCTCGCCGACGACGGAACACTCTGGCTCAACCTTGGTGACAGCTACTACAGCGGCCGGGGCAACCCGGGCCCGAACGCCGACGACCGAAAGAACATCGCAAGGCGCGGCTGGGTCCGGCCCGTAGACCGCCCCGGGCGCGAGTGGGCCAAACCCAAGGATCTGCTGGGCATTCCGTGGCGCGTCGCGTTCGCGCTGCAAGACGACGGCTGGACGCTGCGCAACGACATCATCTGGCACAAGCCGAACGCCATGCCTGAGAGCGTCGTCGACAGGCTGGCCGGGCGCCATGAACACGTGTTCATGCTGGCCAAGTCGAAGCGCTACTGGTTTGACCTCGACCCGATCAGAGAGCAGTACGACGGCGATCGGGAGGCCTCGCGGCGGTCACGGTCCGGGCTGGTCAACAAGGCCAACAGCGTAAAGACACCTTGGGTACCGCCAGAGCCTCGGCCGACAGCTTGGAATGACCAATCGAACATGGGCGCCACCGGGCGCCAACACACATGGACCGACAAGGGCGGCCGCAACCCTGGCGACGTGTGGGAGATCCCCACACAGCCATTCCCGGGGGCCCACTTCGCGGTCATGGCCTCCAAGCTCGCGCAGCGCTGCATCGCCGCCGGATGCAGGCCCGGTGGCACGGTGCTTGACCCCTTCAGCGGTTCCGGCACAACCGGAATGGCGGCACAGCGCCTCGGCCGCAAGTACATCGGCATCGAACTCAATCGCGACTACCTAGACCTGTCGCTACGTACCCGGTTACATGCTGCCCCGCTCGATTTCGAGGCGGGCGCATGACCATCACCCGCACATGGTTCCGGTTCATCTGCATGCGCTGCGCGCACGAATTCCAGACCGACCGGATCGTCCACGAGTGCTTCAAATGCCGGACGGCCCAGCGAGACGCGTTCCCCCACGCCCCAATCGAGGTCATCGAAATGAGCGGCACATGAGCGACAACCCGAACACGAACGGAGACAACATGAAACCCCGCATTGAGCCCCTGGTCGTGATCGATCTAGTGGAGCAGTACATCGACAAGGAGCCGCACAGCGCCGGTCGGCCACGGTGCGAGAAGTGCCATACGAAATTCAGGAGGGGTGAGTGATGGCCGATCCCACAATCCGCGTGCTGTCCCTCGGCGCTGGTGTCCAGTCGACGGTGCTAGCGCTCATGGCGTGCGACGGCACGCTGCCCGGTCTGGACGCTGCGGTTTTCGCCGATACCGGCTGGGAACCGCCCGCAGTCTATGAGCAGGTGGATCGGCTCGCCGCCGAGCTTGCCCGCGTTGATATCCCGCTGCATCGGGTTTCGTCGGGGAACCTGCGCGCCGACACCCTCGACCCGGAAGCGCGATTCGTTTCGGTGCCATGGTTCACCTTGGCGCCCAAGGCTACCGAGGTGCCTGTTTATGGCGTATGCGCACCCTGCGGCGGCTCCGGCCGGGGACCATCTGACGAGCCTGACTCATGTTCGGTGTGCGGTGGCGACGGCCGTGGGTCGATCGTGGGCACCAGGCTAGCCACTGCCACTGAACGGCACGGCATGGGCCGTCGCCAGTGCACCAGCGAGTACAAGCTCAAGCCGATCAAGGTCAAGGTGCGCGAGCTGCTGGGCTACCCACACCCGACACCTGTACCGCGTGATGTGTTCGCTGAGCAGTGGATCGGATTCAGTACTGACGAGATTCACCGGGTGCGTGACCGGCTGGATGTGAACTACTCCCGGCCGCGTTACCCACTGCTGGATCTGGGCATGTCCCGTAAGGACTGCCAACGCTGGCTGGAGCGCGCCGGGTGGGGCCACACCGCCAAGAGCGCCTGCATCGGCTGCCCGTTCCACGGCAATGCCCAGTGGCGGTACATGTACGAGCGGCGCGACATCTGCGCGACGTGCGGCCATTCCCGCGACGACCATTGGCGCGGGTTCGACGAACCGAAGGCATGCGCACACCTGTACAACCGGGACCAGCCCGAAGAGATCGCCGATCTGTGCATGTGTAAGCGGTTCCACTCCCTCTGGGACGACGCGGTCGATTTCGACCGCCGTATCCGCAAGGGCGGCGCCTCGGCCAACCCACTCGACGGCGAGGCGTTCCTGCACCGCTCGCGAGTTCCGTTGGACCTGGCACCAATCGACCGCGTGACACGTGCCGAGTACGCCGACATGCAGCTCGACCTATTCGAGGACGGCGACCCGGATGGCTGCTCACCGTACGGCTGCCGCAGCGGGGAGGTGGCGTGATGCCCATCCGCCCCGAGAACCGCGACCGCTACCCCAAGGACTGGCCCGAGATCTCGCGGCGCATCCGGTTCGAGCGCGCCCAGGGCCGCTGCGAGTGCGAGGGTGAGTGCCTACGGGGCACACACCTCGACCGCTGCACGAACGTCAACGGACAGCCCGCATACGGCACCGGCAGCCGCGTCTTGCTGACCGTGGCGCACCTGAACCACACACCCGAGGACTGCCGCGATGAGAACCTGCGCGCCATGTGCCAGGGCTGCCACCTGCACTACGACCTAGAGCACCACGCGCAGACGCGCCAGCGGGCCCGCACGGCGGCTCTTGAGGCACAGATGGACCCGATGTTCGGCCCCGAGATTTTGGGGTGAGAAGGAGTGCCGAACGTGCCGCAGTCTGAATACATGCACGCGAATCAGAGAAAGGAACACCGTGGCTAACTCGGCCGGAATGCTCAAGGAATCAATCTGGCGCGACGGCCATTTCCGAGCGCTCACACGCACCGCGCAATGCACCTACGCGCAGCTGCTCAGTCAAAAGGATCTCGACCGCGCCGGGATGCAACCACTTCAAATCACCAAGTGGGCCAAGGGGTGCAACGAGATGTCCATTCATGACCTACAGGCCGACCTCGACGAGCTGGAGCGTGAACGGTTCGTGTTCTACGACGAGGACACTGACGAACTGTTCGTGCGCGCCTACATGCGTACCACCGAGGTCACGCGGTATCCGCAGTACTTCAAGAGCGCCTTGAAATGCGCCGTCATGGTGGCCTCGCCCAAGCTGCGCCATGAGCTGGCGGTCGAGCTACGTCGCCTGCGCAAGCCCGAGGCGACCAAGGTCGCCGATGAGATTGACCCGTCTGACCCTGACCCCGATGACACCGTGACGGAACCGTGCGAGAACCCTGACGGCACCGTGCCCGAAGGGTGCGAGAACCCTGCCGGAACCGTGAACCCTGACGGCACCCTGCCCGAACCCTCTAGGGAAAGGGTAAGGGTAGGGGTAAGGGAACTTACGTTGGTAAGTACTCAAGTTGGGGAGCGCTGCGCGCCGCCCCCCGAGTTCTGCCCCAAGCATCCTGGCGGCACCACGGACCCGTGCCGCGCCTGCCAGCGCTACCGGGTGCAGTACTCCCAGTGGGCCGCAGACGACGCGGCTCTCGCCGCCGCCGAGCAGCGCGCACAACACCGGGGCGAGCGAGATGCCAAGCGCCAGGCCATCGCCGCGTGCCGCCTGTGCGACCAGGACGGCTACAACGGCCTCTCCGTCTGCGATCACGTCGACCGCTCGGCCACCGCCAGAGCCGGACTCGCCAGAGCCCGCGCAGCGCTCGAAAATCCCCCCGCCGCGACCGGATAGTCCCGAACGGCCCGAAAACCCGCCAGCGACGACCACAGCCCCAGGAATCGATATGCGAACGGAGACACGATGACCCAGAAAACGGACCCCGAGCGGTTTACCTGCCCCGGGCTGGAAGAGGGCGGGCGCGTAGCCATCCAGCTCACCGATGGCACGCTGGCCGAGGGCTACTGGTACGACGACGCGGTACACGACGAGCCCGTTCGAGCCGGGGGTGCTCCGATGAGGCGCTCATTGGCCGGCCAATGGTGAAGGCCTCGCGCGGGGTCTCTTGGCGCACACGCCAGCTGTGCTCGGAGTCCGACGAGCATCACGAGCGTGTGTGGTTCTGCATGACCTGCAAGGCACTAGAACAGCGACTCGCCCCGGTATCCGAAACGCTCGCCGAACTGCTCCAAAGCGCCGACCTGTCCGTGACGATCACCAGGTGGCCTCGATGAGCCCAATGCGGCACGGCGACGCTGAGCGGATAGCCGAGCTCTGCGCCGAGGCTGGCAAGCCGCTGCAGCCCTGGCAGCACTGCCTACTCCAGCAGATCGAACAGCGTGATATCGATGTCCAATTCGCCAAGATGGTAAGGGGATTCAACCGTTGA
Protein sequences of DBSCAN-SWA_4 >NZ_CP013049|3733428:3744619|3742664_3743033_+|WP_057138069.1|DBSCAN-SWA MPIRPENRDRYPKDWPEISRRIRFERAQGRCECEGECLRGTHLDRCTNVNGQPAYGTGSRVLLTVAHLNHTPEDCRDENLRAMCQGCHLHYDLEHHAQTRQRARTAALEAQMDPMFGPEILG >NZ_CP013049|3733428:3744619|3743112_3744024_+|WP_057138070.1|DBSCAN-SWA MLKESIWRDGHFRALTRTAQCTYAQLLSQKDLDRAGMQPLQITKWAKGCNEMSIHDLQADLDELERERFVFYDEDTDELFVRAYMRTTEVTRYPQYFKSALKCAVMVASPKLRHELAVELRRLRKPEATKVADEIDPSDPDPDDTVTEPCENPDGTVPEGCENPAGTVNPDGTLPEPSRERVRVGVRELTLVSTQVGERCAPPPEFCPKHPGGTTDPCRACQRYRVQYSQWAADDAALAAAEQRAQHRGERDAKRQAIAACRLCDQDGYNGLSVCDHVDRSATARAGLARARAALENPPAATG >NZ_CP013049|3733428:3744619|3735396_3735747_+|WP_052541731.1|DBSCAN-SWA MSEPTRDPREEKLPQWARKLLADERYRASRAEHRLAEHVAKIAKSRIRYGGYDNPIYIPDDNGYQTVYFYPNGGDSTFQQIAVTIRDGAIEIQGGDTLTIELQAGNTFRARLRGDS >NZ_CP013049|3733428:3744619|3736251_3737154_+|WP_049233928.1|DBSCAN-SWA MSECIIQAEIPTADGLYAGIPDEVYHADRTSLSSSGARALLAPSSPEIFHYQQRQPPEPKPQYDFGHVAHKFVLGEGADICELDPAVHGLNKDGSPAKSPTATAMWQAAAEEARKAGQIPMHIAEVAKAKAMAARVHEHPLAGPLLADGTPELSGYWHDRETGVRLRFRPDWLPNPGRGRLIVVDYKTSSSAYPGHFAKSAAEYGYHQQAPWYLDGLAACEIADDAAFLFVVQSKTAPYPITVVELKPEDIDLGRRRNRKAIDLYAQCVADDHWPGYGDHVHSVSLPSYATYQQEGELDQ >NZ_CP013049|3733428:3744619|3739678_3740203_+|WP_057138063.1|DBSCAN-SWA MMLTEDQRWLLRMVGGWEMRDCLIGPAGVTHLMQSCYGGTRLPTDGYPSHLKGFECGHGKIVSRGIPVVTVTTAQLNKFARSLPVELVAEMRGCATAAQRNNLLRHQFCHCGSEPCGYAYMGDRICPPTEQQEADAKAEFWRCQDWTDDLLDRALGFTAEDEPVGQLELFGVSA >NZ_CP013049|3733428:3744619|3735016_3735400_+|WP_057138056.1|DBSCAN-SWA MTTTHYLEIESHWHERLYDGIKTYEVRRADRDYQKGDRILFKIGPSKALSYASWIITHVMYQAPWVADGYVILSLEHPHKTRREKEYEARGRSIEEYRRSNAALRGVITRLRNQLSDANARRQGTTE >NZ_CP013049|3733428:3744619|3734690_3735020_+|WP_052541730.1|DBSCAN-SWA MTITFDPNPTFDELMAAFDKAERACSPTVVNIVLDLEIADLFERLGSHGIAVLVANQKAWRESVKESGTDPRSAWTADGGAERALVEFFTDRDSRDKASAAIRASEAGW >NZ_CP013049|3733428:3744619|3736030_3736255_+|WP_049233925.1|DBSCAN-SWA MQSMKTHPEAALGDCPARFDNYVCTRDAGHDGSHMANAFVEVVAIWDNELAWRADDAQGCWAQRKGSEWVEADA >NZ_CP013049|3733428:3744619|3738744_3739269_+|WP_057138062.1|DBSCAN-SWA MSRNLIVVDLETTGLGPQCAPIEVAAINVDTGETLEFVPYVDLSKVSIEPQAFAINRYFERGVYDAMLNPDDTITAWNDLADILSGNTFAGSNPTFDAAMVARKVGTHWHYRLADLAAYAAPALGRDPSELPGLADVLNALKIENRCPHSAFGDAEATAKAFVKLRDFYADVTL >NZ_CP013049|3733428:3744619|3744093_3744276_+|WP_057138071.1|DBSCAN-SWA MTQKTDPERFTCPGLEEGGRVAIQLTDGTLAEGYWYDDAVHDEPVRAGGAPMRRSLAGQW >NZ_CP013049|3733428:3744619|3738274_3738616_+|WP_057138060.1|DBSCAN-SWA MSDNDTEKKEEGTELAPGDITEFIVVFTQLNKGRTQVEATKALHECVEAAMATGKKTGTVTIKIKVEPLESGAVSLVPDVASNPAKDPAGTIFFADGEGGLSRDNASMHYGLR >NZ_CP013049|3733428:3744619|3740199_3741219_+|WP_057138065.1|DBSCAN-SWA MITPYYQDESVSLHHGDALDVAKALPAGGADCIVTSPPYFGLRDYGEPGQYGLEDSPAQYVENMRALFAELRRVLADDGTLWLNLGDSYYSGRGNPGPNADDRKNIARRGWVRPVDRPGREWAKPKDLLGIPWRVAFALQDDGWTLRNDIIWHKPNAMPESVVDRLAGRHEHVFMLAKSKRYWFDLDPIREQYDGDREASRRSRSGLVNKANSVKTPWVPPEPRPTAWNDQSNMGATGRQHTWTDKGGRNPGDVWEIPTQPFPGAHFAVMASKLAQRCIAAGCRPGGTVLDPFSGSGTTGMAAQRLGRKYIGIELNRDYLDLSLRTRLHAAPLDFEAGA >NZ_CP013049|3733428:3744619|3744460_3744619_+|WP_153995300.1|DBSCAN-SWA MSPMRHGDAERIAELCAEAGKPLQPWQHCLLQQIEQRDIDVQFAKMVRGFNR >NZ_CP013049|3733428:3744619|3733428_3734205_+|WP_052541729.1|DBSCAN-SWA MSALMTASPFDAIRHLTDGGREYWSARDLMPLLGYEKWERFADAINRAKSAARNAGYDPATQFPGAGKLVSTGNGAQRAVEDYHLSRYACYLVALNGDPRKPEIAAAQTYFVIKTREAETATAAPALTGTDLLAAAVLEAQRMIEAKDARIAELSPKADLADTYLTAQGGSRLIREAGKLLGMREREFRQWLLDERLIFAKHAPCGAVQYDHYAQFAHYFQAHEHVVAHSWGSCAHYTLRILPRGMELITARLARISK >NZ_CP013049|3733428:3744619|3737150_3738272_+|WP_057138058.1|DBSCAN-SWA MTVTPYQPISPAPRTAVSQATSVEQSRAVAEVQSAVIVAQQIPRDMQRAEAEMRDTCNRSAMAKQAFYQVPNRGNGASVHLMRELARVWGNVQYGVNELHRDDSRGESEVQAWAWDVQTNTRSTRTFIVPHARMSKGRRQELTDLGDITNNNNNAGARAVRECINAILPKWFTEAAQDICKATLENGEGVPLPKRIEDMIAGFRAIGVSQAQLETKIGKKRGAWDAGDVAQMGITYTSITRDGYDKAEMFPPVAGVTTDEIKAKAPDKPKAEATPAPEKAPSPEKVEEAPEANPAEYNSRGEFLATKKTIGTIRGLLGNAGYSLRGDAATVKTLTYLATVVGREIADINDLSEAEAEVVTDVLNQPTTTEGNE >NZ_CP013049|3733428:3744619|3741525_3742665_+|WP_057138067.1|DBSCAN-SWA MMADPTIRVLSLGAGVQSTVLALMACDGTLPGLDAAVFADTGWEPPAVYEQVDRLAAELARVDIPLHRVSSGNLRADTLDPEARFVSVPWFTLAPKATEVPVYGVCAPCGGSGRGPSDEPDSCSVCGGDGRGSIVGTRLATATERHGMGRRQCTSEYKLKPIKVKVRELLGYPHPTPVPRDVFAEQWIGFSTDEIHRVRDRLDVNYSRPRYPLLDLGMSRKDCQRWLERAGWGHTAKSACIGCPFHGNAQWRYMYERRDICATCGHSRDDHWRGFDEPKACAHLYNRDQPEEIADLCMCKRFHSLWDDAVDFDRRIRKGGASANPLDGEAFLHRSRVPLDLAPIDRVTRAEYADMQLDLFEDGDPDGCSPYGCRSGEVA >NZ_CP013049|3733428:3744619|3739265_3739682_+|WP_017555356.1|DBSCAN-SWA MTAPSISRRYIDATPVREHLEKLQSIGWTINAIAAANGHPGKLVTTLRQILRGQQTCAPSTRDYVMWMDPELPPETGKPFVLKWSEYVYIGVPDHAAAREMGITYNSMSEQLRRNGFQPSALLYELAREEREKAKAPA >NZ_CP013049|3733428:3744619|3734307_3734694_+|WP_016341885.1|DBSCAN-SWA MPQHIRRRSHGRRRPRLSSYDAITVVLAAIAVLAAMLLASPDSHADPVTDDFVTTSGWRVCNELDAQPNFDGIRYSYRALSARGYSLDQSAQIIVGSVKVWCKRHAPLLKSYADTYASEPQQSQGRAA >NZ_CP013049|3733428:3744619|3735743_3736043_+|WP_052541732.1|DBSCAN-SWA MTVMTIDVDESYETNMRVLKGVLYRLVEAVRDTDPHQVHRELVSMWLRHPVKAAQLMMALAIGFDPDTVTTKMLDQRAEEIAGITTLPHKGIEVQPCRA |
19 | Mycobacterium_phage(64.29%) | NA | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_5 |
3855244 : 3860476
Sequences of DBSCAN-SWA_5
Nucleotide sequences of DBSCAN-SWA_5 >NZ_CP013049|3855244:3860476|DBSCAN-SWA CATGGGCAAGTGGGTCTGGATGCCACGGGTCACCGTCGCGGACTCGGCAGTCGGCGCCGATCTGGCCCATCTGCTGCGGGTCGCCCACACGGGCATCGATCAGGGTGTTAGCGCAGACCTCGCCGTCGCAGGAGTGGGCCTAGGCGCCAGTGATGCCAGCCGCGGCGCCGACCTGGCGCGGGCGAAGCTGCGCGTGGCTGCACGAGACGCCGGGATAGCTGCCGACACGGCCCGCCCCGGTGTGCGCGCCACCGATTCGGCCGTGGCCGCCGAGATGGCGCAGATGCTCCCCCGCCTGGCCGCCGTCGGTGCCGCCACAGCCGCCGATATCGCGGTGCTGTCGCGGGTTCGGCTTCCCTCCAGCGCCAGTCAAGCCATCGGCGCCGATACCGCCACCGCCCGGTTCAGTCCGCAACCGGCAGCGCTGACCGCGATCACCGCAGTCGGCACGACCGTGGTCCCGATCCCGGTGTGGTGCCGCTATCTCGATCTGGCGCTGGTCGGCGCTGGCGGCGGCGGTGCGAGCTCGGGCACGTTCTACCTACTCGGAGGCTTCCCCGGCAGCCCGGGAACCTGGGCCACCACCACTTTGGAGCGCGGCATCCACATTCCCTGGACCACAACAACCCTGACATTCGCCATCGGCGCAGGCGGCGCAAAGGGTAGCGGCGGTTTCGCCGGAACCGCGGGCGGCCCAGGTGCGGCAACCACCGCTATCGGCGACGGATGGGCGGGCCTGTCCGCTGCTGGCGGCGCTGGTGGCCCGCAGCACCCCACCGGCATCAACGCCAACGACGGCCCCGGCCCGGGCGACAAGACCTACAACGGCGTGACCTACCCGGGTGGTGCCACGCAAACCTCCGATGGCGGAACGGGCTACGCGCCCGGCGGTGCCGGTGCGGGCGGTGCCAACTTCGGCGGCCCCGGCGGCATCGGAGGCGCAGGCGGCGCCTGGTGCCGCGCATACCAGTAACCGCAGGAGGGACCACCCAAACATGGCCAACCCCAACGACATCGACAACTACTCATTCCGAATCCACTTCTACAGCAGACGCGAAACCTCCTATTTCGACATCTACATGAACGACGGCGCAATCGGACTGATCAACGGAAACTACTACCTCGACGCGGCCCCACACGACCCCAACGTCGGCGAATTCCTCCTGCAATACGTCCCCAAGCTCAACACCACCATCTGGGACTTTGACGACGGCAGCCTTCCGGCCAACACCGAGGGCTACCTCTGGTACCAGGTCAACGAAACCTACGTCATCACAGGCGATTACCAGCCCTTCGGTGGCCTGATGATCGAGGGCCAACTCGGATGCGCCTACCTGAAATCCGTCATCGCCCCCTACAGAGACCACCAATGGACGACCGAATCACCCCGCAACGTCGCGCTGGGATACACCCCGCGCATCAGCGGATGGACCACCTGGGAAACCCCGTAACCAACAGAAAGGCCCCCGCATGTCCGAATACCAAGCACCGCACCGACGCGCCTGCTGCGCCGCAATCACCGCACTCGGCAACCGAATCGGGCTATTCGCCGGTACCACCCGCGTAGGCACCGCCTACGCCGACACCACCTGGGCCACCCCAGTCGATGTCACCGAATCCGGCATCGACAAGGCATCATCCACCGGCTCGCTGGTAACCATCTCGGTACCTGGCGGCACCGTGGCCAACGGCACGGTGATCAACCGGTACGGCGTGTTCAACGGCGCGACCCTGCTGCGCACCGAGGCACTACCGGTCTCCCTGACCGTCAACGACGGATCGCAGCCGTTACAAGTCGATGTCACACCAACATTCAAGTTCTGGGGGGTGTAGTCATGGCCCGCCAGCTTCTCAAGCACTCGGCCTTCTACGCCGCACTTACCGCCATCTCATTCCGGCTCGGCTGGTGGGCGGCCGACCGGCTCTCGTCCTACGCCCAGGAAATAGATCCACGCATCGAAAGGGAGTACACCCGATGAGTTTCCGTACCGTGAACGGCAACACCCACACAGAGGACGGCTGGCGGTGCTGCAATCGGGACGAATGCGACATCGTGCGCATACCCGAGCTGTACCTCGTCGACACCGCGCCCCTGCGCAAGGGCGCCCCGCTGATCATCCTCGGCGCATGGCTGTACTGGTATGACCGCAACGTCGAGGAGATCACCTCCCCGGTATGGGGCTGGTCACTCGACAATGACGTGCTCGGCAACCCCGGGCGCAACGACGGATCAAATCACCTGTCGGCCACAGCCGTTGACGTGATGGCGCCCAAGTACCCCTGGCAGCGGTACACGATGGACGCCGCCACGCAGGCCAAGGTCCGCAAGGGCCTGGCCCTGTTCGAGGGCTCGGTGTTCTGGGGCCGCGACTGGTCGCGCCCCGACGAGATGCACTACCAGATGGCCTGGCCCGAGGGCGACAAACGCAATGACGCGTTCGCCGACAAGCTGCGCGCCGGATACCTCGGCATCTACGCTCCCGCGCAGCCCCCGGCGGTCGATCCTATTGCGCTACACCAGAAATTCGTCAAAGAAGCTCCCGACCGCAAGCTACTGGAATACATCGCCGAACAACTCGGCCCGGGTCATCCCGACTGGGCATCAAAAGGTATGACGCTGCGTGACAAGGTGTGGTCCAAATGATCCGCATCGGAGACCGCAGTGAAACGGTCCGTCGGTGGCGGGTCGTGATGAACGACTGGTTCGGACCGCTGTACACCCGGCTGTTGGGGCCGCTGCCGCAAGATACCGACGAGTTCGGGCCGCGCGCTGCCCTGTGGGCCGCCGAGTATCAGCGCCGCACTGGCCAGATCCCCACCGGGCAGGTGTCCGATGATGACCTGCGCGCGCTGGGTATTGCGCCCCCGGCCCCGCCCGCCAATCGCCACCTGGGCCTAATGTTCCGGGGCACCGGAGGCATCATCGGCCAGGACTACGTATCTCGCGTCATGCAGGCTGTGGCCAACCTCGTCGAGGAAGTGCACCCCGAATTCGCCGCAACCATGGGCGGACTCCCGGTCGGCGCCGCGGGCAGCCCCGGTGACATCTCAATGGCCAAGGCCGTCGAGATCGCGGTCGCCGACGCACAACGCATCTTCCTGGAGCGCTACCGCGCCAACCCCAAGACCAAGGTCGTCATCGGCGGATACTCGGCCGGCGCCGTCGCGGCGGCCAGATTCCGCGCGTGGCTGGCCGAGCACTACCCGAACAACTACCTGTGTTCATTCAGCATCGGTGACCCCACCCGACCCCACGGTGGCAGCTACTACGGCGGCCCGGTCCTTGCGGGGCAAGGCATCTCGTCATGGCGCTACGGCGACACAGGCGACTACCGGCACTGCTGGCTCACCGACCCCGGCGACATGTACGGCAACATCCCTCTGGGCGTGGTCGGGGACATCATGGACGACTGTTTCGACATGGTCACCGCATTCCAGATCACCGACCCACTCGGGGCCGCAGGCGCCATCCTGCCCAAAATCCCCGAAATCGCCGCCAAGGCATTGGGCATCGAGCTGCCCGCCATATTCGGCGCCCTCTCTGGCGGCCCCAACGGTATCGCCGCACTCGGCCTACCCGTGGTCCTCGGCGGTCTACAGGGACTACTCGGCTGGGGCGATATCAACAAGCTCACCGGGCCCGCGGCCGCCGCCCAAGCCGCCCTGATCGCGCTGCGTTTCGTCACCACCAGCCCACCGACCGCCGCGCACATTCAATACGAGTACCGCGAGGTCTGGCCCGGCCAAACCTATCTCGGGCTCGCCATCCAGCACGTGCGCGACTGGGCCAGCCGCACCCCCGCCATCGCCGCGTAGATCAGTCCGCCCCCGCGCGAGGAGAGCGCGCAGGGACTCCCCACACCGTAGCGCTCCCTATCCATGGCGCCATCGAAAAAACTCCCCCTGAACTGCCCAAACGCAGTTATCCACAACCCAACCGCCGAGAGGACCCGTCATGCACATCACCATCCCGCCCTGGCTCAAAGACGCCGCCGTTGACGCTGCCGAGCGCGCCATCAAGACGTTCGCGGGTGGCTTCATCGTCGGCGCCAACCTGGCCGACGCCGCAGTGAACGCAGCCCTGACCGAGATCGATTGGCAGAGCGGTATCAATGTCGGCGCCGGGACGCTGGCGGTATCGCTCATCTTCTCTGCGGCATCGATCAAGCTGGGCCGATCCGGTACCGCCTCGGCCACCAAGGCGGTCGTACCGTCCAGCCTGTTCAAGCTCGTGGCGGGCAGCGGCCGGTGAGCCTGCTGACCGAAATGGTGGATGTCAGCAGCATCGATACCCCAAAGCAGTTCGCGGCACTCACGATGGCGCTACTCGCCCCGATCTGCGGATCGGTCGCGGTGGCCTGGGCCACGGCCACGTTCGCGCACCGCAAGAAGCTGGGCGCTATCGCCGCCGATACCGGAGCCATCCGTGAGCAGACCGAGAACGACCATGAGACCAACATGCGGGTTGATCTCGACGAAATTCTCAAGGGGATCAAGCGAATCGAGGAACAACAGGGCCAGCAGGCCCGCGACATCGGAGGCCTGCGCGAGGAAATGCGTACCGAACGCACCGAGCGCGGGAGGGCAGACCAGCACATCCGAGAGCTGATCGAGCGCTGGCCGCATTGATTGCAGCCCAACGAAATAGCGCCCCTCACCCCGACCCGGTGAGGGGCGCTATTCGTCGTTAGGGCTAACTTCCTTGGTTGGCTGCGCGTTCAAGTCGGCCGACGCATTCGCCGACCACTGCACTGAAATCGGACCAGTCGGTCGCGATGTTGCGACCGTCCCTCAGCATCGAGTGACGCCGGGCATCAGAGGTGAGCGAATACACCACACCGTTGTGAAGTACCCACGTGTCCTGGCTGGAGACCTTTAGATCTCCCCGAAAGATGTTGCCGCCCACGAAAGTCTCGGCGTGCGTTCCGTCGACCGACTGAGCGTGCGCAAGATGCTCACCGTTGGTGAACTGCGCATCGATGATGTCAACGATTCGCGCAGGGGCCTCTTTGCATACCGCGCCCTGCGCGGGGATGTGCGTTGGGGTCGCAGCTGTGCCGGACGGCCGGCTTGGCATCACCACCCGAGAAGTCGGGTTATCGGGGATTCCCGAGGGCTCACTCGACTTGCCCGCGACGCCCACCGTGCATGCGATCACGAGCGCGAATACTCCGATGAATCCGCCCAGGAATGCCAACAGTGCTTTCCATCGGGCTCCCCCCTTGCTTGCCAT
Protein sequences of DBSCAN-SWA_5 >NZ_CP013049|3855244:3860476|3856238_3856694_+|WP_057138077.1|DBSCAN-SWA MANPNDIDNYSFRIHFYSRRETSYFDIYMNDGAIGLINGNYYLDAAPHDPNVGEFLLQYVPKLNTTIWDFDDGSLPANTEGYLWYQVNETYVITGDYQPFGGLMIEGQLGCAYLKSVIAPYRDHQWTTESPRNVALGYTPRISGWTTWETP >NZ_CP013049|3855244:3860476|3855244_3856216_+|WP_057138076.1|DBSCAN-SWA MGKWVWMPRVTVADSAVGADLAHLLRVAHTGIDQGVSADLAVAGVGLGASDASRGADLARAKLRVAARDAGIAADTARPGVRATDSAVAAEMAQMLPRLAAVGAATAADIAVLSRVRLPSSASQAIGADTATARFSPQPAALTAITAVGTTVVPIPVWCRYLDLALVGAGGGGASSGTFYLLGGFPGSPGTWATTTLERGIHIPWTTTTLTFAIGAGGAKGSGGFAGTAGGPGAATTAIGDGWAGLSAAGGAGGPQHPTGINANDGPGPGDKTYNGVTYPGGATQTSDGGTGYAPGGAGAGGANFGGPGGIGGAGGAWCRAYQ >NZ_CP013049|3855244:3860476|3859936_3860476_-|WP_005061765.1|DBSCAN-SWA MASKGGARWKALLAFLGGFIGVFALVIACTVGVAGKSSEPSGIPDNPTSRVVMPSRPSGTAATPTHIPAQGAVCKEAPARIVDIIDAQFTNGEHLAHAQSVDGTHAETFVGGNIFRGDLKVSSQDTWVLHNGVVYSLTSDARRHSMLRDGRNIATDWSDFSAVVGECVGRLERAANQGS >NZ_CP013049|3855244:3860476|3859491_3859872_+|WP_005061763.1|DBSCAN-SWA MSLLTEMVDVSSIDTPKQFAALTMALLAPICGSVAVAWATATFAHRKKLGAIAADTGAIREQTENDHETNMRVDLDEILKGIKRIEEQQGQQARDIGGLREEMRTERTERGRADQHIRELIERWPH >NZ_CP013049|3855244:3860476|3859198_3859495_+|WP_005061762.1|DBSCAN-SWA MHITIPPWLKDAAVDAAERAIKTFAGGFIVGANLADAAVNAALTEIDWQSGINVGAGTLAVSLIFSAASIKLGRSGTASATKAVVPSSLFKLVAGSGR >NZ_CP013049|3855244:3860476|3856713_3857076_+|WP_057138078.1|DBSCAN-SWA MSEYQAPHRRACCAAITALGNRIGLFAGTTRVGTAYADTTWATPVDVTESGIDKASSTGSLVTISVPGGTVANGTVINRYGVFNGATLLRTEALPVSLTVNDGSQPLQVDVTPTFKFWGV >NZ_CP013049|3855244:3860476|3857218_3857887_+|WP_057138079.1|DBSCAN-SWA MSFRTVNGNTHTEDGWRCCNRDECDIVRIPELYLVDTAPLRKGAPLIILGAWLYWYDRNVEEITSPVWGWSLDNDVLGNPGRNDGSNHLSATAVDVMAPKYPWQRYTMDAATQAKVRKGLALFEGSVFWGRDWSRPDEMHYQMAWPEGDKRNDAFADKLRAGYLGIYAPAQPPAVDPIALHQKFVKEAPDRKLLEYIAEQLGPGHPDWASKGMTLRDKVWSK >NZ_CP013049|3855244:3860476|3857883_3859059_+|WP_057138080.1|DBSCAN-SWA MIRIGDRSETVRRWRVVMNDWFGPLYTRLLGPLPQDTDEFGPRAALWAAEYQRRTGQIPTGQVSDDDLRALGIAPPAPPANRHLGLMFRGTGGIIGQDYVSRVMQAVANLVEEVHPEFAATMGGLPVGAAGSPGDISMAKAVEIAVADAQRIFLERYRANPKTKVVIGGYSAGAVAAARFRAWLAEHYPNNYLCSFSIGDPTRPHGGSYYGGPVLAGQGISSWRYGDTGDYRHCWLTDPGDMYGNIPLGVVGDIMDDCFDMVTAFQITDPLGAAGAILPKIPEIAAKALGIELPAIFGALSGGPNGIAALGLPVVLGGLQGLLGWGDINKLTGPAAAAQAALIALRFVTTSPPTAAHIQYEYREVWPGQTYLGLAIQHVRDWASRTPAIAA |
8 | Mycobacterium_phage(100.0%) | NA | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_6 |
4170664 : 4180156
Sequences of DBSCAN-SWA_6
Nucleotide sequences of DBSCAN-SWA_6 >NZ_CP013049|4170664:4180156|DBSCAN-SWA ATCACGCAGCATCGCGATCAAGAGGAATCGGTGGCCGCTCGGCCACCGTGACTGCCTGCTTCCGGGTAAGCCGGGACTTCAGTGCGATAGCGCGTTTTTCCACGACAAACCAACTGAAAGCCGCCAGCGGTACGGTGGCAACTGTCGCAATGACAAAGAACAGCAACAGATTCAGACTGGCCAGACCCGAGACGACCAACAGCTGCTGTATGGGATAGGCGTAGATGTACGTACCGTATGACAGATCGTTACGTAGATTTAGCCGCGGTTTGTGGATAAGCGAGCCCGAGACGATGACCGCATAAGCCAGGGGCAACGCGGCCACATCTCGATAGTTCTCAAGCAGACCGGATACCAACACGATGGCAACGCTGAGCGCGACAAGCGACCAGCGCGCCGGCAGAATATCTTTGTACTGATTTAGCAGAGCGCCCGCCGCGAACATCACGGCAAATCTGGCCACCATCTGCGGGATCGACGTCGCCGCAAAGACTGGGTAGGAGAGATAGGCCGTCGCAGCGAGAAAGAACACGAATGCTGCCGGGACAACCCATCGGCGCTTCAGTAGGCCGGCGACGCCCAGCACTGCCACCGCGATGTAACAGATCATCTCGAATACGAGCGTCCAGATCGAGCCGTTCCAAACCCCCGGCCAGGGGACACCGCGCGGCGTGCCGTCGACGCCCGCGTAGTACACGTTGAGCAGGCCGTTGTTGATGACATATTCGATCGGCTTGTGTGAGGCGAACAGATCCGAGACCGAGCCGCCCTGGATGAGGACACTGGTGGGGGCGATAACGAACGCGATGATCGCCAGACACACCCATAACCCGGGAAAAATGCGCAACCCGCGAGCGATCGCAAAGTCCCGAAGCTTCGGTTTCCGTAGCCAACTCGCAGTGATAAGAAATCCCGAGACGGCGAAAAAGCCGTCCACTCCCACTTGTTCGATGAGCTGCTCGAACGGACGGTACGTGATGGTTCGACCGGTGAGTGGCCACGAATGCCAGAAAATGACGCTCACCGCCAGCGCGAGCCGCCATGCGTTCAGCGCGTTGTTTCGCGGGTCGAAGACCTGGCCGAGTGTCAAACCCTTAGCCTGCCCGGTCGCACCCGCCTTGGTCATGCCGAACACCTCGCCTGCGCGAACCGCTCCATGGCATCCGCGGCAGCGACAGCACTTTCACCGGGTTTGGTCATACCGGCCGCGAGCTCCCGGGCCCGATCGGCATAGTCCGACGCCAGGATTCGTCGCAAGTCGGCTACCAGCGACTCGCGGGTGGTCGTCGAAAAGCGGCGAGTAGTGCCTACTTTCAACTTCTTGACCTGACCACCCCACAGCGTTTGATTGGCGTCCATCGACAAGATCAACGTGGGAACTCCAGCCCGCAGACTGGCCGCGGTGGTGCCCGACCCCCCGTGGTGCACGACTGCCCGGCACGCGGGAAAAACAGTCGCATAGTTCACCGCGCCCACGACCTTGACGTGTTCGGGGAGCTGCGCGCCGCCAAAGTCGCTCCACCCGGCACCGATAAGCGCACGCTCGCCGAGCTCGGCACACGCCGAACCGATCATTTCCAATGCGGCGGAAGGCGATTCAACCGGCATGCTGCCGAAACCAAAAAAGATCGGTGGCGTTCCCCCGGCGATCCACGATGCGACCTCCTCATCGGCTTCCTGTGTCATCGCCATGGTCAGCGAGCCCACGAATGGCCGCAGTCCATTCCACTTCGCCCATTCGGTAGCCAGACCCGGGAAACAGACCGAGTCATACCCCTGAATCTCAAGCGAGCGACGCTCGGCGATCCGCTGCGGCGAGGGACTCGAAGCTTTCGGCAAGCGAAGCTCGCGCCGCTGGGCATCCTCCACCTTCTTGTTCAGCCGCCAGGACACCCAATCGAAGGCTCTCATCGCCGCGCGGCCCAGACGGGAGGGCAGGATTGTGACCAGCTGGCCGTTGGGACGCATCGGAATATGGTGCAGCGTGACCAATGGAATGTCATGGTACTCAGCGACATTTGCGGCCGGCTCCTGGTAACTCTGCCCGGCGAACAGTACATCGGCCCCCTGCGCCGCAGACATCAGCGTGGTGTTCATCTGCGCCCAGCACTCATCACTAAGCTCCCACATCTGACGCCACATCGTTCGAATCTCCCGCACCTTCCAGAAAGTGTGAAAGAAGAACGTCCAGAAATTGCGATACACGTCCAGCCACGTCTTGGTGTCCAACCCGTAGGAAACCGTCTCCAGCCCGGCCGACTCCGTAAAGCTGATCAGGTCGGGCGGCACCGCCATGACGACGTCGTGTCCCCTGCGCTGTAGTTCACGAGCCACGACGACAGAGGGCTCGATATCGCCCCGCGTTCCGTAACTCGCCAGCACAAATTTCATCGCTTCCAACGCCTTTCGGGTATCGCGGTTCAGTGGAGTGAGTCCGACGTCATTCCTTACGCCAAAACACCCCTGTACCGTCAATTTCCATGATCTCAGCAGTTATCCCATGCTTTGCCCGATAATCCGTGACGGCCTGTTTGCATGAGTCAAGCGCGTGGTAATCGTCGATGATGCAAAAACCTCCCGGGGACAAACGCGCGTAGAGCCCGTCAAGCGCCTGGATTGTCGACTCATAGAGGTCCCCGTCGAGCCGCAATACCGCGATTTGTTCAATCGGGGCGTCATGCAGCGTGTCTTTGAACCAGCCCGGCACAAAACGGACCTGCTGATCCAACAACCCATAGCGCTCGAAATTCGCCCGGACATTCTTTTCGGTCACCGCCAGGATGGGCGCCGCGAGATGTAGCCGGTCTCCCTTGTCCGCCTTGTAATTGGCAGTGTCCGGAGGTGGCACGCCCTCGAACGAGTCGCAGAGCCAGACGCTTCGTGTCGTATCCCCGTACGCGGCCAGGACCGCACGCATCAGGATGGACGCACCACCACGCCAGACGCCACATTCGACCAGGTCTCCCGGCACGTCATCGGCAAGTACCGTCTCGACGCACTGCTGCAGGCTGGTAAGTCGCTGCATTCCGATCATCGTCAGTGCATCTGCCGGCCAGTCCAGTCCCAAGTCACGCTTGTGCGCATCGAACGCGCGCTTGCGCACGAGCACGTAGTCGCCAAATTTGAAAAAGGGACGATGCAGCCAGTTCCAACCCACTGGCACAAGTTCATCAACTCCGTATCTGGTGAGGTCCTGCCGAAGCAGGTCCAGATAAGCGGATCGAGTGTCGTAATCGGTCACGGTCAAGAACTGCCCCTATCTCGAAATGCTGGTGTCCCGTCCCAGCGGTAAGAATAGCCGCCCGCACGACTCCGTCGACACGTTTCGAGCTCGGCTACTCCGTTGTGGGGACTGCATTTTCGGTGGAATCTGTCAAAAAATCTTATGAACCGATGTCGCCAGATATTTCCGGCTCACGCGGGCGAAGTTTCCACGGTCCTTCATTAATATTCAGGCTCATCGGAACACCGCACGGGGCAGGCTCTTGGGCTTCTCCGCCCGGTGAATCACGACCACCGAGTCGAACACCTCGATCCGGTCGATGATGGTGGTCGCGAGGGGCACCTGAACTCCCTCAAGCCTCCTGGGATGACCCTCCATGATCTGGTACACAGAGTTCATCTCCAGATAGTGGGCATGCATCGCGTCCATCAGCCACTTCGTGAAGTCGATGAACGATTCACGCTGATCGCGATAGATCGTCCAGTAGTTGGCGCACACATCCTCGACGATGTAGACCCCGCCGGGCCGTAGCCGCGGAAACAGGTATCTGAACGAGCCGATCATCTGCTTTGGGGTGTGCCCGCCGTCATCGAGAATCGTGTCGAACGGGCCCAACTCGTCGACCACCGCCCCGAGGAATCGAACATCGGTCTGGTCACCGATACGCACGTGGACGCGATGCTCGGGATCATCGTGCCGCGCGGACTCGGGATTGATATCCAGACCCACGATGGTCGCGTCCGGCAAATACTCTCGCCACATTTGCAGCGAGCCACCCCGGTCCACCCCGATCTCCAGCATCCGCTCGGTCCGCGCGAGGGCATGCTGATAAATGGGCAGGTAATGACGCAACTTGTGCACCTTCTCGGCGCCCTCGAAGATCCGGCCTATCGGAGTATCCATGGCCGGAGTGTCGTCAGCGGACATCAAGACTTTCCCTGCTAAGTCCGGCCCGAGGGCATCGAGCATACGACGACGGCCCGCGTAGCGATAGCGCCGTGCGATAAACGGCGGCTTAGGCTGGAGGGTTAGCCGAAAAAGCGGCATACTCACGGAACCCTTTGCGGTTCTTTACGGTTCGAAGAACGCGATTCGTTCCGAGACTTCCGAAGCGCCATACACATCAGCCTACCGACACGTTCCCGGTGCTATCGCCAAGCTAACTGTCCTCATGCGGCGACAGTGCCCGTTCCCGCAGCGCCACGCTATCCGCGACCGCCGGCTCCACACCGAGAGTACCGGCCGAGGTGGCGCGCTTCCGTTTAAGCCGAGACTTCAAAGCCAGCGCGCGTTTCTCGATCAGAAACCAGCTGAATGCCGCGGGTATGAGCGTTGCTACTGCTGAAACACCAAAGAATACAAACGGATTCAAGGCGTACAGACCCGCGACGGCCAGGAGCTGCTGAGTGGGCGAGGCGTAAATGTAGACACCGTAGGACAGATCAGTATGAATATTCATGCGCTTGTTCTTGATCAGCACACCCGAGACCACCACGGCATAGGCCAGTGGCAACGCCCCGAGCACTCGATAGTCGGGCAACATGCCCGAAAACAGCACGATCACGACACTCACCGCCACTAATGACCAGCGGGCCGGTATGCGATCGCGCCACTGATACAACAGCGCACCGGCGGAGAACATTATCGCGCTACGCACCGCCAGCTGTGGGATGGTCCAGAGGCCCGGGAATGTCAGCGGCGGCAGCATGATCGCCGCAATTGTCGCCAGAATCAGTATCACGGGTGCGGTCCAGCGGCGACTGGCCAGGCCGAGCAAACCCAGACCGGCCACCGCGATATAGCACATCACCTCAAAAACGAGTGACCACAAAGACGCGTTCCAGATTCCCGGGTACGGGATATCGTGCGGCGTACCGCCCACGTCCAACTGGAGTTGCACCAAGGCACTGTTCTTCAACACGTACTCGACGGGTGCGCTGGAGAACAGCAGCCTCGCCGCGGAGCCGCCCTGGATCGCCACGCTCAGCGGTGCCACGACAAACGCTGTTATCGCCAGGACGATGTAGTAACCCGGCAGGATACGAAGAGCTCGCGCTACGAAATACTCACGCACATCGGGATTGCGAAGCCAACTCGCAGTGATCAGAAATCCTGACAGGGCAAAGAATCCGTCGACTCCCACCGAGAAGAGCAACTGAAGGATCGCTTTCGACTCCACTACCCGCCCGGTGAGAGGAAACGAGTGAAATAGGATCACCTCAGTCGCCAAAGCGAGCCGACATGCGTTCAGCGCGTTGTTTCGTGAATCGAATACAGAACCGAGCTTCATGCGCCTCCGCCACTCTTCGCTGGGAGCGTCATCGTATCGCCATCGCAGGGACTGATCACGCTTTCCGGCATTTCTGCGTCAGGTTGGAAAAACATTGCCGCAGCCCAGGATTCACAGCAATCATAGGATGCCCGCGGTTATCGATCGATGCCGAGGTACCGGCTTGCTCAAAAACGATTCGGGCAGTCTTTGTCGATCCATTCGAGCAGCGCATGTAGGCCGTCTTCCAGTGTCCACTTCGGACTCCACCCGAGGCCCTCGACTGCCGGCGCGATATCGCAGCTCGCCGCGCGCACGTCCCCATCGCGAAACTTCCCGACAACGACGGGTTCCGGCGCCGCGCAGATATCGGCCACCTTGCGCGCCAACTCGTGGATCGTGGTCGCGGTGCCCGAACCGATATCCAGGGTGCGGCGCTCGGTGGCAGGTGTGTGCACCGCGGCGAACAGCGCATCGACAACGTCATCGATGAATACGAAATCGCGGACTATCCGCCCGTCTTCGTAAACCTCCAAGGCTTGTTTCTCACGTGCCAACCGCGCGAACAGGGTGACAATTCCGGTGTATGAATTGGTGAGCGATTGCCCGGGACCATACACATTCTGCAGCCGCAAGACACTCAGATTGGTGTCGTGCGCTGCGGCCCAGGACGCCAAGAGGTGTTCCTGCGCAAGTTTTGTACCCGCGTAGATGTTGGTCGGTCTGGGTTCCGTGCGATCAGCACAACTGGAAAGCGGCGTCGCGGATGCGCCCGCCGGGCCTTGTGGATCCCATTCACCGGCAAGCAGCTGCGCGTGGCTGCGCGGCCGCGGGTAAAAGACCTCCGCGCCGGACTGCCACGCGCCTTCGCCATAAACGGCTCGTGACGATGCGAGGACCAACTGGTCCGGTACGAGTGCCGAACGGCTGAGGGCATCCAGAAGTTGAGTTGTACCTACTACATTGACGGAGCCGTGGCGGGTCGCCTCCGAAAGCGACTGTGCTGTGCCCGTCTCCGCGGCCAGGTGGACGACCTGGGACGGCTCAAACAAGCGCAACACAGCATCCCAGTCCGGTGCGTGGGTGACATCGCCGGTGAACAAGCGCACCGACGCCGGCAGGTCGATCGCCCGCCCCCTGGCGTGGACCTGTGGGTGTAGGACGTCCATTACCGCCACGTCGTACCCCGCCTGGATGAGGCGGTTGGACAGCGCGGATCCAATGAATCCCGCGCCACCTGAGATGAGCACAGATTTCGACGCACTGGTCTTTGACAAGCCGAGCCTCCCGTTCTTTCAGAGCTGTTAGGTGACCGGCACGCACCGGTAATGCGTTCCGGGTGAATGGTATCGATGGTGGTTCCGCTACGCGGGGTTGGTCTCTCACACTCTGCCGCGGGTAGCCATCCAGGCCTATGGGTGGGAAGCCTTTGCCTCCAGCAGGTCGGCTGCCTTGCTGACGCTATCGTGCGGCTTCGACATATGAGGGGCGATCGCGCGCGCCCGGGCCGCACATTCCGGGGTGAGAATCGTGCGTAGGTCCGAGACCAGAGAGTCGCGGGTTGTGGTCGAAAACCGCCGGGAGGCACCCACTCCCATACGCCTTAGCTGGTTCCCCCAGAACGGCTGATCACCAACGGTCCACAGAATCAGCGTGGGCACTCCTGCACGCAGACTGGCCGCCGTGGTGCCCGATCCGCCGTGATGAACGATCGCGCGACTCACCGGAAAGACCGCCGCGTAGTTCACCACTCCGACCAGTTTGACGTGTGGCGAGGTTTGGACGCCGCTGAAGTCGGTCGCCCCGGCACATACCAACGCGCGCTCTCCCAATTCTGCGCACGCTGAACCGATCATCTCGACTGTCTCGGCCGGAGATTCAACGGGGATGCTCCCCGAGGCAAAGCAGATCGGCGGGGGCCCCGAGGCTATCCACGACATCACCTCGTCATCAGCCGAGGTCGTCAGCTCCATGGTGAGAGCACCCACGAAAGGCCGTCGGCCTCCGTATTTCGCCCACTCTTCGGCCAGCCCCCGGAAGCACACGGCGTCGTAGGCCTGAATTTCCAGCGAATTGCGCTGACCTATACGCTGCGGCGACGGGCTGGTCGCCTGTGGCAATCCGAGATGTCGTCGTTGGGCATCTTCGGCCCCCTTGGTGACCCGCCAGGTCAGCCAGTCGTAGGCGGTCATCCCCGTCCGGGTCAGCATCGGCGGTAGCACCGGGAAAAGCTGACCATTGGGACGCCACGGCATCGTGTGTAGTGCGACCAACGGGATCCCATAGAACTCAGCAACATTGGCTGCCGGCTCCTCATAGCCCACACTCGTGGACAGCAGGTCGGCTCCCTCTGCCACCGACACCAGGGTTTCACTCATTTCCGACCATTGTTCGGTCACCAGCTTCAGCGCCTGCCGACACAGTGCCACCAAGTCCTGCAGCCGCCAGAAGTGGCGGGTCCACGACGTCCACAAGTCGCGGTACTCATCGAGCTGCGGGCCCACGTGGATTCCGTAGGGCACCGCGGACAGCCCGACCGAGCCGGCAAAACCGATCAAGTCGGGAGGGACGGCCATGCGCACTTCATGACCCCGGCGCTGCAACTCTCGCCCAACAGCAACGGCAGGTTCGATATCGCCGCGAGTGCCATAACTTGCCATGGCAAATTTCATGACGGTATAGACCTCCCCGATATCTGAATGTGCCACAGAATGGACGTCAATATTCTCGGCCATTATTACCGATTTCGCCCGCCTCGGATAGCCTCAATGCAATAGGCATACACCCGTCTCCAATTCCACCGCGGGCCAGTAGCTAAGGCTCCGAAAAATGTATCGACCTGCCTTGCCGCATATAAACGACGAACTCGAATATGACGGCAATGAGAATGCGGCCGAACACACGTTCCAGGACCGCCAGTACGCTCCGTCGACGCGCTCGCCCCCTGCAGCCGGGCCCAGCGAGCGCCGATCACCAATACCTGCGCAAGCATCTCGGGGGCTCGGTGCCGGGCACCGATTCCGTCGGCATTCCAGCTTCGTACGCGATAGGGTGACTAGGGAGTTGCGCGGGGCATGTCAGAAAAAGTCGCCACCGCAACCGGACTCGCATTTGTCATCGAGTAGGAGCACACGTTGCGCGGCATCATTCTCGCAGGCGGCTCAGGCACTCGCCTACACCCGATAACGACAGGCGTCAGTAAGCAGTTACTGCCGGTCTACGACAAACCGCTGGTCTACTACCCCCTGTCCACCTTGATCATGGCGGGCGTGCGCGACATCCTGGTAATCACCACACCCGCCGATGCCCCGGCTTTTGAGCGCCTACTCGGCGACGGTTCAGCGTTCGGGATCAATCTGAGTTATGCCGTGCAACCCCAGCCAGAGGGGCTGGCACAGGCTTTCGTCATCGGGGCGCAACACATTGGCACCGACACCGCGATGCTCGCGTTGGGGGACAACGTGTTCTACGGGCCCGGCCTGGGCACGAGCCTGCGCAGGTTCGAGAACATCGACGGTGGGGCAATATTCGCCTACTGGGTAGCCAATCCGTCGGCATACGGCGTGGTCGAATTCGATGCCGCAGGAGTTCCCCTGTCCCTGGAGGAGAAGCCCGCGACACCGAAATCTCATTACGCCGTACCGGGGCTGTACTTCTACGACAACGATGTCATCGAGATCGCCCGGTCGCTTCAGAAGTCGGCACGCGGTGAATACGAAATAACGGAGATCAACCAGATCTACCTGGATCAAGGTCGGCTCTCGGTCGAAGTCCTGCCGCGCGGGACCGCTTGGCTGGACACCGGAACGTTCGATTCGCTACTGGATGCGAGCGATTACGTCCGCACCATCGAACGCCGTCAAGGTCTGAAAATTGGCGTTCCGGAAGAAATTGCGTGGCGTGCCGGTTTCATCAATGACGATCAACTTGCCGCCCGTGCCCAAAAGCTACTCAAATCTGGGTATGGAAGTTACCTACTTCAGCTCTTGCAGCGGAAATAG
Protein sequences of DBSCAN-SWA_6 >NZ_CP013049|4170664:4180156|4171785_4173054_-|WP_005085927.1|DBSCAN-SWA MKFVLASYGTRGDIEPSVVVARELQRRGHDVVMAVPPDLISFTESAGLETVSYGLDTKTWLDVYRNFWTFFFHTFWKVREIRTMWRQMWELSDECWAQMNTTLMSAAQGADVLFAGQSYQEPAANVAEYHDIPLVTLHHIPMRPNGQLVTILPSRLGRAAMRAFDWVSWRLNKKVEDAQRRELRLPKASSPSPQRIAERRSLEIQGYDSVCFPGLATEWAKWNGLRPFVGSLTMAMTQEADEEVASWIAGGTPPIFFGFGSMPVESPSAALEMIGSACAELGERALIGAGWSDFGGAQLPEHVKVVGAVNYATVFPACRAVVHHGGSGTTAASLRAGVPTLILSMDANQTLWGGQVKKLKVGTTRRFSTTTRESLVADLRRILASDYADRARELAAGMTKPGESAVAAADAMERFAQARCSA >NZ_CP013049|4170664:4180156|4174120_4174855_-|WP_017206146.1|DBSCAN-SWA MLDALGPDLAGKVLMSADDTPAMDTPIGRIFEGAEKVHKLRHYLPIYQHALARTERMLEIGVDRGGSLQMWREYLPDATIVGLDINPESARHDDPEHRVHVRIGDQTDVRFLGAVVDELGPFDTILDDGGHTPKQMIGSFRYLFPRLRPGGVYIVEDVCANYWTIYRDQRESFIDFTKWLMDAMHAHYLEMNSVYQIMEGHPRRLEGVQVPLATTIIDRIEVFDSVVVIHRAEKPKSLPRAVFR >NZ_CP013049|4170664:4180156|4170664_4171789_-|WP_005112210.1|DBSCAN-SWA MTKAGATGQAKGLTLGQVFDPRNNALNAWRLALAVSVIFWHSWPLTGRTITYRPFEQLIEQVGVDGFFAVSGFLITASWLRKPKLRDFAIARGLRIFPGLWVCLAIIAFVIAPTSVLIQGGSVSDLFASHKPIEYVINNGLLNVYYAGVDGTPRGVPWPGVWNGSIWTLVFEMICYIAVAVLGVAGLLKRRWVVPAAFVFFLAATAYLSYPVFAATSIPQMVARFAVMFAAGALLNQYKDILPARWSLVALSVAIVLVSGLLENYRDVAALPLAYAVIVSGSLIHKPRLNLRNDLSYGTYIYAYPIQQLLVVSGLASLNLLLFFVIATVATVPLAAFSWFVVEKRAIALKSRLTRKQAVTVAERPPIPLDRDAA >NZ_CP013049|4170664:4180156|4179289_4180156_+|WP_005062166.1|DBSCAN-SWA MRGIILAGGSGTRLHPITTGVSKQLLPVYDKPLVYYPLSTLIMAGVRDILVITTPADAPAFERLLGDGSAFGINLSYAVQPQPEGLAQAFVIGAQHIGTDTAMLALGDNVFYGPGLGTSLRRFENIDGGAIFAYWVANPSAYGVVEFDAAGVPLSLEEKPATPKSHYAVPGLYFYDNDVIEIARSLQKSARGEYEITEINQIYLDQGRLSVEVLPRGTAWLDTGTFDSLLDASDYVRTIERRQGLKIGVPEEIAWRAGFINDDQLAARAQKLLKSGYGSYLLQLLQRK >NZ_CP013049|4170664:4180156|4177567_4178827_-|WP_005094865.1|DBSCAN-SWA MKFAMASYGTRGDIEPAVAVGRELQRRGHEVRMAVPPDLIGFAGSVGLSAVPYGIHVGPQLDEYRDLWTSWTRHFWRLQDLVALCRQALKLVTEQWSEMSETLVSVAEGADLLSTSVGYEEPAANVAEFYGIPLVALHTMPWRPNGQLFPVLPPMLTRTGMTAYDWLTWRVTKGAEDAQRRHLGLPQATSPSPQRIGQRNSLEIQAYDAVCFRGLAEEWAKYGGRRPFVGALTMELTTSADDEVMSWIASGPPPICFASGSIPVESPAETVEMIGSACAELGERALVCAGATDFSGVQTSPHVKLVGVVNYAAVFPVSRAIVHHGGSGTTAASLRAGVPTLILWTVGDQPFWGNQLRRMGVGASRRFSTTTRDSLVSDLRTILTPECAARARAIAPHMSKPHDSVSKAADLLEAKASHP >NZ_CP013049|4170664:4180156|4173103_4173904_-|WP_005062157.1|DBSCAN-SWA MTDYDTRSAYLDLLRQDLTRYGVDELVPVGWNWLHRPFFKFGDYVLVRKRAFDAHKRDLGLDWPADALTMIGMQRLTSLQQCVETVLADDVPGDLVECGVWRGGASILMRAVLAAYGDTTRSVWLCDSFEGVPPPDTANYKADKGDRLHLAAPILAVTEKNVRANFERYGLLDQQVRFVPGWFKDTLHDAPIEQIAVLRLDGDLYESTIQALDGLYARLSPGGFCIIDDYHALDSCKQAVTDYRAKHGITAEIMEIDGTGVFWRKE >NZ_CP013049|4170664:4180156|4176343_4177405_-|WP_005085923.1|DBSCAN-SWA MLISGGAGFIGSALSNRLIQAGYDVAVMDVLHPQVHARGRAIDLPASVRLFTGDVTHAPDWDAVLRLFEPSQVVHLAAETGTAQSLSEATRHGSVNVVGTTQLLDALSRSALVPDQLVLASSRAVYGEGAWQSGAEVFYPRPRSHAQLLAGEWDPQGPAGASATPLSSCADRTEPRPTNIYAGTKLAQEHLLASWAAAHDTNLSVLRLQNVYGPGQSLTNSYTGIVTLFARLAREKQALEVYEDGRIVRDFVFIDDVVDALFAAVHTPATERRTLDIGSGTATTIHELARKVADICAAPEPVVVGKFRDGDVRAASCDIAPAVEGLGWSPKWTLEDGLHALLEWIDKDCPNRF >NZ_CP013049|4170664:4180156|4175045_4176176_-|WP_005112211.1|DBSCAN-SWA MKLGSVFDSRNNALNACRLALATEVILFHSFPLTGRVVESKAILQLLFSVGVDGFFALSGFLITASWLRNPDVREYFVARALRILPGYYIVLAITAFVVAPLSVAIQGGSAARLLFSSAPVEYVLKNSALVQLQLDVGGTPHDIPYPGIWNASLWSLVFEVMCYIAVAGLGLLGLASRRWTAPVILILATIAAIMLPPLTFPGLWTIPQLAVRSAIMFSAGALLYQWRDRIPARWSLVAVSVVIVLFSGMLPDYRVLGALPLAYAVVVSGVLIKNKRMNIHTDLSYGVYIYASPTQQLLAVAGLYALNPFVFFGVSAVATLIPAAFSWFLIEKRALALKSRLKRKRATSAGTLGVEPAVADSVALRERALSPHEDS |
8 | Burkholderia_phage(25.0%) | NA | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_7 |
5139108 : 5154019
Sequences of DBSCAN-SWA_7
Nucleotide sequences of DBSCAN-SWA_7 >NZ_CP013049|5139108:5154019|DBSCAN-SWA GTCACATCCCCGGCCCGGAGTTGCGGGCTTGCTGATGGAATGCGATATCGCGGCCGGTGCCGTCCTCGGTGGCGCGGTTGTTGGTGACGTGGATGTTTGTGTCGCCCGCCTTGACTGGGCCGCCTTGGGCGTTCGGGTCGCCCTGATTGGGGTTTGGTGGCGCGGTCGCCTTGCCGGCCACGTTCGGGATCGCCGGGGCAGCACCAGCGACACCACCGAGGATCTTGGTCAGCCAGCTCTTGTTGGCCAGCTCCGAGCCCGCGGTCGGCAGCACCGTATCCATCAAGCCCTGCACCCCGATACCTGCGGCCTGTGCACCGAACTGAATCGCCCTGTTGGCCAGCTTGATTCCGGTCTGCGCTGCCTGCCCGGCACCTGGGGCGAAGATGTCGGCCGCCGAGGCGGCCATCCCGATCGCGGTATCGATGGTGCCGCCGGGAGTGATACCGACCCCGCCTGCACCCGAACCGGTCGCCGGTTCCACACCACCAATGCGCGTCGATGACGGGCTCCACGCCTGCGCAGGCCCGGTAGCCCCACCCCACCCGCCGCCAGCGGCCGGAATACCCGCTGTCAGGGCAGGATTGGTCAACGTCGGATCGCTCATCACCGGATCGGTGACCGCTAAGCCAGGACCGGCCGTCTTGGGGTAGAGCGCCCGATAATCGACCGTGGGCCCGATCGGCTGCGGCGACGGTGCGCTCGACGTGCCCGAACCAAGGGGCATGTAGTACTGCTTGGGGAACTGCTTATCGAGGGCACCGGCCGCCGAGCCTCCCAGCATCGGGCCGTGTCCTCCACCAGATTCGAAATTCATGCCGTTGGGCAGCGTCGCGGCCATGTGGCCCTGCTGCCCCGGCAGGGGATTCACACCGACATTGAAGGCCCCCGGCTGATATCCGGGCAGGAAACCGAGCTTGGCAGCGCTGGCATCGGTGGCGAACGCAGTGGTATCGAACAGCCGTGCCGGTGAGGACTTCCCGTCGCGCAGCACCTCCACCAAATCCGAGACGGCACCCGAGCAGTCGGCCAGCCCGTTCTGCAGATCAGATGCCGGAGCGTACTTTCCGCCACGCGCGGCCAATGCATACATCGCGGCGAGGTTGGGATTTACACCCTGTTGCAGCGCCATCGGCCCGATGCCCGCCATGGCAACGTCCTGGGCAACACCTGTGTACTGCGGCCCAAACACGCCCTGGGCGGCCAGGATGCCCATAGCGCCGTATCCGCCCTTGGACGGGTTGAGTTGGCTGACCGCGCCGAGCTGGCCAAGGATCGGGGCCGCCGCCATATTGGCCAGGAACTTGGTCAGATTCTCGGCCAGCCCCGGCAGGCCCTTGGAGATCCCGAAATCCTTGTCTAGTGCGGCACCGATCTGGCCCATGCCGTCGGCGAGGCCCTGCGTAGAGCTCTCCAGCTTCTTCCACGTACCTTGCTGCGCCTCAGCCAGTTTCATCTGCGCCGAAACGTACGAGCGTTCGGCGTCGGCAACCTGGTTGCGCGCTCGCAGTAGTGCGTCCTGATCGGCGTTACCCTGCTGCTCCAGCCGGATCAACGCAATGCGGTCTTGCTCCAGAGAGTTCTTGGCCCGGATCGCCGACGACTCAGCGTCATACACCCGCATGGGGTCGACCTCGTAGCGACCGAGACCGGGCCCGCCCTTGGGAGACGAAACGAGCACCCCGGGCGCTGCGGTGGGCGCCGTGGCCAATCCTGGCGGCATGGCAACGGGCTTTGACTCCACCGACCAAAGACTCGGATCGATCGGGGCCTTGGTCTTGTCGTCGTCCCCGGCCGGCGCGATCGGCTTCCTGTCGCCTGCCTGCGGACCGTTATCGACAGCATTGCCGCGCTGGGCATCCGGCGGGGGCAGGGCGGTCCCGGGGGCGAGCGCGCTGCCGAGCAGTGTCCGTGCTGAGTTGTCGCCGGGCGCCGGAGGCAGGACGGTTGAGCCCGCGCCCGGCGCACCGGGAAGGGTGTTGGCCAGGATGTCGGTACCGGGATGCGTACCGCCGAGCGGTGCAGCGTATTGCGGCGGTGGCGGCGAGGAACTGAACAGATCCTTGATCATCGTCGGGATGTCCCTGATGACAGGCAGATCCACAAACCAATCCGAGATACTGGTCTTCAGGTCGGTGAACCACTGATCGACCGTCTTGGTTGCGCTTTCCCATTCGGACTTGAACGTCTCCGTCGCGGTCTTAGTCGATCGCTGCGAGGTGTCTTGCAGATCCTTGAACTGGTTTTTAGCCGGGTCGAGGTCGAGTTTGTTGACAGCATCGCCCATGTCCTCCCACTGCGTGCCGAAAAGGCGTTGCCACACAAGGGCTTGCTGAACCGGGTCATCTAGATTGCGTAGCCCGGTGAGCACCGCTGCAAATGCTTGGTGTGCTTGCTCGCCGCCTGCGGAGAAGCGCCGTCCCATCTCGTCGGCGTTGAACCCCAGCGCCTCGAAACCTTCCTTGGTCGACTTGCTGCCGTCGACCGCGCGGATGCTGAATTCCTTGAGGGAGTCGGCCACCTTGTCGGTGTCGCGGGCACCGCCCTCGATGCCTTGCTTGAGCAGCGTCATTGTCTCGCTGCCGGTCAGGCCGAGCTTGCGGAATTGCGTGGAGTACTCGCCGATAGAGTCGAGCCAGTCGCCGGTTACGTCCAGGCCCTTCTGTGAGCCCGCGGTGATGATGTCGAGCGCTTCGGTGACGCTATTGGCAAGGCCGGTCCGCATGAGTTGGGTCGCGGAGTGCGCGAGCTCTTGCGGGGTCTTCTCGACGACCTGCGCCACACCTTGGAGCTGCTGAATCGTGTACTGAATTTCGTCATCGGGCGAGTTGGGCTTGATCAGGTTGTTGCGCAGGGCCGCTTGAGCGACGCTGAGGTTGTCCGCTACAGAGGCGCCGAAGTTGTTGGCGTAGGACTGACCGGCAGCCTTGGCGTAATTGCCCATCGAGGTGTCATCCAGACCCATGCGGCCCTGGAACAACTTGGTGGTGGCCGTGGTGGCCATACCTTCGGCAATGGCGTTGGAGAGCCGACTTCCGACGAGGATGCCTACGGCGGTCAAACCCAACAGGGCCGCGCCGATTGGCCCGCCAGCGGTGCCGAGTCGGGCGATCGAGGCCGCGCTGCTCACCCCGTGGGTGAATCCGCCTGAGAACCCATTGCCCATGTCGCGGCCGAGCTGGGCGGCCTGGCCAGCCTGGGCGCGCATGCCGTCAACAAGGTTGGTGTTGTTGCGTCGGCTCGCCTCGTCGGCAGCTTCTTGATACTCGCGGTATGCCCGCGTTGCGTCCCGGACAGCACGAGCCTCGGCGCGCCGCGCGTCGTTGACTTTCTCGGTCTGGCGGATGATCCGTGCGCCGTCGGCGTCGCGGTCGCGTAGCCGCTGTAGTTCGGATTCCTCAGACTTGAGTTTCCCGACGGCCGATGCTGCCTTGTCGTAGGCATCAGAAGCCCTGTCGCCCATGCGCTTAAGGGACTTCTCGACATCCTTGGAGCTACCCGCCAGCGCGTTGGCGAAATCGCGGCCGGCATCCTTACCCGCGTTGCCGAACGTGCGGGTGGCGTCATCGGCGACCCGCTTCCACGACCGATGATCAGCGGCGGCACCGATGGGTATCTGCACGGACATGGTTCACCTCCTGATCATTGGTCGCCAAAAACGTCATCTAGCAACTCTTCTCGCGCCGACTCGATGAATTCGTTTTCAGCGGAGTCAAGTTCGTGCTGTCTGCGAGAATCCAGCGGCGATGAGTACTTGGTGTACATGTATTCGTGCGGGGTGCCCGCGTACTGGCTGGCCCGGTATGCCGCGAGCTCGTTGTGTGTCTCAGCGGCAATCTTCTGCATGACCGTCCAGTCGCCGTCGCGCCCAAACGGCGGCGGCGCATGGGTTTTGAACTCTGAGTCTTCGGGCAGCTGGTGGATCAGCGACAGTAGTTGGCGGCTGGAAAGCACCAGGGCGCCGCGCTCATCGCGGGTGCCCTGGTGCCAATCAGCGATGCGCACACCGCGAAAACGAAGATCAGCCTCGATCGCATTGGGCCAACGGCACCACAGCGCTACTGCCTCAATTACTTTTGGAGTCGATCTTTGTCCGCTCCTCCAGCTGGCGCTGCATCACCTTCCAGTGCGTATCGATCTGGCCGGGAACACCGCCCGCGGCGAGGAACTTGGCGTAGATGTCCTCACCCATGAGTGCGATGCACAGCTGCTCGTCAGGGTCGTAATCCTTGCCGTCCTTGAGATACGGATAGACGTTCTGCTCGATGGTCTTGCCGTCGATGAAAGGATGATCGACGGTTTCCTTGTCGAGGGCTTTCATGTCCCGCTGGTAGTCGCGGTACCGCTTGCGCTGCTCGGTATCGAGAAACGCCGGGTTGGGAAGCTCCCACATCTCGCCGTCGCCGAGATCAAAGGGCACACCTGCCATGAATCCGAGGTGATCGGCGGCCTGCTCGCGTGCCTTTCTGGGGTCGACGGGGTGTAGAACGTCCTTGGTGTCTTCGGAGCTCATGATTGTTCCTTTCGGGCTGGTGGGCTTGGGGTTTCGGGCTGGAATGGGGGTGGGGCTCACCTGGCGGGCGCAGCCCGACGCCCGCCAGGTGAGGGTTCATCAGGCGATGGTCGCGGCGGCAGACTTCGGGGTGTAGACCGAAGCGCCGTTGGTGCCGGTCACCTTCACGCGGAACTTGGTTGCACCGGCTGCCACCGTCTTGACCTTGACCGTGGTGTTGCCACCGGACGAGACCGCGGGCCCATCGAGCTCGGCGGGCAGCCAGGTGGTCCCGTCATCGACGGTGCTTTCGGCGGCGAAGGTGAACGGATCGCCAGCGCCCGTGGGGTCGGCGAATACGATCGAGGCCTTACCGGCGGCACCGGGGGTGACCGTCGGCGGGGTGTTCGACACCTTGGGGGCGCCCTGAATCGTGGTCCAGCCCTTGCCGCCGACCCATTCGCCGTCCAGGCCGGGAATCAGGATGCCCGGGTTGCGCGGATCGGGGATCAGGAAGAACGGGTCAGGTTCGAGCGAGAACTCCAGCTCGTTGGCGTCGGCGTCTTCCGTGTCCATCTTGGCCGCGCCGATCTTGGTCAGCTTGCACAGGGGGATGGGTTCGACGGTGTACAGCTTGCCGCCGGCCCGGGACCGTGCGCGCACCAAAAGCAGCTGGCGGGGAACGAAATCGGCTTCCAGCGGGGTGCCCACGAAATAGTCGCCTTGGCCCGGTTCTGCCACGAGCAGGTTGCCGTCCTCATCCTGCAGCGGAACGTTATTGCGCAGGGCCTTGACGACGGGGTTCAAGGTCTCGATCGGGGTGAACTTCACCGTCTTTTCGATCTTGGTGATGTCCTTCTCGATCGGGTAATTCGACTGCAAGATCTCCAGCGGGCTGACATCAATGTTCGGCTCACGCTCGGGGCCGCCAGTCTTGGTGTTGGCGCCGATGAACAGCCACCCCTGGTTGGGCTCGGGGTTGTTGACCCAGTACCCGCCGACCTTGCGGCGGGCGAACAGGTCCGCGCGCAGCTTGCCATCCTTGGCCAGCGGGTTGAAGACATGCGGGCTGATATCAGTGGCCGCGCCGCGATAGTCGCGCGCTAATACGGCAACGAGCGGGCCTCGGATAGCGAAACGGCTATCGGTGTCGGTGAATCCGCCGACGCTCCAGTCAGCGCCGGTTTCGGGTTGCGTCATGTGACGCTCCTTCCATGGGTGATGAACCGGAAAGGGCTCCGGCGATTGAGGTGCGGCGGACGCCGCGACGCGATCAGGGGACCGCGACGATCAGTTGAACGACAGGCCGAGCTCGCAAATCGCCTTGAGGCGAAAGGCGTTGTCGGCCTTGTATTCGCGCAGCGTGGAGAGCTGTTGAAAGTCGATGTAGTCGACGTTGGCGACCGTGCCATCGGGCATGGGCACATCCACGATGTCTTTGCCGAGCAGCATGATCCGCCGATCGGTCTTGATGCCCTCACGCTGCGCCTCGGTGATCGTCTTGCCGAAGGTGTGGATCGACAGAACAGCGGTGCAGTAGAACAGATTCGCGTCGTAGGTGCCGTCAATCATGTTGACTTGGCGGAACGGCAGCGGATCGTCGGGCTTGCGTTCGATGTCGCAGGGGCCCAGCGGTGCCAGGTGGGCGAGCATCATCACGATCGCGTTGGGGGGCATCTGCTCATGCAGCGCTACGGTCATCAGTCGGGCCTGTTGATGACATCGGCGGCGGTGCCGCCGAACGCGATGGCAGTCTTGGCCGCGACGGCGAACTCTGGTGTGGGGCTGGTGCCCCCGGTGCCGTCCTCGATCCAGTGGGCTTTGAAGTTGTCGTTGACGACCTTGGTGTCATCGTCACCGCCCTTGCCCTGCTGCACTTTCCACGCCGCGCCGTAGTCGCCGTGATCGACCGGCGAGATGGACTTGGCGTATGCGGCCATCTCCTTGCCGACGCGCGCCTTCTCGGCCTTGGCTTGCGCTGAGGTGTGGATCGCCTTGTCGATCTCGGACTGCGGCACACCCAACGCGACCAGCGGGTTGGGTCTGCGATCTGCGGCCATCAGCCGACCCTGCGCTGACAGATGCAGAACACATGGTCTTCGCGGCCGTCGAGGTCGAATTCGAGTACCGCGTCACCGACCATGCTGTGATCGCGGCCCAGGTGGCGAATCCGGTGCGCCGATCGGATGTCGGCGACCGGGACCGGCGCGGCGGCACCGGTGCCGTCGACGGCGGGGATATGGCCATCGATGACCGGTAGGAACGCCCACGATTGCTCAGTGGTTGTGGTGGTGATGGCCTGGTTGTCCTCGGCCGTCGACTGCACTTCGAACAGGCAGTTATCGACCCACACAACGCGTTCGGTGACTTGCGGCTTGCGGTACTCGTCCAGGATCGGGTCGCCCTGCCCGTCGAGCACCGGGACATCCCACACGATCGCGAGCCGCTGCCCGCCCAGGGTGTCCATCAGTAGTCACCCCTGGGGAAGTGGCCGCGCGCCTTGGCCTGTAGCGCCAGGCCGAGCATGCGGTAGTGGCGGCGTGCGATGAACTTCTCGACGGCTTCGCGATCGATCGCAGCCTGTTTGGTGCGATGACCCACCGTCTTGGTGAACGATGAGACCGGGCCAAACTCGCCATACATCAGCGCGTCCCGGGTGACCTCGAATGTGACCACCTTGGCCGCCGGGTCATCGTCGGCAATGGCCGGTTTCTTGTCGCGTATCCAATCGGAGACGACCGTCAGTAGAGGCGCCGCCACCAGTTTCTCAGCTGCCGACAGCGGCCGGAGCATGGCGGCGAACGCCTCTACGTCAAGGAAGTCGGTCACGAAACTAGTCCGTAGCCTCGATCAGCGCCCACAGGTCGTCCTTCTCCTGTGCCTCCAGCTCGTCACGGTCATACGTGCCGTTGGCCATCAGCCAGTCGACCAGGACGGCCTTGGTCGCGGCCTTGAGCGGCTTCTTACGGGGCGCATCACCCTCGGTGCCGGTGGCCGGGCTCGCGTTACCGGAATCGCCATCCCCACCGTCGCCGCTGTCGCCCTCATCGGTGACTTCCGCCTCGGCCGAGTCGCTTTCGGGATCGGTCGTTTCGGCCGGCAGCTGGGCGCCGAGCGCACCGACGGCGAGGCCGCGCTCGACCTCATCATCGGTGAGCGTGACGAGCTCGCCGAAAAACGCGCGCCGCCGAGTGCCCGCGGGCGTGAGGTATTCCCATGTCGCCGCAGTCACCCGACGTTCTGTGACCTCGGGCATTACGGGGCGCCCTTCAATCCGGTCACCTTCTTGACCGCGTACGGGTCGGTGACGCCCATGATGGGCAGCACCGAAGACTGGACCCAGTTCTGCTTGGTCTTGGGCTCGCGCCAGGTCTCGGTCGAGAGCATCTGCTCGTAGTCCAGGAACCCGACACCGCCGCGCACACCCGCGAAGGCGCTGCCGTTGGCGACGCGGTTGGACCGGAACATCGAGATATCGGCGTCGGCCAGGATCTGCGGTAAGTCCGGTCCGTAGGCGATGCGCAGGTCCGCGTACTGCACGGGGTTGACGACCCACACGTTGTAGACGTAGCCCAATTCCTCGACATCGGCGGCCAGCTGTGCGGCGATGATGTCGGCGAACGGGCGCGCATTGTTGGGCGTGGGGTTGTTGCCGGTCAGGGTGACGTTGCCCCAGTCGTGTCCGGGGATGACACCCGCGCCGCCGAGACTGGCGATAACGGCCTCCAGCACGGCCACGGTGCGCTGATTGATCTTGCGCACCAGCGTGTTCGCCAGCTGTGTGGTCAGGCGGTCCATCTGGGCGCGGTCGTTGCGCCGGATCGCCTCATCGGACATCCAGAACTTGCCACCCCAGTCCTCGGACTTGGCGACCTCGGGCTGCGTGCGCTCACCCTGCACGATCGTGTACTCATCGGACGGGCCGCGCTGTTCCACATCGTTCTTGGTGTACAGCTCGTTGATGCGGATCACGTCGTAGATGATCGCCCCGGCGGTGGTGCTCGCCCCCGAGGACGAAAACAGTTCCGGGGCAATGAACTTCTGCAGCGTCAGGTCCGAGAGCCGCTTGGTGATCCGGCCGGGCTGCTTATATGCCAGGTCGACCGAGATCTTGTTGTCATTGATGACCGGCGCACCCAGCGGGTACGCGACGGGAGATGTTGTCATGGTGGGTAGCCCTTTCCTAGTAGAGGCTGATCTCGGCGTCGGCGCCATCGGTGGCCGCGGACAGTGCGTAGCCAACGGCGACGCCGCTGGCGAACTTCTTGGCCTTGCCGGCCGTGCCGACCTCGACCTCATCGAATGCGGCGAGCGCGCCGTCGGCGGTCACGTAGGTGACACGCGAATTGCCCCGCGCCACACCAACAATGTCGCCGCTGGCCGCGTCGTACTTGGAGACGCCGCATACCCGGCCCGCCGCATCAGCGGGGGCCACGGCGATGTTGCCGGTGGCGGTGCGGTTGCCGCTGATCTTGAGGAACCGCTTACCGGTGATGGCAGCTGTGGCGCGGCCGGTGATGTCGCGGCCGGGCTCATAGACGCCCACGTTCTCGTTGGTCATGATCTATTCCTTCCCTTCCGAACTCGGCGCGGTGGGCGCGGAGTCAAACCAGCTCAGGTCATTGGGCACCGGACCGTCTGCGGGCTGCGTGGAGTGCCCCGTCTCGGCGAGAGGGACCACCCCGGGTGCCAGCGCGGCCAGCACGGCGGTGTGGCCCTCGCGGTCGGCGGCGAGCGCCTGCAAGTGGTGCTCGCGACGCGCCGGGGCGACCTTGCCGTCAGCGATGGCCTGATCGACCACACGCTCGTCACCCTCGCGCAACTGCTGTGCGCGCGCCTCGGCGCCCGCCTGCGCGGCCGCGACGGTGGCCTCGTACTGGGCCCGCTCGACCACCGTCAGGCCCGCCTTCGCGACCAGCGCGGTCGCCTGCTCCAAAGTCGGTGCAGCGGGCGGGGTTTCGTCACTTTCCTGGCCGTCGTCAGCACGCTCTTCGAGCGCTTCGGCGGCAGCAGACAAAATGGTCTCGTCGTCGGCGTCGGCATCGATACCGAGCAGCTTGGCGAGGCCCTCATTCAGGGTTGCCACAATGGGCTCCTTTCCTCTGTTGACCTCGCCCTTCTCGGGCCGAGGGGTCTTGTTGTGCACCAGCGGAATTCGTGGCGCAGGCGCGGACTGGCGTCCGGCATAGCGGAACGCCGACAGATCGAACACCGATGCACGCGCGGCAGCGGACTTGGAGTCAGGCTCGGGCAACTCGAGGACGCGATCGGCCAAACCGGCCTCGACCGCTTCGTCGGCGAGCAGCCAGGTTTCCTCAGCCATCACGTCGAGCCAGTCCTCGACGGTGCCCCCTGCCCGGTCGGCGTAGATCTGCGCAATGTTGCTGTTGTGCTGGGCCAGTCGCGCCGCGCTCTTCTCCATGGCGCGGGCATCTCCGACGCACACCGCCCAGGCGTTGTGCACCATCATCTGGCTGTTGCGGTTCATCACGATCTCATCGCCGGCCATCGCGATCACCGAGGCGATCGAGGCGGCGAGGCTGTCGACCACGACGGTCACCGTGGCGGGGTGATCACGTAGCGCGTTGAGAATGGCGATGCCGTCGAACACCGAGCCGCCGGGGCTGTTGATGCGCACCGTGATGGCATCGTTGTCGATCGCGCTCAGGTCGCGGGCGAACTGTTCGGCGGAAATGCCGTNGAAATTGAGGTCGACCAGATCCTCGACGATGTGCGCCTGTGCGGTGTTGCGGATGTCTTCGGCGACCGTCTGGACCGACTGCACGAACGTGTCGGCTTGCACACTGGCCAGCGCGTACGAGCCGCCCTTGCCATCCAGATTCAGGAAGTGCGCCAACGCAACCAGGGCCATCTGGTGGTCGTGGTACTCGATCGCACGGCGCGGGTCCATCGGGGTGCCCGATGGCGACATGATCCCGGCCTCTTGGCCCTCGGCAAGGGCCAGGCCGGACGACTCGCCACCGCTGTACTTAGAGGCGACATCGAGCAGCGCGTCCATGCGCTCTTCGTCCTGAGAGTCGTTCTCGTTGCCCTTGATCCACGGGACGCCGATGCCATGGCGGCGTGCTGCGGCGGCCTCGATGCGCATCAGCTCGTCTTTGAGCTTCCAGTGCTTGTAGGCAGGCCGCAGCAGGCTGTTGCCGATCCACACACCCGGATCGGGCTCGTACGCATACACGACCAGCCGGTTGATGGGAATGGTCGAATCCAGCGGCCCGCCAGCGGGTATCGCCACTCCGCTCGATGTCATGGTGAACCCGCTGGAGGGGTGTTGCTCGATCGAGATCAGACCGCCGTCGCGGTCGACGTTCCACTTGGCGATGGTCACCTGGGGACGCGGGGCGAGCTTGCGCAGCACGGCGCGTACGTTGGCGCCCTCGCCTTCGAGACGGTAGACCTGCTCAAATACCGAGTGCCCGTACCGCAATGCCATAAGGGCCTGCTGCAAGTGTTTGTCCCAGGAGAACCGGCCACGGGACCGCGCCTGGGGTTCGTTCTCGTCGGCGGCACCCTCGATGGGCAGACCCAGATTGCGGGCGATGAATTCGGTGACCTCATCGCTGGCGCCGTTCTGCCGGATACGCCACGCGGTGCGGCGAATGGGCAGCCCAATTGCCCGCAGCACCGACGAGATTCGGGCGTCCTCGCGGACCATGCGCGTGTAGGTCCACACCGACAGCGGCCAAATCAGGTCGGCAGTCTGCTCGAACTGATCGATAGGTCCACCCCAGCCGGTCGCGCCGGCCGAGCTGAGCACGTACCCCTGTTCGGTACGCGGGGCGGCGGTCTTCTTCGGTGCCTGCTGATCGGCCATGCTCGCCCCCTTTCTCAGAATGCGGCGCTCATCGCGTCGAAATCGGCGCTATGCCGGTGTGTTTGGTGCTCTCGTGCGGCCCCGGTGCGGGCGCTGACGGTCTTGGCGGGCGCCTTGGTTCCGTACTTGCGAAGGGCCCAGTGCGCCATGGACACGTTCATCAGCGGCATGCCTGCGCCGTTGGGTTCCTCGGCCCAGACGAAATCGCCGCCCGGCAGCTCGCGCATGCTGGCGGTGGCCACCTCGTCGTTGAGCACTGTTTGATCGCTGTGCGACAACTTGACGGCATCGGCGTCTGCCAGGAAACCGCTACAGGACTGCGCGATCTCGGACGTGCCGATCATCAGCGGCTCGATACCGGCGGCGATGAGCAGCGGTTCAAGTACCTGCGCGGTGTTCTTACGGTCGATCACCAGCGCCACCGGATTCCACGCGGTGACCTTGGCGACCAGATACTCGGCGATCTCGGAGTGCGTACCGGTGCGCAGCGGTGCCACCTCGACATGGATGTTGCCGTCTTCGGCCATCTGCGCGGCGCTGATTGACCACACCTGACGGTTCCAGGATCGCCGCACCGCGATGGTGCGGGCTCCCGTGAGCTTCGCGTCGGCGTTCGCCATGTCGCTCCAGTTCGGGATCGGCGAGCCAACCTCGTCCTCGTCGGGCGGGTAGTCGCCGATCCCGAGGTAGTCGGCGGTGAAGATCGCCCGCTGTTCGGCGGTGCGGGCCTTGCGCCGTTTGGCTTCGAGCTCGTGCTCATCGCCGACGACACCCAGGGAGGGGTGCGCCAGGCGGTATGCGTCGATATCGCCGAGCTCGGTGCCCTCGGGTACCGCATATAGGGCGTAGTACAGATCCGGGGACCGCTTGTGGCCCAGGTTGTGCATCCCGGTGAGGATCTGGCAGTTGGGGTGTACCGAGGCCACCGGAGGTGTTGAGACGTACCAGATCTGCGGCCCGGTCGCCTTGGTCGAGGCGCGGGTCGCGCCGGTCAAGCTCGCTTCGGCTTGCGCGGTGAGGTCGTAGGCCTCGTCGAGTATCAGCAGATCCACTTCGGTAAGACCGCGACCGAACTTGGCGGTGCGCGGCCCGAACTTGGCCTCGCCGTTGCCGAGCTTGATCAGCCCGCGGTTGCCCGCCGAAGTTGGTTCGGAGCGTAGGCGTTTCTTGAGAGACGGGATGCGGTCGATGACATCGACGCAGCGGCCGAATACGTCCTTGGCCGTTTCCCATTCCTGGGCGGTGTAGGCGATTTTCTCCCCGAGCACCAGCATCCCGAAGATGATGCGTAGCACCACGATCAGGGTCTTGCCGTTCTGACGTGGGCACTCGATGCACACGTCGCGGTGCGTCCAGACGCGATCGCCCCACTCGTTGGGCTCCTGTAGCGAGAGCACCGCGCGTAAGGTGAGCCACTGCCAAGGCATGCAGCGCACGCCAATTCGCGATCCCAAGCGCGCCGCCCGGTCGCCCCATGATTCATCGCCGGGGTGTCGGGACTCGAATCGTGGTGTCTGACTGCCCTTCAGGCGTGGCCACAAGCCGATGAACTCTGGCCACTCACGCGGTGCCAGGTCAGATACCGGCGAGCACGTCGTCGTCATCGGGATCATCCGGCAGTGCGGCGCGCTGGCGATAGACCTCGGTGATCAGCTTGCGCATCTGCTCGGCCTGCTGGCGCTGCTGCACCAGCACGTTGTTCACCACCACTTCGACGGTCTTGGCGCCGATCTTGAGCTGTAGCCAGCAGTCGCGGTCGCCGTCCAGTAGAGCGTTGAGCCGGGCGAGGTAGTCGGCGGCGTATCCGGCCTGCTCGATGATGAGCCGCAAGGGGTAGGGGTCGTCGGGTTGTGACAACTCTTCGATGAGTCGCTGGCCGACTGTCTTCTCGGAGGCCGGTTGCCGACGCGTCGCCCGTTTAGCTGGAGTCTTTGCTGAGTTAGCGGTGGCCTTTGCCGGTTTCGTGGCTGCTGTCATCGTTCGCCGCGTTCAAAAAAAACCTGACGGGAGCCTCCGGGGGTCAGGAAGGCCCCCCACCTGGATAATTTCAGGGGGAGGGGCTTTGACCTGCGGTTATGGCACTTTCGGGTGTGTGCATCGGTGCTGGTCAGGGGCTTTTCGGTCCATCGGCTGGCGATCACCACGACATCACACCTCCGTCGTGTTTGCTGGCAGGTTCGGGATGTTTGCTGTGTGACTGGTCGGCGTACCACCGCTTTGCTGCCTGCGCCATGCGCCACGGTCGCTCGGCTTTGCATCGAGCCAGGACCACGCTCTGACCAGGATCGATCGTGATGACCTGCGCGCCAGCGGATCGGTAGCGCGCGAGCAGGCCCTCGCCGGGCATGGAGTGGATCAGGTACACATCGCACTGGCCCGCGAACGTCAGCGCCGTATCGATCGCGGCCAGCCGAGCGGCCTTGGTGACCGAGCGGACGTGCTGCGGCGGGTCGTGCGGGTCTCCACCCGCGGGCGTGAGTACCGAGGCGATGGCGTCGTAGTCGATGGTGATGTCGCCATGCTTGGCGTGCTGTCGTACCCATGTGGACTTGCCGGCCGCAGGCGGGCCGGTCACCAGGTAGAGCGTCACCAGTCCATCGCCAGGTTGTCGGCAGTGATGACGGGCGCGGCGGTGATGCCCAGTGTCGCAAGGGCTGTCGACCACTCGGATGGCTGAACGCCGAGCACCACGGGCCGGTGGGCGTCATGCCTGCCGTCCTGGCGCTGACTGTTGCAGATGCCGTGCAGTAGGCGATCGGCGCGCTGTCCGCCGAATGCCCGAGCCTGACTATGGTCTGCGGCCAGCTGCTTGCGGTCCCAGTTGCGCTCCAGCAAGGGCGCTTTGAACATCGGTAGGCCACACCACCAGCACAGTGTGCCGTCGACGTGACGGCGCAACAGGCTCTCGGCTTGCTGTTGGTGTTTCCAGCCCAGACCGCGATCGGTGGTGCTGGCCTTACGGCCGGGCCTCGGCATGTGCGGTGTCCGGCTTGGCCTCGGCGCGCGCTGGCGGTGCCTTAGGTGCGGGTGCGACCTTGACGGGTGCGACGGATGGCTCGCTGCCGTCCTGCTCCACATCCAGCGTCCAGCCGTTGGCGCGGGTAGTGATGGTCATCGTGGTGTTCCCAATGGGCTGGCCCAGCTCGGCCAGCGTGCCCGCCTGTGCGAGAGTGACCATCACGGCCAGACCCCAACCCTGCCCGCCGGAGTGGCGCTTAAGGTCGGGGATATCCGGCGGCGTGGAACGCCACTTACCTGGGTCGGTGTCCATCAGGACCTTGCCGTCGACAGTGATCTTGATATTGCTCATTGGGCTAGGAACTTTCGTAGTTGGCGGGCATCGATCGTCACGTCGTCGGTCTTGCCGACCGTCAGCACCAACAACGGCGTGGCGCGCTGGTGGTCGGTGCGGTCATACAGCGTGACGATTCGGGTGCCGTCCGGCGCTTCTGCGGCATCCTGGCGCAGCTGTGCCGCATCGGCTTTCGTGAGTACGTCGAATTCGCCATCGATGACCGACCCAAGGGCCTCGGCCCAGAGCTTTGCGGCCTGGCCGATCATTTCCTGCGCTTGGTCTTCAGGCAT
Protein sequences of DBSCAN-SWA_7 >NZ_CP013049|5139108:5154019|5145299_5145659_-|WP_005061728.1|DBSCAN-SWA MAADRRPNPLVALGVPQSEIDKAIHTSAQAKAEKARVGKEMAAYAKSISPVDHGDYGAAWKVQQGKGGDDDTKVVNDNFKAHWIEDGTGGTSPTPEFAVAAKTAIAFGGTAADVINRPD >NZ_CP013049|5139108:5154019|5145658_5146066_-|WP_017207407.1|DBSCAN-SWA MDTLGGQRLAIVWDVPVLDGQGDPILDEYRKPQVTERVVWVDNCLFEVQSTAEDNQAITTTTTEQSWAFLPVIDGHIPAVDGTGAAAPVPVADIRSAHRIRHLGRDHSMVGDAVLEFDLDGREDHVFCICQRRVG >NZ_CP013049|5139108:5154019|5152009_5152411_-|WP_057138161.1|DBSCAN-SWA MTAATKPAKATANSAKTPAKRATRRQPASEKTVGQRLIEELSQPDDPYPLRLIIEQAGYAADYLARLNALLDGDRDCWLQLKIGAKTVEVVVNNVLVQQRQQAEQMRKLITEVYRQRAALPDDPDDDDVLAGI >NZ_CP013049|5139108:5154019|5143173_5143620_-|WP_005061733.1|DBSCAN-SWA MSSEDTKDVLHPVDPRKAREQAADHLGFMAGVPFDLGDGEMWELPNPAFLDTEQRKRYRDYQRDMKALDKETVDHPFIDGKTIEQNVYPYLKDGKDYDPDEQLCIALMGEDIYAKFLAAGGVPGQIDTHWKVMQRQLEERTKIDSKSN >NZ_CP013049|5139108:5154019|5153740_5154019_-|WP_162259902.1|DBSCAN-SWA MPEDQAQEMIGQAAKLWAEALGSVIDGEFDVLTKADAAQLRQDAAEAPDGTRIVTLYDRTDHQRATPLLVLTVGKTDDVTIDARQLRKFLAQ >NZ_CP013049|5139108:5154019|5139108_5142735_-|WP_005070937.1|DBSCAN-SWA MSVQIPIGAAADHRSWKRVADDATRTFGNAGKDAGRDFANALAGSSKDVEKSLKRMGDRASDAYDKAASAVGKLKSEESELQRLRDRDADGARIIRQTEKVNDARRAEARAVRDATRAYREYQEAADEASRRNNTNLVDGMRAQAGQAAQLGRDMGNGFSGGFTHGVSSAASIARLGTAGGPIGAALLGLTAVGILVGSRLSNAIAEGMATTATTKLFQGRMGLDDTSMGNYAKAAGQSYANNFGASVADNLSVAQAALRNNLIKPNSPDDEIQYTIQQLQGVAQVVEKTPQELAHSATQLMRTGLANSVTEALDIITAGSQKGLDVTGDWLDSIGEYSTQFRKLGLTGSETMTLLKQGIEGGARDTDKVADSLKEFSIRAVDGSKSTKEGFEALGFNADEMGRRFSAGGEQAHQAFAAVLTGLRNLDDPVQQALVWQRLFGTQWEDMGDAVNKLDLDPAKNQFKDLQDTSQRSTKTATETFKSEWESATKTVDQWFTDLKTSISDWFVDLPVIRDIPTMIKDLFSSSPPPPQYAAPLGGTHPGTDILANTLPGAPGAGSTVLPPAPGDNSARTLLGSALAPGTALPPPDAQRGNAVDNGPQAGDRKPIAPAGDDDKTKAPIDPSLWSVESKPVAMPPGLATAPTAAPGVLVSSPKGGPGLGRYEVDPMRVYDAESSAIRAKNSLEQDRIALIRLEQQGNADQDALLRARNQVADAERSYVSAQMKLAEAQQGTWKKLESSTQGLADGMGQIGAALDKDFGISKGLPGLAENLTKFLANMAAAPILGQLGAVSQLNPSKGGYGAMGILAAQGVFGPQYTGVAQDVAMAGIGPMALQQGVNPNLAAMYALAARGGKYAPASDLQNGLADCSGAVSDLVEVLRDGKSSPARLFDTTAFATDASAAKLGFLPGYQPGAFNVGVNPLPGQQGHMAATLPNGMNFESGGGHGPMLGGSAAGALDKQFPKQYYMPLGSGTSSAPSPQPIGPTVDYRALYPKTAGPGLAVTDPVMSDPTLTNPALTAGIPAAGGGWGGATGPAQAWSPSSTRIGGVEPATGSGAGGVGITPGGTIDTAIGMAASAADIFAPGAGQAAQTGIKLANRAIQFGAQAAGIGVQGLMDTVLPTAGSELANKSWLTKILGGVAGAAPAIPNVAGKATAPPNPNQGDPNAQGGPVKAGDTNIHVTNNRATEDGTGRDIAFHQQARNSGPGM >NZ_CP013049|5139108:5154019|5153020_5153284_-|WP_057138219.1|DBSCAN-SWA MFKAPLLERNWDRKQLAADHSQARAFGGQRADRLLHGICNSQRQDGRHDAHRPVVLGVQPSEWSTALATLGITAAPVITADNLAMDW >NZ_CP013049|5139108:5154019|5147782_5148160_-|WP_005061716.1|DBSCAN-SWA MTNENVGVYEPGRDITGRATAAITGKRFLKISGNRTATGNIAVAPADAAGRVCGVSKYDAASGDIVGVARGNSRVTYVTADGALAAFDEVEVGTAGKAKKFASGVAVGYALSAATDGADAEISLY >NZ_CP013049|5139108:5154019|5146065_5146428_-|WP_005061723.1|DBSCAN-SWA MTDFLDVEAFAAMLRPLSAAEKLVAAPLLTVVSDWIRDKKPAIADDDPAAKVVTFEVTRDALMYGEFGPVSSFTKTVGHRTKQAAIDREAVEKFIARRHYRMLGLALQAKARGHFPRGDY >NZ_CP013049|5139108:5154019|5143719_5144799_-|WP_057138160.1|DBSCAN-SWA MTQPETGADWSVGGFTDTDSRFAIRGPLVAVLARDYRGAATDISPHVFNPLAKDGKLRADLFARRKVGGYWVNNPEPNQGWLFIGANTKTGGPEREPNIDVSPLEILQSNYPIEKDITKIEKTVKFTPIETLNPVVKALRNNVPLQDEDGNLLVAEPGQGDYFVGTPLEADFVPRQLLLVRARSRAGGKLYTVEPIPLCKLTKIGAAKMDTEDADANELEFSLEPDPFFLIPDPRNPGILIPGLDGEWVGGKGWTTIQGAPKVSNTPPTVTPGAAGKASIVFADPTGAGDPFTFAAESTVDDGTTWLPAELDGPAVSSGGNTTVKVKTVAAGATKFRVKVTGTNGASVYTPKSAAATIA >NZ_CP013049|5139108:5154019|5153390_5153744_-|WP_057138163.1|DBSCAN-SWA MSNIKITVDGKVLMDTDPGKWRSTPPDIPDLKRHSGGQGWGLAVMVTLAQAGTLAELGQPIGNTTMTITTRANGWTLDVEQDGSEPSVAPVKVAPAPKAPPARAEAKPDTAHAEARP >NZ_CP013049|5139108:5154019|5150448_5151861_-|WP_005123185.1|terminase|DBSCAN-SWA MPWQWLTLRAVLSLQEPNEWGDRVWTHRDVCIECPRQNGKTLIVVLRIIFGMLVLGEKIAYTAQEWETAKDVFGRCVDVIDRIPSLKKRLRSEPTSAGNRGLIKLGNGEAKFGPRTAKFGRGLTEVDLLILDEAYDLTAQAEASLTGATRASTKATGPQIWYVSTPPVASVHPNCQILTGMHNLGHKRSPDLYYALYAVPEGTELGDIDAYRLAHPSLGVVGDEHELEAKRRKARTAEQRAIFTADYLGIGDYPPDEDEVGSPIPNWSDMANADAKLTGARTIAVRRSWNRQVWSISAAQMAEDGNIHVEVAPLRTGTHSEIAEYLVAKVTAWNPVALVIDRKNTAQVLEPLLIAAGIEPLMIGTSEIAQSCSGFLADADAVKLSHSDQTVLNDEVATASMRELPGGDFVWAEEPNGAGMPLMNVSMAHWALRKYGTKAPAKTVSARTGAAREHQTHRHSADFDAMSAAF >NZ_CP013049|5139108:5154019|5144889_5145300_-|WP_005061729.1|DBSCAN-SWA MTVALHEQMPPNAIVMMLAHLAPLGPCDIERKPDDPLPFRQVNMIDGTYDANLFYCTAVLSIHTFGKTITEAQREGIKTDRRIMLLGKDIVDVPMPDGTVANVDYIDFQQLSTLREYKADNAFRLKAICELGLSFN >NZ_CP013049|5139108:5154019|5152571_5153024_-|WP_057138162.1|DBSCAN-SWA MTLYLVTGPPAAGKSTWVRQHAKHGDITIDYDAIASVLTPAGGDPHDPPQHVRSVTKAARLAAIDTALTFAGQCDVYLIHSMPGEGLLARYRSAGAQVITIDPGQSVVLARCKAERPWRMAQAAKRWYADQSHSKHPEPASKHDGGVMSW >NZ_CP013049|5139108:5154019|5142749_5143112_-|WP_005070934.1|DBSCAN-SWA MRIADWHQGTRDERGALVLSSRQLLSLIHQLPEDSEFKTHAPPPFGRDGDWTVMQKIAAETHNELAAYRASQYAGTPHEYMYTKYSSPLDSRRQHELDSAENEFIESAREELLDDVFGDQ >NZ_CP013049|5139108:5154019|5146854_5147766_-|WP_005061719.1|DBSCAN-SWA MTTSPVAYPLGAPVINDNKISVDLAYKQPGRITKRLSDLTLQKFIAPELFSSSGASTTAGAIIYDVIRINELYTKNDVEQRGPSDEYTIVQGERTQPEVAKSEDWGGKFWMSDEAIRRNDRAQMDRLTTQLANTLVRKINQRTVAVLEAVIASLGGAGVIPGHDWGNVTLTGNNPTPNNARPFADIIAAQLAADVEELGYVYNVWVVNPVQYADLRIAYGPDLPQILADADISMFRSNRVANGSAFAGVRGGVGFLDYEQMLSTETWREPKTKQNWVQSSVLPIMGVTDPYAVKKVTGLKGAP >NZ_CP013049|5139108:5154019|5146432_5146855_-|WP_005061721.1|DBSCAN-SWA MPEVTERRVTAATWEYLTPAGTRRRAFFGELVTLTDDEVERGLAVGALGAQLPAETTDPESDSAEAEVTDEGDSGDGGDGDSGNASPATGTEGDAPRKKPLKAATKAVLVDWLMANGTYDRDELEAQEKDDLWALIEATD |
17 | Mycobacterium_phage(92.31%) | terminase | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
Acr ID | Acr position | Acr size | Homology with known anti | Neighbor HTH/AcRanker | Neighbor Aca | In prophage | Protospacer in prophage |
---|