Contig_ID | Contig_def | CRISPR array number | Contig Signature genes | Self targeting spacer number | Target MGE spacer number | Prophage number | Anti-CRISPR protein number |
---|---|---|---|---|---|---|---|
NZ_CP041667 | Lachnospiraceae bacterium KGMB03038 chromosome, complete genome | 6 crisprs | csa3,RT,WYL,cas3,DinG,cas2,cas1,cas4,cas7,cas8c,cas5,DEDDh | 2 | 4 | 7 | 1 |
CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
NZ_CP041667_1 | 234998-235097 | Orphan |
NA
Consensus repeat of NZ_CP041667_1
|
1 spacers
spacers of NZ_CP041667_1
>1.1|235024|48|NZ_CP041667|CRISPRCasFinder AGATGGAGCGAAGCGGAATCAAGCTGGGGCATGGCTGTGAAAAGATTC |
CRISPR arrays and Neighbor proteins around NZ_CP041667_1
The CRISPR arrays of NZ_CP041667_1 >merge|NZ_CP041667|1|234998-235097|CRISPRCasFinder ATGCATAGGGCGGACGCCCGCCAGCGAGATGGAGCGAAGCGGAATCAAGCTGGGGCATGGCTGTGAAAAGATTCATGCATAGGGCGGACGCCCGCCAGCG >NZ_CP041667|1|1|234998-235097|CRISPRCasFinder ATGCATAGGGCGGACGCCCGCCAGCG AGATGGAGCGAAGCGGAATCAAGCTGGGGCATGGCTGTGAAAAGATTC ATGCATAGGGCGGACGCCCGCCAGCG
>NZ_CP041667.1|WP_143928670.1|233560_234934_+|V-type-ATP-synthase-subunit-B MPKEYRTIQEVAGPLMMVKGVEGVAYDELGEIELANGEKRRCKVLEVDGDNVVVQLYENAAGINLSNSKVRFLGRSMELGVSGDMLGRVFDGLGRPIDDGPEILPEERRDINGLPMNPAARVYPSEFIQTGVSAIDGLNTLVRGQKLPIFSCSGLPHSQLAAQIARQAKVRGKDENFAVVFAAMGITFEESNFFVESFKETGAIDRTVLFVNLANDPAVERISTPRMALTAAEYLAFEKDMHVLVILTDITNYADALREISAAKKEVPGRRGYPGYMYTDLAQMYERAGRQRGKIGSITMIPILTMPEDDKTHPIPDLTGYITEGQVILSRDLYRKGLQPPIDVLPSLSRLKDKGIGEGKTRADHSNTMNQLFAAYSRGKDAKELMTILGEAALTEIDLIYAQFADAFEKEYVNQGYNTDRSIEETLDIGWKLLSILPRSELKRIDDKFLDEYYGKQ >NZ_CP041667.1|WP_143928669.1|231791_233561_+|V-type-ATP-synthase-subunit-A MSSVGTIKKVAGPLVIASGMRDANMSDVVRVSSQRLIGEIIEMHGDEASIQVYEETSGLGPGEPVESTGAPMSVELGPGLIASIYDGIQRPLDDIMKISGNNLQRGVEVAALKRDKKWEFVPVAKVGDAVEAGDVLGTVQETEIVQQKIMVPYGVEGIVKEIKAGEFTVEEVVAVLETKDGDHELTMMQKWPVRRGRPYVKKLPLDVPLVTGQRVVDTFFPIAKGGVAAVPGPFGSGKTVIQHQLAKWAEADIVVYIGCGERGNEMTDVLNEFPELKDPKTGRSLMERTVLIANTSDMPFAAREASIYTGITIAEYFRDMGYSVALMADSTSRWAEALREMSGRLEEMPGEEGYPAYLGSRLAEFYERAGHVISLGKEGREGSLSVIGAVSPPGGDTSEPVSQATLRIVKVFWGLDSNLAYKRHFPAINWLTSYSLYLDNVSGWFNSTVAPDWMEDRQKMMSLLQDEAELEEIVQMVGMDALSPADRLKMEAARSIREDFLHQNSFHEVDTYTPLRKQYLMMKLVLAFYEKSLEALNKGASMRALLGMDVRERIGRYKYTAVDQIETEYEKIMSELDAEIAGAFGKEDF >NZ_CP041667.1|WP_143928668.1|231449_231767_+|V-type-ATP-synthase-subunit-F MYKIAVLGDYDSIYGFAALGLDTFPVTAQEEAGERLHQLAAQGYGIIYITEALAAELKHEIVRYQDQILPAIIQIPGISGNTGDGVLGVKKSVEQAVGSDILFSN >NZ_CP041667.1|WP_143928667.1|230488_231457_+|V-type-ATPase-subunit MSERYTYAVARIRALEVSLFSNAAIDQLIACQDYEQACQFLAERGWGDTDTVTDAEAMLTREEEKIWEVVKELHIDMENFAVLSYPKLFHNLKAAVKEAAVSDGNRHIYYDDVSIPGDVMREIVKEKDFYKLPANMQHAAQEAYEALLHTGDGQLCDVIIDRAALEAIYQAGKEAKADIIRAYAESTVGVADIKIAVRSQKTAKSVEFMKRAMAECDSINVDQLSKAALAGMEAIRDYLMGTAYAGGAEALAQSPSAFERWCDNRIIETISPQKYNAFTIGPVIAYVIARQNEIKTVRIILSGKLNDLPEDSIRERVREMYV >NZ_CP041667.1|WP_143928666.1|229863_230457_+|hypothetical-protein MTGLEKMKSQILDEAKAAAESKVSEARAQAAKIVCEAKDEAEKSGKSILQKSQAEVKGYQERIASSIDLQRRTKILEAKQKLIREVLEKALESMESMEREEYFSKMLGVLEKYALPQEGKLFFSEKDLADLPAGFEAEVERIASEKGGKLTVSKEGRKIQNGFILAYGGIEENCTLSAMFDAKKDELSDKIQHILFS >NZ_CP041667.1|WP_143928665.1|229316_229802_+|V-type-ATP-synthase-subunit-K MSILGEMGIVYALLGAAAAVFLAGAGSALGVGIAGQAASGVVTEDPSKFAKVLIIQLLPGTQGIYGLLVGFITLSKIGLLGGGAADLSVITGLQILAACLPVGIVGLISGKSQGETAAAAIGIVAKKPDQFGKAMLFPAMVETYAILALLISILAVSAIQV >NZ_CP041667.1|WP_143928664.1|227304_229320_+|V-type-ATP-synthase-subunit-I MAVLPMQRVSICALKKDRKAILEKIQSMGIMEMNQVAEGEEGFGTMDTVSARLSFDKKAQTAEQALAVLETYAPEKQSIFASLAGKDLVEKEKFDGTVADREEILETASLLLTDHKKIAEDKASIQKLENQIETLTPWLNLDVPMNYAGTKKAAMLLGTMPKETTLESIYARFADQELEAVDVETVYSDRDAVYLAVFCMRESESKVEEVLRAEGFARPTQAVEEIPRRQKEILEAEIQKLNKRIEETEEEIRQQEKSREPLKMISDYYRMRAEKYAVLGTLPQSQRTFVMSGYVPARFVPAVQKAIGEKFDCVLDIEEVKEEEDSPTVLKNNSFSASMEGVVASYGLPNKKEVDPTTIMSFFYVFFFGMMLSDAAYGAIIAIVCFVLLKKFPRMSSGMHKSLKMFMYCGISTVVWGILFGGYFGDVIPVVSETFFGTRINVDALWFVPLDDPMKLLIYSMLFGLIHLFVGHGIKGYMCLKDGNIKDFICDVVLWYVFLIGLILMLIPSDIFASVAQTKIVFPPVLNTLAKALAIIGAVGLLLMSGRDNKNPALRLALGAYDIYNVTGWLSDVLSYSRLLALGLATGVIASVINEMGSMFGNGILGAIGFIIVFIIGHTMNMGINILGAYVHTNRLQFVEFFGKFYDGSGRPFNPFESNTKYVDVKEETKS >NZ_CP041667.1|WP_143928663.1|226958_227267_+|hypothetical-protein MVEETIKTIREAEQEAEELVKKADETCTEILEQAEAEAKTIKETAKENAAKQAEADLNEAKRQGEEALEKALETVETEILSLYETARQKEAEAISAVIGELV >NZ_CP041667.1|WP_143928662.1|226324_226816_+|tRNA-(cytidine(34)-2'-O)-methyltransferase MLNIVLHEPEIPANTGNIGRTCVATGTRLHLIEPLGFRLNEKNLKRAGMDYWNDLDVRTYINYEEFLEKNPNARIYMATTKAQKAYTEVSYEPDCYIMFGKESAGIPEEILVRHKNDCVRIPMAGGIRSLNLANSAAIILYEALRQNDFLGMERTGHLHHLKW >NZ_CP041667.1|WP_143928661.1|225345_226206_+|hypothetical-protein MADENRLSEIAVVYGGMEKGGLYAVTAAANAVAAKCGKLLAAQVRIEYPIKMEKARIYGMLKPIRAFCRERGIELAEEGISPSPVVSRVMTVAVVNGEKSGKTGSESKSVRPGQEILVTKWIGLGGTARLLWECGERLKNHFPLRFLKQTEEMESLIFAGEDAMIANQAGAELMIPVAEGGILAALWQLAKTAGQGFSVDMKALPIRQETVEICEYLELNPYQLTGAGSLLIVAKEGQNMAEYLRKRGIPAVCAGCLTEGQGKILHNGEEIRYLDRPAPDEILKIF >NZ_CP041667.1|WP_143928671.1|235234_235894_+|V-type-ATP-synthase-subunit-D MASTQITPTRMELTKTKKKLATARRGHKLLKDKRDELMRQFLELAKENMALREKVEAGILSANKNFVIAKAGMDAATLNTALMAPKQEVSLGVGKKNVMSVNIPAFETKTRTADANDIYSYGFAFTSSDLDGAVKSLADILPDMLKLAETEKACQLMAAEIEKTRRRVNALEHVIIPEAQETIKYITMKLDENERSSQIRLMKVKDMMLEEAHHYSERA >NZ_CP041667.1|WP_143928672.1|236353_236722_+|hypothetical-protein MKERLPFYMVYQTPFLGDDEERTARRDYDYMKSIYPATAKRMMPFVEEECDRLMYDGSMIYDEYPDQLQMRLMCARIYDRAKDGEENPGKWLRDLTQVMVCQELCQRRREYRKYKKTFYMGK >NZ_CP041667.1|WP_143928673.1|236854_237355_+|shikimate-kinase MDNIILIGMPAAGKSTIGVIIAKRLGYRFIDVDLLIQESEGKLLKEIIAEKGIKGFLEVEERINAGLKTEHTVVSPGGSVVYCEKAMRHYQQIGTIVYLKASFETINKRLKNARSRGVVLEDGQTLRDLYEERCGLFEKYADITVCEDGLRLDETIEKVIEILQKS >NZ_CP041667.1|WP_143928674.1|237487_238780_+|UDP-N-acetylglucosamine-1-carboxyvinyltransferase MEQYIIKGGHPLVGEVEIGGAKNAALAILAAAIMTDETVRIENLPDVNDINVLLDAIAGIGASVHRVDRHTVTINGRGVTDFNIEYDYIKKIRASYYLLGALLGKYRRAEVALPGGCNIGSRPIDQHLKGFRALGAEVEIEHGKIIAEAERLKGEHIYFDVVSVGATINVMMTATMAEGITILENVAKEPHVVDVANFLNSMGANIRGAGTDVIRIRGVQSLHRTEYSVIPDQIEAGTFMFAAAATKGDVTVMNVIPKHLEATIAKLVEMGCEVEEFDDAVRVVSKGDLKSTHVKTLPYPGFPTDMQPQIGVTLALSKGTSTITESIFENRFKYLDELARMGANVKIEGNSATIEGVKALSGARVSAPDLRAGAALCIAGLATEGITIVDDIVYIQRGYERFEEKLRGIGALIEKVSTEKEIQKFKLKVG >NZ_CP041667.1|WP_143928675.1|238866_240036_+|MFS-transporter MSRRREQNRYWILFVASVVNFVHGNPYIWTVFQPYVKEEFHLSDAASSQPFTIIIGIFALGNMAGGWLQQKIGAKKTILAGSLFMCAGFLLAGIAPYNMPWLVSLGYGAIGGFGSGCAFSMLTAVPLAWFPEKRGLVSGITVGVVGISGIVMNPFCDFLLASFGYRFAMLATTAIYAVLCLGGFWIEENPQNIKETAGDGGIERTISIKQYTTREMIKTKTYYTISLTMALAVPAYVLVNPLMKSLGMERGLTNTEALAGVTIASVANIIGRFAMPWLSDKVGRKAVIRGMYVAAAAAVIGLMGAEGGIFVLLISVVCLVYGGVVSVFPVVVSDHFGLKYQGINYGAVMIGYGLVSILCPYVLDNLGLEMSFLAAGIACAAGLLGTRHF >NZ_CP041667.1|WP_143928676.1|240369_241557_+|methionine-adenosyltransferase MERRLFTSESVTEGHPDKMCDQISDAILDALIEQDPMSRVACETCTTTGMVLVMGEITTQAYVDIQKIVRDTVREIGYTRGKFGFDAETCGVIVAIDEQSPDIALGVDKALEAKEKKMSEEEIDAIGAGDQGMMFGYASDETPEFMPYPIALAQKLARKLAEVRKNGTLPYLRPDGKTQVTIEYDENGAPARVDTVVLSTQHDPGVSQEQIHQDIKKYVFDPVIPADMTDEKTRYFINPTGRFVIGGPHGDSGLTGRKLIVDTYGGMARHGGGAFSGKDCTKVDRSAAYAARYVAKNIVAAGLARKCEIQLSYAIGVAHPTSIMADTFGTGRISDEKLVEIIRENFDLRPAGIIKMLDLRRPIYKQTAAYGHFGRTDLDLPWEKTDKAELLKKYL >NZ_CP041667.1|WP_143928677.1|241635_243132_+|HAMP-domain-containing-protein MKLRTRLFIAFLTVILLPICLTLLMFFAFSRYQMGAIEKTYGIENTTVETLSNSMQVLSRLTERSYHNLEAAIEKNPDDLEDATYLEDFNDNLEKKNSYLLVRKGNTLIYIGTDREKADPVICQLPEYQDADTSSENGIYLGGEAHSLVKQIDFLYSDNEEGSAFIVSDVSDVIPEVEELFLDTLLGVVLILVLTASVLMFWIYRSVKLPLKRMQVAAKNIKEGNLDFELKAQGDDELGQLCRDMEDMRRRLKDSAEEKVVFDRENKELISNISHDLKTPVTTIKGYAEGIMDGVADTPEKMEKYIRTIYNKASEMDLLINELTLYSKIDTNRIPYNFNILSVNEYFNDCAEDLSIELESKNVEFGYFNYVTPDVRVIADAEQMKRVIHNIVNNSLKYMDKEKAKINLRIKDVGDFIQVELEDNGKGIAAKDLPNIFDRFYRTDASRNSSKGGSGIGLSIVKKIIEEHGGKIWATSRENTGTTMYFVLRKYQEVPIHE >NZ_CP041667.1|WP_143928678.1|243124_243814_+|response-regulator MSRILIVEDEESIADLEKDYLELSGFQVEVANDGETGLRKALEEEYDLYILDLMLPGIDGFDICRQIRDVKNTPIILVSAKKDDIDKIRGLGLGADDYMTKPFSPSELVARVKAHMARYDRLTESAVPKNKVIEIRGLKIDTTARRVWVNSQEKTFTTKEFDLLTFLASHPNHVYTKEELFREIWDMESIGDIATVTVHIKKIREKIEVDTSNPQYIETIWGVGYRFKV >NZ_CP041667.1|WP_143928679.1|243865_245245_+|potassium-transporter-KtrB MKYRRKSVRWNTMRILAAGFLGVILLGGVLLWLPISNQQPIAFIDALFTSTSAVCVTGLVTITPQVQFTLFGKIVLLLLIQVGGLGVVACIASFFFLLRKRITVKERIVIQEAYNMDRLGGIVGMLRRVIVGVFLVEGAGALFYAFQFVPEYGWIKGLGYSIFHAVSAFCNAGIDVLGSTSLSVYSANPLVNLTTMALIVLGGLGFVVWFDVLDNFRRLRKRRKTVGAGIAGLKLHSKLVILMTAVLLVVGTVVIFLMEYRNPQTMGGMSTGEKLLASAFQSVTTRTAGFFTMPQGELYDETKLFCSILMFIGGSPAGTAGGIKTTTIAMLLLSCLAVVRGGKDIECFGRKITFENFRTGFAVTVLAFGIFIGGTMLIAVFEADSVALVDIIYETTSAIGTVGLTADLTPHLERASQVVLMLLMYTGRIGPITLALVFAGKTDPKARLRELPGERVMVG >NZ_CP041667.1|WP_143928680.1|245258_245936_+|TrkA-family-potassium-uptake-protein MKKQYAVFGLGSFGESVAITLQGLGCEVIAVDNHMERIEEISKYVSYAMKADIEDPEVIRSLGARNLDGVVVAIADNMEASIMATLVSKDEGVPYVLAKAKNDLHATVLRKIGADSVIFPEKEAGSRVARSMVSANFADWIALSPEYSVMEVALPDAWIGKSLEALDVRKNHGVNVIGIRENDDVEVNPEPKKPLKRDMILILVGANSDLEKFAKYERNKLDHDN |
You can click texts colored in the table to view more detailed information
CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
NZ_CP041667_2 | 1820057-1820140 | Unclear |
NA
Consensus repeat of NZ_CP041667_2
|
1 spacers
spacers of NZ_CP041667_2
>2.1|1820081|36|NZ_CP041667|CRISPRCasFinder CTGGGGCTGATCTCCCTGGCCCTGATTTTGAGTCTG |
RT |
CRISPR arrays and Neighbor proteins around NZ_CP041667_2
The CRISPR arrays of NZ_CP041667_2 >merge|NZ_CP041667|2|1820057-1820140|CRISPRCasFinder ATTCTGCTGATAAGTATACTGATACTGGGGCTGATCTCCCTGGCCCTGATTTTGAGTCTGATTCTGCTGATAGGTATACTGATA >NZ_CP041667|2|2|1820057-1820140|CRISPRCasFinder ATTCTGCTGATAAGTATACTGATA CTGGGGCTGATCTCCCTGGCCCTGATTTTGAGTCTG ATTCTGCTGATAGGTATACTGATA
>NZ_CP041667.1|WP_143931250.1|1819633_1820038_-|hypothetical-protein MNYQQNPGQSQNTNYQQYQDNYGNYNNYNQNYTNGYNNYNAANPQGYGQNYTQQPQMDTSPMTMGEWLLTILALLIPCAGIIIYFVWAFGKKGNINRRNYCRAMLIIYGVLLVIYIIFFVIFGAMIGASTTYYY >NZ_CP041667.1|WP_143929856.1|1819254_1819407_-|arginase MRVSLLYKTKPDSYTESCTVNAADRWRERNVWYPGRSVRNALKRVTIIER >NZ_CP041667.1|WP_143929855.1|1817754_1819146_-|group-II-intron-reverse-transcriptase/maturase MGTENGESCSQRDSAERKGYVRAHRSFNRIWKERDSAESDILGKILAKDNLNRAYKRVKANKGAPGVDGMTIEAALPWLKEHNYELTERIRKGKYTPSPVRRVEIPKADGGVRKLGIPTVIDRIIQQAMLQQLMPIYEPLFSDDSFGYRPGHGAKDAIHRIKEYMERGYTRAVVLDLSKYFDTLNHTILLNLLRKQVKDERVIQMVKRYLKSGVMENGVVTETKEGSPQGGNLSPLLANVYLNEFDWEFQRRGVPCIRYADDIVLLAKSERAAERLLESSTKYLEGTLKLKVNREKSRTVSVFSIGNFKFLGFCFGKNGKGIYVRVHGKSWKKAKDKLRKLTSRSRCGSIIRTMEKIKVYMRGWLNYYGIADMKKNIESLNGWLYRRIRMCIWKQWKRPKTRRRKLMGLGLPEWAACEGAYSRKSYWRMSNTGVVKRALTKERLINWGFYDLTTAYQSLHVNY >NZ_CP041667.1|WP_143929907.1|1816687_1817677_+|FAD:protein-FMN-transferase MRYKKIAALLTAAILLLPGCSGLTEKRNLVYTDTLYDTVISVKILDPAGDDILKGCEKLCRKYDTMFSYTNEDSDIYKINHAGGAAVEVSEETIDLIKRGIYYGDLSDGAFDISIGAVSSLWDFSSEEPAVPSSAALAEARTHVNYKNIILKDNTVMLRDPKAAIDVGAIAKGYIADRVKEYLEDQGVKHAVINLGGNVQTIGTKPDGTDYNIAIQKPFAKSGDAITSVKVANQSVVSTGIYQRYFEADDGTLYHHILDPSTGAPCKNNLYSVTIITDSSLTADALSTTCFLLGYEEGMRLIDQLDNVDAVFITDDQKLHYSSNFQKKQ >NZ_CP041667.1|WP_143929906.1|1816192_1816564_-|NusG-domain-II-containing-protein MKIKLKKKDWALIVIILCVAALAFLLHEVIGGSGAGKVVVKVAGEIEGTYDLSEDQEIEINGGTNILQIKGGKADMIEADCPDQLCVHQKDISRSHESIICLPNKVTVEIESAESSEYDAVVQ >NZ_CP041667.1|WP_143929905.1|1815688_1816183_-|Gx-transporter-family-protein MKNKVAYFGVFTALALIFSYVESLIPFQFGIPGVKLGLANLIIVIALYKMRLFEVFLLSIVRILLSGFIFGNYFSILYSLAGGLLSLAVMALLKKLGGFSVIGISVAGGVFHNVGQLLTAMVVVETFSVIYYVPVLLVAGVITGFLIGIAAGEMLKRLVNINFY >NZ_CP041667.1|WP_143929904.1|1815065_1815680_-|Holliday-junction-branch-migration-protein-RuvA MIEFIKGELAAVEADKAVIDVGGVGFALFMSGQALGKMPPAGHQVKIYTYLNVKEDAMQLYGFLTKDDLEVFRLLIGVSGIGPKGALGILSGLTPDELRFAVMSNDVKAISAAPGIGKKTAEKLILELKDKLKIEDVLEHAAQEGSGQGETNSGGGEVAGEAVQALIALGYGNTEALQAVKKAYVSEDMEVEEVLKAALKFVAF >NZ_CP041667.1|WP_143929903.1|1814040_1815048_-|Holliday-junction-branch-migration-DNA-helicase-RuvB MSRRIITTENLEEDIKIENHLRPQLLEDYIGQEKAKETLKVYIQAAKERGEALDHVLFYGPPGLGKTTLAGIIANEMDVNIKITSGPAIEKPGEMAAILNNLQEGDVLFVDEIHRLNRQVEEVLYPAMEDYAIDIMIGKGATARSIRLDLPKFTLVGATTRAGMLTAPLRDRFGVVHRLEFYTIDELKDIILRSAKVLEVGIDEAGAYALARRSRGTPRLANRLLKRVRDFAQVRYDGYITAQVADSALDLLDVDKSGLDQTDRELLETIICKFQGGPVGLDTLAAAIGEDAGTIEDVYEPYLLKNGYLQRTPRGRVATAKACGHLGISLENEKK >NZ_CP041667.1|WP_143929902.1|1813563_1813974_-|cell-division-protein-ZapA MASSKNFTEVLIGGKVFTLSGFESEEYLQKVSTYLNHKIEECSSSEGYRKQNSETRSVLLALNIADDYFKARKQGASLETDIEAKDKDMYDLKHELISVQIKLENAEKAMDRLKEENKELQMKIVELETEMKNKKK >NZ_CP041667.1|WP_143929901.1|1809907_1811380_-|FtsW/RodA/SpoVE-family-cell-cycle-protein MVNIIIELSKYIIIIMITMYTFMCFSIFGFQDPDRKKGMLRNQNILMFMIHIIAFLIMYLETDDIRLLAFYLMQVVLFGATILLYTFIYPKVSRLVINNMCMLLCIGMIMLTRLDYSSAVKQFMIAAGAIAISLVVPVIIRKFKRFSEWRNFYAIVGIISLAAVIVVGQVSGGAMLGFTVAGINVQPSELVKIVFVFFVASSFKISLEFKNIVATTALAAFHVLILVASRDLGAALIIFVVYLVMLYVATRQPLYILAGLGAGSAASVIAYYLFDHVRVRVLVWKDPFATYDSGGYQVAQSLFAIGTGSWFGMGLFQGEPDTIPVVVSDFIFSAIAEELGLIFALCMLLICVSCYVMFLNIAMQLHTMFYKLIALGLGTCYIFQVFLNVGGVTKFIPSTGVTLPLVSYGGSSLLSTIIMFGIIQGLYILREDEEENLERKKKERLRAAGSRKTGQKRTAKAGNVPQRPQRTKAPGKAAGKEKARPKQRIR >NZ_CP041667.1|WP_143929908.1|1820358_1820826_-|DUF2752-domain-containing-protein MKRTVKQTVHDGWKILKEDICQARWAIVALALYFLFFKYILHSMCPMVLATGYPCPGCGMTRAAFCVLRLDFAGAWETHPFIFPIIVLAAVFCWNRYVSGKKRQPVLRKCVTVLAVAMILFYFWRMWRFFPGQPPMSYYSGNLLSRIQGVLLHLR >NZ_CP041667.1|WP_143929909.1|1820900_1821692_-|RnfABCDGE-type-electron-transport-complex-subunit-B MSITGIILAAVIVGGTGLFIGIFLGLADKKFAVEVDEKEEQVLGVLPGNNCGGCGYPGCSGLAAAIAAGEAPVNACPVGGAPVAAKIGEIMGVDAGEQIHEVAFVKCAGTCEAAKTSYDYNGLHDCVMINMMQNGGPKACVYGCIGEGTCVKACPFDAIHIVDGVAVVDKEACKACGKCVAACPRHLIELVPYEQKHLVQCSSKDKGKDVMKACSVGCIGCKMCEKVCEAEAVKVVDNVAYIDTSKCTNCGACAEKCPKKIIL >NZ_CP041667.1|WP_143929910.1|1821705_1822281_-|RnfABCDGE-type-electron-transport-complex-subunit-A MADLILIAVGSALVSNVVLSQFLGICSFLGVSKKTETAVGMGGAVIFVITLASFVASLLYEFILKPLGFDYLNTIVFILVIAALVQIVEMFLKKFVPSLYNALGVYLPLITTNCAVLGVAINNVQDEYNLLESVVNGFATGVGYLIAIVLLAGIREKMEYNDIPESFKGMPIVLLTSTLMAIAFYGFSGLI >NZ_CP041667.1|WP_143929911.1|1822293_1823010_-|RnfABCDGE-type-electron-transport-complex-subunit-E MNSCGERLKNGLIDENPIFVLMLGMCPTLAVTTSAMNGLGMGVSTTAVLVMSNMLISMLRKVIPDSVRMPAFIVVVASFVTIVDFLMEGFTPSLYDALGLYIPLIVVNCIILGRAESYASKNPVLPSIFDGLGMGLGFTVALIAIGAVREIIGAGQIFGYQLLPIADEAAGTAGYVPVAIFVQAPGAFLVLAVLAAIQNKVKINMEKKGKDASKIQSGCGADCATCGGCASAAENGKE >NZ_CP041667.1|WP_143929912.1|1823022_1823607_-|RnfABCDGE-type-electron-transport-complex-subunit-G MNKIIKNTIILTVITLVSGLLLGLVYEITKEPIANAQEQAKREAWQAVFPDASQDAFEQIDVDGDVAAQVIKDLGMSGSIDEVCAVDGGDTGYVITVTDGEGYGGDIQITVGITADGTVSGVSFLSISETAGLGMKATESSFYEQYVGVQTEKFYVSKDGGEGEPIDAISGATITSRAVTSAVNAALGYFQNAF >NZ_CP041667.1|WP_143929913.1|1823606_1824563_-|RnfABCDGE-type-electron-transport-complex-subunit-D MSENKLKVSSSPHIRDRVTSGNIMLMVVIALLPASAFGVYNFGLSALIMLISTTVSSVLTEFIYEKLMHKKVTINDFSAVVTGLLLGLNMPPTAPWWMGVLGGIFAILVVKQLFGGLGQNFMNPALGARCFLMISFAGQMTTFVYDGVTGPTPLSYVKEGALDQVNTMDMLIGTIPGTIGETSVIAIIIGAIFLILMGVIDLRIPGTYIVTFVIFVGIFGHVAHPEIGFFDPQYITAHLCGGGLMLGAWFMATDYVTSPITKKGQIVYGIILGLLTGLFRIFGGSAEGVSYAIIISNLLVPLIEKVTLPKPFGKGGEK >NZ_CP041667.1|WP_143929914.1|1824595_1825915_-|electron-transport-complex-subunit-RsxC MALLTFKGGIHPDDGKGLAKGSEIVELKPKGSLVYPVSQHIGAPAAPVVKVGDVVLKGQKIAEAGGFVSAPIYSSVSGTVKAIEPHLNPTGGMVNSIVIENDGEYREVEYPEVKPLEEMSKEEILNAIGEAGVVGMGGAGFPTKVKLSPKEPDKIDYVIANCAECEPYITADYRAILEMPEKLVGGMKAVLRLFDNAKGIFGVEDNKPDCIAKLKELTKDEPRMEVLALKTKYPQGGERQLIYATTGRAINSAMLPADAGCVVDNVATMISIYQAVVEGRPSMERIVTVSGDAVNEPGNFKVPFGINQAELVEAAGGFKEDPQKLISGGPMMGFAMFTLDVPVTKTSSAILGFTKDEASKFEPTACINCGRCVEACPSRLIPSRLADYAENHNEEAFTKHEGLECMECGSCSYVCPAKRPLKQAIGSMRKIALANRKKK >NZ_CP041667.1|WP_143929915.1|1826095_1827193_-|30S-ribosomal-protein-S1 MSEKTFEQMLEESFKTIRNGEVVDGTVIDVKPDEIILNIGYKADGIIMRNEYTNEPNVDLTTVVSVGDKMTVKVLKVNDGEGQVLLTYKRLAAEKGNERLREAYENHEVLKAPVTQILGGGLSVVIDEARVFIPASLVSDTYEKDLSKYQDQEIEFVISEFNPRRNRIIGDRRQLLVAERAERQKELFAKLQVGDTVEGTVKNVTDFGAFIDLGGIDGLLHISEMSWGRVENPKKVFQVGETIKVLVKDIHDTKIALSLKFPETNPWANAAEDYAVGTVIEGKVARMTDFGAFVELTPGVDALLHVSQISRAHVDKPSDVLSVGQMITAKIVDLNVEEKKISLSMKALETAAETESVDDGGSEEE >NZ_CP041667.1|WP_143929916.1|1827173_1828031_-|4-hydroxy-3-methylbut-2-enyl-diphosphate-reductase MKIELAKTAGFCFGVRRAVDTVYQQVGQAGGKPIYTYGPIIHNDEVVKDLEQKGVKVLDSKEELAAVEEGIVIIRSHGVSKEICGLMERKGIRCVDATCPFVKKIHKIVEEESGKGSHIVIVGNPEHPEVEGISGWAGGPVTIIQTKEEAESFTIKDPLQKVCIVSQTTFNYNKFKELVEIIAEKGYDIIVLNTICSATKERQEEARDIAKRVGAMIVIGDKKSSNTRKLFEICSNACADTYYIQTLDDLDMNQLRSVETVGITAGASTPNKIIEEVQNNVRKNF >NZ_CP041667.1|WP_143929917.1|1828042_1828702_-|(d)CMP-kinase MGYQVAIDGPAGAGKSTIAKRVAKEKGFIYVDTGAMYRALALYFLEQGIRADETDRMTEAVSGAEVGIQYENGIQQVYLNGRNVTGRLREEAVGNMASKSSAIPEVRQKLLELQRELAKTEDVVMDGRDIGTCVLPDADVKIFLTASVETRARRRYEELKEKGIPCSLDEIAKDIQERDERDMTRKTAPLKQAEDAVLVDSSNLTVEEVTARIIELCRG |
You can click texts colored in the table to view more detailed information
CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
NZ_CP041667_3 | 1896563-1896682 | Orphan |
NA
Consensus repeat of NZ_CP041667_3
|
1 spacers
spacers of NZ_CP041667_3
>3.1|1896608|30|NZ_CP041667|CRISPRCasFinder GCGGAAACGGCTTTGGCGGAAACGGAAATG |
CRISPR arrays and Neighbor proteins around NZ_CP041667_3
The CRISPR arrays of NZ_CP041667_3 >merge|NZ_CP041667|3|1896563-1896682|CRISPRCasFinder ACAGCTGCCTGTGGATCATCCTTCTCCTGTGCTGCTGCGGCGGCTGCGGAAACGGCTTTGGCGGAAACGGAAATGACAGCTGCCTGTGGATCATCCTTCTCCTGTGCTGCTGCGGCGGCT >NZ_CP041667|3|3|1896563-1896682|CRISPRCasFinder ACAGCTGCCTGTGGATCATCCTTCTCCTGTGCTGCTGCGGCGGCT GCGGAAACGGCTTTGGCGGAAACGGAAATG ACAGCTGCCTGTGGATCATCCTTCTCCTGTGCTGCTGCGGCGGCT
>NZ_CP041667.1|WP_143929964.1|1895776_1896412_+|DnaJ-domain-containing-protein MEGKTYYEILGVSSKATLEEITAAKNRLAKQYHPDVNMKNGIDTTALMQEVLEAYRVLSDTAARADYDRKTAGRPAVMQTYDLHSASASENQEDPAFVTYWKASNALYDIVTESDALFHQKDESSRIAQLAMQALKHIITLREGQIPERYWHPDIMNWLLFAWYKNRNFTTTYLLTLYDEHLKTEIPVVDKLKLKNKSLRYQHSVKRLIKY >NZ_CP041667.1|WP_143929963.1|1894779_1895670_-|S8-family-serine-peptidase MDRVKQAVRYWCADTRNGRCMGRNVTAAVLDSGIAPHPDFKGRILAFKDFVGGRRELYDDNGHGTHVAGILAGDGRMSGGILAGMAPEAKLVILKVLDQKGEGNMRQILEGICWLMRNGNRFGIKVVNISVGAREGLNEEKENWLVEAVERLWDAGIVVVVSAGNYGPEQGTIAIPGNSRKVITVGAYAKTARGQECSGRGPTKACVVKPDLVAPGYQIISCNRVSARNRKPYIVKSGTSMATPVVAGAGAMLLSKYPDMSNVEIKLKLRESCRRTGSGDGWGLLDVERLLGPGDF >NZ_CP041667.1|WP_143929962.1|1894344_1894689_-|DNA-binding-protein MNEILKQALLFDFYGELLTDHQKEIYGRFILDDLSSAEIAKEAGISRQGVHDLIRRCTQTLDGYEGKLHLVERFLAVKNKVKQIDELLDRYEAGKDRKALEEIRRISGEIIEEL >NZ_CP041667.1|WP_143929961.1|1892945_1894307_-|signal-recognition-particle-protein MAFDSLTEKLQNIFKSLRGKGRLTEDDVKEAMKEIKRALLAADVNFKVVKDFIKNVQERAVGQDVMNGLNPGQMVIKIVNEELIKLMGSETTEIQLQPGSAATVILMAGLQGAGKTTTAAKLAGKFKLKGKKPLLVACDVYRPAAIKQLQVNGEKQGVEVFSMGENHKPANIAKAALEHAQKNGNNIIILDTAGRLHIDEDMMKELQEIKEAVTVHQTILVIDAMTGQDAVNVAKEFNEKAGVDGVIITKLDGDTRGGAALSVKAVTGKPILYAGMGEKLSDLEQFHPDRMASRILGMGDVLTLIEKAEAEIDEEKAKEMSKKLKKAQFDFEDYLESMKQMKKMGGLGSIMSMMPGLGGMGGMGGLGKKGITEEQTAQAEKSMARMEAMIYSMTIEERRNPDLLNPSRKHRIARGAGVDISEVNRMVKQFAEMKKMMKMLGKGGRKNRFRMPF >NZ_CP041667.1|WP_143929960.1|1892630_1892876_-|30S-ribosomal-protein-S16 MAVKIRLRRMGQKKAPFYRIVVADSRSPRDGRFIDEIGIYDPNCEPSAISVDEEVAKKWLNTGAQPTETVSKIFKAAGIEK >NZ_CP041667.1|WP_143929959.1|1892376_1892607_-|KH-domain-containing-protein MKELVEVIAKSLVDDPDSVVVTEREEKKTTILEVRVADSDMGKVIGKQGRIAKAIRAVVKAAAAKEDKKVIVDIMD >NZ_CP041667.1|WP_143929958.1|1891793_1892303_-|16S-rRNA-processing-protein-RimM MEEIFQVGVITSTHGIRGEVKVFPTTDDPARFQDLTEVLLDTGKEKILLEVQNVRFFKQFVILKFKGLDNINDVERYRRCPLLIERKDAVPLEEDEYFIADMLGMEVAEEDGRRFGTLKDVIQTGANDVYVIDSLEHGEVLVPAIKECILEVDIPKGRMKIRVMDGLIG >NZ_CP041667.1|WP_143929957.1|1891044_1891779_-|tRNA-(guanosine(37)-N1)-methyltransferase-TrmD MRFHIMTLFPDMVMNGLNTSITGRAIQKGLLSVEAVNIRDYAFNKHNSVDDYPYGGGAGMLMQAEPVYQCYHALEEKIGKRPRVVYLSPQGQTFHQKMAEEFAQEEELVFLCGHYEGIDERVLEEIVTDYVSIGDYILTGGELPAMIMVDAISRLVPGVLHNDVSAEFESFQDHLLEYPQYSRPEIWHGKQVPEVLLSGHHANIEKWRRRQSVLRTARNRPDLLEEAELTEEEQELVKEILAFL >NZ_CP041667.1|WP_143929956.1|1890598_1890946_-|50S-ribosomal-protein-L19 MNEIIKNLEAEQLKENAPEFCVGDTVKVYGKIKEGNRERIQVFEGTVLKKQGGGARTTFTVRKNSNGIGVEKTWPLHSPNVEKVEVVRRGKVRRAKLNYLRNRVGKRAKVKELVK >NZ_CP041667.1|WP_143929955.1|1889936_1890536_-|signal-peptidase-I MKRRNQLGFSRRRRRRRPRRFHLNLEYLPQIGAWAFKIGVVCLFAFVAVWYFGQRVSTVGDSMKPILENGDVVLVNRIVYNATTPKRGDIIVFRPKGNENSHYYIKRIIGLPGETVEIIENRIYINGEKIEEDYKTSDIDDVGILSEPMTLGSDEYFVLGDDRENSEDSRNADVGNVKRSHIYGKAWFVISPWKNFGPI >NZ_CP041667.1|WP_143929966.1|1897007_1897502_+|hypothetical-protein MEARPPKLMTPFDSLVISDPLYTLKLLLPYTPPSAQRMLAVYIKFQEFRYTLEYFWGFPRPGSPDHLLQDLKPYMTPQERETMEQMEGMLNMMELMQEFPVLSGQEEDGAGSGGFPSSIDLIKHMLTPEQQELFHTYSTFFDDALASSQGEDAKGNKKGGSEHE >NZ_CP041667.1|WP_143929967.1|1897494_1897770_+|hypothetical-protein MNEWMNHPAMENIDPIKLELIKTAAKQTQGKSGNSLAPVMMALITSANKKGIRFQPEEISLIMDSLKEGKTKEEQEQIDRMMQMVKAYLKR >NZ_CP041667.1|WP_143929968.1|1897860_1898337_-|acyl-CoA-thioesterase MSVTSRKVSESVVQTVHIVRPNHLNAAGRLFGGILMQWIDEVAGVVAKRHAHQNVITASVDNLRFLRGAYQRDLIVIIGKVTYVGTTSMEVKVDTYVEDLDGVKTPINHAFFTMVALDQNDKPTPVPKLELETQEEREEWEKARKRRDVRMMRREEGF >NZ_CP041667.1|WP_143929969.1|1898333_1898840_-|hypothetical-protein MAEPTRRKPVAKEALLSAGCLYADTVQEAFQKYQSAVLSAQRKEDLRDFLRAVYQRNPGAMYADFYYPVLPRQEQERFRAGLSGEQRERLETLDTESGRIFYPVTEKELDFLYEITAAEWLFSSFYAGNQKALIWGNYGLNFPIFCEEEETLEFYLGLAQTYKVRWNK >NZ_CP041667.1|WP_143929970.1|1899031_1899769_+|tRNA-pseudouridine(38-40)-synthase-TruA MRNIKLTIEYDGSRYQGWSRLGKNESNNTISNKIQEVLRKMTGEFLIELSCGCRTEVGVHAYAQVVNFKTESDIETQEIKHYLNRYLPMDIAITDVEEVPERFHAQLNAVSQTYVYRMTIADVPSVFDRRHTFHCFKVPDKKAMQQAAMLLIGTHDFKNFSASKSSKSTVREILDIDIYGDEEEMQILICADNFLHNMARMIIGTLLDIGFGSRRKEEIEEIFEGASASSAPCDPKGLYLQEVRY >NZ_CP041667.1|WP_143929971.1|1899815_1900838_-|GNAT-family-N-acetyltransferase MEYEGEERKDCKERISEAQFLTFMHEAERLKCIPRHAWTSSDRRESVAEHCWRLCLAVWLLKEELPAVDIERLMELGLLHDLGEAMTGDIPAFEKHKSHEEKEQEAVKRLAALLPEAKGRELQDRLLEFEKAKTLEGKTAKALDKIEAVIQHNESKIGTWLPLEYDLQLTYGTEEAKKIPYLARLREAVREETMRKIEKETAKEKPRKGYYVSKDPKKLSLERAAALLRQSYWAKDRPKEMIRKAMEHSLCYGVYDEADYMVGYARVITDHATTFYLMDVIIDEPYRHQGLGTMLMDRIMEDMKGLHGVLHTTDAKEFYHRYGFVRNQEKLDSVMEKPRD >NZ_CP041667.1|WP_143929972.1|1900846_1901251_-|YjbQ-family-protein MNLFEHKISTGSPQQMTKVTGMIREDIAKSGVKNGIVVVYSPHTTAGFTINENADPDVVHDMLCGLEESFPTRRSYYQHMEGNSHAHLKTTCVGPSQTLILEEGKLVLGIWQDVYFCEFDGPRNRRFFVKILEG >NZ_CP041667.1|WP_143931252.1|1901266_1902124_-|patatin-family-protein MVTGTMVLEGGATRGVFTSGVLDYLMEKDLYLSHVIGVSAGSCNGVDYVSKQPGRTRDCMIQKDKEYNYYHGLRDFIKEKSVLDMDMVFDRYPNEIFPFDFDTYFASEMECEIVITNCVTGRAEYRTEDHDRDMLMKLCRASSSMPLLAPMVNIDGTPYLDGGLADSIPVERAMEIGNDKIVLILTRNPGYRKKPTSKGLANLYRRAYRKYPNLVSVTIQRNYIYNRQMNLIEKLEDEGKIFVLRPLIPTVSRLEKNYDALMHFYEHGYRLMKKQYNELLKYLEA >NZ_CP041667.1|WP_143929973.1|1902272_1903907_+|ATP-binding-cassette-domain-containing-protein MISTSNITLRVGKKALFEDVNIKFTEGNCYGLIGANGAGKSTFLKILSGQLEPTKGEVSITPGERLSFLQQDHFQYDGYPVLDTVMMGNARLYEIMKEKEVIYAKEDFTDEDGIKASELEAEFASMNGWEAESDAASLLNGLGIGTELHYEMMKNLDGPQKVKVLLAQALFGNPDILLLDEPTNHLDLDAIAWLEEFLINFDNTVIVVSHDRYFLNKVCTQIADIDYGKIKLYAGNYDFWYESSQLLIRQMKEANKKKEEKIKELQEFISRFSANASKSRQATSRKRALEKIQLDEIQPSSRKYPYIDFRPNREIGNEVLTVEGLSKTINGEKVLDNLSFTLNREDKVAFVGGNELAKTTLFQILSGEMEPDEGTYKWGITTSQAYFPLDPGDEFDNDYTIVEWLTQYSEEKDVTYVRGFLGRMLFSGEDGVKKVKVLSGGEKVRCLLSKMMISGANVLILDEPTNHLDMESITALNNGLIKFPGVILFSSRDHQIVQTTANRIMEIVPGGKLIDKITTYDEYLESDEMARKRQTYSVNQEEDD >NZ_CP041667.1|WP_143929974.1|1903909_1904497_-|thiamine-phosphate-synthase MCRREDWGRVTAVTNRRLTSRPYEEQMKRICRLRPAAVIVREKDLPEEVYADLAGRVKTICESYGVPCIYHTYLEAARQAGVRRIHLPLALLRSLEGEPSLREDFDQIGTSIHSLEEALEAVRLGADCLTAGHVYVTDCKKGLAPRGISFLREICQAVPIPVYGIGGIHPGTGQVEEVCSCGAAGACIMSGMMEI |
You can click texts colored in the table to view more detailed information
CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
NZ_CP041667_4 | 1993071-1993171 | Orphan |
NA
Consensus repeat of NZ_CP041667_4
|
1 spacers
spacers of NZ_CP041667_4
>4.1|1993097|49|NZ_CP041667|CRISPRCasFinder GCCCCTGCTCTGGCCTCTTTCGCCAAAGTTTCGATTCTGGTTTCTGTCG |
CRISPR arrays and Neighbor proteins around NZ_CP041667_4
The CRISPR arrays of NZ_CP041667_4 >merge|NZ_CP041667|4|1993071-1993171|CRISPRCasFinder CCGGAACGGTTCTGGCCTCTCTCACCGCCCCTGCTCTGGCCTCTTTCGCCAAAGTTTCGATTCTGGTTTCTGTCGCCGCCTCTGCTCTGGCCTCTTTCGCC >NZ_CP041667|4|4|1993071-1993171|CRISPRCasFinder CCGGAACGGTTCTGGCCTCTCTCACC GCCCCTGCTCTGGCCTCTTTCGCCAAAGTTTCGATTCTGGTTTCTGTCG CCGCCTCTGCTCTGGCCTCTTTCGCC
>NZ_CP041667.1|WP_143930053.1|1990639_1991023_-|30S-ribosome-binding-factor-RbfA MRKRSIKNTRVNVEVQRELSSIIRGGLKDPRVAPWTSVVAAEVAPDLKTCKAYISVLGDEKAQKETIQGLESAEGYIRRELARTLNMRNTPEIHFILDQSIEYGVNMSKKIEEVTGQMETEGEKDAQ >NZ_CP041667.1|WP_143931256.1|1989696_1990650_-|bifunctional-oligoribonuclease/PAP-phosphatase-NrnA MLNRMIAQANTIAIGGHERPDGDCTGSCMGLYLYILENYSGKTVDIYLEQIPRSYQFLKRSGEIRHEAPKDQKYDLFICLDCGDKERLGFSQTLFEQAAHTLCVDHHISNKGFGEENYVVPDASSTSELIYDLIDKEKITLPVAEALYLGIVHDTGVFQYSCASPATLRAAADLMERGVDAPALIRKTYVEKSYAQNQVLGRALMESILFMDGRCIASYIRRREMEFYGVEPKALDGIVSHLRDTKGVEVAVFMYELKPSVYKVSLRSKEQVDVSRIAQYLGGGGHKRAAGFTMAGTPHDVLNNLSGLIEDQLGEPV >NZ_CP041667.1|WP_143930052.1|1988794_1989700_-|tRNA-pseudouridine(55)-synthase-TruB MINGILNVYKEKGFTSHDVVAKLRGITGQRKIGHTGTLDPEAEGVLPICLGRATKVCELLTDQDKTYEAVLLLGVRTDTQDTAGTVLEEGDASAVTEERLKETVKNYIGDYDQIPPMYSALKVNGKKLYELAREGKTVERKARRVRIHEIKILDIHFPRIRMRVNCSKGTYIRTLCDDIGREIGCFGCMESLVRTKAGPFEKSGAHTLEEIEQRQKQGTLGEILRSIDSLFQEYPAVYVEARWEKLALNGNSLRRGMIRTEGRAEDGERVRLYNEKGDFLALYGYQKKTQEYKAVKMFCGS >NZ_CP041667.1|WP_143930051.1|1987880_1988783_-|bifunctional-riboflavin-kinase/FAD-synthetase MEYLKGLEHYKDTRRSAVTFGKFDGLHRGHQTLIRKVAQLRQTEQVRAAVCAFDMKRTGILMTKEERAAHLEDSMDYLVECPFSEELRKMSGEEFIKEIICGVFHAKYVVVGTDFHFGYGQGGDADMLKEYAGAYGYEAIVLEKERYQGRIISSTYIKELMAEGNIQLADKLLGYPYSIKGVVEHGRRLGRTLGFPTFNVEWPQAKIVPPRGVYFSRTWLDGKCYAGITNVGVKPTVSTEDKVLAESFLFDYEGSAYDKEVRVELLDFRRPERKFADIQEMKAAIDRDIESGREYFIKNA >NZ_CP041667.1|WP_143930050.1|1987492_1987759_-|30S-ribosomal-protein-S15 MISKEQKEQIIAEYGRSEGDTGSPEVQVAILTARINDLTEHFKNNPKDHHSRRGLLKMVGQRRGLLAYLKKIDIERYRSLIERLGLRK >NZ_CP041667.1|WP_143930049.1|1985203_1987291_-|polyribonucleotide-nucleotidyltransferase MYKKYEMELAGRTLRVDVDRVAKQANGAVLMHYGDTTVLCTATASEKPREGIDFFPLSVEYNERLYAVGKIPGGFNKREGKASENAILTCRVIDRPMRPLFPKDYRNDVTLENLVLSVDQDCSPELTAMLGAAIATTISDIPFDGPISSTQVGLVDGELVFNPTAAQREVSKLSLTVASTKEKVIMIEAGAEEVPEQQMIDAIFAAHELNQKVIAFIETIVAECGKPKHSYESCAVPEELFEAIKEIVPPAQMEEAVFTDEKQVREENIRQITEKLEEAFAEKEDWLEVLGEAVYQYQKKTVRKMILKDHKRPDGRAIDQIRPLAAEVDLIPRVHGSAMFTRGQTQICTITTLAPLAEAQRLDGLDEAETSKRYMHHYNFPSYSVGETRPSRGPGRREIGHGALAERALLPVLPDETEFPYAIRTVSETFESNGSTSQASVCASSMSLMAAGVPIKSAVAGISAGLVTGETDDDYLVLTDIQGLEDFFGDMDFKVAGTHEGITAIQMDIKIHGLTRPIIEEAIAATRKARLYILDEVMAKTIAEPRPEVGPYAPKIIQMQIDPQKIGDVVGQRGKTINAIIEQTGVKIDIEDDGSVSICGTEAASMEEARKLIHIIVTDFEAGQVLEGKVVSIKEFGAFLEFAPGKEGMVHISKLAKERVNHVEDVLTLGDVVKVVCLGKDKMGRFSFSMKDVAE >NZ_CP041667.1|WP_143930048.1|1984554_1985085_-|methylated-DNA--[protein]-cysteine-S-methyltransferase MQYVSHYQSPIGRILLAADETCLTGLWFEGQKYFGLHLDKEREEKEIPLFQAAKQWLDIYFSGEEPRIPLPLHLRGTDFQKEVWEILRTIPYGQTMTYGEIAGRLAGKRGGKKVSARAVGGAVGHNQISIIVPCHRVVGANGSLTGYAGGIEKKVKLLELEKADNQFLFPGGRARM >NZ_CP041667.1|WP_143930047.1|1983685_1984582_-|LysR-family-transcriptional-regulator MSRREGADVSSIWKYKYFVDVIENRSFTKAGNINYVSQTAISQNISSLEKMAGGKLINRGKGEVVPTELGQIVYRRAKEMLEIEARMTREIEQFRNREVTYIGIDSAINKKMWMTYEQVYDPNFLRVGEKMECYNLDNMIGARMMKNHELDVFIGYENQALEDEPGVAGENLTGSRIGVYVGKNTTIPYGPLRLDDLRGHRCYLASSYSCSVQEEARGRLEDGCRFIEVKNVETMKIKVEFNDAFAFVDSRYFWRGDGEIRQLADYEETCAIRLYYFRASEKKNVSKFIKILKDKMEE >NZ_CP041667.1|WP_143930046.1|1982190_1983585_-|MFS-transporter MGESTTAVKKKHPWGFYVCNLTFTFERLAYYGAKPILLLFLIKAVGEGGLGIDNAQAAVIAANLTAYTYLAPIIGGYISDRWLGARYAIPLGSVIMAIGYLIGWKAANAAMVNLMVIVISIGTGFFKGNLSAIQGRMYDDKSMLDSAFSIQYSFVNIGSFVGSVATGYLYLNTFKNGDVLGFRQCFFLSAVFCLIAAVWFVANWKSLQGQGKKPFKYLTDTQGNIIGEDSKKDKKKEKSAEPLTRAEKRRVWAIILVSAFSIIFWLFYYQNELALTIYMTEYVDMHLAGIEIAPGWINTSLNGLLCVALGGVMAAVWRKLSERPQGDLNMFQKIGLSFLFLGCAFGVVVLAEFTRGVGSPAGNKVSVLWMVGFVFLLTIGEMCFSPLRNAFVSKYAPKKYLSLLMGVITVATFCASKLSPYVQVIIENMNIFPVLVAIFILLLLCALFMVATNKKLNKLVEDED >NZ_CP041667.1|WP_143930045.1|1980882_1982109_-|peptidase-T MKAYERFLRYVNVWTTSDETSETVPSTQRQFELGRLLAQELKDIGVEKVELNDMCYLYAEIPATPGYEDKPAIGLIAHMDTVMDFPGKDIHPQFTKNYNGEDVKLGESGKVLSVKEFPHLKNMKGRTLITTDGTTLLGADDKAGIAEIITAADEILKENIPHGKICIGFTPDEEIARGAKNFDVERFGADFAYTLDGDIEGEIQFENFNASTAYFTIHGVNVHTGSAKGILVNSQLIGMEIQSRLPDERPETTEGYEGFFHLMAFNGNTEKTQMRYLVRDHSGEKFKQRHETLRRIQKEMNEKYGEGTVELEIAESYYNMREKIEPCMHLVDIAKKAITDAGLTPDISPVRGGTDGARLSFKGLPCPNLGTGGSAFHGPFEHITVEGMDLAVGIVKDILKSYAAYEEK >NZ_CP041667.1|WP_143930055.1|1993899_1994223_-|50S-ribosomal-protein-L7ae MSQNKALSLIGLAVKAGKVASGEFCTEKEVKSGRAALVIVAGDASGNTKKKFQNMCSFYRVPIYFYKDKDTLGHAMGKEFRASLAVTDEGFAKGIRKHLDTEENTIA >NZ_CP041667.1|WP_143930056.1|1994209_1994488_-|DUF448-domain-containing-protein MSKNKKIPLRKCVGCQEMRSKKEMIRVIRTSEQEFLLDATGKKNGRGAYLCPNRECLEKARKCKGLERSFGQAIPPEVYESLEKEMECLESE >NZ_CP041667.1|WP_143930057.1|1994497_1995742_-|transcription-termination/antitermination-protein-NusA MNTELLEALNLLEKEKDISKETLLDAIENSLLNACKNHFGKADNIKLIMDRETCDYQLYAEKEVVEEVEDKLEQISLEEAKEIDSTYEIGDIVRIPIESKSFGRIATQNAKNLILQKIREEERKVIYDQYFEKEKDIVTGIVQRYVGRNISINLGKADAMLTENEQVKGEVFKPTERIKLYVVEVKNTTKGPKILVSRTHPELVKRLFESEVTEVRDGIVEIKSIAREAGSRTKIAVWSNDPDVDPVGACVGMNGARVNAIVNELRGEKIDIINWSDNPAILIENALSPAKVISVMADPDEKTASVIVPDYQLSLAIGKEGQNARLAARLTGYKIDIKSETQAIESGELPENYMEMSEGVFEEEMYEEDYDESYEDESYEEGAYEEGYDESYEEAPEDAVYDGEADGEQESEEE >NZ_CP041667.1|WP_143930058.1|1995760_1996228_-|ribosome-maturation-factor-RimP MSKREVYEQKTEQLLQPIVDEYGFELVDVEYVKEGSTWYLRSYIDKTGGISIDDCEKVSRRLSDLLDQEDFIEDAYIMEVSSPGLGRPLKKEKDFKRSLGEEVEVKTYRMIDKQKEFTGILKDYDEDTVTITLADETEKTFDKGDIALIRLAFDF >NZ_CP041667.1|WP_143930059.1|1996395_2000457_-|hypothetical-protein MSIKWKKVIALIVAIILSAGSIMQTVAATPTEVENEVEVTDTPENGEKENITEDNQELGKDDSNVLDNQEEDKPFEDNSELSNNAKSQKQKNRLSVENESPFSGGNGTEESPYLISSDEELLAFAEVVNNQEGEYQTAHFALAKNIYLNDITDYDEWDKTAPKNEWPGVENFFGTFDGKQHTIYGLYMNSDSDRVGLFNSIEVYNWIDRKLVIKNLQLKNIYIKGQSYVGGLVGYAYCKVEITNCNVEGEVIGENSDIGGIGGFFSSGGNKGLTLTSVTNRAKVSGRTRTGGIWGGASIGNNLGSDTSGGGSEKIILKNCNNYGDILGTVSYTGGIIGQLRINLNCNGFDIERTQNFGTVTGKESTGGICGCIDVVAADSISASLEEICNEGNVFGEDCVGGICGEAALDRGDGYFRNVYNIGKIQGSSCVGGIIGVSTRFDVNSSYNVGIVTAETKVASLVGWANLYLGDIETVKVNNCYYAKGTAEVDTDGSASGVLELSEEEFKLESNFSGFDFVDIWKMGEKYPIFIWQDGNGSGNQETSKTDQFIIEKVKEYTSDAVYAQWEEISNSEISEETKFQRYTELFNQYGFLDAKEGIQYLSKTTNERYAYLTLTTDETYCAYNFYDWLHNTGKGAVARGLLIADGLIFNNELSDWTDLSTYIEGDYPGVVKYQDMLYDFMETASFDVEKMTYISDVSKLSGTVTEAGKVYADQLIKKLNECKNRNELRKALRSNEAIGVYSDVKTQGSDLNSFKLSFTLDETSGFGQFYKAMGYATKTLDIVDMSIQHVEDIIQLDSKLQVYEQYRTFLNEVELALDLPLEMRLAAKKILRDMKEGTWGELTDFAMDIVDKTSVKAKISDAILKAAIGQTGAATLNEFLLVLKIEAYFVNRIMDVGKLVQGVAYVEGYSTLSSHYREKLEESKKNFLTDQSAENAWDFYDNYSLLYMLRVKGEEAYLNMCKVEGLINIFTDCGYSEKEEVVKETIQTLEERCFFNLEEGIEIPESVQFALKSVISCPVDVSVYAPDGTLVAKLRDGEENNISNEYGQFSVVYRPYSGEYAKVLCFYKDADYKLEIAGVDKGLVSFELAEEKEGEITTYSFENQGIEAGNILRTSIEQIRGDGTVELDIDGNGVTENTIYLNKNPEETYRPVESLELKETFLILETGGSALMEVKISPDEATRQQVFWLTGDENIARVEEGKVSAISKGTTNIYCVSQDNRDQMAVCKVIVRSKGACTHPLEKVDEKKPDSEHTGNIEYWYCPECDRYYEDENGMIETSLEKVTLSKLSSQEGMGTNPDEQDNSADSEGMREQQQTPDTGDKGVEIWLLVCALSGIIIAGIGTGKMWRRHEK >NZ_CP041667.1|WP_143930060.1|2000596_2000827_+|helix-turn-helix-domain-containing-protein MISFEPFRKLLKEKGISTYYLRHKCGLYNLDSKTIQRLMADESVSTNTLNALCNIFDCDISEIVQFIPDNNISQNE >NZ_CP041667.1|WP_143930061.1|2000871_2001765_-|NAD(+)-diphosphatase MIQDIQPYQYDNVYHPALPQAQDFLLCYKGNRTLVKQNGEKIVFPTFQDAEKGWEKERESLYQGAVYLFSIRRPDESGGKETALKEIRFYLHPEVEGRRLEADGFAWEDNWLFRTAKPKYLRFAGITGWQLFRWYESHRFCGRCGVPMVRDEKERMMRCPECGLMEFPKICPAVIIGVTHGNKILMSKYAGRDFKEYALLAGFCEVGETIEETVKREVMEEVGLKVKNITYFKSQPWSFSDTLLMGFFCELDGDGTISIDQEELSMAEWFEREDMPVKEEDLSLTNAMMMAFKEGKI >NZ_CP041667.1|WP_143930062.1|2001769_2004007_-|DNA-topoisomerase-4-subunit-A MQNSQIIRTEYSDVMKKSYIDYAMSVIVSRALPDVRDGLKPVQRRTLYDMHELGLKADRPYRKCARIVGDTMGKYHPHGDSSIYEALVVMAQDFKKGTVLVDGHGNFGSIEGDGAAAMRYTEARLAKITQEAYLQDLDKDIVNFVPNFDETEKEPEVLPVRVPNLLVNGAEGIAVGMATSIPTHNLGEVIDGVKAYMKNNEISTRQLMKYIKGPDFPTGGIVVNKDDLLQIYETGTGKIKLRGKVEVEELKGGRSQIVISEIPYTMIGTGIGKFLNDVYGLVESKKTSDITDISNQSSKEGIRIVIELKRGADAENLINMLYKKTRLEDTFGVNMLAVADGRPETMGLKKIIEHHVDFQFELATRKYQTLLAKEKDKKEIQEGLIKACDVIDLIIEILRGSQSVKDAKACLTNGVTENIKFKSSISKKMAAMLRFTERQATAILEMRLYRLIGLEIEALMKEHEETLKNIERYEDILNNYESMADVIIQDLDQLKKEFSWKRRTQIENAQEAVFEEKKIEEQEVIFLMDRFGYAKTVDRATYERNREAADSENKYVISCLNTGKLCIFTADGKMHQVKVLDLPFGKFRDKGQPIDNVSNYDSTQEEIIYICDAEQMRFAQLLFATRMGMIKKVSGTEFQVSKRTIAATKLQAEDAVVSVQVVSDSQQVVLRTKEGYFLRFGAEEVSEKKKGAVGVRGIRLKKKDELEEVYLFEEGTETKVQFGDREITLNRLKAAKRDGTGTKAR >NZ_CP041667.1|WP_143930063.1|2004022_2005945_-|DNA-gyrase-subunit-B MAKKNTYDADSIAILEGLEAVRKRPGMYIGSVSTKGLNHLIYEIVDNSVDEHLAGYCSEIRVTLERDGSATVADNGRGVPVDLHAKGVSAERVVYTTLHAGGKFDDSVYKTSGGLHGVGSSVVNALSAYMDVEISRDGYVHHDRYERGVPVIELEDGLLPKTGKTRKTGTKINFLPDDTIFEKTRFRAEEVKSRMHETAYLNPELTIIFEDLRGETKEHIVYHEPEGILGFIRDLNSKKETVHEPVYFKGEAEGIQVEVVFQYVNEFHENVLGFCNNIYNGEGGTHLTGFKTTFTTVINQYARELGILKEKDANFTGADVRNGMTAIVSIKHPDPRFEGQTKTKLDNPDASRAVSKVAGDEIVRYFDRNLENLKKVIGCAEKAAKIRKTEEKAKTNLLTKQKYSFDSNGKLANCESRDPSKCEIFIVEGDSAGGSAKTARDRMYQAILPIRGKILNVEKASIDKILANAEIKTMINAFGCGFSEGYGNDFDISKLRYDKIIIMADADVDGAHISTLLLTLFYRFMPELIYEGHVYIAMPPLYKAMPKKGEEEYLYDDKALEHYRKTHDGPFTLQRYKGLGEMDAQQLWETTLNPESRMLKLVEIEDARMASGVTEMLMGTEVPPRRTFIYENALEAELDV >NZ_CP041667.1|WP_143931257.1|2006021_2007149_-|stage-II-sporulation-protein-P MLILYAGFHINIHLPEEARMELNRFLGSKAEDTYLTGFAYCKDGSGEAPSRWAAEAAMELVPLGSYVEGKTMTDTAIEDQETYEMILAQQANDENAVDENGNLIGEEEKADTAQAASSAIDTSLEKLKNFDYLIGNFYTVDGSTAVDPEQLNAEILLSKNMKIDTKSDGPKVLVYHTHSQEAFADSKKGDESASIVGMGDYLTELLNDTYHISTIHHKGVYDLIDGQLDRSRAYELAEPKIQKILEENPSIEVVIDLHRDGVGESTHLVTEINGKPTAQIMFFNGMSRTKANGSIDYLKNPYIEDNLAFSLQMQLAAAEKYPGFTRRIYLKSYRYNMHLKPKTLLVEAGAQTNTVEEMRNAMEVLAETLDTVLTP |
You can click texts colored in the table to view more detailed information
CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
NZ_CP041667_5 | 2242633-2243738 | TypeI |
NA
Consensus repeat of NZ_CP041667_5
|
16 spacers
spacers of NZ_CP041667_5
>5.1|2242666|36|NZ_CP041667|CRISPRCasFinder,CRT GAAACAGATTCTGCAGCTTTCGGAAACGGAGGTGAT >5.2|2242735|33|NZ_CP041667|CRISPRCasFinder,CRT CGCAACCACGGTCCAGGACCGCGTGTTCTATGC >5.3|2242801|32|NZ_CP041667|CRISPRCasFinder,CRT,PILER-CR CATATAGTATATACCGCCTATAACAAGTACCT >5.4|2242866|34|NZ_CP041667|CRISPRCasFinder,CRT,PILER-CR TCCTCTGGGAGAAGATCCTGAATCTCTTCTGTCA >5.5|2242933|34|NZ_CP041667|CRISPRCasFinder,CRT,PILER-CR ACTATAAAAATACTAGCGCAAGTATATTTTTTCT >5.6|2243000|33|NZ_CP041667|CRISPRCasFinder,CRT,PILER-CR AGTTGTGGAAAGAAAAGGGAAGAACAGTTCCCT >5.7|2243066|34|NZ_CP041667|CRISPRCasFinder,CRT,PILER-CR GGGGCTCACTACCCCAAAACAGATTCGATTTTTA >5.8|2243133|35|NZ_CP041667|CRISPRCasFinder,CRT,PILER-CR ATTGTACAGCTAGCAAAAGACCATGGATGGGAGCT >5.9|2243201|35|NZ_CP041667|CRISPRCasFinder,CRT,PILER-CR TGGAGGGGGCCGGCGGGAGCACAGACGGAAAGGCA >5.10|2243269|33|NZ_CP041667|CRISPRCasFinder,CRT,PILER-CR GCGATCTGGACGGAAATGTGCGGGGCTATTAAC >5.11|2243335|35|NZ_CP041667|CRISPRCasFinder,CRT,PILER-CR AGATCGGTATCACGGCAAAAGGCGCCGTAAAGGTA >5.12|2243403|34|NZ_CP041667|CRISPRCasFinder,CRT,PILER-CR GGAGTATGAATATGGGAAATATTATCTAGAAGGA >5.13|2243470|33|NZ_CP041667|CRISPRCasFinder,CRT,PILER-CR CGAGTGCCAAAAATATTTGAGGCCGAGGCTGAG >5.14|2243536|35|NZ_CP041667|CRISPRCasFinder,CRT,PILER-CR ATTGTACAGCTAGCAAAAGACCATGGATGGGAGCT >5.15|2243604|33|NZ_CP041667|CRISPRCasFinder,CRT,PILER-CR ACACTGCCGCTACCGTATATGCAACAGAAAACG >5.16|2243670|36|NZ_CP041667|CRISPRCasFinder,CRT,PILER-CR GAGCAGCCGCCATCACTTGAGACGGCGGCACTGTAT |
cas2,cas1,cas4,cas7,cas8c,cas5 |
CRISPR arrays and Neighbor proteins around NZ_CP041667_5
The CRISPR arrays of NZ_CP041667_5 >merge|NZ_CP041667|5|2242633-2243738|CRISPRCasFinder,CRT,PILER-CR ATTTCAATCCACACTTCCCATGCGGGGAGTGACGAAACAGATTCTGCAGCTTTCGGAAACGGAGGTGATATTTCAATCCACACTACTCATACGAGGAGTGACCGCAACCACGGTCCAGGACCGCGTGTTCTATGCATTTCAATCCACACTCCCCATGCGGGGAGTGACCATATAGTATATACCGCCTATAACAAGTACCTATTTCAATCCACACTCCCCATGCGGGGAGTGACTCCTCTGGGAGAAGATCCTGAATCTCTTCTGTCAATTTCAATCCACACTCCCCATGCGGGGAGTGACACTATAAAAATACTAGCGCAAGTATATTTTTTCTATTTCAATCCACACTCCCCATGCGGGGAGTGACAGTTGTGGAAAGAAAAGGGAAGAACAGTTCCCTATTTCAATCCACACTCCCCATGCGGGGAGTGACGGGGCTCACTACCCCAAAACAGATTCGATTTTTAATTTCAATCCACACTCCCCATGCGGGGAGTGACATTGTACAGCTAGCAAAAGACCATGGATGGGAGCTATTTCAATCCACACTCCCCATGCGGGGAGTGACTGGAGGGGGCCGGCGGGAGCACAGACGGAAAGGCAATTTCAATCCACACTCCCCATGCGGGGAGTGACGCGATCTGGACGGAAATGTGCGGGGCTATTAACATTTCAATCCACACTCCCCATGCGGGGAGTGACAGATCGGTATCACGGCAAAAGGCGCCGTAAAGGTAATTTCAATCCACACTCCCCATGCGGGGAGTGACGGAGTATGAATATGGGAAATATTATCTAGAAGGAATTTCAATCCACACTCCCCATGCGGGGAGTGACCGAGTGCCAAAAATATTTGAGGCCGAGGCTGAGATTTCAATCCACACTCCCCATGCGGGGAGTGACATTGTACAGCTAGCAAAAGACCATGGATGGGAGCTATTTCAATCCACACTCCCCATGCGGGGAGTGACACACTGCCGCTACCGTATATGCAACAGAAAACGATTTCAATCCACACTCCCCATGCGGGGAGTGACGAGCAGCCGCCATCACTTGAGACGGCGGCACTGTATATTTCAATCCACACTCCCCATGCGGGGAGTGAC >NZ_CP041667|5|5|2242633-2243738|CRISPRCasFinder ATTTCAATCCACACTTCCCATGCGGGGAGTGAC GAAACAGATTCTGCAGCTTTCGGAAACGGAGGTGAT ATTTCAATCCACACTACTCATACGAGGAGTGAC CGCAACCACGGTCCAGGACCGCGTGTTCTATGC ATTTCAATCCACACTCCCCATGCGGGGAGTGAC CATATAGTATATACCGCCTATAACAAGTACCT ATTTCAATCCACACTCCCCATGCGGGGAGTGAC TCCTCTGGGAGAAGATCCTGAATCTCTTCTGTCA ATTTCAATCCACACTCCCCATGCGGGGAGTGAC ACTATAAAAATACTAGCGCAAGTATATTTTTTCT ATTTCAATCCACACTCCCCATGCGGGGAGTGAC AGTTGTGGAAAGAAAAGGGAAGAACAGTTCCCT ATTTCAATCCACACTCCCCATGCGGGGAGTGAC GGGGCTCACTACCCCAAAACAGATTCGATTTTTA ATTTCAATCCACACTCCCCATGCGGGGAGTGAC ATTGTACAGCTAGCAAAAGACCATGGATGGGAGCT ATTTCAATCCACACTCCCCATGCGGGGAGTGAC TGGAGGGGGCCGGCGGGAGCACAGACGGAAAGGCA ATTTCAATCCACACTCCCCATGCGGGGAGTGAC GCGATCTGGACGGAAATGTGCGGGGCTATTAAC ATTTCAATCCACACTCCCCATGCGGGGAGTGAC AGATCGGTATCACGGCAAAAGGCGCCGTAAAGGTA ATTTCAATCCACACTCCCCATGCGGGGAGTGAC GGAGTATGAATATGGGAAATATTATCTAGAAGGA ATTTCAATCCACACTCCCCATGCGGGGAGTGAC CGAGTGCCAAAAATATTTGAGGCCGAGGCTGAG ATTTCAATCCACACTCCCCATGCGGGGAGTGAC ATTGTACAGCTAGCAAAAGACCATGGATGGGAGCT ATTTCAATCCACACTCCCCATGCGGGGAGTGAC ACACTGCCGCTACCGTATATGCAACAGAAAACG ATTTCAATCCACACTCCCCATGCGGGGAGTGAC GAGCAGCCGCCATCACTTGAGACGGCGGCACTGTAT ATTTCAATCCACACTCCCCATGCGGGGAGTGAC >NZ_CP041667|5|1|2242633-2243738|CRT ATTTCAATCCACACTTCCCATGCGGGGAGTGAC GAAACAGATTCTGCAGCTTTCGGAAACGGAGGTGAT ATTTCAATCCACACTACTCATACGAGGAGTGAC CGCAACCACGGTCCAGGACCGCGTGTTCTATGC ATTTCAATCCACACTCCCCATGCGGGGAGTGAC CATATAGTATATACCGCCTATAACAAGTACCT ATTTCAATCCACACTCCCCATGCGGGGAGTGAC TCCTCTGGGAGAAGATCCTGAATCTCTTCTGTCA ATTTCAATCCACACTCCCCATGCGGGGAGTGAC ACTATAAAAATACTAGCGCAAGTATATTTTTTCT ATTTCAATCCACACTCCCCATGCGGGGAGTGAC AGTTGTGGAAAGAAAAGGGAAGAACAGTTCCCT ATTTCAATCCACACTCCCCATGCGGGGAGTGAC GGGGCTCACTACCCCAAAACAGATTCGATTTTTA ATTTCAATCCACACTCCCCATGCGGGGAGTGAC ATTGTACAGCTAGCAAAAGACCATGGATGGGAGCT ATTTCAATCCACACTCCCCATGCGGGGAGTGAC TGGAGGGGGCCGGCGGGAGCACAGACGGAAAGGCA ATTTCAATCCACACTCCCCATGCGGGGAGTGAC GCGATCTGGACGGAAATGTGCGGGGCTATTAAC ATTTCAATCCACACTCCCCATGCGGGGAGTGAC AGATCGGTATCACGGCAAAAGGCGCCGTAAAGGTA ATTTCAATCCACACTCCCCATGCGGGGAGTGAC GGAGTATGAATATGGGAAATATTATCTAGAAGGA ATTTCAATCCACACTCCCCATGCGGGGAGTGAC CGAGTGCCAAAAATATTTGAGGCCGAGGCTGAG ATTTCAATCCACACTCCCCATGCGGGGAGTGAC ATTGTACAGCTAGCAAAAGACCATGGATGGGAGCT ATTTCAATCCACACTCCCCATGCGGGGAGTGAC ACACTGCCGCTACCGTATATGCAACAGAAAACG ATTTCAATCCACACTCCCCATGCGGGGAGTGAC GAGCAGCCGCCATCACTTGAGACGGCGGCACTGTAT ATTTCAATCCACACTCCCCATGCGGGGAGTGAC >NZ_CP041667|5|1|2242768-2243738|PILER-CR ATTTCAATCCACACTCCCCATGCGGGGAGTGAC CATATAGTATATACCGCCTATAACAAGTACCT ATTTCAATCCACACTCCCCATGCGGGGAGTGAC TCCTCTGGGAGAAGATCCTGAATCTCTTCTGTCA ATTTCAATCCACACTCCCCATGCGGGGAGTGAC ACTATAAAAATACTAGCGCAAGTATATTTTTTCT ATTTCAATCCACACTCCCCATGCGGGGAGTGAC AGTTGTGGAAAGAAAAGGGAAGAACAGTTCCCT ATTTCAATCCACACTCCCCATGCGGGGAGTGAC GGGGCTCACTACCCCAAAACAGATTCGATTTTTA ATTTCAATCCACACTCCCCATGCGGGGAGTGAC ATTGTACAGCTAGCAAAAGACCATGGATGGGAGCT ATTTCAATCCACACTCCCCATGCGGGGAGTGAC TGGAGGGGGCCGGCGGGAGCACAGACGGAAAGGCA ATTTCAATCCACACTCCCCATGCGGGGAGTGAC GCGATCTGGACGGAAATGTGCGGGGCTATTAAC ATTTCAATCCACACTCCCCATGCGGGGAGTGAC AGATCGGTATCACGGCAAAAGGCGCCGTAAAGGTA ATTTCAATCCACACTCCCCATGCGGGGAGTGAC GGAGTATGAATATGGGAAATATTATCTAGAAGGA ATTTCAATCCACACTCCCCATGCGGGGAGTGAC CGAGTGCCAAAAATATTTGAGGCCGAGGCTGAG ATTTCAATCCACACTCCCCATGCGGGGAGTGAC ATTGTACAGCTAGCAAAAGACCATGGATGGGAGCT ATTTCAATCCACACTCCCCATGCGGGGAGTGAC ACACTGCCGCTACCGTATATGCAACAGAAAACG ATTTCAATCCACACTCCCCATGCGGGGAGTGAC GAGCAGCCGCCATCACTTGAGACGGCGGCACTGTAT ATTTCAATCCACACTCCCCATGCGGGGAGTGAC
>NZ_CP041667.1|WP_143930289.1|2241696_2242470_-|alpha/beta-fold-hydrolase MPYFTYQSKQIYYSEAGAGKPLLFLHGNTASSKMFEPLLPLYTDNFKVILIDFLGNGRSDRVEKFPASLWQEWAKQTVALLEHLYGEKIVEEKVSIAGSSGGAWAAINAGLIRPDLVDRIVADSFDGRTLGEGFVENLLKERESAKKDEQAAGFYQWCQGEDWEQVVDKDTEALVQCGKQKLPLFVRPLSELQVPLMLIGSLGDEMTRENLQKEYEEIAVETRGQVCLFPEGGHPAMYSNAEEAAEAICRFLKEDCL >NZ_CP041667.1|WP_143930288.1|2240348_2241521_-|hypothetical-protein MKKITEWMRTHKLYATTAAVLIMVLLSASAAAASGVFSPADQETGREKAGSSTSDTVDLSLCIMADKNWDENSTPAVAHIAGADESNTDVDFYHAILPGSEEGEGTSSVSLEAGAYKVEFLSPVNQDGSVYKTGSAQNIAVSTDTEDVPAIDCTLTLIPADQVTDEMLQDIIDQTKAAIENGDETLKGDGGQAVLDKLEENVAKNPNVSDQTKQDAAAADTKVNVYDDPKTATSADNKAASSKDTSQNTSSGTDSGRSGSAGSKGSGSNSSSDQTAKPSHTHKWKSHTAKRWVSNWVTVVDTPARTVYGARLYTENSDGTWTANGETYWFENGFTSSDLAAIIKDKMKNEGYIGNYQNVSKTVPAVTHQEDQGYYEEYVDYQYCGCGATR >NZ_CP041667.1|WP_143930287.1|2239387_2240251_-|glycosyltransferase-family-8-protein MEGQTEKGGLQLNLLVTLDQNYFPQLEVLLTSLELNDPGEKFTLYLLQNHLPEELLTRTANWCHARGYEFCPVQMNGEMFRKAPANDRYPVEMYYRLLAGQMLPEHVDKILYLDPDILVINPLRELWEMELEENLFAAAAHTGKTEFANDMNRIRLGVEHDYYNSGVLLMNLKLARKEILPEKLFQYVETHRKELVLPDQDLLNALYGERILPLDDAVWNYDARNYNNYMMRSGAKYNSKWVMEHTSILHFCGKAKPWKPGYFYRFGMLYLHYQRLAARSWEASQEE >NZ_CP041667.1|WP_143930286.1|2238630_2239278_-|TetR-family-transcriptional-regulator MARNKYPEETEERILDTAQRLFLEKGFENTTIQDIVDELGGLTKGAVYHHFKSKDEIMDAVGDRMFFANNPFEAVQGREDLNGLEKLREMIRLNQADTARVDLNIQAIPITRNPRVLMEMIKSNRRILTPYFQKLIEEGNQDGSIHTEYTREISELLPLLTSLWMIPSIFPATKEEMKHKFRFLGDMLEKLGIPVMNEELYQLVDDFFEKMPQED >NZ_CP041667.1|WP_143930285.1|2237318_2238536_-|MFS-transporter MKQRLFTRNFALLVLGQISSLVGNYSLKFALSMYVLEQTGEASVFAGMLAAAMVPAIILSPFGGILADRANRRNIMVELDTISGGAVLITCAVFPLGDKIFLVGLLLFVLSVLGAFESPTVQACVPQMLSGDNLLKGNAVVSQVQAVAGLVTPFLGSVFYTAFGIRTVFYGAAVCFFITACFECFIRLEGQKPEKKMGLWEIVKADFRESMGFLRGERTGVLRLLALAAAVSMFVVGVVSVGFPYLIRTTLGLSPEHYGAAESLAGVAAVLGSLAAGLLGQKIGLRHFPFLIEGTGLLFLPAGIGFLLPVGILGKYGILVAAVCGCQFVCSMFSIYALSAIQEGTPERLTGKVMACVYTISMCAQPLGQVLYGILFDVFSAQVYWVLIPTGLLICVIGLSAGRMK >NZ_CP041667.1|WP_143930284.1|2235892_2237266_-|MATE-family-efflux-transporter MKKERDRTLKRNKYEIDMCNGSIMDKLISFSLPLMLSSILQLLFNAVDIIVVGRFSGSQALAAVGSTTALINIFTNLFIGVSLGANVLAARYYAAGQDKEMSEAVHTAITLALVSGILMAFVGVGAARPALSLMNTPQDVIGQAAVYMRIYFLGMPFFMLYNYGAAILRAVGDTKRPLLFLLAAGMANVGLDLLLVVVIPLGVAGVAIGTVTSQMISCILVLWCLHRTESSYRLRFSKLKIRSVYLKRIFQVGIPAGIQSTVINFSNAMLQSSVNSFGATAMAGYTAANNILSFLYVAVNAVTQACMSFTSQNYSVGKQKRMDRVLLDCGILSVGVSLVLGVGAYLFGSQVLGIYTSDTDVVQCGLEILSITTVPYFMCGIMDLLPGALRGMGHSAVPMVLSVIGTVGTRIVWIYGFFPQHRSLHFLFISYPASWILTIIMQAVCFWFVRKKIVKSL >NZ_CP041667.1|WP_143930283.1|2235129_2235804_-|response-regulator MTVKIGIIEDDEALREGLKLTFELEGLEVASAGNVKEGWKLLEEEGCGLVILDCNLPDGSGFDLVRRLREISDLPVLMLTARDSEMDEVKGLELGVDDFMSKPFSLAVLQARVRKILKKQEAPRRLSSGGITVDLGSGEVLRMGEKVALSTTEQRLLVLFMEHKGQILAKDQILSRIWDESGNYVDENTLAVSIRRLRVKIEEDPGKPRRIKTVHGMGYVWQEN >NZ_CP041667.1|WP_143930282.1|2234127_2235075_-|HAMP-domain-containing-histidine-kinase MEAVIGIGIINIAAALAGLLYFRRRYFLLYQTVQKLQKAVLEGEKIGKESDRDLPEDVLRDGFWRIQKKFQMEAEGARMEKQAVQGLISDLSHQLKTPLANIRLYQELLKNPGLDLKKRQGLQERLDEQTDKLDWLLSALFQMVDLERGVAALAAEEAPVLPALRQAVEAVLPKAERKEIQFQVEESRQRELAETELLFDPKWTEEVFINILENGVKYSPKGSVIHLSMECYETYGAVQIRDEGPGISKDEYSKIFQKFYRGSKTKEQEGWGIGLYLSRLVLEREQGYIKVESEEGKGSTFSVFLPIARQGGKKK >NZ_CP041667.1|WP_143930281.1|2233355_2234033_-|ATP-binding-cassette-domain-containing-protein MKMEAVKVTDVEKSFTDGMGREKVLKNVTFSLEEGTFTALTGISGAGKTTLLHLLAGFQKPDKGEILLAGRQIQKMKEKEAAPFRRRHIAMVFQEGALLPGLTVEENMILPVVMDTGRLLDEEKLNQILETLSLTGQRNRYPAELSGGERQRAVFGRALFSEAELILADEPAARLDTRQSLELMGTLKRCAKVWHRTILMATHDLDLAQICDETLELRDGCIWSK >NZ_CP041667.1|WP_143930280.1|2230954_2233309_-|FtsX-like-permease-family-protein MKIIWKVIKTEIRQKPGRFAALVTGMTAAVFLITVVTAFSNSCLHSMIEQEKKENGPYEAVFHNLTEEQAERLSESSQIKQTWKIKDCREEDTEEGRACYGAAFRRISLSIFERSQDVGVEIGMDGLPLEEQPILFSRPNWSVTSSFDITFNERLLGYYGINAFQTTAGSLVAIILMDVVIVLFAAALLYYVVLSGLEEKLKTLGLLDGIGISDRQKRLYIYGENLLAGLLAVPLGTLLGMGGLAVSIRYLNQWVLPTQKVEMHVSPIWLLAVFFGCLALVVLSGTGLYARARKERILNLISGYDEEEEVNRTAVLLKAKRHFFKVETLLAVKNVIMNHKNYAVSATLLVIALSVFLNGIMYIRGMTAVTDEVPRYPPISMWIQGNGVDEADFEKLAEELRGFAEVERVSLVKEAKEYTALEGLTKTEIQEYLKLFQAESALDGYEMLRDYEWENLAEEEGLKNIVRVIGVDDAVFDEYLGGAKRAGQVPEDGAILFRADLEGQEEAAEFPVMVDQELHMLPVACINPEVEEEVLLPEYLLTAEEAKKGIGTYKTMEAVEVFVRQDTFDRLLGDDKDPSVYLEISLSRDQGKRVSLNEILYPDQVTERMKEDDEIRQKIQKAGEALGITHLQVFSFAEEYHRSFFQGGKGMEILLVTAVVSASWVAAVLVILQKDAACLRRRKKEFALLLTIGMTRGKIFKMVFVEHLLYAFVGIAAGIPLSLFFLSGIYGDGGARQMASAWDVPADLVVWQVVLTLCVVLIPFFYTVRELRNIDVISVIRKEE >NZ_CP041667.1|WP_143930290.1|2243949_2244240_-|CRISPR-associated-endonuclease-Cas2 MLVLITYDVNTETAAGKKRLRKVAKQCTNYGRRVQNSVFECIVDHAQCVALKAILTEIIDEDVDSLRFYYLGNKYKTKVEHVGVDRGIAVDDTLIL >NZ_CP041667.1|WP_143930291.1|2244279_2245311_-|type-I-C-CRISPR-associated-endonuclease-Cas1 MRKLLNTLYVASENSYLSLDGENVVVLEEQKEVGRVPLHNLEGIVSFGYRGTSPALMGVCAERNISLCYLTPQGKFLARISGRVKGNVILREQQYASKNDEKISLEIAKNCILGKVYNARWVLERAVRDHALQMDVDKVKTASAFMKNSLEQIQNSESKEQLRGYEGEAASIYFGVFDQLILQQKKDFSFHGRNKRPPVDKVNALLSFVYTLLTNNITSALESVGLDPYVGYLHTDRPGRVSLSLDLIEELRAVLADRFVITLINKKIVSGKNFSTKENGAVLMDEELRKRVLIEWQAKKKEIITHPYLKEKVEWGMVPYVQAMLLARYLRGDLDGYPVFLWK >NZ_CP041667.1|WP_143930292.1|2245307_2245973_-|CRISPR-associated-protein-Cas4 MEYSEDDYLMISGIQHFKFCRRQWALIHIEQQWEENVHTVVGELMHKKVHDPLLKEKRKDTITARALPVSSRELGISGECDLVEFHKCEDGIKLYGHRGLYSVYPVEYKKGKPKLSEEDILQLTAQAMCLEEMFSAQVPEGAIFYGETRRRERIEITEELREEVRSMFQEMHNYYARKYTPKVKYSKSCNACSLKDICLPKLGKAVSVKTYIDQMLKEEEI >NZ_CP041667.1|WP_143930293.1|2245972_2246884_-|type-I-C-CRISPR-associated-protein-Cas7/Csd2 MSEVIKNRYEFVVLFDVENGNPNGDPDAGNMPRIDPESGLGLVTDVCLKRKIRNYVETVKEDEKGYQIYIKEDVPLNRSDREACKDLGIEETDDKKVTEALKKLKKSDPDTDVKLRDYMCSNFYDIRTFGAVMTTFVKASLNCGQVRGPVQIGFARSIDPVISQEATITRVAITTEKDAENKSTEMGRKSIVPYGLYRAEGYISANLARKVTGFSEDDLTLLWEAIINMFENDHSAARGKMAVRELIVFKHSKELGDCPAYKLFDAVEVKKNEDVEYPRRYQDYTVEIHEDRIPEMVEVKRMI >NZ_CP041667.1|WP_143930294.1|2246884_2248621_-|type-I-C-CRISPR-associated-protein-Cas8c/Csd1 MILQALAEHYDRLAEQGKVSREGWCMAKVSYGINLSKEGEVTGIIPLKTEEERGKKKVWAPQVLTVPEMVTRSSGVSANFLCDNSKYLLGIDAEGTNQRLTDCFEAAKEKHLTLLKDVNSEMAQAVCRFFESWDPRRAEENNEIQDHWDDITDGGNLIFCREINYAQDDPAIQEAWERARDSSDESGQMGICLVTGKQAEISRIHKTIKGVPGAQSSGAALVSFNAPAFESYGKEQSYNAPVGRYAEFAYTTALNYLLGQREYTFQLGDSMILFWAEDAKEEYQAAFFSCADPKRDNQKEIKGIFDNLKQQRQVQFDDVVLNLEQKFYILSLAPNAARLSVRFFYQDSFGNILGNLAKHYERMSIVKPSWAGEEYLGIRSMLSETVNQNSKDKTPVPNMAALVLQAILSGARYPASLYTDVLIRIRAEQGNITWGRAGIIKAYLIRNMGWKEGENYMGLNEESQDRAYVLGRLFSVLESIQLDTNPGIKATIRDRYFNAACATPASVFPILVKLKNSHMKKLEREKGSAKIYYEKLLTEIMGKIEGEFPARLSLEEQGKFILGYYHQVQKKYEKREEN >NZ_CP041667.1|WP_143930295.1|2248617_2249280_-|type-I-C-CRISPR-associated-protein-Cas5 MGMGVKVRVWGQYALFSRPEMKVERCSYDVMTPSAARGILEAVYWHPGMKWVIDKIHVVNPIRFTSVRRNEVKSKILASNVLQVYNGADKPLYISAKSDIVQRASLLLRDVEYVIEAHFEMTDKANETDNPGKFKDIVMRRLRRGECFHMPYFGCREFPANFALCEEDDIKTAYDVVEEKDLGFMLFDLDYSDQNDIKPMFFRAVMRKGILDLRDCEVVR >NZ_CP041667.1|WP_143930296.1|2249307_2249748_-|hypothetical-protein MGRKNNNALFFTCSLIEYIGRSTKRKRGKVTDFLGKERIERIYEYADVFHCDPIEKTAAEFIEEAHITEGTFDNVKDCRYTVPGYWDIGEVYERLIEDVYEEENIVNGIWEVYHSWIDAQISDYNTDFYYQPRDYIAECYKEGVIL >NZ_CP041667.1|WP_143930297.1|2249731_2250199_-|DUF3990-domain-containing-protein MILYHGSREIVEYPEIRKARFNKDFYFGFYCTQYEEQAKRWASRYGEIGYLNKYEYVPNHSLTYLVFEKMTEEWLDFVISCRSGISHTYDIVEGPMADDTIYNYIQNYMDGKISRAAFWELAKFKYPTHQISFHSISALDTLKFIGSEVVYGKKK >NZ_CP041667.1|WP_143930298.1|2250338_2252399_-|hypothetical-protein MRQKQEAVKNTDIQKQKNEWRKKGWTIPDLRGGKQVWYDLVKRLVELVEIGEAYDLDMIPDTGNLVNGQTWRVYTPFLKGIGLVYNHAGKLVLSDKGQEFVEKPTKKKLADMIQERVRLFGEMLDVINVTPITVEEADQKICKKYGLNWKNLSNTRKRMDWLEVLGLIEAIGNRKWKVTESGKEALKKWCIINPEVLEHEDSDIDDIEIADPPIEIEGLLQKLRDHPEMHKKRCTYNIWAPSPNRIENLRIIIQVASNKISKKELFEFIEKEFELKLSSVESMLPFLKASGLLEEVGRNVYLATAAAQAWVETGNELDFIRILHIHMRFVGEILLYAQNDIVRNDIYHIGKNYGMNKEKTRWMTGFLLEAGLLDEPRYLHLKTTKMGKTFAETLPLEKTVQYDITEESAEEKVSISDSESNKLDIISERLRVAAVNPAAEGKLSGVAFEEAISDLFSYMGFRAEHIGGSGNTDVIVKWKQNDDMVVAILDGKSKSSGQVSHGDISDIALDTHKEKNNAEYVAVIGPGFSGETIKNFAIKKEYALITDKQLIEVANASQELGLSLEEIALMFQVPNGFSRLEEIISFKRREMDIISEIIRQFCNEQELLESLSPRDLFLLLRNSSVSPLLEELIGGFDILSGNSIGILKRVDKSGSPENVRYILHNENTVINKIRALANAIEQGMQL >NZ_CP041667.1|WP_007045947.1|2255205_2255409_+|hypothetical-protein MWENLNFWIFLQCQLRRNPSHSLYLSVYIEFVFSKMQTTPRIGFWLDNSNQTPQQTAENILNARKPV |
You can click texts colored in the table to view more detailed information
CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
NZ_CP041667_6 | 2317193-2317496 | Orphan |
NA
Consensus repeat of NZ_CP041667_6
|
4 spacers
spacers of NZ_CP041667_6
>6.1|2317218|41|NZ_CP041667|CRISPRCasFinder TGGCGATGACTCCAGGCAATCCCGTGGAACTGGTGGAGCCC >6.2|2317284|45|NZ_CP041667|CRISPRCasFinder CGGCGATGTCGCTGCTCCGGAGATGATACAATGATTTTGGTTTTA >6.3|2317354|42|NZ_CP041667|CRISPRCasFinder CGGCGATGACGGAAATACTCTTGTTGCCTCACTGTATTGCAT >6.4|2317228|44|NZ_CP041667|CRT TCCAGGCAATCCCGTGGAACTGGTGGAGCCCATATTTCAATCCA >6.5|2317294|48|NZ_CP041667|CRT GCTGCTCCGGAGATGATACAATGATTTTGGTTTTAGGATTTCAATCCG >6.6|2317364|45|NZ_CP041667|CRT GGAAATACTCTTGTTGCCTCACTGTATTGCATGGATTTCAATCCA >6.7|2317431|44|NZ_CP041667|CRT ATAGGGTTGATTCCTAACTTTTCACGATACTTGTATTTCAATCC >6.8|2317294|37|NZ_CP041667|PILER-CR GCTGCTCCGGAGATGATACAATGATTTTGGTTTTAGG >6.9|2317364|34|NZ_CP041667|PILER-CR GGAAATACTCTTGTTGCCTCACTGTATTGCATGG >6.10|2317431|34|NZ_CP041667|PILER-CR ATAGGGTTGATTCCTAACTTTTCACGATACTTGT |
CRISPR arrays and Neighbor proteins around NZ_CP041667_6
The CRISPR arrays of NZ_CP041667_6 >merge|NZ_CP041667|6|2317193-2317496|CRISPRCasFinder,CRT,PILER-CR ATATTTCAACCCACATCGCCAATGATGGCGATGACTCCAGGCAATCCCGTGGAACTGGTGGAGCCCATATTTCAATCCACATCGCCAATGACGGCGATGTCGCTGCTCCGGAGATGATACAATGATTTTGGTTTTAGGATTTCAATCCGCATTGCCAATGACGGCGATGACGGAAATACTCTTGTTGCCTCACTGTATTGCATGGATTTCAATCCACATCGCCAATGACGGCGATGACATAGGGTTGATTCCTAACTTTTCACGATACTTGTATTTCAATCCACACGCCAATGACGGCGATGAC >NZ_CP041667|6|6|2317193-2317420|CRISPRCasFinder ATATTTCAACCCACATCGCCAATGA TGGCGATGACTCCAGGCAATCCCGTGGAACTGGTGGAGCCC ATATTTCAATCCACATCGCCAATGA CGGCGATGTCGCTGCTCCGGAGATGATACAATGATTTTGGTTTTA GGATTTCAATCCGCATTGCCAATGA CGGCGATGACGGAAATACTCTTGTTGCCTCACTGTATTGCAT GGATTTCAATCCACATCGCCAATGA >NZ_CP041667|6|2|2317206-2317496|CRT CATCGCCAATGATGGCGATGAC TCCAGGCAATCCCGTGGAACTGGTGGAGCCCATATTTCAATCCA CATCGCCAATGACGGCGATGTC GCTGCTCCGGAGATGATACAATGATTTTGGTTTTAGGATTTCAATCCG CATTGCCAATGACGGCGATGAC GGAAATACTCTTGTTGCCTCACTGTATTGCATGGATTTCAATCCA CATCGCCAATGACGGCGATGAC ATAGGGTTGATTCCTAACTTTTCACGATACTTGTATTTCAATCC ACACGCCAATGACGGCGATGAC >NZ_CP041667|6|2|2317261-2317496|PILER-CR ATTTCAATCCACATCGCCAATGACGGCGATGTC GCTGCTCCGGAGATGATACAATGATTTTGGTTTTAGG ATTTCAATCCGCATTGCCAATGACGGCGATGAC GGAAATACTCTTGTTGCCTCACTGTATTGCATGG ATTTCAATCCACATCGCCAATGACGGCGATGAC ATAGGGTTGATTCCTAACTTTTCACGATACTTGT ATTTCAATCCACACGCCAATGACGGCGATGAC
>NZ_CP041667.1|WP_002604486.1|2315258_2315477_-|hypothetical-protein MYFTKGKQRAFELLMQQKPGFDRYQSGCAGDDEDCGTCRFYRPGWKYEFCVFKECPYCPGKRTRKTHASMDK >NZ_CP041667.1|WP_002604485.1|2314185_2315130_-|helix-turn-helix-domain-containing-protein MAVFRVERNTGYTVMSNHHLRNKELSLKAKGLLSQMLSLPEDWDYTLAGLSHINREKIDAIREAVKELEKAGYIVRSRERDEKGRLRGADYVIYEQPQPREPEAATSGGQPPILDLPTLENPTLDNPTLEKPTQEKPTLENPTQLNKDILSKEQSITDLSSTDSIPFHSLNPLPFAHGEAATPPERKRTEAKSNSAVEIYREIIKDNIEYDHLIQNCKIDKDRLDEIVDLMLETVCTARKTIRIAGDDYPAELVKSKFLKLNSSHIEFVLDCMRENTTKVRNIKQYLKAVLFNAPSTIDSYYTALVNHDLYGGE >NZ_CP041667.1|WP_002604484.1|2313675_2314149_-|PcfB-family-protein MQEEVTQKTIALYVKVGKGAARLTEQALQKAIQKFLEQKSKPAHGKQTMRQLMKQNAGVSNIEITDSNIKAFESTAKKYNIDFSLKKVKGEQTRYLVFFKGRDADVMTAAFQEFSAKKLNREKKPSIRKALAAAKDKAKQLNAARDKVKKMDRGREI >NZ_CP041667.1|WP_002604483.1|2311897_2313679_-|type-IV-secretory-system-conjugative-DNA-transfer-family-protein MKQINYKKLILPNIPYVFFVYLFDKVGQAVRLAPGADISAKILNITQGFSAAFENALPSVYPLDLLVGIVGAVIIRLIVYVKGKNAKKYRKGAEYGSARWGNAEDIKPYIDPDFQNNIILTQTERLTMNSRPKQPKYARNKNVVVIGGSGSGKTRFFVKPNLMQLHSSYVLTDPKGTVLIECGKLLQRAGYRIKVLNTINFKKSMHYNPFVYIRSEKDILKLVNTLIANTKGEGEKSAEDFWVKAERLLYCALVGYIWYEAPAEEMNFITLLELINASEAREDDEEYQSPVDLLFADLEERDPDHFAVKQYRKYKLAAGKTAKSILISCGARLAPFDIKELRDLMSYDELELDTLGDRKTALFLIMSDTDSTFNFVIAMLQSQLFNLLCDKADDEYGGKLPVHVRCLLDEFANIGQIPQFEKLIATIRSREISASIILQSQSQLKAIYKDAAEIILDNADSTLFLGGRGKNAKDISENLGRETIDSFNTSENRGTQVSHGLNYQKLGKELMTQDEIAVMDGGKCILQLRGVRPFFSDKYDITQHPNYKYLSDFDKKNAFDVERYMSTRPAIVKPDEPFDIYEIDLSDEDAAAE >NZ_CP041667.1|WP_087272998.1|2311689_2311830_-|hypothetical-protein MDFFNSAVDVLQTLVIALGAGLGIWGGINLMEGYGNDNPGANAHVS >NZ_CP041667.1|WP_070087585.1|2309394_2311314_-|TetM/TetW/TetO/TetS-family-tetracycline-resistance-ribosomal-protection-protein MKIINIGILAHVDAGKTTLTESLLYTSGAVPELGSVDKGTTRTDTMLLERQRGITIQTAVTSFGWKNYKINIVDTPGHMDFLAEVYRSLAVLDGAILVVSAKDGVQAQTRVLFHALQKMKIPTIIFVNKIDQEGIDLQSVYQNIREKLSDDVMVMQDVSLTPEVSLTDIEDIEKWDSIIAGNDELLEKYIAGEPLKIQDLQREKCRRMQNGSLFPIYHGSAKNNIGTEKLIEVIAETFTSGADNDQSELCGSVFKIEYTDQKKRLVYLRLYSGTLHLRDTILLPQNQKLKITEMRIPSNGEIIPADTACCGEIVILTNDTLKLNDTLGNVELLPRKAWEKNPIPLLRTTVEPQNQEQRDLLLNALTEIADTDPLLHYYVDTITHEIIISFLGKVQLEVVCSLLVERYHVNINVKEPTVIYLERPLKTASYTIHIEVPPNPFWASIGLTVTPLPAGSGTRFKSKVSLGYLNQSFQNAVMEGVRYGMEQGLYGWEVTDCEICFDYGVYYSPVSTPADFRSLAPIVLEQALKRAGTQLLEPYLSFTLFAPQEYISRAYNDAPKYCAVIESTLLKNDEVIFTGEIPARCIGEYRNDLNFYTNGRSVCLTELKGYQEISGEPVLQPRRPNSRLDKVRHMFQKIT >NZ_CP041667.1|WP_070087584.1|2308337_2309204_-|AraC-family-transcriptional-regulator MLKKLNDAMDYIEAHLEDEFLLEKISEHINVSDYHFRKIFFALTNMTLNEYVKNRRLSEANKELLQGAQVTDVAYQYGYQSVDGFTRAFKKWSGILPSQVAKLKQCKSCQKLQFVVTMKGGTLMEYKIVEKPAFTFAGVSKRVPLQYEGVNNAILELAQSITQEQKEEMHRLQNIEPYETVNVSYESDTNFLEEAGELTHLIGVLTTKNDISSNLDTFPVKAHTWAVFPNEGIFPFTLQDTMARIYSEWFMTADYELAEPFSFSFTKMDDKKPNYAYSEIWIPVTKKE >NZ_CP041667.1|WP_070087583.1|2306939_2307758_-|RNA-methyltransferase MKVVSILNKNNDYQRFEVLKHNRNKRYKYNQFIVEGVRSLNEAVKNNWKIISFIYDKNNLSGWAKHMIETVKTEVNYTLTAQLLKELSGKEETSELLAIIEMREDRLENVALSSNPFIVLFDRPSNKGNLGTMIRSCDALGVDMLIITGHAVDLYEPDVIVSAMGSFFNLPVIRIIHNEDLYKFVESLRIKYPGFKIIGTTAHHEKPIYHEDLKTPVMLMMGNETMGLNKAFKEYCDVLCTIPMAEDSYASSFNVSCAASIMMYEIVRQRMN >NZ_CP041667.1|WP_083262785.1|2306657_2306888_+|hypothetical-protein MHFVGFIFGHGNSPSLNNSAGRQHSLLSPLDCLSAKAGGAVNGGAYAPFILTVDWLGWLCHLSVIAYSFFICRLLR >NZ_CP041667.1|WP_070087582.1|2306264_2306687_-|sigma-70-family-RNA-polymerase-sigma-factor MTEDEAYKVHIQYTFNAFCKIVIRHAAIDIILKLRRRWEREVSLDYLMNEKFVQLAEPEQLEEYLFTACGQTAVLYHAELAAALALLPEQTQEEIFRYYFLRQPQRVIGVHIGRTRSTAGRHIQLALQRLRRLMEGKDHE >NZ_CP041667.1|WP_143930304.1|2317629_2318916_-|MFS-transporter MKNRIYELKDFYILWSTQGLSQLGSTMTNFALTLWLYQATGSALQTALLSVCSYTPYVLMSIFAGAFSDKWDKKKTMLVCDTLAACCTLLVFGLLKAGLLCPWHLYLLNGINGLMNTVQSPASDVAVTLITPRKYYQKTSGLRSFSNSLITILHPMLATSFYAFGGMDFVIIVDLCTFFAAFLALLFGVKIPEADRKCEEKESLLESVKAGLYYLNENRLVLVLILFLAGVNLVASAFDAVLPAFILPRENGGEKILGIVTSFGGIAMLAGSLLVSVLPAPKDRIRLIVVTMMISLTTDNFLMSLTRTPFTWCVGQILGYTPVPFMNASLDVIVRSTIPMEMQGRVYACRNTLQFFTIPIGFLLGGWMVDQICEPFMARAGADSIAAMLFGQGKGSGAAMMIFLLGLTGMTVCLVFRKVLKKYRYQER >NZ_CP041667.1|WP_143930305.1|2320026_2321442_-|23S-rRNA-(uracil(1939)-C(5))-methyltransferase-RlmD MKKGQIYEGIIETVEFPNKGKIYLPGEEKPVIVKNGIPGQKVRFSVNKIRGGRAEGRILEILESSPKEIPSPCPHFGLCGGCTYQTLPYEDQLQMKETQLKAMMDEAADTEYLWEGIKESPRRQEYRNKMEFSFGDEYKDGPLALGMHKRGSFHDIVNVPDCQIVDEDYRKILSCTLEYARSTGLPYYHRMRHEGFFRHLLVRKAVRTGEILIDQVTTTQSCSSKSAANGTDGFDGQSWVEALLALDLEGKIAGILHTRNDSVADVVKDEGTEILYGQDYFYEELLGLKFKITPFSFFQTNSLGAEVLYETVREYVGETKDKVIFDLYSGTGTIAQMLAPVAAKVVGVEIVEEAVEAAKENAGLNGLENCDFWAGDVLKVIDELGEKPDLIVLDPPRDGVNPKALDKIIRFGVERIVYIACKPTSLARDLEMLQGRGYQVERIAGVDLFPGTYHVETVALLSHKNLTAQSA >NZ_CP041667.1|WP_143930306.1|2321555_2322479_-|NYN-domain-containing-protein MDGQYYALLIDADNVSAKYIKPILTELSKYGNITYKRIYGDWTSTQHSSWKDELLKNSIMPIQQFSYTQGKNATDSAMIIDAMDILYTNAIDGFCIVSSDSDFTRLVSRLRESGKTVIGMGENKTPEPFRKACDKFTILENLLNEQEPGTEQESHDALSKEKIEDAVIKMIIENQDSNRITGLGEVGSRLVSLYPDFDVRSYGYNMLSKFLEEFSRIQMVKKGNIVSVILKEDEGRKADIDRYVRRMVKDAGRSGIELSTLGNRVYEKYKDFKINDYGYSQFNQYVKSLPNIRVEKDGTILNAVYRE >NZ_CP041667.1|WP_143930307.1|2322613_2322856_-|DUF4366-domain-containing-protein MSKVEDIIAATKLSELVNKKDEDKQSKTVLWILAIVGAVAAIAAIAYAVYCFFTPDYLEDFEEDFEDDFDDDFFNDEDQV >NZ_CP041667.1|WP_143930308.1|2322943_2325184_-|DNA-helicase-PcrA MSIYDTLNDRQKEAVLHTEGPLLILAGAGSGKTRVLTHRIAYLIEEKGVNPWNILAITFTNKAAGEMRERVDRLVGFGSESIWVSTFHSMCVRILRRHIELLGYDTNFTIYDTDDQKTLMKDVCRLLNVDTKIFRERTLLSSISKAKNELITPEEFQLQAGGDFGLKKVAEIYTEYEKQMRANNALDFDDLLLKTVQLFQTHKEVLDYYQERFRYIMVDEYQDTNTVQFELVRLLASKYRNLCVVGDDDQSIYKFRGANIKNILNFEQFFQDAAVIKLEQNYRSTETILNAANGVISHNKGRKEKTLWTENGKGEAIQFRQFDTAYDEAEYIVDDIRRQVEQGEGAYQDYAILYRTNAQSRMFEEKFVTANIPYKIVGGVNFYARREIKDLLAYLKTIDNGKDDLAVRRIINVPKRGIGLTSINRVQEYAARREIGFYDALLGADLIPDIGRGLSKLESFAALMERFKRDAAEMTISDLLQEILDETGYIESLQAEGEEEAEARIENIDELRNKIAAYEEACQEQDEPASLSGFLEEVALVADIDDLDENSDYVVLMTLHSAKGLEFPHVYLAGMEDGLFPSYMTITSDDPEEVEEERRLCYVGITRAQEKLTLTCARRRMVRGETQYNKMSRFLKEIPLELLSTGAVFKREETEEKEKPRFQSAYQQARQAFHTKAFAGVKQGKQFGSPAGHLDYQEGDRVRHVKFGDGTVTAIVEGGRDYEVTVDFDGPGTKKMFAAFAKLVKI >NZ_CP041667.1|WP_143930309.1|2325186_2325579_-|DUF3783-domain-containing-protein MKSVVLCYNLKGTAKGKKIGMIFGFLGFKVRAVDKEQYLWPIGALTGMEEPELEPEVYDGDGFPEEMLVIQAETEDMLDKAIFLMQKDRVQVGLKAVVTASNQKWTSLALHDEIKKEHEIMKRREAERKG >NZ_CP041667.1|WP_143930310.1|2325652_2327557_-|ATP-binding-cassette-domain-containing-protein MPGPMKRAPRGVKSQVEHPGRLFARVMKYVFKDYGIHCAAVVVLILVGVLANVQGTMFMRDLIDVYITPFLMSDTPDFAPLAQAILRVAAFYGVGIISTYTYNRLMVTVTQGTLRDMRDDLFSHMQKLPIKYFDTHAHGDIMSVYTNDIDTLRQMISQSMPQLLNSAITIVSVLVSMVILSIPLTIVTLLMVGIMVFSSKIMAGKSGRYFLEQQVNLGTVNGYIEEMMSGQKVVKVFCHEEENIERFRELNDKLYVSADKANTYANLLGPINAQLGNVSYVVCAIAGGVLALGNVGGFTLGGLASFLTFNRNFSMPINQISMQLNAVIMAMAGAERIFRLLDEEEEKDEGYVTLVNIKEENGVIEETKERTGRWAWKHIHRADGSVDYVEVKGDVVFNGVDFGYTDEKIVLHDIKLYATPGQKIAFVGSTGAGKTTITNLINRFYDIQDGKIRYDGININKIKKADLRRSLGIVLQDTHLFTGTVKDNIRFGKLDASDEEIVAAAKLANADGFIRRLPNGYDTMITGDGANLSQGQRQLLAIARAAVADPPVLILDEATSSIDTRTERIVQEGMDKLMHGRTTFVIAHRLSTVRNSDCIMVLEQGRIIERGTHDQLIEEKGRYYQLYTGNLALQ >NZ_CP041667.1|WP_143930311.1|2327560_2329309_-|ATP-binding-cassette-domain-containing-protein MIRVLLREVKQYKGASIATPLFMVLEVLMEMAIPFLMASIIDQGVNEGDIGHIYRVGAVMIAAALIGLLAGVGGGRFGAKASAGLARNLREAMFNNIQTFSFSNIDKYSTAGLVTRLTTDVTNVQNAYQMTLRMFTRAPASLVCAMLMAFAINARLAVIYLIAVIVLGILLFLIMSHATRYFQQAFPKYDDLNASVQENVSAIRVVKAYVREEQETSKFKRASGNLCRIFTKAECNLVYNAPLMQITVYSCILLISWLGAKMIVSSQLTEGELMSLLAYCMNILMSLMMLSMVFVMVSMSMASVRRISEVLRETSDLQNPERPVMDVPDGSIEFRHVDFAYRRDSEEPVLKDIDLTIRSGETIGIIGGTGSAKTSLVNLVSRLYDVTKGEVLVGGRNVKEYDMEVLRNQVSVVLQNNVLFSGSILENLRWGNKEASEEECRQACRLACADEFIQRMPAGYQTHIEQGGTNVSGGQKQRLCIARALLKKPKILILDDSTSAVDTATDAKIRKSFREAIPDTTKLIIAQRISSVQDADRIIVMEDGCVNGFGTHEELLETNEIYREVYESQTQGGGDFDEKAGE >NZ_CP041667.1|WP_143930312.1|2329505_2330117_+|DUF1836-domain-containing-protein MTINTKDMLNSILSSIARIDYVKPESIPNIDLYMDQVTTFMNEQLRSTKRYEDDKILTKTMINNYAKNNLLPPPVKKKYSREHVLVMIFIYYFKNILSIKDIEAMLTPITENYFNNESFNMTEIYEEICQLEKSRIDGLAKDVARAYQSAEETFGDRPQDEQEYLRFFSFICNLSFDVYVKKLLIEKLIDELPDPESSEKKKK >NZ_CP041667.1|WP_143930313.1|2330126_2330855_-|response-regulator MEIRDRVLIVEDDKNIRSFLQTVLEANNYEVLLAQNGLSAYSLITSQCPDIVILDLGLPDMDGIKILENVREWSQMPIIVVSARTHEKDKVSALDKGADDYITKPFGTSELLARVRTAIRHARGSGTGQNGGQSMSFLSGRLVIDHDKHRVFVDGTDAGLTQNEFKLVSLLGKYAGKVLTYDYMMKEIWGPNMKNDNRILRVNMANIRRKIEKNPAQPEFIFTEIGVGYRFVEADVQAKEKK |
You can click texts colored in the table to view more detailed information
CRISPR_ID | Spacer_Info | Spacer_region | Spacer_length | Hit_ID | Protospacer_location | Mismatch | Identity |
---|---|---|---|---|---|---|---|
NZ_CP041667_5 | 5.3|2242801|32|NZ_CP041667|CRISPRCasFinder,CRT,PILER-CR | 2242801-2242832 | 32 | NZ_CP041667.1 | 2134404-2134435 | 1 | 0.969 |
NZ_CP041667_5 | 5.9|2243201|35|NZ_CP041667|CRISPRCasFinder,CRT,PILER-CR | 2243201-2243235 | 35 | NZ_CP041667.1 | 2101206-2101240 | 2 | 0.943 |
1. spacer 5.3|2242801|32|NZ_CP041667|CRISPRCasFinder,CRT,PILER-CR matches to position: 2134404-2134435, mismatch: 1, identity: 0.969
catatagtatataccgcctataacaagtacct CRISPR spacer catatagtatataccgcctataacaagtatct Protospacer *****************************.**
2. spacer 5.9|2243201|35|NZ_CP041667|CRISPRCasFinder,CRT,PILER-CR matches to position: 2101206-2101240, mismatch: 2, identity: 0.943
tggagggggccggcgggagcacagacggaaaggca CRISPR spacer tggagggggccggcgggagcacaggcagaaaggca Protospacer ************************.*.********
CRISPR_ID | Spacer_Info | Spacer_region | Spacer_length | Hit_phage_ID | Hit_phage_def | Protospacer_location | Mismatch | Identity |
---|---|---|---|---|---|---|---|---|
NZ_CP041667_6 | 6.10|2317431|34|NZ_CP041667|PILER-CR | 2317431-2317464 | 34 | MK448877 | Streptococcus phage Javan220, complete genome | 39467-39500 | 7 | 0.794 |
NZ_CP041667_5 | 5.6|2243000|33|NZ_CP041667|CRISPRCasFinder,CRT,PILER-CR | 2243000-2243032 | 33 | NC_010632 | Nostoc punctiforme PCC 73102 plasmid pNPUN02, complete sequence | 252803-252835 | 8 | 0.758 |
NZ_CP041667_5 | 5.5|2242933|34|NZ_CP041667|CRISPRCasFinder,CRT,PILER-CR | 2242933-2242966 | 34 | MN693995 | Marine virus AFVG_250M873, complete genome | 42747-42780 | 9 | 0.735 |
NZ_CP041667_5 | 5.5|2242933|34|NZ_CP041667|CRISPRCasFinder,CRT,PILER-CR | 2242933-2242966 | 34 | KX349293 | Cyanophage S-RIM44 isolate Np_20_0711, complete genome | 110204-110237 | 10 | 0.706 |
NZ_CP041667_5 | 5.5|2242933|34|NZ_CP041667|CRISPRCasFinder,CRT,PILER-CR | 2242933-2242966 | 34 | KX349296 | Cyanophage S-RIM44 isolate Sn_13_0910, complete genome | 110246-110279 | 10 | 0.706 |
NZ_CP041667_5 | 5.5|2242933|34|NZ_CP041667|CRISPRCasFinder,CRT,PILER-CR | 2242933-2242966 | 34 | NC_047734 | Cyanophage S-RIM44 isolate Np_42_0711, complete genome | 110201-110234 | 10 | 0.706 |
NZ_CP041667_5 | 5.5|2242933|34|NZ_CP041667|CRISPRCasFinder,CRT,PILER-CR | 2242933-2242966 | 34 | KX349291 | Cyanophage S-RIM44 isolate ES_42_0910, complete genome | 110246-110279 | 10 | 0.706 |
NZ_CP041667_5 | 5.6|2243000|33|NZ_CP041667|CRISPRCasFinder,CRT,PILER-CR | 2243000-2243032 | 33 | NZ_CP045289 | Curtobacterium flaccumfaciens pv. flaccumfaciens strain P990 plasmid pCff2, complete sequence | 10672-10704 | 10 | 0.697 |
NZ_CP041667_5 | 5.7|2243066|34|NZ_CP041667|CRISPRCasFinder,CRT,PILER-CR | 2243066-2243099 | 34 | MK448815 | Streptococcus phage Javan585, complete genome | 229-262 | 10 | 0.706 |
1. spacer 6.10|2317431|34|NZ_CP041667|PILER-CR matches to MK448877 (Streptococcus phage Javan220, complete genome) position: , mismatch: 7, identity: 0.794
atagggttgattcctaacttttcacgatacttgt CRISPR spacer aaaactttgattcctaacttttcaaaatacttct Protospacer * *. ****************** .****** *
2. spacer 5.6|2243000|33|NZ_CP041667|CRISPRCasFinder,CRT,PILER-CR matches to NC_010632 (Nostoc punctiforme PCC 73102 plasmid pNPUN02, complete sequence) position: , mismatch: 8, identity: 0.758
agttgtggaaagaaaagggaagaacagttccct--- CRISPR spacer agttgtggaaagaaactggaagaa---tctactagc Protospacer *************** ******* *.. **
3. spacer 5.5|2242933|34|NZ_CP041667|CRISPRCasFinder,CRT,PILER-CR matches to MN693995 (Marine virus AFVG_250M873, complete genome) position: , mismatch: 9, identity: 0.735
actataaaaatactagcgcaagtatattttttct CRISPR spacer gtaagaaaaatactagcgccagtatctttatgtt Protospacer .. * ************** ***** *** * .*
4. spacer 5.5|2242933|34|NZ_CP041667|CRISPRCasFinder,CRT,PILER-CR matches to KX349293 (Cyanophage S-RIM44 isolate Np_20_0711, complete genome) position: , mismatch: 10, identity: 0.706
actataaaaatactagcgcaagtatattttttct CRISPR spacer aggcagcaaataatagcggaagtatattttttga Protospacer * . ***** ***** *************
5. spacer 5.5|2242933|34|NZ_CP041667|CRISPRCasFinder,CRT,PILER-CR matches to KX349296 (Cyanophage S-RIM44 isolate Sn_13_0910, complete genome) position: , mismatch: 10, identity: 0.706
actataaaaatactagcgcaagtatattttttct CRISPR spacer aggcagcaaataatagcggaagtatattttttga Protospacer * . ***** ***** *************
6. spacer 5.5|2242933|34|NZ_CP041667|CRISPRCasFinder,CRT,PILER-CR matches to NC_047734 (Cyanophage S-RIM44 isolate Np_42_0711, complete genome) position: , mismatch: 10, identity: 0.706
actataaaaatactagcgcaagtatattttttct CRISPR spacer aggcagcaaataatagcggaagtatattttttga Protospacer * . ***** ***** *************
7. spacer 5.5|2242933|34|NZ_CP041667|CRISPRCasFinder,CRT,PILER-CR matches to KX349291 (Cyanophage S-RIM44 isolate ES_42_0910, complete genome) position: , mismatch: 10, identity: 0.706
actataaaaatactagcgcaagtatattttttct CRISPR spacer aggcagcaaataatagcggaagtatattttttga Protospacer * . ***** ***** *************
8. spacer 5.6|2243000|33|NZ_CP041667|CRISPRCasFinder,CRT,PILER-CR matches to NZ_CP045289 (Curtobacterium flaccumfaciens pv. flaccumfaciens strain P990 plasmid pCff2, complete sequence) position: , mismatch: 10, identity: 0.697
agttgtggaaagaaaagggaagaacagttccct CRISPR spacer gcttgcggaaagaaaagggaagaaaagaaaagg Protospacer . ***.****************** **
9. spacer 5.7|2243066|34|NZ_CP041667|CRISPRCasFinder,CRT,PILER-CR matches to MK448815 (Streptococcus phage Javan585, complete genome) position: , mismatch: 10, identity: 0.706
ggggctcactaccccaaaacagattcgattttta CRISPR spacer agaattagataccccaaaacatattctatttttg Protospacer .*...* . ************ **** ******.
Region | Region Position | Protein_number | Hit_taxonomy | Key_proteins | Att_site | Prophage annotation | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
DBSCAN-SWA_1 |
84111 : 104032
Sequences of DBSCAN-SWA_1
Nucleotide sequences of DBSCAN-SWA_1 >NZ_CP041667|84111:104032|DBSCAN-SWA TTTAAGAAGCAAGAGCCAGTCTTTCAATTTCTGCTTTTGCTGAATTGAAAGTACCGTGTGCGTAGTACCCTAATGTCATGGTGATGTTTGAATGTCCCATGATGTATTGCAGGGTGTTCGGGTTCATTCCTTTGTTTGCCATGTTCGTGCAGTAGGTATGTCTGAATGAGTGCGGTGTGATGTTTGGTAGCTGGTCTTTATGTGTCTTGTTGTATTTCTTCACAAGCCCACGCAACATACTTTCGTAGTTGCCAGCCACTTTAGGGAAGCCCTTTTGGTTTAGGAACAGGAAATTGCTGTAACCATTGATAACGATAGGCTCTGCCTTTCCTCTGTTCTGTAACACTCGCTTAAAGGCTTGATAGGCTCTTTCTGTCATTGGAAGCTGGCGTTCTCCTTTTTTGGTCTTAGGTGTTTCAATGTAGTAGCCGATTTCAGTATCTTTCAATAGCTGGTGGTCTACATTGATGATACGGTTTGCCATATCTATATGGACTGTCAGACCGCAAAATTCAGAGATACGAAGCCCCGTTTCCAGAAGAATAACCACTTCATCATAATACTTGCTGTACGTCTTGTCCTGTTCCATAAAGGCAAGCATTTTTTCTTCCTGCTCTGGCGTGAGAATAACTTTAGGCTCTGAATTATCTTCCAGAACATCACTCAATTTGAAGCTGAAAGGATTTTTTCTGATACAGTCGTCCTCAATCGCCATATAGAATGACGCTTTCAGTGAACGCTTGTAATTGCTAATCGTCTTGTAGGAAAAGCCATTCTCATACATTCGCATAGCCCATTCCTTACCGTCCGATAGCTTCACCGTATCAATGCTCCTTGCACCCAGAGGGTCGTTTTCCAGAATTTTCATAAGATACTGGCGACCTTTTTCTGTGTTGCGTTTCACATTCTTTCTCTGGCTGTTCTTCTTTGCGTAGAGCTGGCAGACTGTCATTTTCCCGCCTATGGTATTGATACCGTCGGCAAGGTCTTTCTTAATCTGCTTTTCTTTTTCTCTCAACGACAAATCTTCATGTTTCCCAGCGGGTGTCTTGTCTGTCGGAACAAGTTTCCAAGCATACTTAAACTGTGGTTTCCCGTATATATCGGTATATTTGTAAACGTATCTTCCGTCTTTTCTCTGGCTCTCTCCAAGTCGTAAATTGCGTCCCTTGCTGTCTTTTCTCTTTACATTAGACATAGTGCGAAGCTCCTTTCCGTCAAGGAATGAGCCGTTGATACGCTATACTTTATTATACCATACCTACGGCTCGCTGACATTAGATTTCGTCCAATGTATCAATGATTTTTTCAAACTGTTTTCGCTTAATCTGTACACGGTTTCCGTTGACAATCACCCAGCCAGCGTCGAGATTTTCCTCTGCCAGCTTACGCAATTTCTTTTCCCCGATACGGAAGTATTTTGACGCTTCCTCAATCGTCAGCGTGTACTTCTCCCAGATAGGCACATCAGTATTGTTCATAAGTATGCACCCCCTTTACTGTGTGTCTTATGGAAGCAGACAGACGGCTTTGGAAAGCTCCACTGGCATTTCAGCCATTCCCATTCCTGTGGGCAGAACCGCACCATGCGGGTGTATCATTATCCTGTGAATACAGGTCGTGGCAAAACGACTTTTCCAACGGTCGCTCTCGGATCACTGCATGGGTCGCTCGCTTCCCCTTATGGAAAAGGTCGTGGCGTACAGTCCCCGTGGCTCGCTGTATCACAAACGTATCTGTCTGCTTTATTCACTTGTCAAAGAACAGTGGCGAAAGCCATAGGCTCTCATGTGTAAAAGCAGAGGGCAGGACAGGAAAGAGGGGATTTCAACAAATCATGCCCTGCCATAATTGCTTTATTCTTCTGTTGGAACAGTACCGATATCGAAGTCTAAAATCATTTTGATAATAGCTTCTCTGATACGCCCTTTAAGCTCCATATCTACTACGATATAAACATTACCGTATTCATCATATAGAGGACGCAAACAGCATTTTGATATGTACGGGTCATAAAATGCCAGTATTCTTTCAATAGACTGTTCATTTCCGTCCACAGCAGATGAAATAAGTCGATAGGGTGGAAGTTTGTACCCTTTTTTCATGTTTTATTCACGCTCCTTTGTCAGTGTTTTCTTCAAATTTTGTAAAGCTCTGTAACGGTTTTTAAAAACTCCTGCTCTGGAAATTTTCTGTGCTTTTGCAATTTCTGTATCGCTCATTTCCAGAAAGTAGTACATCAGAAGATTTTCACGGTTGCGTTCCGATAATTGCTTTAACGCTTCTGAAAGTTCTTCATCAAGCACTCGGATTTCAACGCCGCATACTTCAAACACGGTATAGTCACTCATGTACTCGTCCCATACAGCCAGCTTTTCAACAACGATTTCTGGAAGTTCACAGAAAGGAATTTCTTTATCTTTACGACGTTTTAATTCCTTTTTATAATTCCAGATAACGCCCTTAACAGTACGTTTGAGCTTACAGTCAAACTGACACTGGATTGTCTTTTGGAAGTCAGACGGTTTCATCATCTCACCCCCTCTCTGTCCAGACGTAAAAGGTATCTATCCCTCTTTCCGCCCCACATCGGTACAGGAGTGGGTGATTTGCTAACCTGTATAAGAAAGAAATTCAAAAAAATATTGAACTTCTCATAAATAACAAAAACAGTCAATCAGCACATGAATGTACCAACAGACTGTTTTTGCTATGTTTGCGATATGAATTTTTAACTCATGTTTAGGGTGTGCTGACTGGATAACCATTGTAAGGCGGTTATTCAGTTGTGAATACTGGCGTGTTCGCTGATACGCAATCATAGCACACCCCCCTTATCCAGAGCTGTGCCGTGAGTGCAGAAGAAAAGGACAGCTCTGGTTATCCAAAACCGTCCTAATCAGATTGAAAGACATGGAAAAAGATATTACCTTAGAACGGTTTTTTTTATTTACCGTCAAGGTAATATTGATACGAATTACTAACGCATAATATATGCGCTTTTAACACATTCTAATCATGTGCCATGAAGTATAAACTCATGACAGGTGATTTGAAATGAGGAAAAAGAAAGACACGCACAGCTTTGATTTCCGTCCGTTAGGACTGGCAATCAGAGAAGCGAGAGAAAAAGCTGGGCTTTCCCGTAATGATTTAGGCGATAAGGTCTTTTACGGAGAACGTCATATCGCAGATATTGAGAATATAGGGAAACACCCCAGCTTCCAGTTATTTCATGATTTAGTGACAATGTTCCATATCTCTGTTGATGAATACTTCTATCCGCAGGAGAAAGCGGAAAAATCTACAATCCGCAGGCAGATAGACGTATCGCTTGATGAACTGACAGATGAAGAACTCAAAATCATTCAAGGAACAATCGACGGTATCATGAAATACAAGGGGACGGGAAGCAAATAGGCTTCTCGTTTTCTTTTTCCTCGCATAAGCTATCCTGCTGATCTGCTTTGCAGGAAGCAGGATAAGCACCTAAATCTTAGTTTTCTTTGTAATTGTGTAAATGTATTCTTCTGGTGTAGGTATCTTATTACCAAAATACTTTTTAAATCGTTTCTGTACCACTGGGTCTTGACTATTCATAGAGTTGTCAATATTCTCCTGTATTTCAGCCCTAAACTCTGGTAGCGACAATCCTCTGGCTTTTGCCCCAGCTTTTAAAATAGTTCTCATATCTCGTTTTTTAAGACCAATCATTTTGTCACCTCATTACCGATTAGATTAGAAAAGAATTTATAATCGCTATAACAGATTTATCAAATACGCTAACTTTATGCCTTGTTACTAAATTGATAGTATCGTTAATTGAAGTCAAGTTACCATTTTCCAAAAAAGCTAGGCAATTACATTCTAATGCAATGATAACTTTGTCTAATGTCCAGCAAGAGGATAAAGCTATCCTATGAAAGATTGCACATTCTTGTAGTTCACTTAAAATAATATCTTTATCAGCGTCCATATCTAAAATTGCAATATTCTCTAAAGAGCCGTTATTTATCGGGAGTAAATAATTCTGTATCTGTTGAAAAATCGCTAACCCTTGATAATCTGTAATATATTCAACGTTTAAATTTGTAAGAGAGAATACCTTAATCATTATTAGCCTTTCATTCTGCGTTTAAAGATAATAACAGACGGAATGATAAGTACCAGAGCGTATCCTAAAACCCATACAAGCTGTAATGGTATATCTGAAAAATTACCGTTTATTGCATAATTTACCGCATTTACTGCATGACTGAATGGTAGTAAATCGCAGACCGTTTTGAATGTACCACCAATCATATCCAATGATAAGACAGCACCACTTAAAAACACTGTTGCATTCACTAAGATTGTTCCAAATCCATTGACCGCATTATTTGTTGAAAGTACACTACCAATAAATAAGCCAAAACTGATAAATAATATCGCCACTGGAATTAAGGTCAAAATAGCTATAAAAATATTTATATTTAATTCTAATCCGAAGAACAGAGCTACACCAAAACAGCAAACAGCCTGTAAAATTGCAAGCGGAATTATAGGTATAGAATATCCTAAGATGTAATCAATTCCCTTCATAGGTGACGCAAATAATCGGGTTAGGAAAGAACTATTCCTATCGGTGGACATCAACGTACCTACAAAGGTACTTAGAAACGCCATTCCTAAAATCGTCATACTCGGTGCAAAATTTTGGACTTCAAATACATCAGCCATTCCACCGATACTTTCTTTCATCAAAGCTAACGCAATAATCAGCAAAACAGGAAATCCGAGACACATGAGAATACTCATGGGATCTCGCACAATTTCTTTAAAATTACGTCCTGTAAAAATAGTAAATCTCATAATTCAGTACCCCTTTCTGTATACATCAAAAATGCTTCTTCAAAATTTCTAGCACCAGATACCTTGATAATTTCATCAGCCGTACCAAGAACCTGCACAACACCCTTATCCATAATACAGATACGGTCAGCTAATTCCTCTGCTTCTTCAAGATAATGTGTTGTTAAAATCAAAGTTAATTTTCCTTTCAATTTTTCCATTGTTCCCCATAATTCTCGTCTTGCTCTTACGTCAAGCCCGATTGTCGGTTCATCAAGGAAAAGAATTTTAGGATTACTTATCAAAGCCATTGCGATACTTAATTTTCTCTGCCAGCCACCCGATAACTGTTTTGCTCTGTCATTTTTTCGGTCAGTTAGTTCAAAAGTTTCCATGATTTTGTCGGCTTTTTCCCTTGCTTCCTGTTTAGAACTACCATGTAAGCGGGCAATCATCATCAGATTTTCTTTAACTGTCAGCTTTGGTGCAACGGCTGTTTCCTGTGGAGAAAGGTTGATAATATCTTTTACTTTATCTGCCTGTTTTCTCAAACTGTTTCCTAACATAAGGGCATCACCACTTGTTGGCTCTGTCAGACAGCTCAACATTTTAATAGTTGTTGTTTTTCCTGCACCATTTGAACCAAGCATTGCGAAAAATTCTCCTTGTTTGATATTAAGCGAAAGAGAATTAACTGCTGTTAAATTTCCATATCGCTTTGTTAAATCTCTAATTTCTATTGCATTCATTCTTTGTCCTCCTTTTGAAATGCCATTTGAAACGCTTCGTTACAACCGTCCATTTTCTGTAAGGCAATTCCTTTGTCTTTATCAGCAAGTAACACTAATAGCTCGCCGATATGAATACCAAAGAAATCCCTTACCTCTTTTGGAATAAGAATTTGTCCTTTTGTTCCCACTTTTAAGGTGCCTAAATATCGTCCATTGTGGGAATAGTGTTCTGTAACATCTCTATTCACTATATCTCCTCCTTTTGGTATAACAAGTATTACTTGTTATACCAATATTAACACCAGCAGAGTAAAACGTCAATAGAAAAGTATTACTTTTCCTACTTTTTGCTAAAAGTTATAAAAAAGACGGTGCCATAAAAGCACCGTCTTAATAATTTAATTTATTCTGACTTCGTAGGTGATTGCCCAGAGTTAAACACTTGTTCAAAAAAATCTTTGTAAACCTCATATTTTTGAACAGCGATACCTTGTTTTGCGTCTGCTAATAAGACCAAAGTATCACCTGTACTGATATTTAGCATTTCTCTTGCTTCTTTGGGTATAACAATCTGTCCCTTTGGACCTACTTTCACAGAGCCGACAAATTTATCATTCTTTGTTTTTTTGTTTTCATCAGACATGAACTCGCTCCTCTCTAAAATCGTATAACATTTCCTACTTATTATACAACTCGAACAATAAAAAAACAAGTACTGGTTTTACTCTATTATCTTCCAGTTACTATCCTTATGAAGCGTCAACTCATACTGTGATACCTGTGTTGCCTTTGTCTGGTTATCAATGAACTTCACAGATACCTTGACATTCACATTATCTCCGTCCTTTGTAAATACTGGGTTTACTAGCTCGGAAAAGAGATAGTCCCCGTTTATCGGTTCAAGAGCGTTTCCCTCTACATAGTAGGCAAGCTCCTTATCCGTTGCGGTCGGGTACAGCTTAAAGAATGTTTCCAGAAACGCCGTAGCGTCATTTACCGTATCAGCGTCCACGCTGTTGTCTGCTTCCTGTGCCTTTGGTTCATAGCTGGATTTTTCCATAGCGGGGGCAAGGGTAGGGTTCTGTATGATAACCATATCCCCGTCGGCGTCCACATGAACGGTCACGGTATAGGTTTCTGATACGTTCCTTGTCTGGTCGCCCTCTTTTATCTGCTGATCTACTTCGTAGGTAACAGAAAAAGCGTTCTCCCCAAACTGTTCCACGTCCCAGATAAGTACATCCGTGACCGTGGAGCTGGTCGGTATATCCGTCCTTACCGTATCAACATTCAAATCCTGCAATTCCTGTGTCAGATAGGCACTGATAGCCTGTGTCCTTGCTTCGATTGCTTCCTTGCTGTTGCACCATGTGTAGTAGGATTTACAGAAATTCTTCACGAAATTTTCTATCCCGTTGGTGTCATTCAAGCGGAGCTGTATGGTTTCTATCTCATGCGTGGTGTGCTGGTCGATAGCGGTAAAGTTCTTATACACACCAAAGCTCACGCTTGCGATAAGCACCACCCAGAGGGCAATGACGGATTTCTTATGCGTCCCCACTTTCATAGTGCGGACTTTCTTTTCCTTTATCGGTTTTTCTTTCTTCTTGAACATGATATTTCCATTCCTTTCTTTCTATTGCTTAATTCTGCCAGCCCCGATAAGGTGCTGTTGCCAGTAGGAGCTTGTAAGGTCGGTGTAGCCTATCGGGTCGCCAGCGTGGTACATCTGATTGTTTCCTACATAGATACCGACGTGCGTTACATACGTTCCCGCATTGTAGGTACTGTGGAAGAACACCAAATCCCCAGCCTGTGCCTGTGAAAGCGGGATATGCTGGGTAGCGTCATACTGTGCCTGTGCCGTTCTCGGCAGACTGATACCAGCCTTTCCATAGCACCATTGAACCAGTCCCGAACAGTCAAAAGAAGTGTTAGGATTGCTCCCGCCGTACACATAGTCCCAGCCTTGATATTTCAGTGCTTCGTCCATGATTTTTTGAACGGTCGCATTATCGAATTTTGTAATCGTTAAATATTGGTTGACTATCTCTACATAGAACATATTCCCGTAGGCATATCGCCAGCCACCGTTGCGGGCGACAGCAATAGGGTTCGTGTAGGTGACTTTCTTCCCGCCAGACTTGTCACGGGCGAAATTCTCTGCCAGCGTGAAGCTGTGCTTTTTCCCATTGGAAGCCACATAGCCGATATAGCCACCGCCATAGTTATAGGACTGTATCACCACGTTTATATCCTCTATGCCTTGATTTTCTGCTGAAGAAAGCAGGGACGCAAAATATTTGCACCCCTGCTTGATTGAACTTTCCGTATCAAGGGAATTAGGCGGTAAGCCCATGCTCTCGGAACTCTGCATAACGTCCTCGGCTGTCCCGCCACTTTCCACCTGTATGATTGCCAGCAGATAGGGGACGTACTCGGATATGCCGTACTCTTTAGCGTATTTCTCTACCATAGGCTGATGTTTCAAGACCTCTGCGGACAGGTTCATGCCCGTTACCCATGAGGAAGAACCACCGCCCCCGTCGTCGTCCTCGGCACTGATAAGCACACCAAAGAAAAGCACTATCGCAAGAAGAAAGGGAAACAGACTGCCGATAAGGGCGATATGTCTTAGCTTCATTTCTTATGTCCTTTCTTTCCTGTCATGTTTAGGGTCGCTTTCTTGATGGTTCGCTCTTTTCTGGTGGTCTGCTGGTTCAGCTCTACACCATTGAGCCTTGTTCCGTGTTTCTGCAATACAGGTTCAGCAGATTTTGAAGCAAGCGGACGCTCCTTGACAGGTTGTGCTTTCGTCGGCTCGGTATGTACCTTGTCCGTAGTGACAGGACGCTTGATTTCCTGTGCTTTTCCTGTATCTGTCTTTGACGCAGGAGAAGCAGTCACAGGTCTGTCATGGGGACGGGTAGCTCCTGTTGTCGCTGATCCGTCCGTCGCTTTGCCAGACTGCTTTGCTTCCTGTGCCTTTTGAAGTTCTATACGCTTGTCAGCGATATTCTGTCTGTGTTGCTCACGCTTATCGGCACGCTGGCTCTGTCTAGTTTCCTGTCCCTGCACAATGCCACGCTTGAAGTCGGACACATTTTCCTTTGCCTTTTCCTTAGCGGAGTGCAGGGCATAGGCGGTCTGTGTCGGCATATCCTTGATATTCTCCTTGACAGCCGTTGCCTTGTCCTTGACCTTGTTTTTTGTATCAAGCACCGCACCTATGGCTGAACCCGCTCGGCTTCCAAAAGAAGTATTTTCTCGGCGTTCTTTTCTGCCCGTACCAGCTCTAGTAGATTTCCCAGAACCGTCATTCATGGTTGTTCCTGCTACTGTGCCAGCCACCGCACCAGCAACGCTTCCAGCACCGACAGCCCTTGCGATACGGTGTTCCATACGCCTAGCCCTGTGTCGCATGAACATGTACGGGCGACGGAATATCCTGCGTCCCATGCTCTGGCTGTCCCCAGCGTTTAAGCTGAACATACTCATAAGGTCGCCCAGCTTCATGTAGATACCTGCAAAGCACACTATCTGTAAGAACGCAATCATAAAGAACGGGTAATCCGTGGAGATATTGTAGAACATACTGGAAATACTGAAAGCCACCGTGACAATGAGCGTTATCCCAGCCCTTGTCATAATCGTATTGAACACTCGGACGATTGCCTGTTTTGCCATGTTCTCATAGCTGGGTATCATGGAGAGCAGGAAGCTAATCGGCAGGAACATAGCGAAGATGATAAATAGTATCTGGGAAAATATCATCATGCCCGTAAGCAGGAATACGAATATCGTTATCCCCAGATTGAAGATAAGCAGGAAGAACACCATTCCCAGACGGTTGATGACCTGTGGTATCGTCAGATTGTCGTTGTCGTTATCCTCGATTTCCGTTTTCACCACGTCCTCACGGGTCGCCCCGTCGTCGGCAGAGGGACTAGCCGATACCAGAGCTTCCACACGGTCAGCCCCGATTTCCTCTATATCGCTGTTTCCGAACTGCAAAAGTAGCCACGGCTGTTGCACCTGTATGGAGAACAGGCTGTCCCTTATCAAGTCCACGCTGTCTTTTCCCGCACTGTCGCTGTCGGGAAGCATTATCTTTGTCCCCAAATTCAGCGAAGCCGTGGAAATGTCCGAAGAAAAGTCATTGATTTTTTTGATGTAGTCGGGAGCGTAGGCGATAAAGGAAGCGGACAGCACGAACACCACAACGAAGTTGATAACGGCGTGTAATGCCTTGCTGGTTTCTCGCTTTATCAGTCCCGTGTAGGCTACATAGATACCCACGATAAGGATAATGAGAAGCAGGAAGCCCACATAGAAGCCCGTACTTCCAAAGCCGTTTTCCGTCACGCCCGCTAGGGTCTGTATGCTCTTTCCGATACTGTCTGCCATGTCGTTGATGAAGTCCAGCTTATATGCCTGCTGGACGACGTACCCTGTGGCGTTACTTAGATACAGGCTGATAGTCCAGATAAAGTTGGTGATACAGTAAAGCCCGTATTGTACGGACTTCCCGATACCGTCCAGCCAGTTCCACGGAAGCCAGCCCCAGCTATTGTCCACATAGAAGTCAAGCTGGTAGTTTTCCAGCGGGTACTGTGAATACAGGTTGTCCGCATTGACCGTATCATCAACCAGCCCCGTGGCATGAGCCACCGTCCCGAACAGGGAGAGGAACACAACAGAGGTAAAAATGACTATCAGAGCAATCTTGAAAAAGCGAAAGAGCTTCTTTTTTGTCAGAAGCCCCTTTATCCTATCTATCATCACTCCACCTCGCTTTTCACGGGCGGTCTGGTATCAAAGGCGTGCAACAATTCTTCAAAGACAGGGTGTATCTGCACCACGCCCACACGCCCGTACAAATCTTGAAGCAGGCATTGTCCGTTCTCCAAGTCACGAAGTCGTTTCTGGTTGTTCTCGTCGTCCTTGTCGATACCGAAAAATTCAAGGGTCTGCTTTATCTCGTTGATGTCGGTACTTCTAAAGGCAAACTTCAAACCGATATTGTTTTTCAAGCTCTCCTTTGACACGTCGCCAGAGGATTGTGTGACGAAGTACACGCCCGCCTGCATAGCACGTCCAGCTCTTACCAGCTTGTTTGAGAGGGTTTCGCCCTGTGCCACGTTGAGGAACGCCCACGCTTCATCTAAATCCACGATTTTAAAAATGCTTCTGTCACTGTGGATAAAATCAAGGGCAAAGGTGGAAATGACAATCAGCATAGCCACGGACAGCAGTTCAATGGTGGTGTATTCCTCAAAGGTCGTGTCCTTATCGGGGAGTACCAAGTCTGCCACCTGTATGATGTTGAGCTGGTTGTCCAAGCTGATAGCGTTCTTAACCGTTCCGTCGGAGAACAGCAGATGTGCAAAGTCATAGTCCGTAAAACTGTCGATATGGTCGGCAATATTGCGGGCAATCGCCGTTTCCTCTCTGCGTAGCTCGTCTATCACATGGAGAAGTCCTCGCTGATCGCTCTGGGTCACGGCTCGCACCGCTTTTCTAAGGACAGGGAATTTCTCGCCGTCCCTAGAGGAAATCCCCGTAAGGAATGTGAGAATGTCTATCGCCAGACTTTCAGCATCCTTTACATTCTTCATAATCACGAACGGGTCAAGAAGCCCAGCGTTTTCCTTATCGCTGGTAAGGTTTACGATATTGATTTCATGGGCTATCTCTAGGAGCGTTTCTTTCCAGTTGCCACGCTCGCTCTTAGGGTCGAGAATGACCGCCTGTCCGCCAAAGAGGACGGAATAATAGACAATCAGATTGTTGCAGAATGATTTTCCACCGCCAAGACTTCCCACAAAGGCAGAAGCCAGAGCATTGGTGACTGTCCCCTTGATACCCTGTGAAGCAAGGGACGGCTGTAAATAGACGTTTCTTCCCGTGTCCACGGAATAGCCGATATAGATACCTGTATTCTCGCCAAGCTGTTGTGTCGCCCCAAAGCCAAGCCCAGCCAAGAAGTCCGATTTCACATACTGCACATAGTCATTGATATAACGCTTGCTGGCGGGGAGAAATTCAGAGTGAAGCCCCAGCATATCCCCAGCAGGTCGGACTAACTTTACATTGAGGTCGTCGTAGAAGTCCTTGACCTCATCACAGCGTCGTTTCAGCTCGTCGAGGTCGGGAGCAGACACACGGATAACGTAAGAAAGTTTATACATACTTTCCTTGCTCTGGTCGAGGTCGGTTTCCAATTCGTCCACGCTGTCTAATGCGTCCACCACATTTGAGCTTGTTTCACTCCCAGACTGATAGGCGTGGTTGTCCAAATCTTTCAGCTCTTTCTTCTTGTTGCGGACAGTCGATAAGGCTTTTCTGTTCCCGACGATTTCCACATTCATGCTCGTATCAACAGGGAAAGTGAACTGCTGTTGCTGGAAATAGAAGATTTCCGACGACGGAAAATCCAGCTCGCCCACAATCGCATTGACGGTAAAGTAGGACACGAAGCTCTCGCTGTCCTCATGTTCCAGCCTTATGTATCGCTGGCTTTCCTCTATCAGACAGCGGGTCGGACGGATAAGGTCATACTGCTTAATGAGCGTTGCCTTTTTCAGCTTCTTCTTTGGAAGCGAATACTCGTAGTCCTCATACGCCACGCCGTCCCTGCCGTAGATATGCTCGATAAGATAGCCGAAGTCGTTCTTGTCCAAGCGTCGGAACTTGAAACGGCGGGAGATTTTATTTTCCAGCAGTTTTTCCATTTTCATGTAGCGGTTGATTTCATCATCTGGCATGGAGATAAAGTCGTTCATCAGCGTATGGTTGACCTCATTCAAGAACTCTTTGAACGTCATAAACGCCGATTTTTTCATGCTTTCAAGGCTGACCTTTTCCTCTGTCACAATGAGCTTGAAGCCGATAAAAAAGCGGTAGTCCACTTGATTGTCCCCAATCATGGAAACAAGGGCTTCTGTCTGTTCGTCTATCTTCTGTATCGCCACATCACGAAGTCGCCCCGTTACCAGCTTCTTTGACTGTTCCTGTATGCTTCTTACGGAGCTTTCCGTTGCTATCTGCAAGGCGTGTATCTTTCCCTCACGGGACTGTGCGATAAGCTGACGGAAGCTGTCATGGACGATAAATTTCTGTTCCGCAGAGAGGAAACTGTAATTATACGGTATCAGCTCATAGTAGGCGAACACCTCATTGTCCTTGTTCCAGACAAGGTTGTTGTCGATATACTTAATCGGGAACATACATCACACTCCTAACCGCCGTGACAGTTTCGCTGAACACTCCCTTTTCCAGCTTCACGGCTTTTCCTGCATAGGTCACTTTTGGTCGCAGGGCAAACGTAATCTGCGATTTCAAGAAGCTGTAAGGCTTCTTGCCGTCAAAGGTCTTTTGGCTCATGAACCATGTGAGGGCAACGGGAATACCGAAGTATTTTAGAAACGCTCCCTCGATAAGCGATAACGGCGGTAAGTCCCCTAAGAGAATGACGGCAAATTCCGTAATGACAAACCACGTTATCTGGGTAAAGGTAACGGGAAACGGAAGAGTAAAGTCATTGATTGCGTATAAGACTTTCTCCACGTTCCAGATACCCGTGTAGCTCTTAATCTTCTTCAATTCGTTTCAGCTCCTTTCGTTGTAATGGAAAAGGACAGCCATTTTTCAGACTGTCCTTGATAGAAAAAGCTGACAGTAGCGACTTAATCTGTCGCTGATCTGCCAGCTCTAATATGGTATCTCGCATATTCCTCGGCTTGTTTCAATGAATGTCCCCTCTATGGATAAGTCCCTGCCGTATGCTTCATAGTCGATATAGTTCTGCAACGGTAGCGGTATCTCGCCCAGCACTTGTAACTCGTCAATGAAGTAATAGGCAATGTCGGTCATGTCCTCACAGTCTGGATAGAAATAAATGTCGTCCTTGTGTTCGTAGACTTCTTCCAGACTTCCATAGTGGGAAACAAACTCGTCCAGAGCGTCCGTGATATAGTCGGGAAGCTCACAAATCATGTCGTACATCTCGTTGAGTTCTTCAATGGAGATATACTCCCCGATTTCAATAGGGAAGTTGTCGGTATCGTGGACAGCGTATTCCTCATAATAGCTGTTCAGCCCGATACGCTCCGCAACATCTTCCTCGTCGATAGGGAAAGAGAACCAGTCCCCGACAAGTTCACCCTCGTTGTACTTGCCAAGATTAGCGATATACACCCTCATGTCGTCCACCACAGGCACACACCTCTTTTCTATGGAAGCTCATAGATACCGTGGTCGGTTTCCACAAAGCGTCCGTTGTCGTCAAGGTACTGCCCGTAGGCTTCATAATCGAAATAACGGATAACATTCTCACTTAGGGAAGTAAAGTCTGGGTCATTGTTCAGAATGTGGCGGGCAACGTCCGTCATGTTCTTACACCATGAATAGTGGATAATATCGTTTCTGTATCGGTGCAGTTCGTCAAGGTCGGTGAAATAGCACATCAGCTCCCCGTAGTCCTCTTTCAAGTCAGAGGGTAGGCTTTCATAGGTATGGTACAGGTCGTCGAGTTTTTCAATGGTGGTATCTTCCTGCACTTCATCAGCAAAGGGAAGCTCCTTGCCCGTGATAGAGTAATAGTCCGTATCTACATCAATCCCCAGCAGTTCTTCCACCTGTTCTGTGTCGATAGGCAGGGTGAACCAAAAGGCTCTGATTTCTCCGTCTGTTGGTTCTCTTGCTTCAATCTGCACACGCATTTCTTCCATGTTTCATCACGTCCTTTCTTTGTAACGCTTCCTGTATGACGGTATAAGGCACACCGAATACATAGCGTGAAAAATCTTCTAACTGTTTCTTAGGGTAGTCAGACAGGCAAAGGCGTTTCATCTCAAACCAGACCGCACGCTTGTAGTAGGGCAGGTCGGAGAGATACGGACAGCCGATTTTTGCCTGCATATCGGTAAAAATATCTTTCAATCAATCACTCCAATCTGAAATTTGAGTATAGAAAAAAGCGGTTGATCTCCAAAGGAGTAACCGCTTTGTGTCGGGATATGAAATTGTTAAATTAGGATTTATACCCTCGTTTTTTTCTTACTTTTTGCAATATAGATAATTGTGCTGATTAGAGTTAGTAAGATGAGAATATGTAGTATTGTATGATAAACATCTGGTATGTATTTTCCCTCAATCCTTATCCATTGAAGTGTTCCTGAAAAGTATTCTATTTTTCCTTCTATACTGCCCATAATGCCACTATTAAAAATACTATACCATTCATGACATAAAAATTGAATTATAAACCATAAAGAAGTCCAAACAGCAACAAACCACTTTCCAATTTGAACTTTCACAATAAACAATATGACTGTAACAAGATAAATCAGAAAAAATACTCCATCCTCTTTATATGACTGCGTAACTAAACTCTTGTTCCCGAAGGAAACTCCTATCATATCAAGGAAAAACCACATCAAAAGTAACATCTGAAATATAATACAAGTATTTTTCATCTTATTTGCTTTTTCCAAATTACCACCTTCAAAATTCCAATTTGTTGATTTCTATACTCACATTATATCAGTTGAAATTCAACCATTCAATGAAATTCATAGCGTGACTATCTTTTATCTGAATACCATTCCGTCACCCCCTTTCATTCCCCCTATGCCCGAACTCCCATAGGGGGATAGGTAAGTTTGCTTGACCGCAAACTGCTGATTGTTACGCTCCAATGATACGGTTGAACAGCTCTAACAGCACGTCCTTGACACCGCCTGCATTGAATACCAGCCCGACAGCGATTAAGGCAATCACAAGGAAGCCGATTAACTTAGAAAACTCTCTCTTAAATCCTAAGTAGATACCGATTACCACGATTGCCATTAACACAAGGCTCTGTGCGTTGCTTAAAAACCAGTTGTACAAATTCTGTCCGAAATTCATTCTTTATCAATCCTTTCTTTCTTGATTTCTTCCATTTCCTCATCAGCCATTTTACTGACCTGTGCGATACACATAAGGACGACACCGATACCCATTCCCAGCGATACCAGCACCAAGTCCTTGACAAGTTCTGCCATAATCTCAATTCTCCTTTATTCTGTAATAAGCTCCTCTGTGTTCGCTGTCTGCTGTTTGATTATCTGCAAGTGCTTTTCCGTCAGCTTTGCGTTGTCCTCAATCTCTTTCAGATAGCTTGTGCTGTTGCCAGCGTCTATCTTTTTCAGCATTTTCAGCGTAGGGGCGACCTGTCGGCTTATCCAGCCCAGCGTCCTTTCCAATGTGTAAGGCTCTGGGTCTGTCGTCAGCTTGACAGGCGGTCGGTCTTTTCCGATAAACCACGCCCAGCGGTCATTGAGCTTCCAGTCTGTCTTTCGCTTCTCCGGTTCTCTGTCCACGAAGCGGATATAGTGATTGATGATAGAAAATGCCGTCTGTTCTGCGTCATAATGGGTCAGCAGTTCACGAACGGCATAATAGGCTCTTTCATCTTTCAATCGTATCTCGAAGCGGTTCTTTATCGGTGCTTCCTCAATCGGTATTCCCAGCTTTGCGTACTGTTCGTAGTTTTTCTCATAACAGCAGAAGTACACCTCGCTTTTCAGAGAGCCGATATACAGCGTGTGTCCCATGCCCGCCTTGTCCTGCTCGTTGTGCTTCACCAGCTCGCCAGAAGCATAGGACTTGAAAGAGCGGAACAGGGAAACACATTCCTCACGGTTGCATTTGGCGGTCAGATCTGGAATATCCAGTATGCCCGCCCTGTCGTTAATCGCAAGGTCAAGGCGTTTCATCACACCGCCCTCAATGAGTGCGTCTATGAGGAAGTCATACCAGCTACGCCCCTGTGCCAGCAGATAGCTTTCAAACTGGCGACAGCCTTTTCCTTTCAGTTCCAGAAGCGTGCCTTTTTCCTCGTCCTGCGAGGTGTAGACGAACACATCACCCAGATAGTAATGCTCTGTATATTTGTAGTGTCCGTAGTCCTCATGGAGCATATAGTCAACATTGAGCTTCAATATATCTTTGATAACGTGCTGTATATCCAGCGTGGGGAAACGGATACGCACATAGTCGAACAGCAGGAACATGGGAGCGTCGGGATTGAATTTCTCGACCGCTTCCATGAGCTTGTCCTGCATATCGTCCGTGAGCGTTACCTTTCCCGCTTCGATACGGTTCAAATGTTCACGGCTGATACCAGCCATGATTGCAAGCCTTGTCTGGGACACGCCATAAGATTGACGTTTCTCTTTCAAGGCGGTTATAAAGGTGCTTTCATTCAGCCTGCATACCCCCAATCGTGTATTTTTAGAAATCTGTGATATTTTCCGCCGATTGTCACAGTCAGCCATGAGTGAGCGAAAACCCTTATGCCAAGCGGGATTTCCAGAGTTTTTGACCGTAGTTTCCAGCGGAATTGTGCCCCTCTGTTAGATACCGAGGGGCTTCTTTCGCTGGCGTGGCTGACGCCACACCAGCAAGGCTAGTCCGTTCCGTCGCCTTTCGCTTCGCACGTCGCCGTCCCGTCCTGCCTTGTGAGGACGAGTGAGCCGATAGTCTGTAAAAAATCATGCCCCTTTGGTACAAGGGGAGTGTAAAACTCTGATATGACGCTCGTACCCACGTCACAATAGCCACGCCCTTTGATACGCTTCTGGAAGAACTGCTTCTTTACGTCGCTTCCGAAAAGCATACCGTAGCCCAGCTCACTGATACGTCCCAAGCCCACACGAAAGTTGAAGTTATCTCTGATACCGTCTGAAAAATACTTTGCGTCGGGACGCTGGCAGGCGACAATGAGGAAATAGCCCGCCTGTCTGCCAAGCATGACGATTTTCTTTAGCTGGCTAAGTAGGCTCATGCTTTCCTTTGTCCCCAGCATTTCCAAGAACGCCACATACTCGTCAAAGATAAGGAAGCAGGGCGGTAAGCCCAGATAGGCATAATTCTCGCCCGTCTTATAGTCTGGGTGGTGCTTCATTTCCTCGCTTCGCTGTACCATGCCCTCATAGAACGCATTGACGCACTCTATCATATCTTCTTTGGTATGGTACACGTTATCCATGACCGTACCCAAGTCCGCAAGGTCGGCGTTCTTTGGGTCTAAGATGTAAAGCTGGGCGTTGGTCTGCAAGAGGGCTTCAATGATTGTCAGCAGGAAATAGGTCTTTCCACCGCCAGTCCCACCACAGATAAGAGCGTGAGGGAGTGCGTCGTATTCCCATGTGAGGTTCTTCATCAGACGCAAACCCCCGTTTTCAGCCTTTACTTCGTCAATCGTGATACGGTTCGCTATCATGTCATAGAGCAGGGTGTATTCGATATAGCCGTCATGGAGCGTCTTGTCGGTCAGCTCACAGTAAAGCCCGCTTTCCAGCTTATCTTCCAATCTAAGGAGTTTATCTTGATACTTTCCCAAAGAGATTTCACAGCGGATATGGAGCAAGCCCGTTTCCATTTGATAATAGATTTTAGGAAACCAGATGATTTTTTCCCTTGATCTGCTCTGCAGGTCAGTGAAAAAACCGCTGTCCTGTACGGTCTGGGCTTCATACCACTTGTTATCCAGTATCATGCGAGAGAGCTTTTGACGGTGTAATAGCTTCTTGAAGCTGTCATAGCGAAAGTGGTAATAAAGGAAAGCGACCAATGCACAAACACCAGTCGCTATCCCTACGGTTATGAAGTTGTATATAGAAAGCGTCACGCCATTATCAAGCAGGCTGAAATGCTCCCAATCGGTACGCATAAGCTGTCCCAAGTTCAGTAGCAGAAGAACCGCCACGAACAGGAGCAGGAGCGTACCCCCACAGAAACGGTACACAAGGTACTTGTCGCTGGCTCTGATACGATGTCCTTTATATTTCAAAATGTTCATATATACCTCTTTTCTGAATATAAGAAAAGCAGTCAATCATATAGATTAACTGCTTCTAAAACTTAAAAAAACCATGTCCAATTATATAACAGGAAATTACAGATACAGGCTAAAGATAAATAAATAACAGGCTCACATTTTCCCAATGAAGTAGTGCGTTGTTCTTCTTTCTTATTGAAAAAATATTTTCCTATCATTTTCCATACTACATAGCCTAAAATTGCACCTAATGTATTCATCATTAAATCATCAATATCCACTAACCTGTTAGTTGGTAACTGTGCAAATTCAATAAAGATAGAAAATGAAAAACCTGTTAAAGCTACTTTCAAAGGATTTCTAAAGTTCTTCCAAATATAAGGAAGTAAGAAACCTAAAGGCATAAGCATGATTACGTTCATACAATATGTGAACATACCCTCTGATTGAAAAGGTATCAAACTAATATTAGCCTGTTGAAAAGTAGCTATCAACCCACCTTTTGAAATCATATCCCATATAGAGCCTATTCCAGTAACAGAAATAACAAGCCATATATAAATCATCATAATATATGTCCATACATAATGTGTAGATGTAGACTTTTTCTTACCTAGTAACATGACTGTTTGATATACCATTACTGGGGCTACGGTTAAAATAATTGGAATAATTAACATTAAAAGCATTTCCCCCACTCCTTTCTTGCATTGTCAAGCTAAGTGTACCACATATATTTGAAAGAATTTCTAAAAAACATGAAATTCTTTCTTAAATTTTCCTAAGAAGCAGGGGAATAAAATTATTTCTTCGCCTGTCCCTGCGGAGCATGGTTCTGTGGATTGCCTGCGGGCTGACCGCCTTTATTTTTCAGTACAATGTCGTCTGCCTTGATGTACCATTCCACGTCTGCCCCTCGGTAGTTGGCGTTTGCCACCGTATCGGCAACAGGATTGATAAGCTCCACCTCGGCATTGTACGGGTAGTCCTTGACAGGGATTTCCGCTGGGATAGATACCTGTATCGTGCGTTCCTGCTGACTGCACTTCAAATCATAGGTTCTACGCTTGATTTCCTCGCTGGTCGTCCCGTCCTCATTCTCCACTCGCACTTCATGTCTAAGTGCAGAGAATTTCAGCACTCCAAACGTCTTTTCCTTATCTAACACAATTCCATTCGCTAATCTCATAATCTGTTATCCCCCTTTACTTACTTTCCTCGACTGGAAGCATATCATCAGCCGTTAAGATATAGTTTGTGTAGCCACGACCTCGCACGTTCTTTCCCTCGGCGGTAATCTTAGGATTGACGAGCTTGACTTTCTGCTCATACTTGAAACGCTTCTCCCCAGTCTTTGCGGGAAGCACCACGATAATGTCGTCGGCTCTCTGAATGTCGGAATACAGGTTATAACTTCTCGTTACGGGGACAAAGCGACCATTGACACGCTGTTGCTCCACGTCATTTTCTCCTGCAAACTCCAAGTTACCAAAAGTCTGTGCCATGTTAGGCACGATATATTTCAATTTCAT
Protein sequences of DBSCAN-SWA_1 >NZ_CP041667|84111:104032|96886_97276_-|WP_087303491.1|DBSCAN-SWA MKKIKSYTGIWNVEKVLYAINDFTLPFPVTFTQITWFVITEFAVILLGDLPPLSLIEGAFLKYFGIPVALTWFMSQKTFDGKKPYSFLKSQITFALRPKVTYAGKAVKLEKGVFSETVTAVRSVMYVPD >NZ_CP041667|84111:104032|88226_88961_-|WP_074141166.1|DBSCAN-SWA MRFTIFTGRNFKEIVRDPMSILMCLGFPVLLIIALALMKESIGGMADVFEVQNFAPSMTILGMAFLSTFVGTLMSTDRNSSFLTRLFASPMKGIDYILGYSIPIIPLAILQAVCCFGVALFFGLELNINIFIAILTLIPVAILFISFGLFIGSVLSTNNAVNGFGTILVNATVFLSGAVLSLDMIGGTFKTVCDLLPFSHAVNAVNYAINGNFSDIPLQLVWVLGYALVLIIPSVIIFKRRMKG >NZ_CP041667|84111:104032|85966_86215_-|WP_021421703.1|DBSCAN-SWA MKKGYKLPPYRLISSAVDGNEQSIERILAFYDPYISKCCLRPLYDEYGNVYIVVDMELKGRIREAIIKMILDFDIGTVPTEE >NZ_CP041667|84111:104032|87599_87824_-|WP_074141164.1|DBSCAN-SWA MIGLKKRDMRTILKAGAKARGLSLPEFRAEIQENIDNSMNSQDPVVQKRFKKYFGNKIPTPEEYIYTITKKTKI >NZ_CP041667|84111:104032|98712_99150_-|WP_087303305.1|DBSCAN-SWA MKNTCIIFQMLLLMWFFLDMIGVSFGNKSLVTQSYKEDGVFFLIYLVTVILFIVKVQIGKWFVAVWTSLWFIIQFLCHEWYSIFNSGIMGSIEGKIEYFSGTLQWIRIEGKYIPDVYHTILHILILLTLISTIIYIAKSKKKTRV >NZ_CP041667|84111:104032|97905_98403_-|WP_143928566.1|DBSCAN-SWA MEEMRVQIEAREPTDGEIRAFWFTLPIDTEQVEELLGIDVDTDYYSITGKELPFADEVQEDTTIEKLDDLYHTYESLPSDLKEDYGELMCYFTDLDELHRYRNDIIHYSWCKNMTDVARHILNNDPDFTSLSENVIRYFDYEAYGQYLDDNGRFVETDHGIYELP >NZ_CP041667|84111:104032|103302_103689_-|WP_074143641.1|DBSCAN-SWA MRLANGIVLDKEKTFGVLKFSALRHEVRVENEDGTTSEEIKRRTYDLKCSQQERTIQVSIPAEIPVKDYPYNAEVELINPVADTVANANYRGADVEWYIKADDIVLKNKGGQPAGNPQNHAPQGQAKK >NZ_CP041667|84111:104032|98377_98614_-|WP_060815474.1|DBSCAN-SWA MKDIFTDMQAKIGCPYLSDLPYYKRAVWFEMKRLCLSDYPKKQLEDFSRYVFGVPYTVIQEALQRKDVMKHGRNACAD >NZ_CP041667|84111:104032|91307_92315_-|WP_075877379.1|DBSCAN-SWA MKLRHIALIGSLFPFLLAIVLFFGVLISAEDDDGGGGSSSWVTGMNLSAEVLKHQPMVEKYAKEYGISEYVPYLLAIIQVESGGTAEDVMQSSESMGLPPNSLDTESSIKQGCKYFASLLSSAENQGIEDINVVIQSYNYGGGYIGYVASNGKKHSFTLAENFARDKSGGKKVTYTNPIAVARNGGWRYAYGNMFYVEIVNQYLTITKFDNATVQKIMDEALKYQGWDYVYGGSNPNTSFDCSGLVQWCYGKAGISLPRTAQAQYDATQHIPLSQAQAGDLVFFHSTYNAGTYVTHVGIYVGNNQMYHAGDPIGYTDLTSSYWQQHLIGAGRIKQ >NZ_CP041667|84111:104032|84111_85308_-|WP_087303309.1|integrase|DBSCAN-SWA MSNVKRKDSKGRNLRLGESQRKDGRYVYKYTDIYGKPQFKYAWKLVPTDKTPAGKHEDLSLREKEKQIKKDLADGINTIGGKMTVCQLYAKKNSQRKNVKRNTEKGRQYLMKILENDPLGARSIDTVKLSDGKEWAMRMYENGFSYKTISNYKRSLKASFYMAIEDDCIRKNPFSFKLSDVLEDNSEPKVILTPEQEEKMLAFMEQDKTYSKYYDEVVILLETGLRISEFCGLTVHIDMANRIINVDHQLLKDTEIGYYIETPKTKKGERQLPMTERAYQAFKRVLQNRGKAEPIVINGYSNFLFLNQKGFPKVAGNYESMLRGLVKKYNKTHKDQLPNITPHSFRHTYCTNMANKGMNPNTLQYIMGHSNITMTLGYYAHGTFNSAKAEIERLALAS >NZ_CP041667|84111:104032|86218_86641_-|WP_021421704.1|DBSCAN-SWA MKPSDFQKTIQCQFDCKLKRTVKGVIWNYKKELKRRKDKEIPFCELPEIVVEKLAVWDEYMSDYTVFEVCGVEIRVLDEELSEALKQLSERNRENLLMYYFLEMSDTEIAKAQKISRAGVFKNRYRALQNLKKTLTKERE >NZ_CP041667|84111:104032|99579_99720_-|WP_010817889.1|DBSCAN-SWA MAELVKDLVLVSLGMGIGVVLMCIAQVSKMADEEMEEIKKERIDKE >NZ_CP041667|84111:104032|90392_91286_-|WP_087303311.1|DBSCAN-SWA MFKKKEKPIKEKKVRTMKVGTHKKSVIALWVVLIASVSFGVYKNFTAIDQHTTHEIETIQLRLNDTNGIENFVKNFCKSYYTWCNSKEAIEARTQAISAYLTQELQDLNVDTVRTDIPTSSTVTDVLIWDVEQFGENAFSVTYEVDQQIKEGDQTRNVSETYTVTVHVDADGDMVIIQNPTLAPAMEKSSYEPKAQEADNSVDADTVNDATAFLETFFKLYPTATDKELAYYVEGNALEPINGDYLFSELVNPVFTKDGDNVNVKVSVKFIDNQTKATQVSQYELTLHKDSNWKIIE >NZ_CP041667|84111:104032|90074_90314_-|WP_074141169.1|DBSCAN-SWA MSDENKKTKNDKFVGSVKVGPKGQIVIPKEAREMLNISTGDTLVLLADAKQGIAVQKYEVYKDFFEQVFNSGQSPTKSE >NZ_CP041667|84111:104032|88957_89689_-|WP_074141167.1|DBSCAN-SWA MNAIEIRDLTKRYGNLTAVNSLSLNIKQGEFFAMLGSNGAGKTTTIKMLSCLTEPTSGDALMLGNSLRKQADKVKDIINLSPQETAVAPKLTVKENLMMIARLHGSSKQEAREKADKIMETFELTDRKNDRAKQLSGGWQRKLSIAMALISNPKILFLDEPTIGLDVRARRELWGTMEKLKGKLTLILTTHYLEEAEELADRICIMDKGVVQVLGTADEIIKVSGARNFEEAFLMYTERGTEL >NZ_CP041667|84111:104032|85387_85591_-|WP_021421702.1|DBSCAN-SWA MNNTDVPIWEKYTLTIEEASKYFRIGEKKLRKLAEENLDAGWVIVNGNRVQIKRKQFEKIIDTLDEI >NZ_CP041667|84111:104032|102583_103189_-|WP_074143640.1|DBSCAN-SWA MLLMLIIPIILTVAPVMVYQTVMLLGKKKSTSTHYVWTYIMMIYIWLVISVTGIGSIWDMISKGGLIATFQQANISLIPFQSEGMFTYCMNVIMLMPLGFLLPYIWKNFRNPLKVALTGFSFSIFIEFAQLPTNRLVDIDDLMMNTLGAILGYVVWKMIGKYFFNKKEEQRTTSLGKCEPVIYLSLACICNFLLYNWTWFF >NZ_CP041667|84111:104032|85696_85906_+|WP_116632638.1|DBSCAN-SWA MRVYHYPVNTGRGKTTFPTVALGSLHGSLASPYGKGRGVQSPWLAVSQTYLSALFTCQRTVAKAIGSHV >NZ_CP041667|84111:104032|92311_94453_-|WP_087390348.1|DBSCAN-SWA MIDRIKGLLTKKKLFRFFKIALIVIFTSVVFLSLFGTVAHATGLVDDTVNADNLYSQYPLENYQLDFYVDNSWGWLPWNWLDGIGKSVQYGLYCITNFIWTISLYLSNATGYVVQQAYKLDFINDMADSIGKSIQTLAGVTENGFGSTGFYVGFLLLIILIVGIYVAYTGLIKRETSKALHAVINFVVVFVLSASFIAYAPDYIKKINDFSSDISTASLNLGTKIMLPDSDSAGKDSVDLIRDSLFSIQVQQPWLLLQFGNSDIEEIGADRVEALVSASPSADDGATREDVVKTEIEDNDNDNLTIPQVINRLGMVFFLLIFNLGITIFVFLLTGMMIFSQILFIIFAMFLPISFLLSMIPSYENMAKQAIVRVFNTIMTRAGITLIVTVAFSISSMFYNISTDYPFFMIAFLQIVCFAGIYMKLGDLMSMFSLNAGDSQSMGRRIFRRPYMFMRHRARRMEHRIARAVGAGSVAGAVAGTVAGTTMNDGSGKSTRAGTGRKERRENTSFGSRAGSAIGAVLDTKNKVKDKATAVKENIKDMPTQTAYALHSAKEKAKENVSDFKRGIVQGQETRQSQRADKREQHRQNIADKRIELQKAQEAKQSGKATDGSATTGATRPHDRPVTASPASKTDTGKAQEIKRPVTTDKVHTEPTKAQPVKERPLASKSAEPVLQKHGTRLNGVELNQQTTRKERTIKKATLNMTGKKGHKK >NZ_CP041667|84111:104032|99735_100926_-|WP_087303303.1|DBSCAN-SWA MNESTFITALKEKRQSYGVSQTRLAIMAGISREHLNRIEAGKVTLTDDMQDKLMEAVEKFNPDAPMFLLFDYVRIRFPTLDIQHVIKDILKLNVDYMLHEDYGHYKYTEHYYLGDVFVYTSQDEEKGTLLELKGKGCRQFESYLLAQGRSWYDFLIDALIEGGVMKRLDLAINDRAGILDIPDLTAKCNREECVSLFRSFKSYASGELVKHNEQDKAGMGHTLYIGSLKSEVYFCCYEKNYEQYAKLGIPIEEAPIKNRFEIRLKDERAYYAVRELLTHYDAEQTAFSIINHYIRFVDREPEKRKTDWKLNDRWAWFIGKDRPPVKLTTDPEPYTLERTLGWISRQVAPTLKMLKKIDAGNSTSYLKEIEDNAKLTEKHLQIIKQQTANTEELITE >NZ_CP041667|84111:104032|89685_89949_-|WP_087173369.1|DBSCAN-SWA MLVIPKGGDIVNRDVTEHYSHNGRYLGTLKVGTKGQILIPKEVRDFFGIHIGELLVLLADKDKGIALQKMDGCNEAFQMAFQKEDKE >NZ_CP041667|84111:104032|87843_88224_-|WP_074141165.1|DBSCAN-SWA MIKVFSLTNLNVEYITDYQGLAIFQQIQNYLLPINNGSLENIAILDMDADKDIILSELQECAIFHRIALSSCWTLDKVIIALECNCLAFLENGNLTSINDTINLVTRHKVSVFDKSVIAIINSFLI >NZ_CP041667|84111:104032|87167_87530_+|WP_032509185.1|DBSCAN-SWA MRKKKDTHSFDFRPLGLAIREAREKAGLSRNDLGDKVFYGERHIADIENIGKHPSFQLFHDLVTMFHISVDEYFYPQEKAEKSTIRRQIDVSLDELTDEELKIIQGTIDGIMKYKGTGSK >NZ_CP041667|84111:104032|101126_102521_-|WP_143928567.1|DBSCAN-SWA MNILKYKGHRIRASDKYLVYRFCGGTLLLLFVAVLLLLNLGQLMRTDWEHFSLLDNGVTLSIYNFITVGIATGVCALVAFLYYHFRYDSFKKLLHRQKLSRMILDNKWYEAQTVQDSGFFTDLQSRSREKIIWFPKIYYQMETGLLHIRCEISLGKYQDKLLRLEDKLESGLYCELTDKTLHDGYIEYTLLYDMIANRITIDEVKAENGGLRLMKNLTWEYDALPHALICGGTGGGKTYFLLTIIEALLQTNAQLYILDPKNADLADLGTVMDNVYHTKEDMIECVNAFYEGMVQRSEEMKHHPDYKTGENYAYLGLPPCFLIFDEYVAFLEMLGTKESMSLLSQLKKIVMLGRQAGYFLIVACQRPDAKYFSDGIRDNFNFRVGLGRISELGYGMLFGSDVKKQFFQKRIKGRGYCDVGTSVISEFYTPLVPKGHDFLQTIGSLVLTRQDGTATCEAKGDGTD >NZ_CP041667|84111:104032|103705_104032_-|WP_076779936.1|DBSCAN-SWA MKLKYIVPNMAQTFGNLEFAGENDVEQQRVNGRFVPVTRSYNLYSDIQRADDIIVVLPAKTGEKRFKYEQKVKLVNPKITAEGKNVRGRGYTNYILTADDMLPVEESK >NZ_CP041667|84111:104032|94452_96900_-|WP_087303493.1|DBSCAN-SWA MFPIKYIDNNLVWNKDNEVFAYYELIPYNYSFLSAEQKFIVHDSFRQLIAQSREGKIHALQIATESSVRSIQEQSKKLVTGRLRDVAIQKIDEQTEALVSMIGDNQVDYRFFIGFKLIVTEEKVSLESMKKSAFMTFKEFLNEVNHTLMNDFISMPDDEINRYMKMEKLLENKISRRFKFRRLDKNDFGYLIEHIYGRDGVAYEDYEYSLPKKKLKKATLIKQYDLIRPTRCLIEESQRYIRLEHEDSESFVSYFTVNAIVGELDFPSSEIFYFQQQQFTFPVDTSMNVEIVGNRKALSTVRNKKKELKDLDNHAYQSGSETSSNVVDALDSVDELETDLDQSKESMYKLSYVIRVSAPDLDELKRRCDEVKDFYDDLNVKLVRPAGDMLGLHSEFLPASKRYINDYVQYVKSDFLAGLGFGATQQLGENTGIYIGYSVDTGRNVYLQPSLASQGIKGTVTNALASAFVGSLGGGKSFCNNLIVYYSVLFGGQAVILDPKSERGNWKETLLEIAHEINIVNLTSDKENAGLLDPFVIMKNVKDAESLAIDILTFLTGISSRDGEKFPVLRKAVRAVTQSDQRGLLHVIDELRREETAIARNIADHIDSFTDYDFAHLLFSDGTVKNAISLDNQLNIIQVADLVLPDKDTTFEEYTTIELLSVAMLIVISTFALDFIHSDRSIFKIVDLDEAWAFLNVAQGETLSNKLVRAGRAMQAGVYFVTQSSGDVSKESLKNNIGLKFAFRSTDINEIKQTLEFFGIDKDDENNQKRLRDLENGQCLLQDLYGRVGVVQIHPVFEELLHAFDTRPPVKSEVE >NZ_CP041667|84111:104032|97384_97888_-|WP_143928565.1|DBSCAN-SWA MVDDMRVYIANLGKYNEGELVGDWFSFPIDEEDVAERIGLNSYYEEYAVHDTDNFPIEIGEYISIEELNEMYDMICELPDYITDALDEFVSHYGSLEEVYEHKDDIYFYPDCEDMTDIAYYFIDELQVLGEIPLPLQNYIDYEAYGRDLSIEGTFIETSRGICEIPY >NZ_CP041667|84111:104032|99361_99583_-|WP_010817890.1|DBSCAN-SWA MNFGQNLYNWFLSNAQSLVLMAIVVIGIYLGFKREFSKLIGFLVIALIAVGLVFNAGGVKDVLLELFNRIIGA |
28 | Streptococcus_phage(93.75%) | integrase | attL 72541:72556|attR 116186:116201 |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_2 |
571879 : 594244
Sequences of DBSCAN-SWA_2
Nucleotide sequences of DBSCAN-SWA_2 >NZ_CP041667|571879:594244|DBSCAN-SWA CTTAAAAATTGATCGCGTCATTGACTGCCTCTGCCGCATTCTCCCGCTCCAGTATAATATGATTATATACCTCCAGGACCATCCTCTCTGTGTCTCCCATAAGCTGGGCGATTCTTCCGATCGATATCTTGGGAATCTGATAGCAGAGTTGCGTACAGTAATTGTGCCGGAACATGTGCCCGGTCAGGCCGGTGATCGGTTCGTCTGCGGCCTCATTCATCCGGCGCAGGATCCTGGCCCACATCTTGTCATAGCTGCTCTTGGTCACAGGCTGGCCGCTCGTTATCGTGAAGAGATAGGTCCTGTTTTTAGATTTCAGGCTTTGGACATATTCTTTCACGGCCGGAAAGACCAGATCCGGGATAGGTATGGTCCGGTACCCGTTTTCTGATTTTGGAATCTTAGTGATCGGGGTCCCCCGGACAAACTTGTGAGCCTTTGTTATGCTTACCTCCTGGGTGTCTATATGTATATCGAAAATTGTCAGCGCCAGAGCCTCCTCGCGCCTCAATCCGCAGCCATAGATCAGATAGACATAGATTTTATCCTGATCGTCAAAATCGGCCTTAAAAACGGCCCTGCGCTCGTTGTCAGTTAACGGCCGCTTTTCCGGGGCTGTATATTTCACTGGAGGAATAGACTGGAAGATGTCGGCCATAATGTTAGCGGGGTACAGGTGATCCGTCACGGCGGATTTCAGGATCTGTTTGAAGGTCATATAGATTTGCTGCTGGGTCCGTGTTTTTCCGTCTGCGTTGTTTAGGACCATCTGCAGATGGATCCGCTCAATGGACTGAAGCTGTATGCCTTCCAGGATAATCAGGTGCTTTTCGATCACGTTCTCATACATGGCCTGCGTGTTGGTTTCCGCCTGGTGCTTGTAGACCCGCTTCCATTCTCTCGCATAGTCCAAAAATGTTACATCCATCTTTCTGATCTGCCGACGCTCCCGGACTTGCTGCTCAAATTCCTGAACCTTCCGCTCCAGGTCTCGGCTGGATTTTTTCGATCGGATTGGTTTCCGGTGCTTTTTTCCAAGGTCGTCATAAGTCCCATCCCATACTCTGGCTGTAAAATATCCGTCTGAGCCTCGTGTGTACTTTGCTGTTGCCATGTCATCATCTCCCTTTTTTTGAAAATTGCATAATTGCATTTTTAGTTGCAATTTATGCGGAAAAGTTGCAATTTTGGGTATAAAAATAACAGCCAGCAGGAACGTGAGTTCCGCTTGCGCTGTCTCCCGGAAAATGATACAATACATATGACTCGTTGAAAAGCATTTCCTGATGTCCGGGAGGTGCACAGCCGCTCTGGTGTTGGTAGCGCCAGGGCGGTTTTTTAATAGACGGAAAAGCTGGTCACTTCTGGACTATCAGAAGTTCCAGATACTTTCGCTTCACAAGTTGCATCTGATTCAGCACCGTATTCATTGGTAATGGTGCATTCAGCTTTCAAAAACCAGGTTGTTTCATTTTCTGGTTGCTCTGCCAATTTCCCCATAATGTAGTGAAGCTCAAATCCATATGGATATTGAGACTTTCCATATTCATCTGTAGCCTGCCAAGCAGATGAGGCACTTAACGTTTCCTCCAATGTTTTTTTAATGGCGGAAGAGGCTTCATTTGAGGCAATATTTTCACTTGTATTCACAATTACTTCTACTGTCTTAGATTCAATATTGTGTGATTGCAATCCGGTCACAATCCATTTTCCCAGCTCGCTATCGCTATATACGGATAAATCTTCTGTTATTTCCAAACCGGAATCCTGATGAAAATATGTGGCTGTATACCCAAGATCATCCAATTTTGCTTTTGCCTCTGGAACAGTCAAATTTAAAACTTCATTTAACTGATCTGATTCAAGTTCTTCTGTTTTTTTGCATGACAAAGTGATTTCTGTTTCCGCTGTTTCTTCACTACCGGCTTCTGGTGTTTGCGTTACAACAGTCCAATTCGAAGAATCTAAAATTATAGAGCTTTCACCGGAAGAATCTGCAGATAGCGTGACAGTTGTAAACCCGGATTCTTTTAAAAGGTCGGAAGCTTCTTGAGCATTCATTCCCGTAACATCAGCTACCTTTATTTTCTCTTCTTCGCATGCTAATAGTCCTACGGATAATAGACATATTAAGACTGCTATAATTAAGGCTTTTTTCATAACTTTCCCTCCTCATTTGTACCTTACATTGAGAAATATATAATCCGCCTTGGCGGTTATACCTGCAATTTTCCTATCACCTTTCCCAGGCAAACGACATTGTCATCTTCCATGACAGGGATGTTAGGGTATTCTGGATTTCTGGAAATCAACTCTGTCTCACCTAATTCTTTTATAAAAGCCTCTCCATTTTTCACGAAAATCCCAACATCTCCAATGTTCATTTCCTTTCTCTGAGAAACTAGAGCGGCATCCCCATCATTAAAGTCTGGAGACATCGAATCCCCATTTACTCTAACTGCGTAATCCACATCATCATATTCCGGCCCGATTGGTATTGCCAACTGGTCTGCGGCTTCGTTACCAAGGATATAAATACCGCTTCCAGCGGAAGCGCTGTGTTGATAGGGGATGAACCTATTCAGCGAATTACTGGTCGAGGTGTATTCTATAATGTTCCGTGATTCTTCCCTATCATGAAGACGATTTAGGAGAGTAGTAACGGTCTCCTGGTCGTAGGGGGATAGACTGCGGTATTTTTTAATAATTTTTAATTCCTCGTATGTAATGCCAGAACTATTCTCTTGTGAAAATTCTTTCAGCATATCACATTCATAATATTCACATAGCTTTAGTAGCATGTCGCCGGACGGTTCGGACTTTCCTACTTCCCACGAACTGAATGTTGATTGTGGTATATCAAAGTGTTCATATACTTGTTTTTGAGAAAAACCGGCACGAAGGCGCAATTCTTTTAATTTTTTTGAAAATTCTTTGCTTGCCATATAAGCACCTCCTTTTAATTTCTATTATAACGAAACATTTTAACAAGTCAATAAAAATATTGAAAAATTCGTTAAGACATATTGACAAACGAATAAATCGATATTAATATATGATTATAACGAAATATTCGTTGTACGGGTCGCATTGGAGCGATGGTTTACGGTTACAGGAGAATTACTTCTCCAAAATGTAAGACGGAACAGGGTTGCTTACAAAATACGCGATTGAAACAATTCTGTATCCTTCTTTGAGAAGTTTTTTCACTTCTTCATATGAATCAGTGTGCTTCTTCGTGAGTATCACCTCCCTTCGAAGAGGTGATTTGCGACCCGTAAAAATATTATATCAGAAAGGAGTGATAAACGATATGTCCTTAGAGGATAACCTTGCCAAATATATAAAAGGAAAAGCAATCAATCTGTCGGACATGTCAAGACAGACAGGAATTTCTTATATGTCTTTGTACAACAGCCTGTTTAATGAAAATAGTGAAAGACAACTAAAAGCACGAGAACTTGTGGCAGTATGTATTTTCCTCGAAGTTGATCCGAGGGATTTTGTCGAGAAAACAGATGAGAAGGAGGTGAAAGAGAAGAGATGAAGTCAGAAAAATTCAACTCTATCCATATTGATTTGGAAAAAGGGATTTATAAAATTAACGGTGAGCCAGCGGACTTTATAACAGAACTGAAACTAGAGTGGAATCCAGAAGATGGATGGGCGCTCAACATATCTAAGGATAAGACATATTTGGCGCCCATTCAAAGGATTAAGGAATGAATTTAGAAATAAAATCAGTGACTTCTATTAATCCATTTTTAAAGCGGTTTTCCATGTAGATGATGCAATGTTCTGTGATTATGAAAGTTCCATCCAGATATGTTTTAACAAAGTTGGCTCTTTTTAATTCACCTATTGTATATAGATAGTCATCATGAAGCCAACCTGATAATTTAGATGTATTTTCCCTAAAATCTGGATGAAACGTATTTGCTTGACGCTTCGGAACGCCCCTCTTTCTACGATCGAGGTATTCTTTATAAGCGCAGCAAACGAGTAAATCCGCATCTTTTGTTAATTGCATAAATTCACCTCCCTGTAATGGGATTATACCACAGAAAGACTAACTATTAAAAGTCAAGTCTAAAATGAATATTTTAATGAAATAGGAGAAGGAGGTGGTGTGACGGTGAAATGTCCCAAATGCGGAGCAGAATCAAGAATCAAAGAAGACGATGTGTTTTGCTGTAAGTGCGGATATCCGTTAACGGCAAAAGCAAATGATGGCGACGATGCGGAATTAAAATCATTTTTCCTGGATGTGAGCTCCGGGGTAATGCTGATAAATGGTAAAGAAATAAATAATGTGACCGCATTCTCCTTGAATTTTAATGGAGGAAAATACGGTCTGTGTGTTACTCATGATGATCCATATAAAGCTATTGCCCCGTTAAGTATTGAAAAAGAAGTTTCATAGCGAGTTCAGGAAGCCAGTGATGCTTGGCCAATATATCACTGAACTTTGAGAGGAATCCCTTTTTTAAAGGAGCCTCATTTTCAATTAGTGCTTCGATGTATGTGACCATTTTTTCTGCTTCTGCTTTGTCCTCCGCCGAGATGTCTTGTGAGCGAATAAAAGACAAGGCTTCATTAACAGAGATTTCGTTGTTGATGGTTACGGCGCCGGAATTCCCAATAGCCGATCCAGTGACGGGAGCGTTGAAAATGTTAGTTTGAGCAGTGGGAGCTGTGACAGATTCAAGTTCTTTATAATGCATACCTTGGTGTGTGACATCCTGCACGAAACCGCCGCCTTTATAGCGTTTGATTCTCACATATCCTAAAGATTCCAGGTATTGCAAGGCTGAATCTACAGCGGAAAGTGATAATCCGGTATTTTCTGATATCACAATAAAATTCATCATGGATGATTCATAGTTTGGAAAAGCAGAAATTAAATAATCGAGAATGGTTTTAGATTCATTTGTAAGCAAGAAGTCACGTCCTTTCGTCATTTGATAGGAAAATTATACCAGATTTTTAGAGGAGGTGAAACGATGAAATATCCAAAACCAGTAATGCATAAGACTGAACTGCTTGACATGGGCTTTACCAGGACGTACCTGGATCGGGCCATTGCGGCACCAGGTCAGACATTCGCCTGGCCGATCAATCCGGCCAATAAGTCCAGTCCTGTGCTGTTTGATACAGAGGGGTTTGAACGCTGGAGACTGAAGGAGGTTGCCATGCGGGAGAGGGCGAGAAGGCAACAGAGAGGGGTGATGTAATGAAGAATAACCTAAAATTCGTAGAACCCATGAAGTTAAAACCCGGAACTGACTGGGAAAAACGACGGGAAATCCGGAGGAAAGAGCGGGAACTTGACCGCTGGTGGTGCAGGCTCTACCGAGCGTCACTGGCGCTGGAGATGGTCATAGCGGTCTTAGTGGTCTTGTGTATCGCACATTGCACGGGGCTGCTGCAGATGATGTGAGGAGTGGGAAAGATGAAGAGGTTAACAGTAAGCAGGATTGACGAGATCATCCGCACTCTCGAATCTACAGAGCATATAGATGACCAGACGCAATACCATAAAAATATGGCGATATCATACCTGAAGAATTATGCGGATCTGCTGGATGGGAAGGGGATTAAAAGTGTGAAAGTGAAAGAAGATGAGAAGAGTGGAAGAGGATAAGCAGAAGATCCTGGACTTGTTACTCCCAGCATTGCGGGAGACAAGAAACCTGCACGATCTGGATAACCTGGAATACTTTCCCCAGAACAGCCTGGTGTACGCCACATTTAAAAACGGATACATCAAGGTGGTTGATGTGGAGTGTGATTCTGGGACGGCTATGATCCAGGACGTGATCAGGGGGATCGTATAGGACGAGCGCCCTTACGAGGCGGCAACCTCGGGCGCTCTGGGATTTAATCAATATTATTGTAATCGAAAAAAGGAGAAATGTAAAGATGGCATACATAGGGATCGGGCCAGAGGAAGGGACCATCGCGGAAGAAGACCGGGCTTATGAGTATGCTCTGGAGAGATGTCTTCACGGCACGCCACAGGATCAGCAGGATTTCAAGGAACTGCTGATCGAGTGGTTCTATTCCGGGAACTGGATCAGGAAGGAGGACCATGGATAGGAAGGAGATCCTCTACATCATCCGTTGCTCTGACGGGACTATGAGATGTATCTATGGAACCTATGAAGCGGCGGTAGAAATGGCAGAGGAATATGTCAGGGGTACAGGCCTGACGTATGTGATCAGTTAAGGAGGCGCTATGGAGGTATATAAGATCTATGATTTCCCGGATGAATCTGCATGGCTGAAAGGCCGGATGAACGGGATCGGCGGGAGTGACGCCAGCGCTGTGATCGGCATGAATCCATACAAGAGTAACATCGATCTCTTCGAGGAGAAGATCGGCAGGCGGATCCCGGAGGACATCTCGGACAAGCCCTGCGTAGTCTATGGAAAGCTGGCAGAGGAGCCTATCCGAGCCCTGTTCCGGCTGGACTATCCAGAATACCAGGTAGAACATCATGAACATCGGATCCTGCAGAGTATCAAGTATCCGTTTATGCAGGCGTCCCTGGATGGGGAACTGGTGGACATGGATGGCCGCCGCGGGATCCTGGAGATCAAGACCACAAATATCCTGCAGTCCATGCAGCGGGAGAAGTGGAAGGACAGGATCCCGGACAACTATTATATCCAAGTGCTGCACTACCTGCTTGTGACAGGATATGAATTTGTGATACTCCGGGCGCATTTGAATACAGACTGGAACAGCGAGAAGCGGACCACAGTGAAACATTATTTCATTGAGCGGTCTGAGGTGCGGGAAGATCTGGATATGCTGCTCCGGGAGGAACAGAAGTTTTGGGAGTATGTGGAGAGCGGCAGGAAGCCGCCTCTCGTACTACCAGAAATATAAAAAAGAGAAGGAGGATTCTATGGAATTAAGGATCATCAGTCCCCAGGAAGGGGGATTCATAAAGGAAATCCAGTGGAACAAAGAAGAATTGAAAGCCTGGGTTGCCGAGAAAGTGAAGGATTATCGAGGATTGGTATTTACAGAGGCAACGATCCCGGACGCAAAGAAGGACCGGGCAGACCTCAACAAGCTGAAGAAGGCATTTGAGGACGAGCGGAAGCGGATCAAAAAGCTGTGTCTGGAGCCATACGAAGAGTTCGAGCGGCAGGAAAAGGAACTGATCGCCATGATCGATGAACCGCTCCAACTGATCGATTGTCAGATAAAAGAGGTGGAGGAGAAGCGAAAGGAAGAGAAACGCTTGCGAATCAAGGATCTGTTTTCAACGATCGGTTTTCAGGCATTTGTCTCTTTGGATATGATCTGGGATGAGAAATGGTTAAATGCAACGGTGTCTATGCCCAAGATCGAGGAACAGATGAAGAGCCGGATGTATCAGATTGGGGAAGATGTTTTCACAATCCGCAACCTTCCGGAATTCAGCTTTGAGGCTATGGAGGTCTATAAGAAGTCGTTGGATCTTACTAAGGCAATCCAGGAGGGACAGCGACTATCCGATATCCAGAAGCGAAAGGCAGCCTATGAGGAAGAGAGGCGCCGGAAAGAGGAAGCGGCGAAGCAGGGATCCGTTCACTCGGCGCAGGAACAGGAGGCGGTTCCCGCGGCACAGGAAGCCCAGTCGGAAGCGAAAGAAGCAGAGGATGATGTGGTATGTCTGGATTTCCGTGTATGGGGAACCAGGGACCAGATCATGGGTTTGAGACAGTACATGATTGGCAATAAGATTCGATTTGGAAAGGTGGAATAAGAAGATGGCAGTACAGAACAGTTTGGCAAAGAGTCAGCAAAGAATGGGACTTACGGCGTATCTGACGCAGGACGCTGTGAAGCGGCAGATCAACAGTGTAGTGGGTGGAAAGAATGGAACCCGGTTCATTTCTTCTATTGTGTCGGCTGTACAGACTACCCCAGCGCTTCAGGAGTGTACAAATCAGAGTATCTTATCAGCGGCGCTTCTGGGTGAGGCGCTGAATTTGTCACCATCCCCGCAACTGGGACAGTTCTATATGGTGCCTTTTGACAATAAAAAGAAAGGCGCCAAAGAAGCGCAGTTCCAGCTTGGGTATAAGGGGTATGTTCAGCTTGCAGAACGGTCTGGATATTATAAGAAACTGAACGTACTTGCAATCAGAGAAGGTGAATTGATCCGGTACGATCCTCTGAATGAAGAGATTGAGGTGGAATTGATCGAGGATGATGTTGTCCGGGAAGAAACCCCGGCCATGGGGTATTATGCCATGTTTGAGTACGAAAATGGATTCCGTAAGACTCTGTACTGGTCAAAAAAGAAAATGCTGGCCCATGCAGAGAAGTATTCCCAGGCGTTTAAGCGGAATGGCGGGGCAAAGTCCCTGGAACTATTGGAACAGGGGAAAATTCCAGAGAAAGAGATGTGGAAGTATTCCTCCTTCTGGTTCAAGGATTTCGATGGTATGGCCATGAAGACCATGCTCCGGCAGTTGATCAGTAAATGGGGAATCATGAGCATTGATCTTCAGACTGCGGTGGATAAGGATATGGCGGTGATCCACGAGGATGGGACAGTGGATTATGTAGAAAATGAATCGGTAGAGGATGATGTGGTGTCTGATCAGGACTTGAATGAGGTTCCTGACAATCATAATAATCCGCAGACAGAGCCGATACCACAGAAAGCGGAAAAACCGGACATTGAGTCAGAGTTTTTCAATAACTAATCTATTAGAGATAGAGCATTTCCTTTTTGGTTGATGTATCACATGTAACTTATCAATAACATTTAGCCCGCGCTGGATCAACTACAGTGCGGGCGGAAAGGAGAAACCTATGCGGTCAGTGACCTTTCAAGTCCCGGGAAAGCCCCAGGGAAAAGCCAGGGCCAGGACCTTCTACAGCCCGTCATCCGGCCGGCACATGTCTCACACGCCAGACCGGACCGTCCTGTATGAAAATCTGATCAAGGATCAATTCCTGAATCATGCGGACGGGTTCTATCTTGAGCGGGAGATCCCGGTGTCGCTGCGGATTGTAGCCAGGTTCCTGCCGCCAAAGAGCGCGGCAAAGAAAAAGCAGCAGCAGATGCTGGGAGGTGAGATCCTCCCGCTGAAGAAGCCGGACATGGACAATATCGTGAAGGTGGTGGCGGACGCTTTGAACGGCGTGGCATATCACGATGACACCCAGATCGTTCTGGTGTCCGCGAAGAAAGCCTATTCCGCGATGGAAGGGCTGGATATCACGGTTGAGGAATATTTAGGCGGGAGGTAGGTGAGGACATTGGCCAGACCAAAGAAGAGCGGCCTGTCATACTTCCCTCTGGACGTGGATTTCTTTGAAGATCCAAAGATAAAGATACTGAGAGCCCGATATGGGCGGGATGGTATCGTACTATACATCTACCTTCTCTGCAGAATCTATAAGCAGGGCTACTATATACAGGTGGATGATGATTTTGAGTATATCATATCTGATGACCTGAAAATGGATCAGAATAAGGCGAAGCAGGTCTTGAACTTCTTGCTGTCACGGTCACTGTTTGATAACACACTTTTTCAGTCGGACAAGGTCTTGACCTCTGCCGGAATACAGAGGAGATTTCAGCTGGCGGTAAAAGAGCGGGCAAAGAAAAATCCGATCGAAGTCGGGCGGTTCTGGCTTTTGAAAAAAGAAGAAACGGAACCTTTTATTAAATGCACCGTTTTTGGTGGTTTTTCCGGGAAAAACGAGAGTAATTCCGGGAAAAACGAGAGTAATTCACCGGAACAATCCCTAAAGAAAAGTAAAGTAAATAATATATATAATACTGCGTTTCCGCCAGAACTTGAGAAAGCCTTCCAGATGTACCTCCTTGTGCGTAAAAGTAATTACGGGGAGATCCTGCCGGAGCAGATCCAGGCATTGAGGGAAGATCTCCTGCAGTTGTCCACAGACGAAAAAGAACAGCTTGCTATCGTAAAGAAAGCCACAGCAGGAGGATGGAAGAGTTTTTACAAGTTGAAAAAGGAGCAAGCAAAGGACAAGCAGAATGGGAAGAAGAAAGGATTTGATAATTTCCAAGGTCGGGACTATGACATGGCAGATCTGGAGAGAAGATTAATAAAAAGGGGTGATATGTCATGATATCGACAGATCAGATTGAGATTTATGGTTTTGAATCCCTGGAGGATTTTGAACGATTCCGGCTGCGATGGATAATGGTCTGTTCAAAGATCAACCGCAACGGAAAAAATGCTGGCCGTTCGGAAGTAGCAGAGCGGCGGGCAAAGATATTAAAGATTATTTTGTAAAAACGGAAAGGAGGCGGGAGCCCCGGCCGGGAAAAGATATCTGGCTTCCTTTCGACATGGATTATTTAGAGTTTTTAAAAACAAAGATAGAGATTGCGCCGGAGAGTGGATTTGATGTAGATCCGGATGTGATCAATCCAGCGTTAAAGCCACACCAGCGGGACGCTGTGGCCTGGGCGTTAAAGGGTGGAAGGCGGGCTCTGTTTGAATCTTTTGGACTTGGCAAAACGATCCAGGAACTAGAGTTTTGCCGATTGGCCGCAGGACATGAAAAGAGACGAGCATTGATCGTTCTTCCTTTGGGTGTGAAACAGGAATTTTTGAGGGACGCCGCACAGCTTTTGAAGTATAGAGAAACATGGCCAGGATATAGGGATCCAGAATATTGTCGGACCATGGACGAGGTGAGGAAGTCGAAAAGCCAGATTGTGTTGACAAATTATGAACGGGTCCGGGATGGAGATATCGATCCGTCATATTTTGGGGCAACGTCCTTAGATGAAGCAAGTGTACTGAGATCCTTTGGGAGTAAGACATATCAGACATTTCTGGACAAGTTCAAGGGGGTGCCCTATAAGCTGGTGGCGACGGCAACACCGTCTCCAAACCGATATAAGGAGCTGATCCATTACGCAGGATATCTGGAGATCATGGACACCGGACAGGCGCTGACCAGGTTTTTCCAAAGGGACAGCACAAAAGCTAATAACCTGACACTCTACCCCAATATGGAAGATGAGTTCTGGCTGTGGGTGAGCTCCTGGGCGTTGTTCATCACAAAGCCGTCAGACCTGGATCCGGCTTATTCAGACGATGGATACGTGTTGCCGCCTTTGGATGTGAGATGGCACGAGATTCCAATCCACTATGGAGACGCGACAGAAAAAGATGGTCAGATGACGCTCTTTACAGACGCTGCCGCTGGACTAAAACAGGCGGCGGAAGTGAAGCGGAATACGATCGATATCCGGGTACAGAAGTTGAAAGAAATCGTGGAGGCGTATCCTGAAGAGCATTTTGTCTTATGGCATGACCAGGAAGCGGAACGCCACGCAATCAAGAAGGCGCTTCCTGAGGTCGTGGATATTTATGGATCTCAGGATTATGACATCCGTGAACAACGGGTGATCGATTTCTCAGAAGGAAGGACCAGGCTGTTTGCAACGAAGAAATCCTTGTCTGGATCTGGCTGCAACTTTCAACGGTATTGCCATCGGGAGATTTTCGTTGGGATTGACTATGAGTTCAATGATTTTATCCAGGCTGTGCACCGGTGCTACCGCTTCCTACAGGATAAACCGGTGATCATCGACATTATCTATATGGAGAATGAGCGGAAGATCAAGGAAGAACTGATCCGGAAATGGAAGGACCACGATCACATGATTGAGAAGATGATTGAGATTGTGAAGAAATATGGTCTTTCTTCCGCCGGGAAAGAGCAGGGACTGAAACGAAAGATGGGAGTGAAAACGGTGAGAGTAGAAGGCAAACACTATATGGCGGTACATGATGACTGTGTGGAAGAAGCAAGGAGGATGAAAGGCAATTCTGTTGGGCTGATCCATACTTCAATCCCGTTTGGGAACCATTATGAGTATAGTGCCAATTACAATGACTTTGGTCATAACCAGGATACGGAACGATTTTTTGAGCAGATGGATTTCCTGACACCGGAACTCTTTCGAGTACTGGAGCCCGGAAGGGTGGCGGCAATCCATGTGAAGGACCGGGTGCTGTTTGGAAATACCACGGGGACGGGAATGCCGACTGTGGAACCATTCCATGCGATGTGCATTAAGCACTATATGAAACATGGATTTCAATATTTTGGAATGATCACCGTGGTGACTGATGTGGTCCGGGAAAACAACCAGACCTATCGCCTTGGCTGGACGGAGCAGTGCAAGGATGGTTCCAAGATGGGAGTTGGGTGTCCGGAATATATTCTTCTCTTCCGGAAGCTGCCTTCAGATCGGTCAAACGCCTATGCGGATGATCCGGTGAAAAAGACTAAAGACGAGTATACACGGGCACAGTGGCAGATTGACGCGCATGGGTACTGGAGAAGCTCAGGTGATCGCCTTGTCAGTAAAGAGGAACTGAAAGAGATTTCTGTGGATAACCTGCAGGCAGTCTATCGGAAATACAGCCGCGGAACTGTCTATGATTATGATGAACATGTGAGACTGGCAGAAGAATTAGATCAGAACGGGAAACTTCCAGCCACTTTCATGGTTGTGGCGCCGGGATCCTGGAATCAGATGGAAGTTTGGGACGATATCAACCGTATGCGGACGTTGAACACAACGCAGAGCCGGAGAAGACAGCAGATGCACGTTTGCCCGTTGCAGCTTGATATTGTAGAACGTATCATCAATAGGTATTCCAATGAGGGGGATGTGGTACTGGATCCGTTTGGCGGGCTTATGACAGTACCGATGACCGCGGTGAAAATGAACCGGTATGGATATGGGATCGAACTGAATCCAGACTATTTCCGGGATGGAGTTGGATATTTAAAAGAGGCTGAGGATGAACGGGAAACGCCGACGTTGTTTGATTTTATGTAGAGTGAATGACAAATAAAAATCATGGGGTGAGTATACAGAAGCAAAGAAATAAGCGGTAGACCGTCCGCCAAGACTAATATCTACCGCTTCCACCTGAGGACATTATACCATAGCGGTCCTTGGGAGACAAGGAGGGAAAAGCTATGTATACACAGAAAGAGATATTACGGGACACTATATTGACCGGAATGAAACCATACCTTGATGCCGCAGTGCTGGATATCCTAAACCAGGTAGTCGTGCAGGCATTGTTTGATGTGGAGGTTACAGAGAGCCAAACGCTTCCGGCTACTAGGAATAATACCAATGAGCATATCCTGCAGTTATATATGACCAAAAAGGCTCCAAAGCTGAGCCCTAAAACCGTTAATTATTATCTACTGACTATCCGGCGATTTTCCGACTATGTGGATAAGTCTCTGCTGGACGTGACTGATATGGATGTGGAGTGTTACCTACAGGCCTATGCTCGCAAGGGAAATCAGGCCAGCACGGTCAATAATGAGAGACGTAACTTGTCCGCATTTTACACCTGGGCACGCAAAAGCCATCTGGTCAATGATAACCCAGTGGACGGAGTAGAGCCTTACAAAACGATAGACAAACCTATTGAATATATGGAGGACTGGGAGATGGAGGCTATGCGGGACGCCTGTAAGCTGGAAAAATCCAGTAAGGTTACTGAGATTAAGGAGTATAGGGAATGCTTGCGAGACCGGGCGCTGATTGAGTTTTTACGAAGCACGGCCGTTAGGGTATCAGAGTGTGCGTCTGTGGATATCCGGGATATCGATTGGCAGACTGGAGAGTTAGTGGTGTACGGTCAAAAGACCAGGAGCTATCGGACCGTCTGCTTGGACGATGCAGCCCGGTATCACGTCCGCAAGTACATAGATAGCAGGAGAGACAACGATCCGGCGCTATTTGCGTCACTTAAGGCCCCACATAATCGCCTGACTAAATCAGGGATTGAGAGTGCGATCAAGGCAGTGGCAATCCGGGCAGGCCTCAAACGCCGAGTGTACCCGCACCTATTTCGTAAGACCACAGCGACCAATATGACGCGGCGGGGGTGCCCTAGGGAGCTGATCGCACTGTATCTTGGACATAAAAACGGCAACACGCGGACGCTTAACGCTCATTACGCTGCTACTGACCAGGGACAAGTATTGGCTGCATTCCGACAGTATGGAGCTGCGGCTTAGGAAGGAGAAAATATGTATTAGAAGGAGGACAATCATGAAAAAGAAATCACCAGAACAGGAACTAGAAGGATTGCGCGCCTATATCCGTAAGGAAATCGAATGCTGGGAATATATCAACAAGAATGGGTGCAATGATCCTTTCTGGCCGGATGGAACCAATATGAATTTGACGCGGAACCATATTATATACGCAAAGAACCAAATATTAGAAATCTGCGAGGTGAACAAATTACCAATACCGGAAGAGATGTATCTACCAACCCCGCCGGAAGTAAATGATTACTACATGGCAAACCTGAAGCAGCGGGAACGGGTGAAACGGATTGACAATCCAGAAAAAATCACAACGAAGCGGACGAAATACGATCGAGAACAAATGTCATTATTTTAGGAGTGCGGAGTATTGCAACTAATAGCGACTAATTGCAAATGATTGCAACTAATTGCAACTAAATTTTGATTTAACGGAGGGACAAGGTTGATGTGAGAAAAAAACTAAAAGTGTGTTGGCTATCAGCGGGGGTGTCATCGTTTGTGGCAGGGTATTTGGCAAAAGAAGTGGATGAATATATCTACATTGACATTGATAATCAACACCCGGATAGCATGAGGTTTATTAAGGACTGTGAAAAAGCTTTGGAAAGACCGATAAAAATACTCAGATCTCCATATCGAAGTGTAGAGAATGTGATTAAACAATTTCGATATATTAACGGTGCCTACGGCGCGAAATGCACACAAATCCTGAAAAAGCGTGTGCGAAAAGAATGGGAAGATGCACATAAGGATTATGAGATTATATATGTCTGGGGATTTGATGTCGGAGAAAAGGGCCGTGCAGAACGCCTGAATGAATCTATGTCAGAGTTTGAGCATGAATACCCGTTGATAGAGAGAAATCTGACCAAACAAGACGCACATGCGATTCTGGACAGGCTGGGAATCAAGAGGCCGGTTATGTACGATCTTGGGTACCAGAACAATAATTGTATTGGATGCGTCAAGGGTGGAATGGGTTACTGGAACAAGATCAGGCAAGACTTCCCGGAAGTGTTTGACCGGATGGCGAAATTGGAAAGAGAAGTAGGGCACTCATGCATTAGAGGTGTCTTCCTGGATGAACTTTCCCCGGAACGAGGGCGAATAGAAGATGAGATTATGCAGGAGTGCGGAATTATGTGCTATTTAGCATTAAACTGATATTTAGGGCGTGACTGAAGGTGTGGTCACAGTAACTAGCGGTGTGGAGAGACCACTGCATAAGCGGGGTACGGGGCGATGTTGCAGAACAAACGAAGGGATGAATGACGATGAAAGAATTAACAGTAAAAGAATTAAGGGAAGCGCTGAAGGGCGTACCAGATGATCTGCCGGTACGCTTAAGTAGTGACACAGGGGTAGATCAGGGAGAGGGAGATATCATTATTGAGTCGGCGAACCGAGTAAAATGCGGTGATATGGAGTACTTTGATATCTATGCAAATGACATTGTTGACGATGAAGAAGACTGCGAGAGCCTTCCAGATGATGAGTTTGATAGGCTTATGGGACTTCTGGGAGAGCGGGGAGAATGACATGCAGGAATTAACATCAAAAGAAATAAATGAAATGAAGCACTGTATAGGGTTGGATTATAAAAAACCATACAAAAGACATGGAAAGGAATTTTACAAACCGTACAGAAACCGCTATGCAACATATGTCTACGATGAGGTTTGGAATGGATTAGTTGGAAAGGGGTTTTCAAAGCATGAAAGTGTAGATGAAAAACAAAGGACAGTGTTTTATTTGACCAGAAAAGGACTTGACGCTCTGGGAGACGTGATTGGTGTACATATCTATGATGAGGAGGACTGACATGCAGGAATTAGAGAAGATTCTGGAAGAGATAGATCAAAGGATTGAAAAGGCTGAGAAAATCATTGTTGATCATCCGAGTGACAAATTGGACGAAATAGCAAATGATACGGCTGAAGCGTTTATTGAGGCATATAAAGAGTGTCAGGATATCATCCACAAGCACCTATCTCGTGAAAACGACTCTGAAATCACGAGATTGTCACGAGATAATGACGGATGGATTCCGGTGGAGGAACGGTTGCCGGAAACGAATGAGGATGGACTGAGCGAAGATGTGCTGGTTAGTTTTTCTGATTCGCATAGTAGATGTATAGGATTTTATGATTTTAATAAAAAGCGCTGGTATGTACATGAATCATGCCCGTATATTGGCAGACTGATCGCATGGCGCCCTCTTCCAGAGCCATACCGCCCGGAAAGGAATCGGCCATGACGGTCCGTGATCTGATTGACCGCCTGCTGGTTGAGATTCAATCCGGCGAGCTGAAAGGTTCTGACCCGGTGCCGGCGGACCTCCTGGCGGATAGTGTGCTGGAGAGATACGGAGTGGCTGAGTACATAGCGGAGAAATACGGGATAGGGGAGGTTGGTAAGCATGGGGACTGATTGGAGCCCAGAGGGATATGCCTTATGTGCGGCCAAGGTGGTAGAGCAGGCGGCCCAGGACTACCGGAGGGCGCTGCGGCGGCTGTACCATCATCCAGAAGACGTAGCGGCGCAATCCGGGAAAGAAGAGTGCGAGAGGTTTTTCCGGCGGGATATAGGGATGTACTCCAATCTAGATGGGGAATCTATCATCCGTGAGATCCAAAAGCGGGTGAAAGGAGAGATGAAGCGATGAATAGAGCGATATTGAACAGGTACAAGCCAAACAAAAAAGAGTTGGCATTACTTGATCGGCAACTGAATCGACTGTACGGACGACTGGAGGAAGTCGAGGAAGTATCCGGGAAGGTATCAAAATCCTCGGATGACTGGCCGTACATAGAGGAGTATGTTACTGTCAGGATGGCAGAGCCGAAGGCCGCCGCGAAGATTAAGGACAAGATCAGGGTCAAGGAATCCAGGAGGCAGACGGTTATGTGTGAGGTTTTGGAGGTAGAGGGCTTCATCGAGAGCCTGCCGGAGGGGATTGAGAGGCAGATCCTAGAGATGGTCTACCTGGAAGGGATGAGCCAGGGAGAAGTTGGAGAGATATTGAACCTTGAAAGAAGCGGAATATCCAAAAAGATAAGTGGCTGCCTAAAAGATTCACAGCATTCACACTTTTAGTGTGCTATACTTATACTGAGAAAAGTGGATGACGTAGAGAACCCTCCTTTTTTCTCTCTAATGACATATAGACATACACGGTAAAAGACGCCTACCAGTTAGGGCGTCTTTTTCGTTGTATAAATCAGCCCAGAATGAAGGTGGTGACGTGCCAAGAGCAAGAAGTCCGAAACGGGATGAGGCTATGCGGATATGGCTTTCATCAAACGGACAAAAGAAACTGAAGGACATAGCTGCGGAGATGGGGGTGTCAGAAACCCAGATCCGCAAGTGGAAAAACCAAGATAAATGGAATGGTAACGTTACCAAACTGGCGAAAGGTAACGTTACCAAACGGAAACGCGGCGGGCAGCCAGGTAACAAGAACGCTGTCGGCCACGGCGCACCGCAGGGGAATCAGAACGCCACGAAGCACGGGCTATTCTCCCGGTATCTTCCAGAAGATACCAGGGAGCTTTTTTATTCCCTGGATTCCGCTGATCCGCTTGATCTTCTGTGGGATCAGATCAAGCTTGCTTATACGGCCATTATCCGGGCACAGCAGATCATGCACGTCCGGGACCGGAAGGACAAGACTATTGAGAAGGTCGAAGAAAAAGACGGTAATGTGATCGGGGAGCGATGGGAGGTACAGCAGGCTTGGGACAAGCAGGCTTCCTTCCTGCAGGCGCAGGCCAAGGCCCAGAGAGAGTTGACCAGCATGATCCGGCAGTACGAAGAGTTGCTGCATAAGAACTGGAAATTGGCGACAAAAGAGCAGAAAGCCAGAATCGCACAGTTAAAAGCCCAAACACAGAAAATCACGGGAGAGGGGCTTGAAGTAGAAGATATGTCGGAAATCGAGGCGGAGATCTATGAAGCAGATCAGGAAAAAGAAAACGCTTAATTATGTGTTTTCTGAAAAGCATAAAGATTATATCCGGGCCTGTCGTGATTGCGAATTCAATGTGGCGGAAGGAGCTGTCCGGGCTGGGAAGACGGTAGACAATGTTTTTGCATTTGCTCAAGAATTGAAAACGACACCTGACAGAATCCACCTTGCGACAGGATCCACAGTTGGAAATGCCAAGCTAAACATTGGTGTCTGTAATGGACTTGGTCTGGAGAATATATTCAGGGGACAATGCCACTGGGGGAAATTTAAGGATAATGAGGCTCTGTTTGTGAGAGGGCCTGATACCGGATGGCGACAGAGGATTATCATATTTGCGGGCGGGGCAAAAGAAGACAGCTATAAACGTATCCGAGGAAACTCTTATGGTATGTGGATCGCTACTGAAATAAGCCTCCATCATGATAAAACGATCAAGGAGGCCAATAATCGTCTGCTGGCGGCACGGCGATTAAAGATATTTTGGGACCTAAATCCTGATAACCCGAAACATCCAATATACACAGATTACATAGACAAGTACAGGGATATGGCCGCCGCGGGCAACTTCCCTGGGGGCTACAACTATATGCACTGTACGATCTATGATAATAAAACTATCACTGCTGAGCGCCTGAAGCGGATAGAGAGCAGATATGACAAAAACAGTATCTGGTATATGCGGGATATCAAGGGTATGCGTGTTGTGGCGAGTGGCCTTATCTACCGTAGATTTGCGGATGACGTTAGCACCCAGCAATATACATTCCGTCTGGCGGATAAGCCCAAAGATATCATGGAGATTATCCTTGGGATAGACTTTGGCGGCTCGGCTTCCGGCCATTCTTTTTCCGCGACCGCTATTACCAGAGGGTATAGATTTGTTGTTGCTTTAGCGACTGAGATGATCCCCTGTAAGGATAAAAGCGGTAATCAGATAGAGATTGATCCAGAACTGTTGGGTAATATGTTTTGTGATTTTGCGCAGCGGATATTAAGCATATACGGCTTTGTTACAACCGTATACGCGGATTCTGCGGAGCAGACGTTGATCGCCGGGATCCGAAGCAGTCTTAGAAAGCACGGATTGGGATGGATCAGGGTGGAAAATGCGCTCAAGACGGAGATCAATGACCGGATAAATGCACTTGCTATCTTAATGGCGCAGAGAAGATTTTTCTATGTGGAAAATCAATGCAACAGTCTGGAAACAGCGCTGTGCACGGCTGTATGGGATCCAGATGAGATTACGAAAAATGTCAGGTTGGATGATGGAACGAGCGATATAGACAGCCTGGACAGTTTTGAGTATACATTTGAGCGGCGGATCAGCCAGCTCATCAAATATGGATAGGTGAGGCAAATGAGATATACGAAAATGTATGAGGCGATCGCAAAGATCCTAAAAACTGAAGAGCAGATATCTTTTGTGATGTCAGATATCATGGCTAAGCGTATCGAGCTATGGTCTATGCTTTTCCAAGATAAGGAACCTTGGCTCAGTGATACGGTAAAAAGTTTGAATCTCCCCGCCTCTATTGCTGGAGAGATTGCAAGGCTTGTGACCCTTGAGATGGACACTAAAGTGGAGGGAAGCGCAAGAGCGGCGTATATAGATAAAGTCTACCAAAAGACAGTTAAAAAATTGAGGATCTATGTAGAATTTGCCTGTGCAAAAGGAGGTCTTATCCTAAAGCCATATGTGACAGACTCTGGGATGTGTACAGAATTTATACAGGCAGACAGTTTTTTTCCTCTTTCCTTTGATGACGATGACAATATAACACGCTGCGTCTTTCTGGACCAGTTTCGAGATGGGAATTCCATCTATACAAGGATTGAATATCATAAGCTTGATGGAGGATTTCTGACGATCAGGAATAGGGCTTTTGTATCTATGACAGATGGAGTTTTAGGATCGGAGATACCACTTCATATCGTACCCAAATGGGGCGGGCTTGAGAGCCAGATGATATTTGAAAATGTTCAGAAACTGCCGGTTGGATATTTTAAGGTCCCGATGGCGAATCAGGAAGACAATACCAGCCCCCTGGGGGTGTCCTGCTATAGTCGAGGGGTTGGTCTCATTAAAGAGGCCGATGAAAGATATTCTCAGATTAGCTGGGAATACAAAGGGAAAGAGGTCGCGGTACACATAGCGGAGAGCCTGCTATCCAGAAATCCCCAAACCCAAGAACTTGAGTATCCAGATGGAAAGAAACGGTTATATCGTGAGCTGGATTATAACTCTGGAGCTGTAGATAAACCGCTGTTAGATGTGTTTTCTCCAGATATTCGCGATCAGTCCCTGTTTAATGGACTGAACCAGCAGTTGAGGCGAGTAGAGTTTGCGTGTAATCTGGCTTATGGGACATTGTCAGATCCAAACAATACGGACAAAACAGCGGAAGAGATACGGGCCAGCAAGCAAAGATCCTATTCCTTCGTGGAATCCTGTCAGAGTGCGTTACAGGACGCTCTGGACGATTATATCACAGCCATTGATTTCTGGGCTACAATATATAACTTGGCGCCATTCGGGAACTATCACACATCCTACAAATGGGATGATTCTCTTGTGGTGGATACAGAGAAAGAACGCCAAACAGATCGAGCGGATGTTGCAATGGGGGCAATGCAGTTATGGGAATACCGAGCAAAGCACTATGATGAAGACGATGAGACTGCCAAACGAATGGTGGCGCAACCGGCCGATCTGATTGAAGAGTAGGTGATATCGAATGATGAGCCAAGGAGAGATCGAGGCGCTGACCATTACAATGGAGCAGGCTGCGGCAAAGTTGGAGCGAGACATCATGCTGGATATCGTCCGCAGGATCAAAGCCAATGCCGGAATGACATCTACGGCAGAGTATCAGATCAACCGCCTGCGGCAACTTGATCTATCAGATGCCTATATACAGGAGCAGATCCAGACATATCTCAAAGTGTCCAACGATGAAATGGAGCGGATTTTTAATAATGTCGTCGAAAATCAATATGAGAAGTTTAATGGCATCTATGAATCCACTGGAATATCTCAAAAACCCTTTGGAGATCACGTGGCAATTCAGACGGTGATTTCCTCTGCGCTTGCCCAGACGGATGAAACATTCAAGAATATCTCTCAGAGCATGGGTTTTTCTGTCAAGAAGGGCGGGAAAACGGAGTTTCTTCCGGTGGCTGAATATTACCAGCAGACATTGGACAATGCCATTCTTGGGATAGCAACAGGGGCTTTTGATTACAACTCTGCACTGAACAGAGTGGTTAAGCAGATGACTATGAGCGGCCTTAGGACTATTGATTATGCCTCTGGGAGATCCTATCGGATAGAATCCGCTTCCAGGACGGCCCTCATGACAGGATTTTCCCAGATTACTGGCTATATGAATGAGCAGGTAGCCGCAGAGTTGGGGACAGATAATTATGAGGTGTCCTACCACATGGGCGCCAGGCCAACCCATCAATGGTGGCAGGGAAGGGTGTACAGCTACCAAGAGCTGATCTATACTTGTGGACTTGGCACTGTCACAGGATTGTGCGGGATCAACTGCTATCACTGGTATGAGCCCTTCATCCGTGGCGTATCGGCCAGAAATTATACAGATAAAGAACTGGACGCAATGATCCGGGAAGAGAATCGCCCAAAATCTTACCAGGGAAAGGAATATACCACTTACACGGCCCTTCAGAAGCAGAGAAAGATGGAGCTCCTAATGCGAAAGCAACGCCAGGACATTATGCTGTTGAAAGAAGGAGGTGGAAGCTCCCTTGATATTCTGGCGGCACAGGCCAGATATAGGTACACGATGAATGACTATGTATCATTCTCCAAGGCCATGAACCTACCACAGCAGATGGAAAGGGTGTATATGGATGGGCTTGGAAAGGTAGGAAGGAGCGGAACGATTCCAAGGCCAAAACGCAGAGAGAAAAATACGGGGGCTTTTTCTGTACTTGAAGTTCCTATGCAAAAAAGAACCGTTGACCGTATTGCTAAGAAATATGGAATAACTATATCCGATTTGCAGATAAAAATACAAAGAGACGAGGAACACTTGCGGCGTCCGATGATGGGATGTGCTGATTATGATAATATCGGAAGGATTGACTTGTTTCCCAATTCTTTCAAAAATGAGGAAGCGCTGATCAGGACACTGATACATGAGCGATGTCATGTGCTGCAGCTTAGAAAATATGGCAAGAAATATACGCAGGAGCATATCAATAAGATGGAGGAAGTTGCATACAAGTTCGAAGATTTCTGGTATAATATAGTCAGAAAGAGGGTGCAGCCATGA
Protein sequences of DBSCAN-SWA_2 >NZ_CP041667|571879:594244|589923_591294_+|WP_143928932.1|terminase|DBSCAN-SWA MKQIRKKKTLNYVFSEKHKDYIRACRDCEFNVAEGAVRAGKTVDNVFAFAQELKTTPDRIHLATGSTVGNAKLNIGVCNGLGLENIFRGQCHWGKFKDNEALFVRGPDTGWRQRIIIFAGGAKEDSYKRIRGNSYGMWIATEISLHHDKTIKEANNRLLAARRLKIFWDLNPDNPKHPIYTDYIDKYRDMAAAGNFPGGYNYMHCTIYDNKTITAERLKRIESRYDKNSIWYMRDIKGMRVVASGLIYRRFADDVSTQQYTFRLADKPKDIMEIILGIDFGGSASGHSFSATAITRGYRFVVALATEMIPCKDKSGNQIEIDPELLGNMFCDFAQRILSIYGFVTTVYADSAEQTLIAGIRSSLRKHGLGWIRVENALKTEINDRINALAILMAQRRFFYVENQCNSLETALCTAVWDPDEITKNVRLDDGTSDIDSLDSFEYTFERRISQLIKYG >NZ_CP041667|571879:594244|575191_575425_+|WP_143928910.1|DBSCAN-SWA MSLEDNLAKYIKGKAINLSDMSRQTGISYMSLYNSLFNENSERQLKARELVAVCIFLEVDPRDFVEKTDEKEVKEKR >NZ_CP041667|571879:594244|577111_577318_+|WP_143928915.1|DBSCAN-SWA MKNNLKFVEPMKLKPGTDWEKRREIRRKERELDRWWCRLYRASLALEMVIAVLVVLCIAHCTGLLQMM >NZ_CP041667|571879:594244|575593_575908_-|WP_143928912.1|DBSCAN-SWA MQLTKDADLLVCCAYKEYLDRRKRGVPKRQANTFHPDFRENTSKLSGWLHDDYLYTIGELKRANFVKTYLDGTFIITEHCIIYMENRFKNGLIEVTDFISKFIP >NZ_CP041667|571879:594244|588634_589069_+|WP_143928930.1|DBSCAN-SWA MNRAILNRYKPNKKELALLDRQLNRLYGRLEEVEEVSGKVSKSSDDWPYIEEYVTVRMAEPKAAAKIKDKIRVKESRRQTVMCEVLEVEGFIESLPEGIERQILEMVYLEGMSQGEVGEILNLERSGISKKISGCLKDSQHSHF >NZ_CP041667|571879:594244|576263_576818_-|WP_143928913.1|DBSCAN-SWA MLTNESKTILDYLISAFPNYESSMMNFIVISENTGLSLSAVDSALQYLESLGYVRIKRYKGGGFVQDVTHQGMHYKELESVTAPTAQTNIFNAPVTGSAIGNSGAVTINNEISVNEALSFIRSQDISAEDKAEAEKMVTYIEALIENEAPLKKGFLSKFSDILAKHHWLPELAMKLLFQYLTGQ >NZ_CP041667|571879:594244|582215_584732_+|WP_143928923.1|DBSCAN-SWA MDYLEFLKTKIEIAPESGFDVDPDVINPALKPHQRDAVAWALKGGRRALFESFGLGKTIQELEFCRLAAGHEKRRALIVLPLGVKQEFLRDAAQLLKYRETWPGYRDPEYCRTMDEVRKSKSQIVLTNYERVRDGDIDPSYFGATSLDEASVLRSFGSKTYQTFLDKFKGVPYKLVATATPSPNRYKELIHYAGYLEIMDTGQALTRFFQRDSTKANNLTLYPNMEDEFWLWVSSWALFITKPSDLDPAYSDDGYVLPPLDVRWHEIPIHYGDATEKDGQMTLFTDAAAGLKQAAEVKRNTIDIRVQKLKEIVEAYPEEHFVLWHDQEAERHAIKKALPEVVDIYGSQDYDIREQRVIDFSEGRTRLFATKKSLSGSGCNFQRYCHREIFVGIDYEFNDFIQAVHRCYRFLQDKPVIIDIIYMENERKIKEELIRKWKDHDHMIEKMIEIVKKYGLSSAGKEQGLKRKMGVKTVRVEGKHYMAVHDDCVEEARRMKGNSVGLIHTSIPFGNHYEYSANYNDFGHNQDTERFFEQMDFLTPELFRVLEPGRVAAIHVKDRVLFGNTTGTGMPTVEPFHAMCIKHYMKHGFQYFGMITVVTDVVRENNQTYRLGWTEQCKDGSKMGVGCPEYILLFRKLPSDRSNAYADDPVKKTKDEYTRAQWQIDAHGYWRSSGDRLVSKEELKEISVDNLQAVYRKYSRGTVYDYDEHVRLAEELDQNGKLPATFMVVAPGSWNQMEVWDDINRMRTLNTTQSRRRQQMHVCPLQLDIVERIINRYSNEGDVVLDPFGGLMTVPMTAVKMNRYGYGIELNPDYFRDGVGYLKEAEDERETPTLFDFM >NZ_CP041667|571879:594244|573216_574038_-|WP_143928909.1|DBSCAN-SWA MKKALIIAVLICLLSVGLLACEEEKIKVADVTGMNAQEASDLLKESGFTTVTLSADSSGESSIILDSSNWTVVTQTPEAGSEETAETEITLSCKKTEELESDQLNEVLNLTVPEAKAKLDDLGYTATYFHQDSGLEITEDLSVYSDSELGKWIVTGLQSHNIESKTVEVIVNTSENIASNEASSAIKKTLEETLSASSAWQATDEYGKSQYPYGFELHYIMGKLAEQPENETTWFLKAECTITNEYGAESDATCEAKVSGTSDSPEVTSFSVY >NZ_CP041667|571879:594244|575421_575604_+|WP_143928911.1|DBSCAN-SWA MKSEKFNSIHIDLEKGIYKINGEPADFITELKLEWNPEDGWALNISKDKTYLAPIQRIKE >NZ_CP041667|571879:594244|588392_588638_+|WP_143928929.1|DBSCAN-SWA MGTDWSPEGYALCAAKVVEQAAQDYRRALRRLYHHPEDVAAQSGKEECERFFRRDIGMYSNLDGESIIREIQKRVKGEMKR >NZ_CP041667|571879:594244|576013_576301_+|WP_143931176.1|DBSCAN-SWA MKCPKCGAESRIKEDDVFCCKCGYPLTAKANDGDDAELKSFFLDVSSGVMLINGKEINNVTAFSLNFNGGKYGLCVTHDDPYKAIAPLSIEKEVS >NZ_CP041667|571879:594244|591303_592671_+|WP_143928933.1|portal|DBSCAN-SWA MRYTKMYEAIAKILKTEEQISFVMSDIMAKRIELWSMLFQDKEPWLSDTVKSLNLPASIAGEIARLVTLEMDTKVEGSARAAYIDKVYQKTVKKLRIYVEFACAKGGLILKPYVTDSGMCTEFIQADSFFPLSFDDDDNITRCVFLDQFRDGNSIYTRIEYHKLDGGFLTIRNRAFVSMTDGVLGSEIPLHIVPKWGGLESQMIFENVQKLPVGYFKVPMANQEDNTSPLGVSCYSRGVGLIKEADERYSQISWEYKGKEVAVHIAESLLSRNPQTQELEYPDGKKRLYRELDYNSGAVDKPLLDVFSPDIRDQSLFNGLNQQLRRVEFACNLAYGTLSDPNNTDKTAEEIRASKQRSYSFVESCQSALQDALDDYITAIDFWATIYNLAPFGNYHTSYKWDDSLVVDTEKERQTDRADVAMGAMQLWEYRAKHYDEDDETAKRMVAQPADLIEE >NZ_CP041667|571879:594244|589217_589955_+|WP_143928931.1|terminase|DBSCAN-SWA MPRARSPKRDEAMRIWLSSNGQKKLKDIAAEMGVSETQIRKWKNQDKWNGNVTKLAKGNVTKRKRGGQPGNKNAVGHGAPQGNQNATKHGLFSRYLPEDTRELFYSLDSADPLDLLWDQIKLAYTAIIRAQQIMHVRDRKDKTIEKVEEKDGNVIGERWEVQQAWDKQASFLQAQAKAQRELTSMIRQYEELLHKNWKLATKEQKARIAQLKAQTQKITGEGLEVEDMSEIEAEIYEADQEKENA >NZ_CP041667|571879:594244|578726_579641_+|WP_143928919.1|DBSCAN-SWA MWRAAGSRLSYYQKYKKEKEDSMELRIISPQEGGFIKEIQWNKEELKAWVAEKVKDYRGLVFTEATIPDAKKDRADLNKLKKAFEDERKRIKKLCLEPYEEFERQEKELIAMIDEPLQLIDCQIKEVEEKRKEEKRLRIKDLFSTIGFQAFVSLDMIWDEKWLNATVSMPKIEEQMKSRMYQIGEDVFTIRNLPEFSFEAMEVYKKSLDLTKAIQEGQRLSDIQKRKAAYEEERRRKEEAAKQGSVHSAQEQEAVPAAQEAQSEAKEAEDDVVCLDFRVWGTRDQIMGLRQYMIGNKIRFGKVE >NZ_CP041667|571879:594244|577499_577715_+|WP_143928917.1|DBSCAN-SWA MRRVEEDKQKILDLLLPALRETRNLHDLDNLEYFPQNSLVYATFKNGYIKVVDVECDSGTAMIQDVIRGIV >NZ_CP041667|571879:594244|587777_588230_+|WP_143928928.1|DBSCAN-SWA MMRRTDMQELEKILEEIDQRIEKAEKIIVDHPSDKLDEIANDTAEAFIEAYKECQDIIHKHLSRENDSEITRLSRDNDGWIPVEERLPETNEDGLSEDVLVSFSDSHSRCIGFYDFNKKRWYVHESCPYIGRLIAWRPLPEPYRPERNRP >NZ_CP041667|571879:594244|585971_586328_+|WP_143928925.1|DBSCAN-SWA MKKKSPEQELEGLRAYIRKEIECWEYINKNGCNDPFWPDGTNMNLTRNHIIYAKNQILEICEVNKLPIPEEMYLPTPPEVNDYYMANLKQRERVKRIDNPEKITTKRTKYDREQMSLF >NZ_CP041667|571879:594244|592681_594244_+|WP_143928934.1|capsid|DBSCAN-SWA MMSQGEIEALTITMEQAAAKLERDIMLDIVRRIKANAGMTSTAEYQINRLRQLDLSDAYIQEQIQTYLKVSNDEMERIFNNVVENQYEKFNGIYESTGISQKPFGDHVAIQTVISSALAQTDETFKNISQSMGFSVKKGGKTEFLPVAEYYQQTLDNAILGIATGAFDYNSALNRVVKQMTMSGLRTIDYASGRSYRIESASRTALMTGFSQITGYMNEQVAAELGTDNYEVSYHMGARPTHQWWQGRVYSYQELIYTCGLGTVTGLCGINCYHWYEPFIRGVSARNYTDKELDAMIREENRPKSYQGKEYTTYTALQKQRKMELLMRKQRQDIMLLKEGGGSSLDILAAQARYRYTMNDYVSFSKAMNLPQQMERVYMDGLGKVGRSGTIPRPKRREKNTGAFSVLEVPMQKRTVDRIAKKYGITISDLQIKIQRDEEHLRRPMMGCADYDNIGRIDLFPNSFKNEEALIRTLIHERCHVLQLRKYGKKYTQEHINKMEEVAYKFEDFWYNIVRKRVQP >NZ_CP041667|571879:594244|581149_581992_+|WP_143928922.1|DBSCAN-SWA MARPKKSGLSYFPLDVDFFEDPKIKILRARYGRDGIVLYIYLLCRIYKQGYYIQVDDDFEYIISDDLKMDQNKAKQVLNFLLSRSLFDNTLFQSDKVLTSAGIQRRFQLAVKERAKKNPIEVGRFWLLKKEETEPFIKCTVFGGFSGKNESNSGKNESNSPEQSLKKSKVNNIYNTAFPPELEKAFQMYLLVRKSNYGEILPEQIQALREDLLQLSTDEKEQLAIVKKATAGGWKSFYKLKKEQAKDKQNGKKKGFDNFQGRDYDMADLERRLIKRGDMS >NZ_CP041667|571879:594244|574094_574679_-|WP_143931175.1|DBSCAN-SWA MLLKLCEYYECDMLKEFSQENSSGITYEELKIIKKYRSLSPYDQETVTTLLNRLHDREESRNIIEYTSTSNSLNRFIPYQHSASAGSGIYILGNEAADQLAIPIGPEYDDVDYAVRVNGDSMSPDFNDGDAALVSQRKEMNIGDVGIFVKNGEAFIKELGETELISRNPEYPNIPVMEDDNVVCLGKVIGKLQV >NZ_CP041667|571879:594244|580699_581140_+|WP_143928921.1|DBSCAN-SWA MRSVTFQVPGKPQGKARARTFYSPSSGRHMSHTPDRTVLYENLIKDQFLNHADGFYLEREIPVSLRIVARFLPPKSAAKKKQQQMLGGEILPLKKPDMDNIVKVVADALNGVAYHDDTQIVLVSAKKAYSAMEGLDITVEEYLGGR >NZ_CP041667|571879:594244|586420_587137_+|WP_143928926.1|DBSCAN-SWA MRKKLKVCWLSAGVSSFVAGYLAKEVDEYIYIDIDNQHPDSMRFIKDCEKALERPIKILRSPYRSVENVIKQFRYINGAYGAKCTQILKKRVRKEWEDAHKDYEIIYVWGFDVGEKGRAERLNESMSEFEHEYPLIERNLTKQDAHAILDRLGIKRPVMYDLGYQNNNCIGCVKGGMGYWNKIRQDFPEVFDRMAKLEREVGHSCIRGVFLDELSPERGRIEDEIMQECGIMCYLALN >NZ_CP041667|571879:594244|571879_572992_-|WP_143928908.1|integrase|DBSCAN-SWA MATAKYTRGSDGYFTARVWDGTYDDLGKKHRKPIRSKKSSRDLERKVQEFEQQVRERRQIRKMDVTFLDYAREWKRVYKHQAETNTQAMYENVIEKHLIILEGIQLQSIERIHLQMVLNNADGKTRTQQQIYMTFKQILKSAVTDHLYPANIMADIFQSIPPVKYTAPEKRPLTDNERRAVFKADFDDQDKIYVYLIYGCGLRREEALALTIFDIHIDTQEVSITKAHKFVRGTPITKIPKSENGYRTIPIPDLVFPAVKEYVQSLKSKNRTYLFTITSGQPVTKSSYDKMWARILRRMNEAADEPITGLTGHMFRHNYCTQLCYQIPKISIGRIAQLMGDTERMVLEVYNHIILERENAAEAVNDAINF >NZ_CP041667|571879:594244|579645_580590_+|WP_143928920.1|DBSCAN-SWA MAVQNSLAKSQQRMGLTAYLTQDAVKRQINSVVGGKNGTRFISSIVSAVQTTPALQECTNQSILSAALLGEALNLSPSPQLGQFYMVPFDNKKKGAKEAQFQLGYKGYVQLAERSGYYKKLNVLAIREGELIRYDPLNEEIEVELIEDDVVREETPAMGYYAMFEYENGFRKTLYWSKKKMLAHAEKYSQAFKRNGGAKSLELLEQGKIPEKEMWKYSSFWFKDFDGMAMKTMLRQLISKWGIMSIDLQTAVDKDMAVIHEDGTVDYVENESVEDDVVSDQDLNEVPDNHNNPQTEPIPQKAEKPDIESEFFNN >NZ_CP041667|571879:594244|584875_585937_+|WP_143928924.1|integrase|DBSCAN-SWA MYTQKEILRDTILTGMKPYLDAAVLDILNQVVVQALFDVEVTESQTLPATRNNTNEHILQLYMTKKAPKLSPKTVNYYLLTIRRFSDYVDKSLLDVTDMDVECYLQAYARKGNQASTVNNERRNLSAFYTWARKSHLVNDNPVDGVEPYKTIDKPIEYMEDWEMEAMRDACKLEKSSKVTEIKEYRECLRDRALIEFLRSTAVRVSECASVDIRDIDWQTGELVVYGQKTRSYRTVCLDDAARYHVRKYIDSRRDNDPALFASLKAPHNRLTKSGIESAIKAVAIRAGLKRRVYPHLFRKTTATNMTRRGCPRELIALYLGHKNGNTRTLNAHYAATDQGQVLAAFRQYGAAA >NZ_CP041667|571879:594244|587512_587794_+|WP_143928927.1|DBSCAN-SWA MQELTSKEINEMKHCIGLDYKKPYKRHGKEFYKPYRNRYATYVYDEVWNGLVGKGFSKHESVDEKQRTVFYLTRKGLDALGDVIGVHIYDEED >NZ_CP041667|571879:594244|576881_577112_+|WP_143928914.1|DBSCAN-SWA MKYPKPVMHKTELLDMGFTRTYLDRAIAAPGQTFAWPINPANKSSPVLFDTEGFERWRLKEVAMRERARRQQRGVM >NZ_CP041667|571879:594244|577330_577522_+|WP_143928916.1|DBSCAN-SWA MKRLTVSRIDEIIRTLESTEHIDDQTQYHKNMAISYLKNYADLLDGKGIKSVKVKEDEKSGRG >NZ_CP041667|571879:594244|578116_578773_+|WP_143928918.1|DBSCAN-SWA MEVYKIYDFPDESAWLKGRMNGIGGSDASAVIGMNPYKSNIDLFEEKIGRRIPEDISDKPCVVYGKLAEEPIRALFRLDYPEYQVEHHEHRILQSIKYPFMQASLDGELVDMDGRRGILEIKTTNILQSMQREKWKDRIPDNYYIQVLHYLLVTGYEFVILRAHLNTDWNSEKRTTVKHYFIERSEVREDLDMLLREEQKFWEYVESGRKPPLVLPEI |
29 | Clostridium_phage(30.77%) | integrase,capsid,terminase,portal | attL 578454:578467|attR 590491:590504 |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_3 |
926735 : 936637
Sequences of DBSCAN-SWA_3
Nucleotide sequences of DBSCAN-SWA_3 >NZ_CP041667|926735:936637|DBSCAN-SWA TATGGGAAATACGTCTGAAGATAGAGTGGATAAAAATGGACAGAGCAGATCAGGACATCTCTGTATGGGTATCCTCGCCCATGTGGACTCGGGCAAGACCACCCTTTCCGAGGGGATTCTCTATCTGTGCGGGAAGATTCGGAAGCTGGGGAGAGTGGACCATAAAGACGCGTTTCTGGATACGGAAGAACTGGAGAAAGACAGAGGGATCACCATCTTCTCCAAACAGGCGGTGTTTGAGTTGGGAGATAAGCGGGTAACACTTCTGGATACGCCGGGACATGTGGATTTTGCCGCGGAGATGGAGCGAACTCTTCAGGTACTGGATTACGCCGTGCTGGTGATCAGCGGCGCGGACGGCATACAGGGACATACCCGCACGCTGTGGAAGCTGCTGGAACGGTATCAGATCCCGGTCTTTTTGTTTATCAATAAGATGGATCAGGCGGGAACTCAAAGAGAGGCGGTCCTGGAAGAAATACAGAATGGATTAAGCGAAAGCTGTATCGCTTTTGACAGGGCGCAGACGCCGGATTTCCTGGAAAGTCTGGCCATGTGTGAGGATCAGGCCCTGGAAGAATATCTGCGGACAGAAAGTCTGAAGAAAGAGACGGTCTCACGGATGATCCGGCACAGAAAGGTCTTTCCCTGCTATTTTGGATCGGCTCTGAAGCTGGAAGGGGTACAGGAGCTTTTAGACGGACTGGCCCAGTACGCGCAGAGCTTTTCTGAGAGAAAAGAGAAGACAGGAGACGGACGCTTTGGGGCCAGGGTATACAAAATTTCCAGAGACGGCCAGGGGAACCGTCTGACTCATATGAAGATTACAGGTGGCATCCTGAAAGTGAAGGATACACTGGGCGCGGAAGGAGAAAAGGTGAACCAGATCCGGATTTATTCCGGAGGCAAATATGAAATGGTTCAGGAAGCGCAGGCGGGAATGATCTGCGCTGTCACGGGGCTGAGTCAGACTTATCCTGGAGAAGGGCTTGGGAAAGAGCCGCCGGCCCATCCGCCGGTCCTTGAACCGGTGCTGAGTTACCAGATCTATCTGCCGGAAGGCTGCGAAGTATCGAAAGCCCTCTCTGACCTGCGGCAGCTGGAGGAAGAGGACCCGCTGTTTCGAATTGTGTGGATGGAAGAATTGGGAGAACTGCACGCTCAGGTTATGGGAGAAGTGCAGATTGAGATCCTGAAAAGGCTGATCCAGGATCGATTTGGACTGGAAGTGGAGTTTGGAGCCGGAAGCATTGTCTATAAAGAGACAATTCAGAATCGGGTGGAAGGAGTGGGACATTTTGAACCTCTCCGGCATTACGCGGAAGTCCATCTTATCCTGGAGCCGGGCGAACCGGGAAGCGGCATGGTCTATGTTTCCGAATGCGGTGAGGATGTGCTGGACCGTTCCTGGCAGAGACTGATTTTGACTCACTTAGAGGAAAAAAGGCATCTGGGGGTGCTGACCGGAGCGGAATTGACAGACGTGAAAATTACGCTGAAGGCCGGGAGAGCCCATTTAAAACATACAGAAGGCGGGGATTTCCGGCAGGCGACTTACCGAGCGGTCCGCCAGGGACTGATGCAGGCACAGAGTGTACTTTTGGAACCTTATTATGAGTTCCGGCTGGAAGTCCCCTCGGAGAATCTTGGGAGAGCCATGACGGATATCCAGAGGATGAGCGGAGAGTTCCTGCCGCCTAAGACGGAAGGTGAAATGGCGGTCCTGACAGGAACGGTGCCGGCGGCTGGTCTGAATGGATACCAGATCCAGGTAGCAGGCTATACCGGGGGGAGAGGCCATTTATCCTATGTGTTCCGGGGATATGGACGCTGTCAGAATGAAGAAGCGGTTGTGGAGGCTGCTGGTTATGATCCTTTAAGGGATATGGACAACCCTTCAGATTCTATTTTCTGCAGCCATGGAGCAGGATTCAGCGTGCCATGGGATCAAGTGCCGGAATATATGCATATAGAGAGCGTTTTAAAGAAAGAACAAAAGAAAGAAGAAGGACGGAACATTTCTAAGGAAAAGAAAAACGGAGGCAGAAGTTCTGCCGGATACGCCCAGAGCAAAGAAGCGGAAAAGGAGCTGGAAGAGATCTTTATCCGCACCTACGGGAAGATCGAGCGGAAAAAAGCGCCGGAGACCAGCCGGGTAACTGCCAGAGGAGAGGCGAAACGCCGCAAAAAGGAAGATAGATTGAGGGAATATCTGCTGGTGGACGGATATAATGTGATCTTTGCCTGGGAAGAGCTGAAAGATCTGGCCAGAGATAATATTGACGGGGCCAGAAACCGGCTGATGGATATTCTCTGCAACTATCAGGGTTATAAAAAATGCACAGTGATTCTGGTTTTCGATGCTTACAGGGTAGAGGGAGACGTGCTGGAAATCCAGAAATATCACAACATTCATGTGGTTTATACCAAGGAGGCGGAGACGGCGGATCAGTATATTGAGAAAGTGGTCCATGAGATCGGCCGGAAATATCACGTGACAGTAGTGACTTCAGACGGTGTTGAGCAGGTGGTGACACTGGGGCAGGGGGGAACTCTGATCTCCGCCAGAGAATTCCGGGAGGAAGTAGAGATCGTCCGCCAAGAGATCCGCAGGGAATGGGAGGCAAGAAGAGAGAAGAAAAAGCAGTATTTATTTGACTCCATGGAAGATGAATTGGCGGAGCAGATGGAGAAAGTGCGTCTGGGAAAACAAAAACTGGAATAAAAGAAGGAAGAGAAAGAGGTATATTTTGCCGGAATGGACGACAGACTAATAGGAAAAGCGGAAAGGATGGCAATAGACATGGAAAAACAAACGCTGGCACTATTAAAGGAATGTAGTTCTGGATGCAAAATGGCTGTTGACAGTATGGATCAGATCAAGGAATATGTTCTGGATTCAAACCTGTCTCAAGTGATCGACGAGGCAAAAAAACGGCATCTGCAATTGGAAAAGGAAACCGATGAACTGCTCAGAGAGCAGGGGGAAAGAGGCCGGGAACCAGAGAAGCTGGCGCAGGCGTTCTCCTGGATCACAACGGAGATGAAACTTCTGATCAAAGATGATAACACTCAGATCGCTAAAATTTTGATGGACGGGTGCGGCATGGGGATCAAGAGCTTGAGCGAACAGATCAATAAATACCCGGAAGCATCTAGGGAGAGCAGAACTGTGGCTGGCAAGATCGTAAAATGCGAGGAAAAACTGATGGGAGAATTAAAGAAGTTTCTGTAAGAAACGGGGCAGAGATGAACTGCGGCCGCATGGGGAATGGAAGAGGACCAAGGAGGAAACAGATATGATAAGTTTTGAAAATGTAAAGAAAAACTTTGGCTTTGGATGTATGCGTCTTCCCATGAAGGGAGATGAGGTAGACACAGCAGAATTTTCTAAAATGGTGGATATATTTCTGGAAGCCGGATTCAATTATTTTGACACAGCAAGAGGATATCTGGAGGGTAAGAGCGAGACCGCTCTGCGGGAATGTCTCACCAGCAGATATCCCCGCGGCCGCTATATCCTGACGGATAAACTGACGGAAACTTTTTTCAAAACAGAAGCGGATATCCGGCCGTTTTTTGAAAGCCAGCTGGAAGCCTGCGGCGTAGACTATTTTGATTTCTATTTGATGCACTCTCAAAATGCGGAATATTTTAAAAAATTCAAAAAATGCCGCGCCTATGAGACGGCGTTTGCTCTGAAAGAAGAGGGAAAGATCCGTCATGTGGGGATCTCATTTCATGACAAAGCGGCGGTGCTGGAACAGATCCTGGAAGAGTATCCGCAGATCGAAGTTGTCCAGATCCAATTTAATTATGTAGACTATGAGGACGCGGCGGTAGAAGGACGGAAGTGTTATGAGGTCTGTGTGCGCCATAAGAAGCCGGTTATCGTCATGGAGCCGGTAAAAGGAGGCAATCTGGTGAATCTGCCAAAAGAGGCGAAGGAAGTTCTGGATGAGTTAAAAGGCGGTTCTCCAGCCAGCTATGCGATCCGCTTTGCCGCTGGGTTTGAAGGGAACGAGATGGTAATCTCCGGAATGAGCAATATGGAACAGATGGAGGATAATTTGAGTTATATGAAAGACTTCCATCCCCTTGACCAACGGGAAATGGCTGCGGTAGAAAAGGTGGCGAAGATCTTCCGGTCTATGGATCTGATCTCTTGTACCGCCTGCCGTTACTGTGTTGCCGGATGTCCCCAAAATATCCTTATTCCGGAACTGTTCGCGGCGATGAACGCAAAGCACACATACCAGGACTGGAACAGCGGCTGGTATTATATGATACATACGCTGAATAAAGGGAAGGCTTCTGACTGCATCAAATGCGGAAAATGCGAGAAGGCATGTCCGCAGCATTTGAAGATCCGTGAACTGCTGGAGGCAGTGGCAAAGGAATTTGAGAAAAAGGAAGCCTGACGGCGTCAGCCGCGCAAAAGCCCTGAAAGATTCTGTCGTTACCACGCCAAGAAAATCAAATGTCTGCTGCGCAGCCGCTTGATTTTCTTTTGAGCGTGGTATTTTAATGTTTTTTTCATGTTTTTGTACTATAATTGAGAAAATGGACGGAAATAGGAGGGATTTCTATGCGCCTTTTGGTGGTGGAAGATGAGATCTACCTTTTGGATGTTTTAAAGAAAAGACTGACGAAGGAGCACTATAGTGTGGATGCCTGTGAGGACGGCCTGGAAGCGTGGGATTTTATCAAGCTGACTCCTTATGATGGGATTATTCTGGATATTATGCTTCCGGGAATGGATGGCATCGAGATATTGAAGCGGATGCGAAAGGAAGGAAACCATACGCCGGTGCTTCTTCTGACGGCCAGGGACAGTATTGAGGATCGGGTAACGGGATTGGATATCGGAGCGGATGATTATCTGGTCAAGCCTTTTGCTTTTGAAGAACTTCTGGCCAGAATCCGGGTCATGCTGCGCCGGAAGAATACGGTGCAGCCCCAGGAGGAGGTCTATACTTTGGCGGATCTGTCAGTAGACTGCAAGAGCCATGAGGTGGCCAGGGCGGGAAAGCAGATTGAACTTTCCGCAAAAGAATTTGCGCTGCTGGAATATTTGATCCGCAATCAGGGAGTCGTGCTTTCCAGAGAACAGATTGAAGAACATATCTGGAATTACGATTATATGGGGAGTTCTAATATGGTGGACGTTTATATCCGGTATCTGCGCAAGAAAATTGACGACGGACATGAGAAAAAGCTGATCCAGACGGTGCGGGGCGCGGGGTATGTCCTGCGGGAAACATCATAAAGGAGAGGGAGTATGAAAAAACTTTCGATCAAAATGAAGGTGACCATATGGTATACGGTTTTTGTGGCTATTGTGGCGCTGTTTGCCCTTGGAGTCGTCGCGCTTTACGCGGGCCGGATGATCTGGTCAGAGCAGGAAAAAGAACTGCGGGAAGAGGTGGCGGAGTTCGCCGAGGAGCTGGAAGTGTCTGGAAGCGGCTATGTGACAGAAGAAGGACGGTTCTACGATGACGATATTGTGTTCAGTCTGTACAATGAAAGCGGAGCGCTGCTGGACGGAAACGTCCCGGCCTCTTTTCCAGAGAACACCACGTTAAAAAATGGTGTGGTCCAGACCATTTCGTCAGGAGAGCAGGAGTGGCTGACCTATGATATTGCTGTAGACTATGGCGGGGATCATATGGTGTGGGTGCGGGGAATCACATATATGGGCCTGGCAATGACTATGTCGGGAGGACTTCTCATTGTTTCCTGTATCTTGATCCCGGTTCTGGTAGTCTTGGCCGCTGCCGGCGGATTCATCATTACAAAGCGGGCTTTTAAGCCGGTAGAAGATATCCAAAGAACAGCGGCTGAAATCGCGGGAAGCAAAGACCTGGCCCGTCGGATGCCCACAGAAGAGGCCAGCGGGGAGATCCGGGAACTGGCGGAAACTTTTAACGGTATGCTGGGGACGCTGGAATCGACCTTGGAGGACGAGAAGCGGTTTACGGCGGACGTATCCCATGAACTTCGGACGCCGGTATCGGTGATCATGGCTCAGGGAGAATACGCCATGCTGGAAGATTCCACGGAAGAAGAGCGGAAAGAGGCGCTGGAGATCATTGTAGGACAGGCCAAGAAAATGTCTGCCATGATCGCCCAGCTTCTGGAAATGGCAAGAAGAGAAAAGAGCGCCGGACCGGCGGGAAGAGAGAAGGTAGACCTGGGAGAGATCGTGAATCTTGCGGCGGAAGAACTAAAAGATCTGTCCGGAGAGAAGCGGATCACTATCACAAGCGACAGCGAGCAGGAATTGTATGTCTGGGGAGAGCAGACGGCATTTACCCGGATCTTCATGAACCTGGTAACAAACGCGGTCCAGTACGGGAAAGAAGGGGGCCATGTGTGGCTTAAAGCGTGGAAAGAAGGAGAAGAAGTCCTGTGCAGCGTGAGAGATGACGGGATCGGCATCGGGCCGGAGGAACTGCCCCACATTTTCCGCAGGTTTTACCGGGAGGATAAGTCGCGGACCGGCCGGGCGGAGGCGCACGCGGGATTGGGATTGTCTATGGTAAAGCATCTGACTGAGAATTTCGGTGGAACTATCCAGGTATACAGCCAGAAGGAGAAAGGCACTACCTTTGTACTGCATTTTCCCGCTTTACGAAATCCAGACAATCCGTTACAATAGATAATAAAGTTTTAGAGATACAAAAAGGAAGGTATGAATTATGGCAAATCCAATTGTTACAATTGAAATGGAAAATGGAGATATCATGAAGGCAGAGCTTTATCCGGATATTGCTCCAAATACAGTGAATAATTTTATCTCATTGGTGAAGAAAGGGTATTATGACGGTTTGATCTTCCATCGGGTGATCAATGGATTTATGATCCAGGGAGGCTGCCCGGAAGGAAGAGGAACCGGCGGGCCGGGCTATCACATCAAGGGAGAGTTCTCTCAAAATGGGGTGGAGAATCCGCTGGCTCACACAGAGGGGGTCCTTTCTATGGCAAGGGCTATGCATCCGAATTCAGCAGGGTCCCAGTTCTTTATCATGCATAAGACGTCACCCCATCTGGACGGAGCTTACGCGGCTTTCGGAAAGATCATTGAGGGCATGGATGTGGTCAATAAGATCGCGGAAACTCAGACCGACTATCAGGATCGTCCGTTAACTGAGCAGAAGATGAAGAAGGTTACGGTAGACACCATGGGTGTGGACTATCCGGAGCCAGAGAAAGAATAGAGGAACCAGCAAACAGATGGAACGATTTTATCAAAATACGGCAGTCAGGGTCATCCTGGCATTGGTGTGCTGCGCCTTATGGGGCAGCGCATTTCCGTGTGTAAAAATCGGATATGAAATGTTTCAGGTGACGGGAGCGGGCAGTCAGATTTTATTTGCGGGCTGCCGCTTTTTCCTGGCGGGAGTTCTTACATTCTTAATGGCCTGTCTGCTGGAGCGGCGGGTCGTGAGGATCAAACGATCTTCCGTGCCTTATGTATTTGGACAGGGAGTCCTTCAGACCACGATCCAGTATGTCTGCTTTTATATTGGGCTTTCAAATACCACTGGATCCAAGGGATCGGTGATCAATGCCTCGAACGCCTTTTTTTCTATTATTATGGCGCATTTTCTGCTGAAATCAGAGAAGATCACTTGGAGGAAGGCCCTTGGCTGTGTGGTCGGTTTCGCGGGAGTGATCGTGATCAATCTGGCGCCGGGCGCTTGGGGAAGCGGCTTCCATCTGGCGGGAGAGGGGCTGATCCTGCTCTGCTCCTTTGCCTACGGGACCAGTTCTGTTACCTTGAAAATGATTTCTGATAAGGAAAGTCCAGCCGCGATCACGGCGTATCAGCTATTGTTTGGAGGCGCGCTTTTGATCCTGATCGGAGTGATGACGGGAGGAAGAGTAGGGCATTTTACACTTAAGTCCGCGGCGCTCCTATTTTATCTGGCCTTATTATCCACGATCGCCTTTACCTTGTGGGCAGAACTTTTGAAATATAATCCGGTAGGCCGGGTGGCGGTATTCGGCTTTAGTATTCCAGTATTTGGAGTAGCGCTGTCAGCGCTGCTGCTGGGGGAAGATATTTTCCGGTTTCAAAATCTGGCGGCCCTTGTATTGGTAAGTCTGGGGATTATTGCGGTGAATCTGCCGCCTTGCAAAAAGACCGGGGATAACGTACAATAGGAACAACAACCACTGGGAGGCGTTGAGATTGATGGATAGACAGACGGATATGGTGCAGGCAGAACAACTGATATTCGAGTATGAGAAACGGGATGAAGAAGGCAACGTGATCGGCGCTTCCCGTGCCATTGACAAGGTGGATCTGGATGTGAAAGAAGGACAGTTTATTGCCATCTTAGGCCACAATGGTTCTGGAAAATCTACGCTGGCAAAGCATATAAACGCCATTCTTGTGCCAACAGAGGGAACAATCTGGGTAGATGGAAAGGATACGAAAGATCCGGAGGAATTGTGGAATGTGCGCCAAAGCGCCGGTATGGTCTTTCAAAATCCAGATAACCAGATCATTGGTACTGTAGTGGAAGAGGATGTGGGGTTTGGACCGGAAAATCTGGGAGTCCCTACGGATGAGATCTGGCAGAGAGTGGAGGAAAGTCTAAAAGCGGTGGGTATGCTTTCTTACCGCCATCATTCTCCTAACAAATTGTCAGGAGGGCAGAAACAGCGGGTGGCCATTGCCGGGGTTGTAGCTATGGAACCAAAGTGTATTGTTTTGGACGAGCCTACCGCAATGCTGGATCCTATGGGACGCAAAGAAGTCCTTAAAACCGTACAGAAACTGCGGGAGCAAAAACAGGTAACAGTGATTCTGATCACTCACTATATGGAAGAAGTGGTGGATGCGGATAGAATTTATGTAATGGATCACGGGCATGTTGTCATGGAGGGAACGCCAAGAGAGATATTCTCAAGAGTAGAGGAACTAAAAAACTACCGGCTGGACGTGCCTCAGGTGACGATCCTGGCGGATGAACTTAAGAAACGGGGACTGGATCTCCCGGCAGGGATCTTGAAAAAGGAAGAATTGGTGGAAGCATTATGTCAATTAAAATAGAGCATTTGAATTATATATACAGTCCTGGGACGGCGTATGAAAGACAGGCGCTCAAAGACATCTGCCTGGAGCTTCCCCATGGAGAATTTGTTGGGATCATCGGCCATACGGGTTCCGGGAAATCTACATTGATCCAGCATTTGAACGGGCTGATCAAGGCCACCAGCGGGAAGATCTATTATAATGGAGAGGACATTTACCAGGAAGGGTACGATATGAAGGCTTTGAGAAGCCAGGTGGGCCTGGTCTTCCAGTATCCAGAGCATCAGTTGTTTGAAGTGGACGTGATGACGGATGTGTGCTTTGGCCCTAAGAATCTGGGACTTGGCCCGGAAGAATGTAAGGAACGGGCGCTGGAGGCCCTGCGGCTGGTGGGCCTGAAGGAGAAATATTATAAATCTTCGCCTTTTGAATTATCGGGAGGCCAGAAACGGCGGGCCGCCATAGCGGGAGTCCTGGCTATGCATCCTAAGGTGCTGGTCCTGGATGAACCGACAGCCGGACTGGACCCTAAGGGAAGAGACGATATCCTGGATCAGATCGCATATCTCCATCAGGAGACGGATATGACGGTGATCCTGGTTTCCCACAGCATGGAAGATATCGCCAAATACGCCGACCGTATCGTGGTGATGAACCAGGGAGAGGTTCTTTATAATGATACGCCCCGGAAAGTATTTGCCCACTATCAGGAACTGGAACAGGTGGGGCTTGCGGCGCCCCAGGTAACTTATATCATGCATGATCTAAAGGAAAAAGGATTTCCGGTCAGCGTTCATGTAACTACGGTACAGGAAGCGGCAGATGAGATCATGAAAGCACTGGAGAGGGAAGCATGA
Protein sequences of DBSCAN-SWA_3 >NZ_CP041667|926735:936637|935782_936637_+|WP_143929191.1|DBSCAN-SWA MSIKIEHLNYIYSPGTAYERQALKDICLELPHGEFVGIIGHTGSGKSTLIQHLNGLIKATSGKIYYNGEDIYQEGYDMKALRSQVGLVFQYPEHQLFEVDVMTDVCFGPKNLGLGPEECKERALEALRLVGLKEKYYKSSPFELSGGQKRRAAIAGVLAMHPKVLVLDEPTAGLDPKGRDDILDQIAYLHQETDMTVILVSHSMEDIAKYADRIVVMNQGEVLYNDTPRKVFAHYQELEQVGLAAPQVTYIMHDLKEKGFPVSVHVTTVQEAADEIMKALEREA >NZ_CP041667|926735:936637|931322_932003_+|WP_143929187.1|DBSCAN-SWA MRLLVVEDEIYLLDVLKKRLTKEHYSVDACEDGLEAWDFIKLTPYDGIILDIMLPGMDGIEILKRMRKEGNHTPVLLLTARDSIEDRVTGLDIGADDYLVKPFAFEELLARIRVMLRRKNTVQPQEEVYTLADLSVDCKSHEVARAGKQIELSAKEFALLEYLIRNQGVVLSREQIEEHIWNYDYMGSSNMVDVYIRYLRKKIDDGHEKKLIQTVRGAGYVLRETS >NZ_CP041667|926735:936637|933435_933954_+|WP_143929189.1|DBSCAN-SWA MANPIVTIEMENGDIMKAELYPDIAPNTVNNFISLVKKGYYDGLIFHRVINGFMIQGGCPEGRGTGGPGYHIKGEFSQNGVENPLAHTEGVLSMARAMHPNSAGSQFFIMHKTSPHLDGAYAAFGKIIEGMDVVNKIAETQTDYQDRPLTEQKMKKVTVDTMGVDYPEPEKE >NZ_CP041667|926735:936637|933970_934903_+|WP_143929190.1|DBSCAN-SWA MERFYQNTAVRVILALVCCALWGSAFPCVKIGYEMFQVTGAGSQILFAGCRFFLAGVLTFLMACLLERRVVRIKRSSVPYVFGQGVLQTTIQYVCFYIGLSNTTGSKGSVINASNAFFSIIMAHFLLKSEKITWRKALGCVVGFAGVIVINLAPGAWGSGFHLAGEGLILLCSFAYGTSSVTLKMISDKESPAAITAYQLLFGGALLILIGVMTGGRVGHFTLKSAALLFYLALLSTIAFTLWAELLKYNPVGRVAVFGFSIPVFGVALSALLLGEDIFRFQNLAALVLVSLGIIAVNLPPCKKTGDNVQ >NZ_CP041667|926735:936637|934934_935798_+|WP_143931189.1|DBSCAN-SWA MDRQTDMVQAEQLIFEYEKRDEEGNVIGASRAIDKVDLDVKEGQFIAILGHNGSGKSTLAKHINAILVPTEGTIWVDGKDTKDPEELWNVRQSAGMVFQNPDNQIIGTVVEEDVGFGPENLGVPTDEIWQRVEESLKAVGMLSYRHHSPNKLSGGQKQRVAIAGVVAMEPKCIVLDEPTAMLDPMGRKEVLKTVQKLREQKQVTVILITHYMEEVVDADRIYVMDHGHVVMEGTPREIFSRVEELKNYRLDVPQVTILADELKKRGLDLPAGILKKEELVEALCQLK >NZ_CP041667|926735:936637|926735_929459_+|WP_143929184.1|DBSCAN-SWA MGNTSEDRVDKNGQSRSGHLCMGILAHVDSGKTTLSEGILYLCGKIRKLGRVDHKDAFLDTEELEKDRGITIFSKQAVFELGDKRVTLLDTPGHVDFAAEMERTLQVLDYAVLVISGADGIQGHTRTLWKLLERYQIPVFLFINKMDQAGTQREAVLEEIQNGLSESCIAFDRAQTPDFLESLAMCEDQALEEYLRTESLKKETVSRMIRHRKVFPCYFGSALKLEGVQELLDGLAQYAQSFSERKEKTGDGRFGARVYKISRDGQGNRLTHMKITGGILKVKDTLGAEGEKVNQIRIYSGGKYEMVQEAQAGMICAVTGLSQTYPGEGLGKEPPAHPPVLEPVLSYQIYLPEGCEVSKALSDLRQLEEEDPLFRIVWMEELGELHAQVMGEVQIEILKRLIQDRFGLEVEFGAGSIVYKETIQNRVEGVGHFEPLRHYAEVHLILEPGEPGSGMVYVSECGEDVLDRSWQRLILTHLEEKRHLGVLTGAELTDVKITLKAGRAHLKHTEGGDFRQATYRAVRQGLMQAQSVLLEPYYEFRLEVPSENLGRAMTDIQRMSGEFLPPKTEGEMAVLTGTVPAAGLNGYQIQVAGYTGGRGHLSYVFRGYGRCQNEEAVVEAAGYDPLRDMDNPSDSIFCSHGAGFSVPWDQVPEYMHIESVLKKEQKKEEGRNISKEKKNGGRSSAGYAQSKEAEKELEEIFIRTYGKIERKKAPETSRVTARGEAKRRKKEDRLREYLLVDGYNVIFAWEELKDLARDNIDGARNRLMDILCNYQGYKKCTVILVFDAYRVEGDVLEIQKYHNIHVVYTKEAETADQYIEKVVHEIGRKYHVTVVTSDGVEQVVTLGQGGTLISAREFREEVEIVRQEIRREWEARREKKKQYLFDSMEDELAEQMEKVRLGKQKLE >NZ_CP041667|926735:936637|932015_933395_+|WP_143929188.1|DBSCAN-SWA MKKLSIKMKVTIWYTVFVAIVALFALGVVALYAGRMIWSEQEKELREEVAEFAEELEVSGSGYVTEEGRFYDDDIVFSLYNESGALLDGNVPASFPENTTLKNGVVQTISSGEQEWLTYDIAVDYGGDHMVWVRGITYMGLAMTMSGGLLIVSCILIPVLVVLAAAGGFIITKRAFKPVEDIQRTAAEIAGSKDLARRMPTEEASGEIRELAETFNGMLGTLESTLEDEKRFTADVSHELRTPVSVIMAQGEYAMLEDSTEEERKEALEIIVGQAKKMSAMIAQLLEMARREKSAGPAGREKVDLGEIVNLAAEELKDLSGEKRITITSDSEQELYVWGEQTAFTRIFMNLVTNAVQYGKEGGHVWLKAWKEGEEVLCSVRDDGIGIGPEELPHIFRRFYREDKSRTGRAEAHAGLGLSMVKHLTENFGGTIQVYSQKEKGTTFVLHFPALRNPDNPLQ >NZ_CP041667|926735:936637|929525_929969_+|WP_143929185.1|DBSCAN-SWA MAIDMEKQTLALLKECSSGCKMAVDSMDQIKEYVLDSNLSQVIDEAKKRHLQLEKETDELLREQGERGREPEKLAQAFSWITTEMKLLIKDDNTQIAKILMDGCGMGIKSLSEQINKYPEASRESRTVAGKIVKCEEKLMGELKKFL >NZ_CP041667|926735:936637|930033_931155_+|WP_143929186.1|DBSCAN-SWA MISFENVKKNFGFGCMRLPMKGDEVDTAEFSKMVDIFLEAGFNYFDTARGYLEGKSETALRECLTSRYPRGRYILTDKLTETFFKTEADIRPFFESQLEACGVDYFDFYLMHSQNAEYFKKFKKCRAYETAFALKEEGKIRHVGISFHDKAAVLEQILEEYPQIEVVQIQFNYVDYEDAAVEGRKCYEVCVRHKKPVIVMEPVKGGNLVNLPKEAKEVLDELKGGSPASYAIRFAAGFEGNEMVISGMSNMEQMEDNLSYMKDFHPLDQREMAAVEKVAKIFRSMDLISCTACRYCVAGCPQNILIPELFAAMNAKHTYQDWNSGWYYMIHTLNKGKASDCIKCGKCEKACPQHLKIRELLEAVAKEFEKKEA |
9 | Streptococcus_phage(16.67%) | NA | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_4 |
1453613 : 1464153
Sequences of DBSCAN-SWA_4
Nucleotide sequences of DBSCAN-SWA_4 >NZ_CP041667|1453613:1464153|DBSCAN-SWA CATGAAGAAAAGCAGAGGCATCATAAGCCTTTTGGTGACGGCGGTGCTCATTGTGCTGCTGGGTTACACTGCCATTTGGGGGTTTGGAGAAGGCGGAACCGGTTCGGCGAAGAACATCAAGCTGGGTCTGGATCTGGCGGGAGGCGTGAGTATCACCTACCAGGTCAAAGATGACAATCCATCCAGCGAAGACATGGCGGATACGATCTACAAGCTTCAAAGACGTGTGGAAGAGTACAGTACAGAGGCGGTAGTATACCAGGAGGGAGACGACCGGATCAATATCGAGATCCCCGGAGTATCTGACGCCAATCAGATTCTGGAGGAACTTGGTCAGCCGGGTTCTCTTTATTTCATTGCCCAGACTGGAAGCGATGGATCAGAAAACTATTCGATGATCAACAATACTGGTGATCCGGCCCAGGACTACCAGCTGAATAAAACTATTGAAGAACTGGAAGCAGACGGATCCATCGTCGTTACCGGTGATGAAGTTCAGGACGCGCGGGCCGGCGTGGTGGAAAATCAGACGACACAGCGGGATGAAAACGTTGTATCTTTGACCTTTACGGATGAAGGAACGGAAAAATTTGCCGCGGCCACGGAAAAAGCCTATGAAAATGGGGAGAGCATCGCGATCTACTACGATGGGACCTTTGTGAGCGTACCCAACGTGAACAACGCTATCGAAAATGGAGAGGCTCAGATTACGGGAAATATGACTTATGACGAAGCGGACACGCTGGCTTCCACTATCCGGATCGGAGGATTGCAGCTTGAGCTGGAAGAACTGCGGTCAAATGTAGTCGGCGCCCAGCTTGGAGAGGAAGCCATCAGTACCAGCCTGATGGCCGGAGTGATCGGGCTTTTGATCATTTTCATTTTTATGATCTTCGTTTATTATCTGCCGGGGTTGGCTTCCAGCCTTGCGCTGATCATCTATACAGAGATCGTTCTGCTGATCTTAAATGCTTTTGATGTGACGCTGACTCTGCAGGGAATCGCGGGTATCATTCTGAGTATTGGAATGGCGGTAGATGCCAACGTGATCATTTTTGCCAGAGTAAAAGAAGAGATGTCCAAAGGAAAAAGCGTGCGCAATTCCCTGAAAGCGGGATTTGATAAAGCTCTTTCCGCGATCGTAGACGGAAACGTGACGACCCTTATTGCGGCGGCAGTCCTGTGGTTCCTTGGTTCTGGAACGGTAAAGGGATTCGCTCAGACGCTGGCGATCGGTATTGTGGTTTCGATGTTTACGGCGCTTGTGATCACCCGGATGATCGTGTTCGCTTTCTATGAAGTGGGAATCCGCAATCCTAAGGTTTATTACCGCCCTAAGAAAGAGCGTGAACCTATCAATTTCCTGGGTAAAAGGAAATGGTTCTTTTCCGCTTCCATTGGAGTGATCCTGCTTGGATTCGTGATCATGGGAGTCAATGGGGGCAGAGGCGCGGGAGCTTTCTCCTACAGCCTGGATTTCCAGGGAGGAACTTCTACCAATGTTACCTTTAATGAAGATTATACCATCAATGAAATTGATCAGGAGATCGTGCCGGTGGTAGAAGAGGTGACCGGAGACGCTAATGTACAGACCCAGAAAGTAGATGGCACCAACCAGGTGATCATCAAGACGGTCACGCTTGATCTGGATCAGAGAGAGGCTCTTAACCAGGCTTTGGTGGATAACTTTGGCGTGGATGAGAGCAAGATCACAGCAGAGAATATCAGCTCTACGGTAAGTAATGAAATGCGTCAGGATGCGGTGATTGCGGTGATCGTGGCAAGTATCTTCATGCTGCTGTATATTTGGTTCCGGTTTAAAGATATCCGGTTCGCTACCAGCGCGGTTGCGGCTCTGCTCCATGACGTGCTGGTGGTGCTGGGATTCTACGCGGTTTCCAGGATCGCGGTGGGCAGTACCTTTATCGCGTGTATGCTGACCCTTGTGGGTTATTCTATTAACGCCACCATTGTGATCTTCGACCGGATCAGGGAAGAACTGCATTACCGGACCAAGACTACGGACCTTGCGTATATTGTCAATAAGAGTATCACCCAGACGCTGACCAGAAGTATCTACACTTCTCTTACTACCTTTATTTCGATCGCTGTGCTCTATGTCCTGGGCGTCAGCTCGATCAAAGAATTCGCCCTGCCGCTGATGGTGGGAATCGTAGTCGGCGCATACTCATCTGTCTGCATTACGGGAGCTTTGTGGTATGTGATGCGGACACGGGCGGATAAGAGGAAACAGAAATAATCACAGGCTAAGAAGAAAGCAGCCACAGCCATTTTCCGGCCATGGCTGTTTTCTGATGCAAGAATGGAAAAACAGTGTAGGAGCAGGGAAAAGTCTGATGGAGGAAGAAAATGGAACGATGGGTTCTGCTTAGAAAAGGCGGGGATTTCCAGGCCGCGAGTGAAAAATTTCAGATCAGTCCCAGACTGGCCTGTCTGATCCGGAACCGGGAAGTACAGGGAGAGGATCAGATAGAACAGTATCTGAACGGATCTCTCCTAGATCTGCATGATGGGATGCTGATGCGGGATATGGACCGGGCGGTAGATATCCTGCGGGAGAAGATAGAAGAAGGAAAGAAGATCCGGATCATAGGAGATTATGATATTGATGGGGTGAACGCCGCGTATATCCTTCTGGAAGGCCTGGAAAGAATGGGAGCGGACGTGGACAGCGACATTCCGGACCGGATCAGAGACGGCTACGGATTAAATATAGAACTGGTGGAGCGGGCTTATCAGGAAGGGGTGGATACGGTCTTGACCTGTGATAACGGCATCGCTGCCGCGGAAGAGATCGCTTATGGAAAACGGCTTGGCATGACAATGCTGGTAACCGACCATCATGAGGTCCCTTATGAGGAAATAGACGGGGAGAAGCAGTACCGTCTTCCTCCGGGAGACGCGGTCATTGACCCGAAACGGGAAGACTGCGGATATCCCTTTCAGGGACTTTGCGGCGCGGCAGTGGCCTATAAGCTGATGGAAGCCTTATACGAGGCTTCCGGCAGGGATTCGGAAGACCTGGACGACCTGATCGTCAATGTGGCCATCGCTACCGTGGGAGATGTGATGGATCTGACAGATGAGAACCGGATCTTTGTCAAAGAAGGACTAGCCATGCTTGGCAGGACTCAGAATCCGGGATTGAATGCTCTGATCGAATGTACAGGACTGGATAAGAAGCATATCAGCGCCTATCATATTGGATTTGTGCTGGGTCCCTGCCTGAACGCAGGAGGCCGGCTGGATACGGCCAAACGGTCTTTGGCGCTTCTTAGAGCCAAAACCCGCAAAGAGGCCGCGATCTTGGCGGGAGACCTGAAGGCTCTTAATGACAGCCGGAAAGAGATGACGGAAAAGGCGGTTGAGGAAGCCTGCCGTCAGATTGAAGATACGGACCTTTTGAAAGATAAAGTGCTGGTGGTCTATCTGCCGGACTGCCATGAGAGCCTGGCGGGAATCGTGGCGGGAAGACTGCGGGAGAGGTATTGGCGTCCGGTGTTTGTGCTGACAGACGGGGAAGAAGGAGTAAAAGGATCGGGACGTTCCATCGAAGCGTATTCCATGTATGAAGAGCTGAACCGCTGCGGGGATCTTTTGGAACGGTACGGAGGCCACAGACTGGCGGCGGGCCTGTCCCTGAAAAAGGAAAACGCAGGTCGGCTCCGCCAGGCGCTGAATAGTAATTGTACTCTGGAAGAGGCGGATCTGATGGAAAAAGTTGTTATCGATATGGAACTTCCCCTTTCCTGTGTGACGGAGAAGTTTATTGAAGAATTGAAAATTCTGGAACCCTTTGGGAAAGGGAATACAAAACCAGTGTTTGCCATAAGAGAGGCGGCGCTTTCCAAGCTGAGGATACTGGGGAAAAACAGGAATGTGCTGAAAATGCAGGCGGAAGATGGCGGGGGCGTAAAAATAGACGCGCTATATTTCGGCGGTATTGAGAAATTTAAAACATGCCTGGAGGAAAAATACGGGCATGGCTTTGCTGAGAGATTCCTGGATGGAAAAGAACCGGAAGCCAGGATGATGATCGCTTATTATCCGGGAATCAATGAATATATGGGGAGGAAAAGCCTGCAGATCGTGATCACCCATTGTCAGTAAGCGCTATAAATGTTATACTAAAAGGTATCATGTGGGTAAAATAGCTTTTTTGCCCCGGTATTTAGGATTATGGAGAGAAAATTGGAGAGGATACCATGAAATCAATCGAGGAATATGTAAGAAGTATCCCGGATTTCCCGGAGCCGGGGATTATTTTCCGGGATGTTACCAGTGTGCTCCAGGATGCGGACGGACTGCGCCTGGCCATCGACCTGATGCAGAAAAAGCTGGAAGGAGTAGATTTTGATTTGATCGTAGGGCCGGAGTCCAGAGGATTTATCTTTGGCGTGCCTATCGCCTATAATCTGCATAAACCGTTTATTCCGATCCGCAAGAAGGGCAAACTGCCGTGCGAGACCGTATCTATCCAGTATGATCTGGAGTATGGGACAGCAGAGCTGGAGATGCACAGAGACTCTGTGAAAGAAGGTCAGAAAGTAGTGATCATTGATGATCTTATCGCTACAGGCGGTACCAATGAGGCAATGATCAAACTGATCGAGGGACTGGGAGGAAAAGTAGTAAAGACCGTGTTCCTGATGGAACTTGCGGGGTTGGAAGGAAGAAAGAGACTGCAGGGATATGACGTGGACTCTGTGATCATTTATCCTGGGAAATAGTCTCCTGTCAATTCCGGGAAAGTGTTGTTGTGCCGGAAGCGGCAGGGAAAGTCATGAAAGAAGAAAGGAGGCATTAATATGTCAGGGAGAATCACGACTTATGAATCTCTGGATGACCGCATCGCGCCGGAATCCGTGATGAAAGACATGTCAGAAGGACTTGAGATCGTGGACGGACACGCGGTAAAAGCGCCAGGCGATTATGAAGATCCAGACCAGCTTTATGACATGTTAATCGCCCGTATTCGGAAATACCATCCGTCCACTGACGTCAGCATGATCAAAAAGGCGTATGAGACGGCAAAGAAGGCCCACGGGGATCAGTGCAGGAAATCGGGAGAGCCATATATCATCCATCCTTTATGGGTAGCGATCATCCTGGCTGATCTGGAGATGGATAAAGAGACCATTGCCGCGGGGATGCTCCACGATGTGGTGGAAGATACCCAGGTAAGTGAAGAGGAGATCAAAGAAGTATTCGGAGAAGAAGTGGCGCTTTTGGTAGATGGCGTTACAAAACTGGGCAGGCTGTCCTATTCCTCTGATAAGCTGGAAGTGCAGGCGGAGAATCTAAGGAAGATGTTCCTGGCCATGGCCAAAGATATCCGGGTCATCATCATCAAACTGGCAGACCGGCTCCATAATATGCGGACTTTGCAGTTTATGACGCCGGAGAAGCAGAAAGAAAAAGCGAAAGAGACAATGGATATCTATGCGCCTATCGCCCAGAGGCTTGGTATCTCTAAAATCAAAACGGAACTGGACGATCTGGCTTTGAAATACTCCCAGCCAGAGGTGTTCTTTGATCTGGTACGGCAGATCAACGCGCGGAAGACGGAGAGAGAGGAGTTTGTCGAACAGATCGTCAAAGAAGTATCCGATCATATGAAAAACGCCAATATTAAGGCGGAAGTCAACGGACGGGTCAAACACTTTTTCAGCATCTATCGGAAGATGGTGAATCAGGATAAAACGGTAGATCAGATCTATGATCTGTTTGCGGTGCGGATCATTGTGGATTCGGTAAAGGACTGCTACGCGGCGTTGGGCGTTATCCATGAGCTCTATACCCCGATCCCGGGGCGGTTCAAGGATTATATCGCTATGCCGAAACCCAATATGTACCAGTCGCTGCACACCACATTGATGAGCTCCGTGGGACAGCCTTTTGAAATCCAGATCCGGACGCAGGAGATGCATAAAACCGCGGAATATGGTATCGCGGCCCACTGGAAATATAAGGAATCCAATGACGGCAAGAAGAGCGTGGAGGCTCAGGAAGAAGAAAAACTAAGCTGGCTCAGGCAGATTCTGGAGTGGCAGAGAGACATGTCGGACAACCGGGAATTTCTTAACCTGATCAAAGGAGATTTAGATCTTTTCGCAGAAGACGTGTATTGCTTTACCCCGCAGGGCGATGTGAAGAACCTGCCCAACGGCTCTACCCCGATCGACTTTGCCTATGCCATCCACAGCGCGGTGGGAAATAAAATGGTGGGAGCCAGAGTCAATGGGAAACTGGTCAATATCGACTACAAGATCCAGAACGGAGACCGCATTGAGATCCTGACGTCCCAGAATTCCAGAGGACCAAGCAGGGACTGGCTTGGGATCGTGAAAAGTACCCAGGCAAAAAACAAGATCAACCAGTGGTTTAAAAAGGAATTTAAAGAAAGTAATATTGTAAAAGGAAAAGAACTGATCGGCGCTTATTGCAAAGCCAAAGGAATCAACGCCTCGGATATCCTGCAGACGAGATATCAGGAGATCGTCCAGAAGAAATATGGTTTTCGGGACTGGGATTCTGTCCTTGCCGCGGTCGGCCACGGCGGATTGAAGGAAGGCCAGGTGGTCAACCGTCTGGTGGAAGAGTACGGCAAGGAACATAAACAGGAGATCACCGATGAGGTAGTATTGGAGCGAGTAGCGGAAGCTTCCAAGCACAAGGTCCACATTGCCAAGTCCAAGAGCGGTATCGTGGTCAAGGGCATCGATGATATGGCGGTACGCTTCTCACGGTGCTGCAACCCGGTACCGGGAGATGAGATCGTAGGTTTTGTCACCAGAGGCCGGGGACTGTCTATCCATCGGACGGATTGTGTGAACATGCTGCACCTGACAGAGGCGGAAAGGGCCAGGCTGATCGATGCGGAGTGGGAGAGCGAAGTGGCGGAAGAATCAGGAGGCCAGTACCTGGCGGAGATCAAAATGTACGCCAATGACCGGCAGGGGCTTCTGATGGAAATGTCCCGTATCTTTACAGAGGCGGATGTAGACGTGAAGTCTATGAATGTACGCACCAGTAAGCAGGGGACGGCCACGATCGAGACCGGTTTCATCGTCCATAACAGAGAAGAACTGGGCCGGGTAGTGAAAAAACTCCGGCAGCTGGAAGGCGTGATCGATATTACCCGGACAACGGGATAAAGGAGAGATTATGGGAAAGACAATGAAGATCGAGAAATTTGTGACAGGAATCATCAGCACCAACTGTTACCTGGTGAGCAATGTAGAAACCCATCAAGCTGTGATCGTAGATCCGGCCGCGGTCCCAAAGGCGCTGACAGAAGCGGTGGAAAGAGACGGCCTTACCGTGGAGGCAGTGCTCCTTACCCATGGGCATTTCGACCATACGATGGGGCTGGATGCGCTGTTAAAGCTGTGGGATGTTCCGGTATATGTGGAGGAAGAAGACCAGGAGATCCTGACAGATCCGAAATTGAATCTGTCTTCTGCATATACGGCGGGCTTTACTTTCTCAGACGCTCAGAGTGTAGAAGACGGACAGATCCTGTCTTTGGCCGGCTTTCAGTTTCAGGTGCTCCATACTCCGGGTCACACCAGAGGCGGATGCTGCTATTACGCGGCGTCCGAACAGGTGTTGTTCAGCGGAGACACCCTGTTCCAGGCTTCTGTGGGAAGGACGGACTTCCCAAACAGCAGTACGCTGGATCTGCTCCGTTCCATCCGGGAGAAATTGCTGCCGCTTCCCGATGAGACGGTGGTATATCCAGGGCATATGGGAGAGACTACCATTGGCTATGAGCGGGATCATAATCCATATCTGTAGGGGGAGCCTATGTTTAAGATCCTTTGTCGGAAAGAGACTTATGTTTATAACGCGTATCATATAGGGAAGGCTTTTTATCCATCGGAGACGGTCGAAGCTTCGGTAGAGGAAAAAGCCTCTCATTATGTTATCCTTTTTCTGCCGTCGGGGAAAAGGCTGGAACTGGATCAGGAAGAACAGGAGCGGTTCTTGGATCGAAAAACCCAGAAACACCTGATGGACCAGCGGCTTTACAGAGTATTATCGGCGGAGACGGGAAGGACGCTGGCCTGGGGCATATTGACGGGAGTGCGGCCTACAAAGCTGGCGATGGGAAAACTGGAGGAAGGCATGGAACGCCAGGATTTTCTGTCCTGGTTCACAGGGGAATACCTGGTGAGCAGAAAGAAGGCGGAAGTATCCTGGGATATTGTCCGGCGGGAAAAGGAACTGCTGGACCGGCTGGATTATCAGGACGGGTACAGTCTGTATGTGGGGATTCCATTTTGTCCCACCGTTTGTACTTACTGCTCTTTTAGTTCCGGTTCTCTGGATCAGTGGGGAAACTTTGTGGAACCTTATCTGGAAGCGCTCCGCAGGGAATTGGAATTTATTGGGAGCGCTTCAGATGGAAAGAAGCTGAATACCATTTATTTTGGGGGAGGCACGCCCACCAGCCTGAATGAAGATCAGTTGGAGCGGCTGCTGTCCTGGATCGATGAAATCTTTCCCAGGGACCATCTGCTGGAATATACGGTAGAGGCGGGACGGCCGGACAGCATTACAAAAGAGAAACTCAGAGTGATACGGAGCCATGGAGTGACCCGGATTTCGATCAATCCTCAGAGCATGCAGCAAAAGACTCTGGATCGGATCGGAAGGCGCCATCAGGTGGAGGAGATCCTTTCTGCATATCATATGGCGCGGGAAGAAGGATTTGACAATATCAACATGGACGTGATCGCCGGACTGCCTGGAGAGAGGCTGGCGGATATGGAGGATACTCTGGCGAAGCTGGAAGCGCTTGGCCCGGACAGCCTGACGGTCCATTCTCTCGCGGTGAAGCGGGCGGCCAAGATGGGGCAGGACGGTTACGTTCCCGGCAGGGAAAGCCAGGACGCGGGACCGGCGGAGATCTCAGCTATGCTGGAAGCGGCAGAAAAAAGCGCCGGCAGGATGGGGATGACGCCCTATTACCTCTACCGGCAGAAGAATATTGCCGGGAATTTTGAGAACACGGGCTATGCAAAGGTTGACAAAGCGGGAATATACAATATACTTATTATGGAGGAAAAGCAATCCATCATTGCCGCCGGAGCAGGGGCTTCTACGAAACTGGTATTCAAGGAGCCTGTAGTAAATCCGGAGGGAAAGAAGCAAAAGAAGACGAACCTGATCCGCCTGGAGAATGTGAAGGCCATTGATGCCTATATCCGGCGGGTGAGGGAGATGATAGAACGAAAAGGAGAATGGTTATGGCGCTGAAAAAGAAGCCGGTCACAGGGATGAAGGATATTATGCCGGAAGAAATGGAAATCCGGGATTACGTGATCGGACTGATCAAAGATACTTACAAAACTTACGGGTTTCAATCTATGGAGACTCCCTGTGTGGAGCATATCGAGAATCTGTGCAGCAAGCAGGGGGGAGACAATGAAAAGCTGATCTTTAAGATCATGAAGCGGGGAGAAAAGCTGAAGATCCAGGAAGCCAAAGAGGAGAACGATCTGGCGGACTCTGGCCTCCGGTATGATCTGACGGTTCCTCTGGCCCGATACTACGCGGGCCATGCCAATGAGCTGCCGGCTCCTTTTAAAGCGATGCAGATCGGCAGTGTGTGGCGGGCAGACCGTCCCCAGAGGGGAAGGTTCCGCCAGTTTACCCAGTGTGATATCGACATTTTGGGAGAACCTGGGGAACTGGCAGAGATTGAACTGATCCTGGCCACTACCGCTATGCTGGGCAAACTGGACTTCAAGAATTTTACCGTGTGTATCAACGACCGGGGGATCTTGAGAGCCATGGCGGCTTACAGCGGCTTCCAGGAGGAAGATTATGATGAGGTCTTTGTATGTCTGGACAAGATGGACAAGATTGGCAAAGACGGCGTGGCGGCAGAAATGCAGGAATTGGGATATACGGCGGAACAGGTAGATACTTATCTGGGATTGTTCGACCAGGTGGCGGAAGATGTAAACGGCGTGAAGAGCCTCAAGGAGATCCTGGGAGACTGCCTGCCGGATGAGGTGGCGAATAGCCTGGAGCGCATTATGTCCTGTGTGGAGGCGGCCAAGGAGTGTGAGTTCCGGCTGAAATTTACGCCTACCCTGGTGCGGGGCCAGTCCTATTATACGGGGACGATCTTTGAAGTGGTGATGGATGATTTTGGGGGTTCTGTAGCAGGAGGCGGGCGCTACGATAAGATGATCGGAAAGTTTACCGGACAGGATACGCCGGCCTGCGGTTTCTCTATCGGATTTGAGCGGATTGTCATGCTTTTGCTGGAACAAGGCTACCAGGTTCCAAAGAAACGGCCTAAGAAAGCCTATCTGCTGGATAAAAACCTGCCGTCAGAGGGCCTGCTGAAGGTGCTGGCGAAGGCCAAAGAAGAGCGGGAGCAGGGATATCAGGTGCTGATCGCCAAGATGAAGAAGAATAAGAAATTCCAGAAGGAGCAGCTGCTGGAGGAAGGCTATGAGGAGATCACAGACTGTTACAGCGATTCTGTGGACAGAATATAG
Protein sequences of DBSCAN-SWA_4 >NZ_CP041667|1453613:1464153|1455982_1457743_+|WP_143929596.1|DBSCAN-SWA MERWVLLRKGGDFQAASEKFQISPRLACLIRNREVQGEDQIEQYLNGSLLDLHDGMLMRDMDRAVDILREKIEEGKKIRIIGDYDIDGVNAAYILLEGLERMGADVDSDIPDRIRDGYGLNIELVERAYQEGVDTVLTCDNGIAAAEEIAYGKRLGMTMLVTDHHEVPYEEIDGEKQYRLPPGDAVIDPKREDCGYPFQGLCGAAVAYKLMEALYEASGRDSEDLDDLIVNVAIATVGDVMDLTDENRIFVKEGLAMLGRTQNPGLNALIECTGLDKKHISAYHIGFVLGPCLNAGGRLDTAKRSLALLRAKTRKEAAILAGDLKALNDSRKEMTEKAVEEACRQIEDTDLLKDKVLVVYLPDCHESLAGIVAGRLRERYWRPVFVLTDGEEGVKGSGRSIEAYSMYEELNRCGDLLERYGGHRLAAGLSLKKENAGRLRQALNSNCTLEEADLMEKVVIDMELPLSCVTEKFIEELKILEPFGKGNTKPVFAIREAALSKLRILGKNRNVLKMQAEDGGGVKIDALYFGGIEKFKTCLEEKYGHGFAERFLDGKEPEARMMIAYYPGINEYMGRKSLQIVITHCQ >NZ_CP041667|1453613:1464153|1462881_1464153_+|WP_143929600.1|tRNA|DBSCAN-SWA MVMALKKKPVTGMKDIMPEEMEIRDYVIGLIKDTYKTYGFQSMETPCVEHIENLCSKQGGDNEKLIFKIMKRGEKLKIQEAKEENDLADSGLRYDLTVPLARYYAGHANELPAPFKAMQIGSVWRADRPQRGRFRQFTQCDIDILGEPGELAEIELILATTAMLGKLDFKNFTVCINDRGILRAMAAYSGFQEEDYDEVFVCLDKMDKIGKDGVAAEMQELGYTAEQVDTYLGLFDQVAEDVNGVKSLKEILGDCLPDEVANSLERIMSCVEAAKECEFRLKFTPTLVRGQSYYTGTIFEVVMDDFGGSVAGGGRYDKMIGKFTGQDTPACGFSIGFERIVMLLLEQGYQVPKKRPKKAYLLDKNLPSEGLLKVLAKAKEEREQGYQVLIAKMKKNKKFQKEQLLEEGYEEITDCYSDSVDRI >NZ_CP041667|1453613:1464153|1461442_1462897_+|WP_143929599.1|DBSCAN-SWA MFKILCRKETYVYNAYHIGKAFYPSETVEASVEEKASHYVILFLPSGKRLELDQEEQERFLDRKTQKHLMDQRLYRVLSAETGRTLAWGILTGVRPTKLAMGKLEEGMERQDFLSWFTGEYLVSRKKAEVSWDIVRREKELLDRLDYQDGYSLYVGIPFCPTVCTYCSFSSGSLDQWGNFVEPYLEALRRELEFIGSASDGKKLNTIYFGGGTPTSLNEDQLERLLSWIDEIFPRDHLLEYTVEAGRPDSITKEKLRVIRSHGVTRISINPQSMQQKTLDRIGRRHQVEEILSAYHMAREEGFDNINMDVIAGLPGERLADMEDTLAKLEALGPDSLTVHSLAVKRAAKMGQDGYVPGRESQDAGPAEISAMLEAAEKSAGRMGMTPYYLYRQKNIAGNFENTGYAKVDKAGIYNILIMEEKQSIIAAGAGASTKLVFKEPVVNPEGKKQKKTNLIRLENVKAIDAYIRRVREMIERKGEWLWR >NZ_CP041667|1453613:1464153|1460812_1461433_+|WP_143931223.1|DBSCAN-SWA MKIEKFVTGIISTNCYLVSNVETHQAVIVDPAAVPKALTEAVERDGLTVEAVLLTHGHFDHTMGLDALLKLWDVPVYVEEEDQEILTDPKLNLSSAYTAGFTFSDAQSVEDGQILSLAGFQFQVLHTPGHTRGGCCYYAASEQVLFSGDTLFQASVGRTDFPNSSTLDLLRSIREKLLPLPDETVVYPGHMGETTIGYERDHNPYL >NZ_CP041667|1453613:1464153|1457838_1458363_+|WP_143929597.1|DBSCAN-SWA MKSIEEYVRSIPDFPEPGIIFRDVTSVLQDADGLRLAIDLMQKKLEGVDFDLIVGPESRGFIFGVPIAYNLHKPFIPIRKKGKLPCETVSIQYDLEYGTAELEMHRDSVKEGQKVVIIDDLIATGGTNEAMIKLIEGLGGKVVKTVFLMELAGLEGRKRLQGYDVDSVIIYPGK >NZ_CP041667|1453613:1464153|1453613_1455872_+|WP_143929595.1|DBSCAN-SWA MKKSRGIISLLVTAVLIVLLGYTAIWGFGEGGTGSAKNIKLGLDLAGGVSITYQVKDDNPSSEDMADTIYKLQRRVEEYSTEAVVYQEGDDRINIEIPGVSDANQILEELGQPGSLYFIAQTGSDGSENYSMINNTGDPAQDYQLNKTIEELEADGSIVVTGDEVQDARAGVVENQTTQRDENVVSLTFTDEGTEKFAAATEKAYENGESIAIYYDGTFVSVPNVNNAIENGEAQITGNMTYDEADTLASTIRIGGLQLELEELRSNVVGAQLGEEAISTSLMAGVIGLLIIFIFMIFVYYLPGLASSLALIIYTEIVLLILNAFDVTLTLQGIAGIILSIGMAVDANVIIFARVKEEMSKGKSVRNSLKAGFDKALSAIVDGNVTTLIAAAVLWFLGSGTVKGFAQTLAIGIVVSMFTALVITRMIVFAFYEVGIRNPKVYYRPKKEREPINFLGKRKWFFSASIGVILLGFVIMGVNGGRGAGAFSYSLDFQGGTSTNVTFNEDYTINEIDQEIVPVVEEVTGDANVQTQKVDGTNQVIIKTVTLDLDQREALNQALVDNFGVDESKITAENISSTVSNEMRQDAVIAVIVASIFMLLYIWFRFKDIRFATSAVAALLHDVLVVLGFYAVSRIAVGSTFIACMLTLVGYSINATIVIFDRIREELHYRTKTTDLAYIVNKSITQTLTRSIYTSLTTFISIAVLYVLGVSSIKEFALPLMVGIVVGAYSSVCITGALWYVMRTRADKRKQK >NZ_CP041667|1453613:1464153|1458441_1460790_+|WP_143929598.1|DBSCAN-SWA MSGRITTYESLDDRIAPESVMKDMSEGLEIVDGHAVKAPGDYEDPDQLYDMLIARIRKYHPSTDVSMIKKAYETAKKAHGDQCRKSGEPYIIHPLWVAIILADLEMDKETIAAGMLHDVVEDTQVSEEEIKEVFGEEVALLVDGVTKLGRLSYSSDKLEVQAENLRKMFLAMAKDIRVIIIKLADRLHNMRTLQFMTPEKQKEKAKETMDIYAPIAQRLGISKIKTELDDLALKYSQPEVFFDLVRQINARKTEREEFVEQIVKEVSDHMKNANIKAEVNGRVKHFFSIYRKMVNQDKTVDQIYDLFAVRIIVDSVKDCYAALGVIHELYTPIPGRFKDYIAMPKPNMYQSLHTTLMSSVGQPFEIQIRTQEMHKTAEYGIAAHWKYKESNDGKKSVEAQEEEKLSWLRQILEWQRDMSDNREFLNLIKGDLDLFAEDVYCFTPQGDVKNLPNGSTPIDFAYAIHSAVGNKMVGARVNGKLVNIDYKIQNGDRIEILTSQNSRGPSRDWLGIVKSTQAKNKINQWFKKEFKESNIVKGKELIGAYCKAKGINASDILQTRYQEIVQKKYGFRDWDSVLAAVGHGGLKEGQVVNRLVEEYGKEHKQEITDEVVLERVAEASKHKVHIAKSKSGIVVKGIDDMAVRFSRCCNPVPGDEIVGFVTRGRGLSIHRTDCVNMLHLTEAERARLIDAEWESEVAEESGGQYLAEIKMYANDRQGLLMEMSRIFTEADVDVKSMNVRTSKQGTATIETGFIVHNREELGRVVKKLRQLEGVIDITRTTG |
7 | uncultured_Mediterranean_phage(16.67%) | tRNA | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_5 |
1615698 : 1632890
Sequences of DBSCAN-SWA_5
Nucleotide sequences of DBSCAN-SWA_5 >NZ_CP041667|1615698:1632890|DBSCAN-SWA TCTAACCTACTGCCTTTTCAATTTCCGCTGCCGTCTGTGACGGGAACACGGATAATATGGCATTGGCAGATGAAACCTTGCTGCTGTAGTCTAAATGCGTATAGATATTAGATGTTGTAGAAATGTCACTGTGTCCGAGCCATTCCTGTATCTGCTTCAAGCTCACGCCATTTGCATATAGAAGACTGGCACAGCTATGCCTCAAATCGTGAAAGCGGATTTTCCGCAGATTATGTTTTTTCAGCAACAAGGGAAAATGTTGTGTCAGATAACCGGGTTTTATTCTTTCCCCGATTTCATTAACATAGATAAAATCCAGATAATCCGTGCAATAACAGTCACCACACAATCGCCGGTTCTGTTTCTGTTCCCGATACATCTTGATAAGTAAATCTTCAAACGGCTTAACAAGCGGCAGCGTGCGGCGGCTGGTCTTTGACTTTGTCCGCTCTTTCTCAATAAGCACATTGCCACGCCCGTCATTCAGATTGACTTCTGTTACAGTGTATTTGATTGTGATAGTTTTACGCTCAAAATCAATCGCACTCCATTTCAGACCTATTACTTCACTCCGGCGCAGACCATAGAAAGCAGCGAGGATAATCGCCAGTTCCATAGGGTCGCCTTTAGAAACTTTGAACAAAGTGTCAAGTTCTTCTTTATTATAGTAGGAAGCTACAAATTTGTTCTTCTTTGGTCTTTCCACACGGTCTGCCGGGTTTGATTTGATGAGTCCAATCTGAAAAGCATATTGCAGAGCCTTTCTGATATTGGCATGGCGGTGAATAACAGTATTAGCAGACAGCCCCTTATCCAATTCTGACTGATAGTAGTCTTGAATGTATTTAGGGTTTTCCTCAATCTCTGTCAAAGTATATCGTTTCTCTAAAAAGTAGGGAACAATACGCTTTTTCACAGAGCATGTGTAAGAAGCCATTGTTGTTTCTTCTACACACTGTTTCATCATTTCCAGCCATTCCAGCATAAAATCCGTAAACAAGATAGATACCTCGTTTGTCGCCGGATTGATACTGTCTGCCGGATTGTCCTCGATCCAGCCAAGCGTGACCGCATAATCAAGAGAAATTTTGATAATCTCATGTAGCTGTAAGATTACCACCTTTGAAGCGTGATGTTCTGTCCTCTCATAATTGTAATAGTTCTCAATGTCCGTAGTCCTCAAATCACAGATACGAATTGTCTTTTTTTGAAAATACGGTGTGAGATACATTCTCACATTGTAAGAGTACAGGGCATATGTTTCTGCTTCTAAGGTCTTAAGGTTGAGATGGTCTTTCAGCCATTTGTCGAGGGACTTGTCAAACGGGAGTTCTTTATCCGTCATAGCGTCCTCTGGTGTGAAATTCTTTCTCGTTTCCAAAAGCATAGCTTCCGCACGCTTCTGATTTCCCTTTACGGGAAGTCCTGTACTGATTGATTTTGTACGCCTTTTTCCAGTGCTGTCCTTATAGCTTAAAATCATCTGGAAATATCCGTTTTGTTTTCGTAAATGTCCTGCTACCATGTTTGTCCTCCTAATAGCAGCAGCCATTCACCTATCACTGACCATGTCAAGTATAACAGGTTCTATGGCTGCTGACAACAAGCGACGCAGTAAACCTCTACAGCATTTCCATGTACCGCAGGATATTGAGCTTTGGTATCTTATAGGTTCTTCCGACCCTAAAATGTAGGATTTTATTTTCCCTCAATAATATGTAGGCTGTTTTCATACTGATACCGAGCATTTTGCTCATTTGCTCTACGGTCACTACGTCTGGATAATCCTTGAACATTACACGGTAGGTATCATTTGTCATATATTCTTTATCCATTACAACACCTCCGGGGCAAGATACAGACAGATAATCAGACAGCCTAGCGGCTCACGGGAGTCTCACCCCTGCATACACGGTGCAGCCATACCCTTTGCGTGCGACGCTTCTAACGCTCAAACTAAGGCTATACGGGAGTATCATTGTAGGATATATGAGTCATGGCACACGACCATTCCACAAGTCGCTTGCAGATCACAGATATGAACCGCTCGCTCATGTGTATGAGGTCATGGCGTATCTGTCTACAGGCTCGTCATATATCTAAAGTATCTGATTACCTTATTCACTTTTCAAAGAGCAAGATTAGGAGCTGGAATGAGGTGGTACACGCTCCTGTAGGCGGTTATTTCACTTTGACTTCAAAACCAAGAATTGCTTCAATCAGCGCAGCCCTTATTCTGCCTTTCAGTTCCATATCGACCACAATATACATGTTGCCGTGTTCGTCATAGAACGGGCGCAGGCTTGCTTTTGATATGTAAGCGTCATAGTGGTTCAAAATCTTTTCAATCGCCGTTTCGTCGCCGTCTGCCGCAGAGCAGATGGTTGAGAACAACGGACACTTTTTTACAGACTTCATATGTTTTAATCCTCCTCGTACATATCCTTAATGAGCTTCAATGTGCGTAGTCTGTTGCGGCAAACGGAGTAACGCTCCATTTCCATGACCGCTGCTATTTCAGCGTCGGTCATCTCCAGAAAGTACGACATAAGCAGAATGTTACGCCGTCTTTCTGCTAATTTTTTGATTGCTTCACATAAGCGTTCATCATAGATACGGACTTCATTGCCGAATACGTCAAATGCTGTAAATTCAACCGAGTATTCGTCTACAGCGCCGATATGGTTTATTTCCATCTCGGACAGCTCGCAGAACGGAGTTTCACGTTTTGCACGTCTGCCAAGCTCTCTGTTGTAGTCTTTTACTGTAGTGCCTACAACTTTACGAGCCAGACAGTCAAATTGAAGTCTGATAGCACTCTCGAAAGAAGATGGTTTCATAATCTCACCTCCTTTCTGTCGGAAGTTGCTAAAGCAAAAAGGCTTTTACCCCTTTCCGCACTAACACTCGCCAGAGAAGGTGGAATTTGATACCCGACTTCAAAAACTTTTTCAAAAAGTTTTTGGCAACAAAAAAGCACAAGGACAATACAAGCGTACTGATCTTGTGCTTTGCAGATTATTCCTGCTATATGATGTGAAAAACCACATTCTAGGGTGCTTTGAGCCGCAGGAAGAAGTGAGATTTTTTTGACGGTCTACCAAAGACCAGTGGCGGTCTGATACGAATTTCCATGCGGCTACCTCCTAAAGCCGCTAAAACAAAAACCTTGACCGTCTGGAAAAAAGAAAACGGCGGCTGTAAAACCGTCGTTTAAAAAGGCGCAAAAAAGCCTTGCTGCCAACGGCGGTTTTTTTATTTGCCGCCATGCAGCATATACAGTTGAAAAATGCACCTAATAGGAATTGTAAAGTCCTACAACACGTTTTTCTATACCTGTTGCTAATTCAACAGGAACAGTAAAATGTGGTGAGGTGATTACATATGCGTAAAAAAGAGGATAGATACGATTTCAAAGCAGTAGGACAGTCAATTAAAGAAGCACGCAAAAGACAGGGATTGACCCGTGAACAGGTAGGCGCAGCGATAGAGATTGACCCACGGTATTTAACAAACATTGAAAACAAAGGGGCGCATCCTAGCCTACAGGTGCTATATGACCTTGTCTCCTTGCTTGACGTATCTCTCGACGAACATTTTCTGTCAGCCGGTGAACGCCGGATTAAAAGTACCCGGCGCAGAGCTGTGGAAGCCGGACTTGACGAGCTTACCGATCAAGAGCTTATCATTGTAGAGAGCGTCATTGATGGTATCGTCAAGTCAAAGAAAGTGGAGGAAAATTAAATCCTCCACTTTGAAAAAATAATTTTATTTATTGATAGAATCCCACAATTCTAACGCTCTATCAATAATCGGAAGAATTTCCTCTTTCCGATCTGTCAATAAATTGTTCAAGTGTTCCTTTTGCACATCTGTTAAGTTTTCAGATGTATTGATTACATTACCCTGTTCGTCTATTTCAATATCTCCCGGATATAATGCTAAATATAGTTTTCTGTTCTTATCAATATTTATAGTAATATCCGTATAGCCTTTACGGGCAGTAAGTACAAAATTATCTGTGTCTAATTCATATTTTCCAATGCTTTTAGGTTCTACCAGATAATCAACACCATTTTCTGTTAAGGCATAATATCCTTGATATCTTTCAATAAAATCCTCTGTTATTTCTTTGGTAAATCCGGCATTCTCAACTTCGTCATACATATGGTTATGTTGAACTTTACATACGCCTATATATATACCGGCTATTACCCCAACAGCTAAAAAACACGTTATCAAACCTATAAAGACTTTTTTGTAAATCTTATAGCTGTCAATTTTTTTCACCATGAGCATATCCTCCTTTAGCATTGTGTCAAGTGAAACATTAAATTCGTTGCTGATTTGAACAAGTGTTTGTAAATCGGGATAGCTTTTCTCATTTTCCCAATTAGAAACAGTTTGTCTGGTTACATGGAAGATTTTTGCAAAATCTTCCTGTGTCATATTCCTTTGTTTTCTTATTTCGATAATCTTTGTACCGATATTCATTTGGCGACCTCCTTCTGGTTACATTATTGATTTCACAGCTAAAAAAAGCTAGCAACTTATCTTTACATACCTTGAAAAAGTGACGCAAATAATCTTTGACAAGCGAAAATTAAGGCTTTTTCAGGGATTGTAGTAGAAGCGATAATAAATCAGCTTTTATTATATGGTACAATCATCGAAACATCTATACTAATCATTAGCAAATTATTTTCCAGTTGCCATCAGCCTTAGCCAGTGTCAAATCAAACTGTGATACCTGTGTCGCCTTTGTAGTCTGGTCTAGGTATGCCACGGATACAGATACTTTGACGTTATCCCCGTCCATACCAAATACCGGGTTGATAAGCTCCGAGAACAGATAATTGCCGCCCACGGGTTCAAGGGCATTGCCGGACACATAGTAGGCAAGCTCTTTATCTGTGGCGGTGGGATACAGCTTGAAGAAAGTTTCCAAAAACTCTGTCACTTCCGTTGTGGTGGCAGCGTCTACCGAGCCGTCCGCTTCCTGTGCTTTTGGCGTATAGCCGGATTTCCCCGGCAGGCTGCTGATAGTCGGGTTTTTGATGATAACCATATCTCCGTTGCTGTCCGCATGGACAGTTACCGTATAAGCGGCGGTGCTTTCTGTCACCTGTTCGCCCTCCGTCACCGTCTGATTTACCGAGTAGAGGACGGTGTACTCGTCCTCGCCGGTCTGCCCGATATTCCATATCTTGACTTCTCCCACCGCAGAGCTGGTAGGAATATCGCTTCTGACAGTGTCCACGTTCAAATCCTGCAAGCTCTCTGTGAGGTAGCCGCTGATGGCGGCTGTCCGTGCGTCAATCGCTTCTTTGGTATTGCTCCATGTATAATAGGCTTTTGCAAAGTCCTTGACGAAATTTTCTATTTTATTGGTGTCCACGATACGCTGTTCTATGATTTCCTTTTCATGGACGGTGTGCATATCTATCGCCGTGAAATTCTTATATACCCCGAAGCTGACACTTGCAATCAGCACCAGCCACAGGGCAATCACGGATTTCTTGTGCGTGCCGACTTTCATAGTACGCACTTTTTTCACCTTATTAGGTTTAGTCTCTTTTTCTTTTTTCTTCCATTTCATAAAAATGTTGTCCTTTCTGTCTTAGTTGGATTTTACACGTCCGGCGCAGATGAAGTGTTGCTGCCAGTACGAGGAATTGAGGTCTGCATATCCAATTGGGTCACCGGCATGGTACATACGGTTATTTCCGGCATATATCCCGACGTGGGTTACATACGTTCCGGCACTGTAGGTACTGTGGAAAAATACCAAGTCGCCGACTTTTGACTGTGACAGAGGGATATGCTGCACAGCGTCATACTGTGCCTGTGCCGTCCTCGGAAGTGAGATACCGGCTTTTCCATAGCACCACTGAATCAGACCGCTGCAATCAAAGGAAGTATTAGGATTGCTGCCGCCATAGACATAGTTCCAGCCTTGATATTTTAAGGCTTCATTGAATATTGCCTGTGCGGTTGCGTCGCTGAAATGAGGTACAGCCAGATATTGTGAGACGAGCTGCACATAGAACATATTCCCGTATTTATACCGCCAGCCGCCGTTCTGCTTGATGGAAATTTCATTCTTGTAGTCCACCTTGACGCCGCCGGATTTCTCCCTTGCGAAACTCTCTGCCAGTGTAAAGCTGTATTTCTTTCCACGGGCAGCCACATAGTCGATAAAACCGCCGCCATAATTGTAACTCTGGACGACGGTATTTATATCGCAGCCTTTCGCTTCTGCTGAAGAAAGCAGGGAAGCAAAATACTTGCACCCCTGCTCAATAGACTGCTCCGTATTAAGGGAATTGGGCGGCAGACCAAGACTTTCACTGGACTGCATAACGTCCTCGCCTGTGCCGCCGGACTCCACCTGCATGATGGCGAGCAGATAGTTCAGATAGTCCTCAATGCCGTACTTTCGTGCGTATTTTTCAACAGTAGGCTGATGTTTCAAAACCTCCGCTGACAGGTTCAACCCCGAATAGTCAAACTGTGCGTCGCTGCTTTCCTCATCGTCAGAAGTGACGATAAAAAGGAACAAAAGCAAGCTGATAATTCCCGTGAAAATCCAGCCAAATATCAGCAGATGGCGCAGCCTCATTGTTTCCGCCCTTTCTTTGTGGTAGTTTTCTTCACGGTGTTACGCTCGATATTTTTCTGGACAGACCGTTTCGTGGCAGTCTGCCGGATTTCCTTATTCCCGGACAAAGATGAGTTCTTTCGTTCCGGGGAAGTAGGATTTTTTACCCCCGGTGCATGAGACACCGGCTCATTCTTCACCTGTGTACGGATTTCCTGTGTCTTTGTCACCGGCTCTTTGGGTGTAGGATATTTTGATATATTCTCACGGGATACCGCCGGACGCTCTTTCTTCATAGGTTCACTGTGCTGCACCTTAGAAGCTGCCGCCGGGGAAGTAGGTTTTTCCAATACTGGCTGTGTTGCTGGACGCTCATGGACAGGGGCAGCCTTTGCCGCTGATCCTGCCTGTTTCGCTTTCTCCATTTCCATGCGCCTATGAGCGATAGACGGACGGCGGTTCTCCTGCCGGTCACTCCGGTCATGCCGCCTTGCATGGCGGTTCTCTATAACGTCACGCTGAACGCCGGATACTTTTGTTGTCTGTTCTGTGGAAGCCACATAGCCAGACTGAGACGGATTTTTCCTGTTCTGTTCCTTTGCGGTTTGTGACTTCCGACTGCTGTCACCGGTACGCTGTTTTTCTCCCGTAGCACCTACAGCAGAAGCGGCACGTTTCCCGGAGGAAGTGCCCGAACTTTGCCCGTTGTCATGGTTAGGACGGGAAGCGGCTCTTTTAGTATCTGACTTAGAGCCGGAAGCCGCTGCAGCTCCTACCACGCCACCGGCAGTACCAGCCGTCAGTGTCCTGCCGATACGACGCTCCAAACGCCTTGCTCCACGGTTGAGGAACAGATACGGGCGGCGCAGGATACGTCTGCCGACCTGTTGGGTGTCATTGGACTGTAGGCTGAACATGGACATTAAATCGCCCATCTTGAAATAGATACCGGCAAAGGTGACAATCTGCAAAAAGGCTATCATAAAGAACGGATACCCGGCTGAAATAGAGTAAAACATTGTGCTGACGCTAAAGGCAGCGGTGATGATAAGGGTAATACCGGCACGCAGCATAATAGTGTTGAACAGCTTTGTGACCGCCCGTTTCCCCATGCCCTCATACGTCGGTATCATTGACAGGATAAAGCTCACCGGCAGGAACATGGCATAAATGATAAACAGCACTTGCGAGAAAATCATCATGCCGGTAAGCAGAAACACAAAGACAGAGATACCGATATTGAATATAAACAGAAAGACCACTGTACCTAACCGGCTCATGGTTTTAGTAATGCTCATGTTGTCGTTATCCCTGTCCTCGATTTCCTCAATCACGATATTCTCTCTTTCGTCGGTGTCTGGACTGGTAGACAGCAGGCTCTCCACCCGATCAGCACCGAGACTGTCTACATCTGTCGTATCATACTGCAATAAGAGCCACGGCTGCTTTACCTGTATGGAAAACAGGCTGTCACGGATAAGGTCAACGCTGTCCTTTCCTTGACTGCTGGAGTCCGGCAGCGTGATTTTTGTACCCAGCGTCAGAGCCGCACTACTGATGTCCGAGGAAAAATCATTGATTTTTGCAATGTAGTTAGGGGCATAGGCGATAAAGGACGCTGACAAGAGAAAGACCACAAGGAAGTTAAGGACGGCGTGAACCGCCTTTGTCGTTTCCCGTTTTAGCAATCCCACATAGGCTACATAGATACCCACGACCAGAATGAGGATAAGCAGGAAGCCTACATAAAATCCCTGTGTGGAAAAGCCGTTCTCCGTCACTCCGGCAAGAGTCTGGATATTCTTCCCGATTGCGTCTGCCGTATCTGAAATAAAGTCCAGCTTGTAGGCTTCCTGCACCACATAGCCAGTGGCGTTGGAAATATACATACTCACTGTCCAGACAAAGTTGGTAATCGCATAGAGTCCATACTGGATACTCTTGCCAATGCCGTCCAGCCAGTTCCACGGAAGCCAATCCCACGAACTGTCTACATAGAAGTCAAGCTGATAGTTTTCCAGCGGATATTTTGAGTATTCATTGGCAGCGTCCACCGTATCGTCCACCAGACCGGCAGCATGAGCCACCGTTCCTGTGATGGAGAGGAACGCCAGAATACCAAACACCACCAACAGGGTGATACCGAGATAGCGCAGGATGTTTCGTTTCCTCATAGGCTCTCCCTCGTTTTTTCACGCTGTTTTGGCGGTCTGGTATCAAATGCGGCAAATAAATCCGCAAAGACCGGGTGTATCTGGACAACGCCCACACGCCCGTAGAGGTCTTGAAACAGGCATTGTCCGTTCTCCAAATCCCGGAGCCGCTTCTGGTTGTTCTCGTCATCGCCATCCACACCGAAAAATTCCAGCGTATCCTTGATTTCCTTGCTGTCTGTGGAGCGGAACGCAAATTTCAGACCGATATTGTTCTTCATTTTCTCGTCGTCCACGTCGCTGGAATTTTGCGTCACGAAATAGACAGCGGCGTTCATGGAGCGTCCGGCACGGATAAGTTTGTTAGAGAGGGATTTCCCCTGTGCCACCTGTAAGAACGTCCACGCTTCGTCTAAGTCGACGATTTTAAAAATACTCCTGTCAGAATGGATAAAATCTAAGGCAAAGGTACTGATGTCCATAAGGATAGCAACCGAGAGCAGCTCCGTGGAGGTGTAGTCCTCCAGTTTCGTGTCACAATCCGGCAGCACAAGGTCAGCCACCTGTATGATGTTGAGCTGTTTGTCCAGACTGATCGAATGTTCCACCGTTCCGTCGGAAAAGAGCAGATGTGCAAAGTCATAGTCCGCAATACTCTGGATATGGCTGGCGATACTGTTTGCCGTGGGCGTGCCGATTTTCCGTAGCTCGTCCACCACGCAGAGAAGTCCGTGTTTTTCTCCCTTTGTCACTTCCTGTAGTGCCGCCATGAGGACAGGAAACTTTTCGCCGTCACGGTATCGGATACCTGTCAGAAAGGTCAGAATATCCACGGCAAGGCTTTCCCCGTCATGCACGTCTTTCATAATTACATACGGGTCAAGCATACCCCGGTTTTCTTCGCTGTTGGTAAGGTTTACAATATTGATTTCCTCCGCAATCTCCGGGAGTGTCTCTTTCCATCTGCCACGCTCTGATTTTGGGTCAATAATCACCGCATGACCGCCGAACAGCACAGAATAATAGATAATGAGGTTATTGCAGAATGACTTCCCACCGCCCAGAGAACCGAGAAAGGCAGCCGCCAGCGCATTTGTGACGGAGCCTTCGACTCCTTGTGCCGCAAGGCTGGGTTTTAGGTACACGTTCCTGCCGGTATCGAGGTTATAGCCGATATAGATACCGTCCTTTTCCCCTAGCTGTTGTGTCGCACCGAAACCCAGACCAGCCAGAAAATCACTTGTCACATACTGGATATAGTCGTTTATGTACCGCTTGCTGGCAGGAATGAACTCGCTGTGGAGTCCGAGCATATCACCAAAGGGGCGCACCAGCTTTACGTTCAAGTCGTCGTAAAAATCCTTGACCTCGTCGCAGCGGCGTTTCAGCTCGTCCAGACTGTCCGCAGACACACGCACCACATAGGAGAGCTTATACATGGACTCCTTGCTCTGGTCTAGGGTGGTTTCCAGCTCATTGACCGAGTCCAGAGCTTCCACCACACTGTTGCCGGTTTCGTTTCCAGCTTCCCATGCGTGGTTATCCAAATCTTTCAGTTCTTTCTTTTTATTGCGTACCGTCGTCAGCGCTTTTTTGTTGGTGACGATTTCCACGTTCATAGAGGTGGACACCGGGAATGTAAACTGCTGTTGTTGGTAGTAGAAGATTTCCGAAGATGGAAAATCCAGCTCCCCTACGATATTGTTAATCGTGAAGTAGGCGGCATACGTCGTCTGGTCTTCACTTTCCAGACGGAGGTATCGCTGATTTTCTTCCACTAAGCAGCGTGTCGGACGGATAAGGTCATACCGTTTTACCAGCGTTTCCCGTTTCAGCTTTTTTACAGGAAAGTCATAGAAATAGTCCTCATAGGCAACGCCGGTCTTGCCGTAGATATGCTCCAAAAGGTAGCCGAAGTCCTTTTTCTCCAGACGGCGCACCTTGAACCGGCGAGCGATTTTATTCTCCAAAAGTTTCTCCATCTTCATAAAGCGGTTGATTTCATCATTGCTCATGGAGACAAAGTCACCCATCAGCTTGTGGTTGACTTCGTAGAGGAAGTCAGCAAAGGTCATGGACAGGGATTTCCTTGCGCCTTTCAGACTGATTTCTTCCTCGTTCACCAGCAGCTTAAAGCCAAGAAAAAAACGGTAATCAATCTGATTTTCACCAATCATATCCACTAAAGCGTCCGTCTGCTCGTCTATCTTCTCACAGGCGATTTCACGCAGCCGCCCGGTCACTTCCTCTTTGGGGCGTTCCTGTATCTGCCGGATACTGGACTCTGTGCTTATCTGCAAGGCGTGTATCTTCCCGTCACGGTTCTGTGCGATAAGCTGACGGAAGCTATCGTGCAGCCGGTATTTTTCTTCCGGCGATAGGAACGAGTAATTATAGGGGAGCAGCTCGTAATAGGCGAAGCACTCCCCGTCGTGGTTGAACACCAGATTGTTTTCGATATATTTAATCGGGAACATACAAATCACTCCTAACTGCTGTAATGGCGGTATCGAATTTTTCTTTCCCCAGCTTCACCGGCTTCCCGGCATAGGTCACTTTAGGGCGCACCAGATAACTGACACTGGATTTCAAAAATCCGAAAGGCTTCTTGCCGTCAAAAGTTTTCTGACTCATAAACCATGTAAGGGCGACAGGGATACCAAAATATTTAAGAAACGCCCCGTCAATGAGGGAAAAGGGCGGCAGCTCCCCGAACAGGATAACCACAAACAGGGAGACAACAAACCATGCCATCTGTGTAAAGGTAATCGGGAACGGGAGCTGAAAATCATTGATAGCGTAAATGACCTTTTCCACGTTCCAGATACTTGTGTAGCTTCTAATTTTTTTCAAAGTCTCGTACCATTCCTTTCTTCTGAAATGAATACAGCAGCCTGTATTTACAGACTGCTGCAATAAAAAAACTGCCAGAAGAACACTTGCGTTAGATCTGACAGCTAGTAGGGGATTTCAAATACCCCGTATCTTGTTTCCACGAAGCACCCCTCTATATCAAGGTCACGGGCAAATGCTTCATAGTCAATGTAATTTCTCAAACGGTCTGGTATCTCGCCCAGAGAGCCGCACTCGTCAATGAGGTAATAAGCAACGTCTACCATGTTCTCGCAGCCGGGATACTGGATAATGTCATCCTTATGTTCGTAGAGTTCCTCCACACCCGTAAAATGACAGAGTAAATCGTCCACCACTTCACTGATAGGATAATCCAGTTCCTCAATCATGGCGCACAGTCGGTTGACTTCCTCAATCGGCGTATATTCATCCACAGGGAACGGCAGCTCATAGTCATGGATGGCGTATTCTTCATAATTGCCATTCAAGCCGATACGCTCTGCCATCTCGTCGTAGTCCACCGGCGGTGTAAACCATGCGCCTACCAGCTCGCCCTCATTGTATTTGCCTAAATTGGCGATATAGACACGCATTTCCTCCATAGGTACACCGCCTTATTGATAGCGGAACACACCGCTAGAAGTGAACAGGTATCTGTCGCTTTGTTCCAGCTCACGCCCATAGGAAGCATAGTCGATATGTGCCGTAAGCTCCGGCGGCAGCTCTCCGAAACAGTGTTCCTCCAAAATGAGATACTCCGATAAAGCGGTAGGGTCTGCCACGTCATAGTGGTGTATCTCGTCCTTGTGGTTGAGAAAATCTTCGATATTCTGAAACCATTTATAGACAATCGCCTGTAATTCCTTTCCGACAGGCGTACCCTCGATTTCCTGTATCAGACGGCAGATACCGTTGATTTCCCACAGGTTCATGCTGGGCGTAAGGGCAAAGGGCAGCTCATAGTCGGCAATCTCGATTTCCTCCACCTCTTTGACACCGAGCTTTTCTTTTACTTCCTCAAAATCCACCGGGCAGTCAAACCATTCCCCGGCGTACCCGTCCTCGTCCTCCGAGTAGGGTTTTGTGATGTTGAGGATATAAAGTCTCATTTCATACATTGCTTTTTCATCACGTCCTTTCTAAACGTGCCATCACTTCTGTTAAGATAGCATACTTCACGCCGAATACATAACGTGAAAAGTCTTCTAGCTGCTGTTTTGGATAATCGGCAAGGGGAAGCCGCTTCATCTCGAACCAGACTGCCCGTTTGTAGTAGGCCAAGTCCGAGATGTAAGAGCAGCCGATCCTTGCCTGTATGTGATAACGTATAATTAATTTCGCAAAATCAATTTCATAAAAATAGACATGGTCTATAGCCGTTTGGTAAAATCAAGTCACCACAACTAAATCTATCAAAGGAGGCTATAGAACCATGTCTGATAAGATTATACAGCTAAATGAGGACTTAATAAAGCATGATTTAAAGGATCTTGTCCGTAACAGTGTCGAAGAAACATTAAACGCCCTGCTCGATAAGGAAGCCGACGAATTAGTCAACGCTGAAAAGTATGAGCGCTCCTCTGACCGCCAGGGATATCGTTCCGGCCATTATAAGCGAAATCTCCACACTACCGCAGGGGAAGTCGAACTGAAGGTTCCTAAACTGAAAGGGGTTCCTTTCGAGACAGCCATTATCGAAAGATATCGTCGCAGAGAATCTTCCGTGGAAGAAGCTCTTATTGAGATGTATCTGGCCGGTGTTTCTGTCCGACGCGTGGAAGATATCACCGAAGCCTTATGGGGAACGAAAGTATCCCCTGGAACCATCAGTAACCTGAATAAAAAGGCTTATGAGCATATTGAAACCTGGCGTACCCGTCCGCTTTCCGGGAACTATCCTTATGTTTACGTAGATGGTGTTTACCTGAAACGTAGCTGGGGCGGCGAGATCCAGAACGTTTCTGTTCTCGTTGCCATTGGCGTCAGTCAGGATGGCTGCCGGGAAATCCTTGGTGCTGCGGAAGGGATGAAAGAGGATCGTGAAAGCTGGCGTTCTTTCTTCGTATGGTTGAAAGAACGCGGGCTTACTGGTGTACGTTTGATCATCGGCGATAAGAATCTCGGTATGCTTGAAACCATTCCGGAAGTCTTTCCGGATGCCAGATATCAGCGCTGTACGGTTCATTTTTACAGAAATATATTTTCTGTTACCCCCCGTAACAAGATGAAAACGGTAGCGCTTATGCTCAAGGCGATCCATGCGCAGGAAAGCAAGGAAGCCGCCCGTGAAAAAGCAATCCAGGTAGCGGAGAAACTCCGTGCCATGAAGCTTGCTAAAGCTGCCAAGAAGGTAGAGGACGGGATTGAAGAAACCTTGACCTATATGGATTTTCCTACGCAGCATTGGACCCGGATCCGAACCAATAACGCTATTGAGCGCCTCAACCGTGAGATTAAACGGCGTACAAAAGCGATCGGTGCTTTTCCTGACGGGCAGAGTGCCTTGATGCTCGTATGTGCCAGACTGCGTCACGTAGCAGCAACCAGTTGGGGAGCCAGACGCTATATGAATATGGATCATCTTTTCAAAACAGAAGAGGATCTGCTGTCTGATATCATAGCCGGCTGACTACCGGCTGCTAAACGGCTAGGTTTTGATTTTGCGAAAAACTATTGACACTATCCTGTATGTCGGTGAAAATATCTTTGTCCACAATCAACCTCCCATCTGCTTTAGGGTAAGAAAAAAGCAGCAAATCTTTAGGATTTACTGCTTACACAAGAACTTTTAAATTTTTTTAAAAGGGATAACGCCTTTTCAGCGGTAGGTATAATTTCTTCATGGCTTATATGGTCTTGATATTTTTTGTGCTGCTTTTTCCACTGGGAATTTTCCCCTTCTTCAATTTCTGTCCATCTAAAAGTAATAATTTCTGAAGCAAAAGGAAAATGCATAGGAAATGAAAAACCACCTTTACGTTCATATAATTGTATTACATTTACATCATGTTTCAATTTATTATTTGCATACCGTAAGCCGCTGAAAAAAGCCTTTTCATTTTGAGATAATGGATTTTCTCCTATGCTTTGTAAATATTCATATGTATCAAGAAGCCAATGAAGAAATGTTCCAACTCTATAATTTACATCTTTTTTCAGTTCGTCACTATTCAAATTATGTGTTGCTGCCTTAATAGCCAAGTCCATTTCCATAAAACTTTTATTTAAAGCATATATCAGCATTTACAAACTCTCCTTTATGCTAATAAGTCCATTACAATTATTAGCTTTTATCTTTGAATAATTCTTTAATTTTATCATCAGATATATTTTCTAAATCAGGTTCAATTCCAATCGCTAATAATTCATCAGATGTTATTTTTTGATTTACTATTTTGCAATGTTTTGTAAAACACTCCACAATTATATACGTTGTTCCCATAAGAATAGCAGTACATATGATAATAAAAAATGCAATTAAAAACTTATTGTTTAAGCCGATATCTCCAAAAATACCACTCAAATAACCTAATCCTATAAATGCAAAAGCACATTTGTTAAGATAAGCATTTCTATATGTTTCTCTAAAAACCGTTTCGTTATAACTTAATTTATCGGTATTGTTATCCCGATAAATTAATCCCTTACCAGCAAATTGTTGAATTATTTTTTCTCGTTTAGTAGAAAGAGCAAAAATCATTAATAATAATGCTCCTGCCAGTTGTAATGAAATGGATAATATATATATTATTGTATACATAGATTATATCCTCCTTAAACTGTATTTATACTGCACTATATCATAATAATAACGGCATATCTGTAATCAACCTCCAATCTAGTCTAGTGTAAGAACAAGCAGTAAATCTTTAAGATTTACTGCTTGTTCTTAAATGTGACGGTAACATTTGATTTAGTTCTTCATTTGAAAAAAATTGCTCTTCATTATTCTTTAATGCTTTTTGTGCTTTTGAATAGCTCCTAGAATTGGTTTTTGGCGATTTCTTATAAATACTGTAAGTTTCGTTAAGTAAACTGTCTCTCATGGATACTATAGCATTATCGTCTAATACATCAAAGTCAGTCATTAGAGAAACATATTTTTCCCGAATAAGCCATAGTTCATCTGCCGCAAAAATATGTTGTTTCATTTCACTTTCGAGATTAAAATTTTTGAAGAACAGATTAAACGCCAGCAAAACCGTCGAAAAGAAACCGGCAATAATTGTATAAACATTTTCGTTTGTTACTACTGCGCCTAAGAAGCCGCCGGTAGATAAAGCAGATAATAGTATTTGTGCATATTTTATTAATTTTAATCTATTCTCTAAGGTGTTGACCTGTTTTAAATGGCTTGTGTATGTATAGACTACTCTACCATACGCTTCTTTTAATTGTATTAACAATCCGGCTCTATGTTGGGAATTTTGTACCATATATTTCTCTCCATTTACTATGCCATGTATAATAATATCCGTTGCTGTACGCAGAAATTGCTTCTAAGGACAAATCATATGCTTGTTTGGCTTCTCTTTTAAAAGAATATTTTTTTGTGACGTGTTTTCCATCACCAAATTTTATCCAGTATTCTTTGTCAGCATTATCATAAAGATATTTAAAAAAGTCTCTGGACAGCCAATCATAGTAAGAATATGATTTATCAGCGTATTTATAAGTTTGAATAAATCGGTAAGCAATGGTATCAATCAAAACTCCAGTCATACATACACCCATATTGTTGTTCCATGCTCTTGCCATACGACATAAACGCTTTAGATTTTTATTTGAAAGACTATTACGTCCATTAAAACAGTTCATTTCCTGTTTGGGATCCATGTATTTCCAACTGCCCCCATTATGAGTGTCTGGATACACATAAGAACCATCCTCTAACTTAAAGGACGGTACTATTTCAAATTTAATGCCATTGTAAAAGTCAATAACTACTACCTGTCCATCACCACCTATTTGAGAAGAAGAATACGTTTTTTGAAGCGACAATTTAACCGCCTGTAATAAAGCAGATTGTCCGTTTCCAGTATAATTGTTATATTTTGTATATTCAGACCATGGCAATTCTATAACGATATCAATATCACTGGTATATATAGCGGTTCCTCTTCCATAAGACCCTACATAAAAACTATGATTTGTCTCTGACTCTGTAGACCAAAAATCTTTATTTATCCTTTTAGTAATTGCATGATAACGAGTTTGTACATCATTTACCACAGAAGAACTCATTCTAAGATTTTTACAAAATGTAGAAAAATCTTGACTTACAGAAATATCCGACAAATAAATCCCTCCTCTAAACATCTTACAGAATATTATACCAATTTTTGGTCAAAGCTACAATACAAATCATGCTCCGATAATACGGTTGAACAGCTCCAGCAGCACATCTTTTACGCCACCGGCATTGAATACCAGACCTACCGCAATGAGCGCAATTACCAGAAAGCCGATAAGTTTGGAAAACTCACGCTTAAAGCCTAAGTAGATACCGATTACAACAATCGCCATCAGCACCAAGCTCTGTGCATTGCTTAAAAACCAGTTGTATAAGTTCTGACCGAAATTCAT
Protein sequences of DBSCAN-SWA_5 >NZ_CP041667|1615698:1632890|1617319_1617532_-|WP_070504122.1|DBSCAN-SWA MDKEYMTNDTYRVMFKDYPDVVTVEQMSKMLGISMKTAYILLRENKILHFRVGRTYKIPKLNILRYMEML >NZ_CP041667|1615698:1632890|1624468_1626928_-|WP_143929737.1|DBSCAN-SWA MFPIKYIENNLVFNHDGECFAYYELLPYNYSFLSPEEKYRLHDSFRQLIAQNRDGKIHALQISTESSIRQIQERPKEEVTGRLREIACEKIDEQTDALVDMIGENQIDYRFFLGFKLLVNEEEISLKGARKSLSMTFADFLYEVNHKLMGDFVSMSNDEINRFMKMEKLLENKIARRFKVRRLEKKDFGYLLEHIYGKTGVAYEDYFYDFPVKKLKRETLVKRYDLIRPTRCLVEENQRYLRLESEDQTTYAAYFTINNIVGELDFPSSEIFYYQQQQFTFPVSTSMNVEIVTNKKALTTVRNKKKELKDLDNHAWEAGNETGNSVVEALDSVNELETTLDQSKESMYKLSYVVRVSADSLDELKRRCDEVKDFYDDLNVKLVRPFGDMLGLHSEFIPASKRYINDYIQYVTSDFLAGLGFGATQQLGEKDGIYIGYNLDTGRNVYLKPSLAAQGVEGSVTNALAAAFLGSLGGGKSFCNNLIIYYSVLFGGHAVIIDPKSERGRWKETLPEIAEEINIVNLTNSEENRGMLDPYVIMKDVHDGESLAVDILTFLTGIRYRDGEKFPVLMAALQEVTKGEKHGLLCVVDELRKIGTPTANSIASHIQSIADYDFAHLLFSDGTVEHSISLDKQLNIIQVADLVLPDCDTKLEDYTSTELLSVAILMDISTFALDFIHSDRSIFKIVDLDEAWTFLQVAQGKSLSNKLIRAGRSMNAAVYFVTQNSSDVDDEKMKNNIGLKFAFRSTDSKEIKDTLEFFGVDGDDENNQKRLRDLENGQCLFQDLYGRVGVVQIHPVFADLFAAFDTRPPKQREKTRESL >NZ_CP041667|1615698:1632890|1627408_1627897_-|WP_143931233.1|DBSCAN-SWA MRVYIANLGKYNEGELVGAWFTPPVDYDEMAERIGLNGNYEEYAIHDYELPFPVDEYTPIEEVNRLCAMIEELDYPISEVVDDLLCHFTGVEELYEHKDDIIQYPGCENMVDVAYYLIDECGSLGEIPDRLRNYIDYEAFARDLDIEGCFVETRYGVFEIPY >NZ_CP041667|1615698:1632890|1628736_1629933_+|WP_143928506.1|transposase|DBSCAN-SWA MSDKIIQLNEDLIKHDLKDLVRNSVEETLNALLDKEADELVNAEKYERSSDRQGYRSGHYKRNLHTTAGEVELKVPKLKGVPFETAIIERYRRRESSVEEALIEMYLAGVSVRRVEDITEALWGTKVSPGTISNLNKKAYEHIETWRTRPLSGNYPYVYVDGVYLKRSWGGEIQNVSVLVAIGVSQDGCREILGAAEGMKEDRESWRSFFVWLKERGLTGVRLIIGDKNLGMLETIPEVFPDARYQRCTVHFYRNIFSVTPRNKMKTVALMLKAIHAQESKEAAREKAIQVAEKLRAMKLAKAAKKVEDGIEETLTYMDFPTQHWTRIRTNNAIERLNREIKRRTKAIGAFPDGQSALMLVCARLRHVAATSWGARRYMNMDHLFKTEEDLLSDIIAG >NZ_CP041667|1615698:1632890|1631173_1631740_-|WP_143929742.1|DBSCAN-SWA MVQNSQHRAGLLIQLKEAYGRVVYTYTSHLKQVNTLENRLKLIKYAQILLSALSTGGFLGAVVTNENVYTIIAGFFSTVLLAFNLFFKNFNLESEMKQHIFAADELWLIREKYVSLMTDFDVLDDNAIVSMRDSLLNETYSIYKKSPKTNSRSYSKAQKALKNNEEQFFSNEELNQMLPSHLRTSSKS >NZ_CP041667|1615698:1632890|1615698_1617222_-|WP_143929731.1|integrase|DBSCAN-SWA MVAGHLRKQNGYFQMILSYKDSTGKRRTKSISTGLPVKGNQKRAEAMLLETRKNFTPEDAMTDKELPFDKSLDKWLKDHLNLKTLEAETYALYSYNVRMYLTPYFQKKTIRICDLRTTDIENYYNYERTEHHASKVVILQLHEIIKISLDYAVTLGWIEDNPADSINPATNEVSILFTDFMLEWLEMMKQCVEETTMASYTCSVKKRIVPYFLEKRYTLTEIEENPKYIQDYYQSELDKGLSANTVIHRHANIRKALQYAFQIGLIKSNPADRVERPKKNKFVASYYNKEELDTLFKVSKGDPMELAIILAAFYGLRRSEVIGLKWSAIDFERKTITIKYTVTEVNLNDGRGNVLIEKERTKSKTSRRTLPLVKPFEDLLIKMYREQKQNRRLCGDCYCTDYLDFIYVNEIGERIKPGYLTQHFPLLLKKHNLRKIRFHDLRHSCASLLYANGVSLKQIQEWLGHSDISTTSNIYTHLDYSSKVSSANAILSVFPSQTAAEIEKAVG >NZ_CP041667|1615698:1632890|1630064_1630547_-|WP_143929740.1|DBSCAN-SWA MLIYALNKSFMEMDLAIKAATHNLNSDELKKDVNYRVGTFLHWLLDTYEYLQSIGENPLSQNEKAFFSGLRYANNKLKHDVNVIQLYERKGGFSFPMHFPFASEIITFRWTEIEEGENSQWKKQHKKYQDHISHEEIIPTAEKALSLLKKFKSSCVSSKS >NZ_CP041667|1615698:1632890|1626914_1627304_-|WP_143929738.1|DBSCAN-SWA MKKIRSYTSIWNVEKVIYAINDFQLPFPITFTQMAWFVVSLFVVILFGELPPFSLIDGAFLKYFGIPVALTWFMSQKTFDGKKPFGFLKSSVSYLVRPKVTYAGKPVKLGKEKFDTAITAVRSDLYVPD >NZ_CP041667|1615698:1632890|1630587_1631064_-|WP_143929741.1|DBSCAN-SWA MYTIIYILSISLQLAGALLLMIFALSTKREKIIQQFAGKGLIYRDNNTDKLSYNETVFRETYRNAYLNKCAFAFIGLGYLSGIFGDIGLNNKFLIAFFIIICTAILMGTTYIIVECFTKHCKIVNQKITSDELLAIGIEPDLENISDDKIKELFKDKS >NZ_CP041667|1615698:1632890|1631717_1632593_-|WP_143931235.1|DBSCAN-SWA MSVSQDFSTFCKNLRMSSSVVNDVQTRYHAITKRINKDFWSTESETNHSFYVGSYGRGTAIYTSDIDIVIELPWSEYTKYNNYTGNGQSALLQAVKLSLQKTYSSSQIGGDGQVVVIDFYNGIKFEIVPSFKLEDGSYVYPDTHNGGSWKYMDPKQEMNCFNGRNSLSNKNLKRLCRMARAWNNNMGVCMTGVLIDTIAYRFIQTYKYADKSYSYYDWLSRDFFKYLYDNADKEYWIKFGDGKHVTKKYSFKREAKQAYDLSLEAISAYSNGYYYTWHSKWREIYGTKFPT >NZ_CP041667|1615698:1632890|1628432_1628621_-|WP_143931234.1|DBSCAN-SWA MQARIGCSYISDLAYYKRAVWFEMKRLPLADYPKQQLEDFSRYVFGVKYAILTEVMARLERT >NZ_CP041667|1615698:1632890|1618120_1618537_-|WP_117579490.1|DBSCAN-SWA MKPSSFESAIRLQFDCLARKVVGTTVKDYNRELGRRAKRETPFCELSEMEINHIGAVDEYSVEFTAFDVFGNEVRIYDERLCEAIKKLAERRRNILLMSYFLEMTDAEIAAVMEMERYSVCRNRLRTLKLIKDMYEED >NZ_CP041667|1615698:1632890|1619081_1619441_+|WP_143929733.1|DBSCAN-SWA MRKKEDRYDFKAVGQSIKEARKRQGLTREQVGAAIEIDPRYLTNIENKGAHPSLQVLYDLVSLLDVSLDEHFLSAGERRIKSTRRRAVEAGLDELTDQELIIVESVIDGIVKSKKVEEN >NZ_CP041667|1615698:1632890|1622315_1624472_-|WP_143929736.1|DBSCAN-SWA MRKRNILRYLGITLLVVFGILAFLSITGTVAHAAGLVDDTVDAANEYSKYPLENYQLDFYVDSSWDWLPWNWLDGIGKSIQYGLYAITNFVWTVSMYISNATGYVVQEAYKLDFISDTADAIGKNIQTLAGVTENGFSTQGFYVGFLLILILVVGIYVAYVGLLKRETTKAVHAVLNFLVVFLLSASFIAYAPNYIAKINDFSSDISSAALTLGTKITLPDSSSQGKDSVDLIRDSLFSIQVKQPWLLLQYDTTDVDSLGADRVESLLSTSPDTDERENIVIEEIEDRDNDNMSITKTMSRLGTVVFLFIFNIGISVFVFLLTGMMIFSQVLFIIYAMFLPVSFILSMIPTYEGMGKRAVTKLFNTIMLRAGITLIITAAFSVSTMFYSISAGYPFFMIAFLQIVTFAGIYFKMGDLMSMFSLQSNDTQQVGRRILRRPYLFLNRGARRLERRIGRTLTAGTAGGVVGAAAASGSKSDTKRAASRPNHDNGQSSGTSSGKRAASAVGATGEKQRTGDSSRKSQTAKEQNRKNPSQSGYVASTEQTTKVSGVQRDVIENRHARRHDRSDRQENRRPSIAHRRMEMEKAKQAGSAAKAAPVHERPATQPVLEKPTSPAAASKVQHSEPMKKERPAVSRENISKYPTPKEPVTKTQEIRTQVKNEPVSHAPGVKNPTSPERKNSSLSGNKEIRQTATKRSVQKNIERNTVKKTTTKKGRKQ >NZ_CP041667|1615698:1632890|1627918_1628422_-|WP_143929739.1|DBSCAN-SWA MYEMRLYILNITKPYSEDEDGYAGEWFDCPVDFEEVKEKLGVKEVEEIEIADYELPFALTPSMNLWEINGICRLIQEIEGTPVGKELQAIVYKWFQNIEDFLNHKDEIHHYDVADPTALSEYLILEEHCFGELPPELTAHIDYASYGRELEQSDRYLFTSSGVFRYQ >NZ_CP041667|1615698:1632890|1632668_1632890_-|WP_010817890.1|DBSCAN-SWA MNFGQNLYNWFLSNAQSLVLMAIVVIGIYLGFKREFSKLIGFLVIALIAVGLVFNAGGVKDVLLELFNRIIGA >NZ_CP041667|1615698:1632890|1620387_1621296_-|WP_143929734.1|DBSCAN-SWA MKWKKKEKETKPNKVKKVRTMKVGTHKKSVIALWLVLIASVSFGVYKNFTAIDMHTVHEKEIIEQRIVDTNKIENFVKDFAKAYYTWSNTKEAIDARTAAISGYLTESLQDLNVDTVRSDIPTSSAVGEVKIWNIGQTGEDEYTVLYSVNQTVTEGEQVTESTAAYTVTVHADSNGDMVIIKNPTISSLPGKSGYTPKAQEADGSVDAATTTEVTEFLETFFKLYPTATDKELAYYVSGNALEPVGGNYLFSELINPVFGMDGDNVKVSVSVAYLDQTTKATQVSQFDLTLAKADGNWKIIC >NZ_CP041667|1615698:1632890|1617878_1618115_-|WP_143929732.1|DBSCAN-SWA MKSVKKCPLFSTICSAADGDETAIEKILNHYDAYISKASLRPFYDEHGNMYIVVDMELKGRIRAALIEAILGFEVKVK >NZ_CP041667|1615698:1632890|1621317_1622319_-|WP_143929735.1|DBSCAN-SWA MRLRHLLIFGWIFTGIISLLLFLFIVTSDDEESSDAQFDYSGLNLSAEVLKHQPTVEKYARKYGIEDYLNYLLAIMQVESGGTGEDVMQSSESLGLPPNSLNTEQSIEQGCKYFASLLSSAEAKGCDINTVVQSYNYGGGFIDYVAARGKKYSFTLAESFAREKSGGVKVDYKNEISIKQNGGWRYKYGNMFYVQLVSQYLAVPHFSDATAQAIFNEALKYQGWNYVYGGSNPNTSFDCSGLIQWCYGKAGISLPRTAQAQYDAVQHIPLSQSKVGDLVFFHSTYSAGTYVTHVGIYAGNNRMYHAGDPIGYADLNSSYWQQHFICAGRVKSN |
19 | Streptococcus_phage(75.0%) | integrase,transposase | attL 1610537:1610550|attR 1632046:1632059 |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_6 |
2078196 : 2119060
Sequences of DBSCAN-SWA_6
Nucleotide sequences of DBSCAN-SWA_6 >NZ_CP041667|2078196:2119060|DBSCAN-SWA TTTATTCATTTCCCGCCGTCATTCCACTGAGCTTTTTCTTCTTGTATTCCTGATAATTCCCAGCTTCAATGGCCCCTCGGATTTCCTTCATCATAGTATTGTAGAAATACAGATTGTGAAGCACACAAAGCCGCATCCCCAGCATTTCTTTTGCTTTCAGCAGATGCCGGATATAAGCTCTGGAATATCTGCGGCACACCGGACACTGGCACCCTTCCTCAATGGGGCGCGCATCCAGCTCGTATTTTGCGTTAAATAGATTTAATTTTCCTTGGTTGGTATAGACATGGCCATGCCGGCCGTTTCTGCTTGGGTAGACACAGTCAAAGAAGTCCACGCCCCGGTCCACCGCTTCCAAAATATTGGCAGGCGTACCCACTCCCATCAGATAAGTAGGCTTATTTACCGGCAGATGAGGGACTACCGCCTCCAGGATCCGGTACATATCCTCATGGGACTCGCCTACCGCCAGGCCGCCCACCGCATAACCGTCAAGATCCAGCTCCGCAATCTCCTGAGCATGACGGATACGGATATCTTCACAGATCCCGCCTTGGTTGATCCCAAACAGAAGCTGACGCGGATTGATGGTGTCCGGCAGACTGTTCAGCCGTTCCATCTCCTTTTTGCAGCGCCTCAGCCACCGGGCGGTCCTTGCCACAGACCGCTCAATATAATCCCGGTCCGCCACACTGGATGGACATTCATCAAAAGCCATGGCTATGGTGGAGGCAAGGTTGGACTGGATCTGCATACTCTCTTCCGGCCCCATGAATATCCTTCGTCCATCAATATGGGAATTGAAATAAACGCCTTCTTCCTGGATCTTGCGCAGGCCCGCCAAGGAGAACACCTGGAACCCGCCTGAATCCGTCAGGATTGGCTTATCCCATACCATAAACTTGTGAAGTCCCCCAAGCTTTTTCACCACCTGGTCCCCGGGGCGCACATGGAGATGGTATGTATTAGAAAGCTCTACCTGGGTTCCGATCTCTTCCAGATCTTCAGTAGAAACCGCTCCCTTGATCGCCGCCGCTGTTCCTACATTCATAAATACAGGGGTCTCAACGACCCCATGTACAGTGTGAAGCCTTCCTCGTTTAGCAAGGCCGTCACGCTTTATCAATTCATACATTGCATTCTCATCCTTTGATTTAGTAATGGCATACCACCGGCGTACCGTCAGAGATCAGATCATACATCTTTGCCGCCTGATCCGTTGGCATGTTGATACACCCATGGGATCCATTCGTCTTGTAGCGTTCGCCGCCAAAAGAAGACTGCCAGGAGGCATCGTGGAATCCGATTCCGCCGTTAAATGGCATCCAGTATTTTACCGGTGACTCATAGGTATAAGACCCGTCAGGAAGCCGGTCTCCCCGAAGCACCGCGTTTTTTGTCTTATATGACAAAGAATAAGTTCCTTGCGGAGTCCCATTTCCTTTATTAGGATTTCCCGTTACAACCGGAGAATCCAGGACCACCTGACCATCCTTAATGAAATACGCATGCTGGTTGGTCAAGTCCACTTCCGCATAGGTGGTTCCAATGTCAATAGAACCATGGCTTGCCGCCCGGCTGGAGTATTCCGGTTCCCTTGTTACTACTTCGCCATTCTGGATATTTGCGACCAACGCATTGTATTCCGCATCCTGGTCGATCTTCCACCCATAACTTCCGCCTTCTACTGTCACCGTATTCCCTGTCGCTGTGGTAAACTGTCTGGATTTTCCTTTTGTATCATATTTGTCCGCCAGCGACTGGATATATTGCTTTACCGCATCTGTATTAAAGGTAACTTCCATCTTATCATTGACCGTTACCCACTGAGAGATCACAGCCGCGTTTACCACCTCTGTATTGGGGTTGAAATCATAGGTGATCTCCGCTTTTAAGTAGCTGTTCATGGCATTATTAGCCGCGATCACTTCTTCAGAGTCTGACACATATGAAGGCTTGATATAGCACTCTTCTTCTGTCAGATTCAACGTGTGCTGGAATCCTTCAATATAACTGCGCACTTTCTCATTGAACACTTCTTCATTGATCTGCGTCCCCACAACTTCCGGCACGATCTCAAATTGGGTTTCCTTAAATTCCGGGTGCGCGTTTACGGACGCCACCTGATTCTCAGGATTCATGCATGCCAGAGCCGCGATCTGCTCCGCCAGCGCAGTCTCGTCATATTTGACTCCTACTTCCGCATCCAGCTCCGGGTGGTCCCACAATGTCGTCAGCCACAGGAAATTATCCTGATCCTTCACAAGCTGAGCCAACTGCTTGCCAGGCACATACTCCAGAGAAATATTTTTGCCTGCGATCTCTTCCGTTCCGCCGTCAGATTCTTCCAGCGTCAGTGTGTAATCTTCTACCTGCTGGCGCATATAATCTTCAACCTGGCTGACACTCTTTAAAGACACGTCTGTTCCATTGATCGTAGTAAAGAACATGAAATGACTATTAAAGTAAACCGCAAATCCAATATAGACTACAGCGATCACACCAATGATACTGCCGATGATGATCGCCAGTTTCTTTCTAAATTTATGCTTCCGATAGGCCTTTTCCGCATCCGTCTCTTCTTCCTCCTCGCTCTCCTCTTCCGGATTCAGAATATCCGTCTGGTCTTCCGCGCTATCTTTGTCCAATTCCTCTTCTAATGCCTCTTCTGCCTCTTCCTGAAAATCCTCTTCCTTTTCTAAGTCCTCTTCATCCAGGAAATCTTCCTCTAAATCTTCTTCGCCATCTAGGTCTTCTTCATCTAAAAATTCTTCGCTGTCGAAACCCTCATCTTCATCCTCTTCGTCTAAGAATTCTTCTTCATCTAAATCTTTTTCCTGGAGATCCTTGTCGTCTAAATCCTCTTCCTCAGACACAGGGATATCTTCCTCTTCTTTCTTTTCAAAGACTTTATCTCTATCCAGATCATCATAGATCTCTTTCATTGTTTCTTCAATGATCTCATCTGCATTTTTATCAGTATTTCCCATATAACCATACTCCTCTTAATATTTACAGACTCACACGCTAACATTATAATATGCTTCCATGCAAAAGTACAGCATAAAATCTTTAAAATTTTTTGAAAAAAAGAAGGTTTTTTTGTGAAGGATTGCAGGTTTTCTCCTCATCCTATATAATAGAACGGTACGACAGAAATCATACCAGCAAAAGGAGATTATTTATGGCAGGTTCAACTTATGGAACTTTATTTACGATCACTACCTGGGGCGAATCCCACGGTCCGGGAGTCGGCGTTGTCATTGATGGCTGCCCCGCGGGTCTGCCTCTTTCTTCCGAAGATATACAGAAGTACCTAGACCGGCGCAGGCCCGGCCAAAGCCGTTATACCACTGCCCGCAATGAAGCAGATGAGGCGGAGATCCTCTCCGGCGTTTTTGAAGGGCGGACCACCGGAACTCCGATCTCAATCCTGATCCGCAATCAGGACCAGCGTTCCAGGGATTATGGGAATATCAAGGACTGTTACCGACCCGGCCACGCGGACTATCCTTTTGACGCCAAATACGGTTTCCGGGATTACCGGGGCGGCGGACGTTCTTCCGGCCGGGAGACCATTGGCCGGGTAGCCGCTGGCGCTGTAGCCGCTAAGCTGTTGGAAAGACTTGGCGTCCGTCTGCTGACCTATACCAAATCCATAGGTCCCGTCTCCATTCCTTCCGAGGAATATGATTACAGCCAGATCAGCTGCAATCCTTTATACATGCCAAATGAAGAATATGCCCGGAAGGCACAGGATTATCTGCAGGAGTGTATCCACTCCCTGGATTCCTCCGGCGGTATCATCGAATGTCAGGCCAAAGGACTTCCCGCCGGTATTGGAGAACCTGTTTTCCAAAAATTGGACGCCTGTCTGGCCAAGGCCATTATGTCCATTGGAGCCGTAAAAGGCGTAGAAATCGGCGACGGTTTCGCCGCCGCCAAGTCCAAAGGAAGTCTGAACAACGATCCCTTTATTTGTCAGGATGGAAAAATTTCCAAGATGACTAATCATTCCGGCGGCACACTGGGCGGGTTCAGCGATGGTTCTGCATTGATCCTGCGGGCTGCGGTAAAGCCCACTTCCTCTATCGCGCGGGAGCAGAAAACCGTTACTTCCAGCCTGGAGAACACCACTTTGACCGTCAAAGGACGGCACGACCCGGTGATTGTTCCAAGAGCTGTAGTCGTAGTAGAAGCCATGACTGCCCTGACTCTCATCGATCTTATGATGCAGAACATGACAGCCCGGTTAGAGTGGATGGAAAAATTTTATTTAGGGCCGGGGGATTTTTAAAAATCCCCCGGCCCCAATTAAAAATGCCCTGTCCCTTTTTACACGCGAAAGCATCCGCCTCCAATGAACTCCCGCAGCAGGGAGTTCATTGGTACCATCTCGCTGCCGATCCCCACGAAGTAATCCATAAATTTCAGCAGTCGTTTGGCTTCCAGATATTCCGGACAGTCCCGGTCAATCCCGAAGAGCAGCCTCTCTACGTAGGGTTTTAAGACCGCCAGCTCGTAGATCAGGGCAGAATTGAACATCACATCCGCCTCTTCTTGATAAGGGAAGATGTTGCGCTCTTCTCCCCGGCGCACAGAAGACCACATACGGATGGTCTTTTGGGCAGAAGCGCCTCTGGTCCTGGCGTCCCGCACGATACGGCGGATCAGCCGTCCATCTGTGGTAGGGATCCGATTATGTTCGTCTATATTCAGCTGTGTCAATGCGCTGATATAAATCTTGAATTTATTCTCATTGGAGAGGCTTTCCGTCAGTTTGGGATTCAGGCAGTGGATCCCCTCGATGACCAGCACATCATTTGCGCCCAGTCTTTTTGGTTTATTTCCGTATTCTTTGTGCCCTGTCACAAAATTAAAGGTTGGGATCACCACTTCCTGGCCGGCCAGCAGTTCTTTGAGCTGCTGGTTAAAAAGCTCCACATCCACCGCTTCCAGACATTCAAAATCATAAGCTCCCGTCTCATCTCTTGGATTCTGCTCCCGCTCTACGAAATAATTATCCACCGCGATCGGGTGAGGAACTAATCCGTTTGCTCTAAGCTGGACCGAAAGTCTGTGGGAAAATGTAGTCTTTCCGGAAGAAGAGGGTCCTGCGATCAAAACAAACTTCACCTGAGGGCTGGCAGCGATCTCCCCGGCGATCTCCGCGATCTTCTTCTCCTGCAGAGCTTCCTGGACCAGCACTACATCTGTGGCTTCTCCCCCTGTGATCTTGTCATTTAAGGCTCCCACCGTCTCGATCCCCTGCATATCGCCCCATTTCGTAGATTCCTTCAGAACCTGGAACAGCTTATTCTGGGGCTGAAAATCCGGAACCTTAGCGGGATCTGACATATCCGGCATCTGGATCACAAATCCTTCATCATAAGGATAGAGTTTAAAGTACTTCAGATATCCGGCGCTTGGCACCATATATCCATAATTGTAATCTTCAAATTCATTCAGACTGTAGATATTAACTTTGGAAACTCTCCGATATTCAAACAGCCGCTCTTTATCGTACATCCCATGTTTGTGGAACAGCTCGATCGCTTCATCTGTATGTATACTCCGTTTCTGGATCGGCAGGTCTTCCTCCACCATCTGGCGCATCCGCTCTTCCACCAGGCCAAGAAACGCTTCATCGGGAGACACCTTCCCCTCGATCGTACAGTAATATCCTTTGCTCACCGAATAATGGATCCGAACCTTCTCAATTGCTTCCCTGTCAGCCACATCATAGATCGCCTTGACCAGCAGAAGGCTCATGCTCCGTTTATAGGTTTTGTGCCCCACCGATCCGCCGATGGTCTCGAAGCGGATCTCGCAGTCGAATCTCAAGGGCTTATGAAGTTCCTGCAGCCTGCCGTCTACAAACGCCAGGACAATATCATATTTATAATTCTTCTGGAAATCCTTGGCGATCGTTCGATAGGAGGTTCCCTCCTGGTATATTCTCGTTTCCTCCCCCACGGTCACCCGATAGCTTCTCTTCTCCGCCATTTTGTCTCCTTTCGTCACACCGTCCAAAATGGATGCCGGACTGGTTCTGTCTTCACCTGGACTCCTGGGAAATGTTCTTCCACATATCTCCTCATATCGTCTACAAAGATATGCTCAATCCCGTAATGGCCCCCATCGATCACCGCCAATCCCTGGGCGGCCCCATCAATGCCATCGTGATGGCCTATATCCCCCGTGATCAAGACCTGCGCCTCCTGCTCCAGCGCCGCACCAATCCCGCTCTTTCCAGACCCCGGCAGAATCGCGGCCCGGCGGATCTTCATCTTCGGATCTCCGAACACCCGCACGTCAGGAAGGCCAAAAGCTTCTTTCACCTGCCGGCCAAGCTCCTCCAAGGTCGTCTCTTCCTCCAGATCGGCGATCTGTCCGATCCCCGGTCCGCCTTCTTCCTCGCTGACAGGCTCTAACGCCCGGCCCCTTTTCCATCCCAACCGTTTTGACGCCAGCTCTCCCATCCGGGCAGCGTCATAGTTGGTATGCATGGCATAATAGGATATGTCTCCCTGAAGCAGTTTTAGAACCCGGTTCCCGATAAAATCCTGGTCCGTCACCCTTTTTAAGCCCTGAAAGATCAGCGGATGATGGGTAAGCAGAAGATCTGCCCTCTCCCGCAGCGCTTCCTCGATCACCTGATCTGTAGCATCCAGCCCTATGTAAATACATTTGATCTCTTTTTCCATTCTTCCAGCCAAAAGCCCTACGTTATCCCATTCCATAGCGTAAGACTTGGGGTAATCCTTTTCAATCCGCTCTATAATATCCCGGCAGATCATATGCCACCTCCTGTTTATCCTGATTTCCCTTTTTATCAGTTATCCCCTTTGGACTCGGCTCTCTCCATCCGGTCCAGAGCCTTTTGGGTCAATTCTCTCGCTTCGCATATCTCTTTCGCCCGCTCTTGGGCGCTCACACTTTCTTTCCTTCTGGCAAGCTGTTCCAGGATACTTTCCTGGATCCGCAGCTCCCTGAGAAGAAACTCCCCAAGGATCGGATGCCGCTCTTCCAGCAGCCGCTTTCCATAGAGATATTCCCAGGCTTCATACGGTTCCGGGGCCCCGTGGACCACCCGCATGACCGGATAATATTTTCCATCTTCAAACACCATATCTTCCCGCTGGATCCGGTATCCTTGTTCCACAAGGAACCTGCGCACTCTGGCGATCTCCGACTGGGGCTGGAGGATCAGGGCTTCCAGTTTGTCCGTCACCGCTTCTCCGTCTTTCAGGATCCGGATCGTCAAGGGACCGCCCATTCCGGAAACGATGATCGTATCCACCTCGCCCGGCCGGATCTCTCTCAGTCCATCGGAAAGCCGGGTCTCGATCTTATCTCCAAGGCCATGGCCCACAATATGCATCCTGGCCCGTTCCAGAGGGCCTCTATTTACATCCAGGGCAAGCGCCTTCCTGGCAATCCCTTCTTCCACAAGATAGATGGGAACATAGCCGTGATCCGTTCCGATATCCGCGACAGAGGCGCCCTCTGTCACAAGACCTGCCACCGCGTACAGTCTCTTTGATAGTTCCATAGGCGCCTCTTCCTTAATCCAGATAATCTCTCAGTTTTCTGCTGCGGCTGGGATGACGGAGTTTGCGCAGCGCTTTCGCCTCAATCTGACGGATCCTCTCTCTGGTAACGTCAAACTCTTTCCCAACTTCTTCCAGCGTCCTGGCTCGTCCATCATTCATGCCAAACCTGAGCCTTAACACCTTCTGCTCCCGCTCCGTCAGCGTATCCAGCACTTCATCCAACTGTTCTTTCAAAAGCGTCTGGGCCGCCGCCTCCGCTGGAACCGGCACATTGTCATCCTGGATAAAATCTCCCAGATGACTGTCTTCTTCTTCTCCGATCGGCGTCTCAAGCGACACCGGCTCCTGGGAGATCTTCAAGATCTCTCTTACCCGTTCCACAGGCATGTCCAGTTCTTCCGCGATCTCTTCCGGCGTGGGTTCTCTCCCCAGCTCCTGCAGAAGCTGTCTGGATACCCGGATCAGCTTATTGATGGTCTCCACCATGTGGACCGGAATCCGGATGGTTCTGGCCTGATCCGCGATCGCTCTCGTAATGGCCTGGCGGATCCACCAGGTAGCATAAGTACTGAATTTAAATCCTTTATGATAATCAAATTTCTCTACCGCTTTGATCAGGCCCAGATTTCCTTCCTGGATCAGATCCAGGAACAACATTCCACGGCCCACGTAACGTTTCGCGATGCTCACCACCAGACGCAGATTAGCCTCCGCCAGCCTTTTTTTCGCCCACTCGTCTCCGTCTGCCATTCGTTTGGCCAGCTCTACTTCTTCGTCCGCTGTCAAAAGCGGGACTTTTCCAATCTCTTTCAGATACATACGGACCGGATCTTCAATGCTGATCCCATCCGGCACAGAAAGATCGATCTTCTCCATATCTACTTCATCTTCATCGGTGATCACGATATCCGCATCGTCAATATCATCATCATCGTCATTTCCAATGCGAAGCACGTCGATATTATTCGCTTCCAGATAGTCGTATACCCGCTCCATCTGTTCCGCGTCAAGCTCCATGTCTGAGAAGAAATCATTGATTTCCTGCAGTTCCAGGATGCTTTTCTTTTTCTTTCCCAGGGCTACCAGTTCTTTTAATTTCTCCTCAAATTTTGCCATGCTTTCTTCCATGTAATTTCCATCCTCTCATTGCTTGATTATATTTTATCCCTAATTAATAGAAATATGCAGTTTCTCAAGGCCCTGCAGCTGCCTTTTCGCCTCCATCAGCTTCTGCAGTCCCTGAATGTCTGTAGGCTCCAGATGAGCGGCCTTCTCATCGATGCTGTGGTTTTTCACACGGATGATCGTCTCCTTTAGCGCTTTCTCCTGCTCATCCGCGGTAGTCAGCTCCCGGATCCTGGTATGGAACAGCGAAGCCGCCTCCCGGTGCTCTTCTTCCTCGGTAAAATAATTCATGATCTGGGCCGGGTTGACTTTCTGTTCTTCATATTGATCATACAGTAAATGGGCCACCTTCCGGTACAGACCCTCAGTAAAATCTTCCGGTGTAATATAAGCGCTGATCTGCCGGAAGATCTTCTCATCCTCGATCATCCAGGTAAGCAGGATTTTCTGGGACTTTAAGATCCCGTCTTCTTTCTCCGGCTTTCCCGCCTTCCTTGGCCGCGAGACAGGCTTCGCAAGCCCGGTCTGGACTGCTGTATGGACCACGAGCTGCTTCAATTCTTCGTATCCCACGCGGTACTTTTGGGCAACCGCTTCTATGTAATTATTCCGCTCGATCTCTTCCCGGAATTCATTCAGCCGCCGGGCGGTCTCCTTCATGAAATCCGTCTTGCCCTCCGGGGAATTCAGATCATAGTCCCGTTCCAGCACTTCCAGGCCAAACAAGAAACCATTCCTGGCTTCCCGGATCCTCTCCTCAAAAGCCTCTTTGCCCAGATTCTTGATAAACTCATCCGGGTCCTTATACGGTTCCATACGGATGACCTTCGCGCTGATCCCGGCTTCCCTTAAAATGGGTATCGCCCTCAGGGCCGCCTTCGTCCCTGCCTCATCGCTGTCGTAAGTTAAATATACTTCATTGACATAGCGCTTGATCAAAGAAGCGTGTCCCTGAGTCAGAGCCGTCCCAAGAGAGGCCACCGCGCTGGTAAATCCCGCCTGATGCAGCGAAATCACATCCATATATCCCTCGCATAGAAGGAAATATGGTTTTCTGGAGCTTCTGGCCCGGTTCAGGCCATACAGGTTCCGGCTCTTATCAAAGATCATCGTCTCCGGGGAATTGAGATATTTGGGTTTTCCGTCTCCCATGACCCGGCCTCCAAAGCCGATCACCCGGCTGTTGGCATCCATGATCGGGAACATGACACGGTTCCAAAACTTATCGTGGGCGCCGTGCTTCTCGTCCACACTGATCAGACCCGCTTTGGCGATCATATCGTCTTTATAGCCTTGATCTTTCAAATAGCGGTACAGATCGTCACTGTACTTATTGGCATAGCCCAGTCCAAACGCCCGTATGGTATCATCTCCCAACTGGCGTCCTTTCAGATATGCCAGAGCTTTCTCGCCCTGCCGTCCTTTCAATTGTATGTAGAAATACCGGGCGGCCGCTTTATTGATCTCCAATAGGACCGCTTTCGTATCTGCCCTCTGTCTGGCTTCTTTCGACAGATCCTGCTGGGGCAGTTCCACCCCGGCCCGGTCAGCCAGATACTTCAAAGCCTCCAAAAACGTATAGTTTTCGTACTCCATCAGAAATGTAAACACATTGCCTCCGGCTCCGCAGCCAAAACAGTAGTACATCTGCTTCTGCCTGCTTACAGAAAAGGAAGGCGACTTTTCATTGTGGAACGGACACAGTCCAAAATAAGAACTGCCTTTTTTCTGCAGCCGGACATAACTGGAGATTACGTCTACGATATCATTTTTTGTCCTAACTTCTTCAATCAGTTCATCGGAGTAATACATATCCCGCTTCCTTATCTTTAAAAATAGCCTTCATATATAATATTCGACAAAACGATCCGATTTCCTTTAAATTTTCCAAGATTCTGGAATAAAATATTCTTCAAACTTTTTCACCGCGTATGTATCCGTCATCCCCGCGATATAATCACACACGATCTGCTCTTTTTTCGCGCCCTTTTCTTCCATCATCTGAAGGAACTGCTCTGGAAGCAGTTCCAAATGTTCTATATAAAACTGATACAGATTGGTGATCATATTGATGGCTTTCTTTTCTTCTCCCTTTGCCGTAGGATTCTTATACACATTCTCAAACAGGAATCCCCGAAGTCCCATGGTAGCCTCATAAACTTCCGGTGACATCCGGATCTGAGGCTGATCCATACTGTTTATGATCACATTGTGGATCATCGTGTCCAGGCGGATCTTGGTAGACGTTCCAAGGACATCCGTATATTTGCGCGGAATATCCTCTTCTTCCAAGATCCCGCCGCGGATAGCGTCGTCAATATCATGGTTGATATAAGCAATCTTATCTGACAGCCGGACGATCTGGCCTTCCAGAGTATGAGGCGTGCCGGAGGATTTGTGGTTTTGGATGCCGTCCAGCACCTCCCAGGTCAGATTTAGTCCTTCCCCCTGCTTTTCCAGGCACTCCACCACCCGCACACTCTGCGTATTATGCTGGAATCCCGCCGGATTCAGACGGTTCAGGGCCGTCTCTCCCGCATGTCCAAAAGGCGTATGCCCCAGATCATGGCCTAAGGCCACCGCTTCCACCAGATCTTCATTTAAGCGCAGCGCTTTCGCGATGGTCCTGGCATTCTGGGACACCTCCAGCGTGTGGGTCAGTCTGGTACGGTAATGATCTCCCTTTGGCAGCAGAAAGACCTGCGTTTTTTGCTTCAAGCGCCGGAATGCTTTGCTGTGCAGGATCCGGTCCCGGTCCCTTTGGAAAACCGGTCGGATATCACACTCCTTCTCTGAACGCTTTCTGCCTTTTGACTTGCTGCTTAAGGCCGCGTAAGGACTTAAAAACTCCAGTTCTCGTTTCTCAAGCTGTTCTCTGATCGTCATGCCGCGCCCTCCTCTATCTCTTTTCTTCCTGCCGGATTCTTTCTTCAATCGCCTTTACCGCCGTCTCTAAGACTTTGATCCTGGCGTAATATTTGCAGTTTCCTTCTACAACGATCCATGGCGCTTCCGGGGTGGAGGTCCGAAGCAGCATCTCATTTACCGCTTCTTCGTACTGATCCCATTTCGCACGGTTGCGCCAGTCTTCTTCCGTGATCTTCCACTGCTTTTGGGGATCCTCTTCTCTGGCTTTAAATCTCCGTTCCTGCTCCTCTTTATCTATCTGCATCCAGAACTTCAGCACAATGGCGCCCCCATCTGTCAGATCCTTCTCCATATCATTGATTTCTTTATAAGCCCTCTGCCACTCCTGCCGGGTGCAAAAGCCTTCCACCCGCTCTACCAGGACCCGGCCGTACCAAGTCCTGTCAAATATAGTAATGTGCCCTGCCTTTGGCATATCTGTCCAGAACCGCCACAGATAATGATGCGCCTTCTCAATATCATTGGGGGAGGCGGTGGGATGGACCACATAGCCTCTGGGATCCATTCTCTGGGTCAGCCGCTTGATCGCTCCTCCTTTTCCTCCCGCATCCCATCCTTCGAATCCCAGCACTACCGGAATCCTTCTGCGGTACAGTTCTCCGTGGAGCTTCTCGATCTTAGCCTGCAGTTTTTCCAGCCGTTCTTTATACTCTTCCTTAGAATAGGAAAGGGTTAGGTCCGCTTTCGAGAGGATCGTCTCCCCCAGTCCGCGTTCTGATCCCTGCTCTTCTGCCGGTTCCGGCTCTGCTATAACCCTTCTTGCCTTCTTTTCTTTCTCAATTTTCTCCGACAAGATCCGCTCCACGATCGTATAGATCTTAACCGCGGCAAACCGCCGGTCTGACGCCTCTACAATATTCCAGGGAGCGTACTCGGTATCCGTCCGGGCCAGCATCTCTTCATTCATAGCCTCATATCTTGGAAACTCCTTATTGCGCTTCAAGTCTCCTTTCCCCACCCGCCACGCGGTCTCTTTGGATTCCAGAAGCTTAGCAAACCGTTTCTTTTGTTCTTTCTGGTCGATAGCCAGAAAGAGCTTGATGATCACCATTCCATCTGCCGTCAGTTGTTCTTCGAAAGACAGGATCGACCGATAGGCGTTCTCTACTTCCTTTTTGCGGATCTTTTTATCAAACCGGTCGATCAGGACTTTGCGGTACCAGCTGCTGTCATAAATGGCGATCCGGCCTTTCGGCGGCATTTTCATCCAAAATCTCCAAAGGAACGGATGCATCCGCTCTTCCTTCGTCTCCTTTTTCACCGCATGGACTTCAAATCCTCTGGGATCTAAAGCCCTTATCAGCTCTGCGATCTGAACGCCTTTCCCCGCCGCGTCATATCCTTCAAAAGCGATCATCACCGGAATCCCCAGTTCCCTGCACTCCCGCTGCAGCTTTCCAAGCTTTGGTTCCAGTTCCATCATCTTCTCTTTATATTCTGATTTACTGACCGTCTTTGTCAGATCCAGTTTCTCCAGCATGAACATCTCTCCTTTCCCGGATAAAACACTCTATAAACAGGCGTTCTCTCCCTGATTCTTAGGGTTTCTGATTCTTATCCTCATTATACACAAAATACTGTTCCAGAAACACAACTTTCGCCAGATTTTGCGAAAAAGACCGGCGGAACTTTCGTCCCGCCGGTCTTATATGTATTTTCCCTTATTTCTGTTTTGCCTTAATGTAGTCTGCCACCTTTTTTACATCCAACGTGTTTTCTTTCATGCCGATCTTCGTAGCGATCCGAACGAGGGCCGGCTGCTTCAGCATCGCCTGTTCCATAATGTCTTTCTGAAAGAAGACATCTTCCGTTCTTCCGTCCGCCAATACCTTTTTCTTGCACATAACGATAACTCTTTCATAGTTATCGGCCACAAATTCCATGTCGTGGGTTATGGTCACGATCGCCTTCCCGCGCCCTGTCAGGATCTTGTTCAGCTCAGAAAGCCGATCCAAGCCGTCCAGATCCTGTCCCGCGGTAGGCTCATCAAAGATCATCACATCACAGTTGCTGGCAATGACAGAAGCGATGGTCACGAATTTCCGGATGGAAAGAGGCAGATCATAAGGATTCGACTCCAGATACTTGTCCATGCCAGTCAAGGCCGCCGCATCTTTGATCCTTCGCTCACTCTCTTCCATATCCACATTCATTTTTTTAAGCCCGTACTCGATCTCGCTGTAAACGGTGGAATTAAAGATCTGATCATCCGGATTCTGGAATACATAACCGACCCGTCTCGCCATTTGAGCCGTAGTATAATTTTTTGTCGAGTCTCCGTCGATCAGCACATCCCCGTCGCAGGGTTTCGTCAGACCATTCAGCATCTTGACCGTGGTGGTCTTTCCTGCTCCGTTCTGCCCTACGATCGCGATATTCTCTCCCGCTTCAATGGAAAAACTCACATCATCGACCGCCAGATAGCCATTTGGATATTTGTATGAAATGTTTTTTACTTCTACTAATGCCATCTTCTCTACGCCCTTTCTCTTATTCTTCCATTGCTGTCCGAATCATCCCGACCGTCTCTTCAACCGTAACGGGAATCTCTGAAAGATTTACGCCCTCTTTCTTAAGTTCCAGCGCGATCCGGGTGCTCTGAGGCAGCCTTGTATGGTACTTCAGACAGTCCGGATCAGAAAACACCTCTTTGGGCGTTCCTTCCATCACAACTTGTCCGCCATCCAGCACAATGATCTGATCCGCATATTCCGCGATCTGTTCCATCTTGTGCTCCACCAGGACGATCGTTTTTCCCATACTCTTCATCAATTTGATGATCTCAAATACATTATCGGTACTCTGCGGATCCAGCTGCGAGGTAGGTTCATCGATCACCAGCACATCCTGGTCCATGACCAGAATCGCCGCCAATGCTACTCTTTGCTGCTGACCGCCGGATAACTGATAGGGGTCTCTGTCCCGGAATTCTTGAATCTTAGTCAATTCCATGATCCGCTCGATCCTCTGACGGATCTCTTCCGGCTCCACTCCCATGTTTTCCAAGCCATACGCCAACTCTTCATATACCGTGCTGGCGATCCCGCTGATCTGGGTAAAGGGGTTCTGGAAAACGAATCCGATCTCCAGCGCATTTTTCCCATCCGGATCTTCTTTTACATCTCTTCCGTTTACCAAAACCTCACCGGAGATCTCCCCTTTGTAAAACTTTGGCACAAATCCGCGAATGGCGTTGCAAAGCGTAGTCTTCCCGCTTCCGTTCGCGCCCACGATGGCGCACAGCTCGCCTTTTTTCACGGTAAAGGAAACATCTTTCAGCACATCTGCCTCGGAGGTCGGATACCGGTATGTTAAATTCTTTATCTGAATACTATCCATAAAACAATCTTCCCTCCAATCGCCAATACTACCGCCGCGATCATCAGTCCGCGGATAATCCGGTCTGTCTGTGTATCTTCTACTACCTTCAACGTGGTCTTCTTACAGGGTACGGAAAAGGCTCTCGCCTCCAGTGTGATCGCCCTCTCTTCCGCGCTGACCAGTGAATTCAGGATCAGCGGGCCTACTGATGGGAAGAAGGCTTTCGCCCTGACGATAATGTTGCTGTCCGTCTCAATTCCTCTGGACTTCTGCGCCTCCAGGATCGCGTTCATCTTCTTAGACATCTGCGGGATGATATTGGTAGTGGAAACCAGCACATAGGTTGCGGAAGGGGACATTCCCCTTTTTTCCAACATCACCATCAGCTTCTTGATATCGATCAGCTTGCCTCCCAGAAGGATCGCGGAACCGATCCCCATGATCCGAGAGCACAGTACGATGGCCTTCTGCACGCTTTCCATCTTGATCTGCAGCACCCAGAACTCCCAGAGCACCTCTTCGCCCGGAATAAATAAAGACTGGAGCACGAAGCAGATCACACAGATCAGCACGATGGATTTCAGCCAAAGTTTGATAAAAGACACTCCCTCGCCCGCACTTGCCGCGATCACAATCATGACTACGACTGTGGCCAGGGTCACGCGATAGTCAAAGATGGCAGAAACCACGGCCAGCGCAATGACCAGATATAGGATCGTCAATGGATTCAGGGCCTTGACCCCTTTCTGCCTTTCACTCATTTTCATTTCTCCTCTGTCGTTTCTCTTGATTTCTGATTATTTTCCTGCTTCTTTTTTCTCTTTTCTGGCATTGATGAATACAGAGCCGTTAGAGAATTTGTACAGATACCTGTCGGACATTCCCCGGATTACCAGGTATGGCACCAAAAGGGAGATAAACTTGTCCGGAACCTCGGAGATAAAGTTTCCGATCACCTGAGCCGGCCAGAACGGAATTCCCGCTGACATCATAGCGCCGATCATGATCGAGTTTCCGCTGGAATCAAACCCTCCAAAGAACGCATAGGAAATGATCACACTCATGATCACTGCCAAAACTGCTACCAGGATCGCGGAGATCACGAATTTTCCAAAACTTGTGAACATTTTCTTTCTTGCAAGGATACCAGCCAGGACTGCTGTCGCCGCGGCAGAAAGAGCAAACGGGAACAGAGATGGGTCCGCGATAGAGTTGATCGCGATACTTAAGAGTCCGGTCAATCCTCCTACCCACGGCCCGGCCAGGATTGCTACCAGGAAGGTTCCAATGCTGTCCAGATAAAGAGGCAGCTTCAAAGTCAGTACCAGGTTCCCTACAATAAAGTTGATAGCGATCGCGATCGGGATGATCAAGATCGTGATCAGGTTAAAGTCTGCTTTTAATCCTTTTTTTTCGTTGCTCATGTTTTTCATTCCTCTCTTTCGTTATACTTGGTTTCTCATTTCCTTCTATATCAGTTCCCTTGCCACATCCATATGCTCTGGGAACAGATAACCAAAAAAGATCTCAAAAAATCGCTTTGTGTCAGCTTTTACCGCCACTTCGCAGTTTTTCTTCTTGTCCGGATAATATTTGGTCCGATAGACTACAGAAGCGCCGTCACACAGCCCGCTGGTCTCTACATCTACAAAGGCATCTACCTTTTCTACCACATTCCGGTCCAAGAGATAAGCGATGGTCAAAACGTCACACAGCTCGCAGCCCAGTTTCTTCTCGATCTCCCAGTGTCCGTCTGCGTAAACCCGGGTGATCTTATGGATAAATCTGGATACCGGCGTATTGATAAGATACAAAAGTTCTCTCAGGGTCGGGCTTAAGAACACATAGTTAGTGGCATCCAGCCCTACCATAGTCACGCGCTCGAATCCCGCCTGAAAGACGATCTTCGCCGCTTCAGGATCCGTCCAGAAGTTAAATTCCGCATGAGGCGACATATTGCCGCCATGCTCAGCGCCTCCCATGATCACTACCTCTTTGACGTTCTTAGCAAATTCCGGATCCTTCTGCACCGCTTGAGCAATGTTTGTAAGAGGACCAATGGGAACCAGTGTGATCTCGCCTTTTTCCTCCCGCACTTTCCGGATCAGAAAATCTACAGCGTGTTCTTCTTTTACAGTTCCCGGCACTTCTTTAAAGCCTAAATTCCCAAGCCCGTCATCGCCATGGAATTCCTCACAGTTATCGTTCTCCCGCATCAGAGGCTTTCCATTTCCTTTGTAAACTGGAATATCGCTTCTCCCGGCTACGTCCAGGATACGGAACACATTTTTCGTCACCTGGTCTACGCCTACATTTCCGTTGACCGTGGTGATCGCCTCAACCTCCAGTTCGCTTTTAGCTGACAAAGCCAGTAAAAGGGCCAGCGCGTCGTCAATTCCCGGGTCTGTGTCAATAATCAGTTTCTGCATGTTCTTCCTCCTTGTCTCAGCAAGACTCCACTTCCTCCATAGTCGGAATCGAAGTCTGCGCTCCTTTTCTTGTCACCGCTATAGCGGAAGCCTTCTGTCCAAAAGCAATAGCTTCCTCGCAAGTCTTCCCTTGGCTCAGGGCCAGGGCAAATCCCGCGGTAAAACAGTCGCCGGCCGCCGTAGTGTCCACCGGTTTTACCTTGTGCGCCGGAAAGAATCTAGCTTCCTTTTTGTTTACCAAAAGGCATCCTTCGCCGCCCATGCTCACCAGAACATTCTTAACGCCTTTGTCCAGCATCTGTCCCGCGCCTTCTTTAAGATCTTCCAGGCCATTCCTTTCCTGTCCAGTCAGGATCGCAAGTTCTGTCTCATTCGGTTTGATGAAGTCGATCCCTTTCCAGAAATGATCCGGGATCCCTTCTACCGCCGGCGCCGGATCCACGATCACCAGCTTCCCAAGCTCTACGGCCAGTTCCTTGACATATTCCACCACCTCCAGCGGAATTTCCAGCTGCATGATCACGATATCGCTCTCTTCGATCAGATGGCGGTTTTCCTCGATCATTTCCTTTGATACAAGGCCATTGGTACCCGCAATGATGATAATGGAATTTTCTCCCTGATCATCCACGGTGATAAACGCTTGTCCTGTGCTCTCTCCTTCCAGAACCGCAAGACCTTCCGTGCGGACTCCTACACATTCCAGATTTTCCTTGAGCGCGGTTCCGCAGCTATCTGTCCCCACAGCTCCGATCATCTGGACATCGCCGCCCAGCTTAGCGATAGCATATGCCTGATTCGCGCCTTTTCCTCCAGGCACCTGGGCCACGCTTCTTCCCGCTATCGTCTCTCCAGCCTTTGGCATTGCCGGAGTCTCCACGACACAATCCATGTTCAGGCTTCCTACAACTACGATTTTCTTCATTCCATGTTCCCTCCTCTCTTCTTCTGATTTTCTATCCTAAATTTGATATCTTGAGTAAATCTCATCTTCATAGATCTCCACCGCCTTCCGATCCGGGAGCGCCGGCTGCACCCCATATCTGGTCACGCTGATTCCCGATGCGTACACCGCAAAACCAATTGCGTGGATCAAACTCCTTCCCTCGCTGAGATAAACGGCCAGGGCGCTGATAAAGGAATCCGCCCCTCCTGTCGTATCCACCGCCTCAAATCCGGTTCCATCAAAATACATGGAGTAATCTTGATTCTTCAGGTAACATCCTTTTTCTCCCAGGGTTACGATTACATTTTCAATTCCTTTCTCAAACAGAAACTGGGCTTTTTCTTCCAGCGACATCCGTCCCGGCACAAACCTGTGCAGCTCATTTTCATTTGGCACAAAATAAGCGATGTCTTTCAGCAGTTCCTCTTTGATCTTGTCGGTGGCAGAAGGTTTTAAAATCACCTTCGTATCATTCCTGCGGCAAAACTTAATCGTATATTCCGCGATCACTTCCGGGATCTCCAGAGACAGCAGACAGTATTTGGCGCTTTGAAACAGATATCTGCACCGGTTGATCTGTTCTATACTTAAACTCCGGTTCGCTCCCTGATATACCACGATGGTACTCTCTCCATTTTGGTCTACGTTGATATAAGCTTTCCCGCTTGGCACAGAGGCGTTCACCTGGACGCCCTCCATATGGACATGGTTTTCTGTCAGGCTACTGTAAAGCTGTTTGCCATCCAAGTCGTTTCCCACACAGCCGATCATGTAAACCCGGCCGCCCAGCTTTCCAGCTCCCACGGCCTGATTGCCTCCTTTTCCTCCAGGAAACGTGTAGACCTTTTCCGCGATCTGCGTCTCCCCTTTCAGCGGAATCCTTGAGACTTCGATGGTAATATCCATATTCATGCTTCCCACCACTACTATCCGCTCACCCTGTTTCTCTCTGGGAGATCCCATGATGCTGTTTCTTTCCACAATAGAAGGGGAGAATTTCCGCATGACTTCCACTTGCTTTTTCCCCTGGATCATCTCTACCAGATAATCCACCGCTTCCCGGCTCATTTCTTCGGCAGGAAGGCGAACGGCTGTAATCCCGTCTGCCAGGATCTCCATCCATCTTCCGTCTCCGACGGCGATGATAGACAAAGAGTCCGGCATAGCGATCTGCAGTCTCTCTACCGCTTTATAGAAGCAGCCGGCAATCTCCTGGGAGCCGCAGATTACTGCTGTCACATTTTCTTCCAGGCACTGCTGGATTCCATAATCCTCTATTTCTTCCAAAGATCGGCCCTCGTAAACCCACAAAGGCTGAATTGCCAGGTGCGCTTCCCTCATGGCGATCTGATAACCGTCCTGGATCGTCCGTTCTCCTGTCATGGTGACACAGGCGATCCGTTCATGGCCTTCCTGCATCAAACGCTGCGCCGCCAGTCTGCCCGCCTCTGAAAGCCGATAATAGAAAGTAGCCTTCTGTCGGTCATCAAACTCCTTTGTCTGATTTAAATAAACGGTAACATCCTCCAGTTTCCGGGTGTTGAGCAGTTTCTCAGAATCTACAAGAAGCCCCGCGACCCCCTTTTTTATCATATCGTCAATACATTCCTGTATTTCCTCCAAGTTATCCGCGAAGCGGACCAAAAGTCCATACCCCTTCTCCGCCGCTTCTTTCTGAGCGCTTCTAACGATCTGGGCGCCTTCTCGGTTATACTTCTTCATCACCAGGCCGATCAAGTGGCTTTTAAGCCCTTCCTTTTCCCGGAACTTCAAATAAGGAACATACTGCTCCTGCTCGATCACTTCCAGAACCTTCTGCTTTGTCTTGTCACTAATATCTTTATCTTTTCCGTTCATGACCTTGGATACAGTGGAGGCTGATACACCGGCAAGTTTTGCAATGTCGTGTATGTTCATCTACTCATTCCCTTTACTTATCTTCTTCCGATTCGTTTTCCTGTTTCAATTTTTCTAAATCTTAGCATTTTGTGCATTTTATTTCAATATTTCAACGAAACGAAACTATTTTCCGATACATCCTACAATTTTATTGTTTTACTAAACTTGTTTCGTAATTTTATTTCTTACATAAATACCGAAGAAAAAGCCTTCCAAAAAGGGAGGCCAATCGCACAGCAAACGAAATCATCAAAACCGTCACGCTATTTTGCGGGAATCTGCCAGAAAATCGGCGGAGGGAGAAAATAAAAAAACCCTTGAAAATGGAATCCGACTCCTTTTCAAGGGAAATAAAAAATGCGGAAGATGGGACTGCTTTTATTATCATCTCTAGCCGGAAAATCCCTATTTTCCGGCTTTTCTGCATTTTTCTTTTGATTAATATATCGTATTATTTTATATTGTTTGCCCCTTTTCTGCCCCTCATTTTTGTGATATTATTTGGATGTGGGGAGCGGTGGCAAGCCCGCCCTCCCTGTGTTGTCTTATTTTTGGTCTTTCGCTGTTGCTATCTTATCCAGCAGTTCTACAATTTCGTCTGTGCTATATTCTTTTTTATCTCCCTGTGTAAAAATCAATCTAAGTTCATACAGCGTAGACATCTTAACTGTCTGCATTTCCTTTTCTGTCATTTCTCTTACTACCTCCATGTTAGTTCTCCTTTCTGTTAAAGCCTTGCCATCTCTAACTGTCTTTATTATATACTAACGCTAGTATATTGTCAATAGGTTTCTAGGCTTTTTTCTATTAATTTTACAACATATTCATTTAAGGACATCCCTTGATCTTCAGCCTTCTTTTTTATTATATCCTTCTTCCCTTTTCCTATTGTGATATATAACCTATCATAGTTATTCTCGTTATACTTTCTGCTTGCCCGTTTTTGAGCCTCTGAATATGCCATCCCAAGCACCTCCTATCTATGTTTATATTAACACTATAAAAATACTAGCGCAAGTATATTTTTCTGAAAAATCTATTGACAATATACTTGCGTTAGTATATAATAAAATCATCAAAGGAACGGAGGAAACAAAAATGTTAAAAGGATCAGAAAAACAAATCGCATGGGCTGAAGATATTATCAAGGAAGCAAGAGAAACCGTCAAAGACAATATCGATTTTATCAAAAAACTCCAGGAAGAGCATGGCTTAAAAGTTCGCCAGGACGAGCTGGAAGCATATGAATTGTGCGGAAAACAAATGGAAGAAATGTTAGCAAATATCGACAGCGCTTCCAAGATTATTGATATGAGAGATCAGTTATCTAGCGCAAACATTAATAAAATGGTATCTGAATACTGCATGAGAAACAAAAATAGGAAATAATCCCCCCGTCCCGGAGGTTACGAGGGCAGAAAGGAAAAGAATATGAAGATTAGCAGGAAAGATCTTGCAAGACAGGTAAGGAAGGAATATAAAGAAGGCCGTGAGTGGTCTGACTTGAGGCGAGATCATTGTTACACAATGATGATCGATGTAAGTGATGGGTCAATCTGGGCGGACTGTCTGGACAGAAATCGGTGGAAAGAGTACGAATCCGATACGGTTGTAAGGCTAAACCTGTATGAAGCAGAATCTTATGAGCCTGTGGAGCGGGTGGAAGAATTGTATATCGAGTTAGCTATTACTAAGTTGAAAGAAGCTGGTCATGAGATAATATCCTAAATTTAAAATCTTCCATTTTTCACTTGACTATTGGGTACGAATATGTTATATTATAATCACAAACAGAAAGAAGCAGAAAATGGAATCAGCGAGAGGTTTACGGGAGGTAAAATTATGACACTGAGAAATTGGATGAATGATCACTTTTTTGACGATACGTATCCCTATCAAATCATCGAAAATAACCAAAAATTAAATATCGGCTGGGATGAATATATAAATTACAAAGTCTTAAAAGTCGAAACTATCGACCGAGAAGTTGACAAATTAAAACTCATATATGTAGAAAGGATTTAATATGGAGTCCAGAAAAGCAAAAATTATTTTTACACGCCCAGGCGGGACGGCCGGAAAAAACTCTCAATCAAGCAGGATCACCCTCCCAATAACTTGGGTAAGGGAAATGGGGATTTCGGAAGATAACAGGGAAGTGCAGGTATGTTTTGACGGAAAAACGATTATGATTAAAAAGGAGACAAATTAAATGGGGATCGACAAAGAAATATTGGACGCTTATAAGTTAAAAAAGTCTATCAAAGCTACCGTAGCCCATACAGGATATAGCTGGAATAGAGTTGTAAAATCGCTATCTAGCAACGGGATCATCCTGAATGACACGCACCAAAAAATTATTGATCTCTATAATCACAAAACACCTGTTTGCGATATAGCCTCTCAACTCAAAATATCAACAAAAACCGTGGAATCCTACATCCCTCGCACGCGTCCGGTATACAACGAAAACCCATCACCAAACGCAATGAGAATCAGAAAATGTAGAGATGGTAAAAAATCCGATAGAAAAGACGCCCCGGAGCCGTAACTCCGGGGTTATTTTTTACTGCTTTTTAAGGTACTCGGCAGAGGCAAATCCTGTTACTCCGTCATACACCACGTACAGCCACTTAACACTGCCGACCTTTGTGTAGTATCCATAACACTGTACCTTTGCCCCTTTAGGCATACTGGTCAGTCTCTTTTTGCTTGTCCCAGCTCCAGATCTCAGTGTTAGCGGATCTGTCTTAGTATTTACTGTGTATGTCCCGGCCAGAGACTTGTCAAAACTTTTTGCCGCTTCTGCCTTTCTGCCTGTGCTCCCGCCGGCCCCCTCCAACTTTGCCTTGCTGAGCGGCCCGTACTCGCCGTCTACATCCAGCCCGTGATCTGCCTGGAAGCTTTTAAGCGCCGCCTCGGTATTGGCGCCAAAATCTCCGTCTGCCCCATCCGGTCCGCAGCTATATCCCTTTGCGATAAGTCCATCCTGCATGTCTTTGACCGCCGCCCCGGTGTCTCCCCTCCGTAGGATTCCATCGCCAGACGATGATCCGTCGCCGGAAGACGCCCCTGCTTTAGCAAGCGTATCGCTGGCCTTTGAGCCGTTGGACAGCCCACAGACCGTATGAGAGCCAGCTTTGACGTAGATTGCGCCCCGCACGCAATAGGCGGAACTGGTTAGGTAAGTGCTTGCTGTGATGATCTTGTACCCGGCCGCCTTGAGCGCCGCCAGCATACTGGAGGTTGTCCAGCCGTTGCTACCATAAGACACGCCAGGCGCGCCGGAGGCCACCGCCGCCACGTTCTGGAGGCTGGAGCAGTCGCAATTGCACTTAGTCTTTATCTTAGACAGGACATACCCAACTGCCTTGGCCTGCGTGTTTAATGTGTTCCGGTCATGCCATCCATAGCCGATATTGTTATTGGCGCAAGCCTGCTCTATAGCCCTGGCGTGTCGCTCCCGGACGTTTGCGTCTGGATGAATGGCCATGTAGCCCCAGGGCTTGCTATACCAGTTCCGGGTGCAGACCTCTTTCCCGGTGCTGTCTCCCTTGGCTCCCTCGGTGGTTCCTTTTTCCGATATACTTGCGTGTCCGATTAATATTGCCATGATATTCTCCTTTCTCCGGCTTGCGCCGGCAAAACAAAAAAGGGGCAGTCTCCCGCCCCTTGCTAAGTATTATTTATCTACATTGCTATCCCCAACGCCAGGCGTGGATGGGTCTACCACGATCCCTGCAATCACCAGGACCGAAAAGATCGCATTAACTACGTCAAGCAGCTTATTTCCCAGGTCGCCCAGATCTAGAGTGTATCCAAACACTGCGGCCACCGTCTGGATCACCAGCAACACCGCCGGGATGATGGCCAGCCAAAACATCTTGTTTTTAAATCTTTCTTTCCAATCAAACATAACATTACCTCCTCCTTACAAGCCTATCATTGTAAATACTGCTCCAGCCACCGCCCCGATGGCCAGCGTTAAAATGTACCATTTTACTTTTCTCCAGGTTTCGCCATCCCGGTTTTCCAGAGCTTCCAAACGCTCCTGGTGTTCCTTTTGTTCTTTAGTCATTGTCTCGATTGACGTCGCCAGGCGCTCTACCGATAGGGTCAGATCATAGATCTTATTGACCATTTTTTCACTGTCCGAGATTCGATAATGGATTCTTTTATGCTCGTCCTGCATACGGCGGACAAACTCTTCATGTTCTACCCTGGTCACATATTCATCCATTGGTCTATTCCTCCTGTACTTCTATCCAGCCATATACCCCTGGCTCCCAGACATTATTATCCACAGTGGACTGCCAGGTCTTTCCGTTATGTATCACACGGTCGCCTGTCATATATGGGTTTGTACTCTCCGGCTGTTCCCACGGCAAAACTTCTCCGCTTGGATCCGTCAGCACCTTAGTCCACAGGCTCGGCGCGGCCTCCGGTGTCCAGTCTGCCTGTGATGTATGCTCTGTCAGGCACTTGTACAATATGCCATTGTACAAGACCCTGTCTCCGACCTTATAAGGATATCCGATCACCCAGGCCGGGAAGAAGCTAGGTACTTCCAATGCCTGCGTATCAGTCAATGTCGTAGTCTGCATTTCCAGCAGCCGCCGGAACTGTTTCGCCTGTTCCTTCGTTATCATTCTCTCCACCTCCTTCTCCCGTCACGATAATCTGCAATGCTTGCTCTACGTTTGTCACACGGCTGTCCATCCCATCCAGGCGTTCTTCCGGCGTTTCTCCCACTCGGTACATGGCCACGCCCAGGATGGCGCCGGTGTATTCCTCAATCCGGTACAGTTCTGTGTACCGCTCATACGTGGCAATGATCTGTCCGCGCTCCTCGATCTGGATCTTCCTGCAGGCAAATTCGTCCGAAAACTTTGTACGCAGTTCTTCTCGTGTCGCTGACACGGTTTTGATCTGCAATAGATTTCCAACTATGGCCGCCGCCTGGATGGTTAATTCCGTGGCGTCGTTAAAAACAGCTTTCATAATGATCGCTCTCCTTTCTTTTTTATTATTAAAAATTTTTAGCACTAAATGACAAATTGCCGGAGGAAGGAAACGATTACGTTAAGCTTCCTAATGGAACATTAATCCAATGGGGACAGGCAAGTTTTCCAAGTTCTGGTTCCGGCGGAAATGGATATGCCGTGGAAGATTTTACAATCCCATTTGTCGACACGCCGATCGTAACAGCAACATCTATGTATGCAAGCAGTGTTATAACTTTTGATTTGTCTGTCCAGCCAAGAACGTACAATGTTACGATATACGCTCGGACAAACGATGGGAACCCTGTTACCGGCGCAAGCGCTTGCTGGACGGCTATGGGCCGATGGAAATAATCACTCCACCATTAATGTCAAATCCACATAAAACGTTTCTGGAGTGACAGAATCATGTGCAGAAAATGGTATAACAATCCGGTTATTATTGCTGTCGTACGCTCCTCCACCGCACAATGTTGTCGCGTGAAGTGCGTATGCAGATATACCTATTATTGTATATCCATCAGGGATATTTACATTGATAGCAGTGTATGAGTTTTGCCCATTTGATACATTAATTCTGTTCGAATATCTTTGGACAATAATCTTTTCGTCCAATTTGTCATTTAGTTCCTCAAGCGTGGGCGCCGTCGGGAGTAGCGTGGTGACGCCTGTAACGTTAATCCCGTCCAGCTCGACAGCAAAAACCGGTACCTGTACAGGGCTGTCTCCCTGCTGGATATCCCCGTCCGTATACGCTGGTTGCTGTGGATCCGTCTCTGCCGGGGTGCCCTGGATTACAACCCACTCTGCAGATTCTACGTCCTGCTCGGCGTCGTAGGTGTACTGCCAGCAGATCAGATCGATTCGCTTCATCCCCTGCGTGCCATTCTGGATCGTCACAGGGTCATAACTGTTAACCTTCACGGTGCTCAAAGCCCCCTGGATCATCATTGCTCCGTCCCGAATCCTGATTTCATTTGATGACTGCACCTGCGGCTCCAGCATTTGACCAGTCTGTAAGATATATCCCCCACTTCCCCAGGTCCCCTGGTGCTTCATTCTGCCCTGCTGGCTTGTGATATGCGGTGACCCTTTTCTGCCTGTTGCTAATTCCCTTGTTCATCATCTCCTTCTATGCTATATTCGATGGATTCTTTTCCATCTGTTATTTTGTAAATCTTTTCTGTTATCGGTTTTGCCGCATACAGTCCCGTTATATAGTCTCTTCCTCCGATAATATCTCCGATGTCCACGTTAATTCCGATAGAGGCGACGTCCATCTGGAAATCAGTGGAATTCATCAATTCTCTTAGTTTTTCTACTCCGTTTTCTCTCAGTTCTTCCGTTTCCGCAGACGTATTTTCGTATACTTCCGCAATTTCTTCTACGCCTTTGTAGTATTGCGTATTTCCGATCGTTCCATCTTCCTGCACATACAGATGGATCACTTGCCTTTCCTGCAGCTCTCCCTTTCCCAGACAGATCAAATGATTGACTCCGTTTCGCTTATTTTTAAACGTAAAATTTATTTCGCTGTCCTGCGACAGTTCGATTTCATTAGAATAGTCCACAATCGGAACTGCTGCGACCTCCACATATCCCGGCTGCCCCCTCTCTTGCTGTACGTATTTTATCTGTATCCTGTGCCCTACGGATTTCAGCATTTTTTCAATTCCATACAATAGCGTGCAGTATCTCTCGAATTGATAATCATTCACGTTTACTCCTGTATCCTCAGACGATACTACAAAATAATTCCCGAATTGCTCTTCGATCAGTCCCGACAGCACGGTATTCAGCTCTCCGCTAACTTTTTTGTAGTCTTGTCCAGATGGCGGCTGTATAACTTTTTTATCCAGACTCCCTCTCCAGGTTCTTCCGAGCATAGACACGGTATTTAATTCCGTGTTGGTGTTTACCTCTCCGATAATTCCTCCATACTCTGTCCCTGGTATATAGATTACATTTCCAAAATCCAGGTCGTCCTTCCATGCTTCATACGGTATTTTTATCTCATAATCGTTATCTCCTCCAATGTCTATATCTACATCCGCATCTGCGATCTGTCTGATGTCTCTGAGCGTCTTATCTGCAAGGATCAGTTCCATCTCGGCTCGCTCCTTTCTAGAAAAAGAGTAATGTCGAATCCGAACGCTCCACTCCATACAATATCAAGATTCCCAGCCGGTATTTTTTCGAATACACTTTGCTTTTTCGCTCTCAGGTCGTAAATATTTTGGATCGTTCCGTTTGTAAGGTATTTGAGTACCGTGTTCTCTCTGCTGTCGATTACCATGTACTCTCCTGCCTCCAGCGTGTCAAACACTTCATATGGATATCCATTTATCAGTATTCTCGGGTTTACACACGGCCCATAGATCGTCATTTTAAATTTGCTTTTTGCAAAATGGTCGATATACCAAGAGGCCGTTCCTGCTTTTTGCGCTGTGTAATCGTATTCGTAATCGTATGGATAGTCCAGCCCATCTTCCGGCTGTATTTCGTTTTGTTTAAAAAACTGTTTTGTTTCTTCCTTGATCCAAAACGAGTATGGGCACAAAAATTTCACGTTCTTTTCCGTCCACAACGCCCCTTCCGCCGGCGCCGTGCTGGATGATAAAATATAGCTTCTTATGTAGTGCTCTCCACAGATCAGTTTTCCAGGCGTTTCATTAAGGATATCCCTTTCCCCGGCCTCAAAAAAACGGTTCAGGTTCTCTTTTCTTTGGCTTATAGGCCCCCGGAACACAATAACTCCTTCATATTCCGCGGGGCCCTTCTTAAATCCAGTTATTTTCACGCCATATTTCTGCTCTATCCCGTCATATTCCCACGAATAATTATGGAAATTTATATTTTTTACCCTTGTCGGGTATGAGAGAAGGTCGATTTCTTCCCCAGCAGAATTAATATATTTTGCTACCCTTGAAATACAACCCCCATTCCTTTTAGCTGTCTCTTATATGCTCTCTCTCCGATATACAAGACTGTGCTGCTGTCGGATGCTCCGCTCCTTACAGAGTTGTAGAGCATATTATAGTCAATCGTCTGTGCCGGCGCATCTTTCATAATTTCATAGTTTATTGTTTCCGCCGCTGCTTGTGCTGCCTTTTGTATGGTCGGTATGTAATCGTAGATTCCTTTTGCCAGTCCCTTCATCATATCCGGCATCCACTCTTCATAAAAGTGTAGCGGTCCTTCGTCCGGCCTTGAAAAGTGCAAGAATGAGCTGATCTTGTCAGCCACGTCTTTTACAGCTCCGGTCACTTTCCCGATCGCCCCCTTAATTCCTCGCACAATCCCGTCGATGAAGTCGCTTCCCCACTTCACCGCTTTTGACGGGAGTGAGGTCAGGAAACTTATCGCGCTTTCGAACGCAGACTGTACGGCCGAGCCGATTTTCGAACCAATTCCGCGGATGACCGCAACGATATTATTGAACGTAGACGTTACCAAAGATCGAATCAACGAAAGCGCTGACTGTACGATGGCCTTTGCCGCGTTCCAGGCTCCTTTCCAGTCCCCCTTCAGGACCGAAGTCACCAACTTTATCAAATTTTGCACGATTTTCAATCCGCTGTTCACGACCGTGGTAATGGTTGAGATCGTATTTTGTACGACCTGCATGATCGTGTCGCCCCATTGCGTCCAAATTACGTTCACCAGCTCTACAAATGCCGCGAAAATCGCTTGTATTGTCGAAATCGCTGTTTGTATCGCAGCCTGAATCTGCGACCATGCGGACGATACGGATGCCCGAAAATTTTCGTTTGTCTTCCACAGTGTTGCAAAGATGGCGATTAGAGACGTCACCACTCCTATCACAATTAAAATCGGCGCGGCAACTCCCGAAATTGCTGCTCCTACTGCCGCAAATCCTGCCTTGATACTTCCCAACGCCGCCGTAAGTCCTCCAGATCCCAATAGCGTACTTAGCTTCCCTATTATCCCTAGTACTGGCGATAATGCTGCCACAATTCCTAACAATACCAGCACAAAATTTTGTACTGGCCCTGGAAGGTTGTTAAACGCATTTATTACCTTCGACGCAAATTCTGTTATTTTTGTAATGATCGGGGTCATTACCTCCGCCAGGCCGGCCATAGCCTCCTGAAACTCCTGATTTGCCTTTTTGTTCTCGACCAAAGCCTTGTTGTTTTCTTCCCAACCATTATAGGTATCCATTAGCCCAGCATCCGCCAGCGTCTGCAATGCATAGTTTTGCTTTTCCGCATCCGACGTACACTCCTGCAGTCCGGCTGAAAAGTTATCAGCTCCGATTCCCAGTCGATCTAGCAGTTCTCCGAATTGTCCTGTTGCTTTCTCGGTTGCAAGCGTCTCCTGCAAAGAGTCTGCAAGAGACTCTATTTTTAACGTGTCCGGGAATCTAAGGTATGCTCCAGATAATCCTTCTACTGCTTTTTGCAGGTTCGATTCAGTAAATCCAGCTTGCAAAAGGTTCGAGGTTGCCTCCACCGATGAGTCCGTTTCATCCGACACAACATTAAATTTATCAAATGCATCTCTTGTTGCATCGATCCCAACTCCAGCATTTCTGGCGTTATTATCCAGCTTCGAAAGATCAGATCTAAGTTCAGATGTTGCCGGAACAGTTGCCGCCATTGCCCCCAGCAACCCGCCCGCTGCCGTTGATAATCCTTTTGTCTTTTCTGACGCAGACGATGCCGCATCTCCAAATGCCGTTAGATTCGCATTTAAAGTAGAGGTCTTTTTCGCTTTTTCCTCCAGGTTATCCAATTTCTGTTCTGTCTCAATTATTTCTCTCTGCAGCGCATCGAATTGTTTTGGACTTATCGGATTGTCAAAAGTTTTGTTTACCTTCTCGGCCTCTTGTCGTAACTCTCTCAGTTTCACCGTGGAGTTCGCCAGCTCTTTTTCCAGGTTTTTGTACTCTTCTGTATCGACCTCCCCGGCGTCCTTCATTTCAGACATTTTTGCCTTTAGTTTTGTGATCTTATCTCCCGTTTCTTCAATTTCTTTTTGGATTGGATCATAGGCTTTTTTCCAAGCATCGTACTTTTTTACAGATTCCGCCGCCTGCTCGCTTGCTTTCCGAAGTGTTTTTAGCTTTGTCTCTGTTTCCTCAATCGATTCCGCCAAAAGTTTTTGCTTTTGTTCCAGAAGCGTTGTGTTCTTTGGGTCTAGCTTCAGCAATTTATTTACATCGCTCAGGCTTCTTTGCGTAGATTTCAGATTTTTCTCTACTCCATCCAGCGCCTTATTTAATCCCGTTGCGTCTCCATCCAGCTTTATCGTGATTCCGGTTATTCTTCCCGACCTCTTATCACCTCCCTATAGCCTGTCTATATCTTCCTGGGTTGCCATCACTGGATAGTCGTAGCTATCGTTTTTTGATTCGATAAACATGTCGTTAATCATCCCAATGCTCAGGAGTTCCAATTCTGATATGGATATCCCCAGTTGTACACACCTGAGCATAAATAGCGGCGTATTGATCTCTCTGTCTATTTCACGTTCTTTTTTTTTGGGGTTGACGATGTTTTGTTTTCCTTATTCCACATCTCCAAAATTTCCGGGAATATCTTGTAAATGTCAAACGTCTCAAACTGTTCCAGCCATTCATCTATTTCTTGCGGCTGGCTTGGGTCCCCATGCTTATGCATCATGTATGCAATGTTTTCAAAAACTTCCAGAGATTCGATCGGGAGTTCGCTTTCAAATTCTTTCTCGTCAAATTCTTCTCCCGCTTCTTCTGCTTTTCTTTTTAATTCTTCCTTCAACTGTTCTTGCTTCTCTACCTGTTTTCTAAGCCAGTTTAGATCCACAAAAATATCCCGTCCAAATTTAATCCTATACATTCGCGGGATTGCCGCCGAGCTTTTAAACTCGCACGGCAACCCGTTGATCACTATCGTTTTTCTCATTATGCTGCTTCCGCCTTTTCATATACTTTCGTAAACCAAGTATCTTTTACTGCTGCGTAAGATTCGTCTGTGGTTTTTGCCCTTACCGTTCCATCATCCGCAGCCCCGCAAGATATGGTTAATGTGTCCGTGTTTGGCTCTTTTGTATCTTCCGTTGTAGTCGCATCCACGCTTGGGCGCGTAGCGGTACAGTTATAAAACCAAAAGAGCGTTGGGTTTTTGTCTCCGTCCACCTGGAATCCCAACGCAAATTCTACAGTCTGTGCATTTGCGTCCTCAAACAGCACCCCGTTTTCATCCTCTACTTCCCCCAATACATCTTTCCTGAAGGAATCCGGGATCAGTGCGACTTCCAAATCTCCCTCGTATCCCCCGTTTGAAGCCGAAGTGTAGTATTTCACTCCGTCTGCGTAAAAAGGAGTTAGCTCCCCCTGCTGCTCCAGCGAAATAGATACCGCTCCCGGAATGGCTACCGGTGTCGCATACGTTGTTCCTGCCTGCTCTCCTGTCGTATTTTTTACTGCGTAATGCACATTTTTAATATTAAATTTTACTTTGCCCCTTTTTGCTTCCTCCTTAAATTTCCGTCTCAAAAATCACTTGTATCATTTTTTCTGATTCAATGTAGTATTCCTGTTTTTCGTAATAGAGTCCGTTTTCCGTCAGCCAATCCGAAATCTTTTCCTCGCCTTCCAGGTCTTTTTTCTCCGAATAGAATTCAACGTCCAATTCGATGATCTCCTGGTATACAATCCCATCCGCGGAAAAGTTTTCCGTCCCCGGCGAGTAATATGTTATATATGGCAAATCCGGGACGTCTCCTTCCGCGAAATGGCTGTATGCTACTGGATACCCAAGCTTTGTCAACCCTTCGTGAAATTCTTTTATGGTCCTTGGTTCAACCTCTCTTTCACGCGTCGCTCAAATTCTTCATTGCACCATTCTTCTACCTGCTTGATATGTGGGATTGCCCTGGCACGTCCTCCCTGGCGGAGCTGGTGCCCTTTTTCCAAGAGATGGGTTAGGCCTGGTTTGTTCTTGTTATGGACCACAAACGCAAACTTTCCATTCATTCCCCGGATGTATTTTACGGACCATCCGTCTGCATAATGGCCTTTTGGCCCTCCATTTCCCTTTCCCCGCGGAGAATTTGTTTTTAGTTTCTTCGCTCCTTCTCTCGCCACCGCTCGCGCGATTTTTTCCAGATCGTCCTCCGTCACGTCTCTATACTCTTTTAATTCTTCCAAGACCGCATTGGCTAAGTCGTCAATGCTAATTGTCTGCGCCATTTTTTGTCACCGCCTTGATTTTTACCGTTTTATTCCGAAATTGGATATTGTCAATCAGGTTGATATTGTAGATCGCCCCTCTAAACAATATCCGGTACTGCTTTGTGTTCATCTCTTCAAAAAACGGATGCCATCGAAAAACAAATTCTACCGTGTTTTCTTGGTGTACGTTTGCCGCTTCCCAGTATTCCGCCCCCGACAGACCATTTACATAGGCGTATGCTTTTCTCTCTTCCTTCCATTCTCCAGATGGATTCCCGATTTCGTCATATGCGCTTATGTATTTTTGGAATGTCACCAAGCCCCTGTATGCTCCAGAATCCATCAGCTCGCCTCCCCGTCTGGCGTTGGCAGGAGGTTTATGCAATGCATTCCAAGGATTGTATCTACTACCCGGTTGACGTTCGACCGATCGATCGACATAGACCGGTTATCCCACATATCCGATATCAACGTCAGGACCGCGATCGTAATATCCTCGTTTTCATCCAGCTCTTCTTTTGTCAGTCCCGTCTGCCCTACGCAGTAGCTTACTGCCGCGGCCTGCATCGCCTGGATTAGTGTCAGATCTTCTTCAGTTAGATTTTCTTTCACTTCCCGCAGATGATCCAGGATTATTTCTTGCGTTATTTCGCTTACTTTCCTTTTTCTTCACCGCCTTCACCGCTTCCAGGTATCCGCATTTTACAAGAGGGGCCGCCAGGGAGCTAGAAAGCTCCCTTTCTTCCCCTTTTCGCATAGTTACCTGCCCCGCGAATGATATGATCGCTCTATACTTCATAGTCTCACCTAGCTTTTGATCGCGGTTTTGATGTCCGTCCAGTTGTCCTTAATCCATGCGATCACGTCCGAGACCGTTTCGCCCGGAATATCCGCTGCCGTCCCGCTTCCTTTCATGGTTGCCGCCAGTTCTTTCAATTCTTCAACGATTGTCATAGGCTCCTCCTTATGATTCCGCCATGGTCAAAACGGAGATCTTCTGCTCGTTTTCCACTTTCGCATCCAGTTCTGTCCATCCAACCACGCCTATCGCGTGCTGGGTTGCGTAATGCTCACGCAATACCTGTACCTCAAATTCTTCTGTGATTTTTGTAGCCAGGCCGGACATATCTCCGTAATAGATCGTCTTTGCGGACGCGGCCATTTCTGGCATGTTATCAGATACGTATACCGGCTTACCAAGGAGCGTTTTCCCGAATGGCGCCGTGATATCGTCCTGCATGAGATATCTCCCTACGTCATCTTTCAGTTTCCGAATCGCTGTCCTGGTCTTCGAAGACATAATCCAAATCGAATTCTGCTGGAACGCGTCCTTTACGGAATCCTGTAGATCGATCAATTCGTCCGCCGTAATCGCTGTGGCTGCCGCCGCCGTTACTCCCTGCGTTACGCTTGACAGTCCTTCAATCTTGCTTGTCGTCCCCTTCAGCAGTTCTCCTTCAATCCATCTGGCTGCCTGGTACGCCATATGGTCCACCACAAAGCCCGTTACATCAAACTGGCTGTTATTAATCAGAGATCTTGAGATCTTCACCAGCGCTCCGGCAAGGAATCCATTCAACGTGATGCTTCCGAACTTTCCGGTGTCCGCTGTTAAATCCGTAAATTCCGTTGCGTATGCCATTTCTACGTCATTGGAATCCGCCGGATAAAATGGGATGGTCAGGTTTCCTTTCACGTTGTATTTTGTAGATCTCTCCAGAATCGGAGAAATATCGTAGATTTTCTCAATGATTTTGTTCGCAATTGTTGTCGGGATGATCGCTCCGTTATCCGTCAGCGTCATGTTTGTATCTGCCCGGACTTCTCCGGCCACTACTCCACGCAGATAGTCCGCAAACGCTTTCTCTTCCTCCTGCGCTCTCTGCTCTGCCCCTTCTTCTTTTACGCCGTCTACAACTTCCCGTTTCCCTTCGTTTGTTTTTGCCAGGCGGCTTTTTACGTTTTCCAGGATCTCAATGGTTTTATCGATCTGTTCGATTTCCCCTTCGATCGCCCCGATCTGCTTCTCTTCTTCTTCCGTGATCGCTCTCTGCTCCGCTTCCAGCTTTTCATACAGCATTTCCAGTTCTTCTACTTTTTCAGCTCTCGTTTCTCTGAGCGCTTTGATGTTGTTTTTCTTTTCAGCAAAAAACTGGATGTTGTACTTCATTGCTAATTTTTTGTTTTTCCTTAGACTTTCCCTCCATATTTTTCAATTATTTTTTTCAGTTTGCTGTTGTCTGGTTTCTCTTTTTTATCTTCAAACCCGATATAATCGGGCTCGAATTCTTCCGCTCGAATCTCGATGGTCTCTTCTCCTTCTTCTCCAGCCCTCGTTTCCACGGTTGTAGATTCATACCAGGGCCTCATACGATCCGAGATCAGCGACACTTCGCTGATGTCAAAATCAGTGATGGTTCTAAGCGGCAGCCCTCCCGGAGTTTCTGCCCTGGTTTCCACTGGCCGTTTGATATCAAAGGACCAGCCACGCAAGCGTTTTTCTCTCGCCAGTTTTACTACCTCTGGATCGGACGTTTCCAGGTGCGCTCTTAACCCCACCACATCCTCACGCAGATTCAGGTTTTCTCCTGTCCGTCCGATGGTCTTGTCCCATTTATGGTCTAGCAGCGCTGTAACTTTCTGCGCCCGCGTAACCGCCCGGCCGAATGCTCCTTTCTGGATGCGCTCGATAAAATATCCGCCCTTGCGGTCTGGTATGGGCCGGCTGTCTCTTCCGGTAACATTTACATATCCGTCGATGATTACGGTTTCTTTCCCTGGTTCTCCCCGAAGCTCTATTCTTGCCCTTCTTCTACACCTCCTTCTCCGATTTTTACGGTCTTATTTGTGTTTGGGATATATACAGTTTGTGTTTTTGGATCATACAGGACGTCTTGCAATCCTAATTTGATCATATTCATTCCCAGCGGCTCCATGTTTTCCTTTTTACGGATCTCATCAATCTGCAGCCAGCCTGTTTCCGCCGCCGTCTTGTAAGCGCTGTACCGCTTATCCGCATCTCCTTTTGTCAGTTCGTACATGTCTGGGGCAAAATAGTAGGTCCCTTTTTCTGATTCCAAGAGCAGCGCCCGGTTCAGCGCCGTCATGAATTCATCCATAAACGTGCTTACACAGTATTTTATATACGCCTTATCCCCGGCTTCCGTTCCTAAATCATCTGGGATTCCCAGAATCCTCCGGAACTCTTCTCCGTTGGTCTTTTTGTTTTCGTTCAATTGCATCTCTACAGATGTGTTTGACGCCTCCTGGAACTCTAACCCGTCATTCAAAATCACCACGTTTTCCGTGTTGTTCTGATACAGTTTCCTCCAGGCTTCTTTTAGCGCGTCAATTGCTGTCTGTGTCAACCGCTTTGCTGATTTCACAAATCCTTTCTTGTTTCCGCCCGTCTTTACCAGCCCTTCTTCGTAGGACAAGGAATTATAAGCCACACTAAGCGCCTTTTTGTTTTGTTCTACAATTCCATTCCCAAACATGCCATCTCTGGTATTCCTTAAAATTCGAACAAATTCATCCGGGAAGTAAGATCTTCCCTGGACCAGGATGACATAATCCTTGAAAATCACATCTGCGTTTGGCGCAATTGACACATATTCGCTATAGACATAACGCAGAGAGTTCACTTCCAGGCCCGTCCAATCGACATACACAAACCCGCCGCGGCCCAGGTAATAATCCCGAACCAGCGCTTGTTTCATCTGATTCGCGTCCAGCGTGTCTCCAGTTTCTTCGTTCAGCAGGTAAACCCGTCGGTCTCCTTCCATCTCTATGACTTTCCCGTTTTCTTTTTTATACAGACGAATCGGAATACTTGCTACAGTACGGGCAATCTCACAAATCGCTCCGGCTACCGACGGGACCTGCATCGCTTTTTCTCTTGTTATATTCTCTTCCGAAAACCACGCCCGGAGAAGCTGGTCGCTCACCTGGCTTTCGTCCAACACCTTTTCCTCTTTCTCCGGCTCTGCTCTCACTCGTTTAAAAAACGCCCTTTTATTCGACCTCCCGTTTCGTCCATCTTTCTAATGTTTTTTTATTCGCCACACTTGTATGGATTGTAATCTCCGGCCAGCCTGTTACATCTGTCATTGTCCCCAGATGTATGTCAATCGAGATGTCTGTTATTTTATCAAGGCCGTTTCCTACCATGCTTTCCGCTCTTTCTATTAATTCTTGTCCAGCCTCTTTTATTTGCTGTACAATCTCATCTCTATATGGTTTTATCACTTTCTTTCCTCCTTATGCCGTCTGTACAACAAAGCCATCATCGTATAATACATATTGCTGCAATAGATACATGGCATTAATCAGCGCCACTACCATGTCCACCTTTCCATTTGATTTCTTTTTGTTGACATATTTGTTTTTATTGGTGTCTTCTGTGCACCTTGCGTTCTGGAAATTGATCTCCAACATTCGATTATCGCAATATCGGAATTTCTTATCCAGTATCAACTCCTTAAGCCACTTCGTCGGCATGTGTAGGACGGAGCTGTGCTGTCGGATTTCTACACATTCATACCCATCCGCTTCCAATCCTTGCACAGTTGCCAGCGCGTTCCACTTGTCATATCCGATCTGCATGATCTCTACCCCGTATTCTTCTTCGATTTCTTCCGCTTTCGCCTTGACATATAGATAATCAATCACTTCGCTTCCGCACGAAAAGCAATCCCCGCGCTCGATCAGCCTTTTGTAGTCAACATGTTCTTTTTTGCTTTTGAACTCGACTTTTTCCTCTGGCACAAATCCAAATACTTTCGCATATACAACCCCTTCTTCCTCCGTTACCATTGCCAGCGCTGTGTTGTCGTCCGTCTGGGATAGGTCCAGCCCCATCCAGACTCGGCGCCCCTTCCAGAACTCCTTGTCGTTTTCTTCCCTGCACACCTTTACTTTCTGTATGTCAATATAGCCTTCCACTCCCAGGCCCTTATAAAGAATGTTATTATGCTTACATAAATAGTTCTCTCTCTTATTTTCATACAATACCGCAATCGCGCGCTTTTTCTTGATCTCTTCAAAGATATAGGAGTGTGACACAGCCACCGGATTGCTCTGATAGATGCAACGATCATCCTTCTGCCATGCCTCCCCGGTCTTCAGGTCTTCGTCCGGCTCGTATAGCAACGCAAATACCCGCTGATCTTCCAGCAATCCATCAATCGCTTTTTTTGCGATGTCGATTTCGTCGATCATAACGTTGTCGTCGTTAGGGTACTGGGTGCTGATGATGATCCCCAGCTTGTTGAATAACGTGATCTGGGAGGACCGCATGGCCTCCACCGGGTACTCGTCCAACGCTCCCGCTTCATCTGCCAGAAAAGCATTGGCCATCTTCCCGTCCATGTTGTCCTGGCTATATGCCAGCGGCGTGTACTCGTTGTCATTGATCAAGCATCGGATCTGGCTTCTCATGATCTTAAATGCCGGGTCATCTTCGTCGTAGAGCGCCGGGCTGACCTTGATGATCTTCCGGATCGCCAGCTTCAATTCTGACGATAGCGATAGGTCTGGCGCCACAGAAAAGAACCGGGAAAAATCCGGCTCGATCAGCATCAGCAGGATGAATATGATCGCGCTCACAAATGTCTTGAAATTCTTTCTTGCGATCTCCAGGACTCCGGTAACGTAATATCGAATGTCCAAATCACTTCCCTGTACCTTCGTGCACATAACCGCCACGATAAACAGCCATGCGTAATCTTCTATGCTGTCGTACATTGGCTTTCTCAGATCTGGGTGGATCATGATCTTTAGCAGGTTACAGACTTTCCTGTATGCTTCCTTATCGACGTAGGCCTCCTGGTTCCTGCCTTCTACGATGTCCAGCCAGAGTGCGGCCTGTCGTTTGACATAGATCGGTACATATCGATCGGGTTCTGTTGTGCACCACTTCGCGTATGTGTACGCCTTTCCCTCTTTAAGCATTCAGCACCTCCCGCAGGGCGTTCTTTTTCGTCTCTGGCTTTTTAGGGATCGACCGCAGGGCCGCCGCTATGGTCATCACATTCTCTTTCTCTATATCAAACAGCATTTTTCGCTTGGTTTGTATCTGCCGATCGTAGCTGATCGCCATCTTGGACAAGTCGTTTTGCAAGGTGATGTACTCTGTAAACTCCATTTCTCCGGCTTTTTCTTCTAGCTCCTGCTGCCGGCAGAAGATCTTCTCTCGCTGCTCCTCAAACTCTGCGCACTCTGCAAAGATCAGGCAGTATCTATTGATCGCTCCGGAGTAAAGATCGTCGTCTTTTCCAATTTCCTTCAGTAGGTTTTTTAGCCGCAGGAACTCCCGATGTGCGATTTCATTGTTTTTGACCTCTGCCTTTTCCTTTAGCTTTTTCCCCGTCAGGAGGTTCTCTTCTGCCTGTTTGCGCTGGCGCAATTCTTTTTTTGTCCGATGGGACTTACCTTCCAGGCGGATCACATCCGCTGGCTTCACTGGTGTTGGCCTATTGATTCCTCCTTTCAAAATCTGATCTGGGAATATTTTATGAATAAGGGGTGGGCGTCGGTGGTATAAATCGTCAGAATTTTCTCTCCGAACCTCCGGGGAGGATCATCCGCAAGCCGGACATCCGTCCCTTTCCTCCTGTTCCTCTGCTATTCTTGATAATTCTTCTCGTTTAACTTCCCCTTTTTCTGCCATCTCGTGGTGCCTCGGACACAATGTTATCAGATTGTAATCATCCAACCTTCTATTCCAATCCTCTGCTACCGGTACGATATGATGCACGGACAGATTTTCTGTTTCAAAGATCCGCTCCGGATTGTGCAACCCCCTCACTCACAACTGGCAGACCTGCCTGTCTCTAACTCTGATAGATTCTCTTTTATCTTTCCACTGATTCGTATATCGGAATCTCGTTTCCGGTGTGTCCTTTTTCTTTTGCCTTATTTTTTGTGCTTCCTTTTTCTGCGGGCAGATATTCTTGCTGTCATGTATCCTCCCGCAGTAGTGGCAGGACTTTAACATCTTTCTCTCCACATCTCCTCTATCAGCTCGGACATTCCTTTTTGATCCATTTCCCAGGTATCTTCCCCAACCTCTACCATTTGCGTATTTGGTTCCTTTTTTCTGTTTTTCGCATTTGATGTATGCCTACATTCTCCTCCGTTCTTATAGCAGCTTTTTTTGTTGCAGTCCTTTACTTTCCCGTCACATAAATAGAAAAACATTTTATCCCTCCCTGAATTGCGCCCCCTGGACTCGAACCAGGATTGCCGGCTTAGGAGGCCGGTGAATTACCATTACTCTAAGGCGCAGATCTGTATCAGAAAAGGCAGCCTCCGTTTCCGGTTGCTGCCTCGTCTGAGATATCTTATCTCAATATCTCATGCTAGCATTATAGCACGGAATTCTGTCCCCTGAGTGACGCACTTTTGCAAATTTACAAATATTTATGTCTCACTTCCTTCAGTTCGCTTTTCTTGTCCCATATCCTCCCAACCACTTGATCCATGGCTGTCATGATCCCTTTTATCCTTTGGTCTGTGTTTGGTCTGCTGTATATTTCCATCAACGCCTCCATGGCTGTCAGGTACTTTCTCATCTCTTCCAAAATCGTCATTGCCGTTTTATTTCCCCGGATATCTACCAGCCTTTCTCCCGTCCTGTGTTTTCTGTCGGCTATTTCTCTGACTTGCGCCAGTTCTGTCCTTCCTGCTGCCGCTGGTGCCTCCTGCTGCCGGAAGTAATAATCTACCAGGTAGTCGTATACTTCCCAGGCTTTGTCTGTGTTCAGGCTCTTTGCGTGGAGAAGAGCGCCTTTCTCTGTCCAGAGGTATAAGATTTTTGCGTTTTTCTCAACAAGGCGTGAATTTCCACCCTCGTTATTAACAAGGTGAACATTTTTCACCTTGTTCTTGAAAGACTTCAGTTCTTCCCCTTGCAATAGGAGATAGTGCTTTCCTTCCGCAAATCTCTTCCTATTGTTCGAGTAGTTCTTTTTGATAATATCTACTGTTGCCTCATAGCACTCTGCTAACTGCTTGCTTGTAAGCACTCTGATCCCTTTTACTTCTATTGTTTGTGGTAACTGCATATATGCCCCTCCTTTTTACGCCTATTTTTGGCGTATCTTTTTGATATATATTACGCCATTATTAGGCGTAAGTCAATAATTATTTTTATAGGAGGTTCATATGTTTAAGGATAGGCTTCGTGCCACTCGTATATTTCGCGGCTTTACTCAGCAAAAGACAGCAGATTCTATCGATGTTCCTCTTAGACATTATCAAAAATACGAGGGCGGAAACATTGAGCCTGATCTTTCCACTTTGGTCAAGCTCTCTGACCTGCTCAACGTCCCCTCGGATTTTCTTCTTGGGCGCGACGATTATCTGAAATCTCTCGGAGTTTCCGTTGATGTATCTCTAGAATGTCCTCCAAGGCGTCCCAAATCTCAAAGGAACCTCCAATCTCTGCATATTCAATCTTCTGATAGTGCCTCAAAGTAA
Protein sequences of DBSCAN-SWA_6 >NZ_CP041667|2078196:2119060|2091465_2092275_-|WP_143930133.1|DBSCAN-SWA MALVEVKNISYKYPNGYLAVDDVSFSIEAGENIAIVGQNGAGKTTTVKMLNGLTKPCDGDVLIDGDSTKNYTTAQMARRVGYVFQNPDDQIFNSTVYSEIEYGLKKMNVDMEESERRIKDAAALTGMDKYLESNPYDLPLSIRKFVTIASVIASNCDVMIFDEPTAGQDLDGLDRLSELNKILTGRGKAIVTITHDMEFVADNYERVIVMCKKKVLADGRTEDVFFQKDIMEQAMLKQPALVRIATKIGMKENTLDVKKVADYIKAKQK >NZ_CP041667|2078196:2119060|2089775_2091284_-|WP_143930132.1|DBSCAN-SWA MLEKLDLTKTVSKSEYKEKMMELEPKLGKLQRECRELGIPVMIAFEGYDAAGKGVQIAELIRALDPRGFEVHAVKKETKEERMHPFLWRFWMKMPPKGRIAIYDSSWYRKVLIDRFDKKIRKKEVENAYRSILSFEEQLTADGMVIIKLFLAIDQKEQKKRFAKLLESKETAWRVGKGDLKRNKEFPRYEAMNEEMLARTDTEYAPWNIVEASDRRFAAVKIYTIVERILSEKIEKEKKARRVIAEPEPAEEQGSERGLGETILSKADLTLSYSKEEYKERLEKLQAKIEKLHGELYRRRIPVVLGFEGWDAGGKGGAIKRLTQRMDPRGYVVHPTASPNDIEKAHHYLWRFWTDMPKAGHITIFDRTWYGRVLVERVEGFCTRQEWQRAYKEINDMEKDLTDGGAIVLKFWMQIDKEEQERRFKAREEDPQKQWKITEEDWRNRAKWDQYEEAVNEMLLRTSTPEAPWIVVEGNCKYYARIKVLETAVKAIEERIRQEEKR >NZ_CP041667|2078196:2119060|2099830_2100127_+|WP_143930142.1|DBSCAN-SWA MKISRKDLARQVRKEYKEGREWSDLRRDHCYTMMIDVSDGSIWADCLDRNRWKEYESDTVVRLNLYEAESYEPVERVEELYIELAITKLKEAGHEIIS >NZ_CP041667|2078196:2119060|2110192_2110597_-|WP_143930156.1|DBSCAN-SWA MAQTISIDDLANAVLEELKEYRDVTEDDLEKIARAVAREGAKKLKTNSPRGKGNGGPKGHYADGWSVKYIRGMNGKFAFVVHNKNKPGLTHLLEKGHQLRQGGRARAIPHIKQVEEWCNEEFERRVKERLNQGP >NZ_CP041667|2078196:2119060|2093124_2093892_-|WP_143930135.1|DBSCAN-SWA MKMSERQKGVKALNPLTILYLVIALAVVSAIFDYRVTLATVVVMIVIAASAGEGVSFIKLWLKSIVLICVICFVLQSLFIPGEEVLWEFWVLQIKMESVQKAIVLCSRIMGIGSAILLGGKLIDIKKLMVMLEKRGMSPSATYVLVSTTNIIPQMSKKMNAILEAQKSRGIETDSNIIVRAKAFFPSVGPLILNSLVSAEERAITLEARAFSVPCKKTTLKVVEDTQTDRIIRGLMIAAVVLAIGGKIVLWIVFR >NZ_CP041667|2078196:2119060|2100425_2100611_+|WP_143930144.1|DBSCAN-SWA MESRKAKIIFTRPGGTAGKNSQSSRITLPITWVREMGISEDNREVQVCFDGKTIMIKKETN >NZ_CP041667|2078196:2119060|2100611_2100950_+|WP_143930145.1|DBSCAN-SWA MGIDKEILDAYKLKKSIKATVAHTGYSWNRVVKSLSSNGIILNDTHQKIIDLYNHKTPVCDIASQLKISTKTVESYIPRTRPVYNENPSPNAMRIRKCRDGKKSDRKDAPEP >NZ_CP041667|2078196:2119060|2105500_2106202_-|WP_143930152.1|DBSCAN-SWA MKITGFKKGPAEYEGVIVFRGPISQRKENLNRFFEAGERDILNETPGKLICGEHYIRSYILSSSTAPAEGALWTEKNVKFLCPYSFWIKEETKQFFKQNEIQPEDGLDYPYDYEYDYTAQKAGTASWYIDHFAKSKFKMTIYGPCVNPRILINGYPYEVFDTLEAGEYMVIDSRENTVLKYLTNGTIQNIYDLRAKKQSVFEKIPAGNLDIVWSGAFGFDITLFLERSEPRWN >NZ_CP041667|2078196:2119060|2106321_2108577_-|WP_143930153.1|DBSCAN-SWA MLKLDPKNTTLLEQKQKLLAESIEETETKLKTLRKASEQAAESVKKYDAWKKAYDPIQKEIEETGDKITKLKAKMSEMKDAGEVDTEEYKNLEKELANSTVKLRELRQEAEKVNKTFDNPISPKQFDALQREIIETEQKLDNLEEKAKKTSTLNANLTAFGDAASSASEKTKGLSTAAGGLLGAMAATVPATSELRSDLSKLDNNARNAGVGIDATRDAFDKFNVVSDETDSSVEATSNLLQAGFTESNLQKAVEGLSGAYLRFPDTLKIESLADSLQETLATEKATGQFGELLDRLGIGADNFSAGLQECTSDAEKQNYALQTLADAGLMDTYNGWEENNKALVENKKANQEFQEAMAGLAEVMTPIITKITEFASKVINAFNNLPGPVQNFVLVLLGIVAALSPVLGIIGKLSTLLGSGGLTAALGSIKAGFAAVGAAISGVAAPILIVIGVVTSLIAIFATLWKTNENFRASVSSAWSQIQAAIQTAISTIQAIFAAFVELVNVIWTQWGDTIMQVVQNTISTITTVVNSGLKIVQNLIKLVTSVLKGDWKGAWNAAKAIVQSALSLIRSLVTSTFNNIVAVIRGIGSKIGSAVQSAFESAISFLTSLPSKAVKWGSDFIDGIVRGIKGAIGKVTGAVKDVADKISSFLHFSRPDEGPLHFYEEWMPDMMKGLAKGIYDYIPTIQKAAQAAAETINYEIMKDAPAQTIDYNMLYNSVRSGASDSSTVLYIGERAYKRQLKGMGVVFQG >NZ_CP041667|2078196:2119060|2094594_2095554_-|WP_143930137.1|DBSCAN-SWA MQKLIIDTDPGIDDALALLLALSAKSELEVEAITTVNGNVGVDQVTKNVFRILDVAGRSDIPVYKGNGKPLMRENDNCEEFHGDDGLGNLGFKEVPGTVKEEHAVDFLIRKVREEKGEITLVPIGPLTNIAQAVQKDPEFAKNVKEVVIMGGAEHGGNMSPHAEFNFWTDPEAAKIVFQAGFERVTMVGLDATNYVFLSPTLRELLYLINTPVSRFIHKITRVYADGHWEIEKKLGCELCDVLTIAYLLDRNVVEKVDAFVDVETSGLCDGASVVYRTKYYPDKKKNCEVAVKADTKRFFEIFFGYLFPEHMDVARELI >NZ_CP041667|2078196:2119060|2113374_2114466_-|WP_143931264.1|portal|DBSCAN-SWA MQVPSVAGAICEIARTVASIPIRLYKKENGKVIEMEGDRRVYLLNEETGDTLDANQMKQALVRDYYLGRGGFVYVDWTGLEVNSLRYVYSEYVSIAPNADVIFKDYVILVQGRSYFPDEFVRILRNTRDGMFGNGIVEQNKKALSVAYNSLSYEEGLVKTGGNKKGFVKSAKRLTQTAIDALKEAWRKLYQNNTENVVILNDGLEFQEASNTSVEMQLNENKKTNGEEFRRILGIPDDLGTEAGDKAYIKYCVSTFMDEFMTALNRALLLESEKGTYYFAPDMYELTKGDADKRYSAYKTAAETGWLQIDEIRKKENMEPLGMNMIKLGLQDVLYDPKTQTVYIPNTNKTVKIGEGGVEEGQE >NZ_CP041667|2078196:2119060|2114839_2116531_-|WP_143930163.1|terminase|DBSCAN-SWA MLKEGKAYTYAKWCTTEPDRYVPIYVKRQAALWLDIVEGRNQEAYVDKEAYRKVCNLLKIMIHPDLRKPMYDSIEDYAWLFIVAVMCTKVQGSDLDIRYYVTGVLEIARKNFKTFVSAIIFILLMLIEPDFSRFFSVAPDLSLSSELKLAIRKIIKVSPALYDEDDPAFKIMRSQIRCLINDNEYTPLAYSQDNMDGKMANAFLADEAGALDEYPVEAMRSSQITLFNKLGIIISTQYPNDDNVMIDEIDIAKKAIDGLLEDQRVFALLYEPDEDLKTGEAWQKDDRCIYQSNPVAVSHSYIFEEIKKKRAIAVLYENKRENYLCKHNNILYKGLGVEGYIDIQKVKVCREENDKEFWKGRRVWMGLDLSQTDDNTALAMVTEEEGVVYAKVFGFVPEEKVEFKSKKEHVDYKRLIERGDCFSCGSEVIDYLYVKAKAEEIEEEYGVEIMQIGYDKWNALATVQGLEADGYECVEIRQHSSVLHMPTKWLKELILDKKFRYCDNRMLEINFQNARCTEDTNKNKYVNKKKSNGKVDMVVALINAMYLLQQYVLYDDGFVVQTA >NZ_CP041667|2078196:2119060|2102643_2103048_-|WP_143930148.1|DBSCAN-SWA MITKEQAKQFRRLLEMQTTTLTDTQALEVPSFFPAWVIGYPYKVGDRVLYNGILYKCLTEHTSQADWTPEAAPSLWTKVLTDPSGEVLPWEQPESTNPYMTGDRVIHNGKTWQSTVDNNVWEPGVYGWIEVQEE >NZ_CP041667|2078196:2119060|2108888_2109308_-|WP_143930154.1|DBSCAN-SWA MRKTIVINGLPCEFKSSAAIPRMYRIKFGRDIFVDLNWLRKQVEKQEQLKEELKRKAEEAGEEFDEKEFESELPIESLEVFENIAYMMHKHGDPSQPQEIDEWLEQFETFDIYKIFPEILEMWNKENKTSSTPKKKNVK >NZ_CP041667|2078196:2119060|2096515_2098417_-|WP_143930139.1|DBSCAN-SWA MNIHDIAKLAGVSASTVSKVMNGKDKDISDKTKQKVLEVIEQEQYVPYLKFREKEGLKSHLIGLVMKKYNREGAQIVRSAQKEAAEKGYGLLVRFADNLEEIQECIDDMIKKGVAGLLVDSEKLLNTRKLEDVTVYLNQTKEFDDRQKATFYYRLSEAGRLAAQRLMQEGHERIACVTMTGERTIQDGYQIAMREAHLAIQPLWVYEGRSLEEIEDYGIQQCLEENVTAVICGSQEIAGCFYKAVERLQIAMPDSLSIIAVGDGRWMEILADGITAVRLPAEEMSREAVDYLVEMIQGKKQVEVMRKFSPSIVERNSIMGSPREKQGERIVVVGSMNMDITIEVSRIPLKGETQIAEKVYTFPGGKGGNQAVGAGKLGGRVYMIGCVGNDLDGKQLYSSLTENHVHMEGVQVNASVPSGKAYINVDQNGESTIVVYQGANRSLSIEQINRCRYLFQSAKYCLLSLEIPEVIAEYTIKFCRRNDTKVILKPSATDKIKEELLKDIAYFVPNENELHRFVPGRMSLEEKAQFLFEKGIENVIVTLGEKGCYLKNQDYSMYFDGTGFEAVDTTGGADSFISALAVYLSEGRSLIHAIGFAVYASGISVTRYGVQPALPDRKAVEIYEDEIYSRYQI >NZ_CP041667|2078196:2119060|2118745_2119060_+|WP_143930165.1|DBSCAN-SWA MFKDRLRATRIFRGFTQQKTADSIDVPLRHYQKYEGGNIEPDLSTLVKLSDLLNVPSDFLLGRDDYLKSLGVSVDVSLECPPRRPKSQRNLQSLHIQSSDSASK >NZ_CP041667|2078196:2119060|2100241_2100424_+|WP_143930143.1|DBSCAN-SWA MTLRNWMNDHFFDDTYPYQIIENNQKLNIGWDEYINYKVLKVETIDREVDKLKLIYVERI >NZ_CP041667|2078196:2119060|2112769_2113378_-|WP_143930161.1|head,protease|DBSCAN-SWA MELRGEPGKETVIIDGYVNVTGRDSRPIPDRKGGYFIERIQKGAFGRAVTRAQKVTALLDHKWDKTIGRTGENLNLREDVVGLRAHLETSDPEVVKLAREKRLRGWSFDIKRPVETRAETPGGLPLRTITDFDISEVSLISDRMRPWYESTTVETRAGEEGEETIEIRAEEFEPDYIGFEDKKEKPDNSKLKKIIEKYGGKV >NZ_CP041667|2078196:2119060|2081379_2082492_+|WP_143930125.1|DBSCAN-SWA MAGSTYGTLFTITTWGESHGPGVGVVIDGCPAGLPLSSEDIQKYLDRRRPGQSRYTTARNEADEAEILSGVFEGRTTGTPISILIRNQDQRSRDYGNIKDCYRPGHADYPFDAKYGFRDYRGGGRSSGRETIGRVAAGAVAAKLLERLGVRLLTYTKSIGPVSIPSEEYDYSQISCNPLYMPNEEYARKAQDYLQECIHSLDSSGGIIECQAKGLPAGIGEPVFQKLDACLAKAIMSIGAVKGVEIGDGFAAAKSKGSLNNDPFICQDGKISKMTNHSGGTLGGFSDGSALILRAAVKPTSSIAREQKTVTSSLENTTLTVKGRHDPVIVPRAVVVVEAMTALTLIDLMMQNMTARLEWMEKFYLGPGDF >NZ_CP041667|2078196:2119060|2116523_2117060_-|WP_143931265.1|DBSCAN-SWA MNRPTPVKPADVIRLEGKSHRTKKELRQRKQAEENLLTGKKLKEKAEVKNNEIAHREFLRLKNLLKEIGKDDDLYSGAINRYCLIFAECAEFEEQREKIFCRQQELEEKAGEMEFTEYITLQNDLSKMAISYDRQIQTKRKMLFDIEKENVMTIAAALRSIPKKPETKKNALREVLNA >NZ_CP041667|2078196:2119060|2079349_2081185_-|WP_143930124.1|DBSCAN-SWA MGNTDKNADEIIEETMKEIYDDLDRDKVFEKKEEEDIPVSEEEDLDDKDLQEKDLDEEEFLDEEDEDEGFDSEEFLDEEDLDGEEDLEEDFLDEEDLEKEEDFQEEAEEALEEELDKDSAEDQTDILNPEEESEEEEETDAEKAYRKHKFRKKLAIIIGSIIGVIAVVYIGFAVYFNSHFMFFTTINGTDVSLKSVSQVEDYMRQQVEDYTLTLEESDGGTEEIAGKNISLEYVPGKQLAQLVKDQDNFLWLTTLWDHPELDAEVGVKYDETALAEQIAALACMNPENQVASVNAHPEFKETQFEIVPEVVGTQINEEVFNEKVRSYIEGFQHTLNLTEEECYIKPSYVSDSEEVIAANNAMNSYLKAEITYDFNPNTEVVNAAVISQWVTVNDKMEVTFNTDAVKQYIQSLADKYDTKGKSRQFTTATGNTVTVEGGSYGWKIDQDAEYNALVANIQNGEVVTREPEYSSRAASHGSIDIGTTYAEVDLTNQHAYFIKDGQVVLDSPVVTGNPNKGNGTPQGTYSLSYKTKNAVLRGDRLPDGSYTYESPVKYWMPFNGGIGFHDASWQSSFGGERYKTNGSHGCINMPTDQAAKMYDLISDGTPVVCHY >NZ_CP041667|2078196:2119060|2109884_2110196_-|WP_143930155.1|DBSCAN-SWA MKEFHEGLTKLGYPVAYSHFAEGDVPDLPYITYYSPGTENFSADGIVYQEIIELDVEFYSEKKDLEGEEKISDWLTENGLYYEKQEYYIESEKMIQVIFETEI >NZ_CP041667|2078196:2119060|2111171_2111378_-|WP_143930159.1|DBSCAN-SWA MKYRAIISFAGQVTMRKGEERELSSSLAAPLVKCGYLEAVKAVKKKESKRNNARNNPGSSAGSERKSN >NZ_CP041667|2078196:2119060|2086918_2088688_-|WP_143930130.1|DBSCAN-SWA MYYSDELIEEVRTKNDIVDVISSYVRLQKKGSSYFGLCPFHNEKSPSFSVSRQKQMYYCFGCGAGGNVFTFLMEYENYTFLEALKYLADRAGVELPQQDLSKEARQRADTKAVLLEINKAAARYFYIQLKGRQGEKALAYLKGRQLGDDTIRAFGLGYANKYSDDLYRYLKDQGYKDDMIAKAGLISVDEKHGAHDKFWNRVMFPIMDANSRVIGFGGRVMGDGKPKYLNSPETMIFDKSRNLYGLNRARSSRKPYFLLCEGYMDVISLHQAGFTSAVASLGTALTQGHASLIKRYVNEVYLTYDSDEAGTKAALRAIPILREAGISAKVIRMEPYKDPDEFIKNLGKEAFEERIREARNGFLFGLEVLERDYDLNSPEGKTDFMKETARRLNEFREEIERNNYIEAVAQKYRVGYEELKQLVVHTAVQTGLAKPVSRPRKAGKPEKEDGILKSQKILLTWMIEDEKIFRQISAYITPEDFTEGLYRKVAHLLYDQYEEQKVNPAQIMNYFTEEEEHREAASLFHTRIRELTTADEQEKALKETIIRVKNHSIDEKAAHLEPTDIQGLQKLMEAKRQLQGLEKLHISIN >NZ_CP041667|2078196:2119060|2117991_2118645_-|WP_143930164.1|DBSCAN-SWA MQLPQTIEVKGIRVLTSKQLAECYEATVDIIKKNYSNNRKRFAEGKHYLLLQGEELKSFKNKVKNVHLVNNEGGNSRLVEKNAKILYLWTEKGALLHAKSLNTDKAWEVYDYLVDYYFRQQEAPAAAGRTELAQVREIADRKHRTGERLVDIRGNKTAMTILEEMRKYLTAMEALMEIYSRPNTDQRIKGIMTAMDQVVGRIWDKKSELKEVRHKYL >NZ_CP041667|2078196:2119060|2085030_2085750_-|WP_143930128.1|DBSCAN-SWA MELSKRLYAVAGLVTEGASVADIGTDHGYVPIYLVEEGIARKALALDVNRGPLERARMHIVGHGLGDKIETRLSDGLREIRPGEVDTIIVSGMGGPLTIRILKDGEAVTDKLEALILQPQSEIARVRRFLVEQGYRIQREDMVFEDGKYYPVMRVVHGAPEPYEAWEYLYGKRLLEERHPILGEFLLRELRIQESILEQLARRKESVSAQERAKEICEARELTQKALDRMERAESKGDN >NZ_CP041667|2078196:2119060|2110921_2111227_-|WP_143930158.1|head,tail|DBSCAN-SWA MTQEIILDHLREVKENLTEEDLTLIQAMQAAAVSYCVGQTGLTKEELDENEDITIAVLTLISDMWDNRSMSIDRSNVNRVVDTILGMHCINLLPTPDGEAS >NZ_CP041667|2078196:2119060|2088754_2089762_-|WP_143930131.1|DBSCAN-SWA MTIREQLEKRELEFLSPYAALSSKSKGRKRSEKECDIRPVFQRDRDRILHSKAFRRLKQKTQVFLLPKGDHYRTRLTHTLEVSQNARTIAKALRLNEDLVEAVALGHDLGHTPFGHAGETALNRLNPAGFQHNTQSVRVVECLEKQGEGLNLTWEVLDGIQNHKSSGTPHTLEGQIVRLSDKIAYINHDIDDAIRGGILEEEDIPRKYTDVLGTSTKIRLDTMIHNVIINSMDQPQIRMSPEVYEATMGLRGFLFENVYKNPTAKGEEKKAINMITNLYQFYIEHLELLPEQFLQMMEEKGAKKEQIVCDYIAGMTDTYAVKKFEEYFIPESWKI >NZ_CP041667|2078196:2119060|2109307_2109850_-|WP_143931263.1|tail|DBSCAN-SWA MKNVHYAVKNTTGEQAGTTYATPVAIPGAVSISLEQQGELTPFYADGVKYYTSASNGGYEGDLEVALIPDSFRKDVLGEVEDENGVLFEDANAQTVEFALGFQVDGDKNPTLFWFYNCTATRPSVDATTTEDTKEPNTDTLTISCGAADDGTVRAKTTDESYAAVKDTWFTKVYEKAEAA >NZ_CP041667|2078196:2119060|2093922_2094549_-|WP_143930136.1|DBSCAN-SWA MSNEKKGLKADFNLITILIIPIAIAINFIVGNLVLTLKLPLYLDSIGTFLVAILAGPWVGGLTGLLSIAINSIADPSLFPFALSAAATAVLAGILARKKMFTSFGKFVISAILVAVLAVIMSVIISYAFFGGFDSSGNSIMIGAMMSAGIPFWPAQVIGNFISEVPDKFISLLVPYLVIRGMSDRYLYKFSNGSVFINARKEKKEAGK >NZ_CP041667|2078196:2119060|2085763_2086879_-|WP_143930129.1|DBSCAN-SWA MEESMAKFEEKLKELVALGKKKKSILELQEINDFFSDMELDAEQMERVYDYLEANNIDVLRIGNDDDDDIDDADIVITDEDEVDMEKIDLSVPDGISIEDPVRMYLKEIGKVPLLTADEEVELAKRMADGDEWAKKRLAEANLRLVVSIAKRYVGRGMLFLDLIQEGNLGLIKAVEKFDYHKGFKFSTYATWWIRQAITRAIADQARTIRIPVHMVETINKLIRVSRQLLQELGREPTPEEIAEELDMPVERVREILKISQEPVSLETPIGEEEDSHLGDFIQDDNVPVPAEAAAQTLLKEQLDEVLDTLTEREQKVLRLRFGMNDGRARTLEEVGKEFDVTRERIRQIEAKALRKLRHPSRSRKLRDYLD >NZ_CP041667|2078196:2119060|2082530_2084201_-|WP_143930126.1|DBSCAN-SWA MAEKRSYRVTVGEETRIYQEGTSYRTIAKDFQKNYKYDIVLAFVDGRLQELHKPLRFDCEIRFETIGGSVGHKTYKRSMSLLLVKAIYDVADREAIEKVRIHYSVSKGYYCTIEGKVSPDEAFLGLVEERMRQMVEEDLPIQKRSIHTDEAIELFHKHGMYDKERLFEYRRVSKVNIYSLNEFEDYNYGYMVPSAGYLKYFKLYPYDEGFVIQMPDMSDPAKVPDFQPQNKLFQVLKESTKWGDMQGIETVGALNDKITGGEATDVVLVQEALQEKKIAEIAGEIAASPQVKFVLIAGPSSSGKTTFSHRLSVQLRANGLVPHPIAVDNYFVEREQNPRDETGAYDFECLEAVDVELFNQQLKELLAGQEVVIPTFNFVTGHKEYGNKPKRLGANDVLVIEGIHCLNPKLTESLSNENKFKIYISALTQLNIDEHNRIPTTDGRLIRRIVRDARTRGASAQKTIRMWSSVRRGEERNIFPYQEEADVMFNSALIYELAVLKPYVERLLFGIDRDCPEYLEAKRLLKFMDYFVGIGSEMVPMNSLLREFIGGGCFRV >NZ_CP041667|2078196:2119060|2100965_2101283_-|WP_143931262.1|DBSCAN-SWA MDVDGEYGPLSKAKLEGAGGSTGRKAEAAKSFDKSLAGTYTVNTKTDPLTLRSGAGTSKKRLTSMPKGAKVQCYGYYTKVGSVKWLYVVYDGVTGFASAEYLKKQ >NZ_CP041667|2078196:2119060|2102081_2102315_-|WP_143930146.1|holin|DBSCAN-SWA MFDWKERFKNKMFWLAIIPAVLLVIQTVAAVFGYTLDLGDLGNKLLDVVNAIFSVLVIAGIVVDPSTPGVGDSNVDK >NZ_CP041667|2078196:2119060|2099180_2099363_-|WP_143930140.1|DBSCAN-SWA MAYSEAQKRASRKYNENNYDRLYITIGKGKKDIIKKKAEDQGMSLNEYVVKLIEKSLETY >NZ_CP041667|2078196:2119060|2104508_2105567_-|WP_143930151.1|DBSCAN-SWA MEWSVRIRHYSFSRKERAEMELILADKTLRDIRQIADADVDIDIGGDNDYEIKIPYEAWKDDLDFGNVIYIPGTEYGGIIGEVNTNTELNTVSMLGRTWRGSLDKKVIQPPSGQDYKKVSGELNTVLSGLIEEQFGNYFVVSSEDTGVNVNDYQFERYCTLLYGIEKMLKSVGHRIQIKYVQQERGQPGYVEVAAVPIVDYSNEIELSQDSEINFTFKNKRNGVNHLICLGKGELQERQVIHLYVQEDGTIGNTQYYKGVEEIAEVYENTSAETEELRENGVEKLRELMNSTDFQMDVASIGINVDIGDIIGGRDYITGLYAAKPITEKIYKITDGKESIEYSIEGDDEQGN >NZ_CP041667|2078196:2119060|2103756_2104461_-|WP_143930150.1|DBSCAN-SWA MKHQGTWGSGGYILQTGQMLEPQVQSSNEIRIRDGAMMIQGALSTVKVNSYDPVTIQNGTQGMKRIDLICWQYTYDAEQDVESAEWVVIQGTPAETDPQQPAYTDGDIQQGDSPVQVPVFAVELDGINVTGVTTLLPTAPTLEELNDKLDEKIIVQRYSNRINVSNGQNSYTAINVNIPDGYTIIGISAYALHATTLCGGGAYDSNNNRIVIPFSAHDSVTPETFYVDLTLMVE >NZ_CP041667|2078196:2119060|2114593_2114827_-|WP_143930162.1|DBSCAN-SWA MIKPYRDEIVQQIKEAGQELIERAESMVGNGLDKITDISIDIHLGTMTDVTGWPEITIHTSVANKKTLERWTKREVE >NZ_CP041667|2078196:2119060|2084215_2084995_-|WP_143930127.1|DBSCAN-SWA MICRDIIERIEKDYPKSYAMEWDNVGLLAGRMEKEIKCIYIGLDATDQVIEEALRERADLLLTHHPLIFQGLKRVTDQDFIGNRVLKLLQGDISYYAMHTNYDAARMGELASKRLGWKRGRALEPVSEEEGGPGIGQIADLEEETTLEELGRQVKEAFGLPDVRVFGDPKMKIRRAAILPGSGKSGIGAALEQEAQVLITGDIGHHDGIDGAAQGLAVIDGGHYGIEHIFVDDMRRYVEEHFPGVQVKTEPVRHPFWTV >NZ_CP041667|2078196:2119060|2111543_2112749_-|WP_143930160.1|capsid|DBSCAN-SWA MKYNIQFFAEKKNNIKALRETRAEKVEELEMLYEKLEAEQRAITEEEEKQIGAIEGEIEQIDKTIEILENVKSRLAKTNEGKREVVDGVKEEGAEQRAQEEEKAFADYLRGVVAGEVRADTNMTLTDNGAIIPTTIANKIIEKIYDISPILERSTKYNVKGNLTIPFYPADSNDVEMAYATEFTDLTADTGKFGSITLNGFLAGALVKISRSLINNSQFDVTGFVVDHMAYQAARWIEGELLKGTTSKIEGLSSVTQGVTAAAATAITADELIDLQDSVKDAFQQNSIWIMSSKTRTAIRKLKDDVGRYLMQDDITAPFGKTLLGKPVYVSDNMPEMAASAKTIYYGDMSGLATKITEEFEVQVLREHYATQHAIGVVGWTELDAKVENEQKISVLTMAES >NZ_CP041667|2078196:2119060|2092294_2093143_-|WP_143930134.1|DBSCAN-SWA MDSIQIKNLTYRYPTSEADVLKDVSFTVKKGELCAIVGANGSGKTTLCNAIRGFVPKFYKGEISGEVLVNGRDVKEDPDGKNALEIGFVFQNPFTQISGIASTVYEELAYGLENMGVEPEEIRQRIERIMELTKIQEFRDRDPYQLSGGQQQRVALAAILVMDQDVLVIDEPTSQLDPQSTDNVFEIIKLMKSMGKTIVLVEHKMEQIAEYADQIIVLDGGQVVMEGTPKEVFSDPDCLKYHTRLPQSTRIALELKKEGVNLSEIPVTVEETVGMIRTAMEE >NZ_CP041667|2078196:2119060|2102330_2102639_-|WP_143930147.1|DBSCAN-SWA MDEYVTRVEHEEFVRRMQDEHKRIHYRISDSEKMVNKIYDLTLSVERLATSIETMTKEQKEHQERLEALENRDGETWRKVKWYILTLAIGAVAGAVFTMIGL >NZ_CP041667|2078196:2119060|2099356_2099788_+|WP_143930141.1|DBSCAN-SWA MPSQAPPIYVYINTIKILAQVYFSEKSIDNILALVYNKIIKGTEETKMLKGSEKQIAWAEDIIKEARETVKDNIDFIKKLQEEHGLKVRQDELEAYELCGKQMEEMLANIDSASKIIDMRDQLSSANINKMVSEYCMRNKNRK >NZ_CP041667|2078196:2119060|2095570_2096479_-|WP_143930138.1|DBSCAN-SWA MKKIVVVGSLNMDCVVETPAMPKAGETIAGRSVAQVPGGKGANQAYAIAKLGGDVQMIGAVGTDSCGTALKENLECVGVRTEGLAVLEGESTGQAFITVDDQGENSIIIIAGTNGLVSKEMIEENRHLIEESDIVIMQLEIPLEVVEYVKELAVELGKLVIVDPAPAVEGIPDHFWKGIDFIKPNETELAILTGQERNGLEDLKEGAGQMLDKGVKNVLVSMGGEGCLLVNKKEARFFPAHKVKPVDTTAAGDCFTAGFALALSQGKTCEEAIAFGQKASAIAVTRKGAQTSIPTMEEVESC >NZ_CP041667|2078196:2119060|2102980_2103400_-|WP_143930149.1|DBSCAN-SWA MKAVFNDATELTIQAAAIVGNLLQIKTVSATREELRTKFSDEFACRKIQIEERGQIIATYERYTELYRIEEYTGAILGVAMYRVGETPEERLDGMDSRVTNVEQALQIIVTGEGGGENDNEGTGETVPAAAGNADYDID >NZ_CP041667|2078196:2119060|2078196_2079330_-|WP_143930123.1|tRNA|DBSCAN-SWA MYELIKRDGLAKRGRLHTVHGVVETPVFMNVGTAAAIKGAVSTEDLEEIGTQVELSNTYHLHVRPGDQVVKKLGGLHKFMVWDKPILTDSGGFQVFSLAGLRKIQEEGVYFNSHIDGRRIFMGPEESMQIQSNLASTIAMAFDECPSSVADRDYIERSVARTARWLRRCKKEMERLNSLPDTINPRQLLFGINQGGICEDIRIRHAQEIAELDLDGYAVGGLAVGESHEDMYRILEAVVPHLPVNKPTYLMGVGTPANILEAVDRGVDFFDCVYPSRNGRHGHVYTNQGKLNLFNAKYELDARPIEEGCQCPVCRRYSRAYIRHLLKAKEMLGMRLCVLHNLYFYNTMMKEIRGAIEAGNYQEYKKKKLSGMTAGNE >NZ_CP041667|2078196:2119060|2110580_2110922_-|WP_143930157.1|head|DBSCAN-SWA MDSGAYRGLVTFQKYISAYDEIGNPSGEWKEERKAYAYVNGLSGAEYWEAANVHQENTVEFVFRWHPFFEEMNTKQYRILFRGAIYNINLIDNIQFRNKTVKIKAVTKNGADN |
47 | Lactobacillus_phage(21.05%) | holin,portal,protease,head,terminase,capsid,tRNA,tail | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_7 |
2839226 : 2847563
Sequences of DBSCAN-SWA_7
Nucleotide sequences of DBSCAN-SWA_7 >NZ_CP041667|2839226:2847563|DBSCAN-SWA ACTAAAACAATTCATCCAGTAAAATATAATATTTTATTTTCTCCCAATCAGGCTTGATCCCCAGTAAGTCAAAAAATAGCTCGACATACTGTTCTTCCCCGATATCCTCCCTGATCGACCGGACGCAGAAGGCAATGTCATACCACTTGTCCGCCCTGCCGCTTCTCCCAAGATCAATAAAGCCACTTACTTTGCCATCTTTCACAAAGATGTTGCTGTCTCCCAGGTCGCCGTGGGAAAAGACAAGTTCCTCTTCGGGCTTTTCCGTCTTTAAAAAATCATACAGCTCGCGCGGATCTTTAAATGGAGTGTCTTCTTCCCAGTTTTCGCAATCCACATCGGCCAGATCGTTATTCAGTAAGTAATCCAATTCGGCTAAGCGGCTGTCTAAGCTATTCGTATAGGGACAATCCGATATGTCGATGGAGTGAAAGAGCCTGATGCACTCCGCATACAGCTCGATAATCTTTTCAGGGCTTTGTTCATCTTCATACTCTTCCGAGCAAAGGACGCCATCGGCCTCACTCATGAGCAGATTGCTCCAGCCATCATGCCGTTCAAAGTGCAGGACCTTTGGAACAGGCAGCTTTCCTTCCAGCCATAGCATCATGTCCTTTTCCCGTTCCACATCATAGGTGGTCCCTTTATACCGGCTGTCCGTCATTTTTAAATATAGGTTTTCATTTTCTCCCACCAGCTTATATACCTTAGCAGGAGACATTCCTTCCGTATCTTTTACGCAGCGGTATTTTTCGATCAGTTTTTTCAATTCCGGTGATATTCTCATTTTAGCCATTTATTATTTCCTTCCTCTTTTCTACAGTATTTAAAGATACCCCAAGAAGCTAATTATAACAAGACGAACTCCAATTCACTGTTCCTTGCATTCTAAAACCTTAAATACCAGAAAACAGCTTTTTCAAAGTTGTTTTCAAAGTTGGCGTATAACATAGTATCGACGGAGCCGATTTTGAAACCACAATTATGATAGAATTTACAAGCTATAAGGTTATTGTCCTGGGTTTCAAGCATTAGTCCATGCAAGTTTTTATGCTTTGCCCATTCTATAGATATATTGATAAGCGCGCTGCCTATGCCTTGCCCCCTGAAATCCTTACATACGGCGATATCTTCTATATAAGCGTACCGGTTCCAATTTTTTCGCAGTTTAACTTTTCCGACGCATTTATCGTCTTGGTAGTAAAGATATATTATCTTATCAGTATTGTCAATATATTCAAGGCAATCTGCCTCCTCATCCTCTTCATCCTCTTCGTCTTGGTAGCTTTTTAAATATGGCGCTTCATAGAGTAATTCTGTAAAGGTCCAATTCTCGTTTTCATACCTCGGTATAATCTTACCTATCACCTCAAATGGTTCGCTGGGTTTATCGATATCTTTCAGGTGCCCTGCTTTCATTTCTGTAATCACTGTTCCCACCTCTCTTCTATATCAGCGGCATATGTGCTATCCAGGCAGCCGGTTTTACCAGTGTATTTTTTATACATGTCCCTGGTATATTTTGTTATATTCCTATCATACTCCGGATAGGCATAATGAAGCCTTTCCGCCACCTCACCGGATACCGCCCTGAACAATTGATGGCATAGAAATAATGCTTCCCATATGTTTTCATAGGAATCCATCCGGTAGGTGGACAAAAGTTTCTCCCACAAATCCTCGGATATATACCTTTCAATAAACTTATAGTTCTTGCCTACACTTAATTTAAAGCCTGTTTCGATGCCGACCTTCCATGATATCATTCTCAGCAGCTCATGGCGAACAATCTGATTAAAATGATCAATAGCAAATAAAATTTCCTTACGGCACAGTCCCTTAACCACATAGGTAGTGGTGTTCCAGAATTCATTGCAGCAGTCATCGTATTCCCGGGCACTGGGTTTTCTTATATGATAATCCATGTCGGTCGGAACTACAGCTTGCTGAATCCTTCCATCCTTATCCAGAATAATCTTTATCAATTTATCGTCATTCAGGTAGTTTCCCAACTCTTCCAGGGGCAATAAGGTAAGGTCTATTTTATTATAATCATCAAAAAGTATTATATAGGAGTAGCCTTTCTCTTCAGCCGGGAATAGTTCCATATCCTCCGGTTTTTGCATCATTATAATATTCCCGAAGCTTTTAAGCCATTCATCCTTTAAAGTAAAGGATTCCACATCTGTTACGAAGTATGTGACATCATAATCCTGGAATTCGTCCTTAGGTATATTAATATTTGCGCGTGACCCCTCAAGGGTCACAATTCGAATACGTTCATCCTGTTCTGCTAAAGAAAGTACTAAATCCATCATTTCTTTTTCTGATCTCATTTACATACTCCTCGTTTATTTTTTCTATATTATTACATCTTCTTTTTTGCCGATACAATCAGCATCATTGGGCGTCGCATTTCATCCGCCATCCCCGGAATATCCATCATGTTCTCTGGCGGCTGTGGCTCCACAATCTGATTTATTATAAAACTATTTGAAAGCAGTGTATTTAGATATGTGGTCAGTGTTCTATGATATTTTGTAACCTTTTCTTCCAAAAACATAGCTGTCCGTTTGCCCTCATAATAATAATTGTCCACCGGGAAATGCAGTATTTCTCCTTTTTCGTTATAATACCAGTCTTGTGTTCCATGAGCAGTAAAAACAGGATGTTCAACTGTAAAAACTAAATTGCCACCAGCCTTCAGCATCCTATATATCTTTTTTATTAAATTCTCATAGTCTGCTACATAATGAAACGCAAGCGAACTTAGTATTACATCAAAGCTCTCCTCTGGGAAATCCACATCTTCTATGGCACAGCATTCATATTCAATCTGTGGAAAATGGGTTTTTCCTTTTGCTACTTCGAGCATTTTATGAGAAATATCAACACCTACTACAGAGGAAGCACCGTTTTCCATCGCATATATACAGTGCCATCCATAGCCGCATCCTAAATCAAGCACACGCTTACCCTTAAAATCAGGTAGCATCTTTTTCAAAGTCTCCCATTCTCCCGCACCAGCCAGTCCTTTCTGCGAGCGACTCATTTGACTGTATTTTTGAAAAAATATATTATCATCATATTTGTTTTCTTTCATCTGAACTCTCCTCGTTTAAAAAGTTATTTATCTCCGATACAATTTCATTCACTTCATTATAAAGCTTCTCGGTCATGTCGTAGCATTCAAAAAGTGAGATACCGAGTACTTCAAAAATATGATTTACCTTCTCGGTATATTTTTCAGGTTTATGTTCAAAAGTTTCAAGCAGTTTTATAGCTTTCTTTTCGTTGATACAATAAGCATTATTACATGCAAATAACACTTGATTTAAACATGAAACTATACGAAAAACATGACCCGCAATATAATATTTATCGTCTGTTCCCGAATTTGCTTTTACAAACATTAAAGAGAACCCTGCTTCAAACATAAAAAAGTTAACTAAACTTTTCTGCAAAGCATTGGGATAAGTTTCTGCCTGTTTTTTTAATTCGCATAAGCTTTCATTCTTAGCATATAGTATTTTGCTAATCGCTAATTCTCCTCGATACATTGCACTAATATAACCATGGGGATGCCCAGTCTGATAATTGGCAGTAACAATTCCGTGCTCTGTATCTTTCATTATTTGTTCCACACGTTTAATATCACGTAAAATTAAATCCACATGATACCCGTTTATGACTAACCATCCGCCGCCATTAATCCAATCACCCCATGCTCCGGGAGGTACAACAAGGTTATTTCTATGCTCATCATCCAGCTTTGTAGCGAATTGATTAATAGTATTTATGTCAAATGATTCTGAATTGTAATAGATGCCGATATCTATATCCGAATCCTCTGTATGGGTGCCCCTTGCACGTGAACCACCTAAAACAATACCTTCTATATAAGACAGAGAGGATAATTTCTCTGCTACTGATTTAATAATATTATCTACCATCAAGCATGCTCCTTTTATCGATTCACTGCTGTAAAAATTCCAAGCCAAATCTTTTGCTTCTACTCAACATATTTATAATTTTTTTAATTTCACAAAAGATAGCTAATAAGCACATTATAAGAATGATTCTGATTTTTTTCTTCACTTTTATTTTTCTCCATTTCAGCACATTGTTTTATTTTTGGGGTTTTTCGATTTCATCATTTTCTAATTCAATCACATCCAGAATGCCACAATTCAAGGATTCGCAGATACGAACCAGCGTTTCCATGCTGATATTTTTCCCTTTTCCCATATTAGCAATCATATTTGTGGTAAGACCGGCGGCAATCCGCAAGTCCTCTTTTCTCATATTGCGCTCAACCAGCGTATGCCAGAGGGGTTTATAGCTTATGTGCATTTATCCTCCTGCCTTTCTCCCGGTTCCGGGGTTCCTGCTCCTATTATATCAAAGTTCTTGCTATTCCACAAACATTTCTTGATACAACCGCCGGTTCCGTCTGGATTGTGCTTTTCCTTTCTTCTCTTGATTACATCTACTTAGCCATGAGCTGCTTAATGCCTTGTGATTTTGCACCCGGATTGTCGTTTCCATATCCCTCCAAGAGATTGATAACGCCCCAAATGCCTAAGCCTGCGCCCAGTGCGATAACAAGGGTCTGTAATACGTCGATAGCAGAGTTAAAAAATTCCATAAAGCTCCTTTCTGCCGCAAATGCGGCTTGCCCTATAAAAGGGCATAAAAAACGGCGGTCAGTTTTCAAACTGCCGCGGTCATAAGTCCCCGCGCTGCCTAAAGTTCTGCCGCGCGGCGGTATTCAGTTGTCAGTGTGGGCGATAGGAATTTTTGTAGGGGCGCGTATCATGCAAGTTGTCTTTTTCTTTGTTACTGCGTCAATTCCTCCTTTCTTTTCAGCCGGGAACACACATTTTTCGTCATTCCAAAAGATAGCTCCTACGCTGCATATACTCGGTAACGTAAAGGGTCTGTTCCCCTCCGCGTTCCTGCAAATACTCCCGCAGAGCGTTTTCTGCATTTTCTATCATCTTTGTAAAATAATGTTCACGACGTATATCATCAGTTTTTGTTTTCTGTCTGCTCATAGTATAACCTCCCGTAAAATCAAAGTTCCCGCCGCACCCGTTTCTGCTCATACAGGGCTTCACACCGTTTCACAAACTGCTGGATAGCTGCCTGCCGTTGCTGTTCCCGTCGCTCGGCAATGAAAACAAGCCCGTTTATGGCTTCTGTGATTTTCTTATCTTTTTTCTTCTGCATACGTCTATTCATGGTCGGTGTCCTCCTGCAAATCTGCTTCGCTGATTTCGTAATAGTCAAAGGGTTCGTCCGGCTTTACAAGGGCGGGTCTGCGCCTTAAATGCTTTTCCATATCAAAGGTATTTTTCTTGTCATAGTCGGAAAGATATTTATAGTTGGGGTGCTTTGTAATGTCATACTTATCAGAGAAGAACGGGCGTACCCCTCGTAGTTGTAAGATACATTTCCCGCCGTCCATGACTGCAATTTCATCTTCGGTCATAAGCTGCTTGCCTAACTTCTGATAGTTCAGCCCATGTGATACCTCACGCCCCCGGTTCTCGGAAGTGTTGAAGCTGTCAATGGTTTCTTTCCCCAAGATTTCCGACATTTCTTTGAGGGTGGTTTTCTCCTTGCCGCCCAAGAAAAGGGTGGTGTCGCAGTTGCCGGCTATGGTATCGGCGTTATCTTTGTATATCGCCTTTAGCTGCGACTGGCTTTGCAGAATGATTGAAGCAGAGATTTCCCGGCTTCGGATAGTGGCTATGAGCTTTTCAAACTTCGGTATCTGCCCGATATTTGCAAACTCGTCTAACAGACAGCGTACATGGACGGGCAGCCTGCCGCCGTATTCATCATCTGCCTTGTCGCAAAGAAGATTGAAAAGCTGCGTGTAAAGAATACTCACAACAAAGTTAAAAGTGTCGTCGGTGTCGCTGATAATGACAAACAGGGCGGTCTTACGGTCGCCTATGGTATCAAGTTCCATTTCGTCGGTTTCCATAAGGTCGCGCAGCTCTTTAATGTCAAAAGGGGCAAGCCGCGCCCCGCAGGAAATGAGGATAGAGCTTCTTGTCTTTCCCGCAGATAACAGGAATTTCTTATACTGCCGGACAGCAAAGTGTTCCGGGTCTTTTTCCTCCAACCGTTCAAACATAAGGTCAACGGGGGACTGAAATTCCGGGTCGTCCTCGCGGGCTTCACTGGCATTTATCATTTCAAGCAGCGTCGTAAAATTCTTTTCTTCCTCCGGGGCTTCGTACCAAATGTAGCCGATAAGGGCGCAGTAAAAGAGCCGTTCCGATTATAGTGATAGGTAATCCTTTGAAGTTATCGCCGGATTTTCCCCCACTTCACACCGGACGTGCGACTTTCACCGCATCCGGCGTTCCATCGATTGTTTTCACAAGCATTTAATCAATCTCCAAATGAAATACTATTATTCTATTGCGGACAAACCAGCCCGCTCACGCAGACTGTTCACTTTTAACATCAGTTCCTTTGTTGTTTTGAGAGCATAAGCTATCGCTTTCAGCTCATTCATCGGGCGTCTTTGCAGAAGCAGAAGGAATCTCCTGTGGAACAAAATCATGTTTTGATAGCGTTCCCTGCCGCCAAATTCTGCCGGAGTTTTCAGCCAGCATACAATGTCAGCCGCATTTTCAAAGAGTTCTCCACTTAGCGCACATTTTCCACGTTGAGCAGAGAATAATGAAATCCTGCTGTCTGTGAGTTCTGTGCTTCGACCCATAGACGGTTGATTTCTCAAACCCGCCAGAACAAATGAGTTCAGGCTTAGGTTTGTGTGTATCAATGTCCGTCCGTCGGCCGTATAACTGCATACCGCTGTTTTCTTTGCCATCGGTATCTTGTTCTTGATATATGCGATAGGGTAGATAGGTTGGTCGATTCCTGATACATACCGCACCATTGCCGACTTACCGAACCGCTCCTTTTCGCTCTCGGTCATTGCACCGCCTTCTCGAACAAGTTTGCACCCGGTTTCTGTATTGAGCCGGTTGGTTAAAACTGTCATGACCTGCCTGTGGATTTTTCTGCAATCAATACTGATACAGGTAGCGAGCTGGAAATAATTCTGTATTCCCAGAACCATTGAATTGAACAGCCTTATCTCGTCTAAAGGTCTTTTTCCTTTGGACGGTCTGGCAATCCGCTTCGCTTGTTCCACCAGCTTTTGCCTTTCCAGTTCCAACTTTTTGTCACAGATATGAGATTGCACAGCGTACCTACCGCTTTTTGGTCTGACCCGTATCTTGAATCCGAGAAACTCTGAATAGCGTTTTCTGACATTGACTATCCTTGTTTTCTCTGGCGATACTTCAAGCCTTAGCCTCTCAGTTATCCACGCTGTTACAGCTTCTTTTGTTCTCAAAGCGTCCTCTTTATTCCGACAGAAGATTCTGAAATCATCTGCATATCTCACGATGTACATTTCTTTCAGTCCCGTTTTCCTCATTTTGAGAAATGCCTTACTCCGGTCAAAAGTAATCGAGTTTCTGATTTTTCTCTCATACCCGTATTCCTTTACAAGAGGGTGGTTCTGCCATTGGCTCTCTACCCAGTTATCCAGTTCATTCAGCACGATATTCGCAAGTAGCGGTGAAATGATACCACCTTGCGGTGTGCCTTTGTCTGGAATGAGGGTAGTGCCGTCTGGCATTCTAATCGGTGCTTTCAGAATCCGCTTGATAATAAAAATCAACTGTTTGTCGTGTATTCCCAAAGCCCATATCTGCCGCACCAGCTTACTGTGATTTACATTATCGAAAAAGCCCTTTATATCAAACTCAATCACATAATGCAGATTCATTCGCTGGAGCATGGTATAGGTTCTCTGCATGGCGTGTTCTACGGAGCGGTTCGGGCGAAAGCCGTAGCTATTGTCGCTGAATTTCGCTTCGCAGATAGGCTCCATGACCTGCTTGATACATTGCTGTATCAGCCTGTCCCAGATACAAGGAATACCCAACGGGCGGGTCTTGCCGTTTGGCTTCGGAATGTCTTTACGCCTGACCGGTTTTGGTCGGTATCCGTGCTGACTTCCTGTGACAATAAACCTCACTTTCTCCACAACTGTTTCGGGCGGTAAACACCCGATGTCACTGATGTTTCTGTTGTCTGTTCCTGCTGTGTAGCTTCCTCCGTTTGCTTTGATGTTCCGATATGCCAGAAGAATGTTATCCCGGCTTAGGATTATATCCATAAGGTCGGTAAAGTTGTCTCCGTCCAGACTTCGGCGATAGAGTTCGTCAAAAACTGGCTGCATACCGTAATACTCGGCGTGCCGTAAATCGTCTACACATAGATTTTTGCTTTTCTTTGGCAT
Protein sequences of DBSCAN-SWA_7 >NZ_CP041667|2839226:2847563|2840113_2840656_-|WP_000627290.1|DBSCAN-SWA MITEMKAGHLKDIDKPSEPFEVIGKIIPRYENENWTFTELLYEAPYLKSYQDEEDEEDEEADCLEYIDNTDKIIYLYYQDDKCVGKVKLRKNWNRYAYIEDIAVCKDFRGQGIGSALINISIEWAKHKNLHGLMLETQDNNLIACKFYHNCGFKIGSVDTMLYANFENNFEKAVFWYLRF >NZ_CP041667|2839226:2847563|2845655_2847563_-|WP_143929064.1|DBSCAN-SWA MPKKSKNLCVDDLRHAEYYGMQPVFDELYRRSLDGDNFTDLMDIILSRDNILLAYRNIKANGGSYTAGTDNRNISDIGCLPPETVVEKVRFIVTGSQHGYRPKPVRRKDIPKPNGKTRPLGIPCIWDRLIQQCIKQVMEPICEAKFSDNSYGFRPNRSVEHAMQRTYTMLQRMNLHYVIEFDIKGFFDNVNHSKLVRQIWALGIHDKQLIFIIKRILKAPIRMPDGTTLIPDKGTPQGGIISPLLANIVLNELDNWVESQWQNHPLVKEYGYERKIRNSITFDRSKAFLKMRKTGLKEMYIVRYADDFRIFCRNKEDALRTKEAVTAWITERLRLEVSPEKTRIVNVRKRYSEFLGFKIRVRPKSGRYAVQSHICDKKLELERQKLVEQAKRIARPSKGKRPLDEIRLFNSMVLGIQNYFQLATCISIDCRKIHRQVMTVLTNRLNTETGCKLVREGGAMTESEKERFGKSAMVRYVSGIDQPIYPIAYIKNKIPMAKKTAVCSYTADGRTLIHTNLSLNSFVLAGLRNQPSMGRSTELTDSRISLFSAQRGKCALSGELFENAADIVCWLKTPAEFGGRERYQNMILFHRRFLLLLQRRPMNELKAIAYALKTTKELMLKVNSLRERAGLSAIE >NZ_CP041667|2839226:2847563|2841593_2842328_-|WP_000662263.1|DBSCAN-SWA MKENKYDDNIFFQKYSQMSRSQKGLAGAGEWETLKKMLPDFKGKRVLDLGCGYGWHCIYAMENGASSVVGVDISHKMLEVAKGKTHFPQIEYECCAIEDVDFPEESFDVILSSLAFHYVADYENLIKKIYRMLKAGGNLVFTVEHPVFTAHGTQDWYYNEKGEILHFPVDNYYYEGKRTAMFLEEKVTKYHRTLTTYLNTLLSNSFIINQIVEPQPPENMMDIPGMADEMRRPMMLIVSAKKKM >NZ_CP041667|2839226:2847563|2843714_2843873_-|WP_025579719.1|DBSCAN-SWA MEFFNSAIDVLQTLVIALGAGLGIWGVINLLEGYGNDNPGAKSQGIKQLMAK >NZ_CP041667|2839226:2847563|2840652_2841561_-|WP_143930740.1|DBSCAN-SWA MRSEKEMMDLVLSLAEQDERIRIVTLEGSRANINIPKDEFQDYDVTYFVTDVESFTLKDEWLKSFGNIIMMQKPEDMELFPAEEKGYSYIILFDDYNKIDLTLLPLEELGNYLNDDKLIKIILDKDGRIQQAVVPTDMDYHIRKPSAREYDDCCNEFWNTTTYVVKGLCRKEILFAIDHFNQIVRHELLRMISWKVGIETGFKLSVGKNYKFIERYISEDLWEKLLSTYRMDSYENIWEALFLCHQLFRAVSGEVAERLHYAYPEYDRNITKYTRDMYKKYTGKTGCLDSTYAADIEERWEQ >NZ_CP041667|2839226:2847563|2842308_2843178_-|WP_000228166.1|DBSCAN-SWA MVDNIIKSVAEKLSSLSYIEGIVLGGSRARGTHTEDSDIDIGIYYNSESFDINTINQFATKLDDEHRNNLVVPPGAWGDWINGGGWLVINGYHVDLILRDIKRVEQIMKDTEHGIVTANYQTGHPHGYISAMYRGELAISKILYAKNESLCELKKQAETYPNALQKSLVNFFMFEAGFSLMFVKANSGTDDKYYIAGHVFRIVSCLNQVLFACNNAYCINEKKAIKLLETFEHKPEKYTEKVNHIFEVLGISLFECYDMTEKLYNEVNEIVSEINNFLNEESSDERKQI >NZ_CP041667|2839226:2847563|2843353_2843578_-|WP_033126275.1|DBSCAN-SWA MHISYKPLWHTLVERNMRKEDLRIAAGLTTNMIANMGKGKNISMETLVRICESLNCGILDVIELENDEIEKPQK >NZ_CP041667|2839226:2847563|2839226_2840021_-|WP_001096887.1|DBSCAN-SWA MAKMRISPELKKLIEKYRCVKDTEGMSPAKVYKLVGENENLYLKMTDSRYKGTTYDVEREKDMMLWLEGKLPVPKVLHFERHDGWSNLLMSEADGVLCSEEYEDEQSPEKIIELYAECIRLFHSIDISDCPYTNSLDSRLAELDYLLNNDLADVDCENWEEDTPFKDPRELYDFLKTEKPEEELVFSHGDLGDSNIFVKDGKVSGFIDLGRSGRADKWYDIAFCVRSIREDIGEEQYVELFFDLLGIKPDWEKIKYYILLDELF |
8 | Streptococcus_phage(71.43%) | NA | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
Acr ID | Acr position | Acr size | Homology with known anti | Neighbor HTH/AcRanker | Neighbor Aca | In prophage | Protospacer in prophage | ||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
NZ_CP041667.1|WP_143930140.1|2099180_2099363_-|antitoxin |
2099180_2099363_-
Protein sequences of NZ_CP041667.1|WP_143930140.1|2099180_2099363_-|antitoxin>NZ_CP041667.1|WP_143930140.1|2099180_2099363_-|antitoxin MAYSEAQKRASRKYNENNYDRLYITIGKGKKDIIKKKAEDQGMSLNEYVVKLIEKSLETY |
60 aa aa | NA | NA | NA | 2078196-2119060 |
yes
Self-targetings in the prophage
1. spacer 5.9|2243201|35|NZ_CP041667|CRISPRCasFinder,CRT,PILER-CR matches to NZ_CP041667 position: 2101240-2101206, mismatch: 2 tggagggggccggcgggagcacagacggaaaggca CRISPR spacer tggagggggccggcgggagcacaggcagaaaggca Protospacer ************************.*.******** |