Contig_ID | Contig_def | CRISPR array number | Contig Signature genes | Self targeting spacer number | Target MGE spacer number | Prophage number | Anti-CRISPR protein number |
---|---|---|---|---|---|---|---|
NZ_CP050153 | Brevibacterium sp. YB235 chromosome, complete genome | 3 crisprs | WYL,csa3,cas3,DEDDh,cas4,DinG | 0 | 2 | 2 | 0 |
CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
NZ_CP050153_1 | 1679004-1679075 | Orphan |
NA
Consensus repeat of NZ_CP050153_1
|
1 spacers
spacers of NZ_CP050153_1
>1.1|1679027|26|NZ_CP050153|CRISPRCasFinder CTTGAACGAGGTAGCCCGGGTGGGCG |
CRISPR arrays and Neighbor proteins around NZ_CP050153_1
The CRISPR arrays of NZ_CP050153_1 >merge|NZ_CP050153|1|1679004-1679075|CRISPRCasFinder GAGGATCGATGAACGCTGGCCGGCTTGAACGAGGTAGCCCGGGTGGGCGGAGGATCGATGAACGCTGGCCGA >NZ_CP050153|1|1|1679004-1679075|CRISPRCasFinder GAGGATCGATGAACGCTGGCCGG CTTGAACGAGGTAGCCCGGGTGGGCG GAGGATCGATGAACGCTGGCCGA
>NZ_CP050153.1|WP_167195776.1|1677900_1678905_+|hypothetical-protein MLEGMYVCLSTAALLAAGWDRHPIKAAERCCLRRLERGRYVVTVECSDPSHNFVSAIATAPSTTLPKDSTGLRRRLEDLRILVRSYVDRLPPDAVFSHRSALIVHGLPVPYIDPGDVFAESVSPHSGVRLANMLVRRRSRDFAAQEIIEGLPVTTVVQTLLDIARDYPLAFSVAVLDSAVRSSAVTVDELRSYSVSHPVRTGTRKIVNALENVDARRESVAESICAVRFVEYSVPGFEPQIEVRDENGIHLGRTDFANERAKVIAEFDGAGKYHLEGSDPQETFERERRREYALRNEGWLVFRIRWSDLFSADLFLRIGEAVRRRLIMDDRSRS >NZ_CP050153.1|WP_167195773.1|1676179_1677493_+|UDP-N-acetylglucosamine-1-carboxyvinyltransferase MDVFRLTGPAQLAGTIDVRGAKNSVLKLMAVSLLAVGRTTITNVPAILDVRIMVELLVRLGCEVDYDATEGIVSIDVPAEVGIQADYELVRAMRASISVLGPLTARMRAAHVALPGGDAIGSRGLDMHQAGLEALGAVVHLDHGYFVAEAPDGLRGTEIVLEFPSVGATENLVMAATLAHGTTTIANAAREPEIVDICTMLVEMGAQIEGIGTSDLTITGVESLQPVTHRTVGDRIVAGTFAFGAALSAGEVTVRGVGLDIMPNIGTKLRDSGATVEDLGEISLGDGTRGKGFRVIGAARPHAIRVATMPFPGFPTDLQPFVIALNSVSDGIGLLSENLFEARWRFVQEIARLGAKVRIDGNHALVTGSESLSGAEVEASDIRAGAGLVMAALRAGGVTEVSGIDHIERGYENFVENLRSLGVDIERVEKRDVLSFD >NZ_CP050153.1|WP_025776910.1|1675712_1676180_+|DUF2550-family-protein MNPSVLLIVLLSLVGLALALIVVVTIRRRSISKLSGAFDCSINVGEEYSSRPRWRLGVAVFSVTSLDWYPVFALTRRAAFRLPRADLDILVRRKPTSGEQYSVLPEAVVVDCSYGKADGRPRSVSLAMDTESLSTMASWLESSPPGFNPTMGRFT >NZ_CP050153.1|WP_025776911.1|1675429_1675690_+|F0F1-ATP-synthase-subunit-epsilon MATLEVNVVAADREVWAGEAKRVIARTLDGEIGILPGHEPVLGVVADGEARILTPGEDTIRVKADGGFLSVENNRVIIAADQAELL >NZ_CP050153.1|WP_098730992.1|1673017_1673950_+|F0F1-ATP-synthase-subunit-gamma MGAQQRVFKQKIRSTQSLRKIFKAMELIAASRIQKAIARSQAASPYANALTRAVSAVASESNVDHVLTTESDNVKRAAVLVIGPDRGFAGAYSANLLREAEELVRLLKGEGKQVELFTVGGKAKNYYTFRDRKIEKSWTGISENPTAEVAREIGEALLENFDPEAENSGVDEIYIVFTKFVSSVTHDPEYRRLLPLEVVDADEATTGGQSAGSTDASAFPLYEFEPSAEAVLDALLPRYIDSRILSALLSASASEQASRQAAMKTATDNADDLIKTYTRLANTARQAEITQELTEIVGGADALAASAAGD >NZ_CP050153.1|WP_167195770.1|1671351_1672986_+|F0F1-ATP-synthase-subunit-alpha MAELTIRPEEIRDALGKFVDSYNPASSEKTEVGKVVTAGDGIAHVSGLPGTMANELLRFEDGTLGLAQNLDEREIGVVILGEFSGIAEEQNVYRTGEVLSIPVGDGYLGRVVDPLGRPVDGLGDIETVGRRELELQAAGVMDRQEVREPLQTGYKSIDAMIPVGRGQRQLVIGDRKTGKTALAIDTIINQKANWETGDPKKQVRCIYVAVGQKGSTIAGVRRSLEEAGALEYTTIVSSPASDPAGFKYLAPYSGSAIGQHWMYDGKHVLIVFDDLSKQAEAYRAVSLLLRRPPGREAYPGDVFYLHSRLLERCAKLSDELGGGSMTGLPIIETKANDVGAFIPTNVISITDGQIFLQSDLFNAGQRPAVDVGVSVSRVGGAAQTKALKGVSGTLKISLAQYRSLEAFAMFASDLDDATKRDLARGARLTELLKQGQYAPMPFEKQTVSIFAGTNGYLDEIPVDDVLRFETELHDHIERKTGIFTTIRETLKLDDDTTEELKSVLAEFTQNFASSDQSGSKAGSEDTAAASSDEVEQEQIVRQKR >NZ_CP050153.1|WP_167195767.1|1670463_1671270_+|F0F1-ATP-synthase-subunit-delta MLQSSRLSLQAVLETANSEISGGDPRQIGEETLAVVGILVENVRLRKALADSSESAERKQQLLRTLFSTRITDAVLRISDNAVSRRWARTQDLVTSLEVAGVTAVAAAAQADGQLGQVEEEIFRFARLLESNHELSRALDSQATDESKRALVSDLLGGKAQPDTIKLVEQAALHPRGLRVAKALDQYSDILAARQQRSVADVTVARPLNEAQTERLQAALSASYGRELVLNVQVDPEVLGGVRVQVGDEMMNSTVADRLADVQRKLAG >NZ_CP050153.1|WP_039210678.1|1669912_1670464_+|F0F1-ATP-synthase-subunit-B MTPVNIVASAENPLLPALYDIVWSAVCLLIVFLVVWKYVLPAFNKTLDERAERIQGGIEKAEKVQAEADQALAEYQKQLADGRAEAARLRAEAQEEGAQIIADMKAQAHSEADRIIAQAQTQIDAERQSAMVQLRSEVGTLATDLASRIVGESLTDDQRSANVVDRFISDLESNSSAQPVKGA >NZ_CP050153.1|WP_039210675.1|1669668_1669893_+|F0F1-ATP-synthase-subunit-C MDMLAAVEGSVSTIGYGLAAIGPGVGVGIVIGKTIEGTARQPEMAGALRGNMFLGIALIEALALIGIATPFFLP >NZ_CP050153.1|WP_062242329.1|1668840_1669587_+|F0F1-ATP-synthase-subunit-A MAEFFPASFLFEGTPFEMNRVMLIRIIATVAVVVLLAVWAKRMKLIPTRFQSSMELAMEFVTVGIAEDTMGKEKAKKFMPLIVAIFFGILFWNVTKLIPFLNMPGTGVIGMPIVLTLVVYVTYHWAGIAEKGLGRYLKDSLILPGVPPAMHILLIPIEFITKFVTQPFTLAIRLFANMMVGHLLLVLCFSATSFFLFDAANGFQFFGIVTFAGGMFVFILEMLIVVLQAYIFALLSCVYINAAISDEH >NZ_CP050153.1|WP_167195779.1|1679122_1679818_-|endonuclease-NucS MRLVIAQCSVDYAGRLTAHLPMATRLIMLKSDGSVLIHSDGGSYKPLNWMTPPCTVRHIEPDAERAEAGLTELWEVSQTKTGDRLVISIAEVLSDDTYEFGVDPGLVKDGVEAHLQELLAEHIETLGEGYSLVRREYMTAIGPVDILARDDGQKSVAVEIKRRGDIDGVEQLTRYLELMNRDPLLSPVEGVFAAQEIKPQARTLAEDRGIRCVVLDYDALRGMDDPETRLF >NZ_CP050153.1|WP_167200788.1|1679926_1680145_+|hypothetical-protein MRTSRSLPDGTWSVQTVKGNETGKVYVCPGCGRDVTAASSHVVAWRQDAPHGIEIGVESRRHWHQRCFDRFR >NZ_CP050153.1|WP_167195782.1|1680285_1681929_-|cell-division-protein MTPTEEFRTAMRGYEKSEVDSRLQQLRTEVESVRKALADARSQVINADRAKLQIAGELSEAKAQLKKAANDNAEAAGPPGSRIDHLLKIAESQARETLAQANSDAETIRNKARAEAASARARMHTESNDTLSNARSEADAIISSAELRAEETIKAAEKRAAELSATTERETNQAKEANAASAKEARESLDLELSELRATAEKEAADLRAEAKTEAEETIAAAQAQADELLKSAKARDEASKKAGNDFDVELANKRKDAESERKKRYEEAQAENKKLVEEAQARAAKADTEAKEAAERAEQTRTDAVKKADEIIADGKSRAQTLISEARATAEATIEESAAEAKRNVASAQSQVDLLTKQRKTITAQLDQLRSLFAMPGVMGGDSVDPAKAESASHATEQIADGQELEDLLADDASDAAKADDSAKSSDDAKSTDGSASTGSEDAATKGGSAESTGAKSTGPAKKDASGAQDSGSAKTSAQSADGSTSTPGSKTDSGSNTGSDDDAENTAGEDLPNGATDEDTISINAQVSGSKQNNSKTSRSRNSLR >NZ_CP050153.1|WP_167195784.1|1682160_1683351_+|acetyl-CoA-C-acetyltransferase MADAVIVAGARTPFGRLQGELSKLSAVELGAEAISGALDRAGIQGSDVEYVIMGQVLQAGNSQGPGRQAAAKAGIPMSVPAVSVNKLCLSGINTITQAAQLIRAGEYEVVVAGGQESMSQAPHMLMKSRSGYKYGDVVAKDHMDYDGLWDAFTDQAMGGLTEEANAGDREFSRAEQDAFSARSHQRAAAAQEGGAFENEIVPVTISSRKGDVTVSADEGVRPDTTAESLSKLRPSFRKDGTITAGNASQISDGACAVVVMSREKAEELGAPILAEIRSHAWTAGPDSTLQHQPSQAIKAAAEREGVAADSFDLYEINEAFAAVGLASAKDLDIDEDKVNVNGGAVALGHPIGASGARVVLTLALELQRRGGGTGIAALCGGGGQGDSLIVSVPAQA >NZ_CP050153.1|WP_167195787.1|1683840_1684047_+|hypothetical-protein MNGAKLFSVATALTLVALLGFMLAGFFPIIVDFAFGIEAVAVILAMAGVVAVSFVTVRKSMENAARHY >NZ_CP050153.1|WP_167195790.1|1684365_1686063_+|hypothetical-protein MTESKSAVTPPVGQALNDIVKRVSETRFTLRSEDFADARTAHSTLTAELNDYVLPRINRSRTPFLIAVGGSTGAGKSTLVNSLVGRSVSPAGVRRPTTGNPVVIFNPVDAKFFESEHYLPDLPRSSDPQSSMPGVVLIADENVEAGTAILDCPDIDSISETNRALSRRVLLSADLWLFVTTANRYADAAPWALLKTAAARSTSVAIVLDRVPPEANREVRHHLSSLLSETGLANSPIFSVAELELEDGLLPHAAIYPIRSWISQVGTEGTSLERIRNRTLTGAISALPARVRELADFAEKQEQAHITLADSLEKSFRSAQSGLAEVFSDGRVLHGEVNARWQDFVGTGQLFRGLEPTMARMRDRISAAVTGKHDAATPLHVAILRSAAVSLREQAIDVVDEVNAEWRNTAAGAALIEDQPELRTVGGGLEDAVKSAVSTWSDEVNALVRDIGQGKKSKARILSFGVAGVCAVVEYAAFWDPRRTRGAGQSTQQGAGVALNLAETIFGADEAAGLISSVRQRFLDAAAGIVADCRTPFDNALRLSAVPARQAGALRASGERLEVAL >NZ_CP050153.1|WP_167195793.1|1686059_1687640_+|ABC-transporter MNSSAQTEIRSLAEGIRHALSLSEDKLASDVRTDAQNLLDRAEDRLGLGEDFTVVAFAGSTGSGKSSLFNAVAGLEIARVGVRRPTTSRPTACVWGEGGNDVLDWLHVPERSRTWRESALDGDDQRRLHGLILLDLPDHDSTAVEHRIESDRLVGLVDVVFWVVDPQKYADFSLHSEYLTKLAENSANMVVVLNQIDKLSPEEQKAAADHLRQLLNEDGLSETNVRISSAVTREGIPEIRSILADTVDSNDAAAERLLADMQAMAKRIRRELGEPVSSPDELAGASRLAETMSEAAGVEAVAQTVHDDYIRRAYRKTGYPVLAWAQRNAPDPLGAKHGQDRDELVRASVPATTKAQSSHVRLMAHELIAESVSTMPQAWQNEAAEAEKKSTDELSDNLDSAVTAVEITRQSPGWWSLAHTLQIVFFVASIVGLLGIIASALVAAIGSGTLPTWCWIVSFGLFVIGVIGSFVTSLVAKSARAKGAKEAAAEVDGKLRDAVGRVAQSSYLNPVKTVIGEHRQAYEMLG >NZ_CP050153.1|WP_167195796.1|1687831_1688512_+|single-stranded-DNA-binding-protein MSAIPITLTGTIATEPTARTLPSGRACASFRLAVNHWRVDKSTGEFVTDGTSWFGVDCYGELASNSAMSLSTGAAVIVSGSLRNREWATEERSGISPTVVAEHIGPDLRYGTAHYKRAKATDRSQSSSGQTNTESSGPSEAIWGALAPMETPSGDDVGAGEAAGLTATGPDSAAHTDSDDDRATDETGTAGRDTGVESGGSAGTDIDASGEEDPIARDTAKAAAPF >NZ_CP050153.1|WP_167195799.1|1688611_1689355_+|hypothetical-protein MRLVPVLGAAALTLALSGCSLFGFGADDSQPQPKQTQAKPVAEVDKVLKALKPLTGDKESVPSTKKFFSTMLDAGYEPEQLEATIDESPLGNEVPSKMFGVKTDKGCVVGEIRKGKATADLVEPTESTGSCLFGEVERPEGVKAPKGEKRDEDGDSNGAGHMPGEDINGKDGETESPAPSENGSESTSTGGSEGTAGSEGSESADSSEGTAGTSGEGTSGEASSEGGSGETSSEGDTSGEAPSLGGG >NZ_CP050153.1|WP_167195802.1|1689463_1691146_+|energy-dependent-translational-throttle-protein-EttA MAEFIYTMHKARKAHGDKVILDDVSMSFYPGAKIGMVGPNGAGKSTILKIMAGIEQPSNGEARLSPGYSVGILMQEPPLNEEKTVLGNVEEGVGEIKAKLDRFNAISEEMANPDADFDALMDEMGKLQEAIDAADAWDLDSQLEQAMDALRCPPPDAEVSVLSGGERRRVALCKLLLEKPDLLLLDEPTNHLDAESVLWLEQHLQSYPGAVIAITHDRYFLDHVAEWIAEVDRGHLYPYEGNYSTYLEKKQERLQVQGKKDAKLAKRLKDELEWVRSNPKAKQTKSKARLARYEEMAAEAEKTQKLDFEEIQIPAGPRLGDVVIEADKIEKGFDGRKLIDGLSFSLPRNGIVGVIGPNGVGKSTLFKTIVGMEELDGGNLKVGDTVKISYVDQSRGGIDPDKNLWEVVSDGLDFIQVGKVEMPSRAYVSAFGFKGPDQQKKAGVLSGGERNRLNLALTLKQGGNLLLLDEPTNDLDVETLGSLENALLEFPGCAVVVSHDRWFLDRVATHILAWEGTEENPANWYWFEGNFEAYEKNKVERLDEDAARPHRVTHRRLTRD |
You can click texts colored in the table to view more detailed information
CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
NZ_CP050153_2 | 2079021-2079115 | Orphan |
NA
Consensus repeat of NZ_CP050153_2
|
1 spacers
spacers of NZ_CP050153_2
>2.1|2079044|49|NZ_CP050153|CRISPRCasFinder CTGGGGAGCCTGCTGTCCGTATTGCTGCGAGCCGTCGTTGCCGCCCTGG |
CRISPR arrays and Neighbor proteins around NZ_CP050153_2
The CRISPR arrays of NZ_CP050153_2 >merge|NZ_CP050153|2|2079021-2079115|CRISPRCasFinder CCGTAGTTGGGCTGCGAACCGTACTGGGGAGCCTGCTGTCCGTATTGCTGCGAGCCGTCGTTGCCGCCCTGGCCGTAGTTCGGCTGCGAACCGTA >NZ_CP050153|2|2|2079021-2079115|CRISPRCasFinder CCGTAGTTGGGCTGCGAACCGTA CTGGGGAGCCTGCTGTCCGTATTGCTGCGAGCCGTCGTTGCCGCCCTGG CCGTAGTTCGGCTGCGAACCGTA
>NZ_CP050153.1|WP_167196658.1|2075790_2077797_-|threonine--tRNA-ligase MADSIDCAGESIPWKEGLTGTEIFSTDRTVVAMWLNGEPADLSRQLQSGDRIAPITIDSDAGLDILRHSTGHVTAQAVQELFPGTKLGIGPYITDGYYFDFDVAEPFTPEDLKAIQKKAAQIVKSGQTFNRVVVTEDEARARMANEPYKLELIGDKGKGTDEEASVEVGGGELTVYENVDRKGEVVWQDLCRGPHLPNTKLIGNGFAITRSSAAYWRGDQANASLQRVYGTAWASKDDLKAYQDRIAEAERRDHRKLGAEMDLFSFPEELGSGLPVFHPKGGVIKREMEDYVRARHIDEGFEYVGTPHISKETLYYTSGHLPYYGENMFPAMSVDEVRNESGEVVKEGTPYRLKAMNCPMHNLIYRSRGRSYRDLPLRLFEFGTVYRDEASGVIHGLTRVRALTQDDSHSYVAQEDAAAEIRHLLNFVLSLLRDFGLDDFYLELSTRDEDGKKADKFIGSDEQWAEATSVLEEVAQETGLELVPDPGGAAFYGPKISVQARDAIGRTWQMSTVQLDFNQPERFGLEYVAADGSRKQPVMIHSAKFGSIERFLGVLTEHYAGAFPVWLAPVQVTCIPVADEFNDYLAEVADQLRKAGVRVEIDDSDDRFPKKIRNASKSKVPFTLIAGGEDRDAGAVSFRFRSGEQENGVPVAEAVRRILDSIETKAQV >NZ_CP050153.1|WP_167196655.1|2075188_2075794_-|HIT-domain-containing-protein MSGGDDDRSAQENPDGRLGEGIPHPEAAAGFPGEPDGFQRLWTPHRMVYIDGQDKPKGDQPEECPFCAALSKSDEDGLIVARGQAVYAVLNLYPYNPGHLLICPYRHVADYTDLTGEETVELAEFTQKAMRVIRAVSGPHGFNLGMNQGPVAGAGIAAHLHQHVVPRWGGDANFLPVIAQTKALPQVHADVQARLKKEWNR >NZ_CP050153.1|WP_167196652.1|2073149_2075192_-|S9-family-peptidase MNPEDLGTLAEYSSPVLRGHDTVITIRRPDLESNSYLSQLFSLTEDTSRRLTHSWSDSTPQCGPNWSGYLSAEKKAAPQLYVGDSLETAHQITDNHLGVAEFALDDSRSRALYVARVAEPGRYGLDESIPATEEAPRRITTASYLANGLGYTNDRPARAFLVDLAEPGLGTVGLRGASEVPLSTLLTTPDSDVHDPQFSPDGHWASVIAAVEPDRGRPDLRSTVWLLGREESRPLDLPPMSVSLHVWIDADRVLLLGNALTRDELDFVGQMPGLFIHTVSTGSTRRLTDPETVALAPIPPQIRGGAVVAAVDTDGATRIVRIGLDAAEVGIDDLEFLTDDTTVVNGFDADNETLVFTGSTPHSPAVLGRIALGGPTASGAATEMGSSVIVKEHPAPANSVLPQVLRVPGDSGTITGWLAKPHGEGPFPVILNIHGGPFAQYTHSWFDETQVLTSAGYAVVFSNPRGSGGRTRSWGTAVQGDMAKPAMADVLAVLDHALESDPSLDRSRLGIQGGSYGGYLTAMTIAADHRFRAAIVERGYLDPDSFVGTSDIGRFFTEEYTSRSREAITRQSPLAHAPQVGTPTLVMHSELDLRCPLEQAQQYYAALQRVGVDTEMLIFPGENHELSRAGQPRHRRQRFEAILDWWDRRLSGGNRTPPAEERHIPEAADPEAASASDTAS >NZ_CP050153.1|WP_167196649.1|2072539_2073019_-|NUDIX-domain-containing-protein MKPRKASRVVLLNERDEVLLIRAQDLLTPSHQWWMTCGGGSELGESAAQTAARELAEETGIECEPHELIGPLATRDEVFEFTEKSLRQVETYFAFRTSEDIELEDAVWTDIEKRSLLEFRWWTREELLTTTETIYPKNLLGLIDLATAGSVPEVPLVID >NZ_CP050153.1|WP_139467356.1|2071673_2072426_-|YebC/PmpR-family-DNA-binding-transcriptional-regulator MSGHSKWATTKHKKAAIDAKRGKLFAKLIKNIEVAARNGGPDPDGNPTLFDAIQKAKKNSVPADNITRAVKRGGGLDGSGVNYETIMYEGYAAGGVALLIECLTDNRNRAASEVRVAVTRNGGSMADPGSVTYNFNRKGVITVGAEETDEEAILLATMDAGAEEVKEVGEKFEIICEATDLVAVRTALVDAGIDYDSAEASFVPELEVSLDAETASKVFSLIDALEDSDEVQNVYSNADVSDEVLAELDA >NZ_CP050153.1|WP_039207386.1|2071077_2071674_-|crossover-junction-endodeoxyribonuclease-RuvC MRILGVDPGLTRCGLGVIDTLPARKAKMVAVDVLRTPSADSVDLRLGAIAEAFDTWLDTYRPDVVAIERVFARNDVSTIMGTAQASGLTMGLAARRGLPVAMHTPSEVKAAITGSGRADKKQVTSMVTRILGLDAPPKPADAADALAIAICHSWRGALSAQSTPGKNKDLTERKAGGRQGSGLTKAQQQWAEAMRRAR >NZ_CP050153.1|WP_167196647.1|2070415_2071012_-|Holliday-junction-branch-migration-protein-RuvA MISFLSGTVHRIAADHLVVLTYGVGRKVHVTPDTLAGTRHGAEIELVTSLVVREDSMTLYGFGTEDENHTFEVLLSISGIGPRLAMAILSVMGPDELAAAITNQDANALTRVPGIGKKGASRIILELENKLPKLTAAAPGPTLSFGGGNQQVVDALVGLGWKEAQAEDVVAEVVKETGADAGTSVVLKAALKVLGAKK >NZ_CP050153.1|WP_167200868.1|2069321_2070344_-|Holliday-junction-branch-migration-DNA-helicase-RuvB MTGAERERLVSGRAETAERDDEAALRPKGLADFIGQPKVREQLSLVLDAAKARQKAPDHVLLSGPPGLGKTTLAMIVAHEMNSSLRVTSGPAVQHAGDLAAILSSLEEGEVLFIDEIHRMARAAEEMLYVAMEDFRVDVIVGKGPGATAIPLDLPQFTLVGATTRSGLLPAPLRDRFGFTALLDFYSSADLLTVLQRSARMLGIDSELAGLEEISTRSRGTPRIANRLLRRVRDWAQVRGSGIIDEEAATNALRVYEVDELGLDRLDRSVLQVLCKRFGGGPVGLGTLAVSVGEEADTVETVSEPYLVREGLISRTPRGRVATTAAWDHLKMQIPANYEF >NZ_CP050153.1|WP_167200866.1|2068700_2069234_-|preprotein-translocase-subunit-YajC MLIPLALAALLIFFLFNSRRKQKARAEEIKSGLVPGAKVMTTFGVFGTVLSIDEESNQVTIESGPGTVLRVHRQAIGQIENNQAAAPVDAPGAAAPAADADADAADDEKPAITDAELDAMNERKRAEKDTTDEDTAEDVVADESAAKTEDADAAAETDADSAAADEDSTDSDTDKKN >NZ_CP050153.1|WP_167200864.1|2066517_2068629_-|protein-translocase-subunit-SecD MRFLWLTIITLVLAAIIAGGVIWSNATTTPKLALDLEGGTSIILEPQVSEGTDISKEQLDQAVAIIRQRVDSTGVSEAEITTQGDRNIVVNLPGNPDEETRNLVRSSAQLVFRRVALVGDPRSQEQIQKEQEKSGGEESGGDDGLSDEERKRLEDLTGADSQGDDQGEEQAPSGGGDVVKAGGTAEKKTDESSEATKSSESERKDAEKSGEGSGSSEGSKVTDSTPRPLFDPEKDAAEWQTDKIIKQYSELDCTNKKNRTGGQQKPSDEPVVSCSEDGQAKYILGPVELSGDHLADANAGYAAGANGVQTNNPAVNLSFDATGREIFKQITSDITGKQQPYNQFAIELDGLVLSAPSSNAVITDGNAQITGDFSLDEAQTLANQLKNGSLPLSFQVQSEDQISPTLGSNYLKIGLLTGLVGLLLVVVYSLLQYRVLGLVTVSSLVVAGVLTYLLLLLASWRYGYRLSLAGVAGIIIGIGMAADSFIVYFERVRDELRSGRNLLSAVEVGWDRAKRTIYASKAVNMLAAVILYILAVGSVRGFAFTLGLTVIIDVLIVFLFTHPMLQLLSRTKFFGEGHPMSGMDPRLLGVKPAAYRGALNLSIDDKDKTPEAKRREKARMRKAGMTPEDGSETPNAATTGSESEETSTTKNTKAAKSKSAKAAKTATAAGGMTIAERKAAARRAEDDDADDDSATDDGKEADK >NZ_CP050153.1|WP_167200870.1|2080249_2080612_+|TIGR02611-family-protein MLANAHTAAIYRSIVGGLGTIIVLVGLALVPLPGPGWLIVIIGLFIISSEFRWAQRLLHFVRVNVERWTQWIMAQPLWVRWTVGAVTAAFVGIIVWLTLRLTGLPDWVPDLRVFDLIGLR >NZ_CP050153.1|WP_167196664.1|2080765_2081776_-|cell-division-protein-ZapE MNAEQTLVALSDRSPQVAPEELIAGLVPPPQFEDVSFDSYRTDPAEPSQEEARNKLREFTERSTSQGFFGKLFSKGKSGGAKGVYLDGGYGVGKTHLLASAWHANEKPATFGTFVEYTNLVGALGFARARDDLSKMKLVCVDEFELDDPGDTVLMSRLMRELTDAGVKIIATSNTLPGSLGEGRFAAQDFLREIQALADQFEVYRIDGKDYRARELTAPADPLPESELDSAASQLDGVVARDDFSQLLSHLSTVHPSRYGRMVDGIDAAVWENVRTIDNESVALRFVALVDRLYDRNVHIINSGAALDKVFTEESLAGGYKKKYMRCLSRLTALSS >NZ_CP050153.1|WP_167196667.1|2081940_2082879_-|GTPase-Era MEFRTDYPEDYRAGFACFVGRPNTGKSTLTNALVGEKVAITSAKPQTTRHTIRGIVHKDDHQLILIDTPGLHKPRTLLGSRLNDLVASTLGEVDVIGFCLPADEPIGPGDRYIASQLALLDGRTPIVALVTKVDRVPKDKIAEALLAVGELADFADVVPVSAVEDFQVDTVDSVLAAHLPKSPPLYPDGDLTDEPEEKMIAELVREAALEGVRDELPHSLAVQVEEMYPREGRSEENPLWNVHVNLYVERPSQKAIIIGKGGSRLKAIGSESRQGIERLLGTKVYLDLHVKVAKDWQRDPKQLGRLGFDFNN >NZ_CP050153.1|WP_167196670.1|2082880_2084194_-|HlyC/CorC-family-transporter MFMFFLGAALCLIIAATLSAVDAALLNVSHHAVEEAKEDGKRSAVRVERILADLPTNINVIIFVRNFLEALATVFIALAYDSYYSVGPLMVFLTVITASVSVFIIAGVSPRTIGKRRSLAVSLNLSWVVRIVLVALKPLTRILVVLGNLLTPDKVYKDGPFVTSEQLRDLVERASESDVIEDGEREMIQSVFNLSDTSANEVMVPRTDLITVDADVSLQKTMNLFFRSGFSRIPVCGEDLDDVRGVAYLKDVARRLHLHPEEAERPVGNLARTVLFVPETKPADDLLRQMQLDSTHLAILVDEYGGTAGLVTIEDIVEEIVGEIEDEYDNGDDELVAADDGSFIISTRMSISDFAEYFDVRIDEDDVNSVGGLLSKLIDRVPIDGSHAEIEGLEIEAMEGQGRRHRITHVRVTRTHEDSRDDAQTAAAGGSGTKEED >NZ_CP050153.1|WP_167196673.1|2084196_2084685_-|rRNA-maturation-RNase-YbeY MNTEILNETDAEVDLDEVVALTEYLGDALHMHPGAELAVTMVDSAAMSELHVTWMDLEGPTDVMSFPMDQLHRGEPDKPTEGQMGDIIICPEVAEAQARAAGHSAMDEILLLTVHGFLHLLGYDHGEPEAREEMFALQRHLLLTFFAARYDGRTDIPTPTEV >NZ_CP050153.1|WP_167196676.1|2084681_2085716_-|PhoH-family-protein MNPTDSDTAAPGDGAADKDAVTDTRGAERDTVRLVIPDSIDLVAFFGPGEKNLRALEKTFDDLDIHVRANQVQVTGDPKRVEAFVSVIGELKKLHTAGHRINEETIDRVTTFSSEGAAASAVLGTNILSTRGKSIRPKTMGQHDYVKAIRNHTITFGIGPAGTGKTYLAMAMAVNALQHKEVSRIILTRPAVEAGESLGYLPGTLNDKIDPYVRPLYDALHDMVDPESIPLLIETGTIEVAPLAYMRGRTLNDAFIILDEAQNTTAEQMKMFLTRLGFGSRMVVTGDISQIDLPGKTRSGLKVVRDILDGIDDLQFCELGSKDVVRHSLVTKIVEAYDLWGNAE >NZ_CP050153.1|WP_167196679.1|2085816_2086578_-|16S-rRNA-(uracil(1498)-N(3))-methyltransferase MSLPVFRSATAAEAVVGSALTLGEDVAGHAVRVRRIGPGEVIDIVDGEGTRVRGTVTAASASEVTIDVTAVTNEDSTGPRLVLVQALAKGDRDLQAIETATEIGVDEVIPWAAERSIADWPAKKREKMAAKWENLLNAASLQARRSRFPVLRELVRGASLAKSLDETDAVFVLHETAERRLSEALAALTADESSPLPERIVFVVGPEGGISDRELDALSACGATPVLLGPTILRSSSAGPAGLVLAQNSLGRW >NZ_CP050153.1|WP_167196682.1|2086574_2087690_-|molecular-chaperone-DnaJ MADHYETLGVSKDASAAEIKSSYRKLARKYHPDVNPGHEDEFKAISLAYDVLSDPEKRRNYDMGGGENGQGFPAGGGFGGFGDIFETFFGGGGGQAGGPIPRTQRGKDALVGVNIDLKTAAFGGTVDLDVTTAVVCDTCSGAGTQEGTKIETCSLCHGAGSVQRMTRTLLGQMVTNQTCNSCHGFGTVIPNPCLNCQGDGRVRKQRTMKIRIPAGVSDGTRIQLSSQGEVGPGGGPAGDLFVEVMVTRHEVFQRDGDNLRAAVSVPMTAATLGATIPFETFDGTQDLTIAAGIQSGTVVKLPGLGATRLRSETRGDMLITVDVLTPDKLDDEQRELLEKLAELRGEETPRAQISTENRGMFSRMRERFAGR >NZ_CP050153.1|WP_167196685.1|2087817_2088855_-|heat-inducible-transcriptional-repressor-HrcA MNDSRRAQVLRAIVEDFVATNEPVGSKAIVQRHTLGVSPATIRNDMAQLEQEGYIAQPHTSAGRIPTDLGYRMFVDRIDEFKPLTTAERRAIFQLIDGDVDLDEMLDRTVRVLSGLTRQVALIQYPTVSRARIKHIEIVGLGPGRILVVLITDAGQVEQKSVIAPSPLDEDAVRGLRDQINAEFAGRTLAQVFGSSPAEAAAPEPTEPSSRDDSGLTQVRAAVVDLVAATREERIIMAGTANLARSGSEFGEKMAPILEAFEEQVVLLKLLTSMAEDHEGISVRIGRENTHESFSSTSVVAAEYGHDAGSSARLAVLGPTRMDYPTTISAVRAVAKYVSSILDRG >NZ_CP050153.1|WP_139908139.1|2089016_2089880_+|DUF3097-family-protein MPVNAFDRYGPDVLSGSSPSSHRPKKSRQVELGLGMVLEDAMSGYVGAVVGAEKTTAGVVVKLEDRVGKVRAFPLGPGFLLEGQPVDVQLPKKKAQQPGRTASGSRAVVGAKARVARGSRIWVEGKHDAELVEKIWGDDLRIEGVVVEPLGGLDDVADKLEAFGPDRDHRVGVLADHLVSGTKESKIAEAVRADPRYRDVVHIIGHPYVDIWQAVKPHVVGIREWPVVPRGEDWKTGILRRIGWPHADHRDVARGWVRILGKVSTIADVEPTLSGRVEELIDFVTVG |
You can click texts colored in the table to view more detailed information
CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
NZ_CP050153_3 | 3087122-3087235 | Orphan |
NA
Consensus repeat of NZ_CP050153_3
|
2 spacers
spacers of NZ_CP050153_3
>3.1|3087146|27|NZ_CP050153|CRISPRCasFinder CTCCGCGTCGCCTGCCTCGGCCGGTTC >3.2|3087197|15|NZ_CP050153|CRISPRCasFinder TTCCTCCTCATACTC |
CRISPR arrays and Neighbor proteins around NZ_CP050153_3
The CRISPR arrays of NZ_CP050153_3 >merge|NZ_CP050153|3|3087122-3087235|CRISPRCasFinder CTCCTCCTCGTAGTCCTCTGCGGGCTCCGCGTCGCCTGCCTCGGCCGGTTCCTCCTCCTCGTAGTCCTCTACGGGTTCCTCCTCATACTCCTCTTCGGCTTCGCCCTCCGCGGG >NZ_CP050153|3|3|3087122-3087235|CRISPRCasFinder CTCCTCCTCGTAGTCCTCTGCGGG CTCCGCGTCGCCTGCCTCGGCCGGTTC CTCCTCCTCGTAGTCCTCTACGGG TTCCTCCTCATACTC CTCTTCGGCTTCGCCCTCCGCGGG
>NZ_CP050153.1|WP_167198792.1|3086586_3087021_-|gas-vesicle-structural-protein-GvpA MSTSTVERSRGSYVDRPSSSSLADVIEIILDKGLVIDAYVRVSLVGIEVLTIDARIVIASVDTYLRFAEATNRLDLTQQGGRDLPEMMGGMMENGSKGKTQGAVEGIKDALTSDDSDDDSGESSSQEKSRRRTKRPARNSSESE >NZ_CP050153.1|WP_167198789.1|3085837_3086590_-|GvpL/GvpF-family-gas-vesicle-protein MSEEGAPLADRYLYGIVRAGAELPTGPDGVQGNALALVESGAVAAVVTELADSGMLGTPEALQNHSVVLDELAEKQPVLPLAFGTVVPGGADIAEQVLAPQADVFAEALDQLAGCTQFTLRISFDRDAILREIVSGNPEVAELRERISGTSEDETRNERIRLGEIVVTTMESWRRTEAPPILEQIRSATVETAMREVGQAEDVAEVAVLVRRDAIDEFDSVIEELAEANRERMRFRLIGPQAPYDFVPEM >NZ_CP050153.1|WP_039206887.1|3085577_3085829_-|gas-vesicle-protein MGLLSAVFGAPLAPLKGTVWVAEQVRGEAEKRYFDPGAIRRQLEEVAGARERGSISDDEADALERELVGRLLEGRRRRTEEDR >NZ_CP050153.1|WP_167198786.1|3085086_3085581_-|gas-vesicle-protein MSEESTANEESRAEGSSADDGGTTTKRARKPVEKERTSGGTSRRSSSSASSSREKATSSSSEKAKSSSRSESRSHSATGQRISAVSAVKRAIEQFSTLTGRPPESVVGTRWKDDRWSVRLEVVESRRIPDSADLLAEYEVELDADGELMAYDRKDRYVRGRPSE >NZ_CP050153.1|WP_167198783.1|3084587_3085064_-|gas-vesicle-protein MNQPNDAMQPQRSQEGTLLHVVETLLDKGLVLNADIMVSVAGVELLGIRIRAALASFETAARYGLDFPAGTDRETVAWKEAVEQKDTCPECGKRSALAQLMNDYCPWCGWQSARSKRIEAGEPAQLNSADDADTSAEQAAGSSADAGTPAAGSPGDDR >NZ_CP050153.1|WP_167201066.1|3083581_3084562_-|gas-vesicle-protein MQPTRDPRATLPDLIEVLLNKGVHLNLDLIISVSDIPLIGINLRATIAGIETMIEYGMMQQWDRDTREWVQRAVRTHLPLAADEEILAKMAGGHYQDNFYRTWRPGSAYLTTQRLIIHRRDPAETLWQTRLDAIASVSALREPSIGGEERTRILVGLNDGTEAILSALEPDRLISLVQARLDRTDGAPSSTPEATTAEDRPLREGRMWFLETLSSGSTWRGGQAQLSNTELTWRSPMDGRARVRIPPEQLLDIRREERSNPTDERRVLILETADSTITLAADDAGAWFAGLDEWRTGPGDRRSAPLEATMSPGRNPDERAEEGAAS >NZ_CP050153.1|WP_167198780.1|3083261_3083585_-|gas-vesicle-protein-K MTLNVNEESLKHGVLTLVVTLVEVIQEALETQAVRRMEGGDLTEDEQNRLGEALLELDEAMDQIKDQHGITGSVDDLHRGLDDVVDEVVDKLINPARWAEENGKGVE >NZ_CP050153.1|WP_167198777.1|3082416_3083265_-|hypothetical-protein MNAEGDMLYVYAIVAGDDYAPAVTGIDGSALHMVGRDTGPRAVVHRHTRGPFDGPDDSVRRWVLEHSEVIDDAWQNSPALLPVSFNVIVRSDPETEATATQQLEHWLDDSAVMLSRRLEELCDTSELRVEIFLDGGLLEEVDAEVGEMRTEMESRPAGVRRLLEKRLEKTEKEIVDRAADRIYPEIRARIAAHCLDIEEHRSTSRESGLTPVIMASCLVRSTDATALGAELTALKKAQPALSIRFLGPWPPYSFADVSISEERNPSSSAPDSPTPNPQGETT >NZ_CP050153.1|WP_167198774.1|3080782_3082420_-|NAD-dependent-epimerase/dehydratase-family-protein MRVAVIGATGNVGTAVLDVLGRTPEITSVLGISRRMPDTEAEPYSGCEWRSIDIAAASSEGTAHRDLTEALTGADAVIHLAWLIQPNSDRDLLRRVNVDGTARVAAAVAAAGVPHLVVASSVGAYSPDDSMDKRDEEWPTEGIRSSHYSVDKAAQERVLDDFCADHPEITVTRLRPALIFGAPAASEIQRYFLGTWMPVQLLRAGRLPFLPLPAGLRGVQAVHSTDVARAYVASVLRRRSGAFNICADDVLHPKDLAELLDHGRHIPVPNGAVRAALGMGHSSSLVAADAGWLDMGLHVPLMDNGRARRELGWEPEYSAMDAARELLKGMADGEGAASVPLRPRDVEHTRLRATDDTSRGHDAPGADEHVDMDLLGLYLSDHLTGATAGAERIERMAADFIDTPVFAALSELAAEIRGEHLYVRHLIGELGFRRRPLAEAVSWVGERVGRLKSNGSLLKRSPMTLVLEAELMRSAVIGKLGMWQTLEGNAEALGLDAEQFRGFAQKAEHQREVLDTVHSYARSRAFRRYRAVYDQASGVSPVRGD >NZ_CP050153.1|WP_167198771.1|3078508_3080662_+|catalase MSTEDRPIIPGKPGSRTPDFEEPTTPREPLPPKPDQSGPKPTSPTGAPSRDEQEDQAQQGSWLTTAQGARLYDTDHSLKAGSRGPTLLQDHHLREKITHFDHERIPERVVHARGSAAHGTFISYGNAATITKAAFLAPDVETEVFTRFSSVVGSRGSADAVRDTRGFATKFYTRDGVFDLVGNNIPVFFIQDAIKFPDIVHAAKPHPDREIPQAQSAHDTFWDFVSLHTEATHHTMWQMSDRGIPRSFRMMEGFGIHTFRTENSAGETSLVKFHWKPKAGVHSLIWEEAQMVNGVDPDFHRRDLADAIEAGACPQWELGVQVFPDDPHETFEGIDLLDPTKIVPEELAPVQPLGMLTLNRNPSNFFDETEQVAFHPGHVVPGIDITDDPLLQGRLFSYLDTQLTRLGGPNFSQLPINRPHCPVNDMFRDGMHQTADHRGTAPYKPNSLDGGNPFPAEQTDEHTFVEIAHEIPASKKERRSPESFDDHYSQPRMFWLSLTPVEQQHLADAFTFELGKCYEETIREREVAVLACIDSELARMVAEGLGLEAPAAQTPPRSDIEPSPALSQVGKRWPVDGRKIGIVTGSGTDPEQVIRAYDRIAEAGMVPITIAPVGGRITSGERSVAVERTYLTVASSELDAFFFADGAELTTEIELLITEAWRHLKFIAASGDSCTMMEKYGITADDPGVYCKDDLETALSQLQEGLSEHRAWARVEA >NZ_CP050153.1|WP_167198798.1|3088145_3088826_-|hypothetical-protein MNNKMKITGLILAGYMLGRTKKLGLALTVASAVAGTTAAKNRDQLLGGLKDFADSSPELKSLQEKITGRLAESGKSAVKAVAAKGVDQLSVKLQDQTEKMKSTLDDAADSIDPNVDDSEGADDEEDAPESEDESAPENSEEPEADEAEPTDDEQEADDEPQAEEEAPKKPAAKRSSTAKRSAKGTRSSKSSTSSRGSRTKPGSKRSTTAKKTSSRAAAKPEEAEDE >NZ_CP050153.1|WP_167198801.1|3089098_3090238_-|YbdK-family-carboxylate-amine-ligase MSEFGIEEEFLLVDQHSLLPARSKSSLQEIEDEVRPSRGAACAEWLPGQIEFATPVLTTAEEAFESLHSFRRGLSAAAQARGLLAVGLGTAPQIPAAPPGVSDGSRYREFAQLAPAIAADQYVNGMHVHVDIPDREAGLRAVNGLRRWIPVLTALSANSPLWRGADSGFASWRSIHYRRWVVFGIPPHFHDLDDYDAQIDAALRSDVVLDEATLGWLVRLSPKHRTVEVRTSDVQLDTATTVTLALLTRALADVAMDDTGPEAVPANLLNIAHWQAARFGLTGMLMDPDTHTSVPAAEVVRKVFHRARPALMRSQDMGRVHRGLRRLLSQGTGSEEQRRVAERQGVGGLLEHAAHRLTASSQDQFEQPADSSRGVSDPP >NZ_CP050153.1|WP_152347519.1|3090557_3090866_+|hypothetical-protein MDDQNPKRIDPRSGIAFSDERAIRRRRKLKEAAEFWNMSVAGVWLHYVSLGGDLTEYELDAYLHEAYFLTPYQHDILAEAVNELIDMLPPPPRAPLTDETGL >NZ_CP050153.1|WP_167198805.1|3090860_3092702_-|MFS-transporter MSENVELGTIKTDVPARLDRLPWARFHWMVVVGLGSVWILDGLEVTMVGNVAARMTEEGSGIDMTAGQIGTAGAIYVLGACVGAIVFGQLTDRFGRRKLFLITLVLYLVATVATAFSFSAWYFFLVRFLTGAGIGGEYAAVNSAIDELIPARVRGRIDLIINGSYWLGAAGGAATTLFFLNTDILPKMIGWRLAFAVGMLLAIFVFVVRKNVPESPRWLFIHGRNDEAERIVGEIEDGIETETSQTLPPPKKTITVRQRKTISFVEIMKVAFTIYPKRAILCLVLFVGQAFLYNGITFNLGTIFNGFYGVAAATVPIFIILWSLSNFAGPVILGRFFDTIGRKPMISFSYLGSAVVAVVLALVFNTDVGGEWLFLVILIVCFFLASSGASAAYLTVSEIFPMETRALAIAFFYAIGTAAGGIAGPLLFGGMIESGDRSQVAWAFCIGAAVMALGGVAELIFGVKAEGADLEDIARPLTAEDAESAEGAAESSAEPTERGEWADSAEAAEPSASPEARSRGDRLRPGPGSVGVYSPWPSVSSRDVPPEVSANEVNGIIDFVRDMEPVGEVELYRAIGARRWGPGRFRAAVREAIRQGAVHRNRRGRLEYRGDRS >NZ_CP050153.1|WP_167198808.1|3092902_3094084_-|NAD(P)/FAD-dependent-oxidoreductase MAEFRYVIIGGGMAADSAAQGIREIDEEGSIAIISDDVDEPYTRPALSKRLWTDESFDESDNYLDTAEATGARISLRTGATAVDVEAKSVRTTHGDFTYDKLLFVTGGRPKGIDLDEGERVICFREFNDYRRLRDLSGRNLSIAVIGGGFIGTELAAALVQNDTRTTLIFDDDTLGGSIFPPDLAKQFHELYRSHGVTLVPGTKASGGHVDGDRVVLDCDGEPHEFDAVVVGLGIEPATQLAEDAGLDTDDGIIVDESLRTSKPDVFAAGDVARYPDRILGRQRVEHVDNATQMGKAVGRIMAGADESYTHTPYFYTNVFDFGYQAVGELDPTLRTVEDWKKPHTDGVVYYLGEDGRVRGVLMVNMEDRLDAAREILGEDWDHTPGDLVTRIS >NZ_CP050153.1|WP_167198811.1|3094150_3094951_-|SDR-family-oxidoreductase MDLNIRDRKALVTGASSGIGLETARQLLAEGAVVVMTGPEPDELKAAVDELAEFKERIYAHDADIADDESVDELAASVAAEVGDLDILVNVAGIHGAGGLFHEINQEGWDRTIDVDLMGPVRVTRAFLPGLRRGGWGRIVFVSSEDAVQPYDDELPYCAAKAGMLSLAKGLSRTYASEGLLVNTVSPAFIATPMTDEMMNERAQQKGTSFDEAIASFLREERPFMELGRRGRPEEVAKVIAFLCSDAASFVNGSNYRVDAGSVATI >NZ_CP050153.1|WP_167198814.1|3094993_3096367_-|NAD-dependent-succinate-semialdehyde-dehydrogenase MTAKFNTTNPATGEVLKEFPTASDEEISAVIDASDAAFQTWRTTDVRERSAPLARAADLMEERRWDLAELLILEMGKLRAEAEAEVELAARILRYYSEEGPLLLSDEVLAPSSGGSAVMKYEPIGPILGVMPWNFPYYQVVRLAAPNLVAGNTIILKHASNCPQSALAFEQVMNDAQLPSDCYRNVFADSDQIQTIIADERVRGASLTGSEGAGAAVAGAAGHNLKKSILELGGSDPFIVLDSADLDATVTAAVKGRMTNSGQSCIASKRLIVLEEHYREFVDLLTAKMSQFVAGDPRDSATTMAPLSSEQAARDLMEQVQDAIDHGAHVHTGGHRVDRPGAFVEPTVLTGVTEQMRAYSEELFGPVAVVYSVADEDEAVALANDSSFGLGGTVVSDDIEKAQRVADRIDTGMVWINQATWTEPDLPFGGTKRSGVGRELGAEGIREFVNKKLIRTP >NZ_CP050153.1|WP_167198817.1|3096487_3096949_-|hypothetical-protein MSLSGLTESAVPTRLCNAMLSGMGSDDDLNSQKSDVTDHKPIVKIFHDDIGVLNQLVKLASEGVPKTALHVFAHDAESTDEVVGADGTLKGLGDLVDERYNERGDELRNRFQRYGFDSDEIEKFESDLDNGAILLVIDDPDLRADYKGNRRAP >NZ_CP050153.1|WP_167201068.1|3096945_3097419_-|hypothetical-protein MSPRLDPTTPRHRLGLPVIAIVGLALLAAPRVVLHDLNIIEEGTSVNALFVFLPPLVWIVTVLITRVPRPLITLLAVGLFYGVFLALGHQILWNHSLGDNPPQLGGNLAGLDPTAQSLIFRSFAVASSLLIGVVVGAISGAIAWGLKVLLRRPRNGG >NZ_CP050153.1|WP_167198820.1|3097755_3098115_-|hypothetical-protein MSECDPPAWMIRYRNFKTLCSYVCGEFIRFYLTTGCDQIRYTHSQITEGLPNYSCRLTSVDGSVLLLPLDDWVDRLDEVVPMVRTWLGEHSDLKGCKPEKSHYQGDRYWFTRWQEANPW |
You can click texts colored in the table to view more detailed information
CRISPR_ID | Spacer_Info | Spacer_region | Spacer_length | Hit_ID | Protospacer_location | Mismatch | Identity |
---|
CRISPR_ID | Spacer_Info | Spacer_region | Spacer_length | Hit_phage_ID | Hit_phage_def | Protospacer_location | Mismatch | Identity |
---|---|---|---|---|---|---|---|---|
NZ_CP050153_3 | 3.1|3087146|27|NZ_CP050153|CRISPRCasFinder | 3087146-3087172 | 27 | NZ_CP007796 | Azospirillum brasilense strain Az39 plasmid AbAZ39_p3, complete sequence | 29698-29724 | 3 | 0.889 |
NZ_CP050153_3 | 3.1|3087146|27|NZ_CP050153|CRISPRCasFinder | 3087146-3087172 | 27 | NZ_CP023068 | Ensifer sojae CCBAU 05684 plasmid pSJ05684b, complete sequence | 201127-201153 | 4 | 0.852 |
NZ_CP050153_3 | 3.1|3087146|27|NZ_CP050153|CRISPRCasFinder | 3087146-3087172 | 27 | MT657336 | Microbacterium phage ClearAsMud, complete genome | 13374-13400 | 4 | 0.852 |
NZ_CP050153_1 | 1.1|1679027|26|NZ_CP050153|CRISPRCasFinder | 1679027-1679052 | 26 | NZ_CP043441 | Cupriavidus campinensis strain MJ1 plasmid unnamed1, complete sequence | 1430349-1430374 | 5 | 0.808 |
NZ_CP050153_3 | 3.1|3087146|27|NZ_CP050153|CRISPRCasFinder | 3087146-3087172 | 27 | NZ_CP022418 | Sulfitobacter pseudonitzschiae strain SMR1 plasmid pSMR1-3, complete sequence | 155984-156010 | 5 | 0.815 |
NZ_CP050153_3 | 3.1|3087146|27|NZ_CP050153|CRISPRCasFinder | 3087146-3087172 | 27 | NZ_CP035092 | Paracoccus denitrificans strain ATCC 19367 plasmid unnamed1, complete sequence | 26841-26867 | 6 | 0.778 |
NZ_CP050153_3 | 3.1|3087146|27|NZ_CP050153|CRISPRCasFinder | 3087146-3087172 | 27 | NC_008688 | Paracoccus denitrificans PD1222 plasmid 1, complete sequence | 274188-274214 | 6 | 0.778 |
NZ_CP050153_3 | 3.1|3087146|27|NZ_CP050153|CRISPRCasFinder | 3087146-3087172 | 27 | NC_009717 | Xanthobacter autotrophicus Py2 plasmid pXAUT01, complete sequence | 178368-178394 | 6 | 0.778 |
NZ_CP050153_3 | 3.1|3087146|27|NZ_CP050153|CRISPRCasFinder | 3087146-3087172 | 27 | NC_007486 | Rhodococcus erythropolis PR4 plasmid pREC1, complete sequence | 40415-40441 | 6 | 0.778 |
NZ_CP050153_3 | 3.1|3087146|27|NZ_CP050153|CRISPRCasFinder | 3087146-3087172 | 27 | NZ_CP020698 | Sulfitobacter sp. D7 plasmid p4SUD7, complete sequence | 82808-82834 | 7 | 0.741 |
1. spacer 3.1|3087146|27|NZ_CP050153|CRISPRCasFinder matches to NZ_CP007796 (Azospirillum brasilense strain Az39 plasmid AbAZ39_p3, complete sequence) position: , mismatch: 3, identity: 0.889
ctccgcgtcgcctgcctcggccggttc CRISPR spacer caccgcgtcggctgccccggccggttc Protospacer * ******** *****.**********
2. spacer 3.1|3087146|27|NZ_CP050153|CRISPRCasFinder matches to NZ_CP023068 (Ensifer sojae CCBAU 05684 plasmid pSJ05684b, complete sequence) position: , mismatch: 4, identity: 0.852
ctccgcgtcgcctgcctcggccggttc CRISPR spacer ctccgcgtcgcctgcctcggcgcaatc Protospacer ********************* . **
3. spacer 3.1|3087146|27|NZ_CP050153|CRISPRCasFinder matches to MT657336 (Microbacterium phage ClearAsMud, complete genome) position: , mismatch: 4, identity: 0.852
ctccgcgtcgcctgcctcggc--cggttc CRISPR spacer ctcctcgtcgcctgcctcggcgtcggg-- Protospacer **** **************** ***
4. spacer 1.1|1679027|26|NZ_CP050153|CRISPRCasFinder matches to NZ_CP043441 (Cupriavidus campinensis strain MJ1 plasmid unnamed1, complete sequence) position: , mismatch: 5, identity: 0.808
cttgaacgaggtagcccgggtgggcg CRISPR spacer ggcgaaggaggtggcccgggtgggcg Protospacer .*** *****.*************
5. spacer 3.1|3087146|27|NZ_CP050153|CRISPRCasFinder matches to NZ_CP022418 (Sulfitobacter pseudonitzschiae strain SMR1 plasmid pSMR1-3, complete sequence) position: , mismatch: 5, identity: 0.815
ctccgcgtcgcctgcctcggccggttc CRISPR spacer aatcgcggcgcctgcctcggccgcttc Protospacer .**** *************** ***
6. spacer 3.1|3087146|27|NZ_CP050153|CRISPRCasFinder matches to NZ_CP035092 (Paracoccus denitrificans strain ATCC 19367 plasmid unnamed1, complete sequence) position: , mismatch: 6, identity: 0.778
ctccgcgtcgcctgcctcggccggttc CRISPR spacer atcctcgccgcctgcctcggccgggcg Protospacer *** **.**************** .
7. spacer 3.1|3087146|27|NZ_CP050153|CRISPRCasFinder matches to NC_008688 (Paracoccus denitrificans PD1222 plasmid 1, complete sequence) position: , mismatch: 6, identity: 0.778
ctccgcgtcgcctgcctcggccggttc CRISPR spacer atcctcgccgcctgcctcggccgggcg Protospacer *** **.**************** .
8. spacer 3.1|3087146|27|NZ_CP050153|CRISPRCasFinder matches to NC_009717 (Xanthobacter autotrophicus Py2 plasmid pXAUT01, complete sequence) position: , mismatch: 6, identity: 0.778
ctccgcgtcgcctgcctcggccggttc CRISPR spacer ctccgcgtcgcctgccttggcgaaatg Protospacer *****************.*** .. *
9. spacer 3.1|3087146|27|NZ_CP050153|CRISPRCasFinder matches to NC_007486 (Rhodococcus erythropolis PR4 plasmid pREC1, complete sequence) position: , mismatch: 6, identity: 0.778
ctccgcgtcgcctgcctcggccggttc CRISPR spacer acaggcgttgcctgcctctgccggttc Protospacer . ****.********* ********
10. spacer 3.1|3087146|27|NZ_CP050153|CRISPRCasFinder matches to NZ_CP020698 (Sulfitobacter sp. D7 plasmid p4SUD7, complete sequence) position: , mismatch: 7, identity: 0.741
ctccgcgtcgcctgcctcggccggttc CRISPR spacer gaatgcgtcgcctgcctcggcctgtgt Protospacer .****************** ** .
Region | Region Position | Protein_number | Hit_taxonomy | Key_proteins | Att_site | Prophage annotation | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
DBSCAN-SWA_1 |
2710856 : 2733531
Sequences of DBSCAN-SWA_1
Nucleotide sequences of DBSCAN-SWA_1 >NZ_CP050153|2710856:2733531|DBSCAN-SWA CTTATGTCTTGAATTTGATTCGAAGATTGCCGCTGCCTTCGGTAGCAATCATGTAATTCGACTTCCCCGTCCCCGCACCGATCCCGACCCCTCGGGAGGCACCGGATGCCAGCCGTGCTGCTTGTGAGGCGGGGATGGTGATCCATTTCGAGTCGCCCATGCCGAGGCCGGGCCCGTCCCACCGGTTCGTGAGCTTCGGGGTCTTCGACGGTGGGGATGTTTCATCGTGCAGGCCGAGGCGTGGCCGGACCTTGCCTGACACACCGTGGTATTTCGAGGTGCGAGCGACCCGGATTTGCATGCTATCCACGGTTTTGCCCTCGCAAGCTGCAGCGATCCGACTGCCGTAGAGGAACACCGAAGAATACGGATGCGGATAACTCGGCCACGCGCCCTGAGCAGGTCTCGACCCATCGCGTCGACCATCCCGGTATGCCGCCGTTGACGCCGGATCCAAGGTGACGGGTTTCGGTTTCGACTTCCGAGGCGGTTCACCCGGGCCCGGCGCCGGGCCTTCACCAATGAGCCCGTAGATCGCGCCGTCCTTCATAAACACCGAATTCACCTGCACGAAGTCACCAGCGGGGAGTCCGTCGCCGAAGGTGAGGTCGACGGGTTCGGGAATGTCGACTTCGACGGGCCCGCCGATCTTGTCCATGACCCGCCATCCGGCTGAGTGTTGAAGGACGAGGACCACATCTCCCTCGGCGCGAGGGTTGTACGCCTGGTTGGCGGCGACGTCGTTGATGATCGTCTCGCCCCATTGCAGATTGACCTTCCCGTCCTCGCGGACCGAGCTCACTTGCATGATCTGGAACGGGTTCATCTGCCGTTCGACCTCAGCCCGGACCAGGGCGGCGAGGTCGCTCTCAGCGTCACTCAAAATTGCCCTCCCCGTAGTCGTCGTTGTCGCCTTCGATGGTGATGCGCCCACCAGCGGGTGAGGTGGTGGAGCGGGTTTGCATGGACATGGTTCCTCCGGTCAGGGGGAAGGTGATGGAGTCGATGATGTGGTTCTCCACGGTTTCGTCAGGCATCGTCACGGAGATGACGTCGTCGGGTTCGAGCGCATAGTTCAACAGCGACGACACGGACACCTTCTGTTTCAGACCTAGGCGTGGGGCCAGGAGGCCGAGTGCGGTGGTTTGGCACTGTTGCAGGCTGGTGATGAGTTTCGACGTGTAGAACAGCGGCACCGCCCCGAACGGCCCCTTGTAATAGGTCGGCGAGACGGGGTCGTCGTCGGAGGCGACGGACGGGCCGATGGGTGGTTGCCCGTTCTCGGACGCTCCGGAGGCGACGATCTGGTTGTACACCCCGTCGCGGGACAGTGTTTGTTGCGGTTCGACGAGGGCACCACCCGGGCCGGCGGCCAGTTCCCACACTGTAGGGTCCTCGAGTGTGGGGACGGGAGCTGCGACGAAAGATCCGCGGGAGTCGCAGAACACTCGGCCGCCGAGGGACTTGGCAATGGAGGGGTCGCGGGAGCGGCCGTCGATGAGTCCCCACCGGTCTCGTTCTTCGACGAGCTGCGGAATGTTGGTGTCGCCGAGCCGCCAGGACATGCCGACCTCGGGTAGCACTTCCTTGATGAGGGTGGTGGCCCAGTATTGACCGGTGCCGATCGCCAGGGTGCGGGGTTGGTGGAACCGTTCATCCTGCACCTGCGCCTCAAGCGACAGTCCATCCACCTGTGCGCGTCCGGGTCGGAGACCAGCTTGGGAGACTTCTTCGACCCGGTACACACCCAACGGAACGGTCTCGATGGTGTGACGGTCGTAGTGCATGCGCATGAACACGCGAATCCGGGCACCGTATGGGGAGAGTGCTCGGCGGCCTATGTCTGCGCCGGAGATGATGAGGCCGCGGGCGGTCCATCGGACTTGCGAGGTGGAGGCGGCGGTAATGGACCCGTCGTGAAGGGTCAGATCTGACCAGGTGGTTCCGCGGTCGGGTGACCATTGCACGACGGGTTCATACCGAGCCAGGGAAAGTGCTTCGAGCCACCGATCGGAGACTGGGAACACGGGGGCCTCCTTATGGTTCGAGCAGCTTGTTGTATGACTCAACTGTGGCCGCGAGCTGGTCGTAGGTGGTCCAATCGCGAGCCACAGTGTCGTACGACAGTCCCGGGATCCGCAGTGGTGAGTCGGTGGTGTCGGGGCGGTCGATCTGGATGGCGTCCCATTGGTATCGGCGCCAGGGCCATCCGTCCCAGTGGACGTGTTCGAGGTCGAGTTTCCCGCGGATGAAGTAGAAGTCCGGCAGGCCTGCACAGGCGGGGAATTGTGCGAGGACTGGGCCTTCGTCCAACAGGGCCATGAGTTGTTCTTCGACGTCCTTCTCGCGCACGAGGGACCGCAACTGGGTGGTGAACCCTTCGAGGATGTCCCAGGTCGCGGCGGGGCGGGGTGCGCCGGCGATGCTGGTTGTGTCGACGCGGCCGTCGTAGGACAGGTCTGGTGGTTCGACTTTCAGCAGCATGGCTTTGTCGGGGTCGGTGACGGGTTTCAACCACGCCCACCCATCAGACAGGCCCGGGATGACTGCGCCCACGTATTCGGATGGTTCGCCTTCGGTGCCGTCGAAGCGGATGGGGACGGCTCGCCAGAGTGTGCCGCCGCCGAGTGTGGCTTCGTGGTCGTAGGCGTGCGCGTACCCTCCGGGCGCCCAGGCTGGGCTGCCGGAGCGGACGAGACTGCCGTCCTGCCGTTCGAACCGGACCTTGGCTGTGGTTGCTTCCTGCGCCGGTATTGCTGGAACCTCAGGAGTGAATTCTTCCGACGATGAACGTTGCGGTTGCCCAGTCCAACGGAGCTTCCTCCCACTCGCGGGAGAACTATCGCCGTCGAAATACGCGCCCGAAGTAGTGCCTTCTTCGATGGAGAGGCAATCCCACCACACCGGCTGCGTGGTTGAACCATTCGTGAGCCGGATCGCCCACCCCCGGCAATCTGCCGGCAGCGTGAACGTCACCTTCAGCCGAGTGGTTCCTGGTTCATTCGGTGCGGAAACTGAATCGACGAAATGTGCGTTCAGGCTTCCGTTTGCCCCGAAAGTGGTGATGTTGATCTGCCGGGCGTTGTTCCACAGCGCTCCCGTCTGCTTCGCGTCGAGGTGTATATCAGCGGCGACGGTGAACGTCTTGCCCGCATCACTCTGCCGTTCCCCGAGCTCTTCACGAGTGACAGTGGCCAGGGTGTAGTTCGCCGTTGCTGAGGCCACGCCGATCTTCAGAGCGGTTTGCCCTCGCGAGGACCAGTCCCTCGACACGGAAACGTCAGCACCGGCGATCCCGGTGATCTGGCCGGAATCAAACATTCGGTCCATGTAGGAGAATGAAGGATTCCTGGCCAGATTCCGGCGATGCAACACTTCCGGCTTGCCGGGAACGCTGGGCTTCGAATCATCAAAATCAGCCCGCAACAGCACCCCAGCATGGGCAGGGTCGATTATCGCCGTGATCCGACCATCTGGTGACACCACAGGGTCCACCTCCGGCGGCCAAGCAAACCCCGGATCAACAACAATCGGCATCAGTGAGCCCTCCCAGCACCAACAGCAACCTGGCCTATCTGCTTCTTCATATACGCATCCATCTGAGTGCCGTCCCGCATCACGAGCTGCAGCGGCACACCATCAAACGCGGTCGCGATCTGCTCCGGAGTCACCTGCATCCCCTTCACCGCAGACGCAATCGACGACGACAAAGTTGCCGCATCGAATGCAGGTGCAGTGTTCGGAATGTAAGTGCGGTTCGCGGCCATGAACTTCATCGAATCCGCGGTCGACATGACCTGTTCGCCGCCCTGGAAGTTCACGAGTTCAGGGCCTTCCTCACCGACGACAGCCAGACCGCGCTTGGCGCGTTCCGTGCCGCCCTCATACCAGTTGGAACGCTGGTGATGAGCACGAGCCGCAGCTGGGTTCCCGTATGTGGATTTGATGTAGTTCAGACCCCACCGGGCCTGACCCTCCACCGTGTCCTCAACCGGGCCATGCAGAGAAGTCATCTTCTGGAACAGGCCTCGGGCCGACGATGACGGGTTGGCTGCGTTCGGATTCCACGAGGACTCCTTGTTCACCAACCAGTCAATGTCGCCCCAGTACTTGCCCCAACCCATCTGCTCCGCGATCTTGCGGACAGCAGCCTTCACTGCGGGGTCGCCGGCAGTGGTTCCGGAGATGGCAGTGGTTTTGCCGTACTCCTTCGCCTTACCGACCAGTCCCTCGGAAACCTGGGACATGATGCCCTTGCCGAGTTGCCCAGCAATCCCATCCGGGAGTTGCTTGGCGTGCTTGGCGTAGATCTGCTGGGTCAGCTTCTCGGCCGCCTTGGCGTAGGAAGCCTGCAAGTCGGAACCGGCAGTCACGCCAGCACGGTCCAAGAACGGGTGCGACATTTCCCCACCCATGCCCGGCAGACCGCCGGCCATAGCGTCCGACAGGGCTTGCCGTGCGGCGATGTGGACGTGGTTGAAGTGCATGCCGCGGACGGCACCGGTGTAGTAGTGGTTCGAACCGTTCTTGATCTGCTTCCCGTTCGCGGGCGAATAGATCAGTTCGGCAAGGTTCGCACCGTACTTATTGCGCCAGAAGTTGAAAATGTCCATCGACGGGGACACGTCGATCGCCATACCCCTGGAGTGGTACGAAGCGTTGCCCGACACTGTGCGGGAACCACCACGGTAAGCGGACGTGAGGCGAGCATTCGGGAACTGGTCCCTCGTGATCGCCCAAAGGTTCCGCCACACACCAGCATCCGCGAACGCATTCACGCCCGGAACGCGACCGAACACGCCACCAGTAGCGAACGCCTGCGACTGAGCATGCAGAAGGTCATCCAAACGACCCTGACGTGCAGCATCGTTTTGAGCCTTCACCCCAGCCTCGCCGCCCATGGCGCGAGTCCACTCGGGGCGCATGATTGCTTCCCCACCCGACAAATGCAGATTCCCTGCAGTCGGCGAGTAGAACTGGTGAACGTCCCGACCGGGCGTATACCCGGGCATGACACCACCAGAAGCGAACTTCAATGCGGCCGGGGCGGAGAGCTTCACGCCCACATCCCCGAACGCCTTATTCACACTGGCAGTGAACGAGGACAGGACACCGATGAGAGCATTCAGCCGGCCACCCAGTTCGGGTTTCGCCGCCGCCATCTCACCGTCAATGCCCTCACGGATGGCTTCCATGTTCTTGTTCGACGTGGACAGCAACGACTCGAACCCGGTCTTGTTCAACGACTTCAGGTTCGCGAGACGCGTCGAGTAGTCGCCGTGCAGTCCCGACATGGTGGTGTCAGTGTTGGTGCGCATGGTCGAAAACTTCGAACCGGTCGTGTTCTTCAACGCTTCGAAATCAGTCTCGGACCTGGTTTTCATCTGCGACTGCTTGTCGATGAGCATGAGCCGCATGAGTTCCTGCTGCTCAGTCGACACCGACAGTTGCCCAGCCTGCTTGGCCGCAACGGTGGCCCGCATAGCTTCCTGCTGAGTCTGCGTGTCCAACAGCATCGCTGCGTTCTGCGTCTGCGTGTTCGTGAGCATGCCCAACTGGTTTTCCTGCGTGGACAGGAGCATCCCCGCCTGCGCAGTCTGAATGTTCGCCAGCATTCCAGCGAACCCCTCAGACACGGACAGGTTCATCTGCTCCATCGCATCCAACGTGGTCGCAGACATGTCCTGCATTGCAAGACCCGCCGCACCTTCATCGTCAGCAACAACAGGAGTCACACCCACATCCATGGACATGGCACCCACAGACGGGACGGCAGCAGCACCCAGCATGGCGGCCGCGGACTGAACATCCGAAACACCGCCAAGCATGCCCTGCACGAGCCCCTCAGCAGTGAACTCACCAATCTCACCAGTCACACGCGACGGCGAATGGATACCCAGAACCTGCTTGAACGTCGACTCCATCGCCTTCGCAAGATTCGCGATCGCGGCTTCAACGTTCTTCTGCTTGCCCTGCAGACCCTTCACCACACCGTCAGCGGCATTGACTCCACCCTTGTAGAAGCCCTCCGTGACGAACTGGCCAGCCTGGTTCGCGTACTTGTCGAACTCAGCCCACGACGACACCAACTGGTTCCGCTCCCCCAAAGGAGCGTTCAGCAGAGCATCCGCGGCACTATTGCCGCCCTCGATACCCTGCGACGCAACCTCCTGCAGGAGAGCACCAGGGATGCCCATCTTCGCAAGCTTGCCCAGCTTCCCGACAAGCTCCTTCAGACGAGCGGTCTTCCCAGTCATCGACGACAGAGACGACACCTGTTTGAAGTTCCCGTCGCCAGTCGAAACCACCGAAGTGGACACGTCGAGGTCACGGTCGTTCAACAGGCCAGACGCGACCGAGTCCTTGATGCCCTTCAGCTCGGTCGCCTTGTCCTGCGCCTTCTTCAGCCGGTTGTCGATCCGCTCAGCCTGCGCGTACAGGCTGCGGAGGTTCGCTTCGAACTTCCGAGCCGACGAATTCGCACGTGACCGCGAACCACGAGACAAGTCCTGATTCGAACCCAAGTCGTACAGGCGGTCGACGGCCGAGTAGCCGCCAGACAGTCCACCGGTCACCTGGTCGCGAATGTTGCCGCGGCGAAGATCCACACGCAGATCCGTGCGGAGGTCGTTCACACGGGCCTTGCGTTCACGCTCCTCCTTGGCGCGGTCGGCTGCGATCTTTGCGGCCTTCTCGTCGGCCTTCTGCTTCTCCTTGGCCGCCTTGAGAGATTTCTTCGCAGATTCGAGTTCATCCTCGGCAATGCGGATGCGGCGATCAGCCTGGGCTTTCGCCTTCTTCGACTTCGCGTCCTGCTTTTCCCGACGGGCTGCACGGAGGCGATCCTGTGCCGCATCGACCGACTTTTCAGCCGAGGCAATACCACCCTTCGCCATTGGCATGGCGCCGGGCATGCGGGAGATCGCTTCCATCATGATCTTCCACGACCGCCGGGAACCGTCGAGTGGGATGAACGCCTCGTCGACGTCCATGCGGTCACCGACGATGCGCCAGGTGTTGGGCTTGACCATCTCGGCGATGTTGTTATGGACTCCACCTGCGGCCATCGGCATCAGATCGATGCCGCCGAAGTACTTGAACTTCTGGACCGACTTGTTCTTGGGGTCAGACGAGTAAACGGTCTCGTCGCCCTCTTTGCGGATATGAACCTCATTGACCCGGACCCAAACGGTCTTGTCTTTGATCGAGTCCATCCACTGCTTCGTCTTCTTCAGCTTCGCAGTGGCGTCGCCCGTGTTGACCTTGACATCGGTTTCGATCTCGTCCGGGGTCTTAAGAAGCTGGTCAACGTACGCCTTGGCCTTCTTCTCCGAACCGAAGAATGCGGTGGCAACACGCACCATTTCGTCGCGTTGCTTCTTGTGCTTCTTCGTGACCTCTTCGACCGACTGGCCCTCTTCGATGTTTGATGCCGTGGAGTCGTTGATGTTGCGAATCTGGGCATCCAGCATCTCGTTGTTCTTGCGACCGGACTCGGTGGTGTCATCGAGAGAGGTCTTCCCCGACTTGATTGCCTCGTTGAGGTCATCCCAGCCGGCCTCGACGGCACGCGTGGCCGCTCGTGCATCAAGCGTTTCTGACCCGAACCCGCGGATCTCATCGGCGAGCGAGGCGATCTTCTCCTCGGTCGTCTGAGCGCTCACGCCCAGTGCTTTGAGGTCTTCTTCGTTGCCGTTGGCCGCGGCTCCGGCCTCGATCATCGCCGGGGTGATTTCACCGTAGGCAATCTTCAGAAGGACGGCTTCCTTCTCCTGCTCGGACATGGAGTTAGTGAGGAAGCCGGACGATTCCGCAATGGTCTCAAGTTGCTTCCTGTATTCCGGGAACTCGGAGAAAGTCTCTTGGAGCGAGACGCCAGCAGCGCGACCGTCATACATGACTGCTTTGAATCCAGCCTGAGCCGCTTCGAGTGACCCTCCGGATGCAAGATCGGTGACGGCGGTGTCCAGTTCCCTGAACGCTGCTTCCGCCTTGTCCGCACCCGACTCGACGCCAGGAATCCATCCGAGAGCGGTGTTGATCTTGTCAGAGAAAGTCGAGTTGTAAAGCCGGTTGAACGCGCCAGTCAGGTCGTCAACGGCAGTGAACGTCTTGCCGGTCTTCTCGCCGACCTTTGAAACGTCTTCCATTTCGCGGTTGACGCCCTTGAGGATCTCCTCAGCGTTGCCGCCCGACATGACGATCGCTTCCATGAAGTTCGACGAAGACGTGGTGCGCTCGGTCGCCTTCAGTGCCTCGGACGCCGCCACAGCCGCCGTAGCGACAGCGGCAATGACGCCTGCGGCAATGCCGGCAGATTTTCCGAACTTCCCCATCTTCGTGGTCAGGTCGGGGAACTGTGTGCGCAGGGTGTTGAATCCTTCGATGATGTCGAACACCCGTGGCGCGAGCAGCAGGAAGCCTCCCACGGCGAGTGATGCCGCGCCGCCGATAGCGCCGAGGCCGGCGCCCACTTCGAGCACCGGTTTCGGGAGCTTGCCAATCCACTCAGCAAATTCAGCGACAGCCTTGGATCCTTCAGCAACAACAGGCAGAAAGGTGCCACCGATGGTGATGGCGGAATCCTTGATCGAGTTTCCGGCCATACGGATCTTCGACTCGGCTGTGTCGTACCGTTTGTTCGCCTCGATCATGAGAGCGATGCCGTTGGCGTACTCTTCGTTGCCGGATGCCAGTGCATCCTTGAGGACATCAGAAGCGTTCGATAGGCGCAGCAGTGCATCAGACTCACGAATGCCGGTGATGCCGAGGTCCTTGAGGGCGCCATTGACGTTCTGACCAGCCGCCTGAGCGTCACCCAGACCAGCAATGAAGGTGGACAGTGCGCCCGCGGCGTCGTCCTTCCATGCCGCAGAGAACTCCTTAGAGGACATGCCTGCGACCTCGGCGAACCCGGCCATAGCGTCGCCGCCCTCGGAGACGGAGTTCCCCATCTTCTTCATGACCATGGAGATGGCCGTGCCACCAGCTTCAGCGTCAATGCCCACCGAAGACAGAGCGGCGGCGAGACCGAGAACGTCGCCCTCGGACATGTTCGCCTGCTTGCCTGCGCCGGCGATCCTCATCGCCATTTCCATGATCTCGGACTCGGTGGTAGCGAAGTTGTTGCCGAGACCGACGATCGCGGCACCAAGGCGGCCAATGTCCTTCTGATTGGTTCCCATGATGTTCGAGAACCGGGCCAGTGAGGTTGCGGCTTCTTCGGCGGAAAGATTCGTCGACTCGCCCATGTCGATCATGGTCTTGGTGAACGACACGACATTGCCGGTCTCGATGCCGAGCTGACCGGCAGCTTCCGCAACAGCCGCGATTTCTTCGTGAGTAGCGGGGAGTTCTCGGGCGAGTCCTCGCAGACCTGCCTCAACCTCAGCCAACTGTTTCGGAGATCCATCGACCGTCTTGAGCACGCCAGTCCAAGCCGACTGCCAGGACATGGCCGCCTTCGTGGCGAGGCCCATTGCGCCAACGACAATGCCACCGAAGGTGGCCATCGCGCCGCCGATCTCACTAATCTCGCTTCGGTACTGGCCGATCTTCTGTATGGCCTTGCCTGCGCCAGTTTGAGACTTATCCCAGGCGGTCTCGGTCTTGTTTGCGGCATCTACTGCCGCTTTGCCCGCAGCCGCCATGTCGGCCTTGAATTGATTGACCTTCACCGAAAGGAGGGCTGACACATTACGAACTTCTGCCATGACAGTGGCCTCCTATATACTTCTGGTACTACAGAGGGGATAAGAAAATGACGACCGCTATTGCAGGCAAAAGCAGCGACGCGACATATCGGAAGCTCAAGAAGGCATCCGCAATTCTTCTGCTCATTGGTGTGGTCGCCATGGCTGTGGGTGTCGTCGCCTACGTCGTCCAGGTCGGCGCAGTGCAGTCAGGAATGGGCGGAGAGGGTGCGATGCTGTTCGCCACCTACGGGATGCTCTTCGGCCTATTCGGTGGCTTTCTAGTGCTGTTCGCGTCACTCGTTGTGCGAATCGTCGCCGCTATCAAGCGTTAGTCACTCATCCACGACGACATAGGTCTTTGTGCCCGGCTCAAGGTTCTTCGACTTCTCTGACTGCTGGAACTCTTCGAGTTCGGCGCACGCCATGCATCTCGTGTCGCGGATGTCGTAATCGTGCGGGTGCTCGCCAGACGACAGGGACATCGGTTGCCCGCATCCATTGCAGAGATCCGCCTCATAGAGGGTGAGAGCAAGGGCGAATGCGCGGTCCCTCATGGTCCACTTCTTCCCGGGCTTCCGGGCTCCGCGCAGCACAGACACGGGCTCCCCCCATGCCCGTGCCGTGCGCGTCTCCGTGAGACTGCTTAGTCCTCGGCCCTCGGCCCAGGCTTCAGCGATTTTGGGACGAACGAACCCGGATCGGTGGAAGTCAGATCGTAGAACGCGTCGCGCAGGTTCTTTACCTGCGGTTCGCCGATCTTGTTCGACACTTTCCGCATCTGATCTGGGGTGAACACCTGCGCACCGTCTTCGTTCACGATCGCGGCAGAGAGCACGTAATGCCCCGTCTCGTCTTGGATGATCGCGCTGGACGCATTTGAGGCGGCGCGCCGGACCGCTTCGCGAATGTCGTTCTTGTCGCCGATCTCGGCACGACGGCACTCTTCTCGGGCATATCCAGCGGCCTCCTTGGCTGCCTGGTCTGCCTTCTCTGCACACGCCTTCCGGGCCGCTTTCGCCAGCTCTTCGACCTCTTCGGGGTCTATGGCGCGGACGATGAACTCGACCGCCGAATTCTCGAGTTCGGCCTGCAGTTCGGCCTGCTTCTCAAGCAGCTCCGCCTTCTCTGCTCCGGCCGTGAGATCCTGCTCTTCACCGTCATCGGTGAACAGTGCCAGGTCTGCCTCGACCGCCTGCAATTCGCCGGCGAGATCGGCTCGCTGGAAGATGGTCACTTTGGTCTGGGCCGTGCGGATGCCAGCGATGAATGCATCGGGATCAAACTGAGTTGTCATTTCTTCCTCCATGTGAAAGTTGTGGGGTCTCCATGAGGGGCGGTGGGCCCGTGGGCAGGCCATGGAGGAACCTGCCCACGGGGGTATTTCGGGCACAAAAAATGCACCGGCAATGCGGTGCAATCTGTGCGTCAGGATGGATCAGGGCGTGGTTGCAGCGACCTCACAGTTGGGCCACGCATCCTGGTAGGCGCCAACGATTCGCTTCTTGATGTAGCCGGTGCGGTCGACCTGCTGCGGGTTGTCTGTGAGGAACTCGAAGCCCTCGCACTCATCTTCAGCTGCCCACGGGTCAGTGGACTTCTTCGAGGTCTCGCGCTGGTAGATCCAGAGGCGGGTTCCCTTGACCTTGGTGGCCTGGAAAACGGCGTCTCCGACTTCGGCCTCTTCGGTCGGTTCGGGCTGGCCCATGTCGGTGAAGTACCGGAACGGGGTGAACTCGATGGTTGCGTTCGATGGCCCGAAGGCGACCGCTGCGCCTTCGACGCACAGTTCCTTCTCGTCGACCGTCTCGGACGCAGCGAACGCCACGTTGTAGTCGGACGAGCTGATGCGGCACGACGCATCGATGCCGGCGTTGAGCTCGGCGAGGGTAGGAGCCTTCGGATCGGCGGGCTTGGTGGACAGAAGAGCGACCTTGGTGTGGCCGTCGGCGAGGGACTTAGGCATCTTCAGCCTTCCTTTCAGTGTTCTCGGATGAGATGAAGTCGGGAAGTTTGGGCTTCTCGACGGGCTTGGGCCGGTCGGCCTTCGCCTGGGGTGTGACAGCCAGGTGCGGGAAGATGGTCAGGTGCGCACGGGGAACCCGGCCCACCTTCTTCCCGGTTCGGGTGTCGTAGGCGGTGACGAACTGGTCTTCGTCATAGATGTGAGCCACGTGGGCCTCCTCTTGTGTGCGGACATGACAAAGGCCCCGGTTTCCCGAGGCCTACGGATAGAATCCCCTCAACAGGAGTAGTTGGGAGGTGAAATGAAAATCGAGTGGAATGACGATGGGATGGACAAGCTCAGCAAGGAAGTTGAGAAGCAGTTCAAAACCGCTGTCGAGGGATTCAACCGATCAGGCTCGCAGTCGGCCAGTGACCTGCAAAAGCACATGAAGAAGAACGGCTTCGACATAACGCTGGCAAAAGCGAGAAGCCTGCTGAAGTAGTCACAAACTCCGGCCAGATAATTGCTATCCGCCTGGCCGGAGTCCGCTACTCATGGTTGCTGGAACGAAAACGACAACGGCAGGTAGTACCTGACGGGACTCACTTGACCGTCGTGCTGGATGGGTATGTGTGGTTCGGCATGCCGGTACTCTCCCCCGCCGGGCGCCGGTATCCGATTAAGTGTGCGGATGACCTGTTGGGCGAGGTTGCGGACCGTGCCGACTGTCGCGCCAGCGACGGTGACGGTGAGCGCCCCAGCGGACTCCTGTGCGGAGTAGTCGAGGGGCTGCTCGTAGTGCTCCCGGAGGGGTTGCGTCCACACGGCCAGATAGGGCTTGATGATGCCGCCTGCACTGGTGGGCACGGATTCGGGAACCATTCCGTCATACACGCCCACGCCGACAGTTTTGAGTCGGTCGAGCGTCCACGTGGTGATGGTGATGATGTCCAGGGCACTCACCCCTTCCATGTACCGTCGAGCATGTTGCCGAGAGCTTTCTCAAACCCAGGTATCCGGCGGTCGAACGCCGGGCCAAGGTAAGGCTGCGGGGCCATCCGGCTCGTGCCCCATTCGACGAACCCTGCGTAGCTCGTGGTGGGTCCGATTTCGGCACCGGTCTCCCCGCTCGTGTCGAACAGGTCGTGTCCGATGGAGCTGCGGAGGTTTCCGGTGTCGACAGGGGCGAATTTCTTCGCGTCCGCGGTGATATCGGCTGCGGTCTTCGCGACTACTTCCCGAGCACCACGCTGGGCGTTCATCCCCGCCTTGGTGAGGTCGTGGCCAAGCGATCGGAGTTCGCTTGCGTCGAACGCTTTGCCGGTCACGCTGCGGCCTTTCTGCTTGCGTATCTAGCAACGCGCTCACGAATGCAAGTGCGACAATCTCGCTGACCGCCGTGTTTTCTCGCAGGCCGAATGTAGGTGTTCTCCGGCGTATACTCGTGCCCGTTCTTGCAGTGAGTCTTGATCGCACTCCGCTGGTTGGCCGAGTTATCGGGCAAGGATAAGAGGCGCAAGTGGCCAGGGTTGACGCAGGCCGGGTTGAAACAGAGATGATCTACCGTCATGTCATCGGGGATTTCCCCGACATGCTGTTCATATGACCAGCGGTGGGCGTAGTACCCACGCTTCCCTGCGAATGTCAGTCGACCGTAGCCGCTTCGATCCAAATAAAGGATCCACTCCCAGCAGCCCGTCTCTGCATTGACGATCCATTTCGGGGTGGTAATCATCAAGCAGCCCTCACTTCCTGTCCGTTGACGTCCTGGCACAGGTAGTCGCGTGTCCAGTTCGTGGTCCCTGATTCAGCCACGAGCACTTGGAATTCCCGCCCGTTCAGGTCCGGGTCGTCCGGGTTGTCGAGCACTTTGATCGTGGTTCCGTAGTGGAGCCATTTCGCGGCGATCGGTACCGAGATTCGGTGGGTGGCAATCACTTCGAGACCATCGGCGACCAGGACTGGTCGGGACGATTGGGTGAGGAACTGCACCGAGCACGGCGCGGACTCCCAGACAGGCATGACTTCCGGTTCCTGTGGTGCGGGCCAGTCAGGAATACCGCCGAGCGTGTACGCGGTGCATTCGCCGGTGAGGAACCCGTACGCAGTGGGACGGTGATGCTCAGACCACCCCGGAGGGATGACCTTGAGGCGGCCGCGGGCGATGCGAGTCATAACCGCACCTCCTCACCCTCTTGCGCGGATCCGCCGTCGAGGTTGAAGGCCACGAAGAACGGTTCATCGTCCTCAGCGTCTTCCTCTTCCTTGGCCCGGGCCCGGAGTGCTACCGCGTGTTTGCGGAGCGCATCCGCGACAGCCGGCCCGTCCGATGAGCGGTCCTGAGTGGTGATCTTCTTCGACAGGAGTGCTTCGTCCGTGGCGATCGCATCCAGGGCGTCCGCAGCCGCCCGCAACACTGAGTCACGGGCAATGGTGAGGAAACCTTTGATCTGATCGTTGTCGAAGTCGCGGTCATCCGCATTGGTGTCGCCGATGATGAGCCGCACCTGACTCACTGGATCGTTGTAATCAACCATTACAACTCCCTTGGTTGTGCCGTTCCCCGCCCCGCTGCATGGTGGTCACAGTGGGGCGGGGCCCGTGCTCAGATGGGGTCAGGAACCGGTCGATGCGTAGGCGACGAGCGCCTTACCATCAGCGTCCACGAGGCCCTTCGCGGCACCGTCGACGGAACGACCGCGGAAGTCGATCGTGTCATCGAGGTAGGAGCCTTCCTCGAACGGGATCTCACCACCGCCGACCCGGTTGCCCTGGTCGCGCTTGGTGCGAATGTCCACGTTCTCGTGACCGATGAGCTGCGAGTGGATGACTGCCGGGTGGTCGCTCGACTTGTTCGCCAGCAGGTACCAGGTCTTGGCACGCTTGTCGGACTTGTCGATCTTGACCAGCCACTTCGCGACCTGCACAGTGACGAGACCCCGGAACGGGTTCTGCATCTCCGTGGTCGTTTCCTTGTTGCCCTCGGTGACCTTCAGCTTGAGAACGTCCGCGTTGATGATGCGGTTCGCTTCGATCTGCATGGACGGGGGCACGAGAAGCACGAGACCGGTGGTGTCGACGAGGTCGCCACGATGGTCTTCCTTCAGCGCGAACGACTCGACGGCGGCCTGCAGGTTCTCCGGCGACAGGGGCTTGTTGTCCACTGCGGCGAAGAAGTCGGCACGCGGTCCGGTCTCGGACACGAAGGTTTCGAACACGTTCCGGTTGGAACGCTCCGTGGCACCGTTGCCGAGGCGGCGAGGGAAGTCAGCCATGTCGGTGAAGTCACCGGACAGTGCGAGCTCCCACGTGTACCCGAAGCGGCGACCGAACTTGCCGACCTTGTACTCGACTTCGAGTTCGTCGAGGGTGTCGGCCTTGTACTCTTCGCCTTCGCCGACGGGATCGAATTCGGTGTTGCCGAACAGGTCGCGCAGCTTCTTCGGTCGGAAGTCGGGAACCTTCTTCTCGATCGCGAACGCTTCGTACTCCTTGACGGCATCCTTCTGAGCCTGGATGGCTTCGACTTCGAACCCCTTCGACAGGAGGATCGGGAAGTCGCTGGTGGACATGGCCTCACGGAACTGCGCGAGGGCCAGGTTGGTGCCCTTCTGGGCTTCGTTGAACAGCTTGGCGGCGCTGAGCACCTTCTGCTTCATGTTGGGGGCTGTGCGGAATTCCGCCTCAGCCAGCAGATCAAGTTCCATTGTTATCTCCTAAGAGTGTGGTGGGGTCAGGCGCCAGCAGGGGCGGGCGGGACGGCACCGAACGGTGCGACTTCGGCGGGGCCGGTGCCGGTGGCCTTGGCCTGGTTCGACACACCCCATGGGGTGTCGTTCGCGGTTGCGGTGAGACCGCTGGAGCCGAGGTAGACGACCTGCCCTTCGGTGAGGGCGCCGGTGACGTCGATCTCCCAGGAACCGTCGAGCCACACGGTGACTCGGTCGCCCTGTTCGGCGTCGATGAGGGCGACACCGCGGTAGGCGCCGATCGCGACAGCCTGACCGGACTTGTAGGCCTTGTCGGCGGTGAGGGCGATGTGCTTGTTCTCGGGGTACTTCTGGTTCTTCGCCATGTCAGGCTCCCTTCAGGGCTGTGAGTTCGTCGTACGACGGCAGCTCGGATTCGGTGGCAGTGGTGTCGGTGGTGATGGGTCGGGACTCTCCCAGACCACGAACCTGACCGGTACCGGCTGTCTTGGTTTTGTAGTCCTCGGCGAACGTCCGGGCCTCCTTGATCAGGTCGTCCCCGGCGGCGTCGGATTCGGCGAGTCGGTCGATGAGCAGCTCCTTGACCATGCCGGGGTCGACTCCGTCGAATGCTTCCTCGACGACGTTCTCGACTTCGGCTCGGCGGGCTTCCTTCGCCTTGGCGGCCTTGAGTCGGGCGATTTCAGCCTTGAGTTCGGCGTTCTCCTTGCGGAGTCGTTCGAGTTCACTGCCGCCGCCGTTGCCACCCCCGCTGTTGGGTGGGGTGGGCTTCTTCGAGTCGGCTTCGTCGACCCGGTTGGTGTTCTCGGTGTCAGCCATGCTGGCTGCCTCCTTCGGATTGGTTTCCTGATTTCCATTCCCAACCGGGGCTGGGGTGGTCTTGGGGACGTACACGGTGGTCGGCACGACTTCAGCGGGTTCGCCTTCGAGTGCGATCGTGATGCCGTTGCGCGTGTACTTCTGCTCGAATGTGGAGTTCCCGCCGCGATCGGTTTCGACGTTGAACCACACGGTGGAGTCGGTGTGGTCGGTGACGTGGCACCACCGGTCGCCATTGGCGGCGCGCACAGCCTGGTGGAGGAGCTGACGAGTCTCGTCATTGACGCCTTCGTGCAATCGCGCAGACTCGAGCACCTCGATGACTCGACCGCCGCGTCCGGCCTTGGTTACGAAGTCGACGGACCGTGCCTCGGTGAGCTTCGTGACAACACGCTCACCGCCGGCCTCGCTGACTTCCCCGGATGCGCGGATGCTCACGCCTATGTCTTCGGCCATGTCGGCGATGATGGGCTTCCAGTGGGAGAAGATCTTCGCCTCGGCGACGAGCCCGCCGGATTCGTTGTCCCAGTGGGCGTCTTCGGTGAGAATGCCGACGAGGTTGTGCAGGTCGCCTTCGGGGCGGGACCAGTCCTCGGCTTCGGTGGCGTGGTTGAAGAACATCTGCGTTCCTTTGGGGAACACCTTGTCTCGCCCGGCGGCCTCGATGGTTTCTTTCGGGTACACGCCGCTCGATCCTGCACCGGGTGTGATGATCCCGATCTTGGCGCGCCCACCCGCCTTCGGGGTGACGGTGGCCTTTTCTCGGAGTATTGCGATGGTCATGGGTTCGCCTTCCTGCGGAGGTCTGCGAGTGGGGTTGCCTGATACGAGGGACGCCAATCGGGGTTGTCGACACGGCGGGCCATGTCGGCAAGGCTGATCTGCCCGTCTTGGAGCATCCGGTACCGGTCGGGACCGAGGGCGGCAACAGCGTCGTGAGGGTTGTTCGCTATCCAGTCCTCTGCCGAGGGGATGATGTCGGGTGGTTCGTCGATGTCGAATCCGAGGTCGCGCCACGACTTTGTTTTGGGGATGAACGTGCATCGGCCGTTCTGGTGATCGAACGGGCCTTCGGTGTCGGGTGGATATTCGGTTCCGTGCTTCGCAATGCACGACGGGCACGTGCGAGACGAGAGTTCTGCATGCCACACGACCGCGGTCACAGTCGGGTTGGCGATGTTCTGTGCCCGGTTCGCAGCCCGATGAGCATCCAACTGTTCGGTGCGGGCGATTCTGAGCGCCCGTGTGAGTCCCCCGTTGAACCCTGAGCGTGCCCGTTTGAGCATCTCCCGTGCTGCCCTGTCCGGATGCCAGCCACTGGGGACTGCGCGAATCAGTGCACTCTTCATCGCGTCGACAGCTTCATCCGACAGCGGCAGTGATGAGGCGTGGATCTGACCGAGGGACCGTTCGATGATCTGTTCAAGTGCCGACTGGTCGACACGCGTGAACGACGGGAGCAGCTTCGAATCGACGGGCGGCATCTGGGACGCCGCAACAGCGGTGGTGAGATCCGCGGTGCGTTCGACGACTTCGCGCACCGAATCCTCGAGTAGTGGCCCGAGCTCGGCTGCGAGTTCACGCAGTTTCGCCGCGGTCATCGCCAACGCTCGTTGAGTGCGCGCCAGGTTCGTGATCTGCGCCGGCGTGAGCGGTTCGCCCGCAGCCTTCTTCGCGAGGATGACGGCGATCGTGTCCTGCCACTCCTGGGCAACTTCCTGCCAGGCTTGCGCCCAGCGGGCAGTGAGTGCCCGGGTGGAGGCGTCAACATAGTCGTCGACCATGGCTCGCAATTCGGCGGCCCGACGTAGCGTCAACTCTGTGATGGCCACGCCGAACCTCCTCTACTGGTTGAGCACTGCTGCGGGATCCTCGCCCCGGTTGTACGCATCCACGGCCGCATCCCCGGCACTGATATTTGGGTCGATGAAGTTGCCGGCGTCGTCGGTGAACTGGTCGAGGATCTCGTCGACATCCCTGACTCCGAGTGCACGGAGTAGGAGCCGCATCGTCTCCAACGGTGGGAGCTTCTGTGTTGCGTCGGCATCGACAATGGCTTTCACCATGACGTCAATGGGAGTGTCGTCGAGTTCCGGCCAGGTGAACGTGATCGTGCGGTCGTCTTCGTTGCCGAGGACAACTTCCTGCCGGCCCGTGTACGGGTCTGCTTTCACCCCGCCCTTGAGAGGTCCGCGGGGCGAGAGGACGGCTTGATCGATGACGTGGTTGAGGATGGCCCGGTAGGTTTCCGTCCACACGTCACGGCGTCCCTGCATCTGCAACCGTGTTGGCTGGTTCAGGGTTTCCGCGACAGCACGTGCGCCGGTCTGCCCGGGGTCGGACATGAGCGTCGTGACAGGGATCCCGAACGCAGCCGCGACCATCGTGGCCAGCGGCCGGCCCGACTCGGCATCGATAGTGGCGCCAGTCTTCGGGACGGCCTCGAGTTGTGTGCCCGCGTCCCCGCCGACGATGCCACCAGCCTCAGTGAGGCTGTTGAGCTGGGAGCGCATCTGCTGGGCGGTGTTCTTCTTGCCCGACATCTTCCAAGCGATCCGGGACAAGGCTTTGGTGAGCTTTGCCCAGTCGTCAAGGAAACCCGTGTACCCGCGAGCCCAGGGTAGGGCGGCGTAGGCGTCTCCGATGCCGTACTTGCCGACCTTGTTGACGGCTGTGTGGTGCACGCTGGCGGACCAGTCGACGGGGTTGCCGTCAATGACTGGGAACTTCCGAGTCGGGTTATACCCGAGCGCCGGATACCAGGTGGTCTTCTGACGAGTGGTGACTTTCGCGGTGCGTTCACCGATCGTTGTTTCAACGTAGTCGCGACGGTAGAACCACGGTTCGGATGCATCCTCGGGGTTAGTGAGGATGTCCGTGACTTCGAGCGGGTCGAGAGTGCGAACGCGAACGAACCCAGTATCGGGCGCGGTGAACAGGATCGCGTACATGTTACCGTCCGTCGACAACTGGCGCTCCAGCTGCTCGTGTGCCTGGTGGCCAGTGAACGCGGCCCGGTTACCCGGGTCGTCAAGGAAAGCTTGGATGACGGTGTTCACGTCCTGCGTGCCCGTCTCTTTGCCATCGGCGGCAACGACACCAATGCCGCCGTTGCCGTGGATATACCCGGTGCGGACGGCAATCCCTCGGGCGATCAAGGGGTTCGCGATGGCCATGACCCTGCAGGCCTGCGCGGCCTGCTTGCGACCCTCAGGTGAGAGGTCGCGTTCCGTCTGGTCACCGAGCTTCGACCACCCGGCGTCCTCGGCGAACATGCGGGATAGCTGCTCGTACGATTCCTCGAGACGGAACGACAGGGTATCGACTTCACCCTTGAGCCGGTTCACTTCCGGGTCCGTGGACTCTTGGAGGCCGAGCCAATTGCGAATACCCATGAGGCCTCCGTTCGGTTAGAAGTAGCTGATAGGGATAGCGTCGTCGGCGAACTCCTCAGCGACGAGCTGTTCCTGATCGATGAGCGGATGAAGGAGCAGGTTATTGATTGCCTGCGAGAGGGCATCCACACAATCGTCATGAACTCCGTCTGGAAAGGCTCTCGCCTCTTCGATCAGGTCATCCACGCCAGGGAATAGCTGTGACGACGGGAGAATGATGTTCTTCGAGAACGCGAGCGGGGACACCGCTGATGCTCTCGACATCTTCGAGCCTTGCGGTTCGACCGGGACGAGGCCGGGTATTGAGTGTTGCAATGCGTTGATGACTGCAGGGCCATTGGCCTTGTTTTCAACAAGCTTCATTGTGGCTTGGGGCCAGCGTGCCGACATGGACTTGATCGCGTCCACTGTCGTCTTGAAGTTCATGCGTTCGCGCACCTGGTCGACAAGGTAGGCGTTGATACCCACGCGCAGCCATGTCTGCCCGACCACGAAGTCTGAACCGTCAGTGTCGGAGAAAGTGAGATCCCAGGACTGGGCGACCTCGTAGCCATTCTCTGCGAGACCGGGGACGATTCGTACCCCATCGGGCCTCTCGACCCACATCGGCTGCTCGTAGCGCTCCCACTCCTTCGGGAACACGCCACCATCGTCGACCGTGGGGTTTCCTTGATAGAGACTCTGGAACACCTTCGGCCCGACAGCGGTCTTGATCTGCTCCCACTGCTCGACAGTGCGGCGGCGAGCCGATTGCAACCACTCCCCCGGCTTGCGGCCAAGGGGGTCGTTCTCTTCTGCCTGGGCGGGAATGTTGATGACTCGCCAGCGGTCGGCATCCTCGGCCTTCAGGAGCCGCCCTGCGAGGTCGTCTTCGTGCCATCGGGTGAGCACCAGAATTACGGGCGCGCCGGGTGCGAGACGCGTGTTCGCAACGGAGGTCCAGAAGTTCCATGCACGCTCTCGGTAGGTTTTGGAGAACGCTTGTTCGAGGTTTGAGATCGGGTCATCGATGAATAGCGCGTCCACTGCTTTACCGGTGAGTCCACCGGATAGTCCGACTGCACGGACGCCACCTTGACGGCCGGCGATGCGGAAGCGACGGATTGCGTTATTGTCCGCGGCGATGCGCAGCCCGAGGTCGAGTGAGCCTTCGTCGCCTTGGTTGTCCTCGATGTGGTTGCGAATGTTTCGTCCGAACTCTTCGGCGAGGTCTTGGGCGTAGGACACGACTGCGACACGACGGTCAGGGTTGCGGGTGAGGAACCAGAGTGGCCCGAACGTGGTCACGCGGGTGCTCTTCCCCTCCTGGGGTGGCAGGTTCAGGATGAGCCGGTCGTTGCGACCTGCTTCGACATCTACGAGTGCTTGGTCGATGACATCGAGCGCAGGCGTCTGCACGGTATTCGGTTCAATCGCTCGGGCTAGTTCGCCAGGGGTTTCCCAGCGGTGCAAGTCCTGGCCCTGCGACTCGAGCTTATCGGCCAGTGCTGTCATCCAGTCGACCGACATGCTCACCTCCATGAAGTTGACCCCGCTTTAAACGAGTACGGGAAACTCGAACCACGGCGGCAAGGTAGCGAGCCCGGTGCTGTGGTTGGTAAGGGTGAGACCCCCGGAGCCAGCAGCAGCGCGGCTTCTACGGGGGTCTCAGGTGGTGGGTAGAAAAGTGACAAACACCCCACCACGATTAGGGGGCGGATGCGGTCAGTCTGGGGTTTCCGTCCGATACTTGACTGGCATCCGGTTTCTTCGTTCTCGTTCGTCCTGCCGGACCTTGCGTGCACATTTGGGGCAGGCACCGTCTTCGTCAGGGTTGAACGCGAGTGGATATTGGACTTCGACGAGAGTCCCGCAGGCAGCCTTCACTCGATCGTCGACGTTGATCTTCGTGAGGGGTTTGTATATTCCACCGACCGTGCCCATGCGTACGGCGTGGAATTCGTCTACCACTTCGGGCAGGAATTCTTCTTCCGGGTAGCCCACTCGTTTCGGCATGGGTTTCGCGAAGCGGGCCGCTGCATAGAATCGGGGAGTTGGTTTCCAGTCCAAGTCAGTTCCTCACTGAACTACACGGATGTAATTTACCCGCTGACGCTAGTCTAACATTTGATTGTCCCATCGGGAAGAAGCGACTCACCGTATCACCGGGTTCTTTTTCCGCGCCTGTAGTTCCGCTTTCGCAATGTCTTCGACGCACATCATGAGTGGTAGCCGGCGGGAAAGTCGGGCTTCCCGAATTACGTACTGTTCGCGCCACCTGTTGACAGTTGAGCGGGACACTTCGAATAGTTCGGTTGCTTCCTGCATTGTCACCCATTTTTTCGTCATGCTGCTCGCCCCTTCGCTGCCTGCTCAGCTCGGTGTTGCAATCGCTGTTCGGCGATTTCGGTTTCCGTGTGCCACCTGCCGCAGGACTCGCAGACGAACGTTTCGTCGAATCCGAACGATCGGGGTGGGTGCATGTACACGAGCATCACTTCGCAGGCGATGCATCTGACTGGAAGGTGCTGGCGGGGCGCTTCTGCAGGGTATTTCTTGGTTGCCTTCAACCATCGTTTGATGATGTCGCGTTGGATACTTTCAGTGAGGTCATTATCGCGAACCATGGGGAGTTGGTGTTCGATGAAGCGTGCGAGGGTACGGGATTGTGCGAATGCGGTTTCTGGTGGGGTGTTGGCTGGTAGCCCGTTTTCTCGGTTTTTGGCCCAGGCGATTGTGGTGGGTGGTTTCGTTCCGAGCTCGTCTGCGATGCAGGCGGCGTGCTCGCGGAGTGTGGCGTAGAGGTCGTCTGCGTCGTCGACTGGTCCGAGTTGGATGGGCGCGGTGAGTTTCGCTCCGGGTGCTGCCCGTCCGGTGGTGCCTGATCGTCCTGGTTCGAGGTAGCCGCGGAGTTCCCCGATAAGGCCGGGGATTTCTGTGAGCGCGGCGTGAAAGCCTGGGTGTGGGTAGCTCATTCGTGTGCCTCTCTGCCGTCGAGGGATCGGTGGGTGATGATCCAGAAGTCTCCGTGCTCGTCGGGCGCGAATTCGACGTGTGGACCGCAGACGCAGTTCTCGCCGTCTGTGTGTTCGATGTGATCGTCTGCGGGGATGATCTCGGCGTAGTCAGGCAT
Protein sequences of DBSCAN-SWA_1 >NZ_CP050153|2710856:2733531|2730434_2731913_-|WP_167200985.1|terminase|DBSCAN-SWA MTALADKLESQGQDLHRWETPGELARAIEPNTVQTPALDVIDQALVDVEAGRNDRLILNLPPQEGKSTRVTTFGPLWFLTRNPDRRVAVVSYAQDLAEEFGRNIRNHIEDNQGDEGSLDLGLRIAADNNAIRRFRIAGRQGGVRAVGLSGGLTGKAVDALFIDDPISNLEQAFSKTYRERAWNFWTSVANTRLAPGAPVILVLTRWHEDDLAGRLLKAEDADRWRVINIPAQAEENDPLGRKPGEWLQSARRRTVEQWEQIKTAVGPKVFQSLYQGNPTVDDGGVFPKEWERYEQPMWVERPDGVRIVPGLAENGYEVAQSWDLTFSDTDGSDFVVGQTWLRVGINAYLVDQVRERMNFKTTVDAIKSMSARWPQATMKLVENKANGPAVINALQHSIPGLVPVEPQGSKMSRASAVSPLAFSKNIILPSSQLFPGVDDLIEEARAFPDGVHDDCVDALSQAINNLLLHPLIDQEQLVAEEFADDAIPISYF >NZ_CP050153|2710856:2733531|2733369_2733531_-|WP_167197963.1|DBSCAN-SWA MPDYAEIIPADDHIEHTDGENCVCGPHVEFAPDEHGDFWIITHRSLDGREAHE >NZ_CP050153|2710856:2733531|2724229_2724670_-|WP_167197948.1|DBSCAN-SWA MTRIARGRLKVIPPGWSEHHRPTAYGFLTGECTAYTLGGIPDWPAPQEPEVMPVWESAPCSVQFLTQSSRPVLVADGLEVIATHRISVPIAAKWLHYGTTIKVLDNPDDPDLNGREFQVLVAESGTTNWTRDYLCQDVNGQEVRAA >NZ_CP050153|2710856:2733531|2732743_2733373_-|WP_167197961.1|DBSCAN-SWA MSYPHPGFHAALTEIPGLIGELRGYLEPGRSGTTGRAAPGAKLTAPIQLGPVDDADDLYATLREHAACIADELGTKPPTTIAWAKNRENGLPANTPPETAFAQSRTLARFIEHQLPMVRDNDLTESIQRDIIKRWLKATKKYPAEAPRQHLPVRCIACEVMLVYMHPPRSFGFDETFVCESCGRWHTETEIAEQRLQHRAEQAAKGRAA >NZ_CP050153|2710856:2733531|2722814_2722997_+|WP_167197939.1|DBSCAN-SWA MKIEWNDDGMDKLSKEVEKQFKTAVEGFNRSGSQSASDLQKHMKKNGFDITLAKARSLLK >NZ_CP050153|2710856:2733531|2725110_2726202_-|WP_167197952.1|DBSCAN-SWA MELDLLAEAEFRTAPNMKQKVLSAAKLFNEAQKGTNLALAQFREAMSTSDFPILLSKGFEVEAIQAQKDAVKEYEAFAIEKKVPDFRPKKLRDLFGNTEFDPVGEGEEYKADTLDELEVEYKVGKFGRRFGYTWELALSGDFTDMADFPRRLGNGATERSNRNVFETFVSETGPRADFFAAVDNKPLSPENLQAAVESFALKEDHRGDLVDTTGLVLLVPPSMQIEANRIINADVLKLKVTEGNKETTTEMQNPFRGLVTVQVAKWLVKIDKSDKRAKTWYLLANKSSDHPAVIHSQLIGHENVDIRTKRDQGNRVGGGEIPFEEGSYLDDTIDFRGRSVDGAAKGLVDADGKALVAYASTGS >NZ_CP050153|2710856:2733531|2723454_2723757_-|WP_167200982.1|DBSCAN-SWA MNAQRGAREVVAKTAADITADAKKFAPVDTGNLRSSIGHDLFDTSGETGAEIGPTTSYAGFVEWGTSRMAPQPYLGPAFDRRIPGFEKALGNMLDGTWKG >NZ_CP050153|2710856:2733531|2712892_2714080_-|WP_167197916.1|DBSCAN-SWA MASATANYTLATVTREELGERQSDAGKTFTVAADIHLDAKQTGALWNNARQINITTFGANGSLNAHFVDSVSAPNEPGTTRLKVTFTLPADCRGWAIRLTNGSTTQPVWWDCLSIEEGTTSGAYFDGDSSPASGRKLRWTGQPQRSSSEEFTPEVPAIPAQEATTAKVRFERQDGSLVRSGSPAWAPGGYAHAYDHEATLGGGTLWRAVPIRFDGTEGEPSEYVGAVIPGLSDGWAWLKPVTDPDKAMLLKVEPPDLSYDGRVDTTSIAGAPRPAATWDILEGFTTQLRSLVREKDVEEQLMALLDEGPVLAQFPACAGLPDFYFIRGKLDLEHVHWDGWPWRRYQWDAIQIDRPDTTDSPLRIPGLSYDTVARDWTTYDQLAATVESYNKLLEP >NZ_CP050153|2710856:2733531|2727803_2728856_-|WP_167197956.1|head|DBSCAN-SWA MAITELTLRRAAELRAMVDDYVDASTRALTARWAQAWQEVAQEWQDTIAVILAKKAAGEPLTPAQITNLARTQRALAMTAAKLRELAAELGPLLEDSVREVVERTADLTTAVAASQMPPVDSKLLPSFTRVDQSALEQIIERSLGQIHASSLPLSDEAVDAMKSALIRAVPSGWHPDRAAREMLKRARSGFNGGLTRALRIARTEQLDAHRAANRAQNIANPTVTAVVWHAELSSRTCPSCIAKHGTEYPPDTEGPFDHQNGRCTFIPKTKSWRDLGFDIDEPPDIIPSAEDWIANNPHDAVAALGPDRYRMLQDGQISLADMARRVDNPDWRPSYQATPLADLRRKANP >NZ_CP050153|2710856:2733531|2724666_2725032_-|WP_167197950.1|DBSCAN-SWA MVDYNDPVSQVRLIIGDTNADDRDFDNDQIKGFLTIARDSVLRAAADALDAIATDEALLSKKITTQDRSSDGPAVADALRKHAVALRARAKEEEDAEDDEPFFVAFNLDGGSAQEGEEVRL >NZ_CP050153|2710856:2733531|2711730_2712882_-|WP_167197913.1|DBSCAN-SWA MFPVSDRWLEALSLARYEPVVQWSPDRGTTWSDLTLHDGSITAASTSQVRWTARGLIISGADIGRRALSPYGARIRVFMRMHYDRHTIETVPLGVYRVEEVSQAGLRPGRAQVDGLSLEAQVQDERFHQPRTLAIGTGQYWATTLIKEVLPEVGMSWRLGDTNIPQLVEERDRWGLIDGRSRDPSIAKSLGGRVFCDSRGSFVAAPVPTLEDPTVWELAAGPGGALVEPQQTLSRDGVYNQIVASGASENGQPPIGPSVASDDDPVSPTYYKGPFGAVPLFYTSKLITSLQQCQTTALGLLAPRLGLKQKVSVSSLLNYALEPDDVISVTMPDETVENHIIDSITFPLTGGTMSMQTRSTTSPAGGRITIEGDNDDYGEGNFE >NZ_CP050153|2710856:2733531|2723047_2723458_-|WP_167197942.1|DBSCAN-SWA MSALDIITITTWTLDRLKTVGVGVYDGMVPESVPTSAGGIIKPYLAVWTQPLREHYEQPLDYSAQESAGALTVTVAGATVGTVRNLAQQVIRTLNRIPAPGGGEYRHAEPHIPIQHDGQVSPVRYYLPLSFSFQQP >NZ_CP050153|2710856:2733531|2723819_2724230_-|WP_167197946.1|DBSCAN-SWA MITTPKWIVNAETGCWEWILYLDRSGYGRLTFAGKRGYYAHRWSYEQHVGEIPDDMTVDHLCFNPACVNPGHLRLLSLPDNSANQRSAIKTHCKNGHEYTPENTYIRPARKHGGQRDCRTCIRERVARYASRKAAA >NZ_CP050153|2710856:2733531|2710856_2711738_-|WP_167197910.1|DBSCAN-SWA MSDAESDLAALVRAEVERQMNPFQIMQVSSVREDGKVNLQWGETIINDVAANQAYNPRAEGDVVLVLQHSAGWRVMDKIGGPVEVDIPEPVDLTFGDGLPAGDFVQVNSVFMKDGAIYGLIGEGPAPGPGEPPRKSKPKPVTLDPASTAAYRDGRRDGSRPAQGAWPSYPHPYSSVFLYGSRIAAACEGKTVDSMQIRVARTSKYHGVSGKVRPRLGLHDETSPPSKTPKLTNRWDGPGLGMGDSKWITIPASQAARLASGASRGVGIGAGTGKSNYMIATEGSGNLRIKFKT >NZ_CP050153|2710856:2733531|2714391_2720568_-|WP_167197919.1|tail|DBSCAN-SWA MAEVRNVSALLSVKVNQFKADMAAAGKAAVDAANKTETAWDKSQTGAGKAIQKIGQYRSEISEIGGAMATFGGIVVGAMGLATKAAMSWQSAWTGVLKTVDGSPKQLAEVEAGLRGLARELPATHEEIAAVAEAAGQLGIETGNVVSFTKTMIDMGESTNLSAEEAATSLARFSNIMGTNQKDIGRLGAAIVGLGNNFATTESEIMEMAMRIAGAGKQANMSEGDVLGLAAALSSVGIDAEAGGTAISMVMKKMGNSVSEGGDAMAGFAEVAGMSSKEFSAAWKDDAAGALSTFIAGLGDAQAAGQNVNGALKDLGITGIRESDALLRLSNASDVLKDALASGNEEYANGIALMIEANKRYDTAESKIRMAGNSIKDSAITIGGTFLPVVAEGSKAVAEFAEWIGKLPKPVLEVGAGLGAIGGAASLAVGGFLLLAPRVFDIIEGFNTLRTQFPDLTTKMGKFGKSAGIAAGVIAAVATAAVAASEALKATERTTSSSNFMEAIVMSGGNAEEILKGVNREMEDVSKVGEKTGKTFTAVDDLTGAFNRLYNSTFSDKINTALGWIPGVESGADKAEAAFRELDTAVTDLASGGSLEAAQAGFKAVMYDGRAAGVSLQETFSEFPEYRKQLETIAESSGFLTNSMSEQEKEAVLLKIAYGEITPAMIEAGAAANGNEEDLKALGVSAQTTEEKIASLADEIRGFGSETLDARAATRAVEAGWDDLNEAIKSGKTSLDDTTESGRKNNEMLDAQIRNINDSTASNIEEGQSVEEVTKKHKKQRDEMVRVATAFFGSEKKAKAYVDQLLKTPDEIETDVKVNTGDATAKLKKTKQWMDSIKDKTVWVRVNEVHIRKEGDETVYSSDPKNKSVQKFKYFGGIDLMPMAAGGVHNNIAEMVKPNTWRIVGDRMDVDEAFIPLDGSRRSWKIMMEAISRMPGAMPMAKGGIASAEKSVDAAQDRLRAARREKQDAKSKKAKAQADRRIRIAEDELESAKKSLKAAKEKQKADEKAAKIAADRAKEERERKARVNDLRTDLRVDLRRGNIRDQVTGGLSGGYSAVDRLYDLGSNQDLSRGSRSRANSSARKFEANLRSLYAQAERIDNRLKKAQDKATELKGIKDSVASGLLNDRDLDVSTSVVSTGDGNFKQVSSLSSMTGKTARLKELVGKLGKLAKMGIPGALLQEVASQGIEGGNSAADALLNAPLGERNQLVSSWAEFDKYANQAGQFVTEGFYKGGVNAADGVVKGLQGKQKNVEAAIANLAKAMESTFKQVLGIHSPSRVTGEIGEFTAEGLVQGMLGGVSDVQSAAAMLGAAAVPSVGAMSMDVGVTPVVADDEGAAGLAMQDMSATTLDAMEQMNLSVSEGFAGMLANIQTAQAGMLLSTQENQLGMLTNTQTQNAAMLLDTQTQQEAMRATVAAKQAGQLSVSTEQQELMRLMLIDKQSQMKTRSETDFEALKNTTGSKFSTMRTNTDTTMSGLHGDYSTRLANLKSLNKTGFESLLSTSNKNMEAIREGIDGEMAAAKPELGGRLNALIGVLSSFTASVNKAFGDVGVKLSAPAALKFASGGVMPGYTPGRDVHQFYSPTAGNLHLSGGEAIMRPEWTRAMGGEAGVKAQNDAARQGRLDDLLHAQSQAFATGGVFGRVPGVNAFADAGVWRNLWAITRDQFPNARLTSAYRGGSRTVSGNASYHSRGMAIDVSPSMDIFNFWRNKYGANLAELIYSPANGKQIKNGSNHYYTGAVRGMHFNHVHIAARQALSDAMAGGLPGMGGEMSHPFLDRAGVTAGSDLQASYAKAAEKLTQQIYAKHAKQLPDGIAGQLGKGIMSQVSEGLVGKAKEYGKTTAISGTTAGDPAVKAAVRKIAEQMGWGKYWGDIDWLVNKESSWNPNAANPSSSARGLFQKMTSLHGPVEDTVEGQARWGLNYIKSTYGNPAAARAHHQRSNWYEGGTERAKRGLAVVGEEGPELVNFQGGEQVMSTADSMKFMAANRTYIPNTAPAFDAATLSSSIASAVKGMQVTPEQIATAFDGVPLQLVMRDGTQMDAYMKKQIGQVAVGAGRAH >NZ_CP050153|2710856:2733531|2720615_2720882_+|WP_167197922.1|DBSCAN-SWA MTTAIAGKSSDATYRKLKKASAILLLIGVVAMAVGVVAYVVQVGAVQSGMGGEGAMLFATYGMLFGLFGGFLVLFASLVVRIVAAIKR >NZ_CP050153|2710856:2733531|2721193_2721856_-|WP_167197930.1|DBSCAN-SWA MEEEMTTQFDPDAFIAGIRTAQTKVTIFQRADLAGELQAVEADLALFTDDGEEQDLTAGAEKAELLEKQAELQAELENSAVEFIVRAIDPEEVEELAKAARKACAEKADQAAKEAAGYAREECRRAEIGDKNDIREAVRRAASNASSAIIQDETGHYVLSAAIVNEDGAQVFTPDQMRKVSNKIGEPQVKNLRDAFYDLTSTDPGSFVPKSLKPGPRAED >NZ_CP050153|2710856:2733531|2721985_2722513_-|WP_167197933.1|DBSCAN-SWA MPKSLADGHTKVALLSTKPADPKAPTLAELNAGIDASCRISSSDYNVAFAASETVDEKELCVEGAAVAFGPSNATIEFTPFRYFTDMGQPEPTEEAEVGDAVFQATKVKGTRLWIYQRETSKKSTDPWAAEDECEGFEFLTDNPQQVDRTGYIKKRIVGAYQDAWPNCEVAATTP >NZ_CP050153|2710856:2733531|2722505_2722721_-|WP_167197936.1|DBSCAN-SWA MAHIYDEDQFVTAYDTRTGKKVGRVPRAHLTIFPHLAVTPQAKADRPKPVEKPKLPDFISSENTERKAEDA >NZ_CP050153|2710856:2733531|2720882_2721104_-|WP_167197925.1|DBSCAN-SWA MRDRAFALALTLYEADLCNGCGQPMSLSSGEHPHDYDIRDTRCMACAELEEFQQSEKSKNLEPGTKTYVVVDE >NZ_CP050153|2710856:2733531|2726571_2727807_-|WP_167197954.1|DBSCAN-SWA MTIAILREKATVTPKAGGRAKIGIITPGAGSSGVYPKETIEAAGRDKVFPKGTQMFFNHATEAEDWSRPEGDLHNLVGILTEDAHWDNESGGLVAEAKIFSHWKPIIADMAEDIGVSIRASGEVSEAGGERVVTKLTEARSVDFVTKAGRGGRVIEVLESARLHEGVNDETRQLLHQAVRAANGDRWCHVTDHTDSTVWFNVETDRGGNSTFEQKYTRNGITIALEGEPAEVVPTTVYVPKTTPAPVGNGNQETNPKEAASMADTENTNRVDEADSKKPTPPNSGGGNGGGSELERLRKENAELKAEIARLKAAKAKEARRAEVENVVEEAFDGVDPGMVKELLIDRLAESDAAGDDLIKEARTFAEDYKTKTAGTGQVRGLGESRPITTDTTATESELPSYDELTALKGA >NZ_CP050153|2710856:2733531|2728868_2730419_-|WP_167197959.1|DBSCAN-SWA MGIRNWLGLQESTDPEVNRLKGEVDTLSFRLEESYEQLSRMFAEDAGWSKLGDQTERDLSPEGRKQAAQACRVMAIANPLIARGIAVRTGYIHGNGGIGVVAADGKETGTQDVNTVIQAFLDDPGNRAAFTGHQAHEQLERQLSTDGNMYAILFTAPDTGFVRVRTLDPLEVTDILTNPEDASEPWFYRRDYVETTIGERTAKVTTRQKTTWYPALGYNPTRKFPVIDGNPVDWSASVHHTAVNKVGKYGIGDAYAALPWARGYTGFLDDWAKLTKALSRIAWKMSGKKNTAQQMRSQLNSLTEAGGIVGGDAGTQLEAVPKTGATIDAESGRPLATMVAAAFGIPVTTLMSDPGQTGARAVAETLNQPTRLQMQGRRDVWTETYRAILNHVIDQAVLSPRGPLKGGVKADPYTGRQEVVLGNEDDRTITFTWPELDDTPIDVMVKAIVDADATQKLPPLETMRLLLRALGVRDVDEILDQFTDDAGNFIDPNISAGDAAVDAYNRGEDPAAVLNQ >NZ_CP050153|2710856:2733531|2726228_2726570_-|WP_143924201.1|DBSCAN-SWA MAKNQKYPENKHIALTADKAYKSGQAVAIGAYRGVALIDAEQGDRVTVWLDGSWEIDVTGALTEGQVVYLGSSGLTATANDTPWGVSNQAKATGTGPAEVAPFGAVPPAPAGA |
23 | Brevibacterium_phage(77.78%) | terminase,tail,head | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_2 |
2737669 : 2755216
Sequences of DBSCAN-SWA_2
Nucleotide sequences of DBSCAN-SWA_2 >NZ_CP050153|2737669:2755216|DBSCAN-SWA ATCATGCTGCCTCGATCGCTGGTCGCCAGTGCCGGTCGGACACGGACGCGAAGTGGCCCTCCCACTGCAACGTGATGGCTTCTCGCATGCCGTGCCGGTTCTTCGCCACGAGCATGTTGAGTTCGTCCGGGCTCTCTTCGTCCCGGTGGAGCAACAGCACCACGTCAGCGTCCTGTTCGATCGCACCTGATTCGCGCAGGTCCGCAAGTGTGGGCTTCCCACCGCGGGCGCTTTCACGGTTGAGCTGGGACAGGACTAGGACGGGCACGTCGAACGTCTTCGCCATGACTTTGAGCGCTCTCGTCCAGCGTGCGATTTCCTCGTGGCGCTTCTTGTCACTCTTGCCCGTCGATTCCATCAGCTGCAGGTAGTCCACGACGATCATCGACAGCGGGTGCCGGCGTTGGATGGTCCGTGCCTGCCTTCGGATGCCTTCAACGGTCTGGGCAGGGTTGTCGTCCACGTAGAACCGATCACGGTTCGTGGCCTTGCGATTCGTCAGCCGCTCCCACTCGTTCGGAGTCATGTTCGAGTCGAGGATGTTGCCCAGCGGAATGTGGAGCAGCTGGGAGAACACGCGCTTGTGGATTTCTGACTGCGACATTTCGAGGGTGCACATGAGCACCGATCCGCGGTGGAGCATGTTCACGGCCGCCTGCAGGCCCACGAGCGTCTTACCGCTGCCCGGGCGTGCTCCGATGACGTAAAGCGCTCCTGGTCGCCATCCTCCGATGAGGTGGTTGAGATCCTGCCATGGGGTGACCGTGAACCTCGGTGGTGCGTTGAGTTCCGCGATGGTTTCAGCCTCAACTTCTGCCATGGATCGGATTTCGGTGGTGCTGACGTTCCCAGCATCGTCGATTGCTTTGCGGGCGACTTCGATGACCTGGTTGATGTCATCGACTTCGTTAACGGCTTGACTGATCGCGGTAGTTGCGTTGATGAGGCGGCGCCGGATGGAGTGTTCGCGGGTGATGGATTGGTATTGGCCGATGAGCGCACTGGTGGGGGTGTTGTGCATGGCTTTGGCCAGGTAGGCGGGTTCGAGGAGTCGCTTCGTCTCAGCGTTCGATGTGGCGATAGCTGATCCGGTGGAGACGAGATCAACGGCCTGACCCTTGGCCCAGAGCGCCTGCATGAGCTCGAAGGCTCGGGCGTTGCGGGGGCTGGCGAAGTCGTCGGCAGTGAGGGTGATTTCTTCGAGGACTTTGCCGCCGGTCATGAGGATGGAGCCGAGGACGGAAAGTTCGGCTTCGTCGGTGCTGGTGGGTTCGTTCATCAGAATGGTCCTTCCTTTGACCAGAGGTCAGTCTGCGGTGGTCGTGGTTGTTGTTCGGCTGTACGGATGAGCCAGTTGGTGAAGGCTGCGTTCCAGTTGGCTTTGCGTTCGTCTTTCGACTCAGCCCAATTTCGGAACTTCTCAACCTCGGCAGGGAGGTTGATTCGGTGTTCTGAGGCTTTGGCCCGATGAGTGTCTGTGGGCACCCAGTCGGGCGAGAGTGGTCGTGCTGGTCGCTTGCGACCGGTCGCACTCTCTATAGGGGTCGGGACGGGTCGGGTCGGGGGGACCGTGACATCACCCGTGACTAACGGTTGAACTTCGCCGTGAGTCACGGTGTTAGTCACGCCGTGACGTTTGCGTGACGATGAAGCCTTGTCCCGTGCCCTTTGTTGCCTCTGACGTGCGCTTTCACGTTCTGCGAGCACCTGTTGACGTGATGGTTGATAGTCCAGCCAGTCGTGAAATTGGTAGCCGTTTGGGGTCTCGTCCCATAGGCCCGCTTCAACTAAGCTACGTGCGTCCTTCGTCGACGCTCCGAGCATGAAGCATGTCTGCTTCGTCACCGCTCCGTCCGTCAGTTGCGCCATGCACCATGAGAGTGCAGACACCCAGAGGGCTTTCCCGCCCTTGCTGACGGTCATCCATTTGGGTGACGAGTAGAGCTTGTCGTCGATCTTGCCCCACGGCATTTATGCCGCCTCCTCGAGTCCGACGAGTTCGTCGATCTGTGCCCCCTCGAGGGCTGCAATGGCGAGGTCGACGGTGCGAGCGTTGGGCAGGCGGTGTCCGAGGATGGACTGGGCGAGGGAGATTCGTTTGGTTGCTTCGATGGCGGTGAGGTTTTCCATTGTCATTCCCTCCCTGGTCGTGGTGGGAGCAGTCCGAGGGTTTCGAGGACGTCTCGTGCTTCGTCTGCGGGTAGCCATGATTGGACGACTGCGGTGGCTTGTCGGCGTTCGTGTGTGGTGTTGGTGAGTTTCTCGAGCGAGTAGGTGCTGAGGCTGGCGCCGGCGGGGGTTCGCGTACCCGATGCTGTTCGTCATGCCGCTGCTCCTGTCTTCACGGCTTCGGCATAGGCGTTGAGCTTGTCCGGTTGGTATCCACCCCAGTCTTCGACGGTTCCGTCTGGGTAGGTGACCGCGACGACAGGGGCCTGCATGTAGCCGAGGGCTTTGACTGCCCGGTATGCGTCGAGGTCTTCTGTGAGGTCGATGACGTCGTGCTTGATGCCGAGCTGGTCGAGTTTTCTGGCGGTGGCTGCACTGGACGCAGTTGGGTTTGGAGTACAGGGTGAGCAAGGTGTGGTCCTTTCAGAAGGGGACGTGTTGCGGAATGAACTTCAGGTGGATGCGGTACCGGTCTTTGTCCGGTGCTTTGGGTCCGCGGCGGTAGTCGGGGCCGAGGACGTGTCGGCTGTCGTCGTCGGGCCATGCGCCGGCGTCGGTGAGACCGTCCAAACATGCTTTGACCGCAAGTGATGCGTTACCTGGGTCGGCTCGGCCGCCTGTGCGGAATCCGATCTCGGCGATGACGATCACGGGTGTCGGGAGGACGAGTTTCTGCTGCTTGGCTGTGTAGCGTGCGAGTTGCCGAATGTGCTTTGTGCGTTTCGCTTTGTCTGCCCAGTGCAGTCGGTCGTTGTCCGAGATCCAGTACGCCTTGTGCACGTCGATGGCTAGCTCGGTCATGCGGCCTTCCTTGTCCGATAGGCGGCTGGCCGCCAGCATCGTTCAATGAACTCCAGCCCCTCCGTGGTCTTCGCGAACTCAGCGACCTCGGGCCACGTGATCGGATCCTCAGCACGCTCCTGCACCGGTTCCGGTTCGGGGCCCATCCCAGCCTCGAACAGGATCGCTTCCGCGAACGTGCGACTGATCTTCACTGACTCGCTGCTCAACGCTCGCCATATGGTGCGCGGGGTGACTCTGAGACGGCGTGCCAGTTCCTCCTGCGAATGCTTCGCGCTCAGTTCTTCGAGCGCTGCCCTGTACGGGGCGGCGGGAACGTAACGGGAGTTCGGTTTCATGCTTCACTTCCTTGGTGTGACCGTCTTGAGGACGGCTTTGATCTGTGTGTCTGTGCATCCGTGCTGCTTCGCACGGAGGATCGCGCCGAGTAGGTCGGTGGTGTATATCGGCGGGTTCTCTGTAAAGCCCATGTGCTGCTGGTTTCCGAGCTGGAAGAGGCTGATTGCCTGGTTCTCGCACTTTCCGGCGATTGACCCGCCATAGGGCCAGTGCCCTGACCCCTTGAGGTGGTCGGCGAGAGTCGGTAGCCCGGACTTCGACAGCGTCCACCTGTGATGGCAATGGGGGCAGGTGGTTGATGCGGCGGGCTCACGCTCGTCATCGAGTTCATCGAAGATAGACAGCTGCCCGGCGATCATGATGCGATCTGGCGTGCCTGGTCGTTGCGCTTCAGGCGGTCGACATGTTCGAGTTCCCCTGCTCGCTGCAGTGCCCTCCAGGCGGATGCTCGCGATGACCATCCAGCGCGAACGTAGGTGTCCTCGTTGTCGCCGTACCCATCTCGCGCCAGCAGCTCGAGACGGTCAAGCTTCTCCAGCGTGCTCGCGGCACGGGAGTCGGTGACTTCGTTGCCGATGTCCGCCTTCGGCACCTCGGTGGGATCATCAATGTCATCCCATGCCGCGGCAGTCACCCAGCCTTTCGCGGCGGCTTCACGTTTCGCTCGAGTGATCCCAGCCTGCTCGAACCTCGTCGCGGCGACCGGTGGGGTACTCCAGTTCTCGTTGAAGAATTCTCCGATCTGCTTCGCCGTTGACACGCGAATGCCGCCACGTTCACCGTGGATGACCTTCCCGAAGTTCCCTGGAAACATGTCGAATTCCCGGGCGAGGCGATGCTGTGACCATCCGATGGCGGCGAGCGCCTGTAGACGTCGGACACTGCCGAGGTTGTCGACGATGGACCCGTCTGCCATGTCTGGGGTGACTGCCAGCAGCTTCGCCTCCAGGGCTTTGCTGATCTGCTTGCGGATCGGTCGGGGGTCTGTCCCGCCACGTCCGTAGAGCAGCGGGTACAAGGTTGACGAAGTGATCCCAGCGGCTTTCGCTACCCGCTTCCATCCCATGCCGGCCTCCATAAGGCTTCGGACGTGCTGTCGTGCTGGTTCTGCATCGGTGAGGTGGTAGCGGCCGTACAGCTGGTCTTTGCGTCGCTGGCGTGCGTGCTGCATGGTCACTTTGCGGCAGTGGTCACATCGACAGCGGTGCACGACGTACATGGTGTAGGTGCCGTGCTCGTGGCGTGCCTGTTTGCAGTCGCACACCGGGTTCATTGCCCCTCCATGAGGTCCAGGAACTCCTGCGGGAGTTCGGGTTGATAGCCGTCGAATGCTTCGATCGTTTCGGCGTACTCGAGCAGCTGGGAGTAGTCGACGATCTCGGTTGGGTGGAGCTGGATTGTCAGCGACGATCGATCTGCGTCGTGAGGTTGCCGGGCGACTCTCGCTCTTGTAATGCGACGGTCGAATGCTTCGCTGGCCGTGACGTTCGCCTGCTGGGTGATTCCGACTACCTCACGCCCGCCGGGATGTCGTGTCGTGATCTTGATGAAGCTCAAGCCGCGTCCTTTCGTCGTTCGCGTTCGTATTCGCGACGGTGTTTGAGGCGGTGGATTTCGCGGTCGTCGTCGCAGTGGGGTGATTTCGTGATGACGGGTGAACCGCAGTCGGGGCAGATCTTCCCGGGATCGAGTTGGGCACGTTGTGTGGCGGTGAGGCCGCCGCGGATCCCGAACCGTTCGGTTGCGGGTTTGCCGCGCTCTTCCTCGAGTGACCTGCTGGCACAGATTGCGCGGAGTGGGCACGTCCAACAAGTGGCGACGGCCTGGGCCGCCCGGTCGGGGGTGGAGGGGAAGAACAATTCGTCTGCCACCTCCGCCCCCAACTGGGCACAGACCGCTCCCTCCCAGTCGTCAGAATGGCGGAAGGTCTGCATCGGCTGCTCCCGGGTTACTCCCCCACGCTGGTGATTGTTGCGCCTGCTGCTGGTTGCCCCAACCGCCTCCCTGCTGGGACTGCTGTCCACCGAAGTTCCCGCCGGACTGCTGACCGCTGGAACCTCCGCCGCCTTGAGTCTTCGTGACAACGGCAGTCGCACGGCGAAGGGATGGTCCCACCTCGTCGACGTCGAGTTCCATGACGGTGCGCTTCTCGCCTTCCTTCGTCTCGAACGAACGGGACTTGAGGCGGCCCTGCACGATGACGCGAGTTCCGCGCTGTAGTGATTCCGCACTGTTCTCGCCGAGCTCTTTCCATGCCGAGCAGCGGAGGAATAGGGTCTCGCCGTCGACGAACTCGTTGCGTTGCCGGTCGAAGATCCGCGGCGTGGATGCGACAGTGAAGTTCGAGACCGCGGCTCCGTTCGGAGTGAAGCGCAGTTCCGGATCGCTCGTGAGGTTGCCGACCACTGTGATGACTGTTTCGCCTGCCATGCTCAGGCCCCCTGTCCCTGGATCACGCCGTCAGCGTCAACGAGTTCACCTTCGACGTATTCGTCATCGTTCGGGAACGGTTCATCCTGCACCTGCTGGGGTGCGGGAGATTGCGGGGGCGCTACCGGAGGCTGCGGGGGGACGGGTTCTTCTGCCGCCACGTCACGTTCTGCCCGCAACTGTTCTCGCACGTATTCGGAAGAGGTGGGCACCCACTTGGCAAGGTCGTGCGCGGCAGTCTTGGCCCACATGGCTGCAGGGTCGTTCATCCACGGTGAGTACTTCGAATCCGCGCCCTGTGAGGATGCTTTCGCCCGTTTGATTCGTTCCTGATTGACGATGACAACCTTGGACACGGCGCCGTCTTTCATCAGTGCGTAGGCGTATGCGAGTCTCAGTTCTCCGCGGTCGCTGGCCATCCAGTCGATTTCATGGATCGGTTTCGCGTCTCTGCCCGGCACGTAGTCGAACTTGTCGTTCGCATGCACAGTCTCAACGATCACGGAGGACACGGCACCGGCCCGGTACATGAGCTCGATCTCACCCTGGTAGCCGGTGATGCCAAGGATCTCGTTGCGGCCCTTGTTCTTCCTCGGGGTGAGGTAGTACTCCTCGGTTCCCGGCTGGAGTCCCTTCTGCGCCGCCTCGGACAGTGCGTTGATGAGTGTGTGCGGCGACTCAGCGGCGGCCTGCAACAGTTGCGGGTTGCGGCGGATTGAAGCGAGCGCTGACGACATCCAGCCCGCGCCCTTCTCCTGCAGGTGGCTGGGGAGGGTGGACACAAGTTCACCCTTGTATGCCTTCATGGTCTGCTGGACTGCCTGCACCATGCTGGTGCCGGGATTCTGCTGGTTCATGCTGCTGCACTTCCTTCTGTGATAGCGGTGGATTCGGTGAAGTCGATCGATTTGAGGCTGGGCGGGAACACGACATACGGTGTGCCGTCTCCGCGTGCTTGCCTGCGGGCGATCTTGTGTCCCTCGAACAGGGCGTCTTTCGCGCCCCCCATGAGGTCAGTGAGTTGGATCTTCAGGAGGTTCGTGCGACGCTCCAGTGACTCCTTGTCGAGCACCGCGTCCTGCACGGCGAGCGCGAGCCTGAACGGCACTTCCACCTCTTCGAGTTCGATATCCGGGTGGAGGTAGCGAATAGCGACATAGGTCTCCATGTGCCCATCGAGAGCATCAATGGACGGCTTCTCACCAGCCCACAGGGACTCCATGAACACACGCCCCTCCCCCACCAGACGGGCAAAATAGTCGGCGTTGAACTCGACCCGATACGCCTTGAACTGCAGGCCAGAGAACAGCGCCGCGACGTGGCAGACCTCGAGCCCGAAAACGCCGAGCTGCCAGATCACCTGGTCGTAGTAGTACGGCGGGATCTCGTCCGTGCCATCTTCGCCCCACTCCCACGAGTTGTTCGCAGTCTTGGCTTCGAGGAGTTCGGGTTCGCGTCCGTCTCCGTAGCGGACGATCCGGTCAGGTGTTGCGGCCACCCATTCCCACAGTGGGTGCCGGTACATGGGCGAGCGGTAGAGTTTCGCGTCCGGGTGCATTGATGCCCACCATTCGCAGACTGCCGGTTCAAGAAGGGCTCCTCGCTGCAGTTCGTCGTTCTCGACCTGCGCGGGAACGGTACCGTTCATGCGGTGCCACAGCGAGAACCGCGATTCGTACGGTGAGTGACCCACGACGGCTGCGATCTTGCTGGCAGTCATGTACTGCATCCACTGGTCGGAACCAGGGTCGATGTAGCCGACGCGTTTGCCGGTGATGCCTTTCATGATTCGTCCTTTTCTTCGTAGTACCCACAGTGCGAGCATCGAATGTGGTCGGGTATGTCGTAGGGGTCTGTGATGCGGTTGTCGACCGCTAGGCACGCCTGGCATTCGTAGACCGCGTTCATGCCGTCGTCTTCGTTCCGGCTAACGGGTCGTGCTCTCAGGAGGTCCATGGTGTTGACTCCTTCGTGATGGCGACGAGGGCTGCGAATGCGGTGAGCATGGTGAGCAGGCCGAGGCCGTTGAGGGTGATGGTTTGGGTGTAGCTGTAGAAGTACGCGGCCATGGAGGTCACGAGGATCGGCGTGTAGATCCAGGCGATGCGCCGGCGGCGATGGACGGGTAGGCGACTCATGCGGCACGTCCCGGATCCCACGGGCCTGCCGCACCACGGTCGTGTGCGTGCTGTTGTTCGAGCACTGCGTCCTCGACGTCGTTGAGGAACTGTTCCCCGGCACCGTCGGCGAGGACACCGGTGACCTGTTTGCCTTTGACGGTGAGGAACTCGACGTGGAGTTCGCCGTCAATGCAGTTGACGGTCGCGGTCTTCGCGTTGAAGTGCATCATGCTGCGGCTCCTTTCGTTTCGTGCCGGTCGAGCCATGCGTCGACAGCGGACTGACGTGTACCTCTGGCGCGAGGTCTCCATGAGGTGCCGACTTCGATCCATGCGGGTACTCGATCTCCGCCGTCTCCGCGGTCGAGGGCCCGGTCTGCTTGCAGTTGGTAGACGCTGTAACCGATCTGTTCGGCGACTTCTTTGTCAGTGAGGAGGCGGTTCATTGCTTGTCCTCCGGGAGTAGGGACCAGGGGAAGTTGGTCACGTGGTCGTTTGTGACGAATGTGCCGCGGTGGTTGTTCCGGTCCGTCTGGTACTGCATGGCCCCGATCAGGGTCAGGGATGCGAGTCCGAGGACCACGACGGCGGTGGTGATCAGGAGGAATCCTGTTTGCGGTTTCATTTGTGGGTCTCCTTATGGAGGGCGATGGCATCACCGCTAATTTCCAGTGGCCAGTCCTTGTCGGGGTGGCGGTTAAGGTCGCGGACGCTGACGTCGGACATGACTGAGAAGTCTCGTCTGGCGTCGTCGGCGTAGGTGGCGAGGAACTGCAGGCGTTGCCGTTCGTCGCAGGTCATGGTGTGGAGGTCGATGACGTCCTGCACGGTCTGGCCTGGCTCGAGTGGGATGCAGGCTCCGCAGCAGTCGAACGACACAGCAGTCACGACCAGTCACCGGCCTTATAGCGGGCGACCTTCTCGGCGCTTTTTTCGGCCTGGCGCGTCTCGTAGTCGAGGACCCTCTCGCGCTGCTTTTCGACGAGCGCACGCCAGTGTCGTCGGGGGTCCTCGTTCCATTCGTCGAGAACGAAGTCCATGGCTTCTCGCGCTTCTCTCGCGTCGATGTTGTCGTCGACCTTGACCCCTGCCAGGTCGACTCGCGTCCCGTCGATGTTCACTTCGAGGTTGACGGTTCCGCCGTTCGCGAAGACGTTCGACCAGGTGGCCAATCGGGCGACCGGCTCATCCTCCGGCGAAGGGTTCTTGCTCGCGATCGTGAACTGATTGGACGACTTGATGACGTCGCCGACTGAGGTGGGGATTTTGCGGTAGTCGATGCTCATGCCGCAACCTCCGTGTTCTTGACGAGGCGGGCGATCGCTGACGCTCCAGCCGGGGTGATCTTGAGCGTCCACATGACCTCGCCACGGAAGCGTGGGGCATCGTTCTCAAGTCGGCGATAGAAGTAGAGCTTCTTGTGCGAGTACTCCGAGTACCGCGTGATCTTGACCTTCTCTTGCCGCTTCTCCGACCAGCGGGTGGTCTCTTCGGCATAGATCCAGCCTTTGGCGATCAGCAGCTGTCGCAACGCCTTCTCGGTCATGTCGAGGCTGGATGCGACAGTGCGAAAGAGGATCTTGTCGTCATCGGCGGCGAACGTGTCCACGTAGTCGACCTTCGGCGCATCCTCGGCGACCTTCGCTTCGAGAGCCTTCACCTTGCGGTTGGTGATCTGCAGAGCCTGCAGTACGATTTCGTCCTCGGTGAGGGCGGGAGTCTTGTTCTGTTCCATGAGCATCTGTCGCATGGCGTAGAAGTCCTGGACCAGCTTGAGTTTGAGCTCTCGGACGCTCGGGGTGTTCTTGAGGTATGTGATGAGCAGAGTTCCCTGCTGCTCATTGAGCTGGGCGACCTCTCGGCGCTGGACTCCGCCGGCCGTCTGGAGGGGTGCAATTTCAAATGCGACCCTTCCGAAGCTCTCAAGGTCCGCGAGATTCGCTCTCACGAGCTGGATGACTCCGCGGTGCCCGTTCCCGGTCTTATCTGCGATCGTGGTGGTTGAGACTCGCGGCTCCCCGTCAATGACCTCGACGAGGCTGGTATCATTGGCGTTAGGCATTTGTTTCACCTTCCTGGTGTCGTGGCCCTCGCCCTGTCCGGCGGGGGTCTTTTCTTTGTCCTGGTTCATGTCGGCTCCCTATGCGGCCATATCGCTAGATCTATCGTCTAGTCTGTCGGGCAAAAAACACACTTCCATCGGGATCTTCCACGCGTTCTCGATCTTTCGTGCGGTCGAGAGGTTCACGTGCGTCTTGGGGATCCTGCGCCCGTCTTTGGTCGTCTCCCGCCCGAGGCGGAGTGACTGAATGAGCGTGTGTGAAACGCCTGCCCGCATTGCTGCTTTGCGCTCAGATCCGAGCTCTTGGACTACCTCTGCCAGGTACTGCTTGAAGACTCTGCGGTCTACCTTGTATCTCATCTTCACCTCCCTAGTTGGTATATCTAGCGTATAGCGCCTTGCCAGGATTTGGCAAGCCCGAATCTGAAAAATCTTTATATCCCCCTAGGGCATAGGAAATTACGGGCCTGTAGTTCCGTGCCAGTAGTGGCAAGGGTAGATTTGTGTCGACTGCACCCTCTGCCCGAAAGTGATTGTTGTGAGCGATCTTTCCGACCGTCTAGCGGATGCAAAGCGGAAGCTGGACATCTCTACTCAGAAGATGTCTGACCTTGCGCAGGCCGCCGGCTACCAGTTGTCGAACTACTCGGCGACCGTGTACACCAACGGCAAGCACCCCGCCAAGGCATCGGCGGCAACACTCGAGGCACTGTCCTATGTGCTCCGTGTACCGCTGGCTGAGCTTCGGGAACTGGCCGGACTACCCCAGCACCACGGGAAGTTCGAGCCGCTGCCCGAGGCCGACACGCTCACTGCTCCACAGCGTGCCGCTGTGAACGAAGTCATCCGCCAGTTGGCGGAAGCGAACGCGAAAGCAGGTGATGGCAGTGGAAACGCCACCCCCACCAAACGCGCCGGAGTGAGTCCGGCACCAGAGGACGACGGGCTCGGCTCATTCGGAGGCCGTGCCCGTGGAGACCTCGACCACGAATCCGTGAACGACGGCGCCGGCGACAACGTGCACGAACTGTTCACTCCCCCACCCGGGCGCGAGGACACTGCCGCCTACGATGCACCCAACCGGGGCAAGCAGCAGAAGGAAGAGTCGGAGCAGCGTGGGGAGGAATCGCAAGATCCGGAGGACTGGGAGTGAGTAGACCCAAGCGGCTACACGCATTTGTTGACGAATCCGGCCAGCGCTCTCCGTCAGAGAAGTCGAGCGATTACTTCATCATGTCGGCGGTTCTGGTTTTTGACTACTGCCTTGACCTCGCTCGAGAACACCTCGTCGAGCTAAAGACAGCGACCAACCGAAAGCCGGAGCATCTGCTGCACTGGAGCAAGCTGAAGTCGCACCATCGATCAAAAGTCTCAGAAATGATGGGATCGGCAAGCGGCTTCATGGGCTATGTCTCTGTCGTGGCCTGCAAGCGAGTCCTTGCCGATCAGGTCACGATGGAGATGAACTTCGGAGGTGAGAAGTACAAATTCTCAGCGTTCACTCCACTGTCCGAGGACGAGGCATATCTCAAGACTTATCAGTATCTTCTCGAACGAATCTCGTGGGTAGCTTCGCGAAGCCAAACCTCTGCAGATATCACGATTGAACACACTATTCGATTCAAGCGAAAGACTCTCGAGGAATTCGAATCGAGAATCAAGAAAGACGCCAGTTGCTCAGCCCATTGGGGCAGTCTCCCTGATGGCGCGAAACTGCGATCCAAAAAGGACGAAGATCTCCTTCAGCTTGCAGATCTCGTAGCGAGTGGAATTGGCGCGGCCTTCAATGGGCATCCGAGAAATGGAGTCGACACTTCGCATGTAAGCAACATGGCGCAGGCAATGTGGAGGGGCCCGAAGAAGAATAAGCTCACGACATACGGGCTGAAGATGCACCCATGGAATGAGGTAACGAGATCTCTTCATCCATGGCTGCTTGAGATATAACGAAGCGACCGCCTTACGCTGCATGTACAGGTGCTGGGTTGAACCCCCGGCAGGCGATCGCTCCGTGGCTTCAGTGTCACACGGCGGGACATTGGGAAACAAGGGGAAACACCGAACTATGCGCAACTTCACGCAACTATCGGTAACCACGTCCGTCGTGTCAGCCCCTCCCCATATGGTCTTGAGTCATGCACCATCCATGGAGAGAACTACGTGAACGCGGGGACGGCGTTGTCCTGCACTTCACCAGGTTCAACGACACTCGAGTAGCCGCCACAAACGGCAACAACGCCATCTGGCTCGACCAAGATCTACTGCAGGTCGAGCGCCGCTGCGCCATCCAGCACGAGCAAGCGCACATTGACCTCGGGCACACGAACTGCGACGACCCGAGAGAAGAACAAGCAGCACGCCGACTCACAGCCCAGAAGCTCATTCACTGGGATGCCCTCGTCGACGTCTTCAAATGGGCCCACACAGCCTCTGAGGCGGCCGACGAACTATGGGTGACACCGGAAGTCCTTGAAGACCGTCTCCGGTTCCTCCGCCCCCACGAGAAGCACCTGCTCCGACTCATCGGGGCAGCACGACAAGGATGACACCATGGCCAGCATTGAACCGTACGAGACTGCGAAGGGGCGCCGGTACAGGGTTCGGTACACCAAGCCCGACCGGAAACCCACAGATCGCCGTGGGTTCCGCACGAAACGCGAGGCCCAGATCTTCCTCAGCACAGTCGAAGTGTCGAAACTTGAAGGCACCTACGTGGACCCGACGAGGGGAAAGATCACTGTCGAAGAGCTGGGCACGCACTGGCTGAAAACCGTCAGCGCCAAGGAATCATCGAAAAGGGTGTACGAGACCGCGTTGCGTATTCACATCTACCCGCAGTTCGGCGCACTACCCGTGAACGGGGTATCCCGATCGAACGTCAGAGAGTGGGTTTCGAAGCTGTCGGAGAAGCGAGCAGCCAGAACCGTACGCAGGGCACACTACGTCCTCCAAGCAGTTCTTCAGATGGCTGTCGACGATCACCTGATCCCCCGCAACCCCGCCAGCGGTGTCAAGAACCTACCGTCGCCGACTCACCGGAAGAACGTATACCTCACCTATGAGCAGGTAGAGAAGGTGGCGCAGGCGGCCGATGAGCACCACTTGAGAGCTCAGCGCGGCCGGTACGGGTTCGTCATTTACATTGCCGCATACATGGGATTGCGGTGGAGCGAGATCGCGACCCTGACGCCGGAGGACGTCGACTTGGAGGACCGGCGCGTGCGGGTTCGCGCCGAGGTGTCGAAGAACAGCAGGGAACGGTCAGTCGGGTATCCGAAGTTCATGCACGATCGGTTCCGAGCTCTCACGCTGAAAGCTGTTCCCGGTGGTCTGCTCATTCCCGATCTGGCTGAGCCGAAGAATCAGGAGTCGTGGTTCCTGTCTGTGAGGAAGAAGGCCAAGCTTGACGAGGGGTTCGTGTTTCACGATCTGCGCCACTCCGCGGCATCGTTTGCGGTATCCGCTGGCGCTTCGGTGAAGGTCGTGCAGAACATGCTCGGTCACTCCTCCGCAGCCATGACCTTGGACGTGTACTCGGATCTGTTCGACACCGATGTGGATGATGTTGCCGAGCGAATCGACGAGGCGCGAAAACGAAACATGGGCAAAACGTGGGCAGACGGCGATTCGATCCCCGCTGGGGAGTGAATGCAGAAAAGCGGCCCACCCAAACATTTCTGCTTGGATGGGCCGCTGACCTAGTGTTTCAGAAGTCCCAGTCGTCGTCTTCGGTGTTCTCGGCCTTGCCGATGACGTAGGACGAACCCGAACCGGAGAAGAAGTCGTGGTTCTCATCGCTGCCGGGCGAGAGCGCCGCGAGGATCGCGGGATTGACCTTCGTGACCTCCTTGGGGAACATCGCTTCGTAGCCGAGGTTCATCAGCGCCTTGTTGGCGTTGTAGTGGAGGAACATCTTGCAGTCCTCTGACAGCCCCACCGGATCGTAGATGTCGTGGGTGTAGGCGACCTCATTCTCGTAGAGCTCGAACATGAGGTTCATCGTGTAGTCCTTGAGCTCCTGCTTGCGCTCCTCGGACTGCGACTCCAGACCCCTCTGGTACTTGTAGCCGATGTAGTAGCCGTGCACGGCCTCATCGCGGATGATCAGACGAATGAGGTCAGCCGTGTTCGTCAGCTTCGCATGGGCCGACCAGTACATCGGCAGGTAGAAGCCGGAATAGAAGAGGAAGGACTCGAGCAGCGTCGAGGCGACCTTGCGCTTGAGCGGATCGTCACCCCGGTAGTAGCTGAGGATGATATCGGCCTTGGACTGCAGGTACTTGTTCTCACGCGACCAGCGGAAGGCCTCGTCGATCTCCTTCGTCGAGCACAGGGTCGAGAAGATCGAGGAGTACGACTTCGCGTGCACGCTCTCCATGAACGCGATGTTCGTCAGCACTGCCTCTTCGTGCGGGGTCACCGCGTCCGGGATGAGCGAGATCGCACCGACGGTGCCCTGGATCGTGTCGAGCAGGGTCAGACCGGTGAAGACGCGCATCGTCAGCGTCTTCTCATCGTCGGTCAGAGTCGCCCAGGACTGGATGTCGTTGCTCAGCGGGACCTTCTCCGGCAGCCAGAAGTTGCCGGTCAGGCGGTCCCAGACCTCATCATCGATCGGATCGACGACGCGGTTCCAGTTGATGGCATCGACGTGCGATGCCAGAGTCAGATTCTCCAACGCTTTCTCTCTTCCCAAAGTGCTGTTCGTCGTGAGTACGATTTGTCTGCGCGAGTGTCTGCACGGCAGACGAAAGTCAAATGCGCGCCCGGCCGGTGGCGTGGGGCCCGCATGAAGCCGGCGGACCGATTCACATCGATCCGCCGATTCCATTGATCAGAGCATGCAGGAGACGCAGCCTTCGACCTCCGTGCCCTCCAGAGCGAGCTGCCGGAGCCGGATGTAGTAGATGGTCTTGATTCCCTTGCGCCAGGCGTAGATCTGCGCCTTGTTGATGTCACGTGTGGTGGCGGTGTCCATGAAGAACAGCGTGAGCGACAGGCCCTGGTCCACGTGCTTCGTGGCCTCGGCGTACGTGTCGATGATCTTCTCGTAGCCGATCTCGTAGGCATCCTGGTAGTACTTCAGGTTGTCGTTGTCCATGAAGGGGGCCGGGTAGTACACGCGACCGACCTTGCCCTCCTTGCGGATTTCGATCTTCGAAGCCACCGGGTGGATCGACGAGGTCGAGTTGTTGATGTACGAGATCGAACCGGTCGGCGGCACCGCCTGGAGGTTCTGGTTGTAGATGCCGTGCTCGGCAACAGCGGCCTTGAGCTCGCGCCAGTCATCCTGAGTCGGGATGTGCACGTTGGCTTCCGAGAACAGCTCGGCCACCCGAGCCGTGCGCGGCTGCCAGACCTCGTCGGTGTACTTGTCGAAGTATTCGCCGGTGGCGTACTTCGAACGCTCGAAGCCGGCGAAGGTCTCACCGCGCTCCTTCGCGATCTCCATCGACGCCCGCACGCAGTGGTAGGCGACGGTGTAGAAATACATGTTCGTGAAGTCCAGACCCTCATCGGAACCGTAGTAGATGTGCTCCCGAGCCAGGTAGCCGTGGAGATTCATCTGTCCCAGACCGATGGCGTGCGACATGTCGTTGCCGCGGGCGATCGAGGGCACGGACTGGATATCCGAGGTCTCGGCGACGGCAGTGAGCCCGCGGATCGCGGTCTCGATGGTCTGACCGAAGTTGCTCGAGTCCATCGTCAGAGCGATGTTGAGCGAACCGAGGTTGCAGGAGATGTCCTTGCCGACGACGTCGTAGCCGAGGTCATCGTCGTACTTGCTGGGTTCGGAGACCTGGAGGATCTCCGAACACAGGTTGGACATGATGATCTTGCCGTCGATCGGGTTCGCCTTGTTGACGGTGTCCTCGAACATCACGTACGGGTAGCCGGACTCGAACTGGATCTCGGCCAGAGTCTGGAAGAACTCGCGGGCGTTGATCTTCTTCTTCTTGATCGCCGCGTTGTCGACCATCTCGGTGTACTTCTCGGTGACGTTGATGTCGGAGAAGGGCACACCGTAGACGCGCTCGACGTCGTAGGGGCTGAAGAGGTACATATCCTCGTTGCGCTTGGCCAGTTCGAAGGTGATGTCCGGGATCACAACGCCCAGGGACAGGGTCTTGATCCGGATCTTCTCGTCGGCGTTCTCGCGCTTGGTGTCGAGGAAGTTGTAGATGTCGGGGTGGTGGGCGTGCAGGTACACGGCACCAGCACCCTGACGGGCTCCGAGCTGATTGGCGTAGGAGAACGAGTCCTCGAGCAGCTTCATGACGGGGATGACGCCGGAGGACTGGTTCTCGATCTTCTTGATCGGGGCACCGGACTCACGGATGTTCGTCAGCGCGAAGGCCACACCGCCGCCGCGCTTGGACAGCTGCAGAGCGGAGTTGATGGAGCGGCCGATCGACTCCATATTGTCTTCGATGCGCAGCAGGAAGCAGGAGACGAGCTCACCGCGCTGCCTCTTGCCGGCGTTGAGGAACGTCGGGGTGGCCGGCTGGAAGCGACCGGAGATGATCTCGTCGACGAGCTGGGTGGCCAGCGCCTCGTCTCCGCGCGCCAGGAAGAGAGCGACCATGACGACGCGGTCCTCGAAGCGTTCGAGGTAGCGCTTTCCGTCGAAGGTCTTCAGCGTGTAGGAGGTGTAGAACTTGAAGGCGCCGAGGAAGGTCTGGAAGCGGAACTTCTGATCGTAGGCGCGCGTCGACAGCTGCTCGATGAACTCGAAGGAATACTGCTCGAGGACCTCGGGCTCGTAATAGTCTTTCTCGACGAGGTAGTCGAGCTTCTCCTTGAGGTTGTGGAAGAAGACGGTGTTGTTGTTCACGTGCTGCAGGAAGTACTGCCGGGCAGCCTGCTTGTCACGCTCGAACTGGATCCTGCCGTCCTCGTCGTACAGGTTCAGCATCGCGTTGAGCGCGTGGTAATCCAGATCCGTCTCTTCGCGGATGATGTCCTCTACATTGCGGTCTTCAGCGAGCTCGCGCACAGTTTGTCCAATCCTTGGTTTACAGCGTCGACATCCTCCTGAGTGCCCAGGAGTTCGACTCGATACATGAGGGGTACTTGGCACTTTGCCGCGACCTTGACGGCCGCTCGGCAGAAGTCCTCGCCGAAGTTGGTGTTTCCGGCGCCGATGACGCCGAGCAGATGGCGCCGGTTCTCTTCGACATTGAGGAATTTGATGACTTGCTTGGGAACGGCCCCGCGATTGCGACCGGCCCCGTAGGTGGGGGTGACGAGAACGAATGGCTCATCCACTCGCAGCGTCGGCTCCTTGGTCAACGTCGGCAGCCGCACTGCCGGGTGATGCAGCTTCGCAACGAAGCGGTGCGTGTACTCGGAGTTGGCGGAGAAGTAGACCAGAAGCATTGTCAGGCCACGACGGACGCAGACTGTTCGCCGGTCAGGGTGGCGATCTTGTCCGGACGGAAGCCCGACCAGTGGTCGTTGTCGGTCACGACGACGGGAGCCTGCATGTAGCCCATCGCCTTGACGACATCGAGGGCGTTCTCGTCGACGCTGAGATCGACCGACTTGTACTTCAGGCCCTTGGCGTCGAGAGCACGGTAGGTGGCGTTGCACTGGACGCAAGCTGGCTTGGTGTACACGGTGATCAT
Protein sequences of DBSCAN-SWA_2 >NZ_CP050153|2737669:2755216|2749723_2750134_+|WP_167198066.1|DBSCAN-SWA MHHPWRELRERGDGVVLHFTRFNDTRVAATNGNNAIWLDQDLLQVERRCAIQHEQAHIDLGHTNCDDPREEQAARRLTAQKLIHWDALVDVFKWAHTASEAADELWVTPEVLEDRLRFLRPHEKHLLRLIGAARQG >NZ_CP050153|2737669:2755216|2742530_2742848_-|WP_167198025.1|DBSCAN-SWA MADELFFPSTPDRAAQAVATCWTCPLRAICASRSLEEERGKPATERFGIRGGLTATQRAQLDPGKICPDCGSPVITKSPHCDDDREIHRLKHRREYERERRKDAA >NZ_CP050153|2737669:2755216|2738946_2739639_-|WP_167198004.1|DBSCAN-SWA MPWGKIDDKLYSSPKWMTVSKGGKALWVSALSWCMAQLTDGAVTKQTCFMLGASTKDARSLVEAGLWDETPNGYQFHDWLDYQPSRQQVLAERESARQRQQRARDKASSSRKRHGVTNTVTHGEVQPLVTGDVTVPPTRPVPTPIESATGRKRPARPLSPDWVPTDTHRAKASEHRINLPAEVEKFRNWAESKDERKANWNAAFTNWLIRTAEQQPRPPQTDLWSKEGPF >NZ_CP050153|2737669:2755216|2754554_2754968_-|WP_167198078.1|DBSCAN-SWA MLLVYFSANSEYTHRFVAKLHHPAVRLPTLTKEPTLRVDEPFVLVTPTYGAGRNRGAVPKQVIKFLNVEENRRHLLGVIGAGNTNFGEDFCRAAVKVAAKCQVPLMYRVELLGTQEDVDAVNQGLDKLCASSLKTAM >NZ_CP050153|2737669:2755216|2748737_2749535_+|WP_167198062.1|DBSCAN-SWA MSRPKRLHAFVDESGQRSPSEKSSDYFIMSAVLVFDYCLDLAREHLVELKTATNRKPEHLLHWSKLKSHHRSKVSEMMGSASGFMGYVSVVACKRVLADQVTMEMNFGGEKYKFSAFTPLSEDEAYLKTYQYLLERISWVASRSQTSADITIEHTIRFKRKTLEEFESRIKKDASCSAHWGSLPDGAKLRSKKDEDLLQLADLVASGIGAAFNGHPRNGVDTSHVSNMAQAMWRGPKKNKLTTYGLKMHPWNEVTRSLHPWLLEI >NZ_CP050153|2737669:2755216|2745968_2746151_-|WP_167198047.1|DBSCAN-SWA MKPQTGFLLITTAVVVLGLASLTLIGAMQYQTDRNNHRGTFVTNDHVTNFPWSLLPEDKQ >NZ_CP050153|2737669:2755216|2741332_2742043_-|WP_167198019.1|DBSCAN-SWA MGWKRVAKAAGITSSTLYPLLYGRGGTDPRPIRKQISKALEAKLLAVTPDMADGSIVDNLGSVRRLQALAAIGWSQHRLAREFDMFPGNFGKVIHGERGGIRVSTAKQIGEFFNENWSTPPVAATRFEQAGITRAKREAAAKGWVTAAAWDDIDDPTEVPKADIGNEVTDSRAASTLEKLDRLELLARDGYGDNEDTYVRAGWSSRASAWRALQRAGELEHVDRLKRNDQARQIAS >NZ_CP050153|2737669:2755216|2742888_2743407_-|WP_167198029.1|DBSCAN-SWA MAGETVITVVGNLTSDPELRFTPNGAAVSNFTVASTPRIFDRQRNEFVDGETLFLRCSAWKELGENSAESLQRGTRVIVQGRLKSRSFETKEGEKRTVMELDVDEVGPSLRRATAVVTKTQGGGGSSGQQSGGNFGGQQSQQGGGWGNQQQAQQSPAWGSNPGAADADLPPF >NZ_CP050153|2737669:2755216|2746410_2746812_-|WP_167198053.1|DBSCAN-SWA MSIDYRKIPTSVGDVIKSSNQFTIASKNPSPEDEPVARLATWSNVFANGGTVNLEVNIDGTRVDLAGVKVDDNIDAREAREAMDFVLDEWNEDPRRHWRALVEKQRERVLDYETRQAEKSAEKVARYKAGDWS >NZ_CP050153|2737669:2755216|2740634_2740976_-|WP_167198013.1|DBSCAN-SWA MKPNSRYVPAAPYRAALEELSAKHSQEELARRLRVTPRTIWRALSSESVKISRTFAEAILFEAGMGPEPEPVQERAEDPITWPEVAEFAKTTEGLEFIERCWRPAAYRTRKAA >NZ_CP050153|2737669:2755216|2751294_2752266_-|WP_167198072.1|DBSCAN-SWA MENLTLASHVDAINWNRVVDPIDDEVWDRLTGNFWLPEKVPLSNDIQSWATLTDDEKTLTMRVFTGLTLLDTIQGTVGAISLIPDAVTPHEEAVLTNIAFMESVHAKSYSSIFSTLCSTKEIDEAFRWSRENKYLQSKADIILSYYRGDDPLKRKVASTLLESFLFYSGFYLPMYWSAHAKLTNTADLIRLIIRDEAVHGYYIGYKYQRGLESQSEERKQELKDYTMNLMFELYENEVAYTHDIYDPVGLSEDCKMFLHYNANKALMNLGYEAMFPKEVTKVNPAILAALSPGSDENHDFFSGSGSSYVIGKAENTEDDDWDF >NZ_CP050153|2737669:2755216|2752422_2754585_-|WP_167198075.1|DBSCAN-SWA MRELAEDRNVEDIIREETDLDYHALNAMLNLYDEDGRIQFERDKQAARQYFLQHVNNNTVFFHNLKEKLDYLVEKDYYEPEVLEQYSFEFIEQLSTRAYDQKFRFQTFLGAFKFYTSYTLKTFDGKRYLERFEDRVVMVALFLARGDEALATQLVDEIISGRFQPATPTFLNAGKRQRGELVSCFLLRIEDNMESIGRSINSALQLSKRGGGVAFALTNIRESGAPIKKIENQSSGVIPVMKLLEDSFSYANQLGARQGAGAVYLHAHHPDIYNFLDTKRENADEKIRIKTLSLGVVIPDITFELAKRNEDMYLFSPYDVERVYGVPFSDINVTEKYTEMVDNAAIKKKKINAREFFQTLAEIQFESGYPYVMFEDTVNKANPIDGKIIMSNLCSEILQVSEPSKYDDDLGYDVVGKDISCNLGSLNIALTMDSSNFGQTIETAIRGLTAVAETSDIQSVPSIARGNDMSHAIGLGQMNLHGYLAREHIYYGSDEGLDFTNMYFYTVAYHCVRASMEIAKERGETFAGFERSKYATGEYFDKYTDEVWQPRTARVAELFSEANVHIPTQDDWRELKAAVAEHGIYNQNLQAVPPTGSISYINNSTSSIHPVASKIEIRKEGKVGRVYYPAPFMDNDNLKYYQDAYEIGYEKIIDTYAEATKHVDQGLSLTLFFMDTATTRDINKAQIYAWRKGIKTIYYIRLRQLALEGTEVEGCVSCML >NZ_CP050153|2737669:2755216|2745541_2745757_-|WP_167198041.1|DBSCAN-SWA MMHFNAKTATVNCIDGELHVEFLTVKGKQVTGVLADGAGEQFLNDVEDAVLEQQHAHDRGAAGPWDPGRAA >NZ_CP050153|2737669:2755216|2746147_2746414_-|WP_167198050.1|DBSCAN-SWA MTAVSFDCCGACIPLEPGQTVQDVIDLHTMTCDERQRLQFLATYADDARRDFSVMSDVSVRDLNRHPDKDWPLEISGDAIALHKETHK >NZ_CP050153|2737669:2755216|2739639_2739804_-|WP_167198007.1|DBSCAN-SWA MTMENLTAIEATKRISLAQSILGHRLPNARTVDLAIAALEGAQIDELVGLEEAA >NZ_CP050153|2737669:2755216|2746808_2747597_-|WP_167200987.1|DBSCAN-SWA MKQMPNANDTSLVEVIDGEPRVSTTTIADKTGNGHRGVIQLVRANLADLESFGRVAFEIAPLQTAGGVQRREVAQLNEQQGTLLITYLKNTPSVRELKLKLVQDFYAMRQMLMEQNKTPALTEDEIVLQALQITNRKVKALEAKVAEDAPKVDYVDTFAADDDKILFRTVASSLDMTEKALRQLLIAKGWIYAEETTRWSEKRQEKVKITRYSEYSHKKLYFYRRLENDAPRFRGEVMWTLKITPAGASAIARLVKNTEVAA >NZ_CP050153|2737669:2755216|2745753_2745972_-|WP_167198044.1|DBSCAN-SWA MNRLLTDKEVAEQIGYSVYQLQADRALDRGDGGDRVPAWIEVGTSWRPRARGTRQSAVDAWLDRHETKGAAA >NZ_CP050153|2737669:2755216|2737669_2738947_-|WP_167198001.1|DBSCAN-SWA MNEPTSTDEAELSVLGSILMTGGKVLEEITLTADDFASPRNARAFELMQALWAKGQAVDLVSTGSAIATSNAETKRLLEPAYLAKAMHNTPTSALIGQYQSITREHSIRRRLINATTAISQAVNEVDDINQVIEVARKAIDDAGNVSTTEIRSMAEVEAETIAELNAPPRFTVTPWQDLNHLIGGWRPGALYVIGARPGSGKTLVGLQAAVNMLHRGSVLMCTLEMSQSEIHKRVFSQLLHIPLGNILDSNMTPNEWERLTNRKATNRDRFYVDDNPAQTVEGIRRQARTIQRRHPLSMIVVDYLQLMESTGKSDKKRHEEIARWTRALKVMAKTFDVPVLVLSQLNRESARGGKPTLADLRESGAIEQDADVVLLLHRDEESPDELNMLVAKNRHGMREAITLQWEGHFASVSDRHWRPAIEAA >NZ_CP050153|2737669:2755216|2744260_2745193_-|WP_167198035.1|DBSCAN-SWA MKGITGKRVGYIDPGSDQWMQYMTASKIAAVVGHSPYESRFSLWHRMNGTVPAQVENDELQRGALLEPAVCEWWASMHPDAKLYRSPMYRHPLWEWVAATPDRIVRYGDGREPELLEAKTANNSWEWGEDGTDEIPPYYYDQVIWQLGVFGLEVCHVAALFSGLQFKAYRVEFNADYFARLVGEGRVFMESLWAGEKPSIDALDGHMETYVAIRYLHPDIELEEVEVPFRLALAVQDAVLDKESLERRTNLLKIQLTDLMGGAKDALFEGHKIARRQARGDGTPYVVFPPSLKSIDFTESTAITEGSAAA >NZ_CP050153|2737669:2755216|2740260_2740638_-|WP_167198010.1|DBSCAN-SWA MTELAIDVHKAYWISDNDRLHWADKAKRTKHIRQLARYTAKQQKLVLPTPVIVIAEIGFRTGGRADPGNASLAVKACLDGLTDAGAWPDDDSRHVLGPDYRRGPKAPDKDRYRIHLKFIPQHVPF >NZ_CP050153|2737669:2755216|2740979_2741336_-|WP_167198016.1|DBSCAN-SWA MIAGQLSIFDELDDEREPAASTTCPHCHHRWTLSKSGLPTLADHLKGSGHWPYGGSIAGKCENQAISLFQLGNQQHMGFTENPPIYTTDLLGAILRAKQHGCTDTQIKAVLKTVTPRK >NZ_CP050153|2737669:2755216|2750138_2751236_+|WP_167198069.1|integrase|DBSCAN-SWA MASIEPYETAKGRRYRVRYTKPDRKPTDRRGFRTKREAQIFLSTVEVSKLEGTYVDPTRGKITVEELGTHWLKTVSAKESSKRVYETALRIHIYPQFGALPVNGVSRSNVREWVSKLSEKRAARTVRRAHYVLQAVLQMAVDDHLIPRNPASGVKNLPSPTHRKNVYLTYEQVEKVAQAADEHHLRAQRGRYGFVIYIAAYMGLRWSEIATLTPEDVDLEDRRVRVRAEVSKNSRERSVGYPKFMHDRFRALTLKAVPGGLLIPDLAEPKNQESWFLSVRKKAKLDEGFVFHDLRHSAASFAVSAGASVKVVQNMLGHSSAAMTLDVYSDLFDTDVDDVAERIDEARKRNMGKTWADGDSIPAGE >NZ_CP050153|2737669:2755216|2748189_2748741_+|WP_167198059.1|DBSCAN-SWA MSDLAQAAGYQLSNYSATVYTNGKHPAKASAATLEALSYVLRVPLAELRELAGLPQHHGKFEPLPEADTLTAPQRAAVNEVIRQLAEANAKAGDGSGNATPTKRAGVSPAPEDDGLGSFGGRARGDLDHESVNDGAGDNVHELFTPPPGREDTAAYDAPNRGKQQKEESEQRGEESQDPEDWE >NZ_CP050153|2737669:2755216|2754970_2755216_-|WP_025778731.1|DBSCAN-SWA MITVYTKPACVQCNATYRALDAKGLKYKSVDLSVDENALDVVKAMGYMQAPVVVTDNDHWSGFRPDKIATLTGEQSASVVA >NZ_CP050153|2737669:2755216|2747666_2747948_-|WP_167198056.1|DBSCAN-SWA MRYKVDRRVFKQYLAEVVQELGSERKAAMRAGVSHTLIQSLRLGRETTKDGRRIPKTHVNLSTARKIENAWKIPMEVCFLPDRLDDRSSDMAA >NZ_CP050153|2737669:2755216|2743409_2744264_-|WP_167198032.1|DBSCAN-SWA MNQQNPGTSMVQAVQQTMKAYKGELVSTLPSHLQEKGAGWMSSALASIRRNPQLLQAAAESPHTLINALSEAAQKGLQPGTEEYYLTPRKNKGRNEILGITGYQGEIELMYRAGAVSSVIVETVHANDKFDYVPGRDAKPIHEIDWMASDRGELRLAYAYALMKDGAVSKVVIVNQERIKRAKASSQGADSKYSPWMNDPAAMWAKTAAHDLAKWVPTSSEYVREQLRAERDVAAEEPVPPQPPVAPPQSPAPQQVQDEPFPNDDEYVEGELVDADGVIQGQGA >NZ_CP050153|2737669:2755216|2745350_2745545_-|WP_167198038.1|DBSCAN-SWA MSRLPVHRRRRIAWIYTPILVTSMAAYFYSYTQTITLNGLGLLTMLTAFAALVAITKESTPWTS >NZ_CP050153|2737669:2755216|2742246_2742534_-|WP_167198022.1|DBSCAN-SWA MSFIKITTRHPGGREVVGITQQANVTASEAFDRRITRARVARQPHDADRSSLTIQLHPTEIVDYSQLLEYAETIEAFDGYQPELPQEFLDLMEGQ |
28 | Gordonia_phage(35.71%) | integrase | attL 2735242:2735257|attR 2753331:2753346 |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
Acr ID | Acr position | Acr size | Homology with known anti | Neighbor HTH/AcRanker | Neighbor Aca | In prophage | Protospacer in prophage |
---|