Contig_ID | Contig_def | CRISPR array number | Contig Signature genes | Self targeting spacer number | Target MGE spacer number | Prophage number | Anti-CRISPR protein number |
---|---|---|---|---|---|---|---|
NZ_CP020871 | Xylella fastidiosa subsp. pauca strain De Donno plasmid pXF-De_Donno, complete sequence | 0 crisprs | NA | 0 | 0 | 0 | 0 |
NZ_CP020870 | Xylella fastidiosa subsp. pauca strain De Donno chromosome, complete genome | 2 crisprs | DEDDh,Cas9_archaeal,WYL,csa3,DinG | 4 | 4 | 9 | 0 |
CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
NZ_CP020870_1 | 2076836-2076984 | Orphan |
NA
Consensus repeat of NZ_CP020870_1
|
2 spacers
spacers of NZ_CP020870_1
>1.1|2076862|37|NZ_CP020870|CRISPRCasFinder GGAGGGTGCCGCTGCTGCTGATGTGGCCGGTAGCGGT >1.2|2076925|34|NZ_CP020870|CRISPRCasFinder GGGTGGCGGTCTGGGAGTCGATGGTGCCGCCGCG |
DinG |
CRISPR arrays and Neighbor proteins around NZ_CP020870_1
The CRISPR arrays of NZ_CP020870_1 >merge|NZ_CP020870|1|2076836-2076984|CRISPRCasFinder GTTGGTGAGGGTGGTGCCGTCAATCTGGAGGGTGCCGCTGCTGCTGATGTGGCCGGTAGCGGTGTTGTCGAGGGTGGTGGTGTGCAGGTGGGTGGCGGTCTGGGAGTCGATGGTGCCGCCGCGGTTGTCCAGGGTGGTGGTGTGCAGGT >NZ_CP020870|1|1|2076836-2076984|CRISPRCasFinder GTTGGTGAGGGTGGTGCCGTCAATCT GGAGGGTGCCGCTGCTGCTGATGTGGCCGGTAGCGGT GTTGTCGAGGGTGGTGGTGTGCAGGT GGGTGGCGGTCTGGGAGTCGATGGTGCCGCCGCG GTTGTCCAGGGTGGTGGTGTGCAGGT
>NZ_CP020870.1|WP_046418678.1|2068878_2069727_-|DUF769-domain-containing-protein MQRRPLLGVSLLAASLLLAGCSSGPPIDSRTGKPMMAGPWENRDLSLEYFQLDFFGQYTVKHAFINGINIKRCYRGSPPDHVQVVMMTTPSGFTTPGIVVTGGGRPPRPADESDAFIGSQYSNTSNKYDPDSRTVLRNPDGTPQKKVVKGWEFNALCSAEFAGGNYIGFGIRSAASQSIEEKIQSALSRISAPDRDYKSNRFIDAPRTETRWGNPWTWYRAYMPTPVGDGVEIWMTPIGDSGYYITVYLNFIEAARQQNTEDYQRARKLMDGILQSVVIQKQ >NZ_CP020870.1|WP_046418680.1|2068003_2068882_-|hypothetical-protein MTGWVIPWRPLITHVMNRLYIQDSGETYRNTTYLAYTKPTHPLGELVTAGIEKLLEITKIASPASRLQAAAAKELMYNAEDKKYTNPIYIEGHSRGTMMLSNALRVLAADHVFSDFLEIRAYNPAAEGGRLTEAAALVTSNSVDIWSPRKDFVANKIGGYAGDATFHDLREIFQTNYSVHSSGGTAALGSDSNHVDEHKLFSYDGLNINDMNAKRQGRTNGLLQQWQKTPSPENPVATQLTQLQRLLWQSGQWQQQLDTTPGLLTRPTPTTPDAPSARQQQLQQLRQSLTPY >NZ_CP020870.1|WP_046418682.1|2066336_2067164_-|hypothetical-protein MTGWVIPWRPLITHVMNRLYIQDSGETYRNTTYLAYTKPTHLLGELVTAGIEKLLEITKIASPASRLQAAAAKEVMYNASKDNYSNPVYLEGHSRGTMTLSNALRVLGGLDLGDTKLEVLAYNPAAEGGRLTEAAALVTGKPVKTWAPPKDFVANKIGGYDGDATFHDLWEIFQTNYSVHSSGGTAAQFSSHTLNLTAAGNLDAHSAQSTQEQTSSQRHRSASLGTKIGVTGGGPSVSADVARGRGSAGRDSAEGQCYLRGAEVGNQTLHGCRLS >NZ_CP020870.1|WP_010893418.1|2065736_2066090_+|hypothetical-protein MKNTYVIALLAVTTATLLSGCATKRYGRLQPLTSYESRNYNCTQIDLELAKIDAFEQQVTEQAKFSGMSVASFLGDFGIGNTLERNQAIKTAKERRTQLITARSTKGCDNHPQPKQP >NZ_CP020870.1|WP_046418685.1|2065541_2065733_+|hypothetical-protein MKNTYVIALLAVTTATLLSGCATHRHEKTKNDVTERHTEDCAKQQHHKCGTKSKKTHQHQAEE >NZ_CP020870.1|WP_046418687.1|2064137_2065151_+|magnesium-and-cobalt-transport-protein-CorA MTTPQPPQHPSPENPTCVITCAYYDPQNQRHDIHLDRLSEELQCDNSFIWVGLYEPDATVLRKLQQEFGLHDLAIEDALKAHQRPKVENYGNTLFIVVNTAQLIGERICYGETHAFLSTRFLIIVRHGASLSYAPVRTRIERDPTMLGRGPAFCLYGVLDFIVDNYLPIMNECRDTLEQLEQDIFSENYQRQTVIRLYELKRELNKMRLVVAPLQDVLAHLTRNAETLLPQDIGIYIRDVLDHAIRINDSIDTLREMLSTALSVNLSMVTLAQGETVKRLGAWAALLAAPTLITSWYGMNFTNMPELHIPYAYPLLAAGVTLFCILLYRLFKRVHWL >NZ_CP020870.1|WP_046418690.1|2063040_2064141_+|DUF4105-domain-containing-protein MQPGQLFFEHFGHDAIIVADPISGQSLSYNFGYFDPLEPDFLRRLIRGEMLYYLISLPLEKDLDEYRTTGRSITIQWLDLPPEQATALAQTLAIRSRPEHARYRYDYFTANCATMVRDTLNQAMGGTLKTQLTQHTQNNTYRNETIRLASPTPWMSLTFDLSLGPYADKPLTPWEESFIPMHLAQHLTQVHNSAGRPLVQQTQIIPPHHPITPSPHPLPRPWTWWLLGLGIATLTLLLAPHHPRRLTLLALPFWLICTLGGLILIYLWGFTTHQAAWANRNLLLLNPLSLLLIVGGITWLCGRTPGRWFDLLLWSNAAASLAALLTYWLPAYTQHNLSWIGLLLPNHLALTWTLHRRLTTATHHPK >NZ_CP020870.1|WP_046418692.1|2062039_2062921_+|HlyC/CorC-family-transporter MSEDDDHRTITESQEKRRSWLERLSAAFSGDPHTLDDLLVILRSAEHHGIIATDTLRMMEGAIAIADLTVGDVMVPRSQMVSLPVESDLQTITKQMIESGHSRFPVHGEDKDEVLGILLAKDLLRGVSANHTITNVHELLRPVGMIPESKKLNVLLKEFRLSHNHMAIVVDEYGGVAGLVTIEDVLEQIVGDIDDEHDETEDQTKMIAIQADGCYIVDALTPIEDFNERFNAEFPDDDYDTIGGLVTEAIGHLPETGDELTLDRFAFRVAKANARRIHILHVTVLPPNEQNAA >NZ_CP020870.1|WP_053014121.1|2061416_2061941_+|hypothetical-protein MRASLTALLTTLPTLICLAAEHPHLTPPNLEALIECRQRTIDYAQLVPLLEDPPKATTLGWHPLPATNPFMTEYTLHTPIHVFGHQTKHIALSGGSIIAILDLPDPRPLARTLQLHTAIDTPKKTIFGRELTSANTTNPKTGQPAIESIVLNVSNVTSHPGKTLVGCTYSLDEP >NZ_CP020870.1|WP_046418695.1|2060931_2061417_+|rRNA-maturation-RNase-YbeY MTRGPTFLNVGISYGLPRTKLPAAVSFRKWVAATLQGRIRKADLAIRIVDEKEGRALNYHYRNKDYATNVLSFPAQLPEPLPKALKIPLLGDIVMCAPVIAREAAEQGKSLSAHYAHLTVHGTLHLLGWNHEDHQEADAMEQLEREILANLSISDPYLEEY >NZ_CP020870.1|WP_046418664.1|2079970_2082553_-|glycosyltransferase-family-4-protein MKLVIDLLGAQTRSHLRGIGRYTRELTKALLRQAGEAHAIHLVLHAPLEQASDALIAEFGALLPRARIHLLRLPRHTSEHLTGNTWRHHAASRLSRYGLTCLDADVVWHSSVFEGYDEDGVLPDAPLFHTNRVATLYDLIPLHDPEVFLPGSLPKAWYERRSAFLSSCDLLFCLSEWTAQEAVQRLHLDPERLVVIGGGVDSNFRPLVFDAQQRGNLLARFSITRPYVLYNGGLDQRKNVPALLRAFALLPMAIRQRHQLVVLGDDREVHRSMIALCRQLGLDQQEVVFTGRVNDSDLVALYGLCALFVFPSRLEGFGLPVLEAMACGAPTLCSDAASLPEVAGRQDILFPPEDHVALSARMSQVLDDPQWLQSLRDYGLRRATAFTWDRVAQRALDALTQLQVRSSGMKSRPMSGASPATHTDTAAPEMLPITEIQLVTDLAALPGDASRDDLAQAAFSICSMRVAPQAPQWLVDVSQIAKTDIGTGTSRVTRSILREWLSTPPPGVCIVPVYLREGQYHYARSFTAQVLGLVSETPQEGIVMVYPGDVFVGLDWAPEAINAARARLQDWRRVGVATCFVVHDLLPMTLPDCFHPYSRNLFEQWLRTVAHLADAIACISATTAEVLSRWLQDTTVDYQFGIPPQVKHFPLGVSFPEVSTAQEAIREELRTALVMRPTLLIVGTLEPRKGHVHALRICEQLWEQGEDVNLVVVGQRGWSEQALVMRLTRHPEVGRRLFWLSDAADAELSALYVHATALLALSEGEGYGLPLLEAAHFRLPILARPLPVFLEIMADYPHYLDGTAPDTWSATVAQWLHASDRPISRPVPIASWKESAQQLADIVNDCRFLLDSKKLSAPLK >NZ_CP020870.1|WP_010893406.1|2082549_2082780_-|hypothetical-protein MSLSRITALIARHDRLKRLMVRLLSCVPAVDVWLRHHVSARKYRRSLLDVGEADLPEVAVSVYQQLLIATGKDGRL >NZ_CP020870.1|WP_023906937.1|2082801_2084133_-|glycosyltransferase MSVDSVSRKHDRIPFIRFLGVYLLFQWRRQCARLHRARIALYSRRFGQWIRLRAAVSADQSAPPSVAAVVSASATQTPTNTRCILLIDTVPPRPDRDSGSLRCHHLMHLMVCMGYKVVLHCQERMPSAAEVMALRAIGVTTTAVAGGFPSWLLTNPERYCAVVVCRYHLGLSWLPLLRAFVPDSLCILDTVDLHHLREQREAELRNLSGLRAAAAITRRHELHAISCADLVWVVSPVERDYLARLLPQARVEVVPNMHDMVETIPPFVSRHGFLFVGGSRHPPNVDAVRWLLSEIFPRIRERLPDAQLHLVGAGLAEAMQSQQMICPGVSFHGHVPEMAPLLHACRVSLAPLRFGAGVKGKISEALAYGLPVVTTPEGAEGMYLRSGMDALISGDAEDLARQAVCAHEDLEVWQRLSDNGQQIIKQHFSLHSTRAALAVMLPH >NZ_CP020870.1|WP_031337046.1|2084227_2086606_+|penicillin-binding-protein-1B MSGHSPWWGWLIACGLAGLALAMGFLIPYTLYLNQQVTARFGELRWQIPTRVYARPLMLQPGMALDARTLKIELDAASYREDNHGELPGTYQQQNGRFTVSSRGYVDVDGAIPASRLIITLNNNQVSALRNADTRKPIRRGRLDPARIATLYGQKQEERRLVRLEEVPQLLVTGLQVVEDRDFKHHHGIDITSILRAAWVMVRSGGGVRQGASTLTQQLARSGLLGIGKEQTLQRKCNEILYALILEARYSKRVILESYLNQVYLGQRGSQAVHGVATGAEFWYGRELGSLTTEQVALLIGLVKGPSYYDPRRFPDRALSRRNFVLGKLHENKLIDDAEYMRAQASSLGVPKEPGLVAANRFVAYLDLVRRQLAHDYPENVLRGAGMTVLTGMSPSAQSYAERGVQTTLSALEKVKGQKRPPLQAGLVLTDVHNGDVLAVVGSRDATQSGFNRAIEAQRQVGSLLKPFVYLLALASPDRWSLSSWVEDTPVSIPLAHGRIWSPSNSDNISHGTVRLIDALAHSYNQATVRVGMQVGPERVAQLINVLAGLKADANPSLILGATDQSPYAMTQLYQFLASGGEIQPLHAVRGVLDPQNKLLKRYDNTPAPAQPGDSVAANVISVALQEVVNSGTARQLIIDGLGRLSPAGKTGTSNDGRDSWYAGYTGNHLAVIWVGNDQNEQTGLYGATGAMRVWSSIFSQLPSTRLQVNGKGLDWQSVEPMGTGTTDANCPGARRFPFVAGFAPPYAACSGAGGAPEAGHNIGWRNWFGLDKQKGDQASPPTTPPPDQSTQ >NZ_CP020870.1|WP_046418663.1|2086602_2087175_+|hypothetical-protein MKRLPVPFRTFTVTVMMPVALTACISPPQPATPAPAPAPPPPATPPPVDTLSPAQRLAAVTNIASADDTELLVQPLHDLQIDDLRQAAKAKRQAGNLAAAAAALDHALELVTNDPAVLQERAEIALLQSDWTGAEHYAKRAVELGSRTGPLCRHHWATIEQARLARGEKENASSAHTQIGRCTVPGIKRY >NZ_CP020870.1|WP_046418661.1|2087294_2089322_+|ATP-dependent-DNA-helicase MSLLSRASIEALSDGGLLARRLDNFVPRPGQQRLTAAIGEIFEQRNILLAEAGTGTGKTFAYLVPTLLSGLKTIISTGTRALQEQLYHRDLPRVRAALGVDLRSALLKGRTNYVCKYRLEQTCAAPHLGNAEQTTQLQRILTWSGHTQFGDMAELDTLPDDSPLLPLVTSTLDNCLGKDCPFWNECFVVQARQRAQAADVVVVNHHLLLADLALKQEGFGEILPGAQAFIIDEAHQLPALAANFFGESFGMRPLQELGRDCMIEARSVAGAQAALQHSVWQLEQALRTLRAAMEGLPSRGTQSRLLAKPEVCEGFETLITALTSMHSLLVPLRSAAAGLDACAARAQEALSRLLRWLGETAETSGFEGNLPPNNDVLWYELTQRSFRCQRTPLDVSAQLQAHRETSMAAWVFTSATLAVDGTFEHIARRLGLNHPMTLLQPSPFDWVRQALCYLPPGLPNPATHGFGVALIEALIPVLEASSGRAFLLFASHRALREAATALRGGPWPLFVQGDAPRATLLDRFRVSGNGLLLGSASFREGVDVAGEALSVVVIDKLPFTAPDDPVFEARLEAIRQEGGNPFFDEQLPQAVIALKQGVGRLIRSESDRGVLVLCDPRLLSKSYGRIFLDSLPSFPQTRDIAEVRAFFNGSNTAPTESPHDSPTSHREASGVHRIG >NZ_CP020870.1|WP_053014119.1|2089555_2090164_-|D-alanyl-D-alanine-carboxypeptidase-family-protein MNVYSSNEDTPNAPDMLSWRVHTQKNDPLPLPELKARLLALGIDADRYARTTGLFIELEPTTLADAGYDRYQRPLLLTVDAARAWHAMRQAAVRDGIVLDAISGYRSYAYQFGIFENKLAQGKTLQEILTVNAAPGFSEHHSGEALDIGMPGEPPVKESFEGSSAFAWLQGHAGRFGFHLSYPRKNPYGIVYEPWHWRWRAR >NZ_CP020870.1|WP_046418660.1|2090192_2090855_-|carbonate-dehydratase MHSLEHLLQNNRNWCERINQEDPEFFARLSKQQSPEYLWIGCSDSRVPANQIIDMAPGEVFVHRNIANVVVHTDLNCLSVIQFAIDVLKVKHILVVGHYGCGGVLASLTRARLGLVDNWIRHVTDVAEKHNSYLETIVALPDQHARLCELNVLEQVLNVCRTSIVRDAWSRTQPLTVHGWVYSLSNGLVHDLGIDVDRFEMLPSCYADALARIQADQVVH >NZ_CP020870.1|WP_046418658.1|2090989_2092102_-|glycosyltransferase-family-4-protein MKVLFVGTSRGGGGAESHFVGLVRAMAETNHQSTVLVHPDGLIARQLQQMAIRLFTATFRNVMDLRGYLVIARALRLVDPDVLVGDFGKEYWPLLLMGRLYRLPVVLFRHRLPPMNRFSTYWVPRLADRFFAVSAYARRHYLAEGMPPERVQVLYNPVDTDALRPDPRLRRAMLHELGWDEDVLVVGCFGRIHEGKGVFVLAEAMEQAMQEEPRLCCLWMGTGLHVQRLAATVAGSRFASRQRVLGWVTDPARYFQALAMLAMPSLLPETFGRVSAEAQASGVPVLVSDVGGAAETLQDGTTGMLLPAGDVPAWRNAILAFCDPQPRAAMAAAAPSFVEARFSQQVIAAEFISELERVISDRHHMIAPMR >NZ_CP020870.1|WP_046418867.1|2092098_2092818_-|polysaccharide-deacetylase-family-protein MAIPILMYHNIAKVPKQVRHLRGLYVTPAAFARQMWLLHRLGYCCLSMSAAMPYLRGERSGKVMVVTLDDGYLDNLQAALPVLQAHGFSATCYVVSGSLARFNTWDAERLKVCKPLMSPAQVRQWHDAGMEVGAHTRSHPHLSGCTAAQLHEEIAGCRDDLEQCIGAPVTQFCYPYGDVTPPVIDVVCDAGYAAATTTRRGRVFPGQHLWTLPRVPVSYRHILPQFALRTLTAYEDRRI |
You can click texts colored in the table to view more detailed information
CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
NZ_CP020870_2 | 2497993-2498141 | Orphan |
NA
Consensus repeat of NZ_CP020870_2
|
2 spacers
spacers of NZ_CP020870_2
>2.1|2498019|37|NZ_CP020870|CRISPRCasFinder GGAGGGTGCCGCTGCTGCTGATGTGGCCGGTAGCGGT >2.2|2498082|34|NZ_CP020870|CRISPRCasFinder GGGTGGCGGTCTGGGAGTCGATGGTGCCGCCGCG |
CRISPR arrays and Neighbor proteins around NZ_CP020870_2
The CRISPR arrays of NZ_CP020870_2 >merge|NZ_CP020870|2|2497993-2498141|CRISPRCasFinder GTTGGTGAGGGTGGTGCCGTCAATCTGGAGGGTGCCGCTGCTGCTGATGTGGCCGGTAGCGGTGTTGTCGAGGGTGGTGGTGTGCAGGTGGGTGGCGGTCTGGGAGTCGATGGTGCCGCCGCGGTTGTCCAGGGTGGTGGTGTGCAGGT >NZ_CP020870|2|2|2497993-2498141|CRISPRCasFinder GTTGGTGAGGGTGGTGCCGTCAATCT GGAGGGTGCCGCTGCTGCTGATGTGGCCGGTAGCGGT GTTGTCGAGGGTGGTGGTGTGCAGGT GGGTGGCGGTCTGGGAGTCGATGGTGCCGCCGCG GTTGTCCAGGGTGGTGGTGTGCAGGT
>NZ_CP020870.1|WP_046420982.1|2490336_2490714_-|DUF596-domain-containing-protein MLTQEQIDDFCEDLGGALDGLWSYIRRAHGIPPHQEDLGSFEERKNDFLFMIGKLLDEGKLKLAENGEFITGTTEELVEMFRKSFPTSDEEMEFGVWFFSDDCPAGAVWVYKGEGENGEDECEWT >NZ_CP020870.1|WP_046420980.1|2489446_2489824_-|DUF596-domain-containing-protein MLTQEQIDVICECLVSALDSIWFKIGREYGVSIQQVDPVSFEERKKDFLFIIGKLLDEGRLKLAKKGEFITGTTEELVEMFRKSFPTSDEEMEFGVWFFSDECPAGAVWVYKGEGENGEDYYEWT >NZ_CP020870.1|WP_046420959.1|2487906_2489010_-|hypothetical-protein MTALTLAAGQQVTGPATQMLQNAAVNYIQSLGAREIKDLADTLGSDTARSALQGLLACAGAAGGAAAVVINSLLDRANGAEAASLSPAEKQHRTDLVTSLVAGITTAAGGDAAVSSAAARLETENNAAFIPVILGAVWLADKGITAYQAWQDIKAIRSGEKTLEQVALERGQDYVTSILIGNLAKYGLKAAMIGGRWISGTAKEIANAEKEALKQIRNNPKGPDLTQKPPGQYIVMQQQKRLDDVKSVVGRRSQKNELVVDGIKFEYVPYDPLVKGGSNKAGNVRVFKSEALTDKQIMNYAQQLAGDVPLKNVSPGVYLAQLSDGTKVTLRSVSSSDQVTKARWTIDIANNPSLREITKEKVELKFR >NZ_CP020870.1|WP_046420955.1|2487504_2487897_-|DUF596-domain-containing-protein MLTQKQIDDIYESLGCPLHFLWSYIGSAHGLSHDQVDPDSFEERKNDFLFLIGKLLDDELLKLGNRKAELIMEGTTEELVEMFRSCFPASDEEIDQEVGGLWFFTDCPFVAAWVYKGSGENGEDEYDWCF >NZ_CP020870.1|WP_120279371.1|2486944_2487337_-|hemagglutinin MQQQKRLDEVSALVDKKNPRNELVIAGIEVKATPRGSVGGSNQSGTTKVFDSKALTDVQIKDYAQQLTGGVPLEKVKDGVYAAKLSDGTIVNLRSVSKSNDVTQARWTIDIRNNPSFMEAGNKKVELKFR >NZ_CP020870.1|WP_046420952.1|2485975_2486245_+|hypothetical-protein MMRQIPVDSSPYQTQSFQMAGDALRLILRWNPVPCCWSMDLYTTTLDQPVAHGHPLANGGGALLLDVVDDHIFGKVAAGSADVAAGTGG >NZ_CP020870.1|WP_046420949.1|2485205_2485976_+|hypothetical-protein MITLTHRHVGTVTLDAVMEETHQAELRITENPVESGAMIGDHAVLMPQTVTIAGIVVDYQPQRSPAPAAEEHGAEPLRVLTDRVPFPTDLLPFTAQALRVAQRELSSVIRHATAPQSDGQHAVRPLADWLPDDQPITSGDDATTTGRIAQVYTALRNLQRSGQTLEVHTDVQTYQDMLILSIAARQTQDGSIELVLTVRELFIVKTTSISGVSLPAPKRGRASAQGAAQRHSGQTHPTPVDTEKNRSLLRQMSGLF >NZ_CP020870.1|WP_046420946.1|2484822_2485122_-|type-II-toxin-antitoxin-system-RelE/ParE-family-toxin MTYTVKRLEEFSDWLKGLKDGLARQRLIKRLRKVQLGNFGDVQPVGGCVFEMREHFGPGWRMYYVQRGNFLIVMLGGGDKSTQQSDIRRAIELAKSLED >NZ_CP020870.1|WP_010895177.1|2484510_2484819_-|putative-addiction-module-antidote-protein MTITKKINVSELPEFDAAEYLSSEEEVAAYLTAVLEENDPALLAAALGDIARSRGMSQIAKDSGITREGLYKALRPGSEPRFDTISRVCTALGIRLVAQPMR >NZ_CP020870.1|WP_046420943.1|2481438_2482200_-|Bax-inhibitor-1/YccA-family-protein MRSGNPALKESTFLDLGTGSVVVRDGNAMTLNGTVNKTGALLLLALVTSVFAWNQSLGVDGVPLLAARGYMIGGAIGGLILALITAFKKEWSPITAPMYALVEGFFLGAVSAVYEAKFGGIVFQAVLLTFGTLMAMLLAYRSGLIKATENFKLGVMAATGGIALLYLISFVLSFFGIHIPMIHEGGTFGIVFSLFVVVIAALNLVLDFDLIEHGVEQGAPKYMEWYGAFGLMVTLVWLYLECLRLLSKIQSRD >NZ_CP020870.1|WP_140190082.1|2501472_2501682_-|hypothetical-protein MTITRPGIHILDNTTYPPQRHWWWQYRRAAALEYLPHIVTEPPRFPDLIPDGIHQHVKHPPNASAKYHR >NZ_CP020870.1|WP_046417560.1|2501752_2503108_-|tRNA-uridine-5-carboxymethylaminomethyl(34)-synthesis-GTPase-MnmE MSQRSTKMGDTIAAIATASGAAGIGIIRISGSLIKTIATGLGMTTLRPRYAHYTRFLDVDDQVIDDGLALWFPAPHSFTGEDVLELQGHGSPLLLRQLLTRCLDLGARQAHPGEFSERAFLNGKLDLIQAEAIADMIGAADLRAARAARRSLDGVFSRRCEALAQQLIRLRIHVEATIDFAEESLDTLDRAQIRTFLQTLNVELTQLLRDAEHGKRLCDGLYTVLVGPPNVGKSSLLNALIGSDRAIVTDVPGTTRDTLRESVHFHGLEFVLVDTAGLREEGDAIEREGMRRTLNELQRADLALVVLDACDPQIGNLALADALTSVPRVLWIHNKLDLLTEPPSALDTDVIPVSAMTGAGLETLKTRLRTLLLGETVETIEGEFSARLRHVQALQRTAAHVTDANAQFAYEHLELTAEELRLAYKALGEINGSMSPDELLGRIFSNFCIGK >NZ_CP020870.1|WP_046417562.1|2503104_2505816_-|polysaccharide-deacetylase-family-protein MFNKVTPLSSFLLNLCTLLALVACKHQSTPVESASPVTTVSVDVKHSPDPQQAQPLLASLHQQLESYRRIMVLLTDDEVQSPQERVTSSQVGQILFNEGLNQRTAFSKRIDALLASGAPSRFDTLTVILDYIESSPDLYDADRLAFREALNDLDARINHDSALPAIKLRQRIHEDIDALNQIERNYNQEITRLSIPFDRNRGIFIKREKWDDYVAHLRRSYTRKMILHDYGILEAYPIPMEESEHEIFGNELPPKTLVLTFDDGPHRVYTNEIKEILQHYAVPAVFFEVGRNIGRFDRAGKPQLGPLSKITRELIEQGYAVANHSMTHDLLSKLSGNALRKEVINADIILRAVDERRAPLFRFPYGARSAEGLRLLSEIGLQSVRWNIDSLDWADPVPNSVVRRVLLQVAQRQRGIILFHDIHDRAVKALPQILEKLIAEGYQFAGWDGHAFSVNNTHPKSAKTDGEGIETTASRSWATVIGVDDYTKWPKLKYAANDAQAIANTLIQSFGFPSSHVILLKNREATRDKILSVFNDLANGRIQKNDRLFVFFAGHGATRQLSLRRDVGYIIPVDSDPAHFADDAISMTKIQNIAKTFEAKHVLLVMDACYSALGLMPKETQTSPRLATSINRMSRQMLTSGGPDQPVTDKGPNGHSVFTWALLQALSGRADFNDDGLITGTELASYVASAVSTVSAQTPAFGPLLGSQGGEFVFEIPRRKEILTTQNTKRPVKAINLNGPLESTNTTRQAKQLHQNTTAVKLITVKNLHHDEARLLLPPEVKLGKEKLAQPANDLGLQLYKDKSTKELTDQFSDTLELNPHFLRTSKNLGFFYPPQSEYAQHVPWLQNTLKIDHSPVTTQLNGGDIYLQLNNKDKARKTYTTDVDLQPEDSGTKQLGMKLTKL >NZ_CP020870.1|WP_046417565.1|2505906_2507604_-|membrane-protein-insertase-YidC MNQTRVLLIFSWLTVATLLWMDWGKNKNETLEISASQNLGVDSNLELEHAVPQINAGAVPVQKDSQLIAVAPKVPVINVKTDVLQLKLDGFSVLAADLLRFPQSKDRGAKPIKLLTDDPNYPYSATTGWVSQSNSPVPNLSTFLPEQPGVSYKLANDQDRLVVPFVWTAANGVSIRRTFTFERGRYAILIRDEIRNGGETPWNAYVFRKLSRVPIPNILNRAMTNPDSFSFNGAVWYSEKGGYERRAFKDYMNDGGLNREIGGGWIALLQHHFFTAWIPQKDQASLYLLAQNGSRDIAELRGPAFTVAPGQSTTTEARLWVGPKLVEQITKEHVKGLDRVVDYSRFQLMALIGQGLFWILSHLNSLLHNWGWAIVGLVVLLRIAMYPLSAAQYKSAAKMRKFQPRLQQLKERYGEDRQKFQQAMMELYKKEKINPMGGCFPILIQMPIFFALYWVLVESVELRQAPWLGWIQDLTTRDPYFILPLLNIVIMWATQKLTPTPAGMDPIAGKMMQVMPLIFGVMMAFVPSGLALYWVINGGLNLLIQWWMIRQHADFSRKRSRENIK >NZ_CP020870.1|WP_075584703.1|2507648_2508056_-|ribonuclease-P-protein-component MNSCKRFPRSARICLRSEYYVAFEQGRRYSSVLLRLHHLPTSGPVRLGLVVSRRVDIRAVNRNRIKRALREVMRQIAYKLVPGDYVVVVRQTAKDVSNAELSVALLSLLRRIGALPLAPIDNAMLPFFERNCSRK >NZ_CP020870.1|WP_004085093.1|2508114_2508255_-|50S-ribosomal-protein-L34 MATKRTYQPSNLKRKRDHGFRARMSTADGRKILARRRAKGRKRLSA |
You can click texts colored in the table to view more detailed information
CRISPR_ID | Spacer_Info | Spacer_region | Spacer_length | Hit_ID | Protospacer_location | Mismatch | Identity |
---|---|---|---|---|---|---|---|
NZ_CP020870_1 | 1.1|2076862|37|NZ_CP020870|CRISPRCasFinder | 2076862-2076898 | 37 | NZ_CP020870.1 | 865668-865704 | 0 | 1.0 |
NZ_CP020870_1 | 1.2|2076925|34|NZ_CP020870|CRISPRCasFinder | 2076925-2076958 | 34 | NZ_CP020870.1 | 865608-865641 | 0 | 1.0 |
NZ_CP020870_2 | 2.1|2498019|37|NZ_CP020870|CRISPRCasFinder | 2498019-2498055 | 37 | NZ_CP020870.1 | 865668-865704 | 0 | 1.0 |
NZ_CP020870_2 | 2.2|2498082|34|NZ_CP020870|CRISPRCasFinder | 2498082-2498115 | 34 | NZ_CP020870.1 | 865608-865641 | 0 | 1.0 |
1. spacer 1.1|2076862|37|NZ_CP020870|CRISPRCasFinder matches to position: 865668-865704, mismatch: 0, identity: 1.0
ggagggtgccgctgctgctgatgtggccggtagcggt CRISPR spacer ggagggtgccgctgctgctgatgtggccggtagcggt Protospacer *************************************
2. spacer 1.2|2076925|34|NZ_CP020870|CRISPRCasFinder matches to position: 865608-865641, mismatch: 0, identity: 1.0
gggtggcggtctgggagtcgatggtgccgccgcg CRISPR spacer gggtggcggtctgggagtcgatggtgccgccgcg Protospacer **********************************
3. spacer 2.1|2498019|37|NZ_CP020870|CRISPRCasFinder matches to position: 865668-865704, mismatch: 0, identity: 1.0
ggagggtgccgctgctgctgatgtggccggtagcggt CRISPR spacer ggagggtgccgctgctgctgatgtggccggtagcggt Protospacer *************************************
4. spacer 2.2|2498082|34|NZ_CP020870|CRISPRCasFinder matches to position: 865608-865641, mismatch: 0, identity: 1.0
gggtggcggtctgggagtcgatggtgccgccgcg CRISPR spacer gggtggcggtctgggagtcgatggtgccgccgcg Protospacer **********************************
CRISPR_ID | Spacer_Info | Spacer_region | Spacer_length | Hit_phage_ID | Hit_phage_def | Protospacer_location | Mismatch | Identity |
---|---|---|---|---|---|---|---|---|
NZ_CP020870_1 | 1.1|2076862|37|NZ_CP020870|CRISPRCasFinder | 2076862-2076898 | 37 | NZ_CP035092 | Paracoccus denitrificans strain ATCC 19367 plasmid unnamed1, complete sequence | 602553-602589 | 7 | 0.811 |
NZ_CP020870_1 | 1.1|2076862|37|NZ_CP020870|CRISPRCasFinder | 2076862-2076898 | 37 | NC_008688 | Paracoccus denitrificans PD1222 plasmid 1, complete sequence | 196086-196122 | 7 | 0.811 |
NZ_CP020870_2 | 2.1|2498019|37|NZ_CP020870|CRISPRCasFinder | 2498019-2498055 | 37 | NZ_CP035092 | Paracoccus denitrificans strain ATCC 19367 plasmid unnamed1, complete sequence | 602553-602589 | 7 | 0.811 |
NZ_CP020870_2 | 2.1|2498019|37|NZ_CP020870|CRISPRCasFinder | 2498019-2498055 | 37 | NC_008688 | Paracoccus denitrificans PD1222 plasmid 1, complete sequence | 196086-196122 | 7 | 0.811 |
NZ_CP020870_1 | 1.1|2076862|37|NZ_CP020870|CRISPRCasFinder | 2076862-2076898 | 37 | NZ_CP048110 | Klebsiella michiganensis strain BD177 plasmid unnamed2 | 28603-28639 | 9 | 0.757 |
NZ_CP020870_2 | 2.1|2498019|37|NZ_CP020870|CRISPRCasFinder | 2498019-2498055 | 37 | NZ_CP048110 | Klebsiella michiganensis strain BD177 plasmid unnamed2 | 28603-28639 | 9 | 0.757 |
NZ_CP020870_1 | 1.2|2076925|34|NZ_CP020870|CRISPRCasFinder | 2076925-2076958 | 34 | NZ_CP020810 | Mycobacterium dioxanotrophicus strain PH-06 plasmid unnamed1, complete sequence | 61106-61139 | 10 | 0.706 |
NZ_CP020870_1 | 1.2|2076925|34|NZ_CP020870|CRISPRCasFinder | 2076925-2076958 | 34 | MN692199 | Pectobacterium phage MA12, complete genome | 54468-54501 | 10 | 0.706 |
NZ_CP020870_1 | 1.2|2076925|34|NZ_CP020870|CRISPRCasFinder | 2076925-2076958 | 34 | MN518139 | Pectobacterium phage MA11, partial genome | 29039-29072 | 10 | 0.706 |
NZ_CP020870_2 | 2.2|2498082|34|NZ_CP020870|CRISPRCasFinder | 2498082-2498115 | 34 | NZ_CP020810 | Mycobacterium dioxanotrophicus strain PH-06 plasmid unnamed1, complete sequence | 61106-61139 | 10 | 0.706 |
NZ_CP020870_2 | 2.2|2498082|34|NZ_CP020870|CRISPRCasFinder | 2498082-2498115 | 34 | MN692199 | Pectobacterium phage MA12, complete genome | 54468-54501 | 10 | 0.706 |
NZ_CP020870_2 | 2.2|2498082|34|NZ_CP020870|CRISPRCasFinder | 2498082-2498115 | 34 | MN518139 | Pectobacterium phage MA11, partial genome | 29039-29072 | 10 | 0.706 |
NZ_CP020870_1 | 1.2|2076925|34|NZ_CP020870|CRISPRCasFinder | 2076925-2076958 | 34 | NZ_CP016083 | Streptomyces sp. SAT1 plasmid unnamed3, complete sequence | 340551-340584 | 11 | 0.676 |
NZ_CP020870_2 | 2.2|2498082|34|NZ_CP020870|CRISPRCasFinder | 2498082-2498115 | 34 | NZ_CP016083 | Streptomyces sp. SAT1 plasmid unnamed3, complete sequence | 340551-340584 | 11 | 0.676 |
1. spacer 1.1|2076862|37|NZ_CP020870|CRISPRCasFinder matches to NZ_CP035092 (Paracoccus denitrificans strain ATCC 19367 plasmid unnamed1, complete sequence) position: , mismatch: 7, identity: 0.811
ggagggtgccgctgctgctgatgtggccggtagcggt CRISPR spacer tggcggcgccgctgctgctgatgtggccggttgggat Protospacer *. **.************************ * *.*
2. spacer 1.1|2076862|37|NZ_CP020870|CRISPRCasFinder matches to NC_008688 (Paracoccus denitrificans PD1222 plasmid 1, complete sequence) position: , mismatch: 7, identity: 0.811
ggagggtgccgctgctgctgatgtggccggtagcggt CRISPR spacer tggcggcgccgctgctgctgatgtggccggttgggat Protospacer *. **.************************ * *.*
3. spacer 2.1|2498019|37|NZ_CP020870|CRISPRCasFinder matches to NZ_CP035092 (Paracoccus denitrificans strain ATCC 19367 plasmid unnamed1, complete sequence) position: , mismatch: 7, identity: 0.811
ggagggtgccgctgctgctgatgtggccggtagcggt CRISPR spacer tggcggcgccgctgctgctgatgtggccggttgggat Protospacer *. **.************************ * *.*
4. spacer 2.1|2498019|37|NZ_CP020870|CRISPRCasFinder matches to NC_008688 (Paracoccus denitrificans PD1222 plasmid 1, complete sequence) position: , mismatch: 7, identity: 0.811
ggagggtgccgctgctgctgatgtggccggtagcggt CRISPR spacer tggcggcgccgctgctgctgatgtggccggttgggat Protospacer *. **.************************ * *.*
5. spacer 1.1|2076862|37|NZ_CP020870|CRISPRCasFinder matches to NZ_CP048110 (Klebsiella michiganensis strain BD177 plasmid unnamed2) position: , mismatch: 9, identity: 0.757
ggagggtgccgctgctgctgatgtggccggt-agcggt CRISPR spacer tggcggtggcgcttctgctgatgtggccggtcaacaa- Protospacer *. **** **** ***************** *.*..
6. spacer 2.1|2498019|37|NZ_CP020870|CRISPRCasFinder matches to NZ_CP048110 (Klebsiella michiganensis strain BD177 plasmid unnamed2) position: , mismatch: 9, identity: 0.757
ggagggtgccgctgctgctgatgtggccggt-agcggt CRISPR spacer tggcggtggcgcttctgctgatgtggccggtcaacaa- Protospacer *. **** **** ***************** *.*..
7. spacer 1.2|2076925|34|NZ_CP020870|CRISPRCasFinder matches to NZ_CP020810 (Mycobacterium dioxanotrophicus strain PH-06 plasmid unnamed1, complete sequence) position: , mismatch: 10, identity: 0.706
gggtggcggtctgggagtcgatggtgccgccgcg CRISPR spacer acaccgtggtctgggagtcgacgttgccgccggc Protospacer . .. *.**************.* ********
8. spacer 1.2|2076925|34|NZ_CP020870|CRISPRCasFinder matches to MN692199 (Pectobacterium phage MA12, complete genome) position: , mismatch: 10, identity: 0.706
gggtggcggtctgggagtcgatggtgccgccgcg CRISPR spacer tggcggcggtcagggagtcgatggtcttgactaa Protospacer **.******* ************* ..* * .
9. spacer 1.2|2076925|34|NZ_CP020870|CRISPRCasFinder matches to MN518139 (Pectobacterium phage MA11, partial genome) position: , mismatch: 10, identity: 0.706
gggtggcggtctgggagtcgatggtgccgccgcg CRISPR spacer tggcggcggtcagggagtcgatggtcttgactaa Protospacer **.******* ************* ..* * .
10. spacer 2.2|2498082|34|NZ_CP020870|CRISPRCasFinder matches to NZ_CP020810 (Mycobacterium dioxanotrophicus strain PH-06 plasmid unnamed1, complete sequence) position: , mismatch: 10, identity: 0.706
gggtggcggtctgggagtcgatggtgccgccgcg CRISPR spacer acaccgtggtctgggagtcgacgttgccgccggc Protospacer . .. *.**************.* ********
11. spacer 2.2|2498082|34|NZ_CP020870|CRISPRCasFinder matches to MN692199 (Pectobacterium phage MA12, complete genome) position: , mismatch: 10, identity: 0.706
gggtggcggtctgggagtcgatggtgccgccgcg CRISPR spacer tggcggcggtcagggagtcgatggtcttgactaa Protospacer **.******* ************* ..* * .
12. spacer 2.2|2498082|34|NZ_CP020870|CRISPRCasFinder matches to MN518139 (Pectobacterium phage MA11, partial genome) position: , mismatch: 10, identity: 0.706
gggtggcggtctgggagtcgatggtgccgccgcg CRISPR spacer tggcggcggtcagggagtcgatggtcttgactaa Protospacer **.******* ************* ..* * .
13. spacer 1.2|2076925|34|NZ_CP020870|CRISPRCasFinder matches to NZ_CP016083 (Streptomyces sp. SAT1 plasmid unnamed3, complete sequence) position: , mismatch: 11, identity: 0.676
gggtggcggtctgggagtcgatggtgccgccgcg CRISPR spacer tccacacggtctcggagtggatggtgccgctgtc Protospacer .****** ***** ***********.*.
14. spacer 2.2|2498082|34|NZ_CP020870|CRISPRCasFinder matches to NZ_CP016083 (Streptomyces sp. SAT1 plasmid unnamed3, complete sequence) position: , mismatch: 11, identity: 0.676
gggtggcggtctgggagtcgatggtgccgccgcg CRISPR spacer tccacacggtctcggagtggatggtgccgctgtc Protospacer .****** ***** ***********.*.
Region | Region Position | Protein_number | Hit_taxonomy | Key_proteins | Att_site | Prophage annotation | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
DBSCAN-SWA_1 |
767313 : 779496
Sequences of DBSCAN-SWA_1
Nucleotide sequences of DBSCAN-SWA_1 >NZ_CP020870|767313:779496|DBSCAN-SWA ACTATGAAGCGTCCTCCCGTAACGCCGCTTTCAGCTCCGCCGCCGCGTGCTGTTCCGCCGCGCGTAGCTTGTCTAATAGCCATTCGTACACACCGCGCCACTTTTTGCAGTACGTAGACACATCACGGCCAAGCGCCGCCGCGCGCTTACGATCACTGATAGGAACCGTGCCACTGCCGCCGCACGCTGTGCAGGTGCGCATCAACACGCCGTTACGTACCTGCGCCCACCAGTGGCCGAAGAACGCCAGAATGTTCCCGCCGTCTTGCAGATTTAAGCCATGACCGGCACTGGCCGGATGGGCGAATAGCACGGGAATGTTCCCTGCATTCCAATCGCGGATCGTGTCGGGGTGTTTGTCCAAAGCACGCCCCTTTGGGAAGGCACGCTGCAACCGTGCGACATCACTTTTAAAGTGATACGCCACCAACACCGGCATTCCGGCGGCTTCTTCGATAATGTCGTGCAGCGCCTCTAGTTTTGCGTCGTGGACGACTTCCCAGGCCTGACGTGTGTCATCGGTGTACAGCGCACCATTGGCCAGTTGCAGGCATTTTATGGTTTTACTGGCGGCGTTAAAGGCTTCTACTTCGGCACCGCATTCCAAGGCAATGAACATGTCTTGTTCCATGGCCTTGTACAGACGTTGCGCATGTTCTGGCAACGCAACGCGAATCGTATTGACAATCGGCTGGCGTAAATCAAAGTATTCCTGTGGATCAAGGGATAAACACAGGTCGCGTATCGTGTCTTGCATCTCTTTGGATGCATTCGGCGTGGGCACAAGGCGCACCGCATGCGGATCACTGCCGATCTGCACCGCACGGAACCAGCGATCGATAAACGCTTTAAAATGCGTCCCAAGCCGGGCCCCACGATCCACCATCCACATCAGCGCCCACAGGTCCTGTAGCCCATTTGGCGCGGGCGTGCCGGTCAACCCAATGTAGCGCTCCACCTTGGTATGCACATGCTTGGCCAGTGCGCGGGCGCGCCGTGTTCCTTGCCGCAGCCGGAACCCTTTCAGCTTGGAGCACTCATCGGCGACCACCATCCGGAACGGCCAACGGTCCTTGTAAAACTCCGCCAGCCATTTTAGATTGTCGTAATTAATGCAGTAGATATCCGCGTCCTGCTCCAAAGCGCGGCGACGTGCCGCCGCACTGCCCACGGCCACGGACACCCGCAGATGGCGCAAATGGGGGAACTTAGCCACCTCATCCGGCCATGTCGTGGCCGCAACGCGCAGCGGAGCAATGACCAACATAGGTGCAACGTCTTCCACCAAAAGAAGCACATCTAACGCCGTCAGTGTGGCCACTGTTTTCCCTAAGCCCATCGGCACAAACAGATTGCAGCGTGGGTGGGTCAGGATGAAATCAACCATGGTGTGTTGGTAGGGGCGCAGGTTCATGTCGCATACCTGCGCAGCAGGTGGGGGCTGTTAGTGGCAGCTTGTGTGGGCGTGGGTGCGGGCCTGTGTATGGGAGTCAGCGACAGGACGCTCCCTGCGGCCTCTTTAAGTCCGTTTTTTCCGTCTGCAATTAAGTAAGTGATTGCCTCCAGGGTGTACACATCCTCTTGGCCGTTGTACTCGCGCTTGCGATCGCGCCCCGCATCCTCGCTCACACGTCTGGCGAATTCGTAGGCGATGTTTAAACGGGATGACACATACTGCAATGAGCGGTAGATATCTCTGCGTAGCTGATCGATATTGGCAGGTGTAGCGACATGTGGTGTGGCCGGTACGACAACAGAGGCGAGAAACACACGAATGACTTTCAGCTGAAACGCTGGAGCAATCCATGAGGCGTAAGCAATTGCCAATTCGCGGCAAGCAAAGGTGCTTCCAGCGCGACCTTTCTCTTTTCTCACGAAAAGGTGGGGTGCCCCTCCTTTTATTGAACTCACAAACTCTCCAGAGTTTGTGAGTTCAACTACCAATGCCTTGGTTTTATCGAGCCTTAAAAACTCAGAAGGACGATGCCGCACGGCTCCGCCGCTGGCTTTATGAAAATCGTTGAGCGAGTACAAGCCGTCTTGCTCACGAATGGGGAGAGAATCGATAGCCAGTTGCTGTGTAGTCATGGAGTAGTCCTGGTGGTGGTGATGTTTTGAATTGGTTTCAGCAGGTGGATACACGGCGCGCTGCTATACTCGCCATCACATGGGGCTAAGACAGGGGGCTGTGTGACATCCGTAGCGTTCGATACACTTAAATTTGCGAACCGGCTGAAAACGGCAGGCGTTCCTGCGGCGCACGCGGAGGCGGAAGCCGAAGCCTTGGCGGAAGTGCTGGAAACGAATTTGCAAGACCTTGCGACAAAGCGAGATTTGCGGGAGCTGGAATTAAAGCTCGAATCGAAAATAGATAAAGGGTTCGCTGATGTGCATAAAGGGTTCGCTGAAATAAAAGGCGAAATGCTGCTACTGAAATGGATGTTTGGGGCCCTCGTAGGCGGTGTGACTGCACTGATCATCAAAGCGTTTTTCTGAGTCACGCCAGCACCTCATCCACGCCTTTTAAGGAATCGACCACGACCACGCGCTGGCCCATGCCGCGCATGCGCGCATGCTCACGGACTTGATGCGGTGTGCACTGCTGGCCGGGGGCTTTGAGTTCCACCCACAGGGTGCGCCCCTCGGGCAGCATGGCAATGCGATCCGGCGCACCGTGGCGGCCACCCCATTTCACCTTGCGGATTTCACCGCCCTTGGCCCTGACCTGGGCCACTAAATAGCGTTCGATCGTTCGCTCACGGGGAATGTTCATCATTGCTTCCTATACCGGTGGGTGTCGAAGCCTTCCGCCGCTAAGGGCAACCCTTGTGCCCAGGGTGGCGGTGTGGCCATGAGTGCGGCCAAGTGCGCGGCATTGAAGGAGGGGCACTCATCGGCTTCGGTAATGATCTCGTCATGCACGGTCAGCACGATGCTGTATCCGGCGGCTTCAATCGCAGGCATACACGCGGCCAACACGTCGCGGCTGACGGCTTGGGTGATGTTCTCGACCAGCTTGCCGCCGTAGGTGGTGAGGCGCGCCCATTTTCGCGTCACCGGATGCGTGCCCATGTAGGACAGCGCGCCGTGCTCATCGACTCTGGGAGCGGCGTAGTAAAGCACCCGCCCCGACGGCAGACGCAGACGCAGCCACGCACGGCTGTACTGCACTGTGATCCCGCAGCAGGTGTGCGTCGTTTCGGGGTGGTGGATCGCATCCGTCACCGCGAACTGCAACGCCTGCCAAAACGCCGCAATGGCCGGATGCGCGTTGCGCCATGCGCGCTTAAACACATCGCAGGCCAACCATGCGCGATCGGAGAGGCCGAAGGTGGGACGGTGGTTCGCCTTCGTCCACTGGAGCGCTTCCATGGCCTGCTGAAGCAGCAGGGGTGGTAAGGCGGCCTGCTCGGCCATCGCCTCCAAATCAATGTGATACATGGCCGCAAAGGCGGCAAAGGCCCCGACACCGCCGCCATACCCCAATGCCAATTCCTGCACTTTGCCAATCTGGCGTTGCTGCTTGGTCACGGCCTGGGGCGCTATCCCGAATGAATGGGCGTAGGCGCGCTTGTAAATGTCGTCCCCTTTGCGAATAGGCTCGTGCTCGGCATTCCATTGCAAGGTGATCGGCGCGCCGCGCAGTGCGCCGTGAGTGATGGCCTCGCCGCTGTGCCATGTACCGTCCACCCCTTGGCAGGTATCAAAATCACGAAACGCGTGCAGCTTGGGGGTTTCACCGGCCAGCCACGCCAACACCCGGCCTTCAATGTTAGACAGATCGGCCACGACCAGCTTTTTATTTTTTGGCGCAATCAGACAACTGCGCAGCGCGCTGCTGGTCAGCGCCATAACATCGTCACATACTAAATCCACACAACCGGCTTTCATGGCATCAATGCCGACCGCGATCACTTCTTGGCTGAGCGTGGGGCGCGGCAGGTTGTGTGGCTGGAACAGCCGCCCCGCCCAGCGCCCGGTGCGGCTGGCCCCTTTAAATTGCAGTGTGCCGCGCAGGCGACCGTCAGGGCTGGTGCAGTGCAGTAGCGCCTGGTACTTAGCGGTGCTGGTGGTGCTGGCTTGTCGGCGAATGGACAGTAGTTCCCGCGCCGTCTCTGGGAGTGCCGGATCGTCAATGCACCGTTCTACCGTGTGTTGCTGCATATCCGGCAACGCCACGCCGTGCGCGGTGCTCAGGTGGTGCAACAGTGCATCGCGCTGGGTCGCCGCCTGCACCGCGCCGCCGGTCAGCGCCTCGGTACGGTTGGCTAATGTGCACTTGGCGCGTTCCACAGCGCCGATGGCGGCCTGCGCCAAGTCCGTATCGACCAGAACGCCTCGGTCATTTTCTGCCCCCGTGTAGTTGTGGGACGGCAAGCGCTTCACCACGTCCCGCATGGCCGCCACATCGCGCTTGGCGTAGTCCACAAACTGTGCCCAGTCCACGGGAATCTAGAGCACTACCAACAAATCATCGAACGCATTGGCCCCACACGTCAGGCGCAAGCCGGACATGATCGCCCCGTCTTCATTCACCACATCGTGGCGGCGGGCACGGTGGATGAGCTAGTGATGGCCCGCCGTGAATCCAAACGCGAAGTACAAGATTTACTGCTAGAGGCAGTGAAACGCAGAGAAACAGGCAAACCACTCACATCACAAGGAGCCATGAGGCCATGAGCGACACCTGTAACTACCGCTTTGTTGATACCACGCAATCGGACGAAACGGCCCTGAGAGAATCAGAATTTTTAACCGCTAATGAAGTGTGCGAGCTAACCGGTTACAAGATATGTGCTGGCCAACGTAAATGGTTAGACAGCGAAGGCTGGTGCTATGTGACCAACGCATCTGGACGCCCGATTGTGGGGCGTTGGTATGCACGTTTGCGCGTCGCCGGTGAGTCGCTGCACATGGCAGGCGCTGGTATACGCCCAGTAAAGCTTCCTAATTTTGCTGCACTCGATGAATAGGGCACAGCACCATGCGCCCAAAATCAAGCCACAAAGATTTGCCGCCCAAGATGCTACGCCGTACCCGTGTATTAAAAAGCGGAAAAGTGTGGGAGTCGTTTTACTACAATGGGCGCACTACCGAAGGGCGGCGCGTTGAAATTCCGTTGGGGGGTGATTTAAACGAGGCAAAGCGTCAATGGGCGGAGTTGGAGTGCTGCAAAGCCCCCGTAGAAACAGAAGTGCTTGGCTTTATATTTGATCGCTATTTGCGCGAAGTGGCACCCACTAAAGCGCGTGCAACCCGGTACCAAATTAAAAGCTGCATGACCACGCTACGCAAAGTATTCGGCGACGTGAATATTCATACAGTGACCCCGCAGCAGCTTGCACAGTACCGCGACAAACGCGCACGCACTGCGCCGGTACTCGCTAACCGGGAACTCAGTGTGTTTTCCAGCGTGTGGACTATGGCGCGTGAATGGGGCTACACCAACAAAGAAAACCAAGTGAAAGGGATTCGCAAAATCAAAGAAAAGCCCCGTGATTTCTACGCCGATGCTGCGGTATGGAACGCTGTCTATGCCAAGGCGTGTGAAGAACTCAAAGATGCAATGGATTTAGCCTATTTAACCGGCCAACGGCCTGCTGATGTTCTCAAGATGCGCTTTACGGATATTAGAGATGGCTCGCTTGAGGTGCAACAAAACAAAACAAAAAAGAAGCTGCGAATCCTTTTGGAGAGTGACGGCAGACGCACAGAATTAGGGAAAGTGATTGATCGTATTAAGGCACGTAAACGTAAGGTTGTCGGCTTTTCTCTCGTCTCCACATCGAAAGGTGTTGGGCTAGGCAGCAAGCCCTTGCGAGTCCGATTTCAACGTGCGCGAGCTGCTGCTGCGAAGGCCGCTTCTGACTCTGGCGAAGTAGATTTAGTAAAAAGAATTCTTATTTTTCAGTTCCGCGATATCCGCCCCAAGGCTGCGAGTGAGCTACCGTTGGAGCATGCCAGCAAGCTGTTAGGCCACACTCAGCAGCAAATCACTCAGCGGGTGTATCGGCGCGTAGGCGAAATAGTGAAGCCAACCAAGTGAGCACCAACGGGGGCAGTCCGCAAGTTGCGGAAACTATTCCCGTAAGTTGCGGAAACTTTGATTGGGAAATCTTTTTAAATCAATAGCTAATATGGCGTTTTGCGATGTTGTCAGAGATCAATACAGGGTTGCTGATTGTTGTACATTCCCTGCTTTGCCAGCCAGGCCGAAGTGGCCAAGAAGCGGCCTGCTGGGTTGCTGACCAGGATCAGCAGGTGATGCCTTGCCATGTGGCGTTAACGAGCAAAGTAGTGAGATTACTGCCCGACATTGAACTCGAATGGGATTTCAATGCTTCCCGGTACCGGCTGATCGTTGCTGTTGCGCGCTGGTTTGAAGCGCCAGTTGCGTACCGTTTGCAGTGCAGCATTATCCAGGTCACGCGAGCCGCTACTTTTAGCCATTTCCACACCACTAGGAGCACCTTCGACATCCACCAGTATTCGCACCGTGACGGTACCGTGATCGCCACGTGCCGCCGCTTCGGCTGGATAGTCAGGTGAGGGCATTTTCCCAGGGATAGGAGTGGGTTGGCTGCTGGCAGCCGTTGTTGCTGCATTCTGGGGTGATGCTGTTGCCGTCGGCATCTGTGCTTGCGCCGACATATTAGGTGGTGCTGTGCCGTTGGGAGCTGACGGGATGGGTTCGGGAAGCGTGTCGTTGATTGCTGCGTTGTTTGTGTCCAGGTCGTTCTCTGGGAGTGGATCTGATGGTGCCTTGGGCATATTGGCAGCAACATTGCCTTTTGCTGGCAACGGTGCTGGTAACGGAGGCAGCGTCTGCGTACTTTCGGCCACCGGTGCTACCCGAGTATCGGCCGCAGTGCCTTGCCTGCGTCCTGTCATCCAAAAAAAACCAAATAAGAGAATACCGATACCAAATGCGATGCCGGTAATTTTGAATGTATGCGCGGGGAGGGTGGGAGGGGAGGAATAAATACGGCGGTCGTACATGGGGCATAGGCCTGGACGAATCTGCCGATTATTTCACAGCGCAGATGAATGTACGCCAGAGGCATTGATGCCCCTTAACGGGTCATAATCATCACCCGATCCATTGCACCGTGCTGCTTCTATGCTCGATCCGACTCTACTCCGTCAACAGCTTGCGAAACTCGCTGAATGCTTGTCGACTGTACGCGGTTTCACTCTTGATGTTGCAGCCCTTGAAGCATTGGAAAGCGAACGTAAACGTATCCAGGTGCATACCCAAGAGTTGCAATCCCTGCGTAACAGTAAATCTAAGGCGATTGGCCAAGCCAGGAGCAAGGGTGAAGATGTGTCTTCGTTAATGGCTGAGGTTGCCGCTTTTAGTGATGATTTAAAAGCCTCGGAGATAGCGTTGGAGGAGATCCGTACTGAATTAGAGAAAGTGGCCTTGGGTATTCCCAATTTGCCACAGCAAGATGTGCCGCTGGGTGCCGATGAACGTGACAACGTAGAGCAGGCCCGTTGGGGAGTGCCACGTACGTTTGATTTTTCCATCAAGGATCATGTTGAATTGGGTGCGTGTCATGGTTGGTTAGATGCCGAGAGTGCTGCCAAGCTATCCGGTGCGCGTTTTACTGTATTGCGTGGCCCCATTGCACGACTGCATCGCGCACTGGCGCAGTGCATGCTTGACTTGCATGTTCGCCAGCATGGATACGAAGAAGTCAATGTCCCAATCATTGTCAATGCCGACAGCTTGTATGGGACTGGTCAACTACCGAAATTTGAAGAGGACATGTTCAGTACTCAGCTCGGTGAGCATCGGCGCTATTTGATTTCGACCTCTGAAATTTCGCTGACTAATTTAGTACGTAACGAGATTATTGAGGCGGATCGCCTGCCGTTGCGTATGGCTGCACACTCATTGTGTTTCCGTTCCGAGGCAGGCAGCGGTGGTCGTGATACGCGTGGCATGATTCGTCAGCATCAGTTTGAGAAGGTCGAATTGGTCAGCGTATGTAAGCCGCAAGAGAGTGAAGGTGAGCACCAGCGTATGACTCGCTGTGCAGAGACAGTGTTGGAAATGCTGGGACTGCCCTACCGCAAGATCCTACTGTGTACCGGTGATATGGGATTTGCCGCGACCAAAACCTACGACTTGGAAGTTTGGTTGCCGTCGCAGGGAATGTATCGCGAGATTTCATCGTGTTCCAACTGCGGTGATTTCCAAGCCAGACGCATGCAGGCCCGTTGGCGCAATTCTGTCACCGGCAAGCCGGAATTGGTACACACACTCAATGGTTCTGGTGTCGCTGTTGGCCGTGCGATGATTGCGGTGATGGAGAACTATCAGAATGCTGATGGTTCGATTACGGTGCCGGAGGTGCTGCGTCCTTATATGGATGGTTTGAGTCGGATTGGTTGAGTTCTAAGTGGCGGCAATGTCACAAAGGTGCTGTGATCTGTCAGTGATCTGCCACGGTACACTGTAACGATGCAGCAGAGTATCGCTCCCTGTTCAATGAGCGCTATTGAATTGCCAGCAAACCTGCAACGACATGATGTTGATGAGAAACGGTAATGACTGTCTGTCTGTTGAAGCTGTCAATAGCAGCATGGCTCAATGATGAGAGATTAACGTAAAACTTCATCAACATCTGATGATGACATTGAAGTACATACTGATGTCATGGAGCCAAACAGCATCGGCTGTCGTTTATCCGTTCTATGAAATACCATTGGCGCTTCTTGTATTCCATGAGTGGGCTGCTTTATTAAACTCCCTGCAAGTTATTTTTCTGCCTGGTATGTTTGATATCAGATGTGGTTTTATTCATTTGTTTGATTGTCAATCGTCTCTCAGCATCATCCCTGGCCTGAAAACAAGATCAGGTCATACAACGATATAAGACAAACATTGGACGCAATCCGATGGTGATGCGTTCTGGTGTCAGTGACCGGAATAAAGACTGGTTATTGGGTATATGTGTACTGCAATAAAGCGTTGCCCTTACTCGTAAGGTCAGCAAAGTGGCGCCTGCCTGTTTGCACCCTGGTGTAAGCACGGAAACAGCATCTGGACGGAAAAATATGCCAACTGTGCAATGATATAGTGACAACAGCACACGCTCATGATTTCGTTAATCAGGACGAAAGTATTGAATCATCATCAATAAAGAGCGTGCATTTTTGTTTTATTTCAAAAAAATTGAAAAGTAGGGGAGCTTGAAATGGAAGCATTAAATAAAATTCGATATGCGTGTTTAGTGTGTACGCTGATGGCCCTGGTGACATTAGCTGGATGCAACTTGGCAGATCGTTCTGCTGAACCGCCCCAGTCGTCCCAGCAAGGGTCGGAACAAGATGGTGCCAAAAAGATTCCTTTGAATGTTGAAGTGTTTAACCTGGGCGAGCGTTCGATGTTTGCAGTCTCCTCTGTATTGGTGACAGGGACTGCGGATGCTATTTTGATTGATACGCAGTTTTCTGCGGCTGATGCGCGTCAGCTTGTAGAGAAAATAAAAGCATCTGGCAAGCGTCTGAAGGCGATCTATATCAGTCATGGTGACCCTGATTATTACTTTGGCCTGGATAGTGTGCACGCTGCCTTCCCTGAGGCGAAGGTGCTCGCTACTCCGCAAACTATCGCGCATATCCAGGCCAGTAAAGATGCGAAACTGAAGCTCTGGGCGCCGCAGCTGGGACAGGATGCACCGAAACAAATCTTGATTCCAGAACCAATGCAAGGAGACCAGCTGGTACTGGAAGGTCAGGAATTGAGGATTATTGGTATGGATGGTCCGACGCCTGACCGTACTTTCGTGTGGATTCCCTCAAGCAGAACAGTTGCTGGCGGTATCTCGGTGATGGGGGCTCAGCATGTATGGATGGCTGACACACAGACCCCACAGTCCCACATAGAGTGGTTGGCTGTGCTGGATCGGATCAAATCAATGGATCCGAAGGTTGTGGTTCCAGGACATTTTTTAACTGGCGCGTCGCTGGACGTGCGCTCGGTGACGTTTACTGCCGATTATATTCGTGCCTTTGACGAGGAAACCGCCAAGGCCAAGGATTCGGCTGCGTTGATCGTGGCGATGAAGCAACGTTATCCCGGACTTGGTGGAGAGGGATCGCTCGAACTCAGTGCCAAGGTGGCCAAGGGCGAGATGGAATGGAAGTGATGTGATGGGCGGCTATGCTCACGTTTCCATCCTTATTCCTACACGGCGTGGCGCTTCTTTGTGATAAATGCATCTATTTTTTATTGCAGAGCGTCGCTCTTTTCTGGACATGTTCTTTGGTACATAAATGTGTTTCAGGTTAGGCGATGTCTGGTGCTCACTGGACCTGACATTTGGAGCATTCTTTCTCTACAGCTGAAATGCGCTGTTGTCTGGTAGAAGCGGTATCAGAATTTTCCTGATTGACCTCGTAGACCGCATAATGCTCCGATGACCCCTTTCATCAAAACTCCTTCTATGGCCAGGATCATCGCCATTGCCAATCAAAAAGGTGGTGTCGGTAAGACCACTACAGCGGTTAATCTGGCGGCAGGCTTGGTGCGTGCATCCGAGCGAGTCCTGTTGGTGGATCTTGATTCGCAGGGCAATGCGACGATGGGCAGCGGTGTGGATAAGAACGGGTTGATCTCGTCTACCTGTGAGGTTCTGTTGGGTGAGAGGAGTGTTGCTGAGAGTCGGGCCAGGGCACCGGAAGGTTTCGACTTGCTGCCAGGCAACATTGATTTGACTGCAGCGGCCATCCAGTTAATGGAGCAAAGCGAGCGCGAGCAGCGATTGAAGCGTGCGCTGTCACCGATTCGCCATGAGTACGATTTCATCTTGATTGACTGTCCGCCGGCGCTGTCGCTGCTGACCGTGAATGCCCTCACCGCAGCGGATTCGGTGATTGTGCCGATGCAGTGCGAGTATTACGCGCTGGAGGGGTTGAGTGCGTTGCTTGAAACGATTGAGGCGCTGCGTGTCAATCTGAATCCGCGACTGGAGATTGAGGGTGTACTGCGTACGATGTTTGATATCCGCAATAATCTGGCAAATGCAGTGTCGACGGAGTTAACAGAGCACTTCGGTGACAAAGTGTTCCGGACGATTGTGCCGCGTAATGTGCGTTTGGCTGAAGCTCCAAGTTACGGTAAGAGCATCGTTGGTTATGACGGCGCCTCGCGTGGTAGCGTGGCTTATTTGGGCTTGGCTAATGAAGTGATTCTCCGTCAAAAAAACCGCAAAAAAGCCAACGTTGTGGAGATTAATTAATGAATAAACTCAGCCCTCCCCTCAAGAAGCGCGGCCTGGGTCGTGGTCTTGAAGCCTTGCTTGGGTCTAAGGGTGGATCTTCTGTCCCACCAACGGTTACCGAGGAGCAGTTACCTGGCGAAGTACTGCGTACCCTGCAGATCACGCAATTACAACCTAGTAAGTATCAGCCGCGGCGAGAGATGAGTGAACCCAAGCTGGCAGAGCTTGCTGATTCGATCAAGGCACAGGGGGTGATTCAGCCAATCATCGTGCGCGAGTTGGATGTGGATATGTTCGAGATTGTTGCTGGGGAGCGCCGTTGGCGGGCCTCGCAATTGGCTGGGCTGACGGAAGTGCCGGTGCTCGTTCGTGAGTTAGATGACCGTACTGTTGTTGCGATGGCGCTGATTGAAAACATCCAGCGTGAGGATCTTAATCCACTTGAGGAAGCGCAAGCGCTACAGCGACTGATTGATGAGTTTTCGTTAACGCATGCTGAGGCGGCTGAGGCTGTTGGTCGTTCACGTGCGGCGGTGTCTAATCTTTTACGTTTACTTGAACTTCCGTTGGGGATCCGTACGCTGCTGCAGTTACGTCGCTTGGAGATGGGGCACGCCCGTGCGTTGTTGACTCTGGCTCCCGAGCTGGCGGACAAACTCGCCAAGGAAGCGGCTGACCAAGGGTGGTCTGTCCGGGAGGTCGAGCACCGTGCTCAGCAATTTGCTGCTGGAAAAGTGCCAGACATCCGTGACAAGAAGTCCAAGTCGCCCGCGAGTGCTCCTGCGCAACCCGACATCGCCTCCTTGGAAACGGAGTTGTCTGAGCATTTGGGTACCAAAGTGGCGATCAATCATGGCCGTGCTGGTAAAGGTAAATTGGTGATCCACTACACCGACTTGGATGTTCTTGATGGTGTTTTGGAGAGATTGCGTGCTCGTGCGGCTGATTAA
Protein sequences of DBSCAN-SWA_1 >NZ_CP020870|767313:779496|777769_778564_+|WP_010894730.1|DBSCAN-SWA MARIIAIANQKGGVGKTTTAVNLAAGLVRASERVLLVDLDSQGNATMGSGVDKNGLISSTCEVLLGERSVAESRARAPEGFDLLPGNIDLTAAAIQLMEQSEREQRLKRALSPIRHEYDFILIDCPPALSLLTVNALTAADSVIVPMQCEYYALEGLSALLETIEALRVNLNPRLEIEGVLRTMFDIRNNLANAVSTELTEHFGDKVFRTIVPRNVRLAEAPSYGKSIVGYDGASRGSVAYLGLANEVILRQKNRKKANVVEIN >NZ_CP020870|767313:779496|769502_769808_+|WP_046420589.1|DBSCAN-SWA MTSVAFDTLKFANRLKTAGVPAAHAEAEAEALAEVLETNLQDLATKRDLRELELKLESKIDKGFADVHKGFAEIKGEMLLLKWMFGALVGGVTALIIKAFF >NZ_CP020870|767313:779496|768722_769400_-|WP_060870121.1|DBSCAN-SWA MTTQQLAIDSLPIREQDGLYSLNDFHKASGGAVRHRPSEFLRLDKTKALVVELTNSGEFVSSIKGGAPHLFVRKEKGRAGSTFACRELAIAYASWIAPAFQLKVIRVFLASVVVPATPHVATPANIDQLRRDIYRSLQYVSSRLNIAYEFARRVSEDAGRDRKREYNGQEDVYTLEAITYLIADGKNGLKEAAGSVLSLTPIHRPAPTPTQAATNSPHLLRRYAT >NZ_CP020870|767313:779496|773615_774311_-|WP_046420580.1|DBSCAN-SWA MYDRRIYSSPPTLPAHTFKITGIAFGIGILLFGFFWMTGRRQGTAADTRVAPVAESTQTLPPLPAPLPAKGNVAANMPKAPSDPLPENDLDTNNAAINDTLPEPIPSAPNGTAPPNMSAQAQMPTATASPQNAATTAASSQPTPIPGKMPSPDYPAEAAARGDHGTVTVRILVDVEGAPSGVEMAKSSGSRDLDNAALQTVRNWRFKPARNSNDQPVPGSIEIPFEFNVGQ >NZ_CP020870|767313:779496|778563_779496_+|WP_046420575.1|DBSCAN-SWA MNKLSPPLKKRGLGRGLEALLGSKGGSSVPPTVTEEQLPGEVLRTLQITQLQPSKYQPRREMSEPKLAELADSIKAQGVIQPIIVRELDVDMFEIVAGERRWRASQLAGLTEVPVLVRELDDRTVVAMALIENIQREDLNPLEEAQALQRLIDEFSLTHAEAAEAVGRSRAAVSNLLRLLELPLGIRTLLQLRRLEMGHARALLTLAPELADKLAKEAADQGWSVREVEHRAQQFAAGKVPDIRDKKSKSPASAPAQPDIASLETELSEHLGTKVAINHGRAGKGKLVIHYTDLDVLDGVLERLRARAAD >NZ_CP020870|767313:779496|771988_772285_+|WP_046420585.1|DBSCAN-SWA MSDTCNYRFVDTTQSDETALRESEFLTANEVCELTGYKICAGQRKWLDSEGWCYVTNASGRPIVGRWYARLRVAGESLHMAGAGIRPVKLPNFAALDE >NZ_CP020870|767313:779496|774432_775713_+|WP_010894734.1|tRNA|DBSCAN-SWA MLDPTLLRQQLAKLAECLSTVRGFTLDVAALEALESERKRIQVHTQELQSLRNSKSKAIGQARSKGEDVSSLMAEVAAFSDDLKASEIALEEIRTELEKVALGIPNLPQQDVPLGADERDNVEQARWGVPRTFDFSIKDHVELGACHGWLDAESAAKLSGARFTVLRGPIARLHRALAQCMLDLHVRQHGYEEVNVPIIVNADSLYGTGQLPKFEEDMFSTQLGEHRRYLISTSEISLTNLVRNEIIEADRLPLRMAAHSLCFRSEAGSGGRDTRGMIRQHQFEKVELVSVCKPQESEGEHQRMTRCAETVLEMLGLPYRKILLCTGDMGFAATKTYDLEVWLPSQGMYREISSCSNCGDFQARRMQARWRNSVTGKPELVHTLNGSGVAVGRAMIAVMENYQNADGSITVPEVLRPYMDGLSRIG >NZ_CP020870|767313:779496|767313_768726_-|WP_046420592.1|DBSCAN-SWA MNLRPYQHTMVDFILTHPRCNLFVPMGLGKTVATLTALDVLLLVEDVAPMLVIAPLRVAATTWPDEVAKFPHLRHLRVSVAVGSAAARRRALEQDADIYCINYDNLKWLAEFYKDRWPFRMVVADECSKLKGFRLRQGTRRARALAKHVHTKVERYIGLTGTPAPNGLQDLWALMWMVDRGARLGTHFKAFIDRWFRAVQIGSDPHAVRLVPTPNASKEMQDTIRDLCLSLDPQEYFDLRQPIVNTIRVALPEHAQRLYKAMEQDMFIALECGAEVEAFNAASKTIKCLQLANGALYTDDTRQAWEVVHDAKLEALHDIIEEAAGMPVLVAYHFKSDVARLQRAFPKGRALDKHPDTIRDWNAGNIPVLFAHPASAGHGLNLQDGGNILAFFGHWWAQVRNGVLMRTCTACGGSGTVPISDRKRAAALGRDVSTYCKKWRGVYEWLLDKLRAAEQHAAAELKAALREDAS >NZ_CP020870|767313:779496|769809_770088_-|WP_046420587.1|DBSCAN-SWA MMNIPRERTIERYLVAQVRAKGGEIRKVKWGGRHGAPDRIAMLPEGRTLWVELKAPGQQCTPHQVREHARMRGMGQRVVVVDSLKGVDEVLA >NZ_CP020870|767313:779496|776518_777472_+|WP_046420578.1|DBSCAN-SWA MEALNKIRYACLVCTLMALVTLAGCNLADRSAEPPQSSQQGSEQDGAKKIPLNVEVFNLGERSMFAVSSVLVTGTADAILIDTQFSAADARQLVEKIKASGKRLKAIYISHGDPDYYFGLDSVHAAFPEAKVLATPQTIAHIQASKDAKLKLWAPQLGQDAPKQILIPEPMQGDQLVLEGQELRIIGMDGPTPDRTFVWIPSSRTVAGGISVMGAQHVWMADTQTPQSHIEWLAVLDRIKSMDPKVVVPGHFLTGASLDVRSVTFTADYIRAFDEETAKAKDSAALIVAMKQRYPGLGGEGSLELSAKVAKGEMEWK >NZ_CP020870|767313:779496|772296_773358_+|WP_046420583.1|integrase|DBSCAN-SWA MRPKSSHKDLPPKMLRRTRVLKSGKVWESFYYNGRTTEGRRVEIPLGGDLNEAKRQWAELECCKAPVETEVLGFIFDRYLREVAPTKARATRYQIKSCMTTLRKVFGDVNIHTVTPQQLAQYRDKRARTAPVLANRELSVFSSVWTMAREWGYTNKENQVKGIRKIKEKPRDFYADAAVWNAVYAKACEELKDAMDLAYLTGQRPADVLKMRFTDIRDGSLEVQQNKTKKKLRILLESDGRRTELGKVIDRIKARKRKVVGFSLVSTSKGVGLGSKPLRVRFQRARAAAAKAASDSGEVDLVKRILIFQFRDIRPKAASELPLEHASKLLGHTQQQITQRVYRRVGEIVKPTK >NZ_CP020870|767313:779496|771677_771992_+|WP_060871515.1|DBSCAN-SWA MGRQALHHVPHGRHIALGVVHKLCPVHGNLEHYQQIIERIGPTRQAQAGHDRPVFIHHIVAAGTVDELVMARRESKREVQDLLLEAVKRRETGKPLTSQGAMRP |
12 | Xylella_phage(33.33%) | integrase,tRNA | attL 761696:761710|attR 784680:784694 |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_2 |
1138487 : 1151643
Sequences of DBSCAN-SWA_2
Nucleotide sequences of DBSCAN-SWA_2 >NZ_CP020870|1138487:1151643|DBSCAN-SWA TATGTCTCGTCGCTCTTCTTCTTGTGATTCTGCGGTCCATGTTGATTTTTTTGAACGTGAGCGCCGTGCTGCGCTTAAGAAGGCTGCTTATTTATATGAGACGCGTCGCGATCGTGTGAATCCGTCTTATGCGTTCCCCGCTGCGTCCGGTGAAAAGGTGTTAGGCCCGAACAGTAATACGGGCCAAAAGGGTGTTGTTGGGTCCTATCCTGTTTCTATTGATTATTTGACGGTTGTTTTTAGTTATGCCCGTTTGGCAGAGGCGGGTTATTTTGATGAGCCTCGTTTTCTTCTTTACCTGTTATTTGGTTTGAGCGTTGATGATGTCATCGTGGGTTCTCATACGTCTGTGCGTTGGCATTTTTATAATTCAAGTGCTTCTATTATAGATTCTAATGGTGATCTAGTTGGCAAGATTGGTTGGGATGGGAATGCGGATTCGTATTGTATTAGTTTGACGGGTTCGGCTTGCCGTTATATTCATGATTGGTCAAAGGTGAAACGTTCATTGGCTTCTTTGGATGCGCGTATTACTCGTTGTGATGTGGCTTATGATGATTATGACGGTAAGTTGGGGACGGTTCGTCATCATGAGGCGCTTGCGCGTGAGCATTTAGCTCCTGCCGGTGGCTGTCTGTTGTTTTCTTCTGGTGGGACTCCTCCGCGTACGCGTTTTTTAGATGATCATGGGGGCGGGTCTGGGTGTACGTTGTATGTGGGGCAACGTGGGCATAAGCAATTGTGTATTTATGAGAAAGGCAAGCAGCTTGGTGTGGCTGAGTCTCCTTGGGTACGTTATGAGGTGCGTTTATATGCCAAGCATGCGGTGATTCCTTTTGATTTATTGGTGGAACCCATGCGTTATTTGCGTGGTTCTTATGATTATTTGTGCCAGTTATTTTCTTCTGTTGTTGCTTCTCCTGTGAGTCGTATTCAGACGGTGGTAAAGCATGTGGAGGCGACAGGTGAGGCGTTGGTTCGCTGGCTGCGTCGTCAGGTCGGGCCTGCGTTAGGGGTGTTGCGTCAAGCGTTGGGGTGTGGTTTTTCTGATTTTATTGTTGATCGTGTGGAGCGTCAGGGCTTGCCTTCTCGTTTTCGGCGTATTTGTAGGGGAGGGGATTTGCCTGCGTATTTGCGGGAGACGTTAGCGGATTGTCCTGTGGGCGTGTGTGTCTAGGTCGTTGTAGATTTCTTTATATTTTATTGAATGATTAGGTGATTTATGTCTATTGTGAGAGTGAAAGATAATATTGTTATTGAACGTTCCGTAAATACTAAGACAGCAGGTTTGCAGATTTTTCGGGAACAGCGTGCTGCTGTGGTGATGGGCGGTGCGTATGAAACTGTATTTAGTTTGAAGTTGGGTTCAGGTTCTGTGTATCCCCCGGGTGAATATTTAATTCATCCTGATTCTTATGGAACGGATGATTATGGGAACTTGCTGTTAAGGCGTCTTAAGTTGATTTCTTTATCTTCTGCATTAAAGGAGTTTGCTAGTAAGGAGCCTGTTTCTGTTGTTTCTTCTAAGGTCACTTGATTAGTGCCGTGTTATCGTTTTTTGTTTTAGTTATTTTTTTGTTTTTGTGTTGGTGTGTAGAGTTTTTTTTGTTGTTTTTTGTTGATTATGTTTTTTTTCTTGTTTTGGGAATGTTTGTTAAGTCATGGCCCTCTGCGTCTCTTTAAAAGCTGATGGTACCTTGGTAACTACAGGGCAAGGTGTGTCTGACTGTACGGGGTATGTATTGGTGAGTGGGTCAGAGTATGGGGTTTATCAGTCTGTGCAACGTGTGTTTGAGGTGCCTGACATGAAAACTGTTATTCACACCTCTACGACTGTGGCTTTTACTGTGATTGGTTGGTATGTCGCGGCGCGTATTATCGGCACCGTTGCGACATTTTTTGATAGCCGATAATCAATAAACGAGGTGATGTATGGCTGATATTTTATCTGGACTTGATGTTAAATCGGCGGCGGCGGCTCTCATTGGTGCTGCTGCTTTAATTGCATTGGTTGGTTTTACTAAATGGGGTGCGAAAAAAGTCGCGGGTTTCTTCGGTTGAAATATCTCGAGCGACACTTTTAGTGTCGCTCTTCACCGGTGGGTGGGGAGGGCTGCACTGTGATGATTACTTTATTCTCTTGTTTTCTTGGGGCTTTGTGTGGCTGGGCCGCTGTTAAGGGGCTGGATGCTTCATGAGTATTTTTCGCATACTCGTTTTTTTTGTGATTCTTTTTATTTCTCGTTTTAGTTTTGCTTGCGAGATCGGTGAGCCGCATTGGGATCCTAATCAGTGTTTAGATAGGGGGGAGGCTTATGCAATTGCTAGTGCCAGTTATCAGCTGTGGCGTTCTAATGAATTGAAGAATAGTAATATTCCTGGTTTGCAAGTGGTTGATTGTCCTATGACTGATAATGGTCATGTGATTGGTTTCGGTGGTTATAGTACTGCGCCTGGTCATCCTTCTTCTCAAAGTTGTGATAGTAGTTTGGTCTATTTTCAGAGGGTCTATCCTGAGGGGAAAACTTGTCTTACGCGTTCTCCTAAATCACTTCTTGGTTTAACGCTTCCTTCTGGTGTACGTGTTCCGTCTACGGCTTGTTATGATGGTTGTTCTTATGATTTGGATCGTTCTCACGGAATTATTGGTGTAGGACAGGATGACGGCAAGGTTAGGTATGTTTTACCTGGTATGACGCCTAATGGGAATTTGTGTAGTGTTTCTCCGTCCGGTGGTTCTTCTTCTGAGTCTAGTCAGGAGCCGCCTCCTGTTCAGGATGTTGTTAAGGATGAATGTACACGGATGGGGACATTAACGCAGTGTGTGAGGCAGGACGGTAAATATTGTGCTACTTCATCGACGGGGCATCAGTATTGTTGGAAGCCAGGTGAGGTGGGCACTCAGATTGCTTCTGATGGTAATCATGCAGCAACATTGAATAAGATTGATGTTCCTGTGATTGCTCCTGTGGACGCGCCTAAGGATAAGGGGGATTGGCGTGTAGATGGTAAGGGCACATCTACTCAGATTATTAATAATACTTATAACAATTATAATACAACTACATTTGCTTCTACTGGTTCTTCTGGCGGTTCTAGTGGTGGTTCTAGTGGTGGTTCTAGTGGTTCGGGCAGTGGCGGCGGTGGTGATAAAACAGGTGGTGATAAGGATACCCCCGGTAGCGGTTCGCCTTCGGGGACCGGTGATCTTTATAAACGTAATGGTAAGACATTAGACGCTGTCGTATCAGGATATCAGGCTAAAGTTAATGACCTTCCTTTTATTTCTGGTATTTCTTCCTTTTTAACTATTTCTGCATCTGGGGAATGCCCTGTATTCACGTTGTCTGCTTCTGCGTATTGGCCAGAAATGACATTTGATTATCATTGCAGCGGTGTTTTTTTGAGTTTTTTGCGGTCTGCTGGTTATATTATTTTTGCGATTGCTTCTTATTATGCCGTTCGTATTGCGACCTTGCGTTAGGTGGGAGGAATCATGTTTATATTAAGGGTTGGGTGGTTAACTGATCTGACGCAGTGGATTTGGGATTTAATCACTAAGATGTTTCTTGCTTTAGCTGATTTTGTCTCTGATGTTTTTGTGTCGTTTTGTGATTTTTGTTTTTCATTGATTTTATTTGTTGTGGGTGTATTGCCTTGGCCCGATTTTCTTAAGCAGCAAACGATAGGGGACATGTTAGGTAAGGCTGGTGGTACGGTCGTGTGGTTTGCTGATGTATTTCAGTTATCTAATTCTATGCGCGTGATTAGTGCGGCAATTGTTTTTTCTATATTTAGGCGGTTGTTGACTTTGGGAATTTGGTGATGTTGCTCTTTAATGAGGGTGTGCCTCGTTCTGGAAAAAGTTACGACGCAGTGAAGCATCATATTCTTCCTGCTCTTCGTGAGGGGCGTCGTGTTTATGCACGTTTGAATGGGTTGCGTTATGAGTTGATTGCTCAGTATTTGGGGGTGTCAGAGACTCGTATTCGTGAACTTCTTTTTGTTGTTAATACGGATGATGTATTAAATACATTCGTTTGCTATCGTGATGAAGTTGACGGCAAATGGTGTATTGAAGATCGTTTTAAGGATGTGTTAATCGTTATTGATGAGGTGCATGAGTTTTATGTTGAATCTCGTGCACCGTTAGCGCCTCAGATAGAGAATTTCTGGGCGTTATTGGGTCAGAATGGCGGTGATGCAGTGCTGATGACGCAGTGGATCAAACGCATGCATCCTGCCATCCGTGCACGTATTGAACGTAAGCATAGTTTTCAGAAATTGACGGTTGTTGGTCTCAAGAACCGTTATCGTGTGACGTATTACCATACTGTGGCGGCAGGTAAGTTTGAAAAGGTAGGAAGTCAGACGTTTAAGTATGATGCGTCTATTTTTCCTCTTTATGATGGGTATGCTCCAGGGGCACGCAATACGGAGGTGTATTCTCAGGGTAAGAGAACAGTCTGGGCAGTGATGTTAATAAAGGCTATTTTTTTCTTGGCAGTTGGTGTTGTGGGGTTTCATTTTTATTCTCGTTATTTTGGTAGTGCTGGTCTTTCTGCCCATGTTGTTTCTAGTGCTTCCTCTTCAGGTGTAGGGCAGGTGTTTAAGCCTGGTCAGGTTGTTTCTGGTTCTGTTCATCAAGACGTTTCTGCTTCTGCTCCTGTGATGCCTCCTGTTGATCCTTTGTCAGATTTGGGGCCTGAGCAACGTTATATATTTGATCTTAATGCCAAGGGTCGGTTACGTTTGGCTGCCCTTGCGCAAGTGGGGCATGAGTATCGTGCTTGGGTGCAATGGATTAATACGGAAAATTTGGTGATTGAGCAGCTTGATTTAGAGCAGTTGCGTGCTTTGGGCTTTGATGTCTCTGTGCACTCTTATGGCGTGCGTATTTCTGTCCTTAGTCATGTTTTGGTGGCGACAGCATGGCCTTGGCGGGAGCCAGTTCGTGAGACGGACCCGCGTTTATATAATTTATCTCGTGATCAGCAGGGCGCAGGGAGCATTGCGAGCCCCGCGAGTGATGCGAGCGGAGCCCCTGCCGTTCAGGGAGGTATGATAGAGAAGGGAGAGCGTGCTATGGGAACGTTTCCGGAATCACCTGGCTATGAGCATCATGATGAGTCGGGGCGTGGTTCTTCGTTTTTTCGTTAGTTTTTTTTTGGTCTTTGTGAAGAGTTTTAATTCACATTTTTCGTTCCAGTTATGATTCATTCATAGTTAATTTATTTTATGTTCATAATTGAACGTGTGGGGTAGGGCAGGCTGATTTTTTATTGATACATTTCTTAGATTAGTGAAATAGTTTGATAAATAAGGTTGCTGCTGTCGCTCCTGCTGCGATAAGACCGCTGCCCACTACTACTGGGTACCATTTTGATTCTTTTGAAACTTTTAGCGTTTCCTCTACCAGTTTTTGGGTGCGTGCGTTTATTTCATTTATATGTGCGTTCATCTCTTGGGTGTGTGCCGTCACTTCGTGTATTTCAGCTTGTATCTTTGATGTTTCTAGAATTAATTTTGCAATTTGTATTTCCTGCTCTTTGCTGCGTTCATCTTGCGTCATTGTCATTTTTCCTGTTGTTTTCTTTGCATAGTGTAGAGCAGTAACTGGGGGTGTAGGGGGCTAGCCCCCTACGGAGACGCTTTACGCTTTTGTTGGCGTTGTATCAGCACTTGCCTTAATTGGATGATCGACGCGGTCCCATCGGCTGTTTGTCTTCTTTTACGCAAAACCGTTTTTTGAGGTGCGTTAGTGATGCGCTGCGGATCGGACGCGGCAGAGCCGCCTTGTGGAGCTGCCGCTCGCGCTTCTTCCATCATTTTTTGCCATTCTTGCGCTAGTGAGCAGGTCAGTGATAGCCATCTGAGCTGCCATTCTTCAATGGCTCTTCTTTCTGGTGTGACTAGCTTTCCATGGATAAAAGCGAATCCTGTCCAGTTTCCTGTCAGTATTTGATTTGGAATTTCATCGTTACTCATTATGAGCGGACTAGTTTTTTATTGATACATTGAGATTCTTCTATGTTGCCTGTAAGCATTTGATTTGCATCATTATTGTTACTCATTGTTCGTGGACTAGTTTTTTATTGATCAGTGCGAGTGGCTTGTTTATTAGTTCCTTTTCTAGTCTATACAGCAGTTCTCGCACCCACCTTCTCCACGTGGACATAATAGGCATTATGTGAAATTACATTGCTGAAATTTTTATTTAGTGCGTGTGTTTGAGGTGTGGGAGGGGAGGTTATACCTACTCCTACTAGCAACGTTGCCGCTGTTGCGGTTAGCCGTTCTAGCATTGATTTCCAGATTCGTTGTTCTGTTGTTGTGTCTGCTTTTTCTTCTCGTACTTTTGCGAACATCGTTGCGTCTGCATTTGCTATTTCAATAAGTAATGCCAGATGTTCGTCAGTAATCCTTAGTTCGTTTTTTCGCCATTTCGATATTGTTTGTCGCGATACCTTTAGTGTTTTTGCTACTTTTGCATCGCTATTGGCCGAGCATTTTTCTCTTGCTTTGTCAAGTAATTTATTTAGCGTATTCATGTCGCTTGCTTGTTGACAACTATGGTGCTTGTTAGTTTACACTCCTTATTGTCGCTTGATGAGTGACATCGCCTCTCCATTCATCCTAACCTGGATGGGGAGGTGTTCTAAGACAGGGTAGGGCACGGTAGGAGTGTTGAGATGATAAATGTAGAAACGCGTAATCCACATGACGGTATGTCTTTTAAGTCTTTGGGTGATCAAAGCGAATCATCTGCGGCTTCTGAGGTAGCTGTGATTGATCAAGGTCATTCTTGTATATCTTCTCAGAGGGTCGGTGTTTTGCTTGTGGGTATTGAAGCGGCAAGATCGGAATTGCTTTCGGCTTTTAAACGTTATCAAAAATTAGGACTTGCGGGTTTTGGTATATCAACGTTTGAAGAGAGACGTGAAATTTCTGATTGTATTCAAGATGCTTATAATAGATTGGCGCTTTTTGTTAAGCGTGCGGAACGTATAAGCCATGCGCAGGTGTGACATGTCTAAACTATCTGTTTTTGGTGTTTATCCTTCTTTTACTTCTCAGTATCTTATTTGTTTAACTTCATTTATTTCTGATCTTCAGCATTATATTGATTCTATTGATTGGTCTTTATCGAAGATTTTTACTGATGCTTCTGATGCTTCGGATGAAATTACATTGGAAACAGTTGAATCTATATTTCAGTCTTTAGGTGAGATTCTTTTTGAATTGCGTTTTTTAGAAGAACGTTTATCTCATCTTTCTCGGGCGTAGGGTATGCGTATTACCGATACTGAACGAGGCGCGCGCATGGCATTAAATATTGCTGAGATGTACGTTCATCAATTGGATTTGTGGCCAGATAGTTTACCTCAGCAGTTTGATTTTTGGTTGTCTGTGCGTGCTGCGGCATTGGATCAACTGGACGAGTGTTATTTATTAAGGCGATCATTATTATGATTGATGTCTTTGTTCTTTTCTTAGTAGTTATTGCCGGTATTCTTAATCTCCATGCTACACGCTTTGTGACTAGAGAACGTGAAGTGCATGTTTTTGTTTTTATCTCTGCACTTTATTCACTTATTCTAGCTTCTATTCTTTTGTGTTTTAAGCTTTTTGGAATTTCTCGGTTTTAATAATGATTGATTTATTTGATTTATGGAACTCCCCAAAGCTTAGTTTTGAATGTCTCTTTTTAGTTAGGTTGGCTATTGCTGTCTTTATTACTATTCTTTCTGGGGTTCTTTCATGCTTTTTCATGTTTCTAATTGTTTGGTTTTGCTTTTCTGCCTTTGGTTGGGTCTTAATTCTGATGTTGTTCTTTGGATTCATTCACTCCGCTATGAGATACATTCATGACGAGAGGTATTTTTGAGTTTTTATTGTGAGGCTTGTGTGTTTTCTTGATCTGTTATTGCTGTCGTTCTTAAGTCGCAGTCATCGCTGCGGTTGATGCTGTCTCTAGTTTTTATGTGGAGTTATTCTATGTCTCGTCGCTCTTCTTCTTGTGATTCTGCGGTCCATGTTGATTTTTTTGAACGTGAGCGCCGTGCTGCGCTTAAGAAGGCTGCTTATTTATATGAGACGCGTCGCGATCGTGTGAATCCGTCTTATGCGTTCCCCGCTGCGTCCGGTGAAAAGGTGTTAGGCCCGAACAGTAATACGGGCCAAAAGGGTGTTGTTGGGTCCTATCCTGTTTCTATTGATTATTTGACGGTTGTTTTTAGTTATGCCCGTTTGGCAGAGGCGGGTTATTTTGATGAGCCTCGTTTTCTTCTTTACCTGTTATTTGGTTTGAGCGTTGATGATGTCATCGTGGGTTCTCATACGTCTGTGCGTTGGCATTTTTATAATTCAAGTGCTTCTATTATAGATTCTAATGGTGATCTAGTTGGCAAGATTGGTTGGGATGGGAATGCGGATTCGTATTGTATTAGTTTGACGGGTTCGGCTTGCCGTTATATTCATGATTGGTCAAAGGTGAAACGTTCATTGGCTTCTTTGGATGCGCGTATTACTCGTTGTGATGTGGCTTATGATGATTATGACGGTAAGTTGGGGACGGTTCGTCATCATGAGGCGCTTGCGCGTGAGCATTTAGCTCCTGCCGGTGGCTGTCTGTTGTTTTCTTCTGGTGGGACTCCTCCGCGTACGCGTTTTTTAGATGATCATGGGGGCGGGTCTGGGTGTACGTTGTATGTGGGGCAACGTGGGCATAAGCAATTGTGTATTTATGAGAAAGGCAAGCAGCTTGGTGTGGCTGAGTCTCCTTGGGTACGTTATGAGGTGCGTTTATATGCCAAGCATGCGGTGATTCCTTTTGATTTATTGGTGGAACCCATGCGTTATTTGCGTGGTTCTTATGATTATTTGTGCCAGTTATTTTCTTCTGTTGTTGCTTCTCCTGTGAGTCGTATTCAGACGGTGGTAAAGCATGTGGAGGCGACAGGTGAGGCGTTGGTTCGCTGGCTGCGTCGTCAGGTCGGGCCTGCGTTAGGGGTGTTGCGTCAAGCGTTGGGGTGTGGTTTTTCTGATTTTATTGTTGATCGTGTGGAGCGTCAGGGCTTGCCTTCTCGTTTTCGGCGTATTTGTAGGGGAGGGGATTTGCCTGCGTATTTGCGGGAGACGTTAGCGGATTGTCCTGTGGGCGTGTGTGTCTAGGTCGTTGTAGATTTCTTTATATTTTATTGAATGATTAGGTGATTTATGTCTATTGTGAGAGTGAAAGATAATATTGTTATTGAACGTTCCGTAAATACTAAGACAGCAGGTTTGCAGATTTTTCGGGAACAGCGTGCTGCTGTGGTGATGGGCGGTGCGTATGAAACTGTATTTAGTTTGAAGTTGGGTTCAGGTTCTGTGTATCCCCCGGGTGAATATTTAATTCATCCTGATTCTTATGGAACGGATGATTATGGGAACTTGCTGTTAAGGCGTCTTAAGTTGATTTCTTTATCTTCTGCATTAAAGGAGTTTGCTAGTAAGGAGCCTGTTTCTGTTGTTTCTTCTAAGGTCACTTGATTAGTGCCGTGTTATCGTTTTTTGTTTTAGTTATTTTTTTGTTTTTGTGTTGGTGTGTAGAGTTTTTTTTGTTGTTTTTTGTTGATTATGTTTTTTTTCTTGTTTTGGGAATGTTTGTTAAGTCATGGCCCTCTGCGTCTCTTTAAAAGCTGATGGTACCTTGGTAACTACAGGGCAAGGTGTGTCTGACTGTACGGGGTATGTATTGGTGAGTGGGTCAGAGTATGGGGTTTATCAGTCTGTGCAACGTGTGTTTGAGGTGCCTGACATGAAAACTGTTATTCACACCTCTACGACTGTGGCTTTTACTGTGATTGGTTGGTATGTCGCGGCGCGTATTATCGGCACCGTTGCGACATTTTTTGATAGCCGATAATCAATAAACGAGGTGATGTATGGCTGATATTTTATCTGGACTTGATGTTAAATCGGCGGCGGCGGCTCTCATTGGTGCTGCTGCTTTAATTGCATTGGTTGGTTTTACTAAATGGGGTGCGAAAAAAGTCGCGGGTTTCTTCGGTTGAAATATCTCGAGCGACACTTTTAGTGTCGCTCTTCACCGGTGGGTGGGGAGGGCTGCACTGTGATGATTACTTTATTCTCTTGTTTTCTTGGGGCTTTGTGTGGCTGGGCCGCTGTTAAGGGGCTGGATGCTTCATGAGTATTTTTCGCATACTCGTTTTTTTTGTGATTCTTTTTATTTCTCGTTTTAGTTTTGCTTGCGAGATCGGTGAGCCGCATTGGGATCCTAATCAGTGTTTAGATAGGGGGGAGGCTTATGCAATTGCTAGTGCCAGTTATCAGCTGTGGCGTTCTAATGAATTGAAGAATAGTAATATTCCTGGTTTGCAAGTGGTTGATTGTCCTATGACTGATAATGGTCATGTGATTGGTTTCGGTGGTTATAGTACTGCGCCTGGTCATCCTTCTTCTCAAAGTTGTGATAGTAGTTTGGTCTATTTTCAGAGGGTCTATCCTGAGGGGAAAACTTGTCTTACGCGTTCTCCTAAATCACTTCTTGGTTTAACGCTTCCTTCTGGTGTACGTGTTCCGTCTACGGCTTGTTATGATGGTTGTTCTTATGATTTGGATCGTTCTCACGGAATTATTGGTGTAGGACAGGATGACGGCAAGGTTAGGTATGTTTTACCTGGTATGACGCCTAATGGGAATTTGTGTAGTGTTTCTCCGTCCGGTGGTTCTTCTTCTGAGTCTAGTCAGGAGCCGCCTCCTGTTCAGGATGTTGTTAAGGATGAATGTACACGGATGGGGACATTAACGCAGTGTGTGAGGCAGGACGGTAAATATTGTGCTACTTCATCGACGGGGCATCAGTATTGTTGGAAGCCAGGTGAGGTGGGCACTCAGATTGCTTCTGATGGTAATCATGCAGCAACATTGAATAAGATTGATGTTCCTGTGATTGCTCCTGTGGACGCGCCTAAGGATAAGGGGGATTGGCGTGTAGATGGTAAGGGCACATCTACTCAGATTATTAATAATACTTATAACAATTATAATACAACTACATTTGCTTCTACTGGTTCTTCTGGCGGTTCTAGTGGTGGTTCTAGTGGTGGTTCTAGTGGTTCGGGCAGTGGCGGCGGTGGTGATAAAACAGGTGGTGATAAGGATACCCCCGGTAGCGGTTCGCCTTCGGGGACCGGTGATCTTTATAAACGTAATGGTAAGACATTAGACGCTGTCGTATCAGGATATCAGGCTAAAGTTAATGACCTTCCTTTTATTTCTGGTATTTCTTCCTTTTTAACTATTTCTGCATCTGGGGAATGCCCTGTATTCACGTTGTCTGCTTCTGCGTATTGGCCAGAAATGACATTTGATTATCATTGCAGCGGTGTTTTTTTGAGTTTTTTGCGGTCTGCTGGTTATATTATTTTTGCGATTGCTTCTTATTATGCCGTTCGTATTGCGACCTTGCGTTAGGTGGGAGGAATCATGTTTATATTAAGGGTTGGGTGGTTAACTGATCTGACGCAGTGGATTTGGGATTTAATCACTAAGATGTTTCTTGCTTTAGCTGATTTTGTCTCTGATGTTTTTGTGTCGTTTTGTGATTTTTGTTTTTCATTGATTTTATTTGTTGTGGGTGTATTGCCTTGGCCCGATTTTCTTAAGCAGCAAACGATAGGGGACATGTTAGGTAAGGCTGGTGGTACGGTCGTGTGGTTTGCTGATGTATTTCAGTTATCTAATTCTATGCGCGTGATTAGTGCGGCAATTGTTTTTTCTATATTTAGGCGGTTGTTGACTTTGGGAATTTGGTGATGTTGCTCTTTAATGAGGGTGTGCCTCGTTCTGGAAAAAGTTACGACGCAGTGAAGCATCATATTCTTCCTGCTCTTCGTGAGGGGCGTCGTGTTTATGCACGTTTGAATGGGTTGCGTTATGAGTTGATTGCTCAGTATTTGGGGGTGTCAGAGACTCGTATTCGTGAACTTCTTTTTGTTGTTAATACGGATGATGTATTAAATACATTCGTTTGCTATCGTGATGAAGTTGACGGCAAATGGTGTATTGAAGATCGTTTTAAGGATGTGTTAATCGTTATTGATGAGGTGCATGAGTTTTATGTTGAATCTCGTGCACCGTTAGCGCCTCAGATAGAGAATTTCTGGGCGTTATTGGGTCAGAATGGCGGTGATGCAGTGCTGATGACGCAGTGGATCAAACGCATGCATCCTGCCATCCGTGCACGTATTGAACGTAAGCATAGTTTTCAGAAATTGACGGTTGTTGGTCTCAAGAACCGTTATCGTGTGACGTATTACCATACTGTGGCGGCAGGTAAGTTTGAAAAGGTAGGAAGTCAGACGTTTAAGTATGATGCGTCTATTTTTCCTCTTTATGATGGGTATGCTCCAGGGGCACGCAATACGGAGGTGTATTCTCAGGGTAAGAGAACAGTCTGGGCAGTGATGTTAATAAAGGCTATTTTTTTCTTGGCAGTTGGTGTTGTGGGGTTTCATTTTTATTCTCGTTATTTTGGTAGTGCTGGTCTTTCTGCCCATGTTGTTTCTAGTGCTTCCTCTTCAGGTGTAGGGCAGGTGTTTAAGCCTGGTCAGGTTGTTTCTGGTTCTGTTCATCAAGACGTTTCTGCTTCTGCTCCTGTGATGCCTCCTGTTGATCCTTTGTCAGATTTGGGGCCTGAGCAACGTTATATATTTGATCTTAATGCCAAGGGTCGGTTACGTTTGGCTGCCCTTGCGCAAGTGGGGCATGAGTATCGTGCTTGGGTGCAATGGATTAATACGGAAAATTTGGTGATTGAGCAGCTTGATTTAGAGCAGTTGCGTGCTTTGGGCTTTGATGTCTCTGTGCACTCTTATGGCGTGCGTATTTCTGTCCTTAGTCATGTTTTGGTGGCGACAGCATGGCCTTGGCGGGAGCCAGTTCGTGAGACGGACCCGCGTTTATATAATTTATCTCGTGATCAGCAGGGCGCAGGGAGCATTGCGAGCCCCGCGAGTGATGCGAGCGGAGCCCCTGCCGTTCAGGGAGGTATGATAGAGAAGGGAGAGCGTGCTATGGGAACGTTTCCGGAATCACCTGGCTATGAGCATCGTGATGATTTAGGGCGTGGTTCTTCGTTTTTTCGTTAG
Protein sequences of DBSCAN-SWA_2 >NZ_CP020870|1138487:1151643|1145776_1145959_+|WP_046419283.1|DBSCAN-SWA MRITDTERGARMALNIAEMYVHQLDLWPDSLPQQFDFWLSVRAAALDQLDECYLLRRSLL >NZ_CP020870|1138487:1151643|1145515_1145773_+|WP_081089932.1|DBSCAN-SWA MSKLSVFGVYPSFTSQYLICLTSFISDLQHYIDSIDWSLSKIFTDASDASDEITLETVESIFQSLGEILFELRFLEERLSHLSRA >NZ_CP020870|1138487:1151643|1140680_1141970_+|WP_085808104.1|DBSCAN-SWA MSIFRILVFFVILFISRFSFACEIGEPHWDPNQCLDRGEAYAIASASYQLWRSNELKNSNIPGLQVVDCPMTDNGHVIGFGGYSTAPGHPSSQSCDSSLVYFQRVYPEGKTCLTRSPKSLLGLTLPSGVRVPSTACYDGCSYDLDRSHGIIGVGQDDGKVRYVLPGMTPNGNLCSVSPSGGSSSESSQEPPPVQDVVKDECTRMGTLTQCVRQDGKYCATSSTGHQYCWKPGEVGTQIASDGNHAATLNKIDVPVIAPVDAPKDKGDWRVDGKGTSTQIINNTYNNYNTTTFASTGSSGGSSGGSSGGSSGSGSGGGGDKTGGDKDTPGSGSPSGTGDLYKRNGKTLDAVVSGYQAKVNDLPFISGISSFLTISASGECPVFTLSASAYWPEMTFDYHCSGVFLSFLRSAGYIIFAIASYYAVRIATLR >NZ_CP020870|1138487:1151643|1149979_1150309_+|WP_085808098.1|coat|DBSCAN-SWA MFILRVGWLTDLTQWIWDLITKMFLALADFVSDVFVSFCDFCFSLILFVVGVLPWPDFLKQQTIGDMLGKAGGTVVWFADVFQLSNSMRVISAAIVFSIFRRLLTLGIW >NZ_CP020870|1138487:1151643|1140418_1140547_+|WP_011097859.1|DBSCAN-SWA MADILSGLDVKSAAAALIGAAALIALVGFTKWGAKKVAGFFG >NZ_CP020870|1138487:1151643|1141982_1142312_+|WP_085808098.1|coat|DBSCAN-SWA MFILRVGWLTDLTQWIWDLITKMFLALADFVSDVFVSFCDFCFSLILFVVGVLPWPDFLKQQTIGDMLGKAGGTVVWFADVFQLSNSMRVISAAIVFSIFRRLLTLGIW >NZ_CP020870|1138487:1151643|1138487_1139663_+|WP_046419278.1|DBSCAN-SWA MSRRSSSCDSAVHVDFFERERRAALKKAAYLYETRRDRVNPSYAFPAASGEKVLGPNSNTGQKGVVGSYPVSIDYLTVVFSYARLAEAGYFDEPRFLLYLLFGLSVDDVIVGSHTSVRWHFYNSSASIIDSNGDLVGKIGWDGNADSYCISLTGSACRYIHDWSKVKRSLASLDARITRCDVAYDDYDGKLGTVRHHEALAREHLAPAGGCLLFSSGGTPPRTRFLDDHGGGSGCTLYVGQRGHKQLCIYEKGKQLGVAESPWVRYEVRLYAKHAVIPFDLLVEPMRYLRGSYDYLCQLFSSVVASPVSRIQTVVKHVEATGEALVRWLRRQVGPALGVLRQALGCGFSDFIVDRVERQGLPSRFRRICRGGDLPAYLRETLADCPVGVCV >NZ_CP020870|1138487:1151643|1139708_1140023_+|WP_046419276.1|DBSCAN-SWA MSIVRVKDNIVIERSVNTKTAGLQIFREQRAAVVMGGAYETVFSLKLGSGSVYPPGEYLIHPDSYGTDDYGNLLLRRLKLISLSSALKEFASKEPVSVVSSKVT >NZ_CP020870|1138487:1151643|1142311_1143646_+|WP_085808099.1|DBSCAN-SWA MLLFNEGVPRSGKSYDAVKHHILPALREGRRVYARLNGLRYELIAQYLGVSETRIRELLFVVNTDDVLNTFVCYRDEVDGKWCIEDRFKDVLIVIDEVHEFYVESRAPLAPQIENFWALLGQNGGDAVLMTQWIKRMHPAIRARIERKHSFQKLTVVGLKNRYRVTYYHTVAAGKFEKVGSQTFKYDASIFPLYDGYAPGARNTEVYSQGKRTVWAVMLIKAIFFLAVGVVGFHFYSRYFGSAGLSAHVVSSASSSGVGQVFKPGQVVSGSVHQDVSASAPVMPPVDPLSDLGPEQRYIFDLNAKGRLRLAALAQVGHEYRAWVQWINTENLVIEQLDLEQLRALGFDVSVHSYGVRISVLSHVLVATAWPWREPVRETDPRLYNLSRDQQGAGSIASPASDASGAPAVQGGMIEKGERAMGTFPESPGYEHHDESGRGSSFFR >NZ_CP020870|1138487:1151643|1145178_1145514_+|WP_046420105.1|DBSCAN-SWA MINVETRNPHDGMSFKSLGDQSESSAASEVAVIDQGHSCISSQRVGVLLVGIEAARSELLSAFKRYQKLGLAGFGISTFEERREISDCIQDAYNRLALFVKRAERISHAQV >NZ_CP020870|1138487:1151643|1150308_1151643_+|WP_085808105.1|DBSCAN-SWA MLLFNEGVPRSGKSYDAVKHHILPALREGRRVYARLNGLRYELIAQYLGVSETRIRELLFVVNTDDVLNTFVCYRDEVDGKWCIEDRFKDVLIVIDEVHEFYVESRAPLAPQIENFWALLGQNGGDAVLMTQWIKRMHPAIRARIERKHSFQKLTVVGLKNRYRVTYYHTVAAGKFEKVGSQTFKYDASIFPLYDGYAPGARNTEVYSQGKRTVWAVMLIKAIFFLAVGVVGFHFYSRYFGSAGLSAHVVSSASSSGVGQVFKPGQVVSGSVHQDVSASAPVMPPVDPLSDLGPEQRYIFDLNAKGRLRLAALAQVGHEYRAWVQWINTENLVIEQLDLEQLRALGFDVSVHSYGVRISVLSHVLVATAWPWREPVRETDPRLYNLSRDQQGAGSIASPASDASGAPAVQGGMIEKGERAMGTFPESPGYEHRDDLGRGSSFFR >NZ_CP020870|1138487:1151643|1148144_1148396_+|WP_046419274.1|DBSCAN-SWA MALCVSLKADGTLVTTGQGVSDCTGYVLVSGSEYGVYQSVQRVFEVPDMKTVIHTSTTVAFTVIGWYVAARIIGTVATFFDSR >NZ_CP020870|1138487:1151643|1146484_1147660_+|WP_046419278.1|DBSCAN-SWA MSRRSSSCDSAVHVDFFERERRAALKKAAYLYETRRDRVNPSYAFPAASGEKVLGPNSNTGQKGVVGSYPVSIDYLTVVFSYARLAEAGYFDEPRFLLYLLFGLSVDDVIVGSHTSVRWHFYNSSASIIDSNGDLVGKIGWDGNADSYCISLTGSACRYIHDWSKVKRSLASLDARITRCDVAYDDYDGKLGTVRHHEALAREHLAPAGGCLLFSSGGTPPRTRFLDDHGGGSGCTLYVGQRGHKQLCIYEKGKQLGVAESPWVRYEVRLYAKHAVIPFDLLVEPMRYLRGSYDYLCQLFSSVVASPVSRIQTVVKHVEATGEALVRWLRRQVGPALGVLRQALGCGFSDFIVDRVERQGLPSRFRRICRGGDLPAYLRETLADCPVGVCV >NZ_CP020870|1138487:1151643|1144623_1145037_-|WP_060870210.1|DBSCAN-SWA MNTLNKLLDKAREKCSANSDAKVAKTLKVSRQTISKWRKNELRITDEHLALLIEIANADATMFAKVREEKADTTTEQRIWKSMLERLTATAATLLVGVGITSPPTPQTHALNKNFSNVISHNAYYVHVEKVGARTAV >NZ_CP020870|1138487:1151643|1147705_1148020_+|WP_046419276.1|DBSCAN-SWA MSIVRVKDNIVIERSVNTKTAGLQIFREQRAAVVMGGAYETVFSLKLGSGSVYPPGEYLIHPDSYGTDDYGNLLLRRLKLISLSSALKEFASKEPVSVVSSKVT >NZ_CP020870|1138487:1151643|1140147_1140399_+|WP_046419274.1|DBSCAN-SWA MALCVSLKADGTLVTTGQGVSDCTGYVLVSGSEYGVYQSVQRVFEVPDMKTVIHTSTTVAFTVIGWYVAARIIGTVATFFDSR >NZ_CP020870|1138487:1151643|1148677_1149967_+|WP_085808104.1|DBSCAN-SWA MSIFRILVFFVILFISRFSFACEIGEPHWDPNQCLDRGEAYAIASASYQLWRSNELKNSNIPGLQVVDCPMTDNGHVIGFGGYSTAPGHPSSQSCDSSLVYFQRVYPEGKTCLTRSPKSLLGLTLPSGVRVPSTACYDGCSYDLDRSHGIIGVGQDDGKVRYVLPGMTPNGNLCSVSPSGGSSSESSQEPPPVQDVVKDECTRMGTLTQCVRQDGKYCATSSTGHQYCWKPGEVGTQIASDGNHAATLNKIDVPVIAPVDAPKDKGDWRVDGKGTSTQIINNTYNNYNTTTFASTGSSGGSSGGSSGGSSGSGSGGGGDKTGGDKDTPGSGSPSGTGDLYKRNGKTLDAVVSGYQAKVNDLPFISGISSFLTISASGECPVFTLSASAYWPEMTFDYHCSGVFLSFLRSAGYIIFAIASYYAVRIATLR >NZ_CP020870|1138487:1151643|1144126_1144474_-|WP_060871500.1|DBSCAN-SWA MSNDEIPNQILTGNWTGFAFIHGKLVTPERRAIEEWQLRWLSLTCSLAQEWQKMMEEARAAAPQGGSAASDPQRITNAPQKTVLRKRRQTADGTASIIQLRQVLIQRQQKRKASP >NZ_CP020870|1138487:1151643|1143785_1144064_-|WP_057683733.1|DBSCAN-SWA MTMTQDERSKEQEIQIAKLILETSKIQAEIHEVTAHTQEMNAHINEINARTQKLVEETLKVSKESKWYPVVVGSGLIAAGATAATLFIKLFH >NZ_CP020870|1138487:1151643|1148415_1148544_+|WP_011097859.1|DBSCAN-SWA MADILSGLDVKSAAAALIGAAALIALVGFTKWGAKKVAGFFG |
20 | Stenotrophomonas_phage(62.5%) | coat | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_3 |
1186421 : 1207194
Sequences of DBSCAN-SWA_3
Nucleotide sequences of DBSCAN-SWA_3 >NZ_CP020870|1186421:1207194|DBSCAN-SWA ATTACGACATTCGGACATAATCGGGAGCACCGGAAAGTATGCATGGCGCTTCGCCGTCCACTAAGTTAGGCACGGTGGTTTGCCCGTCGGTTCCGAACCGCCATCCCGACACCAAACCATTGATAGACGTTAAATCCTTGTGCGCGAAGAAATCAGTGATCTCGATATTAGAAAGTGCCCTGTTAAATACACCAAGGCTATGCAGATAGCCATTGAAATAACGATCGGGCGACCCCGCGTTATATCGCCGCGCAATGGTAAACGGTGGCACAGGGGTTAACGTGTTTGATGGCGTATCGCTCGCCGCACCAGAGTCCGTCCACGGTGTGGTGTGCGTAGCGGCATAGCAACGCCCCGCATCGTATTTGCCTGCGTGAGCAGTCCAACGCCCCGCCGTGGCAGAACCTGCGCCCCCGACCCCTCCGTATGGGTTGGACCCATCCGCATGTGACTCATAATAGGTGAATGTGGTTTCGTCGGAGACAGCGCCGATACGTAGTTGCGGCGTTCCTAATGCGCCGCCGCTGAATACCGCTTGACGTGTGCCGGTAGAGCCAAAGCCGCCGAGTGGGTTGTAAAGCGCCAGAATAGTGTGCGGCGTACCCCCGATGACCGATGAGGGAAGCGATGCGATATCACCCCATTGGTTTTTACCGTTAAACGCAACGCAAATAGCGGTCGTGTAGCTGGCTTGTCCGGTGATGCTTGCTGTGATGACGTCCGCTTCGCTGCTGCTCAGGAATACAAGGGCGGCGGCTTCGCCCTGTTGCAGCACAAACGCGCCTGCCGACCCGTTAAGAGCCATGCCCGCCCCTGCTGAGAGAGTGAGGGGGCCGCTGGATTGATTGGAAAGCACGATAGCCATCCCAGACGATACCAGCAGCCCAGCGGGGACAGTCACCGTCTTGGCTCCGCTATACGTGAATCTCAAATAGCTCCCTACTTGGGATTGTGTCAGGGTATTTGCGGCTGATGCGATGTTTTCAATGCGCATACCGCCTGTGCTGCCGCCCCCACCACCGGAGGGGATATCGGCGTACACATGTTTATTCGCGGAGGCATCCCACACCAATGCTTTTTTGTCTGCTATCCCTGAGGCGTCCACATCGGCAAGCGTGCCCACCTTCAGCAAGACGCTCCCTGTGTTGCCATTCACTGAAACCACCGGAACGGGGTGCCAGCCTTTTACGCCTGTGGCGTCGGTGCCGTAGTAGGACGTGCTTCCCGGGGAGCGTGCATCCCCCTCAAGGTTCAGACTCACGAAGCCGCTGGAAAGGGTGCCCGCCACATTGATTGAGCCTGTTGCAAGCAGTTGCGCATTGGGCGCGCTGCTACTGCCATTGGGGGGTGCGGTATGGAGAGCCGCCTGCAACGCGTCTATCTGCTGCTGTAACTGCGTTGCGGCATCGCTTTTGGCTAACGAGGACAGGTAGGTGCGCCAGCTTCTTGTCACCAGTCCGGAGGCGTCGAGGAAAGGCTCGTTATTGTGAGGAATGGTGGTCATGCGACTCCCTGCCCGAAGGCGATCCAGTCAAAAGGGATGGGGGAGATGATCGGATCGGTGTTTTTCTCTTTACTGTCTGCGGTCACGAAGTTAAACGTTGCGTTGGTCGTTGATGTACCGCTCACGCTTTCTGCAACGAGGCTGCTGGCTGTTGCAAGTGCTGCGGTGACGCTGGCTTTTACGAAATAGGGAGTATTCGCAAATGGCTTTGGGAACGTCACCAGTTTTAAGGTGGCCGCTTTGCCGCTCGCAGGGGCGGTATCGCGGCCCCACTGGATCATAAAATTCCCAATCTTCAACATATCCGTGCTGACGGTATACTGCGAGTCTGGCGGGCGTGGGAGGCTTTGCCAAAGCAGATTCTCGCCATCAGTGCCAAGTACCTTACCGCCCATCCCGACAGGGTCAGGGACTTCCCGAATTGTTGACCACAGCAATATCGCCCCGTTATTGGTCAGGAATTTGCTTGAATCCAATGCGGGGATGGTTAGGCCTCCGCCGCCAGGGATGGATACCCCGTCAGCCTCACCCTGTTTTGCACCGAGGCTGTCAAAGAGTTCGACGAAATAGGACCCTTTCCCCCAGATATCCACGTCGGGGCGACCTGAGCTATCTAGCCGAACCTCAACACCGTTGTTCACGGCCAATCCACTATCGCCGTACACAGGGCGCGGGGTTGTCGTGCCTGCGTCATAGAACTTAAGCTGCCCTGCGGCCAGTAATTGCCCTGTCATTCCATAAAAGGTGTTCAGGCGTGAGAATAAGCGGAAGGCGGTCATGGGATTACGTCCGAAAACAATTTAAGGGGACTGGAATGGCGTCATTAGTGAGTTTTTTAGCGGGCTTTTTATTGGGGCGTGCTGGGGTGTTAGGGTACGTTCTATGGGCGGGCCTACTTCTCTTGGTGTTGTGCCTGCTTTTATGGGCGGTGATCTATGCGGCGGGCGCGCTTTTATGGGGGGCTTGCTTGCTTGTCGGGCACGTTTGCTCGCTTTTCGAGCCGGTGGTCGAGTTCATCAAACGGATAACTCCACCTTGGTTGGCACGTGTGTTGTGGGGGCAATCTGTGGCGGGCGTACTTTTATGGGGGCTTTGCTTGCTTTTCAAGCCTCTTGCCGCGTTCATCAACCGGATAACACCATCTTCGTTGGCACGTGTGTTGTGGGGGGCGGGGAAAGAAAACGACAAGATTGATAGACCCGTGGCCTCCTAAGGCATCTCCTCTTGCAGAAGCTGCTTGCCGCCCGCCATAGTGGTGATCTGCCCCGCACGGCCTAGCCCTTTACGCAGTGACCCCCACCCTCCTTGAGCCGCCAGCAGATTGGCAAAGATATCTGGGTTGCCTGCATTACGCGCAAGCCAGTTCCCTATGAACGGCGAGTTCAATGCGCGCCCTGCCGTCGCACCTGTTGCCAATCCGGTAAGGACAGGCACAATGCCGCCCACCAGAGCGCCCCCGCCGCCGCCAAAAAGAGCGCCGACACCGCCACCGCCAAGAGCACTCGCCGAAATCATACGCCCTGTCGTCCCTGAATCTGGGATGGGGTCCTTTAGTAGGTTTTGACCCACCTTGGCCAACTCGCGGAATTCCTGCGTTGCCTTCGGGCCGCGAGCGTTGACCGCACTCCATAGCGCCGCTGGGGCGATATCGGCCTTTGCGCCGGGCTGGCGAGTGAGCAGCTTCTCCAATGTTTTGAAGTTGTTATATTTCTGGTTCGTTTGTTGCAGTAACGCCATATCAGAGGGGTTGATGGAGCTTTCAGCATCGCCGATCAGTTCCTTACGTAGCCGCCCTACATAGTTTCCGACCGATGTTCCAGGCTGCACCTGGGCTAGGGTACGCGCGAGCCGCTGGTAGTTCCTGCCGCTGATCGTACCTTCTTGTCCTGCTTGCGCAATGTCATCCAGAATGCGCCCGAACTGGTTTTCGACTACTTTCCCGCCATCGGTGCCGAGATCGCGGTAGGCGTTCTGCACAATCTGATGCATTCGTGTGGCTGTTTCGGGGGAGATCGTGACATCGTTACGGTTCCAGATCGTGTCGTAGGCGTCGCCTAATGCCTGTTTCTGTCTGGCGATCCAGCTGTCATCCAGCCGATCCGTCGCATCGCCTACGTGGCGTGTGAGTGCGCGATTCCAGGCGTTCTGCTGGTTACGCGCGGCGGCATCGGCTCCGCTGAACGGGAGGTACTTTGCCATGCTGGCCATCGTGCGTGCCGGTGTGGACTGGGCCACCTGAGAAATATGCAGCGGGATGCCTTCACGTTGAGCGATGGTAATATCTGCTTGGAGGAGCGGATCGCCACGACGTGCTAAACCACCCGCCGCTACTCTGGCAGCGCCCAGTGCCCCACGCGCTACAACCCCGCCCAGCGCGCCATAGCCTGCATTGGCCAGCCGGTTTTCCCCTGTACGGGTTTCCCCTAATGCGCCATAGCCCGCCCCTTCCGCCGCCGCCACACCGGCCTTGGCAGCAAGCTGTGCCGCCTTGCCCGCTTGCCCAAGCCGCCCCAGCACCGCCGCTTCCGGTCCGCCCACGATGGATGTAGCCACATACGGCAATACGCGGCCTATAAATCCAGAGACACCGTTAACCCCTTCCTGCATCGGCGCATCGGCATCAATACGCTGTTGCAGGGCATGCCCCCTAGCTGAGTCTTTAGGGGTGACCAACTGCTGAAGGCCGCGCCCGATGCGGTTCATTTCCGCCCCAGCGGCGATGAAGGGGCGTTGATACCAGGGAGAGTCGTCATATACCTGACGCGCAATCGCCGCTTGATCGACGGGGGGTGCGGGGGCAGGTGCCTTTTTCTTGGGCGTAGCGACTTCGTCTGCGTAATCATTCCACGGCCCGTCGGGCGCGGCGGTTTCGCCTGCAAAATCATTCCAGGGTCCGTTAGGGTCATCGGGCGTCTGATTTTTCTTGCGAGGAGCGCTCATTCCGCTTTCTCCCAGCTATCAGGTTTATTGCGGAGGCCCCCCAGGTAACGGTAACCGCGTATCACATCTCCAATTTTAAGTGCAGGCATTTTTGCGGACGCTCCCATGTTTGCGGGTGTTTTTGCAGGAGCGGGCACCCCTGTTTGCGGAGGGGGTATGGAGGCGGCCTTGTCCGCCATTTTTTTAATGGTACTCAGGCTTTCGTTGATCTCGTCGTAAGAAATCCCAGGTTTCAGCGTGGCAATTGAGCTTTTAAGCAATTCCAGTTCTTTCTCGCTCAGCGCCCCGAAACCTGAGGCACCCTGAGGAGAGAGTGCCTTTAAGCGCGCCATCGTGGTCAACGCCACCTGTCCGGCGATGTTATCCAGTTGCGCTTGCGCGTCAGTGGCACGCGTATATGGAAGTTTCGACAGGCCCCCACCTATGAGCGTACCCAGTTGCTTGTAGCCTTTTGACGTCTGCAACTTGTCGATTGCATCGCTCAGGCTTTGCGCGCTCCCCACGGCTTCGTTGTAGCGCGCCATCGCCGCTTGGTGCACCTGATTGTTTTTCATATCCAGCGCCGCTTGCTTCGTGGCGGCAACATCCCTGCGATTTTCACTGTCAAGCGCTAGGCGTTCTCGGCTCAGCCCTAGTTGCGCCTGCTGGTACGGCGTGATCATCTGGGAGGTGTCGGGCTTGGGTTCGATCCCTGGAACCGGCGTGAATACCGGCGTGCCATCGGGAGCCATTTTGACGAACCCTGCCCGTGCCGCGTCGTATTGCGGTTTTTCCTGTAAGGGCGCTTCGGCCACCAGCGCGTTATTACTGTCATACCGACGTGCCCCAGGAGCAAGCGTGTACGGCTGCGCGCTTGGGCCTCTTTGCAGCATGGCCAGATGCGCTTTAGCGGTTTGATCCAGTGAGACAGGGTCGTACTGGTCCGGCAGCTGGAGCCCAGCGGCCCGGGACTGGGGGGCGATAAAGGTATCGTAAAAGTGTTGCCGCTGCTCGGGGGGCGCGTTGGCCGTGGACAGCCACGCACTGGCCAACCCTGCGGCGGCTTTCTGTTGCCGGTCCTGCGTGGCTTGTGCGTCCTGCTGCAACTGCTGCTGCACTTTATAGCCCGCCATCGGATCAACGCCATTTAAGGCGCGCATGTAATCATCGCGCTGGGCGCTATCGGAGGCGGTCAGGGTTTGACCGGCAAGGCGGTTAATGGCACGCAATCGGCCTTCGTCGAAACCGCTTTTAATGCCCTGGTAAACTTCTAGTGGATTCGGCATGGCTTAACCTCCCGTATTCCATTGGTGCTGTCTGTAGGGATCGCTGATGTAAGGGTTGCGTGTATAGGGGTCCTCCGTATAGGCGTTCCCAAAAGACGCCCCATTGGCGATGTGACGTTGCCTCCCTTCGTACAGGTTCCCAAGAGCGTTACCAACGCCTATCAGCATGTTCGTGGTGTTGTCGGCGCTTCTGTTAGCGGCCCAGCCGCGCGCCTGAGCCGCATTGTTGAGCTGGTTGCCCATGTTGGCAGCGTAGTTCTGCCCGAAACCGGCCAGATGACTAGAGGCGGCTTGTCCTGCATTAGACATCCCTGCTAATCGGTTCCAGTAGTTGTTAAGGTTTTGCATCGCCAAACCCTGAGCGTAGTTCACAAGGTCCGCCTGATGTCCGCCGGAGTACAGCGAGCCACGTGCGGCGGCGCTGCGATCCACGCCCTGCAACCCCTGCTGCAACGCGTACGCGTAATCAGGGGAGTTCTGGAAGCCGGAATAGTCGCCATGTAGTACGGCCTGCTGTCCGGCCAGCGCATTTTGCCCCGCCGTCAACCAAGGCATCTGGTCCTGCCGTGTCTGGTTGTATTGGCGTTGCTGTTCGGCGATCGCCGCCTGACTGGCCTGTGTTTGTGCATCCGCCGCCCGATGGGCGGAACGGTTGGAGATAATACTACCGAGGATAGAACCCGCCGCAGGAATAAGAGAAGCCCAAGGCATTACGATTGCTCCAGAAAGAAGGATGAGGGGCTAGGCAGGCCCTGCTGAGGCTTGCAGCGACATTGCCAGCAGATGAGCGACTACAGGATCAGTGATACGGATGTCAAACACCCACTGCCGCCCTTGCCCAAGGCGGTAACGTCTGAGGCGCTTCTGAAACGCGCCCACGTCGCCAAGATCGCGTGCTACCCAGGCCGACCAGTTGTGGCCGCCGTCTTTGCTGTAGCGGAGCATCACTTTTCGGCTCATGTTGGGCAGTCCAGGTGGAATTGCCACGCCGTCCCCGCCAGCGGTGAGTACACCAGAACCTCGGCAATCGTATCGGCGGTGGTCTTTTGGAACGTGGCTGTTTCGTAGTTTTTGTTGGCAAACTGGCCTTCTGGGGTGGTGTCGCCCCCCGGCCTCAGGGTAATGGTCTCTGGGGGTAGCCCCCTGTTTTTAAGGTCCGCATCCAGTTGTGCCTGATAACTGGCGTCGCCGTGGTAGCCCGTATCAATGACCTTCTGCCCGCCGATCCACACCTGGAACTTGTCAGGGTTGGAGGCGGTGGCGTAGGCCAGCGTCACAAGCCCCGTTTGGCTTCCAAGCTGCACATGCACGCTATTCGGGAACGCTTGCCCTCCGCTATAGCTGGTGGAGGTACCGCAAAGCACCTCGGCTGTCGTGATTGTCATCGTGTCGCTCACGCAGGCCCACAGCCCGTTCGCGTCGGTCACGCGAATCGTAAAGGGGTAAGCGCGCTTGGCTCCGGCAACCAGCCCCGCCACCTCCACGGTGCCTGACAACAGGCCTTTAGAATCAAGGCTCAACCCTTCGGGGAGCGCACCACTCACTACACGTACTGCCACAATGGGCGTTGCACCAGGTTTCATGGTGTAGGCGTAGCGGTACGCCTGTGTGTTCACCGCATCTGGTGCGTTGCCGCTGAGCGTAGGCCCTGCCGGTTGTGGAGGGTTGAGATCGCCAGGGCCTCGATACCCGCCCAATGCATCTGTACCGAACACAAGTTCTACCGCATCCACGGTGAGCCGGTTCTGATGGTCGTGCAGAACGCCATTCACACGGCGGCGCTCAATCACTTGGCCGTGTTCCCACGGCATCCCCCAATCAAGGGTGTATAGCTTGCCGTTTGCAAAATCGCCAGCCACCCAGTGGGCGGCCCACCGTACGCAGGCGTTCATCCTCCAACGGCTGATCCCGAAGGACTCGCGGCGATGCCATTCACGGGTTGTTATATCAAACCCCCAGGTCATCCCATCAGGGAAAGTCAGGTAGTACACCTGATGCCCACGGTCATCAAAGGTGAACGCAAATGCTTCCTCATGGTTGCACGCGGTGATGGCCTGCTCCAGCGGCGGTGTGCTGATTCGTACCGGTTGATAGCCGTCTAGCCGATACACACGGCCATCATGCCCCAGCCAGAACACCGTATTGCCCATCTGTTGGATGGTGTGCTGGGAGGCGCAGCCTGTTTGCATTTCAGTGCCTGCATGGCGTTGAAAAGTACCTTCCGATGCGCCGCTGTTGTAGAAGAACTCCCCGGAGCGTTTGCCCAGCACGAAAACAGTGCGATGGATCACGATCAATCCGACAATGCGGTCCGGCTGGCTTTCGGCTTCGTAGCGGTCCAGGGCACTGTAGCTGGTGGCGTCGGCCAGGGCGGAATGAAACCAGTACCTGCCCGAAGGCTCAACACCCACAATGTAGCTATCCACGTAATCGCACGCTTTAAGCCCTAGGAACCCTTCACCGGTGATCTGTTCGGCGAGTAATTCTGTATAGGTGTTGTAGACGTAGCCGGAGGTTCCGTTACCGATCACTAACTGGTTCCCTCCGGCAATCTGGTTGTGCGCCATGCAGACACGCTCAACGCCTGGGATGGTCCCGCGAGGGATGGCCACCCCCGCTGTGGTGATTTGCCATAGTGTCGTACCGATCACTGCGAACAGCTTGTCTTCCACATCATGCAATCCGCGCACCGGAGCAGGGTTAGCGGCTTCAGGGACACAGAACACCGCCGCCCCAGGAGCGCAGCGCAACAGGGAGGACGAACGTCCTCCACCACGTTCGGCGGCTTCTGGAATCCAGTTCACCGTATCTTGCACTGTCCAGGCGCGTGTTTCGTCACTATAAGCGCCTCCGGTCACAGGGGCTTCACGCCATCGGGCGCTCATCCGTCGTACCCGTCACGACCATAGCGCCGTTGGCTTTCGGCAGCAGGCAGGGCGTATTGCATGCGAACGCCGCTGGCATGCACCGTATCGCTGAGTAGCATCGCCCTGCCCCGTTCGGCTGCGTTGAGCACGTCTTGCTCCAGCACAACACCGTAACCGGCACGCAAGCGCACCGCCAGGTTATAGCCAATGGCCTCCTCCGCTTCGGCGGGCGCTGGCAGGAGGTCGTCGGGGCTGGCGACTTCTGACCAACCTAGCGCAATCCCGTTCGCCTCCCAACGGCGCATCATCAGGTTGAGCGTACGTATCGCGCGGGTTGCATCTTCAGCTTCTACCGCTTCGTTGGCATCCAGTACGCGCAAATGCCCGAACGCATCGCGGATGATCTCTGCCACCGTGGTCATGGGCGTTCCTGTAAAGGGGATTACTGCGTCACGCGGCACGCATGGTCGGGACGGACAGGGGCAGGCGTGCCAAACAACACATCTACGCGGGTATGTTCCATATCGTTTTTACCATCGCCAAAGGTCATCACACGGACGCTGATGTTTTTTATGCTTGCCGTATAACCTTCACACGAGGCCAACACCGGAAGGGGGGCGAACGCGGTGGCGAATGCATCTCGGTGGAAGACAAGGTTCTGAACGGCGGGCACCAACGGTTGACCGAAAAAGGTCATCGCTGCGCGATCTCTTGGGGCTTTGTCTACGGTACCAATCCCCTCTGAGGTTGTAGGGGTGATTGCTGGGTAGATGGAAAGCGTGCTGCCGCCTGCGATGTAACCCTCGGTGACTAGAAACTGTCTCAACTTGCCCGTAGTCGCGCCGGTGATGGGATGCACCTCAAAGACATCATCAAAGGTGATGATGGACCCCTTGGTGATATCGCCTTTTCCGGCTTTGCCATCCATTGAAATAGACGAGCCAATCTGGCCTGCGCCGCTGACCACATAGCCCGCTCCTAACCCGTTGGTATGCGTGGGCAGTGATAACTGTTTGTAAAACGTAATGCCCGCAAATTTGCCGACCGCATTTTTACTGAACTCACCGCGCAGTTCTTCAGAGGTATGGAACAGCGACACATTCGCCTCGGCCAGCGCGTCATTGGCATCGGTGGAGAAGTGCGCGCAGCGGTCCTCTTCCGGGGCCAGATGGCGATCCAGGGTCGATGCCGCAGAACGCCACGGGGTGCGCGCATCGGAGACCGTGCCCAACGTGCCAACTATGTTTGGCGTTTGTTGGTACATGGACGCCAGCAGGACCGCATTCACTTTGCTGGAGAGTGAGGTCATGGCCGGACGTAAAAAGCGCTTGCTGAAGTCCGTCAGGTCCAGCTTCTTTTCTTTCGCCGTAAAAGTCAGCGGGACATGGTATTGCTGATCCAGTTTCAAGGTAACGTAACTCTCGCTGATGGGCGGTGCCGATAACGGAGCGGATGTAGCCCCAGAAGCGGCAAAGACGGAGCCGCTATAGGTCACCGGAACCGGAGGGACCATAATTTTTACGGTATCTCCTTTTTTATAGCCGTTGGTCTCCTCCCCAAATTCCTTGGAGCGTTCGGTATTGATGTTAGTAACAACGTTGTTCTGCTCAACAAGCATCTTGGCCGCTTCACGGGCGATCATCTGATGGGTGAGTGCCTGAGTACCCATAGGGTGTGCTCCTAAAATGGAAAGTGTGGGGTTGGTGTGCCGTTTTGCTTAGCGTTTGCGCCGCTTTTCCACGTCGCGCTTGTACCAGTCGTCATCGGTAAGCTTCTCTGGAGGAATCTCCGTGGGGGAGCGACCTGATACCGACGGGGGCGGGGGCGGAGCGTTGCTGATCGGTTTGCCCGGTGAGGGGGCTGAAGTGAAGGGATCGTGTGTCTGAGGGGTGGCCATGCGTGCCGCCAGCCGTTCCACAGCAGCAGGCAGTAAGTCTTCACGTACCGAAGCCAGGGACCATAACGCATCGTCCTGAGTGGCCAGGTGGTAGGCGATCTCAGGGCCTTTCTCATGCTGGATCACAGCGGCCTGCACGGCAGGACTCAGCAGGGAAAGATCCATTGAGCCGACCGTTTCGTAAAAATCAGGGTGCGCATTCACAAACTCTGCGGCACGTGCTTCATAACGCGCCTGGGCACTGTGCTGCTGGCGGGCGGTTTCGGCCTGCTGTTGCTCCTGCTGCCACTGCTGGAACAGATGGCTAAAACGCGCATCGACCCAGCCGTTCAGGTTGTAGGAGTAGTCTTCAAGCTCAGGCGCACCTTCCTGCCCTGTCTGCGGCGTGTGGCTGGAACGCGTAGGGCCGCTTTGCTGCTGCCGCTCCAGCGCCTCAAGCCGACGGCGTAGCGCGTTGTTTTCACCGTTGATCCGCTGAATGTATTCGCGGGTACGGTTGCCCTGTTTCTTTTTCTCTTCGGCCTGCGTGTCCGCGTCGCCCATGTCGCCGGGCGGTTCACCGGTGGGGTCCGGTGTGGGGTGTTGCTGCTGTTCTTGCAGCGCCGCTTGCGCGTCGTTTGTGGGCGGTGTTACCGCCTCAGAACTCTCCGTAGCGGTGTTGGTATCGTCGCTCATCTCATCCTCTCGGGGTCGGCCTGACCGGGCCAATGCGGACGCGGCTGTGAGCAGCCGGTATCACGCCGTGAGGCGCAGGGGCTTCAGGCATGAAAAAACCGCCTTGCGGCGGTGTGTTGTTGGGGGCGTTAGGAAGCCACCCAGGATCGACGGCTGGCGCGCTTAAGGCATGCGCAGCCGCCAATGTCTCAATATGCTGTTGTGCGGCGTCGGCCCGTTGATGTTGCGCTCGGGCATCGGACAATTCCGCATCCGCCGATAGTTTCTTCACGCGGGCTAGCTGTACGGGGTCCGGAGCGGGCGGTTCTGGAGGCGGCTCGCCATCTTTGGGTGGGAGCAAGCCTTGGGCGGCAAGTATCTTATGAAAAGCAGCTACGACTTCCTCCCCGCCGACCAGGTCCATATTCCGCATTCCGGCGTAGGCGGCTACGGTGGCAATTTGCGGCGCAACACCGCCCATCTGGGCCGCTAATTGCATCATGGCATCAGCCGCTTCCATGCGTTGCGTGGCGTAGCTTGGGCCCACCGTGACCACCACATCATATTTGCCCTGACGGATATCATTCAGGGTGACCGTGTGGCCGGTCGTCGGGTCGGTCACCTGTTGGTACAACTGTTTCCACTTCTCGCCGCCATCCTCGCCCAGTACACGCACCGCACGCGGCGTATCGTAGATACGAGGAATCATGTCTACGAGAATTTCATAGGTGTAGCGTACCGCGTAAGCCAGATTATCAATGTAGTTAAACGTGGCCACCGCGCCCTGCATTTTGCGGCTGTTGATCGCAATCCCGCTGGTTTCATTACTGCGTGCGCCTAGGCTCGCGTCGTAGATCCCCGTGGCGGCTTTCACATCGTCGTTATCCATGCCCGCCAGTCGAATTAAAGCGTCAGGGACCTGGGCCTGCTCGACACGCACAGGGATACGCCCGTTATCAACGATATTCGCCAACAGATAGGGGAAGTCTTCAGAATGTGAGTCATTCCACATCTGCACATGACCCTCTATCATTTTGGGATCAACGATGAAAGGCGCTTTAGGGGATTTGGCGACCGCTTCAACAAGCGCTGTTCGATGCACGTTATGTAGGCGCTGCTGGTCCTTACCAAAACGCACCATGCCCGACCAGTAATCACTGCCATCAATATTCTCGATATTCCCCCATACCGGAACGATGGGGATAAATTGGCAAGGGAATTCGTAAGGTTCTGTCAGCCAGGTGTGCCCATTGGTCAGCCGCATCAGCACGCGGTGACCCTCAATGGTGCGTGTACGTACGATCTGCACACCCGCTGCCTCTAAAAACGTTTTCGCCTCCTCCACACCCAACCCAGCCTGCGCGGCAATCTCGTCGGCAAACACCACGCGGCCATCCGACAAGGCCAGCAATTCCCGTTTTCTAGGGTCTTTCCACCAGTATTCGGCAATGCGCACCTGCCCGGCATCACGCCACGCACCGCACTGTGTATCTGCGTCAAAGTCAGACACATCGGCGTCCGGAAAGCGACGCTCAAAATCGGTTCTCGGGATCAACTCCTCGACAAACGCAAAGTTTGCGTCACGCCGATCAATCTCAACGGCGGCAGGGTCAAATTTCACCGCAAACGGATTGCGTACCGCCTTGATGCGAATATCCTGCTCAAAATCATCCTCATTGAGATAATCCGTCATCACGCGCAGCACGCCAAAACCACCCTTGACCGCTTTCTCGTACGCAATGTCATAGGCGTGATCGGCATTGGATACGCTTTCAATATTGCGGCAAATCCCCTGCATGATTTCAGCCAGCCCACGGTCCGATTCTTCCACGCCTCGCACTTTGCAAGAGGGGCGTTGTTGGCGCATCTCGTTGATCACCTGCTGGGTATGCATACGCAGCTTAGGGAATTCATACGTCTGACGGTGTCTGCGGCGTTTCTTCAGCGACTCATCCCACTGGTTCCCAGGGACTGTGACAAACTTAATATCATCACGCGCCTGGTCGTAAAGATCGCGGCAGCAATCACTGGCGAGCTGGTAGCGCGAGCGCATCTGGGCCAGCTCATCCGTGGTGTTTTTCTGTGTGTGGGGCATCTTCAGTAGTCCACCGAGTAATCGTAAACATTAAATGTAGCTACGTGCGGTGGCTTCGCGTACCGCCGCATCATCATCGCGTACCGCGTCGCGCTGAGCAGATCATCGTGGTGTTTGACGATCCGCCCGTCTTCACGGTGGTAGAGCCTAAATTCTTCAAACCATTCCGTCAGGTGGCTAAACACCTTCAAACGTCCGGTGTGCATACGGTCGAGCATCTCCGTGACGCCTGCTTCAAGGCCATTGGTACCGTCTGGGAAGGTCGCCCGTCCCCCAAGCATGGAGAGCCCCTGCTGCCGGTATTGTTCGGCCAACTGTTCACCGCTGCCCTTGTCGTGTTGCAACCCATCGTGCGGCCACGCCCACGGCAAGCGTGCACCCCAGGGCCTTAATGCCGCCGTGTGAATGACAGGGGTCGCTTCACGCTGACGGTATGCACACATCACGTAAATCACATCCGCTTCACGATCCCAGGCCATTTTGACCGCAGCGAACGGGTGGTCATATCCAAAGTCCATCCCGCCAATCAACGCCCATTCTTCAGGAATCGCGAACGGCGCGATGGCAATCGAGTCCTCCGCGATAGGAAACACGCGGCCACTACCCAGTGAAGGGGTGCCCTTGGTGCGCGCCTCGCGCTCATGGGCTGGGTAGCTGGCAATGATGCGCGCCCGATCCTCAGGGCTGTAATGTTCGGCGTCATCAATCGTCATCTGCACCAGCCCCCTGTCAGGGGCTTCTTCCAGCAGAAACCGCCGTACCACGCTGGACATGCCCTTCAGCGGTGTGAACGTCATAAACACTGGGCCAAAGGTCCGATTGGTCCGGGTGATCCCCTCAAAATACACATCTTCGGGTGGCTCCTCATCAAACCACACCCAATCGACCGTATCGGCTTGCCATTTCTCACGGCCCTGATCAAAGGATTTTAGCGAGATCGAACTGCGCTCCCCAGACACATGCCGCACGTACACCGTATCGACCAGTTCAGGCACGCCACGCGCCCAGGTCACGCCCTCAATACATTCTCCAGGAATCGCGCCTGTCCCCATCTCCGTTTTAGGATCGCGTCCTAGCAAAATGCGCTGTACGCCACGGCGTGTTAGTTCTCCAGTTTCTGAACCAGCCAGTCCGTGATTGGACCTTTCAAAACGTTTACCCTCCCACCACTGCGGGTAGCGGCCTGTGAGATGCATCGCCACCTCATGCCCAGCGCACAGTGTCTTTCCTGACTGGTTGGCCGCCGCCAACAACCGTTCACGCGTGCCTGCACCCATCGCATGAAAAGCGCGTTGCTTGGGATAGGGGCTGTATTCGGCCAGACGGTTAGTGCGGCGGCGCCGCGCTTTCTCCTCTAACAACAAGGCTAATACCTGCTTGGGTGGCATATGCTGTAATGGCGGCATCCAATTCATCATCGCCCGCCTCTTTCAACTCCAGTTCACCACTGACGCGGGCCTCCGTGGGGATCATCCGCGCCGCTAGTTTGTAGAAATCTGTTTTGTTATCCCGCGCCCAGGCGACCAAGGCAGGCACACCGCCGAGCTGGTCAAACGCGTCCAGAAAGGCTTGTTTAATCGCCGCCGTGTTCCTGTTCCTGCTGCCCATCGCGCGGCCCTTGGGGTTGCCAGACTGGCCTTTCGCCCACGCCATTACTGCACCCTCACCCAAATACGATGGACACGGCAAACGCCATCACTGAGCGTTGTACGTGCAAAAATCACACCTTCACCGCAACGCTGGCCGGTCATCCAGACATCAAAACGGCGCGCACCTACATCCCCCGCAACTGCATCGGCGGCACACTGGCGGCTGGTATCCAACGTCATTGAGGTGATAGAAACACTTTTCGGCATCGCACCCCCCATCTCTACTGTGCATTTCATCCGTTCGCCCTCCACCATGCGCAATACATGCGTGCGCACTGTGTCGTAAGCGCTTGCATAAAAAACCCCGTTGCGGCTCATCAGTGCGCCTCTACCGCGCATCCAGCAGCAATCACCGCATCACGATCGGCTTTCCACGCGTTCCATAAGCTTGTCATCACAGCGGCGTCCGTGGCGGTGGCTGCAAAAAAACGTCCTGCGCCGTCTGTACGCCTTCGGGCGTCGGCAGCGGCGGCATGGGCGGCAGCACTACCGGAGGGGGCGGGCTGCACACAACCGCTCCATTCCTGACGCAACCGAACATGCCCAGCACGCAGGGCAGCAGCAAGCTCACTACGAAGCTGCACCGCATCGGCTTTGGCTTGCTTGAGCGCCTCATCTACGTTCTCTCTGTTAACTTTTAGCTGCTGGCTTGCGGCATCGGTTTTAGCCTTCATGGCCCGCTCCGCCGTTGTGATCTGATTGCGCAACGTCTGGTACTGGCTGTCAGCCTCGGCGAACTTCGCCTTCCACTCAGAAGCGCCTGCGCGATAACCCATCCCATACGGCACACGTAGCAACAGGAAAAGGCACAACAAGCACGCCAGCGCGCCAAGCACCTTAAGGGGTTGCAACACGATCAGCTCCACAATATGCGGCCTTCACATAGCGCCTGCTCATCGTCACGGCGCAACGTCAACCCGCGTATCTCACGTCCGCCAGCGTGCTTCCAACGCGCAAGCTCGGCACAGGCCCCCGGCCAGTCGTTCGCCAGCGCCTTACGTTGCAACGTGCTCCCACACACCACCTTGGGGCCAATGTTGAACGTGGCCGACACAAGCGCCGCCTCTACGTGAGGAAACATCGGAACCGTAATGCAACGGCGCACGTAGCCGCTGGCCTGGAGCATGTCCCTTTGCAACAACGCCTCGCACTCGGCCTTGGAGTAAGTTTTCCCGTGAACCACGTCTGCTCCAGTGTGCCCGTAGCACACCGTCCATACGCCCACGATATCCTTGTAAGGGCGGTATTCAACGCCCTCCCACTTCGCAATCATAGGGGCCGCTAACCCAAGCACGACCGCCAGCCCAAACGCCATTCGACGCCCGTTAGAAGCACTATTCGGAGCCGTCGGCATCGCCTTGCTCATCGCATTTATCGGTTTTAGCGTCCCGCTTCCAGCGCCATATCAGATAGGCCGCTTGCAAAAGGACGTAGAACAGCGTCGCCAGCGTCACCAGCTTATCGGCTGTCAAAAATGCAACGCCCGCAGGCGGAGCGCTTTTGACAACAGCAAAGCCTGCATCATGAAAAAAATTACTTCTCAAAACACTGTCCCTCAGGGATCGCTCCACAACAAAAAAGCCCTGCTGGGTAGGCAGGGCGTGGGGAAGATAGCAATGAGGGCATCGCACCACTCAGGCGCGTACAGTAGGGGTGAAAGTGCGGAAGCATCAACTCCGCACTACGCAGCTTCTCTCTGCAACGCCTCTTGAAGCTCCCTAGCCGCTTGGCGTTCAGCACTACGCATCTTGTTAAGTAACCACTCGTACACGCCACACCACACCCTGCGGTAGGAAGCCTCATCACGGCCAATCGCAGCCGCCCGACGGCGATCACTCACCGGCAGCATGCCACTGCCACCACAGGCCGCGCATACCTTCACCAACGCCCCTACACGCCGTTCCCCCCGACCATGACAGCAGGGGCATAACTGCGGCGTGGACAGCTCACCCACCACCGCCGCAACCAGTGCCGGTAACATCTCCAACGTTGCCTGTGGCCATAGCTGCGCCTTGGCCTCCTCCAGCCGCTCCTCCGCACGCCTGAGCGCCGCCTGCTGTGCGCTTGTCGTCGCTCGGGTCCACCCCATGCACGCCTTGGCTATGCCCACGTCTGTACGCGCTTCCAGCAAGCGCTGCTGCTGCCGTCGAATCTCCGGCGTCACCAAGGCCACCGCCGCATCGCGCAAGGGGCTACGGCGCAACGCTGCGCCATCCGGCCACCAGCACGCCTCCAGCACCTCACGCCCCAAGCCCGCTGGGGTGAGCGCTAAGGCATGCGCAATGTCCTGCGCCGTCAGCTCAGGCACACCACCAGGCAGCGTGTCATAGCGCACGGTGCTCGGGTTCAAACGCGCCAGTAAACGACGCGGATTGTGACGCTGTAGATGATTTGTTATTTGCATTTGGAAATAATCCAGCACAAAAAAGCATGAGGACTAATATAACCAATTGAACTAAAAGAAGATTGTTGGGTGATATGGCGGCGCGCTCCTCCCGTCGGTGTGGCAGTGGCTAGATGTACGGTGCGGGAGGCTGTCGCCCCACCTATCCACCCACCTAAAAGCCCTTTTTAAACTCTTTTATATTTCTTTTTTTTTACATGTAAAGGTAAAAAAGGGGGTTTTTAGGTGTGTGTAGGTGGGGAGGCCATCCCACCCCACACCTACCCCACCTCAAAACTCAGCATTTACAAGCGACACACCGATAATCACCGTAAACTTGTGGCGCTTCCCTTCAACGATCCGGTCTACGTACTTCTTCTTAAACCCAGGTACACACCGTTTCAATCCGTCAAGGAAACGGGTTTTGGATAATGTGTAAACGCCGCTGGCTTTGCACCATTGCGTATAGGCTGGGTACAATCCATTACCCATTGGCGTACTGAGCTCCTCCTCATACTCGGCTCCTAGCTCACATTCTTCATCAATGAACTGGCCGACACGGTCCTGCTCCGACTGGTATTCCTCCGAAGCGGCCAACACGATATCAGGCGGCCTCAGCCCGTTCTTGTACCACTCCACGGCACCGGCCACGAGCCAGGCCAATACACCTTCTCGTTCAGCGGCAAGCTTCTCAGCGATCCTCATATCCCGAGGGTACTTGCCGTTACCGATCTCTTCCCCTTCAGCGGCATCAAATTTCGCTTTAAAGGGGATGAGCATAATGCGCCTCCAGATGCCGCTGTCCTGCCCCTTAATGACAGGCTTATGGTTAGTGAGCAGTTGCAGCTTGTGCGTGGGCTGGAACTCGAAGAGTTCACCGTACATATAGCGCGCCTTGAGCGCATCGCCGCCAGTGGCTCGTTTCACGAAGTCTTCCCGCAATGCTTCCCCGTCCCCTGATTCGTGGGTAGTCACCATGCGCCGCCCGAAAAGGTCGGCAACGGCGGTAGGGTGCTGTTGGCTTTTATTGCCTATGAGCAGCCCAGGGGCGGCCACGCCCGCATAGCCACCGAGAACCCCCATGATTAGGTCCAGTAACGTGCTTTTGCCGTTAGCGCCATCCCCGTACAGCACAGCGAACTTTTGTTCACGCACCGAGCCGGTGGTGCAGTAGCCGAACCAGCGTTGTAGGAAGTCACTGAGTGGCTTGCCAGCCTGCCCCTCTTCGCAGGTAATGCGTTCCAGTGTCTTTTTAAAGACAGGCGCAGGGGCGTTTGGGTTGTAGCTAAGGGGAACGACCCGCGTAATGCAATCTTCGCGGCGGTGTGGGAACAGCTCCCCCGTGCGCAAGTCCACGGTGCCATTGGCGCAGTTCAATAACCAAGGGTTGCTATCTAGCCGCTGTGGTCCTACGGTGAGGTGGCTTGAAGCCCATTCCACTGCTGCGTTCCGCCTGGAAGTCGATTCGGACTGTTTAACCCATTTGTACAATAGGGCAGCTTTTTTATCGTCTTTCTCTTGCGCTGCTTGATCTGCTTCGTAGCGGATGGCGTTCGGGAAGTCGTCTATTACTTGGAACGCTTCGGCCTTGCCCTTCTTCCAATATTCCCCGTTCCAGGTGTACCAATCACCGGCAACAATCATTAGCTGTTTTCCGTAATGCTTGACGATGCGAGCACCGTTCGCTTTGTCCGTTGTCAGCTCCTTTGTTGTGGCGAGTCTGCTAATGATCATGTCGTCCGTTGGGGTGTGTTCTGTCGTGGGAACGCCGAACATGTTCACATCCATTTCGTCTATTCCAGGCGGTGGTAATTCTTCCTCGGTATAACCCACCCCTTGCCGGAACTCTATTTGCGTCCTATTCCGGCAGTGGGCGTGCTGGCATACGAACGCGCCATTGGCATAGCCTCCGGTGTGCGCGGGGTAATACACCGTGGACGTGGGGCTGGAGGGCTGCGTATGGTGTGCTTCGAATGGGCAGGTAATGAACAGCTGCCCTTCTTTGCCTGTTGATAGGACCTTCCAGAAGTGCGAAAGATGCACAGCCACAGGGTCGTTGGCAGCAGCGGCCATGAGCTTGTGTTGGCGGCTGGGTTTGCCTTCAGCCTTGGCGCTTTCCAGTATCGCCGCATCAATCGCCAACATTACAGAATCCTCAAGAATGCCTTTCACAAAGCCGCTACGCACTGGAACTGGATCGGCCACACCGGCTTCAAACACAGGGGCGGCGGTGTAGTGAATTTGCACCGTATTAAATACAGAAGCATCCAGCCCTGGAGCGCAGACAGCGGCCCAGGCTTTGAGCTGTGCGCTGGTGTACGGCTTATGTAGCCAAAACCACACATGGGCTTTTAGCTTCCCTGCACACTCAGGACGCCCCGCACTACTGGATAACTGCCAATGGTAATCTGCGCCGTAGAAGCCAAGAGGCATTTGGTCGCTGAGGAACTCGCCGATGCTCCCCACCGGATCGGTCACCGGATCGCGGCGCACCGGATCGAAATTGTCAATCTCAACGAGCATCCAGTGATGCGGGATATCATCGTACAGCTCGGCAATGCGCCGTGCTTTCCCTTTCTGGAACTCAGTATCAAGCGCAGCGGCTTTGGCATCACCCACATACGCCCCACGAATCACGCAGGCATGCGGGTTCTGCTCCAGCTCCGTGAGAAGTGCAGACAGCTCGCGGCTATTGTTGAGCGCTCGTTGCTCCACCTGGAAGAACTTGGCGTTGTCGTAGGCTTTGAGCGTGCCATCAGCGCGCCATGTTTTGGCGAGGGTATTTACGGGGTGTTTTAGGACTGTGATAGAATCACTCATAGTTGAGTTCACCTTTGTTTTGGGCGAGTTGAGGCCCCCCCCTCAGCCCGCCCTTTTTAACTGCGGGGGTGGTTTAAAGATTCGAGAGTCACTCCTTACTAATTAAGGAGTACGGCCAGACAGACAGCCGGATGGTTTAGCGCAAACGGTAGAAGGCATTGCGGCAACGCCCAGCGAGGTTCTCCATGCGCTTAGCACGCGGCCCCACCTCCCGCCACAGATCACTCACCCCACCGGCTAACGGTGAGCGCATCAGATGTAAGGCTTGTGCAATCGGCTCAATGCGTTCCCGTGACAGATGCCCCGCGACCATCCTCAGCAGTGCGTACAGGTTGAATGCCTCTTCTTCTGTCAAAGTGACAACAGGAGGGTGCGAAGCTGAGTAGCTCCCCGTCTTGCGGATCGAGGGCAGGACTTCCGAGGTTACCCACTTGGCGAACTTACGTGCCTCTGGCTTTCTGGAGCGGAGTATCAAAGCGTATAGGCCAGACTCGGAGATGATGAGGAATGGCTTTCCAGCTAAACCTAACGATTGGTTAGATTTCTCATCATCGTCCAAATGATCTGCGATTGCTTTGCTTGGATTTCGGTACCCCAATGCATTGCACACATCGCCAGCAATGAACCACGGATTGCCATCGCGCATCACAACGCGCACAGAGTGAGAATGAAAATTGAACGGAATAATGGACCGCGTCAT
Protein sequences of DBSCAN-SWA_3 >NZ_CP020870|1186421:1207194|1196713_1197568_-|WP_046419211.1|DBSCAN-SWA MSDDTNTATESSEAVTPPTNDAQAALQEQQQHPTPDPTGEPPGDMGDADTQAEEKKKQGNRTREYIQRINGENNALRRRLEALERQQQSGPTRSSHTPQTGQEGAPELEDYSYNLNGWVDARFSHLFQQWQQEQQQAETARQQHSAQARYEARAAEFVNAHPDFYETVGSMDLSLLSPAVQAAVIQHEKGPEIAYHLATQDDALWSLASVREDLLPAAVERLAARMATPQTHDPFTSAPSPGKPISNAPPPPPSVSGRSPTEIPPEKLTDDDWYKRDVEKRRKR >NZ_CP020870|1186421:1207194|1201608_1202151_-|WP_046419794.1|DBSCAN-SWA MIVLQPLKVLGALACLLCLFLLLRVPYGMGYRAGASEWKAKFAEADSQYQTLRNQITTAERAMKAKTDAASQQLKVNRENVDEALKQAKADAVQLRSELAAALRAGHVRLRQEWSGCVQPAPSGSAAAHAAAADARRRTDGAGRFFAATATDAAVMTSLWNAWKADRDAVIAAGCAVEAH >NZ_CP020870|1186421:1207194|1192132_1192840_-|WP_046419220.1|DBSCAN-SWA MPWASLIPAAGSILGSIISNRSAHRAADAQTQASQAAIAEQQRQYNQTRQDQMPWLTAGQNALAGQQAVLHGDYSGFQNSPDYAYALQQGLQGVDRSAAARGSLYSGGHQADLVNYAQGLAMQNLNNYWNRLAGMSNAGQAASSHLAGFGQNYAANMGNQLNNAAQARGWAANRSADNTTNMLIGVGNALGNLYEGRQRHIANGASFGNAYTEDPYTRNPYISDPYRQHQWNTGG >NZ_CP020870|1186421:1207194|1197569_1199642_-|WP_046419208.1|DBSCAN-SWA MPHTQKNTTDELAQMRSRYQLASDCCRDLYDQARDDIKFVTVPGNQWDESLKKRRRHRQTYEFPKLRMHTQQVINEMRQQRPSCKVRGVEESDRGLAEIMQGICRNIESVSNADHAYDIAYEKAVKGGFGVLRVMTDYLNEDDFEQDIRIKAVRNPFAVKFDPAAVEIDRRDANFAFVEELIPRTDFERRFPDADVSDFDADTQCGAWRDAGQVRIAEYWWKDPRKRELLALSDGRVVFADEIAAQAGLGVEEAKTFLEAAGVQIVRTRTIEGHRVLMRLTNGHTWLTEPYEFPCQFIPIVPVWGNIENIDGSDYWSGMVRFGKDQQRLHNVHRTALVEAVAKSPKAPFIVDPKMIEGHVQMWNDSHSEDFPYLLANIVDNGRIPVRVEQAQVPDALIRLAGMDNDDVKAATGIYDASLGARSNETSGIAINSRKMQGAVATFNYIDNLAYAVRYTYEILVDMIPRIYDTPRAVRVLGEDGGEKWKQLYQQVTDPTTGHTVTLNDIRQGKYDVVVTVGPSYATQRMEAADAMMQLAAQMGGVAPQIATVAAYAGMRNMDLVGGEEVVAAFHKILAAQGLLPPKDGEPPPEPPAPDPVQLARVKKLSADAELSDARAQHQRADAAQQHIETLAAAHALSAPAVDPGWLPNAPNNTPPQGGFFMPEAPAPHGVIPAAHSRVRIGPVRPTPRG >NZ_CP020870|1186421:1207194|1193085_1195014_-|WP_046419217.1|DBSCAN-SWA MSARWREAPVTGGAYSDETRAWTVQDTVNWIPEAAERGGGRSSSLLRCAPGAAVFCVPEAANPAPVRGLHDVEDKLFAVIGTTLWQITTAGVAIPRGTIPGVERVCMAHNQIAGGNQLVIGNGTSGYVYNTYTELLAEQITGEGFLGLKACDYVDSYIVGVEPSGRYWFHSALADATSYSALDRYEAESQPDRIVGLIVIHRTVFVLGKRSGEFFYNSGASEGTFQRHAGTEMQTGCASQHTIQQMGNTVFWLGHDGRVYRLDGYQPVRISTPPLEQAITACNHEEAFAFTFDDRGHQVYYLTFPDGMTWGFDITTREWHRRESFGISRWRMNACVRWAAHWVAGDFANGKLYTLDWGMPWEHGQVIERRRVNGVLHDHQNRLTVDAVELVFGTDALGGYRGPGDLNPPQPAGPTLSGNAPDAVNTQAYRYAYTMKPGATPIVAVRVVSGALPEGLSLDSKGLLSGTVEVAGLVAGAKRAYPFTIRVTDANGLWACVSDTMTITTAEVLCGTSTSYSGGQAFPNSVHVQLGSQTGLVTLAYATASNPDKFQVWIGGQKVIDTGYHGDASYQAQLDADLKNRGLPPETITLRPGGDTTPEGQFANKNYETATFQKTTADTIAEVLVYSPLAGTAWQFHLDCPT >NZ_CP020870|1186421:1207194|1186421_1187924_-|WP_046419225.1|DBSCAN-SWA MTTIPHNNEPFLDASGLVTRSWRTYLSSLAKSDAATQLQQQIDALQAALHTAPPNGSSSAPNAQLLATGSINVAGTLSSGFVSLNLEGDARSPGSTSYYGTDATGVKGWHPVPVVSVNGNTGSVLLKVGTLADVDASGIADKKALVWDASANKHVYADIPSGGGGGSTGGMRIENIASAANTLTQSQVGSYLRFTYSGAKTVTVPAGLLVSSGMAIVLSNQSSGPLTLSAGAGMALNGSAGAFVLQQGEAAALVFLSSSEADVITASITGQASYTTAICVAFNGKNQWGDIASLPSSVIGGTPHTILALYNPLGGFGSTGTRQAVFSGGALGTPQLRIGAVSDETTFTYYESHADGSNPYGGVGGAGSATAGRWTAHAGKYDAGRCYAATHTTPWTDSGAASDTPSNTLTPVPPFTIARRYNAGSPDRYFNGYLHSLGVFNRALSNIEITDFFAHKDLTSINGLVSGWRFGTDGQTTVPNLVDGEAPCILSGAPDYVRMS >NZ_CP020870|1186421:1207194|1195010_1195418_-|WP_046419214.1|DBSCAN-SWA MTTVAEIIRDAFGHLRVLDANEAVEAEDATRAIRTLNLMMRRWEANGIALGWSEVASPDDLLPAPAEAEEAIGYNLAVRLRAGYGVVLEQDVLNAAERGRAMLLSDTVHASGVRMQYALPAAESQRRYGRDGYDG >NZ_CP020870|1186421:1207194|1189133_1190438_-|WP_046419799.1|DBSCAN-SWA MGRLGQAGKAAQLAAKAGVAAAEGAGYGALGETRTGENRLANAGYGALGGVVARGALGAARVAAGGLARRGDPLLQADITIAQREGIPLHISQVAQSTPARTMASMAKYLPFSGADAAARNQQNAWNRALTRHVGDATDRLDDSWIARQKQALGDAYDTIWNRNDVTISPETATRMHQIVQNAYRDLGTDGGKVVENQFGRILDDIAQAGQEGTISGRNYQRLARTLAQVQPGTSVGNYVGRLRKELIGDAESSINPSDMALLQQTNQKYNNFKTLEKLLTRQPGAKADIAPAALWSAVNARGPKATQEFRELAKVGQNLLKDPIPDSGTTGRMISASALGGGGVGALFGGGGGALVGGIVPVLTGLATGATAGRALNSPFIGNWLARNAGNPDIFANLLAAQGGWGSLRKGLGRAGQITTMAGGKQLLQEEMP >NZ_CP020870|1186421:1207194|1201294_1201609_-|WP_046419200.1|DBSCAN-SWA MSRNGVFYASAYDTVRTHVLRMVEGERMKCTVEMGGAMPKSVSITSMTLDTSRQCAADAVAGDVGARRFDVWMTGQRCGEGVIFARTTLSDGVCRVHRIWVRVQ >NZ_CP020870|1186421:1207194|1203965_1206494_-|WP_046419194.1|DBSCAN-SWA MSDSITVLKHPVNTLAKTWRADGTLKAYDNAKFFQVEQRALNNSRELSALLTELEQNPHACVIRGAYVGDAKAAALDTEFQKGKARRIAELYDDIPHHWMLVEIDNFDPVRRDPVTDPVGSIGEFLSDQMPLGFYGADYHWQLSSSAGRPECAGKLKAHVWFWLHKPYTSAQLKAWAAVCAPGLDASVFNTVQIHYTAAPVFEAGVADPVPVRSGFVKGILEDSVMLAIDAAILESAKAEGKPSRQHKLMAAAANDPVAVHLSHFWKVLSTGKEGQLFITCPFEAHHTQPSSPTSTVYYPAHTGGYANGAFVCQHAHCRNRTQIEFRQGVGYTEEELPPPGIDEMDVNMFGVPTTEHTPTDDMIISRLATTKELTTDKANGARIVKHYGKQLMIVAGDWYTWNGEYWKKGKAEAFQVIDDFPNAIRYEADQAAQEKDDKKAALLYKWVKQSESTSRRNAAVEWASSHLTVGPQRLDSNPWLLNCANGTVDLRTGELFPHRREDCITRVVPLSYNPNAPAPVFKKTLERITCEEGQAGKPLSDFLQRWFGYCTTGSVREQKFAVLYGDGANGKSTLLDLIMGVLGGYAGVAAPGLLIGNKSQQHPTAVADLFGRRMVTTHESGDGEALREDFVKRATGGDALKARYMYGELFEFQPTHKLQLLTNHKPVIKGQDSGIWRRIMLIPFKAKFDAAEGEEIGNGKYPRDMRIAEKLAAEREGVLAWLVAGAVEWYKNGLRPPDIVLAASEEYQSEQDRVGQFIDEECELGAEYEEELSTPMGNGLYPAYTQWCKASGVYTLSKTRFLDGLKRCVPGFKKKYVDRIVEGKRHKFTVIIGVSLVNAEF >NZ_CP020870|1186421:1207194|1202147_1202645_-|WP_053014138.1|DBSCAN-SWA MPTAPNSASNGRRMAFGLAVVLGLAAPMIAKWEGVEYRPYKDIVGVWTVCYGHTGADVVHGKTYSKAECEALLQRDMLQASGYVRRCITVPMFPHVEAALVSATFNIGPKVVCGSTLQRKALANDWPGACAELARWKHAGGREIRGLTLRRDDEQALCEGRILWS >NZ_CP020870|1186421:1207194|1192870_1193116_-|WP_046419219.1|DBSCAN-SWA MAIPPGLPNMSRKVMLRYSKDGGHNWSAWVARDLGDVGAFQKRLRRYRLGQGRQWVFDIRITDPVVAHLLAMSLQASAGPA >NZ_CP020870|1186421:1207194|1195438_1196665_-|WP_046419213.1|DBSCAN-SWA MGTQALTHQMIAREAAKMLVEQNNVVTNINTERSKEFGEETNGYKKGDTVKIMVPPVPVTYSGSVFAASGATSAPLSAPPISESYVTLKLDQQYHVPLTFTAKEKKLDLTDFSKRFLRPAMTSLSSKVNAVLLASMYQQTPNIVGTLGTVSDARTPWRSAASTLDRHLAPEEDRCAHFSTDANDALAEANVSLFHTSEELRGEFSKNAVGKFAGITFYKQLSLPTHTNGLGAGYVVSGAGQIGSSISMDGKAGKGDITKGSIITFDDVFEVHPITGATTGKLRQFLVTEGYIAGGSTLSIYPAITPTTSEGIGTVDKAPRDRAAMTFFGQPLVPAVQNLVFHRDAFATAFAPLPVLASCEGYTASIKNISVRVMTFGDGKNDMEHTRVDVLFGTPAPVRPDHACRVTQ >NZ_CP020870|1186421:1207194|1187920_1188703_-|WP_046419223.1|DBSCAN-SWA MTAFRLFSRLNTFYGMTGQLLAAGQLKFYDAGTTTPRPVYGDSGLAVNNGVEVRLDSSGRPDVDIWGKGSYFVELFDSLGAKQGEADGVSIPGGGGLTIPALDSSKFLTNNGAILLWSTIREVPDPVGMGGKVLGTDGENLLWQSLPRPPDSQYTVSTDMLKIGNFMIQWGRDTAPASGKAATLKLVTFPKPFANTPYFVKASVTAALATASSLVAESVSGTSTTNATFNFVTADSKEKNTDPIISPIPFDWIAFGQGVA >NZ_CP020870|1186421:1207194|1202972_1203695_-|WP_046419197.1|DBSCAN-SWA MQITNHLQRHNPRRLLARLNPSTVRYDTLPGGVPELTAQDIAHALALTPAGLGREVLEACWWPDGAALRRSPLRDAAVALVTPEIRRQQQRLLEARTDVGIAKACMGWTRATTSAQQAALRRAEERLEEAKAQLWPQATLEMLPALVAAVVGELSTPQLCPCCHGRGERRVGALVKVCAACGGSGMLPVSDRRRAAAIGRDEASYRRVWCGVYEWLLNKMRSAERQAARELQEALQREAA >NZ_CP020870|1186421:1207194|1200968_1201295_-|WP_046419203.1|DBSCAN-SWA MAWAKGQSGNPKGRAMGSRNRNTAAIKQAFLDAFDQLGGVPALVAWARDNKTDFYKLAARMIPTEARVSGELELKEAGDDELDAAITAYATQAGISLVVRGESAAPPH >NZ_CP020870|1186421:1207194|1188738_1189137_+|WP_060872308.1|DBSCAN-SWA MASLVSFLAGFLLGRAGVLGYVLWAGLLLLVLCLLLWAVIYAAGALLWGACLLVGHVCSLFEPVVEFIKRITPPWLARVLWGQSVAGVLLWGLCLLFKPLAAFINRITPSSLARVLWGAGKENDKIDRPVAS >NZ_CP020870|1186421:1207194|1202625_1202850_-|WP_046419792.1|DBSCAN-SWA MRDSVLRSNFFHDAGFAVVKSAPPAGVAFLTADKLVTLATLFYVLLQAAYLIWRWKRDAKTDKCDEQGDADGSE >NZ_CP020870|1186421:1207194|1206630_1207194_-|WP_046419191.1|DBSCAN-SWA MTRSIIPFNFHSHSVRVVMRDGNPWFIAGDVCNALGYRNPSKAIADHLDDDEKSNQSLGLAGKPFLIISESGLYALILRSRKPEARKFAKWVTSEVLPSIRKTGSYSASHPPVVTLTEEEAFNLYALLRMVAGHLSRERIEPIAQALHLMRSPLAGGVSDLWREVGPRAKRMENLAGRCRNAFYRLR >NZ_CP020870|1186421:1207194|1199644_1201060_-|WP_080939574.1|DBSCAN-SWA MMNWMPPLQHMPPKQVLALLLEEKARRRRTNRLAEYSPYPKQRAFHAMGAGTRERLLAAANQSGKTLCAGHEVAMHLTGRYPQWWEGKRFERSNHGLAGSETGELTRRGVQRILLGRDPKTEMGTGAIPGECIEGVTWARGVPELVDTVYVRHVSGERSSISLKSFDQGREKWQADTVDWVWFDEEPPEDVYFEGITRTNRTFGPVFMTFTPLKGMSSVVRRFLLEEAPDRGLVQMTIDDAEHYSPEDRARIIASYPAHEREARTKGTPSLGSGRVFPIAEDSIAIAPFAIPEEWALIGGMDFGYDHPFAAVKMAWDREADVIYVMCAYRQREATPVIHTAALRPWGARLPWAWPHDGLQHDKGSGEQLAEQYRQQGLSMLGGRATFPDGTNGLEAGVTEMLDRMHTGRLKVFSHLTEWFEEFRLYHREDGRIVKHHDDLLSATRYAMMMRRYAKPPHVATFNVYDYSVDY |
20 | Xylella_phage(100.0%) | NA | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_4 |
1211003 : 1222382
Sequences of DBSCAN-SWA_4
Nucleotide sequences of DBSCAN-SWA_4 >NZ_CP020870|1211003:1222382|DBSCAN-SWA AATGACATTAAACGTTAGCGAGCAAGCCCAGCCCCAGGCTAGCAAGGCCTTATCAGGGACTCAAAAAGCGCCGCTGTTCTACTGGAACGGCATTCGGGACGAGAAAGGCGGGAAGTTGCAGCATGCCGATTACTCCTACGAAAAGCCGCACGACAGGGATAACTCGGAAATTAGGGTAATCGCCACGCGTTACACACGCTTTAGCCCTTTAGTGCATGCCTATTTTAAGGTGTACAACAGCACCGACGTGATGACAGATTATTTTGACGACGATAAGTTCACTGTTACCACAACGCATCCATTGTATCAGCAAGCGAAAGCTGCCCTTGAGGCGGTGCGTAATCGATCTGCGGCGCAGCTCGCTGCATCGGAGAAAAAGCGACAAGAAAAGCGTAATGCTGCTCGTGCTGCGCTCGAATCCGCTAAAGACGAAGAAGCGCGTGATGCTGCTTTTGCTGCTGCTCGTGCTGCTTTAGCTACATACGAAGCCGCTGAAAGCATGGGGGTGGCCGCATGAGCACCGCGACCAAAGAAGTACAGGCCGACGCGCCCGTATTAGCGCCTAAAAAAGCGCCGTTGTTTTATTGGAACGGCATTCGGGATGAGGAAGGCGCGGAGCTACAGCAAGCTTTTTACGCAAAGTGCCTGGACGGGATAGCCATCTTCGGACGTTGCGCCTGCAAATTCAGCCCGCTCGTGCGCTCATGGTTCGAACGCTCCCCCTACAGCGGTGTTAGAGGGATTCTTTATCAGAGAAGAATCGTTAGCTACCGCATCGACTTAAGACAAGCGCACCCGCTATATCCGCAGGTGAAGGCGGCTTATGGAACGCAGGAAGCCTACAGCAATTCCATAGTGGAAGATGCGGAAGAATGGAAACATGACACTGGTAAAGAGGTGGCCGCATGAGCACCGCGACCCTCGAAAACGTCCCCACCCCAGGGGGCACTAGGTTCACCAAAATATACGACACCGATGGCTACCACGCCATCACGCGTGATGAGCATAAGGGGCTTGAGTGGCTAACTGGGTATGTCGGGGGTCCAAGCTCGAATGCATACGGCCCTGATTGTGATGCGGCGGAGGAATGTCGCCAATTAGCCATCTGGGGTTATGACAATTGGCGCTTGCCTACGCAGGAAGAGGCGCTATCCGCTAATGGCGCAGCTGCCTATCTGCGTTGGGAGGAACCATCCTATTGGATATGGACCTGTACACCAGATGACGATCACCCGCGCGAGGCGGCTTGGGTCGTTAAATTCGGCACCAATCACACCGCTGCGGTATATCGAACAGCGAAGTATTTAGTTTGCGCCGTGCGCGGGCAGATGCGTACTGACGCCACTGCTACACCAAAGGCAGGTGAGTCATGACTGGCATTGGCTACAGCAGTTATAGCGACCCACGCCTGCAACCACCGCAAGACGATGCTAAAGAGTATTTCGCAGAACGGGTTAATGCTCGCGTTCAAGACTATTTGAGCGACCCAGAAAAAATCGAAGAAGCCGATGAATGGGTGGACGGTACGTTATCGGAGGCCCATTACAAAGAGATGGAAATCGTTCTAGCGGATTTACATGCCCTGCCTTCGGATCAGTTAAGCGACAGCGATGTATTAGCCCGTCTTTACGCACTGGCCGAAGTGCAAGGACTCGCACGCATGGAACAACTGCGCATCCTTGCTGAGCAGGACGTCAAGGAAGAAATGCGCCACGAATCAGAATGCTTTCACGCGATGTGGGGTGATGTCATGCAGGAGGAATACGCATGAATCGTTATCGCAAAGCGGAGGAGCGGAGTATCGCATTGCGCCGTTATTACAGCTCGCGTCCTTGGCTACCCAATTGCATTTTGCGCGATACCCGAGCAATGCCCAATGCACTACGCATCGCCAATTACCGCGTTCGTCGCGCGCACTACGAAGCCGCCAAAACGCTTTCATCGTTATTGATTAAACACTAATCCCGAACGTTTTTCTGGAAGGAGTTTTTATATGTCGCCTCCCATGCCGACAGAAATGTTCATGCGTTCGATGAGTAATAGCGAACTTGTGAATGACCTAGCAGCCGCGCAAGCCGTGGAGCGTTTTACGCCTGTGGAAAGTGAGTTGTTATGGCGTTTTACCGCCTTACTAGACGCTTACGAGCACGTATGTGAGGCACTGGAGCAAGACGGCGATGCGCAGTGATGACAACGGACAACGGACAACGCACCAGCACCAGATAACACACACACACGAGAGAAAAAACCCATGTTCCCTATGACCGTCACGATTACCGACCAGGCCCAATTAAACGCTGTTTTAGCGGTGCTTCATCCGCCCAGCAGTAGTAGCACCGTGCAGCCTGTGCTTGACGATACACCGCGCAGCGATAGCGCCCCTAGCCATCCACCAGTTGCACCTCCTCCCTCTGCGGGTAAGACGCAGCGCGCTGCCAAACCGGCTGCTGCTCAGAGCACAACGGACGTTATGCGCACGCCTGGATATGCTGAGGTGGCGAATGCGTTAACGGCCCTGTCTAAGGAGCTCGGCGACAAGTACGCCTACGACGTTTTGGCGTCATTCGGCGTCAAAGTGTTGTCCAAGATACCGGAGGAGTTATTTCCGGAGGTGCTGAAGCGCATCGAAGGATTTTACATCGCCCACGAACTCGATTTGCCCTTAGAGGCTGAGACATGAGCCAGCACGCCATGCTATCCCCGAGCAGTGCGCATCGCTGGTTGCACTGTCCGGCAAGCGTTCCGTTATCTCGTACATGCGAAGACGACGCTAGCCCATTTGCCGATGAAGGCACCGTGGCCCATACGGTGGCTGCCGACGCATTACGCACCGGCTGCGATGCCAGCGCGTACGTGGGGGCATGCCATGAGGTAAACGGCCACCGCTGGGAAGTCACCGCAGAGATGGCGGCGTACGTGCAGGAGTATGTGGACTATGTACGCGCCATTGCGGGGGTGCGCCTGGTCGAGCAGCCCCTACGCATTGCCTCCATTACTGGGGAGCAAGGCGCTAAAGGCACCGCTGATGTGGTGATTTTGGCGGGGGATGCGTTGACCATCGTTGATCTCAAATACGGCAGAGGTGTCAAAGTCTTTGCCGAAGGCAATGAGCAATTGCAGCTGTATGCGCTGGCAGCGCTGCAAGAATTTGCAGGGGTGGAAGCCTTCCAGCACGTGCGGCTAGTGATCGTGCAACCACGGCTCGGACATGCCGATGAGTGGGTGCGCACCCTCCCCGAGATGGAGGATTTTAGGCGGAAGGTTGCGCAAGGCGCGGCGCGGTGTAGGGCTGCGATTGGGCACTACGACAACGTAGGCGAATTACCTTCAGAGTATTTCGGCCCCGCAGAAAAACCATGCCGGTTTTGCAGTACCAAGGCGACCTGCCCTGCGTTGGCCACGCATGTACTGAACACGGTGGCGGATGATTTTGTCGACCTCACAAAGCCCATTGTCCCGCAGCTCAGCTACGCGCAGCTGCGCACGTTTGACAACACCACACTGGCCTGCCTGTTTGGCGCAACAGAGTTAATCGAATCCTGGTGCAAATCTATTCGCGACAGGGCAGCAGCGCAATTGCTTTCAGGCCAGCCCGTGCCTGGATACAAGGTAGTTCAAGGCCGACAGGGGCCGCGCCGTTGGGTGGATGAGACAGCCGCCGAGGACGCGCTCATCCAGATGCGAATTGGATTATCTCACCTGCACGATGTGTCCCTCATCAGTCCTGCGAGCGCCGAGAAACTCCACAAGGCAGGAGTACTCGACCTACAGCAATGGGTGCAGCTCCAACCGCTCATTCATAGGTCAACAGGGGCGCCCATTGTTGTTCCCACATCGGATAAACGCCCCGCGCTCGCCCTTCAGGACGCGACGGATTTTGAGGACTTGAGCGATATGCCCATCCCCCCACTCCCAGACACAGCACCCTCACTTCAATCTCAGGAGACACCGTAATGAAACTCACCCTAAAAAACGTGCGCTTAGCCTTCCCCGTGTTGTTTGAACCCAAGAAAGTCAATGGCGAAGGCGAGGCCGCCTTCTCGGCCTGCTTCCTCATCGACCCTGCTGACCCGCAAGTCAAAGCCCTTAACCAGGCCATTGACAAGATGGCCAATGACAAATGGGGTGTTAAGGCGGCGGCCCAGCTTAAACAGATGCGTATGGGCGACAGAGTCGCCTTGCATGATGGCGACCTGAAAGCCAGCTATGACGGGTTTGCAGGGCACCTATACGTCTCTGCGCGTAACAAGGCACGGCCACTGGTGATCGACCGTGACCGGACCCCGCTCGCCGCGCAGGACGGCAGGCCGTATGCCGGATGCTACGTCAACGCCAACATAGAACTCTGGGCGCAGGACAACAACTACGGCAAGCGGATTAACGCCTCGCTGGGGGGCGTGCAGTTCTTGCGTGATGGTGAGGCGTTCGCTGGAGGCGGTGTGGCCAGCGTGGAGGACTTCGAGGACCTGAGCAACGTCGCCGAGCTGGCGGATGTTGAAGGAGCCATGCCGTGGGGGTGAACCCCGATGGGGAGGGTGCTTCGCCCTCCCACGCCCCAGCTCCCCTTATCCCTGTATGGACAACCTCATGAATACACCCTCTGAGTTCACTCTCCAGTTTGAATCCCACGCCGTGCGTGTCCAGCTTGATGAACACGAGCGGCGATGGTTCAACGCCAATGACATTTGCGCGGCGTTGGAGTTGTTAAATCCGTGCGCCGCACTTGCTCATCATGTGGATGCCGAGAATGTATCGAAACGCGCCACCTTTACGGCAGGTGGCCCCCAACGCGCCAACTATCTCAATGAGCCAGGAATGTACGCCCTGCTCATCGGCAGCACCAAAGACACCGCCAAACGCTTTCAACGATGGCTCACCAGTGAAGCGCTGCCCGCAGCCGCAGCTCAAAAAGCGGGCCAGAACATCATCCCGCTGCATCACGCGCCTTCCATCCCAAGCCCCTTCCAACCCACAGAGGACCACACAATGCATGCAATCACTCCATTCCAATTTGAATCGCACGCCGTGCGTACCGTGGTCGATGATCACGGTGAAGTGTGGTTTGTCGGCAAAGACGTTGCCGATGTACTCGGTTACACCAACCATAACAAAGCTTTGGGCGATCATTGCAGGGGGGTGCCGAAGCGTTACCCCCTTCAGACGTCAGGCGGAGTTCAAGAAATCCGGATCATCTCCGAGCCTGACATGCTCCGCTTGATTGTGAGCAGCAAACTCCCTGCCGCAGAACGGTTCGAGCGTTGGGTGTTTGAGGAAGTTCTGCCCGCCCTGCGCAAGACAGGCACCTACTCCACACCAGGAGCACTGCCCACCTTGCCTGGGCCGACACAGGATCGCATTGCCGCACTCCTGTTAATCGGCCAATACATCTCCACAGTGCCAGGGGTGAAGCCAGGGATTGCCGCAGCGGCAACGCTGGCCTGCATCAAAAGCAACACGAATTTAACAACCGAAGAGATACGCCGTGCGTTGCCTGCACTGCGGGACCCGCTTTGCATGCTCAACGCCACACAACTAGGCAAACAGCTGCATTGCTCGGCCAAGGCGGTGAACCAATTATTAGCCTCCAGTGGCCTGCAATTTCGTAATGAACGCGACGCGTGGGAGTTAACCGAGGCCGGTCGCGTGTGGGGTGAAGCCATTCCGTACTCACGCAACGGGCACAGCAGCTACCAAATTCTTTGGAACCCAACGGTGCTTGACTCGCTGAAGGTCGCCGCCTGAGATGGCCACCGTGCAGGCACCTACACCCATCCTATGGGGGGACCTAGAGACGTACTCCCCCGTACCGATTGCCCATGGCGTGCATGCGTATGCCGAGCAAGCACAGCTGTTGCTGTTTGCTTACGCCTTGGGTGATGGGCCGGTGCAGGTGTGGGACTGCACGGCCACGGCAACAATGCCTGACGACCTGTCCGCTGCGCTGCACAACCCCGCGGTGCTGCTGTACTTCCACAACTCCCATTTTGACCGAACCGTACTACGCCATTGCGGCATCAGCATCCCCTTGGAGCGCTGGCGCGATTCAATGGCCCAGGCCTTGGCCCATGCGCTGCCTGGGGCGCTGGGCACGTTATGCGAGCTGCTGCGCGTTCCTGTGGAGCAAGCCAAGGCCAAGGACGGCAAACGGCTCGTTGCACTGTTTTGCAAACCACGCCCGACCCACTGCACGTTGCGCCGTGCCACACGTGACACCCATCCGACCGAGTGGGCACAGTTTGTGGACTACGCCAAGCGCGATGTGGCGGCCATGCGGGACGTGGTGAAGCGCTTGCCGTCCCACAACTACACGGGGGCAGAAAATGACCGAGGCGTTCTGGTCGATACGGACTTGGCGCAGGCCGCCATCGGCGCTGTGGAACGCGCCAAGTGCACATTAGCCAACCGTACCGAGGCGCTGACCGGCGGCGCGGTGCAGGCGGCGACCCAGCGCGATGCACTGTTGCACCACCTGAGCACCGCGCACGGCGTGGCGTTGCCGGATATGCAGCAACACACGGTAGAACGGTGCATTGACGATCCGGCACTCCCAGAGACGGCGCGGGAACTACTGTCCATTCGCCGACAAGCCAGCACCACCAGCACCGCTAAGTACCAGGCGCTACTGCACTGCACCAGCCCTGACGGTCGCCTGCGCGGCACACTGCAATTTAAAGGGGCCAGCCGCACCGGGCGCTGGGCGGGGCGGCTGTTCCAGCCACACAACCTGCCGCGCCCCACGCTCAGCCAAGAAGTGATCGCGGTCGGCATTGATGCCATGAAAGCCGGTTGTGTGGATTTAGTATGTGACGATGTTATGGCGCTGACCAGCAGCGCGCTGCGCAGTTGTCTGATTGCGCCAAAAAATAAAAAGCTGGTCGTGGCCGATCTGTCTAACATTGAAGGCCGGGTGTTGGCGTGGCTGGCCGGTGAAACCCCCAAGCTGCACGCGTTTCGTGATTTTGATACCTGCCAAGGGGTGGACGGTACATGGCACAGCGGCGAGGCCATCACTCACGGCGCACTGCGCGGCGCGCCGATCACCTTGCAATGGAATGCCGAGCACGAGCCTATTCGCAAAGGGGACGACATTTACAAGCGCGCCTACGCCCATTCATTCGGGATAGCGCCCCAGGCCGTGACCAAGCAGCAACGCCAGATTGGCAAAGTGCAGGAATTGGCATTGGGGTATGGCGGCGGTGTCGGGGCCTTTGCCGCCTTTGCGGCCATGTATCACATTGATTTGGAGGCGATGGCCGAGCAGGCCGCCTTACCACCCCTGCTGCTTCAGCAGGCCATGGAAGCGCTCCAGTGGACGAAGGCGAACCACCGTCCCACCTTCGGCCTCTCCGATCGCGCATGGTTGGCCTGCGATGTGTTTAAGCGCGCATGGCGCAACGCGCATCCGGCCATTGCGGCGTTTTGGCAGGCGTTGCAGTTCGCGGTGACGGATGCGATCCACCACCCCGAAACGACGCACACCTGCTGCGGGATCACAGTGCAGTACAGCCGTGCGTGGCTGCGTCTGCGTCTGCCGTCGGGGCGGGTGCTTTACTACGCCGCTCCCAGAGTCGATGAGCACGGCGCGCTGTCCTACATGGGCACGCATCCGGTGACGCGAAAATGGGCGCGCCTCACCACCTACGGCGGCAAGCTGGTCGAGAACATCACCCAAGCCGTCAGCCGCGACGTGTTGGCCGCGTGTATGCCTGCGATTGAAGCCGCCGGATACAGCATCGTGCTGACCGTGCATGACGAGATCATTACCGAAGCCGATGAGTGCCCCTCCTTCAATGCCGCGCACTTGGCCGCACTCATGGCCACACCGCCACCCTGGGCACAAGGGTTGCCCTTAGCGGCGGAAGGCTTCGACACCCACCGGTATAGGAAGCAATGATGAACATTCCCCGTGAGCGAACGATCGAACGCTATTTAGTGGCCCAGGTCAGGGCCAAGGGCGGTGAAATCCGCAAGGTGAAATGGGGTGGCCGCCACGGTGCGCCGGATCGCATTGCCATGCTGCCCGAGGGGCGCACCCTGTGGGTGGAACTCAAAGCCCCCGGCCAGCAGTGCACACCGCATCAAGTCCGTGAGCATGCGCGCATGCGCGGCATGGGCCAGCGCGTGGTCGTGGTCGATTCCTTAAAAGGCGTGGATGAGGTGCTGGCATGACTCAGAAAAACGCTTTGATGATCAGTGCAATGAGGCTAGTGACGATCACACCGAACATCCACTTGAGTAGCAGCATTTCGCCTTTTATTTCAACGAAGCGTTGGTCTACTTGGGCGAAGCGCTGGTCCATGTTTTTATCCAGCTGTGCAAAATCTTTAGCGATTTGTTCAAAGCGCTGGTCAACCTGCGCGAAGCGTTGGTCTATTTTCTCAAAGCGCTGGTCTATTTTCTCAAAGCGCTGGTCAACCTGGGCAAAGCCTTCCTTCATATCGGCTTCAAGACGCGCTAATGCCTTGCCGTTTTTAGATTCAGACTCAGCAAGGCCTTGTAAATTTATTTCCAGCACTTCGGCCAAGGCTTCGGCTTCGGCCTCTGCGTGCGCCGCAGGAACCCCTGCCGTTTTCAGCCGGTTCGCAAATTTAAGCGTATCGAACGCTACGGATGTCACACATAACCCCGCTTTAGCCCCATGTGGTGGCGAGTATAGCAGCGCGCCTCCCGCCCATCCTGAAGTGCTGCACGGAGCATTGGCATGAACCTGCGCCCCTACCAACACACCATCGTTGATTTCATCCTGACGCACCCGCGCTGCAATCTGTTTGTGCCAATGGGTTTGGGGAAGACAGTAGCCACGCTGACGGCGTTAGATGTGCTCCTGGTGGTGGAAGACATTGCGCCTATTTTGGTGATTGCTCCGCTGCGCGTTGCTGCCACGACATGGCCGGATGAGGTGGCCAAGTTCCCCCATTTGCGCCATCTGCGGGTGTCCGTGGTCGTGGGTAGTGCGGCAGCACGTCGCCACGCCTTGGAGCAGGAGGCAGATATCTACTGCATTAATTACGACAATCTGAAATGGTTAGTGGAGTTTTACAAGGACCGTTGGCCGTTCCGTATGGTGGTCGCCGATGAGTGCTCCAAGCTGAAAGGGTTCCGGTTGCGGCAAGGAACACGGCGCGCCCGCGCACTGGCCAAGCATGTGCATACCAAGGTGGAGCGCTACGTTGGGTTGACCGGCACGCCCGCGCCAAATGGGCTACAGGACCTGTGGGCGCTGATGTGGATGGTGGATCGTGGGGCACGGCTTGGGACGCATTTTAAAGCGTTTATCGATCGCTGGTTCCGTGCGATGCAGATCGGCAGTGATCCGCATGCGGTGCGCTTTGTGCCCACGCCACATGCATCTCAAGAGATTCAAGACAAGATACGCGATATCTGTTTGTCACTTGATCCACATGCGTACTTCGATTTACGCCAGCCGATTGTCAATACGATTCGCGTTGCGTTGCCAGCACATGCGCAACGTCTGTACAAGGCGATGGAACAAGATATGTTCATCGCCTTGGAATGCGGTGCTGAAGTAGAAGCCTTTAACGCTGCCAGTAACACCATAAAATGCCTGCAACTGGCCAATGGTGCGCTGTACACCGATGACACACGTCAGGCCTGGGAAGTCGTGCACGATGCAAAATTAGAGGCGCTGCACGGCATTATCGAAGAAGCCGCCGGTATGCCGGTGTTGGTGGCGTATCACTTTAAAAGTGATGTCGCACGGTTGCAGCGTGCCTTCCCCAAGGGGCGTGCTTTGGACAAACACCCCGACACGATCCGCGATTGGAATGCGGGGAACATTCCCGTGCTATTTGCCCATCCGGCCAGTGCCGGTCATGGCTTGAATCTGCAAGACGGCGGAAATATTTTGGCCTTCTTCGGCCACTGGTGGGACCTGGAGCAGTACCAGCAGATCATCGAACGCATTGGGCCGACACGTCAGGCGCAAGCCGGACATACGCGGCCTGTATTTATTCACCACATCGTGGCGGCGGGCACGGTGGATGAATTGGTGATGGCCCGCCGTGAATCTAAACGCGAAGTTCAAGATTTACTGCTAGAGGCAGTGAAACGCAGAGAAACAGGCAAACCACTCACATCACAAGGAGCCATGAGGCGATGAACGCCCCATCAAAGCAGACATGTGGCCTGCTCCCAGCCGCTGGTTTATGTCTTTCTAGAAGTGAGGTTGCCGAGTTATGCGGTACTCCGCAACGCGCTCGCCAAGCCGCTTTTCTTAGGAAGAACGGCATTCGGCATTATCTGGATGCACATGATTGGCCAGTGGTTCTGCGTTCTTCGATTGAAGAGATACCGACGACTCCCATCGTTGCGCCTGTTTGGAAGTCTAATAAGGTCGCTTATGGGACGTAAGCCAATCAAAGCAGGTGCGATTCCGAGGTTTCGCGTGCGCCCTCAGAAGTCCGGCGTGGTGTATTACTACTATGATCATGGCGGCAAACCACGCAAAGAGACGCCACTAGGACGCGACTACGGTTTAGCCATCAAGCGGTGGGCTGAGCTGGAGCATGCGCAGATCACTTCTGCCATTGCGGTGACGTTTCGCCATGTGGCCGAGCGTTACCGCGCTGAGGTGACCCCGACAAAGGCGTATAACACCCAGCGCGTGGAGCATCGTTGTTTGGCTCCACTCCTGAAGTTTTTTGATGACCCACCCGCGCCGTTTGAGGCCATTAAACCGATGAATATCCGCCAGTACCTGGATTGGCGCACTTCTAAGGTGATCGCCAATCGTGAGGTGTCCGTGTTTTCGCATCTTTGGAATTGGGCGCGGAGTAAGGGAATCACTGATCTTCCTAACCCTTGCGGGGGTATCCGTCGTAATAAGGCGACAGGCCGCGATGTATATGTAGACGATACGACGTACCGCGCTGTGTACCAGGCAGCGGACCAAACGCTCAGGGATGCGATGGACCTTGCCTATCTGACGGGGCAGCGTGTGAGTGATGTTGTGTCTATGGATGAGCGCCATATTGTTAATGGCGCTTTGGAGATTTGCCAAGCTAAAACGGGTGCAAAGTTGGCGATTACGGTCACTGGCGAATTGGCAGTTTTAATAAAGCGTATTTTTGATCGCAAGCGGGGGATGAAGCTGCGTAGCACGCGTTTGATTGTGGATGCGGAAGGCTTGGAGCTAAGTCGTATAGGATTGCGTTACAGGTTTGATAAGGCACGTGCTGCCGCAGGGGTCGCCAAGGAGGTATTTCAATTCCGCGATCTACGCGCCAAAGCGGCGACCGATAAGGCAGATTTGGCGGGCGATATACGCCAAGCGCAAGCGCAATTAGGGCATGCGTCGGTGACGATGACGGAGCACTATGTGCGCAAGCGCAGGGGGGCGAAGGTGACGCCAACGCGGTGA
Protein sequences of DBSCAN-SWA_4 >NZ_CP020870|1211003:1222382|1219172_1219619_-|WP_046420063.1|DBSCAN-SWA MTSVAFDTLKFANRLKTAGVPAAHAEAEAEALAEVLEINLQGLAESESKNGKALARLEADMKEGFAQVDQRFEKIDQRFEKIDQRFAQVDQRFEQIAKDFAQLDKNMDQRFAQVDQRFVEIKGEMLLLKWMFGVIVTSLIALIIKAFF >NZ_CP020870|1211003:1222382|1212769_1212964_+|WP_046419163.1|DBSCAN-SWA MNRYRKAEERSIALRRYYSSRPWLPNCILRDTRAMPNALRIANYRVRRAHYEAAKTLSSLLIKH >NZ_CP020870|1211003:1222382|1212371_1212773_+|WP_004088054.1|DBSCAN-SWA MTGIGYSSYSDPRLQPPQDDAKEYFAERVNARVQDYLSDPEKIEEADEWVDGTLSEAHYKEMEIVLADLHALPSDQLSDSDVLARLYALAEVQGLARMEQLRILAEQDVKEEMRHESECFHAMWGDVMQEEYA >NZ_CP020870|1211003:1222382|1211515_1211911_+|WP_046419169.1|DBSCAN-SWA MSTATKEVQADAPVLAPKKAPLFYWNGIRDEEGAELQQAFYAKCLDGIAIFGRCACKFSPLVRSWFERSPYSGVRGILYQRRIVSYRIDLRQAHPLYPQVKAAYGTQEAYSNSIVEDAEEWKHDTGKEVAA >NZ_CP020870|1211003:1222382|1221118_1221373_+|WP_080939599.1|DBSCAN-SWA MNAPSKQTCGLLPAAGLCLSRSEVAELCGTPQRARQAAFLRKNGIRHYLDAHDWPVVLRSSIEEIPTTPIVAPVWKSNKVAYGT >NZ_CP020870|1211003:1222382|1212995_1213190_+|WP_046419160.1|DBSCAN-SWA MSPPMPTEMFMRSMSNSELVNDLAAAQAVERFTPVESELLWRFTALLDAYEHVCEALEQDGDAQ >NZ_CP020870|1211003:1222382|1214955_1215522_+|WP_020851622.1|DBSCAN-SWA MKLTLKNVRLAFPVLFEPKKVNGEGEAAFSACFLIDPADPQVKALNQAIDKMANDKWGVKAAAQLKQMRMGDRVALHDGDLKASYDGFAGHLYVSARNKARPLVIDRDRTPLAAQDGRPYAGCYVNANIELWAQDNNYGKRINASLGGVQFLRDGEAFAGGGVASVEDFEDLSNVAELADVEGAMPWG >NZ_CP020870|1211003:1222382|1215589_1216744_+|WP_080939573.1|DBSCAN-SWA MNTPSEFTLQFESHAVRVQLDEHERRWFNANDICAALELLNPCAALAHHVDAENVSKRATFTAGGPQRANYLNEPGMYALLIGSTKDTAKRFQRWLTSEALPAAAAQKAGQNIIPLHHAPSIPSPFQPTEDHTMHAITPFQFESHAVRTVVDDHGEVWFVGKDVADVLGYTNHNKALGDHCRGVPKRYPLQTSGGVQEIRIISEPDMLRLIVSSKLPAAERFERWVFEEVLPALRKTGTYSTPGALPTLPGPTQDRIAALLLIGQYISTVPGVKPGIAAAATLACIKSNTNLTTEEIRRALPALRDPLCMLNATQLGKQLHCSAKAVNQLLASSGLQFRNERDAWELTEAGRVWGEAIPYSRNGHSSYQILWNPTVLDSLKVAA >NZ_CP020870|1211003:1222382|1218892_1219171_+|WP_046420587.1|DBSCAN-SWA MMNIPRERTIERYLVAQVRAKGGEIRKVKWGGRHGAPDRIAMLPEGRTLWVELKAPGQQCTPHQVREHARMRGMGQRVVVVDSLKGVDEVLA >NZ_CP020870|1211003:1222382|1219703_1221122_+|WP_046420062.1|DBSCAN-SWA MNLRPYQHTIVDFILTHPRCNLFVPMGLGKTVATLTALDVLLVVEDIAPILVIAPLRVAATTWPDEVAKFPHLRHLRVSVVVGSAAARRHALEQEADIYCINYDNLKWLVEFYKDRWPFRMVVADECSKLKGFRLRQGTRRARALAKHVHTKVERYVGLTGTPAPNGLQDLWALMWMVDRGARLGTHFKAFIDRWFRAMQIGSDPHAVRFVPTPHASQEIQDKIRDICLSLDPHAYFDLRQPIVNTIRVALPAHAQRLYKAMEQDMFIALECGAEVEAFNAASNTIKCLQLANGALYTDDTRQAWEVVHDAKLEALHGIIEEAAGMPVLVAYHFKSDVARLQRAFPKGRALDKHPDTIRDWNAGNIPVLFAHPASAGHGLNLQDGGNILAFFGHWWDLEQYQQIIERIGPTRQAQAGHTRPVFIHHIVAAGTVDELVMARRESKREVQDLLLEAVKRRETGKPLTSQGAMRR >NZ_CP020870|1211003:1222382|1211907_1212375_+|WP_046419166.1|DBSCAN-SWA MSTATLENVPTPGGTRFTKIYDTDGYHAITRDEHKGLEWLTGYVGGPSSNAYGPDCDAAEECRQLAIWGYDNWRLPTQEEALSANGAAAYLRWEEPSYWIWTCTPDDDHPREAAWVVKFGTNHTAAVYRTAKYLVCAVRGQMRTDATATPKAGES >NZ_CP020870|1211003:1222382|1221362_1222382_+|WP_046420060.1|integrase|DBSCAN-SWA MGRKPIKAGAIPRFRVRPQKSGVVYYYYDHGGKPRKETPLGRDYGLAIKRWAELEHAQITSAIAVTFRHVAERYRAEVTPTKAYNTQRVEHRCLAPLLKFFDDPPAPFEAIKPMNIRQYLDWRTSKVIANREVSVFSHLWNWARSKGITDLPNPCGGIRRNKATGRDVYVDDTTYRAVYQAADQTLRDAMDLAYLTGQRVSDVVSMDERHIVNGALEICQAKTGAKLAITVTGELAVLIKRIFDRKRGMKLRSTRLIVDAEGLELSRIGLRYRFDKARAAAGVAKEVFQFRDLRAKAATDKADLAGDIRQAQAQLGHASVTMTEHYVRKRRGAKVTPTR >NZ_CP020870|1211003:1222382|1216745_1218896_+|WP_085808107.1|DBSCAN-SWA MATVQAPTPILWGDLETYSPVPIAHGVHAYAEQAQLLLFAYALGDGPVQVWDCTATATMPDDLSAALHNPAVLLYFHNSHFDRTVLRHCGISIPLERWRDSMAQALAHALPGALGTLCELLRVPVEQAKAKDGKRLVALFCKPRPTHCTLRRATRDTHPTEWAQFVDYAKRDVAAMRDVVKRLPSHNYTGAENDRGVLVDTDLAQAAIGAVERAKCTLANRTEALTGGAVQAATQRDALLHHLSTAHGVALPDMQQHTVERCIDDPALPETARELLSIRRQASTTSTAKYQALLHCTSPDGRLRGTLQFKGASRTGRWAGRLFQPHNLPRPTLSQEVIAVGIDAMKAGCVDLVCDDVMALTSSALRSCLIAPKNKKLVVADLSNIEGRVLAWLAGETPKLHAFRDFDTCQGVDGTWHSGEAITHGALRGAPITLQWNAEHEPIRKGDDIYKRAYAHSFGIAPQAVTKQQRQIGKVQELALGYGGGVGAFAAFAAMYHIDLEAMAEQAALPPLLLQQAMEALQWTKANHRPTFGLSDRAWLACDVFKRAWRNAHPAIAAFWQALQFAVTDAIHHPETTHTCCGITVQYSRAWLRLRLPSGRVLYYAAPRVDEHGALSYMGTHPVTRKWARLTTYGGKLVENITQAVSRDVLAACMPAIEAAGYSIVLTVHDEIITEADECPSFNAAHLAALMATPPPWAQGLPLAAEGFDTHRYRKQ >NZ_CP020870|1211003:1222382|1211003_1211519_+|WP_046419172.1|DBSCAN-SWA MTLNVSEQAQPQASKALSGTQKAPLFYWNGIRDEKGGKLQHADYSYEKPHDRDNSEIRVIATRYTRFSPLVHAYFKVYNSTDVMTDYFDDDKFTVTTTHPLYQQAKAALEAVRNRSAAQLAASEKKRQEKRNAARAALESAKDEEARDAAFAAARAALATYEAAESMGVAA >NZ_CP020870|1211003:1222382|1213678_1214956_+|WP_046419156.1|DBSCAN-SWA MSQHAMLSPSSAHRWLHCPASVPLSRTCEDDASPFADEGTVAHTVAADALRTGCDASAYVGACHEVNGHRWEVTAEMAAYVQEYVDYVRAIAGVRLVEQPLRIASITGEQGAKGTADVVILAGDALTIVDLKYGRGVKVFAEGNEQLQLYALAALQEFAGVEAFQHVRLVIVQPRLGHADEWVRTLPEMEDFRRKVAQGAARCRAAIGHYDNVGELPSEYFGPAEKPCRFCSTKATCPALATHVLNTVADDFVDLTKPIVPQLSYAQLRTFDNTTLACLFGATELIESWCKSIRDRAAAQLLSGQPVPGYKVVQGRQGPRRWVDETAAEDALIQMRIGLSHLHDVSLISPASAEKLHKAGVLDLQQWVQLQPLIHRSTGAPIVVPTSDKRPALALQDATDFEDLSDMPIPPLPDTAPSLQSQETP |
15 | Xylella_phage(100.0%) | integrase | attL 1208710:1208723|attR 1224999:1225012 |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_5 |
1290073 : 1300244
Sequences of DBSCAN-SWA_5
Nucleotide sequences of DBSCAN-SWA_5 >NZ_CP020870|1290073:1300244|DBSCAN-SWA ATCACCTTTTGTAATGCGCGTTGATTAGAACGCGGATCGTCTTGTCTGCGGCGTTTGGAAGCGCCGCCCTCCCCTCTTCCCAGGCCGCAAGCGTCTGCGTCGTCGCGCCCAGACGAGCCGCGAACTCAGGCTGCGATAGCTCGAGTTCCTTTCTCATGAAGCGGATTTCGGCACCTGAAAGGGTGGCCGGTTTCAGCGCCAGCGCTAGGGAAAGGCTTCTATGGAGACTTGGTAAATCAGTAATAGCAATGCCGTCGCCATAGGGTGTCTTGTGCACCGTAAACCCGTTGCGTAACCACACATTACCTAGGCCTGATTCGGTGTAGTGGTACATGTTCAGTACGAAAAACCTTTTTTAGCGGTCAAATTAAGGTGCCGGTAGAGGCGCTGCTCCAGCGCCTCGCGGGTGGAGCGGCCTCAATGGCCCCGCGTCCGGCAAGGATTAAGATGCCGCATGAAGCGGGACGGTGGTATCAGAGAATGCCTTAATAGCAGGTAGGACTTTTTCGCGTGTGACCTAGGGATGTCGGTGCAATACTGAATGACCAGCATCCTCAACGTCCAAAGGCTTCCTCCCCGCTGGCTGGAAGGCACCGAGGACCAAGGCCTCAATTCTTACGCACCCCTAGCTTTTCGAACATCTTTCTGTTTCGACGCTTGGCATCTTTGAGCAATCGGTCATATGAAATGATCTCAACATATCCCTGCGCACCGGAAAGCGTTATAAAGTAGGATTGGCCATCAGCAGCTTTGTTGTAGGTCATTTCGATGTGGCTACGCATGGTTGGAGTGATATCCGCTATGAGATACCCAATGTAGCGCATGCTGGCCGAATTAATTGGGCGGCCTTCAATATCATTGACCGTGCCTGCCCTAATTTCATTGAGCAGCCGCGTAATCTGCCCGGTAGGGTCTGAACTGTAGTCATTTCTCCCAGGTCGTTTAAACTCAATGACGACTACTGCACTATCGCCAGGTTCTGCCACTCCAATAGGCGTATCGTAGAAGAAAGCAAAGATATCCGGCTCCTTCCCGGAGGTGCCCTCCAAACCACGAACGCTATTGAGTTTCTTATCCGATGTCAGAAGCGTGTGGAAGGAAAGACGCTCATCAATCATCCAAAGATTCTGCTGATCTAAGAAGATATCCTTTGATGTTGCGCCCATTGGGAAGATCATCTTGTGCAGAATTCGCTCAAGAGGATACTTATCATCTTCATGAGCACGCTTAAGGCTCCGGCCAAATAGATCAATAATGATCCGGCGGTGCGCGATGTAGGCAGCCAGATTGCTCTTGCCAACATCGTTGATGTCCTCCATGAGTTTCTTCATCTGCTGCTCGTAGGCCTCGAATGACTCTTTCTCCATACGTGCCGCAAGGTGACGCTCCTCCTTCCTTACACTATCTTCAATATCGCGCCGAAGATGAAGCAAAGCTTCGTCGAGCTTCTCATCGCTTAGCCCAGGTTGAATCTTCTTTTCGATCAAATGTCGGTAGCGTTCATGCATCAACATCCGATACTCAGGAGCTCCCTCAATGAACCGCTCGATCTGTAAGATCTTCTCTACGTTGGTTGTCTTGAGATCCCCTTTCAATACATCATGCAGCAATTCCGCCAGTGCCTTCGTCAGTAGATCTTTTGAGAGCAGACTGCGGTCAAGGTCAGGAGCGTCATCGGCTTCAAAGAGAATGTTCGTCCGCTCCTGGTTTGTATGCTCGTCCAGATATTTCCCAGTAACCAGCACGATAAGCGTATATGAGTCCTGCTGATCGTCAGTGAGCTTTTCCGGCAATTCAGGCATGATGTTACGCAGTCGAGCCGTCGTGACTTCTCTGCCGTTAGCGCAAAGGTAGTACTCATGTTTGGCACGGGGGGCCTTATTCCGTAATACCTGAACATCAAAGCTGTTCTCGCCGACAGGGAAGGAAAGTTCTTTAATATGCTTCAGAACCGTTTGGTCGTGGTAATCATTGACATTGATCGCCTCAAACCCGGTCGCTTCAAGAATGATCTGCGGGCAAAACCCAGAAGCAAATCGTATCAGAAAGTGCTCAGTGATCCGTTGGACAATTGTTTCTGGACCGCACGCCCATATAGATTGGTACTTCTGGTTTACCCCTTGCATAAAGATAGTTGTTCCAACTGGATCATCCGTGAGTGCCGTTTTATCGGAACCGACCATCTCATCATCAATGCTAAATCGGAATGCACGTTGGTAGCGTTTACCATCGCTATCAAAAATGCTCTTAATCTGCACGTGCTGGAAGACCTTGAGGTACATGAACCTTCCGACGCCTTTGCCACCCAGCTTTATCTTGCTTAGTGTGTATGCCTCACCAAAAGACTTGGTGTTACGCTGAGTGAAGCCAATGCCGTTATCTGATATCTCGAATCCATCGAACATAATTATCTGATCTCGCCCATCCGTCGCCAAGTCATGGCGCTGAATCAACCGGATGCGAATACAGCCATCGTAGGGAGGAATACCCCGTTCATTGATGGCATCAATAGAGTTGGAAATAGCCTCGAAAAGAGGGATCAAACCTCGTGATACGCTTAGATCAATGCGTTCTACTAATCCACGTAAGTCTGTCCTCATTCGATGCCTCCCCTGCATTAGCACGTTAGGAGACGATTTCATCACATTCACACATCTGGGGGCTACTTCTACACTGATACCGCACCGTTTCCTGTTCTGCTCATCAATCTATCCATCATGAAGCTGGAGAGGTGCTATCTGTAATTAAATATTCCCACTATCACACTTAGTGTAAAAGGCATCTATTAAAGTGATGGTGATGACATTTGTTTTCAGAAAATACTATGAGAAAAATATTGCAGAGTGATGGGCATGCCGTACTCGCCAGCGACCATCGCTAACTACTTCTTACAAAGAGCCTCTAAAGAAGGGCGTGCACTCACACCTATGCAGGTGCTGAAGCTCGTCTACATTGCTCATGGTTGGCACCTTGGATTCCGGAAGGAGCCTTTGATTGATGAAGTTGTCGAAGCATGGCGTCACGGCCCTGTTATTAGTTCTCTGTATAGGAAAATGAAGCAATATGGAAGTGGCGGAATTACTGAGTTACTGCCTGTCAATCCGTTTTCTTGGGCCACTGCCCGTGCTTCGAAGATCGACGAAAAATCGGAAGAAATTTTAGATAGTGTTTGGAATAGCTGTGGACACTTTGGCGGTATTCAGTTATCAGAGATGACTCACAAGGAAGGCACTCCGTGGTGGCAGGTACGGAATGGGCCTAAAAGAAAGGAAACTGGTCTTATGAACCTGACTATAAACAACGATCTCATTCAAGAATTCTATGAGCAGAAAATCAAAGCCCATAGCTATGGGAAGCTCTACGAAAAATCTCGATCAAATCAAATCCAGCTTAACAATGAAACAATAATTGAGTGTGTTTAATGATGAGTAAATTGGCTCACAGATTGAACGATTTCCTATCGGGATTCGGGCAGGCGTTTTCGCTGTGGCCTACAGAGTCGCTTGATCGTTACCTGGTACAGGAAAGCCCAGAATCCCGTATCTATGAACACTTCGCGCGTGTTGGCGAGCATATGGAGTCAGCCATGCAAAAGGTGATGGAGGAGCAAAGCCTCATCGAAAAAAATGCGATGGATCAATGTCGCTGATCAATACGCGATACACGCCATGACGCGCAGTCCAGCGGGCAGGTTGTGGTGTTCTCCAAAGCGATCAATGGACACAGTGTGGCCATGCCTGCCCGCCGATTCGGCTGTTGCCGCGTGGGCATGATCTGCCTCTGCTGTTACAGCAATATCGTGCACATGAAACCCTGCGGCCTGCATGCCGATGTTATGGGCATGCTTGCCGTTTGCATCGGTGGTGAATCTGTGTAGATGCCCGCCAGTGACATTGGTCGTGACTTTGCCTTGATTTTGCTCATAAAAACCCATGTCTCTTCCAGAGGCATAGATCGCTCTGAAAGAACCCAAGATATGCGCGTGGTCACCGTCCCAAGAGGTACTGCCTGTATGCTGGTGCAATCCTTGTTCGTCACTCCATGCCTGGTGTAGATGATTGCCCGCCGCTGCTGCGCTGGCAGGGTGTGTATGGCGGCCTGCGGGATGGACTGTGACAGGATGGAGATGCCTACCGCCCTCCTCGGCCGTCGCGGTGTGCGCATGGGAGATCACCTGCCCGCTGGTGAAGGTGCCGACCAAGGCAGGGTCGGCGGTGTGAACGCCGACGGTGCCTTCAAGAAAATTAGGAATATTGAAGGTGCTGACACCATCACCGGCACCATAACTGGTGTTGATTTCCTCAAACAGGCGTGGGTACATGGCACGCGATACAGCGCGGCCATCACACAGCAGCGTGCCGGGTAAGGCGCGTTTGCCTGCGGTGTAGACAATCTGTCCAGGCTCGTACCTGGAGAGTGCCGTCCAGCGGTTGGGTTCATTCTGTAGCGGTGTATCGGTGTTGTTGTCAGCCGTGGACAGATACAGCCCGTAACGGCTTGCGTGATCCGGGCGGTATCGGACGATCACCCCACGCATGTATGAAAATGCCGTGCCGTTATTCTGTTCGGCGGTGATGAATTCAGGGCTGCCGTATTCTTGGTAGCCTTTAAGCACCGTGGTGATGGCGTGCAGCACGGCATTCATGACAGTGCGTTCTACGGGTTTAGCCGTCGGCTCCTTGGTTAAATCTTTTTGATAGTCCGGCCCCCATCCCTGGGTGTAGCTCACAAAGCCGTGACTGTCTTTGGCTTCGGGCACGTGGATCATGTCCCCTTGCTGGGCAAAGGGGGTACGGAAGTAGTGTTCTGTCATGGGTTATCGCTCGGGTGGGTCGCCAAACGCGGCGTGTGTGTAGTTACGGTTGCTGCGCTCGTATCCGAAGGGCAGATGTGTCACAAGGTTGTAGCGAACCCGCACGCCTGCTGGACGTGGCAGGATGTCCAGCGCAGTGATGGCATAGCGGATGACGTCTGAAATCATTGCAGTACTGACAAACACGGTATAGCTCATGTCGTAGTGGTCCAGCACTGCGGCGCTGCCGGGAAAGATGAAGTCAAGCACTTGCTCCATATTGGGCGCGGTGCCGGTCATGTGATTTTTGGCAATCCGGCACTTGATGAGAAAGCGGTAGGCTGCATCGTCCAAGCTTAGATCGTGCCGCACGGGGGGGCGTTCACTGGATAAGACACGGGATTGACCGACATGCTGGCCGATGAGGTCAAGGTGTGTTCCGGTGGCGCGTTCGATATCCAATGTCTGGCGCAGATCGGCTAAGCCGTTCCAGGTGCTGCCGAAGGTATCGCTGATCAATGCAGCGGTGGCGGTGGCCCTGGGTTGGCCCTTGTACTGCCAGATCAACAGGTCCGCATAGCTCATCGCACGACAACCTGCAGATCGTTCATTGCAAAGCGCGCCATGCTTCGCACGTCGATAGGAATATTCTGCTCAGACAACGCTTGGCCTGCTTGACCGATCATCAGCGATGTCACCCAAAAGCCTGGGACGCGATTAATTTGGGTATACAGTCGGCTGCGGTGAACGTGCTCGCCAATCAGAAAGGAGCGCTCGGCCAATGCCTGTTTGATCGCATGGGTATCAATACCGGACGTGCTGCTATCGCGCTCTACTTCGATGCGGGCGGCGCAACGGACCATCGTTGGACGGTCAAAATAGATCTCTCTAGGTTGACCGTGTTTGTTTTTAATCTGTACCCGTACCTCACCACGCATGTTTGTCCCGAGTGTTTTATGGTGATAGATCACTTCAGCAATGGCCTCATCCCGGCCCCCCTCCACAATGACGTTAATGCCGTGGGCGGGGACTCCCGCAGCATCCACGGTATCGGTGAAGTTTTCTAAGCAGACGACGTGGCGCACGTCGGGCAGCCCCCAGAGTGTGGCCTGGATGCTGTCAGCATTGTTGGTGGATGTCTTAGCGCGACTTTTAAAGAAGCGGGCGCGCAGCGCCGCATCGGACTCTTCTTCTGCCCCTGCTTCGGCGTCCTCGGTCGTGAGGGCCGAGTCCCAGCCCAGGGCCACGGTTTCAATGGTCAGGGCGGTGTGTGCGGGGACGTCAACACGGCCTAAGGCGTCGCTGCGAAAGTCTGCATGGGCGTGGCCGGTGGCATCCAAGCGCACGGATGACACGAGCTGCCAGCGGCAGCGATTGGGATCGGAAACAACAGACCCAGCAGGGATCGGGGCATCAGGTGTGCCGCTCAAGGTGACATTGCGTAAGTAGCTGTAGCTGGCTCGCCTGCGGGTGAGGCCCGCATAGGCCACGCGTTGTTCTAGCCACGCGCCGCTGGCGTAATCCGGGTCCAGTTGCCGGTGGATGTCCGTGCCCAGTTCCTCCAGATCGGCTTTGATCTGTGCAATCAGGCCAATCAACTGGCCATCGGGGCTGTCCGGATCAACATTGATATCGTTGCCGTAAATCGAACGGAAGCCTTCTTGCAAGCGGGCAATGATCGTATCCAGCCGCTCGGCTTCGTATCCGCTGGTGGTGACTTTTCCCATGGTTCAAGCACTTGATAAATTAGAAATAAACAACGTCATCCTTAAATCAACAGGCAATTCAGACGTCCCCCATCGTAGTATTTTTTAGACTACAATTGTGACATGATTGAATTGAAGCAGACTGACACCTTCCGCAAGTGGCGGGAGAAACTCAAGGATGCGCGCGCCCGCTCGGCCATCGCCTCGCGCCTCGACCGCTTGGCGTTCGGCCATGTCGGCGACGCGGAGCCAGTAGGGAAAGGTGTCAGCGAGCTTCGCATCAACTACGGCCCCGGTTACCGGGTGTATTTCCAGCGGCGTGGCGACACGATCTACTTGCTGCTTTGCGGCGGTGACAAAGGATCACAAGCGCGCGACATCAAGACTGCGCTGCACCTGTCTGAACAATGGAGCGAATGACCATGACCGAGAAACTGACGAGCTACGATCCAGCCGAGGACTTGACCACTGACCAGGCCATCGCCGATTTCATGGCAGCTGCGTTCGGGACGAACGATCCTGCCTACGTTGCCCACGCGCTGGGCGTCGTTGCCCGCGCCAAGGGCATGACGCAGATCGCCAGCCAAACAGGGCTATCGCGCGAACAGCTCTACCGCTCGTTCAGTGCCGAGGGCAACCCAACGCTGCGCACGACATTGGCCGTGATGAAGGCGCTAGGGATCGAGCTGTCTGCGAAACCGTCGGGTGTTCACTGACCGGATGCGCGACAAGCCTTTAAAGAAATCCTATCTGTTTGTCATCAGCATCCCCTCACAGCGTGGTACTAACGGTCATCGCCTGCTGGTCCACATCCAGCAAGGTGACCTGGATGGTTAAGGTGCGGGTATCCGCGTCCAAGGCCATTGAGAAGGCGGTGAGGCGGCGCACCCCTTCGGTGGTGAGGATGCAGCGCTTGACCTCGCGCTCCAGGTGTACCAGGTCGGCAGGCCGCTCCATTAGCTGCAACCACGGCAGGCCGTGGTCCAGATCCAGGAACCAGTTGCCACGGAAGGAGCGTAGCCGTGTCTTCACCCGCTGTGCCACGGAATCGCTGGCGGCGGCATAGTTGCCGCGCCCGTTGCCGAAGGTCCAATCCCCTTGGCTGTCCACGCGCCGCACTCTCATTGGGCCGGGCCTGTCTGTCCTGGGCCGTTCTCCACGTTGTCGTGGGTGTGTGTCTCCAGGCCGATGCTGTTTGATACGACGTCGCCATGACCACGCAGTCCCTGGGTGAATTCCACGGGAAGATCAAGAACCAGCTTGGTTCCACGCAGCGTGATCACGCCTTGGGTATCCAGTTTGAATGAGGCGCGGCCATCCAGGGTGCGCAGTACCACGCCGTCCATTTCAAACGTTGGAATGACATTGGGCAAGGAGGCAATGCCAACATGCGCCACGGCATCCGACAGGTCATGCAGGCGATAGTCCACAGGCTCGGACGCACGGCCAGACTGGAACCAGGCATCCATGCAGCGATCTTGGAAGATGAGTTCGCATTCATCCCCAGCAGCCACGGGGAAGGTCATCACAAAGCCGCCGCCCCGCGGGAAGGACACCGGCACATCCTGGAGTACCGGTAAGGGCTGAAGGGAGCCATCGTTCCTCTTCTGCTGGATCAACGGCTGTACGGTCGCCGTTTGGGTGACTGGGTTAAAGCGGACGATCTGCCCAGGCAAGGCCACACGCAGGCGCTGTGCCAGCGCTTCGGTACTGCGTTGCAGTACGGCACTGAGGGAGGCGTTATTCCAGTCATCCAGACTCATACAGACGGCCTCACGTTCTGAAAATCACCGCCCACACAGGTCACCGTACTGAACCAGGCTTCGGCCATGACATCGCCCATGTCATGCAGTGAGGTGATTTTGTAGTCGCCGTTGTAGATAGGGATGATCGAGTCCACGCGCACCAGGCCGCCGATGCGCAAGGCCGGATTGAGCAAGGTGGTGATTTTTAATCCATCATCGGTCACTTCGGGGGAGCCCATCATGCCGCTGCTTTGGGACAGCAGCACGGCGTCACCGGCCAGGACGGTATCGGCAGGCAGTAACATCAGTGCGCCATCCTGGATGGACCAGTCCGCGCCATGATTGTTGGCCATTGCATCCAGCAGGTCGCGGGTATTGCCCGACAGGACTTTGCCGCGGGTCAAGCCACGTTGTCCCTGCATCTGGATAGGTCCCAGCTGGGTAGACGGCATGGAGGTACTCAGTGCCCGCAGTACCTGGGCATCGGTCGCCCCTGCGGCCAACGATAAGCAAACATGCCCATGGCGGTAGTCGTGATCGCCATCGCCGCATTCCAGTTCAATGACGTAATCCGTCCCATCACGCCGCACAGAAGGCTTGATGATGTCACCGACAAATAACAGGCGCAGTTCTGCGTAACCGGCGAGCAGCCGGACCCTGTTGTACTGTCGGCTGGTGAGCAAGCTCAGGTGATCGCGGTTGAGATTCCATACGGTGATCTTGGCGGGGTTGGGGGTGGAGTCGCTGGTTTTGCGGATGTCAAAGGCGATGCGCAGGGTGTCGATGGCAATCCCATCGTGGCTGGACCCCAGCTCCAGGCGATACTGGCGGCCAAACTGTTTCATGGGCGGACCTGCTCTTTTAATCCAACAAACAGCAAGCAGCGTTCGCCCAGGTCATCGTGGCGCATCGGGTCCATCTCTAAACCGCTTTCATCTGTCAGCCAAAAGAAGTAATCGACAGGACGCCGCCACAACAGGGGGACGCCCACCACCAGGGGGATGCCTTGCGCCACGGGCTGATCTAGGGTCGCGGTGTACAGGTCCATCGACCAGCAACACGGGACCGGATTCCATCGCAGGATCAAGCGTAGGGACTCGGCACCTACGCGAAACGACTGGGTCTGATACGCGCTGCTATCCACGGGAATCTGCCACATCAGAACAGTCCAGACATCTGACGCAGTAAGGAGCGGTTCTTCTCGGTGTCCACCGGCTTAGGGTGGGTCTGGCCGCTGTGGCGTTGCGCCGCGCCTTGGGAGGCGCTCCTGCCGCGTTTGGGGGCGGGCAATGAGACACCAGAAATCGATGTTGTCTTGACGATGAACAGTTCTCGCACTGTCAGCACGCATTCAATCGAACCATCCTGGGTTTGTCTGGCCGCGATGGAGAGAATCAACATGTCTTGATACGTCTGGACGCCGGTGTGTACCTCCAGGGTCTGTCCGCTGCGTTGTAGATTCCGTAGGGCGGTGTACACCTGGGCAATGCGGCCTGTGGTGCTGGAGTCATCACGGGGGGGGATGGGCTGGTCATCGGGAAGCCAGTCGGCCAGAGGGCGCACGGCGTGCTGGCCGTCGCTCTGGGGTGCAGTGGCTTGGCTGATCACCGAGGGCAGCTCACGTTGGGCCACACGCAGCGCCTGAGCGGTGAAGGGCAGCAGGTCCGTCGGAAATGGGACGCGATCGGTCAGGACACGCAATGGCTCGGCCCCGTGTTCCTCTGCGGCAGGGGCTGGGCTGCGCTGGGGTTGGTAGTCCACCACAATGCCAGCAATGGTGACGGTCTGCGGCATCAGGACGGCGTGATCGCCGATCATCGCGCCAGACTCTATCGGGTTTTCAGTGATGCGCAGCTCGGCTTGGTGGGTTTCTTCCATCACCGCATCCAGGGTGACGGTGCCGACGTGGCGGTGGGTCAGGGTGATCAT
Protein sequences of DBSCAN-SWA_5 >NZ_CP020870|1290073:1300244|1299156_1299474_-|WP_046419669.1|DBSCAN-SWA MWQIPVDSSAYQTQSFRVGAESLRLILRWNPVPCCWSMDLYTATLDQPVAQGIPLVVGVPLLWRRPVDYFFWLTDESGLEMDPMRHDDLGERCLLFVGLKEQVRP >NZ_CP020870|1290073:1300244|1297691_1298333_-|WP_046419962.1|DBSCAN-SWA MSLDDWNNASLSAVLQRSTEALAQRLRVALPGQIVRFNPVTQTATVQPLIQQKRNDGSLQPLPVLQDVPVSFPRGGGFVMTFPVAAGDECELIFQDRCMDAWFQSGRASEPVDYRLHDLSDAVAHVGIASLPNVIPTFEMDGVVLRTLDGRASFKLDTQGVITLRGTKLVLDLPVEFTQGLRGHGDVVSNSIGLETHTHDNVENGPGQTGPAQ >NZ_CP020870|1290073:1300244|1290073_1290406_-|WP_046419977.1|DBSCAN-SWA MYHYTESGLGNVWLRNGFTVHKTPYGDGIAITDLPSLHRSLSLALALKPATLSGAEIRFMRKELELSQPEFAARLGATTQTLAAWEEGRAALPNAADKTIRVLINAHYKR >NZ_CP020870|1290073:1300244|1296992_1297286_+|WP_004085592.1|DBSCAN-SWA MTEKLTSYDPAEDLTTDQAIADFMAAAFGTNDPAYVAHALGVVARAKGMTQIASQTGLSREQLYRSFSAEGNPTLRTTLAVMKALGIELSAKPSGVH >NZ_CP020870|1290073:1300244|1293721_1294885_-|WP_046419971.1|tail|DBSCAN-SWA MTEHYFRTPFAQQGDMIHVPEAKDSHGFVSYTQGWGPDYQKDLTKEPTAKPVERTVMNAVLHAITTVLKGYQEYGSPEFITAEQNNGTAFSYMRGVIVRYRPDHASRYGLYLSTADNNTDTPLQNEPNRWTALSRYEPGQIVYTAGKRALPGTLLCDGRAVSRAMYPRLFEEINTSYGAGDGVSTFNIPNFLEGTVGVHTADPALVGTFTSGQVISHAHTATAEEGGRHLHPVTVHPAGRHTHPASAAAAGNHLHQAWSDEQGLHQHTGSTSWDGDHAHILGSFRAIYASGRDMGFYEQNQGKVTTNVTGGHLHRFTTDANGKHAHNIGMQAAGFHVHDIAVTAEADHAHAATAESAGRHGHTVSIDRFGEHHNLPAGLRVMACIAY >NZ_CP020870|1290073:1300244|1296693_1296990_+|WP_004085590.1|DBSCAN-SWA MIELKQTDTFRKWREKLKDARARSAIASRLDRLAFGHVGDAEPVGKGVSELRINYGPGYRVYFQRRGDTIYLLLCGGDKGSQARDIKTALHLSEQWSE >NZ_CP020870|1290073:1300244|1297341_1297695_-|WP_046419963.1|DBSCAN-SWA MRVRRVDSQGDWTFGNGRGNYAAASDSVAQRVKTRLRSFRGNWFLDLDHGLPWLQLMERPADLVHLEREVKRCILTTEGVRRLTAFSMALDADTRTLTIQVTLLDVDQQAMTVSTTL >NZ_CP020870|1290073:1300244|1299473_1300244_-|WP_046419667.1|DBSCAN-SWA MITLTHRHVGTVTLDAVMEETHQAELRITENPIESGAMIGDHAVLMPQTVTIAGIVVDYQPQRSPAPAAEEHGAEPLRVLTDRVPFPTDLLPFTAQALRVAQRELPSVISQATAPQSDGQHAVRPLADWLPDDQPIPPRDDSSTTGRIAQVYTALRNLQRSGQTLEVHTGVQTYQDMLILSIAARQTQDGSIECVLTVRELFIVKTTSISGVSLPAPKRGRSASQGAAQRHSGQTHPKPVDTEKNRSLLRQMSGLF >NZ_CP020870|1290073:1300244|1295445_1296591_-|WP_046419967.1|plate|DBSCAN-SWA MGKVTTSGYEAERLDTIIARLQEGFRSIYGNDINVDPDSPDGQLIGLIAQIKADLEELGTDIHRQLDPDYASGAWLEQRVAYAGLTRRRASYSYLRNVTLSGTPDAPIPAGSVVSDPNRCRWQLVSSVRLDATGHAHADFRSDALGRVDVPAHTALTIETVALGWDSALTTEDAEAGAEEESDAALRARFFKSRAKTSTNNADSIQATLWGLPDVRHVVCLENFTDTVDAAGVPAHGINVIVEGGRDEAIAEVIYHHKTLGTNMRGEVRVQIKNKHGQPREIYFDRPTMVRCAARIEVERDSSTSGIDTHAIKQALAERSFLIGEHVHRSRLYTQINRVPGFWVTSLMIGQAGQALSEQNIPIDVRSMARFAMNDLQVVVR >NZ_CP020870|1290073:1300244|1294888_1295449_-|WP_046419969.1|DBSCAN-SWA MSYADLLIWQYKGQPRATATAALISDTFGSTWNGLADLRQTLDIERATGTHLDLIGQHVGQSRVLSSERPPVRHDLSLDDAAYRFLIKCRIAKNHMTGTAPNMEQVLDFIFPGSAAVLDHYDMSYTVFVSTAMISDVIRYAITALDILPRPAGVRVRYNLVTHLPFGYERSNRNYTHAAFGDPPER >NZ_CP020870|1290073:1300244|1292924_1293494_+|WP_046420067.1|DBSCAN-SWA MPYSPATIANYFLQRASKEGRALTPMQVLKLVYIAHGWHLGFRKEPLIDEVVEAWRHGPVISSLYRKMKQYGSGGITELLPVNPFSWATARASKIDEKSEEILDSVWNSCGHFGGIQLSEMTHKEGTPWWQVRNGPKRKETGLMNLTINNDLIQEFYEQKIKAHSYGKLYEKSRSNQIQLNNETIIECV >NZ_CP020870|1290073:1300244|1293493_1293721_+|WP_046419974.1|DBSCAN-SWA MMSKLAHRLNDFLSGFGQAFSLWPTESLDRYLVQESPESRIYEHFARVGEHMESAMQKVMEEQSLIEKNAMDQCR >NZ_CP020870|1290073:1300244|1298329_1299160_-|WP_046419672.1|DBSCAN-SWA MKQFGRQYRLELGSSHDGIAIDTLRIAFDIRKTSDSTPNPAKITVWNLNRDHLSLLTSRQYNRVRLLAGYAELRLLFVGDIIKPSVRRDGTDYVIELECGDGDHDYRHGHVCLSLAAGATDAQVLRALSTSMPSTQLGPIQMQGQRGLTRGKVLSGNTRDLLDAMANNHGADWSIQDGALMLLPADTVLAGDAVLLSQSSGMMGSPEVTDDGLKITTLLNPALRIGGLVRVDSIIPIYNGDYKITSLHDMGDVMAEAWFSTVTCVGGDFQNVRPSV >NZ_CP020870|1290073:1300244|1290680_1292672_-|WP_046419976.1|DBSCAN-SWA MRTDLRGLVERIDLSVSRGLIPLFEAISNSIDAINERGIPPYDGCIRIRLIQRHDLATDGRDQIIMFDGFEISDNGIGFTQRNTKSFGEAYTLSKIKLGGKGVGRFMYLKVFQHVQIKSIFDSDGKRYQRAFRFSIDDEMVGSDKTALTDDPVGTTIFMQGVNQKYQSIWACGPETIVQRITEHFLIRFASGFCPQIILEATGFEAINVNDYHDQTVLKHIKELSFPVGENSFDVQVLRNKAPRAKHEYYLCANGREVTTARLRNIMPELPEKLTDDQQDSYTLIVLVTGKYLDEHTNQERTNILFEADDAPDLDRSLLSKDLLTKALAELLHDVLKGDLKTTNVEKILQIERFIEGAPEYRMLMHERYRHLIEKKIQPGLSDEKLDEALLHLRRDIEDSVRKEERHLAARMEKESFEAYEQQMKKLMEDINDVGKSNLAAYIAHRRIIIDLFGRSLKRAHEDDKYPLERILHKMIFPMGATSKDIFLDQQNLWMIDERLSFHTLLTSDKKLNSVRGLEGTSGKEPDIFAFFYDTPIGVAEPGDSAVVVIEFKRPGRNDYSSDPTGQITRLLNEIRAGTVNDIEGRPINSASMRYIGYLIADITPTMRSHIEMTYNKAADGQSYFITLSGAQGYVEIISYDRLLKDAKRRNRKMFEKLGVRKN |
14 | Salmonella_phage(25.0%) | plate,tail | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_6 |
1303669 : 1323936
Sequences of DBSCAN-SWA_6
Nucleotide sequences of DBSCAN-SWA_6 >NZ_CP020870|1303669:1323936|DBSCAN-SWA TTCATGCGTTGCCAAAGCCTTTTTCTAGGGTGATGTCCATCACTTCAAACACCAGTGTCCAGGTTTCTGGATTGTGTCCGGCGCCCCGGGTAAATCCTGGGGGGGTGGTGAAATACCCTTGGGTGGCTGTCACCACGTCCTGATTCAACAGGTCACGGATATCCAGGGTGAAGGGGGTGAAGGACTGGATCGCGCCGCGTTGCTGCGCCAGTCGCCTGCTCAAAAAGGCGTTGTCGGCGCTGTGCTGTTTGATTTTCAACGTTAAGGTGCCGGACCGATCCGCGTTGGCGACAAACACGCCCGTGCCGCTGGCCCCGATGGTGTAGGCACCGGCATCGGCATTGTGTTTGGCGTCGATGACGTCCGTGCCATCGGCCCAGTCTTTGATCTGGGTTCCATTGAGCAGCACCGACACTTGTTTGGGGTCGAAGACGGACATGGGATTCCTTTATCGGTCGAAGTTGATGATGACGTCCACCGCATGGATGGCACCGGCCAGCTTCACGGCGATCTGAAGTGGCGGTGCCCGGCGCGCTTGGCGATCAGAGGTCGATAAGGTGTCCACTGAATCAGCCCAGACATAAAAACCAGCCTCCAGGTAATCGCCCGTGGCCAGCGCACCGAAGGCCTGCCCGTTCCAGAGGCCAGGGGCCAAGGCACCGTTACGGACCCCTTCTTGGCAAACTTTTTTGCAGGCCGCGATCAGCAGGTGGGTGCCTGCATCCGTCAGCGGCACCTTGGTTGGGCTGCGATGCAGGACGGCAAACACCTCCTTTTGCACCGCATCCACCAGCCAATCCAGCAGATGGACTTCATCAAAGAAGCGCCCGCCAAGACAGGTGCCTTCGGCCACCATCGCCACATCATCAACGTAGGCGTAATAGTTGATGCCTAAACGCACGCATTGGGCCACCTGGGTCTGTGTCAATTGATCTGCGGCCACCCCGGGCAGGTGCTTAAATTTCATGGTCAGGGCGGCGTTGTTGGCGCTGAAGTTCACCGATAACGCACGGGCCAACCACGAGATCACCGCGTAGGGGTCTGTGGTGTCGTACAGCACCACGGTACGGTCATGCCCCGATGCATTGAGTTGTCTGAACACATTGGTTTTTTTAAAGTCCAAATGCGCCGCCTCGCGGGTCGTCCATCCCATGATTTTTTTGTCTGCCGCCTGGATCCAGGCGGAGGCGGATCGGATCTGCGTGTCTGTCAATGTCTCATCGGCCACCGCAGCGGCATACCAGCCTGGGGTGAGTGCCTGCAAGGCCGCAAAGGCCTCCGGCAGTGTCTCGGCCTCCATGGTGTCAGCGTTGTTGCCGATGGTCAGGCGGGCCTGATCGGCTTCAAGCTTCAGCCAGTGCCCGACATAGGTGCCAGAGGGACTGCGCTGCTGTGCATAGCCAATGGCGTGATTTCCTCCGGCCACGGCAGCATAGAGTTCAAAGCAGCCATTTAAAAATCGGCAATTCACTCCAAACTCATCCAGTGCGTTATTCAACACACCTGCCACTTGGGAGAAGGAGGTGGCCGTGGTGAAATTCAGCTTGGATAAGGTGACATCCACACCATAGATGCGAATGGAAAAACAGCCGTCATCCACGCCCTTGTACCACGTATCCCCCTGGGCAATCGCTCCGGAGGTGAGTGTCGTTGGGGAGGCGGCAATGTGTTGTTTAAAGCGATTCCAGCGCGCCACCATCAGCTGTTTGGGGCGGGGGCTTTGTGCAAAAAAGCGACCGGTGGCGGCGGCGGTTTTGGAGTAACTTCCGAAGGCGCGTTCTACCTCGTTTTGAGTGCTGGCATCCATGAAGCGTGTTTTGGTATCGACAAACACGCTGCCCGCTTCGGGGGTGAACACGGCCAGCCTCCCAAAGTCACGACGGGGTGCTGACTGGGGCTGTCCATTGAGTTGCACATTGACAATGTTTGAAAGCGCTAGCGCCATTTACTGGGTCTCCGGTGCCGTCATGGTCACGCTGGCGATGTGACCGGTGCGGGTGTGAATATGGATGTCTGCGCTGTCCACAGCAGCCAGGGTGGTCACCACACGGTGGTGGTGGGTGATCTGTAATTCGATCCGGGCGCGGGCTTCATAGCCGCCGCCCACAATGGCCGAGAGGTCTTGGGCAGCCGTGACGGACACCAGGCCCGCACGTAAGGCGCGCAGCCCTGCCGTGCCCGCCTCGCAGGACAGTAACGCCTGTGCCTGCAACACCAGTTCATAGGCGCCCGTGCCGTAGGCATTCACACTGATATGGTGCAGATAGGCACAGGTGATGCTCTGCTGGCGGCCATCAAATACGCAGCACGCCGCTCCCAAGGGGGAAGAACGCAGGCGCTTCACCGTCACAAACGGTGCTGCTCCACAGGGGGCGGCTTGGTCGGCAGGACGGACGGTTCCTTCAGGTAAGGCCAACAGCCGCCGCAGTAGGGTGCGCAGTCCCGTCATGTCGAACGGCGATACCGTGGTAGTAGCCATACTCGGACCAGTTGGAAAGTTGCGCGATGCGCCAGGTGGTGTCCTGGTAGTGCACCAGATCACCAACGCACAGCGCGTCCTGACTCATGATTTTTTTGGAGGGCAAATAGCGCTGCCCTTCTGGAAGCAATTGCAGGTCATCGGGTTTGACCGGATGCAGGATCGCTCGCACAGGGTGCGCAACGCTGTCCTGGGTCCAGGTGCCATCGGCACGATAGTGCCCGTGGTCACGGTGCACCGTGACTGTTTGGGCAAAGCGTGGATTGCCAAACAGCGCGCTAATCTTCAGCATCGCGCACCTCATAGGTGATCGACTGGATCATCTGGCCGGTATCGATCAGGGGCGCGCTGGACCCTTTGCGCTGGATCGTTTGGGGCTTCAGGGGGGCAAGATCAGCGTGGCGGAGCGTCGCCTTGACATCGCCTGCGGCCACCGTCCCTAGTAGGTTCAGGGCAGTGTCTACGCTCATCGCATCACGCAGCACTGCGCGCAGGTGGTGGGTGTGCAGGGCCACATATTTTTCTTGATGCGCGCTGATGGAACGCCGCACCACCGAGCGTTCCGGAATGCCCCGCTCTGGCGCACCCAATTCATGCACCGCCAACAGTCCAGCCGAGCCGATCCCGTCTTCCGTCCGGGCGTTGTGCGCGGCAGGAATGCCCACCACCACAGCGCGCTCCCCCAGCGCCTGAAGCCGCTGCGCCAGGGCCTTCCACTTTTTGGGATCGGCGGGCCGAAGGATCGTGACCGCACTCATGGGGCAACCAAGGCCCCCAGGCCGATCATCCGACGCAGCGCCAGGTAACGTTGTCCATACACCGAGGTGGCTAGCCAAGCGTCACTGGCACTGCCAGAGGGCAGCGCCGCGTAGCTGATGTGCAGATCACCGGCCCGCTCGGACACTACCGCGCCTGTGGTGGCGGCGCTGTCGCCCAGCCCTGGGGTGGACCACACAAAATGGGCCGCCAGGCTCGCGATTCCTTGCGCATACGCCGCTCCCCATCGGGACGCGTCCAGCCAGGGATGGGCATCCTCCAGGGCCTGAGCCACGCGTTCCGGGGGCTGGGTGGCAAACTCCGGATAGCGCGCCAGGAACGTGTGAAGCGTCAGTGCCCCGGACATCATCCCTTCCTTCCGGGTTTGCCAGATTTGCCCGGCGTGCTGTCATGGCTGCCGGTCCCCGGAAGGACAGCCGCGCTCCCTTCACCAGAAACCTGCCCTTCACTGTTGGGTGGGCCCGCCCCCTCAGGGGGGCTGTGCGGCTGGAGCGGCTTCTCCTGCTGTTCCTCCCGCTGGACCGGCTCCTGCTGCTGCTCCACCAAATAGCCATTGTCAAACCACAGGCCAATGCCAGGGTGCTGCCGCAGCTGCTCCACGTGTGCGGCCTCCAGGGCCTGGGTGCGTCCGGCCTGAATCGTCACGCCATCCAGGGTGACATCACAGCTGCGGGTATTCTTGAGCATAATCGTGGTCATGGTGCTGCGTTCTCCCCAAAAAAAAAGCGCCTCAGGGCGCTGGTGTGGTCGGTGTTGAATACAACTCAAATGCCATCGGCATACAGGGCTGATTTTGGATAACGAAACTCCACGCCGCTGTATTTGTATTCACCTGGAATATCAAACGTCAGGCCCTTGGGTTGCGGGGGCAAAAACCGGATGGGCATGGGCAGATGCAGCACCAGCTTGGTGGGGTTCTTGGTATACAGCATGGCGCGGGTTGTGCCACCTTCCCCTGCCGTCTCTAAGCCGTAGCCGGTGCGGACGGTCAGATCAAGGCCACGCTCGGCTTTGGCAATGTTGTTTTCCAGCACGTAATGCAGAATGGTTTTATCGCTGTTGTCACTGCGCGGGGTGGAGACGAGATAGTTCATGACGCTACCGGGCAACAGGACGGTATCGATCATCTCCACATAGTGGGTGTTCATCCAAGCGCTGGAGATCAAGGTGTTGAACAAGGCCAGCACCTGGGCGGGCGACTGACCGATCCAAGGCCCCGCGGTGTTCAGCAGGACCGGCACGCCAGGATGGGTATACAGGCCGGTGAGTTCGTCCTCACCAAACAACGCCACATCGTTGATATGGCGCTCATAGGCATCCATCGCGGCATCGGCCCGCGCGGTATTGAGGGGTTTACGCAGAAAGGCCGACTGGCGCAGTTCCTCGGTGGTGTAATCGTAGCCAATGGTGCCCAGCACCACAGGCACGCTCTTTTGTGCGTAGGCCACATCGACCGTCGGAATATCTTCGCCCCGTCCAGAATGCCGCTTGCCGCGTCCGGAATAGTCATACATTTGATAGGTCACCGAGGTCGCGTACTCGCCCGCTTCGGTGCTGATGGGCACCAAATCCCGGTACTGGATGCCTTGGCGCTGGCGGGCGTAGATCGTCGATTCAACATGGGTCAGTTGCGACACCAAAAACGCCAGCGCTTGGGTGGCATCGGAGGTCTGATACCGTGCATCGGTCAGCAACATCGGGTTCAACGCATCGGCCATCTGACGGCGGCGTATGTCAATCATGTTCATGCAGGTGCCTTATTTAAGAATGCGGATCACGCCCAGCGCTCCGGGGGCGGTGGTGCTGTCCCAGCGGGCCTGGGGATAGGAAATCGTTTCTGAAGCGATGGCGGCGGATCGGGCCGCGCCCAAGGCCCCCGTTCCCGCAATGCGGATAGACACCGGATCATCCGGGCGGCAGCCATCCTCGCAGATCACCCAGATGCGACCGATCTCCAACACCGGCACCATCGCATGGGGCGCATACCGGACCTCTCCGGTCGCATCGGCGACCATCGTGACATGGCGGACACTGATCCCTAGGATGGCGGCGTCTGCCCCATCGGGGGCTTTGCAGGTGGCGTCTTTGGGGCCGCGTGCCACAAACAGGCCAAAATCAATGGGCGTCTGGCCCTCGTTCTTGTAGTTGCACAGGCGGCTGGTGTTCAAGTCGATGACTTGCCCCGCAACGCCAAGATCAAGTAAGCGTCCACCATAGGTGGATAGGTCAATTGCGGACATACGTGTTCCTTCGGGTGCTGAAGTGCTCTAGGTGGCATGGGTGAGCTGTTGGATATACGCCGCTCGCGGGTCCAAGTCGGCATCCGATGTCTTGACGACCTGACGCCGTAACGCCTCATTCACCGCCTCAGCAGCCAGCCCTGCCGAGGCCGTCACCGGCGCGGACGCCAGAACGTGAAACGCCAGGTCCACCGCCGTTTGTGCCGCATCGGCCACCCGGACCCCGCCCAGCAGGGTGTCAATCATGGCCGTATGCGTGGGGTGCAGACGGCTCACCACGTCACGGCGGATCGCGCTGCACGGCTTGCCGTCGGTCACCAGGCCCGGCACCAGCCGCTGGGCATCGCCGATCTGTCGTGACATGGCTGCAATCGCTTGATCCCGCTGGTGCGGGTCTTCGTCCGCAGCGCGGGCCGCTTCCAGCGCCGCCAATTGCTTGGACAGCTCCGCAATCTTGGCCACCAATTGCTCCTTGGTCAGCGCTTGGCCGCTATCCAGTTTGATGGGGGCCTGGGCGGCGTGCAGATCCTCTTCCAGGGCATCCACTTTCTCGGTGGCCGTCTTGAGCTTGGTCGCCAGGTGTTCAACCGCGCTGGCTTCCGTCTCTTCAAGCTCTAGGCTGATACCGTCCACACTAATACGGCGTTTGGTCATGGGGTGTTCTCCAAAGGATGGGGGTAAGGCTATGTCGCGATCGGCCACGCGGCACTGGGGTCCAGCACGGCCTGCGGCAACGGTGGCAATGTGGTTGCCACGAATCCGAATCTGTTTCACCTCGTACGCCTCGCCCTCCGGGGTCCAGCCCGGGGTCCAGTCGTACTCGGCGCTGTAGCCGCCGGACAGTTCTTGTTTGCCAGCTTCAATCTTTTCGATGGTCGCCTCATCGGTAATCGTGAGATCGGCCACCAGATACTCCCCTTCGCGCCGTGGATTGCGGGCAAAGCCCACCGCATGGGCGCGCCAGTTCTCGGCGGTCACCTCCTCATCCGGATGCTCATCGGTGATCGGGCGACCATCAAAGCTGGCGATGGCCTCGGCAGCAAACACTTCTTCAGGGGGGCGGTAGACGCGAATCACCCGCTGGGGATCGGCATCGCTCACCCCCAGTTCGTGGGCGGCATAGTGCTGGATGCCGGTGCGCGCAAATCGGGCAGGCACGATCAGATACCCTTCCGGCGTCTTGCGACGTTGGGTCAGTTGGACATCCAAGGTGATCATCAAGGCCCTTCCAGCGTCACATTCGGAATCGCCACACAGCGGCAGTTGTAGTCCTGTCCCGGATGCCCCGTCGCGGGGGGATCGCTCCAGCGGAACACGTTCCCATCATGGGCCGCATGATCCTCACGCACCCGTTCATCCCCGCTGGTCTGCCAGGTGTAGGTCGTTATGCCCAACCCCACTTGCCGGATTTCATTCAACGCCGCATTCATTTTTGATGTCTGATCCCGTGCAATAAACTTCGCTCGTGATGCCGTGGCATCGGTGATCTGTTCCATCTCCTTGGCCACGACGCTGGCGCGTTTGCCCTGCATGACGCCTTGCAACACAGCGGTACCGATCTTGTCGAAATACTGTCGCTGAATGGAGGTGATCAACTGGACATTGACGGCACGGGCCGCGTGTATCTGGTCCCGCACCTGCTGGGCCAGCATCCATGACGTGATGTCGATGCCGAAGGCGGTACGCACCGCGCTGCCAATGGTCTGTACGACCTGACGATCCACACGCTGCACCTGCTGCGCCGCGATCCGCTCAGCCCATTGAGGCAAGCCACCACAGCGCAACGCCGCCCGCCGCAAGGCCGCCTCAATGGCCTGCATAAACTGCGCTGCCAGATAGCCCTGTGGGGCGCTGCCGTCAGGCGCATCACGGGTCATGTGGGGCGGCGATGCGTGGAGCACCGGCAGCACCTCCTCCCGCACCGCCTGGTGCAGCACCCGCACCAAGGCCAGCAGTTCATTCCTATACGTGGCCTCAGCCTGGCGGCTGGGGCGCGGCGGGCGTAACTGCCGCTGCCTGATCCGGCGTCCCTGCAAGCGCAGTAGGTCCGGTAATGTCAACATCCTCAGTGAACCTAAATTCCGTGGGCCGACAAAGATGTCCTTGCGCTCATACAAAGTTTTGTATGATGGCGCATGGTAGGACCCAAACCGATTGAATTCAGAGGCAGCGCTCTTGACGACTTACGCACTTTTCCAGTGAGCGTAAGACGTGAGGCCGGGTACCAGCTTCACCAAGTGCAAAACGGACGCGACGCTGACGACTGGAAGCCCATGCCCACGGTAGGGCGTGGAGTCCGCGAGATTCGCATCCGTGACGCAGACGGCGCTTTCCGCGTTATCTACGTCGCCACGCTGCCCGAGGCTGTCTATGTGTTGCATTGCTTCCAAAAGAAAACTGAGAAAACCACCAAAGGCGATCTTGATGTAGCGGCTAAACGCTACCGTGATCTGTTTAATGAGGTAGGACAATGAGCAACGAGCGATTCACCAGTGTGTGGGATGCCATTGAGGACACTCCCGAAGCCGCCGAAAACATGAAGTTACGTTCCACACTCATGATGGCCCTGAAACAACACATCGAAACGGCTGCGCTGAGTCAGTCTCAAGCCGCTACGCTGTTCGGTGTCACGCAGCCTCGCGTGTCAGATTTAATGCGCGGCAAAATCAACCTGTTCGGCTTGGATGCACTGGTCAACATGGCTGCGGCGGCTGGCATGCATGTGGAAATGCGCGTGCTGAAAGCGGCGTGAGTGCTTCGCCGGTTTTTTATCTGACCACTGCATGATTCCAATCGCAGCCTATGTATTGGAAGCGGGTGCCGTCGTTTCTACCACTGGCAACACATCTGGAACCTCCATCGCCTGAGACAGTTCCGCCGCCAGCGTCACATCGCGTTCGGTGATCTTTGAATAAGTTTTTTGTTCCAGCAGCTCCGCACAGGGCACGTCTGGACCGATGACGCCATGCGTCAAGTAAATCTGATCACGCTCGGCGCGTAGCTTCTCAATGGTTGCCTGTTCTGTCTCGCTCATCTGCCATAGCGAATTGAACTGAATCTCTAAATCCTGCGGACACTCCCCCACAGACGCCCGAAACAGCACGGCGTACAACACCCGCAGCACAGGCCGCAGCTCGTCTTCCTGCTGCGCCTTGATGCGGTCGTAATAATTGCGAATATCACTGTCGCCGGTGGCGTTCATGCCTTTGGGGGACTGACCGAACAACCGGGTTGCCGGAATATCCGCCGCCCCTGAAATATCCATCATGAATTGCTCAATCACATCCTTCACACCCGCAAAGTGATTGGTTTTCTGGGTGTATTCATCCTTAGCATCCAGCAGCAGCATCCGATTGAATGATTTCATCATGGCCGCTAACTGAAAGCGCTTGTGTACCTCTTGCGTCCCTTGGTCCGAGGCGAGCGTGTCGCTGAGTCCAGAGATCCGCAATACATCCACCACCGCCTCAAAAAACATCGACGCCGTGCCCTGGGTCGCGGTGTCATAGCGGCTGAGCGCGTTATACATGGCCTGCAATACCGAATCATGCCAGTAGCCGTTGCCTCTGAATGCCTCCCAGGGCAGTTCCGCTCCAGAGAAGGCAATCATTCGGGAATGGTCCACCCGCTCCACCGATCCGGCAATCTGATAACAGCGCGGTTGCCCGTAGGTCTCACTCAAGGGGTCCTGGTCCATCTGACCACTGCCCAGCGCCACCCGCCAGCGATCCAACACCGTCAGCGATAGCCTGGTCCCCGGCATGACCGAGGCCGGATCAAACGGCAAGCACGGGTCTTGCCCATGCACGTTGATAAACAGCACCGCACCCCCGTACAACCGGGCCCAGGCCAAGGCATCGCGCACCTTGGCGCGTACGTTCAACGCCTGTTCCAGACGATGCATCGGCTCCAGCGCATCGGCGTGCAGCGCCGTATTCAACGTCACCCATTCCCGCGTCATGTCCGTCGCTGGAATATCCACCACCTTGCGCGCCAGCCAATTGGTCCGGTACATCGCCTCCAGTTCTACACGATCAATCACCCGGGGCAGCAGGTACCGCCCATAGCTCATCTTGTCGCGCTGATCGCCTAATCCGGCCACCAGGTTCTGCAAGGTGTCCACGACATGCTGAGGCGCCGCCCCCCGTGTGGCCCGCGTAGCGCGCTTGTTTCGGTTCTGCTGACTCACACCCAGCGACTCCAATCACTTGCAGGATTGGCCAGCAAATCGTTGATTGCATCCACCATCGGATCAATCTGATCATCATGGGCGTGCGTGCCATCAGCGGTGAACGCTTCACACTCGGCCACAAAATCCTTCACCCACGCCGCCTGCGCTGGAATCACCACCCACCCCGCATCAATGTAAGACACCACATCCATCACCCGCGTGAGCTTGTCGGTCACCCGTGCAATCCCAGTCACCGGAATACGCCCCTGACCAGCGCCACCTCTGGCGATGTCCTGAATTAAGCCCGTGCCGCTAGATTTGTCCTCAATCTTCATCTGACGGATCGGAGCCGATACCTTATGGTCGTAGGCGCGATGCGCATTCCAAAAATCAATCGCCCGCCGCTTGAGTTCCGGCGCTTCCCACTTGCCGCGAATCATGTCCAACAAATAAATACGCTTGTCCTCACCCAAGCCCCACAACTGGAACACGCTGTAATCGTTACGCTCAGCCGTCTTCTGCGCCGTATCGCCATACACCGTGCGCGAGAGAATGCGCGGCAGCACCGTATAGCGCCCAAATTGATCCCCTTTGATGATCCCACCGCCCAGCGGACTGGGGCGCTGCTGATATTGACCGCTGAACACATAGCGGTCCGTGGCTTCCAACGCCAGCAACTCGGCTAACGGTTCTTTGTACGGCCAGTAGCTGTAGCGTCCGTCCTGGTCCTGCACATCACGCACCACCTGCCCTTGGACGTGCTCCGGCAAACGGGACACGTAGGCATCATCAATCAATGCCGGAATCTCAATACACTCCCACGCCCCCGGGAATCCCCCAGACTGGATGAACCCCGTTGGATCGTCCTGCGCCAACCGTTGCATGATCACAATGATCGGCGTGTCCGGACTGGCTTTACGACTCTTCACCGTGGACACCAGCTTACGGTTGGCCTTACTGCGTCCGGTCTTGCTGTAGGCATCTTCTACCTTCAGCGGGTCATCAATAATGATCGCCCCCTGCCATCCCGGGGCCATGTGTCCGGCCCGAAACCCCGTCACCTGTCCGCCCAGACTCACCGCGTACACCCCACCGGCTTTCTTGCCATCCACCACCACATTCCAGCGCTTCTTGGACTTGGCATCGTCAGCAATCTCCAACGGCCACAACGCACGATATTCATCAGACTGCACAATCTCCCGCGCCGTCTCCGAATTCAGCAGCGCCAAATCATCCGAATAACTAATATGCAAAAACCGCGCATACGGATTCAGCGCCAACCCTCGCGCCATCAAATTAATCGCCACAAGCTCCGTTTTCGATGATCCAGGAGGCACGTTAATCACCACATCCTTGCGCCGCCCTGCAATCACATCATCCACCACACCAGCAATCACTTGATGGTGCCAATTCACCCTAAACCGCAGTTGCTGACGCTGTTTGAAAAAATACCGTGTGAAAAATAAATGATCTGCTTCGCACCTGGCCTTGATCACCGCTTGATCAATGGCCTGTTCAGTACTTAGCCTCAAGCCGCTTGAGGGCCGAGACGATTTGCTTTTCATCAACTAACGCCAATCCCACCTTCTGTTCAATCGCCTCCCCATCGCGACCGGACACCTCAACGCGTTGCGTGTCTTTCCAGCCAGCGCGTGTCCTCAGCCAAAAGATGATGGCTGTAATATTCGGATTGGTGCTATGCGTGGCTAACCGGAACAAACTTTTAGCCACCTTTGCATTGGCTTGGATATGCCCAGTATCTAACTCCACGCGGTAGTGCTTGCGCAGCGTCGGCGCACTGATTTGCATCACCAAGGCAATCTCCGCATGCGGTATGCCAAACGACGTCAATTGTTTTGCCAGCAGGCGATTCTTATCCGTTGGCACATGTGATTTTCTTCCAGTCTCTGCCATCAACGATCTCTTTAGGAAAAATAAAACATCGGCGTTTTTTTCGTTTCATCGCCCTTCTTTTAATAGACGAAAAAAACTAAAAAAACTGGCCTCATTCGCCCTCTTCCCCGCAGCCCTGCGGTGACCACGGCTGCCCGACGCTGGCCGCCTGTCCCCATGCCGCAACACAGCGCACTTGCGCCGCACAGTGTTCATACGCCAAACGCCATCCCAGCGTTTGGCTCAGTAAGTCGCGGACGGTCTCTACACGCGGCAATGGCCGCGCCTCACACGGCTGCAACAACCCCTGCGGCGGTGCGATGACTTCAACGCGGGTCTGTATGACGGTGACTGGCTTGACAGGAACCCCCGTCGAGCAAGCACCCAAGCACGTCAGGCACATCACTATCCAAAAAAGCTTTCGCCTCATCATGAAAGTGCTCCAAATGCGTAATGCGCTGCCGCAACACACGGTCGCGCACCGTGATCCGATTCAAATCCGTATGCAGCCCCGCAATCGCCTGCCTGTCGATCTCACGTAACGCACGCAGCCGCGCAATCGCCGCATCTTGATCGCGATTCATCGCAACCTGCGCATCCAACGTGCTTTCCACCGTCGCTAACTGGCCTTCCAGCTGCGCCGCACGTTGCGCTAATTGACTGCGCTCGGACCACGCCAGCACCGCATGTGCCACCAGCGCCACCAACGCACCAATCATCATGTACTCAACCAACAGCCGCACACTGGGCAAACCTCGCCCCACACGGCGCAGTGTATTAACGATCATCCGAACTCCTTCTCCCCGTGCCAAGCTTGGGCACAATCACGTTCTGAATCAGTTCTAAGGTCTGCGGTGTATCAATTAAACCGGAGGCAATCACCGCCGCCACCGTGAACGCTTGGCTCATCTCCAACCATTCACATACACACATCACGAATAAGCCGACAAACCCCGCAATCCCCGCCTCAATCAACACGCGGGAGACGGCCAGCCTCTCCTTAGCGTCCAGTGCACGCATTAAATAACTCAGCGTCCCCGTGGCCATCGCCAGGCACACATAAAACGCCTCCTTCCACCAGGCAGGAAGCGCCGCAATATCAATCACGTGATGCCTTCTTCACCGCCGCATGTTCGGCTGCTAATGCGGCCTGCCAATCTTGACCTTCAAATAGCCAACGTTCGGCTTTTCGACGCCGAACTAAGCCGGGAAGCACACGACCGCTTGAAAACTTCCACGCCCCAAACTGCTCCGCCGCACCAGCCACATCACCGGCATTGAGCTTGCGTAACAGCGTCGAGCGGTGAAACGCACCCGTACCTATGTTGAAGCTCAACGACACCAACGCATCGAACTGATGCTGCTTGAGTGGCACACGCACATAACGCCGCACCGCTGGCTCAAACTCCTTGGCCAATCGAGCACGTAACCGCGCATCGGCTTCCTGCTCATTGGCAAGACGCATATCAGGCCTAACATGCTTGCCCGTCTCGCCATAGCCAATGGTCAATACTCCCCCAGGACAGGTGTACGGGTTCAACTTGCAACCCTCAAAAAACTTGATCAGTGCAATGCCTTCTTCACCAATGGTCTGCATGGGGGGACTCCAGAACGCAAAAAACCGCCCGAAGGCGGTCGCTGGCTTTGAATAAAAAAAAAGCCCTGCTGAGGTGGGCAGGGCGCGAGTAAATCATTCGATGAGGGCATAGCACCACTCAGGCGCGTAGATTAGGGGGGAAAGTGTTGCAGTATCAATGCAACACTACGCGACGACTCAGGGGAGTTGCATCGCTACAGATACTTATTGGTTGAGCGGATGCTCGGCGATGAACTGACGCAAGGCCGCATTCACACGAGTTTGCCAACCCTTGCCTGTGGCCTTGAAGGCTTCCAGCAGATCAGCATCCAGGCGAATGGCAGTGAACACCTTGGTTTGGTCTGCCTTCGGGCGACCTCGGGGGCGCTTCATGGCGACCAAGGCAGTGTATATCTCAGGGGAGAATGCTTCACAGGCAAGCTTGGCCCCCTTATGCCATTGGCTATCGAGTTCACGCGCATCTACATCAGCGGCGATGCCCGCATTAATTACCTCTGTTTCGTCGTGAGTGGGGATCATCGTCCCCTGTTTAAGTGTCGGCATAGCTTTTCACCTCTCTTGAGTTGGCCTTACGCAGACTGATGACCCGTACAGCATCACCACGAAGGCAAAACACCATCACATGCAGACGGTTGCCGATATACCCCTTTGCTTCAAAACGGGGTTCTGCATACTGTTTACGTGTGTCTTCACGAACCACGGCGGTTTCCCACTCAAAACCATCGGCATCAGCGAGCGACAATCCATGCTTGTCAAGATTGCTTTCGCTTTTAGCAGAATCAAATTCGTAGTTCATTTAAATTATTGTATAAATAATAAATCGACGTTACCAAGCTTTTTATATAAACAGTTAAGTTAATTTAACTAACGCTGCACCTAACAACTCATCTCGGGAAAGCCCCTGACTCATACGTTCAATGGTTGCCTTTGCAACATCATACAAATGCCAGGATGGAAAGTGAGGGTAAGTACATCGTTAGCCATCACGCCACATTTCTGTGCAACGCCTCTTGCAACTGCGTCGCCGCCTGTCGTTCCGCAACACCCATCCGCTCCAACAGCCACTCGTACACGCCACACCATGCTCTGCGGTAGGTGGATTCATCCCGGCCAATCGCAGCGGCGCGCTTGCAATCACTGGCGGGAACCGCACCACTCCCCCCGCACGCCGTGCACACCTTCACTAACGCCCCTACACGTCGTTCCCCCCGGCCATGACAGCAGGGGCATAACTGTGGCTTAGACAGCTCACCCACCACCGCCGCAACCAGTGCCGGTAACATCTCCAACGTCGCCTGCGGCCACAGGTGGGCTTTGACCTTGTCCAGCTGTTCCTGCGCACGCCTCAGCGCAGCCTGCTGTGCGCTTGTCGTTGCTCGGGTCCACCCCATGCACGCTTTGACAATGCCCACATCGGTACGCGCTTCCAGCAAGCGCTGCTGCTGCCGTCGAATCTCCGGCACCACCAAGGCCACCGCCGCATCGCGCAAGGGGCTACGGCGCAACGCTGCGCCATCCGGCCACCAGCACGCTTCCAGTACCTCACGCCCCAACCCCGCAGGCACCAGCCCCAGGGCATGGGCAATGTCTTGCGCTGTCAACTCAGGCACTCCACCAGGCAGCGTGTCGTAGCGGATCGTGCTCGGGTTCAAACGAGCCAGTAAGCGGCGCGGATCAGTCATGGCGGATGTCCTGTGGGTTTTACGTCAAAAGCGGTGACCAGAAGCAGTCACCAGGCGCTTCATAGCGCTCAGGTAGGCATCACCTGAATAAACGTTGGTCACACGGCTATCACCACCGATGGGGAGGATCAGAATGCAGGATTGGCGTGCAGGCGTGACCCTCCGTTGCTTGCTTGCTGCGTGATGCCCGTGATGGTGATGACCACCTGTCCGGCGTGGCGTACCTCATCCTCAACCAACGGATGGGATACAAAACGCCGATCATCAATGCCCAGGGCATCGGCAATGCCATCCCGGTACGGCTTAAACCGCGCCAGCATGTTGTCATCGTCAGGCAGGCAGCGTGTGGGCGGATAGAAGCTAATCCATAGATCCAGGCGCCCCTCAACAGGGAGTGACAGGCCACCCCATCCGGCACGCCGTGCCATCACCTCGGCGTAGCCTCTGGCCTGTTTTACGGCTTTGCTGCGCCGTGTCCAATGCACCCGTGCGTTCGGTGACAGGTCCTTGGACGGCCACGGCAATGTTAAAGATTGCATCTGCTTCACCCCTGATCGTCTCTGTATTGACTCCAGCGCATGCGGGCTTTGCTGGGTGCAGATACCACGGAAACGCGGCCTTGAGGTGGTTCGCCTTCGTAATCGTCAATGTATCCGTAGGCCAGTCTGCTGCGTGCCCACACCCGGCCTGTCTGCCCTTCACGTTGTTTGGCAATGTTGATTTCCAGAAAACCGGGGTACTCGCTGGCACGATCTTCCTGTTCGGCGTAATAGTCGTCGCGGTACAAAAATACGATCAGGTCGGCGTCCTGTTCGATGTTTCCAGATTCACGTAGGTCCTTCATCACCGGTCGCTTGTTCTGCCGTGCTTCAACACCGCGATTCAACTGCGCCAGCAACACCACCGGACAGCCAAGCTCCTTACCTAATCCCTTAAGATCGCGGGTGATTTCGCCAATCTCCACCGTTTCACGCGTCTTTCCAGGCAGAGGCATCAGGTGCAGGTGGTCAATGATGATCAAGTCCACCGGCTGACGCAGATGCTCTCGGCGCGCACGCGCAATGATCTGCTCGCGATTGAGTCCAGGCGTGTCATCAATCATGAGTCCGGCATCACGCATCCGGCGCATCCCTTCAGTGACCTGACTCCAGAACATTTCACTGTCGGGGCAGTCGTCGTTAGGCTCACGAAGCCATTGCAATGGCACGTTCATGACCGAAGCTATACAGCGGTTAAAAATACTGACATCGGTCATTTCCAGGTTGAAAAACAGTACCCGCTTGCCATGCAATGCGTTTGCCGTTGCCACATTCACCGCCCAAGCGCTTTTCCCCATCCCCGGTCTGGCCGCAAGAATGATCAGCTGGCCGGGGGAGAGTCCGCCTGTCATCGCATTGAACTTCCCCCACGGGGTCGGCAGTCCGTACAACCGTCCCTTGTCACTGTAACGGCACTGCAAATCGTCGAACCAGCGCCGTGCGACTTCCTGCATCGTCTTGATGCCGCCGATGCGTGGACGATCCGCAAGCCGAGCAATTGCGTGTTCAGCCTCGGCGATCAGGTCGCGTGTCTCCCGACCTTCAGGCTGAAACCCGGCATCCTGAAGACGGGTGCCCACGTCGATCAACTCACGCAACCTCGCCTTGTCCACAACAATTTCAGCGTAGGCCACAATGTTAGCCGCCGAGGGCGTGGTACTCGCCAATTCGATCAGATACGCACCACCGTCCACCTCTGCACTCAGCCCTTGCGAGTGGAACCACTCCATCAAGGTCACGGCATCGCAAGGCTGACGCTTGCTGTCCAATTCCAGGATGGCGCGATAGATCAACTGGTGATCACGGCGGTAGAAGTTTTCTGGCGTGATCCAGTCAGCGATCTTGACCAGCATTTCCGGTGCCAGCATCAGCCCCCCCAACACCGCCTGCTCAGCCTCCAACGACCACGGCGGCACCCGTAGTTCCCCAGTACAACGTTCATCGAAAGCGTAAAGACGAGCGTTCACTGGCAGCGCTCCTCGCGATCTATCGCCGTTTCAAAAACCTTCGTCACCACCTCGTTTTTCATCAGGTAGTCAAAGTCTGGCCGCCAGTCTGGTGGCTTGTACGGACCAGTACCGTTGAGGAAAGGATTTTCCTGACATTCGTCCAGGTATGCCTGAAAGAACTCAAGGCTGCGTCGTTGTGGCGACGCCTGCCATGCCGACCGGATCAACGTGCGCCGCTTCTGCGTCAGTTCCCTGACTTTGGGTAACCCCTTCATCGTCACGTTGAAGGCATCCACGATCCCCTGGTACGGAATCCTGTCGCCCCAAGGCGGTTTTTTCTGATTTTGATGTTCCTCCACGATGTCAGGTGCTTCGCTGTCAAGCGATGCACTCAAAATACCGTCAGGTATTTTGTTGTTTTTAGTTTTTATATCTTCTCTTCTCTTCTCTAGTCCCTACTTTTGTCCCCGTTACATCGGGACATTTTTCAGGACACTTACTACGTTGCAATCGCTTCTTCTTGGCATCCGAAGCTCGCGTCTTCGCCGTGGTTCCGTTATGACGGCCAAGGTTTGGGAAGATGAGGCGGCCACCGGAACGGACTGATATCCAGCAGATTTCGGATGAAGCCAACATCTCAGCTAAACCAGGGAGGCCACACAAAGCACTCAGGAAGGGCAAGGGGTCACTACCAAGTACAACACTTGCATCGCCCGTTTCCGGGTCAATGCAGGATTCGGAAATGTTGTCGTCACACCATTCCCAAATTTTCATAATGCGTCCTGCAATCTCATATCTGTCCCGTTGCAATCGGGACGCAGCAAGAACGACTTCGCGTTTATCCGCCAAGCCCTTGCTCCATTTGATCCAGTCACCGGCCATCAGTCCGAAACATCCTGATTTTCAGAACCTGCGAGAAACACAGCAGGTTCTTCTTCGGTTTCAGCATCTGAGAGGTCAGCCGTCTTGTGCAGACTGGTGCAGTCCAACATTTGCAAATACGGCTTTCCATCTTCGTTGTAAAGAGCGATCAATCCGGCCTTCTCGCACGCAGCAATCCAACGGGAAATGTCGGCATGCCGAACTCGCTTCAACGGATACAGGGATGCTCTCAGCACCGACGGGCGACCGTCATGCAATCCGCAGTCATCTACCTTGGACATCAGACGCCGGTAAAATATTTCAGCAGGAAAATCCAACGCATTGACGCGCTCGCTGCTCAAGATTTCTTCGTGGATGACAATGTTGACCATCTCACTGCCTTCTCATCTCGAACGTGTCCGCCGCAATGCACGCGAACTGTTCCAGGTGATGCTTTGTAATCCACGTTTTCTGCGCTAAATGTTCGATCCAGCGCAGCGCCTCCCTGATGTCGGAGGGATGAATCTCATAAACAATGTTTCCATCTTGCACAAACCCGTAGAACCCATCCGGCAACTGGCGAACAACCTTCCCCACCACTGCTGCCTCCGAGTTACGTACATCATCGCCACGAGGGCGTGCGTAATAATTCATGCCGCCTCCCTCAGACACGCGATGACGTCTGGATTCCACAAGAGTTGATAACTGCTGTGCCCGTTGCGTGAGTACGGAATGGCTTCACACCACACGCGACCGGCTTCGGTTAATTCCCACTCGTCGCGTTCATTACGGAACTGAAAGCCTCTGGAGGCTAATAATTGGTTCACCGCCTTGGCCGAGCAATGCAGCCGCTTACCTAGTTGCGTGGCGTTGAGCAGGCAAAGCGGTTCCTGCAACGCAGGCAATGCGCGGCGTATTTCTTCTGTCGTTAAATTCGTATTGCTTTTGATACAGGCCAAGGTCGCGGCGGCAGCAATCCCTGGTTTCACGCCCGGCACTTTAGAAACGAATTGGCCGACTAACAGGAGCGCGGCAACGCAATCCTGCGTCGGCCCAGGCAAGGTGGGCAGCGCCCCCGGTGTGGAGTACGTGCCTGTTTTGCGCAGGGTGGGCAGCACTTCACTGGTGACCCATCGTTTAAAGCGCTTGGCCGCAGGTTTTGTACTGCCCAGGATCAAGGCATACAAACCGGATTCGTTGATGTGATTAGCGCGTTGGGTGCGCCCTAGGGTGTCGATGACCTCCAACTTCTGGAGGTCATCTACATCCACGTGGGATTCAATTGCTTGATGCGCATTGCCAAACTCCAGAACGGCACAAACATCATTGGCGTTAAACCAGGGCGCACTGGCCTCGTCGAGCTGAATACGCACATCTTTTGATTCGAATTGAAATGGAGTGATTGCGTTGGCGGAATGAGTGGAGTGCTCAAGCGCGGGACGGTTGCCTGTCTTGCGGATGGTTGGCAGCACTCCCTCAAACACCCAACGCTCGAACCGTTCTGCGGCAGGAAGTTTGCAGCCCGCAATCAATCGGAACATGTCCGGTTCGGAGATGATCCGGAAGTGCTGCAAGCGGCCAAGGCTATCTGGAAGTGGGTAGCGTTTTGCTACCCCCCTGCAATGATCGCCCATAGCTTTGTTATGGTTGGTGTAGCCGAGTACATCGGCAACGTCTTTGCCGACAAACCACACTTCACCGTGATCATCAACCACGGTTCGCACGGCTTGCGATTCAAATTGAAATGGGGTGATTGCATTCATTCCAACTCCCGGTTCATTACTGATGTCGTTCATTAGAGATCCCTGCTTGTATTGCCCATGACCTTTTGGGCGGCCTGTTCCATGTGCGGGCCGAGGGCTTTTTCTGAGAACGCTTTACTCATGCGCTCAATCGCGGCTTTTGCGATATCTGGTAGATGCTTGCTTGGAAACGTTGGGTGTCTGACCCAATCGGCTACTTCGTCGCTAGTAAGAAAATTGGTATGCAGTGTTACGGGTGTCAGCCGTAGCCTGAAGTCAGGACCGAAATCCAGTACCCATCGATGCCATCTCAATAGTTCGACGAGTGCCGATGTGGCAATGGCTTTACTATCTTCCAACCCTTTTGGAAATTCAGGCTCTTTTATTCCTGCGGACTCAAGTACCCGTCTGACATCGCTAATCCAACGGCTATGAGCGGTGTTTAACTCCTCCATTTGCTTACGGTCTTTGGAGTAGCTGCGCATCTGGTCGTATTCCAGATCGGGATTCACTGTGTACTCGCCCGTCTTGCGGATCGAGGGAAGGACTTCGCCAGCTAACCACTTCTGGAATGGCAGCGCCTTTGGTTTGTCGCTACGGCCAAGAAAGAAGTACAAACCAGGTTCAGAAATAACGATGACTTCTTGTTGCCCAGAGGGGGTGGAAACGGATTCCACCCCTCTCCACTCTGAGGGAACATGACTAATGCTTTTGCTAGCGTTCCAGCGATACTCCAATGCTTTGGCAACGTCTTTAGCAACAAACCACGGATTGCCATCGCGCATCACAACGCGCACAGCGTGGGAATGAAAATCGAACGGAATAATGGACTGCGACAT
Protein sequences of DBSCAN-SWA_6 >NZ_CP020870|1303669:1323936|1307269_1307623_-|WP_046419651.1|DBSCAN-SWA MTTIMLKNTRSCDVTLDGVTIQAGRTQALEAAHVEQLRQHPGIGLWFDNGYLVEQQQEPVQREEQQEKPLQPHSPPEGAGPPNSEGQVSGEGSAAVLPGTGSHDSTPGKSGKPGRKG >NZ_CP020870|1303669:1323936|1306070_1306439_-|WP_046419655.1|DBSCAN-SWA MLKISALFGNPRFAQTVTVHRDHGHYRADGTWTQDSVAHPVRAILHPVKPDDLQLLPEGQRYLPSKKIMSQDALCVGDLVHYQDTTWRIAQLSNWSEYGYYHGIAVRHDGTAHPTAAAVGLT >NZ_CP020870|1303669:1323936|1305613_1306084_-|WP_080939589.1|DBSCAN-SWA MALPEGTVRPADQAAPCGAAPFVTVKRLRSSPLGAACCVFDGRQQSITCAYLHHISVNAYGTGAYELVLQAQALLSCEAGTAGLRALRAGLVSVTAAQDLSAIVGGGYEARARIELQITHHHRVVTTLAAVDSADIHIHTRTGHIASVTMTAPETQ >NZ_CP020870|1303669:1323936|1316005_1316335_-|WP_046419634.1|DBSCAN-SWA MIDIAALPAWWKEAFYVCLAMATGTLSYLMRALDAKERLAVSRVLIEAGIAGFVGLFVMCVCEWLEMSQAFTVAAVIASGLIDTPQTLELIQNVIVPKLGTGRRSSDDR >NZ_CP020870|1303669:1323936|1310381_1311227_-|WP_046419646.1|head|DBSCAN-SWA MLTLPDLLRLQGRRIRQRQLRPPRPSRQAEATYRNELLALVRVLHQAVREEVLPVLHASPPHMTRDAPDGSAPQGYLAAQFMQAIEAALRRAALRCGGLPQWAERIAAQQVQRVDRQVVQTIGSAVRTAFGIDITSWMLAQQVRDQIHAARAVNVQLITSIQRQYFDKIGTAVLQGVMQGKRASVVAKEMEQITDATASRAKFIARDQTSKMNAALNEIRQVGLGITTYTWQTSGDERVREDHAAHDGNVFRWSDPPATGHPGQDYNCRCVAIPNVTLEGP >NZ_CP020870|1303669:1323936|1317808_1318507_-|WP_046419632.1|DBSCAN-SWA MTDPRRLLARLNPSTIRYDTLPGGVPELTAQDIAHALGLVPAGLGREVLEACWWPDGAALRRSPLRDAAVALVVPEIRRQQQRLLEARTDVGIVKACMGWTRATTSAQQAALRRAQEQLDKVKAHLWPQATLEMLPALVAAVVGELSKPQLCPCCHGRGERRVGALVKVCTACGGSGAVPASDCKRAAAIGRDESTYRRAWCGVYEWLLERMGVAERQAATQLQEALHRNVA >NZ_CP020870|1303669:1323936|1315340_1315571_-|WP_046420065.1|DBSCAN-SWA MQTRVEVIAPPQGLLQPCEARPLPRVETVRDLLSQTLGWRLAYEHCAAQVRCVAAWGQAASVGQPWSPQGCGEEGE >NZ_CP020870|1303669:1323936|1303669_1304107_-|WP_004087249.1|DBSCAN-SWA MSVFDPKQVSVLLNGTQIKDWADGTDVIDAKHNADAGAYTIGASGTGVFVANADRSGTLTLKIKQHSADNAFLSRRLAQQRGAIQSFTPFTLDIRDLLNQDVVTATQGYFTTPPGFTRGAGHNPETWTLVFEVMDITLEKGFGNA >NZ_CP020870|1303669:1323936|1307688_1308672_-|WP_046419650.1|DBSCAN-SWA MNMIDIRRRQMADALNPMLLTDARYQTSDATQALAFLVSQLTHVESTIYARQRQGIQYRDLVPISTEAGEYATSVTYQMYDYSGRGKRHSGRGEDIPTVDVAYAQKSVPVVLGTIGYDYTTEELRQSAFLRKPLNTARADAAMDAYERHINDVALFGEDELTGLYTHPGVPVLLNTAGPWIGQSPAQVLALFNTLISSAWMNTHYVEMIDTVLLPGSVMNYLVSTPRSDNSDKTILHYVLENNIAKAERGLDLTVRTGYGLETAGEGGTTRAMLYTKNPTKLVLHLPMPIRFLPPQPKGLTFDIPGEYKYSGVEFRYPKSALYADGI >NZ_CP020870|1303669:1323936|1311634_1311916_+|WP_046419643.1|DBSCAN-SWA MSNERFTSVWDAIEDTPEAAENMKLRSTLMMALKQHIETAALSQSQAATLFGVTQPRVSDLMRGKINLFGLDALVNMAAAAGMHVEMRVLKAA >NZ_CP020870|1303669:1323936|1323150_1323936_-|WP_046419774.1|DBSCAN-SWA MSQSIIPFDFHSHAVRVVMRDGNPWFVAKDVAKALEYRWNASKSISHVPSEWRGVESVSTPSGQQEVIVISEPGLYFFLGRSDKPKALPFQKWLAGEVLPSIRKTGEYTVNPDLEYDQMRSYSKDRKQMEELNTAHSRWISDVRRVLESAGIKEPEFPKGLEDSKAIATSALVELLRWHRWVLDFGPDFRLRLTPVTLHTNFLTSDEVADWVRHPTFPSKHLPDIAKAAIERMSKAFSEKALGPHMEQAAQKVMGNTSRDL >NZ_CP020870|1303669:1323936|1317029_1317368_-|WP_004085696.1|DBSCAN-SWA MPTLKQGTMIPTHDETEVINAGIAADVDARELDSQWHKGAKLACEAFSPEIYTALVAMKRPRGRPKADQTKVFTAIRLDADLLEAFKATGKGWQTRVNAALRQFIAEHPLNQ >NZ_CP020870|1303669:1323936|1311964_1313269_-|WP_080939596.1|DBSCAN-SWA MVAGLGDQRDKMSYGRYLLPRVIDRVELEAMYRTNWLARKVVDIPATDMTREWVTLNTALHADALEPMHRLEQALNVRAKVRDALAWARLYGGAVLFINVHGQDPCLPFDPASVMPGTRLSLTVLDRWRVALGSGQMDQDPLSETYGQPRCYQIAGSVERVDHSRMIAFSGAELPWEAFRGNGYWHDSVLQAMYNALSRYDTATQGTASMFFEAVVDVLRISGLSDTLASDQGTQEVHKRFQLAAMMKSFNRMLLLDAKDEYTQKTNHFAGVKDVIEQFMMDISGAADIPATRLFGQSPKGMNATGDSDIRNYYDRIKAQQEDELRPVLRVLYAVLFRASVGECPQDLEIQFNSLWQMSETEQATIEKLRAERDQIYLTHGVIGPDVPCAELLEQKTYSKITERDVTLAAELSQAMEVPDVLPVVETTAPASNT >NZ_CP020870|1303669:1323936|1306425_1306905_-|WP_046419654.1|DBSCAN-SWA MSAVTILRPADPKKWKALAQRLQALGERAVVVGIPAAHNARTEDGIGSAGLLAVHELGAPERGIPERSVVRRSISAHQEKYVALHTHHLRAVLRDAMSVDTALNLLGTVAAGDVKATLRHADLAPLKPQTIQRKGSSAPLIDTGQMIQSITYEVRDAED >NZ_CP020870|1303669:1323936|1322005_1323118_-|WP_053014142.1|DBSCAN-SWA MNAITPFQFESQAVRTVVDDHGEVWFVGKDVADVLGYTNHNKAMGDHCRGVAKRYPLPDSLGRLQHFRIISEPDMFRLIAGCKLPAAERFERWVFEGVLPTIRKTGNRPALEHSTHSANAITPFQFESKDVRIQLDEASAPWFNANDVCAVLEFGNAHQAIESHVDVDDLQKLEVIDTLGRTQRANHINESGLYALILGSTKPAAKRFKRWVTSEVLPTLRKTGTYSTPGALPTLPGPTQDCVAALLLVGQFVSKVPGVKPGIAAAATLACIKSNTNLTTEEIRRALPALQEPLCLLNATQLGKRLHCSAKAVNQLLASRGFQFRNERDEWELTEAGRVWCEAIPYSRNGHSSYQLLWNPDVIACLREAA >NZ_CP020870|1303669:1323936|1318635_1319046_-|WP_046419893.1|DBSCAN-SWA MQSLTLPWPSKDLSPNARVHWTRRSKAVKQARGYAEVMARRAGWGGLSLPVEGRLDLWISFYPPTRCLPDDDNMLARFKPYRDGIADALGIDDRRFVSHPLVEDEVRHAGQVVITITGITQQASNGGSRLHANPAF >NZ_CP020870|1303669:1323936|1313349_1314900_-|WP_046419640.1|terminase|DBSCAN-SWA MKSKSSRPSSGLRLSTEQAIDQAVIKARCEADHLFFTRYFFKQRQQLRFRVNWHHQVIAGVVDDVIAGRRKDVVINVPPGSSKTELVAINLMARGLALNPYARFLHISYSDDLALLNSETAREIVQSDEYRALWPLEIADDAKSKKRWNVVVDGKKAGGVYAVSLGGQVTGFRAGHMAPGWQGAIIIDDPLKVEDAYSKTGRSKANRKLVSTVKSRKASPDTPIIVIMQRLAQDDPTGFIQSGGFPGAWECIEIPALIDDAYVSRLPEHVQGQVVRDVQDQDGRYSYWPYKEPLAELLALEATDRYVFSGQYQQRPSPLGGGIIKGDQFGRYTVLPRILSRTVYGDTAQKTAERNDYSVFQLWGLGEDKRIYLLDMIRGKWEAPELKRRAIDFWNAHRAYDHKVSAPIRQMKIEDKSSGTGLIQDIARGGAGQGRIPVTGIARVTDKLTRVMDVVSYIDAGWVVIPAQAAWVKDFVAECEAFTADGTHAHDDQIDPMVDAINDLLANPASDWSRWV >NZ_CP020870|1303669:1323936|1308681_1309164_-|WP_046419648.1|DBSCAN-SWA MSAIDLSTYGGRLLDLGVAGQVIDLNTSRLCNYKNEGQTPIDFGLFVARGPKDATCKAPDGADAAILGISVRHVTMVADATGEVRYAPHAMVPVLEIGRIWVICEDGCRPDDPVSIRIAGTGALGAARSAAIASETISYPQARWDSTTAPGALGVIRILK >NZ_CP020870|1303669:1323936|1306901_1307270_-|WP_046419652.1|DBSCAN-SWA MSGALTLHTFLARYPEFATQPPERVAQALEDAHPWLDASRWGAAYAQGIASLAAHFVWSTPGLGDSAATTGAVVSERAGDLHISYAALPSGSASDAWLATSVYGQRYLALRRMIGLGALVAP >NZ_CP020870|1303669:1323936|1317354_1317621_-|WP_010894061.1|DBSCAN-SWA MNYEFDSAKSESNLDKHGLSLADADGFEWETAVVREDTRKQYAEPRFEAKGYIGNRLHVMVFCLRGDAVRVISLRKANSREVKSYADT >NZ_CP020870|1303669:1323936|1304116_1305613_-|WP_046419658.1|DBSCAN-SWA MALALSNIVNVQLNGQPQSAPRRDFGRLAVFTPEAGSVFVDTKTRFMDASTQNEVERAFGSYSKTAAATGRFFAQSPRPKQLMVARWNRFKQHIAASPTTLTSGAIAQGDTWYKGVDDGCFSIRIYGVDVTLSKLNFTTATSFSQVAGVLNNALDEFGVNCRFLNGCFELYAAVAGGNHAIGYAQQRSPSGTYVGHWLKLEADQARLTIGNNADTMEAETLPEAFAALQALTPGWYAAAVADETLTDTQIRSASAWIQAADKKIMGWTTREAAHLDFKKTNVFRQLNASGHDRTVVLYDTTDPYAVISWLARALSVNFSANNAALTMKFKHLPGVAADQLTQTQVAQCVRLGINYYAYVDDVAMVAEGTCLGGRFFDEVHLLDWLVDAVQKEVFAVLHRSPTKVPLTDAGTHLLIAACKKVCQEGVRNGALAPGLWNGQAFGALATGDYLEAGFYVWADSVDTLSTSDRQARRAPPLQIAVKLAGAIHAVDVIINFDR >NZ_CP020870|1303669:1323936|1319051_1320512_-|WP_046419785.1|DBSCAN-SWA MNARLYAFDERCTGELRVPPWSLEAEQAVLGGLMLAPEMLVKIADWITPENFYRRDHQLIYRAILELDSKRQPCDAVTLMEWFHSQGLSAEVDGGAYLIELASTTPSAANIVAYAEIVVDKARLRELIDVGTRLQDAGFQPEGRETRDLIAEAEHAIARLADRPRIGGIKTMQEVARRWFDDLQCRYSDKGRLYGLPTPWGKFNAMTGGLSPGQLIILAARPGMGKSAWAVNVATANALHGKRVLFFNLEMTDVSIFNRCIASVMNVPLQWLREPNDDCPDSEMFWSQVTEGMRRMRDAGLMIDDTPGLNREQIIARARREHLRQPVDLIIIDHLHLMPLPGKTRETVEIGEITRDLKGLGKELGCPVVLLAQLNRGVEARQNKRPVMKDLRESGNIEQDADLIVFLYRDDYYAEQEDRASEYPGFLEINIAKQREGQTGRVWARSRLAYGYIDDYEGEPPQGRVSVVSAPSKARMRWSQYRDDQG >NZ_CP020870|1303669:1323936|1315554_1316016_-|WP_046419636.1|DBSCAN-SWA MIVNTLRRVGRGLPSVRLLVEYMMIGALVALVAHAVLAWSERSQLAQRAAQLEGQLATVESTLDAQVAMNRDQDAAIARLRALREIDRQAIAGLHTDLNRITVRDRVLRQRITHLEHFHDEAKAFLDSDVPDVLGCLLDGGSCQASHRHTDPR >NZ_CP020870|1303669:1323936|1309191_1310382_-|WP_046419647.1|DBSCAN-SWA MITLDVQLTQRRKTPEGYLIVPARFARTGIQHYAAHELGVSDADPQRVIRVYRPPEEVFAAEAIASFDGRPITDEHPDEEVTAENWRAHAVGFARNPRREGEYLVADLTITDEATIEKIEAGKQELSGGYSAEYDWTPGWTPEGEAYEVKQIRIRGNHIATVAAGRAGPQCRVADRDIALPPSFGEHPMTKRRISVDGISLELEETEASAVEHLATKLKTATEKVDALEEDLHAAQAPIKLDSGQALTKEQLVAKIAELSKQLAALEAARAADEDPHQRDQAIAAMSRQIGDAQRLVPGLVTDGKPCSAIRRDVVSRLHPTHTAMIDTLLGGVRVADAAQTAVDLAFHVLASAPVTASAGLAAEAVNEALRRQVVKTSDADLDPRAAYIQQLTHAT >NZ_CP020870|1303669:1323936|1311299_1311638_+|WP_046419644.1|DBSCAN-SWA MVGPKPIEFRGSALDDLRTFPVSVRREAGYQLHQVQNGRDADDWKPMPTVGRGVREIRIRDADGAFRVIYVATLPEAVYVLHCFQKKTEKTTKGDLDVAAKRYRDLFNEVGQ >NZ_CP020870|1303669:1323936|1314850_1315249_-|WP_046419638.1|DBSCAN-SWA MAETGRKSHVPTDKNRLLAKQLTSFGIPHAEIALVMQISAPTLRKHYRVELDTGHIQANAKVAKSLFRLATHSTNPNITAIIFWLRTRAGWKDTQRVEVSGRDGEAIEQKVGLALVDEKQIVSALKRLEAKY >NZ_CP020870|1303669:1323936|1320914_1321376_-|WP_046419782.1|DBSCAN-SWA MAGDWIKWSKGLADKREVVLAASRLQRDRYEIAGRIMKIWEWCDDNISESCIDPETGDASVVLGSDPLPFLSALCGLPGLAEMLASSEICWISVRSGGRLIFPNLGRHNGTTAKTRASDAKKKRLQRSKCPEKCPDVTGTKVGTREEKRRYKN >NZ_CP020870|1303669:1323936|1321748_1322027_-|WP_046419776.1|DBSCAN-SWA MSEGGGMNYYARPRGDDVRNSEAAVVGKVVRQLPDGFYGFVQDGNIVYEIHPSDIREALRWIEHLAQKTWITKHHLEQFACIAADTFEMRRQ >NZ_CP020870|1303669:1323936|1316327_1316825_-|WP_023907472.1|DBSCAN-SWA MQTIGEEGIALIKFFEGCKLNPYTCPGGVLTIGYGETGKHVRPDMRLANEQEADARLRARLAKEFEPAVRRYVRVPLKQHQFDALVSLSFNIGTGAFHRSTLLRKLNAGDVAGAAEQFGAWKFSSGRVLPGLVRRRKAERWLFEGQDWQAALAAEHAAVKKASRD |
29 | Haemophilus_phage(25.0%) | terminase,head | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_7 |
1329457 : 1337775
Sequences of DBSCAN-SWA_7
Nucleotide sequences of DBSCAN-SWA_7 >NZ_CP020870|1329457:1337775|DBSCAN-SWA CATGAGCGCCTTAGCCAAAGAAGTACAGGCCAACACGCCTGAAGAAGTGCCGCTGTTTTACTACGCTCACATCCAGGACGCTGAAGGCGCTGAGCAGCAATACGTTACTGATTACCTTAAAGAACCAGGTGTAAATTCTTCAGAAGGAACGATACGAGTTTTTGCAGCTCACTACGTTAAGTTCAGCCCTTTGGTGCGCTCATGGTTTGTAGGTAGGCCAGACGGCGATATACAGCGTACGTCTATGGGGTACGAATACATACGTGTAGAGCCAACACATCCACTGTATCCGCAGATAAAAGATGCGTGTGAAGAACTATACAGCTTTAATGAATCCGTCCTATCCCATATAGCACACAAAGCCGCCGATACGGCAAAGGCAGGTGCGTAATGGCTGGCATTGGCTACAGCAGTTATAGCGACCCACGCCTGCAACCACCGCAAGACGATGCTAAAGAGTATTTCGCAGAACGGGTTAATGCACGCATTCAAGACTATTTAAGCGACCCAGAAAAAATCGAAGAAGCCGATGAATGGGTGGCCGGTACGTTGTCGGAGGCCCATTACAAAGAGATGGAAATCGTTTTAGCGGATTTACATGACATGCCTGCGTATCAGTTAATCGGCAGCGATCTATTAAAACATCTTTATGCATTAGCCGAAGTGCAAGGGCTCGCCCGCCTGAGGCAATTGCGCATCCTTGCTGAGCAGGACGTACAAGAGGACATGCAAACAGCGCAAGAAGCGCAAGAAGCGCATTGGTATCACCGTAGCGGTATCGATGCCATGCAGGAGGACTACGCATGATCCCTTGCACCATCACCGCACGCACCAAGCAGGGTGTGTACACCTACACCGGCCTATTCCAAAAAACGGTTGAGGCCAAGTCTGACGCCTACCGGCGCTTTGGATTCCCAGCCGTTATTGCCGTCACAGCCATCCACCGCAAATCAATGCGAAACAACGCGCATCCCAGCCCGCCACGTGCGGCGTGATCTGACACAAAAACCCCCGAGATTTCTCGGTAAAGGAATTCCTATGTCACATCTGTATCGCAATCTCAATTTCTGGATGGGTTGCTGCAATCATCATGCCGTAGACATCTACGGCACACCTGCATATTGCAGGAAGTGTGCGCCTCATTTATTTCTCGCGTCTGCCGACGACGTGAACGGCCCCCTCCGTCACTATCTCGACTTGCGATATCGGCTCAGTCGTCCAGTAAGCCTCAGCGATCTGACGGAAGTGCTCAAGCGGATCAACTCCGGGAAGGATGTCGAAATCGTAACTCCCACGATGGGTCCGATGCCCTGACTATCAAGCAAGAAACCCACCCATGAACCGCTATCGCACACCGAAAGAACAACACACAGCGCGTTGCAATTACTACGCACACCCTGGGTTAGCCCACTGCATTTTGCGCGATACCCGAGCAATGCCCAATGCACTACGCATCGCCAATTACCGCGTTCGTCGCGCGCACTACGAAGCCGCTAAAACGCTTTCTTCAACCCTGATCAAGCACTAACACCACTGCTGGAGGACACCATGTCTATCACTACAGAAAGAACATCGTTAACAGACCTCGAACACCTATTTGAAGTCGCAGATGGCGGAGTGCTCGCCCAAAAATTTGCAAAGGCATTAAGCGATGTAGCGTTGGCCATCAGCTACACCCGCAAACAAGGTGAGGTGGCGCTCATCTTAAAAATCAAACCCATTGGCGAATCCTCGCAAGTCATGATCGATCACACCTTGAAATACGTTGAACCCAAGGAACGCGGAAAAGTCATAGAGGAAGACACCACGAGCACGCCAATGTATGTCGGGGCACGCGGCAAATTAACGCTCATCCCCGATACACAACAGCAATTGTTCGCTGAAAAAAACTAGTCATCACCCATTAAATAAACTGAAAAAAGGAGCTGATTCATGGACAAAACAGCCATAGAACATATTCAACAAACCGCCATTGATGCTAATGGCTTCCGTTTGCCTGCTGCATTACAAGATAGAGCGGTAGCGCTTCCTCTTGACTATGGAATCAAAGATCTTGAACCTCTCTTCGAGTTGCGCTCGCGCTTTCGTGGAGCAATGCAAACGCAATCCATCCCCGACTTCGTGCAGTACATCAAGACAAAAGGAAATGGCGAAGGCTTTATTAATGCAGAAAATCTCAGTGCCAAGATATTTTTCAATCTCGGTACAACGGAATTACCCGGTCATGGCGACTGGACAGCCACGCTCACCCTTAACCCTACAGCAGCTTACAAAGCACTGCTGGATATTGACGGTAAAACACTCAATCAGCGGAAGTTGATTGATTTCCTGGAAGATTGGGCACCGCTGCTGTCGGCCACATCACAAGAAAGCCTTGATGAACTCCCACTGACGACCGCGATTAATGCCATTCGCAAATTAACTATCAAGAAAACCTCAGAGTCAGAATCGACTCAGGGTGAATTCAATACATCACGCTCCAAGTTCGATGATATCCAAGCCAAATCAGCTATTGGCTTACCAATTGGTTTCACCTTCACGACCCAGCCGTATCTTGGATTACCGGAACGCAGATTCCTGCTACGACTGTCCGTATTCACCGACGAAGAAAAAGAAAAGCCACTCATCACACTGCGCCTCATACAGCGTGAAGCACAGCAGGAAGCGACCGCCCAGGATTTTAAAAGCGTGTTGTTTCGTGATCTTGAAGGACACGCCACGCTGACCATTGGCACATTCTCCCCATCCCTTTAACAGAGCGAAGCTGCCATGAACTCCCTGTCTGTCATCAAACCAAACAATGCCGTGCCTATGACAGCCGAATACCGGCAGTCGATCCGTACCGCATTGAAAACCAGCCTGTATCCAGGGGCGACCAATGAATCCGTAGAGATGGTGTTGGCCTATTGCCAGGCGGCTGATTTAGACCCGATGACAAAACCCGTTCACATCGTGCCGATGTGGATTCCAGAAAAGAAAGTCGATGGGCGCGTTGTGTCATCGGCAGGCATGCGTGATGTGATCATGCCCGGAATCGAACTGTATCGAACCAAGGCACACCGTACCGGCGAATACGCCGGACAAGATGAAGCCGTGTTTGGAGACACCCTCTGCGAAACACTGGGCGGCGTGCAGATCCGCTATCCGTCATGGTGCCGCATTGCGGTGTACCGCATGGTCGCTGGTGAACGCGTCCGGTTTGCCGCAACCGTGTATTGGCTGGAAGCCTATGCCACCGCAAGAAAAGACAGCCCCGCCCCTAATAGCATGTGGTGCAAACGCCCCTTTGGACAGTTGGAGAAATGCGCTGAAGCCTTGGCATTACGTAAGGCATTCCCAGAAGCCGTGGGCGCTCAACCGACGGCTGAAGAAATGGATGCCGGACGACACACGATTGAGGGCGAAACGATCCATGTTGCCCCCATCCCCGTTAACCAAGACATCAGCGGCCCTTACCCACAAGAAGAATTTGAAAAGAATTTTCCAAAATGGTGTGACCTGATTGAATCAGGAAAAACTACGGCAGATCGCATCATCACGATGCTGCGTAGCAAAGACAAAGGCCAATTAACCCCAGAGCAACTCACCGCCATTCGCGCCTGCGCCGAAGACACCAGTGCTGAGCCTGTGCCTACAGACAACACCCCACAGGAAAGCAACACCACTCACACGGAAGAAAACACATGAACATCATTGAACTGACCCAAGGCACCCCGCAATGGCACACCCACCGCGCCAAGTATCTCAATGCTAGCGACGCACCGGCCATGATGGGATGCAGCCCCTATAAAAGCCGTGCCGAACTGATACGGGAACGCGCCACGGGGATCACCCCAGACTATGACCAAGCCACGTTACAACGTTTTGCTGAGGGTCATCGCGTTGAAGAACTGGCGCGCCCCTTGGCAGAACGCATCATCGGTGATGACCTGTATCCCTGTGTGGGTGTCGATGGCATGTATTCGGCCAGCTTCGACGGCTTGACCTTGCTGGAAGACACCGTCTGGGAACACAAGCAACTCAATGACACGATTCGCGCCGCCATGACAGACGGCAGCACCGGCCAAGACCTGCCATTCCATTATCAAATACAGATGGAACACCAGAGCATGGTGTCCGGTGCCCAACGTGTCCTGTTCATGGCCTCGGCCTGGAACGGTGAACAGCTCATAGAAGAACGTCATTGCTGGTACACCCCAACCCCAGAATTACGTACCCGCATCATCAACGGATGGCAACAACTGCAAGCGGACATCGCCGCCTACCAGCCTGAGCCACCACCACCGCCTGTGGCCCTGGGCCGTTCACCAGAGCAACTACCGGCATTGCATATTGCAGTCACAGGAACCGTGACCACCTCCAACCTGCCGCACTTCAAAGCCTGCGCACTGGCCGTATTGCACAACATCAACCGTGACCTGCGCACCGATGAGGATTTTGCCAACGCAGAACAAACCGTGAAATGGTGCAAAGGCGTAGAAACTCGCCTAGACGCCACCAAACAACAGGTATTAGGACAAACCGCCGACATCGATGCCGTGTTCCGTACCCTGGACGAGATCGCCGAAGAAACCCGCCGCGTCCGTCTGGAGTTGGATCGCCTGGTGAACACCGAGAAGGCCCACCGCCGTACTGAGATTGTCCACACCGGCCTGAAGACCCTGCGTGACTACTACGCCAGCCTCAACGCGAGGTTAGAGGAGTATGCCCTACCGATTCCGGCTGATTTGCCTGCAAAGATCGGTGACACCATTAAAGGCAAGAAGTCGATGATCAGCATGCAGGAGGCTGTGAGCGCCGCCGTGGCTCACGAGAAGATCAACATCACTGCCCACGTGGAACGGGTGCGCGCCAATATCACCGTGATGGAAGCATGTGACGCGGCTTACCGCTCCCTGTTTCCAGACAGGGTGAGTCTATGCATCAGCAAAACACCCGATGACCTGCACAATCTGATCCTTGCCCGCATTGCCGATCATCAGCAGCAGGAACAGGCACGCATCCAAGCCCAACGCGAGCGCCAGAACACCCTGGAAACAACGCCATTACCCACCCCGCCAACAGCCCACTGCTCCACCGAATCCACTGCAACCACCCCAGCATCCACAGGCCACCTCATCAAGCTGGGAGATATCAACGCATTGATTGCCCCACTATCCATCAATGCCGATGGATTAGCAGCACTGGGTTTTGTCCCCATCGGCACCGAACGCGCCACCAAGTGGTACGCCGCTGCTGAATTTCCTGTGATCTGTCATGCCTTGAAACAGGTGTTGTCCGACGCTGCATCCCGCACCGCCATCAAGCAGGCCGCGTGATGACAAGGGCAGCCAATATCACGAGCCATGTCATGAACGGCACGTGTGATGATCGCTTTCTCGGCAATCTCGATTCCAGAGGCGGCCTGAGCTTGACCCACATTCACAGCTGCGGTACCGTCGCTCCTGCGCTGGCAAATTCTAGCGCCGGGTTTGGCAACCCGATGAACGGCGCTTCAAAAGCGCCCCGCATCCCGAGTGGCGCTTTTTTGTTGCCTCCCACGATTTACGGTGGGGCAGTCAGGAAGACCGCAAGGTCTGCCGGTTCATGCGCAAGCTCCGTTCCCGGTTTGCCAATCCTGCCTGTCCCATCACCATGGGGTTTGGCAACGTCTGGTGATGGGTTTTCCTCTCGAACGGAGACCTGCACCATGCATAACACCCTTCCTGCGTCCGTTGATTTTTCCGACGTATCCCTAACCATCATTGATCATGATGGCATTCCTTACCTGACGGCTGCCGATCTGGCCCAAGCTTTGGGCTACAAGGACACCAGCACTGTCTTGCGCATCTACTCGCGCCATACCGACGAATTCACCAGTGAAATGAGCTTGACGGTCAATTTGACCGTCAAGGGTTTCGGTTGCGGCAACTCAGAAAAACCTGTCCGTATCTTCAGCCCCCGTGGCTGTCACCTTGTGGCGATGTTTGCCCGTACTTCAGTCGCCGCCGCGTTTCGCCGCTGGGTGCTGGATGTGTTGGAAGTCCTTCCCTCGATCCGCAAGACGGGCGAGTACACAGTGAATCCCGATCTGGAATACGACCAGATGCGCAGCTACTCCAAAGACCGTAAGCAAATGGCGGCGTTAAACACCGCTCATAGCCGTTGGATTAGCGATGTCAGACGGCTACTTGAGTCCGCAGGAATAAAAGAGCCTGAATTTCCAAAAGGGTTGGAAGATAGTAAAGCCATTGCCACATCGGCACTCGTCGAACTATTGAGATGGCATCGCTGGATACTGGATTTCGGTCCTGACTTCAGGCTACGGCTGACACCCGTCACGCTGCATACCAATTTTCTTACTAGCGACGAAGTAGCGGATTGGGTCAGACACCCAACGTTTCCAAGCAAGCATCTACCAGATATCGCAAAAGCCGCGATTGAGCGCATGAGTAAAGCGTTCTCAGAAAAAGCCATCGGCCCACACATGGAGCAGGCCGCCCAAAAGGGCATGGGCAATGCAAGCAGGCATCTGTCATGAACACCATCAGCAATGACAGCATCTACCTGCCCCCTCCCCATCCATTGCCCTCTAGGAGCACTCACAATGGCACGCGGAATTAACAAGGTGATCCTAGTCGGAAACCTGGGAAACGAGCCGGATATCAAATACACCCAAAGCGGCATGACGATCACCAACATTAGCCTAGCAACCAGCAGCAAACGCAAGGACAGAGAGGGCAATACCCAGGAGCGGACCGAATGGCACCGCGTCAAGTTTTTCGGAAAGCTGGGCGAGATTGCCGCCGAATATCTGCATAAGGGATCGCAGTGCTACATAGAGGGTGCCATTCGCTACGACAAGTTCACCGGCCAGGACGGCCAGGAGCGTTATGTCACTGAGATTATTGCTGACCAAATGCACATGCTCGGCGGTCGTGATGAAGGCTCCAGCGGCATCACGCCACAGCGGCGACCGGCAAAGGTCCGTAACAACGATAAAGCCTATGCGTATGCAGGCGATGACTTCCACGATGACGACCCACCGTTTTAGCTGAGCAGGGATCAGACAATGGTTAATGTGCAAGCAAGACAACTACGAGACATGCCCCCAGCTGCTGGTTTATGTCTTTCCAGAACTGAGGTGGCTGAGTTATGCGGTACCCCGCAACGCGCTCGCCAAGCCGCCTTTCTTAGGAAGAACGGCATTCGGCATTATCTGGATGCGCATGACTGGCCGGTCGTGCTGCGTGCTGCGATTGACGCGATGCCGTCTTCTCCGATGGTCCCGCCTGTTTGGAAGTCTAATAAGGTCGCTTAATGGGACGTAAACCAACTAAAACAGGCGCGATTCCGAGGTTTCGCGTGCGCCCTCAAAAGTCCGGCGTCGTGCATTACTACTACGATCATGGCGGCAAACCACGCAAAGAGACGCCACTAGGATGCGACTACGGGTTAGCCATCAAGCGGTGGGCTGAGCTGGAGCATGCGCAGATCACTCCTGCAATTGCGGTGACGTTCCGCCATGTGGCCGAGCGTTACCGCGCTGAGGTGATCCCGACAAAGGCGTACAACACCCAACGCATGCACTACATATATTTGAACTATCTGTTGCAGTTTTTTGACGATCCCCCAGCGCCGTTTGAGTCAATTAAACCTGTGAATATCCGCCAGTATCTAGATTGGCGGCACTTTAAGGTAAGCGCTAATCGTGAAGTAGCATTATTTTCGCATCTTTGGAATTGGGCGCGGAGTAAGGGAATCACTGATCTTCCCAATCCTTGTGCGGGCATCCGTCGTAATAAGGAATCAGGCCGTGATGTATACATAGACGATACAACTTACCATGCTGTTTACCAAGCAGCGGATCAAACGCTCAGGGATGCAATGGACCTCGCCTACCTGACGGGGCAGCGTGTGAGCGATGTTCTATCTATGGATGAGCGCCATATTGTTAATGGCGCTTTGGAGATTTGCCAAGCTAAAACGGGGGTGAAATTGGCGATTGCGATTACAGGTGAATTGGCAGTTTTAATAAAGCGTATTTTTGATCGCAAGCGGGGGCTAAAGCTGCGTAGCACACGTTTGATTGTGGATGAAAAAGGCTTGGGGTTGAATTGGAAAAAGTTAGCTTACAGGTTTAGGAAAGTACGCGCTGCCGCAGGGATCGCCAAGGAGATATTCCAATTCCGCGATCTACGCGCCAAAGCGGCGACCGATAAGGCGGATTTGGCGGGCGATATGCGCCAAGCCCAAGCACAGCTGGGCCATGCCTCGGTGGTGATGACAGAGCACTATGTACGCAAGCGCAAAGGGGCGAAGGTCACTCCTACTCGGTGA
Protein sequences of DBSCAN-SWA_7 >NZ_CP020870|1329457:1337775|1329846_1330263_+|WP_046419749.1|DBSCAN-SWA MAGIGYSSYSDPRLQPPQDDAKEYFAERVNARIQDYLSDPEKIEEADEWVAGTLSEAHYKEMEIVLADLHDMPAYQLIGSDLLKHLYALAEVQGLARLRQLRILAEQDVQEDMQTAQEAQEAHWYHRSGIDAMQEDYA >NZ_CP020870|1329457:1337775|1333139_1334774_+|WP_046419737.1|DBSCAN-SWA MNIIELTQGTPQWHTHRAKYLNASDAPAMMGCSPYKSRAELIRERATGITPDYDQATLQRFAEGHRVEELARPLAERIIGDDLYPCVGVDGMYSASFDGLTLLEDTVWEHKQLNDTIRAAMTDGSTGQDLPFHYQIQMEHQSMVSGAQRVLFMASAWNGEQLIEERHCWYTPTPELRTRIINGWQQLQADIAAYQPEPPPPPVALGRSPEQLPALHIAVTGTVTTSNLPHFKACALAVLHNINRDLRTDEDFANAEQTVKWCKGVETRLDATKQQVLGQTADIDAVFRTLDEIAEETRRVRLELDRLVNTEKAHRRTEIVHTGLKTLRDYYASLNARLEEYALPIPADLPAKIGDTIKGKKSMISMQEAVSAAVAHEKINITAHVERVRANITVMEACDAAYRSLFPDRVSLCISKTPDDLHNLILARIADHQQQEQARIQAQRERQNTLETTPLPTPPTAHCSTESTATTPASTGHLIKLGDINALIAPLSINADGLAALGFVPIGTERATKWYAAAEFPVICHALKQVLSDAASRTAIKQAA >NZ_CP020870|1329457:1337775|1336507_1336756_+|WP_060870176.1|DBSCAN-SWA MVNVQARQLRDMPPAAGLCLSRTEVAELCGTPQRARQAAFLRKNGIRHYLDAHDWPVVLRAAIDAMPSSPMVPPVWKSNKVA >NZ_CP020870|1329457:1337775|1335145_1335976_+|WP_046419912.1|DBSCAN-SWA MHNTLPASVDFSDVSLTIIDHDGIPYLTAADLAQALGYKDTSTVLRIYSRHTDEFTSEMSLTVNLTVKGFGCGNSEKPVRIFSPRGCHLVAMFARTSVAAAFRRWVLDVLEVLPSIRKTGEYTVNPDLEYDQMRSYSKDRKQMAALNTAHSRWISDVRRLLESAGIKEPEFPKGLEDSKAIATSALVELLRWHRWILDFGPDFRLRLTPVTLHTNFLTSDEVADWVRHPTFPSKHLPDIAKAAIERMSKAFSEKAIGPHMEQAAQKGMGNASRHLS >NZ_CP020870|1329457:1337775|1331004_1331349_+|WP_046419745.1|DBSCAN-SWA MSITTERTSLTDLEHLFEVADGGVLAQKFAKALSDVALAISYTRKQGEVALILKIKPIGESSQVMIDHTLKYVEPKERGKVIEEDTTSTPMYVGARGKLTLIPDTQQQLFAEKN >NZ_CP020870|1329457:1337775|1331388_1332210_+|WP_046419741.1|DBSCAN-SWA MDKTAIEHIQQTAIDANGFRLPAALQDRAVALPLDYGIKDLEPLFELRSRFRGAMQTQSIPDFVQYIKTKGNGEGFINAENLSAKIFFNLGTTELPGHGDWTATLTLNPTAAYKALLDIDGKTLNQRKLIDFLEDWAPLLSATSQESLDELPLTTAINAIRKLTIKKTSESESTQGEFNTSRSKFDDIQAKSAIGLPIGFTFTTQPYLGLPERRFLLRLSVFTDEEKEKPLITLRLIQREAQQEATAQDFKSVLFRDLEGHATLTIGTFSPSL >NZ_CP020870|1329457:1337775|1330494_1330770_+|WP_080939592.1|DBSCAN-SWA MSHLYRNLNFWMGCCNHHAVDIYGTPAYCRKCAPHLFLASADDVNGPLRHYLDLRYRLSRPVSLSDLTEVLKRINSGKDVEIVTPTMGPMP >NZ_CP020870|1329457:1337775|1330792_1330984_+|WP_046419746.1|DBSCAN-SWA MNRYRTPKEQHTARCNYYAHPGLAHCILRDTRAMPNALRIANYRVRRAHYEAAKTLSSTLIKH >NZ_CP020870|1329457:1337775|1336042_1336489_+|WP_046419733.1|DBSCAN-SWA MARGINKVILVGNLGNEPDIKYTQSGMTITNISLATSSKRKDREGNTQERTEWHRVKFFGKLGEIAAEYLHKGSQCYIEGAIRYDKFTGQDGQERYVTEIIADQMHMLGGRDEGSSGITPQRRPAKVRNNDKAYAYAGDDFHDDDPPF >NZ_CP020870|1329457:1337775|1329457_1329847_+|WP_046419752.1|DBSCAN-SWA MSALAKEVQANTPEEVPLFYYAHIQDAEGAEQQYVTDYLKEPGVNSSEGTIRVFAAHYVKFSPLVRSWFVGRPDGDIQRTSMGYEYIRVEPTHPLYPQIKDACEELYSFNESVLSHIAHKAADTAKAGA >NZ_CP020870|1329457:1337775|1336755_1337775_+|WP_046419731.1|integrase|DBSCAN-SWA MGRKPTKTGAIPRFRVRPQKSGVVHYYYDHGGKPRKETPLGCDYGLAIKRWAELEHAQITPAIAVTFRHVAERYRAEVIPTKAYNTQRMHYIYLNYLLQFFDDPPAPFESIKPVNIRQYLDWRHFKVSANREVALFSHLWNWARSKGITDLPNPCAGIRRNKESGRDVYIDDTTYHAVYQAADQTLRDAMDLAYLTGQRVSDVLSMDERHIVNGALEICQAKTGVKLAIAITGELAVLIKRIFDRKRGLKLRSTRLIVDEKGLGLNWKKLAYRFRKVRAAAGIAKEIFQFRDLRAKAATDKADLAGDMRQAQAQLGHASVVMTEHYVRKRKGAKVTPTR >NZ_CP020870|1329457:1337775|1330259_1330451_+|WP_046419748.1|DBSCAN-SWA MIPCTITARTKQGVYTYTGLFQKTVEAKSDAYRRFGFPAVIAVTAIHRKSMRNNAHPSPPRAA >NZ_CP020870|1329457:1337775|1332225_1333143_+|WP_046419739.1|DBSCAN-SWA MNSLSVIKPNNAVPMTAEYRQSIRTALKTSLYPGATNESVEMVLAYCQAADLDPMTKPVHIVPMWIPEKKVDGRVVSSAGMRDVIMPGIELYRTKAHRTGEYAGQDEAVFGDTLCETLGGVQIRYPSWCRIAVYRMVAGERVRFAATVYWLEAYATARKDSPAPNSMWCKRPFGQLEKCAEALALRKAFPEAVGAQPTAEEMDAGRHTIEGETIHVAPIPVNQDISGPYPQEEFEKNFPKWCDLIESGKTTADRIITMLRSKDKGQLTPEQLTAIRACAEDTSAEPVPTDNTPQESNTTHTEENT |
13 | Xylella_phage(45.45%) | integrase | attL 1329997:1330009|attR 1338377:1338389 |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_8 |
1383317 : 1393896
Sequences of DBSCAN-SWA_8
Nucleotide sequences of DBSCAN-SWA_8 >NZ_CP020870|1383317:1393896|DBSCAN-SWA CATGACCGAGAAACTGACGAGCTACGATCCGGCCGAGGACTTGACCACTGACCAGGCCATCGCCGATTTCATGGCAGCCGCGTTCGGGACGAACGATCCTGCCTACGTTGCCCACGCGCTGGGCGTCGTTGCCCGCGCCAAGGGCATGACGCAGATCGCCAGCCAGACAGGGCTATCGCGCGAACAGCTCTACCGCTCGTTCAGTGCCGAGGGCAACCCGACGCTGCGCACGACATTAGCCGTGATGAAGGCGCTAGGGATCGAGCTCTCTGCGAAACCGTCTGGTGTTCACTGACCGGATGCGCGACAAGCCTTTAAAGAAATCCTATCTGTTTGTCATCAGCATCCCCTCACAGCGTGGTGCTAACGGTCATCGCCTGCTGCTCCACATCCAGCAAGGTGACCTGGATAGTGAAGGTGCGGGTATCCGCGTCCAAGGCCATTGAGAAGGCGGTGAGGCGGCGCACGCCTTCGGTGGTGAGGATGCAGCGCTTGACCTCACGCTCCAGACGCACCAGGTCGGCAGGCCGCTCCATCAGGGGCAACCACGGCAGGCCGTGGTCCAGATCCAGGAACCAGTTGCCACGGAAGGAGCGCAGCCGTGTCTTCACCCGCTGTGCCACGGAATCGCTGGCGGCCGCATAGTTGCCGCGTCCGTTGCCGAAGGTCCAATCCCCTTGGCTGTCCACGCGGCGCACTCTCATTGGGCCGGGCCTGTCTGTCCTGGGCCGTTCTCCACGTTGTCGTGGGTGTGTGTCTCCAGGCGGATGCCGTTTGATACGACGTCGCCATGACCACGGATGCCTTGGGTAAATTCCACGGGAAGATCAAGAACCAGCTTCGTTCCACGCAGTGTGATCACGCCTTGGGTATCCAGTTTGAATGAGGCGCGGCCATCCAGGGTGCGCAATACCACGCCGTCCATTTCAAACGTCGGAATGACATTGGGCAAGGAGGCAATGCCAACATGCGCCACGGCATCCGACAGGTCATGCAGGCGATAGTCCACAGGCTCGGACGCACGGCCAGACTGGAACCAGGCATCGATGCAGCGATCTTGGAAGATGAGTTCACATTCATCATATTTTAGCATTAATAATCAATAACTTATCTTAATAATAAGCTCCGCAATGAGTAAAATTACTAATTGTTATTTTCAGGGGGTTTTTGATGGAAATGAGAAATTGCGGAGCAGATATTGGGTTTTCTGTCTTTGAACTTTAATCGTGAAACTCACGACCAATTCATTAATAAACGCAATATCAAGAGCGAGTTTCGTTTAACGTGATTTCTTTACAAAAATATTTCGTTAATCAAAGCGGATAAGGTACCGGTAGAGGCGCTGCTCCAGCGCCTCGCGGGTGGAGCGGCCTCAATGGCCCCGCGTCCGGCAAGGATTAAGATGCCGCATGAAGCGGGACGGTGGTATCAGAAAATACCGCAACGCGATGTAAGGGTTTCGGTATCTGGCTTAAGGATATGGTGAGGCACCCTGTGTCATAACGTTGAAATTGCCACTAACGACCTGAGACGCACTTCATGCAACTGTTCTCTCATCTCACCACGTCTATAAAAAATTAATATACTCGACTGTAATTTAAATTAAGAAAATAACCTGATGAACTACAGCGTTATTCCTCCATATATTTTTTCAAAAATCATTGATACTGGTTCCAGTGACCAGCGAAGCCGTGCACGCAGTACATTGGCGCATGTCCAGATGCTGATGACTGAGACTAAAACAAAACCTGAAGGTTTACACATCACCACTACTGGTAAGCTACAGCGTGAGATTTATGATGCTCAGCAAACCCAGAACCTTCCAGGCACACTAGTACGCATGGAGAAGTCTCGAGAGAACGATGATGTCACTGTGAATGAAGCCTACGATTATCTTGGAGTAACATATGATTTCTACAAGGAAGCTTATCAGCGTAATTCACTCGATAACAAAGGGCTACCACTGAAAGCTACTGTCCATTACGGCAAAGACTACCAGAACGCTTTCTGGAACGGCCATCAAATGGTCTTCGGTGATGGCGACGGAGAAATTTTTAATCGCTTTACATCTGCCATCGACGTAGTGGCTCATGAACTGTCACATGGCATGACAGAAAGTGAAGCCGATCTAATTTATTTTCAACAAGCTGGAGCGCTGAATGAATCTATTTCTGATGTATTTGGCTCACTCGTCAAACAATTTAATAAGAAACAAACGGCTGACCAGGCTGACTGGATTATTGGTGAAGGCTTACTGATGAAAGGCATAAACGGCAAAGGAATTCGCTCCATGTCTAAGCCTGGCAGCGCCTATGACGACCCCTTGCTGGGTAAAGACCCTCAGCCCGCACATATGAACGACTACGTGCAAACACGCGATGACAACGGCGGCGTACACCTCAATTCTGGAATTCCTAACCATGCTTTCTACCTTGCAGCTATCGCATTGGGTGGTTACGCTTGGAAGATAGCTGGTTATGCCTGGTACGATACAGTATGCGACAAACATCTACCACAGGATGCCGATTTCAAAACTTTTGCTGAGTTCACGATAAAACATGGCAAGACCCGTTTTAACAAATCGGTGGGACACACTATCGAAACTGCTTGGCAGCAGGTTGGTGTATTACCATGACAGAGCCATTGCTGAGCTGAACGACGATATAGTCCTTGATATCGCTTGTGAAGGCGGTTGTGCTTGGATTCCCAAACTGGCCAAGCCAAGGCGTGTTACGTTGCGTCAAATGAACCCGCTGCAACGGGAACGTATCTGTAACACGCTTCTTCAGGTATTGCCGCTTGGTAAAACGCCAGGAGCTCAATCTTCTGTCGGGCGAGGTGATCAACGTTATTACCGCATTGAAATCAGCAATATCATTCAAAATCAGAAACACGTCGGAGACGGAGACACCATCATCCTCTTGATTCCCGAAGATCAAGCTCCACCGGAACTGGTTCAACTCTGGAAAGAGGGCGAAATAACGATCAGTCGGTAACACTTCACAATGAATTAAAACCAATCAACAAAGCTACTTCCGCTGAAGTTTTCCCACCAGGCGAATTCCTCCGCGAGGAATTGGAAGCGCGTCATTGGACGCAAACCGAACTAGCTGGAATCATTGGTTGTCCAGTACGGATCATCAACAAGATCATCTTAGGCAACAAAAGCATTACACAGAAAATAGCAATCCAGTTAGGCGAATCGCTGGGGACTGGTCCGGAAGTCTGGATGAATCTCGAAAGCCAATTTCAATTTTCAAAGCATGCCGTACTCGCCAGCGACCATCGCTAACTATTTCTTACACCGCGCCTCTAAAGAAAGTCGTGCACTCACCACTATGCAGGTGCTGAAGCTCGTCTACATTGCTCATGGTTGGCACCTTGGATTCCGGAAGGAGCCTTTGATTGATGAACCTGTGGAAGCGTGGCGGCATGGTCCAGCCATTCGCTCGCTTTACAACAAGATTAAAAAGTATGGAAGCGGTGCCGTTACTGAGTTGCTGCCTGTCAATCGGTTTTCTTTGCCAACTCCCCCTTGGGCCAATGTTACGAACATCGACAAAAGTACAGCAGAAATTTTAGATAGCGTCTGGAATAGCTGTGGACACTTTGGCGGTATTCAGTTATCTGAGATGACCCACAAGGAAGGCACTCCGTGGTGGCAGGTATGGAACGGTCGTGAAAGTAAGGACACTGATGTGATAAGCAACGATCTCATTCAAGAATTCTATGAGCAGAAAATCAAAGCCCATAGCTATGGGAAGCTCTACGAAAAATCTCGATCAAATCAAATCCAGCTTAACAATGAAACAATAATTGAGTGTGTTTAATGATGAGTAAATTGGCTCACAGATTGAACGATTTCCTATCGGGATTCGGGCAGGCGTTTTCGCTGTGGCCTACAGAGTCGCTTGATCGTTACCTGGTACAGGAAAGCCCAGAATCCCGTATCTATGAACACTTCGCGCGTGTTGGCGAGCATATGGAGTCAGCCATGCAAAAGGTGATGGAGGAGCAAAGCCTCATCGAAAAAAATGCGATGGATCAATGTCGCTGATCAATACGCGATACACGCCATGACGCGCAGTCCAGCGGGCAGGTTGTGGTGTTCTCCAAAGCGATCAATGGACACAGTGTGGCCATGCCTGCCCGCCGATTCGGCTGTTGCCGCGTGGGCATGATCTGCCTCTGCTGTTACAGCAATATCGTGCACATGAAACCCTGCGGCCTGCATGCCGATGTTATGGGCATGCTTGCCGTTTGCATCGGTGGTGAATCTGTGTAGATGCCCGCCAGTGACATTGGTCGTGACTTTGCCTTGATTTTGCTCATAAAAACCCATGTCTCTTCCAGAGGCATAGATCGCTCTGAAAGAACCCAAGATATGCGCGTGGTCACCGTCCCAAGAGGTACTGCCTGTATGCTGGTGCAATCCTTGTTCGTCACTCCATGCCTGGTGTAGATGATTGCCCGCCGCTGCTGCGCTGGCAGGGTGTGTATGGCGGCCTGCGGGATGGACTGTGACAGGATGGAGATGCCTACCGCCCTCCTCGGCCGTCGCGGTGTGCGCATGGGAGATCACCTGCCCGCTGGTGAAGGTGCCGACCAAGGCAGGGTCGGCGGTGTGAACGCCGACGGTGCCTTCAAGAAAATTAGGAATATTGAAGGTGCTGACACCATCACCGGCACCATAACTGGTGTTGATTTCCTCAAACAGGCGTGGGTACATGGCACGCGATACAGCGCGGCCATCACACAGCAGCGTGCCGGGTAAGGCGCGTTTGCCTGCGGTGTAGACAATCTGTCCAGGCTCGTACCTGGAGAGTGCCGTCCAGCGGTTGGGTTCATTCTGTAGCGGTGTATCGGTGTTGTTGTCAGCCGTGGACAGATACAGCCCGTAACGGCTTGCGTGATCCGGGCGGTATCGGACGATCACCCCACGCATGTATGAAAATGCCGTGCCGTTATTCTGTTCGGCGGTGATGAATTCAGGGCTGCCGTATTCTTGGTAGCCTTTAAGCACCGTGGTGATGGCGTGCAGCACGGCATTCATGACAGTGCGTTCTACGGGTTTAGCCGTCGGCTCCTTGGTTAAATCTTTTTGATAGTCCGGCCCCCATCCCTGGGTGTAGCTCACAAAGCCGTGACTGTCTTTGGCTTCGGGCACGTGGATCATGTCCCCTTGCTGGGCAAAGGGGGTACGGAAGTAGTGTTCTGTCATGGGTTATCGCTCGGGTGGGTCGCCAAACGCGGCGTGTGTGTAGTTACGGTTGCTGCGCTCGTATCCGAAGGGCAGATGTGTCACAAGGTTGTAGCGAACCCGCACGCCTGCTGGACGTGGCAGGATGTCCAGCGCAGTGATGGCATAGCGGATGACGTCTGAAATCATTGCAGTACTGACAAACACGGTATAGCTCATGTCGTAGTGGTCCAGCACTGCGGCGCTGCCGGGAAAGATGAAGTCAAGCACTTGCTCCATATTGGGCGCGGTGCCGGTCATGTGATTTTTGGCAATCCGGCACTTGATGAGAAAGCGGTAGGCTGCATCGTCCAAGCTTAGATCGTGCCGCACGGGGGGGCGTTCACTGGATAAGACACGGGATTGACCGACATGCTGGCCGATGAGGTCAAGGTGTGTTCCGGTGGCGCGTTCGATATCCAATGTCTGGCGCAGATCGGCTAAGCCGTTCCAGGTGCTGCCGAAGGTATCGCTGATCAATGCAGCGGTGGCGGTGGCCCTGGGTTGGCCCTTGTACTGCCAGATCAACAGGTCCGCATAGCTCATCGCACGACAACCTGCAGATCGTTCATTGCAAAGCGCGCCATGCTTCGCACGTCGATAGGAATATTCTGCTCAGACAACGCTTGGCCTGCTTGACCGATCATCAGCGATGTCACCCAAAAGCCTGGGACGCGATTAATTTGGGTATACAGTCGGCTGCGGTGAACGTGCTCGCCAATCAGAAAGGAGCGCTCGGCCAATGCCTGTTTGATCGCATGGGTATCAATACCGGACGTGCTGCTATCGCGCTCTACTTCGATGCGGGCGGCGCAACGGACCATCGTTGGACGGTCAAAATAGATCTCTCTAGGTTGACCGTGTTTGTTTTTAATCTGTACCCGTACCTCACCACGCATGTTTGTCCCGAGTGTTTTATGGTGATAGATCACTTCAGCAATGGCCTCATCCCGGCCCCCCTCCACAATGACGTTAATGCCGTGGGCGGGGACTCCCGCAGCATCCACGGTATCGGTGAAGTTTTCTAAGCAGACGACGTGGCGCACGTCGGGCAGCCCCCAGAGTGTGGCCTGGATGCTGTCAGCATTGTTGGTGGATGTCTTAGCGCGACTTTTAAAGAAGCGGGCGCGCAGCGCCGCATCGGACTCTTCTTCTGCCCCTGCTTCGGCGTCCTCGGTCGTGAGGGCCGAGTCCCAGCCCAGGGCCACGGTTTCAATGGTCAGGGCGGTGTGTGCGGGGACGTCAACACGGCCTAAGGCGTCGCTGCGAAAGTCTGCATGGGCGTGGCCGGTGGCATCCAAGCGCACGGATGACACGAGCTGCCAGCGGCAGCGATTGGGATCGGAAACAACAGACCCAGCAGGGATCGGGGCATCAGGTGTGCCGCTCAAGGTGACATTGCGTAAGTAGCTGTAGCTGGCTCGCCTGCGGGTGAGGCCCGCATAGGCCACGCGTTGTTCTAGCCACGCGCCGCTGGCGTAATCCGGGTCCAGTTGCCGGTGGATGTCCGTGCCCAGTTCCTCCAGATCGGCTTTGATCTGTGCAATCAGGCCAATCAACTGGCCATCGGGGCTGTCCGGATCAACATTGATATCGTTGCCGTAAATCGAACGGAAGCCTTCTTGCAAGCGGGCAATGATCGTATCCAGCCGCTCGGCTTCGTATCCGCTGGTGGTGACTTTTCCCATGGTTCAAGCACTTGATAAATTAGAAATAAACAACGTCATCCTTAAATCAACAGGCAATTCAGACGTCCCCCATCGTAGTATTTTTTAGACTACAATTGTGACATGATTGAATTGAAGCAGACTGACACCTTCCGCAAGTGGCGGGAGAAACTCAAGGATGCGCGCGCCCGCTCGGCCATCGCCTCGCGCCTCGACCGCTTGGCGTTCGGCCATGTCGGCGACGCGGAGCCAGTAGGGAAAGGTGTCAGCGAGCTTCGCATCAACTACGGCCCCGGTTACCGGGTGTATTTCCAGCGGCGTGGCGACACGATCTACTTGCTGCTTTGCGGCGGTGACAAAGGATCACAAGCGCGCGACATCAAGACTGCGCTGCACCTGTCTGAACAATGGAGCGAATGACCATGACCGAGAAACTGACGAGCTACGATCCAGCCGAGGACTTGACCACTGACCAGGCCATCGCCGATTTCATGGCAGCTGCGTTCGGGACGAACGATCCTGCCTACGTTGCCCACGCGCTGGGCGTCGTTGCCCGCGCCAAGGGCATGACGCAGATCGCCAGCCAAACAGGGCTATCGCGCGAACAGCTCTACCGCTCGTTCAGTGCCGAGGGCAACCCAACGCTGCGCACGACATTGGCCGTGATGAAGGCGCTAGGGATCGAGCTGTCTGCGAAACCGTCGGGTGTTCACTGACCGGATGCGCGACAAGCCTTTAAAGAAATCCTATCTGTTTGTCATCAGCATCCCCTCACAGCGTGGTACTAACGGTCATCGCCTGCTGGTCCACATCCAGCAAGGTGACCTGGATGGTTAAGGTGCGGGTATCCGCGTCCAAGGCCATTGAGAAGGCGGTGAGGCGGCGCACCCCTTCGGTGGTGAGGATGCAGCGCTTGACCTCGCGCTCCAGGTGTACCAGGTCGGCAGGCCGCTCCATTAGCTGCAACCACGGCAGGCCGTGGTCCAGATCCAGGAACCAGTTGCCACGGAAGGAGCGTAGCCGTGTCTTCACCCGCTGTGCCACGGAATCGCTGGCGGCGGCATAGTTGCCGCGCCCGTTGCCGAAGGTCCAATCCCCTTGGCTGTCCACGCGCCGCACTCTCATTGGGCCGGGCCTGTCTGTCCTGGGCCGTTCTCCACGTTGTCGTGGGTGTGTGTCTCCAGGCCGATGCTGTTTGATACGACGTCGCCATGACCACGCAGTCCCTGGGTGAATTCCACGGGAAGATCAAGAACCAGCTTGGTTCCACGCAGCGTGATCACGCCTTGGGTATCCAGTTTGAATGAGGCGCGGCCATCCAGGGTGCGCAGTACCACGCCGTCCATTTCAAACGTTGGAATGACATTGGGCAAGGAGGCAATGCCAACATGCGCCACGGCATCCGACAGGTCATGCAGGCGATAGTCCACAGGCTCGGACGCACGGCCAGACTGGAACCAGGCATCCATGCAGCGATCTTGGAAGATGAGTTCGCATTCATCCCCAGCAGCCACGGGGAAGGTCATCACAAAGCCGCCGCCCCGCGGGAAGGACACCGGCACATCCTGGAGTACCGGTAAGGGCTGAAGGGAGCCATCGTTCCTCTTCTGCTGGATCAACGGCTGTACGGTCGCCGTTTGGGTGACTGGGTTAAAGCGGACGATCTGCCCAGGCAAGGCCACACGCAGGCGCTGTGCCAGCGCTTCGGTACTGCGTTGCAGTACGGCACTGAGGGAGGCGTTATTCCAGTCATCCAGACTCATACAGACGGCCTCACGTTCTGAAAATCACCGCCCACACAGGTCACCGTACTGAACCAGGCTTCGGCCATGACATCGCCCATGTCATGCAGTGAGGTGATTTTGTAGTCGCCGTTGTAGATAGGGATGATCGAGTCCACGCGCACCAGGCCGCCGATGCGCAAGGCCGGATTGAGCAAGGTGGTGATTTTTAATCCATCATCGGTCACTTCGGGGGAGCCCATCATGCCGCTGCTTTGGGACAGCAGCACGGCGTCACCGGCCAGGACGGTATCGGCAGGCAGTAACATCAGTGCGCCATCCTGGATGGACCAGTCCGCGCCATGATTGTTGGCCATTGCATCCAGCAGGTCGCGGGTATTGCCCGACAGGACTTTGCCGCGGGTCAAGCCACGTTGTCCCTGCATCTGGATAGGTCCCAGCTGGGTAGACGGCATGGAGGTACTCAGTGCCCGCAGTACCTGGGCATCGGTCGCCCCTGCGGCCAACGATAAGCAAACATGCCCATGGCGGTAGTCGTGATCGCCATCGCCGCATTCCAGTTCAATGACGTAATCCGTCCCATCACGCCGCACAGAAGGCTTGATGATGTCACCGACAAATAACAGGCGCAGTTCTGCGTAACCGGCGAGCAGCCGGACCCTGTTGTACTGTCGGCTGGTGAGCAAGCTCAGGTGATCGCGGTTGAGATTCCATACGGTGATCTTGGCGGGGTTGGGGGTGGAGTCGCTGGTTTTGCGGATGTCAAAGGCGATGCGCAGGGTGTCGATGGCAATCCCATCGTGGCTGGACCCCAGCTCCAGGCGATACTGGCGGCCAAACTGTTTCATGGGCGGACCTGCTCTTTTAATCCAACAAACAGCAAGCAGCGTTCGCCCAGGTCATCGTGGCGCATCGGGTCCATCTCTAAACCGCTTTCATCTGTCAGCCAAAAGAAGTAATCGACAGGACGCCGCCACAACAGGGGGACGCCCACCACCAGGGGGATGCCTTGCGCCACGGGCTGATCTAGGGTCGCGGTGTACAGGTCCATCGACCAGCAACACGGGACCGGATTCCATCGCAGGATCAAGCGTAGGGACTCGGCACCTACGCGAAACGACTGGGTCTGATACGCGCTGCTATCCACGGGAATCTGCCACATCAGAACAGTCCAGACATCTGACGCAGTAAGGAGCGGTTCTTCTCGGTGTCCACCGGCTTAGGGTGGGTCTGGCCGCTGTGGCGTTGCGCCGCGCCTTGGGAGGCGCTCCTGCCGCGTTTGGGGGCGGGCAATGAGACACCAGAAATCGATGTTGTCTTGACGATGAACAGTTCTCGCACTGTCAGCACGCATTCAATCGAACCATCCTGGGTTTGTCTGGCCGCGATGGAGAGAATCAACATGTCTTGATACGTCTGGACGCCGGTGTGTACCTCCAGGGTCTGTCCGCTGCGTTGTAGATTCCGTAGGGCGGTGTACACCTGGGCAATGCGGCCTGTGGTGCTGGAGTCATCACGGGGGGGGATGGGCTGGTCATCGGGAAGCCAGTCGGCCAGAGGGCGCACGGCGTGCTGGCCGTCGCTCTGGGGTGCAGTGGCTTGGCTGATCACCGAGGGCAGCTCACGTTGGGCCACACGCAGCGCCTGAGCGGTGAAGGGCAGCAGGTCCGTCGGAAATGGGACGCGATCGGTCAGGACACGCAATGGCTCGGCCCCGTGTTCCTCTGCGGCAGGGGCTGGGCTGCGCTGGGGTTGGTAGTCCACCACAATGCCAGCAATGGTGACGGTCTGCGGCATCAGGACGGCGTGATCGCCGATCATCGCGCCAGACTCTATCGGGTTTTCAGTGATGCGCAGCTCGGCTTGGTGGGTTTCTTCCATCACCGCATCCAGGGTGACGGTGCCGACGTGGCGGTGGGTCAGGGTGATCAT
Protein sequences of DBSCAN-SWA_8 >NZ_CP020870|1383317:1393896|1390345_1390642_+|WP_004085590.1|DBSCAN-SWA MIELKQTDTFRKWREKLKDARARSAIASRLDRLAFGHVGDAEPVGKGVSELRINYGPGYRVYFQRRGDTIYLLLCGGDKGSQARDIKTALHLSEQWSE >NZ_CP020870|1383317:1393896|1386335_1386608_+|WP_060871504.1|DBSCAN-SWA MNKATSAEVFPPGEFLREELEARHWTQTELAGIIGCPVRIINKIILGNKSITQKIAIQLGESLGTGPEVWMNLESQFQFSKHAVLASDHR >NZ_CP020870|1383317:1393896|1387373_1388537_-|WP_046419971.1|tail|DBSCAN-SWA MTEHYFRTPFAQQGDMIHVPEAKDSHGFVSYTQGWGPDYQKDLTKEPTAKPVERTVMNAVLHAITTVLKGYQEYGSPEFITAEQNNGTAFSYMRGVIVRYRPDHASRYGLYLSTADNNTDTPLQNEPNRWTALSRYEPGQIVYTAGKRALPGTLLCDGRAVSRAMYPRLFEEINTSYGAGDGVSTFNIPNFLEGTVGVHTADPALVGTFTSGQVISHAHTATAEEGGRHLHPVTVHPAGRHTHPASAAAAGNHLHQAWSDEQGLHQHTGSTSWDGDHAHILGSFRAIYASGRDMGFYEQNQGKVTTNVTGGHLHRFTTDANGKHAHNIGMQAAGFHVHDIAVTAEADHAHAATAESAGRHGHTVSIDRFGEHHNLPAGLRVMACIAY >NZ_CP020870|1383317:1393896|1388540_1389101_-|WP_046419969.1|DBSCAN-SWA MSYADLLIWQYKGQPRATATAALISDTFGSTWNGLADLRQTLDIERATGTHLDLIGQHVGQSRVLSSERPPVRHDLSLDDAAYRFLIKCRIAKNHMTGTAPNMEQVLDFIFPGSAAVLDHYDMSYTVFVSTAMISDVIRYAITALDILPRPAGVRVRYNLVTHLPFGYERSNRNYTHAAFGDPPER >NZ_CP020870|1383317:1393896|1383666_1384020_-|WP_046419676.1|DBSCAN-SWA MRVRRVDSQGDWTFGNGRGNYAAASDSVAQRVKTRLRSFRGNWFLDLDHGLPWLPLMERPADLVRLEREVKRCILTTEGVRRLTAFSMALDADTRTFTIQVTLLDVEQQAMTVSTTL >NZ_CP020870|1383317:1393896|1387145_1387373_+|WP_046419974.1|DBSCAN-SWA MMSKLAHRLNDFLSGFGQAFSLWPTESLDRYLVQESPESRIYEHFARVGEHMESAMQKVMEEQSLIEKNAMDQCR >NZ_CP020870|1383317:1393896|1393125_1393896_-|WP_046419667.1|DBSCAN-SWA MITLTHRHVGTVTLDAVMEETHQAELRITENPIESGAMIGDHAVLMPQTVTIAGIVVDYQPQRSPAPAAEEHGAEPLRVLTDRVPFPTDLLPFTAQALRVAQRELPSVISQATAPQSDGQHAVRPLADWLPDDQPIPPRDDSSTTGRIAQVYTALRNLQRSGQTLEVHTGVQTYQDMLILSIAARQTQDGSIECVLTVRELFIVKTTSISGVSLPAPKRGRSASQGAAQRHSGQTHPKPVDTEKNRSLLRQMSGLF >NZ_CP020870|1383317:1393896|1384934_1385951_+|WP_060870179.1|DBSCAN-SWA MNYSVIPPYIFSKIIDTGSSDQRSRARSTLAHVQMLMTETKTKPEGLHITTTGKLQREIYDAQQTQNLPGTLVRMEKSRENDDVTVNEAYDYLGVTYDFYKEAYQRNSLDNKGLPLKATVHYGKDYQNAFWNGHQMVFGDGDGEIFNRFTSAIDVVAHELSHGMTESEADLIYFQQAGALNESISDVFGSLVKQFNKKQTADQADWIIGEGLLMKGINGKGIRSMSKPGSAYDDPLLGKDPQPAHMNDYVQTRDDNGGVHLNSGIPNHAFYLAAIALGGYAWKIAGYAWYDTVCDKHLPQDADFKTFAEFTIKHGKTRFNKSVGHTIETAWQQVGVLP >NZ_CP020870|1383317:1393896|1385958_1386312_+|WP_081089730.1|DBSCAN-SWA MAELNDDIVLDIACEGGCAWIPKLAKPRRVTLRQMNPLQRERICNTLLQVLPLGKTPGAQSSVGRGDQRYYRIEISNIIQNQKHVGDGDTIILLIPEDQAPPELVQLWKEGEITISR >NZ_CP020870|1383317:1393896|1392808_1393126_-|WP_046419669.1|DBSCAN-SWA MWQIPVDSSAYQTQSFRVGAESLRLILRWNPVPCCWSMDLYTATLDQPVAQGIPLVVGVPLLWRRPVDYFFWLTDESGLEMDPMRHDDLGERCLLFVGLKEQVRP >NZ_CP020870|1383317:1393896|1390644_1390938_+|WP_004085592.1|DBSCAN-SWA MTEKLTSYDPAEDLTTDQAIADFMAAAFGTNDPAYVAHALGVVARAKGMTQIASQTGLSREQLYRSFSAEGNPTLRTTLAVMKALGIELSAKPSGVH >NZ_CP020870|1383317:1393896|1383317_1383611_+|WP_004085592.1|DBSCAN-SWA MTEKLTSYDPAEDLTTDQAIADFMAAAFGTNDPAYVAHALGVVARAKGMTQIASQTGLSREQLYRSFSAEGNPTLRTTLAVMKALGIELSAKPSGVH >NZ_CP020870|1383317:1393896|1391981_1392812_-|WP_046419672.1|DBSCAN-SWA MKQFGRQYRLELGSSHDGIAIDTLRIAFDIRKTSDSTPNPAKITVWNLNRDHLSLLTSRQYNRVRLLAGYAELRLLFVGDIIKPSVRRDGTDYVIELECGDGDHDYRHGHVCLSLAAGATDAQVLRALSTSMPSTQLGPIQMQGQRGLTRGKVLSGNTRDLLDAMANNHGADWSIQDGALMLLPADTVLAGDAVLLSQSSGMMGSPEVTDDGLKITTLLNPALRIGGLVRVDSIIPIYNGDYKITSLHDMGDVMAEAWFSTVTCVGGDFQNVRPSV >NZ_CP020870|1383317:1393896|1391343_1391985_-|WP_046419962.1|DBSCAN-SWA MSLDDWNNASLSAVLQRSTEALAQRLRVALPGQIVRFNPVTQTATVQPLIQQKRNDGSLQPLPVLQDVPVSFPRGGGFVMTFPVAAGDECELIFQDRCMDAWFQSGRASEPVDYRLHDLSDAVAHVGIASLPNVIPTFEMDGVVLRTLDGRASFKLDTQGVITLRGTKLVLDLPVEFTQGLRGHGDVVSNSIGLETHTHDNVENGPGQTGPAQ >NZ_CP020870|1383317:1393896|1390993_1391347_-|WP_046419963.1|DBSCAN-SWA MRVRRVDSQGDWTFGNGRGNYAAASDSVAQRVKTRLRSFRGNWFLDLDHGLPWLQLMERPADLVHLEREVKRCILTTEGVRRLTAFSMALDADTRTLTIQVTLLDVDQQAMTVSTTL >NZ_CP020870|1383317:1393896|1386579_1387146_+|WP_081089935.1|DBSCAN-SWA MPYSPATIANYFLHRASKESRALTTMQVLKLVYIAHGWHLGFRKEPLIDEPVEAWRHGPAIRSLYNKIKKYGSGAVTELLPVNRFSLPTPPWANVTNIDKSTAEILDSVWNSCGHFGGIQLSEMTHKEGTPWWQVWNGRESKDTDVISNDLIQEFYEQKIKAHSYGKLYEKSRSNQIQLNNETIIECV >NZ_CP020870|1383317:1393896|1389097_1390243_-|WP_046419967.1|plate|DBSCAN-SWA MGKVTTSGYEAERLDTIIARLQEGFRSIYGNDINVDPDSPDGQLIGLIAQIKADLEELGTDIHRQLDPDYASGAWLEQRVAYAGLTRRRASYSYLRNVTLSGTPDAPIPAGSVVSDPNRCRWQLVSSVRLDATGHAHADFRSDALGRVDVPAHTALTIETVALGWDSALTTEDAEAGAEEESDAALRARFFKSRAKTSTNNADSIQATLWGLPDVRHVVCLENFTDTVDAAGVPAHGINVIVEGGRDEAIAEVIYHHKTLGTNMRGEVRVQIKNKHGQPREIYFDRPTMVRCAARIEVERDSSTSGIDTHAIKQALAERSFLIGEHVHRSRLYTQINRVPGFWVTSLMIGQAGQALSEQNIPIDVRSMARFAMNDLQVVVR |
17 | Haemophilus_phage(23.08%) | plate,tail | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_9 |
1397321 : 1416107
Sequences of DBSCAN-SWA_9
Nucleotide sequences of DBSCAN-SWA_9 >NZ_CP020870|1397321:1416107|DBSCAN-SWA TTCATGCGTTGCCAAAGCCTTTTTCTAGGGTGATGTCCATCACTTCAAACACCAGTGTCCAGGTTTCTGGATTGTGTCCGGCGCCCCGGGTAAATCCTGGGGGGGTGGTGAAATACCCTTGGGTGGCTGTCACCACGTCCTGATTCAACAGGTCACGGATATCCAGGGTGAAGGGGGTGAAGGACTGGATCGCGCCGCGTTGCTGCGCCAGTCGCCTGCTCAAAAAGGCGTTGTCGGCGCTGTGCTGTTTGATTTTCAACGTTAAGGTGCCGGACCGATCCGCGTTGGCGACAAACACGCCCGTGCCGCTGGCCCCGATGGTGTAGGCACCGGCATCGGCATTGTGTTTGGCGTCGATGACGTCCGTGCCATCGGCCCAGTCTTTGATCTGGGTTCCATTGAGCAGCACCGACACTTGTTTGGGGTCGAAGACGGACATGGGATTCCTTTATCGGTCGAAGTTGATGATGACGTCCACCGCATGGATGGCACCGGCCAGCTTCACGGCGATCTGAAGTGGCGGTGCCCGGCGCGCTTGGCGATCAGAGGTCGATAAGGTGTCCACTGAATCAGCCCAGACATAAAAACCAGCCTCCAGGTAATCGCCCGTGGCCAGCGCACCGAAGGCCTGCCCGTTCCAGAGGCCAGGGGCCAAGGCACCGTTACGGACCCCTTCTTGGCAAACTTTTTTGCAGGCCGCGATCAGCAGGTGGGTGCCTGCATCCGTCAGCGGCACCTTGGTTGGGCTGCGATGCAGGACGGCAAACACCTCCTTTTGCACCGCATCCACCAGCCAATCCAGCAGATGGACTTCATCAAAGAAGCGCCCGCCAAGACAGGTGCCTTCGGCCACCATCGCCACATCATCAACGTAGGCGTAATAGTTGATGCCTAAACGCACGCATTGGGCCACCTGGGTCTGTGTCAATTGATCTGCGGCCACCCCGGGCAGGTGCTTAAATTTCATGGTCAGGGCGGCGTTGTTGGCGCTGAAGTTCACCGATAACGCACGGGCCAACCACGAGATCACCGCGTAGGGGTCTGTGGTGTCGTACAGCACCACGGTACGGTCATGCCCCGATGCATTGAGTTGTCTGAACACATTGGTTTTTTTAAAGTCCAAATGCGCCGCCTCGCGGGTCGTCCATCCCATGATTTTTTTGTCTGCCGCCTGGATCCAGGCGGAGGCGGATCGGATCTGCGTGTCTGTCAATGTCTCATCGGCCACCGCAGCGGCATACCAGCCTGGGGTGAGTGCCTGCAAGGCCGCAAAGGCCTCCGGCAGTGTCTCGGCCTCCATGGTGTCAGCGTTGTTGCCGATGGTCAGGCGGGCCTGATCGGCTTCAAGCTTCAGCCAGTGCCCGACATAGGTGCCAGAGGGACTGCGCTGCTGTGCATAGCCAATGGCGTGATTTCCTCCGGCCACGGCAGCATAGAGTTCAAAGCAGCCATTTAAAAATCGGCAATTCACTCCAAACTCATCCAGTGCGTTATTCAACACACCTGCCACTTGGGAGAAGGAGGTGGCCGTGGTGAAATTCAGCTTGGATAAGGTGACATCCACACCATAGATGCGAATGGAAAAACAGCCGTCATCCACGCCCTTGTACCACGTATCCCCCTGGGCAATCGCTCCGGAGGTGAGTGTCGTTGGGGAGGCGGCAATGTGTTGTTTAAAGCGATTCCAGCGCGCCACCATCAGCTGTTTGGGGCGGGGGCTTTGTGCAAAAAAGCGACCGGTGGCGGCGGCGGTTTTGGAGTAACTTCCGAAGGCGCGTTCTACCTCGTTTTGAGTGCTGGCATCCATGAAGCGTGTTTTGGTATCGACAAACACGCTGCCCGCTTCGGGGGTGAACACGGCCAGCCTCCCAAAGTCACGACGGGGTGCTGACTGGGGCTGTCCATTGAGTTGCACATTGACAATGTTTGAAAGCGCTAGCGCCATTTACTGGGTCTCCGGTGCCGTCATGGTCACGCTGGCGATGTGACCGGTGCGGGTGTGAATATGGATGTCTGCGCTGTCCACAGCAGCCAGGGTGGTCACCACACGGTGGTGGTGGGTGATCTGTAATTCGATCCGGGCGCGGGCTTCATAGCCGCCGCCCACAATGGCCGAGAGGTCTTGGGCAGCCGTGACGGACACCAGGCCCGCACGTAAGGCGCGCAGCCCTGCCGTGCCCGCCTCGCAGGACAGTAACGCCTGTGCCTGCAACACCAGTTCATAGGCGCCCGTGCCGTAGGCATTCACACTGATATGGTGCAGATAGGCACAGGTGATGCTCTGCTGGCGGCCATCAAATACGCAGCACGCCGCTCCCAAGGGGGAAGAACGCAGGCGCTTCACCGTCACAAACGGTGCTGCTCCACAGGGGGCGGCTTGGTCGGCAGGACGGACGGTTCCTTCAGGTAAGGCCAACAGCCGCCGCAGTAGGGTGCGCAGTCCCGTCATGTCGAACGGCGATACCGTGGTAGTAGCCATACTCGGACCAGTTGGAAAGTTGCGCGATGCGCCAGGTGGTGTCCTGGTAGTGCACCAGATCACCAACGCACAGCGCGTCCTGACTCATGATTTTTTTGGAGGGCAAATAGCGCTGCCCTTCTGGAAGCAATTGCAGGTCATCGGGTTTGACCGGATGCAGGATCGCTCGCACAGGGTGCGCAACGCTGTCCTGGGTCCAGGTGCCATCGGCACGATAGTGCCCGTGGTCACGGTGCACCGTGACTGTTTGGGCAAAGCGTGGATTGCCAAACAGCGCGCTAATCTTCAGCATCGCGCACCTCATAGGTGATCGACTGGATCATCTGGCCGGTATCGATCAGGGGCGCGCTGGACCCTTTGCGCTGGATCGTTTGGGGCTTCAGGGGGGCAAGATCAGCGTGGCGGAGCGTCGCCTTGACATCGCCTGCGGCCACCGTCCCTAGTAGGTTCAGGGCAGTGTCTACGCTCATCGCATCACGCAGCACTGCGCGCAGGTGGTGGGTGTGCAGGGCCACATATTTTTCTTGATGCGCGCTGATGGAACGCCGCACCACCGAGCGTTCCGGAATGCCCCGCTCTGGCGCACCCAATTCATGCACCGCCAACAGTCCAGCCGAGCCGATCCCGTCTTCCGTCCGGGCGTTGTGCGCGGCAGGAATGCCCACCACCACAGCGCGCTCCCCCAGCGCCTGAAGCCGCTGCGCCAGGGCCTTCCACTTTTTGGGATCGGCGGGCCGAAGGATCGTGACCGCACTCATGGGGCAACCAAGGCCCCCAGGCCGATCATCCGACGCAGCGCCAGGTAACGTTGTCCATACACCGAGGTGGCTAGCCAAGCGTCACTGGCACTGCCAGAGGGCAGCGCCGCGTAGCTGATGTGCAGATCACCGGCCCGCTCGGACACTACCGCGCCTGTGGTGGCGGCGCTGTCGCCCAGCCCTGGGGTGGACCACACAAAATGGGCCGCCAGGCTCGCGATTCCTTGCGCATACGCCGCTCCCCATCGGGACGCGTCCAGCCAGGGATGGGCATCCTCCAGGGCCTGAGCCACGCGTTCCGGGGGCTGGGTGGCAAACTCCGGATAGCGCGCCAGGAACGTGTGAAGCGTCAGTGCCCCGGACATCATCCCTTCCTTCCGGGTTTGCCAGATTTGCCCGGCGTGCTGTCATGGCTGCCGGTCCCCGGAAGGACAGCCGCGCTCCCTTCACCAGAAACCTGCCCTTCACTGTTGGGTGGGCCCGCCCCCTCAGGGGGGCTGTGCGGCTGGAGCGGCTTCTCCTGCTGTTCCTCCCGCTGGACCGGCTCCTGCTGCTGCTCCACCAAATAGCCATTGTCAAACCACAGGCCAATGCCAGGGTGCTGCCGCAGCTGCTCCACGTGTGCGGCCTCCAGGGCCTGGGTGCGTCCGGCCTGAATCGTCACGCCATCCAGGGTGACATCACAGCTGCGGGTATTCTTGAGCATAATCGTGGTCATGGTGCTGCGTTCTCCCCAAAAAAAAAGCGCCTCAGGGCGCTGGTGTGGTCGGTGTTGAATACAACTCAAATGCCATCGGCATACAGGGCTGATTTTGGATAACGAAACTCCACGCCGCTGTATTTGTATTCACCTGGAATATCAAACGTCAGGCCCTTGGGTTGCGGGGGCAAAAACCGGATGGGCATGGGCAGATGCAGCACCAGCTTGGTGGGGTTCTTGGTATACAGCATGGCGCGGGTTGTGCCACCTTCCCCTGCCGTCTCTAAGCCGTAGCCGGTGCGGACGGTCAGATCAAGGCCACGCTCGGCTTTGGCAATGTTGTTTTCCAGCACGTAATGCAGAATGGTTTTATCGCTGTTGTCACTGCGCGGGGTGGAGACGAGATAGTTCATGACGCTACCGGGCAACAGGACGGTATCGATCATCTCCACATAGTGGGTGTTCATCCAAGCGCTGGAGATCAAGGTGTTGAACAAGGCCAGCACCTGGGCGGGCGACTGACCGATCCAAGGCCCCGCGGTGTTCAGCAGGACCGGCACGCCAGGATGGGTATACAGGCCGGTGAGTTCGTCCTCACCAAACAACGCCACATCGTTGATATGGCGCTCATAGGCATCCATCGCGGCATCGGCCCGCGCGGTATTGAGGGGTTTACGCAGAAAGGCCGACTGGCGCAGTTCCTCGGTGGTGTAATCGTAGCCAATGGTGCCCAGCACCACAGGCACGCTCTTTTGTGCGTAGGCCACATCGACCGTCGGAATATCTTCGCCCCGTCCAGAATGCCGCTTGCCGCGTCCGGAATAGTCATACATTTGATAGGTCACCGAGGTCGCGTACTCGCCCGCTTCGGTGCTGATGGGCACCAAATCCCGGTACTGGATGCCTTGGCGCTGGCGGGCGTAGATCGTCGATTCAACATGGGTCAGTTGCGACACCAAAAACGCCAGCGCTTGGGTGGCATCGGAGGTCTGATACCGTGCATCGGTCAGCAACATCGGGTTCAACGCATCGGCCATCTGACGGCGGCGTATGTCAATCATGTTCATGCAGGTGCCTTATTTAAGAATGCGGATCACGCCCAGCGCTCCGGGGGCGGTGGTGCTGTCCCAGCGGGCCTGGGGATAGGAAATCGTTTCTGAAGCGATGGCGGCGGATCGGGCCGCGCCCAAGGCCCCCGTTCCCGCAATGCGGATAGACACCGGATCATCCGGGCGGCAGCCATCCTCGCAGATCACCCAGATGCGACCGATCTCCAACACCGGCACCATCGCATGGGGCGCATACCGGACCTCTCCGGTCGCATCGGCGACCATCGTGACATGGCGGACACTGATCCCTAGGATGGCGGCGTCTGCCCCATCGGGGGCTTTGCAGGTGGCGTCTTTGGGGCCGCGTGCCACAAACAGGCCAAAATCAATGGGCGTCTGGCCCTCGTTCTTGTAGTTGCACAGGCGGCTGGTGTTCAAGTCGATGACTTGCCCCGCAACGCCAAGATCAAGTAAGCGTCCACCATAGGTGGATAGGTCAATTGCGGACATACGTGTTCCTTCGGGTGCTGAAGTGCTCTAGGTGGCATGGGTGAGCTGTTGGATATACGCCGCTCGCGGGTCCAAGTCGGCATCCGATGTCTTGACGACCTGACGCCGTAACGCCTCATTCACCGCCTCAGCAGCCAGCCCTGCCGAGGCCGTCACCGGCGCGGACGCCAGAACGTGAAACGCCAGGTCCACCGCCGTTTGTGCCGCATCGGCCACCCGGACCCCGCCCAGCAGGGTGTCAATCATGGCCGTATGCGTGGGGTGCAGACGGCTCACCACGTCACGGCGGATCGCGCTGCACGGCTTGCCGTCGGTCACCAGGCCCGGCACCAGCCGCTGGGCATCGCCGATCTGTCGTGACATGGCTGCAATCGCTTGATCCCGCTGGTGCGGGTCTTCGTCCGCAGCGCGGGCCGCTTCCAGCGCCGCCAATTGCTTGGACAGCTCCGCAATCTTGGCCACCAATTGCTCCTTGGTCAGCGCTTGGCCGCTATCCAGTTTGATGGGGGCCTGGGCGGCGTGCAGATCCTCTTCCAGGGCATCCACTTTCTCGGTGGCCGTCTTGAGCTTGGTCGCCAGGTGTTCAACCGCGCTGGCTTCCGTCTCTTCAAGCTCTAGGCTGATACCGTCCACACTAATACGGCGTTTGGTCATGGGGTGTTCTCCAAAGGATGGGGGTAAGGCTATGTCGCGATCGGCCACGCGGCACTGGGGTCCAGCACGGCCTGCGGCAACGGTGGCAATGTGGTTGCCACGAATCCGAATCTGTTTCACCTCGTACGCCTCGCCCTCCGGGGTCCAGCCCGGGGTCCAGTCGTACTCGGCGCTGTAGCCGCCGGACAGTTCTTGTTTGCCAGCTTCAATCTTTTCGATGGTCGCCTCATCGGTAATCGTGAGATCGGCCACCAGATACTCCCCTTCGCGCCGTGGATTGCGGGCAAAGCCCACCGCATGGGCGCGCCAGTTCTCGGCGGTCACCTCCTCATCCGGATGCTCATCGGTGATCGGGCGACCATCAAAGCTGGCGATGGCCTCGGCAGCAAACACTTCTTCAGGGGGGCGGTAGACGCGAATCACCCGCTGGGGATCGGCATCGCTCACCCCCAGTTCGTGGGCGGCATAGTGCTGGATGCCGGTGCGCGCAAATCGGGCAGGCACGATCAGATACCCTTCCGGCGTCTTGCGACGTTGGGTCAGTTGGACATCCAAGGTGATCATCAAGGCCCTTCCAGCGTCACATTCGGAATCGCCACACAGCGGCAGTTGTAGTCCTGTCCCGGATGCCCCGTCGCGGGGGGATCGCTCCAGCGGAACACGTTCCCATCATGGGCCGCATGATCCTCACGCACCCGTTCATCCCCGCTGGTCTGCCAGGTGTAGGTCGTTATGCCCAACCCCACTTGCCGGATTTCATTCAACGCCGCATTCATTTTTGATGTCTGATCCCGTGCAATAAACTTCGCTCGTGATGCCGTGGCATCGGTGATCTGTTCCATCTCCTTGGCCACGACGCTGGCGCGTTTGCCCTGCATGACGCCTTGCAACACAGCGGTACCGATCTTGTCGAAATACTGTCGCTGAATGGAGGTGATCAACTGGACATTGACGGCACGGGCCGCGTGTATCTGGTCCCGCACCTGCTGGGCCAGCATCCATGACGTGATGTCGATGCCGAAGGCGGTACGCACCGCGCTGCCAATGGTCTGTACGACCTGACGATCCACACGCTGCACCTGCTGCGCCGCGATCCGCTCAGCCCATTGAGGCAAGCCACCACAGCGCAACGCCGCCCGCCGCAAGGCCGCCTCAATGGCCTGCATAAACTGCGCTGCCAGATAGCCCTGTGGGGCGCTGCCGTCAGGCGCATCACGGGTCATGTGGGGCGGCGATGCGTGGAGCACCGGCAGCACCTCCTCCCGCACCGCCTGGTGCAGCACCCGCACCAAGGCCAGCAGTTCATTCCTATACGTGGCCTCAGCCTGGCGGCTGGGGCGCGGCGGGCGTAACTGCCGCTGCCTGATCCGGCGTCCCTGCAAGCGCAGTAGGTCCGGTAATGTCAACATCCTCAGTGAACCTAAATTCCGTGGGCCGACAAAGATGTCCTTGCGCTCATACAAAGTTTTGTATGATGGCGCATGGTAGGACCCAAACCGATTGAATTCAGAGGCAGCGCTCTTGACGACTTACGCACTTTTCCAGTGAGCGTAAGACGTGAGGCCGGGTACCAGCTTCACCAAGTGCAAAACGGACGCGACGCTGACGACTGGAAGCCCATGCCCACGGTAGGGCGTGGAGTCCGCGAGATTCGCATCCGTGACGCAGACGGCGCTTTCCGCGTTATCTACGTCGCCACGCTGCCCGAGGCTGTCTATGTGTTGCATTGCTTCCAAAAGAAAACTGAGAAAACCACCAAAGGCGATCTTGATGTAGCGGCTAAACGCTACCGTGATCTGTTTAATGAGGTAGGACAATGAGCAACGAGCGATTCACCAGTGTGTGGGATGCCATTGAGGACACTCCCGAAGCCGCCGAAAACATGAAGTTACGTTCCACACTCATGATGGCCCTGAAACAACACATCGAAACGGCTGCGCTGAGTCAGTCTCAAGCCGCTACGCTGTTCGGTGTCACGCAGCCTCGCGTGTCAGATTTAATGCGCGGCAAAATCAACCTGTTCGGCTTGGATGCACTGGTCAACATGGCTGCGGCGGCTGGCATGCATGTGGAAATGCGCGTGCTGAAAGCGGCGTGAGTGCTTCGCCGGTTTTTTATCTGACCACTGCATGATTCCAATCGCAGCCTATGTATTGGAAGCGGGTGCCGTCGTTTCTACCACTGGCAACACATCTGGAACCTCCATCGCCTGAGACAGTTCCGCCGCCAGCGTCACATCGCGTTCGGTGATCTTTGAATAAGTTTTTTGTTCCAGCAGCTCCGCACAGGGCACGTCTGGACCGATGACGCCATGCGTCAAGTAAATCTGATCACGCTCGGCGCGTAGCTTCTCAATGGTTGCCTGTTCTGTCTCGCTCATCTGCCATAGCGAATTGAACTGAATCTCTAAATCCTGCGGACACTCCCCCACAGACGCCCGAAACAGCACGGCGTACAACACCCGCAGCACAGGCCGCAGCTCGTCTTCCTGCTGCGCCTTGATGCGGTCGTAATAATTGCGAATATCACTGTCGCCGGTGGCGTTCATGCCTTTGGGGGACTGACCGAACAACCGGGTTGCCGGAATATCCGCCGCCCCTGAAATATCCATCATGAATTGCTCAATCACATCCTTCACACCCGCAAAGTGATTGGTTTTCTGGGTGTATTCATCCTTAGCATCCAGCAGCAGCATCCGATTGAATGATTTCATCATGGCCGCTAACTGAAAGCGCTTGTGTACCTCTTGCGTCCCTTGGTCCGAGGCGAGCGTGTCGCTGAGTCCAGAGATCCGCAATACATCCACCACCGCCTCAAAAAACATCGACGCCGTGCCCTGGGTCGCGGTGTCATAGCGGCTGAGCGCGTTATACATGGCCTGCAATACCGAATCATGCCAGTAGCCGTTGCCTCTGAATGCCTCCCAGGGCAGTTCCGCTCCAGAGAAGGCAATCATTCGGGAATGGTCCACCCGCTCCACCGATCCGGCAATCTGATAACAGCGCGGTTGCCCGTAGGTCTCACTCAAGGGGTCCTGGTCCATCTGACCACTGCCCAGCGCCACCCGCCAGCGATCCAACACCGTCAGCGATAGCCTGGTCCCCGGCATGACCGAGGCCGGATCAAACGGCAAGCACGGGTCTTGCCCATGCACGTTGATAAACAGCACCGCACCCCCGTACAACCGGGCCCAGGCCAAGGCATCGCGCACCTTGGCGCGTACGTTCAACGCCTGTTCCAGACGATGCATCGGCTCCAGCGCATCGGCGTGCAGCGCCGTATTCAACGTCACCCATTCCCGCGTCATGTCCGTCGCTGGAATATCCACCACCTTGCGCGCCAGCCAATTGGTCCGGTACATCGCCTCCAGTTCTACACGATCAATCACCCGGGGCAGCAGGTACCGCCCATAGCTCATCTTGTCGCGCTGATCGCCTAATCCGGCCACCAGGTTCTGCAAGGTGTCCACGACATGCTGAGGCGCCGCCCCCCGTGTGGCCCGCGTAGCGCGCTTGTTTCGGTTCTGCTGACTCACACCCAGCGACTCCAATCACTTGCAGGATTGGCCAGCAAATCGTTGATTGCATCCACCATCGGATCAATCTGATCATCATGGGCGTGCGTGCCATCAGCGGTGAACGCTTCACACTCGGCCACAAAATCCTTCACCCACGCCGCCTGCGCTGGAATCACCACCCACCCCGCATCAATGTAAGACACCACATCCATCACCCGCGTGAGCTTGTCGGTCACCCGTGCAATCCCAGTCACCGGAATACGCCCCTGACCAGCGCCACCTCTGGCGATGTCCTGAATTAAGCCCGTGCCGCTAGATTTGTCCTCAATCTTCATCTGACGGATCGGAGCCGATACCTTATGGTCGTAGGCGCGATGCGCATTCCAAAAATCAATCGCCCGCCGCTTGAGTTCCGGCGCTTCCCACTTGCCGCGAATCATGTCCAACAAATAAATACGCTTGTCCTCACCCAAGCCCCACAACTGGAACACGCTGTAATCGTTACGCTCAGCCGTCTTCTGCGCCGTATCGCCATACACCGTGCGCGAGAGAATGCGCGGCAGCACCGTATAGCGCCCAAATTGATCCCCTTTGATGATCCCACCGCCCAGCGGACTGGGGCGCTGCTGATATTGACCGCTGAACACATAGCGGTCCGTGGCTTCCAACGCCAGCAACTCGGCTAACGGTTCTTTGTACGGCCAGTAGCTGTAGCGTCCGTCCTGGTCCTGCACATCACGCACCACCTGCCCTTGGACGTGCTCCGGCAAACGGGACACGTAGGCATCATCAATCAATGCCGGAATCTCAATACACTCCCACGCCCCCGGGAATCCCCCAGACTGGATGAACCCCGTTGGATCGTCCTGCGCCAACCGTTGCATGATCACAATGATCGGCGTGTCCGGACTGGCTTTACGACTCTTCACCGTGGACACCAGCTTACGGTTGGCCTTACTGCGTCCGGTCTTGCTGTAGGCATCTTCTACCTTCAGCGGGTCATCAATAATGATCGCCCCCTGCCATCCCGGGGCCATGTGTCCGGCCCGAAACCCCGTCACCTGTCCGCCCAGACTCACCGCGTACACCCCACCGGCTTTCTTGCCATCCACCACCACATTCCAGCGCTTCTTGGACTTGGCATCGTCAGCAATCTCCAACGGCCACAACGCACGATATTCATCAGACTGCACAATCTCCCGCGCCGTCTCCGAATTCAGCAGCGCCAAATCATCCGAATAACTAATATGCAAAAACCGCGCATACGGATTCAGCGCCAACCCTCGCGCCATCAAATTAATCGCCACAAGCTCCGTTTTCGATGATCCAGGAGGCACGTTAATCACCACATCCTTGCGCCGCCCTGCAATCACATCATCCACCACACCAGCAATCACTTGATGGTGCCAATTCACCCTAAACCGCAGTTGCTGACGCTGTTTGAAAAAATACCGTGTGAAAAATAAATGATCTGCTTCGCACCTGGCCTTGATCACCGCTTGATCAATGGCCTGTTCAGTACTTAGCCTCAAGCCGCTTGAGGGCCGAGACGATTTGCTTTTCATCAACTAACGCCAATCCCACCTTCTGTTCAATCGCCTCCCCATCGCGACCGGACACCTCAACGCGTTGCGTGTCTTTCCAGCCAGCGCGTGTCCTCAGCCAAAAGATGATGGCTGTAATATTCGGATTGGTGCTATGCGTGGCTAACCGGAACAAACTTTTAGCCACCTTTGCATTGGCTTGGATATGCCCAGTATCTAACTCCACGCGGTAGTGCTTGCGCAGCGTCGGCGCACTGATTTGCATCACCAAGGCAATCTCCGCATGCGGTATGCCAAACGACGTCAATTGTTTTGCCAGCAGGCGATTCTTATCCGTTGGCACATGTGATTTTCTTCCAGTCTCTGCCATCAACGATCTCTTTAGGAAAAATAAAACATCGGCGTTTTTTTCGTTTCATCGCCCTTCTTTTAATAGACGAAAAAAACTAAAAAAACTGGCCTCATTCGCCCTCTTCCCCGCAGCCCTGCGGTGACCACGGCTGCCCGACGCTGGCCGCCTGTCCCCATGCCGCAACACAGCGCACTTGCGCCGCACAGTGTTCATACGCCAAACGCCATCCCAGCGTTTGGCTCAGTAAGTCGCGGACGGTCTCTACACGCGGCAATGGCCGCGCCTCACACGGCTGCAACAACCCCTGCGGCGGTGCGATGACTTCAACGCGGGTCTGTATGACGGTGACTGGCTTGACAGGAACCCCCGTCGAGCAAGCACCCAAGCACGTCAGGCACATCACTATCCAAAAAAGCTTTCGCCTCATCATGAAAGTGCTCCAAATGCGTAATGCGCTGCCGCAACACACGGTCGCGCACCGTGATCCGATTCAAATCCGTATGCAGCCCCGCAATCGCCTGCCTGTCGATCTCACGTAACGCACGCAGCCGCGCAATCGCCGCATCTTGATCGCGATTCATCGCAACCTGCGCATCCAACGTGCTTTCCACCGTCGCTAACTGGCCTTCCAGCTGCGCCGCACGTTGCGCTAATTGACTGCGCTCGGACCACGCCAGCACCGCATGTGCCACCAGCGCCACCAACGCACCAATCATCATGTACTCAACCAACAGCCGCACACTGGGCAAACCTCGCCCCACACGGCGCAGTGTATTAACGATCATCCGAACTCCTTCTCCCCGTGCCAAGCTTGGGCACAATCACGTTCTGAATCAGTTCTAAGGTCTGCGGTGTATCAATTAAACCGGAGGCAATCACCGCCGCCACCGTGAACGCTTGGCTCATCTCCAACCATTCACATACACACATCACGAATAAGCCGACAAACCCCGCAATCCCCGCCTCAATCAACACGCGGGAGACGGCCAGCCTCTCCTTAGCGTCCAGTGCACGCATTAAATAACTCAGCGTCCCCGTGGCCATCGCCAGGCACACATAAAACGCCTCCTTCCACCAGGCAGGAAGCGCCGCAATATCAATCACGTGATGCCTTCTTCACCGCCGCATGTTCGGCTGCTAATGCGGCCTGCCAATCTTGACCTTCAAATAGCCAACGTTCGGCTTTTCGACGCCGAACTAAGCCGGGAAGCACACGACCGCTTGAAAACTTCCACGCCCCAAACTGCTCCGCCGCACCAGCCACATCACCGGCATTGAGCTTGCGTAACAGCGTCGAGCGGTGAAACGCACCCGTACCTATGTTGAAGCTCAACGACACCAACGCATCGAACTGATGCTGCTTGAGTGGCACACGCACATAACGCCGCACCGCTGGCTCAAACTCCTTGGCCAATCGAGCACGTAACCGCGCATCGGCTTCCTGCTCATTGGCAAGACGCATATCAGGCCTAACATGCTTGCCCGTCTCGCCATAGCCAATGGTCAATACTCCCCCAGGACAGGTGTACGGGTTCAACTTGCAACCCTCAAAAAACTTGATCAGTGCAATGCCTTCTTCACCAATGGTCTGCATGGGGGGACTCCAGAACGCAAAAAACCGCCCGAAGGCGGTCGCTGGCTTTGAATAAAAAAAAAGCCCTGCTGAGGTGGGCAGGGCGCGAGTAAATCATTCGATGAGGGCATAGCACCACTCAGGCGCGTAGATTAGGGGGGAAAGTGTTGCAGTATCAATGCAACACTACGCGACGACTCAGGGGAGTTGCATCGCTACAGATACTTATTGGTTGAGCGGATGCTCGGCGATGAACTGACGCAAGGCCGCATTCACACGAGTTTGCCAACCCTTGCCTGTGGCCTTGAAGGCTTCCAGCAGATCAGCATCCAGGCGAATGGCAGTGAACACCTTGGTTTGGTCTGCCTTCGGGCGACCTCGGGGGCGCTTCATGGCGACCAAGGCAGTGTATATCTCAGGGGAGAATGCTTCACAGGCAAGCTTGGCCCCCTTATGCCATTGGCTATCGAGTTCACGCGCATCTACATCAGCGGCGATGCCCGCATTAATTACCTCTGTTTCGTCGTGAGTGGGGATCATCGTCCCCTGTTTAAGTGTCGGCATAGCTTTTCACCTCTCTTGAGTTGGCCTTACGCAGACTGATGACCCGTACAGCATCACCACGAAGGCAAAACACCATCACATGCAGACGGTTGCCGATATACCCCTTTGCTTCAAAACGGGGTTCTGCATACTGTTTACGTGTGTCTTCACGAACCACGGCGGTTTCCCACTCAAAACCATCGGCATCAGCGAGCGACAATCCATGCTTGTCAAGATTGCTTTCGCTTTTAGCAGAATCAAATTCGTAGTTCATTTAAATTATTGTATAAATAATAAATCGACGTTACCAAGCTTTTTATATAAACAGTTAAGTTAATTTAACTAACGCTGCACCTAACAACTCATCTCGGGAAAGCCCCTGACTCATACGTTCAATGGTTGCCTTTGCAACATCATACAAATGCCAGGATGGAAAGTGAGGGTAAGTACATCGTTAGCCATCACGCCACATTTCTGTGCAACGCCTCTTGCAACTGCGTCGCCGCCTGTCGTTCCGCAACACCCATCCGCTCCAACAGCCACTCGTACACGCCACACCATGCTCTGCGGTAGGTGGATTCATCCCGGCCAATCGCAGCGGCGCGCTTGCAATCACTGGCGGGAACCGCACCACTCCCCCCGCACGCCGTGCACACCTTCACTAACGCCCCTACACGTCGTTCCCCCCGGCCATGACAGCAGGGGCATAACTGTGGCTTAGACAGCTCACCCACCACCGCCGCAACCAGTGCCGGTAACATCTCCAACGTCGCCTGCGGCCACAGGTGGGCTTTGACCTTGTCCAGCTGTTCCTGCGCACGCCTCAGCGCAGCCTGCTGTGCGCTTGTCGTTGCTCGGGTCCACCCCATGCACGCTTTGACAATGCCCACATCGGTACGCGCTTCCAGCAAGCGCTGCTGCTGCCGTCGAATCTCCGGCACCACCAAGGCCACCGCCGCATCGCGCAAGGGGCTACGGCGCAACGCTGCGCCATCCGGCCACCAGCACGCTTCCAGTACCTCACGCCCCAACCCCGCAGGCACCAGCCCCAGGGCATGGGCAATGTCTTGCGCTGTCAACTCAGGCACTCCACCAGGCAGCGTGTCGTAGCGGATCGTGCTCGGGTTCAAACGAGCCAGTAAGCGGCGCGGATCAGTCATGGCGGATGTCCTGTGGGTTTTACGTCAAAAGCGGTGACCAGAAGCAGTCACCAGGCGCTTCATAGCGCTCAGGTAGGCATCACCTGAATAAACGTTGGTCACACGGCTATCACCACCGATGGGGAGGATCAGAATGCAGGATTGGCGTGCAGGCGTGACCCTCCGTTGCTTGCTTGCTGCGTGATGCCCGTGATGGTGATGACCACCTGTCCGGCGTGGCGTACCTCATCCTCAACCAACGGATGGGATACAAAACGCCGATCATCAATGCCCAGGGCATCGGCAATGCCATCCCGGTACGGCTTAAACCGCGCCAGCATGTTGTCATCGTCAGGCAGGCAGCGTGTGGGCGGATAGAAGCTAATCCATAGATCCAGGCGCCCCTCAACAGGGAGTGACAGGCCACCCCATCCGGCACGCCGTGCCATCACCTCGGCGTAGCCTCTGGCCTGTTTTACGGCTTTGCTGCGCCGTGTCCAATGCACCCGTGCGTTCGGTGACAGGTCCTTGGACGGCCACGGCAATGTTAAAGATTGCATCTGCTTCACCCCTGCCATGCCGCAGGCAGCAATAGGCATCGCCTTGCTCATATCACCCCCGATTCAATCAGCTCATGCGCCTGATTCCGGCTGAGGTGGAACTCGGCCATGAGCTTCTCCTCACGGGCGGCCAGCTCAGGGTGATGAAAGATGCTGCTGATTTCCTTCAACGCTGCTTGAGCCACCTCAGGGGAGGCCGGTTGGGGCGCTGCGGGCTGTTGCGCCTCAATCAGGGCAACAGGCACCTCTGGCAGGACGCCGCCACGCATGACAAAGTCTCGCGCTTGCTCGTAGGCCTCACGGAGCATCCGATCCGATTCAGCACGGCTGGACTGGCAATAGTTCCATGGGTCGATGAACTCCCAGCATTTGGCTACAAACGGCGTATTGCGCCGCGTGCTGCCTGCGGTGAAGTGACTCCGTACCGCCGCCAGGGATGGCACGCCTAAACACATGCTACGGAATCGCGGCGCACTGGGCGGAAACTCGCCGCCTTCAGCAATGCACGCCGCCAACCCGTCCGCAAACTGAGACGCCTCCAGCCCAACCAGCACCTTCTGCCAGGTTTCACCGGCCACGGTCAGGGCGCCTTCTTCGTCTTGGGCCGACTGACCATGAACATTCACCCAGGCGTGGCCATAGAACGCGGTCATCCGCTCCCATAAGCGGCGCAGCCAGGGGAACGGGATCACCGTGGCTTGCGCAAGGCAGGCGATGTTACTGGAGCACTCCGTCACGCAGGTCAAATTGTCTATGGAAGGCGCGGGCTTGGTCTGCAAGGCCGCGTCGCATAGGTCTTGAATTGTATTCATGGCGTTGATATCCGTGCTTCATGGGGCTGGTTTTGTCTTCGGCAGCGCGGCGAATCCAGTTACGCCAGGTGGCCTCCCAATCCTGTTTGCGCCCCTTGGCTCCGGCCACGCTGTGCCAGTAATCGCGGAATTTCTCGGCTTCGTAGCGTCCATCCACACCCTGCTGGGTGGCGTACAACACATCAACCTCACTGGGTGCCCAGTCATCGGGCAGGCGTGAGCCGTGAGGCGAGCGCTTGGGTTTTTCCGTGCAGCCGGTGGCGTTGCCGTCCTCGGCGAATACCAAACTCTCTTCCGAACGCAGTGAGGAAGAGATAATGGGGTCTGGGGTCTGGGGTCTGGGGTCTGGGGTCTGGGGTCTGGGGTCTGGGGTCTGGGGTCTGGGGTCTGGGGTCTGGGGTCTGGGGTCTGGGGTCTGGGGTCTGGGGTCTGGTGGGCTTTTGATTGGGTTCTATCTGGGTTTCATTTGGGTTTGTGTTGGCTTTTGATTGGGTTTCATTTGGGTTATCTAAATCAAAGTCAGCGGGTTTCTGTTGGGTTTGTGTCTGGCTATCATGATGAGCAGAATTTCTAGGTCGCCCTCCCTTTTTGCCGTTCTCCTGCTGTGCTGCTGCCCTTGCATGAAAGCGAGCGATTTCCTCATCACATCGCTTGTTATGCCAGCCGTCCTCTTGCAATAAGAAGAATTCATCAAGCACCACATCAACAGCGCTTTTCTCTTTTTTGCTATGGGCACGCGCGATGCGGTGCGCCTTGTTAGCCGGAATCGGTTGTTCTGTGGCGTAGTAGCGATCTAACAGCAGGCAGTAAATGCCATGTTCGAGTACCGAGAGGTACCCCGTGTCACGGGCGTAATCGCCAATGTGGCGTTCGTAATAATTCATGCAGCCTCCCTCAGACACGCGATGACGTCTGGATTCCACAAAAGTTGATAACTGCTGTGCCCGTTGCGCGAGTACGGAATGGCTTCACACCACACGCGACCGGCTTCGGTTAATTCCCATTCGTCACGTTCATTACGGAACTGAAAGCCTCTGGAGGCTAATAACTGGTTCACCGCCTTGGCCGAGCAATGCAGCCGCTTGCCTAGTTGCGTGGCGTTGAGCATGCAAAGCGGTTCCTGTAACGCAGGCAATGCACGGCGTATCTCTTCGGTCGTTAAATTCGTATTGCTTTTGATGCAGGCCAAGGTCGCGGCGGCAGCAATTCCTGGTTTCACGCCAGGCACTGTAGAAATGTATTGGCCGATTAATAGGAGTGCGGCAATGCGATCCTGGCTACATAGAGGGCACCATCCGTTACGACAAGTTCACCGGCAATGATGGGCAGGAGCGTTATGTCACTGAGATTATTGCTGATCAAATGCAAATGATTGCTGGTTTATGTCTTTCCAGAACTGAGGTGGCCGAGTTATGCGGTACTCCGCAACGCGCTCGCCAAGCCGCCTTTCTTAGGAAGAACGGCATTCGGCATTATCTGGATGCGCATGATTGGCCGGTCGTCCTGCGTGCTGCAATTGACGCGCTGCCGCCTTCTGCGATGGTTCGGCCTGTGTGGAAATCTAATAAGGTCGCTTGATGGGACGTAAACCAACCAAAACAGGCGCGATTCCGAGGTTTCGCGTGCGCCCTCAGAAGTCCGGCGTCGTGCATTACTACTACGATCATGGCGGCAAACCACGCAAAGAGACGCCACTGGGACGCGACTATGGTTTAGCCATCAAGCGGTGGGCTGAGCTGGAGCATGCGCAGATCACCCCTGCGATTGCGGTGACGTTTCGCCAGGTGGCTGAACGTTACCGTGCTGAGGTGATCCCGACAAAGGCCCGTAGCACGCAACGTATAAACCGCACCCATTTGAGCTATTTGCTGCAGTTTTTTGACGACCCTCCCGCGCCGTTTGAGGCTATTAAACCAGTGAATATCCGTCAGTACCTGGATTGGCGCACCTCTAAGGTGATCGCCAATCGTGAGGTGTCCGTGTTTTCGCACCTTTGGAATTGGGCGCGGAGTAAGGGAATCACTGATCTTCCCAATCCTTGCGGGGGCATCCGTCGTAATAAGGAGCGGGGCCGTGATGTGTACATAGACGATGCAATGTATCGCTCTGTGTACCAAGCAGCGGACCGAACGCTCAAAGATGCGATGGATCTTGCCTACCTGACCGGGCAGCGTGTGGCTGATGTGATCGCGATGGATGAGCGCCATATTGTTAATGGCGCTTTGGAGATTTGCCAAGCTAAAACAGGTGCGAAATTAGCGATTGCGATTACGGGTGAATTGGCCGTTTTAATAAAGCGTATTTTTGCTCGCAAGCGAGGGATGAAGCTGCGTAGCACACGTTTGATTGTGGATGCGAAAGGTTTGGAGCTAAGTCGTACAGGATTGCGTTACAGGTTTGATAAGGCACGTGTTGCCGCAGGGATCGCCAAGGAGGTGTTTCAATTCCGTGATCTACGCGCCAAAGCGGCGACCGATAAGGCGGATTTGGCGGGCGATATGCGCCAAGCCCAAGCGCAATTGGGGCATGCGTCAGTGACGATGACGGAACACTATGTACGCAAGCGCAAAGGGGCGAAGGTCACGCCTACTCGGTGA
Protein sequences of DBSCAN-SWA_9 >NZ_CP020870|1397321:1416107|1404951_1405290_+|WP_046419644.1|DBSCAN-SWA MVGPKPIEFRGSALDDLRTFPVSVRREAGYQLHQVQNGRDADDWKPMPTVGRGVREIRIRDADGAFRVIYVATLPEAVYVLHCFQKKTEKTTKGDLDVAAKRYRDLFNEVGQ >NZ_CP020870|1397321:1416107|1412287_1412698_-|WP_046419893.1|DBSCAN-SWA MQSLTLPWPSKDLSPNARVHWTRRSKAVKQARGYAEVMARRAGWGGLSLPVEGRLDLWISFYPPTRCLPDDDNMLARFKPYRDGIADALGIDDRRFVSHPLVEDEVRHAGQVVITITGITQQASNGGSRLHANPAF >NZ_CP020870|1397321:1416107|1404033_1404879_-|WP_046419646.1|head|DBSCAN-SWA MLTLPDLLRLQGRRIRQRQLRPPRPSRQAEATYRNELLALVRVLHQAVREEVLPVLHASPPHMTRDAPDGSAPQGYLAAQFMQAIEAALRRAALRCGGLPQWAERIAAQQVQRVDRQVVQTIGSAVRTAFGIDITSWMLAQQVRDQIHAARAVNVQLITSIQRQYFDKIGTAVLQGVMQGKRASVVAKEMEQITDATASRAKFIARDQTSKMNAALNEIRQVGLGITTYTWQTSGDERVREDHAAHDGNVFRWSDPPATGHPGQDYNCRCVAIPNVTLEGP >NZ_CP020870|1397321:1416107|1400921_1401275_-|WP_046419651.1|DBSCAN-SWA MTTIMLKNTRSCDVTLDGVTIQAGRTQALEAAHVEQLRQHPGIGLWFDNGYLVEQQQEPVQREEQQEKPLQPHSPPEGAGPPNSEGQVSGEGSAAVLPGTGSHDSTPGKSGKPGRKG >NZ_CP020870|1397321:1416107|1414767_1415088_+|WP_080939588.1|DBSCAN-SWA MRQCDPGYIEGTIRYDKFTGNDGQERYVTEIIADQMQMIAGLCLSRTEVAELCGTPQRARQAAFLRKNGIRHYLDAHDWPVVLRAAIDALPPSAMVRPVWKSNKVA >NZ_CP020870|1397321:1416107|1410681_1411020_-|WP_004085696.1|DBSCAN-SWA MPTLKQGTMIPTHDETEVINAGIAADVDARELDSQWHKGAKLACEAFSPEIYTALVAMKRPRGRPKADQTKVFTAIRLDADLLEAFKATGKGWQTRVNAALRQFIAEHPLNQ >NZ_CP020870|1397321:1416107|1400077_1400557_-|WP_046419654.1|DBSCAN-SWA MSAVTILRPADPKKWKALAQRLQALGERAVVVGIPAAHNARTEDGIGSAGLLAVHELGAPERGIPERSVVRRSISAHQEKYVALHTHHLRAVLRDAMSVDTALNLLGTVAAGDVKATLRHADLAPLKPQTIQRKGSSAPLIDTGQMIQSITYEVRDAED >NZ_CP020870|1397321:1416107|1402333_1402816_-|WP_046419648.1|DBSCAN-SWA MSAIDLSTYGGRLLDLGVAGQVIDLNTSRLCNYKNEGQTPIDFGLFVARGPKDATCKAPDGADAAILGISVRHVTMVADATGEVRYAPHAMVPVLEIGRIWVICEDGCRPDDPVSIRIAGTGALGAARSAAIASETISYPQARWDSTTAPGALGVIRILK >NZ_CP020870|1397321:1416107|1411006_1411273_-|WP_010894061.1|DBSCAN-SWA MNYEFDSAKSESNLDKHGLSLADADGFEWETAVVREDTRKQYAEPRFEAKGYIGNRLHVMVFCLRGDAVRVISLRKANSREVKSYADT >NZ_CP020870|1397321:1416107|1408992_1409223_-|WP_046420065.1|DBSCAN-SWA MQTRVEVIAPPQGLLQPCEARPLPRVETVRDLLSQTLGWRLAYEHCAAQVRCVAAWGQAASVGQPWSPQGCGEEGE >NZ_CP020870|1397321:1416107|1399722_1400091_-|WP_046419655.1|DBSCAN-SWA MLKISALFGNPRFAQTVTVHRDHGHYRADGTWTQDSVAHPVRAILHPVKPDDLQLLPEGQRYLPSKKIMSQDALCVGDLVHYQDTTWRIAQLSNWSEYGYYHGIAVRHDGTAHPTAAAVGLT >NZ_CP020870|1397321:1416107|1415087_1416107_+|WP_046419629.1|integrase|DBSCAN-SWA MGRKPTKTGAIPRFRVRPQKSGVVHYYYDHGGKPRKETPLGRDYGLAIKRWAELEHAQITPAIAVTFRQVAERYRAEVIPTKARSTQRINRTHLSYLLQFFDDPPAPFEAIKPVNIRQYLDWRTSKVIANREVSVFSHLWNWARSKGITDLPNPCGGIRRNKERGRDVYIDDAMYRSVYQAADRTLKDAMDLAYLTGQRVADVIAMDERHIVNGALEICQAKTGAKLAIAITGELAVLIKRIFARKRGMKLRSTRLIVDAKGLELSRTGLRYRFDKARVAAGIAKEVFQFRDLRAKAATDKADLAGDMRQAQAQLGHASVTMTEHYVRKRKGAKVTPTR >NZ_CP020870|1397321:1416107|1400553_1400922_-|WP_046419652.1|DBSCAN-SWA MSGALTLHTFLARYPEFATQPPERVAQALEDAHPWLDASRWGAAYAQGIASLAAHFVWSTPGLGDSAATTGAVVSERAGDLHISYAALPSGSASDAWLATSVYGQRYLALRRMIGLGALVAP >NZ_CP020870|1397321:1416107|1411460_1412159_-|WP_046419632.1|DBSCAN-SWA MTDPRRLLARLNPSTIRYDTLPGGVPELTAQDIAHALGLVPAGLGREVLEACWWPDGAALRRSPLRDAAVALVVPEIRRQQQRLLEARTDVGIVKACMGWTRATTSAQQAALRRAQEQLDKVKAHLWPQATLEMLPALVAAVVGELSKPQLCPCCHGRGERRVGALVKVCTACGGSGAVPASDCKRAAAIGRDESTYRRAWCGVYEWLLERMGVAERQAATQLQEALHRNVA >NZ_CP020870|1397321:1416107|1397321_1397759_-|WP_004087249.1|DBSCAN-SWA MSVFDPKQVSVLLNGTQIKDWADGTDVIDAKHNADAGAYTIGASGTGVFVANADRSGTLTLKIKQHSADNAFLSRRLAQQRGAIQSFTPFTLDIRDLLNQDVVTATQGYFTTPPGFTRGAGHNPETWTLVFEVMDITLEKGFGNA >NZ_CP020870|1397321:1416107|1405286_1405568_+|WP_046419643.1|DBSCAN-SWA MSNERFTSVWDAIEDTPEAAENMKLRSTLMMALKQHIETAALSQSQAATLFGVTQPRVSDLMRGKINLFGLDALVNMAAAAGMHVEMRVLKAA >NZ_CP020870|1397321:1416107|1402843_1404034_-|WP_046419647.1|DBSCAN-SWA MITLDVQLTQRRKTPEGYLIVPARFARTGIQHYAAHELGVSDADPQRVIRVYRPPEEVFAAEAIASFDGRPITDEHPDEEVTAENWRAHAVGFARNPRREGEYLVADLTITDEATIEKIEAGKQELSGGYSAEYDWTPGWTPEGEAYEVKQIRIRGNHIATVAAGRAGPQCRVADRDIALPPSFGEHPMTKRRISVDGISLELEETEASAVEHLATKLKTATEKVDALEEDLHAAQAPIKLDSGQALTKEQLVAKIAELSKQLAALEAARAADEDPHQRDQAIAAMSRQIGDAQRLVPGLVTDGKPCSAIRRDVVSRLHPTHTAMIDTLLGGVRVADAAQTAVDLAFHVLASAPVTASAGLAAEAVNEALRRQVVKTSDADLDPRAAYIQQLTHAT >NZ_CP020870|1397321:1416107|1407001_1408552_-|WP_046419640.1|terminase|DBSCAN-SWA MKSKSSRPSSGLRLSTEQAIDQAVIKARCEADHLFFTRYFFKQRQQLRFRVNWHHQVIAGVVDDVIAGRRKDVVINVPPGSSKTELVAINLMARGLALNPYARFLHISYSDDLALLNSETAREIVQSDEYRALWPLEIADDAKSKKRWNVVVDGKKAGGVYAVSLGGQVTGFRAGHMAPGWQGAIIIDDPLKVEDAYSKTGRSKANRKLVSTVKSRKASPDTPIIVIMQRLAQDDPTGFIQSGGFPGAWECIEIPALIDDAYVSRLPEHVQGQVVRDVQDQDGRYSYWPYKEPLAELLALEATDRYVFSGQYQQRPSPLGGGIIKGDQFGRYTVLPRILSRTVYGDTAQKTAERNDYSVFQLWGLGEDKRIYLLDMIRGKWEAPELKRRAIDFWNAHRAYDHKVSAPIRQMKIEDKSSGTGLIQDIARGGAGQGRIPVTGIARVTDKLTRVMDVVSYIDAGWVVIPAQAAWVKDFVAECEAFTADGTHAHDDQIDPMVDAINDLLANPASDWSRWV >NZ_CP020870|1397321:1416107|1409979_1410477_-|WP_023907472.1|DBSCAN-SWA MQTIGEEGIALIKFFEGCKLNPYTCPGGVLTIGYGETGKHVRPDMRLANEQEADARLRARLAKEFEPAVRRYVRVPLKQHQFDALVSLSFNIGTGAFHRSTLLRKLNAGDVAGAAEQFGAWKFSSGRVLPGLVRRRKAERWLFEGQDWQAALAAEHAAVKKASRD >NZ_CP020870|1397321:1416107|1405616_1406921_-|WP_080939596.1|DBSCAN-SWA MVAGLGDQRDKMSYGRYLLPRVIDRVELEAMYRTNWLARKVVDIPATDMTREWVTLNTALHADALEPMHRLEQALNVRAKVRDALAWARLYGGAVLFINVHGQDPCLPFDPASVMPGTRLSLTVLDRWRVALGSGQMDQDPLSETYGQPRCYQIAGSVERVDHSRMIAFSGAELPWEAFRGNGYWHDSVLQAMYNALSRYDTATQGTASMFFEAVVDVLRISGLSDTLASDQGTQEVHKRFQLAAMMKSFNRMLLLDAKDEYTQKTNHFAGVKDVIEQFMMDISGAADIPATRLFGQSPKGMNATGDSDIRNYYDRIKAQQEDELRPVLRVLYAVLFRASVGECPQDLEIQFNSLWQMSETEQATIEKLRAERDQIYLTHGVIGPDVPCAELLEQKTYSKITERDVTLAAELSQAMEVPDVLPVVETTAPASNT >NZ_CP020870|1397321:1416107|1409657_1409987_-|WP_046419634.1|DBSCAN-SWA MIDIAALPAWWKEAFYVCLAMATGTLSYLMRALDAKERLAVSRVLIEAGIAGFVGLFVMCVCEWLEMSQAFTVAAVIASGLIDTPQTLELIQNVIVPKLGTGRRSSDDR >NZ_CP020870|1397321:1416107|1397768_1399265_-|WP_046419658.1|DBSCAN-SWA MALALSNIVNVQLNGQPQSAPRRDFGRLAVFTPEAGSVFVDTKTRFMDASTQNEVERAFGSYSKTAAATGRFFAQSPRPKQLMVARWNRFKQHIAASPTTLTSGAIAQGDTWYKGVDDGCFSIRIYGVDVTLSKLNFTTATSFSQVAGVLNNALDEFGVNCRFLNGCFELYAAVAGGNHAIGYAQQRSPSGTYVGHWLKLEADQARLTIGNNADTMEAETLPEAFAALQALTPGWYAAAVADETLTDTQIRSASAWIQAADKKIMGWTTREAAHLDFKKTNVFRQLNASGHDRTVVLYDTTDPYAVISWLARALSVNFSANNAALTMKFKHLPGVAADQLTQTQVAQCVRLGINYYAYVDDVAMVAEGTCLGGRFFDEVHLLDWLVDAVQKEVFAVLHRSPTKVPLTDAGTHLLIAACKKVCQEGVRNGALAPGLWNGQAFGALATGDYLEAGFYVWADSVDTLSTSDRQARRAPPLQIAVKLAGAIHAVDVIINFDR >NZ_CP020870|1397321:1416107|1412745_1413435_-|WP_046419631.1|DBSCAN-SWA MTECSSNIACLAQATVIPFPWLRRLWERMTAFYGHAWVNVHGQSAQDEEGALTVAGETWQKVLVGLEASQFADGLAACIAEGGEFPPSAPRFRSMCLGVPSLAAVRSHFTAGSTRRNTPFVAKCWEFIDPWNYCQSSRAESDRMLREAYEQARDFVMRGGVLPEVPVALIEAQQPAAPQPASPEVAQAALKEISSIFHHPELAAREEKLMAEFHLSRNQAHELIESGVI >NZ_CP020870|1397321:1416107|1409206_1409668_-|WP_046419636.1|DBSCAN-SWA MIVNTLRRVGRGLPSVRLLVEYMMIGALVALVAHAVLAWSERSQLAQRAAQLEGQLATVESTLDAQVAMNRDQDAAIARLRALREIDRQAIAGLHTDLNRITVRDRVLRQRITHLEHFHDEAKAFLDSDVPDVLGCLLDGGSCQASHRHTDPR >NZ_CP020870|1397321:1416107|1399265_1399736_-|WP_080939589.1|DBSCAN-SWA MALPEGTVRPADQAAPCGAAPFVTVKRLRSSPLGAACCVFDGRQQSITCAYLHHISVNAYGTGAYELVLQAQALLSCEAGTAGLRALRAGLVSVTAAQDLSAIVGGGYEARARIELQITHHHRVVTTLAAVDSADIHIHTRTGHIASVTMTAPETQ >NZ_CP020870|1397321:1416107|1408502_1408901_-|WP_046419638.1|DBSCAN-SWA MAETGRKSHVPTDKNRLLAKQLTSFGIPHAEIALVMQISAPTLRKHYRVELDTGHIQANAKVAKSLFRLATHSTNPNITAIIFWLRTRAGWKDTQRVEVSGRDGEAIEQKVGLALVDEKQIVSALKRLEAKY >NZ_CP020870|1397321:1416107|1401340_1402324_-|WP_046419650.1|DBSCAN-SWA MNMIDIRRRQMADALNPMLLTDARYQTSDATQALAFLVSQLTHVESTIYARQRQGIQYRDLVPISTEAGEYATSVTYQMYDYSGRGKRHSGRGEDIPTVDVAYAQKSVPVVLGTIGYDYTTEELRQSAFLRKPLNTARADAAMDAYERHINDVALFGEDELTGLYTHPGVPVLLNTAGPWIGQSPAQVLALFNTLISSAWMNTHYVEMIDTVLLPGSVMNYLVSTPRSDNSDKTILHYVLENNIAKAERGLDLTVRTGYGLETAGEGGTTRAMLYTKNPTKLVLHLPMPIRFLPPQPKGLTFDIPGEYKYSGVEFRYPKSALYADGI |
27 | Haemophilus_phage(27.78%) | terminase,integrase,head | attL 1414364:1414377|attR 1418798:1418811 |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
Acr ID | Acr position | Acr size | Homology with known anti | Neighbor HTH/AcRanker | Neighbor Aca | In prophage | Protospacer in prophage |
---|