CRISPRimmunity

Please click to download your results

Overview of predicted results

Overview of the results

Contig_ID	Contig_def	CRISPR array number	Contig Signature genes	Self targeting spacer number	Target MGE spacer number	Prophage number	Anti-CRISPR protein number
NZ_AP021860	Alteromonas sp. I4 plasmid pAltI4, complete sequence	1 crisprs	NA	1	1	0	0
NZ_AP021859	Alteromonas sp. I4	5 crisprs	WYL,DEDDh,DinG,RT,csa3,Cas9_archaeal,cas3	1	0	2	0

Results visualization

Click the left colored region to show detailed information

CRISPR-Cas detection and classification

Crispr_ID: NZ_AP021860_1

CRISPR_ID

CRISPR_location

CRISPR_type

Repeat_type

Spacer_info

Cas_protein_info

CRISPR-Cas_info

NZ_AP021860_1

17-129

Orphan

Consensus_repeat	Method
GAATGTTGAATAAATTCAAGTGTCCAATATAGGATACATAG	CRISPRCasFinder

1 spacers

The CRISPR arrays of NZ_AP021860_1

>merge|NZ_AP021860|1|17-129|CRISPRCasFinder
GAATGTTGAATAAATTCAAGTGTCCAATATAGGATACATAGTTGCCCCATGTGGGACACTTCCGCCCACTGTGAATGTTGAATAAATTCAAGTGTCCAATATAGGACACATAG

>NZ_AP021860|1|1|17-129|CRISPRCasFinder
GAATGTTGAATAAATTCAAGTGTCCAATATAGGATACATAG	TTGCCCCATGTGGGACACTTCCGCCCACTGT
GAATGTTGAATAAATTCAAGTGTCCAATATAGGACACATAG

Protein	Signature genes	Signature genes Name	Protein_function
NZ_AP021860.1\|WP_162360026.1\|2671_3829_-\|putative-DNA-binding-domain-containing-protein	unknown	unknown	gnl\|CDD\|377306
NZ_AP021860.1\|WP_162360027.1\|5163_6498_+\|alpha/beta-hydrolase	unknown	unknown	gnl\|CDD\|224561
NZ_AP021860.1\|WP_155016734.1\|9124_10018_-\|hypothetical-protein	unknown	unknown	unknown
NZ_AP021860.1\|WP_155016736.1\|12855_13092_-\|hypothetical-protein	unknown	unknown	unknown
NZ_AP021860.1\|WP_155016732.1\|6551_7628_+\|MBL-fold-metallo-hydrolase	unknown	unknown	gnl\|CDD\|225130
NZ_AP021860.1\|WP_155016802.1\|344_1313_-\|tyrosine-type-recombinase/integrase	unknown	unknown	gnl\|CDD\|271180
NZ_AP021860.1\|WP_155016730.1\|3977_4733_-\|hypothetical-protein	unknown	unknown	unknown
NZ_AP021860.1\|WP_155016728.1\|1553_2414_+\|hypothetical-protein	unknown	unknown	unknown
NZ_AP021860.1\|WP_155016735.1\|10355_12710_+\|hypothetical-protein	unknown	unknown	unknown
NZ_AP021860.1\|WP_155016733.1\|7724_8738_-\|hypothetical-protein	unknown	unknown	gnl\|CDD\|235090

>NZ_AP021860.1|WP_155016802.1|344_1313_-|tyrosine-type-recombinase/integrase
MLMLSLEDVPLAISPDDRQRVIDNLQEFKDEVWHLKSENTKRAYQSDFKQYLSFCMDNGMPALASDWRITRESCRSYLKYMMASKLKHHTIRRKIASIRYFIGVSELADPWKHSKLFTEFTNNTLKAKPSRQGQAKPLRVNLIDKFTSQLDLDNLLELRDAVIFNVAIDTIFRASNLLAIDISHIKFSQNKVFAPRSKTDRTGKGHYGYISQTSIELIKRWMEAGNISTGPLFRTLSPKHTVRDEGMQYHALISRYRTIARRIMIEDRFSCHSTRVGGVVTMFENGVSLDEIQKAGGWSSQAMPLHYAEEYDVAKTGMARLR
>NZ_AP021860.1|WP_155016728.1|1553_2414_+|hypothetical-protein
MNNDRPWQCFIEQQLDTFLSSLGNADWVSLYPTTLDKERLAESGESAALMAMRKVIKPGAMRRFERDFAEYRKEFEKSLWSVYLSKHLKKFLSAVPESDYHPDAAPLTDDKIESWFNLQPKDLYKTLRIWIPQRYVKQFQNRYRAYKHREVRNIKVFDISAKSKAILERYRDEIEANSLDEAIERCFSVNYRTRENDPESTVAKSAIANTIMFGNDVYFDDLMQRLSNNDRQKLALIIERSFKAGWNAAKANRVRKGDPKQQALDEFDLMQKLTAFLPAENVDSGQ
>NZ_AP021860.1|WP_162360026.1|2671_3829_-|putative-DNA-binding-domain-containing-protein
MQKLVEQLIKNKSEGYWWDFKLKHHSNLLELLHDVLCLANIIYEGERFIIFGVSDDFNVIGLNDDDIRHKQADILNFLRTKSFAYHKIPSVKIDTIQVDGKELDVLSIKDENYKPYFLTRDETKKGITIRAGTIYSRLGDSNTPKDSCANPYEVEAMWRQRFGLDKKASERFYDVLVDFKNWKYDGISKAFYDIDPDYTIEIGGNEGSGGKFWWEESLFEKPDRFYYHLRYKGVELYKLLVVRFNSENLQLPFPDVEYITYPEKNDGCETEVYCDIFFYLENSIEYSLFKHIRALEVSEITKKSFTTPIETQMKPRIIELPFLIFNSEYSLKTACQKLVENYNDFLRVKSESNEIKNSSDEMRKRYITERLFTEWAYSIVHENST
>NZ_AP021860.1|WP_155016730.1|3977_4733_-|hypothetical-protein
MKDQLDSVIPIFHEDFQTEKINQIGSGVLILFRAYYFILTAGHVIDEQKSGHLLIPGVTRHLTGIRGSFSHFNPIIGRKNDLVDVGYFKLETDFGLELSKVFEVVTEQDMFLAPEYAEDTIFSLCGYPYRKSKIENDQVNNEIFSYSALHAKAEEYEKHNCKQPYQIVMKFNRKKAVDSYSGKKEISPLPHGISGGGIFIWPKIFESLTPIDRKLTGICHTYKQSEHLFIGTNLLLIINFILHNNPELAKN
>NZ_AP021860.1|WP_162360027.1|5163_6498_+|alpha/beta-hydrolase
MVSTIRFLIVAAVMFSIISCGSVPYLEKSGHALIPPAEMDLSFDDYVAHSTAEIRTAMQGRQDPLVFQGSYSLDDAVSMRAPYSIPVDSTVCTGGTGGEDKGFLLIHGLTDSPYLMKGLANSMRKAYPCSTIRAIVLPGHSTIPGDSNHSSDWDGSNDSQLMTYKKWLKSTNFGIRSFDNKEHVKSLYVLTFSTGAPLLIQHLSKHKEEKLKGAVLISAAIKAKSKAAFLAPLAQYIVPWSTVYPEEDAVRYETFSTHAAAEFYWLTRELLEEEYRFKLPLFIAISADDNTVSAQAALRYFCAAETDSKQMLWYQYASSERPLNSYRLASGPCGEDIIAREIGKNGFELPSYYKSFSHTSLSVPESDPHYGKEGAYRQCKDYFKDDKLEKFEQCKDPDEANYVIGETTERLYEENKGKRVRRGVYNPDYPYMEQQILEFIESIR
>NZ_AP021860.1|WP_155016732.1|6551_7628_+|MBL-fold-metallo-hydrolase
MRKLILNVLTALTVTACAYKPNHYLPVENNEPGKGDLYGRFLGVTNLYFSDGHDAIMIDAYIGHRTLPGLFFFYDMETDPENVELILDRAEITDIDRVFIAHSHFDHALDVATIVKLHPGIVVSGSLNTRAILEDDKQITRIVDLAEGKSAGSKSEPVDSDETVALFDNVVGGNERYQGNFKVTVFESPHVKKEPHQRWVESAINYVSDGDIFKEPGTSYSYYIDHPQAKMLVVPSAGYPSSFNDIEADVVFLGIGLLSNWVGKPGLYREPPFKYVEQYWQKTVVDTCASVVVPIHWDSPFTALSVEPRAPLDIFDSITKSVAALERVAETMRGCDGKPVEIVFPRGFKRFKVPVNSY
>NZ_AP021860.1|WP_155016733.1|7724_8738_-|hypothetical-protein
MANDGVRLLQLIDELNDAFDVFEANLDLETTAYFADIAPIDKNMQKSYQEGRVCLLVEHYFGNEAIDKCVRALRQYKRPDETVSGRFAGQFPGILFAKNGETVQRDVAQINIIKSEIQACVQDQRRRKRGKRMETYHARNHQEKHEFLHRYLPNAISYQLYRHIDVISVASDDETLSLKTIGFYWGNKNTDKYLSLDQANHYIDKSREMSVSSSIRMEIKEKLANSNLSHRFCLRRTRNDTINISLYFGVGENGGPVTSRTVIPQPIIITDYDSIPKIGLVKHQEPGRRSEIRTGHQWELLDSKLKLYRRPKTPDEFKKDADRALLEVDGLTNTAKT
>NZ_AP021860.1|WP_155016734.1|9124_10018_-|hypothetical-protein
MPKLTPAQRRLRNALERGVCYKSLDFDYLCSRAPHRAGVIIEGDSWVDYPRKYIISGPSINLGHRFEQTTEYLDTVNVLRIGSNGDTAVGMTTGKQFALMKKILKKNGKHIQLMLFSAGGNDIVGESDLDPLIKEFDAALHHGWEDVIEKELFDEKLDEILEAYLRMIELYKSLAPAASIVTHTYDCVNPSPQGAAFFWNLIKTKSWVWPTMQKRNIPVEWRAPIIQYMLSLFSLRIQALQNHPSAVGRFYVVDTQGTLDPTSKADWLNEIHATPAGYRKIFNKMYPIFKHLLPVLP
>NZ_AP021860.1|WP_155016735.1|10355_12710_+|hypothetical-protein
MRQYPPRLQQLVAAITATGFTAIMLALGIILLTLVDQIKDILIQVDQSDHVRYRIGFILGGLSFSLACWWSARFVLDVMAASHSKNSPESCITGNYKVLAPSAGISPFLALWLPRFYAAVIPIVVLIAGACNELWILFLLSTFGVLVPALLFVICRRAFISWAFNVQTPALFRIFGDTKSWGLLASIMYLLVTVWALANPYSLGDFFGAYFVVFWGLSSILFCLIVSFYGLVPWLIKKVFGPLVKAQQTAFEIQQQAHPNVVLSRPLLLQLSESQLVQPPHPVSLPVFLLAVLISWLVDTDNHEVRRVYLKEQVSYAYADFSEAWKTFSINLSKSDLYLKPSTNETEQRVRKKPVFFVASQGGGLRAAYWSAVGMGYLEARIPGFSQHVFSLAGVSGGSVGNSFYAASLNQPVVQNQCLALTKDLHLACGLEHALGTDYLSPVLTSFLYNDLLYRFFPLSSLPFMVRDRAEVLETSWEKGFARVFSSNNMQAKLQSLYQSDDDNWLPLLMSMGAHQESGTRLVTAPFPIEQDIFINQYNTYDLMACDNGIGINCDMRLSTVALNAARFPFVTPAGTLAKKSENGSNIPWKEKDHIIDGGYVENYGLMATRFMISHLMANNQFSVVENGQSIELVPVVIIFANDMDLTTEVFNPSRKRPYRNGNSLALNEVTNPLQGLLTTRSGRSVQSLTELIDFQHRLGGKQIATQITFPDAGSNEAPVIMNNVVVFHLQNDASNVNVPLGWWLSDQSQTYMSDQYRTQGKRAHQAIESLSRVIKTSSNAESP
>NZ_AP021860.1|WP_155016736.1|12855_13092_-|hypothetical-protein
MTILRIDNYPELKFIFWDWSPNEIDEKIAFALIEKRWPYISHKNLNSEEVSLIQRLAKTYGNGLLLVGQAQTIVQSTE

You can click texts colored in the table to view more detailed information

Click the colored protein region to show detailed information

Self-targeting detection

CRISPR_ID	Spacer_Info	Spacer_region	Spacer_length	Hit_ID	Protospacer_location	Mismatch	Identity
NZ_AP021860_1	1.1\|58\|31\|NZ_AP021860\|CRISPRCasFinder	58-88	31	NZ_AP021860.1	123606-123636	0	1.0
NZ_AP021860_1	1.1\|58\|31\|NZ_AP021860\|CRISPRCasFinder	58-88	31	NZ_AP021860.1	123750-123780	0	1.0

1. spacer 1.1|58|31|NZ_AP021860|CRISPRCasFinder matches to position: 123606-123636, mismatch: 0, identity: 1.0

ttgccccatgtgggacacttccgcccactgt	CRISPR spacer
ttgccccatgtgggacacttccgcccactgt	Protospacer
*******************************

2. spacer 1.1|58|31|NZ_AP021860|CRISPRCasFinder matches to position: 123750-123780, mismatch: 0, identity: 1.0

ttgccccatgtgggacacttccgcccactgt	CRISPR spacer
ttgccccatgtgggacacttccgcccactgt	Protospacer
*******************************

MGE targeting detection<

CRISPR_ID	Spacer_Info	Spacer_region	Spacer_length	Hit_phage_ID	Hit_phage_def	Protospacer_location	Mismatch	Identity
NZ_AP021860_1	1.1\|58\|31\|NZ_AP021860\|CRISPRCasFinder	58-88	31	NZ_AP021860	Alteromonas sp. I4 plasmid pAltI4, complete sequence	58-88	0	1.0
NZ_AP021860_1	1.1\|58\|31\|NZ_AP021860\|CRISPRCasFinder	58-88	31	NZ_AP021860	Alteromonas sp. I4 plasmid pAltI4, complete sequence	123606-123636	0	1.0
NZ_AP021860_1	1.1\|58\|31\|NZ_AP021860\|CRISPRCasFinder	58-88	31	NZ_AP021860	Alteromonas sp. I4 plasmid pAltI4, complete sequence	123750-123780	0	1.0
NZ_AP021860_1	1.1\|58\|31\|NZ_AP021860\|CRISPRCasFinder	58-88	31	NZ_AP021860	Alteromonas sp. I4 plasmid pAltI4, complete sequence	123462-123492	2	0.935
NZ_AP021860_1	1.1\|58\|31\|NZ_AP021860\|CRISPRCasFinder	58-88	31	NZ_AP021860	Alteromonas sp. I4 plasmid pAltI4, complete sequence	123534-123564	2	0.935
NZ_AP021860_1	1.1\|58\|31\|NZ_AP021860\|CRISPRCasFinder	58-88	31	NZ_AP021860	Alteromonas sp. I4 plasmid pAltI4, complete sequence	123678-123708	3	0.903
NZ_AP021860_1	1.1\|58\|31\|NZ_AP021860\|CRISPRCasFinder	58-88	31	NZ_AP021860	Alteromonas sp. I4 plasmid pAltI4, complete sequence	130-160	4	0.871

1. spacer 1.1|58|31|NZ_AP021860|CRISPRCasFinder matches to NZ_AP021860 (Alteromonas sp. I4 plasmid pAltI4, complete sequence) position: , mismatch: 0, identity: 1.0

ttgccccatgtgggacacttccgcccactgt	CRISPR spacer
ttgccccatgtgggacacttccgcccactgt	Protospacer
*******************************

2. spacer 1.1|58|31|NZ_AP021860|CRISPRCasFinder matches to NZ_AP021860 (Alteromonas sp. I4 plasmid pAltI4, complete sequence) position: , mismatch: 0, identity: 1.0

ttgccccatgtgggacacttccgcccactgt	CRISPR spacer
ttgccccatgtgggacacttccgcccactgt	Protospacer
*******************************

3. spacer 1.1|58|31|NZ_AP021860|CRISPRCasFinder matches to NZ_AP021860 (Alteromonas sp. I4 plasmid pAltI4, complete sequence) position: , mismatch: 0, identity: 1.0

ttgccccatgtgggacacttccgcccactgt	CRISPR spacer
ttgccccatgtgggacacttccgcccactgt	Protospacer
*******************************

4. spacer 1.1|58|31|NZ_AP021860|CRISPRCasFinder matches to NZ_AP021860 (Alteromonas sp. I4 plasmid pAltI4, complete sequence) position: , mismatch: 2, identity: 0.935

ttgccccatgtgggacacttccgcccactgt	CRISPR spacer
ctgccccatgtgggacacttccgcccactgg	Protospacer
.*****************************

5. spacer 1.1|58|31|NZ_AP021860|CRISPRCasFinder matches to NZ_AP021860 (Alteromonas sp. I4 plasmid pAltI4, complete sequence) position: , mismatch: 2, identity: 0.935

ttgccccatgtgggacacttccgcccactgt	CRISPR spacer
ctgccccatgtgggacacttccgcccactgg	Protospacer
.*****************************

6. spacer 1.1|58|31|NZ_AP021860|CRISPRCasFinder matches to NZ_AP021860 (Alteromonas sp. I4 plasmid pAltI4, complete sequence) position: , mismatch: 3, identity: 0.903

ttgccccatgtgggacacttccgcccactgt	CRISPR spacer
ctgccccatgtggggcacttccgcccactgg	Protospacer
.*************.***************

7. spacer 1.1|58|31|NZ_AP021860|CRISPRCasFinder matches to NZ_AP021860 (Alteromonas sp. I4 plasmid pAltI4, complete sequence) position: , mismatch: 4, identity: 0.871

ttgccccatgtgggacacttccgcccactgt	CRISPR spacer
ctgccccatgtggggcacttccgcctactgg	Protospacer
.*************.**********.****

Prophage detection

Region	Region Position	Protein_number	Hit_taxonomy	Key_proteins	Att_site	Prophage annotation

Anti-CRISPR protein detection

Acr ID	Acr position	Acr size	Homology with known anti	Neighbor HTH/AcRanker	Neighbor Aca	In prophage	Protospacer in prophage

Click the left colored region to show detailed information

CRISPR-Cas detection and classification

Crispr_ID: NZ_AP021859_1

CRISPR_ID

CRISPR_location

CRISPR_type

Repeat_type

Spacer_info

Cas_protein_info

CRISPR-Cas_info

NZ_AP021859_1

115001-115125

Orphan

Consensus_repeat	Method
AAAATAACCGCGAACGCATAAATTT	PILER-CR

2 spacers

The CRISPR arrays of NZ_AP021859_1

>merge|NZ_AP021859|1|115001-115125|PILER-CR
AAATAGCCGCGAACGCATAAATTTTGGGTTGTTTTGTCGGGTGAATATCAATTTAACCGCGAACGCATAAATAAGATTGAAATATCCTATCTGCTATCATAAAGTAACCGCGAACGCATAAATTT

>NZ_AP021859|1|1|115001-115125|PILER-CR
AAATAGCCGCGAACGCATAAATTTT	GGGTTGTTTTGTCGGGTGAATATCA
ATTTAACCGCGAACGCATAAATAAG	ATTGAAATATCCTATCTGCTATCATA
AAGTAACCGCGAACGCATAAATTT

Protein	Signature genes	Signature genes Name	Protein_function
NZ_AP021859.1\|WP_155013287.1\|109619_110942_+\|MFS-transporter	unknown	unknown	gnl\|CDD\|273327
NZ_AP021859.1\|WP_155013290.1\|115419_116688_+\|HipA-domain-containing-protein	unknown	unknown	gnl\|CDD\|341494
NZ_AP021859.1\|WP_073324062.1\|112417_113254_+\|p-hydroxycinnamoyl-CoA-hydratase/lyase	unknown	unknown	gnl\|CDD\|236383
NZ_AP021859.1\|WP_155013291.1\|116777_117206_+\|DUF3010-family-protein	unknown	unknown	gnl\|CDD\|378599
NZ_AP021859.1\|WP_155013286.1\|105369_107844_-\|TonB-dependent-receptor	unknown	unknown	gnl\|CDD\|224544
NZ_AP021859.1\|WP_073324060.1\|110983_112381_+\|aldehyde-dehydrogenase	unknown	unknown	gnl\|CDD\|143423
NZ_AP021859.1\|WP_155013283.1\|101386_102469_+\|Rieske-2Fe-2S-domain-containing-protein	unknown	unknown	gnl\|CDD\|226985
NZ_AP021859.1\|WP_155013297.1\|125097_125910_-\|TauD/TfdA-family-dioxygenase	unknown	unknown	gnl\|CDD\|181947
NZ_AP021859.1\|WP_155013292.1\|117299_117662_+\|hypothetical-protein	unknown	unknown	unknown
NZ_AP021859.1\|WP_155013284.1\|102456_103407_+\|2Fe-2S-iron-sulfur-cluster-binding-domain-containing-protein	unknown	unknown	gnl\|CDD\|99782
NZ_AP021859.1\|WP_155013289.1\|115138_115423_+\|transcriptional-regulator	unknown	unknown	gnl\|CDD\|213767
NZ_AP021859.1\|WP_155013295.1\|123152_124235_+\|electron-transporter-RnfD	unknown	unknown	gnl\|CDD\|380087
NZ_AP021859.1\|WP_155013282.1\|100416_101118_-\|FCD-domain-containing-protein	unknown	unknown	gnl\|CDD\|224715
NZ_AP021859.1\|WP_155013294.1\|120562_123034_+\|glycoside-hydrolase-family-9-protein	unknown	unknown	gnl\|CDD\|199881
NZ_AP021859.1\|WP_162359724.1\|118444_119167_+\|PEP-CTERM-sorting-domain-containing-protein	unknown	unknown	gnl\|CDD\|377877
NZ_AP021859.1\|WP_073324056.1\|108406_109432_+\|AraC-family-transcriptional-regulator	unknown	unknown	gnl\|CDD\|378890
NZ_AP021859.1\|WP_162359727.1\|124231_124978_-\|CPBP-family-intramembrane-metalloprotease	unknown	unknown	gnl\|CDD\|376806
NZ_AP021859.1\|WP_073324076.1\|119339_119885_-\|hypothetical-protein	unknown	unknown	unknown
NZ_AP021859.1\|WP_155013285.1\|103501_105223_-\|tannase/feruloyl-esterase-family-alpha/beta-hydrolase	unknown	unknown	gnl\|CDD\|284851
NZ_AP021859.1\|WP_155013288.1\|113264_114737_+\|AMP-binding-protein	unknown	unknown	gnl\|CDD\|181644

Protein	Function_ID	Function_description	E-value
NZ_AP021859.1\|WP_155013287.1\|109619_110942_+\|MFS-transporter	gnl\|CDD\|273327	TIGR00895, transport_protein, benzoate transport. [Transport and binding proteins, Carbohydrates, organic alcohols, and acids].	9.9276e-88
NZ_AP021859.1\|WP_155013290.1\|115419_116688_+\|HipA-domain-containing-protein	gnl\|CDD\|341494	cd17809, HipA_So_like, type II toxin-antitoxin sytem toxin HipA from Shewanella oneidensis and similar proteins. This family contains type II toxin-antitoxin (TA) system HipA family toxins similar to Shewanella oneidensis HipA, a serine/threonine-protein kinase that phosphorylates Glu-tRNA-ligase (GltX), preventing it from being charged, leading to an increase in uncharged tRNA(Glu). This induces amino acid starvation and the stringent response via RelA/SpoT and increased (p)ppGpp levels, which inhibits replication, transcription, translation and cell wall synthesis, reducing growth and leading to persistence and multidrug resistance. HipA is the toxin component of the HipA-HipB TA module that is a major factor in persistence and bioflim formation; its toxic effect is neutralized by its cognate antitoxin HipB. HipA, with HipB, acts as a a corepressor for transcription of the hipBA promoter. In the Shewanella oneidensis HipAB:DNA promoter complex, HipB forms a dimer that binds the duplex operator DNA, with each HipB monomer interacting with separate HipA monomers. The HipAB component of the complex is composed of two HipA and two HipB subunits.	1.0995e-119
NZ_AP021859.1\|WP_073324062.1\|112417_113254_+\|p-hydroxycinnamoyl-CoA-hydratase/lyase	gnl\|CDD\|236383	PRK09120, PRK09120, p-hydroxycinnamoyl CoA hydratase/lyase; Validated.	0
NZ_AP021859.1\|WP_155013291.1\|116777_117206_+\|DUF3010-family-protein	gnl\|CDD\|378599	pfam11215, DUF3010, Protein of unknown function (DUF3010). This family of proteins with unknown function appears to be restricted to Gammaproteobacteria.	2.18043e-68
NZ_AP021859.1\|WP_155013286.1\|105369_107844_-\|TonB-dependent-receptor	gnl\|CDD\|224544	COG1629, CirA, Outer membrane receptor proteins, mostly Fe transport [Inorganic ion transport and metabolism].	6.10852e-42
NZ_AP021859.1\|WP_073324060.1\|110983_112381_+\|aldehyde-dehydrogenase	gnl\|CDD\|143423	cd07105, ALDH_SaliADH, Salicylaldehyde dehydrogenase, DoxF-like. Salicylaldehyde dehydrogenase (DoxF, SaliADH, EC=1.2.1.65) involved in the upper naphthalene catabolic pathway of Pseudomonas strain C18 and other similar sequences are present in this CD.	0
NZ_AP021859.1\|WP_155013283.1\|101386_102469_+\|Rieske-2Fe-2S-domain-containing-protein	gnl\|CDD\|226985	COG4638, HcaE, Phenylpropionate dioxygenase and related ring-hydroxylating dioxygenases, large terminal subunit [Inorganic ion transport and metabolism / General function prediction only].	3.45174e-68
NZ_AP021859.1\|WP_155013297.1\|125097_125910_-\|TauD/TfdA-family-dioxygenase	gnl\|CDD\|181947	PRK09553, tauD, taurine dioxygenase; Reviewed.	1.9665e-96
NZ_AP021859.1\|WP_155013284.1\|102456_103407_+\|2Fe-2S-iron-sulfur-cluster-binding-domain-containing-protein	gnl\|CDD\|99782	cd06185, PDR_like, Phthalate dioxygenase reductase (PDR) is an FMN-dependent reductase that mediates electron transfer from NADH to FMN to an iron sulfur cluster. PDR has an an N-terminal ferrredoxin reductase (FNR)-like NAD(H) binding domain and a C-terminal iron-sulfur [2Fe-2S] cluster domain. Although structurally homologous to FNR, PDR binds FMN rather than FAD in it's FNR-like domain. Electron transfer between pyrimidines and iron-sulfur clusters (Rieske center [2Fe-2S]) or heme groups is mediated by flavins in respiration, photosynthesis, and oxygenase systems. Type I dioxygenase systems, including the hydroxylate phthalate system, have 2 components, a monomeric reductase consisting of a flavin and a 2Fe-2S center and a multimeric oxygenase. In contrast to other Rieske dioxygenases the ferredoxin like domain is C-, not N-terminal.	7.03559e-102
NZ_AP021859.1\|WP_155013289.1\|115138_115423_+\|transcriptional-regulator	gnl\|CDD\|213767	TIGR03070, couple_hipB, transcriptional regulator, y4mF family. Members of this family belong to a clade of helix-turn-helix DNA-binding proteins, among the larger family pfam01381 (HTH_3; Helix-turn-helix). Members are similar in sequence to the HipB protein of E. coli. Genes for members of the seed alignment for this protein family were found to be closely linked to genes encoding proteins related to HipA. The HibBA operon appears to have some features in common with toxin-antitoxin post-segregational killing systems. [Regulatory functions, DNA interactions].	1.34275e-08
NZ_AP021859.1\|WP_155013295.1\|123152_124235_+\|electron-transporter-RnfD	gnl\|CDD\|380087	pfam17996, CE2_N, Carbohydrate esterase 2 N-terminal. This is the N-terminal beta-sheet domain with jelly roll topology found in CE2 acetyl-esterase from the bacterium Clostridium thermocellum. This enzyme displays dual activities, it catalyses the deacetylation of plant polysaccharides and also potentiates the activity of its appended cellulase catalytic module through its noncatalytic cellulose binding function. This N-terminal jelly-roll domain appears to extend the substrate/cellulose binding cleft of the catalytic domain in C.thermocellum.	1.76163e-44
NZ_AP021859.1\|WP_155013282.1\|100416_101118_-\|FCD-domain-containing-protein	gnl\|CDD\|224715	COG1802, GntR, Transcriptional regulators [Transcription].	2.50022e-39
NZ_AP021859.1\|WP_155013294.1\|120562_123034_+\|glycoside-hydrolase-family-9-protein	gnl\|CDD\|199881	cd02850, E_set_Cellulase_N, N-terminal Early set domain associated with the catalytic domain of cellulase. E or "early" set domains are associated with the catalytic domain of cellulases at the N-terminal end. Cellulases are O-glycosyl hydrolases (GHs) that hydrolyze beta 1-4 glucosidic bonds in cellulose. They are usually categorized into either exoglucanases, which sequentially release terminal sugar units from the cellulose chain, or endoglucanases, which also attack the chain internally. The N-terminal domain of cellulase may be related to the immunoglobulin and/or fibronectin type III superfamilies. These domains are associated with different types of catalytic domains at either the N-terminal or C-terminal end and may be involved in homodimeric/tetrameric/dodecameric interactions. Members of this family include members of the alpha amylase family, sialidase, galactose oxidase, cellulase, cellulose, hyaluronate lyase, chitobiase, and chitinase, among others.	4.7333e-18
NZ_AP021859.1\|WP_162359724.1\|118444_119167_+\|PEP-CTERM-sorting-domain-containing-protein	gnl\|CDD\|377877	pfam07589, VPEP, PEP-CTERM motif. This motif has been identified in a wide range of bacteria at their C-terminus. It has been suggested that this is a protein sorting signal. Based on phylogenetic profiling it has been suggested that the EpsH family of proteins mediate this function.	0.000871202
NZ_AP021859.1\|WP_073324056.1\|108406_109432_+\|AraC-family-transcriptional-regulator	gnl\|CDD\|378890	pfam12625, Arabinose_bd, Arabinose-binding domain of AraC transcription regulator, N-term. AraC is a bacterial transcriptional regulatory protein with a DNA-binding domain at the C-terminus, HTH_AraC, pfam00165, and this dimerization domain which harbours the arabinose-binding pocket at the N-terminus. AraC positively and negatively regulates expression of the proteins required for the uptake and catabolism of the sugar L-arabinose 1,2,3].	3.34883e-30
NZ_AP021859.1\|WP_162359727.1\|124231_124978_-\|CPBP-family-intramembrane-metalloprotease	gnl\|CDD\|376806	pfam02517, Abi, CAAX protease self-immunity. Members of this family are probably proteases (after a isoprenyl group is attached to the Cys residue in the C-terminal CAAX motif of a protein to attach it to the membrane, the AAX tripeptide being removed by one of the CAAX prenyl proteases). The family contains the CAAX prenyl protease. The proteins contain a highly conserved Glu-Glu motif at the amino end of the alignment. The alignment also contains two histidine residues that may be involved in zinc binding. While they are involved in membrane anchoring of proteins in eukaryotes, little is known about their function in prokaryotes. In some known bacteriocin loci, Abi genes have been found downstream of bacteriocin structural genes where they are probably involved in self-immunity. Investigation of the bacteriocin-like loci in the Gram positive bacteria locus from Lactobacillus sakei 23K confirmed that the bacteriocin-like genes (sak23Kalphabeta) exhibited antimicrobial activity when expressed in a heterologous host and that the associated Abi gene (sak23Ki) conferred immunity against the cognate bacteriocin. Interestingly, the immunity genes from three similar systems conferred a high degree of cross-immunity against each other's bacteriocins, suggesting the recognition of a common receptor. Site-directed mutagenesis demonstrated that the conserved motifs constituting the putative proteolytic active site of the Abi proteins are essential for the immunity function of Sak23Ki - thus a new concept in self-immunity.	6.52075e-10
NZ_AP021859.1\|WP_155013285.1\|103501_105223_-\|tannase/feruloyl-esterase-family-alpha/beta-hydrolase	gnl\|CDD\|284851	pfam07519, Tannase, Tannase and feruloyl esterase. This family includes fungal tannase and feruloyl esterase. It also includes several bacterial homologs of unknown function.	5.32912e-121
NZ_AP021859.1\|WP_155013288.1\|113264_114737_+\|AMP-binding-protein	gnl\|CDD\|181644	PRK09088, PRK09088, acyl-CoA synthetase; Validated.	0

>NZ_AP021859.1|WP_155013288.1|113264_114737_+|AMP-binding-protein
MSFLSVQAKLRPFKQAVRDLTSGRSWTYLEWDTFVNKCSQWLSLQGLACGDRLVCIAKNCAELVALHFACEQSGVIFVPLNWRLSSDELHTLIADCTPQLIVGDEMANQLELAYFELKTLNNAVAELNGEIQPRNHRALPSLILYTSGTTGKPKGVMHSYDTIMETTLNMALLGQVDEYSTFLCETPMFHVIGLISCVRPALYQGGKILISDGFKPSRTLARLTDKNLLITHYFCVPQMANSLRQEPKFNPSSLYNLKALLTGGAPHPAVQIRQWLNDDIPIVDGYGMSEAGTVFGMPFDIATIDVKAGCVGIPTHRLEVRLADTEGQPVADGVAGEIQLKGLNLFIGIWRQPALFKACFTDDGWFKTGDVAIRDNDGFYFIVDRIKDMFISGGENVYPTEIESVVLKLDSVLECALVGVPDDRWGEVGCLFVVAKSKHSTIEQQEIMDELEQCLAKYKLPKYIQFVDSLPRNGGGKVMKHRLKAMFHSD
>NZ_AP021859.1|WP_073324062.1|112417_113254_+|p-hydroxycinnamoyl-CoA-hydratase/lyase
MSAEREEDTVAVKVENQIAWVSFNRPEKRNCMSPKLNRQMMRVLDDLEFREDVSVLVLTGEGSAWSAGMDLKEYFRETEAQGLGGTRQAQRESYGWWRRLRWYQKATIAMVNGWCFGGGYGPLFACDLAFAAEEAQFGLSEINWGILPGGGAAKVVAELMPLRKAMYHAMMGENIDGKTAEEWGLVNEAVPAEQLRARVTDVANVLLQKNQVALKATKDAVRRVKEMTYDNAEDYLVRAQEAANSFDNHGRKEGIKQFIDDKTYKPGLGAYDKSKQKS
>NZ_AP021859.1|WP_073324060.1|110983_112381_+|aldehyde-dehydrogenase
MNFERKNPLTGEVASQSIAMQAHEMAGIAERAQQGFEQWSAYGPNARRAILNKAAAALESRQNDFVEAMMTEVGATAGWAMFNLGLAVSMMREAASLTTQIGGETIPSDKPGCLAMAIRQPVGVVLGIAPWNAPIILGVRAISTALACGNAVILKASELCPRTHSLIIEALETAGFPDGTVNIVTNSPADAGEVVGALIDQPLVKRINFTGSTEVGRIIAQRAGANLKPVLLELGGKAPMLVLDDADLDEAVKAAAFGAFMNQGQICMSTERLVVDESVADSFAAKFAQKVSGMATGDPREGNTPLGAVVDQKTVSKVNALIDDAVAKGAKIIAGQKSDSVLMAATVIDHVTDDMKIYREESFGPVVAIIRAKDEADAIRIANDSEYGLSAAVFTKDSARGLRVARQIQSGICHVNGSTVHDEAQMPFGGVGASGYGRFGGKAGIDQFTELRWITIETEKGHFPI
>NZ_AP021859.1|WP_155013287.1|109619_110942_+|MFS-transporter
MSTNPKQQIDQSEIKGFQLFVILMCILLNALDGFDVLAISFASPGIASDWNVSRGALGIVLSMELVGMAIGSISLGNLADRLGRRPTILLCLSLMTIGMGACAFVNSLNYLLVFRFLTGLGVGGMLASTNALVAEFANAKYRNLAVILMATGYPIGAIVGGYISTELLALYNWKVIFVFGGAVTGSFLVICWLLLPESIDFLASRQPQNALQRINKILKRIGHNPIASLPKLENQQASSGFSTLITNNLRAVTGLLVIAYFAQIMTFYYILKWIPKIVVDMGYEPTSAGTVLVWANVGGAVGSLIFGVIASRLKLRPLLIAIMLCAFVMVSVFGLGPQTLLQLSVVSAATGFFTNSAVVGLYALMAQSFPAEVRASGTGVVIGIGRGGAALGPIVAGYLFQSGYGLFDVSVAMGFGAVVAAIAIFSLGPVLKKYQLSSQV
>NZ_AP021859.1|WP_073324056.1|108406_109432_+|AraC-family-transcriptional-regulator
MPGKYEVGTAANHYISQLYHEALKNKLDVTSMLNTLGLTEEVFDKPELRVKTEKLATFQNLIWQAMQDESMGLGASPVPAGSYFMMGRLTVNQPTLHKALNLAVRFYGMVTKAFTINLTVDGDTAFLGFKLHSPERDPQHMFAEILLLAMHRYASWLIADSLPLIECYFDYPTPAHISEYSYLFPGGHTFESDKLGFAFPARYLKRDVKQNDASLKLFMKRCPQEIFQRYEADYSLTTELQRLLWKNLKGGVPSIEAAAAMMNMTKRTMMRKLKSEGTSYQQLKDQVRLDKAVTLLTKYNLPINQISESVGFSEPAVFTRAFLNWTGDSPSHFREKNAVED
>NZ_AP021859.1|WP_155013286.1|105369_107844_-|TonB-dependent-receptor
MCLFTPYLFTKSAFQIFKDQAFNNTQGDRKSANFFKRSLVATSIGLSFLSSTAIAQQTDTAPEAESEVQLEVITVNARRRAESMQETPIAVSAFSVKELERRGIENTQDLDRVTPSLQFATSGQLSGNNSAAVVFIRGVGQLDPTSSVDPGVGIYVDDVYMGRSAGGAMDFKDIQSVEVLRGPQGTLFGRNTIGGAVLVKTAEPSDVFGGKARLRIGDDNLREAFVAVDLPITTDLLSRFSLGTRKRDGYVTRVYDGQDLGNDDTYSVNGTIQYTPSDTFKITLKGDFTKEDENGSPFVFAGVNESAPVAAIVSVAAGCPGATIPFAPLAPGDAGFGAPNVPNIDDERCANDFQHKGEFTNGGTAPVESTLKGWGLSAAMEWEYSKTITLKSISAFRSTEWTGIRDADNTPFDMLTTDVTSDSEQFSQEFQLIYDNDKVSGITGLYYFDETSDDKLSILLAFPPSPPVIGSLLNGGPGTRDYQVINLETESFAVFSEWAYELSNDWSISAGLRYTEDDKGFQGAIMNLFPATQPDPTTLPTKATSEGGPLFIFNTPFADTYSATTGSASVRYKVQENINTYLSYSSSFKSGGFNSRYNAPTPGNLPISFGEEEVSSWEIGVKADITNDFRVNAAAFMSEYSDIQLIFRQGVVPLLFNAGSASIDGVELEFTYIPTNSLLIEGGFSYLRDKIDSITEVNGAQATITPDNSLPLTPEWQGNLGASYSTELGNNYELTTRLDVSYTASQYFDSSNTDIVAQNDGVTYVTASVKLDDLVNYWDLTFGVNNLTDERYIEQGNASLATLGYAEVIYARPRNWFLSFSTEF
>NZ_AP021859.1|WP_155013285.1|103501_105223_-|tannase/feruloyl-esterase-family-alpha/beta-hydrolase
MTKHNIFHAFTLAALSSTLLACNSSNNDISPVEIPQLSPATAANLSGNCNDLAASMSALANTTITSSSEVASGELMVAGKDIPAHCLVTGSMFERVSDIDGNVYAIRFEMRLPLNWNGRFYHQGNGGIDGSVVTAVGDAGPGNLSNALYQGFAVLSSDAGHSGALGPAFGVDPIARLDYGYKAVEKLTPMAKELISIAYGKGPDRSYFGGCSNGGRHTFNTLARMPDEYDGYLAGAPGFRLPYAAIANIFGAQRYFSVATDPSDISTGFTAEERNMVATAALVKCDDLDGINDGLIGDVEACQSVFSLDDVPSCSSERDGTCLSSQQKEALSPIFSGAVTASGEAFYAPFPFDTGIASPDYNFWDFFAPLVLDSGGVGLIWGVPVADPATFNGPEFALTGSIDDMLTSIESTDDVYTEAASSFMIPPNNAEALSAVRDRGAKIMVYHGVSDAIFSALDTINWYNNLTANHNDDASDFARLYLMPGMGHCSGGAAVDQVDLLTPLVAWVESGIEPEGLVATARGAGNPGGENPAIPASWAADRTRPLCAYPTVARYNADAGNGDVESAESFSCQ
>NZ_AP021859.1|WP_155013284.1|102456_103407_+|2Fe-2S-iron-sulfur-cluster-binding-domain-containing-protein
MFEVIVTNKQPLTASVCRLQLAAVDDSALPAWQAGAHIDVHLPNGIIRQYSLCGGVDTKHYEIAILNEPNSRGGSKYIHDQLQQGDVLTISAPKNLFPLVQGTHKTLLIAAGIGITPMLAMAEQLHAEDTPFELHYCAREQQHAAYYDRISNSDYARNCHFHFSLGNSQNRLNPYRLLADYNQDTQLYICGPNLFIQDVISAAEQHGWPTANIHREFFAAEAIDHSQDQRFEVVINSTGQVLQVAEDVSILNVLEDNGMFIPVACEEGVCGTCLTGLLEGEADHKDVFLSADEKQKMNQITPCCSRAKSKRLVLDL
>NZ_AP021859.1|WP_155013283.1|101386_102469_+|Rieske-2Fe-2S-domain-containing-protein
MHPQSYPLNTWYVAATPDEISDKPFARQICSIKLVFFRNSQQKIVAVEDFCPHRGAPLSLGFVENGQLVCGYHGLRMGDDGKTQSMPNQRVAHFPCIKHFAVIERHGFVWIWPGDQTLADDSLIPELHWANNPDWGYGGGLYHIKCDYRLMIDNLMDLTHETYVHASSIGQKEIDESPVSTKMEGQTVVTSRFMDNVMAPPFWQAALRANDLADDVAVDRWQICRFSLPSHIMIEVGVAHAGKGGYDAPKSHKASSIVVDFITPETEHSIWYFWGMARDFKPEDQALTQTIQQGQGAIFAEDLEVLERQQRNLLDYPDRSLLKLDIDAGGVQARRMIERVIKQEQAASASTNAGEKQCSK
>NZ_AP021859.1|WP_155013282.1|100416_101118_-|FCD-domain-containing-protein
MPREGQAVISILRDKIVSGVFPAGERLAEIPTAELLGVSRTPVRIAFRALAQEGLLIKLPRRGYQVRKVTNDEILGAVEVRGVLEGLAARQAAEKGLTEDTRVQLAECLKNADAIFEKGYLTEEDIEQYNVINKQFHDLIINASGNPAIQSAMQLNEHLPFASVNALVFNPKQLDREFRRFNFANMQHHVVFDALLKRQGARAEAVMKEHAHATLSQVDLCESPDSRTCNPSK
>NZ_AP021859.1|WP_155013289.1|115138_115423_+|transcriptional-regulator
MIFSINNTKQLGKAAQLTRKVQGLDQFAAASMSENGITFLSEFENGKQTVELGRVLRVLSTLGIKVTIDIPVDEASLTPKQQQQLAKIINEANL
>NZ_AP021859.1|WP_155013290.1|115419_116688_+|HipA-domain-containing-protein
MKRALDVYIDKTQVGKLTDENNIWAFEYTTGWLTSQHRHPLSPHITLEAGKQIDGSSFRPVQWFFDNLLPEEKARELLARHVKVPVEDAFQLLKEAGAESAGAITLMPEGEEVAPGTVHKLTYEEVNQRILNLPQVPLNRAERKRMSVAGAQHKMLVIYRHGELLEPSGFFPSTHILKPQHSSPEVYYHTVRNEWFVMTLAGLCGLEVPPVDIRYLPEPVYLIERFDRAGEYPHQHRRHVLDGCQLLNLGPHMKYPNSNAGSLNKLAELTRMKARTKIDIFRWALFNALVGNGDAHLKNLSFFINKEDVVMTPHYDLLSTAIYEAPHKHMDHQLSQQMGDAQYLGQLTVPNILAFAEELQLPTKLAKRELDRLIGKIEQEAIPLMQQVQDAPAHPGKGGEIRMIKEIYYNCIKEMVTRLTKA
>NZ_AP021859.1|WP_155013291.1|116777_117206_+|DUF3010-family-protein
MRVCGVELAGNDANIALLNLENDLIQIPDCRTRKLSLQKAATAHELKYFQKSFAQLVQDYKIETIVIRQRPMKGKFAGGAIGFKLEAALELLNGVQVIVMPPTEIKAALKENHMFIEFGDTGLKGFQKSAFETALAYISKHL
>NZ_AP021859.1|WP_155013292.1|117299_117662_+|hypothetical-protein
MLKRTAILAPLFFSAHLSAQTWIEDAQGQRYLLGDQLVSTTTEAYNYCTSKGLTPANVAQMRRALRQGKVTVDFVSVPVSEDAKTPFIKDKHWIAKHEQGRIRVRNSSSFDSALPLCAGR
>NZ_AP021859.1|WP_162359724.1|118444_119167_+|PEP-CTERM-sorting-domain-containing-protein
MNNVVKGLGVLLTLASMTANAVLINDNSFAQAGFKDTSTGLVWMDFGINNGQSYNHVSSQLGAGGDYSGWRLPSAEEVYMLWDHVANLDEVEADFESPDYYGAGQLYAWDYNSRVVGGDDSVWDNIFNIMGFNSASGTDYMERSSAIGFFMGHHGLASVKFHDAIDKVGFPAFTHKDEVALRDDGAYSDFFLGLAHENYSTMLVRTATVPEPGAFVAFATAIVALSWRRRQGRFGRKTRL
>NZ_AP021859.1|WP_073324076.1|119339_119885_-|hypothetical-protein
MKYQKQLDRLNSGTMSRHELAVMKKNAKALVEKGDSDAVAILDAIDYSKPADDYILFMGFCPGADFSQRLDIEWKKHGICRFDYLESESQLNRWNTLCAGDLVILKKREKFGESMKLYGYGRIKRIAYDEENTRYFEMDWSAQEQEIEVPLMGCNSTVDVKSMLEVEKQMPDNFWQWLNKE
>NZ_AP021859.1|WP_155013294.1|120562_123034_+|glycoside-hydrolase-family-9-protein
MIKSVLYSAIASALVTAPIVTSAAVPALNDKAYFSQPSLDIVVFSNWYNGLFGDSKISGVELVHFGERIATNGDVRLSATPEQWDPIPTFVERKVDNNTNTISATLSYPEFDFTYTISASPIENGVEITLSSPRPVPAELVGKAGFNMEFLPANYMETSFLADGKPGTFPLYPTGVKEIIGQHEPAPLASAKQLVLAPESDTKRVMIESSAPLTLYDGRAKAQNGWYVVRGVLPENKQGELLTWRITASTDDAWLRDPMIAHSQVGYLPGQTKRAVIELDKHAPVNGMAELLKVNADGSKAVVKKAAPGKVEDYTRYQYATFDFTDVTEPGLYQLRYKGTTTASFPIAEHVLDAAWYPTLDHYFPVQMDHVLVNEAYRVWHGASHLDDALQAPVNHEHFDLYAQGPTTDTQYKPGEHIPGLNVGGWYDAGDYDIRTQTQYRTVRFLVQAFEEFGIDRDTTLVDYDRKYVDIHVPDGKPDLLQQIEHGTLALLAQFKAVGHAIPGIIVPDISQYTHLGDGLTMTDNLIYNASMADTESNGIESGVFDDRWAFTSKSTPLNYGSMAALAAASRTLQGYKPVLAKESLDTAIAAWASEADKQPDLFRVGNTTGGGLEEEKLKAAVELLVTTGDTQYKHAVTALLPHIEEHFGRSAVLAVRALPFMDNAYKKRIRAAAEAYKPKLEAITSKNPFGVVITEYGWAGNGTVLDMAVTQYYLHQAYPDLYSSDLIYRSLDYLYGTHPDSDISFVSNVGTVSKKVAYGMNRADYSFISGAIVPGVLILKPDLPENMENWPFLWGENEYVIDLGASYLFTVNAALKLAGRQP
>NZ_AP021859.1|WP_155013295.1|123152_124235_+|electron-transporter-RnfD
MLNRFITLCALVCLHVSVAAFAKVVPATDAGYVYTGRIDFANASAPYLTWPGSSVKARFSGESLSVTLKDDNGKNYYNVIVDGNDAFPFVIEAKQGEHTYWISNTLGAGEHTVEIYKRTEGEEGGTHFLGISIDDDAALLAPPGRPTRRIEIYGDSISSGMGNVAPYNGPDNLPRDKNHYLSYGAIAARTLGAELHTISQSGIGIMVSWFNFIMPQFYDQLSAVGNNDSQWDFSTWTPQVVVINLMQNDSWLVPDPKRISPTPTEPQIIAAYQAFVKSVRAEYPNAQIICALGSMDATKAGSPWPGYVEAAVANLTIEGDSRLSTVVFPFNGYGQHPRVNQHTSNAELLTQAIQQVTGWR
>NZ_AP021859.1|WP_162359727.1|124231_124978_-|CPBP-family-intramembrane-metalloprotease
MEQGKAVSSIIILSVFSAAIFATRFLQPQITAPYVKYAVPYLCWSLAILLAGVLLRRKGPLLKVVGITHQPLIGVAAALLFSLPMLIGFSVFFEFATPSMTTLLTKSLIPGFFEELFFRGFLVGSLIAIAGWRFLPAALIGAVIFGMGHWFQGATLVQAATAALFTAIGGLWFAWLFYRWGHNLWIVITLHTLMNAYWVLWQVDSTAIGGQAANLCRMGTIVLSIAGTEWWVRRSRKRSLTPADSPET
>NZ_AP021859.1|WP_155013297.1|125097_125910_-|TauD/TfdA-family-dioxygenase
MASDTITITPLTRNIGAEIGNIDLTKPITAEVEDQLKAAIAEHQVIFFRDQQITHEQHMAVGQIFGDLIVHPGAKGIDGYEKIVAIHADKDSKYIAGDNWHSDLSCNELPPMGSMLYIHTLPEVGGDTLFSSMYAAYDALSPAMQQYLEGLQAEHDANHVYHAIYGDYGTAYPCNVHPVVRTHPVTGKKAIFVNASYTTRILGVSKNESDGILAMLYELAKDPNFQVRFSWQPHSIAIWDNRCTQHFAVWDYFPDTRSGYRVTIGGDKPY

You can click texts colored in the table to view more detailed information

Click the colored protein region to show detailed information

Crispr_ID: NZ_AP021859_2

CRISPR_ID

CRISPR_location

CRISPR_type

Repeat_type

Spacer_info

Cas_protein_info

CRISPR-Cas_info

NZ_AP021859_2

1657863-1658090

Orphan

Consensus_repeat	Method
AGTGGTGGACGTCTGCTGTGGCTCGCAACGAAGTTGTTGAACCAATGCCGTGTGTTTTTTA	PILER-CR

2 spacers

The CRISPR arrays of NZ_AP021859_2

>merge|NZ_AP021859|2|1657863-1658090|PILER-CR
AGTGGCGGACGTCTGCTGTGGCTCGCAACGAAGTTGTTGAACCAATGCCGTGTGTTTTTTAGTTTAGAAATAATGAGAGGGGAAGTGGTGGACGTTTGCTGTGGCTCGCAACGAAGTTGTTGAACCAATGCCGTGTGTTTTTTTAGTTTAGAAATAATAAGAGTGGGAGTGGTGGACGTCTGCTGTGGCTCGCAACGAAGTTGTTGAACCAATGCCGTGTCTTTTTTA

>NZ_AP021859|2|2|1657863-1658090|PILER-CR
AGTGGCGGACGTCTGCTGTGGCTCGCAACGAAGTTGTTGAACCAATGCCGTGTGTTTTTTA	GTTTAGAAATAATGAGAGGGGA
AGTGGTGGACGTTTGCTGTGGCTCGCAACGAAGTTGTTGAACCAATGCCGTGTGTTTTTTT	AGTTTAGAAATAATAAGAGTGGG
AGTGGTGGACGTCTGCTGTGGCTCGCAACGAAGTTGTTGAACCAATGCCGTGTCTTTTTTA

Protein	Signature genes	Signature genes Name	Protein_function
NZ_AP021859.1\|WP_155014257.1\|1648706_1650491_-\|ATP-binding-cassette-domain-containing-protein	unknown	unknown	gnl\|CDD\|227590
NZ_AP021859.1\|WP_155014266.1\|1665355_1665772_-\|hemin-receptor	unknown	unknown	gnl\|CDD\|381269
NZ_AP021859.1\|WP_155014259.1\|1652216_1654556_-\|diguanylate-cyclase	unknown	unknown	gnl\|CDD\|143635
NZ_AP021859.1\|WP_073322593.1\|1667415_1668354_+\|tRNA-dihydrouridine(16)-synthase-DusC	unknown	unknown	gnl\|CDD\|236713
NZ_AP021859.1\|WP_155016569.1\|1651636_1652182_+\|GNAT-family-N-acetyltransferase	unknown	unknown	gnl\|CDD\|379112
NZ_AP021859.1\|WP_155014260.1\|1654770_1656405_-\|response-regulator	unknown	unknown	gnl\|CDD\|381123
NZ_AP021859.1\|WP_155016570.1\|1661311_1662256_+\|TRAP-transporter-substrate-binding-protein-DctP	unknown	unknown	gnl\|CDD\|270387
NZ_AP021859.1\|WP_155014262.1\|1658616_1660026_+\|glutamate--tRNA-ligase	unknown	unknown	gnl\|CDD\|234953
NZ_AP021859.1\|WP_155014255.1\|1645978_1646731_+\|ATP-binding-cassette-domain-containing-protein	unknown	unknown	gnl\|CDD\|224045
NZ_AP021859.1\|WP_155014256.1\|1648144_1648480_-\|hypothetical-protein	unknown	unknown	unknown
NZ_AP021859.1\|WP_155014267.1\|1666123_1667044_+\|DUF2817-domain-containing-protein	unknown	unknown	gnl\|CDD\|349450
NZ_AP021859.1\|WP_155014264.1\|1662797_1664069_+\|TRAP-transporter-large-permease-subunit	unknown	unknown	gnl\|CDD\|224509
NZ_AP021859.1\|WP_155014263.1\|1662255_1662801_+\|TRAP-transporter-small-permease-subunit	unknown	unknown	gnl\|CDD\|225632
NZ_AP021859.1\|WP_155014258.1\|1650662_1651628_+\|GTP-3',8-cyclase-MoaA	unknown	unknown	gnl\|CDD\|234672
NZ_AP021859.1\|WP_155014265.1\|1664083_1665292_+\|D-galactonate-dehydratase-family-protein	unknown	unknown	gnl\|CDD\|237901
NZ_AP021859.1\|WP_155523163.1\|1667129_1667303_+\|hypothetical-protein	unknown	unknown	unknown
NZ_AP021859.1\|WP_155014268.1\|1668521_1669940_+\|amidohydrolase-family-protein	unknown	unknown	gnl\|CDD\|238634
NZ_AP021859.1\|WP_155014261.1\|1656658_1657708_+\|mechanosensitive-ion-channel	unknown	unknown	gnl\|CDD\|366370
NZ_AP021859.1\|WP_073322635.1\|1646788_1648132_+\|3-deoxy-7-phosphoheptulonate-synthase-class-II	unknown	unknown	gnl\|CDD\|366662
NZ_AP021859.1\|WP_073322631.1\|1648491_1648707_-\|DUF3820-family-protein	unknown	unknown	gnl\|CDD\|378970

Protein	Function_ID	Function_description	E-value
NZ_AP021859.1\|WP_155014257.1\|1648706_1650491_-\|ATP-binding-cassette-domain-containing-protein	gnl\|CDD\|227590	COG5265, ATM1, ABC-type transport system involved in Fe-S cluster assembly, permease and ATPase components [Posttranslational modification, protein turnover, chaperones].	0
NZ_AP021859.1\|WP_155014266.1\|1665355_1665772_-\|hemin-receptor	gnl\|CDD\|381269	cd12131, HGbI-like, Hell's gate globin I (HGbI) from Methylacidophilum infernorum and related proteins. HGbI is a single-domain heme-containing protein isolated from Methylacidiphilum infernorum, an aerobic acidophilic and thermophilic methanotroph. M. infernorum grows optimally at pH 2.0 and 60C and its home is New Zealand's Hell's Gate geothermal park. The physiological role of HGbI has yet to be determined. It has an extremely strong resistance to auto-oxidation, and has fast oxygen-binding/slow release characteristics. Its CO on-rate is comparable to the O2 on-rate, and it is able to bind acetate with high affinity in the ferric state. The coordination of the heme iron changes in the ferrous form from pentacoordinate at low pH to predominantly hexacoordinate at high pH; in the ferric form, it is predominantly hexacoordinate at all pH.	1.15367e-69
NZ_AP021859.1\|WP_155016570.1\|1661311_1662256_+\|TRAP-transporter-substrate-binding-protein-DctP	gnl\|CDD\|270387	cd13669, PBP2_TRAP_TM0322_like, Periplasmic component of TRAP-type C4-dicarboxylate transport system TM0322 from Thermotoga maritima and similar proteins; the type 2 periplasmic binding protein fold. This subgroup includes the hyperthermophilic bacterium Thermotoga maritima TRAP-type C4-dicarboxylate transport system TM0322 and its closely related proteins. TRAP transporters are a large family of solute transporters ubiquitously found in bacteria and archaea. They are comprised of a periplasmic substrate-binding protein (SBP; often called the P subunit) and two unequally sized integral membrane components: a large transmembrane subunit involved in the translocation process (the M subunit) and a smaller membrane of unknown function (the Q subunit). The driving force of TRAP transporters is provided by electrochemical ion gradients (either protons or sodium ions) across the cytoplasmic membrane, rather than ATP hydrolysis. This substrate-binding domain belongs to the type 2 periplasmic binding fold protein superfamily (PBP2). The PBP2 proteins are typically comprised of two globular subdomains connected by a flexible hinge and bind their ligand in the cleft between these domains in a manner resembling a Venus flytrap.	3.3657e-128
NZ_AP021859.1\|WP_073322593.1\|1667415_1668354_+\|tRNA-dihydrouridine(16)-synthase-DusC	gnl\|CDD\|236713	PRK10550, PRK10550, tRNA dihydrouridine(16) synthase DusC.	3.96957e-139
NZ_AP021859.1\|WP_155016569.1\|1651636_1652182_+\|GNAT-family-N-acetyltransferase	gnl\|CDD\|379112	pfam13302, Acetyltransf_3, Acetyltransferase (GNAT) domain. This domain catalyzes N-acetyltransferase reactions.	9.16881e-27
NZ_AP021859.1\|WP_155014260.1\|1654770_1656405_-\|response-regulator	gnl\|CDD\|381123	cd17589, REC_TPR, phosphoacceptor receiver (REC) domain of uncharacterized tetratricopeptide repeat (TPR)-containing response regulators. Response regulators share the common phosphoacceptor REC domain and different output domains. This subfamily contains uncharacterized response regulators with TPR repeats as the effector or output domain, which might contain between 3 to 16 TPR repeats (each about 34 amino acids). TPR-containing proteins occur in all domains of life and the abundance of TPR-containing proteins in a bacterial proteome is not indicative of virulence. REC domains function as phosphorylation-mediated switches within response regulators, but some also transfer phosphoryl groups in multistep phosphorelays. Some members in this subfamily may contain inactive REC domains lacking canonical metal-binding and active site residues.	1.46697e-37
NZ_AP021859.1\|WP_155014262.1\|1658616_1660026_+\|glutamate--tRNA-ligase	gnl\|CDD\|234953	PRK01406, gltX, glutamyl-tRNA synthetase; Reviewed.	0
NZ_AP021859.1\|WP_155014255.1\|1645978_1646731_+\|ATP-binding-cassette-domain-containing-protein	gnl\|CDD\|224045	COG1120, FepC, ABC-type cobalamin/Fe3+-siderophores transport systems, ATPase components [Inorganic ion transport and metabolism / Coenzyme metabolism].	1.12851e-47
NZ_AP021859.1\|WP_155014259.1\|1652216_1654556_-\|diguanylate-cyclase	gnl\|CDD\|143635	cd01949, GGDEF, Diguanylate-cyclase (DGC) or GGDEF domain. Diguanylate-cyclase (DGC) or GGDEF domain: Originally named after a conserved residue pattern, and initially described as a domain of unknown function 1 (DUF1). This domain is widely present in bacteria, linked to a wide range of non-homologous domains in a variety of cell signaling proteins. The domain shows homology to the adenylyl cyclase catalytic domain. This correlates with the functional information available on two GGDEF-containing proteins, namely diguanylate cyclase and phosphodiesterase A of Acetobacter xylinum, both of which regulate the turnover of cyclic diguanosine monophosphate. Together with the EAL domain, GGDEF might be involved in regulating cell surface adhesion in bacteria.	7.30911e-51
NZ_AP021859.1\|WP_155014267.1\|1666123_1667044_+\|DUF2817-domain-containing-protein	gnl\|CDD\|349450	cd06231, M14_REP34-like, Peptidase M14-like domain similar to rapid encystment phenotype 34 (REP34). This family includes Francisella tularensis protein rapid encystment phenotype 34 (REP34) which is a zinc-containing monomeric protein demonstrating carboxypeptidase B-like activity. REP34 possesses a novel topology with its substrate binding pocket deviating from the canonical M14 peptidases with a possible catalytic role for a conserved tyrosine and distinct S1' recognition site. Thus, REP34, identified as an active carboxypeptidase and a potential key F. tularensis effector protein, may help elucidate a mechanistic understanding of F. tularensis infection of phagocytic cells. A functionally uncharacterized subgroup of the M14 family of metallocarboxypeptidases (MCPs). The M14 family are zinc-binding carboxypeptidases (CPs) which hydrolyze single, C-terminal amino acids from polypeptide chains, and have a recognition site for the free C-terminal carboxyl group, which is a key determinant of specificity. Two major subfamilies of the M14 family, defined based on sequence and structural homology, are the A/B and N/E subfamilies. Enzymes belonging to the A/B subfamily are normally synthesized as inactive precursors containing preceding signal peptide, followed by an N-terminal pro-region linked to the enzyme; these proenzymes are called procarboxypeptidases. The A/B enzymes can be further divided based on their substrate specificity; Carboxypeptidase A-like (CPA-like) enzymes favor hydrophobic residues while carboxypeptidase B-like (CPB-like) enzymes only cleave the basic residues lysine or arginine. The A forms have slightly different specificities, with Carboxypeptidase A1 (CPA1) preferring aliphatic and small aromatic residues, and CPA2 preferring the bulky aromatic side chains. Enzymes belonging to the N/E subfamily enzymes are not produced as inactive precursors and instead rely on their substrate specificity and subcellular compartmentalization to prevent inappropriate cleavages. They contain an extra C-terminal transthyretin-like domain, thought to be involved in folding or formation of oligomers. MCPs can also be classified based on their involvement in specific physiological processes; the pancreatic MCPs participate only in alimentary digestion and include carboxypeptidase A and B (A/B subfamily), while others, namely regulatory MCPs or the N/E subfamily, are involved in more selective reactions, mainly in non-digestive tissues and fluids, acting on blood coagulation/fibrinolysis, inflammation and local anaphylaxis, pro-hormone and neuropeptide processing, cellular response and others. Another MCP subfamily, is that of succinylglutamate desuccinylase /aspartoacylase, which hydrolyzes N-acetyl-L-aspartate (NAA), and deficiency in which is the established cause of Canavan disease. Another subfamily (referred to as subfamily C) includes an exceptional type of activity in the MCP family, that of dipeptidyl-peptidase activity of gamma-glutamyl-(L)-meso-diaminopimelate peptidase I which is involved in bacterial cell wall metabolism.	1.15966e-57
NZ_AP021859.1\|WP_155014264.1\|1662797_1664069_+\|TRAP-transporter-large-permease-subunit	gnl\|CDD\|224509	COG1593, DctQ, TRAP-type C4-dicarboxylate transport system, large permease component [Carbohydrate transport and metabolism].	8.50239e-105
NZ_AP021859.1\|WP_155014263.1\|1662255_1662801_+\|TRAP-transporter-small-permease-subunit	gnl\|CDD\|225632	COG3090, DctM, TRAP-type C4-dicarboxylate transport system, small permease component [Carbohydrate transport and metabolism].	6.44027e-17
NZ_AP021859.1\|WP_155014258.1\|1650662_1651628_+\|GTP-3',8-cyclase-MoaA	gnl\|CDD\|234672	PRK00164, moaA, GTP 3',8-cyclase MoaA.	5.05895e-153
NZ_AP021859.1\|WP_155014265.1\|1664083_1665292_+\|D-galactonate-dehydratase-family-protein	gnl\|CDD\|237901	PRK15072, PRK15072, D-galactonate dehydratase family protein.	0
NZ_AP021859.1\|WP_155014268.1\|1668521_1669940_+\|amidohydrolase-family-protein	gnl\|CDD\|238634	cd01309, Met_dep_hydrolase_C, Metallo-dependent hydrolases, subgroup C is part of the superfamily of metallo-dependent hydrolases, a large group of proteins that show conservation in their 3-dimensional fold (TIM barrel) and in details of their active site. The vast majority of the members have a conserved metal binding site, involving four histidines and one aspartic acid residue. In the common reaction mechanism, the metal ion (or ions) deprotonate a water molecule for a nucleophilic attack on the substrate. The function of this subgroup is unknown.	7.87523e-139
NZ_AP021859.1\|WP_155014261.1\|1656658_1657708_+\|mechanosensitive-ion-channel	gnl\|CDD\|366370	pfam00924, MS_channel, Mechanosensitive ion channel. Two members of this protein family of M. jannaschii have been functionally characterized. Both proteins form mechanosensitive (MS) ion channels upon reconstitution into liposomes and functional examination by the patch-clamp technique. Therefore this family are likely to also be MS channel proteins.	3.85582e-37
NZ_AP021859.1\|WP_073322635.1\|1646788_1648132_+\|3-deoxy-7-phosphoheptulonate-synthase-class-II	gnl\|CDD\|366662	pfam01474, DAHP_synth_2, Class-II DAHP synthetase family. Members of this family are aldolase enzymes that catalyze the first step of the shikimate pathway.	0
NZ_AP021859.1\|WP_073322631.1\|1648491_1648707_-\|DUF3820-family-protein	gnl\|CDD\|378970	pfam12843, QSregVF_b, Putative quorum-sensing-regulated virulence factor. QSregVF_b is a family of short Pseudomonas proteins that are potential virulence factors. The structure of UniProtKB:Q9HY15 a secreted protein has been solved and deposited as Structure 3npd, from pfam13652. It is predicted that these two adjacent proteins form a single transcriptional unit based on the prediction that together they interact with their adjacent protein PotD, which is the putrescine-binding periplasmic protein in the polyamine uptake system comprising PotABCD. These two adjacent proteins are predicted to be quroum-sensing-regulated virulence factors.	1.92065e-36

>NZ_AP021859.1|WP_155014261.1|1656658_1657708_+|mechanosensitive-ion-channel
MEASTNELVMALNTYWQAFIVKLPAIALGLVIMVIMVTLAGRVAAALIKPVGYLSASPLLRSVSQRAISLIIILLALYIFLKLSGLTEFAVAIMSGTGVLGLILGFAFRDIAENMIASLLLTVQRPFRINDVVQINNYTGVVQKVTTRATTLVDFDGNHIQIPNAVIYKGTIKNLTANPKMRGQFTIGIGYDADCQLAQKRALSLISEHPNVLVEPEPLVLVDGLGSSTINLKVYFWIDVESTSVVKMASVLMRDLVNLFTEKGISMPDDAREIVFPEGVPVLVQQGTEEQQYTDPDQSVRQNEQALTNRAEPNVESQSFLDDVSSENDDIRRQADEARAPEQGTNILS
>NZ_AP021859.1|WP_155014260.1|1654770_1656405_-|response-regulator
MNLKLARQTRVLVIDDQVLAKGYLKYSLEELGFQNIEYADRASNALSAIRRHHYDLIVCSYDLKNEQDGYFLYDQLKEHNELPPSTAFVFISADTTADIVHSIVELQPDDFLAKPFTVRELDRRLGRLLTRKKALKPVYQLIEQSTLDKALLELENFLTEPKNSEFFPLALKLKGELLLACGHYEEAREFYQAIINVQNFTWAQLGLIKSYISLDQDEEAEKLVIELALKQDSMLAAYDLLAALQIKHHEFEDALESVEVASAISPRNLHRHAKALDLSRLTHDYESQFEAAKKIVKIAKNSIHDKPEIYLNVARSGIDFAMTAEEEHTKRLIKQSTEYLKQLKSNFPKADIDDQLKVIDARMLYLEDEVDNARALLNQLNNDAWETESIEGLLDKAKAFHEVGIQDHALKILDMIERRCNNDPAQSDLFLQYVQQEKVEKTEIKLSPKALNNNAVIQYQRGDLQKALETFRQAFTIMPKNPSIALNLLQATAINLKESSSDAAKESLNSTLIHNCLKTIESGRLSEEQEQRYQRVKSVLKDLT
>NZ_AP021859.1|WP_155014259.1|1652216_1654556_-|diguanylate-cyclase
MNQALIEKYRLHRQEYETQLTQYVQSIAELFDVPVAFLGLSDNETLWIKAKCGTEATEAPLKQAICHLVIDSNDLIYIEDTQKDPRTQKMPAVAGSPFVRFYAGVPVKINNDLVGSFCIIDVKPRQLSSKELRALHNFGTHLGQHCELIFKHLSSEDEHELLNNSPAALIRWQVRPSLSVSYLSPNLTKLLGVALPEDASLFKLEDVIHHDDTDHFLFTVRNHQQGAEICECDFRVKVPGNRSVWYKLVSRAIFDNENLIAIQGLLLNNTEQKYLENRILDTNERMRLLLEASGLGTSDWDIENDTLRVNNRICHMLKLHPDSVDTQSMFWMQLVHPADKDRLISQIGNSLKEPNGVVDIEYRLRNAEGQYLWIETYGKVVTRNEMGRATRFAATHRNITEKKLAELHADKQRRLLGFINHAQNLFVAQKDLQLACEQIFPELIDLAESAYGFIGKMETEDGVPCLQIYAISDVSWSESSKAEYKKFKKGELRFTNLSNLFGHVVTSNAPVIANKPMMHNASKGTPAGHPVLKRFLGLPIQRENKVVGMIGLANKLEDYSQKDVEFLTPLTETLAYLFKAVEVEQARYEAEERLSYLAATDPLTGLMNRRAYFDKIQQFAAENNEPHCLAIVDIDNFKTLNDTYGHPVGDQVLRAVAKVMQHNIRNNDMVSRLGGEEFGIYIDTKDNAVCHQILQAILEEIKQLSFDTDQGQLNNVTVSIGATMFKKGPHAVDTTYFDIAMKQADQALYEVKRNGKASINWFSPSNALKKDGLITGLKS
>NZ_AP021859.1|WP_155016569.1|1651636_1652182_+|GNAT-family-N-acetyltransferase
MEQPESERLTYRLLDETDGEFLWELDQCEAVMRFINGGRKTSRKETNDIFIPRMLSYRNAPAGWGLWQASLKDGDKTALGWILVRPHGYFTEQRDDSVIELGWRFKQDWWGKGYATEAARAVMTYMQTLGARSFSAIALPENTASIHIMKKLGMTFSHTEHYQDQVFNDDIVVYHTDATKP
>NZ_AP021859.1|WP_155014258.1|1650662_1651628_+|GTP-3',8-cyclase-MoaA
MLQDTFGRRFYYLRLSITDVCNFRCQYCLPDGYEGQAQTFLSVNEIDTLVKGFAGMGTCKIRVTGGEPTLRKDVSDIVAACAATPGIQKVAMTTHGGKLASHAKSLADAGLSQVNISLDSLDPAQFALISGQDKLNAVLDGVNEALIQGMSVKVNTVLLRPFAESQLSTFLDWLKDMPVTVRFIELMETGQHKAFFKSHHTSAEPFLTRLKAAGWEVLQRGADAGPAVELQHPDYAGRFGFIMPYSSDFCSSCNRLRVTALGKLHLCLFSDNGLSLRDYLLRGDVSGLQGFITEKLADKKVSHYLQDGVTGITKNLSMLGG
>NZ_AP021859.1|WP_155014257.1|1648706_1650491_-|ATP-binding-cassette-domain-containing-protein
MRPTRLPVNPDTPMQWHVFARLWPYLLEFKQRVALALLCLVAAKIASIGLPFVLKHTVDNLNQEPIATLAVPIALVVAYGTLRLINVLLGEVRDTLFGRVTERAMRRIGLEVFEHLHRLDLTFHLSRQTGGLSRDIERGTSGISFLMRFMVFNIGPTLLEIALVVGVLLTQYGMSFAMIILCSVVAYVWFSMKATDWRTEFVRQANLADSTSNTRAIDSLLNYETVKYFNNEQYEARQYDTNLANWEQARRKNRLSLFALNGGQAFIIAASMTFMMLLAALEVSRDNMTIGDFVLINAFTMQIFMPLNFLGFVYREIRGSLANIENLFNLLGTVPTVADAPSAGKLEINQSRIHFDNVSFYYRAERRILNHVNFTIEPGTKVAIVGESGAGKSTLVKLLFRFYDPVSGAVRIDGQDISTVTQHSLRQHIGIVPQDTVLFNDTIGENIRYGRPDATEQDIQQAIRLAHLEQFIASLPDGLNTQVGERGLKLSGGEKQRVAIARAILKRPAIMVFDEATSSLDSQSEQAILSALREVAEGHTSMVIAHRLSTIIDADKILVMQQGQIVEQGTHSELLSQQGVYAGLWQAQQKQASD
>NZ_AP021859.1|WP_073322631.1|1648491_1648707_-|DUF3820-family-protein
MDPQQLKLTINQIMPFGKYAGRKLIHLPEPYLVWFAKQGFPEGKLGQQLALMYEIKLNGMESMLTPLIDND
>NZ_AP021859.1|WP_155014256.1|1648144_1648480_-|hypothetical-protein
MVYQRKRMQQQRIDQQSQQLHLCIAKKLLANPDMMEAVTARLHQRYQDKLMGYGSYLHWQAILAEFPHPEHFIAAITASDSTTTRLRRATIFTGVLNEKERSDCLAAPSQQ
>NZ_AP021859.1|WP_073322635.1|1646788_1648132_+|3-deoxy-7-phosphoheptulonate-synthase-class-II
MNNWQPDSWRKKPILQQPEYDDKAELAQVEKTLSSYPPLVFAAEARELRRQLGQVCEGKGFLLQGGDCAESFSEFNAPKIRDTFKVLLQMAIVLTFAGRCPVTKVARMAGQYAKPRSSDFETKDGITLPSYRGDIINSFEFSEAARRPDPQRLIEAYHRSSATLNLLRAFAQGGLADLHEVNRWNMAFVENNPLKEQYQDIARRIQDSLEFMDVIGLNASNTPTLHETSLFTSHEALLLNYEEALTRIDTLTGKPYDCSAHMVWIGERTRQLDHAHIEFFRGIHNPIGVKVGPTMEEDELIRLIDALNPNNEAGRLTLITRMGADKLEANLPRLLRRVKAEGRNVVWSSDPMHGNTFSASSGYKTRNFDAILSEIRQFFAAHDAEGTYAGGIHLEMTGQHVTECTGGAYQISDDDLAEAYKTQCDPRLNADQVLEMAFLVSDHLRIK
>NZ_AP021859.1|WP_155014255.1|1645978_1646731_+|ATP-binding-cassette-domain-containing-protein
MTRKLRAEHICLTNRFKRLSIVHASSGIVCLLGANGAGKSSLLEVLAGLTPATEGEVLWGGQPTAQRSLAELAVERGYLAQKPSIQFELTGRDCLQFFNDHTQQQIPGMLIEKLGLTTLLDKVYTHMSGGEQQRIFIARTLLQVWQPLMDGNALLILDEPLQSLDIRHQHALMCWLADLGIRGNQIVMSCHDVNIANTFADTVWLARSGELLASGPVEEVMTLDNLWRTFDCHFDFLEREPRGVFVPVSV
>NZ_AP021859.1|WP_155014262.1|1658616_1660026_+|glutamate--tRNA-ligase
MAVVTRFAPSPTGYLHVGGARTALYSWLYAKSQGGEFVLRIEDTDIERSTEEAKQAILDGMQWLGLTWDRGPYYQTERFDRYKAIIQTMLEEGKAYKCFMPADELDAIREAQKERGEKPRYPGTWRDRTEHPEGQPYVIRFKNPQEGSVVFDDHVRGRIEISNSELDDLIIQRSDGTPTYNFCVVVDDWDMGITHVVRGEDHINNTPRQINILKALNAPVPEYAHVSMILGDDGKKLSKRHGAVSVMQYRDDGYLPQAVKNYLVRLGWSHGDQEIFSEQEMIELFSLDAIGQSASAFNTEKLIWLNQHYIKTLPGSEVAEHAKWHFEQLNVDLSAGPALEDVIAIQADRVKTLKELAEISLYFYKDFEDFDANAAKKHLRPVAKEPLQVVQEKLEALADWTPETIHAAINGAAESLGVGMGKVGMPLRVAATGGGNSPSLDVTLHLLPKAKVVERINKALTFIANRENS
>NZ_AP021859.1|WP_155016570.1|1661311_1662256_+|TRAP-transporter-substrate-binding-protein-DctP
MPAQATTLNVVTALSQNDPIYQGLLRFKQAVEQGSDNQIKVRLFVGSQLGNDNDILEQAMAGAPVAVLVDAGRLSFYQPEIGVLSAPYLIDNVEQLNVLVQSPMFEQWANALATQSGIKVLGFNWWQGERHVLTNKPVFTPDDLDGVRLRTIGAPVWISTIRAMGATPTPLSWAEVYSGLQQRVIDGAEAQHAGTYGARLYEVIGYVNKTRHIHLISGLVASNHWFKRLSKAHQNLVQKSALEAGEFATSLVQARQSEIEQALAAAGVEIVEPDIDAFKHATQQVYTELGYENVYQRIQQYLAEQMGVDIKESH
>NZ_AP021859.1|WP_155014263.1|1662255_1662801_+|TRAP-transporter-small-permease-subunit
MWAQIERGIAVMLLAAIVLLVLLAAILRTAGYPIIWSVDIAQLLFAWLSVIAANQALRQGSHARLDILMNRLRLINRLRLTLALNLISMSCMLVVAVFGFQLVGINPARTLGSTAIPYAWVTAALPAGAVLMLVTLLQQSVRVFHCLRKPGDTLQNPPAFLASVLTPDTHHSDKLAKESLS
>NZ_AP021859.1|WP_155014264.1|1662797_1664069_+|TRAP-transporter-large-permease-subunit
MSGVVFLILLLLGLPLAFTLIASGMVYFAQNPELPSAVAVQRMVAASQSFPLLAVPFFILAGHVMNCSGITKRLIHVSNLLVAWISGGLAHVTIVLSALMGGVSGSAIADAAMQARILGEPMQASGLTKGFSAATITVSALITACIPPSIGLILYGYMGNVSIGKLFLAGLIPGLLLTLVLMLVVYLQARKKGFAPTQANPPSFKQILEAINQSKWALLFPVLLIVTIRFGIFTPSEAGAFAVLYACVVGRFAYQELTLSDITTSLSESVSDIGMIMLIILASGVVGYAIAYEQLPVSLTLAVTQVTEQPQLILLMSLVILLVVGVVMEGTITVLLLTPILVPLMQSVGVDPVHFGILMLIMVTLGGTTPPVGIAMYAVCNILRCSTTDYVKAAVPLFTAVLALVVVLALYPPLVLFIPELLF
>NZ_AP021859.1|WP_155014265.1|1664083_1665292_+|D-galactonate-dehydratase-family-protein
MKIRDVKVIVCSPGRNFVTLKIVTDEGIYGIGDATLNGREKSVVSYLEDYIAPALIGKDPHRIEDIWQFFYRGAYWRRGPVGMTAIAAVDTALWDIKAKVAGLPLYQLLGGRSRDKIMVYTHANGADIPATLDAVGKAIEDGYKAIRVQSGIPGVKSTYGVAKEGQKYEPADADLPTESVWSTEKYLNFAPKLFAAVREQYGDDIHLLHDVHHRLSPIEAARLGKSLEPYHLFWMEDPVAAENQQGFKLIREHTTTPIAVGEVFNSIHDCQALIQNQWIDYIRSTVAHAGGITQLRRIADLASLYHIRMGCHGATDLSPVCMGAALHFDYWVPNFGIQEHMPHSELMESVFSVSYKFDDGFFTPGETPGHGVDIDEELAKKYPYKRACLPVNRLEDGTLWHW
>NZ_AP021859.1|WP_155014266.1|1665355_1665772_-|hemin-receptor
MDAKTISLVQSTFQQVVPIAGTAASLFYTKLFELDPSLKPMFKSDITEQGKKLMQMIGVAVNGLNNLDALVPAVEQLGSRHVGYGVQDSHYDTVGTALLWTLNKGLAEDFTPEVEAAWTEVYTLLASVMKEASKTQVA
>NZ_AP021859.1|WP_155014267.1|1666123_1667044_+|DUF2817-domain-containing-protein
MTMYPIGTPGTPWDENEKRQWLELQSVKRSYAEEVLTKLTALPETLTRVQYGALPYDTERYPLYALLSKSPTDGAPWVLITGGVHGYETSGVQGAILFANEYSKAYNGKVNFVVVPCVSPWGYETINRWNPKAVDPNRSFKPESPAAESQLLMDFVNSLPFDITLHVDLHETTDTDNSEFRPALAARDAIEQKTWNIPDGFYLVADTQQPCIALQEAMIHEVKQVTHIAPADDSGRIIGETLLSEGVIGYNKKALFLCGGFTNAPMCSTTEVYPDSPSATDEICNLAQVAAIGGALRYLLSDANAQ
>NZ_AP021859.1|WP_155523163.1|1667129_1667303_+|hypothetical-protein
MQFVALLYDGISQRFVRVEAQDEKAFFSALDKQYPCYVCLWHSYEATEVNASVPQQV
>NZ_AP021859.1|WP_073322593.1|1667415_1668354_+|tRNA-dihydrouridine(16)-synthase-DusC
MRVYLAPMEGVVDHLMRDMLTRVGGFDLCVTEFVRVVDQKLPHKTFYRLCPELHNDCKTPSGVPVKIQLLGQHPEWLAENAMTAVELGSPGVDLNFGCPAKTVNKSKGGAVLLQYTQQLHDIVYAVRQAVPAHLPVTAKIRLGYEDKSLAIDNAVAIDEAGASELVVHARTKTEGYRPPAYWDWIKKIKAVTRLPVIANGEIWNHDDAVRCMQASGCDDLMIGRGALAMPNLARHIRGEEAPMAWQDLSQLLIDYSGYEIFGDKGRYYPNRIKQWCGYLKRQYPQAETLFSNIRRLQKADEIVNVLRQSAHL
>NZ_AP021859.1|WP_155014268.1|1668521_1669940_+|amidohydrolase-family-protein
MNNTHKRLSIMMLCCCAFWLTACSEDKAAKTEQAAKVSIDKNPFPSRYTPLAGEPTLITNVTILDGIGNKIDKGMVYFADGKIVEIGETLSVPNGVRTIDGQGKWVTPGIIDVHSHLGVYPNPSTHSHSDGNEIVKPVTANVWAEHSVWPQDPGFGRALAGGVTSLQILPGSANLFGGRSVVLKNVPHRTMQEMKFPDAPYGLKMACGENPKRVYGKRGGPSTRMGNVAGYRQAWSDAQDYQRKWDQYEADYEAGKNPKAPKRDLNLETLAGVLDGDIRVHMHCYRADEMAVMMDVMKEFNYQIYSFQHAVEAYKISDILAENNVCSAMWADWWGFKMEAYDGIRENVPMVHNAGACAIVHSDSDLGIQRLNQEAAKAWADGRRAGIDIPQEDAWIWLSANPAKSLGIFDKTGSLESGKNADLVMWTANPFSTYARAEKVYIDGGLAYDLNDPQSWPVADFELGQVGEGDSK

You can click texts colored in the table to view more detailed information

Click the colored protein region to show detailed information

Crispr_ID: NZ_AP021859_3

CRISPR_ID

CRISPR_location

CRISPR_type

Repeat_type

Spacer_info

Cas_protein_info

CRISPR-Cas_info

NZ_AP021859_3

3836907-3836999

Orphan

Consensus_repeat	Method
GGGCAAACCTTAGCTCCGAGGCTTTTTTA	CRISPRCasFinder

1 spacers

The CRISPR arrays of NZ_AP021859_3

>merge|NZ_AP021859|3|3836907-3836999|CRISPRCasFinder
GGGCAAACCTTAGCTCCGAGGCTTTTTTATTAGGTCACGGGGCAAGCCCGGGATGACGGGGAGTGGGCAAACCTTAGCTCCGAGGCCTTTTTA

>NZ_AP021859|3|1|3836907-3836999|CRISPRCasFinder
GGGCAAACCTTAGCTCCGAGGCTTTTTTA	TTAGGTCACGGGGCAAGCCCGGGATGACGGGGAGT
GGGCAAACCTTAGCTCCGAGGCCTTTTTA

Protein	Signature genes	Signature genes Name	Protein_function
NZ_AP021859.1\|WP_155015720.1\|3839371_3839974_-\|hypothetical-protein	unknown	unknown	unknown
NZ_AP021859.1\|WP_155015714.1\|3827740_3829144_-\|undecaprenyl-phosphate-glucose-phosphotransferase	unknown	unknown	gnl\|CDD\|274396
NZ_AP021859.1\|WP_155014205.1\|3841742_3842780_+\|IS110-family-transposase	unknown	unknown	gnl\|CDD\|226077
NZ_AP021859.1\|WP_155015719.1\|3837612_3839007_-\|hypothetical-protein	unknown	unknown	unknown
NZ_AP021859.1\|WP_155015722.1\|3844323_3846102_+\|lasso-peptide-isopeptide-bond-forming-cyclase	unknown	unknown	gnl\|CDD\|238949
NZ_AP021859.1\|WP_155015713.1\|3826803_3827751_-\|sulfotransferase	unknown	unknown	gnl\|CDD\|379204
NZ_AP021859.1\|WP_073320074.1\|3846585_3846855_-\|PqqD-family-protein	unknown	unknown	gnl\|CDD\|377508
NZ_AP021859.1\|WP_155015712.1\|3826024_3826807_-\|glycosyltransferase	unknown	unknown	gnl\|CDD\|133055
NZ_AP021859.1\|WP_155015721.1\|3840430_3841045_-\|hypothetical-protein	unknown	unknown	unknown
NZ_AP021859.1\|WP_155015718.1\|3837325_3837520_-\|hypothetical-protein	unknown	unknown	unknown
NZ_AP021859.1\|WP_139241519.1\|3840254_3840416_-\|lasso-RiPP-family-leader-peptide-containing-protein	unknown	unknown	unknown
NZ_AP021859.1\|WP_139241518.1\|3839197_3839359_-\|lasso-RiPP-family-leader-peptide-containing-protein	unknown	unknown	unknown
NZ_AP021859.1\|WP_155015715.1\|3830006_3833462_+\|hypothetical-protein	unknown	unknown	gnl\|CDD\|214495
NZ_AP021859.1\|WP_155015709.1\|3823997_3824897_-\|hypothetical-protein	unknown	unknown	unknown
NZ_AP021859.1\|WP_162359943.1\|3825127_3825604_-\|glucuronosyltransferase	unknown	unknown	gnl\|CDD\|227350
NZ_AP021859.1\|WP_155015723.1\|3846131_3846593_-\|lasso-peptide-biosynthesis-B2-protein	unknown	unknown	gnl\|CDD\|379206
NZ_AP021859.1\|WP_084526282.1\|3833804_3835025_-\|HD-domain-containing-protein	unknown	unknown	gnl\|CDD\|225116
NZ_AP021859.1\|WP_155015711.1\|3825600_3826032_-\|hypothetical-protein	unknown	unknown	gnl\|CDD\|370045
NZ_AP021859.1\|WP_155015717.1\|3836345_3836498_-\|lasso-RiPP-family-leader-peptide-containing-protein	unknown	unknown	unknown
NZ_AP021859.1\|WP_155015716.1\|3835231_3836215_-\|hypothetical-protein	unknown	unknown	gnl\|CDD\|275147

Protein	Function_ID	Function_description	E-value
NZ_AP021859.1\|WP_155015714.1\|3827740_3829144_-\|undecaprenyl-phosphate-glucose-phosphotransferase	gnl\|CDD\|274396	TIGR03023, Sugar_transferase., Undecaprenyl-phosphate glucose phosphotransferase. This family of proteins encompasses the E. coli WcaJ protein involved in colanic acid biosynthesis, the Methylobacillus EpsB protein involved in methanolan biosynthesis, as well as the GumD protein involved in the biosynthesis of xanthan. All of these are closely related to the well-characterized WbaP (formerly RfbP) protein, which is the first enzyme in O-antigen biosynthesis in Salmonella typhimurium. The enzyme transfers galactose from UDP-galactose (NOTE: not glucose) to a polyprenyl carrier (utilizing the highly conserved C-terminal sugar transferase domain, pfam02397) a reaction which takes place at the cytoplasmic face of the inner membrane. The N-terminal hydrophobic domain is then believed to facilitate the "flippase" function of transferring the liposaccharide unit from the cytoplasmic face to the periplasmic face of the inner membrane. Most of these genes are found within large operons dedicated to the production of complex exopolysaccharides such as the enterobacterial O-antigen. Colanic acid biosynthesis utilizes a glucose-undecaprenyl carrier, knockout of EpsB abolishes incorporation of UDP-glucose into the lipid phase, and the C-terminal portion of GumD has been shown to be responsible for the glucosyl-1-transferase activity.	0
NZ_AP021859.1\|WP_155014205.1\|3841742_3842780_+\|IS110-family-transposase	gnl\|CDD\|226077	COG3547, COG3547, Transposase and inactivated derivatives [DNA replication, recombination, and repair].	5.01108e-30
NZ_AP021859.1\|WP_073320074.1\|3846585_3846855_-\|PqqD-family-protein	gnl\|CDD\|377508	pfam05402, PqqD, Coenzyme PQQ synthesis protein D (PqqD). This family contains several bacterial coenzyme PQQ synthesis protein D (PqqD) sequences. This protein is required for coenzyme pyrrolo-quinoline-quinone (PQQ) biosynthesis.	3.74631e-14
NZ_AP021859.1\|WP_155015722.1\|3844323_3846102_+\|lasso-peptide-isopeptide-bond-forming-cyclase	gnl\|CDD\|238949	cd01991, Asn_Synthase_B_C, The C-terminal domain of Asparagine Synthase B. This domain is always found associated n-terminal amidotransferase domain. Family members that contain this domain catalyse the conversion of aspartate to asparagine. Asparagine synthetase B catalyzes the assembly of asparagine from aspartate, Mg(2+)ATP, and glutamine. The three-dimensional architecture of the N-terminal domain of asparagine synthetase B is similar to that observed for glutamine phosphoribosylpyrophosphate amidotransferase while the molecular motif of the C-domain is reminiscent to that observed for GMP synthetase .	4.81103e-22
NZ_AP021859.1\|WP_155015713.1\|3826803_3827751_-\|sulfotransferase	gnl\|CDD\|379204	pfam13469, Sulfotransfer_3, Sulfotransferase family.	4.66051e-13
NZ_AP021859.1\|WP_155015715.1\|3830006_3833462_+\|hypothetical-protein	gnl\|CDD\|214495	smart00060, FN3, Fibronectin type 3 domain. One of three types of internal repeat within the plasma protein, fibronectin. The tenth fibronectin type III repeat contains a RGD cell recognition sequence in a flexible loop between 2 strands. Type III modules are present in both extracellular and intracellular proteins.	4.87876e-05
NZ_AP021859.1\|WP_162359943.1\|3825127_3825604_-\|glucuronosyltransferase	gnl\|CDD\|227350	COG5017, COG5017, Uncharacterized conserved protein [Function unknown].	3.64913e-18
NZ_AP021859.1\|WP_155015723.1\|3846131_3846593_-\|lasso-peptide-biosynthesis-B2-protein	gnl\|CDD\|379206	pfam13471, Transglut_core3, Transglutaminase-like superfamily. This family includes uncharacterized proteins that are related to the transglutaminase like domain pfam01841.	3.73635e-11
NZ_AP021859.1\|WP_155015712.1\|3826024_3826807_-\|glycosyltransferase	gnl\|CDD\|133055	cd06433, GT_2_WfgS_like, WfgS and WfeV are involved in O-antigen biosynthesis. Escherichia coli WfgS and Shigella dysenteriae WfeV are glycosyltransferase 2 family enzymes involved in O-antigen biosynthesis. GT-2 enzymes have GT-A type structural fold, which has two tightly associated beta/alpha/beta domains that tend to form a continuous central sheet of at least eight beta-strands. These are enzymes that catalyze the transfer of sugar moieties from activated donor molecules to specific acceptor molecules, forming glycosidic bonds. Glycosyltransferases have been classified into more than 90 distinct sequence based families.	3.58486e-43
NZ_AP021859.1\|WP_155015711.1\|3825600_3826032_-\|hypothetical-protein	gnl\|CDD\|370045	pfam08660, Alg14, Oligosaccharide biosynthesis protein Alg14 like. Alg14 is involved dolichol-linked oligosaccharide biosynthesis and anchors the catalytic subunit Alg13 to the ER membrane.	2.02965e-13
NZ_AP021859.1\|WP_155015716.1\|3835231_3836215_-\|hypothetical-protein	gnl\|CDD\|275147	TIGR04352, hypothetical_protein_imdm_1353, HprK-related kinase A. A number of protein families resemble HPr kinase (see TIGR00679) but do not belong to that system. They include this family, which appears instead to be the marker for a different type of gene neighborhood, in which one of the conserved neighboring proteins resembles (but is distinct from) PqqD.	5.24536e-06
NZ_AP021859.1\|WP_084526282.1\|3833804_3835025_-\|HD-domain-containing-protein	gnl\|CDD\|225116	COG2206, COG2206, c-di-GMP phosphodiesterase class II (HD-GYP domain) [Signal transduction mechanisms].	3.18884e-64

>NZ_AP021859.1|WP_155015717.1|3836345_3836498_-|lasso-RiPP-family-leader-peptide-containing-protein
MNKQDDRNNKLTYHAPQLQRAGSLQDLTQGFSGSVPDEQGGHTKRNPFQG
>NZ_AP021859.1|WP_155015716.1|3835231_3836215_-|hypothetical-protein
MSPNTQQELASLSSTPLHGVTLALNKTTNNTNTPFALDVSFNPVTPLSNPKTAWTLTDKGERYESNITLNREEDCFQLHIDCEGQGTFQLQDNQLTIDWQANGTGPAHYLQTLGLSLFLELQGHLCLHANTLVKNNRAQLFLAPSRTGKSTLTTLLTTLGYTLTTDDMAALYHTNEQYEVYPSWPKVRLWPDSAAMLENHLTTQPVQQKKVHERFAKQEISFAAQDTHTATPVTAMYYLNRVDEQPQSANPLTITPINPSAALIILMQNSMLGDAYRGLGIEQSRIIALAALLLRVPFYRVTYPSGLEQLPDIAKHLDEWLQSEQAK
>NZ_AP021859.1|WP_084526282.1|3833804_3835025_-|HD-domain-containing-protein
MNKRRYQKLVAIYLLASLIWIIGSDWLLAQFIGDFNDSYVLSAGKGVAFVTLMSGLLYVLLMRLNSAERQVASAATDSTEQLVLNKIPTFFQHIPMLTYAVNWDGKTLRTLWVSDNLYDLLGYTQEEALQPGWWEKCVHPDDRLRALEESKTILANGGGDHYYRVKHAKGHYVYFHDELRRVESVSATCFVGIWRDISTEESALEQVQEYSTKLEKTILGTITAISHMVELRDPYTAGHESRVGELASAIALEMGLDMDTQYGLRIAGLLHDIGKISIPAEYLTKPTRLTDAEFEIIKSHACNGYNILKNIAFPWPVAEVAYQHHERLDGTGYPRGLKGDEILLEARIITVADVIESMATNRPYRHALGIKKALQEIEQHAGKLYDPEVCNAALTLFRQKNYQLGD
>NZ_AP021859.1|WP_155015715.1|3830006_3833462_+|hypothetical-protein
MKTCLKSFAAVVKYTAFLPLLFVISACGGGGDTSTPSPNPNPPVTPKAAISLGSYTLQTEVRDTNGNVVSNCNCGSAVVLPSGEIWVAFEQYPLVEDYGYGVLLTTQAQSTSSFTADYTYYEASVSVAEGSLASPGTFNANQFSFSLTVKGQSVNVEAVKSTDATSIDLFSLSSGSGELEFSDDPYSRISVTANGNVNGEVLGCPISDGELADVSNSSHVFSLSLSFDSCSNDSLSNQSFFGAAFTLPGISNHGLQLLFADDNGYFRQAYAEKLADTGSADYTTESAPSISSLTVTGAETVTLHWYSEDQTSIPTTGTSYTVYASTQPDFQPLPSHLVHVTGNALSADIDSLASDTEYYFKVIAKTEDKQILISSEIAAKTYKSSPVQAQGTTIFQAEDNNLTLTSASSDTLVFTLSAGANVPAPGDFIFAKNADDELLFKQVTSVSSNNNEATVLVTQPSLSELIPSAELSDYTSLGTVSGFAPDSRKVVPTASGKYPANSSPTKAERWGNSISNTAYYKPYKKAHLTQNGESVNVTFDGDKITLDIDGVAGSISLDPEIDIAPSVYSDFAWAGPLEIARAESRLDATVDLSLSVDSSISVDLLDFEKKIDVPSLTFKNTKIYAIGPIPVYQEIIFTLHAVISSTSSAGIDVSTGLHRKYKLTAGASYNGLTEEWDGILNGPTLLSQSSTASAAVHIGANLELRIVPKIELRFYTVAAPYFSIEPYASAQIEVGNGVTFDSLSGNTQVTPTQFNTFNVELGLECFVGFEFGILVPNKFSLKSKNVCPFVEPNVLWAVPEFQFDVDTESTTSDFATISALVSNDLGYNELDASSATWDIYPNDFILIPHSDDPLKADVQLTSNANEEGEYTILFSATDKLGEIGRQYDSGKIRFGDCITEINEAYTSMDQLPVDESLRLSQYACYHTPAQNEFGFRAENALFARSYFQVNEEVYEGPLSPLEVLNAGEFRYKKHVGILDIPFFTVAESRYVVYGNYGREIDGIYTKYREVIFKALSDEPFERLRRTASGDDKPYFVPKQSVEAELTYNVGIPSYNLTYFNGLDDTDTRGAIYYIYKYRCAQRTKETEGYTIYGELVYESYIEMPENDDTCLSNAEAKRLNSIDPIRDNYFYLDYKHRYSDLLEIIFAIKDE
>NZ_AP021859.1|WP_155015714.1|3827740_3829144_-|undecaprenyl-phosphate-glucose-phosphotransferase
MNENGGFIRNNINQFAFVYRLIDILIIQLCLAVSAFAYIGKFDVRYFVLGLISNMAYLFFAELFVLYRSWRNGSFKEMLFYTVSSWLLTLFPLFLFLFFTKQTEYFSRVTLGLWIISTTTLLCLWRAIFRQFLISIRKKGYNTRSVGIIGLTKRGLDLANEIVNYPESGYKLTAVFDERAPERLDGKFLHKLEGGIADGVRLAQEGKIEILFVALPLTNKERIENILKELGDTTVDVHIVPDVFTFNLLHSRMGHVGEIQTISVYDSPMRGGYSIMKRMEDLFLAFSILTVIALPMLIIAAIIKITSPGPVLFKQDRYGMDGKKIKVWKFRSMRVMDNGEVVKQATKNDPRVTKFGAFLRRTSLDELPQFFNVIQGTMSVVGPRPHAVSHNEEYRKKVAYYMLRHKMKPGITGWAQVNGWRGETDTVEKMEMRIKYDLEYIRNWSIWMDFKIVIFTIFRGFVGKNVY
>NZ_AP021859.1|WP_155015713.1|3826803_3827751_-|sulfotransferase
MFTKRIFIFSLPRSGSTLLQRYIMSCENIETTSETWLLLSLLYSIEVDGIKAEYNQRKLTEAVEDFFRYKKIEKKQYYESIIDFYFKLYNTQYISEENCSKIIVEKTPRLSLVSDKLIECSKNSHFVFLWRNPVDIINSMCETWAKGNWNIFHYYIDLYKSQKNNVETYLKYKNRSNVHSIKYEDFVSDENLREKLLDDIGLEYSELNINKAPLFGKMGDKTGIQKYKSISSPKKENKIGSILRYFWIKRYIRYLKEIGFDDIYDTGEIIKSYKISWRIDVLIKDVFSFSYGYIYLATEPRVYLEKLKGRKGILG
>NZ_AP021859.1|WP_155015712.1|3826024_3826807_-|glycosyltransferase
MRLVIITITYNNLNGLMKTNESLEVQSDQDYIQIVIDGGSTDGTSNYVKNMRERASFYYASERDKGIYDAMNKGLLKYKSLASNNNDYILFLNSGDYLYSKESIKNIKEKLIDLDSDLLLFDVYEDIYGKLYYKKSRDKDWIPKGMPTSHQAMLFNSNIFKEYKFDDGMRFSGDYDLVCYCYINNKNIKKTNKAVSVFDKTGVSEVNRIQALKENYVVRRKTLKMSFLNSIFLYFIHYIHTKLKIYLPGFTRYLRGLTRE
>NZ_AP021859.1|WP_155015711.1|3825600_3826032_-|hypothetical-protein
MSKKILAYASRGGHWVQLKRIIDGSDFKLTTISTLGNDEADYIVPDFSRSTWYRFLGVLISISKIVFRNNPDIVITTGAAPGVIIALLYRVKGTKVIWIDSIANSKKLSLSARIVRPFVTVVLTQWEYLADESKNILYKGAVF
>NZ_AP021859.1|WP_162359943.1|3825127_3825604_-|glucuronosyltransferase
MILVTVGVQLPFKRLIKKTLEIAKINADVDFVIQCGDHQHAELQNVKFISSVSEDDFNELISKCEFVVSHAGMGTILKALTLRKKIILVPRLASLNEHRNNHQIDTLRAFKQKPGIFPCMNVEDLLTIYNECKFAAPNFQESNNEKLRELRAYLFGLI
>NZ_AP021859.1|WP_155015709.1|3823997_3824897_-|hypothetical-protein
MSIEKIQAGLTVEETFITRILLFFSFIFFVFSKKVDFVVLFLRYRVFFLSVFFILNILQLVKQSNIAFGALKHIYHEEIILVYVIALLYIYCNKKSSIFTMLLIALTPAFSFKNTGFFLSLLLLVYILLILNNEKRVKLSVSLTISSVLVIAFTGVGFLLYDYILPYLPSGSPEVRLETYSFRINTFLENVFFGDYAGSNMLLRIGYLFWGFDIPSHSDVLDILAFFGLFGAFVFYYPVFLSIFRAAYTNSHDLTYLFIIAGFVFLMAFNPIINQPKLIVVYYYILATAYEKFRFKIKL
>NZ_AP021859.1|WP_155015718.1|3837325_3837520_-|hypothetical-protein
MTLKASDSASPARRYILKNKEKRDEEESSFDALEKRYQALKKAHEGDAERASAMVERAYALVRE
>NZ_AP021859.1|WP_155015719.1|3837612_3839007_-|hypothetical-protein
MQLPQFFKQVLHISTGFYKSLSQSNLIFGVVFLFLSATNYCVYMSFFTLSWPSEFESIIRHRDIAVFNVVLGAALLLSLILRFSTQHKDQDSWRYLSIVTFSLVGSVVVALITAENRLQMPAIWALAISLFHVGYWSWRREHNERVNYETKMENMTTDLTNYVNTMPKEDAFRLLGETTKAKFNFLYTLDLMAKQVNSIEQQEKLAEYAKEEFKTGLQALCQIASLWTISKTRIYEGNVMVALPSSDAPNAPNAREAFENGKYFFHDMTTLDTVEHCCKQVLYIVPQLTTSISSNNSQPTPTEPFMLPVGLKSEIHPGSIKGAPECVEKNEVVSIDINEIENSLPDNYVGQRLNEIKEYYKLQTDWQSVLCIPLHIKGLKNTADDFTNGVNPPANGVINLYRSEKGSVKSPELFYELTAPLVQLLENLLYLYLAHTPDDGSYSPEFCTFPRAEKDEGIGDIPTG
>NZ_AP021859.1|WP_139241518.1|3839197_3839359_-|lasso-RiPP-family-leader-peptide-containing-protein
MKSQQAVVTEQSKKAYAQPKLDKAGSLAELTQGFSGSVPDEQGGHTKRNPFQG
>NZ_AP021859.1|WP_155015720.1|3839371_3839974_-|hypothetical-protein
MRKFIIAMAMTLFASFQANASYVQGVTGADMVGISVSVDFANGSSESAIWQALTATQGGAFGASDWALILDGDSFGDFDPVSNTFYGLFTFFNGPFDVVSITIDILSEGFVFDTAFFDASANGSGPGHEFVSSDPQATASYSNLVEDELYGTMTISAFIAAGSGLAFQTDTDAFGEVPAPAGLLLVAFGLLALRATRRSK
>NZ_AP021859.1|WP_139241519.1|3840254_3840416_-|lasso-RiPP-family-leader-peptide-containing-protein
MTSIEQRDAHAEKREYSKPELQKAGSLAELTQGFSGSVPDEQGGHTKRNPFQG
>NZ_AP021859.1|WP_155015721.1|3840430_3841045_-|hypothetical-protein
MKKLILAGILAFFACQANAGFVQGVTGEDMVGMEVTATFADGSSETATWGAVAPGAGGVFGLIPWGVLLDGDSFGDFDPVTGDLFGGFLMMNFYDFDMVSLSFNALAAGFVFDTAYFDASANGSGPGRELVSSNADVFAVYSDNYMDELFGTMTLLSTSQVVVASGEMQVFLTDTDQISVPAPAGFAMIALALMGMRIARKSSK
>NZ_AP021859.1|WP_155014205.1|3841742_3842780_+|IS110-family-transposase
MYKDNVIAIDIAKSVFQVCVFDKHNQIKSNQEIRRQKLMAWLAKQPASIVALEGCGSSHYWAKVAEKLGHTPLQIPTRFVKKFVEGQKTDKNDAIAIGIAARQPNLKPVAVKSDEQLALQACEKMRKHYQDVAISTSNMMRSILYEFGIVIPQGESALKSKLPDILEDAENGLPMMLRQPLHQQFQLWLSLKERINEATKYLRVQLRTHTICNELQKLDGIGPVNALNLYLALGTKGESFKNGREAAACIGLTPKQHSSGGKVVMLGISKHIAKKQLRANLIQGALAKIKVVAKRPPKNTREVWMKQLIERRGLRRAAVALANKMVRVAWAMVHHQQPYKSPQAI
>NZ_AP021859.1|WP_155015722.1|3844323_3846102_+|lasso-peptide-isopeptide-bond-forming-cyclase
MTALFAMVAADHHAIMDLFDEASRHATLALNWQNEHCLVGNYCYRGNARTQLLSQFSLPQGNIIANASAPLSNSELSEQFSTQPAAISNAITGPHTVFTWHNPHKLLFASRDPLNQHALYYGKVGGVTVISSEASFIAKLMPKQPSLNTTALSCWLAGQPNPALCLYNEINTLPLGTSLSVSPQGNVTEHTFWDIDPQNKLAPTSDGAYRETFLDLLKQCVSSHIHPSDSLVVSQMSGGMDSTSITALANELLTEPRSCRALSHLYSHSASCDESDNIKAMYQKLGLVDPIQITVDAGAHRDFMSLYPTDYDSPGTVLSPRYHQECEIIQAAGGHRLLTGNGGDEMCWGHASAYTERLFKGEFGVIAEVLKACKQTGMARWPVARSLFVKPMIPQWLLNSAYALKGYKPSDIPAWLTPEAAKLATDASKIPNPFNERKQPVGYARYQALKTTSTYNSVRSYQKVGWQYGIDVAHPFFDPRMAEFSFAVPGKQLIRGPYPKWLLRNAMQNHLPESVCWNVKKVTFDNHFGQLVKDNAKPLRELLSDTRLASLGLVDNDVLLNAFDAAVGGNGVSVHVDLLYAILTQRWIQQHH
>NZ_AP021859.1|WP_155015723.1|3846131_3846593_-|lasso-peptide-biosynthesis-B2-protein
MLKSFSKYRALAPDQRRWFRRCWWQFALWHIRIQYFPYHWWKARIFSELNSESGHSLPFSLSEAIRLSEMAARHHIFPINCLRRCVVQQQLLAQYGYDLALHFGVAKQDARLKAHCWLTHNGQLINDGLEVVNTYTELKLAAEQSQHILASLR
>NZ_AP021859.1|WP_073320074.1|3846585_3846855_-|PqqD-family-protein
MSAYQLKPELLLQKVADEMVLLEPESGEYFTLNNVGADMLEQLQQGKSAQQIAQYIADIYDVTAEQAEQDFQVLMHDLVQANLAEAGVA

You can click texts colored in the table to view more detailed information

Click the colored protein region to show detailed information

Crispr_ID: NZ_AP021859_4

CRISPR_ID

CRISPR_location

CRISPR_type

Repeat_type

Spacer_info

Cas_protein_info

CRISPR-Cas_info

NZ_AP021859_4

3854466-3854541

Orphan

Consensus_repeat	Method
CACATCCGTCGTCCCGGAATCGT	CRISPRCasFinder

1 spacers

The CRISPR arrays of NZ_AP021859_4

>merge|NZ_AP021859|4|3854466-3854541|CRISPRCasFinder
CACATCCGTCGTCCCGGAATCGTCCCAGCGATATTCGAGTTCTTAAAAAGCCACACATCCGTCGTCCCGAAATCGT

>NZ_AP021859|4|2|3854466-3854541|CRISPRCasFinder
CACATCCGTCGTCCCGGAATCGT	CCCAGCGATATTCGAGTTCTTAAAAAGCCA
CACATCCGTCGTCCCGAAATCGT

Protein	Signature genes	Signature genes Name	Protein_function
NZ_AP021859.1\|WP_073320080.1\|3849350_3850259_+\|hydrogen-peroxide-inducible-genes-activator	unknown	unknown	gnl\|CDD\|176103
NZ_AP021859.1\|WP_073320101.1\|3857445_3858252_-\|phosphate-ABC-transporter-ATP-binding-protein	unknown	unknown	gnl\|CDD\|184582
NZ_AP021859.1\|WP_155014205.1\|3852964_3854002_+\|IS110-family-transposase	unknown	unknown	gnl\|CDD\|226077
NZ_AP021859.1\|WP_155015728.1\|3858257_3859847_-\|phosphate-ABC-transporter-permease-PstA	unknown	unknown	gnl\|CDD\|223654
NZ_AP021859.1\|WP_155015725.1\|3850334_3850877_+\|sigma-70-family-RNA-polymerase-sigma-factor	unknown	unknown	gnl\|CDD\|274357
NZ_AP021859.1\|WP_155014205.1\|3841742_3842780_+\|IS110-family-transposase	unknown	unknown	gnl\|CDD\|226077
NZ_AP021859.1\|WP_155015731.1\|3866441_3867596_-\|diguanylate-cyclase	unknown	unknown	gnl\|CDD\|143635
NZ_AP021859.1\|WP_073320074.1\|3846585_3846855_-\|PqqD-family-protein	unknown	unknown	gnl\|CDD\|377508
NZ_AP021859.1\|WP_155015722.1\|3844323_3846102_+\|lasso-peptide-isopeptide-bond-forming-cyclase	unknown	unknown	gnl\|CDD\|238949
NZ_AP021859.1\|WP_073320090.1\|3851542_3852481_+\|hypothetical-protein	unknown	unknown	gnl\|CDD\|235175
NZ_AP021859.1\|WP_155015727.1\|3854690_3854987_-\|GIY-YIG-nuclease-family-protein	unknown	unknown	gnl\|CDD\|198395
NZ_AP021859.1\|WP_073320098.1\|3856714_3857431_-\|phosphate-signaling-complex-protein-PhoU	unknown	unknown	gnl\|CDD\|182974
NZ_AP021859.1\|WP_155015726.1\|3850869_3851520_+\|hypothetical-protein	unknown	unknown	gnl\|CDD\|274073
NZ_AP021859.1\|WP_073320095.1\|3855129_3856398_-\|inorganic-phosphate-transporter	unknown	unknown	gnl\|CDD\|376541
NZ_AP021859.1\|WP_073320107.1\|3862127_3862637_+\|glycine-cleavage-system-protein-R	unknown	unknown	gnl\|CDD\|225337
NZ_AP021859.1\|WP_155015729.1\|3859839_3861987_-\|ABC-transporter-permease-subunit	unknown	unknown	gnl\|CDD\|226956
NZ_AP021859.1\|WP_155015723.1\|3846131_3846593_-\|lasso-peptide-biosynthesis-B2-protein	unknown	unknown	gnl\|CDD\|379206
NZ_AP021859.1\|WP_162359946.1\|3846858_3848547_-\|ATP-binding-cassette-domain-containing-protein	unknown	unknown	gnl\|CDD\|224055
NZ_AP021859.1\|WP_155015730.1\|3862761_3864834_+\|polyphosphate-kinase-1	unknown	unknown	gnl\|CDD\|274734
NZ_AP021859.1\|WP_073320113.1\|3864830_3866363_+\|exopolyphosphatase	unknown	unknown	gnl\|CDD\|182781

Protein	Function_ID	Function_description	E-value
NZ_AP021859.1\|WP_073320080.1\|3849350_3850259_+\|hydrogen-peroxide-inducible-genes-activator	gnl\|CDD\|176103	cd08411, PBP2_OxyR, The C-terminal substrate-binding domain of the LysR-type transcriptional regulator OxyR, a member of the type 2 periplasmic binding fold protein superfamily. OxyR senses hydrogen peroxide and is activated through the formation of an intramolecular disulfide bond. The OxyR activation induces the transcription of genes necessary for the bacterial defense against oxidative stress. The OxyR of LysR-type transcriptional regulator family is composed of two functional domains joined by a linker helix involved in oligomerization: an N-terminal HTH (helix-turn-helix) domain, which is responsible for the DNA-binding specificity, and a C-terminal substrate-binding domain, which is structurally homologous to the type 2 periplasmic binding proteins. As also observed in the periplasmic binding proteins, the C-terminal domain of the bacterial transcriptional repressor undergoes a conformational change upon substrate binding which in turn changes the DNA binding affinity of the repressor. The C-terminal domain also contains the redox-active cysteines that mediate the redox-dependent conformational switch. Thus, the interaction between the OxyR-tetramer and DNA is notably different between the oxidized and reduced forms. The structural topology of this substrate-binding domain is most similar to that of the type 2 periplasmic binding proteins (PBP2), which are responsible for the uptake of a variety of substrates such as phosphate, sulfate, polysaccharides, lysine/arginine/ornithine, and histidine. The PBP2 bind their ligand in the cleft between these domains in a manner resembling a Venus flytrap. After binding their specific ligand with high affinity, they can interact with a cognate membrane transport complex comprised of two integral membrane domains and two cytoplasmically located ATPase domains. This interaction triggers the ligand translocation across the cytoplasmic membrane energized by ATP hydrolysis.	8.73999e-63
NZ_AP021859.1\|WP_073320101.1\|3857445_3858252_-\|phosphate-ABC-transporter-ATP-binding-protein	gnl\|CDD\|184582	PRK14236, PRK14236, phosphate transporter ATP-binding protein; Provisional.	8.74981e-169
NZ_AP021859.1\|WP_155014205.1\|3852964_3854002_+\|IS110-family-transposase	gnl\|CDD\|226077	COG3547, COG3547, Transposase and inactivated derivatives [DNA replication, recombination, and repair].	5.01108e-30
NZ_AP021859.1\|WP_155015728.1\|3858257_3859847_-\|phosphate-ABC-transporter-permease-PstA	gnl\|CDD\|223654	COG0581, PstA, ABC-type phosphate transport system, permease component [Inorganic ion transport and metabolism].	1.07856e-83
NZ_AP021859.1\|WP_155015725.1\|3850334_3850877_+\|sigma-70-family-RNA-polymerase-sigma-factor	gnl\|CDD\|274357	TIGR02937, RNA_polymerase_sigma_factor, RNA polymerase sigma factor, sigma-70 family. This model encompasses all varieties of the sigma-70 type sigma factors including the ECF subfamily. A number of sigma factors have names with a different number than 70 (i.e. sigma-38), but in fact, all except for the Sigma-54 family (TIGR02395) are included within this family. Several Pfam models hit segments of these sequences including Sigma-70 region 2 (pfam04542) and Sigma-70, region 4 (pfam04545), but not always above their respective trusted cutoffs.	9.81645e-42
NZ_AP021859.1\|WP_155014205.1\|3841742_3842780_+\|IS110-family-transposase	gnl\|CDD\|226077	COG3547, COG3547, Transposase and inactivated derivatives [DNA replication, recombination, and repair].	5.01108e-30
NZ_AP021859.1\|WP_155015731.1\|3866441_3867596_-\|diguanylate-cyclase	gnl\|CDD\|143635	cd01949, GGDEF, Diguanylate-cyclase (DGC) or GGDEF domain. Diguanylate-cyclase (DGC) or GGDEF domain: Originally named after a conserved residue pattern, and initially described as a domain of unknown function 1 (DUF1). This domain is widely present in bacteria, linked to a wide range of non-homologous domains in a variety of cell signaling proteins. The domain shows homology to the adenylyl cyclase catalytic domain. This correlates with the functional information available on two GGDEF-containing proteins, namely diguanylate cyclase and phosphodiesterase A of Acetobacter xylinum, both of which regulate the turnover of cyclic diguanosine monophosphate. Together with the EAL domain, GGDEF might be involved in regulating cell surface adhesion in bacteria.	8.10863e-55
NZ_AP021859.1\|WP_073320074.1\|3846585_3846855_-\|PqqD-family-protein	gnl\|CDD\|377508	pfam05402, PqqD, Coenzyme PQQ synthesis protein D (PqqD). This family contains several bacterial coenzyme PQQ synthesis protein D (PqqD) sequences. This protein is required for coenzyme pyrrolo-quinoline-quinone (PQQ) biosynthesis.	3.74631e-14
NZ_AP021859.1\|WP_155015722.1\|3844323_3846102_+\|lasso-peptide-isopeptide-bond-forming-cyclase	gnl\|CDD\|238949	cd01991, Asn_Synthase_B_C, The C-terminal domain of Asparagine Synthase B. This domain is always found associated n-terminal amidotransferase domain. Family members that contain this domain catalyse the conversion of aspartate to asparagine. Asparagine synthetase B catalyzes the assembly of asparagine from aspartate, Mg(2+)ATP, and glutamine. The three-dimensional architecture of the N-terminal domain of asparagine synthetase B is similar to that observed for glutamine phosphoribosylpyrophosphate amidotransferase while the molecular motif of the C-domain is reminiscent to that observed for GMP synthetase .	4.81103e-22
NZ_AP021859.1\|WP_073320090.1\|3851542_3852481_+\|hypothetical-protein	gnl\|CDD\|235175	PRK03918, PRK03918, DNA double-strand break repair ATPase Rad50.	1.35384e-05
NZ_AP021859.1\|WP_155015727.1\|3854690_3854987_-\|GIY-YIG-nuclease-family-protein	gnl\|CDD\|198395	cd10448, GIY-YIG_unchar_3, GIY-YIG domain of uncharacterized hypothetical protein found in bacteria. The family includes a group of uncharacterized bacterial proteins with a GIY-YIG domain that shows statistically significant similarity to the N-terminal catalytic domains of GIY-YIG family of intron-encoded homing endonuclease I-TevI and catalytic GIY-YIG domain of nucleotide excision repair endonuclease UvrC.	1.52605e-46
NZ_AP021859.1\|WP_073320098.1\|3856714_3857431_-\|phosphate-signaling-complex-protein-PhoU	gnl\|CDD\|182974	PRK11115, PRK11115, phosphate signaling complex protein PhoU.	1.23297e-85
NZ_AP021859.1\|WP_155015726.1\|3850869_3851520_+\|hypothetical-protein	gnl\|CDD\|274073	TIGR02302, conserved_hypothetical_protein, TIGR02302 family protein. Members of this family are long (~850 residue) bacterial proteins from the alpha Proteobacteria. Each has 2-3 predicted transmembrane helices near the N-terminus and a long C-terminal region that includes stretches of Gln/Gly-rich low complexity sequence, predicted by TMHMM to be outside the membrane. In Bradyrhizobium japonicum, two tandem reading frames are together homologous the single members found in other species; the cutoffs scores are set low enough that the longer scores above the trusted cutoff and the shorter above the noise cutoff for this model.	0.00389624
NZ_AP021859.1\|WP_073320095.1\|3855129_3856398_-\|inorganic-phosphate-transporter	gnl\|CDD\|376541	pfam01384, PHO4, Phosphate transporter family. This family includes PHO-4 from Neurospora crassa which is a is a Na(+)-phosphate symporter. This family also contains the leukaemia virus receptor.	3.91828e-55
NZ_AP021859.1\|WP_073320107.1\|3862127_3862637_+\|glycine-cleavage-system-protein-R	gnl\|CDD\|225337	COG2716, GcvR, Glycine cleavage system regulatory protein [Amino acid transport and metabolism].	1.30384e-30
NZ_AP021859.1\|WP_155015729.1\|3859839_3861987_-\|ABC-transporter-permease-subunit	gnl\|CDD\|226956	COG4590, COG4590, ABC-type uncharacterized transport system, permease component [General function prediction only].	9.07175e-104
NZ_AP021859.1\|WP_155015723.1\|3846131_3846593_-\|lasso-peptide-biosynthesis-B2-protein	gnl\|CDD\|379206	pfam13471, Transglut_core3, Transglutaminase-like superfamily. This family includes uncharacterized proteins that are related to the transglutaminase like domain pfam01841.	3.73635e-11
NZ_AP021859.1\|WP_162359946.1\|3846858_3848547_-\|ATP-binding-cassette-domain-containing-protein	gnl\|CDD\|224055	COG1132, MdlB, ABC-type multidrug transport system, ATPase and permease components [Defense mechanisms].	6.57923e-134
NZ_AP021859.1\|WP_155015730.1\|3862761_3864834_+\|polyphosphate-kinase-1	gnl\|CDD\|274734	TIGR03705, poly_P_kin, polyphosphate kinase 1. Members of this protein family are the enzyme polyphosphate kinase 1 (PPK1). This family is found in many prokaryotes and also in Dictyostelium. Sequences in the seed alignment were taken from prokaryotic consecutive two-gene pairs in which the other gene encodes an exopolyphosphatase. It synthesizes polyphosphate from the terminal phosphate of ATP but not GTP, in contrast to PPK2. [Central intermediary metabolism, Phosphorus compounds].	0
NZ_AP021859.1\|WP_073320113.1\|3864830_3866363_+\|exopolyphosphatase	gnl\|CDD\|182781	PRK10854, PRK10854, exopolyphosphatase; Provisional.	0

>NZ_AP021859.1|WP_155014205.1|3852964_3854002_+|IS110-family-transposase
MYKDNVIAIDIAKSVFQVCVFDKHNQIKSNQEIRRQKLMAWLAKQPASIVALEGCGSSHYWAKVAEKLGHTPLQIPTRFVKKFVEGQKTDKNDAIAIGIAARQPNLKPVAVKSDEQLALQACEKMRKHYQDVAISTSNMMRSILYEFGIVIPQGESALKSKLPDILEDAENGLPMMLRQPLHQQFQLWLSLKERINEATKYLRVQLRTHTICNELQKLDGIGPVNALNLYLALGTKGESFKNGREAAACIGLTPKQHSSGGKVVMLGISKHIAKKQLRANLIQGALAKIKVVAKRPPKNTREVWMKQLIERRGLRRAAVALANKMVRVAWAMVHHQQPYKSPQAI
>NZ_AP021859.1|WP_073320090.1|3851542_3852481_+|hypothetical-protein
MKALAIASLFGATLATSAVAQNLTELDKQLNIMSGVIDTALKQDTRKEGVRYRSIEATYLAKQGVIFTIHTGGRGMMFDFDFGDLMSVIPTPPSAPDAPTVTVVADGMHVESHGDYEFIVEQDWGETAERVVRQVEKIVRKTDEKLREFRSDRREIEWEIRELERRNRDLEFELRAADNERKREIEGEMKELKAELDRLQSRQQELQDYAQELAEEKKAELAKQREMQEKAYKTFLANFEGSVGDTLCSFGAGLRELPDDEHISFILKNFGKGEDGKAQDRLYIFNKKSVKSCVAEKITPSDLLAKAEVYVF
>NZ_AP021859.1|WP_155015726.1|3850869_3851520_+|hypothetical-protein
MANYESDYLFGQWLDNALTNEERDAFEALCLSDKAFAAQVETATQLSVAAEQFTPPPMPAWDKNATFVAPDKPKWWQWQGLPVMSMAMSALAIVMVVSGFSVQVSEGKLTMGFQQGPSDEQVAALVNQKLNDYQQANQAMFTQYVAALQQQQQENSTQLTQYLLTSSRQERREDFAELIKFINQQRDDDQRYYARQLTQLQREINSLDDGYPALTE
>NZ_AP021859.1|WP_155015725.1|3850334_3850877_+|sigma-70-family-RNA-polymerase-sigma-factor
MFEKRDVQLIEQALKGHKKAWFALIKRYESAIYQYGVRMTGNPHDAADLMQDIFIAVFRSLSNYRGEGSFKAWLFRIAHFRCIEFYRRKRPDSPLDEDDELSCERPCPEHNLMTDSTSQALTAAMQRLPLAQRAVIELKFFGQFTFDEIADQLGLSSNTVKSRLYSALSKLKLDLEVEHG
>NZ_AP021859.1|WP_073320080.1|3849350_3850259_+|hydrogen-peroxide-inducible-genes-activator
MKWPNLKHLHYLVTLHQEQHFHRAAQRCNVSQSTLSTAIQNLEEHFGSQLLEREHKTFVFTSLGLDVVERSKVILQEAGELVEYAQNAGNWQRGKLKLGVIPTIAPFLFEAMLGAFRTFLPEIQLELQEDTTANLTRQLTDGSLDLLVLALPMETPGCKQMVLGHDPFHLIAHKDLANDLPSPLDISSLPKKSIFLLQQEHCMTGHAVSACNLQHTDQISSLAASSLYTLVQLANSKLGYTFLPELALNQDLLKNTQLTSFPAEEKAFREIGLVWRAGTTRMRLFRRVGEIISPLLPVPTLK
>NZ_AP021859.1|WP_162359946.1|3846858_3848547_-|ATP-binding-cassette-domain-containing-protein
MFDSLRLHYGLKVFQSYKWSFIMVVALMVTETAVSLSVPYLIGQQSQSFLQEVTILNNNHLKYLYLWVALFAAQAALRFLSTYNVNLVGARLMAELSCRLYDHIQVLPIQYFQENKRGDVLSMLTNDLAVVSFFASSVLTNLIPNILVLIGAGILMYMIEPTIALLICFLVPLIYIILKLVSRGMQPISRALVQRQADSLAQASENIGAISLIKAFNKEEAESSKFKRRSNEILALRARQFRLQALLSPLIQFLASLCILVVVIMSVLKFNSGALGIPDLISLLLYGIVFTRPLGSLAGLYGQLQQVIGASERLLHVYHLESEPRDEHGTPMTIEHGDIVFDSVAFGFPQRGTILKDVSFHVKAGQNMLIYGQNGGGKTTMLHLLMRFYQPATGKILIDGQNIQQATTASLRHAIGLVSQDVLLLNGSIFDNLTYGLANPAADDVYKAASQAGLDHLIMRLPQGYDTQVGEGGVRLSGGQRQRIALARALLMKPKILLLDEPTSMLDEQARLSFKEEFHGLFAQFTVIMISHDPTLSDVADVVYQLENGTLQRQSIRSDYQE
>NZ_AP021859.1|WP_073320074.1|3846585_3846855_-|PqqD-family-protein
MSAYQLKPELLLQKVADEMVLLEPESGEYFTLNNVGADMLEQLQQGKSAQQIAQYIADIYDVTAEQAEQDFQVLMHDLVQANLAEAGVA
>NZ_AP021859.1|WP_155015723.1|3846131_3846593_-|lasso-peptide-biosynthesis-B2-protein
MLKSFSKYRALAPDQRRWFRRCWWQFALWHIRIQYFPYHWWKARIFSELNSESGHSLPFSLSEAIRLSEMAARHHIFPINCLRRCVVQQQLLAQYGYDLALHFGVAKQDARLKAHCWLTHNGQLINDGLEVVNTYTELKLAAEQSQHILASLR
>NZ_AP021859.1|WP_155015722.1|3844323_3846102_+|lasso-peptide-isopeptide-bond-forming-cyclase
MTALFAMVAADHHAIMDLFDEASRHATLALNWQNEHCLVGNYCYRGNARTQLLSQFSLPQGNIIANASAPLSNSELSEQFSTQPAAISNAITGPHTVFTWHNPHKLLFASRDPLNQHALYYGKVGGVTVISSEASFIAKLMPKQPSLNTTALSCWLAGQPNPALCLYNEINTLPLGTSLSVSPQGNVTEHTFWDIDPQNKLAPTSDGAYRETFLDLLKQCVSSHIHPSDSLVVSQMSGGMDSTSITALANELLTEPRSCRALSHLYSHSASCDESDNIKAMYQKLGLVDPIQITVDAGAHRDFMSLYPTDYDSPGTVLSPRYHQECEIIQAAGGHRLLTGNGGDEMCWGHASAYTERLFKGEFGVIAEVLKACKQTGMARWPVARSLFVKPMIPQWLLNSAYALKGYKPSDIPAWLTPEAAKLATDASKIPNPFNERKQPVGYARYQALKTTSTYNSVRSYQKVGWQYGIDVAHPFFDPRMAEFSFAVPGKQLIRGPYPKWLLRNAMQNHLPESVCWNVKKVTFDNHFGQLVKDNAKPLRELLSDTRLASLGLVDNDVLLNAFDAAVGGNGVSVHVDLLYAILTQRWIQQHH
>NZ_AP021859.1|WP_155014205.1|3841742_3842780_+|IS110-family-transposase
MYKDNVIAIDIAKSVFQVCVFDKHNQIKSNQEIRRQKLMAWLAKQPASIVALEGCGSSHYWAKVAEKLGHTPLQIPTRFVKKFVEGQKTDKNDAIAIGIAARQPNLKPVAVKSDEQLALQACEKMRKHYQDVAISTSNMMRSILYEFGIVIPQGESALKSKLPDILEDAENGLPMMLRQPLHQQFQLWLSLKERINEATKYLRVQLRTHTICNELQKLDGIGPVNALNLYLALGTKGESFKNGREAAACIGLTPKQHSSGGKVVMLGISKHIAKKQLRANLIQGALAKIKVVAKRPPKNTREVWMKQLIERRGLRRAAVALANKMVRVAWAMVHHQQPYKSPQAI
>NZ_AP021859.1|WP_155015727.1|3854690_3854987_-|GIY-YIG-nuclease-family-protein
MSERYPAVYILSNFTRTVLYVGVTSNLPQRVYQHKMSMASGFCSRYNVKDLVYYEMHEEMYAAITREKQLKRWRRSWKEKLITQKNPQWLDLYPLIVG
>NZ_AP021859.1|WP_073320095.1|3855129_3856398_-|inorganic-phosphate-transporter
MDFLQSYGMILIILAAAVGFVMAWGIGANDVANAMGTSVGSKALTIKQAIIIAMIFEFAGAYLAGGEVTSTIRKGIIDTAYFVDIPEYLVLGMISSLFAAGLWLAVASYLGWPVSTTHSIVGAIIGFTAVGVSMDAVEWSKVGGIVGSWIVTPAISGVIAYLIFMSAHKLIFETDKPFHYARKYVPFYMAFAGFVMSLVTIKKGLKHVGLDLSPTTGYVLSVVLAVIIAFIGKWLISRQAYSHSEDADLQRANVEKVFALLMVVTACCMAFAHGSNDVANAIGPLAAVVSVVSNGGEIGSSSSLAPWILPLGGLGIVAGLALFGHRVIATIGEGITHLTPSRGFAAEMAAACTVVIASGTGLPISTTQTLVGAVLGVGLARGVSALNLGIIRNIVISWVVTLPAGAILSILCFFTLKAIFGV
>NZ_AP021859.1|WP_073320098.1|3856714_3857431_-|phosphate-signaling-complex-protein-PhoU
MHQVALNTHISDRFNLELENLRNSVLTMGGEVEQQLIDTLKAISTNNPGLAEKVILNDLKVNSMEMQIDEECVRIIAKRHPTASDLRLIMTISKAITDIERMGDEIERIAKLVTKQKIPASESIKSSMLQIGQQVTAMMRGTFDAFARQDERAALHVYDQDNRIDSEYKKLLTFTTGEMSRSGEDMEDWLEILWALRSLERIGDRCKNICEYIVSLTSGKDVRHTPLESLQQKLDDLT
>NZ_AP021859.1|WP_073320101.1|3857445_3858252_-|phosphate-ABC-transporter-ATP-binding-protein
MLKLFERERLDLEGLSPEQTAIEVRDLNLRFGQKHVLHDINMRIPKHRITALIGQSGCGKSTLIACFNRMNDLILNSQTSGEIVIEGRNINHKKENLSLLRSQVGMVFQRPNPFPMSIYDNVCYGLRLQGIKQRRQLDDAVERALHEAALWEEVKDRLFDSAMTLSGGQQQRLVIARALALKPSILLLDEPTSALDPLTTLFIEELMGELKKRCTIVIVTHNMQQAARVSDYTAFLHQGELVEYSDSDTLFTMPDKKQTEDYITGRYG
>NZ_AP021859.1|WP_155015728.1|3858257_3859847_-|phosphate-ABC-transporter-permease-PstA
MGKWSVRQFLSNRQQQSFVISLGAFCASLLLVALVCVLALIALRGSDYFWPRPVHSLTYVDQAGKEHKVYGQVGVGHASSQYSAGTQRLWLIRYSDTRYPYGNQLILETPAIQNLAVAKDAADMLLADGTRVFAKPVSVDMPENQAQPLSSLAQAQARVDLLQQDVDQIRTQHLAPIHRRLAELDIRAVAEDAPARERLSAEFREWQTKVLEREAQIAEFRLNVQFSDGAPFSVALNELDQLTYTGQLTTWGKLGVAADGVWTFLSESPKQANTAGGVFPALFGTVLMIFIMTILVTPFGVMAAIYLNEYAPDNSMTAVIRICVSNMAGVPSIVYGVFGLGFFVYMVGGQIDELFFSDRLPAPTMGTPGVFWAALTMAILTLPVVIVATEEGLRRVPDRLKAGSYALGATKLETIWHTILPIASPGIMTGVILAIARAAGEVAPLMLVGAVKFAPNLPFDGEFPYLHLDRQFMHLGVLIYDGAFHSQTDMRSASMMFASCLLLLLVVFVLNILAVILRKRLRQRYLRGY
>NZ_AP021859.1|WP_155015729.1|3859839_3861987_-|ABC-transporter-permease-subunit
MNVAAEQGSVLKKRRRRDTAARVIISGFGGIVLLTMVILIWHLFSQAASIAMSPDADIQTEIPVLPSGRYLYVGDMDSGQAAIIDGPGCRLTLARLEADTLTARQSIRRPCSHTLTTLQVQGQPYAVDISTSGQVRLLPVPTVGTAQTLGSFGTPLASELSFAIPEAVWAEHTDWTLGVGEQWLIMVVNTAQSQLIQWVNRQDPANIMRHTLPSSHPVALLPDSKQVVQIRDRELRFYNEKHQQINQISLTKAVDKVFTFVKNRSLFVTHPDSTVSRYTVFNDKGTLRYQRTYVLALRKQEQPVAIYPHASVNGLAMVTNQQQLLLINRVTGEIVERRGLPIQPTGVSWFDNRAYVFSDAVMIKLRIQHLAGLSTADSLLTPQIYEGYKEADQLWQTTSATDYQETKMNLVPLLIGSFKASGLALLIAIPLALGAAVYTAYFARPKVRDNMKPAIEMLEAIPSVLIGFIAAIWLAPLAERFLFSFAVFLFTVPFSLLLIAFVQHKVARNLPSEVRNVAELILPVLGIVGLGYISIEWAPQLLFYLLEVNDFDFITDATGVPVGKTTILVAIALGFAISPSIYSLAEDAISGVPASLRQASYALGATRLQTLRRVVLRVAFPGIMAAIMLGFGRAFGETMIVLMVTGNTPIADWDLFAGLRALTANLAIELPEAELDSMHYKVLFLTACVLFTFTFVVNTLAELLRQRLRRNASYG
>NZ_AP021859.1|WP_073320107.1|3862127_3862637_+|glycine-cleavage-system-protein-R
MKPVIITVIGKDRPGLVDAVAKKVYQFGGNWQGSSFAHMAGQFAGFVEVLVPAEQHQALIDALNTLDGLQVQSQSVTDTLEQPDEMLRIEVMGNDRAGIVQELTNVLHGFNLNILHFASTCESAPNWGSQMFKAQLRVGVSADLDRDDLQEALEAVANDLVVDITTTLS
>NZ_AP021859.1|WP_155015730.1|3862761_3864834_+|polyphosphate-kinase-1
MESTDLYYPKELSWLAFNERVLQEAADKNNPAVERIRFLGIYSNNLDEFFRVRVSDVKRQIIIAQNDGNELEAQHQRKLLEQIQQKVMALSKKFDTIHKDVVKALARYNIYILQKHELTDYQREWVRNYFVNKVLRHIAPILIDKKTDLLSRLNGNAVYLYVALRREGRSPRFAAVQVPTGEVPRFFLIPPQRSRKNKHIILLDDMIQLSMEDIFRGFVKFDTLESYSFKMTRDAEYSINDEIDESYVEKMSESMKQRLIAEPVRVIHDQDMPEDMVEDLQKRLKVTKLDTLHSAGHYRNFKDFIGFPNPGREYLEHPPLPAIDTKDFSAYNTVFDAISDHDILLYYPYHRFLHFTEFVRQAAFDPSVKSIRINIYRVASHSRIISSLIDAVDNGKKVTVIVELRARFDEEANIEWSKRMTDAGIRVVLGVPSLKIHSKLCIVSREERGKLMHYAHFGTGNFNEKTAKIYTDYSLFTKNQELAEEGNAVFDLISNPYRRYKFQHLQISPLNARTKIQSLIRQEIQYLKEGHKAGITFKINNLVDNELIDDLYRASQAGVKIRGIVRGMCSLRPGIKGLSENIKIISVVDRFLEHPRVMIFNGGGNRKVYISSADWMMRNMDNRIEVGAPVYDDALQQRIVDIMEIQFRDTMKAREIDKEQVNQYVRRGNRKKLRSQEEIYDYLKQLEEKK
>NZ_AP021859.1|WP_073320113.1|3864830_3866363_+|exopolyphosphatase
MSPDPSSFASVESREASKVAALDIGSNSFHLVVARIVAGSVQILHRVKQKVRLAEGLNEDNVLQGEAKQRGLDTLKIIADSLKGFEPDSVRIVATHTLRRAVNAKEFIEEALQVLPYPIEVISGTEEARLIYSGVAHTNHDAGKRLVVDIGGGSTEFIIGEGLSPLLLRSLQMGCVSYTQRFFANGELKAKAFDKAITAAEQEMEPIEERYRRLGWQRCIGTSGTIKAIYNLVQQTQKEGEHDVPVTLKALKSLMKQFIEAGHIDKLNFPEMTEDRRPVIPAGLCVLIGLFKALKIEALEYSPAALREGVLYQMEDELHNSDIRSRTASSLATRYDVDIDQANLVLNTTLTLFNHCQKAWKLRHPDYRAILGWAALLHEVGFQINTRGVQRHSAYILQNVDMPGFNQAQQELLATLVRFHRKKIRIADIPSFTQYDKEDVYRLIVMLRLGVLLNIKRQESFLPEFEVDVEKDKLSVTFPAEWLPSKPIMTADLEREKGYQSAVGIELDVS
>NZ_AP021859.1|WP_155015731.1|3866441_3867596_-|diguanylate-cyclase
MMDYDYVVAQTLLLGIVALIGVVFTFSLPVPNHKTKASSGVFARLFLAACWLVEVFQTVRFSGYEDIGRVGFYVMSLSAAYMLMMTIVKRYGHSLNRQQITLVLLHLVGVALCSLLLQAGYLPQWTANTVILLSVAFPVWQAIRRVKYYLATNSLGDKVLYAVLSTVFYTLLAILPIYLIFFDASIVHHHSLTFAILLVFMLVFMLSFAVSVLHSLVNRLHTQVHTDPLTGAKNRHFFYEIAPKLSAHALRNNEILSVVACDIDHFKAINDKHGHVVGDIALKRFCKIIQDELRAEDTLIRMGGEEFLVLSPHCDRNQATELAERLRKVISETEIEAKGVNLMLTASFGVIEMTHNSEFFSSVKEADQALYNAKAAGRNQVITV

You can click texts colored in the table to view more detailed information

Click the colored protein region to show detailed information

Crispr_ID: NZ_AP021859_5

CRISPR_ID

CRISPR_location

CRISPR_type

Repeat_type

Spacer_info

Cas_protein_info

CRISPR-Cas_info

NZ_AP021859_5

4714383-4714471

Orphan

Consensus_repeat	Method
TTGAATAGATCCCGGGACAAGCCCGGGA	CRISPRCasFinder

1 spacers

The CRISPR arrays of NZ_AP021859_5

>merge|NZ_AP021859|5|4714383-4714471|CRISPRCasFinder
TTGAATAGATCCCGGGACAAGCCCGGGAAGACGGTGAGGTTGTTTTGAAATCGCGGCCTTGTTGAATAGATCCCGGGGCAAGCCCGGGA

>NZ_AP021859|5|3|4714383-4714471|CRISPRCasFinder
TTGAATAGATCCCGGGACAAGCCCGGGA	AGACGGTGAGGTTGTTTTGAAATCGCGGCCTTG
TTGAATAGATCCCGGGGCAAGCCCGGGA

Protein	Signature genes	Signature genes Name	Protein_function
NZ_AP021859.1\|WP_155016254.1\|4703988_4704921_-\|histone-deacetylase-family-protein	unknown	unknown	gnl\|CDD\|212541
NZ_AP021859.1\|WP_155016253.1\|4702534_4703062_+\|thioredoxin	unknown	unknown	unknown
NZ_AP021859.1\|WP_073317384.1\|4704917_4706774_-\|GNAT-family-N-acetyltransferase	unknown	unknown	gnl\|CDD\|223504
NZ_AP021859.1\|WP_155016266.1\|4721538_4721847_-\|hypothetical-protein	unknown	unknown	unknown
NZ_AP021859.1\|WP_073317402.1\|4712044_4713178_-\|efflux-RND-transporter-periplasmic-adaptor-subunit	unknown	unknown	gnl\|CDD\|184990
NZ_AP021859.1\|WP_155016255.1\|4707451_4708603_-\|hypothetical-protein	unknown	unknown	unknown
NZ_AP021859.1\|WP_155016256.1\|4708921_4712029_-\|efflux-RND-transporter-permease-subunit	unknown	unknown	gnl\|CDD\|273335
NZ_AP021859.1\|WP_155016265.1\|4720919_4721483_-\|CBS-domain-containing-protein	unknown	unknown	gnl\|CDD\|341398
NZ_AP021859.1\|WP_155016259.1\|4715055_4715796_-\|hypothetical-protein	unknown	unknown	gnl\|CDD\|319992
NZ_AP021859.1\|WP_155016251.1\|4699829_4700801_+\|zinc-transporter-ZntB	unknown	unknown	gnl\|CDD\|213367
NZ_AP021859.1\|WP_155016263.1\|4719987_4720314_+\|TfoX/Sxy-family-protein	unknown	unknown	gnl\|CDD\|377438
NZ_AP021859.1\|WP_155016261.1\|4718043_4718940_-\|DUF560-domain-containing-protein	unknown	unknown	gnl\|CDD\|313331
NZ_AP021859.1\|WP_155016262.1\|4719020_4719842_-\|hypothetical-protein	unknown	unknown	unknown
NZ_AP021859.1\|WP_155016258.1\|4714512_4715037_-\|hypothetical-protein	unknown	unknown	unknown
NZ_AP021859.1\|WP_155016252.1\|4701056_4702535_+\|chemotaxis-protein	unknown	unknown	gnl\|CDD\|214599
NZ_AP021859.1\|WP_155016701.1\|4703388_4703955_+\|hypothetical-protein	unknown	unknown	unknown
NZ_AP021859.1\|WP_155016264.1\|4720331_4720784_-\|hypothetical-protein	unknown	unknown	unknown
NZ_AP021859.1\|WP_155016257.1\|4713304_4713913_+\|TetR-family-transcriptional-regulator	unknown	unknown	gnl\|CDD\|182632
NZ_AP021859.1\|WP_155016260.1\|4716616_4718026_+\|S8-family-serine-peptidase	unknown	unknown	gnl\|CDD\|173797
NZ_AP021859.1\|WP_073317410.1\|4715792_4716314_-\|RNA-polymerase-sigma-factor	unknown	unknown	gnl\|CDD\|224511

Protein	Function_ID	Function_description	E-value
NZ_AP021859.1\|WP_155016254.1\|4703988_4704921_-\|histone-deacetylase-family-protein	gnl\|CDD\|212541	cd11599, HDAC_classII_2, Histone deacetylases and histone-like deacetylases, classII. This subfamily includes eukaryotic as well as bacterial Class II histone deacetylase (HDAC) and related proteins. Deacetylases of class II are Zn-dependent enzymes that catalyze hydrolysis of N(6)-acetyl-lysine residues of histones (EC 3.5.1.98) and possibly other proteins to yield deacetylated histones/other proteins. In D. discoideum, where four homologs (HdaA, HdaB, HdaC, HdaD) have been identified, HDAC activity is important for regulating the timing of gene expression during development. Also, inhibition of HDAC activity by trichostatin A is shown to cause hyperacetylation of the histone and a delay in cell aggregation and differentiation.	2.68891e-165
NZ_AP021859.1\|WP_155016251.1\|4699829_4700801_+\|zinc-transporter-ZntB	gnl\|CDD\|213367	cd12833, ZntB-like_1, Salmonella typhimurium Zn2+ transporter ZntB-like subgroup. A bacterial subgroup belonging to the Escherichia coli CorA-Salmonella typhimurium ZntB_like family (EcCorA_ZntB-like) of the MIT superfamily of essential membrane proteins involved in transporting divalent cations (uptake or efflux) across membranes. This subgroup includes the Zn2+ transporter Salmonella typhimurium ZntB which mediates the efflux of Zn2+ (and Cd2+). Structures of the intracellular domain of Vibrio parahaemolyticus and Salmonella typhimurium ZntB form funnel-shaped homopentamers, the tip of the funnel is formed from two C-terminal transmembrane (TM) helices from each monomer, and the large opening of the funnel from the N-terminal cytoplasmic domains. The GMN signature motif of the MIT superfamily occurs just after TM1, mutation within this motif is known to abolish Mg2+ transport through Salmonella typhimurium CorA, and Mrs2p. Natural variants such as GVN and GIN, which occur in proteins belonging to this subfamily, may be associated with the transport of different divalent cations, such as zinc and cadmium. The functional diversity of MIT transporters may also be due to minor structural differences regulating gating, substrate selection, and transport.	5.59474e-120
NZ_AP021859.1\|WP_073317384.1\|4704917_4706774_-\|GNAT-family-N-acetyltransferase	gnl\|CDD\|223504	COG0427, ACH1, Acetyl-CoA hydrolase [Energy production and conversion].	1.30363e-104
NZ_AP021859.1\|WP_073317410.1\|4715792_4716314_-\|RNA-polymerase-sigma-factor	gnl\|CDD\|224511	COG1595, RpoE, DNA-directed RNA polymerase specialized sigma subunit, sigma24 homolog [Transcription].	3.14344e-37
NZ_AP021859.1\|WP_155016256.1\|4708921_4712029_-\|efflux-RND-transporter-permease-subunit	gnl\|CDD\|273335	TIGR00915, Probable_aminoglycoside_efflux_pump, The (Largely Gram-negative Bacterial) Hydrophobe/Amphiphile Efflux-1 (HAE1) Family. Proteins scoring above the trusted cutoff (1000) form a tight clade within the RND (Resistance-Nodulation-Cell Division) superfamily. Proteins scoring greater than the noise cutoff (100) appear to form a larger clade, cleanly separated from more distant homologs that include cadmium/zinc/cobalt resistance transporters. This family is one of several subfamilies within the scope of pfam00873. [Cellular processes, Toxin production and resistance, Transport and binding proteins, Unknown substrate].	0
NZ_AP021859.1\|WP_155016265.1\|4720919_4721483_-\|CBS-domain-containing-protein	gnl\|CDD\|341398	cd04640, CBS_pair_proteobact, Two tandem repeats of the cystathionine beta-synthase (CBS pair) domains present in proteobacteria. The CBS domain, named after human CBS, is a small domain originally identified in cystathionine beta-synthase and is subsequently found in a wide range of different proteins. CBS domains usually occur in tandem repeats. They associate to form a so-called Bateman domain or a CBS pair based on crystallographic studies in bacteria. The CBS pair was used as a basis for this cd hierarchy since the human CBS proteins can adopt the typical core structure and form an intramolecular CBS pair. The interface between the two CBS domains forms a cleft that is a potential ligand binding site. The CBS pair coexists with a variety of other functional domains and this has been used to help in its classification here. It has been proposed that the CBS domain may play a regulatory role, although its exact function is unknown. Mutations of conserved residues within this domain are associated with a variety of human hereditary diseases, including congenital myotonia, idiopathic generalized epilepsy, hypercalciuric nephrolithiasis, and classic Bartter syndrome (CLC chloride channel family members), Wolff-Parkinson-White syndrome (gamma 2 subunit of AMP-activated protein kinase), retinitis pigmentosa (IMP dehydrogenase-1), and homocystinuria (cystathionine beta-synthase).	3.74648e-45
NZ_AP021859.1\|WP_155016259.1\|4715055_4715796_-\|hypothetical-protein	gnl\|CDD\|319992	cd16328, RseA_N, N-terminal domain of RseA. This family contains the cytoplasmic (N-terminal) domain of RseA, the transmembrane anti-sigma-E factor. RseA is degraded during sigma-E-dependent transcription caused by bacterial envelope stress such as heat shock. It is an inner membrane protein with an N-terminal cytoplasmic domain that binds sigma-E and blocks its transcriptional activity, and a C-terminal periplasmic domain that binds RseB, an auxiliary negative regulator. Under inducing conditions, RseA is rapidly degraded and sigma-E is released into the cytoplasm, where it can bind core RNAP and induce its regulon. It has been shown that just the N-terminal domain is sufficient to bind and inhibit sigma-E. The C-terminal domain may interact with other proteins that signal periplasmic stress.	0.00907926
NZ_AP021859.1\|WP_155016263.1\|4719987_4720314_+\|TfoX/Sxy-family-protein	gnl\|CDD\|377438	pfam04993, TfoX_N, TfoX N-terminal domain. TfoX may play a key role in the development of genetic competence by regulating the expression of late competence-specific genes. This family corresponds to the N-terminal presumed domain of TfoX. The domain is found as an isolated domain in some proteins suggesting this is an autonomous domain.	1.10676e-32
NZ_AP021859.1\|WP_155016261.1\|4718043_4718940_-\|DUF560-domain-containing-protein	gnl\|CDD\|313331	pfam10082, BBP2_2, Putative beta-barrel porin 2. This domain is a putative beta-barrel porin type 2.	2.10654e-05
NZ_AP021859.1\|WP_073317402.1\|4712044_4713178_-\|efflux-RND-transporter-periplasmic-adaptor-subunit	gnl\|CDD\|184990	PRK15030, PRK15030, multidrug efflux RND transporter periplasmic adaptor subunit AcrA.	1.92745e-84
NZ_AP021859.1\|WP_155016252.1\|4701056_4702535_+\|chemotaxis-protein	gnl\|CDD\|214599	smart00283, MA, Methyl-accepting chemotaxis-like domains (chemotaxis sensory transducer). Thought to undergo reversible methylation in response to attractants or repellants during bacterial chemotaxis.	1.06344e-44
NZ_AP021859.1\|WP_155016257.1\|4713304_4713913_+\|TetR-family-transcriptional-regulator	gnl\|CDD\|182632	PRK10668, PRK10668, DNA-binding transcriptional repressor AcrR; Provisional.	1.23597e-34
NZ_AP021859.1\|WP_155016260.1\|4716616_4718026_+\|S8-family-serine-peptidase	gnl\|CDD\|173797	cd05561, Peptidases_S8_4, Peptidase S8 family domain, uncharacterized subfamily 4. This family is a member of the Peptidases S8 or Subtilases serine endo- and exo-peptidase clan. They have an Asp/His/Ser catalytic triad similar to that found in trypsin-like proteases, but do not share their three-dimensional structure and are not homologous to trypsin. The stability of subtilases may be enhanced by calcium, some members have been shown to bind up to 4 ions via binding sites with different affinity. Some members of this clan contain disulfide bonds. These enzymes can be intra- and extracellular, some function at extreme temperatures and pH values.	3.90602e-78

>NZ_AP021859.1|WP_155016257.1|4713304_4713913_+|TetR-family-transcriptional-regulator
MRRTKEEAEQTRAAILDAAVDVFSSQGVARATLEQIAKSANVTRGAVYWHFKNKTDIFMALYDELHKPFIQELVDGLEKSYDDPLRQLEKVCCDLTVRLEEDPHLQRVLSLFLLKCDYSGSLQICQEKTRLAKEEKQATLEKFFAKAQAQGTLSADLDPKTLTMALNCFFRGIVVEYLENPDEFSLKEMAPKLFGVFFGKWR
>NZ_AP021859.1|WP_073317402.1|4712044_4713178_-|efflux-RND-transporter-periplasmic-adaptor-subunit
MIKKLVLAALSVSVIAVTGCSQESGGQQAAGGQGAPSGVPVNVVTVEQQNVSTTLELPGRVSAFRQSHVRPQVTGVITQRLFEQGTVVEKGQQLYQIDDLQYKAALNSAKADVASANANVKTLKAKAARYKDLMKVNAISGQEYDDVVAQLDQAMAAVSVAEAQVALAEVNMDYTKVYAPISGRISRSFYTEGALVTANQTDPLATITQLDPVYVDVQVSSEQALGLQMALRDKGSLTVDLTIPGSHQSLEGLKGTVEFSEVIVNESTGSVTIRARFPNPDNILLPGLYVRATIHLSDTAALVVPQRATIRQPDGSLSVWVVNGDNPELRSIGVLQAFDGNWQVSNGLSAGEQIIVAGYHKLRPGAKVMPIPLSKDA
>NZ_AP021859.1|WP_155016256.1|4708921_4712029_-|efflux-RND-transporter-permease-subunit
MARFFIDRPVFAWVLAIITMLAGVMAITSLPIQQYPTVAPPSVTISASYPGASAQTVENAVTQVIEQRLTAIDNLRYFHSSSANGRMTITLTFEPEADPDIAQVQTQNKVQGAVSQLPSSVQQMGVTVTKSNNSFMMAVGFYSEDDSIDQYELGDILLSKFRDPISRVDGVGSVRAFGAQKAMRIWLDPQRLYSYNLTPQDVQSAIAVQNTDVSAGELGGLPAISGQEINATIQAQSRLQTVDDFERILLRVNADGSQVRLRDVARVELGSESYGVISRYKRHPAAGMALSLASGANALDTIDRVKARVEDLKSNLPAGVKVIYPIDNGPFIELSIKSVVQTLLEAIVLVFLVMLLFLQNWRATLIPTIAVPVVLLGTFAVLYAFGFSINVLTMFGLVLAIGLLVDDAIVVVENVERIIHEEGLSPKEATKKSMTQITSALVGIAAVLSTVFIPMAFFSGSAGAIYRQFSITIVSAMVLSVLVAIILSPSLCATFLRAADAEGEAKTGFFGWFNRTFNKGRDRYQGATRFMANRLARFVALYTLLVAGMVVIFMRLPGSFLPNEDQGFLMMMLNTPAGSSAERTLESVAKVEDHFLEKEGDLVDHMFTVTGFSFAGSAQSSALGFIRLKDWSERTEPGTSVEAVAGRAYPALAQVIDASAFAFFPPPIRELGNASGFDMQLLDVAGRGHEALMQARNQLLGAAAQNPKLVGVRPNGLNDVPQYKIDIDSEKATALGVSLSDINSTLQIAWGSSYVNDFIDDGRIKKVYLQADAPHRMMPDDLNKWYLRNSAGDMVPFSAFASSKWTYGSPQLERFDGVSSVNLQGSAAPGISSGEAMQEMEKLVAEMLPEGFEIAWSGLSYEERAAGSQAGFLYAISILVVFLCLAALYESWAVPFSVILIVPLGILGAVVAAYLFNLSNDVYLQVAFLTTIGLAAKNAILIVEFAKVLQEEEGKTVMEAVTMAAKQRFRPILMTSMAFILGVTPLAIANGPGAASQNAIGITVIGGMFAATFLAIFFVPMFYVLISKFSRPKQS
>NZ_AP021859.1|WP_155016255.1|4707451_4708603_-|hypothetical-protein
MLKLSFIIILCAWIWSPTKAIANEVVMSVICFDCNESYAKNLAKEHATPFIECDQNNDYFEFNSTQSCYSQPKRIIVFDGLYKTAYPYRLSHSNQGGPLNTLRLNIDDFQIESGTHSLLTDIANARLDYEERMETFVADTLSRIPNEEFNNLVLSSSVNDQCSDDPGVAAFKRAMSQSDTTALKTWFQIQQNVINDSVFGFLPFDISALSFSYQSPPTPYGSLGITGQFDVDPDASVITVVYTNISSTLRSTVVEGAQIESSAVVYKIGKQENFPSLVEVNVEPDLSRVDGVPLSDIAGGRVADVDQTKPVSKCVYDYIKDVYETEQRMAPVGGGLGGSTVGNRGGSGSGEMGPALTYEGSCVVDTFSGGEHTGTFKINCADL
>NZ_AP021859.1|WP_073317384.1|4704917_4706774_-|GNAT-family-N-acetyltransferase
MTVALPPRPDWSQLIKSGCRVFVGGNAGVPYALIDDLIANSKAYSDIELVHMLALGDNRWAKEEYRQLFKVNTFFIHGDEVRRAVDEGRADYTPVFLSEMSSLFSDGTLPLDTALVMVSPPDEFGYCSLGVSVDICMSAARHANKVVAQINPQMPRTAGHSYLHISEFAAVIEADQPLQEIEAPPIDSVTERIGQYVAMLVEDGATLQFGVGKIPSATLKYLERHKDLGIHSEMLSDSIMEIIASGAISNRKKTFHPGKVVTSFCIGSRKLYDFVNNNPHIEFYPSSYVNKPTNIAKNDNMIAINSALEVDLTGQVVADSLGFDFYSGIGGQVDFVSGASMSKGGKPIIALPSTAKNETVSRIVPYITEGSGVVTSRGNVHYIVTEYGIASLRGKSIRERALELIRVAHPKFRAKLLAEVRQNYWVPHYQQKYPTDIPELGAIQLHKMVVNGEKFYLRPLNPADERRLQEFFYSHTKETLRLRYNYDPKQMSREKSCNLVSVDQSSDAALCIVRQEGSRITIHAVGRFYYNEHDNTCEAAFVTRETQQGKGMASKLLTTLIDIAQKRNINKMLAFCRADNKPMIAIFEHHGFKRLFSGDPSEVELALPLQEASQEKSA
>NZ_AP021859.1|WP_155016254.1|4703988_4704921_-|histone-deacetylase-family-protein
MTIKIFRGKDCVHHDVSGEHPEHPDRLYAIDDQLLSSGLDMVCQHADAKPVKRENLALAHDPYYVDSIFQRAPKTGVIWLEQDTGMTPITLSAALYAAGAGCDAVDWVMDGENRQAFCAVRPPGHHAEYDNAMGFCLFNNIAVAARYAVKKYDLSRVAIVDFDVHHGNGTEHIIAGDQRIMMCSSFQHPFYPHSGSPVSASNILCAPLEAGANGEAFRKAVSYWFDALINYQPQLILISAGFDAHAEDHMGQLRLREDDYHWVSQQLRKVADKVCHGRIVSMLEGGYNLSALGRSVVAHIKGLHGDDTSH
>NZ_AP021859.1|WP_155016701.1|4703388_4703955_+|hypothetical-protein
MLVLPVCFLALPFAANANAGVPMLFLAMPALLMSLLPIIFIESVYCAQRLSLSFGQSLKTVSISNLASTLVGIPVTWLLLVGVQIATSGGRAYGIDSPVEKVLAVTWQAPWLIPYETDLHWMIPAAGLVLLVPFYFASWWSEFWIAKKLNTLLPTSDIKLTVRNANRITYCLLAGWPIASWLANVAMK
>NZ_AP021859.1|WP_155016253.1|4702534_4703062_+|thioredoxin
MSSRTTTILVGVWATAILVALLIANSNQMQDFDPDASLAQAASQQDFDSAFTGMLQEAGVSNGSIVHLSADSNCFCNDLSKGHQYDITQSLADKGYEFHTLSLSENPAISKLISHFPALAVIDNNGNLRYVGPYATGFGCFTGNDLVDDIARIATTEQYFGATVNTEARGCFCNV
>NZ_AP021859.1|WP_155016252.1|4701056_4702535_+|chemotaxis-protein
MFSWVKEGHQIFRVILIVQLVISVVIGLITGELMIAFWLGIPIIALPLYLSYANPESEISGHAVGIGVQLMTALHIHQAFGLIEIHFEIFVLLAMLAYFRNWRIIATSTATVAVHHILFFFMQAGGSGVFIFEENHITFSILLLHAAFALAEGLTLMYMTKRSHEDGVGGALLESAIADIIRDKESLNLAVKIDKSVPVMRTFDELLDAIRQLVSNAAKLADDVADTSAFMQNATRELSEHAQQSHQEIGSISAASEEIAVTMQDTSERTNAANDITQEAKANTSESRTSVESTKTTISSLRDRLNSAAQTNQELNERCASISDSMRSITAVAEQTNLLALNAAIESARAGEHGRGFAVVADEVRTLAIRSKESADEISTITEQLVASTASSVTQMNQCIELVDEAVSASDRAATHMQGIESKIQAASDNMMEVATSAVEQETASSSIAASTAKIYELATQEARTAAELEQKSQSLATLCQTLQTMVRRFVV
>NZ_AP021859.1|WP_155016251.1|4699829_4700801_+|zinc-transporter-ZntB
MSNADQAFLWAYDIGADGTIATVDQAAITTPVAPNTYRWVHLQSDESDAEQLLDTLALPSSVADSLMALQTRPRVLPIKEGALIFLRGINANPGADPDDMVSLRLWLTPNLMVTARRQNRRLMSVQDTREMIESGEAPATTAELLVTLLTRIADRIHDKIEDIDEQLAQYETADALNKQDRQQLAMLRRQTAIIRRHLAPQRDALDTLIRLPNLINDSLIFELRDQADRMTRYVEDLDLARERSLVLQDELRNQIADQQGIRMYVLSMITAIFLPLSFLTGVFGMNVAGLPGTEAPDAFTTLMMAMGGIAVVMLIAMLWKRWL
>NZ_AP021859.1|WP_155016258.1|4714512_4715037_-|hypothetical-protein
MKTLMKFATLVMAMTLAACASQPAYRAAENGGYGYSETKLTDTQYRVYFKGKGSDKTKAMDYAMLRAAEITLDQGYDWFVVANRETMVDREKVSMEPEIGFSKRYTRVTDCGLVTCRTSYYPESTLSTGIYVGGREKSVIESALDIQLGRGTRPDNSASFDARQVKENLSPKDE
>NZ_AP021859.1|WP_155016259.1|4715055_4715796_-|hypothetical-protein
MKITDEQLSAFLDNELDDEQMALVRDAIAADETLCDRMATLSMVDHVVKRAAEQATTGPVPEHIVARCDSASESNVVSFADRKAEQTAQQPTPNSDTRWLRGMAMAASVALVGLLGWQQLMGDQPGDAGQWQQIAAVLDSQTSGSRYSAGDVTVMPQLSFVHQDGALCRQFTVSGQSRNDAVIACKKDGSWQQRTLVPMTPTNGQAGEYQTATSAHELDKVLDTMIKGAPLNREQEQQAIQSNWQQ
>NZ_AP021859.1|WP_073317410.1|4715792_4716314_-|RNA-polymerase-sigma-factor
MTKTQSEQLKAMLPVLRRFAYSLTGSMADADDLLQNTVEKLLTKPVPDDVELLAWSYRICRNLWIDEYRANKVRQAAVHNPELQQAEVDATAQITSDITLKQVESAMATLPDDQREVLSLVAVQGLSYQDTANVLSVPSGTVMSRLARARSKLAQILFLEKGTKGPNGNEVTA
>NZ_AP021859.1|WP_155016260.1|4716616_4718026_+|S8-family-serine-peptidase
MKLSLSTKLLSRSLIAAAVVLSPISLVNAQVLPSVTRSVTQPIEDLTRRLPANRVTERLTRPEKPALPELLPALTSTLTADLSNALLPVKQAISVVDSLQQTVLREEITPQGELAIAREWVVYTSEADLAWFEQSPFSITKQRYVALLDSWLVNIQVPDAFNSLNRIKAALPAHLQSQLGRNHVYLTQSNAAEANEESAAVKESAAVKESAATTSSVETKPLCETPARIGMLDSAIEDTHPLLSTLQVREHTFIDASLPLSRAHGTAVAGILQQQLAHNSQVVNAAVFYARTQVSQGASLFDLVSGLDWLASQQVPVINMSLTGPDNPLLAKAITGLSSKGVTLIAAVGNAGPAAPPLFPAAYPDVIGVSAVDAQGNIYRWANQGEQVALSAPGVSVLTARVKGETGPETGTSIASPAVAGWIAQWRTCQSGSDATKTAIPPALRQQLQDKGEPGWDPVFGEGAWLPAK
>NZ_AP021859.1|WP_155016261.1|4718043_4718940_-|DUF560-domain-containing-protein
MKTTLIFAAITAVSAPAVATVTYKGELQAGAQYDSNVTVTELDRASNQSDYAGYLKGKLSADWQATDALSFNAGVNHQRTQYQDATDFNLAITTWNVGAGLKNRLGKWGVHSYLADAALDGNDFMRYQQSGVSWQNSITAKTYLHISADYLQKRFDTVPERDANGGQLSTQWFYMPDMEGQMINVGYTYQYEDADIDRLDFTGHAGQLSWTYPTQWWGTPTSLKAQYQFAYRDYREAEAFLSVTQRTDRQHTLGVSASYQFTKHVAFDIGAKYADYQSNLSIADYQENQANVGVTAKF
>NZ_AP021859.1|WP_155016262.1|4719020_4719842_-|hypothetical-protein
MKTFTFTSLILAMGLAHGAAQAQSVESNVTFNQSTAADAQTSASTDPTTASSVTLSFANQSNVQANSNANEQVADESAAQSEEATTEVAESAEGTAEQSTETVSESEETMAETGQQATASADTALEQSTDIVTDLPVAELSQSLQGSLQGSLNVVADAGGSISSTLNGSGDAINSAVNGTLQNTIDAATAAQVADAASVEQVVSSAVSATVAQNVSTSAVTQVSDTVNSTVTNSVTSAVDGAVEGAVEGAVEGAVSSTVNNAVAANISSILGN
>NZ_AP021859.1|WP_155016263.1|4719987_4720314_+|TfoX/Sxy-family-protein
MSANQFANSLHDVFSTFGPIHLKRMFGGHGVFSQGKMFALVVSDQLYIKVDAGMKQALEARGYSPFTYVRQGKTIALSYMEAPEEIYDDPDDACNWAHQAYTAALAGK
>NZ_AP021859.1|WP_155016264.1|4720331_4720784_-|hypothetical-protein
MATFYNKQHGEFSLIMNQDVVLTNAVGPWNLECIEQFGIDYATSVYSAKVSRWADIIMLQGESLLVPDAEKDLQVRIARAVETGLSHVAVVMCKSEVKTTAKLQMKRLYRNLPAELAFFETVQEAIGWITEHGYRCEAPVIEAFFDDAPK
>NZ_AP021859.1|WP_155016265.1|4720919_4721483_-|CBS-domain-containing-protein
MHHLTLCQTEAIDTLAHPEVYDHVELASSALSIFTDFHEHQPLVIDGNVKAIELERLMRQSHVKMKLVLDKHDQFVGIVTLADITEQKILQRVVQLGLPRSELLVVDMMQPKAALQAFDYHELKVASVKDVVDTLQDNGKMHCLVIDKQQHEIRGVISVSDIARILRVPLDIQSQPSFAALSHIIAA
>NZ_AP021859.1|WP_155016266.1|4721538_4721847_-|hypothetical-protein
MKAWLLILTLIALAGLPGSGAAVQSGLHGQWSSVDSNLTEAESAEVEQGIESDQQGDGPDVISSANGIRFAPQASTVPGQLQTQLSGSFYSGYAIRAPPVFS

You can click texts colored in the table to view more detailed information

Click the colored protein region to show detailed information

Self-targeting detection

CRISPR_ID	Spacer_Info	Spacer_region	Spacer_length	Hit_ID	Protospacer_location	Mismatch	Identity
NZ_AP021859_2	2.1\|1657924\|22\|NZ_AP021859\|PILER-CR	1657924-1657945	22	NZ_AP021859.1	1658091-1658112	2	0.909

1. spacer 2.1|1657924|22|NZ_AP021859|PILER-CR matches to position: 1658091-1658112, mismatch: 2, identity: 0.909

gtttagaaataatgagagggga	CRISPR spacer
gtttagaaaaaataagagggga	Protospacer
********* ***.********

MGE targeting detection<

CRISPR_ID	Spacer_Info	Spacer_region	Spacer_length	Hit_phage_ID	Hit_phage_def	Protospacer_location	Mismatch	Identity

Prophage detection

Region

Region Position

Protein_number

Hit_taxonomy

Key_proteins

Att_site

Prophage annotation

DBSCAN-SWA_1

888747 : 896718

Enterobacteria_phage(33.33%)

The bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')

Protein_ID	Protein_Def	Hit_ID	Hit_Def	E-value	Identity
WP_155013753.1\|888747_889707_+	NAD-dependent epimerase/dehydratase family protein	D1LW79	Prochlorococcus_phage	6.2e-90	52.4
WP_155013754.1\|889744_890140_+	hypothetical protein	NA	NA	NA	NA
WP_155013755.1\|890140_891376_+	hypothetical protein	NA	NA	NA	NA
WP_155013756.1\|891412_892150_+	sulfotransferase family 2 domain-containing protein	NA	NA	NA	NA
WP_155013757.1\|892283_893162_+	glucose-1-phosphate thymidylyltransferase RfbA	I7I009	Enterobacteria_phage	1.5e-98	59.6
WP_155013758.1\|893216_893762_+	dTDP-4-dehydrorhamnose 3,5-epimerase	I7HJC4	Enterobacteria_phage	1.1e-46	55.3
WP_155013759.1\|893785_894643_+	dTDP-4-dehydrorhamnose reductase	A0A291LA50	Escherichia_phage	2.5e-34	34.9
WP_155013760.1\|894660_895749_+	dTDP-glucose 4,6-dehydratase	A0A1D7XFE8	Escherichia_phage	7.7e-97	52.5
WP_155013761.1\|895806_896718_+	NAD-dependent epimerase/dehydratase family protein	L7RCI0	Acanthamoeba_polyphaga_moumouvirus	1.9e-08	22.8

DBSCAN-SWA_2

2866728 : 2892555

Acidithiobacillus_phage(40.0%)

protease,transposase

Protein_ID	Protein_Def	Hit_ID	Hit_Def	E-value	Identity
WP_155016623.1\|2866728_2867760_+\|transposase	IS630 family transposase	NA	NA	NA	NA
WP_155015095.1\|2868143_2868896_-	hypothetical protein	NA	NA	NA	NA
WP_073324816.1\|2869163_2869415_-	hypothetical protein	NA	NA	NA	NA
WP_155015096.1\|2869641_2870100_-	DUF4265 domain-containing protein	NA	NA	NA	NA
WP_073324818.1\|2870221_2870923_-\|protease	CPBP family intramembrane metalloprotease	NA	NA	NA	NA
WP_155015097.1\|2871501_2871894_-	hypothetical protein	NA	NA	NA	NA
WP_073324821.1\|2872027_2872363_-	hypothetical protein	NA	NA	NA	NA
WP_155015098.1\|2872470_2872848_-	hypothetical protein	NA	NA	NA	NA
WP_155015099.1\|2872953_2873241_-	hypothetical protein	NA	NA	NA	NA
WP_155015100.1\|2873468_2873918_-	hypothetical protein	NA	NA	NA	NA
WP_155015101.1\|2874031_2874547_-	hypothetical protein	NA	NA	NA	NA
WP_155016624.1\|2874678_2875884_+\|transposase	IS4 family transposase	A4KWT9	Enterobacteria_phage	1.9e-112	52.4
WP_155016625.1\|2876074_2876320_+	type II toxin-antitoxin system CcdA family antitoxin	NA	NA	NA	NA
WP_155015102.1\|2876319_2876637_+	CcdB family protein	NA	NA	NA	NA
WP_155015103.1\|2876751_2877063_-	hypothetical protein	NA	NA	NA	NA
WP_155015104.1\|2877174_2877594_-	hypothetical protein	NA	NA	NA	NA
WP_155015105.1\|2877707_2878163_-	hypothetical protein	NA	NA	NA	NA
WP_155015106.1\|2878285_2878558_-	hypothetical protein	NA	NA	NA	NA
WP_155015107.1\|2878582_2879533_-\|transposase	IS30 family transposase	W5R8L2	Staphylococcus_phage	6.9e-41	34.3
WP_155523170.1\|2879593_2879764_-	hypothetical protein	NA	NA	NA	NA
WP_155015108.1\|2879870_2880290_-	hypothetical protein	NA	NA	NA	NA
WP_155523171.1\|2880424_2880808_-	hypothetical protein	NA	NA	NA	NA
WP_155015110.1\|2881348_2881684_-	hypothetical protein	NA	NA	NA	NA
WP_155014623.1\|2882814_2884362_+\|transposase	IS21 family transposase	K4I413	Acidithiobacillus_phage	1.5e-125	46.1
WP_155014142.1\|2884376_2885132_+	ATP-binding protein	K4HZD4	Acidithiobacillus_phage	1.5e-54	49.6
WP_155015111.1\|2885548_2885872_-	hypothetical protein	NA	NA	NA	NA
WP_014978218.1\|2887015_2887258_+	type II toxin-antitoxin system ParD family antitoxin	NA	NA	NA	NA
WP_155015112.1\|2887254_2887557_+	type II toxin-antitoxin system RelE/ParE family toxin	NA	NA	NA	NA
WP_073325466.1\|2888011_2888263_-	hypothetical protein	NA	NA	NA	NA
WP_014978218.1\|2889250_2889493_+	type II toxin-antitoxin system ParD family antitoxin	NA	NA	NA	NA
WP_155015112.1\|2889489_2889792_+	type II toxin-antitoxin system RelE/ParE family toxin	NA	NA	NA	NA
WP_155015113.1\|2889897_2890275_-	hypothetical protein	NA	NA	NA	NA
WP_155015114.1\|2890391_2890757_-	hypothetical protein	NA	NA	NA	NA
WP_155014787.1\|2890872_2891268_-	hypothetical protein	NA	NA	NA	NA
WP_155015024.1\|2891604_2892555_+\|transposase	IS30 family transposase	Q9MBM9	Staphylococcus_prophage	3.1e-41	34.3

Anti-CRISPR protein detection

Acr ID	Acr position	Acr size	Homology with known anti	Neighbor HTH/AcRanker	Neighbor Aca	In prophage	Protospacer in prophage

Overview of predicted results

Overview of the results

Cas Category Instructions

Results visualization

1. NZ_AP021860

Click the left colored region to show detailed information

CRISPR-Cas detection and classification

Click the colored protein region to show detailed information

Self-targeting detection

MGE targeting detection<

Prophage detection

Anti-CRISPR protein detection

2. NZ_AP021859

Click the left colored region to show detailed information

CRISPR-Cas detection and classification

Click the colored protein region to show detailed information

Click the colored protein region to show detailed information

Click the colored protein region to show detailed information

Click the colored protein region to show detailed information

Click the colored protein region to show detailed information

Self-targeting detection

MGE targeting detection<

Prophage detection

Anti-CRISPR protein detection