assembly_id	genome_id	genome_def	crispr_array_locus_merge	crispr_array_location_merge	crispr_locus_id	crispr_pred_method	array_in_prot	prot_within_array_20000	prot_in_genome	crispr_type_by_cas_prot	consensus_repeat	repeat_length	self-targeting_spacer_number	self-targeting_target_number	spacer_location	protospacer_location	repeat_type	spacer_locus_num	spacer_num	correct_crispr_type	genome_cas_prots	unknown_protein_around_crispr	L10	L10_domain	L9	L9_domain	L8	L8_domain	L7	L7_domain	L6	L6_domain	L5	L5_domain	L4	L4_domain	L3	L3_domain	L2	L2_domain	L1	L1_domain	R1	R1_domain	R2	R2_domain	R3	R3_domain	R4	R4_domain	R5	R5_domain	R6	R6_domain	R7	R7_domain	R8	R8_domain	R9	R9_domain	R10	R10_domain
GCF_007747215.1_ASM774721v1	NZ_CP036273	Planctomycetes bacterium ETA_A1 chromosome, complete genome	1	4566-4646	1	CRISPRCasFinder	no		DEDDh,csa3,RT,DinG,cas3	Orphan	GCTCAGATCGCTGAGAAGTATCCG	24	0	0	NA	NA	NA	1	1	Orphan	DEDDh,csa3,RT,DinG,cas3	NA|120aa|up_3|NZ_CP036273.1_1037_1397_+,NA|271aa|up_2|NZ_CP036273.1_1409_2222_+,NA	NA|NA	NA	NA|NA	NA	NA|NA	NA	NA|NA	NA	NA|NA	NA	NA|328aa|up_4|NZ_CP036273.1_0_984_+	COG0593, DnaA, ATPase involved in DNA replication initiation [DNA replication, recombination, and repair]	NA|120aa|up_3|NZ_CP036273.1_1037_1397_+	NA	NA|271aa|up_2|NZ_CP036273.1_1409_2222_+	NA	NA|375aa|up_1|NZ_CP036273.1_2600_3725_+	cd00140, beta_clamp, Beta clamp domain	NA|111aa|up_0|NZ_CP036273.1_3805_4138_+	pfam05258, DUF721, Protein of unknown function (DUF721)	NA|118aa|down_0|NZ_CP036273.1_4815_5169_+	pfam18480, DUF5615, Domain of unknown function (DUF5615)	NA|843aa|down_1|NZ_CP036273.1_5187_7716_+	PRK14939, gyrB, DNA gyrase subunit B; Provisional	NA|222aa|down_2|NZ_CP036273.1_7773_8439_-	cd00241, DOMON_like, Domon-like ligand-binding domains	NA|141aa|down_3|NZ_CP036273.1_8460_8883_-	cd09881, PIN_VapC4-5_FitB-like, VapC-like PIN domain of Mycobacterium tuberculosis VapC4 and VapC5, and Neisseria gonorrhoeae FitB and related proteins	NA|112aa|down_4|NZ_CP036273.1_8879_9215_-	pfam12728, HTH_17, Helix-turn-helix domain	NA|325aa|down_5|NZ_CP036273.1_9217_10192_-	PRK09375, PRK09375, quinolinate synthase NadA	NA|133aa|down_6|NZ_CP036273.1_10273_10672_-	cd07043, STAS_anti-anti-sigma_factors, Sulphate Transporter and Anti-Sigma factor antagonist) domain of anti-anti-sigma factors, key regulators of anti-sigma factors by phosphorylation	NA|477aa|down_7|NZ_CP036273.1_10764_12195_-	PLN03159, PLN03159, cation/H(+) antiporter 15; Provisional	NA|417aa|down_8|NZ_CP036273.1_12209_13460_-	NF033113, halo_ClmS, chloramphenicol-biosynthetic FADH2-dependent halogenase CmlS	NA|158aa|down_9|NZ_CP036273.1_13456_13930_-	smart00347, HTH_MARR, helix_turn_helix multiple antibiotic resistance protein
GCF_007747215.1_ASM774721v1	NZ_CP036273	Planctomycetes bacterium ETA_A1 chromosome, complete genome	2	1842635-1842715	2	CRISPRCasFinder	no		DEDDh,csa3,RT,DinG,cas3	Orphan	CCAGGCCGCCCCGCCGCGGCCGC	23	0	0	NA	NA	NA	1	1	Orphan	DEDDh,csa3,RT,DinG,cas3	NA|119aa|up_0|NZ_CP036273.1_1841673_1842030_+,NA|147aa|down_3|NZ_CP036273.1_1849383_1849824_-	NA|120aa|up_9|NZ_CP036273.1_1825336_1825696_-	PRK14950, PRK14950, DNA polymerase III subunits gamma and tau; Provisional	NA|530aa|up_8|NZ_CP036273.1_1826011_1827601_+	pfam07585, BBP7, Putative beta barrel porin-7 (BBP7)	NA|130aa|up_7|NZ_CP036273.1_1827654_1828044_-	cd14263, DAGK_IM_like, Integral membrane diacylglycerol kinase and similar enzymes	NA|923aa|up_6|NZ_CP036273.1_1828061_1830830_-	PRK14950, PRK14950, DNA polymerase III subunits gamma and tau; Provisional	NA|255aa|up_5|NZ_CP036273.1_1831022_1831787_+	PRK07994, PRK07994, DNA polymerase III subunits gamma and tau; Validated	NA|1269aa|up_4|NZ_CP036273.1_1831940_1835747_-	sd00042, LVIVD, LVIVD repeat	NA|433aa|up_3|NZ_CP036273.1_1836126_1837425_-	cd14014, STKc_PknB_like, Catalytic domain of bacterial Serine/Threonine kinases, PknB and similar proteins	NA|422aa|up_2|NZ_CP036273.1_1838017_1839283_+	pfam05150, Legionella_OMP, Legionella pneumophila major outer membrane protein precursor	NA|701aa|up_1|NZ_CP036273.1_1839344_1841447_-	PRK08633, PRK08633, 2-acyl-glycerophospho-ethanolamine acyltransferase; Validated	NA|119aa|up_0|NZ_CP036273.1_1841673_1842030_+	NA	NA|680aa|down_0|NZ_CP036273.1_1844728_1846768_+	cd00200, WD40, WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment	NA|754aa|down_1|NZ_CP036273.1_1846858_1849120_+	TIGR01730, COG0845:_Membrane-fusion_protein, RND family efflux transporter, MFP subunit	NA|66aa|down_2|NZ_CP036273.1_1849193_1849391_-	pfam03884, YacG, DNA gyrase inhibitor YacG	NA|147aa|down_3|NZ_CP036273.1_1849383_1849824_-	NA	NA|111aa|down_4|NZ_CP036273.1_1849843_1850176_-	TIGR02605, CxxC_CxxC_SSSS, putative regulatory protein, FmdB family	NA|178aa|down_5|NZ_CP036273.1_1850178_1850712_-	cd00446, GrpE, nucleotide exchange factor GrpE	NA|379aa|down_6|NZ_CP036273.1_1850728_1851865_-	PRK10767, PRK10767, chaperone protein DnaJ; Provisional	NA|544aa|down_7|NZ_CP036273.1_1851999_1853631_-	PRK00013, groEL, chaperonin GroEL; Reviewed	NA|104aa|down_8|NZ_CP036273.1_1853762_1854074_-	PRK00364, groES, co-chaperonin GroES; Reviewed	NA|571aa|down_9|NZ_CP036273.1_1854158_1855871_-	PRK00013, groEL, chaperonin GroEL; Reviewed
GCF_007747215.1_ASM774721v1	NZ_CP036273	Planctomycetes bacterium ETA_A1 chromosome, complete genome	3	1992036-1992140	3	CRISPRCasFinder	no		DEDDh,csa3,RT,DinG,cas3	Orphan	GCAGCCGCGGCGGGTTCCTCGGCC	24	0	0	NA	NA	NA	1	1	Orphan	DEDDh,csa3,RT,DinG,cas3	NA|225aa|up_4|NZ_CP036273.1_1984582_1985257_+,NA|393aa|down_1|NZ_CP036273.1_1993867_1995046_-,NA|111aa|down_6|NZ_CP036273.1_2005695_2006028_-	NA|388aa|up_9|NZ_CP036273.1_1978972_1980136_-	pfam13546, DDE_5, DDE superfamily endonuclease	NA|114aa|up_8|NZ_CP036273.1_1980675_1981017_+	cd00397, DNA_BRE_C, DNA breaking-rejoining enzymes, C-terminal catalytic domain	NA|519aa|up_7|NZ_CP036273.1_1981430_1982987_-	PRK00484, lysS, lysyl-tRNA synthetase; Reviewed	NA|249aa|up_6|NZ_CP036273.1_1983060_1983807_+	pfam13649, Methyltransf_25, Methyltransferase domain	NA|223aa|up_5|NZ_CP036273.1_1983848_1984517_+	pfam12836, HHH_3, Helix-hairpin-helix motif	NA|225aa|up_4|NZ_CP036273.1_1984582_1985257_+	NA	NA|334aa|up_3|NZ_CP036273.1_1985743_1986745_+	sd00039, 7WD40, WD40 repeats in seven bladed beta propellers	NA|465aa|up_2|NZ_CP036273.1_1986728_1988123_-	pfam01609, DDE_Tnp_1, Transposase DDE domain	NA|633aa|up_1|NZ_CP036273.1_1988253_1990152_-	smart00752, HTTM, Horizontally Transferred TransMembrane Domain	NA|426aa|up_0|NZ_CP036273.1_1990148_1991426_-	pfam01925, TauE, Sulfite exporter TauE/SafE	NA|436aa|down_0|NZ_CP036273.1_1992413_1993721_-	pfam07394, DUF1501, Protein of unknown function (DUF1501)	NA|393aa|down_1|NZ_CP036273.1_1993867_1995046_-	NA	NA|2631aa|down_2|NZ_CP036273.1_1995044_2002937_+	cd07473, Peptidases_S8_Subtilisin_like, Peptidase S8 family domain in Subtilisin-like proteins	NA|319aa|down_3|NZ_CP036273.1_2003637_2004594_-	cd07984, LPLAT_LABLAT-like, Lysophospholipid Acyltransferases (LPLATs) of Glycerophospholipid Biosynthesis: LABLAT-like	NA|258aa|down_4|NZ_CP036273.1_2004649_2005423_+	pfam01925, TauE, Sulfite exporter TauE/SafE	NA|98aa|down_5|NZ_CP036273.1_2005392_2005686_-	cd10456, GIY-YIG_UPF0213, The GIY-YIG domain of uncharacterized protein family UPF0213 related to structure-specific endonuclease SLX1	NA|111aa|down_6|NZ_CP036273.1_2005695_2006028_-	NA	NA|340aa|down_7|NZ_CP036273.1_2006165_2007185_+	PRK07994, PRK07994, DNA polymerase III subunits gamma and tau; Validated	NA|297aa|down_8|NZ_CP036273.1_2007186_2008077_-	pfam06283, ThuA, Trehalose utilisation	NA|356aa|down_9|NZ_CP036273.1_2008061_2009129_-	pfam05448, AXE1, Acetyl xylan esterase (AXE1)
GCF_007747215.1_ASM774721v1	NZ_CP036273	Planctomycetes bacterium ETA_A1 chromosome, complete genome	4	3112441-3112550	4	CRISPRCasFinder	no		DEDDh,csa3,RT,DinG,cas3	Orphan	CCGGTCGCCGGCCCGGTGGCACC	23	0	0	NA	NA	NA	2	2	Orphan	DEDDh,csa3,RT,DinG,cas3	NA|69aa|up_0|NZ_CP036273.1_3111839_3112046_-,NA|191aa|down_0|NZ_CP036273.1_3112608_3113181_+	NA|280aa|up_9|NZ_CP036273.1_3098415_3099255_-	PRK00724, PRK00724, formate dehydrogenase accessory sulfurtransferase FdhD	NA|759aa|up_8|NZ_CP036273.1_3099251_3101528_-	TIGR01701, Hypothetical_protein_Rv2900c/MT2968/Mb2924c	NA|274aa|up_7|NZ_CP036273.1_3101613_3102435_+	pfam02517, Abi, CAAX protease self-immunity	NA|769aa|up_6|NZ_CP036273.1_3102677_3104984_+	pfam07631, PSD4, Protein of unknown function (DUF1592)	NA|436aa|up_5|NZ_CP036273.1_3105013_3106321_+	pfam07586, HXXSHH, Protein of unknown function (DUF1552)	NA|564aa|up_4|NZ_CP036273.1_3106378_3108070_-	pfam13472, Lipase_GDSL_2, GDSL-like Lipase/Acylhydrolase family	NA|127aa|up_3|NZ_CP036273.1_3108315_3108696_+	cd11537, NTP-PPase_RS21-C6_like, Nucleoside Triphosphate Pyrophosphohydrolase (EC 3	NA|757aa|up_2|NZ_CP036273.1_3108742_3111013_+	PHA03247, PHA03247, large tegument protein UL36; Provisional	NA|169aa|up_1|NZ_CP036273.1_3111299_3111806_+	TIGR01575, rimI, ribosomal-protein-alanine acetyltransferase	NA|69aa|up_0|NZ_CP036273.1_3111839_3112046_-	NA	NA|191aa|down_0|NZ_CP036273.1_3112608_3113181_+	NA	NA|422aa|down_1|NZ_CP036273.1_3113231_3114497_-	COG2379, GckA, Putative glycerate kinase [Carbohydrate transport and metabolism]	NA|299aa|down_2|NZ_CP036273.1_3114502_3115399_-	pfam06283, ThuA, Trehalose utilisation	NA|269aa|down_3|NZ_CP036273.1_3115472_3116279_-	pfam06439, DUF1080, Domain of Unknown Function (DUF1080)	NA|438aa|down_4|NZ_CP036273.1_3116387_3117701_+	COG0673, MviM, Predicted dehydrogenases and related proteins [General function prediction only]	NA|345aa|down_5|NZ_CP036273.1_3117945_3118980_-	TIGR02800, Protein_TolB, tol-pal system beta propeller repeat protein TolB	NA|101aa|down_6|NZ_CP036273.1_3119057_3119360_+	pfam01844, HNH, HNH endonuclease	NA|259aa|down_7|NZ_CP036273.1_3119362_3120139_-	pfam13197, DUF4013, Protein of unknown function (DUF4013)	NA|292aa|down_8|NZ_CP036273.1_3120169_3121045_-	COG1413, COG1413, FOG: HEAT repeat [Energy production and conversion]	NA|456aa|down_9|NZ_CP036273.1_3121202_3122570_+	cd00397, DNA_BRE_C, DNA breaking-rejoining enzymes, C-terminal catalytic domain
GCF_007747215.1_ASM774721v1	NZ_CP036273	Planctomycetes bacterium ETA_A1 chromosome, complete genome	5	3824487-3824725	1	PILER-CR	no		DEDDh,csa3,RT,DinG,cas3	Orphan	TGCCGTACACGACGAAGCGGCCGGTGTACGAGCAGCACGTCCGCGAG	47	0	0	NA	NA	NA	2	2	Orphan	DEDDh,csa3,RT,DinG,cas3	NA|75aa|up_1|NZ_CP036273.1_3822732_3822957_+,NA|285aa|up_0|NZ_CP036273.1_3822960_3823815_-,NA|256aa|down_2|NZ_CP036273.1_3828071_3828839_+,NA|69aa|down_6|NZ_CP036273.1_3833141_3833348_-	NA|192aa|up_9|NZ_CP036273.1_3814911_3815487_+	COG0386, BtuE, Glutathione peroxidase [Posttranslational modification, protein turnover, chaperones]	NA|190aa|up_8|NZ_CP036273.1_3815557_3816127_+	pfam13023, HD_3, HD domain	NA|294aa|up_7|NZ_CP036273.1_3816126_3817008_+	cd00739, DHPS, DHPS subgroup of Pterin binding enzymes	NA|429aa|up_6|NZ_CP036273.1_3817026_3818313_+	PRK00197, proA, gamma-glutamyl phosphate reductase; Provisional	NA|455aa|up_5|NZ_CP036273.1_3818330_3819695_+	COG1413, COG1413, FOG: HEAT repeat [Energy production and conversion]	NA|294aa|up_4|NZ_CP036273.1_3819695_3820577_-	cd08411, PBP2_OxyR, The C-terminal substrate-binding domain of the LysR-type transcriptional regulator OxyR, a member of the type 2 periplasmic binding fold protein superfamily	NA|461aa|up_3|NZ_CP036273.1_3820646_3822029_+	PRK04311, PRK04311, selenocysteine synthase; Provisional	NA|194aa|up_2|NZ_CP036273.1_3821974_3822556_-	pfam08819, DUF1802, Domain of unknown function (DUF1802)	NA|75aa|up_1|NZ_CP036273.1_3822732_3822957_+	NA	NA|285aa|up_0|NZ_CP036273.1_3822960_3823815_-	NA	NA|273aa|down_0|NZ_CP036273.1_3826157_3826976_+	cd13634, PBP2_Sco4506, The conserved hypothetical protein SCO4506 exhibits the type 2 periplasmic-binidng protein fold	NA|373aa|down_1|NZ_CP036273.1_3826935_3828054_+	TIGR03699, menaquin_MqnC, dehypoxanthine futalosine cyclase	NA|256aa|down_2|NZ_CP036273.1_3828071_3828839_+	NA	NA|243aa|down_3|NZ_CP036273.1_3828835_3829564_+	cd03266, ABC_NatA_sodium_exporter, ATP-binding cassette domain of the Na+ transporter	NA|472aa|down_4|NZ_CP036273.1_3829567_3830983_-	COG2204, AtoC, Response regulator containing CheY-like receiver, AAA-type ATPase, and DNA-binding domains [Signal transduction mechanisms]	NA|712aa|down_5|NZ_CP036273.1_3830996_3833132_-	TIGR00376, DNA-binding_protein_SMUBP-2, DNA helicase, putative	NA|69aa|down_6|NZ_CP036273.1_3833141_3833348_-	NA	NA|367aa|down_7|NZ_CP036273.1_3833407_3834508_-	pfam01784, NIF3, NIF3 (NGG1p interacting factor 3)	NA|274aa|down_8|NZ_CP036273.1_3834598_3835420_-	PRK00216, ubiE, bifunctional demethylmenaquinone methyltransferase/2-methoxy-6-polyprenyl-1,4-benzoquinol methylase UbiE	NA|133aa|down_9|NZ_CP036273.1_3835523_3835922_+	PRK12270, kgd, multifunctional oxoglutarate decarboxylase/oxoglutarate dehydrogenase thiamine pyrophosphate-binding subunit/dihydrolipoyllysine-residue succinyltransferase subunit
GCF_007747215.1_ASM774721v1	NZ_CP036273	Planctomycetes bacterium ETA_A1 chromosome, complete genome	6	4563259-4563599	5	CRISPRCasFinder	no		DEDDh,csa3,RT,DinG,cas3	Orphan	GAGCGGCATCCCCTTCAGCGGCG	23	0	0	NA	NA	NA	5	5	Orphan	DEDDh,csa3,RT,DinG,cas3	NA|78aa|up_8|NZ_CP036273.1_4545204_4545438_-,NA|165aa|up_6|NZ_CP036273.1_4547636_4548131_+,NA|96aa|up_5|NZ_CP036273.1_4548310_4548598_-,NA|135aa|up_3|NZ_CP036273.1_4549451_4549856_+,NA|93aa|down_8|NZ_CP036273.1_4585702_4585981_+,NA|91aa|down_9|NZ_CP036273.1_4587347_4587620_-	NA|468aa|up_9|NZ_CP036273.1_4543806_4545210_+	pfam03050, DDE_Tnp_IS66, Transposase IS66 family	NA|78aa|up_8|NZ_CP036273.1_4545204_4545438_-	NA	NA|355aa|up_7|NZ_CP036273.1_4546481_4547546_-	pfam02371, Transposase_20, Transposase IS116/IS110/IS902 family	NA|165aa|up_6|NZ_CP036273.1_4547636_4548131_+	NA	NA|96aa|up_5|NZ_CP036273.1_4548310_4548598_-	NA	NA|220aa|up_4|NZ_CP036273.1_4548829_4549489_+	cd09163, PLDc_CLS_unchar2_2, Putative catalytic domain, repeat 2, of uncharacterized proteins similar to bacterial cardiolipin synthase	NA|135aa|up_3|NZ_CP036273.1_4549451_4549856_+	NA	NA|95aa|up_2|NZ_CP036273.1_4550744_4551029_+	COG2261, COG2261, Predicted membrane protein [Function unknown]	NA|494aa|up_1|NZ_CP036273.1_4551122_4552604_+	PRK09441, PRK09441, cytoplasmic alpha-amylase; Reviewed	NA|3454aa|up_0|NZ_CP036273.1_4552749_4563111_-	cd14014, STKc_PknB_like, Catalytic domain of bacterial Serine/Threonine kinases, PknB and similar proteins	NA|192aa|down_0|NZ_CP036273.1_4568481_4569057_+	TIGR02999, Sig-70_X6, RNA polymerase sigma factor, TIGR02999 family	NA|914aa|down_1|NZ_CP036273.1_4569155_4571897_+	cd14014, STKc_PknB_like, Catalytic domain of bacterial Serine/Threonine kinases, PknB and similar proteins	NA|844aa|down_2|NZ_CP036273.1_4571926_4574458_+	pfam13433, Peripla_BP_5, Periplasmic binding protein domain	NA|1082aa|down_3|NZ_CP036273.1_4574462_4577708_+	cd00200, WD40, WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment	NA|465aa|down_4|NZ_CP036273.1_4578059_4579454_-	pfam01609, DDE_Tnp_1, Transposase DDE domain	NA|416aa|down_5|NZ_CP036273.1_4579835_4581083_+	cd00397, DNA_BRE_C, DNA breaking-rejoining enzymes, C-terminal catalytic domain	NA|773aa|down_6|NZ_CP036273.1_4581291_4583610_-	pfam01548, DEDD_Tnp_IS110, Transposase	NA|465aa|down_7|NZ_CP036273.1_4584072_4585467_-	pfam01609, DDE_Tnp_1, Transposase DDE domain	NA|93aa|down_8|NZ_CP036273.1_4585702_4585981_+	NA	NA|91aa|down_9|NZ_CP036273.1_4587347_4587620_-	NA
GCF_007747215.1_ASM774721v1	NZ_CP036273	Planctomycetes bacterium ETA_A1 chromosome, complete genome	7	5348354-5348555	6	CRISPRCasFinder	no		DEDDh,csa3,RT,DinG,cas3	Orphan	CGACGTGGGTCGAGAAGGAAGTCACCGTGAGCAAGTGCGTCCCGGT	46	0	0	NA	NA	NA	2	2	Orphan	DEDDh,csa3,RT,DinG,cas3	NA|244aa|up_9|NZ_CP036273.1_5336186_5336918_-,NA|230aa|up_8|NZ_CP036273.1_5337281_5337971_-,NA|100aa|up_4|NZ_CP036273.1_5340635_5340935_-,NA|493aa|down_1|NZ_CP036273.1_5349973_5351452_+	NA|244aa|up_9|NZ_CP036273.1_5336186_5336918_-	NA	NA|230aa|up_8|NZ_CP036273.1_5337281_5337971_-	NA	NA|352aa|up_7|NZ_CP036273.1_5338391_5339447_-	pfam04371, PAD_porph, Porphyromonas-type peptidyl-arginine deiminase	NA|155aa|up_6|NZ_CP036273.1_5339557_5340022_+	cd00156, REC, phosphoacceptor receiver (REC) domain of response regulators (RRs) and pseudo response regulators (PRRs)	NA|151aa|up_5|NZ_CP036273.1_5340115_5340568_+	pfam09424, YqeY, Yqey-like protein	NA|100aa|up_4|NZ_CP036273.1_5340635_5340935_-	NA	NA|624aa|up_3|NZ_CP036273.1_5341206_5343078_-	PRK05218, PRK05218, heat shock protein 90; Provisional	NA|448aa|up_2|NZ_CP036273.1_5343240_5344584_-	COG0642, BaeS, Signal transduction histidine kinase [Signal transduction mechanisms]	NA|584aa|up_1|NZ_CP036273.1_5344687_5346439_+	cd16146, ARS_like, uncharacterized arylsulfatase	NA|247aa|up_0|NZ_CP036273.1_5346491_5347232_-	pfam06439, DUF1080, Domain of Unknown Function (DUF1080)	NA|395aa|down_0|NZ_CP036273.1_5348792_5349977_+	pfam09594, GT87, Glycosyltransferase family 87	NA|493aa|down_1|NZ_CP036273.1_5349973_5351452_+	NA	NA|283aa|down_2|NZ_CP036273.1_5351551_5352400_-	COG3118, COG3118, Thioredoxin domain-containing protein [Posttranslational modification, protein turnover, chaperones]	NA|709aa|down_3|NZ_CP036273.1_5352443_5354570_-	TIGR02937, RNA_polymerase_sigma_factor, RNA polymerase sigma factor, sigma-70 family	NA|491aa|down_4|NZ_CP036273.1_5354726_5356199_+	COG0673, MviM, Predicted dehydrogenases and related proteins [General function prediction only]	NA|54aa|down_5|NZ_CP036273.1_5356280_5356442_+	TIGR02574, hypothetical_protein, putative addiction module component, TIGR02574 family	NA|387aa|down_6|NZ_CP036273.1_5356956_5358117_+	cd02892, SQCY_1, Squalene cyclase (SQCY) domain subgroup 1; found in class II terpene cyclases that have an alpha 6 - alpha 6 barrel fold	NA|391aa|down_7|NZ_CP036273.1_5358207_5359380_+	PRK05790, PRK05790, putative acyltransferase; Provisional	NA|235aa|down_8|NZ_CP036273.1_5359411_5360116_+	cd06260, DUF820, Domain of unknown function (DUF820)	NA|405aa|down_9|NZ_CP036273.1_5360274_5361489_+	cd07989, LPLAT_AGPAT-like, Lysophospholipid Acyltransferases (LPLATs) of Glycerophospholipid Biosynthesis: AGPAT-like
GCF_007747215.1_ASM774721v1	NZ_CP036273	Planctomycetes bacterium ETA_A1 chromosome, complete genome	8	6974723-6974838	7	CRISPRCasFinder	no		DEDDh,csa3,RT,DinG,cas3	Orphan	TGCATGTTAAGTGTGCCGAACCTCTGAAA	29	0	0	NA	NA	NA	1	1	Orphan	DEDDh,csa3,RT,DinG,cas3	NA|144aa|up_8|NZ_CP036273.1_6966023_6966455_+,NA|403aa|up_7|NZ_CP036273.1_6966521_6967730_+,NA|355aa|up_6|NZ_CP036273.1_6967748_6968813_+,NA|107aa|up_5|NZ_CP036273.1_6969071_6969392_+,NA|180aa|up_4|NZ_CP036273.1_6969542_6970082_+,NA|100aa|up_3|NZ_CP036273.1_6970119_6970419_+,NA|247aa|up_2|NZ_CP036273.1_6970494_6971235_+,NA|67aa|down_1|NZ_CP036273.1_6976258_6976459_+,NA|133aa|down_2|NZ_CP036273.1_6976533_6976932_+,NA|93aa|down_3|NZ_CP036273.1_6976928_6977207_+,NA|156aa|down_4|NZ_CP036273.1_6977203_6977671_+,NA|72aa|down_5|NZ_CP036273.1_6977667_6977883_+,NA|93aa|down_6|NZ_CP036273.1_6977879_6978158_+,NA|194aa|down_7|NZ_CP036273.1_6978154_6978736_+,NA|424aa|down_8|NZ_CP036273.1_6978821_6980093_-	NA|530aa|up_9|NZ_CP036273.1_6964437_6966027_+	smart00857, Resolvase, Resolvase, N terminal domain	NA|144aa|up_8|NZ_CP036273.1_6966023_6966455_+	NA	NA|403aa|up_7|NZ_CP036273.1_6966521_6967730_+	NA	NA|355aa|up_6|NZ_CP036273.1_6967748_6968813_+	NA	NA|107aa|up_5|NZ_CP036273.1_6969071_6969392_+	NA	NA|180aa|up_4|NZ_CP036273.1_6969542_6970082_+	NA	NA|100aa|up_3|NZ_CP036273.1_6970119_6970419_+	NA	NA|247aa|up_2|NZ_CP036273.1_6970494_6971235_+	NA	NA|687aa|up_1|NZ_CP036273.1_6971231_6973292_+	PRK12270, kgd, multifunctional oxoglutarate decarboxylase/oxoglutarate dehydrogenase thiamine pyrophosphate-binding subunit/dihydrolipoyllysine-residue succinyltransferase subunit	NA|353aa|up_0|NZ_CP036273.1_6973319_6974378_-	COG3292, COG3292, Predicted periplasmic ligand-binding sensor domain [Signal transduction mechanisms]	NA|417aa|down_0|NZ_CP036273.1_6974983_6976234_+	cd16402, ParB_N_like_MT, ParB N-terminal-like domain, some attached to C-terminal S-adenosylmethionine-dependent methyltransferase domain	NA|67aa|down_1|NZ_CP036273.1_6976258_6976459_+	NA	NA|133aa|down_2|NZ_CP036273.1_6976533_6976932_+	NA	NA|93aa|down_3|NZ_CP036273.1_6976928_6977207_+	NA	NA|156aa|down_4|NZ_CP036273.1_6977203_6977671_+	NA	NA|72aa|down_5|NZ_CP036273.1_6977667_6977883_+	NA	NA|93aa|down_6|NZ_CP036273.1_6977879_6978158_+	NA	NA|194aa|down_7|NZ_CP036273.1_6978154_6978736_+	NA	NA|424aa|down_8|NZ_CP036273.1_6978821_6980093_-	NA	NA|669aa|down_9|NZ_CP036273.1_6980131_6982138_-	pfam13191, AAA_16, AAA ATPase domain
GCF_007747215.1_ASM774721v1	NZ_CP036273	Planctomycetes bacterium ETA_A1 chromosome, complete genome	9	7231850-7231985	2	PILER-CR	no		DEDDh,csa3,RT,DinG,cas3	Orphan	CCGGCCGCGGCGCGGGCGGTCGTCGTAGTCGTCGTCGT	38	0	0	NA	NA	NA	2	2	Orphan	DEDDh,csa3,RT,DinG,cas3	NA|175aa|up_7|NZ_CP036273.1_7224366_7224891_-,NA|339aa|up_6|NZ_CP036273.1_7224983_7226000_-,NA|140aa|up_5|NZ_CP036273.1_7226032_7226452_-,NA|113aa|up_4|NZ_CP036273.1_7226448_7226787_-,NA|609aa|down_0|NZ_CP036273.1_7232035_7233862_-	NA|312aa|up_9|NZ_CP036273.1_7221962_7222898_+	pfam12710, HAD, haloacid dehalogenase-like hydrolase	NA|374aa|up_8|NZ_CP036273.1_7222934_7224056_+	pfam03781, FGE-sulfatase, Sulfatase-modifying factor enzyme 1	NA|175aa|up_7|NZ_CP036273.1_7224366_7224891_-	NA	NA|339aa|up_6|NZ_CP036273.1_7224983_7226000_-	NA	NA|140aa|up_5|NZ_CP036273.1_7226032_7226452_-	NA	NA|113aa|up_4|NZ_CP036273.1_7226448_7226787_-	NA	NA|383aa|up_3|NZ_CP036273.1_7226804_7227953_-	pfam13808, DDE_Tnp_1_assoc, DDE_Tnp_1-associated	NA|400aa|up_2|NZ_CP036273.1_7228470_7229670_-	cd00796, INT_Rci_Hp1_C, Shufflon-specific DNA recombinase Rci and Bacteriophage Hp1_like integrase, C-terminal catalytic domain	NA|422aa|up_1|NZ_CP036273.1_7229835_7231101_+	COG0402, SsnA, Cytosine deaminase and related metal-dependent hydrolases [Nucleotide transport and metabolism / General function prediction only]	NA|232aa|up_0|NZ_CP036273.1_7231152_7231848_-	pfam07596, SBP_bac_10, Protein of unknown function (DUF1559)	NA|609aa|down_0|NZ_CP036273.1_7232035_7233862_-	NA	NA|431aa|down_1|NZ_CP036273.1_7234701_7235994_+	TIGR03097, PEP_O_lig_1, probable O-glycosylation ligase, exosortase A-associated	NA|276aa|down_2|NZ_CP036273.1_7236000_7236828_+	cd02511, Beta4Glucosyltransferase, UDP-glucose LOS-beta-1,4 glucosyltransferase is required for biosynthesis of lipooligosaccharide	NA|222aa|down_3|NZ_CP036273.1_7236846_7237512_+	pfam13578, Methyltransf_24, Methyltransferase domain	NA|374aa|down_4|NZ_CP036273.1_7237508_7238630_+	cd03807, GT4_WbnK-like, Shigella dysenteriae WbnK and similar proteins	NA|216aa|down_5|NZ_CP036273.1_7238587_7239235_+	cd04647, LbH_MAT_like, Maltose O-acyltransferase (MAT)-like: This family is composed of maltose O-acetyltransferase, galactoside O-acetyltransferase (GAT), xenobiotic acyltransferase (XAT) and similar proteins	NA|380aa|down_6|NZ_CP036273.1_7239227_7240367_+	cd04950, GT4_TuaH-like, teichuronic acid biosynthesis glycosyltransferase TuaH and similar proteins	NA|387aa|down_7|NZ_CP036273.1_7240371_7241532_-	cd03801, GT4_PimA-like, phosphatidyl-myo-inositol mannosyltransferase	NA|217aa|down_8|NZ_CP036273.1_7241580_7242231_-	pfam03966, Trm112p, Trm112p-like protein	NA|898aa|down_9|NZ_CP036273.1_7242330_7245024_-	pfam03200, Glyco_hydro_63, Glycosyl hydrolase family 63 C-terminal domain
GCF_007747215.1_ASM774721v1	NZ_CP036273	Planctomycetes bacterium ETA_A1 chromosome, complete genome	10	7240037-7240110	8	CRISPRCasFinder	no		DEDDh,csa3,RT,DinG,cas3	Orphan	GGTGCCCTACGCCGACCTGCCGG	23	0	0	NA	NA	NA	1	1	Orphan	DEDDh,csa3,RT,DinG,cas3	NA|609aa|up_5|NZ_CP036273.1_7232035_7233862_-,NA|93aa|down_3|NZ_CP036273.1_7245121_7245400_-,NA|91aa|down_5|NZ_CP036273.1_7246886_7247159_-	NA|383aa|up_9|NZ_CP036273.1_7226804_7227953_-	pfam13808, DDE_Tnp_1_assoc, DDE_Tnp_1-associated	NA|400aa|up_8|NZ_CP036273.1_7228470_7229670_-	cd00796, INT_Rci_Hp1_C, Shufflon-specific DNA recombinase Rci and Bacteriophage Hp1_like integrase, C-terminal catalytic domain	NA|422aa|up_7|NZ_CP036273.1_7229835_7231101_+	COG0402, SsnA, Cytosine deaminase and related metal-dependent hydrolases [Nucleotide transport and metabolism / General function prediction only]	NA|232aa|up_6|NZ_CP036273.1_7231152_7231848_-	pfam07596, SBP_bac_10, Protein of unknown function (DUF1559)	NA|609aa|up_5|NZ_CP036273.1_7232035_7233862_-	NA	NA|431aa|up_4|NZ_CP036273.1_7234701_7235994_+	TIGR03097, PEP_O_lig_1, probable O-glycosylation ligase, exosortase A-associated	NA|276aa|up_3|NZ_CP036273.1_7236000_7236828_+	cd02511, Beta4Glucosyltransferase, UDP-glucose LOS-beta-1,4 glucosyltransferase is required for biosynthesis of lipooligosaccharide	NA|222aa|up_2|NZ_CP036273.1_7236846_7237512_+	pfam13578, Methyltransf_24, Methyltransferase domain	NA|374aa|up_1|NZ_CP036273.1_7237508_7238630_+	cd03807, GT4_WbnK-like, Shigella dysenteriae WbnK and similar proteins	NA|216aa|up_0|NZ_CP036273.1_7238587_7239235_+	cd04647, LbH_MAT_like, Maltose O-acyltransferase (MAT)-like: This family is composed of maltose O-acetyltransferase, galactoside O-acetyltransferase (GAT), xenobiotic acyltransferase (XAT) and similar proteins	NA|387aa|down_0|NZ_CP036273.1_7240371_7241532_-	cd03801, GT4_PimA-like, phosphatidyl-myo-inositol mannosyltransferase	NA|217aa|down_1|NZ_CP036273.1_7241580_7242231_-	pfam03966, Trm112p, Trm112p-like protein	NA|898aa|down_2|NZ_CP036273.1_7242330_7245024_-	pfam03200, Glyco_hydro_63, Glycosyl hydrolase family 63 C-terminal domain	NA|93aa|down_3|NZ_CP036273.1_7245121_7245400_-	NA	NA|465aa|down_4|NZ_CP036273.1_7245468_7246863_+	pfam01609, DDE_Tnp_1, Transposase DDE domain	NA|91aa|down_5|NZ_CP036273.1_7246886_7247159_-	NA	NA|542aa|down_6|NZ_CP036273.1_7247186_7248812_-	PRK00179, pgi, glucose-6-phosphate isomerase; Reviewed	NA|299aa|down_7|NZ_CP036273.1_7248833_7249730_-	PRK12678, PRK12678, transcription termination factor Rho; Provisional	NA|596aa|down_8|NZ_CP036273.1_7249762_7251550_-	cd16144, ARS_like, uncharacterized arylsulfatase subfamily	NA|253aa|down_9|NZ_CP036273.1_7251704_7252463_+	pfam13808, DDE_Tnp_1_assoc, DDE_Tnp_1-associated
GCF_007747215.1_ASM774721v1	NZ_CP036273	Planctomycetes bacterium ETA_A1 chromosome, complete genome	11	7673500-7673606	9	CRISPRCasFinder	no		DEDDh,csa3,RT,DinG,cas3	Orphan	GCAACGCCCGCTACAGGTCGCTGATGGTGCGGTT	34	0	0	NA	NA	NA	1	1	Orphan	DEDDh,csa3,RT,DinG,cas3	NA|111aa|up_1|NZ_CP036273.1_7672432_7672765_-,NA|303aa|down_2|NZ_CP036273.1_7675239_7676148_+,NA|320aa|down_4|NZ_CP036273.1_7677458_7678418_+	NA|374aa|up_9|NZ_CP036273.1_7605882_7607004_+	PRK15062, PRK15062, hydrogenase isoenzymes formation protein HypD; Provisional	NA|330aa|up_8|NZ_CP036273.1_7607057_7608047_+	cd02197, HypE, HypE (Hydrogenase expression/formation protein)	NA|134aa|up_7|NZ_CP036273.1_7608112_7608514_-	PRK00955, PRK00955, YgiQ family radical SAM protein	NA|416aa|up_6|NZ_CP036273.1_7608738_7609986_-	pfam07592, DDE_Tnp_ISAZ013, Rhodopirellula transposase DDE domain	NA|2282aa|up_5|NZ_CP036273.1_7610250_7617096_+	cd07498, Peptidases_S8_15, Peptidase S8 family domain, uncharacterized subfamily 15	NA|1980aa|up_4|NZ_CP036273.1_7618112_7624052_+	TIGR00864, PCC, polycystin cation channel protein	NA|15718aa|up_3|NZ_CP036273.1_7624606_7671760_+	PRK12688, PRK12688, flagellin; Reviewed	NA|154aa|up_2|NZ_CP036273.1_7671963_7672425_+	COG3415, COG3415, Transposase and inactivated derivatives [DNA replication, recombination, and repair]	NA|111aa|up_1|NZ_CP036273.1_7672432_7672765_-	NA	NA|119aa|up_0|NZ_CP036273.1_7672993_7673350_-	COG3464, COG3464, Transposase and inactivated derivatives [DNA replication, recombination, and repair]	NA|152aa|down_0|NZ_CP036273.1_7673656_7674112_-	pfam01610, DDE_Tnp_ISL3, Transposase	NA|213aa|down_1|NZ_CP036273.1_7674121_7674760_-	COG3464, COG3464, Transposase and inactivated derivatives [DNA replication, recombination, and repair]	NA|303aa|down_2|NZ_CP036273.1_7675239_7676148_+	NA	NA|399aa|down_3|NZ_CP036273.1_7676265_7677462_+	COG2852, COG2852, Very-short-patch-repair endonuclease [Replication, recombination,    and repair]	NA|320aa|down_4|NZ_CP036273.1_7677458_7678418_+	NA	NA|1048aa|down_5|NZ_CP036273.1_7678414_7681558_+	pfam05872, DUF853, Bacterial protein of unknown function (DUF853)	NA|149aa|down_6|NZ_CP036273.1_7681587_7682034_-	PRK00955, PRK00955, YgiQ family radical SAM protein	NA|782aa|down_7|NZ_CP036273.1_7682055_7684401_-	pfam13640, 2OG-FeII_Oxy_3, 2OG-Fe(II) oxygenase superfamily	NA|126aa|down_8|NZ_CP036273.1_7684412_7684790_-	pfam07929, PRiA4_ORF3, Plasmid pRiA4b ORF-3-like protein	NA|128aa|down_9|NZ_CP036273.1_7684782_7685166_-	PRK00215, PRK00215, transcriptional repressor LexA
