assembly_id	genome_id	genome_def	crispr_array_locus_merge	crispr_array_location_merge	crispr_locus_id	crispr_pred_method	array_in_prot	prot_within_array_20000	prot_in_genome	crispr_type_by_cas_prot	consensus_repeat	repeat_length	self-targeting_spacer_number	self-targeting_target_number	spacer_location	protospacer_location	repeat_type	spacer_locus_num	spacer_num	correct_crispr_type	genome_cas_prots	unknown_protein_around_crispr	L10	L10_domain	L9	L9_domain	L8	L8_domain	L7	L7_domain	L6	L6_domain	L5	L5_domain	L4	L4_domain	L3	L3_domain	L2	L2_domain	L1	L1_domain	R1	R1_domain	R2	R2_domain	R3	R3_domain	R4	R4_domain	R5	R5_domain	R6	R6_domain	R7	R7_domain	R8	R8_domain	R9	R9_domain	R10	R10_domain
GCF_007741515.1_ASM774151v1	NZ_CP036263	Planctomycetes bacterium HG15A2 chromosome, complete genome	1	924259-924401	1	CRISPRCasFinder	no		cas3,csa3,RT,DinG	Orphan	GGGTTCTCAGAAAAGTTTGGCTTGTCCCAATAAATCAGTGGGC	43	0	0	NA	NA	NA	1	1	Orphan	cas3,csa3,RT,DinG	NA|173aa|up_8|NZ_CP036263.1_914006_914525_-,NA|271aa|up_6|NZ_CP036263.1_915709_916522_+,NA	NA|143aa|up_9|NZ_CP036263.1_913429_913858_+	pfam04134, DUF393, Protein of unknown function, DUF393	NA|173aa|up_8|NZ_CP036263.1_914006_914525_-	NA	NA|278aa|up_7|NZ_CP036263.1_914741_915575_-	PRK00311, panB, 3-methyl-2-oxobutanoate hydroxymethyltransferase; Reviewed	NA|271aa|up_6|NZ_CP036263.1_915709_916522_+	NA	NA|138aa|up_5|NZ_CP036263.1_916591_917005_+	cd06154, YjgF_YER057c_UK114_like_6, This group of proteins belong to a large family of YjgF/YER057c/UK114-like proteins present in bacteria, archaea, and eukaryotes with no definitive function	NA|283aa|up_4|NZ_CP036263.1_917062_917911_-	pfam13649, Methyltransf_25, Methyltransferase domain	NA|340aa|up_3|NZ_CP036263.1_918293_919313_+	COG0057, GapA, Glyceraldehyde-3-phosphate dehydrogenase/erythrose-4-phosphate dehydrogenase [Carbohydrate transport and metabolism]	NA|171aa|up_2|NZ_CP036263.1_919441_919954_-	cd01055, Nonheme_Ferritin, nonheme-containing ferritins	NA|455aa|up_1|NZ_CP036263.1_920238_921603_-	COG0673, MviM, Predicted dehydrogenases and related proteins [General function prediction only]	NA|648aa|up_0|NZ_CP036263.1_921953_923897_+	pfam06439, DUF1080, Domain of Unknown Function (DUF1080)	NA|388aa|down_0|NZ_CP036263.1_924643_925807_-	COG0673, MviM, Predicted dehydrogenases and related proteins [General function prediction only]	NA|446aa|down_1|NZ_CP036263.1_926201_927539_-	cd06114, EcCS_like, Escherichia coli (Ec) citrate synthase (CS) GltA_like	NA|586aa|down_2|NZ_CP036263.1_927873_929631_+	TIGR02917, TPR_domain_protein, putative PEP-CTERM system TPR-repeat lipoprotein	NA|511aa|down_3|NZ_CP036263.1_929873_931406_+	pfam00375, SDF, Sodium:dicarboxylate symporter family	NA|359aa|down_4|NZ_CP036263.1_931520_932597_-	COG1194, MutY, A/G-specific DNA glycosylase [DNA replication, recombination, and repair]	NA|494aa|down_5|NZ_CP036263.1_932685_934167_-	pfam02233, PNTB, NAD(P) transhydrogenase beta subunit	NA|89aa|down_6|NZ_CP036263.1_934302_934569_-	pfam12769, PNTB_4TM, 4TM region of pyridine nucleotide transhydrogenase, mitoch	NA|410aa|down_7|NZ_CP036263.1_934565_935795_-	cd05304, Rubrum_tdh, Rubrum transdehydrogenase NAD-binding and catalytic domains	NA|733aa|down_8|NZ_CP036263.1_935888_938087_-	pfam06943, zf-LSD1, LSD1 zinc finger	NA|509aa|down_9|NZ_CP036263.1_938280_939807_-	cd06460, M32_Taq, Peptidase family M32, which includes thermostable carboxypeptidases TaqCP, PfuCP and FisCP
GCF_007741515.1_ASM774151v1	NZ_CP036263	Planctomycetes bacterium HG15A2 chromosome, complete genome	2	1332387-1332758	1	CRT	no		cas3,csa3,RT,DinG	Orphan	GGTTTCCCANACNGGCTT	18	0	0	NA	NA	NA	8	8	Orphan	cas3,csa3,RT,DinG	NA|253aa|up_6|NZ_CP036263.1_1320597_1321356_-,NA	NA|120aa|up_9|NZ_CP036263.1_1318385_1318745_+	pfam11950, DUF3467, Protein of unknown function (DUF3467)	NA|313aa|up_8|NZ_CP036263.1_1318876_1319815_+	PRK13386, fliH, flagellar assembly protein H; Provisional	NA|117aa|up_7|NZ_CP036263.1_1319947_1320298_-	sd00006, TPR, Tetratricopeptide repeat	NA|253aa|up_6|NZ_CP036263.1_1320597_1321356_-	NA	NA|284aa|up_5|NZ_CP036263.1_1321734_1322586_+	PRK05289, PRK05289, acyl-ACP--UDP-N-acetylglucosamine O-acyltransferase	NA|653aa|up_4|NZ_CP036263.1_1322824_1324783_+	COG0514, RecQ, Superfamily II DNA helicase [DNA replication, recombination, and repair]	NA|896aa|up_3|NZ_CP036263.1_1325206_1327894_-	PRK09279, PRK09279, pyruvate phosphate dikinase; Provisional	NA|191aa|up_2|NZ_CP036263.1_1328417_1328990_+	pfam02517, Abi, CAAX protease self-immunity	NA|129aa|up_1|NZ_CP036263.1_1329054_1329441_+	cd08349, BLMA_like, Bleomycin binding protein (BLMA) and similar proteins	NA|359aa|up_0|NZ_CP036263.1_1330190_1331267_+	pfam00459, Inositol_P, Inositol monophosphatase family	NA|236aa|down_0|NZ_CP036263.1_1333500_1334208_+	PRK05472, PRK05472, redox-sensing transcriptional repressor Rex; Provisional	NA|332aa|down_1|NZ_CP036263.1_1334446_1335442_+	TIGR01330, 3'2'5'-bisphosphate_nucleotidase, 3'(2'),5'-bisphosphate nucleotidase, HAL2 family	NA|272aa|down_2|NZ_CP036263.1_1335662_1336478_+	PRK05299, rpsB, 30S ribosomal protein S2; Provisional	NA|289aa|down_3|NZ_CP036263.1_1336599_1337466_+	PRK09377, tsf, elongation factor Ts; Provisional	NA|246aa|down_4|NZ_CP036263.1_1337631_1338369_+	cd04254, AAK_UMPK-PyrH-Ec, UMP kinase (UMPK)-Ec, the microbial/chloroplast uridine monophosphate kinase (uridylate kinase) enzyme that catalyzes UMP phosphorylation and plays a key role in pyrimidine nucleotide biosynthesis; regulation of this process is via feed-back control and via gene repression of carbamoyl phosphate synthetase (the first enzyme of the pyrimidine biosynthesis pathway)	NA|187aa|down_5|NZ_CP036263.1_1338650_1339211_+	PRK00083, frr, ribosome recycling factor; Reviewed	NA|72aa|down_6|NZ_CP036263.1_1339309_1339525_+	COG1598, COG1598, Predicted nuclease of the RNAse H fold, HicB family [General    function prediction only]	NA|647aa|down_7|NZ_CP036263.1_1339885_1341826_+	pfam00639, Rotamase, PPIC-type PPIASE domain	NA|419aa|down_8|NZ_CP036263.1_1342094_1343351_+	PRK08175, PRK08175, aminotransferase; Validated	NA|145aa|down_9|NZ_CP036263.1_1343597_1344032_+	COG1725, COG1725, Predicted transcriptional regulators [Transcription]
GCF_007741515.1_ASM774151v1	NZ_CP036263	Planctomycetes bacterium HG15A2 chromosome, complete genome	3	1748453-1748888	2	CRISPRCasFinder	no		cas3,csa3,RT,DinG	Orphan	ACGCTCACCAACAGCACGGTCAGCGGAAACT	31	0	0	NA	NA	NA	5	5	Orphan	cas3,csa3,RT,DinG	NA,NA|117aa|down_1|NZ_CP036263.1_1753579_1753930_-,NA|118aa|down_2|NZ_CP036263.1_1753954_1754308_-,NA|604aa|down_7|NZ_CP036263.1_1761033_1762845_+,NA|555aa|down_8|NZ_CP036263.1_1762940_1764605_+	NA|306aa|up_9|NZ_CP036263.1_1733113_1734031_-	pfam08378, NERD, Nuclease-related domain	NA|585aa|up_8|NZ_CP036263.1_1734306_1736061_-	cd16146, ARS_like, uncharacterized arylsulfatase	NA|697aa|up_7|NZ_CP036263.1_1736402_1738493_-	cd16144, ARS_like, uncharacterized arylsulfatase subfamily	NA|235aa|up_6|NZ_CP036263.1_1738489_1739194_-	cd00051, EFh, EF-hand, calcium binding motif; A diverse superfamily of calcium sensors and calcium signal modulators; most examples in this alignment model have 2 active canonical EF hands	NA|354aa|up_5|NZ_CP036263.1_1739216_1740278_-	pfam07596, SBP_bac_10, Protein of unknown function (DUF1559)	NA|540aa|up_4|NZ_CP036263.1_1740529_1742149_-	pfam04773, FecR, FecR protein	NA|194aa|up_3|NZ_CP036263.1_1742162_1742744_-	TIGR02989, Sig-70_gvs1, RNA polymerase sigma-70 factor, Rhodopirellula/Verrucomicrobium family	NA|493aa|up_2|NZ_CP036263.1_1743311_1744790_+	cd14254, Dockerin_II, Type II dockerin repeat domain	NA|235aa|up_1|NZ_CP036263.1_1745043_1745748_+	TIGR02595, conserved_hypothetical_protein, PEP-CTERM protein-sorting domain	NA|614aa|up_0|NZ_CP036263.1_1745844_1747686_+	pfam03629, SASA, Carbohydrate esterase, sialic acid-specific acetylesterase	NA|632aa|down_0|NZ_CP036263.1_1750919_1752815_+	cd16027, SGSH, N-sulfoglucosamine sulfohydrolase (SGSH; sulfamidase)	NA|117aa|down_1|NZ_CP036263.1_1753579_1753930_-	NA	NA|118aa|down_2|NZ_CP036263.1_1753954_1754308_-	NA	NA|111aa|down_3|NZ_CP036263.1_1755138_1755471_-	cd08637, DNA_pol_A_pol_I_C, Polymerase I functions primarily to fill DNA gaps that arise during DNA repair, recombination and replication	NA|565aa|down_4|NZ_CP036263.1_1755643_1757338_+	COG1785, PhoA, Alkaline phosphatase [Inorganic ion transport and metabolism]	NA|106aa|down_5|NZ_CP036263.1_1758496_1758814_-	pfam08681, DUF1778, Protein of unknown function (DUF1778)	NA|375aa|down_6|NZ_CP036263.1_1759788_1760913_+	cd06267, PBP1_LacI_sugar_binding-like, ligand binding domain of the LacI transcriptional regulator family belonging to the type 1 periplasmic-binding fold protein superfamily	NA|604aa|down_7|NZ_CP036263.1_1761033_1762845_+	NA	NA|555aa|down_8|NZ_CP036263.1_1762940_1764605_+	NA	NA|314aa|down_9|NZ_CP036263.1_1764930_1765872_+	pfam09264, Sial-lect-inser, Vibrio cholerae sialidase, lectin insertion
GCF_007741515.1_ASM774151v1	NZ_CP036263	Planctomycetes bacterium HG15A2 chromosome, complete genome	4	1961713-1961813	3	CRISPRCasFinder	no		cas3,csa3,RT,DinG	Orphan	CCGCCGCCACCGAAACCACCGCC	23	0	0	NA	NA	NA	2	2	Orphan	cas3,csa3,RT,DinG	NA|169aa|up_0|NZ_CP036263.1_1961180_1961687_-,NA|423aa|down_2|NZ_CP036263.1_1965518_1966787_+,NA|318aa|down_6|NZ_CP036263.1_1968712_1969666_+,NA|171aa|down_8|NZ_CP036263.1_1970808_1971321_-	NA|726aa|up_9|NZ_CP036263.1_1947999_1950177_+	COG1344, FlgL, Flagellin and related hook-associated proteins [Cell motility and secretion]	NA|1039aa|up_8|NZ_CP036263.1_1950519_1953636_+	COG1345, FliD, Flagellar capping protein [Cell motility and secretion]	NA|166aa|up_7|NZ_CP036263.1_1953702_1954200_+	pfam02561, FliS, Flagellar protein FliS	NA|333aa|up_6|NZ_CP036263.1_1954306_1955305_+	cd13636, PBP2_Af1704, The conserved hypothetical protein Af1704 exhibits the type 2 periplasmic-binding protein fold	NA|407aa|up_5|NZ_CP036263.1_1955390_1956611_+	cd08014, M20_Acy1-like, M20 Peptidase aminoacylase 1 subfamily	NA|377aa|up_4|NZ_CP036263.1_1956665_1957796_+	PRK13517, PRK13517, glutamate--cysteine ligase	NA|193aa|up_3|NZ_CP036263.1_1957817_1958396_-	cd02969, PRX_like1, Peroxiredoxin (PRX)-like 1 family; hypothetical proteins that show sequence similarity to PRXs	NA|108aa|up_2|NZ_CP036263.1_1958751_1959075_+	TIGR01068, Thioredoxin-like_protein_slr0233, thioredoxin	NA|582aa|up_1|NZ_CP036263.1_1959344_1961090_+	TIGR02168, Chromosome_partition_protein_Smc, chromosome segregation protein SMC, common bacterial type	NA|169aa|up_0|NZ_CP036263.1_1961180_1961687_-	NA	NA|728aa|down_0|NZ_CP036263.1_1962459_1964643_+	TIGR02917, TPR_domain_protein, putative PEP-CTERM system TPR-repeat lipoprotein	NA|259aa|down_1|NZ_CP036263.1_1964745_1965522_+	COG1131, CcmA, ABC-type multidrug transport system, ATPase component [Defense mechanisms]	NA|423aa|down_2|NZ_CP036263.1_1965518_1966787_+	NA	NA|177aa|down_3|NZ_CP036263.1_1967031_1967562_+	COG4970, FimT, Tfp pilus assembly protein FimT [Cell motility and secretion / Intracellular trafficking and secretion]	NA|143aa|down_4|NZ_CP036263.1_1967618_1968047_+	COG4795, PulJ, Type II secretory pathway, component PulJ [Intracellular trafficking and secretion]	NA|220aa|down_5|NZ_CP036263.1_1968036_1968696_+	COG2165, PulG, Type II secretory pathway, pseudopilin PulG [Cell motility and secretion / Intracellular trafficking and secretion]	NA|318aa|down_6|NZ_CP036263.1_1968712_1969666_+	NA	NA|325aa|down_7|NZ_CP036263.1_1969809_1970784_+	pfam06283, ThuA, Trehalose utilisation	NA|171aa|down_8|NZ_CP036263.1_1970808_1971321_-	NA	NA|1059aa|down_9|NZ_CP036263.1_1971399_1974576_-	COG3696, COG3696, Putative silver efflux pump [Inorganic ion transport and metabolism]
GCF_007741515.1_ASM774151v1	NZ_CP036263	Planctomycetes bacterium HG15A2 chromosome, complete genome	5	5623400-5623500	4	CRISPRCasFinder	no		cas3,csa3,RT,DinG	Orphan	CCGTAGGGTACTGCCGAGCTCAGCGAGGCATACC	34	0	0	NA	NA	NA	1	1	Orphan	cas3,csa3,RT,DinG	NA|124aa|up_9|NZ_CP036263.1_5610286_5610658_-,NA|71aa|up_8|NZ_CP036263.1_5610800_5611013_-,NA|256aa|up_3|NZ_CP036263.1_5618740_5619508_+,NA	NA|124aa|up_9|NZ_CP036263.1_5610286_5610658_-	NA	NA|71aa|up_8|NZ_CP036263.1_5610800_5611013_-	NA	NA|187aa|up_7|NZ_CP036263.1_5611162_5611723_-	COG1943, COG1943, Transposase and inactivated derivatives [DNA replication, recombination, and repair]	NA|282aa|up_6|NZ_CP036263.1_5612105_5612951_-	COG3258, COG3258, Cytochrome c [Energy production and conversion]	NA|558aa|up_5|NZ_CP036263.1_5613187_5614861_-	pfam13304, AAA_21, AAA domain, putative AbiEii toxin, Type IV TA system	NA|1034aa|up_4|NZ_CP036263.1_5614980_5618082_-	PRK06556, PRK06556, vitamin B12-dependent ribonucleotide reductase; Validated	NA|256aa|up_3|NZ_CP036263.1_5618740_5619508_+	NA	NA|266aa|up_2|NZ_CP036263.1_5619917_5620715_+	pfam00072, Response_reg, Response regulator receiver domain	NA|528aa|up_1|NZ_CP036263.1_5620801_5622385_-	COG0642, BaeS, Signal transduction histidine kinase [Signal transduction mechanisms]	NA|185aa|up_0|NZ_CP036263.1_5622798_5623353_-	COG1943, COG1943, Transposase and inactivated derivatives [DNA replication, recombination, and repair]	NA|1041aa|down_0|NZ_CP036263.1_5623705_5626828_-	cd07341, M56_BlaR1_MecR1_like, Peptidase M56-like including those in BlaR1 and MecR1, integral membrane metallopeptidase	NA|133aa|down_1|NZ_CP036263.1_5626824_5627223_-	pfam03965, Penicillinase_R, Penicillinase repressor	NA|938aa|down_2|NZ_CP036263.1_5627678_5630492_+	PRK00009, PRK00009, phosphoenolpyruvate carboxylase; Reviewed	NA|130aa|down_3|NZ_CP036263.1_5630613_5631003_+	pfam05973, Gp49, Phage derived protein Gp49-like (DUF891)	NA|105aa|down_4|NZ_CP036263.1_5630995_5631310_+	smart00530, HTH_XRE, Helix-turn-helix XRE-family like proteins	NA|327aa|down_5|NZ_CP036263.1_5631382_5632363_-	TIGR02800, Protein_TolB, tol-pal system beta propeller repeat protein TolB	NA|396aa|down_6|NZ_CP036263.1_5632869_5634057_-	PRK05958, PRK05958, 8-amino-7-oxononanoate synthase; Reviewed	NA|236aa|down_7|NZ_CP036263.1_5634544_5635252_+	pfam16156, DUF4864, Domain of unknown function (DUF4864)	NA|215aa|down_8|NZ_CP036263.1_5635248_5635893_+	PRK12519, PRK12519, RNA polymerase sigma factor; Provisional	NA|309aa|down_9|NZ_CP036263.1_5635918_5636845_+	pfam13490, zf-HC2, Putative zinc-finger
