assembly_id	genome_id	genome_def	crispr_array_locus_merge	crispr_array_location_merge	crispr_locus_id	crispr_pred_method	array_in_prot	prot_within_array_20000	prot_in_genome	crispr_type_by_cas_prot	consensus_repeat	repeat_length	self-targeting_spacer_number	self-targeting_target_number	spacer_location	protospacer_location	repeat_type	spacer_locus_num	spacer_num	correct_crispr_type	genome_cas_prots	unknown_protein_around_crispr	L10	L10_domain	L9	L9_domain	L8	L8_domain	L7	L7_domain	L6	L6_domain	L5	L5_domain	L4	L4_domain	L3	L3_domain	L2	L2_domain	L1	L1_domain	R1	R1_domain	R2	R2_domain	R3	R3_domain	R4	R4_domain	R5	R5_domain	R6	R6_domain	R7	R7_domain	R8	R8_domain	R9	R9_domain	R10	R10_domain
GCF_000218855.1_ASM21885v1	NC_015687	Clostridium acetobutylicum DSM 1731, complete sequence	1	2035553-2035640	1	CRISPRCasFinder	no		DEDDh,DinG,cas3,csa3,WYL,RT	Orphan	TTCTCTGCTTCTGCTTTTGTATTAT	25	0	0	NA	NA	NA	1	1	Orphan	DEDDh,DinG,cas3,csa3,WYL,RT	NA|59aa|up_9|NC_015687.1_2022243_2022420_-,NA|345aa|up_8|NC_015687.1_2022434_2023469_-,NA|89aa|up_4|NC_015687.1_2025613_2025880_-,NA|125aa|up_2|NC_015687.1_2030152_2030527_-,NA|89aa|up_1|NC_015687.1_2030513_2030780_-,NA|403aa|up_0|NC_015687.1_2030798_2032007_-,NA|110aa|down_0|NC_015687.1_2037683_2038013_-,NA|108aa|down_2|NC_015687.1_2039031_2039355_-	NA|59aa|up_9|NC_015687.1_2022243_2022420_-	NA	NA|345aa|up_8|NC_015687.1_2022434_2023469_-	NA	NA|188aa|up_7|NC_015687.1_2023688_2024252_-	smart00338, BRLZ, basic region leucin zipper	NA|352aa|up_6|NC_015687.1_2024271_2025327_-	cd06525, GH25_Lyc-like, Lyc muramidase is an autolytic lysozyme (autolysin) from Clostridium acetobutylicum encoded by the lyc gene	NA|86aa|up_5|NC_015687.1_2025344_2025602_-	pfam10779, XhlA, Haemolysin XhlA	NA|89aa|up_4|NC_015687.1_2025613_2025880_-	NA	NA|1404aa|up_3|NC_015687.1_2025909_2030121_-	TIGR01665, structural_protein, phage minor structural protein, N-terminal region	NA|125aa|up_2|NC_015687.1_2030152_2030527_-	NA	NA|89aa|up_1|NC_015687.1_2030513_2030780_-	NA	NA|403aa|up_0|NC_015687.1_2030798_2032007_-	NA	NA|110aa|down_0|NC_015687.1_2037683_2038013_-	NA	NA|307aa|down_1|NC_015687.1_2038066_2038987_-	TIGR01603, Uncharacterized_phage_related_protein, phage major tail protein, phi13 family	NA|108aa|down_2|NC_015687.1_2039031_2039355_-	NA	NA|144aa|down_3|NC_015687.1_2039351_2039783_-	TIGR01725, hypothetical_protein, phage protein, HK97 gp10 family	NA|116aa|down_4|NC_015687.1_2039783_2040131_-	COG5614, COG5614, Bacteriophage head-tail adaptor [General function prediction only]	NA|106aa|down_5|NC_015687.1_2040093_2040411_-	TIGR01560, phage-related_hypothetical_protein, uncharacterized phage protein (possible DNA packaging)	NA|397aa|down_6|NC_015687.1_2040698_2041889_-	TIGR01554, prophage_Lp3_protein_18, phage major capsid protein, HK97 family	NA|257aa|down_7|NC_015687.1_2041890_2042661_-	cd07016, S14_ClpP_1, Caseinolytic protease (ClpP) is an ATP-dependent, highly conserved serine protease	NA|405aa|down_8|NC_015687.1_2042623_2043838_-	COG4695, COG4695, Phage-related protein [Function unknown]	NA|597aa|down_9|NC_015687.1_2043879_2045670_-	COG4626, COG4626, Phage terminase-like protein, large subunit [General function prediction only]
GCF_000218855.1_ASM21885v1	NC_015687	Clostridium acetobutylicum DSM 1731, complete sequence	2	2081965-2082104	2	CRISPRCasFinder	no		DEDDh,DinG,cas3,csa3,WYL,RT	Orphan	TTATGGTTCAAGATTTATAATGGGCTCTTAGCCCAAACCCCAGG	44	0	0	NA	NA	NA	1	1	Orphan	DEDDh,DinG,cas3,csa3,WYL,RT	NA|55aa|up_9|NC_015687.1_2073470_2073635_-,NA|74aa|down_4|NC_015687.1_2088312_2088534_-,NA|166aa|down_5|NC_015687.1_2088549_2089047_-,NA|190aa|down_9|NC_015687.1_2091287_2091857_-	NA|55aa|up_9|NC_015687.1_2073470_2073635_-	NA	NA|59aa|up_8|NC_015687.1_2073662_2073839_-	COG5515, COG5515, Uncharacterized conserved small protein [Function unknown]	NA|532aa|up_7|NC_015687.1_2073954_2075550_+	smart00857, Resolvase, Resolvase, N terminal domain	NA|113aa|up_6|NC_015687.1_2075509_2075848_+	COG1206, Gid, NAD(FAD)-utilizing enzyme possibly involved in translation [Translation, ribosomal structure and biogenesis]	NA|275aa|up_5|NC_015687.1_2075943_2076768_-	cd19157, AKR_AKR5G1-3, AKR5G family of aldo-keto reductase (AKR)	NA|178aa|up_4|NC_015687.1_2076867_2077401_-	pfam06866, DUF1256, Protein of unknown function (DUF1256)	NA|195aa|up_3|NC_015687.1_2077402_2077987_-	pfam06866, DUF1256, Protein of unknown function (DUF1256)	NA|325aa|up_2|NC_015687.1_2078155_2079130_-	cd07383, MPP_Dcr2, Saccharomyces cerevisiae DCR2 phosphatase and related proteins, metallophosphatase domain	NA|243aa|up_1|NC_015687.1_2079209_2079938_-	COG2819, COG2819, Predicted hydrolase of the alpha/beta superfamily [General function prediction only]	NA|526aa|up_0|NC_015687.1_2080229_2081807_-	cd07410, MPP_CpdB_N, Escherichia coli CpdB and related proteins, N-terminal metallophosphatase domain	NA|159aa|down_0|NC_015687.1_2083106_2083583_-	TIGR01462, Transcription_elongation_factor_GreA, transcription elongation factor GreA	NA|445aa|down_1|NC_015687.1_2083765_2085100_-	pfam13229, Beta_helix, Right handed beta helix region	NA|271aa|down_2|NC_015687.1_2085640_2086453_+	pfam13229, Beta_helix, Right handed beta helix region	NA|592aa|down_3|NC_015687.1_2086516_2088292_-	COG3505, VirD4, Type IV secretory pathway, VirD4 components [Intracellular trafficking and secretion]	NA|74aa|down_4|NC_015687.1_2088312_2088534_-	NA	NA|166aa|down_5|NC_015687.1_2088549_2089047_-	NA	NA|200aa|down_6|NC_015687.1_2089106_2089706_-	pfam01478, Peptidase_A24, Type IV leader peptidase family	NA|215aa|down_7|NC_015687.1_2089774_2090419_-	PRK14949, PRK14949, DNA polymerase III subunits gamma and tau; Provisional	NA|218aa|down_8|NC_015687.1_2090618_2091272_-	pfam08666, SAF, SAF domain	NA|190aa|down_9|NC_015687.1_2091287_2091857_-	NA
GCF_000218855.1_ASM21885v1	NC_015687	Clostridium acetobutylicum DSM 1731, complete sequence	3	2141220-2141315	3	CRISPRCasFinder	no		DEDDh,DinG,cas3,csa3,WYL,RT	Orphan	CTGTAGCATCAACATAATTACCATCAATATC	31	0	0	NA	NA	NA	1	1	Orphan	DEDDh,DinG,cas3,csa3,WYL,RT	NA|211aa|up_0|NC_015687.1_2140530_2141163_-,NA|155aa|down_0|NC_015687.1_2141744_2142209_-,NA|81aa|down_1|NC_015687.1_2142374_2142617_-,NA|150aa|down_2|NC_015687.1_2142623_2143073_-,NA|198aa|down_3|NC_015687.1_2143201_2143795_-,NA|63aa|down_4|NC_015687.1_2143812_2144001_-,NA|157aa|down_5|NC_015687.1_2144035_2144506_-,NA|163aa|down_6|NC_015687.1_2144507_2144996_-,NA|220aa|down_7|NC_015687.1_2145451_2146111_-,NA|117aa|down_8|NC_015687.1_2146157_2146508_-,NA|459aa|down_9|NC_015687.1_2146523_2147900_-	NA|104aa|up_9|NC_015687.1_2130172_2130484_-	COG0236, AcpP, Acyl carrier protein [Lipid metabolism / Secondary metabolites biosynthesis, transport, and catabolism]	NA|685aa|up_8|NC_015687.1_2130520_2132575_-	COG2414, COG2414, Aldehyde:ferredoxin oxidoreductase [Energy production and conversion]	NA|317aa|up_7|NC_015687.1_2132615_2133566_-	COG0331, FabD, (acyl-carrier-protein) S-malonyltransferase [Lipid metabolism]	NA|637aa|up_6|NC_015687.1_2133935_2135846_-	PRK14498, PRK14498, putative molybdopterin biosynthesis protein MoeA/LysR substrate binding-domain-containing protein; Provisional	NA|408aa|up_5|NC_015687.1_2135911_2137135_-	COG0303, MoeA, Molybdopterin biosynthesis enzyme [Coenzyme metabolism]	NA|166aa|up_4|NC_015687.1_2137150_2137648_-	cd00886, MogA_MoaB, MogA_MoaB family	NA|181aa|up_3|NC_015687.1_2137688_2138231_-	COG0558, PgsA, Phosphatidylglycerophosphate synthase [Lipid metabolism]	NA|141aa|up_2|NC_015687.1_2138310_2138733_-	cd08865, SRPBCC_10, Ligand-binding SRPBCC domain of an uncharacterized subfamily of proteins	NA|181aa|up_1|NC_015687.1_2138832_2139375_-	COG0426, FpaA, Uncharacterized flavoproteins [Energy production and conversion]	NA|211aa|up_0|NC_015687.1_2140530_2141163_-	NA	NA|155aa|down_0|NC_015687.1_2141744_2142209_-	NA	NA|81aa|down_1|NC_015687.1_2142374_2142617_-	NA	NA|150aa|down_2|NC_015687.1_2142623_2143073_-	NA	NA|198aa|down_3|NC_015687.1_2143201_2143795_-	NA	NA|63aa|down_4|NC_015687.1_2143812_2144001_-	NA	NA|157aa|down_5|NC_015687.1_2144035_2144506_-	NA	NA|163aa|down_6|NC_015687.1_2144507_2144996_-	NA	NA|220aa|down_7|NC_015687.1_2145451_2146111_-	NA	NA|117aa|down_8|NC_015687.1_2146157_2146508_-	NA	NA|459aa|down_9|NC_015687.1_2146523_2147900_-	NA
