assembly_id	genome_id	genome_def	crispr_array_locus_merge	crispr_array_location_merge	crispr_locus_id	crispr_pred_method	array_in_prot	prot_within_array_20000	prot_in_genome	crispr_type_by_cas_prot	consensus_repeat	repeat_length	self-targeting_spacer_number	self-targeting_target_number	spacer_location	protospacer_location	repeat_type	spacer_locus_num	spacer_num	correct_crispr_type	genome_cas_prots	unknown_protein_around_crispr	L10	L10_domain	L9	L9_domain	L8	L8_domain	L7	L7_domain	L6	L6_domain	L5	L5_domain	L4	L4_domain	L3	L3_domain	L2	L2_domain	L1	L1_domain	R1	R1_domain	R2	R2_domain	R3	R3_domain	R4	R4_domain	R5	R5_domain	R6	R6_domain	R7	R7_domain	R8	R8_domain	R9	R9_domain	R10	R10_domain
GCF_000191905.1_ASM19190v1	NC_017295	Clostridium acetobutylicum EA 2018, complete sequence	1	2033366-2033453	1	CRISPRCasFinder	no		DEDDh,DinG,cas3,csa3,WYL,RT	Orphan	TTCTCTGCTTCTGCTTTTGTATTAT	25	0	0	NA	NA	NA	1	1	Orphan	DEDDh,DinG,cas3,csa3,WYL,RT	NA|59aa|up_9|NC_017295.1_2020056_2020233_-,NA|345aa|up_8|NC_017295.1_2020247_2021282_-,NA|89aa|up_4|NC_017295.1_2023426_2023693_-,NA|125aa|up_2|NC_017295.1_2027965_2028340_-,NA|89aa|up_1|NC_017295.1_2028326_2028593_-,NA|403aa|up_0|NC_017295.1_2028611_2029820_-,NA|110aa|down_0|NC_017295.1_2035496_2035826_-,NA|108aa|down_2|NC_017295.1_2036844_2037168_-	NA|59aa|up_9|NC_017295.1_2020056_2020233_-	NA	NA|345aa|up_8|NC_017295.1_2020247_2021282_-	NA	NA|188aa|up_7|NC_017295.1_2021501_2022065_-	smart00338, BRLZ, basic region leucin zipper	NA|352aa|up_6|NC_017295.1_2022084_2023140_-	cd06525, GH25_Lyc-like, Lyc muramidase is an autolytic lysozyme (autolysin) from Clostridium acetobutylicum encoded by the lyc gene	NA|86aa|up_5|NC_017295.1_2023157_2023415_-	pfam10779, XhlA, Haemolysin XhlA	NA|89aa|up_4|NC_017295.1_2023426_2023693_-	NA	NA|1404aa|up_3|NC_017295.1_2023722_2027934_-	TIGR01665, structural_protein, phage minor structural protein, N-terminal region	NA|125aa|up_2|NC_017295.1_2027965_2028340_-	NA	NA|89aa|up_1|NC_017295.1_2028326_2028593_-	NA	NA|403aa|up_0|NC_017295.1_2028611_2029820_-	NA	NA|110aa|down_0|NC_017295.1_2035496_2035826_-	NA	NA|307aa|down_1|NC_017295.1_2035879_2036800_-	TIGR01603, Uncharacterized_phage_related_protein, phage major tail protein, phi13 family	NA|108aa|down_2|NC_017295.1_2036844_2037168_-	NA	NA|144aa|down_3|NC_017295.1_2037164_2037596_-	TIGR01725, hypothetical_protein, phage protein, HK97 gp10 family	NA|116aa|down_4|NC_017295.1_2037596_2037944_-	COG5614, COG5614, Bacteriophage head-tail adaptor [General function prediction only]	NA|106aa|down_5|NC_017295.1_2037906_2038224_-	TIGR01560, phage-related_hypothetical_protein, uncharacterized phage protein (possible DNA packaging)	NA|397aa|down_6|NC_017295.1_2038511_2039702_-	TIGR01554, prophage_Lp3_protein_18, phage major capsid protein, HK97 family	NA|257aa|down_7|NC_017295.1_2039703_2040474_-	cd07016, S14_ClpP_1, Caseinolytic protease (ClpP) is an ATP-dependent, highly conserved serine protease	NA|420aa|down_8|NC_017295.1_2040436_2041696_-	COG4695, COG4695, Phage-related protein [Function unknown]	NA|597aa|down_9|NC_017295.1_2041692_2043483_-	COG4626, COG4626, Phage terminase-like protein, large subunit [General function prediction only]
GCF_000191905.1_ASM19190v1	NC_017295	Clostridium acetobutylicum EA 2018, complete sequence	2	2079778-2079917	2	CRISPRCasFinder	no		DEDDh,DinG,cas3,csa3,WYL,RT	Orphan	TTATGGTTCAAGATTTATAATGGGCTCTTAGCCCAAACCCCAGG	44	0	0	NA	NA	NA	1	1	Orphan	DEDDh,DinG,cas3,csa3,WYL,RT	NA|277aa|up_9|NC_017295.1_2070084_2070915_-,NA|55aa|up_8|NC_017295.1_2071283_2071448_-,NA|74aa|down_4|NC_017295.1_2086125_2086347_-,NA|166aa|down_5|NC_017295.1_2086362_2086860_-,NA|190aa|down_9|NC_017295.1_2089100_2089670_-	NA|277aa|up_9|NC_017295.1_2070084_2070915_-	NA	NA|55aa|up_8|NC_017295.1_2071283_2071448_-	NA	NA|59aa|up_7|NC_017295.1_2071475_2071652_-	COG5515, COG5515, Uncharacterized conserved small protein [Function unknown]	NA|532aa|up_6|NC_017295.1_2071767_2073363_+	smart00857, Resolvase, Resolvase, N terminal domain	NA|275aa|up_5|NC_017295.1_2073756_2074581_-	cd19157, AKR_AKR5G1-3, AKR5G family of aldo-keto reductase (AKR)	NA|178aa|up_4|NC_017295.1_2074680_2075214_-	pfam06866, DUF1256, Protein of unknown function (DUF1256)	NA|195aa|up_3|NC_017295.1_2075215_2075800_-	pfam06866, DUF1256, Protein of unknown function (DUF1256)	NA|325aa|up_2|NC_017295.1_2075968_2076943_-	cd07383, MPP_Dcr2, Saccharomyces cerevisiae DCR2 phosphatase and related proteins, metallophosphatase domain	NA|243aa|up_1|NC_017295.1_2077022_2077751_-	COG2819, COG2819, Predicted hydrolase of the alpha/beta superfamily [General function prediction only]	NA|526aa|up_0|NC_017295.1_2078042_2079620_-	cd07410, MPP_CpdB_N, Escherichia coli CpdB and related proteins, N-terminal metallophosphatase domain	NA|159aa|down_0|NC_017295.1_2080919_2081396_-	TIGR01462, Transcription_elongation_factor_GreA, transcription elongation factor GreA	NA|445aa|down_1|NC_017295.1_2081578_2082913_-	pfam13229, Beta_helix, Right handed beta helix region	NA|271aa|down_2|NC_017295.1_2083453_2084266_+	pfam13229, Beta_helix, Right handed beta helix region	NA|592aa|down_3|NC_017295.1_2084329_2086105_-	COG3505, VirD4, Type IV secretory pathway, VirD4 components [Intracellular trafficking and secretion]	NA|74aa|down_4|NC_017295.1_2086125_2086347_-	NA	NA|166aa|down_5|NC_017295.1_2086362_2086860_-	NA	NA|200aa|down_6|NC_017295.1_2086919_2087519_-	pfam01478, Peptidase_A24, Type IV leader peptidase family	NA|215aa|down_7|NC_017295.1_2087587_2088232_-	PRK14949, PRK14949, DNA polymerase III subunits gamma and tau; Provisional	NA|218aa|down_8|NC_017295.1_2088431_2089085_-	pfam08666, SAF, SAF domain	NA|190aa|down_9|NC_017295.1_2089100_2089670_-	NA
GCF_000191905.1_ASM19190v1	NC_017295	Clostridium acetobutylicum EA 2018, complete sequence	3	2139033-2139128	3	CRISPRCasFinder	no		DEDDh,DinG,cas3,csa3,WYL,RT	Orphan	CTGTAGCATCAACATAATTACCATCAATATC	31	0	0	NA	NA	NA	1	1	Orphan	DEDDh,DinG,cas3,csa3,WYL,RT	NA|211aa|up_0|NC_017295.1_2138343_2138976_-,NA|155aa|down_0|NC_017295.1_2139557_2140022_-,NA|81aa|down_1|NC_017295.1_2140187_2140430_-,NA|150aa|down_2|NC_017295.1_2140436_2140886_-,NA|198aa|down_3|NC_017295.1_2141014_2141608_-,NA|63aa|down_4|NC_017295.1_2141625_2141814_-,NA|157aa|down_5|NC_017295.1_2141848_2142319_-,NA|163aa|down_6|NC_017295.1_2142320_2142809_-,NA|220aa|down_7|NC_017295.1_2143264_2143924_-,NA|117aa|down_8|NC_017295.1_2143970_2144321_-,NA|459aa|down_9|NC_017295.1_2144336_2145713_-	NA|104aa|up_9|NC_017295.1_2127985_2128297_-	COG0236, AcpP, Acyl carrier protein [Lipid metabolism / Secondary metabolites biosynthesis, transport, and catabolism]	NA|685aa|up_8|NC_017295.1_2128333_2130388_-	COG2414, COG2414, Aldehyde:ferredoxin oxidoreductase [Energy production and conversion]	NA|317aa|up_7|NC_017295.1_2130428_2131379_-	COG0331, FabD, (acyl-carrier-protein) S-malonyltransferase [Lipid metabolism]	NA|637aa|up_6|NC_017295.1_2131748_2133659_-	PRK14498, PRK14498, putative molybdopterin biosynthesis protein MoeA/LysR substrate binding-domain-containing protein; Provisional	NA|408aa|up_5|NC_017295.1_2133724_2134948_-	COG0303, MoeA, Molybdopterin biosynthesis enzyme [Coenzyme metabolism]	NA|166aa|up_4|NC_017295.1_2134963_2135461_-	cd00886, MogA_MoaB, MogA_MoaB family	NA|181aa|up_3|NC_017295.1_2135501_2136044_-	COG0558, PgsA, Phosphatidylglycerophosphate synthase [Lipid metabolism]	NA|141aa|up_2|NC_017295.1_2136123_2136546_-	cd08865, SRPBCC_10, Ligand-binding SRPBCC domain of an uncharacterized subfamily of proteins	NA|181aa|up_1|NC_017295.1_2136645_2137188_-	COG0426, FpaA, Uncharacterized flavoproteins [Energy production and conversion]	NA|211aa|up_0|NC_017295.1_2138343_2138976_-	NA	NA|155aa|down_0|NC_017295.1_2139557_2140022_-	NA	NA|81aa|down_1|NC_017295.1_2140187_2140430_-	NA	NA|150aa|down_2|NC_017295.1_2140436_2140886_-	NA	NA|198aa|down_3|NC_017295.1_2141014_2141608_-	NA	NA|63aa|down_4|NC_017295.1_2141625_2141814_-	NA	NA|157aa|down_5|NC_017295.1_2141848_2142319_-	NA	NA|163aa|down_6|NC_017295.1_2142320_2142809_-	NA	NA|220aa|down_7|NC_017295.1_2143264_2143924_-	NA	NA|117aa|down_8|NC_017295.1_2143970_2144321_-	NA	NA|459aa|down_9|NC_017295.1_2144336_2145713_-	NA
