assembly_id	genome_id	genome_def	crispr_array_locus_merge	crispr_array_location_merge	crispr_locus_id	crispr_pred_method	array_in_prot	prot_within_array_20000	prot_in_genome	crispr_type_by_cas_prot	consensus_repeat	repeat_length	self-targeting_spacer_number	self-targeting_target_number	spacer_location	protospacer_location	repeat_type	spacer_locus_num	spacer_num	correct_crispr_type	genome_cas_prots	unknown_protein_around_crispr	L10	L10_domain	L9	L9_domain	L8	L8_domain	L7	L7_domain	L6	L6_domain	L5	L5_domain	L4	L4_domain	L3	L3_domain	L2	L2_domain	L1	L1_domain	R1	R1_domain	R2	R2_domain	R3	R3_domain	R4	R4_domain	R5	R5_domain	R6	R6_domain	R7	R7_domain	R8	R8_domain	R9	R9_domain	R10	R10_domain
GCA_004006395.1_ASM400639v1	CP029758	Clostridium sp. AWRP chromosome, complete genome	1	2028394-2029679	1,1,1	CRISPRCasFinder,CRT,PILER-CR	no		csa3,cas3,RT,WYL,DEDDh,cas8b1,PD-DExK,cas3HD	Orphan	ATTTAAATACATCTCATGTTGAGGTTCAAC,ATTTAAATACATCTCATGTTGAGGTTCAAC,ATTTAAATACATCTCATGTTGAGGTTCAAC	30,30,30	0	0	NA	NA	II-B:II-B:II-B	19,19,19	19	Orphan	csa3,cas3,RT,WYL,DEDDh,cas8b1,PD-DExK,cas3HD	NA|365aa|up_9|CP029758.1_2018055_2019150_+,NA|230aa|up_3|CP029758.1_2024659_2025349_-,NA|109aa|up_2|CP029758.1_2026590_2026917_+,NA|121aa|up_1|CP029758.1_2026935_2027298_+,NA	NA|365aa|up_9|CP029758.1_2018055_2019150_+	NA	NA|158aa|up_8|CP029758.1_2019826_2020300_+	pfam06541, ABC_trans_CmpB, Putative ABC-transporter type IV	NA|252aa|up_7|CP029758.1_2020408_2021164_+	cd02619, Peptidase_C1, C1 Peptidase family (MEROPS database nomenclature), also referred to as the papain family; composed of two subfamilies of cysteine peptidases (CPs), C1A (papain) and C1B (bleomycin hydrolase)	NA|88aa|up_6|CP029758.1_2021241_2021505_+	pfam10779, XhlA, Haemolysin XhlA	NA|306aa|up_5|CP029758.1_2021521_2022439_+	cd06525, GH25_Lyc-like, Lyc muramidase is an autolytic lysozyme (autolysin) from Clostridium acetobutylicum encoded by the lyc gene	NA|411aa|up_4|CP029758.1_2023238_2024471_+	pfam00872, Transposase_mut, Transposase, Mutator family	NA|230aa|up_3|CP029758.1_2024659_2025349_-	NA	NA|109aa|up_2|CP029758.1_2026590_2026917_+	NA	NA|121aa|up_1|CP029758.1_2026935_2027298_+	NA	NA|177aa|up_0|CP029758.1_2027570_2028101_+	pfam11611, DUF4352, Domain of unknown function (DUF4352)	NA|200aa|down_0|CP029758.1_2030023_2030623_+	COG1349, GlpR, Transcriptional regulators of sugar metabolism [Transcription / Carbohydrate transport and metabolism]	NA|168aa|down_1|CP029758.1_2031196_2031700_-	cd00093, HTH_XRE, Helix-turn-helix XRE-family like proteins	NA|545aa|down_2|CP029758.1_2032425_2034060_-	smart00857, Resolvase, Resolvase, N terminal domain	NA|241aa|down_3|CP029758.1_2034625_2035348_+	COG4712, COG4712, Uncharacterized protein conserved in bacteria [Function unknown]	NA|95aa|down_4|CP029758.1_2035440_2035725_-	pfam01134, GIDA, Glucose inhibited division protein A	NA|329aa|down_5|CP029758.1_2035737_2036724_-	PRK05802, PRK05802, sulfide/dihydroorotate dehydrogenase-like FAD/NAD-binding protein	NA|289aa|down_6|CP029758.1_2036867_2037734_+	TIGR00950, Uncharacterized_inner_membrane_transporter_YicL, Carboxylate/Amino Acid/Amine Transporter	NA|458aa|down_7|CP029758.1_2038188_2039562_+	pfam00665, rve, Integrase core domain	NA|485aa|down_8|CP029758.1_2039876_2041331_+	pfam16800, Endopep_inhib, IseA DL-endopeptidase inhibitor	NA|623aa|down_9|CP029758.1_2041606_2043475_+	PRK10060, PRK10060, cyclic di-GMP phosphodiesterase
GCA_004006395.1_ASM400639v1	CP029758	Clostridium sp. AWRP chromosome, complete genome	2	2858169-2858278	2	CRISPRCasFinder	no		csa3,cas3,RT,WYL,DEDDh,cas8b1,PD-DExK,cas3HD	Orphan	AGTAAGTAGATTCACACCAAATCGAAGATTTGGGTTCT	38	0	0	NA	NA	NA	1	1	Orphan	csa3,cas3,RT,WYL,DEDDh,cas8b1,PD-DExK,cas3HD	NA|262aa|up_8|CP029758.1_2850504_2851290_-,NA|153aa|down_7|CP029758.1_2869668_2870127_-,NA|99aa|down_8|CP029758.1_2870495_2870792_+	NA|228aa|up_9|CP029758.1_2849566_2850250_-	COG0745, OmpR, Response regulators consisting of a CheY-like receiver domain and a winged-helix DNA-binding domain [Signal transduction mechanisms / Transcription]	NA|262aa|up_8|CP029758.1_2850504_2851290_-	NA	NA|232aa|up_7|CP029758.1_2851289_2851985_-	COG1131, CcmA, ABC-type multidrug transport system, ATPase component [Defense mechanisms]	NA|123aa|up_6|CP029758.1_2851987_2852356_-	COG1725, COG1725, Predicted transcriptional regulators [Transcription]	NA|151aa|up_5|CP029758.1_2852543_2852996_-	cd06262, metallo-hydrolase-like_MBL-fold, mainly hydrolytic enzymes and related proteins which carry out various biological functions; MBL-fold metallohydrolase domain	NA|224aa|up_4|CP029758.1_2853177_2853849_+	COG1182, AcpD, Acyl carrier protein phosphodiesterase [Lipid metabolism]	NA|125aa|up_3|CP029758.1_2854030_2854405_+	pfam03965, Penicillinase_R, Penicillinase repressor	NA|509aa|up_2|CP029758.1_2854416_2855943_+	pfam05569, Peptidase_M56, BlaR1 peptidase M56	NA|366aa|up_1|CP029758.1_2856019_2857117_-	pfam12671, Amidase_6, Putative amidase domain	NA|302aa|up_0|CP029758.1_2857241_2858147_-	pfam13539, Peptidase_M15_4, D-alanyl-D-alanine carboxypeptidase	NA|379aa|down_0|CP029758.1_2858468_2859605_+	COG1453, COG1453, Predicted oxidoreductases of the aldo/keto reductase family [General function prediction only]	NA|1069aa|down_1|CP029758.1_2860153_2863360_-	PRK05294, carB, carbamoyl-phosphate synthase large subunit	NA|355aa|down_2|CP029758.1_2863455_2864520_-	PRK12564, PRK12564, carbamoyl-phosphate synthase small subunit	NA|334aa|down_3|CP029758.1_2864534_2865536_-	PRK02102, PRK02102, ornithine carbamoyltransferase; Validated	NA|301aa|down_4|CP029758.1_2865828_2866731_-	COG0179, MhpD, 2-keto-4-pentenoate hydratase/2-oxohepta-3-ene-1,7-dioic acid hydratase (catechol pathway) [Secondary metabolites biosynthesis, transport, and catabolism]	NA|208aa|down_5|CP029758.1_2866777_2867401_-	pfam13649, Methyltransf_25, Methyltransferase domain	NA|723aa|down_6|CP029758.1_2867449_2869618_-	TIGR01389, recQ, ATP-dependent DNA helicase RecQ	NA|153aa|down_7|CP029758.1_2869668_2870127_-	NA	NA|99aa|down_8|CP029758.1_2870495_2870792_+	NA	NA|409aa|down_9|CP029758.1_2871002_2872229_-	pfam00872, Transposase_mut, Transposase, Mutator family
GCA_004006395.1_ASM400639v1	CP029758	Clostridium sp. AWRP chromosome, complete genome	3	3756415-3756969	3,2,2	CRISPRCasFinder,CRT,PILER-CR	no	RT	csa3,cas3,RT,WYL,DEDDh,cas8b1,PD-DExK,cas3HD	Unclear	ATTTAAATACATCTCATGTTAAGGTTCAAC,ATTTAAATACATCTCATGTTAAGGTTCAAC,ATTTAAATACATCTCATGTTAAGGTTCAAC	30,30,30	0	0	NA	NA	III-B:III-B:III-B	8,8,6	8	Orphan	csa3,cas3,RT,WYL,DEDDh,cas8b1,PD-DExK,cas3HD	NA|130aa|up_5|CP029758.1_3752770_3753160_-,NA|57aa|up_0|CP029758.1_3756172_3756343_-,NA	NA|204aa|up_9|CP029758.1_3748852_3749464_-	COG0558, PgsA, Phosphatidylglycerophosphate synthase [Lipid metabolism]	NA|188aa|up_8|CP029758.1_3749469_3750033_-	PRK06242, PRK06242, flavodoxin; Provisional	NA|243aa|up_7|CP029758.1_3751241_3751970_-	COG1664, CcmA, Integral membrane protein CcmA involved in cell shape determination [Cell envelope biogenesis, outer membrane]	NA|212aa|up_6|CP029758.1_3751989_3752625_-	pfam13171, DUF4004, Protein of unknown function (DUF4004)	NA|130aa|up_5|CP029758.1_3752770_3753160_-	NA	NA|251aa|up_4|CP029758.1_3753180_3753933_-	cd05333, BKR_SDR_c, beta-Keto acyl carrier protein reductase (BKR), involved in Type II FAS, classical (c) SDRs	NA|269aa|up_3|CP029758.1_3754074_3754881_-	cd07713, DHPS-like_MBL-fold, Methanocaldococcus jannaschii dihydropteroate synthase, Thermoanaerobacter tengcongensis Tflp, and related proteins; MBL-fold metallo hydrolase domain	NA|206aa|up_2|CP029758.1_3755010_3755628_-	COG3153, COG3153, Predicted acetyltransferase [General function prediction only]	NA|95aa|up_1|CP029758.1_3755760_3756045_-	pfam08765, Mor, Mor transcription activator family	NA|57aa|up_0|CP029758.1_3756172_3756343_-	NA	NA|411aa|down_0|CP029758.1_3757073_3758306_+	pfam00872, Transposase_mut, Transposase, Mutator family	NA|224aa|down_1|CP029758.1_3759754_3760426_-	PRK10402, PRK10402, DNA-binding transcriptional activator YeiL; Provisional	NA|211aa|down_2|CP029758.1_3760524_3761157_+	pfam13649, Methyltransf_25, Methyltransferase domain	RT|438aa|down_3|CP029758.1_3761967_3763281_-	TIGR04416, hypothetical_protein, group II intron reverse transcriptase/maturase	NA|133aa|down_4|CP029758.1_3763893_3764292_+	cd07824, SRPBCC_6, Ligand-binding SRPBCC domain of an uncharacterized subfamily of proteins	NA|169aa|down_5|CP029758.1_3764308_3764815_-	pfam13238, AAA_18, AAA domain	NA|181aa|down_6|CP029758.1_3764842_3765385_-	cd02139, nitroreductase, nitroreductase family protein	NA|253aa|down_7|CP029758.1_3765518_3766277_-	COG0300, DltE, Short-chain dehydrogenases of various substrate specificities [General function prediction only]	NA|195aa|down_8|CP029758.1_3766396_3766981_-	COG0664, Crp, cAMP-binding proteins - catabolite gene activator and regulatory subunit of cAMP-dependent protein kinases [Signal transduction mechanisms]	NA|293aa|down_9|CP029758.1_3767022_3767901_-	COG2207, AraC, AraC-type DNA-binding domain-containing proteins [Transcription]
GCA_004006395.1_ASM400639v1	CP029758	Clostridium sp. AWRP chromosome, complete genome	4	3758507-3759463	3,4,3	PILER-CR,CRISPRCasFinder,CRT	no	RT	csa3,cas3,RT,WYL,DEDDh,cas8b1,PD-DExK,cas3HD	Unclear	ATTTAAATACATCTCATGTTAAGGTTCAAC,ATTTAAATACATCTCATGTTAAGGTTCAAC,ATTTAAATACATCTCATGTTAAGGTTCAAC	30,30,30	0	0	NA	NA	III-B:III-B:III-B	14,14,14	14	Orphan	csa3,cas3,RT,WYL,DEDDh,cas8b1,PD-DExK,cas3HD	NA|130aa|up_6|CP029758.1_3752770_3753160_-,NA|57aa|up_1|CP029758.1_3756172_3756343_-,NA	NA|188aa|up_9|CP029758.1_3749469_3750033_-	PRK06242, PRK06242, flavodoxin; Provisional	NA|243aa|up_8|CP029758.1_3751241_3751970_-	COG1664, CcmA, Integral membrane protein CcmA involved in cell shape determination [Cell envelope biogenesis, outer membrane]	NA|212aa|up_7|CP029758.1_3751989_3752625_-	pfam13171, DUF4004, Protein of unknown function (DUF4004)	NA|130aa|up_6|CP029758.1_3752770_3753160_-	NA	NA|251aa|up_5|CP029758.1_3753180_3753933_-	cd05333, BKR_SDR_c, beta-Keto acyl carrier protein reductase (BKR), involved in Type II FAS, classical (c) SDRs	NA|269aa|up_4|CP029758.1_3754074_3754881_-	cd07713, DHPS-like_MBL-fold, Methanocaldococcus jannaschii dihydropteroate synthase, Thermoanaerobacter tengcongensis Tflp, and related proteins; MBL-fold metallo hydrolase domain	NA|206aa|up_3|CP029758.1_3755010_3755628_-	COG3153, COG3153, Predicted acetyltransferase [General function prediction only]	NA|95aa|up_2|CP029758.1_3755760_3756045_-	pfam08765, Mor, Mor transcription activator family	NA|57aa|up_1|CP029758.1_3756172_3756343_-	NA	NA|411aa|up_0|CP029758.1_3757073_3758306_+	pfam00872, Transposase_mut, Transposase, Mutator family	NA|224aa|down_0|CP029758.1_3759754_3760426_-	PRK10402, PRK10402, DNA-binding transcriptional activator YeiL; Provisional	NA|211aa|down_1|CP029758.1_3760524_3761157_+	pfam13649, Methyltransf_25, Methyltransferase domain	RT|438aa|down_2|CP029758.1_3761967_3763281_-	TIGR04416, hypothetical_protein, group II intron reverse transcriptase/maturase	NA|133aa|down_3|CP029758.1_3763893_3764292_+	cd07824, SRPBCC_6, Ligand-binding SRPBCC domain of an uncharacterized subfamily of proteins	NA|169aa|down_4|CP029758.1_3764308_3764815_-	pfam13238, AAA_18, AAA domain	NA|181aa|down_5|CP029758.1_3764842_3765385_-	cd02139, nitroreductase, nitroreductase family protein	NA|253aa|down_6|CP029758.1_3765518_3766277_-	COG0300, DltE, Short-chain dehydrogenases of various substrate specificities [General function prediction only]	NA|195aa|down_7|CP029758.1_3766396_3766981_-	COG0664, Crp, cAMP-binding proteins - catabolite gene activator and regulatory subunit of cAMP-dependent protein kinases [Signal transduction mechanisms]	NA|293aa|down_8|CP029758.1_3767022_3767901_-	COG2207, AraC, AraC-type DNA-binding domain-containing proteins [Transcription]	NA|485aa|down_9|CP029758.1_3768056_3769511_+	TIGR00711, Uncharacterized_MFS-type_transporter_YhcA, drug resistance transporter, EmrB/QacA subfamily
