assembly_id	genome_id	genome_def	crispr_array_locus_merge	crispr_array_location_merge	crispr_locus_id	crispr_pred_method	array_in_prot	prot_within_array_20000	prot_in_genome	crispr_type_by_cas_prot	consensus_repeat	repeat_length	self-targeting_spacer_number	self-targeting_target_number	spacer_location	protospacer_location	repeat_type	spacer_locus_num	spacer_num	correct_crispr_type	genome_cas_prots	unknown_protein_around_crispr	L10	L10_domain	L9	L9_domain	L8	L8_domain	L7	L7_domain	L6	L6_domain	L5	L5_domain	L4	L4_domain	L3	L3_domain	L2	L2_domain	L1	L1_domain	R1	R1_domain	R2	R2_domain	R3	R3_domain	R4	R4_domain	R5	R5_domain	R6	R6_domain	R7	R7_domain	R8	R8_domain	R9	R9_domain	R10	R10_domain
GCF_000237085.1_ASM23708v1	NC_016627	Hungateiclostridium clariflavum DSM 19732, complete sequence	1	2476251-2478645	1,1,1	CRISPRCasFinder,CRT,PILER-CR	no	cas4,cas2,cas1,csx1,csx19,csm3gr7,csx10gr5,cas10,cas6,cas3,cas5,cas7,cas8b1	csa3,RT,WYL,DEDDh,cas4,cas2,cas1,csx1,csx19,csm3gr7,csx10gr5,cas10,cas6,cas3,cas5,cas7,cas8b1,cas8b2,DinG	Type III-B,Type III-D,Type I-B,Type III-C,Type III-A	GTTTCAATCCTTATTTTTATGGAACATCTACTTCAAC,GTTTCAATCCTTATTTTTATGGAACATCTACTTCAAC,GTTTCAATCCTTATTTTTATGGAACATCTACTTCAAC	37,37,37	0	0	NA	NA	I-B:I-B:I-B	32,32,31	32	TypeIII-B,TypeIII-D,TypeI-B,TypeIII-C,TypeIII-A	csa3,RT,WYL,DEDDh,cas4,cas2,cas1,csx1,csx19,csm3gr7,csx10gr5,cas10,cas6,cas3,cas5,cas7,cas8b1,cas8b2,DinG	NA|54aa|up_4|NC_016627.1_2472911_2473073_-,csx19|122aa|down_2|NC_016627.1_2483839_2484205_-	NA|499aa|up_9|NC_016627.1_2466936_2468433_-	PRK07107, PRK07107, IMP dehydrogenase	NA|252aa|up_8|NC_016627.1_2468619_2469375_-	PRK06924, PRK06924, (S)-benzoin forming benzil reductase	NA|203aa|up_7|NC_016627.1_2469888_2470497_-	cd02603, HAD_sEH-N_like, N-terminal lipase phosphatase domain of human soluble epoxide hydrolase, Escherichia coli YihX/HAD4 alpha-D-glucose 1-phosphate phosphatase, and related domains, may be inactive	NA|522aa|up_6|NC_016627.1_2470682_2472248_+	COG0665, DadA, Glycine/D-amino acid oxidases (deaminating) [Amino acid transport and metabolism]	NA|205aa|up_5|NC_016627.1_2472322_2472937_-	TIGR02692, putative_tRNA_nucleotidyltransferase, tRNA adenylyltransferase	NA|54aa|up_4|NC_016627.1_2472911_2473073_-	NA	NA|155aa|up_3|NC_016627.1_2473254_2473719_-	cd17906, CheX, chemotaxis phosphatase CheX	cas4|205aa|up_2|NC_016627.1_2474298_2474913_-	cd09637, Cas4_I-A_I-B_I-C_I-D_II-B, CRISPR/Cas system-associated protein Cas4	cas2|97aa|up_1|NC_016627.1_2474887_2475178_-	cd09725, Cas2_I_II_III, CRISPR/Cas system-associated protein Cas2	cas1|332aa|up_0|NC_016627.1_2475171_2476167_-	pfam01867, Cas_Cas1, CRISPR associated protein Cas1	csx1|492aa|down_0|NC_016627.1_2478925_2480401_-	pfam09670, Cas_Cas02710, CRISPR-associated protein (Cas_Cas02710)	csx1|416aa|down_1|NC_016627.1_2480452_2481700_-	cd09732, Csx1_III-U, CRISPR/Cas system-associated protein Csx1	csx19|122aa|down_2|NC_016627.1_2483839_2484205_-	NA	csm3gr7|444aa|down_3|NC_016627.1_2484211_2485543_-	cd09726, RAMP_I_III, CRISPR/Cas system-associated RAMP superfamily protein	csx10gr5|534aa|down_4|NC_016627.1_2485539_2487141_-	TIGR02674, cas_cyan_RAMP_2, CRISPR-associated RAMP protein, Csx10 family	csm3gr7|225aa|down_5|NC_016627.1_2487133_2487808_-	pfam03787, RAMPs, RAMP superfamily	cas10|488aa|down_6|NC_016627.1_2487810_2489274_-	cd09679, Cas10_III, CRISPR/Cas system-associated protein Cas10	cas6|220aa|down_7|NC_016627.1_2489320_2489980_-	pfam17262, DUF5328, Family of unknown function (DUF5328)	cas3|799aa|down_8|NC_016627.1_2490007_2492404_-	cd17930, DEXHc_cas3, DEXH/Q-box helicase domain of Cas3	cas5|238aa|down_9|NC_016627.1_2492417_2493131_-	TIGR02592, hypothetical_protein_CTC_01466, CRISPR-associated protein Cas5, subtype I-B/HMARI
GCF_000237085.1_ASM23708v1	NC_016627	Hungateiclostridium clariflavum DSM 19732, complete sequence	2	2496169-2502078	2,2,2	CRISPRCasFinder,CRT,PILER-CR	no	csx1,csx19,csm3gr7,csx10gr5,cas10,cas6,cas3,cas5,cas7,cas8b1	csa3,RT,WYL,DEDDh,cas4,cas2,cas1,csx1,csx19,csm3gr7,csx10gr5,cas10,cas6,cas3,cas5,cas7,cas8b1,cas8b2,DinG	Type III-B,Type III-D,Type I-B,Type III-C,Type III-A	GTTTCAATCCTTATTTTACTGGATGTTCTACTTCAAC,GTTTCAATCCTTATTTTACTGGATGTTCTACTTCAAC,GTTTCAATCCTTATTTTACTGGATGTTCTACTTCAAC	37,37,37	0	0	NA	NA	I-B:I-B:I-B	79,79,78	79	TypeIII-B,TypeIII-D,TypeI-B,TypeIII-C,TypeIII-A	csa3,RT,WYL,DEDDh,cas4,cas2,cas1,csx1,csx19,csm3gr7,csx10gr5,cas10,cas6,cas3,cas5,cas7,cas8b1,cas8b2,DinG	csx19|122aa|up_9|NC_016627.1_2483839_2484205_-,NA|105aa|down_6|NC_016627.1_2508352_2508667_+	csx19|122aa|up_9|NC_016627.1_2483839_2484205_-	NA	csm3gr7|444aa|up_8|NC_016627.1_2484211_2485543_-	cd09726, RAMP_I_III, CRISPR/Cas system-associated RAMP superfamily protein	csx10gr5|534aa|up_7|NC_016627.1_2485539_2487141_-	TIGR02674, cas_cyan_RAMP_2, CRISPR-associated RAMP protein, Csx10 family	csm3gr7|225aa|up_6|NC_016627.1_2487133_2487808_-	pfam03787, RAMPs, RAMP superfamily	cas10|488aa|up_5|NC_016627.1_2487810_2489274_-	cd09679, Cas10_III, CRISPR/Cas system-associated protein Cas10	cas6|220aa|up_4|NC_016627.1_2489320_2489980_-	pfam17262, DUF5328, Family of unknown function (DUF5328)	cas3|799aa|up_3|NC_016627.1_2490007_2492404_-	cd17930, DEXHc_cas3, DEXH/Q-box helicase domain of Cas3	cas5|238aa|up_2|NC_016627.1_2492417_2493131_-	TIGR02592, hypothetical_protein_CTC_01466, CRISPR-associated protein Cas5, subtype I-B/HMARI	cas7|306aa|up_1|NC_016627.1_2493145_2494063_-	TIGR02590, hypothetical_protein_MM_0563, CRISPR-associated protein Cas7/Csh2, subtype I-B/HMARI	cas8b1|615aa|up_0|NC_016627.1_2494059_2495904_-	pfam09484, Cas_TM1802, CRISPR-associated protein TM1802 (cas_TM1802)	NA|618aa|down_0|NC_016627.1_2502348_2504202_-	pfam01973, MAF_flag10, Protein of unknown function DUF115	NA|152aa|down_1|NC_016627.1_2504270_2504726_-	PRK00464, nrdR, transcriptional repressor NrdR	NA|90aa|down_2|NC_016627.1_2504952_2505222_-	TIGR02888, conserved_hypothetical_protein, sporulation protein, YlmC/YmxH family	NA|258aa|down_3|NC_016627.1_2505391_2506165_-	PRK08215, PRK08215, RNA polymerase sporulation sigma factor SigG	NA|245aa|down_4|NC_016627.1_2506333_2507068_-	PRK08301, PRK08301, RNA polymerase sporulation sigma factor SigE	NA|299aa|down_5|NC_016627.1_2507107_2508004_-	pfam03419, Peptidase_U4, Sporulation factor SpoIIGA	NA|105aa|down_6|NC_016627.1_2508352_2508667_+	NA	NA|365aa|down_7|NC_016627.1_2508783_2509878_-	PRK09330, PRK09330, cell division protein FtsZ; Validated	NA|412aa|down_8|NC_016627.1_2510088_2511324_-	COG0849, ftsA, Cell division ATPase FtsA [Cell division and chromosome partitioning]	NA|122aa|down_9|NC_016627.1_2511530_2511896_-	pfam06947, DUF1290, Protein of unknown function (DUF1290)
GCF_000237085.1_ASM23708v1	NC_016627	Hungateiclostridium clariflavum DSM 19732, complete sequence	3	3016571-3020204	3,3,3	PILER-CR,CRISPRCasFinder,CRT	no	cas2,cas1,cas4,cas3,cas5,cas7,cas8b2	csa3,RT,WYL,DEDDh,cas4,cas2,cas1,csx1,csx19,csm3gr7,csx10gr5,cas10,cas6,cas3,cas5,cas7,cas8b1,cas8b2,DinG	Unclear	ATTTACATCCCACATAGTTAAAGAACAAC,ATTTACATCCCACATAGTTAAAGAACAAC,ATTTACATCCCACATAGTTAAAGAACAAC	29,29,29	0	0	NA	NA	NA:NA:NA	55,55,55	55	Unclear	csa3,RT,WYL,DEDDh,cas4,cas2,cas1,csx1,csx19,csm3gr7,csx10gr5,cas10,cas6,cas3,cas5,cas7,cas8b1,cas8b2,DinG	NA|138aa|up_4|NC_016627.1_3009640_3010054_-,NA|470aa|up_0|NC_016627.1_3013647_3015057_+,NA|80aa|down_9|NC_016627.1_3031265_3031505_-	NA|406aa|up_9|NC_016627.1_3002015_3003233_-	pfam00872, Transposase_mut, Transposase, Mutator family	NA|389aa|up_8|NC_016627.1_3003484_3004651_-	COG1729, COG1729, Uncharacterized protein conserved in bacteria [Function unknown]	NA|426aa|up_7|NC_016627.1_3004689_3005967_-	COG0285, FolC, Folylpolyglutamate synthase [Coenzyme metabolism]	NA|914aa|up_6|NC_016627.1_3006134_3008876_-	PRK12904, PRK12904, preprotein translocase subunit SecA; Reviewed	NA|195aa|up_5|NC_016627.1_3009007_3009592_-	pfam17248, DUF5317, Family of unknown function (DUF5317)	NA|138aa|up_4|NC_016627.1_3009640_3010054_-	NA	NA|101aa|up_3|NC_016627.1_3010143_3010446_-	TIGR02531, conserved_hypothetical_protein, TrpR-related protein YerC/YecD	NA|444aa|up_2|NC_016627.1_3011262_3012594_+	COG0617, PcnB, tRNA nucleotidyltransferase/poly(A) polymerase [Translation, ribosomal structure and biogenesis]	NA|308aa|up_1|NC_016627.1_3012590_3013514_+	COG2267, PldB, Lysophospholipase [Lipid metabolism]	NA|470aa|up_0|NC_016627.1_3013647_3015057_+	NA	cas2|88aa|down_0|NC_016627.1_3020389_3020653_-	cd09725, Cas2_I_II_III, CRISPR/Cas system-associated protein Cas2	NA|466aa|down_1|NC_016627.1_3021028_3022426_+	pfam13546, DDE_5, DDE superfamily endonuclease	cas1|263aa|down_2|NC_016627.1_3022388_3023177_-	cd09722, Cas1_I-B, CRISPR/Cas system-associated protein Cas1	cas4|178aa|down_3|NC_016627.1_3023169_3023703_-	pfam01930, Cas_Cas4, Domain of unknown function DUF83	cas3|731aa|down_4|NC_016627.1_3023723_3025916_-	COG1203, COG1203, CRISPR-associated helicase Cas3 [Defense mechanisms]	cas5|250aa|down_5|NC_016627.1_3025931_3026681_-	TIGR01895, conserved_hypothetical_protein, CRISPR-associated protein Cas5, subtype I-B/TNEAP	cas7|295aa|down_6|NC_016627.1_3026667_3027552_-	TIGR02585, conserved_protein, CRISPR-associated protein Cas7/Cst2/DevR, subtype I-B/TNEAP	cas8b2|530aa|down_7|NC_016627.1_3027551_3029141_-	cd09754, Cas8a1_I-A, CRISPR/Cas system-associated protein Cas8a1	NA|430aa|down_8|NC_016627.1_3029555_3030845_+	pfam01548, DEDD_Tnp_IS110, Transposase	NA|80aa|down_9|NC_016627.1_3031265_3031505_-	NA
