assembly_id	genome_id	genome_def	crispr_array_locus_merge	crispr_array_location_merge	crispr_locus_id	crispr_pred_method	array_in_prot	prot_within_array_20000	prot_in_genome	crispr_type_by_cas_prot	consensus_repeat	repeat_length	self-targeting_spacer_number	self-targeting_target_number	spacer_location	protospacer_location	repeat_type	spacer_locus_num	spacer_num	correct_crispr_type	genome_cas_prots	unknown_protein_around_crispr	L10	L10_domain	L9	L9_domain	L8	L8_domain	L7	L7_domain	L6	L6_domain	L5	L5_domain	L4	L4_domain	L3	L3_domain	L2	L2_domain	L1	L1_domain	R1	R1_domain	R2	R2_domain	R3	R3_domain	R4	R4_domain	R5	R5_domain	R6	R6_domain	R7	R7_domain	R8	R8_domain	R9	R9_domain	R10	R10_domain
GCF_000244875.1_ASM24487v1	NC_016791	Clostridium sp. BNL1100, complete sequence	1	1450296-1451654	1,1,1	PILER-CR,CRISPRCasFinder,CRT	no	cas3,cas5,cas8c,cas7,cas4,cas1,cas2	csa3,cas3,RT,cas5,cas8c,cas7,cas4,cas1,cas2,DEDDh,c2c10_CAS-V-U3,DinG,Cas14u_CAS-V,WYL	Type I-C,Type I-U, Type I-U?	GTCGCTCCTCTCGTAGGAGCGTGGATTGAAAT,GTCGCTCCTCTCGTAGGAGCGTGGATTGAAAT,GTCGCTCCTCTCGTAGGAGCGTGGATTGAAAT	32,32,32	0	0	NA	NA	I-C:I-C:I-C	19,20,20	20	TypeI-C,TypeI-U,TypeI-U?	csa3,cas3,RT,cas5,cas8c,cas7,cas4,cas1,cas2,DEDDh,c2c10_CAS-V-U3,DinG,Cas14u_CAS-V,WYL	NA,NA|99aa|down_3|NC_016791.1_1455179_1455476_-,NA|55aa|down_4|NC_016791.1_1455794_1455959_-,NA|228aa|down_7|NC_016791.1_1467984_1468668_+,NA|126aa|down_8|NC_016791.1_1469141_1469519_+,NA|71aa|down_9|NC_016791.1_1469529_1469742_+	NA|425aa|up_9|NC_016791.1_1438588_1439863_+	pfam02618, YceG, YceG-like family	NA|214aa|up_8|NC_016791.1_1439967_1440609_+	COG4122, COG4122, Predicted O-methyltransferase [General function prediction only]	NA|407aa|up_7|NC_016791.1_1440629_1441850_+	COG0826, COG0826, Collagenase and related proteases [Posttranslational modification, protein turnover, chaperones]	cas3|804aa|up_6|NC_016791.1_1442186_1444598_+	COG1203, COG1203, CRISPR-associated helicase Cas3 [Defense mechanisms]	cas5|246aa|up_5|NC_016791.1_1444608_1445346_+	cd09651, Cas5_I-C, CRISPR/Cas system-associated RAMP superfamily protein Cas5	cas8c|646aa|up_4|NC_016791.1_1445332_1447270_+	cd09757, Cas8c_I-C, CRISPR/Cas system-associated protein Cas8c	cas7|286aa|up_3|NC_016791.1_1447283_1448141_+	cd09689, Cas7_I-C, CRISPR/Cas system-associated RAMP superfamily protein Cas7	cas4|223aa|up_2|NC_016791.1_1448127_1448796_+	TIGR00372, conserved_hypothetical_protein, CRISPR-associated protein Cas4	cas1|344aa|up_1|NC_016791.1_1448792_1449824_+	TIGR03640, cas1_DVULG, CRISPR-associated endonuclease Cas1, subtype I-C/DVULG	cas2|97aa|up_0|NC_016791.1_1449833_1450124_+	cd09725, Cas2_I_II_III, CRISPR/Cas system-associated protein Cas2	NA|152aa|down_0|NC_016791.1_1452188_1452644_+	pfam00583, Acetyltransf_1, Acetyltransferase (GNAT) family	NA|552aa|down_1|NC_016791.1_1452737_1454393_+	TIGR03423, pbp2_mrdA, penicillin-binding protein 2	NA|235aa|down_2|NC_016791.1_1454482_1455187_-	cd14814, Peptidase_M15, Metalloproteases including zinc D-Ala-D-Ala carboxypeptidase, L-Ala-D-Glu peptidase, L,D-carboxypeptidase, bacteriophage endolysins, and related proteins	NA|99aa|down_3|NC_016791.1_1455179_1455476_-	NA	NA|55aa|down_4|NC_016791.1_1455794_1455959_-	NA	NA|117aa|down_5|NC_016791.1_1455961_1456312_-	TIGR02937, RNA_polymerase_sigma_factor, RNA polymerase sigma factor, sigma-70 family	NA|2257aa|down_6|NC_016791.1_1456682_1463453_+	pfam00942, CBM_3, Cellulose binding domain	NA|228aa|down_7|NC_016791.1_1467984_1468668_+	NA	NA|126aa|down_8|NC_016791.1_1469141_1469519_+	NA	NA|71aa|down_9|NC_016791.1_1469529_1469742_+	NA
GCF_000244875.1_ASM24487v1	NC_016791	Clostridium sp. BNL1100, complete sequence	2	2063114-2063186	2	CRISPRCasFinder	no		csa3,cas3,RT,cas5,cas8c,cas7,cas4,cas1,cas2,DEDDh,c2c10_CAS-V-U3,DinG,Cas14u_CAS-V,WYL	Orphan	TGTAAAGTAAAAATACTTGACAAT	24	0	0	NA	NA	NA	1	1	Orphan	csa3,cas3,RT,cas5,cas8c,cas7,cas4,cas1,cas2,DEDDh,c2c10_CAS-V-U3,DinG,Cas14u_CAS-V,WYL	NA,NA	NA|944aa|up_9|NC_016791.1_2047638_2050470_+	pfam03699, UPF0182, Uncharacterized protein family (UPF0182)	NA|627aa|up_8|NC_016791.1_2050725_2052606_+	COG0531, PotE, Amino acid transporters [Amino acid transport and metabolism]	NA|326aa|up_7|NC_016791.1_2052659_2053637_+	COG1940, NagC, Transcriptional regulator/sugar kinase [Transcription / Carbohydrate transport and metabolism]	NA|74aa|up_6|NC_016791.1_2053663_2053885_+	COG4443, COG4443, Uncharacterized protein conserved in bacteria [Function unknown]	NA|771aa|up_5|NC_016791.1_2054397_2056710_+	PRK07111, PRK07111, anaerobic ribonucleoside triphosphate reductase; Provisional	NA|170aa|up_4|NC_016791.1_2056778_2057288_+	TIGR02491, Anaerobic_ribonucleoside-triphosphate_reductase, anaerobic ribonucleoside-triphosphate reductase activating protein	NA|629aa|up_3|NC_016791.1_2057412_2059299_+	cd16015, LTA_synthase, Lipoteichoic acid synthase like	NA|282aa|up_2|NC_016791.1_2059372_2060218_-	PRK13317, PRK13317, pantothenate kinase; Provisional	NA|551aa|up_1|NC_016791.1_2060240_2061893_-	COG1297, COG1297, Predicted membrane protein [Function unknown]	NA|259aa|up_0|NC_016791.1_2062333_2063110_+	cd11297, PIN_LabA-like_N_1, uncharacterized subfamily of N-terminal LabA-like PIN domains	NA|119aa|down_0|NC_016791.1_2063225_2063582_+	PRK00118, PRK00118, putative DNA-binding protein; Validated	NA|449aa|down_1|NC_016791.1_2063617_2064964_+	PRK10867, PRK10867, signal recognition particle protein; Provisional	NA|82aa|down_2|NC_016791.1_2065006_2065252_+	PRK00040, rpsP, 30S ribosomal protein S16; Reviewed	NA|77aa|down_3|NC_016791.1_2065399_2065630_+	PRK00468, PRK00468, KH domain-containing protein	NA|170aa|down_4|NC_016791.1_2065698_2066208_+	PRK00122, rimM, 16S rRNA-processing protein RimM; Provisional	NA|230aa|down_5|NC_016791.1_2066207_2066897_+	PRK00026, trmD, tRNA (guanine-N(1)-)-methyltransferase; Reviewed	NA|57aa|down_6|NC_016791.1_2067081_2067252_+	pfam10055, DUF2292, Uncharacterized small protein (DUF2292)	NA|303aa|down_7|NC_016791.1_2067387_2068296_+	TIGR01290, FeMo_cofactor_biosynthesis_protein_NifB, nitrogenase cofactor biosynthesis protein NifB	NA|116aa|down_8|NC_016791.1_2068324_2068672_+	cd00852, NifB, NifB belongs to a family of iron-molybdenum cluster-binding proteins that includes NifX, and NifY, all of which are involved in the synthesis of an iron-molybdenum cofactor (FeMo-co) that binds the active site of the dinitrogenase enzyme as part of nitrogen fixation in bacteria	NA|293aa|down_9|NC_016791.1_2068688_2069567_+	cd02040, NifH, nitrogenase component II NifH
