assembly_id	genome_id	genome_def	crispr_array_locus_merge	crispr_array_location_merge	crispr_locus_id	crispr_pred_method	array_in_prot	prot_within_array_20000	prot_in_genome	crispr_type_by_cas_prot	consensus_repeat	repeat_length	self-targeting_spacer_number	self-targeting_target_number	spacer_location	protospacer_location	repeat_type	spacer_locus_num	spacer_num	correct_crispr_type	genome_cas_prots	unknown_protein_around_crispr	L10	L10_domain	L9	L9_domain	L8	L8_domain	L7	L7_domain	L6	L6_domain	L5	L5_domain	L4	L4_domain	L3	L3_domain	L2	L2_domain	L1	L1_domain	R1	R1_domain	R2	R2_domain	R3	R3_domain	R4	R4_domain	R5	R5_domain	R6	R6_domain	R7	R7_domain	R8	R8_domain	R9	R9_domain	R10	R10_domain
GCF_001854185.1_ASM185418v1	NZ_CP017603	Clostridium formicaceticum strain ATCC 27076 chromosome, complete genome	1	219327-219389	1	CRISPRCasFinder	no	WYL,Cas14u_CAS-V	DinG,WYL,Cas14u_CAS-V,RT,csa3,cas14k,c2c9_V-U4,PD-DExK,cas3,cas14j,cas6,cas8b1,cas7b,cas5,cas4,cas1,cas2,DEDDh,cas7,cas8b2	Unclear	ATCAAAACGTCCCCTTGGCTTATG	24	0	0	NA	NA	NA	1	1	Unclear	DinG,WYL,Cas14u_CAS-V,RT,csa3,cas14k,c2c9_V-U4,PD-DExK,cas3,cas14j,cas6,cas8b1,cas7b,cas5,cas4,cas1,cas2,DEDDh,cas7,cas8b2	NA|313aa|up_8|NZ_CP017603.1_208133_209072_-,NA|91aa|up_4|NZ_CP017603.1_213955_214228_+,NA|502aa|up_3|NZ_CP017603.1_214297_215803_-,NA|245aa|down_2|NZ_CP017603.1_221262_221997_+,NA|407aa|down_9|NZ_CP017603.1_229234_230455_-	NA|425aa|up_9|NZ_CP017603.1_205813_207088_-	cd06173, MFS_MefA_like, Macrolide efflux protein A and similar proteins of the Major Facilitator Superfamily of transporters	NA|313aa|up_8|NZ_CP017603.1_208133_209072_-	NA	NA|84aa|up_7|NZ_CP017603.1_209237_209489_-	pfam12787, EcsC, EcsC protein family	NA|130aa|up_6|NZ_CP017603.1_209485_209875_-	pfam12787, EcsC, EcsC protein family	NA|872aa|up_5|NZ_CP017603.1_210561_213177_-	PRK06241, PRK06241, phosphoenolpyruvate synthase; Validated	NA|91aa|up_4|NZ_CP017603.1_213955_214228_+	NA	NA|502aa|up_3|NZ_CP017603.1_214297_215803_-	NA	NA|236aa|up_2|NZ_CP017603.1_217190_217898_-	COG0745, OmpR, Response regulators consisting of a CheY-like receiver domain and a winged-helix DNA-binding domain [Signal transduction mechanisms / Transcription]	NA|270aa|up_1|NZ_CP017603.1_218096_218906_+	cd10944, CE4_SmPgdA_like, Catalytic NodB homology domain of Streptococcus mutans polysaccharide deacetylase PgdA, Bacillus subtilis YheN, and similar proteins	NA|142aa|up_0|NZ_CP017603.1_218855_219281_-	COG1943, COG1943, Transposase and inactivated derivatives [DNA replication, recombination, and repair]	NA|75aa|down_0|NZ_CP017603.1_219414_219639_+	pfam03816, LytR_cpsA_psr, Cell envelope-related transcriptional attenuator domain	NA|459aa|down_1|NZ_CP017603.1_219656_221033_+	cd01596, Aspartase_like, aspartase (L-aspartate ammonia-lyase) and fumarase class II enzymes	NA|245aa|down_2|NZ_CP017603.1_221262_221997_+	NA	NA|410aa|down_3|NZ_CP017603.1_222640_223870_-	cd06173, MFS_MefA_like, Macrolide efflux protein A and similar proteins of the Major Facilitator Superfamily of transporters	NA|267aa|down_4|NZ_CP017603.1_223859_224660_-	COG1120, FepC, ABC-type cobalamin/Fe3+-siderophores transport systems, ATPase components [Inorganic ion transport and metabolism / Coenzyme metabolism]	NA|336aa|down_5|NZ_CP017603.1_224660_225668_-	pfam01032, FecCD, FecCD transport family	NA|329aa|down_6|NZ_CP017603.1_225669_226656_-	pfam01032, FecCD, FecCD transport family	NA|318aa|down_7|NZ_CP017603.1_226642_227596_-	cd01138, FeuA, Periplasmic binding protein FeuA	NA|334aa|down_8|NZ_CP017603.1_227900_228902_+	COG2207, AraC, AraC-type DNA-binding domain-containing proteins [Transcription]	NA|407aa|down_9|NZ_CP017603.1_229234_230455_-	NA
GCF_001854185.1_ASM185418v1	NZ_CP017603	Clostridium formicaceticum strain ATCC 27076 chromosome, complete genome	2	663813-663904	2	CRISPRCasFinder	no	cas14k	DinG,WYL,Cas14u_CAS-V,RT,csa3,cas14k,c2c9_V-U4,PD-DExK,cas3,cas14j,cas6,cas8b1,cas7b,cas5,cas4,cas1,cas2,DEDDh,cas7,cas8b2	Unclear	AAAATTACAATAAAGTTCACCGAA	24	0	0	NA	NA	NA	1	1	TypeV	DinG,WYL,Cas14u_CAS-V,RT,csa3,cas14k,c2c9_V-U4,PD-DExK,cas3,cas14j,cas6,cas8b1,cas7b,cas5,cas4,cas1,cas2,DEDDh,cas7,cas8b2	NA|113aa|up_7|NZ_CP017603.1_651046_651385_-,NA|166aa|down_3|NZ_CP017603.1_669689_670187_-	NA|542aa|up_9|NZ_CP017603.1_649004_650630_-	pfam03050, DDE_Tnp_IS66, Transposase IS66 family	NA|117aa|up_8|NZ_CP017603.1_650702_651053_-	pfam05717, TnpB_IS66, IS66 Orf2 like protein	NA|113aa|up_7|NZ_CP017603.1_651046_651385_-	NA	NA|601aa|up_6|NZ_CP017603.1_651465_653268_-	pfam00665, rve, Integrase core domain	NA|272aa|up_5|NZ_CP017603.1_653282_654098_-	pfam08722, Tn7_Tnp_TnsA_N, TnsA endonuclease N terminal	NA|313aa|up_4|NZ_CP017603.1_654328_655267_-	pfam15978, TnsD, Tn7-like transposition protein D	NA|874aa|up_3|NZ_CP017603.1_656356_658978_-	pfam04851, ResIII, Type III restriction enzyme, res subunit	NA|549aa|up_2|NZ_CP017603.1_658980_660627_-	COG2189, COG2189, Adenine specific DNA methylase Mod [DNA replication, recombination, and repair]	NA|475aa|up_1|NZ_CP017603.1_660888_662313_-	pfam18134, AGS_C, Adenylyl/Guanylyl and SMODS C-terminal sensor domain	NA|384aa|up_0|NZ_CP017603.1_662330_663482_-	pfam18145, SAVED, SMODS-associated and fused to various effectors sensor domain	NA|609aa|down_0|NZ_CP017603.1_664258_666085_-	PRK00331, PRK00331, isomerizing glutamine--fructose-6-phosphate transaminase	NA|449aa|down_1|NZ_CP017603.1_666498_667845_-	PRK14316, glmM, phosphoglucosamine mutase; Provisional	NA|540aa|down_2|NZ_CP017603.1_668035_669655_-	pfam02554, CstA, Carbon starvation protein CstA	NA|166aa|down_3|NZ_CP017603.1_669689_670187_-	NA	cas14k|421aa|down_4|NZ_CP017603.1_670310_671573_-	COG0675, COG0675, Transposase and inactivated derivatives [DNA replication, recombination, and repair]	NA|182aa|down_5|NZ_CP017603.1_672512_673058_-	pfam01558, POR, Pyruvate ferredoxin/flavodoxin oxidoreductase	NA|249aa|down_6|NZ_CP017603.1_673059_673806_-	cd03375, TPP_OGFOR, Thiamine pyrophosphate (TPP family), 2-oxoglutarate ferredoxin oxidoreductase (OGFOR) subfamily, TPP-binding module; OGFOR catalyzes the oxidative decarboxylation of 2-oxo-acids, with ferredoxin acting as an electron acceptor	NA|355aa|down_7|NZ_CP017603.1_673805_674870_-	PRK07119, PRK07119, 2-ketoisovalerate ferredoxin reductase; Validated	NA|75aa|down_8|NZ_CP017603.1_674897_675122_-	COG1143, NuoI, Formate hydrogenlyase subunit 6/NADH:ubiquinone oxidoreductase 23 kD subunit (chain I) [Energy production and conversion]	NA|224aa|down_9|NZ_CP017603.1_675150_675822_-	cd02042, ParAB_family, partition proteins ParAB family
GCF_001854185.1_ASM185418v1	NZ_CP017603	Clostridium formicaceticum strain ATCC 27076 chromosome, complete genome	3	2541918-2543001	3,1,1	CRISPRCasFinder,CRT,PILER-CR	no	cas6,cas8b1,cas7b,cas5,cas3,cas4,cas1,cas2	DinG,WYL,Cas14u_CAS-V,RT,csa3,cas14k,c2c9_V-U4,PD-DExK,cas3,cas14j,cas6,cas8b1,cas7b,cas5,cas4,cas1,cas2,DEDDh,cas7,cas8b2	Type I-B	ATTGAACCTCAACATAGGATGTATTTAAAT,ATTGAACCTCAACATAGGATGTATTTAAAT,ATTGAACCTCAACATAGGATGTATTTAAAT	30,30,30	0	0	NA	NA	II-B:II-B:II-B	16,16,13	16	TypeI-B	DinG,WYL,Cas14u_CAS-V,RT,csa3,cas14k,c2c9_V-U4,PD-DExK,cas3,cas14j,cas6,cas8b1,cas7b,cas5,cas4,cas1,cas2,DEDDh,cas7,cas8b2	NA,NA	NA|1635aa|up_9|NZ_CP017603.1_2526695_2531600_-	COG1057, NadD, Nicotinic acid mononucleotide adenylyltransferase [Coenzyme metabolism]	NA|306aa|up_8|NZ_CP017603.1_2531762_2532680_+	cd16913, YkuD_like, L,D-transpeptidases/carboxypeptidases similar to Bacillus YkuD	cas6|232aa|up_7|NZ_CP017603.1_2533120_2533816_+	TIGR01877, CRISPR-associated_endoribonuclease_Cas6_1, CRISPR-associated endoribonuclease Cas6	cas8b1|574aa|up_6|NZ_CP017603.1_2533827_2535549_+	TIGR02591, cas_Csh1, CRISPR-associated protein Cas8b/Csh1, subtype I-B/HMARI	cas7b|325aa|up_5|NZ_CP017603.1_2535551_2536526_+	pfam05107, Cas_Cas7, CRISPR-associated protein Cas7	cas5|251aa|up_4|NZ_CP017603.1_2536528_2537281_+	TIGR02592, hypothetical_protein_CTC_01466, CRISPR-associated protein Cas5, subtype I-B/HMARI	cas3|862aa|up_3|NZ_CP017603.1_2537328_2539914_+	cd09639, Cas3_I, CRISPR/Cas system-associated protein Cas3	cas4|164aa|up_2|NZ_CP017603.1_2539922_2540414_+	pfam01930, Cas_Cas4, Domain of unknown function DUF83	cas1|333aa|up_1|NZ_CP017603.1_2540423_2541422_+	cd09722, Cas1_I-B, CRISPR/Cas system-associated protein Cas1	cas2|97aa|up_0|NZ_CP017603.1_2541422_2541713_+	cd09725, Cas2_I_II_III, CRISPR/Cas system-associated protein Cas2	NA|792aa|down_0|NZ_CP017603.1_2543185_2545561_+	PRK00409, PRK00409, recombination and DNA strand exchange inhibitor protein; Reviewed	NA|157aa|down_1|NZ_CP017603.1_2545581_2546052_+	pfam04463, DUF523, Protein of unknown function (DUF523)	NA|364aa|down_2|NZ_CP017603.1_2546122_2547214_-	cd19920, REC_PA4781-like, phosphoacceptor receiver (REC) domain of cyclic di-GMP phosphodiesterase PA4781 and similar domains	NA|865aa|down_3|NZ_CP017603.1_2547519_2550114_+	PRK11107, PRK11107, hybrid sensory histidine kinase BarA; Provisional	NA|350aa|down_4|NZ_CP017603.1_2550106_2551156_+	cd17536, REC_YesN-like, phosphoacceptor receiver (REC) domain of YesN and related helix-turn-helix containing response regulators	NA|417aa|down_5|NZ_CP017603.1_2551413_2552664_+	TIGR03407, urea_ABC_UrtA, urea ABC transporter, urea binding protein	NA|303aa|down_6|NZ_CP017603.1_2552759_2553668_+	TIGR03409, urea_trans_UrtB, urea ABC transporter, permease protein UrtB	NA|364aa|down_7|NZ_CP017603.1_2553682_2554774_+	TIGR03408, urea_trans_UrtC, urea ABC transporter, permease protein UrtC	NA|252aa|down_8|NZ_CP017603.1_2554781_2555537_+	TIGR03411, urea_trans_UrtD, urea ABC transporter, ATP-binding protein UrtD	NA|231aa|down_9|NZ_CP017603.1_2555539_2556232_+	TIGR03410, urea_trans_UrtE, urea ABC transporter, ATP-binding protein UrtE
GCF_001854185.1_ASM185418v1	NZ_CP017603	Clostridium formicaceticum strain ATCC 27076 chromosome, complete genome	4	3724927-3726743	4,2,2	CRISPRCasFinder,CRT,PILER-CR	no	cas2,cas1,cas4,cas3,cas5,cas7,cas8b2,cas6,WYL	DinG,WYL,Cas14u_CAS-V,RT,csa3,cas14k,c2c9_V-U4,PD-DExK,cas3,cas14j,cas6,cas8b1,cas7b,cas5,cas4,cas1,cas2,DEDDh,cas7,cas8b2	Unclear	ATTTACATTCTACTGTAGTTCTATTAAAGG,ATTTACATTCTACTGTAGTTCTATTAAAGG,ATTTACATTCTACTGTAGTTCTATTAAAGG	30,30,30	0	0	NA	NA	NA:NA:NA	27,27,26	27	Unclear	DinG,WYL,Cas14u_CAS-V,RT,csa3,cas14k,c2c9_V-U4,PD-DExK,cas3,cas14j,cas6,cas8b1,cas7b,cas5,cas4,cas1,cas2,DEDDh,cas7,cas8b2	NA|104aa|up_8|NZ_CP017603.1_3711014_3711326_-,NA	NA|398aa|up_9|NZ_CP017603.1_3709780_3710974_-	COG4591, LolE, ABC-type transport system, involved in lipoprotein release, permease component [Cell envelope biogenesis, outer membrane]	NA|104aa|up_8|NZ_CP017603.1_3711014_3711326_-	NA	NA|194aa|up_7|NZ_CP017603.1_3711360_3711942_-	pfam10080, DUF2318, Predicted membrane protein (DUF2318)	NA|405aa|up_6|NZ_CP017603.1_3712545_3713760_+	cd00548, NrfA-like, cytochrome c nitrite reductase and similar proteins	NA|430aa|up_5|NZ_CP017603.1_3713814_3715104_+	pfam05140, ResB, ResB-like family	NA|277aa|up_4|NZ_CP017603.1_3715121_3715952_+	TIGR03144, cytochrome_c_biogenesis_protein_chloroplast, cytochrome c-type biogenesis protein CcsB	NA|713aa|up_3|NZ_CP017603.1_3716076_3718215_-	PRK11249, katE, hydroperoxidase II; Provisional	NA|198aa|up_2|NZ_CP017603.1_3718593_3719187_+	COG5663, COG5663, Uncharacterized conserved protein [Function unknown]	NA|431aa|up_1|NZ_CP017603.1_3719348_3720641_+	COG2200, Rtn, c-di-GMP phosphodiesterase class I (EAL domain) [Signal    transduction mechanisms]	NA|1188aa|up_0|NZ_CP017603.1_3720889_3724453_-	NF033452, BREX_1_MTaseX, BREX-1 system adenine-specific DNA-methyltransferase PglX	cas2|93aa|down_0|NZ_CP017603.1_3726928_3727207_-	cd09725, Cas2_I_II_III, CRISPR/Cas system-associated protein Cas2	cas1|331aa|down_1|NZ_CP017603.1_3727211_3728204_-	cd09722, Cas1_I-B, CRISPR/Cas system-associated protein Cas1	cas4|164aa|down_2|NZ_CP017603.1_3728213_3728705_-	pfam01930, Cas_Cas4, Domain of unknown function DUF83	cas3|744aa|down_3|NZ_CP017603.1_3728735_3730967_-	COG1203, COG1203, CRISPR-associated helicase Cas3 [Defense mechanisms]	cas5|252aa|down_4|NZ_CP017603.1_3730989_3731745_-	cd09658, Cas5_I-B, CRISPR/Cas system-associated RAMP superfamily protein Cas5	cas7|295aa|down_5|NZ_CP017603.1_3731731_3732616_-	TIGR02585, conserved_protein, CRISPR-associated protein Cas7/Cst2/DevR, subtype I-B/TNEAP	cas8b2|557aa|down_6|NZ_CP017603.1_3732615_3734286_-	cd09754, Cas8a1_I-A, CRISPR/Cas system-associated protein Cas8a1	cas6|243aa|down_7|NZ_CP017603.1_3734298_3735027_-	TIGR01877, CRISPR-associated_endoribonuclease_Cas6_1, CRISPR-associated endoribonuclease Cas6	WYL|316aa|down_8|NZ_CP017603.1_3735127_3736075_-	pfam13280, WYL, WYL domain	NA|207aa|down_9|NZ_CP017603.1_3736254_3736875_-	COG1878, COG1878, Kynurenine formamidase [Amino acid transport and metabolism]
GCF_001854185.1_ASM185418v1	NZ_CP017603	Clostridium formicaceticum strain ATCC 27076 chromosome, complete genome	5	4200017-4200109	5	CRISPRCasFinder	no	RT	DinG,WYL,Cas14u_CAS-V,RT,csa3,cas14k,c2c9_V-U4,PD-DExK,cas3,cas14j,cas6,cas8b1,cas7b,cas5,cas4,cas1,cas2,DEDDh,cas7,cas8b2	Unclear	AAACACATTATCATTTCCTCCTTTCTTGCGTG	32	0	0	NA	NA	NA	1	1	Orphan	DinG,WYL,Cas14u_CAS-V,RT,csa3,cas14k,c2c9_V-U4,PD-DExK,cas3,cas14j,cas6,cas8b1,cas7b,cas5,cas4,cas1,cas2,DEDDh,cas7,cas8b2	NA,NA	NA|194aa|up_9|NZ_CP017603.1_4191584_4192166_-	COG1853, COG1853, Conserved protein/domain typically associated with flavoprotein oxygenases, DIM6/NTAB family [General function prediction only]	NA|264aa|up_8|NZ_CP017603.1_4192231_4193023_-	PRK07475, PRK07475, hypothetical protein; Provisional	NA|66aa|up_7|NZ_CP017603.1_4193140_4193338_-	cd00565, Ubl_ThiS, ubiquitin-like (Ubl) domain found in sulfur carrier protein ThiS	NA|376aa|up_6|NZ_CP017603.1_4193360_4194488_-	pfam01314, AFOR_C, Aldehyde ferredoxin oxidoreductase, domains 2 & 3	NA|309aa|up_5|NZ_CP017603.1_4194521_4195448_-	pfam01925, TauE, Sulfite exporter TauE/SafE	NA|301aa|up_4|NZ_CP017603.1_4195585_4196488_-	cd00408, DHDPS-like, Dihydrodipicolinate synthase family	NA|398aa|up_3|NZ_CP017603.1_4196517_4197711_-	COG0665, DadA, Glycine/D-amino acid oxidases (deaminating) [Amino acid transport and metabolism]	NA|87aa|up_2|NZ_CP017603.1_4197707_4197968_-	cd19946, GlpA-like_Fer2_BFD-like, bacterioferritin-associated ferredoxin (BFD)-like [2Fe-2S]-binding domain of anaerobic glycerol 3-phosphate dehydrogenase subunit A, hydrogen cyanide synthase subunit B, and similar proteins	NA|165aa|up_1|NZ_CP017603.1_4198065_4198560_-	COG1245, COG1245, Predicted ATPase, RNase L inhibitor (RLI) homolog [General function prediction only]	NA|372aa|up_0|NZ_CP017603.1_4198589_4199705_-	TIGR01372, sarcosine_oxidase_alpha_subunit, sarcosine oxidase, alpha subunit family, heterotetrameric form	NA|568aa|down_0|NZ_CP017603.1_4200396_4202100_-	COG3829, RocR, Transcriptional regulator containing PAS, AAA-type ATPase, and DNA-binding domains [Transcription / Signal transduction mechanisms]	NA|357aa|down_1|NZ_CP017603.1_4202728_4203799_-	cd02110, SO_family_Moco_dimer, Subgroup of sulfite oxidase (SO) family molybdopterin binding domains that contains conserved dimerization domain	NA|465aa|down_2|NZ_CP017603.1_4204172_4205567_-	cd16913, YkuD_like, L,D-transpeptidases/carboxypeptidases similar to Bacillus YkuD	NA|294aa|down_3|NZ_CP017603.1_4205898_4206780_+	smart00257, LysM, Lysin motif	NA|599aa|down_4|NZ_CP017603.1_4207329_4209126_-	cd01454, vWA_norD_type, norD type: Denitrifying bacteria contain both membrane bound and periplasmic nitrate reductases	NA|308aa|down_5|NZ_CP017603.1_4209139_4210063_-	COG0714, COG0714, MoxR-like ATPases [General function prediction only]	NA|97aa|down_6|NZ_CP017603.1_4210439_4210730_-	pfam09308, LuxQ-periplasm, LuxQ, periplasmic	NA|230aa|down_7|NZ_CP017603.1_4210779_4211469_-	COG0731, COG0731, Fe-S oxidoreductases [Energy production and conversion]	NA|236aa|down_8|NZ_CP017603.1_4211487_4212195_-	COG3382, COG3382, Solo B3/4 domain (OB-fold DNA/RNA-binding) of Phe-aaRS-beta [General function prediction only]	NA|443aa|down_9|NZ_CP017603.1_4212260_4213589_-	cd13143, MATE_MepA_like, Subfamily of the multidrug and toxic compound extrusion (MATE)-like proteins similar to Streptococcus aureus MepA
