assembly_id	genome_id	genome_def	crispr_array_locus_merge	crispr_array_location_merge	crispr_locus_id	crispr_pred_method	array_in_prot	prot_within_array_20000	prot_in_genome	crispr_type_by_cas_prot	consensus_repeat	repeat_length	self-targeting_spacer_number	self-targeting_target_number	spacer_location	protospacer_location	repeat_type	spacer_locus_num	spacer_num	correct_crispr_type	genome_cas_prots	unknown_protein_around_crispr	L10	L10_domain	L9	L9_domain	L8	L8_domain	L7	L7_domain	L6	L6_domain	L5	L5_domain	L4	L4_domain	L3	L3_domain	L2	L2_domain	L1	L1_domain	R1	R1_domain	R2	R2_domain	R3	R3_domain	R4	R4_domain	R5	R5_domain	R6	R6_domain	R7	R7_domain	R8	R8_domain	R9	R9_domain	R10	R10_domain
GCF_006228565.1_ASM622856v1	NZ_CP031054	Moorella thermoacetica strain 39073-HH chromosome, complete genome	1	512606-514297	1,1,1	PILER-CR,CRISPRCasFinder,CRT	no	cas2,cas1,cas4,cas7,cas8c,cas5,cas3,WYL	cas3,csa3,cas2,cas1,cas4,cas7,cas8c,cas5,WYL,DinG,cas6,RT	Type I-C,Type I-U, Type I-U?	GTTTCAACCCTCGCCCGGCATGGAAGCCGGGCGCGAC,GTTTCAACCCTCGCCCGGCATGGAAGCCGGGCGCGAC,GTTTCAACCCTCGCCCGGCATGGAAGCCGGGCGCGAC	37,37,37	0	0	NA	NA	I-C,III-B:I-C,III-B:I-C,III-B	23,23,23	23	TypeI-C,TypeI-U,TypeI-U?	cas3,csa3,cas2,cas1,cas4,cas7,cas8c,cas5,WYL,DinG,cas6,RT	NA|272aa|up_5|NZ_CP031054.1_504179_504995_+,NA|81aa|up_4|NZ_CP031054.1_505249_505492_+,NA|486aa|up_3|NZ_CP031054.1_505539_506997_+,NA	NA|1406aa|up_9|NZ_CP031054.1_498740_502958_+	TIGR02785, ATP-dependent_helicase/nuclease_subunit_A, helicase-exonuclease AddAB, AddA subunit, Firmicutes type	NA|74aa|up_8|NZ_CP031054.1_503150_503372_+	COG1598, COG1598, Predicted nuclease of the RNAse H fold, HicB family [General    function prediction only]	NA|79aa|up_7|NZ_CP031054.1_503368_503605_+	COG1724, COG1724, Predicted RNA binding protein (dsRBD-like fold), HicA family    [General function prediction only]	NA|87aa|up_6|NZ_CP031054.1_503699_503960_+	pfam06114, Peptidase_M78, IrrE N-terminal-like domain	NA|272aa|up_5|NZ_CP031054.1_504179_504995_+	NA	NA|81aa|up_4|NZ_CP031054.1_505249_505492_+	NA	NA|486aa|up_3|NZ_CP031054.1_505539_506997_+	NA	NA|204aa|up_2|NZ_CP031054.1_507089_507701_+	pfam13835, DUF4194, Domain of unknown function (DUF4194)	NA|1208aa|up_1|NZ_CP031054.1_507669_511293_+	pfam13558, SbcCD_C, Putative exonuclease SbcCD, C subunit	NA|426aa|up_0|NZ_CP031054.1_511279_512557_+	cd00223, TOPRIM_TopoIIB_SPO, TOPRIM_TopoIIB_SPO: topoisomerase-primase (TOPRIM) nucleotidyl transferase/hydrolase domain of the type found in the type IIB family of DNA topoisomerases and Spo11	cas2|97aa|down_0|NZ_CP031054.1_514468_514759_-	cd09725, Cas2_I_II_III, CRISPR/Cas system-associated protein Cas2	cas1|344aa|down_1|NZ_CP031054.1_514791_515823_-	TIGR03640, cas1_DVULG, CRISPR-associated endonuclease Cas1, subtype I-C/DVULG	cas4|215aa|down_2|NZ_CP031054.1_515894_516539_-	cd09637, Cas4_I-A_I-B_I-C_I-D_II-B, CRISPR/Cas system-associated protein Cas4	cas7|298aa|down_3|NZ_CP031054.1_516622_517516_-	pfam05107, Cas_Cas7, CRISPR-associated protein Cas7	cas8c|561aa|down_4|NZ_CP031054.1_517608_519291_-	cd09757, Cas8c_I-C, CRISPR/Cas system-associated protein Cas8c	cas5|234aa|down_5|NZ_CP031054.1_519300_520002_-	cd09752, Cas5_I-C, CRISPR/Cas system-associated RAMP superfamily protein Cas5	cas3|725aa|down_6|NZ_CP031054.1_520063_522238_-	cd17930, DEXHc_cas3, DEXH/Q-box helicase domain of Cas3	WYL|70aa|down_7|NZ_CP031054.1_523055_523265_+	pfam13280, WYL, WYL domain	NA|348aa|down_8|NZ_CP031054.1_523825_524869_-	TIGR02037, Probable_periplasmic_serine_protease_do/HhoA-like, periplasmic serine protease, Do/DeqQ family	NA|407aa|down_9|NZ_CP031054.1_524881_526102_-	pfam13240, zinc_ribbon_2, zinc-ribbon domain
GCF_006228565.1_ASM622856v1	NZ_CP031054	Moorella thermoacetica strain 39073-HH chromosome, complete genome	2	726455-726546	2	CRISPRCasFinder	no	WYL	cas3,csa3,cas2,cas1,cas4,cas7,cas8c,cas5,WYL,DinG,cas6,RT	Unclear	GGGGCGCATGCTCTATGGCCTCTCCGGC	28	0	0	NA	NA	NA	1	1	Orphan	cas3,csa3,cas2,cas1,cas4,cas7,cas8c,cas5,WYL,DinG,cas6,RT	NA|91aa|up_9|NZ_CP031054.1_717037_717310_+,NA|91aa|down_1|NZ_CP031054.1_727732_728005_-,NA|191aa|down_4|NZ_CP031054.1_730047_730620_+	NA|91aa|up_9|NZ_CP031054.1_717037_717310_+	NA	NA|159aa|up_8|NZ_CP031054.1_717281_717758_+	cd18738, PIN_VapC4-5_FitB-like, uncharacterized subgroup of the PIN_VapC4-5_FitB-like subfamily of the PIN domain superfamily	NA|305aa|up_7|NZ_CP031054.1_718377_719292_-	pfam14261, DUF4351, Domain of unknown function (DUF4351)	NA|227aa|up_6|NZ_CP031054.1_719427_720108_+	COG1087, GalE, UDP-glucose 4-epimerase [Cell envelope biogenesis, outer membrane]	NA|154aa|up_5|NZ_CP031054.1_720439_720901_-	cd18738, PIN_VapC4-5_FitB-like, uncharacterized subgroup of the PIN_VapC4-5_FitB-like subfamily of the PIN domain superfamily	NA|83aa|up_4|NZ_CP031054.1_720888_721137_-	smart00966, SpoVT_AbrB, SpoVT / AbrB like domain	NA|93aa|up_3|NZ_CP031054.1_721615_721894_+	smart00966, SpoVT_AbrB, SpoVT / AbrB like domain	NA|133aa|up_2|NZ_CP031054.1_721881_722280_+	cd18680, PIN_MtVapC20-like, VapC-like PIN domain of Mycobacterium tuberculosis VapC20 and related proteins	NA|837aa|up_1|NZ_CP031054.1_722528_725039_-	COG2206, COG2206, c-di-GMP phosphodiesterase class II (HD-GYP domain) [Signal transduction mechanisms]	NA|377aa|up_0|NZ_CP031054.1_725042_726173_-	COG3287, COG3287, Uncharacterized conserved protein [Function unknown]	NA|271aa|down_0|NZ_CP031054.1_726909_727722_+	pfam00072, Response_reg, Response regulator receiver domain	NA|91aa|down_1|NZ_CP031054.1_727732_728005_-	NA	NA|306aa|down_2|NZ_CP031054.1_728136_729054_+	pfam04294, VanW, VanW like protein	NA|183aa|down_3|NZ_CP031054.1_729251_729800_+	PLN02915, PLN02915, cellulose synthase A [UDP-forming], catalytic subunit	NA|191aa|down_4|NZ_CP031054.1_730047_730620_+	NA	NA|529aa|down_5|NZ_CP031054.1_730669_732256_+	cd06061, PurM-like1, AIR synthase (PurM) related protein, subgroup 1 of unknown function	NA|267aa|down_6|NZ_CP031054.1_732541_733342_+	COG1349, GlpR, Transcriptional regulators of sugar metabolism [Transcription / Carbohydrate transport and metabolism]	NA|359aa|down_7|NZ_CP031054.1_733400_734477_+	cd01536, PBP1_ABC_sugar_binding-like, periplasmic sugar-binding domain of active transport systems that are members of the type 1 periplasmic binding protein (PBP1) superfamily	NA|493aa|down_8|NZ_CP031054.1_734537_736016_+	COG1129, MglA, ABC-type sugar transport system, ATPase component [Carbohydrate transport and metabolism]	NA|331aa|down_9|NZ_CP031054.1_736043_737036_+	cd06579, TM_PBP1_transp_AraH_like, Transmembrane subunit (TM) of Escherichia coli AraH and related proteins
GCF_006228565.1_ASM622856v1	NZ_CP031054	Moorella thermoacetica strain 39073-HH chromosome, complete genome	3	1789091-1792110	2,3,2	PILER-CR,CRISPRCasFinder,CRT	no	cas2,cas1,cas4,cas3,cas5,cas7,cas6	cas3,csa3,cas2,cas1,cas4,cas7,cas8c,cas5,WYL,DinG,cas6,RT	Unclear	GTTCAAATTCCTCTATGGTCGATGGTCAC,GTTCAAATTCCTCTATGGTCGATGGTCAC,GTTCAAATTCCTCTATGGTCGATGGTCAC	29,29,29	0	0	NA	NA	NA:NA:NA	46,46,46	46	Unclear	cas3,csa3,cas2,cas1,cas4,cas7,cas8c,cas5,WYL,DinG,cas6,RT	NA|62aa|up_1|NZ_CP031054.1_1785472_1785658_-,NA	NA|153aa|up_9|NZ_CP031054.1_1774209_1774668_+	cd00093, HTH_XRE, Helix-turn-helix XRE-family like proteins	NA|195aa|up_8|NZ_CP031054.1_1774778_1775363_-	cd04501, SGNH_hydrolase_like_4, Members of the SGNH-hydrolase superfamily, a diverse family of lipases and esterases	NA|574aa|up_7|NZ_CP031054.1_1775532_1777254_-	TIGR02512, Periplasmic_hydrogenase_large_subunit, [FeFe] hydrogenase, group A	NA|158aa|up_6|NZ_CP031054.1_1779173_1779647_-	pfam01257, 2Fe-2S_thioredx, Thioredoxin-like [2Fe-2S] ferredoxin	NA|365aa|up_5|NZ_CP031054.1_1780122_1781217_-	pfam16864, dimerization2, dimerization domain	NA|363aa|up_4|NZ_CP031054.1_1781213_1782302_-	pfam02277, DBI_PRT, Phosphoribosyltransferase	NA|427aa|up_3|NZ_CP031054.1_1782324_1783605_-	PRK13352, PRK13352, phosphomethylpyrimidine synthase ThiC	NA|433aa|up_2|NZ_CP031054.1_1783617_1784916_-	PRK13352, PRK13352, phosphomethylpyrimidine synthase ThiC	NA|62aa|up_1|NZ_CP031054.1_1785472_1785658_-	NA	NA|403aa|up_0|NZ_CP031054.1_1786366_1787575_+	COG5421, COG5421, Transposase [DNA replication, recombination, and repair]	cas2|92aa|down_0|NZ_CP031054.1_1792191_1792467_-	cd09725, Cas2_I_II_III, CRISPR/Cas system-associated protein Cas2	cas1|322aa|down_1|NZ_CP031054.1_1792475_1793441_-	cd09722, Cas1_I-B, CRISPR/Cas system-associated protein Cas1	cas4|178aa|down_2|NZ_CP031054.1_1793457_1793991_-	pfam01930, Cas_Cas4, Domain of unknown function DUF83	cas3|918aa|down_3|NZ_CP031054.1_1793980_1796734_-	COG1203, COG1203, CRISPR-associated helicase Cas3 [Defense mechanisms]	cas5|295aa|down_4|NZ_CP031054.1_1796733_1797618_-	TIGR01895, conserved_hypothetical_protein, CRISPR-associated protein Cas5, subtype I-B/TNEAP	cas7|354aa|down_5|NZ_CP031054.1_1797617_1798679_-	pfam01905, DevR, CRISPR-associated negative auto-regulator DevR/Csa2	NA|493aa|down_6|NZ_CP031054.1_1798708_1800187_-	TIGR01908, Uncharacterized_protein_aq_372, CRISPR-associated protein Cas8b1/Cst1, subtype I-B/TNEAP	cas6|227aa|down_7|NZ_CP031054.1_1800190_1800871_-	cd09652, Cas6-I-III, CRISPR/Cas system-associated RAMP superfamily protein Cas6	NA|468aa|down_8|NZ_CP031054.1_1801169_1802573_-	PRK09613, thiH, thiamine biosynthesis protein ThiH; Reviewed	NA|351aa|down_9|NZ_CP031054.1_1802687_1803740_-	PRK07094, PRK07094, biotin synthase; Provisional
