assembly_id	genome_id	genome_def	crispr_array_locus_merge	crispr_array_location_merge	crispr_locus_id	crispr_pred_method	array_in_prot	prot_within_array_20000	prot_in_genome	crispr_type_by_cas_prot	consensus_repeat	repeat_length	self-targeting_spacer_number	self-targeting_target_number	spacer_location	protospacer_location	repeat_type	spacer_locus_num	spacer_num	correct_crispr_type	genome_cas_prots	unknown_protein_around_crispr	L10	L10_domain	L9	L9_domain	L8	L8_domain	L7	L7_domain	L6	L6_domain	L5	L5_domain	L4	L4_domain	L3	L3_domain	L2	L2_domain	L1	L1_domain	R1	R1_domain	R2	R2_domain	R3	R3_domain	R4	R4_domain	R5	R5_domain	R6	R6_domain	R7	R7_domain	R8	R8_domain	R9	R9_domain	R10	R10_domain
GCF_000019065.1_ASM1906v1	NC_010320	Thermoanaerobacter sp. X514, complete genome	1	764967-765396	1,1,1	PILER-CR,CRISPRCasFinder,CRT	no	RT	RT,csa3,cas3,cas4,Cas14b_CAS-V-F,DEDDh,csm3gr7,csx19,cas10,csx1,csx20,cas14k,cas2,cas1,cas5,cas7b,cas8b1,cas6	Unclear	GTTTTTAGCTTACCTATAAGGGATTGAAA,GTTTTTAGCTTACCTATAAGGGATTGAAAC,GTTTTTAGCTTACCTATAAGGGATTGAAA	29,30,29	0	0	NA	NA	NA:NA:NA	5,5,6	6	Orphan	RT,csa3,cas3,cas4,Cas14b_CAS-V-F,DEDDh,csm3gr7,csx19,cas10,csx1,csx20,cas14k,cas2,cas1,cas5,cas7b,cas8b1,cas6	NA|129aa|up_7|NC_010320.1_756447_756834_+,NA|103aa|up_4|NC_010320.1_758861_759170_+,NA|184aa|down_2|NC_010320.1_767585_768137_+,NA|191aa|down_3|NC_010320.1_768878_769451_-,NA|64aa|down_6|NC_010320.1_771582_771774_+	NA|234aa|up_9|NC_010320.1_752388_753090_+	cd16269, GBP_C, Guanylate-binding protein, C-terminal domain	NA|1090aa|up_8|NC_010320.1_753181_756451_+	pfam01471, PG_binding_1, Putative peptidoglycan binding domain	NA|129aa|up_7|NC_010320.1_756447_756834_+	NA	NA|173aa|up_6|NC_010320.1_756863_757382_+	COG2206, COG2206, c-di-GMP phosphodiesterase class II (HD-GYP domain) [Signal transduction mechanisms]	NA|343aa|up_5|NC_010320.1_757374_758403_+	cd10227, ParM_like, Plasmid segregation protein ParM and similar proteins	NA|103aa|up_4|NC_010320.1_758861_759170_+	NA	NA|283aa|up_3|NC_010320.1_759389_760238_+	cd01335, Radical_SAM, Radical SAM superfamily	NA|362aa|up_2|NC_010320.1_760359_761445_+	TIGR04474, conserved_hypothetical_protein, three-Cys-motif partner protein	NA|208aa|up_1|NC_010320.1_761929_762553_+	pfam13814, Replic_Relax, Replication-relaxation	NA|558aa|up_0|NC_010320.1_762521_764195_+	TIGR03743, SXT_TraD, conjugative coupling factor TraD, SXT/TOL subfamily	NA|57aa|down_0|NC_010320.1_765471_765642_+	TIGR03769, P_ac_wall_RPT, actinobacterial surface-anchored protein domain	NA|478aa|down_1|NC_010320.1_765729_767163_-	COG1373, COG1373, Predicted ATPase (AAA+ superfamily) [General function prediction only]	NA|184aa|down_2|NC_010320.1_767585_768137_+	NA	NA|191aa|down_3|NC_010320.1_768878_769451_-	NA	NA|100aa|down_4|NC_010320.1_770521_770821_+	pfam11213, DUF3006, Protein of unknown function (DUF3006)	NA|221aa|down_5|NC_010320.1_770906_771569_+	pfam13630, SdpI, SdpI/YfhL protein family	NA|64aa|down_6|NC_010320.1_771582_771774_+	NA	NA|79aa|down_7|NC_010320.1_771759_771996_-	COG0840, Tar, Methyl-accepting chemotaxis protein [Cell motility and secretion / Signal transduction mechanisms]	NA|241aa|down_8|NC_010320.1_772259_772982_-	cd01948, EAL, EAL domain	NA|149aa|down_9|NC_010320.1_773107_773554_-	cd01949, GGDEF, Diguanylate-cyclase (DGC) or GGDEF domain
GCF_000019065.1_ASM1906v1	NC_010320	Thermoanaerobacter sp. X514, complete genome	2	2340404-2341103	2,2,2	CRISPRCasFinder,CRT,PILER-CR	no	cas2,cas1,cas4	RT,csa3,cas3,cas4,Cas14b_CAS-V-F,DEDDh,csm3gr7,csx19,cas10,csx1,csx20,cas14k,cas2,cas1,cas5,cas7b,cas8b1,cas6	Unclear	GTTTCAATTCCTTATAGGTAGGCTAAAAAC,GTTTCAATTCCTTATAGGTAGGCTAAAAAC,GTTTTTAGCCTACCTATAAGGAATTGAAAC	30,30,30	0	0	NA	NA	NA:NA:NA	10,10,9	10	Unclear	RT,csa3,cas3,cas4,Cas14b_CAS-V-F,DEDDh,csm3gr7,csx19,cas10,csx1,csx20,cas14k,cas2,cas1,cas5,cas7b,cas8b1,cas6	NA,NA	NA|1777aa|up_9|NC_010320.1_2323502_2328833_-	cd07475, Peptidases_S8_C5a_Peptidase, Peptidase S8 family domain in Streptococcal C5a peptidases	NA|309aa|up_8|NC_010320.1_2329282_2330209_-	PRK01212, PRK01212, homoserine kinase; Provisional	NA|352aa|up_7|NC_010320.1_2330198_2331254_-	PRK07409, PRK07409, threonine synthase; Validated	NA|419aa|up_6|NC_010320.1_2331257_2332514_-	PRK06349, PRK06349, homoserine dehydrogenase; Provisional	NA|147aa|up_5|NC_010320.1_2332526_2332967_-	PRK04435, PRK04435, ACT domain-containing protein	NA|260aa|up_4|NC_010320.1_2333173_2333953_-	cd07733, YycJ-like_MBL-fold, uncharacterized subgroup which includes Bacillus subtilis YycJ and related proteins; MBL-fold metallo hydrolase domain	cas2|88aa|up_3|NC_010320.1_2336259_2336523_-	cd09725, Cas2_I_II_III, CRISPR/Cas system-associated protein Cas2	cas1|331aa|up_2|NC_010320.1_2336539_2337532_-	TIGR03641, cas1_HMARI, CRISPR-associated endonuclease Cas1, subtype I-B/HMARI/TNEAP	cas4|166aa|up_1|NC_010320.1_2337528_2338026_-	pfam01930, Cas_Cas4, Domain of unknown function DUF83	NA|243aa|up_0|NC_010320.1_2338231_2338960_-	cd03146, GAT1_Peptidase_E, Type 1 glutamine amidotransferase (GATase1)-like domain found in peptidase E	NA|407aa|down_0|NC_010320.1_2341428_2342649_+	pfam01548, DEDD_Tnp_IS110, Transposase	NA|407aa|down_1|NC_010320.1_2347195_2348416_+	pfam01548, DEDD_Tnp_IS110, Transposase	cas3|779aa|down_2|NC_010320.1_2363262_2365599_-	COG1203, COG1203, CRISPR-associated helicase Cas3 [Defense mechanisms]	cas5|237aa|down_3|NC_010320.1_2365613_2366324_-	TIGR02592, hypothetical_protein_CTC_01466, CRISPR-associated protein Cas5, subtype I-B/HMARI	cas7b|294aa|down_4|NC_010320.1_2366338_2367220_-	pfam05107, Cas_Cas7, CRISPR-associated protein Cas7	cas8b1|661aa|down_5|NC_010320.1_2367206_2369189_-	cd09730, Cas8a1_I-A, CRISPR/Cas system-associated protein Cas8a1	cas6|252aa|down_6|NC_010320.1_2369243_2369999_-	TIGR01877, CRISPR-associated_endoribonuclease_Cas6_1, CRISPR-associated endoribonuclease Cas6	NA|138aa|down_7|NC_010320.1_2370187_2370601_-	TIGR01994, Iron-sulfur_cluster_assembly_scaffold_protein_IscU, SUF system FeS assembly protein, NifU family	NA|410aa|down_8|NC_010320.1_2370597_2371827_-	TIGR01979, Probable_cysteine_desulfurase, cysteine desulfurases, SufSfamily	NA|350aa|down_9|NC_010320.1_2371823_2372873_-	TIGR01981, UPF0051_protein_Rv1462/MT1509, FeS assembly protein SufD
GCF_000019065.1_ASM1906v1	NC_010320	Thermoanaerobacter sp. X514, complete genome	3	2342735-2346872	3,3,3	CRT,PILER-CR,CRISPRCasFinder	no	cas2,cas1,cas4,cas3,cas5	RT,csa3,cas3,cas4,Cas14b_CAS-V-F,DEDDh,csm3gr7,csx19,cas10,csx1,csx20,cas14k,cas2,cas1,cas5,cas7b,cas8b1,cas6	Unclear	GTTTCAATTCCTTATAGGTAGGCTAAAAAC,GTTTTTAGCCTACCTATAAGGAATTGAAAC,GTTTCAATTCCTTATAGGTAGGCTAAAAAC	30,30,30	0	0	NA	NA	NA:NA:NA	62,61,61	62	Unclear	RT,csa3,cas3,cas4,Cas14b_CAS-V-F,DEDDh,csm3gr7,csx19,cas10,csx1,csx20,cas14k,cas2,cas1,cas5,cas7b,cas8b1,cas6	NA,NA	NA|309aa|up_9|NC_010320.1_2329282_2330209_-	PRK01212, PRK01212, homoserine kinase; Provisional	NA|352aa|up_8|NC_010320.1_2330198_2331254_-	PRK07409, PRK07409, threonine synthase; Validated	NA|419aa|up_7|NC_010320.1_2331257_2332514_-	PRK06349, PRK06349, homoserine dehydrogenase; Provisional	NA|147aa|up_6|NC_010320.1_2332526_2332967_-	PRK04435, PRK04435, ACT domain-containing protein	NA|260aa|up_5|NC_010320.1_2333173_2333953_-	cd07733, YycJ-like_MBL-fold, uncharacterized subgroup which includes Bacillus subtilis YycJ and related proteins; MBL-fold metallo hydrolase domain	cas2|88aa|up_4|NC_010320.1_2336259_2336523_-	cd09725, Cas2_I_II_III, CRISPR/Cas system-associated protein Cas2	cas1|331aa|up_3|NC_010320.1_2336539_2337532_-	TIGR03641, cas1_HMARI, CRISPR-associated endonuclease Cas1, subtype I-B/HMARI/TNEAP	cas4|166aa|up_2|NC_010320.1_2337528_2338026_-	pfam01930, Cas_Cas4, Domain of unknown function DUF83	NA|243aa|up_1|NC_010320.1_2338231_2338960_-	cd03146, GAT1_Peptidase_E, Type 1 glutamine amidotransferase (GATase1)-like domain found in peptidase E	NA|407aa|up_0|NC_010320.1_2341428_2342649_+	pfam01548, DEDD_Tnp_IS110, Transposase	NA|407aa|down_0|NC_010320.1_2347195_2348416_+	pfam01548, DEDD_Tnp_IS110, Transposase	cas3|779aa|down_1|NC_010320.1_2363262_2365599_-	COG1203, COG1203, CRISPR-associated helicase Cas3 [Defense mechanisms]	cas5|237aa|down_2|NC_010320.1_2365613_2366324_-	TIGR02592, hypothetical_protein_CTC_01466, CRISPR-associated protein Cas5, subtype I-B/HMARI	cas7b|294aa|down_3|NC_010320.1_2366338_2367220_-	pfam05107, Cas_Cas7, CRISPR-associated protein Cas7	cas8b1|661aa|down_4|NC_010320.1_2367206_2369189_-	cd09730, Cas8a1_I-A, CRISPR/Cas system-associated protein Cas8a1	cas6|252aa|down_5|NC_010320.1_2369243_2369999_-	TIGR01877, CRISPR-associated_endoribonuclease_Cas6_1, CRISPR-associated endoribonuclease Cas6	NA|138aa|down_6|NC_010320.1_2370187_2370601_-	TIGR01994, Iron-sulfur_cluster_assembly_scaffold_protein_IscU, SUF system FeS assembly protein, NifU family	NA|410aa|down_7|NC_010320.1_2370597_2371827_-	TIGR01979, Probable_cysteine_desulfurase, cysteine desulfurases, SufSfamily	NA|350aa|down_8|NC_010320.1_2371823_2372873_-	TIGR01981, UPF0051_protein_Rv1462/MT1509, FeS assembly protein SufD	NA|468aa|down_9|NC_010320.1_2372883_2374287_-	TIGR01980, UPF0051_protein_slr0074, FeS assembly protein SufB
GCF_000019065.1_ASM1906v1	NC_010320	Thermoanaerobacter sp. X514, complete genome	4	2348502-2363082	4,4,4	CRISPRCasFinder,CRT,PILER-CR	no	cas2,cas1,cas4,cas3,cas5,cas7b,cas8b1,cas6	RT,csa3,cas3,cas4,Cas14b_CAS-V-F,DEDDh,csm3gr7,csx19,cas10,csx1,csx20,cas14k,cas2,cas1,cas5,cas7b,cas8b1,cas6	Type I-B	GTTTCAATTCCTTATAGGTAGGCTAAAAAC,GTTTCAATTCCTTATAGGTAGGCTAAAAAC,GTTTTTAGCCTACCTATAAGGAATTGAAAC	30,30,30	0	0	NA	NA	NA:NA:NA	219,219,218	219	TypeI-B	RT,csa3,cas3,cas4,Cas14b_CAS-V-F,DEDDh,csm3gr7,csx19,cas10,csx1,csx20,cas14k,cas2,cas1,cas5,cas7b,cas8b1,cas6	NA,NA	NA|352aa|up_9|NC_010320.1_2330198_2331254_-	PRK07409, PRK07409, threonine synthase; Validated	NA|419aa|up_8|NC_010320.1_2331257_2332514_-	PRK06349, PRK06349, homoserine dehydrogenase; Provisional	NA|147aa|up_7|NC_010320.1_2332526_2332967_-	PRK04435, PRK04435, ACT domain-containing protein	NA|260aa|up_6|NC_010320.1_2333173_2333953_-	cd07733, YycJ-like_MBL-fold, uncharacterized subgroup which includes Bacillus subtilis YycJ and related proteins; MBL-fold metallo hydrolase domain	cas2|88aa|up_5|NC_010320.1_2336259_2336523_-	cd09725, Cas2_I_II_III, CRISPR/Cas system-associated protein Cas2	cas1|331aa|up_4|NC_010320.1_2336539_2337532_-	TIGR03641, cas1_HMARI, CRISPR-associated endonuclease Cas1, subtype I-B/HMARI/TNEAP	cas4|166aa|up_3|NC_010320.1_2337528_2338026_-	pfam01930, Cas_Cas4, Domain of unknown function DUF83	NA|243aa|up_2|NC_010320.1_2338231_2338960_-	cd03146, GAT1_Peptidase_E, Type 1 glutamine amidotransferase (GATase1)-like domain found in peptidase E	NA|407aa|up_1|NC_010320.1_2341428_2342649_+	pfam01548, DEDD_Tnp_IS110, Transposase	NA|407aa|up_0|NC_010320.1_2347195_2348416_+	pfam01548, DEDD_Tnp_IS110, Transposase	cas3|779aa|down_0|NC_010320.1_2363262_2365599_-	COG1203, COG1203, CRISPR-associated helicase Cas3 [Defense mechanisms]	cas5|237aa|down_1|NC_010320.1_2365613_2366324_-	TIGR02592, hypothetical_protein_CTC_01466, CRISPR-associated protein Cas5, subtype I-B/HMARI	cas7b|294aa|down_2|NC_010320.1_2366338_2367220_-	pfam05107, Cas_Cas7, CRISPR-associated protein Cas7	cas8b1|661aa|down_3|NC_010320.1_2367206_2369189_-	cd09730, Cas8a1_I-A, CRISPR/Cas system-associated protein Cas8a1	cas6|252aa|down_4|NC_010320.1_2369243_2369999_-	TIGR01877, CRISPR-associated_endoribonuclease_Cas6_1, CRISPR-associated endoribonuclease Cas6	NA|138aa|down_5|NC_010320.1_2370187_2370601_-	TIGR01994, Iron-sulfur_cluster_assembly_scaffold_protein_IscU, SUF system FeS assembly protein, NifU family	NA|410aa|down_6|NC_010320.1_2370597_2371827_-	TIGR01979, Probable_cysteine_desulfurase, cysteine desulfurases, SufSfamily	NA|350aa|down_7|NC_010320.1_2371823_2372873_-	TIGR01981, UPF0051_protein_Rv1462/MT1509, FeS assembly protein SufD	NA|468aa|down_8|NC_010320.1_2372883_2374287_-	TIGR01980, UPF0051_protein_slr0074, FeS assembly protein SufB	NA|249aa|down_9|NC_010320.1_2374306_2375053_-	TIGR01978, Probable_ATP-dependent_transporter_SufC, FeS assembly ATPase SufC
