assembly_id	genome_id	genome_def	crispr_array_locus_merge	crispr_array_location_merge	crispr_locus_id	crispr_pred_method	array_in_prot	prot_within_array_20000	prot_in_genome	crispr_type_by_cas_prot	consensus_repeat	repeat_length	self-targeting_spacer_number	self-targeting_target_number	spacer_location	protospacer_location	repeat_type	spacer_locus_num	spacer_num	correct_crispr_type	genome_cas_prots	unknown_protein_around_crispr	L10	L10_domain	L9	L9_domain	L8	L8_domain	L7	L7_domain	L6	L6_domain	L5	L5_domain	L4	L4_domain	L3	L3_domain	L2	L2_domain	L1	L1_domain	R1	R1_domain	R2	R2_domain	R3	R3_domain	R4	R4_domain	R5	R5_domain	R6	R6_domain	R7	R7_domain	R8	R8_domain	R9	R9_domain	R10	R10_domain
GCF_000020945.1_ASM2094v1	NC_011295	Coprothermobacter proteolyticus DSM 5265, complete sequence	1	41136-41230	1	CRISPRCasFinder	no		csa3,cas3,cas3HD,cas2,cas1	Orphan	TCGTTTTTATCGTTCCTATAAGGA	24	0	0	NA	NA	NA	1	1	Orphan	csa3,cas3,cas3HD,cas2,cas1	NA|127aa|up_8|NC_011295.1_35944_36325_+,NA|199aa|up_4|NC_011295.1_37764_38361_+,NA|116aa|up_1|NC_011295.1_39256_39604_+,NA|117aa|down_0|NC_011295.1_41233_41584_+,NA|46aa|down_2|NC_011295.1_42799_42937_+	NA|48aa|up_9|NC_011295.1_35776_35920_-	pfam10941, DUF2620, Protein of unknown function DUF2620	NA|127aa|up_8|NC_011295.1_35944_36325_+	NA	NA|90aa|up_7|NC_011295.1_36325_36595_+	pfam07833, Cu_amine_oxidN1, Copper amine oxidase N-terminal domain	NA|59aa|up_6|NC_011295.1_36536_36713_+	COG2169, Ada, Adenosine deaminase [Nucleotide transport and metabolism]	NA|289aa|up_5|NC_011295.1_36884_37751_+	PRK00236, xerC, site-specific tyrosine recombinase XerC; Reviewed	NA|199aa|up_4|NC_011295.1_37764_38361_+	NA	NA|196aa|up_3|NC_011295.1_38361_38949_+	pfam13851, GAS, Growth-arrest specific micro-tubule binding	NA|50aa|up_2|NC_011295.1_39042_39192_+	pfam10087, DUF2325, Uncharacterized protein conserved in bacteria (DUF2325)	NA|116aa|up_1|NC_011295.1_39256_39604_+	NA	NA|449aa|up_0|NC_011295.1_39588_40935_+	cd06423, CESA_like, CESA_like is  the cellulose synthase superfamily	NA|117aa|down_0|NC_011295.1_41233_41584_+	NA	NA|366aa|down_1|NC_011295.1_41699_42797_+	COG1322, COG1322, Predicted nuclease of restriction endonuclease-like fold, RmuC family [General function prediction only]	NA|46aa|down_2|NC_011295.1_42799_42937_+	NA	NA|285aa|down_3|NC_011295.1_43236_44091_+	smart00318, SNc, Staphylococcal nuclease homologues	NA|272aa|down_4|NC_011295.1_44217_45033_+	pfam09509, Hypoth_Ymh, Protein of unknown function (Hypoth_ymh)	NA|290aa|down_5|NC_011295.1_45842_46712_+	pfam06267, DUF1028, Family of unknown function (DUF1028)	NA|274aa|down_6|NC_011295.1_46921_47743_+	cd19071, AKR_AKR1-5-like, AKR1/2/3/4/5 family of aldo-keto reductase (AKR) and similar proteins	NA|190aa|down_7|NC_011295.1_47960_48530_+	COG1045, CysE, Serine acetyltransferase [Amino acid transport and metabolism]	NA|290aa|down_8|NC_011295.1_48526_49396_+	cd01561, CBS_like, CBS_like: This subgroup includes Cystathionine beta-synthase (CBS) and Cysteine synthase	NA|217aa|down_9|NC_011295.1_49710_50361_+	PRK13189, PRK13189, peroxiredoxin; Provisional
GCF_000020945.1_ASM2094v1	NC_011295	Coprothermobacter proteolyticus DSM 5265, complete sequence	2	42877-42974	2	CRISPRCasFinder	no		csa3,cas3,cas3HD,cas2,cas1	Orphan	GGTTTCAGCTGGATATATAGAGGAGTGGAAC	31	0	0	NA	NA	NA	1	1	Orphan	csa3,cas3,cas3HD,cas2,cas1	NA|199aa|up_6|NC_011295.1_37764_38361_+,NA|116aa|up_3|NC_011295.1_39256_39604_+,NA|117aa|up_1|NC_011295.1_41233_41584_+,NA	NA|90aa|up_9|NC_011295.1_36325_36595_+	pfam07833, Cu_amine_oxidN1, Copper amine oxidase N-terminal domain	NA|59aa|up_8|NC_011295.1_36536_36713_+	COG2169, Ada, Adenosine deaminase [Nucleotide transport and metabolism]	NA|289aa|up_7|NC_011295.1_36884_37751_+	PRK00236, xerC, site-specific tyrosine recombinase XerC; Reviewed	NA|199aa|up_6|NC_011295.1_37764_38361_+	NA	NA|196aa|up_5|NC_011295.1_38361_38949_+	pfam13851, GAS, Growth-arrest specific micro-tubule binding	NA|50aa|up_4|NC_011295.1_39042_39192_+	pfam10087, DUF2325, Uncharacterized protein conserved in bacteria (DUF2325)	NA|116aa|up_3|NC_011295.1_39256_39604_+	NA	NA|449aa|up_2|NC_011295.1_39588_40935_+	cd06423, CESA_like, CESA_like is  the cellulose synthase superfamily	NA|117aa|up_1|NC_011295.1_41233_41584_+	NA	NA|366aa|up_0|NC_011295.1_41699_42797_+	COG1322, COG1322, Predicted nuclease of restriction endonuclease-like fold, RmuC family [General function prediction only]	NA|285aa|down_0|NC_011295.1_43236_44091_+	smart00318, SNc, Staphylococcal nuclease homologues	NA|272aa|down_1|NC_011295.1_44217_45033_+	pfam09509, Hypoth_Ymh, Protein of unknown function (Hypoth_ymh)	NA|290aa|down_2|NC_011295.1_45842_46712_+	pfam06267, DUF1028, Family of unknown function (DUF1028)	NA|274aa|down_3|NC_011295.1_46921_47743_+	cd19071, AKR_AKR1-5-like, AKR1/2/3/4/5 family of aldo-keto reductase (AKR) and similar proteins	NA|190aa|down_4|NC_011295.1_47960_48530_+	COG1045, CysE, Serine acetyltransferase [Amino acid transport and metabolism]	NA|290aa|down_5|NC_011295.1_48526_49396_+	cd01561, CBS_like, CBS_like: This subgroup includes Cystathionine beta-synthase (CBS) and Cysteine synthase	NA|217aa|down_6|NC_011295.1_49710_50361_+	PRK13189, PRK13189, peroxiredoxin; Provisional	NA|662aa|down_7|NC_011295.1_50441_52427_+	COG0210, UvrD, Superfamily I DNA and RNA helicases [DNA replication, recombination, and repair]	NA|90aa|down_8|NC_011295.1_52619_52889_-	COG1977, MoaD, Molybdopterin converting factor, small subunit [Coenzyme metabolism]	NA|615aa|down_9|NC_011295.1_52896_54741_-	COG2414, COG2414, Aldehyde:ferredoxin oxidoreductase [Energy production and conversion]
GCF_000020945.1_ASM2094v1	NC_011295	Coprothermobacter proteolyticus DSM 5265, complete sequence	3	1263870-1264966	1,3,1	PILER-CR,CRISPRCasFinder,CRT	no	csa3,cas2,cas1	csa3,cas3,cas3HD,cas2,cas1	Type I-A	GTTTCAATCCCTTGTAGGTAAGCTAGAAAC,GTTTCAATCCCTTGTAGGTAAGCTAGAAAC,GTTTCAATCCCTTGTAGGTAAGCTAGAAAC	30,30,30	0	0	NA	NA	NA:NA:NA	16,16,16	16	Unclear	csa3,cas3,cas3HD,cas2,cas1	NA|69aa|up_0|NC_011295.1_1263543_1263750_-,NA|46aa|down_2|NC_011295.1_1266805_1266943_-,NA|232aa|down_3|NC_011295.1_1267197_1267893_-,NA|67aa|down_7|NC_011295.1_1273216_1273417_-	NA|391aa|up_9|NC_011295.1_1253976_1255149_+	cd00751, thiolase, Thiolase are ubiquitous enzymes that catalyze the reversible thiolytic cleavage of 3-ketoacyl-CoA into acyl-CoA and acetyl-CoA, a 2-step reaction involving a covalent intermediate formed with a catalytic cysteine	NA|629aa|up_8|NC_011295.1_1255376_1257263_+	COG2414, COG2414, Aldehyde:ferredoxin oxidoreductase [Energy production and conversion]	NA|149aa|up_7|NC_011295.1_1257317_1257764_-	cd03249, ABC_MTABC3_MDL1_MDL2, ATP-binding cassette domain of a mitochondrial protein MTABC3 and related proteins	NA|343aa|up_6|NC_011295.1_1258202_1259231_+	pfam01757, Acyl_transf_3, Acyltransferase family	NA|118aa|up_5|NC_011295.1_1259507_1259861_-	pfam13192, Thioredoxin_3, Thioredoxin domain	NA|338aa|up_4|NC_011295.1_1259911_1260925_-	COG0701, COG0701, Predicted permeases [General function prediction only]	csa3|85aa|up_3|NC_011295.1_1260945_1261200_-	smart00418, HTH_ARSR, helix_turn_helix, Arsenical Resistance Operon Repressor	NA|273aa|up_2|NC_011295.1_1261790_1262609_-	cd07385, MPP_YkuE_C, Bacillus subtilis YkuE and related proteins, C-terminal metallophosphatase domain	NA|141aa|up_1|NC_011295.1_1262722_1263145_-	COG1661, COG1661, Predicted DNA-binding protein with PD1-like DNA-binding motif [General function prediction only]	NA|69aa|up_0|NC_011295.1_1263543_1263750_-	NA	NA|376aa|down_0|NC_011295.1_1265065_1266193_+	cd07474, Peptidases_S8_subtilisin_Vpr-like, Peptidase S8 family domain in Vpr-like proteins	NA|139aa|down_1|NC_011295.1_1266167_1266584_+	pfam07833, Cu_amine_oxidN1, Copper amine oxidase N-terminal domain	NA|46aa|down_2|NC_011295.1_1266805_1266943_-	NA	NA|232aa|down_3|NC_011295.1_1267197_1267893_-	NA	NA|719aa|down_4|NC_011295.1_1267892_1270049_-	cd17291, RMtype1_S_MgeORF438P-TRD-CR_like, Type I restriction-modification system specificity (S) subunit Target Recognition Domain-ConseRved domain (TRD-CR), similar to S	NA|742aa|down_5|NC_011295.1_1270520_1272746_-	COG4096, HsdR, Type I site-specific restriction-modification system, R (restriction) subunit and related helicases [Defense mechanisms]	NA|128aa|down_6|NC_011295.1_1272840_1273224_-	COG4969, PilA, Tfp pilus assembly protein, major pilin PilA [Cell motility and secretion / Intracellular trafficking and secretion]	NA|67aa|down_7|NC_011295.1_1273216_1273417_-	NA	NA|328aa|down_8|NC_011295.1_1273721_1274705_-	pfam01555, N6_N4_Mtase, DNA methylase	NA|311aa|down_9|NC_011295.1_1274709_1275642_-	pfam04556, DpnII, DpnII restriction endonuclease
GCF_000020945.1_ASM2094v1	NC_011295	Coprothermobacter proteolyticus DSM 5265, complete sequence	4	1273430-1273524	4	CRISPRCasFinder	no	csa3,cas2,cas1	csa3,cas3,cas3HD,cas2,cas1	Type I-A	AATGTTTCAATCCCTTGTAGGTAAGCTA	28	0	0	NA	NA	NA	1	1	Unclear	csa3,cas3,cas3HD,cas2,cas1	NA|69aa|up_8|NC_011295.1_1263543_1263750_-,NA|46aa|up_5|NC_011295.1_1266805_1266943_-,NA|232aa|up_4|NC_011295.1_1267197_1267893_-,NA|67aa|up_0|NC_011295.1_1273216_1273417_-,NA|668aa|down_6|NC_011295.1_1279175_1281179_-,NA|55aa|down_7|NC_011295.1_1281308_1281473_-	NA|141aa|up_9|NC_011295.1_1262722_1263145_-	COG1661, COG1661, Predicted DNA-binding protein with PD1-like DNA-binding motif [General function prediction only]	NA|69aa|up_8|NC_011295.1_1263543_1263750_-	NA	NA|376aa|up_7|NC_011295.1_1265065_1266193_+	cd07474, Peptidases_S8_subtilisin_Vpr-like, Peptidase S8 family domain in Vpr-like proteins	NA|139aa|up_6|NC_011295.1_1266167_1266584_+	pfam07833, Cu_amine_oxidN1, Copper amine oxidase N-terminal domain	NA|46aa|up_5|NC_011295.1_1266805_1266943_-	NA	NA|232aa|up_4|NC_011295.1_1267197_1267893_-	NA	NA|719aa|up_3|NC_011295.1_1267892_1270049_-	cd17291, RMtype1_S_MgeORF438P-TRD-CR_like, Type I restriction-modification system specificity (S) subunit Target Recognition Domain-ConseRved domain (TRD-CR), similar to S	NA|742aa|up_2|NC_011295.1_1270520_1272746_-	COG4096, HsdR, Type I site-specific restriction-modification system, R (restriction) subunit and related helicases [Defense mechanisms]	NA|128aa|up_1|NC_011295.1_1272840_1273224_-	COG4969, PilA, Tfp pilus assembly protein, major pilin PilA [Cell motility and secretion / Intracellular trafficking and secretion]	NA|67aa|up_0|NC_011295.1_1273216_1273417_-	NA	NA|328aa|down_0|NC_011295.1_1273721_1274705_-	pfam01555, N6_N4_Mtase, DNA methylase	NA|311aa|down_1|NC_011295.1_1274709_1275642_-	pfam04556, DpnII, DpnII restriction endonuclease	NA|316aa|down_2|NC_011295.1_1275631_1276579_-	TIGR00571, DNA_adenine_methylase, DNA adenine methylase (dam)	cas2|88aa|down_3|NC_011295.1_1276962_1277226_-	cd09725, Cas2_I_II_III, CRISPR/Cas system-associated protein Cas2	cas1|50aa|down_4|NC_011295.1_1277235_1277385_-	cd09722, Cas1_I-B, CRISPR/Cas system-associated protein Cas1	NA|500aa|down_5|NC_011295.1_1277621_1279121_+	pfam07833, Cu_amine_oxidN1, Copper amine oxidase N-terminal domain	NA|668aa|down_6|NC_011295.1_1279175_1281179_-	NA	NA|55aa|down_7|NC_011295.1_1281308_1281473_-	NA	NA|815aa|down_8|NC_011295.1_1281796_1284241_-	pfam05569, Peptidase_M56, BlaR1 peptidase M56	NA|129aa|down_9|NC_011295.1_1284243_1284630_-	pfam03965, Penicillinase_R, Penicillinase repressor
