assembly_id	genome_id	genome_def	crispr_array_locus_merge	crispr_array_location_merge	crispr_locus_id	crispr_pred_method	array_in_prot	prot_within_array_20000	prot_in_genome	crispr_type_by_cas_prot	consensus_repeat	repeat_length	self-targeting_spacer_number	self-targeting_target_number	spacer_location	protospacer_location	repeat_type	spacer_locus_num	spacer_num	correct_crispr_type	genome_cas_prots	unknown_protein_around_crispr	L10	L10_domain	L9	L9_domain	L8	L8_domain	L7	L7_domain	L6	L6_domain	L5	L5_domain	L4	L4_domain	L3	L3_domain	L2	L2_domain	L1	L1_domain	R1	R1_domain	R2	R2_domain	R3	R3_domain	R4	R4_domain	R5	R5_domain	R6	R6_domain	R7	R7_domain	R8	R8_domain	R9	R9_domain	R10	R10_domain
GCF_000021645.1_ASM2164v1	NC_011661	Dictyoglomus turgidum DSM 6724, complete sequence	1	470870-474424	1,1,1	PILER-CR,CRISPRCasFinder,CRT	no	csa3	WYL,csa3,csx1,csm5gr7,csm4gr5,csm3gr7,csm2gr11,cas10,csx3,cas2,cas1,cas4,cas3,cas5,cas7,cas8b1,cas6,cas3HD	Type I-A	GTTTCAATCCCTTATAGGTACGCTACAAAC,GTTTCAATCCCTTATAGGTACGCTACAAAC,GTTTCAATCCCTTATAGGTACGCTACAAAC	30,30,30	0	0	NA	NA	NA:NA:NA	52,53,53	53	Orphan	WYL,csa3,csx1,csm5gr7,csm4gr5,csm3gr7,csm2gr11,cas10,csx3,cas2,cas1,cas4,cas3,cas5,cas7,cas8b1,cas6,cas3HD	NA|148aa|up_9|NC_011661.1_462216_462660_+,NA|77aa|down_2|NC_011661.1_475288_475519_-	NA|148aa|up_9|NC_011661.1_462216_462660_+	NA	NA|118aa|up_8|NC_011661.1_462715_463069_-	COG1148, HdrA, Heterodisulfide reductase, subunit A and related polyferredoxins [Energy production and conversion]	NA|79aa|up_7|NC_011661.1_463065_463302_-	COG1146, COG1146, Ferredoxin [Energy production and conversion]	NA|135aa|up_6|NC_011661.1_463330_463735_-	pfam08859, DGC, DGC domain	csa3|132aa|up_5|NC_011661.1_463760_464156_-	cd00090, HTH_ARSR, Arsenical Resistance Operon Repressor and similar prokaryotic, metal regulated homodimeric repressors	NA|218aa|up_4|NC_011661.1_465429_466083_-	cd16423, HAD_BPGM-like, uncharacterized subfamily of beta-phosphoglucomutase-like family, similar to uncharacterized Bacillus subtilis YhcW	NA|419aa|up_3|NC_011661.1_466505_467762_+	COG2723, BglB, Beta-glucosidase/6-phospho-beta-glucosidase/beta-galactosidase [Carbohydrate transport and metabolism]	NA|336aa|up_2|NC_011661.1_467754_468762_-	TIGR04070, photo_TT_lyase, spore photoproduct lyase	NA|278aa|up_1|NC_011661.1_468730_469564_-	cd09008, MTAN, 5'-methylthioadenosine/S-adenosylhomocysteine nucleosidases	NA|419aa|up_0|NC_011661.1_469560_470817_-	pfam01566, Nramp, Natural resistance-associated macrophage protein	NA|53aa|down_0|NC_011661.1_474802_474961_-	pfam12841, YvrJ, YvrJ protein family	NA|111aa|down_1|NC_011661.1_474963_475296_-	pfam08291, Peptidase_M15_3, Peptidase M15	NA|77aa|down_2|NC_011661.1_475288_475519_-	NA	NA|77aa|down_3|NC_011661.1_475575_475806_-	pfam11148, DUF2922, Protein of unknown function (DUF2922)	NA|74aa|down_4|NC_011661.1_475841_476063_-	pfam07872, DUF1659, Protein of unknown function (DUF1659)	NA|374aa|down_5|NC_011661.1_476178_477300_+	pfam13598, DUF4139, Domain of unknown function (DUF4139)	NA|258aa|down_6|NC_011661.1_477343_478117_+	COG0348, NapH, Polyferredoxin [Energy production and conversion]	NA|516aa|down_7|NC_011661.1_478133_479681_+	COG1236, YSH1, Predicted exonuclease of the beta-lactamase fold involved in RNA processing [Translation, ribosomal structure and biogenesis]	NA|423aa|down_8|NC_011661.1_479677_480946_+	cd03877, M28_like, M28 Zn-peptidase, many containing a protease-associated (PA) domain insert	NA|303aa|down_9|NC_011661.1_480999_481908_+	PRK11275, pstC, phosphate ABC transporter permease PstC
GCF_000021645.1_ASM2164v1	NC_011661	Dictyoglomus turgidum DSM 6724, complete sequence	2	615153-617311	2,2,2	PILER-CR,CRISPRCasFinder,CRT	no	csx1,csm5gr7,csm4gr5,csm3gr7,csm2gr11,cas10,csx3,cas2,cas1,cas4,cas3,cas5,cas7,cas8b1,cas6	WYL,csa3,csx1,csm5gr7,csm4gr5,csm3gr7,csm2gr11,cas10,csx3,cas2,cas1,cas4,cas3,cas5,cas7,cas8b1,cas6,cas3HD	Type I-B,Type III-D,Type III-B,Type III-C,Type III-A	GTTTCAATCCCTTATAGGTACGCTACAAAC,GTTTCAATCCCTTATAGGTACGCTACAAAC,GTTTCAATCCCTTATAGGTACGCTACAAAC	30,30,30	0	0	NA	NA	NA:NA:NA	31,32,32	32	TypeI-B,TypeIII-D,TypeIII-B,TypeIII-C,TypeIII-A	WYL,csa3,csx1,csm5gr7,csm4gr5,csm3gr7,csm2gr11,cas10,csx3,cas2,cas1,cas4,cas3,cas5,cas7,cas8b1,cas6,cas3HD	NA|196aa|up_9|NC_011661.1_604831_605419_+,NA	NA|196aa|up_9|NC_011661.1_604831_605419_+	NA	NA|291aa|up_8|NC_011661.1_605399_606272_-	COG1578, COG1578, Uncharacterized conserved protein [Function unknown]	csx1|397aa|up_7|NC_011661.1_606328_607519_-	cd09732, Csx1_III-U, CRISPR/Cas system-associated protein Csx1	csx1|420aa|up_6|NC_011661.1_607515_608775_-	cd09728, Csx1_III-U, CRISPR/Cas system-associated protein Csx1	csm5gr7|382aa|up_5|NC_011661.1_608767_609913_-	cd09662, Csm5_III-A, CRISPR/Cas system-associated RAMP superfamily protein Csm5	csm4gr5|339aa|up_4|NC_011661.1_609929_610946_-	COG1567, COG1567, CRISPR system related protein, RAMP superfamily [Defense    mechanisms]	csm3gr7|250aa|up_3|NC_011661.1_610961_611711_-	cd09684, Csm3_III-A, CRISPR/Cas system-associated RAMP superfamily protein Csm3	csm2gr11|150aa|up_2|NC_011661.1_611723_612173_-	pfam03750, Csm2_III-A, Csm2 Type III-A	cas10|800aa|up_1|NC_011661.1_612162_614562_-	cd09680, Cas10_III, CRISPR/Cas system-associated protein Cas10	csx3|104aa|up_0|NC_011661.1_614565_614877_-	cd09681, Csx3_III-U, CRISPR/Cas system-associated protein Csx3	NA|126aa|down_0|NC_011661.1_617728_618106_+	pfam01242, PTPS, 6-pyruvoyl tetrahydropterin synthase	NA|258aa|down_1|NC_011661.1_618086_618860_+	PRK13674, PRK13674, GTP cyclohydrolase I FolE2	NA|401aa|down_2|NC_011661.1_618856_620059_+	cd00739, DHPS, DHPS subgroup of Pterin binding enzymes	NA|156aa|down_3|NC_011661.1_620034_620502_+	COG0801, FolK, 7,8-dihydro-6-hydroxymethylpterin-pyrophosphokinase [Coenzyme metabolism]	cas2|88aa|down_4|NC_011661.1_620514_620778_-	cd09725, Cas2_I_II_III, CRISPR/Cas system-associated protein Cas2	cas1|332aa|down_5|NC_011661.1_620787_621783_-	cd09722, Cas1_I-B, CRISPR/Cas system-associated protein Cas1	cas4|169aa|down_6|NC_011661.1_621779_622286_-	pfam01930, Cas_Cas4, Domain of unknown function DUF83	cas3|785aa|down_7|NC_011661.1_622279_624634_-	cd17930, DEXHc_cas3, DEXH/Q-box helicase domain of Cas3	cas5|239aa|down_8|NC_011661.1_624617_625334_-	TIGR02592, hypothetical_protein_CTC_01466, CRISPR-associated protein Cas5, subtype I-B/HMARI	cas7|304aa|down_9|NC_011661.1_625347_626259_-	TIGR02590, hypothetical_protein_MM_0563, CRISPR-associated protein Cas7/Csh2, subtype I-B/HMARI
GCF_000021645.1_ASM2164v1	NC_011661	Dictyoglomus turgidum DSM 6724, complete sequence	3	1411186-1411308	3	CRISPRCasFinder	no		WYL,csa3,csx1,csm5gr7,csm4gr5,csm3gr7,csm2gr11,cas10,csx3,cas2,cas1,cas4,cas3,cas5,cas7,cas8b1,cas6,cas3HD	Orphan	AAGTGGTCGGGGCGACTGGACTTGAACCAGCGACCT	36	0	0	NA	NA	NA	1	1	Orphan	WYL,csa3,csx1,csm5gr7,csm4gr5,csm3gr7,csm2gr11,cas10,csx3,cas2,cas1,cas4,cas3,cas5,cas7,cas8b1,cas6,cas3HD	NA|100aa|up_4|NC_011661.1_1408420_1408720_-,NA|150aa|down_5|NC_011661.1_1416633_1417083_-	NA|172aa|up_9|NC_011661.1_1401671_1402187_-	cd01275, FHIT, FHIT (fragile histidine family): FHIT proteins, related to the HIT family carry a motif HxHxH/Qxx (x, is a hydrophobic amino acid), On the basis of sequence, substrate specificity, structure, evolution and mechanism, HIT proteins are classified into three  branches: the Hint branch, which consists of adenosine 5' -monophosphoramide hydrolases, the Fhit branch, that consists of diadenosine polyphosphate hydrolases, and the GalT branch consisting of specific nucloside monophosphate transferases	NA|320aa|up_8|NC_011661.1_1402358_1403318_-	PRK03202, PRK03202, ATP-dependent 6-phosphofructokinase	NA|273aa|up_7|NC_011661.1_1403314_1404133_-	PRK05724, PRK05724, acetyl-CoA carboxylase carboxyltransferase subunit alpha; Validated	NA|281aa|up_6|NC_011661.1_1404126_1404969_-	PRK05654, PRK05654, acetyl-CoA carboxylase carboxyltransferase subunit beta	NA|1128aa|up_5|NC_011661.1_1405013_1408397_-	PRK06826, dnaE, DNA polymerase III DnaE; Reviewed	NA|100aa|up_4|NC_011661.1_1408420_1408720_-	NA	NA|211aa|up_3|NC_011661.1_1408753_1409386_-	cd10030, UDG-F4_TTUDGA_SPO1dp_like, Uracil DNA glycosylase family 4, includes Thermotoga maritima TTUDGA, Bacillus phage SPO1 DNA polymerase, and similar proteins	NA|221aa|up_2|NC_011661.1_1409373_1410036_-	PRK00312, pcm, protein-L-isoaspartate(D-aspartate) O-methyltransferase	NA|286aa|up_1|NC_011661.1_1410041_1410899_-	PRK00811, PRK00811, polyamine aminopropyltransferase	NA|91aa|up_0|NC_011661.1_1410905_1411178_-	pfam00708, Acylphosphatase, Acylphosphatase	NA|626aa|down_0|NC_011661.1_1411382_1413260_-	TIGR01073, ATP-dependent_DNA_helicase_PcrA, ATP-dependent DNA helicase PcrA	NA|81aa|down_1|NC_011661.1_1413265_1413508_-	pfam03776, MinE, Septum formation topological specificity factor MinE	NA|265aa|down_2|NC_011661.1_1413520_1414315_-	TIGR01968, Septum_site-determining_protein_MinD, septum site-determining protein MinD	NA|203aa|down_3|NC_011661.1_1414307_1414916_-	pfam03775, MinC_C, Septum formation inhibitor MinC, C-terminal domain	NA|575aa|down_4|NC_011661.1_1414912_1416637_-	TIGR03423, pbp2_mrdA, penicillin-binding protein 2	NA|150aa|down_5|NC_011661.1_1416633_1417083_-	NA	NA|269aa|down_6|NC_011661.1_1417067_1417874_-	PRK13922, PRK13922, rod shape-determining protein MreC; Provisional	NA|349aa|down_7|NC_011661.1_1417854_1418901_-	PRK13927, PRK13927, rod shape-determining protein MreB; Provisional	NA|194aa|down_8|NC_011661.1_1418940_1419522_-	pfam02545, Maf, Maf-like protein	NA|279aa|down_9|NC_011661.1_1419518_1420355_-	COG1284, COG1284, Uncharacterized conserved protein [Function unknown]
