assembly_id	genome_id	genome_def	crispr_array_locus_merge	crispr_array_location_merge	crispr_locus_id	crispr_pred_method	array_in_prot	prot_within_array_20000	prot_in_genome	crispr_type_by_cas_prot	consensus_repeat	repeat_length	self-targeting_spacer_number	self-targeting_target_number	spacer_location	protospacer_location	repeat_type	spacer_locus_num	spacer_num	correct_crispr_type	genome_cas_prots	unknown_protein_around_crispr	L10	L10_domain	L9	L9_domain	L8	L8_domain	L7	L7_domain	L6	L6_domain	L5	L5_domain	L4	L4_domain	L3	L3_domain	L2	L2_domain	L1	L1_domain	R1	R1_domain	R2	R2_domain	R3	R3_domain	R4	R4_domain	R5	R5_domain	R6	R6_domain	R7	R7_domain	R8	R8_domain	R9	R9_domain	R10	R10_domain
GCF_000006985.1_ASM698v1	NC_002932	Chlorobaculum tepidum TLS, complete sequence	1	1052108-1055064	1,1,1	CRISPRCasFinder,CRT,PILER-CR	no	cas2,cas1,cas4,cas7,cas8c,cas5,cas3,cas3HD	csa3,Cas9_archaeal,DEDDh,cas2,cas1,cas4,cas7,cas8c,cas5,cas3,cas3HD,cas8e,cse2gr11,cas6e	Type I-C,Type I-U, Type I-U?	GTTTCAATCCACGCGCCCGCGCGGGGCGCGAC,GTTTCAATCCACGCGCCCGCGCGGGGCGCGAC,GTTTCAATCCACGCGCCCGCGCGGGGCGCGAC	32,32,32	0	0	NA	NA	I-C:I-C:I-C	44,44,20	44	TypeI-C,TypeI-U?,TypeI-U	csa3,Cas9_archaeal,DEDDh,cas2,cas1,cas4,cas7,cas8c,cas5,cas3,cas3HD,cas8e,cse2gr11,cas6e	NA|61aa|up_1|NC_002932.3_1051457_1051640_+,NA|83aa|up_0|NC_002932.3_1051734_1051983_-,NA|122aa|down_9|NC_002932.3_1067699_1068065_+	NA|405aa|up_9|NC_002932.3_1043479_1044694_+	cd02152, OAT, Ornithine acetyltransferase (OAT) family; also referred to as ArgJ	NA|307aa|up_8|NC_002932.3_1044727_1045648_+	PRK00942, PRK00942, acetylglutamate kinase; Provisional	NA|339aa|up_7|NC_002932.3_1045644_1046661_+	PRK00779, PRK00779, ornithine carbamoyltransferase; Provisional	NA|150aa|up_6|NC_002932.3_1046671_1047121_+	COG1438, ArgR, Arginine repressor [Transcription]	NA|402aa|up_5|NC_002932.3_1047134_1048340_+	PRK00509, PRK00509, argininosuccinate synthase; Provisional	NA|500aa|up_4|NC_002932.3_1048419_1049919_-	pfam04909, Amidohydro_2, Amidohydrolase	NA|93aa|up_3|NC_002932.3_1050139_1050418_-	pfam12680, SnoaL_2, SnoaL-like domain	NA|216aa|up_2|NC_002932.3_1050609_1051257_+	pfam04832, SOUL, SOUL heme-binding protein	NA|61aa|up_1|NC_002932.3_1051457_1051640_+	NA	NA|83aa|up_0|NC_002932.3_1051734_1051983_-	NA	cas2|97aa|down_0|NC_002932.3_1055242_1055533_-	cd09725, Cas2_I_II_III, CRISPR/Cas system-associated protein Cas2	cas1|344aa|down_1|NC_002932.3_1055536_1056568_-	TIGR03640, cas1_DVULG, CRISPR-associated endonuclease Cas1, subtype I-C/DVULG	cas4|215aa|down_2|NC_002932.3_1056564_1057209_-	cd09637, Cas4_I-A_I-B_I-C_I-D_II-B, CRISPR/Cas system-associated protein Cas4	cas7|300aa|down_3|NC_002932.3_1057211_1058111_-	TIGR02589, conserved_hypothetical_protein, CRISPR-associated protein Cas7/Csd2, subtype I-C/DVULG	cas8c|579aa|down_4|NC_002932.3_1058128_1059865_-	cd09642, Cas8c_I-C, CRISPR/Cas system-associated protein Cas8c	cas5|238aa|down_5|NC_002932.3_1059861_1060575_-	cd09752, Cas5_I-C, CRISPR/Cas system-associated RAMP superfamily protein Cas5	cas3|669aa|down_6|NC_002932.3_1060598_1062605_-	cd17930, DEXHc_cas3, DEXH/Q-box helicase domain of Cas3	cas3HD|84aa|down_7|NC_002932.3_1062591_1062843_-	cd09641, Cas3''_I, CRISPR/Cas system-associated protein Cas3''	NA|1222aa|down_8|NC_002932.3_1063532_1067198_+	COG0277, GlcD, FAD/FMN-containing dehydrogenases [Energy production and conversion]	NA|122aa|down_9|NC_002932.3_1067699_1068065_+	NA
GCF_000006985.1_ASM698v1	NC_002932	Chlorobaculum tepidum TLS, complete sequence	2	1252473-1252546	2	CRISPRCasFinder	no		csa3,Cas9_archaeal,DEDDh,cas2,cas1,cas4,cas7,cas8c,cas5,cas3,cas3HD,cas8e,cse2gr11,cas6e	Orphan	CGTTCACGAAGCTCACGAAAGTA	23	0	0	NA	NA	NA	1	1	Orphan	csa3,Cas9_archaeal,DEDDh,cas2,cas1,cas4,cas7,cas8c,cas5,cas3,cas3HD,cas8e,cse2gr11,cas6e	NA,NA	NA|247aa|up_9|NC_002932.3_1240041_1240782_+	cd02516, CDP-ME_synthetase, CDP-ME synthetase is involved in mevalonate-independent isoprenoid production	NA|150aa|up_8|NC_002932.3_1241085_1241535_+	PRK10254, PRK10254, proofreading thioesterase EntH	NA|469aa|up_7|NC_002932.3_1241818_1243225_+	COG1032, COG1032, Fe-S oxidoreductase [Energy production and conversion]	NA|167aa|up_6|NC_002932.3_1243317_1243818_-	PRK12678, PRK12678, transcription termination factor Rho; Provisional	NA|604aa|up_5|NC_002932.3_1244209_1246021_-	PRK00476, aspS, aspartyl-tRNA synthetase; Validated	NA|621aa|up_4|NC_002932.3_1246165_1248028_-	PRK14954, PRK14954, DNA polymerase III subunits gamma and tau; Provisional	NA|423aa|up_3|NC_002932.3_1248024_1249293_-	TIGR00765, tRNA_processing_ribonuclease_BN_RNase_BN	NA|167aa|up_2|NC_002932.3_1250194_1250695_-	COG1592, COG1592, Rubrerythrin [Energy production and conversion]	NA|165aa|up_1|NC_002932.3_1250733_1251228_-	cd01052, DPSL, DPS-like protein, ferritin-like diiron-binding domain	NA|314aa|up_0|NC_002932.3_1251523_1252465_+	PRK03604, moaC, bifunctional molybdenum cofactor biosynthesis protein MoaC/MogA; Provisional	NA|367aa|down_0|NC_002932.3_1253744_1254845_+	PRK14490, PRK14490, putative bifunctional molybdopterin-guanine dinucleotide biosynthesis protein MobB/MobA; Provisional	NA|331aa|down_1|NC_002932.3_1254841_1255834_+	TIGR02666, Cyclic_pyranopterin_monophosphate_synthase, molybdenum cofactor biosynthesis protein A, bacterial	NA|153aa|down_2|NC_002932.3_1255863_1256322_-	PRK14499, PRK14499, cyclic pyranopterin monophosphate synthase MoaC/MOSC-domain-containing protein	NA|147aa|down_3|NC_002932.3_1256345_1256786_-	pfam09912, DUF2141, Uncharacterized protein conserved in bacteria (DUF2141)	NA|226aa|down_4|NC_002932.3_1256859_1257537_-	pfam08901, DUF1847, Protein of unknown function (DUF1847)	NA|409aa|down_5|NC_002932.3_1257574_1258801_-	cd17482, MFS_YxiO_like, Bacillus subtilis YxiO, Listeria monocytogenes BtlA, and similar transporters of the Major Facilitator Superfamily	NA|235aa|down_6|NC_002932.3_1258894_1259599_-	PRK03619, PRK03619, phosphoribosylformylglycinamidine synthase subunit PurQ	NA|85aa|down_7|NC_002932.3_1259614_1259869_-	pfam02700, PurS, Phosphoribosylformylglycinamidine (FGAM) synthase	NA|267aa|down_8|NC_002932.3_1259911_1260712_-	COG1183, PssA, Phosphatidylserine synthase [Lipid metabolism]	NA|278aa|down_9|NC_002932.3_1260855_1261689_-	PRK00311, panB, 3-methyl-2-oxobutanoate hydroxymethyltransferase; Reviewed
GCF_000006985.1_ASM698v1	NC_002932	Chlorobaculum tepidum TLS, complete sequence	3	1873025-1873115	3	CRISPRCasFinder	no	cas3,cas8e,cse2gr11,cas6e,cas7,cas5,cas1,cas2	csa3,Cas9_archaeal,DEDDh,cas2,cas1,cas4,cas7,cas8c,cas5,cas3,cas3HD,cas8e,cse2gr11,cas6e	Type I-E	GGCAGATCGATGCGTGGGCGGAGTGCC	27	0	0	NA	NA	NA	1	1	TypeI-E	csa3,Cas9_archaeal,DEDDh,cas2,cas1,cas4,cas7,cas8c,cas5,cas3,cas3HD,cas8e,cse2gr11,cas6e	NA|152aa|up_3|NC_002932.3_1865959_1866415_-,NA	NA|547aa|up_9|NC_002932.3_1855812_1857453_+	TIGR02026, BchE, magnesium-protoporphyrin IX monomethyl ester anaerobic oxidative cyclase	NA|468aa|up_8|NC_002932.3_1858073_1859477_-	COG1249, Lpd, Pyruvate/2-oxoglutarate dehydrogenase complex, dihydrolipoamide dehydrogenase (E3) component, and related enzymes [Energy production and conversion]	NA|428aa|up_7|NC_002932.3_1859655_1860939_-	PRK00077, eno, enolase; Provisional	NA|389aa|up_6|NC_002932.3_1862798_1863965_-	COG0053, MMT1, Predicted Co/Zn/Cd cation transporters [Inorganic ion transport and metabolism]	NA|361aa|up_5|NC_002932.3_1864272_1865355_-	cd01117, YbiR_permease, Putative anion permease YbiR	NA|171aa|up_4|NC_002932.3_1865463_1865976_-	pfam11611, DUF4352, Domain of unknown function (DUF4352)	NA|152aa|up_3|NC_002932.3_1865959_1866415_-	NA	NA|381aa|up_2|NC_002932.3_1866938_1868081_+	PRK11705, PRK11705, cyclopropane fatty acyl phospholipid synthase	NA|143aa|up_1|NC_002932.3_1868174_1868603_-	cd06464, ACD_sHsps-like, Alpha-crystallin domain (ACD) of alpha-crystallin-type small(s) heat shock proteins (Hsps)	cas3|862aa|up_0|NC_002932.3_1868932_1871518_+	PRK09694, PRK09694, CRISPR-associated helicase/endonuclease Cas3	cse2gr11|179aa|down_0|NC_002932.3_1873164_1873701_+	TIGR02548, CRISPR_system_Cascade_subunit_CasB, CRISPR type I-E/ECOLI-associated protein CasB/Cse2	cas6e|210aa|down_1|NC_002932.3_1873697_1874327_+	TIGR01907, CRISPR_system_Cascade_subunit_CasE, CRISPR-associated protein Cas6/Cse3/CasE, subtype I-E/ECOLI	cas7|349aa|down_2|NC_002932.3_1874339_1875386_+	cd09646, Cas7_I-E, CRISPR/Cas system-associated RAMP superfamily protein Cas7	cas5|239aa|down_3|NC_002932.3_1875382_1876099_+	TIGR01868, hypothetical_protein, CRISPR-associated protein Cas5/CasD, subtype I-E/ECOLI	cas1|297aa|down_4|NC_002932.3_1876088_1876979_+	TIGR03638, cas1_ECOLI, CRISPR-associated endonuclease Cas1, subtype I-E/ECOLI	cas2|112aa|down_5|NC_002932.3_1876966_1877302_+	cd09648, Cas2_I-E, CRISPR/Cas system-associated protein Cas2	NA|295aa|down_6|NC_002932.3_1878547_1879432_-	pfam14505, DUF4438, Domain of unknown function (DUF4438)	NA|473aa|down_7|NC_002932.3_1879708_1881127_+	PRK11649, PRK11649, putative peptidase; Provisional	NA|359aa|down_8|NC_002932.3_1881613_1882690_+	COG0739, NlpD, Membrane proteins related to metalloendopeptidases [Cell envelope biogenesis, outer membrane]	NA|140aa|down_9|NC_002932.3_1882731_1883151_-	PRK09256, PRK09256, aminoacyl-tRNA hydrolase
GCF_000006985.1_ASM698v1	NC_002932	Chlorobaculum tepidum TLS, complete sequence	4	1877311-1878450	4,2,2	CRISPRCasFinder,CRT,PILER-CR	no	cas3,cas8e,cse2gr11,cas6e,cas7,cas5,cas1,cas2	csa3,Cas9_archaeal,DEDDh,cas2,cas1,cas4,cas7,cas8c,cas5,cas3,cas3HD,cas8e,cse2gr11,cas6e	Type I-E	GTCTTCCCCACGCCCGTGGGGGTGTTTC,GTCTTCCCCACGCCCGTGGGGGTGTTTC,GTCTTCCCCACGCCCGTGGGGGTGTTTC	28,28,28	0	0	NA	NA	I-B,III-A,III-B:I-B,III-A,III-B:I-B,III-A,III-B	18,18,17	18	TypeI-E	csa3,Cas9_archaeal,DEDDh,cas2,cas1,cas4,cas7,cas8c,cas5,cas3,cas3HD,cas8e,cse2gr11,cas6e	NA,NA	NA|381aa|up_9|NC_002932.3_1866938_1868081_+	PRK11705, PRK11705, cyclopropane fatty acyl phospholipid synthase	NA|143aa|up_8|NC_002932.3_1868174_1868603_-	cd06464, ACD_sHsps-like, Alpha-crystallin domain (ACD) of alpha-crystallin-type small(s) heat shock proteins (Hsps)	cas3|862aa|up_7|NC_002932.3_1868932_1871518_+	PRK09694, PRK09694, CRISPR-associated helicase/endonuclease Cas3	cas8e|531aa|up_6|NC_002932.3_1871535_1873128_+	TIGR02547, CRISPR_system_Cascade_subunit_CasA, CRISPR type I-E/ECOLI-associated protein CasA/Cse1	cse2gr11|179aa|up_5|NC_002932.3_1873164_1873701_+	TIGR02548, CRISPR_system_Cascade_subunit_CasB, CRISPR type I-E/ECOLI-associated protein CasB/Cse2	cas6e|210aa|up_4|NC_002932.3_1873697_1874327_+	TIGR01907, CRISPR_system_Cascade_subunit_CasE, CRISPR-associated protein Cas6/Cse3/CasE, subtype I-E/ECOLI	cas7|349aa|up_3|NC_002932.3_1874339_1875386_+	cd09646, Cas7_I-E, CRISPR/Cas system-associated RAMP superfamily protein Cas7	cas5|239aa|up_2|NC_002932.3_1875382_1876099_+	TIGR01868, hypothetical_protein, CRISPR-associated protein Cas5/CasD, subtype I-E/ECOLI	cas1|297aa|up_1|NC_002932.3_1876088_1876979_+	TIGR03638, cas1_ECOLI, CRISPR-associated endonuclease Cas1, subtype I-E/ECOLI	cas2|112aa|up_0|NC_002932.3_1876966_1877302_+	cd09648, Cas2_I-E, CRISPR/Cas system-associated protein Cas2	NA|295aa|down_0|NC_002932.3_1878547_1879432_-	pfam14505, DUF4438, Domain of unknown function (DUF4438)	NA|473aa|down_1|NC_002932.3_1879708_1881127_+	PRK11649, PRK11649, putative peptidase; Provisional	NA|359aa|down_2|NC_002932.3_1881613_1882690_+	COG0739, NlpD, Membrane proteins related to metalloendopeptidases [Cell envelope biogenesis, outer membrane]	NA|140aa|down_3|NC_002932.3_1882731_1883151_-	PRK09256, PRK09256, aminoacyl-tRNA hydrolase	NA|83aa|down_4|NC_002932.3_1883665_1883914_+	pfam13370, Fer4_13, 4Fe-4S single cluster domain of Ferredoxin I	NA|317aa|down_5|NC_002932.3_1884014_1884965_-	cd00200, WD40, WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment	NA|400aa|down_6|NC_002932.3_1885085_1886285_-	TIGR03469, HpnB, hopene-associated glycosyltransferase HpnB	NA|294aa|down_7|NC_002932.3_1886341_1887223_-	PRK00489, hisG, ATP phosphoribosyltransferase; Reviewed	NA|244aa|down_8|NC_002932.3_1887250_1887982_-	cd07398, MPP_YbbF-LpxH, Escherichia coli YbbF/LpxH and related proteins, metallophosphatase domain	NA|238aa|down_9|NC_002932.3_1888000_1888714_-	COG0170, SEC59, Dolichol kinase [Lipid metabolism]
