assembly_id	genome_id	genome_def	crispr_array_locus_merge	crispr_array_location_merge	crispr_locus_id	crispr_pred_method	array_in_prot	prot_within_array_20000	prot_in_genome	crispr_type_by_cas_prot	consensus_repeat	repeat_length	self-targeting_spacer_number	self-targeting_target_number	spacer_location	protospacer_location	repeat_type	spacer_locus_num	spacer_num	correct_crispr_type	genome_cas_prots	unknown_protein_around_crispr	L10	L10_domain	L9	L9_domain	L8	L8_domain	L7	L7_domain	L6	L6_domain	L5	L5_domain	L4	L4_domain	L3	L3_domain	L2	L2_domain	L1	L1_domain	R1	R1_domain	R2	R2_domain	R3	R3_domain	R4	R4_domain	R5	R5_domain	R6	R6_domain	R7	R7_domain	R8	R8_domain	R9	R9_domain	R10	R10_domain
GCF_000184705.1_ASM18470v1	NC_014831	Thermaerobacter marianensis DSM 12885, complete sequence	1	724015-724196	1	PILER-CR	no		csa3,cas8u1,cas3,csb2gr5,csb1gr7,DinG,cas5,cas7b,cas8b1,WYL	Orphan	ACGTGGTCCAGCCCGGCGA	19	0	0	NA	NA	NA	2	2	Orphan	csa3,cas8u1,cas3,csb2gr5,csb1gr7,DinG,cas5,cas7b,cas8b1,WYL	NA|100aa|up_1|NC_014831.1_722052_722352_-,NA|95aa|down_6|NC_014831.1_732577_732862_-	NA|335aa|up_9|NC_014831.1_706853_707858_-	PRK13141, hisH, imidazole glycerol phosphate synthase subunit HisH; Provisional	NA|201aa|up_8|NC_014831.1_707902_708505_-	PRK00951, hisB, imidazoleglycerol-phosphate dehydratase HisB	NA|936aa|up_7|NC_014831.1_708501_711309_-	pfam00815, Histidinol_dh, Histidinol dehydrogenase	NA|256aa|up_6|NC_014831.1_711296_712064_-	PRK01686, hisG, ATP phosphoribosyltransferase catalytic subunit; Reviewed	NA|378aa|up_5|NC_014831.1_714071_715205_+	PRK11259, solA, N-methyl-L-tryptophan oxidase	NA|300aa|up_4|NC_014831.1_715206_716106_+	TIGR00950, Uncharacterized_inner_membrane_transporter_YicL, Carboxylate/Amino Acid/Amine Transporter	NA|330aa|up_3|NC_014831.1_716087_717077_+	COG2423, COG2423, Predicted ornithine cyclodeaminase, mu-crystallin homolog [Amino acid transport and metabolism]	NA|759aa|up_2|NC_014831.1_719384_721661_-	cd07551, P-type_ATPase_HM_ZosA_PfeT-like, P-type heavy metal-transporting ATPase, similar to Bacillus subtilis ZosA/PfeT which transports copper, and perhaps zinc under oxidative stress, and perhaps ferrous iron	NA|100aa|up_1|NC_014831.1_722052_722352_-	NA	NA|252aa|up_0|NC_014831.1_722702_723458_+	pfam04474, DUF554, Protein of unknown function (DUF554)	NA|464aa|down_0|NC_014831.1_724554_725946_+	PRK07764, PRK07764, DNA polymerase III subunits gamma and tau; Validated	NA|320aa|down_1|NC_014831.1_726075_727035_+	PRK12323, PRK12323, DNA polymerase III subunit gamma/tau	NA|327aa|down_2|NC_014831.1_727089_728070_+	cd19937, REC_OmpR_BsPhoP-like, phosphoacceptor receiver (REC) domain of BsPhoP-like OmpR family response regulators	NA|647aa|down_3|NC_014831.1_728158_730099_+	COG0642, BaeS, Signal transduction histidine kinase [Signal transduction mechanisms]	NA|162aa|down_4|NC_014831.1_730371_730857_+	COG2236, COG2236, Predicted phosphoribosyltransferases [General function prediction only]	NA|491aa|down_5|NC_014831.1_730985_732458_+	pfam10646, Germane, Sporulation and spore germination	NA|95aa|down_6|NC_014831.1_732577_732862_-	NA	NA|308aa|down_7|NC_014831.1_733366_734290_+	pfam06267, DUF1028, Family of unknown function (DUF1028)	NA|444aa|down_8|NC_014831.1_734313_735645_+	COG3214, COG3214, Uncharacterized protein conserved in bacteria [Function unknown]	NA|406aa|down_9|NC_014831.1_735654_736872_-	cd08234, threonine_DH_like, L-threonine dehydrogenase
GCF_000184705.1_ASM18470v1	NC_014831	Thermaerobacter marianensis DSM 12885, complete sequence	2	1143966-1145239	2,1,1	PILER-CR,CRISPRCasFinder,CRT	no	cas8u1,cas3,csb2gr5,csb1gr7	csa3,cas8u1,cas3,csb2gr5,csb1gr7,DinG,cas5,cas7b,cas8b1,WYL	Unclear	GCTGCAACGTAGCCACGGTATTAACCGTGGATGCGAC,GCTGCAACGTAGCCACGGTATTAACCGTGGATGCGAC,GCTGCAACGTAGCCACGGTATTAACCGTGGATGCGAC	37,37,37	0	0	NA	NA	NA:NA:NA	13,17,17	17	Unclear	csa3,cas8u1,cas3,csb2gr5,csb1gr7,DinG,cas5,cas7b,cas8b1,WYL	NA,cas8u1|342aa|down_0|NC_014831.1_1145433_1146459_-	NA|168aa|up_9|NC_014831.1_1130532_1131036_+	COG1302, COG1302, Uncharacterized protein conserved in bacteria [Function unknown]	NA|883aa|up_8|NC_014831.1_1131437_1134086_+	PRK10917, PRK10917, ATP-dependent DNA helicase RecG; Provisional	NA|480aa|up_7|NC_014831.1_1134317_1135757_-	cd17489, MFS_YfcJ_like, Escherichia coli YfcJ, YhhS, and similar transporters of the Major Facilitator Superfamily	NA|307aa|up_6|NC_014831.1_1135995_1136916_-	PRK12362, PRK12362, germination protease; Provisional	NA|494aa|up_5|NC_014831.1_1137539_1139021_+	cd17474, MFS_YfmO_like, Bacillus subtilis multidrug efflux protein YfmO and similar transporters of the Major Facilitator Superfamily	NA|200aa|up_4|NC_014831.1_1139271_1139871_+	pfam03602, Cons_hypoth95, Conserved hypothetical protein 95	NA|164aa|up_3|NC_014831.1_1139867_1140359_+	PRK00168, coaD, phosphopantetheine adenylyltransferase; Provisional	NA|200aa|up_2|NC_014831.1_1140355_1140955_+	cd06503, ATP-synt_Fo_b, F-type ATP synthase, membrane subunit b	NA|405aa|up_1|NC_014831.1_1141054_1142269_-	TIGR02871, conserved_hypothetical_protein, sporulation integral membrane protein YlbJ	NA|270aa|up_0|NC_014831.1_1142532_1143342_+	COG3480, SdrC, Predicted secreted protein containing a PDZ domain [Signal transduction mechanisms]	cas8u1|342aa|down_0|NC_014831.1_1145433_1146459_-	NA	cas3|1022aa|down_1|NC_014831.1_1146455_1149521_-	TIGR02621, CRISPR-associated_helicase_Cas3, CRISPR-associated helicase Cas3, subtype Dpsyc	csb2gr5|466aa|down_2|NC_014831.1_1149517_1150915_-	TIGR02165, CRISPR-associated_protein_GSU0054_family, CRISPR-associated protein GSU0054/csb2, Dpsyc system	csb1gr7|402aa|down_3|NC_014831.1_1150935_1152141_-	pfam09617, Cas_GSU0053, CRISPR-associated protein GSU0053 (Cas_GSU0053)	NA|48aa|down_4|NC_014831.1_1152468_1152612_+	PRK05342, clpX, ATP-dependent Clp protease ATP-binding subunit ClpX	NA|196aa|down_5|NC_014831.1_1153223_1153811_+	pfam02620, DUF177, Uncharacterized ACR, COG1399	NA|60aa|down_6|NC_014831.1_1153791_1153971_+	PRK12286, rpmF, 50S ribosomal protein L32; Reviewed	NA|352aa|down_7|NC_014831.1_1155199_1156255_+	PRK05331, PRK05331, phosphate acyltransferase PlsX	NA|419aa|down_8|NC_014831.1_1156238_1157495_+	PRK09352, PRK09352, beta-ketoacyl-ACP synthase 3	NA|318aa|down_9|NC_014831.1_1157491_1158445_+	COG0331, FabD, (acyl-carrier-protein) S-malonyltransferase [Lipid metabolism]
GCF_000184705.1_ASM18470v1	NC_014831	Thermaerobacter marianensis DSM 12885, complete sequence	3	2079749-2079844	2	CRISPRCasFinder	no	csa3	csa3,cas8u1,cas3,csb2gr5,csb1gr7,DinG,cas5,cas7b,cas8b1,WYL	Type I-A	CGTCGCTCCGGCTCGCCTCCGCC	23	0	0	NA	NA	NA	1	1	Orphan	csa3,cas8u1,cas3,csb2gr5,csb1gr7,DinG,cas5,cas7b,cas8b1,WYL	NA|306aa|up_8|NC_014831.1_2070912_2071830_+,NA|139aa|up_6|NC_014831.1_2073353_2073770_+,NA|256aa|up_3|NC_014831.1_2075926_2076694_-,NA|111aa|up_1|NC_014831.1_2078445_2078778_+,NA|264aa|down_1|NC_014831.1_2080926_2081718_+	NA|351aa|up_9|NC_014831.1_2068050_2069103_-	cd08260, Zn_ADH6, Alcohol dehydrogenases of the MDR family	NA|306aa|up_8|NC_014831.1_2070912_2071830_+	NA	csa3|367aa|up_7|NC_014831.1_2072120_2073221_-	cd00090, HTH_ARSR, Arsenical Resistance Operon Repressor and similar prokaryotic, metal regulated homodimeric repressors	NA|139aa|up_6|NC_014831.1_2073353_2073770_+	NA	NA|134aa|up_5|NC_014831.1_2073816_2074218_-	pfam00903, Glyoxalase, Glyoxalase/Bleomycin resistance protein/Dioxygenase superfamily	NA|324aa|up_4|NC_014831.1_2074412_2075384_+	pfam01925, TauE, Sulfite exporter TauE/SafE	NA|256aa|up_3|NC_014831.1_2075926_2076694_-	NA	NA|467aa|up_2|NC_014831.1_2076859_2078260_-	pfam05694, SBP56, 56kDa selenium binding protein (SBP56)	NA|111aa|up_1|NC_014831.1_2078445_2078778_+	NA	NA|223aa|up_0|NC_014831.1_2078913_2079582_-	COG1670, RimL, Acetyltransferases, including N-acetylases of ribosomal proteins [Translation, ribosomal structure and biogenesis]	NA|112aa|down_0|NC_014831.1_2080229_2080565_+	pfam12728, HTH_17, Helix-turn-helix domain	NA|264aa|down_1|NC_014831.1_2080926_2081718_+	NA	NA|360aa|down_2|NC_014831.1_2085906_2086986_-	PRK01212, PRK01212, homoserine kinase; Provisional	NA|390aa|down_3|NC_014831.1_2087261_2088431_-	PRK07409, PRK07409, threonine synthase; Validated	NA|143aa|down_4|NC_014831.1_2089125_2089554_-	cd08891, SRPBCC_CalC, Ligand-binding SRPBCC domain of Micromonospora echinospora CalC and related proteins	csa3|106aa|down_5|NC_014831.1_2089550_2089868_-	cd00090, HTH_ARSR, Arsenical Resistance Operon Repressor and similar prokaryotic, metal regulated homodimeric repressors	NA|680aa|down_6|NC_014831.1_2090105_2092145_+	PRK00174, PRK00174, acetyl-CoA synthetase; Provisional	NA|125aa|down_7|NC_014831.1_2092182_2092557_-	COG0599, COG0599, Uncharacterized homolog of gamma-carboxymuconolactone decarboxylase subunit [Function unknown]	NA|465aa|down_8|NC_014831.1_2093371_2094766_-	PLN02805, PLN02805, D-lactate dehydrogenase [cytochrome]	NA|523aa|down_9|NC_014831.1_2094831_2096400_-	COG3333, COG3333, Uncharacterized protein conserved in bacteria [Function unknown]
GCF_000184705.1_ASM18470v1	NC_014831	Thermaerobacter marianensis DSM 12885, complete sequence	4	2479885-2481108	3,3,2	PILER-CR,CRISPRCasFinder,CRT	no	csa3	csa3,cas8u1,cas3,csb2gr5,csb1gr7,DinG,cas5,cas7b,cas8b1,WYL	Type I-A	GTTTGTAGAGTGCCTATGAGGAATCGAAAC,GTTTGTAGAGTGCCTATGAGGAATCGAAAC,GTTTGTAGAGTGCCTATGAGGAATCGAAAC	30,30,30	1	1	2479915-2479951	NC_014831.1_51553-51589	NA:NA:NA	18,18,18	18	Orphan	csa3,cas8u1,cas3,csb2gr5,csb1gr7,DinG,cas5,cas7b,cas8b1,WYL	NA|170aa|up_2|NC_014831.1_2475469_2475979_-,NA|291aa|down_2|NC_014831.1_2484847_2485720_-,NA|147aa|down_3|NC_014831.1_2486072_2486513_+	NA|307aa|up_9|NC_014831.1_2461822_2462743_-	PRK07259, PRK07259, dihydroorotate dehydrogenase	NA|353aa|up_8|NC_014831.1_2462829_2463888_-	cd06218, DHOD_e_trans, FAD/NAD binding domain in the electron transfer subunit of dihydroorotate dehydrogenase	NA|771aa|up_7|NC_014831.1_2463887_2466200_-	PRK05294, carB, carbamoyl-phosphate synthase large subunit	NA|524aa|up_6|NC_014831.1_2467620_2469192_-	PRK12564, PRK12564, carbamoyl-phosphate synthase small subunit	NA|540aa|up_5|NC_014831.1_2469340_2470960_-	PRK09357, pyrC, dihydroorotase; Validated	NA|734aa|up_4|NC_014831.1_2472601_2474803_-	cd07548, P-type_ATPase-Cd_Zn_Co_like, P-type heavy metal-transporting ATPase, similar to Bacillus subtilis CadA which appears to transport cadmium, zinc and cobalt but not copper out of the cell	csa3|112aa|up_3|NC_014831.1_2474943_2475279_-	smart00418, HTH_ARSR, helix_turn_helix, Arsenical Resistance Operon Repressor	NA|170aa|up_2|NC_014831.1_2475469_2475979_-	NA	NA|650aa|up_1|NC_014831.1_2476234_2478184_-	cd14748, PBP2_UgpB, The periplasmic-binding component of ABC transport system specific for sn-glycerol-3-phosphate; possesses type 2 periplasmic binding fold	NA|250aa|up_0|NC_014831.1_2478383_2479133_-	cd16913, YkuD_like, L,D-transpeptidases/carboxypeptidases similar to Bacillus YkuD	NA|313aa|down_0|NC_014831.1_2481313_2482252_-	cd07205, Pat_PNPLA6_PNPLA7_NTE1_like, Patatin-like phospholipase domain containing protein 6, protein 7, and fungal NTE1	NA|669aa|down_1|NC_014831.1_2482492_2484499_-	pfam07454, SpoIIP, Stage II sporulation protein P (SpoIIP)	NA|291aa|down_2|NC_014831.1_2484847_2485720_-	NA	NA|147aa|down_3|NC_014831.1_2486072_2486513_+	NA	NA|287aa|down_4|NC_014831.1_2486535_2487396_+	pfam02683, DsbD, Cytochrome C biogenesis protein transmembrane region	NA|107aa|down_5|NC_014831.1_2487505_2487826_-	pfam00269, SASP, Small, acid-soluble spore proteins, alpha/beta type	NA|304aa|down_6|NC_014831.1_2488012_2488924_+	cd09989, Arginase, Arginase family	NA|82aa|down_7|NC_014831.1_2489132_2489378_+	pfam10262, Rdx, Rdx family	NA|430aa|down_8|NC_014831.1_2489471_2490761_-	COG1194, MutY, A/G-specific DNA glycosylase [DNA replication, recombination, and repair]	NA|175aa|down_9|NC_014831.1_2490759_2491284_+	TIGR02227, Inactive_signal_peptidase_IA
