assembly_id	genome_id	genome_def	crispr_array_locus_merge	crispr_array_location_merge	crispr_locus_id	crispr_pred_method	array_in_prot	prot_within_array_20000	prot_in_genome	crispr_type_by_cas_prot	consensus_repeat	repeat_length	self-targeting_spacer_number	self-targeting_target_number	spacer_location	protospacer_location	repeat_type	spacer_locus_num	spacer_num	correct_crispr_type	genome_cas_prots	unknown_protein_around_crispr	L10	L10_domain	L9	L9_domain	L8	L8_domain	L7	L7_domain	L6	L6_domain	L5	L5_domain	L4	L4_domain	L3	L3_domain	L2	L2_domain	L1	L1_domain	R1	R1_domain	R2	R2_domain	R3	R3_domain	R4	R4_domain	R5	R5_domain	R6	R6_domain	R7	R7_domain	R8	R8_domain	R9	R9_domain	R10	R10_domain
GCF_002119625.1_ASM211962v1	NZ_CP020030	Geobacillus thermodenitrificans strain T12 chromosome, complete genome	1	345230-345925	1,1,1,2	PILER-CR,CRISPRCasFinder,CRT,PILER-CR	no	cas6,cas2,cas1,cas9	cas3,cas6,cas2,cas1,cas9,csa3,DEDDh,WYL,DinG,c2c9_V-U4	Type II-B,Type II-C,Type II-A	GTCATAGTTCCCCTGAGATTATCGCTGTGGTATAATTT,GTCATAGTTCCCCTGAGATTATCGCTGTGGTATAAT,GTCATAGTTCCCCTGAGATTATCGCTGTGGTATAAT,GTCATAGTTCCCCTGAGATTATCGCTGTGGTATAATTTC	38,36,36,39	0	0	NA	NA	NA:NA:NA:NA	5,10,10,5	10	TypeII-B,TypeII-C,TypeII-A	cas3,cas6,cas2,cas1,cas9,csa3,DEDDh,WYL,DinG,c2c9_V-U4	NA,NA	NA|233aa|up_9|NZ_CP020030.1_334427_335126_+	COG0745, OmpR, Response regulators consisting of a CheY-like receiver domain and a winged-helix DNA-binding domain [Signal transduction mechanisms / Transcription]	NA|473aa|up_8|NZ_CP020030.1_335112_336531_+	COG0642, BaeS, Signal transduction histidine kinase [Signal transduction mechanisms]	NA|145aa|up_7|NZ_CP020030.1_336624_337059_-	pfam18218, Spa1_C, Lantibiotic immunity protein Spa1 C-terminal domain	NA|263aa|up_6|NZ_CP020030.1_337113_337902_-	TIGR03733, lanti_perm_MutG, lantibiotic protection ABC transporter permease subunit, MutG family	NA|249aa|up_5|NZ_CP020030.1_337903_338650_-	TIGR03732, lanti_perm_MutE, lantibiotic protection ABC transporter permease subunit, MutE/EpiE family	NA|230aa|up_4|NZ_CP020030.1_338668_339358_-	TIGR03740, galliderm_ABC, gallidermin-class lantibiotic protection ABC transporter, ATP-binding subunit	NA|210aa|up_3|NZ_CP020030.1_340010_340640_-	COG1174, OpuBB, ABC-type proline/glycine betaine transport systems, permease component [Amino acid transport and metabolism]	NA|378aa|up_2|NZ_CP020030.1_341029_342163_-	cd03295, ABC_OpuCA_Osmoprotection, ATP-binding cassette domain of the osmoprotectant transporter	NA|301aa|up_1|NZ_CP020030.1_342231_343134_-	cd13528, PBP2_osmoprotectants, Substrate-binding domain of osmoregulatory ABC-type transporters; the type 2 periplasmic-binding protein fold	cas6|249aa|up_0|NZ_CP020030.1_344126_344873_+	TIGR01877, CRISPR-associated_endoribonuclease_Cas6_1, CRISPR-associated endoribonuclease Cas6	cas2|109aa|down_0|NZ_CP020030.1_346024_346351_-	COG3512, COG3512, CRISPR-associated protein, Cas2 homolog [Defense mechanisms]	cas1|293aa|down_1|NZ_CP020030.1_346337_347216_-	cd09720, Cas1_II, CRISPR/Cas system-associated protein Cas1	cas9|1083aa|down_2|NZ_CP020030.1_347181_350430_-	COG3513, COG3513, Predicted CRISPR-associated nuclease, contains McrA/HNH-nuclease and RuvC-like nuclease domain [Defense mechanisms]	NA|354aa|down_3|NZ_CP020030.1_350755_351817_+	cd02253, DmpA, L-Aminopeptidase D-amidase/D-esterase (DmpA) family; DmpA catalyzes the release of N-terminal D and L amino acids from peptide susbtrates	NA|438aa|down_4|NZ_CP020030.1_352716_354030_+	COG2610, GntT, H+/gluconate symporter and related permeases [Carbohydrate transport and metabolism / Amino acid transport and metabolism]	NA|141aa|down_5|NZ_CP020030.1_354119_354542_+	cd03449, R_hydratase, (R)-hydratase [(R)-specific enoyl-CoA hydratase] catalyzes the hydration of trans-2-enoyl CoA to (R)-3-hydroxyacyl-CoA as part of the PHA (polyhydroxyalkanoate) biosynthetic pathway	NA|446aa|down_6|NZ_CP020030.1_354550_355888_+	COG3829, RocR, Transcriptional regulator containing PAS, AAA-type ATPase, and DNA-binding domains [Transcription / Signal transduction mechanisms]	NA|385aa|down_7|NZ_CP020030.1_356071_357226_+	TIGR03207, cyc_hxne_CoA_dh, cyclohexanecarboxyl-CoA dehydrogenase	NA|266aa|down_8|NZ_CP020030.1_357246_358044_+	PRK07396, PRK07396, dihydroxynaphthoic acid synthetase; Validated	NA|548aa|down_9|NZ_CP020030.1_358068_359712_+	PRK13295, PRK13295, cyclohexanecarboxylate-CoA ligase; Reviewed
GCF_002119625.1_ASM211962v1	NZ_CP020030	Geobacillus thermodenitrificans strain T12 chromosome, complete genome	2	393185-394212	3,2,2	PILER-CR,CRISPRCasFinder,CRT	no		cas3,cas6,cas2,cas1,cas9,csa3,DEDDh,WYL,DinG,c2c9_V-U4	Orphan	GTTTGTATCGTACCTATGAGGGATTGAAAC,GTTTGTATCGTACCTATGAGGGATTGAAAC,GTTTGTATCGTACCTATGAGGGATTGAAAC	30,30,30	0	0	NA	NA	NA:NA:NA	14,15,15	15	Orphan	cas3,cas6,cas2,cas1,cas9,csa3,DEDDh,WYL,DinG,c2c9_V-U4	NA,NA|162aa|down_7|NZ_CP020030.1_402774_403260_+	NA|507aa|up_9|NZ_CP020030.1_379539_381060_+	COG2268, COG2268, Uncharacterized protein conserved in bacteria [Function unknown]	NA|307aa|up_8|NZ_CP020030.1_381204_382125_+	PRK13337, PRK13337, putative lipid kinase; Reviewed	NA|461aa|up_7|NZ_CP020030.1_382199_383582_+	TIGR00479, 23S_rRNA_uracil1939-C5-methyltransferase_RlmD, 23S rRNA (uracil-5-)-methyltransferase RumA	NA|462aa|up_6|NZ_CP020030.1_384250_385636_-	COG1823, COG1823, Predicted Na+/dicarboxylate symporter [General function prediction only]	NA|258aa|up_5|NZ_CP020030.1_385973_386747_+	COG1136, SalX, ABC-type antimicrobial peptide transport system, ATPase component [Defense mechanisms]	NA|624aa|up_4|NZ_CP020030.1_386721_388593_+	pfam02687, FtsX, FtsX-like permease family	NA|119aa|up_3|NZ_CP020030.1_388617_388974_+	COG5294, COG5294, Uncharacterized protein conserved in bacteria [Function unknown]	NA|390aa|up_2|NZ_CP020030.1_389465_390634_-	PHA02517, PHA02517, putative transposase OrfB; Reviewed	NA|99aa|up_1|NZ_CP020030.1_390732_391029_+	pfam09966, DUF2200, Uncharacterized protein conserved in bacteria (DUF2200)	NA|412aa|up_0|NZ_CP020030.1_391264_392500_+	pfam00872, Transposase_mut, Transposase, Mutator family	NA|108aa|down_0|NZ_CP020030.1_395467_395791_+	COG1695, COG1695, Predicted transcriptional regulators [Transcription]	NA|187aa|down_1|NZ_CP020030.1_395783_396344_+	pfam08006, DUF1700, Protein of unknown function (DUF1700)	NA|267aa|down_2|NZ_CP020030.1_396970_397771_-	COG3267, ExeA, Type II secretory pathway, component ExeA (predicted ATPase) [Intracellular trafficking and secretion]	NA|418aa|down_3|NZ_CP020030.1_397763_399017_-	pfam00665, rve, Integrase core domain	NA|186aa|down_4|NZ_CP020030.1_399157_399715_-	pfam14690, zf-ISL3, zinc-finger of transposase IS204/IS1001/IS1096/IS1165	NA|278aa|down_5|NZ_CP020030.1_400625_401459_+	pfam18765, Polbeta, Polymerase beta, Nucleotidyltransferase	NA|197aa|down_6|NZ_CP020030.1_401578_402169_+	pfam04229, GrpB, GrpB protein	NA|162aa|down_7|NZ_CP020030.1_402774_403260_+	NA	NA|397aa|down_8|NZ_CP020030.1_403916_405107_+	pfam01610, DDE_Tnp_ISL3, Transposase	NA|490aa|down_9|NZ_CP020030.1_406170_407640_+	pfam01235, Na_Ala_symp, Sodium:alanine symporter family
GCF_002119625.1_ASM211962v1	NZ_CP020030	Geobacillus thermodenitrificans strain T12 chromosome, complete genome	3	1460464-1460561	3	CRISPRCasFinder	no	csa3	cas3,cas6,cas2,cas1,cas9,csa3,DEDDh,WYL,DinG,c2c9_V-U4	Type I-A	TGTTGAAAGAGAAAGGCCTCGTTTA	25	0	0	NA	NA	NA	1	1	Orphan	cas3,cas6,cas2,cas1,cas9,csa3,DEDDh,WYL,DinG,c2c9_V-U4	NA,NA|103aa|down_1|NZ_CP020030.1_1462295_1462604_-	NA|309aa|up_9|NZ_CP020030.1_1444304_1445231_+	COG1230, CzcD, Co/Zn/Cd efflux system component [Inorganic ion transport and metabolism]	NA|299aa|up_8|NZ_CP020030.1_1445262_1446159_-	cd08420, PBP2_CysL_like, C-terminal substrate binding domain of LysR-type transcriptional regulator CysL, which activates the transcription of the cysJI operon encoding sulfite reductase, contains the type 2 periplasmic binding fold	NA|610aa|up_7|NZ_CP020030.1_1446396_1448226_+	TIGR01931, Sulfite_reductase_flavoprotein_alpha-component, sulfite reductase [NADPH] flavoprotein, alpha-component	NA|574aa|up_6|NZ_CP020030.1_1448250_1449972_+	PRK13504, PRK13504, NADPH-dependent assimilatory sulfite reductase hemoprotein subunit	NA|488aa|up_5|NZ_CP020030.1_1450230_1451694_+	cd07097, ALDH_KGSADH-YcbD, Bacillus subtilis NADP+-dependent alpha-ketoglutaric semialdehyde dehydrogenase ycbD-like	NA|319aa|up_4|NZ_CP020030.1_1452224_1453181_+	COG4606, CeuB, ABC-type enterochelin transport system, permease component [Inorganic ion transport and metabolism]	NA|317aa|up_3|NZ_CP020030.1_1453173_1454124_+	COG4605, CeuC, ABC-type enterochelin transport system, permease component [Inorganic ion transport and metabolism]	NA|252aa|up_2|NZ_CP020030.1_1454117_1454873_+	COG4604, CeuD, ABC-type enterochelin transport system, ATPase component [Inorganic ion transport and metabolism]	NA|320aa|up_1|NZ_CP020030.1_1455128_1456088_+	COG4607, CeuA, ABC-type enterochelin transport system, periplasmic component [Inorganic ion transport and metabolism]	NA|1257aa|up_0|NZ_CP020030.1_1456382_1460153_+	COG1112, COG1112, Superfamily I DNA and RNA helicases and helicase subunits [DNA replication, recombination, and repair]	NA|479aa|down_0|NZ_CP020030.1_1460726_1462163_+	COG2189, COG2189, Adenine specific DNA methylase Mod [DNA replication, recombination, and repair]	NA|103aa|down_1|NZ_CP020030.1_1462295_1462604_-	NA	NA|464aa|down_2|NZ_CP020030.1_1462820_1464212_-	TIGR01773, GABA_permease, gamma-aminobutyrate permease	NA|539aa|down_3|NZ_CP020030.1_1464434_1466051_+	pfam07905, PucR, Purine catabolism regulatory protein-like family	NA|451aa|down_4|NZ_CP020030.1_1466274_1467627_+	PRK07678, PRK07678, aminotransferase; Validated	NA|329aa|down_5|NZ_CP020030.1_1467864_1468851_+	cd08289, MDR_yhfp_like, Yhfp putative quinone oxidoreductases	NA|570aa|down_6|NZ_CP020030.1_1469220_1470930_+	pfam00144, Beta-lactamase, Beta-lactamase	NA|698aa|down_7|NZ_CP020030.1_1470955_1473049_+	pfam00933, Glyco_hydro_3, Glycosyl hydrolase family 3 N terminal domain	NA|419aa|down_8|NZ_CP020030.1_1473064_1474321_+	pfam07075, DUF1343, Protein of unknown function (DUF1343)	NA|303aa|down_9|NZ_CP020030.1_1474480_1475389_-	cd08434, PBP2_GltC_like, The substrate binding domain of LysR-type transcriptional regulator GltC, which activates gltA expression of glutamate synthase operon, contains type 2 periplasmic binding fold
GCF_002119625.1_ASM211962v1	NZ_CP020030	Geobacillus thermodenitrificans strain T12 chromosome, complete genome	4	2204196-2204382	4	CRISPRCasFinder	no	c2c9_V-U4	cas3,cas6,cas2,cas1,cas9,csa3,DEDDh,WYL,DinG,c2c9_V-U4	Type V-U4	GTGTTTCAATCCTCTAAACGAGGCCTATCTCTTTCAAC	38	0	0	NA	NA	NA	2	2	TypeV-U4	cas3,cas6,cas2,cas1,cas9,csa3,DEDDh,WYL,DinG,c2c9_V-U4	NA|123aa|up_9|NZ_CP020030.1_2194625_2194994_+,NA|55aa|up_0|NZ_CP020030.1_2203920_2204085_-,NA	NA|123aa|up_9|NZ_CP020030.1_2194625_2194994_+	NA	NA|502aa|up_8|NZ_CP020030.1_2195051_2196557_+	COG4584, COG4584, Transposase and inactivated derivatives [DNA replication, recombination, and repair]	NA|251aa|up_7|NZ_CP020030.1_2196553_2197306_+	PRK09183, PRK09183, transposase/IS protein; Provisional	NA|223aa|up_6|NZ_CP020030.1_2197606_2198275_+	COG0745, OmpR, Response regulators consisting of a CheY-like receiver domain and a winged-helix DNA-binding domain [Signal transduction mechanisms / Transcription]	NA|449aa|up_5|NZ_CP020030.1_2198271_2199618_+	COG0642, BaeS, Signal transduction histidine kinase [Signal transduction mechanisms]	NA|304aa|up_4|NZ_CP020030.1_2199727_2200639_+	COG1131, CcmA, ABC-type multidrug transport system, ATPase component [Defense mechanisms]	NA|240aa|up_3|NZ_CP020030.1_2200635_2201355_+	COG4200, COG4200, Uncharacterized protein conserved in bacteria [Function unknown]	NA|248aa|up_2|NZ_CP020030.1_2201355_2202099_+	COG4200, COG4200, Uncharacterized protein conserved in bacteria [Function unknown]	NA|378aa|up_1|NZ_CP020030.1_2202370_2203504_+	pfam01609, DDE_Tnp_1, Transposase DDE domain	NA|55aa|up_0|NZ_CP020030.1_2203920_2204085_-	NA	NA|283aa|down_0|NZ_CP020030.1_2204623_2205472_-	pfam08282, Hydrolase_3, haloacid dehalogenase-like hydrolase	NA|373aa|down_1|NZ_CP020030.1_2206297_2207416_+	cd03506, Delta6-FADS-like, The Delta6 Fatty Acid Desaturase (Delta6-FADS)-like CD includes the integral-membrane enzymes: delta-4, delta-5, delta-6, delta-8, delta-8-sphingolipid, and delta-11 desaturases found in vertebrates, higher plants, fungi, and bacteria	NA|93aa|down_2|NZ_CP020030.1_2207607_2207886_-	cd14797, DUF302, Uncharacterized domain family DUF302	NA|261aa|down_3|NZ_CP020030.1_2208037_2208820_-	pfam01925, TauE, Sulfite exporter TauE/SafE	NA|379aa|down_4|NZ_CP020030.1_2208903_2210040_-	cd07724, POD-like_MBL-fold, ETHE1 (PDO type I), persulfide dioxygenase A (PDOA, PDO type II) and related proteins; MBL-fold metallo-hydrolase domain	NA|190aa|down_5|NZ_CP020030.1_2210089_2210659_-	cd00158, RHOD, Rhodanese Homology Domain (RHOD); an alpha beta fold domain found duplicated in the rhodanese protein	NA|99aa|down_6|NZ_CP020030.1_2210682_2210979_-	cd00158, RHOD, Rhodanese Homology Domain (RHOD); an alpha beta fold domain found duplicated in the rhodanese protein	NA|122aa|down_7|NZ_CP020030.1_2210991_2211357_-	cd00158, RHOD, Rhodanese Homology Domain (RHOD); an alpha beta fold domain found duplicated in the rhodanese protein	NA|161aa|down_8|NZ_CP020030.1_2211382_2211865_-	pfam13686, DrsE_2, DsrE/DsrF/DrsH-like family	NA|76aa|down_9|NZ_CP020030.1_2211913_2212141_-	cd00291, SirA_YedF_YeeD, SirA, YedF, and YeeD
GCF_002119625.1_ASM211962v1	NZ_CP020030	Geobacillus thermodenitrificans strain T12 chromosome, complete genome	5	2919684-2919785	5	CRISPRCasFinder	no		cas3,cas6,cas2,cas1,cas9,csa3,DEDDh,WYL,DinG,c2c9_V-U4	Orphan	TCTCTGCTTCTGCCGCTGTCTGCG	24	0	0	NA	NA	NA	1	1	Orphan	cas3,cas6,cas2,cas1,cas9,csa3,DEDDh,WYL,DinG,c2c9_V-U4	NA,NA	NA|572aa|up_9|NZ_CP020030.1_2905424_2907140_-	PRK04319, PRK04319, acetyl-CoA synthetase; Provisional	NA|211aa|up_8|NZ_CP020030.1_2907315_2907948_+	pfam00583, Acetyltransf_1, Acetyltransferase (GNAT) family	NA|215aa|up_7|NZ_CP020030.1_2907979_2908624_+	cd04584, CBS_pair_AcuB_like, Two tandem repeats of the cystathionine beta-synthase (CBS pair) domains associated with the ACT domain	NA|390aa|up_6|NZ_CP020030.1_2908620_2909790_+	cd09994, HDAC_AcuC_like, Class I histone deacetylase AcuC (Acetoin utilization protein)-like enzymes	NA|332aa|up_5|NZ_CP020030.1_2911639_2912635_-	TIGR01481, catabolite_control_protein_A, catabolite control protein A	NA|361aa|up_4|NZ_CP020030.1_2913107_2914190_-	PRK12595, PRK12595, bifunctional 3-deoxy-7-phosphoheptulonate synthase/chorismate mutase; Reviewed	NA|111aa|up_3|NZ_CP020030.1_2914433_2914766_-	pfam11009, DUF2847, Protein of unknown function (DUF2847)	NA|128aa|up_2|NZ_CP020030.1_2914771_2915155_-	pfam12732, YtxH, YtxH-like protein	NA|150aa|up_1|NZ_CP020030.1_2915141_2915591_-	COG4768, COG4768, Uncharacterized protein containing a divergent version of the methyl-accepting chemotaxis-like domain [General function prediction only]	NA|435aa|up_0|NZ_CP020030.1_2915852_2917157_-	PRK00421, murC, UDP-N-acetylmuramate--L-alanine ligase; Provisional	NA|202aa|down_0|NZ_CP020030.1_2920561_2921167_-	PRK00629, pheT, phenylalanyl-tRNA synthetase subunit beta; Reviewed	NA|266aa|down_1|NZ_CP020030.1_2921188_2921986_-	pfam07285, DUF1444, Protein of unknown function (DUF1444)	NA|177aa|down_2|NZ_CP020030.1_2922032_2922563_-	PRK03114, PRK03114, DUF84 family protein	NA|359aa|down_3|NZ_CP020030.1_2922665_2923742_-	cd05656, M42_Frv, M42 Peptidase, endoglucanases	NA|99aa|down_4|NZ_CP020030.1_2923854_2924151_+	COG5584, COG5584, Predicted small secreted protein [Function unknown]	NA|284aa|down_5|NZ_CP020030.1_2924145_2924997_-	cd07728, YtnP-like_MBL-fold, Bacillus subtilis YtnP and related proteins; MBL-fold metallo hydrolase domain	NA|217aa|down_6|NZ_CP020030.1_2925254_2925905_-	PRK00121, trmB, tRNA (guanine-N(7)-)-methyltransferase; Reviewed	NA|90aa|down_7|NZ_CP020030.1_2926052_2926322_+	pfam14165, YtzH, YtzH-like protein	NA|261aa|down_8|NZ_CP020030.1_2926418_2927201_-	COG0510, ycfN, Thiamine kinase and related kinases [Coenzyme transport and metabolism]	NA|727aa|down_9|NZ_CP020030.1_2927429_2929610_-	TIGR02104, pulA_typeI, pullulanase, type I
