assembly_id	genome_id	genome_def	crispr_array_locus_merge	crispr_array_location_merge	crispr_locus_id	crispr_pred_method	array_in_prot	prot_within_array_20000	prot_in_genome	crispr_type_by_cas_prot	consensus_repeat	repeat_length	self-targeting_spacer_number	self-targeting_target_number	spacer_location	protospacer_location	repeat_type	spacer_locus_num	spacer_num	correct_crispr_type	genome_cas_prots	unknown_protein_around_crispr	L10	L10_domain	L9	L9_domain	L8	L8_domain	L7	L7_domain	L6	L6_domain	L5	L5_domain	L4	L4_domain	L3	L3_domain	L2	L2_domain	L1	L1_domain	R1	R1_domain	R2	R2_domain	R3	R3_domain	R4	R4_domain	R5	R5_domain	R6	R6_domain	R7	R7_domain	R8	R8_domain	R9	R9_domain	R10	R10_domain
GCF_000236605.1_ASM23660v1	NC_016593	Geobacillus thermoleovorans CCB_US3_UF5, complete genome	1	394989-398364	1,1,1	PILER-CR,CRISPRCasFinder,CRT	no		cas3,c2c10_CAS-V-U3,cas14k,csa3,RT,DEDDh,DinG,cas14j,Cas14u_CAS-V	Orphan	GTTTTTATCGTACCTATGAGGGATTGAAAC,GTTTTTATCGTACCTATGAGGGATTGAAAC,GTTTTTATCGTACCTATGAGGGATTGAAAC	30,30,30	0	0	NA	NA	NA:NA:NA	50,50,50	50	Orphan	cas3,c2c10_CAS-V-U3,cas14k,csa3,RT,DEDDh,DinG,cas14j,Cas14u_CAS-V	NA,NA|75aa|down_0|NC_016593.1_398439_398664_-,NA|175aa|down_3|NC_016593.1_402250_402775_-,NA|62aa|down_8|NC_016593.1_409299_409485_+	NA|210aa|up_9|NC_016593.1_379640_380270_-	COG1174, OpuBB, ABC-type proline/glycine betaine transport systems, permease component [Amino acid transport and metabolism]	NA|376aa|up_8|NC_016593.1_380524_381652_-	cd03295, ABC_OpuCA_Osmoprotection, ATP-binding cassette domain of the osmoprotectant transporter	NA|301aa|up_7|NC_016593.1_381668_382571_-	cd13528, PBP2_osmoprotectants, Substrate-binding domain of osmoregulatory ABC-type transporters; the type 2 periplasmic-binding protein fold	NA|553aa|up_6|NC_016593.1_383151_384810_+	pfam14104, DUF4277, Domain of unknown function (DUF4277)	NA|216aa|up_5|NC_016593.1_384938_385586_-	TIGR02168, Chromosome_partition_protein_Smc, chromosome segregation protein SMC, common bacterial type	NA|497aa|up_4|NC_016593.1_385730_387221_+	TIGR02677, conserved_hypothetical_protein, TIGR02677 family protein	NA|399aa|up_3|NC_016593.1_387223_388420_+	TIGR02678, hypothetical_protein, TIGR02678 family protein	NA|1374aa|up_2|NC_016593.1_388379_392501_+	TIGR02680, conserved_hypothetical_protein, TIGR02680 family protein	NA|407aa|up_1|NC_016593.1_392497_393718_+	TIGR02679, conserved_hypothetical_protein, TIGR02679 family protein	NA|309aa|up_0|NC_016593.1_393860_394787_+	TIGR01784, Uncharacterized_protein_pSLT051, conserved hypothetical protein (putative transposase or invertase)	NA|75aa|down_0|NC_016593.1_398439_398664_-	NA	NA|411aa|down_1|NC_016593.1_400205_401438_-	cd17391, MFS_MdtG_MDR_like, Multidrug resistance protein MdtG and similar multidrug resistance (MDR) transporters of the Major Facilitator Superfamily	NA|218aa|down_2|NC_016593.1_401637_402291_-	COG1842, PspA, Phage shock protein A (IM30), suppresses sigma54-dependent transcription [Transcription / Signal transduction mechanisms]	NA|175aa|down_3|NC_016593.1_402250_402775_-	NA	NA|770aa|down_4|NC_016593.1_403683_405993_-	COG1511, COG1511, Predicted membrane protein [Function unknown]	NA|458aa|down_5|NC_016593.1_406645_408019_-	NF033092, HK_WalK, cell wall metabolism sensor histidine kinase WalK	NA|221aa|down_6|NC_016593.1_408008_408671_-	COG0745, OmpR, Response regulators consisting of a CheY-like receiver domain and a winged-helix DNA-binding domain [Signal transduction mechanisms / Transcription]	NA|142aa|down_7|NC_016593.1_408850_409276_+	pfam03929, PepSY_TM, PepSY-associated TM region	NA|62aa|down_8|NC_016593.1_409299_409485_+	NA	NA|402aa|down_9|NC_016593.1_409465_410671_+	TIGR02037, Probable_periplasmic_serine_protease_do/HhoA-like, periplasmic serine protease, Do/DeqQ family
GCF_000236605.1_ASM23660v1	NC_016593	Geobacillus thermoleovorans CCB_US3_UF5, complete genome	2	1129828-1129921	2	CRISPRCasFinder	no		cas3,c2c10_CAS-V-U3,cas14k,csa3,RT,DEDDh,DinG,cas14j,Cas14u_CAS-V	Orphan	GTAGACACAAAATATAGTGCGGAAGAATCG	30	0	0	NA	NA	NA	1	1	Orphan	cas3,c2c10_CAS-V-U3,cas14k,csa3,RT,DEDDh,DinG,cas14j,Cas14u_CAS-V	NA|73aa|up_9|NC_016593.1_1120358_1120577_+,NA	NA|73aa|up_9|NC_016593.1_1120358_1120577_+	NA	NA|262aa|up_8|NC_016593.1_1120680_1121466_+	cd05344, BKR_like_SDR_like, putative beta-ketoacyl acyl carrier protein [ACP] reductase (BKR)-like, SDR	NA|289aa|up_7|NC_016593.1_1121492_1122359_+	COG2084, MmsB, 3-hydroxyisobutyrate dehydrogenase and related beta-hydroxyacid dehydrogenases [Lipid metabolism]	NA|404aa|up_6|NC_016593.1_1122533_1123745_+	cd01155, ACAD_FadE2, Acyl-CoA dehydrogenases similar to fadE2	NA|261aa|up_5|NC_016593.1_1123763_1124546_+	PRK08213, PRK08213, gluconate 5-dehydrogenase; Provisional	NA|540aa|up_4|NC_016593.1_1124567_1126187_+	cd05936, FC-FACS_FadD_like, Prokaryotic long-chain fatty acid CoA synthetases similar to Escherichia coli FadD	NA|195aa|up_3|NC_016593.1_1126280_1126865_+	pfam17932, TetR_C_24, Tetracyclin repressor-like, C-terminal domain	NA|353aa|up_2|NC_016593.1_1126962_1128021_+	cd05154, ACAD10_11_N-like, N-terminal domain of Acyl-CoA dehydrogenase (ACAD) 10 and 11, and similar proteins	NA|261aa|up_1|NC_016593.1_1128030_1128813_+	pfam04029, 2-ph_phosp, 2-phosphosulpholactate phosphatase	NA|325aa|up_0|NC_016593.1_1128809_1129784_+	cd08241, QOR1, Quinone oxidoreductase (QOR)	NA|386aa|down_0|NC_016593.1_1130084_1131242_-	PRK07683, PRK07683, aminotransferase A; Validated	NA|76aa|down_1|NC_016593.1_1131397_1131625_-	PRK03636, PRK03636, hypothetical protein; Provisional	NA|333aa|down_2|NC_016593.1_1131814_1132813_+	cd12831, TmCorA-like_u2, Uncharacterized bacterial subfamily of the Thermotoga maritima CorA-like family	NA|217aa|down_3|NC_016593.1_1132889_1133540_+	cd06418, GH25_BacA-like, BacA is a bacterial lysin from Enterococcus faecalis that degrades bacterial cell walls by catalyzing the hydrolysis of 1,4-beta-linkages between N-acetylmuramic acid and N-acetyl-D-glucosamine residues	NA|543aa|down_4|NC_016593.1_1133713_1135342_+	cd01949, GGDEF, Diguanylate-cyclase (DGC) or GGDEF domain	NA|342aa|down_5|NC_016593.1_1135338_1136364_+	PRK13800, PRK13800, fumarate reductase/succinate dehydrogenase flavoprotein subunit	NA|469aa|down_6|NC_016593.1_1136363_1137770_+	cd06423, CESA_like, CESA_like is  the cellulose synthase superfamily	NA|145aa|down_7|NC_016593.1_1137921_1138356_-	pfam14177, YkyB, YkyB-like protein	NA|290aa|down_8|NC_016593.1_1138424_1139294_-	cd07385, MPP_YkuE_C, Bacillus subtilis YkuE and related proteins, C-terminal metallophosphatase domain	NA|256aa|down_9|NC_016593.1_1139437_1140205_+	PRK07677, PRK07677, short chain dehydrogenase; Provisional
GCF_000236605.1_ASM23660v1	NC_016593	Geobacillus thermoleovorans CCB_US3_UF5, complete genome	3	2064940-2065907	3,2,2	CRISPRCasFinder,CRT,PILER-CR	no		cas3,c2c10_CAS-V-U3,cas14k,csa3,RT,DEDDh,DinG,cas14j,Cas14u_CAS-V	Orphan	GTTTCAATCCCTCATAGGTACGATAAAAAC,GTTTCAATCCCTCATAGGTACGATAAAAAC,GTTTTTATCGTACCTATGAGGGATTGAAAC	30,30,30	0	0	NA	NA	NA:NA:NA	14,14,13	14	Orphan	cas3,c2c10_CAS-V-U3,cas14k,csa3,RT,DEDDh,DinG,cas14j,Cas14u_CAS-V	NA,NA	NA|450aa|up_9|NC_016593.1_2052412_2053762_-	pfam02447, GntP_permease, GntP family permease	NA|515aa|up_8|NC_016593.1_2053918_2055463_-	TIGR01314, gntK_FGGY, gluconate kinase, FGGY type	NA|349aa|up_7|NC_016593.1_2055449_2056496_-	COG1609, PurR, Transcriptional regulators [Transcription]	NA|312aa|up_6|NC_016593.1_2057016_2057952_-	COG0679, COG0679, Predicted permeases [General function prediction only]	NA|229aa|up_5|NC_016593.1_2058013_2058700_-	COG4565, CitB, Response regulator of citrate/malate metabolism [Transcription / Signal transduction mechanisms]	NA|533aa|up_4|NC_016593.1_2058772_2060371_-	COG3290, CitA, Signal transduction histidine kinase regulating citrate/malate metabolism [Signal transduction mechanisms]	NA|509aa|up_3|NC_016593.1_2060443_2061970_-	COG3333, COG3333, Uncharacterized protein conserved in bacteria [Function unknown]	NA|155aa|up_2|NC_016593.1_2061983_2062448_-	pfam07331, TctB, Tripartite tricarboxylate transporter TctB family	NA|347aa|up_1|NC_016593.1_2062502_2063543_-	cd07012, PBP2_Bug_TTT, Bug (Bordetella uptake gene) protein family of periplasmic solute-binding receptors; contains the type 2 periplasmic binding fold	NA|310aa|up_0|NC_016593.1_2063909_2064839_-	TIGR01784, Uncharacterized_protein_pSLT051, conserved hypothetical protein (putative transposase or invertase)	NA|387aa|down_0|NC_016593.1_2066087_2067248_-	PRK02318, PRK02318, mannitol-1-phosphate 5-dehydrogenase; Provisional	NA|148aa|down_1|NC_016593.1_2067247_2067691_-	COG4668, MtlA, Mannitol/fructose-specific phosphotransferase system, IIA domain [Carbohydrate transport and metabolism]	NA|697aa|down_2|NC_016593.1_2067696_2069787_-	COG3711, BglG, Transcriptional antiterminator [Transcription]	NA|483aa|down_3|NC_016593.1_2070098_2071547_-	COG2213, MtlA, Phosphotransferase system, mannitol-specific IIBC component [Carbohydrate transport and metabolism]	NA|168aa|down_4|NC_016593.1_2071689_2072193_-	COG2405, COG2405, Predicted nucleic acid-binding protein, contains PIN domain [General function prediction only]	NA|97aa|down_5|NC_016593.1_2072185_2072476_-	pfam03683, UPF0175, Uncharacterized protein family (UPF0175)	NA|455aa|down_6|NC_016593.1_2072702_2074067_-	pfam06782, UPF0236, Uncharacterized protein family (UPF0236)	NA|487aa|down_7|NC_016593.1_2074682_2076143_-	cd07085, ALDH_F6_MMSDH, Methylmalonate semialdehyde dehydrogenase and ALDH family members 6A1 and 6B2	NA|396aa|down_8|NC_016593.1_2076344_2077532_-	cd08194, Fe-ADH-like, Iron-containing alcohol dehydrogenases-like	NA|570aa|down_9|NC_016593.1_2077726_2079436_-	COG3829, RocR, Transcriptional regulator containing PAS, AAA-type ATPase, and DNA-binding domains [Transcription / Signal transduction mechanisms]
GCF_000236605.1_ASM23660v1	NC_016593	Geobacillus thermoleovorans CCB_US3_UF5, complete genome	4	2999352-2999434	4	CRISPRCasFinder	no	Cas14u_CAS-V	cas3,c2c10_CAS-V-U3,cas14k,csa3,RT,DEDDh,DinG,cas14j,Cas14u_CAS-V	Unclear	ATGGCCACCAAAGACGAACTCGC	23	0	0	NA	NA	NA	1	1	Unclear	cas3,c2c10_CAS-V-U3,cas14k,csa3,RT,DEDDh,DinG,cas14j,Cas14u_CAS-V	NA|96aa|up_5|NC_016593.1_2992944_2993232_-,NA|71aa|down_3|NC_016593.1_3011886_3012099_-,NA|84aa|down_9|NC_016593.1_3017654_3017906_-	NA|390aa|up_9|NC_016593.1_2987627_2988797_-	PRK05293, glgC, glucose-1-phosphate adenylyltransferase; Provisional	NA|681aa|up_8|NC_016593.1_2988678_2990721_-	PRK05402, PRK05402, 1,4-alpha-glucan branching protein GlgB	NA|317aa|up_7|NC_016593.1_2990897_2991848_+	COG1230, CzcD, Co/Zn/Cd efflux system component [Inorganic ion transport and metabolism]	NA|209aa|up_6|NC_016593.1_2991980_2992607_+	COG1280, RhtB, Putative threonine efflux protein [Amino acid transport and metabolism]	NA|96aa|up_5|NC_016593.1_2992944_2993232_-	NA	NA|185aa|up_4|NC_016593.1_2993241_2993796_-	COG1670, RimL, Acetyltransferases, including N-acetylases of ribosomal proteins [Translation, ribosomal structure and biogenesis]	NA|501aa|up_3|NC_016593.1_2994025_2995528_-	pfam17936, Big_6, Bacterial Ig domain	NA|327aa|up_2|NC_016593.1_2995580_2996561_-	pfam01032, FecCD, FecCD transport family	NA|333aa|up_1|NC_016593.1_2996553_2997552_-	pfam01032, FecCD, FecCD transport family	NA|314aa|up_0|NC_016593.1_2997582_2998524_-	cd01146, FhuD, Fe3+-siderophore binding domain FhuD	NA|154aa|down_0|NC_016593.1_3000070_3000532_-	pfam11518, DUF3221, Protein of unknown function (DUF3221)	NA|427aa|down_1|NC_016593.1_3000977_3002258_-	pfam01548, DEDD_Tnp_IS110, Transposase	NA|455aa|down_2|NC_016593.1_3002639_3004004_+	pfam06782, UPF0236, Uncharacterized protein family (UPF0236)	NA|71aa|down_3|NC_016593.1_3011886_3012099_-	NA	NA|273aa|down_4|NC_016593.1_3012143_3012962_+	PRK03187, tgl, transglutaminase; Provisional	NA|303aa|down_5|NC_016593.1_3012932_3013841_-	COG0697, RhaT, Permeases of the drug/metabolite transporter (DMT) superfamily [Carbohydrate transport and metabolism / Amino acid transport and metabolism / General function prediction only]	NA|226aa|down_6|NC_016593.1_3013967_3014645_+	COG1285, SapB, Uncharacterized membrane protein [Function unknown]	NA|434aa|down_7|NC_016593.1_3015021_3016323_-	COG3935, DnaD, Putative primosome component and related proteins [DNA replication, recombination, and repair]	NA|294aa|down_8|NC_016593.1_3016560_3017442_+	smart00318, SNc, Staphylococcal nuclease homologues	NA|84aa|down_9|NC_016593.1_3017654_3017906_-	NA
GCF_000236605.1_ASM23660v1	NC_016593	Geobacillus thermoleovorans CCB_US3_UF5, complete genome	5	3501395-3501491	5	CRISPRCasFinder	no		cas3,c2c10_CAS-V-U3,cas14k,csa3,RT,DEDDh,DinG,cas14j,Cas14u_CAS-V	Orphan	GTTTTCCAAAAACAGACAACGAATCGTTGT	30	1	1	3501425-3501461	NC_016593.1_1853621-1853585	NA	1	1	Orphan	cas3,c2c10_CAS-V-U3,cas14k,csa3,RT,DEDDh,DinG,cas14j,Cas14u_CAS-V	NA,NA|64aa|down_2|NC_016593.1_3504085_3504277_-	NA|321aa|up_9|NC_016593.1_3487974_3488937_-	PRK09479, glpX, fructose 1,6-bisphosphatase II; Reviewed	NA|429aa|up_8|NC_016593.1_3488971_3490258_-	PRK12830, PRK12830, UDP-N-acetylglucosamine 1-carboxyvinyltransferase; Reviewed	NA|214aa|up_7|NC_016593.1_3490390_3491032_-	PRK01362, PRK01362, fructose-6-phosphate aldolase	NA|288aa|up_6|NC_016593.1_3491107_3491971_-	PRK07709, PRK07709, fructose-bisphosphate aldolase; Provisional	NA|121aa|up_5|NC_016593.1_3492179_3492542_-	cd17553, REC_Spo0F-like, phosphoacceptor receiver (REC) domain of Spo0F and similar domains	NA|174aa|up_4|NC_016593.1_3492665_3493187_+	pfam10740, DUF2529, Domain of unknown function (DUF2529)	NA|532aa|up_3|NC_016593.1_3493367_3494963_-	PRK05380, pyrG, CTP synthetase; Validated	NA|1087aa|up_2|NC_016593.1_3496070_3499331_-	cd03678, MM_CoA_mutase_1, Coenzyme B12-dependent-methylmalonyl coenzyme A (CoA) mutase (MCM) family, unknown subfamily 1; composed of uncharacterized bacterial proteins containing a C-terminal MCM domain	NA|211aa|up_1|NC_016593.1_3499348_3499981_-	COG1309, AcrR, Transcriptional regulator [Transcription]	NA|381aa|up_0|NC_016593.1_3500188_3501331_-	cd01158, SCAD_SBCAD, Short chain acyl-CoA dehydrogenases and eukaryotic short/branched chain acyl-CoA dehydrogenases	NA|382aa|down_0|NC_016593.1_3501842_3502988_-	cd01158, SCAD_SBCAD, Short chain acyl-CoA dehydrogenases and eukaryotic short/branched chain acyl-CoA dehydrogenases	NA|284aa|down_1|NC_016593.1_3503221_3504073_-	PRK05808, PRK05808, 3-hydroxybutyryl-CoA dehydrogenase; Validated	NA|64aa|down_2|NC_016593.1_3504085_3504277_-	NA	NA|393aa|down_3|NC_016593.1_3504395_3505574_-	PRK08235, PRK08235, acetyl-CoA C-acetyltransferase	NA|699aa|down_4|NC_016593.1_3505730_3507827_-	COG0247, GlpC, Fe-S oxidoreductase [Energy production and conversion]	NA|397aa|down_5|NC_016593.1_3508163_3509354_+	PRK01642, cls, cardiolipin synthetase; Reviewed	NA|56aa|down_6|NC_016593.1_3509483_3509651_-	COG4317, COG4317, Uncharacterized protein conserved in bacteria [Function unknown]	NA|558aa|down_7|NC_016593.1_3509725_3511399_-	PRK01611, argS, arginyl-tRNA synthetase; Reviewed	NA|144aa|down_8|NC_016593.1_3511402_3511834_-	pfam09148, DUF1934, Domain of unknown function (DUF1934)	NA|126aa|down_9|NC_016593.1_3512088_3512466_-	PRK14485, PRK14485, putative bifunctional cbb3-type cytochrome c oxidase subunit I/II; Provisional
