assembly_id	genome_id	genome_def	crispr_array_locus_merge	crispr_array_location_merge	crispr_locus_id	crispr_pred_method	array_in_prot	prot_within_array_20000	prot_in_genome	crispr_type_by_cas_prot	consensus_repeat	repeat_length	self-targeting_spacer_number	self-targeting_target_number	spacer_location	protospacer_location	repeat_type	spacer_locus_num	spacer_num	correct_crispr_type	genome_cas_prots	unknown_protein_around_crispr	L10	L10_domain	L9	L9_domain	L8	L8_domain	L7	L7_domain	L6	L6_domain	L5	L5_domain	L4	L4_domain	L3	L3_domain	L2	L2_domain	L1	L1_domain	R1	R1_domain	R2	R2_domain	R3	R3_domain	R4	R4_domain	R5	R5_domain	R6	R6_domain	R7	R7_domain	R8	R8_domain	R9	R9_domain	R10	R10_domain
GCF_003688615.2_ASM368861v2	NZ_CP049703	Parageobacillus toebii NBRC 107807 strain DSM 14590 chromosome, complete genome	1	702019-702580	1,1,1	CRISPRCasFinder,CRT,PILER-CR	no		DEDDh,Cas14b_CAS-V-F,csa3,cas3,Cas14u_CAS-V,cas14k,c2c10_CAS-V-U3,DinG	Orphan	GTTTCAATCCCTCATAGGTAAGATAAAAAC,GTTNTCAATCCCTCATAGGTAAGATAAAAAC,GTTTCAATCCCTCATAGGTAAGATAAAAAC	30,31,30	0	0	NA	NA	NA:NA:NA	8,7,6	8	Orphan	DEDDh,Cas14b_CAS-V-F,csa3,cas3,Cas14u_CAS-V,cas14k,c2c10_CAS-V-U3,DinG	NA|190aa|up_6|NZ_CP049703.1_695317_695887_+,NA|48aa|up_1|NZ_CP049703.1_700651_700795_-,NA	NA|322aa|up_9|NZ_CP049703.1_692123_693089_-	TIGR04383, hypothetical_protein_HMPREF1015_01194, processed acidic surface protein	NA|459aa|up_8|NZ_CP049703.1_693221_694598_-	PRK09034, PRK09034, aspartate kinase; Reviewed	NA|164aa|up_7|NZ_CP049703.1_694678_695170_-	pfam06935, DUF1284, Protein of unknown function (DUF1284)	NA|190aa|up_6|NZ_CP049703.1_695317_695887_+	NA	NA|507aa|up_5|NZ_CP049703.1_696049_697570_+	cd07116, ALDH_ACDHII-AcoD, Ralstonia eutrophus NAD+-dependent acetaldehyde dehydrogenase II-like	NA|113aa|up_4|NZ_CP049703.1_697641_697980_+	pfam05610, DUF779, Protein of unknown function (DUF779)	NA|542aa|up_3|NZ_CP049703.1_698093_699719_-	TIGR00711, Uncharacterized_MFS-type_transporter_YhcA, drug resistance transporter, EmrB/QacA subfamily	NA|314aa|up_2|NZ_CP049703.1_699697_700639_-	pfam00440, TetR_N, Bacterial regulatory proteins, tetR family	NA|48aa|up_1|NZ_CP049703.1_700651_700795_-	NA	NA|292aa|up_0|NZ_CP049703.1_700998_701874_-	pfam00480, ROK, ROK family	NA|274aa|down_0|NZ_CP049703.1_702864_703686_-	COG0395, UgpE, ABC-type sugar transport system, permease component [Carbohydrate transport and metabolism]	NA|324aa|down_1|NZ_CP049703.1_703685_704657_-	COG1175, UgpA, ABC-type sugar transport systems, permease components [Carbohydrate transport and metabolism]	NA|435aa|down_2|NZ_CP049703.1_704720_706025_-	COG1653, UgpB, ABC-type sugar transport system, periplasmic component [Carbohydrate transport and metabolism]	NA|516aa|down_3|NZ_CP049703.1_706169_707717_-	COG4753, COG4753, Response regulator containing CheY-like receiver domain and AraC-type DNA-binding domain [Signal transduction mechanisms]	NA|486aa|down_4|NZ_CP049703.1_707713_709171_-	COG2972, COG2972, Predicted signal transduction protein with a C-terminal ATPase domain [Signal transduction mechanisms]	NA|338aa|down_5|NZ_CP049703.1_709163_710177_-	cd06314, PBP1_tmGBP, periplasmic sugar-binding domain of Thermotoga maritima glucose-binding protein (tmGBP) and its close homologs	NA|455aa|down_6|NZ_CP049703.1_710326_711691_-	PLN02805, PLN02805, D-lactate dehydrogenase [cytochrome]	NA|326aa|down_7|NZ_CP049703.1_711712_712690_-	COG0022, AcoB, Pyruvate/2-oxoglutarate dehydrogenase complex, dehydrogenase (E1) component, eukaryotic type, beta subunit [Energy production and conversion]	NA|361aa|down_8|NZ_CP049703.1_712691_713774_-	TIGR03181, PDH_E1_alph_x, pyruvate dehydrogenase E1 component, alpha subunit	NA|399aa|down_9|NZ_CP049703.1_713792_714989_-	PRK11856, PRK11856, branched-chain alpha-keto acid dehydrogenase subunit E2; Reviewed
GCF_003688615.2_ASM368861v2	NZ_CP049703	Parageobacillus toebii NBRC 107807 strain DSM 14590 chromosome, complete genome	2	1224427-1224594	2	CRISPRCasFinder	no	Cas14u_CAS-V	DEDDh,Cas14b_CAS-V-F,csa3,cas3,Cas14u_CAS-V,cas14k,c2c10_CAS-V-U3,DinG	Unclear	GGTTTTTATCTTACCTATGAGGAATTGAAAC	31	0	0	NA	NA	NA	2	2	Unclear	DEDDh,Cas14b_CAS-V-F,csa3,cas3,Cas14u_CAS-V,cas14k,c2c10_CAS-V-U3,DinG	NA,NA|198aa|down_9|NZ_CP049703.1_1235622_1236216_+	NA|159aa|up_9|NZ_CP049703.1_1204323_1204800_-	COG1196, Smc, Chromosome segregation ATPases [Cell division and chromosome partitioning]	NA|97aa|up_8|NZ_CP049703.1_1204992_1205283_+	PRK00034, gatC, Asp-tRNA(Asn)/Glu-tRNA(Gln) amidotransferase subunit GatC	NA|487aa|up_7|NZ_CP049703.1_1205296_1206757_+	PRK00012, gatA, Asp-tRNA(Asn)/Glu-tRNA(Gln) amidotransferase subunit GatA	NA|477aa|up_6|NZ_CP049703.1_1206770_1208201_+	PRK05477, gatB, Asp-tRNA(Asn)/Glu-tRNA(Gln) amidotransferase subunit GatB	NA|338aa|up_5|NZ_CP049703.1_1208558_1209572_+	cd13520, PBP2_TAXI_TRAP, Substrate binding domain of TAXI proteins of the tripartite ATP-independent periplasmic transporters; the type 2 periplasmic binding protein fold	NA|663aa|up_4|NZ_CP049703.1_1209733_1211722_+	TIGR02123, conserved_inner_membrane_protein, TRAP transporter, 4TM/12TM fusion protein	NA|461aa|up_3|NZ_CP049703.1_1211852_1213235_+	COG0001, HemL, Glutamate-1-semialdehyde aminotransferase [Coenzyme metabolism]	NA|283aa|up_2|NZ_CP049703.1_1213278_1214127_+	COG1737, RpiR, Transcriptional regulators [Transcription]	Cas14u_CAS-V|361aa|up_1|NZ_CP049703.1_1214620_1215703_+	COG0675, COG0675, Transposase and inactivated derivatives [DNA replication, recombination, and repair]	NA|389aa|up_0|NZ_CP049703.1_1217380_1218547_+	pfam09661, DUF2398, Protein of unknown function (DUF2398)	NA|360aa|down_0|NZ_CP049703.1_1224921_1226001_+	COG3842, PotA, ABC-type spermidine/putrescine transport systems, ATPase components [Amino acid transport and metabolism]	NA|264aa|down_1|NZ_CP049703.1_1225990_1226782_+	COG1176, PotB, ABC-type spermidine/putrescine transport system, permease component I [Amino acid transport and metabolism]	NA|269aa|down_2|NZ_CP049703.1_1226792_1227599_+	COG1177, PotC, ABC-type spermidine/putrescine transport system, permease component II [Amino acid transport and metabolism]	NA|358aa|down_3|NZ_CP049703.1_1227595_1228669_+	cd13663, PBP2_PotD_PotF_like_2, The periplasmic substrate-binding component of an uncharacterized active transport system closely related to spermidine and putrescine transporters; contains the type 2 periplasmic binding fold	NA|369aa|down_4|NZ_CP049703.1_1228689_1229796_+	cd06160, S2P-M50_like_2, Uncharacterized homologs of Site-2 protease (S2P), zinc metalloproteases (MEROPS family M50) which cleave transmembrane domains of substrate proteins, regulating intramembrane proteolysis (RIP) of diverse signal transduction mechanisms	NA|186aa|down_5|NZ_CP049703.1_1230598_1231156_+	pfam01957, NfeD, NfeD-like C-terminal, partner-binding	NA|508aa|down_6|NZ_CP049703.1_1231155_1232679_+	COG2268, COG2268, Uncharacterized protein conserved in bacteria [Function unknown]	NA|309aa|down_7|NZ_CP049703.1_1232800_1233727_+	PRK13337, PRK13337, putative lipid kinase; Reviewed	NA|461aa|down_8|NZ_CP049703.1_1233922_1235305_+	TIGR00479, 23S_rRNA_uracil1939-C5-methyltransferase_RlmD, 23S rRNA (uracil-5-)-methyltransferase RumA	NA|198aa|down_9|NZ_CP049703.1_1235622_1236216_+	NA
GCF_003688615.2_ASM368861v2	NZ_CP049703	Parageobacillus toebii NBRC 107807 strain DSM 14590 chromosome, complete genome	3	1701696-1701810	3	CRISPRCasFinder	no		DEDDh,Cas14b_CAS-V-F,csa3,cas3,Cas14u_CAS-V,cas14k,c2c10_CAS-V-U3,DinG	Orphan	CATTCATTAAACAAAAAAGATGAGAAGAATC	31	0	0	NA	NA	NA	1	1	Orphan	DEDDh,Cas14b_CAS-V-F,csa3,cas3,Cas14u_CAS-V,cas14k,c2c10_CAS-V-U3,DinG	NA,NA|47aa|down_2|NZ_CP049703.1_1703560_1703701_+,NA|54aa|down_3|NZ_CP049703.1_1703700_1703862_+,NA|49aa|down_4|NZ_CP049703.1_1703858_1704005_+	NA|120aa|up_9|NZ_CP049703.1_1691346_1691706_+	PRK14205, PRK14205, fluoride efflux transporter CrcB	NA|251aa|up_8|NZ_CP049703.1_1691774_1692527_+	pfam06738, ThrE, Putative threonine/serine exporter	NA|144aa|up_7|NZ_CP049703.1_1692545_1692977_+	pfam12821, ThrE_2, Threonine/Serine exporter, ThrE	NA|390aa|up_6|NZ_CP049703.1_1693163_1694333_+	cd08018, M20_Acy1_amhX-like, M20 Peptidase aminoacylase 1 amhX-like subfamily	NA|275aa|up_5|NZ_CP049703.1_1694307_1695132_+	COG3375, COG3375, Uncharacterized conserved protein [Function unknown]	NA|372aa|up_4|NZ_CP049703.1_1695148_1696264_+	cd03317, NAAAR, N-acylamino acid racemase (NAAAR), an octameric enzyme that catalyzes the racemization of N-acylamino acids	NA|514aa|up_3|NZ_CP049703.1_1696446_1697988_-	cd11480, SLC5sbd_u4, Uncharacterized bacterial solute carrier 5 subfamily; putative solute-binding domain	NA|114aa|up_2|NZ_CP049703.1_1697987_1698329_-	pfam04341, DUF485, Protein of unknown function, DUF485	NA|477aa|up_1|NZ_CP049703.1_1698628_1700059_+	TIGR00909, putative_amino_acid_transporter, amino acid transporter	NA|246aa|up_0|NZ_CP049703.1_1700229_1700967_+	PRK08311, PRK08311, RNA polymerase sigma factor SigI	NA|70aa|down_0|NZ_CP049703.1_1702257_1702467_-	pfam00269, SASP, Small, acid-soluble spore proteins, alpha/beta type	NA|115aa|down_1|NZ_CP049703.1_1703111_1703456_-	pfam13045, DUF3905, Protein of unknown function (DUF3905)	NA|47aa|down_2|NZ_CP049703.1_1703560_1703701_+	NA	NA|54aa|down_3|NZ_CP049703.1_1703700_1703862_+	NA	NA|49aa|down_4|NZ_CP049703.1_1703858_1704005_+	NA	NA|591aa|down_5|NZ_CP049703.1_1704014_1705787_-	pfam13311, DUF4080, Protein of unknown function (DUF4080)	NA|745aa|down_6|NZ_CP049703.1_1705984_1708219_+	PRK11360, PRK11360, two-component system sensor histidine kinase AtoS	NA|166aa|down_7|NZ_CP049703.1_1708193_1708691_+	PRK00901, PRK00901, methylated-DNA--protein-cysteine methyltransferase; Provisional	NA|337aa|down_8|NZ_CP049703.1_1708868_1709879_-	COG1172, AraH, Ribose/xylose/arabinose/galactoside ABC-type transport systems, permease components [Carbohydrate transport and metabolism]	NA|516aa|down_9|NZ_CP049703.1_1709823_1711371_-	COG1129, MglA, ABC-type sugar transport system, ATPase component [Carbohydrate transport and metabolism]
GCF_003688615.2_ASM368861v2	NZ_CP049703	Parageobacillus toebii NBRC 107807 strain DSM 14590 chromosome, complete genome	4	2937682-2937778	4	CRISPRCasFinder	no		DEDDh,Cas14b_CAS-V-F,csa3,cas3,Cas14u_CAS-V,cas14k,c2c10_CAS-V-U3,DinG	Orphan	AAGCTCTTCACAGGAAACAATCGTT	25	0	0	NA	NA	NA	1	1	Orphan	DEDDh,Cas14b_CAS-V-F,csa3,cas3,Cas14u_CAS-V,cas14k,c2c10_CAS-V-U3,DinG	NA|205aa|up_7|NZ_CP049703.1_2928792_2929407_-,NA|140aa|up_6|NZ_CP049703.1_2929899_2930319_+,NA|156aa|up_5|NZ_CP049703.1_2930465_2930933_+,NA|111aa|up_4|NZ_CP049703.1_2931055_2931388_+,NA|133aa|up_3|NZ_CP049703.1_2931659_2932058_-,NA|222aa|down_0|NZ_CP049703.1_2938015_2938681_+	NA|349aa|up_9|NZ_CP049703.1_2923989_2925036_-	TIGR02393, RNA_polymerase_sigma_factor_RpoD, RNA polymerase sigma factor RpoD, C-terminal domain	NA|1086aa|up_8|NZ_CP049703.1_2925139_2928397_+	COG0514, RecQ, Superfamily II DNA helicase [DNA replication, recombination, and repair]	NA|205aa|up_7|NZ_CP049703.1_2928792_2929407_-	NA	NA|140aa|up_6|NZ_CP049703.1_2929899_2930319_+	NA	NA|156aa|up_5|NZ_CP049703.1_2930465_2930933_+	NA	NA|111aa|up_4|NZ_CP049703.1_2931055_2931388_+	NA	NA|133aa|up_3|NZ_CP049703.1_2931659_2932058_-	NA	NA|308aa|up_2|NZ_CP049703.1_2932077_2933001_-	TIGR03814, Glutaminase_1, glutaminase A	NA|391aa|up_1|NZ_CP049703.1_2933290_2934463_-	PRK05958, PRK05958, 8-amino-7-oxononanoate synthase; Reviewed	NA|363aa|up_0|NZ_CP049703.1_2935284_2936373_+	pfam08757, CotH, CotH kinase protein	NA|222aa|down_0|NZ_CP049703.1_2938015_2938681_+	NA	NA|264aa|down_1|NZ_CP049703.1_2938974_2939766_-	pfam10086, DUF2324, Putative membrane peptidase family (DUF2324)	NA|130aa|down_2|NZ_CP049703.1_2939785_2940175_-	cd07244, FosA, fosfomycin resistant protein subfamily FosA	NA|171aa|down_3|NZ_CP049703.1_2940495_2941008_-	COG2259, COG2259, Predicted membrane protein [Function unknown]	NA|114aa|down_4|NZ_CP049703.1_2941265_2941607_+	COG1733, COG1733, Predicted transcriptional regulators [Transcription]	NA|293aa|down_5|NZ_CP049703.1_2941658_2942537_-	PRK05755, PRK05755, DNA polymerase I; Provisional	NA|388aa|down_6|NZ_CP049703.1_2943550_2944714_-	COG0665, DadA, Glycine/D-amino acid oxidases (deaminating) [Amino acid transport and metabolism]	NA|414aa|down_7|NZ_CP049703.1_2945376_2946618_+	pfam02073, Peptidase_M29, Thermophilic metalloprotease (M29)	NA|193aa|down_8|NZ_CP049703.1_2946772_2947351_-	COG1670, RimL, Acetyltransferases, including N-acetylases of ribosomal proteins [Translation, ribosomal structure and biogenesis]	NA|203aa|down_9|NZ_CP049703.1_2947525_2948134_-	pfam01841, Transglut_core, Transglutaminase-like superfamily
