assembly_id	genome_id	genome_def	crispr_array_locus_merge	crispr_array_location_merge	crispr_locus_id	crispr_pred_method	array_in_prot	prot_within_array_20000	prot_in_genome	crispr_type_by_cas_prot	consensus_repeat	repeat_length	self-targeting_spacer_number	self-targeting_target_number	spacer_location	protospacer_location	repeat_type	spacer_locus_num	spacer_num	correct_crispr_type	genome_cas_prots	unknown_protein_around_crispr	L10	L10_domain	L9	L9_domain	L8	L8_domain	L7	L7_domain	L6	L6_domain	L5	L5_domain	L4	L4_domain	L3	L3_domain	L2	L2_domain	L1	L1_domain	R1	R1_domain	R2	R2_domain	R3	R3_domain	R4	R4_domain	R5	R5_domain	R6	R6_domain	R7	R7_domain	R8	R8_domain	R9	R9_domain	R10	R10_domain
GCF_001598095.1_ASM159809v1	NZ_CP014847	Bacillus thuringiensis strain HD12, complete sequence	1	668646-668721	1	CRISPRCasFinder	no	csa3	cas3,RT,cas14k,csa3,WYL,c2c9_V-U4,DinG,DEDDh	Type I-A	ATCATCATCATGGAGGACACAATCA	25	0	0	NA	NA	NA	1	1	Orphan	cas3,RT,cas14k,csa3,WYL,c2c9_V-U4,DinG,DEDDh,Cas14u_CAS-V,cas14j	NA,NA	NA|335aa|up_9|NZ_CP014847.1_657367_658372_+	pfam01032, FecCD, FecCD transport family	NA|353aa|up_8|NZ_CP014847.1_658368_659427_+	pfam01032, FecCD, FecCD transport family	NA|274aa|up_7|NZ_CP014847.1_659439_660261_+	COG1120, FepC, ABC-type cobalamin/Fe3+-siderophores transport systems, ATPase components [Inorganic ion transport and metabolism / Coenzyme metabolism]	NA|244aa|up_6|NZ_CP014847.1_660292_661024_-	pfam13649, Methyltransf_25, Methyltransferase domain	NA|397aa|up_5|NZ_CP014847.1_661237_662428_+	PRK06939, PRK06939, 2-amino-3-ketobutyrate coenzyme A ligase; Provisional	NA|322aa|up_4|NZ_CP014847.1_662472_663438_+	cd05272, TDH_SDR_e, L-threonine dehydrogenase, extended (e) SDRs	NA|141aa|up_3|NZ_CP014847.1_663497_663920_+	cd02883, Nudix_Hydrolase, Nudix hydrolase is a superfamily of enzymes found in all three kingdoms of life, and it catalyzes the hydrolysis of NUcleoside DIphosphates linked to other moieties, X	NA|628aa|up_2|NZ_CP014847.1_663957_665841_-	COG4548, NorD, Nitric oxide reductase activation protein [Inorganic ion transport and metabolism]	NA|298aa|up_1|NZ_CP014847.1_665844_666738_-	COG0714, COG0714, MoxR-like ATPases [General function prediction only]	NA|510aa|up_0|NZ_CP014847.1_666865_668395_-	PRK12452, PRK12452, cardiolipin synthase	NA|466aa|down_0|NZ_CP014847.1_671175_672573_-	TIGR00905, Arginine/ornithine_antiporter, transporter, basic amino acid/polyamine antiporter (APA) family	NA|237aa|down_1|NZ_CP014847.1_673025_673736_+	TIGR02404, Trehalose_operon_transcriptional_repressor, trehalose operon repressor, B	NA|476aa|down_2|NZ_CP014847.1_673878_675306_+	TIGR01992, phosphotransferase_system_trehalose_permease, PTS system, trehalose-specific IIBC component	NA|554aa|down_3|NZ_CP014847.1_675319_676981_+	TIGR02403, Trehalose-6-phosphate_hydrolase, alpha,alpha-phosphotrehalase	NA|376aa|down_4|NZ_CP014847.1_677013_678141_-	TIGR02887, Spore_germination_protein_B3, germination protein, Ger(x)C family	NA|390aa|down_5|NZ_CP014847.1_678056_679226_-	pfam03845, Spore_permease, Spore germination protein	NA|501aa|down_6|NZ_CP014847.1_679206_680709_-	pfam03323, GerA, Bacillus/Clostridium GerA spore germination protein	NA|324aa|down_7|NZ_CP014847.1_680897_681869_+	COG2334, COG2334, Putative homoserine kinase type II (protein kinase fold) [General function prediction only]	NA|487aa|down_8|NZ_CP014847.1_682028_683489_+	pfam01235, Na_Ala_symp, Sodium:alanine symporter family	NA|244aa|down_9|NZ_CP014847.1_683620_684352_+	COG1126, GlnQ, ABC-type polar amino acid transport system, ATPase component [Amino acid transport and metabolism]
GCF_001598095.1_ASM159809v1	NZ_CP014847	Bacillus thuringiensis strain HD12, complete sequence	2	3891562-3892560	1	CRT	no		cas3,RT,cas14k,csa3,WYL,c2c9_V-U4,DinG,DEDDh	Orphan	TGGACCTCAAGGTGTTCAAGGTAACAC	27	17	27	3891589-3891606|3891634-3891651|3891679-3891696|3891724-3891741|3891823-3891840|3891868-3891885|3891868-3891885|3891913-3891930|3891958-3891975|3891958-3891975|3892003-3892020|3892102-3892119|3892147-3892164|3892147-3892164|3892147-3892164|3892147-3892164|3892192-3892209|3892192-3892209|3892237-3892254|3892237-3892254|3892237-3892254|3892237-3892254|3892282-3892299|3892327-3892344|3892372-3892389|3892417-3892434|3892417-3892434	NZ_CP014847.1_3892570-3892587|NZ_CP014847.1_3892561-3892578|NZ_CP014847.1_3892561-3892578|NZ_CP014847.1_3892561-3892578|NZ_CP014847.1_3892561-3892578|NZ_CP014847.1_3892561-3892578|NZ_CP014847.1_3892804-3892821|NZ_CP014847.1_3892561-3892578|NZ_CP014847.1_3892561-3892578|NZ_CP014847.1_3892804-3892821|NZ_CP014847.1_3892561-3892578|NZ_CP014847.1_3892561-3892578|NZ_CP014847.1_3892804-3892821|NZ_CP014847.1_3891517-3891534|NZ_CP014847.1_3892714-3892731|NZ_CP014847.1_3892759-3892776|NZ_CP014847.1_3892561-3892578|NZ_CP014847.1_3892804-3892821|NZ_CP014847.1_3892804-3892821|NZ_CP014847.1_3891517-3891534|NZ_CP014847.1_3892714-3892731|NZ_CP014847.1_3892759-3892776|NZ_CP014847.1_3892561-3892578|NZ_CP014847.1_3892570-3892587|NZ_CP014847.1_3892561-3892578|NZ_CP014847.1_3892849-3892866|NZ_CP014847.1_3892894-3892911	NA	21	21	Orphan	cas3,RT,cas14k,csa3,WYL,c2c9_V-U4,DinG,DEDDh,Cas14u_CAS-V,cas14j	NA|65aa|up_9|NZ_CP014847.1_3880678_3880873_-,NA|71aa|up_8|NZ_CP014847.1_3881839_3882052_-,NA|82aa|down_3|NZ_CP014847.1_3894837_3895083_+	NA|65aa|up_9|NZ_CP014847.1_3880678_3880873_-	NA	NA|71aa|up_8|NZ_CP014847.1_3881839_3882052_-	NA	NA|220aa|up_7|NZ_CP014847.1_3882255_3882915_+	COG1974, LexA, SOS-response transcriptional repressors (RecA-mediated autopeptidases) [Transcription / Signal transduction mechanisms]	NA|445aa|up_6|NZ_CP014847.1_3883253_3884588_-	TIGR00653, Glutamine_synthetase, glutamine synthetase, type I	NA|130aa|up_5|NZ_CP014847.1_3884636_3885026_-	cd01105, HTH_GlnR-like, Helix-Turn-Helix DNA binding domain of GlnR-like transcription regulators	NA|424aa|up_4|NZ_CP014847.1_3885200_3886472_-	pfam06838, Met_gamma_lyase, Methionine gamma-lyase	NA|425aa|up_3|NZ_CP014847.1_3886464_3887739_-	TIGR03156, GTP_HflX, GTP-binding protein HflX	NA|208aa|up_2|NZ_CP014847.1_3887831_3888455_+	COG2860, COG2860, Predicted membrane protein [Function unknown]	NA|319aa|up_1|NZ_CP014847.1_3888723_3889680_-	TIGR02881, Stage_V_sporulation_protein_K, stage V sporulation protein K	NA|322aa|up_0|NZ_CP014847.1_3889931_3890897_+	cd00397, DNA_BRE_C, DNA breaking-rejoining enzymes, C-terminal catalytic domain	NA|259aa|down_0|NZ_CP014847.1_3892600_3893377_+	pfam01391, Collagen, Collagen triple helix repeat (20 copies)	NA|75aa|down_1|NZ_CP014847.1_3893457_3893682_-	PRK00395, hfq, RNA-binding protein Hfq; Provisional	NA|318aa|down_2|NZ_CP014847.1_3893703_3894657_-	PRK00091, miaA, tRNA delta(2)-isopentenylpyrophosphate transferase; Reviewed	NA|82aa|down_3|NZ_CP014847.1_3894837_3895083_+	NA	NA|317aa|down_4|NZ_CP014847.1_3895145_3896096_-	COG3584, COG3584, Uncharacterized protein conserved in bacteria [Function unknown]	NA|619aa|down_5|NZ_CP014847.1_3896444_3898301_-	PRK10712, PRK10712, PTS system fructose-specific transporter subunits IIBC; Provisional	NA|304aa|down_6|NZ_CP014847.1_3898314_3899226_-	cd01164, FruK_PfkB_like, 1-phosphofructokinase (FruK), minor 6-phosphofructokinase (pfkB) and related sugar kinases	NA|251aa|down_7|NZ_CP014847.1_3899222_3899975_-	COG1349, GlpR, Transcriptional regulators of sugar metabolism [Transcription / Carbohydrate transport and metabolism]	NA|388aa|down_8|NZ_CP014847.1_3900133_3901297_-	cd08187, BDH, Butanol dehydrogenase catalyzes the conversion of butyraldehyde to butanol with the cofactor NAD(P)H being oxidized in the process	NA|181aa|down_9|NZ_CP014847.1_3901412_3901955_-	COG2322, COG2322, Predicted membrane protein [Function unknown]
GCF_001598095.1_ASM159809v1	NZ_CP014847	Bacillus thuringiensis strain HD12, complete sequence	3	5188251-5188367	2	CRISPRCasFinder	no		cas3,RT,cas14k,csa3,WYL,c2c9_V-U4,DinG,DEDDh	Orphan	CTTAAACAAGCGTTTGATTAATTCTCCATTTTTCTT	36	0	0	NA	NA	NA	1	1	Orphan	cas3,RT,cas14k,csa3,WYL,c2c9_V-U4,DinG,DEDDh,Cas14u_CAS-V,cas14j	NA|115aa|up_4|NZ_CP014847.1_5185380_5185725_-,NA|176aa|down_0|NZ_CP014847.1_5188466_5188994_-,NA|62aa|down_5|NZ_CP014847.1_5192345_5192531_-	NA|262aa|up_9|NZ_CP014847.1_5180545_5181331_-	COG0396, sufC, Cysteine desulfurase activator ATPase [Posttranslational modification, protein turnover, chaperones]	NA|269aa|up_8|NZ_CP014847.1_5181569_5182376_-	COG1464, NlpA, ABC-type metal ion transport system, periplasmic component/surface antigen [Inorganic ion transport and metabolism]	NA|271aa|up_7|NZ_CP014847.1_5182447_5183260_-	COG1464, NlpA, ABC-type metal ion transport system, periplasmic component/surface antigen [Inorganic ion transport and metabolism]	NA|222aa|up_6|NZ_CP014847.1_5183283_5183949_-	COG2011, AbcD, ABC-type metal ion transport system, permease component [Inorganic ion transport and metabolism]	NA|342aa|up_5|NZ_CP014847.1_5183941_5184967_-	COG1135, AbcC, ABC-type metal ion transport system, ATPase component [Inorganic ion transport and metabolism]	NA|115aa|up_4|NZ_CP014847.1_5185380_5185725_-	NA	NA|100aa|up_3|NZ_CP014847.1_5185877_5186177_-	cd02947, TRX_family, TRX family; composed of two groups: Group I, which includes proteins that exclusively encode a TRX domain; and Group II, which are composed of fusion proteins of TRX and additional domains	NA|115aa|up_2|NZ_CP014847.1_5186189_5186534_-	COG1658, COG1658, Small primase-like proteins (Toprim domain) [DNA replication, recombination, and repair]	NA|128aa|up_1|NZ_CP014847.1_5187264_5187648_-	PRK01202, PRK01202, glycine cleavage system protein GcvH	NA|122aa|up_0|NZ_CP014847.1_5187689_5188055_-	cd03036, ArsC_like, Arsenate Reductase (ArsC) family, unknown subfamily; uncharacterized proteins containing a CXXC motif with similarity to thioredoxin (TRX)-fold arsenic reductases, ArsC	NA|176aa|down_0|NZ_CP014847.1_5188466_5188994_-	NA	NA|216aa|down_1|NZ_CP014847.1_5189138_5189786_+	cd03386, PAP2_Aur1_like, PAP2_like proteins, Aur1_like subfamily	NA|338aa|down_2|NZ_CP014847.1_5189851_5190865_-	pfam13303, PTS_EIIC_2, Phosphotransferase system, EIIC	NA|390aa|down_3|NZ_CP014847.1_5190887_5192057_-	cd05291, HicDH_like, L-2-hydroxyisocapronate dehydrogenases and some bacterial L-lactate dehydrogenases	NA|83aa|down_4|NZ_CP014847.1_5192083_5192332_-	pfam07875, Coat_F, Coat F domain	NA|62aa|down_5|NZ_CP014847.1_5192345_5192531_-	NA	NA|240aa|down_6|NZ_CP014847.1_5192644_5193364_-	cd07721, yflN-like_MBL-fold, uncharacterized subgroup which includes Bacillus subtilis yflN; MBL-fold metallo hydrolase domain	NA|601aa|down_7|NZ_CP014847.1_5193479_5195282_-	cd01161, VLCAD, Very long chain acyl-CoA dehydrogenase	NA|391aa|down_8|NZ_CP014847.1_5195650_5196823_-	PRK07661, PRK07661, acetyl-CoA C-acetyltransferase	NA|794aa|down_9|NZ_CP014847.1_5196844_5199226_-	COG1250, FadB, 3-hydroxyacyl-CoA dehydrogenase [Lipid metabolism]
GCF_001598095.1_ASM159809v1	NZ_CP014847	Bacillus thuringiensis strain HD12, complete sequence	4	5555890-5556023	3	CRISPRCasFinder	no		cas3,RT,cas14k,csa3,WYL,c2c9_V-U4,DinG,DEDDh	Orphan	GTTGATTTCTCTTCTTTTTGAGA	23	0	0	NA	NA	NA	2	2	Orphan	cas3,RT,cas14k,csa3,WYL,c2c9_V-U4,DinG,DEDDh,Cas14u_CAS-V,cas14j	NA|45aa|up_0|NZ_CP014847.1_5555567_5555702_-,NA	NA|295aa|up_9|NZ_CP014847.1_5546802_5547687_-	COG1210, GalU, UDP-glucose pyrophosphorylase [Cell envelope biogenesis, outer membrane]	NA|256aa|up_8|NZ_CP014847.1_5547930_5548698_-	COG4464, CapC, Capsular polysaccharide biosynthesis protein [Carbohydrate transport and metabolism / Cell envelope biogenesis, outer membrane]	NA|234aa|up_7|NZ_CP014847.1_5548808_5549510_-	cd05387, BY-kinase, bacterial tyrosine-kinase	NA|248aa|up_6|NZ_CP014847.1_5549499_5550243_-	COG3944, COG3944, Capsular polysaccharide biosynthesis protein [Cell envelope biogenesis, outer membrane]	NA|449aa|up_5|NZ_CP014847.1_5550539_5551885_+	PHA02517, PHA02517, putative transposase OrfB; Reviewed	NA|226aa|up_4|NZ_CP014847.1_5551942_5552620_-	cd05387, BY-kinase, bacterial tyrosine-kinase	NA|145aa|up_3|NZ_CP014847.1_5552961_5553396_-	PRK00006, fabZ, 3-hydroxyacyl-ACP dehydratase FabZ	NA|334aa|up_2|NZ_CP014847.1_5553824_5554826_-	PRK13928, PRK13928, rod shape-determining protein Mbl; Provisional	NA|91aa|up_1|NZ_CP014847.1_5554986_5555259_-	pfam12116, SpoIIID, Stage III sporulation protein D	NA|45aa|up_0|NZ_CP014847.1_5555567_5555702_-	NA	NA|236aa|down_0|NZ_CP014847.1_5556911_5557619_-	pfam12698, ABC2_membrane_3, ABC-2 family transporter protein	NA|281aa|down_1|NZ_CP014847.1_5557618_5558461_-	COG1131, CcmA, ABC-type multidrug transport system, ATPase component [Defense mechanisms]	NA|336aa|down_2|NZ_CP014847.1_5558642_5559650_-	COG1131, CcmA, ABC-type multidrug transport system, ATPase component [Defense mechanisms]	NA|340aa|down_3|NZ_CP014847.1_5559747_5560767_-	TIGR02870, Stage_II_sporulation_protein_D, stage II sporulation protein D	NA|435aa|down_4|NZ_CP014847.1_5560974_5562279_-	PRK09369, PRK09369, UDP-N-acetylglucosamine 1-carboxyvinyltransferase; Validated	NA|237aa|down_5|NZ_CP014847.1_5562318_5563029_-	pfam08680, DUF1779, TATA-box binding	NA|79aa|down_6|NZ_CP014847.1_5563074_5563311_-	COG4836, COG4836, Predicted membrane protein [Function unknown]	NA|507aa|down_7|NZ_CP014847.1_5563513_5565034_-	PRK05777, PRK05777, NADH-quinone oxidoreductase subunit NuoN	NA|501aa|down_8|NZ_CP014847.1_5565035_5566538_-	PRK05846, PRK05846, NADH:ubiquinone oxidoreductase subunit M; Reviewed	NA|621aa|down_9|NZ_CP014847.1_5566534_5568397_-	PRK06590, PRK06590, NADH:ubiquinone oxidoreductase subunit L; Reviewed
GCF_001598095.1_ASM159809v1	NZ_CP014851	Bacillus thuringiensis strain HD12 plasmid pHD120112, complete sequence	1	100018-100323	1	CRT	no	csa3	RT,WYL,csa3	Type I-A	AAACCAGAGCCGAAGCCA	18	0	0	NA	NA	NA	6	6	Orphan	cas3,RT,cas14k,csa3,WYL,c2c9_V-U4,DinG,DEDDh,Cas14u_CAS-V,cas14j	NA|52aa|up_8|NZ_CP014851.1_93269_93425_-,NA|61aa|up_7|NZ_CP014851.1_93500_93683_-,NA|66aa|up_6|NZ_CP014851.1_93748_93946_-,NA|399aa|up_4|NZ_CP014851.1_95558_96755_-,NA|913aa|down_0|NZ_CP014851.1_101892_104631_+,NA|83aa|down_2|NZ_CP014851.1_106385_106634_+,NA|79aa|down_3|NZ_CP014851.1_106654_106891_+,NA|188aa|down_4|NZ_CP014851.1_106949_107513_+,NA|321aa|down_5|NZ_CP014851.1_107527_108490_+,NA|210aa|down_6|NZ_CP014851.1_108511_109141_+,NA|134aa|down_8|NZ_CP014851.1_110783_111185_+,NA|119aa|down_9|NZ_CP014851.1_111207_111564_+	NA|259aa|up_9|NZ_CP014851.1_92354_93131_-	COG1192, Soj, ATPases involved in chromosome partitioning [Cell division and chromosome partitioning]	NA|52aa|up_8|NZ_CP014851.1_93269_93425_-	NA	NA|61aa|up_7|NZ_CP014851.1_93500_93683_-	NA	NA|66aa|up_6|NZ_CP014851.1_93748_93946_-	NA	NA|209aa|up_5|NZ_CP014851.1_94275_94902_-	pfam13411, MerR_1, MerR HTH family regulatory protein	NA|399aa|up_4|NZ_CP014851.1_95558_96755_-	NA	NA|109aa|up_3|NZ_CP014851.1_97390_97717_-	cd00093, HTH_XRE, Helix-turn-helix XRE-family like proteins	NA|131aa|up_2|NZ_CP014851.1_98145_98538_-	cd00093, HTH_XRE, Helix-turn-helix XRE-family like proteins	NA|79aa|up_1|NZ_CP014851.1_98639_98876_+	smart00530, HTH_XRE, Helix-turn-helix XRE-family like proteins	NA|145aa|up_0|NZ_CP014851.1_99071_99506_+	TIGR01637, Putative_autolysin_regulatory_protein_ArpU, phage transcriptional regulator, ArpU family	NA|913aa|down_0|NZ_CP014851.1_101892_104631_+	NA	NA|399aa|down_1|NZ_CP014851.1_104892_106089_-	pfam01548, DEDD_Tnp_IS110, Transposase	NA|83aa|down_2|NZ_CP014851.1_106385_106634_+	NA	NA|79aa|down_3|NZ_CP014851.1_106654_106891_+	NA	NA|188aa|down_4|NZ_CP014851.1_106949_107513_+	NA	NA|321aa|down_5|NZ_CP014851.1_107527_108490_+	NA	NA|210aa|down_6|NZ_CP014851.1_108511_109141_+	NA	NA|536aa|down_7|NZ_CP014851.1_109154_110762_+	pfam13155, Toprim_2, Toprim-like	NA|134aa|down_8|NZ_CP014851.1_110783_111185_+	NA	NA|119aa|down_9|NZ_CP014851.1_111207_111564_+	NA
