assembly_id	genome_id	genome_def	crispr_array_locus_merge	crispr_array_location_merge	crispr_locus_id	crispr_pred_method	array_in_prot	prot_within_array_20000	prot_in_genome	crispr_type_by_cas_prot	consensus_repeat	repeat_length	self-targeting_spacer_number	self-targeting_target_number	spacer_location	protospacer_location	repeat_type	spacer_locus_num	spacer_num	correct_crispr_type	genome_cas_prots	unknown_protein_around_crispr	L10	L10_domain	L9	L9_domain	L8	L8_domain	L7	L7_domain	L6	L6_domain	L5	L5_domain	L4	L4_domain	L3	L3_domain	L2	L2_domain	L1	L1_domain	R1	R1_domain	R2	R2_domain	R3	R3_domain	R4	R4_domain	R5	R5_domain	R6	R6_domain	R7	R7_domain	R8	R8_domain	R9	R9_domain	R10	R10_domain
GCF_000292455.1_ASM29245v1	NC_018500	Bacillus thuringiensis HD-771, complete genome	1	2170798-2170962	1	PILER-CR	no	cas14j	DEDDh,cas14k,WYL,csa3,cas3,cas14j,DinG,c2c10_CAS-V-U3,c2c9_V-U4	Unclear	TCCAGTTGGGCCTGTGGG	18	0	0	NA	NA	NA	2	2	TypeV	DEDDh,cas14k,WYL,csa3,cas3,cas14j,DinG,c2c10_CAS-V-U3,c2c9_V-U4	NA|198aa|up_5|NC_018500.1_2163185_2163779_+,NA|262aa|down_0|NC_018500.1_2171888_2172674_-,NA|51aa|down_5|NC_018500.1_2179025_2179178_+,NA|93aa|down_6|NC_018500.1_2179376_2179655_+	NA|369aa|up_9|NC_018500.1_2155837_2156944_+	pfam03845, Spore_permease, Spore germination protein	NA|375aa|up_8|NC_018500.1_2156940_2158065_+	TIGR02887, Spore_germination_protein_B3, germination protein, Ger(x)C family	NA|720aa|up_7|NC_018500.1_2159535_2161695_-	PRK07726, PRK07726, DNA topoisomerase 3	NA|305aa|up_6|NC_018500.1_2161730_2162645_-	TIGR02224, Tyrosine_recombinase_XerC, tyrosine recombinase XerC	NA|198aa|up_5|NC_018500.1_2163185_2163779_+	NA	NA|110aa|up_4|NC_018500.1_2163771_2164101_+	cd00085, HNHc, HNH nucleases; HNH endonuclease signature which is found in viral, prokaryotic, and eukaryotic proteins	NA|120aa|up_3|NC_018500.1_2164730_2165090_+	cd00093, HTH_XRE, Helix-turn-helix XRE-family like proteins	NA|172aa|up_2|NC_018500.1_2165710_2166226_-	pfam13042, DUF3902, Protein of unknown function (DUF3902)	NA|185aa|up_1|NC_018500.1_2166709_2167264_+	cd02209, cupin_XRE_C, XRE (Xenobiotic Response Element) family transcriptional regulators, C-terminal cupin domain	NA|326aa|up_0|NC_018500.1_2167348_2168326_-	TIGR03866, PQQ_ABC_repeats, PQQ-dependent catabolism-associated beta-propeller protein	NA|262aa|down_0|NC_018500.1_2171888_2172674_-	NA	cas14j|100aa|down_1|NC_018500.1_2174511_2174811_+	COG0675, COG0675, Transposase and inactivated derivatives [DNA replication, recombination, and repair]	NA|392aa|down_2|NC_018500.1_2175826_2177002_-	TIGR04092, hypothetical_protein, D-alanyl-lipoteichoic acid biosynthesis protein DltD	NA|80aa|down_3|NC_018500.1_2176998_2177238_-	PRK05087, PRK05087, D-alanine--poly(phosphoribitol) ligase subunit DltC	NA|407aa|down_4|NC_018500.1_2177621_2178842_-	TIGR02037, Probable_periplasmic_serine_protease_do/HhoA-like, periplasmic serine protease, Do/DeqQ family	NA|51aa|down_5|NC_018500.1_2179025_2179178_+	NA	NA|93aa|down_6|NC_018500.1_2179376_2179655_+	NA	NA|378aa|down_7|NC_018500.1_2180249_2181383_-	pfam05791, Bacillus_HBL, Bacillus haemolytic enterotoxin (HBL)	NA|410aa|down_8|NC_018500.1_2181414_2182644_-	pfam05791, Bacillus_HBL, Bacillus haemolytic enterotoxin (HBL)	NA|440aa|down_9|NC_018500.1_2182706_2184026_-	pfam05791, Bacillus_HBL, Bacillus haemolytic enterotoxin (HBL)
GCF_000292455.1_ASM29245v1	NC_018500	Bacillus thuringiensis HD-771, complete genome	2	3501685-3501760	1	CRISPRCasFinder	no	csa3	DEDDh,cas14k,WYL,csa3,cas3,cas14j,DinG,c2c10_CAS-V-U3,c2c9_V-U4	Type I-A	TGATTGTGTCCTCCATGGTGATGAT	25	0	0	NA	NA	NA	1	1	Orphan	DEDDh,cas14k,WYL,csa3,cas3,cas14j,DinG,c2c10_CAS-V-U3,c2c9_V-U4	NA,NA	NA|324aa|up_9|NC_018500.1_3487864_3488836_-	COG2334, COG2334, Putative homoserine kinase type II (protein kinase fold) [General function prediction only]	NA|229aa|up_8|NC_018500.1_3488859_3489546_-	pfam02517, Abi, CAAX protease self-immunity	NA|501aa|up_7|NC_018500.1_3489696_3491199_+	pfam03323, GerA, Bacillus/Clostridium GerA spore germination protein	NA|369aa|up_6|NC_018500.1_3491179_3492286_+	pfam03845, Spore_permease, Spore germination protein	NA|86aa|up_5|NC_018500.1_3492266_3492524_+	TIGR02887, Spore_germination_protein_B3, germination protein, Ger(x)C family	NA|554aa|up_4|NC_018500.1_3493420_3495082_-	TIGR02403, Trehalose-6-phosphate_hydrolase, alpha,alpha-phosphotrehalase	NA|476aa|up_3|NC_018500.1_3495095_3496523_-	TIGR01992, phosphotransferase_system_trehalose_permease, PTS system, trehalose-specific IIBC component	NA|237aa|up_2|NC_018500.1_3496665_3497376_-	TIGR02404, Trehalose_operon_transcriptional_repressor, trehalose operon repressor, B	NA|466aa|up_1|NC_018500.1_3497828_3499226_+	TIGR00905, Arginine/ornithine_antiporter, transporter, basic amino acid/polyamine antiporter (APA) family	NA|568aa|up_0|NC_018500.1_3499257_3500961_-	PRK10060, PRK10060, cyclic di-GMP phosphodiesterase	NA|510aa|down_0|NC_018500.1_3502010_3503540_+	PRK12452, PRK12452, cardiolipin synthase	NA|298aa|down_1|NC_018500.1_3503667_3504561_+	COG0714, COG0714, MoxR-like ATPases [General function prediction only]	NA|628aa|down_2|NC_018500.1_3504564_3506448_+	COG4548, NorD, Nitric oxide reductase activation protein [Inorganic ion transport and metabolism]	NA|141aa|down_3|NC_018500.1_3506485_3506908_-	cd02883, Nudix_Hydrolase, Nudix hydrolase is a superfamily of enzymes found in all three kingdoms of life, and it catalyzes the hydrolysis of NUcleoside DIphosphates linked to other moieties, X	NA|322aa|down_4|NC_018500.1_3506967_3507933_-	cd05272, TDH_SDR_e, L-threonine dehydrogenase, extended (e) SDRs	NA|397aa|down_5|NC_018500.1_3507977_3509168_-	PRK06939, PRK06939, 2-amino-3-ketobutyrate coenzyme A ligase; Provisional	NA|244aa|down_6|NC_018500.1_3509381_3510113_+	pfam13649, Methyltransf_25, Methyltransferase domain	NA|353aa|down_7|NC_018500.1_3510971_3512030_-	pfam01032, FecCD, FecCD transport family	NA|335aa|down_8|NC_018500.1_3512026_3513031_-	pfam01032, FecCD, FecCD transport family	NA|323aa|down_9|NC_018500.1_3513203_3514172_+	cd01146, FhuD, Fe3+-siderophore binding domain FhuD
GCF_000292455.1_ASM29245v1	NC_018500	Bacillus thuringiensis HD-771, complete genome	3	4358900-4359033	2	CRISPRCasFinder	no		DEDDh,cas14k,WYL,csa3,cas3,cas14j,DinG,c2c10_CAS-V-U3,c2c9_V-U4	Orphan	GACAAATCTCAAAAAGAAGAGAA	23	0	0	NA	NA	NA	2	2	Orphan	DEDDh,cas14k,WYL,csa3,cas3,cas14j,DinG,c2c10_CAS-V-U3,c2c9_V-U4	NA,NA|45aa|down_0|NC_018500.1_4359226_4359361_+	NA|621aa|up_9|NC_018500.1_4346532_4348395_+	PRK06590, PRK06590, NADH:ubiquinone oxidoreductase subunit L; Reviewed	NA|501aa|up_8|NC_018500.1_4348391_4349894_+	PRK05846, PRK05846, NADH:ubiquinone oxidoreductase subunit M; Reviewed	NA|507aa|up_7|NC_018500.1_4349895_4351416_+	PRK05777, PRK05777, NADH-quinone oxidoreductase subunit NuoN	NA|79aa|up_6|NC_018500.1_4351618_4351855_+	COG4836, COG4836, Predicted membrane protein [Function unknown]	NA|237aa|up_5|NC_018500.1_4351900_4352611_+	pfam08680, DUF1779, TATA-box binding	NA|435aa|up_4|NC_018500.1_4352650_4353955_+	PRK09369, PRK09369, UDP-N-acetylglucosamine 1-carboxyvinyltransferase; Validated	NA|340aa|up_3|NC_018500.1_4354164_4355184_+	TIGR02870, Stage_II_sporulation_protein_D, stage II sporulation protein D	NA|336aa|up_2|NC_018500.1_4355282_4356290_+	COG1131, CcmA, ABC-type multidrug transport system, ATPase component [Defense mechanisms]	NA|281aa|up_1|NC_018500.1_4356471_4357314_+	COG1131, CcmA, ABC-type multidrug transport system, ATPase component [Defense mechanisms]	NA|235aa|up_0|NC_018500.1_4357313_4358018_+	pfam12679, ABC2_membrane_2, ABC-2 family transporter protein	NA|45aa|down_0|NC_018500.1_4359226_4359361_+	NA	NA|91aa|down_1|NC_018500.1_4359669_4359942_+	pfam12116, SpoIIID, Stage III sporulation protein D	NA|334aa|down_2|NC_018500.1_4360102_4361104_+	PRK13928, PRK13928, rod shape-determining protein Mbl; Provisional	NA|145aa|down_3|NC_018500.1_4361533_4361968_+	PRK00006, fabZ, 3-hydroxyacyl-ACP dehydratase FabZ	NA|226aa|down_4|NC_018500.1_4362309_4362987_+	cd05387, BY-kinase, bacterial tyrosine-kinase	NA|248aa|down_5|NC_018500.1_4363250_4363994_+	COG3944, COG3944, Capsular polysaccharide biosynthesis protein [Cell envelope biogenesis, outer membrane]	NA|234aa|down_6|NC_018500.1_4363983_4364685_+	cd05387, BY-kinase, bacterial tyrosine-kinase	NA|256aa|down_7|NC_018500.1_4364796_4365564_+	COG4464, CapC, Capsular polysaccharide biosynthesis protein [Carbohydrate transport and metabolism / Cell envelope biogenesis, outer membrane]	NA|293aa|down_8|NC_018500.1_4365803_4366682_+	COG1210, GalU, UDP-glucose pyrophosphorylase [Cell envelope biogenesis, outer membrane]	NA|220aa|down_9|NC_018500.1_4366700_4367360_+	TIGR03025, EPS_sugtrans, exopolysaccharide biosynthesis polyprenyl glycosylphosphotransferase
GCF_000292455.1_ASM29245v1	NC_018501	Bacillus thuringiensis HD-771 plasmid p02, complete sequence	1	45294-45457	1	CRISPRCasFinder	no	c2c10_CAS-V-U3,csa3	c2c10_CAS-V-U3,csa3	Type V-U3, Type I-A?	GTTTAAACCAAACAATAGATGTATTGAAATTT	32	1	1	45391-45424	NC_018487.1_39201-39234	V-U3	2	2	TypeV-U3,TypeI-A?	DEDDh,cas14k,WYL,csa3,cas3,cas14j,DinG,c2c10_CAS-V-U3,c2c9_V-U4	NA|238aa|up_8|NC_018501.1_36655_37369_+,NA|210aa|up_7|NC_018501.1_37706_38336_+,NA|212aa|up_6|NC_018501.1_38335_38971_+,NA|205aa|up_5|NC_018501.1_39021_39636_-,NA|194aa|up_4|NC_018501.1_39619_40201_-,NA|128aa|up_3|NC_018501.1_40326_40710_+,NA|610aa|up_1|NC_018501.1_41710_43540_-,NA|187aa|down_1|NC_018501.1_48968_49529_+,NA|67aa|down_3|NC_018501.1_50220_50421_+,NA|96aa|down_4|NC_018501.1_50498_50786_+,NA|119aa|down_8|NC_018501.1_54063_54420_+,NA|143aa|down_9|NC_018501.1_54532_54961_+	NA|1097aa|up_9|NC_018501.1_33348_36639_+	cd18793, SF2_C_SNF, C-terminal helicase domain of the SNF family helicases	NA|238aa|up_8|NC_018501.1_36655_37369_+	NA	NA|210aa|up_7|NC_018501.1_37706_38336_+	NA	NA|212aa|up_6|NC_018501.1_38335_38971_+	NA	NA|205aa|up_5|NC_018501.1_39021_39636_-	NA	NA|194aa|up_4|NC_018501.1_39619_40201_-	NA	NA|128aa|up_3|NC_018501.1_40326_40710_+	NA	NA|213aa|up_2|NC_018501.1_40726_41365_+	pfam08378, NERD, Nuclease-related domain	NA|610aa|up_1|NC_018501.1_41710_43540_-	NA	c2c10_CAS-V-U3|454aa|up_0|NC_018501.1_43665_45027_+	pfam07282, OrfB_Zn_ribbon, Putative transposase DNA-binding domain	NA|888aa|down_0|NC_018501.1_46141_48805_+	PRK07726, PRK07726, DNA topoisomerase 3	NA|187aa|down_1|NC_018501.1_48968_49529_+	NA	NA|216aa|down_2|NC_018501.1_49521_50169_+	smart00318, SNc, Staphylococcal nuclease homologues	NA|67aa|down_3|NC_018501.1_50220_50421_+	NA	NA|96aa|down_4|NC_018501.1_50498_50786_+	NA	csa3|98aa|down_5|NC_018501.1_50932_51226_+	cd00090, HTH_ARSR, Arsenical Resistance Operon Repressor and similar prokaryotic, metal regulated homodimeric repressors	NA|251aa|down_6|NC_018501.1_51476_52229_-	PRK09183, PRK09183, transposase/IS protein; Provisional	NA|432aa|down_7|NC_018501.1_52218_53514_-	COG4584, COG4584, Transposase and inactivated derivatives [DNA replication, recombination, and repair]	NA|119aa|down_8|NC_018501.1_54063_54420_+	NA	NA|143aa|down_9|NC_018501.1_54532_54961_+	NA
GCF_000292455.1_ASM29245v1	NC_018501	Bacillus thuringiensis HD-771 plasmid p02, complete sequence	2	45767-45992	2,1	CRISPRCasFinder,CRT	no	c2c10_CAS-V-U3,csa3	c2c10_CAS-V-U3,csa3	Type V-U3, Type I-A?	GTTTAAACCAAACAATAGATGTATTGAAATTT,AACAATAGATGTATTGAAAT	32,20	0	0	NA	NA	V-U3:NA	1,3	3	TypeV-U3,TypeI-A?	DEDDh,cas14k,WYL,csa3,cas3,cas14j,DinG,c2c10_CAS-V-U3,c2c9_V-U4	NA|238aa|up_8|NC_018501.1_36655_37369_+,NA|210aa|up_7|NC_018501.1_37706_38336_+,NA|212aa|up_6|NC_018501.1_38335_38971_+,NA|205aa|up_5|NC_018501.1_39021_39636_-,NA|194aa|up_4|NC_018501.1_39619_40201_-,NA|128aa|up_3|NC_018501.1_40326_40710_+,NA|610aa|up_1|NC_018501.1_41710_43540_-,NA|187aa|down_1|NC_018501.1_48968_49529_+,NA|67aa|down_3|NC_018501.1_50220_50421_+,NA|96aa|down_4|NC_018501.1_50498_50786_+,NA|119aa|down_8|NC_018501.1_54063_54420_+,NA|143aa|down_9|NC_018501.1_54532_54961_+	NA|1097aa|up_9|NC_018501.1_33348_36639_+	cd18793, SF2_C_SNF, C-terminal helicase domain of the SNF family helicases	NA|238aa|up_8|NC_018501.1_36655_37369_+	NA	NA|210aa|up_7|NC_018501.1_37706_38336_+	NA	NA|212aa|up_6|NC_018501.1_38335_38971_+	NA	NA|205aa|up_5|NC_018501.1_39021_39636_-	NA	NA|194aa|up_4|NC_018501.1_39619_40201_-	NA	NA|128aa|up_3|NC_018501.1_40326_40710_+	NA	NA|213aa|up_2|NC_018501.1_40726_41365_+	pfam08378, NERD, Nuclease-related domain	NA|610aa|up_1|NC_018501.1_41710_43540_-	NA	c2c10_CAS-V-U3|454aa|up_0|NC_018501.1_43665_45027_+	pfam07282, OrfB_Zn_ribbon, Putative transposase DNA-binding domain	NA|888aa|down_0|NC_018501.1_46141_48805_+	PRK07726, PRK07726, DNA topoisomerase 3	NA|187aa|down_1|NC_018501.1_48968_49529_+	NA	NA|216aa|down_2|NC_018501.1_49521_50169_+	smart00318, SNc, Staphylococcal nuclease homologues	NA|67aa|down_3|NC_018501.1_50220_50421_+	NA	NA|96aa|down_4|NC_018501.1_50498_50786_+	NA	csa3|98aa|down_5|NC_018501.1_50932_51226_+	cd00090, HTH_ARSR, Arsenical Resistance Operon Repressor and similar prokaryotic, metal regulated homodimeric repressors	NA|251aa|down_6|NC_018501.1_51476_52229_-	PRK09183, PRK09183, transposase/IS protein; Provisional	NA|432aa|down_7|NC_018501.1_52218_53514_-	COG4584, COG4584, Transposase and inactivated derivatives [DNA replication, recombination, and repair]	NA|119aa|down_8|NC_018501.1_54063_54420_+	NA	NA|143aa|down_9|NC_018501.1_54532_54961_+	NA
GCF_000292455.1_ASM29245v1	NC_018503	Bacillus thuringiensis HD-771 plasmid p07, complete sequence	1	5477-5674	1	CRT	no			Orphan	NCCAGANACTAAACCAGA	18	4	4	5495-5536|5555-5572|5591-5620|5639-5656	NC_018503.1_5459-5500|NC_018503.1_5459-5476|NC_018503.1_5459-5488|NC_018503.1_5663-5680	NA	4	4	Orphan	DEDDh,cas14k,WYL,csa3,cas3,cas14j,DinG,c2c10_CAS-V-U3,c2c9_V-U4	NA|208aa|up_4|NC_018503.1_99_723_+,NA|385aa|up_3|NC_018503.1_1268_2423_-,NA|188aa|up_2|NC_018503.1_2434_2998_-,NA|82aa|up_1|NC_018503.1_3853_4099_+,NA|215aa|up_0|NC_018503.1_4126_4771_+,NA|788aa|down_0|NC_018503.1_6258_8622_+	NA|NA	NA	NA|NA	NA	NA|NA	NA	NA|NA	NA	NA|NA	NA	NA|208aa|up_4|NC_018503.1_99_723_+	NA	NA|385aa|up_3|NC_018503.1_1268_2423_-	NA	NA|188aa|up_2|NC_018503.1_2434_2998_-	NA	NA|82aa|up_1|NC_018503.1_3853_4099_+	NA	NA|215aa|up_0|NC_018503.1_4126_4771_+	NA	NA|788aa|down_0|NC_018503.1_6258_8622_+	NA	NA|NA	NA	NA|NA	NA	NA|NA	NA	NA|NA	NA	NA|NA	NA	NA|NA	NA	NA|NA	NA	NA|NA	NA	NA|NA	NA
