assembly_id	genome_id	genome_def	crispr_array_locus_merge	crispr_array_location_merge	crispr_locus_id	crispr_pred_method	array_in_prot	prot_within_array_20000	prot_in_genome	crispr_type_by_cas_prot	consensus_repeat	repeat_length	self-targeting_spacer_number	self-targeting_target_number	spacer_location	protospacer_location	repeat_type	spacer_locus_num	spacer_num	correct_crispr_type	genome_cas_prots	unknown_protein_around_crispr	L10	L10_domain	L9	L9_domain	L8	L8_domain	L7	L7_domain	L6	L6_domain	L5	L5_domain	L4	L4_domain	L3	L3_domain	L2	L2_domain	L1	L1_domain	R1	R1_domain	R2	R2_domain	R3	R3_domain	R4	R4_domain	R5	R5_domain	R6	R6_domain	R7	R7_domain	R8	R8_domain	R9	R9_domain	R10	R10_domain
GCF_000940785.1_ASM94078v1	NZ_CP010577	Bacillus thuringiensis serovar morrisoni strain BGSC 4AA1 chromosome, complete genome	1	610049-610124	1	CRISPRCasFinder	no	csa3	cas3,cas14k,csa3,WYL,c2c9_V-U4,DinG,RT,DEDDh	Type I-A	ATCATCATCATGGAGGACACAATCA	25	0	0	NA	NA	NA	1	1	Orphan	cas3,cas14k,csa3,WYL,c2c9_V-U4,DinG,RT,DEDDh,cas14j	NA,NA	NA|335aa|up_9|NZ_CP010577.1_598770_599775_+	pfam01032, FecCD, FecCD transport family	NA|353aa|up_8|NZ_CP010577.1_599771_600830_+	pfam01032, FecCD, FecCD transport family	NA|274aa|up_7|NZ_CP010577.1_600842_601664_+	COG1120, FepC, ABC-type cobalamin/Fe3+-siderophores transport systems, ATPase components [Inorganic ion transport and metabolism / Coenzyme metabolism]	NA|244aa|up_6|NZ_CP010577.1_601695_602427_-	pfam13649, Methyltransf_25, Methyltransferase domain	NA|397aa|up_5|NZ_CP010577.1_602640_603831_+	PRK06939, PRK06939, 2-amino-3-ketobutyrate coenzyme A ligase; Provisional	NA|322aa|up_4|NZ_CP010577.1_603875_604841_+	cd05272, TDH_SDR_e, L-threonine dehydrogenase, extended (e) SDRs	NA|141aa|up_3|NZ_CP010577.1_604900_605323_+	cd02883, Nudix_Hydrolase, Nudix hydrolase is a superfamily of enzymes found in all three kingdoms of life, and it catalyzes the hydrolysis of NUcleoside DIphosphates linked to other moieties, X	NA|628aa|up_2|NZ_CP010577.1_605360_607244_-	COG4548, NorD, Nitric oxide reductase activation protein [Inorganic ion transport and metabolism]	NA|298aa|up_1|NZ_CP010577.1_607247_608141_-	COG0714, COG0714, MoxR-like ATPases [General function prediction only]	NA|510aa|up_0|NZ_CP010577.1_608268_609798_-	PRK12452, PRK12452, cardiolipin synthase	NA|568aa|down_0|NZ_CP010577.1_610847_612551_+	PRK10060, PRK10060, cyclic di-GMP phosphodiesterase	NA|466aa|down_1|NZ_CP010577.1_612582_613980_-	TIGR00905, Arginine/ornithine_antiporter, transporter, basic amino acid/polyamine antiporter (APA) family	NA|237aa|down_2|NZ_CP010577.1_614432_615143_+	TIGR02404, Trehalose_operon_transcriptional_repressor, trehalose operon repressor, B	NA|476aa|down_3|NZ_CP010577.1_615285_616713_+	TIGR01992, phosphotransferase_system_trehalose_permease, PTS system, trehalose-specific IIBC component	NA|554aa|down_4|NZ_CP010577.1_616726_618388_+	TIGR02403, Trehalose-6-phosphate_hydrolase, alpha,alpha-phosphotrehalase	NA|376aa|down_5|NZ_CP010577.1_618420_619548_-	TIGR02887, Spore_germination_protein_B3, germination protein, Ger(x)C family	NA|390aa|down_6|NZ_CP010577.1_619463_620633_-	pfam03845, Spore_permease, Spore germination protein	NA|501aa|down_7|NZ_CP010577.1_620613_622116_-	pfam03323, GerA, Bacillus/Clostridium GerA spore germination protein	NA|324aa|down_8|NZ_CP010577.1_622304_623276_+	COG2334, COG2334, Putative homoserine kinase type II (protein kinase fold) [General function prediction only]	NA|487aa|down_9|NZ_CP010577.1_623435_624896_+	pfam01235, Na_Ala_symp, Sodium:alanine symporter family
GCF_000940785.1_ASM94078v1	NZ_CP010577	Bacillus thuringiensis serovar morrisoni strain BGSC 4AA1 chromosome, complete genome	2	1195169-1195389	2	CRISPRCasFinder	no		cas3,cas14k,csa3,WYL,c2c9_V-U4,DinG,RT,DEDDh	Orphan	GAAACATGGAATTCGATTGTTGA	23	0	0	NA	NA	NA	3	3	Orphan	cas3,cas14k,csa3,WYL,c2c9_V-U4,DinG,RT,DEDDh,cas14j	NA|86aa|up_7|NZ_CP010577.1_1190301_1190559_+,NA|110aa|up_3|NZ_CP010577.1_1191469_1191799_+,NA|121aa|up_1|NZ_CP010577.1_1192399_1192762_+,NA|86aa|down_2|NZ_CP010577.1_1200045_1200303_+,NA|126aa|down_6|NZ_CP010577.1_1207683_1208061_+	NA|195aa|up_9|NZ_CP010577.1_1188500_1189085_+	pfam04586, Peptidase_S78, Caudovirus prohead serine protease	NA|395aa|up_8|NZ_CP010577.1_1189101_1190286_+	TIGR01554, prophage_Lp3_protein_18, phage major capsid protein, HK97 family	NA|86aa|up_7|NZ_CP010577.1_1190301_1190559_+	NA	NA|91aa|up_6|NZ_CP010577.1_1190555_1190828_+	TIGR01560, phage-related_hypothetical_protein, uncharacterized phage protein (possible DNA packaging)	NA|100aa|up_5|NZ_CP010577.1_1190824_1191124_+	TIGR01563, hypothetical_protein_AGR_C_1752, phage head-tail adaptor, putative, SPP1 family	NA|119aa|up_4|NZ_CP010577.1_1191116_1191473_+	pfam04883, HK97-gp10_like, Bacteriophage HK97-gp10, putative tail-component	NA|110aa|up_3|NZ_CP010577.1_1191469_1191799_+	NA	NA|198aa|up_2|NZ_CP010577.1_1191799_1192393_+	TIGR01603, Uncharacterized_phage_related_protein, phage major tail protein, phi13 family	NA|121aa|up_1|NZ_CP010577.1_1192399_1192762_+	NA	NA|415aa|up_0|NZ_CP010577.1_1192992_1194237_+	COG5280, COG5280, Phage-related minor tail protein [Function unknown]	NA|1006aa|down_0|NZ_CP010577.1_1195880_1198898_-	pfam01526, DDE_Tnp_Tn3, Tn3 transposase DDE domain	NA|307aa|down_1|NZ_CP010577.1_1198969_1199890_-	TIGR02224, Tyrosine_recombinase_XerC, tyrosine recombinase XerC	NA|86aa|down_2|NZ_CP010577.1_1200045_1200303_+	NA	NA|88aa|down_3|NZ_CP010577.1_1200271_1200535_+	COG2026, RelE, Cytotoxic translational repressor of toxin-antitoxin stability system [Translation, ribosomal structure and biogenesis / Cell division and chromosome partitioning]	NA|490aa|down_4|NZ_CP010577.1_1201812_1203282_+	TIGR01633, unnamed_protein_product, putative phage tail component, N-terminal domain	NA|1463aa|down_5|NZ_CP010577.1_1203278_1207667_+	TIGR01665, structural_protein, phage minor structural protein, N-terminal region	NA|126aa|down_6|NZ_CP010577.1_1207683_1208061_+	NA	NA|142aa|down_7|NZ_CP010577.1_1208098_1208524_+	COG4824, COG4824, Phage-related holin (Lysis protein) [General function prediction only]	NA|312aa|down_8|NZ_CP010577.1_1208523_1209459_+	COG5632, COG5632, N-acetylmuramoyl-L-alanine amidase [Cell envelope biogenesis, outer membrane]	NA|368aa|down_9|NZ_CP010577.1_1210013_1211117_+	PRK09354, recA, recombinase A; Provisional
GCF_000940785.1_ASM94078v1	NZ_CP010577	Bacillus thuringiensis serovar morrisoni strain BGSC 4AA1 chromosome, complete genome	3	5100859-5100975	3	CRISPRCasFinder	no		cas3,cas14k,csa3,WYL,c2c9_V-U4,DinG,RT,DEDDh	Orphan	CTTAAACAAGCGTTTGATTAATTCTCCATTTTTCTT	36	0	0	NA	NA	NA	1	1	Orphan	cas3,cas14k,csa3,WYL,c2c9_V-U4,DinG,RT,DEDDh,cas14j	NA|115aa|up_4|NZ_CP010577.1_5097988_5098333_-,NA|176aa|down_0|NZ_CP010577.1_5101074_5101602_-,NA|62aa|down_5|NZ_CP010577.1_5104953_5105139_-	NA|262aa|up_9|NZ_CP010577.1_5093153_5093939_-	COG0396, sufC, Cysteine desulfurase activator ATPase [Posttranslational modification, protein turnover, chaperones]	NA|269aa|up_8|NZ_CP010577.1_5094177_5094984_-	COG1464, NlpA, ABC-type metal ion transport system, periplasmic component/surface antigen [Inorganic ion transport and metabolism]	NA|271aa|up_7|NZ_CP010577.1_5095055_5095868_-	COG1464, NlpA, ABC-type metal ion transport system, periplasmic component/surface antigen [Inorganic ion transport and metabolism]	NA|222aa|up_6|NZ_CP010577.1_5095891_5096557_-	COG2011, AbcD, ABC-type metal ion transport system, permease component [Inorganic ion transport and metabolism]	NA|342aa|up_5|NZ_CP010577.1_5096549_5097575_-	COG1135, AbcC, ABC-type metal ion transport system, ATPase component [Inorganic ion transport and metabolism]	NA|115aa|up_4|NZ_CP010577.1_5097988_5098333_-	NA	NA|103aa|up_3|NZ_CP010577.1_5098485_5098794_-	cd02947, TRX_family, TRX family; composed of two groups: Group I, which includes proteins that exclusively encode a TRX domain; and Group II, which are composed of fusion proteins of TRX and additional domains	NA|115aa|up_2|NZ_CP010577.1_5098797_5099142_-	COG1658, COG1658, Small primase-like proteins (Toprim domain) [DNA replication, recombination, and repair]	NA|128aa|up_1|NZ_CP010577.1_5099872_5100256_-	PRK01202, PRK01202, glycine cleavage system protein GcvH	NA|122aa|up_0|NZ_CP010577.1_5100297_5100663_-	cd03036, ArsC_like, Arsenate Reductase (ArsC) family, unknown subfamily; uncharacterized proteins containing a CXXC motif with similarity to thioredoxin (TRX)-fold arsenic reductases, ArsC	NA|176aa|down_0|NZ_CP010577.1_5101074_5101602_-	NA	NA|216aa|down_1|NZ_CP010577.1_5101746_5102394_+	cd03386, PAP2_Aur1_like, PAP2_like proteins, Aur1_like subfamily	NA|338aa|down_2|NZ_CP010577.1_5102459_5103473_-	pfam13303, PTS_EIIC_2, Phosphotransferase system, EIIC	NA|390aa|down_3|NZ_CP010577.1_5103495_5104665_-	cd05291, HicDH_like, L-2-hydroxyisocapronate dehydrogenases and some bacterial L-lactate dehydrogenases	NA|83aa|down_4|NZ_CP010577.1_5104691_5104940_-	pfam07875, Coat_F, Coat F domain	NA|62aa|down_5|NZ_CP010577.1_5104953_5105139_-	NA	NA|240aa|down_6|NZ_CP010577.1_5105252_5105972_-	cd07721, yflN-like_MBL-fold, uncharacterized subgroup which includes Bacillus subtilis yflN; MBL-fold metallo hydrolase domain	NA|595aa|down_7|NZ_CP010577.1_5106087_5107872_-	cd01161, VLCAD, Very long chain acyl-CoA dehydrogenase	NA|391aa|down_8|NZ_CP010577.1_5108258_5109431_-	PRK07661, PRK07661, acetyl-CoA C-acetyltransferase	NA|794aa|down_9|NZ_CP010577.1_5109452_5111834_-	COG1250, FadB, 3-hydroxyacyl-CoA dehydrogenase [Lipid metabolism]
GCF_000940785.1_ASM94078v1	NZ_CP010577	Bacillus thuringiensis serovar morrisoni strain BGSC 4AA1 chromosome, complete genome	4	5420442-5420575	4	CRISPRCasFinder	no		cas3,cas14k,csa3,WYL,c2c9_V-U4,DinG,RT,DEDDh	Orphan	GTTGATTTCTCTTCTTTTTGAGA	23	0	0	NA	NA	NA	2	2	Orphan	cas3,cas14k,csa3,WYL,c2c9_V-U4,DinG,RT,DEDDh,cas14j	NA|45aa|up_0|NZ_CP010577.1_5420119_5420254_-,NA	NA|217aa|up_9|NZ_CP010577.1_5410454_5411105_-	TIGR03025, EPS_sugtrans, exopolysaccharide biosynthesis polyprenyl glycosylphosphotransferase	NA|479aa|up_8|NZ_CP010577.1_5411493_5412930_+	pfam01609, DDE_Tnp_1, Transposase DDE domain	NA|256aa|up_7|NZ_CP010577.1_5413919_5414687_-	COG4464, CapC, Capsular polysaccharide biosynthesis protein [Carbohydrate transport and metabolism / Cell envelope biogenesis, outer membrane]	NA|234aa|up_6|NZ_CP010577.1_5414797_5415499_-	cd05387, BY-kinase, bacterial tyrosine-kinase	NA|248aa|up_5|NZ_CP010577.1_5415488_5416232_-	COG3944, COG3944, Capsular polysaccharide biosynthesis protein [Cell envelope biogenesis, outer membrane]	NA|226aa|up_4|NZ_CP010577.1_5416494_5417172_-	cd05387, BY-kinase, bacterial tyrosine-kinase	NA|145aa|up_3|NZ_CP010577.1_5417513_5417948_-	PRK00006, fabZ, 3-hydroxyacyl-ACP dehydratase FabZ	NA|334aa|up_2|NZ_CP010577.1_5418376_5419378_-	PRK13928, PRK13928, rod shape-determining protein Mbl; Provisional	NA|91aa|up_1|NZ_CP010577.1_5419538_5419811_-	pfam12116, SpoIIID, Stage III sporulation protein D	NA|45aa|up_0|NZ_CP010577.1_5420119_5420254_-	NA	NA|236aa|down_0|NZ_CP010577.1_5421463_5422171_-	pfam12698, ABC2_membrane_3, ABC-2 family transporter protein	NA|281aa|down_1|NZ_CP010577.1_5422170_5423013_-	COG1131, CcmA, ABC-type multidrug transport system, ATPase component [Defense mechanisms]	NA|336aa|down_2|NZ_CP010577.1_5423194_5424202_-	COG1131, CcmA, ABC-type multidrug transport system, ATPase component [Defense mechanisms]	NA|340aa|down_3|NZ_CP010577.1_5424299_5425319_-	TIGR02870, Stage_II_sporulation_protein_D, stage II sporulation protein D	NA|435aa|down_4|NZ_CP010577.1_5425526_5426831_-	PRK09369, PRK09369, UDP-N-acetylglucosamine 1-carboxyvinyltransferase; Validated	NA|237aa|down_5|NZ_CP010577.1_5426870_5427581_-	pfam08680, DUF1779, TATA-box binding	NA|79aa|down_6|NZ_CP010577.1_5427626_5427863_-	COG4836, COG4836, Predicted membrane protein [Function unknown]	NA|507aa|down_7|NZ_CP010577.1_5428065_5429586_-	PRK05777, PRK05777, NADH-quinone oxidoreductase subunit NuoN	NA|501aa|down_8|NZ_CP010577.1_5429587_5431090_-	PRK05846, PRK05846, NADH:ubiquinone oxidoreductase subunit M; Reviewed	NA|621aa|down_9|NZ_CP010577.1_5431086_5432949_-	PRK06590, PRK06590, NADH:ubiquinone oxidoreductase subunit L; Reviewed
GCF_000940785.1_ASM94078v1	NZ_CP010578	Bacillus thuringiensis serovar morrisoni strain BGSC 4AA1 plasmid pBMB232, complete sequence	1	122545-122676	1	CRISPRCasFinder	no	RT,cas14j	RT,cas14j,csa3	Unclear	CTGGTGTTCCTGGTGTTCCTGGTATTCCTG	30	1	4	122623-122646|122623-122646|122623-122646|122623-122646	NZ_CP010578.1_122659-122682|NZ_CP010578.1_122668-122691|NZ_CP010578.1_122677-122700|NZ_CP010578.1_122686-122709	NA	2	2	TypeV	cas3,cas14k,csa3,WYL,c2c9_V-U4,DinG,RT,DEDDh,cas14j	NA|129aa|up_6|NZ_CP010578.1_108866_109253_+,NA|504aa|up_5|NZ_CP010578.1_109270_110782_+,NA|309aa|up_0|NZ_CP010578.1_118750_119677_+,NA|88aa|down_1|NZ_CP010578.1_124316_124580_+,NA|95aa|down_2|NZ_CP010578.1_125406_125691_+,NA|132aa|down_3|NZ_CP010578.1_125929_126325_+,NA|65aa|down_8|NZ_CP010578.1_131750_131945_-	NA|105aa|up_9|NZ_CP010578.1_105160_105475_+	COG1846, MarR, Transcriptional regulators [Transcription]	NA|485aa|up_8|NZ_CP010578.1_105495_106950_+	cd02202, CetZ_tubulin-like, Cell-structure-related euryarchaeota tubulin/FtsZ homologs	NA|429aa|up_7|NZ_CP010578.1_107389_108676_+	pfam13814, Replic_Relax, Replication-relaxation	NA|129aa|up_6|NZ_CP010578.1_108866_109253_+	NA	NA|504aa|up_5|NZ_CP010578.1_109270_110782_+	NA	RT|606aa|up_4|NZ_CP010578.1_111373_113191_+	cd01651, RT_G2_intron, RT_G2_intron: Reverse transcriptases (RTs) with group II intron origin	NA|675aa|up_3|NZ_CP010578.1_113294_115319_+	smart00843, Ftsk_gamma, This domain directs oriented DNA translocation and forms a winged helix structure	NA|187aa|up_2|NZ_CP010578.1_115640_116201_+	cd00093, HTH_XRE, Helix-turn-helix XRE-family like proteins	NA|328aa|up_1|NZ_CP010578.1_117757_118741_+	COG0539, RpsA, Ribosomal protein S1 [Translation, ribosomal structure and biogenesis]	NA|309aa|up_0|NZ_CP010578.1_118750_119677_+	NA	NA|142aa|down_0|NZ_CP010578.1_123508_123934_-	pfam16723, DUF5065, Domain of unknown function (DUF5065)	NA|88aa|down_1|NZ_CP010578.1_124316_124580_+	NA	NA|95aa|down_2|NZ_CP010578.1_125406_125691_+	NA	NA|132aa|down_3|NZ_CP010578.1_125929_126325_+	NA	NA|129aa|down_4|NZ_CP010578.1_126545_126932_+	cd05403, NT_KNTase_like, Nucleotidyltransferase (NT) domain of Staphylococcus aureus kanamycin nucleotidyltransferase, and similar proteins	NA|138aa|down_5|NZ_CP010578.1_126918_127332_+	pfam05168, HEPN, HEPN domain	NA|274aa|down_6|NZ_CP010578.1_127816_128638_+	pfam08878, DUF1837, Domain of unknown function (DUF1837)	NA|1051aa|down_7|NZ_CP010578.1_128627_131780_+	COG1204, COG1204, Superfamily II helicase [General function prediction only]	NA|65aa|down_8|NZ_CP010578.1_131750_131945_-	NA	NA|638aa|down_9|NZ_CP010578.1_132851_134765_+	pfam07693, KAP_NTPase, KAP family P-loop domain
GCF_000940785.1_ASM94078v1	NZ_CP010580	Bacillus thuringiensis serovar morrisoni strain BGSC 4AA1 plasmid pBMB76, complete sequence	1	53083-53304	1	CRT	no	csa3	csa3	Type I-A	TCTCAGGCTTCGTCTCAG	18	0	0	NA	NA	NA	5	5	Orphan	cas3,cas14k,csa3,WYL,c2c9_V-U4,DinG,RT,DEDDh,cas14j	NA|138aa|up_8|NZ_CP010580.1_48157_48571_+,NA|93aa|up_7|NZ_CP010580.1_48864_49143_-,NA|176aa|up_6|NZ_CP010580.1_49192_49720_-,NA|80aa|up_5|NZ_CP010580.1_49799_50039_-,NA|62aa|up_4|NZ_CP010580.1_50249_50435_-,NA|85aa|up_3|NZ_CP010580.1_50489_50744_-,NA|63aa|up_2|NZ_CP010580.1_50800_50989_-,NA|113aa|up_0|NZ_CP010580.1_52249_52588_-,NA|175aa|down_1|NZ_CP010580.1_55896_56421_-,NA|66aa|down_5|NZ_CP010580.1_58855_59053_-,NA|58aa|down_6|NZ_CP010580.1_59128_59302_-,NA|101aa|down_7|NZ_CP010580.1_59791_60094_-,NA|83aa|down_8|NZ_CP010580.1_60669_60918_-,NA|151aa|down_9|NZ_CP010580.1_60979_61432_-	NA|513aa|up_9|NZ_CP010580.1_46489_48028_+	pfam14284, PcfJ, PcfJ-like protein	NA|138aa|up_8|NZ_CP010580.1_48157_48571_+	NA	NA|93aa|up_7|NZ_CP010580.1_48864_49143_-	NA	NA|176aa|up_6|NZ_CP010580.1_49192_49720_-	NA	NA|80aa|up_5|NZ_CP010580.1_49799_50039_-	NA	NA|62aa|up_4|NZ_CP010580.1_50249_50435_-	NA	NA|85aa|up_3|NZ_CP010580.1_50489_50744_-	NA	NA|63aa|up_2|NZ_CP010580.1_50800_50989_-	NA	NA|209aa|up_1|NZ_CP010580.1_51102_51729_-	cd03768, SR_ResInv, Serine Recombinase (SR) family, Resolvase and Invertase subfamily, catalytic domain; members contain a C-terminal DNA binding domain	NA|113aa|up_0|NZ_CP010580.1_52249_52588_-	NA	NA|135aa|down_0|NZ_CP010580.1_54165_54570_+	pfam13443, HTH_26, Cro/C1-type HTH DNA-binding domain	NA|175aa|down_1|NZ_CP010580.1_55896_56421_-	NA	NA|376aa|down_2|NZ_CP010580.1_56479_57607_-	pfam00395, SLH, S-layer homology domain	NA|322aa|down_3|NZ_CP010580.1_57633_58599_-	pfam13034, DUF3895, Protein of unknown function (DUF3895)	NA|81aa|down_4|NZ_CP010580.1_58629_58872_-	pfam12728, HTH_17, Helix-turn-helix domain	NA|66aa|down_5|NZ_CP010580.1_58855_59053_-	NA	NA|58aa|down_6|NZ_CP010580.1_59128_59302_-	NA	NA|101aa|down_7|NZ_CP010580.1_59791_60094_-	NA	NA|83aa|down_8|NZ_CP010580.1_60669_60918_-	NA	NA|151aa|down_9|NZ_CP010580.1_60979_61432_-	NA
