assembly_id	genome_id	genome_def	crispr_array_locus_merge	crispr_array_location_merge	crispr_locus_id	crispr_pred_method	array_in_prot	prot_within_array_20000	prot_in_genome	crispr_type_by_cas_prot	consensus_repeat	repeat_length	self-targeting_spacer_number	self-targeting_target_number	spacer_location	protospacer_location	repeat_type	spacer_locus_num	spacer_num	correct_crispr_type	genome_cas_prots	unknown_protein_around_crispr	L10	L10_domain	L9	L9_domain	L8	L8_domain	L7	L7_domain	L6	L6_domain	L5	L5_domain	L4	L4_domain	L3	L3_domain	L2	L2_domain	L1	L1_domain	R1	R1_domain	R2	R2_domain	R3	R3_domain	R4	R4_domain	R5	R5_domain	R6	R6_domain	R7	R7_domain	R8	R8_domain	R9	R9_domain	R10	R10_domain
GCF_000832825.1_ASM83282v1	NZ_CP009600	Bacillus thuringiensis strain HD571 chromosome, complete genome	1	2216829-2216936	1	CRISPRCasFinder	no		cas3,WYL,csa3,cas14j,DEDDh,RT,c2c9_V-U4,cas14k,DinG	Orphan	TATATCAGCGATTTTTTGAATATATC	26	0	0	NA	NA	NA	1	1	Orphan	cas3,WYL,csa3,cas14j,DEDDh,RT,c2c9_V-U4,cas14k,DinG	NA|99aa|up_3|NZ_CP009600.1_2214517_2214814_+,NA|145aa|down_4|NZ_CP009600.1_2220453_2220888_-,NA|61aa|down_5|NZ_CP009600.1_2220903_2221086_-	NA|230aa|up_9|NZ_CP009600.1_2209534_2210224_-	COG0745, OmpR, Response regulators consisting of a CheY-like receiver domain and a winged-helix DNA-binding domain [Signal transduction mechanisms / Transcription]	NA|485aa|up_8|NZ_CP009600.1_2210225_2211680_-	COG0642, BaeS, Signal transduction histidine kinase [Signal transduction mechanisms]	NA|227aa|up_7|NZ_CP009600.1_2211829_2212510_+	sd00045, ANK, ankyrin repeats	NA|309aa|up_6|NZ_CP009600.1_2212589_2213516_+	COG0673, MviM, Predicted dehydrogenases and related proteins [General function prediction only]	NA|133aa|up_5|NZ_CP009600.1_2213644_2214043_+	PRK13955, mscL, large conductance mechanosensitive channel protein MscL	NA|91aa|up_4|NZ_CP009600.1_2214094_2214367_-	pfam13055, DUF3917, Protein of unknown function (DUF3917)	NA|99aa|up_3|NZ_CP009600.1_2214517_2214814_+	NA	NA|79aa|up_2|NZ_CP009600.1_2214895_2215132_+	pfam13133, DUF3949, Protein of unknown function (DUF3949)	NA|121aa|up_1|NZ_CP009600.1_2215133_2215496_+	pfam14119, DUF4288, Domain of unknown function (DUF4288)	NA|333aa|up_0|NZ_CP009600.1_2215531_2216530_-	TIGR01481, catabolite_control_protein_A, catabolite control protein A	NA|339aa|down_0|NZ_CP009600.1_2217140_2218157_-	COG4851, CamS, Protein involved in sex pheromone biosynthesis [General function prediction only]	NA|109aa|down_1|NZ_CP009600.1_2218274_2218601_-	pfam11009, DUF2847, Protein of unknown function (DUF2847)	NA|186aa|down_2|NZ_CP009600.1_2218600_2219158_-	COG4768, COG4768, Uncharacterized protein containing a divergent version of the methyl-accepting chemotaxis-like domain [General function prediction only]	NA|372aa|down_3|NZ_CP009600.1_2219300_2220416_+	COG2309, AmpS, Leucyl aminopeptidase (aminopeptidase T) [Amino acid transport and metabolism]	NA|145aa|down_4|NZ_CP009600.1_2220453_2220888_-	NA	NA|61aa|down_5|NZ_CP009600.1_2220903_2221086_-	NA	NA|135aa|down_6|NZ_CP009600.1_2221082_2221487_-	pfam06713, bPH_4, Bacterial PH domain	NA|185aa|down_7|NZ_CP009600.1_2221578_2222133_-	COG1670, RimL, Acetyltransferases, including N-acetylases of ribosomal proteins [Translation, ribosomal structure and biogenesis]	NA|437aa|down_8|NZ_CP009600.1_2222331_2223642_-	PRK00421, murC, UDP-N-acetylmuramate--L-alanine ligase; Provisional	NA|373aa|down_9|NZ_CP009600.1_2223894_2225013_-	PRK07188, PRK07188, nicotinate phosphoribosyltransferase; Provisional
GCF_000832825.1_ASM83282v1	NZ_CP009600	Bacillus thuringiensis strain HD571 chromosome, complete genome	2	2389065-2389151	2	CRISPRCasFinder	no		cas3,WYL,csa3,cas14j,DEDDh,RT,c2c9_V-U4,cas14k,DinG	Orphan	GATATATCTTAAAAATCGCTGATATA	26	0	0	NA	NA	NA	1	1	Orphan	cas3,WYL,csa3,cas14j,DEDDh,RT,c2c9_V-U4,cas14k,DinG	NA,NA	NA|482aa|up_9|NZ_CP009600.1_2377917_2379363_-	PRK03640, PRK03640, o-succinylbenzoate--CoA ligase	NA|273aa|up_8|NZ_CP009600.1_2379593_2380412_-	PRK07396, PRK07396, dihydroxynaphthoic acid synthetase; Validated	NA|271aa|up_7|NZ_CP009600.1_2380481_2381294_-	TIGR03695, menH_SHCHC, 2-succinyl-6-hydroxy-2,4-cyclohexadiene-1-carboxylate synthase	NA|585aa|up_6|NZ_CP009600.1_2381290_2383045_-	PRK07449, PRK07449, 2-succinyl-5-enolpyruvyl-6-hydroxy-3-cyclohexene-1-carboxylate synthase; Validated	NA|465aa|up_5|NZ_CP009600.1_2383041_2384436_-	COG1169, MenF, Isochorismate synthase [Coenzyme metabolism / Secondary metabolites biosynthesis, transport, and catabolism]	NA|318aa|up_4|NZ_CP009600.1_2384630_2385584_+	PRK06080, PRK06080, 1,4-dihydroxy-2-naphthoate octaprenyltransferase; Validated	NA|244aa|up_3|NZ_CP009600.1_2385693_2386425_+	TIGR02890, conserved_hypothetical_protein, regulatory protein, yteA family	NA|67aa|up_2|NZ_CP009600.1_2386469_2386670_-	COG1278, CspC, Cold shock proteins [Transcription]	NA|261aa|up_1|NZ_CP009600.1_2386996_2387779_+	pfam01987, AIM24, Mitochondrial biogenesis AIM24	NA|241aa|up_0|NZ_CP009600.1_2388057_2388780_+	cd00519, Lipase_3, Lipase (class 3)	NA|803aa|down_0|NZ_CP009600.1_2389206_2391615_-	cd04300, GT35_Glycogen_Phosphorylase, glycogen phosphorylase and similar proteins	NA|477aa|down_1|NZ_CP009600.1_2391633_2393064_-	PRK00654, glgA, glycogen synthase GlgA	NA|345aa|down_2|NZ_CP009600.1_2393176_2394211_-	TIGR02092, Glycogen_biosynthesis_protein_GlgD, glucose-1-phosphate adenylyltransferase, GlgD subunit	NA|377aa|down_3|NZ_CP009600.1_2394229_2395360_-	PRK05293, glgC, glucose-1-phosphate adenylyltransferase; Provisional	NA|646aa|down_4|NZ_CP009600.1_2395307_2397245_-	PRK05402, PRK05402, 1,4-alpha-glucan branching protein GlgB	NA|202aa|down_5|NZ_CP009600.1_2397713_2398319_+	pfam12389, Peptidase_M73, Camelysin metallo-endopeptidase	NA|1408aa|down_6|NZ_CP009600.1_2398351_2402575_+	cd07474, Peptidases_S8_subtilisin_Vpr-like, Peptidase S8 family domain in Vpr-like proteins	NA|315aa|down_7|NZ_CP009600.1_2402746_2403691_-	PRK00066, ldh, L-lactate dehydrogenase; Reviewed	NA|227aa|down_8|NZ_CP009600.1_2411601_2412282_+	COG1285, SapB, Uncharacterized membrane protein [Function unknown]	NA|332aa|down_9|NZ_CP009600.1_2412377_2413373_+	pfam07885, Ion_trans_2, Ion channel
GCF_000832825.1_ASM83282v1	NZ_CP009600	Bacillus thuringiensis strain HD571 chromosome, complete genome	3	2499809-2499931	3	CRISPRCasFinder	no		cas3,WYL,csa3,cas14j,DEDDh,RT,c2c9_V-U4,cas14k,DinG	Orphan	TTAAACAAACGTTTGATTAACTCCCTATTTTTCTTTGTTCAC	42	0	0	NA	NA	NA	1	1	Orphan	cas3,WYL,csa3,cas14j,DEDDh,RT,c2c9_V-U4,cas14k,DinG	NA|115aa|up_4|NZ_CP009600.1_2497165_2497510_-,NA|130aa|down_1|NZ_CP009600.1_2500884_2501274_-,NA|64aa|down_2|NZ_CP009600.1_2501598_2501790_-,NA|77aa|down_3|NZ_CP009600.1_2501805_2502036_-,NA|176aa|down_4|NZ_CP009600.1_2502091_2502619_-,NA|62aa|down_9|NZ_CP009600.1_2505960_2506146_-	NA|262aa|up_9|NZ_CP009600.1_2492209_2492995_-	COG0396, sufC, Cysteine desulfurase activator ATPase [Posttranslational modification, protein turnover, chaperones]	NA|269aa|up_8|NZ_CP009600.1_2493234_2494041_-	COG1464, NlpA, ABC-type metal ion transport system, periplasmic component/surface antigen [Inorganic ion transport and metabolism]	NA|271aa|up_7|NZ_CP009600.1_2494113_2494926_-	COG1464, NlpA, ABC-type metal ion transport system, periplasmic component/surface antigen [Inorganic ion transport and metabolism]	NA|222aa|up_6|NZ_CP009600.1_2494949_2495615_-	COG2011, AbcD, ABC-type metal ion transport system, permease component [Inorganic ion transport and metabolism]	NA|342aa|up_5|NZ_CP009600.1_2495607_2496633_-	COG1135, AbcC, ABC-type metal ion transport system, ATPase component [Inorganic ion transport and metabolism]	NA|115aa|up_4|NZ_CP009600.1_2497165_2497510_-	NA	NA|100aa|up_3|NZ_CP009600.1_2497664_2497964_-	cd02947, TRX_family, TRX family; composed of two groups: Group I, which includes proteins that exclusively encode a TRX domain; and Group II, which are composed of fusion proteins of TRX and additional domains	NA|115aa|up_2|NZ_CP009600.1_2497976_2498321_-	COG1658, COG1658, Small primase-like proteins (Toprim domain) [DNA replication, recombination, and repair]	NA|128aa|up_1|NZ_CP009600.1_2498811_2499195_-	PRK01202, PRK01202, glycine cleavage system protein GcvH	NA|122aa|up_0|NZ_CP009600.1_2499236_2499602_-	cd03036, ArsC_like, Arsenate Reductase (ArsC) family, unknown subfamily; uncharacterized proteins containing a CXXC motif with similarity to thioredoxin (TRX)-fold arsenic reductases, ArsC	NA|80aa|down_0|NZ_CP009600.1_2500132_2500372_-	pfam13073, DUF3937, Protein of unknown function (DUF3937)	NA|130aa|down_1|NZ_CP009600.1_2500884_2501274_-	NA	NA|64aa|down_2|NZ_CP009600.1_2501598_2501790_-	NA	NA|77aa|down_3|NZ_CP009600.1_2501805_2502036_-	NA	NA|176aa|down_4|NZ_CP009600.1_2502091_2502619_-	NA	NA|216aa|down_5|NZ_CP009600.1_2502762_2503410_+	cd03386, PAP2_Aur1_like, PAP2_like proteins, Aur1_like subfamily	NA|338aa|down_6|NZ_CP009600.1_2503465_2504479_-	pfam13303, PTS_EIIC_2, Phosphotransferase system, EIIC	NA|378aa|down_7|NZ_CP009600.1_2504501_2505635_-	cd05291, HicDH_like, L-2-hydroxyisocapronate dehydrogenases and some bacterial L-lactate dehydrogenases	NA|83aa|down_8|NZ_CP009600.1_2505698_2505947_-	pfam07875, Coat_F, Coat F domain	NA|62aa|down_9|NZ_CP009600.1_2505960_2506146_-	NA
GCF_000832825.1_ASM83282v1	NZ_CP009600	Bacillus thuringiensis strain HD571 chromosome, complete genome	4	4211280-4211388	4	CRISPRCasFinder	no		cas3,WYL,csa3,cas14j,DEDDh,RT,c2c9_V-U4,cas14k,DinG	Orphan	TGTATGATTACCTTCCGCATGAGAA	25	0	0	NA	NA	NA	1	1	Orphan	cas3,WYL,csa3,cas14j,DEDDh,RT,c2c9_V-U4,cas14k,DinG	NA|124aa|up_3|NZ_CP009600.1_4207817_4208189_+,NA	NA|515aa|up_9|NZ_CP009600.1_4200350_4201895_+	PRK01642, cls, cardiolipin synthetase; Reviewed	NA|415aa|up_8|NZ_CP009600.1_4201976_4203221_+	COG4469, CoiA, Competence protein CoiA-like family, contains a predicted nuclease    domain [General function prediction only]	NA|609aa|up_7|NZ_CP009600.1_4203271_4205098_+	cd09608, M3B_PepF, Peptidase family M3B, oligopeptidase F (PepF)	NA|298aa|up_6|NZ_CP009600.1_4205620_4206514_-	pfam13743, Thioredoxin_5, Thioredoxin	NA|133aa|up_5|NZ_CP009600.1_4206513_4206912_-	cd14772, TrHb2_Bs-trHb-like_O, Truncated hemoglobins, group 2 (O); Bacillus subtilis TrHb like	NA|193aa|up_4|NZ_CP009600.1_4207092_4207671_-	cd07762, CYTH-like_Pase_1, Uncharacterized subgroup 1 of the CYTH-like superfamily	NA|124aa|up_3|NZ_CP009600.1_4207817_4208189_+	NA	NA|213aa|up_2|NZ_CP009600.1_4208219_4208858_+	COG2357, COG2357, PpGpp synthetase catalytic domain [General function prediction only]	NA|266aa|up_1|NZ_CP009600.1_4208876_4209674_+	PRK04885, ppnK, inorganic polyphosphate/ATP-NAD kinase; Provisional	NA|298aa|up_0|NZ_CP009600.1_4209689_4210583_+	COG0564, RluA, Pseudouridylate synthases, 23S RNA-specific [Translation, ribosomal structure and biogenesis]	NA|247aa|down_0|NZ_CP009600.1_4212161_4212902_-	PRK13625, PRK13625, bis(5'-nucleosyl)-tetraphosphatase PrpE; Provisional	NA|387aa|down_1|NZ_CP009600.1_4212976_4214137_-	TIGR02210, Rod_shape-determining_protein_RodA, rod shape-determining protein RodA	NA|309aa|down_2|NZ_CP009600.1_4214255_4215182_-	pfam00535, Glycos_transf_2, Glycosyl transferase family 2	NA|251aa|down_3|NZ_CP009600.1_4215195_4215948_-	pfam13649, Methyltransf_25, Methyltransferase domain	NA|282aa|down_4|NZ_CP009600.1_4216162_4217008_-	pfam05711, TylF, Macrocin-O-methyltransferase (TylF)	NA|302aa|down_5|NZ_CP009600.1_4217130_4218036_-	pfam18573, BclA_C, BclA C-terminal domain	NA|366aa|down_6|NZ_CP009600.1_4218201_4219299_+	cd02511, Beta4Glucosyltransferase, UDP-glucose LOS-beta-1,4 glucosyltransferase is required for biosynthesis of lipooligosaccharide	NA|230aa|down_7|NZ_CP009600.1_4219510_4220200_+	pfam08242, Methyltransf_12, Methyltransferase domain	NA|229aa|down_8|NZ_CP009600.1_4220196_4220883_+	pfam13712, Glyco_tranf_2_5, Glycosyltransferase like family	NA|227aa|down_9|NZ_CP009600.1_4220897_4221578_+	pfam13712, Glyco_tranf_2_5, Glycosyltransferase like family
