assembly_id	genome_id	genome_def	crispr_array_locus_merge	crispr_array_location_merge	crispr_locus_id	crispr_pred_method	array_in_prot	prot_within_array_20000	prot_in_genome	crispr_type_by_cas_prot	consensus_repeat	repeat_length	self-targeting_spacer_number	self-targeting_target_number	spacer_location	protospacer_location	repeat_type	spacer_locus_num	spacer_num	correct_crispr_type	genome_cas_prots	unknown_protein_around_crispr	L10	L10_domain	L9	L9_domain	L8	L8_domain	L7	L7_domain	L6	L6_domain	L5	L5_domain	L4	L4_domain	L3	L3_domain	L2	L2_domain	L1	L1_domain	R1	R1_domain	R2	R2_domain	R3	R3_domain	R4	R4_domain	R5	R5_domain	R6	R6_domain	R7	R7_domain	R8	R8_domain	R9	R9_domain	R10	R10_domain
GCF_013343075.1_ASM1334307v1	NZ_CP054568	Bacillus thuringiensis strain FDAARGOS_791 chromosome, complete genome	1	1639756-1639863	1	CRISPRCasFinder	no		csa3,cas14j,WYL,DEDDh,RT,cas3,c2c9_V-U4,cas14k,DinG	Orphan	TATATCAGCGATTTTTTGAATATATC	26	0	0	NA	NA	NA	1	1	Orphan	csa3,cas14j,WYL,DEDDh,RT,cas3,c2c9_V-U4,cas14k,DinG	NA|99aa|up_3|NZ_CP054568.1_1637444_1637741_+,NA|145aa|down_4|NZ_CP054568.1_1643380_1643815_-,NA|61aa|down_5|NZ_CP054568.1_1643830_1644013_-	NA|230aa|up_9|NZ_CP054568.1_1632461_1633151_-	COG0745, OmpR, Response regulators consisting of a CheY-like receiver domain and a winged-helix DNA-binding domain [Signal transduction mechanisms / Transcription]	NA|485aa|up_8|NZ_CP054568.1_1633152_1634607_-	COG0642, BaeS, Signal transduction histidine kinase [Signal transduction mechanisms]	NA|227aa|up_7|NZ_CP054568.1_1634756_1635437_+	sd00045, ANK, ankyrin repeats	NA|309aa|up_6|NZ_CP054568.1_1635516_1636443_+	COG0673, MviM, Predicted dehydrogenases and related proteins [General function prediction only]	NA|133aa|up_5|NZ_CP054568.1_1636571_1636970_+	PRK13955, mscL, large conductance mechanosensitive channel protein MscL	NA|91aa|up_4|NZ_CP054568.1_1637021_1637294_-	pfam13055, DUF3917, Protein of unknown function (DUF3917)	NA|99aa|up_3|NZ_CP054568.1_1637444_1637741_+	NA	NA|79aa|up_2|NZ_CP054568.1_1637822_1638059_+	pfam13133, DUF3949, Protein of unknown function (DUF3949)	NA|121aa|up_1|NZ_CP054568.1_1638060_1638423_+	pfam14119, DUF4288, Domain of unknown function (DUF4288)	NA|333aa|up_0|NZ_CP054568.1_1638458_1639457_-	TIGR01481, catabolite_control_protein_A, catabolite control protein A	NA|339aa|down_0|NZ_CP054568.1_1640067_1641084_-	COG4851, CamS, Protein involved in sex pheromone biosynthesis [General function prediction only]	NA|109aa|down_1|NZ_CP054568.1_1641201_1641528_-	pfam11009, DUF2847, Protein of unknown function (DUF2847)	NA|171aa|down_2|NZ_CP054568.1_1641527_1642040_-	COG4768, COG4768, Uncharacterized protein containing a divergent version of the methyl-accepting chemotaxis-like domain [General function prediction only]	NA|372aa|down_3|NZ_CP054568.1_1642227_1643343_+	COG2309, AmpS, Leucyl aminopeptidase (aminopeptidase T) [Amino acid transport and metabolism]	NA|145aa|down_4|NZ_CP054568.1_1643380_1643815_-	NA	NA|61aa|down_5|NZ_CP054568.1_1643830_1644013_-	NA	NA|135aa|down_6|NZ_CP054568.1_1644009_1644414_-	pfam06713, bPH_4, Bacterial PH domain	NA|185aa|down_7|NZ_CP054568.1_1644505_1645060_-	COG1670, RimL, Acetyltransferases, including N-acetylases of ribosomal proteins [Translation, ribosomal structure and biogenesis]	NA|437aa|down_8|NZ_CP054568.1_1645258_1646569_-	PRK00421, murC, UDP-N-acetylmuramate--L-alanine ligase; Provisional	NA|373aa|down_9|NZ_CP054568.1_1646821_1647940_-	PRK07188, PRK07188, nicotinate phosphoribosyltransferase; Provisional
GCF_013343075.1_ASM1334307v1	NZ_CP054568	Bacillus thuringiensis strain FDAARGOS_791 chromosome, complete genome	2	1812003-1812089	2	CRISPRCasFinder	no		csa3,cas14j,WYL,DEDDh,RT,cas3,c2c9_V-U4,cas14k,DinG	Orphan	GATATATCTTAAAAATCGCTGATATA	26	0	0	NA	NA	NA	1	1	Orphan	csa3,cas14j,WYL,DEDDh,RT,cas3,c2c9_V-U4,cas14k,DinG	NA,NA	NA|482aa|up_9|NZ_CP054568.1_1800855_1802301_-	PRK03640, PRK03640, o-succinylbenzoate--CoA ligase	NA|273aa|up_8|NZ_CP054568.1_1802531_1803350_-	PRK07396, PRK07396, dihydroxynaphthoic acid synthetase; Validated	NA|271aa|up_7|NZ_CP054568.1_1803419_1804232_-	TIGR03695, menH_SHCHC, 2-succinyl-6-hydroxy-2,4-cyclohexadiene-1-carboxylate synthase	NA|585aa|up_6|NZ_CP054568.1_1804228_1805983_-	PRK07449, PRK07449, 2-succinyl-5-enolpyruvyl-6-hydroxy-3-cyclohexene-1-carboxylate synthase; Validated	NA|465aa|up_5|NZ_CP054568.1_1805979_1807374_-	COG1169, MenF, Isochorismate synthase [Coenzyme metabolism / Secondary metabolites biosynthesis, transport, and catabolism]	NA|318aa|up_4|NZ_CP054568.1_1807568_1808522_+	PRK06080, PRK06080, 1,4-dihydroxy-2-naphthoate octaprenyltransferase; Validated	NA|244aa|up_3|NZ_CP054568.1_1808631_1809363_+	TIGR02890, conserved_hypothetical_protein, regulatory protein, yteA family	NA|67aa|up_2|NZ_CP054568.1_1809407_1809608_-	COG1278, CspC, Cold shock proteins [Transcription]	NA|261aa|up_1|NZ_CP054568.1_1809934_1810717_+	pfam01987, AIM24, Mitochondrial biogenesis AIM24	NA|241aa|up_0|NZ_CP054568.1_1810995_1811718_+	cd00519, Lipase_3, Lipase (class 3)	NA|803aa|down_0|NZ_CP054568.1_1812144_1814553_-	cd04300, GT35_Glycogen_Phosphorylase, glycogen phosphorylase and similar proteins	NA|477aa|down_1|NZ_CP054568.1_1814571_1816002_-	PRK00654, glgA, glycogen synthase GlgA	NA|345aa|down_2|NZ_CP054568.1_1816114_1817149_-	TIGR02092, Glycogen_biosynthesis_protein_GlgD, glucose-1-phosphate adenylyltransferase, GlgD subunit	NA|377aa|down_3|NZ_CP054568.1_1817167_1818298_-	PRK05293, glgC, glucose-1-phosphate adenylyltransferase; Provisional	NA|646aa|down_4|NZ_CP054568.1_1818245_1820183_-	PRK05402, PRK05402, 1,4-alpha-glucan branching protein GlgB	NA|202aa|down_5|NZ_CP054568.1_1820651_1821257_+	pfam12389, Peptidase_M73, Camelysin metallo-endopeptidase	NA|1408aa|down_6|NZ_CP054568.1_1821289_1825513_+	cd07474, Peptidases_S8_subtilisin_Vpr-like, Peptidase S8 family domain in Vpr-like proteins	NA|315aa|down_7|NZ_CP054568.1_1825684_1826629_-	PRK00066, ldh, L-lactate dehydrogenase; Reviewed	NA|227aa|down_8|NZ_CP054568.1_1834539_1835220_+	COG1285, SapB, Uncharacterized membrane protein [Function unknown]	NA|332aa|down_9|NZ_CP054568.1_1835315_1836311_+	pfam07885, Ion_trans_2, Ion channel
GCF_013343075.1_ASM1334307v1	NZ_CP054568	Bacillus thuringiensis strain FDAARGOS_791 chromosome, complete genome	3	1922747-1922869	3	CRISPRCasFinder	no		csa3,cas14j,WYL,DEDDh,RT,cas3,c2c9_V-U4,cas14k,DinG	Orphan	TTAAACAAACGTTTGATTAACTCCCTATTTTTCTTTGTTCAC	42	0	0	NA	NA	NA	1	1	Orphan	csa3,cas14j,WYL,DEDDh,RT,cas3,c2c9_V-U4,cas14k,DinG	NA|115aa|up_4|NZ_CP054568.1_1920103_1920448_-,NA|130aa|down_1|NZ_CP054568.1_1923822_1924212_-,NA|64aa|down_2|NZ_CP054568.1_1924536_1924728_-,NA|77aa|down_3|NZ_CP054568.1_1924743_1924974_-,NA|176aa|down_4|NZ_CP054568.1_1925029_1925557_-,NA|62aa|down_9|NZ_CP054568.1_1928898_1929084_-	NA|262aa|up_9|NZ_CP054568.1_1915147_1915933_-	COG0396, sufC, Cysteine desulfurase activator ATPase [Posttranslational modification, protein turnover, chaperones]	NA|269aa|up_8|NZ_CP054568.1_1916172_1916979_-	COG1464, NlpA, ABC-type metal ion transport system, periplasmic component/surface antigen [Inorganic ion transport and metabolism]	NA|271aa|up_7|NZ_CP054568.1_1917051_1917864_-	COG1464, NlpA, ABC-type metal ion transport system, periplasmic component/surface antigen [Inorganic ion transport and metabolism]	NA|222aa|up_6|NZ_CP054568.1_1917887_1918553_-	COG2011, AbcD, ABC-type metal ion transport system, permease component [Inorganic ion transport and metabolism]	NA|342aa|up_5|NZ_CP054568.1_1918545_1919571_-	COG1135, AbcC, ABC-type metal ion transport system, ATPase component [Inorganic ion transport and metabolism]	NA|115aa|up_4|NZ_CP054568.1_1920103_1920448_-	NA	NA|103aa|up_3|NZ_CP054568.1_1920602_1920911_-	cd02947, TRX_family, TRX family; composed of two groups: Group I, which includes proteins that exclusively encode a TRX domain; and Group II, which are composed of fusion proteins of TRX and additional domains	NA|115aa|up_2|NZ_CP054568.1_1920914_1921259_-	COG1658, COG1658, Small primase-like proteins (Toprim domain) [DNA replication, recombination, and repair]	NA|128aa|up_1|NZ_CP054568.1_1921749_1922133_-	PRK01202, PRK01202, glycine cleavage system protein GcvH	NA|122aa|up_0|NZ_CP054568.1_1922174_1922540_-	cd03036, ArsC_like, Arsenate Reductase (ArsC) family, unknown subfamily; uncharacterized proteins containing a CXXC motif with similarity to thioredoxin (TRX)-fold arsenic reductases, ArsC	NA|80aa|down_0|NZ_CP054568.1_1923070_1923310_-	pfam13073, DUF3937, Protein of unknown function (DUF3937)	NA|130aa|down_1|NZ_CP054568.1_1923822_1924212_-	NA	NA|64aa|down_2|NZ_CP054568.1_1924536_1924728_-	NA	NA|77aa|down_3|NZ_CP054568.1_1924743_1924974_-	NA	NA|176aa|down_4|NZ_CP054568.1_1925029_1925557_-	NA	NA|216aa|down_5|NZ_CP054568.1_1925700_1926348_+	cd03386, PAP2_Aur1_like, PAP2_like proteins, Aur1_like subfamily	NA|338aa|down_6|NZ_CP054568.1_1926403_1927417_-	pfam13303, PTS_EIIC_2, Phosphotransferase system, EIIC	NA|378aa|down_7|NZ_CP054568.1_1927439_1928573_-	cd05291, HicDH_like, L-2-hydroxyisocapronate dehydrogenases and some bacterial L-lactate dehydrogenases	NA|83aa|down_8|NZ_CP054568.1_1928636_1928885_-	pfam07875, Coat_F, Coat F domain	NA|62aa|down_9|NZ_CP054568.1_1928898_1929084_-	NA
GCF_013343075.1_ASM1334307v1	NZ_CP054568	Bacillus thuringiensis strain FDAARGOS_791 chromosome, complete genome	4	3634216-3634324	4	CRISPRCasFinder	no		csa3,cas14j,WYL,DEDDh,RT,cas3,c2c9_V-U4,cas14k,DinG	Orphan	TGTATGATTACCTTCCGCATGAGAA	25	0	0	NA	NA	NA	1	1	Orphan	csa3,cas14j,WYL,DEDDh,RT,cas3,c2c9_V-U4,cas14k,DinG	NA|58aa|up_7|NZ_CP054568.1_3628153_3628327_-,NA|124aa|up_3|NZ_CP054568.1_3630753_3631125_+,NA	NA|415aa|up_9|NZ_CP054568.1_3624912_3626157_+	COG4469, CoiA, Competence protein CoiA-like family, contains a predicted nuclease    domain [General function prediction only]	NA|609aa|up_8|NZ_CP054568.1_3626207_3628034_+	cd09608, M3B_PepF, Peptidase family M3B, oligopeptidase F (PepF)	NA|58aa|up_7|NZ_CP054568.1_3628153_3628327_-	NA	NA|298aa|up_6|NZ_CP054568.1_3628556_3629450_-	pfam13743, Thioredoxin_5, Thioredoxin	NA|133aa|up_5|NZ_CP054568.1_3629449_3629848_-	cd14772, TrHb2_Bs-trHb-like_O, Truncated hemoglobins, group 2 (O); Bacillus subtilis TrHb like	NA|193aa|up_4|NZ_CP054568.1_3630028_3630607_-	cd07762, CYTH-like_Pase_1, Uncharacterized subgroup 1 of the CYTH-like superfamily	NA|124aa|up_3|NZ_CP054568.1_3630753_3631125_+	NA	NA|213aa|up_2|NZ_CP054568.1_3631155_3631794_+	COG2357, COG2357, PpGpp synthetase catalytic domain [General function prediction only]	NA|266aa|up_1|NZ_CP054568.1_3631812_3632610_+	PRK04885, ppnK, inorganic polyphosphate/ATP-NAD kinase; Provisional	NA|298aa|up_0|NZ_CP054568.1_3632625_3633519_+	COG0564, RluA, Pseudouridylate synthases, 23S RNA-specific [Translation, ribosomal structure and biogenesis]	NA|247aa|down_0|NZ_CP054568.1_3635097_3635838_-	PRK13625, PRK13625, bis(5'-nucleosyl)-tetraphosphatase PrpE; Provisional	NA|387aa|down_1|NZ_CP054568.1_3635912_3637073_-	TIGR02210, Rod_shape-determining_protein_RodA, rod shape-determining protein RodA	NA|309aa|down_2|NZ_CP054568.1_3637191_3638118_-	pfam00535, Glycos_transf_2, Glycosyl transferase family 2	NA|251aa|down_3|NZ_CP054568.1_3638131_3638884_-	pfam13649, Methyltransf_25, Methyltransferase domain	NA|282aa|down_4|NZ_CP054568.1_3639098_3639944_-	pfam05711, TylF, Macrocin-O-methyltransferase (TylF)	NA|302aa|down_5|NZ_CP054568.1_3640066_3640972_-	pfam18573, BclA_C, BclA C-terminal domain	NA|366aa|down_6|NZ_CP054568.1_3641137_3642235_+	cd02511, Beta4Glucosyltransferase, UDP-glucose LOS-beta-1,4 glucosyltransferase is required for biosynthesis of lipooligosaccharide	NA|230aa|down_7|NZ_CP054568.1_3642446_3643136_+	pfam08242, Methyltransf_12, Methyltransferase domain	NA|229aa|down_8|NZ_CP054568.1_3643132_3643819_+	pfam13712, Glyco_tranf_2_5, Glycosyltransferase like family	NA|227aa|down_9|NZ_CP054568.1_3643833_3644514_+	pfam13712, Glyco_tranf_2_5, Glycosyltransferase like family
