assembly_id	genome_id	genome_def	crispr_array_locus_merge	crispr_array_location_merge	crispr_locus_id	crispr_pred_method	array_in_prot	prot_within_array_20000	prot_in_genome	crispr_type_by_cas_prot	consensus_repeat	repeat_length	self-targeting_spacer_number	self-targeting_target_number	spacer_location	protospacer_location	repeat_type	spacer_locus_num	spacer_num	correct_crispr_type	genome_cas_prots	unknown_protein_around_crispr	L10	L10_domain	L9	L9_domain	L8	L8_domain	L7	L7_domain	L6	L6_domain	L5	L5_domain	L4	L4_domain	L3	L3_domain	L2	L2_domain	L1	L1_domain	R1	R1_domain	R2	R2_domain	R3	R3_domain	R4	R4_domain	R5	R5_domain	R6	R6_domain	R7	R7_domain	R8	R8_domain	R9	R9_domain	R10	R10_domain
GCA_003626955.1_ASM362695v1	CP032609	Bacillus thuringiensis strain QZL38 plasmid p.2, complete sequence	1	234632-234863	1	CRISPRCasFinder	no	RT,cas7,cas2,cas1,cas4,cas8c,cas5,cas3,cas14j	RT,cas14j,cas5,cas7b,cas8b1,cas6,csa3,cas7,cas2,cas1,cas4,cas8c,cas3,cas14k	Type I-C,Type I-U, Type I-U?	ATTTCAATCCACGCACCTATATAAGGTGCGACA	33	0	0	NA	NA	I-C	3	3	TypeI-C,TypeI-U,TypeV,TypeI-U?	csa3,cas3,DinG,c2c10_CAS-V-U3,cas14j,WYL,cas14k,DEDDh,c2c9_V-U4,RT,Cas14u_CAS-V,cas5,cas7b,cas8b1,cas6,cas7,cas2,cas1,cas4,cas8c	NA|210aa|up_9|CP032609.1_222388_223018_-,NA|66aa|up_8|CP032609.1_223094_223292_-,NA|240aa|up_7|CP032609.1_223354_224074_-,NA|241aa|up_6|CP032609.1_224090_224813_-,NA|76aa|up_3|CP032609.1_229954_230182_-,NA|138aa|up_1|CP032609.1_231550_231964_-,NA|345aa|down_7|CP032609.1_246421_247456_-,NA|255aa|down_8|CP032609.1_247688_248453_-,NA|576aa|down_9|CP032609.1_249994_251722_-	NA|210aa|up_9|CP032609.1_222388_223018_-	NA	NA|66aa|up_8|CP032609.1_223094_223292_-	NA	NA|240aa|up_7|CP032609.1_223354_224074_-	NA	NA|241aa|up_6|CP032609.1_224090_224813_-	NA	RT|604aa|up_5|CP032609.1_224914_226726_-	TIGR04416, hypothetical_protein, group II intron reverse transcriptase/maturase	NA|862aa|up_4|CP032609.1_227297_229883_-	smart00487, DEXDc, DEAD-like helicases superfamily	NA|76aa|up_3|CP032609.1_229954_230182_-	NA	NA|364aa|up_2|CP032609.1_230197_231289_-	cd02440, AdoMet_MTases, S-adenosylmethionine-dependent methyltransferases (SAM or AdoMet-MTase), class I;  AdoMet-MTases are enzymes that use S-adenosyl-L-methionine (SAM or AdoMet) as a substrate for methyltransfer, creating the product S-adenosyl-L-homocysteine (AdoHcy)	NA|138aa|up_1|CP032609.1_231550_231964_-	NA	cas7|101aa|up_0|CP032609.1_234210_234513_-	cd09689, Cas7_I-C, CRISPR/Cas system-associated RAMP superfamily protein Cas7	cas2|97aa|down_0|CP032609.1_235031_235322_-	cd09725, Cas2_I_II_III, CRISPR/Cas system-associated protein Cas2	cas1|344aa|down_1|CP032609.1_235331_236363_-	TIGR03640, cas1_DVULG, CRISPR-associated endonuclease Cas1, subtype I-C/DVULG	cas4|220aa|down_2|CP032609.1_236359_237019_-	TIGR00372, conserved_hypothetical_protein, CRISPR-associated protein Cas4	cas7|287aa|down_3|CP032609.1_237008_237869_-	COG3649, COG3649, CRISPR system related protein [Defense mechanisms]	cas8c|640aa|down_4|CP032609.1_237871_239791_-	cd09757, Cas8c_I-C, CRISPR/Cas system-associated protein Cas8c	cas5|240aa|down_5|CP032609.1_239791_240511_-	TIGR01876, cas_Cas5d, CRISPR-associated protein Cas5, subtype I-C/DVULG	cas3|810aa|down_6|CP032609.1_240679_243109_-	COG1203, COG1203, CRISPR-associated helicase Cas3 [Defense mechanisms]	NA|345aa|down_7|CP032609.1_246421_247456_-	NA	NA|255aa|down_8|CP032609.1_247688_248453_-	NA	NA|576aa|down_9|CP032609.1_249994_251722_-	NA
GCA_003626955.1_ASM362695v1	CP032609	Bacillus thuringiensis strain QZL38 plasmid p.2, complete sequence	2	243221-243977	1,1,2	PILER-CR,CRT,CRISPRCasFinder	no	RT,cas7,cas2,cas1,cas4,cas8c,cas5,cas3,cas14j	RT,cas14j,cas5,cas7b,cas8b1,cas6,csa3,cas7,cas2,cas1,cas4,cas8c,cas3,cas14k	Type I-C,Type I-U, Type I-U?	TAATTTCAATCCAGGCACCTTTTAAGGTGCGACCTAC,ATTTCAATCCACGCACCTATNTAAGGTGCGAC,AATTTCAATCCACGCACCTATATAAGGTGCGACA	37,32,34	0	0	NA	NA	NA:I-C:I-C	2,11,10	11	TypeI-C,TypeI-U,TypeV,TypeI-U?	csa3,cas3,DinG,c2c10_CAS-V-U3,cas14j,WYL,cas14k,DEDDh,c2c9_V-U4,RT,Cas14u_CAS-V,cas5,cas7b,cas8b1,cas6,cas7,cas2,cas1,cas4,cas8c	NA|138aa|up_8|CP032609.1_231550_231964_-,NA|345aa|down_0|CP032609.1_246421_247456_-,NA|255aa|down_1|CP032609.1_247688_248453_-,NA|576aa|down_2|CP032609.1_249994_251722_-,NA|69aa|down_4|CP032609.1_253808_254015_-,NA|570aa|down_5|CP032609.1_254294_256004_+,NA|158aa|down_7|CP032609.1_258124_258598_-	NA|364aa|up_9|CP032609.1_230197_231289_-	cd02440, AdoMet_MTases, S-adenosylmethionine-dependent methyltransferases (SAM or AdoMet-MTase), class I;  AdoMet-MTases are enzymes that use S-adenosyl-L-methionine (SAM or AdoMet) as a substrate for methyltransfer, creating the product S-adenosyl-L-homocysteine (AdoHcy)	NA|138aa|up_8|CP032609.1_231550_231964_-	NA	cas7|101aa|up_7|CP032609.1_234210_234513_-	cd09689, Cas7_I-C, CRISPR/Cas system-associated RAMP superfamily protein Cas7	cas2|97aa|up_6|CP032609.1_235031_235322_-	cd09725, Cas2_I_II_III, CRISPR/Cas system-associated protein Cas2	cas1|344aa|up_5|CP032609.1_235331_236363_-	TIGR03640, cas1_DVULG, CRISPR-associated endonuclease Cas1, subtype I-C/DVULG	cas4|220aa|up_4|CP032609.1_236359_237019_-	TIGR00372, conserved_hypothetical_protein, CRISPR-associated protein Cas4	cas7|287aa|up_3|CP032609.1_237008_237869_-	COG3649, COG3649, CRISPR system related protein [Defense mechanisms]	cas8c|640aa|up_2|CP032609.1_237871_239791_-	cd09757, Cas8c_I-C, CRISPR/Cas system-associated protein Cas8c	cas5|240aa|up_1|CP032609.1_239791_240511_-	TIGR01876, cas_Cas5d, CRISPR-associated protein Cas5, subtype I-C/DVULG	cas3|810aa|up_0|CP032609.1_240679_243109_-	COG1203, COG1203, CRISPR-associated helicase Cas3 [Defense mechanisms]	NA|345aa|down_0|CP032609.1_246421_247456_-	NA	NA|255aa|down_1|CP032609.1_247688_248453_-	NA	NA|576aa|down_2|CP032609.1_249994_251722_-	NA	cas14j|378aa|down_3|CP032609.1_252049_253183_-	COG0675, COG0675, Transposase and inactivated derivatives [DNA replication, recombination, and repair]	NA|69aa|down_4|CP032609.1_253808_254015_-	NA	NA|570aa|down_5|CP032609.1_254294_256004_+	NA	NA|141aa|down_6|CP032609.1_257440_257863_-	cd02883, Nudix_Hydrolase, Nudix hydrolase is a superfamily of enzymes found in all three kingdoms of life, and it catalyzes the hydrolysis of NUcleoside DIphosphates linked to other moieties, X	NA|158aa|down_7|CP032609.1_258124_258598_-	NA	NA|711aa|down_8|CP032609.1_258569_260702_-	cd00397, DNA_BRE_C, DNA breaking-rejoining enzymes, C-terminal catalytic domain	NA|457aa|down_9|CP032609.1_260689_262060_-	cd00397, DNA_BRE_C, DNA breaking-rejoining enzymes, C-terminal catalytic domain
GCA_003626955.1_ASM362695v1	CP032608	Bacillus thuringiensis strain QZL38 chromosome, complete genome	2	2735664-2735787	2	CRISPRCasFinder	no		csa3,cas3,DinG,c2c10_CAS-V-U3,cas14j,WYL,cas14k,DEDDh,c2c9_V-U4	Orphan	AAACGTTTGTTTAAGCTGTATGTTCGGTAGGAAGAAACCGATG	43	0	0	NA	NA	NA	1	1	Orphan	csa3,cas3,DinG,c2c10_CAS-V-U3,cas14j,WYL,cas14k,DEDDh,c2c9_V-U4,RT,Cas14u_CAS-V,cas5,cas7b,cas8b1,cas6,cas7,cas2,cas1,cas4,cas8c	NA|62aa|up_5|CP032608.1_2731559_2731745_+,NA|176aa|up_0|CP032608.1_2735096_2735624_+,NA|115aa|down_4|CP032608.1_2737891_2738236_+	NA|794aa|up_9|CP032608.1_2724781_2727163_+	COG1250, FadB, 3-hydroxyacyl-CoA dehydrogenase [Lipid metabolism]	NA|391aa|up_8|CP032608.1_2727184_2728357_+	PRK07661, PRK07661, acetyl-CoA C-acetyltransferase	NA|601aa|up_7|CP032608.1_2728808_2730611_+	cd01161, VLCAD, Very long chain acyl-CoA dehydrogenase	NA|240aa|up_6|CP032608.1_2730726_2731446_+	cd07721, yflN-like_MBL-fold, uncharacterized subgroup which includes Bacillus subtilis yflN; MBL-fold metallo hydrolase domain	NA|62aa|up_5|CP032608.1_2731559_2731745_+	NA	NA|83aa|up_4|CP032608.1_2731758_2732007_+	pfam07875, Coat_F, Coat F domain	NA|378aa|up_3|CP032608.1_2732070_2733204_+	cd05291, HicDH_like, L-2-hydroxyisocapronate dehydrogenases and some bacterial L-lactate dehydrogenases	NA|338aa|up_2|CP032608.1_2733226_2734240_+	pfam13303, PTS_EIIC_2, Phosphotransferase system, EIIC	NA|216aa|up_1|CP032608.1_2734305_2734953_-	cd03386, PAP2_Aur1_like, PAP2_like proteins, Aur1_like subfamily	NA|176aa|up_0|CP032608.1_2735096_2735624_+	NA	NA|122aa|down_0|CP032608.1_2736036_2736402_+	cd03036, ArsC_like, Arsenate Reductase (ArsC) family, unknown subfamily; uncharacterized proteins containing a CXXC motif with similarity to thioredoxin (TRX)-fold arsenic reductases, ArsC	NA|128aa|down_1|CP032608.1_2736443_2736827_+	PRK01202, PRK01202, glycine cleavage system protein GcvH	NA|115aa|down_2|CP032608.1_2737082_2737427_+	COG1658, COG1658, Small primase-like proteins (Toprim domain) [DNA replication, recombination, and repair]	NA|100aa|down_3|CP032608.1_2737439_2737739_+	cd02947, TRX_family, TRX family; composed of two groups: Group I, which includes proteins that exclusively encode a TRX domain; and Group II, which are composed of fusion proteins of TRX and additional domains	NA|115aa|down_4|CP032608.1_2737891_2738236_+	NA	NA|342aa|down_5|CP032608.1_2738646_2739672_+	COG1135, AbcC, ABC-type metal ion transport system, ATPase component [Inorganic ion transport and metabolism]	NA|222aa|down_6|CP032608.1_2739664_2740330_+	COG2011, AbcD, ABC-type metal ion transport system, permease component [Inorganic ion transport and metabolism]	NA|271aa|down_7|CP032608.1_2740353_2741166_+	COG1464, NlpA, ABC-type metal ion transport system, periplasmic component/surface antigen [Inorganic ion transport and metabolism]	NA|269aa|down_8|CP032608.1_2741236_2742043_+	COG1464, NlpA, ABC-type metal ion transport system, periplasmic component/surface antigen [Inorganic ion transport and metabolism]	NA|262aa|down_9|CP032608.1_2742281_2743067_+	COG0396, sufC, Cysteine desulfurase activator ATPase [Posttranslational modification, protein turnover, chaperones]
