assembly_id	genome_id	genome_def	crispr_array_locus_merge	crispr_array_location_merge	crispr_locus_id	crispr_pred_method	array_in_prot	prot_within_array_20000	prot_in_genome	crispr_type_by_cas_prot	consensus_repeat	repeat_length	self-targeting_spacer_number	self-targeting_target_number	spacer_location	protospacer_location	repeat_type	spacer_locus_num	spacer_num	correct_crispr_type	genome_cas_prots	unknown_protein_around_crispr	L10	L10_domain	L9	L9_domain	L8	L8_domain	L7	L7_domain	L6	L6_domain	L5	L5_domain	L4	L4_domain	L3	L3_domain	L2	L2_domain	L1	L1_domain	R1	R1_domain	R2	R2_domain	R3	R3_domain	R4	R4_domain	R5	R5_domain	R6	R6_domain	R7	R7_domain	R8	R8_domain	R9	R9_domain	R10	R10_domain
GCF_000022765.1_ASM2276v1	NC_012563	Clostridium botulinum A2 str. Kyoto, complete genome	1	1179524-1179875	1,1	CRISPRCasFinder,CRT	no		csa3,DEDDh,DinG,WYL,cas3	Orphan	TGAACATTAACATGAGATGTATTTAAAT,TGAACANTAACATNAGATGTATTTAAAT	28,28	0	0	NA	NA	III-B:III-B	5,5	5	Orphan	csa3,DEDDh,DinG,WYL,cas3	NA,NA|101aa|down_4|NC_012563.1_1189131_1189434_-	NA|587aa|up_9|NC_012563.1_1169425_1171186_-	PRK14479, PRK14479, dihydroxyacetone kinase; Provisional	NA|377aa|up_8|NC_012563.1_1171196_1172327_-	PRK09423, gldA, glycerol dehydrogenase; Provisional	NA|346aa|up_7|NC_012563.1_1172847_1173885_-	pfam10114, PocR, Sensory domain found in PocR	NA|155aa|up_6|NC_012563.1_1174330_1174795_+	PRK09831, PRK09831, GNAT family N-acetyltransferase	NA|148aa|up_5|NC_012563.1_1175001_1175445_+	pfam13508, Acetyltransf_7, Acetyltransferase (GNAT) domain	NA|198aa|up_4|NC_012563.1_1175597_1176191_+	pfam07081, DUF1349, Protein of unknown function (DUF1349)	NA|68aa|up_3|NC_012563.1_1176254_1176458_+	pfam12663, DUF3788, Protein of unknown function (DUF3788)	NA|282aa|up_2|NC_012563.1_1176575_1177421_+	cd19157, AKR_AKR5G1-3, AKR5G family of aldo-keto reductase (AKR)	NA|108aa|up_1|NC_012563.1_1177522_1177846_+	COG3070, TfoX, Regulator of competence-specific genes [Transcription]	NA|285aa|up_0|NC_012563.1_1177891_1178746_+	pfam00583, Acetyltransf_1, Acetyltransferase (GNAT) family	NA|323aa|down_0|NC_012563.1_1180488_1181457_+	COG0791, Spr, Cell wall-associated hydrolases (invasion-associated proteins) [Cell envelope biogenesis, outer membrane]	NA|336aa|down_1|NC_012563.1_1182461_1183469_-	PRK09358, PRK09358, adenosine deaminase; Provisional	NA|723aa|down_2|NC_012563.1_1184365_1186534_+	COG1511, COG1511, Predicted membrane protein [Function unknown]	NA|720aa|down_3|NC_012563.1_1186559_1188719_+	COG1511, COG1511, Predicted membrane protein [Function unknown]	NA|101aa|down_4|NC_012563.1_1189131_1189434_-	NA	NA|252aa|down_5|NC_012563.1_1189639_1190395_+	pfam13803, DUF4184, Domain of unknown function (DUF4184)	NA|710aa|down_6|NC_012563.1_1190638_1192768_+	TIGR01389, recQ, ATP-dependent DNA helicase RecQ	NA|144aa|down_7|NC_012563.1_1192878_1193310_+	cd01046, Rubrerythrin_like, rubrerythrin-like, diiron-binding domain	NA|182aa|down_8|NC_012563.1_1193593_1194139_-	PRK09453, PRK09453, phosphodiesterase; Provisional	NA|128aa|down_9|NC_012563.1_1194774_1195158_+	cd17562, REC_CheY4-like, phosphoacceptor receiver (REC) domain of chemotaxis response regulator CheY4 and similar CheY family proteins
GCF_000022765.1_ASM2276v1	NC_012563	Clostridium botulinum A2 str. Kyoto, complete genome	2	2455808-2456100	2,2,1	CRISPRCasFinder,CRT,PILER-CR	no		csa3,DEDDh,DinG,WYL,cas3	Orphan	ATTTAAATACATCTCATGTTAATGTTCAAC,ATTTAAATACATCTCATGTTAATGTTCAAC,ATTTAAATACATCTCATGTTAATGTTCAAC	30,30,30	0	0	NA	NA	III-B:III-B:III-B	4,4,3	4	Orphan	csa3,DEDDh,DinG,WYL,cas3	NA|98aa|up_2|NC_012563.1_2454548_2454842_-,NA|47aa|up_1|NC_012563.1_2454855_2454996_-,NA|64aa|up_0|NC_012563.1_2454995_2455187_-,NA|300aa|down_7|NC_012563.1_2466098_2466998_+	NA|118aa|up_9|NC_012563.1_2446844_2447198_+	pfam16189, Creatinase_N_2, Creatinase/Prolidase N-terminal domain	NA|469aa|up_8|NC_012563.1_2447528_2448935_+	cd17346, MFS_DtpA_like, Dipeptide and tripeptide permease A (DtpA)-like subfamily of the Major Facilitator Superfamily of transporters	NA|247aa|up_7|NC_012563.1_2449038_2449779_-	pfam13649, Methyltransf_25, Methyltransferase domain	NA|297aa|up_6|NC_012563.1_2449785_2450676_-	COG2207, AraC, AraC-type DNA-binding domain-containing proteins [Transcription]	NA|174aa|up_5|NC_012563.1_2450976_2451498_+	pfam16107, DUF4825, Domain of unknown function (DUF4825)	NA|666aa|up_4|NC_012563.1_2451608_2453606_-	cd02931, ER_like_FMN, Enoate reductase (ER)-like FMN-binding domain	NA|266aa|up_3|NC_012563.1_2453702_2454500_-	cd00592, HTH_MerR-like, Helix-Turn-Helix DNA binding domain of MerR-like transcription regulators	NA|98aa|up_2|NC_012563.1_2454548_2454842_-	NA	NA|47aa|up_1|NC_012563.1_2454855_2454996_-	NA	NA|64aa|up_0|NC_012563.1_2454995_2455187_-	NA	NA|312aa|down_0|NC_012563.1_2456641_2457577_-	COG0384, COG0384, Predicted epimerase, PhzC/PhzF homolog [General function prediction only]	NA|498aa|down_1|NC_012563.1_2457901_2459395_+	COG1167, ARO8, Transcriptional regulators containing a DNA-binding HTH domain and an aminotransferase domain (MocR family) and their eukaryotic orthologs [Transcription / Amino acid transport and metabolism]	NA|151aa|down_2|NC_012563.1_2460444_2460897_+	pfam12638, Staygreen, Staygreen protein	NA|149aa|down_3|NC_012563.1_2460978_2461425_-	cd00093, HTH_XRE, Helix-turn-helix XRE-family like proteins	NA|349aa|down_4|NC_012563.1_2461592_2462639_-	pfam07859, Abhydrolase_3, alpha/beta hydrolase fold	NA|451aa|down_5|NC_012563.1_2462889_2464242_-	cd01298, ATZ_TRZ_like, TRZ/ATZ family contains enzymes from the atrazine degradation pathway and related hydrolases	NA|448aa|down_6|NC_012563.1_2464317_2465661_-	TIGR03173, pbuX, xanthine permease	NA|300aa|down_7|NC_012563.1_2466098_2466998_+	NA	NA|441aa|down_8|NC_012563.1_2467337_2468660_+	COG0531, PotE, Amino acid transporters [Amino acid transport and metabolism]	NA|245aa|down_9|NC_012563.1_2468727_2469462_-	cd00180, PKc, Catalytic domain of Protein Kinases
GCF_000022765.1_ASM2276v1	NC_012563	Clostridium botulinum A2 str. Kyoto, complete genome	3	2459891-2460115	2,3,3	PILER-CR,CRISPRCasFinder,CRT	no		csa3,DEDDh,DinG,WYL,cas3	Orphan	ATTTAAATACATCTCATGTTAATGTTCAAC,ATTTAAATACATCTCATGTTAATGTTCAAC,ATTTAAATACATCTCATGTTA	30,30,21	0	0	NA	NA	III-B:III-B:III-B	3,3,3	3	Orphan	csa3,DEDDh,DinG,WYL,cas3	NA|98aa|up_4|NC_012563.1_2454548_2454842_-,NA|47aa|up_3|NC_012563.1_2454855_2454996_-,NA|64aa|up_2|NC_012563.1_2454995_2455187_-,NA|300aa|down_5|NC_012563.1_2466098_2466998_+	NA|247aa|up_9|NC_012563.1_2449038_2449779_-	pfam13649, Methyltransf_25, Methyltransferase domain	NA|297aa|up_8|NC_012563.1_2449785_2450676_-	COG2207, AraC, AraC-type DNA-binding domain-containing proteins [Transcription]	NA|174aa|up_7|NC_012563.1_2450976_2451498_+	pfam16107, DUF4825, Domain of unknown function (DUF4825)	NA|666aa|up_6|NC_012563.1_2451608_2453606_-	cd02931, ER_like_FMN, Enoate reductase (ER)-like FMN-binding domain	NA|266aa|up_5|NC_012563.1_2453702_2454500_-	cd00592, HTH_MerR-like, Helix-Turn-Helix DNA binding domain of MerR-like transcription regulators	NA|98aa|up_4|NC_012563.1_2454548_2454842_-	NA	NA|47aa|up_3|NC_012563.1_2454855_2454996_-	NA	NA|64aa|up_2|NC_012563.1_2454995_2455187_-	NA	NA|312aa|up_1|NC_012563.1_2456641_2457577_-	COG0384, COG0384, Predicted epimerase, PhzC/PhzF homolog [General function prediction only]	NA|498aa|up_0|NC_012563.1_2457901_2459395_+	COG1167, ARO8, Transcriptional regulators containing a DNA-binding HTH domain and an aminotransferase domain (MocR family) and their eukaryotic orthologs [Transcription / Amino acid transport and metabolism]	NA|151aa|down_0|NC_012563.1_2460444_2460897_+	pfam12638, Staygreen, Staygreen protein	NA|149aa|down_1|NC_012563.1_2460978_2461425_-	cd00093, HTH_XRE, Helix-turn-helix XRE-family like proteins	NA|349aa|down_2|NC_012563.1_2461592_2462639_-	pfam07859, Abhydrolase_3, alpha/beta hydrolase fold	NA|451aa|down_3|NC_012563.1_2462889_2464242_-	cd01298, ATZ_TRZ_like, TRZ/ATZ family contains enzymes from the atrazine degradation pathway and related hydrolases	NA|448aa|down_4|NC_012563.1_2464317_2465661_-	TIGR03173, pbuX, xanthine permease	NA|300aa|down_5|NC_012563.1_2466098_2466998_+	NA	NA|441aa|down_6|NC_012563.1_2467337_2468660_+	COG0531, PotE, Amino acid transporters [Amino acid transport and metabolism]	NA|245aa|down_7|NC_012563.1_2468727_2469462_-	cd00180, PKc, Catalytic domain of Protein Kinases	NA|258aa|down_8|NC_012563.1_2469948_2470722_-	COG1924, COG1924, Activator of 2-hydroxyglutaryl-CoA dehydratase (HSP70-class ATPase domain) [Lipid metabolism]	NA|567aa|down_9|NC_012563.1_2470798_2472499_-	COG3949, COG3949, Uncharacterized membrane protein [Function unknown]
GCF_000022765.1_ASM2276v1	NC_012563	Clostridium botulinum A2 str. Kyoto, complete genome	4	2563318-2563804	4,4,3	CRISPRCasFinder,CRT,PILER-CR	no		csa3,DEDDh,DinG,WYL,cas3	Orphan	ATTTAAATACATCATATGTTACTGTTCAAC,ATTTAAATACATCNTATGTTANTGTTCAAC,ATTTAAATACATCATATGTTACTGTTCAAC	30,30,30	0	0	NA	NA	III-B:III-B:III-B	7,7,3	7	Orphan	csa3,DEDDh,DinG,WYL,cas3	NA|43aa|up_8|NC_012563.1_2555824_2555953_-,NA|68aa|up_7|NC_012563.1_2556005_2556209_-,NA|79aa|up_4|NC_012563.1_2560545_2560782_-,NA|135aa|up_3|NC_012563.1_2560794_2561199_-,NA|57aa|up_2|NC_012563.1_2561558_2561729_-,NA|183aa|up_0|NC_012563.1_2562526_2563075_+,NA|77aa|down_0|NC_012563.1_2564258_2564489_-,NA|183aa|down_1|NC_012563.1_2564846_2565395_-,NA|330aa|down_2|NC_012563.1_2565406_2566396_-,NA|65aa|down_4|NC_012563.1_2567442_2567637_-,NA|99aa|down_6|NC_012563.1_2568175_2568472_-	NA|88aa|up_9|NC_012563.1_2555428_2555692_-	TIGR01642, Splicing_factor_U2AF_59_kDa_subunit, U2 snRNP auxilliary factor, large subunit, splicing factor	NA|43aa|up_8|NC_012563.1_2555824_2555953_-	NA	NA|68aa|up_7|NC_012563.1_2556005_2556209_-	NA	NA|63aa|up_6|NC_012563.1_2557021_2557210_+	pfam12788, YmaF, YmaF family	NA|117aa|up_5|NC_012563.1_2557851_2558202_+	cd00093, HTH_XRE, Helix-turn-helix XRE-family like proteins	NA|79aa|up_4|NC_012563.1_2560545_2560782_-	NA	NA|135aa|up_3|NC_012563.1_2560794_2561199_-	NA	NA|57aa|up_2|NC_012563.1_2561558_2561729_-	NA	NA|67aa|up_1|NC_012563.1_2561915_2562116_-	COG1476, COG1476, Predicted transcriptional regulators [Transcription]	NA|183aa|up_0|NC_012563.1_2562526_2563075_+	NA	NA|77aa|down_0|NC_012563.1_2564258_2564489_-	NA	NA|183aa|down_1|NC_012563.1_2564846_2565395_-	NA	NA|330aa|down_2|NC_012563.1_2565406_2566396_-	NA	NA|254aa|down_3|NC_012563.1_2566643_2567405_-	cd02696, MurNAc-LAA, N-acetylmuramoyl-L-alanine amidase or MurNAc-LAA (also known as peptidoglycan aminohydrolase, NAMLA amidase, NAMLAA, Amidase 3, and peptidoglycan amidase; EC 3	NA|65aa|down_4|NC_012563.1_2567442_2567637_-	NA	NA|85aa|down_5|NC_012563.1_2567653_2567908_-	pfam10779, XhlA, Haemolysin XhlA	NA|99aa|down_6|NC_012563.1_2568175_2568472_-	NA	NA|389aa|down_7|NC_012563.1_2568487_2569654_-	pfam12571, DUF3751, Phage tail-collar fibre protein	NA|212aa|down_8|NC_012563.1_2569656_2570292_-	pfam10076, DUF2313, Uncharacterized protein conserved in bacteria (DUF2313)	NA|377aa|down_9|NC_012563.1_2570288_2571419_-	pfam04865, Baseplate_J, Baseplate J-like protein
GCF_000022765.1_ASM2276v1	NC_012563	Clostridium botulinum A2 str. Kyoto, complete genome	5	2655984-2656343	5,5	CRISPRCasFinder,CRT	no		csa3,DEDDh,DinG,WYL,cas3	Orphan	ATTTAAATACATCTCATGTTAATGTTCAAT,ATTTAAATACATCNNATGTTAATGTTCAA	30,29	1	1	2656081-2656116	NC_012563.1_1708437-1708472	III-B:III-B	5,5	5	Orphan	csa3,DEDDh,DinG,WYL,cas3	NA,NA|65aa|down_1|NC_012563.1_2657361_2657556_-,NA|237aa|down_3|NC_012563.1_2657930_2658641_-,NA|156aa|down_4|NC_012563.1_2658864_2659332_-,NA|130aa|down_5|NC_012563.1_2659996_2660386_-,NA|70aa|down_6|NC_012563.1_2660507_2660717_-,NA|102aa|down_7|NC_012563.1_2660732_2661038_-	NA|229aa|up_9|NC_012563.1_2648539_2649226_+	pfam01957, NfeD, NfeD-like C-terminal, partner-binding	NA|417aa|up_8|NC_012563.1_2649268_2650519_-	COG0642, BaeS, Signal transduction histidine kinase [Signal transduction mechanisms]	NA|227aa|up_7|NC_012563.1_2650511_2651192_-	COG0745, OmpR, Response regulators consisting of a CheY-like receiver domain and a winged-helix DNA-binding domain [Signal transduction mechanisms / Transcription]	NA|180aa|up_6|NC_012563.1_2651184_2651724_-	cd14360, UBA_NAC_like_bac, UBA-like domain found in uncharacterized bacteria proteins similar to eukaryotic nascent polypeptide-associated complex proteins (NAC)	NA|181aa|up_5|NC_012563.1_2651874_2652417_-	cd14360, UBA_NAC_like_bac, UBA-like domain found in uncharacterized bacteria proteins similar to eukaryotic nascent polypeptide-associated complex proteins (NAC)	NA|37aa|up_4|NC_012563.1_2652658_2652769_-	pfam12841, YvrJ, YvrJ protein family	NA|270aa|up_3|NC_012563.1_2652775_2653585_-	PRK06921, PRK06921, hypothetical protein; Provisional	NA|83aa|up_2|NC_012563.1_2654320_2654569_+	cd00093, HTH_XRE, Helix-turn-helix XRE-family like proteins	NA|60aa|up_1|NC_012563.1_2654967_2655147_+	pfam07927, HicA_toxin, HicA toxin of bacterial toxin-antitoxin,	NA|138aa|up_0|NC_012563.1_2655205_2655619_+	pfam15919, HicB_lk_antitox, HicB_like antitoxin of bacterial toxin-antitoxin system	NA|254aa|down_0|NC_012563.1_2656558_2657320_-	cd02696, MurNAc-LAA, N-acetylmuramoyl-L-alanine amidase or MurNAc-LAA (also known as peptidoglycan aminohydrolase, NAMLA amidase, NAMLAA, Amidase 3, and peptidoglycan amidase; EC 3	NA|65aa|down_1|NC_012563.1_2657361_2657556_-	NA	NA|85aa|down_2|NC_012563.1_2657572_2657827_-	pfam10779, XhlA, Haemolysin XhlA	NA|237aa|down_3|NC_012563.1_2657930_2658641_-	NA	NA|156aa|down_4|NC_012563.1_2658864_2659332_-	NA	NA|130aa|down_5|NC_012563.1_2659996_2660386_-	NA	NA|70aa|down_6|NC_012563.1_2660507_2660717_-	NA	NA|102aa|down_7|NC_012563.1_2660732_2661038_-	NA	NA|415aa|down_8|NC_012563.1_2661053_2662298_-	pfam12571, DUF3751, Phage tail-collar fibre protein	NA|209aa|down_9|NC_012563.1_2662301_2662928_-	pfam10076, DUF2313, Uncharacterized protein conserved in bacteria (DUF2313)
