assembly_id	genome_id	genome_def	crispr_array_locus_merge	crispr_array_location_merge	crispr_locus_id	crispr_pred_method	array_in_prot	prot_within_array_20000	prot_in_genome	crispr_type_by_cas_prot	consensus_repeat	repeat_length	self-targeting_spacer_number	self-targeting_target_number	spacer_location	protospacer_location	repeat_type	spacer_locus_num	spacer_num	correct_crispr_type	genome_cas_prots	unknown_protein_around_crispr	L10	L10_domain	L9	L9_domain	L8	L8_domain	L7	L7_domain	L6	L6_domain	L5	L5_domain	L4	L4_domain	L3	L3_domain	L2	L2_domain	L1	L1_domain	R1	R1_domain	R2	R2_domain	R3	R3_domain	R4	R4_domain	R5	R5_domain	R6	R6_domain	R7	R7_domain	R8	R8_domain	R9	R9_domain	R10	R10_domain
GCF_000020165.1_ASM2016v1	NC_010674	Clostridium botulinum B str. Eklund 17B (NRP), complete sequence	1	592165-592270	1	CRISPRCasFinder	no		csx1,cas3,csa3,DEDDh,DinG,RT,WYL	Orphan	AAATTAACTAAATGAAAATTAAAT	24	0	0	NA	NA	NA	1	1	Orphan	csx1,cas3,csa3,DEDDh,DinG,RT,WYL	NA|235aa|up_5|NC_010674.1_587694_588399_+,NA|58aa|up_1|NC_010674.1_591380_591554_-,NA|101aa|down_5|NC_010674.1_594252_594555_+,NA|214aa|down_6|NC_010674.1_596219_596861_+,NA|159aa|down_8|NC_010674.1_597873_598350_-,NA|81aa|down_9|NC_010674.1_598614_598857_+	NA|432aa|up_9|NC_010674.1_583125_584421_+	COG1316, LytR, Transcriptional regulator [Transcription]	NA|299aa|up_8|NC_010674.1_584445_585342_+	COG0564, RluA, Pseudouridylate synthases, 23S RNA-specific [Translation, ribosomal structure and biogenesis]	NA|440aa|up_7|NC_010674.1_585355_586675_-	COG1686, DacC, D-alanyl-D-alanine carboxypeptidase [Cell envelope biogenesis, outer membrane]	NA|207aa|up_6|NC_010674.1_586891_587512_+	TIGR01259, ComE_operon_protein_1, comEA protein	NA|235aa|up_5|NC_010674.1_587694_588399_+	NA	NA|126aa|up_4|NC_010674.1_588424_588802_+	pfam06271, RDD, RDD family	NA|451aa|up_3|NC_010674.1_588818_590171_+	COG2265, TrmA, SAM-dependent methyltransferases related to tRNA (uracil-5-)-methyltransferase [Translation, ribosomal structure and biogenesis]	NA|386aa|up_2|NC_010674.1_590239_591397_-	cd01189, INT_ICEBs1_C_like, C-terminal catalytic domain of integrases from bacterial phages and conjugate transposons	NA|58aa|up_1|NC_010674.1_591380_591554_-	NA	NA|125aa|up_0|NC_010674.1_591738_592113_-	cd00093, HTH_XRE, Helix-turn-helix XRE-family like proteins	NA|63aa|down_0|NC_010674.1_592312_592501_+	COG1476, COG1476, Predicted transcriptional regulators [Transcription]	NA|70aa|down_1|NC_010674.1_592552_592762_+	smart00530, HTH_XRE, Helix-turn-helix XRE-family like proteins	NA|74aa|down_2|NC_010674.1_592778_593000_+	pfam12728, HTH_17, Helix-turn-helix domain	NA|211aa|down_3|NC_010674.1_593077_593710_+	pfam09681, Phage_rep_org_N, N-terminal phage replisome organizer (Phage_rep_org_N)	NA|96aa|down_4|NC_010674.1_593782_594070_+	TIGR01446, replication_protein, DnaD and phage-associated domain	NA|101aa|down_5|NC_010674.1_594252_594555_+	NA	NA|214aa|down_6|NC_010674.1_596219_596861_+	NA	NA|67aa|down_7|NC_010674.1_597232_597433_-	pfam12841, YvrJ, YvrJ protein family	NA|159aa|down_8|NC_010674.1_597873_598350_-	NA	NA|81aa|down_9|NC_010674.1_598614_598857_+	NA
GCF_000020165.1_ASM2016v1	NC_010674	Clostridium botulinum B str. Eklund 17B (NRP), complete sequence	2	973600-973699	2	CRISPRCasFinder	no		csx1,cas3,csa3,DEDDh,DinG,RT,WYL	Orphan	TTATAATTAATTACAATAAATTATAAT	27	0	0	NA	NA	NA	1	1	Orphan	csx1,cas3,csa3,DEDDh,DinG,RT,WYL	NA|111aa|up_8|NC_010674.1_969078_969411_+,NA|52aa|up_7|NC_010674.1_969410_969566_+,NA|184aa|up_6|NC_010674.1_969822_970374_+,NA|216aa|up_5|NC_010674.1_970541_971189_+,NA|52aa|up_4|NC_010674.1_971175_971331_-,NA|59aa|up_2|NC_010674.1_971711_971888_+,NA|134aa|up_0|NC_010674.1_972925_973327_+,NA|346aa|down_3|NC_010674.1_977024_978062_+	NA|484aa|up_9|NC_010674.1_967624_969076_+	COG1196, Smc, Chromosome segregation ATPases [Cell division and chromosome partitioning]	NA|111aa|up_8|NC_010674.1_969078_969411_+	NA	NA|52aa|up_7|NC_010674.1_969410_969566_+	NA	NA|184aa|up_6|NC_010674.1_969822_970374_+	NA	NA|216aa|up_5|NC_010674.1_970541_971189_+	NA	NA|52aa|up_4|NC_010674.1_971175_971331_-	NA	NA|71aa|up_3|NC_010674.1_971485_971698_+	pfam10779, XhlA, Haemolysin XhlA	NA|59aa|up_2|NC_010674.1_971711_971888_+	NA	NA|267aa|up_1|NC_010674.1_971933_972734_+	cd06525, GH25_Lyc-like, Lyc muramidase is an autolytic lysozyme (autolysin) from Clostridium acetobutylicum encoded by the lyc gene	NA|134aa|up_0|NC_010674.1_972925_973327_+	NA	NA|67aa|down_0|NC_010674.1_974576_974777_+	pfam10960, Holin_BhlA, BhlA holin family	NA|189aa|down_1|NC_010674.1_975086_975653_-	COG1670, RimL, Acetyltransferases, including N-acetylases of ribosomal proteins [Translation, ribosomal structure and biogenesis]	NA|348aa|down_2|NC_010674.1_975962_977006_+	pfam09992, NAGPA, Phosphodiester glycosidase	NA|346aa|down_3|NC_010674.1_977024_978062_+	NA	NA|370aa|down_4|NC_010674.1_978214_979324_+	PRK11650, ugpC, sn-glycerol-3-phosphate ABC transporter ATP-binding protein UgpC	NA|314aa|down_5|NC_010674.1_979345_980287_+	COG2508, COG2508, Regulator of polyketide synthase expression [Signal transduction mechanisms / Secondary metabolites biosynthesis, transport, and catabolism]	NA|340aa|down_6|NC_010674.1_980448_981468_+	COG1609, PurR, Transcriptional regulators [Transcription]	NA|496aa|down_7|NC_010674.1_981615_983103_+	PRK14508, PRK14508, 4-alpha-glucanotransferase; Provisional	NA|787aa|down_8|NC_010674.1_983102_985463_+	cd04300, GT35_Glycogen_Phosphorylase, glycogen phosphorylase and similar proteins	NA|580aa|down_9|NC_010674.1_985979_987719_+	pfam03814, KdpA, Potassium-transporting ATPase A subunit
GCF_000020165.1_ASM2016v1	NC_010674	Clostridium botulinum B str. Eklund 17B (NRP), complete sequence	3	2143426-2144430	3,1,1	CRISPRCasFinder,CRT,PILER-CR	no		csx1,cas3,csa3,DEDDh,DinG,RT,WYL	Orphan	ATTTAAATACATCTCATGTTAAGGTTAATC,ATTTAAATACATCTCATGTTAAGGTTAATC,ATTTAAATACATCTCATGTTAAGGTTAATC	30,30,30	1	1	2143520-2143555	NC_010674.1_1997690-1997725	II-B:II-B:II-B	15,15,14	15	Orphan	csx1,cas3,csa3,DEDDh,DinG,RT,WYL	NA,NA	NA|139aa|up_9|NC_010674.1_2133282_2133699_-	PRK05568, PRK05568, flavodoxin; Provisional	NA|487aa|up_8|NC_010674.1_2133790_2135251_-	cd07082, ALDH_F11_NP-GAPDH, NADP+-dependent non-phosphorylating glyceraldehyde 3-phosphate dehydrogenase and ALDH family 11	NA|257aa|up_7|NC_010674.1_2135302_2136073_-	cd06184, flavohem_like_fad_nad_binding, FAD_NAD(P)H binding domain of flavohemoglobin	NA|411aa|up_6|NC_010674.1_2136565_2137798_+	COG0426, FpaA, Uncharacterized flavoproteins [Energy production and conversion]	NA|111aa|up_5|NC_010674.1_2138070_2138403_+	pfam13751, DDE_Tnp_1_6, Transposase DDE domain	NA|54aa|up_4|NC_010674.1_2138445_2138607_-	pfam00301, Rubredoxin, Rubredoxin	NA|78aa|up_3|NC_010674.1_2138804_2139038_-	COG4309, COG4309, Uncharacterized conserved protein [Function unknown]	NA|568aa|up_2|NC_010674.1_2139400_2141104_-	PRK05290, PRK05290, hybrid cluster protein; Provisional	NA|181aa|up_1|NC_010674.1_2141320_2141863_-	PRK14879, PRK14879, Kae1-associated kinase Bud32	NA|425aa|up_0|NC_010674.1_2142000_2143275_-	cd07563, Peptidase_S41_IRBP, Interphotoreceptor retinoid-binding protein; serine protease family S41	NA|264aa|down_0|NC_010674.1_2146233_2147025_-	COG2382, Fes, Enterochelin esterase and related enzymes [Inorganic ion transport and metabolism]	NA|290aa|down_1|NC_010674.1_2147267_2148137_+	COG2207, AraC, AraC-type DNA-binding domain-containing proteins [Transcription]	NA|243aa|down_2|NC_010674.1_2148333_2149062_+	pfam05857, TraX, TraX protein	NA|404aa|down_3|NC_010674.1_2149302_2150514_-	cd17335, MFS_MFSD6, Major facilitator superfamily domain-containing protein 6	NA|297aa|down_4|NC_010674.1_2150618_2151509_-	COG1940, NagC, Transcriptional regulator/sugar kinase [Transcription / Carbohydrate transport and metabolism]	NA|312aa|down_5|NC_010674.1_2151519_2152455_-	COG1482, ManA, Phosphomannose isomerase [Carbohydrate transport and metabolism]	NA|534aa|down_6|NC_010674.1_2152632_2154234_-	cd11333, AmyAc_SI_OligoGlu_DGase, Alpha amylase catalytic domain found in Sucrose isomerases, oligo-1,6-glucosidase (also called isomaltase; sucrase-isomaltase; alpha-limit dextrinase), dextran glucosidase (also called glucan 1,6-alpha-glucosidase), and related proteins	NA|306aa|down_7|NC_010674.1_2154248_2155166_-	pfam08950, DUF1861, Protein of unknown function (DUF1861)	NA|354aa|down_8|NC_010674.1_2155179_2156241_-	cd19974, PBP1_LacI-like, ligand-binding domain of uncharacterized DNA-binding regulatory proteins that are members of the LacI-GalR family of bacterial transcription repressors	NA|363aa|down_9|NC_010674.1_2156309_2157398_-	cd18612, GH130_Lin0857-like, Glycoside hydrolase family 130 such as Listeria innocua beta-1,2-mannobiose phosphorylase
GCF_000020165.1_ASM2016v1	NC_010680	Clostridium botulinum B str. Eklund 17B (NRP) plasmid pCLL, complete sequence	1	14087-14213	1	CRISPRCasFinder	no			Orphan	ATGTAGATTTAAGAAATGCAAATTT	25	0	0	NA	NA	NA	2	2	Orphan	csx1,cas3,csa3,DEDDh,DinG,RT,WYL	NA|712aa|up_9|NC_010680.1_3240_5376_+,NA|213aa|up_7|NC_010680.1_5763_6402_+,NA|227aa|up_5|NC_010680.1_8372_9053_+,NA|92aa|up_2|NC_010680.1_11396_11672_+,NA|172aa|down_0|NC_010680.1_14452_14968_+,NA|117aa|down_1|NC_010680.1_14990_15341_+,NA|137aa|down_3|NC_010680.1_15906_16317_+,NA|53aa|down_4|NC_010680.1_16397_16556_+,NA|104aa|down_5|NC_010680.1_16571_16883_+,NA|105aa|down_6|NC_010680.1_17175_17490_+,NA|79aa|down_7|NC_010680.1_18048_18285_+,NA|72aa|down_8|NC_010680.1_18381_18597_+,NA|96aa|down_9|NC_010680.1_18670_18958_+	NA|712aa|up_9|NC_010680.1_3240_5376_+	NA	NA|92aa|up_8|NC_010680.1_5376_5652_+	pfam17332, pXO2-11, Uncharacterized protein pXO2-11	NA|213aa|up_7|NC_010680.1_5763_6402_+	NA	NA|638aa|up_6|NC_010680.1_6457_8371_+	TIGR02746, hypothetical_protein, type-IV secretion system protein TraC	NA|227aa|up_5|NC_010680.1_8372_9053_+	NA	NA|390aa|up_4|NC_010680.1_9113_10283_+	pfam00877, NLPC_P60, NlpC/P60 family	NA|271aa|up_3|NC_010680.1_10301_11114_+	TIGR02169, chromosome_segregation_protein_related_ptotein, chromosome segregation protein SMC, primarily archaeal type	NA|92aa|up_2|NC_010680.1_11396_11672_+	NA	NA|137aa|up_1|NC_010680.1_11677_12088_+	cd19586, serpin_mimivirus, serpin-like proteins found in mimiviruses	NA|380aa|up_0|NC_010680.1_12101_13241_+	pfam18555, MobL, MobL relaxases	NA|172aa|down_0|NC_010680.1_14452_14968_+	NA	NA|117aa|down_1|NC_010680.1_14990_15341_+	NA	NA|117aa|down_2|NC_010680.1_15354_15705_+	pfam00436, SSB, Single-strand binding protein family	NA|137aa|down_3|NC_010680.1_15906_16317_+	NA	NA|53aa|down_4|NC_010680.1_16397_16556_+	NA	NA|104aa|down_5|NC_010680.1_16571_16883_+	NA	NA|105aa|down_6|NC_010680.1_17175_17490_+	NA	NA|79aa|down_7|NC_010680.1_18048_18285_+	NA	NA|72aa|down_8|NC_010680.1_18381_18597_+	NA	NA|96aa|down_9|NC_010680.1_18670_18958_+	NA
GCF_000020165.1_ASM2016v1	NC_010680	Clostridium botulinum B str. Eklund 17B (NRP) plasmid pCLL, complete sequence	2	47075-47450	2	CRISPRCasFinder	no			Orphan	GGAAAAATGCTTAATGGTTGGATTAATGATAATGGCAACTGGTATT	46	0	0	NA	NA	NA	3	3	Orphan	csx1,cas3,csa3,DEDDh,DinG,RT,WYL	NA|120aa|up_7|NC_010680.1_39430_39790_+,NA|74aa|up_6|NC_010680.1_39833_40055_-,NA|121aa|up_2|NC_010680.1_41746_42109_+,NA|71aa|up_1|NC_010680.1_42119_42332_+,NA|171aa|up_0|NC_010680.1_42428_42941_+,NA	NA|257aa|up_9|NC_010680.1_37527_38298_+	pfam01841, Transglut_core, Transglutaminase-like superfamily	NA|298aa|up_8|NC_010680.1_38544_39438_+	cd10227, ParM_like, Plasmid segregation protein ParM and similar proteins	NA|120aa|up_7|NC_010680.1_39430_39790_+	NA	NA|74aa|up_6|NC_010680.1_39833_40055_-	NA	NA|85aa|up_5|NC_010680.1_40260_40515_+	cd00093, HTH_XRE, Helix-turn-helix XRE-family like proteins	NA|236aa|up_4|NC_010680.1_40569_41277_+	cd00093, HTH_XRE, Helix-turn-helix XRE-family like proteins	NA|79aa|up_3|NC_010680.1_41437_41674_+	smart00530, HTH_XRE, Helix-turn-helix XRE-family like proteins	NA|121aa|up_2|NC_010680.1_41746_42109_+	NA	NA|71aa|up_1|NC_010680.1_42119_42332_+	NA	NA|171aa|up_0|NC_010680.1_42428_42941_+	NA	NA|NA	NA	NA|NA	NA	NA|NA	NA	NA|NA	NA	NA|NA	NA	NA|NA	NA	NA|NA	NA	NA|NA	NA	NA|NA	NA	NA|NA	NA
