assembly_id	genome_id	genome_def	crispr_array_locus_merge	crispr_array_location_merge	crispr_locus_id	crispr_pred_method	array_in_prot	prot_within_array_20000	prot_in_genome	crispr_type_by_cas_prot	consensus_repeat	repeat_length	self-targeting_spacer_number	self-targeting_target_number	spacer_location	protospacer_location	repeat_type	spacer_locus_num	spacer_num	correct_crispr_type	genome_cas_prots	unknown_protein_around_crispr	L10	L10_domain	L9	L9_domain	L8	L8_domain	L7	L7_domain	L6	L6_domain	L5	L5_domain	L4	L4_domain	L3	L3_domain	L2	L2_domain	L1	L1_domain	R1	R1_domain	R2	R2_domain	R3	R3_domain	R4	R4_domain	R5	R5_domain	R6	R6_domain	R7	R7_domain	R8	R8_domain	R9	R9_domain	R10	R10_domain
GCF_001688665.2_ASM168866v2	NZ_CP015399	Lachnoclostridium sp. YL32, complete genome	1	2080264-2081576	1,1,1	PILER-CR,CRISPRCasFinder,CRT	no		cas3,RT,DEDDh,Cas14u_CAS-V,cas7b,cas8c,cas5,WYL,cas2,cas1,cas4,cas7,csa3,PD-DExK,DinG	Orphan	ATTTCAATCCACAAGGCCCTCGCGGGCCTCGAC,ATTTCAATCCACAAGGCCCTCGCGGGCCTCGAC,ATTTCAATCCACAAGGCCCTCGCGGGCCTCGAC	33,33,33	0	0	NA	NA	NA:NA:NA	19,19,19	19	Orphan	cas3,RT,DEDDh,Cas14u_CAS-V,cas7b,cas8c,cas5,WYL,cas2,cas1,cas4,cas7,csa3,PD-DExK,DinG	NA,NA|235aa|down_1|NZ_CP015399.2_2086485_2087190_-,NA|54aa|down_8|NZ_CP015399.2_2095436_2095598_-	NA|430aa|up_9|NZ_CP015399.2_2065535_2066825_-	pfam12897, Aminotran_MocR, Alanine-glyoxylate amino-transferase	NA|483aa|up_8|NZ_CP015399.2_2067026_2068475_-	PRK05477, gatB, Asp-tRNA(Asn)/Glu-tRNA(Gln) amidotransferase subunit GatB	NA|508aa|up_7|NZ_CP015399.2_2068467_2069991_-	PRK00012, gatA, Asp-tRNA(Asn)/Glu-tRNA(Gln) amidotransferase subunit GatA	NA|98aa|up_6|NZ_CP015399.2_2070004_2070298_-	PRK00034, gatC, Asp-tRNA(Asn)/Glu-tRNA(Gln) amidotransferase subunit GatC	NA|444aa|up_5|NZ_CP015399.2_2070331_2071663_-	PRK05159, aspC, aspartyl-tRNA synthetase; Provisional	NA|172aa|up_4|NZ_CP015399.2_2072351_2072867_+	cd07908, Mn_catalase_like, Manganese catalase-like protein, ferritin-like diiron-binding domain	NA|485aa|up_3|NZ_CP015399.2_2072873_2074328_-	cd02808, GltS_FMN, Glutamate synthase (GltS) FMN-binding domain	NA|454aa|up_2|NZ_CP015399.2_2074501_2075863_+	cd13144, MATE_like_4, Uncharacterized subfamily of the multidrug and toxic compound extrusion (MATE) proteins	NA|263aa|up_1|NZ_CP015399.2_2076773_2077562_-	COG2859, COG2859, Uncharacterized protein conserved in bacteria [Function unknown]	NA|439aa|up_0|NZ_CP015399.2_2078586_2079903_-	PRK13342, PRK13342, recombination factor protein RarA; Reviewed	NA|1459aa|down_0|NZ_CP015399.2_2081783_2086160_-	COG1201, Lhr, Lhr-like helicases [General function prediction only]	NA|235aa|down_1|NZ_CP015399.2_2086485_2087190_-	NA	NA|547aa|down_2|NZ_CP015399.2_2087507_2089148_-	pfam00920, ILVD_EDD, Dehydratase family	NA|456aa|down_3|NZ_CP015399.2_2089131_2090499_-	COG3775, GatC, Phosphotransferase system, galactitol-specific IIC component [Carbohydrate transport and metabolism]	NA|93aa|down_4|NZ_CP015399.2_2090565_2090844_-	cd05566, PTS_IIB_galactitol, PTS_IIB_galactitol: subunit IIB of enzyme II (EII) of the galactitol-specific phosphoenolpyruvate:carbohydrate phosphotransferase system (PTS)	NA|158aa|down_5|NZ_CP015399.2_2090874_2091348_-	pfam00359, PTS_EIIA_2, Phosphoenolpyruvate-dependent sugar phosphotransferase system, EIIA 2	NA|213aa|down_6|NZ_CP015399.2_2091344_2091983_-	cd00452, KDPG_aldolase, KDPG and KHG aldolase	NA|987aa|down_7|NZ_CP015399.2_2092029_2094990_-	COG1221, PspF, Transcriptional regulators containing an AAA-type ATPase domain and a DNA-binding domain [Transcription / Signal transduction mechanisms]	NA|54aa|down_8|NZ_CP015399.2_2095436_2095598_-	NA	NA|306aa|down_9|NZ_CP015399.2_2095947_2096865_-	cd12826, EcCorA_ZntB-like_u1, uncharacterized bacterial subfamily of the Escherichia coli CorA-Salmonella typhimurium ZntB family
GCF_001688665.2_ASM168866v2	NZ_CP015399	Lachnoclostridium sp. YL32, complete genome	2	3047872-3048834	2,2,2	PILER-CR,CRISPRCasFinder,CRT	no	cas2,cas1,cas4,cas7,cas8c,cas5,cas3	cas3,RT,DEDDh,Cas14u_CAS-V,cas7b,cas8c,cas5,WYL,cas2,cas1,cas4,cas7,csa3,PD-DExK,DinG	 Type I-U?,Type I-U,Type I-C	ATTTCAATCCACTCCACCGCGAGGGTGGAGAC,ATTTCAATCCACTCCACCGCGAGGGTGGAGAC,ATTTCAATCCACTCCACCGCGAGGGTGGAGAC	32,32,32	0	0	NA	NA	NA:NA:NA	14,14,14	14	TypeI-U?,TypeI-U,TypeI-C	cas3,RT,DEDDh,Cas14u_CAS-V,cas7b,cas8c,cas5,WYL,cas2,cas1,cas4,cas7,csa3,PD-DExK,DinG	NA|132aa|up_4|NZ_CP015399.2_3044041_3044437_+,NA	NA|129aa|up_9|NZ_CP015399.2_3038401_3038788_-	pfam10990, DUF2809, Protein of unknown function (DUF2809)	NA|469aa|up_8|NZ_CP015399.2_3038808_3040215_-	PRK01096, PRK01096, deoxyguanosinetriphosphate triphosphohydrolase-like protein; Provisional	NA|203aa|up_7|NZ_CP015399.2_3040934_3041543_+	COG3601, COG3601, Predicted membrane protein [Function unknown]	NA|264aa|up_6|NZ_CP015399.2_3041609_3042401_+	pfam02633, Creatininase, Creatinine amidohydrolase	NA|454aa|up_5|NZ_CP015399.2_3042576_3043938_-	pfam13751, DDE_Tnp_1_6, Transposase DDE domain	NA|132aa|up_4|NZ_CP015399.2_3044041_3044437_+	NA	NA|106aa|up_3|NZ_CP015399.2_3044487_3044805_+	pfam05717, TnpB_IS66, IS66 Orf2 like protein	NA|512aa|up_2|NZ_CP015399.2_3044869_3046405_+	pfam03050, DDE_Tnp_IS66, Transposase IS66 family	NA|261aa|up_1|NZ_CP015399.2_3046458_3047241_-	pfam13358, DDE_3, DDE superfamily endonuclease	NA|161aa|up_0|NZ_CP015399.2_3047224_3047707_-	pfam13565, HTH_32, Homeodomain-like domain	cas2|97aa|down_0|NZ_CP015399.2_3049044_3049335_-	cd09725, Cas2_I_II_III, CRISPR/Cas system-associated protein Cas2	cas1|344aa|down_1|NZ_CP015399.2_3049372_3050404_-	TIGR03640, cas1_DVULG, CRISPR-associated endonuclease Cas1, subtype I-C/DVULG	cas4|221aa|down_2|NZ_CP015399.2_3050400_3051063_-	TIGR00372, conserved_hypothetical_protein, CRISPR-associated protein Cas4	cas7|297aa|down_3|NZ_CP015399.2_3051062_3051953_-	pfam05107, Cas_Cas7, CRISPR-associated protein Cas7	cas8c|592aa|down_4|NZ_CP015399.2_3051954_3053730_-	pfam09709, Cas_Csd1, CRISPR-associated protein (Cas_Csd1)	cas5|220aa|down_5|NZ_CP015399.2_3053726_3054386_-	cd09752, Cas5_I-C, CRISPR/Cas system-associated RAMP superfamily protein Cas5	cas3|726aa|down_6|NZ_CP015399.2_3054429_3056607_-	COG1203, COG1203, CRISPR-associated helicase Cas3 [Defense mechanisms]	NA|365aa|down_7|NZ_CP015399.2_3058671_3059766_+	pfam10282, Lactonase, Lactonase, 7-bladed beta-propeller	NA|475aa|down_8|NZ_CP015399.2_3059796_3061221_+	PRK09287, PRK09287, NADP-dependent phosphogluconate dehydrogenase	NA|465aa|down_9|NZ_CP015399.2_3061204_3062599_+	PRK05722, PRK05722, glucose-6-phosphate 1-dehydrogenase; Validated
GCF_001688665.2_ASM168866v2	NZ_CP015399	Lachnoclostridium sp. YL32, complete genome	3	3902713-3902818	3	CRISPRCasFinder	no		cas3,RT,DEDDh,Cas14u_CAS-V,cas7b,cas8c,cas5,WYL,cas2,cas1,cas4,cas7,csa3,PD-DExK,DinG	Orphan	CTTAGGAGGCGGACCCTCTATCCT	24	0	0	NA	NA	NA	1	1	Orphan	cas3,RT,DEDDh,Cas14u_CAS-V,cas7b,cas8c,cas5,WYL,cas2,cas1,cas4,cas7,csa3,PD-DExK,DinG	NA|173aa|up_6|NZ_CP015399.2_3898148_3898667_+,NA|179aa|up_5|NZ_CP015399.2_3898774_3899311_+,NA|86aa|up_2|NZ_CP015399.2_3900363_3900621_-,NA	NA|486aa|up_9|NZ_CP015399.2_3894577_3896035_-	pfam05598, DUF772, Transposase domain (DUF772)	NA|138aa|up_8|NZ_CP015399.2_3896210_3896624_+	pfam03965, Penicillinase_R, Penicillinase repressor	NA|473aa|up_7|NZ_CP015399.2_3896604_3898023_+	cd07341, M56_BlaR1_MecR1_like, Peptidase M56-like including those in BlaR1 and MecR1, integral membrane metallopeptidase	NA|173aa|up_6|NZ_CP015399.2_3898148_3898667_+	NA	NA|179aa|up_5|NZ_CP015399.2_3898774_3899311_+	NA	NA|141aa|up_4|NZ_CP015399.2_3899566_3899989_+	COG1595, RpoE, DNA-directed RNA polymerase specialized sigma subunit, sigma24 homolog [Transcription]	NA|80aa|up_3|NZ_CP015399.2_3899985_3900225_+	pfam12645, HTH_16, Helix-turn-helix domain	NA|86aa|up_2|NZ_CP015399.2_3900363_3900621_-	NA	NA|67aa|up_1|NZ_CP015399.2_3900764_3900965_+	pfam12728, HTH_17, Helix-turn-helix domain	NA|487aa|up_0|NZ_CP015399.2_3900984_3902445_+	cd01189, INT_ICEBs1_C_like, C-terminal catalytic domain of integrases from bacterial phages and conjugate transposons	NA|128aa|down_0|NZ_CP015399.2_3903162_3903546_-	pfam06782, UPF0236, Uncharacterized protein family (UPF0236)	NA|271aa|down_1|NZ_CP015399.2_3903549_3904362_-	pfam06782, UPF0236, Uncharacterized protein family (UPF0236)	NA|145aa|down_2|NZ_CP015399.2_3904205_3904640_-	pfam06782, UPF0236, Uncharacterized protein family (UPF0236)	NA|291aa|down_3|NZ_CP015399.2_3904827_3905700_-	pfam12997, DUF3881, Domain of unknown function, E	NA|370aa|down_4|NZ_CP015399.2_3906296_3907406_+	PRK00389, gcvT, glycine cleavage system aminomethyltransferase GcvT	NA|127aa|down_5|NZ_CP015399.2_3907474_3907855_+	PRK01202, PRK01202, glycine cleavage system protein GcvH	NA|456aa|down_6|NZ_CP015399.2_3907908_3909276_+	PRK00451, PRK00451, aminomethyl-transferring glycine dehydrogenase subunit GcvPA	NA|478aa|down_7|NZ_CP015399.2_3909272_3910706_+	PRK04366, PRK04366, aminomethyl-transferring glycine dehydrogenase subunit GcvPB	NA|343aa|down_8|NZ_CP015399.2_3910831_3911860_+	PRK03822, lplA, lipoate-protein ligase A; Provisional	NA|477aa|down_9|NZ_CP015399.2_3911895_3913326_+	COG1249, Lpd, Pyruvate/2-oxoglutarate dehydrogenase complex, dihydrolipoamide dehydrogenase (E3) component, and related enzymes [Energy production and conversion]
GCF_001688665.2_ASM168866v2	NZ_CP015399	Lachnoclostridium sp. YL32, complete genome	4	4903245-4903372	4	CRISPRCasFinder	no	WYL	cas3,RT,DEDDh,Cas14u_CAS-V,cas7b,cas8c,cas5,WYL,cas2,cas1,cas4,cas7,csa3,PD-DExK,DinG	Unclear	ACAGATTTGCTCCAGTCTTTATTTCACTTCTTCTTT	36	1	5	4903281-4903336|4903281-4903336|4903281-4903336|4903281-4903336|4903281-4903336	NZ_CP015399.2_5910075-5910130|NZ_CP015399.2_828995-828940|NZ_CP015399.2_3088527-3088472|NZ_CP015399.2_3794819-3794764|NZ_CP015399.2_6888274-6888219	NA	1	1	Orphan	cas3,RT,DEDDh,Cas14u_CAS-V,cas7b,cas8c,cas5,WYL,cas2,cas1,cas4,cas7,csa3,PD-DExK,DinG	NA|256aa|up_9|NZ_CP015399.2_4894922_4895690_+,NA|195aa|up_8|NZ_CP015399.2_4895732_4896317_+,NA|365aa|up_7|NZ_CP015399.2_4896316_4897411_+,NA|162aa|up_6|NZ_CP015399.2_4897413_4897899_+,NA|127aa|up_5|NZ_CP015399.2_4898046_4898427_+,NA|245aa|up_3|NZ_CP015399.2_4899988_4900723_+,NA|191aa|up_2|NZ_CP015399.2_4900859_4901432_+,NA|198aa|up_1|NZ_CP015399.2_4901715_4902309_+,NA|124aa|down_1|NZ_CP015399.2_4905604_4905976_-,NA|108aa|down_3|NZ_CP015399.2_4908424_4908748_+	NA|256aa|up_9|NZ_CP015399.2_4894922_4895690_+	NA	NA|195aa|up_8|NZ_CP015399.2_4895732_4896317_+	NA	NA|365aa|up_7|NZ_CP015399.2_4896316_4897411_+	NA	NA|162aa|up_6|NZ_CP015399.2_4897413_4897899_+	NA	NA|127aa|up_5|NZ_CP015399.2_4898046_4898427_+	NA	NA|493aa|up_4|NZ_CP015399.2_4898419_4899898_+	pfam14107, DUF4280, Domain of unknown function (DUF4280)	NA|245aa|up_3|NZ_CP015399.2_4899988_4900723_+	NA	NA|191aa|up_2|NZ_CP015399.2_4900859_4901432_+	NA	NA|198aa|up_1|NZ_CP015399.2_4901715_4902309_+	NA	NA|205aa|up_0|NZ_CP015399.2_4902473_4903088_+	pfam14107, DUF4280, Domain of unknown function (DUF4280)	NA|121aa|down_0|NZ_CP015399.2_4905249_4905612_-	pfam05717, TnpB_IS66, IS66 Orf2 like protein	NA|124aa|down_1|NZ_CP015399.2_4905604_4905976_-	NA	NA|211aa|down_2|NZ_CP015399.2_4907669_4908302_+	cd10227, ParM_like, Plasmid segregation protein ParM and similar proteins	NA|108aa|down_3|NZ_CP015399.2_4908424_4908748_+	NA	NA|153aa|down_4|NZ_CP015399.2_4908758_4909217_+	TIGR02227, Inactive_signal_peptidase_IA	NA|111aa|down_5|NZ_CP015399.2_4909237_4909570_+	pfam10571, UPF0547, Uncharacterized protein family UPF0547	NA|151aa|down_6|NZ_CP015399.2_4910344_4910797_+	COG0745, OmpR, Response regulators consisting of a CheY-like receiver domain and a winged-helix DNA-binding domain [Signal transduction mechanisms / Transcription]	WYL|309aa|down_7|NZ_CP015399.2_4910933_4911860_+	COG2378, COG2378, Predicted transcriptional regulator [Transcription]	NA|318aa|down_8|NZ_CP015399.2_4911873_4912827_+	COG1131, CcmA, ABC-type multidrug transport system, ATPase component [Defense mechanisms]	NA|279aa|down_9|NZ_CP015399.2_4912774_4913611_+	TIGR01247, drrB, daunorubicin resistance ABC transporter membrane protein
GCF_001688665.2_ASM168866v2	NZ_CP015399	Lachnoclostridium sp. YL32, complete genome	5	4944245-4944318	5	CRISPRCasFinder	no		cas3,RT,DEDDh,Cas14u_CAS-V,cas7b,cas8c,cas5,WYL,cas2,cas1,cas4,cas7,csa3,PD-DExK,DinG	Orphan	GGAGGTTGGCAGGCAACGGAAAAA	24	0	0	NA	NA	NA	1	1	Orphan	cas3,RT,DEDDh,Cas14u_CAS-V,cas7b,cas8c,cas5,WYL,cas2,cas1,cas4,cas7,csa3,PD-DExK,DinG	NA|124aa|up_8|NZ_CP015399.2_4935564_4935936_-,NA|160aa|up_7|NZ_CP015399.2_4936189_4936669_+,NA|83aa|up_6|NZ_CP015399.2_4936680_4936929_+,NA|379aa|up_5|NZ_CP015399.2_4936951_4938088_+,NA|145aa|up_2|NZ_CP015399.2_4939869_4940304_+,NA|80aa|down_3|NZ_CP015399.2_4949085_4949325_+	NA|118aa|up_9|NZ_CP015399.2_4935218_4935572_-	pfam05717, TnpB_IS66, IS66 Orf2 like protein	NA|124aa|up_8|NZ_CP015399.2_4935564_4935936_-	NA	NA|160aa|up_7|NZ_CP015399.2_4936189_4936669_+	NA	NA|83aa|up_6|NZ_CP015399.2_4936680_4936929_+	NA	NA|379aa|up_5|NZ_CP015399.2_4936951_4938088_+	NA	NA|306aa|up_4|NZ_CP015399.2_4938455_4939373_+	COG0539, RpsA, Ribosomal protein S1 [Translation, ribosomal structure and biogenesis]	NA|72aa|up_3|NZ_CP015399.2_4939479_4939695_+	pfam00313, CSD, 'Cold-shock' DNA-binding domain	NA|145aa|up_2|NZ_CP015399.2_4939869_4940304_+	NA	NA|882aa|up_1|NZ_CP015399.2_4940547_4943193_+	COG5492, COG5492, Bacterial surface proteins containing Ig-like domains [Cell motility and secretion]	NA|241aa|up_0|NZ_CP015399.2_4943325_4944048_+	COG5263, COG5263, FOG: Glucan-binding domain (YG repeat) [General function prediction only]	NA|445aa|down_0|NZ_CP015399.2_4945584_4946919_-	cd13138, MATE_yoeA_like, Subfamily of the multidrug and toxic compound extrusion (MATE)-like proteins similar to Bacillus subtilis yoeA	NA|301aa|down_1|NZ_CP015399.2_4947024_4947927_+	COG0583, LysR, Transcriptional regulator [Transcription]	NA|307aa|down_2|NZ_CP015399.2_4948055_4948976_+	COG2207, AraC, AraC-type DNA-binding domain-containing proteins [Transcription]	NA|80aa|down_3|NZ_CP015399.2_4949085_4949325_+	NA	NA|324aa|down_4|NZ_CP015399.2_4949747_4950719_+	COG5263, COG5263, FOG: Glucan-binding domain (YG repeat) [General function prediction only]	NA|700aa|down_5|NZ_CP015399.2_4950796_4952896_+	cd02690, M28, M28 Zn-peptidases include aminopeptidases and carboxypeptidases	NA|211aa|down_6|NZ_CP015399.2_4952903_4953536_+	pfam13673, Acetyltransf_10, Acetyltransferase (GNAT) domain	NA|362aa|down_7|NZ_CP015399.2_4953696_4954782_+	PRK00772, PRK00772, 3-isopropylmalate dehydrogenase; Provisional	NA|555aa|down_8|NZ_CP015399.2_4954830_4956495_+	PRK00911, PRK00911, dihydroxy-acid dehydratase; Provisional	NA|561aa|down_9|NZ_CP015399.2_4956585_4958268_+	PRK06048, PRK06048, acetolactate synthase large subunit
