assembly_id	genome_id	genome_def	crispr_array_locus_merge	crispr_array_location_merge	crispr_locus_id	crispr_pred_method	array_in_prot	prot_within_array_20000	prot_in_genome	crispr_type_by_cas_prot	consensus_repeat	repeat_length	self-targeting_spacer_number	self-targeting_target_number	spacer_location	protospacer_location	repeat_type	spacer_locus_num	spacer_num	correct_crispr_type	genome_cas_prots	unknown_protein_around_crispr	L10	L10_domain	L9	L9_domain	L8	L8_domain	L7	L7_domain	L6	L6_domain	L5	L5_domain	L4	L4_domain	L3	L3_domain	L2	L2_domain	L1	L1_domain	R1	R1_domain	R2	R2_domain	R3	R3_domain	R4	R4_domain	R5	R5_domain	R6	R6_domain	R7	R7_domain	R8	R8_domain	R9	R9_domain	R10	R10_domain
GCF_007361935.1_ASM736193v1	NZ_CP041667	Lachnospiraceae bacterium KGMB03038 chromosome, complete genome	1	234998-235097	1	CRISPRCasFinder	no		WYL,csa3,RT,cas3,DinG,cas2,cas1,cas4,cas7,cas8c,cas5,DEDDh	Orphan	ATGCATAGGGCGGACGCCCGCCAGCG	26	0	0	NA	NA	NA	1	1	Orphan	WYL,csa3,RT,cas3,DinG,cas2,cas1,cas4,cas7,cas8c,cas5,DEDDh	NA,NA|123aa|down_1|NZ_CP041667.1_236353_236722_+	NA|287aa|up_9|NZ_CP041667.1_225345_226206_+	cd06061, PurM-like1, AIR synthase (PurM) related protein, subgroup 1 of unknown function	NA|164aa|up_8|NZ_CP041667.1_226324_226816_+	cd18094, SpoU-like_TrmL, SAM-dependent tRNA methylase related to TrmL	NA|103aa|up_7|NZ_CP041667.1_226958_227267_+	PRK01005, PRK01005, V-type ATP synthase subunit E; Provisional	NA|672aa|up_6|NZ_CP041667.1_227304_229320_+	PRK05771, PRK05771, V-type ATP synthase subunit I; Validated	NA|162aa|up_5|NZ_CP041667.1_229316_229802_+	PRK06558, PRK06558, V-type ATP synthase subunit K; Validated	NA|198aa|up_4|NZ_CP041667.1_229863_230457_+	COG1390, NtpE, Archaeal/vacuolar-type H+-ATPase subunit E [Energy production and conversion]	NA|323aa|up_3|NZ_CP041667.1_230488_231457_+	pfam01992, vATP-synt_AC39, ATP synthase (C/AC39) subunit	NA|106aa|up_2|NZ_CP041667.1_231449_231767_+	PRK01395, PRK01395, V-type ATP synthase subunit F; Provisional	NA|590aa|up_1|NZ_CP041667.1_231791_233561_+	PRK04192, PRK04192, V-type ATP synthase subunit A; Provisional	NA|458aa|up_0|NZ_CP041667.1_233560_234934_+	PRK04196, PRK04196, V-type ATP synthase subunit B; Provisional	NA|220aa|down_0|NZ_CP041667.1_235234_235894_+	PRK00373, PRK00373, V-type ATP synthase subunit D; Reviewed	NA|123aa|down_1|NZ_CP041667.1_236353_236722_+	NA	NA|167aa|down_2|NZ_CP041667.1_236854_237355_+	PRK00131, aroK, shikimate kinase; Reviewed	NA|431aa|down_3|NZ_CP041667.1_237487_238780_+	PRK09369, PRK09369, UDP-N-acetylglucosamine 1-carboxyvinyltransferase; Validated	NA|390aa|down_4|NZ_CP041667.1_238866_240036_+	cd17353, MFS_OFA_like, Oxalate:formate antiporter (OFA) and similar proteins of the Major Facilitator Superfamily of transporters	NA|396aa|down_5|NZ_CP041667.1_240369_241557_+	PRK05250, PRK05250, S-adenosylmethionine synthetase; Validated	NA|499aa|down_6|NZ_CP041667.1_241635_243132_+	COG0642, BaeS, Signal transduction histidine kinase [Signal transduction mechanisms]	NA|230aa|down_7|NZ_CP041667.1_243124_243814_+	COG0745, OmpR, Response regulators consisting of a CheY-like receiver domain and a winged-helix DNA-binding domain [Signal transduction mechanisms / Transcription]	NA|460aa|down_8|NZ_CP041667.1_243865_245245_+	TIGR00933, Trk_system_potassium_uptake_protein_trkH	NA|226aa|down_9|NZ_CP041667.1_245258_245936_+	COG0569, TrkA, K+ transport systems, NAD-binding component [Inorganic ion transport and metabolism]
GCF_007361935.1_ASM736193v1	NZ_CP041667	Lachnospiraceae bacterium KGMB03038 chromosome, complete genome	2	1820057-1820140	2	CRISPRCasFinder	no	cas3,RT	WYL,csa3,RT,cas3,DinG,cas2,cas1,cas4,cas7,cas8c,cas5,DEDDh	Unclear	ATTCTGCTGATAAGTATACTGATA	24	0	0	NA	NA	NA	1	1	Unclear	WYL,csa3,RT,cas3,DinG,cas2,cas1,cas4,cas7,cas8c,cas5,DEDDh	NA|51aa|up_1|NZ_CP041667.1_1819254_1819407_-,NA|135aa|up_0|NZ_CP041667.1_1819633_1820038_-,NA	NA|491aa|up_9|NZ_CP041667.1_1809907_1811380_-	COG0772, FtsW, Bacterial cell division membrane protein [Cell division and chromosome partitioning]	NA|137aa|up_8|NZ_CP041667.1_1813563_1813974_-	pfam05164, ZapA, Cell division protein ZapA	NA|336aa|up_7|NZ_CP041667.1_1814040_1815048_-	PRK00080, ruvB, Holliday junction branch migration DNA helicase RuvB	NA|205aa|up_6|NZ_CP041667.1_1815065_1815680_-	PRK00116, ruvA, Holliday junction branch migration protein RuvA	NA|165aa|up_5|NZ_CP041667.1_1815688_1816183_-	pfam07456, Hpre_diP_synt_I, Heptaprenyl diphosphate synthase component I	NA|124aa|up_4|NZ_CP041667.1_1816192_1816564_-	pfam07009, NusG_II, NusG domain II	NA|330aa|up_3|NZ_CP041667.1_1816687_1817677_+	COG1477, ApbE, Membrane-associated lipoprotein involved in thiamine biosynthesis [Coenzyme metabolism]	RT|464aa|up_2|NZ_CP041667.1_1817754_1819146_-	TIGR04416, hypothetical_protein, group II intron reverse transcriptase/maturase	NA|51aa|up_1|NZ_CP041667.1_1819254_1819407_-	NA	NA|135aa|up_0|NZ_CP041667.1_1819633_1820038_-	NA	NA|156aa|down_0|NZ_CP041667.1_1820358_1820826_-	pfam10825, DUF2752, Protein of unknown function (DUF2752)	NA|264aa|down_1|NZ_CP041667.1_1820900_1821692_-	PRK07118, PRK07118, Fe-S cluster domain-containing protein	NA|192aa|down_2|NZ_CP041667.1_1821705_1822281_-	COG4657, RnfA, Predicted NADH:ubiquinone oxidoreductase, subunit RnfA [Energy production and conversion]	NA|239aa|down_3|NZ_CP041667.1_1822293_1823010_-	PRK12405, PRK12405, electron transport complex RsxE subunit; Provisional	NA|195aa|down_4|NZ_CP041667.1_1823022_1823607_-	TIGR01947, Electron_transport_complex_subunit_G, electron transport complex, RnfABCDGE type, G subunit	NA|319aa|down_5|NZ_CP041667.1_1823606_1824563_-	pfam03116, NQR2_RnfD_RnfE, NQR2, RnfD, RnfE family	NA|440aa|down_6|NZ_CP041667.1_1824595_1825915_-	TIGR01945, Electron_transport_complex_subunit_C, electron transport complex, RnfABCDGE type, C subunit	NA|366aa|down_7|NZ_CP041667.1_1826095_1827193_-	PRK06676, rpsA, 30S ribosomal protein S1; Reviewed	NA|286aa|down_8|NZ_CP041667.1_1827173_1828031_-	cd13944, lytB_ispH, 4-hydroxy-3-methylbut-2-enyl diphosphate reductase	NA|220aa|down_9|NZ_CP041667.1_1828042_1828702_-	PRK00023, cmk, (d)CMP kinase
GCF_007361935.1_ASM736193v1	NZ_CP041667	Lachnospiraceae bacterium KGMB03038 chromosome, complete genome	3	1896563-1896682	3	CRISPRCasFinder	no		WYL,csa3,RT,cas3,DinG,cas2,cas1,cas4,cas7,cas8c,cas5,DEDDh	Orphan	ACAGCTGCCTGTGGATCATCCTTCTCCTGTGCTGCTGCGGCGGCT	45	0	0	NA	NA	NA	1	1	Orphan	WYL,csa3,RT,cas3,DinG,cas2,cas1,cas4,cas7,cas8c,cas5,DEDDh	NA,NA|165aa|down_0|NZ_CP041667.1_1897007_1897502_+,NA|92aa|down_1|NZ_CP041667.1_1897494_1897770_+,NA|169aa|down_3|NZ_CP041667.1_1898333_1898840_-	NA|200aa|up_9|NZ_CP041667.1_1889936_1890536_-	TIGR02227, Inactive_signal_peptidase_IA	NA|116aa|up_8|NZ_CP041667.1_1890598_1890946_-	pfam01245, Ribosomal_L19, Ribosomal protein L19	NA|245aa|up_7|NZ_CP041667.1_1891044_1891779_-	PRK00026, trmD, tRNA (guanine-N(1)-)-methyltransferase; Reviewed	NA|170aa|up_6|NZ_CP041667.1_1891793_1892303_-	PRK00122, rimM, 16S rRNA-processing protein RimM; Provisional	NA|77aa|up_5|NZ_CP041667.1_1892376_1892607_-	PRK00468, PRK00468, KH domain-containing protein	NA|82aa|up_4|NZ_CP041667.1_1892630_1892876_-	PRK00040, rpsP, 30S ribosomal protein S16; Reviewed	NA|454aa|up_3|NZ_CP041667.1_1892945_1894307_-	PRK10867, PRK10867, signal recognition particle protein; Provisional	NA|115aa|up_2|NZ_CP041667.1_1894344_1894689_-	PRK00118, PRK00118, putative DNA-binding protein; Validated	NA|297aa|up_1|NZ_CP041667.1_1894779_1895670_-	cd07487, Peptidases_S8_1, Peptidase S8 family domain, uncharacterized subfamily 1	NA|212aa|up_0|NZ_CP041667.1_1895776_1896412_+	TIGR02349, Chaperone_protein_DnaJ, chaperone protein DnaJ	NA|165aa|down_0|NZ_CP041667.1_1897007_1897502_+	NA	NA|92aa|down_1|NZ_CP041667.1_1897494_1897770_+	NA	NA|159aa|down_2|NZ_CP041667.1_1897860_1898337_-	COG1607, COG1607, Acyl-CoA hydrolase [Lipid metabolism]	NA|169aa|down_3|NZ_CP041667.1_1898333_1898840_-	NA	NA|246aa|down_4|NZ_CP041667.1_1899031_1899769_+	cd02570, PseudoU_synth_EcTruA, Eukaryotic and bacterial pseudouridine synthases similar to E	NA|341aa|down_5|NZ_CP041667.1_1899815_1900838_-	pfam13023, HD_3, HD domain	NA|135aa|down_6|NZ_CP041667.1_1900846_1901251_-	pfam01894, UPF0047, Uncharacterized protein family UPF0047	NA|286aa|down_7|NZ_CP041667.1_1901266_1902124_-	cd07208, Pat_hypo_Ecoli_yjju_like, Hypothetical patatin similar to yjju protein of Escherichia coli	NA|545aa|down_8|NZ_CP041667.1_1902272_1903907_+	PRK15064, PRK15064, ABC transporter ATP-binding protein; Provisional	NA|196aa|down_9|NZ_CP041667.1_1903909_1904497_-	cd00564, TMP_TenI, Thiamine monophosphate synthase (TMP synthase)/TenI
GCF_007361935.1_ASM736193v1	NZ_CP041667	Lachnospiraceae bacterium KGMB03038 chromosome, complete genome	4	1993071-1993171	4	CRISPRCasFinder	no		WYL,csa3,RT,cas3,DinG,cas2,cas1,cas4,cas7,cas8c,cas5,DEDDh	Orphan	CCGCCCCTGCTCTGGCCTCTTTCGCC	26	0	0	NA	NA	NA	1	1	Orphan	WYL,csa3,RT,cas3,DinG,cas2,cas1,cas4,cas7,cas8c,cas5,DEDDh	NA,NA	NA|409aa|up_9|NZ_CP041667.1_1980882_1982109_-	PRK05469, PRK05469, tripeptide aminopeptidase PepT	NA|465aa|up_8|NZ_CP041667.1_1982190_1983585_-	cd17346, MFS_DtpA_like, Dipeptide and tripeptide permease A (DtpA)-like subfamily of the Major Facilitator Superfamily of transporters	NA|299aa|up_7|NZ_CP041667.1_1983685_1984582_-	COG0583, LysR, Transcriptional regulator [Transcription]	NA|177aa|up_6|NZ_CP041667.1_1984554_1985085_-	COG0350, Ada, Methylated DNA-protein cysteine methyltransferase [DNA replication, recombination, and repair]	NA|696aa|up_5|NZ_CP041667.1_1985203_1987291_-	PRK11824, PRK11824, polynucleotide phosphorylase/polyadenylase; Provisional	NA|89aa|up_4|NZ_CP041667.1_1987492_1987759_-	PRK05626, rpsO, 30S ribosomal protein S15; Reviewed	NA|301aa|up_3|NZ_CP041667.1_1987880_1988783_-	PRK05627, PRK05627, bifunctional riboflavin kinase/FAD synthetase	NA|302aa|up_2|NZ_CP041667.1_1988794_1989700_-	PRK00130, truB, tRNA pseudouridine synthase B; Provisional	NA|318aa|up_1|NZ_CP041667.1_1989696_1990650_-	COG0618, COG0618, Exopolyphosphatase-related proteins [General function prediction only]	NA|128aa|up_0|NZ_CP041667.1_1990639_1991023_-	pfam02033, RBFA, Ribosome-binding factor A	NA|108aa|down_0|NZ_CP041667.1_1993899_1994223_-	PRK07714, PRK07714, YlxQ family RNA-binding protein	NA|93aa|down_1|NZ_CP041667.1_1994209_1994488_-	pfam04296, DUF448, Protein of unknown function (DUF448)	NA|415aa|down_2|NZ_CP041667.1_1994497_1995742_-	PRK12327, nusA, transcription elongation factor NusA; Provisional	NA|156aa|down_3|NZ_CP041667.1_1995760_1996228_-	PRK00092, PRK00092, ribosome maturation protein RimP; Reviewed	NA|1354aa|down_4|NZ_CP041667.1_1996395_2000457_-	pfam07581, Glug, The GLUG motif	NA|77aa|down_5|NZ_CP041667.1_2000596_2000827_+	pfam13443, HTH_26, Cro/C1-type HTH DNA-binding domain	NA|298aa|down_6|NZ_CP041667.1_2000871_2001765_-	PRK00241, nudC, NAD(+) diphosphatase	NA|746aa|down_7|NZ_CP041667.1_2001769_2004007_-	PRK05560, PRK05560, DNA gyrase subunit A; Validated	NA|641aa|down_8|NZ_CP041667.1_2004022_2005945_-	PRK05644, gyrB, DNA gyrase subunit B; Validated	NA|376aa|down_9|NZ_CP041667.1_2006021_2007149_-	pfam07454, SpoIIP, Stage II sporulation protein P (SpoIIP)
GCF_007361935.1_ASM736193v1	NZ_CP041667	Lachnospiraceae bacterium KGMB03038 chromosome, complete genome	5	2242633-2243738	5,1,1	CRISPRCasFinder,CRT,PILER-CR	no	cas2,cas1,cas4,cas7,cas8c,cas5	WYL,csa3,RT,cas3,DinG,cas2,cas1,cas4,cas7,cas8c,cas5,DEDDh	Type I-C, Type I-U?,Type I-U	ATTTCAATCCACACTCCCCATGCGGGGAGTGAC,ATTTCAATCCACACTCCCCATGCGGGGAGTGAC,ATTTCAATCCACACTCCCCATGCGGGGAGTGAC	33,33,33	2	2	2242801-2242832|2243201-2243235	NZ_CP041667.1_2134404-2134435|NZ_CP041667.1_2101240-2101206	NA:NA:NA	16,16,14	16	TypeI-C,TypeI-U,TypeI-U?	WYL,csa3,RT,cas3,DinG,cas2,cas1,cas4,cas7,cas8c,cas5,DEDDh	NA|391aa|up_1|NZ_CP041667.1_2240348_2241521_-,NA|147aa|down_6|NZ_CP041667.1_2249307_2249748_-,NA|687aa|down_8|NZ_CP041667.1_2250338_2252399_-,NA|68aa|down_9|NZ_CP041667.1_2255205_2255409_+	NA|785aa|up_9|NZ_CP041667.1_2230954_2233309_-	pfam02687, FtsX, FtsX-like permease family	NA|226aa|up_8|NZ_CP041667.1_2233355_2234033_-	cd03255, ABC_MJ0796_LolCDE_FtsE, ATP-binding cassette domain of the transporters involved in export of lipoprotein and macrolide, and cell division protein	NA|316aa|up_7|NZ_CP041667.1_2234127_2235075_-	COG0642, BaeS, Signal transduction histidine kinase [Signal transduction mechanisms]	NA|225aa|up_6|NZ_CP041667.1_2235129_2235804_-	COG0745, OmpR, Response regulators consisting of a CheY-like receiver domain and a winged-helix DNA-binding domain [Signal transduction mechanisms / Transcription]	NA|458aa|up_5|NZ_CP041667.1_2235892_2237266_-	cd13138, MATE_yoeA_like, Subfamily of the multidrug and toxic compound extrusion (MATE)-like proteins similar to Bacillus subtilis yoeA	NA|406aa|up_4|NZ_CP041667.1_2237318_2238536_-	cd06173, MFS_MefA_like, Macrolide efflux protein A and similar proteins of the Major Facilitator Superfamily of transporters	NA|216aa|up_3|NZ_CP041667.1_2238630_2239278_-	COG1309, AcrR, Transcriptional regulator [Transcription]	NA|288aa|up_2|NZ_CP041667.1_2239387_2240251_-	cd04194, GT8_A4GalT_like, A4GalT_like proteins catalyze the addition of galactose or glucose residues to the lipooligosaccharide (LOS) or lipopolysaccharide (LPS) of the bacterial cell surface	NA|391aa|up_1|NZ_CP041667.1_2240348_2241521_-	NA	NA|258aa|up_0|NZ_CP041667.1_2241696_2242470_-	COG0596, MhpC, Predicted hydrolases or acyltransferases (alpha/beta hydrolase superfamily) [General function prediction only]	cas2|97aa|down_0|NZ_CP041667.1_2243949_2244240_-	cd09725, Cas2_I_II_III, CRISPR/Cas system-associated protein Cas2	cas1|344aa|down_1|NZ_CP041667.1_2244279_2245311_-	TIGR03640, cas1_DVULG, CRISPR-associated endonuclease Cas1, subtype I-C/DVULG	cas4|222aa|down_2|NZ_CP041667.1_2245307_2245973_-	TIGR00372, conserved_hypothetical_protein, CRISPR-associated protein Cas4	cas7|304aa|down_3|NZ_CP041667.1_2245972_2246884_-	pfam05107, Cas_Cas7, CRISPR-associated protein Cas7	cas8c|579aa|down_4|NZ_CP041667.1_2246884_2248621_-	pfam09709, Cas_Csd1, CRISPR-associated protein (Cas_Csd1)	cas5|221aa|down_5|NZ_CP041667.1_2248617_2249280_-	cd09752, Cas5_I-C, CRISPR/Cas system-associated RAMP superfamily protein Cas5	NA|147aa|down_6|NZ_CP041667.1_2249307_2249748_-	NA	NA|156aa|down_7|NZ_CP041667.1_2249731_2250199_-	pfam13151, DUF3990, Protein of unknown function (DUF3990)	NA|687aa|down_8|NZ_CP041667.1_2250338_2252399_-	NA	NA|68aa|down_9|NZ_CP041667.1_2255205_2255409_+	NA
GCF_007361935.1_ASM736193v1	NZ_CP041667	Lachnospiraceae bacterium KGMB03038 chromosome, complete genome	6	2317193-2317496	6,2,2	CRISPRCasFinder,CRT,PILER-CR	no		WYL,csa3,RT,cas3,DinG,cas2,cas1,cas4,cas7,cas8c,cas5,DEDDh	Orphan	ATATTTCAACCCACATCGCCAATGA,CATCGCCAATGACGGCGATGAC,ATTTCAATCCACATCGCCAATGACGGCGATGAC	25,22,33	0	0	NA	NA	NA:NA:NA	3,4,3	4	Orphan	WYL,csa3,RT,cas3,DinG,cas2,cas1,cas4,cas7,cas8c,cas5,DEDDh	NA|77aa|up_8|NZ_CP041667.1_2306657_2306888_+,NA|73aa|up_0|NZ_CP041667.1_2315258_2315477_-,NA|81aa|down_3|NZ_CP041667.1_2322613_2322856_-	NA|141aa|up_9|NZ_CP041667.1_2306264_2306687_-	TIGR02937, RNA_polymerase_sigma_factor, RNA polymerase sigma factor, sigma-70 family	NA|77aa|up_8|NZ_CP041667.1_2306657_2306888_+	NA	NA|273aa|up_7|NZ_CP041667.1_2306939_2307758_-	cd18107, SpoU-like_AviRb, SAM-dependent rRNA methylase related to AviRb	NA|289aa|up_6|NZ_CP041667.1_2308337_2309204_-	pfam06445, GyrI-like, GyrI-like small molecule binding domain	NA|640aa|up_5|NZ_CP041667.1_2309394_2311314_-	COG0480, FusA, Translation elongation factors (GTPases) [Translation, ribosomal structure and biogenesis]	NA|47aa|up_4|NZ_CP041667.1_2311689_2311830_-	pfam12750, Maff2, Maff2 family	NA|594aa|up_3|NZ_CP041667.1_2311897_2313679_-	pfam02534, T4SS-DNA_transf, Type IV secretory system Conjugative DNA transfer	NA|158aa|up_2|NZ_CP041667.1_2313675_2314149_-	pfam12687, DUF3801, Protein of unknown function (DUF3801)	NA|315aa|up_1|NZ_CP041667.1_2314185_2315130_-	pfam13730, HTH_36, Helix-turn-helix domain	NA|73aa|up_0|NZ_CP041667.1_2315258_2315477_-	NA	NA|429aa|down_0|NZ_CP041667.1_2317629_2318916_-	cd06173, MFS_MefA_like, Macrolide efflux protein A and similar proteins of the Major Facilitator Superfamily of transporters	NA|472aa|down_1|NZ_CP041667.1_2320026_2321442_-	COG2265, TrmA, SAM-dependent methyltransferases related to tRNA (uracil-5-)-methyltransferase [Translation, ribosomal structure and biogenesis]	NA|308aa|down_2|NZ_CP041667.1_2321555_2322479_-	cd11297, PIN_LabA-like_N_1, uncharacterized subfamily of N-terminal LabA-like PIN domains	NA|81aa|down_3|NZ_CP041667.1_2322613_2322856_-	NA	NA|747aa|down_4|NZ_CP041667.1_2322943_2325184_-	TIGR01073, ATP-dependent_DNA_helicase_PcrA, ATP-dependent DNA helicase PcrA	NA|131aa|down_5|NZ_CP041667.1_2325186_2325579_-	pfam12646, DUF3783, Domain of unknown function (DUF3783)	NA|635aa|down_6|NZ_CP041667.1_2325652_2327557_-	COG1132, MdlB, ABC-type multidrug transport system, ATPase and permease components [Defense mechanisms]	NA|583aa|down_7|NZ_CP041667.1_2327560_2329309_-	COG1132, MdlB, ABC-type multidrug transport system, ATPase and permease components [Defense mechanisms]	NA|204aa|down_8|NZ_CP041667.1_2329505_2330117_+	pfam08876, DUF1836, Domain of unknown function (DUF1836)	NA|243aa|down_9|NZ_CP041667.1_2330126_2330855_-	COG0745, OmpR, Response regulators consisting of a CheY-like receiver domain and a winged-helix DNA-binding domain [Signal transduction mechanisms / Transcription]
