assembly_id	genome_id	genome_def	crispr_array_locus_merge	crispr_array_location_merge	crispr_locus_id	crispr_pred_method	array_in_prot	prot_within_array_20000	prot_in_genome	crispr_type_by_cas_prot	consensus_repeat	repeat_length	self-targeting_spacer_number	self-targeting_target_number	spacer_location	protospacer_location	repeat_type	spacer_locus_num	spacer_num	correct_crispr_type	genome_cas_prots	unknown_protein_around_crispr	L10	L10_domain	L9	L9_domain	L8	L8_domain	L7	L7_domain	L6	L6_domain	L5	L5_domain	L4	L4_domain	L3	L3_domain	L2	L2_domain	L1	L1_domain	R1	R1_domain	R2	R2_domain	R3	R3_domain	R4	R4_domain	R5	R5_domain	R6	R6_domain	R7	R7_domain	R8	R8_domain	R9	R9_domain	R10	R10_domain
GCF_902386615.1_UHGG_MGYG-HGUT-02385	NZ_LR698987	Leminorella richardii isolate MGYG-HGUT-02385 chromosome 1	1	636327-638857	1,1,1	PILER-CR,CRISPRCasFinder,CRT	no	cas3,cas8e,cse2gr11,cas6e,cas7,cas5,cas1,cas2	cas3,csa3,cas8e,cse2gr11,cas6e,cas7,cas5,cas1,cas2,DEDDh,WYL,DinG,RT	Type I-E	GTCTTCTCCACGTACGTGGAGGTGTTTC,GTCTTCTCCACGTACGTGGAGGTGTTTC,GTCTTCTCCACGTNACGTGGAGGTGTTTCN	28,28,30	0	0	NA	NA	I-B,III-A,III-B:I-B,III-A,III-B:NA	39,39,41	41	TypeI-E	cas3,csa3,cas8e,cse2gr11,cas6e,cas7,cas5,cas1,cas2,DEDDh,WYL,DinG,RT	NA,NA|191aa|down_9|NZ_LR698987.1_648021_648594_+	NA|388aa|up_9|NZ_LR698987.1_625356_626520_-	cd03326, MR_like_1, Mandelate racemase (MR)-like subfamily of the enolase superfamily, subgroup 1	NA|306aa|up_8|NZ_LR698987.1_626714_627632_+	PRK10082, PRK10082, hypochlorite stress DNA-binding transcriptional regulator HypT	cas3|889aa|up_7|NZ_LR698987.1_627993_630660_+	PRK09694, PRK09694, CRISPR-associated helicase/endonuclease Cas3	cas8e|503aa|up_6|NZ_LR698987.1_630662_632171_+	cd09729, Cse1_I-E, CRISPR/Cas system-associated protein Cse1	cse2gr11|172aa|up_5|NZ_LR698987.1_632167_632683_+	cd09731, Cse2_I-E, CRISPR/Cas system-associated protein Cse2	cas6e|212aa|up_4|NZ_LR698987.1_632679_633315_+	TIGR01907, CRISPR_system_Cascade_subunit_CasE, CRISPR-associated protein Cas6/Cse3/CasE, subtype I-E/ECOLI	cas7|349aa|up_3|NZ_LR698987.1_633330_634377_+	pfam09344, Cas_CT1975, CT1975-like protein	cas5|234aa|up_2|NZ_LR698987.1_634380_635082_+	TIGR01868, hypothetical_protein, CRISPR-associated protein Cas5/CasD, subtype I-E/ECOLI	cas1|287aa|up_1|NZ_LR698987.1_635126_635987_+	TIGR03638, cas1_ECOLI, CRISPR-associated endonuclease Cas1, subtype I-E/ECOLI	cas2|97aa|up_0|NZ_LR698987.1_635983_636274_+	cd09648, Cas2_I-E, CRISPR/Cas system-associated protein Cas2	NA|336aa|down_0|NZ_LR698987.1_639018_640026_+	PRK07204, PRK07204, beta-ketoacyl-ACP synthase III	NA|339aa|down_1|NZ_LR698987.1_640022_641039_+	COG0451, WcaG, Nucleoside-diphosphate-sugar epimerases [Cell envelope biogenesis, outer membrane / Carbohydrate transport and metabolism]	NA|269aa|down_2|NZ_LR698987.1_641016_641823_+	cd07730, metallo-hydrolase-like_MBL-fold, uncharacterized subgroup of the MBL-fold_metallo-hydrolase superfamily; MBL-fold metallo hydrolase domain	NA|429aa|down_3|NZ_LR698987.1_641819_643106_+	TIGR02304, conserved_hypothetical_protein, putative adenylate-forming enzyme	NA|206aa|down_4|NZ_LR698987.1_643102_643720_+	cd03386, PAP2_Aur1_like, PAP2_like proteins, Aur1_like subfamily	NA|376aa|down_5|NZ_LR698987.1_643712_644840_+	pfam04116, FA_hydroxylase, Fatty acid hydroxylase superfamily	NA|371aa|down_6|NZ_LR698987.1_644826_645939_+	cd03506, Delta6-FADS-like, The Delta6 Fatty Acid Desaturase (Delta6-FADS)-like CD includes the integral-membrane enzymes: delta-4, delta-5, delta-6, delta-8, delta-8-sphingolipid, and delta-11 desaturases found in vertebrates, higher plants, fungi, and bacteria	NA|322aa|down_7|NZ_LR698987.1_645935_646901_+	COG0451, WcaG, Nucleoside-diphosphate-sugar epimerases [Cell envelope biogenesis, outer membrane / Carbohydrate transport and metabolism]	NA|376aa|down_8|NZ_LR698987.1_646897_648025_+	pfam13506, Glyco_transf_21, Glycosyl transferase family 21	NA|191aa|down_9|NZ_LR698987.1_648021_648594_+	NA
GCF_902386615.1_UHGG_MGYG-HGUT-02385	NZ_LR698987	Leminorella richardii isolate MGYG-HGUT-02385 chromosome 1	2	1247657-1247985	2,2,2	CRISPRCasFinder,PILER-CR,CRT	no		cas3,csa3,cas8e,cse2gr11,cas6e,cas7,cas5,cas1,cas2,DEDDh,WYL,DinG,RT	Orphan	TGTTGACCGCCGCATAGGCGGCTTAGAAA,GTTGACCGCCGCATAGGCGGCTTAGAAA,GTTGACCGCCGCATAGGCGGCTTAGAAA	29,28,28	0	0	NA	NA	I-F:I-F:I-F	5,4,5	5	Orphan	cas3,csa3,cas8e,cse2gr11,cas6e,cas7,cas5,cas1,cas2,DEDDh,WYL,DinG,RT	NA|94aa|up_9|NZ_LR698987.1_1238622_1238904_-,NA|48aa|up_8|NZ_LR698987.1_1239015_1239159_+,NA|56aa|up_7|NZ_LR698987.1_1239285_1239453_-,NA|250aa|down_8|NZ_LR698987.1_1256088_1256838_+	NA|94aa|up_9|NZ_LR698987.1_1238622_1238904_-	NA	NA|48aa|up_8|NZ_LR698987.1_1239015_1239159_+	NA	NA|56aa|up_7|NZ_LR698987.1_1239285_1239453_-	NA	NA|289aa|up_6|NZ_LR698987.1_1239707_1240574_-	pfam12883, DUF3828, Protein of unknown function (DUF3828)	NA|406aa|up_5|NZ_LR698987.1_1240720_1241938_-	pfam13620, CarboxypepD_reg, Carboxypeptidase regulatory-like domain	NA|258aa|up_4|NZ_LR698987.1_1242111_1242885_+	sd00010, SLR, Sel1-like repeat	NA|475aa|up_3|NZ_LR698987.1_1243063_1244488_+	PRK10637, cysG, siroheme synthase CysG	NA|303aa|up_2|NZ_LR698987.1_1244507_1245416_+	PRK05253, PRK05253, sulfate adenylyltransferase subunit CysD	NA|487aa|up_1|NZ_LR698987.1_1245438_1246899_+	PRK05124, cysN, sulfate adenylyltransferase subunit 1; Provisional	NA|200aa|up_0|NZ_LR698987.1_1246902_1247502_+	PRK03846, PRK03846, adenylylsulfate kinase; Provisional	NA|258aa|down_0|NZ_LR698987.1_1248091_1248865_+	PRK01683, PRK01683, trans-aconitate 2-methyltransferase; Provisional	NA|330aa|down_1|NZ_LR698987.1_1248906_1249896_-	PRK10752, PRK10752, sulfate ABC transporter substrate-binding protein	NA|135aa|down_2|NZ_LR698987.1_1250095_1250500_+	COG3755, COG3755, Uncharacterized protein conserved in bacteria [Function unknown]	NA|52aa|down_3|NZ_LR698987.1_1250707_1250863_+	pfam07927, HicA_toxin, HicA toxin of bacterial toxin-antitoxin,	NA|137aa|down_4|NZ_LR698987.1_1250875_1251286_+	pfam15919, HicB_lk_antitox, HicB_like antitoxin of bacterial toxin-antitoxin system	NA|137aa|down_5|NZ_LR698987.1_1251323_1251734_+	PRK10227, PRK10227, HTH-type transcriptional regulator CueR	NA|436aa|down_6|NZ_LR698987.1_1252161_1253469_+	cd17371, MFS_MucK, Cis,cis-muconate transport protein and similar proteins of the Major Facilitator Superfamily	NA|622aa|down_7|NZ_LR698987.1_1253865_1255731_+	COG0471, CitT, Di- and tricarboxylate transporters [Inorganic ion transport and metabolism]	NA|250aa|down_8|NZ_LR698987.1_1256088_1256838_+	NA	NA|587aa|down_9|NZ_LR698987.1_1256956_1258717_+	cd16012, ALP, Alkaline Phosphatase
GCF_902386615.1_UHGG_MGYG-HGUT-02385	NZ_LR698987	Leminorella richardii isolate MGYG-HGUT-02385 chromosome 1	3	2278992-2279139	3,3	CRISPRCasFinder,PILER-CR	no		cas3,csa3,cas8e,cse2gr11,cas6e,cas7,cas5,cas1,cas2,DEDDh,WYL,DinG,RT	Orphan	TTTCTAAGCCGCCTATGCGGCGGTTAAC,GTTAACCGCCGCATAGGCGGCTTAGAAA	28,28	0	0	NA	NA	I-F:I-F	2,2	2	Orphan	cas3,csa3,cas8e,cse2gr11,cas6e,cas7,cas5,cas1,cas2,DEDDh,WYL,DinG,RT	NA,NA|316aa|down_1|NZ_LR698987.1_2280169_2281117_-,NA|188aa|down_5|NZ_LR698987.1_2284422_2284986_+	NA|186aa|up_9|NZ_LR698987.1_2266757_2267315_+	COG3539, FimA, P pilus assembly protein, pilin FimA [Cell motility and secretion / Intracellular trafficking and secretion]	NA|235aa|up_8|NZ_LR698987.1_2267397_2268102_+	PRK09926, PRK09926, fimbrial chaperone	NA|849aa|up_7|NZ_LR698987.1_2268221_2270768_+	COG3188, FimD, P pilus assembly protein, porin PapC [Cell motility and secretion / Intracellular trafficking and secretion]	NA|97aa|up_6|NZ_LR698987.1_2270778_2271069_+	COG3677, COG3677, Transposase and inactivated derivatives [DNA replication, recombination, and repair]	NA|401aa|up_5|NZ_LR698987.1_2271068_2272271_+	COG3539, FimA, P pilus assembly protein, pilin FimA [Cell motility and secretion / Intracellular trafficking and secretion]	NA|304aa|up_4|NZ_LR698987.1_2272395_2273307_-	COG0329, DapA, Dihydrodipicolinate synthase/N-acetylneuraminate lyase [Amino acid transport and metabolism / Cell envelope biogenesis, outer membrane]	NA|572aa|up_3|NZ_LR698987.1_2273426_2275142_-	pfam00920, ILVD_EDD, Dehydratase family	NA|424aa|up_2|NZ_LR698987.1_2275203_2276475_-	cd17319, MFS_ExuT_GudP_like, Hexuronate transporter, Glucarate transporter, and similar transporters of the Major Facilitator Superfamily	NA|366aa|up_1|NZ_LR698987.1_2276885_2277983_+	PRK09423, gldA, glycerol dehydrogenase; Provisional	NA|274aa|up_0|NZ_LR698987.1_2278167_2278989_+	sd00045, ANK, ankyrin repeats	NA|246aa|down_0|NZ_LR698987.1_2279188_2279926_-	COG4700, COG4700, Uncharacterized protein conserved in bacteria containing a divergent form of TPR repeats [Function unknown]	NA|316aa|down_1|NZ_LR698987.1_2280169_2281117_-	NA	NA|442aa|down_2|NZ_LR698987.1_2281203_2282529_-	PRK10590, PRK10590, ATP-dependent RNA helicase RhlE; Provisional	NA|234aa|down_3|NZ_LR698987.1_2282761_2283463_-	cd10432, BI-1-like_bacterial, Bacterial BAX inhibitor (BI)-1/YccA-like proteins	NA|195aa|down_4|NZ_LR698987.1_2283587_2284172_-	COG0655, WrbA, Multimeric flavodoxin WrbA [General function prediction only]	NA|188aa|down_5|NZ_LR698987.1_2284422_2284986_+	NA	NA|153aa|down_6|NZ_LR698987.1_2285027_2285486_-	PRK10678, moaE, molybdopterin synthase catalytic subunit MoaE	NA|82aa|down_7|NZ_LR698987.1_2285487_2285733_-	PRK11130, moaD, molybdopterin synthase small subunit; Provisional	NA|161aa|down_8|NZ_LR698987.1_2285747_2286230_-	PRK09364, moaC, cyclic pyranopterin monophosphate synthase MoaC	NA|172aa|down_9|NZ_LR698987.1_2286244_2286760_-	TIGR02667, Molybdenum_cofactor_biosynthesis_protein_B
