assembly_id	genome_id	genome_def	crispr_array_locus_merge	crispr_array_location_merge	crispr_locus_id	crispr_pred_method	array_in_prot	prot_within_array_20000	prot_in_genome	crispr_type_by_cas_prot	consensus_repeat	repeat_length	self-targeting_spacer_number	self-targeting_target_number	spacer_location	protospacer_location	repeat_type	spacer_locus_num	spacer_num	correct_crispr_type	genome_cas_prots	unknown_protein_around_crispr	L10	L10_domain	L9	L9_domain	L8	L8_domain	L7	L7_domain	L6	L6_domain	L5	L5_domain	L4	L4_domain	L3	L3_domain	L2	L2_domain	L1	L1_domain	R1	R1_domain	R2	R2_domain	R3	R3_domain	R4	R4_domain	R5	R5_domain	R6	R6_domain	R7	R7_domain	R8	R8_domain	R9	R9_domain	R10	R10_domain
GCF_001050475.1_ASM105047v1	NZ_CP012034	Lactobacillus ginsenosidimutans strain EMML 3041, complete genome	1	607146-607308	1	CRISPRCasFinder	no		csa3,DinG,DEDDh,cas3,WYL,cas14j	Orphan	CAGCATCTTTAACGCCAGTTCTTAC	25	0	0	NA	NA	NA	2	2	Orphan	csa3,DinG,DEDDh,cas3,WYL,cas14j	NA|260aa|up_9|NZ_CP012034.1_594987_595767_-,NA|172aa|up_6|NZ_CP012034.1_598438_598954_-,NA|173aa|up_4|NZ_CP012034.1_600420_600939_-,NA	NA|260aa|up_9|NZ_CP012034.1_594987_595767_-	NA	NA|368aa|up_8|NZ_CP012034.1_595916_597020_+	cd04737, LOX_like_FMN, L-Lactate oxidase (LOX) FMN-binding domain	NA|446aa|up_7|NZ_CP012034.1_597101_598439_-	PRK11204, PRK11204, N-glycosyltransferase; Provisional	NA|172aa|up_6|NZ_CP012034.1_598438_598954_-	NA	NA|433aa|up_5|NZ_CP012034.1_599125_600424_-	PRK11204, PRK11204, N-glycosyltransferase; Provisional	NA|173aa|up_4|NZ_CP012034.1_600420_600939_-	NA	NA|173aa|up_3|NZ_CP012034.1_601790_602309_-	PRK02304, PRK02304, adenine phosphoribosyltransferase; Provisional	NA|771aa|up_2|NZ_CP012034.1_602328_604641_-	TIGR00644, recJ, single-stranded-DNA-specific exonuclease RecJ	NA|221aa|up_1|NZ_CP012034.1_604705_605368_-	cd06165, Sortase_A, Sortase domain found in class A sortases	NA|443aa|up_0|NZ_CP012034.1_605547_606876_-	COG1249, Lpd, Pyruvate/2-oxoglutarate dehydrogenase complex, dihydrolipoamide dehydrogenase (E3) component, and related enzymes [Energy production and conversion]	NA|320aa|down_0|NZ_CP012034.1_608699_609659_-	cd08253, zeta_crystallin, Zeta-crystallin with NADP-dependent quinone reductase activity (QOR)	NA|616aa|down_1|NZ_CP012034.1_609729_611577_-	PRK05433, PRK05433, GTP-binding protein LepA; Provisional	NA|379aa|down_2|NZ_CP012034.1_611644_612781_-	PRK14276, PRK14276, chaperone protein DnaJ; Provisional	NA|628aa|down_3|NZ_CP012034.1_612891_614775_-	PRK00290, dnaK, molecular chaperone DnaK; Provisional	NA|200aa|down_4|NZ_CP012034.1_614807_615407_-	PRK14162, PRK14162, heat shock protein GrpE; Provisional	NA|349aa|down_5|NZ_CP012034.1_615425_616472_-	PRK00082, hrcA, heat-inducible transcription repressor; Provisional	NA|314aa|down_6|NZ_CP012034.1_616649_617591_-	PRK05627, PRK05627, bifunctional riboflavin kinase/FAD synthetase	NA|303aa|down_7|NZ_CP012034.1_617597_618506_-	PRK01550, truB, tRNA pseudouridine synthase B; Provisional	NA|119aa|down_8|NZ_CP012034.1_618576_618933_-	PRK00521, rbfA, 30S ribosome-binding factor RbfA	NA|947aa|down_9|NZ_CP012034.1_618944_621785_-	PRK05306, infB, translation initiation factor IF-2; Validated
GCF_001050475.1_ASM105047v1	NZ_CP012034	Lactobacillus ginsenosidimutans strain EMML 3041, complete genome	2	1679428-1679538	2	CRISPRCasFinder	no	DinG	csa3,DinG,DEDDh,cas3,WYL,cas14j	Type IV-A	CACACAGACATAAGTTGCTCTATGCACC	28	0	0	NA	NA	NA	1	1	Orphan	csa3,DinG,DEDDh,cas3,WYL,cas14j	NA|80aa|up_0|NZ_CP012034.1_1679168_1679408_+,NA|70aa|down_6|NZ_CP012034.1_1685943_1686153_+,NA|209aa|down_7|NZ_CP012034.1_1686255_1686882_-	NA|194aa|up_9|NZ_CP012034.1_1671871_1672453_-	pfam06736, DUF1211, Protein of unknown function (DUF1211)	NA|351aa|up_8|NZ_CP012034.1_1672639_1673692_+	COG3177, COG3177, Fic family protein [Function unknown]	NA|130aa|up_7|NZ_CP012034.1_1673731_1674121_-	pfam10012, DUF2255, Uncharacterized protein conserved in bacteria (DUF2255)	NA|248aa|up_6|NZ_CP012034.1_1674136_1674880_-	COG4221, COG4221, Short-chain alcohol dehydrogenase of unknown specificity [General function prediction only]	NA|291aa|up_5|NZ_CP012034.1_1674994_1675867_+	COG0583, LysR, Transcriptional regulator [Transcription]	NA|249aa|up_4|NZ_CP012034.1_1675905_1676652_-	COG4221, COG4221, Short-chain alcohol dehydrogenase of unknown specificity [General function prediction only]	NA|150aa|up_3|NZ_CP012034.1_1676679_1677129_-	pfam18050, Cyclophil_like2, Cyclophilin-like family	NA|187aa|up_2|NZ_CP012034.1_1677132_1677693_-	cd03357, LbH_MAT_GAT, Maltose O-acetyltransferase (MAT) and Galactoside O-acetyltransferase (GAT): MAT and GAT catalyze the CoA-dependent acetylation of the 6-hydroxyl group of their respective sugar substrates	NA|401aa|up_1|NZ_CP012034.1_1677919_1679122_+	COG1373, COG1373, Predicted ATPase (AAA+ superfamily) [General function prediction only]	NA|80aa|up_0|NZ_CP012034.1_1679168_1679408_+	NA	NA|171aa|down_0|NZ_CP012034.1_1680039_1680552_+	pfam07274, DUF1440, Protein of unknown function (DUF1440)	NA|442aa|down_1|NZ_CP012034.1_1680555_1681881_+	cd17332, MFS_MelB_like, Salmonella enterica Na+/melibiose symporter MelB and similar transporters of the Major Facilitator Superfamily	NA|70aa|down_2|NZ_CP012034.1_1682003_1682213_+	TIGR02384, Putative_antitoxin_RelB, addiction module antitoxin, RelB/DinJ family	NA|335aa|down_3|NZ_CP012034.1_1682421_1683426_+	cd19079, AKR_EcYajO-like, Escherichia coli YajO and similar proteins	NA|243aa|down_4|NZ_CP012034.1_1684355_1685084_-	pfam00877, NLPC_P60, NlpC/P60 family	NA|184aa|down_5|NZ_CP012034.1_1685301_1685853_-	COG0194, Gmk, Guanylate kinase [Nucleotide transport and metabolism]	NA|70aa|down_6|NZ_CP012034.1_1685943_1686153_+	NA	NA|209aa|down_7|NZ_CP012034.1_1686255_1686882_-	NA	NA|77aa|down_8|NZ_CP012034.1_1687143_1687374_+	TIGR02194, Glutaredoxin-like_protein_NrdH, Glutaredoxin-like protein NrdH	NA|125aa|down_9|NZ_CP012034.1_1687378_1687753_+	PRK03600, nrdI, class Ib ribonucleoside-diphosphate reductase assembly flavoprotein NrdI
