assembly_id	genome_id	genome_def	crispr_array_locus_merge	crispr_array_location_merge	crispr_locus_id	crispr_pred_method	array_in_prot	prot_within_array_20000	prot_in_genome	crispr_type_by_cas_prot	consensus_repeat	repeat_length	self-targeting_spacer_number	self-targeting_target_number	spacer_location	protospacer_location	repeat_type	spacer_locus_num	spacer_num	correct_crispr_type	genome_cas_prots	unknown_protein_around_crispr	L10	L10_domain	L9	L9_domain	L8	L8_domain	L7	L7_domain	L6	L6_domain	L5	L5_domain	L4	L4_domain	L3	L3_domain	L2	L2_domain	L1	L1_domain	R1	R1_domain	R2	R2_domain	R3	R3_domain	R4	R4_domain	R5	R5_domain	R6	R6_domain	R7	R7_domain	R8	R8_domain	R9	R9_domain	R10	R10_domain
GCF_009769205.1_ASM976920v1	NZ_CP039266	Lactobacillus crispatus strain DC21.1 chromosome, complete genome	1	1184365-1185248	1,1,1	CRISPRCasFinder,CRT,PILER-CR	no	cas14j	cas14k,c2c9_V-U4,cas2,cas3,DEDDh,DinG,cas14j,csa3,Cas14u_CAS-V	Unclear	GGATCACCTCCACACACGTGGAGAATAC,GGATCACCTCCACACACGTGGAGAATAC,GGATCACCTCCACACACGTGGAGAATAC	28,28,28	0	0	NA	NA	I-B,III-A,III-B:I-B,III-A,III-B:I-B,III-A,III-B	14,14,13	14	TypeV	cas14k,c2c9_V-U4,cas2,cas3,DEDDh,DinG,cas14j,csa3,Cas14u_CAS-V	NA|31aa|up_0|NZ_CP039266.1_1183802_1183895_-,NA	NA|228aa|up_9|NZ_CP039266.1_1173080_1173764_-	pfam12847, Methyltransf_18, Methyltransferase domain	NA|147aa|up_8|NZ_CP039266.1_1173864_1174305_+	cd03427, MTH1, MutT homolog-1 (MTH1) is a member of the Nudix hydrolase superfamily	NA|225aa|up_7|NZ_CP039266.1_1174304_1174979_+	pfam01551, Peptidase_M23, Peptidase family M23	NA|371aa|up_6|NZ_CP039266.1_1175025_1176138_-	PRK09210, PRK09210, RNA polymerase sigma factor RpoD; Validated	NA|607aa|up_5|NZ_CP039266.1_1176153_1177974_-	PRK05667, dnaG, DNA primase; Validated	NA|688aa|up_4|NZ_CP039266.1_1178003_1180067_-	PRK01233, glyS, glycyl-tRNA synthetase subunit beta; Validated	NA|306aa|up_3|NZ_CP039266.1_1180059_1180977_-	PRK09348, glyQ, glycyl-tRNA synthetase subunit alpha; Validated	NA|252aa|up_2|NZ_CP039266.1_1181236_1181992_-	TIGR00613, DNA_repair_protein_RecO, DNA repair protein RecO	cas14j|434aa|up_1|NZ_CP039266.1_1182227_1183529_-	COG0675, COG0675, Transposase and inactivated derivatives [DNA replication, recombination, and repair]	NA|31aa|up_0|NZ_CP039266.1_1183802_1183895_-	NA	NA|301aa|down_0|NZ_CP039266.1_1185431_1186334_-	PRK00089, era, GTPase Era; Reviewed	NA|175aa|down_1|NZ_CP039266.1_1186333_1186858_-	pfam02130, UPF0054, Uncharacterized protein family UPF0054	NA|320aa|down_2|NZ_CP039266.1_1186860_1187820_-	pfam02562, PhoH, PhoH-like protein	NA|148aa|down_3|NZ_CP039266.1_1187845_1188289_-	pfam09424, YqeY, Yqey-like protein	NA|59aa|down_4|NZ_CP039266.1_1188449_1188626_-	PRK00270, rpsU, 30S ribosomal protein S21; Reviewed	NA|279aa|down_5|NZ_CP039266.1_1188853_1189690_+	pfam03618, Kinase-PPPase, Kinase/pyrophosphorylase	NA|271aa|down_6|NZ_CP039266.1_1189790_1190603_+	pfam01610, DDE_Tnp_ISL3, Transposase	NA|249aa|down_7|NZ_CP039266.1_1191074_1191821_-	pfam00398, RrnaAD, Ribosomal RNA adenine dimethylase	NA|28aa|down_8|NZ_CP039266.1_1191945_1192029_-	pfam06308, ErmC, 23S rRNA methylase leader peptide (ErmC)	NA|80aa|down_9|NZ_CP039266.1_1192077_1192317_-	pfam07764, Omega_Repress, Omega Transcriptional Repressor
GCF_009769205.1_ASM976920v1	NZ_CP039266	Lactobacillus crispatus strain DC21.1 chromosome, complete genome	2	1207296-1208177	2,2,2	CRISPRCasFinder,CRT,PILER-CR	no		cas14k,c2c9_V-U4,cas2,cas3,DEDDh,DinG,cas14j,csa3,Cas14u_CAS-V	Orphan	GGATCACCTCCACATGTGTGGAGAATAC,GGATCACCTCCACATGTGTGGAGAATAC,GGATCACCTCCACATGTGTGGAGAATAC	28,28,28	0	0	NA	NA	I-B,III-A,III-B:I-B,III-A,III-B:I-B,III-A,III-B	14,14,13	14	Orphan	cas14k,c2c9_V-U4,cas2,cas3,DEDDh,DinG,cas14j,csa3,Cas14u_CAS-V	NA|182aa|up_3|NZ_CP039266.1_1203034_1203580_-,NA|186aa|up_0|NZ_CP039266.1_1206387_1206945_-,NA|77aa|down_6|NZ_CP039266.1_1215271_1215502_-,NA|131aa|down_7|NZ_CP039266.1_1216028_1216421_-,NA|68aa|down_9|NZ_CP039266.1_1219270_1219474_-	NA|249aa|up_9|NZ_CP039266.1_1191074_1191821_-	pfam00398, RrnaAD, Ribosomal RNA adenine dimethylase	NA|28aa|up_8|NZ_CP039266.1_1191945_1192029_-	pfam06308, ErmC, 23S rRNA methylase leader peptide (ErmC)	NA|80aa|up_7|NZ_CP039266.1_1192077_1192317_-	pfam07764, Omega_Repress, Omega Transcriptional Repressor	NA|424aa|up_6|NZ_CP039266.1_1192565_1193837_+	COG3464, COG3464, Transposase and inactivated derivatives [DNA replication, recombination, and repair]	NA|2524aa|up_5|NZ_CP039266.1_1193907_1201479_-	pfam17966, Mub_B2, Mub B2-like domain	NA|212aa|up_4|NZ_CP039266.1_1202379_1203015_-	COG0664, Crp, cAMP-binding proteins - catabolite gene activator and regulatory subunit of cAMP-dependent protein kinases [Signal transduction mechanisms]	NA|182aa|up_3|NZ_CP039266.1_1203034_1203580_-	NA	NA|77aa|up_2|NZ_CP039266.1_1203579_1203810_-	cd00371, HMA, Heavy-metal-associated domain (HMA) is a conserved domain of approximately 30 amino acid residues found in a number of proteins that transport or detoxify heavy metals, for example, the CPx-type heavy metal ATPases and copper chaperones	NA|618aa|up_1|NZ_CP039266.1_1203885_1205739_-	cd02079, P-type_ATPase_HM, P-type heavy metal-transporting ATPase	NA|186aa|up_0|NZ_CP039266.1_1206387_1206945_-	NA	NA|208aa|down_0|NZ_CP039266.1_1208609_1209233_+	pfam01551, Peptidase_M23, Peptidase family M23	NA|189aa|down_1|NZ_CP039266.1_1209303_1209870_-	pfam01625, PMSR, Peptide methionine sulfoxide reductase	NA|293aa|down_2|NZ_CP039266.1_1211690_1212569_-	COG1284, COG1284, Uncharacterized conserved protein [Function unknown]	NA|286aa|down_3|NZ_CP039266.1_1213020_1213878_-	TIGR04320, hypothetical_protein, SEC10/PgrA surface exclusion domain	NA|296aa|down_4|NZ_CP039266.1_1213989_1214877_-	cd06415, GH25_Cpl1-like, Cpl-1 lysin (also known as Cpl-9 lysozyme / muramidase) is a bacterial cell wall endolysin encoded by the pneumococcal bacteriophage Cp-1, which cleaves the glycosidic N-acetylmuramoyl-(beta1,4)-N-acetylglucosamine bonds of the pneumococcal glycan chain, thus acting as an enzymatic antimicrobial agent (an enzybiotic) against streptococcal infections	NA|111aa|down_5|NZ_CP039266.1_1214936_1215269_-	cd15964, 7tmA_TSH-R, thyroid-stimulating hormone receptor (or thyrotropin receptor), member of the class A family of seven-transmembrane G protein-coupled receptors	NA|77aa|down_6|NZ_CP039266.1_1215271_1215502_-	NA	NA|131aa|down_7|NZ_CP039266.1_1216028_1216421_-	NA	NA|946aa|down_8|NZ_CP039266.1_1216432_1219270_-	pfam01442, Apolipoprotein, Apolipoprotein A1/A4/E domain	NA|68aa|down_9|NZ_CP039266.1_1219270_1219474_-	NA
GCF_009769205.1_ASM976920v1	NZ_CP039266	Lactobacillus crispatus strain DC21.1 chromosome, complete genome	3	1210906-1211472	3,3,3	CRISPRCasFinder,CRT,PILER-CR	no		cas14k,c2c9_V-U4,cas2,cas3,DEDDh,DinG,cas14j,csa3,Cas14u_CAS-V	Orphan	GGATGACCTCCACATACGTGGAGAATAC,ACCTCCACATACGTGGAGAATAC,GGATGACCTCCACATACGTGGAGAATAC	28,23,28	0	0	NA	NA	I-B,III-A,III-B:NA:I-B,III-A,III-B	9,9,6	9	Orphan	cas14k,c2c9_V-U4,cas2,cas3,DEDDh,DinG,cas14j,csa3,Cas14u_CAS-V	NA|182aa|up_5|NZ_CP039266.1_1203034_1203580_-,NA|186aa|up_2|NZ_CP039266.1_1206387_1206945_-,NA|77aa|down_4|NZ_CP039266.1_1215271_1215502_-,NA|131aa|down_5|NZ_CP039266.1_1216028_1216421_-,NA|68aa|down_7|NZ_CP039266.1_1219270_1219474_-	NA|80aa|up_9|NZ_CP039266.1_1192077_1192317_-	pfam07764, Omega_Repress, Omega Transcriptional Repressor	NA|424aa|up_8|NZ_CP039266.1_1192565_1193837_+	COG3464, COG3464, Transposase and inactivated derivatives [DNA replication, recombination, and repair]	NA|2524aa|up_7|NZ_CP039266.1_1193907_1201479_-	pfam17966, Mub_B2, Mub B2-like domain	NA|212aa|up_6|NZ_CP039266.1_1202379_1203015_-	COG0664, Crp, cAMP-binding proteins - catabolite gene activator and regulatory subunit of cAMP-dependent protein kinases [Signal transduction mechanisms]	NA|182aa|up_5|NZ_CP039266.1_1203034_1203580_-	NA	NA|77aa|up_4|NZ_CP039266.1_1203579_1203810_-	cd00371, HMA, Heavy-metal-associated domain (HMA) is a conserved domain of approximately 30 amino acid residues found in a number of proteins that transport or detoxify heavy metals, for example, the CPx-type heavy metal ATPases and copper chaperones	NA|618aa|up_3|NZ_CP039266.1_1203885_1205739_-	cd02079, P-type_ATPase_HM, P-type heavy metal-transporting ATPase	NA|186aa|up_2|NZ_CP039266.1_1206387_1206945_-	NA	NA|208aa|up_1|NZ_CP039266.1_1208609_1209233_+	pfam01551, Peptidase_M23, Peptidase family M23	NA|189aa|up_0|NZ_CP039266.1_1209303_1209870_-	pfam01625, PMSR, Peptide methionine sulfoxide reductase	NA|293aa|down_0|NZ_CP039266.1_1211690_1212569_-	COG1284, COG1284, Uncharacterized conserved protein [Function unknown]	NA|286aa|down_1|NZ_CP039266.1_1213020_1213878_-	TIGR04320, hypothetical_protein, SEC10/PgrA surface exclusion domain	NA|296aa|down_2|NZ_CP039266.1_1213989_1214877_-	cd06415, GH25_Cpl1-like, Cpl-1 lysin (also known as Cpl-9 lysozyme / muramidase) is a bacterial cell wall endolysin encoded by the pneumococcal bacteriophage Cp-1, which cleaves the glycosidic N-acetylmuramoyl-(beta1,4)-N-acetylglucosamine bonds of the pneumococcal glycan chain, thus acting as an enzymatic antimicrobial agent (an enzybiotic) against streptococcal infections	NA|111aa|down_3|NZ_CP039266.1_1214936_1215269_-	cd15964, 7tmA_TSH-R, thyroid-stimulating hormone receptor (or thyrotropin receptor), member of the class A family of seven-transmembrane G protein-coupled receptors	NA|77aa|down_4|NZ_CP039266.1_1215271_1215502_-	NA	NA|131aa|down_5|NZ_CP039266.1_1216028_1216421_-	NA	NA|946aa|down_6|NZ_CP039266.1_1216432_1219270_-	pfam01442, Apolipoprotein, Apolipoprotein A1/A4/E domain	NA|68aa|down_7|NZ_CP039266.1_1219270_1219474_-	NA	NA|753aa|down_8|NZ_CP039266.1_1219485_1221744_-	pfam06605, Prophage_tail, Prophage endopeptidase tail	NA|273aa|down_9|NZ_CP039266.1_1221743_1222562_-	pfam05709, Sipho_tail, Phage tail protein
GCF_009769205.1_ASM976920v1	NZ_CP039266	Lactobacillus crispatus strain DC21.1 chromosome, complete genome	4	1275321-1276814	4,4,4,5	CRISPRCasFinder,CRT,PILER-CR,PILER-CR	no		cas14k,c2c9_V-U4,cas2,cas3,DEDDh,DinG,cas14j,csa3,Cas14u_CAS-V	Orphan	GGATCACCTCCACATACGTGGAGAATAC,GGATCACCTCCACATACGTGGAGAATAC,GGATCACCTCCACATACGTGGAGAATA,AGGATCACCTCCACATACGTGGAGAATAC	28,28,27,29	0	0	NA	NA	I-B,III-A,III-B:I-B,III-A,III-B:I-B,III-A,III-B:I-B,III-A,III-B	24,24,23,23	24	Orphan	cas14k,c2c9_V-U4,cas2,cas3,DEDDh,DinG,cas14j,csa3,Cas14u_CAS-V	NA|183aa|up_9|NZ_CP039266.1_1262273_1262822_-,NA	NA|183aa|up_9|NZ_CP039266.1_1262273_1262822_-	NA	NA|206aa|up_8|NZ_CP039266.1_1262911_1263529_-	cd05466, PBP2_LTTR_substrate, The substrate binding domain of LysR-type transcriptional regulators (LTTRs), a member of the type 2 periplasmic binding fold protein superfamily	NA|389aa|up_7|NZ_CP039266.1_1263695_1264862_-	COG4552, Eis, Predicted acetyltransferase involved in intracellular survival and related acetyltransferases [General function prediction only]	NA|179aa|up_6|NZ_CP039266.1_1264977_1265514_-	pfam03802, CitX, Apo-citrate lyase phosphoribosyl-dephospho-CoA transferase	NA|301aa|up_5|NZ_CP039266.1_1265527_1266430_-	PRK02102, PRK02102, ornithine carbamoyltransferase; Validated	NA|410aa|up_4|NZ_CP039266.1_1266832_1268062_-	PRK01388, PRK01388, arginine deiminase; Provisional	NA|283aa|up_3|NZ_CP039266.1_1268135_1268984_-	COG0583, LysR, Transcriptional regulator [Transcription]	NA|758aa|up_2|NZ_CP039266.1_1269112_1271386_+	cd02076, P-type_ATPase_H, plant and fungal plasma membrane H(+)-ATPases, and related bacterial and archaeal putative H(+)-ATPases	NA|172aa|up_1|NZ_CP039266.1_1271785_1272301_-	cd10432, BI-1-like_bacterial, Bacterial BAX inhibitor (BI)-1/YccA-like proteins	NA|440aa|up_0|NZ_CP039266.1_1273712_1275032_-	PRK07251, PRK07251, FAD-containing oxidoreductase	NA|176aa|down_0|NZ_CP039266.1_1278063_1278591_-	PRK02304, PRK02304, adenine phosphoribosyltransferase; Provisional	NA|758aa|down_1|NZ_CP039266.1_1278695_1280969_-	TIGR00644, recJ, single-stranded-DNA-specific exonuclease RecJ	NA|229aa|down_2|NZ_CP039266.1_1281092_1281779_-	cd06165, Sortase_A, Sortase domain found in class A sortases	NA|613aa|down_3|NZ_CP039266.1_1281781_1283620_-	PRK05433, PRK05433, GTP-binding protein LepA; Provisional	NA|152aa|down_4|NZ_CP039266.1_1283727_1284183_+	cd03428, Ap4A_hydrolase_human_like, Diadenosine tetraphosphate (Ap4A) hydrolase is a member of the Nudix hydrolase superfamily	NA|384aa|down_5|NZ_CP039266.1_1284242_1285394_-	PRK14276, PRK14276, chaperone protein DnaJ; Provisional	NA|618aa|down_6|NZ_CP039266.1_1285476_1287330_-	PRK00290, dnaK, molecular chaperone DnaK; Provisional	NA|195aa|down_7|NZ_CP039266.1_1287346_1287931_-	PRK14162, PRK14162, heat shock protein GrpE; Provisional	NA|350aa|down_8|NZ_CP039266.1_1287943_1288993_-	PRK00082, hrcA, heat-inducible transcription repressor; Provisional	NA|312aa|down_9|NZ_CP039266.1_1289130_1290066_-	PRK05627, PRK05627, bifunctional riboflavin kinase/FAD synthetase
GCF_009769205.1_ASM976920v1	NZ_CP039266	Lactobacillus crispatus strain DC21.1 chromosome, complete genome	5	1986751-1986815	5	CRISPRCasFinder	no		cas14k,c2c9_V-U4,cas2,cas3,DEDDh,DinG,cas14j,csa3,Cas14u_CAS-V	Orphan	TTTACTTTATAAAGGAAATAATATG	25	0	0	NA	NA	NA	1	1	Orphan	cas14k,c2c9_V-U4,cas2,cas3,DEDDh,DinG,cas14j,csa3,Cas14u_CAS-V	NA|101aa|up_7|NZ_CP039266.1_1980800_1981103_+,NA|111aa|up_2|NZ_CP039266.1_1985019_1985352_+,NA	NA|267aa|up_9|NZ_CP039266.1_1977984_1978785_-	cd07516, HAD_Pase, phosphatase, similar to Escherichia coli Cof and Thermotoga maritima TM0651; belongs to the haloacid dehalogenase-like superfamily	NA|436aa|up_8|NZ_CP039266.1_1979013_1980321_+	COG2252, COG2252, Xanthine/uracil/vitamin C permease [Nucleotide transport and    metabolism]	NA|101aa|up_7|NZ_CP039266.1_1980800_1981103_+	NA	NA|396aa|up_6|NZ_CP039266.1_1981105_1982293_+	pfam02517, Abi, CAAX protease self-immunity	NA|242aa|up_5|NZ_CP039266.1_1982377_1983103_-	cd01106, HTH_TipAL-Mta, Helix-Turn-Helix DNA binding domain of the transcription regulators TipAL, Mta, and SkgA	NA|437aa|up_4|NZ_CP039266.1_1983286_1984597_+	COG2252, COG2252, Xanthine/uracil/vitamin C permease [Nucleotide transport and    metabolism]	NA|89aa|up_3|NZ_CP039266.1_1984763_1985030_+	TIGR02384, Putative_antitoxin_RelB, addiction module antitoxin, RelB/DinJ family	NA|111aa|up_2|NZ_CP039266.1_1985019_1985352_+	NA	NA|202aa|up_1|NZ_CP039266.1_1985736_1986342_+	pfam12625, Arabinose_bd, Arabinose-binding domain of AraC transcription regulator, N-term	NA|126aa|up_0|NZ_CP039266.1_1986345_1986723_+	COG2207, AraC, AraC-type DNA-binding domain-containing proteins [Transcription]	NA|258aa|down_0|NZ_CP039266.1_1988103_1988877_-	cd07517, HAD_HPP, phosphatase, similar to Bacteroides thetaiotaomicron VPI-5482 BT4131 hexose phosphate phosphatase; belongs to the haloacid dehalogenase-like superfamily	NA|437aa|down_1|NZ_CP039266.1_1989030_1990341_+	COG2252, COG2252, Xanthine/uracil/vitamin C permease [Nucleotide transport and    metabolism]	NA|143aa|down_2|NZ_CP039266.1_1991702_1992131_-	pfam07931, CPT, Chloramphenicol phosphotransferase-like protein	NA|165aa|down_3|NZ_CP039266.1_1992206_1992701_-	pfam10706, Aminoglyc_resit, Aminoglycoside-2''-adenylyltransferase	NA|346aa|down_4|NZ_CP039266.1_1992724_1993762_-	COG3677, COG3677, Transposase and inactivated derivatives [DNA replication, recombination, and repair]	NA|259aa|down_5|NZ_CP039266.1_1994180_1994957_-	PRK13746, PRK13746, aminoglycoside resistance protein; Provisional	NA|257aa|down_6|NZ_CP039266.1_1994981_1995752_-	cd09007, NP-I_spr0068, uncharacterized subfamily of the nucleoside phosphorylase-I family	NA|147aa|down_7|NZ_CP039266.1_1997717_1998158_+	TIGR02698, Uncharacterized_16	NA|126aa|down_8|NZ_CP039266.1_1998173_1998551_+	pfam13473, Cupredoxin_1, Cupredoxin-like domain	NA|96aa|down_9|NZ_CP039266.1_1998563_1998851_+	pfam13473, Cupredoxin_1, Cupredoxin-like domain
