assembly_id	genome_id	genome_def	crispr_array_locus_merge	crispr_array_location_merge	crispr_locus_id	crispr_pred_method	array_in_prot	prot_within_array_20000	prot_in_genome	crispr_type_by_cas_prot	consensus_repeat	repeat_length	self-targeting_spacer_number	self-targeting_target_number	spacer_location	protospacer_location	repeat_type	spacer_locus_num	spacer_num	correct_crispr_type	genome_cas_prots	unknown_protein_around_crispr	L10	L10_domain	L9	L9_domain	L8	L8_domain	L7	L7_domain	L6	L6_domain	L5	L5_domain	L4	L4_domain	L3	L3_domain	L2	L2_domain	L1	L1_domain	R1	R1_domain	R2	R2_domain	R3	R3_domain	R4	R4_domain	R5	R5_domain	R6	R6_domain	R7	R7_domain	R8	R8_domain	R9	R9_domain	R10	R10_domain
GCF_000015385.1_ASM1538v1	NC_010080	Lactobacillus helveticus DPC 4571, complete sequence	1	566505-566687	1	PILER-CR	no		Cas14u_CAS-V,cas14j,cas3,DEDDh,DinG,cas14k,cas2,cas1,cas4,cas7,cas8c,cas5,csa3	Orphan	CACGCGTGGGTTCAAATCCCACAT	24	0	0	NA	NA	NA	2	2	Orphan	Cas14u_CAS-V,cas14j,cas3,DEDDh,DinG,cas14k,cas2,cas1,cas4,cas7,cas8c,cas5,csa3	NA|130aa|up_1|NC_010080.1_564086_564476_+,NA|264aa|down_6|NC_010080.1_575712_576504_+	NA|393aa|up_9|NC_010080.1_555424_556603_-	PRK08068, PRK08068, transaminase; Reviewed	NA|264aa|up_8|NC_010080.1_556623_557415_-	cd07583, nitrilase_5, Uncharacterized subgroup of the nitrilase superfamily (putative class 13 nitrilases)	NA|154aa|up_7|NC_010080.1_559276_559738_+	pfam13349, DUF4097, Putative adhesin	NA|116aa|up_6|NC_010080.1_559811_560159_+	COG1695, COG1695, Predicted transcriptional regulators [Transcription]	NA|84aa|up_5|NC_010080.1_560204_560456_+	pfam04024, PspC, PspC domain	NA|186aa|up_4|NC_010080.1_560637_561195_+	pfam07563, DUF1541, Protein of unknown function (DUF1541)	NA|319aa|up_3|NC_010080.1_561312_562269_-	COG2826, Tra8, Transposase and inactivated derivatives, IS30 family [DNA replication, recombination, and repair]	NA|67aa|up_2|NC_010080.1_563623_563824_-	COG1278, CspC, Cold shock proteins [Transcription]	NA|130aa|up_1|NC_010080.1_564086_564476_+	NA	NA|166aa|up_0|NC_010080.1_565840_566338_+	pfam00583, Acetyltransf_1, Acetyltransferase (GNAT) family	NA|443aa|down_0|NC_010080.1_567759_569088_-	PRK10720, PRK10720, uracil transporter; Provisional	NA|177aa|down_1|NC_010080.1_569100_569631_-	PRK05205, PRK05205, bifunctional pyr operon transcriptional regulator/uracil phosphoribosyltransferase PyrR	NA|490aa|down_2|NC_010080.1_569921_571391_+	cd17502, MFS_Azr1_MDR_like, Saccharomyces cerevisiae Azole resistance protein 1 (Azr1p), and similar multidrug resistance (MDR) transporters of the Major Facilitator Superfamily	NA|402aa|down_3|NC_010080.1_571414_572620_-	cd17391, MFS_MdtG_MDR_like, Multidrug resistance protein MdtG and similar multidrug resistance (MDR) transporters of the Major Facilitator Superfamily	NA|361aa|down_4|NC_010080.1_572746_573829_+	cd05656, M42_Frv, M42 Peptidase, endoglucanases	NA|541aa|down_5|NC_010080.1_573933_575556_+	PRK15064, PRK15064, ABC transporter ATP-binding protein; Provisional	NA|264aa|down_6|NC_010080.1_575712_576504_+	NA	NA|223aa|down_7|NC_010080.1_576516_577185_+	cd01741, GATase1_1, Subgroup of proteins having the Type 1 glutamine amidotransferase (GATase1) domain	NA|273aa|down_8|NC_010080.1_577227_578046_-	cd13689, PBP2_BsGlnH, Substrate binding domain of ABC glutamine transporter from Bacillus subtilis; the type 2 periplasmic-bindig protein fold	NA|154aa|down_9|NC_010080.1_578186_578648_+	pfam12802, MarR_2, MarR family
GCF_000015385.1_ASM1538v1	NC_010080	Lactobacillus helveticus DPC 4571, complete sequence	2	1349781-1349901	1	CRISPRCasFinder	no	DEDDh	Cas14u_CAS-V,cas14j,cas3,DEDDh,DinG,cas14k,cas2,cas1,cas4,cas7,cas8c,cas5,csa3	Unclear	AGAAAAAGAACCAAACAGCCTTTTAATGACTGCTTGATTCT	41	1	13	1349822-1349860|1349822-1349860|1349822-1349860|1349822-1349860|1349822-1349860|1349822-1349860|1349822-1349860|1349822-1349860|1349822-1349860|1349822-1349860|1349822-1349860|1349822-1349860|1349822-1349860	NC_010080.1_11682-11720|NC_010080.1_22131-22093|NC_010080.1_33601-33563|NC_010080.1_342278-342240|NC_010080.1_376795-376757|NC_010080.1_436274-436236|NC_010080.1_962327-962289|NC_010080.1_1078129-1078091|NC_010080.1_1466610-1466572|NC_010080.1_1614293-1614331|NC_010080.1_1893196-1893234|NC_010080.1_1912867-1912829|NC_010080.1_1762542-1762580	NA	1	1	Orphan	Cas14u_CAS-V,cas14j,cas3,DEDDh,DinG,cas14k,cas2,cas1,cas4,cas7,cas8c,cas5,csa3	NA,NA	NA|419aa|up_9|NC_010080.1_1337076_1338333_-	TIGR00054, Putative_zinc_metalloprotease_slr1821, RIP metalloprotease RseP	NA|245aa|up_8|NC_010080.1_1339159_1339894_-	PRK14830, PRK14830, undecaprenyl pyrophosphate synthase; Provisional	NA|186aa|up_7|NC_010080.1_1339896_1340454_-	PRK00083, frr, ribosome recycling factor; Reviewed	NA|242aa|up_6|NC_010080.1_1340453_1341179_-	PRK00358, pyrH, uridylate kinase; Provisional	NA|342aa|up_5|NC_010080.1_1341317_1342343_-	PRK09377, tsf, elongation factor Ts; Provisional	NA|258aa|up_4|NC_010080.1_1342376_1343150_-	PRK05299, rpsB, 30S ribosomal protein S2; Provisional	NA|344aa|up_3|NC_010080.1_1343311_1344343_-	COG4123, COG4123, Predicted O-methyltransferase [General function prediction only]	NA|205aa|up_2|NC_010080.1_1344408_1345023_+	COG0204, PlsC, 1-acyl-sn-glycerol-3-phosphate acyltransferase [Lipid metabolism]	NA|394aa|up_1|NC_010080.1_1345082_1346264_+	COG2230, Cfa, Cyclopropane fatty acid synthase and related methyltransferases [Cell envelope biogenesis, outer membrane]	NA|648aa|up_0|NC_010080.1_1346321_1348265_-	cd08662, M13, Peptidase family M13 includes neprilysin and endothelin-converting enzyme I	NA|73aa|down_0|NC_010080.1_1349977_1350196_-	pfam03672, UPF0154, Uncharacterized protein family (UPF0154)	NA|88aa|down_1|NC_010080.1_1350258_1350522_-	pfam05979, DUF896, Bacterial protein of unknown function (DUF896)	NA|209aa|down_2|NC_010080.1_1350672_1351299_+	PRK00215, PRK00215, transcriptional repressor LexA	NA|262aa|down_3|NC_010080.1_1351327_1352113_-	cd00229, SGNH_hydrolase, SGNH_hydrolase, or GDSL_hydrolase, is a diverse family of lipases and esterases	NA|220aa|down_4|NC_010080.1_1352105_1352765_-	cd10030, UDG-F4_TTUDGA_SPO1dp_like, Uracil DNA glycosylase family 4, includes Thermotoga maritima TTUDGA, Bacillus phage SPO1 DNA polymerase, and similar proteins	NA|469aa|down_5|NC_010080.1_1352947_1354354_+	pfam06782, UPF0236, Uncharacterized protein family (UPF0236)	NA|116aa|down_6|NC_010080.1_1354760_1355108_-	PRK05338, rplS, 50S ribosomal protein L19; Provisional	NA|240aa|down_7|NC_010080.1_1355220_1355940_-	PRK00026, trmD, tRNA (guanine-N(1)-)-methyltransferase; Reviewed	NA|172aa|down_8|NC_010080.1_1355929_1356445_-	PRK00122, rimM, 16S rRNA-processing protein RimM; Provisional	NA|91aa|down_9|NC_010080.1_1356513_1356786_-	PRK00040, rpsP, 30S ribosomal protein S16; Reviewed
GCF_000015385.1_ASM1538v1	NC_010080	Lactobacillus helveticus DPC 4571, complete sequence	3	1448503-1448593	2	CRISPRCasFinder	no		Cas14u_CAS-V,cas14j,cas3,DEDDh,DinG,cas14k,cas2,cas1,cas4,cas7,cas8c,cas5,csa3	Orphan	TAGGATCACCTCCACATACGTGGAGAATAC	30	0	0	NA	NA	I-B,III-A,III-B	1	1	Orphan	Cas14u_CAS-V,cas14j,cas3,DEDDh,DinG,cas14k,cas2,cas1,cas4,cas7,cas8c,cas5,csa3	NA|99aa|up_3|NC_010080.1_1443921_1444218_-,NA|136aa|down_0|NC_010080.1_1449933_1450341_+,NA|205aa|down_4|NC_010080.1_1455319_1455934_-,NA|169aa|down_8|NC_010080.1_1464832_1465339_+	NA|308aa|up_9|NC_010080.1_1434914_1435838_-	PRK07259, PRK07259, dihydroorotate dehydrogenase	NA|235aa|up_8|NC_010080.1_1436092_1436797_+	PRK00230, PRK00230, orotidine-5'-phosphate decarboxylase	NA|208aa|up_7|NC_010080.1_1436798_1437422_+	PRK00455, pyrE, orotate phosphoribosyltransferase; Validated	NA|93aa|up_6|NC_010080.1_1437646_1437925_-	TIGR02384, Putative_antitoxin_RelB, addiction module antitoxin, RelB/DinJ family	NA|879aa|up_5|NC_010080.1_1439858_1442495_+	cd02089, P-type_ATPase_Ca_prok, prokaryotic P-type Ca(2+)-ATPase similar to Synechococcus elongatus sp	NA|412aa|up_4|NC_010080.1_1442551_1443787_-	cd17355, MFS_YcxA_like, MFS-type transporter YcxA and similar proteins of the Major Facilitator Superfamily of transporters	NA|99aa|up_3|NC_010080.1_1443921_1444218_-	NA	NA|221aa|up_2|NC_010080.1_1444750_1445413_+	TIGR00719, Probable_L-serine_dehydratase_beta_chain, L-serine dehydratase, iron-sulfur-dependent, beta subunit	NA|295aa|up_1|NC_010080.1_1445427_1446312_+	TIGR00718, Probable_L-serine_dehydratase_alpha_chain, L-serine dehydratase, iron-sulfur-dependent, alpha subunit	NA|109aa|up_0|NC_010080.1_1446313_1446640_+	COG2151, PaaD, Predicted metal-sulfur cluster biosynthetic enzyme [General function prediction only]	NA|136aa|down_0|NC_010080.1_1449933_1450341_+	NA	NA|207aa|down_1|NC_010080.1_1450280_1450901_+	COG4990, COG4990, Uncharacterized protein conserved in bacteria [Function unknown]	NA|395aa|down_2|NC_010080.1_1452568_1453753_-	pfam03217, SLAP, SLAP domain	NA|415aa|down_3|NC_010080.1_1453992_1455237_-	pfam13020, DUF3883, Domain of unknown function (DUF3883)	NA|205aa|down_4|NC_010080.1_1455319_1455934_-	NA	NA|485aa|down_5|NC_010080.1_1458381_1459836_-	pfam02384, N6_Mtase, N-6 DNA Methylase	NA|1071aa|down_6|NC_010080.1_1459856_1463069_-	PRK11448, hsdR, type I restriction enzyme EcoKI subunit R; Provisional	NA|484aa|down_7|NC_010080.1_1463200_1464652_-	pfam13240, zinc_ribbon_2, zinc-ribbon domain	NA|169aa|down_8|NC_010080.1_1464832_1465339_+	NA	NA|426aa|down_9|NC_010080.1_1466614_1467892_-	COG3464, COG3464, Transposase and inactivated derivatives [DNA replication, recombination, and repair]
GCF_000015385.1_ASM1538v1	NC_010080	Lactobacillus helveticus DPC 4571, complete sequence	4	1587513-1589020	1,2,3	CRT,PILER-CR,CRISPRCasFinder	no	cas14k,cas2,cas1,cas4,cas7,cas8c,cas5,cas3	Cas14u_CAS-V,cas14j,cas3,DEDDh,DinG,cas14k,cas2,cas1,cas4,cas7,cas8c,cas5,csa3	Type I-C,Type I-U, Type I-U?	ATTTCAATCCACGCACTCACAAGGAGTGCGAC,ATTTCAATCCACGCACTCACAAGGAGTGCGAC,ATTTCAATCCACGCACTCACAAGGAGTGCGAC	32,32,32	0	0	NA	NA	I-C:I-C:I-C	22,21,21	22	TypeV,TypeI-C,TypeI-U,TypeI-U?	Cas14u_CAS-V,cas14j,cas3,DEDDh,DinG,cas14k,cas2,cas1,cas4,cas7,cas8c,cas5,csa3	NA|177aa|up_7|NC_010080.1_1576481_1577012_-,NA	cas14k|442aa|up_9|NC_010080.1_1574327_1575653_-	COG0675, COG0675, Transposase and inactivated derivatives [DNA replication, recombination, and repair]	NA|203aa|up_8|NC_010080.1_1575682_1576291_-	cd03769, SR_IS607_transposase_like, Serine Recombinase (SR) family, IS607-like transposase subfamily, catalytic domain; members contain a DNA binding domain with homology to MerR/SoxR located N-terminal to the catalytic domain	NA|177aa|up_7|NC_010080.1_1576481_1577012_-	NA	NA|645aa|up_6|NC_010080.1_1577843_1579778_-	PRK00413, thrS, threonyl-tRNA synthetase; Reviewed	NA|303aa|up_5|NC_010080.1_1580068_1580977_-	PRK08939, PRK08939, primosomal protein DnaI; Reviewed	NA|444aa|up_4|NC_010080.1_1581008_1582340_-	COG3611, DnaB, Replication initiation/membrane attachment protein [DNA replication, recombination, and repair]	NA|156aa|up_3|NC_010080.1_1582342_1582810_-	PRK00464, nrdR, transcriptional repressor NrdR	NA|201aa|up_2|NC_010080.1_1582812_1583415_-	PRK00081, coaE, dephospho-CoA kinase; Reviewed	NA|277aa|up_1|NC_010080.1_1583411_1584242_-	PRK01103, PRK01103, bifunctional DNA-formamidopyrimidine glycosylase/DNA-(apurinic or apyrimidinic site) lyase	NA|888aa|up_0|NC_010080.1_1584250_1586914_-	PRK05755, PRK05755, DNA polymerase I; Provisional	cas2|97aa|down_0|NC_010080.1_1589181_1589472_-	cd09725, Cas2_I_II_III, CRISPR/Cas system-associated protein Cas2	cas1|344aa|down_1|NC_010080.1_1589481_1590513_-	TIGR03640, cas1_DVULG, CRISPR-associated endonuclease Cas1, subtype I-C/DVULG	cas4|219aa|down_2|NC_010080.1_1590509_1591166_-	TIGR00372, conserved_hypothetical_protein, CRISPR-associated protein Cas4	cas7|284aa|down_3|NC_010080.1_1591168_1592020_-	cd09689, Cas7_I-C, CRISPR/Cas system-associated RAMP superfamily protein Cas7	cas8c|658aa|down_4|NC_010080.1_1592022_1593996_-	pfam09709, Cas_Csd1, CRISPR-associated protein (Cas_Csd1)	cas5|248aa|down_5|NC_010080.1_1593995_1594739_-	TIGR01876, cas_Cas5d, CRISPR-associated protein Cas5, subtype I-C/DVULG	cas3|845aa|down_6|NC_010080.1_1594753_1597288_-	COG1203, COG1203, CRISPR-associated helicase Cas3 [Defense mechanisms]	NA|294aa|down_7|NC_010080.1_1600762_1601644_-	pfam01145, Band_7, SPFH domain / Band 7 family	NA|105aa|down_8|NC_010080.1_1601661_1601976_-	TIGR02384, Putative_antitoxin_RelB, addiction module antitoxin, RelB/DinJ family	NA|326aa|down_9|NC_010080.1_1602382_1603360_-	pfam17312, Helveticin_J, Bacteriocin helveticin-J
