assembly_id	genome_id	genome_def	crispr_array_locus_merge	crispr_array_location_merge	crispr_locus_id	crispr_pred_method	array_in_prot	prot_within_array_20000	prot_in_genome	crispr_type_by_cas_prot	consensus_repeat	repeat_length	self-targeting_spacer_number	self-targeting_target_number	spacer_location	protospacer_location	repeat_type	spacer_locus_num	spacer_num	correct_crispr_type	genome_cas_prots	unknown_protein_around_crispr	L10	L10_domain	L9	L9_domain	L8	L8_domain	L7	L7_domain	L6	L6_domain	L5	L5_domain	L4	L4_domain	L3	L3_domain	L2	L2_domain	L1	L1_domain	R1	R1_domain	R2	R2_domain	R3	R3_domain	R4	R4_domain	R5	R5_domain	R6	R6_domain	R7	R7_domain	R8	R8_domain	R9	R9_domain	R10	R10_domain
GCF_001280285.1_ASM128028v1	NZ_CP012588	Streptococcus thermophilus strain MN-BM-A01, complete genome	1	443838-445850	1,1,1,2	PILER-CR,CRISPRCasFinder,CRT,PILER-CR	no	cas9,cas1,cas2	cas3,cas9,cas1,cas2,csm6,csm5gr7,csm4gr5,csm3gr7,csm2gr11,cas10,cas6,DinG,csn2,DEDDh	 or Type II-C?,Type II-A,Type II-B,Type II-C, Type II-B	GTTTTTGTACTCTCAAGATTTAAGTAACTGTACAAC,GTTTTTGTACTCTCAAGATTTAAGTAACTGTACAAC,GTTTTTGTACTCTCAAGATTTAAGTAACTGTACAAC,GTTTTTGTACTCTCAAGATTTAAGTAACTGTACA	36,36,36,34	0	0	NA	NA	NA:NA:NA:NA	29,30,30,29	30	orTypeII-C?,TypeII-A,TypeII-B,TypeII-C,TypeII-B	cas3,cas9,cas1,cas2,csm6,csm5gr7,csm4gr5,csm3gr7,csm2gr11,cas10,cas6,DinG,csn2,DEDDh	NA|86aa|up_4|NZ_CP012588.1_436653_436911_-,NA	NA|183aa|up_9|NZ_CP012588.1_432474_433023_-	COG2128, COG2128, Uncharacterized conserved protein [Function unknown]	NA|454aa|up_8|NZ_CP012588.1_433238_434600_+	COG2265, TrmA, SAM-dependent methyltransferases related to tRNA (uracil-5-)-methyltransferase [Translation, ribosomal structure and biogenesis]	NA|277aa|up_7|NZ_CP012588.1_434809_435640_+	cd10447, GIY-YIG_unchar_2, GIY-YIG domain of uncharacterized hypothetical protein found in bacteria and archaea	NA|74aa|up_6|NZ_CP012588.1_435807_436029_+	COG3655, COG3655, Predicted transcriptional regulator [Transcription]	NA|153aa|up_5|NZ_CP012588.1_436028_436487_+	COG3392, COG3392, Adenine-specific DNA methylase [DNA replication, recombination, and repair]	NA|86aa|up_4|NZ_CP012588.1_436653_436911_-	NA	NA|205aa|up_3|NZ_CP012588.1_437080_437695_+	pfam09911, DUF2140, Uncharacterized protein conserved in bacteria (DUF2140)	cas9|1122aa|up_2|NZ_CP012588.1_437947_441313_+	cd09643, Csn1, CRISPR/Cas system-associated protein Cas9	cas1|304aa|up_1|NZ_CP012588.1_441489_442401_+	cd09720, Cas1_II, CRISPR/Cas system-associated protein Cas1	cas2|108aa|up_0|NZ_CP012588.1_442402_442726_+	COG3512, COG3512, CRISPR-associated protein, Cas2 homolog [Defense mechanisms]	NA|272aa|down_0|NZ_CP012588.1_445891_446707_-	COG3689, COG3689, Predicted membrane protein [Function unknown]	NA|301aa|down_1|NZ_CP012588.1_446706_447609_-	COG0701, COG0701, Predicted permeases [General function prediction only]	NA|711aa|down_2|NZ_CP012588.1_447972_450105_+	COG2183, Tex, Transcriptional accessory protein [Transcription]	NA|147aa|down_3|NZ_CP012588.1_450091_450532_+	PRK04351, PRK04351, SprT family protein	NA|88aa|down_4|NZ_CP012588.1_450597_450861_+	COG1983, PspC, Putative stress-responsive transcriptional regulator [Transcription / Signal transduction mechanisms]	NA|310aa|down_5|NZ_CP012588.1_450990_451920_+	PRK05428, PRK05428, HPr kinase/phosphorylase; Provisional	NA|264aa|down_6|NZ_CP012588.1_451919_452711_+	PRK12437, PRK12437, prolipoprotein diacylglyceryl transferase; Reviewed	NA|133aa|down_7|NZ_CP012588.1_452833_453232_+	COG4768, COG4768, Uncharacterized protein containing a divergent version of the methyl-accepting chemotaxis-like domain [General function prediction only]	NA|147aa|down_8|NZ_CP012588.1_453244_453685_+	pfam12732, YtxH, YtxH-like protein	NA|99aa|down_9|NZ_CP012588.1_453705_454002_-	pfam11674, DUF3270, Protein of unknown function (DUF3270)
GCF_001280285.1_ASM128028v1	NZ_CP012588	Streptococcus thermophilus strain MN-BM-A01, complete genome	2	719325-719426	2	CRISPRCasFinder	no	csm6,csm5gr7,csm4gr5,csm3gr7,csm2gr11,cas10,cas6,cas2,cas1	cas3,cas9,cas1,cas2,csm6,csm5gr7,csm4gr5,csm3gr7,csm2gr11,cas10,cas6,DinG,csn2,DEDDh	Type III-C,Type III-D,Type III-A,Type III-B	GTCCCCTCTCGAGGTAATTAGGTTTATATC	30	0	0	NA	NA	NA	1	1	TypeIII-C,TypeIII-D,TypeIII-A,TypeIII-B	cas3,cas9,cas1,cas2,csm6,csm5gr7,csm4gr5,csm3gr7,csm2gr11,cas10,cas6,DinG,csn2,DEDDh	NA,NA|72aa|down_4|NZ_CP012588.1_722736_722952_+,NA|73aa|down_8|NZ_CP012588.1_724693_724912_-	NA|484aa|up_9|NZ_CP012588.1_708086_709538_+	COG3104, PTR2, Dipeptide/tripeptide permease [Amino acid transport and metabolism]	NA|210aa|up_8|NZ_CP012588.1_710234_710864_-	PRK00455, pyrE, orotate phosphoribosyltransferase; Validated	NA|232aa|up_7|NZ_CP012588.1_710952_711648_-	PRK00230, PRK00230, orotidine-5'-phosphate decarboxylase	csm6|392aa|up_6|NZ_CP012588.1_711847_713023_-	cd09699, Csm6_III-A, CRISPR/Cas system-associated protein Csm6	csm5gr7|358aa|up_5|NZ_CP012588.1_713176_714250_-	COG1332, COG1332, CRISPR system related protein, RAMP superfamily [Defense mechanisms]	csm4gr5|300aa|up_4|NZ_CP012588.1_714252_715152_-	pfam17953, Csm4_C, CRISPR Csm4 C-terminal domain	csm3gr7|221aa|up_3|NZ_CP012588.1_715153_715816_-	cd09684, Csm3_III-A, CRISPR/Cas system-associated RAMP superfamily protein Csm3	csm2gr11|127aa|up_2|NZ_CP012588.1_715815_716196_-	pfam03750, Csm2_III-A, Csm2 Type III-A	cas10|759aa|up_1|NZ_CP012588.1_716199_718476_-	cd09680, Cas10_III, CRISPR/Cas system-associated protein Cas10	cas6|244aa|up_0|NZ_CP012588.1_718456_719188_-	TIGR01877, CRISPR-associated_endoribonuclease_Cas6_1, CRISPR-associated endoribonuclease Cas6	cas2|110aa|down_0|NZ_CP012588.1_719526_719856_-	cd09725, Cas2_I_II_III, CRISPR/Cas system-associated protein Cas2	cas1|335aa|down_1|NZ_CP012588.1_719855_720860_-	cd09634, Cas1_I-II-III, CRISPR/Cas system-associated protein Cas1	NA|316aa|down_2|NZ_CP012588.1_720998_721946_-	PRK07259, PRK07259, dihydroorotate dehydrogenase	NA|268aa|down_3|NZ_CP012588.1_721964_722768_-	PRK00054, PRK00054, dihydroorotate dehydrogenase electron transfer subunit; Reviewed	NA|72aa|down_4|NZ_CP012588.1_722736_722952_+	NA	NA|183aa|down_5|NZ_CP012588.1_723008_723557_-	cd03135, GATase1_DJ-1, Type 1 glutamine amidotransferase (GATase1)-like domain found in Human DJ-1	NA|75aa|down_6|NZ_CP012588.1_723598_723823_-	pfam05857, TraX, TraX protein	NA|223aa|down_7|NZ_CP012588.1_723823_724492_-	pfam05857, TraX, TraX protein	NA|73aa|down_8|NZ_CP012588.1_724693_724912_-	NA	NA|64aa|down_9|NZ_CP012588.1_725348_725540_-	COG4405, COG4405, Uncharacterized protein conserved in bacteria [Function unknown]
GCF_001280285.1_ASM128028v1	NZ_CP012588	Streptococcus thermophilus strain MN-BM-A01, complete genome	3	1186347-1188099	3,2,3,4,5	CRISPRCasFinder,CRT,PILER-CR,PILER-CR,PILER-CR	no	csn2,cas2,cas1	cas3,cas9,cas1,cas2,csm6,csm5gr7,csm4gr5,csm3gr7,csm2gr11,cas10,cas6,DinG,csn2,DEDDh	Type II-A	GTTTTGGAACCATTCGAAACAACACAGCTCTAAAAC,GTTTTGGAACCATTCGAAACAACACAGCTCTAAAAC,GTTTTGGAACCATTCGAAACAACACAGCTCTAAAAC,GTTTTGGAACCATTCGAAACAACACAGCTCTAAAACTT,TGTTTTGGAACCATTCGAAACAACACAGCTCTAAAAC	36,36,36,38,37	0	0	NA	NA	II-A,II-B:II-A,II-B:II-A,II-B:II-A,II-B:II-A,II-B	26,26,20,20,20	26	TypeII-A	cas3,cas9,cas1,cas2,csm6,csm5gr7,csm4gr5,csm3gr7,csm2gr11,cas10,cas6,DinG,csn2,DEDDh	NA|66aa|up_7|NZ_CP012588.1_1180057_1180255_-,NA|148aa|up_4|NZ_CP012588.1_1183139_1183583_-,NA|80aa|up_0|NZ_CP012588.1_1186023_1186263_-,NA	NA|737aa|up_9|NZ_CP012588.1_1177034_1179245_+	cd13619, PBP2_GlnP, Glutamine-binding domain of ABC transporter, a member of the type 2 periplasmic binding fold protein superfamily	NA|247aa|up_8|NZ_CP012588.1_1179244_1179985_+	COG1126, GlnQ, ABC-type polar amino acid transport system, ATPase component [Amino acid transport and metabolism]	NA|66aa|up_7|NZ_CP012588.1_1180057_1180255_-	NA	NA|438aa|up_6|NZ_CP012588.1_1180421_1181735_-	PRK12297, obgE, GTPase CgtA; Reviewed	NA|43aa|up_5|NZ_CP012588.1_1181828_1181957_-	pfam13253, DUF4044, Protein of unknown function (DUF4044)	NA|148aa|up_4|NZ_CP012588.1_1183139_1183583_-	NA	NA|128aa|up_3|NZ_CP012588.1_1183613_1183997_-	COG2050, PaaI, HGG motif-containing thioesterase, possibly involved in aromatic compounds catabolism [Secondary metabolites biosynthesis,    transport, and catabolism]	NA|81aa|up_2|NZ_CP012588.1_1184685_1184928_+	COG2182, MalE, Maltose-binding periplasmic proteins/domains [Carbohydrate transport and metabolism]	NA|297aa|up_1|NZ_CP012588.1_1185095_1185986_+	COG0697, RhaT, Permeases of the drug/metabolite transporter (DMT) superfamily [Carbohydrate transport and metabolism / Amino acid transport and metabolism / General function prediction only]	NA|80aa|up_0|NZ_CP012588.1_1186023_1186263_-	NA	csn2|220aa|down_0|NZ_CP012588.1_1188420_1189080_-	cd09758, Csn2, CRISPR/Cas system-associated protein Csn2	cas2|115aa|down_1|NZ_CP012588.1_1189069_1189414_-	COG3512, COG3512, CRISPR-associated protein, Cas2 homolog [Defense mechanisms]	cas1|290aa|down_2|NZ_CP012588.1_1189410_1190280_-	cd09720, Cas1_II, CRISPR/Cas system-associated protein Cas1	NA|216aa|down_3|NZ_CP012588.1_1194776_1195424_-	cd07500, HAD_PSP, phosphoserine phosphatase (PSP), similar to Methanococcus Jannaschii PSP and Saccharomyces cerevisiae SER2p	NA|575aa|down_4|NZ_CP012588.1_1195535_1197260_-	PRK04778, PRK04778, septation ring formation regulator EzrA; Provisional	NA|651aa|down_5|NZ_CP012588.1_1197356_1199309_-	PRK05644, gyrB, DNA gyrase subunit B; Validated	NA|202aa|down_6|NZ_CP012588.1_1199309_1199915_-	cd07523, HAD_YsbA-like, uncharacterized family of the haloacid dehalogenase-like superfamily, similar to the uncharacterized Lactococcus lactis YsbA	NA|232aa|down_7|NZ_CP012588.1_1199955_1200651_-	pfam04172, LrgB, LrgB-like family	NA|125aa|down_8|NZ_CP012588.1_1200643_1201018_-	pfam03788, LrgA, LrgA family	NA|118aa|down_9|NZ_CP012588.1_1201297_1201651_-	cd03036, ArsC_like, Arsenate Reductase (ArsC) family, unknown subfamily; uncharacterized proteins containing a CXXC motif with similarity to thioredoxin (TRX)-fold arsenic reductases, ArsC
GCF_001280285.1_ASM128028v1	NZ_CP012588	Streptococcus thermophilus strain MN-BM-A01, complete genome	4	1403271-1403362	4	CRISPRCasFinder	no	cas3	cas3,cas9,cas1,cas2,csm6,csm5gr7,csm4gr5,csm3gr7,csm2gr11,cas10,cas6,DinG,csn2,DEDDh	Unclear	CTTTTCTTTACGATATCAATTTTAACATCTTT	32	0	0	NA	NA	NA	1	1	Unclear	cas3,cas9,cas1,cas2,csm6,csm5gr7,csm4gr5,csm3gr7,csm2gr11,cas10,cas6,DinG,csn2,DEDDh	NA|86aa|up_3|NZ_CP012588.1_1398889_1399147_-,NA|143aa|down_7|NZ_CP012588.1_1409264_1409693_-	cas3|673aa|up_9|NZ_CP012588.1_1390339_1392358_-	PRK10917, PRK10917, ATP-dependent DNA helicase RecG; Provisional	NA|368aa|up_8|NZ_CP012588.1_1392486_1393590_-	PRK00053, alr, alanine racemase; Reviewed	NA|120aa|up_7|NZ_CP012588.1_1393610_1393970_-	PRK00070, acpS, 4'-phosphopantetheinyl transferase; Provisional	NA|344aa|up_6|NZ_CP012588.1_1393973_1395005_-	PRK09261, PRK09261, phospho-2-dehydro-3-deoxyheptonate aldolase; Validated	NA|344aa|up_5|NZ_CP012588.1_1395102_1396134_-	COG0722, AroG, 3-deoxy-D-arabino-heptulosonate 7-phosphate (DAHP) synthase [Amino acid transport and metabolism]	NA|850aa|up_4|NZ_CP012588.1_1396157_1398707_-	PRK12904, PRK12904, preprotein translocase subunit SecA; Reviewed	NA|86aa|up_3|NZ_CP012588.1_1398889_1399147_-	NA	NA|316aa|up_2|NZ_CP012588.1_1399233_1400181_-	COG1482, ManA, Phosphomannose isomerase [Carbohydrate transport and metabolism]	NA|298aa|up_1|NZ_CP012588.1_1400246_1401140_-	pfam00480, ROK, ROK family	NA|634aa|up_0|NZ_CP012588.1_1401324_1403226_-	TIGR01996, Includes:_Phosphotransferase_enzyme_IIB_component, PTS system, sucrose-specific IIBC component	NA|481aa|down_0|NZ_CP012588.1_1403474_1404917_+	TIGR01322, Sucrose-6-phosphate_hydrolase, sucrose-6-phosphate hydrolase	NA|322aa|down_1|NZ_CP012588.1_1404925_1405891_+	cd06291, PBP1_Qymf-like, ligand binding domain of the lacI-like transcription regulator from a novel metal-reducing bacterium Alkaliphilus Metalliredigens (strain Qymf) and its close homologs	NA|145aa|down_2|NZ_CP012588.1_1405980_1406415_-	PRK00202, nusB, transcription antitermination factor NusB	NA|130aa|down_3|NZ_CP012588.1_1406407_1406797_-	pfam03780, Asp23, Asp23 family, cell envelope-related function	NA|187aa|down_4|NZ_CP012588.1_1406867_1407428_-	PRK00529, PRK00529, elongation factor P; Validated	NA|152aa|down_5|NZ_CP012588.1_1407540_1407996_-	TIGR02571, putative_dCMP_deaminase, ComE operon protein 2	NA|354aa|down_6|NZ_CP012588.1_1408014_1409076_-	COG0006, PepP, Xaa-Pro aminopeptidase [Amino acid transport and metabolism]	NA|143aa|down_7|NZ_CP012588.1_1409264_1409693_-	NA	NA|198aa|down_8|NZ_CP012588.1_1409737_1410331_-	COG1131, CcmA, ABC-type multidrug transport system, ATPase component [Defense mechanisms]	NA|130aa|down_9|NZ_CP012588.1_1410299_1410689_-	cd03268, ABC_BcrA_bacitracin_resist, ATP-binding cassette domain of the bacitracin-resistance transporter
