assembly_id	genome_id	genome_def	crispr_array_locus_merge	crispr_array_location_merge	crispr_locus_id	crispr_pred_method	array_in_prot	prot_within_array_20000	prot_in_genome	crispr_type_by_cas_prot	consensus_repeat	repeat_length	self-targeting_spacer_number	self-targeting_target_number	spacer_location	protospacer_location	repeat_type	spacer_locus_num	spacer_num	correct_crispr_type	genome_cas_prots	unknown_protein_around_crispr	L10	L10_domain	L9	L9_domain	L8	L8_domain	L7	L7_domain	L6	L6_domain	L5	L5_domain	L4	L4_domain	L3	L3_domain	L2	L2_domain	L1	L1_domain	R1	R1_domain	R2	R2_domain	R3	R3_domain	R4	R4_domain	R5	R5_domain	R6	R6_domain	R7	R7_domain	R8	R8_domain	R9	R9_domain	R10	R10_domain
GCF_000971665.1_ASM97166v1	NZ_CP011217	Streptococcus thermophilus strain SMQ-301 chromosome, complete genome	1	649756-650847	1,1,1	PILER-CR,CRISPRCasFinder,CRT	no	cas9,cas1,cas2,csn2	cas3,DEDDh,cas9,cas1,cas2,csn2,DinG,cas6,csm2gr11,csm3gr7,csm4gr5,csm5gr7	Type II-B,Type II-A,Type II-C	GTTTTTGTACTCTCAAGATTTAAGTAACTGTACAAC,GTTTTTGTACTCTCAAGATTTAAGTAACTGTACAAC,GTTTTTGTACTCTCAAGATTTAAGTAACTGTACAAC	36,36,36	0	0	NA	NA	NA:NA:NA	16,16,16	16	TypeII-B,TypeII-A,TypeII-C	cas3,DEDDh,cas9,cas1,cas2,csn2,DinG,cas6,csm2gr11,csm3gr7,csm4gr5,csm5gr7	NA,NA	NA|355aa|up_9|NZ_CP011217.1_637288_638353_-	COG1960, CaiA, Acyl-CoA dehydrogenases [Lipid metabolism]	NA|183aa|up_8|NZ_CP011217.1_638392_638941_-	COG2128, COG2128, Uncharacterized conserved protein [Function unknown]	NA|454aa|up_7|NZ_CP011217.1_639156_640518_+	COG2265, TrmA, SAM-dependent methyltransferases related to tRNA (uracil-5-)-methyltransferase [Translation, ribosomal structure and biogenesis]	NA|74aa|up_6|NZ_CP011217.1_641724_641946_+	COG3655, COG3655, Predicted transcriptional regulator [Transcription]	NA|153aa|up_5|NZ_CP011217.1_641945_642404_+	COG3392, COG3392, Adenine-specific DNA methylase [DNA replication, recombination, and repair]	NA|204aa|up_4|NZ_CP011217.1_642998_643610_+	pfam09911, DUF2140, Uncharacterized protein conserved in bacteria (DUF2140)	cas9|1122aa|up_3|NZ_CP011217.1_643865_647231_+	cd09643, Csn1, CRISPR/Cas system-associated protein Cas9	cas1|304aa|up_2|NZ_CP011217.1_647407_648319_+	cd09720, Cas1_II, CRISPR/Cas system-associated protein Cas1	cas2|108aa|up_1|NZ_CP011217.1_648320_648644_+	COG3512, COG3512, CRISPR-associated protein, Cas2 homolog [Defense mechanisms]	csn2|351aa|up_0|NZ_CP011217.1_648640_649693_+	pfam16813, Cas_St_Csn2, CRISPR-associated protein Csn2 subfamily St	NA|272aa|down_0|NZ_CP011217.1_650888_651704_-	COG3689, COG3689, Predicted membrane protein [Function unknown]	NA|301aa|down_1|NZ_CP011217.1_651703_652606_-	COG0701, COG0701, Predicted permeases [General function prediction only]	NA|711aa|down_2|NZ_CP011217.1_652969_655102_+	COG2183, Tex, Transcriptional accessory protein [Transcription]	NA|147aa|down_3|NZ_CP011217.1_655088_655529_+	PRK04351, PRK04351, SprT family protein	NA|88aa|down_4|NZ_CP011217.1_655594_655858_+	COG1983, PspC, Putative stress-responsive transcriptional regulator [Transcription / Signal transduction mechanisms]	NA|310aa|down_5|NZ_CP011217.1_655987_656917_+	PRK05428, PRK05428, HPr kinase/phosphorylase; Provisional	NA|264aa|down_6|NZ_CP011217.1_656916_657708_+	PRK12437, PRK12437, prolipoprotein diacylglyceryl transferase; Reviewed	NA|133aa|down_7|NZ_CP011217.1_657830_658229_+	COG4768, COG4768, Uncharacterized protein containing a divergent version of the methyl-accepting chemotaxis-like domain [General function prediction only]	NA|147aa|down_8|NZ_CP011217.1_658241_658682_+	pfam12732, YtxH, YtxH-like protein	NA|99aa|down_9|NZ_CP011217.1_658702_658999_-	pfam11674, DUF3270, Protein of unknown function (DUF3270)
GCF_000971665.1_ASM97166v1	NZ_CP011217	Streptococcus thermophilus strain SMQ-301 chromosome, complete genome	2	781323-781470	2	CRISPRCasFinder	no		cas3,DEDDh,cas9,cas1,cas2,csn2,DinG,cas6,csm2gr11,csm3gr7,csm4gr5,csm5gr7	Orphan	CTCAGAAAGTCCCGTAGCCGAGGAGTTGGTTGACACTTCTGTGGAGGCTACC	52	0	0	NA	NA	NA	1	1	Orphan	cas3,DEDDh,cas9,cas1,cas2,csn2,DinG,cas6,csm2gr11,csm3gr7,csm4gr5,csm5gr7	NA|100aa|up_4|NZ_CP011217.1_772742_773042_+,NA|108aa|up_3|NZ_CP011217.1_773119_773443_+,NA	NA|573aa|up_9|NZ_CP011217.1_766405_768124_-	cd05799, PGM2, This CD includes PGM2 (phosphoglucomutase 2) and PGM2L1 (phosphoglucomutase 2-like 1)	NA|191aa|up_8|NZ_CP011217.1_768206_768779_-	COG4684, COG4684, Predicted membrane protein [Function unknown]	NA|182aa|up_7|NZ_CP011217.1_768806_769352_-	PRK07313, PRK07313, phosphopantothenoylcysteine decarboxylase; Validated	NA|228aa|up_6|NZ_CP011217.1_769344_770028_-	PRK06732, PRK06732, phosphopantothenate--cysteine ligase; Validated	NA|557aa|up_5|NZ_CP011217.1_770221_771892_+	PRK13505, PRK13505, formate--tetrahydrofolate ligase; Provisional	NA|100aa|up_4|NZ_CP011217.1_772742_773042_+	NA	NA|108aa|up_3|NZ_CP011217.1_773119_773443_+	NA	NA|524aa|up_2|NZ_CP011217.1_774264_775835_+	PHA02517, PHA02517, putative transposase OrfB; Reviewed	NA|357aa|up_1|NZ_CP011217.1_777595_778666_+	cd13663, PBP2_PotD_PotF_like_2, The periplasmic substrate-binding component of an uncharacterized active transport system closely related to spermidine and putrescine transporters; contains the type 2 periplasmic binding fold	NA|513aa|up_0|NZ_CP011217.1_778783_780322_+	cd01031, EriC, ClC chloride channel EriC	NA|524aa|down_0|NZ_CP011217.1_786127_787698_+	PHA02517, PHA02517, putative transposase OrfB; Reviewed	NA|435aa|down_1|NZ_CP011217.1_787822_789127_+	COG3464, COG3464, Transposase and inactivated derivatives [DNA replication, recombination, and repair]	NA|79aa|down_2|NZ_CP011217.1_789226_789463_-	PRK00239, rpsT, 30S ribosomal protein S20; Reviewed	NA|307aa|down_3|NZ_CP011217.1_789531_790452_-	PRK05439, PRK05439, pantothenate kinase; Provisional	NA|197aa|down_4|NZ_CP011217.1_790774_791365_+	COG2813, RsmC, 16S RNA G1207 methylase RsmC [Translation, ribosomal structure and biogenesis]	NA|133aa|down_5|NZ_CP011217.1_793316_793715_+	PRK05578, PRK05578, cytidine deaminase; Validated	NA|357aa|down_6|NZ_CP011217.1_793777_794848_+	COG1744, Med, Uncharacterized ABC-type transport system, periplasmic component/surface lipoprotein [General function prediction only]	NA|513aa|down_7|NZ_CP011217.1_794961_796500_+	COG3845, COG3845, ABC-type uncharacterized transport systems, ATPase components [General function prediction only]	NA|356aa|down_8|NZ_CP011217.1_796492_797560_+	COG4603, COG4603, ABC-type uncharacterized transport system, permease component [General function prediction only]	NA|319aa|down_9|NZ_CP011217.1_797561_798518_+	COG1079, COG1079, Uncharacterized ABC-type transport system, permease component [General function prediction only]
GCF_000971665.1_ASM97166v1	NZ_CP011217	Streptococcus thermophilus strain SMQ-301 chromosome, complete genome	3	897843-898101	2,3,2	PILER-CR,CRISPRCasFinder,CRT	no	cas1,cas2,cas6,csm2gr11,csm3gr7,csm4gr5,csm5gr7	cas3,DEDDh,cas9,cas1,cas2,csn2,DinG,cas6,csm2gr11,csm3gr7,csm4gr5,csm5gr7	Type III-A	GATATAAACCTAATTACCTCGAGAGGGGACGGAAACCC,GATATAAACCTAATTACCTCGAGAGGGGACGGAAAC,GATATAAACCTAATTACCTCGAGAGGGGACGGAAAC	38,36,36	0	0	NA	NA	II-B,III-A:II-B,III-A:II-B,III-A	2,3,3	3	TypeIII-A	cas3,DEDDh,cas9,cas1,cas2,csn2,DinG,cas6,csm2gr11,csm3gr7,csm4gr5,csm5gr7	NA|73aa|up_8|NZ_CP011217.1_892356_892575_+,NA|72aa|up_4|NZ_CP011217.1_894316_894532_-,NA|77aa|down_8|NZ_CP011217.1_909906_910137_+	NA|64aa|up_9|NZ_CP011217.1_891728_891920_+	COG4405, COG4405, Uncharacterized protein conserved in bacteria [Function unknown]	NA|73aa|up_8|NZ_CP011217.1_892356_892575_+	NA	NA|223aa|up_7|NZ_CP011217.1_892776_893445_+	pfam05857, TraX, TraX protein	NA|70aa|up_6|NZ_CP011217.1_893460_893670_+	pfam05857, TraX, TraX protein	NA|183aa|up_5|NZ_CP011217.1_893711_894260_+	cd03135, GATase1_DJ-1, Type 1 glutamine amidotransferase (GATase1)-like domain found in Human DJ-1	NA|72aa|up_4|NZ_CP011217.1_894316_894532_-	NA	NA|268aa|up_3|NZ_CP011217.1_894500_895304_+	PRK00054, PRK00054, dihydroorotate dehydrogenase electron transfer subunit; Reviewed	NA|316aa|up_2|NZ_CP011217.1_895322_896270_+	PRK07259, PRK07259, dihydroorotate dehydrogenase	cas1|335aa|up_1|NZ_CP011217.1_896408_897413_+	cd09634, Cas1_I-II-III, CRISPR/Cas system-associated protein Cas1	cas2|110aa|up_0|NZ_CP011217.1_897412_897742_+	cd09725, Cas2_I_II_III, CRISPR/Cas system-associated protein Cas2	cas6|244aa|down_0|NZ_CP011217.1_898230_898962_+	TIGR01877, CRISPR-associated_endoribonuclease_Cas6_1, CRISPR-associated endoribonuclease Cas6	csm2gr11|127aa|down_1|NZ_CP011217.1_901215_901596_+	pfam03750, Csm2_III-A, Csm2 Type III-A	csm3gr7|221aa|down_2|NZ_CP011217.1_901595_902258_+	cd09684, Csm3_III-A, CRISPR/Cas system-associated RAMP superfamily protein Csm3	csm4gr5|300aa|down_3|NZ_CP011217.1_902259_903159_+	pfam17953, Csm4_C, CRISPR Csm4 C-terminal domain	csm5gr7|358aa|down_4|NZ_CP011217.1_903161_904235_+	COG1332, COG1332, CRISPR system related protein, RAMP superfamily [Defense mechanisms]	NA|232aa|down_5|NZ_CP011217.1_905230_905926_+	PRK00230, PRK00230, orotidine-5'-phosphate decarboxylase	NA|210aa|down_6|NZ_CP011217.1_906014_906644_+	PRK00455, pyrE, orotate phosphoribosyltransferase; Validated	NA|484aa|down_7|NZ_CP011217.1_907346_908798_-	COG3104, PTR2, Dipeptide/tripeptide permease [Amino acid transport and metabolism]	NA|77aa|down_8|NZ_CP011217.1_909906_910137_+	NA	NA|218aa|down_9|NZ_CP011217.1_910482_911136_+	pfam02872, 5_nucleotid_C, 5'-nucleotidase, C-terminal domain
GCF_000971665.1_ASM97166v1	NZ_CP011217	Streptococcus thermophilus strain SMQ-301 chromosome, complete genome	4	1383838-1384867	3,4,3,4	PILER-CR,CRISPRCasFinder,CRT,PILER-CR	no	csn2,cas2,cas1,cas9	cas3,DEDDh,cas9,cas1,cas2,csn2,DinG,cas6,csm2gr11,csm3gr7,csm4gr5,csm5gr7	Type II-B,Type II-A,Type II-C	TGTTTTGGAACCATTCGAAACAACACAGCTCTAAAACTT,GTTTTGGAACCATTCGAAACAACACAGCTCTAAAAC,GTTTTGGAACCATTCGAAACAACACAGCTCTAAAAC,GTTTTGGAACCATTCGAAACAACACAGCTCTAAAAC	39,36,36,36	0	0	NA	NA	II-A,II-B:II-A,II-B:II-A,II-B:II-A,II-B	13,15,15,13	15	TypeII-B,TypeII-A,TypeII-C	cas3,DEDDh,cas9,cas1,cas2,csn2,DinG,cas6,csm2gr11,csm3gr7,csm4gr5,csm5gr7	NA|66aa|up_7|NZ_CP011217.1_1377550_1377748_-,NA|148aa|up_4|NZ_CP011217.1_1380633_1381077_-,NA|80aa|up_0|NZ_CP011217.1_1383514_1383754_-,NA	NA|737aa|up_9|NZ_CP011217.1_1374527_1376738_+	cd13619, PBP2_GlnP, Glutamine-binding domain of ABC transporter, a member of the type 2 periplasmic binding fold protein superfamily	NA|247aa|up_8|NZ_CP011217.1_1376737_1377478_+	COG1126, GlnQ, ABC-type polar amino acid transport system, ATPase component [Amino acid transport and metabolism]	NA|66aa|up_7|NZ_CP011217.1_1377550_1377748_-	NA	NA|438aa|up_6|NZ_CP011217.1_1377914_1379228_-	PRK12297, obgE, GTPase CgtA; Reviewed	NA|43aa|up_5|NZ_CP011217.1_1379321_1379450_-	pfam13253, DUF4044, Protein of unknown function (DUF4044)	NA|148aa|up_4|NZ_CP011217.1_1380633_1381077_-	NA	NA|128aa|up_3|NZ_CP011217.1_1381108_1381492_-	COG2050, PaaI, HGG motif-containing thioesterase, possibly involved in aromatic compounds catabolism [Secondary metabolites biosynthesis,    transport, and catabolism]	NA|100aa|up_2|NZ_CP011217.1_1381585_1381885_-	pfam02566, OsmC, OsmC-like protein	NA|297aa|up_1|NZ_CP011217.1_1382586_1383477_+	COG0697, RhaT, Permeases of the drug/metabolite transporter (DMT) superfamily [Carbohydrate transport and metabolism / Amino acid transport and metabolism / General function prediction only]	NA|80aa|up_0|NZ_CP011217.1_1383514_1383754_-	NA	csn2|220aa|down_0|NZ_CP011217.1_1385188_1385848_-	cd09758, Csn2, CRISPR/Cas system-associated protein Csn2	cas2|115aa|down_1|NZ_CP011217.1_1385837_1386182_-	COG3512, COG3512, CRISPR-associated protein, Cas2 homolog [Defense mechanisms]	cas1|290aa|down_2|NZ_CP011217.1_1386178_1387048_-	cd09720, Cas1_II, CRISPR/Cas system-associated protein Cas1	cas9|1389aa|down_3|NZ_CP011217.1_1387047_1391214_-	TIGR01865, conserved_hypothetical_protein, CRISPR subtype II/NMENI RNA-guided endonuclease Cas9/Csn1	NA|216aa|down_4|NZ_CP011217.1_1391546_1392194_-	cd07500, HAD_PSP, phosphoserine phosphatase (PSP), similar to Methanococcus Jannaschii PSP and Saccharomyces cerevisiae SER2p	NA|575aa|down_5|NZ_CP011217.1_1392305_1394030_-	PRK04778, PRK04778, septation ring formation regulator EzrA; Provisional	NA|651aa|down_6|NZ_CP011217.1_1394126_1396079_-	PRK05644, gyrB, DNA gyrase subunit B; Validated	NA|202aa|down_7|NZ_CP011217.1_1396079_1396685_-	cd07523, HAD_YsbA-like, uncharacterized family of the haloacid dehalogenase-like superfamily, similar to the uncharacterized Lactococcus lactis YsbA	NA|232aa|down_8|NZ_CP011217.1_1396725_1397421_-	pfam04172, LrgB, LrgB-like family	NA|125aa|down_9|NZ_CP011217.1_1397413_1397788_-	pfam03788, LrgA, LrgA family
GCF_000971665.1_ASM97166v1	NZ_CP011217	Streptococcus thermophilus strain SMQ-301 chromosome, complete genome	5	1609132-1609223	5	CRISPRCasFinder	no	cas3	cas3,DEDDh,cas9,cas1,cas2,csn2,DinG,cas6,csm2gr11,csm3gr7,csm4gr5,csm5gr7	Unclear	CTTTTCTTTACGATATCAATTTTAACATCTTT	32	0	0	NA	NA	NA	1	1	Unclear	cas3,DEDDh,cas9,cas1,cas2,csn2,DinG,cas6,csm2gr11,csm3gr7,csm4gr5,csm5gr7	NA|86aa|up_3|NZ_CP011217.1_1604751_1605009_-,NA|143aa|down_7|NZ_CP011217.1_1615126_1615555_-	cas3|673aa|up_9|NZ_CP011217.1_1596201_1598220_-	PRK10917, PRK10917, ATP-dependent DNA helicase RecG; Provisional	NA|368aa|up_8|NZ_CP011217.1_1598348_1599452_-	PRK00053, alr, alanine racemase; Reviewed	NA|120aa|up_7|NZ_CP011217.1_1599472_1599832_-	PRK00070, acpS, 4'-phosphopantetheinyl transferase; Provisional	NA|344aa|up_6|NZ_CP011217.1_1599835_1600867_-	PRK09261, PRK09261, phospho-2-dehydro-3-deoxyheptonate aldolase; Validated	NA|344aa|up_5|NZ_CP011217.1_1600964_1601996_-	COG0722, AroG, 3-deoxy-D-arabino-heptulosonate 7-phosphate (DAHP) synthase [Amino acid transport and metabolism]	NA|850aa|up_4|NZ_CP011217.1_1602019_1604569_-	PRK12904, PRK12904, preprotein translocase subunit SecA; Reviewed	NA|86aa|up_3|NZ_CP011217.1_1604751_1605009_-	NA	NA|316aa|up_2|NZ_CP011217.1_1605095_1606043_-	COG1482, ManA, Phosphomannose isomerase [Carbohydrate transport and metabolism]	NA|298aa|up_1|NZ_CP011217.1_1606108_1607002_-	pfam00480, ROK, ROK family	NA|634aa|up_0|NZ_CP011217.1_1607185_1609087_-	TIGR01996, Includes:_Phosphotransferase_enzyme_IIB_component, PTS system, sucrose-specific IIBC component	NA|481aa|down_0|NZ_CP011217.1_1609336_1610779_+	TIGR01322, Sucrose-6-phosphate_hydrolase, sucrose-6-phosphate hydrolase	NA|322aa|down_1|NZ_CP011217.1_1610787_1611753_+	cd06291, PBP1_Qymf-like, ligand binding domain of the lacI-like transcription regulator from a novel metal-reducing bacterium Alkaliphilus Metalliredigens (strain Qymf) and its close homologs	NA|145aa|down_2|NZ_CP011217.1_1611842_1612277_-	PRK00202, nusB, transcription antitermination factor NusB	NA|130aa|down_3|NZ_CP011217.1_1612269_1612659_-	pfam03780, Asp23, Asp23 family, cell envelope-related function	NA|187aa|down_4|NZ_CP011217.1_1612729_1613290_-	PRK00529, PRK00529, elongation factor P; Validated	NA|152aa|down_5|NZ_CP011217.1_1613402_1613858_-	TIGR02571, putative_dCMP_deaminase, ComE operon protein 2	NA|354aa|down_6|NZ_CP011217.1_1613876_1614938_-	COG0006, PepP, Xaa-Pro aminopeptidase [Amino acid transport and metabolism]	NA|143aa|down_7|NZ_CP011217.1_1615126_1615555_-	NA	NA|198aa|down_8|NZ_CP011217.1_1615599_1616193_-	COG1131, CcmA, ABC-type multidrug transport system, ATPase component [Defense mechanisms]	NA|130aa|down_9|NZ_CP011217.1_1616161_1616551_-	cd03268, ABC_BcrA_bacitracin_resist, ATP-binding cassette domain of the bacitracin-resistance transporter
