assembly_id	genome_id	genome_def	crispr_array_locus_merge	crispr_array_location_merge	crispr_locus_id	crispr_pred_method	array_in_prot	prot_within_array_20000	prot_in_genome	crispr_type_by_cas_prot	consensus_repeat	repeat_length	self-targeting_spacer_number	self-targeting_target_number	spacer_location	protospacer_location	repeat_type	spacer_locus_num	spacer_num	correct_crispr_type	genome_cas_prots	unknown_protein_around_crispr	L10	L10_domain	L9	L9_domain	L8	L8_domain	L7	L7_domain	L6	L6_domain	L5	L5_domain	L4	L4_domain	L3	L3_domain	L2	L2_domain	L1	L1_domain	R1	R1_domain	R2	R2_domain	R3	R3_domain	R4	R4_domain	R5	R5_domain	R6	R6_domain	R7	R7_domain	R8	R8_domain	R9	R9_domain	R10	R10_domain
GCF_000698885.1_ASM69888v1	NZ_CP006819	Streptococcus thermophilus ASCC 1275 chromosome, complete genome	1	823246-825392	1,1,1,2,3	PILER-CR,CRISPRCasFinder,CRT,PILER-CR,PILER-CR	no	cas9,cas1,cas2,csn2	csa3,cas3,DEDDh,cas9,cas1,cas2,csn2,DinG,cas6,cas10,csm2gr11,csm3gr7,csm4gr5,csm5gr7,cas6e,cas5,cas7,cse2gr11	Type II-A,Type II-B,Type II-C	GTTTTTGTACTCTCAAGATTTAAGTAACTGTACAAC,GTTTTTGTACTCTCAAGATTTAAGTAACTGTACAAC,GTTTTTGTACTCTCAAGATTTAAGTAACTGTACAAC,GTTTTTGTACTCTCAAGATTTAAGTAACTGTACA,GTTTTTGTACTCTCAAGATTTAAGTAACTGTACAAC	36,36,36,34,36	0	0	NA	NA	NA:NA:NA:NA:NA	30,32,32,30,30	32	TypeII-A,TypeII-B,TypeII-C	csa3,cas3,DEDDh,cas9,cas1,cas2,csn2,DinG,cas6,cas10,csm2gr11,csm3gr7,csm4gr5,csm5gr7,cas6e,cas5,cas7,cse2gr11	NA|189aa|up_9|NZ_CP006819.1_808391_808958_+,NA|121aa|up_5|NZ_CP006819.1_814918_815281_+,NA	NA|189aa|up_9|NZ_CP006819.1_808391_808958_+	NA	NA|76aa|up_8|NZ_CP006819.1_809801_810029_-	TIGR01716, HTH-type_transcriptional_regulator_rgg, transcriptional activator, Rgg/GadR/MutR family, C-terminal domain	NA|131aa|up_7|NZ_CP006819.1_813772_814165_+	pfam08349, DUF1722, Protein of unknown function (DUF1722)	NA|121aa|up_6|NZ_CP006819.1_814161_814524_+	TIGR02328, TIGR02328, conserved hypothetical protein	NA|121aa|up_5|NZ_CP006819.1_814918_815281_+	NA	NA|204aa|up_4|NZ_CP006819.1_816487_817099_+	pfam09911, DUF2140, Uncharacterized protein conserved in bacteria (DUF2140)	cas9|1122aa|up_3|NZ_CP006819.1_817354_820720_+	cd09643, Csn1, CRISPR/Cas system-associated protein Cas9	cas1|304aa|up_2|NZ_CP006819.1_820896_821808_+	cd09720, Cas1_II, CRISPR/Cas system-associated protein Cas1	cas2|108aa|up_1|NZ_CP006819.1_821809_822133_+	COG3512, COG3512, CRISPR-associated protein, Cas2 homolog [Defense mechanisms]	csn2|351aa|up_0|NZ_CP006819.1_822129_823182_+	pfam16813, Cas_St_Csn2, CRISPR-associated protein Csn2 subfamily St	NA|272aa|down_0|NZ_CP006819.1_825433_826249_-	COG3689, COG3689, Predicted membrane protein [Function unknown]	NA|301aa|down_1|NZ_CP006819.1_826248_827151_-	COG0701, COG0701, Predicted permeases [General function prediction only]	NA|711aa|down_2|NZ_CP006819.1_827514_829647_+	COG2183, Tex, Transcriptional accessory protein [Transcription]	NA|147aa|down_3|NZ_CP006819.1_829633_830074_+	PRK04351, PRK04351, SprT family protein	NA|88aa|down_4|NZ_CP006819.1_830139_830403_+	COG1983, PspC, Putative stress-responsive transcriptional regulator [Transcription / Signal transduction mechanisms]	NA|310aa|down_5|NZ_CP006819.1_830532_831462_+	PRK05428, PRK05428, HPr kinase/phosphorylase; Provisional	NA|264aa|down_6|NZ_CP006819.1_831461_832253_+	PRK12437, PRK12437, prolipoprotein diacylglyceryl transferase; Reviewed	NA|133aa|down_7|NZ_CP006819.1_832375_832774_+	COG4768, COG4768, Uncharacterized protein containing a divergent version of the methyl-accepting chemotaxis-like domain [General function prediction only]	NA|147aa|down_8|NZ_CP006819.1_832786_833227_+	pfam12732, YtxH, YtxH-like protein	NA|99aa|down_9|NZ_CP006819.1_833247_833544_-	pfam11674, DUF3270, Protein of unknown function (DUF3270)
GCF_000698885.1_ASM69888v1	NZ_CP006819	Streptococcus thermophilus ASCC 1275 chromosome, complete genome	2	1074866-1075124	4,2,2	PILER-CR,CRISPRCasFinder,CRT	no	cas1,cas2,cas6,cas10,csm2gr11,csm3gr7,csm4gr5,csm5gr7	csa3,cas3,DEDDh,cas9,cas1,cas2,csn2,DinG,cas6,cas10,csm2gr11,csm3gr7,csm4gr5,csm5gr7,cas6e,cas5,cas7,cse2gr11	Type III-C,Type III-A,Type III-B,Type III-D	GATATAAACCTAATTACCTCGAGAGGGGACGGAAACCC,GATATAAACCTAATTACCTCGAGAGGGGACGGAAAC,GATATAAACCTAATTACCTCGAGAGGGGACGGAAAC	38,36,36	0	0	NA	NA	II-B,III-A:II-B,III-A:II-B,III-A	2,3,3	3	TypeIII-C,TypeIII-A,TypeIII-B,TypeIII-D	csa3,cas3,DEDDh,cas9,cas1,cas2,csn2,DinG,cas6,cas10,csm2gr11,csm3gr7,csm4gr5,csm5gr7,cas6e,cas5,cas7,cse2gr11	NA,NA|77aa|down_9|NZ_CP006819.1_1086931_1087162_+	NA|365aa|up_9|NZ_CP006819.1_1059141_1060236_+	COG0276, HemH, Protoheme ferro-lyase (ferrochelatase) [Coenzyme metabolism]	NA|346aa|up_8|NZ_CP006819.1_1060332_1061370_+	cd08287, FDH_like_ADH3, formaldehyde dehydrogenase (FDH)-like	NA|255aa|up_7|NZ_CP006819.1_1063755_1064520_+	cd09007, NP-I_spr0068, uncharacterized subfamily of the nucleoside phosphorylase-I family	NA|212aa|up_6|NZ_CP006819.1_1064581_1065217_-	cd04302, HAD_5NT, haloacid dehalogenase (HAD)-like 5'-nucleotidases similar to the Pseudomonas aeruginosa PA0065	NA|553aa|up_5|NZ_CP006819.1_1068143_1069802_-	pfam05833, FbpA, Fibronectin-binding protein A N-terminus (FbpA)	NA|183aa|up_4|NZ_CP006819.1_1070734_1071283_+	cd03135, GATase1_DJ-1, Type 1 glutamine amidotransferase (GATase1)-like domain found in Human DJ-1	NA|268aa|up_3|NZ_CP006819.1_1071523_1072327_+	PRK00054, PRK00054, dihydroorotate dehydrogenase electron transfer subunit; Reviewed	NA|316aa|up_2|NZ_CP006819.1_1072345_1073293_+	PRK07259, PRK07259, dihydroorotate dehydrogenase	cas1|335aa|up_1|NZ_CP006819.1_1073431_1074436_+	cd09634, Cas1_I-II-III, CRISPR/Cas system-associated protein Cas1	cas2|110aa|up_0|NZ_CP006819.1_1074435_1074765_+	cd09725, Cas2_I_II_III, CRISPR/Cas system-associated protein Cas2	cas6|244aa|down_0|NZ_CP006819.1_1075253_1075985_+	TIGR01877, CRISPR-associated_endoribonuclease_Cas6_1, CRISPR-associated endoribonuclease Cas6	cas10|759aa|down_1|NZ_CP006819.1_1075965_1078242_+	cd09680, Cas10_III, CRISPR/Cas system-associated protein Cas10	csm2gr11|127aa|down_2|NZ_CP006819.1_1078245_1078626_+	pfam03750, Csm2_III-A, Csm2 Type III-A	csm3gr7|221aa|down_3|NZ_CP006819.1_1078625_1079288_+	cd09684, Csm3_III-A, CRISPR/Cas system-associated RAMP superfamily protein Csm3	csm4gr5|300aa|down_4|NZ_CP006819.1_1079289_1080189_+	pfam17953, Csm4_C, CRISPR Csm4 C-terminal domain	csm5gr7|358aa|down_5|NZ_CP006819.1_1080191_1081265_+	COG1332, COG1332, CRISPR system related protein, RAMP superfamily [Defense mechanisms]	NA|232aa|down_6|NZ_CP006819.1_1082260_1082956_+	PRK00230, PRK00230, orotidine-5'-phosphate decarboxylase	NA|210aa|down_7|NZ_CP006819.1_1083045_1083675_+	PRK00455, pyrE, orotate phosphoribosyltransferase; Validated	NA|484aa|down_8|NZ_CP006819.1_1084372_1085824_-	COG3104, PTR2, Dipeptide/tripeptide permease [Amino acid transport and metabolism]	NA|77aa|down_9|NZ_CP006819.1_1086931_1087162_+	NA
GCF_000698885.1_ASM69888v1	NZ_CP006819	Streptococcus thermophilus ASCC 1275 chromosome, complete genome	3	1144778-1145540	3,3,5	CRISPRCasFinder,CRT,PILER-CR	no	DEDDh,cas1,cas6e,cas5,cas7,cse2gr11,cas3	csa3,cas3,DEDDh,cas9,cas1,cas2,csn2,DinG,cas6,cas10,csm2gr11,csm3gr7,csm4gr5,csm5gr7,cas6e,cas5,cas7,cse2gr11	Type I-E	GGATCACCCCCGCGTGTGCGGGAAAAAC,GGATCACCCCCGCGTGTGCGGGAAAAAC,GGATCACCCCCGCGTGTGCGGGAAAAAC	28,28,28	0	0	NA	NA	I-C,I-E,II-B:I-C,I-E,II-B:I-C,I-E,II-B	12,12,12	12	TypeI-E	csa3,cas3,DEDDh,cas9,cas1,cas2,csn2,DinG,cas6,cas10,csm2gr11,csm3gr7,csm4gr5,csm5gr7,cas6e,cas5,cas7,cse2gr11	NA|33aa|up_9|NZ_CP006819.1_1131452_1131551_-,NA	NA|33aa|up_9|NZ_CP006819.1_1131452_1131551_-	NA	NA|90aa|up_8|NZ_CP006819.1_1131587_1131857_-	pfam18033, SpuA_C, SpuA C-terminal	NA|171aa|up_7|NZ_CP006819.1_1131962_1132475_-	TIGR02102, alkaline_amylopullulanase, pullulanase, extracellular, Gram-positive	NA|117aa|up_6|NZ_CP006819.1_1132538_1132889_-	TIGR02102, alkaline_amylopullulanase, pullulanase, extracellular, Gram-positive	NA|157aa|up_5|NZ_CP006819.1_1132860_1133331_-	TIGR02102, alkaline_amylopullulanase, pullulanase, extracellular, Gram-positive	NA|270aa|up_4|NZ_CP006819.1_1133287_1134097_-	TIGR02102, alkaline_amylopullulanase, pullulanase, extracellular, Gram-positive	NA|585aa|up_3|NZ_CP006819.1_1136327_1138082_-	TIGR01350, Dihydrolipoyl_dehydrogenase, dihydrolipoamide dehydrogenase	NA|463aa|up_2|NZ_CP006819.1_1138252_1139641_-	PRK11856, PRK11856, branched-chain alpha-keto acid dehydrogenase subunit E2; Reviewed	NA|333aa|up_1|NZ_CP006819.1_1139798_1140797_-	COG0022, AcoB, Pyruvate/2-oxoglutarate dehydrogenase complex, dehydrogenase (E1) component, eukaryotic type, beta subunit [Energy production and conversion]	NA|324aa|up_0|NZ_CP006819.1_1140821_1141793_-	COG1071, AcoA, Pyruvate/2-oxoglutarate dehydrogenase complex, dehydrogenase (E1) component, eukaryotic type, alpha subunit [Energy production and conversion]	DEDDh|302aa|down_0|NZ_CP006819.1_1145560_1146466_-	cd06127, DEDDh, DEDDh 3'-5' exonuclease domain family	cas1|314aa|down_1|NZ_CP006819.1_1146467_1147409_-	TIGR03638, cas1_ECOLI, CRISPR-associated endonuclease Cas1, subtype I-E/ECOLI	cas6e|213aa|down_2|NZ_CP006819.1_1147412_1148051_-	pfam08798, CRISPR_assoc, CRISPR associated protein	cas5|242aa|down_3|NZ_CP006819.1_1148055_1148781_-	cd09756, Cas5_I-E, CRISPR/Cas system-associated RAMP superfamily protein Cas5	cas7|356aa|down_4|NZ_CP006819.1_1148794_1149862_-	pfam09344, Cas_CT1975, CT1975-like protein	cse2gr11|198aa|down_5|NZ_CP006819.1_1149851_1150445_-	pfam09485, CRISPR_Cse2, CRISPR-associated protein Cse2 (CRISPR_cse2)	cas3|927aa|down_6|NZ_CP006819.1_1152107_1154888_-	PRK09694, PRK09694, CRISPR-associated helicase/endonuclease Cas3	NA|648aa|down_7|NZ_CP006819.1_1155184_1157128_-	pfam09972, DUF2207, Predicted membrane protein (DUF2207)	NA|423aa|down_8|NZ_CP006819.1_1157202_1158471_-	PRK09357, pyrC, dihydroorotase; Validated	NA|218aa|down_9|NZ_CP006819.1_1158490_1159144_-	PRK05254, PRK05254, uracil-DNA glycosylase; Provisional
GCF_000698885.1_ASM69888v1	NZ_CP006819	Streptococcus thermophilus ASCC 1275 chromosome, complete genome	4	1558841-1559668	6,4,4	PILER-CR,CRISPRCasFinder,CRT	no	csn2,cas2,cas1,cas9	csa3,cas3,DEDDh,cas9,cas1,cas2,csn2,DinG,cas6,cas10,csm2gr11,csm3gr7,csm4gr5,csm5gr7,cas6e,cas5,cas7,cse2gr11	Type II-A,Type II-B,Type II-C	GTTTTGGAACCATTCGAAACAACACAGCTCTAAAAC,GTTTTGGAACCATTCGAAACAACACAGCTCTAAAAC,GTTTTGGAACCATTCGAAACAACACAGCTCTAAAAC	36,36,36	0	0	NA	NA	II-A,II-B:II-A,II-B:II-A,II-B	12,12,12	12	TypeII-A,TypeII-B,TypeII-C	csa3,cas3,DEDDh,cas9,cas1,cas2,csn2,DinG,cas6,cas10,csm2gr11,csm3gr7,csm4gr5,csm5gr7,cas6e,cas5,cas7,cse2gr11	NA|66aa|up_9|NZ_CP006819.1_1552551_1552749_-,NA|148aa|up_4|NZ_CP006819.1_1555633_1556077_-,NA|80aa|up_0|NZ_CP006819.1_1558517_1558757_-,NA	NA|66aa|up_9|NZ_CP006819.1_1552551_1552749_-	NA	NA|438aa|up_8|NZ_CP006819.1_1552915_1554229_-	PRK12297, obgE, GTPase CgtA; Reviewed	NA|43aa|up_7|NZ_CP006819.1_1554322_1554451_-	pfam13253, DUF4044, Protein of unknown function (DUF4044)	NA|244aa|up_6|NZ_CP006819.1_1554532_1555264_-	COG1187, RsuA, 16S rRNA uridine-516 pseudouridylate synthase and related pseudouridylate synthases [Translation, ribosomal structure and biogenesis]	NA|78aa|up_5|NZ_CP006819.1_1555264_1555498_-	COG3708, COG3708, Uncharacterized protein conserved in bacteria [Function unknown]	NA|148aa|up_4|NZ_CP006819.1_1555633_1556077_-	NA	NA|128aa|up_3|NZ_CP006819.1_1556107_1556491_-	COG2050, PaaI, HGG motif-containing thioesterase, possibly involved in aromatic compounds catabolism [Secondary metabolites biosynthesis,    transport, and catabolism]	NA|81aa|up_2|NZ_CP006819.1_1557179_1557422_+	COG2182, MalE, Maltose-binding periplasmic proteins/domains [Carbohydrate transport and metabolism]	NA|297aa|up_1|NZ_CP006819.1_1557589_1558480_+	COG0697, RhaT, Permeases of the drug/metabolite transporter (DMT) superfamily [Carbohydrate transport and metabolism / Amino acid transport and metabolism / General function prediction only]	NA|80aa|up_0|NZ_CP006819.1_1558517_1558757_-	NA	csn2|220aa|down_0|NZ_CP006819.1_1559989_1560649_-	cd09758, Csn2, CRISPR/Cas system-associated protein Csn2	cas2|115aa|down_1|NZ_CP006819.1_1560638_1560983_-	COG3512, COG3512, CRISPR-associated protein, Cas2 homolog [Defense mechanisms]	cas1|290aa|down_2|NZ_CP006819.1_1560979_1561849_-	cd09720, Cas1_II, CRISPR/Cas system-associated protein Cas1	cas9|1389aa|down_3|NZ_CP006819.1_1561848_1566015_-	TIGR01865, conserved_hypothetical_protein, CRISPR subtype II/NMENI RNA-guided endonuclease Cas9/Csn1	NA|216aa|down_4|NZ_CP006819.1_1566346_1566994_-	cd07500, HAD_PSP, phosphoserine phosphatase (PSP), similar to Methanococcus Jannaschii PSP and Saccharomyces cerevisiae SER2p	NA|575aa|down_5|NZ_CP006819.1_1567105_1568830_-	PRK04778, PRK04778, septation ring formation regulator EzrA; Provisional	NA|651aa|down_6|NZ_CP006819.1_1568926_1570879_-	PRK05644, gyrB, DNA gyrase subunit B; Validated	NA|187aa|down_7|NZ_CP006819.1_1570879_1571440_-	cd07523, HAD_YsbA-like, uncharacterized family of the haloacid dehalogenase-like superfamily, similar to the uncharacterized Lactococcus lactis YsbA	NA|232aa|down_8|NZ_CP006819.1_1571525_1572221_-	COG1346, LrgB, Putative effector of murein hydrolase [Cell envelope biogenesis, outer membrane]	NA|125aa|down_9|NZ_CP006819.1_1572213_1572588_-	pfam03788, LrgA, LrgA family
GCF_000698885.1_ASM69888v1	NZ_CP006819	Streptococcus thermophilus ASCC 1275 chromosome, complete genome	5	1775091-1775182	5	CRISPRCasFinder	no	cas3	csa3,cas3,DEDDh,cas9,cas1,cas2,csn2,DinG,cas6,cas10,csm2gr11,csm3gr7,csm4gr5,csm5gr7,cas6e,cas5,cas7,cse2gr11	Unclear	CTTTTCTTTACGATATCAATTTTAACATCTTT	32	0	0	NA	NA	NA	1	1	Unclear	csa3,cas3,DEDDh,cas9,cas1,cas2,csn2,DinG,cas6,cas10,csm2gr11,csm3gr7,csm4gr5,csm5gr7,cas6e,cas5,cas7,cse2gr11	NA|87aa|up_3|NZ_CP006819.1_1770706_1770967_-,NA|143aa|down_7|NZ_CP006819.1_1781084_1781513_-	cas3|673aa|up_9|NZ_CP006819.1_1762156_1764175_-	PRK10917, PRK10917, ATP-dependent DNA helicase RecG; Provisional	NA|368aa|up_8|NZ_CP006819.1_1764303_1765407_-	PRK00053, alr, alanine racemase; Reviewed	NA|120aa|up_7|NZ_CP006819.1_1765427_1765787_-	PRK00070, acpS, 4'-phosphopantetheinyl transferase; Provisional	NA|344aa|up_6|NZ_CP006819.1_1765790_1766822_-	PRK09261, PRK09261, phospho-2-dehydro-3-deoxyheptonate aldolase; Validated	NA|344aa|up_5|NZ_CP006819.1_1766919_1767951_-	COG0722, AroG, 3-deoxy-D-arabino-heptulosonate 7-phosphate (DAHP) synthase [Amino acid transport and metabolism]	NA|850aa|up_4|NZ_CP006819.1_1767974_1770524_-	PRK12904, PRK12904, preprotein translocase subunit SecA; Reviewed	NA|87aa|up_3|NZ_CP006819.1_1770706_1770967_-	NA	NA|316aa|up_2|NZ_CP006819.1_1771053_1772001_-	COG1482, ManA, Phosphomannose isomerase [Carbohydrate transport and metabolism]	NA|298aa|up_1|NZ_CP006819.1_1772066_1772960_-	pfam00480, ROK, ROK family	NA|634aa|up_0|NZ_CP006819.1_1773144_1775046_-	TIGR01996, Includes:_Phosphotransferase_enzyme_IIB_component, PTS system, sucrose-specific IIBC component	NA|481aa|down_0|NZ_CP006819.1_1775294_1776737_+	TIGR01322, Sucrose-6-phosphate_hydrolase, sucrose-6-phosphate hydrolase	NA|322aa|down_1|NZ_CP006819.1_1776745_1777711_+	cd06291, PBP1_Qymf-like, ligand binding domain of the lacI-like transcription regulator from a novel metal-reducing bacterium Alkaliphilus Metalliredigens (strain Qymf) and its close homologs	NA|145aa|down_2|NZ_CP006819.1_1777800_1778235_-	PRK00202, nusB, transcription antitermination factor NusB	NA|130aa|down_3|NZ_CP006819.1_1778227_1778617_-	pfam03780, Asp23, Asp23 family, cell envelope-related function	NA|187aa|down_4|NZ_CP006819.1_1778687_1779248_-	PRK00529, PRK00529, elongation factor P; Validated	NA|152aa|down_5|NZ_CP006819.1_1779360_1779816_-	TIGR02571, putative_dCMP_deaminase, ComE operon protein 2	NA|354aa|down_6|NZ_CP006819.1_1779834_1780896_-	COG0006, PepP, Xaa-Pro aminopeptidase [Amino acid transport and metabolism]	NA|143aa|down_7|NZ_CP006819.1_1781084_1781513_-	NA	NA|198aa|down_8|NZ_CP006819.1_1781557_1782151_-	COG1131, CcmA, ABC-type multidrug transport system, ATPase component [Defense mechanisms]	NA|130aa|down_9|NZ_CP006819.1_1782119_1782509_-	cd03268, ABC_BcrA_bacitracin_resist, ATP-binding cassette domain of the bacitracin-resistance transporter
