assembly_id	genome_id	genome_def	crispr_array_locus_merge	crispr_array_location_merge	crispr_locus_id	crispr_pred_method	array_in_prot	prot_within_array_20000	prot_in_genome	crispr_type_by_cas_prot	consensus_repeat	repeat_length	self-targeting_spacer_number	self-targeting_target_number	spacer_location	protospacer_location	repeat_type	spacer_locus_num	spacer_num	correct_crispr_type	genome_cas_prots	unknown_protein_around_crispr	L10	L10_domain	L9	L9_domain	L8	L8_domain	L7	L7_domain	L6	L6_domain	L5	L5_domain	L4	L4_domain	L3	L3_domain	L2	L2_domain	L1	L1_domain	R1	R1_domain	R2	R2_domain	R3	R3_domain	R4	R4_domain	R5	R5_domain	R6	R6_domain	R7	R7_domain	R8	R8_domain	R9	R9_domain	R10	R10_domain
GCF_001855705.1_ASM185570v1	NZ_CP016394	Streptococcus thermophilus strain ND07, complete genome	1	177242-179388	1,1,1	CRISPRCasFinder,CRT,PILER-CR	no	csn2,cas2,cas1,cas9	DinG,csn2,cas2,cas1,cas9,cas3,DEDDh,csa3,cas8e,cse2gr11,cas7,cas5,cas6e,csm5gr7,csm4gr5,csm3gr7,csm2gr11,cas10,cas6	Type II-C,Type II-A,Type II-B	GTTGTACAGTTACTTAAATCTTGAGAGTACAAAAAC,GTTGTACAGTTACTTAAATCTTGAGAGTACAAAAAC,GTTGTACAGTTACTTAAATCTTGAGAGTACAAAAAC	36,36,36	0	0	NA	NA	NA:NA:NA	32,32,28	32	TypeII-C,TypeII-A,TypeII-B	DinG,csn2,cas2,cas1,cas9,cas3,DEDDh,csa3,cas8e,cse2gr11,cas7,cas5,cas6e,csm5gr7,csm4gr5,csm3gr7,csm2gr11,cas10,cas6	NA,NA|121aa|down_5|NZ_CP016394.1_187352_187715_-,NA|189aa|down_9|NZ_CP016394.1_193675_194242_-	NA|99aa|up_9|NZ_CP016394.1_169089_169386_+	pfam11674, DUF3270, Protein of unknown function (DUF3270)	NA|147aa|up_8|NZ_CP016394.1_169406_169847_-	pfam12732, YtxH, YtxH-like protein	NA|133aa|up_7|NZ_CP016394.1_169859_170258_-	COG4768, COG4768, Uncharacterized protein containing a divergent version of the methyl-accepting chemotaxis-like domain [General function prediction only]	NA|264aa|up_6|NZ_CP016394.1_170380_171172_-	PRK12437, PRK12437, prolipoprotein diacylglyceryl transferase; Reviewed	NA|310aa|up_5|NZ_CP016394.1_171171_172101_-	PRK05428, PRK05428, HPr kinase/phosphorylase; Provisional	NA|88aa|up_4|NZ_CP016394.1_172230_172494_-	COG1983, PspC, Putative stress-responsive transcriptional regulator [Transcription / Signal transduction mechanisms]	NA|147aa|up_3|NZ_CP016394.1_172559_173000_-	PRK04351, PRK04351, SprT family protein	NA|711aa|up_2|NZ_CP016394.1_172986_175119_-	COG2183, Tex, Transcriptional accessory protein [Transcription]	NA|301aa|up_1|NZ_CP016394.1_175482_176385_+	COG0701, COG0701, Predicted permeases [General function prediction only]	NA|272aa|up_0|NZ_CP016394.1_176384_177200_+	COG3689, COG3689, Predicted membrane protein [Function unknown]	csn2|351aa|down_0|NZ_CP016394.1_179451_180504_-	pfam16813, Cas_St_Csn2, CRISPR-associated protein Csn2 subfamily St	cas2|108aa|down_1|NZ_CP016394.1_180500_180824_-	COG3512, COG3512, CRISPR-associated protein, Cas2 homolog [Defense mechanisms]	cas1|304aa|down_2|NZ_CP016394.1_180825_181737_-	cd09720, Cas1_II, CRISPR/Cas system-associated protein Cas1	cas9|1122aa|down_3|NZ_CP016394.1_181913_185279_-	cd09643, Csn1, CRISPR/Cas system-associated protein Cas9	NA|204aa|down_4|NZ_CP016394.1_185534_186146_-	pfam09911, DUF2140, Uncharacterized protein conserved in bacteria (DUF2140)	NA|121aa|down_5|NZ_CP016394.1_187352_187715_-	NA	NA|121aa|down_6|NZ_CP016394.1_188109_188472_-	TIGR02328, TIGR02328, conserved hypothetical protein	NA|131aa|down_7|NZ_CP016394.1_188468_188861_-	pfam08349, DUF1722, Protein of unknown function (DUF1722)	NA|76aa|down_8|NZ_CP016394.1_192604_192832_+	TIGR01716, HTH-type_transcriptional_regulator_rgg, transcriptional activator, Rgg/GadR/MutR family, C-terminal domain	NA|189aa|down_9|NZ_CP016394.1_193675_194242_-	NA
GCF_001855705.1_ASM185570v1	NZ_CP016394	Streptococcus thermophilus strain ND07, complete genome	2	1294523-1295350	2,2,2	PILER-CR,CRISPRCasFinder,CRT	no	cas9,cas1,cas2,csn2	DinG,csn2,cas2,cas1,cas9,cas3,DEDDh,csa3,cas8e,cse2gr11,cas7,cas5,cas6e,csm5gr7,csm4gr5,csm3gr7,csm2gr11,cas10,cas6	Type II-C,Type II-A,Type II-B	GTTTTAGAGCTGTGTTGTTTCGAATGGTTCCAAAAC,GTTTTAGAGCTGTGTTGTTTCGAATGGTTCCAAAAC,GTTTTAGAGCTGTGTTGTTTCGAATGGTTCCAAAAC	36,36,36	0	0	NA	NA	II-A,II-B:II-A,II-B:II-A,II-B	12,12,12	12	TypeII-C,TypeII-A,TypeII-B	DinG,csn2,cas2,cas1,cas9,cas3,DEDDh,csa3,cas8e,cse2gr11,cas7,cas5,cas6e,csm5gr7,csm4gr5,csm3gr7,csm2gr11,cas10,cas6	NA,NA|80aa|down_0|NZ_CP016394.1_1295433_1295673_+,NA|148aa|down_4|NZ_CP016394.1_1298113_1298557_+,NA|66aa|down_9|NZ_CP016394.1_1301441_1301639_+	NA|125aa|up_9|NZ_CP016394.1_1281601_1281976_+	pfam03788, LrgA, LrgA family	NA|232aa|up_8|NZ_CP016394.1_1281968_1282664_+	COG1346, LrgB, Putative effector of murein hydrolase [Cell envelope biogenesis, outer membrane]	NA|202aa|up_7|NZ_CP016394.1_1282704_1283310_+	cd07523, HAD_YsbA-like, uncharacterized family of the haloacid dehalogenase-like superfamily, similar to the uncharacterized Lactococcus lactis YsbA	NA|651aa|up_6|NZ_CP016394.1_1283310_1285263_+	PRK05644, gyrB, DNA gyrase subunit B; Validated	NA|575aa|up_5|NZ_CP016394.1_1285359_1287084_+	PRK04778, PRK04778, septation ring formation regulator EzrA; Provisional	NA|216aa|up_4|NZ_CP016394.1_1287195_1287843_+	cd07500, HAD_PSP, phosphoserine phosphatase (PSP), similar to Methanococcus Jannaschii PSP and Saccharomyces cerevisiae SER2p	cas9|1389aa|up_3|NZ_CP016394.1_1288175_1292342_+	TIGR01865, conserved_hypothetical_protein, CRISPR subtype II/NMENI RNA-guided endonuclease Cas9/Csn1	cas1|290aa|up_2|NZ_CP016394.1_1292341_1293211_+	cd09720, Cas1_II, CRISPR/Cas system-associated protein Cas1	cas2|115aa|up_1|NZ_CP016394.1_1293207_1293552_+	COG3512, COG3512, CRISPR-associated protein, Cas2 homolog [Defense mechanisms]	csn2|220aa|up_0|NZ_CP016394.1_1293541_1294201_+	cd09758, Csn2, CRISPR/Cas system-associated protein Csn2	NA|80aa|down_0|NZ_CP016394.1_1295433_1295673_+	NA	NA|297aa|down_1|NZ_CP016394.1_1295710_1296601_-	COG0697, RhaT, Permeases of the drug/metabolite transporter (DMT) superfamily [Carbohydrate transport and metabolism / Amino acid transport and metabolism / General function prediction only]	NA|81aa|down_2|NZ_CP016394.1_1296768_1297011_-	COG2182, MalE, Maltose-binding periplasmic proteins/domains [Carbohydrate transport and metabolism]	NA|128aa|down_3|NZ_CP016394.1_1297699_1298083_+	COG2050, PaaI, HGG motif-containing thioesterase, possibly involved in aromatic compounds catabolism [Secondary metabolites biosynthesis,    transport, and catabolism]	NA|148aa|down_4|NZ_CP016394.1_1298113_1298557_+	NA	NA|78aa|down_5|NZ_CP016394.1_1298692_1298926_+	COG3708, COG3708, Uncharacterized protein conserved in bacteria [Function unknown]	NA|244aa|down_6|NZ_CP016394.1_1298926_1299658_+	COG1187, RsuA, 16S rRNA uridine-516 pseudouridylate synthase and related pseudouridylate synthases [Translation, ribosomal structure and biogenesis]	NA|43aa|down_7|NZ_CP016394.1_1299739_1299868_+	pfam13253, DUF4044, Protein of unknown function (DUF4044)	NA|438aa|down_8|NZ_CP016394.1_1299961_1301275_+	PRK12297, obgE, GTPase CgtA; Reviewed	NA|66aa|down_9|NZ_CP016394.1_1301441_1301639_+	NA
GCF_001855705.1_ASM185570v1	NZ_CP016394	Streptococcus thermophilus strain ND07, complete genome	3	1708655-1709417	3,3,3	PILER-CR,CRISPRCasFinder,CRT	no	cas3,cas8e,cse2gr11,cas7,cas5,cas6e,cas1,DEDDh	DinG,csn2,cas2,cas1,cas9,cas3,DEDDh,csa3,cas8e,cse2gr11,cas7,cas5,cas6e,csm5gr7,csm4gr5,csm3gr7,csm2gr11,cas10,cas6	Type I-E	GTTTTTCCCGCACACGCGGGGGTGATCC,GTTTTTCCCGCACACGCGGGGGTGATCC,GTTTTTCCCGCACACGCGGGGGTGATCC	28,28,28	0	0	NA	NA	I-C,I-E,II-B:I-C,I-E,II-B:I-C,I-E,II-B	12,12,12	12	TypeI-E	DinG,csn2,cas2,cas1,cas9,cas3,DEDDh,csa3,cas8e,cse2gr11,cas7,cas5,cas6e,csm5gr7,csm4gr5,csm3gr7,csm2gr11,cas10,cas6	NA,NA|33aa|down_5|NZ_CP016394.1_1722643_1722742_+	NA|423aa|up_9|NZ_CP016394.1_1695722_1696991_+	PRK09357, pyrC, dihydroorotase; Validated	NA|648aa|up_8|NZ_CP016394.1_1697065_1699009_+	pfam09972, DUF2207, Predicted membrane protein (DUF2207)	cas3|927aa|up_7|NZ_CP016394.1_1699305_1702086_+	PRK09694, PRK09694, CRISPR-associated helicase/endonuclease Cas3	cas8e|556aa|up_6|NZ_CP016394.1_1702072_1703740_+	pfam09481, CRISPR_Cse1, CRISPR-associated protein Cse1 (CRISPR_cse1)	cse2gr11|198aa|up_5|NZ_CP016394.1_1703749_1704343_+	pfam09485, CRISPR_Cse2, CRISPR-associated protein Cse2 (CRISPR_cse2)	cas7|356aa|up_4|NZ_CP016394.1_1704332_1705400_+	pfam09344, Cas_CT1975, CT1975-like protein	cas5|242aa|up_3|NZ_CP016394.1_1705413_1706139_+	cd09756, Cas5_I-E, CRISPR/Cas system-associated RAMP superfamily protein Cas5	cas6e|213aa|up_2|NZ_CP016394.1_1706143_1706782_+	pfam08798, CRISPR_assoc, CRISPR associated protein	cas1|314aa|up_1|NZ_CP016394.1_1706785_1707727_+	TIGR03638, cas1_ECOLI, CRISPR-associated endonuclease Cas1, subtype I-E/ECOLI	DEDDh|302aa|up_0|NZ_CP016394.1_1707728_1708634_+	cd06127, DEDDh, DEDDh 3'-5' exonuclease domain family	NA|324aa|down_0|NZ_CP016394.1_1712401_1713373_+	COG1071, AcoA, Pyruvate/2-oxoglutarate dehydrogenase complex, dehydrogenase (E1) component, eukaryotic type, alpha subunit [Energy production and conversion]	NA|333aa|down_1|NZ_CP016394.1_1713397_1714396_+	COG0022, AcoB, Pyruvate/2-oxoglutarate dehydrogenase complex, dehydrogenase (E1) component, eukaryotic type, beta subunit [Energy production and conversion]	NA|463aa|down_2|NZ_CP016394.1_1714553_1715942_+	PRK11856, PRK11856, branched-chain alpha-keto acid dehydrogenase subunit E2; Reviewed	NA|585aa|down_3|NZ_CP016394.1_1716112_1717867_+	TIGR01350, Dihydrolipoyl_dehydrogenase, dihydrolipoamide dehydrogenase	NA|90aa|down_4|NZ_CP016394.1_1722337_1722607_+	pfam18033, SpuA_C, SpuA C-terminal	NA|33aa|down_5|NZ_CP016394.1_1722643_1722742_+	NA	NA|318aa|down_6|NZ_CP016394.1_1723105_1724059_+	COG4606, CeuB, ABC-type enterochelin transport system, permease component [Inorganic ion transport and metabolism]	NA|323aa|down_7|NZ_CP016394.1_1724055_1725024_+	COG4605, CeuC, ABC-type enterochelin transport system, permease component [Inorganic ion transport and metabolism]	NA|252aa|down_8|NZ_CP016394.1_1725020_1725776_+	COG4604, CeuD, ABC-type enterochelin transport system, ATPase component [Inorganic ion transport and metabolism]	NA|349aa|down_9|NZ_CP016394.1_1725787_1726834_+	COG4607, CeuA, ABC-type enterochelin transport system, periplasmic component [Inorganic ion transport and metabolism]
GCF_001855705.1_ASM185570v1	NZ_CP016394	Streptococcus thermophilus strain ND07, complete genome	4	1779072-1779330	4,4,4	CRT,PILER-CR,CRISPRCasFinder	no	csm5gr7,csm4gr5,csm3gr7,csm2gr11,cas10,cas6,cas2,cas1	DinG,csn2,cas2,cas1,cas9,cas3,DEDDh,csa3,cas8e,cse2gr11,cas7,cas5,cas6e,csm5gr7,csm4gr5,csm3gr7,csm2gr11,cas10,cas6	Type III-D,Type III-A,Type III-B,Type III-C	GTTTCCGTCCCCTCTCGAGGTAATTAGGTTTATATC,GTTTCCGTCCCCTCTCGAGGTAATTAGGTTTATATC,GTCCCCTCTCGAGGTAATTAGGTTTATATC	36,36,30	0	0	NA	NA	II-B,III-A:II-B,III-A:NA	3,3,3	3	TypeIII-D,TypeIII-A,TypeIII-B,TypeIII-C	DinG,csn2,cas2,cas1,cas9,cas3,DEDDh,csa3,cas8e,cse2gr11,cas7,cas5,cas6e,csm5gr7,csm4gr5,csm3gr7,csm2gr11,cas10,cas6	NA|77aa|up_9|NZ_CP016394.1_1767033_1767264_-,NA|85aa|down_9|NZ_CP016394.1_1791235_1791490_-	NA|77aa|up_9|NZ_CP016394.1_1767033_1767264_-	NA	NA|484aa|up_8|NZ_CP016394.1_1768371_1769823_+	COG3104, PTR2, Dipeptide/tripeptide permease [Amino acid transport and metabolism]	NA|210aa|up_7|NZ_CP016394.1_1770520_1771150_-	PRK00455, pyrE, orotate phosphoribosyltransferase; Validated	NA|232aa|up_6|NZ_CP016394.1_1771239_1771935_-	PRK00230, PRK00230, orotidine-5'-phosphate decarboxylase	csm5gr7|358aa|up_5|NZ_CP016394.1_1772930_1774004_-	COG1332, COG1332, CRISPR system related protein, RAMP superfamily [Defense mechanisms]	csm4gr5|300aa|up_4|NZ_CP016394.1_1774006_1774906_-	pfam17953, Csm4_C, CRISPR Csm4 C-terminal domain	csm3gr7|221aa|up_3|NZ_CP016394.1_1774907_1775570_-	cd09684, Csm3_III-A, CRISPR/Cas system-associated RAMP superfamily protein Csm3	csm2gr11|127aa|up_2|NZ_CP016394.1_1775569_1775950_-	pfam03750, Csm2_III-A, Csm2 Type III-A	cas10|759aa|up_1|NZ_CP016394.1_1775953_1778230_-	cd09680, Cas10_III, CRISPR/Cas system-associated protein Cas10	cas6|244aa|up_0|NZ_CP016394.1_1778210_1778942_-	TIGR01877, CRISPR-associated_endoribonuclease_Cas6_1, CRISPR-associated endoribonuclease Cas6	cas2|110aa|down_0|NZ_CP016394.1_1779430_1779760_-	cd09725, Cas2_I_II_III, CRISPR/Cas system-associated protein Cas2	cas1|335aa|down_1|NZ_CP016394.1_1779759_1780764_-	cd09634, Cas1_I-II-III, CRISPR/Cas system-associated protein Cas1	NA|316aa|down_2|NZ_CP016394.1_1780902_1781850_-	PRK07259, PRK07259, dihydroorotate dehydrogenase	NA|268aa|down_3|NZ_CP016394.1_1781868_1782672_-	PRK00054, PRK00054, dihydroorotate dehydrogenase electron transfer subunit; Reviewed	NA|183aa|down_4|NZ_CP016394.1_1782912_1783461_-	cd03135, GATase1_DJ-1, Type 1 glutamine amidotransferase (GATase1)-like domain found in Human DJ-1	NA|553aa|down_5|NZ_CP016394.1_1784393_1786052_+	pfam05833, FbpA, Fibronectin-binding protein A N-terminus (FbpA)	NA|89aa|down_6|NZ_CP016394.1_1787743_1788010_+	cd13138, MATE_yoeA_like, Subfamily of the multidrug and toxic compound extrusion (MATE)-like proteins similar to Bacillus subtilis yoeA	NA|212aa|down_7|NZ_CP016394.1_1788978_1789614_+	cd04302, HAD_5NT, haloacid dehalogenase (HAD)-like 5'-nucleotidases similar to the Pseudomonas aeruginosa PA0065	NA|255aa|down_8|NZ_CP016394.1_1789675_1790440_-	cd09007, NP-I_spr0068, uncharacterized subfamily of the nucleoside phosphorylase-I family	NA|85aa|down_9|NZ_CP016394.1_1791235_1791490_-	NA
