assembly_id	genome_id	genome_def	crispr_array_locus_merge	crispr_array_location_merge	crispr_locus_id	crispr_pred_method	array_in_prot	prot_within_array_20000	prot_in_genome	crispr_type_by_cas_prot	consensus_repeat	repeat_length	self-targeting_spacer_number	self-targeting_target_number	spacer_location	protospacer_location	repeat_type	spacer_locus_num	spacer_num	correct_crispr_type	genome_cas_prots	unknown_protein_around_crispr	L10	L10_domain	L9	L9_domain	L8	L8_domain	L7	L7_domain	L6	L6_domain	L5	L5_domain	L4	L4_domain	L3	L3_domain	L2	L2_domain	L1	L1_domain	R1	R1_domain	R2	R2_domain	R3	R3_domain	R4	R4_domain	R5	R5_domain	R6	R6_domain	R7	R7_domain	R8	R8_domain	R9	R9_domain	R10	R10_domain
GCF_000253395.1_ASM25339v1	NC_017581	Streptococcus thermophilus JIM 8232, complete genome	1	712332-715137	1,1,1,2	CRISPRCasFinder,CRT,PILER-CR,PILER-CR	no	cas9,cas1,cas2,csn2	cas3,DEDDh,cas9,cas1,cas2,csn2,DinG,cas6,cas10,csm2gr11,csm3gr7,csm4gr5,csm5gr7,csm6	Type II-C,Type II-B,Type II-A	GTTTTTGTACTCTCAAGATTTAAGTAACTGTACAAC,GTTTTTGTACTCTCAAGATTTAAGTAACTGTACAAC,GTTTTTGTACTCTCAAGATTTAAGTAACTGTACAA,AGTTTTTGTACTCTCAAGATTTAAGTAACTGTACAAC	36,36,35,37	0	0	NA	NA	NA:NA:NA:NA	42,42,24,24	42	TypeII-C,TypeII-B,TypeII-A	cas3,DEDDh,cas9,cas1,cas2,csn2,DinG,cas6,cas10,csm2gr11,csm3gr7,csm4gr5,csm5gr7,csm6	NA|86aa|up_5|NC_017581.1_705147_705405_-,NA	NA|183aa|up_9|NC_017581.1_700969_701518_-	COG2128, COG2128, Uncharacterized conserved protein [Function unknown]	NA|454aa|up_8|NC_017581.1_701733_703095_+	COG2265, TrmA, SAM-dependent methyltransferases related to tRNA (uracil-5-)-methyltransferase [Translation, ribosomal structure and biogenesis]	NA|74aa|up_7|NC_017581.1_704301_704523_+	COG3655, COG3655, Predicted transcriptional regulator [Transcription]	NA|153aa|up_6|NC_017581.1_704522_704981_+	COG3392, COG3392, Adenine-specific DNA methylase [DNA replication, recombination, and repair]	NA|86aa|up_5|NC_017581.1_705147_705405_-	NA	NA|205aa|up_4|NC_017581.1_705574_706189_+	pfam09911, DUF2140, Uncharacterized protein conserved in bacteria (DUF2140)	cas9|1122aa|up_3|NC_017581.1_706442_709808_+	cd09643, Csn1, CRISPR/Cas system-associated protein Cas9	cas1|304aa|up_2|NC_017581.1_709984_710896_+	cd09720, Cas1_II, CRISPR/Cas system-associated protein Cas1	cas2|108aa|up_1|NC_017581.1_710897_711221_+	COG3512, COG3512, CRISPR-associated protein, Cas2 homolog [Defense mechanisms]	csn2|351aa|up_0|NC_017581.1_711217_712270_+	pfam16813, Cas_St_Csn2, CRISPR-associated protein Csn2 subfamily St	NA|272aa|down_0|NC_017581.1_715178_715994_-	COG3689, COG3689, Predicted membrane protein [Function unknown]	NA|301aa|down_1|NC_017581.1_715993_716896_-	COG0701, COG0701, Predicted permeases [General function prediction only]	NA|711aa|down_2|NC_017581.1_717259_719392_+	COG2183, Tex, Transcriptional accessory protein [Transcription]	NA|147aa|down_3|NC_017581.1_719378_719819_+	PRK04351, PRK04351, SprT family protein	NA|88aa|down_4|NC_017581.1_719886_720150_+	COG1983, PspC, Putative stress-responsive transcriptional regulator [Transcription / Signal transduction mechanisms]	NA|310aa|down_5|NC_017581.1_720279_721209_+	PRK05428, PRK05428, HPr kinase/phosphorylase; Provisional	NA|264aa|down_6|NC_017581.1_721208_722000_+	PRK12437, PRK12437, prolipoprotein diacylglyceryl transferase; Reviewed	NA|133aa|down_7|NC_017581.1_722122_722521_+	COG4768, COG4768, Uncharacterized protein containing a divergent version of the methyl-accepting chemotaxis-like domain [General function prediction only]	NA|147aa|down_8|NC_017581.1_722533_722974_+	pfam12732, YtxH, YtxH-like protein	NA|99aa|down_9|NC_017581.1_722994_723291_-	pfam11674, DUF3270, Protein of unknown function (DUF3270)
GCF_000253395.1_ASM25339v1	NC_017581	Streptococcus thermophilus JIM 8232, complete genome	2	969817-971087	3,2,2	PILER-CR,CRISPRCasFinder,CRT	no	cas1,cas2,cas6,cas10,csm2gr11,csm3gr7,csm4gr5,csm5gr7,csm6	cas3,DEDDh,cas9,cas1,cas2,csn2,DinG,cas6,cas10,csm2gr11,csm3gr7,csm4gr5,csm5gr7,csm6	Type III-C,Type III-B,Type III-A,Type III-D	GATATAAACCTAATTACCTCGAGAGGGGACGGAAAC,GATATAAACCTAATTACCTCGAGAGGGGACGGAAAC,GATATAAACCTAATTACCTCGAGAGGGGACGGAAAC	36,36,36	0	0	NA	NA	II-B,III-A:II-B,III-A:II-B,III-A	16,17,17	17	TypeIII-C,TypeIII-B,TypeIII-A,TypeIII-D	cas3,DEDDh,cas9,cas1,cas2,csn2,DinG,cas6,cas10,csm2gr11,csm3gr7,csm4gr5,csm5gr7,csm6	NA|73aa|up_7|NC_017581.1_964329_964548_+,NA	NA|162aa|up_9|NC_017581.1_963212_963698_+	cd04684, Nudix_Hydrolase_25, Contains a crystal structure of the Nudix hydrolase from Enterococcus faecalis, which has an unknown function	NA|64aa|up_8|NC_017581.1_963701_963893_+	COG4405, COG4405, Uncharacterized protein conserved in bacteria [Function unknown]	NA|73aa|up_7|NC_017581.1_964329_964548_+	NA	NA|223aa|up_6|NC_017581.1_964749_965418_+	pfam05857, TraX, TraX protein	NA|70aa|up_5|NC_017581.1_965433_965643_+	pfam05857, TraX, TraX protein	NA|183aa|up_4|NC_017581.1_965684_966233_+	cd03135, GATase1_DJ-1, Type 1 glutamine amidotransferase (GATase1)-like domain found in Human DJ-1	NA|268aa|up_3|NC_017581.1_966474_967278_+	PRK00054, PRK00054, dihydroorotate dehydrogenase electron transfer subunit; Reviewed	NA|316aa|up_2|NC_017581.1_967296_968244_+	PRK07259, PRK07259, dihydroorotate dehydrogenase	cas1|335aa|up_1|NC_017581.1_968382_969387_+	cd09634, Cas1_I-II-III, CRISPR/Cas system-associated protein Cas1	cas2|110aa|up_0|NC_017581.1_969386_969716_+	cd09725, Cas2_I_II_III, CRISPR/Cas system-associated protein Cas2	cas6|244aa|down_0|NC_017581.1_971217_971949_+	TIGR01877, CRISPR-associated_endoribonuclease_Cas6_1, CRISPR-associated endoribonuclease Cas6	cas10|759aa|down_1|NC_017581.1_971929_974206_+	cd09680, Cas10_III, CRISPR/Cas system-associated protein Cas10	csm2gr11|131aa|down_2|NC_017581.1_974209_974602_+	pfam03750, Csm2_III-A, Csm2 Type III-A	csm3gr7|221aa|down_3|NC_017581.1_974601_975264_+	cd09684, Csm3_III-A, CRISPR/Cas system-associated RAMP superfamily protein Csm3	csm4gr5|300aa|down_4|NC_017581.1_975265_976165_+	pfam17953, Csm4_C, CRISPR Csm4 C-terminal domain	csm5gr7|358aa|down_5|NC_017581.1_976167_977241_+	COG1332, COG1332, CRISPR system related protein, RAMP superfamily [Defense mechanisms]	csm6|387aa|down_6|NC_017581.1_977237_978398_+	cd09699, Csm6_III-A, CRISPR/Cas system-associated protein Csm6	csm6|429aa|down_7|NC_017581.1_978412_979699_+	cd09699, Csm6_III-A, CRISPR/Cas system-associated protein Csm6	NA|232aa|down_8|NC_017581.1_979897_980593_+	PRK00230, PRK00230, orotidine-5'-phosphate decarboxylase	NA|210aa|down_9|NC_017581.1_980681_981311_+	PRK00455, pyrE, orotate phosphoribosyltransferase; Validated
GCF_000253395.1_ASM25339v1	NC_017581	Streptococcus thermophilus JIM 8232, complete genome	3	1452282-1452514	4,3,3	PILER-CR,CRISPRCasFinder,CRT	no		cas3,DEDDh,cas9,cas1,cas2,csn2,DinG,cas6,cas10,csm2gr11,csm3gr7,csm4gr5,csm5gr7,csm6	Orphan	GTTTTGGAACCATTCGAAACAACACAGCTCTAAAAC,GTTTTGGAACCATTCGAAACAACACAGCTCTAAAAC,GTTTTGGAACCATTCGAAACAACACAGCTCTAAAAC	36,36,36	0	0	NA	NA	II-A,II-B:II-A,II-B:II-A,II-B	2,3,3	3	Orphan	cas3,DEDDh,cas9,cas1,cas2,csn2,DinG,cas6,cas10,csm2gr11,csm3gr7,csm4gr5,csm5gr7,csm6	NA|66aa|up_7|NC_017581.1_1445991_1446189_-,NA|148aa|up_4|NC_017581.1_1449074_1449518_-,NA|80aa|up_0|NC_017581.1_1451958_1452198_-,NA|186aa|down_8|NC_017581.1_1461275_1461833_-	NA|737aa|up_9|NC_017581.1_1442968_1445179_+	cd13619, PBP2_GlnP, Glutamine-binding domain of ABC transporter, a member of the type 2 periplasmic binding fold protein superfamily	NA|247aa|up_8|NC_017581.1_1445178_1445919_+	COG1126, GlnQ, ABC-type polar amino acid transport system, ATPase component [Amino acid transport and metabolism]	NA|66aa|up_7|NC_017581.1_1445991_1446189_-	NA	NA|438aa|up_6|NC_017581.1_1446355_1447669_-	PRK12297, obgE, GTPase CgtA; Reviewed	NA|43aa|up_5|NC_017581.1_1447762_1447891_-	pfam13253, DUF4044, Protein of unknown function (DUF4044)	NA|148aa|up_4|NC_017581.1_1449074_1449518_-	NA	NA|128aa|up_3|NC_017581.1_1449548_1449932_-	COG2050, PaaI, HGG motif-containing thioesterase, possibly involved in aromatic compounds catabolism [Secondary metabolites biosynthesis,    transport, and catabolism]	NA|81aa|up_2|NC_017581.1_1450620_1450863_+	COG2182, MalE, Maltose-binding periplasmic proteins/domains [Carbohydrate transport and metabolism]	NA|297aa|up_1|NC_017581.1_1451030_1451921_+	COG0697, RhaT, Permeases of the drug/metabolite transporter (DMT) superfamily [Carbohydrate transport and metabolism / Amino acid transport and metabolism / General function prediction only]	NA|80aa|up_0|NC_017581.1_1451958_1452198_-	NA	NA|216aa|down_0|NC_017581.1_1452618_1453266_-	cd07500, HAD_PSP, phosphoserine phosphatase (PSP), similar to Methanococcus Jannaschii PSP and Saccharomyces cerevisiae SER2p	NA|575aa|down_1|NC_017581.1_1453377_1455102_-	PRK04778, PRK04778, septation ring formation regulator EzrA; Provisional	NA|651aa|down_2|NC_017581.1_1455198_1457151_-	PRK05644, gyrB, DNA gyrase subunit B; Validated	NA|187aa|down_3|NC_017581.1_1457151_1457712_-	cd07523, HAD_YsbA-like, uncharacterized family of the haloacid dehalogenase-like superfamily, similar to the uncharacterized Lactococcus lactis YsbA	NA|232aa|down_4|NC_017581.1_1457797_1458493_-	pfam04172, LrgB, LrgB-like family	NA|125aa|down_5|NC_017581.1_1458485_1458860_-	pfam03788, LrgA, LrgA family	NA|118aa|down_6|NC_017581.1_1459139_1459493_-	cd03036, ArsC_like, Arsenate Reductase (ArsC) family, unknown subfamily; uncharacterized proteins containing a CXXC motif with similarity to thioredoxin (TRX)-fold arsenic reductases, ArsC	NA|393aa|down_7|NC_017581.1_1460078_1461257_-	cd12174, PGDH_like_3, Putative D-3-Phosphoglycerate Dehydrogenases, NAD-binding and catalytic domains	NA|186aa|down_8|NC_017581.1_1461275_1461833_-	NA	NA|365aa|down_9|NC_017581.1_1461844_1462939_-	PRK05355, PRK05355, 3-phosphoserine/phosphohydroxythreonine transaminase
