assembly_id	genome_id	genome_def	crispr_array_locus_merge	crispr_array_location_merge	crispr_locus_id	crispr_pred_method	array_in_prot	prot_within_array_20000	prot_in_genome	crispr_type_by_cas_prot	consensus_repeat	repeat_length	self-targeting_spacer_number	self-targeting_target_number	spacer_location	protospacer_location	repeat_type	spacer_locus_num	spacer_num	correct_crispr_type	genome_cas_prots	unknown_protein_around_crispr	L10	L10_domain	L9	L9_domain	L8	L8_domain	L7	L7_domain	L6	L6_domain	L5	L5_domain	L4	L4_domain	L3	L3_domain	L2	L2_domain	L1	L1_domain	R1	R1_domain	R2	R2_domain	R3	R3_domain	R4	R4_domain	R5	R5_domain	R6	R6_domain	R7	R7_domain	R8	R8_domain	R9	R9_domain	R10	R10_domain
GCF_002846075.1_ASM284607v1	NZ_CP025399	Streptococcus thermophilus strain GABA chromosome, complete genome	1	652778-655449	1,1,1	PILER-CR,CRISPRCasFinder,CRT	no	cas9,cas1,cas2,csn2	cas3,DEDDh,RT,cas9,cas1,cas2,csn2,DinG,csm6	Type II-A,Type II-B,Type II-C	GTTTTTGTACTCTCAAGATTTAAGTAACTGTACAAC,GTTTTTGTACTCTCAAGATTTAAGTAACTGTACAAC,GTTTTTGTACTCTCAAGATTTAAGTAACTGTACAAC	36,36,36	0	0	NA	NA	NA:NA:NA	40,40,40	40	TypeII-A,TypeII-B,TypeII-C	cas3,DEDDh,RT,cas9,cas1,cas2,csn2,DinG,csm6	NA,NA	NA|183aa|up_9|NZ_CP025399.1_641414_641963_-	COG2128, COG2128, Uncharacterized conserved protein [Function unknown]	NA|454aa|up_8|NZ_CP025399.1_642178_643540_+	COG2265, TrmA, SAM-dependent methyltransferases related to tRNA (uracil-5-)-methyltransferase [Translation, ribosomal structure and biogenesis]	NA|277aa|up_7|NZ_CP025399.1_643749_644580_+	cd10447, GIY-YIG_unchar_2, GIY-YIG domain of uncharacterized hypothetical protein found in bacteria and archaea	NA|74aa|up_6|NZ_CP025399.1_644747_644969_+	COG3655, COG3655, Predicted transcriptional regulator [Transcription]	NA|153aa|up_5|NZ_CP025399.1_644968_645427_+	COG3392, COG3392, Adenine-specific DNA methylase [DNA replication, recombination, and repair]	NA|204aa|up_4|NZ_CP025399.1_646021_646633_+	pfam09911, DUF2140, Uncharacterized protein conserved in bacteria (DUF2140)	cas9|1122aa|up_3|NZ_CP025399.1_646887_650253_+	cd09643, Csn1, CRISPR/Cas system-associated protein Cas9	cas1|304aa|up_2|NZ_CP025399.1_650429_651341_+	cd09720, Cas1_II, CRISPR/Cas system-associated protein Cas1	cas2|108aa|up_1|NZ_CP025399.1_651342_651666_+	COG3512, COG3512, CRISPR-associated protein, Cas2 homolog [Defense mechanisms]	csn2|351aa|up_0|NZ_CP025399.1_651662_652715_+	pfam16813, Cas_St_Csn2, CRISPR-associated protein Csn2 subfamily St	NA|272aa|down_0|NZ_CP025399.1_655490_656306_-	COG3689, COG3689, Predicted membrane protein [Function unknown]	NA|300aa|down_1|NZ_CP025399.1_656305_657205_-	COG0701, COG0701, Predicted permeases [General function prediction only]	NA|711aa|down_2|NZ_CP025399.1_657568_659701_+	COG2183, Tex, Transcriptional accessory protein [Transcription]	NA|147aa|down_3|NZ_CP025399.1_659687_660128_+	PRK04351, PRK04351, SprT family protein	NA|88aa|down_4|NZ_CP025399.1_660192_660456_+	COG1983, PspC, Putative stress-responsive transcriptional regulator [Transcription / Signal transduction mechanisms]	NA|310aa|down_5|NZ_CP025399.1_660585_661515_+	PRK05428, PRK05428, HPr kinase/phosphorylase; Provisional	NA|264aa|down_6|NZ_CP025399.1_661514_662306_+	PRK12437, PRK12437, prolipoprotein diacylglyceryl transferase; Reviewed	NA|133aa|down_7|NZ_CP025399.1_662428_662827_+	COG4768, COG4768, Uncharacterized protein containing a divergent version of the methyl-accepting chemotaxis-like domain [General function prediction only]	NA|147aa|down_8|NZ_CP025399.1_662839_663280_+	pfam12732, YtxH, YtxH-like protein	NA|99aa|down_9|NZ_CP025399.1_663300_663597_-	pfam11674, DUF3270, Protein of unknown function (DUF3270)
GCF_002846075.1_ASM284607v1	NZ_CP025399	Streptococcus thermophilus strain GABA chromosome, complete genome	2	766448-766595	2	CRISPRCasFinder	no		cas3,DEDDh,RT,cas9,cas1,cas2,csn2,DinG,csm6	Orphan	CTCAGAAAGTCCCGTAGCCGAGGAGTTGGTTGACACTTCTGTGGAGGCTACC	52	0	0	NA	NA	NA	1	1	Orphan	cas3,DEDDh,RT,cas9,cas1,cas2,csn2,DinG,csm6	NA|100aa|up_4|NZ_CP025399.1_757867_758167_+,NA|108aa|up_3|NZ_CP025399.1_758244_758568_+,NA	NA|573aa|up_9|NZ_CP025399.1_751530_753249_-	cd05799, PGM2, This CD includes PGM2 (phosphoglucomutase 2) and PGM2L1 (phosphoglucomutase 2-like 1)	NA|191aa|up_8|NZ_CP025399.1_753331_753904_-	COG4684, COG4684, Predicted membrane protein [Function unknown]	NA|182aa|up_7|NZ_CP025399.1_753931_754477_-	PRK07313, PRK07313, phosphopantothenoylcysteine decarboxylase; Validated	NA|228aa|up_6|NZ_CP025399.1_754469_755153_-	PRK06732, PRK06732, phosphopantothenate--cysteine ligase; Validated	NA|557aa|up_5|NZ_CP025399.1_755346_757017_+	PRK13505, PRK13505, formate--tetrahydrofolate ligase; Provisional	NA|100aa|up_4|NZ_CP025399.1_757867_758167_+	NA	NA|108aa|up_3|NZ_CP025399.1_758244_758568_+	NA	NA|524aa|up_2|NZ_CP025399.1_759389_760960_+	PHA02517, PHA02517, putative transposase OrfB; Reviewed	NA|357aa|up_1|NZ_CP025399.1_762720_763791_+	cd13663, PBP2_PotD_PotF_like_2, The periplasmic substrate-binding component of an uncharacterized active transport system closely related to spermidine and putrescine transporters; contains the type 2 periplasmic binding fold	NA|513aa|up_0|NZ_CP025399.1_763908_765447_+	cd01031, EriC, ClC chloride channel EriC	NA|524aa|down_0|NZ_CP025399.1_771253_772824_+	PHA02517, PHA02517, putative transposase OrfB; Reviewed	NA|79aa|down_1|NZ_CP025399.1_774349_774586_-	PRK00239, rpsT, 30S ribosomal protein S20; Reviewed	NA|307aa|down_2|NZ_CP025399.1_774654_775575_-	PRK05439, PRK05439, pantothenate kinase; Provisional	NA|197aa|down_3|NZ_CP025399.1_775899_776490_+	COG2813, RsmC, 16S RNA G1207 methylase RsmC [Translation, ribosomal structure and biogenesis]	NA|61aa|down_4|NZ_CP025399.1_776505_776688_+	PRK06078, PRK06078, pyrimidine-nucleoside phosphorylase; Reviewed	NA|79aa|down_5|NZ_CP025399.1_776703_776940_+	PRK06078, PRK06078, pyrimidine-nucleoside phosphorylase; Reviewed	NA|267aa|down_6|NZ_CP025399.1_776920_777721_+	PRK06078, PRK06078, pyrimidine-nucleoside phosphorylase; Reviewed	NA|357aa|down_7|NZ_CP025399.1_778895_779966_+	COG1744, Med, Uncharacterized ABC-type transport system, periplasmic component/surface lipoprotein [General function prediction only]	NA|513aa|down_8|NZ_CP025399.1_780079_781618_+	COG3845, COG3845, ABC-type uncharacterized transport systems, ATPase components [General function prediction only]	NA|356aa|down_9|NZ_CP025399.1_781610_782678_+	COG4603, COG4603, ABC-type uncharacterized transport system, permease component [General function prediction only]
GCF_002846075.1_ASM284607v1	NZ_CP025399	Streptococcus thermophilus strain GABA chromosome, complete genome	3	889434-889687	2,3,2	PILER-CR,CRISPRCasFinder,CRT	no	cas1,cas2,csm6	cas3,DEDDh,RT,cas9,cas1,cas2,csn2,DinG,csm6	Type III-A	GATATAAACCTAATTACCTCGAGAGGGGACGGAAACTG,GATATAAACCTAATTACCTCGAGAGGGGACGGAAAC,GATATAAACCTAATTACCTCGAGAGGGGACGGAAACTGA	38,36,39	0	0	NA	NA	II-B,III-A:II-B,III-A:II-B,III-A	3,3,3	3	TypeIII-A	cas3,DEDDh,RT,cas9,cas1,cas2,csn2,DinG,csm6	NA|70aa|up_7|NZ_CP025399.1_881917_882127_+,NA|74aa|down_2|NZ_CP025399.1_893971_894193_+,NA|128aa|down_4|NZ_CP025399.1_894493_894877_+,NA|78aa|down_6|NZ_CP025399.1_895548_895782_+,NA|86aa|down_7|NZ_CP025399.1_895971_896229_+,NA|103aa|down_8|NZ_CP025399.1_896228_896537_+,NA|60aa|down_9|NZ_CP025399.1_896911_897091_+	NA|212aa|up_9|NZ_CP025399.1_878894_879530_-	cd04302, HAD_5NT, haloacid dehalogenase (HAD)-like 5'-nucleotidases similar to the Pseudomonas aeruginosa PA0065	NA|92aa|up_8|NZ_CP025399.1_879673_879949_+	cd01140, FatB, Siderophore binding protein FatB	NA|70aa|up_7|NZ_CP025399.1_881917_882127_+	NA	NA|553aa|up_6|NZ_CP025399.1_882502_884161_-	pfam05833, FbpA, Fibronectin-binding protein A N-terminus (FbpA)	NA|230aa|up_5|NZ_CP025399.1_884478_885168_+	pfam05857, TraX, TraX protein	NA|183aa|up_4|NZ_CP025399.1_885301_885850_+	cd03135, GATase1_DJ-1, Type 1 glutamine amidotransferase (GATase1)-like domain found in Human DJ-1	NA|268aa|up_3|NZ_CP025399.1_886091_886895_+	PRK00054, PRK00054, dihydroorotate dehydrogenase electron transfer subunit; Reviewed	NA|316aa|up_2|NZ_CP025399.1_886913_887861_+	PRK07259, PRK07259, dihydroorotate dehydrogenase	cas1|335aa|up_1|NZ_CP025399.1_887999_889004_+	cd09634, Cas1_I-II-III, CRISPR/Cas system-associated protein Cas1	cas2|110aa|up_0|NZ_CP025399.1_889003_889333_+	cd09725, Cas2_I_II_III, CRISPR/Cas system-associated protein Cas2	NA|465aa|down_0|NZ_CP025399.1_890143_891538_-	TIGR01000, Mesentericin_Y105_secretion_protein_MesE, bacteriocin secretion accessory protein	NA|716aa|down_1|NZ_CP025399.1_891549_893697_-	TIGR01193, transport/processing_ATP-binding_protein, ABC-type bacteriocin transporter	NA|74aa|down_2|NZ_CP025399.1_893971_894193_+	NA	NA|76aa|down_3|NZ_CP025399.1_894244_894472_+	pfam10439, Bacteriocin_IIc, Bacteriocin class II with double-glycine leader peptide	NA|128aa|down_4|NZ_CP025399.1_894493_894877_+	NA	NA|101aa|down_5|NZ_CP025399.1_895088_895391_+	pfam08951, EntA_Immun, Enterocin A Immunity	NA|78aa|down_6|NZ_CP025399.1_895548_895782_+	NA	NA|86aa|down_7|NZ_CP025399.1_895971_896229_+	NA	NA|103aa|down_8|NZ_CP025399.1_896228_896537_+	NA	NA|60aa|down_9|NZ_CP025399.1_896911_897091_+	NA
GCF_002846075.1_ASM284607v1	NZ_CP025399	Streptococcus thermophilus strain GABA chromosome, complete genome	4	1396122-1397147	3,4,3,4	PILER-CR,CRISPRCasFinder,CRT,PILER-CR	no	csn2,cas2,cas1,cas9	cas3,DEDDh,RT,cas9,cas1,cas2,csn2,DinG,csm6	Type II-A,Type II-B,Type II-C	CGTTTTGGAACCATTCGAAACAACACAGCTCTAAAACACG,GTTTTGGAACCATTCGAAACAACACAGCTCTAAAAC,GTTTTGGAACCATTCGAAACAACACAGCTCTAAAAC,GTTTTGGAACCATTCGAAACAACACAGCTCTAAAAC	40,36,36,36	0	0	NA	NA	II-A,II-B:II-A,II-B:II-A,II-B:II-A,II-B	13,15,15,13	15	TypeII-A,TypeII-B,TypeII-C	cas3,DEDDh,RT,cas9,cas1,cas2,csn2,DinG,csm6	NA|66aa|up_7|NZ_CP025399.1_1389830_1390028_-,NA|148aa|up_4|NZ_CP025399.1_1392914_1393358_-,NA|80aa|up_0|NZ_CP025399.1_1395798_1396038_-,NA	NA|737aa|up_9|NZ_CP025399.1_1386807_1389018_+	cd13619, PBP2_GlnP, Glutamine-binding domain of ABC transporter, a member of the type 2 periplasmic binding fold protein superfamily	NA|247aa|up_8|NZ_CP025399.1_1389017_1389758_+	COG1126, GlnQ, ABC-type polar amino acid transport system, ATPase component [Amino acid transport and metabolism]	NA|66aa|up_7|NZ_CP025399.1_1389830_1390028_-	NA	NA|438aa|up_6|NZ_CP025399.1_1390194_1391508_-	PRK12297, obgE, GTPase CgtA; Reviewed	NA|43aa|up_5|NZ_CP025399.1_1391601_1391730_-	pfam13253, DUF4044, Protein of unknown function (DUF4044)	NA|148aa|up_4|NZ_CP025399.1_1392914_1393358_-	NA	NA|128aa|up_3|NZ_CP025399.1_1393388_1393772_-	COG2050, PaaI, HGG motif-containing thioesterase, possibly involved in aromatic compounds catabolism [Secondary metabolites biosynthesis,    transport, and catabolism]	NA|81aa|up_2|NZ_CP025399.1_1394460_1394703_+	COG2182, MalE, Maltose-binding periplasmic proteins/domains [Carbohydrate transport and metabolism]	NA|297aa|up_1|NZ_CP025399.1_1394870_1395761_+	COG0697, RhaT, Permeases of the drug/metabolite transporter (DMT) superfamily [Carbohydrate transport and metabolism / Amino acid transport and metabolism / General function prediction only]	NA|80aa|up_0|NZ_CP025399.1_1395798_1396038_-	NA	csn2|220aa|down_0|NZ_CP025399.1_1397468_1398128_-	cd09758, Csn2, CRISPR/Cas system-associated protein Csn2	cas2|115aa|down_1|NZ_CP025399.1_1398117_1398462_-	COG3512, COG3512, CRISPR-associated protein, Cas2 homolog [Defense mechanisms]	cas1|290aa|down_2|NZ_CP025399.1_1398458_1399328_-	cd09720, Cas1_II, CRISPR/Cas system-associated protein Cas1	cas9|1389aa|down_3|NZ_CP025399.1_1399327_1403494_-	TIGR01865, conserved_hypothetical_protein, CRISPR subtype II/NMENI RNA-guided endonuclease Cas9/Csn1	NA|216aa|down_4|NZ_CP025399.1_1403827_1404475_-	cd07500, HAD_PSP, phosphoserine phosphatase (PSP), similar to Methanococcus Jannaschii PSP and Saccharomyces cerevisiae SER2p	NA|575aa|down_5|NZ_CP025399.1_1404586_1406311_-	PRK04778, PRK04778, septation ring formation regulator EzrA; Provisional	NA|651aa|down_6|NZ_CP025399.1_1406407_1408360_-	PRK05644, gyrB, DNA gyrase subunit B; Validated	NA|202aa|down_7|NZ_CP025399.1_1408360_1408966_-	cd07523, HAD_YsbA-like, uncharacterized family of the haloacid dehalogenase-like superfamily, similar to the uncharacterized Lactococcus lactis YsbA	NA|232aa|down_8|NZ_CP025399.1_1409006_1409702_-	pfam04172, LrgB, LrgB-like family	NA|125aa|down_9|NZ_CP025399.1_1409694_1410069_-	pfam03788, LrgA, LrgA family
