assembly_id	genome_id	genome_def	crispr_array_locus_merge	crispr_array_location_merge	crispr_locus_id	crispr_pred_method	array_in_prot	prot_within_array_20000	prot_in_genome	crispr_type_by_cas_prot	consensus_repeat	repeat_length	self-targeting_spacer_number	self-targeting_target_number	spacer_location	protospacer_location	repeat_type	spacer_locus_num	spacer_num	correct_crispr_type	genome_cas_prots	unknown_protein_around_crispr	L10	L10_domain	L9	L9_domain	L8	L8_domain	L7	L7_domain	L6	L6_domain	L5	L5_domain	L4	L4_domain	L3	L3_domain	L2	L2_domain	L1	L1_domain	R1	R1_domain	R2	R2_domain	R3	R3_domain	R4	R4_domain	R5	R5_domain	R6	R6_domain	R7	R7_domain	R8	R8_domain	R9	R9_domain	R10	R10_domain
GCF_902141215.1_42042_G01	NZ_LR595857	Streptococcus sp. NCTC 11567 strain NCTC11567 chromosome 1	1	217863-218009	1	CRISPRCasFinder	no		cas3,DEDDh,cas9,cas1,csn2,csm6,DinG,cas2,cas4,cas7,cas8c,cas5,WYL,PD-DExK,RT,csa3	Orphan	CTTGCTGAAGTTCAAACTCATATCCGTGAGAAATTAAAAGCAGAGAAAGCT	51	0	0	NA	NA	NA	1	1	Orphan	cas3,DEDDh,cas9,cas1,csn2,csm6,DinG,cas2,cas4,cas7,cas8c,cas5,WYL,PD-DExK,RT,csa3	NA|184aa|up_3|NZ_LR595857.1_212884_213436_-,NA|97aa|down_8|NZ_LR595857.1_234061_234352_-	NA|253aa|up_9|NZ_LR595857.1_205941_206700_+	COG1385, COG1385, Uncharacterized protein conserved in bacteria [Function unknown]	NA|336aa|up_8|NZ_LR595857.1_206865_207873_+	cd06294, PBP1_MalR-like, ligand-binding domain of maltose transcription regulator MalR which is a member of the LacI-GalR family repressors	NA|729aa|up_7|NZ_LR595857.1_208142_210329_+	TIGR02003, PTS_system_glucose-specific_IIBC_component, PTS system, IIBC component	NA|275aa|up_6|NZ_LR595857.1_210407_211232_+	cd09079, RgfB-like, Streptococcus agalactiae RgfB, part of a putative two component signal transduction system, and related proteins	NA|171aa|up_5|NZ_LR595857.1_211240_211753_-	PRK06762, PRK06762, hypothetical protein; Provisional	NA|346aa|up_4|NZ_LR595857.1_211776_212814_-	cd05657, M42_glucanase_like, M42 Peptidase, endoglucanase-like subfamily	NA|184aa|up_3|NZ_LR595857.1_212884_213436_-	NA	NA|327aa|up_2|NZ_LR595857.1_213439_214420_-	cd02653, nuc_hydro_3, NH_3: A subgroup of nucleoside hydrolases	NA|156aa|up_1|NZ_LR595857.1_214700_215168_-	PRK02551, PRK02551, flavoprotein NrdI; Provisional	NA|514aa|up_0|NZ_LR595857.1_215655_217197_+	pfam08270, PRD_Mga, M protein trans-acting positive regulator (MGA) PRD domain	NA|793aa|down_0|NZ_LR595857.1_219085_221464_-	PRK11907, PRK11907, bifunctional 2',3'-cyclic-nucleotide 2'-phosphodiesterase/3'-nucleotidase	NA|740aa|down_1|NZ_LR595857.1_221719_223939_+	COG0317, SpoT, Guanosine polyphosphate pyrophosphohydrolases/synthetases [Signal transduction mechanisms / Transcription]	NA|148aa|down_2|NZ_LR595857.1_223953_224397_+	PRK05273, PRK05273, D-tyrosyl-tRNA(Tyr) deacylase; Provisional	NA|441aa|down_3|NZ_LR595857.1_224494_225817_-	pfam02821, Staphylokinase, Staphylokinase/Streptokinase family	NA|283aa|down_4|NZ_LR595857.1_226145_226994_+	COG2508, COG2508, Regulator of polyketide synthase expression [Signal transduction mechanisms / Secondary metabolites biosynthesis, transport, and catabolism]	NA|378aa|down_5|NZ_LR595857.1_227291_228425_+	PRK11650, ugpC, sn-glycerol-3-phosphate ABC transporter ATP-binding protein UgpC	NA|538aa|down_6|NZ_LR595857.1_228506_230120_+	cd11333, AmyAc_SI_OligoGlu_DGase, Alpha amylase catalytic domain found in Sucrose isomerases, oligo-1,6-glucosidase (also called isomaltase; sucrase-isomaltase; alpha-limit dextrinase), dextran glucosidase (also called glucan 1,6-alpha-glucosidase), and related proteins	NA|1208aa|down_7|NZ_LR595857.1_230285_233909_+	TIGR02102, alkaline_amylopullulanase, pullulanase, extracellular, Gram-positive	NA|97aa|down_8|NZ_LR595857.1_234061_234352_-	NA	NA|117aa|down_9|NZ_LR595857.1_234507_234858_+	cd02947, TRX_family, TRX family; composed of two groups: Group I, which includes proteins that exclusively encode a TRX domain; and Group II, which are composed of fusion proteins of TRX and additional domains
GCF_902141215.1_42042_G01	NZ_LR595857	Streptococcus sp. NCTC 11567 strain NCTC11567 chromosome 1	2	882677-882775	2	CRISPRCasFinder	no	cas9,cas1,csn2	cas3,DEDDh,cas9,cas1,csn2,csm6,DinG,cas2,cas4,cas7,cas8c,cas5,WYL,PD-DExK,RT,csa3	Type II-C,Type II-B,Type II-A	GTTTTAGAGCTATGTTGTTTTGAATGGTCCCAA	33	0	0	NA	NA	II-A,II-B	1	1	TypeII-C,TypeII-B,TypeII-A	cas3,DEDDh,cas9,cas1,csn2,csm6,DinG,cas2,cas4,cas7,cas8c,cas5,WYL,PD-DExK,RT,csa3	NA|214aa|up_7|NZ_LR595857.1_872144_872786_+,NA	NA|319aa|up_9|NZ_LR595857.1_869655_870612_+	COG4856, COG4856, Uncharacterized protein conserved in bacteria [Function unknown]	NA|452aa|up_8|NZ_LR595857.1_870665_872021_+	PRK14316, glmM, phosphoglucosamine mutase; Provisional	NA|214aa|up_7|NZ_LR595857.1_872144_872786_+	NA	NA|377aa|up_6|NZ_LR595857.1_872847_873978_+	PRK08599, PRK08599, oxygen-independent coproporphyrinogen III oxidase	NA|251aa|up_5|NZ_LR595857.1_873987_874740_+	COG3884, FatA, Acyl-ACP thioesterase [Lipid metabolism]	NA|255aa|up_4|NZ_LR595857.1_874739_875504_+	cd07530, HAD_Pase_UmpH-like, UmpH/NagD family phosphatase, similar to Escherichia coli UmpH UMP phosphatase/NagD nucleotide phosphatase and Mycobacterium tuberculosis Rv1692 glycerol 3-phosphate phosphatase	NA|210aa|up_3|NZ_LR595857.1_875503_876133_+	COG4478, COG4478, Predicted membrane protein [Function unknown]	cas9|1372aa|up_2|NZ_LR595857.1_876609_880725_+	COG3513, COG3513, Predicted CRISPR-associated nuclease, contains McrA/HNH-nuclease and RuvC-like nuclease domain [Defense mechanisms]	cas1|290aa|up_1|NZ_LR595857.1_880724_881594_+	cd09720, Cas1_II, CRISPR/Cas system-associated protein Cas1	csn2|221aa|up_0|NZ_LR595857.1_881911_882574_+	cd09758, Csn2, CRISPR/Cas system-associated protein Csn2	NA|153aa|down_0|NZ_LR595857.1_882893_883352_+	PRK00668, ndk, mulitfunctional nucleoside diphosphate kinase/apyrimidinic endonuclease/3'-; Validated	NA|611aa|down_1|NZ_LR595857.1_883421_885254_+	PRK05433, PRK05433, GTP-binding protein LepA; Provisional	NA|146aa|down_2|NZ_LR595857.1_885429_885867_+	PRK00222, PRK00222, peptide-methionine (R)-S-oxide reductase MsrB	NA|77aa|down_3|NZ_LR595857.1_886705_886936_+	pfam01721, Bacteriocin_II, Class II bacteriocin	NA|99aa|down_4|NZ_LR595857.1_886935_887232_+	pfam08951, EntA_Immun, Enterocin A Immunity	NA|390aa|down_5|NZ_LR595857.1_887421_888591_+	pfam11187, DUF2974, Protein of unknown function (DUF2974)	NA|340aa|down_6|NZ_LR595857.1_888701_889721_+	COG2855, COG2855, Predicted membrane protein [Function unknown]	NA|142aa|down_7|NZ_LR595857.1_889927_890353_+	COG2893, ManX, Phosphotransferase system, mannose/fructose-specific component IIA [Carbohydrate transport and metabolism]	NA|164aa|down_8|NZ_LR595857.1_890372_890864_+	COG3444, COG3444, Phosphotransferase system, mannose/fructose/N-acetylgalactosamine-specific component IIB [Carbohydrate transport and metabolism]	NA|270aa|down_9|NZ_LR595857.1_890880_891690_+	COG3715, ManY, Phosphotransferase system, mannose/fructose/N-acetylgalactosamine-specific component IIC [Carbohydrate transport and metabolism]
GCF_902141215.1_42042_G01	NZ_LR595857	Streptococcus sp. NCTC 11567 strain NCTC11567 chromosome 1	3	1451360-1451588	1,3,1	CRT,CRISPRCasFinder,PILER-CR	no	cas2,cas1,cas4,cas7,cas8c,cas5,cas3	cas3,DEDDh,cas9,cas1,csn2,csm6,DinG,cas2,cas4,cas7,cas8c,cas5,WYL,PD-DExK,RT,csa3	Type I-U, Type I-U?,Type I-C	ATTTCAATCCACTCACCCGCGAAGGGTGAGAC,CAATCCACTCACCCGCGAAGGGTGAGAC,AATTTCAATCCACTCACCCGCGAAGGGTGAGAC	32,28,33	0	0	NA	NA	I-C:NA:I-C	3,3,2	3	TypeI-U,TypeI-U?,TypeI-C	cas3,DEDDh,cas9,cas1,csn2,csm6,DinG,cas2,cas4,cas7,cas8c,cas5,WYL,PD-DExK,RT,csa3	NA,NA	NA|158aa|up_9|NZ_LR595857.1_1441444_1441918_+	COG1438, ArgR, Arginine repressor [Transcription]	NA|239aa|up_8|NZ_LR595857.1_1442095_1442812_-	COG3382, COG3382, Solo B3/4 domain (OB-fold DNA/RNA-binding) of Phe-aaRS-beta [General function prediction only]	NA|360aa|up_7|NZ_LR595857.1_1442825_1443905_-	COG2315, MmcQ, Uncharacterized protein conserved in bacteria [Function unknown]	NA|578aa|up_6|NZ_LR595857.1_1443977_1445711_-	COG2972, COG2972, Predicted signal transduction protein with a C-terminal ATPase domain [Signal transduction mechanisms]	NA|247aa|up_5|NZ_LR595857.1_1445707_1446448_-	COG4753, COG4753, Response regulator containing CheY-like receiver domain and AraC-type DNA-binding domain [Signal transduction mechanisms]	NA|369aa|up_4|NZ_LR595857.1_1446535_1447642_-	PRK14018, PRK14018, bifunctional peptide-methionine (S)-S-oxide reductase MsrA/peptide-methionine (R)-S-oxide reductase MsrB	NA|208aa|up_3|NZ_LR595857.1_1447684_1448308_-	cd02966, TlpA_like_family, TlpA-like family; composed of  TlpA, ResA, DsbE and similar proteins	NA|237aa|up_2|NZ_LR595857.1_1448320_1449031_-	COG0785, CcdA, Cytochrome c biogenesis protein [Posttranslational modification, protein turnover, chaperones]	NA|311aa|up_1|NZ_LR595857.1_1449250_1450183_-	cd12827, EcCorA_ZntB-like_u2, uncharacterized bacterial subfamily of the Escherichia coli CorA-Salmonella typhimurium ZntB family	NA|320aa|up_0|NZ_LR595857.1_1450274_1451234_-	COG0673, MviM, Predicted dehydrogenases and related proteins [General function prediction only]	cas2|98aa|down_0|NZ_LR595857.1_1451736_1452030_-	COG1343, COG1343, CRISPR-associated protein Cas2 [Defense mechanisms]	cas1|342aa|down_1|NZ_LR595857.1_1452040_1453066_-	TIGR03640, cas1_DVULG, CRISPR-associated endonuclease Cas1, subtype I-C/DVULG	cas4|225aa|down_2|NZ_LR595857.1_1453062_1453737_-	COG1468, COG1468, CRISPR-associated protein Cas4 (RecB family exonuclease) [Defense    mechanisms]	cas7|283aa|down_3|NZ_LR595857.1_1453738_1454587_-	COG3649, COG3649, CRISPR system related protein [Defense mechanisms]	cas8c|632aa|down_4|NZ_LR595857.1_1454591_1456487_-	cd09642, Cas8c_I-C, CRISPR/Cas system-associated protein Cas8c	cas5|243aa|down_5|NZ_LR595857.1_1456486_1457215_-	TIGR01876, cas_Cas5d, CRISPR-associated protein Cas5, subtype I-C/DVULG	cas3|808aa|down_6|NZ_LR595857.1_1457477_1459901_-	COG1203, COG1203, CRISPR-associated helicase Cas3 [Defense mechanisms]	NA|883aa|down_7|NZ_LR595857.1_1460167_1462816_-	PRK05729, valS, valyl-tRNA synthetase; Reviewed	NA|188aa|down_8|NZ_LR595857.1_1462817_1463381_-	pfam13238, AAA_18, AAA domain	NA|197aa|down_9|NZ_LR595857.1_1463377_1463968_-	COG1670, RimL, Acetyltransferases, including N-acetylases of ribosomal proteins [Translation, ribosomal structure and biogenesis]
