assembly_id	genome_id	genome_def	crispr_array_locus_merge	crispr_array_location_merge	crispr_locus_id	crispr_pred_method	array_in_prot	prot_within_array_20000	prot_in_genome	crispr_type_by_cas_prot	consensus_repeat	repeat_length	self-targeting_spacer_number	self-targeting_target_number	spacer_location	protospacer_location	repeat_type	spacer_locus_num	spacer_num	correct_crispr_type	genome_cas_prots	unknown_protein_around_crispr	L10	L10_domain	L9	L9_domain	L8	L8_domain	L7	L7_domain	L6	L6_domain	L5	L5_domain	L4	L4_domain	L3	L3_domain	L2	L2_domain	L1	L1_domain	R1	R1_domain	R2	R2_domain	R3	R3_domain	R4	R4_domain	R5	R5_domain	R6	R6_domain	R7	R7_domain	R8	R8_domain	R9	R9_domain	R10	R10_domain
GCF_901543615.1_42731_E02	NZ_LR594045	Streptococcus dysgalactiae subsp. equisimilis strain NCTC9413 chromosome 1	1	123046-123231	1	PILER-CR	no		DEDDh,DinG,csa3,csm6,csn2,cas2,cas1,cas9,cas3	Orphan	TTTAGATGGTCAAGAAGTCCCAGAAGTTCCAAGCGAGAGCTTAGAACC	48	0	0	NA	NA	NA	2	2	Orphan	DEDDh,DinG,csa3,csm6,csn2,cas2,cas1,cas9,cas3	NA,NA|192aa|down_6|NZ_LR594045.1_135314_135890_-	NA|132aa|up_9|NZ_LR594045.1_111437_111833_+	PRK07274, PRK07274, single-stranded DNA-binding protein; Provisional	NA|321aa|up_8|NZ_LR594045.1_112342_113305_+	cd06325, PBP1_ABC_unchar_transporter, type 1 periplasmic ligand-binding domain of uncharacterized ABC-type transport systems predicted to be involved in uptake of amino acids, peptides, or inorganic ions	NA|320aa|up_7|NZ_LR594045.1_113334_114294_+	cd06325, PBP1_ABC_unchar_transporter, type 1 periplasmic ligand-binding domain of uncharacterized ABC-type transport systems predicted to be involved in uptake of amino acids, peptides, or inorganic ions	NA|295aa|up_6|NZ_LR594045.1_114334_115219_+	COG4120, COG4120, ABC-type uncharacterized transport system, permease component [General function prediction only]	NA|410aa|up_5|NZ_LR594045.1_115312_116542_-	pfam01548, DEDD_Tnp_IS110, Transposase	NA|268aa|up_4|NZ_LR594045.1_116845_117649_+	COG1101, PhnK, ABC-type uncharacterized transport system, ATPase component [General function prediction only]	NA|214aa|up_3|NZ_LR594045.1_117784_118426_-	COG1428, COG1428, Deoxynucleoside kinases [Nucleotide transport and metabolism]	NA|326aa|up_2|NZ_LR594045.1_118445_119423_-	TIGR00737, Probable_tRNA-dihydrouridine_synthase, putative TIM-barrel protein, nifR3 family	NA|291aa|up_1|NZ_LR594045.1_119409_120282_-	PRK00114, hslO, Hsp33 family molecular chaperone HslO	NA|498aa|up_0|NZ_LR594045.1_120428_121922_-	pfam08270, PRD_Mga, M protein trans-acting positive regulator (MGA) PRD domain	NA|1373aa|down_0|NZ_LR594045.1_125085_129204_+	COG4932, COG4932, Predicted outer membrane protein [Cell envelope biogenesis, outer membrane]	NA|714aa|down_1|NZ_LR594045.1_129324_131466_+	pfam16570, GramPos_pilinD3, Gram-positive pilin backbone subunit 3, Cna-B-like domain	NA|282aa|down_2|NZ_LR594045.1_131478_132324_+	pfam17802, SpaA, Prealbumin-like fold domain	NA|291aa|down_3|NZ_LR594045.1_132405_133278_+	cd05827, Sortase_C, Sortase domain found in class C sortases	NA|278aa|down_4|NZ_LR594045.1_133264_134098_+	cd05827, Sortase_C, Sortase domain found in class C sortases	NA|286aa|down_5|NZ_LR594045.1_134117_134975_+	cd05827, Sortase_C, Sortase domain found in class C sortases	NA|192aa|down_6|NZ_LR594045.1_135314_135890_-	NA	NA|227aa|down_7|NZ_LR594045.1_136727_137408_-	COG2964, COG2964, Uncharacterized protein conserved in bacteria [Function unknown]	NA|120aa|down_8|NZ_LR594045.1_137586_137946_+	COG0251, TdcF, Putative translation initiation inhibitor, yjgF family [Translation, ribosomal structure and biogenesis]	NA|340aa|down_9|NZ_LR594045.1_137983_139003_+	COG1299, FruA, Phosphotransferase system, fructose-specific IIC component [Carbohydrate transport and metabolism]
GCF_901543615.1_42731_E02	NZ_LR594045	Streptococcus dysgalactiae subsp. equisimilis strain NCTC9413 chromosome 1	2	184740-184886	1	CRISPRCasFinder	no		DEDDh,DinG,csa3,csm6,csn2,cas2,cas1,cas9,cas3	Orphan	CTTGCTGAAGTTCAAACTCATATCCGTGAGAAATTAAAAGCAGAGAAAGCT	51	0	0	NA	NA	NA	1	1	Orphan	DEDDh,DinG,csa3,csm6,csn2,cas2,cas1,cas9,cas3	NA|184aa|up_3|NZ_LR594045.1_179792_180344_-,NA|97aa|down_8|NZ_LR594045.1_199734_200025_-	NA|253aa|up_9|NZ_LR594045.1_172849_173608_+	COG1385, COG1385, Uncharacterized protein conserved in bacteria [Function unknown]	NA|336aa|up_8|NZ_LR594045.1_173773_174781_+	cd06294, PBP1_MalR-like, ligand-binding domain of maltose transcription regulator MalR which is a member of the LacI-GalR family repressors	NA|729aa|up_7|NZ_LR594045.1_175050_177237_+	TIGR02003, PTS_system_glucose-specific_IIBC_component, PTS system, IIBC component	NA|275aa|up_6|NZ_LR594045.1_177315_178140_+	cd09079, RgfB-like, Streptococcus agalactiae RgfB, part of a putative two component signal transduction system, and related proteins	NA|171aa|up_5|NZ_LR594045.1_178148_178661_-	PRK06762, PRK06762, hypothetical protein; Provisional	NA|346aa|up_4|NZ_LR594045.1_178684_179722_-	cd05657, M42_glucanase_like, M42 Peptidase, endoglucanase-like subfamily	NA|184aa|up_3|NZ_LR594045.1_179792_180344_-	NA	NA|327aa|up_2|NZ_LR594045.1_180347_181328_-	cd02653, nuc_hydro_3, NH_3: A subgroup of nucleoside hydrolases	NA|156aa|up_1|NZ_LR594045.1_181608_182076_-	PRK02551, PRK02551, flavoprotein NrdI; Provisional	NA|511aa|up_0|NZ_LR594045.1_182562_184095_+	pfam08270, PRD_Mga, M protein trans-acting positive regulator (MGA) PRD domain	NA|740aa|down_0|NZ_LR594045.1_186423_188643_+	COG0317, SpoT, Guanosine polyphosphate pyrophosphohydrolases/synthetases [Signal transduction mechanisms / Transcription]	NA|148aa|down_1|NZ_LR594045.1_188656_189100_+	PRK05273, PRK05273, D-tyrosyl-tRNA(Tyr) deacylase; Provisional	NA|433aa|down_2|NZ_LR594045.1_189197_190496_-	pfam02821, Staphylokinase, Staphylokinase/Streptokinase family	NA|288aa|down_3|NZ_LR594045.1_190524_191388_-	pfam13612, DDE_Tnp_1_3, Transposase DDE domain	NA|283aa|down_4|NZ_LR594045.1_191817_192666_+	COG2508, COG2508, Regulator of polyketide synthase expression [Signal transduction mechanisms / Secondary metabolites biosynthesis, transport, and catabolism]	NA|378aa|down_5|NZ_LR594045.1_192964_194098_+	PRK11650, ugpC, sn-glycerol-3-phosphate ABC transporter ATP-binding protein UgpC	NA|538aa|down_6|NZ_LR594045.1_194179_195793_+	cd11333, AmyAc_SI_OligoGlu_DGase, Alpha amylase catalytic domain found in Sucrose isomerases, oligo-1,6-glucosidase (also called isomaltase; sucrase-isomaltase; alpha-limit dextrinase), dextran glucosidase (also called glucan 1,6-alpha-glucosidase), and related proteins	NA|1208aa|down_7|NZ_LR594045.1_195958_199582_+	TIGR02102, alkaline_amylopullulanase, pullulanase, extracellular, Gram-positive	NA|97aa|down_8|NZ_LR594045.1_199734_200025_-	NA	NA|117aa|down_9|NZ_LR594045.1_200180_200531_+	cd02947, TRX_family, TRX family; composed of two groups: Group I, which includes proteins that exclusively encode a TRX domain; and Group II, which are composed of fusion proteins of TRX and additional domains
GCF_901543615.1_42731_E02	NZ_LR594045	Streptococcus dysgalactiae subsp. equisimilis strain NCTC9413 chromosome 1	3	1246837-1247531	2,1,2	CRISPRCasFinder,CRT,PILER-CR	no	csn2,cas2,cas1,cas9	DEDDh,DinG,csa3,csm6,csn2,cas2,cas1,cas9,cas3	Type II-C,Type II-A,Type II-B	GTTTTGGGACCATTCAAAACAACATAGCTCTAAAAC,GTTTTGGGACCATTCAAAACAACATAGCTCTAAAAC,GTTTTGGGACCATTCAAAACAACATAGCTCTAAAAC	36,36,36	0	0	NA	NA	II-A,II-B:II-A,II-B:II-A,II-B	10,10,9	10	TypeII-C,TypeII-A,TypeII-B	DEDDh,DinG,csa3,csm6,csn2,cas2,cas1,cas9,cas3	NA,NA|214aa|down_8|NZ_LR594045.1_1257431_1258073_-	NA|164aa|up_9|NZ_LR594045.1_1238719_1239211_-	COG3444, COG3444, Phosphotransferase system, mannose/fructose/N-acetylgalactosamine-specific component IIB [Carbohydrate transport and metabolism]	NA|142aa|up_8|NZ_LR594045.1_1239229_1239655_-	COG2893, ManX, Phosphotransferase system, mannose/fructose-specific component IIA [Carbohydrate transport and metabolism]	NA|340aa|up_7|NZ_LR594045.1_1239861_1240881_-	COG2855, COG2855, Predicted membrane protein [Function unknown]	NA|390aa|up_6|NZ_LR594045.1_1240991_1242161_-	pfam11187, DUF2974, Protein of unknown function (DUF2974)	NA|99aa|up_5|NZ_LR594045.1_1242350_1242647_-	pfam08951, EntA_Immun, Enterocin A Immunity	NA|77aa|up_4|NZ_LR594045.1_1242646_1242877_-	pfam01721, Bacteriocin_II, Class II bacteriocin	NA|217aa|up_3|NZ_LR594045.1_1243084_1243735_-	COG1418, COG1418, Predicted HD superfamily hydrolase [General function prediction only]	NA|146aa|up_2|NZ_LR594045.1_1243747_1244185_-	PRK00222, PRK00222, peptide-methionine (R)-S-oxide reductase MsrB	NA|611aa|up_1|NZ_LR594045.1_1244360_1246193_-	PRK05433, PRK05433, GTP-binding protein LepA; Provisional	NA|153aa|up_0|NZ_LR594045.1_1246262_1246721_-	PRK00668, ndk, mulitfunctional nucleoside diphosphate kinase/apyrimidinic endonuclease/3'-; Validated	csn2|221aa|down_0|NZ_LR594045.1_1247633_1248296_-	cd09758, Csn2, CRISPR/Cas system-associated protein Csn2	cas2|114aa|down_1|NZ_LR594045.1_1248285_1248627_-	COG3512, COG3512, CRISPR-associated protein, Cas2 homolog [Defense mechanisms]	cas1|290aa|down_2|NZ_LR594045.1_1248623_1249493_-	TIGR03639, cas1_NMENI, CRISPR-associated endonuclease Cas1, subtype II/NMENI	cas9|1372aa|down_3|NZ_LR594045.1_1249492_1253608_-	COG3513, COG3513, Predicted CRISPR-associated nuclease, contains McrA/HNH-nuclease and RuvC-like nuclease domain [Defense mechanisms]	NA|210aa|down_4|NZ_LR594045.1_1254084_1254714_-	COG4478, COG4478, Predicted membrane protein [Function unknown]	NA|255aa|down_5|NZ_LR594045.1_1254713_1255478_-	cd07530, HAD_Pase_UmpH-like, UmpH/NagD family phosphatase, similar to Escherichia coli UmpH UMP phosphatase/NagD nucleotide phosphatase and Mycobacterium tuberculosis Rv1692 glycerol 3-phosphate phosphatase	NA|251aa|down_6|NZ_LR594045.1_1255477_1256230_-	COG3884, FatA, Acyl-ACP thioesterase [Lipid metabolism]	NA|377aa|down_7|NZ_LR594045.1_1256239_1257370_-	PRK08599, PRK08599, oxygen-independent coproporphyrinogen III oxidase	NA|214aa|down_8|NZ_LR594045.1_1257431_1258073_-	NA	NA|452aa|down_9|NZ_LR594045.1_1258196_1259552_-	PRK14316, glmM, phosphoglucosamine mutase; Provisional
