assembly_id	genome_id	genome_def	crispr_array_locus_merge	crispr_array_location_merge	crispr_locus_id	crispr_pred_method	array_in_prot	prot_within_array_20000	prot_in_genome	crispr_type_by_cas_prot	consensus_repeat	repeat_length	self-targeting_spacer_number	self-targeting_target_number	spacer_location	protospacer_location	repeat_type	spacer_locus_num	spacer_num	correct_crispr_type	genome_cas_prots	unknown_protein_around_crispr	L10	L10_domain	L9	L9_domain	L8	L8_domain	L7	L7_domain	L6	L6_domain	L5	L5_domain	L4	L4_domain	L3	L3_domain	L2	L2_domain	L1	L1_domain	R1	R1_domain	R2	R2_domain	R3	R3_domain	R4	R4_domain	R5	R5_domain	R6	R6_domain	R7	R7_domain	R8	R8_domain	R9	R9_domain	R10	R10_domain
GCF_002355815.1_ASM235581v1	NZ_AP017629	Streptococcus pyogenes strain JMUB1235	1	112080-112176	1	CRISPRCasFinder	no		cas3,DinG,csm6,cas9,cas1,cas2,csn2,cas4,cas7,cas8c,cas5,DEDDh,csa3	Orphan	GCTAGATGGTGAAGAAGTCCCAGAA	25	0	0	NA	NA	NA	1	1	Orphan	cas3,DinG,csm6,cas9,cas1,cas2,csn2,cas4,cas7,cas8c,cas5,DEDDh,csa3	NA,NA|222aa|down_8|NZ_AP017629.1_125030_125696_-	NA|257aa|up_9|NZ_AP017629.1_102397_103168_-	PRK11880, PRK11880, pyrroline-5-carboxylate reductase; Reviewed	NA|356aa|up_8|NZ_AP017629.1_103215_104283_-	TIGR03107, Glutamyl_aminopeptidase, glutamyl aminopeptidase	NA|98aa|up_7|NZ_AP017629.1_104738_105032_+	pfam15513, DUF4651, Domain of unknown function (DUF4651)	NA|106aa|up_6|NZ_AP017629.1_105028_105346_+	cd02947, TRX_family, TRX family; composed of two groups: Group I, which includes proteins that exclusively encode a TRX domain; and Group II, which are composed of fusion proteins of TRX and additional domains	NA|209aa|up_5|NZ_AP017629.1_105363_105990_+	cd02796, tRNA_bind_bactPheRS, tRNA-binding-domain-containing prokaryotic phenylalanly tRNA synthetase (PheRS) beta chain	NA|132aa|up_4|NZ_AP017629.1_106141_106537_+	PRK07274, PRK07274, single-stranded DNA-binding protein; Provisional	NA|214aa|up_3|NZ_AP017629.1_106790_107432_-	COG1428, COG1428, Deoxynucleoside kinases [Nucleotide transport and metabolism]	NA|326aa|up_2|NZ_AP017629.1_107451_108429_-	TIGR00737, Probable_tRNA-dihydrouridine_synthase, putative TIM-barrel protein, nifR3 family	NA|291aa|up_1|NZ_AP017629.1_108415_109288_-	PRK00114, hslO, Hsp33 family molecular chaperone HslO	NA|498aa|up_0|NZ_AP017629.1_109434_110928_-	pfam08270, PRD_Mga, M protein trans-acting positive regulator (MGA) PRD domain	NA|283aa|down_0|NZ_AP017629.1_113303_114152_+	cd05827, Sortase_C, Sortase domain found in class C sortases	NA|754aa|down_1|NZ_AP017629.1_114437_116699_+	NF033396, pilus_ancill_1, pilus ancillary protein 1	NA|174aa|down_2|NZ_AP017629.1_116695_117217_+	TIGR02227, Inactive_signal_peptidase_IA	NA|350aa|down_3|NZ_AP017629.1_117238_118288_+	TIGR03786, strep_pil_rpt, streptococcal pilin isopeptide linkage domain	NA|242aa|down_4|NZ_AP017629.1_118303_119029_+	TIGR03064, sortase_srtB, sortase, SrtB family	NA|196aa|down_5|NZ_AP017629.1_119045_119633_+	TIGR03786, strep_pil_rpt, streptococcal pilin isopeptide linkage domain	NA|402aa|down_6|NZ_AP017629.1_119791_120997_-	TIGR04094, AraC_family_transcriptional_regulator, YSIRK-targeted surface antigen transcriptional regulator	NA|1126aa|down_7|NZ_AP017629.1_121387_124765_+	pfam05738, Cna_B, Cna protein B-type domain	NA|222aa|down_8|NZ_AP017629.1_125030_125696_-	NA	NA|469aa|down_9|NZ_AP017629.1_126048_127455_+	COG2031, AtoE, Short chain fatty acids transporter [Lipid metabolism]
GCF_002355815.1_ASM235581v1	NZ_AP017629	Streptococcus pyogenes strain JMUB1235	2	791394-791694	1,2,1	PILER-CR,CRISPRCasFinder,CRT	no	cas9,cas1,cas2,csn2	cas3,DinG,csm6,cas9,cas1,cas2,csn2,cas4,cas7,cas8c,cas5,DEDDh,csa3	Type II-A,Type II-C,Type II-B	GTTTTAGAGCTATGCTGTTTTGAATGGTCCCAAAACT,GTTTTAGAGCTATGCTGTTTTGAATGGTCCCAAAACT,GTTTTAGAGCTATGCTGTTTTGAATGGTCCCAAAAC	37,37,36	0	0	NA	NA	II-A:II-A:II-A	3,4,4	4	TypeII-A,TypeII-C,TypeII-B	cas3,DinG,csm6,cas9,cas1,cas2,csn2,cas4,cas7,cas8c,cas5,DEDDh,csa3	NA|214aa|up_8|NZ_AP017629.1_780855_781497_+,NA	NA|452aa|up_9|NZ_AP017629.1_779376_780732_+	PRK14316, glmM, phosphoglucosamine mutase; Provisional	NA|214aa|up_8|NZ_AP017629.1_780855_781497_+	NA	NA|377aa|up_7|NZ_AP017629.1_781559_782690_+	PRK08599, PRK08599, oxygen-independent coproporphyrinogen III oxidase	NA|251aa|up_6|NZ_AP017629.1_782699_783452_+	COG3884, FatA, Acyl-ACP thioesterase [Lipid metabolism]	NA|255aa|up_5|NZ_AP017629.1_783451_784216_+	cd07530, HAD_Pase_UmpH-like, UmpH/NagD family phosphatase, similar to Escherichia coli UmpH UMP phosphatase/NagD nucleotide phosphatase and Mycobacterium tuberculosis Rv1692 glycerol 3-phosphate phosphatase	NA|211aa|up_4|NZ_AP017629.1_784215_784848_+	COG4478, COG4478, Predicted membrane protein [Function unknown]	cas9|1369aa|up_3|NZ_AP017629.1_785325_789432_+	COG3513, COG3513, Predicted CRISPR-associated nuclease, contains McrA/HNH-nuclease and RuvC-like nuclease domain [Defense mechanisms]	cas1|290aa|up_2|NZ_AP017629.1_789431_790301_+	cd09720, Cas1_II, CRISPR/Cas system-associated protein Cas1	cas2|114aa|up_1|NZ_AP017629.1_790297_790639_+	COG3512, COG3512, CRISPR-associated protein, Cas2 homolog [Defense mechanisms]	csn2|221aa|up_0|NZ_AP017629.1_790628_791291_+	cd09758, Csn2, CRISPR/Cas system-associated protein Csn2	NA|611aa|down_0|NZ_AP017629.1_792336_794169_+	PRK05433, PRK05433, GTP-binding protein LepA; Provisional	NA|389aa|down_1|NZ_AP017629.1_794344_795511_+	PHA03169, PHA03169, hypothetical protein; Provisional	NA|146aa|down_2|NZ_AP017629.1_795710_796148_+	PRK00222, PRK00222, peptide-methionine (R)-S-oxide reductase MsrB	NA|340aa|down_3|NZ_AP017629.1_796277_797297_+	COG2855, COG2855, Predicted membrane protein [Function unknown]	NA|142aa|down_4|NZ_AP017629.1_797503_797929_+	COG2893, ManX, Phosphotransferase system, mannose/fructose-specific component IIA [Carbohydrate transport and metabolism]	NA|164aa|down_5|NZ_AP017629.1_797947_798439_+	COG3444, COG3444, Phosphotransferase system, mannose/fructose/N-acetylgalactosamine-specific component IIB [Carbohydrate transport and metabolism]	NA|270aa|down_6|NZ_AP017629.1_798455_799265_+	COG3715, ManY, Phosphotransferase system, mannose/fructose/N-acetylgalactosamine-specific component IIC [Carbohydrate transport and metabolism]	NA|276aa|down_7|NZ_AP017629.1_799261_800089_+	COG3716, ManZ, Phosphotransferase system, mannose/fructose/N-acetylgalactosamine-specific component IID [Carbohydrate transport and metabolism]	NA|550aa|down_8|NZ_AP017629.1_800224_801874_+	COG2972, COG2972, Predicted signal transduction protein with a C-terminal ATPase domain [Signal transduction mechanisms]	NA|263aa|down_9|NZ_AP017629.1_801877_802666_+	COG4753, COG4753, Response regulator containing CheY-like receiver domain and AraC-type DNA-binding domain [Signal transduction mechanisms]
GCF_002355815.1_ASM235581v1	NZ_AP017629	Streptococcus pyogenes strain JMUB1235	3	987471-987572	3	CRISPRCasFinder	no		cas3,DinG,csm6,cas9,cas1,cas2,csn2,cas4,cas7,cas8c,cas5,DEDDh,csa3	Orphan	AATAATTGGTATAGTCTAATTATA	24	0	0	NA	NA	NA	1	1	Orphan	cas3,DinG,csm6,cas9,cas1,cas2,csn2,cas4,cas7,cas8c,cas5,DEDDh,csa3	NA,NA|262aa|down_6|NZ_AP017629.1_995402_996188_+	NA|83aa|up_9|NZ_AP017629.1_975187_975436_-	COG2261, COG2261, Predicted membrane protein [Function unknown]	NA|773aa|up_8|NZ_AP017629.1_975798_978117_-	TIGR01073, ATP-dependent_DNA_helicase_PcrA, ATP-dependent DNA helicase PcrA	NA|441aa|up_7|NZ_AP017629.1_978645_979968_+	COG1115, AlsT, Na+/alanine symporter [Amino acid transport and metabolism]	NA|412aa|up_6|NZ_AP017629.1_980087_981323_+	COG0053, MMT1, Predicted Co/Zn/Cd cation transporters [Inorganic ion transport and metabolism]	NA|258aa|up_5|NZ_AP017629.1_981692_982466_-	pfam07373, CAMP_factor, CAMP factor (Cfa)	NA|279aa|up_4|NZ_AP017629.1_982835_983672_-	cd00996, PBP2_AatB_like, Polar amino acids-binding domain of ATP-binding cassette transporter-like systems that belong to the type 2 periplasmic binding fold protein superfamily	NA|210aa|up_3|NZ_AP017629.1_983687_984317_-	COG1126, GlnQ, ABC-type polar amino acid transport system, ATPase component [Amino acid transport and metabolism]	NA|214aa|up_2|NZ_AP017629.1_984326_984968_-	COG0765, HisM, ABC-type amino acid transport system, permease component [Amino acid transport and metabolism]	NA|112aa|up_1|NZ_AP017629.1_985074_985410_-	COG2824, PhnA, Uncharacterized Zn-ribbon-containing protein involved in phosphonate metabolism [Inorganic ion transport and metabolism]	NA|605aa|up_0|NZ_AP017629.1_985605_987420_-	PRK00331, PRK00331, isomerizing glutamine--fructose-6-phosphate transaminase	NA|186aa|down_0|NZ_AP017629.1_987595_988153_-	TIGR02227, Inactive_signal_peptidase_IA	NA|501aa|down_1|NZ_AP017629.1_988370_989873_-	PRK05826, PRK05826, pyruvate kinase; Provisional	NA|338aa|down_2|NZ_AP017629.1_989935_990949_-	PRK03202, PRK03202, ATP-dependent 6-phosphofructokinase	NA|1037aa|down_3|NZ_AP017629.1_991028_994139_-	PRK07279, dnaE, DNA polymerase III DnaE; Reviewed	NA|124aa|down_4|NZ_AP017629.1_994323_994695_+	COG1725, COG1725, Predicted transcriptional regulators [Transcription]	NA|233aa|down_5|NZ_AP017629.1_994694_995393_+	COG1131, CcmA, ABC-type multidrug transport system, ATPase component [Defense mechanisms]	NA|262aa|down_6|NZ_AP017629.1_995402_996188_+	NA	NA|205aa|down_7|NZ_AP017629.1_996318_996933_-	COG0398, COG0398, Uncharacterized conserved protein [Function unknown]	NA|209aa|down_8|NZ_AP017629.1_997666_998293_+	pfam12978, DUF3862, Domain of Unknown Function with PDB structure (DUF3862)	NA|755aa|down_9|NZ_AP017629.1_998549_1000814_-	cd04300, GT35_Glycogen_Phosphorylase, glycogen phosphorylase and similar proteins
GCF_002355815.1_ASM235581v1	NZ_AP017629	Streptococcus pyogenes strain JMUB1235	4	1182157-1182520	2,4,2	CRT,CRISPRCasFinder,PILER-CR	no	cas2,cas1,cas4,cas7,cas8c,cas5,cas3	cas3,DinG,csm6,cas9,cas1,cas2,csn2,cas4,cas7,cas8c,cas5,DEDDh,csa3	Type I-C,Type I-U, Type I-U?	TATTTCAATCCACTCACCCATGAAGGGTGAGAC,ATTTCAATCCACTCACCCATGAAGGGTGAGAC,ATTTCAATCCACTCACCCATGAAGGGTGAGAC	33,32,32	0	0	NA	NA	I-C:I-C:I-C	5,5,5	5	TypeI-C,TypeI-U,TypeI-U?	cas3,DinG,csm6,cas9,cas1,cas2,csn2,cas4,cas7,cas8c,cas5,DEDDh,csa3	NA,NA	NA|412aa|up_9|NZ_AP017629.1_1172092_1173328_-	PRK01388, PRK01388, arginine deiminase; Provisional	NA|227aa|up_8|NZ_AP017629.1_1173601_1174282_-	COG0664, Crp, cAMP-binding proteins - catabolite gene activator and regulatory subunit of cAMP-dependent protein kinases [Signal transduction mechanisms]	NA|158aa|up_7|NZ_AP017629.1_1174423_1174897_+	COG1438, ArgR, Arginine repressor [Transcription]	NA|239aa|up_6|NZ_AP017629.1_1175062_1175779_-	COG3382, COG3382, Solo B3/4 domain (OB-fold DNA/RNA-binding) of Phe-aaRS-beta [General function prediction only]	NA|360aa|up_5|NZ_AP017629.1_1175792_1176872_-	COG2315, MmcQ, Uncharacterized protein conserved in bacteria [Function unknown]	NA|578aa|up_4|NZ_AP017629.1_1176944_1178678_-	COG2972, COG2972, Predicted signal transduction protein with a C-terminal ATPase domain [Signal transduction mechanisms]	NA|247aa|up_3|NZ_AP017629.1_1178674_1179415_-	COG4753, COG4753, Response regulator containing CheY-like receiver domain and AraC-type DNA-binding domain [Signal transduction mechanisms]	NA|369aa|up_2|NZ_AP017629.1_1179502_1180609_-	PRK14018, PRK14018, bifunctional peptide-methionine (S)-S-oxide reductase MsrA/peptide-methionine (R)-S-oxide reductase MsrB	NA|208aa|up_1|NZ_AP017629.1_1180651_1181275_-	cd02966, TlpA_like_family, TlpA-like family; composed of  TlpA, ResA, DsbE and similar proteins	NA|237aa|up_0|NZ_AP017629.1_1181287_1181998_-	COG0785, CcdA, Cytochrome c biogenesis protein [Posttranslational modification, protein turnover, chaperones]	cas2|98aa|down_0|NZ_AP017629.1_1182668_1182962_-	COG1343, COG1343, CRISPR-associated protein Cas2 [Defense mechanisms]	cas1|342aa|down_1|NZ_AP017629.1_1182972_1183998_-	TIGR03640, cas1_DVULG, CRISPR-associated endonuclease Cas1, subtype I-C/DVULG	cas4|225aa|down_2|NZ_AP017629.1_1183994_1184669_-	COG1468, COG1468, CRISPR-associated protein Cas4 (RecB family exonuclease) [Defense    mechanisms]	cas7|283aa|down_3|NZ_AP017629.1_1184670_1185519_-	COG3649, COG3649, CRISPR system related protein [Defense mechanisms]	cas8c|632aa|down_4|NZ_AP017629.1_1185523_1187419_-	TIGR01863, CRISPR-associated_protein_CT1133_family, CRISPR-associated protein Cas8c/Csd1, subtype I-C/DVULG	cas5|243aa|down_5|NZ_AP017629.1_1187418_1188147_-	TIGR01876, cas_Cas5d, CRISPR-associated protein Cas5, subtype I-C/DVULG	cas3|803aa|down_6|NZ_AP017629.1_1188279_1190688_-	COG1203, COG1203, CRISPR-associated helicase Cas3 [Defense mechanisms]	NA|883aa|down_7|NZ_AP017629.1_1190841_1193490_-	PRK05729, valS, valyl-tRNA synthetase; Reviewed	NA|188aa|down_8|NZ_AP017629.1_1193491_1194055_-	pfam13238, AAA_18, AAA domain	NA|132aa|down_9|NZ_AP017629.1_1194655_1195051_-	PRK07758, PRK07758, hypothetical protein; Provisional
