assembly_id	genome_id	genome_def	crispr_array_locus_merge	crispr_array_location_merge	crispr_locus_id	crispr_pred_method	array_in_prot	prot_within_array_20000	prot_in_genome	crispr_type_by_cas_prot	consensus_repeat	repeat_length	self-targeting_spacer_number	self-targeting_target_number	spacer_location	protospacer_location	repeat_type	spacer_locus_num	spacer_num	correct_crispr_type	genome_cas_prots	unknown_protein_around_crispr	L10	L10_domain	L9	L9_domain	L8	L8_domain	L7	L7_domain	L6	L6_domain	L5	L5_domain	L4	L4_domain	L3	L3_domain	L2	L2_domain	L1	L1_domain	R1	R1_domain	R2	R2_domain	R3	R3_domain	R4	R4_domain	R5	R5_domain	R6	R6_domain	R7	R7_domain	R8	R8_domain	R9	R9_domain	R10	R10_domain
GCA_004124075.1_ASM412407v1	CP033337	Streptococcus pyogenes strain TSPY453 chromosome, complete genome	1	828844-829062	1	CRT	no		cas3,DinG,csm6,RT,DEDDh,csa3	Orphan	TGTTTTGAATGGTCCCAAAAC	21	0	0	NA	NA	II-A	3	3	Orphan	cas3,DinG,csm6,RT,DEDDh,csa3	NA|214aa|up_4|CP033337.1_824683_825325_+,NA	NA|264aa|up_9|CP033337.1_819105_819897_-	COG3442, COG3442, Predicted glutamine amidotransferase [General function prediction only]	NA|446aa|up_8|CP033337.1_819896_821234_-	COG0769, MurE, UDP-N-acetylmuramyl tripeptide synthase [Cell envelope biogenesis, outer membrane]	NA|284aa|up_7|CP033337.1_821346_822198_+	COG1624, COG1624, Uncharacterized conserved protein [Function unknown]	NA|319aa|up_6|CP033337.1_822194_823151_+	COG4856, COG4856, Uncharacterized protein conserved in bacteria [Function unknown]	NA|452aa|up_5|CP033337.1_823204_824560_+	PRK14316, glmM, phosphoglucosamine mutase; Provisional	NA|214aa|up_4|CP033337.1_824683_825325_+	NA	NA|377aa|up_3|CP033337.1_825387_826518_+	PRK08599, PRK08599, oxygen-independent coproporphyrinogen III oxidase	NA|251aa|up_2|CP033337.1_826527_827280_+	COG3884, FatA, Acyl-ACP thioesterase [Lipid metabolism]	NA|255aa|up_1|CP033337.1_827279_828044_+	cd07530, HAD_Pase_UmpH-like, UmpH/NagD family phosphatase, similar to Escherichia coli UmpH UMP phosphatase/NagD nucleotide phosphatase and Mycobacterium tuberculosis Rv1692 glycerol 3-phosphate phosphatase	NA|211aa|up_0|CP033337.1_828043_828676_+	COG4478, COG4478, Predicted membrane protein [Function unknown]	NA|60aa|down_0|CP033337.1_829192_829372_+	cd04413, NDPk_I, Nucleoside diphosphate kinase Group I (NDPk_I)-like: NDP kinase domains are present in a large family of structurally and functionally conserved proteins from bacteria to humans that generally catalyze the transfer of gamma-phosphates of a nucleoside triphosphate (NTP) donor onto a nucleoside diphosphate (NDP) acceptor through a phosphohistidine intermediate	NA|49aa|down_1|CP033337.1_829435_829582_+	PRK00668, ndk, mulitfunctional nucleoside diphosphate kinase/apyrimidinic endonuclease/3'-; Validated	NA|611aa|down_2|CP033337.1_829705_831538_+	PRK05433, PRK05433, GTP-binding protein LepA; Provisional	NA|146aa|down_3|CP033337.1_833212_833650_+	PRK00222, PRK00222, peptide-methionine (R)-S-oxide reductase MsrB	NA|340aa|down_4|CP033337.1_833779_834799_+	COG2855, COG2855, Predicted membrane protein [Function unknown]	NA|142aa|down_5|CP033337.1_835005_835431_+	COG2893, ManX, Phosphotransferase system, mannose/fructose-specific component IIA [Carbohydrate transport and metabolism]	NA|164aa|down_6|CP033337.1_835457_835949_+	COG3444, COG3444, Phosphotransferase system, mannose/fructose/N-acetylgalactosamine-specific component IIB [Carbohydrate transport and metabolism]	NA|270aa|down_7|CP033337.1_835965_836775_+	COG3715, ManY, Phosphotransferase system, mannose/fructose/N-acetylgalactosamine-specific component IIC [Carbohydrate transport and metabolism]	NA|276aa|down_8|CP033337.1_836771_837599_+	COG3716, ManZ, Phosphotransferase system, mannose/fructose/N-acetylgalactosamine-specific component IID [Carbohydrate transport and metabolism]	NA|550aa|down_9|CP033337.1_837734_839384_+	COG2972, COG2972, Predicted signal transduction protein with a C-terminal ATPase domain [Signal transduction mechanisms]
GCA_004124075.1_ASM412407v1	CP033337	Streptococcus pyogenes strain TSPY453 chromosome, complete genome	2	1338880-1338973	1	CRISPRCasFinder	no	cas3	cas3,DinG,csm6,RT,DEDDh,csa3	Unclear	AATCCACTCACCCGTGAAGGGTGAGAC	27	0	0	NA	NA	NA	1	1	Unclear	cas3,DinG,csm6,RT,DEDDh,csa3	NA,NA|91aa|down_8|CP033337.1_1346374_1346647_-	NA|227aa|up_9|CP033337.1_1328148_1328829_-	COG0664, Crp, cAMP-binding proteins - catabolite gene activator and regulatory subunit of cAMP-dependent protein kinases [Signal transduction mechanisms]	NA|158aa|up_8|CP033337.1_1328970_1329444_+	COG1438, ArgR, Arginine repressor [Transcription]	NA|360aa|up_7|CP033337.1_1330340_1331420_-	COG2315, MmcQ, Uncharacterized protein conserved in bacteria [Function unknown]	NA|578aa|up_6|CP033337.1_1331492_1333226_-	COG2972, COG2972, Predicted signal transduction protein with a C-terminal ATPase domain [Signal transduction mechanisms]	NA|247aa|up_5|CP033337.1_1333222_1333963_-	COG4753, COG4753, Response regulator containing CheY-like receiver domain and AraC-type DNA-binding domain [Signal transduction mechanisms]	NA|369aa|up_4|CP033337.1_1334050_1335157_-	PRK14018, PRK14018, bifunctional peptide-methionine (S)-S-oxide reductase MsrA/peptide-methionine (R)-S-oxide reductase MsrB	NA|208aa|up_3|CP033337.1_1335199_1335823_-	cd02966, TlpA_like_family, TlpA-like family; composed of  TlpA, ResA, DsbE and similar proteins	NA|237aa|up_2|CP033337.1_1335835_1336546_-	COG0785, CcdA, Cytochrome c biogenesis protein [Posttranslational modification, protein turnover, chaperones]	NA|311aa|up_1|CP033337.1_1336766_1337699_-	cd12827, EcCorA_ZntB-like_u2, uncharacterized bacterial subfamily of the Escherichia coli CorA-Salmonella typhimurium ZntB family	NA|320aa|up_0|CP033337.1_1337790_1338750_-	COG0673, MviM, Predicted dehydrogenases and related proteins [General function prediction only]	NA|60aa|down_0|CP033337.1_1339121_1339301_-	COG1343, COG1343, CRISPR-associated protein Cas2 [Defense mechanisms]	cas3|416aa|down_1|CP033337.1_1339323_1340571_-	COG1203, COG1203, CRISPR-associated helicase Cas3 [Defense mechanisms]	NA|883aa|down_2|CP033337.1_1340724_1343373_-	PRK05729, valS, valyl-tRNA synthetase; Reviewed	NA|188aa|down_3|CP033337.1_1343374_1343938_-	pfam13238, AAA_18, AAA domain	NA|67aa|down_4|CP033337.1_1343934_1344135_-	COG1670, RimL, Acetyltransferases, including N-acetylases of ribosomal proteins [Translation, ribosomal structure and biogenesis]	NA|132aa|down_5|CP033337.1_1344538_1344934_-	PRK07758, PRK07758, hypothetical protein; Provisional	NA|85aa|down_6|CP033337.1_1344951_1345206_-	pfam08930, DUF1912, Domain of unknown function (DUF1912)	NA|180aa|down_7|CP033337.1_1345518_1346058_-	cd01189, INT_ICEBs1_C_like, C-terminal catalytic domain of integrases from bacterial phages and conjugate transposons	NA|91aa|down_8|CP033337.1_1346374_1346647_-	NA	NA|109aa|down_9|CP033337.1_1346696_1347023_-	pfam05595, DUF771, Domain of unknown function (DUF771)
