assembly_id	genome_id	genome_def	crispr_array_locus_merge	crispr_array_location_merge	crispr_locus_id	crispr_pred_method	array_in_prot	prot_within_array_20000	prot_in_genome	crispr_type_by_cas_prot	consensus_repeat	repeat_length	self-targeting_spacer_number	self-targeting_target_number	spacer_location	protospacer_location	repeat_type	spacer_locus_num	spacer_num	correct_crispr_type	genome_cas_prots	unknown_protein_around_crispr	L10	L10_domain	L9	L9_domain	L8	L8_domain	L7	L7_domain	L6	L6_domain	L5	L5_domain	L4	L4_domain	L3	L3_domain	L2	L2_domain	L1	L1_domain	R1	R1_domain	R2	R2_domain	R3	R3_domain	R4	R4_domain	R5	R5_domain	R6	R6_domain	R7	R7_domain	R8	R8_domain	R9	R9_domain	R10	R10_domain
GCF_900475925.1_46514_A01	NZ_LS483430	Streptococcus pyogenes strain NCTC12044 chromosome 1	1	760113-760545	1,1	CRISPRCasFinder,PILER-CR	no	cas9,cas1,cas2,csn2	cas3,DinG,csm6,cas9,cas1,cas2,csn2,DEDDh,csa3	Type II-B,Type II-C,Type II-A	TGTTTTAGAGCTATGCTGTTTTGAATGGTCCCAAAAC,GTTTTAGAGCTATGCTGTTTTGAATGGTCCCAAAAC	37,36	0	0	NA	NA	II-A:II-A	6,5	6	TypeII-B,TypeII-C,TypeII-A	cas3,DinG,csm6,cas9,cas1,cas2,csn2,DEDDh,csa3	NA|214aa|up_8|NZ_LS483430.1_749577_750219_+,NA	NA|452aa|up_9|NZ_LS483430.1_748098_749454_+	PRK14316, glmM, phosphoglucosamine mutase; Provisional	NA|214aa|up_8|NZ_LS483430.1_749577_750219_+	NA	NA|377aa|up_7|NZ_LS483430.1_750281_751412_+	PRK08599, PRK08599, oxygen-independent coproporphyrinogen III oxidase	NA|251aa|up_6|NZ_LS483430.1_751421_752174_+	COG3884, FatA, Acyl-ACP thioesterase [Lipid metabolism]	NA|255aa|up_5|NZ_LS483430.1_752173_752938_+	cd07530, HAD_Pase_UmpH-like, UmpH/NagD family phosphatase, similar to Escherichia coli UmpH UMP phosphatase/NagD nucleotide phosphatase and Mycobacterium tuberculosis Rv1692 glycerol 3-phosphate phosphatase	NA|211aa|up_4|NZ_LS483430.1_752937_753570_+	COG4478, COG4478, Predicted membrane protein [Function unknown]	cas9|1368aa|up_3|NZ_LS483430.1_754048_758152_+	COG3513, COG3513, Predicted CRISPR-associated nuclease, contains McrA/HNH-nuclease and RuvC-like nuclease domain [Defense mechanisms]	cas1|290aa|up_2|NZ_LS483430.1_758151_759021_+	cd09720, Cas1_II, CRISPR/Cas system-associated protein Cas1	cas2|114aa|up_1|NZ_LS483430.1_759017_759359_+	COG3512, COG3512, CRISPR-associated protein, Cas2 homolog [Defense mechanisms]	csn2|221aa|up_0|NZ_LS483430.1_759348_760011_+	cd09758, Csn2, CRISPR/Cas system-associated protein Csn2	NA|611aa|down_0|NZ_LS483430.1_761189_763022_+	PRK05433, PRK05433, GTP-binding protein LepA; Provisional	NA|540aa|down_1|NZ_LS483430.1_763170_764790_+	PRK07764, PRK07764, DNA polymerase III subunits gamma and tau; Validated	NA|146aa|down_2|NZ_LS483430.1_764975_765413_+	PRK00222, PRK00222, peptide-methionine (R)-S-oxide reductase MsrB	NA|340aa|down_3|NZ_LS483430.1_765541_766561_+	COG2855, COG2855, Predicted membrane protein [Function unknown]	NA|142aa|down_4|NZ_LS483430.1_766767_767193_+	COG2893, ManX, Phosphotransferase system, mannose/fructose-specific component IIA [Carbohydrate transport and metabolism]	NA|164aa|down_5|NZ_LS483430.1_767211_767703_+	COG3444, COG3444, Phosphotransferase system, mannose/fructose/N-acetylgalactosamine-specific component IIB [Carbohydrate transport and metabolism]	NA|270aa|down_6|NZ_LS483430.1_767719_768529_+	COG3715, ManY, Phosphotransferase system, mannose/fructose/N-acetylgalactosamine-specific component IIC [Carbohydrate transport and metabolism]	NA|276aa|down_7|NZ_LS483430.1_768525_769353_+	COG3716, ManZ, Phosphotransferase system, mannose/fructose/N-acetylgalactosamine-specific component IID [Carbohydrate transport and metabolism]	NA|550aa|down_8|NZ_LS483430.1_769488_771138_+	COG2972, COG2972, Predicted signal transduction protein with a C-terminal ATPase domain [Signal transduction mechanisms]	NA|263aa|down_9|NZ_LS483430.1_771141_771930_+	COG4753, COG4753, Response regulator containing CheY-like receiver domain and AraC-type DNA-binding domain [Signal transduction mechanisms]
GCF_900475925.1_46514_A01	NZ_LS483430	Streptococcus pyogenes strain NCTC12044 chromosome 1	2	954902-955003	2	CRISPRCasFinder	no		cas3,DinG,csm6,cas9,cas1,cas2,csn2,DEDDh,csa3	Orphan	AATAATTGGTATAGTCTAATTATA	24	0	0	NA	NA	NA	1	1	Orphan	cas3,DinG,csm6,cas9,cas1,cas2,csn2,DEDDh,csa3	NA,NA|262aa|down_6|NZ_LS483430.1_962833_963619_+	NA|83aa|up_9|NZ_LS483430.1_942617_942866_-	COG2261, COG2261, Predicted membrane protein [Function unknown]	NA|773aa|up_8|NZ_LS483430.1_943228_945547_-	TIGR01073, ATP-dependent_DNA_helicase_PcrA, ATP-dependent DNA helicase PcrA	NA|441aa|up_7|NZ_LS483430.1_946076_947399_+	COG1115, AlsT, Na+/alanine symporter [Amino acid transport and metabolism]	NA|412aa|up_6|NZ_LS483430.1_947518_948754_+	COG0053, MMT1, Predicted Co/Zn/Cd cation transporters [Inorganic ion transport and metabolism]	NA|258aa|up_5|NZ_LS483430.1_949123_949897_-	pfam07373, CAMP_factor, CAMP factor (Cfa)	NA|279aa|up_4|NZ_LS483430.1_950266_951103_-	cd00996, PBP2_AatB_like, Polar amino acids-binding domain of ATP-binding cassette transporter-like systems that belong to the type 2 periplasmic binding fold protein superfamily	NA|210aa|up_3|NZ_LS483430.1_951118_951748_-	COG1126, GlnQ, ABC-type polar amino acid transport system, ATPase component [Amino acid transport and metabolism]	NA|214aa|up_2|NZ_LS483430.1_951757_952399_-	COG0765, HisM, ABC-type amino acid transport system, permease component [Amino acid transport and metabolism]	NA|112aa|up_1|NZ_LS483430.1_952505_952841_-	COG2824, PhnA, Uncharacterized Zn-ribbon-containing protein involved in phosphonate metabolism [Inorganic ion transport and metabolism]	NA|605aa|up_0|NZ_LS483430.1_953036_954851_-	PRK00331, PRK00331, isomerizing glutamine--fructose-6-phosphate transaminase	NA|186aa|down_0|NZ_LS483430.1_955026_955584_-	TIGR02227, Inactive_signal_peptidase_IA	NA|501aa|down_1|NZ_LS483430.1_955801_957304_-	PRK05826, PRK05826, pyruvate kinase; Provisional	NA|338aa|down_2|NZ_LS483430.1_957366_958380_-	PRK03202, PRK03202, ATP-dependent 6-phosphofructokinase	NA|1037aa|down_3|NZ_LS483430.1_958459_961570_-	PRK07279, dnaE, DNA polymerase III DnaE; Reviewed	NA|124aa|down_4|NZ_LS483430.1_961754_962126_+	COG1725, COG1725, Predicted transcriptional regulators [Transcription]	NA|233aa|down_5|NZ_LS483430.1_962125_962824_+	COG1131, CcmA, ABC-type multidrug transport system, ATPase component [Defense mechanisms]	NA|262aa|down_6|NZ_LS483430.1_962833_963619_+	NA	NA|205aa|down_7|NZ_LS483430.1_963749_964364_-	COG0398, COG0398, Uncharacterized conserved protein [Function unknown]	NA|207aa|down_8|NZ_LS483430.1_965104_965725_+	pfam12978, DUF3862, Domain of Unknown Function with PDB structure (DUF3862)	NA|755aa|down_9|NZ_LS483430.1_965981_968246_-	cd04300, GT35_Glycogen_Phosphorylase, glycogen phosphorylase and similar proteins
GCF_900475925.1_46514_A01	NZ_LS483430	Streptococcus pyogenes strain NCTC12044 chromosome 1	3	1552840-1552925	3	CRISPRCasFinder	no		cas3,DinG,csm6,cas9,cas1,cas2,csn2,DEDDh,csa3	Orphan	CCTGGTTGCTCTGGCATTTCTGG	23	0	0	NA	NA	NA	1	1	Orphan	cas3,DinG,csm6,cas9,cas1,cas2,csn2,DEDDh,csa3	NA,NA	NA|250aa|up_9|NZ_LS483430.1_1538250_1539000_-	PRK14830, PRK14830, undecaprenyl pyrophosphate synthase; Provisional	NA|124aa|up_8|NZ_LS483430.1_1539218_1539590_-	PRK06531, yajC, preprotein translocase subunit YajC; Validated	NA|125aa|up_7|NZ_LS483430.1_1539705_1540080_-	TIGR01295, Pediocin_PA-1_biosynthesis_protein_PedC, bacteriocin transport accessory protein, putative	NA|1166aa|up_6|NZ_LS483430.1_1540194_1543692_-	TIGR02102, alkaline_amylopullulanase, pullulanase, extracellular, Gram-positive	NA|538aa|up_5|NZ_LS483430.1_1543889_1545503_-	cd11333, AmyAc_SI_OligoGlu_DGase, Alpha amylase catalytic domain found in Sucrose isomerases, oligo-1,6-glucosidase (also called isomaltase; sucrase-isomaltase; alpha-limit dextrinase), dextran glucosidase (also called glucan 1,6-alpha-glucosidase), and related proteins	NA|378aa|up_4|NZ_LS483430.1_1545631_1546765_-	PRK11650, ugpC, sn-glycerol-3-phosphate ABC transporter ATP-binding protein UgpC	NA|283aa|up_3|NZ_LS483430.1_1547062_1547911_-	COG2508, COG2508, Regulator of polyketide synthase expression [Signal transduction mechanisms / Secondary metabolites biosynthesis, transport, and catabolism]	NA|441aa|up_2|NZ_LS483430.1_1548250_1549573_+	pfam02821, Staphylokinase, Staphylokinase/Streptokinase family	NA|148aa|up_1|NZ_LS483430.1_1549670_1550114_-	PRK05273, PRK05273, D-tyrosyl-tRNA(Tyr) deacylase; Provisional	NA|740aa|up_0|NZ_LS483430.1_1550128_1552348_-	COG0317, SpoT, Guanosine polyphosphate pyrophosphohydrolases/synthetases [Signal transduction mechanisms / Transcription]	NA|161aa|down_0|NZ_LS483430.1_1554240_1554723_+	PRK02551, PRK02551, flavoprotein NrdI; Provisional	NA|273aa|down_1|NZ_LS483430.1_1555108_1555927_-	cd09079, RgfB-like, Streptococcus agalactiae RgfB, part of a putative two component signal transduction system, and related proteins	NA|729aa|down_2|NZ_LS483430.1_1556009_1558196_-	TIGR02003, PTS_system_glucose-specific_IIBC_component, PTS system, IIBC component	NA|250aa|down_3|NZ_LS483430.1_1558552_1559302_-	COG1385, COG1385, Uncharacterized protein conserved in bacteria [Function unknown]	NA|318aa|down_4|NZ_LS483430.1_1559301_1560255_-	pfam06325, PrmA, Ribosomal protein L11 methyltransferase (PrmA)	NA|147aa|down_5|NZ_LS483430.1_1561446_1561887_-	cd04682, Nudix_Hydrolase_23, Members of the Nudix hydrolase superfamily catalyze the hydrolysis of NUcleoside DIphosphates linked to other moieties, X	NA|110aa|down_6|NZ_LS483430.1_1561914_1562244_-	cd06555, ASCH_PF0470_like, ASC-1 homology domain, subfamily similar to Pyrococcus furiosus Pf0470	NA|157aa|down_7|NZ_LS483430.1_1562245_1562716_-	pfam11217, DUF3013, Protein of unknown function (DUF3013)	NA|658aa|down_8|NZ_LS483430.1_1562888_1564862_+	PRK06529, PRK06529, amidase; Provisional	NA|586aa|down_9|NZ_LS483430.1_1565046_1566804_+	COG0147, TrpE, Anthranilate/para-aminobenzoate synthases component I [Amino acid transport and metabolism / Coenzyme metabolism]
