assembly_id	genome_id	genome_def	crispr_array_locus_merge	crispr_array_location_merge	crispr_locus_id	crispr_pred_method	array_in_prot	prot_within_array_20000	prot_in_genome	crispr_type_by_cas_prot	consensus_repeat	repeat_length	self-targeting_spacer_number	self-targeting_target_number	spacer_location	protospacer_location	repeat_type	spacer_locus_num	spacer_num	correct_crispr_type	genome_cas_prots	unknown_protein_around_crispr	L10	L10_domain	L9	L9_domain	L8	L8_domain	L7	L7_domain	L6	L6_domain	L5	L5_domain	L4	L4_domain	L3	L3_domain	L2	L2_domain	L1	L1_domain	R1	R1_domain	R2	R2_domain	R3	R3_domain	R4	R4_domain	R5	R5_domain	R6	R6_domain	R7	R7_domain	R8	R8_domain	R9	R9_domain	R10	R10_domain
GCF_000219765.1_ASM21976v1	NC_017582	Streptococcus equi subsp. zooepidemicus ATCC 35246, complete sequence	1	561748-562962	1,1,1	PILER-CR,CRISPRCasFinder,CRT	no	cas3,cas5,cas8c,cas7,cas4,cas1,cas2	cas3,DEDDh,cas5,cas8c,cas7,cas4,cas1,cas2,RT,DinG,csm6	Type I-C, Type I-U?,Type I-U	GTCTCGCCTTTCATGGGCGAGTGGATTGAAAT,GTCTCGCCTTTCATGGGCGAGTGGATTGAAAT,GTCTCGCCTTTCATGGGCGAGTGGATTGAAAT	32,32,32	0	0	NA	NA	I-C:I-C:I-C	15,18,18	18	TypeI-C,TypeI-U?,TypeI-U	cas3,DEDDh,cas5,cas8c,cas7,cas4,cas1,cas2,RT,DinG,csm6	NA|139aa|up_9|NC_017582.1_550963_551380_+,NA	NA|139aa|up_9|NC_017582.1_550963_551380_+	NA	NA|181aa|up_8|NC_017582.1_551486_552029_+	pfam14021, TNT, Tuberculosis necrotizing toxin	NA|107aa|up_7|NC_017582.1_552028_552349_+	pfam15597, Imm59, Immunity protein 59	cas3|818aa|up_6|NC_017582.1_552939_555393_+	COG1203, COG1203, CRISPR-associated helicase Cas3 [Defense mechanisms]	cas5|248aa|up_5|NC_017582.1_556125_556869_+	TIGR01876, cas_Cas5d, CRISPR-associated protein Cas5, subtype I-C/DVULG	cas8c|628aa|up_4|NC_017582.1_556868_558752_+	cd09642, Cas8c_I-C, CRISPR/Cas system-associated protein Cas8c	cas7|283aa|up_3|NC_017582.1_558752_559601_+	cd09689, Cas7_I-C, CRISPR/Cas system-associated RAMP superfamily protein Cas7	cas4|224aa|up_2|NC_017582.1_559602_560274_+	COG1468, COG1468, CRISPR-associated protein Cas4 (RecB family exonuclease) [Defense    mechanisms]	cas1|342aa|up_1|NC_017582.1_560270_561296_+	TIGR03640, cas1_DVULG, CRISPR-associated endonuclease Cas1, subtype I-C/DVULG	cas2|98aa|up_0|NC_017582.1_561306_561600_+	cd09725, Cas2_I_II_III, CRISPR/Cas system-associated protein Cas2	NA|237aa|down_0|NC_017582.1_564400_565111_+	COG0785, CcdA, Cytochrome c biogenesis protein [Posttranslational modification, protein turnover, chaperones]	NA|217aa|down_1|NC_017582.1_565124_565775_+	cd02966, TlpA_like_family, TlpA-like family; composed of  TlpA, ResA, DsbE and similar proteins	NA|371aa|down_2|NC_017582.1_565792_566905_+	PRK14018, PRK14018, bifunctional peptide-methionine (S)-S-oxide reductase MsrA/peptide-methionine (R)-S-oxide reductase MsrB	NA|247aa|down_3|NC_017582.1_567172_567913_+	COG4753, COG4753, Response regulator containing CheY-like receiver domain and AraC-type DNA-binding domain [Signal transduction mechanisms]	NA|573aa|down_4|NC_017582.1_567909_569628_+	COG2972, COG2972, Predicted signal transduction protein with a C-terminal ATPase domain [Signal transduction mechanisms]	NA|363aa|down_5|NC_017582.1_569667_570756_+	COG2315, MmcQ, Uncharacterized protein conserved in bacteria [Function unknown]	NA|235aa|down_6|NC_017582.1_570770_571475_+	COG3382, COG3382, Solo B3/4 domain (OB-fold DNA/RNA-binding) of Phe-aaRS-beta [General function prediction only]	NA|163aa|down_7|NC_017582.1_571711_572200_+	cd04335, PrdX_deacylase, This CD includes bacterial (Agrobacterium tumefaciens and Caulobacter crescentus ProX, and Clostridium sticklandii PrdX) and eukaryotic (Plasmodium falciparum N-terminal ProRS editing domain) sequences	NA|158aa|down_8|NC_017582.1_572843_573317_-	COG1438, ArgR, Arginine repressor [Transcription]	NA|227aa|down_9|NC_017582.1_573457_574138_+	COG0664, Crp, cAMP-binding proteins - catabolite gene activator and regulatory subunit of cAMP-dependent protein kinases [Signal transduction mechanisms]
GCF_000219765.1_ASM21976v1	NC_017582	Streptococcus equi subsp. zooepidemicus ATCC 35246, complete sequence	2	572557-572712	2	CRISPRCasFinder	no	cas3,cas5,cas8c,cas7,cas4,cas1,cas2	cas3,DEDDh,cas5,cas8c,cas7,cas4,cas1,cas2,RT,DinG,csm6	Type I-C, Type I-U?,Type I-U	CTCCTTTTGCAGGAGTGTGGATTG	24	0	0	NA	NA	NA	2	2	TypeI-C,TypeI-U?,TypeI-U	cas3,DEDDh,cas5,cas8c,cas7,cas4,cas1,cas2,RT,DinG,csm6	NA,NA	cas1|342aa|up_9|NC_017582.1_560270_561296_+	TIGR03640, cas1_DVULG, CRISPR-associated endonuclease Cas1, subtype I-C/DVULG	cas2|98aa|up_8|NC_017582.1_561306_561600_+	cd09725, Cas2_I_II_III, CRISPR/Cas system-associated protein Cas2	NA|237aa|up_7|NC_017582.1_564400_565111_+	COG0785, CcdA, Cytochrome c biogenesis protein [Posttranslational modification, protein turnover, chaperones]	NA|217aa|up_6|NC_017582.1_565124_565775_+	cd02966, TlpA_like_family, TlpA-like family; composed of  TlpA, ResA, DsbE and similar proteins	NA|371aa|up_5|NC_017582.1_565792_566905_+	PRK14018, PRK14018, bifunctional peptide-methionine (S)-S-oxide reductase MsrA/peptide-methionine (R)-S-oxide reductase MsrB	NA|247aa|up_4|NC_017582.1_567172_567913_+	COG4753, COG4753, Response regulator containing CheY-like receiver domain and AraC-type DNA-binding domain [Signal transduction mechanisms]	NA|573aa|up_3|NC_017582.1_567909_569628_+	COG2972, COG2972, Predicted signal transduction protein with a C-terminal ATPase domain [Signal transduction mechanisms]	NA|363aa|up_2|NC_017582.1_569667_570756_+	COG2315, MmcQ, Uncharacterized protein conserved in bacteria [Function unknown]	NA|235aa|up_1|NC_017582.1_570770_571475_+	COG3382, COG3382, Solo B3/4 domain (OB-fold DNA/RNA-binding) of Phe-aaRS-beta [General function prediction only]	NA|163aa|up_0|NC_017582.1_571711_572200_+	cd04335, PrdX_deacylase, This CD includes bacterial (Agrobacterium tumefaciens and Caulobacter crescentus ProX, and Clostridium sticklandii PrdX) and eukaryotic (Plasmodium falciparum N-terminal ProRS editing domain) sequences	NA|158aa|down_0|NC_017582.1_572843_573317_-	COG1438, ArgR, Arginine repressor [Transcription]	NA|227aa|down_1|NC_017582.1_573457_574138_+	COG0664, Crp, cAMP-binding proteins - catabolite gene activator and regulatory subunit of cAMP-dependent protein kinases [Signal transduction mechanisms]	NA|412aa|down_2|NC_017582.1_574406_575642_+	PRK01388, PRK01388, arginine deiminase; Provisional	NA|144aa|down_3|NC_017582.1_575651_576083_+	pfam13673, Acetyltransf_10, Acetyltransferase (GNAT) domain	NA|338aa|down_4|NC_017582.1_576097_577111_+	PRK02102, PRK02102, ornithine carbamoyltransferase; Validated	NA|498aa|down_5|NC_017582.1_577272_578766_+	COG1288, COG1288, Predicted membrane protein [Function unknown]	NA|444aa|down_6|NC_017582.1_578782_580114_+	PRK07205, PRK07205, hypothetical protein; Provisional	NA|317aa|down_7|NC_017582.1_580131_581082_+	PRK12353, PRK12353, putative amino acid kinase; Reviewed	NA|331aa|down_8|NC_017582.1_581300_582293_+	COG2502, AsnA, Asparagine synthetase A [Amino acid transport and metabolism]	NA|180aa|down_9|NC_017582.1_582462_583002_+	pfam03602, Cons_hypoth95, Conserved hypothetical protein 95
GCF_000219765.1_ASM21976v1	NC_017582	Streptococcus equi subsp. zooepidemicus ATCC 35246, complete sequence	3	611563-611640	3	CRISPRCasFinder	no		cas3,DEDDh,cas5,cas8c,cas7,cas4,cas1,cas2,RT,DinG,csm6	Orphan	AGGCCCAGCAGGCCCTTGTTCGCC	24	0	0	NA	NA	NA	1	1	Orphan	cas3,DEDDh,cas5,cas8c,cas7,cas4,cas1,cas2,RT,DinG,csm6	NA,NA	NA|224aa|up_9|NC_017582.1_599848_600520_+	COG0325, COG0325, Predicted enzyme with a TIM-barrel fold [General function prediction only]	NA|221aa|up_8|NC_017582.1_600532_601195_+	COG1799, COG1799, Uncharacterized protein conserved in bacteria [Function unknown]	NA|86aa|up_7|NC_017582.1_601199_601457_+	COG0762, COG0762, Predicted integral membrane protein [Function unknown]	NA|265aa|up_6|NC_017582.1_601453_602248_+	COG2302, COG2302, Uncharacterized conserved protein, contains S4-like domain [Function unknown]	NA|252aa|up_5|NC_017582.1_602257_603013_+	pfam05103, DivIVA, DivIVA protein	NA|933aa|up_4|NC_017582.1_603283_606082_+	PRK13804, ileS, isoleucyl-tRNA synthetase; Provisional	NA|101aa|up_3|NC_017582.1_606550_606853_-	pfam08860, DUF1827, Domain of unknown function (DUF1827)	NA|149aa|up_2|NC_017582.1_606914_607361_-	cd04684, Nudix_Hydrolase_25, Contains a crystal structure of the Nudix hydrolase from Enterococcus faecalis, which has an unknown function	NA|752aa|up_1|NC_017582.1_607508_609764_-	COG0542, clpA, ATP-binding subunits of Clp protease and DnaK/DnaJ chaperones [Posttranslational modification, protein turnover, chaperones]	NA|77aa|up_0|NC_017582.1_610045_610276_+	COG4703, COG4703, Uncharacterized protein conserved in bacteria [Function unknown]	NA|228aa|down_0|NC_017582.1_612229_612913_+	COG0765, HisM, ABC-type amino acid transport system, permease component [Amino acid transport and metabolism]	NA|246aa|down_1|NC_017582.1_612914_613652_+	COG1126, GlnQ, ABC-type polar amino acid transport system, ATPase component [Amino acid transport and metabolism]	NA|565aa|down_2|NC_017582.1_614098_615793_+	cd05799, PGM2, This CD includes PGM2 (phosphoglucomutase 2) and PGM2L1 (phosphoglucomutase 2-like 1)	NA|285aa|down_3|NC_017582.1_615904_616759_+	PRK14179, PRK14179, bifunctional methylenetetrahydrofolate dehydrogenase/methenyltetrahydrofolate cyclohydrolase	NA|285aa|down_4|NC_017582.1_616755_617610_+	cd01171, YXKO-related, B	NA|447aa|down_5|NC_017582.1_617893_619234_+	PRK00286, xseA, exodeoxyribonuclease VII large subunit; Reviewed	NA|72aa|down_6|NC_017582.1_619211_619427_+	PRK00977, PRK00977, exodeoxyribonuclease VII small subunit; Provisional	NA|290aa|down_7|NC_017582.1_619426_620296_+	COG0142, IspA, Geranylgeranyl pyrophosphate synthase [Coenzyme metabolism]	NA|276aa|down_8|NC_017582.1_620288_621116_+	COG1189, COG1189, Predicted rRNA methylase [Translation, ribosomal structure and biogenesis]	NA|157aa|down_9|NC_017582.1_621102_621573_+	COG1438, ArgR, Arginine repressor [Transcription]
GCF_000219765.1_ASM21976v1	NC_017582	Streptococcus equi subsp. zooepidemicus ATCC 35246, complete sequence	4	902920-903105	2	CRT	no		cas3,DEDDh,cas5,cas8c,cas7,cas4,cas1,cas2,RT,DinG,csm6	Orphan	CCAAAACCNGAACCTAAG	18	2	22	902938-902955|902938-902955|902938-902955|902938-902955|902938-902955|902938-902955|902938-902955|903022-903039|903022-903039|903022-903039|903022-903039|903022-903039|903022-903039|903022-903039|903022-903039|903022-903039|903022-903039|903022-903039|903022-903039|903022-903039|903022-903039|903022-903039	NC_017582.1_517015-517032|NC_017582.1_517039-517056|NC_017582.1_517087-517104|NC_017582.1_516967-516984|NC_017582.1_516991-517008|NC_017582.1_517063-517080|NC_017582.1_1530348-1530365|NC_017582.1_499800-499783|NC_017582.1_517003-517020|NC_017582.1_517027-517044|NC_017582.1_517075-517092|NC_017582.1_499512-499495|NC_017582.1_190487-190504|NC_017582.1_338607-338590|NC_017582.1_499494-499477|NC_017582.1_499542-499525|NC_017582.1_499602-499585|NC_017582.1_499722-499705|NC_017582.1_499782-499765|NC_017582.1_499830-499813|NC_017582.1_1529128-1529145|NC_017582.1_1530336-1530353	NA	4	4	Orphan	cas3,DEDDh,cas5,cas8c,cas7,cas4,cas1,cas2,RT,DinG,csm6	NA|53aa|up_4|NC_017582.1_895214_895373_+,NA|183aa|down_6|NC_017582.1_911271_911820_-,NA|96aa|down_7|NC_017582.1_911812_912100_-,NA|152aa|down_8|NC_017582.1_912099_912555_-	NA|139aa|up_9|NC_017582.1_891977_892394_+	PRK00571, atpC, F0F1 ATP synthase subunit epsilon; Validated	NA|79aa|up_8|NC_017582.1_892527_892764_+	TIGR02327, conserved_hypothetical_protein, conserved hypothetical integral membrane protein	NA|424aa|up_7|NC_017582.1_892827_894099_+	PRK09369, PRK09369, UDP-N-acetylglucosamine 1-carboxyvinyltransferase; Validated	NA|63aa|up_6|NC_017582.1_894102_894291_+	pfam11772, EpuA, DNA-directed RNA polymerase subunit beta	NA|293aa|up_5|NC_017582.1_894322_895201_+	smart00892, Endonuclease_NS, DNA/RNA non-specific endonuclease	NA|53aa|up_4|NC_017582.1_895214_895373_+	NA	NA|348aa|up_3|NC_017582.1_896112_897156_+	PRK00488, pheS, phenylalanyl-tRNA synthetase subunit alpha; Validated	NA|802aa|up_2|NC_017582.1_897345_899751_+	PRK00629, pheT, phenylalanyl-tRNA synthetase subunit beta; Reviewed	NA|359aa|up_1|NC_017582.1_899985_901062_+	COG0577, SalY, ABC-type antimicrobial peptide transport system, permease component [Defense mechanisms]	NA|235aa|up_0|NC_017582.1_901071_901776_+	COG1136, SalX, ABC-type antimicrobial peptide transport system, ATPase component [Defense mechanisms]	NA|192aa|down_0|NC_017582.1_903416_903992_-	COG1309, AcrR, Transcriptional regulator [Transcription]	NA|657aa|down_1|NC_017582.1_904078_906049_+	pfam05738, Cna_B, Cna protein B-type domain	NA|481aa|down_2|NC_017582.1_906063_907506_+	pfam16569, GramPos_pilinBB, Gram-positive pilin backbone subunit 2, Cna-B-like domain	NA|270aa|down_3|NC_017582.1_907586_908396_+	cd05827, Sortase_C, Sortase domain found in class C sortases	NA|393aa|down_4|NC_017582.1_908710_909889_+	pfam09028, Mac-1, Mac 1	NA|375aa|down_5|NC_017582.1_909968_911093_+	PHA03247, PHA03247, large tegument protein UL36; Provisional	NA|183aa|down_6|NC_017582.1_911271_911820_-	NA	NA|96aa|down_7|NC_017582.1_911812_912100_-	NA	NA|152aa|down_8|NC_017582.1_912099_912555_-	NA	NA|1239aa|down_9|NC_017582.1_913660_917377_+	TIGR02102, alkaline_amylopullulanase, pullulanase, extracellular, Gram-positive
GCF_000219765.1_ASM21976v1	NC_017582	Streptococcus equi subsp. zooepidemicus ATCC 35246, complete sequence	5	917516-917625	4	CRISPRCasFinder	no		cas3,DEDDh,cas5,cas8c,cas7,cas4,cas1,cas2,RT,DinG,csm6	Orphan	CTGACTTGATGACAGCTTATACTAAAA	27	0	0	NA	NA	NA	1	1	Orphan	cas3,DEDDh,cas5,cas8c,cas7,cas4,cas1,cas2,RT,DinG,csm6	NA|183aa|up_3|NC_017582.1_911271_911820_-,NA|96aa|up_2|NC_017582.1_911812_912100_-,NA|152aa|up_1|NC_017582.1_912099_912555_-,NA|714aa|down_5|NC_017582.1_921789_923931_+	NA|192aa|up_9|NC_017582.1_903416_903992_-	COG1309, AcrR, Transcriptional regulator [Transcription]	NA|657aa|up_8|NC_017582.1_904078_906049_+	pfam05738, Cna_B, Cna protein B-type domain	NA|481aa|up_7|NC_017582.1_906063_907506_+	pfam16569, GramPos_pilinBB, Gram-positive pilin backbone subunit 2, Cna-B-like domain	NA|270aa|up_6|NC_017582.1_907586_908396_+	cd05827, Sortase_C, Sortase domain found in class C sortases	NA|393aa|up_5|NC_017582.1_908710_909889_+	pfam09028, Mac-1, Mac 1	NA|375aa|up_4|NC_017582.1_909968_911093_+	PHA03247, PHA03247, large tegument protein UL36; Provisional	NA|183aa|up_3|NC_017582.1_911271_911820_-	NA	NA|96aa|up_2|NC_017582.1_911812_912100_-	NA	NA|152aa|up_1|NC_017582.1_912099_912555_-	NA	NA|1239aa|up_0|NC_017582.1_913660_917377_+	TIGR02102, alkaline_amylopullulanase, pullulanase, extracellular, Gram-positive	NA|180aa|down_0|NC_017582.1_917665_918205_+	COG1595, RpoE, DNA-directed RNA polymerase specialized sigma subunit, sigma24 homolog [Transcription]	NA|212aa|down_1|NC_017582.1_918194_918830_+	pfam13490, zf-HC2, Putative zinc-finger	NA|232aa|down_2|NC_017582.1_918957_919653_+	cd07750, PolyPPase_VTC_like, Polyphosphate(polyP) polymerase domain of yeast vacuolar transport chaperone (VTC) proteins VTC-2, -3 and- 4, and similar proteins	NA|226aa|down_3|NC_017582.1_919666_920344_+	pfam16316, DUF4956, Domain of unknown function (DUF4956)	NA|478aa|down_4|NC_017582.1_920349_921783_+	pfam08757, CotH, CotH kinase protein	NA|714aa|down_5|NC_017582.1_921789_923931_+	NA	NA|467aa|down_6|NC_017582.1_924072_925473_+	cd06423, CESA_like, CESA_like is  the cellulose synthase superfamily	NA|1075aa|down_7|NC_017582.1_925683_928908_+	TIGR02774, putative_ATP-dependent_exonuclease_subunit_B, ATP-dependent nuclease subunit B	NA|1214aa|down_8|NC_017582.1_928894_932536_+	TIGR02785, ATP-dependent_helicase/nuclease_subunit_A, helicase-exonuclease AddAB, AddA subunit, Firmicutes type	NA|375aa|down_9|NC_017582.1_932798_933923_+	PRK07324, PRK07324, transaminase; Validated
