assembly_id	genome_id	genome_def	crispr_array_locus_merge	crispr_array_location_merge	crispr_locus_id	crispr_pred_method	array_in_prot	prot_within_array_20000	prot_in_genome	crispr_type_by_cas_prot	consensus_repeat	repeat_length	self-targeting_spacer_number	self-targeting_target_number	spacer_location	protospacer_location	repeat_type	spacer_locus_num	spacer_num	correct_crispr_type	genome_cas_prots	unknown_protein_around_crispr	L10	L10_domain	L9	L9_domain	L8	L8_domain	L7	L7_domain	L6	L6_domain	L5	L5_domain	L4	L4_domain	L3	L3_domain	L2	L2_domain	L1	L1_domain	R1	R1_domain	R2	R2_domain	R3	R3_domain	R4	R4_domain	R5	R5_domain	R6	R6_domain	R7	R7_domain	R8	R8_domain	R9	R9_domain	R10	R10_domain
GCF_009676645.1_ASM967664v1	NZ_CP046040	Streptococcus equi subsp. zooepidemicus strain OH-71905 chromosome, complete genome	1	500651-501352	1	CRT	no		cas3,DEDDh,cas5,cas8c,cas7,cas4,cas1,cas2,RT,DinG,csm6	Orphan	CTNAGGTTNNGGNTTAGGCTCAGG	24	2	7	500813-500836|500813-500836|500813-500836|500813-500836|500813-500836|501191-501208|501191-501208	NZ_CP046040.1_1545493-1545470|NZ_CP046040.1_925007-924984|NZ_CP046040.1_925031-925008|NZ_CP046040.1_925109-925086|NZ_CP046040.1_925187-925164|NZ_CP046040.1_485948-485965|NZ_CP046040.1_1545481-1545464	NA	11	11	Orphan	cas3,DEDDh,cas5,cas8c,cas7,cas4,cas1,cas2,RT,DinG,csm6	NA|229aa|up_3|NZ_CP046040.1_496457_497144_+,NA|362aa|down_2|NZ_CP046040.1_504976_506062_-,NA|560aa|down_9|NZ_CP046040.1_515584_517264_+	NA|86aa|up_9|NZ_CP046040.1_491254_491512_+	PRK02539, PRK02539, DUF896 family protein	NA|55aa|up_8|NZ_CP046040.1_492040_492205_+	PLN02866, PLN02866, phospholipase D	NA|317aa|up_7|NZ_CP046040.1_492432_493383_+	TIGR03605, antibiot_sagB, SagB-type dehydrogenase domain	NA|355aa|up_6|NZ_CP046040.1_493379_494444_+	TIGR03603, cyclo_dehy_ocin, thiazole/oxazole-forming peptide maturase, SagC family component	NA|453aa|up_5|NZ_CP046040.1_494456_495815_+	TIGR03604, hypothetical_protein, thiazole/oxazole-forming peptide maturase, SagD family component	NA|224aa|up_4|NZ_CP046040.1_495789_496461_+	pfam02517, Abi, CAAX protease self-immunity	NA|229aa|up_3|NZ_CP046040.1_496457_497144_+	NA	NA|308aa|up_2|NZ_CP046040.1_497166_498090_+	COG1131, CcmA, ABC-type multidrug transport system, ATPase component [Defense mechanisms]	NA|376aa|up_1|NZ_CP046040.1_498098_499226_+	pfam12698, ABC2_membrane_3, ABC-2 family transporter protein	NA|373aa|up_0|NZ_CP046040.1_499222_500341_+	pfam12698, ABC2_membrane_3, ABC-2 family transporter protein	NA|97aa|down_0|NZ_CP046040.1_502372_502663_-	NF033186, internalin_K, class 1 internalin InlK	NA|623aa|down_1|NZ_CP046040.1_502693_504562_-	PRK12678, PRK12678, transcription termination factor Rho; Provisional	NA|362aa|down_2|NZ_CP046040.1_504976_506062_-	NA	NA|210aa|down_3|NZ_CP046040.1_506535_507165_-	pfam03932, CutC, CutC family	NA|185aa|down_4|NZ_CP046040.1_507375_507930_+	pfam00583, Acetyltransf_1, Acetyltransferase (GNAT) family	NA|171aa|down_5|NZ_CP046040.1_508386_508899_+	COG0350, Ada, Methylated DNA-protein cysteine methyltransferase [DNA replication, recombination, and repair]	NA|118aa|down_6|NZ_CP046040.1_508902_509256_+	cd03036, ArsC_like, Arsenate Reductase (ArsC) family, unknown subfamily; uncharacterized proteins containing a CXXC motif with similarity to thioredoxin (TRX)-fold arsenic reductases, ArsC	NA|276aa|down_7|NZ_CP046040.1_509331_510159_-	cd09087, Ape1-like_AP-endo, Human Ape1-like subfamily of the ExoIII family apurinic/apyrimidinic (AP) endonucleases	NA|1633aa|down_8|NZ_CP046040.1_510511_515410_+	cd07475, Peptidases_S8_C5a_Peptidase, Peptidase S8 family domain in Streptococcal C5a peptidases	NA|560aa|down_9|NZ_CP046040.1_515584_517264_+	NA
GCF_009676645.1_ASM967664v1	NZ_CP046040	Streptococcus equi subsp. zooepidemicus strain OH-71905 chromosome, complete genome	2	563488-564773	1,1,2	PILER-CR,CRISPRCasFinder,CRT	no	cas3,cas5,cas8c,cas7,cas4,cas1,cas2	cas3,DEDDh,cas5,cas8c,cas7,cas4,cas1,cas2,RT,DinG,csm6	 Type I-U?,Type I-C,Type I-U	GTCTCGCCTTTCATGGGCGAGTGGATTGAAAT,GTCTCGCCTTTCATGGGCGAGTGGATTGAAAT,GTCTCGCCTTTCATGGGCGAGTGGATTGAAAT	32,32,32	0	0	NA	NA	I-C:I-C:I-C	16,19,19	19	TypeI-U,TypeI-U?,TypeI-C	cas3,DEDDh,cas5,cas8c,cas7,cas4,cas1,cas2,RT,DinG,csm6	NA|139aa|up_9|NZ_CP046040.1_552703_553120_+,NA	NA|139aa|up_9|NZ_CP046040.1_552703_553120_+	NA	NA|140aa|up_8|NZ_CP046040.1_553349_553769_+	pfam14021, TNT, Tuberculosis necrotizing toxin	NA|107aa|up_7|NZ_CP046040.1_553768_554089_+	pfam15597, Imm59, Immunity protein 59	cas3|818aa|up_6|NZ_CP046040.1_554679_557133_+	COG1203, COG1203, CRISPR-associated helicase Cas3 [Defense mechanisms]	cas5|240aa|up_5|NZ_CP046040.1_557889_558609_+	TIGR01876, cas_Cas5d, CRISPR-associated protein Cas5, subtype I-C/DVULG	cas8c|628aa|up_4|NZ_CP046040.1_558608_560492_+	cd09642, Cas8c_I-C, CRISPR/Cas system-associated protein Cas8c	cas7|283aa|up_3|NZ_CP046040.1_560492_561341_+	cd09689, Cas7_I-C, CRISPR/Cas system-associated RAMP superfamily protein Cas7	cas4|224aa|up_2|NZ_CP046040.1_561342_562014_+	COG1468, COG1468, CRISPR-associated protein Cas4 (RecB family exonuclease) [Defense    mechanisms]	cas1|342aa|up_1|NZ_CP046040.1_562010_563036_+	TIGR03640, cas1_DVULG, CRISPR-associated endonuclease Cas1, subtype I-C/DVULG	cas2|98aa|up_0|NZ_CP046040.1_563046_563340_+	cd09725, Cas2_I_II_III, CRISPR/Cas system-associated protein Cas2	NA|237aa|down_0|NZ_CP046040.1_566211_566922_+	COG0785, CcdA, Cytochrome c biogenesis protein [Posttranslational modification, protein turnover, chaperones]	NA|217aa|down_1|NZ_CP046040.1_566935_567586_+	cd02966, TlpA_like_family, TlpA-like family; composed of  TlpA, ResA, DsbE and similar proteins	NA|371aa|down_2|NZ_CP046040.1_567603_568716_+	PRK14018, PRK14018, bifunctional peptide-methionine (S)-S-oxide reductase MsrA/peptide-methionine (R)-S-oxide reductase MsrB	NA|247aa|down_3|NZ_CP046040.1_568983_569724_+	COG4753, COG4753, Response regulator containing CheY-like receiver domain and AraC-type DNA-binding domain [Signal transduction mechanisms]	NA|573aa|down_4|NZ_CP046040.1_569720_571439_+	COG2972, COG2972, Predicted signal transduction protein with a C-terminal ATPase domain [Signal transduction mechanisms]	NA|363aa|down_5|NZ_CP046040.1_571478_572567_+	COG2315, MmcQ, Uncharacterized protein conserved in bacteria [Function unknown]	NA|235aa|down_6|NZ_CP046040.1_572581_573286_+	COG3382, COG3382, Solo B3/4 domain (OB-fold DNA/RNA-binding) of Phe-aaRS-beta [General function prediction only]	NA|163aa|down_7|NZ_CP046040.1_573522_574011_+	cd04335, PrdX_deacylase, This CD includes bacterial (Agrobacterium tumefaciens and Caulobacter crescentus ProX, and Clostridium sticklandii PrdX) and eukaryotic (Plasmodium falciparum N-terminal ProRS editing domain) sequences	NA|158aa|down_8|NZ_CP046040.1_574654_575128_-	COG1438, ArgR, Arginine repressor [Transcription]	NA|227aa|down_9|NZ_CP046040.1_575268_575949_+	COG0664, Crp, cAMP-binding proteins - catabolite gene activator and regulatory subunit of cAMP-dependent protein kinases [Signal transduction mechanisms]
GCF_009676645.1_ASM967664v1	NZ_CP046040	Streptococcus equi subsp. zooepidemicus strain OH-71905 chromosome, complete genome	3	574368-574523	2	CRISPRCasFinder	no	cas3,cas5,cas8c,cas7,cas4,cas1,cas2	cas3,DEDDh,cas5,cas8c,cas7,cas4,cas1,cas2,RT,DinG,csm6	 Type I-U?,Type I-C,Type I-U	CTCCTTTTGCAGGAGTGTGGATTG	24	0	0	NA	NA	NA	2	2	TypeI-U,TypeI-U?,TypeI-C	cas3,DEDDh,cas5,cas8c,cas7,cas4,cas1,cas2,RT,DinG,csm6	NA,NA	cas1|342aa|up_9|NZ_CP046040.1_562010_563036_+	TIGR03640, cas1_DVULG, CRISPR-associated endonuclease Cas1, subtype I-C/DVULG	cas2|98aa|up_8|NZ_CP046040.1_563046_563340_+	cd09725, Cas2_I_II_III, CRISPR/Cas system-associated protein Cas2	NA|237aa|up_7|NZ_CP046040.1_566211_566922_+	COG0785, CcdA, Cytochrome c biogenesis protein [Posttranslational modification, protein turnover, chaperones]	NA|217aa|up_6|NZ_CP046040.1_566935_567586_+	cd02966, TlpA_like_family, TlpA-like family; composed of  TlpA, ResA, DsbE and similar proteins	NA|371aa|up_5|NZ_CP046040.1_567603_568716_+	PRK14018, PRK14018, bifunctional peptide-methionine (S)-S-oxide reductase MsrA/peptide-methionine (R)-S-oxide reductase MsrB	NA|247aa|up_4|NZ_CP046040.1_568983_569724_+	COG4753, COG4753, Response regulator containing CheY-like receiver domain and AraC-type DNA-binding domain [Signal transduction mechanisms]	NA|573aa|up_3|NZ_CP046040.1_569720_571439_+	COG2972, COG2972, Predicted signal transduction protein with a C-terminal ATPase domain [Signal transduction mechanisms]	NA|363aa|up_2|NZ_CP046040.1_571478_572567_+	COG2315, MmcQ, Uncharacterized protein conserved in bacteria [Function unknown]	NA|235aa|up_1|NZ_CP046040.1_572581_573286_+	COG3382, COG3382, Solo B3/4 domain (OB-fold DNA/RNA-binding) of Phe-aaRS-beta [General function prediction only]	NA|163aa|up_0|NZ_CP046040.1_573522_574011_+	cd04335, PrdX_deacylase, This CD includes bacterial (Agrobacterium tumefaciens and Caulobacter crescentus ProX, and Clostridium sticklandii PrdX) and eukaryotic (Plasmodium falciparum N-terminal ProRS editing domain) sequences	NA|158aa|down_0|NZ_CP046040.1_574654_575128_-	COG1438, ArgR, Arginine repressor [Transcription]	NA|227aa|down_1|NZ_CP046040.1_575268_575949_+	COG0664, Crp, cAMP-binding proteins - catabolite gene activator and regulatory subunit of cAMP-dependent protein kinases [Signal transduction mechanisms]	NA|412aa|down_2|NZ_CP046040.1_576217_577453_+	PRK01388, PRK01388, arginine deiminase; Provisional	NA|144aa|down_3|NZ_CP046040.1_577462_577894_+	pfam13673, Acetyltransf_10, Acetyltransferase (GNAT) domain	NA|338aa|down_4|NZ_CP046040.1_577908_578922_+	PRK02102, PRK02102, ornithine carbamoyltransferase; Validated	NA|498aa|down_5|NZ_CP046040.1_579083_580577_+	COG1288, COG1288, Predicted membrane protein [Function unknown]	NA|444aa|down_6|NZ_CP046040.1_580593_581925_+	PRK07205, PRK07205, hypothetical protein; Provisional	NA|317aa|down_7|NZ_CP046040.1_581942_582893_+	PRK12353, PRK12353, putative amino acid kinase; Reviewed	NA|331aa|down_8|NZ_CP046040.1_583111_584104_+	COG2502, AsnA, Asparagine synthetase A [Amino acid transport and metabolism]	NA|180aa|down_9|NZ_CP046040.1_584273_584813_+	pfam03602, Cons_hypoth95, Conserved hypothetical protein 95
GCF_009676645.1_ASM967664v1	NZ_CP046040	Streptococcus equi subsp. zooepidemicus strain OH-71905 chromosome, complete genome	4	931808-931917	3	CRISPRCasFinder	no		cas3,DEDDh,cas5,cas8c,cas7,cas4,cas1,cas2,RT,DinG,csm6	Orphan	CTGACTTGATGACAGCTTATACTAAAA	27	0	0	NA	NA	NA	1	1	Orphan	cas3,DEDDh,cas5,cas8c,cas7,cas4,cas1,cas2,RT,DinG,csm6	NA|183aa|up_3|NZ_CP046040.1_925563_926112_-,NA|96aa|up_2|NZ_CP046040.1_926104_926392_-,NA|152aa|up_1|NZ_CP046040.1_926391_926847_-,NA|714aa|down_5|NZ_CP046040.1_936081_938223_+	NA|192aa|up_9|NZ_CP046040.1_917744_918320_-	COG1309, AcrR, Transcriptional regulator [Transcription]	NA|657aa|up_8|NZ_CP046040.1_918406_920377_+	pfam05738, Cna_B, Cna protein B-type domain	NA|481aa|up_7|NZ_CP046040.1_920391_921834_+	pfam16569, GramPos_pilinBB, Gram-positive pilin backbone subunit 2, Cna-B-like domain	NA|270aa|up_6|NZ_CP046040.1_921914_922724_+	cd05827, Sortase_C, Sortase domain found in class C sortases	NA|393aa|up_5|NZ_CP046040.1_923038_924217_+	pfam09028, Mac-1, Mac 1	NA|363aa|up_4|NZ_CP046040.1_924296_925385_+	PHA03247, PHA03247, large tegument protein UL36; Provisional	NA|183aa|up_3|NZ_CP046040.1_925563_926112_-	NA	NA|96aa|up_2|NZ_CP046040.1_926104_926392_-	NA	NA|152aa|up_1|NZ_CP046040.1_926391_926847_-	NA	NA|1239aa|up_0|NZ_CP046040.1_927952_931669_+	TIGR02102, alkaline_amylopullulanase, pullulanase, extracellular, Gram-positive	NA|180aa|down_0|NZ_CP046040.1_931957_932497_+	COG1595, RpoE, DNA-directed RNA polymerase specialized sigma subunit, sigma24 homolog [Transcription]	NA|212aa|down_1|NZ_CP046040.1_932486_933122_+	pfam13490, zf-HC2, Putative zinc-finger	NA|232aa|down_2|NZ_CP046040.1_933249_933945_+	cd07750, PolyPPase_VTC_like, Polyphosphate(polyP) polymerase domain of yeast vacuolar transport chaperone (VTC) proteins VTC-2, -3 and- 4, and similar proteins	NA|226aa|down_3|NZ_CP046040.1_933958_934636_+	pfam16316, DUF4956, Domain of unknown function (DUF4956)	NA|478aa|down_4|NZ_CP046040.1_934641_936075_+	pfam08757, CotH, CotH kinase protein	NA|714aa|down_5|NZ_CP046040.1_936081_938223_+	NA	NA|465aa|down_6|NZ_CP046040.1_938370_939765_+	cd06423, CESA_like, CESA_like is  the cellulose synthase superfamily	NA|1075aa|down_7|NZ_CP046040.1_939975_943200_+	TIGR02774, putative_ATP-dependent_exonuclease_subunit_B, ATP-dependent nuclease subunit B	NA|1214aa|down_8|NZ_CP046040.1_943186_946828_+	TIGR02785, ATP-dependent_helicase/nuclease_subunit_A, helicase-exonuclease AddAB, AddA subunit, Firmicutes type	NA|375aa|down_9|NZ_CP046040.1_947090_948215_+	PRK07324, PRK07324, transaminase; Validated
