assembly_id	genome_id	genome_def	crispr_array_locus_merge	crispr_array_location_merge	crispr_locus_id	crispr_pred_method	array_in_prot	prot_within_array_20000	prot_in_genome	crispr_type_by_cas_prot	consensus_repeat	repeat_length	self-targeting_spacer_number	self-targeting_target_number	spacer_location	protospacer_location	repeat_type	spacer_locus_num	spacer_num	correct_crispr_type	genome_cas_prots	unknown_protein_around_crispr	L10	L10_domain	L9	L9_domain	L8	L8_domain	L7	L7_domain	L6	L6_domain	L5	L5_domain	L4	L4_domain	L3	L3_domain	L2	L2_domain	L1	L1_domain	R1	R1_domain	R2	R2_domain	R3	R3_domain	R4	R4_domain	R5	R5_domain	R6	R6_domain	R7	R7_domain	R8	R8_domain	R9	R9_domain	R10	R10_domain
GCF_002094975.1_ASM209497v1	NZ_CP015283	Streptococcus salivarius strain ATCC 25975, complete genome	1	750817-750910	1	CRISPRCasFinder	no		WYL,DEDDh,cas3,csa3,DinG,csn2,cas2,cas1,cas9	Orphan	TGTTCTAAAATTCGAGTTGGCAATTAA	27	0	0	NA	NA	NA	1	1	Orphan	WYL,DEDDh,cas3,csa3,DinG,csn2,cas2,cas1,cas9	NA,NA	NA|540aa|up_9|NZ_CP015283.1_742271_743891_+	PRK00013, groEL, chaperonin GroEL; Reviewed	NA|282aa|up_8|NZ_CP015283.1_743975_744821_-	pfam14373, Imm_superinfect, Superinfection immunity protein	NA|102aa|up_7|NZ_CP015283.1_744910_745216_-	pfam10825, DUF2752, Protein of unknown function (DUF2752)	NA|111aa|up_6|NZ_CP015283.1_745215_745548_-	pfam05154, TM2, TM2 domain	NA|290aa|up_5|NZ_CP015283.1_745732_746602_-	COG0564, RluA, Pseudouridylate synthases, 23S RNA-specific [Translation, ribosomal structure and biogenesis]	NA|776aa|up_4|NZ_CP015283.1_746667_748995_+	TIGR02074, Includes:_Penicillin-insensitive_transglycosylase, penicillin-binding protein, 1A family	NA|51aa|up_3|NZ_CP015283.1_749043_749196_+	PRK00504, rpmG, 50S ribosomal protein L33; Validated	NA|59aa|up_2|NZ_CP015283.1_749204_749381_+	PRK07597, secE, preprotein translocase subunit SecE; Reviewed	NA|180aa|up_1|NZ_CP015283.1_749581_750121_+	PRK05609, nusG, transcription antitermination protein NusG; Validated	NA|163aa|up_0|NZ_CP015283.1_750250_750739_+	cd01610, PAP2_like, PAP2_like proteins, a super-family of histidine phosphatases and vanadium haloperoxidases, includes type 2 phosphatidic acid phosphatase or lipid phosphate phosphatase (LPP), Glucose-6-phosphatase, Phosphatidylglycerophosphatase B and bacterial acid phosphatase, vanadium chloroperoxidases, vanadium bromoperoxidases, and several other mostly uncharacterized subfamilies	NA|270aa|down_0|NZ_CP015283.1_750937_751747_-	cd07516, HAD_Pase, phosphatase, similar to Escherichia coli Cof and Thermotoga maritima TM0651; belongs to the haloacid dehalogenase-like superfamily	NA|256aa|down_1|NZ_CP015283.1_751739_752507_-	COG1349, GlpR, Transcriptional regulators of sugar metabolism [Transcription / Carbohydrate transport and metabolism]	NA|834aa|down_2|NZ_CP015283.1_752700_755202_+	PRK00390, leuS, leucyl-tRNA synthetase; Validated	NA|152aa|down_3|NZ_CP015283.1_755300_755756_+	smart00347, HTH_MARR, helix_turn_helix multiple antibiotic resistance protein	NA|138aa|down_4|NZ_CP015283.1_757187_757601_+	pfam11208, DUF2992, Protein of unknown function (DUF2992)	NA|289aa|down_5|NZ_CP015283.1_757949_758816_+	COG1131, CcmA, ABC-type multidrug transport system, ATPase component [Defense mechanisms]	NA|281aa|down_6|NZ_CP015283.1_758815_759658_+	pfam01061, ABC2_membrane, ABC-2 type transporter	NA|152aa|down_7|NZ_CP015283.1_759676_760132_+	pfam04397, LytTR, LytTr DNA-binding domain	NA|142aa|down_8|NZ_CP015283.1_760141_760567_+	pfam11457, DUF3021, Protein of unknown function (DUF3021)	NA|237aa|down_9|NZ_CP015283.1_760658_761369_+	cd01106, HTH_TipAL-Mta, Helix-Turn-Helix DNA binding domain of the transcription regulators TipAL, Mta, and SkgA
GCF_002094975.1_ASM209497v1	NZ_CP015283	Streptococcus salivarius strain ATCC 25975, complete genome	2	1800136-1800207	2	CRISPRCasFinder	no		WYL,DEDDh,cas3,csa3,DinG,csn2,cas2,cas1,cas9	Orphan	GGACCTAAAAAGGTCCACTGAAC	23	0	0	NA	NA	NA	1	1	Orphan	WYL,DEDDh,cas3,csa3,DinG,csn2,cas2,cas1,cas9	NA|250aa|up_5|NZ_CP015283.1_1791534_1792284_-,NA|47aa|down_4|NZ_CP015283.1_1803939_1804080_-,NA|81aa|down_5|NZ_CP015283.1_1804383_1804626_-	NA|424aa|up_9|NZ_CP015283.1_1787070_1788342_-	PRK09369, PRK09369, UDP-N-acetylglucosamine 1-carboxyvinyltransferase; Validated	NA|79aa|up_8|NZ_CP015283.1_1788409_1788646_-	TIGR02327, conserved_hypothetical_protein, conserved hypothetical integral membrane protein	NA|368aa|up_7|NZ_CP015283.1_1788967_1790071_-	pfam01270, Glyco_hydro_8, Glycosyl hydrolases family 8	NA|437aa|up_6|NZ_CP015283.1_1790079_1791390_-	cd06423, CESA_like, CESA_like is  the cellulose synthase superfamily	NA|250aa|up_5|NZ_CP015283.1_1791534_1792284_-	NA	NA|384aa|up_4|NZ_CP015283.1_1792325_1793477_-	COG0381, WecB, UDP-N-acetylglucosamine 2-epimerase [Cell envelope biogenesis, outer membrane]	NA|329aa|up_3|NZ_CP015283.1_1796358_1797345_+	COG2826, Tra8, Transposase and inactivated derivatives, IS30 family [DNA replication, recombination, and repair]	NA|398aa|up_2|NZ_CP015283.1_1797410_1798604_-	PRK05250, PRK05250, S-adenosylmethionine synthetase; Validated	NA|313aa|up_1|NZ_CP015283.1_1798883_1799822_+	PRK11886, PRK11886, bifunctional biotin--[acetyl-CoA-carboxylase] ligase/biotin operon repressor BirA	NA|62aa|up_0|NZ_CP015283.1_1799808_1799994_-	pfam11676, DUF3272, Protein of unknown function (DUF3272)	NA|551aa|down_0|NZ_CP015283.1_1800211_1801864_-	PRK05563, PRK05563, DNA polymerase III subunits gamma and tau; Validated	NA|170aa|down_1|NZ_CP015283.1_1801863_1802373_-	COG1956, COG1956, GAF domain-containing protein [Signal transduction mechanisms]	NA|304aa|down_2|NZ_CP015283.1_1802397_1803309_-	PRK00091, miaA, tRNA delta(2)-isopentenylpyrophosphate transferase; Reviewed	NA|59aa|down_3|NZ_CP015283.1_1803384_1803561_+	pfam11240, DUF3042, Protein of unknown function (DUF3042)	NA|47aa|down_4|NZ_CP015283.1_1803939_1804080_-	NA	NA|81aa|down_5|NZ_CP015283.1_1804383_1804626_-	NA	NA|238aa|down_6|NZ_CP015283.1_1804831_1805545_-	pfam04233, Phage_Mu_F, Phage Mu protein F like protein	NA|116aa|down_7|NZ_CP015283.1_1806258_1806606_-	PRK05338, rplS, 50S ribosomal protein L19; Provisional	NA|95aa|down_8|NZ_CP015283.1_1807955_1808240_-	PRK07248, PRK07248, chorismate mutase	NA|512aa|down_9|NZ_CP015283.1_1808313_1809849_-	cd01031, EriC, ClC chloride channel EriC
GCF_002094975.1_ASM209497v1	NZ_CP015283	Streptococcus salivarius strain ATCC 25975, complete genome	3	2007705-2008925	3,1,1	CRISPRCasFinder,CRT,PILER-CR	no	csn2,cas2,cas1,cas9	WYL,DEDDh,cas3,csa3,DinG,csn2,cas2,cas1,cas9	Type II-C,Type II-B,Type II-A	GTTGTACAGTTACTTAAATCTTGAGAGTACAAAAAC,GTTGTACAGTTACTTAAATCTTGAGAGTACAAAAAC,GTTGTACAGTTACTTAAATCTTGAGAGTACAAAAAC	36,36,36	0	0	NA	NA	NA:NA:NA	18,18,17	18	TypeII-C,TypeII-B,TypeII-A	WYL,DEDDh,cas3,csa3,DinG,csn2,cas2,cas1,cas9	NA,NA|121aa|down_8|NZ_CP015283.1_2019019_2019382_-,NA|126aa|down_9|NZ_CP015283.1_2019386_2019764_-	NA|329aa|up_9|NZ_CP015283.1_1998896_1999883_+	COG2826, Tra8, Transposase and inactivated derivatives, IS30 family [DNA replication, recombination, and repair]	NA|147aa|up_8|NZ_CP015283.1_1999924_2000365_-	pfam12732, YtxH, YtxH-like protein	NA|133aa|up_7|NZ_CP015283.1_2000377_2000776_-	COG4768, COG4768, Uncharacterized protein containing a divergent version of the methyl-accepting chemotaxis-like domain [General function prediction only]	NA|264aa|up_6|NZ_CP015283.1_2000900_2001692_-	PRK12437, PRK12437, prolipoprotein diacylglyceryl transferase; Reviewed	NA|310aa|up_5|NZ_CP015283.1_2001691_2002621_-	PRK05428, PRK05428, HPr kinase/phosphorylase; Provisional	NA|88aa|up_4|NZ_CP015283.1_2002748_2003012_-	COG1983, PspC, Putative stress-responsive transcriptional regulator [Transcription / Signal transduction mechanisms]	NA|147aa|up_3|NZ_CP015283.1_2003077_2003518_-	PRK04351, PRK04351, SprT family protein	NA|711aa|up_2|NZ_CP015283.1_2003504_2005637_-	COG2183, Tex, Transcriptional accessory protein [Transcription]	NA|301aa|up_1|NZ_CP015283.1_2005941_2006844_+	COG0701, COG0701, Predicted permeases [General function prediction only]	NA|272aa|up_0|NZ_CP015283.1_2006843_2007659_+	COG3689, COG3689, Predicted membrane protein [Function unknown]	csn2|351aa|down_0|NZ_CP015283.1_2008989_2010042_-	cd12217, Stu0660_Csn2, Stu0660-like CRISPR/Cas system-associated protein Csn2	cas2|108aa|down_1|NZ_CP015283.1_2010038_2010362_-	COG3512, COG3512, CRISPR-associated protein, Cas2 homolog [Defense mechanisms]	cas1|304aa|down_2|NZ_CP015283.1_2010363_2011275_-	cd09720, Cas1_II, CRISPR/Cas system-associated protein Cas1	cas9|1140aa|down_3|NZ_CP015283.1_2011451_2014871_-	cd09643, Csn1, CRISPR/Cas system-associated protein Cas9	NA|554aa|down_4|NZ_CP015283.1_2015017_2016679_-	PRK11660, PRK11660, putative transporter; Provisional	NA|205aa|down_5|NZ_CP015283.1_2016926_2017541_-	pfam09911, DUF2140, Uncharacterized protein conserved in bacteria (DUF2140)	NA|214aa|down_6|NZ_CP015283.1_2017636_2018278_-	cd01841, NnaC_like, NnaC (CMP-NeuNAc synthetase) _like subfamily of SGNH_hydrolases, a diverse family of lipases and esterases	NA|204aa|down_7|NZ_CP015283.1_2018304_2018916_-	pfam02517, Abi, CAAX protease self-immunity	NA|121aa|down_8|NZ_CP015283.1_2019019_2019382_-	NA	NA|126aa|down_9|NZ_CP015283.1_2019386_2019764_-	NA
GCF_002094975.1_ASM209497v1	NZ_CP015284	Streptococcus salivarius strain ATCC 25975 plasmid, complete sequence	1	10270-10493	1	CRISPRCasFinder	no			Orphan	CCATGTCAAATAGAACAGTAACAAAAC	27	0	0	NA	NA	NA	3	3	Orphan	WYL,DEDDh,cas3,csa3,DinG,csn2,cas2,cas1,cas9	NA|164aa|up_8|NZ_CP015284.1_4095_4587_-,NA|103aa|up_6|NZ_CP015284.1_5463_5772_-,NA|95aa|up_5|NZ_CP015284.1_5832_6117_-,NA|123aa|up_1|NZ_CP015284.1_8915_9284_-,NA|107aa|up_0|NZ_CP015284.1_9296_9617_-,NA|431aa|down_0|NZ_CP015284.1_10653_11946_-,NA|46aa|down_1|NZ_CP015284.1_12292_12430_-,NA|252aa|down_2|NZ_CP015284.1_12448_13204_-,NA|99aa|down_3|NZ_CP015284.1_13621_13918_-,NA|231aa|down_4|NZ_CP015284.1_13958_14651_-,NA|211aa|down_5|NZ_CP015284.1_14644_15277_-,NA|121aa|down_7|NZ_CP015284.1_16214_16577_-	NA|148aa|up_9|NZ_CP015284.1_3599_4043_-	pfam00583, Acetyltransf_1, Acetyltransferase (GNAT) family	NA|164aa|up_8|NZ_CP015284.1_4095_4587_-	NA	NA|219aa|up_7|NZ_CP015284.1_4601_5258_-	PRK00024, PRK00024, DNA repair protein RadC	NA|103aa|up_6|NZ_CP015284.1_5463_5772_-	NA	NA|95aa|up_5|NZ_CP015284.1_5832_6117_-	NA	NA|337aa|up_4|NZ_CP015284.1_6481_7492_-	pfam04404, ERF, ERF superfamily	NA|283aa|up_3|NZ_CP015284.1_7547_8396_-	pfam12684, DUF3799, PDDEXK-like domain of unknown function (DUF3799)	NA|143aa|up_2|NZ_CP015284.1_8472_8901_-	PRK07275, PRK07275, single-stranded DNA-binding protein; Provisional	NA|123aa|up_1|NZ_CP015284.1_8915_9284_-	NA	NA|107aa|up_0|NZ_CP015284.1_9296_9617_-	NA	NA|431aa|down_0|NZ_CP015284.1_10653_11946_-	NA	NA|46aa|down_1|NZ_CP015284.1_12292_12430_-	NA	NA|252aa|down_2|NZ_CP015284.1_12448_13204_-	NA	NA|99aa|down_3|NZ_CP015284.1_13621_13918_-	NA	NA|231aa|down_4|NZ_CP015284.1_13958_14651_-	NA	NA|211aa|down_5|NZ_CP015284.1_14644_15277_-	NA	NA|266aa|down_6|NZ_CP015284.1_15276_16074_-	cd02947, TRX_family, TRX family; composed of two groups: Group I, which includes proteins that exclusively encode a TRX domain; and Group II, which are composed of fusion proteins of TRX and additional domains	NA|121aa|down_7|NZ_CP015284.1_16214_16577_-	NA	NA|334aa|down_8|NZ_CP015284.1_16594_17596_-	pfam13814, Replic_Relax, Replication-relaxation	NA|103aa|down_9|NZ_CP015284.1_18402_18711_+	PRK05431, PRK05431, seryl-tRNA synthetase; Provisional
