assembly_id	genome_id	genome_def	crispr_array_locus_merge	crispr_array_location_merge	crispr_locus_id	crispr_pred_method	array_in_prot	prot_within_array_20000	prot_in_genome	crispr_type_by_cas_prot	consensus_repeat	repeat_length	self-targeting_spacer_number	self-targeting_target_number	spacer_location	protospacer_location	repeat_type	spacer_locus_num	spacer_num	correct_crispr_type	genome_cas_prots	unknown_protein_around_crispr	L10	L10_domain	L9	L9_domain	L8	L8_domain	L7	L7_domain	L6	L6_domain	L5	L5_domain	L4	L4_domain	L3	L3_domain	L2	L2_domain	L1	L1_domain	R1	R1_domain	R2	R2_domain	R3	R3_domain	R4	R4_domain	R5	R5_domain	R6	R6_domain	R7	R7_domain	R8	R8_domain	R9	R9_domain	R10	R10_domain
GCF_900475045.1_42290_B02	NZ_LS483346	Streptococcus sanguinis strain NCTC11085 chromosome 1	1	289049-289161	1	CRISPRCasFinder	no	DEDDh,WYL	cas3,WYL,DEDDh,csa3,DinG,csm6	Unclear	CTTAATGAGACAAGCCGTGGGCGCAGTCG	29	0	0	NA	NA	NA	1	1	Orphan	cas3,WYL,DEDDh,csa3,DinG,csm6	NA|95aa|up_7|NZ_LS483346.1_283957_284242_+,NA|121aa|up_5|NZ_LS483346.1_285392_285755_+,NA|137aa|up_2|NZ_LS483346.1_287652_288063_+,NA|47aa|up_1|NZ_LS483346.1_288195_288336_-,NA|86aa|up_0|NZ_LS483346.1_288721_288979_+,NA|183aa|down_7|NZ_LS483346.1_297660_298209_-	NA|255aa|up_9|NZ_LS483346.1_280335_281100_+	COG1126, GlnQ, ABC-type polar amino acid transport system, ATPase component [Amino acid transport and metabolism]	NA|713aa|up_8|NZ_LS483346.1_281557_283696_+	COG0542, clpA, ATP-binding subunits of Clp protease and DnaK/DnaJ chaperones [Posttranslational modification, protein turnover, chaperones]	NA|95aa|up_7|NZ_LS483346.1_283957_284242_+	NA	NA|91aa|up_6|NZ_LS483346.1_285107_285380_+	TIGR04197, conserved_hypothetical_protein, type VII secretion effector, SACOL2603 family	NA|121aa|up_5|NZ_LS483346.1_285392_285755_+	NA	NA|527aa|up_4|NZ_LS483346.1_285744_287325_+	TIGR02168, Chromosome_partition_protein_Smc, chromosome segregation protein SMC, common bacterial type	NA|107aa|up_3|NZ_LS483346.1_287342_287663_+	pfam13780, DUF4176, Domain of unknown function (DUF4176)	NA|137aa|up_2|NZ_LS483346.1_287652_288063_+	NA	NA|47aa|up_1|NZ_LS483346.1_288195_288336_-	NA	NA|86aa|up_0|NZ_LS483346.1_288721_288979_+	NA	NA|659aa|down_0|NZ_LS483346.1_289442_291419_+	COG0021, TktA, Transketolase [Carbohydrate transport and metabolism]	NA|112aa|down_1|NZ_LS483346.1_291596_291932_+	PRK06531, yajC, preprotein translocase subunit YajC; Validated	NA|250aa|down_2|NZ_LS483346.1_292161_292911_+	PRK14830, PRK14830, undecaprenyl pyrophosphate synthase; Provisional	NA|268aa|down_3|NZ_LS483346.1_292929_293733_+	pfam01148, CTP_transf_1, Cytidylyltransferase family	NA|419aa|down_4|NZ_LS483346.1_293961_295218_+	TIGR00054, Putative_zinc_metalloprotease_slr1821, RIP metalloprotease RseP	NA|617aa|down_5|NZ_LS483346.1_295283_297134_+	PRK09194, PRK09194, prolyl-tRNA synthetase; Provisional	NA|127aa|down_6|NZ_LS483346.1_297223_297604_+	pfam08000, bPH_1, Bacterial PH domain	NA|183aa|down_7|NZ_LS483346.1_297660_298209_-	NA	DEDDh|1463aa|down_8|NZ_LS483346.1_298375_302764_+	PRK00448, polC, DNA polymerase III PolC; Validated	NA|637aa|down_9|NZ_LS483346.1_302838_304749_+	TIGR02881, Stage_V_sporulation_protein_K, stage V sporulation protein K
GCF_900475045.1_42290_B02	NZ_LS483346	Streptococcus sanguinis strain NCTC11085 chromosome 1	2	1293219-1293306	2	CRISPRCasFinder	no	csm6	cas3,WYL,DEDDh,csa3,DinG,csm6	Type III-A	GAACCTTGGATTAAGGAGAACTCGC	25	0	0	NA	NA	NA	1	1	TypeIII-A	cas3,WYL,DEDDh,csa3,DinG,csm6	NA,NA	csm6|254aa|up_9|NZ_LS483346.1_1285262_1286024_-	pfam09659, Cas_Csm6, CRISPR-associated protein (Cas_Csm6)	NA|238aa|up_8|NZ_LS483346.1_1286020_1286734_-	PRK05819, deoD, DeoD-type purine-nucleoside phosphorylase	NA|271aa|up_7|NZ_LS483346.1_1287068_1287881_-	PRK08202, PRK08202, purine nucleoside phosphorylase; Provisional	NA|404aa|up_6|NZ_LS483346.1_1287910_1289122_-	PRK05362, PRK05362, phosphopentomutase; Provisional	NA|226aa|up_5|NZ_LS483346.1_1289135_1289813_-	PRK00702, PRK00702, ribose-5-phosphate isomerase RpiA	NA|458aa|up_4|NZ_LS483346.1_1289989_1291363_+	PRK05291, trmE, tRNA uridine-5-carboxymethylaminomethyl(34) synthesis GTPase MnmE	NA|116aa|up_3|NZ_LS483346.1_1291587_1291935_-	PRK05338, rplS, 50S ribosomal protein L19; Provisional	NA|109aa|up_2|NZ_LS483346.1_1292053_1292380_-	PRK14229, PRK14229, fluoride efflux transporter CrcB	NA|125aa|up_1|NZ_LS483346.1_1292376_1292751_-	PRK14221, PRK14221, fluoride efflux transporter CrcB	NA|88aa|up_0|NZ_LS483346.1_1292750_1293014_-	PRK07248, PRK07248, chorismate mutase	NA|461aa|down_0|NZ_LS483346.1_1293318_1294701_-	TIGR01980, UPF0051_protein_slr0074, FeS assembly protein SufB	NA|147aa|down_1|NZ_LS483346.1_1294825_1295266_-	PRK07308, PRK07308, flavodoxin; Validated	NA|312aa|down_2|NZ_LS483346.1_1295371_1296307_+	COG0618, COG0618, Exopolyphosphatase-related proteins [General function prediction only]	NA|81aa|down_3|NZ_LS483346.1_1296414_1296657_+	PRK01678, rpmE2, type B 50S ribosomal protein L31	NA|495aa|down_4|NZ_LS483346.1_1296873_1298358_-	PHA03247, PHA03247, large tegument protein UL36; Provisional	NA|97aa|down_5|NZ_LS483346.1_1299505_1299796_-	cd02230, cupin_HP0902-like, Helicobacter pylori HP0902 and related proteins, cupin domain	NA|273aa|down_6|NZ_LS483346.1_1299825_1300644_-	cd14852, LD-carboxypeptidase, L,D-carboxypeptidase DacB and LdcB, and related proteins	NA|577aa|down_7|NZ_LS483346.1_1300624_1302355_-	COG4640, COG4640, Predicted membrane protein [Function unknown]	NA|346aa|down_8|NZ_LS483346.1_1302371_1303409_-	pfam09770, PAT1, Topoisomerase II-associated protein PAT1	NA|105aa|down_9|NZ_LS483346.1_1303507_1303822_-	cd00158, RHOD, Rhodanese Homology Domain (RHOD); an alpha beta fold domain found duplicated in the rhodanese protein
GCF_900475045.1_42290_B02	NZ_LS483346	Streptococcus sanguinis strain NCTC11085 chromosome 1	3	2017210-2017308	3	CRISPRCasFinder	no		cas3,WYL,DEDDh,csa3,DinG,csm6	Orphan	TTGCGCTATCTGCTATGTTATTTA	24	0	0	NA	NA	NA	1	1	Orphan	cas3,WYL,DEDDh,csa3,DinG,csm6	NA|171aa|up_4|NZ_LS483346.1_2010398_2010911_+,NA|145aa|up_0|NZ_LS483346.1_2016183_2016618_-,NA|158aa|down_2|NZ_LS483346.1_2021973_2022447_+	NA|209aa|up_9|NZ_LS483346.1_2004019_2004646_+	COG3404, COG3404, Methenyl tetrahydrofolate cyclohydrolase [Amino acid transport and metabolism]	NA|558aa|up_8|NZ_LS483346.1_2004712_2006386_+	pfam01268, FTHFS, Formate--tetrahydrofolate ligase	NA|200aa|up_7|NZ_LS483346.1_2006398_2006998_+	COG3758, COG3758, Uncharacterized protein conserved in bacteria [Function unknown]	NA|451aa|up_6|NZ_LS483346.1_2007073_2008426_+	TIGR00909, putative_amino_acid_transporter, amino acid transporter	NA|514aa|up_5|NZ_LS483346.1_2008431_2009973_+	PRK09367, PRK09367, histidine ammonia-lyase; Provisional	NA|171aa|up_4|NZ_LS483346.1_2010398_2010911_+	NA	NA|336aa|up_3|NZ_LS483346.1_2011000_2012008_+	PRK13775, PRK13775, formimidoylglutamase; Provisional	NA|1019aa|up_2|NZ_LS483346.1_2012069_2015126_-	COG3629, DnrI, DNA-binding transcriptional activator of the SARP family [Signal transduction mechanisms]	NA|274aa|up_1|NZ_LS483346.1_2015286_2016108_+	COG2339, prsW, Membrane proteinase, regulator of anti-sigma factor [Posttranslational modification, protein turnover, chaperones]	NA|145aa|up_0|NZ_LS483346.1_2016183_2016618_-	NA	NA|350aa|down_0|NZ_LS483346.1_2020035_2021085_+	cd04179, DPM_DPG-synthase_like, DPM_DPG-synthase_like is a member of the Glycosyltransferase 2 superfamily	NA|304aa|down_1|NZ_LS483346.1_2021081_2021993_+	pfam09992, NAGPA, Phosphodiester glycosidase	NA|158aa|down_2|NZ_LS483346.1_2021973_2022447_+	NA	NA|201aa|down_3|NZ_LS483346.1_2022530_2023133_-	pfam00300, His_Phos_1, Histidine phosphatase superfamily (branch 1)	NA|199aa|down_4|NZ_LS483346.1_2023129_2023726_-	pfam00300, His_Phos_1, Histidine phosphatase superfamily (branch 1)	NA|281aa|down_5|NZ_LS483346.1_2023737_2024580_-	pfam08282, Hydrolase_3, haloacid dehalogenase-like hydrolase	NA|746aa|down_6|NZ_LS483346.1_2024808_2027046_-	COG3345, GalA, Alpha-galactosidase [Carbohydrate transport and metabolism]	NA|315aa|down_7|NZ_LS483346.1_2027154_2028099_-	cd06986, cupin_MmsR-like_N, AraC/XylS family transcriptional regulators similar to MmsR, N-terminal cupin domain	NA|293aa|down_8|NZ_LS483346.1_2028100_2028979_-	TIGR00676, 510-methylenetetrahydrofolate_reductase	NA|751aa|down_9|NZ_LS483346.1_2029116_2031369_-	PRK05222, PRK05222, 5-methyltetrahydropteroyltriglutamate--homocysteine S-methyltransferase; Provisional
GCF_900475045.1_42290_B02	NZ_LS483346	Streptococcus sanguinis strain NCTC11085 chromosome 1	4	2152663-2152763	4	CRISPRCasFinder	no		cas3,WYL,DEDDh,csa3,DinG,csm6	Orphan	TTTTCTGCTGTGATCCGAGCTACTTC	26	0	0	NA	NA	NA	1	1	Orphan	cas3,WYL,DEDDh,csa3,DinG,csm6	NA|104aa|up_9|NZ_LS483346.1_2141843_2142155_-,NA|575aa|up_8|NZ_LS483346.1_2142136_2143861_-,NA|138aa|up_7|NZ_LS483346.1_2143835_2144249_-,NA|138aa|up_6|NZ_LS483346.1_2144353_2144767_-,NA|140aa|up_5|NZ_LS483346.1_2144868_2145288_-,NA	NA|104aa|up_9|NZ_LS483346.1_2141843_2142155_-	NA	NA|575aa|up_8|NZ_LS483346.1_2142136_2143861_-	NA	NA|138aa|up_7|NZ_LS483346.1_2143835_2144249_-	NA	NA|138aa|up_6|NZ_LS483346.1_2144353_2144767_-	NA	NA|140aa|up_5|NZ_LS483346.1_2144868_2145288_-	NA	NA|140aa|up_4|NZ_LS483346.1_2145356_2145776_-	TIGR01575, rimI, ribosomal-protein-alanine acetyltransferase	NA|449aa|up_3|NZ_LS483346.1_2145827_2147174_-	COG0174, GlnA, Glutamine synthetase [Amino acid transport and metabolism]	NA|122aa|up_2|NZ_LS483346.1_2147205_2147571_-	cd01105, HTH_GlnR-like, Helix-Turn-Helix DNA binding domain of GlnR-like transcription regulators	NA|175aa|up_1|NZ_LS483346.1_2147651_2148176_-	COG4129, COG4129, Predicted membrane protein [Function unknown]	NA|222aa|up_0|NZ_LS483346.1_2148352_2149018_-	COG3942, COG3942, Surface antigen [General function prediction only]	NA|399aa|down_0|NZ_LS483346.1_2153604_2154801_-	PRK00073, pgk, phosphoglycerate kinase; Provisional	NA|230aa|down_1|NZ_LS483346.1_2155045_2155735_-	cd02432, Nodulin-21_like_1, Nodulin-21 and CCC1-related protein family	NA|834aa|down_2|NZ_LS483346.1_2155913_2158415_-	PRK00390, leuS, leucyl-tRNA synthetase; Validated	NA|188aa|down_3|NZ_LS483346.1_2158579_2159143_+	COG2020, STE14, Putative protein-S-isoprenylcysteine methyltransferase [Posttranslational modification, protein turnover, chaperones]	NA|365aa|down_4|NZ_LS483346.1_2159210_2160305_-	PRK09423, gldA, glycerol dehydrogenase; Provisional	NA|223aa|down_5|NZ_LS483346.1_2160350_2161019_-	PRK12656, PRK12656, fructose-6-phosphate aldolase; Reviewed	NA|814aa|down_6|NZ_LS483346.1_2161033_2163475_-	cd01677, PFL2_DhaB_BssA, Pyruvate formate lyase 2 and related enzymes	NA|434aa|down_7|NZ_LS483346.1_2163643_2164945_-	COG1455, CelB, Phosphotransferase system cellobiose-specific component IIC [Carbohydrate transport and metabolism]	NA|104aa|down_8|NZ_LS483346.1_2164962_2165274_-	COG1440, CelA, Phosphotransferase system cellobiose-specific component IIB [Carbohydrate transport and metabolism]	NA|108aa|down_9|NZ_LS483346.1_2165294_2165618_-	COG1447, CelC, Phosphotransferase system cellobiose-specific component IIA [Carbohydrate transport and metabolism]
