assembly_id	genome_id	genome_def	crispr_array_locus_merge	crispr_array_location_merge	crispr_locus_id	crispr_pred_method	array_in_prot	prot_within_array_20000	prot_in_genome	crispr_type_by_cas_prot	consensus_repeat	repeat_length	self-targeting_spacer_number	self-targeting_target_number	spacer_location	protospacer_location	repeat_type	spacer_locus_num	spacer_num	correct_crispr_type	genome_cas_prots	unknown_protein_around_crispr	L10	L10_domain	L9	L9_domain	L8	L8_domain	L7	L7_domain	L6	L6_domain	L5	L5_domain	L4	L4_domain	L3	L3_domain	L2	L2_domain	L1	L1_domain	R1	R1_domain	R2	R2_domain	R3	R3_domain	R4	R4_domain	R5	R5_domain	R6	R6_domain	R7	R7_domain	R8	R8_domain	R9	R9_domain	R10	R10_domain
GCF_000018965.1_ASM1896v1	NC_012468	Streptococcus pneumoniae 70585, complete sequence	1	137959-138054	1	CRISPRCasFinder	no		cas3,DEDDh,DinG,RT	Orphan	AATGTGTAAGATTTTTATATATAA	24	0	0	NA	NA	NA	1	1	Orphan	cas3,DEDDh,DinG,RT	NA|74aa|up_9|NC_012468.1_128494_128716_+,NA	NA|74aa|up_9|NC_012468.1_128494_128716_+	NA	NA|310aa|up_8|NC_012468.1_129034_129964_+	COG4209, LplB, ABC-type polysaccharide transport system, permease component [Carbohydrate transport and metabolism]	NA|308aa|up_7|NC_012468.1_129977_130901_+	COG0395, UgpE, ABC-type sugar transport system, permease component [Carbohydrate transport and metabolism]	NA|492aa|up_6|NC_012468.1_131110_132586_+	pfam12010, DUF3502, Domain of unknown function (DUF3502)	NA|329aa|up_5|NC_012468.1_132857_133844_+	PRK00142, PRK00142, rhodanese-related sulfurtransferase	NA|287aa|up_4|NC_012468.1_133965_134826_+	pfam14132, DUF4299, Domain of unknown function (DUF4299)	NA|355aa|up_3|NC_012468.1_135003_136068_-	pfam10310, DUF5427, Family of unknown function (DUF5427)	NA|304aa|up_2|NC_012468.1_136130_137042_-	pfam13349, DUF4097, Putative adhesin	NA|198aa|up_1|NC_012468.1_137034_137628_-	COG4709, COG4709, Predicted membrane protein [Function unknown]	NA|109aa|up_0|NC_012468.1_137614_137941_-	COG1695, COG1695, Predicted transcriptional regulators [Transcription]	NA|389aa|down_0|NC_012468.1_138079_139246_-	COG2807, CynX, Cyanate permease [Inorganic ion transport and metabolism]	NA|76aa|down_1|NC_012468.1_139303_139531_+	cd02525, Succinoglycan_BP_ExoA, ExoA is involved in the biosynthesis of succinoglycan	NA|76aa|down_2|NC_012468.1_139600_139828_+	cd02525, Succinoglycan_BP_ExoA, ExoA is involved in the biosynthesis of succinoglycan	NA|76aa|down_3|NC_012468.1_139897_140125_+	cd02525, Succinoglycan_BP_ExoA, ExoA is involved in the biosynthesis of succinoglycan	NA|277aa|down_4|NC_012468.1_140291_141122_+	cd04195, GT2_AmsE_like, GT2_AmsE_like is involved in exopolysaccharide amylovora biosynthesis	NA|617aa|down_5|NC_012468.1_141207_143058_+	COG1086, COG1086, Predicted nucleoside-diphosphate sugar epimerases [Cell envelope biogenesis, outer membrane / Carbohydrate transport and metabolism]	NA|211aa|down_6|NC_012468.1_143361_143994_-	cd04302, HAD_5NT, haloacid dehalogenase (HAD)-like 5'-nucleotidases similar to the Pseudomonas aeruginosa PA0065	NA|291aa|down_7|NC_012468.1_144015_144888_-	TIGR00718, Probable_L-serine_dehydratase_alpha_chain, L-serine dehydratase, iron-sulfur-dependent, alpha subunit	NA|224aa|down_8|NC_012468.1_144896_145568_-	COG1760, SdaA, L-serine deaminase [Amino acid transport and metabolism]	NA|196aa|down_9|NC_012468.1_145809_146397_+	cd00118, LysM, Lysin Motif is a small domain involved in binding peptidoglycan
GCF_000018965.1_ASM1896v1	NC_012468	Streptococcus pneumoniae 70585, complete sequence	2	1461339-1461475	2	CRISPRCasFinder	no		cas3,DEDDh,DinG,RT	Orphan	ACTTCTGGTGTCGGTACATTTGGTGTTGG	29	0	0	NA	NA	NA	2	2	Orphan	cas3,DEDDh,DinG,RT	NA,NA|532aa|down_3|NC_012468.1_1469997_1471593_-	NA|120aa|up_9|NC_012468.1_1453401_1453761_-	PRK07252, PRK07252, S1 RNA-binding domain-containing protein	NA|467aa|up_8|NC_012468.1_1453762_1455163_-	COG0652, PpiB, Peptidyl-prolyl cis-trans isomerase (rotamase) - cyclophilin family [Posttranslational modification, protein turnover, chaperones]	NA|80aa|up_7|NC_012468.1_1455457_1455697_-	PRK00391, rpsR, 30S ribosomal protein S18; Reviewed	NA|157aa|up_6|NC_012468.1_1455728_1456199_-	PRK07275, PRK07275, single-stranded DNA-binding protein; Provisional	NA|97aa|up_5|NC_012468.1_1456210_1456501_-	PRK00453, rpsF, 30S ribosomal protein S6; Reviewed	NA|449aa|up_4|NC_012468.1_1456653_1458000_-	PRK03932, asnC, asparaginyl-tRNA synthetase; Validated	NA|122aa|up_3|NC_012468.1_1458015_1458381_-	cd07262, VOC_like, uncharacterized subfamily of vicinal oxygen chelate (VOC) family	NA|396aa|up_2|NC_012468.1_1458373_1459561_-	PRK05764, PRK05764, aspartate aminotransferase; Provisional	NA|163aa|up_1|NC_012468.1_1459557_1460046_-	COG5353, COG5353, Uncharacterized protein conserved in bacteria [Function unknown]	NA|183aa|up_0|NC_012468.1_1460366_1460915_+	COG0431, COG0431, Predicted flavoprotein [General function prediction only]	NA|495aa|down_0|NC_012468.1_1467324_1468809_+	pfam08270, PRD_Mga, M protein trans-acting positive regulator (MGA) PRD domain	NA|243aa|down_1|NC_012468.1_1468859_1469588_-	PRK02101, PRK02101, peroxide stress protein YaaA	NA|55aa|down_2|NC_012468.1_1469666_1469831_-	pfam13129, DUF3953, Protein of unknown function (DUF3953)	NA|532aa|down_3|NC_012468.1_1469997_1471593_-	NA	NA|137aa|down_4|NC_012468.1_1471604_1472015_-	PRK09218, PRK09218, peptide deformylase; Validated	NA|264aa|down_5|NC_012468.1_1472130_1472922_-	PRK11752, PRK11752, putative S-transferase; Provisional	NA|899aa|down_6|NC_012468.1_1472934_1475631_-	cd02089, P-type_ATPase_Ca_prok, prokaryotic P-type Ca(2+)-ATPase similar to Synechococcus elongatus sp	NA|395aa|down_7|NC_012468.1_1475934_1477119_+	COG0053, MMT1, Predicted Co/Zn/Cd cation transporters [Inorganic ion transport and metabolism]	NA|624aa|down_8|NC_012468.1_1477261_1479133_-	COG0488, Uup, ATPase components of ABC transporters with duplicated ATPase domains [General function prediction only]	NA|395aa|down_9|NC_012468.1_1479129_1480314_-	PRK13299, PRK13299, tRNA CCA-pyrophosphorylase; Provisional
GCF_000018965.1_ASM1896v1	NC_012468	Streptococcus pneumoniae 70585, complete sequence	3	1804729-1804815	3	CRISPRCasFinder	no		cas3,DEDDh,DinG,RT	Orphan	CTTCAACCCACTACAGTTGACAAAGAGCC	29	0	0	NA	NA	NA	1	1	Orphan	cas3,DEDDh,DinG,RT	NA,NA|70aa|down_7|NC_012468.1_1814799_1815009_-	NA|243aa|up_9|NC_012468.1_1792324_1793053_-	PRK00104, scpA, segregation and condensation protein A; Reviewed	NA|247aa|up_8|NC_012468.1_1793052_1793793_-	PRK02436, xerD, site-specific tyrosine recombinase XerD	NA|154aa|up_7|NC_012468.1_1793783_1794245_-	cd04643, CBS_pair_bac, Two tandem repeats of the cystathionine beta-synthase (CBS pair) domains present in bacteria	NA|174aa|up_6|NC_012468.1_1794241_1794763_-	COG0622, COG0622, Predicted phosphoesterase [General function prediction only]	NA|324aa|up_5|NC_012468.1_1794738_1795710_-	PRK02491, PRK02491, putative deoxyribonucleotide triphosphate pyrophosphatase/unknown domain fusion protein; Reviewed	NA|265aa|up_4|NC_012468.1_1795706_1796501_-	PRK00865, PRK00865, glutamate racemase; Provisional	NA|83aa|up_3|NC_012468.1_1796707_1796956_-	COG3763, COG3763, Uncharacterized protein conserved in bacteria [Function unknown]	NA|542aa|up_2|NC_012468.1_1797211_1798837_-	TIGR02403, Trehalose-6-phosphate_hydrolase, alpha,alpha-phosphotrehalase	NA|656aa|up_1|NC_012468.1_1799014_1800982_-	TIGR01992, phosphotransferase_system_trehalose_permease, PTS system, trehalose-specific IIBC component	NA|237aa|up_0|NC_012468.1_1801166_1801877_+	TIGR02404, Trehalose_operon_transcriptional_repressor, trehalose operon repressor, B	NA|309aa|down_0|NC_012468.1_1804856_1805783_-	COG1123, COG1123, ATPase components of various ABC-type transport systems, contain duplicated ATPase [General function prediction only]	NA|356aa|down_1|NC_012468.1_1805793_1806861_-	COG0444, DppD, ABC-type dipeptide/oligopeptide/nickel transport system, ATPase component [Amino acid transport and metabolism / Inorganic ion transport and metabolism]	NA|309aa|down_2|NC_012468.1_1806869_1807796_-	COG1173, DppC, ABC-type dipeptide/oligopeptide/nickel transport systems, permease components [Amino acid transport and metabolism / Inorganic ion transport and metabolism]	NA|499aa|down_3|NC_012468.1_1807795_1809292_-	COG0601, DppB, ABC-type dipeptide/oligopeptide/nickel transport systems, permease components [Amino acid transport and metabolism / Inorganic ion transport and metabolism]	NA|660aa|down_4|NC_012468.1_1809358_1811338_-	cd08504, PBP2_OppA, The substrate-binding component of an ABC-type oligopetide import system contains the type 2 periplasmic binding fold	NA|398aa|down_5|NC_012468.1_1812023_1813217_-	COG3307, RfaL, Lipid A core - O-antigen ligase and related enzymes [Cell envelope biogenesis, outer membrane]	NA|481aa|down_6|NC_012468.1_1813354_1814797_-	TIGR03852, sucrose_gtfA, sucrose phosphorylase	NA|70aa|down_7|NC_012468.1_1814799_1815009_-	NA	NA|278aa|down_8|NC_012468.1_1815015_1815849_-	COG0395, UgpE, ABC-type sugar transport system, permease component [Carbohydrate transport and metabolism]	NA|289aa|down_9|NC_012468.1_1815862_1816729_-	COG1175, UgpA, ABC-type sugar transport systems, permease components [Carbohydrate transport and metabolism]
