assembly_id	genome_id	genome_def	crispr_array_locus_merge	crispr_array_location_merge	crispr_locus_id	crispr_pred_method	array_in_prot	prot_within_array_20000	prot_in_genome	crispr_type_by_cas_prot	consensus_repeat	repeat_length	self-targeting_spacer_number	self-targeting_target_number	spacer_location	protospacer_location	repeat_type	spacer_locus_num	spacer_num	correct_crispr_type	genome_cas_prots	unknown_protein_around_crispr	L10	L10_domain	L9	L9_domain	L8	L8_domain	L7	L7_domain	L6	L6_domain	L5	L5_domain	L4	L4_domain	L3	L3_domain	L2	L2_domain	L1	L1_domain	R1	R1_domain	R2	R2_domain	R3	R3_domain	R4	R4_domain	R5	R5_domain	R6	R6_domain	R7	R7_domain	R8	R8_domain	R9	R9_domain	R10	R10_domain
GCF_900476475.1_49768_F01	NZ_LS483450	Streptococcus pneumoniae strain 4041STDY6583227 chromosome 1	1	95639-95734	1	CRISPRCasFinder	no		cas3,DEDDh,DinG	Orphan	AATGTGTAAGATTTTTATATATAA	24	0	0	NA	NA	NA	1	1	Orphan	cas3,DEDDh,DinG	NA|117aa|up_9|NZ_LS483450.1_86124_86475_+,NA|56aa|down_3|NZ_LS483450.1_100118_100286_-,NA|107aa|down_8|NZ_LS483450.1_103408_103729_-	NA|117aa|up_9|NZ_LS483450.1_86124_86475_+	NA	NA|310aa|up_8|NZ_LS483450.1_86793_87723_+	COG4209, LplB, ABC-type polysaccharide transport system, permease component [Carbohydrate transport and metabolism]	NA|308aa|up_7|NZ_LS483450.1_87736_88660_+	COG0395, UgpE, ABC-type sugar transport system, permease component [Carbohydrate transport and metabolism]	NA|492aa|up_6|NZ_LS483450.1_88743_90219_+	pfam12010, DUF3502, Domain of unknown function (DUF3502)	NA|329aa|up_5|NZ_LS483450.1_90490_91477_+	PRK00142, PRK00142, rhodanese-related sulfurtransferase	NA|287aa|up_4|NZ_LS483450.1_91751_92612_+	pfam14132, DUF4299, Domain of unknown function (DUF4299)	NA|355aa|up_3|NZ_LS483450.1_92683_93748_-	pfam10310, DUF5427, Family of unknown function (DUF5427)	NA|304aa|up_2|NZ_LS483450.1_93810_94722_-	pfam13349, DUF4097, Putative adhesin	NA|198aa|up_1|NZ_LS483450.1_94714_95308_-	COG4709, COG4709, Predicted membrane protein [Function unknown]	NA|109aa|up_0|NZ_LS483450.1_95294_95621_-	COG1695, COG1695, Predicted transcriptional regulators [Transcription]	NA|389aa|down_0|NZ_LS483450.1_95759_96926_-	COG2807, CynX, Cyanate permease [Inorganic ion transport and metabolism]	NA|387aa|down_1|NZ_LS483450.1_96983_98144_+	cd02525, Succinoglycan_BP_ExoA, ExoA is involved in the biosynthesis of succinoglycan	NA|617aa|down_2|NZ_LS483450.1_98182_100033_+	COG1086, COG1086, Predicted nucleoside-diphosphate sugar epimerases [Cell envelope biogenesis, outer membrane / Carbohydrate transport and metabolism]	NA|56aa|down_3|NZ_LS483450.1_100118_100286_-	NA	NA|211aa|down_4|NZ_LS483450.1_100335_100968_-	cd04302, HAD_5NT, haloacid dehalogenase (HAD)-like 5'-nucleotidases similar to the Pseudomonas aeruginosa PA0065	NA|291aa|down_5|NZ_LS483450.1_100989_101862_-	TIGR00718, Probable_L-serine_dehydratase_alpha_chain, L-serine dehydratase, iron-sulfur-dependent, alpha subunit	NA|224aa|down_6|NZ_LS483450.1_101870_102542_-	COG1760, SdaA, L-serine deaminase [Amino acid transport and metabolism]	NA|191aa|down_7|NZ_LS483450.1_102783_103356_+	cd00118, LysM, Lysin Motif is a small domain involved in binding peptidoglycan	NA|107aa|down_8|NZ_LS483450.1_103408_103729_-	NA	NA|96aa|down_9|NZ_LS483450.1_104153_104441_+	TIGR01653, hypothetical_protein, bacteriocin, lactococcin 972 family
GCF_900476475.1_49768_F01	NZ_LS483450	Streptococcus pneumoniae strain 4041STDY6583227 chromosome 1	2	128776-129357	1	CRT	no		cas3,DEDDh,DinG	Orphan	TTCATGGTANTACCTAAACGCTAANGGNGCTATGGCAACAGG	42	5	5	128818-128835|128938-128955|128998-129015|129118-129135|129238-129255	NZ_LS483450.1_128758-128775|NZ_LS483450.1_128758-128775|NZ_LS483450.1_128758-128775|NZ_LS483450.1_1864605-1864622|NZ_LS483450.1_1298523-1298506	NA	9	9	Orphan	cas3,DEDDh,DinG	NA|118aa|up_9|NZ_LS483450.1_117723_118077_+,NA|243aa|up_8|NZ_LS483450.1_118073_118802_+,NA|71aa|up_7|NZ_LS483450.1_118955_119168_+,NA|353aa|up_6|NZ_LS483450.1_119972_121031_+,NA|326aa|up_5|NZ_LS483450.1_121345_122323_+,NA|125aa|up_4|NZ_LS483450.1_122700_123075_+,NA|301aa|up_3|NZ_LS483450.1_123282_124185_+,NA|118aa|up_2|NZ_LS483450.1_124240_124594_+,NA|243aa|up_1|NZ_LS483450.1_124590_125319_+,NA|74aa|down_5|NZ_LS483450.1_136218_136440_+,NA|51aa|down_6|NZ_LS483450.1_136652_136805_-	NA|118aa|up_9|NZ_LS483450.1_117723_118077_+	NA	NA|243aa|up_8|NZ_LS483450.1_118073_118802_+	NA	NA|71aa|up_7|NZ_LS483450.1_118955_119168_+	NA	NA|353aa|up_6|NZ_LS483450.1_119972_121031_+	NA	NA|326aa|up_5|NZ_LS483450.1_121345_122323_+	NA	NA|125aa|up_4|NZ_LS483450.1_122700_123075_+	NA	NA|301aa|up_3|NZ_LS483450.1_123282_124185_+	NA	NA|118aa|up_2|NZ_LS483450.1_124240_124594_+	NA	NA|243aa|up_1|NZ_LS483450.1_124590_125319_+	NA	NA|80aa|up_0|NZ_LS483450.1_126453_126693_+	pfam18813, PBECR4, phage-Barnase-EndoU-ColicinE5/D-RelE like nuclease4	NA|374aa|down_0|NZ_LS483450.1_129929_131051_+	PRK00143, mnmA, tRNA-specific 2-thiouridylase MnmA; Reviewed	NA|152aa|down_1|NZ_LS483450.1_131191_131647_+	cd04688, Nudix_Hydrolase_29, Members of the Nudix hydrolase superfamily catalyze the hydrolysis of NUcleoside DIphosphates linked to other moieties, X	NA|638aa|down_2|NZ_LS483450.1_131656_133570_+	PRK05192, PRK05192, tRNA uridine-5-carboxymethylaminomethyl(34) synthesis enzyme MnmG	NA|560aa|down_3|NZ_LS483450.1_133922_135602_-	COG0595, COG0595, mRNA degradation ribonucleases J1/J2 (metallo-beta-lactamase superfamily) [Translation, ribosomal structure and biogenesis; Replication, recombination and repair]	NA|78aa|down_4|NZ_LS483450.1_135603_135837_-	PRK13667, PRK13667, hypothetical protein; Provisional	NA|74aa|down_5|NZ_LS483450.1_136218_136440_+	NA	NA|51aa|down_6|NZ_LS483450.1_136652_136805_-	NA	NA|62aa|down_7|NZ_LS483450.1_136806_136992_-	TIGR03949, bact_IIb_cerein, class IIb bacteriocin, lactobin A/cerein 7B family	NA|228aa|down_8|NZ_LS483450.1_137169_137853_+	COG1214, COG1214, Inactive homolog of metal-dependent proteases, putative molecular chaperone [Posttranslational modification, protein turnover, chaperones]	NA|146aa|down_9|NZ_LS483450.1_137849_138287_+	TIGR01575, rimI, ribosomal-protein-alanine acetyltransferase
GCF_900476475.1_49768_F01	NZ_LS483450	Streptococcus pneumoniae strain 4041STDY6583227 chromosome 1	3	793502-793587	2	CRISPRCasFinder	no		cas3,DEDDh,DinG	Orphan	AGCTAAGCTTGAGAAAGGACAAATTTCG	28	1	1	793530-793559	NZ_LS483450.1_1818084-1818055	NA	1	1	Orphan	cas3,DEDDh,DinG	NA,NA	NA|214aa|up_9|NZ_LS483450.1_781600_782242_-	PRK00220, PRK00220, glycerol-3-phosphate 1-O-acyltransferase PlsY	NA|648aa|up_8|NZ_LS483450.1_782377_784321_+	TIGR01058, DNA_topoisomerase_4_subunit_B, DNA topoisomerase IV, B subunit, Gram-positive	NA|824aa|up_7|NZ_LS483450.1_784740_787212_+	TIGR01061, DNA_topoisomerase_4_subunit_A, DNA topoisomerase IV, A subunit, Gram-positive	NA|341aa|up_6|NZ_LS483450.1_787343_788366_+	PRK13357, PRK13357, branched-chain amino acid aminotransferase; Provisional	NA|225aa|up_5|NZ_LS483450.1_788421_789096_+	cd08504, PBP2_OppA, The substrate-binding component of an ABC-type oligopetide import system contains the type 2 periplasmic binding fold	NA|230aa|up_4|NZ_LS483450.1_789164_789854_+	COG3819, COG3819, Predicted membrane protein [Function unknown]	NA|308aa|up_3|NZ_LS483450.1_789850_790774_+	COG3817, COG3817, Predicted membrane protein [Function unknown]	NA|215aa|up_2|NZ_LS483450.1_790788_791433_+	PRK13197, PRK13197, pyrrolidone-carboxylate peptidase; Provisional	NA|77aa|up_1|NZ_LS483450.1_791600_791831_+	pfam11184, DUF2969, Protein of unknown function (DUF2969)	NA|401aa|up_0|NZ_LS483450.1_792203_793406_+	PRK06676, rpsA, 30S ribosomal protein S1; Reviewed	NA|166aa|down_0|NZ_LS483450.1_795082_795580_+	COG1956, COG1956, GAF domain-containing protein [Signal transduction mechanisms]	NA|552aa|down_1|NZ_LS483450.1_795579_797235_+	PRK05563, PRK05563, DNA polymerase III subunits gamma and tau; Validated	NA|65aa|down_2|NZ_LS483450.1_797262_797457_+	pfam11676, DUF3272, Protein of unknown function (DUF3272)	NA|257aa|down_3|NZ_LS483450.1_797606_798377_+	COG0396, sufC, Cysteine desulfurase activator ATPase [Posttranslational modification, protein turnover, chaperones]	NA|421aa|down_4|NZ_LS483450.1_798404_799667_+	COG0719, SufB, Cysteine desulfurase activator SufB [Posttranslational modification, protein turnover, chaperones]	NA|409aa|down_5|NZ_LS483450.1_799677_800904_+	TIGR01979, Probable_cysteine_desulfurase, cysteine desulfurases, SufSfamily	NA|147aa|down_6|NZ_LS483450.1_800890_801331_+	TIGR01994, Iron-sulfur_cluster_assembly_scaffold_protein_IscU, SUF system FeS assembly protein, NifU family	NA|471aa|down_7|NZ_LS483450.1_801384_802797_+	TIGR01980, UPF0051_protein_slr0074, FeS assembly protein SufB	NA|414aa|down_8|NZ_LS483450.1_803027_804269_-	COG1686, DacC, D-alanyl-D-alanine carboxypeptidase [Cell envelope biogenesis, outer membrane]	NA|370aa|down_9|NZ_LS483450.1_804371_805481_+	pfam01594, AI-2E_transport, AI-2E family transporter
GCF_900476475.1_49768_F01	NZ_LS483450	Streptococcus pneumoniae strain 4041STDY6583227 chromosome 1	4	1110377-1110456	3	CRISPRCasFinder	no		cas3,DEDDh,DinG	Orphan	TTCTGGTGTCTGCCACCGCTTGGCCC	26	0	0	NA	NA	NA	1	1	Orphan	cas3,DEDDh,DinG	NA|74aa|up_5|NZ_LS483450.1_1106208_1106430_-,NA|51aa|up_3|NZ_LS483450.1_1107700_1107853_-,NA|285aa|down_3|NZ_LS483450.1_1116099_1116954_-,NA|81aa|down_4|NZ_LS483450.1_1116970_1117213_-,NA|163aa|down_6|NZ_LS483450.1_1119110_1119599_-,NA|100aa|down_7|NZ_LS483450.1_1119588_1119888_-	NA|728aa|up_9|NZ_LS483450.1_1100496_1102680_-	pfam14093, DUF4271, Domain of unknown function (DUF4271)	NA|816aa|up_8|NZ_LS483450.1_1102682_1105130_-	pfam12846, AAA_10, AAA-like domain	NA|169aa|up_7|NZ_LS483450.1_1105113_1105620_-	pfam12648, TcpE, TcpE family	NA|166aa|up_6|NZ_LS483450.1_1105594_1106092_-	pfam07275, ArdA, Antirestriction protein (ArdA)	NA|74aa|up_5|NZ_LS483450.1_1106208_1106430_-	NA	NA|402aa|up_4|NZ_LS483450.1_1106472_1107678_-	pfam02486, Rep_trans, Replication initiation factor	NA|51aa|up_3|NZ_LS483450.1_1107700_1107853_-	NA	NA|462aa|up_2|NZ_LS483450.1_1107855_1109241_-	COG1674, FtsK, DNA segregation ATPase FtsK/SpoIIIE and related proteins [Cell division and chromosome partitioning]	NA|129aa|up_1|NZ_LS483450.1_1109269_1109656_-	pfam06125, DUF961, Bacterial protein of unknown function (DUF961)	NA|105aa|up_0|NZ_LS483450.1_1109671_1109986_-	pfam06125, DUF961, Bacterial protein of unknown function (DUF961)	NA|938aa|down_0|NZ_LS483450.1_1110553_1113367_-	pfam18013, Phage_lysozyme2, Phage tail lysozyme	NA|786aa|down_1|NZ_LS483450.1_1113378_1115736_-	TIGR02746, hypothetical_protein, type-IV secretion system protein TraC	NA|120aa|down_2|NZ_LS483450.1_1115686_1116046_-	pfam12666, PrgI, PrgI family protein	NA|285aa|down_3|NZ_LS483450.1_1116099_1116954_-	NA	NA|81aa|down_4|NZ_LS483450.1_1116970_1117213_-	NA	NA|626aa|down_5|NZ_LS483450.1_1117233_1119111_-	pfam02534, T4SS-DNA_transf, Type IV secretory system Conjugative DNA transfer	NA|163aa|down_6|NZ_LS483450.1_1119110_1119599_-	NA	NA|100aa|down_7|NZ_LS483450.1_1119588_1119888_-	NA	NA|279aa|down_8|NZ_LS483450.1_1120170_1121007_-	pfam08843, AbiEii, Nucleotidyl transferase AbiEii toxin, Type IV TA system	NA|197aa|down_9|NZ_LS483450.1_1121006_1121597_-	pfam13338, AbiEi_4, Transcriptional regulator, AbiEi antitoxin
GCF_900476475.1_49768_F01	NZ_LS483450	Streptococcus pneumoniae strain 4041STDY6583227 chromosome 1	5	1490781-1490917	4	CRISPRCasFinder	no		cas3,DEDDh,DinG	Orphan	ACTTCTGGTGTCGGTACATTTGGTGTTGG	29	0	0	NA	NA	NA	2	2	Orphan	cas3,DEDDh,DinG	NA,NA|532aa|down_3|NZ_LS483450.1_1498967_1500563_-	NA|120aa|up_9|NZ_LS483450.1_1481352_1481712_-	PRK07252, PRK07252, S1 RNA-binding domain-containing protein	NA|467aa|up_8|NZ_LS483450.1_1481713_1483114_-	COG0652, PpiB, Peptidyl-prolyl cis-trans isomerase (rotamase) - cyclophilin family [Posttranslational modification, protein turnover, chaperones]	NA|80aa|up_7|NZ_LS483450.1_1483408_1483648_-	PRK00391, rpsR, 30S ribosomal protein S18; Reviewed	NA|157aa|up_6|NZ_LS483450.1_1483679_1484150_-	PRK07275, PRK07275, single-stranded DNA-binding protein; Provisional	NA|97aa|up_5|NZ_LS483450.1_1484161_1484452_-	PRK00453, rpsF, 30S ribosomal protein S6; Reviewed	NA|449aa|up_4|NZ_LS483450.1_1484604_1485951_-	PRK03932, asnC, asparaginyl-tRNA synthetase; Validated	NA|122aa|up_3|NZ_LS483450.1_1485966_1486332_-	cd07262, VOC_like, uncharacterized subfamily of vicinal oxygen chelate (VOC) family	NA|396aa|up_2|NZ_LS483450.1_1486324_1487512_-	PRK05764, PRK05764, aspartate aminotransferase; Provisional	NA|163aa|up_1|NZ_LS483450.1_1487508_1487997_-	COG5353, COG5353, Uncharacterized protein conserved in bacteria [Function unknown]	NA|183aa|up_0|NZ_LS483450.1_1488317_1488866_+	COG0431, COG0431, Predicted flavoprotein [General function prediction only]	NA|495aa|down_0|NZ_LS483450.1_1496394_1497879_+	pfam08270, PRD_Mga, M protein trans-acting positive regulator (MGA) PRD domain	NA|243aa|down_1|NZ_LS483450.1_1497929_1498658_-	PRK02101, PRK02101, peroxide stress protein YaaA	NA|55aa|down_2|NZ_LS483450.1_1498736_1498901_-	pfam13129, DUF3953, Protein of unknown function (DUF3953)	NA|532aa|down_3|NZ_LS483450.1_1498967_1500563_-	NA	NA|137aa|down_4|NZ_LS483450.1_1500574_1500985_-	PRK09218, PRK09218, peptide deformylase; Validated	NA|264aa|down_5|NZ_LS483450.1_1501100_1501892_-	PRK11752, PRK11752, putative S-transferase; Provisional	NA|899aa|down_6|NZ_LS483450.1_1501904_1504601_-	cd02089, P-type_ATPase_Ca_prok, prokaryotic P-type Ca(2+)-ATPase similar to Synechococcus elongatus sp	NA|395aa|down_7|NZ_LS483450.1_1504904_1506089_+	COG0053, MMT1, Predicted Co/Zn/Cd cation transporters [Inorganic ion transport and metabolism]	NA|624aa|down_8|NZ_LS483450.1_1506231_1508103_-	COG0488, Uup, ATPase components of ABC transporters with duplicated ATPase domains [General function prediction only]	NA|395aa|down_9|NZ_LS483450.1_1508099_1509284_-	PRK13299, PRK13299, tRNA CCA-pyrophosphorylase; Provisional
