assembly_id	genome_id	genome_def	crispr_array_locus_merge	crispr_array_location_merge	crispr_locus_id	crispr_pred_method	array_in_prot	prot_within_array_20000	prot_in_genome	crispr_type_by_cas_prot	consensus_repeat	repeat_length	self-targeting_spacer_number	self-targeting_target_number	spacer_location	protospacer_location	repeat_type	spacer_locus_num	spacer_num	correct_crispr_type	genome_cas_prots	unknown_protein_around_crispr	L10	L10_domain	L9	L9_domain	L8	L8_domain	L7	L7_domain	L6	L6_domain	L5	L5_domain	L4	L4_domain	L3	L3_domain	L2	L2_domain	L1	L1_domain	R1	R1_domain	R2	R2_domain	R3	R3_domain	R4	R4_domain	R5	R5_domain	R6	R6_domain	R7	R7_domain	R8	R8_domain	R9	R9_domain	R10	R10_domain
GCF_000147095.1_ASM14709v1	NC_014498	Streptococcus pneumoniae 670-6B, complete sequence	1	15648-15727	1	CRISPRCasFinder	no	cas3	cas3,DEDDh,DinG	Unclear	TGGAATTTACTTAGAAAATAAAAAA	25	0	0	NA	NA	NA	1	1	Unclear	cas3,DEDDh,DinG	NA|47aa|up_9|NC_014498.1_10445_10586_-,NA|167aa|up_8|NC_014498.1_10597_11098_-,NA|50aa|up_7|NC_014498.1_11491_11641_-,NA|49aa|up_6|NC_014498.1_11627_11774_-,NA|143aa|up_5|NC_014498.1_11777_12206_-,NA|56aa|up_4|NC_014498.1_12361_12529_-,NA	NA|47aa|up_9|NC_014498.1_10445_10586_-	NA	NA|167aa|up_8|NC_014498.1_10597_11098_-	NA	NA|50aa|up_7|NC_014498.1_11491_11641_-	NA	NA|49aa|up_6|NC_014498.1_11627_11774_-	NA	NA|143aa|up_5|NC_014498.1_11777_12206_-	NA	NA|56aa|up_4|NC_014498.1_12361_12529_-	NA	NA|209aa|up_3|NC_014498.1_12805_13432_-	COG3646, COG3646, Uncharacterized phage-encoded protein [Function unknown]	NA|195aa|up_2|NC_014498.1_13422_14007_-	smart01040, Bro-N, BRO family, N-terminal domain	NA|64aa|up_1|NC_014498.1_14306_14498_-	cd00093, HTH_XRE, Helix-turn-helix XRE-family like proteins	NA|239aa|up_0|NC_014498.1_14701_15418_+	COG2932, COG2932, Predicted transcriptional regulator [Transcription]	NA|285aa|down_0|NC_014498.1_16643_17498_+	PRK11525, dinD, DNA-damage-inducible protein D; Provisional	NA|389aa|down_1|NC_014498.1_17662_18829_+	cd01189, INT_ICEBs1_C_like, C-terminal catalytic domain of integrases from bacterial phages and conjugate transposons	NA|372aa|down_2|NC_014498.1_18948_20064_+	PRK09601, PRK09601, redox-regulated ATPase YchF	NA|190aa|down_3|NC_014498.1_20134_20704_+	COG0193, Pth, Peptidyl-tRNA hydrolase [Translation, ribosomal structure and biogenesis]	cas3|1170aa|down_4|NC_014498.1_20704_24214_+	COG1197, Mfd, Transcription-repair coupling factor (superfamily II helicase) [DNA replication, recombination, and repair / Transcription]	NA|89aa|down_5|NC_014498.1_24271_24538_+	COG1188, COG1188, Ribosome-associated heat shock protein implicated in the recycling of the 50S subunit (S4 paralog) [Translation, ribosomal structure and biogenesis]	NA|123aa|down_6|NC_014498.1_24530_24899_+	COG2919, COG2919, Septum formation initiator [Cell division and chromosome partitioning]	NA|423aa|down_7|NC_014498.1_25018_26287_+	COG2367, PenP, Beta-lactamase class A [Defense mechanisms]	NA|426aa|down_8|NC_014498.1_26283_27561_+	cd01992, PP-ATPase, N-terminal domain of predicted ATPase of the PP-loop faimly implicated in cell cycle control [Cell division and chromosome partitioning]	NA|181aa|down_9|NC_014498.1_27564_28107_+	COG0634, Hpt, Hypoxanthine-guanine phosphoribosyltransferase [Nucleotide transport and metabolism]
GCF_000147095.1_ASM14709v1	NC_014498	Streptococcus pneumoniae 670-6B, complete sequence	2	147974-148069	2	CRISPRCasFinder	no		cas3,DEDDh,DinG	Orphan	AATGTGTAAGATTTTTATATATAA	24	0	0	NA	NA	NA	1	1	Orphan	cas3,DEDDh,DinG	NA|117aa|up_9|NC_014498.1_138284_138635_+,NA|107aa|down_7|NC_014498.1_155759_156080_-	NA|117aa|up_9|NC_014498.1_138284_138635_+	NA	NA|310aa|up_8|NC_014498.1_138953_139883_+	COG4209, LplB, ABC-type polysaccharide transport system, permease component [Carbohydrate transport and metabolism]	NA|308aa|up_7|NC_014498.1_139896_140820_+	COG0395, UgpE, ABC-type sugar transport system, permease component [Carbohydrate transport and metabolism]	NA|492aa|up_6|NC_014498.1_141077_142553_+	pfam12010, DUF3502, Domain of unknown function (DUF3502)	NA|329aa|up_5|NC_014498.1_142824_143811_+	PRK00142, PRK00142, rhodanese-related sulfurtransferase	NA|268aa|up_4|NC_014498.1_144143_144947_+	pfam14132, DUF4299, Domain of unknown function (DUF4299)	NA|355aa|up_3|NC_014498.1_145018_146083_-	pfam10310, DUF5427, Family of unknown function (DUF5427)	NA|304aa|up_2|NC_014498.1_146145_147057_-	pfam13349, DUF4097, Putative adhesin	NA|198aa|up_1|NC_014498.1_147049_147643_-	COG4709, COG4709, Predicted membrane protein [Function unknown]	NA|109aa|up_0|NC_014498.1_147629_147956_-	COG1695, COG1695, Predicted transcriptional regulators [Transcription]	NA|389aa|down_0|NC_014498.1_148094_149261_-	COG2807, CynX, Cyanate permease [Inorganic ion transport and metabolism]	NA|387aa|down_1|NC_014498.1_149318_150479_+	cd02525, Succinoglycan_BP_ExoA, ExoA is involved in the biosynthesis of succinoglycan	NA|617aa|down_2|NC_014498.1_150517_152368_+	COG1086, COG1086, Predicted nucleoside-diphosphate sugar epimerases [Cell envelope biogenesis, outer membrane / Carbohydrate transport and metabolism]	NA|211aa|down_3|NC_014498.1_152672_153305_-	cd04302, HAD_5NT, haloacid dehalogenase (HAD)-like 5'-nucleotidases similar to the Pseudomonas aeruginosa PA0065	NA|291aa|down_4|NC_014498.1_153326_154199_-	TIGR00718, Probable_L-serine_dehydratase_alpha_chain, L-serine dehydratase, iron-sulfur-dependent, alpha subunit	NA|224aa|down_5|NC_014498.1_154207_154879_-	COG1760, SdaA, L-serine deaminase [Amino acid transport and metabolism]	NA|196aa|down_6|NC_014498.1_155120_155708_+	cd00118, LysM, Lysin Motif is a small domain involved in binding peptidoglycan	NA|107aa|down_7|NC_014498.1_155759_156080_-	NA	NA|96aa|down_8|NC_014498.1_156510_156798_+	TIGR01653, hypothetical_protein, bacteriocin, lactococcin 972 family	NA|703aa|down_9|NC_014498.1_156849_158958_+	TIGR01654, unnamed_protein_product, bacteriocin-associated integral membrane (putative immunity) protein
GCF_000147095.1_ASM14709v1	NC_014498	Streptococcus pneumoniae 670-6B, complete sequence	3	1102590-1102672	3	CRISPRCasFinder	no		cas3,DEDDh,DinG	Orphan	TTCTGGTGTCTGCCACCGCTTGGCCCTTA	29	0	0	NA	NA	NA	1	1	Orphan	cas3,DEDDh,DinG	NA|99aa|up_3|NC_014498.1_1095053_1095350_-,NA|95aa|up_2|NC_014498.1_1095363_1095648_-,NA|137aa|up_0|NC_014498.1_1101994_1102405_-,NA|55aa|down_0|NC_014498.1_1102867_1103032_+,NA|285aa|down_5|NC_014498.1_1109222_1110077_-,NA|81aa|down_6|NC_014498.1_1110093_1110336_-,NA|163aa|down_8|NC_014498.1_1112233_1112722_-,NA|100aa|down_9|NC_014498.1_1112711_1113011_-	NA|462aa|up_9|NC_014498.1_1088750_1090136_-	COG1674, FtsK, DNA segregation ATPase FtsK/SpoIIIE and related proteins [Cell division and chromosome partitioning]	NA|129aa|up_8|NC_014498.1_1090164_1090551_-	pfam06125, DUF961, Bacterial protein of unknown function (DUF961)	NA|105aa|up_7|NC_014498.1_1090566_1090881_-	pfam06125, DUF961, Bacterial protein of unknown function (DUF961)	NA|169aa|up_6|NC_014498.1_1091196_1091703_-	COG0612, PqqL, Predicted Zn-dependent peptidases [General function prediction only]	NA|374aa|up_5|NC_014498.1_1091695_1092817_-	pfam00899, ThiF, ThiF family	NA|528aa|up_4|NC_014498.1_1093078_1094662_-	COG1132, MdlB, ABC-type multidrug transport system, ATPase and permease components [Defense mechanisms]	NA|99aa|up_3|NC_014498.1_1095053_1095350_-	NA	NA|95aa|up_2|NC_014498.1_1095363_1095648_-	NA	NA|2075aa|up_1|NC_014498.1_1095722_1101947_-	COG4646, COG4646, DNA methylase [Transcription / DNA replication, recombination, and repair]	NA|137aa|up_0|NC_014498.1_1101994_1102405_-	NA	NA|55aa|down_0|NC_014498.1_1102867_1103032_+	NA	NA|203aa|down_1|NC_014498.1_1103033_1103642_+	PRK08445, PRK08445, dehypoxanthine futalosine cyclase	NA|938aa|down_2|NC_014498.1_1103676_1106490_-	pfam18013, Phage_lysozyme2, Phage tail lysozyme	NA|786aa|down_3|NC_014498.1_1106501_1108859_-	TIGR02746, hypothetical_protein, type-IV secretion system protein TraC	NA|120aa|down_4|NC_014498.1_1108809_1109169_-	pfam12666, PrgI, PrgI family protein	NA|285aa|down_5|NC_014498.1_1109222_1110077_-	NA	NA|81aa|down_6|NC_014498.1_1110093_1110336_-	NA	NA|626aa|down_7|NC_014498.1_1110356_1112234_-	COG3505, VirD4, Type IV secretory pathway, VirD4 components [Intracellular trafficking and secretion]	NA|163aa|down_8|NC_014498.1_1112233_1112722_-	NA	NA|100aa|down_9|NC_014498.1_1112711_1113011_-	NA
