assembly_id	genome_id	genome_def	crispr_array_locus_merge	crispr_array_location_merge	crispr_locus_id	crispr_pred_method	array_in_prot	prot_within_array_20000	prot_in_genome	crispr_type_by_cas_prot	consensus_repeat	repeat_length	self-targeting_spacer_number	self-targeting_target_number	spacer_location	protospacer_location	repeat_type	spacer_locus_num	spacer_num	correct_crispr_type	genome_cas_prots	unknown_protein_around_crispr	L10	L10_domain	L9	L9_domain	L8	L8_domain	L7	L7_domain	L6	L6_domain	L5	L5_domain	L4	L4_domain	L3	L3_domain	L2	L2_domain	L1	L1_domain	R1	R1_domain	R2	R2_domain	R3	R3_domain	R4	R4_domain	R5	R5_domain	R6	R6_domain	R7	R7_domain	R8	R8_domain	R9	R9_domain	R10	R10_domain
GCF_000019005.1_ASM1900v1	NC_012467	Streptococcus pneumoniae P1031, complete sequence	1	130757-130852	1	CRISPRCasFinder	no		cas3,DEDDh,DinG,RT	Orphan	AATGTGTAAGATTTTTATATATAA	24	0	0	NA	NA	NA	1	1	Orphan	cas3,DEDDh,DinG,RT	NA|87aa|up_9|NC_012467.1_121206_121467_+,NA|107aa|down_7|NC_012467.1_138533_138854_-	NA|87aa|up_9|NC_012467.1_121206_121467_+	NA	NA|310aa|up_8|NC_012467.1_121959_122889_+	COG4209, LplB, ABC-type polysaccharide transport system, permease component [Carbohydrate transport and metabolism]	NA|308aa|up_7|NC_012467.1_122902_123826_+	COG0395, UgpE, ABC-type sugar transport system, permease component [Carbohydrate transport and metabolism]	NA|492aa|up_6|NC_012467.1_123909_125385_+	pfam12010, DUF3502, Domain of unknown function (DUF3502)	NA|329aa|up_5|NC_012467.1_125656_126643_+	PRK00142, PRK00142, rhodanese-related sulfurtransferase	NA|287aa|up_4|NC_012467.1_126764_127625_+	pfam14132, DUF4299, Domain of unknown function (DUF4299)	NA|355aa|up_3|NC_012467.1_127801_128866_-	pfam10310, DUF5427, Family of unknown function (DUF5427)	NA|304aa|up_2|NC_012467.1_128928_129840_-	pfam13349, DUF4097, Putative adhesin	NA|198aa|up_1|NC_012467.1_129832_130426_-	COG4709, COG4709, Predicted membrane protein [Function unknown]	NA|109aa|up_0|NC_012467.1_130412_130739_-	COG1695, COG1695, Predicted transcriptional regulators [Transcription]	NA|389aa|down_0|NC_012467.1_130877_132044_-	cd17339, MFS_NIMT_CynX_like, 2-nitroimidazole and cyanate transporters and similar proteins of the Major Facilitator Superfamily of transporters	NA|380aa|down_1|NC_012467.1_132101_133241_+	cd02525, Succinoglycan_BP_ExoA, ExoA is involved in the biosynthesis of succinoglycan	NA|617aa|down_2|NC_012467.1_133308_135159_+	COG1086, COG1086, Predicted nucleoside-diphosphate sugar epimerases [Cell envelope biogenesis, outer membrane / Carbohydrate transport and metabolism]	NA|211aa|down_3|NC_012467.1_135461_136094_-	cd04302, HAD_5NT, haloacid dehalogenase (HAD)-like 5'-nucleotidases similar to the Pseudomonas aeruginosa PA0065	NA|291aa|down_4|NC_012467.1_136115_136988_-	TIGR00718, Probable_L-serine_dehydratase_alpha_chain, L-serine dehydratase, iron-sulfur-dependent, alpha subunit	NA|224aa|down_5|NC_012467.1_136996_137668_-	COG1760, SdaA, L-serine deaminase [Amino acid transport and metabolism]	NA|191aa|down_6|NC_012467.1_137909_138482_+	cd00118, LysM, Lysin Motif is a small domain involved in binding peptidoglycan	NA|107aa|down_7|NC_012467.1_138533_138854_-	NA	NA|95aa|down_8|NC_012467.1_139278_139563_+	pfam09683, Lactococcin_972, Bacteriocin (Lactococcin_972)	NA|703aa|down_9|NC_012467.1_139617_141726_+	TIGR01654, unnamed_protein_product, bacteriocin-associated integral membrane (putative immunity) protein
GCF_000019005.1_ASM1900v1	NC_012467	Streptococcus pneumoniae P1031, complete sequence	2	1059441-1059553	2	CRISPRCasFinder	no		cas3,DEDDh,DinG,RT	Orphan	ACAACAGGAGTAGATGAAAATGGAAACTTGATTGA	35	0	0	NA	NA	NA	1	1	Orphan	cas3,DEDDh,DinG,RT	NA|393aa|up_1|NC_012467.1_1050876_1052055_+,NA|58aa|down_0|NC_012467.1_1064529_1064703_+,NA|150aa|down_3|NC_012467.1_1066919_1067369_+,NA|78aa|down_5|NC_012467.1_1067754_1067988_+	NA|124aa|up_9|NC_012467.1_1038216_1038588_+	cd07500, HAD_PSP, phosphoserine phosphatase (PSP), similar to Methanococcus Jannaschii PSP and Saccharomyces cerevisiae SER2p	NA|75aa|up_8|NC_012467.1_1038532_1038757_+	cd07500, HAD_PSP, phosphoserine phosphatase (PSP), similar to Methanococcus Jannaschii PSP and Saccharomyces cerevisiae SER2p	NA|372aa|up_7|NC_012467.1_1038870_1039986_+	COG1929, COG1929, Glycerate kinase [Carbohydrate transport and metabolism]	NA|149aa|up_6|NC_012467.1_1039982_1040429_-	COG5506, COG5506, Uncharacterized conserved protein [Function unknown]	NA|435aa|up_5|NC_012467.1_1040592_1041897_+	PRK00077, eno, enolase; Provisional	NA|149aa|up_4|NC_012467.1_1042034_1042481_-	cd01189, INT_ICEBs1_C_like, C-terminal catalytic domain of integrases from bacterial phages and conjugate transposons	NA|1092aa|up_3|NC_012467.1_1043739_1047015_+	TIGR02774, putative_ATP-dependent_exonuclease_subunit_B, ATP-dependent nuclease subunit B	NA|1217aa|up_2|NC_012467.1_1047011_1050662_+	TIGR02785, ATP-dependent_helicase/nuclease_subunit_A, helicase-exonuclease AddAB, AddA subunit, Firmicutes type	NA|393aa|up_1|NC_012467.1_1050876_1052055_+	NA	NA|2160aa|up_0|NC_012467.1_1052263_1058743_+	pfam07580, Peptidase_M26_C, M26 IgA1-specific Metallo-endopeptidase C-terminal region	NA|58aa|down_0|NC_012467.1_1064529_1064703_+	NA	NA|260aa|down_1|NC_012467.1_1064699_1065479_+	pfam06970, RepA_N, Replication initiator protein A (RepA) N-terminus	NA|453aa|down_2|NC_012467.1_1065577_1066936_+	TIGR00675, Modification_methylase, DNA-methyltransferase (dcm)	NA|150aa|down_3|NC_012467.1_1066919_1067369_+	NA	NA|127aa|down_4|NC_012467.1_1067361_1067742_+	COG1393, ArsC, Arsenate reductase and related proteins, glutaredoxin family [Inorganic ion transport and metabolism]	NA|78aa|down_5|NC_012467.1_1067754_1067988_+	NA	NA|201aa|down_6|NC_012467.1_1067990_1068593_+	pfam02517, Abi, CAAX protease self-immunity	NA|406aa|down_7|NC_012467.1_1068754_1069972_-	pfam00589, Phage_integrase, Phage integrase family	NA|68aa|down_8|NC_012467.1_1070053_1070257_-	pfam09035, Tn916-Xis, Excisionase from transposon Tn916	NA|77aa|down_9|NC_012467.1_1070717_1070948_-	pfam12645, HTH_16, Helix-turn-helix domain
