assembly_id	genome_id	genome_def	crispr_array_locus_merge	crispr_array_location_merge	crispr_locus_id	crispr_pred_method	array_in_prot	prot_within_array_20000	prot_in_genome	crispr_type_by_cas_prot	consensus_repeat	repeat_length	self-targeting_spacer_number	self-targeting_target_number	spacer_location	protospacer_location	repeat_type	spacer_locus_num	spacer_num	correct_crispr_type	genome_cas_prots	unknown_protein_around_crispr	L10	L10_domain	L9	L9_domain	L8	L8_domain	L7	L7_domain	L6	L6_domain	L5	L5_domain	L4	L4_domain	L3	L3_domain	L2	L2_domain	L1	L1_domain	R1	R1_domain	R2	R2_domain	R3	R3_domain	R4	R4_domain	R5	R5_domain	R6	R6_domain	R7	R7_domain	R8	R8_domain	R9	R9_domain	R10	R10_domain
GCF_000211095.1_ASM21109v1	NC_021004	Streptococcus pneumoniae SPN033038, complete genome	1	465267-465403	1	CRISPRCasFinder	no		cas3,RT,DEDDh,DinG	Orphan	ACTTCTGGTGTCGGTACATTTGGTGTTGG	29	0	0	NA	NA	NA	2	2	Orphan	cas3,RT,DEDDh,DinG	NA,NA|532aa|down_3|NC_021004.1_474798_476394_-	NA|120aa|up_9|NC_021004.1_457311_457671_-	PRK07252, PRK07252, S1 RNA-binding domain-containing protein	NA|467aa|up_8|NC_021004.1_457672_459073_-	COG0652, PpiB, Peptidyl-prolyl cis-trans isomerase (rotamase) - cyclophilin family [Posttranslational modification, protein turnover, chaperones]	NA|80aa|up_7|NC_021004.1_459367_459607_-	PRK00391, rpsR, 30S ribosomal protein S18; Reviewed	NA|157aa|up_6|NC_021004.1_459638_460109_-	PRK07275, PRK07275, single-stranded DNA-binding protein; Provisional	NA|97aa|up_5|NC_021004.1_460120_460411_-	PRK00453, rpsF, 30S ribosomal protein S6; Reviewed	NA|449aa|up_4|NC_021004.1_460563_461910_-	PRK03932, asnC, asparaginyl-tRNA synthetase; Validated	NA|122aa|up_3|NC_021004.1_461925_462291_-	cd07262, VOC_like, uncharacterized subfamily of vicinal oxygen chelate (VOC) family	NA|396aa|up_2|NC_021004.1_462283_463471_-	PRK05764, PRK05764, aspartate aminotransferase; Provisional	NA|163aa|up_1|NC_021004.1_463467_463956_-	COG5353, COG5353, Uncharacterized protein conserved in bacteria [Function unknown]	NA|183aa|up_0|NC_021004.1_464276_464825_+	COG0431, COG0431, Predicted flavoprotein [General function prediction only]	NA|495aa|down_0|NC_021004.1_472125_473610_+	pfam08270, PRD_Mga, M protein trans-acting positive regulator (MGA) PRD domain	NA|243aa|down_1|NC_021004.1_473660_474389_-	PRK02101, PRK02101, peroxide stress protein YaaA	NA|55aa|down_2|NC_021004.1_474467_474632_-	pfam13129, DUF3953, Protein of unknown function (DUF3953)	NA|532aa|down_3|NC_021004.1_474798_476394_-	NA	NA|137aa|down_4|NC_021004.1_476405_476816_-	PRK09218, PRK09218, peptide deformylase; Validated	NA|264aa|down_5|NC_021004.1_476931_477723_-	PRK11752, PRK11752, putative S-transferase; Provisional	NA|899aa|down_6|NC_021004.1_477735_480432_-	cd02089, P-type_ATPase_Ca_prok, prokaryotic P-type Ca(2+)-ATPase similar to Synechococcus elongatus sp	NA|395aa|down_7|NC_021004.1_480735_481920_+	COG0053, MMT1, Predicted Co/Zn/Cd cation transporters [Inorganic ion transport and metabolism]	NA|624aa|down_8|NC_021004.1_482062_483934_-	COG0488, Uup, ATPase components of ABC transporters with duplicated ATPase domains [General function prediction only]	NA|395aa|down_9|NC_021004.1_483930_485115_-	PRK13299, PRK13299, tRNA CCA-pyrophosphorylase; Provisional
GCF_000211095.1_ASM21109v1	NC_021004	Streptococcus pneumoniae SPN033038, complete genome	2	1261288-1261383	2	CRISPRCasFinder	no		cas3,RT,DEDDh,DinG	Orphan	AATGTGTAAGATTTTTATATATAA	24	0	0	NA	NA	NA	1	1	Orphan	cas3,RT,DEDDh,DinG	NA|117aa|up_9|NC_021004.1_1251906_1252257_+,NA|120aa|down_9|NC_021004.1_1270799_1271159_+	NA|117aa|up_9|NC_021004.1_1251906_1252257_+	NA	NA|310aa|up_8|NC_021004.1_1252575_1253505_+	COG4209, LplB, ABC-type polysaccharide transport system, permease component [Carbohydrate transport and metabolism]	NA|308aa|up_7|NC_021004.1_1253518_1254442_+	COG0395, UgpE, ABC-type sugar transport system, permease component [Carbohydrate transport and metabolism]	NA|492aa|up_6|NC_021004.1_1254545_1256021_+	pfam12010, DUF3502, Domain of unknown function (DUF3502)	NA|329aa|up_5|NC_021004.1_1256292_1257279_+	PRK00142, PRK00142, rhodanese-related sulfurtransferase	NA|287aa|up_4|NC_021004.1_1257400_1258261_+	pfam14132, DUF4299, Domain of unknown function (DUF4299)	NA|355aa|up_3|NC_021004.1_1258332_1259397_-	pfam10310, DUF5427, Family of unknown function (DUF5427)	NA|304aa|up_2|NC_021004.1_1259459_1260371_-	pfam13349, DUF4097, Putative adhesin	NA|198aa|up_1|NC_021004.1_1260363_1260957_-	COG4709, COG4709, Predicted membrane protein [Function unknown]	NA|109aa|up_0|NC_021004.1_1260943_1261270_-	COG1695, COG1695, Predicted transcriptional regulators [Transcription]	NA|389aa|down_0|NC_021004.1_1261408_1262575_-	cd17339, MFS_NIMT_CynX_like, 2-nitroimidazole and cyanate transporters and similar proteins of the Major Facilitator Superfamily of transporters	NA|380aa|down_1|NC_021004.1_1262632_1263772_+	cd02525, Succinoglycan_BP_ExoA, ExoA is involved in the biosynthesis of succinoglycan	NA|617aa|down_2|NC_021004.1_1263839_1265690_+	COG1086, COG1086, Predicted nucleoside-diphosphate sugar epimerases [Cell envelope biogenesis, outer membrane / Carbohydrate transport and metabolism]	NA|211aa|down_3|NC_021004.1_1266082_1266715_-	cd04302, HAD_5NT, haloacid dehalogenase (HAD)-like 5'-nucleotidases similar to the Pseudomonas aeruginosa PA0065	NA|291aa|down_4|NC_021004.1_1266736_1267609_-	TIGR00718, Probable_L-serine_dehydratase_alpha_chain, L-serine dehydratase, iron-sulfur-dependent, alpha subunit	NA|224aa|down_5|NC_021004.1_1267617_1268289_-	COG1760, SdaA, L-serine deaminase [Amino acid transport and metabolism]	NA|191aa|down_6|NC_021004.1_1268530_1269103_+	cd00118, LysM, Lysin Motif is a small domain involved in binding peptidoglycan	NA|288aa|down_7|NC_021004.1_1269154_1270018_-	cd00093, HTH_XRE, Helix-turn-helix XRE-family like proteins	NA|113aa|down_8|NC_021004.1_1270483_1270822_+	cd02418, Peptidase_C39B, A sub-family of peptidase family C39	NA|120aa|down_9|NC_021004.1_1270799_1271159_+	NA
