assembly_id	genome_id	genome_def	crispr_array_locus_merge	crispr_array_location_merge	crispr_locus_id	crispr_pred_method	array_in_prot	prot_within_array_20000	prot_in_genome	crispr_type_by_cas_prot	consensus_repeat	repeat_length	self-targeting_spacer_number	self-targeting_target_number	spacer_location	protospacer_location	repeat_type	spacer_locus_num	spacer_num	correct_crispr_type	genome_cas_prots	unknown_protein_around_crispr	L10	L10_domain	L9	L9_domain	L8	L8_domain	L7	L7_domain	L6	L6_domain	L5	L5_domain	L4	L4_domain	L3	L3_domain	L2	L2_domain	L1	L1_domain	R1	R1_domain	R2	R2_domain	R3	R3_domain	R4	R4_domain	R5	R5_domain	R6	R6_domain	R7	R7_domain	R8	R8_domain	R9	R9_domain	R10	R10_domain
GCF_900475805.1_46514_B03	NZ_LS483417	Streptococcus pneumoniae strain NCTC11902 chromosome 1	1	117013-117108	1	CRISPRCasFinder	no		cas3,DEDDh,PrimPol,DinG,RT	Orphan	AATGTGTAAGATTTTTATATATAA	24	0	0	NA	NA	NA	1	1	Orphan	cas3,DEDDh,PrimPol,DinG,RT	NA|74aa|up_9|NZ_LS483417.1_107344_107566_+,NA|107aa|down_7|NZ_LS483417.1_124889_125210_-	NA|74aa|up_9|NZ_LS483417.1_107344_107566_+	NA	NA|310aa|up_8|NZ_LS483417.1_107887_108817_+	COG4209, LplB, ABC-type polysaccharide transport system, permease component [Carbohydrate transport and metabolism]	NA|308aa|up_7|NZ_LS483417.1_108830_109754_+	COG0395, UgpE, ABC-type sugar transport system, permease component [Carbohydrate transport and metabolism]	NA|492aa|up_6|NZ_LS483417.1_110011_111487_+	pfam12010, DUF3502, Domain of unknown function (DUF3502)	NA|329aa|up_5|NZ_LS483417.1_111758_112745_+	PRK00142, PRK00142, rhodanese-related sulfurtransferase	NA|287aa|up_4|NZ_LS483417.1_113019_113880_+	pfam14132, DUF4299, Domain of unknown function (DUF4299)	NA|355aa|up_3|NZ_LS483417.1_114057_115122_-	pfam10310, DUF5427, Family of unknown function (DUF5427)	NA|304aa|up_2|NZ_LS483417.1_115184_116096_-	pfam13349, DUF4097, Putative adhesin	NA|198aa|up_1|NZ_LS483417.1_116088_116682_-	COG4709, COG4709, Predicted membrane protein [Function unknown]	NA|109aa|up_0|NZ_LS483417.1_116668_116995_-	COG1695, COG1695, Predicted transcriptional regulators [Transcription]	NA|389aa|down_0|NZ_LS483417.1_117133_118300_-	cd17339, MFS_NIMT_CynX_like, 2-nitroimidazole and cyanate transporters and similar proteins of the Major Facilitator Superfamily of transporters	NA|386aa|down_1|NZ_LS483417.1_118357_119515_+	cd02525, Succinoglycan_BP_ExoA, ExoA is involved in the biosynthesis of succinoglycan	NA|617aa|down_2|NZ_LS483417.1_119556_121407_+	COG1086, COG1086, Predicted nucleoside-diphosphate sugar epimerases [Cell envelope biogenesis, outer membrane / Carbohydrate transport and metabolism]	NA|211aa|down_3|NZ_LS483417.1_121802_122435_-	cd04302, HAD_5NT, haloacid dehalogenase (HAD)-like 5'-nucleotidases similar to the Pseudomonas aeruginosa PA0065	NA|291aa|down_4|NZ_LS483417.1_122456_123329_-	TIGR00718, Probable_L-serine_dehydratase_alpha_chain, L-serine dehydratase, iron-sulfur-dependent, alpha subunit	NA|224aa|down_5|NZ_LS483417.1_123337_124009_-	COG1760, SdaA, L-serine deaminase [Amino acid transport and metabolism]	NA|196aa|down_6|NZ_LS483417.1_124250_124838_+	cd00118, LysM, Lysin Motif is a small domain involved in binding peptidoglycan	NA|107aa|down_7|NZ_LS483417.1_124889_125210_-	NA	NA|96aa|down_8|NZ_LS483417.1_125636_125924_+	TIGR01653, hypothetical_protein, bacteriocin, lactococcin 972 family	NA|703aa|down_9|NZ_LS483417.1_125975_128084_+	TIGR01654, unnamed_protein_product, bacteriocin-associated integral membrane (putative immunity) protein
GCF_900475805.1_46514_B03	NZ_LS483417	Streptococcus pneumoniae strain NCTC11902 chromosome 1	2	1424080-1424216	2	CRISPRCasFinder	no		cas3,DEDDh,PrimPol,DinG,RT	Orphan	ACTTCTGGTGTCGGTACATTTGGTGTTGG	29	0	0	NA	NA	NA	2	2	Orphan	cas3,DEDDh,PrimPol,DinG,RT	NA,NA|532aa|down_3|NZ_LS483417.1_1432365_1433961_-	NA|120aa|up_9|NZ_LS483417.1_1416124_1416484_-	PRK07252, PRK07252, S1 RNA-binding domain-containing protein	NA|467aa|up_8|NZ_LS483417.1_1416485_1417886_-	COG0652, PpiB, Peptidyl-prolyl cis-trans isomerase (rotamase) - cyclophilin family [Posttranslational modification, protein turnover, chaperones]	NA|80aa|up_7|NZ_LS483417.1_1418180_1418420_-	PRK00391, rpsR, 30S ribosomal protein S18; Reviewed	NA|157aa|up_6|NZ_LS483417.1_1418451_1418922_-	PRK07275, PRK07275, single-stranded DNA-binding protein; Provisional	NA|97aa|up_5|NZ_LS483417.1_1418933_1419224_-	PRK00453, rpsF, 30S ribosomal protein S6; Reviewed	NA|449aa|up_4|NZ_LS483417.1_1419376_1420723_-	PRK03932, asnC, asparaginyl-tRNA synthetase; Validated	NA|122aa|up_3|NZ_LS483417.1_1420738_1421104_-	cd07262, VOC_like, uncharacterized subfamily of vicinal oxygen chelate (VOC) family	NA|396aa|up_2|NZ_LS483417.1_1421096_1422284_-	PRK05764, PRK05764, aspartate aminotransferase; Provisional	NA|163aa|up_1|NZ_LS483417.1_1422280_1422769_-	COG5353, COG5353, Uncharacterized protein conserved in bacteria [Function unknown]	NA|183aa|up_0|NZ_LS483417.1_1423089_1423638_+	COG0431, COG0431, Predicted flavoprotein [General function prediction only]	NA|495aa|down_0|NZ_LS483417.1_1429792_1431277_+	pfam08270, PRD_Mga, M protein trans-acting positive regulator (MGA) PRD domain	NA|243aa|down_1|NZ_LS483417.1_1431327_1432056_-	PRK02101, PRK02101, peroxide stress protein YaaA	NA|55aa|down_2|NZ_LS483417.1_1432134_1432299_-	pfam13129, DUF3953, Protein of unknown function (DUF3953)	NA|532aa|down_3|NZ_LS483417.1_1432365_1433961_-	NA	NA|137aa|down_4|NZ_LS483417.1_1433972_1434383_-	PRK09218, PRK09218, peptide deformylase; Validated	NA|264aa|down_5|NZ_LS483417.1_1434498_1435290_-	PRK11752, PRK11752, putative S-transferase; Provisional	NA|899aa|down_6|NZ_LS483417.1_1435302_1437999_-	cd02089, P-type_ATPase_Ca_prok, prokaryotic P-type Ca(2+)-ATPase similar to Synechococcus elongatus sp	NA|395aa|down_7|NZ_LS483417.1_1438302_1439487_+	COG0053, MMT1, Predicted Co/Zn/Cd cation transporters [Inorganic ion transport and metabolism]	NA|624aa|down_8|NZ_LS483417.1_1439629_1441501_-	COG0488, Uup, ATPase components of ABC transporters with duplicated ATPase domains [General function prediction only]	NA|395aa|down_9|NZ_LS483417.1_1441497_1442682_-	PRK13299, PRK13299, tRNA CCA-pyrophosphorylase; Provisional
