assembly_id	genome_id	genome_def	crispr_array_locus_merge	crispr_array_location_merge	crispr_locus_id	crispr_pred_method	array_in_prot	prot_within_array_20000	prot_in_genome	crispr_type_by_cas_prot	consensus_repeat	repeat_length	self-targeting_spacer_number	self-targeting_target_number	spacer_location	protospacer_location	repeat_type	spacer_locus_num	spacer_num	correct_crispr_type	genome_cas_prots	unknown_protein_around_crispr	L10	L10_domain	L9	L9_domain	L8	L8_domain	L7	L7_domain	L6	L6_domain	L5	L5_domain	L4	L4_domain	L3	L3_domain	L2	L2_domain	L1	L1_domain	R1	R1_domain	R2	R2_domain	R3	R3_domain	R4	R4_domain	R5	R5_domain	R6	R6_domain	R7	R7_domain	R8	R8_domain	R9	R9_domain	R10	R10_domain
GCF_900476505.1_55312_C02	NZ_LS483451	Streptococcus pneumoniae strain 4041STDY6836166 chromosome 1	1	138866-138961	1	CRISPRCasFinder	no		cas3,DEDDh,PrimPol,DinG,RT	Orphan	AATGTGTAAGATTTTTATATATAA	24	0	0	NA	NA	NA	1	1	Orphan	cas3,DEDDh,PrimPol,DinG,RT	NA|47aa|up_7|NZ_LS483451.1_131787_131928_-,NA|107aa|down_7|NZ_LS483451.1_146770_147091_-	NA|310aa|up_9|NZ_LS483451.1_129847_130777_+	COG4209, LplB, ABC-type polysaccharide transport system, permease component [Carbohydrate transport and metabolism]	NA|308aa|up_8|NZ_LS483451.1_130790_131714_+	COG0395, UgpE, ABC-type sugar transport system, permease component [Carbohydrate transport and metabolism]	NA|47aa|up_7|NZ_LS483451.1_131787_131928_-	NA	NA|492aa|up_6|NZ_LS483451.1_131971_133447_+	pfam12010, DUF3502, Domain of unknown function (DUF3502)	NA|329aa|up_5|NZ_LS483451.1_133718_134705_+	PRK00142, PRK00142, rhodanese-related sulfurtransferase	NA|287aa|up_4|NZ_LS483451.1_134979_135840_+	pfam14132, DUF4299, Domain of unknown function (DUF4299)	NA|355aa|up_3|NZ_LS483451.1_135911_136976_-	pfam16001, DUF4775, Domain of unknown function (DUF4775)	NA|304aa|up_2|NZ_LS483451.1_137037_137949_-	pfam13349, DUF4097, Putative adhesin	NA|198aa|up_1|NZ_LS483451.1_137941_138535_-	COG4709, COG4709, Predicted membrane protein [Function unknown]	NA|109aa|up_0|NZ_LS483451.1_138521_138848_-	COG1695, COG1695, Predicted transcriptional regulators [Transcription]	NA|389aa|down_0|NZ_LS483451.1_138986_140153_-	cd17339, MFS_NIMT_CynX_like, 2-nitroimidazole and cyanate transporters and similar proteins of the Major Facilitator Superfamily of transporters	NA|386aa|down_1|NZ_LS483451.1_140210_141368_+	cd02525, Succinoglycan_BP_ExoA, ExoA is involved in the biosynthesis of succinoglycan	NA|617aa|down_2|NZ_LS483451.1_141409_143260_+	COG1086, COG1086, Predicted nucleoside-diphosphate sugar epimerases [Cell envelope biogenesis, outer membrane / Carbohydrate transport and metabolism]	NA|211aa|down_3|NZ_LS483451.1_143697_144330_-	cd04302, HAD_5NT, haloacid dehalogenase (HAD)-like 5'-nucleotidases similar to the Pseudomonas aeruginosa PA0065	NA|291aa|down_4|NZ_LS483451.1_144351_145224_-	TIGR00718, Probable_L-serine_dehydratase_alpha_chain, L-serine dehydratase, iron-sulfur-dependent, alpha subunit	NA|224aa|down_5|NZ_LS483451.1_145232_145904_-	COG1760, SdaA, L-serine deaminase [Amino acid transport and metabolism]	NA|191aa|down_6|NZ_LS483451.1_146145_146718_+	cd00118, LysM, Lysin Motif is a small domain involved in binding peptidoglycan	NA|107aa|down_7|NZ_LS483451.1_146770_147091_-	NA	NA|96aa|down_8|NZ_LS483451.1_147521_147809_+	TIGR01653, hypothetical_protein, bacteriocin, lactococcin 972 family	NA|703aa|down_9|NZ_LS483451.1_147860_149969_+	TIGR01654, unnamed_protein_product, bacteriocin-associated integral membrane (putative immunity) protein
GCF_900476505.1_55312_C02	NZ_LS483451	Streptococcus pneumoniae strain 4041STDY6836166 chromosome 1	2	1231120-1231191	2	CRISPRCasFinder	no		cas3,DEDDh,PrimPol,DinG,RT	Orphan	ATTTACAAAATCAACCTCGCTCT	23	0	0	NA	NA	NA	1	1	Orphan	cas3,DEDDh,PrimPol,DinG,RT	NA|106aa|up_4|NZ_LS483451.1_1226568_1226886_+,NA|592aa|down_6|NZ_LS483451.1_1237670_1239446_-	NA|360aa|up_9|NZ_LS483451.1_1222424_1223504_-	PRK12564, PRK12564, carbamoyl-phosphate synthase small subunit	NA|308aa|up_8|NZ_LS483451.1_1223553_1224477_-	PRK00856, pyrB, aspartate carbamoyltransferase catalytic subunit	NA|174aa|up_7|NZ_LS483451.1_1224495_1225017_-	PRK05205, PRK05205, bifunctional pyr operon transcriptional regulator/uracil phosphoribosyltransferase PyrR	NA|210aa|up_6|NZ_LS483451.1_1225227_1225857_-	COG0177, Nth, Predicted EndoIII-related endonuclease [DNA replication, recombination, and repair]	NA|181aa|up_5|NZ_LS483451.1_1225856_1226399_-	COG1399, COG1399, Predicted metal-binding, possibly nucleic acid-binding protein [General function prediction only]	NA|106aa|up_4|NZ_LS483451.1_1226568_1226886_+	NA	NA|520aa|up_3|NZ_LS483451.1_1227124_1228684_+	COG0488, Uup, ATPase components of ABC transporters with duplicated ATPase domains [General function prediction only]	NA|300aa|up_2|NZ_LS483451.1_1228832_1229732_-	PRK04897, PRK04897, heat shock protein HtpX; Provisional	NA|187aa|up_1|NZ_LS483451.1_1229733_1230294_-	COG1704, LemA, Uncharacterized conserved protein [Function unknown]	NA|238aa|up_0|NZ_LS483451.1_1230387_1231101_+	PRK00107, gidB, 16S rRNA (guanine(527)-N(7))-methyltransferase RsmG	NA|428aa|down_0|NZ_LS483451.1_1231403_1232687_+	PRK10720, PRK10720, uracil transporter; Provisional	NA|524aa|down_1|NZ_LS483451.1_1232881_1234453_-	PRK10867, PRK10867, signal recognition particle protein; Provisional	NA|111aa|down_2|NZ_LS483451.1_1234464_1234797_-	PRK00118, PRK00118, putative DNA-binding protein; Validated	NA|126aa|down_3|NZ_LS483451.1_1234887_1235265_-	pfam09148, DUF1934, Domain of unknown function (DUF1934)	NA|435aa|down_4|NZ_LS483451.1_1235336_1236641_+	COG1078, COG1078, HD superfamily phosphohydrolases [General function prediction only]	NA|269aa|down_5|NZ_LS483451.1_1236653_1237460_+	PRK10513, PRK10513, sugar phosphate phosphatase; Provisional	NA|592aa|down_6|NZ_LS483451.1_1237670_1239446_-	NA	NA|354aa|down_7|NZ_LS483451.1_1239931_1240993_-	COG5263, COG5263, FOG: Glucan-binding domain (YG repeat) [General function prediction only]	NA|407aa|down_8|NZ_LS483451.1_1241005_1242226_-	COG5263, COG5263, FOG: Glucan-binding domain (YG repeat) [General function prediction only]	NA|285aa|down_9|NZ_LS483451.1_1242319_1243174_+	TIGR01716, HTH-type_transcriptional_regulator_rgg, transcriptional activator, Rgg/GadR/MutR family, C-terminal domain
