assembly_id	genome_id	genome_def	crispr_array_locus_merge	crispr_array_location_merge	crispr_locus_id	crispr_pred_method	array_in_prot	prot_within_array_20000	prot_in_genome	crispr_type_by_cas_prot	consensus_repeat	repeat_length	self-targeting_spacer_number	self-targeting_target_number	spacer_location	protospacer_location	repeat_type	spacer_locus_num	spacer_num	correct_crispr_type	genome_cas_prots	unknown_protein_around_crispr	L10	L10_domain	L9	L9_domain	L8	L8_domain	L7	L7_domain	L6	L6_domain	L5	L5_domain	L4	L4_domain	L3	L3_domain	L2	L2_domain	L1	L1_domain	R1	R1_domain	R2	R2_domain	R3	R3_domain	R4	R4_domain	R5	R5_domain	R6	R6_domain	R7	R7_domain	R8	R8_domain	R9	R9_domain	R10	R10_domain
GCF_003967155.2_ASM396715v2	NZ_AP019192	Streptococcus pneumoniae strain ASP0581	1	102502-102597	1	CRISPRCasFinder	no		cas3,DEDDh,DinG,RT	Orphan	AATGTGTAAGATTTTTATATATAA	24	0	0	NA	NA	NA	1	1	Orphan	cas3,DEDDh,DinG,RT	NA|74aa|up_9|NZ_AP019192.2_92944_93166_+,NA|131aa|down_3|NZ_AP019192.2_106920_107313_-	NA|74aa|up_9|NZ_AP019192.2_92944_93166_+	NA	NA|310aa|up_8|NZ_AP019192.2_93484_94414_+	COG4209, LplB, ABC-type polysaccharide transport system, permease component [Carbohydrate transport and metabolism]	NA|308aa|up_7|NZ_AP019192.2_94427_95351_+	COG0395, UgpE, ABC-type sugar transport system, permease component [Carbohydrate transport and metabolism]	NA|492aa|up_6|NZ_AP019192.2_95608_97084_+	pfam12010, DUF3502, Domain of unknown function (DUF3502)	NA|329aa|up_5|NZ_AP019192.2_97355_98342_+	PRK00142, PRK00142, rhodanese-related sulfurtransferase	NA|287aa|up_4|NZ_AP019192.2_98616_99477_+	pfam14132, DUF4299, Domain of unknown function (DUF4299)	NA|355aa|up_3|NZ_AP019192.2_99547_100612_-	pfam16001, DUF4775, Domain of unknown function (DUF4775)	NA|304aa|up_2|NZ_AP019192.2_100673_101585_-	pfam13349, DUF4097, Putative adhesin	NA|198aa|up_1|NZ_AP019192.2_101577_102171_-	COG4709, COG4709, Predicted membrane protein [Function unknown]	NA|109aa|up_0|NZ_AP019192.2_102157_102484_-	COG1695, COG1695, Predicted transcriptional regulators [Transcription]	NA|389aa|down_0|NZ_AP019192.2_102622_103789_-	cd17339, MFS_NIMT_CynX_like, 2-nitroimidazole and cyanate transporters and similar proteins of the Major Facilitator Superfamily of transporters	NA|386aa|down_1|NZ_AP019192.2_103846_105004_+	cd02525, Succinoglycan_BP_ExoA, ExoA is involved in the biosynthesis of succinoglycan	NA|617aa|down_2|NZ_AP019192.2_105045_106896_+	COG1086, COG1086, Predicted nucleoside-diphosphate sugar epimerases [Cell envelope biogenesis, outer membrane / Carbohydrate transport and metabolism]	NA|131aa|down_3|NZ_AP019192.2_106920_107313_-	NA	NA|211aa|down_4|NZ_AP019192.2_107470_108103_-	cd04302, HAD_5NT, haloacid dehalogenase (HAD)-like 5'-nucleotidases similar to the Pseudomonas aeruginosa PA0065	NA|291aa|down_5|NZ_AP019192.2_108124_108997_-	TIGR00718, Probable_L-serine_dehydratase_alpha_chain, L-serine dehydratase, iron-sulfur-dependent, alpha subunit	NA|224aa|down_6|NZ_AP019192.2_109005_109677_-	COG1760, SdaA, L-serine deaminase [Amino acid transport and metabolism]	NA|196aa|down_7|NZ_AP019192.2_109918_110506_+	cd00118, LysM, Lysin Motif is a small domain involved in binding peptidoglycan	NA|288aa|down_8|NZ_AP019192.2_110557_111421_-	cd00093, HTH_XRE, Helix-turn-helix XRE-family like proteins	NA|740aa|down_9|NZ_AP019192.2_111886_114106_+	COG2274, SunT, ABC-type bacteriocin/lantibiotic exporters, contain an N-terminal double-glycine peptidase domain [Defense mechanisms]
GCF_003967155.2_ASM396715v2	NZ_AP019192	Streptococcus pneumoniae strain ASP0581	2	1011259-1011362	2	CRISPRCasFinder	no		cas3,DEDDh,DinG,RT	Orphan	ATAGGGGATTTACCCACTACAAATATTATAGAG	33	0	0	NA	NA	NA	1	1	Orphan	cas3,DEDDh,DinG,RT	NA,NA	NA|214aa|up_9|NZ_AP019192.2_1002257_1002899_-	COG2344, COG2344, AT-rich DNA-binding protein [General function prediction only]	NA|48aa|up_8|NZ_AP019192.2_1003053_1003197_-	pfam15507, DUF4649, Domain of unknown function (DUF4649)	NA|116aa|up_7|NZ_AP019192.2_1003204_1003552_-	pfam08866, DUF1831, Putative amino acid metabolism	NA|372aa|up_6|NZ_AP019192.2_1003556_1004672_-	COG1104, NifS, Cysteine sulfinate desulfinase/cysteine desulfurase and related enzymes [Amino acid transport and metabolism]	NA|320aa|up_5|NZ_AP019192.2_1004681_1005641_-	PRK02458, PRK02458, ribose-phosphate pyrophosphokinase; Provisional	NA|190aa|up_4|NZ_AP019192.2_1005753_1006323_-	COG4116, COG4116, Uncharacterized protein conserved in bacteria [Function unknown]	NA|224aa|up_3|NZ_AP019192.2_1006450_1007122_+	COG2357, COG2357, PpGpp synthetase catalytic domain [General function prediction only]	NA|273aa|up_2|NZ_AP019192.2_1007105_1007924_+	PRK04885, ppnK, inorganic polyphosphate/ATP-NAD kinase; Provisional	NA|299aa|up_1|NZ_AP019192.2_1007920_1008817_+	COG0564, RluA, Pseudouridylate synthases, 23S RNA-specific [Translation, ribosomal structure and biogenesis]	NA|325aa|up_0|NZ_AP019192.2_1008860_1009835_+	PRK09653, eutD, phosphotransacetylase	NA|100aa|down_0|NZ_AP019192.2_1011421_1011721_-	PRK00153, PRK00153, YbaB/EbfC family nucleoid-associated protein	NA|233aa|down_1|NZ_AP019192.2_1012122_1012821_+	COG3619, COG3619, Predicted membrane protein [Function unknown]	NA|105aa|down_2|NZ_AP019192.2_1013008_1013323_+	PRK05573, rplU, 50S ribosomal protein L21; Validated	NA|115aa|down_3|NZ_AP019192.2_1013338_1013683_+	COG2868, COG2868, Predicted ribosomal protein [Translation, ribosomal structure and biogenesis]	NA|98aa|down_4|NZ_AP019192.2_1013699_1013993_+	PRK05435, rpmA, 50S ribosomal protein L27; Validated	NA|306aa|down_5|NZ_AP019192.2_1014839_1015757_+	PRK05627, PRK05627, bifunctional riboflavin kinase/FAD synthetase	NA|283aa|down_6|NZ_AP019192.2_1015849_1016698_-	COG1284, COG1284, Uncharacterized conserved protein [Function unknown]	NA|280aa|down_7|NZ_AP019192.2_1016839_1017679_+	TIGR00762, DegV, EDD domain protein, DegV family	NA|92aa|down_8|NZ_AP019192.2_1017785_1018061_+	cd13831, HU, histone-like DNA-binding protein HU	NA|634aa|down_9|NZ_AP019192.2_1018503_1020405_+	COG0488, Uup, ATPase components of ABC transporters with duplicated ATPase domains [General function prediction only]
GCF_003967155.2_ASM396715v2	NZ_AP019192	Streptococcus pneumoniae strain ASP0581	3	1071778-1071857	3	CRISPRCasFinder	no		cas3,DEDDh,DinG,RT	Orphan	GGGCCAAGCGGTGGCGGACACCAGAA	26	0	0	NA	NA	NA	1	1	Orphan	cas3,DEDDh,DinG,RT	NA|100aa|up_7|NZ_AP019192.2_1062345_1062645_+,NA|163aa|up_6|NZ_AP019192.2_1062634_1063123_+,NA|81aa|up_4|NZ_AP019192.2_1065020_1065263_+,NA|285aa|up_3|NZ_AP019192.2_1065279_1066134_+,NA|51aa|down_2|NZ_AP019192.2_1074378_1074531_+,NA|74aa|down_6|NZ_AP019192.2_1078649_1078871_+	NA|215aa|up_9|NZ_AP019192.2_1060582_1061227_+	pfam13338, AbiEi_4, Transcriptional regulator, AbiEi antitoxin	NA|279aa|up_8|NZ_AP019192.2_1061226_1062063_+	pfam08843, AbiEii, Nucleotidyl transferase AbiEii toxin, Type IV TA system	NA|100aa|up_7|NZ_AP019192.2_1062345_1062645_+	NA	NA|163aa|up_6|NZ_AP019192.2_1062634_1063123_+	NA	NA|626aa|up_5|NZ_AP019192.2_1063122_1065000_+	pfam02534, T4SS-DNA_transf, Type IV secretory system Conjugative DNA transfer	NA|81aa|up_4|NZ_AP019192.2_1065020_1065263_+	NA	NA|285aa|up_3|NZ_AP019192.2_1065279_1066134_+	NA	NA|120aa|up_2|NZ_AP019192.2_1066187_1066547_+	pfam12666, PrgI, PrgI family protein	NA|772aa|up_1|NZ_AP019192.2_1066539_1068855_+	TIGR02746, hypothetical_protein, type-IV secretion system protein TraC	NA|938aa|up_0|NZ_AP019192.2_1068866_1071680_+	pfam18013, Phage_lysozyme2, Phage tail lysozyme	NA|105aa|down_0|NZ_AP019192.2_1072246_1072561_+	pfam06125, DUF961, Bacterial protein of unknown function (DUF961)	NA|129aa|down_1|NZ_AP019192.2_1072576_1072963_+	pfam06125, DUF961, Bacterial protein of unknown function (DUF961)	NA|51aa|down_2|NZ_AP019192.2_1074378_1074531_+	NA	NA|473aa|down_3|NZ_AP019192.2_1074553_1075972_+	pfam02486, Rep_trans, Replication initiation factor	NA|28aa|down_4|NZ_AP019192.2_1076020_1076104_+	pfam06308, ErmC, 23S rRNA methylase leader peptide (ErmC)	NA|246aa|down_5|NZ_AP019192.2_1076228_1076966_+	pfam00398, RrnaAD, Ribosomal RNA adenine dimethylase	NA|74aa|down_6|NZ_AP019192.2_1078649_1078871_+	NA	NA|166aa|down_7|NZ_AP019192.2_1078987_1079485_+	pfam07275, ArdA, Antirestriction protein (ArdA)	NA|169aa|down_8|NZ_AP019192.2_1079459_1079966_+	pfam12648, TcpE, TcpE family	NA|816aa|down_9|NZ_AP019192.2_1079949_1082397_+	pfam12846, AAA_10, AAA-like domain
