assembly_id	genome_id	genome_def	crispr_array_locus_merge	crispr_array_location_merge	crispr_locus_id	crispr_pred_method	array_in_prot	prot_within_array_20000	prot_in_genome	crispr_type_by_cas_prot	consensus_repeat	repeat_length	self-targeting_spacer_number	self-targeting_target_number	spacer_location	protospacer_location	repeat_type	spacer_locus_num	spacer_num	correct_crispr_type	genome_cas_prots	unknown_protein_around_crispr	L10	L10_domain	L9	L9_domain	L8	L8_domain	L7	L7_domain	L6	L6_domain	L5	L5_domain	L4	L4_domain	L3	L3_domain	L2	L2_domain	L1	L1_domain	R1	R1_domain	R2	R2_domain	R3	R3_domain	R4	R4_domain	R5	R5_domain	R6	R6_domain	R7	R7_domain	R8	R8_domain	R9	R9_domain	R10	R10_domain
GCF_003351645.1_ASM335164v1	NZ_CP031245	Streptococcus pneumoniae strain M16808 chromosome, complete genome	1	69217-69302	1	CRISPRCasFinder	no		cas3,DEDDh,PrimPol,DinG	Orphan	TTTTTGAAACGTTTCATTTTTGCTT	25	0	0	NA	NA	NA	1	1	Orphan	cas3,DEDDh,PrimPol,DinG	NA|60aa|up_6|NZ_CP031245.1_61743_61923_-,NA|80aa|up_5|NZ_CP031245.1_62087_62327_-,NA	NA|157aa|up_9|NZ_CP031245.1_60177_60648_-	pfam11217, DUF3013, Protein of unknown function (DUF3013)	NA|151aa|up_8|NZ_CP031245.1_60938_61391_-	COG1598, COG1598, Predicted nuclease of the RNAse H fold, HicB family [General    function prediction only]	NA|60aa|up_7|NZ_CP031245.1_61427_61607_-	pfam07927, HicA_toxin, HicA toxin of bacterial toxin-antitoxin,	NA|60aa|up_6|NZ_CP031245.1_61743_61923_-	NA	NA|80aa|up_5|NZ_CP031245.1_62087_62327_-	NA	NA|424aa|up_4|NZ_CP031245.1_62596_63868_+	PRK13342, PRK13342, recombination factor protein RarA; Reviewed	NA|440aa|up_3|NZ_CP031245.1_64412_65732_-	COG1621, SacC, Beta-fructosidases (levanase/invertase) [Carbohydrate transport and metabolism]	NA|539aa|up_2|NZ_CP031245.1_65741_67358_-	cd13581, PBP2_AlgQ_like_2, Periplasmic-binding component of alginate-specific ABC uptake system-like; contains the type 2 periplasmic binding fold	NA|297aa|up_1|NZ_CP031245.1_67386_68277_-	COG0395, UgpE, ABC-type sugar transport system, permease component [Carbohydrate transport and metabolism]	NA|306aa|up_0|NZ_CP031245.1_68287_69205_-	COG4209, LplB, ABC-type polysaccharide transport system, permease component [Carbohydrate transport and metabolism]	NA|494aa|down_0|NZ_CP031245.1_70723_72205_-	pfam08270, PRD_Mga, M protein trans-acting positive regulator (MGA) PRD domain	NA|55aa|down_1|NZ_CP031245.1_72647_72812_+	COG2261, COG2261, Predicted membrane protein [Function unknown]	NA|189aa|down_2|NZ_CP031245.1_72895_73462_+	NF033218, anchor_AmaP, alkaline shock response membrane anchor protein AmaP	NA|57aa|down_3|NZ_CP031245.1_73473_73644_+	COG5547, COG5547, Small integral membrane protein [Function unknown]	NA|203aa|down_4|NZ_CP031245.1_73682_74291_+	COG1302, COG1302, Uncharacterized protein conserved in bacteria [Function unknown]	NA|68aa|down_5|NZ_CP031245.1_74321_74525_+	COG3237, COG3237, Uncharacterized protein conserved in bacteria [Function unknown]	NA|96aa|down_6|NZ_CP031245.1_75110_75398_+	COG2826, Tra8, Transposase and inactivated derivatives, IS30 family [DNA replication, recombination, and repair]	NA|257aa|down_7|NZ_CP031245.1_75675_76446_-	pfam01925, TauE, Sulfite exporter TauE/SafE	NA|283aa|down_8|NZ_CP031245.1_76460_77309_-	pfam03781, FGE-sulfatase, Sulfatase-modifying factor enzyme 1	NA|492aa|down_9|NZ_CP031245.1_77305_78781_-	PRK13759, PRK13759, arylsulfatase; Provisional
GCF_003351645.1_ASM335164v1	NZ_CP031245	Streptococcus pneumoniae strain M16808 chromosome, complete genome	2	662683-662778	2	CRISPRCasFinder	no		cas3,DEDDh,PrimPol,DinG	Orphan	AATGTGTAAGATTTTTATATATAA	24	0	0	NA	NA	NA	1	1	Orphan	cas3,DEDDh,PrimPol,DinG	NA|74aa|up_9|NZ_CP031245.1_653218_653440_+,NA|56aa|down_3|NZ_CP031245.1_667055_667223_-,NA|107aa|down_8|NZ_CP031245.1_670407_670728_-	NA|74aa|up_9|NZ_CP031245.1_653218_653440_+	NA	NA|310aa|up_8|NZ_CP031245.1_653758_654688_+	COG4209, LplB, ABC-type polysaccharide transport system, permease component [Carbohydrate transport and metabolism]	NA|308aa|up_7|NZ_CP031245.1_654701_655625_+	COG0395, UgpE, ABC-type sugar transport system, permease component [Carbohydrate transport and metabolism]	NA|492aa|up_6|NZ_CP031245.1_655834_657310_+	pfam12010, DUF3502, Domain of unknown function (DUF3502)	NA|329aa|up_5|NZ_CP031245.1_657581_658568_+	PRK00142, PRK00142, rhodanese-related sulfurtransferase	NA|287aa|up_4|NZ_CP031245.1_658689_659550_+	pfam14132, DUF4299, Domain of unknown function (DUF4299)	NA|355aa|up_3|NZ_CP031245.1_659727_660792_-	pfam10310, DUF5427, Family of unknown function (DUF5427)	NA|304aa|up_2|NZ_CP031245.1_660854_661766_-	pfam13349, DUF4097, Putative adhesin	NA|198aa|up_1|NZ_CP031245.1_661758_662352_-	COG4709, COG4709, Predicted membrane protein [Function unknown]	NA|109aa|up_0|NZ_CP031245.1_662338_662665_-	COG1695, COG1695, Predicted transcriptional regulators [Transcription]	NA|389aa|down_0|NZ_CP031245.1_662803_663970_-	COG2807, CynX, Cyanate permease [Inorganic ion transport and metabolism]	NA|386aa|down_1|NZ_CP031245.1_664027_665185_+	cd02525, Succinoglycan_BP_ExoA, ExoA is involved in the biosynthesis of succinoglycan	NA|617aa|down_2|NZ_CP031245.1_665226_667077_+	COG1086, COG1086, Predicted nucleoside-diphosphate sugar epimerases [Cell envelope biogenesis, outer membrane / Carbohydrate transport and metabolism]	NA|56aa|down_3|NZ_CP031245.1_667055_667223_-	NA	NA|211aa|down_4|NZ_CP031245.1_667335_667968_-	cd04302, HAD_5NT, haloacid dehalogenase (HAD)-like 5'-nucleotidases similar to the Pseudomonas aeruginosa PA0065	NA|291aa|down_5|NZ_CP031245.1_667989_668862_-	TIGR00718, Probable_L-serine_dehydratase_alpha_chain, L-serine dehydratase, iron-sulfur-dependent, alpha subunit	NA|224aa|down_6|NZ_CP031245.1_668870_669542_-	COG1760, SdaA, L-serine deaminase [Amino acid transport and metabolism]	NA|191aa|down_7|NZ_CP031245.1_669783_670356_+	cd00118, LysM, Lysin Motif is a small domain involved in binding peptidoglycan	NA|107aa|down_8|NZ_CP031245.1_670407_670728_-	NA	NA|83aa|down_9|NZ_CP031245.1_671179_671428_+	TIGR01653, hypothetical_protein, bacteriocin, lactococcin 972 family
