assembly_id	genome_id	genome_def	crispr_array_locus_merge	crispr_array_location_merge	crispr_locus_id	crispr_pred_method	array_in_prot	prot_within_array_20000	prot_in_genome	crispr_type_by_cas_prot	consensus_repeat	repeat_length	self-targeting_spacer_number	self-targeting_target_number	spacer_location	protospacer_location	repeat_type	spacer_locus_num	spacer_num	correct_crispr_type	genome_cas_prots	unknown_protein_around_crispr	L10	L10_domain	L9	L9_domain	L8	L8_domain	L7	L7_domain	L6	L6_domain	L5	L5_domain	L4	L4_domain	L3	L3_domain	L2	L2_domain	L1	L1_domain	R1	R1_domain	R2	R2_domain	R3	R3_domain	R4	R4_domain	R5	R5_domain	R6	R6_domain	R7	R7_domain	R8	R8_domain	R9	R9_domain	R10	R10_domain
GCA_003966485.1_ASM396648v1	AP018938	Streptococcus pneumoniae ATCC 49619 DNA, complete genome	1	132096-132191	1	CRISPRCasFinder	no		cas3,DEDDh,PrimPol,DinG,RT	Orphan	AATGTGTAAGATTTTTATATATAA	24	0	0	NA	NA	NA	1	1	Orphan	cas3,DEDDh,PrimPol,DinG,RT	NA|74aa|up_9|AP018938.1_122631_122853_+,NA	NA|74aa|up_9|AP018938.1_122631_122853_+	NA	NA|310aa|up_8|AP018938.1_123171_124101_+	COG4209, LplB, ABC-type polysaccharide transport system, permease component [Carbohydrate transport and metabolism]	NA|308aa|up_7|AP018938.1_124114_125038_+	COG0395, UgpE, ABC-type sugar transport system, permease component [Carbohydrate transport and metabolism]	NA|492aa|up_6|AP018938.1_125247_126723_+	pfam12010, DUF3502, Domain of unknown function (DUF3502)	NA|329aa|up_5|AP018938.1_126994_127981_+	PRK00142, PRK00142, rhodanese-related sulfurtransferase	NA|287aa|up_4|AP018938.1_128102_128963_+	pfam14132, DUF4299, Domain of unknown function (DUF4299)	NA|355aa|up_3|AP018938.1_129140_130205_-	pfam10310, DUF5427, Family of unknown function (DUF5427)	NA|304aa|up_2|AP018938.1_130267_131179_-	pfam13349, DUF4097, Putative adhesin	NA|198aa|up_1|AP018938.1_131171_131765_-	COG4709, COG4709, Predicted membrane protein [Function unknown]	NA|109aa|up_0|AP018938.1_131751_132078_-	COG1695, COG1695, Predicted transcriptional regulators [Transcription]	NA|389aa|down_0|AP018938.1_132216_133383_-	COG2807, CynX, Cyanate permease [Inorganic ion transport and metabolism]	NA|355aa|down_1|AP018938.1_133533_134598_+	cd02525, Succinoglycan_BP_ExoA, ExoA is involved in the biosynthesis of succinoglycan	NA|617aa|down_2|AP018938.1_134639_136490_+	COG1086, COG1086, Predicted nucleoside-diphosphate sugar epimerases [Cell envelope biogenesis, outer membrane / Carbohydrate transport and metabolism]	NA|211aa|down_3|AP018938.1_136748_137381_-	cd04302, HAD_5NT, haloacid dehalogenase (HAD)-like 5'-nucleotidases similar to the Pseudomonas aeruginosa PA0065	NA|291aa|down_4|AP018938.1_137402_138275_-	TIGR00718, Probable_L-serine_dehydratase_alpha_chain, L-serine dehydratase, iron-sulfur-dependent, alpha subunit	NA|224aa|down_5|AP018938.1_138283_138955_-	COG1760, SdaA, L-serine deaminase [Amino acid transport and metabolism]	NA|191aa|down_6|AP018938.1_139196_139769_+	cd00118, LysM, Lysin Motif is a small domain involved in binding peptidoglycan	NA|83aa|down_7|AP018938.1_140592_140841_+	TIGR01653, hypothetical_protein, bacteriocin, lactococcin 972 family	NA|703aa|down_8|AP018938.1_140895_143004_+	TIGR01654, unnamed_protein_product, bacteriocin-associated integral membrane (putative immunity) protein	NA|144aa|down_9|AP018938.1_143000_143432_+	TIGR03608, L_ocin_972_ABC, putative bacteriocin export ABC transporter, lactococcin 972 group
GCA_003966485.1_ASM396648v1	AP018938	Streptococcus pneumoniae ATCC 49619 DNA, complete genome	2	1635053-1635138	2	CRISPRCasFinder	no		cas3,DEDDh,PrimPol,DinG,RT	Orphan	TTTTTGAAACGTTTCATTTTTGCTT	25	0	0	NA	NA	NA	1	1	Orphan	cas3,DEDDh,PrimPol,DinG,RT	NA|80aa|up_5|AP018938.1_1627923_1628163_-,NA	NA|143aa|up_9|AP018938.1_1624497_1624926_-	cd04682, Nudix_Hydrolase_23, Members of the Nudix hydrolase superfamily catalyze the hydrolysis of NUcleoside DIphosphates linked to other moieties, X	NA|356aa|up_8|AP018938.1_1624927_1625995_-	pfam02163, Peptidase_M50, Peptidase family M50	NA|157aa|up_7|AP018938.1_1626013_1626484_-	pfam11217, DUF3013, Protein of unknown function (DUF3013)	NA|151aa|up_6|AP018938.1_1626774_1627227_-	COG1598, COG1598, Predicted nuclease of the RNAse H fold, HicB family [General    function prediction only]	NA|80aa|up_5|AP018938.1_1627923_1628163_-	NA	NA|424aa|up_4|AP018938.1_1628432_1629704_+	PRK13342, PRK13342, recombination factor protein RarA; Reviewed	NA|440aa|up_3|AP018938.1_1630248_1631568_-	COG1621, SacC, Beta-fructosidases (levanase/invertase) [Carbohydrate transport and metabolism]	NA|539aa|up_2|AP018938.1_1631577_1633194_-	cd13581, PBP2_AlgQ_like_2, Periplasmic-binding component of alginate-specific ABC uptake system-like; contains the type 2 periplasmic binding fold	NA|297aa|up_1|AP018938.1_1633222_1634113_-	COG0395, UgpE, ABC-type sugar transport system, permease component [Carbohydrate transport and metabolism]	NA|306aa|up_0|AP018938.1_1634123_1635041_-	COG4209, LplB, ABC-type polysaccharide transport system, permease component [Carbohydrate transport and metabolism]	NA|330aa|down_0|AP018938.1_1635207_1636197_-	COG1609, PurR, Transcriptional regulators [Transcription]	NA|494aa|down_1|AP018938.1_1636559_1638041_-	pfam08270, PRD_Mga, M protein trans-acting positive regulator (MGA) PRD domain	NA|55aa|down_2|AP018938.1_1638483_1638648_+	COG2261, COG2261, Predicted membrane protein [Function unknown]	NA|189aa|down_3|AP018938.1_1638731_1639298_+	NF033218, anchor_AmaP, alkaline shock response membrane anchor protein AmaP	NA|57aa|down_4|AP018938.1_1639309_1639480_+	COG5547, COG5547, Small integral membrane protein [Function unknown]	NA|203aa|down_5|AP018938.1_1639518_1640127_+	COG1302, COG1302, Uncharacterized protein conserved in bacteria [Function unknown]	NA|68aa|down_6|AP018938.1_1640157_1640361_+	COG3237, COG3237, Uncharacterized protein conserved in bacteria [Function unknown]	NA|257aa|down_7|AP018938.1_1641511_1642282_-	pfam01925, TauE, Sulfite exporter TauE/SafE	NA|283aa|down_8|AP018938.1_1642296_1643145_-	pfam03781, FGE-sulfatase, Sulfatase-modifying factor enzyme 1	NA|492aa|down_9|AP018938.1_1643141_1644617_-	PRK13759, PRK13759, arylsulfatase; Provisional
