assembly_id	genome_id	genome_def	crispr_array_locus_merge	crispr_array_location_merge	crispr_locus_id	crispr_pred_method	array_in_prot	prot_within_array_20000	prot_in_genome	crispr_type_by_cas_prot	consensus_repeat	repeat_length	self-targeting_spacer_number	self-targeting_target_number	spacer_location	protospacer_location	repeat_type	spacer_locus_num	spacer_num	correct_crispr_type	genome_cas_prots	unknown_protein_around_crispr	L10	L10_domain	L9	L9_domain	L8	L8_domain	L7	L7_domain	L6	L6_domain	L5	L5_domain	L4	L4_domain	L3	L3_domain	L2	L2_domain	L1	L1_domain	R1	R1_domain	R2	R2_domain	R3	R3_domain	R4	R4_domain	R5	R5_domain	R6	R6_domain	R7	R7_domain	R8	R8_domain	R9	R9_domain	R10	R10_domain
GCF_000146975.1_ASM14697v1	NC_014494	Streptococcus pneumoniae AP200, complete sequence	1	135151-135246	1	CRISPRCasFinder	no		cas3,DEDDh,DinG	Orphan	AATGTGTAAGATTTTTATATATAA	24	0	0	NA	NA	NA	1	1	Orphan	cas3,DEDDh,DinG	NA|74aa|up_9|NC_014494.1_125898_126120_+,NA|107aa|down_7|NC_014494.1_142891_143212_-	NA|74aa|up_9|NC_014494.1_125898_126120_+	NA	NA|310aa|up_8|NC_014494.1_126438_127368_+	COG4209, LplB, ABC-type polysaccharide transport system, permease component [Carbohydrate transport and metabolism]	NA|308aa|up_7|NC_014494.1_127381_128305_+	COG0395, UgpE, ABC-type sugar transport system, permease component [Carbohydrate transport and metabolism]	NA|492aa|up_6|NC_014494.1_128408_129884_+	pfam12010, DUF3502, Domain of unknown function (DUF3502)	NA|329aa|up_5|NC_014494.1_130155_131142_+	PRK00142, PRK00142, rhodanese-related sulfurtransferase	NA|284aa|up_4|NC_014494.1_131263_132115_+	pfam14132, DUF4299, Domain of unknown function (DUF4299)	NA|355aa|up_3|NC_014494.1_132195_133260_-	pfam05262, Borrelia_P83, Borrelia P83/100 protein	NA|304aa|up_2|NC_014494.1_133322_134234_-	pfam13349, DUF4097, Putative adhesin	NA|198aa|up_1|NC_014494.1_134226_134820_-	COG4709, COG4709, Predicted membrane protein [Function unknown]	NA|109aa|up_0|NC_014494.1_134806_135133_-	COG1695, COG1695, Predicted transcriptional regulators [Transcription]	NA|389aa|down_0|NC_014494.1_135271_136438_-	COG2807, CynX, Cyanate permease [Inorganic ion transport and metabolism]	NA|387aa|down_1|NC_014494.1_136495_137656_+	cd02525, Succinoglycan_BP_ExoA, ExoA is involved in the biosynthesis of succinoglycan	NA|617aa|down_2|NC_014494.1_137694_139545_+	COG1086, COG1086, Predicted nucleoside-diphosphate sugar epimerases [Cell envelope biogenesis, outer membrane / Carbohydrate transport and metabolism]	NA|211aa|down_3|NC_014494.1_139804_140437_-	cd04302, HAD_5NT, haloacid dehalogenase (HAD)-like 5'-nucleotidases similar to the Pseudomonas aeruginosa PA0065	NA|291aa|down_4|NC_014494.1_140458_141331_-	TIGR00718, Probable_L-serine_dehydratase_alpha_chain, L-serine dehydratase, iron-sulfur-dependent, alpha subunit	NA|224aa|down_5|NC_014494.1_141339_142011_-	COG1760, SdaA, L-serine deaminase [Amino acid transport and metabolism]	NA|196aa|down_6|NC_014494.1_142252_142840_+	cd00118, LysM, Lysin Motif is a small domain involved in binding peptidoglycan	NA|107aa|down_7|NC_014494.1_142891_143212_-	NA	NA|96aa|down_8|NC_014494.1_143642_143930_+	TIGR01653, hypothetical_protein, bacteriocin, lactococcin 972 family	NA|703aa|down_9|NC_014494.1_143981_146090_+	TIGR01654, unnamed_protein_product, bacteriocin-associated integral membrane (putative immunity) protein
GCF_000146975.1_ASM14697v1	NC_014494	Streptococcus pneumoniae AP200, complete sequence	2	1685035-1685119	2	CRISPRCasFinder	no		cas3,DEDDh,DinG	Orphan	TTTTTGAAACGTTTCATTTTTTT	23	0	0	NA	NA	NA	1	1	Orphan	cas3,DEDDh,DinG	NA|60aa|up_6|NC_014494.1_1677558_1677738_-,NA|72aa|up_5|NC_014494.1_1677902_1678118_-,NA	NA|157aa|up_9|NC_014494.1_1675992_1676463_-	pfam11217, DUF3013, Protein of unknown function (DUF3013)	NA|151aa|up_8|NC_014494.1_1676753_1677206_-	COG1598, COG1598, Predicted nuclease of the RNAse H fold, HicB family [General    function prediction only]	NA|60aa|up_7|NC_014494.1_1677242_1677422_-	pfam07927, HicA_toxin, HicA toxin of bacterial toxin-antitoxin,	NA|60aa|up_6|NC_014494.1_1677558_1677738_-	NA	NA|72aa|up_5|NC_014494.1_1677902_1678118_-	NA	NA|424aa|up_4|NC_014494.1_1678415_1679687_+	PRK13342, PRK13342, recombination factor protein RarA; Reviewed	NA|440aa|up_3|NC_014494.1_1680231_1681551_-	COG1621, SacC, Beta-fructosidases (levanase/invertase) [Carbohydrate transport and metabolism]	NA|539aa|up_2|NC_014494.1_1681560_1683177_-	cd13581, PBP2_AlgQ_like_2, Periplasmic-binding component of alginate-specific ABC uptake system-like; contains the type 2 periplasmic binding fold	NA|297aa|up_1|NC_014494.1_1683205_1684096_-	COG0395, UgpE, ABC-type sugar transport system, permease component [Carbohydrate transport and metabolism]	NA|306aa|up_0|NC_014494.1_1684106_1685024_-	COG4209, LplB, ABC-type polysaccharide transport system, permease component [Carbohydrate transport and metabolism]	NA|334aa|down_0|NC_014494.1_1685173_1686175_-	COG1609, PurR, Transcriptional regulators [Transcription]	NA|494aa|down_1|NC_014494.1_1686535_1688017_-	pfam08270, PRD_Mga, M protein trans-acting positive regulator (MGA) PRD domain	NA|55aa|down_2|NC_014494.1_1688459_1688624_+	COG2261, COG2261, Predicted membrane protein [Function unknown]	NA|189aa|down_3|NC_014494.1_1688707_1689274_+	NF033218, anchor_AmaP, alkaline shock response membrane anchor protein AmaP	NA|57aa|down_4|NC_014494.1_1689285_1689456_+	COG5547, COG5547, Small integral membrane protein [Function unknown]	NA|203aa|down_5|NC_014494.1_1689494_1690103_+	COG1302, COG1302, Uncharacterized protein conserved in bacteria [Function unknown]	NA|68aa|down_6|NC_014494.1_1690133_1690337_+	COG3237, COG3237, Uncharacterized protein conserved in bacteria [Function unknown]	NA|92aa|down_7|NC_014494.1_1690917_1691193_+	COG2826, Tra8, Transposase and inactivated derivatives, IS30 family [DNA replication, recombination, and repair]	NA|257aa|down_8|NC_014494.1_1691470_1692241_-	pfam01925, TauE, Sulfite exporter TauE/SafE	NA|283aa|down_9|NC_014494.1_1692255_1693104_-	pfam03781, FGE-sulfatase, Sulfatase-modifying factor enzyme 1
