assembly_id	genome_id	genome_def	crispr_array_locus_merge	crispr_array_location_merge	crispr_locus_id	crispr_pred_method	array_in_prot	prot_within_array_20000	prot_in_genome	crispr_type_by_cas_prot	consensus_repeat	repeat_length	self-targeting_spacer_number	self-targeting_target_number	spacer_location	protospacer_location	repeat_type	spacer_locus_num	spacer_num	correct_crispr_type	genome_cas_prots	unknown_protein_around_crispr	L10	L10_domain	L9	L9_domain	L8	L8_domain	L7	L7_domain	L6	L6_domain	L5	L5_domain	L4	L4_domain	L3	L3_domain	L2	L2_domain	L1	L1_domain	R1	R1_domain	R2	R2_domain	R3	R3_domain	R4	R4_domain	R5	R5_domain	R6	R6_domain	R7	R7_domain	R8	R8_domain	R9	R9_domain	R10	R10_domain
GCF_000018985.1_ASM1898v1	NC_012466	Streptococcus pneumoniae JJA, complete genome	1	108337-108432	1	CRISPRCasFinder	no		cas3,DEDDh,DinG,RT	Orphan	AATGTGTAAGATTTTTATATATAA	24	0	0	NA	NA	NA	1	1	Orphan	cas3,DEDDh,DinG,RT	NA|47aa|up_7|NC_012466.1_101410_101551_-,NA|107aa|down_7|NC_012466.1_116123_116444_-	NA|310aa|up_9|NC_012466.1_99470_100400_+	COG4209, LplB, ABC-type polysaccharide transport system, permease component [Carbohydrate transport and metabolism]	NA|308aa|up_8|NC_012466.1_100413_101337_+	COG0395, UgpE, ABC-type sugar transport system, permease component [Carbohydrate transport and metabolism]	NA|47aa|up_7|NC_012466.1_101410_101551_-	NA	NA|492aa|up_6|NC_012466.1_101594_103070_+	pfam12010, DUF3502, Domain of unknown function (DUF3502)	NA|329aa|up_5|NC_012466.1_103341_104328_+	PRK00142, PRK00142, rhodanese-related sulfurtransferase	NA|338aa|up_4|NC_012466.1_104296_105310_+	pfam14132, DUF4299, Domain of unknown function (DUF4299)	NA|355aa|up_3|NC_012466.1_105381_106446_-	pfam16001, DUF4775, Domain of unknown function (DUF4775)	NA|304aa|up_2|NC_012466.1_106508_107420_-	pfam13349, DUF4097, Putative adhesin	NA|198aa|up_1|NC_012466.1_107412_108006_-	COG4709, COG4709, Predicted membrane protein [Function unknown]	NA|109aa|up_0|NC_012466.1_107992_108319_-	COG1695, COG1695, Predicted transcriptional regulators [Transcription]	NA|389aa|down_0|NC_012466.1_108457_109624_-	COG2807, CynX, Cyanate permease [Inorganic ion transport and metabolism]	NA|386aa|down_1|NC_012466.1_109681_110839_+	cd02525, Succinoglycan_BP_ExoA, ExoA is involved in the biosynthesis of succinoglycan	NA|617aa|down_2|NC_012466.1_110880_112731_+	COG1086, COG1086, Predicted nucleoside-diphosphate sugar epimerases [Cell envelope biogenesis, outer membrane / Carbohydrate transport and metabolism]	NA|211aa|down_3|NC_012466.1_113036_113669_-	cd04302, HAD_5NT, haloacid dehalogenase (HAD)-like 5'-nucleotidases similar to the Pseudomonas aeruginosa PA0065	NA|291aa|down_4|NC_012466.1_113690_114563_-	TIGR00718, Probable_L-serine_dehydratase_alpha_chain, L-serine dehydratase, iron-sulfur-dependent, alpha subunit	NA|224aa|down_5|NC_012466.1_114571_115243_-	COG1760, SdaA, L-serine deaminase [Amino acid transport and metabolism]	NA|196aa|down_6|NC_012466.1_115484_116072_+	cd00118, LysM, Lysin Motif is a small domain involved in binding peptidoglycan	NA|107aa|down_7|NC_012466.1_116123_116444_-	NA	NA|83aa|down_8|NC_012466.1_116894_117143_+	pfam09683, Lactococcin_972, Bacteriocin (Lactococcin_972)	NA|703aa|down_9|NC_012466.1_117197_119306_+	TIGR01654, unnamed_protein_product, bacteriocin-associated integral membrane (putative immunity) protein
GCF_000018985.1_ASM1898v1	NC_012466	Streptococcus pneumoniae JJA, complete genome	2	1033713-1033825	2	CRISPRCasFinder	no		cas3,DEDDh,DinG,RT	Orphan	ACAACAGGAGTAGATGAAAATGGAAACTTGATTGA	35	0	0	NA	NA	NA	1	1	Orphan	cas3,DEDDh,DinG,RT	NA|393aa|up_1|NC_012466.1_1025148_1026327_+,NA	NA|380aa|up_9|NC_012466.1_1010172_1011312_+	TIGR02092, Glycogen_biosynthesis_protein_GlgD, glucose-1-phosphate adenylyltransferase, GlgD subunit	NA|478aa|up_8|NC_012466.1_1011308_1012742_+	PRK00654, glgA, glycogen synthase GlgA	NA|272aa|up_7|NC_012466.1_1013442_1014258_+	COG1929, COG1929, Glycerate kinase [Carbohydrate transport and metabolism]	NA|149aa|up_6|NC_012466.1_1014254_1014701_-	COG5506, COG5506, Uncharacterized conserved protein [Function unknown]	NA|435aa|up_5|NC_012466.1_1014864_1016169_+	PRK00077, eno, enolase; Provisional	NA|149aa|up_4|NC_012466.1_1016306_1016753_-	cd01189, INT_ICEBs1_C_like, C-terminal catalytic domain of integrases from bacterial phages and conjugate transposons	NA|1092aa|up_3|NC_012466.1_1018011_1021287_+	TIGR02774, putative_ATP-dependent_exonuclease_subunit_B, ATP-dependent nuclease subunit B	NA|1217aa|up_2|NC_012466.1_1021283_1024934_+	TIGR02785, ATP-dependent_helicase/nuclease_subunit_A, helicase-exonuclease AddAB, AddA subunit, Firmicutes type	NA|393aa|up_1|NC_012466.1_1025148_1026327_+	NA	NA|2160aa|up_0|NC_012466.1_1026535_1033015_+	pfam07580, Peptidase_M26_C, M26 IgA1-specific Metallo-endopeptidase C-terminal region	NA|284aa|down_0|NC_012466.1_1038525_1039377_+	PRK09563, rbgA, GTPase YlqF; Reviewed	NA|260aa|down_1|NC_012466.1_1039363_1040143_+	PRK00015, rnhB, ribonuclease HII; Validated	NA|517aa|down_2|NC_012466.1_1040158_1041709_+	cd01031, EriC, ClC chloride channel EriC	NA|357aa|down_3|NC_012466.1_1042292_1043363_+	PRK05084, xerS, site-specific tyrosine recombinase XerS; Reviewed	NA|330aa|down_4|NC_012466.1_1043435_1044425_-	TIGR00545, Probable_lipoate-protein_ligase_A, lipoyltransferase and lipoate-protein ligase	NA|568aa|down_5|NC_012466.1_1044488_1046192_-	TIGR01350, Dihydrolipoyl_dehydrogenase, dihydrolipoamide dehydrogenase	NA|348aa|down_6|NC_012466.1_1046237_1047281_-	PRK14843, PRK14843, dihydrolipoamide acetyltransferase; Provisional	NA|331aa|down_7|NC_012466.1_1047498_1048491_-	COG0022, AcoB, Pyruvate/2-oxoglutarate dehydrogenase complex, dehydrogenase (E1) component, eukaryotic type, beta subunit [Energy production and conversion]	NA|323aa|down_8|NC_012466.1_1048506_1049475_-	COG1071, AcoA, Pyruvate/2-oxoglutarate dehydrogenase complex, dehydrogenase (E1) component, eukaryotic type, alpha subunit [Energy production and conversion]	NA|454aa|down_9|NC_012466.1_1049628_1050990_-	cd13131, MATE_NorM_like, Subfamily of the multidrug and toxic compound extrusion (MATE)-like proteins similar to Vibrio cholerae NorM
GCF_000018985.1_ASM1898v1	NC_012466	Streptococcus pneumoniae JJA, complete genome	3	1374709-1374830	3	CRISPRCasFinder	no		cas3,DEDDh,DinG,RT	Orphan	ACTTCTGGTGTCGGTACATTTGGTGTTGG	29	0	0	NA	NA	NA	1	1	Orphan	cas3,DEDDh,DinG,RT	NA,NA|532aa|down_3|NC_012466.1_1383706_1385302_-	NA|120aa|up_9|NC_012466.1_1366737_1367097_-	PRK07252, PRK07252, S1 RNA-binding domain-containing protein	NA|467aa|up_8|NC_012466.1_1367098_1368499_-	COG0652, PpiB, Peptidyl-prolyl cis-trans isomerase (rotamase) - cyclophilin family [Posttranslational modification, protein turnover, chaperones]	NA|80aa|up_7|NC_012466.1_1368793_1369033_-	PRK00391, rpsR, 30S ribosomal protein S18; Reviewed	NA|157aa|up_6|NC_012466.1_1369064_1369535_-	PRK07275, PRK07275, single-stranded DNA-binding protein; Provisional	NA|97aa|up_5|NC_012466.1_1369546_1369837_-	PRK00453, rpsF, 30S ribosomal protein S6; Reviewed	NA|448aa|up_4|NC_012466.1_1369989_1371333_-	PRK03932, asnC, asparaginyl-tRNA synthetase; Validated	NA|122aa|up_3|NC_012466.1_1371351_1371717_-	cd07262, VOC_like, uncharacterized subfamily of vicinal oxygen chelate (VOC) family	NA|396aa|up_2|NC_012466.1_1371709_1372897_-	PRK05764, PRK05764, aspartate aminotransferase; Provisional	NA|144aa|up_1|NC_012466.1_1372893_1373325_-	COG5353, COG5353, Uncharacterized protein conserved in bacteria [Function unknown]	NA|183aa|up_0|NC_012466.1_1373702_1374251_+	COG0431, COG0431, Predicted flavoprotein [General function prediction only]	NA|503aa|down_0|NC_012466.1_1380997_1382506_+	pfam08270, PRD_Mga, M protein trans-acting positive regulator (MGA) PRD domain	NA|243aa|down_1|NC_012466.1_1382556_1383285_-	PRK02101, PRK02101, peroxide stress protein YaaA	NA|60aa|down_2|NC_012466.1_1383363_1383543_-	pfam13129, DUF3953, Protein of unknown function (DUF3953)	NA|532aa|down_3|NC_012466.1_1383706_1385302_-	NA	NA|137aa|down_4|NC_012466.1_1385313_1385724_-	PRK09218, PRK09218, peptide deformylase; Validated	NA|264aa|down_5|NC_012466.1_1385839_1386631_-	PRK11752, PRK11752, putative S-transferase; Provisional	NA|899aa|down_6|NC_012466.1_1386643_1389340_-	cd02089, P-type_ATPase_Ca_prok, prokaryotic P-type Ca(2+)-ATPase similar to Synechococcus elongatus sp	NA|395aa|down_7|NC_012466.1_1389643_1390828_+	COG0053, MMT1, Predicted Co/Zn/Cd cation transporters [Inorganic ion transport and metabolism]	NA|624aa|down_8|NC_012466.1_1390970_1392842_-	COG0488, Uup, ATPase components of ABC transporters with duplicated ATPase domains [General function prediction only]	NA|400aa|down_9|NC_012466.1_1392838_1394038_-	PRK13299, PRK13299, tRNA CCA-pyrophosphorylase; Provisional
GCF_000018985.1_ASM1898v1	NC_012466	Streptococcus pneumoniae JJA, complete genome	4	1624596-1624678	4	CRISPRCasFinder	no		cas3,DEDDh,DinG,RT	Orphan	CTTTTTTGAAACGTTTCATTTTT	23	0	0	NA	NA	NA	1	1	Orphan	cas3,DEDDh,DinG,RT	NA|60aa|up_5|NC_012466.1_1617122_1617302_-,NA	NA|368aa|up_9|NC_012466.1_1614449_1615553_-	pfam02163, Peptidase_M50, Peptidase family M50	NA|157aa|up_8|NC_012466.1_1615556_1616027_-	pfam11217, DUF3013, Protein of unknown function (DUF3013)	NA|151aa|up_7|NC_012466.1_1616317_1616770_-	COG1598, COG1598, Predicted nuclease of the RNAse H fold, HicB family [General    function prediction only]	NA|69aa|up_6|NC_012466.1_1616806_1617013_-	pfam07927, HicA_toxin, HicA toxin of bacterial toxin-antitoxin,	NA|60aa|up_5|NC_012466.1_1617122_1617302_-	NA	NA|424aa|up_4|NC_012466.1_1617975_1619247_+	PRK13342, PRK13342, recombination factor protein RarA; Reviewed	NA|440aa|up_3|NC_012466.1_1619793_1621113_-	COG1621, SacC, Beta-fructosidases (levanase/invertase) [Carbohydrate transport and metabolism]	NA|539aa|up_2|NC_012466.1_1621122_1622739_-	cd13581, PBP2_AlgQ_like_2, Periplasmic-binding component of alginate-specific ABC uptake system-like; contains the type 2 periplasmic binding fold	NA|297aa|up_1|NC_012466.1_1622767_1623658_-	COG0395, UgpE, ABC-type sugar transport system, permease component [Carbohydrate transport and metabolism]	NA|306aa|up_0|NC_012466.1_1623668_1624586_-	COG4209, LplB, ABC-type polysaccharide transport system, permease component [Carbohydrate transport and metabolism]	NA|334aa|down_0|NC_012466.1_1624734_1625736_-	COG1609, PurR, Transcriptional regulators [Transcription]	NA|494aa|down_1|NC_012466.1_1626097_1627579_-	pfam08270, PRD_Mga, M protein trans-acting positive regulator (MGA) PRD domain	NA|55aa|down_2|NC_012466.1_1628021_1628186_+	COG2261, COG2261, Predicted membrane protein [Function unknown]	NA|189aa|down_3|NC_012466.1_1628269_1628836_+	NF033218, anchor_AmaP, alkaline shock response membrane anchor protein AmaP	NA|57aa|down_4|NC_012466.1_1628847_1629018_+	COG5547, COG5547, Small integral membrane protein [Function unknown]	NA|203aa|down_5|NC_012466.1_1629056_1629665_+	COG1302, COG1302, Uncharacterized protein conserved in bacteria [Function unknown]	NA|68aa|down_6|NC_012466.1_1629695_1629899_+	COG3237, COG3237, Uncharacterized protein conserved in bacteria [Function unknown]	NA|257aa|down_7|NC_012466.1_1631050_1631821_-	pfam01925, TauE, Sulfite exporter TauE/SafE	NA|283aa|down_8|NC_012466.1_1631835_1632684_-	pfam03781, FGE-sulfatase, Sulfatase-modifying factor enzyme 1	NA|492aa|down_9|NC_012466.1_1632680_1634156_-	PRK13759, PRK13759, arylsulfatase; Provisional
