assembly_id	genome_id	genome_def	crispr_array_locus_merge	crispr_array_location_merge	crispr_locus_id	crispr_pred_method	array_in_prot	prot_within_array_20000	prot_in_genome	crispr_type_by_cas_prot	consensus_repeat	repeat_length	self-targeting_spacer_number	self-targeting_target_number	spacer_location	protospacer_location	repeat_type	spacer_locus_num	spacer_num	correct_crispr_type	genome_cas_prots	unknown_protein_around_crispr	L10	L10_domain	L9	L9_domain	L8	L8_domain	L7	L7_domain	L6	L6_domain	L5	L5_domain	L4	L4_domain	L3	L3_domain	L2	L2_domain	L1	L1_domain	R1	R1_domain	R2	R2_domain	R3	R3_domain	R4	R4_domain	R5	R5_domain	R6	R6_domain	R7	R7_domain	R8	R8_domain	R9	R9_domain	R10	R10_domain
GCF_001457635.1_NCTC7465	NZ_LN831051	Streptococcus pneumoniae strain NCTC7465 chromosome 1	1	366291-366386	1	CRISPRCasFinder	no		DEDDh,cas3,RT,csa3,DinG	Orphan	TTATATATAAAAATTTTACACATT	24	0	0	NA	NA	NA	1	1	Orphan	DEDDh,cas3,RT,csa3,DinG	NA|107aa|up_7|NZ_LN831051.1_358288_358609_+,NA|117aa|down_9|NZ_LN831051.1_375501_375852_-	NA|703aa|up_9|NZ_LN831051.1_355416_357525_-	TIGR01654, unnamed_protein_product, bacteriocin-associated integral membrane (putative immunity) protein	NA|95aa|up_8|NZ_LN831051.1_357579_357864_-	pfam09683, Lactococcin_972, Bacteriocin (Lactococcin_972)	NA|107aa|up_7|NZ_LN831051.1_358288_358609_+	NA	NA|191aa|up_6|NZ_LN831051.1_358660_359233_-	cd00118, LysM, Lysin Motif is a small domain involved in binding peptidoglycan	NA|224aa|up_5|NZ_LN831051.1_359474_360146_+	COG1760, SdaA, L-serine deaminase [Amino acid transport and metabolism]	NA|291aa|up_4|NZ_LN831051.1_360154_361027_+	TIGR00718, Probable_L-serine_dehydratase_alpha_chain, L-serine dehydratase, iron-sulfur-dependent, alpha subunit	NA|211aa|up_3|NZ_LN831051.1_361048_361681_+	cd04302, HAD_5NT, haloacid dehalogenase (HAD)-like 5'-nucleotidases similar to the Pseudomonas aeruginosa PA0065	NA|617aa|up_2|NZ_LN831051.1_361983_363834_-	COG1086, COG1086, Predicted nucleoside-diphosphate sugar epimerases [Cell envelope biogenesis, outer membrane / Carbohydrate transport and metabolism]	NA|380aa|up_1|NZ_LN831051.1_363901_365041_-	cd02525, Succinoglycan_BP_ExoA, ExoA is involved in the biosynthesis of succinoglycan	NA|389aa|up_0|NZ_LN831051.1_365098_366265_+	cd17339, MFS_NIMT_CynX_like, 2-nitroimidazole and cyanate transporters and similar proteins of the Major Facilitator Superfamily of transporters	NA|109aa|down_0|NZ_LN831051.1_366403_366730_+	COG1695, COG1695, Predicted transcriptional regulators [Transcription]	NA|198aa|down_1|NZ_LN831051.1_366716_367310_+	COG4709, COG4709, Predicted membrane protein [Function unknown]	NA|304aa|down_2|NZ_LN831051.1_367302_368214_+	pfam13349, DUF4097, Putative adhesin	NA|355aa|down_3|NZ_LN831051.1_368276_369341_+	pfam10310, DUF5427, Family of unknown function (DUF5427)	NA|287aa|down_4|NZ_LN831051.1_369517_370378_-	pfam14132, DUF4299, Domain of unknown function (DUF4299)	NA|329aa|down_5|NZ_LN831051.1_370499_371486_-	PRK00142, PRK00142, rhodanese-related sulfurtransferase	NA|492aa|down_6|NZ_LN831051.1_371757_373233_-	pfam12010, DUF3502, Domain of unknown function (DUF3502)	NA|308aa|down_7|NZ_LN831051.1_373316_374240_-	COG0395, UgpE, ABC-type sugar transport system, permease component [Carbohydrate transport and metabolism]	NA|310aa|down_8|NZ_LN831051.1_374253_375183_-	COG4209, LplB, ABC-type polysaccharide transport system, permease component [Carbohydrate transport and metabolism]	NA|117aa|down_9|NZ_LN831051.1_375501_375852_-	NA
GCF_001457635.1_NCTC7465	NZ_LN831051	Streptococcus pneumoniae strain NCTC7465 chromosome 1	2	945487-945570	2	CRISPRCasFinder	no	csa3	DEDDh,cas3,RT,csa3,DinG	Type I-A	AAAAATGAAACGTTTCAAAAAAGA	24	0	0	NA	NA	NA	1	1	Orphan	DEDDh,cas3,RT,csa3,DinG	NA,NA|80aa|down_5|NZ_LN831051.1_952455_952695_+,NA|83aa|down_6|NZ_LN831051.1_952684_952933_+	NA|283aa|up_9|NZ_LN831051.1_937489_938338_+	pfam03781, FGE-sulfatase, Sulfatase-modifying factor enzyme 1	NA|257aa|up_8|NZ_LN831051.1_938352_939123_+	pfam01925, TauE, Sulfite exporter TauE/SafE	NA|96aa|up_7|NZ_LN831051.1_939400_939688_-	COG2826, Tra8, Transposase and inactivated derivatives, IS30 family [DNA replication, recombination, and repair]	NA|68aa|up_6|NZ_LN831051.1_940273_940477_-	COG3237, COG3237, Uncharacterized protein conserved in bacteria [Function unknown]	NA|203aa|up_5|NZ_LN831051.1_940507_941116_-	COG1302, COG1302, Uncharacterized protein conserved in bacteria [Function unknown]	NA|57aa|up_4|NZ_LN831051.1_941154_941325_-	COG5547, COG5547, Small integral membrane protein [Function unknown]	NA|189aa|up_3|NZ_LN831051.1_941336_941903_-	NF033218, anchor_AmaP, alkaline shock response membrane anchor protein AmaP	NA|55aa|up_2|NZ_LN831051.1_941986_942151_-	COG2261, COG2261, Predicted membrane protein [Function unknown]	NA|491aa|up_1|NZ_LN831051.1_942593_944066_+	pfam08270, PRD_Mga, M protein trans-acting positive regulator (MGA) PRD domain	NA|334aa|up_0|NZ_LN831051.1_944427_945429_+	COG1609, PurR, Transcriptional regulators [Transcription]	NA|306aa|down_0|NZ_LN831051.1_945577_946495_+	COG4209, LplB, ABC-type polysaccharide transport system, permease component [Carbohydrate transport and metabolism]	NA|297aa|down_1|NZ_LN831051.1_946505_947396_+	COG0395, UgpE, ABC-type sugar transport system, permease component [Carbohydrate transport and metabolism]	NA|539aa|down_2|NZ_LN831051.1_947424_949041_+	cd13581, PBP2_AlgQ_like_2, Periplasmic-binding component of alginate-specific ABC uptake system-like; contains the type 2 periplasmic binding fold	NA|440aa|down_3|NZ_LN831051.1_949050_950370_+	COG1621, SacC, Beta-fructosidases (levanase/invertase) [Carbohydrate transport and metabolism]	NA|424aa|down_4|NZ_LN831051.1_950914_952186_-	PRK13342, PRK13342, recombination factor protein RarA; Reviewed	NA|80aa|down_5|NZ_LN831051.1_952455_952695_+	NA	NA|83aa|down_6|NZ_LN831051.1_952684_952933_+	NA	NA|58aa|down_7|NZ_LN831051.1_953075_953249_+	pfam07927, HicA_toxin, HicA toxin of bacterial toxin-antitoxin,	NA|151aa|down_8|NZ_LN831051.1_953285_953738_+	COG1598, COG1598, Predicted nuclease of the RNAse H fold, HicB family [General    function prediction only]	NA|157aa|down_9|NZ_LN831051.1_954028_954499_+	pfam11217, DUF3013, Protein of unknown function (DUF3013)
GCF_001457635.1_NCTC7465	NZ_LN831051	Streptococcus pneumoniae strain NCTC7465 chromosome 1	3	1395180-1395251	3	CRISPRCasFinder	no		DEDDh,cas3,RT,csa3,DinG	Orphan	AGAGCGAGGCTGATTTTGTAAAT	23	0	0	NA	NA	NA	1	1	Orphan	DEDDh,cas3,RT,csa3,DinG	NA,NA	NA|125aa|up_9|NZ_LN831051.1_1386427_1386802_+	PRK14221, PRK14221, fluoride efflux transporter CrcB	NA|110aa|up_8|NZ_LN831051.1_1386795_1387125_+	PRK14229, PRK14229, fluoride efflux transporter CrcB	NA|116aa|up_7|NZ_LN831051.1_1387243_1387591_+	PRK05338, rplS, 50S ribosomal protein L19; Provisional	NA|333aa|up_6|NZ_LN831051.1_1387869_1388868_-	pfam02037, SAP, SAP domain	NA|269aa|up_5|NZ_LN831051.1_1388910_1389717_-	PRK10513, PRK10513, sugar phosphate phosphatase; Provisional	NA|435aa|up_4|NZ_LN831051.1_1389729_1391034_-	COG1078, COG1078, HD superfamily phosphohydrolases [General function prediction only]	NA|126aa|up_3|NZ_LN831051.1_1391105_1391483_+	pfam09148, DUF1934, Domain of unknown function (DUF1934)	NA|111aa|up_2|NZ_LN831051.1_1391573_1391906_+	PRK00118, PRK00118, putative DNA-binding protein; Validated	NA|524aa|up_1|NZ_LN831051.1_1391917_1393489_+	PRK10867, PRK10867, signal recognition particle protein; Provisional	NA|428aa|up_0|NZ_LN831051.1_1393683_1394967_-	PRK10720, PRK10720, uracil transporter; Provisional	NA|238aa|down_0|NZ_LN831051.1_1395269_1395983_-	PRK00107, gidB, 16S rRNA (guanine(527)-N(7))-methyltransferase RsmG	NA|187aa|down_1|NZ_LN831051.1_1396076_1396637_+	COG1704, LemA, Uncharacterized conserved protein [Function unknown]	NA|300aa|down_2|NZ_LN831051.1_1396638_1397538_+	PRK04897, PRK04897, heat shock protein HtpX; Provisional	NA|520aa|down_3|NZ_LN831051.1_1397581_1399141_-	COG0488, Uup, ATPase components of ABC transporters with duplicated ATPase domains [General function prediction only]	NA|181aa|down_4|NZ_LN831051.1_1399865_1400408_+	COG1399, COG1399, Predicted metal-binding, possibly nucleic acid-binding protein [General function prediction only]	NA|210aa|down_5|NZ_LN831051.1_1400407_1401037_+	COG0177, Nth, Predicted EndoIII-related endonuclease [DNA replication, recombination, and repair]	NA|174aa|down_6|NZ_LN831051.1_1401247_1401769_+	PRK05205, PRK05205, bifunctional pyr operon transcriptional regulator/uracil phosphoribosyltransferase PyrR	NA|308aa|down_7|NZ_LN831051.1_1401787_1402711_+	PRK00856, pyrB, aspartate carbamoyltransferase catalytic subunit	NA|360aa|down_8|NZ_LN831051.1_1402760_1403840_+	PRK12564, PRK12564, carbamoyl-phosphate synthase small subunit	NA|1059aa|down_9|NZ_LN831051.1_1404152_1407329_+	PRK05294, carB, carbamoyl-phosphate synthase large subunit
GCF_001457635.1_NCTC7465	NZ_LN831051	Streptococcus pneumoniae strain NCTC7465 chromosome 1	4	1537068-1537180	4	CRISPRCasFinder	no		DEDDh,cas3,RT,csa3,DinG	Orphan	ACAACAGGAGTAGATGAAAATGGAAACTTGATTGA	35	0	0	NA	NA	NA	1	1	Orphan	DEDDh,cas3,RT,csa3,DinG	NA|393aa|up_1|NZ_LN831051.1_1528299_1529478_+,NA|299aa|down_3|NZ_LN831051.1_1545772_1546669_-,NA|119aa|down_4|NZ_LN831051.1_1546813_1547170_+	NA|124aa|up_9|NZ_LN831051.1_1515684_1516056_+	cd07500, HAD_PSP, phosphoserine phosphatase (PSP), similar to Methanococcus Jannaschii PSP and Saccharomyces cerevisiae SER2p	NA|75aa|up_8|NZ_LN831051.1_1516000_1516225_+	cd07500, HAD_PSP, phosphoserine phosphatase (PSP), similar to Methanococcus Jannaschii PSP and Saccharomyces cerevisiae SER2p	NA|372aa|up_7|NZ_LN831051.1_1516338_1517454_+	COG1929, COG1929, Glycerate kinase [Carbohydrate transport and metabolism]	NA|149aa|up_6|NZ_LN831051.1_1517450_1517897_-	COG5506, COG5506, Uncharacterized conserved protein [Function unknown]	NA|435aa|up_5|NZ_LN831051.1_1518060_1519365_+	PRK00077, eno, enolase; Provisional	NA|149aa|up_4|NZ_LN831051.1_1519502_1519949_-	cd01189, INT_ICEBs1_C_like, C-terminal catalytic domain of integrases from bacterial phages and conjugate transposons	NA|1092aa|up_3|NZ_LN831051.1_1521207_1524483_+	TIGR02774, putative_ATP-dependent_exonuclease_subunit_B, ATP-dependent nuclease subunit B	NA|1217aa|up_2|NZ_LN831051.1_1524479_1528130_+	TIGR02785, ATP-dependent_helicase/nuclease_subunit_A, helicase-exonuclease AddAB, AddA subunit, Firmicutes type	NA|393aa|up_1|NZ_LN831051.1_1528299_1529478_+	NA	NA|2160aa|up_0|NZ_LN831051.1_1529686_1536166_+	pfam07580, Peptidase_M26_C, M26 IgA1-specific Metallo-endopeptidase C-terminal region	NA|284aa|down_0|NZ_LN831051.1_1541880_1542732_+	PRK09563, rbgA, GTPase YlqF; Reviewed	NA|260aa|down_1|NZ_LN831051.1_1542718_1543498_+	PRK00015, rnhB, ribonuclease HII; Validated	NA|517aa|down_2|NZ_LN831051.1_1543513_1545064_+	cd01031, EriC, ClC chloride channel EriC	NA|299aa|down_3|NZ_LN831051.1_1545772_1546669_-	NA	NA|119aa|down_4|NZ_LN831051.1_1546813_1547170_+	NA	NA|439aa|down_5|NZ_LN831051.1_1547244_1548561_-	pfam00145, DNA_methylase, C-5 cytosine-specific DNA methylase	NA|230aa|down_6|NZ_LN831051.1_1548679_1549369_-	COG2932, COG2932, Predicted transcriptional regulator [Transcription]	NA|357aa|down_7|NZ_LN831051.1_1550791_1551862_+	PRK05084, xerS, site-specific tyrosine recombinase XerS; Reviewed	NA|330aa|down_8|NZ_LN831051.1_1551934_1552924_-	TIGR00545, Probable_lipoate-protein_ligase_A, lipoyltransferase and lipoate-protein ligase	NA|562aa|down_9|NZ_LN831051.1_1552987_1554673_-	TIGR01350, Dihydrolipoyl_dehydrogenase, dihydrolipoamide dehydrogenase
