assembly_id	genome_id	genome_def	crispr_array_locus_merge	crispr_array_location_merge	crispr_locus_id	crispr_pred_method	array_in_prot	prot_within_array_20000	prot_in_genome	crispr_type_by_cas_prot	consensus_repeat	repeat_length	self-targeting_spacer_number	self-targeting_target_number	spacer_location	protospacer_location	repeat_type	spacer_locus_num	spacer_num	correct_crispr_type	genome_cas_prots	unknown_protein_around_crispr	L10	L10_domain	L9	L9_domain	L8	L8_domain	L7	L7_domain	L6	L6_domain	L5	L5_domain	L4	L4_domain	L3	L3_domain	L2	L2_domain	L1	L1_domain	R1	R1_domain	R2	R2_domain	R3	R3_domain	R4	R4_domain	R5	R5_domain	R6	R6_domain	R7	R7_domain	R8	R8_domain	R9	R9_domain	R10	R10_domain
GCF_900475305.1_43721_E02	NZ_LS483374	Streptococcus pneumoniae strain NCTC7466 chromosome 1	1	97129-97224	1	CRISPRCasFinder	no		cas3,DEDDh,DinG	Orphan	AATGTGTAAGATTTTTATATATAA	24	0	0	NA	NA	NA	1	1	Orphan	cas3,DEDDh,DinG	NA|74aa|up_9|NZ_LS483374.1_87722_87944_+,NA	NA|74aa|up_9|NZ_LS483374.1_87722_87944_+	NA	NA|310aa|up_8|NZ_LS483374.1_88262_89192_+	COG4209, LplB, ABC-type polysaccharide transport system, permease component [Carbohydrate transport and metabolism]	NA|308aa|up_7|NZ_LS483374.1_89205_90129_+	COG0395, UgpE, ABC-type sugar transport system, permease component [Carbohydrate transport and metabolism]	NA|492aa|up_6|NZ_LS483374.1_90386_91862_+	pfam12010, DUF3502, Domain of unknown function (DUF3502)	NA|329aa|up_5|NZ_LS483374.1_92133_93120_+	PRK00142, PRK00142, rhodanese-related sulfurtransferase	NA|287aa|up_4|NZ_LS483374.1_93241_94102_+	pfam14132, DUF4299, Domain of unknown function (DUF4299)	NA|355aa|up_3|NZ_LS483374.1_94173_95238_-	pfam10310, DUF5427, Family of unknown function (DUF5427)	NA|304aa|up_2|NZ_LS483374.1_95300_96212_-	pfam13349, DUF4097, Putative adhesin	NA|198aa|up_1|NZ_LS483374.1_96204_96798_-	COG4709, COG4709, Predicted membrane protein [Function unknown]	NA|109aa|up_0|NZ_LS483374.1_96784_97111_-	COG1695, COG1695, Predicted transcriptional regulators [Transcription]	NA|389aa|down_0|NZ_LS483374.1_97249_98416_-	cd17339, MFS_NIMT_CynX_like, 2-nitroimidazole and cyanate transporters and similar proteins of the Major Facilitator Superfamily of transporters	NA|386aa|down_1|NZ_LS483374.1_98473_99631_+	cd02525, Succinoglycan_BP_ExoA, ExoA is involved in the biosynthesis of succinoglycan	NA|617aa|down_2|NZ_LS483374.1_99672_101523_+	COG1086, COG1086, Predicted nucleoside-diphosphate sugar epimerases [Cell envelope biogenesis, outer membrane / Carbohydrate transport and metabolism]	NA|211aa|down_3|NZ_LS483374.1_101918_102551_-	cd04302, HAD_5NT, haloacid dehalogenase (HAD)-like 5'-nucleotidases similar to the Pseudomonas aeruginosa PA0065	NA|291aa|down_4|NZ_LS483374.1_102572_103445_-	TIGR00718, Probable_L-serine_dehydratase_alpha_chain, L-serine dehydratase, iron-sulfur-dependent, alpha subunit	NA|224aa|down_5|NZ_LS483374.1_103453_104125_-	COG1760, SdaA, L-serine deaminase [Amino acid transport and metabolism]	NA|168aa|down_6|NZ_LS483374.1_104366_104870_+	cd00118, LysM, Lysin Motif is a small domain involved in binding peptidoglycan	NA|98aa|down_7|NZ_LS483374.1_105652_105946_+	pfam09683, Lactococcin_972, Bacteriocin (Lactococcin_972)	NA|703aa|down_8|NZ_LS483374.1_106000_108109_+	TIGR01654, unnamed_protein_product, bacteriocin-associated integral membrane (putative immunity) protein	NA|214aa|down_9|NZ_LS483374.1_108105_108747_+	TIGR03608, L_ocin_972_ABC, putative bacteriocin export ABC transporter, lactococcin 972 group
GCF_900475305.1_43721_E02	NZ_LS483374	Streptococcus pneumoniae strain NCTC7466 chromosome 1	2	1388474-1388595	2	CRISPRCasFinder	no		cas3,DEDDh,DinG	Orphan	ACTTCTGGTGTCGGTACATTTGGTGTTGG	29	0	0	NA	NA	NA	1	1	Orphan	cas3,DEDDh,DinG	NA,NA|532aa|down_3|NZ_LS483374.1_1398797_1400393_-	NA|120aa|up_9|NZ_LS483374.1_1380502_1380862_-	PRK07252, PRK07252, S1 RNA-binding domain-containing protein	NA|467aa|up_8|NZ_LS483374.1_1380863_1382264_-	COG0652, PpiB, Peptidyl-prolyl cis-trans isomerase (rotamase) - cyclophilin family [Posttranslational modification, protein turnover, chaperones]	NA|80aa|up_7|NZ_LS483374.1_1382558_1382798_-	PRK00391, rpsR, 30S ribosomal protein S18; Reviewed	NA|157aa|up_6|NZ_LS483374.1_1382829_1383300_-	PRK07275, PRK07275, single-stranded DNA-binding protein; Provisional	NA|97aa|up_5|NZ_LS483374.1_1383311_1383602_-	PRK00453, rpsF, 30S ribosomal protein S6; Reviewed	NA|449aa|up_4|NZ_LS483374.1_1383754_1385101_-	PRK03932, asnC, asparaginyl-tRNA synthetase; Validated	NA|122aa|up_3|NZ_LS483374.1_1385116_1385482_-	cd07262, VOC_like, uncharacterized subfamily of vicinal oxygen chelate (VOC) family	NA|396aa|up_2|NZ_LS483374.1_1385474_1386662_-	PRK05764, PRK05764, aspartate aminotransferase; Provisional	NA|163aa|up_1|NZ_LS483374.1_1386658_1387147_-	COG5353, COG5353, Uncharacterized protein conserved in bacteria [Function unknown]	NA|183aa|up_0|NZ_LS483374.1_1387467_1388016_+	COG0431, COG0431, Predicted flavoprotein [General function prediction only]	NA|495aa|down_0|NZ_LS483374.1_1396124_1397609_+	pfam08270, PRD_Mga, M protein trans-acting positive regulator (MGA) PRD domain	NA|243aa|down_1|NZ_LS483374.1_1397659_1398388_-	PRK02101, PRK02101, peroxide stress protein YaaA	NA|55aa|down_2|NZ_LS483374.1_1398466_1398631_-	pfam13129, DUF3953, Protein of unknown function (DUF3953)	NA|532aa|down_3|NZ_LS483374.1_1398797_1400393_-	NA	NA|137aa|down_4|NZ_LS483374.1_1400404_1400815_-	PRK09218, PRK09218, peptide deformylase; Validated	NA|264aa|down_5|NZ_LS483374.1_1400930_1401722_-	PRK11752, PRK11752, putative S-transferase; Provisional	NA|899aa|down_6|NZ_LS483374.1_1401734_1404431_-	cd02089, P-type_ATPase_Ca_prok, prokaryotic P-type Ca(2+)-ATPase similar to Synechococcus elongatus sp	NA|395aa|down_7|NZ_LS483374.1_1404734_1405919_+	COG0053, MMT1, Predicted Co/Zn/Cd cation transporters [Inorganic ion transport and metabolism]	NA|624aa|down_8|NZ_LS483374.1_1406061_1407933_-	COG0488, Uup, ATPase components of ABC transporters with duplicated ATPase domains [General function prediction only]	NA|395aa|down_9|NZ_LS483374.1_1407929_1409114_-	PRK13299, PRK13299, tRNA CCA-pyrophosphorylase; Provisional
GCF_900475305.1_43721_E02	NZ_LS483374	Streptococcus pneumoniae strain NCTC7466 chromosome 1	3	1698526-1698643	3	CRISPRCasFinder	no		cas3,DEDDh,DinG	Orphan	CAAAAAAGAAAGGACAAAATTTGTCCTTTCTCGAGCTTAGCTTTT	45	1	3	1698571-1698598|1698571-1698598|1698571-1698598	NZ_LS483374.1_40218-40191|NZ_LS483374.1_966395-966422|NZ_LS483374.1_1656868-1656895	NA	1	1	Orphan	cas3,DEDDh,DinG	NA,NA	NA|499aa|up_9|NZ_LS483374.1_1683419_1684916_-	COG0601, DppB, ABC-type dipeptide/oligopeptide/nickel transport systems, permease components [Amino acid transport and metabolism / Inorganic ion transport and metabolism]	NA|660aa|up_8|NZ_LS483374.1_1684982_1686962_-	cd08504, PBP2_OppA, The substrate-binding component of an ABC-type oligopetide import system contains the type 2 periplasmic binding fold	NA|398aa|up_7|NZ_LS483374.1_1687648_1688842_-	COG3307, RfaL, Lipid A core - O-antigen ligase and related enzymes [Cell envelope biogenesis, outer membrane]	NA|481aa|up_6|NZ_LS483374.1_1688979_1690422_-	TIGR03852, sucrose_gtfA, sucrose phosphorylase	NA|278aa|up_5|NZ_LS483374.1_1690779_1691613_-	COG0395, UgpE, ABC-type sugar transport system, permease component [Carbohydrate transport and metabolism]	NA|289aa|up_4|NZ_LS483374.1_1691626_1692493_-	COG1175, UgpA, ABC-type sugar transport systems, permease components [Carbohydrate transport and metabolism]	NA|420aa|up_3|NZ_LS483374.1_1692506_1693766_-	cd14749, PBP2_XBP1_like, The periplasmic-binding component of ABC transport systems specific for xylo-oligosaccharides; possesses type 2 periplasmic binding fold	NA|721aa|up_2|NZ_LS483374.1_1693857_1696020_-	COG3345, GalA, Alpha-galactosidase [Carbohydrate transport and metabolism]	NA|287aa|up_1|NZ_LS483374.1_1696127_1696988_+	cd06986, cupin_MmsR-like_N, AraC/XylS family transcriptional regulators similar to MmsR, N-terminal cupin domain	NA|312aa|up_0|NZ_LS483374.1_1696984_1697920_+	PRK11886, PRK11886, bifunctional biotin--[acetyl-CoA-carboxylase] ligase/biotin operon repressor BirA	NA|452aa|down_0|NZ_LS483374.1_1705552_1706908_-	COG2265, TrmA, SAM-dependent methyltransferases related to tRNA (uracil-5-)-methyltransferase [Translation, ribosomal structure and biogenesis]	NA|259aa|down_1|NZ_LS483374.1_1706945_1707722_+	PRK14135, recX, recombination regulator RecX; Provisional	NA|178aa|down_2|NZ_LS483374.1_1707810_1708344_+	PRK13662, PRK13662, hypothetical protein; Provisional	NA|410aa|down_3|NZ_LS483374.1_1709063_1710293_+	COG3464, COG3464, Transposase and inactivated derivatives [DNA replication, recombination, and repair]	NA|541aa|down_4|NZ_LS483374.1_1710391_1712014_-	PRK00013, groEL, chaperonin GroEL; Reviewed	NA|95aa|down_5|NZ_LS483374.1_1712029_1712314_-	PRK00364, groES, co-chaperonin GroES; Reviewed	NA|132aa|down_6|NZ_LS483374.1_1712469_1712865_-	PRK07274, PRK07274, single-stranded DNA-binding protein; Provisional	NA|254aa|down_7|NZ_LS483374.1_1712942_1713704_-	cd05346, SDR_c5, classical (c) SDR, subgroup 5	NA|209aa|down_8|NZ_LS483374.1_1713736_1714363_-	cd02796, tRNA_bind_bactPheRS, tRNA-binding-domain-containing prokaryotic phenylalanly tRNA synthetase (PheRS) beta chain	NA|106aa|down_9|NZ_LS483374.1_1714378_1714696_-	cd02947, TRX_family, TRX family; composed of two groups: Group I, which includes proteins that exclusively encode a TRX domain; and Group II, which are composed of fusion proteins of TRX and additional domains
