assembly_id	genome_id	genome_def	crispr_array_locus_merge	crispr_array_location_merge	crispr_locus_id	crispr_pred_method	array_in_prot	prot_within_array_20000	prot_in_genome	crispr_type_by_cas_prot	consensus_repeat	repeat_length	self-targeting_spacer_number	self-targeting_target_number	spacer_location	protospacer_location	repeat_type	spacer_locus_num	spacer_num	correct_crispr_type	genome_cas_prots	unknown_protein_around_crispr	L10	L10_domain	L9	L9_domain	L8	L8_domain	L7	L7_domain	L6	L6_domain	L5	L5_domain	L4	L4_domain	L3	L3_domain	L2	L2_domain	L1	L1_domain	R1	R1_domain	R2	R2_domain	R3	R3_domain	R4	R4_domain	R5	R5_domain	R6	R6_domain	R7	R7_domain	R8	R8_domain	R9	R9_domain	R10	R10_domain
GCF_000210975.1_ASM21097v1	NC_017591	Streptococcus pneumoniae INV104, complete genome	1	94852-94947	1	CRISPRCasFinder	no		cas3,DEDDh,DinG,RT	Orphan	AATGTGTAAGATTTTTATATATAA	24	0	0	NA	NA	NA	1	1	Orphan	cas3,DEDDh,DinG,RT	NA|117aa|up_9|NC_017591.1_85470_85821_+,NA|120aa|down_9|NC_017591.1_104363_104723_+	NA|117aa|up_9|NC_017591.1_85470_85821_+	NA	NA|310aa|up_8|NC_017591.1_86139_87069_+	COG4209, LplB, ABC-type polysaccharide transport system, permease component [Carbohydrate transport and metabolism]	NA|308aa|up_7|NC_017591.1_87082_88006_+	COG0395, UgpE, ABC-type sugar transport system, permease component [Carbohydrate transport and metabolism]	NA|492aa|up_6|NC_017591.1_88109_89585_+	pfam12010, DUF3502, Domain of unknown function (DUF3502)	NA|329aa|up_5|NC_017591.1_89856_90843_+	PRK00142, PRK00142, rhodanese-related sulfurtransferase	NA|287aa|up_4|NC_017591.1_90964_91825_+	pfam14132, DUF4299, Domain of unknown function (DUF4299)	NA|355aa|up_3|NC_017591.1_91896_92961_-	pfam10310, DUF5427, Family of unknown function (DUF5427)	NA|304aa|up_2|NC_017591.1_93023_93935_-	pfam13349, DUF4097, Putative adhesin	NA|198aa|up_1|NC_017591.1_93927_94521_-	COG4709, COG4709, Predicted membrane protein [Function unknown]	NA|109aa|up_0|NC_017591.1_94507_94834_-	COG1695, COG1695, Predicted transcriptional regulators [Transcription]	NA|389aa|down_0|NC_017591.1_94972_96139_-	cd17339, MFS_NIMT_CynX_like, 2-nitroimidazole and cyanate transporters and similar proteins of the Major Facilitator Superfamily of transporters	NA|380aa|down_1|NC_017591.1_96196_97336_+	cd02525, Succinoglycan_BP_ExoA, ExoA is involved in the biosynthesis of succinoglycan	NA|617aa|down_2|NC_017591.1_97403_99254_+	COG1086, COG1086, Predicted nucleoside-diphosphate sugar epimerases [Cell envelope biogenesis, outer membrane / Carbohydrate transport and metabolism]	NA|211aa|down_3|NC_017591.1_99646_100279_-	cd04302, HAD_5NT, haloacid dehalogenase (HAD)-like 5'-nucleotidases similar to the Pseudomonas aeruginosa PA0065	NA|291aa|down_4|NC_017591.1_100300_101173_-	TIGR00718, Probable_L-serine_dehydratase_alpha_chain, L-serine dehydratase, iron-sulfur-dependent, alpha subunit	NA|224aa|down_5|NC_017591.1_101181_101853_-	COG1760, SdaA, L-serine deaminase [Amino acid transport and metabolism]	NA|191aa|down_6|NC_017591.1_102094_102667_+	cd00118, LysM, Lysin Motif is a small domain involved in binding peptidoglycan	NA|288aa|down_7|NC_017591.1_102718_103582_-	cd00093, HTH_XRE, Helix-turn-helix XRE-family like proteins	NA|113aa|down_8|NC_017591.1_104047_104386_+	cd02418, Peptidase_C39B, A sub-family of peptidase family C39	NA|120aa|down_9|NC_017591.1_104363_104723_+	NA
GCF_000210975.1_ASM21097v1	NC_017591	Streptococcus pneumoniae INV104, complete genome	2	1434416-1434552	2	CRISPRCasFinder	no		cas3,DEDDh,DinG,RT	Orphan	ACTTCTGGTGTCGGTACATTTGGTGTTGG	29	0	0	NA	NA	NA	2	2	Orphan	cas3,DEDDh,DinG,RT	NA,NA|532aa|down_3|NC_017591.1_1443947_1445543_-	NA|120aa|up_9|NC_017591.1_1426460_1426820_-	PRK07252, PRK07252, S1 RNA-binding domain-containing protein	NA|467aa|up_8|NC_017591.1_1426821_1428222_-	COG0652, PpiB, Peptidyl-prolyl cis-trans isomerase (rotamase) - cyclophilin family [Posttranslational modification, protein turnover, chaperones]	NA|80aa|up_7|NC_017591.1_1428516_1428756_-	PRK00391, rpsR, 30S ribosomal protein S18; Reviewed	NA|157aa|up_6|NC_017591.1_1428787_1429258_-	PRK07275, PRK07275, single-stranded DNA-binding protein; Provisional	NA|97aa|up_5|NC_017591.1_1429269_1429560_-	PRK00453, rpsF, 30S ribosomal protein S6; Reviewed	NA|449aa|up_4|NC_017591.1_1429712_1431059_-	PRK03932, asnC, asparaginyl-tRNA synthetase; Validated	NA|122aa|up_3|NC_017591.1_1431074_1431440_-	cd07262, VOC_like, uncharacterized subfamily of vicinal oxygen chelate (VOC) family	NA|396aa|up_2|NC_017591.1_1431432_1432620_-	PRK05764, PRK05764, aspartate aminotransferase; Provisional	NA|163aa|up_1|NC_017591.1_1432616_1433105_-	COG5353, COG5353, Uncharacterized protein conserved in bacteria [Function unknown]	NA|183aa|up_0|NC_017591.1_1433425_1433974_+	COG0431, COG0431, Predicted flavoprotein [General function prediction only]	NA|495aa|down_0|NC_017591.1_1441274_1442759_+	pfam08270, PRD_Mga, M protein trans-acting positive regulator (MGA) PRD domain	NA|243aa|down_1|NC_017591.1_1442809_1443538_-	PRK02101, PRK02101, peroxide stress protein YaaA	NA|55aa|down_2|NC_017591.1_1443616_1443781_-	pfam13129, DUF3953, Protein of unknown function (DUF3953)	NA|532aa|down_3|NC_017591.1_1443947_1445543_-	NA	NA|137aa|down_4|NC_017591.1_1445554_1445965_-	PRK09218, PRK09218, peptide deformylase; Validated	NA|264aa|down_5|NC_017591.1_1446080_1446872_-	PRK11752, PRK11752, putative S-transferase; Provisional	NA|899aa|down_6|NC_017591.1_1446884_1449581_-	cd02089, P-type_ATPase_Ca_prok, prokaryotic P-type Ca(2+)-ATPase similar to Synechococcus elongatus sp	NA|395aa|down_7|NC_017591.1_1449884_1451069_+	COG0053, MMT1, Predicted Co/Zn/Cd cation transporters [Inorganic ion transport and metabolism]	NA|624aa|down_8|NC_017591.1_1451211_1453083_-	COG0488, Uup, ATPase components of ABC transporters with duplicated ATPase domains [General function prediction only]	NA|395aa|down_9|NC_017591.1_1453079_1454264_-	PRK13299, PRK13299, tRNA CCA-pyrophosphorylase; Provisional
