assembly_id	genome_id	genome_def	crispr_array_locus_merge	crispr_array_location_merge	crispr_locus_id	crispr_pred_method	array_in_prot	prot_within_array_20000	prot_in_genome	crispr_type_by_cas_prot	consensus_repeat	repeat_length	self-targeting_spacer_number	self-targeting_target_number	spacer_location	protospacer_location	repeat_type	spacer_locus_num	spacer_num	correct_crispr_type	genome_cas_prots	unknown_protein_around_crispr	L10	L10_domain	L9	L9_domain	L8	L8_domain	L7	L7_domain	L6	L6_domain	L5	L5_domain	L4	L4_domain	L3	L3_domain	L2	L2_domain	L1	L1_domain	R1	R1_domain	R2	R2_domain	R3	R3_domain	R4	R4_domain	R5	R5_domain	R6	R6_domain	R7	R7_domain	R8	R8_domain	R9	R9_domain	R10	R10_domain
GCF_003351525.1_ASM335152v1	NZ_CP031247	Streptococcus pneumoniae strain M23734 chromosome, complete genome	1	390141-390223	1	CRISPRCasFinder	no		DEDDh,DinG,cas3,PrimPol,RT	Orphan	TTCTGGTGTCTGCCACCGCTTGGCCCTTA	29	0	0	NA	NA	NA	1	1	Orphan	DEDDh,DinG,cas3,PrimPol,RT	NA|128aa|up_9|NZ_CP031247.1_379234_379618_-,NA|78aa|up_8|NZ_CP031247.1_379614_379848_-,NA|214aa|up_6|NZ_CP031247.1_380993_381635_-,NA|66aa|up_5|NZ_CP031247.1_381794_381992_+,NA|99aa|up_4|NZ_CP031247.1_382005_382302_-,NA|95aa|up_3|NZ_CP031247.1_382315_382600_-,NA|137aa|up_1|NZ_CP031247.1_388952_389363_-,NA|285aa|down_3|NZ_CP031247.1_395864_396719_-,NA|81aa|down_4|NZ_CP031247.1_396735_396978_-,NA|163aa|down_6|NZ_CP031247.1_398875_399364_-,NA|100aa|down_7|NZ_CP031247.1_399353_399653_-	NA|128aa|up_9|NZ_CP031247.1_379234_379618_-	NA	NA|78aa|up_8|NZ_CP031247.1_379614_379848_-	NA	NA|362aa|up_7|NZ_CP031247.1_379898_380984_-	PHA02415, PHA02415, DNA primase domain-containing protein	NA|214aa|up_6|NZ_CP031247.1_380993_381635_-	NA	NA|66aa|up_5|NZ_CP031247.1_381794_381992_+	NA	NA|99aa|up_4|NZ_CP031247.1_382005_382302_-	NA	NA|95aa|up_3|NZ_CP031247.1_382315_382600_-	NA	NA|2082aa|up_2|NZ_CP031247.1_382674_388920_-	COG4646, COG4646, DNA methylase [Transcription / DNA replication, recombination, and repair]	NA|137aa|up_1|NZ_CP031247.1_388952_389363_-	NA	NA|189aa|up_0|NZ_CP031247.1_389514_390081_-	pfam18813, PBECR4, phage-Barnase-EndoU-ColicinE5/D-RelE like nuclease4	NA|938aa|down_0|NZ_CP031247.1_390318_393132_-	pfam18013, Phage_lysozyme2, Phage tail lysozyme	NA|772aa|down_1|NZ_CP031247.1_393143_395459_-	TIGR02746, hypothetical_protein, type-IV secretion system protein TraC	NA|120aa|down_2|NZ_CP031247.1_395451_395811_-	pfam12666, PrgI, PrgI family protein	NA|285aa|down_3|NZ_CP031247.1_395864_396719_-	NA	NA|81aa|down_4|NZ_CP031247.1_396735_396978_-	NA	NA|626aa|down_5|NZ_CP031247.1_396998_398876_-	COG3505, VirD4, Type IV secretory pathway, VirD4 components [Intracellular trafficking and secretion]	NA|163aa|down_6|NZ_CP031247.1_398875_399364_-	NA	NA|100aa|down_7|NZ_CP031247.1_399353_399653_-	NA	NA|105aa|down_8|NZ_CP031247.1_400325_400640_+	pfam06125, DUF961, Bacterial protein of unknown function (DUF961)	NA|129aa|down_9|NZ_CP031247.1_400655_401042_+	pfam06125, DUF961, Bacterial protein of unknown function (DUF961)
GCF_003351525.1_ASM335152v1	NZ_CP031247	Streptococcus pneumoniae strain M23734 chromosome, complete genome	2	427053-427165	2	CRISPRCasFinder	no		DEDDh,DinG,cas3,PrimPol,RT	Orphan	TCAATCAAGTTTCCATTTTCATCCACTCCTGTTGT	35	0	0	NA	NA	NA	1	1	Orphan	DEDDh,DinG,cas3,PrimPol,RT	NA|78aa|up_4|NZ_CP031247.1_418617_418851_-,NA|150aa|up_2|NZ_CP031247.1_419236_419686_-,NA|393aa|down_1|NZ_CP031247.1_434550_435729_-	NA|141aa|up_9|NZ_CP031247.1_415238_415661_+	TIGR02937, RNA_polymerase_sigma_factor, RNA polymerase sigma factor, sigma-70 family	NA|77aa|up_8|NZ_CP031247.1_415657_415888_+	pfam12645, HTH_16, Helix-turn-helix domain	NA|68aa|up_7|NZ_CP031247.1_416348_416552_+	pfam09035, Tn916-Xis, Excisionase from transposon Tn916	NA|406aa|up_6|NZ_CP031247.1_416633_417851_+	pfam00589, Phage_integrase, Phage integrase family	NA|201aa|up_5|NZ_CP031247.1_418012_418615_-	pfam02517, Abi, CAAX protease self-immunity	NA|78aa|up_4|NZ_CP031247.1_418617_418851_-	NA	NA|127aa|up_3|NZ_CP031247.1_418863_419244_-	COG1393, ArsC, Arsenate reductase and related proteins, glutaredoxin family [Inorganic ion transport and metabolism]	NA|150aa|up_2|NZ_CP031247.1_419236_419686_-	NA	NA|453aa|up_1|NZ_CP031247.1_419669_421028_-	TIGR00675, Modification_methylase, DNA-methyltransferase (dcm)	NA|260aa|up_0|NZ_CP031247.1_421126_421906_-	pfam06970, RepA_N, Replication initiator protein A (RepA) N-terminus	NA|2160aa|down_0|NZ_CP031247.1_427862_434342_-	pfam07580, Peptidase_M26_C, M26 IgA1-specific Metallo-endopeptidase C-terminal region	NA|393aa|down_1|NZ_CP031247.1_434550_435729_-	NA	NA|1217aa|down_2|NZ_CP031247.1_435943_439594_-	TIGR02785, ATP-dependent_helicase/nuclease_subunit_A, helicase-exonuclease AddAB, AddA subunit, Firmicutes type	NA|1092aa|down_3|NZ_CP031247.1_439590_442866_-	TIGR02774, putative_ATP-dependent_exonuclease_subunit_B, ATP-dependent nuclease subunit B	NA|149aa|down_4|NZ_CP031247.1_444124_444571_+	cd01189, INT_ICEBs1_C_like, C-terminal catalytic domain of integrases from bacterial phages and conjugate transposons	NA|435aa|down_5|NZ_CP031247.1_444708_446013_-	PRK00077, eno, enolase; Provisional	NA|149aa|down_6|NZ_CP031247.1_446176_446623_+	COG5506, COG5506, Uncharacterized conserved protein [Function unknown]	NA|372aa|down_7|NZ_CP031247.1_446619_447735_-	COG1929, COG1929, Glycerate kinase [Carbohydrate transport and metabolism]	NA|124aa|down_8|NZ_CP031247.1_448017_448389_-	cd07500, HAD_PSP, phosphoserine phosphatase (PSP), similar to Methanococcus Jannaschii PSP and Saccharomyces cerevisiae SER2p	NA|478aa|down_9|NZ_CP031247.1_448447_449881_-	PRK00654, glgA, glycogen synthase GlgA
GCF_003351525.1_ASM335152v1	NZ_CP031247	Streptococcus pneumoniae strain M23734 chromosome, complete genome	3	1355892-1355987	3	CRISPRCasFinder	no		DEDDh,DinG,cas3,PrimPol,RT	Orphan	TTATATATAAAAATTTTACACATT	24	0	0	NA	NA	NA	1	1	Orphan	DEDDh,DinG,cas3,PrimPol,RT	NA|107aa|up_7|NZ_CP031247.1_1347889_1348210_+,NA|87aa|down_9|NZ_CP031247.1_1365276_1365537_-	NA|703aa|up_9|NZ_CP031247.1_1345017_1347126_-	TIGR01654, unnamed_protein_product, bacteriocin-associated integral membrane (putative immunity) protein	NA|95aa|up_8|NZ_CP031247.1_1347180_1347465_-	pfam09683, Lactococcin_972, Bacteriocin (Lactococcin_972)	NA|107aa|up_7|NZ_CP031247.1_1347889_1348210_+	NA	NA|191aa|up_6|NZ_CP031247.1_1348261_1348834_-	cd00118, LysM, Lysin Motif is a small domain involved in binding peptidoglycan	NA|224aa|up_5|NZ_CP031247.1_1349075_1349747_+	COG1760, SdaA, L-serine deaminase [Amino acid transport and metabolism]	NA|291aa|up_4|NZ_CP031247.1_1349755_1350628_+	TIGR00718, Probable_L-serine_dehydratase_alpha_chain, L-serine dehydratase, iron-sulfur-dependent, alpha subunit	NA|211aa|up_3|NZ_CP031247.1_1350649_1351282_+	cd04302, HAD_5NT, haloacid dehalogenase (HAD)-like 5'-nucleotidases similar to the Pseudomonas aeruginosa PA0065	NA|617aa|up_2|NZ_CP031247.1_1351584_1353435_-	COG1086, COG1086, Predicted nucleoside-diphosphate sugar epimerases [Cell envelope biogenesis, outer membrane / Carbohydrate transport and metabolism]	NA|380aa|up_1|NZ_CP031247.1_1353502_1354642_-	cd02525, Succinoglycan_BP_ExoA, ExoA is involved in the biosynthesis of succinoglycan	NA|389aa|up_0|NZ_CP031247.1_1354699_1355866_+	cd17339, MFS_NIMT_CynX_like, 2-nitroimidazole and cyanate transporters and similar proteins of the Major Facilitator Superfamily of transporters	NA|109aa|down_0|NZ_CP031247.1_1356004_1356331_+	COG1695, COG1695, Predicted transcriptional regulators [Transcription]	NA|198aa|down_1|NZ_CP031247.1_1356317_1356911_+	COG4709, COG4709, Predicted membrane protein [Function unknown]	NA|304aa|down_2|NZ_CP031247.1_1356903_1357815_+	pfam13349, DUF4097, Putative adhesin	NA|355aa|down_3|NZ_CP031247.1_1357877_1358942_+	pfam10310, DUF5427, Family of unknown function (DUF5427)	NA|287aa|down_4|NZ_CP031247.1_1359118_1359979_-	pfam14132, DUF4299, Domain of unknown function (DUF4299)	NA|329aa|down_5|NZ_CP031247.1_1360100_1361087_-	PRK00142, PRK00142, rhodanese-related sulfurtransferase	NA|492aa|down_6|NZ_CP031247.1_1361358_1362834_-	pfam12010, DUF3502, Domain of unknown function (DUF3502)	NA|308aa|down_7|NZ_CP031247.1_1362917_1363841_-	COG0395, UgpE, ABC-type sugar transport system, permease component [Carbohydrate transport and metabolism]	NA|310aa|down_8|NZ_CP031247.1_1363854_1364784_-	COG4209, LplB, ABC-type polysaccharide transport system, permease component [Carbohydrate transport and metabolism]	NA|87aa|down_9|NZ_CP031247.1_1365276_1365537_-	NA
