assembly_id	genome_id	genome_def	crispr_array_locus_merge	crispr_array_location_merge	crispr_locus_id	crispr_pred_method	array_in_prot	prot_within_array_20000	prot_in_genome	crispr_type_by_cas_prot	consensus_repeat	repeat_length	self-targeting_spacer_number	self-targeting_target_number	spacer_location	protospacer_location	repeat_type	spacer_locus_num	spacer_num	correct_crispr_type	genome_cas_prots	unknown_protein_around_crispr	L10	L10_domain	L9	L9_domain	L8	L8_domain	L7	L7_domain	L6	L6_domain	L5	L5_domain	L4	L4_domain	L3	L3_domain	L2	L2_domain	L1	L1_domain	R1	R1_domain	R2	R2_domain	R3	R3_domain	R4	R4_domain	R5	R5_domain	R6	R6_domain	R7	R7_domain	R8	R8_domain	R9	R9_domain	R10	R10_domain
GCF_900476455.1_55312_E02	NZ_LS483523	Streptococcus pneumoniae strain 4041STDY6836169 chromosome 1	1	405911-406006	1	CRISPRCasFinder	no		DEDDh,cas3,RT,DinG	Orphan	TTATATATAAAAATTTTACACATT	24	0	0	NA	NA	NA	1	1	Orphan	DEDDh,cas3,RT,DinG	NA|107aa|up_7|NZ_LS483523.1_397781_398102_+,NA|117aa|down_8|NZ_LS483523.1_415450_415801_-	NA|703aa|up_9|NZ_LS483523.1_394917_397026_-	TIGR01654, unnamed_protein_product, bacteriocin-associated integral membrane (putative immunity) protein	NA|98aa|up_8|NZ_LS483523.1_397080_397374_-	pfam09683, Lactococcin_972, Bacteriocin (Lactococcin_972)	NA|107aa|up_7|NZ_LS483523.1_397781_398102_+	NA	NA|191aa|up_6|NZ_LS483523.1_398153_398726_-	cd00118, LysM, Lysin Motif is a small domain involved in binding peptidoglycan	NA|224aa|up_5|NZ_LS483523.1_398967_399639_+	COG1760, SdaA, L-serine deaminase [Amino acid transport and metabolism]	NA|291aa|up_4|NZ_LS483523.1_399647_400520_+	TIGR00718, Probable_L-serine_dehydratase_alpha_chain, L-serine dehydratase, iron-sulfur-dependent, alpha subunit	NA|211aa|up_3|NZ_LS483523.1_400541_401174_+	cd04302, HAD_5NT, haloacid dehalogenase (HAD)-like 5'-nucleotidases similar to the Pseudomonas aeruginosa PA0065	NA|617aa|up_2|NZ_LS483523.1_401611_403462_-	COG1086, COG1086, Predicted nucleoside-diphosphate sugar epimerases [Cell envelope biogenesis, outer membrane / Carbohydrate transport and metabolism]	NA|386aa|up_1|NZ_LS483523.1_403503_404661_-	cd02525, Succinoglycan_BP_ExoA, ExoA is involved in the biosynthesis of succinoglycan	NA|389aa|up_0|NZ_LS483523.1_404718_405885_+	COG2807, CynX, Cyanate permease [Inorganic ion transport and metabolism]	NA|109aa|down_0|NZ_LS483523.1_406023_406350_+	COG1695, COG1695, Predicted transcriptional regulators [Transcription]	NA|198aa|down_1|NZ_LS483523.1_406336_406930_+	COG4709, COG4709, Predicted membrane protein [Function unknown]	NA|304aa|down_2|NZ_LS483523.1_406922_407834_+	pfam13349, DUF4097, Putative adhesin	NA|355aa|down_3|NZ_LS483523.1_407896_408961_+	pfam10310, DUF5427, Family of unknown function (DUF5427)	NA|329aa|down_4|NZ_LS483523.1_410274_411261_-	PRK00142, PRK00142, rhodanese-related sulfurtransferase	NA|492aa|down_5|NZ_LS483523.1_411532_413008_-	pfam12010, DUF3502, Domain of unknown function (DUF3502)	NA|308aa|down_6|NZ_LS483523.1_413265_414189_-	COG0395, UgpE, ABC-type sugar transport system, permease component [Carbohydrate transport and metabolism]	NA|310aa|down_7|NZ_LS483523.1_414202_415132_-	COG4209, LplB, ABC-type polysaccharide transport system, permease component [Carbohydrate transport and metabolism]	NA|117aa|down_8|NZ_LS483523.1_415450_415801_-	NA	NA|204aa|down_9|NZ_LS483523.1_416567_417179_-	PRK05327, rpsD, 30S ribosomal protein S4; Validated
GCF_900476455.1_55312_E02	NZ_LS483523	Streptococcus pneumoniae strain 4041STDY6836169 chromosome 1	2	949239-949323	2	CRISPRCasFinder	no		DEDDh,cas3,RT,DinG	Orphan	AACAAAAATGAAACGTTTCAAAAA	24	0	0	NA	NA	NA	1	1	Orphan	DEDDh,cas3,RT,DinG	NA,NA|80aa|down_5|NZ_LS483523.1_956211_956451_+,NA|60aa|down_6|NZ_LS483523.1_956615_956795_+	NA|283aa|up_9|NZ_LS483523.1_941255_942104_+	pfam03781, FGE-sulfatase, Sulfatase-modifying factor enzyme 1	NA|257aa|up_8|NZ_LS483523.1_942118_942889_+	pfam01925, TauE, Sulfite exporter TauE/SafE	NA|96aa|up_7|NZ_LS483523.1_943149_943437_-	COG2826, Tra8, Transposase and inactivated derivatives, IS30 family [DNA replication, recombination, and repair]	NA|68aa|up_6|NZ_LS483523.1_944021_944225_-	COG3237, COG3237, Uncharacterized protein conserved in bacteria [Function unknown]	NA|203aa|up_5|NZ_LS483523.1_944255_944864_-	COG1302, COG1302, Uncharacterized protein conserved in bacteria [Function unknown]	NA|57aa|up_4|NZ_LS483523.1_944902_945073_-	COG5547, COG5547, Small integral membrane protein [Function unknown]	NA|189aa|up_3|NZ_LS483523.1_945084_945651_-	NF033218, anchor_AmaP, alkaline shock response membrane anchor protein AmaP	NA|55aa|up_2|NZ_LS483523.1_945734_945899_-	COG2261, COG2261, Predicted membrane protein [Function unknown]	NA|494aa|up_1|NZ_LS483523.1_946340_947822_+	pfam08270, PRD_Mga, M protein trans-acting positive regulator (MGA) PRD domain	NA|334aa|up_0|NZ_LS483523.1_948182_949184_+	COG1609, PurR, Transcriptional regulators [Transcription]	NA|306aa|down_0|NZ_LS483523.1_949333_950251_+	COG4209, LplB, ABC-type polysaccharide transport system, permease component [Carbohydrate transport and metabolism]	NA|297aa|down_1|NZ_LS483523.1_950261_951152_+	COG0395, UgpE, ABC-type sugar transport system, permease component [Carbohydrate transport and metabolism]	NA|539aa|down_2|NZ_LS483523.1_951180_952797_+	cd13581, PBP2_AlgQ_like_2, Periplasmic-binding component of alginate-specific ABC uptake system-like; contains the type 2 periplasmic binding fold	NA|440aa|down_3|NZ_LS483523.1_952806_954126_+	COG1621, SacC, Beta-fructosidases (levanase/invertase) [Carbohydrate transport and metabolism]	NA|424aa|down_4|NZ_LS483523.1_954670_955942_-	PRK13342, PRK13342, recombination factor protein RarA; Reviewed	NA|80aa|down_5|NZ_LS483523.1_956211_956451_+	NA	NA|60aa|down_6|NZ_LS483523.1_956615_956795_+	NA	NA|60aa|down_7|NZ_LS483523.1_956931_957111_+	pfam07927, HicA_toxin, HicA toxin of bacterial toxin-antitoxin,	NA|151aa|down_8|NZ_LS483523.1_957147_957600_+	COG1598, COG1598, Predicted nuclease of the RNAse H fold, HicB family [General    function prediction only]	NA|157aa|down_9|NZ_LS483523.1_957890_958361_+	pfam11217, DUF3013, Protein of unknown function (DUF3013)
GCF_900476455.1_55312_E02	NZ_LS483523	Streptococcus pneumoniae strain 4041STDY6836169 chromosome 1	3	972628-986211	2,4,1,3	CRT,CRT,CRT,CRT	no		DEDDh,cas3,RT,DinG	Orphan	AGNACNTCNGCANNNACA,TCNGCATCAACAAGCGCN,AGNACNTCNGCATCAACA,ACNTCNGCAAGNACNTCN	18,18,18,18	0	0	NA	NA	NA:NA:NA:NA	397,397,397,397	397	Orphan	DEDDh,cas3,RT,DinG	NA|60aa|up_5|NZ_LS483523.1_968065_968245_-,NA|34aa|up_1|NZ_LS483523.1_970473_970575_+,NA	NA|257aa|up_9|NZ_LS483523.1_964536_965307_+	COG1131, CcmA, ABC-type multidrug transport system, ATPase component [Defense mechanisms]	NA|255aa|up_8|NZ_LS483523.1_965299_966064_+	TIGR01291, Nodulation_protein_J, ABC-2 type transporter, NodJ family	NA|264aa|up_7|NZ_LS483523.1_966470_967262_+	pfam10978, DUF2785, Protein of unknown function (DUF2785)	NA|223aa|up_6|NZ_LS483523.1_967372_968041_+	cd00333, MIP, Major intrinsic protein (MIP) superfamily	NA|60aa|up_5|NZ_LS483523.1_968065_968245_-	NA	NA|105aa|up_4|NZ_LS483523.1_969132_969447_-	TIGR01068, Thioredoxin-like_protein_slr0233, thioredoxin	NA|74aa|up_3|NZ_LS483523.1_969462_969684_-	pfam15507, DUF4649, Domain of unknown function (DUF4649)	NA|127aa|up_2|NZ_LS483523.1_969782_970163_-	smart00347, HTH_MARR, helix_turn_helix multiple antibiotic resistance protein	NA|34aa|up_1|NZ_LS483523.1_970473_970575_+	NA	NA|154aa|up_0|NZ_LS483523.1_970541_971003_+	pfam01710, HTH_Tnp_IS630, Transposase	NA|293aa|down_0|NZ_LS483523.1_987099_987978_+	pfam00535, Glycos_transf_2, Glycosyl transferase family 2	NA|293aa|down_1|NZ_LS483523.1_988066_988945_+	cd04647, LbH_MAT_like, Maltose O-acyltransferase (MAT)-like: This family is composed of maltose O-acetyltransferase, galactoside O-acetyltransferase (GAT), xenobiotic acyltransferase (XAT) and similar proteins	NA|416aa|down_2|NZ_LS483523.1_988947_990195_+	cd04194, GT8_A4GalT_like, A4GalT_like proteins catalyze the addition of galactose or glucose residues to the lipooligosaccharide (LOS) or lipopolysaccharide (LPS) of the bacterial cell surface	NA|386aa|down_3|NZ_LS483523.1_990163_991321_+	cd04194, GT8_A4GalT_like, A4GalT_like proteins catalyze the addition of galactose or glucose residues to the lipooligosaccharide (LOS) or lipopolysaccharide (LPS) of the bacterial cell surface	NA|281aa|down_4|NZ_LS483523.1_991325_992168_+	pfam00535, Glycos_transf_2, Glycosyl transferase family 2	NA|1078aa|down_5|NZ_LS483523.1_992364_995598_+	TIGR03728, glyco_access_1, glycosyltransferase, SP_1767 family	NA|336aa|down_6|NZ_LS483523.1_995606_996614_+	PRK09814, PRK09814, sugar transferase	NA|409aa|down_7|NZ_LS483523.1_996721_997948_+	PRK12417, secY, preprotein translocase subunit SecY; Reviewed	NA|525aa|down_8|NZ_LS483523.1_998236_999811_+	pfam16993, Asp1, Accessory Sec system protein Asp1	NA|511aa|down_9|NZ_LS483523.1_999800_1001333_+	TIGR03712, acc_sec_asp2, accessory Sec system protein Asp2
GCF_900476455.1_55312_E02	NZ_LS483523	Streptococcus pneumoniae strain 4041STDY6836169 chromosome 1	4	1547443-1547525	3	CRISPRCasFinder	no		DEDDh,cas3,RT,DinG	Orphan	TTCTGGTGTCTGCCACCGCTTGGCCCTTA	29	0	0	NA	NA	NA	1	1	Orphan	DEDDh,cas3,RT,DinG	NA|59aa|up_6|NZ_LS483523.1_1538619_1538796_-,NA|61aa|up_5|NZ_LS483523.1_1538882_1539065_-,NA|61aa|up_4|NZ_LS483523.1_1539227_1539410_-,NA|61aa|up_3|NZ_LS483523.1_1539567_1539750_-,NA|107aa|up_2|NZ_LS483523.1_1539904_1540225_-,NA|95aa|up_1|NZ_LS483523.1_1540217_1540502_-,NA|137aa|up_0|NZ_LS483523.1_1546847_1547258_-,NA|55aa|down_0|NZ_LS483523.1_1547720_1547885_+,NA|285aa|down_4|NZ_LS483523.1_1554075_1554930_-,NA|81aa|down_5|NZ_LS483523.1_1554946_1555189_-,NA|78aa|down_9|NZ_LS483523.1_1558077_1558311_-	NA|525aa|up_9|NZ_LS483523.1_1533931_1535506_-	cd16917, HATPase_UhpB-NarQ-NarX-like, Histidine kinase-like ATPase domain of two-component sensor histidine kinases similar to Escherichia coli UhpB, NarQ and NarX, and Bacillus subtilis YdfH, YhcY and YfiJ	NA|672aa|up_8|NZ_LS483523.1_1535586_1537602_-	COG4591, LolE, ABC-type transport system, involved in lipoprotein release, permease component [Cell envelope biogenesis, outer membrane]	NA|245aa|up_7|NZ_LS483523.1_1537614_1538349_-	cd03255, ABC_MJ0796_LolCDE_FtsE, ATP-binding cassette domain of the transporters involved in export of lipoprotein and macrolide, and cell division protein	NA|59aa|up_6|NZ_LS483523.1_1538619_1538796_-	NA	NA|61aa|up_5|NZ_LS483523.1_1538882_1539065_-	NA	NA|61aa|up_4|NZ_LS483523.1_1539227_1539410_-	NA	NA|61aa|up_3|NZ_LS483523.1_1539567_1539750_-	NA	NA|107aa|up_2|NZ_LS483523.1_1539904_1540225_-	NA	NA|95aa|up_1|NZ_LS483523.1_1540217_1540502_-	NA	NA|137aa|up_0|NZ_LS483523.1_1546847_1547258_-	NA	NA|55aa|down_0|NZ_LS483523.1_1547720_1547885_+	NA	NA|938aa|down_1|NZ_LS483523.1_1548529_1551343_-	pfam18013, Phage_lysozyme2, Phage tail lysozyme	NA|786aa|down_2|NZ_LS483523.1_1551354_1553712_-	TIGR02746, hypothetical_protein, type-IV secretion system protein TraC	NA|120aa|down_3|NZ_LS483523.1_1553662_1554022_-	pfam12666, PrgI, PrgI family protein	NA|285aa|down_4|NZ_LS483523.1_1554075_1554930_-	NA	NA|81aa|down_5|NZ_LS483523.1_1554946_1555189_-	NA	NA|270aa|down_6|NZ_LS483523.1_1555958_1556768_-	pfam08843, AbiEii, Nucleotidyl transferase AbiEii toxin, Type IV TA system	NA|197aa|down_7|NZ_LS483523.1_1556767_1557358_-	pfam13338, AbiEi_4, Transcriptional regulator, AbiEi antitoxin	NA|196aa|down_8|NZ_LS483523.1_1557487_1558075_-	pfam02517, Abi, CAAX protease self-immunity	NA|78aa|down_9|NZ_LS483523.1_1558077_1558311_-	NA
