assembly_id	genome_id	genome_def	crispr_array_locus_merge	crispr_array_location_merge	crispr_locus_id	crispr_pred_method	array_in_prot	prot_within_array_20000	prot_in_genome	crispr_type_by_cas_prot	consensus_repeat	repeat_length	self-targeting_spacer_number	self-targeting_target_number	spacer_location	protospacer_location	repeat_type	spacer_locus_num	spacer_num	correct_crispr_type	genome_cas_prots	unknown_protein_around_crispr	L10	L10_domain	L9	L9_domain	L8	L8_domain	L7	L7_domain	L6	L6_domain	L5	L5_domain	L4	L4_domain	L3	L3_domain	L2	L2_domain	L1	L1_domain	R1	R1_domain	R2	R2_domain	R3	R3_domain	R4	R4_domain	R5	R5_domain	R6	R6_domain	R7	R7_domain	R8	R8_domain	R9	R9_domain	R10	R10_domain
GCF_001693275.1_ASM169327v1	NZ_CP016482	Synechococcus sp. PCC 7117 plasmid unnamed5, complete sequence	1	54-380	1,1,1	PILER-CR,CRISPRCasFinder,CRT	no		cas4,WYL,csx18,cas1,cas2,cas3,cas10d,csc2gr7,csc1gr5,cas6	Orphan	CTTTCCAACCACTAAATCCCGACCACGGGACTGAAAC,CTTTCCAACCACTAAATCCCGACCACGGGACTGAAAC,CTTTCCAACCACTAAATCCCGACCACGGGACTGAAAC	37,37,37	0	0	NA	NA	NA:NA:NA	4,4,4	4	Orphan	csa3,cas3,cas14j,DinG,c2c9_V-U4,WYL,RT,Cas14c_CAS-V-F,cas4,csx18,cas1,cas2,cas10d,csc2gr7,csc1gr5,cas6	NA,NA|47aa|down_2|NZ_CP016482.1_1564_1705_-	NA|NA	NA	NA|NA	NA	NA|NA	NA	NA|NA	NA	NA|NA	NA	NA|NA	NA	NA|NA	NA	NA|NA	NA	NA|NA	NA	NA|NA	NA	NA|114aa|down_0|NZ_CP016482.1_636_978_-	COG2361, COG2361, Uncharacterized conserved protein [Function unknown]	NA|107aa|down_1|NZ_CP016482.1_970_1291_-	COG1669, COG1669, Predicted nucleotidyltransferases [General function prediction only]	NA|47aa|down_2|NZ_CP016482.1_1564_1705_-	NA	NA|540aa|down_3|NZ_CP016482.1_4015_5635_+	COG4188, COG4188, Predicted dienelactone hydrolase [General function prediction only]	NA|539aa|down_4|NZ_CP016482.1_5631_7248_-	COG0025, NhaP, NhaP-type Na+/H+ and K+/H+ antiporters [Inorganic ion transport and metabolism]	NA|427aa|down_5|NZ_CP016482.1_7788_9069_+	COG1252, Ndh, NADH dehydrogenase, FAD-containing subunit [Energy production and conversion]	NA|178aa|down_6|NZ_CP016482.1_9384_9918_-	PRK13474, PRK13474, cytochrome b6-f complex iron-sulfur subunit; Provisional	NA|66aa|down_7|NZ_CP016482.1_9972_10170_-	pfam11127, DUF2892, Protein of unknown function (DUF2892)	NA|341aa|down_8|NZ_CP016482.1_10684_11707_+	pfam01032, FecCD, FecCD transport family	NA|279aa|down_9|NZ_CP016482.1_11703_12540_+	COG1120, FepC, ABC-type cobalamin/Fe3+-siderophores transport systems, ATPase components [Inorganic ion transport and metabolism / Coenzyme metabolism]
GCF_001693275.1_ASM169327v1	NZ_CP016482	Synechococcus sp. PCC 7117 plasmid unnamed5, complete sequence	2	236389-237178	2,2,2	PILER-CR,CRISPRCasFinder,CRT	no	WYL,csx18,cas1,cas2,cas3	cas4,WYL,csx18,cas1,cas2,cas3,cas10d,csc2gr7,csc1gr5,cas6	Unclear	GTTTCTCTTTACTTGAGAAGCTAATAGATTGGAAAC,GTTTCTCTTTACTTGAGAAGCTAATAGATTGGAAAC,GTTTCTCTTTACTTGAGAAGCTAATAGATTGGAAAC	36,36,36	0	0	NA	NA	NA:NA:NA	9,10,10	10	Unclear	csa3,cas3,cas14j,DinG,c2c9_V-U4,WYL,RT,Cas14c_CAS-V-F,cas4,csx18,cas1,cas2,cas10d,csc2gr7,csc1gr5,cas6	NA|132aa|up_8|NZ_CP016482.1_228759_229155_+,NA|384aa|up_7|NZ_CP016482.1_229559_230711_+,NA|230aa|up_6|NZ_CP016482.1_230736_231426_+,NA|74aa|up_3|NZ_CP016482.1_234251_234473_+,csx18|99aa|up_2|NZ_CP016482.1_234652_234949_+,NA|79aa|down_3|NZ_CP016482.1_240627_240864_-,NA|347aa|down_5|NZ_CP016482.1_242675_243716_+	NA|197aa|up_9|NZ_CP016482.1_228133_228724_+	COG1196, Smc, Chromosome segregation ATPases [Cell division and chromosome partitioning]	NA|132aa|up_8|NZ_CP016482.1_228759_229155_+	NA	NA|384aa|up_7|NZ_CP016482.1_229559_230711_+	NA	NA|230aa|up_6|NZ_CP016482.1_230736_231426_+	NA	NA|394aa|up_5|NZ_CP016482.1_231430_232612_+	COG1357, COG1357, Pentapeptide repeats containing protein [Function unknown]	WYL|409aa|up_4|NZ_CP016482.1_232867_234094_-	pfam13280, WYL, WYL domain	NA|74aa|up_3|NZ_CP016482.1_234251_234473_+	NA	csx18|99aa|up_2|NZ_CP016482.1_234652_234949_+	NA	cas1|329aa|up_1|NZ_CP016482.1_234945_235932_+	pfam01867, Cas_Cas1, CRISPR associated protein Cas1	cas2|93aa|up_0|NZ_CP016482.1_235931_236210_+	cd09725, Cas2_I_II_III, CRISPR/Cas system-associated protein Cas2	NA|687aa|down_0|NZ_CP016482.1_237398_239459_+	TIGR04096, conserved_hypothetical_protein, DNA phosphorothioation-associated putative methyltransferase	NA|107aa|down_1|NZ_CP016482.1_239627_239948_+	pfam09907, HigB_toxin, HigB_toxin, RelE-like toxic component of a toxin-antitoxin system	NA|140aa|down_2|NZ_CP016482.1_240088_240508_+	COG5499, COG5499, Predicted transcription regulator containing HTH domain [Transcription]	NA|79aa|down_3|NZ_CP016482.1_240627_240864_-	NA	NA|270aa|down_4|NZ_CP016482.1_241034_241844_+	COG3440, COG3440, Predicted restriction endonuclease [Defense mechanisms]	NA|347aa|down_5|NZ_CP016482.1_242675_243716_+	NA	NA|138aa|down_6|NZ_CP016482.1_243712_244126_-	cd18807, SF1_C_UvrD, C-terminal helicase domain of UvrD family helicases	NA|381aa|down_7|NZ_CP016482.1_244103_245246_-	COG0210, UvrD, Superfamily I DNA and RNA helicases [DNA replication, recombination, and repair]	NA|336aa|down_8|NZ_CP016482.1_245349_246357_+	PRK06850, PRK06850, hypothetical protein; Provisional	NA|174aa|down_9|NZ_CP016482.1_246553_247075_+	pfam10387, DUF2442, Protein of unknown function (DUF2442)
GCF_001693275.1_ASM169327v1	NZ_CP016482	Synechococcus sp. PCC 7117 plasmid unnamed5, complete sequence	3	261902-264744	3,3,3,4,5	PILER-CR,CRISPRCasFinder,CRT,PILER-CR,PILER-CR	no	WYL,cas3,cas10d,csc2gr7,csc1gr5,cas6,cas4,cas1,cas2	cas4,WYL,csx18,cas1,cas2,cas3,cas10d,csc2gr7,csc1gr5,cas6	Type I-D	CTTTCCAACCACTAAATCCCGATCACGGGACTGAAAC,CTTTCCAACCACTAAATCCCGATCACGGGACTGAAAC,CTTTCCAACCACTAAATCCCGATCACGGGACTGAAAC,CTTTCCAACCACTAAATCCCGACCACGGGACTGAAAC,CTTTCCAACCACTAAATCCCGATCACGGGACTGAAAC	37,37,37,37,37	1	1	263052-263089	NZ_CP016482.1_264745-264782	NA:NA:NA:NA:NA	34,38,38,34,34	38	TypeI-D	csa3,cas3,cas14j,DinG,c2c9_V-U4,WYL,RT,Cas14c_CAS-V-F,cas4,csx18,cas1,cas2,cas10d,csc2gr7,csc1gr5,cas6	NA|69aa|up_9|NZ_CP016482.1_249888_250095_-,NA	NA|69aa|up_9|NZ_CP016482.1_249888_250095_-	NA	WYL|288aa|up_8|NZ_CP016482.1_250057_250921_-	COG2378, COG2378, Predicted transcriptional regulator [Transcription]	cas3|728aa|up_7|NZ_CP016482.1_251017_253201_+	cd09710, Cas3_I-D, CRISPR/Cas system-associated protein Cas3; Distinct diverged subfamily of Cas3 helicase domain	cas10d|1114aa|up_6|NZ_CP016482.1_253893_257235_+	TIGR03174, cas_Csc3, CRISPR type I-D/CYANO-associated protein Csc3/Cas10d	csc2gr7|322aa|up_5|NZ_CP016482.1_257288_258254_+	pfam18320, Csc2, Csc2 Crispr	csc1gr5|241aa|up_4|NZ_CP016482.1_258268_258991_+	cd09711, Csc1_I-D, CRISPR/Cas system-associated protein Csc1	cas6|279aa|up_3|NZ_CP016482.1_258995_259832_+	COG5551, COG5551, CRISPR system related protein, RAMP superfamily [Defense    mechanisms]	cas4|193aa|up_2|NZ_CP016482.1_259841_260420_+	TIGR00372, conserved_hypothetical_protein, CRISPR-associated protein Cas4	cas1|326aa|up_1|NZ_CP016482.1_260422_261400_+	TIGR04093, hypothetical_protein_L8106_25395, CRISPR-associated endonuclease Cas1, subtype CYANO	cas2|98aa|up_0|NZ_CP016482.1_261396_261690_+	cd09725, Cas2_I_II_III, CRISPR/Cas system-associated protein Cas2	NA|NA	NA	NA|NA	NA	NA|NA	NA	NA|NA	NA	NA|NA	NA	NA|NA	NA	NA|NA	NA	NA|NA	NA	NA|NA	NA	NA|NA	NA
GCF_001693275.1_ASM169327v1	NZ_CP016477	Synechococcus sp. PCC 7117 chromosome, complete genome	1	441641-441770	1	CRISPRCasFinder	no		csa3,cas3,cas14j,DinG,c2c9_V-U4,WYL,RT,Cas14c_CAS-V-F	Orphan	CTTCAAGTCTGCGTTGCTCCTCCTCAGCTCTTCG	34	0	0	NA	NA	NA	1	1	Orphan	csa3,cas3,cas14j,DinG,c2c9_V-U4,WYL,RT,Cas14c_CAS-V-F,cas4,csx18,cas1,cas2,cas10d,csc2gr7,csc1gr5,cas6	NA|130aa|up_4|NZ_CP016477.1_436568_436958_-,NA|432aa|down_2|NZ_CP016477.1_443755_445051_-	NA|374aa|up_9|NZ_CP016477.1_430460_431582_-	TIGR03411, urea_trans_UrtD, urea ABC transporter, ATP-binding protein UrtD	NA|390aa|up_8|NZ_CP016477.1_431591_432761_-	TIGR03408, urea_trans_UrtC, urea ABC transporter, permease protein UrtC	NA|393aa|up_7|NZ_CP016477.1_432797_433976_-	TIGR03409, urea_trans_UrtB, urea ABC transporter, permease protein UrtB	NA|439aa|up_6|NZ_CP016477.1_434075_435392_-	pfam13433, Peripla_BP_5, Periplasmic binding protein domain	NA|317aa|up_5|NZ_CP016477.1_435611_436562_-	COG1397, DraG, ADP-ribosylglycohydrolase [Posttranslational modification, protein turnover, chaperones]	NA|130aa|up_4|NZ_CP016477.1_436568_436958_-	NA	NA|422aa|up_3|NZ_CP016477.1_436941_438207_-	COG0285, FolC, Folylpolyglutamate synthase [Coenzyme metabolism]	NA|398aa|up_2|NZ_CP016477.1_438417_439611_+	cd03823, GT4_ExpE7-like, glycosyltransferase ExpE7 and similar proteins	NA|431aa|up_1|NZ_CP016477.1_439613_440906_-	cd03794, GT4_WbuB-like, Escherichia coli WbuB and similar proteins	NA|132aa|up_0|NZ_CP016477.1_440917_441313_-	pfam02350, Epimerase_2, UDP-N-acetylglucosamine 2-epimerase	NA|141aa|down_0|NZ_CP016477.1_442112_442535_-	pfam01724, DUF29, Domain of unknown function DUF29	NA|363aa|down_1|NZ_CP016477.1_442646_443735_-	cd03786, GTB_UDP-GlcNAc_2-Epimerase, UDP-N-acetylglucosamine 2-epimerase and similar proteins	NA|432aa|down_2|NZ_CP016477.1_443755_445051_-	NA	NA|427aa|down_3|NZ_CP016477.1_445131_446412_-	cd13128, MATE_Wzx_like, Wzx, a subfamily of the multidrug and toxic compound extrusion (MATE)-like proteins	NA|458aa|down_4|NZ_CP016477.1_446431_447805_-	cd00616, AHBA_syn, 3-amino-5-hydroxybenzoic acid synthase family (AHBA_syn)	NA|300aa|down_5|NZ_CP016477.1_447873_448773_-	pfam13489, Methyltransf_23, Methyltransferase domain	NA|288aa|down_6|NZ_CP016477.1_448782_449646_-	COG1082, IolE, Sugar phosphate isomerases/epimerases [Carbohydrate transport and metabolism]	NA|155aa|down_7|NZ_CP016477.1_449686_450151_-	pfam00583, Acetyltransf_1, Acetyltransferase (GNAT) family	NA|354aa|down_8|NZ_CP016477.1_450193_451255_-	TIGR03569, ORF_8_similar_to_NeuB_family, N-acetylneuraminate synthase	NA|234aa|down_9|NZ_CP016477.1_451257_451959_-	cd02513, CMP-NeuAc_Synthase, CMP-NeuAc_Synthase activates N-acetylneuraminic acid by adding CMP moiety
GCF_001693275.1_ASM169327v1	NZ_CP016477	Synechococcus sp. PCC 7117 chromosome, complete genome	2	1857602-1857716	2	CRISPRCasFinder	no		csa3,cas3,cas14j,DinG,c2c9_V-U4,WYL,RT,Cas14c_CAS-V-F	Orphan	CCAGCCCTTCATAGCTCCACTTCTCTAACGGGACTCTGAACAC	43	0	0	NA	NA	NA	1	1	Orphan	csa3,cas3,cas14j,DinG,c2c9_V-U4,WYL,RT,Cas14c_CAS-V-F,cas4,csx18,cas1,cas2,cas10d,csc2gr7,csc1gr5,cas6	NA|280aa|up_3|NZ_CP016477.1_1854898_1855738_+,NA|128aa|up_2|NZ_CP016477.1_1855734_1856118_+,NA|211aa|up_1|NZ_CP016477.1_1856120_1856753_+,NA|169aa|up_0|NZ_CP016477.1_1856877_1857384_+,NA|259aa|down_2|NZ_CP016477.1_1861178_1861955_+,NA|214aa|down_4|NZ_CP016477.1_1863017_1863659_+	NA|334aa|up_9|NZ_CP016477.1_1847497_1848499_+	pfam02618, YceG, YceG-like family	NA|302aa|up_8|NZ_CP016477.1_1848519_1849425_+	pfam05430, Methyltransf_30, S-adenosyl-L-methionine-dependent methyltransferase	NA|377aa|up_7|NZ_CP016477.1_1849375_1850506_-	pfam12698, ABC2_membrane_3, ABC-2 family transporter protein	NA|369aa|up_6|NZ_CP016477.1_1850516_1851623_-	pfam12698, ABC2_membrane_3, ABC-2 family transporter protein	NA|653aa|up_5|NZ_CP016477.1_1851626_1853585_-	COG1131, CcmA, ABC-type multidrug transport system, ATPase component [Defense mechanisms]	NA|414aa|up_4|NZ_CP016477.1_1853590_1854832_-	PRK03598, PRK03598, putative efflux pump membrane fusion protein; Provisional	NA|280aa|up_3|NZ_CP016477.1_1854898_1855738_+	NA	NA|128aa|up_2|NZ_CP016477.1_1855734_1856118_+	NA	NA|211aa|up_1|NZ_CP016477.1_1856120_1856753_+	NA	NA|169aa|up_0|NZ_CP016477.1_1856877_1857384_+	NA	NA|613aa|down_0|NZ_CP016477.1_1858202_1860041_+	cd00200, WD40, WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment	NA|365aa|down_1|NZ_CP016477.1_1860079_1861174_+	PRK09601, PRK09601, redox-regulated ATPase YchF	NA|259aa|down_2|NZ_CP016477.1_1861178_1861955_+	NA	NA|288aa|down_3|NZ_CP016477.1_1862026_1862890_+	PRK00258, aroE, shikimate 5-dehydrogenase; Reviewed	NA|214aa|down_4|NZ_CP016477.1_1863017_1863659_+	NA	NA|212aa|down_5|NZ_CP016477.1_1863670_1864306_-	COG0009, SUA5, Putative translation factor (SUA5) [Translation, ribosomal structure and biogenesis]	NA|288aa|down_6|NZ_CP016477.1_1864462_1865326_-	pfam00950, ABC-3, ABC 3 transport family	NA|267aa|down_7|NZ_CP016477.1_1865322_1866123_-	COG1121, ZnuC, ABC-type Mn/Zn transport systems, ATPase component [Inorganic ion transport and metabolism]	NA|326aa|down_8|NZ_CP016477.1_1866177_1867155_-	cd01137, PsaA, Metal binding protein PsaA	NA|160aa|down_9|NZ_CP016477.1_1867333_1867813_+	sd00006, TPR, Tetratricopeptide repeat
GCF_001693275.1_ASM169327v1	NZ_CP016477	Synechococcus sp. PCC 7117 chromosome, complete genome	3	2988049-2988141	3	CRISPRCasFinder	no		csa3,cas3,cas14j,DinG,c2c9_V-U4,WYL,RT,Cas14c_CAS-V-F	Orphan	CTCTAGCTCCCCTCATAGGGGAGAA	25	0	0	NA	NA	NA	1	1	Orphan	csa3,cas3,cas14j,DinG,c2c9_V-U4,WYL,RT,Cas14c_CAS-V-F,cas4,csx18,cas1,cas2,cas10d,csc2gr7,csc1gr5,cas6	NA|115aa|up_4|NZ_CP016477.1_2981421_2981766_+,NA|699aa|down_0|NZ_CP016477.1_2988264_2990361_-	NA|263aa|up_9|NZ_CP016477.1_2970461_2971250_+	PRK00235, cobS, cobalamin synthase; Reviewed	NA|146aa|up_8|NZ_CP016477.1_2971229_2971667_-	pfam04151, PPC, Bacterial pre-peptidase C-terminal domain	NA|584aa|up_7|NZ_CP016477.1_2977287_2979039_-	COG1132, MdlB, ABC-type multidrug transport system, ATPase and permease components [Defense mechanisms]	NA|380aa|up_6|NZ_CP016477.1_2979330_2980470_+	PRK09585, anmK, anhydro-N-acetylmuramic acid kinase; Reviewed	NA|306aa|up_5|NZ_CP016477.1_2980463_2981381_-	cd09992, HDAC_classII, Histone deacetylases and histone-like deacetylases, classII	NA|115aa|up_4|NZ_CP016477.1_2981421_2981766_+	NA	NA|538aa|up_3|NZ_CP016477.1_2981886_2983500_-	PRK10416, PRK10416, signal recognition particle-docking protein FtsY; Provisional	NA|261aa|up_2|NZ_CP016477.1_2983685_2984468_+	TIGR03069, RNA-binding_S4_domain-containing_protein, photosystem II S4 domain protein	NA|478aa|up_1|NZ_CP016477.1_2984464_2985898_-	TIGR03556, photolyase_8HDF, deoxyribodipyrimidine photo-lyase, 8-HDF type	NA|592aa|up_0|NZ_CP016477.1_2986011_2987787_-	PRK07418, PRK07418, acetolactate synthase large subunit	NA|699aa|down_0|NZ_CP016477.1_2988264_2990361_-	NA	NA|212aa|down_1|NZ_CP016477.1_2992063_2992699_-	pfam16734, Pilin_GH, Type IV pilin-like G and H, putative	NA|194aa|down_2|NZ_CP016477.1_2992887_2993469_-	pfam16734, Pilin_GH, Type IV pilin-like G and H, putative	NA|134aa|down_3|NZ_CP016477.1_2993818_2994220_+	cd01038, Endonuclease_DUF559, Domain of unknown function, appears to be related to a diverse group of endonucleases	NA|400aa|down_4|NZ_CP016477.1_2994401_2995601_+	PRK00509, PRK00509, argininosuccinate synthase; Provisional	NA|318aa|down_5|NZ_CP016477.1_2995943_2996897_+	TIGR02651, Ribonuclease_Z, ribonuclease Z	NA|112aa|down_6|NZ_CP016477.1_2996888_2997224_-	pfam18032, FRP, Photoprotection regulator fluorescence recovery protein	NA|238aa|down_7|NZ_CP016477.1_2997425_2998139_-	cd03513, CrtW_beta-carotene-ketolase, Beta-carotene ketolase/oxygenase (CrtW, also known as CrtO), the carotenoid astaxanthin biosynthetic enzyme, initially catalyzes the addition of two keto groups to carbons C4 and C4' of beta-carotene	NA|321aa|down_8|NZ_CP016477.1_2998250_2999213_-	pfam09150, Carot_N, Orange carotenoid protein, N-terminal	NA|462aa|down_9|NZ_CP016477.1_2999448_3000834_+	COG0312, TldD, Predicted Zn-dependent proteases and their inactivated homologs [General function prediction only]
