assembly_id	genome_id	genome_def	crispr_array_locus_merge	crispr_array_location_merge	crispr_locus_id	crispr_pred_method	array_in_prot	prot_within_array_20000	prot_in_genome	crispr_type_by_cas_prot	consensus_repeat	repeat_length	self-targeting_spacer_number	self-targeting_target_number	spacer_location	protospacer_location	repeat_type	spacer_locus_num	spacer_num	correct_crispr_type	genome_cas_prots	unknown_protein_around_crispr	L10	L10_domain	L9	L9_domain	L8	L8_domain	L7	L7_domain	L6	L6_domain	L5	L5_domain	L4	L4_domain	L3	L3_domain	L2	L2_domain	L1	L1_domain	R1	R1_domain	R2	R2_domain	R3	R3_domain	R4	R4_domain	R5	R5_domain	R6	R6_domain	R7	R7_domain	R8	R8_domain	R9	R9_domain	R10	R10_domain
GCF_000147335.1_ASM14733v1	NC_014533	Gloeothece verrucosa PCC 7822 plasmid Cy782201, complete sequence	1	21043-23249	1,1,1	PILER-CR,CRISPRCasFinder,CRT	no	cmr3gr5,cmr4gr7,cmr5gr11	cmr3gr5,cmr4gr7,cmr5gr11,cas2,cas1,cas4,cas6,csc1gr5,csc2gr7,cas10d,cas3,WYL,2OG_CAS,csx18,cas8b3,c2c9_V-U4,cmr6gr7,cas10	Type III-C	CCCTACCTATTGGGTTAAATAGGAATAGTTGGAAAC,CCCTACCTATTGGGTTAAATAGGAATAGTTGGAAAC,CCCTACCTATTGGGTTAAATAGGAATAGTTGGAAAC	36,36,36	0	0	NA	NA	NA:NA:NA	25,29,29	29	TypeIII-C	RT,DEDDh,DinG,PD-DExK,c2c9_V-U4,cas14k,cas3,cas14j,Cas14c_CAS-V-F,csa3,Cas9_archaeal,Cas14b_CAS-V-F,csx3,csx1,cmr3gr5,cmr4gr7,cmr5gr11,cas2,cas1,cas4,cas6,csc1gr5,csc2gr7,cas10d,WYL,2OG_CAS,csx18,cas8b3,cmr6gr7,cas10,cas5,cas7,c2c5_V-U5	NA|92aa|up_6|NC_014533.1_2005_2281_+,NA|109aa|down_1|NC_014533.1_27320_27647_+,NA|107aa|down_2|NC_014533.1_27705_28026_-,cmr5gr11|130aa|down_4|NC_014533.1_29148_29538_+	NA|NA	NA	NA|NA	NA	NA|NA	NA	NA|92aa|up_6|NC_014533.1_2005_2281_+	NA	NA|413aa|up_5|NC_014533.1_2356_3595_-	COG1352, CheR, Methylase of chemotaxis methyl-accepting proteins [Cell motility and secretion / Signal transduction mechanisms]	NA|833aa|up_4|NC_014533.1_3620_6119_-	PRK15347, PRK15347, two component system sensor kinase	NA|1036aa|up_3|NC_014533.1_9166_12274_-	COG0642, BaeS, Signal transduction histidine kinase [Signal transduction mechanisms]	NA|345aa|up_2|NC_014533.1_12328_13363_-	cd17580, REC_2_DhkD-like, second phosphoacceptor receiver (REC) domain of Dictyostelium discoideum hybrid signal transduction histidine kinase D and similar domains	NA|1306aa|up_1|NC_014533.1_13873_17791_+	cd09728, Csx1_III-U, CRISPR/Cas system-associated protein Csx1	NA|802aa|up_0|NC_014533.1_18000_20406_+	pfam05203, Hom_end_hint, Hom_end-associated Hint	cmr3gr5|357aa|down_0|NC_014533.1_25698_26769_+	cd09748, Cmr3_III-B, CRISPR/Cas system-associated RAMP superfamily protein Cmr3	NA|109aa|down_1|NC_014533.1_27320_27647_+	NA	NA|107aa|down_2|NC_014533.1_27705_28026_-	NA	cmr4gr7|262aa|down_3|NC_014533.1_28345_29131_+	COG1336, COG1336, CRISPR system related protein, RAMP superfamily [Defense mechanisms]	cmr5gr11|130aa|down_4|NC_014533.1_29148_29538_+	NA	NA|547aa|down_5|NC_014533.1_29550_31191_+	TIGR01898, repair_system, CRISPR type III-B/RAMP module RAMP protein Cmr6	NA|1185aa|down_6|NC_014533.1_31664_35219_-	NF033203, entero_EhxA, enterohemolysin EhxA	NA|209aa|down_7|NC_014533.1_42620_43247_-	pfam13649, Methyltransf_25, Methyltransferase domain	NA|186aa|down_8|NC_014533.1_43995_44553_+	COG3448, COG3448, CBS-domain-containing membrane protein [Signal transduction mechanisms]	cas2|98aa|down_9|NC_014533.1_46703_46997_-	cd09725, Cas2_I_II_III, CRISPR/Cas system-associated protein Cas2
GCF_000147335.1_ASM14733v1	NC_014533	Gloeothece verrucosa PCC 7822 plasmid Cy782201, complete sequence	2	44680-46461	2,2,2	CRT,PILER-CR,CRISPRCasFinder	no	cmr3gr5,cmr4gr7,cmr5gr11,cas2,cas1,cas4,cas6,csc1gr5,csc2gr7,cas10d,cas3,WYL	cmr3gr5,cmr4gr7,cmr5gr11,cas2,cas1,cas4,cas6,csc1gr5,csc2gr7,cas10d,cas3,WYL,2OG_CAS,csx18,cas8b3,c2c9_V-U4,cmr6gr7,cas10	Type I-D	GTTTCAATCCCGTTTCCAGGATTCATTTAATAGAAAG,GTTTCAATCCCGTTTCCAGGATTCATTTAATAGAAAG,GTTTCAATCCCGTTTCCAGGATTCATTTAATAGAAAG	37,37,37	0	0	NA	NA	V-U2:V-U2:V-U2	24,23,23	24	TypeI-D	RT,DEDDh,DinG,PD-DExK,c2c9_V-U4,cas14k,cas3,cas14j,Cas14c_CAS-V-F,csa3,Cas9_archaeal,Cas14b_CAS-V-F,csx3,csx1,cmr3gr5,cmr4gr7,cmr5gr11,cas2,cas1,cas4,cas6,csc1gr5,csc2gr7,cas10d,WYL,2OG_CAS,csx18,cas8b3,cmr6gr7,cas10,cas5,cas7,c2c5_V-U5	NA|109aa|up_7|NC_014533.1_27320_27647_+,NA|107aa|up_6|NC_014533.1_27705_28026_-,cmr5gr11|130aa|up_4|NC_014533.1_29148_29538_+,NA	NA|802aa|up_9|NC_014533.1_18000_20406_+	pfam05203, Hom_end_hint, Hom_end-associated Hint	cmr3gr5|357aa|up_8|NC_014533.1_25698_26769_+	cd09748, Cmr3_III-B, CRISPR/Cas system-associated RAMP superfamily protein Cmr3	NA|109aa|up_7|NC_014533.1_27320_27647_+	NA	NA|107aa|up_6|NC_014533.1_27705_28026_-	NA	cmr4gr7|262aa|up_5|NC_014533.1_28345_29131_+	COG1336, COG1336, CRISPR system related protein, RAMP superfamily [Defense mechanisms]	cmr5gr11|130aa|up_4|NC_014533.1_29148_29538_+	NA	NA|547aa|up_3|NC_014533.1_29550_31191_+	TIGR01898, repair_system, CRISPR type III-B/RAMP module RAMP protein Cmr6	NA|1185aa|up_2|NC_014533.1_31664_35219_-	NF033203, entero_EhxA, enterohemolysin EhxA	NA|209aa|up_1|NC_014533.1_42620_43247_-	pfam13649, Methyltransf_25, Methyltransferase domain	NA|186aa|up_0|NC_014533.1_43995_44553_+	COG3448, COG3448, CBS-domain-containing membrane protein [Signal transduction mechanisms]	cas2|98aa|down_0|NC_014533.1_46703_46997_-	cd09725, Cas2_I_II_III, CRISPR/Cas system-associated protein Cas2	cas1|326aa|down_1|NC_014533.1_46993_47971_-	TIGR04093, hypothetical_protein_L8106_25395, CRISPR-associated endonuclease Cas1, subtype CYANO	cas4|201aa|down_2|NC_014533.1_48013_48616_-	TIGR00372, conserved_hypothetical_protein, CRISPR-associated protein Cas4	cas6|276aa|down_3|NC_014533.1_48618_49446_-	cd09652, Cas6-I-III, CRISPR/Cas system-associated RAMP superfamily protein Cas6	csc1gr5|260aa|down_4|NC_014533.1_49417_50197_-	TIGR03159, cas_Csc1, CRISPR type I-D/CYANO-associated protein Csc1	csc2gr7|331aa|down_5|NC_014533.1_50264_51257_-	pfam18320, Csc2, Csc2 Crispr	cas10d|972aa|down_6|NC_014533.1_51316_54232_-	cd09712, Cas10d_I-D, CRISPR/Cas system-associated protein Cas10d	cas3|713aa|down_7|NC_014533.1_54274_56413_-	cd09710, Cas3_I-D, CRISPR/Cas system-associated protein Cas3; Distinct diverged subfamily of Cas3 helicase domain	WYL|316aa|down_8|NC_014533.1_56962_57910_+	pfam13280, WYL, WYL domain	NA|366aa|down_9|NC_014533.1_58271_59369_-	cd01066, APP_MetAP, A family including aminopeptidase P, aminopeptidase M, and prolidase
GCF_000147335.1_ASM14733v1	NC_014533	Gloeothece verrucosa PCC 7822 plasmid Cy782201, complete sequence	3	208812-209007	3	PILER-CR	no		cmr3gr5,cmr4gr7,cmr5gr11,cas2,cas1,cas4,cas6,csc1gr5,csc2gr7,cas10d,cas3,WYL,2OG_CAS,csx18,cas8b3,c2c9_V-U4,cmr6gr7,cas10	Orphan	GATTACATCGAAACCTACAACTATGATGCTAACAGCAATCAAATCT	46	0	0	NA	NA	NA	2	2	Orphan	RT,DEDDh,DinG,PD-DExK,c2c9_V-U4,cas14k,cas3,cas14j,Cas14c_CAS-V-F,csa3,Cas9_archaeal,Cas14b_CAS-V-F,csx3,csx1,cmr3gr5,cmr4gr7,cmr5gr11,cas2,cas1,cas4,cas6,csc1gr5,csc2gr7,cas10d,WYL,2OG_CAS,csx18,cas8b3,cmr6gr7,cas10,cas5,cas7,c2c5_V-U5	NA,NA|138aa|down_2|NC_014533.1_213932_214346_+,NA|123aa|down_3|NC_014533.1_214431_214800_+,NA|142aa|down_4|NC_014533.1_214803_215229_+,NA|127aa|down_5|NC_014533.1_215243_215624_+	NA|388aa|up_9|NC_014533.1_192182_193346_+	cd03801, GT4_PimA-like, phosphatidyl-myo-inositol mannosyltransferase	NA|401aa|up_8|NC_014533.1_193379_194582_+	cd03821, GT4_Bme6-like, Brucella melitensis Bme6 and similar proteins	NA|174aa|up_7|NC_014533.1_194664_195186_+	cd06259, YdcF-like, YdcF-like	NA|1140aa|up_6|NC_014533.1_195205_198625_-	NF012211, tand_rpt_95, tandem-95 repeat protein	NA|215aa|up_5|NC_014533.1_199332_199977_+	COG4339, COG4339, Uncharacterized protein conserved in bacteria [Function unknown]	NA|321aa|up_4|NC_014533.1_200252_201215_+	pfam14200, RicinB_lectin_2, Ricin-type beta-trefoil lectin domain-like	NA|316aa|up_3|NC_014533.1_201570_202518_-	cd03802, GT4_AviGT4-like, UDP-Glc:tetrahydrobiopterin alpha-glucosyltransferase and similar proteins	NA|1159aa|up_2|NC_014533.1_202580_206057_+	cd10918, CE4_NodB_like_5s_6s, Putative catalytic NodB homology domain of PgaB, IcaB, and similar proteins which consist of a deformed (beta/alpha)8 barrel fold with 5- or 6-strands	NA|338aa|up_1|NC_014533.1_206343_207357_-	PRK10130, PRK10130, HTH-type transcriptional regulator EutR	NA|113aa|up_0|NC_014533.1_207591_207930_+	pfam09798, LCD1, DNA damage checkpoint protein	NA|633aa|down_0|NC_014533.1_210626_212525_-	COG1132, MdlB, ABC-type multidrug transport system, ATPase and permease components [Defense mechanisms]	NA|387aa|down_1|NC_014533.1_212548_213709_-	TIGR03087, stp1, sugar transferase, PEP-CTERM/EpsH1 system associated	NA|138aa|down_2|NC_014533.1_213932_214346_+	NA	NA|123aa|down_3|NC_014533.1_214431_214800_+	NA	NA|142aa|down_4|NC_014533.1_214803_215229_+	NA	NA|127aa|down_5|NC_014533.1_215243_215624_+	NA	NA|262aa|down_6|NC_014533.1_215980_216766_-	cd05344, BKR_like_SDR_like, putative beta-ketoacyl acyl carrier protein [ACP] reductase (BKR)-like, SDR	NA|241aa|down_7|NC_014533.1_216776_217499_-	pfam13489, Methyltransf_23, Methyltransferase domain	NA|403aa|down_8|NC_014533.1_217510_218719_-	COG1960, CaiA, Acyl-CoA dehydrogenases [Lipid metabolism]	NA|202aa|down_9|NC_014533.1_218793_219399_+	pfam03358, FMN_red, NADPH-dependent FMN reductase
GCF_000147335.1_ASM14733v1	NC_014533	Gloeothece verrucosa PCC 7822 plasmid Cy782201, complete sequence	4	487679-488993	4,3,3,5	PILER-CR,CRISPRCasFinder,CRT,PILER-CR	no	cas2,cas1,csx18	cmr3gr5,cmr4gr7,cmr5gr11,cas2,cas1,cas4,cas6,csc1gr5,csc2gr7,cas10d,cas3,WYL,2OG_CAS,csx18,cas8b3,c2c9_V-U4,cmr6gr7,cas10	Unclear	CCCTACCTATTGGGTTAAATAGGAATAGTTGGAAAC,GTTTCCAACTATTCCTATTTAACCCAATAGGTAGGG,GTTTCCAACTATTCCTATTTAACCCAATAGGTAGGG,CCCTACCTATTGGGTTAAATAGGAATAGTTGGAAACA	36,36,36,37	0	0	NA	NA	NA:NA:NA:NA	16,17,17,16	17	Unclear	RT,DEDDh,DinG,PD-DExK,c2c9_V-U4,cas14k,cas3,cas14j,Cas14c_CAS-V-F,csa3,Cas9_archaeal,Cas14b_CAS-V-F,csx3,csx1,cmr3gr5,cmr4gr7,cmr5gr11,cas2,cas1,cas4,cas6,csc1gr5,csc2gr7,cas10d,WYL,2OG_CAS,csx18,cas8b3,cmr6gr7,cas10,cas5,cas7,c2c5_V-U5	NA|124aa|up_7|NC_014533.1_479676_480048_+,NA|163aa|up_4|NC_014533.1_481551_482040_+,NA|162aa|up_3|NC_014533.1_482065_482551_+,csx18|96aa|down_2|NC_014533.1_490824_491112_-,NA|128aa|down_6|NC_014533.1_495098_495482_-	NA|256aa|up_9|NC_014533.1_477951_478719_-	pfam09865, DUF2092, Predicted periplasmic protein (DUF2092)	NA|183aa|up_8|NC_014533.1_478741_479290_-	pfam10989, DUF2808, Protein of unknown function (DUF2808)	NA|124aa|up_7|NC_014533.1_479676_480048_+	NA	NA|161aa|up_6|NC_014533.1_480108_480591_+	pfam13441, Gly-zipper_YMGG, YMGG-like Gly-zipper	NA|254aa|up_5|NC_014533.1_480716_481478_+	COG5031, COQ4, Uncharacterized protein involved in ubiquinone biosynthesis [Coenzyme metabolism]	NA|163aa|up_4|NC_014533.1_481551_482040_+	NA	NA|162aa|up_3|NC_014533.1_482065_482551_+	NA	NA|748aa|up_2|NC_014533.1_483166_485410_+	COG0659, SUL1, Sulfate permease and related transporters (MFS superfamily) [Inorganic ion transport and metabolism]	NA|113aa|up_1|NC_014533.1_485516_485855_+	pfam13630, SdpI, SdpI/YfhL protein family	NA|403aa|up_0|NC_014533.1_486349_487558_-	COG0625, Gst, Glutathione S-transferase [Posttranslational modification, protein turnover, chaperones]	cas2|93aa|down_0|NC_014533.1_489190_489469_-	cd09725, Cas2_I_II_III, CRISPR/Cas system-associated protein Cas2	cas1|330aa|down_1|NC_014533.1_489594_490584_-	pfam01867, Cas_Cas1, CRISPR associated protein Cas1	csx18|96aa|down_2|NC_014533.1_490824_491112_-	NA	NA|400aa|down_3|NC_014533.1_491793_492993_+	COG3284, AcoR, Transcriptional activator of acetoin/glycerol metabolism [Secondary metabolites biosynthesis, transport, and catabolism / Transcription]	NA|358aa|down_4|NC_014533.1_493217_494291_-	TIGR04070, photo_TT_lyase, spore photoproduct lyase	NA|192aa|down_5|NC_014533.1_494453_495029_-	pfam06037, DUF922, Bacterial protein of unknown function (DUF922)	NA|128aa|down_6|NC_014533.1_495098_495482_-	NA	NA|339aa|down_7|NC_014533.1_495904_496921_+	cd01194, INT_C_like_4, Uncharacterized site-specific tyrosine recombinase, C-terminal catalytic domain	NA|468aa|down_8|NC_014533.1_497356_498760_+	cd17359, MFS_XylE_like, D-xylose-proton symporter and similar transporters of the Major Facilitator Superfamily	NA|580aa|down_9|NC_014533.1_498995_500735_+	pfam04966, OprB, Carbohydrate-selective porin, OprB family
GCF_000147335.1_ASM14733v1	NC_014533	Gloeothece verrucosa PCC 7822 plasmid Cy782201, complete sequence	5	841042-841965	6,4,4	PILER-CR,CRISPRCasFinder,CRT	no	cmr6gr7,cmr5gr11,cmr4gr7,cmr3gr5,cas10,cas2,cas1	cmr3gr5,cmr4gr7,cmr5gr11,cas2,cas1,cas4,cas6,csc1gr5,csc2gr7,cas10d,cas3,WYL,2OG_CAS,csx18,cas8b3,c2c9_V-U4,cmr6gr7,cas10	Type III-C,Type III-A,Type III-D,Type III-B	GTTTCCATTCTATTAGCTTCCTCCTAAGAGGAAAAG,GTTTCCATTCTATTAGCTTCCTCCTAAGAGGAAAAG,GTTTCCATTCTATTAGCTTCCTCCTAAGAGGAAAAG	36,36,36	0	0	NA	NA	NA:NA:NA	12,12,12	12	TypeIII-C,TypeIII-A,TypeIII-D,TypeIII-B	RT,DEDDh,DinG,PD-DExK,c2c9_V-U4,cas14k,cas3,cas14j,Cas14c_CAS-V-F,csa3,Cas9_archaeal,Cas14b_CAS-V-F,csx3,csx1,cmr3gr5,cmr4gr7,cmr5gr11,cas2,cas1,cas4,cas6,csc1gr5,csc2gr7,cas10d,WYL,2OG_CAS,csx18,cas8b3,cmr6gr7,cas10,cas5,cas7,c2c5_V-U5	NA|361aa|up_8|NC_014533.1_828320_829403_+,cmr5gr11|160aa|up_3|NC_014533.1_835035_835515_-,NA|178aa|down_0|NC_014533.1_842241_842775_+,NA|99aa|down_1|NC_014533.1_842960_843257_-,NA|179aa|down_2|NC_014533.1_843249_843786_-,NA|103aa|down_3|NC_014533.1_843885_844194_-,NA|65aa|down_8|NC_014533.1_852548_852743_-	NA|22aa|up_9|NC_014533.1_828162_828228_-	pfam14706, Tnp_DNA_bind, Transposase DNA-binding	NA|361aa|up_8|NC_014533.1_828320_829403_+	NA	NA|73aa|up_7|NC_014533.1_829745_829964_+	COG4118, Phd, Antitoxin of toxin-antitoxin stability system [Cell division and chromosome partitioning]	NA|128aa|up_6|NC_014533.1_829960_830344_+	cd09872, PIN_Sll0205-like, VapC-like PIN domain of Sll0205 protein and homologs	NA|482aa|up_5|NC_014533.1_830527_831973_-	pfam13546, DDE_5, DDE superfamily endonuclease	cmr6gr7|696aa|up_4|NC_014533.1_832945_835033_-	TIGR01898, repair_system, CRISPR type III-B/RAMP module RAMP protein Cmr6	cmr5gr11|160aa|up_3|NC_014533.1_835035_835515_-	NA	cmr4gr7|343aa|up_2|NC_014533.1_835514_836543_-	TIGR02580, putative_CRISPR-associated_protein, CRISPR type III-B/RAMP module RAMP protein Cmr4	cmr3gr5|388aa|up_1|NC_014533.1_836554_837718_-	cd09748, Cmr3_III-B, CRISPR/Cas system-associated RAMP superfamily protein Cmr3	cas10|967aa|up_0|NC_014533.1_837710_840611_-	pfam12469, DUF3692, CRISPR-associated protein	NA|178aa|down_0|NC_014533.1_842241_842775_+	NA	NA|99aa|down_1|NC_014533.1_842960_843257_-	NA	NA|179aa|down_2|NC_014533.1_843249_843786_-	NA	NA|103aa|down_3|NC_014533.1_843885_844194_-	NA	cas2|97aa|down_4|NC_014533.1_844221_844512_-	cd09725, Cas2_I_II_III, CRISPR/Cas system-associated protein Cas2	cas1|672aa|down_5|NC_014533.1_844537_846553_-	pfam01867, Cas_Cas1, CRISPR associated protein Cas1	NA|503aa|down_6|NC_014533.1_849302_850811_-	sd00006, TPR, Tetratricopeptide repeat	NA|496aa|down_7|NC_014533.1_850899_852387_-	pfam13191, AAA_16, AAA ATPase domain	NA|65aa|down_8|NC_014533.1_852548_852743_-	NA	NA|345aa|down_9|NC_014533.1_852889_853924_-	pfam01609, DDE_Tnp_1, Transposase DDE domain
GCF_000147335.1_ASM14733v1	NC_014533	Gloeothece verrucosa PCC 7822 plasmid Cy782201, complete sequence	6	846786-849039	5,5,7	CRISPRCasFinder,CRT,PILER-CR	no	cmr6gr7,cmr5gr11,cmr4gr7,cmr3gr5,cas10,cas2,cas1	cmr3gr5,cmr4gr7,cmr5gr11,cas2,cas1,cas4,cas6,csc1gr5,csc2gr7,cas10d,cas3,WYL,2OG_CAS,csx18,cas8b3,c2c9_V-U4,cmr6gr7,cas10	Type III-C,Type III-A,Type III-D,Type III-B	GTTTCCATTCTGTTAGTTTTCTCCTAAGAGAAAAG,GTTTCCATTCTGTTAGTTTTCTCCTAAGAGAAAAG,GTTTCCATTCTGTTAGTTTTCTCCTAAGA-GAAAAG	35,35,36	0	0	NA	NA	NA:NA:NA	31,31,30	31	TypeIII-C,TypeIII-A,TypeIII-D,TypeIII-B	RT,DEDDh,DinG,PD-DExK,c2c9_V-U4,cas14k,cas3,cas14j,Cas14c_CAS-V-F,csa3,Cas9_archaeal,Cas14b_CAS-V-F,csx3,csx1,cmr3gr5,cmr4gr7,cmr5gr11,cas2,cas1,cas4,cas6,csc1gr5,csc2gr7,cas10d,WYL,2OG_CAS,csx18,cas8b3,cmr6gr7,cas10,cas5,cas7,c2c5_V-U5	cmr5gr11|160aa|up_9|NC_014533.1_835035_835515_-,NA|178aa|up_5|NC_014533.1_842241_842775_+,NA|99aa|up_4|NC_014533.1_842960_843257_-,NA|179aa|up_3|NC_014533.1_843249_843786_-,NA|103aa|up_2|NC_014533.1_843885_844194_-,NA|65aa|down_2|NC_014533.1_852548_852743_-,NA|1264aa|down_4|NC_014533.1_853975_857767_-,NA|92aa|down_5|NC_014533.1_857796_858072_+,NA|130aa|down_6|NC_014533.1_858447_858837_-,NA|120aa|down_7|NC_014533.1_858836_859196_-,NA|100aa|down_8|NC_014533.1_859249_859549_-,NA|81aa|down_9|NC_014533.1_859545_859788_-	cmr5gr11|160aa|up_9|NC_014533.1_835035_835515_-	NA	cmr4gr7|343aa|up_8|NC_014533.1_835514_836543_-	TIGR02580, putative_CRISPR-associated_protein, CRISPR type III-B/RAMP module RAMP protein Cmr4	cmr3gr5|388aa|up_7|NC_014533.1_836554_837718_-	cd09748, Cmr3_III-B, CRISPR/Cas system-associated RAMP superfamily protein Cmr3	cas10|967aa|up_6|NC_014533.1_837710_840611_-	pfam12469, DUF3692, CRISPR-associated protein	NA|178aa|up_5|NC_014533.1_842241_842775_+	NA	NA|99aa|up_4|NC_014533.1_842960_843257_-	NA	NA|179aa|up_3|NC_014533.1_843249_843786_-	NA	NA|103aa|up_2|NC_014533.1_843885_844194_-	NA	cas2|97aa|up_1|NC_014533.1_844221_844512_-	cd09725, Cas2_I_II_III, CRISPR/Cas system-associated protein Cas2	cas1|672aa|up_0|NC_014533.1_844537_846553_-	pfam01867, Cas_Cas1, CRISPR associated protein Cas1	NA|503aa|down_0|NC_014533.1_849302_850811_-	sd00006, TPR, Tetratricopeptide repeat	NA|496aa|down_1|NC_014533.1_850899_852387_-	pfam13191, AAA_16, AAA ATPase domain	NA|65aa|down_2|NC_014533.1_852548_852743_-	NA	NA|345aa|down_3|NC_014533.1_852889_853924_-	pfam01609, DDE_Tnp_1, Transposase DDE domain	NA|1264aa|down_4|NC_014533.1_853975_857767_-	NA	NA|92aa|down_5|NC_014533.1_857796_858072_+	NA	NA|130aa|down_6|NC_014533.1_858447_858837_-	NA	NA|120aa|down_7|NC_014533.1_858836_859196_-	NA	NA|100aa|down_8|NC_014533.1_859249_859549_-	NA	NA|81aa|down_9|NC_014533.1_859545_859788_-	NA
GCF_000147335.1_ASM14733v1	NC_014534	Gloeothece verrucosa PCC 7822 plasmid Cy782202, complete sequence	1	125521-125703	1	CRISPRCasFinder	no	RT,Cas9_archaeal	RT,Cas9_archaeal,cas14j,cas1,cas2,cas10,cmr3gr5,cmr4gr7,cmr5gr11,cmr6gr7,cas5,cas7,cas8b3,cas6,PD-DExK,c2c5_V-U5	Unclear	AAGTGATCAACGCTTTACAGCATCAAAGGTTAAAGCAC	38	0	0	NA	NA	I-A,I-B,II-B	2	2	Unclear	RT,DEDDh,DinG,PD-DExK,c2c9_V-U4,cas14k,cas3,cas14j,Cas14c_CAS-V-F,csa3,Cas9_archaeal,Cas14b_CAS-V-F,csx3,csx1,cmr3gr5,cmr4gr7,cmr5gr11,cas2,cas1,cas4,cas6,csc1gr5,csc2gr7,cas10d,WYL,2OG_CAS,csx18,cas8b3,cmr6gr7,cas10,cas5,cas7,c2c5_V-U5	NA|302aa|up_9|NC_014534.1_111896_112802_+,NA|242aa|up_8|NC_014534.1_112798_113524_+,NA|114aa|up_6|NC_014534.1_115019_115361_+,NA|154aa|up_5|NC_014534.1_115357_115819_+,NA|274aa|up_4|NC_014534.1_115815_116637_+,NA|190aa|up_3|NC_014534.1_116728_117298_+,NA|1988aa|up_1|NC_014534.1_119036_125000_-,NA|117aa|up_0|NC_014534.1_125039_125390_+,NA|154aa|down_3|NC_014534.1_128803_129265_-,NA|164aa|down_5|NC_014534.1_129786_130278_-,NA|106aa|down_9|NC_014534.1_132737_133055_-	NA|302aa|up_9|NC_014534.1_111896_112802_+	NA	NA|242aa|up_8|NC_014534.1_112798_113524_+	NA	NA|498aa|up_7|NC_014534.1_113548_115042_+	CHL00195, ycf46, Ycf46; Provisional	NA|114aa|up_6|NC_014534.1_115019_115361_+	NA	NA|154aa|up_5|NC_014534.1_115357_115819_+	NA	NA|274aa|up_4|NC_014534.1_115815_116637_+	NA	NA|190aa|up_3|NC_014534.1_116728_117298_+	NA	NA|508aa|up_2|NC_014534.1_117287_118811_+	COG1061, SSL2, DNA or RNA helicases of superfamily II [Transcription / DNA replication, recombination, and repair]	NA|1988aa|up_1|NC_014534.1_119036_125000_-	NA	NA|117aa|up_0|NC_014534.1_125039_125390_+	NA	NA|72aa|down_0|NC_014534.1_125861_126077_-	pfam13551, HTH_29, Winged helix-turn helix	NA|68aa|down_1|NC_014534.1_126409_126613_+	cd05117, STKc_CAMK, The catalytic domain of CAMK family Serine/Threonine Kinases	NA|213aa|down_2|NC_014534.1_127974_128613_-	pfam05685, Uma2, Putative restriction endonuclease	NA|154aa|down_3|NC_014534.1_128803_129265_-	NA	NA|126aa|down_4|NC_014534.1_129338_129716_-	PRK07459, PRK07459, single-stranded DNA-binding protein; Provisional	NA|164aa|down_5|NC_014534.1_129786_130278_-	NA	NA|170aa|down_6|NC_014534.1_130686_131196_-	TIGR01766, Putative_transposase_MJ0751, transposase, IS605 OrfB family, central region	RT|280aa|down_7|NC_014534.1_131470_132310_+	TIGR04416, hypothetical_protein, group II intron reverse transcriptase/maturase	NA|122aa|down_8|NC_014534.1_132375_132741_-	COG5550, COG5550, Predicted aspartyl protease [Posttranslational modification, protein turnover, chaperones]	NA|106aa|down_9|NC_014534.1_132737_133055_-	NA
GCF_000147335.1_ASM14733v1	NC_014534	Gloeothece verrucosa PCC 7822 plasmid Cy782202, complete sequence	2	161319-161463	2	CRISPRCasFinder	no	Cas9_archaeal,cas14j	RT,Cas9_archaeal,cas14j,cas1,cas2,cas10,cmr3gr5,cmr4gr7,cmr5gr11,cmr6gr7,cas5,cas7,cas8b3,cas6,PD-DExK,c2c5_V-U5	 or Type II-C?, Type II-B,Type II-A	TACTGGGAATTAGAACAGTACGACCAATCACTGTCGGATTTG	42	0	0	NA	NA	NA	1	1	orTypeII-C?,TypeV,TypeII-B,TypeII-A	RT,DEDDh,DinG,PD-DExK,c2c9_V-U4,cas14k,cas3,cas14j,Cas14c_CAS-V-F,csa3,Cas9_archaeal,Cas14b_CAS-V-F,csx3,csx1,cmr3gr5,cmr4gr7,cmr5gr11,cas2,cas1,cas4,cas6,csc1gr5,csc2gr7,cas10d,WYL,2OG_CAS,csx18,cas8b3,cmr6gr7,cas10,cas5,cas7,c2c5_V-U5	NA|65aa|up_9|NC_014534.1_149549_149744_-,NA|233aa|up_8|NC_014534.1_149795_150494_-,NA|226aa|up_2|NC_014534.1_158810_159488_-,NA|97aa|up_0|NC_014534.1_159917_160208_-,NA|53aa|down_0|NC_014534.1_162815_162974_-,NA|395aa|down_1|NC_014534.1_163134_164319_-,NA|67aa|down_2|NC_014534.1_164601_164802_-,NA|139aa|down_4|NC_014534.1_165572_165989_-,NA|119aa|down_6|NC_014534.1_169005_169362_-,NA|221aa|down_7|NC_014534.1_170610_171273_-,NA|49aa|down_8|NC_014534.1_171315_171462_-	NA|65aa|up_9|NC_014534.1_149549_149744_-	NA	NA|233aa|up_8|NC_014534.1_149795_150494_-	NA	NA|302aa|up_7|NC_014534.1_153853_154759_+	TIGR04285, parB-like_partition_protein, nucleoid occlusion protein	NA|359aa|up_6|NC_014534.1_155339_156416_-	pfam14239, RRXRR, RRXRR protein	NA|293aa|up_5|NC_014534.1_156638_157517_-	TIGR03736, PRTRC_ThiF, PRTRC system ThiF family protein	NA|220aa|up_4|NC_014534.1_157449_158109_-	TIGR03735, PRTRC_A, PRTRC system protein A	NA|232aa|up_3|NC_014534.1_158112_158808_-	pfam14460, Prok-E2_D, Prokaryotic E2 family D	NA|226aa|up_2|NC_014534.1_158810_159488_-	NA	NA|137aa|up_1|NC_014534.1_159495_159906_-	TIGR03738, PRTRC_C, PRTRC system protein C	NA|97aa|up_0|NC_014534.1_159917_160208_-	NA	NA|53aa|down_0|NC_014534.1_162815_162974_-	NA	NA|395aa|down_1|NC_014534.1_163134_164319_-	NA	NA|67aa|down_2|NC_014534.1_164601_164802_-	NA	NA|221aa|down_3|NC_014534.1_164839_165502_-	cd06257, DnaJ, DnaJ domain or J-domain	NA|139aa|down_4|NC_014534.1_165572_165989_-	NA	NA|394aa|down_5|NC_014534.1_167392_168574_+	pfam13808, DDE_Tnp_1_assoc, DDE_Tnp_1-associated	NA|119aa|down_6|NC_014534.1_169005_169362_-	NA	NA|221aa|down_7|NC_014534.1_170610_171273_-	NA	NA|49aa|down_8|NC_014534.1_171315_171462_-	NA	NA|1381aa|down_9|NC_014534.1_171505_175648_-	cd11304, Cadherin_repeat, Cadherin tandem repeat domain
GCF_000147335.1_ASM14733v1	NC_014534	Gloeothece verrucosa PCC 7822 plasmid Cy782202, complete sequence	3	161625-161870	3	CRISPRCasFinder	no	Cas9_archaeal,cas14j	RT,Cas9_archaeal,cas14j,cas1,cas2,cas10,cmr3gr5,cmr4gr7,cmr5gr11,cmr6gr7,cas5,cas7,cas8b3,cas6,PD-DExK,c2c5_V-U5	 or Type II-C?, Type II-B,Type II-A	TACTGGGAATTAGAACAGTACGACCAATCACTGTCGGATTTG	42	0	0	NA	NA	NA	2	2	orTypeII-C?,TypeV,TypeII-B,TypeII-A	RT,DEDDh,DinG,PD-DExK,c2c9_V-U4,cas14k,cas3,cas14j,Cas14c_CAS-V-F,csa3,Cas9_archaeal,Cas14b_CAS-V-F,csx3,csx1,cmr3gr5,cmr4gr7,cmr5gr11,cas2,cas1,cas4,cas6,csc1gr5,csc2gr7,cas10d,WYL,2OG_CAS,csx18,cas8b3,cmr6gr7,cas10,cas5,cas7,c2c5_V-U5	NA|65aa|up_9|NC_014534.1_149549_149744_-,NA|233aa|up_8|NC_014534.1_149795_150494_-,NA|226aa|up_2|NC_014534.1_158810_159488_-,NA|97aa|up_0|NC_014534.1_159917_160208_-,NA|53aa|down_0|NC_014534.1_162815_162974_-,NA|395aa|down_1|NC_014534.1_163134_164319_-,NA|67aa|down_2|NC_014534.1_164601_164802_-,NA|139aa|down_4|NC_014534.1_165572_165989_-,NA|119aa|down_6|NC_014534.1_169005_169362_-,NA|221aa|down_7|NC_014534.1_170610_171273_-,NA|49aa|down_8|NC_014534.1_171315_171462_-	NA|65aa|up_9|NC_014534.1_149549_149744_-	NA	NA|233aa|up_8|NC_014534.1_149795_150494_-	NA	NA|302aa|up_7|NC_014534.1_153853_154759_+	TIGR04285, parB-like_partition_protein, nucleoid occlusion protein	NA|359aa|up_6|NC_014534.1_155339_156416_-	pfam14239, RRXRR, RRXRR protein	NA|293aa|up_5|NC_014534.1_156638_157517_-	TIGR03736, PRTRC_ThiF, PRTRC system ThiF family protein	NA|220aa|up_4|NC_014534.1_157449_158109_-	TIGR03735, PRTRC_A, PRTRC system protein A	NA|232aa|up_3|NC_014534.1_158112_158808_-	pfam14460, Prok-E2_D, Prokaryotic E2 family D	NA|226aa|up_2|NC_014534.1_158810_159488_-	NA	NA|137aa|up_1|NC_014534.1_159495_159906_-	TIGR03738, PRTRC_C, PRTRC system protein C	NA|97aa|up_0|NC_014534.1_159917_160208_-	NA	NA|53aa|down_0|NC_014534.1_162815_162974_-	NA	NA|395aa|down_1|NC_014534.1_163134_164319_-	NA	NA|67aa|down_2|NC_014534.1_164601_164802_-	NA	NA|221aa|down_3|NC_014534.1_164839_165502_-	cd06257, DnaJ, DnaJ domain or J-domain	NA|139aa|down_4|NC_014534.1_165572_165989_-	NA	NA|394aa|down_5|NC_014534.1_167392_168574_+	pfam13808, DDE_Tnp_1_assoc, DDE_Tnp_1-associated	NA|119aa|down_6|NC_014534.1_169005_169362_-	NA	NA|221aa|down_7|NC_014534.1_170610_171273_-	NA	NA|49aa|down_8|NC_014534.1_171315_171462_-	NA	NA|1381aa|down_9|NC_014534.1_171505_175648_-	cd11304, Cadherin_repeat, Cadherin tandem repeat domain
GCF_000147335.1_ASM14733v1	NC_014534	Gloeothece verrucosa PCC 7822 plasmid Cy782202, complete sequence	4	236763-237089	1,4,1	PILER-CR,CRISPRCasFinder,CRT	no	cas1,cas2,cas10,cmr3gr5,cmr4gr7,cmr5gr11,cmr6gr7	RT,Cas9_archaeal,cas14j,cas1,cas2,cas10,cmr3gr5,cmr4gr7,cmr5gr11,cmr6gr7,cas5,cas7,cas8b3,cas6,PD-DExK,c2c5_V-U5	Type III-C,Type III-A,Type III-D,Type III-B	GTTTCCATTCTGTTAGTTTTCTCCTAAGA-GAAAAG,CTTTTCTCTTAGGAGAAAACTAACAGAATGGAAAC,CTTTTCTCTTAGGAGAAAACTAACAGAATGGAAAC	36,35,35	0	0	NA	NA	NA:NA:NA	3,4,4	4	TypeIII-C,TypeIII-A,TypeIII-D,TypeIII-B	RT,DEDDh,DinG,PD-DExK,c2c9_V-U4,cas14k,cas3,cas14j,Cas14c_CAS-V-F,csa3,Cas9_archaeal,Cas14b_CAS-V-F,csx3,csx1,cmr3gr5,cmr4gr7,cmr5gr11,cas2,cas1,cas4,cas6,csc1gr5,csc2gr7,cas10d,WYL,2OG_CAS,csx18,cas8b3,cmr6gr7,cas10,cas5,cas7,c2c5_V-U5	NA|201aa|up_8|NC_014534.1_222157_222760_+,NA|81aa|up_4|NC_014534.1_226733_226976_+,NA|108aa|up_3|NC_014534.1_226972_227296_+,NA|92aa|up_2|NC_014534.1_228107_228383_-,NA|103aa|down_2|NC_014534.1_239672_239981_+,NA|179aa|down_3|NC_014534.1_240080_240617_+,NA|99aa|down_4|NC_014534.1_240609_240906_+,NA|170aa|down_5|NC_014534.1_241082_241592_-	NA|345aa|up_9|NC_014534.1_220891_221926_+	pfam01609, DDE_Tnp_1, Transposase DDE domain	NA|201aa|up_8|NC_014534.1_222157_222760_+	NA	NA|681aa|up_7|NC_014534.1_222728_224771_+	COG4252, COG4252, Predicted transmembrane sensor domain [Signal transduction mechanisms]	NA|257aa|up_6|NC_014534.1_225096_225867_-	COG3561, COG3561, Phage anti-repressor protein [Transcription]	NA|78aa|up_5|NC_014534.1_226303_226537_-	pfam13443, HTH_26, Cro/C1-type HTH DNA-binding domain	NA|81aa|up_4|NC_014534.1_226733_226976_+	NA	NA|108aa|up_3|NC_014534.1_226972_227296_+	NA	NA|92aa|up_2|NC_014534.1_228107_228383_-	NA	NA|496aa|up_1|NC_014534.1_233414_234902_+	pfam13191, AAA_16, AAA ATPase domain	NA|526aa|up_0|NC_014534.1_234921_236499_+	pfam17874, TPR_MalT, MalT-like TPR region	cas1|672aa|down_0|NC_014534.1_237313_239329_+	pfam01867, Cas_Cas1, CRISPR associated protein Cas1	cas2|97aa|down_1|NC_014534.1_239354_239645_+	cd09725, Cas2_I_II_III, CRISPR/Cas system-associated protein Cas2	NA|103aa|down_2|NC_014534.1_239672_239981_+	NA	NA|179aa|down_3|NC_014534.1_240080_240617_+	NA	NA|99aa|down_4|NC_014534.1_240609_240906_+	NA	NA|170aa|down_5|NC_014534.1_241082_241592_-	NA	NA|281aa|down_6|NC_014534.1_242583_243426_+	pfam13676, TIR_2, TIR domain	cas10|967aa|down_7|NC_014534.1_243524_246425_+	pfam12469, DUF3692, CRISPR-associated protein	cmr3gr5|388aa|down_8|NC_014534.1_246417_247581_+	cd09748, Cmr3_III-B, CRISPR/Cas system-associated RAMP superfamily protein Cmr3	cmr4gr7|342aa|down_9|NC_014534.1_247592_248618_+	TIGR02580, putative_CRISPR-associated_protein, CRISPR type III-B/RAMP module RAMP protein Cmr4
GCF_000147335.1_ASM14733v1	NC_014534	Gloeothece verrucosa PCC 7822 plasmid Cy782202, complete sequence	5	241869-242349	2,5,2	PILER-CR,CRISPRCasFinder,CRT	no	cas1,cas2,cas10,cmr3gr5,cmr4gr7,cmr5gr11,cmr6gr7	RT,Cas9_archaeal,cas14j,cas1,cas2,cas10,cmr3gr5,cmr4gr7,cmr5gr11,cmr6gr7,cas5,cas7,cas8b3,cas6,PD-DExK,c2c5_V-U5	Type III-C,Type III-A,Type III-D,Type III-B	GTTTCCATTCTATTAGCTTCCTCCTAAGAGGAAAAG,CTTTTCCTCTTAGGAGGAAGCTAATAGAATGGAAAC,CTTTTCCTCTTAGGAGGAAGCTAATAGAATGGAAAC	36,36,36	0	0	NA	NA	NA:NA:NA	6,6,6	6	TypeIII-C,TypeIII-A,TypeIII-D,TypeIII-B	RT,DEDDh,DinG,PD-DExK,c2c9_V-U4,cas14k,cas3,cas14j,Cas14c_CAS-V-F,csa3,Cas9_archaeal,Cas14b_CAS-V-F,csx3,csx1,cmr3gr5,cmr4gr7,cmr5gr11,cas2,cas1,cas4,cas6,csc1gr5,csc2gr7,cas10d,WYL,2OG_CAS,csx18,cas8b3,cmr6gr7,cas10,cas5,cas7,c2c5_V-U5	NA|108aa|up_9|NC_014534.1_226972_227296_+,NA|92aa|up_8|NC_014534.1_228107_228383_-,NA|103aa|up_3|NC_014534.1_239672_239981_+,NA|179aa|up_2|NC_014534.1_240080_240617_+,NA|99aa|up_1|NC_014534.1_240609_240906_+,NA|170aa|up_0|NC_014534.1_241082_241592_-,cmr5gr11|160aa|down_4|NC_014534.1_248624_249104_+,NA|224aa|down_6|NC_014534.1_251309_251981_-	NA|108aa|up_9|NC_014534.1_226972_227296_+	NA	NA|92aa|up_8|NC_014534.1_228107_228383_-	NA	NA|496aa|up_7|NC_014534.1_233414_234902_+	pfam13191, AAA_16, AAA ATPase domain	NA|526aa|up_6|NC_014534.1_234921_236499_+	pfam17874, TPR_MalT, MalT-like TPR region	cas1|672aa|up_5|NC_014534.1_237313_239329_+	pfam01867, Cas_Cas1, CRISPR associated protein Cas1	cas2|97aa|up_4|NC_014534.1_239354_239645_+	cd09725, Cas2_I_II_III, CRISPR/Cas system-associated protein Cas2	NA|103aa|up_3|NC_014534.1_239672_239981_+	NA	NA|179aa|up_2|NC_014534.1_240080_240617_+	NA	NA|99aa|up_1|NC_014534.1_240609_240906_+	NA	NA|170aa|up_0|NC_014534.1_241082_241592_-	NA	NA|281aa|down_0|NC_014534.1_242583_243426_+	pfam13676, TIR_2, TIR domain	cas10|967aa|down_1|NC_014534.1_243524_246425_+	pfam12469, DUF3692, CRISPR-associated protein	cmr3gr5|388aa|down_2|NC_014534.1_246417_247581_+	cd09748, Cmr3_III-B, CRISPR/Cas system-associated RAMP superfamily protein Cmr3	cmr4gr7|342aa|down_3|NC_014534.1_247592_248618_+	TIGR02580, putative_CRISPR-associated_protein, CRISPR type III-B/RAMP module RAMP protein Cmr4	cmr5gr11|160aa|down_4|NC_014534.1_248624_249104_+	NA	cmr6gr7|696aa|down_5|NC_014534.1_249106_251194_+	TIGR01898, repair_system, CRISPR type III-B/RAMP module RAMP protein Cmr6	NA|224aa|down_6|NC_014534.1_251309_251981_-	NA	NA|208aa|down_7|NC_014534.1_252088_252712_+	cd17544, REC_2_GGDEF, second phosphoacceptor receiver (REC) domain of uncharacterized GGDEF domain proteins	NA|143aa|down_8|NC_014534.1_253375_253804_+	pfam13565, HTH_32, Homeodomain-like domain	NA|92aa|down_9|NC_014534.1_254090_254366_+	pfam03683, UPF0175, Uncharacterized protein family (UPF0175)
GCF_000147335.1_ASM14733v1	NC_014534	Gloeothece verrucosa PCC 7822 plasmid Cy782202, complete sequence	6	280376-280489	6	CRISPRCasFinder	no	RT,cas14j,cas5,cas7,cas8b3,cas6	RT,Cas9_archaeal,cas14j,cas1,cas2,cas10,cmr3gr5,cmr4gr7,cmr5gr11,cmr6gr7,cas5,cas7,cas8b3,cas6,PD-DExK,c2c5_V-U5	Unclear	ATTATGGCGACGTACAGAAGCAGCAGG	27	0	0	NA	NA	NA	1	1	TypeV	RT,DEDDh,DinG,PD-DExK,c2c9_V-U4,cas14k,cas3,cas14j,Cas14c_CAS-V-F,csa3,Cas9_archaeal,Cas14b_CAS-V-F,csx3,csx1,cmr3gr5,cmr4gr7,cmr5gr11,cas2,cas1,cas4,cas6,csc1gr5,csc2gr7,cas10d,WYL,2OG_CAS,csx18,cas8b3,cmr6gr7,cas10,cas5,cas7,c2c5_V-U5	NA|137aa|up_8|NC_014534.1_271370_271781_+,NA|273aa|up_7|NC_014534.1_271795_272614_+,NA|63aa|up_6|NC_014534.1_272832_273021_+,NA|47aa|up_5|NC_014534.1_273033_273174_-,NA|139aa|up_1|NC_014534.1_279153_279570_-,NA|82aa|up_0|NC_014534.1_279799_280045_-,NA|71aa|down_0|NC_014534.1_280817_281030_-,NA|133aa|down_2|NC_014534.1_282019_282418_-,NA|63aa|down_3|NC_014534.1_282511_282700_-,NA|161aa|down_5|NC_014534.1_283416_283899_-,NA|161aa|down_7|NC_014534.1_285875_286358_-,NA|73aa|down_8|NC_014534.1_286448_286667_-	NA|375aa|up_9|NC_014534.1_270235_271360_+	cd10227, ParM_like, Plasmid segregation protein ParM and similar proteins	NA|137aa|up_8|NC_014534.1_271370_271781_+	NA	NA|273aa|up_7|NC_014534.1_271795_272614_+	NA	NA|63aa|up_6|NC_014534.1_272832_273021_+	NA	NA|47aa|up_5|NC_014534.1_273033_273174_-	NA	RT|543aa|up_4|NC_014534.1_273242_274871_-	TIGR04416, hypothetical_protein, group II intron reverse transcriptase/maturase	cas14j|409aa|up_3|NC_014534.1_275433_276660_-	COG0675, COG0675, Transposase and inactivated derivatives [DNA replication, recombination, and repair]	NA|409aa|up_2|NC_014534.1_276898_278125_+	cd14014, STKc_PknB_like, Catalytic domain of bacterial Serine/Threonine kinases, PknB and similar proteins	NA|139aa|up_1|NC_014534.1_279153_279570_-	NA	NA|82aa|up_0|NC_014534.1_279799_280045_-	NA	NA|71aa|down_0|NC_014534.1_280817_281030_-	NA	NA|155aa|down_1|NC_014534.1_281095_281560_-	pfam13619, KTSC, KTSC domain	NA|133aa|down_2|NC_014534.1_282019_282418_-	NA	NA|63aa|down_3|NC_014534.1_282511_282700_-	NA	NA|165aa|down_4|NC_014534.1_282816_283311_-	pfam12747, DdrB, DdrB-like protein	NA|161aa|down_5|NC_014534.1_283416_283899_-	NA	NA|616aa|down_6|NC_014534.1_283974_285822_-	cd16404, pNOB8_ParB_N_like, pNOB8 ParB-like N-terminal domain, plasmid partitioning system protein domain	NA|161aa|down_7|NC_014534.1_285875_286358_-	NA	NA|73aa|down_8|NC_014534.1_286448_286667_-	NA	NA|279aa|down_9|NC_014534.1_286733_287570_-	pfam06114, Peptidase_M78, IrrE N-terminal-like domain
GCF_000147335.1_ASM14733v1	NC_014534	Gloeothece verrucosa PCC 7822 plasmid Cy782202, complete sequence	7	290411-290684	7	CRISPRCasFinder	no	RT,cas14j,cas5,cas7,cas8b3,cas6	RT,Cas9_archaeal,cas14j,cas1,cas2,cas10,cmr3gr5,cmr4gr7,cmr5gr11,cmr6gr7,cas5,cas7,cas8b3,cas6,PD-DExK,c2c5_V-U5	Unclear	TGAGCAACGCCAATAGGCAGTAAACCATGATGTAC	35	0	0	NA	NA	NA	3	3	TypeV	RT,DEDDh,DinG,PD-DExK,c2c9_V-U4,cas14k,cas3,cas14j,Cas14c_CAS-V-F,csa3,Cas9_archaeal,Cas14b_CAS-V-F,csx3,csx1,cmr3gr5,cmr4gr7,cmr5gr11,cas2,cas1,cas4,cas6,csc1gr5,csc2gr7,cas10d,WYL,2OG_CAS,csx18,cas8b3,cmr6gr7,cas10,cas5,cas7,c2c5_V-U5	NA|133aa|up_9|NC_014534.1_282019_282418_-,NA|63aa|up_8|NC_014534.1_282511_282700_-,NA|161aa|up_6|NC_014534.1_283416_283899_-,NA|161aa|up_4|NC_014534.1_285875_286358_-,NA|73aa|up_3|NC_014534.1_286448_286667_-,NA|137aa|up_1|NC_014534.1_287972_288383_-,NA|148aa|down_6|NC_014534.1_297774_298218_+	NA|133aa|up_9|NC_014534.1_282019_282418_-	NA	NA|63aa|up_8|NC_014534.1_282511_282700_-	NA	NA|165aa|up_7|NC_014534.1_282816_283311_-	pfam12747, DdrB, DdrB-like protein	NA|161aa|up_6|NC_014534.1_283416_283899_-	NA	NA|616aa|up_5|NC_014534.1_283974_285822_-	cd16404, pNOB8_ParB_N_like, pNOB8 ParB-like N-terminal domain, plasmid partitioning system protein domain	NA|161aa|up_4|NC_014534.1_285875_286358_-	NA	NA|73aa|up_3|NC_014534.1_286448_286667_-	NA	NA|279aa|up_2|NC_014534.1_286733_287570_-	pfam06114, Peptidase_M78, IrrE N-terminal-like domain	NA|137aa|up_1|NC_014534.1_287972_288383_-	NA	NA|344aa|up_0|NC_014534.1_289193_290225_+	pfam06527, TniQ, TniQ	cas5|212aa|down_0|NC_014534.1_290778_291414_-	cd09688, Cas5_I-C, CRISPR/Cas system-associated RAMP superfamily protein Cas5	cas7|302aa|down_1|NC_014534.1_291418_292324_-	cd09687, Cas7_I-C, CRISPR/Cas system-associated RAMP superfamily protein Cas7	cas8b3|419aa|down_2|NC_014534.1_292325_293582_-	TIGR03485, hypothetical_protein_L8106_30105, CRISPR-associated protein Cas8a1/Csx13, MYXAN subtype	cas6|195aa|down_3|NC_014534.1_293683_294268_-	pfam09559, Cas6, Cas6 Crispr	NA|776aa|down_4|NC_014534.1_294258_296586_+	pfam00665, rve, Integrase core domain	NA|400aa|down_5|NC_014534.1_296585_297785_+	pfam05621, TniB, Bacterial TniB protein	NA|148aa|down_6|NC_014534.1_297774_298218_+	NA	NA|256aa|down_7|NC_014534.1_298776_299544_-	pfam01035, DNA_binding_1, 6-O-methylguanine DNA methyltransferase, DNA binding domain	NA|389aa|down_8|NC_014534.1_299594_300761_-	cd16393, SPO0J_N, Thermus thermophilus stage 0 sporulation protein J-like N-terminal domain, ParB family member	NA|274aa|down_9|NC_014534.1_300745_301567_-	COG1192, Soj, ATPases involved in chromosome partitioning [Cell division and chromosome partitioning]
GCF_000147335.1_ASM14733v1	NC_014534	Gloeothece verrucosa PCC 7822 plasmid Cy782202, complete sequence	8	389583-389685	8	CRISPRCasFinder	no	c2c5_V-U5	RT,Cas9_archaeal,cas14j,cas1,cas2,cas10,cmr3gr5,cmr4gr7,cmr5gr11,cmr6gr7,cas5,cas7,cas8b3,cas6,PD-DExK,c2c5_V-U5	Type V-U5	AGGGTTTTTGCTCTCTAAATGATTGAAAG	29	0	0	NA	NA	NA	1	1	TypeV-U5	RT,DEDDh,DinG,PD-DExK,c2c9_V-U4,cas14k,cas3,cas14j,Cas14c_CAS-V-F,csa3,Cas9_archaeal,Cas14b_CAS-V-F,csx3,csx1,cmr3gr5,cmr4gr7,cmr5gr11,cas2,cas1,cas4,cas6,csc1gr5,csc2gr7,cas10d,WYL,2OG_CAS,csx18,cas8b3,cmr6gr7,cas10,cas5,cas7,c2c5_V-U5	NA|85aa|up_7|NC_014534.1_380061_380316_-,NA|325aa|up_6|NC_014534.1_380610_381585_+,NA|424aa|up_5|NC_014534.1_381908_383180_+,c2c5_V-U5|224aa|up_1|NC_014534.1_386928_387600_+,NA|200aa|down_3|NC_014534.1_404392_404992_-	NA|72aa|up_9|NC_014534.1_377755_377971_+	TIGR04220, hypothetical_protein_L8106_29040, cyanobactin biosynthesis protein, PatB/AcyB/McaB family	NA|67aa|up_8|NC_014534.1_378136_378337_+	TIGR04447, hypothetical_protein, cyanobactin cluster PatC/TenC/TruC protein	NA|85aa|up_7|NC_014534.1_380061_380316_-	NA	NA|325aa|up_6|NC_014534.1_380610_381585_+	NA	NA|424aa|up_5|NC_014534.1_381908_383180_+	NA	NA|663aa|up_4|NC_014534.1_383400_385389_+	pfam09299, Mu-transpos_C, Mu transposase, C-terminal	NA|306aa|up_3|NC_014534.1_385395_386313_+	pfam13401, AAA_22, AAA domain	NA|164aa|up_2|NC_014534.1_386319_386811_+	pfam06527, TniQ, TniQ	c2c5_V-U5|224aa|up_1|NC_014534.1_386928_387600_+	NA	c2c5_V-U5|496aa|up_0|NC_014534.1_387487_388975_+	TIGR01766, Putative_transposase_MJ0751, transposase, IS605 OrfB family, central region	NA|1007aa|down_0|NC_014534.1_390511_393532_-	COG3378, COG3378, Phage associated DNA primase [General function prediction only]	NA|1674aa|down_1|NC_014534.1_395901_400923_+	smart00457, MACPF, membrane-attack complex / perforin	NA|383aa|down_2|NC_014534.1_401717_402866_-	pfam01609, DDE_Tnp_1, Transposase DDE domain	NA|200aa|down_3|NC_014534.1_404392_404992_-	NA	NA|353aa|down_4|NC_014534.1_405138_406197_-	cd02577, PSTD1, Pseudouridine synthase, a subgroup of the TruD family	NA|186aa|down_5|NC_014534.1_406731_407289_-	pfam00300, His_Phos_1, Histidine phosphatase superfamily (branch 1)	NA|229aa|down_6|NC_014534.1_407297_407984_-	cd00761, Glyco_tranf_GTA_type, Glycosyltransferase family A (GT-A) includes diverse families of glycosyl transferases with a common GT-A type structural fold	NA|468aa|down_7|NC_014534.1_408103_409507_-	cd06423, CESA_like, CESA_like is  the cellulose synthase superfamily	NA|267aa|down_8|NC_014534.1_409836_410637_-	COG3694, COG3694, ABC-type uncharacterized transport system, permease component [General function prediction only]	NA|264aa|down_9|NC_014534.1_410642_411434_-	COG4587, COG4587, ABC-type uncharacterized transport system, permease component [General function prediction only]
GCF_000147335.1_ASM14733v1	NC_014501	Gloeothece verrucosa PCC 7822, complete sequence	1	51929-52059	1	CRISPRCasFinder	no		RT,DEDDh,DinG,PD-DExK,c2c9_V-U4,cas14k,cas3,cas14j,Cas14c_CAS-V-F,csa3,Cas9_archaeal,Cas14b_CAS-V-F,csx3,csx1	Orphan	ACCATCAGCCCAACCTCTACCCC	23	0	0	NA	NA	NA	2	2	Orphan	RT,DEDDh,DinG,PD-DExK,c2c9_V-U4,cas14k,cas3,cas14j,Cas14c_CAS-V-F,csa3,Cas9_archaeal,Cas14b_CAS-V-F,csx3,csx1,cmr3gr5,cmr4gr7,cmr5gr11,cas2,cas1,cas4,cas6,csc1gr5,csc2gr7,cas10d,WYL,2OG_CAS,csx18,cas8b3,cmr6gr7,cas10,cas5,cas7,c2c5_V-U5	NA|370aa|up_2|NC_014501.1_48886_49996_-,NA|251aa|down_6|NC_014501.1_59682_60435_+	NA|266aa|up_9|NC_014501.1_39816_40614_-	pfam02517, Abi, CAAX protease self-immunity	NA|423aa|up_8|NC_014501.1_40677_41946_-	pfam05673, DUF815, Protein of unknown function (DUF815)	NA|429aa|up_7|NC_014501.1_42011_43298_-	cd14014, STKc_PknB_like, Catalytic domain of bacterial Serine/Threonine kinases, PknB and similar proteins	NA|368aa|up_6|NC_014501.1_43786_44890_+	COG4972, PilM, Tfp pilus assembly protein, ATPase PilM [Cell motility and secretion / Intracellular trafficking and secretion]	NA|267aa|up_5|NC_014501.1_44895_45696_+	COG3166, PilN, Tfp pilus assembly protein PilN [Cell motility and secretion / Intracellular trafficking and secretion]	NA|253aa|up_4|NC_014501.1_45692_46451_+	pfam04350, PilO, Pilus assembly protein, PilO	NA|622aa|up_3|NC_014501.1_46883_48749_+	COG4796, HofQ, Type II secretory pathway, component HofQ [Intracellular trafficking and secretion]	NA|370aa|up_2|NC_014501.1_48886_49996_-	NA	NA|396aa|up_1|NC_014501.1_50144_51332_-	TIGR04261, putative_arylsulfatase_regulatory_protein, radical SAM/SPASM domain protein, GRRM system	NA|134aa|up_0|NC_014501.1_51356_51758_-	TIGR04260, hypothetical_protein, rSAM-associated Gly-rich repeat protein	NA|296aa|down_0|NC_014501.1_52446_53334_+	TIGR04262, possible_ABC_transporter_solute_binding_protein, extracellular substrate-binding orphan protein, GRRM family	NA|396aa|down_1|NC_014501.1_53382_54570_+	pfam02374, ArsA_ATPase, Anion-transporting ATPase	NA|517aa|down_2|NC_014501.1_54697_56248_-	COG5526, COG5526, Lysozyme family protein [General function prediction only]	NA|278aa|down_3|NC_014501.1_56587_57421_-	cd07385, MPP_YkuE_C, Bacillus subtilis YkuE and related proteins, C-terminal metallophosphatase domain	NA|330aa|down_4|NC_014501.1_57486_58476_+	TIGR03466, HpnA, hopanoid-associated sugar epimerase	NA|291aa|down_5|NC_014501.1_58792_59665_+	cd19088, AKR_AKR13B1, AKR13B family of aldo-keto reductase (AKR)	NA|251aa|down_6|NC_014501.1_59682_60435_+	NA	NA|170aa|down_7|NC_014501.1_60445_60955_+	cd19100, AKR_unchar, uncharacterized aldo-keto reductase (AKR) superfamily protein	NA|207aa|down_8|NC_014501.1_61656_62277_+	COG1713, COG1713, Predicted HD superfamily hydrolase involved in NAD metabolism [Coenzyme metabolism]	NA|145aa|down_9|NC_014501.1_62312_62747_+	pfam02410, RsfS, Ribosomal silencing factor during starvation
GCF_000147335.1_ASM14733v1	NC_014501	Gloeothece verrucosa PCC 7822, complete sequence	2	499730-499830	2	CRISPRCasFinder	no		RT,DEDDh,DinG,PD-DExK,c2c9_V-U4,cas14k,cas3,cas14j,Cas14c_CAS-V-F,csa3,Cas9_archaeal,Cas14b_CAS-V-F,csx3,csx1	Orphan	GGCGGGTTCGGTTGACCCAAATA	23	0	0	NA	NA	NA	1	1	Orphan	RT,DEDDh,DinG,PD-DExK,c2c9_V-U4,cas14k,cas3,cas14j,Cas14c_CAS-V-F,csa3,Cas9_archaeal,Cas14b_CAS-V-F,csx3,csx1,cmr3gr5,cmr4gr7,cmr5gr11,cas2,cas1,cas4,cas6,csc1gr5,csc2gr7,cas10d,WYL,2OG_CAS,csx18,cas8b3,cmr6gr7,cas10,cas5,cas7,c2c5_V-U5	NA|108aa|up_9|NC_014501.1_491183_491507_-,NA|76aa|up_8|NC_014501.1_491506_491734_-,NA|1191aa|up_6|NC_014501.1_492522_496095_-,NA|109aa|up_5|NC_014501.1_496158_496485_+,NA|189aa|up_4|NC_014501.1_496497_497064_-,NA|92aa|up_3|NC_014501.1_497085_497361_-,NA|63aa|up_2|NC_014501.1_497357_497546_-,NA|139aa|up_1|NC_014501.1_497977_498394_-,NA|61aa|down_0|NC_014501.1_501421_501604_+,NA|146aa|down_1|NC_014501.1_501557_501995_+,NA|49aa|down_4|NC_014501.1_503885_504032_-,NA|73aa|down_5|NC_014501.1_504195_504414_-,NA|330aa|down_7|NC_014501.1_508905_509895_-	NA|108aa|up_9|NC_014501.1_491183_491507_-	NA	NA|76aa|up_8|NC_014501.1_491506_491734_-	NA	NA|79aa|up_7|NC_014501.1_492189_492426_-	pfam04255, DUF433, Protein of unknown function (DUF433)	NA|1191aa|up_6|NC_014501.1_492522_496095_-	NA	NA|109aa|up_5|NC_014501.1_496158_496485_+	NA	NA|189aa|up_4|NC_014501.1_496497_497064_-	NA	NA|92aa|up_3|NC_014501.1_497085_497361_-	NA	NA|63aa|up_2|NC_014501.1_497357_497546_-	NA	NA|139aa|up_1|NC_014501.1_497977_498394_-	NA	NA|393aa|up_0|NC_014501.1_498371_499550_-	cd10227, ParM_like, Plasmid segregation protein ParM and similar proteins	NA|61aa|down_0|NC_014501.1_501421_501604_+	NA	NA|146aa|down_1|NC_014501.1_501557_501995_+	NA	NA|138aa|down_2|NC_014501.1_502584_502998_+	pfam06271, RDD, RDD family	NA|162aa|down_3|NC_014501.1_503369_503855_+	pfam04134, DUF393, Protein of unknown function, DUF393	NA|49aa|down_4|NC_014501.1_503885_504032_-	NA	NA|73aa|down_5|NC_014501.1_504195_504414_-	NA	NA|721aa|down_6|NC_014501.1_505819_507982_+	COG0550, TopA, Topoisomerase IA [DNA replication, recombination, and repair]	NA|330aa|down_7|NC_014501.1_508905_509895_-	NA	NA|122aa|down_8|NC_014501.1_509936_510302_-	cd08351, ChaP_like, ChaP, an enzyme involved in the biosynthesis of the antitumor agent chartreusin (cha), and similar proteins	NA|110aa|down_9|NC_014501.1_510842_511172_+	cd17282, RMtype1_S_Eco16444ORF1681_TRD1-CR1_like, Type I restriction-modification system specificity (S) subunit Target Recognition Domain-ConseRved domain (TRD-CR), similar to Escherichia coli G4/9 S subunit (S
GCF_000147335.1_ASM14733v1	NC_014501	Gloeothece verrucosa PCC 7822, complete sequence	3	563929-564025	3	CRISPRCasFinder	no		RT,DEDDh,DinG,PD-DExK,c2c9_V-U4,cas14k,cas3,cas14j,Cas14c_CAS-V-F,csa3,Cas9_archaeal,Cas14b_CAS-V-F,csx3,csx1	Orphan	ACAACAAAACGCTATAGAAATAAAT	25	0	0	NA	NA	NA	1	1	Orphan	RT,DEDDh,DinG,PD-DExK,c2c9_V-U4,cas14k,cas3,cas14j,Cas14c_CAS-V-F,csa3,Cas9_archaeal,Cas14b_CAS-V-F,csx3,csx1,cmr3gr5,cmr4gr7,cmr5gr11,cas2,cas1,cas4,cas6,csc1gr5,csc2gr7,cas10d,WYL,2OG_CAS,csx18,cas8b3,cmr6gr7,cas10,cas5,cas7,c2c5_V-U5	NA|34aa|up_6|NC_014501.1_551658_551760_-,NA|185aa|up_2|NC_014501.1_557446_558001_+,NA|125aa|up_1|NC_014501.1_558778_559153_+,NA|106aa|up_0|NC_014501.1_559565_559883_+,NA|89aa|down_0|NC_014501.1_564426_564693_-,NA|477aa|down_1|NC_014501.1_565317_566748_-,NA|122aa|down_2|NC_014501.1_566757_567123_-,NA|124aa|down_6|NC_014501.1_575082_575454_+,NA|303aa|down_7|NC_014501.1_575869_576778_-,NA|285aa|down_8|NC_014501.1_577324_578179_+,NA|214aa|down_9|NC_014501.1_578412_579054_+	NA|580aa|up_9|NC_014501.1_544915_546655_+	pfam13676, TIR_2, TIR domain	NA|981aa|up_8|NC_014501.1_546657_549600_+	sd00006, TPR, Tetratricopeptide repeat	NA|574aa|up_7|NC_014501.1_549897_551619_-	sd00006, TPR, Tetratricopeptide repeat	NA|34aa|up_6|NC_014501.1_551658_551760_-	NA	NA|300aa|up_5|NC_014501.1_551777_552677_-	sd00006, TPR, Tetratricopeptide repeat	NA|694aa|up_4|NC_014501.1_552712_554794_-	cd09008, MTAN, 5'-methylthioadenosine/S-adenosylhomocysteine nucleosidases	NA|851aa|up_3|NC_014501.1_554786_557339_-	cd09008, MTAN, 5'-methylthioadenosine/S-adenosylhomocysteine nucleosidases	NA|185aa|up_2|NC_014501.1_557446_558001_+	NA	NA|125aa|up_1|NC_014501.1_558778_559153_+	NA	NA|106aa|up_0|NC_014501.1_559565_559883_+	NA	NA|89aa|down_0|NC_014501.1_564426_564693_-	NA	NA|477aa|down_1|NC_014501.1_565317_566748_-	NA	NA|122aa|down_2|NC_014501.1_566757_567123_-	NA	NA|1221aa|down_3|NC_014501.1_567234_570897_-	pfam12770, CHAT, CHAT domain	NA|420aa|down_4|NC_014501.1_570977_572237_-	pfam07693, KAP_NTPase, KAP family P-loop domain	NA|691aa|down_5|NC_014501.1_572238_574311_-	cd15832, SNAP, Soluble N-ethylmaleimide-sensitive factor (NSF) Attachment Protein family	NA|124aa|down_6|NC_014501.1_575082_575454_+	NA	NA|303aa|down_7|NC_014501.1_575869_576778_-	NA	NA|285aa|down_8|NC_014501.1_577324_578179_+	NA	NA|214aa|down_9|NC_014501.1_578412_579054_+	NA
GCF_000147335.1_ASM14733v1	NC_014501	Gloeothece verrucosa PCC 7822, complete sequence	4	853231-853461	4	CRISPRCasFinder	no		RT,DEDDh,DinG,PD-DExK,c2c9_V-U4,cas14k,cas3,cas14j,Cas14c_CAS-V-F,csa3,Cas9_archaeal,Cas14b_CAS-V-F,csx3,csx1	Orphan	CTTCCCCCAGAAATTGGGCAACT	23	0	0	NA	NA	NA	3	3	Orphan	RT,DEDDh,DinG,PD-DExK,c2c9_V-U4,cas14k,cas3,cas14j,Cas14c_CAS-V-F,csa3,Cas9_archaeal,Cas14b_CAS-V-F,csx3,csx1,cmr3gr5,cmr4gr7,cmr5gr11,cas2,cas1,cas4,cas6,csc1gr5,csc2gr7,cas10d,WYL,2OG_CAS,csx18,cas8b3,cmr6gr7,cas10,cas5,cas7,c2c5_V-U5	NA|152aa|up_9|NC_014501.1_838506_838962_+,NA|65aa|up_3|NC_014501.1_849304_849499_+,NA|385aa|up_1|NC_014501.1_851010_852165_-,NA|83aa|down_0|NC_014501.1_856613_856862_+,NA|67aa|down_9|NC_014501.1_866626_866827_-	NA|152aa|up_9|NC_014501.1_838506_838962_+	NA	NA|373aa|up_8|NC_014501.1_839042_840161_+	PRK00112, tgt, queuine tRNA-ribosyltransferase; Provisional	NA|366aa|up_7|NC_014501.1_840358_841456_+	pfam14239, RRXRR, RRXRR protein	NA|62aa|up_6|NC_014501.1_841726_841912_-	pfam11165, DUF2949, Protein of unknown function (DUF2949)	NA|907aa|up_5|NC_014501.1_842295_845016_-	pfam03200, Glyco_hydro_63, Glycosyl hydrolase family 63 C-terminal domain	NA|1187aa|up_4|NC_014501.1_845311_848872_+	TIGR02082, Methionine_synthase, 5-methyltetrahydrofolate--homocysteine methyltransferase	NA|65aa|up_3|NC_014501.1_849304_849499_+	NA	NA|479aa|up_2|NC_014501.1_849571_851008_-	COG0419, SbcC, ATPase involved in DNA repair [DNA replication, recombination, and repair]	NA|385aa|up_1|NC_014501.1_851010_852165_-	NA	NA|201aa|up_0|NC_014501.1_852389_852992_+	cd06260, DUF820, Domain of unknown function (DUF820)	NA|83aa|down_0|NC_014501.1_856613_856862_+	NA	NA|424aa|down_1|NC_014501.1_857253_858525_+	cd03800, GT4_sucrose_synthase, sucrose-phosphate synthase and similar proteins	NA|498aa|down_2|NC_014501.1_858696_860190_-	PRK00139, murE, UDP-N-acetylmuramoylalanyl-D-glutamate--2,6-diaminopimelate ligase; Provisional	NA|89aa|down_3|NC_014501.1_860326_860593_-	pfam05768, DUF836, Glutaredoxin-like domain (DUF836)	NA|193aa|down_4|NC_014501.1_860627_861206_+	pfam05685, Uma2, Putative restriction endonuclease	NA|321aa|down_5|NC_014501.1_861267_862230_-	cd09279, RNase_HI_like, RNAse HI family that includes archaeal, some bacterial as well as plant RNase HI	NA|489aa|down_6|NC_014501.1_862626_864093_+	pfam13413, HTH_25, Helix-turn-helix domain	NA|483aa|down_7|NC_014501.1_864221_865670_+	PRK00012, gatA, Asp-tRNA(Asn)/Glu-tRNA(Gln) amidotransferase subunit GatA	NA|296aa|down_8|NC_014501.1_865742_866630_+	pfam12146, Hydrolase_4, Serine aminopeptidase, S33	NA|67aa|down_9|NC_014501.1_866626_866827_-	NA
GCF_000147335.1_ASM14733v1	NC_014501	Gloeothece verrucosa PCC 7822, complete sequence	5	853645-853943	5,1	CRISPRCasFinder,PILER-CR	no		RT,DEDDh,DinG,PD-DExK,c2c9_V-U4,cas14k,cas3,cas14j,Cas14c_CAS-V-F,csa3,Cas9_archaeal,Cas14b_CAS-V-F,csx3,csx1	Orphan	CTTCCCCCAGAAATTGGGCAACT,CTCCCCCCAGAAATCGGACAACT	23,23	0	0	NA	NA	NA:NA	4,3	4	Orphan	RT,DEDDh,DinG,PD-DExK,c2c9_V-U4,cas14k,cas3,cas14j,Cas14c_CAS-V-F,csa3,Cas9_archaeal,Cas14b_CAS-V-F,csx3,csx1,cmr3gr5,cmr4gr7,cmr5gr11,cas2,cas1,cas4,cas6,csc1gr5,csc2gr7,cas10d,WYL,2OG_CAS,csx18,cas8b3,cmr6gr7,cas10,cas5,cas7,c2c5_V-U5	NA|152aa|up_9|NC_014501.1_838506_838962_+,NA|65aa|up_3|NC_014501.1_849304_849499_+,NA|385aa|up_1|NC_014501.1_851010_852165_-,NA|83aa|down_0|NC_014501.1_856613_856862_+,NA|67aa|down_9|NC_014501.1_866626_866827_-	NA|152aa|up_9|NC_014501.1_838506_838962_+	NA	NA|373aa|up_8|NC_014501.1_839042_840161_+	PRK00112, tgt, queuine tRNA-ribosyltransferase; Provisional	NA|366aa|up_7|NC_014501.1_840358_841456_+	pfam14239, RRXRR, RRXRR protein	NA|62aa|up_6|NC_014501.1_841726_841912_-	pfam11165, DUF2949, Protein of unknown function (DUF2949)	NA|907aa|up_5|NC_014501.1_842295_845016_-	pfam03200, Glyco_hydro_63, Glycosyl hydrolase family 63 C-terminal domain	NA|1187aa|up_4|NC_014501.1_845311_848872_+	TIGR02082, Methionine_synthase, 5-methyltetrahydrofolate--homocysteine methyltransferase	NA|65aa|up_3|NC_014501.1_849304_849499_+	NA	NA|479aa|up_2|NC_014501.1_849571_851008_-	COG0419, SbcC, ATPase involved in DNA repair [DNA replication, recombination, and repair]	NA|385aa|up_1|NC_014501.1_851010_852165_-	NA	NA|201aa|up_0|NC_014501.1_852389_852992_+	cd06260, DUF820, Domain of unknown function (DUF820)	NA|83aa|down_0|NC_014501.1_856613_856862_+	NA	NA|424aa|down_1|NC_014501.1_857253_858525_+	cd03800, GT4_sucrose_synthase, sucrose-phosphate synthase and similar proteins	NA|498aa|down_2|NC_014501.1_858696_860190_-	PRK00139, murE, UDP-N-acetylmuramoylalanyl-D-glutamate--2,6-diaminopimelate ligase; Provisional	NA|89aa|down_3|NC_014501.1_860326_860593_-	pfam05768, DUF836, Glutaredoxin-like domain (DUF836)	NA|193aa|down_4|NC_014501.1_860627_861206_+	pfam05685, Uma2, Putative restriction endonuclease	NA|321aa|down_5|NC_014501.1_861267_862230_-	cd09279, RNase_HI_like, RNAse HI family that includes archaeal, some bacterial as well as plant RNase HI	NA|489aa|down_6|NC_014501.1_862626_864093_+	pfam13413, HTH_25, Helix-turn-helix domain	NA|483aa|down_7|NC_014501.1_864221_865670_+	PRK00012, gatA, Asp-tRNA(Asn)/Glu-tRNA(Gln) amidotransferase subunit GatA	NA|296aa|down_8|NC_014501.1_865742_866630_+	pfam12146, Hydrolase_4, Serine aminopeptidase, S33	NA|67aa|down_9|NC_014501.1_866626_866827_-	NA
GCF_000147335.1_ASM14733v1	NC_014501	Gloeothece verrucosa PCC 7822, complete sequence	6	927016-927213	1	CRT	no		RT,DEDDh,DinG,PD-DExK,c2c9_V-U4,cas14k,cas3,cas14j,Cas14c_CAS-V-F,csa3,Cas9_archaeal,Cas14b_CAS-V-F,csx3,csx1	Orphan	AAGTCTCTGAGGAANTCT	18	3	3	927034-927063|927082-927111|927166-927195	NC_014501.1_927190-927219|NC_014501.1_927190-927219|NC_014501.1_927010-927039	NA	4	4	Orphan	RT,DEDDh,DinG,PD-DExK,c2c9_V-U4,cas14k,cas3,cas14j,Cas14c_CAS-V-F,csa3,Cas9_archaeal,Cas14b_CAS-V-F,csx3,csx1,cmr3gr5,cmr4gr7,cmr5gr11,cas2,cas1,cas4,cas6,csc1gr5,csc2gr7,cas10d,WYL,2OG_CAS,csx18,cas8b3,cmr6gr7,cas10,cas5,cas7,c2c5_V-U5	NA|67aa|up_8|NC_014501.1_918205_918406_-,NA|83aa|up_7|NC_014501.1_918392_918641_-,NA|74aa|up_5|NC_014501.1_921240_921462_-,NA|254aa|up_4|NC_014501.1_921467_922229_-,NA|65aa|up_3|NC_014501.1_922225_922420_-,NA|104aa|up_2|NC_014501.1_922410_922722_-,NA|173aa|up_0|NC_014501.1_923502_924021_-,NA|409aa|down_2|NC_014501.1_929937_931164_-,NA|184aa|down_3|NC_014501.1_931300_931852_-,NA|74aa|down_5|NC_014501.1_933162_933384_+,NA|371aa|down_7|NC_014501.1_933761_934874_-,NA|58aa|down_8|NC_014501.1_937990_938164_-	NA|205aa|up_9|NC_014501.1_917495_918110_+	cd00736, lambda_lys-like, Bacteriophage lambda lysozyme and similar proteins	NA|67aa|up_8|NC_014501.1_918205_918406_-	NA	NA|83aa|up_7|NC_014501.1_918392_918641_-	NA	NA|873aa|up_6|NC_014501.1_918640_921259_-	pfam12965, DUF3854, Domain of unknown function (DUF3854)	NA|74aa|up_5|NC_014501.1_921240_921462_-	NA	NA|254aa|up_4|NC_014501.1_921467_922229_-	NA	NA|65aa|up_3|NC_014501.1_922225_922420_-	NA	NA|104aa|up_2|NC_014501.1_922410_922722_-	NA	NA|85aa|up_1|NC_014501.1_923185_923440_+	pfam01381, HTH_3, Helix-turn-helix	NA|173aa|up_0|NC_014501.1_923502_924021_-	NA	NA|357aa|down_0|NC_014501.1_927737_928808_-	cd00796, INT_Rci_Hp1_C, Shufflon-specific DNA recombinase Rci and Bacteriophage Hp1_like integrase, C-terminal catalytic domain	NA|87aa|down_1|NC_014501.1_929277_929538_+	pfam06806, DUF1233, Putative excisionase (DUF1233)	NA|409aa|down_2|NC_014501.1_929937_931164_-	NA	NA|184aa|down_3|NC_014501.1_931300_931852_-	NA	NA|231aa|down_4|NC_014501.1_931994_932687_-	COG3617, COG3617, Prophage antirepressor [Transcription]	NA|74aa|down_5|NC_014501.1_933162_933384_+	NA	NA|95aa|down_6|NC_014501.1_933383_933668_+	COG3041, COG3041, Uncharacterized protein conserved in bacteria [Function unknown]	NA|371aa|down_7|NC_014501.1_933761_934874_-	NA	NA|58aa|down_8|NC_014501.1_937990_938164_-	NA	NA|192aa|down_9|NC_014501.1_938156_938732_-	COG4185, COG4185, Uncharacterized protein conserved in bacteria [Function unknown]
GCF_000147335.1_ASM14733v1	NC_014501	Gloeothece verrucosa PCC 7822, complete sequence	7	1320517-1320611	6	CRISPRCasFinder	no		RT,DEDDh,DinG,PD-DExK,c2c9_V-U4,cas14k,cas3,cas14j,Cas14c_CAS-V-F,csa3,Cas9_archaeal,Cas14b_CAS-V-F,csx3,csx1	Orphan	TAGTCATTAGTCGTTAGTTGTTAGTC	26	0	0	NA	NA	NA	1	1	Orphan	RT,DEDDh,DinG,PD-DExK,c2c9_V-U4,cas14k,cas3,cas14j,Cas14c_CAS-V-F,csa3,Cas9_archaeal,Cas14b_CAS-V-F,csx3,csx1,cmr3gr5,cmr4gr7,cmr5gr11,cas2,cas1,cas4,cas6,csc1gr5,csc2gr7,cas10d,WYL,2OG_CAS,csx18,cas8b3,cmr6gr7,cas10,cas5,cas7,c2c5_V-U5	NA|251aa|up_7|NC_014501.1_1311395_1312148_-,NA|67aa|up_6|NC_014501.1_1312190_1312391_-,NA|69aa|up_5|NC_014501.1_1312507_1312714_-,NA|226aa|down_6|NC_014501.1_1328778_1329456_+,NA|196aa|down_7|NC_014501.1_1329557_1330145_-	NA|484aa|up_9|NC_014501.1_1308766_1310218_+	pfam01139, RtcB, tRNA-splicing ligase RtcB	NA|263aa|up_8|NC_014501.1_1310621_1311410_+	PLN02939, PLN02939, transferase, transferring glycosyl groups	NA|251aa|up_7|NC_014501.1_1311395_1312148_-	NA	NA|67aa|up_6|NC_014501.1_1312190_1312391_-	NA	NA|69aa|up_5|NC_014501.1_1312507_1312714_-	NA	NA|673aa|up_4|NC_014501.1_1312821_1314840_-	TIGR02442, Uncharacterized_protein_Rv2850c/MT2916, cobaltochelatase subunit	NA|48aa|up_3|NC_014501.1_1315029_1315173_+	PLN00014, PLN00014, light-harvesting-like protein 3; Provisional	NA|283aa|up_2|NC_014501.1_1315360_1316209_-	COG0668, MscS, Small-conductance mechanosensitive channel [Cell envelope biogenesis, outer membrane]	NA|305aa|up_1|NC_014501.1_1316735_1317650_+	COG1682, TagG, ABC-type polysaccharide/polyol phosphate export systems, permease component [Carbohydrate transport and metabolism / Cell envelope biogenesis, outer membrane]	NA|402aa|up_0|NC_014501.1_1317723_1318929_+	cd03220, ABC_KpsT_Wzt, ATP-binding cassette component of polysaccharide transport system	NA|685aa|down_0|NC_014501.1_1320807_1322862_-	PRK05354, PRK05354, biosynthetic arginine decarboxylase	NA|175aa|down_1|NC_014501.1_1323134_1323659_+	pfam14221, DUF4330, Domain of unknown function (DUF4330)	NA|571aa|down_2|NC_014501.1_1323965_1325678_-	COG4249, COG4249, Uncharacterized protein containing caspase domain [General function prediction only]	NA|573aa|down_3|NC_014501.1_1325762_1327481_-	COG4178, COG4178, ABC-type uncharacterized transport system, permease and ATPase components [General function prediction only]	NA|60aa|down_4|NC_014501.1_1327696_1327876_+	CHL00152, rpl32, ribosomal protein L32; Validated	NA|199aa|down_5|NC_014501.1_1328036_1328633_+	cd02109, arch_bact_SO_family_Moco, bacterial and archael members of the sulfite oxidase (SO) family of molybdopterin binding domains	NA|226aa|down_6|NC_014501.1_1328778_1329456_+	NA	NA|196aa|down_7|NC_014501.1_1329557_1330145_-	NA	NA|354aa|down_8|NC_014501.1_1330321_1331383_+	PRK00143, mnmA, tRNA-specific 2-thiouridylase MnmA; Reviewed	NA|80aa|down_9|NC_014501.1_1331461_1331701_+	pfam08972, DUF1902, Domain of unknown function (DUF1902)
GCF_000147335.1_ASM14733v1	NC_014501	Gloeothece verrucosa PCC 7822, complete sequence	8	1333852-1334337	2	CRT	no	cas3	RT,DEDDh,DinG,PD-DExK,c2c9_V-U4,cas14k,cas3,cas14j,Cas14c_CAS-V-F,csa3,Cas9_archaeal,Cas14b_CAS-V-F,csx3,csx1	Unclear	GGCGTGGCGGNNGGCGTG	18	0	0	NA	NA	NA	11	11	Unclear	RT,DEDDh,DinG,PD-DExK,c2c9_V-U4,cas14k,cas3,cas14j,Cas14c_CAS-V-F,csa3,Cas9_archaeal,Cas14b_CAS-V-F,csx3,csx1,cmr3gr5,cmr4gr7,cmr5gr11,cas2,cas1,cas4,cas6,csc1gr5,csc2gr7,cas10d,WYL,2OG_CAS,csx18,cas8b3,cmr6gr7,cas10,cas5,cas7,c2c5_V-U5	NA|226aa|up_4|NC_014501.1_1328778_1329456_+,NA|196aa|up_3|NC_014501.1_1329557_1330145_-,NA|113aa|down_3|NC_014501.1_1339444_1339783_-,NA|301aa|down_6|NC_014501.1_1341888_1342791_+,NA|70aa|down_7|NC_014501.1_1342895_1343105_-	NA|175aa|up_9|NC_014501.1_1323134_1323659_+	pfam14221, DUF4330, Domain of unknown function (DUF4330)	NA|571aa|up_8|NC_014501.1_1323965_1325678_-	COG4249, COG4249, Uncharacterized protein containing caspase domain [General function prediction only]	NA|573aa|up_7|NC_014501.1_1325762_1327481_-	COG4178, COG4178, ABC-type uncharacterized transport system, permease and ATPase components [General function prediction only]	NA|60aa|up_6|NC_014501.1_1327696_1327876_+	CHL00152, rpl32, ribosomal protein L32; Validated	NA|199aa|up_5|NC_014501.1_1328036_1328633_+	cd02109, arch_bact_SO_family_Moco, bacterial and archael members of the sulfite oxidase (SO) family of molybdopterin binding domains	NA|226aa|up_4|NC_014501.1_1328778_1329456_+	NA	NA|196aa|up_3|NC_014501.1_1329557_1330145_-	NA	NA|354aa|up_2|NC_014501.1_1330321_1331383_+	PRK00143, mnmA, tRNA-specific 2-thiouridylase MnmA; Reviewed	NA|80aa|up_1|NC_014501.1_1331461_1331701_+	pfam08972, DUF1902, Domain of unknown function (DUF1902)	NA|358aa|up_0|NC_014501.1_1331762_1332836_+	smart00382, AAA, ATPases associated with a variety of cellular activities	NA|311aa|down_0|NC_014501.1_1336267_1337200_-	PLN02632, PLN02632, phytoene synthase	NA|474aa|down_1|NC_014501.1_1337359_1338781_-	TIGR02731, Phytoene_dehydrogenase_chloroplastic/chromoplastic, phytoene desaturase	NA|132aa|down_2|NC_014501.1_1338970_1339366_+	PRK01202, PRK01202, glycine cleavage system protein GcvH	NA|113aa|down_3|NC_014501.1_1339444_1339783_-	NA	NA|271aa|down_4|NC_014501.1_1340003_1340816_+	PRK05299, rpsB, 30S ribosomal protein S2; Provisional	NA|287aa|down_5|NC_014501.1_1340868_1341729_+	PRK12332, tsf, elongation factor Ts; Reviewed	NA|301aa|down_6|NC_014501.1_1341888_1342791_+	NA	NA|70aa|down_7|NC_014501.1_1342895_1343105_-	NA	cas3|819aa|down_8|NC_014501.1_1343403_1345860_+	PRK10917, PRK10917, ATP-dependent DNA helicase RecG; Provisional	NA|913aa|down_9|NC_014501.1_1346033_1348772_+	COG1042, COG1042, Acyl-CoA synthetase (NDP forming) [Energy production and conversion]
GCF_000147335.1_ASM14733v1	NC_014501	Gloeothece verrucosa PCC 7822, complete sequence	9	1837026-1837232	3	CRT	no		RT,DEDDh,DinG,PD-DExK,c2c9_V-U4,cas14k,cas3,cas14j,Cas14c_CAS-V-F,csa3,Cas9_archaeal,Cas14b_CAS-V-F,csx3,csx1	Orphan	NAGATTGAGATTCTGTATCNTTATAATC	28	0	0	NA	NA	NA	4	4	Orphan	RT,DEDDh,DinG,PD-DExK,c2c9_V-U4,cas14k,cas3,cas14j,Cas14c_CAS-V-F,csa3,Cas9_archaeal,Cas14b_CAS-V-F,csx3,csx1,cmr3gr5,cmr4gr7,cmr5gr11,cas2,cas1,cas4,cas6,csc1gr5,csc2gr7,cas10d,WYL,2OG_CAS,csx18,cas8b3,cmr6gr7,cas10,cas5,cas7,c2c5_V-U5	NA|95aa|up_7|NC_014501.1_1831479_1831764_+,NA|74aa|up_6|NC_014501.1_1831826_1832048_+,NA|67aa|down_6|NC_014501.1_1845533_1845734_+	NA|159aa|up_9|NC_014501.1_1826033_1826510_+	cd06063, H2MP_Cyano-H2up, This group of endopeptidases include HupW enzymes that are specific to the cyanobacterial hydrogenase and are involved in the C-terminal cleavage of the hydrogenase large subunit precursor protein	NA|399aa|up_8|NC_014501.1_1826665_1827862_+	cd08152, y4iL_like, Catalase-like heme-binding proteins similar to the uncharacterized y4iL	NA|95aa|up_7|NC_014501.1_1831479_1831764_+	NA	NA|74aa|up_6|NC_014501.1_1831826_1832048_+	NA	NA|155aa|up_5|NC_014501.1_1832202_1832667_+	PHA01886, PHA01886, TM2 domain-containing protein	NA|351aa|up_4|NC_014501.1_1832686_1833739_+	PRK04204, PRK04204, RNA 3'-terminal phosphate cyclase	NA|431aa|up_3|NC_014501.1_1833818_1835111_+	PRK05431, PRK05431, seryl-tRNA synthetase; Provisional	NA|210aa|up_2|NC_014501.1_1835180_1835810_-	COG4636, Uma2, Endonuclease, Uma2 family (restriction endonuclease fold) [General function prediction only]	NA|170aa|up_1|NC_014501.1_1835877_1836387_-	COG0823, TolB, Periplasmic component of the Tol biopolymer transport system [Intracellular trafficking and secretion]	NA|175aa|up_0|NC_014501.1_1836437_1836962_-	COG0823, TolB, Periplasmic component of the Tol biopolymer transport system [Intracellular trafficking and secretion]	NA|41aa|down_0|NC_014501.1_1838189_1838312_-	pfam06596, PsbX, Photosystem II reaction centre X protein (PsbX)	NA|522aa|down_1|NC_014501.1_1838602_1840168_-	COG0025, NhaP, NhaP-type Na+/H+ and K+/H+ antiporters [Inorganic ion transport and metabolism]	NA|202aa|down_2|NC_014501.1_1840287_1840893_+	pfam11237, DUF3038, Protein of unknown function (DUF3038)	NA|546aa|down_3|NC_014501.1_1840946_1842584_+	pfam14233, DUF4335, Domain of unknown function (DUF4335)	NA|242aa|down_4|NC_014501.1_1842723_1843449_+	PRK00042, tpiA, triosephosphate isomerase; Provisional	NA|459aa|down_5|NC_014501.1_1843636_1845013_+	PRK05291, trmE, tRNA uridine-5-carboxymethylaminomethyl(34) synthesis GTPase MnmE	NA|67aa|down_6|NC_014501.1_1845533_1845734_+	NA	NA|173aa|down_7|NC_014501.1_1845829_1846348_-	cd16339, CpcS, S-type phycobiliprotein (PBP) lyase	NA|224aa|down_8|NC_014501.1_1846436_1847108_-	COG1413, COG1413, FOG: HEAT repeat [Energy production and conversion]	NA|177aa|down_9|NC_014501.1_1847262_1847793_+	COG1045, CysE, Serine acetyltransferase [Amino acid transport and metabolism]
GCF_000147335.1_ASM14733v1	NC_014501	Gloeothece verrucosa PCC 7822, complete sequence	10	2470646-2470761	7	CRISPRCasFinder	no		RT,DEDDh,DinG,PD-DExK,c2c9_V-U4,cas14k,cas3,cas14j,Cas14c_CAS-V-F,csa3,Cas9_archaeal,Cas14b_CAS-V-F,csx3,csx1	Orphan	GTACAACGGATGGAAGGGCCCAATGAAGAGGAAGAACTGAA	41	0	0	NA	NA	NA	1	1	Orphan	RT,DEDDh,DinG,PD-DExK,c2c9_V-U4,cas14k,cas3,cas14j,Cas14c_CAS-V-F,csa3,Cas9_archaeal,Cas14b_CAS-V-F,csx3,csx1,cmr3gr5,cmr4gr7,cmr5gr11,cas2,cas1,cas4,cas6,csc1gr5,csc2gr7,cas10d,WYL,2OG_CAS,csx18,cas8b3,cmr6gr7,cas10,cas5,cas7,c2c5_V-U5	NA|112aa|up_3|NC_014501.1_2465596_2465932_+,NA|210aa|down_2|NC_014501.1_2475269_2475899_-,NA|195aa|down_5|NC_014501.1_2479925_2480510_+,NA|240aa|down_7|NC_014501.1_2482483_2483203_-,NA|228aa|down_8|NC_014501.1_2483218_2483902_-	NA|454aa|up_9|NC_014501.1_2460229_2461591_+	TIGR00933, Trk_system_potassium_uptake_protein_trkH	NA|232aa|up_8|NC_014501.1_2461668_2462364_+	COG0569, TrkA, K+ transport systems, NAD-binding component [Inorganic ion transport and metabolism]	NA|145aa|up_7|NC_014501.1_2462390_2462825_-	cd08357, VOC_like, uncharacterized subfamily of vicinal oxygen chelate (VOC) familyprotein, glyoxalase I, and type I ring-cleaving dioxygenases	NA|196aa|up_6|NC_014501.1_2462888_2463476_+	PRK00889, PRK00889, adenylylsulfate kinase; Provisional	NA|329aa|up_5|NC_014501.1_2463544_2464531_-	PRK05949, PRK05949, RNA polymerase sigma factor; Validated	NA|192aa|up_4|NC_014501.1_2464945_2465521_-	PRK00301, aat, leucyl/phenylalanyl-tRNA--protein transferase; Reviewed	NA|112aa|up_3|NC_014501.1_2465596_2465932_+	NA	NA|101aa|up_2|NC_014501.1_2466059_2466362_-	CHL00074, rps14, ribosomal protein S14	NA|562aa|up_1|NC_014501.1_2466553_2468239_+	PRK00911, PRK00911, dihydroxy-acid dehydratase; Provisional	NA|419aa|up_0|NC_014501.1_2468805_2470062_+	PRK07598, PRK07598, RNA polymerase sigma factor SigC; Validated	NA|571aa|down_0|NC_014501.1_2471204_2472917_-	sd00006, TPR, Tetratricopeptide repeat	NA|589aa|down_1|NC_014501.1_2473290_2475057_-	PRK07449, PRK07449, 2-succinyl-5-enolpyruvyl-6-hydroxy-3-cyclohexene-1-carboxylate synthase; Validated	NA|210aa|down_2|NC_014501.1_2475269_2475899_-	NA	NA|893aa|down_3|NC_014501.1_2476044_2478723_-	PRK05399, PRK05399, DNA mismatch repair protein MutS; Provisional	NA|171aa|down_4|NC_014501.1_2479135_2479648_+	cd07245, VOC_like, uncharacterized subfamily of vicinal oxygen chelate (VOC) family	NA|195aa|down_5|NC_014501.1_2479925_2480510_+	NA	NA|546aa|down_6|NC_014501.1_2480725_2482363_-	PRK06184, PRK06184, hypothetical protein; Provisional	NA|240aa|down_7|NC_014501.1_2482483_2483203_-	NA	NA|228aa|down_8|NC_014501.1_2483218_2483902_-	NA	NA|326aa|down_9|NC_014501.1_2484280_2485258_+	COG2897, SseA, Rhodanese-related sulfurtransferase [Inorganic ion transport and metabolism]
GCF_000147335.1_ASM14733v1	NC_014501	Gloeothece verrucosa PCC 7822, complete sequence	11	2548707-2548805	8	CRISPRCasFinder	no		RT,DEDDh,DinG,PD-DExK,c2c9_V-U4,cas14k,cas3,cas14j,Cas14c_CAS-V-F,csa3,Cas9_archaeal,Cas14b_CAS-V-F,csx3,csx1	Orphan	CGTATAGCCCCACCCTTCAGGGTG	24	0	0	NA	NA	NA	1	1	Orphan	RT,DEDDh,DinG,PD-DExK,c2c9_V-U4,cas14k,cas3,cas14j,Cas14c_CAS-V-F,csa3,Cas9_archaeal,Cas14b_CAS-V-F,csx3,csx1,cmr3gr5,cmr4gr7,cmr5gr11,cas2,cas1,cas4,cas6,csc1gr5,csc2gr7,cas10d,WYL,2OG_CAS,csx18,cas8b3,cmr6gr7,cas10,cas5,cas7,c2c5_V-U5	NA|409aa|up_8|NC_014501.1_2538048_2539275_+,NA|135aa|down_2|NC_014501.1_2551751_2552156_+,NA|258aa|down_3|NC_014501.1_2552182_2552956_+,NA|189aa|down_8|NC_014501.1_2558889_2559456_+	NA|450aa|up_9|NC_014501.1_2536648_2537998_+	TIGR00911, High-affinity_methionine_permease, L-type amino acid transporter	NA|409aa|up_8|NC_014501.1_2538048_2539275_+	NA	NA|317aa|up_7|NC_014501.1_2539628_2540579_+	COG3706, PleD, Response regulator containing a CheY-like receiver domain and a GGDEF domain [Signal transduction mechanisms]	NA|262aa|up_6|NC_014501.1_2540571_2541357_+	cd01000, PBP2_Cys_DEBP_like, Substrate-binding domain of cysteine- and aspartate/glutamate-binding proteins; the type 2 periplasmic-binding protein fold	NA|177aa|up_5|NC_014501.1_2541538_2542069_+	sd00006, TPR, Tetratricopeptide repeat	NA|404aa|up_4|NC_014501.1_2542134_2543346_-	pfam00144, Beta-lactamase, Beta-lactamase	NA|819aa|up_3|NC_014501.1_2543674_2546131_-	COG3972, COG3972, Superfamily I DNA and RNA helicases [General function prediction only]	NA|163aa|up_2|NC_014501.1_2546246_2546735_-	cd06260, DUF820, Domain of unknown function (DUF820)	NA|135aa|up_1|NC_014501.1_2546946_2547351_-	pfam01797, Y1_Tnp, Transposase IS200 like	NA|309aa|up_0|NC_014501.1_2547576_2548503_-	cd00180, PKc, Catalytic domain of Protein Kinases	NA|733aa|down_0|NC_014501.1_2548877_2551076_-	COG3957, COG3957, Phosphoketolase [Carbohydrate transport and metabolism]	NA|156aa|down_1|NC_014501.1_2551273_2551741_+	COG3600, GepA, Uncharacterized phage-associated protein [Function unknown]	NA|135aa|down_2|NC_014501.1_2551751_2552156_+	NA	NA|258aa|down_3|NC_014501.1_2552182_2552956_+	NA	NA|601aa|down_4|NC_014501.1_2553090_2554893_+	PRK07431, PRK07431, aspartate kinase; Provisional	NA|493aa|down_5|NC_014501.1_2555104_2556583_+	COG0737, UshA, 5'-nucleotidase/2',3'-cyclic phosphodiesterase and related esterases [Nucleotide transport and metabolism]	NA|375aa|down_6|NC_014501.1_2556632_2557757_+	cd02647, nuc_hydro_TvIAG, nuc_hydro_ TvIAG:  Nucleoside hydrolases similar to the Inosine-adenosine-guanosine-preferring nucleoside hydrolase from Trypanosoma vivax	NA|325aa|down_7|NC_014501.1_2557895_2558870_+	cd19093, AKR_AtPLR-like, Arabidopsis thaliana pyridoxal reductase (PLR) and similar proteins	NA|189aa|down_8|NC_014501.1_2558889_2559456_+	NA	NA|334aa|down_9|NC_014501.1_2559458_2560460_+	PRK05731, PRK05731, thiamine monophosphate kinase; Provisional
GCF_000147335.1_ASM14733v1	NC_014501	Gloeothece verrucosa PCC 7822, complete sequence	12	2644547-2644666	9	CRISPRCasFinder	no		RT,DEDDh,DinG,PD-DExK,c2c9_V-U4,cas14k,cas3,cas14j,Cas14c_CAS-V-F,csa3,Cas9_archaeal,Cas14b_CAS-V-F,csx3,csx1	Orphan	GGAATGGGAGACTGTTCAACCACTGGGGGT	30	0	0	NA	NA	NA	1	1	Orphan	RT,DEDDh,DinG,PD-DExK,c2c9_V-U4,cas14k,cas3,cas14j,Cas14c_CAS-V-F,csa3,Cas9_archaeal,Cas14b_CAS-V-F,csx3,csx1,cmr3gr5,cmr4gr7,cmr5gr11,cas2,cas1,cas4,cas6,csc1gr5,csc2gr7,cas10d,WYL,2OG_CAS,csx18,cas8b3,cmr6gr7,cas10,cas5,cas7,c2c5_V-U5	NA|111aa|up_9|NC_014501.1_2630242_2630575_-,NA|200aa|up_8|NC_014501.1_2632573_2633173_+,NA|57aa|up_7|NC_014501.1_2633327_2633498_-,NA|166aa|up_6|NC_014501.1_2633763_2634261_-,NA|161aa|up_5|NC_014501.1_2634789_2635272_+,NA|77aa|up_4|NC_014501.1_2635498_2635729_-,NA	NA|111aa|up_9|NC_014501.1_2630242_2630575_-	NA	NA|200aa|up_8|NC_014501.1_2632573_2633173_+	NA	NA|57aa|up_7|NC_014501.1_2633327_2633498_-	NA	NA|166aa|up_6|NC_014501.1_2633763_2634261_-	NA	NA|161aa|up_5|NC_014501.1_2634789_2635272_+	NA	NA|77aa|up_4|NC_014501.1_2635498_2635729_-	NA	NA|398aa|up_3|NC_014501.1_2639633_2640827_+	COG3284, AcoR, Transcriptional activator of acetoin/glycerol metabolism [Secondary metabolites biosynthesis, transport, and catabolism / Transcription]	NA|129aa|up_2|NC_014501.1_2640829_2641216_+	pfam00072, Response_reg, Response regulator receiver domain	NA|294aa|up_1|NC_014501.1_2641394_2642276_-	PRK14875, PRK14875, acetoin dehydrogenase E2 subunit dihydrolipoyllysine-residue acetyltransferase; Provisional	NA|327aa|up_0|NC_014501.1_2642331_2643312_-	cd01051, Mn_catalase, Manganese catalase, ferritin-like diiron-binding domain	NA|394aa|down_0|NC_014501.1_2646068_2647250_-	PRK05643, PRK05643, DNA polymerase III subunit beta; Validated	NA|485aa|down_1|NC_014501.1_2647553_2649008_-	cd06268, PBP1_ABC_transporter_LIVBP-like, periplasmic binding domain of ATP-binding cassette transporter-like systems that belong to the type 1 periplasmic binding fold protein superfamily	NA|464aa|down_2|NC_014501.1_2649032_2650424_-	cd14014, STKc_PknB_like, Catalytic domain of bacterial Serine/Threonine kinases, PknB and similar proteins	NA|337aa|down_3|NC_014501.1_2650562_2651573_-	CHL00180, rbcR, LysR transcriptional regulator; Provisional	NA|233aa|down_4|NC_014501.1_2651827_2652526_+	COG4094, COG4094, Predicted membrane protein [Function unknown]	NA|114aa|down_5|NC_014501.1_2652677_2653019_+	TIGR01068, Thioredoxin-like_protein_slr0233, thioredoxin	NA|218aa|down_6|NC_014501.1_2653026_2653680_-	cd07989, LPLAT_AGPAT-like, Lysophospholipid Acyltransferases (LPLATs) of Glycerophospholipid Biosynthesis: AGPAT-like	NA|143aa|down_7|NC_014501.1_2653703_2654132_+	COG0824, FcbC, Predicted thioesterase [General function prediction only]	NA|113aa|down_8|NC_014501.1_2654236_2654575_+	pfam00543, P-II, Nitrogen regulatory protein P-II	NA|190aa|down_9|NC_014501.1_2654641_2655211_-	cd03134, GATase1_PfpI_like, A type 1 glutamine amidotransferase (GATase1)-like domain found in PfpI from Pyrococcus furiosus
GCF_000147335.1_ASM14733v1	NC_014501	Gloeothece verrucosa PCC 7822, complete sequence	13	3470803-3470892	10	CRISPRCasFinder	no		RT,DEDDh,DinG,PD-DExK,c2c9_V-U4,cas14k,cas3,cas14j,Cas14c_CAS-V-F,csa3,Cas9_archaeal,Cas14b_CAS-V-F,csx3,csx1	Orphan	AGCTTGCTCAATTTCTTGTAACTG	24	0	0	NA	NA	NA	1	1	Orphan	RT,DEDDh,DinG,PD-DExK,c2c9_V-U4,cas14k,cas3,cas14j,Cas14c_CAS-V-F,csa3,Cas9_archaeal,Cas14b_CAS-V-F,csx3,csx1,cmr3gr5,cmr4gr7,cmr5gr11,cas2,cas1,cas4,cas6,csc1gr5,csc2gr7,cas10d,WYL,2OG_CAS,csx18,cas8b3,cmr6gr7,cas10,cas5,cas7,c2c5_V-U5	NA|232aa|up_4|NC_014501.1_3462844_3463540_-,NA|69aa|up_1|NC_014501.1_3465766_3465973_-,NA|115aa|down_5|NC_014501.1_3482347_3482692_-	NA|474aa|up_9|NC_014501.1_3455491_3456913_+	COG1316, LytR, Transcriptional regulator [Transcription]	NA|337aa|up_8|NC_014501.1_3456985_3457996_+	cd04187, DPM1_like_bac, Bacterial DPM1_like enzymes are related to eukaryotic DPM1	NA|655aa|up_7|NC_014501.1_3458087_3460052_+	TIGR00644, recJ, single-stranded-DNA-specific exonuclease RecJ	NA|683aa|up_6|NC_014501.1_3460081_3462130_-	PRK07956, ligA, NAD-dependent DNA ligase LigA; Validated	NA|185aa|up_5|NC_014501.1_3462203_3462758_-	pfam05685, Uma2, Putative restriction endonuclease	NA|232aa|up_4|NC_014501.1_3462844_3463540_-	NA	NA|152aa|up_3|NC_014501.1_3463648_3464104_-	COG1051, COG1051, ADP-ribose pyrophosphatase [Nucleotide transport and metabolism]	NA|463aa|up_2|NC_014501.1_3464335_3465724_-	PRK00855, PRK00855, argininosuccinate lyase; Provisional	NA|69aa|up_1|NC_014501.1_3465766_3465973_-	NA	NA|479aa|up_0|NC_014501.1_3466028_3467465_-	COG2208, RsbU, Serine phosphatase RsbU, regulator of sigma subunit [Signal transduction mechanisms / Transcription]	NA|313aa|down_0|NC_014501.1_3471671_3472610_-	cd06420, GT2_Chondriotin_Pol_N, N-terminal domain of Chondroitin polymerase functions as a GalNAc transferase	NA|946aa|down_1|NC_014501.1_3473076_3475914_-	cd01951, lectin_L-type, legume lectins	NA|268aa|down_2|NC_014501.1_3476454_3477258_-	pfam13369, Transglut_core2, Transglutaminase-like superfamily	NA|515aa|down_3|NC_014501.1_3477334_3478879_-	TIGR01790, Uncharacterized_carotenoid_cyclase_DR_0801, lycopene cyclase family protein	NA|617aa|down_4|NC_014501.1_3479155_3481006_+	TIGR03423, pbp2_mrdA, penicillin-binding protein 2	NA|115aa|down_5|NC_014501.1_3482347_3482692_-	NA	NA|278aa|down_6|NC_014501.1_3483187_3484021_-	sd00006, TPR, Tetratricopeptide repeat	NA|182aa|down_7|NC_014501.1_3484119_3484665_-	PLN02948, PLN02948, phosphoribosylaminoimidazole carboxylase	NA|392aa|down_8|NC_014501.1_3484952_3486128_+	cd00854, NagA, N-acetylglucosamine-6-phosphate deacetylase, NagA, catalyzes the hydrolysis of the N-acetyl group of N-acetyl-glucosamine-6-phosphate (GlcNAc-6-P) to glucosamine 6-phosphate and acetate	NA|231aa|down_9|NC_014501.1_3486141_3486834_-	cd07438, PHP_HisPPase_AMP, Polymerase and Histidinol Phosphatase domain of Histidinol phosphate phosphatase (HisPPase) AMP bound
GCF_000147335.1_ASM14733v1	NC_014501	Gloeothece verrucosa PCC 7822, complete sequence	14	3866790-3866878	11	CRISPRCasFinder	no		RT,DEDDh,DinG,PD-DExK,c2c9_V-U4,cas14k,cas3,cas14j,Cas14c_CAS-V-F,csa3,Cas9_archaeal,Cas14b_CAS-V-F,csx3,csx1	Orphan	GAAGGTGGTGAAGGCGGCGAAGGCGG	26	0	0	NA	NA	NA	1	1	Orphan	RT,DEDDh,DinG,PD-DExK,c2c9_V-U4,cas14k,cas3,cas14j,Cas14c_CAS-V-F,csa3,Cas9_archaeal,Cas14b_CAS-V-F,csx3,csx1,cmr3gr5,cmr4gr7,cmr5gr11,cas2,cas1,cas4,cas6,csc1gr5,csc2gr7,cas10d,WYL,2OG_CAS,csx18,cas8b3,cmr6gr7,cas10,cas5,cas7,c2c5_V-U5	NA|67aa|up_9|NC_014501.1_3851002_3851203_-,NA|74aa|up_7|NC_014501.1_3858976_3859198_-,NA|61aa|up_5|NC_014501.1_3860749_3860932_+,NA|507aa|up_2|NC_014501.1_3863073_3864594_+,NA|322aa|down_8|NC_014501.1_3879279_3880245_+,NA|120aa|down_9|NC_014501.1_3880749_3881109_+	NA|67aa|up_9|NC_014501.1_3851002_3851203_-	NA	NA|678aa|up_8|NC_014501.1_3851344_3853378_-	cd07338, M48B_HtpX_like, Peptidase M48 subfamily B HtpX-like membrane-bound metallopeptidase	NA|74aa|up_7|NC_014501.1_3858976_3859198_-	NA	NA|194aa|up_6|NC_014501.1_3859783_3860365_+	COG3124, COG3124, Uncharacterized protein conserved in bacteria [Function unknown]	NA|61aa|up_5|NC_014501.1_3860749_3860932_+	NA	NA|304aa|up_4|NC_014501.1_3861263_3862175_-	PLN02679, PLN02679, hydrolase, alpha/beta fold family protein	NA|177aa|up_3|NC_014501.1_3862417_3862948_+	PRK00028, infC, translation initiation factor IF-3; Reviewed	NA|507aa|up_2|NC_014501.1_3863073_3864594_+	NA	NA|217aa|up_1|NC_014501.1_3864747_3865398_+	pfam08241, Methyltransf_11, Methyltransferase domain	NA|335aa|up_0|NC_014501.1_3865548_3866553_+	cd11024, CuRO_1_2DMCO_NIR_like, The cupredoxin domain 1 of a two-domain laccase related to nitrite reductase	NA|221aa|down_0|NC_014501.1_3867559_3868222_+	PRK05467, PRK05467, Fe(II)-dependent oxygenase superfamily protein; Provisional	NA|805aa|down_1|NC_014501.1_3868501_3870916_+	COG0745, OmpR, Response regulators consisting of a CheY-like receiver domain and a winged-helix DNA-binding domain [Signal transduction mechanisms / Transcription]	NA|347aa|down_2|NC_014501.1_3870962_3872003_-	COG1835, COG1835, Predicted acyltransferases [Lipid metabolism]	NA|360aa|down_3|NC_014501.1_3872027_3873107_-	pfam18087, RuBisCo_chap_C, Rubisco Assembly chaperone C-terminal domain	NA|529aa|down_4|NC_014501.1_3874303_3875890_+	TIGR01444, 2-O-methyltransferase_NoeI, methyltransferase, FkbM family	NA|377aa|down_5|NC_014501.1_3875901_3877032_-	COG1173, DppC, ABC-type dipeptide/oligopeptide/nickel transport systems, permease components [Amino acid transport and metabolism / Inorganic ion transport and metabolism]	NA|209aa|down_6|NC_014501.1_3877521_3878148_+	pfam12306, PixA, Inclusion body protein	NA|327aa|down_7|NC_014501.1_3878156_3879137_+	pfam13365, Trypsin_2, Trypsin-like peptidase domain	NA|322aa|down_8|NC_014501.1_3879279_3880245_+	NA	NA|120aa|down_9|NC_014501.1_3880749_3881109_+	NA
GCF_000147335.1_ASM14733v1	NC_014501	Gloeothece verrucosa PCC 7822, complete sequence	15	3993552-3993652	12	CRISPRCasFinder	no		RT,DEDDh,DinG,PD-DExK,c2c9_V-U4,cas14k,cas3,cas14j,Cas14c_CAS-V-F,csa3,Cas9_archaeal,Cas14b_CAS-V-F,csx3,csx1	Orphan	TTTAGGTGCGTTACGCTCCGCTA	23	0	0	NA	NA	NA	1	1	Orphan	RT,DEDDh,DinG,PD-DExK,c2c9_V-U4,cas14k,cas3,cas14j,Cas14c_CAS-V-F,csa3,Cas9_archaeal,Cas14b_CAS-V-F,csx3,csx1,cmr3gr5,cmr4gr7,cmr5gr11,cas2,cas1,cas4,cas6,csc1gr5,csc2gr7,cas10d,WYL,2OG_CAS,csx18,cas8b3,cmr6gr7,cas10,cas5,cas7,c2c5_V-U5	NA|46aa|up_7|NC_014501.1_3983524_3983662_-,NA|132aa|up_6|NC_014501.1_3983661_3984057_-,NA|122aa|down_2|NC_014501.1_3999909_4000275_+	NA|457aa|up_9|NC_014501.1_3980764_3982135_-	pfam07589, VPEP, PEP-CTERM motif	NA|216aa|up_8|NC_014501.1_3982726_3983374_-	PLN02770, PLN02770, haloacid dehalogenase-like hydrolase family protein	NA|46aa|up_7|NC_014501.1_3983524_3983662_-	NA	NA|132aa|up_6|NC_014501.1_3983661_3984057_-	NA	NA|156aa|up_5|NC_014501.1_3984238_3984706_+	cd14503, PTP-bact, bacterial tyrosine-protein phosphataseS similar to Neisseria NMA1982	NA|645aa|up_4|NC_014501.1_3988315_3990250_-	COG1123, COG1123, ATPase components of various ABC-type transport systems, contain duplicated ATPase [General function prediction only]	NA|212aa|up_3|NC_014501.1_3990404_3991040_+	cd00956, Transaldolase_FSA, Transaldolase-like fructose-6-phosphate aldolases (FSA) found in bacteria and archaea	NA|299aa|up_2|NC_014501.1_3991057_3991954_+	pfam09992, NAGPA, Phosphodiester glycosidase	NA|54aa|up_1|NC_014501.1_3992067_3992229_+	pfam00301, Rubredoxin, Rubredoxin	NA|317aa|up_0|NC_014501.1_3992246_3993197_-	cd08267, MDR1, Medium chain dehydrogenases/reductase (MDR)/zinc-dependent alcohol dehydrogenase-like family	NA|1259aa|down_0|NC_014501.1_3994197_3997974_+	COG4251, COG4251, Bacteriophytochrome (light-regulated signal transduction histidine kinase) [Signal transduction mechanisms]	NA|302aa|down_1|NC_014501.1_3998983_3999889_+	cd16350, VOC_like, uncharacterized subfamily of the vicinal oxygen chelate (VOC) family	NA|122aa|down_2|NC_014501.1_3999909_4000275_+	NA	NA|135aa|down_3|NC_014501.1_4000279_4000684_+	cd00303, retropepsin_like, Retropepsins; pepsin-like aspartate proteases	NA|116aa|down_4|NC_014501.1_4000680_4001028_-	pfam13032, DUF3893, Domain of unknown function (DUF3893)	NA|314aa|down_5|NC_014501.1_4001072_4002014_-	pfam13111, DUF3962, Protein of unknown function (DUF3962)	NA|1115aa|down_6|NC_014501.1_4002010_4005355_-	pfam18155, pPIWI_RE_Z, pPIWI RE three-gene island domain Z	NA|350aa|down_7|NC_014501.1_4005344_4006394_-	pfam18154, pPIWI_RE_REase, REase associating with pPIWI_RE	NA|68aa|down_8|NC_014501.1_4006485_4006689_-	COG1110, COG1110, Reverse gyrase [DNA replication, recombination, and repair]	NA|149aa|down_9|NC_014501.1_4006750_4007197_-	smart00347, HTH_MARR, helix_turn_helix multiple antibiotic resistance protein
GCF_000147335.1_ASM14733v1	NC_014501	Gloeothece verrucosa PCC 7822, complete sequence	16	4529883-4530104	4	CRT	no		RT,DEDDh,DinG,PD-DExK,c2c9_V-U4,cas14k,cas3,cas14j,Cas14c_CAS-V-F,csa3,Cas9_archaeal,Cas14b_CAS-V-F,csx3,csx1	Orphan	AGATGANTCTCTAGATGA	18	0	0	NA	NA	NA	5	5	Orphan	RT,DEDDh,DinG,PD-DExK,c2c9_V-U4,cas14k,cas3,cas14j,Cas14c_CAS-V-F,csa3,Cas9_archaeal,Cas14b_CAS-V-F,csx3,csx1,cmr3gr5,cmr4gr7,cmr5gr11,cas2,cas1,cas4,cas6,csc1gr5,csc2gr7,cas10d,WYL,2OG_CAS,csx18,cas8b3,cmr6gr7,cas10,cas5,cas7,c2c5_V-U5	NA|216aa|up_9|NC_014501.1_4523463_4524111_-,NA|163aa|up_8|NC_014501.1_4524107_4524596_-,NA|51aa|up_7|NC_014501.1_4524696_4524849_-,NA|77aa|up_3|NC_014501.1_4527474_4527705_+,NA|92aa|up_2|NC_014501.1_4527757_4528033_+,NA|100aa|up_1|NC_014501.1_4528366_4528666_+,NA|114aa|up_0|NC_014501.1_4528964_4529306_+,NA|73aa|down_0|NC_014501.1_4531651_4531870_+,NA|84aa|down_1|NC_014501.1_4531893_4532145_+,NA|122aa|down_7|NC_014501.1_4537419_4537785_-	NA|216aa|up_9|NC_014501.1_4523463_4524111_-	NA	NA|163aa|up_8|NC_014501.1_4524107_4524596_-	NA	NA|51aa|up_7|NC_014501.1_4524696_4524849_-	NA	NA|233aa|up_6|NC_014501.1_4525190_4525889_+	pfam14518, Haem_oxygenas_2, Iron-containing redox enzyme	NA|191aa|up_5|NC_014501.1_4525904_4526477_-	COG1309, AcrR, Transcriptional regulator [Transcription]	NA|95aa|up_4|NC_014501.1_4527176_4527461_+	cd00093, HTH_XRE, Helix-turn-helix XRE-family like proteins	NA|77aa|up_3|NC_014501.1_4527474_4527705_+	NA	NA|92aa|up_2|NC_014501.1_4527757_4528033_+	NA	NA|100aa|up_1|NC_014501.1_4528366_4528666_+	NA	NA|114aa|up_0|NC_014501.1_4528964_4529306_+	NA	NA|73aa|down_0|NC_014501.1_4531651_4531870_+	NA	NA|84aa|down_1|NC_014501.1_4531893_4532145_+	NA	NA|426aa|down_2|NC_014501.1_4532197_4533475_-	cd00397, DNA_BRE_C, DNA breaking-rejoining enzymes, C-terminal catalytic domain	NA|137aa|down_3|NC_014501.1_4533620_4534031_-	pfam01724, DUF29, Domain of unknown function DUF29	NA|412aa|down_4|NC_014501.1_4534462_4535698_-	cd00887, MoeA, MoeA family	NA|113aa|down_5|NC_014501.1_4535953_4536292_+	PRK13697, PRK13697, cytochrome c6; Provisional	NA|308aa|down_6|NC_014501.1_4536426_4537350_+	sd00006, TPR, Tetratricopeptide repeat	NA|122aa|down_7|NC_014501.1_4537419_4537785_-	NA	NA|205aa|down_8|NC_014501.1_4538172_4538787_-	pfam00805, Pentapeptide, Pentapeptide repeats (8 copies)	NA|361aa|down_9|NC_014501.1_4538770_4539853_-	pfam00805, Pentapeptide, Pentapeptide repeats (8 copies)
GCF_000147335.1_ASM14733v1	NC_014501	Gloeothece verrucosa PCC 7822, complete sequence	17	4713519-4713650	13	CRISPRCasFinder	no		RT,DEDDh,DinG,PD-DExK,c2c9_V-U4,cas14k,cas3,cas14j,Cas14c_CAS-V-F,csa3,Cas9_archaeal,Cas14b_CAS-V-F,csx3,csx1	Orphan	GGGGGGATCTCGATAATGTTAGATTGACCTCTCCCCCAA	39	0	0	NA	NA	NA	1	1	Orphan	RT,DEDDh,DinG,PD-DExK,c2c9_V-U4,cas14k,cas3,cas14j,Cas14c_CAS-V-F,csa3,Cas9_archaeal,Cas14b_CAS-V-F,csx3,csx1,cmr3gr5,cmr4gr7,cmr5gr11,cas2,cas1,cas4,cas6,csc1gr5,csc2gr7,cas10d,WYL,2OG_CAS,csx18,cas8b3,cmr6gr7,cas10,cas5,cas7,c2c5_V-U5	NA|145aa|up_5|NC_014501.1_4706534_4706969_+,NA|47aa|up_3|NC_014501.1_4708602_4708743_+,NA	NA|354aa|up_9|NC_014501.1_4703120_4704182_-	COG0601, DppB, ABC-type dipeptide/oligopeptide/nickel transport systems, permease components [Amino acid transport and metabolism / Inorganic ion transport and metabolism]	NA|173aa|up_8|NC_014501.1_4704315_4704834_-	PRK02304, PRK02304, adenine phosphoribosyltransferase; Provisional	NA|203aa|up_7|NC_014501.1_4704987_4705596_+	pfam11237, DUF3038, Protein of unknown function (DUF3038)	NA|228aa|up_6|NC_014501.1_4705700_4706384_+	pfam14233, DUF4335, Domain of unknown function (DUF4335)	NA|145aa|up_5|NC_014501.1_4706534_4706969_+	NA	NA|491aa|up_4|NC_014501.1_4707075_4708548_+	cd02440, AdoMet_MTases, S-adenosylmethionine-dependent methyltransferases (SAM or AdoMet-MTase), class I;  AdoMet-MTases are enzymes that use S-adenosyl-L-methionine (SAM or AdoMet) as a substrate for methyltransfer, creating the product S-adenosyl-L-homocysteine (AdoHcy)	NA|47aa|up_3|NC_014501.1_4708602_4708743_+	NA	NA|655aa|up_2|NC_014501.1_4708795_4710760_+	PRK05940, PRK05940, anthranilate synthase component I	NA|186aa|up_1|NC_014501.1_4710765_4711323_-	pfam13548, DUF4126, Domain of unknown function (DUF4126)	NA|432aa|up_0|NC_014501.1_4711518_4712814_+	COG0334, GdhA, Glutamate dehydrogenase/leucine dehydrogenase [Amino acid transport and metabolism]	NA|167aa|down_0|NC_014501.1_4716895_4717396_+	COG1413, COG1413, FOG: HEAT repeat [Energy production and conversion]	NA|193aa|down_1|NC_014501.1_4717618_4718197_+	pfam13646, HEAT_2, HEAT repeats	NA|193aa|down_2|NC_014501.1_4718418_4718997_+	pfam08819, DUF1802, Domain of unknown function (DUF1802)	NA|165aa|down_3|NC_014501.1_4719091_4719586_+	pfam10116, Host_attach, Protein required for attachment to host cells	NA|241aa|down_4|NC_014501.1_4719865_4720588_-	cd01465, vWA_subgroup, VWA subgroup: Von Willebrand factor type A (vWA) domain was originally found in the blood coagulation protein von Willebrand factor (vWF)	NA|648aa|down_5|NC_014501.1_4720934_4722878_+	cd11320, AmyAc_AmyMalt_CGTase_like, Alpha amylase catalytic domain found in maltogenic amylases, cyclodextrin glycosyltransferase, and related proteins	NA|608aa|down_6|NC_014501.1_4722975_4724799_+	TIGR02402, Malto-oligosyltrehalose_trehalohydrolase, malto-oligosyltrehalose trehalohydrolase	NA|930aa|down_7|NC_014501.1_4724907_4727697_+	PRK14511, PRK14511, malto-oligosyltrehalose synthase	NA|231aa|down_8|NC_014501.1_4727822_4728515_-	COG3546, COG3546, Mn-containing catalase [Inorganic ion transport and metabolism]	NA|323aa|down_9|NC_014501.1_4728722_4729691_-	PRK00861, PRK00861, putative lipid kinase; Reviewed
GCF_000147335.1_ASM14733v1	NC_014502	Gloeothece verrucosa PCC 7822 plasmid Cy782203, complete sequence	1	148518-148670	1	PILER-CR	no		DEDDh,cas3	Orphan	AAATAATAAAGTTGTATAATTT	22	0	0	NA	NA	NA	2	2	Orphan	RT,DEDDh,DinG,PD-DExK,c2c9_V-U4,cas14k,cas3,cas14j,Cas14c_CAS-V-F,csa3,Cas9_archaeal,Cas14b_CAS-V-F,csx3,csx1,cmr3gr5,cmr4gr7,cmr5gr11,cas2,cas1,cas4,cas6,csc1gr5,csc2gr7,cas10d,WYL,2OG_CAS,csx18,cas8b3,cmr6gr7,cas10,cas5,cas7,c2c5_V-U5	NA|144aa|up_8|NC_014502.1_135958_136390_+,NA|75aa|up_6|NC_014502.1_137003_137228_-,NA|55aa|up_4|NC_014502.1_139391_139556_-,NA|1289aa|up_1|NC_014502.1_142440_146307_-,NA|108aa|up_0|NC_014502.1_147980_148304_-,NA|622aa|down_2|NC_014502.1_151088_152954_-,NA|185aa|down_4|NC_014502.1_155484_156039_-,NA|280aa|down_5|NC_014502.1_156129_156969_-,NA|125aa|down_6|NC_014502.1_157314_157689_-	NA|76aa|up_9|NC_014502.1_134189_134417_-	COG4118, Phd, Antitoxin of toxin-antitoxin stability system [Cell division and chromosome partitioning]	NA|144aa|up_8|NC_014502.1_135958_136390_+	NA	NA|186aa|up_7|NC_014502.1_136400_136958_-	pfam10999, DUF2839, Protein of unknown function (DUF2839)	NA|75aa|up_6|NC_014502.1_137003_137228_-	NA	NA|646aa|up_5|NC_014502.1_137348_139286_+	cd00315, Cyt_C5_DNA_methylase, Cytosine-C5 specific DNA methylases; Methyl transfer reactions play an important role in many aspects of biology	NA|55aa|up_4|NC_014502.1_139391_139556_-	NA	NA|266aa|up_3|NC_014502.1_140312_141110_+	cd17932, DEXQc_UvrD, DEXQD-box helicase domain of UvrD	NA|299aa|up_2|NC_014502.1_141320_142217_+	pfam13361, UvrD_C, UvrD-like helicase C-terminal domain	NA|1289aa|up_1|NC_014502.1_142440_146307_-	NA	NA|108aa|up_0|NC_014502.1_147980_148304_-	NA	NA|392aa|down_0|NC_014502.1_148882_150058_-	cd16962, RuvC, Crossover junction endodeoxyribonuclease RuvC	NA|322aa|down_1|NC_014502.1_150057_151023_-	cd01195, INT_C_like_5, Uncharacterized site-specific tyrosine recombinase, C-terminal catalytic domain	NA|622aa|down_2|NC_014502.1_151088_152954_-	NA	NA|280aa|down_3|NC_014502.1_153060_153900_-	cd17554, REC_TrrA-like, phosphoacceptor receiver (REC) domain of Thermotoga maritima response regulator TrrA and similar domains	NA|185aa|down_4|NC_014502.1_155484_156039_-	NA	NA|280aa|down_5|NC_014502.1_156129_156969_-	NA	NA|125aa|down_6|NC_014502.1_157314_157689_-	NA	NA|308aa|down_7|NC_014502.1_157769_158693_-	pfam03432, Relaxase, Relaxase/Mobilisation nuclease domain	NA|118aa|down_8|NC_014502.1_158673_159027_-	pfam05713, MobC, Bacterial mobilisation protein (MobC)	NA|126aa|down_9|NC_014502.1_159341_159719_-	cd16383, GUN4, porphyrin-binding protein domain GUN4
GCF_000147335.1_ASM14733v1	NC_014503	Gloeothece verrucosa PCC 7822 plasmid Cy782204, complete sequence	1	6886-6971	1	CRISPRCasFinder	no	cas14j	cas14j	Unclear	GTTGCAGGAGCAGCTAATGATTG	23	0	0	NA	NA	NA	1	1	TypeV	RT,DEDDh,DinG,PD-DExK,c2c9_V-U4,cas14k,cas3,cas14j,Cas14c_CAS-V-F,csa3,Cas9_archaeal,Cas14b_CAS-V-F,csx3,csx1,cmr3gr5,cmr4gr7,cmr5gr11,cas2,cas1,cas4,cas6,csc1gr5,csc2gr7,cas10d,WYL,2OG_CAS,csx18,cas8b3,cmr6gr7,cas10,cas5,cas7,c2c5_V-U5	NA|250aa|up_3|NC_014503.1_7_757_+,NA|831aa|up_2|NC_014503.1_746_3239_+,NA|450aa|up_1|NC_014503.1_3216_4566_+,NA|61aa|down_6|NC_014503.1_15845_16028_+,NA|126aa|down_9|NC_014503.1_21411_21789_+	NA|NA	NA	NA|NA	NA	NA|NA	NA	NA|NA	NA	NA|NA	NA	NA|NA	NA	NA|250aa|up_3|NC_014503.1_7_757_+	NA	NA|831aa|up_2|NC_014503.1_746_3239_+	NA	NA|450aa|up_1|NC_014503.1_3216_4566_+	NA	NA|514aa|up_0|NC_014503.1_4642_6184_+	cd06268, PBP1_ABC_transporter_LIVBP-like, periplasmic binding domain of ATP-binding cassette transporter-like systems that belong to the type 1 periplasmic binding fold protein superfamily	NA|116aa|down_0|NC_014503.1_7994_8342_-	cd00200, WD40, WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment	NA|1003aa|down_1|NC_014503.1_8396_11405_-	cd00200, WD40, WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment	NA|396aa|down_2|NC_014503.1_11407_12595_-	pfam13676, TIR_2, TIR domain	NA|147aa|down_3|NC_014503.1_12661_13102_-	pfam13676, TIR_2, TIR domain	NA|287aa|down_4|NC_014503.1_13382_14243_-	TIGR02224, Tyrosine_recombinase_XerC, tyrosine recombinase XerC	cas14j|406aa|down_5|NC_014503.1_14525_15743_-	COG0675, COG0675, Transposase and inactivated derivatives [DNA replication, recombination, and repair]	NA|61aa|down_6|NC_014503.1_15845_16028_+	NA	NA|72aa|down_7|NC_014503.1_20885_21101_+	pfam13358, DDE_3, DDE superfamily endonuclease	NA|54aa|down_8|NC_014503.1_21075_21237_+	pfam13358, DDE_3, DDE superfamily endonuclease	NA|126aa|down_9|NC_014503.1_21411_21789_+	NA
