assembly_id	genome_id	genome_def	crispr_array_locus_merge	crispr_array_location_merge	crispr_locus_id	crispr_pred_method	array_in_prot	prot_within_array_20000	prot_in_genome	crispr_type_by_cas_prot	consensus_repeat	repeat_length	self-targeting_spacer_number	self-targeting_target_number	spacer_location	protospacer_location	repeat_type	spacer_locus_num	spacer_num	correct_crispr_type	genome_cas_prots	unknown_protein_around_crispr	L10	L10_domain	L9	L9_domain	L8	L8_domain	L7	L7_domain	L6	L6_domain	L5	L5_domain	L4	L4_domain	L3	L3_domain	L2	L2_domain	L1	L1_domain	R1	R1_domain	R2	R2_domain	R3	R3_domain	R4	R4_domain	R5	R5_domain	R6	R6_domain	R7	R7_domain	R8	R8_domain	R9	R9_domain	R10	R10_domain
GCF_009372195.1_ASM937219v1	NZ_CP045226	Nostoc sphaeroides CCNUC1 chromosome Gxm1, complete sequence	1	141540-141671	1	CRISPRCasFinder	no		PD-DExK,cas14k,cas14j,Cas9_archaeal,csa3,Cas14u_CAS-V,RT,Cas14c_CAS-V-F,c2c9_V-U4,cas3,c2c10_CAS-V-U3,DinG,c2c5_V-U5,cas2,cas1,cas4,cas6,2OG_CAS,csc1gr5,csc2gr7,cas10d,WYL,DEDDh	Orphan	GGAAGTGATAACCCTTTTCCTAAAGGT	27	0	0	NA	NA	NA	2	2	Orphan	PD-DExK,cas14k,cas14j,Cas9_archaeal,csa3,Cas14u_CAS-V,RT,Cas14c_CAS-V-F,c2c9_V-U4,cas3,c2c10_CAS-V-U3,DinG,c2c5_V-U5,cas2,cas1,cas4,cas6,2OG_CAS,csc1gr5,csc2gr7,cas10d,WYL,DEDDh,csm3gr7,csx19,csx10gr5,cas10,csx1,csx3	NA|66aa|up_7|NZ_CP045226.1_132123_132321_-,NA|68aa|up_3|NZ_CP045226.1_136818_137022_+,NA|97aa|up_2|NZ_CP045226.1_137088_137379_+,NA|124aa|up_1|NZ_CP045226.1_137401_137773_-,NA	NA|79aa|up_9|NZ_CP045226.1_131287_131524_-	CHL00191, ycf61, DNA-directed RNA polymerase subunit omega; Provisional	NA|140aa|up_8|NZ_CP045226.1_131681_132101_+	COG2335, COG2335, Secreted and surface protein containing fasciclin-like repeats [Cell envelope biogenesis, outer membrane]	NA|66aa|up_7|NZ_CP045226.1_132123_132321_-	NA	NA|491aa|up_6|NZ_CP045226.1_132350_133823_-	cd11338, AmyAc_CMD, Alpha amylase catalytic domain found in cyclomaltodextrinases and related proteins	NA|184aa|up_5|NZ_CP045226.1_134146_134698_-	pfam07538, ChW, Clostridial hydrophobic W	NA|576aa|up_4|NZ_CP045226.1_135045_136773_+	cd07478, Peptidases_S8_CspA-like, Peptidase S8 family domain in CspA-like proteins	NA|68aa|up_3|NZ_CP045226.1_136818_137022_+	NA	NA|97aa|up_2|NZ_CP045226.1_137088_137379_+	NA	NA|124aa|up_1|NZ_CP045226.1_137401_137773_-	NA	NA|1149aa|up_0|NZ_CP045226.1_137920_141367_+	cd00200, WD40, WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment	NA|430aa|down_0|NZ_CP045226.1_142171_143461_+	PRK02862, glgC, glucose-1-phosphate adenylyltransferase; Provisional	NA|190aa|down_1|NZ_CP045226.1_143520_144090_+	COG4639, COG4639, Predicted kinase [General function prediction only]	NA|348aa|down_2|NZ_CP045226.1_144212_145256_+	COG0484, DnaJ, DnaJ-class molecular chaperone with C-terminal Zn finger domain [Posttranslational modification, protein turnover, chaperones]	NA|179aa|down_3|NZ_CP045226.1_145645_146182_-	COG1225, Bcp, Peroxiredoxin [Posttranslational modification, protein turnover, chaperones]	NA|204aa|down_4|NZ_CP045226.1_146708_147320_-	cd03015, PRX_Typ2cys, Peroxiredoxin (PRX) family, Typical 2-Cys PRX subfamily; PRXs are thiol-specific antioxidant (TSA) proteins, which confer a protective role in cells through its peroxidase activity by reducing hydrogen peroxide, peroxynitrite, and organic hydroperoxides	NA|704aa|down_5|NZ_CP045226.1_147451_149563_+	pfam06202, GDE_C, Amylo-alpha-1,6-glucosidase	NA|291aa|down_6|NZ_CP045226.1_149907_150780_+	pfam11209, DUF2993, Protein of unknown function (DUF2993)	NA|464aa|down_7|NZ_CP045226.1_151215_152607_-	COG1004, Ugd, Predicted UDP-glucose 6-dehydrogenase [Cell envelope biogenesis, outer membrane]	NA|317aa|down_8|NZ_CP045226.1_152753_153704_-	cd05230, UGD_SDR_e, UDP-glucuronate decarboxylase (UGD) and related proteins, extended (e) SDRs	NA|713aa|down_9|NZ_CP045226.1_154227_156366_-	COG0475, KefB, Kef-type K+ transport systems, membrane components [Inorganic ion transport and metabolism]
GCF_009372195.1_ASM937219v1	NZ_CP045226	Nostoc sphaeroides CCNUC1 chromosome Gxm1, complete sequence	2	149686-149765	2	CRISPRCasFinder	no	cas14j	PD-DExK,cas14k,cas14j,Cas9_archaeal,csa3,Cas14u_CAS-V,RT,Cas14c_CAS-V-F,c2c9_V-U4,cas3,c2c10_CAS-V-U3,DinG,c2c5_V-U5,cas2,cas1,cas4,cas6,2OG_CAS,csc1gr5,csc2gr7,cas10d,WYL,DEDDh	Unclear	TATCATTGTGGGCAGCGATCGCCA	24	1	1	149710-149741	NZ_CP045226.1_6329184-6329153	NA	1	1	TypeV	PD-DExK,cas14k,cas14j,Cas9_archaeal,csa3,Cas14u_CAS-V,RT,Cas14c_CAS-V-F,c2c9_V-U4,cas3,c2c10_CAS-V-U3,DinG,c2c5_V-U5,cas2,cas1,cas4,cas6,2OG_CAS,csc1gr5,csc2gr7,cas10d,WYL,DEDDh,csm3gr7,csx19,csx10gr5,cas10,csx1,csx3	NA|97aa|up_9|NZ_CP045226.1_137088_137379_+,NA|124aa|up_8|NZ_CP045226.1_137401_137773_-,NA|141aa|up_6|NZ_CP045226.1_141458_141881_+,NA|175aa|down_8|NZ_CP045226.1_160583_161108_-	NA|97aa|up_9|NZ_CP045226.1_137088_137379_+	NA	NA|124aa|up_8|NZ_CP045226.1_137401_137773_-	NA	NA|1149aa|up_7|NZ_CP045226.1_137920_141367_+	cd00200, WD40, WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment	NA|141aa|up_6|NZ_CP045226.1_141458_141881_+	NA	NA|430aa|up_5|NZ_CP045226.1_142171_143461_+	PRK02862, glgC, glucose-1-phosphate adenylyltransferase; Provisional	NA|190aa|up_4|NZ_CP045226.1_143520_144090_+	COG4639, COG4639, Predicted kinase [General function prediction only]	NA|348aa|up_3|NZ_CP045226.1_144212_145256_+	COG0484, DnaJ, DnaJ-class molecular chaperone with C-terminal Zn finger domain [Posttranslational modification, protein turnover, chaperones]	NA|179aa|up_2|NZ_CP045226.1_145645_146182_-	COG1225, Bcp, Peroxiredoxin [Posttranslational modification, protein turnover, chaperones]	NA|204aa|up_1|NZ_CP045226.1_146708_147320_-	cd03015, PRX_Typ2cys, Peroxiredoxin (PRX) family, Typical 2-Cys PRX subfamily; PRXs are thiol-specific antioxidant (TSA) proteins, which confer a protective role in cells through its peroxidase activity by reducing hydrogen peroxide, peroxynitrite, and organic hydroperoxides	NA|704aa|up_0|NZ_CP045226.1_147451_149563_+	pfam06202, GDE_C, Amylo-alpha-1,6-glucosidase	NA|291aa|down_0|NZ_CP045226.1_149907_150780_+	pfam11209, DUF2993, Protein of unknown function (DUF2993)	NA|464aa|down_1|NZ_CP045226.1_151215_152607_-	COG1004, Ugd, Predicted UDP-glucose 6-dehydrogenase [Cell envelope biogenesis, outer membrane]	NA|317aa|down_2|NZ_CP045226.1_152753_153704_-	cd05230, UGD_SDR_e, UDP-glucuronate decarboxylase (UGD) and related proteins, extended (e) SDRs	NA|713aa|down_3|NZ_CP045226.1_154227_156366_-	COG0475, KefB, Kef-type K+ transport systems, membrane components [Inorganic ion transport and metabolism]	NA|219aa|down_4|NZ_CP045226.1_156510_157167_-	pfam09378, HAS-barrel, HAS barrel domain	NA|66aa|down_5|NZ_CP045226.1_157300_157498_-	pfam11623, NdhS, NAD(P)H dehydrogenase subunit S	NA|438aa|down_6|NZ_CP045226.1_157600_158914_-	TIGR02210, Rod_shape-determining_protein_RodA, rod shape-determining protein RodA	NA|357aa|down_7|NZ_CP045226.1_159109_160180_-	pfam10609, ParA, NUBPL iron-transfer P-loop NTPase	NA|175aa|down_8|NZ_CP045226.1_160583_161108_-	NA	NA|345aa|down_9|NZ_CP045226.1_161560_162595_+	PRK05330, PRK05330, oxygen-dependent coproporphyrinogen oxidase
GCF_009372195.1_ASM937219v1	NZ_CP045226	Nostoc sphaeroides CCNUC1 chromosome Gxm1, complete sequence	3	299414-299800	3	CRISPRCasFinder	no	cas14k	PD-DExK,cas14k,cas14j,Cas9_archaeal,csa3,Cas14u_CAS-V,RT,Cas14c_CAS-V-F,c2c9_V-U4,cas3,c2c10_CAS-V-U3,DinG,c2c5_V-U5,cas2,cas1,cas4,cas6,2OG_CAS,csc1gr5,csc2gr7,cas10d,WYL,DEDDh	Unclear	TTTTTGTTCTTGAGCATCTACCTGTAATTCAGATAATT	38	0	0	NA	NA	NA	4	4	TypeV	PD-DExK,cas14k,cas14j,Cas9_archaeal,csa3,Cas14u_CAS-V,RT,Cas14c_CAS-V-F,c2c9_V-U4,cas3,c2c10_CAS-V-U3,DinG,c2c5_V-U5,cas2,cas1,cas4,cas6,2OG_CAS,csc1gr5,csc2gr7,cas10d,WYL,DEDDh,csm3gr7,csx19,csx10gr5,cas10,csx1,csx3	NA|72aa|up_1|NZ_CP045226.1_297121_297337_-,NA|255aa|up_0|NZ_CP045226.1_297698_298463_+,NA|87aa|down_0|NZ_CP045226.1_301123_301384_+	NA|240aa|up_9|NZ_CP045226.1_288383_289103_+	COG0745, OmpR, Response regulators consisting of a CheY-like receiver domain and a winged-helix DNA-binding domain [Signal transduction mechanisms / Transcription]	NA|85aa|up_8|NZ_CP045226.1_290162_290417_-	pfam10929, DUF2811, Protein of unknown function (DUF2811)	NA|424aa|up_7|NZ_CP045226.1_290894_292166_+	cd17329, MFS_MdtH_MDR_like, Multidrug resistance protein MdtH and similar multidrug resistance (MDR) transporters of the Major Facilitator Superfamily	NA|217aa|up_6|NZ_CP045226.1_292196_292847_-	PRK02759, PRK02759, bifunctional phosphoribosyl-AMP cyclohydrolase/phosphoribosyl-ATP diphosphatase HisIE	NA|63aa|up_5|NZ_CP045226.1_292948_293137_-	COG4572, ChaB, Putative cation transport regulator [General function prediction only]	NA|433aa|up_4|NZ_CP045226.1_293354_294653_+	PLN02482, PLN02482, glutamate-1-semialdehyde 2,1-aminomutase	NA|273aa|up_3|NZ_CP045226.1_294942_295761_-	COG2335, COG2335, Secreted and surface protein containing fasciclin-like repeats [Cell envelope biogenesis, outer membrane]	NA|177aa|up_2|NZ_CP045226.1_296397_296928_+	COG1357, COG1357, Pentapeptide repeats containing protein [Function unknown]	NA|72aa|up_1|NZ_CP045226.1_297121_297337_-	NA	NA|255aa|up_0|NZ_CP045226.1_297698_298463_+	NA	NA|87aa|down_0|NZ_CP045226.1_301123_301384_+	NA	NA|327aa|down_1|NZ_CP045226.1_303381_304362_-	COG2515, Acd, 1-aminocyclopropane-1-carboxylate deaminase [Amino acid transport and metabolism]	NA|577aa|down_2|NZ_CP045226.1_304582_306313_+	COG0426, FpaA, Uncharacterized flavoproteins [Energy production and conversion]	NA|171aa|down_3|NZ_CP045226.1_306386_306899_+	COG3431, COG3431, Predicted membrane protein [Function unknown]	NA|578aa|down_4|NZ_CP045226.1_307109_308843_+	COG0426, FpaA, Uncharacterized flavoproteins [Energy production and conversion]	NA|291aa|down_5|NZ_CP045226.1_309130_310003_-	pfam14344, DUF4397, Domain of unknown function (DUF4397)	NA|336aa|down_6|NZ_CP045226.1_310738_311746_+	COG1131, CcmA, ABC-type multidrug transport system, ATPase component [Defense mechanisms]	NA|271aa|down_7|NZ_CP045226.1_311746_312559_+	TIGR03518, ABC_transporter_permease_protein, gliding motility-associated ABC transporter permease protein GldF	NA|95aa|down_8|NZ_CP045226.1_312691_312976_-	PRK12275, PRK12275, hypothetical protein; Reviewed	NA|556aa|down_9|NZ_CP045226.1_313092_314760_+	COG3225, GldG, ABC-type uncharacterized transport system involved in gliding motility, auxiliary component [Cell motility and secretion]
GCF_009372195.1_ASM937219v1	NZ_CP045226	Nostoc sphaeroides CCNUC1 chromosome Gxm1, complete sequence	4	299936-300060	4	CRISPRCasFinder	no	cas14k	PD-DExK,cas14k,cas14j,Cas9_archaeal,csa3,Cas14u_CAS-V,RT,Cas14c_CAS-V-F,c2c9_V-U4,cas3,c2c10_CAS-V-U3,DinG,c2c5_V-U5,cas2,cas1,cas4,cas6,2OG_CAS,csc1gr5,csc2gr7,cas10d,WYL,DEDDh	Unclear	TTTTTGTTCTTGAGCATCTACCTGTAATTCAGATAATT	38	0	0	NA	NA	NA	1	1	TypeV	PD-DExK,cas14k,cas14j,Cas9_archaeal,csa3,Cas14u_CAS-V,RT,Cas14c_CAS-V-F,c2c9_V-U4,cas3,c2c10_CAS-V-U3,DinG,c2c5_V-U5,cas2,cas1,cas4,cas6,2OG_CAS,csc1gr5,csc2gr7,cas10d,WYL,DEDDh,csm3gr7,csx19,csx10gr5,cas10,csx1,csx3	NA|72aa|up_1|NZ_CP045226.1_297121_297337_-,NA|255aa|up_0|NZ_CP045226.1_297698_298463_+,NA|87aa|down_0|NZ_CP045226.1_301123_301384_+	NA|240aa|up_9|NZ_CP045226.1_288383_289103_+	COG0745, OmpR, Response regulators consisting of a CheY-like receiver domain and a winged-helix DNA-binding domain [Signal transduction mechanisms / Transcription]	NA|85aa|up_8|NZ_CP045226.1_290162_290417_-	pfam10929, DUF2811, Protein of unknown function (DUF2811)	NA|424aa|up_7|NZ_CP045226.1_290894_292166_+	cd17329, MFS_MdtH_MDR_like, Multidrug resistance protein MdtH and similar multidrug resistance (MDR) transporters of the Major Facilitator Superfamily	NA|217aa|up_6|NZ_CP045226.1_292196_292847_-	PRK02759, PRK02759, bifunctional phosphoribosyl-AMP cyclohydrolase/phosphoribosyl-ATP diphosphatase HisIE	NA|63aa|up_5|NZ_CP045226.1_292948_293137_-	COG4572, ChaB, Putative cation transport regulator [General function prediction only]	NA|433aa|up_4|NZ_CP045226.1_293354_294653_+	PLN02482, PLN02482, glutamate-1-semialdehyde 2,1-aminomutase	NA|273aa|up_3|NZ_CP045226.1_294942_295761_-	COG2335, COG2335, Secreted and surface protein containing fasciclin-like repeats [Cell envelope biogenesis, outer membrane]	NA|177aa|up_2|NZ_CP045226.1_296397_296928_+	COG1357, COG1357, Pentapeptide repeats containing protein [Function unknown]	NA|72aa|up_1|NZ_CP045226.1_297121_297337_-	NA	NA|255aa|up_0|NZ_CP045226.1_297698_298463_+	NA	NA|87aa|down_0|NZ_CP045226.1_301123_301384_+	NA	NA|327aa|down_1|NZ_CP045226.1_303381_304362_-	COG2515, Acd, 1-aminocyclopropane-1-carboxylate deaminase [Amino acid transport and metabolism]	NA|577aa|down_2|NZ_CP045226.1_304582_306313_+	COG0426, FpaA, Uncharacterized flavoproteins [Energy production and conversion]	NA|171aa|down_3|NZ_CP045226.1_306386_306899_+	COG3431, COG3431, Predicted membrane protein [Function unknown]	NA|578aa|down_4|NZ_CP045226.1_307109_308843_+	COG0426, FpaA, Uncharacterized flavoproteins [Energy production and conversion]	NA|291aa|down_5|NZ_CP045226.1_309130_310003_-	pfam14344, DUF4397, Domain of unknown function (DUF4397)	NA|336aa|down_6|NZ_CP045226.1_310738_311746_+	COG1131, CcmA, ABC-type multidrug transport system, ATPase component [Defense mechanisms]	NA|271aa|down_7|NZ_CP045226.1_311746_312559_+	TIGR03518, ABC_transporter_permease_protein, gliding motility-associated ABC transporter permease protein GldF	NA|95aa|down_8|NZ_CP045226.1_312691_312976_-	PRK12275, PRK12275, hypothetical protein; Reviewed	NA|556aa|down_9|NZ_CP045226.1_313092_314760_+	COG3225, GldG, ABC-type uncharacterized transport system involved in gliding motility, auxiliary component [Cell motility and secretion]
GCF_009372195.1_ASM937219v1	NZ_CP045226	Nostoc sphaeroides CCNUC1 chromosome Gxm1, complete sequence	5	549044-549141	5	CRISPRCasFinder	no	cas14j	PD-DExK,cas14k,cas14j,Cas9_archaeal,csa3,Cas14u_CAS-V,RT,Cas14c_CAS-V-F,c2c9_V-U4,cas3,c2c10_CAS-V-U3,DinG,c2c5_V-U5,cas2,cas1,cas4,cas6,2OG_CAS,csc1gr5,csc2gr7,cas10d,WYL,DEDDh	Unclear	ATTTTGAGAAAGCAATATGCCTTTTTGCTTGCTGA	35	0	0	NA	NA	NA	1	1	TypeV	PD-DExK,cas14k,cas14j,Cas9_archaeal,csa3,Cas14u_CAS-V,RT,Cas14c_CAS-V-F,c2c9_V-U4,cas3,c2c10_CAS-V-U3,DinG,c2c5_V-U5,cas2,cas1,cas4,cas6,2OG_CAS,csc1gr5,csc2gr7,cas10d,WYL,DEDDh,csm3gr7,csx19,csx10gr5,cas10,csx1,csx3	NA|126aa|up_9|NZ_CP045226.1_535069_535447_-,NA|65aa|up_8|NZ_CP045226.1_535571_535766_+,NA|71aa|up_6|NZ_CP045226.1_536868_537081_-,NA|435aa|down_0|NZ_CP045226.1_549428_550733_-	NA|126aa|up_9|NZ_CP045226.1_535069_535447_-	NA	NA|65aa|up_8|NZ_CP045226.1_535571_535766_+	NA	NA|222aa|up_7|NZ_CP045226.1_536206_536872_+	cd02042, ParAB_family, partition proteins ParAB family	NA|71aa|up_6|NZ_CP045226.1_536868_537081_-	NA	NA|1909aa|up_5|NZ_CP045226.1_537368_543095_+	pfam04357, TamB, TamB, inner membrane protein subunit of TAM complex	NA|77aa|up_4|NZ_CP045226.1_543151_543382_-	TIGR01784, Uncharacterized_protein_pSLT051, conserved hypothetical protein (putative transposase or invertase)	NA|310aa|up_3|NZ_CP045226.1_543687_544617_-	COG1175, UgpA, ABC-type sugar transport systems, permease components [Carbohydrate transport and metabolism]	NA|221aa|up_2|NZ_CP045226.1_544741_545404_+	cd10450, GIY-YIG_AtGrxS16_like, GIY-YIG domain found in CAXIP1-like proteins, iron-sulfur cluster assembly proteins, and similar proteins	NA|634aa|up_1|NZ_CP045226.1_545899_547801_+	pfam00144, Beta-lactamase, Beta-lactamase	NA|261aa|up_0|NZ_CP045226.1_547849_548632_-	COG1120, FepC, ABC-type cobalamin/Fe3+-siderophores transport systems, ATPase components [Inorganic ion transport and metabolism / Coenzyme metabolism]	NA|435aa|down_0|NZ_CP045226.1_549428_550733_-	NA	NA|74aa|down_1|NZ_CP045226.1_550779_551001_-	PTZ00146, PTZ00146, fibrillarin; Provisional	NA|409aa|down_2|NZ_CP045226.1_551937_553164_+	COG0045, SucC, Succinyl-CoA synthetase, beta subunit [Energy production and conversion]	NA|294aa|down_3|NZ_CP045226.1_553286_554168_+	COG0074, SucD, Succinyl-CoA synthetase, alpha subunit [Energy production and conversion]	cas14j|405aa|down_4|NZ_CP045226.1_554206_555421_-	COG0675, COG0675, Transposase and inactivated derivatives [DNA replication, recombination, and repair]	NA|281aa|down_5|NZ_CP045226.1_555978_556821_-	PLN02244, PLN02244, tocopherol O-methyltransferase	NA|445aa|down_6|NZ_CP045226.1_557115_558450_+	TIGR00933, Trk_system_potassium_uptake_protein_trkH	NA|232aa|down_7|NZ_CP045226.1_558686_559382_+	COG0569, TrkA, K+ transport systems, NAD-binding component [Inorganic ion transport and metabolism]	NA|377aa|down_8|NZ_CP045226.1_559731_560862_-	sd00006, TPR, Tetratricopeptide repeat	NA|208aa|down_9|NZ_CP045226.1_561001_561625_-	cd03354, LbH_SAT, Serine acetyltransferase (SAT): SAT catalyzes the CoA-dependent acetylation of the side chain hydroxyl group of L-serine to form O-acetylserine, as the first step of a two-step biosynthetic pathway in bacteria and plants leading to the formation of L-cysteine
GCF_009372195.1_ASM937219v1	NZ_CP045226	Nostoc sphaeroides CCNUC1 chromosome Gxm1, complete sequence	6	824627-824844	1,6	PILER-CR,CRISPRCasFinder	no		PD-DExK,cas14k,cas14j,Cas9_archaeal,csa3,Cas14u_CAS-V,RT,Cas14c_CAS-V-F,c2c9_V-U4,cas3,c2c10_CAS-V-U3,DinG,c2c5_V-U5,cas2,cas1,cas4,cas6,2OG_CAS,csc1gr5,csc2gr7,cas10d,WYL,DEDDh	Orphan	AGTTCGCGTATTCATTTTAGTGTTTTGAGCAAGCAAAATG,ACTTATTCAAACCCCAGTTTGCTTA	40,25	0	0	NA	NA	NA:NA	2,1	2	Orphan	PD-DExK,cas14k,cas14j,Cas9_archaeal,csa3,Cas14u_CAS-V,RT,Cas14c_CAS-V-F,c2c9_V-U4,cas3,c2c10_CAS-V-U3,DinG,c2c5_V-U5,cas2,cas1,cas4,cas6,2OG_CAS,csc1gr5,csc2gr7,cas10d,WYL,DEDDh,csm3gr7,csx19,csx10gr5,cas10,csx1,csx3	NA|80aa|up_9|NZ_CP045226.1_815675_815915_+,NA|323aa|up_3|NZ_CP045226.1_821770_822739_+,NA|89aa|up_2|NZ_CP045226.1_823059_823326_+,NA|75aa|up_1|NZ_CP045226.1_823334_823559_+,NA|125aa|down_7|NZ_CP045226.1_834159_834534_-,NA|112aa|down_9|NZ_CP045226.1_835667_836003_-	NA|80aa|up_9|NZ_CP045226.1_815675_815915_+	NA	NA|142aa|up_8|NZ_CP045226.1_815901_816327_+	COG2402, COG2402, Predicted nucleic acid-binding protein, contains PIN domain [General function prediction only]	NA|77aa|up_7|NZ_CP045226.1_816426_816657_-	COG2442, COG2442, Uncharacterized conserved protein [Function unknown]	NA|195aa|up_6|NZ_CP045226.1_817088_817673_-	COG0374, HyaB, Ni,Fe-hydrogenase I large subunit [Energy production and conversion]	NA|321aa|up_5|NZ_CP045226.1_817950_818913_-	COG1740, HyaA, Ni,Fe-hydrogenase I small subunit [Energy production and conversion]	NA|276aa|up_4|NZ_CP045226.1_819694_820522_+	pfam13612, DDE_Tnp_1_3, Transposase DDE domain	NA|323aa|up_3|NZ_CP045226.1_821770_822739_+	NA	NA|89aa|up_2|NZ_CP045226.1_823059_823326_+	NA	NA|75aa|up_1|NZ_CP045226.1_823334_823559_+	NA	NA|282aa|up_0|NZ_CP045226.1_823705_824551_+	pfam01106, NifU, NifU-like domain	NA|403aa|down_0|NZ_CP045226.1_825331_826540_+	cd05819, NHL, NHL repeat unit of beta-propeller proteins	NA|783aa|down_1|NZ_CP045226.1_826647_828996_+	COG0068, HypF, Hydrogenase maturation factor [Posttranslational modification, protein turnover, chaperones]	NA|92aa|down_2|NZ_CP045226.1_829072_829348_+	pfam01455, HupF_HypC, HupF/HypC family	NA|393aa|down_3|NZ_CP045226.1_829591_830770_+	PRK15062, PRK15062, hydrogenase isoenzymes formation protein HypD; Provisional	NA|363aa|down_4|NZ_CP045226.1_830829_831918_+	TIGR02124, Hydrogenase_expression/formation_protein_HypE, hydrogenase expression/formation protein HypE	NA|114aa|down_5|NZ_CP045226.1_831937_832279_+	pfam01155, HypA, Hydrogenase/urease nickel incorporation, metallochaperone, hypA	NA|272aa|down_6|NZ_CP045226.1_832269_833085_+	PRK10463, PRK10463, hydrogenase nickel incorporation protein HypB; Provisional	NA|125aa|down_7|NZ_CP045226.1_834159_834534_-	NA	NA|336aa|down_8|NZ_CP045226.1_834523_835531_-	pfam00296, Bac_luciferase, Luciferase-like monooxygenase	NA|112aa|down_9|NZ_CP045226.1_835667_836003_-	NA
GCF_009372195.1_ASM937219v1	NZ_CP045226	Nostoc sphaeroides CCNUC1 chromosome Gxm1, complete sequence	7	1221852-1221954	7	CRISPRCasFinder	no	PD-DExK	PD-DExK,cas14k,cas14j,Cas9_archaeal,csa3,Cas14u_CAS-V,RT,Cas14c_CAS-V-F,c2c9_V-U4,cas3,c2c10_CAS-V-U3,DinG,c2c5_V-U5,cas2,cas1,cas4,cas6,2OG_CAS,csc1gr5,csc2gr7,cas10d,WYL,DEDDh	Unclear	CCTCCTGAGTTTTGTCGCCAAGCTATCCAAAAAGCACTA	39	0	0	NA	NA	NA	1	1	Orphan	PD-DExK,cas14k,cas14j,Cas9_archaeal,csa3,Cas14u_CAS-V,RT,Cas14c_CAS-V-F,c2c9_V-U4,cas3,c2c10_CAS-V-U3,DinG,c2c5_V-U5,cas2,cas1,cas4,cas6,2OG_CAS,csc1gr5,csc2gr7,cas10d,WYL,DEDDh,csm3gr7,csx19,csx10gr5,cas10,csx1,csx3	NA|126aa|up_0|NZ_CP045226.1_1221244_1221622_-,NA	NA|638aa|up_9|NZ_CP045226.1_1208250_1210164_+	PRK07418, PRK07418, acetolactate synthase large subunit	NA|857aa|up_8|NZ_CP045226.1_1211598_1214169_+	COG4251, COG4251, Bacteriophytochrome (light-regulated signal transduction histidine kinase) [Signal transduction mechanisms]	NA|142aa|up_7|NZ_CP045226.1_1214177_1214603_+	cd17557, REC_Rcp-like, phosphoacceptor receiver (REC) domain of cyanobacterial phytochrome response regulator Rcp and similar domains	NA|760aa|up_6|NZ_CP045226.1_1214608_1216888_+	PRK13557, PRK13557, histidine kinase; Provisional	NA|409aa|up_5|NZ_CP045226.1_1217049_1218276_+	pfam13517, VCBS, Repeat domain in Vibrio, Colwellia, Bradyrhizobium and Shewanella	NA|364aa|up_4|NZ_CP045226.1_1218493_1219585_-	PRK09601, PRK09601, redox-regulated ATPase YchF	NA|66aa|up_3|NZ_CP045226.1_1219630_1219828_-	PRK12323, PRK12323, DNA polymerase III subunit gamma/tau	NA|79aa|up_2|NZ_CP045226.1_1220053_1220290_-	pfam01809, Haemolytic, Haemolytic domain	NA|241aa|up_1|NZ_CP045226.1_1220350_1221073_+	COG0170, SEC59, Dolichol kinase [Lipid metabolism]	NA|126aa|up_0|NZ_CP045226.1_1221244_1221622_-	NA	NA|308aa|down_0|NZ_CP045226.1_1222364_1223288_-	PLN02824, PLN02824, hydrolase, alpha/beta fold family protein	NA|185aa|down_1|NZ_CP045226.1_1223726_1224281_+	cd02969, PRX_like1, Peroxiredoxin (PRX)-like 1 family; hypothetical proteins that show sequence similarity to PRXs	NA|243aa|down_2|NZ_CP045226.1_1224352_1225081_+	cd04254, AAK_UMPK-PyrH-Ec, UMP kinase (UMPK)-Ec, the microbial/chloroplast uridine monophosphate kinase (uridylate kinase) enzyme that catalyzes UMP phosphorylation and plays a key role in pyrimidine nucleotide biosynthesis; regulation of this process is via feed-back control and via gene repression of carbamoyl phosphate synthetase (the first enzyme of the pyrimidine biosynthesis pathway)	NA|183aa|down_3|NZ_CP045226.1_1225067_1225616_+	PRK00083, frr, ribosome recycling factor; Reviewed	NA|374aa|down_4|NZ_CP045226.1_1225750_1226872_+	TIGR02032, Uncharacterized_protein_MJ1520, geranylgeranyl reductase family	PD-DExK|207aa|down_5|NZ_CP045226.1_1227496_1228117_-	COG3980, spsG, Spore coat polysaccharide biosynthesis protein, predicted glycosyltransferase [Cell envelope biogenesis, outer membrane]	NA|259aa|down_6|NZ_CP045226.1_1228236_1229013_-	pfam02517, Abi, CAAX protease self-immunity	NA|146aa|down_7|NZ_CP045226.1_1229509_1229947_-	pfam14250, AbrB-like, AbrB-like transcriptional regulator	NA|351aa|down_8|NZ_CP045226.1_1231353_1232406_+	PRK12577, PRK12577, succinate dehydrogenase/fumarate reductase iron-sulfur subunit	NA|175aa|down_9|NZ_CP045226.1_1232498_1233023_-	COG0823, TolB, Periplasmic component of the Tol biopolymer transport system [Intracellular trafficking and secretion]
GCF_009372195.1_ASM937219v1	NZ_CP045226	Nostoc sphaeroides CCNUC1 chromosome Gxm1, complete sequence	8	1438532-1438593	8	CRISPRCasFinder	no	cas14j	PD-DExK,cas14k,cas14j,Cas9_archaeal,csa3,Cas14u_CAS-V,RT,Cas14c_CAS-V-F,c2c9_V-U4,cas3,c2c10_CAS-V-U3,DinG,c2c5_V-U5,cas2,cas1,cas4,cas6,2OG_CAS,csc1gr5,csc2gr7,cas10d,WYL,DEDDh	Unclear	ACGCGATTAATCGCGTCTGTACA	23	1	14	1438555-1438570|1438555-1438570|1438555-1438570|1438555-1438570|1438555-1438570|1438555-1438570|1438555-1438570|1438555-1438570|1438555-1438570|1438555-1438570|1438555-1438570|1438555-1438570|1438555-1438570|1438555-1438570	NZ_CP045226.1_517183-517198|NZ_CP045226.1_559398-559413|NZ_CP045226.1_559437-559452|NZ_CP045226.1_559476-559491|NZ_CP045226.1_609089-609074|NZ_CP045226.1_720468-720483|NZ_CP045226.1_975889-975874|NZ_CP045226.1_1079557-1079572|NZ_CP045226.1_1079603-1079618|NZ_CP045226.1_3606622-3606637|NZ_CP045226.1_3868467-3868452|NZ_CP045226.1_4264361-4264376|NZ_CP045226.1_5596903-5596918|NZ_CP045226.1_6091858-6091873	NA	1	1	TypeV	PD-DExK,cas14k,cas14j,Cas9_archaeal,csa3,Cas14u_CAS-V,RT,Cas14c_CAS-V-F,c2c9_V-U4,cas3,c2c10_CAS-V-U3,DinG,c2c5_V-U5,cas2,cas1,cas4,cas6,2OG_CAS,csc1gr5,csc2gr7,cas10d,WYL,DEDDh,csm3gr7,csx19,csx10gr5,cas10,csx1,csx3	NA|133aa|up_8|NZ_CP045226.1_1429208_1429607_+,NA|99aa|down_2|NZ_CP045226.1_1441676_1441973_+,NA|139aa|down_3|NZ_CP045226.1_1442507_1442924_+,NA|214aa|down_4|NZ_CP045226.1_1443024_1443666_-,NA|82aa|down_7|NZ_CP045226.1_1450085_1450331_+,NA|84aa|down_8|NZ_CP045226.1_1450442_1450694_+,NA|78aa|down_9|NZ_CP045226.1_1450698_1450932_-	NA|390aa|up_9|NZ_CP045226.1_1427976_1429146_+	cd08283, FDH_like_1, Glutathione-dependent formaldehyde dehydrogenase related proteins, child 1	NA|133aa|up_8|NZ_CP045226.1_1429208_1429607_+	NA	NA|390aa|up_7|NZ_CP045226.1_1429867_1431037_+	cd08283, FDH_like_1, Glutathione-dependent formaldehyde dehydrogenase related proteins, child 1	NA|114aa|up_6|NZ_CP045226.1_1431179_1431521_+	cd02238, cupin_KdgF, pectin degradation protein KdgF and related proteins, cupin domain	cas14j|302aa|up_5|NZ_CP045226.1_1431934_1432840_-	COG0675, COG0675, Transposase and inactivated derivatives [DNA replication, recombination, and repair]	NA|135aa|up_4|NZ_CP045226.1_1433007_1433412_+	pfam01797, Y1_Tnp, Transposase IS200 like	NA|305aa|up_3|NZ_CP045226.1_1433545_1434460_+	COG2207, AraC, AraC-type DNA-binding domain-containing proteins [Transcription]	NA|398aa|up_2|NZ_CP045226.1_1434629_1435823_+	PRK07415, PRK07415, NAD(P)H-quinone oxidoreductase subunit H; Validated	NA|412aa|up_1|NZ_CP045226.1_1436375_1437611_-	pfam06838, Met_gamma_lyase, Methionine gamma-lyase	NA|274aa|up_0|NZ_CP045226.1_1437701_1438523_+	COG1398, OLE1, Fatty-acid desaturase [Lipid metabolism]	NA|375aa|down_0|NZ_CP045226.1_1438756_1439881_+	PLN02598, PLN02598, omega-6 fatty acid desaturase	NA|360aa|down_1|NZ_CP045226.1_1440099_1441179_+	PLN02498, PLN02498, omega-3 fatty acid desaturase	NA|99aa|down_2|NZ_CP045226.1_1441676_1441973_+	NA	NA|139aa|down_3|NZ_CP045226.1_1442507_1442924_+	NA	NA|214aa|down_4|NZ_CP045226.1_1443024_1443666_-	NA	NA|837aa|down_5|NZ_CP045226.1_1443949_1446460_-	COG1080, PtsA, Phosphoenolpyruvate-protein kinase (PTS system EI component in bacteria) [Carbohydrate transport and metabolism]	NA|393aa|down_6|NZ_CP045226.1_1448628_1449807_+	COG4552, Eis, Predicted acetyltransferase involved in intracellular survival and related acetyltransferases [General function prediction only]	NA|82aa|down_7|NZ_CP045226.1_1450085_1450331_+	NA	NA|84aa|down_8|NZ_CP045226.1_1450442_1450694_+	NA	NA|78aa|down_9|NZ_CP045226.1_1450698_1450932_-	NA
GCF_009372195.1_ASM937219v1	NZ_CP045226	Nostoc sphaeroides CCNUC1 chromosome Gxm1, complete sequence	9	1574938-1575210	2	PILER-CR	no		PD-DExK,cas14k,cas14j,Cas9_archaeal,csa3,Cas14u_CAS-V,RT,Cas14c_CAS-V-F,c2c9_V-U4,cas3,c2c10_CAS-V-U3,DinG,c2c5_V-U5,cas2,cas1,cas4,cas6,2OG_CAS,csc1gr5,csc2gr7,cas10d,WYL,DEDDh	Orphan	GGCATCGACTCTATCTCTGGTGGTGAAGGTAGCGATCGCATCTTTGGCCGCAATGAT	57	0	0	NA	NA	NA	2	2	Orphan	PD-DExK,cas14k,cas14j,Cas9_archaeal,csa3,Cas14u_CAS-V,RT,Cas14c_CAS-V-F,c2c9_V-U4,cas3,c2c10_CAS-V-U3,DinG,c2c5_V-U5,cas2,cas1,cas4,cas6,2OG_CAS,csc1gr5,csc2gr7,cas10d,WYL,DEDDh,csm3gr7,csx19,csx10gr5,cas10,csx1,csx3	NA|73aa|up_6|NZ_CP045226.1_1565776_1565995_+,NA|140aa|up_5|NZ_CP045226.1_1565987_1566407_+,NA|245aa|up_0|NZ_CP045226.1_1572308_1573043_-,NA|62aa|down_0|NZ_CP045226.1_1575690_1575876_-,NA|115aa|down_4|NZ_CP045226.1_1579418_1579763_-,NA|82aa|down_7|NZ_CP045226.1_1584615_1584861_-	NA|77aa|up_9|NZ_CP045226.1_1564491_1564722_+	pfam04851, ResIII, Type III restriction enzyme, res subunit	NA|122aa|up_8|NZ_CP045226.1_1564841_1565207_+	cd00085, HNHc, HNH nucleases; HNH endonuclease signature which is found in viral, prokaryotic, and eukaryotic proteins	NA|75aa|up_7|NZ_CP045226.1_1565299_1565524_+	COG4226, HicB, Predicted nuclease of the RNAse H fold, HicB family [General    function prediction only]	NA|73aa|up_6|NZ_CP045226.1_1565776_1565995_+	NA	NA|140aa|up_5|NZ_CP045226.1_1565987_1566407_+	NA	NA|246aa|up_4|NZ_CP045226.1_1566460_1567198_-	pfam07444, Ycf66_N, Ycf66 protein N-terminus	NA|477aa|up_3|NZ_CP045226.1_1567415_1568846_+	PRK09287, PRK09287, NADP-dependent phosphogluconate dehydrogenase	NA|712aa|up_2|NZ_CP045226.1_1569246_1571382_-	cd13566, PBP2_phosphate, Substrate binding domain of putative ABC-type phosphate transporter, a member of the type 2 periplasmic binding fold superfamily	NA|302aa|up_1|NZ_CP045226.1_1571403_1572309_-	pfam08852, DUF1822, Protein of unknown function (DUF1822)	NA|245aa|up_0|NZ_CP045226.1_1572308_1573043_-	NA	NA|62aa|down_0|NZ_CP045226.1_1575690_1575876_-	NA	NA|238aa|down_1|NZ_CP045226.1_1576237_1576951_+	COG1842, PspA, Phage shock protein A (IM30), suppresses sigma54-dependent transcription [Transcription / Signal transduction mechanisms]	NA|294aa|down_2|NZ_CP045226.1_1576957_1577839_+	cd13653, PBP2_phosphate_like_1, Substrate binding domain of putative ABC-type phosphate transporter, a member of the type 2 periplasmic binding fold superfamily	NA|380aa|down_3|NZ_CP045226.1_1578097_1579237_-	cd14014, STKc_PknB_like, Catalytic domain of bacterial Serine/Threonine kinases, PknB and similar proteins	NA|115aa|down_4|NZ_CP045226.1_1579418_1579763_-	NA	NA|364aa|down_5|NZ_CP045226.1_1580011_1581103_-	TIGR00378, cax, calcium/proton exchanger (cax)	NA|927aa|down_6|NZ_CP045226.1_1581690_1584471_+	cd10797, GH57N_APU_like_1, N-terminal putative catalytic domain of mainly uncharacterized prokaryotic proteins similar to archaeal thermoactive amylopullulanases; glycoside hydrolase family 57 (GH57)	NA|82aa|down_7|NZ_CP045226.1_1584615_1584861_-	NA	NA|82aa|down_8|NZ_CP045226.1_1584897_1585143_-	pfam14279, HNH_5, HNH endonuclease	NA|372aa|down_9|NZ_CP045226.1_1585150_1586266_-	COG0673, MviM, Predicted dehydrogenases and related proteins [General function prediction only]
GCF_009372195.1_ASM937219v1	NZ_CP045226	Nostoc sphaeroides CCNUC1 chromosome Gxm1, complete sequence	10	1730103-1730177	9	CRISPRCasFinder	no	cas14j,Cas14c_CAS-V-F	PD-DExK,cas14k,cas14j,Cas9_archaeal,csa3,Cas14u_CAS-V,RT,Cas14c_CAS-V-F,c2c9_V-U4,cas3,c2c10_CAS-V-U3,DinG,c2c5_V-U5,cas2,cas1,cas4,cas6,2OG_CAS,csc1gr5,csc2gr7,cas10d,WYL,DEDDh	Unclear	CTGAATTCTGACTCCTGAATTCT	23	0	0	NA	NA	NA	1	1	TypeV	PD-DExK,cas14k,cas14j,Cas9_archaeal,csa3,Cas14u_CAS-V,RT,Cas14c_CAS-V-F,c2c9_V-U4,cas3,c2c10_CAS-V-U3,DinG,c2c5_V-U5,cas2,cas1,cas4,cas6,2OG_CAS,csc1gr5,csc2gr7,cas10d,WYL,DEDDh,csm3gr7,csx19,csx10gr5,cas10,csx1,csx3	NA|71aa|up_8|NZ_CP045226.1_1722267_1722480_+,NA|66aa|up_6|NZ_CP045226.1_1723777_1723975_+,NA	NA|158aa|up_9|NZ_CP045226.1_1721457_1721931_-	COG3476, COG3476, Tryptophan-rich sensory protein (mitochondrial benzodiazepine receptor homolog) [Signal transduction mechanisms]	NA|71aa|up_8|NZ_CP045226.1_1722267_1722480_+	NA	cas14j|401aa|up_7|NZ_CP045226.1_1722533_1723736_-	COG0675, COG0675, Transposase and inactivated derivatives [DNA replication, recombination, and repair]	NA|66aa|up_6|NZ_CP045226.1_1723777_1723975_+	NA	NA|211aa|up_5|NZ_CP045226.1_1724215_1724848_-	COG4430, COG4430, Uncharacterized protein conserved in bacteria [Function unknown]	NA|147aa|up_4|NZ_CP045226.1_1724893_1725334_-	pfam00583, Acetyltransf_1, Acetyltransferase (GNAT) family	NA|226aa|up_3|NZ_CP045226.1_1725333_1726011_-	COG2197, CitB, Response regulator containing a CheY-like receiver domain and an HTH DNA-binding domain [Signal transduction mechanisms / Transcription]	NA|325aa|up_2|NZ_CP045226.1_1726406_1727381_-	PRK00089, era, GTPase Era; Reviewed	NA|361aa|up_1|NZ_CP045226.1_1727660_1728743_+	PRK14071, PRK14071, ATP-dependent 6-phosphofructokinase	NA|237aa|up_0|NZ_CP045226.1_1729220_1729931_-	COG4929, COG4929, Uncharacterized membrane-anchored protein [Function unknown]	NA|472aa|down_0|NZ_CP045226.1_1730190_1731606_-	COG4872, COG4872, Predicted membrane protein [Function unknown]	NA|329aa|down_1|NZ_CP045226.1_1731780_1732767_+	COG0144, Sun, tRNA and rRNA cytosine-C5-methylases [Translation, ribosomal structure and biogenesis]	NA|150aa|down_2|NZ_CP045226.1_1733469_1733919_+	cd04666, Nudix_Hydrolase_9, Members of the Nudix hydrolase superfamily catalyze the hydrolysis of NUcleoside DIphosphates linked to other moieties, X	NA|316aa|down_3|NZ_CP045226.1_1734667_1735615_-	TIGR01249, Putative_proline_iminopeptidase, proline iminopeptidase, Neisseria-type subfamily	NA|598aa|down_4|NZ_CP045226.1_1736134_1737928_-	pfam04966, OprB, Carbohydrate-selective porin, OprB family	NA|214aa|down_5|NZ_CP045226.1_1738722_1739364_-	PRK14003, PRK14003, K(+)-transporting ATPase subunit C	NA|90aa|down_6|NZ_CP045226.1_1739363_1739633_-	pfam09604, Potass_KdpF, F subunit of K+-transporting ATPase (Potass_KdpF)	NA|714aa|down_7|NZ_CP045226.1_1739639_1741781_-	PRK01122, PRK01122, potassium-transporting ATPase subunit KdpB	NA|582aa|down_8|NZ_CP045226.1_1742135_1743881_-	pfam03814, KdpA, Potassium-transporting ATPase A subunit	Cas14c_CAS-V-F|407aa|down_9|NZ_CP045226.1_1744971_1746192_+	COG0675, COG0675, Transposase and inactivated derivatives [DNA replication, recombination, and repair]
GCF_009372195.1_ASM937219v1	NZ_CP045226	Nostoc sphaeroides CCNUC1 chromosome Gxm1, complete sequence	11	2108867-2108947	10	CRISPRCasFinder	no		PD-DExK,cas14k,cas14j,Cas9_archaeal,csa3,Cas14u_CAS-V,RT,Cas14c_CAS-V-F,c2c9_V-U4,cas3,c2c10_CAS-V-U3,DinG,c2c5_V-U5,cas2,cas1,cas4,cas6,2OG_CAS,csc1gr5,csc2gr7,cas10d,WYL,DEDDh	Orphan	GGCGATGTCTACGACGGGCTATTC	24	0	0	NA	NA	NA	1	1	Orphan	PD-DExK,cas14k,cas14j,Cas9_archaeal,csa3,Cas14u_CAS-V,RT,Cas14c_CAS-V-F,c2c9_V-U4,cas3,c2c10_CAS-V-U3,DinG,c2c5_V-U5,cas2,cas1,cas4,cas6,2OG_CAS,csc1gr5,csc2gr7,cas10d,WYL,DEDDh,csm3gr7,csx19,csx10gr5,cas10,csx1,csx3	NA,NA|69aa|down_0|NZ_CP045226.1_2109008_2109215_+	NA|366aa|up_9|NZ_CP045226.1_2100199_2101297_+	PRK00188, trpD, anthranilate phosphoribosyltransferase; Provisional	NA|367aa|up_8|NZ_CP045226.1_2101429_2102530_+	PRK13396, PRK13396, 3-deoxy-7-phosphoheptulonate synthase; Provisional	NA|177aa|up_7|NZ_CP045226.1_2102820_2103351_-	NF033203, entero_EhxA, enterohemolysin EhxA	NA|67aa|up_6|NZ_CP045226.1_2103958_2104159_-	COG2608, CopZ, Copper chaperone [Inorganic ion transport and metabolism]	NA|271aa|up_5|NZ_CP045226.1_2104981_2105794_-	COG0300, DltE, Short-chain dehydrogenases of various substrate specificities [General function prediction only]	NA|326aa|up_4|NZ_CP045226.1_2106006_2106984_+	PRK07048, PRK07048, threo-3-hydroxy-L-aspartate ammonia-lyase	NA|202aa|up_3|NZ_CP045226.1_2107049_2107655_-	pfam05685, Uma2, Putative restriction endonuclease	NA|73aa|up_2|NZ_CP045226.1_2107789_2108008_-	pfam12441, CopG_antitoxin, CopG antitoxin of type II toxin-antitoxin system	NA|102aa|up_1|NZ_CP045226.1_2108108_2108414_-	smart00530, HTH_XRE, Helix-turn-helix XRE-family like proteins	NA|120aa|up_0|NZ_CP045226.1_2108410_2108770_-	COG4683, COG4683, Uncharacterized protein conserved in bacteria [Function unknown]	NA|69aa|down_0|NZ_CP045226.1_2109008_2109215_+	NA	NA|439aa|down_1|NZ_CP045226.1_2109487_2110804_+	pfam03222, Trp_Tyr_perm, Tryptophan/tyrosine permease family	NA|274aa|down_2|NZ_CP045226.1_2111634_2112456_-	cd02146, NfsA-like, nitroreductase similar to Escherichia coli NfsA	NA|373aa|down_3|NZ_CP045226.1_2112732_2113851_-	cd13557, PBP2_SsuA, Substrate binding domain of sulfonate binding protein, a member of the type 2 periplasmic binding fold superfamily	NA|253aa|down_4|NZ_CP045226.1_2114346_2115105_-	PRK06208, PRK06208, class II aldolase/adducin family protein	NA|224aa|down_5|NZ_CP045226.1_2115091_2115763_-	cd10548, cupin_CDO, cysteine dioxygenase, cupin domain	NA|70aa|down_6|NZ_CP045226.1_2115832_2116042_-	pfam14239, RRXRR, RRXRR protein	NA|587aa|down_7|NZ_CP045226.1_2117255_2119016_-	COG0564, RluA, Pseudouridylate synthases, 23S RNA-specific [Translation, ribosomal structure and biogenesis]	NA|280aa|down_8|NZ_CP045226.1_2119550_2120390_-	PRK00724, PRK00724, formate dehydrogenase accessory sulfurtransferase FdhD	NA|244aa|down_9|NZ_CP045226.1_2120482_2121214_-	COG4636, Uma2, Endonuclease, Uma2 family (restriction endonuclease fold) [General function prediction only]
GCF_009372195.1_ASM937219v1	NZ_CP045226	Nostoc sphaeroides CCNUC1 chromosome Gxm1, complete sequence	12	2138627-2140944	3,11,1	PILER-CR,CRISPRCasFinder,CRT	no		PD-DExK,cas14k,cas14j,Cas9_archaeal,csa3,Cas14u_CAS-V,RT,Cas14c_CAS-V-F,c2c9_V-U4,cas3,c2c10_CAS-V-U3,DinG,c2c5_V-U5,cas2,cas1,cas4,cas6,2OG_CAS,csc1gr5,csc2gr7,cas10d,WYL,DEDDh	Orphan	ATTGCAATTACCTAAAATCCCTATTAGGG----------ATTGAAAC,ATTGCAATTACCTAAAATCCCTATTAGGGATTGAAAC,ATTGCAATTACCTAAAATCCCTATTAGGGATTGAAAC	47,37,37	0	0	NA	NA	I-D,II-B:I-D,II-B:I-D,II-B	32,32,32	32	Orphan	PD-DExK,cas14k,cas14j,Cas9_archaeal,csa3,Cas14u_CAS-V,RT,Cas14c_CAS-V-F,c2c9_V-U4,cas3,c2c10_CAS-V-U3,DinG,c2c5_V-U5,cas2,cas1,cas4,cas6,2OG_CAS,csc1gr5,csc2gr7,cas10d,WYL,DEDDh,csm3gr7,csx19,csx10gr5,cas10,csx1,csx3	NA|103aa|up_4|NZ_CP045226.1_2132287_2132596_-,NA|108aa|up_0|NZ_CP045226.1_2137999_2138323_+,NA|88aa|down_0|NZ_CP045226.1_2141248_2141512_-,NA|172aa|down_5|NZ_CP045226.1_2147025_2147541_-,NA|345aa|down_8|NZ_CP045226.1_2150061_2151096_-,NA|82aa|down_9|NZ_CP045226.1_2151106_2151352_-	NA|314aa|up_9|NZ_CP045226.1_2123348_2124290_-	pfam13737, DDE_Tnp_1_5, Transposase DDE domain	NA|382aa|up_8|NZ_CP045226.1_2124938_2126084_+	cd09912, DLP_2, Dynamin-like protein including dynamins, mitofusins, and guanylate-binding proteins	NA|352aa|up_7|NZ_CP045226.1_2126166_2127222_-	PRK13654, PRK13654, magnesium-protoporphyrin IX monomethyl ester cyclase; Provisional	NA|355aa|up_6|NZ_CP045226.1_2127735_2128800_+	PRK05330, PRK05330, oxygen-dependent coproporphyrinogen oxidase	NA|741aa|up_5|NZ_CP045226.1_2128995_2131218_-	cd02767, MopB_ydeP, The MopB_ydeP CD includes a group of related uncharacterized bacterial molybdopterin-binding oxidoreductase-like domains with a putative molybdopterin cofactor binding site	NA|103aa|up_4|NZ_CP045226.1_2132287_2132596_-	NA	NA|427aa|up_3|NZ_CP045226.1_2133395_2134676_+	PRK02427, PRK02427, 3-phosphoshikimate 1-carboxyvinyltransferase; Provisional	NA|530aa|up_2|NZ_CP045226.1_2135335_2136925_+	COG3540, PhoD, Phosphodiesterase/alkaline phosphatase D [Inorganic ion transport and metabolism]	NA|60aa|up_1|NZ_CP045226.1_2137649_2137829_+	PLN00014, PLN00014, light-harvesting-like protein 3; Provisional	NA|108aa|up_0|NZ_CP045226.1_2137999_2138323_+	NA	NA|88aa|down_0|NZ_CP045226.1_2141248_2141512_-	NA	NA|213aa|down_1|NZ_CP045226.1_2141813_2142452_+	cd03189, GST_C_GTT1_like, C-terminal, alpha helical domain of GTT1-like Glutathione S-transferases	NA|344aa|down_2|NZ_CP045226.1_2142860_2143892_-	pfam00903, Glyoxalase, Glyoxalase/Bleomycin resistance protein/Dioxygenase superfamily	NA|200aa|down_3|NZ_CP045226.1_2144486_2145086_-	cd02566, PseudoU_synth_RluE, Pseudouridine synthase, Escherichia coli RluE	NA|327aa|down_4|NZ_CP045226.1_2145625_2146606_-	cd19093, AKR_AtPLR-like, Arabidopsis thaliana pyridoxal reductase (PLR) and similar proteins	NA|172aa|down_5|NZ_CP045226.1_2147025_2147541_-	NA	NA|100aa|down_6|NZ_CP045226.1_2147719_2148019_-	cd12399, RRM_HP0827_like, RNA recognition motif in Helicobacter pylori HP0827 protein and similar proteins	NA|504aa|down_7|NZ_CP045226.1_2148417_2149929_+	PRK09224, PRK09224, threonine ammonia-lyase IlvA	NA|345aa|down_8|NZ_CP045226.1_2150061_2151096_-	NA	NA|82aa|down_9|NZ_CP045226.1_2151106_2151352_-	NA
GCF_009372195.1_ASM937219v1	NZ_CP045226	Nostoc sphaeroides CCNUC1 chromosome Gxm1, complete sequence	13	2153958-2154053	12	CRISPRCasFinder	no		PD-DExK,cas14k,cas14j,Cas9_archaeal,csa3,Cas14u_CAS-V,RT,Cas14c_CAS-V-F,c2c9_V-U4,cas3,c2c10_CAS-V-U3,DinG,c2c5_V-U5,cas2,cas1,cas4,cas6,2OG_CAS,csc1gr5,csc2gr7,cas10d,WYL,DEDDh	Orphan	TGGTAGTCTTTAGGCGATCGCTA	23	0	0	NA	NA	NA	1	1	Orphan	PD-DExK,cas14k,cas14j,Cas9_archaeal,csa3,Cas14u_CAS-V,RT,Cas14c_CAS-V-F,c2c9_V-U4,cas3,c2c10_CAS-V-U3,DinG,c2c5_V-U5,cas2,cas1,cas4,cas6,2OG_CAS,csc1gr5,csc2gr7,cas10d,WYL,DEDDh,csm3gr7,csx19,csx10gr5,cas10,csx1,csx3	NA|172aa|up_5|NZ_CP045226.1_2147025_2147541_-,NA|345aa|up_2|NZ_CP045226.1_2150061_2151096_-,NA|82aa|up_1|NZ_CP045226.1_2151106_2151352_-,NA|64aa|down_2|NZ_CP045226.1_2157438_2157630_+,NA|804aa|down_3|NZ_CP045226.1_2157957_2160369_-,NA|856aa|down_8|NZ_CP045226.1_2168134_2170702_+	NA|213aa|up_9|NZ_CP045226.1_2141813_2142452_+	cd03189, GST_C_GTT1_like, C-terminal, alpha helical domain of GTT1-like Glutathione S-transferases	NA|344aa|up_8|NZ_CP045226.1_2142860_2143892_-	pfam00903, Glyoxalase, Glyoxalase/Bleomycin resistance protein/Dioxygenase superfamily	NA|200aa|up_7|NZ_CP045226.1_2144486_2145086_-	cd02566, PseudoU_synth_RluE, Pseudouridine synthase, Escherichia coli RluE	NA|327aa|up_6|NZ_CP045226.1_2145625_2146606_-	cd19093, AKR_AtPLR-like, Arabidopsis thaliana pyridoxal reductase (PLR) and similar proteins	NA|172aa|up_5|NZ_CP045226.1_2147025_2147541_-	NA	NA|100aa|up_4|NZ_CP045226.1_2147719_2148019_-	cd12399, RRM_HP0827_like, RNA recognition motif in Helicobacter pylori HP0827 protein and similar proteins	NA|504aa|up_3|NZ_CP045226.1_2148417_2149929_+	PRK09224, PRK09224, threonine ammonia-lyase IlvA	NA|345aa|up_2|NZ_CP045226.1_2150061_2151096_-	NA	NA|82aa|up_1|NZ_CP045226.1_2151106_2151352_-	NA	NA|413aa|up_0|NZ_CP045226.1_2152295_2153534_-	cd01189, INT_ICEBs1_C_like, C-terminal catalytic domain of integrases from bacterial phages and conjugate transposons	NA|197aa|down_0|NZ_CP045226.1_2154174_2154765_-	pfam06206, CpeT, CpeT/CpcT family (DUF1001)	NA|364aa|down_1|NZ_CP045226.1_2155023_2156115_-	PRK07409, PRK07409, threonine synthase; Validated	NA|64aa|down_2|NZ_CP045226.1_2157438_2157630_+	NA	NA|804aa|down_3|NZ_CP045226.1_2157957_2160369_-	NA	NA|550aa|down_4|NZ_CP045226.1_2160686_2162336_+	pfam04966, OprB, Carbohydrate-selective porin, OprB family	NA|699aa|down_5|NZ_CP045226.1_2162950_2165047_-	cd01347, ligand_gated_channel, TonB dependent/Ligand-Gated channels are created by a monomeric 22 strand (22,24) anti-parallel beta-barrel	NA|92aa|down_6|NZ_CP045226.1_2165591_2165867_-	cd17074, Ubl_CysO_like, ubiquitin-like (Ubl) domain found in Mycobacterium tuberculosis CysO and similar proteins	NA|437aa|down_7|NZ_CP045226.1_2165978_2167289_-	PRK07591, PRK07591, threonine synthase; Validated	NA|856aa|down_8|NZ_CP045226.1_2168134_2170702_+	NA	NA|188aa|down_9|NZ_CP045226.1_2170788_2171352_+	COG4719, COG4719, Uncharacterized protein conserved in bacteria [Function unknown]
GCF_009372195.1_ASM937219v1	NZ_CP045226	Nostoc sphaeroides CCNUC1 chromosome Gxm1, complete sequence	14	2286027-2286217	13	CRISPRCasFinder	no	Cas14u_CAS-V	PD-DExK,cas14k,cas14j,Cas9_archaeal,csa3,Cas14u_CAS-V,RT,Cas14c_CAS-V-F,c2c9_V-U4,cas3,c2c10_CAS-V-U3,DinG,c2c5_V-U5,cas2,cas1,cas4,cas6,2OG_CAS,csc1gr5,csc2gr7,cas10d,WYL,DEDDh	Unclear	CTTGCTGAATCTACCGAAATGAGAAAGCAAACTGGGGTTTGAGAAAG	47	0	0	NA	NA	NA	1	1	Unclear	PD-DExK,cas14k,cas14j,Cas9_archaeal,csa3,Cas14u_CAS-V,RT,Cas14c_CAS-V-F,c2c9_V-U4,cas3,c2c10_CAS-V-U3,DinG,c2c5_V-U5,cas2,cas1,cas4,cas6,2OG_CAS,csc1gr5,csc2gr7,cas10d,WYL,DEDDh,csm3gr7,csx19,csx10gr5,cas10,csx1,csx3	NA|125aa|up_3|NZ_CP045226.1_2282168_2282543_-,NA|204aa|down_3|NZ_CP045226.1_2291110_2291722_-	NA|240aa|up_9|NZ_CP045226.1_2276223_2276943_+	COG2197, CitB, Response regulator containing a CheY-like receiver domain and an HTH DNA-binding domain [Signal transduction mechanisms / Transcription]	NA|558aa|up_8|NZ_CP045226.1_2277092_2278766_+	COG2208, RsbU, Serine phosphatase RsbU, regulator of sigma subunit [Signal transduction mechanisms / Transcription]	NA|142aa|up_7|NZ_CP045226.1_2278769_2279195_+	COG2172, RsbW, Anti-sigma regulatory factor (Ser/Thr protein kinase) [Signal transduction mechanisms]	NA|139aa|up_6|NZ_CP045226.1_2279278_2279695_-	pfam02136, NTF2, Nuclear transport factor 2 (NTF2) domain	NA|325aa|up_5|NZ_CP045226.1_2280438_2281413_+	COG1600, COG1600, Uncharacterized Fe-S protein [Energy production and conversion]	NA|213aa|up_4|NZ_CP045226.1_2281387_2282026_+	cd04303, HAD_PGPase, phosphoglycolate phosphatase, similar to Synechococcus elongates phosphoglycolate phosphatase PGP/CbbZ	NA|125aa|up_3|NZ_CP045226.1_2282168_2282543_-	NA	NA|177aa|up_2|NZ_CP045226.1_2282764_2283295_+	pfam09150, Carot_N, Orange carotenoid protein, N-terminal	NA|255aa|up_1|NZ_CP045226.1_2283733_2284498_-	cd03255, ABC_MJ0796_LolCDE_FtsE, ATP-binding cassette domain of the transporters involved in export of lipoprotein and macrolide, and cell division protein	NA|201aa|up_0|NZ_CP045226.1_2284939_2285542_+	pfam05685, Uma2, Putative restriction endonuclease	NA|490aa|down_0|NZ_CP045226.1_2286928_2288398_+	PRK03598, PRK03598, putative efflux pump membrane fusion protein; Provisional	NA|406aa|down_1|NZ_CP045226.1_2288446_2289664_+	PRK10535, PRK10535, macrolide ABC transporter ATP-binding protein/permease MacB	NA|182aa|down_2|NZ_CP045226.1_2289592_2290138_-	cd17580, REC_2_DhkD-like, second phosphoacceptor receiver (REC) domain of Dictyostelium discoideum hybrid signal transduction histidine kinase D and similar domains	NA|204aa|down_3|NZ_CP045226.1_2291110_2291722_-	NA	NA|342aa|down_4|NZ_CP045226.1_2291767_2292793_-	PRK00292, glk, glucokinase; Provisional	NA|216aa|down_5|NZ_CP045226.1_2292877_2293525_-	pfam00300, His_Phos_1, Histidine phosphatase superfamily (branch 1)	NA|188aa|down_6|NZ_CP045226.1_2293823_2294387_-	COG4636, Uma2, Endonuclease, Uma2 family (restriction endonuclease fold) [General function prediction only]	NA|131aa|down_7|NZ_CP045226.1_2294421_2294814_-	cd09872, PIN_Sll0205-like, VapC-like PIN domain of Sll0205 protein and homologs	NA|80aa|down_8|NZ_CP045226.1_2294810_2295050_-	COG4118, Phd, Antitoxin of toxin-antitoxin stability system [Cell division and chromosome partitioning]	NA|244aa|down_9|NZ_CP045226.1_2295085_2295817_-	COG0410, LivF, ABC-type branched-chain amino acid transport systems, ATPase component [Amino acid transport and metabolism]
GCF_009372195.1_ASM937219v1	NZ_CP045226	Nostoc sphaeroides CCNUC1 chromosome Gxm1, complete sequence	15	2296815-2296911	14	CRISPRCasFinder	no	cas14j	PD-DExK,cas14k,cas14j,Cas9_archaeal,csa3,Cas14u_CAS-V,RT,Cas14c_CAS-V-F,c2c9_V-U4,cas3,c2c10_CAS-V-U3,DinG,c2c5_V-U5,cas2,cas1,cas4,cas6,2OG_CAS,csc1gr5,csc2gr7,cas10d,WYL,DEDDh	Unclear	AAGTTTGTCTGCTTAAATTTTACAGGTGCGATCGC	35	0	0	NA	NA	NA	1	1	TypeV	PD-DExK,cas14k,cas14j,Cas9_archaeal,csa3,Cas14u_CAS-V,RT,Cas14c_CAS-V-F,c2c9_V-U4,cas3,c2c10_CAS-V-U3,DinG,c2c5_V-U5,cas2,cas1,cas4,cas6,2OG_CAS,csc1gr5,csc2gr7,cas10d,WYL,DEDDh,csm3gr7,csx19,csx10gr5,cas10,csx1,csx3	NA|204aa|up_6|NZ_CP045226.1_2291110_2291722_-,NA|339aa|down_7|NZ_CP045226.1_2303707_2304724_+,NA|109aa|down_9|NZ_CP045226.1_2306537_2306864_-	NA|490aa|up_9|NZ_CP045226.1_2286928_2288398_+	PRK03598, PRK03598, putative efflux pump membrane fusion protein; Provisional	NA|406aa|up_8|NZ_CP045226.1_2288446_2289664_+	PRK10535, PRK10535, macrolide ABC transporter ATP-binding protein/permease MacB	NA|182aa|up_7|NZ_CP045226.1_2289592_2290138_-	cd17580, REC_2_DhkD-like, second phosphoacceptor receiver (REC) domain of Dictyostelium discoideum hybrid signal transduction histidine kinase D and similar domains	NA|204aa|up_6|NZ_CP045226.1_2291110_2291722_-	NA	NA|342aa|up_5|NZ_CP045226.1_2291767_2292793_-	PRK00292, glk, glucokinase; Provisional	NA|216aa|up_4|NZ_CP045226.1_2292877_2293525_-	pfam00300, His_Phos_1, Histidine phosphatase superfamily (branch 1)	NA|188aa|up_3|NZ_CP045226.1_2293823_2294387_-	COG4636, Uma2, Endonuclease, Uma2 family (restriction endonuclease fold) [General function prediction only]	NA|131aa|up_2|NZ_CP045226.1_2294421_2294814_-	cd09872, PIN_Sll0205-like, VapC-like PIN domain of Sll0205 protein and homologs	NA|80aa|up_1|NZ_CP045226.1_2294810_2295050_-	COG4118, Phd, Antitoxin of toxin-antitoxin stability system [Cell division and chromosome partitioning]	NA|244aa|up_0|NZ_CP045226.1_2295085_2295817_-	COG0410, LivF, ABC-type branched-chain amino acid transport systems, ATPase component [Amino acid transport and metabolism]	NA|98aa|down_0|NZ_CP045226.1_2296974_2297268_+	pfam02604, PhdYeFM_antitox, Antitoxin Phd_YefM, type II toxin-antitoxin system	NA|79aa|down_1|NZ_CP045226.1_2297260_2297497_+	COG2026, RelE, Cytotoxic translational repressor of toxin-antitoxin stability system [Translation, ribosomal structure and biogenesis / Cell division and chromosome partitioning]	NA|264aa|down_2|NZ_CP045226.1_2297512_2298304_-	COG0411, LivG, ABC-type branched-chain amino acid transport systems, ATPase component [Amino acid transport and metabolism]	NA|319aa|down_3|NZ_CP045226.1_2298358_2299315_-	cd06581, TM_PBP1_LivM_like, Transmembrane subunit (TM) of Escherichia coli LivM and related proteins	NA|201aa|down_4|NZ_CP045226.1_2299489_2300092_-	pfam05685, Uma2, Putative restriction endonuclease	NA|317aa|down_5|NZ_CP045226.1_2300431_2301382_-	COG0559, LivH, Branched-chain amino acid ABC-type transport system, permease components [Amino acid transport and metabolism]	NA|416aa|down_6|NZ_CP045226.1_2301390_2302638_-	cd06348, PBP1_ABC_HAAT-like, type 1 periplasmic ligand-binding domain of uncharacterized ABC (Atpase Binding Cassette)-type active transport systems predicted to be involved in uptake of amino acids or peptides	NA|339aa|down_7|NZ_CP045226.1_2303707_2304724_+	NA	cas14j|409aa|down_8|NZ_CP045226.1_2305155_2306382_+	COG0675, COG0675, Transposase and inactivated derivatives [DNA replication, recombination, and repair]	NA|109aa|down_9|NZ_CP045226.1_2306537_2306864_-	NA
GCF_009372195.1_ASM937219v1	NZ_CP045226	Nostoc sphaeroides CCNUC1 chromosome Gxm1, complete sequence	16	2320550-2320705	15	CRISPRCasFinder	no	cas14j	PD-DExK,cas14k,cas14j,Cas9_archaeal,csa3,Cas14u_CAS-V,RT,Cas14c_CAS-V-F,c2c9_V-U4,cas3,c2c10_CAS-V-U3,DinG,c2c5_V-U5,cas2,cas1,cas4,cas6,2OG_CAS,csc1gr5,csc2gr7,cas10d,WYL,DEDDh	Unclear	CATTAATTTTGCTGGGTTGTAAAGACGCGAAATTTCGCGTCTTTACAGG	49	1	5	2320599-2320656|2320599-2320656|2320599-2320656|2320599-2320656|2320599-2320656	NZ_CP045226.1_1072281-1072338|NZ_CP045226.1_1892239-1892296|NZ_CP045226.1_1892132-1892189|NZ_CP045226.1_2589064-2589007|NZ_CP045226.1_6230863-6230920	NA	1	1	TypeV	PD-DExK,cas14k,cas14j,Cas9_archaeal,csa3,Cas14u_CAS-V,RT,Cas14c_CAS-V-F,c2c9_V-U4,cas3,c2c10_CAS-V-U3,DinG,c2c5_V-U5,cas2,cas1,cas4,cas6,2OG_CAS,csc1gr5,csc2gr7,cas10d,WYL,DEDDh,csm3gr7,csx19,csx10gr5,cas10,csx1,csx3	NA|80aa|up_9|NZ_CP045226.1_2308941_2309181_+,NA|184aa|up_5|NZ_CP045226.1_2315245_2315797_-,NA|95aa|up_3|NZ_CP045226.1_2316640_2316925_-,NA|124aa|down_3|NZ_CP045226.1_2325642_2326014_+	NA|80aa|up_9|NZ_CP045226.1_2308941_2309181_+	NA	NA|352aa|up_8|NZ_CP045226.1_2309466_2310522_-	cd07987, LPLAT_MGAT-like, Lysophospholipid Acyltransferases (LPLATs) of Glycerophospholipid Biosynthesis: MGAT-like	NA|85aa|up_7|NZ_CP045226.1_2310995_2311250_+	pfam01381, HTH_3, Helix-turn-helix	NA|1273aa|up_6|NZ_CP045226.1_2311251_2315070_+	TIGR02956, sensor_protein_TorS, TMAO reductase sytem sensor TorS	NA|184aa|up_5|NZ_CP045226.1_2315245_2315797_-	NA	NA|132aa|up_4|NZ_CP045226.1_2316248_2316644_-	cd09872, PIN_Sll0205-like, VapC-like PIN domain of Sll0205 protein and homologs	NA|95aa|up_3|NZ_CP045226.1_2316640_2316925_-	NA	NA|411aa|up_2|NZ_CP045226.1_2317010_2318243_-	PRK05942, PRK05942, aspartate aminotransferase; Provisional	NA|398aa|up_1|NZ_CP045226.1_2318239_2319433_-	cd08550, GlyDH-like, Glycerol_dehydrogenase-like	NA|174aa|up_0|NZ_CP045226.1_2319935_2320457_-	pfam10726, DUF2518, Protein of function (DUF2518)	NA|457aa|down_0|NZ_CP045226.1_2321597_2322968_-	COG3307, RfaL, Lipid A core - O-antigen ligase and related enzymes [Cell envelope biogenesis, outer membrane]	NA|281aa|down_1|NZ_CP045226.1_2322949_2323792_-	COG4735, COG4735, Uncharacterized protein conserved in bacteria [Function unknown]	NA|355aa|down_2|NZ_CP045226.1_2323936_2325001_+	COG3380, COG3380, Predicted NAD/FAD-dependent oxidoreductase [General function prediction only]	NA|124aa|down_3|NZ_CP045226.1_2325642_2326014_+	NA	NA|313aa|down_4|NZ_CP045226.1_2326196_2327135_+	pfam04402, SIMPL, Protein of unknown function (DUF541)	NA|121aa|down_5|NZ_CP045226.1_2327434_2327797_+	pfam04970, LRAT, Lecithin retinol acyltransferase	NA|307aa|down_6|NZ_CP045226.1_2328174_2329095_-	COG4360, APA2, ATP adenylyltransferase (5',5'''-P-1,P-4-tetraphosphate phosphorylase II) [Nucleotide transport and metabolism]	NA|421aa|down_7|NZ_CP045226.1_2329279_2330542_+	TIGR02037, Probable_periplasmic_serine_protease_do/HhoA-like, periplasmic serine protease, Do/DeqQ family	NA|353aa|down_8|NZ_CP045226.1_2330911_2331970_-	PRK13396, PRK13396, 3-deoxy-7-phosphoheptulonate synthase; Provisional	NA|157aa|down_9|NZ_CP045226.1_2332230_2332701_-	pfam11947, DUF3464, Protein of unknown function (DUF3464)
GCF_009372195.1_ASM937219v1	NZ_CP045226	Nostoc sphaeroides CCNUC1 chromosome Gxm1, complete sequence	17	2391222-2391315	16	CRISPRCasFinder	no	PD-DExK,c2c9_V-U4,cas14j	PD-DExK,cas14k,cas14j,Cas9_archaeal,csa3,Cas14u_CAS-V,RT,Cas14c_CAS-V-F,c2c9_V-U4,cas3,c2c10_CAS-V-U3,DinG,c2c5_V-U5,cas2,cas1,cas4,cas6,2OG_CAS,csc1gr5,csc2gr7,cas10d,WYL,DEDDh	Unclear	ACTTATTCAAACCCCAGTTTGCTTTCTGATTT	32	1	2	2391254-2391283|2391254-2391283	NZ_CP045226.1_5483319-5483290|NZ_CP045226.1_5483402-5483373	NA	1	1	TypeV	PD-DExK,cas14k,cas14j,Cas9_archaeal,csa3,Cas14u_CAS-V,RT,Cas14c_CAS-V-F,c2c9_V-U4,cas3,c2c10_CAS-V-U3,DinG,c2c5_V-U5,cas2,cas1,cas4,cas6,2OG_CAS,csc1gr5,csc2gr7,cas10d,WYL,DEDDh,csm3gr7,csx19,csx10gr5,cas10,csx1,csx3	NA|81aa|up_4|NZ_CP045226.1_2384718_2384961_-,NA|77aa|up_0|NZ_CP045226.1_2386796_2387027_-,NA|75aa|down_3|NZ_CP045226.1_2397643_2397868_-,NA|69aa|down_5|NZ_CP045226.1_2399027_2399234_+	NA|302aa|up_9|NZ_CP045226.1_2377258_2378164_+	PRK00114, hslO, Hsp33 family molecular chaperone HslO	NA|688aa|up_8|NZ_CP045226.1_2378324_2380388_+	cd01000, PBP2_Cys_DEBP_like, Substrate-binding domain of cysteine- and aspartate/glutamate-binding proteins; the type 2 periplasmic-binding protein fold	NA|289aa|up_7|NZ_CP045226.1_2381043_2381910_-	COG2084, MmsB, 3-hydroxyisobutyrate dehydrogenase and related beta-hydroxyacid dehydrogenases [Lipid metabolism]	NA|396aa|up_6|NZ_CP045226.1_2382260_2383448_-	COG1565, COG1565, Uncharacterized conserved protein [Function unknown]	c2c9_V-U4|362aa|up_5|NZ_CP045226.1_2383627_2384713_-	COG0675, COG0675, Transposase and inactivated derivatives [DNA replication, recombination, and repair]	NA|81aa|up_4|NZ_CP045226.1_2384718_2384961_-	NA	NA|130aa|up_3|NZ_CP045226.1_2385075_2385465_-	cd07264, VOC_like, uncharacterized subfamily of vicinal oxygen chelate (VOC) family	NA|179aa|up_2|NZ_CP045226.1_2385660_2386197_-	COG2203, FhlA, FOG: GAF domain [Signal transduction mechanisms]	NA|149aa|up_1|NZ_CP045226.1_2386347_2386794_-	pfam01844, HNH, HNH endonuclease	NA|77aa|up_0|NZ_CP045226.1_2386796_2387027_-	NA	NA|1152aa|down_0|NZ_CP045226.1_2391467_2394923_-	cd09178, PLDc_N_Snf2_like, N-terminal putative catalytic domain of uncharacterized HKD family nucleases fused to putative helicases from the Snf2-like family	NA|395aa|down_1|NZ_CP045226.1_2395150_2396335_-	PRK07415, PRK07415, NAD(P)H-quinone oxidoreductase subunit H; Validated	NA|299aa|down_2|NZ_CP045226.1_2396743_2397640_+	PRK00050, PRK00050, 16S rRNA (cytosine(1402)-N(4))-methyltransferase RsmH	NA|75aa|down_3|NZ_CP045226.1_2397643_2397868_-	NA	NA|221aa|down_4|NZ_CP045226.1_2398214_2398877_+	COG0811, TolQ, Biopolymer transport proteins [Intracellular trafficking and secretion]	NA|69aa|down_5|NZ_CP045226.1_2399027_2399234_+	NA	NA|189aa|down_6|NZ_CP045226.1_2399612_2400179_+	pfam10229, MMADHC, Methylmalonic aciduria and homocystinuria type D protein	NA|320aa|down_7|NZ_CP045226.1_2400350_2401310_+	pfam11927, DUF3445, Protein of unknown function (DUF3445)	NA|339aa|down_8|NZ_CP045226.1_2401721_2402738_-	COG0569, TrkA, K+ transport systems, NAD-binding component [Inorganic ion transport and metabolism]	NA|407aa|down_9|NZ_CP045226.1_2403181_2404402_-	PLN00093, PLN00093, geranylgeranyl diphosphate reductase; Provisional
GCF_009372195.1_ASM937219v1	NZ_CP045226	Nostoc sphaeroides CCNUC1 chromosome Gxm1, complete sequence	18	2448946-2449021	17	CRISPRCasFinder	no		PD-DExK,cas14k,cas14j,Cas9_archaeal,csa3,Cas14u_CAS-V,RT,Cas14c_CAS-V-F,c2c9_V-U4,cas3,c2c10_CAS-V-U3,DinG,c2c5_V-U5,cas2,cas1,cas4,cas6,2OG_CAS,csc1gr5,csc2gr7,cas10d,WYL,DEDDh	Orphan	CAGAGAAGGGATGCCCACTTGCCC	24	1	6	2448970-2448997|2448970-2448997|2448970-2448997|2448970-2448997|2448970-2448997|2448970-2448997	NZ_CP045226.1_1206737-1206764|NZ_CP045226.1_1206789-1206816|NZ_CP045226.1_1879667-1879694|NZ_CP045226.1_6281819-6281846|NZ_CP045226.1_529813-529786|NZ_CP045226.1_3515611-3515584	NA	1	1	Orphan	PD-DExK,cas14k,cas14j,Cas9_archaeal,csa3,Cas14u_CAS-V,RT,Cas14c_CAS-V-F,c2c9_V-U4,cas3,c2c10_CAS-V-U3,DinG,c2c5_V-U5,cas2,cas1,cas4,cas6,2OG_CAS,csc1gr5,csc2gr7,cas10d,WYL,DEDDh,csm3gr7,csx19,csx10gr5,cas10,csx1,csx3	NA|79aa|up_9|NZ_CP045226.1_2437029_2437266_-,NA|184aa|up_8|NZ_CP045226.1_2437302_2437854_-,NA|184aa|up_7|NZ_CP045226.1_2437850_2438402_-,NA|105aa|up_5|NZ_CP045226.1_2439995_2440310_-,NA|294aa|up_0|NZ_CP045226.1_2447958_2448840_+,NA|409aa|down_2|NZ_CP045226.1_2451342_2452569_+,NA|71aa|down_5|NZ_CP045226.1_2456111_2456324_+,NA|80aa|down_6|NZ_CP045226.1_2456531_2456771_+,NA|79aa|down_7|NZ_CP045226.1_2457107_2457344_-,NA|145aa|down_8|NZ_CP045226.1_2457659_2458094_-	NA|79aa|up_9|NZ_CP045226.1_2437029_2437266_-	NA	NA|184aa|up_8|NZ_CP045226.1_2437302_2437854_-	NA	NA|184aa|up_7|NZ_CP045226.1_2437850_2438402_-	NA	NA|444aa|up_6|NZ_CP045226.1_2438517_2439849_-	cd10434, GIY-YIG_UvrC_Cho, Catalytic GIY-YIG domain of nucleotide excision repair endonucleases UvrC, Cho, and similar proteins	NA|105aa|up_5|NZ_CP045226.1_2439995_2440310_-	NA	NA|738aa|up_4|NZ_CP045226.1_2440574_2442788_-	COG3957, COG3957, Phosphoketolase [Carbohydrate transport and metabolism]	NA|461aa|up_3|NZ_CP045226.1_2443133_2444516_+	COG2124, CypX, Cytochrome P450 [Secondary metabolites biosynthesis, transport, and catabolism]	NA|252aa|up_2|NZ_CP045226.1_2444966_2445722_+	pfam11209, DUF2993, Protein of unknown function (DUF2993)	NA|565aa|up_1|NZ_CP045226.1_2445826_2447521_+	COG1233, COG1233, Phytoene dehydrogenase and related proteins [Secondary metabolites biosynthesis, transport, and catabolism]	NA|294aa|up_0|NZ_CP045226.1_2447958_2448840_+	NA	NA|303aa|down_0|NZ_CP045226.1_2449034_2449943_+	COG1099, COG1099, Predicted metal-dependent hydrolases with the TIM-barrel fold [General function prediction only]	NA|294aa|down_1|NZ_CP045226.1_2450111_2450993_+	cd13964, PT_UbiA_1, UbiA family of prenyltransferases (PTases), Unknown subgroup	NA|409aa|down_2|NZ_CP045226.1_2451342_2452569_+	NA	NA|462aa|down_3|NZ_CP045226.1_2452583_2453969_+	pfam01663, Phosphodiest, Type I phosphodiesterase / nucleotide pyrophosphatase	NA|498aa|down_4|NZ_CP045226.1_2454315_2455809_+	cd08156, catalase_clade_3, Clade 3 of the heme-binding enzyme catalase	NA|71aa|down_5|NZ_CP045226.1_2456111_2456324_+	NA	NA|80aa|down_6|NZ_CP045226.1_2456531_2456771_+	NA	NA|79aa|down_7|NZ_CP045226.1_2457107_2457344_-	NA	NA|145aa|down_8|NZ_CP045226.1_2457659_2458094_-	NA	NA|152aa|down_9|NZ_CP045226.1_2458454_2458910_+	TIGR03042, hypothetical_protein, photosystem II protein PsbQ
GCF_009372195.1_ASM937219v1	NZ_CP045226	Nostoc sphaeroides CCNUC1 chromosome Gxm1, complete sequence	19	2487557-2487693	18	CRISPRCasFinder	no		PD-DExK,cas14k,cas14j,Cas9_archaeal,csa3,Cas14u_CAS-V,RT,Cas14c_CAS-V-F,c2c9_V-U4,cas3,c2c10_CAS-V-U3,DinG,c2c5_V-U5,cas2,cas1,cas4,cas6,2OG_CAS,csc1gr5,csc2gr7,cas10d,WYL,DEDDh	Orphan	ACTCCTCACTCCTAACTTTATTA	23	0	0	NA	NA	NA	2	2	Orphan	PD-DExK,cas14k,cas14j,Cas9_archaeal,csa3,Cas14u_CAS-V,RT,Cas14c_CAS-V-F,c2c9_V-U4,cas3,c2c10_CAS-V-U3,DinG,c2c5_V-U5,cas2,cas1,cas4,cas6,2OG_CAS,csc1gr5,csc2gr7,cas10d,WYL,DEDDh,csm3gr7,csx19,csx10gr5,cas10,csx1,csx3	NA|78aa|up_9|NZ_CP045226.1_2476605_2476839_-,NA|249aa|up_2|NZ_CP045226.1_2484010_2484757_-,NA|230aa|up_1|NZ_CP045226.1_2485458_2486148_+,NA	NA|78aa|up_9|NZ_CP045226.1_2476605_2476839_-	NA	NA|305aa|up_8|NZ_CP045226.1_2476887_2477802_+	COG1619, LdcA, Uncharacterized proteins, homologs of microcin C7 resistance protein MccF [Defense mechanisms]	NA|237aa|up_7|NZ_CP045226.1_2477848_2478559_-	pfam13649, Methyltransf_25, Methyltransferase domain	NA|356aa|up_6|NZ_CP045226.1_2478612_2479680_-	cd03802, GT4_AviGT4-like, UDP-Glc:tetrahydrobiopterin alpha-glucosyltransferase and similar proteins	NA|323aa|up_5|NZ_CP045226.1_2479849_2480818_+	cd03801, GT4_PimA-like, phosphatidyl-myo-inositol mannosyltransferase	NA|488aa|up_4|NZ_CP045226.1_2480890_2482354_+	pfam05711, TylF, Macrocin-O-methyltransferase (TylF)	NA|440aa|up_3|NZ_CP045226.1_2482603_2483923_+	cd14748, PBP2_UgpB, The periplasmic-binding component of ABC transport system specific for sn-glycerol-3-phosphate; possesses type 2 periplasmic binding fold	NA|249aa|up_2|NZ_CP045226.1_2484010_2484757_-	NA	NA|230aa|up_1|NZ_CP045226.1_2485458_2486148_+	NA	NA|354aa|up_0|NZ_CP045226.1_2486416_2487478_+	PLN02433, PLN02433, uroporphyrinogen decarboxylase	NA|319aa|down_0|NZ_CP045226.1_2487709_2488666_+	COG0451, WcaG, Nucleoside-diphosphate-sugar epimerases [Cell envelope biogenesis, outer membrane / Carbohydrate transport and metabolism]	NA|533aa|down_1|NZ_CP045226.1_2488991_2490590_+	pfam13282, DUF4070, Domain of unknown function (DUF4070)	NA|423aa|down_2|NZ_CP045226.1_2490798_2492067_-	TIGR02917, TPR_domain_protein, putative PEP-CTERM system TPR-repeat lipoprotein	NA|223aa|down_3|NZ_CP045226.1_2492684_2493353_-	COG4340, COG4340, Uncharacterized protein conserved in bacteria [Function unknown]	NA|242aa|down_4|NZ_CP045226.1_2493595_2494321_+	PRK00347, PRK00347, DNA/RNA nuclease SfsA	NA|134aa|down_5|NZ_CP045226.1_2494631_2495033_-	COG2335, COG2335, Secreted and surface protein containing fasciclin-like repeats [Cell envelope biogenesis, outer membrane]	NA|137aa|down_6|NZ_CP045226.1_2495661_2496072_+	pfam13239, 2TM, 2TM domain	NA|108aa|down_7|NZ_CP045226.1_2497039_2497363_+	COG3937, COG3937, Uncharacterized conserved protein [Function unknown]	NA|164aa|down_8|NZ_CP045226.1_2497471_2497963_+	COG0545, FkpA, FKBP-type peptidyl-prolyl cis-trans isomerases 1 [Posttranslational modification, protein turnover, chaperones]	NA|459aa|down_9|NZ_CP045226.1_2498169_2499546_-	TIGR00665, DnaB, replicative DNA helicase
GCF_009372195.1_ASM937219v1	NZ_CP045226	Nostoc sphaeroides CCNUC1 chromosome Gxm1, complete sequence	20	2796030-2796134	19	CRISPRCasFinder	no		PD-DExK,cas14k,cas14j,Cas9_archaeal,csa3,Cas14u_CAS-V,RT,Cas14c_CAS-V-F,c2c9_V-U4,cas3,c2c10_CAS-V-U3,DinG,c2c5_V-U5,cas2,cas1,cas4,cas6,2OG_CAS,csc1gr5,csc2gr7,cas10d,WYL,DEDDh	Orphan	AGATATAAGGCTGTAGCAGCAATAATGCGAT	31	1	1	2796061-2796103	NZ_CP045226.1_2795875-2795917	NA	1	1	Orphan	PD-DExK,cas14k,cas14j,Cas9_archaeal,csa3,Cas14u_CAS-V,RT,Cas14c_CAS-V-F,c2c9_V-U4,cas3,c2c10_CAS-V-U3,DinG,c2c5_V-U5,cas2,cas1,cas4,cas6,2OG_CAS,csc1gr5,csc2gr7,cas10d,WYL,DEDDh,csm3gr7,csx19,csx10gr5,cas10,csx1,csx3	NA|67aa|up_5|NZ_CP045226.1_2792675_2792876_+,NA|113aa|up_4|NZ_CP045226.1_2793304_2793643_-,NA|125aa|up_1|NZ_CP045226.1_2795209_2795584_+,NA|80aa|up_0|NZ_CP045226.1_2795562_2795802_+,NA|78aa|down_0|NZ_CP045226.1_2796440_2796674_-,NA|66aa|down_1|NZ_CP045226.1_2796797_2796995_-,NA|189aa|down_7|NZ_CP045226.1_2802822_2803389_+	NA|219aa|up_9|NZ_CP045226.1_2785343_2786000_-	PRK13925, rnhB, ribonuclease HII; Provisional	NA|719aa|up_8|NZ_CP045226.1_2786008_2788165_-	TIGR00757, Ribonuclease_E/G-like_protein, ribonuclease, Rne/Rng family	NA|906aa|up_7|NZ_CP045226.1_2789020_2791738_-	TIGR03960, radical_SAM_domain_protein, radical SAM family uncharacterized protein	NA|110aa|up_6|NZ_CP045226.1_2791886_2792216_-	cd07043, STAS_anti-anti-sigma_factors, Sulphate Transporter and Anti-Sigma factor antagonist) domain of anti-anti-sigma factors, key regulators of anti-sigma factors by phosphorylation	NA|67aa|up_5|NZ_CP045226.1_2792675_2792876_+	NA	NA|113aa|up_4|NZ_CP045226.1_2793304_2793643_-	NA	NA|94aa|up_3|NZ_CP045226.1_2793949_2794231_+	PRK00033, clpS, ATP-dependent Clp protease adaptor protein ClpS; Reviewed	NA|278aa|up_2|NZ_CP045226.1_2794341_2795175_+	pfam02517, Abi, CAAX protease self-immunity	NA|125aa|up_1|NZ_CP045226.1_2795209_2795584_+	NA	NA|80aa|up_0|NZ_CP045226.1_2795562_2795802_+	NA	NA|78aa|down_0|NZ_CP045226.1_2796440_2796674_-	NA	NA|66aa|down_1|NZ_CP045226.1_2796797_2796995_-	NA	NA|213aa|down_2|NZ_CP045226.1_2797176_2797815_-	COG1357, COG1357, Pentapeptide repeats containing protein [Function unknown]	NA|462aa|down_3|NZ_CP045226.1_2797811_2799197_-	COG4250, COG4250, Predicted sensor protein/domain [Signal transduction mechanisms]	NA|531aa|down_4|NZ_CP045226.1_2799419_2801012_-	COG1032, COG1032, Fe-S oxidoreductase [Energy production and conversion]	NA|127aa|down_5|NZ_CP045226.1_2801585_2801966_+	pfam08865, DUF1830, Domain of unknown function (DUF1830)	NA|154aa|down_6|NZ_CP045226.1_2802141_2802603_+	pfam13301, DUF4079, Protein of unknown function (DUF4079)	NA|189aa|down_7|NZ_CP045226.1_2802822_2803389_+	NA	NA|258aa|down_8|NZ_CP045226.1_2803457_2804231_+	COG1135, AbcC, ABC-type metal ion transport system, ATPase component [Inorganic ion transport and metabolism]	NA|256aa|down_9|NZ_CP045226.1_2804385_2805153_-	COG0745, OmpR, Response regulators consisting of a CheY-like receiver domain and a winged-helix DNA-binding domain [Signal transduction mechanisms / Transcription]
GCF_009372195.1_ASM937219v1	NZ_CP045226	Nostoc sphaeroides CCNUC1 chromosome Gxm1, complete sequence	21	2835309-2835523	20	CRISPRCasFinder	no		PD-DExK,cas14k,cas14j,Cas9_archaeal,csa3,Cas14u_CAS-V,RT,Cas14c_CAS-V-F,c2c9_V-U4,cas3,c2c10_CAS-V-U3,DinG,c2c5_V-U5,cas2,cas1,cas4,cas6,2OG_CAS,csc1gr5,csc2gr7,cas10d,WYL,DEDDh	Orphan	CCAGCTAGCGAAGACCCTTACGGCGACCCCGCAGA	35	0	0	NA	NA	NA	3	3	Orphan	PD-DExK,cas14k,cas14j,Cas9_archaeal,csa3,Cas14u_CAS-V,RT,Cas14c_CAS-V-F,c2c9_V-U4,cas3,c2c10_CAS-V-U3,DinG,c2c5_V-U5,cas2,cas1,cas4,cas6,2OG_CAS,csc1gr5,csc2gr7,cas10d,WYL,DEDDh,csm3gr7,csx19,csx10gr5,cas10,csx1,csx3	NA|86aa|up_9|NZ_CP045226.1_2824744_2825002_+,NA|106aa|up_6|NZ_CP045226.1_2828621_2828939_+,NA|657aa|up_2|NZ_CP045226.1_2831606_2833577_-,NA|225aa|up_1|NZ_CP045226.1_2833625_2834300_-,NA|160aa|up_0|NZ_CP045226.1_2834638_2835118_+,NA	NA|86aa|up_9|NZ_CP045226.1_2824744_2825002_+	NA	NA|529aa|up_8|NZ_CP045226.1_2825113_2826700_+	PRK14096, pgi, glucose-6-phosphate isomerase; Provisional	NA|221aa|up_7|NZ_CP045226.1_2827477_2828140_-	pfam07862, Nif11, Nif11 domain	NA|106aa|up_6|NZ_CP045226.1_2828621_2828939_+	NA	NA|121aa|up_5|NZ_CP045226.1_2828935_2829298_+	COG5550, COG5550, Predicted aspartyl protease [Posttranslational modification, protein turnover, chaperones]	NA|86aa|up_4|NZ_CP045226.1_2829804_2830062_+	TIGR02606, Antitoxin_ParD, putative addiction module antidote protein, CC2985 family	NA|519aa|up_3|NZ_CP045226.1_2830047_2831604_-	pfam08819, DUF1802, Domain of unknown function (DUF1802)	NA|657aa|up_2|NZ_CP045226.1_2831606_2833577_-	NA	NA|225aa|up_1|NZ_CP045226.1_2833625_2834300_-	NA	NA|160aa|up_0|NZ_CP045226.1_2834638_2835118_+	NA	NA|326aa|down_0|NZ_CP045226.1_2835749_2836727_-	cd01339, LDH-like_MDH, L-lactate dehydrogenase-like malate dehydrogenase proteins	NA|72aa|down_1|NZ_CP045226.1_2836831_2837047_-	pfam11910, NdhO, Cyanobacterial and plant NDH-1 subunit O	NA|298aa|down_2|NZ_CP045226.1_2837246_2838140_-	PRK13945, PRK13945, formamidopyrimidine-DNA glycosylase; Provisional	NA|72aa|down_3|NZ_CP045226.1_2838391_2838607_-	pfam02427, PSI_PsaE, Photosystem I reaction centre subunit IV / PsaE	NA|195aa|down_4|NZ_CP045226.1_2838921_2839506_-	pfam04755, PAP_fibrillin, PAP_fibrillin	NA|79aa|down_5|NZ_CP045226.1_2839748_2839985_+	pfam11332, DUF3134, Protein of unknown function (DUF3134)	NA|368aa|down_6|NZ_CP045226.1_2840201_2841305_+	PRK00108, mraY, phospho-N-acetylmuramoyl-pentapeptide-transferase; Provisional	NA|584aa|down_7|NZ_CP045226.1_2841447_2843199_-	cd01949, GGDEF, Diguanylate-cyclase (DGC) or GGDEF domain	NA|168aa|down_8|NZ_CP045226.1_2843472_2843976_-	cd00886, MogA_MoaB, MogA_MoaB family	NA|114aa|down_9|NZ_CP045226.1_2844053_2844395_-	PRK13612, PRK13612, photosystem II reaction center protein Psb28; Provisional
GCF_009372195.1_ASM937219v1	NZ_CP045226	Nostoc sphaeroides CCNUC1 chromosome Gxm1, complete sequence	22	3653832-3654018	4	PILER-CR	no	cas14j	PD-DExK,cas14k,cas14j,Cas9_archaeal,csa3,Cas14u_CAS-V,RT,Cas14c_CAS-V-F,c2c9_V-U4,cas3,c2c10_CAS-V-U3,DinG,c2c5_V-U5,cas2,cas1,cas4,cas6,2OG_CAS,csc1gr5,csc2gr7,cas10d,WYL,DEDDh	Unclear	ACAAAAAGACAAGCGTTCTGAACGAAACAACTTGTGTTCTGAACGAA	47	0	0	NA	NA	NA	2	2	TypeV	PD-DExK,cas14k,cas14j,Cas9_archaeal,csa3,Cas14u_CAS-V,RT,Cas14c_CAS-V-F,c2c9_V-U4,cas3,c2c10_CAS-V-U3,DinG,c2c5_V-U5,cas2,cas1,cas4,cas6,2OG_CAS,csc1gr5,csc2gr7,cas10d,WYL,DEDDh,csm3gr7,csx19,csx10gr5,cas10,csx1,csx3	NA|60aa|up_9|NZ_CP045226.1_3639003_3639183_+,NA|365aa|up_0|NZ_CP045226.1_3652545_3653640_-,NA	NA|60aa|up_9|NZ_CP045226.1_3639003_3639183_+	NA	NA|456aa|up_8|NZ_CP045226.1_3639274_3640642_-	pfam14516, AAA_35, AAA-like domain	NA|529aa|up_7|NZ_CP045226.1_3641212_3642799_+	COG0642, BaeS, Signal transduction histidine kinase [Signal transduction mechanisms]	NA|524aa|up_6|NZ_CP045226.1_3643354_3644926_+	cd07378, MPP_ACP5, Homo sapiens acid phosphatase 5 and related proteins, metallophosphatase domain	NA|253aa|up_5|NZ_CP045226.1_3645139_3645898_+	PRK00110, PRK00110, YebC/PmpR family DNA-binding transcriptional regulator	NA|419aa|up_4|NZ_CP045226.1_3646277_3647534_+	PRK07364, PRK07364, FAD-dependent hydroxylase	NA|207aa|up_3|NZ_CP045226.1_3647544_3648165_-	pfam05685, Uma2, Putative restriction endonuclease	NA|732aa|up_2|NZ_CP045226.1_3648660_3650856_+	cd13401, Slt70-like, 70kDa soluble lytic transglycosylase (Slt70) and similar proteins	NA|95aa|up_1|NZ_CP045226.1_3652008_3652293_+	COG2343, COG2343, Uncharacterized protein conserved in bacteria [Function unknown]	NA|365aa|up_0|NZ_CP045226.1_3652545_3653640_-	NA	NA|1009aa|down_0|NZ_CP045226.1_3654433_3657460_-	COG0643, CheA, Chemotaxis protein histidine kinase and related kinases [Cell motility and secretion / Signal transduction mechanisms]	NA|943aa|down_1|NZ_CP045226.1_3657823_3660652_-	smart00283, MA, Methyl-accepting chemotaxis-like domains (chemotaxis sensory transducer)	NA|166aa|down_2|NZ_CP045226.1_3660712_3661210_-	COG0835, CheW, Chemotaxis signal transduction protein [Cell motility and secretion / Signal transduction mechanisms]	NA|126aa|down_3|NZ_CP045226.1_3661214_3661592_-	cd17574, REC_OmpR, phosphoacceptor receiver (REC) domain of OmpR family response regulators	NA|356aa|down_4|NZ_CP045226.1_3661658_3662726_-	cd17602, REC_PatA-like, phosphoacceptor receiver (REC) domain of PatA and similar domains	NA|490aa|down_5|NZ_CP045226.1_3663659_3665129_-	pfam07995, GSDH, Glucose / Sorbosone dehydrogenase	cas14j|472aa|down_6|NZ_CP045226.1_3665535_3666951_-	COG0675, COG0675, Transposase and inactivated derivatives [DNA replication, recombination, and repair]	NA|135aa|down_7|NZ_CP045226.1_3667030_3667435_+	pfam01797, Y1_Tnp, Transposase IS200 like	NA|29aa|down_8|NZ_CP045226.1_3667720_3667807_+	cd09019, galactose_mutarotase_like, galactose mutarotase_like	NA|396aa|down_9|NZ_CP045226.1_3668399_3669587_-	PRK00053, alr, alanine racemase; Reviewed
GCF_009372195.1_ASM937219v1	NZ_CP045226	Nostoc sphaeroides CCNUC1 chromosome Gxm1, complete sequence	23	3915299-3915521	21	CRISPRCasFinder	no		PD-DExK,cas14k,cas14j,Cas9_archaeal,csa3,Cas14u_CAS-V,RT,Cas14c_CAS-V-F,c2c9_V-U4,cas3,c2c10_CAS-V-U3,DinG,c2c5_V-U5,cas2,cas1,cas4,cas6,2OG_CAS,csc1gr5,csc2gr7,cas10d,WYL,DEDDh	Orphan	CTTCTAAGTTGGCATCACAAAGGATTGC	28	0	0	NA	NA	NA	3	3	Orphan	PD-DExK,cas14k,cas14j,Cas9_archaeal,csa3,Cas14u_CAS-V,RT,Cas14c_CAS-V-F,c2c9_V-U4,cas3,c2c10_CAS-V-U3,DinG,c2c5_V-U5,cas2,cas1,cas4,cas6,2OG_CAS,csc1gr5,csc2gr7,cas10d,WYL,DEDDh,csm3gr7,csx19,csx10gr5,cas10,csx1,csx3	NA,NA	NA|147aa|up_9|NZ_CP045226.1_3904375_3904816_+	COG2105, COG2105, Uncharacterized conserved protein [Function unknown]	NA|205aa|up_8|NZ_CP045226.1_3904839_3905454_-	COG5662, COG5662, Predicted transmembrane transcriptional regulator (anti-sigma factor) [Transcription]	NA|219aa|up_7|NZ_CP045226.1_3905608_3906265_-	PRK09652, PRK09652, RNA polymerase sigma factor RpoE; Provisional	NA|241aa|up_6|NZ_CP045226.1_3907339_3908062_+	cd16913, YkuD_like, L,D-transpeptidases/carboxypeptidases similar to Bacillus YkuD	NA|180aa|up_5|NZ_CP045226.1_3908370_3908910_-	pfam10719, ComFB, Late competence development protein ComFB	NA|305aa|up_4|NZ_CP045226.1_3909032_3909947_-	pfam01551, Peptidase_M23, Peptidase family M23	NA|33aa|up_3|NZ_CP045226.1_3910041_3910140_+	pfam01797, Y1_Tnp, Transposase IS200 like	NA|233aa|up_2|NZ_CP045226.1_3910942_3911641_-	TIGR02982, heterocyst_DevA, ABC exporter ATP-binding subunit, DevA family	NA|389aa|up_1|NZ_CP045226.1_3911746_3912913_-	TIGR01185, membrane_spanning_subunit, DevC protein	NA|399aa|up_0|NZ_CP045226.1_3913074_3914271_-	TIGR02971, devB-like_secretion_protein, ABC exporter membrane fusion protein, DevB family	NA|161aa|down_0|NZ_CP045226.1_3917692_3918175_+	COG3678, CpxP, P pilus assembly/Cpx signaling pathway, periplasmic inhibitor/zinc-resistance associated protein [Intracellular trafficking and secretion / Cell motility and secretio / Signal transduction mechanisms / Inorganic ion transport and metabolism]	NA|492aa|down_1|NZ_CP045226.1_3918352_3919828_-	COG1357, COG1357, Pentapeptide repeats containing protein [Function unknown]	NA|248aa|down_2|NZ_CP045226.1_3920145_3920889_-	COG1357, COG1357, Pentapeptide repeats containing protein [Function unknown]	NA|404aa|down_3|NZ_CP045226.1_3921115_3922327_+	TIGR03169, selenide_water_dikinase_putative, pyridine nucleotide-disulfide oxidoreductase family protein	NA|607aa|down_4|NZ_CP045226.1_3922622_3924443_-	TIGR02917, TPR_domain_protein, putative PEP-CTERM system TPR-repeat lipoprotein	NA|300aa|down_5|NZ_CP045226.1_3925144_3926044_+	pfam14261, DUF4351, Domain of unknown function (DUF4351)	NA|85aa|down_6|NZ_CP045226.1_3926222_3926477_+	COG4095, COG4095, Uncharacterized conserved protein [Function unknown]	NA|552aa|down_7|NZ_CP045226.1_3926649_3928305_+	TIGR03960, radical_SAM_domain_protein, radical SAM family uncharacterized protein	NA|63aa|down_8|NZ_CP045226.1_3928411_3928600_+	pfam14255, Cys_rich_CPXG, Cysteine-rich CPXCG	NA|150aa|down_9|NZ_CP045226.1_3928619_3929069_+	COG4276, COG4276, Uncharacterized conserved protein [Function unknown]
GCF_009372195.1_ASM937219v1	NZ_CP045226	Nostoc sphaeroides CCNUC1 chromosome Gxm1, complete sequence	24	3920279-3920362	22	CRISPRCasFinder	no		PD-DExK,cas14k,cas14j,Cas9_archaeal,csa3,Cas14u_CAS-V,RT,Cas14c_CAS-V-F,c2c9_V-U4,cas3,c2c10_CAS-V-U3,DinG,c2c5_V-U5,cas2,cas1,cas4,cas6,2OG_CAS,csc1gr5,csc2gr7,cas10d,WYL,DEDDh	Orphan	TTAGCTGCACTGAGAATTGCTCCT	24	0	0	NA	NA	NA	1	1	Orphan	PD-DExK,cas14k,cas14j,Cas9_archaeal,csa3,Cas14u_CAS-V,RT,Cas14c_CAS-V-F,c2c9_V-U4,cas3,c2c10_CAS-V-U3,DinG,c2c5_V-U5,cas2,cas1,cas4,cas6,2OG_CAS,csc1gr5,csc2gr7,cas10d,WYL,DEDDh,csm3gr7,csx19,csx10gr5,cas10,csx1,csx3	NA,NA	NA|241aa|up_9|NZ_CP045226.1_3907339_3908062_+	cd16913, YkuD_like, L,D-transpeptidases/carboxypeptidases similar to Bacillus YkuD	NA|180aa|up_8|NZ_CP045226.1_3908370_3908910_-	pfam10719, ComFB, Late competence development protein ComFB	NA|305aa|up_7|NZ_CP045226.1_3909032_3909947_-	pfam01551, Peptidase_M23, Peptidase family M23	NA|33aa|up_6|NZ_CP045226.1_3910041_3910140_+	pfam01797, Y1_Tnp, Transposase IS200 like	NA|233aa|up_5|NZ_CP045226.1_3910942_3911641_-	TIGR02982, heterocyst_DevA, ABC exporter ATP-binding subunit, DevA family	NA|389aa|up_4|NZ_CP045226.1_3911746_3912913_-	TIGR01185, membrane_spanning_subunit, DevC protein	NA|399aa|up_3|NZ_CP045226.1_3913074_3914271_-	TIGR02971, devB-like_secretion_protein, ABC exporter membrane fusion protein, DevB family	NA|409aa|up_2|NZ_CP045226.1_3915170_3916397_-	COG1357, COG1357, Pentapeptide repeats containing protein [Function unknown]	NA|161aa|up_1|NZ_CP045226.1_3917692_3918175_+	COG3678, CpxP, P pilus assembly/Cpx signaling pathway, periplasmic inhibitor/zinc-resistance associated protein [Intracellular trafficking and secretion / Cell motility and secretio / Signal transduction mechanisms / Inorganic ion transport and metabolism]	NA|492aa|up_0|NZ_CP045226.1_3918352_3919828_-	COG1357, COG1357, Pentapeptide repeats containing protein [Function unknown]	NA|404aa|down_0|NZ_CP045226.1_3921115_3922327_+	TIGR03169, selenide_water_dikinase_putative, pyridine nucleotide-disulfide oxidoreductase family protein	NA|607aa|down_1|NZ_CP045226.1_3922622_3924443_-	TIGR02917, TPR_domain_protein, putative PEP-CTERM system TPR-repeat lipoprotein	NA|300aa|down_2|NZ_CP045226.1_3925144_3926044_+	pfam14261, DUF4351, Domain of unknown function (DUF4351)	NA|85aa|down_3|NZ_CP045226.1_3926222_3926477_+	COG4095, COG4095, Uncharacterized conserved protein [Function unknown]	NA|552aa|down_4|NZ_CP045226.1_3926649_3928305_+	TIGR03960, radical_SAM_domain_protein, radical SAM family uncharacterized protein	NA|63aa|down_5|NZ_CP045226.1_3928411_3928600_+	pfam14255, Cys_rich_CPXG, Cysteine-rich CPXCG	NA|150aa|down_6|NZ_CP045226.1_3928619_3929069_+	COG4276, COG4276, Uncharacterized conserved protein [Function unknown]	NA|254aa|down_7|NZ_CP045226.1_3930442_3931204_+	cd03513, CrtW_beta-carotene-ketolase, Beta-carotene ketolase/oxygenase (CrtW, also known as CrtO), the carotenoid astaxanthin biosynthetic enzyme, initially catalyzes the addition of two keto groups to carbons C4 and C4' of beta-carotene	NA|327aa|down_8|NZ_CP045226.1_3931514_3932495_+	PRK06245, cofG, FO synthase subunit 1; Reviewed	NA|442aa|down_9|NZ_CP045226.1_3932938_3934264_+	pfam13191, AAA_16, AAA ATPase domain
GCF_009372195.1_ASM937219v1	NZ_CP045226	Nostoc sphaeroides CCNUC1 chromosome Gxm1, complete sequence	25	4035757-4035876	23	CRISPRCasFinder	no		PD-DExK,cas14k,cas14j,Cas9_archaeal,csa3,Cas14u_CAS-V,RT,Cas14c_CAS-V-F,c2c9_V-U4,cas3,c2c10_CAS-V-U3,DinG,c2c5_V-U5,cas2,cas1,cas4,cas6,2OG_CAS,csc1gr5,csc2gr7,cas10d,WYL,DEDDh	Orphan	TGTAGGCGATGGCAGTATGGAGAAAGTAGAAGAG	34	0	0	NA	NA	NA	1	1	Orphan	PD-DExK,cas14k,cas14j,Cas9_archaeal,csa3,Cas14u_CAS-V,RT,Cas14c_CAS-V-F,c2c9_V-U4,cas3,c2c10_CAS-V-U3,DinG,c2c5_V-U5,cas2,cas1,cas4,cas6,2OG_CAS,csc1gr5,csc2gr7,cas10d,WYL,DEDDh,csm3gr7,csx19,csx10gr5,cas10,csx1,csx3	NA,NA|102aa|down_3|NZ_CP045226.1_4042481_4042787_+	NA|408aa|up_9|NZ_CP045226.1_4020759_4021983_-	COG0758, Smf, Predicted Rossmann fold nucleotide-binding protein involved in DNA uptake [DNA replication, recombination, and repair / Intracellular trafficking and secretion]	NA|700aa|up_8|NZ_CP045226.1_4021989_4024089_-	COG0514, RecQ, Superfamily II DNA helicase [DNA replication, recombination, and repair]	NA|373aa|up_7|NZ_CP045226.1_4024571_4025690_+	COG0673, MviM, Predicted dehydrogenases and related proteins [General function prediction only]	NA|430aa|up_6|NZ_CP045226.1_4026331_4027621_-	COG0334, GdhA, Glutamate dehydrogenase/leucine dehydrogenase [Amino acid transport and metabolism]	NA|382aa|up_5|NZ_CP045226.1_4028060_4029206_-	PRK05952, PRK05952, beta-ketoacyl-ACP synthase	NA|266aa|up_4|NZ_CP045226.1_4029294_4030092_-	cd01924, cyclophilin_TLP40_like, cyclophilin_TLP40_like: cyclophilin-type peptidylprolyl cis- trans isomerases (cyclophilins) similar ot the Spinach thylakoid lumen protein TLP40	NA|190aa|up_3|NZ_CP045226.1_4030233_4030803_-	PRK02542, PRK02542, photosystem I assembly protein Ycf4; Provisional	NA|352aa|up_2|NZ_CP045226.1_4031965_4033021_+	CHL00004, psbD, photosystem II protein D2	NA|464aa|up_1|NZ_CP045226.1_4033004_4034396_+	CHL00035, psbC, photosystem II 44 kDa protein	NA|178aa|up_0|NZ_CP045226.1_4034757_4035291_+	pfam13517, VCBS, Repeat domain in Vibrio, Colwellia, Bradyrhizobium and Shewanella	NA|279aa|down_0|NZ_CP045226.1_4036138_4036975_-	COG4279, COG4279, Uncharacterized conserved protein [Function unknown]	NA|1562aa|down_1|NZ_CP045226.1_4037074_4041760_-	COG0553, HepA, Superfamily II DNA/RNA helicases, SNF2 family [Transcription / DNA replication, recombination, and repair]	NA|188aa|down_2|NZ_CP045226.1_4041907_4042471_+	COG0456, RimI, Acetyltransferases [General function prediction only]	NA|102aa|down_3|NZ_CP045226.1_4042481_4042787_+	NA	NA|719aa|down_4|NZ_CP045226.1_4042840_4044997_+	PRK11824, PRK11824, polynucleotide phosphorylase/polyadenylase; Provisional	NA|450aa|down_5|NZ_CP045226.1_4045428_4046778_+	TIGR04344, generic_methyltransferase, 5-histidylcysteine sulfoxide synthase	NA|528aa|down_6|NZ_CP045226.1_4047323_4048907_+	pfam00924, MS_channel, Mechanosensitive ion channel	NA|332aa|down_7|NZ_CP045226.1_4049543_4050539_-	PRK05479, PRK05479, ketol-acid reductoisomerase; Provisional	NA|298aa|down_8|NZ_CP045226.1_4050934_4051828_+	pfam14257, DUF4349, Domain of unknown function (DUF4349)	NA|446aa|down_9|NZ_CP045226.1_4052083_4053421_+	COG1252, Ndh, NADH dehydrogenase, FAD-containing subunit [Energy production and conversion]
GCF_009372195.1_ASM937219v1	NZ_CP045226	Nostoc sphaeroides CCNUC1 chromosome Gxm1, complete sequence	26	4464957-4465098	24	CRISPRCasFinder	no		PD-DExK,cas14k,cas14j,Cas9_archaeal,csa3,Cas14u_CAS-V,RT,Cas14c_CAS-V-F,c2c9_V-U4,cas3,c2c10_CAS-V-U3,DinG,c2c5_V-U5,cas2,cas1,cas4,cas6,2OG_CAS,csc1gr5,csc2gr7,cas10d,WYL,DEDDh	Orphan	CAGGGGGGCTGTTTGAGAACTGAGTAGTTGATTAAAAGTTGGTGCT	46	0	0	NA	NA	NA	1	1	Orphan	PD-DExK,cas14k,cas14j,Cas9_archaeal,csa3,Cas14u_CAS-V,RT,Cas14c_CAS-V-F,c2c9_V-U4,cas3,c2c10_CAS-V-U3,DinG,c2c5_V-U5,cas2,cas1,cas4,cas6,2OG_CAS,csc1gr5,csc2gr7,cas10d,WYL,DEDDh,csm3gr7,csx19,csx10gr5,cas10,csx1,csx3	NA,NA|268aa|down_0|NZ_CP045226.1_4465864_4466668_+,NA|242aa|down_1|NZ_CP045226.1_4467018_4467744_-,NA|122aa|down_7|NZ_CP045226.1_4475490_4475856_+,NA|72aa|down_8|NZ_CP045226.1_4476617_4476833_-	NA|208aa|up_9|NZ_CP045226.1_4448011_4448635_+	TIGR04211, hypothetical_protein, SH3 domain protein	NA|295aa|up_8|NZ_CP045226.1_4449134_4450019_-	COG0596, MhpC, Predicted hydrolases or acyltransferases (alpha/beta hydrolase superfamily) [General function prediction only]	NA|104aa|up_7|NZ_CP045226.1_4450391_4450703_+	cd07057, BMC_CcmK, Carbon dioxide concentrating mechanism (CcmK); Bacterial Micro-Compartment (BMC) domain	NA|116aa|up_6|NZ_CP045226.1_4450788_4451136_+	cd07057, BMC_CcmK, Carbon dioxide concentrating mechanism (CcmK); Bacterial Micro-Compartment (BMC) domain	NA|396aa|up_5|NZ_CP045226.1_4452054_4453242_+	PRK08295, PRK08295, RNA polymerase sporulation sigma factor SigH	NA|228aa|up_4|NZ_CP045226.1_4453773_4454457_+	COG0664, Crp, cAMP-binding proteins - catabolite gene activator and regulatory subunit of cAMP-dependent protein kinases [Signal transduction mechanisms]	NA|871aa|up_3|NZ_CP045226.1_4454945_4457558_-	TIGR03030, Cellulose_synthase_UDP-forming, cellulose synthase catalytic subunit (UDP-forming)	NA|780aa|up_2|NZ_CP045226.1_4457609_4459949_-	pfam03170, BcsB, Bacterial cellulose synthase subunit	NA|419aa|up_1|NZ_CP045226.1_4460130_4461387_-	PRK11097, PRK11097, cellulase	NA|786aa|up_0|NZ_CP045226.1_4461426_4463784_-	TIGR02917, TPR_domain_protein, putative PEP-CTERM system TPR-repeat lipoprotein	NA|268aa|down_0|NZ_CP045226.1_4465864_4466668_+	NA	NA|242aa|down_1|NZ_CP045226.1_4467018_4467744_-	NA	NA|531aa|down_2|NZ_CP045226.1_4468417_4470010_+	COG2730, BglC, Endoglucanase [Carbohydrate transport and metabolism]	NA|594aa|down_3|NZ_CP045226.1_4470093_4471875_+	COG3975, COG3975, Predicted protease with the C-terminal PDZ domain [General function prediction only]	NA|590aa|down_4|NZ_CP045226.1_4472008_4473778_-	PRK06354, PRK06354, pyruvate kinase; Provisional	NA|297aa|down_5|NZ_CP045226.1_4474158_4475049_+	cd03514, CrtR_beta-carotene-hydroxylase, Beta-carotene hydroxylase (CrtR), the carotenoid zeaxanthin biosynthetic enzyme catalyzes the addition of hydroxyl groups to the beta-ionone rings of beta-carotene to form zeaxanthin and is found in bacteria and red algae	NA|83aa|down_6|NZ_CP045226.1_4475111_4475360_+	pfam15643, Tox-PL-2, Papain fold toxin 2	NA|122aa|down_7|NZ_CP045226.1_4475490_4475856_+	NA	NA|72aa|down_8|NZ_CP045226.1_4476617_4476833_-	NA	NA|192aa|down_9|NZ_CP045226.1_4477152_4477728_-	pfam05685, Uma2, Putative restriction endonuclease
GCF_009372195.1_ASM937219v1	NZ_CP045226	Nostoc sphaeroides CCNUC1 chromosome Gxm1, complete sequence	27	5036399-5036791	5,25,2	PILER-CR,CRISPRCasFinder,CRT	no	c2c5_V-U5	PD-DExK,cas14k,cas14j,Cas9_archaeal,csa3,Cas14u_CAS-V,RT,Cas14c_CAS-V-F,c2c9_V-U4,cas3,c2c10_CAS-V-U3,DinG,c2c5_V-U5,cas2,cas1,cas4,cas6,2OG_CAS,csc1gr5,csc2gr7,cas10d,WYL,DEDDh	Type V-U5	GTTACAATGACCCTTCCCGTGTTGAGCGGGTTGAAAG,GTTACAATGACCCTTCCCGTGTTGAGCGGGTTGAAAG,GTTACAATGACCCTTCCCGTGTTGAGCGGGTTGAAAG	37,37,37	0	0	NA	NA	V-U5:V-U5:V-U5	4,5,5	5	TypeV-U5	PD-DExK,cas14k,cas14j,Cas9_archaeal,csa3,Cas14u_CAS-V,RT,Cas14c_CAS-V-F,c2c9_V-U4,cas3,c2c10_CAS-V-U3,DinG,c2c5_V-U5,cas2,cas1,cas4,cas6,2OG_CAS,csc1gr5,csc2gr7,cas10d,WYL,DEDDh,csm3gr7,csx19,csx10gr5,cas10,csx1,csx3	NA|86aa|up_3|NZ_CP045226.1_5032752_5033010_-,NA|102aa|up_2|NZ_CP045226.1_5033206_5033512_+,NA|64aa|down_2|NZ_CP045226.1_5041524_5041716_-	NA|174aa|up_9|NZ_CP045226.1_5020625_5021147_+	pfam06527, TniQ, TniQ	NA|1173aa|up_8|NZ_CP045226.1_5021386_5024905_+	cd18011, DEXDc_RapA, DEXH-box helicase domain of RapA	NA|116aa|up_7|NZ_CP045226.1_5025034_5025382_-	cd00093, HTH_XRE, Helix-turn-helix XRE-family like proteins	NA|137aa|up_6|NZ_CP045226.1_5025378_5025789_-	pfam05973, Gp49, Phage derived protein Gp49-like (DUF891)	NA|974aa|up_5|NZ_CP045226.1_5026099_5029021_+	COG1743, COG1743, Adenine-specific DNA methylase containing a Zn-ribbon [DNA replication, recombination, and repair]	NA|1109aa|up_4|NZ_CP045226.1_5029148_5032475_+	COG1483, COG1483, Predicted ATPase (AAA+ superfamily) [General function prediction only]	NA|86aa|up_3|NZ_CP045226.1_5032752_5033010_-	NA	NA|102aa|up_2|NZ_CP045226.1_5033206_5033512_+	NA	NA|89aa|up_1|NZ_CP045226.1_5033638_5033905_-	cd00093, HTH_XRE, Helix-turn-helix XRE-family like proteins	c2c5_V-U5|640aa|up_0|NZ_CP045226.1_5034026_5035946_+	PRK05771, PRK05771, V-type ATP synthase subunit I; Validated	NA|773aa|down_0|NZ_CP045226.1_5037525_5039844_-	pfam01551, Peptidase_M23, Peptidase family M23	NA|154aa|down_1|NZ_CP045226.1_5040636_5041098_-	cd18094, SpoU-like_TrmL, SAM-dependent tRNA methylase related to TrmL	NA|64aa|down_2|NZ_CP045226.1_5041524_5041716_-	NA	NA|380aa|down_3|NZ_CP045226.1_5041651_5042791_-	TIGR02048, gshA_cyano, glutamate--cysteine ligase, cyanobacterial, putative	NA|614aa|down_4|NZ_CP045226.1_5043009_5044851_-	cd09133, PLDc_unchar5, Putative catalytic domain of uncharacterized hypothetical proteins with one or two copies of the HKD motif	NA|508aa|down_5|NZ_CP045226.1_5044828_5046352_-	pfam13087, AAA_12, AAA domain	NA|199aa|down_6|NZ_CP045226.1_5049749_5050346_-	pfam05685, Uma2, Putative restriction endonuclease	NA|518aa|down_7|NZ_CP045226.1_5050707_5052261_+	COG0025, NhaP, NhaP-type Na+/H+ and K+/H+ antiporters [Inorganic ion transport and metabolism]	NA|188aa|down_8|NZ_CP045226.1_5052644_5053208_-	pfam11322, DUF3124, Protein of unknown function (DUF3124)	NA|226aa|down_9|NZ_CP045226.1_5053282_5053960_-	COG0569, TrkA, K+ transport systems, NAD-binding component [Inorganic ion transport and metabolism]
GCF_009372195.1_ASM937219v1	NZ_CP045226	Nostoc sphaeroides CCNUC1 chromosome Gxm1, complete sequence	28	5417800-5417905	26	CRISPRCasFinder	no		PD-DExK,cas14k,cas14j,Cas9_archaeal,csa3,Cas14u_CAS-V,RT,Cas14c_CAS-V-F,c2c9_V-U4,cas3,c2c10_CAS-V-U3,DinG,c2c5_V-U5,cas2,cas1,cas4,cas6,2OG_CAS,csc1gr5,csc2gr7,cas10d,WYL,DEDDh	Orphan	CTATTTCCGACTTGTGTGTACACCATAGC	29	0	0	NA	NA	NA	1	1	Orphan	PD-DExK,cas14k,cas14j,Cas9_archaeal,csa3,Cas14u_CAS-V,RT,Cas14c_CAS-V-F,c2c9_V-U4,cas3,c2c10_CAS-V-U3,DinG,c2c5_V-U5,cas2,cas1,cas4,cas6,2OG_CAS,csc1gr5,csc2gr7,cas10d,WYL,DEDDh,csm3gr7,csx19,csx10gr5,cas10,csx1,csx3	NA|62aa|up_7|NZ_CP045226.1_5407265_5407451_+,NA|102aa|down_5|NZ_CP045226.1_5424118_5424424_+,NA|65aa|down_7|NZ_CP045226.1_5427592_5427787_-	NA|356aa|up_9|NZ_CP045226.1_5404529_5405597_-	pfam12275, DUF3616, Protein of unknown function (DUF3616)	NA|328aa|up_8|NZ_CP045226.1_5406174_5407158_+	COG5607, COG5607, Uncharacterized conserved protein [Function unknown]	NA|62aa|up_7|NZ_CP045226.1_5407265_5407451_+	NA	NA|626aa|up_6|NZ_CP045226.1_5407796_5409674_+	PRK07390, PRK07390, NAD(P)H-quinone oxidoreductase subunit F; Validated	NA|239aa|up_5|NZ_CP045226.1_5409725_5410442_+	cd03378, beta_CA_cladeC, Carbonic anhydrases (CA) are zinc-containing enzymes that catalyze the reversible hydration of carbon dioxide in a two-step mechanism in which the nucleophilic attack of a zinc-bound hydroxide ion on carbon dioxide is followed by the regeneration of an active site by ionization of the zinc-bound water molecule and removal of a proton from the active site	NA|499aa|up_4|NZ_CP045226.1_5410583_5412080_+	PRK07363, PRK07363, NADH-quinone oxidoreductase subunit M	NA|436aa|up_3|NZ_CP045226.1_5412556_5413864_+	pfam10216, ChpXY, CO2 hydration protein (ChpXY)	NA|286aa|up_2|NZ_CP045226.1_5413931_5414789_-	cd06433, GT_2_WfgS_like, WfgS and WfeV are involved in O-antigen biosynthesis	NA|108aa|up_1|NZ_CP045226.1_5415462_5415786_+	COG2076, EmrE, Membrane transporters of cations and cationic drugs [Inorganic ion transport and metabolism]	NA|339aa|up_0|NZ_CP045226.1_5416409_5417426_-	pfam01032, FecCD, FecCD transport family	NA|208aa|down_0|NZ_CP045226.1_5418108_5418732_-	pfam13353, Fer4_12, 4Fe-4S single cluster domain	NA|398aa|down_1|NZ_CP045226.1_5418931_5420125_+	COG1252, Ndh, NADH dehydrogenase, FAD-containing subunit [Energy production and conversion]	NA|216aa|down_2|NZ_CP045226.1_5420228_5420876_+	TIGR02252, Rhythmically_expressed_gene_2_protein, REG-2-like, HAD superfamily (subfamily IA) hydrolase	NA|287aa|down_3|NZ_CP045226.1_5421433_5422294_+	pfam06485, DUF1092, Protein of unknown function (DUF1092)	NA|438aa|down_4|NZ_CP045226.1_5422776_5424090_+	COG0312, TldD, Predicted Zn-dependent proteases and their inactivated homologs [General function prediction only]	NA|102aa|down_5|NZ_CP045226.1_5424118_5424424_+	NA	NA|867aa|down_6|NZ_CP045226.1_5424817_5427418_+	cd01031, EriC, ClC chloride channel EriC	NA|65aa|down_7|NZ_CP045226.1_5427592_5427787_-	NA	NA|387aa|down_8|NZ_CP045226.1_5427820_5428981_-	cd03801, GT4_PimA-like, phosphatidyl-myo-inositol mannosyltransferase	NA|420aa|down_9|NZ_CP045226.1_5428973_5430233_-	pfam04932, Wzy_C, O-Antigen ligase
GCF_009372195.1_ASM937219v1	NZ_CP045226	Nostoc sphaeroides CCNUC1 chromosome Gxm1, complete sequence	29	5507926-5508825	6,27,3	PILER-CR,CRISPRCasFinder,CRT	no	cas2,cas1,cas4,cas6,2OG_CAS,csc1gr5,csc2gr7,cas10d,PD-DExK,cas3,WYL,DEDDh	PD-DExK,cas14k,cas14j,Cas9_archaeal,csa3,Cas14u_CAS-V,RT,Cas14c_CAS-V-F,c2c9_V-U4,cas3,c2c10_CAS-V-U3,DinG,c2c5_V-U5,cas2,cas1,cas4,cas6,2OG_CAS,csc1gr5,csc2gr7,cas10d,WYL,DEDDh	Type I-D	GTTTCAATCCCTAATAGGGATTTTGATGAATTGCAAT,GTTTCAATCCCTAATAGGGATTTTGATGAATTGCAAT,GTTTCAATCCCTAATAGGGATTTTGATGAATTGCAAT	37,37,37	0	0	NA	NA	I-D,II-B:I-D,II-B:I-D,II-B	12,12,12	12	TypeI-D	PD-DExK,cas14k,cas14j,Cas9_archaeal,csa3,Cas14u_CAS-V,RT,Cas14c_CAS-V-F,c2c9_V-U4,cas3,c2c10_CAS-V-U3,DinG,c2c5_V-U5,cas2,cas1,cas4,cas6,2OG_CAS,csc1gr5,csc2gr7,cas10d,WYL,DEDDh,csm3gr7,csx19,csx10gr5,cas10,csx1,csx3	NA|335aa|up_3|NZ_CP045226.1_5503208_5504213_+,NA|62aa|up_1|NZ_CP045226.1_5505485_5505671_-,NA|137aa|down_5|NZ_CP045226.1_5512889_5513300_+	NA|295aa|up_9|NZ_CP045226.1_5493317_5494202_-	COG1192, Soj, ATPases involved in chromosome partitioning [Cell division and chromosome partitioning]	NA|391aa|up_8|NZ_CP045226.1_5494619_5495792_+	PRK00770, PRK00770, deoxyhypusine synthase	NA|635aa|up_7|NZ_CP045226.1_5496072_5497977_-	pfam00211, Guanylate_cyc, Adenylate and Guanylate cyclase catalytic domain	NA|120aa|up_6|NZ_CP045226.1_5498678_5499038_+	cd07245, VOC_like, uncharacterized subfamily of vicinal oxygen chelate (VOC) family	NA|455aa|up_5|NZ_CP045226.1_5499249_5500614_+	COG1961, PinR, Site-specific recombinases, DNA invertase Pin homologs [DNA replication, recombination, and repair]	NA|540aa|up_4|NZ_CP045226.1_5501295_5502915_+	COG1807, ArnT, 4-amino-4-deoxy-L-arabinose transferase and related glycosyltransferases of PMT family [Cell envelope biogenesis, outer membrane]	NA|335aa|up_3|NZ_CP045226.1_5503208_5504213_+	NA	NA|341aa|up_2|NZ_CP045226.1_5504282_5505305_-	cd08235, iditol_2_DH_like, L-iditol 2-dehydrogenase	NA|62aa|up_1|NZ_CP045226.1_5505485_5505671_-	NA	NA|261aa|up_0|NZ_CP045226.1_5506979_5507762_+	cd09086, ExoIII-like_AP-endo, Escherichia coli exonuclease III (ExoIII) and Neisseria meningitides NExo-like subfamily of the ExoIII family purinic/apyrimidinic (AP) endonucleases	cas2|96aa|down_0|NZ_CP045226.1_5509093_5509381_-	cd09725, Cas2_I_II_III, CRISPR/Cas system-associated protein Cas2	cas1|335aa|down_1|NZ_CP045226.1_5509561_5510566_-	TIGR04093, hypothetical_protein_L8106_25395, CRISPR-associated endonuclease Cas1, subtype CYANO	cas4|198aa|down_2|NZ_CP045226.1_5510715_5511309_-	cd09637, Cas4_I-A_I-B_I-C_I-D_II-B, CRISPR/Cas system-associated protein Cas4	cas6|279aa|down_3|NZ_CP045226.1_5511340_5512177_-	COG5551, COG5551, CRISPR system related protein, RAMP superfamily [Defense    mechanisms]	2OG_CAS|207aa|down_4|NZ_CP045226.1_5512166_5512787_-	pfam13640, 2OG-FeII_Oxy_3, 2OG-Fe(II) oxygenase superfamily	NA|137aa|down_5|NZ_CP045226.1_5512889_5513300_+	NA	csc1gr5|236aa|down_6|NZ_CP045226.1_5513300_5514008_-	cd09711, Csc1_I-D, CRISPR/Cas system-associated protein Csc1	csc2gr7|339aa|down_7|NZ_CP045226.1_5514007_5515024_-	pfam18320, Csc2, Csc2 Crispr	cas10d|1095aa|down_8|NZ_CP045226.1_5515089_5518374_-	TIGR03174, cas_Csc3, CRISPR type I-D/CYANO-associated protein Csc3/Cas10d	PD-DExK|340aa|down_9|NZ_CP045226.1_5518391_5519411_-	pfam06250, DUF1016, Protein of unknown function (DUF1016)
GCF_009372195.1_ASM937219v1	NZ_CP045226	Nostoc sphaeroides CCNUC1 chromosome Gxm1, complete sequence	30	5588704-5588803	28	CRISPRCasFinder	no		PD-DExK,cas14k,cas14j,Cas9_archaeal,csa3,Cas14u_CAS-V,RT,Cas14c_CAS-V-F,c2c9_V-U4,cas3,c2c10_CAS-V-U3,DinG,c2c5_V-U5,cas2,cas1,cas4,cas6,2OG_CAS,csc1gr5,csc2gr7,cas10d,WYL,DEDDh	Orphan	TGCCTTTTTGCTTGCTGAATATACCGAAATGAGAAAG	37	0	0	NA	NA	NA	1	1	Orphan	PD-DExK,cas14k,cas14j,Cas9_archaeal,csa3,Cas14u_CAS-V,RT,Cas14c_CAS-V-F,c2c9_V-U4,cas3,c2c10_CAS-V-U3,DinG,c2c5_V-U5,cas2,cas1,cas4,cas6,2OG_CAS,csc1gr5,csc2gr7,cas10d,WYL,DEDDh,csm3gr7,csx19,csx10gr5,cas10,csx1,csx3	NA|162aa|up_9|NZ_CP045226.1_5574315_5574801_+,NA|75aa|up_5|NZ_CP045226.1_5578319_5578544_+,NA|133aa|down_6|NZ_CP045226.1_5596485_5596884_+	NA|162aa|up_9|NZ_CP045226.1_5574315_5574801_+	NA	NA|134aa|up_8|NZ_CP045226.1_5574880_5575282_+	COG2363, COG2363, Uncharacterized small membrane protein [Function unknown]	NA|263aa|up_7|NZ_CP045226.1_5575817_5576606_+	TIGR03442, TIGR03442, ergothioneine biosynthesis protein EgtC	NA|345aa|up_6|NZ_CP045226.1_5576896_5577931_+	COG4301, COG4301, Uncharacterized conserved protein [Function unknown]	NA|75aa|up_5|NZ_CP045226.1_5578319_5578544_+	NA	NA|390aa|up_4|NZ_CP045226.1_5580528_5581698_+	COG0664, Crp, cAMP-binding proteins - catabolite gene activator and regulatory subunit of cAMP-dependent protein kinases [Signal transduction mechanisms]	NA|230aa|up_3|NZ_CP045226.1_5581922_5582612_-	pfam02517, Abi, CAAX protease self-immunity	NA|505aa|up_2|NZ_CP045226.1_5583132_5584647_+	cd07786, FGGY_EcGK_like, Escherichia coli glycerol kinase-like proteins; belongs to the FGGY family of carbohydrate kinases	NA|389aa|up_1|NZ_CP045226.1_5585177_5586344_-	TIGR03803, hypothetical_protein_CfE428DRAFT_0741, Gloeo_Verruco repeat	NA|324aa|up_0|NZ_CP045226.1_5587590_5588562_+	PRK11863, PRK11863, N-acetyl-gamma-glutamyl-phosphate reductase; Provisional	NA|214aa|down_0|NZ_CP045226.1_5589990_5590632_+	cd07051, BMC_like_1_repeat1, Bacterial Micro-Compartment (BMC)-like domain 1 repeat 1	NA|269aa|down_1|NZ_CP045226.1_5590642_5591449_+	COG1357, COG1357, Pentapeptide repeats containing protein [Function unknown]	NA|448aa|down_2|NZ_CP045226.1_5592387_5593731_+	NF033203, entero_EhxA, enterohemolysin EhxA	NA|486aa|down_3|NZ_CP045226.1_5593968_5595426_+	TIGR03606, Quinoprotein_glucose_dehydrogenase_B, dehydrogenase, PQQ-dependent, s-GDH family	NA|89aa|down_4|NZ_CP045226.1_5595515_5595782_+	pfam03795, YCII, YCII-related domain	NA|129aa|down_5|NZ_CP045226.1_5595942_5596329_+	COG3937, COG3937, Uncharacterized conserved protein [Function unknown]	NA|133aa|down_6|NZ_CP045226.1_5596485_5596884_+	NA	NA|140aa|down_7|NZ_CP045226.1_5597048_5597468_+	PRK13258, PRK13258, 7-cyano-7-deazaguanine reductase; Provisional	NA|466aa|down_8|NZ_CP045226.1_5598226_5599624_-	CHL00177, ccs1, c-type cytochrome biogenensis protein; Validated	NA|247aa|down_9|NZ_CP045226.1_5599654_5600395_-	pfam02683, DsbD, Cytochrome C biogenesis protein transmembrane region
GCF_009372195.1_ASM937219v1	NZ_CP045226	Nostoc sphaeroides CCNUC1 chromosome Gxm1, complete sequence	31	5652990-5653078	29	CRISPRCasFinder	no		PD-DExK,cas14k,cas14j,Cas9_archaeal,csa3,Cas14u_CAS-V,RT,Cas14c_CAS-V-F,c2c9_V-U4,cas3,c2c10_CAS-V-U3,DinG,c2c5_V-U5,cas2,cas1,cas4,cas6,2OG_CAS,csc1gr5,csc2gr7,cas10d,WYL,DEDDh	Orphan	GCGATCGCTCTTATCTAAACCCCAA	25	0	0	NA	NA	NA	1	1	Orphan	PD-DExK,cas14k,cas14j,Cas9_archaeal,csa3,Cas14u_CAS-V,RT,Cas14c_CAS-V-F,c2c9_V-U4,cas3,c2c10_CAS-V-U3,DinG,c2c5_V-U5,cas2,cas1,cas4,cas6,2OG_CAS,csc1gr5,csc2gr7,cas10d,WYL,DEDDh,csm3gr7,csx19,csx10gr5,cas10,csx1,csx3	NA,NA|93aa|down_0|NZ_CP045226.1_5653099_5653378_+,NA|147aa|down_3|NZ_CP045226.1_5657376_5657817_-,NA|223aa|down_5|NZ_CP045226.1_5660881_5661550_+,NA|135aa|down_8|NZ_CP045226.1_5664091_5664496_+	NA|318aa|up_9|NZ_CP045226.1_5641511_5642465_-	PRK04375, PRK04375, protoheme IX farnesyltransferase; Provisional	NA|332aa|up_8|NZ_CP045226.1_5642658_5643654_-	COG1612, CtaA, Uncharacterized protein required for cytochrome oxidase assembly [Posttranslational modification, protein turnover, chaperones]	NA|361aa|up_7|NZ_CP045226.1_5644248_5645331_+	cd13919, CuRO_HCO_II_like_5, Uncharacterized subfamily with similarity to Heme-copper oxidase subunit II cupredoxin domain	NA|579aa|up_6|NZ_CP045226.1_5645432_5647169_+	TIGR02891, Probable_cytochrome_c_oxidase_subunit_1-beta, cytochrome c oxidase, subunit I	NA|209aa|up_5|NZ_CP045226.1_5647277_5647904_+	COG1845, CyoC, Heme/copper-type cytochrome/quinol oxidase, subunit 3 [Energy production and conversion]	NA|433aa|up_4|NZ_CP045226.1_5648388_5649687_+	cd14014, STKc_PknB_like, Catalytic domain of bacterial Serine/Threonine kinases, PknB and similar proteins	NA|358aa|up_3|NZ_CP045226.1_5649818_5650892_+	cd13653, PBP2_phosphate_like_1, Substrate binding domain of putative ABC-type phosphate transporter, a member of the type 2 periplasmic binding fold superfamily	NA|191aa|up_2|NZ_CP045226.1_5650956_5651529_-	COG1943, COG1943, Transposase and inactivated derivatives [DNA replication, recombination, and repair]	NA|208aa|up_1|NZ_CP045226.1_5651636_5652260_+	pfam01797, Y1_Tnp, Transposase IS200 like	NA|131aa|up_0|NZ_CP045226.1_5652273_5652666_-	pfam07693, KAP_NTPase, KAP family P-loop domain	NA|93aa|down_0|NZ_CP045226.1_5653099_5653378_+	NA	NA|588aa|down_1|NZ_CP045226.1_5653541_5655305_-	pfam13424, TPR_12, Tetratricopeptide repeat	NA|431aa|down_2|NZ_CP045226.1_5655294_5656587_-	pfam13191, AAA_16, AAA ATPase domain	NA|147aa|down_3|NZ_CP045226.1_5657376_5657817_-	NA	NA|561aa|down_4|NZ_CP045226.1_5658426_5660109_+	COG4188, COG4188, Predicted dienelactone hydrolase [General function prediction only]	NA|223aa|down_5|NZ_CP045226.1_5660881_5661550_+	NA	NA|174aa|down_6|NZ_CP045226.1_5662291_5662813_-	pfam13384, HTH_23, Homeodomain-like domain	NA|130aa|down_7|NZ_CP045226.1_5663265_5663655_+	pfam00030, Crystall, Beta/Gamma crystallin	NA|135aa|down_8|NZ_CP045226.1_5664091_5664496_+	NA	NA|739aa|down_9|NZ_CP045226.1_5664719_5666936_+	COG2274, SunT, ABC-type bacteriocin/lantibiotic exporters, contain an N-terminal double-glycine peptidase domain [Defense mechanisms]
GCF_009372195.1_ASM937219v1	NZ_CP045226	Nostoc sphaeroides CCNUC1 chromosome Gxm1, complete sequence	32	5791614-5791699	30	CRISPRCasFinder	no		PD-DExK,cas14k,cas14j,Cas9_archaeal,csa3,Cas14u_CAS-V,RT,Cas14c_CAS-V-F,c2c9_V-U4,cas3,c2c10_CAS-V-U3,DinG,c2c5_V-U5,cas2,cas1,cas4,cas6,2OG_CAS,csc1gr5,csc2gr7,cas10d,WYL,DEDDh	Orphan	GTTGCAGTTCAAAGCACGAAGTT	23	0	0	NA	NA	NA	1	1	Orphan	PD-DExK,cas14k,cas14j,Cas9_archaeal,csa3,Cas14u_CAS-V,RT,Cas14c_CAS-V-F,c2c9_V-U4,cas3,c2c10_CAS-V-U3,DinG,c2c5_V-U5,cas2,cas1,cas4,cas6,2OG_CAS,csc1gr5,csc2gr7,cas10d,WYL,DEDDh,csm3gr7,csx19,csx10gr5,cas10,csx1,csx3	NA|61aa|up_9|NZ_CP045226.1_5780870_5781053_-,NA|73aa|up_2|NZ_CP045226.1_5786071_5786290_-,NA|112aa|up_0|NZ_CP045226.1_5791016_5791352_+,NA|158aa|down_0|NZ_CP045226.1_5791837_5792311_-,NA|138aa|down_2|NZ_CP045226.1_5796802_5797216_+,NA|129aa|down_7|NZ_CP045226.1_5804051_5804438_+	NA|61aa|up_9|NZ_CP045226.1_5780870_5781053_-	NA	NA|1003aa|up_8|NZ_CP045226.1_5781198_5784207_-	PRK02509, PRK02509, hypothetical protein; Provisional	NA|66aa|up_7|NZ_CP045226.1_5784343_5784541_-	pfam07927, HicA_toxin, HicA toxin of bacterial toxin-antitoxin,	NA|82aa|up_6|NZ_CP045226.1_5784541_5784787_-	pfam08972, DUF1902, Domain of unknown function (DUF1902)	NA|81aa|up_5|NZ_CP045226.1_5784960_5785203_+	pfam02604, PhdYeFM_antitox, Antitoxin Phd_YefM, type II toxin-antitoxin system	NA|88aa|up_4|NZ_CP045226.1_5785244_5785508_+	TIGR02116, Hypothetical_protein_Rv3358/MT3466/Mb3393	NA|143aa|up_3|NZ_CP045226.1_5785646_5786075_-	cd09881, PIN_VapC4-5_FitB-like, VapC-like PIN domain of Mycobacterium tuberculosis VapC4 and VapC5, and Neisseria gonorrhoeae FitB and related proteins	NA|73aa|up_2|NZ_CP045226.1_5786071_5786290_-	NA	NA|1413aa|up_1|NZ_CP045226.1_5786708_5790947_-	cd00200, WD40, WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment	NA|112aa|up_0|NZ_CP045226.1_5791016_5791352_+	NA	NA|158aa|down_0|NZ_CP045226.1_5791837_5792311_-	NA	NA|808aa|down_1|NZ_CP045226.1_5792479_5794903_+	COG4354, COG4354, Predicted bile acid beta-glucosidase [Carbohydrate transport and metabolism]	NA|138aa|down_2|NZ_CP045226.1_5796802_5797216_+	NA	NA|236aa|down_3|NZ_CP045226.1_5798128_5798836_-	PRK05581, PRK05581, ribulose-phosphate 3-epimerase; Validated	NA|532aa|down_4|NZ_CP045226.1_5799036_5800632_+	cd07488, Peptidases_S8_2, Peptidase S8 family domain, uncharacterized subfamily 2	NA|443aa|down_5|NZ_CP045226.1_5801186_5802515_+	cd19588, serpin_miropin-like, serpin miropin and similar proteins	NA|392aa|down_6|NZ_CP045226.1_5802646_5803822_-	COG0628, yhhT, Predicted permease, member of the PurR regulon [General function prediction only]	NA|129aa|down_7|NZ_CP045226.1_5804051_5804438_+	NA	NA|477aa|down_8|NZ_CP045226.1_5804724_5806155_+	COG1316, LytR, Transcriptional regulator [Transcription]	NA|354aa|down_9|NZ_CP045226.1_5806126_5807188_-	PRK07394, PRK07394, hypothetical protein; Provisional
GCF_009372195.1_ASM937219v1	NZ_CP045227	Nostoc sphaeroides CCNUC1 chromosome Gxm2, complete sequence	1	55096-55179	1	CRISPRCasFinder	no		cas3,csa3,Cas9_archaeal,cas6,WYL,RT,cas2,cas1,cas4,csc1gr5,csc2gr7,cas10d,csm3gr7,csx19,csx10gr5,cas10,csx1,csx3	Orphan	TGTCAACCATTGGTTTACAGAATG	24	0	0	NA	NA	NA	1	1	Orphan	PD-DExK,cas14k,cas14j,Cas9_archaeal,csa3,Cas14u_CAS-V,RT,Cas14c_CAS-V-F,c2c9_V-U4,cas3,c2c10_CAS-V-U3,DinG,c2c5_V-U5,cas2,cas1,cas4,cas6,2OG_CAS,csc1gr5,csc2gr7,cas10d,WYL,DEDDh,csm3gr7,csx19,csx10gr5,cas10,csx1,csx3	NA|320aa|up_8|NZ_CP045227.1_48949_49909_+,NA|152aa|up_6|NZ_CP045227.1_50598_51054_-,NA|99aa|up_5|NZ_CP045227.1_51064_51361_-,NA|143aa|up_4|NZ_CP045227.1_51363_51792_-,NA|63aa|up_3|NZ_CP045227.1_51942_52131_-,NA|311aa|up_2|NZ_CP045227.1_52137_53070_-,NA|268aa|up_1|NZ_CP045227.1_53188_53992_-,NA|277aa|down_2|NZ_CP045227.1_60108_60939_+,NA|297aa|down_3|NZ_CP045227.1_61474_62365_+,NA|93aa|down_9|NZ_CP045227.1_81305_81584_-	NA|464aa|up_9|NZ_CP045227.1_46122_47514_-	TIGR00665, DnaB, replicative DNA helicase	NA|320aa|up_8|NZ_CP045227.1_48949_49909_+	NA	NA|202aa|up_7|NZ_CP045227.1_49974_50580_-	COG4725, IME4, Transcriptional activator, adenine-specific DNA methyltransferase [Signal transduction mechanisms / Transcription]	NA|152aa|up_6|NZ_CP045227.1_50598_51054_-	NA	NA|99aa|up_5|NZ_CP045227.1_51064_51361_-	NA	NA|143aa|up_4|NZ_CP045227.1_51363_51792_-	NA	NA|63aa|up_3|NZ_CP045227.1_51942_52131_-	NA	NA|311aa|up_2|NZ_CP045227.1_52137_53070_-	NA	NA|268aa|up_1|NZ_CP045227.1_53188_53992_-	NA	NA|119aa|up_0|NZ_CP045227.1_54206_54563_+	COG3655, COG3655, Predicted transcriptional regulator [Transcription]	NA|999aa|down_0|NZ_CP045227.1_56149_59146_+	cd00200, WD40, WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment	NA|267aa|down_1|NZ_CP045227.1_59308_60109_+	COG3328, COG3328, Transposase and inactivated derivatives [DNA replication, recombination, and repair]	NA|277aa|down_2|NZ_CP045227.1_60108_60939_+	NA	NA|297aa|down_3|NZ_CP045227.1_61474_62365_+	NA	NA|1274aa|down_4|NZ_CP045227.1_62388_66210_+	pfam00656, Peptidase_C14, Caspase domain	NA|553aa|down_5|NZ_CP045227.1_66341_68000_+	COG4188, COG4188, Predicted dienelactone hydrolase [General function prediction only]	NA|1537aa|down_6|NZ_CP045227.1_68062_72673_+	cd00200, WD40, WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment	NA|1472aa|down_7|NZ_CP045227.1_72702_77118_+	cd00200, WD40, WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment	NA|200aa|down_8|NZ_CP045227.1_80637_81237_-	COG3145, AlkB, Alkylated DNA repair protein [DNA replication, recombination, and repair]	NA|93aa|down_9|NZ_CP045227.1_81305_81584_-	NA
GCF_009372195.1_ASM937219v1	NZ_CP045227	Nostoc sphaeroides CCNUC1 chromosome Gxm2, complete sequence	2	96040-97219	1,2,1	CRT,CRISPRCasFinder,PILER-CR	no	cas3	cas3,csa3,Cas9_archaeal,cas6,WYL,RT,cas2,cas1,cas4,csc1gr5,csc2gr7,cas10d,csm3gr7,csx19,csx10gr5,cas10,csx1,csx3	Unclear	GTTTCAATCCCTAATAGGGAGTAACTGAAATTGTAAC,GTTTCAATCCCTAATAGGGAGTAACTGAAATTGTAAC,GTTTCAATCCCTAATAGGGAGTAACTGAAATTGTAAC	37,37,37	0	0	NA	NA	I-D,II-B:I-D,II-B:I-D,II-B	16,15,14	16	Unclear	PD-DExK,cas14k,cas14j,Cas9_archaeal,csa3,Cas14u_CAS-V,RT,Cas14c_CAS-V-F,c2c9_V-U4,cas3,c2c10_CAS-V-U3,DinG,c2c5_V-U5,cas2,cas1,cas4,cas6,2OG_CAS,csc1gr5,csc2gr7,cas10d,WYL,DEDDh,csm3gr7,csx19,csx10gr5,cas10,csx1,csx3	NA|280aa|up_9|NZ_CP045227.1_83576_84416_-,NA|313aa|up_7|NZ_CP045227.1_86839_87778_+,NA|62aa|up_6|NZ_CP045227.1_87892_88078_+,NA|281aa|up_2|NZ_CP045227.1_92927_93770_-,NA|195aa|down_0|NZ_CP045227.1_97688_98273_-,NA|112aa|down_2|NZ_CP045227.1_100640_100976_+,NA|201aa|down_4|NZ_CP045227.1_104754_105357_-,NA|77aa|down_6|NZ_CP045227.1_108042_108273_-,NA|185aa|down_7|NZ_CP045227.1_108497_109052_+,NA|175aa|down_8|NZ_CP045227.1_109060_109585_+,NA|71aa|down_9|NZ_CP045227.1_109833_110046_+	NA|280aa|up_9|NZ_CP045227.1_83576_84416_-	NA	NA|77aa|up_8|NZ_CP045227.1_86367_86598_+	pfam13443, HTH_26, Cro/C1-type HTH DNA-binding domain	NA|313aa|up_7|NZ_CP045227.1_86839_87778_+	NA	NA|62aa|up_6|NZ_CP045227.1_87892_88078_+	NA	cas3|699aa|up_5|NZ_CP045227.1_88369_90466_-	COG1201, Lhr, Lhr-like helicases [General function prediction only]	NA|399aa|up_4|NZ_CP045227.1_90465_91662_-	pfam10923, DUF2791, P-loop Domain of unknown function (DUF2791)	NA|420aa|up_3|NZ_CP045227.1_91665_92925_-	pfam10923, DUF2791, P-loop Domain of unknown function (DUF2791)	NA|281aa|up_2|NZ_CP045227.1_92927_93770_-	NA	NA|411aa|up_1|NZ_CP045227.1_94184_95416_+	pfam07592, DDE_Tnp_ISAZ013, Rhodopirellula transposase DDE domain	NA|118aa|up_0|NZ_CP045227.1_95620_95974_-	cd07503, HAD_HisB-N, histidinol phosphate phosphatase and related phosphatases	NA|195aa|down_0|NZ_CP045227.1_97688_98273_-	NA	NA|53aa|down_1|NZ_CP045227.1_100331_100490_+	pfam09274, ParG, ParG	NA|112aa|down_2|NZ_CP045227.1_100640_100976_+	NA	NA|263aa|down_3|NZ_CP045227.1_103907_104696_-	pfam01797, Y1_Tnp, Transposase IS200 like	NA|201aa|down_4|NZ_CP045227.1_104754_105357_-	NA	NA|552aa|down_5|NZ_CP045227.1_106069_107725_-	COG0419, SbcC, ATPase involved in DNA repair [DNA replication, recombination, and repair]	NA|77aa|down_6|NZ_CP045227.1_108042_108273_-	NA	NA|185aa|down_7|NZ_CP045227.1_108497_109052_+	NA	NA|175aa|down_8|NZ_CP045227.1_109060_109585_+	NA	NA|71aa|down_9|NZ_CP045227.1_109833_110046_+	NA
GCF_009372195.1_ASM937219v1	NZ_CP045227	Nostoc sphaeroides CCNUC1 chromosome Gxm2, complete sequence	3	153190-153356	2	PILER-CR	no		cas3,csa3,Cas9_archaeal,cas6,WYL,RT,cas2,cas1,cas4,csc1gr5,csc2gr7,cas10d,csm3gr7,csx19,csx10gr5,cas10,csx1,csx3	Orphan	AGTTACACCGCGTCTTTATGACGTGGCTCCATTGAAGCGT	40	0	0	NA	NA	NA	2	2	Orphan	PD-DExK,cas14k,cas14j,Cas9_archaeal,csa3,Cas14u_CAS-V,RT,Cas14c_CAS-V-F,c2c9_V-U4,cas3,c2c10_CAS-V-U3,DinG,c2c5_V-U5,cas2,cas1,cas4,cas6,2OG_CAS,csc1gr5,csc2gr7,cas10d,WYL,DEDDh,csm3gr7,csx19,csx10gr5,cas10,csx1,csx3	NA|670aa|up_7|NZ_CP045227.1_141962_143972_-,NA|137aa|up_4|NZ_CP045227.1_147591_148002_+,NA|85aa|down_0|NZ_CP045227.1_153688_153943_-,NA|84aa|down_1|NZ_CP045227.1_155989_156241_+,NA|76aa|down_3|NZ_CP045227.1_157471_157699_+,NA|139aa|down_4|NZ_CP045227.1_157686_158103_+	NA|264aa|up_9|NZ_CP045227.1_139293_140085_-	COG0300, DltE, Short-chain dehydrogenases of various substrate specificities [General function prediction only]	NA|491aa|up_8|NZ_CP045227.1_140328_141801_-	COG1075, LipA, Predicted acetyltransferases and hydrolases with the alpha/beta hydrolase fold [General function prediction only]	NA|670aa|up_7|NZ_CP045227.1_141962_143972_-	NA	NA|465aa|up_6|NZ_CP045227.1_144651_146046_+	cd02803, OYE_like_FMN_family, Old yellow enzyme (OYE)-like FMN binding domain	NA|489aa|up_5|NZ_CP045227.1_146112_147579_+	pfam00034, Cytochrom_C, Cytochrome c	NA|137aa|up_4|NZ_CP045227.1_147591_148002_+	NA	NA|478aa|up_3|NZ_CP045227.1_148090_149524_+	COG1858, MauG, Cytochrome c peroxidase [Inorganic ion transport and metabolism]	NA|547aa|up_2|NZ_CP045227.1_149617_151258_+	PLN02337, PLN02337, lipoxygenase	NA|102aa|up_1|NZ_CP045227.1_151287_151593_+	COG1359, COG1359, Uncharacterized conserved protein [Function unknown]	NA|351aa|up_0|NZ_CP045227.1_151612_152665_+	COG3268, COG3268, Uncharacterized conserved protein [Function unknown]	NA|85aa|down_0|NZ_CP045227.1_153688_153943_-	NA	NA|84aa|down_1|NZ_CP045227.1_155989_156241_+	NA	NA|307aa|down_2|NZ_CP045227.1_156332_157253_-	pfam14261, DUF4351, Domain of unknown function (DUF4351)	NA|76aa|down_3|NZ_CP045227.1_157471_157699_+	NA	NA|139aa|down_4|NZ_CP045227.1_157686_158103_+	NA	NA|202aa|down_5|NZ_CP045227.1_158186_158792_-	pfam03929, PepSY_TM, PepSY-associated TM region	NA|226aa|down_6|NZ_CP045227.1_159097_159775_+	COG0745, OmpR, Response regulators consisting of a CheY-like receiver domain and a winged-helix DNA-binding domain [Signal transduction mechanisms / Transcription]	NA|468aa|down_7|NZ_CP045227.1_159719_161123_+	TIGR02966, Phosphate_regulon_sensor_protein_PhoR, phosphate regulon sensor kinase PhoR	NA|231aa|down_8|NZ_CP045227.1_161313_162006_+	cd01041, Rubrerythrin, Rubrerythrin, ferritin-like diiron-binding domain	NA|143aa|down_9|NZ_CP045227.1_162197_162626_+	COG0723, QcrA, Rieske Fe-S protein [Energy production and conversion]
GCF_009372195.1_ASM937219v1	NZ_CP045227	Nostoc sphaeroides CCNUC1 chromosome Gxm2, complete sequence	4	164117-164233	3	CRISPRCasFinder	no		cas3,csa3,Cas9_archaeal,cas6,WYL,RT,cas2,cas1,cas4,csc1gr5,csc2gr7,cas10d,csm3gr7,csx19,csx10gr5,cas10,csx1,csx3	Orphan	GTGTCAGTCAACAATAATAAAGAGTTCCCAATACAA	36	0	0	NA	NA	NA	1	1	Orphan	PD-DExK,cas14k,cas14j,Cas9_archaeal,csa3,Cas14u_CAS-V,RT,Cas14c_CAS-V-F,c2c9_V-U4,cas3,c2c10_CAS-V-U3,DinG,c2c5_V-U5,cas2,cas1,cas4,cas6,2OG_CAS,csc1gr5,csc2gr7,cas10d,WYL,DEDDh,csm3gr7,csx19,csx10gr5,cas10,csx1,csx3	NA|85aa|up_9|NZ_CP045227.1_153688_153943_-,NA|84aa|up_8|NZ_CP045227.1_155989_156241_+,NA|76aa|up_6|NZ_CP045227.1_157471_157699_+,NA|139aa|up_5|NZ_CP045227.1_157686_158103_+,NA|67aa|down_2|NZ_CP045227.1_167757_167958_+,NA|266aa|down_4|NZ_CP045227.1_171078_171876_+,NA|178aa|down_8|NZ_CP045227.1_180378_180912_-	NA|85aa|up_9|NZ_CP045227.1_153688_153943_-	NA	NA|84aa|up_8|NZ_CP045227.1_155989_156241_+	NA	NA|307aa|up_7|NZ_CP045227.1_156332_157253_-	pfam14261, DUF4351, Domain of unknown function (DUF4351)	NA|76aa|up_6|NZ_CP045227.1_157471_157699_+	NA	NA|139aa|up_5|NZ_CP045227.1_157686_158103_+	NA	NA|202aa|up_4|NZ_CP045227.1_158186_158792_-	pfam03929, PepSY_TM, PepSY-associated TM region	NA|226aa|up_3|NZ_CP045227.1_159097_159775_+	COG0745, OmpR, Response regulators consisting of a CheY-like receiver domain and a winged-helix DNA-binding domain [Signal transduction mechanisms / Transcription]	NA|468aa|up_2|NZ_CP045227.1_159719_161123_+	TIGR02966, Phosphate_regulon_sensor_protein_PhoR, phosphate regulon sensor kinase PhoR	NA|231aa|up_1|NZ_CP045227.1_161313_162006_+	cd01041, Rubrerythrin, Rubrerythrin, ferritin-like diiron-binding domain	NA|143aa|up_0|NZ_CP045227.1_162197_162626_+	COG0723, QcrA, Rieske Fe-S protein [Energy production and conversion]	NA|381aa|down_0|NZ_CP045227.1_164945_166088_+	TIGR01263, 4-hydroxyphenylpyruvate_dioxygenase, 4-hydroxyphenylpyruvate dioxygenase	NA|209aa|down_1|NZ_CP045227.1_166341_166968_+	COG4122, COG4122, Predicted O-methyltransferase [General function prediction only]	NA|67aa|down_2|NZ_CP045227.1_167757_167958_+	NA	NA|664aa|down_3|NZ_CP045227.1_168832_170824_+	COG3899, COG3899, Predicted ATPase [General function prediction only]	NA|266aa|down_4|NZ_CP045227.1_171078_171876_+	NA	NA|757aa|down_5|NZ_CP045227.1_172177_174448_+	PRK09776, PRK09776, putative diguanylate cyclase; Provisional	NA|1250aa|down_6|NZ_CP045227.1_174726_178476_-	cd01948, EAL, EAL domain	NA|216aa|down_7|NZ_CP045227.1_179040_179688_-	pfam03372, Exo_endo_phos, Endonuclease/Exonuclease/phosphatase family	NA|178aa|down_8|NZ_CP045227.1_180378_180912_-	NA	NA|615aa|down_9|NZ_CP045227.1_181551_183396_-	pfam00305, Lipoxygenase, Lipoxygenase
GCF_009372195.1_ASM937219v1	NZ_CP045227	Nostoc sphaeroides CCNUC1 chromosome Gxm2, complete sequence	5	528566-528721	4	CRISPRCasFinder	no		cas3,csa3,Cas9_archaeal,cas6,WYL,RT,cas2,cas1,cas4,csc1gr5,csc2gr7,cas10d,csm3gr7,csx19,csx10gr5,cas10,csx1,csx3	Orphan	TCTGTCCCATTAAATATGATGGGTTGTAGAGACGCGAAATTTCGCGTCT	49	0	0	NA	NA	NA	1	1	Orphan	PD-DExK,cas14k,cas14j,Cas9_archaeal,csa3,Cas14u_CAS-V,RT,Cas14c_CAS-V-F,c2c9_V-U4,cas3,c2c10_CAS-V-U3,DinG,c2c5_V-U5,cas2,cas1,cas4,cas6,2OG_CAS,csc1gr5,csc2gr7,cas10d,WYL,DEDDh,csm3gr7,csx19,csx10gr5,cas10,csx1,csx3	NA|117aa|up_8|NZ_CP045227.1_516062_516413_-,NA|117aa|up_6|NZ_CP045227.1_517547_517898_-,NA|112aa|up_5|NZ_CP045227.1_518235_518571_-,NA|158aa|up_1|NZ_CP045227.1_525287_525761_+,NA|190aa|down_2|NZ_CP045227.1_531825_532395_-,NA|47aa|down_5|NZ_CP045227.1_549041_549182_+	NA|739aa|up_9|NZ_CP045227.1_513609_515826_-	COG2274, SunT, ABC-type bacteriocin/lantibiotic exporters, contain an N-terminal double-glycine peptidase domain [Defense mechanisms]	NA|117aa|up_8|NZ_CP045227.1_516062_516413_-	NA	NA|112aa|up_7|NZ_CP045227.1_516862_517198_-	pfam00030, Crystall, Beta/Gamma crystallin	NA|117aa|up_6|NZ_CP045227.1_517547_517898_-	NA	NA|112aa|up_5|NZ_CP045227.1_518235_518571_-	NA	NA|1007aa|up_4|NZ_CP045227.1_518754_521775_-	pfam05860, Haemagg_act, haemagglutination activity domain	NA|223aa|up_3|NZ_CP045227.1_522199_522868_+	COG2197, CitB, Response regulator containing a CheY-like receiver domain and an HTH DNA-binding domain [Signal transduction mechanisms / Transcription]	NA|656aa|up_2|NZ_CP045227.1_522867_524835_+	COG4252, COG4252, Predicted transmembrane sensor domain [Signal transduction mechanisms]	NA|158aa|up_1|NZ_CP045227.1_525287_525761_+	NA	NA|904aa|up_0|NZ_CP045227.1_525815_528527_+	COG4995, COG4995, Uncharacterized protein conserved in bacteria [Function unknown]	NA|257aa|down_0|NZ_CP045227.1_528750_529521_-	pfam06051, DUF928, Domain of Unknown Function (DUF928)	NA|262aa|down_1|NZ_CP045227.1_530182_530968_-	pfam12204, DUF3598, Domain of unknown function (DUF3598)	NA|190aa|down_2|NZ_CP045227.1_531825_532395_-	NA	NA|185aa|down_3|NZ_CP045227.1_532538_533093_-	cd00051, EFh, EF-hand, calcium binding motif; A diverse superfamily of calcium sensors and calcium signal modulators; most examples in this alignment model have 2 active canonical EF hands	NA|4985aa|down_4|NZ_CP045227.1_533658_548613_-	COG2931, COG2931, RTX toxins and related Ca2+-binding proteins [Secondary metabolites biosynthesis, transport, and catabolism]	NA|47aa|down_5|NZ_CP045227.1_549041_549182_+	NA	NA|1463aa|down_6|NZ_CP045227.1_549903_554292_+	pfam05860, Haemagg_act, haemagglutination activity domain	NA|876aa|down_7|NZ_CP045227.1_554306_556934_+	COG4995, COG4995, Uncharacterized protein conserved in bacteria [Function unknown]	NA|297aa|down_8|NZ_CP045227.1_556875_557766_-	pfam06051, DUF928, Domain of Unknown Function (DUF928)	NA|784aa|down_9|NZ_CP045227.1_557768_560120_-	COG4252, COG4252, Predicted transmembrane sensor domain [Signal transduction mechanisms]
GCF_009372195.1_ASM937219v1	NZ_CP045227	Nostoc sphaeroides CCNUC1 chromosome Gxm2, complete sequence	6	847665-847761	5	CRISPRCasFinder	no		cas3,csa3,Cas9_archaeal,cas6,WYL,RT,cas2,cas1,cas4,csc1gr5,csc2gr7,cas10d,csm3gr7,csx19,csx10gr5,cas10,csx1,csx3	Orphan	TTGTCATTTGTCATTGGTCATTTG	24	0	0	NA	NA	NA	1	1	Orphan	PD-DExK,cas14k,cas14j,Cas9_archaeal,csa3,Cas14u_CAS-V,RT,Cas14c_CAS-V-F,c2c9_V-U4,cas3,c2c10_CAS-V-U3,DinG,c2c5_V-U5,cas2,cas1,cas4,cas6,2OG_CAS,csc1gr5,csc2gr7,cas10d,WYL,DEDDh,csm3gr7,csx19,csx10gr5,cas10,csx1,csx3	NA|121aa|up_9|NZ_CP045227.1_836116_836479_-,NA|80aa|down_4|NZ_CP045227.1_851563_851803_+	NA|121aa|up_9|NZ_CP045227.1_836116_836479_-	NA	NA|208aa|up_8|NZ_CP045227.1_836456_837080_-	pfam06206, CpeT, CpeT/CpcT family (DUF1001)	NA|177aa|up_7|NZ_CP045227.1_837332_837863_-	cd19433, lipocalin_CpcS-CpeS, CpcS/CpeS phycobiliprotein lyase family	NA|250aa|up_6|NZ_CP045227.1_838181_838931_-	pfam00427, PBS_linker_poly, Phycobilisome Linker polypeptide	NA|287aa|up_5|NZ_CP045227.1_838973_839834_-	pfam00427, PBS_linker_poly, Phycobilisome Linker polypeptide	NA|718aa|up_4|NZ_CP045227.1_840591_842745_+	NF033092, HK_WalK, cell wall metabolism sensor histidine kinase WalK	NA|125aa|up_3|NZ_CP045227.1_842716_843091_+	cd17552, REC_RR468-like, phosphoacceptor receiver (REC) domain of Thermotoga maritima response regulator RR468 and similar domains	NA|648aa|up_2|NZ_CP045227.1_843099_845043_+	COG0745, OmpR, Response regulators consisting of a CheY-like receiver domain and a winged-helix DNA-binding domain [Signal transduction mechanisms / Transcription]	NA|194aa|up_1|NZ_CP045227.1_845460_846042_-	cd12130, Apl, Allophycocyanin-like globins	NA|220aa|up_0|NZ_CP045227.1_846519_847179_+	COG1413, COG1413, FOG: HEAT repeat [Energy production and conversion]	NA|270aa|down_0|NZ_CP045227.1_847812_848622_-	COG1357, COG1357, Pentapeptide repeats containing protein [Function unknown]	NA|196aa|down_1|NZ_CP045227.1_848692_849280_+	pfam09367, CpeS, CpeS-like protein	NA|244aa|down_2|NZ_CP045227.1_849473_850205_+	PRK13247, PRK13247, 15,16-dihydrobiliverdin:ferredoxin oxidoreductase	NA|250aa|down_3|NZ_CP045227.1_850273_851023_+	PRK13250, PRK13250, phycoerythrobilin:ferredoxin oxidoreductase; Provisional	NA|80aa|down_4|NZ_CP045227.1_851563_851803_+	NA	NA|304aa|down_5|NZ_CP045227.1_851837_852749_+	COG1413, COG1413, FOG: HEAT repeat [Energy production and conversion]	NA|205aa|down_6|NZ_CP045227.1_853050_853665_-	COG1413, COG1413, FOG: HEAT repeat [Energy production and conversion]	NA|440aa|down_7|NZ_CP045227.1_854196_855516_-	pfam13646, HEAT_2, HEAT repeats	NA|165aa|down_8|NZ_CP045227.1_855662_856157_-	cd14769, PE_alpha, Phycoerythrin alpha subunit, a phycobilisome rod component	NA|185aa|down_9|NZ_CP045227.1_856237_856792_-	cd14767, PE_beta-like, Phycoerythrin beta subunit, a component of the phycobilisome rod; and related proteins
GCF_009372195.1_ASM937219v1	NZ_CP045227	Nostoc sphaeroides CCNUC1 chromosome Gxm2, complete sequence	7	940928-941016	6	CRISPRCasFinder	no		cas3,csa3,Cas9_archaeal,cas6,WYL,RT,cas2,cas1,cas4,csc1gr5,csc2gr7,cas10d,csm3gr7,csx19,csx10gr5,cas10,csx1,csx3	Orphan	GTCCTTCTCTAACGAGACGCTGCGCGTTGGC	31	0	0	NA	NA	NA	1	1	Orphan	PD-DExK,cas14k,cas14j,Cas9_archaeal,csa3,Cas14u_CAS-V,RT,Cas14c_CAS-V-F,c2c9_V-U4,cas3,c2c10_CAS-V-U3,DinG,c2c5_V-U5,cas2,cas1,cas4,cas6,2OG_CAS,csc1gr5,csc2gr7,cas10d,WYL,DEDDh,csm3gr7,csx19,csx10gr5,cas10,csx1,csx3	NA|104aa|up_7|NZ_CP045227.1_933264_933576_+,NA|68aa|up_6|NZ_CP045227.1_934453_934657_+,NA|121aa|up_5|NZ_CP045227.1_935197_935560_+,NA|66aa|up_3|NZ_CP045227.1_937451_937649_-,NA|67aa|down_1|NZ_CP045227.1_943534_943735_+,NA|109aa|down_7|NZ_CP045227.1_951579_951906_+	NA|220aa|up_9|NZ_CP045227.1_930964_931624_-	pfam07077, DUF1345, Protein of unknown function (DUF1345)	NA|257aa|up_8|NZ_CP045227.1_931658_932429_-	pfam07077, DUF1345, Protein of unknown function (DUF1345)	NA|104aa|up_7|NZ_CP045227.1_933264_933576_+	NA	NA|68aa|up_6|NZ_CP045227.1_934453_934657_+	NA	NA|121aa|up_5|NZ_CP045227.1_935197_935560_+	NA	NA|404aa|up_4|NZ_CP045227.1_935908_937120_-	pfam05433, Rick_17kDa_Anti, Glycine zipper 2TM domain	NA|66aa|up_3|NZ_CP045227.1_937451_937649_-	NA	NA|49aa|up_2|NZ_CP045227.1_937774_937921_+	pfam04304, DUF454, Protein of unknown function (DUF454)	NA|368aa|up_1|NZ_CP045227.1_938097_939201_+	cd02035, ArsA, Arsenical pump-driving ATPase ArsA	NA|363aa|up_0|NZ_CP045227.1_939349_940438_-	TIGR00378, cax, calcium/proton exchanger (cax)	NA|664aa|down_0|NZ_CP045227.1_941393_943385_-	cd07550, P-type_ATPase_HM, P-type heavy metal-transporting ATPase; uncharacterized subfamily	NA|67aa|down_1|NZ_CP045227.1_943534_943735_+	NA	NA|106aa|down_2|NZ_CP045227.1_945272_945590_+	cd16382, XisI-like, XisI is FdxN element excision controlling factor protein	NA|112aa|down_3|NZ_CP045227.1_945743_946079_-	pfam08869, XisI, XisI protein	NA|139aa|down_4|NZ_CP045227.1_946066_946483_-	pfam08814, XisH, XisH protein	NA|130aa|down_5|NZ_CP045227.1_946694_947084_-	cd00200, WD40, WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment	NA|168aa|down_6|NZ_CP045227.1_950816_951320_+	COG1430, COG1430, Uncharacterized conserved protein [Function unknown]	NA|109aa|down_7|NZ_CP045227.1_951579_951906_+	NA	NA|329aa|down_8|NZ_CP045227.1_951944_952931_+	cd05386, TraL, transfer origin protein TraL	NA|229aa|down_9|NZ_CP045227.1_952935_953622_+	cd05386, TraL, transfer origin protein TraL
GCF_009372195.1_ASM937219v1	NZ_CP045227	Nostoc sphaeroides CCNUC1 chromosome Gxm2, complete sequence	8	948000-948099	7	CRISPRCasFinder	no		cas3,csa3,Cas9_archaeal,cas6,WYL,RT,cas2,cas1,cas4,csc1gr5,csc2gr7,cas10d,csm3gr7,csx19,csx10gr5,cas10,csx1,csx3	Orphan	TACAAAATGTCTGCTATTTTGCA	23	0	0	NA	NA	NA	1	1	Orphan	PD-DExK,cas14k,cas14j,Cas9_archaeal,csa3,Cas14u_CAS-V,RT,Cas14c_CAS-V-F,c2c9_V-U4,cas3,c2c10_CAS-V-U3,DinG,c2c5_V-U5,cas2,cas1,cas4,cas6,2OG_CAS,csc1gr5,csc2gr7,cas10d,WYL,DEDDh,csm3gr7,csx19,csx10gr5,cas10,csx1,csx3	NA|66aa|up_9|NZ_CP045227.1_937451_937649_-,NA|67aa|up_4|NZ_CP045227.1_943534_943735_+,NA|109aa|down_1|NZ_CP045227.1_951579_951906_+	NA|66aa|up_9|NZ_CP045227.1_937451_937649_-	NA	NA|49aa|up_8|NZ_CP045227.1_937774_937921_+	pfam04304, DUF454, Protein of unknown function (DUF454)	NA|368aa|up_7|NZ_CP045227.1_938097_939201_+	cd02035, ArsA, Arsenical pump-driving ATPase ArsA	NA|363aa|up_6|NZ_CP045227.1_939349_940438_-	TIGR00378, cax, calcium/proton exchanger (cax)	NA|664aa|up_5|NZ_CP045227.1_941393_943385_-	cd07550, P-type_ATPase_HM, P-type heavy metal-transporting ATPase; uncharacterized subfamily	NA|67aa|up_4|NZ_CP045227.1_943534_943735_+	NA	NA|106aa|up_3|NZ_CP045227.1_945272_945590_+	cd16382, XisI-like, XisI is FdxN element excision controlling factor protein	NA|112aa|up_2|NZ_CP045227.1_945743_946079_-	pfam08869, XisI, XisI protein	NA|139aa|up_1|NZ_CP045227.1_946066_946483_-	pfam08814, XisH, XisH protein	NA|130aa|up_0|NZ_CP045227.1_946694_947084_-	cd00200, WD40, WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment	NA|168aa|down_0|NZ_CP045227.1_950816_951320_+	COG1430, COG1430, Uncharacterized conserved protein [Function unknown]	NA|109aa|down_1|NZ_CP045227.1_951579_951906_+	NA	NA|329aa|down_2|NZ_CP045227.1_951944_952931_+	cd05386, TraL, transfer origin protein TraL	NA|229aa|down_3|NZ_CP045227.1_952935_953622_+	cd05386, TraL, transfer origin protein TraL	NA|228aa|down_4|NZ_CP045227.1_953726_954410_+	smart00387, HATPase_c, Histidine kinase-like ATPases	NA|515aa|down_5|NZ_CP045227.1_957919_959464_-	pfam03743, TrbI, Bacterial conjugation TrbI-like protein	NA|186aa|down_6|NZ_CP045227.1_961698_962256_+	cd03768, SR_ResInv, Serine Recombinase (SR) family, Resolvase and Invertase subfamily, catalytic domain; members contain a C-terminal DNA binding domain	NA|849aa|down_7|NZ_CP045227.1_965618_968165_+	COG4995, COG4995, Uncharacterized protein conserved in bacteria [Function unknown]	NA|351aa|down_8|NZ_CP045227.1_968354_969407_+	pfam06051, DUF928, Domain of Unknown Function (DUF928)	NA|975aa|down_9|NZ_CP045227.1_969557_972482_+	COG4252, COG4252, Predicted transmembrane sensor domain [Signal transduction mechanisms]
GCF_009372195.1_ASM937219v1	NZ_CP045227	Nostoc sphaeroides CCNUC1 chromosome Gxm2, complete sequence	9	1113750-1113836	8	CRISPRCasFinder	no		cas3,csa3,Cas9_archaeal,cas6,WYL,RT,cas2,cas1,cas4,csc1gr5,csc2gr7,cas10d,csm3gr7,csx19,csx10gr5,cas10,csx1,csx3	Orphan	ACGACTCACAGGAATAGCGATCGC	24	0	0	NA	NA	NA	1	1	Orphan	PD-DExK,cas14k,cas14j,Cas9_archaeal,csa3,Cas14u_CAS-V,RT,Cas14c_CAS-V-F,c2c9_V-U4,cas3,c2c10_CAS-V-U3,DinG,c2c5_V-U5,cas2,cas1,cas4,cas6,2OG_CAS,csc1gr5,csc2gr7,cas10d,WYL,DEDDh,csm3gr7,csx19,csx10gr5,cas10,csx1,csx3	NA|362aa|up_8|NZ_CP045227.1_1096593_1097678_+,NA|98aa|up_7|NZ_CP045227.1_1097862_1098156_+,NA|200aa|down_4|NZ_CP045227.1_1122975_1123575_+	NA|74aa|up_9|NZ_CP045227.1_1096100_1096322_-	pfam14217, DUF4327, Domain of unknown function (DUF4327)	NA|362aa|up_8|NZ_CP045227.1_1096593_1097678_+	NA	NA|98aa|up_7|NZ_CP045227.1_1097862_1098156_+	NA	NA|65aa|up_6|NZ_CP045227.1_1098393_1098588_+	pfam11165, DUF2949, Protein of unknown function (DUF2949)	NA|483aa|up_5|NZ_CP045227.1_1104752_1106201_-	COG0642, BaeS, Signal transduction histidine kinase [Signal transduction mechanisms]	NA|126aa|up_4|NZ_CP045227.1_1106197_1106575_-	cd17614, REC_OmpR_YycF-like, phosphoacceptor receiver (REC) domain of YrcF-like OmpR family response regulators	NA|234aa|up_3|NZ_CP045227.1_1109442_1110144_+	COG0745, OmpR, Response regulators consisting of a CheY-like receiver domain and a winged-helix DNA-binding domain [Signal transduction mechanisms / Transcription]	NA|251aa|up_2|NZ_CP045227.1_1110818_1111571_+	pfam03235, DUF262, Protein of unknown function DUF262	NA|348aa|up_1|NZ_CP045227.1_1111577_1112621_+	pfam04326, AlbA_2, Putative DNA-binding domain	NA|214aa|up_0|NZ_CP045227.1_1112996_1113638_-	pfam13588, HSDR_N_2, Type I restriction enzyme R protein N-terminus (HSDR_N)	NA|443aa|down_0|NZ_CP045227.1_1114830_1116159_-	pfam01609, DDE_Tnp_1, Transposase DDE domain	NA|428aa|down_1|NZ_CP045227.1_1118407_1119691_+	COG1808, COG1808, Predicted membrane protein [Function unknown]	NA|272aa|down_2|NZ_CP045227.1_1119775_1120591_+	pfam00520, Ion_trans, Ion transport protein	NA|336aa|down_3|NZ_CP045227.1_1120623_1121631_+	COG0668, MscS, Small-conductance mechanosensitive channel [Cell envelope biogenesis, outer membrane]	NA|200aa|down_4|NZ_CP045227.1_1122975_1123575_+	NA	NA|344aa|down_5|NZ_CP045227.1_1123775_1124807_-	COG0628, yhhT, Predicted permease, member of the PurR regulon [General function prediction only]	NA|355aa|down_6|NZ_CP045227.1_1124878_1125943_-	COG0628, yhhT, Predicted permease, member of the PurR regulon [General function prediction only]	NA|606aa|down_7|NZ_CP045227.1_1125986_1127804_-	COG0745, OmpR, Response regulators consisting of a CheY-like receiver domain and a winged-helix DNA-binding domain [Signal transduction mechanisms / Transcription]	NA|555aa|down_8|NZ_CP045227.1_1128283_1129948_-	TIGR02966, Phosphate_regulon_sensor_protein_PhoR, phosphate regulon sensor kinase PhoR	NA|124aa|down_9|NZ_CP045227.1_1130126_1130498_+	cd17552, REC_RR468-like, phosphoacceptor receiver (REC) domain of Thermotoga maritima response regulator RR468 and similar domains
GCF_009372195.1_ASM937219v1	NZ_CP045227	Nostoc sphaeroides CCNUC1 chromosome Gxm2, complete sequence	10	1438457-1438563	9	CRISPRCasFinder	no		cas3,csa3,Cas9_archaeal,cas6,WYL,RT,cas2,cas1,cas4,csc1gr5,csc2gr7,cas10d,csm3gr7,csx19,csx10gr5,cas10,csx1,csx3	Orphan	CATAAATAAATTTAGGGGCTTGAAA	25	0	0	NA	NA	NA	1	1	Orphan	PD-DExK,cas14k,cas14j,Cas9_archaeal,csa3,Cas14u_CAS-V,RT,Cas14c_CAS-V-F,c2c9_V-U4,cas3,c2c10_CAS-V-U3,DinG,c2c5_V-U5,cas2,cas1,cas4,cas6,2OG_CAS,csc1gr5,csc2gr7,cas10d,WYL,DEDDh,csm3gr7,csx19,csx10gr5,cas10,csx1,csx3	NA|132aa|up_6|NZ_CP045227.1_1431084_1431480_-,NA|171aa|down_3|NZ_CP045227.1_1442052_1442565_-,NA|315aa|down_6|NZ_CP045227.1_1449744_1450689_-,NA|294aa|down_8|NZ_CP045227.1_1451274_1452156_+,NA|109aa|down_9|NZ_CP045227.1_1452243_1452570_+	NA|102aa|up_9|NZ_CP045227.1_1428748_1429054_-	COG0412, COG0412, Dienelactone hydrolase and related enzymes [Secondary metabolites biosynthesis, transport, and catabolism]	NA|214aa|up_8|NZ_CP045227.1_1429135_1429777_-	COG2258, COG2258, Uncharacterized protein conserved in bacteria [Function unknown]	NA|160aa|up_7|NZ_CP045227.1_1430278_1430758_-	pfam07080, DUF1348, Protein of unknown function (DUF1348)	NA|132aa|up_6|NZ_CP045227.1_1431084_1431480_-	NA	NA|332aa|up_5|NZ_CP045227.1_1431775_1432771_-	pfam07859, Abhydrolase_3, alpha/beta hydrolase fold	NA|290aa|up_4|NZ_CP045227.1_1433049_1433919_-	COG0596, MhpC, Predicted hydrolases or acyltransferases (alpha/beta hydrolase superfamily) [General function prediction only]	NA|250aa|up_3|NZ_CP045227.1_1434024_1434774_-	PRK06500, PRK06500, SDR family oxidoreductase	NA|144aa|up_2|NZ_CP045227.1_1434859_1435291_-	cd02215, cupin_QDO_N_C, quercetinase, N- and C-terminal cupin domains	NA|304aa|up_1|NZ_CP045227.1_1435582_1436494_-	PRK06196, PRK06196, oxidoreductase; Provisional	NA|526aa|up_0|NZ_CP045227.1_1436726_1438304_-	PRK02106, PRK02106, choline dehydrogenase; Validated	NA|518aa|down_0|NZ_CP045227.1_1438721_1440275_-	PRK02106, PRK02106, choline dehydrogenase; Validated	NA|271aa|down_1|NZ_CP045227.1_1440390_1441203_-	pfam12697, Abhydrolase_6, Alpha/beta hydrolase family	NA|186aa|down_2|NZ_CP045227.1_1441302_1441860_-	COG1651, DsbG, Protein-disulfide isomerase [Posttranslational modification, protein turnover, chaperones]	NA|171aa|down_3|NZ_CP045227.1_1442052_1442565_-	NA	NA|210aa|down_4|NZ_CP045227.1_1442836_1443466_-	COG2197, CitB, Response regulator containing a CheY-like receiver domain and an HTH DNA-binding domain [Signal transduction mechanisms / Transcription]	NA|2024aa|down_5|NZ_CP045227.1_1443550_1449622_-	COG3899, COG3899, Predicted ATPase [General function prediction only]	NA|315aa|down_6|NZ_CP045227.1_1449744_1450689_-	NA	NA|97aa|down_7|NZ_CP045227.1_1450840_1451131_-	pfam13560, HTH_31, Helix-turn-helix domain	NA|294aa|down_8|NZ_CP045227.1_1451274_1452156_+	NA	NA|109aa|down_9|NZ_CP045227.1_1452243_1452570_+	NA
GCF_009372195.1_ASM937219v1	NZ_CP045227	Nostoc sphaeroides CCNUC1 chromosome Gxm2, complete sequence	11	1701884-1701997	10	CRISPRCasFinder	no		cas3,csa3,Cas9_archaeal,cas6,WYL,RT,cas2,cas1,cas4,csc1gr5,csc2gr7,cas10d,csm3gr7,csx19,csx10gr5,cas10,csx1,csx3	Orphan	TTACCTACATTTCCCTTTCCAGAATTTGAG	30	0	0	NA	NA	NA	1	1	Orphan	PD-DExK,cas14k,cas14j,Cas9_archaeal,csa3,Cas14u_CAS-V,RT,Cas14c_CAS-V-F,c2c9_V-U4,cas3,c2c10_CAS-V-U3,DinG,c2c5_V-U5,cas2,cas1,cas4,cas6,2OG_CAS,csc1gr5,csc2gr7,cas10d,WYL,DEDDh,csm3gr7,csx19,csx10gr5,cas10,csx1,csx3	NA|79aa|up_8|NZ_CP045227.1_1694182_1694419_+,NA|67aa|up_7|NZ_CP045227.1_1695662_1695863_-,NA|146aa|up_4|NZ_CP045227.1_1698137_1698575_-,NA|86aa|up_3|NZ_CP045227.1_1698828_1699086_-,NA|127aa|up_1|NZ_CP045227.1_1699746_1700127_-,NA|127aa|up_0|NZ_CP045227.1_1700296_1700677_-,NA|74aa|down_1|NZ_CP045227.1_1705972_1706194_-,NA|124aa|down_2|NZ_CP045227.1_1706317_1706689_+,NA|90aa|down_3|NZ_CP045227.1_1706710_1706980_-,NA|127aa|down_9|NZ_CP045227.1_1714466_1714847_-	NA|355aa|up_9|NZ_CP045227.1_1692889_1693954_+	pfam13006, Nterm_IS4, Insertion element 4 transposase N-terminal	NA|79aa|up_8|NZ_CP045227.1_1694182_1694419_+	NA	NA|67aa|up_7|NZ_CP045227.1_1695662_1695863_-	NA	NA|422aa|up_6|NZ_CP045227.1_1695869_1697135_-	TIGR04103, heme_biosynthesis_protein, nif11-class peptide radical SAM maturase 3	NA|140aa|up_5|NZ_CP045227.1_1697548_1697968_-	TIGR03798, ocin_TIGR03798, nif11-like leader peptide domain	NA|146aa|up_4|NZ_CP045227.1_1698137_1698575_-	NA	NA|86aa|up_3|NZ_CP045227.1_1698828_1699086_-	NA	NA|109aa|up_2|NZ_CP045227.1_1699196_1699523_-	pfam06594, HCBP_related, Haemolysin-type calcium binding protein related domain	NA|127aa|up_1|NZ_CP045227.1_1699746_1700127_-	NA	NA|127aa|up_0|NZ_CP045227.1_1700296_1700677_-	NA	NA|224aa|down_0|NZ_CP045227.1_1703984_1704656_-	PRK11198, PRK11198, LysM domain/BON superfamily protein; Provisional	NA|74aa|down_1|NZ_CP045227.1_1705972_1706194_-	NA	NA|124aa|down_2|NZ_CP045227.1_1706317_1706689_+	NA	NA|90aa|down_3|NZ_CP045227.1_1706710_1706980_-	NA	NA|248aa|down_4|NZ_CP045227.1_1707057_1707801_-	pfam12120, Arr-ms, Rifampin ADP-ribosyl transferase	NA|697aa|down_5|NZ_CP045227.1_1708504_1710595_-	cd14014, STKc_PknB_like, Catalytic domain of bacterial Serine/Threonine kinases, PknB and similar proteins	NA|596aa|down_6|NZ_CP045227.1_1710634_1712422_-	COG0631, PTC1, Serine/threonine protein phosphatase [Signal transduction mechanisms]	NA|174aa|down_7|NZ_CP045227.1_1712426_1712948_-	COG1716, COG1716, FOG: FHA domain [Signal transduction mechanisms]	NA|465aa|down_8|NZ_CP045227.1_1713012_1714407_-	cd00198, vWFA, Von Willebrand factor type A (vWA) domain was originally found in the blood coagulation protein von Willebrand factor (vWF)	NA|127aa|down_9|NZ_CP045227.1_1714466_1714847_-	NA
GCF_009372195.1_ASM937219v1	NZ_CP045227	Nostoc sphaeroides CCNUC1 chromosome Gxm2, complete sequence	12	1805087-1805203	11	CRISPRCasFinder	no		cas3,csa3,Cas9_archaeal,cas6,WYL,RT,cas2,cas1,cas4,csc1gr5,csc2gr7,cas10d,csm3gr7,csx19,csx10gr5,cas10,csx1,csx3	Orphan	TTGTATTGGGAACTCTTTATTATTGTTGACTGACAC	36	0	0	NA	NA	NA	1	1	Orphan	PD-DExK,cas14k,cas14j,Cas9_archaeal,csa3,Cas14u_CAS-V,RT,Cas14c_CAS-V-F,c2c9_V-U4,cas3,c2c10_CAS-V-U3,DinG,c2c5_V-U5,cas2,cas1,cas4,cas6,2OG_CAS,csc1gr5,csc2gr7,cas10d,WYL,DEDDh,csm3gr7,csx19,csx10gr5,cas10,csx1,csx3	NA|85aa|up_8|NZ_CP045227.1_1795461_1795716_-,NA|96aa|up_7|NZ_CP045227.1_1795692_1795980_+,NA|81aa|up_6|NZ_CP045227.1_1796122_1796365_-,NA|127aa|up_2|NZ_CP045227.1_1802297_1802678_-,NA|119aa|up_1|NZ_CP045227.1_1802777_1803134_-,NA|165aa|up_0|NZ_CP045227.1_1804088_1804583_+,NA|136aa|down_5|NZ_CP045227.1_1811216_1811624_-,NA|76aa|down_6|NZ_CP045227.1_1811620_1811848_-,NA|97aa|down_8|NZ_CP045227.1_1813079_1813370_-,NA|74aa|down_9|NZ_CP045227.1_1813882_1814104_+	NA|751aa|up_9|NZ_CP045227.1_1789709_1791962_-	COG4252, COG4252, Predicted transmembrane sensor domain [Signal transduction mechanisms]	NA|85aa|up_8|NZ_CP045227.1_1795461_1795716_-	NA	NA|96aa|up_7|NZ_CP045227.1_1795692_1795980_+	NA	NA|81aa|up_6|NZ_CP045227.1_1796122_1796365_-	NA	NA|497aa|up_5|NZ_CP045227.1_1796990_1798481_-	pfam08852, DUF1822, Protein of unknown function (DUF1822)	NA|526aa|up_4|NZ_CP045227.1_1798404_1799982_-	TIGR02980, SigBFG, RNA polymerase sigma-70 factor, sigma-B/F/G subfamily	NA|120aa|up_3|NZ_CP045227.1_1800486_1800846_-	cd05077, PTK_Jak1_rpt1, Pseudokinase (repeat 1) domain of the Protein Tyrosine Kinase, Janus kinase 1	NA|127aa|up_2|NZ_CP045227.1_1802297_1802678_-	NA	NA|119aa|up_1|NZ_CP045227.1_1802777_1803134_-	NA	NA|165aa|up_0|NZ_CP045227.1_1804088_1804583_+	NA	NA|143aa|down_0|NZ_CP045227.1_1806693_1807122_-	COG0723, QcrA, Rieske Fe-S protein [Energy production and conversion]	NA|231aa|down_1|NZ_CP045227.1_1807313_1808006_-	cd01041, Rubrerythrin, Rubrerythrin, ferritin-like diiron-binding domain	NA|468aa|down_2|NZ_CP045227.1_1808196_1809600_-	TIGR02966, Phosphate_regulon_sensor_protein_PhoR, phosphate regulon sensor kinase PhoR	NA|226aa|down_3|NZ_CP045227.1_1809544_1810222_-	COG0745, OmpR, Response regulators consisting of a CheY-like receiver domain and a winged-helix DNA-binding domain [Signal transduction mechanisms / Transcription]	NA|202aa|down_4|NZ_CP045227.1_1810527_1811133_+	pfam03929, PepSY_TM, PepSY-associated TM region	NA|136aa|down_5|NZ_CP045227.1_1811216_1811624_-	NA	NA|76aa|down_6|NZ_CP045227.1_1811620_1811848_-	NA	NA|307aa|down_7|NZ_CP045227.1_1812067_1812988_+	pfam14261, DUF4351, Domain of unknown function (DUF4351)	NA|97aa|down_8|NZ_CP045227.1_1813079_1813370_-	NA	NA|74aa|down_9|NZ_CP045227.1_1813882_1814104_+	NA
GCF_009372195.1_ASM937219v1	NZ_CP045227	Nostoc sphaeroides CCNUC1 chromosome Gxm2, complete sequence	13	2106865-2107338	3,12,2	PILER-CR,CRISPRCasFinder,CRT	no	cas2,cas1,cas4,cas6,csc1gr5,csc2gr7,cas10d,cas3,WYL,csm3gr7,csx19,csx10gr5	cas3,csa3,Cas9_archaeal,cas6,WYL,RT,cas2,cas1,cas4,csc1gr5,csc2gr7,cas10d,csm3gr7,csx19,csx10gr5,cas10,csx1,csx3	Type I-D	GTTTCAGTCCCCTTGCGGGGTAATAGGCTTTGGAAAC,GTTTCAGTCCCCTTGCGGGGTAATAGGCTTTGGAAAC,GTTTCAGTCCCCTTGCGGGGTAATAGGCTTTGGAAAC	37,37,37	0	0	NA	NA	NA:NA:NA	6,6,6	6	TypeI-D	PD-DExK,cas14k,cas14j,Cas9_archaeal,csa3,Cas14u_CAS-V,RT,Cas14c_CAS-V-F,c2c9_V-U4,cas3,c2c10_CAS-V-U3,DinG,c2c5_V-U5,cas2,cas1,cas4,cas6,2OG_CAS,csc1gr5,csc2gr7,cas10d,WYL,DEDDh,csm3gr7,csx19,csx10gr5,cas10,csx1,csx3	NA|91aa|up_7|NZ_CP045227.1_2097805_2098078_-,NA|88aa|up_5|NZ_CP045227.1_2101567_2101831_-,NA|129aa|up_3|NZ_CP045227.1_2104194_2104581_+,NA|92aa|up_2|NZ_CP045227.1_2104577_2104853_+,NA|61aa|up_1|NZ_CP045227.1_2104925_2105108_-,NA|272aa|down_7|NZ_CP045227.1_2115120_2115936_-	NA|515aa|up_9|NZ_CP045227.1_2095287_2096832_+	cd16383, GUN4, porphyrin-binding protein domain GUN4	NA|255aa|up_8|NZ_CP045227.1_2096967_2097732_-	pfam01797, Y1_Tnp, Transposase IS200 like	NA|91aa|up_7|NZ_CP045227.1_2097805_2098078_-	NA	NA|526aa|up_6|NZ_CP045227.1_2099415_2100993_-	PRK00409, PRK00409, recombination and DNA strand exchange inhibitor protein; Reviewed	NA|88aa|up_5|NZ_CP045227.1_2101567_2101831_-	NA	NA|531aa|up_4|NZ_CP045227.1_2102192_2103785_-	pfam13751, DDE_Tnp_1_6, Transposase DDE domain	NA|129aa|up_3|NZ_CP045227.1_2104194_2104581_+	NA	NA|92aa|up_2|NZ_CP045227.1_2104577_2104853_+	NA	NA|61aa|up_1|NZ_CP045227.1_2104925_2105108_-	NA	NA|275aa|up_0|NZ_CP045227.1_2105922_2106747_-	pfam13359, DDE_Tnp_4, DDE superfamily endonuclease	cas2|98aa|down_0|NZ_CP045227.1_2107558_2107852_-	cd09725, Cas2_I_II_III, CRISPR/Cas system-associated protein Cas2	cas1|326aa|down_1|NZ_CP045227.1_2107877_2108855_-	TIGR04093, hypothetical_protein_L8106_25395, CRISPR-associated endonuclease Cas1, subtype CYANO	cas4|200aa|down_2|NZ_CP045227.1_2108900_2109500_-	TIGR00372, conserved_hypothetical_protein, CRISPR-associated protein Cas4	cas6|270aa|down_3|NZ_CP045227.1_2109514_2110324_-	TIGR01877, CRISPR-associated_endoribonuclease_Cas6_1, CRISPR-associated endoribonuclease Cas6	csc1gr5|242aa|down_4|NZ_CP045227.1_2110292_2111018_-	TIGR03159, cas_Csc1, CRISPR type I-D/CYANO-associated protein Csc1	csc2gr7|331aa|down_5|NZ_CP045227.1_2111151_2112144_-	pfam18320, Csc2, Csc2 Crispr	cas10d|991aa|down_6|NZ_CP045227.1_2112144_2115117_-	cd09712, Cas10d_I-D, CRISPR/Cas system-associated protein Cas10d	NA|272aa|down_7|NZ_CP045227.1_2115120_2115936_-	NA	NA|298aa|down_8|NZ_CP045227.1_2116129_2117023_-	pfam13359, DDE_Tnp_4, DDE superfamily endonuclease	cas3|437aa|down_9|NZ_CP045227.1_2117098_2118409_-	cd09710, Cas3_I-D, CRISPR/Cas system-associated protein Cas3; Distinct diverged subfamily of Cas3 helicase domain
GCF_009372195.1_ASM937219v1	NZ_CP045227	Nostoc sphaeroides CCNUC1 chromosome Gxm2, complete sequence	14	2133469-2134390	4,13,3	PILER-CR,CRISPRCasFinder,CRT	no	cas3,WYL,csm3gr7,csx19,csx10gr5,cas10,csx1,csx3	cas3,csa3,Cas9_archaeal,cas6,WYL,RT,cas2,cas1,cas4,csc1gr5,csc2gr7,cas10d,csm3gr7,csx19,csx10gr5,cas10,csx1,csx3	Type III-D,Type III-C,Type III-B,Type III-A	GTTTCAGTCCCCGTGAGGGGATTTGGTTAGTGGAAAC,GTTTCAGTCCCCGTGAGGGGATTTGGTTAGTGGAAAC,GTTTCAGTCCCCGTGAGGGGATTTGGTTA	37,37,29	0	0	NA	NA	NA:NA:NA	12,12,12	12	TypeIII-D,TypeIII-C,TypeIII-B,TypeIII-A	PD-DExK,cas14k,cas14j,Cas9_archaeal,csa3,Cas14u_CAS-V,RT,Cas14c_CAS-V-F,c2c9_V-U4,cas3,c2c10_CAS-V-U3,DinG,c2c5_V-U5,cas2,cas1,cas4,cas6,2OG_CAS,csc1gr5,csc2gr7,cas10d,WYL,DEDDh,csm3gr7,csx19,csx10gr5,cas10,csx1,csx3	csx3|308aa|up_3|NZ_CP045227.1_2130049_2130973_-,NA|308aa|down_0|NZ_CP045227.1_2134643_2135567_-,NA|297aa|down_2|NZ_CP045227.1_2136110_2137001_+,NA|155aa|down_4|NZ_CP045227.1_2139278_2139743_-,NA|122aa|down_7|NZ_CP045227.1_2145356_2145722_-	csx19|188aa|up_9|NZ_CP045227.1_2122594_2123158_-	TIGR03984, hypothetical_protein_FrEUN1fDRAFT_5778, CRISPR-associated protein, TIGR03984 family	csm3gr7|503aa|up_8|NZ_CP045227.1_2123154_2124663_-	TIGR02581, putative_CRISPR-associated_protein, CRISPR-associated RAMP protein, SSO1426 family	csx10gr5|548aa|up_7|NZ_CP045227.1_2124665_2126309_-	TIGR02674, cas_cyan_RAMP_2, CRISPR-associated RAMP protein, Csx10 family	csm3gr7|233aa|up_6|NZ_CP045227.1_2126305_2127004_-	pfam03787, RAMPs, RAMP superfamily	cas10|561aa|up_5|NZ_CP045227.1_2126993_2128676_-	COG1353, COG1353, Predicted CRISPR-associated polymerase [Defense mechanisms]	csx1|431aa|up_4|NZ_CP045227.1_2128672_2129965_-	pfam09002, DUF1887, Domain of unknown function (DUF1887)	csx3|308aa|up_3|NZ_CP045227.1_2130049_2130973_-	NA	csx3|102aa|up_2|NZ_CP045227.1_2131016_2131322_-	cd09740, Csx3_III-U, CRISPR/Cas system-associated protein Csx3	WYL|457aa|up_1|NZ_CP045227.1_2131424_2132795_+	TIGR03985, hypothetical_protein_sll7078, CRISPR-associated protein, TIGR03985 family	NA|193aa|up_0|NZ_CP045227.1_2132802_2133381_+	pfam13328, HD_4, HD domain	NA|308aa|down_0|NZ_CP045227.1_2134643_2135567_-	NA	NA|73aa|down_1|NZ_CP045227.1_2135711_2135930_-	pfam13560, HTH_31, Helix-turn-helix domain	NA|297aa|down_2|NZ_CP045227.1_2136110_2137001_+	NA	NA|384aa|down_3|NZ_CP045227.1_2137876_2139028_+	pfam13808, DDE_Tnp_1_assoc, DDE_Tnp_1-associated	NA|155aa|down_4|NZ_CP045227.1_2139278_2139743_-	NA	NA|301aa|down_5|NZ_CP045227.1_2140142_2141045_-	cd07324, M48C_Oma1-like, Oma1 peptidase-like, integral membrane metallopeptidase	NA|151aa|down_6|NZ_CP045227.1_2144382_2144835_-	pfam06271, RDD, RDD family	NA|122aa|down_7|NZ_CP045227.1_2145356_2145722_-	NA	NA|1531aa|down_8|NZ_CP045227.1_2146294_2150887_-	cd00200, WD40, WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment	NA|316aa|down_9|NZ_CP045227.1_2151001_2151949_-	pfam08852, DUF1822, Protein of unknown function (DUF1822)
GCF_009372195.1_ASM937219v1	NZ_CP045228	Nostoc sphaeroides CCNUC1 chromosome pGXM01, complete sequence	1	135795-135939	1	PILER-CR	no			Orphan	AATAATAATAAAGTTGGAAACTG	23	2	2	135818-135865|135889-135920	NZ_CP045227.1_1840364-1840411|NZ_CP045227.1_1840431-1840462	NA	2	2	Orphan	PD-DExK,cas14k,cas14j,Cas9_archaeal,csa3,Cas14u_CAS-V,RT,Cas14c_CAS-V-F,c2c9_V-U4,cas3,c2c10_CAS-V-U3,DinG,c2c5_V-U5,cas2,cas1,cas4,cas6,2OG_CAS,csc1gr5,csc2gr7,cas10d,WYL,DEDDh,csm3gr7,csx19,csx10gr5,cas10,csx1,csx3	NA|131aa|up_8|NZ_CP045228.1_125727_126120_+,NA|229aa|up_7|NZ_CP045228.1_126176_126863_+,NA|165aa|up_3|NZ_CP045228.1_132338_132833_+,NA|95aa|down_0|NZ_CP045228.1_135959_136244_-,NA|312aa|down_1|NZ_CP045228.1_136612_137548_-,NA|178aa|down_5|NZ_CP045228.1_142178_142712_-,NA|69aa|down_7|NZ_CP045228.1_144096_144303_+,NA|104aa|down_8|NZ_CP045228.1_144662_144974_+	NA|83aa|up_9|NZ_CP045228.1_121952_122201_+	cd00093, HTH_XRE, Helix-turn-helix XRE-family like proteins	NA|131aa|up_8|NZ_CP045228.1_125727_126120_+	NA	NA|229aa|up_7|NZ_CP045228.1_126176_126863_+	NA	NA|260aa|up_6|NZ_CP045228.1_130179_130959_+	TIGR02191, Ribonuclease_3, ribonuclease III, bacterial	NA|141aa|up_5|NZ_CP045228.1_131003_131426_+	pfam01878, EVE, EVE domain	NA|256aa|up_4|NZ_CP045228.1_131439_132207_+	cd01038, Endonuclease_DUF559, Domain of unknown function, appears to be related to a diverse group of endonucleases	NA|165aa|up_3|NZ_CP045228.1_132338_132833_+	NA	NA|219aa|up_2|NZ_CP045228.1_133215_133872_+	COG2020, STE14, Putative protein-S-isoprenylcysteine methyltransferase [Posttranslational modification, protein turnover, chaperones]	NA|151aa|up_1|NZ_CP045228.1_133983_134436_-	COG0071, IbpA, Molecular chaperone (small heat shock protein) [Posttranslational modification, protein turnover, chaperones]	NA|314aa|up_0|NZ_CP045228.1_134584_135526_+	pfam13737, DDE_Tnp_1_5, Transposase DDE domain	NA|95aa|down_0|NZ_CP045228.1_135959_136244_-	NA	NA|312aa|down_1|NZ_CP045228.1_136612_137548_-	NA	NA|524aa|down_2|NZ_CP045228.1_137952_139524_-	cd01187, INT_tnpB_C_Tn554, Putative Transposase B from transposon Tn554, C-terminal catalytic domain	NA|495aa|down_3|NZ_CP045228.1_139510_140995_-	cd01187, INT_tnpB_C_Tn554, Putative Transposase B from transposon Tn554, C-terminal catalytic domain	NA|360aa|down_4|NZ_CP045228.1_140991_142071_-	cd01186, INT_tnpA_C_Tn554, Putative Transposase A from transposon Tn554, C-terminal catalytic domain	NA|178aa|down_5|NZ_CP045228.1_142178_142712_-	NA	NA|311aa|down_6|NZ_CP045228.1_143019_143952_+	smart00974, T5orf172, This entry represents the putative helicase A859L	NA|69aa|down_7|NZ_CP045228.1_144096_144303_+	NA	NA|104aa|down_8|NZ_CP045228.1_144662_144974_+	NA	NA|110aa|down_9|NZ_CP045228.1_145419_145749_-	cd07503, HAD_HisB-N, histidinol phosphate phosphatase and related phosphatases
