assembly_id	genome_id	genome_def	crispr_array_locus_merge	crispr_array_location_merge	crispr_locus_id	crispr_pred_method	array_in_prot	prot_within_array_20000	prot_in_genome	crispr_type_by_cas_prot	consensus_repeat	repeat_length	self-targeting_spacer_number	self-targeting_target_number	spacer_location	protospacer_location	repeat_type	spacer_locus_num	spacer_num	correct_crispr_type	genome_cas_prots	unknown_protein_around_crispr	L10	L10_domain	L9	L9_domain	L8	L8_domain	L7	L7_domain	L6	L6_domain	L5	L5_domain	L4	L4_domain	L3	L3_domain	L2	L2_domain	L1	L1_domain	R1	R1_domain	R2	R2_domain	R3	R3_domain	R4	R4_domain	R5	R5_domain	R6	R6_domain	R7	R7_domain	R8	R8_domain	R9	R9_domain	R10	R10_domain
GCF_002813575.1_ASM281357v1	NZ_CP024785	Nostoc flagelliforme CCNUN1 chromosome, complete genome	1	130009-130117	1	CRISPRCasFinder	no	cas14k	cas14k,csa3,cas14j,2OG_CAS,cas6,cas8b3,cas5,RT,c2c9_V-U4,PD-DExK,DinG,c2c5_V-U5,Cas14c_CAS-V-F,cas3,Cas9_archaeal,Cas14u_CAS-V	Unclear	GATATGTGCTACTAGAAAAACTATCACAAGCATT	34	0	0	NA	NA	NA	1	1	TypeV	cas14k,csa3,cas14j,2OG_CAS,cas6,cas8b3,cas5,RT,c2c9_V-U4,PD-DExK,DinG,c2c5_V-U5,Cas14c_CAS-V-F,cas3,Cas9_archaeal,Cas14u_CAS-V	NA|165aa|up_4|NZ_CP024785.1_125179_125674_+,NA|131aa|up_3|NZ_CP024785.1_126216_126609_-,NA|136aa|up_2|NZ_CP024785.1_126846_127254_+,NA|258aa|down_0|NZ_CP024785.1_130179_130953_+,NA|280aa|down_4|NZ_CP024785.1_133936_134776_-,NA|448aa|down_6|NZ_CP024785.1_135974_137318_+	NA|415aa|up_9|NZ_CP024785.1_117915_119160_+	TIGR02037, Probable_periplasmic_serine_protease_do/HhoA-like, periplasmic serine protease, Do/DeqQ family	NA|177aa|up_8|NZ_CP024785.1_119248_119779_+	COG0242, Def, N-formylmethionyl-tRNA deformylase [Translation, ribosomal structure and biogenesis]	NA|564aa|up_7|NZ_CP024785.1_119874_121566_-	COG1226, Kch, Kef-type K+ transport systems, predicted NAD-binding component [Inorganic ion transport and metabolism]	NA|479aa|up_6|NZ_CP024785.1_121718_123155_-	COG0654, UbiH, 2-polyprenyl-6-methoxyphenol hydroxylase and related FAD-dependent oxidoreductases [Coenzyme metabolism / Energy production and conversion]	NA|368aa|up_5|NZ_CP024785.1_123716_124820_+	PRK01966, ddl, D-alanine--D-alanine ligase	NA|165aa|up_4|NZ_CP024785.1_125179_125674_+	NA	NA|131aa|up_3|NZ_CP024785.1_126216_126609_-	NA	NA|136aa|up_2|NZ_CP024785.1_126846_127254_+	NA	NA|493aa|up_1|NZ_CP024785.1_127471_128950_-	TIGR00465, Ketol-acid_reductoisomerase, ketol-acid reductoisomerase	NA|296aa|up_0|NZ_CP024785.1_129064_129952_+	cd08414, PBP2_LTTR_aromatics_like, The C-terminal substrate binding domain of LysR-type transcriptional regulators involved in the catabolism of aromatic compounds and that of other related regulators, contains type 2 periplasmic binding fold	NA|258aa|down_0|NZ_CP024785.1_130179_130953_+	NA	NA|258aa|down_1|NZ_CP024785.1_130960_131734_-	cd05346, SDR_c5, classical (c) SDR, subgroup 5	NA|136aa|down_2|NZ_CP024785.1_131899_132307_-	pfam01797, Y1_Tnp, Transposase IS200 like	cas14k|446aa|down_3|NZ_CP024785.1_132409_133747_+	pfam01385, OrfB_IS605, Probable transposase	NA|280aa|down_4|NZ_CP024785.1_133936_134776_-	NA	NA|358aa|down_5|NZ_CP024785.1_134861_135935_+	cd03785, GT28_MurG, undecaprenyldiphospho-muramoylpentapeptide beta-N-acetylglucosaminyltransferase	NA|448aa|down_6|NZ_CP024785.1_135974_137318_+	NA	NA|122aa|down_7|NZ_CP024785.1_137341_137707_-	cd08351, ChaP_like, ChaP, an enzyme involved in the biosynthesis of the antitumor agent chartreusin (cha), and similar proteins	NA|361aa|down_8|NZ_CP024785.1_138020_139103_+	TIGR01151, Photosystem_QB_protein, photosystem II, DI subunit (also called Q(B))	NA|227aa|down_9|NZ_CP024785.1_139198_139879_-	COG1985, RibD, Pyrimidine reductase, riboflavin biosynthesis [Coenzyme metabolism]
GCF_002813575.1_ASM281357v1	NZ_CP024785	Nostoc flagelliforme CCNUN1 chromosome, complete genome	2	144684-144788	2	CRISPRCasFinder	no	cas14k	cas14k,csa3,cas14j,2OG_CAS,cas6,cas8b3,cas5,RT,c2c9_V-U4,PD-DExK,DinG,c2c5_V-U5,Cas14c_CAS-V-F,cas3,Cas9_archaeal,Cas14u_CAS-V	Unclear	TATGGGGCATTGGGTCTTGAGCATTAATTATTCCT	35	0	0	NA	NA	NA	1	1	TypeV	cas14k,csa3,cas14j,2OG_CAS,cas6,cas8b3,cas5,RT,c2c9_V-U4,PD-DExK,DinG,c2c5_V-U5,Cas14c_CAS-V-F,cas3,Cas9_archaeal,Cas14u_CAS-V	NA|280aa|up_9|NZ_CP024785.1_133936_134776_-,NA|448aa|up_7|NZ_CP024785.1_135974_137318_+,NA|150aa|down_0|NZ_CP024785.1_145196_145646_+	NA|280aa|up_9|NZ_CP024785.1_133936_134776_-	NA	NA|358aa|up_8|NZ_CP024785.1_134861_135935_+	cd03785, GT28_MurG, undecaprenyldiphospho-muramoylpentapeptide beta-N-acetylglucosaminyltransferase	NA|448aa|up_7|NZ_CP024785.1_135974_137318_+	NA	NA|122aa|up_6|NZ_CP024785.1_137341_137707_-	cd08351, ChaP_like, ChaP, an enzyme involved in the biosynthesis of the antitumor agent chartreusin (cha), and similar proteins	NA|361aa|up_5|NZ_CP024785.1_138020_139103_+	TIGR01151, Photosystem_QB_protein, photosystem II, DI subunit (also called Q(B))	NA|227aa|up_4|NZ_CP024785.1_139198_139879_-	COG1985, RibD, Pyrimidine reductase, riboflavin biosynthesis [Coenzyme metabolism]	NA|160aa|up_3|NZ_CP024785.1_140240_140720_-	cd14503, PTP-bact, bacterial tyrosine-protein phosphataseS similar to Neisseria NMA1982	NA|166aa|up_2|NZ_CP024785.1_141358_141856_+	pfam13673, Acetyltransf_10, Acetyltransferase (GNAT) domain	NA|214aa|up_1|NZ_CP024785.1_141977_142619_+	COG1309, AcrR, Transcriptional regulator [Transcription]	NA|250aa|up_0|NZ_CP024785.1_143902_144652_-	cd08934, CAD_SDR_c, clavulanic acid dehydrogenase (CAD), classical (c) SDR	NA|150aa|down_0|NZ_CP024785.1_145196_145646_+	NA	NA|316aa|down_1|NZ_CP024785.1_145888_146836_+	pfam00535, Glycos_transf_2, Glycosyl transferase family 2	NA|252aa|down_2|NZ_CP024785.1_146915_147671_-	smart00550, Zalpha, Z-DNA-binding domain in adenosine deaminases	NA|247aa|down_3|NZ_CP024785.1_147704_148445_-	pfam12836, HHH_3, Helix-hairpin-helix motif	NA|455aa|down_4|NZ_CP024785.1_148444_149809_-	pfam13614, AAA_31, AAA domain	NA|124aa|down_5|NZ_CP024785.1_150068_150440_+	pfam12773, DZR, Double zinc ribbon	NA|649aa|down_6|NZ_CP024785.1_150580_152527_-	NF033092, HK_WalK, cell wall metabolism sensor histidine kinase WalK	NA|426aa|down_7|NZ_CP024785.1_152801_154079_-	PRK00885, PRK00885, phosphoribosylamine--glycine ligase; Provisional	NA|316aa|down_8|NZ_CP024785.1_154564_155512_+	cd05256, UDP_AE_SDR_e, UDP-N-acetylglucosamine 4-epimerase, extended (e) SDRs	NA|206aa|down_9|NZ_CP024785.1_157283_157901_+	COG4447, COG4447, Uncharacterized protein related to plant photosystem II stability/assembly factor [General function prediction only]
GCF_002813575.1_ASM281357v1	NZ_CP024785	Nostoc flagelliforme CCNUN1 chromosome, complete genome	3	198575-198699	3	CRISPRCasFinder	no		cas14k,csa3,cas14j,2OG_CAS,cas6,cas8b3,cas5,RT,c2c9_V-U4,PD-DExK,DinG,c2c5_V-U5,Cas14c_CAS-V-F,cas3,Cas9_archaeal,Cas14u_CAS-V	Orphan	ATGCAATGTTAATGCGTCGCCCTGCAATGCCATTGCATCCC	41	1	2	198616-198658|198616-198658	NZ_CP024785.1_198658-198700|NZ_CP024785.1_198700-198742	NA	1	1	Orphan	cas14k,csa3,cas14j,2OG_CAS,cas6,cas8b3,cas5,RT,c2c9_V-U4,PD-DExK,DinG,c2c5_V-U5,Cas14c_CAS-V-F,cas3,Cas9_archaeal,Cas14u_CAS-V	NA|236aa|up_7|NZ_CP024785.1_189904_190612_+,NA|90aa|up_5|NZ_CP024785.1_191599_191869_-,NA|144aa|down_4|NZ_CP024785.1_204315_204747_-	NA|516aa|up_9|NZ_CP024785.1_187586_189134_+	COG0644, FixC, Dehydrogenases (flavoproteins) [Energy production and conversion]	NA|235aa|up_8|NZ_CP024785.1_189157_189862_+	pfam09843, DUF2070, Predicted membrane protein (DUF2070)	NA|236aa|up_7|NZ_CP024785.1_189904_190612_+	NA	NA|183aa|up_6|NZ_CP024785.1_190789_191338_-	cd16913, YkuD_like, L,D-transpeptidases/carboxypeptidases similar to Bacillus YkuD	NA|90aa|up_5|NZ_CP024785.1_191599_191869_-	NA	NA|129aa|up_4|NZ_CP024785.1_191883_192270_-	cd17552, REC_RR468-like, phosphoacceptor receiver (REC) domain of Thermotoga maritima response regulator RR468 and similar domains	NA|135aa|up_3|NZ_CP024785.1_193244_193649_+	smart00930, NIL, This domain is found at the C-terminus of ABC transporter proteins involved in D-methionine transport as well as a number of ferredoxin-like proteins	NA|356aa|up_2|NZ_CP024785.1_193686_194754_-	pfam12275, DUF3616, Protein of unknown function (DUF3616)	NA|328aa|up_1|NZ_CP024785.1_195031_196015_+	COG5607, COG5607, Uncharacterized conserved protein [Function unknown]	NA|529aa|up_0|NZ_CP024785.1_196966_198553_+	pfam13282, DUF4070, Domain of unknown function (DUF4070)	NA|348aa|down_0|NZ_CP024785.1_198894_199938_+	pfam11199, DUF2891, Protein of unknown function (DUF2891)	NA|348aa|down_1|NZ_CP024785.1_200387_201431_+	COG0057, GapA, Glyceraldehyde-3-phosphate dehydrogenase/erythrose-4-phosphate dehydrogenase [Carbohydrate transport and metabolism]	NA|477aa|down_2|NZ_CP024785.1_201659_203090_+	COG0469, PykF, Pyruvate kinase [Carbohydrate transport and metabolism]	NA|335aa|down_3|NZ_CP024785.1_203238_204243_+	PRK12309, PRK12309, transaldolase	NA|144aa|down_4|NZ_CP024785.1_204315_204747_-	NA	NA|498aa|down_5|NZ_CP024785.1_205004_206498_-	cd08154, catalase_clade_1, Clade 1 of the heme-binding enzyme catalase	NA|154aa|down_6|NZ_CP024785.1_207201_207663_+	cd00158, RHOD, Rhodanese Homology Domain (RHOD); an alpha beta fold domain found duplicated in the rhodanese protein	NA|252aa|down_7|NZ_CP024785.1_208139_208895_+	cd02522, GT_2_like_a, GT_2_like_a represents a glycosyltransferase family-2 subfamily with unknown function	NA|234aa|down_8|NZ_CP024785.1_208879_209581_-	pfam04784, DUF547, Protein of unknown function, DUF547	NA|160aa|down_9|NZ_CP024785.1_210224_210704_+	pfam14325, DUF4383, Domain of unknown function (DUF4383)
GCF_002813575.1_ASM281357v1	NZ_CP024785	Nostoc flagelliforme CCNUN1 chromosome, complete genome	4	350480-350554	4	CRISPRCasFinder	no	cas14j	cas14k,csa3,cas14j,2OG_CAS,cas6,cas8b3,cas5,RT,c2c9_V-U4,PD-DExK,DinG,c2c5_V-U5,Cas14c_CAS-V-F,cas3,Cas9_archaeal,Cas14u_CAS-V	Unclear	GCGCTCGTTAACCAATGTCAACCG	24	1	1	350504-350530	NZ_CP024785.1_408223-408197	NA	1	1	TypeV	cas14k,csa3,cas14j,2OG_CAS,cas6,cas8b3,cas5,RT,c2c9_V-U4,PD-DExK,DinG,c2c5_V-U5,Cas14c_CAS-V-F,cas3,Cas9_archaeal,Cas14u_CAS-V	NA|113aa|up_8|NZ_CP024785.1_337528_337867_-,NA|82aa|up_0|NZ_CP024785.1_349196_349442_+,NA|263aa|down_0|NZ_CP024785.1_351939_352728_+,NA|147aa|down_7|NZ_CP024785.1_361438_361879_+,NA|52aa|down_9|NZ_CP024785.1_363832_363988_-	NA|394aa|up_9|NZ_CP024785.1_336043_337225_-	COG3180, AbrB, Putative ammonia monooxygenase [General function prediction only]	NA|113aa|up_8|NZ_CP024785.1_337528_337867_-	NA	NA|313aa|up_7|NZ_CP024785.1_338308_339247_-	TIGR02666, Cyclic_pyranopterin_monophosphate_synthase, molybdenum cofactor biosynthesis protein A, bacterial	NA|130aa|up_6|NZ_CP024785.1_339540_339930_-	TIGR02058, lin0512_fam, conserved hypothetical protein	NA|123aa|up_5|NZ_CP024785.1_340022_340391_-	COG0664, Crp, cAMP-binding proteins - catabolite gene activator and regulatory subunit of cAMP-dependent protein kinases [Signal transduction mechanisms]	NA|961aa|up_4|NZ_CP024785.1_340749_343632_+	PRK05743, ileS, isoleucyl-tRNA synthetase; Reviewed	NA|482aa|up_3|NZ_CP024785.1_344231_345677_-	TIGR03556, photolyase_8HDF, deoxyribodipyrimidine photo-lyase, 8-HDF type	NA|411aa|up_2|NZ_CP024785.1_346438_347672_+	pfam07592, DDE_Tnp_ISAZ013, Rhodopirellula transposase DDE domain	cas14j|407aa|up_1|NZ_CP024785.1_347838_349059_+	COG0675, COG0675, Transposase and inactivated derivatives [DNA replication, recombination, and repair]	NA|82aa|up_0|NZ_CP024785.1_349196_349442_+	NA	NA|263aa|down_0|NZ_CP024785.1_351939_352728_+	NA	NA|149aa|down_1|NZ_CP024785.1_353346_353793_+	COG3585, MopI, Molybdopterin-binding protein [Coenzyme metabolism]	NA|62aa|down_2|NZ_CP024785.1_355123_355309_+	pfam11165, DUF2949, Protein of unknown function (DUF2949)	NA|622aa|down_3|NZ_CP024785.1_355400_357266_+	TIGR01241, ATP-dependent_zinc_metalloprotease_FtsH, ATP-dependent metalloprotease FtsH	NA|391aa|down_4|NZ_CP024785.1_357353_358526_-	PRK07411, PRK07411, molybdopterin-synthase adenylyltransferase MoeB	NA|153aa|down_5|NZ_CP024785.1_358604_359063_-	cd08070, MPN_like, Mpr1p, Pad1p N-terminal (MPN) domains with catalytic isopeptidase activity (metal-binding)	NA|286aa|down_6|NZ_CP024785.1_360283_361141_+	COG1801, COG1801, Uncharacterized conserved protein [Function unknown]	NA|147aa|down_7|NZ_CP024785.1_361438_361879_+	NA	NA|416aa|down_8|NZ_CP024785.1_362370_363618_+	PRK07424, PRK07424, bifunctional sterol desaturase/short chain dehydrogenase; Validated	NA|52aa|down_9|NZ_CP024785.1_363832_363988_-	NA
GCF_002813575.1_ASM281357v1	NZ_CP024785	Nostoc flagelliforme CCNUN1 chromosome, complete genome	5	381296-381474	1,5	PILER-CR,CRISPRCasFinder	no	2OG_CAS,cas6	cas14k,csa3,cas14j,2OG_CAS,cas6,cas8b3,cas5,RT,c2c9_V-U4,PD-DExK,DinG,c2c5_V-U5,Cas14c_CAS-V-F,cas3,Cas9_archaeal,Cas14u_CAS-V	Unclear	GTTTCAATCCCTAATAGGGATTTTAGTTGATTGCAATGCT,GTTTCAATCCCTAATAGGGATTTTAGTTGATTGCAAT	40,37	0	0	NA	NA	I-D,II-B:I-D,II-B	2,2	2	Unclear	cas14k,csa3,cas14j,2OG_CAS,cas6,cas8b3,cas5,RT,c2c9_V-U4,PD-DExK,DinG,c2c5_V-U5,Cas14c_CAS-V-F,cas3,Cas9_archaeal,Cas14u_CAS-V	NA|147aa|up_9|NZ_CP024785.1_361438_361879_+,NA|52aa|up_7|NZ_CP024785.1_363832_363988_-,NA|64aa|up_3|NZ_CP024785.1_373067_373259_+,NA|146aa|up_2|NZ_CP024785.1_373903_374341_+,NA|125aa|down_2|NZ_CP024785.1_384837_385212_+,NA|137aa|down_9|NZ_CP024785.1_391411_391822_-	NA|147aa|up_9|NZ_CP024785.1_361438_361879_+	NA	NA|416aa|up_8|NZ_CP024785.1_362370_363618_+	PRK07424, PRK07424, bifunctional sterol desaturase/short chain dehydrogenase; Validated	NA|52aa|up_7|NZ_CP024785.1_363832_363988_-	NA	NA|811aa|up_6|NZ_CP024785.1_366480_368913_+	cd11304, Cadherin_repeat, Cadherin tandem repeat domain	NA|426aa|up_5|NZ_CP024785.1_369653_370931_-	PRK05476, PRK05476, S-adenosyl-L-homocysteine hydrolase; Provisional	NA|552aa|up_4|NZ_CP024785.1_371321_372977_+	pfam00924, MS_channel, Mechanosensitive ion channel	NA|64aa|up_3|NZ_CP024785.1_373067_373259_+	NA	NA|146aa|up_2|NZ_CP024785.1_373903_374341_+	NA	NA|542aa|up_1|NZ_CP024785.1_374503_376129_+	COG5421, COG5421, Transposase [DNA replication, recombination, and repair]	NA|305aa|up_0|NZ_CP024785.1_376507_377422_+	pfam01551, Peptidase_M23, Peptidase family M23	NA|256aa|down_0|NZ_CP024785.1_381840_382608_-	cd02978, KaiB_like, KaiB-like family; composed of the circadian clock proteins, KaiB and the N-terminal KaiB-like sensory domain of SasA	NA|425aa|down_1|NZ_CP024785.1_382828_384103_-	COG2805, PilT, Tfp pilus assembly protein, pilus retraction ATPase PilT [Cell motility and secretion / Intracellular trafficking and secretion]	NA|125aa|down_2|NZ_CP024785.1_384837_385212_+	NA	NA|377aa|down_3|NZ_CP024785.1_385486_386617_+	TIGR00236, UDP-N-acetylglucosamine_2-epimerase, UDP-N-acetylglucosamine 2-epimerase	NA|361aa|down_4|NZ_CP024785.1_387001_388084_-	pfam17914, HopA1, HopA1 effector protein family	NA|420aa|down_5|NZ_CP024785.1_388165_389425_-	pfam01636, APH, Phosphotransferase enzyme family	NA|98aa|down_6|NZ_CP024785.1_390176_390470_-	pfam10387, DUF2442, Protein of unknown function (DUF2442)	NA|87aa|down_7|NZ_CP024785.1_390473_390734_-	pfam13711, DUF4160, Domain of unknown function (DUF4160)	NA|101aa|down_8|NZ_CP024785.1_390738_391041_-	pfam13280, WYL, WYL domain	NA|137aa|down_9|NZ_CP024785.1_391411_391822_-	NA
GCF_002813575.1_ASM281357v1	NZ_CP024785	Nostoc flagelliforme CCNUN1 chromosome, complete genome	6	385245-385380	6	CRISPRCasFinder	no	2OG_CAS,cas6	cas14k,csa3,cas14j,2OG_CAS,cas6,cas8b3,cas5,RT,c2c9_V-U4,PD-DExK,DinG,c2c5_V-U5,Cas14c_CAS-V-F,cas3,Cas9_archaeal,Cas14u_CAS-V	Unclear	TGCCCGGTTAAGCCGGGTTTTTTCATGCTGTTTGTGATTT	40	0	0	NA	NA	NA	1	1	Unclear	cas14k,csa3,cas14j,2OG_CAS,cas6,cas8b3,cas5,RT,c2c9_V-U4,PD-DExK,DinG,c2c5_V-U5,Cas14c_CAS-V-F,cas3,Cas9_archaeal,Cas14u_CAS-V	NA|64aa|up_6|NZ_CP024785.1_373067_373259_+,NA|146aa|up_5|NZ_CP024785.1_373903_374341_+,NA|125aa|up_0|NZ_CP024785.1_384837_385212_+,NA|137aa|down_6|NZ_CP024785.1_391411_391822_-	NA|811aa|up_9|NZ_CP024785.1_366480_368913_+	cd11304, Cadherin_repeat, Cadherin tandem repeat domain	NA|426aa|up_8|NZ_CP024785.1_369653_370931_-	PRK05476, PRK05476, S-adenosyl-L-homocysteine hydrolase; Provisional	NA|552aa|up_7|NZ_CP024785.1_371321_372977_+	pfam00924, MS_channel, Mechanosensitive ion channel	NA|64aa|up_6|NZ_CP024785.1_373067_373259_+	NA	NA|146aa|up_5|NZ_CP024785.1_373903_374341_+	NA	NA|542aa|up_4|NZ_CP024785.1_374503_376129_+	COG5421, COG5421, Transposase [DNA replication, recombination, and repair]	NA|305aa|up_3|NZ_CP024785.1_376507_377422_+	pfam01551, Peptidase_M23, Peptidase family M23	NA|256aa|up_2|NZ_CP024785.1_381840_382608_-	cd02978, KaiB_like, KaiB-like family; composed of the circadian clock proteins, KaiB and the N-terminal KaiB-like sensory domain of SasA	NA|425aa|up_1|NZ_CP024785.1_382828_384103_-	COG2805, PilT, Tfp pilus assembly protein, pilus retraction ATPase PilT [Cell motility and secretion / Intracellular trafficking and secretion]	NA|125aa|up_0|NZ_CP024785.1_384837_385212_+	NA	NA|377aa|down_0|NZ_CP024785.1_385486_386617_+	TIGR00236, UDP-N-acetylglucosamine_2-epimerase, UDP-N-acetylglucosamine 2-epimerase	NA|361aa|down_1|NZ_CP024785.1_387001_388084_-	pfam17914, HopA1, HopA1 effector protein family	NA|420aa|down_2|NZ_CP024785.1_388165_389425_-	pfam01636, APH, Phosphotransferase enzyme family	NA|98aa|down_3|NZ_CP024785.1_390176_390470_-	pfam10387, DUF2442, Protein of unknown function (DUF2442)	NA|87aa|down_4|NZ_CP024785.1_390473_390734_-	pfam13711, DUF4160, Domain of unknown function (DUF4160)	NA|101aa|down_5|NZ_CP024785.1_390738_391041_-	pfam13280, WYL, WYL domain	NA|137aa|down_6|NZ_CP024785.1_391411_391822_-	NA	2OG_CAS|207aa|down_7|NZ_CP024785.1_391931_392552_+	pfam13640, 2OG-FeII_Oxy_3, 2OG-Fe(II) oxygenase superfamily	cas6|286aa|down_8|NZ_CP024785.1_392541_393399_+	COG5551, COG5551, CRISPR system related protein, RAMP superfamily [Defense    mechanisms]	NA|261aa|down_9|NZ_CP024785.1_394571_395354_-	cd09086, ExoIII-like_AP-endo, Escherichia coli exonuclease III (ExoIII) and Neisseria meningitides NExo-like subfamily of the ExoIII family purinic/apyrimidinic (AP) endonucleases
GCF_002813575.1_ASM281357v1	NZ_CP024785	Nostoc flagelliforme CCNUN1 chromosome, complete genome	7	394300-394544	2,1	PILER-CR,CRT	no	2OG_CAS,cas6	cas14k,csa3,cas14j,2OG_CAS,cas6,cas8b3,cas5,RT,c2c9_V-U4,PD-DExK,DinG,c2c5_V-U5,Cas14c_CAS-V-F,cas3,Cas9_archaeal,Cas14u_CAS-V	Unclear	TTGCAATTTATCAAAATCCCTATTAGGGATT,TTGCAATTTATCAAAATCCCTATTAGGGATT	31,31	0	0	NA	NA	I-D,II-B:I-D,II-B	2,3	3	Unclear	cas14k,csa3,cas14j,2OG_CAS,cas6,cas8b3,cas5,RT,c2c9_V-U4,PD-DExK,DinG,c2c5_V-U5,Cas14c_CAS-V-F,cas3,Cas9_archaeal,Cas14u_CAS-V	NA|125aa|up_9|NZ_CP024785.1_384837_385212_+,NA|137aa|up_2|NZ_CP024785.1_391411_391822_-,NA|62aa|down_2|NZ_CP024785.1_396939_397125_+,NA|394aa|down_6|NZ_CP024785.1_401370_402552_+	NA|125aa|up_9|NZ_CP024785.1_384837_385212_+	NA	NA|377aa|up_8|NZ_CP024785.1_385486_386617_+	TIGR00236, UDP-N-acetylglucosamine_2-epimerase, UDP-N-acetylglucosamine 2-epimerase	NA|361aa|up_7|NZ_CP024785.1_387001_388084_-	pfam17914, HopA1, HopA1 effector protein family	NA|420aa|up_6|NZ_CP024785.1_388165_389425_-	pfam01636, APH, Phosphotransferase enzyme family	NA|98aa|up_5|NZ_CP024785.1_390176_390470_-	pfam10387, DUF2442, Protein of unknown function (DUF2442)	NA|87aa|up_4|NZ_CP024785.1_390473_390734_-	pfam13711, DUF4160, Domain of unknown function (DUF4160)	NA|101aa|up_3|NZ_CP024785.1_390738_391041_-	pfam13280, WYL, WYL domain	NA|137aa|up_2|NZ_CP024785.1_391411_391822_-	NA	2OG_CAS|207aa|up_1|NZ_CP024785.1_391931_392552_+	pfam13640, 2OG-FeII_Oxy_3, 2OG-Fe(II) oxygenase superfamily	cas6|286aa|up_0|NZ_CP024785.1_392541_393399_+	COG5551, COG5551, CRISPR system related protein, RAMP superfamily [Defense    mechanisms]	NA|261aa|down_0|NZ_CP024785.1_394571_395354_-	cd09086, ExoIII-like_AP-endo, Escherichia coli exonuclease III (ExoIII) and Neisseria meningitides NExo-like subfamily of the ExoIII family purinic/apyrimidinic (AP) endonucleases	NA|357aa|down_1|NZ_CP024785.1_395813_396884_+	cd03802, GT4_AviGT4-like, UDP-Glc:tetrahydrobiopterin alpha-glucosyltransferase and similar proteins	NA|62aa|down_2|NZ_CP024785.1_396939_397125_+	NA	NA|341aa|down_3|NZ_CP024785.1_397361_398384_+	cd08235, iditol_2_DH_like, L-iditol 2-dehydrogenase	NA|93aa|down_4|NZ_CP024785.1_398418_398697_+	pfam11746, DUF3303, Protein of unknown function (DUF3303)	NA|540aa|down_5|NZ_CP024785.1_399167_400787_-	COG1807, ArnT, 4-amino-4-deoxy-L-arabinose transferase and related glycosyltransferases of PMT family [Cell envelope biogenesis, outer membrane]	NA|394aa|down_6|NZ_CP024785.1_401370_402552_+	NA	NA|447aa|down_7|NZ_CP024785.1_402560_403901_-	COG1961, PinR, Site-specific recombinases, DNA invertase Pin homologs [DNA replication, recombination, and repair]	NA|120aa|down_8|NZ_CP024785.1_404022_404382_-	cd07245, VOC_like, uncharacterized subfamily of vicinal oxygen chelate (VOC) family	NA|202aa|down_9|NZ_CP024785.1_405518_406124_+	COG3448, COG3448, CBS-domain-containing membrane protein [Signal transduction mechanisms]
GCF_002813575.1_ASM281357v1	NZ_CP024785	Nostoc flagelliforme CCNUN1 chromosome, complete genome	8	423822-423914	7	CRISPRCasFinder	no		cas14k,csa3,cas14j,2OG_CAS,cas6,cas8b3,cas5,RT,c2c9_V-U4,PD-DExK,DinG,c2c5_V-U5,Cas14c_CAS-V-F,cas3,Cas9_archaeal,Cas14u_CAS-V	Orphan	CCTGCTCAACCAAGCTTGTACCATTGC	27	0	0	NA	NA	NA	1	1	Orphan	cas14k,csa3,cas14j,2OG_CAS,cas6,cas8b3,cas5,RT,c2c9_V-U4,PD-DExK,DinG,c2c5_V-U5,Cas14c_CAS-V-F,cas3,Cas9_archaeal,Cas14u_CAS-V	NA|57aa|up_9|NZ_CP024785.1_406737_406908_-,NA|326aa|up_2|NZ_CP024785.1_418397_419375_-,NA|265aa|up_0|NZ_CP024785.1_421613_422408_-,NA|78aa|down_0|NZ_CP024785.1_424645_424879_+,NA|94aa|down_1|NZ_CP024785.1_424871_425153_+,NA|88aa|down_5|NZ_CP024785.1_428406_428670_+,NA|98aa|down_7|NZ_CP024785.1_429668_429962_+	NA|57aa|up_9|NZ_CP024785.1_406737_406908_-	NA	NA|208aa|up_8|NZ_CP024785.1_407477_408101_+	pfam03050, DDE_Tnp_IS66, Transposase IS66 family	NA|635aa|up_7|NZ_CP024785.1_409311_411216_+	pfam00211, Guanylate_cyc, Adenylate and Guanylate cyclase catalytic domain	NA|391aa|up_6|NZ_CP024785.1_411359_412532_-	PRK00770, PRK00770, deoxyhypusine synthase	NA|295aa|up_5|NZ_CP024785.1_412783_413668_+	COG1192, Soj, ATPases involved in chromosome partitioning [Cell division and chromosome partitioning]	NA|1129aa|up_4|NZ_CP024785.1_413891_417278_+	PRK10060, PRK10060, cyclic di-GMP phosphodiesterase	NA|33aa|up_3|NZ_CP024785.1_418107_418206_+	pfam14706, Tnp_DNA_bind, Transposase DNA-binding	NA|326aa|up_2|NZ_CP024785.1_418397_419375_-	NA	NA|407aa|up_1|NZ_CP024785.1_419553_420774_-	cd17330, MFS_SLC46_TetA_like, Eukaryotic Solute carrier 46 (SLC46) family, Bacterial Tetracycline resistance proteins, and similar proteins of the Major Facilitator Superfamily of transporters	NA|265aa|up_0|NZ_CP024785.1_421613_422408_-	NA	NA|78aa|down_0|NZ_CP024785.1_424645_424879_+	NA	NA|94aa|down_1|NZ_CP024785.1_424871_425153_+	NA	NA|406aa|down_2|NZ_CP024785.1_425376_426594_-	TIGR03087, stp1, sugar transferase, PEP-CTERM/EpsH1 system associated	NA|211aa|down_3|NZ_CP024785.1_426863_427496_-	COG2197, CitB, Response regulator containing a CheY-like receiver domain and an HTH DNA-binding domain [Signal transduction mechanisms / Transcription]	NA|279aa|down_4|NZ_CP024785.1_427492_428329_-	TIGR02037, Probable_periplasmic_serine_protease_do/HhoA-like, periplasmic serine protease, Do/DeqQ family	NA|88aa|down_5|NZ_CP024785.1_428406_428670_+	NA	NA|304aa|down_6|NZ_CP024785.1_428700_429612_-	TIGR02037, Probable_periplasmic_serine_protease_do/HhoA-like, periplasmic serine protease, Do/DeqQ family	NA|98aa|down_7|NZ_CP024785.1_429668_429962_+	NA	NA|285aa|down_8|NZ_CP024785.1_429984_430839_+	COG1587, HemD, Uroporphyrinogen-III synthase [Coenzyme metabolism]	NA|100aa|down_9|NZ_CP024785.1_430942_431242_+	TIGR02008, Ferredoxin_root_R-B1, ferredoxin [2Fe-2S]
GCF_002813575.1_ASM281357v1	NZ_CP024785	Nostoc flagelliforme CCNUN1 chromosome, complete genome	9	702959-703143	3	PILER-CR	no	cas8b3,cas5	cas14k,csa3,cas14j,2OG_CAS,cas6,cas8b3,cas5,RT,c2c9_V-U4,PD-DExK,DinG,c2c5_V-U5,Cas14c_CAS-V-F,cas3,Cas9_archaeal,Cas14u_CAS-V	Unclear	GAACTGGAAGGATGTGATTCGTGGCTTTATGCCGTTAGGCGTTGCTCAAAT	51	0	0	NA	NA	NA	2	2	Unclear	cas14k,csa3,cas14j,2OG_CAS,cas6,cas8b3,cas5,RT,c2c9_V-U4,PD-DExK,DinG,c2c5_V-U5,Cas14c_CAS-V-F,cas3,Cas9_archaeal,Cas14u_CAS-V	NA|130aa|up_9|NZ_CP024785.1_688905_689295_-,NA|65aa|down_0|NZ_CP024785.1_703223_703418_-,NA|101aa|down_1|NZ_CP024785.1_703494_703797_-,NA|613aa|down_3|NZ_CP024785.1_706110_707949_+,NA|636aa|down_4|NZ_CP024785.1_707963_709871_+,NA|226aa|down_5|NZ_CP024785.1_709867_710545_+,NA|51aa|down_9|NZ_CP024785.1_718443_718596_+	NA|130aa|up_9|NZ_CP024785.1_688905_689295_-	NA	NA|359aa|up_8|NZ_CP024785.1_691058_692135_+	cd03801, GT4_PimA-like, phosphatidyl-myo-inositol mannosyltransferase	NA|217aa|up_7|NZ_CP024785.1_692327_692978_+	pfam03808, Glyco_tran_WecB, Glycosyl transferase WecB/TagA/CpsF family	NA|381aa|up_6|NZ_CP024785.1_694079_695222_+	cd19963, PBP1_BMP-like, periplasmic binding component of a basic membrane lipoprotein (BMP) from Brucella abortus and its close homologs in other bacteria	NA|248aa|up_5|NZ_CP024785.1_695312_696056_-	PRK08057, PRK08057, cobalt-precorrin-6x reductase; Reviewed	NA|139aa|up_4|NZ_CP024785.1_696058_696475_-	COG5469, COG5469, Predicted metal-binding protein [Function unknown]	NA|329aa|up_3|NZ_CP024785.1_696997_697984_-	pfam06527, TniQ, TniQ	cas8b3|526aa|up_2|NZ_CP024785.1_698404_699982_+	cd09713, Cas8c_I-C, CRISPR/Cas system-associated protein Cas8c	NA|306aa|up_1|NZ_CP024785.1_700902_701820_-	pfam13359, DDE_Tnp_4, DDE superfamily endonuclease	cas5|213aa|up_0|NZ_CP024785.1_702072_702711_+	cd09688, Cas5_I-C, CRISPR/Cas system-associated RAMP superfamily protein Cas5	NA|65aa|down_0|NZ_CP024785.1_703223_703418_-	NA	NA|101aa|down_1|NZ_CP024785.1_703494_703797_-	NA	NA|98aa|down_2|NZ_CP024785.1_703928_704222_-	pfam13443, HTH_26, Cro/C1-type HTH DNA-binding domain	NA|613aa|down_3|NZ_CP024785.1_706110_707949_+	NA	NA|636aa|down_4|NZ_CP024785.1_707963_709871_+	NA	NA|226aa|down_5|NZ_CP024785.1_709867_710545_+	NA	NA|463aa|down_6|NZ_CP024785.1_710547_711936_+	COG1205, COG1205, Distinct helicase family with a unique C-terminal domain including a metal-binding cysteine cluster [General function prediction only]	NA|1400aa|down_7|NZ_CP024785.1_712056_716256_+	cd18796, SF2_C_LHR, C-terminal helicase domain of LHR family helicases	NA|692aa|down_8|NZ_CP024785.1_716255_718331_+	COG0210, UvrD, Superfamily I DNA and RNA helicases [DNA replication, recombination, and repair]	NA|51aa|down_9|NZ_CP024785.1_718443_718596_+	NA
GCF_002813575.1_ASM281357v1	NZ_CP024785	Nostoc flagelliforme CCNUN1 chromosome, complete genome	10	741079-741161	8	CRISPRCasFinder	no		cas14k,csa3,cas14j,2OG_CAS,cas6,cas8b3,cas5,RT,c2c9_V-U4,PD-DExK,DinG,c2c5_V-U5,Cas14c_CAS-V-F,cas3,Cas9_archaeal,Cas14u_CAS-V	Orphan	TTACGAAGACCAGGAAATTTCTAATTATT	29	0	0	NA	NA	NA	1	1	Orphan	cas14k,csa3,cas14j,2OG_CAS,cas6,cas8b3,cas5,RT,c2c9_V-U4,PD-DExK,DinG,c2c5_V-U5,Cas14c_CAS-V-F,cas3,Cas9_archaeal,Cas14u_CAS-V	NA|72aa|up_1|NZ_CP024785.1_738494_738710_-,NA|56aa|up_0|NZ_CP024785.1_739599_739767_+,NA|92aa|down_5|NZ_CP024785.1_748634_748910_+	NA|487aa|up_9|NZ_CP024785.1_729072_730533_+	PRK00012, gatA, Asp-tRNA(Asn)/Glu-tRNA(Gln) amidotransferase subunit GatA	NA|153aa|up_8|NZ_CP024785.1_730649_731108_-	pfam13413, HTH_25, Helix-turn-helix domain	NA|291aa|up_7|NZ_CP024785.1_731336_732209_+	pfam12146, Hydrolase_4, Serine aminopeptidase, S33	NA|209aa|up_6|NZ_CP024785.1_732376_733003_+	COG1695, COG1695, Predicted transcriptional regulators [Transcription]	NA|229aa|up_5|NZ_CP024785.1_733059_733746_-	PLN02770, PLN02770, haloacid dehalogenase-like hydrolase family protein	NA|258aa|up_4|NZ_CP024785.1_733942_734716_+	PRK00443, nagB, glucosamine-6-phosphate deaminase; Provisional	NA|688aa|up_3|NZ_CP024785.1_734788_736852_+	pfam02254, TrkA_N, TrkA-N domain	NA|308aa|up_2|NZ_CP024785.1_737121_738045_-	PRK13875, PRK13875, conjugal transfer protein TrbL; Provisional	NA|72aa|up_1|NZ_CP024785.1_738494_738710_-	NA	NA|56aa|up_0|NZ_CP024785.1_739599_739767_+	NA	NA|505aa|down_0|NZ_CP024785.1_742558_744073_+	cd17534, REC_DC-like, phosphoacceptor receiver (REC) domain of modulated diguanylate cyclase and similar domains	NA|388aa|down_1|NZ_CP024785.1_744221_745385_+	cd02152, OAT, Ornithine acetyltransferase (OAT) family; also referred to as ArgJ	NA|367aa|down_2|NZ_CP024785.1_745682_746783_-	COG3839, MalK, ABC-type sugar transport systems, ATPase components [Carbohydrate transport and metabolism]	NA|295aa|down_3|NZ_CP024785.1_746791_747676_-	COG0395, UgpE, ABC-type sugar transport system, permease component [Carbohydrate transport and metabolism]	NA|304aa|down_4|NZ_CP024785.1_747679_748591_-	COG1175, UgpA, ABC-type sugar transport systems, permease components [Carbohydrate transport and metabolism]	NA|92aa|down_5|NZ_CP024785.1_748634_748910_+	NA	NA|448aa|down_6|NZ_CP024785.1_748899_750243_-	cd14750, PBP2_TMBP, The periplasmic-binding component of ABC transport systems specific for trehalose/maltose; possesses type 2 periplasmic binding fold	NA|401aa|down_7|NZ_CP024785.1_751275_752478_+	COG0465, HflB, ATP-dependent Zn proteases [Posttranslational modification, protein turnover, chaperones]	NA|320aa|down_8|NZ_CP024785.1_752482_753442_-	cd17574, REC_OmpR, phosphoacceptor receiver (REC) domain of OmpR family response regulators	NA|660aa|down_9|NZ_CP024785.1_753582_755562_-	COG0642, BaeS, Signal transduction histidine kinase [Signal transduction mechanisms]
GCF_002813575.1_ASM281357v1	NZ_CP024785	Nostoc flagelliforme CCNUN1 chromosome, complete genome	11	796840-796914	9	CRISPRCasFinder	no		cas14k,csa3,cas14j,2OG_CAS,cas6,cas8b3,cas5,RT,c2c9_V-U4,PD-DExK,DinG,c2c5_V-U5,Cas14c_CAS-V-F,cas3,Cas9_archaeal,Cas14u_CAS-V	Orphan	GCGCTCGTTAACCAATGTCAACCG	24	1	1	796864-796890	NZ_CP024785.1_408223-408197	NA	1	1	Orphan	cas14k,csa3,cas14j,2OG_CAS,cas6,cas8b3,cas5,RT,c2c9_V-U4,PD-DExK,DinG,c2c5_V-U5,Cas14c_CAS-V-F,cas3,Cas9_archaeal,Cas14u_CAS-V	NA|68aa|up_9|NZ_CP024785.1_785245_785449_+,NA|67aa|up_8|NZ_CP024785.1_787222_787423_+,NA|119aa|up_6|NZ_CP024785.1_788027_788384_-,NA|73aa|up_5|NZ_CP024785.1_789431_789650_+,NA|46aa|up_1|NZ_CP024785.1_794726_794864_-,NA|355aa|down_0|NZ_CP024785.1_797317_798382_+,NA|198aa|down_1|NZ_CP024785.1_798579_799173_-,NA|47aa|down_3|NZ_CP024785.1_801586_801727_+,NA|57aa|down_4|NZ_CP024785.1_801868_802039_+,NA|354aa|down_5|NZ_CP024785.1_804295_805357_-,NA|57aa|down_7|NZ_CP024785.1_806895_807066_-	NA|68aa|up_9|NZ_CP024785.1_785245_785449_+	NA	NA|67aa|up_8|NZ_CP024785.1_787222_787423_+	NA	NA|136aa|up_7|NZ_CP024785.1_787451_787859_-	pfam01844, HNH, HNH endonuclease	NA|119aa|up_6|NZ_CP024785.1_788027_788384_-	NA	NA|73aa|up_5|NZ_CP024785.1_789431_789650_+	NA	NA|146aa|up_4|NZ_CP024785.1_790334_790772_-	cd00200, WD40, WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment	NA|170aa|up_3|NZ_CP024785.1_790756_791266_-	pfam13586, DDE_Tnp_1_2, Transposase DDE domain	NA|117aa|up_2|NZ_CP024785.1_791228_791579_-	pfam13340, DUF4096, Putative transposase of IS4/5 family (DUF4096)	NA|46aa|up_1|NZ_CP024785.1_794726_794864_-	NA	NA|439aa|up_0|NZ_CP024785.1_795037_796354_+	pfam03222, Trp_Tyr_perm, Tryptophan/tyrosine permease family	NA|355aa|down_0|NZ_CP024785.1_797317_798382_+	NA	NA|198aa|down_1|NZ_CP024785.1_798579_799173_-	NA	NA|345aa|down_2|NZ_CP024785.1_800319_801354_-	COG1061, SSL2, DNA or RNA helicases of superfamily II [Transcription / DNA replication, recombination, and repair]	NA|47aa|down_3|NZ_CP024785.1_801586_801727_+	NA	NA|57aa|down_4|NZ_CP024785.1_801868_802039_+	NA	NA|354aa|down_5|NZ_CP024785.1_804295_805357_-	NA	NA|78aa|down_6|NZ_CP024785.1_805658_805892_-	pfam13613, HTH_Tnp_4, Helix-turn-helix of DDE superfamily endonuclease	NA|57aa|down_7|NZ_CP024785.1_806895_807066_-	NA	NA|307aa|down_8|NZ_CP024785.1_809875_810796_-	pfam13612, DDE_Tnp_1_3, Transposase DDE domain	NA|308aa|down_9|NZ_CP024785.1_810837_811761_-	COG1173, DppC, ABC-type dipeptide/oligopeptide/nickel transport systems, permease components [Amino acid transport and metabolism / Inorganic ion transport and metabolism]
GCF_002813575.1_ASM281357v1	NZ_CP024785	Nostoc flagelliforme CCNUN1 chromosome, complete genome	12	1000265-1000378	10	CRISPRCasFinder	no		cas14k,csa3,cas14j,2OG_CAS,cas6,cas8b3,cas5,RT,c2c9_V-U4,PD-DExK,DinG,c2c5_V-U5,Cas14c_CAS-V-F,cas3,Cas9_archaeal,Cas14u_CAS-V	Orphan	TTTGTCAACCATTATTACTTGACCGAC	27	0	0	NA	NA	NA	1	1	Orphan	cas14k,csa3,cas14j,2OG_CAS,cas6,cas8b3,cas5,RT,c2c9_V-U4,PD-DExK,DinG,c2c5_V-U5,Cas14c_CAS-V-F,cas3,Cas9_archaeal,Cas14u_CAS-V	NA|48aa|up_9|NZ_CP024785.1_979045_979189_-,NA|69aa|up_7|NZ_CP024785.1_979477_979684_-,NA|138aa|up_0|NZ_CP024785.1_999612_1000026_-,NA|97aa|down_6|NZ_CP024785.1_1008549_1008840_-,NA|105aa|down_7|NZ_CP024785.1_1010422_1010737_+,NA|59aa|down_8|NZ_CP024785.1_1011175_1011352_-	NA|48aa|up_9|NZ_CP024785.1_979045_979189_-	NA	NA|55aa|up_8|NZ_CP024785.1_979269_979434_-	COG3210, FhaB, Large exoproteins involved in heme utilization or adhesion [Intracellular trafficking and secretion]	NA|69aa|up_7|NZ_CP024785.1_979477_979684_-	NA	NA|207aa|up_6|NZ_CP024785.1_982105_982726_-	TIGR01901, Heme/hemopexin-binding_protein, filamentous hemagglutinin family N-terminal domain	NA|910aa|up_5|NZ_CP024785.1_986163_988893_+	COG4252, COG4252, Predicted transmembrane sensor domain [Signal transduction mechanisms]	NA|254aa|up_4|NZ_CP024785.1_988870_989632_-	pfam06051, DUF928, Domain of Unknown Function (DUF928)	NA|985aa|up_3|NZ_CP024785.1_989721_992676_-	COG4995, COG4995, Uncharacterized protein conserved in bacteria [Function unknown]	NA|1192aa|up_2|NZ_CP024785.1_992733_996309_-	TIGR01901, Heme/hemopexin-binding_protein, filamentous hemagglutinin family N-terminal domain	NA|639aa|up_1|NZ_CP024785.1_996360_998277_-	COG2831, FhaC, Hemolysin activation/secretion protein [Intracellular trafficking and secretion]	NA|138aa|up_0|NZ_CP024785.1_999612_1000026_-	NA	NA|64aa|down_0|NZ_CP024785.1_1001573_1001765_+	pfam08042, PqqA, PqqA family	NA|302aa|down_1|NZ_CP024785.1_1001819_1002725_+	PRK05184, PRK05184, pyrroloquinoline quinone biosynthesis protein PqqB; Provisional	NA|243aa|down_2|NZ_CP024785.1_1002745_1003474_+	PRK05157, PRK05157, pyrroloquinoline quinone biosynthesis protein PqqC; Provisional	NA|114aa|down_3|NZ_CP024785.1_1003550_1003892_+	TIGR03859, PQQ_PqqD, coenzyme PQQ biosynthesis protein PqqD	NA|366aa|down_4|NZ_CP024785.1_1003907_1005005_+	PRK05301, PRK05301, pyrroloquinoline quinone biosynthesis protein PqqE; Provisional	NA|46aa|down_5|NZ_CP024785.1_1008299_1008437_+	TIGR02997, RNA_polymerase_sigma_subunit_sigma70/sigma32, RNA polymerase sigma factor, cyanobacterial RpoD-like family	NA|97aa|down_6|NZ_CP024785.1_1008549_1008840_-	NA	NA|105aa|down_7|NZ_CP024785.1_1010422_1010737_+	NA	NA|59aa|down_8|NZ_CP024785.1_1011175_1011352_-	NA	NA|129aa|down_9|NZ_CP024785.1_1011416_1011803_-	pfam07592, DDE_Tnp_ISAZ013, Rhodopirellula transposase DDE domain
GCF_002813575.1_ASM281357v1	NZ_CP024785	Nostoc flagelliforme CCNUN1 chromosome, complete genome	13	1407242-1407351	11	CRISPRCasFinder	no		cas14k,csa3,cas14j,2OG_CAS,cas6,cas8b3,cas5,RT,c2c9_V-U4,PD-DExK,DinG,c2c5_V-U5,Cas14c_CAS-V-F,cas3,Cas9_archaeal,Cas14u_CAS-V	Orphan	GTACGGGGAATGATACTCTCATTGGTGAT	29	0	0	NA	NA	NA	1	1	Orphan	cas14k,csa3,cas14j,2OG_CAS,cas6,cas8b3,cas5,RT,c2c9_V-U4,PD-DExK,DinG,c2c5_V-U5,Cas14c_CAS-V-F,cas3,Cas9_archaeal,Cas14u_CAS-V	NA|105aa|up_9|NZ_CP024785.1_1390219_1390534_+,NA|80aa|up_8|NZ_CP024785.1_1391114_1391354_+,NA|112aa|up_6|NZ_CP024785.1_1397257_1397593_-,NA|144aa|up_5|NZ_CP024785.1_1399624_1400056_-,NA|132aa|up_0|NZ_CP024785.1_1406050_1406446_-,NA	NA|105aa|up_9|NZ_CP024785.1_1390219_1390534_+	NA	NA|80aa|up_8|NZ_CP024785.1_1391114_1391354_+	NA	NA|534aa|up_7|NZ_CP024785.1_1391347_1392949_+	PRK00409, PRK00409, recombination and DNA strand exchange inhibitor protein; Reviewed	NA|112aa|up_6|NZ_CP024785.1_1397257_1397593_-	NA	NA|144aa|up_5|NZ_CP024785.1_1399624_1400056_-	NA	NA|94aa|up_4|NZ_CP024785.1_1400591_1400873_+	cd00093, HTH_XRE, Helix-turn-helix XRE-family like proteins	NA|451aa|up_3|NZ_CP024785.1_1400920_1402273_+	cd16434, CheB-CheR_fusion, Chemotaxis response regulator protein-glutamate methylesterase, CheB, fused with CheR domain	NA|488aa|up_2|NZ_CP024785.1_1402368_1403832_-	pfam13546, DDE_5, DDE superfamily endonuclease	NA|639aa|up_1|NZ_CP024785.1_1404167_1406084_-	pfam01526, DDE_Tnp_Tn3, Tn3 transposase DDE domain	NA|132aa|up_0|NZ_CP024785.1_1406050_1406446_-	NA	NA|148aa|down_0|NZ_CP024785.1_1408984_1409428_-	COG2153, ElaA, Predicted acyltransferase [General function prediction only]	NA|1004aa|down_1|NZ_CP024785.1_1409483_1412495_-	cd07124, ALDH_PutA-P5CDH-RocA, Delta(1)-pyrroline-5-carboxylate dehydrogenase, RocA	NA|217aa|down_2|NZ_CP024785.1_1412797_1413448_+	COG4339, COG4339, Uncharacterized protein conserved in bacteria [Function unknown]	NA|243aa|down_3|NZ_CP024785.1_1413557_1414286_-	COG1413, COG1413, FOG: HEAT repeat [Energy production and conversion]	NA|282aa|down_4|NZ_CP024785.1_1414456_1415302_-	COG1413, COG1413, FOG: HEAT repeat [Energy production and conversion]	NA|88aa|down_5|NZ_CP024785.1_1415276_1415540_-	pfam01383, CpcD, CpcD/allophycocyanin linker domain	NA|290aa|down_6|NZ_CP024785.1_1415622_1416492_-	pfam00427, PBS_linker_poly, Phycobilisome Linker polypeptide	NA|271aa|down_7|NZ_CP024785.1_1416607_1417420_-	pfam00427, PBS_linker_poly, Phycobilisome Linker polypeptide	NA|163aa|down_8|NZ_CP024785.1_1417828_1418317_-	cd14770, PC-PEC_alpha, Alpha subunits of phycoerythrin and phycoerythrocyanin; phycobilisome rod components	NA|174aa|down_9|NZ_CP024785.1_1418406_1418928_-	cd14768, PC_PEC_beta, Beta subunits of phycoerythrin and phycoerythrocyanin; phycobilisome rod components
GCF_002813575.1_ASM281357v1	NZ_CP024785	Nostoc flagelliforme CCNUN1 chromosome, complete genome	14	1628387-1628473	12	CRISPRCasFinder	no		cas14k,csa3,cas14j,2OG_CAS,cas6,cas8b3,cas5,RT,c2c9_V-U4,PD-DExK,DinG,c2c5_V-U5,Cas14c_CAS-V-F,cas3,Cas9_archaeal,Cas14u_CAS-V	Orphan	GACGGGTTACGCCTACGCTGTTTT	24	0	0	NA	NA	NA	1	1	Orphan	cas14k,csa3,cas14j,2OG_CAS,cas6,cas8b3,cas5,RT,c2c9_V-U4,PD-DExK,DinG,c2c5_V-U5,Cas14c_CAS-V-F,cas3,Cas9_archaeal,Cas14u_CAS-V	NA|119aa|up_9|NZ_CP024785.1_1617273_1617630_+,NA|59aa|up_6|NZ_CP024785.1_1620569_1620746_+,NA|64aa|up_5|NZ_CP024785.1_1620742_1620934_+,NA|293aa|up_3|NZ_CP024785.1_1622671_1623550_-,NA|168aa|up_2|NZ_CP024785.1_1624077_1624581_-,NA|73aa|down_1|NZ_CP024785.1_1628994_1629213_-,NA|95aa|down_2|NZ_CP024785.1_1629320_1629605_-,NA|165aa|down_3|NZ_CP024785.1_1631003_1631498_+,NA|130aa|down_5|NZ_CP024785.1_1634616_1635006_+,NA|209aa|down_9|NZ_CP024785.1_1637416_1638043_-	NA|119aa|up_9|NZ_CP024785.1_1617273_1617630_+	NA	NA|359aa|up_8|NZ_CP024785.1_1617679_1618756_+	TIGR04070, photo_TT_lyase, spore photoproduct lyase	NA|494aa|up_7|NZ_CP024785.1_1619082_1620564_+	cd06160, S2P-M50_like_2, Uncharacterized homologs of Site-2 protease (S2P), zinc metalloproteases (MEROPS family M50) which cleave transmembrane domains of substrate proteins, regulating intramembrane proteolysis (RIP) of diverse signal transduction mechanisms	NA|59aa|up_6|NZ_CP024785.1_1620569_1620746_+	NA	NA|64aa|up_5|NZ_CP024785.1_1620742_1620934_+	NA	NA|420aa|up_4|NZ_CP024785.1_1621173_1622433_-	COG1649, COG1649, Uncharacterized protein conserved in bacteria [Function unknown]	NA|293aa|up_3|NZ_CP024785.1_1622671_1623550_-	NA	NA|168aa|up_2|NZ_CP024785.1_1624077_1624581_-	NA	NA|222aa|up_1|NZ_CP024785.1_1625387_1626053_+	COG1926, COG1926, Predicted phosphoribosyltransferases [General function prediction only]	NA|347aa|up_0|NZ_CP024785.1_1626692_1627733_-	cd11593, Agmatinase-like_2, Agmatinase and related proteins	NA|139aa|down_0|NZ_CP024785.1_1628585_1629002_-	cd00303, retropepsin_like, Retropepsins; pepsin-like aspartate proteases	NA|73aa|down_1|NZ_CP024785.1_1628994_1629213_-	NA	NA|95aa|down_2|NZ_CP024785.1_1629320_1629605_-	NA	NA|165aa|down_3|NZ_CP024785.1_1631003_1631498_+	NA	NA|297aa|down_4|NZ_CP024785.1_1633119_1634010_-	cd01846, fatty_acyltransferase_like, Fatty acyltransferase-like subfamily of the SGNH hydrolases, a diverse family of lipases and esterases	NA|130aa|down_5|NZ_CP024785.1_1634616_1635006_+	NA	NA|316aa|down_6|NZ_CP024785.1_1635256_1636204_+	sd00006, TPR, Tetratricopeptide repeat	NA|194aa|down_7|NZ_CP024785.1_1636323_1636905_-	COG3019, COG3019, Predicted metal-binding protein [General function prediction only]	NA|121aa|down_8|NZ_CP024785.1_1637067_1637430_-	COG3654, Doc, Prophage maintenance system killer protein [General function prediction only]	NA|209aa|down_9|NZ_CP024785.1_1637416_1638043_-	NA
GCF_002813575.1_ASM281357v1	NZ_CP024785	Nostoc flagelliforme CCNUN1 chromosome, complete genome	15	1713596-1713680	13	CRISPRCasFinder	no		cas14k,csa3,cas14j,2OG_CAS,cas6,cas8b3,cas5,RT,c2c9_V-U4,PD-DExK,DinG,c2c5_V-U5,Cas14c_CAS-V-F,cas3,Cas9_archaeal,Cas14u_CAS-V	Orphan	CGACACGCTACGCGTAGCTTGCTTC	25	0	0	NA	NA	NA	1	1	Orphan	cas14k,csa3,cas14j,2OG_CAS,cas6,cas8b3,cas5,RT,c2c9_V-U4,PD-DExK,DinG,c2c5_V-U5,Cas14c_CAS-V-F,cas3,Cas9_archaeal,Cas14u_CAS-V	NA|88aa|up_7|NZ_CP024785.1_1709949_1710213_+,NA|118aa|up_6|NZ_CP024785.1_1710632_1710986_-,NA|94aa|up_5|NZ_CP024785.1_1711286_1711568_-,NA|100aa|up_3|NZ_CP024785.1_1711967_1712267_-,NA|64aa|down_7|NZ_CP024785.1_1723853_1724045_-,NA|47aa|down_8|NZ_CP024785.1_1724037_1724178_-,NA|203aa|down_9|NZ_CP024785.1_1724840_1725449_-	NA|441aa|up_9|NZ_CP024785.1_1706666_1707989_-	PRK07369, PRK07369, dihydroorotase; Provisional	NA|451aa|up_8|NZ_CP024785.1_1708163_1709516_-	pfam00300, His_Phos_1, Histidine phosphatase superfamily (branch 1)	NA|88aa|up_7|NZ_CP024785.1_1709949_1710213_+	NA	NA|118aa|up_6|NZ_CP024785.1_1710632_1710986_-	NA	NA|94aa|up_5|NZ_CP024785.1_1711286_1711568_-	NA	NA|145aa|up_4|NZ_CP024785.1_1711533_1711968_-	cd18681, PIN_MtVapC27-VapC40_like, VapC-like PIN domain of Mycobacterium tuberculosis VapC27, and VapC40, and related proteins	NA|100aa|up_3|NZ_CP024785.1_1711967_1712267_-	NA	NA|93aa|up_2|NZ_CP024785.1_1712310_1712589_-	COG3609, COG3609, Predicted transcriptional regulators containing the CopG/Arc/MetJ DNA-binding domain [Transcription]	NA|83aa|up_1|NZ_CP024785.1_1712858_1713107_+	COG4118, Phd, Antitoxin of toxin-antitoxin stability system [Cell division and chromosome partitioning]	NA|129aa|up_0|NZ_CP024785.1_1713109_1713496_+	cd09872, PIN_Sll0205-like, VapC-like PIN domain of Sll0205 protein and homologs	NA|658aa|down_0|NZ_CP024785.1_1713746_1715720_-	pfam13424, TPR_12, Tetratricopeptide repeat	NA|264aa|down_1|NZ_CP024785.1_1715723_1716515_-	pfam13340, DUF4096, Putative transposase of IS4/5 family (DUF4096)	NA|99aa|down_2|NZ_CP024785.1_1716590_1716887_-	NF033474, DivGenRetAVD, diversity-generating retroelement protein Avd	NA|288aa|down_3|NZ_CP024785.1_1716998_1717862_-	COG1262, COG1262, Uncharacterized conserved protein [Function unknown]	NA|856aa|down_4|NZ_CP024785.1_1718016_1720584_-	COG1262, COG1262, Uncharacterized conserved protein [Function unknown]	NA|380aa|down_5|NZ_CP024785.1_1720913_1722053_-	COG0714, COG0714, MoxR-like ATPases [General function prediction only]	NA|600aa|down_6|NZ_CP024785.1_1722057_1723857_-	pfam13365, Trypsin_2, Trypsin-like peptidase domain	NA|64aa|down_7|NZ_CP024785.1_1723853_1724045_-	NA	NA|47aa|down_8|NZ_CP024785.1_1724037_1724178_-	NA	NA|203aa|down_9|NZ_CP024785.1_1724840_1725449_-	NA
GCF_002813575.1_ASM281357v1	NZ_CP024785	Nostoc flagelliforme CCNUN1 chromosome, complete genome	16	1765182-1765276	14	CRISPRCasFinder	no	c2c9_V-U4	cas14k,csa3,cas14j,2OG_CAS,cas6,cas8b3,cas5,RT,c2c9_V-U4,PD-DExK,DinG,c2c5_V-U5,Cas14c_CAS-V-F,cas3,Cas9_archaeal,Cas14u_CAS-V	Type V-U4	TAGCGATCGCCTAAAGACTACCA	23	0	0	NA	NA	NA	1	1	TypeV-U4	cas14k,csa3,cas14j,2OG_CAS,cas6,cas8b3,cas5,RT,c2c9_V-U4,PD-DExK,DinG,c2c5_V-U5,Cas14c_CAS-V-F,cas3,Cas9_archaeal,Cas14u_CAS-V	NA|155aa|up_8|NZ_CP024785.1_1748908_1749373_-,NA|57aa|up_5|NZ_CP024785.1_1755972_1756143_-,NA|93aa|up_4|NZ_CP024785.1_1756197_1756476_+,NA|798aa|up_3|NZ_CP024785.1_1759643_1762037_+,NA|64aa|up_2|NZ_CP024785.1_1762137_1762329_-,NA|125aa|down_1|NZ_CP024785.1_1766853_1767228_-,NA|77aa|down_2|NZ_CP024785.1_1767300_1767531_-,NA|170aa|down_4|NZ_CP024785.1_1768150_1768660_+,NA|128aa|down_5|NZ_CP024785.1_1768695_1769079_-,NA|76aa|down_6|NZ_CP024785.1_1769075_1769303_-,NA|147aa|down_7|NZ_CP024785.1_1769364_1769805_-,NA|57aa|down_8|NZ_CP024785.1_1769794_1769965_-,NA|580aa|down_9|NZ_CP024785.1_1769964_1771704_-	NA|577aa|up_9|NZ_CP024785.1_1746688_1748419_-	NF033203, entero_EhxA, enterohemolysin EhxA	NA|155aa|up_8|NZ_CP024785.1_1748908_1749373_-	NA	NA|437aa|up_7|NZ_CP024785.1_1753326_1754637_+	PRK07591, PRK07591, threonine synthase; Validated	NA|92aa|up_6|NZ_CP024785.1_1754732_1755008_+	cd17074, Ubl_CysO_like, ubiquitin-like (Ubl) domain found in Mycobacterium tuberculosis CysO and similar proteins	NA|57aa|up_5|NZ_CP024785.1_1755972_1756143_-	NA	NA|93aa|up_4|NZ_CP024785.1_1756197_1756476_+	NA	NA|798aa|up_3|NZ_CP024785.1_1759643_1762037_+	NA	NA|64aa|up_2|NZ_CP024785.1_1762137_1762329_-	NA	NA|364aa|up_1|NZ_CP024785.1_1763179_1764271_+	PRK07409, PRK07409, threonine synthase; Validated	NA|197aa|up_0|NZ_CP024785.1_1764511_1765102_+	pfam06206, CpeT, CpeT/CpcT family (DUF1001)	NA|381aa|down_0|NZ_CP024785.1_1765647_1766790_+	cd00796, INT_Rci_Hp1_C, Shufflon-specific DNA recombinase Rci and Bacteriophage Hp1_like integrase, C-terminal catalytic domain	NA|125aa|down_1|NZ_CP024785.1_1766853_1767228_-	NA	NA|77aa|down_2|NZ_CP024785.1_1767300_1767531_-	NA	NA|162aa|down_3|NZ_CP024785.1_1767598_1768084_-	COG4474, COG4474, Uncharacterized protein conserved in bacteria [Function unknown]	NA|170aa|down_4|NZ_CP024785.1_1768150_1768660_+	NA	NA|128aa|down_5|NZ_CP024785.1_1768695_1769079_-	NA	NA|76aa|down_6|NZ_CP024785.1_1769075_1769303_-	NA	NA|147aa|down_7|NZ_CP024785.1_1769364_1769805_-	NA	NA|57aa|down_8|NZ_CP024785.1_1769794_1769965_-	NA	NA|580aa|down_9|NZ_CP024785.1_1769964_1771704_-	NA
GCF_002813575.1_ASM281357v1	NZ_CP024785	Nostoc flagelliforme CCNUN1 chromosome, complete genome	17	1906225-1906346	15	CRISPRCasFinder	no		cas14k,csa3,cas14j,2OG_CAS,cas6,cas8b3,cas5,RT,c2c9_V-U4,PD-DExK,DinG,c2c5_V-U5,Cas14c_CAS-V-F,cas3,Cas9_archaeal,Cas14u_CAS-V	Orphan	TTCAACGGCTGGGGCTACCTGGACATTGCCATCATCCACAG	41	0	0	NA	NA	NA	1	1	Orphan	cas14k,csa3,cas14j,2OG_CAS,cas6,cas8b3,cas5,RT,c2c9_V-U4,PD-DExK,DinG,c2c5_V-U5,Cas14c_CAS-V-F,cas3,Cas9_archaeal,Cas14u_CAS-V	NA|253aa|up_9|NZ_CP024785.1_1896908_1897667_+,NA|155aa|up_8|NZ_CP024785.1_1897726_1898191_+,NA|770aa|up_7|NZ_CP024785.1_1898187_1900497_+,NA|235aa|up_6|NZ_CP024785.1_1900517_1901222_+,NA|538aa|up_5|NZ_CP024785.1_1901224_1902838_+,NA|96aa|up_4|NZ_CP024785.1_1902824_1903112_+,NA|52aa|up_3|NZ_CP024785.1_1903231_1903387_+,NA|94aa|up_2|NZ_CP024785.1_1903429_1903711_+,NA|229aa|up_1|NZ_CP024785.1_1903850_1904537_+,NA|85aa|up_0|NZ_CP024785.1_1904510_1904765_-,NA|119aa|down_1|NZ_CP024785.1_1909139_1909496_-,NA|159aa|down_2|NZ_CP024785.1_1909595_1910072_+,NA|82aa|down_4|NZ_CP024785.1_1912688_1912934_+,NA|346aa|down_5|NZ_CP024785.1_1912944_1913982_+,NA|172aa|down_8|NZ_CP024785.1_1916688_1917204_+	NA|253aa|up_9|NZ_CP024785.1_1896908_1897667_+	NA	NA|155aa|up_8|NZ_CP024785.1_1897726_1898191_+	NA	NA|770aa|up_7|NZ_CP024785.1_1898187_1900497_+	NA	NA|235aa|up_6|NZ_CP024785.1_1900517_1901222_+	NA	NA|538aa|up_5|NZ_CP024785.1_1901224_1902838_+	NA	NA|96aa|up_4|NZ_CP024785.1_1902824_1903112_+	NA	NA|52aa|up_3|NZ_CP024785.1_1903231_1903387_+	NA	NA|94aa|up_2|NZ_CP024785.1_1903429_1903711_+	NA	NA|229aa|up_1|NZ_CP024785.1_1903850_1904537_+	NA	NA|85aa|up_0|NZ_CP024785.1_1904510_1904765_-	NA	NA|359aa|down_0|NZ_CP024785.1_1907554_1908631_-	smart00419, HTH_CRP, helix_turn_helix, cAMP Regulatory protein	NA|119aa|down_1|NZ_CP024785.1_1909139_1909496_-	NA	NA|159aa|down_2|NZ_CP024785.1_1909595_1910072_+	NA	NA|413aa|down_3|NZ_CP024785.1_1910442_1911681_+	cd01189, INT_ICEBs1_C_like, C-terminal catalytic domain of integrases from bacterial phages and conjugate transposons	NA|82aa|down_4|NZ_CP024785.1_1912688_1912934_+	NA	NA|346aa|down_5|NZ_CP024785.1_1912944_1913982_+	NA	NA|504aa|down_6|NZ_CP024785.1_1914082_1915594_-	PRK09224, PRK09224, threonine ammonia-lyase IlvA	NA|100aa|down_7|NZ_CP024785.1_1916009_1916309_+	cd12399, RRM_HP0827_like, RNA recognition motif in Helicobacter pylori HP0827 protein and similar proteins	NA|172aa|down_8|NZ_CP024785.1_1916688_1917204_+	NA	NA|233aa|down_9|NZ_CP024785.1_1918009_1918708_-	COG1357, COG1357, Pentapeptide repeats containing protein [Function unknown]
GCF_002813575.1_ASM281357v1	NZ_CP024785	Nostoc flagelliforme CCNUN1 chromosome, complete genome	18	1951044-1951118	16	CRISPRCasFinder	no		cas14k,csa3,cas14j,2OG_CAS,cas6,cas8b3,cas5,RT,c2c9_V-U4,PD-DExK,DinG,c2c5_V-U5,Cas14c_CAS-V-F,cas3,Cas9_archaeal,Cas14u_CAS-V	Orphan	CGGTTGATATTGGTTAACGAGCGC	24	1	1	1951068-1951094	NZ_CP024785.1_408197-408223	NA	1	1	Orphan	cas14k,csa3,cas14j,2OG_CAS,cas6,cas8b3,cas5,RT,c2c9_V-U4,PD-DExK,DinG,c2c5_V-U5,Cas14c_CAS-V-F,cas3,Cas9_archaeal,Cas14u_CAS-V	NA|116aa|up_7|NZ_CP024785.1_1941426_1941774_+,NA|192aa|up_1|NZ_CP024785.1_1949137_1949713_-,NA|253aa|down_0|NZ_CP024785.1_1951394_1952153_-	NA|170aa|up_9|NZ_CP024785.1_1939117_1939627_-	TIGR04110, hypothetical_protein_VSWAT3_12502, heme utilization protein HutZ	NA|495aa|up_8|NZ_CP024785.1_1939878_1941363_+	pfam08547, CIA30, Complex I intermediate-associated protein 30 (CIA30)	NA|116aa|up_7|NZ_CP024785.1_1941426_1941774_+	NA	NA|272aa|up_6|NZ_CP024785.1_1942959_1943775_-	COG0596, MhpC, Predicted hydrolases or acyltransferases (alpha/beta hydrolase superfamily) [General function prediction only]	NA|131aa|up_5|NZ_CP024785.1_1943817_1944210_-	COG2128, COG2128, Uncharacterized conserved protein [Function unknown]	NA|187aa|up_4|NZ_CP024785.1_1944283_1944844_-	pfam13975, gag-asp_proteas, gag-polyprotein putative aspartyl protease	NA|258aa|up_3|NZ_CP024785.1_1945441_1946215_-	cd17767, UP_EcUdp-like, uridine phosphorylases similar to Escherichia coli Udp and related phosphorylases	NA|200aa|up_2|NZ_CP024785.1_1946274_1946874_-	cd02900, Macro_Appr_pase, macrodomain, Appr-1"-pase family	NA|192aa|up_1|NZ_CP024785.1_1949137_1949713_-	NA	NA|214aa|up_0|NZ_CP024785.1_1950330_1950972_+	pfam03050, DDE_Tnp_IS66, Transposase IS66 family	NA|253aa|down_0|NZ_CP024785.1_1951394_1952153_-	NA	NA|291aa|down_1|NZ_CP024785.1_1952457_1953330_+	COG3393, COG3393, Predicted acetyltransferase [General function prediction only]	NA|161aa|down_2|NZ_CP024785.1_1953510_1953993_-	pfam17935, TetR_C_27, Tetracyclin repressor-like, C-terminal domain	NA|435aa|down_3|NZ_CP024785.1_1954155_1955460_+	pfam11717, Tudor-knot, RNA binding activity-knot of a chromodomain	NA|373aa|down_4|NZ_CP024785.1_1955770_1956889_+	pfam01636, APH, Phosphotransferase enzyme family	NA|181aa|down_5|NZ_CP024785.1_1956885_1957428_+	cd09627, DOMON_murB_like, Domon-like domain of UDP-N-acetylenolpyruvoylglucosamine reductase	NA|563aa|down_6|NZ_CP024785.1_1957864_1959553_+	TIGR03156, GTP_HflX, GTP-binding protein HflX	NA|291aa|down_7|NZ_CP024785.1_1959567_1960440_+	cd04512, Ntn_Asparaginase_2_like, L-Asparaginase type 2-like enzymes of the NTN-hydrolase superfamily	NA|423aa|down_8|NZ_CP024785.1_1960599_1961868_+	PRK12767, PRK12767, carbamoyl phosphate synthase-like protein; Provisional	NA|243aa|down_9|NZ_CP024785.1_1961925_1962654_+	pfam01904, DUF72, Protein of unknown function DUF72
GCF_002813575.1_ASM281357v1	NZ_CP024785	Nostoc flagelliforme CCNUN1 chromosome, complete genome	19	2011846-2011956	17	CRISPRCasFinder	no		cas14k,csa3,cas14j,2OG_CAS,cas6,cas8b3,cas5,RT,c2c9_V-U4,PD-DExK,DinG,c2c5_V-U5,Cas14c_CAS-V-F,cas3,Cas9_archaeal,Cas14u_CAS-V	Orphan	AGTTCCAGTTACCGTTACCGTTGGTTG	27	0	0	NA	NA	NA	1	1	Orphan	cas14k,csa3,cas14j,2OG_CAS,cas6,cas8b3,cas5,RT,c2c9_V-U4,PD-DExK,DinG,c2c5_V-U5,Cas14c_CAS-V-F,cas3,Cas9_archaeal,Cas14u_CAS-V	NA,NA|165aa|down_0|NZ_CP024785.1_2014001_2014496_+,NA|84aa|down_3|NZ_CP024785.1_2017906_2018158_-,NA|132aa|down_8|NZ_CP024785.1_2022697_2023093_+,NA|243aa|down_9|NZ_CP024785.1_2023607_2024336_+	NA|284aa|up_9|NZ_CP024785.1_1997629_1998481_-	pfam05721, PhyH, Phytanoyl-CoA dioxygenase (PhyH)	NA|336aa|up_8|NZ_CP024785.1_1998530_1999538_-	pfam00535, Glycos_transf_2, Glycosyl transferase family 2	NA|423aa|up_7|NZ_CP024785.1_1999767_2001036_-	pfam08484, Methyltransf_14, C-methyltransferase C-terminal domain	NA|343aa|up_6|NZ_CP024785.1_2001145_2002174_-	cd08946, SDR_e, extended (e) SDRs	NA|405aa|up_5|NZ_CP024785.1_2002632_2003847_-	PRK11728, PRK11728, L-2-hydroxyglutarate oxidase	NA|258aa|up_4|NZ_CP024785.1_2003998_2004772_-	cd02524, G1P_cytidylyltransferase, G1P_cytidylyltransferase catalyzes the production of CDP-D-Glucose	NA|723aa|up_3|NZ_CP024785.1_2005279_2007448_-	COG4248, COG4248, Uncharacterized protein with protein kinase and helix-hairpin-helix DNA-binding domains [General function prediction only]	NA|261aa|up_2|NZ_CP024785.1_2007504_2008287_-	pfam13672, PP2C_2, Protein phosphatase 2C	NA|225aa|up_1|NZ_CP024785.1_2008338_2009013_-	COG4245, TerY, Uncharacterized protein encoded in toxicity protection region of plasmid R478, contains von Willebrand factor (vWF) domain [General function prediction only]	NA|491aa|up_0|NZ_CP024785.1_2009767_2011240_+	cd17602, REC_PatA-like, phosphoacceptor receiver (REC) domain of PatA and similar domains	NA|165aa|down_0|NZ_CP024785.1_2014001_2014496_+	NA	NA|817aa|down_1|NZ_CP024785.1_2014933_2017384_+	PRK00629, pheT, phenylalanyl-tRNA synthetase subunit beta; Reviewed	NA|90aa|down_2|NZ_CP024785.1_2017635_2017905_+	PRK12864, PRK12864, YciI-like protein; Reviewed	NA|84aa|down_3|NZ_CP024785.1_2017906_2018158_-	NA	NA|266aa|down_4|NZ_CP024785.1_2018249_2019047_+	pfam06485, DUF1092, Protein of unknown function (DUF1092)	NA|50aa|down_5|NZ_CP024785.1_2019949_2020099_+	pfam01701, PSI_PsaJ, Photosystem I reaction centre subunit IX / PsaJ	NA|361aa|down_6|NZ_CP024785.1_2020221_2021304_-	TIGR01151, Photosystem_QB_protein, photosystem II, DI subunit (also called Q(B))	NA|151aa|down_7|NZ_CP024785.1_2021621_2022074_-	COG0071, IbpA, Molecular chaperone (small heat shock protein) [Posttranslational modification, protein turnover, chaperones]	NA|132aa|down_8|NZ_CP024785.1_2022697_2023093_+	NA	NA|243aa|down_9|NZ_CP024785.1_2023607_2024336_+	NA
GCF_002813575.1_ASM281357v1	NZ_CP024785	Nostoc flagelliforme CCNUN1 chromosome, complete genome	20	2421499-2421600	18	CRISPRCasFinder	no		cas14k,csa3,cas14j,2OG_CAS,cas6,cas8b3,cas5,RT,c2c9_V-U4,PD-DExK,DinG,c2c5_V-U5,Cas14c_CAS-V-F,cas3,Cas9_archaeal,Cas14u_CAS-V	Orphan	AAGCCGAGGCGAATCTGCAAAAGG	24	0	0	NA	NA	NA	1	1	Orphan	cas14k,csa3,cas14j,2OG_CAS,cas6,cas8b3,cas5,RT,c2c9_V-U4,PD-DExK,DinG,c2c5_V-U5,Cas14c_CAS-V-F,cas3,Cas9_archaeal,Cas14u_CAS-V	NA,NA|99aa|down_6|NZ_CP024785.1_2429323_2429620_-,NA|96aa|down_9|NZ_CP024785.1_2430709_2430997_+	NA|188aa|up_9|NZ_CP024785.1_2406834_2407398_-	pfam10706, Aminoglyc_resit, Aminoglycoside-2''-adenylyltransferase	NA|236aa|up_8|NZ_CP024785.1_2407526_2408234_+	cd06260, DUF820, Domain of unknown function (DUF820)	NA|302aa|up_7|NZ_CP024785.1_2408289_2409195_-	PRK00091, miaA, tRNA delta(2)-isopentenylpyrophosphate transferase; Reviewed	NA|646aa|up_6|NZ_CP024785.1_2409400_2411338_+	PRK05644, gyrB, DNA gyrase subunit B; Validated	NA|203aa|up_5|NZ_CP024785.1_2411592_2412201_+	COG2335, COG2335, Secreted and surface protein containing fasciclin-like repeats [Cell envelope biogenesis, outer membrane]	NA|390aa|up_4|NZ_CP024785.1_2413693_2414863_+	PRK07406, PRK07406, RNA polymerase sigma factor RpoD; Validated	NA|62aa|up_3|NZ_CP024785.1_2415003_2415189_-	PLN00014, PLN00014, light-harvesting-like protein 3; Provisional	NA|833aa|up_2|NZ_CP024785.1_2415820_2418319_-	PRK05560, PRK05560, DNA gyrase subunit A; Validated	NA|135aa|up_1|NZ_CP024785.1_2418643_2419048_+	cd17548, REC_DivK-like, phosphoacceptor receiver (REC) domain of DivK and similar proteins	NA|303aa|up_0|NZ_CP024785.1_2419426_2420335_-	sd00006, TPR, Tetratricopeptide repeat	NA|424aa|down_0|NZ_CP024785.1_2423078_2424350_+	PRK10535, PRK10535, macrolide ABC transporter ATP-binding protein/permease MacB	NA|224aa|down_1|NZ_CP024785.1_2424464_2425136_+	cd03255, ABC_MJ0796_LolCDE_FtsE, ATP-binding cassette domain of the transporters involved in export of lipoprotein and macrolide, and cell division protein	NA|479aa|down_2|NZ_CP024785.1_2425252_2426689_+	COG1538, TolC, Outer membrane protein [Cell envelope biogenesis, outer membrane / Intracellular trafficking and secretion]	NA|273aa|down_3|NZ_CP024785.1_2426863_2427682_-	COG2226, UbiE, Methylase involved in ubiquinone/menaquinone biosynthesis [Coenzyme metabolism]	NA|269aa|down_4|NZ_CP024785.1_2427688_2428495_+	pfam08242, Methyltransf_12, Methyltransferase domain	NA|238aa|down_5|NZ_CP024785.1_2428571_2429285_+	pfam05685, Uma2, Putative restriction endonuclease	NA|99aa|down_6|NZ_CP024785.1_2429323_2429620_-	NA	NA|156aa|down_7|NZ_CP024785.1_2429648_2430116_-	cd06587, VOC, vicinal oxygen chelate (VOC) family	NA|158aa|down_8|NZ_CP024785.1_2430193_2430667_+	pfam14328, DUF4385, Domain of unknown function (DUF4385)	NA|96aa|down_9|NZ_CP024785.1_2430709_2430997_+	NA
GCF_002813575.1_ASM281357v1	NZ_CP024785	Nostoc flagelliforme CCNUN1 chromosome, complete genome	21	2487024-2487121	19	CRISPRCasFinder	no	DinG	cas14k,csa3,cas14j,2OG_CAS,cas6,cas8b3,cas5,RT,c2c9_V-U4,PD-DExK,DinG,c2c5_V-U5,Cas14c_CAS-V-F,cas3,Cas9_archaeal,Cas14u_CAS-V	Type IV-A	TTCTTCTTCTTTCTTTTCTTGTA	23	0	0	NA	NA	NA	1	1	Orphan	cas14k,csa3,cas14j,2OG_CAS,cas6,cas8b3,cas5,RT,c2c9_V-U4,PD-DExK,DinG,c2c5_V-U5,Cas14c_CAS-V-F,cas3,Cas9_archaeal,Cas14u_CAS-V	NA|142aa|up_6|NZ_CP024785.1_2475494_2475920_-,NA|155aa|up_1|NZ_CP024785.1_2483109_2483574_-,NA	NA|155aa|up_9|NZ_CP024785.1_2472443_2472908_+	pfam01724, DUF29, Domain of unknown function DUF29	NA|562aa|up_8|NZ_CP024785.1_2473004_2474690_-	PRK00484, lysS, lysyl-tRNA synthetase; Reviewed	NA|186aa|up_7|NZ_CP024785.1_2474792_2475350_+	pfam05685, Uma2, Putative restriction endonuclease	NA|142aa|up_6|NZ_CP024785.1_2475494_2475920_-	NA	NA|321aa|up_5|NZ_CP024785.1_2476777_2477740_-	cd07325, M48_Ste24p_like, M48 Ste24 endopeptidase-like, integral membrane metallopeptidase	DinG|520aa|up_4|NZ_CP024785.1_2477872_2479432_+	COG1199, DinG, Rad3-related DNA helicases [Transcription / DNA replication, recombination, and repair]	NA|72aa|up_3|NZ_CP024785.1_2479623_2479839_+	pfam10999, DUF2839, Protein of unknown function (DUF2839)	NA|690aa|up_2|NZ_CP024785.1_2480043_2482113_+	COG1505, COG1505, Serine proteases of the peptidase family S9A [Amino acid transport and metabolism]	NA|155aa|up_1|NZ_CP024785.1_2483109_2483574_-	NA	NA|925aa|up_0|NZ_CP024785.1_2484067_2486842_+	COG0612, PqqL, Predicted Zn-dependent peptidases [General function prediction only]	NA|249aa|down_0|NZ_CP024785.1_2487650_2488397_-	pfam18171, LSDAT_prok, SLOG in TRPM, prokaryote	NA|428aa|down_1|NZ_CP024785.1_2488757_2490041_+	PRK00011, glyA, serine hydroxymethyltransferase; Reviewed	NA|417aa|down_2|NZ_CP024785.1_2490169_2491420_+	PRK00549, PRK00549, competence damage-inducible protein A; Provisional	NA|291aa|down_3|NZ_CP024785.1_2492351_2493224_+	pfam11209, DUF2993, Protein of unknown function (DUF2993)	NA|464aa|down_4|NZ_CP024785.1_2493714_2495106_-	COG1004, Ugd, Predicted UDP-glucose 6-dehydrogenase [Cell envelope biogenesis, outer membrane]	NA|317aa|down_5|NZ_CP024785.1_2495352_2496303_-	cd05230, UGD_SDR_e, UDP-glucuronate decarboxylase (UGD) and related proteins, extended (e) SDRs	NA|219aa|down_6|NZ_CP024785.1_2499314_2499971_-	pfam09378, HAS-barrel, HAS barrel domain	NA|66aa|down_7|NZ_CP024785.1_2500095_2500293_-	pfam11623, NdhS, NAD(P)H dehydrogenase subunit S	NA|438aa|down_8|NZ_CP024785.1_2500402_2501716_-	TIGR02210, Rod_shape-determining_protein_RodA, rod shape-determining protein RodA	NA|357aa|down_9|NZ_CP024785.1_2501885_2502956_-	pfam10609, ParA, NUBPL iron-transfer P-loop NTPase
GCF_002813575.1_ASM281357v1	NZ_CP024785	Nostoc flagelliforme CCNUN1 chromosome, complete genome	22	2644474-2645113	20,2	CRISPRCasFinder,CRT	no	c2c5_V-U5	cas14k,csa3,cas14j,2OG_CAS,cas6,cas8b3,cas5,RT,c2c9_V-U4,PD-DExK,DinG,c2c5_V-U5,Cas14c_CAS-V-F,cas3,Cas9_archaeal,Cas14u_CAS-V	Type V-U5	GTTTCTAAAGCCCTCCTGCTTGGTGGTGGGTTGAAAG,GTTTCTAAAGCCCTCCTGCTTGGTGGTGGGTTGAAAG	37,37	0	0	NA	NA	NA:NA	8,8	8	TypeV-U5	cas14k,csa3,cas14j,2OG_CAS,cas6,cas8b3,cas5,RT,c2c9_V-U4,PD-DExK,DinG,c2c5_V-U5,Cas14c_CAS-V-F,cas3,Cas9_archaeal,Cas14u_CAS-V	NA|75aa|up_9|NZ_CP024785.1_2631994_2632219_-,NA|332aa|up_8|NZ_CP024785.1_2632231_2633227_-,NA|60aa|up_7|NZ_CP024785.1_2633230_2633410_+,NA|95aa|up_6|NZ_CP024785.1_2633410_2633695_-,NA|104aa|up_5|NZ_CP024785.1_2633687_2633999_-,c2c5_V-U5|622aa|up_0|NZ_CP024785.1_2640439_2642305_+,NA|159aa|down_7|NZ_CP024785.1_2653684_2654161_+	NA|75aa|up_9|NZ_CP024785.1_2631994_2632219_-	NA	NA|332aa|up_8|NZ_CP024785.1_2632231_2633227_-	NA	NA|60aa|up_7|NZ_CP024785.1_2633230_2633410_+	NA	NA|95aa|up_6|NZ_CP024785.1_2633410_2633695_-	NA	NA|104aa|up_5|NZ_CP024785.1_2633687_2633999_-	NA	NA|170aa|up_4|NZ_CP024785.1_2634477_2634987_+	pfam06527, TniQ, TniQ	NA|394aa|up_3|NZ_CP024785.1_2635318_2636500_+	COG0286, HsdM, Type I restriction-modification system methyltransferase subunit [Defense mechanisms]	NA|1137aa|up_2|NZ_CP024785.1_2636453_2639864_+	TIGR00348, R_protein, type I site-specific deoxyribonuclease, HsdR family	NA|57aa|up_1|NZ_CP024785.1_2639917_2640088_-	pfam07878, RHH_5, CopG-like RHH_1 or ribbon-helix-helix domain, RHH_5	c2c5_V-U5|622aa|up_0|NZ_CP024785.1_2640439_2642305_+	NA	NA|243aa|down_0|NZ_CP024785.1_2645677_2646406_-	COG0637, COG0637, Predicted phosphatase/phosphohexomutase [General function prediction only]	NA|295aa|down_1|NZ_CP024785.1_2646402_2647287_-	COG2226, UbiE, Methylase involved in ubiquinone/menaquinone biosynthesis [Coenzyme metabolism]	NA|394aa|down_2|NZ_CP024785.1_2647697_2648879_+	pfam01139, RtcB, tRNA-splicing ligase RtcB	NA|300aa|down_3|NZ_CP024785.1_2649075_2649975_+	COG2207, AraC, AraC-type DNA-binding domain-containing proteins [Transcription]	NA|271aa|down_4|NZ_CP024785.1_2650081_2650894_+	COG2226, UbiE, Methylase involved in ubiquinone/menaquinone biosynthesis [Coenzyme metabolism]	NA|301aa|down_5|NZ_CP024785.1_2650902_2651805_-	COG5464, COG5464, Uncharacterized conserved protein [Function unknown]	NA|434aa|down_6|NZ_CP024785.1_2651966_2653268_-	PRK11856, PRK11856, branched-chain alpha-keto acid dehydrogenase subunit E2; Reviewed	NA|159aa|down_7|NZ_CP024785.1_2653684_2654161_+	NA	NA|150aa|down_8|NZ_CP024785.1_2654242_2654692_-	pfam11068, YlqD, YlqD protein	NA|659aa|down_9|NZ_CP024785.1_2654734_2656711_-	cd17640, LC_FACS_like, Long-chain fatty acid CoA synthetase
GCF_002813575.1_ASM281357v1	NZ_CP024785	Nostoc flagelliforme CCNUN1 chromosome, complete genome	23	2710359-2710458	21	CRISPRCasFinder	no		cas14k,csa3,cas14j,2OG_CAS,cas6,cas8b3,cas5,RT,c2c9_V-U4,PD-DExK,DinG,c2c5_V-U5,Cas14c_CAS-V-F,cas3,Cas9_archaeal,Cas14u_CAS-V	Orphan	AGGAGTTAGAAATAAAAATCACAACTC	27	0	0	NA	NA	NA	1	1	Orphan	cas14k,csa3,cas14j,2OG_CAS,cas6,cas8b3,cas5,RT,c2c9_V-U4,PD-DExK,DinG,c2c5_V-U5,Cas14c_CAS-V-F,cas3,Cas9_archaeal,Cas14u_CAS-V	NA|355aa|up_8|NZ_CP024785.1_2697787_2698852_+,NA|233aa|down_1|NZ_CP024785.1_2716192_2716891_+,NA|66aa|down_4|NZ_CP024785.1_2720016_2720214_-,NA|127aa|down_5|NZ_CP024785.1_2720589_2720970_+	NA|589aa|up_9|NZ_CP024785.1_2694692_2696459_+	pfam11832, DUF3352, Protein of unknown function (DUF3352)	NA|355aa|up_8|NZ_CP024785.1_2697787_2698852_+	NA	NA|353aa|up_7|NZ_CP024785.1_2699388_2700447_+	CHL00045, ccsA, cytochrome c biogenesis protein	NA|103aa|up_6|NZ_CP024785.1_2701126_2701435_+	pfam08872, KGK, KGK domain	NA|333aa|up_5|NZ_CP024785.1_2701771_2702770_+	TIGR02432, tRNAIle-lysidine_synthase, tRNA(Ile)-lysidine synthetase, N-terminal domain	NA|587aa|up_4|NZ_CP024785.1_2702839_2704600_-	COG1196, Smc, Chromosome segregation ATPases [Cell division and chromosome partitioning]	NA|393aa|up_3|NZ_CP024785.1_2704979_2706158_+	cd17602, REC_PatA-like, phosphoacceptor receiver (REC) domain of PatA and similar domains	NA|122aa|up_2|NZ_CP024785.1_2706479_2706845_+	cd19937, REC_OmpR_BsPhoP-like, phosphoacceptor receiver (REC) domain of BsPhoP-like OmpR family response regulators	NA|177aa|up_1|NZ_CP024785.1_2706858_2707389_+	COG0835, CheW, Chemotaxis signal transduction protein [Cell motility and secretion / Signal transduction mechanisms]	NA|978aa|up_0|NZ_CP024785.1_2707403_2710337_+	smart00283, MA, Methyl-accepting chemotaxis-like domains (chemotaxis sensory transducer)	NA|1815aa|down_0|NZ_CP024785.1_2710519_2715964_+	COG0643, CheA, Chemotaxis protein histidine kinase and related kinases [Cell motility and secretion / Signal transduction mechanisms]	NA|233aa|down_1|NZ_CP024785.1_2716192_2716891_+	NA	NA|115aa|down_2|NZ_CP024785.1_2717851_2718196_+	pfam08844, DUF1815, Domain of unknown function (DUF1815)	NA|167aa|down_3|NZ_CP024785.1_2719213_2719714_+	COG0824, FcbC, Predicted thioesterase [General function prediction only]	NA|66aa|down_4|NZ_CP024785.1_2720016_2720214_-	NA	NA|127aa|down_5|NZ_CP024785.1_2720589_2720970_+	NA	NA|399aa|down_6|NZ_CP024785.1_2721123_2722320_-	PRK05447, PRK05447, 1-deoxy-D-xylulose 5-phosphate reductoisomerase; Provisional	NA|436aa|down_7|NZ_CP024785.1_2722772_2724080_+	COG5542, COG5542, Predicted integral membrane protein [Function unknown]	NA|322aa|down_8|NZ_CP024785.1_2725592_2726558_+	pfam00892, EamA, EamA-like transporter family	NA|396aa|down_9|NZ_CP024785.1_2726845_2728033_+	COG5542, COG5542, Predicted integral membrane protein [Function unknown]
GCF_002813575.1_ASM281357v1	NZ_CP024785	Nostoc flagelliforme CCNUN1 chromosome, complete genome	24	2777395-2777506	22	CRISPRCasFinder	no		cas14k,csa3,cas14j,2OG_CAS,cas6,cas8b3,cas5,RT,c2c9_V-U4,PD-DExK,DinG,c2c5_V-U5,Cas14c_CAS-V-F,cas3,Cas9_archaeal,Cas14u_CAS-V	Orphan	AAACAACTTAATTGATAAGCTTTGCTTAAG	30	0	0	NA	NA	NA	1	1	Orphan	cas14k,csa3,cas14j,2OG_CAS,cas6,cas8b3,cas5,RT,c2c9_V-U4,PD-DExK,DinG,c2c5_V-U5,Cas14c_CAS-V-F,cas3,Cas9_archaeal,Cas14u_CAS-V	NA|59aa|up_6|NZ_CP024785.1_2769023_2769200_+,NA|72aa|up_4|NZ_CP024785.1_2770124_2770340_+,NA	NA|203aa|up_9|NZ_CP024785.1_2765479_2766088_-	PRK01641, leuD, 3-isopropylmalate dehydratase small subunit	NA|468aa|up_8|NZ_CP024785.1_2766155_2767559_-	PRK05478, PRK05478, 3-isopropylmalate dehydratase large subunit	NA|232aa|up_7|NZ_CP024785.1_2768331_2769027_+	cd00200, WD40, WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment	NA|59aa|up_6|NZ_CP024785.1_2769023_2769200_+	NA	NA|248aa|up_5|NZ_CP024785.1_2769261_2770005_+	pfam12697, Abhydrolase_6, Alpha/beta hydrolase family	NA|72aa|up_4|NZ_CP024785.1_2770124_2770340_+	NA	NA|196aa|up_3|NZ_CP024785.1_2771004_2771592_+	pfam09988, DUF2227, Uncharacterized metal-binding protein (DUF2227)	NA|275aa|up_2|NZ_CP024785.1_2771642_2772467_+	PRK09562, mazG, nucleoside triphosphate pyrophosphohydrolase; Reviewed	NA|490aa|up_1|NZ_CP024785.1_2772872_2774342_-	cd07091, ALDH_F1-2_Ald2-like, ALDH subfamily: ALDH families 1and 2, including 10-formyltetrahydrofolate dehydrogenase, NAD+-dependent retinal dehydrogenase 1 and related proteins	NA|763aa|up_0|NZ_CP024785.1_2774634_2776923_+	COG0642, BaeS, Signal transduction histidine kinase [Signal transduction mechanisms]	NA|1000aa|down_0|NZ_CP024785.1_2778554_2781554_-	COG0643, CheA, Chemotaxis protein histidine kinase and related kinases [Cell motility and secretion / Signal transduction mechanisms]	NA|942aa|down_1|NZ_CP024785.1_2781780_2784606_-	smart00283, MA, Methyl-accepting chemotaxis-like domains (chemotaxis sensory transducer)	NA|166aa|down_2|NZ_CP024785.1_2784666_2785164_-	COG0835, CheW, Chemotaxis signal transduction protein [Cell motility and secretion / Signal transduction mechanisms]	NA|126aa|down_3|NZ_CP024785.1_2785168_2785546_-	cd17574, REC_OmpR, phosphoacceptor receiver (REC) domain of OmpR family response regulators	NA|356aa|down_4|NZ_CP024785.1_2785608_2786676_-	cd17602, REC_PatA-like, phosphoacceptor receiver (REC) domain of PatA and similar domains	NA|489aa|down_5|NZ_CP024785.1_2787823_2789290_-	pfam07995, GSDH, Glucose / Sorbosone dehydrogenase	NA|641aa|down_6|NZ_CP024785.1_2789494_2791417_+	cd09019, galactose_mutarotase_like, galactose mutarotase_like	NA|396aa|down_7|NZ_CP024785.1_2791928_2793116_-	PRK00053, alr, alanine racemase; Reviewed	NA|166aa|down_8|NZ_CP024785.1_2793419_2793917_+	COG1403, McrA, Restriction endonuclease [Defense mechanisms]	NA|428aa|down_9|NZ_CP024785.1_2794811_2796095_+	COG0618, COG0618, Exopolyphosphatase-related proteins [General function prediction only]
GCF_002813575.1_ASM281357v1	NZ_CP024785	Nostoc flagelliforme CCNUN1 chromosome, complete genome	25	3051122-3051219	23	CRISPRCasFinder	no		cas14k,csa3,cas14j,2OG_CAS,cas6,cas8b3,cas5,RT,c2c9_V-U4,PD-DExK,DinG,c2c5_V-U5,Cas14c_CAS-V-F,cas3,Cas9_archaeal,Cas14u_CAS-V	Orphan	GCTTCTTGATAAATCCTTGTTTGCTCTAA	29	0	0	NA	NA	NA	1	1	Orphan	cas14k,csa3,cas14j,2OG_CAS,cas6,cas8b3,cas5,RT,c2c9_V-U4,PD-DExK,DinG,c2c5_V-U5,Cas14c_CAS-V-F,cas3,Cas9_archaeal,Cas14u_CAS-V	NA|67aa|up_8|NZ_CP024785.1_3038736_3038937_-,NA|82aa|up_7|NZ_CP024785.1_3039550_3039796_+,NA|220aa|up_2|NZ_CP024785.1_3045488_3046148_-,NA|85aa|down_3|NZ_CP024785.1_3058386_3058641_-,NA|86aa|down_6|NZ_CP024785.1_3060600_3060858_-	NA|407aa|up_9|NZ_CP024785.1_3036964_3038185_-	COG3464, COG3464, Transposase and inactivated derivatives [DNA replication, recombination, and repair]	NA|67aa|up_8|NZ_CP024785.1_3038736_3038937_-	NA	NA|82aa|up_7|NZ_CP024785.1_3039550_3039796_+	NA	NA|152aa|up_6|NZ_CP024785.1_3040503_3040959_+	cd17548, REC_DivK-like, phosphoacceptor receiver (REC) domain of DivK and similar proteins	NA|375aa|up_5|NZ_CP024785.1_3040997_3042122_-	COG0642, BaeS, Signal transduction histidine kinase [Signal transduction mechanisms]	NA|445aa|up_4|NZ_CP024785.1_3042164_3043499_-	PRK11107, PRK11107, hybrid sensory histidine kinase BarA; Provisional	NA|407aa|up_3|NZ_CP024785.1_3043686_3044907_-	COG3464, COG3464, Transposase and inactivated derivatives [DNA replication, recombination, and repair]	NA|220aa|up_2|NZ_CP024785.1_3045488_3046148_-	NA	NA|242aa|up_1|NZ_CP024785.1_3046460_3047186_+	COG2197, CitB, Response regulator containing a CheY-like receiver domain and an HTH DNA-binding domain [Signal transduction mechanisms / Transcription]	NA|695aa|up_0|NZ_CP024785.1_3048813_3050898_+	COG4775, COG4775, Outer membrane protein/protective antigen OMA87 [Cell envelope biogenesis, outer membrane]	NA|539aa|down_0|NZ_CP024785.1_3052180_3053797_+	cd14014, STKc_PknB_like, Catalytic domain of bacterial Serine/Threonine kinases, PknB and similar proteins	NA|409aa|down_1|NZ_CP024785.1_3054025_3055252_-	smart00854, PGA_cap, Bacterial capsule synthesis protein PGA_cap	NA|335aa|down_2|NZ_CP024785.1_3057332_3058337_+	cd12183, LDH_like_2, D-Lactate and related Dehydrogenases, NAD-binding and catalytic domains	NA|85aa|down_3|NZ_CP024785.1_3058386_3058641_-	NA	NA|248aa|down_4|NZ_CP024785.1_3058637_3059381_-	pfam13398, Peptidase_M50B, Peptidase M50B-like	NA|293aa|down_5|NZ_CP024785.1_3059651_3060530_-	COG1716, COG1716, FOG: FHA domain [Signal transduction mechanisms]	NA|86aa|down_6|NZ_CP024785.1_3060600_3060858_-	NA	NA|241aa|down_7|NZ_CP024785.1_3060826_3061549_-	TIGR01198, 6-phosphogluconolactonase_6PGL	NA|248aa|down_8|NZ_CP024785.1_3061822_3062566_-	PRK00173, rph, ribonuclease PH; Reviewed	NA|208aa|down_9|NZ_CP024785.1_3062788_3063412_-	cd01428, ADK, Adenylate kinase (ADK) catalyzes the reversible phosphoryl transfer from adenosine triphosphates (ATP) to adenosine monophosphates (AMP) and to yield adenosine diphosphates (ADP)
GCF_002813575.1_ASM281357v1	NZ_CP024785	Nostoc flagelliforme CCNUN1 chromosome, complete genome	26	3206611-3206724	24	CRISPRCasFinder	no		cas14k,csa3,cas14j,2OG_CAS,cas6,cas8b3,cas5,RT,c2c9_V-U4,PD-DExK,DinG,c2c5_V-U5,Cas14c_CAS-V-F,cas3,Cas9_archaeal,Cas14u_CAS-V	Orphan	TTACTGCCTCAACACTTGGGCAAGGTAATG	30	0	0	NA	NA	NA	1	1	Orphan	cas14k,csa3,cas14j,2OG_CAS,cas6,cas8b3,cas5,RT,c2c9_V-U4,PD-DExK,DinG,c2c5_V-U5,Cas14c_CAS-V-F,cas3,Cas9_archaeal,Cas14u_CAS-V	NA|141aa|up_6|NZ_CP024785.1_3196419_3196842_-,NA|214aa|up_5|NZ_CP024785.1_3196844_3197486_-,NA|82aa|up_1|NZ_CP024785.1_3201553_3201799_-,NA|79aa|up_0|NZ_CP024785.1_3202224_3202461_+,NA|150aa|down_0|NZ_CP024785.1_3207823_3208273_+,NA|142aa|down_2|NZ_CP024785.1_3210344_3210770_+,NA|117aa|down_3|NZ_CP024785.1_3210837_3211188_+,NA|84aa|down_5|NZ_CP024785.1_3211890_3212142_+	NA|169aa|up_9|NZ_CP024785.1_3191568_3192075_+	pfam13508, Acetyltransf_7, Acetyltransferase (GNAT) domain	NA|1015aa|up_8|NZ_CP024785.1_3192049_3195094_-	smart00306, HintN, Hint (Hedgehog/Intein) domain N-terminal region	NA|410aa|up_7|NZ_CP024785.1_3195090_3196320_-	COG0367, AsnB, Asparagine synthase (glutamine-hydrolyzing) [Amino acid transport and metabolism]	NA|141aa|up_6|NZ_CP024785.1_3196419_3196842_-	NA	NA|214aa|up_5|NZ_CP024785.1_3196844_3197486_-	NA	NA|418aa|up_4|NZ_CP024785.1_3197527_3198781_-	COG0367, AsnB, Asparagine synthase (glutamine-hydrolyzing) [Amino acid transport and metabolism]	NA|525aa|up_3|NZ_CP024785.1_3198777_3200352_-	cd16403, ParB_N_like_MT, ParB N-terminal-like domain, some attached to C-terminal S-adenosylmethionine-dependent methyltransferase	NA|401aa|up_2|NZ_CP024785.1_3200344_3201547_-	COG0367, AsnB, Asparagine synthase (glutamine-hydrolyzing) [Amino acid transport and metabolism]	NA|82aa|up_1|NZ_CP024785.1_3201553_3201799_-	NA	NA|79aa|up_0|NZ_CP024785.1_3202224_3202461_+	NA	NA|150aa|down_0|NZ_CP024785.1_3207823_3208273_+	NA	NA|51aa|down_1|NZ_CP024785.1_3208396_3208549_+	pfam14706, Tnp_DNA_bind, Transposase DNA-binding	NA|142aa|down_2|NZ_CP024785.1_3210344_3210770_+	NA	NA|117aa|down_3|NZ_CP024785.1_3210837_3211188_+	NA	NA|195aa|down_4|NZ_CP024785.1_3211188_3211773_+	TIGR04096, conserved_hypothetical_protein, DNA phosphorothioation-associated putative methyltransferase	NA|84aa|down_5|NZ_CP024785.1_3211890_3212142_+	NA	NA|89aa|down_6|NZ_CP024785.1_3212457_3212724_+	pfam13384, HTH_23, Homeodomain-like domain	NA|311aa|down_7|NZ_CP024785.1_3212636_3213569_+	pfam13358, DDE_3, DDE superfamily endonuclease	NA|212aa|down_8|NZ_CP024785.1_3213661_3214297_+	PRK13413, mpi, master DNA invertase Mpi family serine-type recombinase	NA|530aa|down_9|NZ_CP024785.1_3214298_3215888_+	pfam00665, rve, Integrase core domain
GCF_002813575.1_ASM281357v1	NZ_CP024785	Nostoc flagelliforme CCNUN1 chromosome, complete genome	27	3395147-3395270	25	CRISPRCasFinder	no		cas14k,csa3,cas14j,2OG_CAS,cas6,cas8b3,cas5,RT,c2c9_V-U4,PD-DExK,DinG,c2c5_V-U5,Cas14c_CAS-V-F,cas3,Cas9_archaeal,Cas14u_CAS-V	Orphan	ACTTACTCAAACCCCAGTTTGCTTTCTGATTTTGGTGTTTT	41	1	1	3395188-3395229	NZ_CP024785.1_2293379-2293338	NA	1	1	Orphan	cas14k,csa3,cas14j,2OG_CAS,cas6,cas8b3,cas5,RT,c2c9_V-U4,PD-DExK,DinG,c2c5_V-U5,Cas14c_CAS-V-F,cas3,Cas9_archaeal,Cas14u_CAS-V	NA,NA|66aa|down_1|NZ_CP024785.1_3396408_3396606_-,NA|184aa|down_4|NZ_CP024785.1_3398380_3398932_-,NA|469aa|down_6|NZ_CP024785.1_3401280_3402687_+	NA|46aa|up_9|NZ_CP024785.1_3386453_3386591_-	PRK02561, psbF, cytochrome b559 subunit beta; Provisional	NA|83aa|up_8|NZ_CP024785.1_3386600_3386849_-	PRK02557, psbE, cytochrome b559 subunit alpha; Provisional	NA|340aa|up_7|NZ_CP024785.1_3387067_3388087_-	PRK13684, PRK13684, photosynthesis system II assembly factor Ycf48	NA|112aa|up_6|NZ_CP024785.1_3388548_3388884_-	COG1773, COG1773, Rubredoxin [Energy production and conversion]	NA|121aa|up_5|NZ_CP024785.1_3389231_3389594_+	CHL00022, ndhC, NADH dehydrogenase subunit 3	NA|246aa|up_4|NZ_CP024785.1_3389584_3390322_+	CHL00023, ndhK, NADH dehydrogenase subunit K	NA|177aa|up_3|NZ_CP024785.1_3390314_3390845_+	PRK12494, PRK12494, NAD(P)H-quinone oxidoreductase subunit J	NA|268aa|up_2|NZ_CP024785.1_3391286_3392090_-	pfam02557, VanY, D-alanyl-D-alanine carboxypeptidase	NA|306aa|up_1|NZ_CP024785.1_3392192_3393110_+	COG0679, COG0679, Predicted permeases [General function prediction only]	NA|473aa|up_0|NZ_CP024785.1_3393163_3394582_-	pfam11850, DUF3370, Protein of unknown function (DUF3370)	NA|220aa|down_0|NZ_CP024785.1_3395654_3396314_-	TIGR03725, T6A_YeaZ, tRNA threonylcarbamoyl adenosine modification protein YeaZ	NA|66aa|down_1|NZ_CP024785.1_3396408_3396606_-	NA	NA|83aa|down_2|NZ_CP024785.1_3396692_3396941_-	pfam10718, Ycf34, Hypothetical chloroplast protein Ycf34	NA|441aa|down_3|NZ_CP024785.1_3397027_3398350_+	COG0617, PcnB, tRNA nucleotidyltransferase/poly(A) polymerase [Translation, ribosomal structure and biogenesis]	NA|184aa|down_4|NZ_CP024785.1_3398380_3398932_-	NA	NA|427aa|down_5|NZ_CP024785.1_3398952_3400233_-	cd00710, LbH_gamma_CA, Gamma carbonic anhydrases (CA): Carbonic anhydrases are zinc-containing enzymes that catalyze the reversible hydration of carbon dioxide in a two-step mechanism, involving the nucleophilic attack of a zinc-bound hydroxide ion on carbon dioxide, followed by the regeneration of the active site by ionization of the zinc-bound water molecule and removal of a proton from the active site	NA|469aa|down_6|NZ_CP024785.1_3401280_3402687_+	NA	NA|204aa|down_7|NZ_CP024785.1_3402795_3403407_-	cd06158, S2P-M50_like_1, Uncharacterized homologs of Site-2 protease (S2P), zinc metalloproteases (MEROPS family M50) which cleave transmembrane domains of substrate proteins, regulating intramembrane proteolysis (RIP) of diverse signal transduction mechanisms	NA|85aa|down_8|NZ_CP024785.1_3404169_3404424_+	COG4327, COG4327, Predicted membrane protein [Function unknown]	NA|552aa|down_9|NZ_CP024785.1_3404433_3406089_+	TIGR03648, Na_symport_lg, probable sodium:solute symporter, VC_2705 subfamily
GCF_002813575.1_ASM281357v1	NZ_CP024785	Nostoc flagelliforme CCNUN1 chromosome, complete genome	28	3547017-3547102	26	CRISPRCasFinder	no		cas14k,csa3,cas14j,2OG_CAS,cas6,cas8b3,cas5,RT,c2c9_V-U4,PD-DExK,DinG,c2c5_V-U5,Cas14c_CAS-V-F,cas3,Cas9_archaeal,Cas14u_CAS-V	Orphan	TTTTTACAGATATCCACAAAAAT	23	0	0	NA	NA	NA	1	1	Orphan	cas14k,csa3,cas14j,2OG_CAS,cas6,cas8b3,cas5,RT,c2c9_V-U4,PD-DExK,DinG,c2c5_V-U5,Cas14c_CAS-V-F,cas3,Cas9_archaeal,Cas14u_CAS-V	NA,NA|240aa|down_0|NZ_CP024785.1_3547831_3548551_+	NA|122aa|up_9|NZ_CP024785.1_3535008_3535374_-	CHL00165, ftrB, ferredoxin thioreductase subunit beta; Validated	NA|387aa|up_8|NZ_CP024785.1_3535641_3536802_-	COG1721, COG1721, Uncharacterized conserved protein (some members contain a von Willebrand factor type A (vWA) domain) [General function prediction only]	NA|49aa|up_7|NZ_CP024785.1_3536905_3537052_+	pfam04255, DUF433, Protein of unknown function (DUF433)	NA|66aa|up_6|NZ_CP024785.1_3538349_3538547_+	COG2442, COG2442, Uncharacterized conserved protein [Function unknown]	NA|316aa|up_5|NZ_CP024785.1_3539201_3540149_+	pfam10592, AIPR, AIPR protein	NA|597aa|up_4|NZ_CP024785.1_3540538_3542329_-	COG1217, TypA, Predicted membrane GTPase involved in stress response [Signal transduction mechanisms]	NA|351aa|up_3|NZ_CP024785.1_3543395_3544448_+	COG5592, COG5592, Uncharacterized conserved protein [Function unknown]	NA|395aa|up_2|NZ_CP024785.1_3544529_3545714_-	cd08014, M20_Acy1-like, M20 Peptidase aminoacylase 1 subfamily	NA|175aa|up_1|NZ_CP024785.1_3545892_3546417_+	pfam00583, Acetyltransf_1, Acetyltransferase (GNAT) family	NA|188aa|up_0|NZ_CP024785.1_3546450_3547014_-	COG1357, COG1357, Pentapeptide repeats containing protein [Function unknown]	NA|240aa|down_0|NZ_CP024785.1_3547831_3548551_+	NA	NA|256aa|down_1|NZ_CP024785.1_3548829_3549597_+	pfam07082, DUF1350, Protein of unknown function (DUF1350)	NA|379aa|down_2|NZ_CP024785.1_3549753_3550890_+	COG2208, RsbU, Serine phosphatase RsbU, regulator of sigma subunit [Signal transduction mechanisms / Transcription]	NA|109aa|down_3|NZ_CP024785.1_3550890_3551217_-	cd07043, STAS_anti-anti-sigma_factors, Sulphate Transporter and Anti-Sigma factor antagonist) domain of anti-anti-sigma factors, key regulators of anti-sigma factors by phosphorylation	NA|218aa|down_4|NZ_CP024785.1_3551862_3552516_-	COG0576, GrpE, Molecular chaperone GrpE (heat shock protein) [Posttranslational modification, protein turnover, chaperones]	NA|164aa|down_5|NZ_CP024785.1_3552826_3553318_-	COG0783, Dps, DNA-binding ferritin-like protein (oxidative damage protectant) [Inorganic ion transport and metabolism]	NA|88aa|down_6|NZ_CP024785.1_3553523_3553787_-	cd16395, Srx, Sulfiredoxin reactivates peroxiredoxins after oxidative inactivation	NA|548aa|down_7|NZ_CP024785.1_3554064_3555708_+	COG2385, SpoIID, Sporulation protein and related proteins [Cell division and chromosome partitioning]	NA|100aa|down_8|NZ_CP024785.1_3555905_3556205_-	CHL00134, petF, ferredoxin; Validated	NA|337aa|down_9|NZ_CP024785.1_3556885_3557896_+	COG0673, MviM, Predicted dehydrogenases and related proteins [General function prediction only]
GCF_002813575.1_ASM281357v1	NZ_CP024785	Nostoc flagelliforme CCNUN1 chromosome, complete genome	29	3941638-3941717	27	CRISPRCasFinder	no	cas14j	cas14k,csa3,cas14j,2OG_CAS,cas6,cas8b3,cas5,RT,c2c9_V-U4,PD-DExK,DinG,c2c5_V-U5,Cas14c_CAS-V-F,cas3,Cas9_archaeal,Cas14u_CAS-V	Unclear	CCAAGACCTACTACCCCAAAAAAAGC	26	0	0	NA	NA	NA	1	1	TypeV	cas14k,csa3,cas14j,2OG_CAS,cas6,cas8b3,cas5,RT,c2c9_V-U4,PD-DExK,DinG,c2c5_V-U5,Cas14c_CAS-V-F,cas3,Cas9_archaeal,Cas14u_CAS-V	NA|107aa|up_8|NZ_CP024785.1_3931255_3931576_+,NA|107aa|up_4|NZ_CP024785.1_3935022_3935343_-,NA|116aa|down_4|NZ_CP024785.1_3946122_3946470_-	cas14j|409aa|up_9|NZ_CP024785.1_3929404_3930631_-	COG0675, COG0675, Transposase and inactivated derivatives [DNA replication, recombination, and repair]	NA|107aa|up_8|NZ_CP024785.1_3931255_3931576_+	NA	NA|107aa|up_7|NZ_CP024785.1_3931610_3931931_-	pfam09876, DUF2103, Predicted metal-binding protein (DUF2103)	NA|109aa|up_6|NZ_CP024785.1_3931936_3932263_-	PRK13019, clpS, ATP-dependent Clp protease adapter ClpS	NA|693aa|up_5|NZ_CP024785.1_3932391_3934470_-	pfam12831, FAD_oxidored, FAD dependent oxidoreductase	NA|107aa|up_4|NZ_CP024785.1_3935022_3935343_-	NA	NA|164aa|up_3|NZ_CP024785.1_3936110_3936602_+	COG1357, COG1357, Pentapeptide repeats containing protein [Function unknown]	NA|385aa|up_2|NZ_CP024785.1_3936598_3937753_+	PRK06019, PRK06019, phosphoribosylaminoimidazole carboxylase ATPase subunit; Reviewed	NA|422aa|up_1|NZ_CP024785.1_3938336_3939602_+	COG0612, PqqL, Predicted Zn-dependent peptidases [General function prediction only]	NA|424aa|up_0|NZ_CP024785.1_3939682_3940954_+	COG0612, PqqL, Predicted Zn-dependent peptidases [General function prediction only]	NA|118aa|down_0|NZ_CP024785.1_3942453_3942807_+	COG1950, COG1950, Predicted membrane protein [Function unknown]	NA|162aa|down_1|NZ_CP024785.1_3942812_3943298_-	pfam01890, CbiG_C, Cobalamin synthesis G C-terminus	NA|188aa|down_2|NZ_CP024785.1_3943314_3943878_-	pfam05685, Uma2, Putative restriction endonuclease	NA|588aa|down_3|NZ_CP024785.1_3944047_3945811_-	PLN02286, PLN02286, arginine-tRNA ligase	NA|116aa|down_4|NZ_CP024785.1_3946122_3946470_-	NA	NA|51aa|down_5|NZ_CP024785.1_3946543_3946696_+	pfam07927, HicA_toxin, HicA toxin of bacterial toxin-antitoxin,	NA|74aa|down_6|NZ_CP024785.1_3946707_3946929_+	pfam15919, HicB_lk_antitox, HicB_like antitoxin of bacterial toxin-antitoxin system	NA|504aa|down_7|NZ_CP024785.1_3946974_3948486_-	PRK02705, murD, UDP-N-acetylmuramoyl-L-alanine--D-glutamate ligase	NA|717aa|down_8|NZ_CP024785.1_3948570_3950721_-	PRK01233, glyS, glycyl-tRNA synthetase subunit beta; Validated	NA|430aa|down_9|NZ_CP024785.1_3951885_3953175_+	cd03794, GT4_WbuB-like, Escherichia coli WbuB and similar proteins
GCF_002813575.1_ASM281357v1	NZ_CP024785	Nostoc flagelliforme CCNUN1 chromosome, complete genome	30	4055494-4055598	28	CRISPRCasFinder	no		cas14k,csa3,cas14j,2OG_CAS,cas6,cas8b3,cas5,RT,c2c9_V-U4,PD-DExK,DinG,c2c5_V-U5,Cas14c_CAS-V-F,cas3,Cas9_archaeal,Cas14u_CAS-V	Orphan	GGGGGAAACAAGAATGCAGTTCCCTCCCTTTTA	33	0	0	NA	NA	NA	1	1	Orphan	cas14k,csa3,cas14j,2OG_CAS,cas6,cas8b3,cas5,RT,c2c9_V-U4,PD-DExK,DinG,c2c5_V-U5,Cas14c_CAS-V-F,cas3,Cas9_archaeal,Cas14u_CAS-V	NA|136aa|up_8|NZ_CP024785.1_4041013_4041421_-,NA|78aa|up_5|NZ_CP024785.1_4045101_4045335_-,NA|500aa|up_2|NZ_CP024785.1_4051173_4052673_-,NA|98aa|down_1|NZ_CP024785.1_4057000_4057294_-,NA|140aa|down_8|NZ_CP024785.1_4063597_4064017_-	NA|688aa|up_9|NZ_CP024785.1_4038829_4040893_-	PRK00208, thiG, thiazole synthase; Reviewed	NA|136aa|up_8|NZ_CP024785.1_4041013_4041421_-	NA	NA|113aa|up_7|NZ_CP024785.1_4041437_4041776_-	COG0633, Fdx, Ferredoxin [Energy production and conversion]	NA|563aa|up_6|NZ_CP024785.1_4041851_4043540_+	PRK12344, PRK12344, putative alpha-isopropylmalate/homocitrate synthase family transferase; Provisional	NA|78aa|up_5|NZ_CP024785.1_4045101_4045335_-	NA	NA|721aa|up_4|NZ_CP024785.1_4045495_4047658_+	COG0644, FixC, Dehydrogenases (flavoproteins) [Energy production and conversion]	NA|517aa|up_3|NZ_CP024785.1_4047707_4049258_-	COG1196, Smc, Chromosome segregation ATPases [Cell division and chromosome partitioning]	NA|500aa|up_2|NZ_CP024785.1_4051173_4052673_-	NA	NA|258aa|up_1|NZ_CP024785.1_4052732_4053506_-	COG3689, COG3689, Predicted membrane protein [Function unknown]	NA|351aa|up_0|NZ_CP024785.1_4053667_4054720_-	COG0701, COG0701, Predicted permeases [General function prediction only]	NA|336aa|down_0|NZ_CP024785.1_4055857_4056865_-	cd00200, WD40, WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment	NA|98aa|down_1|NZ_CP024785.1_4057000_4057294_-	NA	NA|57aa|down_2|NZ_CP024785.1_4057721_4057892_-	PLN00014, PLN00014, light-harvesting-like protein 3; Provisional	NA|364aa|down_3|NZ_CP024785.1_4058308_4059400_+	cd05305, L-AlaDH, Alanine dehydrogenase NAD-binding and catalytic domains	NA|371aa|down_4|NZ_CP024785.1_4059696_4060809_+	cd01846, fatty_acyltransferase_like, Fatty acyltransferase-like subfamily of the SGNH hydrolases, a diverse family of lipases and esterases	NA|185aa|down_5|NZ_CP024785.1_4060879_4061434_-	pfam05685, Uma2, Putative restriction endonuclease	NA|352aa|down_6|NZ_CP024785.1_4061538_4062594_-	TIGR00737, Probable_tRNA-dihydrouridine_synthase, putative TIM-barrel protein, nifR3 family	NA|270aa|down_7|NZ_CP024785.1_4062670_4063480_-	pfam11103, DUF2887, Protein of unknown function (DUF2887)	NA|140aa|down_8|NZ_CP024785.1_4063597_4064017_-	NA	NA|286aa|down_9|NZ_CP024785.1_4064109_4064967_-	COG1398, OLE1, Fatty-acid desaturase [Lipid metabolism]
GCF_002813575.1_ASM281357v1	NZ_CP024785	Nostoc flagelliforme CCNUN1 chromosome, complete genome	31	4356957-4357050	29	CRISPRCasFinder	no		cas14k,csa3,cas14j,2OG_CAS,cas6,cas8b3,cas5,RT,c2c9_V-U4,PD-DExK,DinG,c2c5_V-U5,Cas14c_CAS-V-F,cas3,Cas9_archaeal,Cas14u_CAS-V	Orphan	TCGCAGAAAATACCTGATGATATTCTGC	28	0	0	NA	NA	NA	1	1	Orphan	cas14k,csa3,cas14j,2OG_CAS,cas6,cas8b3,cas5,RT,c2c9_V-U4,PD-DExK,DinG,c2c5_V-U5,Cas14c_CAS-V-F,cas3,Cas9_archaeal,Cas14u_CAS-V	NA|109aa|up_9|NZ_CP024785.1_4346129_4346456_+,NA|130aa|up_8|NZ_CP024785.1_4346624_4347014_-,NA|84aa|up_6|NZ_CP024785.1_4347651_4347903_-,NA|112aa|up_5|NZ_CP024785.1_4347974_4348310_+,NA|59aa|down_0|NZ_CP024785.1_4358162_4358339_-,NA|73aa|down_4|NZ_CP024785.1_4360940_4361159_-,NA|87aa|down_8|NZ_CP024785.1_4366215_4366476_-	NA|109aa|up_9|NZ_CP024785.1_4346129_4346456_+	NA	NA|130aa|up_8|NZ_CP024785.1_4346624_4347014_-	NA	NA|169aa|up_7|NZ_CP024785.1_4347030_4347537_-	COG1403, McrA, Restriction endonuclease [Defense mechanisms]	NA|84aa|up_6|NZ_CP024785.1_4347651_4347903_-	NA	NA|112aa|up_5|NZ_CP024785.1_4347974_4348310_+	NA	NA|674aa|up_4|NZ_CP024785.1_4348937_4350959_+	COG0642, BaeS, Signal transduction histidine kinase [Signal transduction mechanisms]	NA|202aa|up_3|NZ_CP024785.1_4350967_4351573_+	cd08866, SRPBCC_11, Ligand-binding SRPBCC domain of an uncharacterized subfamily of proteins	NA|640aa|up_2|NZ_CP024785.1_4352058_4353978_+	COG0025, NhaP, NhaP-type Na+/H+ and K+/H+ antiporters [Inorganic ion transport and metabolism]	NA|220aa|up_1|NZ_CP024785.1_4354039_4354699_-	COG1290, QcrB, Cytochrome b subunit of the bc complex [Energy production and conversion]	NA|232aa|up_0|NZ_CP024785.1_4355088_4355784_-	COG1075, LipA, Predicted acetyltransferases and hydrolases with the alpha/beta hydrolase fold [General function prediction only]	NA|59aa|down_0|NZ_CP024785.1_4358162_4358339_-	NA	NA|115aa|down_1|NZ_CP024785.1_4358908_4359253_-	PRK13697, PRK13697, cytochrome c6; Provisional	NA|140aa|down_2|NZ_CP024785.1_4359507_4359927_-	PRK02710, PRK02710, plastocyanin; Provisional	NA|164aa|down_3|NZ_CP024785.1_4360319_4360811_-	PRK13618, psbV, cytochrome c-550; Provisional	NA|73aa|down_4|NZ_CP024785.1_4360940_4361159_-	NA	NA|485aa|down_5|NZ_CP024785.1_4361753_4363208_+	cd17325, MFS_MdtG_SLC18_like, bacterial MdtG-like and eukaryotic solute carrier 18 (SLC18) family of the Major Facilitator Superfamily of transporters	NA|317aa|down_6|NZ_CP024785.1_4363304_4364255_-	PRK05654, PRK05654, acetyl-CoA carboxylase carboxyltransferase subunit beta	NA|283aa|down_7|NZ_CP024785.1_4365229_4366078_-	COG1989, PulO, Type II secretory pathway, prepilin signal peptidase PulO and related peptidases [Cell motility and secretion / Posttranslational modification, protein turnover, chaperones / Intracellular trafficking and secretion]	NA|87aa|down_8|NZ_CP024785.1_4366215_4366476_-	NA	NA|363aa|down_9|NZ_CP024785.1_4366566_4367655_-	PRK00772, PRK00772, 3-isopropylmalate dehydrogenase; Provisional
GCF_002813575.1_ASM281357v1	NZ_CP024785	Nostoc flagelliforme CCNUN1 chromosome, complete genome	32	4679869-4679970	30	CRISPRCasFinder	no		cas14k,csa3,cas14j,2OG_CAS,cas6,cas8b3,cas5,RT,c2c9_V-U4,PD-DExK,DinG,c2c5_V-U5,Cas14c_CAS-V-F,cas3,Cas9_archaeal,Cas14u_CAS-V	Orphan	CCGTTCACAAATTAATGTCGTTTT	24	0	0	NA	NA	NA	1	1	Orphan	cas14k,csa3,cas14j,2OG_CAS,cas6,cas8b3,cas5,RT,c2c9_V-U4,PD-DExK,DinG,c2c5_V-U5,Cas14c_CAS-V-F,cas3,Cas9_archaeal,Cas14u_CAS-V	NA,NA|100aa|down_7|NZ_CP024785.1_4694940_4695240_-,NA|56aa|down_8|NZ_CP024785.1_4695454_4695622_+,NA|94aa|down_9|NZ_CP024785.1_4695777_4696059_-	NA|298aa|up_9|NZ_CP024785.1_4661085_4661979_-	PLN02679, PLN02679, hydrolase, alpha/beta fold family protein	NA|564aa|up_8|NZ_CP024785.1_4662325_4664017_+	pfam00395, SLH, S-layer homology domain	NA|146aa|up_7|NZ_CP024785.1_4664028_4664466_-	pfam00903, Glyoxalase, Glyoxalase/Bleomycin resistance protein/Dioxygenase superfamily	NA|260aa|up_6|NZ_CP024785.1_4664577_4665357_+	pfam13483, Lactamase_B_3, Beta-lactamase superfamily domain	NA|306aa|up_5|NZ_CP024785.1_4665510_4666428_+	cd09993, HDAC_classIV, Histone deacetylase class IV also known as histone deacetylase 11	NA|1493aa|up_4|NZ_CP024785.1_4666669_4671148_+	pfam13087, AAA_12, AAA domain	NA|615aa|up_3|NZ_CP024785.1_4672284_4674129_+	cd09133, PLDc_unchar5, Putative catalytic domain of uncharacterized hypothetical proteins with one or two copies of the HKD motif	NA|380aa|up_2|NZ_CP024785.1_4674360_4675500_+	TIGR02048, gshA_cyano, glutamate--cysteine ligase, cyanobacterial, putative	NA|154aa|up_1|NZ_CP024785.1_4676005_4676467_+	cd18094, SpoU-like_TrmL, SAM-dependent tRNA methylase related to TrmL	NA|767aa|up_0|NZ_CP024785.1_4677259_4679560_+	pfam01551, Peptidase_M23, Peptidase family M23	NA|89aa|down_0|NZ_CP024785.1_4681658_4681925_+	cd00093, HTH_XRE, Helix-turn-helix XRE-family like proteins	NA|717aa|down_1|NZ_CP024785.1_4687233_4689384_-	pfam06202, GDE_C, Amylo-alpha-1,6-glucosidase	NA|204aa|down_2|NZ_CP024785.1_4689515_4690127_+	cd03015, PRX_Typ2cys, Peroxiredoxin (PRX) family, Typical 2-Cys PRX subfamily; PRXs are thiol-specific antioxidant (TSA) proteins, which confer a protective role in cells through its peroxidase activity by reducing hydrogen peroxide, peroxynitrite, and organic hydroperoxides	NA|181aa|down_3|NZ_CP024785.1_4690359_4690902_+	COG1225, Bcp, Peroxiredoxin [Posttranslational modification, protein turnover, chaperones]	NA|347aa|down_4|NZ_CP024785.1_4691653_4692694_-	COG0484, DnaJ, DnaJ-class molecular chaperone with C-terminal Zn finger domain [Posttranslational modification, protein turnover, chaperones]	NA|190aa|down_5|NZ_CP024785.1_4692816_4693386_-	COG4639, COG4639, Predicted kinase [General function prediction only]	NA|430aa|down_6|NZ_CP024785.1_4693420_4694710_-	PRK02862, glgC, glucose-1-phosphate adenylyltransferase; Provisional	NA|100aa|down_7|NZ_CP024785.1_4694940_4695240_-	NA	NA|56aa|down_8|NZ_CP024785.1_4695454_4695622_+	NA	NA|94aa|down_9|NZ_CP024785.1_4695777_4696059_-	NA
GCF_002813575.1_ASM281357v1	NZ_CP024785	Nostoc flagelliforme CCNUN1 chromosome, complete genome	33	4807871-4807965	31	CRISPRCasFinder	no		cas14k,csa3,cas14j,2OG_CAS,cas6,cas8b3,cas5,RT,c2c9_V-U4,PD-DExK,DinG,c2c5_V-U5,Cas14c_CAS-V-F,cas3,Cas9_archaeal,Cas14u_CAS-V	Orphan	TTTAATTTAGTGTTTATAAACACTTTTT	28	0	0	NA	NA	NA	1	1	Orphan	cas14k,csa3,cas14j,2OG_CAS,cas6,cas8b3,cas5,RT,c2c9_V-U4,PD-DExK,DinG,c2c5_V-U5,Cas14c_CAS-V-F,cas3,Cas9_archaeal,Cas14u_CAS-V	NA|90aa|up_6|NZ_CP024785.1_4797431_4797701_+,NA|302aa|up_1|NZ_CP024785.1_4803303_4804209_+,NA|448aa|up_0|NZ_CP024785.1_4806199_4807543_-,NA	NA|505aa|up_9|NZ_CP024785.1_4793262_4794777_+	COG3320, COG3320, Putative dehydrogenase domain of multifunctional non-ribosomal peptide synthetases and related enzymes [Secondary metabolites biosynthesis, transport, and catabolism]	NA|268aa|up_8|NZ_CP024785.1_4795522_4796326_+	COG0300, DltE, Short-chain dehydrogenases of various substrate specificities [General function prediction only]	NA|240aa|up_7|NZ_CP024785.1_4796392_4797112_-	COG2091, Sfp, Phosphopantetheinyl transferase [Coenzyme metabolism]	NA|90aa|up_6|NZ_CP024785.1_4797431_4797701_+	NA	NA|306aa|up_5|NZ_CP024785.1_4797854_4798772_+	pfam13359, DDE_Tnp_4, DDE superfamily endonuclease	NA|355aa|up_4|NZ_CP024785.1_4799151_4800216_-	COG0429, COG0429, Predicted hydrolase of the alpha/beta-hydrolase fold [General function prediction only]	NA|248aa|up_3|NZ_CP024785.1_4800372_4801116_+	COG0745, OmpR, Response regulators consisting of a CheY-like receiver domain and a winged-helix DNA-binding domain [Signal transduction mechanisms / Transcription]	NA|264aa|up_2|NZ_CP024785.1_4802235_4803027_-	pfam01887, SAM_adeno_trans, S-adenosyl-l-methionine hydroxide adenosyltransferase	NA|302aa|up_1|NZ_CP024785.1_4803303_4804209_+	NA	NA|448aa|up_0|NZ_CP024785.1_4806199_4807543_-	NA	NA|152aa|down_0|NZ_CP024785.1_4808666_4809122_+	pfam13508, Acetyltransf_7, Acetyltransferase (GNAT) domain	NA|275aa|down_1|NZ_CP024785.1_4809360_4810185_+	pfam06167, Peptidase_M90, Glucose-regulated metallo-peptidase M90	NA|93aa|down_2|NZ_CP024785.1_4810328_4810607_+	pfam15670, Spem1, Spermatid maturation protein 1	NA|152aa|down_3|NZ_CP024785.1_4810722_4811178_-	COG2947, COG2947, Uncharacterized conserved protein [Function unknown]	NA|215aa|down_4|NZ_CP024785.1_4811292_4811937_-	pfam08241, Methyltransf_11, Methyltransferase domain	NA|553aa|down_5|NZ_CP024785.1_4812075_4813734_+	COG1807, ArnT, 4-amino-4-deoxy-L-arabinose transferase and related glycosyltransferases of PMT family [Cell envelope biogenesis, outer membrane]	NA|156aa|down_6|NZ_CP024785.1_4813807_4814275_+	smart00347, HTH_MARR, helix_turn_helix multiple antibiotic resistance protein	NA|484aa|down_7|NZ_CP024785.1_4814437_4815889_+	COG1566, EmrA, Multidrug resistance efflux pump [Defense mechanisms]	NA|517aa|down_8|NZ_CP024785.1_4816617_4818168_-	PRK06370, PRK06370, FAD-containing oxidoreductase	NA|370aa|down_9|NZ_CP024785.1_4818521_4819631_-	cd08300, alcohol_DH_class_III, class III alcohol dehydrogenases
GCF_002813575.1_ASM281357v1	NZ_CP024785	Nostoc flagelliforme CCNUN1 chromosome, complete genome	34	5190407-5190492	32	CRISPRCasFinder	no		cas14k,csa3,cas14j,2OG_CAS,cas6,cas8b3,cas5,RT,c2c9_V-U4,PD-DExK,DinG,c2c5_V-U5,Cas14c_CAS-V-F,cas3,Cas9_archaeal,Cas14u_CAS-V	Orphan	TTTTACTTATTCAAACCCCAGTTT	24	0	0	NA	NA	NA	1	1	Orphan	cas14k,csa3,cas14j,2OG_CAS,cas6,cas8b3,cas5,RT,c2c9_V-U4,PD-DExK,DinG,c2c5_V-U5,Cas14c_CAS-V-F,cas3,Cas9_archaeal,Cas14u_CAS-V	NA|89aa|up_8|NZ_CP024785.1_5183085_5183352_-,NA|70aa|up_6|NZ_CP024785.1_5183837_5184047_-,NA|325aa|up_3|NZ_CP024785.1_5187728_5188703_+,NA|89aa|up_2|NZ_CP024785.1_5188781_5189048_+,NA|75aa|up_1|NZ_CP024785.1_5189056_5189281_+,NA|87aa|down_7|NZ_CP024785.1_5200296_5200557_+,NA|141aa|down_9|NZ_CP024785.1_5202695_5203118_+	NA|169aa|up_9|NZ_CP024785.1_5182486_5182993_+	pfam13302, Acetyltransf_3, Acetyltransferase (GNAT) domain	NA|89aa|up_8|NZ_CP024785.1_5183085_5183352_-	NA	NA|138aa|up_7|NZ_CP024785.1_5183423_5183837_-	pfam01844, HNH, HNH endonuclease	NA|70aa|up_6|NZ_CP024785.1_5183837_5184047_-	NA	NA|532aa|up_5|NZ_CP024785.1_5184133_5185729_-	COG0374, HyaB, Ni,Fe-hydrogenase I large subunit [Energy production and conversion]	NA|321aa|up_4|NZ_CP024785.1_5185938_5186901_-	COG1740, HyaA, Ni,Fe-hydrogenase I small subunit [Energy production and conversion]	NA|325aa|up_3|NZ_CP024785.1_5187728_5188703_+	NA	NA|89aa|up_2|NZ_CP024785.1_5188781_5189048_+	NA	NA|75aa|up_1|NZ_CP024785.1_5189056_5189281_+	NA	NA|282aa|up_0|NZ_CP024785.1_5189337_5190183_+	pfam01106, NifU, NifU-like domain	NA|405aa|down_0|NZ_CP024785.1_5190813_5192028_+	cd05819, NHL, NHL repeat unit of beta-propeller proteins	NA|783aa|down_1|NZ_CP024785.1_5192048_5194397_+	COG0068, HypF, Hydrogenase maturation factor [Posttranslational modification, protein turnover, chaperones]	NA|87aa|down_2|NZ_CP024785.1_5194479_5194740_+	pfam01455, HupF_HypC, HupF/HypC family	NA|393aa|down_3|NZ_CP024785.1_5194959_5196138_+	PRK15062, PRK15062, hydrogenase isoenzymes formation protein HypD; Provisional	NA|368aa|down_4|NZ_CP024785.1_5196197_5197301_+	TIGR02124, Hydrogenase_expression/formation_protein_HypE, hydrogenase expression/formation protein HypE	NA|114aa|down_5|NZ_CP024785.1_5197321_5197663_+	pfam01155, HypA, Hydrogenase/urease nickel incorporation, metallochaperone, hypA	NA|272aa|down_6|NZ_CP024785.1_5197653_5198469_+	PRK10463, PRK10463, hydrogenase nickel incorporation protein HypB; Provisional	NA|87aa|down_7|NZ_CP024785.1_5200296_5200557_+	NA	NA|320aa|down_8|NZ_CP024785.1_5200617_5201577_-	TIGR01136, Cysteine_synthase, cysteine synthase	NA|141aa|down_9|NZ_CP024785.1_5202695_5203118_+	NA
GCF_002813575.1_ASM281357v1	NZ_CP024785	Nostoc flagelliforme CCNUN1 chromosome, complete genome	35	5376819-5376937	33	CRISPRCasFinder	no		cas14k,csa3,cas14j,2OG_CAS,cas6,cas8b3,cas5,RT,c2c9_V-U4,PD-DExK,DinG,c2c5_V-U5,Cas14c_CAS-V-F,cas3,Cas9_archaeal,Cas14u_CAS-V	Orphan	ATTCTTCTTAATTATTTTCGACTTAGTTTGAG	32	0	0	NA	NA	NA	1	1	Orphan	cas14k,csa3,cas14j,2OG_CAS,cas6,cas8b3,cas5,RT,c2c9_V-U4,PD-DExK,DinG,c2c5_V-U5,Cas14c_CAS-V-F,cas3,Cas9_archaeal,Cas14u_CAS-V	NA,NA|50aa|down_0|NZ_CP024785.1_5377523_5377673_+,NA|66aa|down_4|NZ_CP024785.1_5380044_5380242_-,NA|126aa|down_5|NZ_CP024785.1_5380370_5380748_+	NA|275aa|up_9|NZ_CP024785.1_5362985_5363810_-	pfam13359, DDE_Tnp_4, DDE superfamily endonuclease	NA|548aa|up_8|NZ_CP024785.1_5363904_5365548_+	COG4251, COG4251, Bacteriophytochrome (light-regulated signal transduction histidine kinase) [Signal transduction mechanisms]	NA|159aa|up_7|NZ_CP024785.1_5365544_5366021_+	cd17557, REC_Rcp-like, phosphoacceptor receiver (REC) domain of cyanobacterial phytochrome response regulator Rcp and similar domains	NA|655aa|up_6|NZ_CP024785.1_5366020_5367985_+	TIGR02956, sensor_protein_TorS, TMAO reductase sytem sensor TorS	NA|408aa|up_5|NZ_CP024785.1_5368308_5369532_+	COG1357, COG1357, Pentapeptide repeats containing protein [Function unknown]	NA|424aa|up_4|NZ_CP024785.1_5370222_5371494_+	TIGR02971, devB-like_secretion_protein, ABC exporter membrane fusion protein, DevB family	NA|389aa|up_3|NZ_CP024785.1_5371771_5372938_+	TIGR01185, membrane_spanning_subunit, DevC protein	NA|231aa|up_2|NZ_CP024785.1_5373076_5373769_+	TIGR02982, heterocyst_DevA, ABC exporter ATP-binding subunit, DevA family	NA|312aa|up_1|NZ_CP024785.1_5374527_5375463_+	pfam01551, Peptidase_M23, Peptidase family M23	NA|179aa|up_0|NZ_CP024785.1_5375585_5376122_+	pfam10719, ComFB, Late competence development protein ComFB	NA|50aa|down_0|NZ_CP024785.1_5377523_5377673_+	NA	NA|219aa|down_1|NZ_CP024785.1_5378040_5378697_+	PRK09652, PRK09652, RNA polymerase sigma factor RpoE; Provisional	NA|205aa|down_2|NZ_CP024785.1_5378851_5379466_+	COG5662, COG5662, Predicted transmembrane transcriptional regulator (anti-sigma factor) [Transcription]	NA|152aa|down_3|NZ_CP024785.1_5379479_5379935_-	COG2105, COG2105, Uncharacterized conserved protein [Function unknown]	NA|66aa|down_4|NZ_CP024785.1_5380044_5380242_-	NA	NA|126aa|down_5|NZ_CP024785.1_5380370_5380748_+	NA	NA|883aa|down_6|NZ_CP024785.1_5381001_5383650_+	PRK00390, leuS, leucyl-tRNA synthetase; Validated	NA|109aa|down_7|NZ_CP024785.1_5383823_5384150_+	COG2442, COG2442, Uncharacterized conserved protein [Function unknown]	NA|385aa|down_8|NZ_CP024785.1_5385861_5387016_+	COG0075, COG0075, Serine-pyruvate aminotransferase/archaeal aspartate aminotransferase [Amino acid transport and metabolism]	NA|76aa|down_9|NZ_CP024785.1_5387369_5387597_-	pfam11165, DUF2949, Protein of unknown function (DUF2949)
GCF_002813575.1_ASM281357v1	NZ_CP024785	Nostoc flagelliforme CCNUN1 chromosome, complete genome	36	5422137-5422282	34	CRISPRCasFinder	no		cas14k,csa3,cas14j,2OG_CAS,cas6,cas8b3,cas5,RT,c2c9_V-U4,PD-DExK,DinG,c2c5_V-U5,Cas14c_CAS-V-F,cas3,Cas9_archaeal,Cas14u_CAS-V	Orphan	GGGGTTGCGCTGTACGGTCAAGGGAAGTTGGAACAAGCGATCGC	44	0	0	NA	NA	NA	1	1	Orphan	cas14k,csa3,cas14j,2OG_CAS,cas6,cas8b3,cas5,RT,c2c9_V-U4,PD-DExK,DinG,c2c5_V-U5,Cas14c_CAS-V-F,cas3,Cas9_archaeal,Cas14u_CAS-V	NA|51aa|up_5|NZ_CP024785.1_5415951_5416104_-,NA|143aa|down_6|NZ_CP024785.1_5428454_5428883_+,NA|70aa|down_7|NZ_CP024785.1_5429409_5429619_-	NA|134aa|up_9|NZ_CP024785.1_5407976_5408378_+	COG3011, COG3011, Predicted thiol-disulfide oxidoreductase [General function    prediction only]	NA|616aa|up_8|NZ_CP024785.1_5408690_5410538_+	TIGR02402, Malto-oligosyltrehalose_trehalohydrolase, malto-oligosyltrehalose trehalohydrolase	NA|933aa|up_7|NZ_CP024785.1_5410850_5413649_+	COG3280, TreY, Maltooligosyl trehalose synthase [Carbohydrate transport and metabolism]	NA|505aa|up_6|NZ_CP024785.1_5413746_5415261_+	COG1626, TreA, Neutral trehalase [Carbohydrate transport and metabolism]	NA|51aa|up_5|NZ_CP024785.1_5415951_5416104_-	NA	NA|479aa|up_4|NZ_CP024785.1_5416551_5417988_+	PRK09243, PRK09243, nicotinate phosphoribosyltransferase; Validated	NA|200aa|up_3|NZ_CP024785.1_5417992_5418592_+	PRK00071, nadD, nicotinate-nucleotide adenylyltransferase	NA|249aa|up_2|NZ_CP024785.1_5418552_5419299_+	COG1051, COG1051, ADP-ribose pyrophosphatase [Nucleotide transport and metabolism]	NA|573aa|up_1|NZ_CP024785.1_5419331_5421050_+	PRK13981, PRK13981, NAD synthetase; Provisional	NA|248aa|up_0|NZ_CP024785.1_5421275_5422019_+	pfam13676, TIR_2, TIR domain	NA|203aa|down_0|NZ_CP024785.1_5422576_5423185_-	COG5526, COG5526, Lysozyme family protein [General function prediction only]	NA|298aa|down_1|NZ_CP024785.1_5423561_5424455_+	pfam01471, PG_binding_1, Putative peptidoglycan binding domain	NA|194aa|down_2|NZ_CP024785.1_5424629_5425211_-	COG1981, COG1981, Predicted membrane protein [Function unknown]	NA|284aa|down_3|NZ_CP024785.1_5425399_5426251_-	PLN03084, PLN03084, alpha/beta hydrolase fold protein; Provisional	NA|372aa|down_4|NZ_CP024785.1_5426360_5427476_-	pfam01266, DAO, FAD dependent oxidoreductase	NA|152aa|down_5|NZ_CP024785.1_5427647_5428103_-	TIGR03042, hypothetical_protein, photosystem II protein PsbQ	NA|143aa|down_6|NZ_CP024785.1_5428454_5428883_+	NA	NA|70aa|down_7|NZ_CP024785.1_5429409_5429619_-	NA	NA|498aa|down_8|NZ_CP024785.1_5429709_5431203_-	cd08156, catalase_clade_3, Clade 3 of the heme-binding enzyme catalase	NA|459aa|down_9|NZ_CP024785.1_5431541_5432918_-	pfam01663, Phosphodiest, Type I phosphodiesterase / nucleotide pyrophosphatase
GCF_002813575.1_ASM281357v1	NZ_CP024785	Nostoc flagelliforme CCNUN1 chromosome, complete genome	37	5777362-5777523	35	CRISPRCasFinder	no		cas14k,csa3,cas14j,2OG_CAS,cas6,cas8b3,cas5,RT,c2c9_V-U4,PD-DExK,DinG,c2c5_V-U5,Cas14c_CAS-V-F,cas3,Cas9_archaeal,Cas14u_CAS-V	Orphan	TCATCATCGCCGCGATGTCTGTACTCCCGGTCATCGTCATCATCGCCGCGA	51	0	0	NA	NA	NA	1	1	Orphan	cas14k,csa3,cas14j,2OG_CAS,cas6,cas8b3,cas5,RT,c2c9_V-U4,PD-DExK,DinG,c2c5_V-U5,Cas14c_CAS-V-F,cas3,Cas9_archaeal,Cas14u_CAS-V	NA|180aa|up_5|NZ_CP024785.1_5772414_5772954_+,NA|76aa|up_1|NZ_CP024785.1_5776350_5776578_+,NA	NA|323aa|up_9|NZ_CP024785.1_5768361_5769330_+	COG3706, PleD, Response regulator containing a CheY-like receiver domain and a GGDEF domain [Signal transduction mechanisms]	NA|369aa|up_8|NZ_CP024785.1_5769391_5770498_-	COG3093, VapI, Plasmid maintenance system antidote protein [General function prediction only]	NA|110aa|up_7|NZ_CP024785.1_5770490_5770820_-	COG3549, HigB, Plasmid maintenance system killer protein [General function prediction only]	NA|379aa|up_6|NZ_CP024785.1_5770909_5772046_-	PRK07413, PRK07413, cob(I)yrinic acid a,c-diamide adenosyltransferase	NA|180aa|up_5|NZ_CP024785.1_5772414_5772954_+	NA	NA|349aa|up_4|NZ_CP024785.1_5773014_5774061_+	pfam17310, DUF5357, Family of unknown function (DUF5357)	NA|260aa|up_3|NZ_CP024785.1_5774078_5774858_+	COG1277, NosY, ABC-type transport system involved in multi-copper enzyme maturation, permease component [General function prediction only]	NA|219aa|up_2|NZ_CP024785.1_5774934_5775591_-	COG1357, COG1357, Pentapeptide repeats containing protein [Function unknown]	NA|76aa|up_1|NZ_CP024785.1_5776350_5776578_+	NA	NA|101aa|up_0|NZ_CP024785.1_5776968_5777271_-	PRK14423, PRK14423, acylphosphatase; Provisional	NA|126aa|down_0|NZ_CP024785.1_5777838_5778216_+	pfam08853, DUF1823, Domain of unknown function (DUF1823)	NA|555aa|down_1|NZ_CP024785.1_5778527_5780192_+	cd11350, AmyAc_4, Alpha amylase catalytic domain found in an uncharacterized protein family	NA|352aa|down_2|NZ_CP024785.1_5780372_5781428_+	PRK12755, PRK12755, phospho-2-dehydro-3-deoxyheptonate aldolase; Provisional	NA|58aa|down_3|NZ_CP024785.1_5781853_5782027_-	COG2929, COG2929, Uncharacterized protein conserved in bacteria [Function unknown]	NA|168aa|down_4|NZ_CP024785.1_5782422_5782926_-	pfam13239, 2TM, 2TM domain	NA|239aa|down_5|NZ_CP024785.1_5783196_5783913_+	COG1940, NagC, Transcriptional regulator/sugar kinase [Transcription / Carbohydrate transport and metabolism]	NA|185aa|down_6|NZ_CP024785.1_5784773_5785328_+	pfam01471, PG_binding_1, Putative peptidoglycan binding domain	NA|219aa|down_7|NZ_CP024785.1_5785527_5786184_-	pfam14103, DUF4276, Domain of unknown function (DUF4276)	NA|365aa|down_8|NZ_CP024785.1_5786180_5787275_-	COG4637, COG4637, Predicted ATPase [General function prediction only]	NA|807aa|down_9|NZ_CP024785.1_5787839_5790260_-	TIGR02470, Sucrose_synthase_1, sucrose synthase
GCF_002813575.1_ASM281357v1	NZ_CP024785	Nostoc flagelliforme CCNUN1 chromosome, complete genome	38	6043346-6043461	36	CRISPRCasFinder	no		cas14k,csa3,cas14j,2OG_CAS,cas6,cas8b3,cas5,RT,c2c9_V-U4,PD-DExK,DinG,c2c5_V-U5,Cas14c_CAS-V-F,cas3,Cas9_archaeal,Cas14u_CAS-V	Orphan	TGCATACTAGGGTCATAGTGCCGCTTAT	28	0	0	NA	NA	NA	1	1	Orphan	cas14k,csa3,cas14j,2OG_CAS,cas6,cas8b3,cas5,RT,c2c9_V-U4,PD-DExK,DinG,c2c5_V-U5,Cas14c_CAS-V-F,cas3,Cas9_archaeal,Cas14u_CAS-V	NA|58aa|up_7|NZ_CP024785.1_6038468_6038642_-,NA|113aa|up_5|NZ_CP024785.1_6040402_6040741_-,NA|137aa|up_4|NZ_CP024785.1_6040755_6041166_-,NA|68aa|up_3|NZ_CP024785.1_6041190_6041394_-,NA|69aa|up_1|NZ_CP024785.1_6042719_6042926_-,NA|123aa|up_0|NZ_CP024785.1_6042970_6043339_-,NA|115aa|down_0|NZ_CP024785.1_6043782_6044127_+,NA|52aa|down_2|NZ_CP024785.1_6047070_6047226_-,NA|117aa|down_3|NZ_CP024785.1_6047555_6047906_+,NA|181aa|down_4|NZ_CP024785.1_6047905_6048448_+,NA|185aa|down_5|NZ_CP024785.1_6048490_6049045_+,NA|261aa|down_6|NZ_CP024785.1_6049041_6049824_+,NA|436aa|down_7|NZ_CP024785.1_6049998_6051306_+,NA|78aa|down_9|NZ_CP024785.1_6051741_6051975_+	NA|632aa|up_9|NZ_CP024785.1_6034177_6036073_-	pfam11850, DUF3370, Protein of unknown function (DUF3370)	NA|691aa|up_8|NZ_CP024785.1_6036399_6038472_-	TIGR03185, DNA_S_dndD, DNA sulfur modification protein DndD	NA|58aa|up_7|NZ_CP024785.1_6038468_6038642_-	NA	NA|398aa|up_6|NZ_CP024785.1_6039219_6040413_-	cd00796, INT_Rci_Hp1_C, Shufflon-specific DNA recombinase Rci and Bacteriophage Hp1_like integrase, C-terminal catalytic domain	NA|113aa|up_5|NZ_CP024785.1_6040402_6040741_-	NA	NA|137aa|up_4|NZ_CP024785.1_6040755_6041166_-	NA	NA|68aa|up_3|NZ_CP024785.1_6041190_6041394_-	NA	NA|407aa|up_2|NZ_CP024785.1_6041390_6042611_-	COG0714, COG0714, MoxR-like ATPases [General function prediction only]	NA|69aa|up_1|NZ_CP024785.1_6042719_6042926_-	NA	NA|123aa|up_0|NZ_CP024785.1_6042970_6043339_-	NA	NA|115aa|down_0|NZ_CP024785.1_6043782_6044127_+	NA	NA|820aa|down_1|NZ_CP024785.1_6044625_6047085_+	PRK07773, PRK07773, replicative DNA helicase; Validated	NA|52aa|down_2|NZ_CP024785.1_6047070_6047226_-	NA	NA|117aa|down_3|NZ_CP024785.1_6047555_6047906_+	NA	NA|181aa|down_4|NZ_CP024785.1_6047905_6048448_+	NA	NA|185aa|down_5|NZ_CP024785.1_6048490_6049045_+	NA	NA|261aa|down_6|NZ_CP024785.1_6049041_6049824_+	NA	NA|436aa|down_7|NZ_CP024785.1_6049998_6051306_+	NA	NA|79aa|down_8|NZ_CP024785.1_6051402_6051639_+	pfam12728, HTH_17, Helix-turn-helix domain	NA|78aa|down_9|NZ_CP024785.1_6051741_6051975_+	NA
GCF_002813575.1_ASM281357v1	NZ_CP024785	Nostoc flagelliforme CCNUN1 chromosome, complete genome	39	6291289-6291392	37	CRISPRCasFinder	no		cas14k,csa3,cas14j,2OG_CAS,cas6,cas8b3,cas5,RT,c2c9_V-U4,PD-DExK,DinG,c2c5_V-U5,Cas14c_CAS-V-F,cas3,Cas9_archaeal,Cas14u_CAS-V	Orphan	CAGTTATGGGAGTATGATGAACCAGATGT	29	0	0	NA	NA	NA	1	1	Orphan	cas14k,csa3,cas14j,2OG_CAS,cas6,cas8b3,cas5,RT,c2c9_V-U4,PD-DExK,DinG,c2c5_V-U5,Cas14c_CAS-V-F,cas3,Cas9_archaeal,Cas14u_CAS-V	NA|125aa|up_3|NZ_CP024785.1_6283612_6283987_-,NA|228aa|up_0|NZ_CP024785.1_6289933_6290617_+,NA|135aa|down_5|NZ_CP024785.1_6302036_6302441_-	NA|237aa|up_9|NZ_CP024785.1_6275312_6276023_+	TIGR04283, glycosyl_transferase_family_2, transferase 2, rSAM/selenodomain-associated	NA|253aa|up_8|NZ_CP024785.1_6276436_6277195_+	COG0398, COG0398, Uncharacterized conserved protein [Function unknown]	NA|264aa|up_7|NZ_CP024785.1_6277163_6277955_+	COG0398, COG0398, Uncharacterized conserved protein [Function unknown]	NA|560aa|up_6|NZ_CP024785.1_6278555_6280235_-	COG0642, BaeS, Signal transduction histidine kinase [Signal transduction mechanisms]	NA|640aa|up_5|NZ_CP024785.1_6280499_6282419_-	COG3596, COG3596, Predicted GTPase [General function prediction only]	NA|271aa|up_4|NZ_CP024785.1_6282627_6283440_+	pfam13640, 2OG-FeII_Oxy_3, 2OG-Fe(II) oxygenase superfamily	NA|125aa|up_3|NZ_CP024785.1_6283612_6283987_-	NA	NA|469aa|up_2|NZ_CP024785.1_6284608_6286015_+	COG1215, COG1215, Glycosyltransferases, probably involved in cell wall biogenesis [Cell envelope biogenesis, outer membrane]	NA|1118aa|up_1|NZ_CP024785.1_6286181_6289535_-	PRK14948, PRK14948, DNA polymerase III subunit gamma/tau	NA|228aa|up_0|NZ_CP024785.1_6289933_6290617_+	NA	NA|843aa|down_0|NZ_CP024785.1_6295084_6297613_+	cd05805, MPG1_transferase, GTP-mannose-1-phosphate guanyltransferase (MPG1 transferase), also known as GDP-mannose pyrophosphorylase, is a bifunctional enzyme with both phosphomannose isomerase (PMI) activity and GDP-mannose phosphorylase (GMP) activity	NA|447aa|down_1|NZ_CP024785.1_6298202_6299543_-	COG2124, CypX, Cytochrome P450 [Secondary metabolites biosynthesis, transport, and catabolism]	NA|128aa|down_2|NZ_CP024785.1_6299925_6300309_-	cd00687, Terpene_cyclase_nonplant_C1, Non-plant Terpene Cyclases, Class 1	NA|171aa|down_3|NZ_CP024785.1_6300422_6300935_+	PRK09267, PRK09267, flavodoxin FldA; Validated	NA|251aa|down_4|NZ_CP024785.1_6301002_6301755_+	PRK14875, PRK14875, acetoin dehydrogenase E2 subunit dihydrolipoyllysine-residue acetyltransferase; Provisional	NA|135aa|down_5|NZ_CP024785.1_6302036_6302441_-	NA	NA|314aa|down_6|NZ_CP024785.1_6302921_6303863_-	cd02696, MurNAc-LAA, N-acetylmuramoyl-L-alanine amidase or MurNAc-LAA (also known as peptidoglycan aminohydrolase, NAMLA amidase, NAMLAA, Amidase 3, and peptidoglycan amidase; EC 3	NA|1236aa|down_7|NZ_CP024785.1_6304510_6308218_+	PLN03241, PLN03241, magnesium chelatase subunit H; Provisional	NA|74aa|down_8|NZ_CP024785.1_6308270_6308492_+	COG1598, COG1598, Predicted nuclease of the RNAse H fold, HicB family [General    function prediction only]	NA|62aa|down_9|NZ_CP024785.1_6308491_6308677_+	pfam07927, HicA_toxin, HicA toxin of bacterial toxin-antitoxin,
GCF_002813575.1_ASM281357v1	NZ_CP024785	Nostoc flagelliforme CCNUN1 chromosome, complete genome	40	6653569-6653642	38	CRISPRCasFinder	no		cas14k,csa3,cas14j,2OG_CAS,cas6,cas8b3,cas5,RT,c2c9_V-U4,PD-DExK,DinG,c2c5_V-U5,Cas14c_CAS-V-F,cas3,Cas9_archaeal,Cas14u_CAS-V	Orphan	CGAGTCGGATTTGCACCAGAGAT	23	0	0	NA	NA	NA	1	1	Orphan	cas14k,csa3,cas14j,2OG_CAS,cas6,cas8b3,cas5,RT,c2c9_V-U4,PD-DExK,DinG,c2c5_V-U5,Cas14c_CAS-V-F,cas3,Cas9_archaeal,Cas14u_CAS-V	NA|326aa|up_9|NZ_CP024785.1_6642099_6643077_+,NA|62aa|up_8|NZ_CP024785.1_6643308_6643494_+,NA|234aa|up_4|NZ_CP024785.1_6648795_6649497_-,NA|61aa|up_3|NZ_CP024785.1_6649519_6649702_+,NA|364aa|down_8|NZ_CP024785.1_6662779_6663871_-	NA|326aa|up_9|NZ_CP024785.1_6642099_6643077_+	NA	NA|62aa|up_8|NZ_CP024785.1_6643308_6643494_+	NA	NA|800aa|up_7|NZ_CP024785.1_6643623_6646023_+	COG4251, COG4251, Bacteriophytochrome (light-regulated signal transduction histidine kinase) [Signal transduction mechanisms]	NA|152aa|up_6|NZ_CP024785.1_6646075_6646531_+	cd17557, REC_Rcp-like, phosphoacceptor receiver (REC) domain of cyanobacterial phytochrome response regulator Rcp and similar domains	NA|578aa|up_5|NZ_CP024785.1_6646810_6648544_+	COG0642, BaeS, Signal transduction histidine kinase [Signal transduction mechanisms]	NA|234aa|up_4|NZ_CP024785.1_6648795_6649497_-	NA	NA|61aa|up_3|NZ_CP024785.1_6649519_6649702_+	NA	NA|295aa|up_2|NZ_CP024785.1_6650243_6651128_+	PRK02755, truB, tRNA pseudouridine synthase B; Provisional	NA|282aa|up_1|NZ_CP024785.1_6651204_6652050_+	COG0596, MhpC, Predicted hydrolases or acyltransferases (alpha/beta hydrolase superfamily) [General function prediction only]	NA|101aa|up_0|NZ_CP024785.1_6652218_6652521_-	pfam08846, DUF1816, Domain of unknown function (DUF1816)	NA|173aa|down_0|NZ_CP024785.1_6653860_6654379_-	COG1939, COG1939, Ribonuclease III family protein [Replication, recombination, and    repair]	NA|122aa|down_1|NZ_CP024785.1_6654915_6655281_-	cd07043, STAS_anti-anti-sigma_factors, Sulphate Transporter and Anti-Sigma factor antagonist) domain of anti-anti-sigma factors, key regulators of anti-sigma factors by phosphorylation	NA|389aa|down_2|NZ_CP024785.1_6655533_6656700_-	PRK12564, PRK12564, carbamoyl-phosphate synthase small subunit	NA|352aa|down_3|NZ_CP024785.1_6656940_6657996_+	pfam13975, gag-asp_proteas, gag-polyprotein putative aspartyl protease	NA|356aa|down_4|NZ_CP024785.1_6657925_6658993_-	PRK00188, trpD, anthranilate phosphoribosyltransferase; Provisional	NA|294aa|down_5|NZ_CP024785.1_6659073_6659955_-	COG0338, Dam, Site-specific DNA methylase [DNA replication, recombination, and repair]	NA|243aa|down_6|NZ_CP024785.1_6661593_6662322_+	pfam01292, Ni_hydr_CYTB, Prokaryotic cytochrome b561	NA|60aa|down_7|NZ_CP024785.1_6662497_6662677_+	PHA02337, PHA02337, putative high light inducible protein	NA|364aa|down_8|NZ_CP024785.1_6662779_6663871_-	NA	NA|77aa|down_9|NZ_CP024785.1_6667258_6667489_-	TIGR01784, Uncharacterized_protein_pSLT051, conserved hypothetical protein (putative transposase or invertase)
GCF_002813575.1_ASM281357v1	NZ_CP024785	Nostoc flagelliforme CCNUN1 chromosome, complete genome	41	6810287-6810394	39	CRISPRCasFinder	no		cas14k,csa3,cas14j,2OG_CAS,cas6,cas8b3,cas5,RT,c2c9_V-U4,PD-DExK,DinG,c2c5_V-U5,Cas14c_CAS-V-F,cas3,Cas9_archaeal,Cas14u_CAS-V	Orphan	GTTTCAATCCCTAATAGGGATTTTAATGAATTGCAAT	37	0	0	NA	NA	I-D,II-B	1	1	Orphan	cas14k,csa3,cas14j,2OG_CAS,cas6,cas8b3,cas5,RT,c2c9_V-U4,PD-DExK,DinG,c2c5_V-U5,Cas14c_CAS-V-F,cas3,Cas9_archaeal,Cas14u_CAS-V	NA,NA	NA|114aa|up_9|NZ_CP024785.1_6795459_6795801_+	cd02238, cupin_KdgF, pectin degradation protein KdgF and related proteins, cupin domain	NA|308aa|up_8|NZ_CP024785.1_6796330_6797254_+	COG3546, COG3546, Mn-containing catalase [Inorganic ion transport and metabolism]	NA|305aa|up_7|NZ_CP024785.1_6797413_6798328_+	COG2207, AraC, AraC-type DNA-binding domain-containing proteins [Transcription]	NA|398aa|up_6|NZ_CP024785.1_6798511_6799705_+	PRK07415, PRK07415, NAD(P)H-quinone oxidoreductase subunit H; Validated	NA|274aa|up_5|NZ_CP024785.1_6801353_6802175_+	COG1398, OLE1, Fatty-acid desaturase [Lipid metabolism]	NA|351aa|up_4|NZ_CP024785.1_6802439_6803492_+	PLN02598, PLN02598, omega-6 fatty acid desaturase	NA|360aa|up_3|NZ_CP024785.1_6803711_6804791_+	PLN02498, PLN02498, omega-3 fatty acid desaturase	NA|232aa|up_2|NZ_CP024785.1_6805273_6805969_+	cd10911, PIN_LabA, PIN domain of Synechococcus elongatus LabA (low-amplitude and bright) and related proteins	NA|237aa|up_1|NZ_CP024785.1_6806297_6807008_+	COG0745, OmpR, Response regulators consisting of a CheY-like receiver domain and a winged-helix DNA-binding domain [Signal transduction mechanisms / Transcription]	NA|363aa|up_0|NZ_CP024785.1_6807013_6808102_+	COG0642, BaeS, Signal transduction histidine kinase [Signal transduction mechanisms]	NA|491aa|down_0|NZ_CP024785.1_6810703_6812176_-	COG4775, COG4775, Outer membrane protein/protective antigen OMA87 [Cell envelope biogenesis, outer membrane]	NA|366aa|down_1|NZ_CP024785.1_6812613_6813711_+	COG0673, MviM, Predicted dehydrogenases and related proteins [General function prediction only]	NA|82aa|down_2|NZ_CP024785.1_6813718_6813964_+	pfam14279, HNH_5, HNH endonuclease	NA|912aa|down_3|NZ_CP024785.1_6814314_6817050_-	cd10797, GH57N_APU_like_1, N-terminal putative catalytic domain of mainly uncharacterized prokaryotic proteins similar to archaeal thermoactive amylopullulanases; glycoside hydrolase family 57 (GH57)	NA|362aa|down_4|NZ_CP024785.1_6817574_6818660_+	TIGR00378, cax, calcium/proton exchanger (cax)	NA|380aa|down_5|NZ_CP024785.1_6818902_6820042_+	cd14014, STKc_PknB_like, Catalytic domain of bacterial Serine/Threonine kinases, PknB and similar proteins	NA|294aa|down_6|NZ_CP024785.1_6820123_6821005_-	cd13653, PBP2_phosphate_like_1, Substrate binding domain of putative ABC-type phosphate transporter, a member of the type 2 periplasmic binding fold superfamily	NA|238aa|down_7|NZ_CP024785.1_6821011_6821725_-	COG1842, PspA, Phage shock protein A (IM30), suppresses sigma54-dependent transcription [Transcription / Signal transduction mechanisms]	NA|230aa|down_8|NZ_CP024785.1_6822117_6822807_+	cd00569, HTH_Hin_like, Helix-turn-helix domain of Hin and related proteins	NA|302aa|down_9|NZ_CP024785.1_6822806_6823712_+	pfam08852, DUF1822, Protein of unknown function (DUF1822)
GCF_002813575.1_ASM281357v1	NZ_CP024785	Nostoc flagelliforme CCNUN1 chromosome, complete genome	42	6835858-6836018	4	PILER-CR	no		cas14k,csa3,cas14j,2OG_CAS,cas6,cas8b3,cas5,RT,c2c9_V-U4,PD-DExK,DinG,c2c5_V-U5,Cas14c_CAS-V-F,cas3,Cas9_archaeal,Cas14u_CAS-V	Orphan	CCGCCCCCGTAGAGAGAATCATTGCCATCATTGCCAAAAAGCAG	44	0	0	NA	NA	NA	2	2	Orphan	cas14k,csa3,cas14j,2OG_CAS,cas6,cas8b3,cas5,RT,c2c9_V-U4,PD-DExK,DinG,c2c5_V-U5,Cas14c_CAS-V-F,cas3,Cas9_archaeal,Cas14u_CAS-V	NA|99aa|up_0|NZ_CP024785.1_6832459_6832756_+,NA|155aa|down_0|NZ_CP024785.1_6838594_6839059_-,NA|121aa|down_1|NZ_CP024785.1_6839064_6839427_-,NA|122aa|down_2|NZ_CP024785.1_6839428_6839794_-,NA|78aa|down_3|NZ_CP024785.1_6839852_6840086_-,NA|526aa|down_5|NZ_CP024785.1_6840424_6842002_-,NA|261aa|down_6|NZ_CP024785.1_6842414_6843197_-,NA|185aa|down_7|NZ_CP024785.1_6843193_6843748_-,NA|181aa|down_8|NZ_CP024785.1_6843790_6844333_-,NA|117aa|down_9|NZ_CP024785.1_6844332_6844683_-	NA|238aa|up_9|NZ_CP024785.1_6821011_6821725_-	COG1842, PspA, Phage shock protein A (IM30), suppresses sigma54-dependent transcription [Transcription / Signal transduction mechanisms]	NA|230aa|up_8|NZ_CP024785.1_6822117_6822807_+	cd00569, HTH_Hin_like, Helix-turn-helix domain of Hin and related proteins	NA|302aa|up_7|NZ_CP024785.1_6822806_6823712_+	pfam08852, DUF1822, Protein of unknown function (DUF1822)	NA|712aa|up_6|NZ_CP024785.1_6823733_6825869_+	cd13566, PBP2_phosphate, Substrate binding domain of putative ABC-type phosphate transporter, a member of the type 2 periplasmic binding fold superfamily	NA|105aa|up_5|NZ_CP024785.1_6825865_6826180_+	COG4026, COG4026, Uncharacterized protein containing TOPRIM domain, potential nuclease [General function prediction only]	NA|477aa|up_4|NZ_CP024785.1_6826597_6828028_-	PRK09287, PRK09287, NADP-dependent phosphogluconate dehydrogenase	NA|243aa|up_3|NZ_CP024785.1_6828243_6828972_+	pfam07444, Ycf66_N, Ycf66 protein N-terminus	NA|264aa|up_2|NZ_CP024785.1_6829116_6829908_+	pfam13340, DUF4096, Putative transposase of IS4/5 family (DUF4096)	NA|375aa|up_1|NZ_CP024785.1_6831172_6832297_+	CHL00081, chlI, Mg-protoporyphyrin IX chelatase	NA|99aa|up_0|NZ_CP024785.1_6832459_6832756_+	NA	NA|155aa|down_0|NZ_CP024785.1_6838594_6839059_-	NA	NA|121aa|down_1|NZ_CP024785.1_6839064_6839427_-	NA	NA|122aa|down_2|NZ_CP024785.1_6839428_6839794_-	NA	NA|78aa|down_3|NZ_CP024785.1_6839852_6840086_-	NA	NA|79aa|down_4|NZ_CP024785.1_6840188_6840425_-	pfam12728, HTH_17, Helix-turn-helix domain	NA|526aa|down_5|NZ_CP024785.1_6840424_6842002_-	NA	NA|261aa|down_6|NZ_CP024785.1_6842414_6843197_-	NA	NA|185aa|down_7|NZ_CP024785.1_6843193_6843748_-	NA	NA|181aa|down_8|NZ_CP024785.1_6843790_6844333_-	NA	NA|117aa|down_9|NZ_CP024785.1_6844332_6844683_-	NA
GCF_002813575.1_ASM281357v1	NZ_CP024785	Nostoc flagelliforme CCNUN1 chromosome, complete genome	43	6845621-6845746	40	CRISPRCasFinder	no		cas14k,csa3,cas14j,2OG_CAS,cas6,cas8b3,cas5,RT,c2c9_V-U4,PD-DExK,DinG,c2c5_V-U5,Cas14c_CAS-V-F,cas3,Cas9_archaeal,Cas14u_CAS-V	Orphan	ATGATCCACATAGGTTTTTGTATGTGGATCATG	33	0	0	NA	NA	NA	1	1	Orphan	cas14k,csa3,cas14j,2OG_CAS,cas6,cas8b3,cas5,RT,c2c9_V-U4,PD-DExK,DinG,c2c5_V-U5,Cas14c_CAS-V-F,cas3,Cas9_archaeal,Cas14u_CAS-V	NA|121aa|up_9|NZ_CP024785.1_6839064_6839427_-,NA|122aa|up_8|NZ_CP024785.1_6839428_6839794_-,NA|78aa|up_7|NZ_CP024785.1_6839852_6840086_-,NA|526aa|up_5|NZ_CP024785.1_6840424_6842002_-,NA|261aa|up_4|NZ_CP024785.1_6842414_6843197_-,NA|185aa|up_3|NZ_CP024785.1_6843193_6843748_-,NA|181aa|up_2|NZ_CP024785.1_6843790_6844333_-,NA|117aa|up_1|NZ_CP024785.1_6844332_6844683_-,NA|53aa|up_0|NZ_CP024785.1_6844948_6845107_+,NA|113aa|down_0|NZ_CP024785.1_6847846_6848185_-,NA|68aa|down_1|NZ_CP024785.1_6848195_6848399_-,NA|75aa|down_2|NZ_CP024785.1_6848426_6848651_-,NA|49aa|down_4|NZ_CP024785.1_6849201_6849348_-,NA|123aa|down_6|NZ_CP024785.1_6850104_6850473_+,NA|90aa|down_7|NZ_CP024785.1_6850509_6850779_+,NA|121aa|down_8|NZ_CP024785.1_6850887_6851250_+	NA|121aa|up_9|NZ_CP024785.1_6839064_6839427_-	NA	NA|122aa|up_8|NZ_CP024785.1_6839428_6839794_-	NA	NA|78aa|up_7|NZ_CP024785.1_6839852_6840086_-	NA	NA|79aa|up_6|NZ_CP024785.1_6840188_6840425_-	pfam12728, HTH_17, Helix-turn-helix domain	NA|526aa|up_5|NZ_CP024785.1_6840424_6842002_-	NA	NA|261aa|up_4|NZ_CP024785.1_6842414_6843197_-	NA	NA|185aa|up_3|NZ_CP024785.1_6843193_6843748_-	NA	NA|181aa|up_2|NZ_CP024785.1_6843790_6844333_-	NA	NA|117aa|up_1|NZ_CP024785.1_6844332_6844683_-	NA	NA|53aa|up_0|NZ_CP024785.1_6844948_6845107_+	NA	NA|113aa|down_0|NZ_CP024785.1_6847846_6848185_-	NA	NA|68aa|down_1|NZ_CP024785.1_6848195_6848399_-	NA	NA|75aa|down_2|NZ_CP024785.1_6848426_6848651_-	NA	NA|123aa|down_3|NZ_CP024785.1_6848747_6849116_+	pfam08872, KGK, KGK domain	NA|49aa|down_4|NZ_CP024785.1_6849201_6849348_-	NA	NA|64aa|down_5|NZ_CP024785.1_6849344_6849536_-	pfam01402, RHH_1, Ribbon-helix-helix protein, copG family	NA|123aa|down_6|NZ_CP024785.1_6850104_6850473_+	NA	NA|90aa|down_7|NZ_CP024785.1_6850509_6850779_+	NA	NA|121aa|down_8|NZ_CP024785.1_6850887_6851250_+	NA	NA|237aa|down_9|NZ_CP024785.1_6851246_6851957_+	pfam01935, DUF87, Domain of unknown function DUF87
GCF_002813575.1_ASM281357v1	NZ_CP024785	Nostoc flagelliforme CCNUN1 chromosome, complete genome	44	6849745-6849853	41	CRISPRCasFinder	no		cas14k,csa3,cas14j,2OG_CAS,cas6,cas8b3,cas5,RT,c2c9_V-U4,PD-DExK,DinG,c2c5_V-U5,Cas14c_CAS-V-F,cas3,Cas9_archaeal,Cas14u_CAS-V	Orphan	ACCCGTAAGTCGTAGCGTCGCGGGGTTAGC	30	0	0	NA	NA	NA	1	1	Orphan	cas14k,csa3,cas14j,2OG_CAS,cas6,cas8b3,cas5,RT,c2c9_V-U4,PD-DExK,DinG,c2c5_V-U5,Cas14c_CAS-V-F,cas3,Cas9_archaeal,Cas14u_CAS-V	NA|181aa|up_9|NZ_CP024785.1_6843790_6844333_-,NA|117aa|up_8|NZ_CP024785.1_6844332_6844683_-,NA|53aa|up_7|NZ_CP024785.1_6844948_6845107_+,NA|113aa|up_5|NZ_CP024785.1_6847846_6848185_-,NA|68aa|up_4|NZ_CP024785.1_6848195_6848399_-,NA|75aa|up_3|NZ_CP024785.1_6848426_6848651_-,NA|49aa|up_1|NZ_CP024785.1_6849201_6849348_-,NA|123aa|down_0|NZ_CP024785.1_6850104_6850473_+,NA|90aa|down_1|NZ_CP024785.1_6850509_6850779_+,NA|121aa|down_2|NZ_CP024785.1_6850887_6851250_+,NA|200aa|down_4|NZ_CP024785.1_6851895_6852495_+,NA|127aa|down_5|NZ_CP024785.1_6853239_6853620_+	NA|181aa|up_9|NZ_CP024785.1_6843790_6844333_-	NA	NA|117aa|up_8|NZ_CP024785.1_6844332_6844683_-	NA	NA|53aa|up_7|NZ_CP024785.1_6844948_6845107_+	NA	NA|878aa|up_6|NZ_CP024785.1_6845083_6847717_-	COG5545, COG5545, Predicted P-loop ATPase and inactivated derivatives [General function prediction only]	NA|113aa|up_5|NZ_CP024785.1_6847846_6848185_-	NA	NA|68aa|up_4|NZ_CP024785.1_6848195_6848399_-	NA	NA|75aa|up_3|NZ_CP024785.1_6848426_6848651_-	NA	NA|123aa|up_2|NZ_CP024785.1_6848747_6849116_+	pfam08872, KGK, KGK domain	NA|49aa|up_1|NZ_CP024785.1_6849201_6849348_-	NA	NA|64aa|up_0|NZ_CP024785.1_6849344_6849536_-	pfam01402, RHH_1, Ribbon-helix-helix protein, copG family	NA|123aa|down_0|NZ_CP024785.1_6850104_6850473_+	NA	NA|90aa|down_1|NZ_CP024785.1_6850509_6850779_+	NA	NA|121aa|down_2|NZ_CP024785.1_6850887_6851250_+	NA	NA|237aa|down_3|NZ_CP024785.1_6851246_6851957_+	pfam01935, DUF87, Domain of unknown function DUF87	NA|200aa|down_4|NZ_CP024785.1_6851895_6852495_+	NA	NA|127aa|down_5|NZ_CP024785.1_6853239_6853620_+	NA	NA|286aa|down_6|NZ_CP024785.1_6853579_6854437_+	cd01189, INT_ICEBs1_C_like, C-terminal catalytic domain of integrases from bacterial phages and conjugate transposons	NA|78aa|down_7|NZ_CP024785.1_6855958_6856192_-	pfam07862, Nif11, Nif11 domain	NA|136aa|down_8|NZ_CP024785.1_6856645_6857053_+	CHL00075, rpl21, ribosomal protein L21	NA|99aa|down_9|NZ_CP024785.1_6857077_6857374_+	PRK05435, rpmA, 50S ribosomal protein L27; Validated
GCF_002813575.1_ASM281357v1	NZ_CP024785	Nostoc flagelliforme CCNUN1 chromosome, complete genome	45	7141062-7141710	42,3	CRISPRCasFinder,CRT	no		cas14k,csa3,cas14j,2OG_CAS,cas6,cas8b3,cas5,RT,c2c9_V-U4,PD-DExK,DinG,c2c5_V-U5,Cas14c_CAS-V-F,cas3,Cas9_archaeal,Cas14u_CAS-V	Orphan	AATTATCTAAATTACAGGTAGATGCTCAA,TNTCTGANTTACANNNNGATGCTCAANAACNAAANNA	29,37	0	0	NA	NA	NA:NA	7,5	7	Orphan	cas14k,csa3,cas14j,2OG_CAS,cas6,cas8b3,cas5,RT,c2c9_V-U4,PD-DExK,DinG,c2c5_V-U5,Cas14c_CAS-V-F,cas3,Cas9_archaeal,Cas14u_CAS-V	NA|87aa|up_0|NZ_CP024785.1_7139741_7140002_-,NA|254aa|down_0|NZ_CP024785.1_7142633_7143395_-,NA|69aa|down_1|NZ_CP024785.1_7145455_7145662_+,NA|297aa|down_3|NZ_CP024785.1_7146744_7147635_+,NA|56aa|down_7|NZ_CP024785.1_7150877_7151045_-,NA|133aa|down_8|NZ_CP024785.1_7151955_7152354_-,NA|53aa|down_9|NZ_CP024785.1_7152418_7152577_+	NA|556aa|up_9|NZ_CP024785.1_7128806_7130474_-	COG3225, GldG, ABC-type uncharacterized transport system involved in gliding motility, auxiliary component [Cell motility and secretion]	NA|142aa|up_8|NZ_CP024785.1_7130583_7131009_+	PRK12275, PRK12275, hypothetical protein; Reviewed	NA|271aa|up_7|NZ_CP024785.1_7131016_7131829_-	TIGR03518, ABC_transporter_permease_protein, gliding motility-associated ABC transporter permease protein GldF	NA|336aa|up_6|NZ_CP024785.1_7131829_7132837_-	COG1131, CcmA, ABC-type multidrug transport system, ATPase component [Defense mechanisms]	NA|579aa|up_5|NZ_CP024785.1_7133489_7135226_-	COG0426, FpaA, Uncharacterized flavoproteins [Energy production and conversion]	NA|171aa|up_4|NZ_CP024785.1_7135390_7135903_-	COG3431, COG3431, Predicted membrane protein [Function unknown]	NA|577aa|up_3|NZ_CP024785.1_7136002_7137733_-	COG0426, FpaA, Uncharacterized flavoproteins [Energy production and conversion]	NA|320aa|up_2|NZ_CP024785.1_7137952_7138912_+	COG2515, Acd, 1-aminocyclopropane-1-carboxylate deaminase [Amino acid transport and metabolism]	NA|170aa|up_1|NZ_CP024785.1_7139130_7139640_+	COG3472, COG3472, Uncharacterized conserved protein [Function unknown]	NA|87aa|up_0|NZ_CP024785.1_7139741_7140002_-	NA	NA|254aa|down_0|NZ_CP024785.1_7142633_7143395_-	NA	NA|69aa|down_1|NZ_CP024785.1_7145455_7145662_+	NA	NA|177aa|down_2|NZ_CP024785.1_7145933_7146464_-	COG1357, COG1357, Pentapeptide repeats containing protein [Function unknown]	NA|297aa|down_3|NZ_CP024785.1_7146744_7147635_+	NA	NA|465aa|down_4|NZ_CP024785.1_7147762_7149157_-	COG0312, TldD, Predicted Zn-dependent proteases and their inactivated homologs [General function prediction only]	NA|110aa|down_5|NZ_CP024785.1_7149469_7149799_+	cd12399, RRM_HP0827_like, RNA recognition motif in Helicobacter pylori HP0827 protein and similar proteins	NA|64aa|down_6|NZ_CP024785.1_7149919_7150111_+	PRK00270, rpsU, 30S ribosomal protein S21; Reviewed	NA|56aa|down_7|NZ_CP024785.1_7150877_7151045_-	NA	NA|133aa|down_8|NZ_CP024785.1_7151955_7152354_-	NA	NA|53aa|down_9|NZ_CP024785.1_7152418_7152577_+	NA
GCF_002813575.1_ASM281357v1	NZ_CP024785	Nostoc flagelliforme CCNUN1 chromosome, complete genome	46	7222561-7222662	43	CRISPRCasFinder	no		cas14k,csa3,cas14j,2OG_CAS,cas6,cas8b3,cas5,RT,c2c9_V-U4,PD-DExK,DinG,c2c5_V-U5,Cas14c_CAS-V-F,cas3,Cas9_archaeal,Cas14u_CAS-V	Orphan	GGGTGTAGGGGAAGAGTATTGATTTGTAAATA	32	1	1	7222593-7222630	NZ_CP024785.1_7222500-7222537	NA	1	1	Orphan	cas14k,csa3,cas14j,2OG_CAS,cas6,cas8b3,cas5,RT,c2c9_V-U4,PD-DExK,DinG,c2c5_V-U5,Cas14c_CAS-V-F,cas3,Cas9_archaeal,Cas14u_CAS-V	NA|119aa|up_3|NZ_CP024785.1_7219107_7219464_-,NA|340aa|up_1|NZ_CP024785.1_7219990_7221010_+,NA|109aa|down_0|NZ_CP024785.1_7223089_7223416_-,NA|67aa|down_1|NZ_CP024785.1_7223623_7223824_-,NA|139aa|down_6|NZ_CP024785.1_7228550_7228967_+,NA|115aa|down_8|NZ_CP024785.1_7231169_7231514_+,NA|66aa|down_9|NZ_CP024785.1_7231698_7231896_-	NA|199aa|up_9|NZ_CP024785.1_7215504_7216101_-	COG3267, ExeA, Type II secretory pathway, component ExeA (predicted ATPase) [Intracellular trafficking and secretion]	NA|311aa|up_8|NZ_CP024785.1_7216063_7216996_-	pfam13358, DDE_3, DDE superfamily endonuclease	NA|89aa|up_7|NZ_CP024785.1_7216908_7217175_-	pfam13384, HTH_23, Homeodomain-like domain	NA|154aa|up_6|NZ_CP024785.1_7217640_7218102_-	COG3145, AlkB, Alkylated DNA repair protein [DNA replication, recombination, and repair]	NA|199aa|up_5|NZ_CP024785.1_7218255_7218852_-	pfam07154, DUF1392, Protein of unknown function (DUF1392)	NA|91aa|up_4|NZ_CP024785.1_7218848_7219121_-	cd04762, HTH_MerR-trunc, Helix-Turn-Helix DNA binding domain of truncated MerR-like proteins	NA|119aa|up_3|NZ_CP024785.1_7219107_7219464_-	NA	NA|60aa|up_2|NZ_CP024785.1_7219630_7219810_+	pfam07878, RHH_5, CopG-like RHH_1 or ribbon-helix-helix domain, RHH_5	NA|340aa|up_1|NZ_CP024785.1_7219990_7221010_+	NA	NA|155aa|up_0|NZ_CP024785.1_7221012_7221477_+	pfam13649, Methyltransf_25, Methyltransferase domain	NA|109aa|down_0|NZ_CP024785.1_7223089_7223416_-	NA	NA|67aa|down_1|NZ_CP024785.1_7223623_7223824_-	NA	NA|155aa|down_2|NZ_CP024785.1_7224428_7224893_-	pfam13613, HTH_Tnp_4, Helix-turn-helix of DDE superfamily endonuclease	NA|170aa|down_3|NZ_CP024785.1_7225380_7225890_-	pfam13586, DDE_Tnp_1_2, Transposase DDE domain	NA|117aa|down_4|NZ_CP024785.1_7225852_7226203_-	pfam13340, DUF4096, Putative transposase of IS4/5 family (DUF4096)	NA|131aa|down_5|NZ_CP024785.1_7226958_7227351_+	TIGR02052, periplasmic_mercuric_ion_binding_protein, mercuric transport protein periplasmic component	NA|139aa|down_6|NZ_CP024785.1_7228550_7228967_+	NA	NA|275aa|down_7|NZ_CP024785.1_7229574_7230399_-	pfam01609, DDE_Tnp_1, Transposase DDE domain	NA|115aa|down_8|NZ_CP024785.1_7231169_7231514_+	NA	NA|66aa|down_9|NZ_CP024785.1_7231698_7231896_-	NA
GCF_002813575.1_ASM281357v1	NZ_CP024785	Nostoc flagelliforme CCNUN1 chromosome, complete genome	47	7589514-7589593	44	CRISPRCasFinder	no		cas14k,csa3,cas14j,2OG_CAS,cas6,cas8b3,cas5,RT,c2c9_V-U4,PD-DExK,DinG,c2c5_V-U5,Cas14c_CAS-V-F,cas3,Cas9_archaeal,Cas14u_CAS-V	Orphan	GAACCAATGGTTCCTAGAATTCAG	24	0	0	NA	NA	NA	1	1	Orphan	cas14k,csa3,cas14j,2OG_CAS,cas6,cas8b3,cas5,RT,c2c9_V-U4,PD-DExK,DinG,c2c5_V-U5,Cas14c_CAS-V-F,cas3,Cas9_archaeal,Cas14u_CAS-V	NA|215aa|up_8|NZ_CP024785.1_7576861_7577506_-,NA|47aa|up_6|NZ_CP024785.1_7582553_7582694_+,NA|48aa|up_4|NZ_CP024785.1_7586950_7587094_-,NA|103aa|up_2|NZ_CP024785.1_7587968_7588277_-,NA|155aa|up_0|NZ_CP024785.1_7588937_7589402_-,NA|79aa|down_3|NZ_CP024785.1_7595494_7595731_+,NA|137aa|down_4|NZ_CP024785.1_7595989_7596400_-,NA|56aa|down_7|NZ_CP024785.1_7600151_7600319_+	NA|281aa|up_9|NZ_CP024785.1_7575897_7576740_+	smart01040, Bro-N, BRO family, N-terminal domain	NA|215aa|up_8|NZ_CP024785.1_7576861_7577506_-	NA	NA|287aa|up_7|NZ_CP024785.1_7581686_7582547_+	sd00006, TPR, Tetratricopeptide repeat	NA|47aa|up_6|NZ_CP024785.1_7582553_7582694_+	NA	NA|177aa|up_5|NZ_CP024785.1_7583651_7584182_-	pfam02643, DUF192, Uncharacterized ACR, COG1430	NA|48aa|up_4|NZ_CP024785.1_7586950_7587094_-	NA	NA|248aa|up_3|NZ_CP024785.1_7587198_7587942_-	pfam02668, TauD, Taurine catabolism dioxygenase TauD, TfdA family	NA|103aa|up_2|NZ_CP024785.1_7587968_7588277_-	NA	NA|98aa|up_1|NZ_CP024785.1_7588675_7588969_-	PTZ00266, PTZ00266, NIMA-related protein kinase; Provisional	NA|155aa|up_0|NZ_CP024785.1_7588937_7589402_-	NA	NA|769aa|down_0|NZ_CP024785.1_7590250_7592557_+	TIGR01448, recD_rel, helicase, putative, RecD/TraA family	NA|259aa|down_1|NZ_CP024785.1_7593287_7594064_+	CHL00148, orf27, Ycf27; Reviewed	NA|437aa|down_2|NZ_CP024785.1_7594187_7595498_+	cd03784, GT1_Gtf-like, UDP-glycosyltransferases and similar proteins	NA|79aa|down_3|NZ_CP024785.1_7595494_7595731_+	NA	NA|137aa|down_4|NZ_CP024785.1_7595989_7596400_-	NA	NA|214aa|down_5|NZ_CP024785.1_7596681_7597323_+	cd01012, YcaC_related, YcaC related amidohydrolases; E	NA|194aa|down_6|NZ_CP024785.1_7597613_7598195_+	COG3224, COG3224, Uncharacterized protein conserved in bacteria [Function unknown]	NA|56aa|down_7|NZ_CP024785.1_7600151_7600319_+	NA	NA|476aa|down_8|NZ_CP024785.1_7600438_7601866_-	COG0270, Dcm, Site-specific DNA methylase [DNA replication, recombination, and repair]	NA|131aa|down_9|NZ_CP024785.1_7601862_7602255_-	cd06554, ASCH_ASC-1_like, ASC-1 homology domain, ASC-1-like subfamily
GCF_002813575.1_ASM281357v1	NZ_CP024785	Nostoc flagelliforme CCNUN1 chromosome, complete genome	48	7650504-7650609	45	CRISPRCasFinder	no		cas14k,csa3,cas14j,2OG_CAS,cas6,cas8b3,cas5,RT,c2c9_V-U4,PD-DExK,DinG,c2c5_V-U5,Cas14c_CAS-V-F,cas3,Cas9_archaeal,Cas14u_CAS-V	Orphan	CTCACTACTCTTGGTAAAGATTTAG	25	0	0	NA	NA	NA	1	1	Orphan	cas14k,csa3,cas14j,2OG_CAS,cas6,cas8b3,cas5,RT,c2c9_V-U4,PD-DExK,DinG,c2c5_V-U5,Cas14c_CAS-V-F,cas3,Cas9_archaeal,Cas14u_CAS-V	NA|223aa|up_9|NZ_CP024785.1_7640808_7641477_+,NA|72aa|up_8|NZ_CP024785.1_7641476_7641692_+,NA|210aa|up_5|NZ_CP024785.1_7644157_7644787_+,NA|148aa|up_4|NZ_CP024785.1_7644805_7645249_+,NA|58aa|up_3|NZ_CP024785.1_7645959_7646133_+,NA|46aa|up_1|NZ_CP024785.1_7648996_7649134_+,NA|386aa|down_0|NZ_CP024785.1_7653228_7654386_+,NA|64aa|down_1|NZ_CP024785.1_7654553_7654745_+,NA|59aa|down_3|NZ_CP024785.1_7655427_7655604_+,NA|140aa|down_4|NZ_CP024785.1_7655593_7656013_+	NA|223aa|up_9|NZ_CP024785.1_7640808_7641477_+	NA	NA|72aa|up_8|NZ_CP024785.1_7641476_7641692_+	NA	NA|524aa|up_7|NZ_CP024785.1_7641673_7643245_+	COG1674, FtsK, DNA segregation ATPase FtsK/SpoIIIE and related proteins [Cell division and chromosome partitioning]	NA|281aa|up_6|NZ_CP024785.1_7643267_7644110_+	COG3617, COG3617, Prophage antirepressor [Transcription]	NA|210aa|up_5|NZ_CP024785.1_7644157_7644787_+	NA	NA|148aa|up_4|NZ_CP024785.1_7644805_7645249_+	NA	NA|58aa|up_3|NZ_CP024785.1_7645959_7646133_+	NA	NA|411aa|up_2|NZ_CP024785.1_7646923_7648156_-	pfam10592, AIPR, AIPR protein	NA|46aa|up_1|NZ_CP024785.1_7648996_7649134_+	NA	NA|375aa|up_0|NZ_CP024785.1_7649182_7650307_+	cd00200, WD40, WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment	NA|386aa|down_0|NZ_CP024785.1_7653228_7654386_+	NA	NA|64aa|down_1|NZ_CP024785.1_7654553_7654745_+	NA	NA|220aa|down_2|NZ_CP024785.1_7654746_7655406_+	TIGR02241, hypothetical_protein_SCD8A	NA|59aa|down_3|NZ_CP024785.1_7655427_7655604_+	NA	NA|140aa|down_4|NZ_CP024785.1_7655593_7656013_+	NA	NA|700aa|down_5|NZ_CP024785.1_7656707_7658807_+	COG3501, VgrG, Uncharacterized protein conserved in bacteria [Function unknown]	NA|400aa|down_6|NZ_CP024785.1_7659391_7660591_-	pfam13709, DUF4159, Domain of unknown function (DUF4159)	NA|376aa|down_7|NZ_CP024785.1_7660654_7661782_-	TIGR02242, putative_secreted_protein, phage tail protein domain	NA|737aa|down_8|NZ_CP024785.1_7661785_7663996_-	TIGR02243, hypothetical_protein_SCD8A	NA|139aa|down_9|NZ_CP024785.1_7663998_7664415_-	pfam04965, GPW_gp25, Gene 25-like lysozyme
GCF_002813575.1_ASM281357v1	NZ_CP024791	Nostoc flagelliforme CCNUN1 plasmid pNFSY06, complete sequence	1	1823-2051	1	CRISPRCasFinder	no			Orphan	GGTAATATTACTATCAATTCTGGTTCTTTCTCATTACAAAATGGCGCTCAACTTG	55	0	0	NA	NA	NA	1	1	Orphan	cas14k,csa3,cas14j,2OG_CAS,cas6,cas8b3,cas5,RT,c2c9_V-U4,PD-DExK,DinG,c2c5_V-U5,Cas14c_CAS-V-F,cas3,Cas9_archaeal,Cas14u_CAS-V	NA,NA	NA|NA	NA	NA|NA	NA	NA|NA	NA	NA|NA	NA	NA|NA	NA	NA|NA	NA	NA|NA	NA	NA|NA	NA	NA|NA	NA	NA|NA	NA	NA|66aa|down_0|NZ_CP024791.1_3836_4034_+	COG3210, FhaB, Large exoproteins involved in heme utilization or adhesion [Intracellular trafficking and secretion]	NA|71aa|down_1|NZ_CP024791.1_4552_4765_-	smart01096, CPSase_L_D3, Carbamoyl-phosphate synthetase large chain, oligomerisation domain	NA|443aa|down_2|NZ_CP024791.1_6225_7554_+	COG5659, COG5659, FOG: Transposase [DNA replication, recombination, and repair]	NA|436aa|down_3|NZ_CP024791.1_7915_9223_+	pfam01609, DDE_Tnp_1, Transposase DDE domain	NA|153aa|down_4|NZ_CP024791.1_9247_9706_-	PRK13413, mpi, master DNA invertase Mpi family serine-type recombinase	NA|223aa|down_5|NZ_CP024791.1_9798_10467_-	pfam13358, DDE_3, DDE superfamily endonuclease	NA|330aa|down_6|NZ_CP024791.1_10733_11723_-	COG3267, ExeA, Type II secretory pathway, component ExeA (predicted ATPase) [Intracellular trafficking and secretion]	NA|545aa|down_7|NZ_CP024791.1_11712_13347_-	pfam00665, rve, Integrase core domain	NA|212aa|down_8|NZ_CP024791.1_13363_13999_-	PRK13413, mpi, master DNA invertase Mpi family serine-type recombinase	NA|311aa|down_9|NZ_CP024791.1_14091_15024_-	pfam13358, DDE_3, DDE superfamily endonuclease
GCF_002813575.1_ASM281357v1	NZ_CP024791	Nostoc flagelliforme CCNUN1 plasmid pNFSY06, complete sequence	2	214672-214739	2	CRISPRCasFinder	no			Orphan	ACCTACCCATAACGTTATGGGCG	23	0	0	NA	NA	NA	1	1	Orphan	cas14k,csa3,cas14j,2OG_CAS,cas6,cas8b3,cas5,RT,c2c9_V-U4,PD-DExK,DinG,c2c5_V-U5,Cas14c_CAS-V-F,cas3,Cas9_archaeal,Cas14u_CAS-V	NA,NA|254aa|down_1|NZ_CP024791.1_218509_219271_+,NA|122aa|down_3|NZ_CP024791.1_220439_220805_+,NA|138aa|down_5|NZ_CP024791.1_221783_222197_-,NA|170aa|down_7|NZ_CP024791.1_225474_225984_+,NA|143aa|down_8|NZ_CP024791.1_226173_226602_+,NA|153aa|down_9|NZ_CP024791.1_226739_227198_+	NA|545aa|up_9|NZ_CP024791.1_204638_206273_-	pfam00665, rve, Integrase core domain	NA|212aa|up_8|NZ_CP024791.1_206289_206925_-	PRK13413, mpi, master DNA invertase Mpi family serine-type recombinase	NA|311aa|up_7|NZ_CP024791.1_207017_207950_-	pfam13358, DDE_3, DDE superfamily endonuclease	NA|89aa|up_6|NZ_CP024791.1_207862_208129_-	pfam13384, HTH_23, Homeodomain-like domain	NA|117aa|up_5|NZ_CP024791.1_208444_208795_-	pfam13592, HTH_33, Winged helix-turn helix	NA|89aa|up_4|NZ_CP024791.1_208707_208974_-	pfam13384, HTH_23, Homeodomain-like domain	NA|572aa|up_3|NZ_CP024791.1_209745_211461_+	COG4252, COG4252, Predicted transmembrane sensor domain [Signal transduction mechanisms]	NA|264aa|up_2|NZ_CP024791.1_211482_212274_-	pfam13340, DUF4096, Putative transposase of IS4/5 family (DUF4096)	NA|288aa|up_1|NZ_CP024791.1_212250_213114_+	COG4252, COG4252, Predicted transmembrane sensor domain [Signal transduction mechanisms]	NA|252aa|up_0|NZ_CP024791.1_213422_214178_+	pfam13614, AAA_31, AAA domain	NA|958aa|down_0|NZ_CP024791.1_215378_218252_+	cd18808, SF1_C_Upf1, C-terminal helicase domain of Upf1-like family helicases	NA|254aa|down_1|NZ_CP024791.1_218509_219271_+	NA	NA|264aa|down_2|NZ_CP024791.1_219328_220120_+	pfam13340, DUF4096, Putative transposase of IS4/5 family (DUF4096)	NA|122aa|down_3|NZ_CP024791.1_220439_220805_+	NA	NA|311aa|down_4|NZ_CP024791.1_220867_221800_-	pfam08843, AbiEii, Nucleotidyl transferase AbiEii toxin, Type IV TA system	NA|138aa|down_5|NZ_CP024791.1_221783_222197_-	NA	NA|135aa|down_6|NZ_CP024791.1_223955_224360_-	pfam05713, MobC, Bacterial mobilisation protein (MobC)	NA|170aa|down_7|NZ_CP024791.1_225474_225984_+	NA	NA|143aa|down_8|NZ_CP024791.1_226173_226602_+	NA	NA|153aa|down_9|NZ_CP024791.1_226739_227198_+	NA
GCF_002813575.1_ASM281357v1	NZ_CP024792	Nostoc flagelliforme CCNUN1 plasmid pNFSY07, complete sequence	1	115587-115661	1	CRISPRCasFinder	no			Orphan	GCGCTCGTTAACCAATGTCAACCG	24	1	1	115611-115637	NZ_CP024785.1_408223-408197	NA	1	1	Orphan	cas14k,csa3,cas14j,2OG_CAS,cas6,cas8b3,cas5,RT,c2c9_V-U4,PD-DExK,DinG,c2c5_V-U5,Cas14c_CAS-V-F,cas3,Cas9_archaeal,Cas14u_CAS-V	NA|140aa|up_1|NZ_CP024792.1_111043_111463_+,NA|57aa|down_1|NZ_CP024792.1_116925_117096_+,NA|65aa|down_4|NZ_CP024792.1_121050_121245_-,NA|46aa|down_7|NZ_CP024792.1_124686_124824_-,NA|266aa|down_8|NZ_CP024792.1_124836_125634_+	NA|768aa|up_9|NZ_CP024792.1_98988_101292_+	cd07550, P-type_ATPase_HM, P-type heavy metal-transporting ATPase; uncharacterized subfamily	NA|422aa|up_8|NZ_CP024792.1_101619_102885_+	COG1262, COG1262, Uncharacterized conserved protein [Function unknown]	NA|68aa|up_7|NZ_CP024792.1_102915_103119_+	pfam07862, Nif11, Nif11 domain	NA|806aa|up_6|NZ_CP024792.1_104009_106427_+	cd07550, P-type_ATPase_HM, P-type heavy metal-transporting ATPase; uncharacterized subfamily	NA|176aa|up_5|NZ_CP024792.1_106491_107019_+	cd00293, USP_Like, Usp: Universal stress protein family	NA|371aa|up_4|NZ_CP024792.1_108350_109463_+	TIGR00378, cax, calcium/proton exchanger (cax)	NA|303aa|up_3|NZ_CP024792.1_109564_110473_+	TIGR00378, cax, calcium/proton exchanger (cax)	NA|98aa|up_2|NZ_CP024792.1_110641_110935_+	COG0387, ChaA, Ca2+/H+ antiporter [Inorganic ion transport and metabolism]	NA|140aa|up_1|NZ_CP024792.1_111043_111463_+	NA	NA|946aa|up_0|NZ_CP024792.1_111756_114594_+	cd02089, P-type_ATPase_Ca_prok, prokaryotic P-type Ca(2+)-ATPase similar to Synechococcus elongatus sp	NA|208aa|down_0|NZ_CP024792.1_115732_116356_-	pfam03050, DDE_Tnp_IS66, Transposase IS66 family	NA|57aa|down_1|NZ_CP024792.1_116925_117096_+	NA	NA|342aa|down_2|NZ_CP024792.1_119250_120276_-	pfam05860, Haemagg_act, haemagglutination activity domain	NA|213aa|down_3|NZ_CP024792.1_120321_120960_-	pfam04755, PAP_fibrillin, PAP_fibrillin	NA|65aa|down_4|NZ_CP024792.1_121050_121245_-	NA	NA|191aa|down_5|NZ_CP024792.1_122147_122720_-	PTZ00266, PTZ00266, NIMA-related protein kinase; Provisional	NA|107aa|down_6|NZ_CP024792.1_124115_124436_+	cd03784, GT1_Gtf-like, UDP-glycosyltransferases and similar proteins	NA|46aa|down_7|NZ_CP024792.1_124686_124824_-	NA	NA|266aa|down_8|NZ_CP024792.1_124836_125634_+	NA	NA|206aa|down_9|NZ_CP024792.1_125798_126416_+	pfam05685, Uma2, Putative restriction endonuclease
GCF_002813575.1_ASM281357v1	NZ_CP024793	Nostoc flagelliforme CCNUN1 plasmid pNFSY08, complete sequence	1	48548-48719	1	CRISPRCasFinder	no		cas3,Cas9_archaeal	Orphan	CAAGAATTCCCACTCGCTGGGGATA	25	0	0	NA	NA	NA	2	2	Orphan	cas14k,csa3,cas14j,2OG_CAS,cas6,cas8b3,cas5,RT,c2c9_V-U4,PD-DExK,DinG,c2c5_V-U5,Cas14c_CAS-V-F,cas3,Cas9_archaeal,Cas14u_CAS-V	NA|71aa|up_2|NZ_CP024793.1_43086_43299_-,NA|131aa|up_0|NZ_CP024793.1_46831_47224_+,NA|47aa|down_0|NZ_CP024793.1_49804_49945_+,NA|63aa|down_4|NZ_CP024793.1_55984_56173_+,NA|227aa|down_7|NZ_CP024793.1_60664_61345_+	NA|132aa|up_9|NZ_CP024793.1_36426_36822_-	cd00315, Cyt_C5_DNA_methylase, Cytosine-C5 specific DNA methylases; Methyl transfer reactions play an important role in many aspects of biology	NA|131aa|up_8|NZ_CP024793.1_38235_38628_-	cd06554, ASCH_ASC-1_like, ASC-1 homology domain, ASC-1-like subfamily	NA|273aa|up_7|NZ_CP024793.1_38970_39789_-	COG5464, COG5464, Uncharacterized conserved protein [Function unknown]	NA|133aa|up_6|NZ_CP024793.1_39806_40205_-	pfam12616, DUF3775, Protein of unknown function (DUF3775)	NA|418aa|up_5|NZ_CP024793.1_40333_41587_-	TIGR00665, DnaB, replicative DNA helicase	NA|201aa|up_4|NZ_CP024793.1_41602_42205_-	pfam13358, DDE_3, DDE superfamily endonuclease	NA|132aa|up_3|NZ_CP024793.1_42162_42558_-	COG3415, COG3415, Transposase and inactivated derivatives [DNA replication, recombination, and repair]	NA|71aa|up_2|NZ_CP024793.1_43086_43299_-	NA	NA|707aa|up_1|NZ_CP024793.1_44464_46585_+	pfam05860, Haemagg_act, haemagglutination activity domain	NA|131aa|up_0|NZ_CP024793.1_46831_47224_+	NA	NA|47aa|down_0|NZ_CP024793.1_49804_49945_+	NA	NA|376aa|down_1|NZ_CP024793.1_50199_51327_-	pfam01202, SKI, Shikimate kinase	NA|401aa|down_2|NZ_CP024793.1_51964_53167_+	pfam13751, DDE_Tnp_1_6, Transposase DDE domain	NA|415aa|down_3|NZ_CP024793.1_54125_55370_+	TIGR04247, nitrous_oxide_maturation_protein_NosD, nitrous oxide reductase family maturation protein NosD	NA|63aa|down_4|NZ_CP024793.1_55984_56173_+	NA	NA|364aa|down_5|NZ_CP024793.1_56583_57675_+	pfam08548, Peptidase_M10_C, Peptidase M10 serralysin C terminal	NA|252aa|down_6|NZ_CP024793.1_58779_59535_+	COG3210, FhaB, Large exoproteins involved in heme utilization or adhesion [Intracellular trafficking and secretion]	NA|227aa|down_7|NZ_CP024793.1_60664_61345_+	NA	NA|41aa|down_8|NZ_CP024793.1_64282_64405_+	cd01197, INT_FimBE_like, FimB and FimE and related proteins, integrase/recombinases	NA|71aa|down_9|NZ_CP024793.1_64769_64982_-	pfam11211, DUF2997, Protein of unknown function (DUF2997)
GCF_002813575.1_ASM281357v1	NZ_CP024793	Nostoc flagelliforme CCNUN1 plasmid pNFSY08, complete sequence	2	51789-51902	2	CRISPRCasFinder	no		cas3,Cas9_archaeal	Orphan	GTCGGTCAAGTAATAATGATTGACAAA	27	0	0	NA	NA	NA	1	1	Orphan	cas14k,csa3,cas14j,2OG_CAS,cas6,cas8b3,cas5,RT,c2c9_V-U4,PD-DExK,DinG,c2c5_V-U5,Cas14c_CAS-V-F,cas3,Cas9_archaeal,Cas14u_CAS-V	NA|71aa|up_4|NZ_CP024793.1_43086_43299_-,NA|131aa|up_2|NZ_CP024793.1_46831_47224_+,NA|47aa|up_1|NZ_CP024793.1_49804_49945_+,NA|63aa|down_2|NZ_CP024793.1_55984_56173_+,NA|227aa|down_5|NZ_CP024793.1_60664_61345_+,NA|358aa|down_9|NZ_CP024793.1_65771_66845_-	NA|273aa|up_9|NZ_CP024793.1_38970_39789_-	COG5464, COG5464, Uncharacterized conserved protein [Function unknown]	NA|133aa|up_8|NZ_CP024793.1_39806_40205_-	pfam12616, DUF3775, Protein of unknown function (DUF3775)	NA|418aa|up_7|NZ_CP024793.1_40333_41587_-	TIGR00665, DnaB, replicative DNA helicase	NA|201aa|up_6|NZ_CP024793.1_41602_42205_-	pfam13358, DDE_3, DDE superfamily endonuclease	NA|132aa|up_5|NZ_CP024793.1_42162_42558_-	COG3415, COG3415, Transposase and inactivated derivatives [DNA replication, recombination, and repair]	NA|71aa|up_4|NZ_CP024793.1_43086_43299_-	NA	NA|707aa|up_3|NZ_CP024793.1_44464_46585_+	pfam05860, Haemagg_act, haemagglutination activity domain	NA|131aa|up_2|NZ_CP024793.1_46831_47224_+	NA	NA|47aa|up_1|NZ_CP024793.1_49804_49945_+	NA	NA|376aa|up_0|NZ_CP024793.1_50199_51327_-	pfam01202, SKI, Shikimate kinase	NA|401aa|down_0|NZ_CP024793.1_51964_53167_+	pfam13751, DDE_Tnp_1_6, Transposase DDE domain	NA|415aa|down_1|NZ_CP024793.1_54125_55370_+	TIGR04247, nitrous_oxide_maturation_protein_NosD, nitrous oxide reductase family maturation protein NosD	NA|63aa|down_2|NZ_CP024793.1_55984_56173_+	NA	NA|364aa|down_3|NZ_CP024793.1_56583_57675_+	pfam08548, Peptidase_M10_C, Peptidase M10 serralysin C terminal	NA|252aa|down_4|NZ_CP024793.1_58779_59535_+	COG3210, FhaB, Large exoproteins involved in heme utilization or adhesion [Intracellular trafficking and secretion]	NA|227aa|down_5|NZ_CP024793.1_60664_61345_+	NA	NA|41aa|down_6|NZ_CP024793.1_64282_64405_+	cd01197, INT_FimBE_like, FimB and FimE and related proteins, integrase/recombinases	NA|71aa|down_7|NZ_CP024793.1_64769_64982_-	pfam11211, DUF2997, Protein of unknown function (DUF2997)	NA|150aa|down_8|NZ_CP024793.1_65208_65658_-	pfam06868, DUF1257, Protein of unknown function (DUF1257)	NA|358aa|down_9|NZ_CP024793.1_65771_66845_-	NA
GCF_002813575.1_ASM281357v1	NZ_CP024793	Nostoc flagelliforme CCNUN1 plasmid pNFSY08, complete sequence	3	70016-70077	3	CRISPRCasFinder	no		cas3,Cas9_archaeal	Orphan	AATTCGTAATTCGTAATTATTAC	23	0	0	NA	NA	NA	1	1	Orphan	cas14k,csa3,cas14j,2OG_CAS,cas6,cas8b3,cas5,RT,c2c9_V-U4,PD-DExK,DinG,c2c5_V-U5,Cas14c_CAS-V-F,cas3,Cas9_archaeal,Cas14u_CAS-V	NA|227aa|up_8|NZ_CP024793.1_60664_61345_+,NA|358aa|up_4|NZ_CP024793.1_65771_66845_-,NA|90aa|up_3|NZ_CP024793.1_67012_67282_-,NA|139aa|up_2|NZ_CP024793.1_67394_67811_-,NA|91aa|up_1|NZ_CP024793.1_67854_68127_-,NA|186aa|up_0|NZ_CP024793.1_68380_68938_-,NA|202aa|down_0|NZ_CP024793.1_70400_71006_-,NA|301aa|down_2|NZ_CP024793.1_71912_72815_+,NA|235aa|down_4|NZ_CP024793.1_73659_74364_+,NA|481aa|down_6|NZ_CP024793.1_75415_76858_+,NA|64aa|down_8|NZ_CP024793.1_79281_79473_-	NA|252aa|up_9|NZ_CP024793.1_58779_59535_+	COG3210, FhaB, Large exoproteins involved in heme utilization or adhesion [Intracellular trafficking and secretion]	NA|227aa|up_8|NZ_CP024793.1_60664_61345_+	NA	NA|41aa|up_7|NZ_CP024793.1_64282_64405_+	cd01197, INT_FimBE_like, FimB and FimE and related proteins, integrase/recombinases	NA|71aa|up_6|NZ_CP024793.1_64769_64982_-	pfam11211, DUF2997, Protein of unknown function (DUF2997)	NA|150aa|up_5|NZ_CP024793.1_65208_65658_-	pfam06868, DUF1257, Protein of unknown function (DUF1257)	NA|358aa|up_4|NZ_CP024793.1_65771_66845_-	NA	NA|90aa|up_3|NZ_CP024793.1_67012_67282_-	NA	NA|139aa|up_2|NZ_CP024793.1_67394_67811_-	NA	NA|91aa|up_1|NZ_CP024793.1_67854_68127_-	NA	NA|186aa|up_0|NZ_CP024793.1_68380_68938_-	NA	NA|202aa|down_0|NZ_CP024793.1_70400_71006_-	NA	NA|116aa|down_1|NZ_CP024793.1_71428_71776_-	PRK08154, PRK08154, anaerobic benzoate catabolism transcriptional regulator; Reviewed	NA|301aa|down_2|NZ_CP024793.1_71912_72815_+	NA	NA|104aa|down_3|NZ_CP024793.1_72908_73220_+	pfam14277, DUF4364, Domain of unknown function (DUF4364)	NA|235aa|down_4|NZ_CP024793.1_73659_74364_+	NA	NA|309aa|down_5|NZ_CP024793.1_74378_75305_+	pfam07505, DUF5131, Protein of unknown function (DUF5131)	NA|481aa|down_6|NZ_CP024793.1_75415_76858_+	NA	NA|458aa|down_7|NZ_CP024793.1_76860_78234_+	PRK07773, PRK07773, replicative DNA helicase; Validated	NA|64aa|down_8|NZ_CP024793.1_79281_79473_-	NA	NA|272aa|down_9|NZ_CP024793.1_80564_81380_+	pfam00520, Ion_trans, Ion transport protein
GCF_002813575.1_ASM281357v1	NZ_CP024793	Nostoc flagelliforme CCNUN1 plasmid pNFSY08, complete sequence	4	305266-305375	4	CRISPRCasFinder	no		cas3,Cas9_archaeal	Orphan	GTGCGTTCCTCAAACTGTCACAATA	25	0	0	NA	NA	NA	1	1	Orphan	cas14k,csa3,cas14j,2OG_CAS,cas6,cas8b3,cas5,RT,c2c9_V-U4,PD-DExK,DinG,c2c5_V-U5,Cas14c_CAS-V-F,cas3,Cas9_archaeal,Cas14u_CAS-V	NA|75aa|up_9|NZ_CP024793.1_294955_295180_+,NA|46aa|up_8|NZ_CP024793.1_296082_296220_-,NA|93aa|up_7|NZ_CP024793.1_296327_296606_+,NA|273aa|up_2|NZ_CP024793.1_301537_302356_+,NA|83aa|up_1|NZ_CP024793.1_303007_303256_+,NA|140aa|down_1|NZ_CP024793.1_312150_312570_-,NA|84aa|down_6|NZ_CP024793.1_315245_315497_-,NA|70aa|down_7|NZ_CP024793.1_315514_315724_-,NA|276aa|down_8|NZ_CP024793.1_315790_316618_-,NA|67aa|down_9|NZ_CP024793.1_317135_317336_-	NA|75aa|up_9|NZ_CP024793.1_294955_295180_+	NA	NA|46aa|up_8|NZ_CP024793.1_296082_296220_-	NA	NA|93aa|up_7|NZ_CP024793.1_296327_296606_+	NA	NA|132aa|up_6|NZ_CP024793.1_297160_297556_+	TIGR02997, RNA_polymerase_sigma_subunit_sigma70/sigma32, RNA polymerase sigma factor, cyanobacterial RpoD-like family	NA|322aa|up_5|NZ_CP024793.1_298060_299026_-	TIGR04285, parB-like_partition_protein, nucleoid occlusion protein	NA|256aa|up_4|NZ_CP024793.1_299028_299796_-	COG1192, Soj, ATPases involved in chromosome partitioning [Cell division and chromosome partitioning]	NA|50aa|up_3|NZ_CP024793.1_300091_300241_+	TIGR02997, RNA_polymerase_sigma_subunit_sigma70/sigma32, RNA polymerase sigma factor, cyanobacterial RpoD-like family	NA|273aa|up_2|NZ_CP024793.1_301537_302356_+	NA	NA|83aa|up_1|NZ_CP024793.1_303007_303256_+	NA	NA|428aa|up_0|NZ_CP024793.1_303684_304968_+	COG2124, CypX, Cytochrome P450 [Secondary metabolites biosynthesis, transport, and catabolism]	NA|1202aa|down_0|NZ_CP024793.1_308452_312058_-	pfam12965, DUF3854, Domain of unknown function (DUF3854)	NA|140aa|down_1|NZ_CP024793.1_312150_312570_-	NA	NA|200aa|down_2|NZ_CP024793.1_312635_313235_-	COG3145, AlkB, Alkylated DNA repair protein [DNA replication, recombination, and repair]	NA|150aa|down_3|NZ_CP024793.1_313279_313729_-	PRK11525, dinD, DNA-damage-inducible protein D; Provisional	NA|153aa|down_4|NZ_CP024793.1_314119_314578_-	pfam07154, DUF1392, Protein of unknown function (DUF1392)	NA|122aa|down_5|NZ_CP024793.1_314782_315148_-	PRK12275, PRK12275, hypothetical protein; Reviewed	NA|84aa|down_6|NZ_CP024793.1_315245_315497_-	NA	NA|70aa|down_7|NZ_CP024793.1_315514_315724_-	NA	NA|276aa|down_8|NZ_CP024793.1_315790_316618_-	NA	NA|67aa|down_9|NZ_CP024793.1_317135_317336_-	NA
GCF_002813575.1_ASM281357v1	NZ_CP024793	Nostoc flagelliforme CCNUN1 plasmid pNFSY08, complete sequence	5	370243-370326	5	CRISPRCasFinder	no		cas3,Cas9_archaeal	Orphan	TGTCAACCATTGGTTTACAGAATG	24	0	0	NA	NA	NA	1	1	Orphan	cas14k,csa3,cas14j,2OG_CAS,cas6,cas8b3,cas5,RT,c2c9_V-U4,PD-DExK,DinG,c2c5_V-U5,Cas14c_CAS-V-F,cas3,Cas9_archaeal,Cas14u_CAS-V	NA|253aa|up_8|NZ_CP024793.1_363485_364244_-,NA|70aa|up_4|NZ_CP024793.1_366122_366332_-,NA|276aa|up_3|NZ_CP024793.1_366407_367235_-,NA|68aa|up_2|NZ_CP024793.1_367775_367979_-,NA|109aa|up_1|NZ_CP024793.1_367971_368298_-,NA|292aa|up_0|NZ_CP024793.1_368457_369333_-,NA|63aa|down_3|NZ_CP024793.1_380046_380235_+,NA|112aa|down_6|NZ_CP024793.1_383037_383373_-,NA|62aa|down_9|NZ_CP024793.1_387519_387705_+	NA|200aa|up_9|NZ_CP024793.1_362841_363441_-	COG3145, AlkB, Alkylated DNA repair protein [DNA replication, recombination, and repair]	NA|253aa|up_8|NZ_CP024793.1_363485_364244_-	NA	NA|154aa|up_7|NZ_CP024793.1_364271_364733_-	pfam07154, DUF1392, Protein of unknown function (DUF1392)	NA|327aa|up_6|NZ_CP024793.1_364729_365710_-	COG0270, Dcm, Site-specific DNA methylase [DNA replication, recombination, and repair]	NA|133aa|up_5|NZ_CP024793.1_365706_366105_-	cd04762, HTH_MerR-trunc, Helix-Turn-Helix DNA binding domain of truncated MerR-like proteins	NA|70aa|up_4|NZ_CP024793.1_366122_366332_-	NA	NA|276aa|up_3|NZ_CP024793.1_366407_367235_-	NA	NA|68aa|up_2|NZ_CP024793.1_367775_367979_-	NA	NA|109aa|up_1|NZ_CP024793.1_367971_368298_-	NA	NA|292aa|up_0|NZ_CP024793.1_368457_369333_-	NA	NA|488aa|down_0|NZ_CP024793.1_371149_372613_+	pfam13546, DDE_5, DDE superfamily endonuclease	NA|285aa|down_1|NZ_CP024793.1_372708_373563_-	smart00191, Int_alpha, Integrin alpha (beta-propellor repeats)	NA|264aa|down_2|NZ_CP024793.1_376555_377347_-	pfam13340, DUF4096, Putative transposase of IS4/5 family (DUF4096)	NA|63aa|down_3|NZ_CP024793.1_380046_380235_+	NA	NA|257aa|down_4|NZ_CP024793.1_380293_381064_+	pfam01797, Y1_Tnp, Transposase IS200 like	NA|637aa|down_5|NZ_CP024793.1_381130_383041_-	cd16383, GUN4, porphyrin-binding protein domain GUN4	NA|112aa|down_6|NZ_CP024793.1_383037_383373_-	NA	NA|53aa|down_7|NZ_CP024793.1_383523_383682_-	pfam09274, ParG, ParG	NA|856aa|down_8|NZ_CP024793.1_384783_387351_+	smart00191, Int_alpha, Integrin alpha (beta-propellor repeats)	NA|62aa|down_9|NZ_CP024793.1_387519_387705_+	NA
GCF_002813575.1_ASM281357v1	NZ_CP024793	Nostoc flagelliforme CCNUN1 plasmid pNFSY08, complete sequence	6	646177-646299	6	CRISPRCasFinder	no		cas3,Cas9_archaeal	Orphan	CCTAGACTATGTTTGGACTAAGCCTAGACTAAGCC	35	0	0	NA	NA	NA	1	1	Orphan	cas14k,csa3,cas14j,2OG_CAS,cas6,cas8b3,cas5,RT,c2c9_V-U4,PD-DExK,DinG,c2c5_V-U5,Cas14c_CAS-V-F,cas3,Cas9_archaeal,Cas14u_CAS-V	NA|181aa|up_8|NZ_CP024793.1_628991_629534_-,NA|235aa|up_5|NZ_CP024793.1_634611_635316_-,NA|307aa|up_3|NZ_CP024793.1_638419_639340_-,NA|94aa|up_0|NZ_CP024793.1_645796_646078_+,NA|59aa|down_0|NZ_CP024793.1_646640_646817_-,NA|60aa|down_1|NZ_CP024793.1_648450_648630_+	NA|661aa|up_9|NZ_CP024793.1_626878_628861_-	pfam14280, DUF4365, Domain of unknown function (DUF4365)	NA|181aa|up_8|NZ_CP024793.1_628991_629534_-	NA	NA|165aa|up_7|NZ_CP024793.1_629530_630025_-	pfam14280, DUF4365, Domain of unknown function (DUF4365)	NA|1508aa|up_6|NZ_CP024793.1_630064_634588_-	COG1205, COG1205, Distinct helicase family with a unique C-terminal domain including a metal-binding cysteine cluster [General function prediction only]	NA|235aa|up_5|NZ_CP024793.1_634611_635316_-	NA	NA|1039aa|up_4|NZ_CP024793.1_635306_638423_-	pfam13401, AAA_22, AAA domain	NA|307aa|up_3|NZ_CP024793.1_638419_639340_-	NA	NA|844aa|up_2|NZ_CP024793.1_639336_641868_-	COG0514, RecQ, Superfamily II DNA helicase [DNA replication, recombination, and repair]	NA|1103aa|up_1|NZ_CP024793.1_641864_645173_-	cd18011, DEXDc_RapA, DEXH-box helicase domain of RapA	NA|94aa|up_0|NZ_CP024793.1_645796_646078_+	NA	NA|59aa|down_0|NZ_CP024793.1_646640_646817_-	NA	NA|60aa|down_1|NZ_CP024793.1_648450_648630_+	NA	NA|488aa|down_2|NZ_CP024793.1_650359_651823_-	pfam13546, DDE_5, DDE superfamily endonuclease	NA|71aa|down_3|NZ_CP024793.1_652158_652371_-	pfam01526, DDE_Tnp_Tn3, Tn3 transposase DDE domain	NA|112aa|down_4|NZ_CP024793.1_652367_652703_-	pfam01526, DDE_Tnp_Tn3, Tn3 transposase DDE domain	NA|30aa|down_5|NZ_CP024793.1_652728_652818_-	pfam01526, DDE_Tnp_Tn3, Tn3 transposase DDE domain	NA|46aa|down_6|NZ_CP024793.1_652823_652961_-	pfam01526, DDE_Tnp_Tn3, Tn3 transposase DDE domain	NA|435aa|down_7|NZ_CP024793.1_653108_654413_-	pfam01526, DDE_Tnp_Tn3, Tn3 transposase DDE domain	NA|151aa|down_8|NZ_CP024793.1_654415_654868_-	pfam13700, DUF4158, Domain of unknown function (DUF4158)	NA|987aa|down_9|NZ_CP024793.1_656908_659869_-	pfam01526, DDE_Tnp_Tn3, Tn3 transposase DDE domain
GCF_002813575.1_ASM281357v1	NZ_CP024793	Nostoc flagelliforme CCNUN1 plasmid pNFSY08, complete sequence	7	685720-685851	7	CRISPRCasFinder	no		cas3,Cas9_archaeal	Orphan	CGACATCCTCAATGGTGGCAATGG	24	0	0	NA	NA	NA	2	2	Orphan	cas14k,csa3,cas14j,2OG_CAS,cas6,cas8b3,cas5,RT,c2c9_V-U4,PD-DExK,DinG,c2c5_V-U5,Cas14c_CAS-V-F,cas3,Cas9_archaeal,Cas14u_CAS-V	NA|79aa|up_9|NZ_CP024793.1_670475_670712_+,NA|97aa|up_3|NZ_CP024793.1_679608_679899_-,NA|59aa|up_0|NZ_CP024793.1_682378_682555_-,NA|134aa|down_5|NZ_CP024793.1_693529_693931_-,NA|102aa|down_8|NZ_CP024793.1_698226_698532_-,NA|71aa|down_9|NZ_CP024793.1_698830_699043_-	NA|79aa|up_9|NZ_CP024793.1_670475_670712_+	NA	NA|351aa|up_8|NZ_CP024793.1_671180_672233_+	PLN02433, PLN02433, uroporphyrinogen decarboxylase	NA|329aa|up_7|NZ_CP024793.1_672238_673225_+	PRK00072, hemC, porphobilinogen deaminase; Reviewed	NA|280aa|up_6|NZ_CP024793.1_675785_676625_+	TIGR02971, devB-like_secretion_protein, ABC exporter membrane fusion protein, DevB family	NA|391aa|up_5|NZ_CP024793.1_676900_678073_+	TIGR01185, membrane_spanning_subunit, DevC protein	NA|317aa|up_4|NZ_CP024793.1_678336_679287_+	cd05292, LDH_2, A subgroup of L-lactate dehydrogenases	NA|97aa|up_3|NZ_CP024793.1_679608_679899_-	NA	NA|265aa|up_2|NZ_CP024793.1_680445_681240_+	COG1192, Soj, ATPases involved in chromosome partitioning [Cell division and chromosome partitioning]	NA|381aa|up_1|NZ_CP024793.1_681239_682382_+	cd16393, SPO0J_N, Thermus thermophilus stage 0 sporulation protein J-like N-terminal domain, ParB family member	NA|59aa|up_0|NZ_CP024793.1_682378_682555_-	NA	NA|299aa|down_0|NZ_CP024793.1_687480_688377_-	COG0596, MhpC, Predicted hydrolases or acyltransferases (alpha/beta hydrolase superfamily) [General function prediction only]	NA|298aa|down_1|NZ_CP024793.1_688672_689566_+	COG4798, COG4798, Predicted methyltransferase [General function prediction only]	NA|267aa|down_2|NZ_CP024793.1_690217_691018_+	pfam12138, Spherulin4, Spherulation-specific family 4	NA|264aa|down_3|NZ_CP024793.1_691236_692028_+	pfam13340, DUF4096, Putative transposase of IS4/5 family (DUF4096)	NA|220aa|down_4|NZ_CP024793.1_692402_693062_-	cd02970, PRX_like2, Peroxiredoxin (PRX)-like 2 family; hypothetical proteins that show sequence similarity to PRXs	NA|134aa|down_5|NZ_CP024793.1_693529_693931_-	NA	NA|121aa|down_6|NZ_CP024793.1_694211_694574_-	COG1670, RimL, Acetyltransferases, including N-acetylases of ribosomal proteins [Translation, ribosomal structure and biogenesis]	NA|669aa|down_7|NZ_CP024793.1_694895_696902_-	COG1061, SSL2, DNA or RNA helicases of superfamily II [Transcription / DNA replication, recombination, and repair]	NA|102aa|down_8|NZ_CP024793.1_698226_698532_-	NA	NA|71aa|down_9|NZ_CP024793.1_698830_699043_-	NA
GCF_002813575.1_ASM281357v1	NZ_CP024793	Nostoc flagelliforme CCNUN1 plasmid pNFSY08, complete sequence	8	778011-778111	8	CRISPRCasFinder	no		cas3,Cas9_archaeal	Orphan	TATATATGATGTGTACTACCGCGT	24	0	0	NA	NA	NA	1	1	Orphan	cas14k,csa3,cas14j,2OG_CAS,cas6,cas8b3,cas5,RT,c2c9_V-U4,PD-DExK,DinG,c2c5_V-U5,Cas14c_CAS-V-F,cas3,Cas9_archaeal,Cas14u_CAS-V	NA|87aa|up_9|NZ_CP024793.1_767061_767322_-,NA|76aa|up_8|NZ_CP024793.1_767349_767577_-,NA|72aa|up_7|NZ_CP024793.1_767573_767789_-,NA|144aa|up_6|NZ_CP024793.1_767885_768317_-,NA|184aa|up_5|NZ_CP024793.1_768391_768943_-,NA|90aa|up_4|NZ_CP024793.1_769183_769453_+,NA|221aa|up_1|NZ_CP024793.1_772502_773165_+,NA|116aa|down_3|NZ_CP024793.1_782575_782923_-	NA|87aa|up_9|NZ_CP024793.1_767061_767322_-	NA	NA|76aa|up_8|NZ_CP024793.1_767349_767577_-	NA	NA|72aa|up_7|NZ_CP024793.1_767573_767789_-	NA	NA|144aa|up_6|NZ_CP024793.1_767885_768317_-	NA	NA|184aa|up_5|NZ_CP024793.1_768391_768943_-	NA	NA|90aa|up_4|NZ_CP024793.1_769183_769453_+	NA	NA|100aa|up_3|NZ_CP024793.1_769746_770046_+	PRK06113, PRK06113, 7-alpha-hydroxysteroid dehydrogenase; Validated	NA|533aa|up_2|NZ_CP024793.1_770246_771845_+	TIGR02168, Chromosome_partition_protein_Smc, chromosome segregation protein SMC, common bacterial type	NA|221aa|up_1|NZ_CP024793.1_772502_773165_+	NA	NA|1345aa|up_0|NZ_CP024793.1_773330_777365_+	pfam05860, Haemagg_act, haemagglutination activity domain	NA|741aa|down_0|NZ_CP024793.1_778634_780857_+	COG2274, SunT, ABC-type bacteriocin/lantibiotic exporters, contain an N-terminal double-glycine peptidase domain [Defense mechanisms]	NA|419aa|down_1|NZ_CP024793.1_780864_782121_+	pfam00529, HlyD, HlyD membrane-fusion protein of T1SS	NA|187aa|down_2|NZ_CP024793.1_782001_782562_-	COG3831, COG3831, Uncharacterized conserved protein [Function unknown]	NA|116aa|down_3|NZ_CP024793.1_782575_782923_-	NA	NA|541aa|down_4|NZ_CP024793.1_783018_784641_-	CHL00195, ycf46, Ycf46; Provisional	NA|70aa|down_5|NZ_CP024793.1_784732_784942_-	pfam11211, DUF2997, Protein of unknown function (DUF2997)	NA|150aa|down_6|NZ_CP024793.1_784945_785395_-	pfam06868, DUF1257, Protein of unknown function (DUF1257)	NA|264aa|down_7|NZ_CP024793.1_786257_787049_-	pfam13340, DUF4096, Putative transposase of IS4/5 family (DUF4096)	NA|167aa|down_8|NZ_CP024793.1_787036_787537_+	pfam07154, DUF1392, Protein of unknown function (DUF1392)	NA|48aa|down_9|NZ_CP024793.1_788131_788275_-	pfam13701, DDE_Tnp_1_4, Transposase DDE domain group 1
