assembly_id	genome_id	genome_def	crispr_array_locus_merge	crispr_array_location_merge	crispr_locus_id	crispr_pred_method	array_in_prot	prot_within_array_20000	prot_in_genome	crispr_type_by_cas_prot	consensus_repeat	repeat_length	self-targeting_spacer_number	self-targeting_target_number	spacer_location	protospacer_location	repeat_type	spacer_locus_num	spacer_num	correct_crispr_type	genome_cas_prots	unknown_protein_around_crispr	L10	L10_domain	L9	L9_domain	L8	L8_domain	L7	L7_domain	L6	L6_domain	L5	L5_domain	L4	L4_domain	L3	L3_domain	L2	L2_domain	L1	L1_domain	R1	R1_domain	R2	R2_domain	R3	R3_domain	R4	R4_domain	R5	R5_domain	R6	R6_domain	R7	R7_domain	R8	R8_domain	R9	R9_domain	R10	R10_domain
GCA_002949795.1_ASM294979v1	CP026692	Nostoc sp. 'Lobaria pulmonaria (5183) cyanobiont' strain 5183 chromosome, complete genome	2	106630-106735	2	CRISPRCasFinder	no		PD-DExK,cas14j,cas3,WYL,cas10d,csc2gr7,csc1gr5,2OG_CAS,cas6,cas4,cas1,cas2,DinG,csa3,RT,DEDDh	Orphan	TTACCCCACCCTAACCCTCCCCTTGGAAAGG	31	0	0	NA	NA	NA	1	1	Orphan	PD-DExK,cas14j,cas3,WYL,cas10d,csc2gr7,csc1gr5,2OG_CAS,cas6,cas4,cas1,cas2,DinG,csa3,RT,DEDDh,Cas9_archaeal	NA|134aa|up_6|CP026692.1_91280_91682_+,NA|147aa|down_0|CP026692.1_106930_107371_-,NA|201aa|down_2|CP026692.1_111635_112238_-	NA|112aa|up_9|CP026692.1_89834_90170_+	pfam08872, KGK, KGK domain	NA|89aa|up_8|CP026692.1_90311_90578_+	pfam03795, YCII, YCII-related domain	NA|128aa|up_7|CP026692.1_90737_91121_+	COG3937, COG3937, Uncharacterized conserved protein [Function unknown]	NA|134aa|up_6|CP026692.1_91280_91682_+	NA	NA|138aa|up_5|CP026692.1_91762_92176_+	PRK13258, PRK13258, 7-cyano-7-deazaguanine reductase; Provisional	NA|360aa|up_4|CP026692.1_92213_93293_-	TIGR00975, precursor_PBP-3_PstS-3_Antigen_Ag88	NA|466aa|up_3|CP026692.1_93447_94845_-	CHL00177, ccs1, c-type cytochrome biogenensis protein; Validated	NA|247aa|up_2|CP026692.1_94874_95615_-	pfam02683, DsbD, Cytochrome C biogenesis protein transmembrane region	NA|2206aa|up_1|CP026692.1_96234_102852_-	PRK11107, PRK11107, hybrid sensory histidine kinase BarA; Provisional	NA|887aa|up_0|CP026692.1_103714_106375_-	COG5635, COG5635, Predicted NTPase (NACHT family) [Signal transduction mechanisms]	NA|147aa|down_0|CP026692.1_106930_107371_-	NA	NA|1230aa|down_1|CP026692.1_107858_111548_-	PLN03241, PLN03241, magnesium chelatase subunit H; Provisional	NA|201aa|down_2|CP026692.1_111635_112238_-	NA	NA|385aa|down_3|CP026692.1_112252_113407_-	pfam10017, Methyltransf_33, Histidine-specific methyltransferase, SAM-dependent	NA|319aa|down_4|CP026692.1_113976_114933_+	cd02696, MurNAc-LAA, N-acetylmuramoyl-L-alanine amidase or MurNAc-LAA (also known as peptidoglycan aminohydrolase, NAMLA amidase, NAMLAA, Amidase 3, and peptidoglycan amidase; EC 3	NA|741aa|down_5|CP026692.1_115535_117758_-	cd01948, EAL, EAL domain	NA|1422aa|down_6|CP026692.1_118370_122636_-	COG0642, BaeS, Signal transduction histidine kinase [Signal transduction mechanisms]	NA|349aa|down_7|CP026692.1_123991_125038_-	PRK04204, PRK04204, RNA 3'-terminal phosphate cyclase	NA|456aa|down_8|CP026692.1_125182_126550_-	COG0277, GlcD, FAD/FMN-containing dehydrogenases [Energy production and conversion]	NA|221aa|down_9|CP026692.1_126731_127394_-	cd10450, GIY-YIG_AtGrxS16_like, GIY-YIG domain found in CAXIP1-like proteins, iron-sulfur cluster assembly proteins, and similar proteins
GCA_002949795.1_ASM294979v1	CP026692	Nostoc sp. 'Lobaria pulmonaria (5183) cyanobiont' strain 5183 chromosome, complete genome	3	892240-893645	1,3,1	PILER-CR,CRISPRCasFinder,CRT	no	WYL,cas10d,csc2gr7,csc1gr5,cas3,2OG_CAS,cas6,cas4,cas1,cas2	PD-DExK,cas14j,cas3,WYL,cas10d,csc2gr7,csc1gr5,2OG_CAS,cas6,cas4,cas1,cas2,DinG,csa3,RT,DEDDh	Type I-D	ATTGCAATTCATCAAAATCCCTATTAGGG----------ATTGAAAC,ATTGCAATTCATCAAAATCCCTATTAGGGATTGAAAC,ATTGCAATTCATCAAAATCCCTATTAGGGATTGAAAC	47,37,37	0	0	NA	NA	I-D,II-B:I-D,II-B:I-D,II-B	19,19,19	19	TypeI-D	PD-DExK,cas14j,cas3,WYL,cas10d,csc2gr7,csc1gr5,2OG_CAS,cas6,cas4,cas1,cas2,DinG,csa3,RT,DEDDh,Cas9_archaeal	cas10d|898aa|up_8|CP026692.1_881514_884208_+,NA|220aa|down_1|CP026692.1_894754_895414_-	NA|84aa|up_9|CP026692.1_880784_881036_+	COG2442, COG2442, Uncharacterized conserved protein [Function unknown]	cas10d|898aa|up_8|CP026692.1_881514_884208_+	NA	csc2gr7|343aa|up_7|CP026692.1_884225_885254_+	pfam18320, Csc2, Csc2 Crispr	csc1gr5|253aa|up_6|CP026692.1_885257_886016_+	TIGR03159, cas_Csc1, CRISPR type I-D/CYANO-associated protein Csc1	cas3|761aa|up_5|CP026692.1_886008_888291_+	cd09710, Cas3_I-D, CRISPR/Cas system-associated protein Cas3; Distinct diverged subfamily of Cas3 helicase domain	2OG_CAS|207aa|up_4|CP026692.1_888367_888988_+	pfam13640, 2OG-FeII_Oxy_3, 2OG-Fe(II) oxygenase superfamily	cas6|279aa|up_3|CP026692.1_888977_889814_+	COG5551, COG5551, CRISPR system related protein, RAMP superfamily [Defense    mechanisms]	cas4|198aa|up_2|CP026692.1_889845_890439_+	TIGR00372, conserved_hypothetical_protein, CRISPR-associated protein Cas4	cas1|335aa|up_1|CP026692.1_890656_891661_+	TIGR04093, hypothetical_protein_L8106_25395, CRISPR-associated endonuclease Cas1, subtype CYANO	cas2|96aa|up_0|CP026692.1_891683_891971_+	cd09725, Cas2_I_II_III, CRISPR/Cas system-associated protein Cas2	NA|195aa|down_0|CP026692.1_893685_894270_-	pfam03358, FMN_red, NADPH-dependent FMN reductase	NA|220aa|down_1|CP026692.1_894754_895414_-	NA	NA|245aa|down_2|CP026692.1_895894_896629_-	COG1126, GlnQ, ABC-type polar amino acid transport system, ATPase component [Amino acid transport and metabolism]	NA|497aa|down_3|CP026692.1_896621_898112_-	cd13628, PBP2_Ala, Periplasmic substrate binding domain of ABC-type transporter specific to alanine; the type 2 periplasmic binding protein	NA|398aa|down_4|CP026692.1_898208_899402_-	COG3177, COG3177, Fic family protein [Function unknown]	NA|327aa|down_5|CP026692.1_899880_900861_-	PRK06522, PRK06522, 2-dehydropantoate 2-reductase; Reviewed	NA|290aa|down_6|CP026692.1_900911_901781_-	cd01637, IMPase_like, Inositol-monophosphatase-like domains	NA|247aa|down_7|CP026692.1_902992_903733_-	PRK08057, PRK08057, cobalt-precorrin-6x reductase; Reviewed	NA|156aa|down_8|CP026692.1_903729_904197_-	COG5469, COG5469, Predicted metal-binding protein [Function unknown]	NA|260aa|down_9|CP026692.1_904499_905279_-	PRK06136, PRK06136, uroporphyrinogen-III C-methyltransferase
GCA_002949795.1_ASM294979v1	CP026692	Nostoc sp. 'Lobaria pulmonaria (5183) cyanobiont' strain 5183 chromosome, complete genome	5	971108-972373	5,2,2	CRISPRCasFinder,CRT,PILER-CR	no		PD-DExK,cas14j,cas3,WYL,cas10d,csc2gr7,csc1gr5,2OG_CAS,cas6,cas4,cas1,cas2,DinG,csa3,RT,DEDDh	Orphan	GTTTCAATCCCTAATAGGGATTTTGATGAATTGCAAT,GTTTCAATCCCTAATAGGGATTTTGATGAATTGCAAT,GTTTCAATCCCTAATAGGGATTTTGATGAATTGCAAT	37,37,37	0	0	NA	NA	I-D,II-B:I-D,II-B:I-D,II-B	17,17,16	17	Orphan	PD-DExK,cas14j,cas3,WYL,cas10d,csc2gr7,csc1gr5,2OG_CAS,cas6,cas4,cas1,cas2,DinG,csa3,RT,DEDDh,Cas9_archaeal	NA,NA|106aa|down_4|CP026692.1_977491_977809_-,NA|258aa|down_5|CP026692.1_977970_978744_-,NA|230aa|down_6|CP026692.1_979168_979858_+,NA|136aa|down_7|CP026692.1_979868_980276_-	NA|321aa|up_9|CP026692.1_954818_955781_+	TIGR01136, Cysteine_synthase, cysteine synthase	NA|229aa|up_8|CP026692.1_956176_956863_-	COG4300, CadD, Predicted permease, cadmium resistance protein [Inorganic ion transport and metabolism]	NA|221aa|up_7|CP026692.1_957278_957941_+	COG4300, CadD, Predicted permease, cadmium resistance protein [Inorganic ion transport and metabolism]	NA|266aa|up_6|CP026692.1_957998_958796_-	pfam01925, TauE, Sulfite exporter TauE/SafE	NA|228aa|up_5|CP026692.1_959694_960378_+	COG1309, AcrR, Transcriptional regulator [Transcription]	NA|148aa|up_4|CP026692.1_960419_960863_+	COG0824, FcbC, Predicted thioesterase [General function prediction only]	NA|277aa|up_3|CP026692.1_961954_962785_+	COG1562, ERG9, Phytoene/squalene synthetase [Lipid metabolism]	NA|533aa|up_2|CP026692.1_963256_964855_+	cd14014, STKc_PknB_like, Catalytic domain of bacterial Serine/Threonine kinases, PknB and similar proteins	NA|244aa|up_1|CP026692.1_965500_966232_+	cd00060, FHA, Forkhead associated domain (FHA); found in eukaryotic and prokaryotic proteins	NA|849aa|up_0|CP026692.1_968367_970914_+	pfam12770, CHAT, CHAT domain	NA|279aa|down_0|CP026692.1_973276_974113_-	PRK07417, PRK07417, prephenate/arogenate dehydrogenase	NA|536aa|down_1|CP026692.1_974138_975746_+	COG1357, COG1357, Pentapeptide repeats containing protein [Function unknown]	NA|197aa|down_2|CP026692.1_975809_976400_+	pfam07466, DUF1517, Protein of unknown function (DUF1517)	NA|297aa|down_3|CP026692.1_976470_977361_+	pfam06682, SARAF, SOCE-associated regulatory factor of calcium homoeostasis	NA|106aa|down_4|CP026692.1_977491_977809_-	NA	NA|258aa|down_5|CP026692.1_977970_978744_-	NA	NA|230aa|down_6|CP026692.1_979168_979858_+	NA	NA|136aa|down_7|CP026692.1_979868_980276_-	NA	NA|222aa|down_8|CP026692.1_980343_981009_+	pfam01596, Methyltransf_3, O-methyltransferase	NA|583aa|down_9|CP026692.1_981067_982816_-	COG2335, COG2335, Secreted and surface protein containing fasciclin-like repeats [Cell envelope biogenesis, outer membrane]
GCA_002949795.1_ASM294979v1	CP026692	Nostoc sp. 'Lobaria pulmonaria (5183) cyanobiont' strain 5183 chromosome, complete genome	7	1387145-1387269	7	CRISPRCasFinder	no		PD-DExK,cas14j,cas3,WYL,cas10d,csc2gr7,csc1gr5,2OG_CAS,cas6,cas4,cas1,cas2,DinG,csa3,RT,DEDDh	Orphan	CAATGCCAATGCGAGGGCTTGCATTGTTAATGCATCGGCTT	41	0	0	NA	NA	NA	1	1	Orphan	PD-DExK,cas14j,cas3,WYL,cas10d,csc2gr7,csc1gr5,2OG_CAS,cas6,cas4,cas1,cas2,DinG,csa3,RT,DEDDh,Cas9_archaeal	NA|282aa|up_6|CP026692.1_1375195_1376041_+,NA|263aa|up_4|CP026692.1_1377010_1377799_-,NA|113aa|down_0|CP026692.1_1387529_1387868_-,NA|120aa|down_1|CP026692.1_1387867_1388227_-,NA|159aa|down_3|CP026692.1_1389437_1389914_-,NA|99aa|down_5|CP026692.1_1392660_1392957_-,NA|337aa|down_8|CP026692.1_1398405_1399416_+	NA|260aa|up_9|CP026692.1_1370853_1371633_-	PRK14258, PRK14258, phosphate ABC transporter ATP-binding protein; Provisional	NA|224aa|up_8|CP026692.1_1372292_1372964_-	PRK14260, PRK14260, phosphate ABC transporter ATP-binding protein; Provisional	NA|325aa|up_7|CP026692.1_1374140_1375115_-	COG1300, SpoIIM, Uncharacterized membrane protein [Function unknown]	NA|282aa|up_6|CP026692.1_1375195_1376041_+	NA	NA|262aa|up_5|CP026692.1_1376086_1376872_+	pfam06271, RDD, RDD family	NA|263aa|up_4|CP026692.1_1377010_1377799_-	NA	NA|330aa|up_3|CP026692.1_1378142_1379132_+	PRK07452, PRK07452, DNA polymerase III subunit delta; Validated	NA|161aa|up_2|CP026692.1_1379241_1379724_+	pfam13767, DUF4168, Domain of unknown function (DUF4168)	NA|1463aa|up_1|CP026692.1_1379960_1384349_-	COG0642, BaeS, Signal transduction histidine kinase [Signal transduction mechanisms]	NA|800aa|up_0|CP026692.1_1384536_1386936_-	COG1501, COG1501, Alpha-glucosidases, family 31 of glycosyl hydrolases [Carbohydrate transport and metabolism]	NA|113aa|down_0|CP026692.1_1387529_1387868_-	NA	NA|120aa|down_1|CP026692.1_1387867_1388227_-	NA	NA|405aa|down_2|CP026692.1_1388230_1389445_-	pfam01566, Nramp, Natural resistance-associated macrophage protein	NA|159aa|down_3|CP026692.1_1389437_1389914_-	NA	NA|297aa|down_4|CP026692.1_1391483_1392374_+	PRK13236, PRK13236, nitrogenase reductase; Reviewed	NA|99aa|down_5|CP026692.1_1392660_1392957_-	NA	NA|703aa|down_6|CP026692.1_1393179_1395288_-	cd01347, ligand_gated_channel, TonB dependent/Ligand-Gated channels are created by a monomeric 22 strand (22,24) anti-parallel beta-barrel	NA|709aa|down_7|CP026692.1_1395821_1397948_+	TIGR02100, Glycogen_operon_protein_GlgX_homolog, glycogen debranching enzyme GlgX	NA|337aa|down_8|CP026692.1_1398405_1399416_+	NA	NA|150aa|down_9|CP026692.1_1399420_1399870_+	PRK00252, alaS, alanyl-tRNA synthetase; Reviewed
GCA_002949795.1_ASM294979v1	CP026692	Nostoc sp. 'Lobaria pulmonaria (5183) cyanobiont' strain 5183 chromosome, complete genome	10	1927161-1929063	3,9,4	PILER-CR,CRISPRCasFinder,CRT	no		PD-DExK,cas14j,cas3,WYL,cas10d,csc2gr7,csc1gr5,2OG_CAS,cas6,cas4,cas1,cas2,DinG,csa3,RT,DEDDh	Orphan	ATTGCAATCAACTAAAATCCCTATTAGGG----------ATTGAAAC,ATTGCAATCAACTAAAATCCCTATTAGGGATTGAAAC,ATTGCAATCAACTAAAATCCCTATTAGGGATTGAAAC	47,37,37	0	0	NA	NA	I-D,II-B:I-D,II-B:I-D,II-B	26,26,26	26	Orphan	PD-DExK,cas14j,cas3,WYL,cas10d,csc2gr7,csc1gr5,2OG_CAS,cas6,cas4,cas1,cas2,DinG,csa3,RT,DEDDh,Cas9_archaeal	NA|138aa|up_7|CP026692.1_1917979_1918393_+,NA|108aa|up_0|CP026692.1_1926488_1926812_+,NA|84aa|down_3|CP026692.1_1931779_1932031_-	NA|286aa|up_9|CP026692.1_1916051_1916909_-	cd01945, ribokinase_group_B, Ribokinase-like subgroup B	NA|200aa|up_8|CP026692.1_1916921_1917521_-	COG1695, COG1695, Predicted transcriptional regulators [Transcription]	NA|138aa|up_7|CP026692.1_1917979_1918393_+	NA	NA|325aa|up_6|CP026692.1_1918624_1919599_+	PRK07399, PRK07399, DNA polymerase III subunit delta'; Validated	NA|398aa|up_5|CP026692.1_1919808_1921002_+	COG1035, FrhB, Coenzyme F420-reducing hydrogenase, beta subunit [Energy production and conversion]	NA|120aa|up_4|CP026692.1_1921135_1921495_-	cd07245, VOC_like, uncharacterized subfamily of vicinal oxygen chelate (VOC) family	NA|387aa|up_3|CP026692.1_1922652_1923813_+	pfam12617, LdpA_C, Iron-Sulfur binding protein C terminal	NA|578aa|up_2|CP026692.1_1923887_1925621_+	COG3854, SpoIIIAA, ncharacterized protein conserved in bacteria [Function unknown]	NA|62aa|up_1|CP026692.1_1926123_1926309_+	PLN00014, PLN00014, light-harvesting-like protein 3; Provisional	NA|108aa|up_0|CP026692.1_1926488_1926812_+	NA	NA|90aa|down_0|CP026692.1_1929331_1929601_+	PRK05626, rpsO, 30S ribosomal protein S15; Reviewed	NA|157aa|down_1|CP026692.1_1929612_1930083_+	pfam11947, DUF3464, Protein of unknown function (DUF3464)	NA|353aa|down_2|CP026692.1_1930369_1931428_+	PRK13396, PRK13396, 3-deoxy-7-phosphoheptulonate synthase; Provisional	NA|84aa|down_3|CP026692.1_1931779_1932031_-	NA	NA|419aa|down_4|CP026692.1_1932386_1933643_-	TIGR02037, Probable_periplasmic_serine_protease_do/HhoA-like, periplasmic serine protease, Do/DeqQ family	NA|299aa|down_5|CP026692.1_1933821_1934718_+	COG4360, APA2, ATP adenylyltransferase (5',5'''-P-1,P-4-tetraphosphate phosphorylase II) [Nucleotide transport and metabolism]	NA|232aa|down_6|CP026692.1_1934941_1935637_-	PRK00090, bioD, ATP-dependent dethiobiotin synthetase BioD	NA|583aa|down_7|CP026692.1_1935636_1937385_-	cd14014, STKc_PknB_like, Catalytic domain of bacterial Serine/Threonine kinases, PknB and similar proteins	NA|339aa|down_8|CP026692.1_1938380_1939397_+	PRK02812, PRK02812, ribose-phosphate pyrophosphokinase; Provisional	NA|232aa|down_9|CP026692.1_1939624_1940320_+	pfam05685, Uma2, Putative restriction endonuclease
GCA_002949795.1_ASM294979v1	CP026692	Nostoc sp. 'Lobaria pulmonaria (5183) cyanobiont' strain 5183 chromosome, complete genome	11	2382335-2382455	10	CRISPRCasFinder	no		PD-DExK,cas14j,cas3,WYL,cas10d,csc2gr7,csc1gr5,2OG_CAS,cas6,cas4,cas1,cas2,DinG,csa3,RT,DEDDh	Orphan	CTGCGATCGCTGACAACGATACTCATGTCTAT	32	0	0	NA	NA	NA	1	1	Orphan	PD-DExK,cas14j,cas3,WYL,cas10d,csc2gr7,csc1gr5,2OG_CAS,cas6,cas4,cas1,cas2,DinG,csa3,RT,DEDDh,Cas9_archaeal	NA,NA|124aa|down_5|CP026692.1_2387501_2387873_-	NA|509aa|up_9|CP026692.1_2365310_2366837_-	COG0642, BaeS, Signal transduction histidine kinase [Signal transduction mechanisms]	NA|273aa|up_8|CP026692.1_2368410_2369229_+	cd05243, SDR_a5, atypical (a) SDRs, subgroup 5	NA|641aa|up_7|CP026692.1_2369662_2371585_+	PRK00290, dnaK, molecular chaperone DnaK; Provisional	NA|213aa|up_6|CP026692.1_2371720_2372359_+	pfam06314, ADC, Acetoacetate decarboxylase (ADC)	NA|627aa|up_5|CP026692.1_2373747_2375628_-	COG1132, MdlB, ABC-type multidrug transport system, ATPase and permease components [Defense mechanisms]	NA|341aa|up_4|CP026692.1_2376292_2377315_+	sd00006, TPR, Tetratricopeptide repeat	NA|338aa|up_3|CP026692.1_2377492_2378506_+	cd08276, MDR7, Medium chain dehydrogenases/reductase (MDR)/zinc-dependent alcohol dehydrogenase-like family	NA|224aa|up_2|CP026692.1_2378583_2379255_-	COG0625, Gst, Glutathione S-transferase [Posttranslational modification, protein turnover, chaperones]	NA|376aa|up_1|CP026692.1_2379494_2380622_-	pfam05935, Arylsulfotrans, Arylsulfotransferase (ASST)	NA|346aa|up_0|CP026692.1_2380746_2381784_-	cd19094, AKR_Tas-like, Escherichia coli Tas protein and similar proteins	NA|249aa|down_0|CP026692.1_2382879_2383626_+	COG0664, Crp, cAMP-binding proteins - catabolite gene activator and regulatory subunit of cAMP-dependent protein kinases [Signal transduction mechanisms]	NA|237aa|down_1|CP026692.1_2383706_2384417_+	COG0664, Crp, cAMP-binding proteins - catabolite gene activator and regulatory subunit of cAMP-dependent protein kinases [Signal transduction mechanisms]	NA|137aa|down_2|CP026692.1_2384612_2385023_+	cd17580, REC_2_DhkD-like, second phosphoacceptor receiver (REC) domain of Dictyostelium discoideum hybrid signal transduction histidine kinase D and similar domains	NA|197aa|down_3|CP026692.1_2385293_2385884_+	pfam01471, PG_binding_1, Putative peptidoglycan binding domain	NA|377aa|down_4|CP026692.1_2386109_2387240_-	TIGR00236, UDP-N-acetylglucosamine_2-epimerase, UDP-N-acetylglucosamine 2-epimerase	NA|124aa|down_5|CP026692.1_2387501_2387873_-	NA	NA|428aa|down_6|CP026692.1_2388603_2389887_+	COG2805, PilT, Tfp pilus assembly protein, pilus retraction ATPase PilT [Cell motility and secretion / Intracellular trafficking and secretion]	NA|256aa|down_7|CP026692.1_2390056_2390824_+	cd02978, KaiB_like, KaiB-like family; composed of the circadian clock proteins, KaiB and the N-terminal KaiB-like sensory domain of SasA	NA|293aa|down_8|CP026692.1_2393906_2394785_+	PRK05481, PRK05481, lipoyl synthase; Provisional	NA|311aa|down_9|CP026692.1_2394809_2395742_+	PRK14619, PRK14619, NAD(P)H-dependent glycerol-3-phosphate dehydrogenase; Provisional
GCA_002949795.1_ASM294979v1	CP026692	Nostoc sp. 'Lobaria pulmonaria (5183) cyanobiont' strain 5183 chromosome, complete genome	12	2387391-2387509	11	CRISPRCasFinder	no		PD-DExK,cas14j,cas3,WYL,cas10d,csc2gr7,csc1gr5,2OG_CAS,cas6,cas4,cas1,cas2,DinG,csa3,RT,DEDDh	Orphan	TGTTTGTCTAATTAACCTGTAATGACTATTTAA	33	0	0	NA	NA	NA	1	1	Orphan	PD-DExK,cas14j,cas3,WYL,cas10d,csc2gr7,csc1gr5,2OG_CAS,cas6,cas4,cas1,cas2,DinG,csa3,RT,DEDDh,Cas9_archaeal	NA,NA	NA|341aa|up_9|CP026692.1_2376292_2377315_+	sd00006, TPR, Tetratricopeptide repeat	NA|338aa|up_8|CP026692.1_2377492_2378506_+	cd08276, MDR7, Medium chain dehydrogenases/reductase (MDR)/zinc-dependent alcohol dehydrogenase-like family	NA|224aa|up_7|CP026692.1_2378583_2379255_-	COG0625, Gst, Glutathione S-transferase [Posttranslational modification, protein turnover, chaperones]	NA|376aa|up_6|CP026692.1_2379494_2380622_-	pfam05935, Arylsulfotrans, Arylsulfotransferase (ASST)	NA|346aa|up_5|CP026692.1_2380746_2381784_-	cd19094, AKR_Tas-like, Escherichia coli Tas protein and similar proteins	NA|249aa|up_4|CP026692.1_2382879_2383626_+	COG0664, Crp, cAMP-binding proteins - catabolite gene activator and regulatory subunit of cAMP-dependent protein kinases [Signal transduction mechanisms]	NA|237aa|up_3|CP026692.1_2383706_2384417_+	COG0664, Crp, cAMP-binding proteins - catabolite gene activator and regulatory subunit of cAMP-dependent protein kinases [Signal transduction mechanisms]	NA|137aa|up_2|CP026692.1_2384612_2385023_+	cd17580, REC_2_DhkD-like, second phosphoacceptor receiver (REC) domain of Dictyostelium discoideum hybrid signal transduction histidine kinase D and similar domains	NA|197aa|up_1|CP026692.1_2385293_2385884_+	pfam01471, PG_binding_1, Putative peptidoglycan binding domain	NA|377aa|up_0|CP026692.1_2386109_2387240_-	TIGR00236, UDP-N-acetylglucosamine_2-epimerase, UDP-N-acetylglucosamine 2-epimerase	NA|428aa|down_0|CP026692.1_2388603_2389887_+	COG2805, PilT, Tfp pilus assembly protein, pilus retraction ATPase PilT [Cell motility and secretion / Intracellular trafficking and secretion]	NA|256aa|down_1|CP026692.1_2390056_2390824_+	cd02978, KaiB_like, KaiB-like family; composed of the circadian clock proteins, KaiB and the N-terminal KaiB-like sensory domain of SasA	NA|293aa|down_2|CP026692.1_2393906_2394785_+	PRK05481, PRK05481, lipoyl synthase; Provisional	NA|311aa|down_3|CP026692.1_2394809_2395742_+	PRK14619, PRK14619, NAD(P)H-dependent glycerol-3-phosphate dehydrogenase; Provisional	NA|417aa|down_4|CP026692.1_2396551_2397802_+	PRK07598, PRK07598, RNA polymerase sigma factor SigC; Validated	NA|152aa|down_5|CP026692.1_2398222_2398678_+	pfam01475, FUR, Ferric uptake regulator family	NA|380aa|down_6|CP026692.1_2398810_2399950_-	COG3409, COG3409, Putative peptidoglycan-binding domain-containing protein [Cell envelope biogenesis, outer membrane]	NA|1393aa|down_7|CP026692.1_2399959_2404138_-	PRK05989, cobN, cobaltochelatase subunit CobN; Reviewed	NA|277aa|down_8|CP026692.1_2406367_2407198_-	COG0189, RimK, Glutathione synthase/Ribosomal protein S6 modification enzyme (glutaminyl transferase) [Coenzyme metabolism / Translation, ribosomal structure and biogenesis]	NA|292aa|down_9|CP026692.1_2407281_2408157_-	PHA03247, PHA03247, large tegument protein UL36; Provisional
GCA_002949795.1_ASM294979v1	CP026692	Nostoc sp. 'Lobaria pulmonaria (5183) cyanobiont' strain 5183 chromosome, complete genome	13	2391077-2393564	4,12,5	PILER-CR,CRISPRCasFinder,CRT	no		PD-DExK,cas14j,cas3,WYL,cas10d,csc2gr7,csc1gr5,2OG_CAS,cas6,cas4,cas1,cas2,DinG,csa3,RT,DEDDh	Orphan	ATCGCAATCAAATAAAATCCCTATTAGGG----------ATTGAAAC,ATCGCAATCAAATAAAATCCCTATTAGGGATTGAAAC,ATCGCAATCAAATAAAATCCCTATTAGGGATTGAAAC	47,37,37	0	0	NA	NA	I-D,II-B:I-D,II-B:I-D,II-B	34,34,34	34	Orphan	PD-DExK,cas14j,cas3,WYL,cas10d,csc2gr7,csc1gr5,2OG_CAS,cas6,cas4,cas1,cas2,DinG,csa3,RT,DEDDh,Cas9_archaeal	NA|124aa|up_2|CP026692.1_2387501_2387873_-,NA	NA|376aa|up_9|CP026692.1_2379494_2380622_-	pfam05935, Arylsulfotrans, Arylsulfotransferase (ASST)	NA|346aa|up_8|CP026692.1_2380746_2381784_-	cd19094, AKR_Tas-like, Escherichia coli Tas protein and similar proteins	NA|249aa|up_7|CP026692.1_2382879_2383626_+	COG0664, Crp, cAMP-binding proteins - catabolite gene activator and regulatory subunit of cAMP-dependent protein kinases [Signal transduction mechanisms]	NA|237aa|up_6|CP026692.1_2383706_2384417_+	COG0664, Crp, cAMP-binding proteins - catabolite gene activator and regulatory subunit of cAMP-dependent protein kinases [Signal transduction mechanisms]	NA|137aa|up_5|CP026692.1_2384612_2385023_+	cd17580, REC_2_DhkD-like, second phosphoacceptor receiver (REC) domain of Dictyostelium discoideum hybrid signal transduction histidine kinase D and similar domains	NA|197aa|up_4|CP026692.1_2385293_2385884_+	pfam01471, PG_binding_1, Putative peptidoglycan binding domain	NA|377aa|up_3|CP026692.1_2386109_2387240_-	TIGR00236, UDP-N-acetylglucosamine_2-epimerase, UDP-N-acetylglucosamine 2-epimerase	NA|124aa|up_2|CP026692.1_2387501_2387873_-	NA	NA|428aa|up_1|CP026692.1_2388603_2389887_+	COG2805, PilT, Tfp pilus assembly protein, pilus retraction ATPase PilT [Cell motility and secretion / Intracellular trafficking and secretion]	NA|256aa|up_0|CP026692.1_2390056_2390824_+	cd02978, KaiB_like, KaiB-like family; composed of the circadian clock proteins, KaiB and the N-terminal KaiB-like sensory domain of SasA	NA|293aa|down_0|CP026692.1_2393906_2394785_+	PRK05481, PRK05481, lipoyl synthase; Provisional	NA|311aa|down_1|CP026692.1_2394809_2395742_+	PRK14619, PRK14619, NAD(P)H-dependent glycerol-3-phosphate dehydrogenase; Provisional	NA|417aa|down_2|CP026692.1_2396551_2397802_+	PRK07598, PRK07598, RNA polymerase sigma factor SigC; Validated	NA|152aa|down_3|CP026692.1_2398222_2398678_+	pfam01475, FUR, Ferric uptake regulator family	NA|380aa|down_4|CP026692.1_2398810_2399950_-	COG3409, COG3409, Putative peptidoglycan-binding domain-containing protein [Cell envelope biogenesis, outer membrane]	NA|1393aa|down_5|CP026692.1_2399959_2404138_-	PRK05989, cobN, cobaltochelatase subunit CobN; Reviewed	NA|277aa|down_6|CP026692.1_2406367_2407198_-	COG0189, RimK, Glutathione synthase/Ribosomal protein S6 modification enzyme (glutaminyl transferase) [Coenzyme metabolism / Translation, ribosomal structure and biogenesis]	NA|292aa|down_7|CP026692.1_2407281_2408157_-	PHA03247, PHA03247, large tegument protein UL36; Provisional	NA|306aa|down_8|CP026692.1_2408292_2409210_+	cd07378, MPP_ACP5, Homo sapiens acid phosphatase 5 and related proteins, metallophosphatase domain	NA|450aa|down_9|CP026692.1_2409215_2410565_+	COG3950, COG3950, Predicted ATP-binding protein involved in virulence [General function prediction only]
GCA_002949795.1_ASM294979v1	CP026692	Nostoc sp. 'Lobaria pulmonaria (5183) cyanobiont' strain 5183 chromosome, complete genome	14	2693486-2693591	13	CRISPRCasFinder	no		PD-DExK,cas14j,cas3,WYL,cas10d,csc2gr7,csc1gr5,2OG_CAS,cas6,cas4,cas1,cas2,DinG,csa3,RT,DEDDh	Orphan	CAGAGCAAGTAAATTTATTTATGGA	25	0	0	NA	NA	NA	1	1	Orphan	PD-DExK,cas14j,cas3,WYL,cas10d,csc2gr7,csc1gr5,2OG_CAS,cas6,cas4,cas1,cas2,DinG,csa3,RT,DEDDh,Cas9_archaeal	NA|137aa|up_7|CP026692.1_2680029_2680440_-,NA|213aa|up_6|CP026692.1_2684532_2685171_+,NA|196aa|up_1|CP026692.1_2691241_2691829_-,NA|308aa|up_0|CP026692.1_2692157_2693081_-,NA|125aa|down_3|CP026692.1_2698945_2699320_+,NA|166aa|down_6|CP026692.1_2702548_2703046_+,NA|123aa|down_8|CP026692.1_2704177_2704546_+	NA|442aa|up_9|CP026692.1_2677370_2678696_+	COG1721, COG1721, Uncharacterized conserved protein (some members contain a von Willebrand factor type A (vWA) domain) [General function prediction only]	NA|170aa|up_8|CP026692.1_2678752_2679262_+	pfam00583, Acetyltransf_1, Acetyltransferase (GNAT) family	NA|137aa|up_7|CP026692.1_2680029_2680440_-	NA	NA|213aa|up_6|CP026692.1_2684532_2685171_+	NA	NA|454aa|up_5|CP026692.1_2685270_2686632_+	PRK03598, PRK03598, putative efflux pump membrane fusion protein; Provisional	NA|669aa|up_4|CP026692.1_2686631_2688638_+	COG1131, CcmA, ABC-type multidrug transport system, ATPase component [Defense mechanisms]	NA|369aa|up_3|CP026692.1_2688653_2689760_+	pfam12698, ABC2_membrane_3, ABC-2 family transporter protein	NA|380aa|up_2|CP026692.1_2689776_2690916_+	pfam12698, ABC2_membrane_3, ABC-2 family transporter protein	NA|196aa|up_1|CP026692.1_2691241_2691829_-	NA	NA|308aa|up_0|CP026692.1_2692157_2693081_-	NA	NA|462aa|down_0|CP026692.1_2693804_2695190_-	PRK09201, PRK09201, AtzE family amidohydrolase	NA|63aa|down_1|CP026692.1_2695186_2695375_-	pfam13318, DUF4089, Protein of unknown function (DUF4089)	NA|710aa|down_2|CP026692.1_2695952_2698082_-	pfam15902, Sortilin-Vps10, Sortilin, neurotensin receptor 3,	NA|125aa|down_3|CP026692.1_2698945_2699320_+	NA	NA|397aa|down_4|CP026692.1_2699523_2700714_-	cd17324, MFS_NepI_like, Purine ribonucleoside efflux pump NepI and similar transporters of the Major Facilitator Superfamily	NA|450aa|down_5|CP026692.1_2700777_2702127_-	cd13131, MATE_NorM_like, Subfamily of the multidrug and toxic compound extrusion (MATE)-like proteins similar to Vibrio cholerae NorM	NA|166aa|down_6|CP026692.1_2702548_2703046_+	NA	NA|152aa|down_7|CP026692.1_2703518_2703974_-	pfam14119, DUF4288, Domain of unknown function (DUF4288)	NA|123aa|down_8|CP026692.1_2704177_2704546_+	NA	NA|864aa|down_9|CP026692.1_2704831_2707423_+	cd01031, EriC, ClC chloride channel EriC
GCA_002949795.1_ASM294979v1	CP026692	Nostoc sp. 'Lobaria pulmonaria (5183) cyanobiont' strain 5183 chromosome, complete genome	16	2703297-2703402	15	CRISPRCasFinder	no		PD-DExK,cas14j,cas3,WYL,cas10d,csc2gr7,csc1gr5,2OG_CAS,cas6,cas4,cas1,cas2,DinG,csa3,RT,DEDDh	Orphan	TACAGAGCAAGTAAATTTATTTATGG	26	0	0	NA	NA	NA	1	1	Orphan	PD-DExK,cas14j,cas3,WYL,cas10d,csc2gr7,csc1gr5,2OG_CAS,cas6,cas4,cas1,cas2,DinG,csa3,RT,DEDDh,Cas9_archaeal	NA|196aa|up_8|CP026692.1_2691241_2691829_-,NA|308aa|up_7|CP026692.1_2692157_2693081_-,NA|125aa|up_3|CP026692.1_2698945_2699320_+,NA|166aa|up_0|CP026692.1_2702548_2703046_+,NA|123aa|down_1|CP026692.1_2704177_2704546_+,NA|206aa|down_7|CP026692.1_2714906_2715524_+	NA|380aa|up_9|CP026692.1_2689776_2690916_+	pfam12698, ABC2_membrane_3, ABC-2 family transporter protein	NA|196aa|up_8|CP026692.1_2691241_2691829_-	NA	NA|308aa|up_7|CP026692.1_2692157_2693081_-	NA	NA|462aa|up_6|CP026692.1_2693804_2695190_-	PRK09201, PRK09201, AtzE family amidohydrolase	NA|63aa|up_5|CP026692.1_2695186_2695375_-	pfam13318, DUF4089, Protein of unknown function (DUF4089)	NA|710aa|up_4|CP026692.1_2695952_2698082_-	pfam15902, Sortilin-Vps10, Sortilin, neurotensin receptor 3,	NA|125aa|up_3|CP026692.1_2698945_2699320_+	NA	NA|397aa|up_2|CP026692.1_2699523_2700714_-	cd17324, MFS_NepI_like, Purine ribonucleoside efflux pump NepI and similar transporters of the Major Facilitator Superfamily	NA|450aa|up_1|CP026692.1_2700777_2702127_-	cd13131, MATE_NorM_like, Subfamily of the multidrug and toxic compound extrusion (MATE)-like proteins similar to Vibrio cholerae NorM	NA|166aa|up_0|CP026692.1_2702548_2703046_+	NA	NA|152aa|down_0|CP026692.1_2703518_2703974_-	pfam14119, DUF4288, Domain of unknown function (DUF4288)	NA|123aa|down_1|CP026692.1_2704177_2704546_+	NA	NA|864aa|down_2|CP026692.1_2704831_2707423_+	cd01031, EriC, ClC chloride channel EriC	NA|1234aa|down_3|CP026692.1_2708303_2712005_+	cd09914, RocCOR, Ras of complex proteins (Roc) C-terminal of Roc (COR) domain family	NA|138aa|down_4|CP026692.1_2712267_2712681_-	pfam12680, SnoaL_2, SnoaL-like domain	NA|326aa|down_5|CP026692.1_2712823_2713801_+	COG4638, HcaE, Phenylpropionate dioxygenase and related ring-hydroxylating dioxygenases, large terminal subunit [Inorganic ion transport and metabolism / General function prediction only]	NA|252aa|down_6|CP026692.1_2713994_2714750_-	COG3476, COG3476, Tryptophan-rich sensory protein (mitochondrial benzodiazepine receptor homolog) [Signal transduction mechanisms]	NA|206aa|down_7|CP026692.1_2714906_2715524_+	NA	NA|364aa|down_8|CP026692.1_2716113_2717205_+	pfam14258, DUF4350, Domain of unknown function (DUF4350)	NA|317aa|down_9|CP026692.1_2717265_2718216_+	COG0714, COG0714, MoxR-like ATPases [General function prediction only]
GCA_002949795.1_ASM294979v1	CP026692	Nostoc sp. 'Lobaria pulmonaria (5183) cyanobiont' strain 5183 chromosome, complete genome	17	2892131-2892741	16,6,5	CRISPRCasFinder,CRT,PILER-CR	no	cas2,cas1	PD-DExK,cas14j,cas3,WYL,cas10d,csc2gr7,csc1gr5,2OG_CAS,cas6,cas4,cas1,cas2,DinG,csa3,RT,DEDDh	Unclear	GTTTCCATTAATTCAACTTCCGAAGAAGTTTAAA,GTTTCCATTAATTCAACTTCCGAAGAAGTTTAAAG,GTTTCCATTAATTCAACTTCCGAAGAAGTTTAAAG	34,35,35	0	0	NA	NA	NA:NA:NA	8,8,7	8	Unclear	PD-DExK,cas14j,cas3,WYL,cas10d,csc2gr7,csc1gr5,2OG_CAS,cas6,cas4,cas1,cas2,DinG,csa3,RT,DEDDh,Cas9_archaeal	NA|123aa|up_9|CP026692.1_2882793_2883162_+,NA|153aa|up_8|CP026692.1_2883525_2883984_-,NA|97aa|up_5|CP026692.1_2886291_2886582_+,NA|61aa|up_4|CP026692.1_2889284_2889467_-,NA|100aa|up_3|CP026692.1_2889604_2889904_-,NA|90aa|up_2|CP026692.1_2890148_2890418_-,NA|227aa|down_3|CP026692.1_2899124_2899805_-,NA|156aa|down_4|CP026692.1_2899807_2900275_-,NA|98aa|down_8|CP026692.1_2902927_2903221_-	NA|123aa|up_9|CP026692.1_2882793_2883162_+	NA	NA|153aa|up_8|CP026692.1_2883525_2883984_-	NA	NA|262aa|up_7|CP026692.1_2883976_2884762_-	COG1192, Soj, ATPases involved in chromosome partitioning [Cell division and chromosome partitioning]	NA|321aa|up_6|CP026692.1_2885326_2886289_+	cd10227, ParM_like, Plasmid segregation protein ParM and similar proteins	NA|97aa|up_5|CP026692.1_2886291_2886582_+	NA	NA|61aa|up_4|CP026692.1_2889284_2889467_-	NA	NA|100aa|up_3|CP026692.1_2889604_2889904_-	NA	NA|90aa|up_2|CP026692.1_2890148_2890418_-	NA	cas2|93aa|up_1|CP026692.1_2890542_2890821_-	cd09725, Cas2_I_II_III, CRISPR/Cas system-associated protein Cas2	cas1|348aa|up_0|CP026692.1_2890823_2891867_-	pfam01867, Cas_Cas1, CRISPR associated protein Cas1	cas1|669aa|down_0|CP026692.1_2892952_2894959_-	pfam01867, Cas_Cas1, CRISPR associated protein Cas1	NA|253aa|down_1|CP026692.1_2895188_2895947_+	pfam14326, DUF4384, Domain of unknown function (DUF4384)	NA|372aa|down_2|CP026692.1_2897834_2898950_+	pfam01609, DDE_Tnp_1, Transposase DDE domain	NA|227aa|down_3|CP026692.1_2899124_2899805_-	NA	NA|156aa|down_4|CP026692.1_2899807_2900275_-	NA	NA|345aa|down_5|CP026692.1_2900278_2901313_-	TIGR02225, Tyrosine_recombinase_XerD, tyrosine recombinase XerD	NA|112aa|down_6|CP026692.1_2901299_2901635_-	cd16382, XisI-like, XisI is FdxN element excision controlling factor protein	NA|139aa|down_7|CP026692.1_2901622_2902039_-	pfam08814, XisH, XisH protein	NA|98aa|down_8|CP026692.1_2902927_2903221_-	NA	NA|372aa|down_9|CP026692.1_2904818_2905934_-	pfam01609, DDE_Tnp_1, Transposase DDE domain
GCA_002949795.1_ASM294979v1	CP026692	Nostoc sp. 'Lobaria pulmonaria (5183) cyanobiont' strain 5183 chromosome, complete genome	18	3725986-3726107	17	CRISPRCasFinder	no		PD-DExK,cas14j,cas3,WYL,cas10d,csc2gr7,csc1gr5,2OG_CAS,cas6,cas4,cas1,cas2,DinG,csa3,RT,DEDDh	Orphan	CGGCTTGTCGTTAGACATCGCCTGTTGTAGAAGAG	35	0	0	NA	NA	NA	1	1	Orphan	PD-DExK,cas14j,cas3,WYL,cas10d,csc2gr7,csc1gr5,2OG_CAS,cas6,cas4,cas1,cas2,DinG,csa3,RT,DEDDh,Cas9_archaeal	NA|213aa|up_7|CP026692.1_3717277_3717916_+,NA|87aa|up_6|CP026692.1_3718255_3718516_-,NA|303aa|down_5|CP026692.1_3735942_3736851_+	NA|227aa|up_9|CP026692.1_3715283_3715964_-	sd00006, TPR, Tetratricopeptide repeat	NA|109aa|up_8|CP026692.1_3716613_3716940_+	COG0393, COG0393, Uncharacterized conserved protein [Function unknown]	NA|213aa|up_7|CP026692.1_3717277_3717916_+	NA	NA|87aa|up_6|CP026692.1_3718255_3718516_-	NA	NA|228aa|up_5|CP026692.1_3718574_3719258_-	TIGR02702, transcriptional_regulator, iron-sulfur cluster biosynthesis transcriptional regulator SufR	NA|480aa|up_4|CP026692.1_3719502_3720942_+	PRK11814, PRK11814, cysteine desulfurase activator complex subunit SufB; Provisional	NA|264aa|up_3|CP026692.1_3721069_3721861_+	CHL00131, ycf16, sulfate ABC transporter protein; Validated	NA|469aa|up_2|CP026692.1_3721860_3723267_+	TIGR01981, UPF0051_protein_Rv1462/MT1509, FeS assembly protein SufD	NA|421aa|up_1|CP026692.1_3723505_3724768_+	PLN02855, PLN02855, Bifunctional selenocysteine lyase/cysteine desulfurase	NA|75aa|up_0|CP026692.1_3725607_3725832_-	TIGR02574, hypothetical_protein, putative addiction module component, TIGR02574 family	NA|280aa|down_0|CP026692.1_3726251_3727091_-	pfam09520, RE_TdeIII, Type II restriction endonuclease, TdeIII	NA|479aa|down_1|CP026692.1_3727108_3728545_-	cd00315, Cyt_C5_DNA_methylase, Cytosine-C5 specific DNA methylases; Methyl transfer reactions play an important role in many aspects of biology	NA|1329aa|down_2|CP026692.1_3728564_3732551_-	PRK12493, PRK12493, magnesium chelatase subunit H; Provisional	NA|381aa|down_3|CP026692.1_3733243_3734386_-	COG0859, RfaF, ADP-heptose:LPS heptosyltransferase [Cell envelope biogenesis, outer membrane]	NA|321aa|down_4|CP026692.1_3734573_3735536_-	COG0859, RfaF, ADP-heptose:LPS heptosyltransferase [Cell envelope biogenesis, outer membrane]	NA|303aa|down_5|CP026692.1_3735942_3736851_+	NA	NA|419aa|down_6|CP026692.1_3736983_3738240_-	cd03825, GT4_WcaC-like, putative colanic acid biosynthesis glycosyl transferase WcaC and similar proteins	NA|246aa|down_7|CP026692.1_3738338_3739076_-	pfam13649, Methyltransf_25, Methyltransferase domain	NA|1650aa|down_8|CP026692.1_3739353_3744303_-	PRK12467, PRK12467, peptide synthase; Provisional	NA|643aa|down_9|CP026692.1_3744299_3746228_-	cd05930, A_NRPS, The adenylation domain of nonribosomal peptide synthetases (NRPS)
GCA_002949795.1_ASM294979v1	CP026692	Nostoc sp. 'Lobaria pulmonaria (5183) cyanobiont' strain 5183 chromosome, complete genome	19	3876227-3877119	6,18,7	PILER-CR,CRISPRCasFinder,CRT	no		PD-DExK,cas14j,cas3,WYL,cas10d,csc2gr7,csc1gr5,2OG_CAS,cas6,cas4,cas1,cas2,DinG,csa3,RT,DEDDh	Orphan	ATTGCAATTAACTAAAATCCCTATTAGGG----------ATTGAAAC,ATTGCAATTAACTAAAATCCCTATTAGGGATTGAAAC,ATTGCAATTAACTAAAATCCCTATTAGGGATTGAAAC	47,37,37	0	0	NA	NA	I-D,II-B:I-D,II-B:I-D,II-B	12,12,12	12	Orphan	PD-DExK,cas14j,cas3,WYL,cas10d,csc2gr7,csc1gr5,2OG_CAS,cas6,cas4,cas1,cas2,DinG,csa3,RT,DEDDh,Cas9_archaeal	NA|98aa|up_2|CP026692.1_3873522_3873816_-,NA|246aa|down_1|CP026692.1_3879357_3880095_-,NA|124aa|down_4|CP026692.1_3883547_3883919_-	NA|485aa|up_9|CP026692.1_3864491_3865946_+	COG0520, csdA, Selenocysteine lyase/Cysteine desulfurase [Posttranslational modification, protein turnover, chaperones]	NA|429aa|up_8|CP026692.1_3866046_3867333_+	cd02110, SO_family_Moco_dimer, Subgroup of sulfite oxidase (SO) family molybdopterin binding domains that contains conserved dimerization domain	NA|280aa|up_7|CP026692.1_3867369_3868209_+	pfam01925, TauE, Sulfite exporter TauE/SafE	NA|500aa|up_6|CP026692.1_3868231_3869731_-	PRK05137, tolB, Tol-Pal system protein TolB	NA|258aa|up_5|CP026692.1_3869783_3870557_-	TIGR03943, TIGR03943, TIGR03943 family protein	NA|351aa|up_4|CP026692.1_3870635_3871688_-	COG0701, COG0701, Predicted permeases [General function prediction only]	NA|356aa|up_3|CP026692.1_3872376_3873444_-	cd00200, WD40, WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment	NA|98aa|up_2|CP026692.1_3873522_3873816_-	NA	NA|57aa|up_1|CP026692.1_3874182_3874353_-	PLN00014, PLN00014, light-harvesting-like protein 3; Provisional	NA|364aa|up_0|CP026692.1_3874833_3875925_+	cd05305, L-AlaDH, Alanine dehydrogenase NAD-binding and catalytic domains	NA|352aa|down_0|CP026692.1_3877919_3878975_-	TIGR00737, Probable_tRNA-dihydrouridine_synthase, putative TIM-barrel protein, nifR3 family	NA|246aa|down_1|CP026692.1_3879357_3880095_-	NA	NA|301aa|down_2|CP026692.1_3881145_3882048_-	COG1398, OLE1, Fatty-acid desaturase [Lipid metabolism]	NA|410aa|down_3|CP026692.1_3882222_3883452_+	COG1309, AcrR, Transcriptional regulator [Transcription]	NA|124aa|down_4|CP026692.1_3883547_3883919_-	NA	NA|559aa|down_5|CP026692.1_3884939_3886616_-	cd14750, PBP2_TMBP, The periplasmic-binding component of ABC transport systems specific for trehalose/maltose; possesses type 2 periplasmic binding fold	NA|117aa|down_6|CP026692.1_3887350_3887701_-	pfam18480, DUF5615, Domain of unknown function (DUF5615)	NA|115aa|down_7|CP026692.1_3887704_3888049_-	COG2442, COG2442, Uncharacterized conserved protein [Function unknown]	NA|637aa|down_8|CP026692.1_3888205_3890116_-	pfam07602, DUF1565, Protein of unknown function (DUF1565)	NA|372aa|down_9|CP026692.1_3890626_3891742_+	PRK02615, PRK02615, thiamine phosphate synthase
GCA_002949795.1_ASM294979v1	CP026692	Nostoc sp. 'Lobaria pulmonaria (5183) cyanobiont' strain 5183 chromosome, complete genome	20	4099479-4099587	19	CRISPRCasFinder	no		PD-DExK,cas14j,cas3,WYL,cas10d,csc2gr7,csc1gr5,2OG_CAS,cas6,cas4,cas1,cas2,DinG,csa3,RT,DEDDh	Orphan	ATTCTCACTGCAGGATCGGGTAATGATA	28	0	0	NA	NA	NA	1	1	Orphan	PD-DExK,cas14j,cas3,WYL,cas10d,csc2gr7,csc1gr5,2OG_CAS,cas6,cas4,cas1,cas2,DinG,csa3,RT,DEDDh,Cas9_archaeal	NA|136aa|up_4|CP026692.1_4089814_4090222_-,NA|342aa|down_3|CP026692.1_4104676_4105702_+,NA|64aa|down_6|CP026692.1_4108833_4109025_-,NA|331aa|down_7|CP026692.1_4109309_4110302_-,NA|144aa|down_8|CP026692.1_4111326_4111758_-	NA|424aa|up_9|CP026692.1_4081895_4083167_+	pfam13646, HEAT_2, HEAT repeats	NA|481aa|up_8|CP026692.1_4083551_4084994_-	PRK00654, glgA, glycogen synthase GlgA	NA|331aa|up_7|CP026692.1_4085128_4086121_-	PRK00072, hemC, porphobilinogen deaminase; Reviewed	NA|324aa|up_6|CP026692.1_4086631_4087603_+	PRK00861, PRK00861, putative lipid kinase; Reviewed	NA|517aa|up_5|CP026692.1_4087731_4089282_+	cd07378, MPP_ACP5, Homo sapiens acid phosphatase 5 and related proteins, metallophosphatase domain	NA|136aa|up_4|CP026692.1_4089814_4090222_-	NA	NA|145aa|up_3|CP026692.1_4090564_4090999_-	pfam14159, CAAD, CAAD domains of cyanobacterial aminoacyl-tRNA synthetase	NA|154aa|up_2|CP026692.1_4091180_4091642_-	pfam04972, BON, BON domain	NA|427aa|up_1|CP026692.1_4091781_4093062_-	pfam06897, DUF1269, Protein of unknown function (DUF1269)	NA|1041aa|up_0|CP026692.1_4093696_4096819_+	COG1196, Smc, Chromosome segregation ATPases [Cell division and chromosome partitioning]	NA|562aa|down_0|CP026692.1_4100615_4102301_+	PRK00911, PRK00911, dihydroxy-acid dehydratase; Provisional	NA|207aa|down_1|CP026692.1_4102639_4103260_+	COG2119, COG2119, Predicted membrane protein [Function unknown]	NA|298aa|down_2|CP026692.1_4103404_4104298_-	TIGR02917, TPR_domain_protein, putative PEP-CTERM system TPR-repeat lipoprotein	NA|342aa|down_3|CP026692.1_4104676_4105702_+	NA	NA|390aa|down_4|CP026692.1_4105823_4106993_-	pfam01837, HcyBio, Homocysteine biosynthesis enzyme, sulfur-incorporation	NA|384aa|down_5|CP026692.1_4107591_4108743_-	COG0628, yhhT, Predicted permease, member of the PurR regulon [General function prediction only]	NA|64aa|down_6|CP026692.1_4108833_4109025_-	NA	NA|331aa|down_7|CP026692.1_4109309_4110302_-	NA	NA|144aa|down_8|CP026692.1_4111326_4111758_-	NA	NA|588aa|down_9|CP026692.1_4112744_4114508_-	cd00200, WD40, WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment
GCA_002949795.1_ASM294979v1	CP026692	Nostoc sp. 'Lobaria pulmonaria (5183) cyanobiont' strain 5183 chromosome, complete genome	22	4562063-4562169	21	CRISPRCasFinder	no		PD-DExK,cas14j,cas3,WYL,cas10d,csc2gr7,csc1gr5,2OG_CAS,cas6,cas4,cas1,cas2,DinG,csa3,RT,DEDDh	Orphan	ATAAAACCTTCCCTGGCAGGTTCGT	25	0	0	NA	NA	NA	1	1	Orphan	PD-DExK,cas14j,cas3,WYL,cas10d,csc2gr7,csc1gr5,2OG_CAS,cas6,cas4,cas1,cas2,DinG,csa3,RT,DEDDh,Cas9_archaeal	NA|210aa|up_6|CP026692.1_4553006_4553636_-,NA|118aa|up_1|CP026692.1_4559011_4559365_-,NA|100aa|down_3|CP026692.1_4566007_4566307_+,NA|159aa|down_4|CP026692.1_4566426_4566903_-,NA|270aa|down_8|CP026692.1_4570369_4571179_-	NA|300aa|up_9|CP026692.1_4549228_4550128_-	COG0581, PstA, ABC-type phosphate transport system, permease component [Inorganic ion transport and metabolism]	NA|386aa|up_8|CP026692.1_4550838_4551996_-	COG1613, Sbp, ABC-type sulfate transport system, periplasmic component [Inorganic ion transport and metabolism]	NA|72aa|up_7|CP026692.1_4552688_4552904_-	COG1598, COG1598, Predicted nuclease of the RNAse H fold, HicB family [General    function prediction only]	NA|210aa|up_6|CP026692.1_4553006_4553636_-	NA	NA|418aa|up_5|CP026692.1_4553678_4554932_-	COG0420, SbcD, DNA repair exonuclease [DNA replication, recombination, and repair]	NA|296aa|up_4|CP026692.1_4555007_4555895_-	TIGR02057, Phosphoadenosine_phosphosulfate_reductase, phosphoadenosine phosphosulfate reductase, thioredoxin dependent	NA|530aa|up_3|CP026692.1_4556446_4558036_-	TIGR00711, Uncharacterized_MFS-type_transporter_YhcA, drug resistance transporter, EmrB/QacA subfamily	NA|214aa|up_2|CP026692.1_4558267_4558909_+	pfam16859, TetR_C_11, Bacterial transcriptional repressor C-terminal	NA|118aa|up_1|CP026692.1_4559011_4559365_-	NA	NA|757aa|up_0|CP026692.1_4559691_4561962_+	COG0317, SpoT, Guanosine polyphosphate pyrophosphohydrolases/synthetases [Signal transduction mechanisms / Transcription]	NA|333aa|down_0|CP026692.1_4562210_4563209_+	cd11548, NodZ_like, Alpha 1,6-fucosyltransferase similar to Bradyrhizobium NodZ	NA|239aa|down_1|CP026692.1_4563457_4564174_+	pfam13649, Methyltransf_25, Methyltransferase domain	NA|360aa|down_2|CP026692.1_4564589_4565669_+	COG3227, LasB, Zinc metalloprotease (elastase) [Amino acid transport and metabolism]	NA|100aa|down_3|CP026692.1_4566007_4566307_+	NA	NA|159aa|down_4|CP026692.1_4566426_4566903_-	NA	NA|194aa|down_5|CP026692.1_4567627_4568209_-	PRK10502, PRK10502, putative acyl transferase; Provisional	NA|319aa|down_6|CP026692.1_4568211_4569168_-	cd02511, Beta4Glucosyltransferase, UDP-glucose LOS-beta-1,4 glucosyltransferase is required for biosynthesis of lipooligosaccharide	NA|311aa|down_7|CP026692.1_4569192_4570125_-	cd00761, Glyco_tranf_GTA_type, Glycosyltransferase family A (GT-A) includes diverse families of glycosyl transferases with a common GT-A type structural fold	NA|270aa|down_8|CP026692.1_4570369_4571179_-	NA	NA|192aa|down_9|CP026692.1_4571568_4572144_+	PRK05800, cobU, adenosylcobinamide kinase/adenosylcobinamide-phosphate guanylyltransferase; Validated
GCA_002949795.1_ASM294979v1	CP026692	Nostoc sp. 'Lobaria pulmonaria (5183) cyanobiont' strain 5183 chromosome, complete genome	23	4702744-4702841	22	CRISPRCasFinder	no		PD-DExK,cas14j,cas3,WYL,cas10d,csc2gr7,csc1gr5,2OG_CAS,cas6,cas4,cas1,cas2,DinG,csa3,RT,DEDDh	Orphan	ACTCTACTGCTTTCTAAAAAATTAGCC	27	0	0	NA	NA	NA	1	1	Orphan	PD-DExK,cas14j,cas3,WYL,cas10d,csc2gr7,csc1gr5,2OG_CAS,cas6,cas4,cas1,cas2,DinG,csa3,RT,DEDDh,Cas9_archaeal	NA|152aa|up_5|CP026692.1_4695223_4695679_+,NA|175aa|down_9|CP026692.1_4715696_4716221_+	NA|258aa|up_9|CP026692.1_4690241_4691015_-	pfam13672, PP2C_2, Protein phosphatase 2C	NA|394aa|up_8|CP026692.1_4691232_4692414_+	pfam13649, Methyltransf_25, Methyltransferase domain	NA|336aa|up_7|CP026692.1_4692739_4693747_-	PLN02389, PLN02389, biotin synthase	NA|192aa|up_6|CP026692.1_4694135_4694711_+	PRK10809, PRK10809, 30S ribosomal protein S5 alanine N-acetyltransferase	NA|152aa|up_5|CP026692.1_4695223_4695679_+	NA	NA|486aa|up_4|CP026692.1_4696004_4697462_+	pfam02696, UPF0061, Uncharacterized ACR, YdiU/UPF0061 family	NA|599aa|up_3|CP026692.1_4697567_4699364_-	COG1132, MdlB, ABC-type multidrug transport system, ATPase and permease components [Defense mechanisms]	NA|656aa|up_2|CP026692.1_4699541_4701509_+	cd01948, EAL, EAL domain	NA|112aa|up_1|CP026692.1_4701837_4702173_+	pfam04255, DUF433, Protein of unknown function (DUF433)	NA|122aa|up_0|CP026692.1_4702172_4702538_+	pfam18480, DUF5615, Domain of unknown function (DUF5615)	NA|332aa|down_0|CP026692.1_4702930_4703926_-	pfam05239, PRC, PRC-barrel domain	NA|1211aa|down_1|CP026692.1_4704125_4707758_-	TIGR02169, chromosome_segregation_protein_related_ptotein, chromosome segregation protein SMC, primarily archaeal type	NA|274aa|down_2|CP026692.1_4708418_4709240_+	cd00884, beta_CA_cladeB, Carbonic anhydrases (CA) are zinc-containing enzymes that catalyze the reversible hydration of carbon dioxide in a two-step mechanism in which the nucleophilic attack of a zinc-bound hydroxide ion on carbon dioxide is followed by the regeneration of an active site by ionization of the zinc-bound water molecule and removal of a proton from the active site	NA|153aa|down_3|CP026692.1_4709385_4709844_-	pfam04972, BON, BON domain	NA|307aa|down_4|CP026692.1_4710310_4711231_+	COG0349, Rnd, Ribonuclease D [Translation, ribosomal structure and biogenesis]	NA|281aa|down_5|CP026692.1_4711547_4712390_+	sd00006, TPR, Tetratricopeptide repeat	NA|523aa|down_6|CP026692.1_4712525_4714094_-	PRK09395, actP, cation/acetate symporter ActP	NA|92aa|down_7|CP026692.1_4714246_4714522_-	pfam04341, DUF485, Protein of unknown function, DUF485	NA|203aa|down_8|CP026692.1_4715050_4715659_+	cd06158, S2P-M50_like_1, Uncharacterized homologs of Site-2 protease (S2P), zinc metalloproteases (MEROPS family M50) which cleave transmembrane domains of substrate proteins, regulating intramembrane proteolysis (RIP) of diverse signal transduction mechanisms	NA|175aa|down_9|CP026692.1_4715696_4716221_+	NA
GCA_002949795.1_ASM294979v1	CP026692	Nostoc sp. 'Lobaria pulmonaria (5183) cyanobiont' strain 5183 chromosome, complete genome	24	4963884-4963977	23	CRISPRCasFinder	no		PD-DExK,cas14j,cas3,WYL,cas10d,csc2gr7,csc1gr5,2OG_CAS,cas6,cas4,cas1,cas2,DinG,csa3,RT,DEDDh	Orphan	TAGCGCAGCGTTAGCGAGTCATCGAGCGTCTGGGG	35	1	1	4963919-4963942	CP026692.1_2652331-2652308	NA	1	1	Orphan	PD-DExK,cas14j,cas3,WYL,cas10d,csc2gr7,csc1gr5,2OG_CAS,cas6,cas4,cas1,cas2,DinG,csa3,RT,DEDDh,Cas9_archaeal	NA|79aa|up_6|CP026692.1_4950982_4951219_-,NA|220aa|up_5|CP026692.1_4951876_4952536_-,NA|1326aa|up_0|CP026692.1_4959783_4963761_+,NA|213aa|down_0|CP026692.1_4964097_4964736_-,NA|68aa|down_1|CP026692.1_4965038_4965242_-	NA|146aa|up_9|CP026692.1_4946092_4946530_+	PRK11907, PRK11907, bifunctional 2',3'-cyclic-nucleotide 2'-phosphodiesterase/3'-nucleotidase	NA|627aa|up_8|CP026692.1_4947872_4949753_-	PRK00331, PRK00331, isomerizing glutamine--fructose-6-phosphate transaminase	NA|82aa|up_7|CP026692.1_4950241_4950487_-	CHL00065, psaC, photosystem I subunit VII	NA|79aa|up_6|CP026692.1_4950982_4951219_-	NA	NA|220aa|up_5|CP026692.1_4951876_4952536_-	NA	NA|406aa|up_4|CP026692.1_4952741_4953959_-	PRK10535, PRK10535, macrolide ABC transporter ATP-binding protein/permease MacB	NA|490aa|up_3|CP026692.1_4953978_4955448_-	PRK03598, PRK03598, putative efflux pump membrane fusion protein; Provisional	NA|323aa|up_2|CP026692.1_4956250_4957219_-	PRK09553, tauD, taurine dioxygenase; Reviewed	NA|531aa|up_1|CP026692.1_4957870_4959463_+	COG5305, COG5305, Predicted membrane protein [Function unknown]	NA|1326aa|up_0|CP026692.1_4959783_4963761_+	NA	NA|213aa|down_0|CP026692.1_4964097_4964736_-	NA	NA|68aa|down_1|CP026692.1_4965038_4965242_-	NA	NA|320aa|down_2|CP026692.1_4965766_4966726_-	cd05256, UDP_AE_SDR_e, UDP-N-acetylglucosamine 4-epimerase, extended (e) SDRs	NA|433aa|down_3|CP026692.1_4967360_4968659_+	PRK00885, PRK00885, phosphoribosylamine--glycine ligase; Provisional	NA|139aa|down_4|CP026692.1_4968704_4969121_+	pfam08814, XisH, XisH protein	NA|115aa|down_5|CP026692.1_4969108_4969453_+	pfam08869, XisI, XisI protein	NA|649aa|down_6|CP026692.1_4969734_4971681_+	NF033092, HK_WalK, cell wall metabolism sensor histidine kinase WalK	NA|124aa|down_7|CP026692.1_4971834_4972206_-	pfam13248, zf-ribbon_3, zinc-ribbon domain	NA|459aa|down_8|CP026692.1_4972464_4973841_+	pfam13614, AAA_31, AAA domain	NA|246aa|down_9|CP026692.1_4973840_4974578_+	cd16393, SPO0J_N, Thermus thermophilus stage 0 sporulation protein J-like N-terminal domain, ParB family member
GCA_002949795.1_ASM294979v1	CP026692	Nostoc sp. 'Lobaria pulmonaria (5183) cyanobiont' strain 5183 chromosome, complete genome	29	6260679-6261008	7,27,9	PILER-CR,CRISPRCasFinder,CRT	no		PD-DExK,cas14j,cas3,WYL,cas10d,csc2gr7,csc1gr5,2OG_CAS,cas6,cas4,cas1,cas2,DinG,csa3,RT,DEDDh	Orphan	GTGGCAACAACCCTCCAGGTACTGGGTGGGTTGAAAG,GTGGCAACAACCCTCCAGGTACTGGGTGGGTTGAAAG,GTGGCAACAACCCTCCAGGTACTGGGTGGGTTGAAAG	37,37,37	0	0	NA	NA	V-U5:V-U5:V-U5	4,4,4	4	Orphan	PD-DExK,cas14j,cas3,WYL,cas10d,csc2gr7,csc1gr5,2OG_CAS,cas6,cas4,cas1,cas2,DinG,csa3,RT,DEDDh,Cas9_archaeal	NA|283aa|up_7|CP026692.1_6247566_6248415_-,NA|170aa|up_6|CP026692.1_6248672_6249182_+,NA|207aa|down_0|CP026692.1_6261563_6262184_-	NA|108aa|up_9|CP026692.1_6246278_6246602_-	pfam08844, DUF1815, Domain of unknown function (DUF1815)	NA|170aa|up_8|CP026692.1_6246740_6247250_-	COG3265, GntK, Gluconate kinase [Carbohydrate transport and metabolism]	NA|283aa|up_7|CP026692.1_6247566_6248415_-	NA	NA|170aa|up_6|CP026692.1_6248672_6249182_+	NA	NA|398aa|up_5|CP026692.1_6249605_6250799_-	COG0654, UbiH, 2-polyprenyl-6-methoxyphenol hydroxylase and related FAD-dependent oxidoreductases [Coenzyme metabolism / Energy production and conversion]	NA|166aa|up_4|CP026692.1_6251448_6251946_+	COG1319, CoxM, Aerobic-type carbon monoxide dehydrogenase, middle subunit CoxM/CutM homologs [Energy production and conversion]	NA|317aa|up_3|CP026692.1_6252075_6253026_+	COG1529, CoxL, Aerobic-type carbon monoxide dehydrogenase, large subunit CoxL/CutL homologs [Energy production and conversion]	NA|388aa|up_2|CP026692.1_6254127_6255291_-	cd02152, OAT, Ornithine acetyltransferase (OAT) family; also referred to as ArgJ	NA|505aa|up_1|CP026692.1_6255543_6257058_-	cd17534, REC_DC-like, phosphoacceptor receiver (REC) domain of modulated diguanylate cyclase and similar domains	NA|658aa|up_0|CP026692.1_6257076_6259050_-	PRK13560, PRK13560, hypothetical protein; Provisional	NA|207aa|down_0|CP026692.1_6261563_6262184_-	NA	NA|387aa|down_1|CP026692.1_6263171_6264332_+	COG0664, Crp, cAMP-binding proteins - catabolite gene activator and regulatory subunit of cAMP-dependent protein kinases [Signal transduction mechanisms]	NA|230aa|down_2|CP026692.1_6264362_6265052_-	pfam02517, Abi, CAAX protease self-immunity	NA|517aa|down_3|CP026692.1_6265748_6267299_+	cd07786, FGGY_EcGK_like, Escherichia coli glycerol kinase-like proteins; belongs to the FGGY family of carbohydrate kinases	NA|365aa|down_4|CP026692.1_6267920_6269015_-	COG0642, BaeS, Signal transduction histidine kinase [Signal transduction mechanisms]	NA|240aa|down_5|CP026692.1_6269736_6270456_+	COG2197, CitB, Response regulator containing a CheY-like receiver domain and an HTH DNA-binding domain [Signal transduction mechanisms / Transcription]	NA|563aa|down_6|CP026692.1_6270643_6272332_+	PRK12561, PRK12561, NAD(P)H-quinone oxidoreductase subunit 4; Provisional	NA|327aa|down_7|CP026692.1_6272550_6273531_+	COG1525, COG1525, Micrococcal nuclease (thermonuclease) homologs [DNA replication, recombination, and repair]	NA|305aa|down_8|CP026692.1_6273629_6274544_+	PRK01212, PRK01212, homoserine kinase; Provisional	NA|170aa|down_9|CP026692.1_6274573_6275083_+	PRK11448, hsdR, type I restriction enzyme EcoKI subunit R; Provisional
GCA_002949795.1_ASM294979v1	CP026692	Nostoc sp. 'Lobaria pulmonaria (5183) cyanobiont' strain 5183 chromosome, complete genome	31	6811429-6811504	29	CRISPRCasFinder	no		PD-DExK,cas14j,cas3,WYL,cas10d,csc2gr7,csc1gr5,2OG_CAS,cas6,cas4,cas1,cas2,DinG,csa3,RT,DEDDh	Orphan	TTTGTCCTTTGTGAAAAACAATGAC	25	0	0	NA	NA	NA	1	1	Orphan	PD-DExK,cas14j,cas3,WYL,cas10d,csc2gr7,csc1gr5,2OG_CAS,cas6,cas4,cas1,cas2,DinG,csa3,RT,DEDDh,Cas9_archaeal	NA,NA|129aa|down_1|CP026692.1_6813910_6814297_+,NA|390aa|down_4|CP026692.1_6822464_6823634_+	NA|189aa|up_9|CP026692.1_6800994_6801561_+	COG1525, COG1525, Micrococcal nuclease (thermonuclease) homologs [DNA replication, recombination, and repair]	NA|294aa|up_8|CP026692.1_6801656_6802538_-	COG1175, UgpA, ABC-type sugar transport systems, permease components [Carbohydrate transport and metabolism]	NA|113aa|up_7|CP026692.1_6802691_6803030_-	pfam02452, PemK_toxin, PemK-like, MazF-like toxin of type II toxin-antitoxin system	NA|82aa|up_6|CP026692.1_6803016_6803262_-	COG0864, NikR, Predicted transcriptional regulators containing the CopG/Arc/MetJ DNA-binding domain and a metal-binding domain [Transcription]	NA|221aa|up_5|CP026692.1_6804612_6805275_+	PRK05647, purN, phosphoribosylglycinamide formyltransferase; Reviewed	NA|861aa|up_4|CP026692.1_6805321_6807904_-	COG2203, FhlA, FOG: GAF domain [Signal transduction mechanisms]	NA|179aa|up_3|CP026692.1_6808399_6808936_-	PRK05205, PRK05205, bifunctional pyr operon transcriptional regulator/uracil phosphoribosyltransferase PyrR	NA|324aa|up_2|CP026692.1_6809037_6810009_-	PRK01209, cobD, cobalamin biosynthesis protein	NA|147aa|up_1|CP026692.1_6810066_6810507_-	COG1959, COG1959, Predicted transcriptional regulator [Transcription]	NA|138aa|up_0|CP026692.1_6810976_6811390_+	pfam14250, AbrB-like, AbrB-like transcriptional regulator	NA|630aa|down_0|CP026692.1_6811545_6813435_-	cd00400, Voltage_gated_ClC, CLC voltage-gated chloride channel	NA|129aa|down_1|CP026692.1_6813910_6814297_+	NA	NA|1478aa|down_2|CP026692.1_6815032_6819466_+	pfam02898, NO_synthase, Nitric oxide synthase, oxygenase domain	NA|540aa|down_3|CP026692.1_6820236_6821856_-	COG1807, ArnT, 4-amino-4-deoxy-L-arabinose transferase and related glycosyltransferases of PMT family [Cell envelope biogenesis, outer membrane]	NA|390aa|down_4|CP026692.1_6822464_6823634_+	NA	NA|436aa|down_5|CP026692.1_6823647_6824955_-	COG1961, PinR, Site-specific recombinases, DNA invertase Pin homologs [DNA replication, recombination, and repair]	NA|416aa|down_6|CP026692.1_6825857_6827105_+	smart00812, Alpha_L_fucos, Alpha-L-fucosidase	NA|962aa|down_7|CP026692.1_6828325_6831211_+	PRK00349, uvrA, excinuclease ABC subunit UvrA	NA|344aa|down_8|CP026692.1_6838656_6839688_+	cd07987, LPLAT_MGAT-like, Lysophospholipid Acyltransferases (LPLATs) of Glycerophospholipid Biosynthesis: MGAT-like	NA|239aa|down_9|CP026692.1_6840449_6841166_-	COG0745, OmpR, Response regulators consisting of a CheY-like receiver domain and a winged-helix DNA-binding domain [Signal transduction mechanisms / Transcription]
