assembly_id	genome_id	genome_def	crispr_array_locus_merge	crispr_array_location_merge	crispr_locus_id	crispr_pred_method	array_in_prot	prot_within_array_20000	prot_in_genome	crispr_type_by_cas_prot	consensus_repeat	repeat_length	self-targeting_spacer_number	self-targeting_target_number	spacer_location	protospacer_location	repeat_type	spacer_locus_num	spacer_num	correct_crispr_type	genome_cas_prots	unknown_protein_around_crispr	L10	L10_domain	L9	L9_domain	L8	L8_domain	L7	L7_domain	L6	L6_domain	L5	L5_domain	L4	L4_domain	L3	L3_domain	L2	L2_domain	L1	L1_domain	R1	R1_domain	R2	R2_domain	R3	R3_domain	R4	R4_domain	R5	R5_domain	R6	R6_domain	R7	R7_domain	R8	R8_domain	R9	R9_domain	R10	R10_domain
GCF_000316625.1_ASM31662v1	NC_019676	Nostoc sp. PCC 7107, complete sequence	1	426065-426177	1	CRISPRCasFinder	no		PD-DExK,cas3,csa3,cas2,cas1,cas4,cas6,2OG_CAS,csc1gr5,csc2gr7,cas10d,WYL,c2c9_V-U4,cas10,csm3gr7,csx10gr5,csm2gr11,csx19,csx21,DinG,DEDDh,csx3,csm4gr5,csm5gr7,csm6,RT,c2c5_V-U5	Orphan	TCTGTCATATCTGTCGTTTCTGT	23	0	0	NA	NA	NA	2	2	Orphan	PD-DExK,cas3,csa3,cas2,cas1,cas4,cas6,2OG_CAS,csc1gr5,csc2gr7,cas10d,WYL,c2c9_V-U4,cas10,csm3gr7,csx10gr5,csm2gr11,csx19,csx21,DinG,DEDDh,csx3,csm4gr5,csm5gr7,csm6,RT,c2c5_V-U5	NA|1367aa|up_8|NC_019676.1_410783_414884_-,NA|111aa|up_3|NC_019676.1_420393_420726_-,NA|228aa|down_5|NC_019676.1_433989_434673_+,NA|78aa|down_8|NC_019676.1_436843_437077_+	NA|693aa|up_9|NC_019676.1_408617_410696_+	COG0475, KefB, Kef-type K+ transport systems, membrane components [Inorganic ion transport and metabolism]	NA|1367aa|up_8|NC_019676.1_410783_414884_-	NA	NA|234aa|up_7|NC_019676.1_415932_416634_+	COG0664, Crp, cAMP-binding proteins - catabolite gene activator and regulatory subunit of cAMP-dependent protein kinases [Signal transduction mechanisms]	NA|255aa|up_6|NC_019676.1_416646_417411_+	COG4241, COG4241, Predicted membrane protein [Function unknown]	NA|373aa|up_5|NC_019676.1_417476_418595_+	PRK04447, PRK04447, hypothetical protein; Provisional	NA|480aa|up_4|NC_019676.1_418774_420214_-	TIGR03556, photolyase_8HDF, deoxyribodipyrimidine photo-lyase, 8-HDF type	NA|111aa|up_3|NC_019676.1_420393_420726_-	NA	NA|901aa|up_2|NC_019676.1_421435_424138_+	PRK13557, PRK13557, histidine kinase; Provisional	NA|155aa|up_1|NC_019676.1_424382_424847_+	cd07891, CYTH-like_CthTTM-like_1, CYTH-like Clostridium thermocellum TTM-like subgroup 1	NA|244aa|up_0|NC_019676.1_424878_425610_-	COG1230, CzcD, Co/Zn/Cd efflux system component [Inorganic ion transport and metabolism]	NA|456aa|down_0|NC_019676.1_426909_428277_+	TIGR01976, am_tr_V_VC1184, cysteine desulfurase family protein, VC1184 subfamily	NA|410aa|down_1|NC_019676.1_428322_429552_+	PRK09237, PRK09237, amidohydrolase/deacetylase family metallohydrolase	NA|775aa|down_2|NC_019676.1_430107_432432_+	pfam12770, CHAT, CHAT domain	NA|180aa|down_3|NC_019676.1_432498_433038_+	pfam05685, Uma2, Putative restriction endonuclease	NA|279aa|down_4|NC_019676.1_433126_433963_+	PRK00048, PRK00048, dihydrodipicolinate reductase; Provisional	NA|228aa|down_5|NC_019676.1_433989_434673_+	NA	NA|210aa|down_6|NC_019676.1_434746_435376_-	PRK05953, PRK05953, Precorrin-8X methylmutase	NA|270aa|down_7|NC_019676.1_435558_436368_-	pfam04536, TPM_phosphatase, TPM domain	NA|78aa|down_8|NC_019676.1_436843_437077_+	NA	NA|133aa|down_9|NC_019676.1_437083_437482_+	cd09872, PIN_Sll0205-like, VapC-like PIN domain of Sll0205 protein and homologs
GCF_000316625.1_ASM31662v1	NC_019676	Nostoc sp. PCC 7107, complete sequence	2	532634-532730	2	CRISPRCasFinder	no		PD-DExK,cas3,csa3,cas2,cas1,cas4,cas6,2OG_CAS,csc1gr5,csc2gr7,cas10d,WYL,c2c9_V-U4,cas10,csm3gr7,csx10gr5,csm2gr11,csx19,csx21,DinG,DEDDh,csx3,csm4gr5,csm5gr7,csm6,RT,c2c5_V-U5	Orphan	ATGATTTGGGATAATTATCTGCGT	24	0	0	NA	NA	NA	1	1	Orphan	PD-DExK,cas3,csa3,cas2,cas1,cas4,cas6,2OG_CAS,csc1gr5,csc2gr7,cas10d,WYL,c2c9_V-U4,cas10,csm3gr7,csx10gr5,csm2gr11,csx19,csx21,DinG,DEDDh,csx3,csm4gr5,csm5gr7,csm6,RT,c2c5_V-U5	NA|129aa|up_1|NC_019676.1_531051_531438_-,NA	NA|272aa|up_9|NC_019676.1_520300_521116_+	COG3176, COG3176, Putative hemolysin [General function prediction only]	NA|537aa|up_8|NC_019676.1_521180_522791_+	COG5305, COG5305, Predicted membrane protein [Function unknown]	NA|575aa|up_7|NC_019676.1_523200_524925_-	PRK09319, PRK09319, bifunctional 3,4-dihydroxy-2-butanone-4-phosphate synthase RibB/GTP cyclohydrolase II RibA	NA|353aa|up_6|NC_019676.1_525386_526445_+	PRK00436, argC, N-acetyl-gamma-glutamyl-phosphate reductase; Validated	NA|430aa|up_5|NC_019676.1_526505_527795_-	PRK00077, eno, enolase; Provisional	NA|229aa|up_4|NC_019676.1_528121_528808_+	PRK07580, PRK07580, Mg-protoporphyrin IX methyl transferase; Validated	NA|323aa|up_3|NC_019676.1_528915_529884_+	PRK07399, PRK07399, DNA polymerase III subunit delta'; Validated	NA|271aa|up_2|NC_019676.1_530061_530874_+	pfam13340, DUF4096, Putative transposase of IS4/5 family (DUF4096)	NA|129aa|up_1|NC_019676.1_531051_531438_-	NA	NA|288aa|up_0|NC_019676.1_531612_532476_+	pfam06485, DUF1092, Protein of unknown function (DUF1092)	NA|438aa|down_0|NC_019676.1_532774_534088_+	COG0312, TldD, Predicted Zn-dependent proteases and their inactivated homologs [General function prediction only]	NA|882aa|down_1|NC_019676.1_534243_536889_+	cd01031, EriC, ClC chloride channel EriC	NA|159aa|down_2|NC_019676.1_536925_537402_+	cd08070, MPN_like, Mpr1p, Pad1p N-terminal (MPN) domains with catalytic isopeptidase activity (metal-binding)	NA|391aa|down_3|NC_019676.1_537464_538637_+	PRK07411, PRK07411, molybdopterin-synthase adenylyltransferase MoeB	NA|397aa|down_4|NC_019676.1_538744_539935_+	cd17313, MFS_SLC45_SUC, Solute carrier family 45 and similar sugar transporters of the Major Facilitator Superfamily of transporters	NA|549aa|down_5|NC_019676.1_539931_541578_-	pfam13231, PMT_2, Dolichyl-phosphate-mannose-protein mannosyltransferase	NA|248aa|down_6|NC_019676.1_541694_542438_-	COG0410, LivF, ABC-type branched-chain amino acid transport systems, ATPase component [Amino acid transport and metabolism]	NA|293aa|down_7|NC_019676.1_548406_549285_+	PRK09348, glyQ, glycyl-tRNA synthetase subunit alpha; Validated	NA|388aa|down_8|NC_019676.1_549543_550707_-	TIGR03169, selenide_water_dikinase_putative, pyridine nucleotide-disulfide oxidoreductase family protein	NA|169aa|down_9|NC_019676.1_550850_551357_+	COG1247, COG1247, Sortase and related acyltransferases [Cell envelope biogenesis, outer membrane]
GCF_000316625.1_ASM31662v1	NC_019676	Nostoc sp. PCC 7107, complete sequence	3	2044433-2044519	3	CRISPRCasFinder	no		PD-DExK,cas3,csa3,cas2,cas1,cas4,cas6,2OG_CAS,csc1gr5,csc2gr7,cas10d,WYL,c2c9_V-U4,cas10,csm3gr7,csx10gr5,csm2gr11,csx19,csx21,DinG,DEDDh,csx3,csm4gr5,csm5gr7,csm6,RT,c2c5_V-U5	Orphan	TTTCATTTGCAGTTTGTCTTCATC	24	0	0	NA	NA	NA	1	1	Orphan	PD-DExK,cas3,csa3,cas2,cas1,cas4,cas6,2OG_CAS,csc1gr5,csc2gr7,cas10d,WYL,c2c9_V-U4,cas10,csm3gr7,csx10gr5,csm2gr11,csx19,csx21,DinG,DEDDh,csx3,csm4gr5,csm5gr7,csm6,RT,c2c5_V-U5	NA|70aa|up_8|NC_019676.1_2031952_2032162_+,NA|56aa|up_0|NC_019676.1_2043177_2043345_-,NA|113aa|down_0|NC_019676.1_2044544_2044883_-	NA|212aa|up_9|NC_019676.1_2031034_2031670_+	PRK12757, PRK12757, cell division protein FtsN; Provisional	NA|70aa|up_8|NC_019676.1_2031952_2032162_+	NA	NA|145aa|up_7|NC_019676.1_2032435_2032870_+	TIGR00068, Lactoylglutathione_lyase, lactoylglutathione lyase	NA|881aa|up_6|NC_019676.1_2033078_2035721_+	TIGR03346, chaperone_ClpB, ATP-dependent chaperone ClpB	NA|357aa|up_5|NC_019676.1_2035968_2037039_+	COG1611, COG1611, Predicted Rossmann fold nucleotide-binding protein [General function prediction only]	NA|132aa|up_4|NC_019676.1_2037143_2037539_+	pfam12680, SnoaL_2, SnoaL-like domain	NA|609aa|up_3|NC_019676.1_2037635_2039462_-	COG1413, COG1413, FOG: HEAT repeat [Energy production and conversion]	NA|440aa|up_2|NC_019676.1_2039888_2041208_+	COG2027, DacB, D-alanyl-D-alanine carboxypeptidase (penicillin-binding protein 4) [Cell envelope biogenesis, outer membrane]	NA|574aa|up_1|NC_019676.1_2041331_2043053_-	pfam13699, DUF4157, Domain of unknown function (DUF4157)	NA|56aa|up_0|NC_019676.1_2043177_2043345_-	NA	NA|113aa|down_0|NC_019676.1_2044544_2044883_-	NA	NA|656aa|down_1|NC_019676.1_2044998_2046966_-	COG0464, SpoVK, ATPases of the AAA+ class [Posttranslational modification, protein turnover, chaperones]	NA|422aa|down_2|NC_019676.1_2047041_2048307_-	pfam14065, DUF4255, Protein of unknown function (DUF4255)	NA|228aa|down_3|NC_019676.1_2048564_2049248_-	COG2197, CitB, Response regulator containing a CheY-like receiver domain and an HTH DNA-binding domain [Signal transduction mechanisms / Transcription]	NA|389aa|down_4|NC_019676.1_2049372_2050539_-	COG0642, BaeS, Signal transduction histidine kinase [Signal transduction mechanisms]	NA|181aa|down_5|NC_019676.1_2050991_2051534_+	pfam14065, DUF4255, Protein of unknown function (DUF4255)	NA|557aa|down_6|NC_019676.1_2051605_2053276_+	COG3497, COG3497, Phage tail sheath protein FI [General function prediction only]	NA|162aa|down_7|NC_019676.1_2053531_2054017_+	pfam06841, Phage_T4_gp19, T4-like virus tail tube protein gp19	NA|156aa|down_8|NC_019676.1_2054073_2054541_+	pfam06841, Phage_T4_gp19, T4-like virus tail tube protein gp19	NA|164aa|down_9|NC_019676.1_2054670_2055162_+	pfam06841, Phage_T4_gp19, T4-like virus tail tube protein gp19
GCF_000316625.1_ASM31662v1	NC_019676	Nostoc sp. PCC 7107, complete sequence	4	2133534-2133831	1	CRT	no		PD-DExK,cas3,csa3,cas2,cas1,cas4,cas6,2OG_CAS,csc1gr5,csc2gr7,cas10d,WYL,c2c9_V-U4,cas10,csm3gr7,csx10gr5,csm2gr11,csx19,csx21,DinG,DEDDh,csx3,csm4gr5,csm5gr7,csm6,RT,c2c5_V-U5	Orphan	TCTGTCAACTTTCCAACTTGAC	22	0	0	NA	NA	NA	4	4	Orphan	PD-DExK,cas3,csa3,cas2,cas1,cas4,cas6,2OG_CAS,csc1gr5,csc2gr7,cas10d,WYL,c2c9_V-U4,cas10,csm3gr7,csx10gr5,csm2gr11,csx19,csx21,DinG,DEDDh,csx3,csm4gr5,csm5gr7,csm6,RT,c2c5_V-U5	NA|104aa|up_4|NC_019676.1_2127094_2127406_-,NA|115aa|up_2|NC_019676.1_2131588_2131933_+,NA|142aa|down_5|NC_019676.1_2142122_2142548_-	NA|362aa|up_9|NC_019676.1_2121630_2122716_-	COG4191, COG4191, Signal transduction histidine kinase regulating C4-dicarboxylate transport system [Signal transduction mechanisms]	NA|386aa|up_8|NC_019676.1_2122723_2123881_-	COG3287, COG3287, Uncharacterized conserved protein [Function unknown]	NA|375aa|up_7|NC_019676.1_2124197_2125322_+	COG3292, COG3292, Predicted periplasmic ligand-binding sensor domain [Signal transduction mechanisms]	NA|261aa|up_6|NC_019676.1_2125404_2126187_+	PRK00311, panB, 3-methyl-2-oxobutanoate hydroxymethyltransferase; Reviewed	NA|191aa|up_5|NC_019676.1_2126215_2126788_-	TIGR02227, Inactive_signal_peptidase_IA	NA|104aa|up_4|NC_019676.1_2127094_2127406_-	NA	NA|1188aa|up_3|NC_019676.1_2127723_2131287_-	TIGR02082, Methionine_synthase, 5-methyltetrahydrofolate--homocysteine methyltransferase	NA|115aa|up_2|NC_019676.1_2131588_2131933_+	NA	NA|70aa|up_1|NC_019676.1_2132040_2132250_+	pfam10047, DUF2281, Protein of unknown function (DUF2281)	NA|141aa|up_0|NC_019676.1_2132246_2132669_+	cd18696, PIN_MtVapC26-like, VapC-like PIN domain of Mycobacterium tuberculosis VapC26 and related proteins	NA|111aa|down_0|NC_019676.1_2133871_2134204_-	cd02947, TRX_family, TRX family; composed of two groups: Group I, which includes proteins that exclusively encode a TRX domain; and Group II, which are composed of fusion proteins of TRX and additional domains	NA|1042aa|down_1|NC_019676.1_2134404_2137530_-	TIGR02169, chromosome_segregation_protein_related_ptotein, chromosome segregation protein SMC, primarily archaeal type	NA|517aa|down_2|NC_019676.1_2137843_2139394_-	cd00839, MPP_PAPs, purple acid phosphatases of the metallophosphatase superfamily, metallophosphatase domain	NA|328aa|down_3|NC_019676.1_2139409_2140393_-	PRK00861, PRK00861, putative lipid kinase; Reviewed	NA|222aa|down_4|NC_019676.1_2140877_2141543_+	pfam05023, Phytochelatin, Phytochelatin synthase	NA|142aa|down_5|NC_019676.1_2142122_2142548_-	NA	NA|90aa|down_6|NC_019676.1_2142779_2143049_-	PRK12864, PRK12864, YciI-like protein; Reviewed	NA|812aa|down_7|NC_019676.1_2143129_2145565_-	PRK00629, pheT, phenylalanyl-tRNA synthetase subunit beta; Reviewed	NA|668aa|down_8|NC_019676.1_2145864_2147868_+	PRK00007, PRK00007, elongation factor G; Reviewed	NA|299aa|down_9|NC_019676.1_2148117_2149014_+	COG0385, COG0385, Predicted Na+-dependent transporter [General function prediction only]
GCF_000316625.1_ASM31662v1	NC_019676	Nostoc sp. PCC 7107, complete sequence	5	2168386-2168884	2	CRT	no		PD-DExK,cas3,csa3,cas2,cas1,cas4,cas6,2OG_CAS,csc1gr5,csc2gr7,cas10d,WYL,c2c9_V-U4,cas10,csm3gr7,csx10gr5,csm2gr11,csx19,csx21,DinG,DEDDh,csx3,csm4gr5,csm5gr7,csm6,RT,c2c5_V-U5	Orphan	ATAACACCCTCAANGGTGG	19	2	2	2168525-2168565|2168585-2168625	NC_019676.1_2169125-2169165|NC_019676.1_2169185-2169225	NA	8	8	Orphan	PD-DExK,cas3,csa3,cas2,cas1,cas4,cas6,2OG_CAS,csc1gr5,csc2gr7,cas10d,WYL,c2c9_V-U4,cas10,csm3gr7,csx10gr5,csm2gr11,csx19,csx21,DinG,DEDDh,csx3,csm4gr5,csm5gr7,csm6,RT,c2c5_V-U5	NA|108aa|up_5|NC_019676.1_2160528_2160852_+,NA|225aa|down_7|NC_019676.1_2179860_2180535_-	NA|123aa|up_9|NC_019676.1_2157251_2157620_+	COG1357, COG1357, Pentapeptide repeats containing protein [Function unknown]	NA|159aa|up_8|NC_019676.1_2157979_2158456_+	pfam00583, Acetyltransf_1, Acetyltransferase (GNAT) family	NA|272aa|up_7|NC_019676.1_2158469_2159285_-	pfam08894, DUF1838, Protein of unknown function (DUF1838)	NA|282aa|up_6|NC_019676.1_2159380_2160226_-	cd01949, GGDEF, Diguanylate-cyclase (DGC) or GGDEF domain	NA|108aa|up_5|NC_019676.1_2160528_2160852_+	NA	NA|268aa|up_4|NC_019676.1_2161190_2161994_-	pfam02557, VanY, D-alanyl-D-alanine carboxypeptidase	NA|306aa|up_3|NC_019676.1_2162094_2163012_+	COG0679, COG0679, Predicted permeases [General function prediction only]	NA|173aa|up_2|NC_019676.1_2163070_2163589_-	COG2236, COG2236, Predicted phosphoribosyltransferases [General function prediction only]	NA|480aa|up_1|NC_019676.1_2163661_2165101_+	COG2211, MelB, Na+/melibiose symporter and related transporters [Carbohydrate transport and metabolism]	NA|256aa|up_0|NC_019676.1_2165269_2166037_+	pfam13230, GATase_4, Glutamine amidotransferases class-II	NA|262aa|down_0|NC_019676.1_2170928_2171714_-	cd00060, FHA, Forkhead associated domain (FHA); found in eukaryotic and prokaryotic proteins	NA|392aa|down_1|NC_019676.1_2171751_2172927_-	smart00933, NurA, NurA nuclease	NA|262aa|down_2|NC_019676.1_2173000_2173786_-	COG0546, Gph, Predicted phosphatases [General function prediction only]	NA|183aa|down_3|NC_019676.1_2174132_2174681_+	cd02970, PRX_like2, Peroxiredoxin (PRX)-like 2 family; hypothetical proteins that show sequence similarity to PRXs	NA|392aa|down_4|NC_019676.1_2174917_2176093_+	pfam13191, AAA_16, AAA ATPase domain	NA|294aa|down_5|NC_019676.1_2176100_2176982_-	pfam09972, DUF2207, Predicted membrane protein (DUF2207)	NA|859aa|down_6|NC_019676.1_2177200_2179777_-	cd04059, Peptidases_S8_Protein_convertases_Kexins_Furin-like, Peptidase S8 family domain in Protein convertases	NA|225aa|down_7|NC_019676.1_2179860_2180535_-	NA	NA|783aa|down_8|NC_019676.1_2181112_2183461_+	cd06268, PBP1_ABC_transporter_LIVBP-like, periplasmic binding domain of ATP-binding cassette transporter-like systems that belong to the type 1 periplasmic binding fold protein superfamily	NA|148aa|down_9|NC_019676.1_2183987_2184431_+	pfam12966, AtpR, N-ATPase, AtpR subunit
GCF_000316625.1_ASM31662v1	NC_019676	Nostoc sp. PCC 7107, complete sequence	6	2728758-2730094	1,4,3	PILER-CR,CRISPRCasFinder,CRT	no		PD-DExK,cas3,csa3,cas2,cas1,cas4,cas6,2OG_CAS,csc1gr5,csc2gr7,cas10d,WYL,c2c9_V-U4,cas10,csm3gr7,csx10gr5,csm2gr11,csx19,csx21,DinG,DEDDh,csx3,csm4gr5,csm5gr7,csm6,RT,c2c5_V-U5	Orphan	GTTT----------CAATCCCTAATAGGGATTAGGTGAAGTTTAAAC,GTTTCAATCCCTAATAGGGATTAGGTGAAGTTTAAAC,GTTTCAATCCCTAATAGGGATTAGGTGAAGTTTAAAC	47,37,37	0	0	NA	NA	I-D,II-B:I-D,II-B:I-D,II-B	12,18,18	18	Orphan	PD-DExK,cas3,csa3,cas2,cas1,cas4,cas6,2OG_CAS,csc1gr5,csc2gr7,cas10d,WYL,c2c9_V-U4,cas10,csm3gr7,csx10gr5,csm2gr11,csx19,csx21,DinG,DEDDh,csx3,csm4gr5,csm5gr7,csm6,RT,c2c5_V-U5	NA|82aa|up_4|NC_019676.1_2726190_2726436_-,NA|86aa|up_3|NC_019676.1_2726549_2726807_-,NA|86aa|up_2|NC_019676.1_2727006_2727264_-,NA|82aa|up_1|NC_019676.1_2727399_2727645_-,NA|88aa|up_0|NC_019676.1_2728274_2728538_-,NA	NA|139aa|up_9|NC_019676.1_2721596_2722013_-	pfam00875, DNA_photolyase, DNA photolyase	NA|281aa|up_8|NC_019676.1_2722009_2722852_-	COG0415, PhrB, Deoxyribodipyrimidine photolyase [DNA replication, recombination, and repair]	NA|160aa|up_7|NC_019676.1_2722975_2723455_+	COG1225, Bcp, Peroxiredoxin [Posttranslational modification, protein turnover, chaperones]	NA|367aa|up_6|NC_019676.1_2723553_2724654_-	pfam17914, HopA1, HopA1 effector protein family	NA|393aa|up_5|NC_019676.1_2724944_2726123_-	pfam01636, APH, Phosphotransferase enzyme family	NA|82aa|up_4|NC_019676.1_2726190_2726436_-	NA	NA|86aa|up_3|NC_019676.1_2726549_2726807_-	NA	NA|86aa|up_2|NC_019676.1_2727006_2727264_-	NA	NA|82aa|up_1|NC_019676.1_2727399_2727645_-	NA	NA|88aa|up_0|NC_019676.1_2728274_2728538_-	NA	NA|495aa|down_0|NC_019676.1_2730394_2731879_-	COG0612, PqqL, Predicted Zn-dependent peptidases [General function prediction only]	NA|519aa|down_1|NC_019676.1_2731970_2733527_-	COG0612, PqqL, Predicted Zn-dependent peptidases [General function prediction only]	NA|205aa|down_2|NC_019676.1_2733828_2734443_+	pfam00440, TetR_N, Bacterial regulatory proteins, tetR family	NA|151aa|down_3|NC_019676.1_2734470_2734923_-	pfam14229, DUF4332, Domain of unknown function (DUF4332)	NA|194aa|down_4|NC_019676.1_2735103_2735685_-	TIGR04376, conserved_hypothetical_protein, TIGR04376 family protein	NA|815aa|down_5|NC_019676.1_2736026_2738471_+	PRK11091, PRK11091, aerobic respiration control sensor protein ArcB; Provisional	NA|430aa|down_6|NC_019676.1_2738467_2739757_+	cd19920, REC_PA4781-like, phosphoacceptor receiver (REC) domain of cyclic di-GMP phosphodiesterase PA4781 and similar domains	NA|296aa|down_7|NC_019676.1_2740061_2740949_-	COG0596, MhpC, Predicted hydrolases or acyltransferases (alpha/beta hydrolase superfamily) [General function prediction only]	NA|166aa|down_8|NC_019676.1_2741038_2741536_-	TIGR04110, hypothetical_protein_VSWAT3_12502, heme utilization protein HutZ	NA|495aa|down_9|NC_019676.1_2741686_2743171_+	pfam08547, CIA30, Complex I intermediate-associated protein 30 (CIA30)
GCF_000316625.1_ASM31662v1	NC_019676	Nostoc sp. PCC 7107, complete sequence	7	3230314-3230937	2,5,4	PILER-CR,CRISPRCasFinder,CRT	no		PD-DExK,cas3,csa3,cas2,cas1,cas4,cas6,2OG_CAS,csc1gr5,csc2gr7,cas10d,WYL,c2c9_V-U4,cas10,csm3gr7,csx10gr5,csm2gr11,csx19,csx21,DinG,DEDDh,csx3,csm4gr5,csm5gr7,csm6,RT,c2c5_V-U5	Orphan	GTTT----------CAATCCCTAATAGGGATTAAGTGAAATTTCAAC,GTTTCAATCCCTAATAGGGATTAAGTGAAATTTCAAC,GTTTCAATCCCTAATAGGGATTAAGTGAAATTTCAAC	47,37,37	0	0	NA	NA	I-D,II-B:I-D,II-B:I-D,II-B	8,8,8	8	Orphan	PD-DExK,cas3,csa3,cas2,cas1,cas4,cas6,2OG_CAS,csc1gr5,csc2gr7,cas10d,WYL,c2c9_V-U4,cas10,csm3gr7,csx10gr5,csm2gr11,csx19,csx21,DinG,DEDDh,csx3,csm4gr5,csm5gr7,csm6,RT,c2c5_V-U5	NA|84aa|up_5|NC_019676.1_3224867_3225119_-,NA|206aa|up_4|NC_019676.1_3225518_3226136_-,NA|93aa|up_2|NC_019676.1_3227128_3227407_-,NA	NA|858aa|up_9|NC_019676.1_3217822_3220396_+	smart00065, GAF, Domain present in phytochromes and cGMP-specific phosphodiesterases	NA|714aa|up_8|NC_019676.1_3220502_3222644_+	COG0642, BaeS, Signal transduction histidine kinase [Signal transduction mechanisms]	NA|219aa|up_7|NC_019676.1_3222648_3223305_-	pfam07077, DUF1345, Protein of unknown function (DUF1345)	NA|415aa|up_6|NC_019676.1_3223402_3224647_-	PRK07424, PRK07424, bifunctional sterol desaturase/short chain dehydrogenase; Validated	NA|84aa|up_5|NC_019676.1_3224867_3225119_-	NA	NA|206aa|up_4|NC_019676.1_3225518_3226136_-	NA	NA|256aa|up_3|NC_019676.1_3226253_3227021_-	cd03457, intradiol_dioxygenase_like, Intradiol dioxygenase supgroup	NA|93aa|up_2|NC_019676.1_3227128_3227407_-	NA	NA|228aa|up_1|NC_019676.1_3227672_3228356_+	COG0745, OmpR, Response regulators consisting of a CheY-like receiver domain and a winged-helix DNA-binding domain [Signal transduction mechanisms / Transcription]	NA|482aa|up_0|NC_019676.1_3228361_3229807_+	TIGR01386, Probable_sensor_protein_PcoS, heavy metal sensor kinase	NA|540aa|down_0|NC_019676.1_3231359_3232979_-	sd00006, TPR, Tetratricopeptide repeat	NA|607aa|down_1|NC_019676.1_3233272_3235093_+	TIGR03423, pbp2_mrdA, penicillin-binding protein 2	NA|530aa|down_2|NC_019676.1_3235557_3237147_+	COG1032, COG1032, Fe-S oxidoreductase [Energy production and conversion]	NA|275aa|down_3|NC_019676.1_3237218_3238043_-	pfam17882, SBD, OAA-family lectin sugar binding domain	NA|224aa|down_4|NC_019676.1_3238865_3239537_-	smart00421, HTH_LUXR, helix_turn_helix, Lux Regulon	NA|277aa|down_5|NC_019676.1_3239636_3240467_+	COG2897, SseA, Rhodanese-related sulfurtransferase [Inorganic ion transport and metabolism]	NA|710aa|down_6|NC_019676.1_3240544_3242674_-	cd14014, STKc_PknB_like, Catalytic domain of bacterial Serine/Threonine kinases, PknB and similar proteins	NA|558aa|down_7|NC_019676.1_3242742_3244416_-	cd14014, STKc_PknB_like, Catalytic domain of bacterial Serine/Threonine kinases, PknB and similar proteins	NA|513aa|down_8|NC_019676.1_3244579_3246118_+	COG1541, PaaK, Coenzyme F390 synthetase [Coenzyme metabolism]	NA|213aa|down_9|NC_019676.1_3246260_3246899_+	COG1309, AcrR, Transcriptional regulator [Transcription]
GCF_000316625.1_ASM31662v1	NC_019676	Nostoc sp. PCC 7107, complete sequence	8	3300503-3301032	6,5,3,7	CRISPRCasFinder,CRT,PILER-CR,CRISPRCasFinder	no	cas10,csm3gr7,csx10gr5,csm2gr11,csx19,csx21,cas6,cas2,cas1	PD-DExK,cas3,csa3,cas2,cas1,cas4,cas6,2OG_CAS,csc1gr5,csc2gr7,cas10d,WYL,c2c9_V-U4,cas10,csm3gr7,csx10gr5,csm2gr11,csx19,csx21,DinG,DEDDh,csx3,csm4gr5,csm5gr7,csm6,RT,c2c5_V-U5	Type III-B,Type III-D,Type III-A,Type III-C	GTTTCCATCCCCGTGAGGGGTAATTAATTGAAAAC,GTTTCCATCCCCGTGAGGGGTAATTAATTGAAAAC,TTTCCATCCCCGTGAGGGGTAATTAATTGAAAAC,GTTTCCATCCCCGTGAGGGGTAATTAATTGAAAAC	35,35,34,35	0	0	NA	NA	NA:NA:NA:NA	5,7,7,5	7	TypeIII-B,TypeIII-D,TypeIII-A,TypeIII-C	PD-DExK,cas3,csa3,cas2,cas1,cas4,cas6,2OG_CAS,csc1gr5,csc2gr7,cas10d,WYL,c2c9_V-U4,cas10,csm3gr7,csx10gr5,csm2gr11,csx19,csx21,DinG,DEDDh,csx3,csm4gr5,csm5gr7,csm6,RT,c2c5_V-U5	csm2gr11|170aa|up_7|NC_019676.1_3294514_3295024_+,csx19|152aa|up_4|NC_019676.1_3296925_3297381_+,csx21|218aa|up_2|NC_019676.1_3298505_3299159_+,NA|271aa|up_1|NC_019676.1_3299174_3299987_-,NA|77aa|down_1|NC_019676.1_3302598_3302829_-,NA|102aa|down_2|NC_019676.1_3302950_3303256_-,NA|90aa|down_3|NC_019676.1_3303348_3303618_-,NA|92aa|down_8|NC_019676.1_3310462_3310738_-,NA|46aa|down_9|NC_019676.1_3310759_3310897_+	csm3gr7|238aa|up_9|NC_019676.1_3292530_3293244_+	COG1337, COG1337, CRISPR system related protein, RAMP superfamily [Defense    mechanisms]	csx10gr5|426aa|up_8|NC_019676.1_3293240_3294518_+	TIGR02674, cas_cyan_RAMP_2, CRISPR-associated RAMP protein, Csx10 family	csm2gr11|170aa|up_7|NC_019676.1_3294514_3295024_+	NA	csm3gr7|304aa|up_6|NC_019676.1_3295029_3295941_+	cd09683, Csm3_III-A, CRISPR/Cas system-associated RAMP superfamily protein Csm3	csm3gr7|326aa|up_5|NC_019676.1_3295951_3296929_+	COG1337, COG1337, CRISPR system related protein, RAMP superfamily [Defense    mechanisms]	csx19|152aa|up_4|NC_019676.1_3296925_3297381_+	NA	csm3gr7|363aa|up_3|NC_019676.1_3297392_3298481_+	TIGR03986, CRISPR-associated_protein, CRISPR-associated protein	csx21|218aa|up_2|NC_019676.1_3298505_3299159_+	NA	NA|271aa|up_1|NC_019676.1_3299174_3299987_-	NA	NA|113aa|up_0|NC_019676.1_3300101_3300440_-	cd16382, XisI-like, XisI is FdxN element excision controlling factor protein	cas6|372aa|down_0|NC_019676.1_3301327_3302443_-	COG5551, COG5551, CRISPR system related protein, RAMP superfamily [Defense    mechanisms]	NA|77aa|down_1|NC_019676.1_3302598_3302829_-	NA	NA|102aa|down_2|NC_019676.1_3302950_3303256_-	NA	NA|90aa|down_3|NC_019676.1_3303348_3303618_-	NA	cas2|93aa|down_4|NC_019676.1_3303746_3304025_-	cd09725, Cas2_I_II_III, CRISPR/Cas system-associated protein Cas2	cas1|347aa|down_5|NC_019676.1_3304024_3305065_-	pfam01867, Cas_Cas1, CRISPR associated protein Cas1	NA|79aa|down_6|NC_019676.1_3305425_3305662_+	pfam02604, PhdYeFM_antitox, Antitoxin Phd_YefM, type II toxin-antitoxin system	cas1|673aa|down_7|NC_019676.1_3308169_3310188_-	pfam01867, Cas_Cas1, CRISPR associated protein Cas1	NA|92aa|down_8|NC_019676.1_3310462_3310738_-	NA	NA|46aa|down_9|NC_019676.1_3310759_3310897_+	NA
GCF_000316625.1_ASM31662v1	NC_019676	Nostoc sp. PCC 7107, complete sequence	9	3306039-3307844	6,4,8	CRT,PILER-CR,CRISPRCasFinder	no	cas10,csm3gr7,csx10gr5,csm2gr11,csx19,csx21,cas6,cas2,cas1	PD-DExK,cas3,csa3,cas2,cas1,cas4,cas6,2OG_CAS,csc1gr5,csc2gr7,cas10d,WYL,c2c9_V-U4,cas10,csm3gr7,csx10gr5,csm2gr11,csx19,csx21,DinG,DEDDh,csx3,csm4gr5,csm5gr7,csm6,RT,c2c5_V-U5	Type III-B,Type III-D,Type III-A,Type III-C	GTTTCCATCCCCGTGAGGGGTAAGTGATTTAAAAC,GTTTCCATCCCCGTGAGGGGTAAGTGATTTAAAAC,GTTTCCATCCCCGTGAGGGGTAAGTGATTTAAAAC	35,35,35	0	0	NA	NA	NA:NA:NA	24,23,23	24	TypeIII-B,TypeIII-D,TypeIII-A,TypeIII-C	PD-DExK,cas3,csa3,cas2,cas1,cas4,cas6,2OG_CAS,csc1gr5,csc2gr7,cas10d,WYL,c2c9_V-U4,cas10,csm3gr7,csx10gr5,csm2gr11,csx19,csx21,DinG,DEDDh,csx3,csm4gr5,csm5gr7,csm6,RT,c2c5_V-U5	csx21|218aa|up_9|NC_019676.1_3298505_3299159_+,NA|271aa|up_8|NC_019676.1_3299174_3299987_-,NA|77aa|up_5|NC_019676.1_3302598_3302829_-,NA|102aa|up_4|NC_019676.1_3302950_3303256_-,NA|90aa|up_3|NC_019676.1_3303348_3303618_-,NA|92aa|down_1|NC_019676.1_3310462_3310738_-,NA|46aa|down_2|NC_019676.1_3310759_3310897_+,NA|47aa|down_5|NC_019676.1_3317904_3318045_-	csx21|218aa|up_9|NC_019676.1_3298505_3299159_+	NA	NA|271aa|up_8|NC_019676.1_3299174_3299987_-	NA	NA|113aa|up_7|NC_019676.1_3300101_3300440_-	cd16382, XisI-like, XisI is FdxN element excision controlling factor protein	cas6|372aa|up_6|NC_019676.1_3301327_3302443_-	COG5551, COG5551, CRISPR system related protein, RAMP superfamily [Defense    mechanisms]	NA|77aa|up_5|NC_019676.1_3302598_3302829_-	NA	NA|102aa|up_4|NC_019676.1_3302950_3303256_-	NA	NA|90aa|up_3|NC_019676.1_3303348_3303618_-	NA	cas2|93aa|up_2|NC_019676.1_3303746_3304025_-	cd09725, Cas2_I_II_III, CRISPR/Cas system-associated protein Cas2	cas1|347aa|up_1|NC_019676.1_3304024_3305065_-	pfam01867, Cas_Cas1, CRISPR associated protein Cas1	NA|79aa|up_0|NC_019676.1_3305425_3305662_+	pfam02604, PhdYeFM_antitox, Antitoxin Phd_YefM, type II toxin-antitoxin system	cas1|673aa|down_0|NC_019676.1_3308169_3310188_-	pfam01867, Cas_Cas1, CRISPR associated protein Cas1	NA|92aa|down_1|NC_019676.1_3310462_3310738_-	NA	NA|46aa|down_2|NC_019676.1_3310759_3310897_+	NA	NA|248aa|down_3|NC_019676.1_3311025_3311769_+	pfam14326, DUF4384, Domain of unknown function (DUF4384)	NA|1259aa|down_4|NC_019676.1_3314099_3317876_+	pfam12770, CHAT, CHAT domain	NA|47aa|down_5|NC_019676.1_3317904_3318045_-	NA	NA|162aa|down_6|NC_019676.1_3318013_3318499_-	cd00756, MoaE, MoaE family	NA|102aa|down_7|NC_019676.1_3318678_3318984_+	COG2202, AtoS, FOG: PAS/PAC domain [Signal transduction mechanisms]	NA|828aa|down_8|NC_019676.1_3319131_3321615_+	COG4251, COG4251, Bacteriophytochrome (light-regulated signal transduction histidine kinase) [Signal transduction mechanisms]	NA|166aa|down_9|NC_019676.1_3321834_3322332_+	cd17557, REC_Rcp-like, phosphoacceptor receiver (REC) domain of cyanobacterial phytochrome response regulator Rcp and similar domains
GCF_000316625.1_ASM31662v1	NC_019676	Nostoc sp. PCC 7107, complete sequence	10	3312106-3314038	5,9,7,6	PILER-CR,CRISPRCasFinder,CRT,PILER-CR	no	csm3gr7,csx10gr5,csm2gr11,csx19,csx21,cas6,cas2,cas1	PD-DExK,cas3,csa3,cas2,cas1,cas4,cas6,2OG_CAS,csc1gr5,csc2gr7,cas10d,WYL,c2c9_V-U4,cas10,csm3gr7,csx10gr5,csm2gr11,csx19,csx21,DinG,DEDDh,csx3,csm4gr5,csm5gr7,csm6,RT,c2c5_V-U5	Type III-A	GTTTCCATCCCCGTGAGGGGTAATGAATTGAAAAC,GTTTTCAATTCATTACCCCTCACGGGGATGGAAAC,GTTTTCAATTCATTACCCCTCACGGGGATGGAAAC,GTTTCCATCCCCGTGAGGGGTAATGAATTGAAAAC	35,35,35,35	0	0	NA	NA	NA:NA:NA:NA	23,26,26,23	26	TypeIII-A	PD-DExK,cas3,csa3,cas2,cas1,cas4,cas6,2OG_CAS,csc1gr5,csc2gr7,cas10d,WYL,c2c9_V-U4,cas10,csm3gr7,csx10gr5,csm2gr11,csx19,csx21,DinG,DEDDh,csx3,csm4gr5,csm5gr7,csm6,RT,c2c5_V-U5	NA|77aa|up_9|NC_019676.1_3302598_3302829_-,NA|102aa|up_8|NC_019676.1_3302950_3303256_-,NA|90aa|up_7|NC_019676.1_3303348_3303618_-,NA|92aa|up_2|NC_019676.1_3310462_3310738_-,NA|46aa|up_1|NC_019676.1_3310759_3310897_+,NA|47aa|down_1|NC_019676.1_3317904_3318045_-	NA|77aa|up_9|NC_019676.1_3302598_3302829_-	NA	NA|102aa|up_8|NC_019676.1_3302950_3303256_-	NA	NA|90aa|up_7|NC_019676.1_3303348_3303618_-	NA	cas2|93aa|up_6|NC_019676.1_3303746_3304025_-	cd09725, Cas2_I_II_III, CRISPR/Cas system-associated protein Cas2	cas1|347aa|up_5|NC_019676.1_3304024_3305065_-	pfam01867, Cas_Cas1, CRISPR associated protein Cas1	NA|79aa|up_4|NC_019676.1_3305425_3305662_+	pfam02604, PhdYeFM_antitox, Antitoxin Phd_YefM, type II toxin-antitoxin system	cas1|673aa|up_3|NC_019676.1_3308169_3310188_-	pfam01867, Cas_Cas1, CRISPR associated protein Cas1	NA|92aa|up_2|NC_019676.1_3310462_3310738_-	NA	NA|46aa|up_1|NC_019676.1_3310759_3310897_+	NA	NA|248aa|up_0|NC_019676.1_3311025_3311769_+	pfam14326, DUF4384, Domain of unknown function (DUF4384)	NA|1259aa|down_0|NC_019676.1_3314099_3317876_+	pfam12770, CHAT, CHAT domain	NA|47aa|down_1|NC_019676.1_3317904_3318045_-	NA	NA|162aa|down_2|NC_019676.1_3318013_3318499_-	cd00756, MoaE, MoaE family	NA|102aa|down_3|NC_019676.1_3318678_3318984_+	COG2202, AtoS, FOG: PAS/PAC domain [Signal transduction mechanisms]	NA|828aa|down_4|NC_019676.1_3319131_3321615_+	COG4251, COG4251, Bacteriophytochrome (light-regulated signal transduction histidine kinase) [Signal transduction mechanisms]	NA|166aa|down_5|NC_019676.1_3321834_3322332_+	cd17557, REC_Rcp-like, phosphoacceptor receiver (REC) domain of cyanobacterial phytochrome response regulator Rcp and similar domains	NA|631aa|down_6|NC_019676.1_3322297_3324190_+	PRK13557, PRK13557, histidine kinase; Provisional	NA|179aa|down_7|NC_019676.1_3324215_3324752_-	PRK05205, PRK05205, bifunctional pyr operon transcriptional regulator/uracil phosphoribosyltransferase PyrR	NA|320aa|down_8|NC_019676.1_3324833_3325793_-	PRK01209, cobD, cobalamin biosynthesis protein	NA|148aa|down_9|NC_019676.1_3325860_3326304_-	TIGR00738, Putative_HTH-type_transcriptional_regulator, Rrf2 family protein
GCF_000316625.1_ASM31662v1	NC_019676	Nostoc sp. PCC 7107, complete sequence	11	3371366-3371462	10	CRISPRCasFinder	no		PD-DExK,cas3,csa3,cas2,cas1,cas4,cas6,2OG_CAS,csc1gr5,csc2gr7,cas10d,WYL,c2c9_V-U4,cas10,csm3gr7,csx10gr5,csm2gr11,csx19,csx21,DinG,DEDDh,csx3,csm4gr5,csm5gr7,csm6,RT,c2c5_V-U5	Orphan	ACTCAGCACGGGCTAAACGCCCCGCTACCGCT	32	1	16	3371398-3371430|3371398-3371430|3371398-3371430|3371398-3371430|3371398-3371430|3371398-3371430|3371398-3371430|3371398-3371430|3371398-3371430|3371398-3371430|3371398-3371430|3371398-3371430|3371398-3371430|3371398-3371430|3371398-3371430|3371398-3371430	NC_019676.1_33131-33099|NC_019676.1_246490-246522|NC_019676.1_660849-660881|NC_019676.1_795126-795158|NC_019676.1_2107946-2107914|NC_019676.1_2128608-2128576|NC_019676.1_2677477-2677445|NC_019676.1_3280543-3280511|NC_019676.1_4254863-4254895|NC_019676.1_4746213-4746181|NC_019676.1_1062204-1062236|NC_019676.1_5266719-5266687|NC_019676.1_5285933-5285901|NC_019676.1_997625-997657|NC_019676.1_2993510-2993478|NC_019676.1_4920327-4920295	NA	1	1	Orphan	PD-DExK,cas3,csa3,cas2,cas1,cas4,cas6,2OG_CAS,csc1gr5,csc2gr7,cas10d,WYL,c2c9_V-U4,cas10,csm3gr7,csx10gr5,csm2gr11,csx19,csx21,DinG,DEDDh,csx3,csm4gr5,csm5gr7,csm6,RT,c2c5_V-U5	NA|159aa|up_3|NC_019676.1_3365972_3366449_-,NA|91aa|down_9|NC_019676.1_3383622_3383895_-	NA|272aa|up_9|NC_019676.1_3360058_3360874_+	PRK03592, PRK03592, haloalkane dehalogenase; Provisional	NA|306aa|up_8|NC_019676.1_3360934_3361852_+	pfam14499, DUF4437, Domain of unknown function (DUF4437)	NA|490aa|up_7|NC_019676.1_3361887_3363357_-	cd01302, Cyclic_amidohydrolases, Cyclic amidohydrolases, including hydantoinase, dihydropyrimidinase, allantoinase, and dihydroorotase, are involved in the metabolism of pyrimidines and purines, sharing the property of hydrolyzing the cyclic amide bond of each substrate to the corresponding N-carbamyl amino acids	NA|265aa|up_6|NC_019676.1_3363493_3364288_+	COG3555, COG3555, Aspartyl/asparaginyl beta-hydroxylase and related dioxygenases [Posttranslational modification, protein turnover, chaperones]	NA|279aa|up_5|NC_019676.1_3364322_3365159_-	cd05359, ChcA_like_SDR_c, 1-cyclohexenylcarbonyl_coenzyme A_reductase (ChcA)_like, classical (c) SDRs	NA|133aa|up_4|NC_019676.1_3365560_3365959_+	pfam12680, SnoaL_2, SnoaL-like domain	NA|159aa|up_3|NC_019676.1_3365972_3366449_-	NA	NA|341aa|up_2|NC_019676.1_3366662_3367685_+	cd07987, LPLAT_MGAT-like, Lysophospholipid Acyltransferases (LPLATs) of Glycerophospholipid Biosynthesis: MGAT-like	NA|233aa|up_1|NC_019676.1_3367735_3368434_-	COG0745, OmpR, Response regulators consisting of a CheY-like receiver domain and a winged-helix DNA-binding domain [Signal transduction mechanisms / Transcription]	NA|215aa|up_0|NC_019676.1_3368510_3369155_+	PRK01686, hisG, ATP phosphoribosyltransferase catalytic subunit; Reviewed	NA|142aa|down_0|NC_019676.1_3371741_3372167_+	cd17557, REC_Rcp-like, phosphoacceptor receiver (REC) domain of cyanobacterial phytochrome response regulator Rcp and similar domains	NA|756aa|down_1|NC_019676.1_3372172_3374440_+	PRK13557, PRK13557, histidine kinase; Provisional	NA|104aa|down_2|NC_019676.1_3374443_3374755_-	smart00530, HTH_XRE, Helix-turn-helix XRE-family like proteins	NA|133aa|down_3|NC_019676.1_3374741_3375140_-	pfam05973, Gp49, Phage derived protein Gp49-like (DUF891)	NA|335aa|down_4|NC_019676.1_3375227_3376232_-	PRK00856, pyrB, aspartate carbamoyltransferase catalytic subunit	NA|188aa|down_5|NC_019676.1_3376517_3377081_+	COG1434, COG1434, Uncharacterized conserved protein [Function unknown]	NA|803aa|down_6|NC_019676.1_3377186_3379595_-	pfam06537, DHOR, Di-haem oxidoreductase, putative peroxidase	NA|451aa|down_7|NC_019676.1_3381381_3382734_-	COG2239, MgtE, Mg/Co/Ni transporter MgtE (contains CBS domain) [Inorganic ion transport and metabolism]	NA|119aa|down_8|NC_019676.1_3383266_3383623_-	pfam02452, PemK_toxin, PemK-like, MazF-like toxin of type II toxin-antitoxin system	NA|91aa|down_9|NC_019676.1_3383622_3383895_-	NA
GCF_000316625.1_ASM31662v1	NC_019676	Nostoc sp. PCC 7107, complete sequence	12	3446755-3448198	7,11,8,8	PILER-CR,CRISPRCasFinder,CRT,PILER-CR	no		PD-DExK,cas3,csa3,cas2,cas1,cas4,cas6,2OG_CAS,csc1gr5,csc2gr7,cas10d,WYL,c2c9_V-U4,cas10,csm3gr7,csx10gr5,csm2gr11,csx19,csx21,DinG,DEDDh,csx3,csm4gr5,csm5gr7,csm6,RT,c2c5_V-U5	Orphan	AATTG----------CAATTCAAACTAATCCCTATTAGGGATTGAAAC,AATTGCAATTCAAACTAATCCCTATTAGGGATTGAAAC,AATTGCAATTCAAACTAATCCCTATTAGGGATTGAAAC,AATTG----------CAATTCAAACTAATCCCTATTAGGGATTGAAAC	48,38,38,48	0	0	NA	NA	I-D,II-B:I-D,II-B:I-D,II-B:I-D,II-B	16,19,19,16	19	Orphan	PD-DExK,cas3,csa3,cas2,cas1,cas4,cas6,2OG_CAS,csc1gr5,csc2gr7,cas10d,WYL,c2c9_V-U4,cas10,csm3gr7,csx10gr5,csm2gr11,csx19,csx21,DinG,DEDDh,csx3,csm4gr5,csm5gr7,csm6,RT,c2c5_V-U5	NA|72aa|up_6|NC_019676.1_3440318_3440534_-,NA|123aa|up_5|NC_019676.1_3440744_3441113_+,NA|248aa|up_3|NC_019676.1_3442556_3443300_-,NA|193aa|down_4|NC_019676.1_3458537_3459116_+,NA|102aa|down_6|NC_019676.1_3460456_3460762_-	NA|162aa|up_9|NC_019676.1_3437399_3437885_-	cd12125, APC_alpha, Allophycocyanin alpha subunit of the phycobilisome core	NA|453aa|up_8|NC_019676.1_3438016_3439375_+	TIGR00479, 23S_rRNA_uracil1939-C5-methyltransferase_RlmD, 23S rRNA (uracil-5-)-methyltransferase RumA	NA|150aa|up_7|NC_019676.1_3439807_3440257_+	COG2172, RsbW, Anti-sigma regulatory factor (Ser/Thr protein kinase) [Signal transduction mechanisms]	NA|72aa|up_6|NC_019676.1_3440318_3440534_-	NA	NA|123aa|up_5|NC_019676.1_3440744_3441113_+	NA	NA|464aa|up_4|NC_019676.1_3441115_3442507_+	PRK03932, asnC, asparaginyl-tRNA synthetase; Validated	NA|248aa|up_3|NC_019676.1_3442556_3443300_-	NA	NA|417aa|up_2|NC_019676.1_3443690_3444941_+	TIGR02037, Probable_periplasmic_serine_protease_do/HhoA-like, periplasmic serine protease, Do/DeqQ family	NA|124aa|up_1|NC_019676.1_3445033_3445405_-	pfam01925, TauE, Sulfite exporter TauE/SafE	NA|322aa|up_0|NC_019676.1_3445430_3446396_-	COG2421, COG2421, Predicted acetamidase/formamidase [Energy production and conversion]	NA|2215aa|down_0|NC_019676.1_3449583_3456228_-	PRK11107, PRK11107, hybrid sensory histidine kinase BarA; Provisional	NA|199aa|down_1|NC_019676.1_3456676_3457273_-	cd05540, UreG, urease accessory protein UreG	NA|230aa|down_2|NC_019676.1_3457347_3458037_-	COG0830, UreF, Urease accessory protein UreF [Posttranslational modification, protein turnover, chaperones]	NA|147aa|down_3|NC_019676.1_3458014_3458455_-	PRK13261, ureE, urease accessory protein UreE; Provisional	NA|193aa|down_4|NC_019676.1_3458537_3459116_+	NA	NA|435aa|down_5|NC_019676.1_3459155_3460460_-	PRK00197, proA, gamma-glutamyl phosphate reductase; Provisional	NA|102aa|down_6|NC_019676.1_3460456_3460762_-	NA	NA|308aa|down_7|NC_019676.1_3460781_3461705_-	cd02647, nuc_hydro_TvIAG, nuc_hydro_ TvIAG:  Nucleoside hydrolases similar to the Inosine-adenosine-guanosine-preferring nucleoside hydrolase from Trypanosoma vivax	NA|144aa|down_8|NC_019676.1_3461722_3462154_+	COG0590, CumB, Cytosine/adenosine deaminases [Nucleotide transport and metabolism / Translation, ribosomal structure and biogenesis]	NA|307aa|down_9|NC_019676.1_3462198_3463119_-	cd01174, ribokinase, Ribokinase catalyses the phosphorylation of ribose to ribose-5-phosphate using ATP
GCF_000316625.1_ASM31662v1	NC_019676	Nostoc sp. PCC 7107, complete sequence	13	3449199-3449464	9,12,9	CRT,CRISPRCasFinder,PILER-CR	no		PD-DExK,cas3,csa3,cas2,cas1,cas4,cas6,2OG_CAS,csc1gr5,csc2gr7,cas10d,WYL,c2c9_V-U4,cas10,csm3gr7,csx10gr5,csm2gr11,csx19,csx21,DinG,DEDDh,csx3,csm4gr5,csm5gr7,csm6,RT,c2c5_V-U5	Orphan	AAAATTGCAATTCAAACTAATCCCTATTAGGGATTGAAACA,AATTGCAATTCAAACTAATCCCTATTAGGGATTGAAAC,AAAATTG----------CAATTCAAACTAATCCCTATTAGGGATTGAAACA	41,38,51	0	0	NA	NA	I-D,II-B:I-D,II-B:I-D,II-B	3,3,2	3	Orphan	PD-DExK,cas3,csa3,cas2,cas1,cas4,cas6,2OG_CAS,csc1gr5,csc2gr7,cas10d,WYL,c2c9_V-U4,cas10,csm3gr7,csx10gr5,csm2gr11,csx19,csx21,DinG,DEDDh,csx3,csm4gr5,csm5gr7,csm6,RT,c2c5_V-U5	NA|72aa|up_6|NC_019676.1_3440318_3440534_-,NA|123aa|up_5|NC_019676.1_3440744_3441113_+,NA|248aa|up_3|NC_019676.1_3442556_3443300_-,NA|193aa|down_4|NC_019676.1_3458537_3459116_+,NA|102aa|down_6|NC_019676.1_3460456_3460762_-	NA|162aa|up_9|NC_019676.1_3437399_3437885_-	cd12125, APC_alpha, Allophycocyanin alpha subunit of the phycobilisome core	NA|453aa|up_8|NC_019676.1_3438016_3439375_+	TIGR00479, 23S_rRNA_uracil1939-C5-methyltransferase_RlmD, 23S rRNA (uracil-5-)-methyltransferase RumA	NA|150aa|up_7|NC_019676.1_3439807_3440257_+	COG2172, RsbW, Anti-sigma regulatory factor (Ser/Thr protein kinase) [Signal transduction mechanisms]	NA|72aa|up_6|NC_019676.1_3440318_3440534_-	NA	NA|123aa|up_5|NC_019676.1_3440744_3441113_+	NA	NA|464aa|up_4|NC_019676.1_3441115_3442507_+	PRK03932, asnC, asparaginyl-tRNA synthetase; Validated	NA|248aa|up_3|NC_019676.1_3442556_3443300_-	NA	NA|417aa|up_2|NC_019676.1_3443690_3444941_+	TIGR02037, Probable_periplasmic_serine_protease_do/HhoA-like, periplasmic serine protease, Do/DeqQ family	NA|124aa|up_1|NC_019676.1_3445033_3445405_-	pfam01925, TauE, Sulfite exporter TauE/SafE	NA|322aa|up_0|NC_019676.1_3445430_3446396_-	COG2421, COG2421, Predicted acetamidase/formamidase [Energy production and conversion]	NA|2215aa|down_0|NC_019676.1_3449583_3456228_-	PRK11107, PRK11107, hybrid sensory histidine kinase BarA; Provisional	NA|199aa|down_1|NC_019676.1_3456676_3457273_-	cd05540, UreG, urease accessory protein UreG	NA|230aa|down_2|NC_019676.1_3457347_3458037_-	COG0830, UreF, Urease accessory protein UreF [Posttranslational modification, protein turnover, chaperones]	NA|147aa|down_3|NC_019676.1_3458014_3458455_-	PRK13261, ureE, urease accessory protein UreE; Provisional	NA|193aa|down_4|NC_019676.1_3458537_3459116_+	NA	NA|435aa|down_5|NC_019676.1_3459155_3460460_-	PRK00197, proA, gamma-glutamyl phosphate reductase; Provisional	NA|102aa|down_6|NC_019676.1_3460456_3460762_-	NA	NA|308aa|down_7|NC_019676.1_3460781_3461705_-	cd02647, nuc_hydro_TvIAG, nuc_hydro_ TvIAG:  Nucleoside hydrolases similar to the Inosine-adenosine-guanosine-preferring nucleoside hydrolase from Trypanosoma vivax	NA|144aa|down_8|NC_019676.1_3461722_3462154_+	COG0590, CumB, Cytosine/adenosine deaminases [Nucleotide transport and metabolism / Translation, ribosomal structure and biogenesis]	NA|307aa|down_9|NC_019676.1_3462198_3463119_-	cd01174, ribokinase, Ribokinase catalyses the phosphorylation of ribose to ribose-5-phosphate using ATP
GCF_000316625.1_ASM31662v1	NC_019676	Nostoc sp. PCC 7107, complete sequence	14	3559800-3559888	13	CRISPRCasFinder	no		PD-DExK,cas3,csa3,cas2,cas1,cas4,cas6,2OG_CAS,csc1gr5,csc2gr7,cas10d,WYL,c2c9_V-U4,cas10,csm3gr7,csx10gr5,csm2gr11,csx19,csx21,DinG,DEDDh,csx3,csm4gr5,csm5gr7,csm6,RT,c2c5_V-U5	Orphan	AATGGCGGAAATTTTTCTGATCC	23	0	0	NA	NA	NA	1	1	Orphan	PD-DExK,cas3,csa3,cas2,cas1,cas4,cas6,2OG_CAS,csc1gr5,csc2gr7,cas10d,WYL,c2c9_V-U4,cas10,csm3gr7,csx10gr5,csm2gr11,csx19,csx21,DinG,DEDDh,csx3,csm4gr5,csm5gr7,csm6,RT,c2c5_V-U5	NA|148aa|up_9|NC_019676.1_3547253_3547697_+,NA|137aa|up_8|NC_019676.1_3547994_3548405_+,NA|76aa|up_0|NC_019676.1_3558951_3559179_-,NA|81aa|down_0|NC_019676.1_3561694_3561937_-,NA|93aa|down_1|NC_019676.1_3561933_3562212_-,NA|118aa|down_2|NC_019676.1_3562208_3562562_-,NA|78aa|down_3|NC_019676.1_3562581_3562815_-,NA|59aa|down_6|NC_019676.1_3564002_3564179_+,NA|64aa|down_7|NC_019676.1_3564175_3564367_+,NA|84aa|down_8|NC_019676.1_3564714_3564966_-,NA|84aa|down_9|NC_019676.1_3565248_3565500_+	NA|148aa|up_9|NC_019676.1_3547253_3547697_+	NA	NA|137aa|up_8|NC_019676.1_3547994_3548405_+	NA	NA|238aa|up_7|NC_019676.1_3548682_3549396_-	sd00006, TPR, Tetratricopeptide repeat	NA|313aa|up_6|NC_019676.1_3550111_3551050_-	PRK07429, PRK07429, phosphoribulokinase; Provisional	NA|534aa|up_5|NC_019676.1_3551468_3553070_+	pfam18105, PGM1_C, PGM1 C-terminal domain	NA|580aa|up_4|NC_019676.1_3553206_3554946_+	cd14014, STKc_PknB_like, Catalytic domain of bacterial Serine/Threonine kinases, PknB and similar proteins	NA|196aa|up_3|NC_019676.1_3555583_3556171_+	COG1357, COG1357, Pentapeptide repeats containing protein [Function unknown]	NA|392aa|up_2|NC_019676.1_3556404_3557580_+	cd08014, M20_Acy1-like, M20 Peptidase aminoacylase 1 subfamily	NA|395aa|up_1|NC_019676.1_3557743_3558928_-	cd01189, INT_ICEBs1_C_like, C-terminal catalytic domain of integrases from bacterial phages and conjugate transposons	NA|76aa|up_0|NC_019676.1_3558951_3559179_-	NA	NA|81aa|down_0|NC_019676.1_3561694_3561937_-	NA	NA|93aa|down_1|NC_019676.1_3561933_3562212_-	NA	NA|118aa|down_2|NC_019676.1_3562208_3562562_-	NA	NA|78aa|down_3|NC_019676.1_3562581_3562815_-	NA	NA|114aa|down_4|NC_019676.1_3562924_3563266_+	pfam08872, KGK, KGK domain	NA|81aa|down_5|NC_019676.1_3563519_3563762_+	cd00093, HTH_XRE, Helix-turn-helix XRE-family like proteins	NA|59aa|down_6|NC_019676.1_3564002_3564179_+	NA	NA|64aa|down_7|NC_019676.1_3564175_3564367_+	NA	NA|84aa|down_8|NC_019676.1_3564714_3564966_-	NA	NA|84aa|down_9|NC_019676.1_3565248_3565500_+	NA
GCF_000316625.1_ASM31662v1	NC_019676	Nostoc sp. PCC 7107, complete sequence	15	3760767-3761456	14,10,10	CRISPRCasFinder,CRT,PILER-CR	no		PD-DExK,cas3,csa3,cas2,cas1,cas4,cas6,2OG_CAS,csc1gr5,csc2gr7,cas10d,WYL,c2c9_V-U4,cas10,csm3gr7,csx10gr5,csm2gr11,csx19,csx21,DinG,DEDDh,csx3,csm4gr5,csm5gr7,csm6,RT,c2c5_V-U5	Orphan	GTTTCCATCCCCTTGCGGGGGAAGTGGTTTGGATAC,GTTTCCATCCCCTTGCGGGGGAAGTGGTTTGGATAC,GTTTCCATCCCCTTGCGGGGGAAGTGGTTTGGATAC	36,36,36	0	0	NA	NA	NA:NA:NA	9,9,8	9	Orphan	PD-DExK,cas3,csa3,cas2,cas1,cas4,cas6,2OG_CAS,csc1gr5,csc2gr7,cas10d,WYL,c2c9_V-U4,cas10,csm3gr7,csx10gr5,csm2gr11,csx19,csx21,DinG,DEDDh,csx3,csm4gr5,csm5gr7,csm6,RT,c2c5_V-U5	NA|81aa|up_5|NC_019676.1_3754187_3754430_-,NA|108aa|up_1|NC_019676.1_3759052_3759376_+,NA|53aa|down_0|NC_019676.1_3761812_3761971_+	NA|301aa|up_9|NC_019676.1_3748903_3749806_-	COG1176, PotB, ABC-type spermidine/putrescine transport system, permease component I [Amino acid transport and metabolism]	NA|361aa|up_8|NC_019676.1_3749870_3750953_-	cd13590, PBP2_PotD_PotF_like, The periplasmic-binding component of ABC transporters involved in uptake of polyamines; possess the type 2 periplasmic binding fold	NA|376aa|up_7|NC_019676.1_3750965_3752093_-	COG3842, PotA, ABC-type spermidine/putrescine transport systems, ATPase components [Amino acid transport and metabolism]	NA|614aa|up_6|NC_019676.1_3752305_3754147_+	TIGR03423, pbp2_mrdA, penicillin-binding protein 2	NA|81aa|up_5|NC_019676.1_3754187_3754430_-	NA	NA|547aa|up_4|NC_019676.1_3754548_3756189_-	pfam07602, DUF1565, Protein of unknown function (DUF1565)	NA|380aa|up_3|NC_019676.1_3756702_3757842_-	PRK06019, PRK06019, phosphoribosylaminoimidazole carboxylase ATPase subunit; Reviewed	NA|190aa|up_2|NC_019676.1_3757838_3758408_-	COG1357, COG1357, Pentapeptide repeats containing protein [Function unknown]	NA|108aa|up_1|NC_019676.1_3759052_3759376_+	NA	NA|412aa|up_0|NC_019676.1_3759422_3760658_-	TIGR02917, TPR_domain_protein, putative PEP-CTERM system TPR-repeat lipoprotein	NA|53aa|down_0|NC_019676.1_3761812_3761971_+	NA	NA|197aa|down_1|NC_019676.1_3762073_3762664_+	pfam06206, CpeT, CpeT/CpcT family (DUF1001)	NA|266aa|down_2|NC_019676.1_3762727_3763525_-	cd07402, MPP_GpdQ, Enterobacter aerogenes GpdQ and related proteins, metallophosphatase domain	NA|150aa|down_3|NC_019676.1_3764008_3764458_+	cd14503, PTP-bact, bacterial tyrosine-protein phosphataseS similar to Neisseria NMA1982	NA|95aa|down_4|NC_019676.1_3764593_3764878_+	pfam17195, DUF5132, Protein of unknown function (DUF5132)	NA|229aa|down_5|NC_019676.1_3764894_3765581_-	COG0745, OmpR, Response regulators consisting of a CheY-like receiver domain and a winged-helix DNA-binding domain [Signal transduction mechanisms / Transcription]	NA|306aa|down_6|NC_019676.1_3765765_3766683_-	PRK02649, ppnK, NAD(+) kinase	NA|330aa|down_7|NC_019676.1_3766824_3767814_-	CHL00194, ycf39, Ycf39; Provisional	NA|35aa|down_8|NC_019676.1_3767978_3768083_-	pfam08041, PetM, PetM family of cytochrome b6f complex subunit 7	NA|362aa|down_9|NC_019676.1_3768298_3769384_+	PRK02746, pdxA, 4-hydroxythreonine-4-phosphate dehydrogenase PdxA
GCF_000316625.1_ASM31662v1	NC_019676	Nostoc sp. PCC 7107, complete sequence	16	3791035-3791125	15	CRISPRCasFinder	no		PD-DExK,cas3,csa3,cas2,cas1,cas4,cas6,2OG_CAS,csc1gr5,csc2gr7,cas10d,WYL,c2c9_V-U4,cas10,csm3gr7,csx10gr5,csm2gr11,csx19,csx21,DinG,DEDDh,csx3,csm4gr5,csm5gr7,csm6,RT,c2c5_V-U5	Orphan	TTCAGCAATTCCCATCACCAGAACAAGTTGT	31	0	0	NA	NA	NA	1	1	Orphan	PD-DExK,cas3,csa3,cas2,cas1,cas4,cas6,2OG_CAS,csc1gr5,csc2gr7,cas10d,WYL,c2c9_V-U4,cas10,csm3gr7,csx10gr5,csm2gr11,csx19,csx21,DinG,DEDDh,csx3,csm4gr5,csm5gr7,csm6,RT,c2c5_V-U5	NA|92aa|up_6|NC_019676.1_3779844_3780120_+,NA|104aa|up_3|NC_019676.1_3784036_3784348_-,NA|554aa|down_0|NC_019676.1_3793439_3795101_+,NA|242aa|down_4|NC_019676.1_3798722_3799448_-	NA|162aa|up_9|NC_019676.1_3772623_3773109_+	pfam14317, YcxB, YcxB-like protein	NA|1705aa|up_8|NC_019676.1_3773164_3778279_-	TIGR02956, sensor_protein_TorS, TMAO reductase sytem sensor TorS	NA|208aa|up_7|NC_019676.1_3778963_3779587_+	pfam13353, Fer4_12, 4Fe-4S single cluster domain	NA|92aa|up_6|NC_019676.1_3779844_3780120_+	NA	NA|255aa|up_5|NC_019676.1_3780382_3781147_+	COG4328, COG4328, Predicted nuclease (RNAse H fold) [General function prediction only]	NA|855aa|up_4|NC_019676.1_3781194_3783759_-	TIGR02094, Glycogen_phosphorylase, alpha-glucan phosphorylases	NA|104aa|up_3|NC_019676.1_3784036_3784348_-	NA	NA|236aa|up_2|NC_019676.1_3785870_3786578_+	cd04647, LbH_MAT_like, Maltose O-acyltransferase (MAT)-like: This family is composed of maltose O-acetyltransferase, galactoside O-acetyltransferase (GAT), xenobiotic acyltransferase (XAT) and similar proteins	NA|843aa|up_1|NC_019676.1_3787007_3789536_+	pfam00343, Phosphorylase, Carbohydrate phosphorylase	NA|39aa|up_0|NC_019676.1_3789966_3790083_+	PRK02655, psbI, photosystem II reaction center protein I	NA|554aa|down_0|NC_019676.1_3793439_3795101_+	NA	NA|341aa|down_1|NC_019676.1_3795188_3796211_+	COG1216, COG1216, Predicted glycosyltransferases [General function prediction only]	NA|391aa|down_2|NC_019676.1_3796321_3797494_+	cd03821, GT4_Bme6-like, Brucella melitensis Bme6 and similar proteins	NA|380aa|down_3|NC_019676.1_3797574_3798714_-	pfam13531, SBP_bac_11, Bacterial extracellular solute-binding protein	NA|242aa|down_4|NC_019676.1_3798722_3799448_-	NA	NA|571aa|down_5|NC_019676.1_3799493_3801206_-	cd00198, vWFA, Von Willebrand factor type A (vWA) domain was originally found in the blood coagulation protein von Willebrand factor (vWF)	NA|213aa|down_6|NC_019676.1_3801498_3802137_+	cd03016, PRX_1cys, Peroxiredoxin (PRX) family, 1-cys PRX subfamily; composed of PRXs containing only one conserved cysteine, which serves as the peroxidatic cysteine	NA|195aa|down_7|NC_019676.1_3802242_3802827_+	COG4636, Uma2, Endonuclease, Uma2 family (restriction endonuclease fold) [General function prediction only]	NA|493aa|down_8|NC_019676.1_3802835_3804314_-	pfam00931, NB-ARC, NB-ARC domain	NA|82aa|down_9|NC_019676.1_3804411_3804657_+	pfam03683, UPF0175, Uncharacterized protein family (UPF0175)
GCF_000316625.1_ASM31662v1	NC_019676	Nostoc sp. PCC 7107, complete sequence	17	3817115-3817204	16	CRISPRCasFinder	no		PD-DExK,cas3,csa3,cas2,cas1,cas4,cas6,2OG_CAS,csc1gr5,csc2gr7,cas10d,WYL,c2c9_V-U4,cas10,csm3gr7,csx10gr5,csm2gr11,csx19,csx21,DinG,DEDDh,csx3,csm4gr5,csm5gr7,csm6,RT,c2c5_V-U5	Orphan	TGGCTGTAGGAGTAGGAGTTGCTTGAGGTT	30	0	0	NA	NA	NA	1	1	Orphan	PD-DExK,cas3,csa3,cas2,cas1,cas4,cas6,2OG_CAS,csc1gr5,csc2gr7,cas10d,WYL,c2c9_V-U4,cas10,csm3gr7,csx10gr5,csm2gr11,csx19,csx21,DinG,DEDDh,csx3,csm4gr5,csm5gr7,csm6,RT,c2c5_V-U5	NA|413aa|up_6|NC_019676.1_3805202_3806441_-,NA|471aa|down_3|NC_019676.1_3823185_3824598_+,NA|58aa|down_5|NC_019676.1_3825581_3825755_-	NA|493aa|up_9|NC_019676.1_3802835_3804314_-	pfam00931, NB-ARC, NB-ARC domain	NA|82aa|up_8|NC_019676.1_3804411_3804657_+	pfam03683, UPF0175, Uncharacterized protein family (UPF0175)	NA|163aa|up_7|NC_019676.1_3804653_3805142_+	COG2405, COG2405, Predicted nucleic acid-binding protein, contains PIN domain [General function prediction only]	NA|413aa|up_6|NC_019676.1_3805202_3806441_-	NA	NA|822aa|up_5|NC_019676.1_3806534_3809000_+	TIGR03788, marine_srt_targ, marine proteobacterial sortase target protein	NA|737aa|up_4|NC_019676.1_3809048_3811259_-	cd09912, DLP_2, Dynamin-like protein including dynamins, mitofusins, and guanylate-binding proteins	NA|111aa|up_3|NC_019676.1_3813593_3813926_+	cd02980, TRX_Fd_family, Thioredoxin (TRX)-like [2Fe-2S] Ferredoxin (Fd) family; composed of [2Fe-2S] Fds with a TRX fold (TRX-like Fds) and proteins containing domains similar to TRX-like Fd including formate dehydrogenases, NAD-reducing hydrogenases and the subunit E of NADH:ubiquinone oxidoreductase (NuoE)	NA|266aa|up_2|NC_019676.1_3813980_3814778_+	pfam17265, DUF5331, Family of unknown function (DUF5331)	NA|285aa|up_1|NC_019676.1_3815094_3815949_-	PRK06027, purU, formyltetrahydrofolate deformylase; Reviewed	NA|205aa|up_0|NC_019676.1_3816105_3816720_-	pfam14238, DUF4340, Domain of unknown function (DUF4340)	NA|270aa|down_0|NC_019676.1_3818586_3819396_-	TIGR03518, ABC_transporter_permease_protein, gliding motility-associated ABC transporter permease protein GldF	NA|334aa|down_1|NC_019676.1_3819396_3820398_-	COG1131, CcmA, ABC-type multidrug transport system, ATPase component [Defense mechanisms]	NA|498aa|down_2|NC_019676.1_3821296_3822790_-	pfam04966, OprB, Carbohydrate-selective porin, OprB family	NA|471aa|down_3|NC_019676.1_3823185_3824598_+	NA	NA|160aa|down_4|NC_019676.1_3824810_3825290_-	smart00347, HTH_MARR, helix_turn_helix multiple antibiotic resistance protein	NA|58aa|down_5|NC_019676.1_3825581_3825755_-	NA	NA|180aa|down_6|NC_019676.1_3825970_3826510_-	cd02980, TRX_Fd_family, Thioredoxin (TRX)-like [2Fe-2S] Ferredoxin (Fd) family; composed of [2Fe-2S] Fds with a TRX fold (TRX-like Fds) and proteins containing domains similar to TRX-like Fd including formate dehydrogenases, NAD-reducing hydrogenases and the subunit E of NADH:ubiquinone oxidoreductase (NuoE)	NA|117aa|down_7|NC_019676.1_3826616_3826967_+	PRK12275, PRK12275, hypothetical protein; Reviewed	NA|68aa|down_8|NC_019676.1_3827004_3827208_-	pfam06988, NifT, NifT/FixU protein	NA|96aa|down_9|NC_019676.1_3827182_3827470_-	pfam04319, NifZ, NifZ domain
GCF_000316625.1_ASM31662v1	NC_019676	Nostoc sp. PCC 7107, complete sequence	18	4119123-4119240	17	CRISPRCasFinder	no		PD-DExK,cas3,csa3,cas2,cas1,cas4,cas6,2OG_CAS,csc1gr5,csc2gr7,cas10d,WYL,c2c9_V-U4,cas10,csm3gr7,csx10gr5,csm2gr11,csx19,csx21,DinG,DEDDh,csx3,csm4gr5,csm5gr7,csm6,RT,c2c5_V-U5	Orphan	GCTTGCGCTTGGGCAATATCTTGAGGGCGATTACCTGCTT	40	0	0	NA	NA	NA	1	1	Orphan	PD-DExK,cas3,csa3,cas2,cas1,cas4,cas6,2OG_CAS,csc1gr5,csc2gr7,cas10d,WYL,c2c9_V-U4,cas10,csm3gr7,csx10gr5,csm2gr11,csx19,csx21,DinG,DEDDh,csx3,csm4gr5,csm5gr7,csm6,RT,c2c5_V-U5	NA|249aa|up_7|NC_019676.1_4110641_4111388_+,NA|169aa|up_6|NC_019676.1_4111476_4111983_+,NA|143aa|up_3|NC_019676.1_4114039_4114468_+,NA|155aa|down_0|NC_019676.1_4120134_4120599_+,NA|188aa|down_2|NC_019676.1_4122283_4122847_+,NA|52aa|down_8|NC_019676.1_4126738_4126894_+	NA|221aa|up_9|NC_019676.1_4108843_4109506_+	pfam10988, DUF2807, Putative auto-transporter adhesin, head GIN domain	NA|195aa|up_8|NC_019676.1_4109553_4110138_-	cd06260, DUF820, Domain of unknown function (DUF820)	NA|249aa|up_7|NC_019676.1_4110641_4111388_+	NA	NA|169aa|up_6|NC_019676.1_4111476_4111983_+	NA	NA|96aa|up_5|NC_019676.1_4112367_4112655_+	COG1357, COG1357, Pentapeptide repeats containing protein [Function unknown]	NA|320aa|up_4|NC_019676.1_4112792_4113752_+	pfam09150, Carot_N, Orange carotenoid protein, N-terminal	NA|143aa|up_3|NC_019676.1_4114039_4114468_+	NA	NA|463aa|up_2|NC_019676.1_4114512_4115901_-	COG1538, TolC, Outer membrane protein [Cell envelope biogenesis, outer membrane / Intracellular trafficking and secretion]	NA|221aa|up_1|NC_019676.1_4116011_4116674_-	COG1136, SalX, ABC-type antimicrobial peptide transport system, ATPase component [Defense mechanisms]	NA|430aa|up_0|NC_019676.1_4116765_4118055_-	PRK10535, PRK10535, macrolide ABC transporter ATP-binding protein/permease MacB	NA|155aa|down_0|NC_019676.1_4120134_4120599_+	NA	NA|437aa|down_1|NC_019676.1_4120802_4122113_-	pfam10282, Lactonase, Lactonase, 7-bladed beta-propeller	NA|188aa|down_2|NC_019676.1_4122283_4122847_+	NA	NA|128aa|down_3|NC_019676.1_4122906_4123290_+	pfam10184, DUF2358, Uncharacterized conserved protein (DUF2358)	NA|325aa|down_4|NC_019676.1_4123363_4124338_-	PRK09375, PRK09375, quinolinate synthase NadA	NA|217aa|down_5|NC_019676.1_4124476_4125127_-	cd06259, YdcF-like, YdcF-like	NA|219aa|down_6|NC_019676.1_4125126_4125783_-	pfam10063, DUF2301, Uncharacterized integral membrane protein (DUF2301)	NA|268aa|down_7|NC_019676.1_4125824_4126628_-	pfam01887, SAM_adeno_trans, S-adenosyl-l-methionine hydroxide adenosyltransferase	NA|52aa|down_8|NC_019676.1_4126738_4126894_+	NA	NA|596aa|down_9|NC_019676.1_4127241_4129029_-	PRK00476, aspS, aspartyl-tRNA synthetase; Validated
GCF_000316625.1_ASM31662v1	NC_019676	Nostoc sp. PCC 7107, complete sequence	19	4291641-4292700	11,18,11	PILER-CR,CRISPRCasFinder,CRT	no		PD-DExK,cas3,csa3,cas2,cas1,cas4,cas6,2OG_CAS,csc1gr5,csc2gr7,cas10d,WYL,c2c9_V-U4,cas10,csm3gr7,csx10gr5,csm2gr11,csx19,csx21,DinG,DEDDh,csx3,csm4gr5,csm5gr7,csm6,RT,c2c5_V-U5	Orphan	GTTTCCATCCCCGTGAGGGGTAATTAATTGAAAAC,GTTTCCATCCCCGTGAGGGGTAATTAATTGAAAAC,GTTTCCATCCCCGTGAGGGGTAATTAATTGAAAAC	35,35,35	0	0	NA	NA	NA:NA:NA	14,14,14	14	Orphan	PD-DExK,cas3,csa3,cas2,cas1,cas4,cas6,2OG_CAS,csc1gr5,csc2gr7,cas10d,WYL,c2c9_V-U4,cas10,csm3gr7,csx10gr5,csm2gr11,csx19,csx21,DinG,DEDDh,csx3,csm4gr5,csm5gr7,csm6,RT,c2c5_V-U5	NA|76aa|up_9|NC_019676.1_4280674_4280902_+,NA|220aa|down_4|NC_019676.1_4298760_4299420_-,NA|164aa|down_9|NC_019676.1_4304099_4304591_+	NA|76aa|up_9|NC_019676.1_4280674_4280902_+	NA	NA|280aa|up_8|NC_019676.1_4281246_4282086_+	cd02537, GT8_Glycogenin, Glycogenin belongs the GT 8 family and initiates the biosynthesis of glycogen	NA|389aa|up_7|NC_019676.1_4282112_4283279_+	cd03801, GT4_PimA-like, phosphatidyl-myo-inositol mannosyltransferase	NA|747aa|up_6|NC_019676.1_4283433_4285674_+	COG1132, MdlB, ABC-type multidrug transport system, ATPase and permease components [Defense mechanisms]	NA|404aa|up_5|NC_019676.1_4285679_4286891_+	cd03801, GT4_PimA-like, phosphatidyl-myo-inositol mannosyltransferase	NA|322aa|up_4|NC_019676.1_4286931_4287897_+	pfam00852, Glyco_transf_10, Glycosyltransferase family 10 (fucosyltransferase) C-term	NA|337aa|up_3|NC_019676.1_4287903_4288914_+	pfam11051, Mannosyl_trans3, Mannosyltransferase putative	NA|309aa|up_2|NC_019676.1_4288915_4289842_+	pfam11051, Mannosyl_trans3, Mannosyltransferase putative	NA|153aa|up_1|NC_019676.1_4290061_4290520_+	PRK00137, rplI, 50S ribosomal protein L9; Reviewed	NA|181aa|up_0|NC_019676.1_4290816_4291359_+	COG1943, COG1943, Transposase and inactivated derivatives [DNA replication, recombination, and repair]	NA|1376aa|down_0|NC_019676.1_4293171_4297299_+	PRK07773, PRK07773, replicative DNA helicase; Validated	NA|174aa|down_1|NC_019676.1_4297352_4297874_-	COG0545, FkpA, FKBP-type peptidyl-prolyl cis-trans isomerases 1 [Posttranslational modification, protein turnover, chaperones]	NA|108aa|down_2|NC_019676.1_4297953_4298277_-	COG3937, COG3937, Uncharacterized conserved protein [Function unknown]	NA|103aa|down_3|NC_019676.1_4298412_4298721_+	TIGR03792, conserved_hypothetical_protein, uncharacterized cyanobacterial protein, TIGR03792 family	NA|220aa|down_4|NC_019676.1_4298760_4299420_-	NA	NA|227aa|down_5|NC_019676.1_4299441_4300122_-	TIGR02795, Uncharacterized_protein_in_oprL_3'region, tol-pal system protein YbgF	NA|366aa|down_6|NC_019676.1_4300316_4301414_-	PRK00082, hrcA, heat-inducible transcription repressor; Provisional	NA|120aa|down_7|NC_019676.1_4301773_4302133_-	cd01528, RHOD_2, Member of the Rhodanese Homology Domain superfamily, subgroup 2	NA|544aa|down_8|NC_019676.1_4302348_4303980_+	pfam11832, DUF3352, Protein of unknown function (DUF3352)	NA|164aa|down_9|NC_019676.1_4304099_4304591_+	NA
GCF_000316625.1_ASM31662v1	NC_019676	Nostoc sp. PCC 7107, complete sequence	20	4563956-4567063	12,19,12	PILER-CR,CRISPRCasFinder,CRT	no		PD-DExK,cas3,csa3,cas2,cas1,cas4,cas6,2OG_CAS,csc1gr5,csc2gr7,cas10d,WYL,c2c9_V-U4,cas10,csm3gr7,csx10gr5,csm2gr11,csx19,csx21,DinG,DEDDh,csx3,csm4gr5,csm5gr7,csm6,RT,c2c5_V-U5	Orphan	ATTG----------CAATTCAAACTAATCCCTATTAGGGATTGAAAC,ATTGCAATTCAAACTAATCCCTATTAGGGATTGAAAC,ATTGCAATTCAAACTAATCCCTATTAGGGATTGAAAC	47,37,37	0	0	NA	NA	I-D,II-B:I-D,II-B:I-D,II-B	40,42,42	42	Orphan	PD-DExK,cas3,csa3,cas2,cas1,cas4,cas6,2OG_CAS,csc1gr5,csc2gr7,cas10d,WYL,c2c9_V-U4,cas10,csm3gr7,csx10gr5,csm2gr11,csx19,csx21,DinG,DEDDh,csx3,csm4gr5,csm5gr7,csm6,RT,c2c5_V-U5	NA|80aa|up_9|NC_019676.1_4554436_4554676_+,NA|67aa|up_6|NC_019676.1_4556974_4557175_+,NA|95aa|down_2|NC_019676.1_4570142_4570427_+,NA|125aa|down_3|NC_019676.1_4570423_4570798_+,NA|91aa|down_6|NC_019676.1_4573439_4573712_-,NA|79aa|down_7|NC_019676.1_4573749_4573986_-,NA|91aa|down_8|NC_019676.1_4575084_4575357_+	NA|80aa|up_9|NC_019676.1_4554436_4554676_+	NA	NA|245aa|up_8|NC_019676.1_4555014_4555749_+	pfam05685, Uma2, Putative restriction endonuclease	NA|300aa|up_7|NC_019676.1_4555784_4556684_-	cd00657, Ferritin_like, Ferritin-like superfamily of diiron-containing four-helix-bundle proteins	NA|67aa|up_6|NC_019676.1_4556974_4557175_+	NA	NA|472aa|up_5|NC_019676.1_4557256_4558672_-	TIGR00653, Glutamine_synthetase, glutamine synthetase, type I	NA|170aa|up_4|NC_019676.1_4559030_4559540_+	cd12126, APC_beta, Allophycocyanin beta subunit of the phycobilisome core	NA|58aa|up_3|NC_019676.1_4559764_4559938_+	cd16382, XisI-like, XisI is FdxN element excision controlling factor protein	NA|412aa|up_2|NC_019676.1_4560062_4561298_+	PRK07590, PRK07590, L,L-diaminopimelate aminotransferase; Validated	NA|240aa|up_1|NC_019676.1_4561419_4562139_-	pfam08241, Methyltransf_11, Methyltransferase domain	NA|464aa|up_0|NC_019676.1_4562315_4563707_+	cd07989, LPLAT_AGPAT-like, Lysophospholipid Acyltransferases (LPLATs) of Glycerophospholipid Biosynthesis: AGPAT-like	NA|418aa|down_0|NC_019676.1_4567359_4568613_-	PRK06185, PRK06185, FAD-dependent oxidoreductase	NA|418aa|down_1|NC_019676.1_4568798_4570052_+	COG3839, MalK, ABC-type sugar transport systems, ATPase components [Carbohydrate transport and metabolism]	NA|95aa|down_2|NC_019676.1_4570142_4570427_+	NA	NA|125aa|down_3|NC_019676.1_4570423_4570798_+	NA	NA|550aa|down_4|NC_019676.1_4570906_4572556_-	pfam04966, OprB, Carbohydrate-selective porin, OprB family	NA|186aa|down_5|NC_019676.1_4572821_4573379_+	pfam05685, Uma2, Putative restriction endonuclease	NA|91aa|down_6|NC_019676.1_4573439_4573712_-	NA	NA|79aa|down_7|NC_019676.1_4573749_4573986_-	NA	NA|91aa|down_8|NC_019676.1_4575084_4575357_+	NA	NA|376aa|down_9|NC_019676.1_4576028_4577156_+	COG4299, COG4299, Uncharacterized protein conserved in bacteria [Function unknown]
GCF_000316625.1_ASM31662v1	NC_019676	Nostoc sp. PCC 7107, complete sequence	21	5102610-5102685	20	CRISPRCasFinder	no		PD-DExK,cas3,csa3,cas2,cas1,cas4,cas6,2OG_CAS,csc1gr5,csc2gr7,cas10d,WYL,c2c9_V-U4,cas10,csm3gr7,csx10gr5,csm2gr11,csx19,csx21,DinG,DEDDh,csx3,csm4gr5,csm5gr7,csm6,RT,c2c5_V-U5	Orphan	GTCAATCAATTTTGGATTTTAGA	23	0	0	NA	NA	NA	1	1	Orphan	PD-DExK,cas3,csa3,cas2,cas1,cas4,cas6,2OG_CAS,csc1gr5,csc2gr7,cas10d,WYL,c2c9_V-U4,cas10,csm3gr7,csx10gr5,csm2gr11,csx19,csx21,DinG,DEDDh,csx3,csm4gr5,csm5gr7,csm6,RT,c2c5_V-U5	NA|648aa|up_9|NC_019676.1_5087177_5089121_+,NA	NA|648aa|up_9|NC_019676.1_5087177_5089121_+	NA	NA|204aa|up_8|NC_019676.1_5089645_5090257_-	pfam09988, DUF2227, Uncharacterized metal-binding protein (DUF2227)	NA|200aa|up_7|NC_019676.1_5090845_5091445_-	cd06260, DUF820, Domain of unknown function (DUF820)	NA|391aa|up_6|NC_019676.1_5091729_5092902_+	TIGR02937, RNA_polymerase_sigma_factor, RNA polymerase sigma factor, sigma-70 family	NA|381aa|up_5|NC_019676.1_5092996_5094139_+	pfam08852, DUF1822, Protein of unknown function (DUF1822)	NA|156aa|up_4|NC_019676.1_5094213_5094681_+	COG3296, COG3296, Uncharacterized protein conserved in bacteria [Function unknown]	NA|536aa|up_3|NC_019676.1_5094743_5096351_-	COG4191, COG4191, Signal transduction histidine kinase regulating C4-dicarboxylate transport system [Signal transduction mechanisms]	NA|370aa|up_2|NC_019676.1_5097149_5098259_+	COG0673, MviM, Predicted dehydrogenases and related proteins [General function prediction only]	NA|764aa|up_1|NC_019676.1_5099360_5101652_+	TIGR02956, sensor_protein_TorS, TMAO reductase sytem sensor TorS	NA|190aa|up_0|NC_019676.1_5101801_5102371_-	pfam13975, gag-asp_proteas, gag-polyprotein putative aspartyl protease	NA|261aa|down_0|NC_019676.1_5102761_5103544_-	cd17767, UP_EcUdp-like, uridine phosphorylases similar to Escherichia coli Udp and related phosphorylases	NA|97aa|down_1|NC_019676.1_5103588_5103879_+	COG1669, COG1669, Predicted nucleotidyltransferases [General function prediction only]	NA|114aa|down_2|NC_019676.1_5103875_5104217_+	COG2361, COG2361, Uncharacterized conserved protein [Function unknown]	NA|1153aa|down_3|NC_019676.1_5104521_5107980_+	pfam05860, Haemagg_act, haemagglutination activity domain	NA|307aa|down_4|NC_019676.1_5108188_5109109_-	PRK02645, ppnK, NAD(+) kinase	NA|102aa|down_5|NC_019676.1_5109115_5109421_-	CHL00015, ndhE, NADH dehydrogenase subunit 4L	NA|200aa|down_6|NC_019676.1_5109582_5110182_-	CHL00016, ndhG, NADH dehydrogenase subunit 6	NA|192aa|down_7|NC_019676.1_5110413_5110989_-	TIGR00403, NADPH-quinone_oxidoreductase_subunit_I, NADH-plastoquinone oxidoreductase subunit I protein	NA|373aa|down_8|NC_019676.1_5111078_5112197_-	CHL00032, ndhA, NADH dehydrogenase subunit 1	NA|379aa|down_9|NC_019676.1_5112658_5113795_-	PRK14036, PRK14036, citrate synthase; Provisional
GCF_000316625.1_ASM31662v1	NC_019676	Nostoc sp. PCC 7107, complete sequence	22	5241496-5241606	21	CRISPRCasFinder	no		PD-DExK,cas3,csa3,cas2,cas1,cas4,cas6,2OG_CAS,csc1gr5,csc2gr7,cas10d,WYL,c2c9_V-U4,cas10,csm3gr7,csx10gr5,csm2gr11,csx19,csx21,DinG,DEDDh,csx3,csm4gr5,csm5gr7,csm6,RT,c2c5_V-U5	Orphan	CGTTTCGAGTTCTGAAGACAAAGTTCC	27	0	0	NA	NA	NA	1	1	Orphan	PD-DExK,cas3,csa3,cas2,cas1,cas4,cas6,2OG_CAS,csc1gr5,csc2gr7,cas10d,WYL,c2c9_V-U4,cas10,csm3gr7,csx10gr5,csm2gr11,csx19,csx21,DinG,DEDDh,csx3,csm4gr5,csm5gr7,csm6,RT,c2c5_V-U5	NA|111aa|up_9|NC_019676.1_5234780_5235113_-,NA|135aa|up_8|NC_019676.1_5235298_5235703_-,NA|111aa|up_7|NC_019676.1_5235743_5236076_-,NA|117aa|up_6|NC_019676.1_5236508_5236859_-,NA|52aa|up_4|NC_019676.1_5237832_5237988_+,NA|270aa|up_2|NC_019676.1_5238892_5239702_-,NA|158aa|up_0|NC_019676.1_5240967_5241441_+,NA|65aa|down_4|NC_019676.1_5250128_5250323_-,NA|67aa|down_5|NC_019676.1_5250490_5250691_-,NA|60aa|down_6|NC_019676.1_5250972_5251152_-	NA|111aa|up_9|NC_019676.1_5234780_5235113_-	NA	NA|135aa|up_8|NC_019676.1_5235298_5235703_-	NA	NA|111aa|up_7|NC_019676.1_5235743_5236076_-	NA	NA|117aa|up_6|NC_019676.1_5236508_5236859_-	NA	NA|197aa|up_5|NC_019676.1_5237243_5237834_-	COG0664, Crp, cAMP-binding proteins - catabolite gene activator and regulatory subunit of cAMP-dependent protein kinases [Signal transduction mechanisms]	NA|52aa|up_4|NC_019676.1_5237832_5237988_+	NA	NA|258aa|up_3|NC_019676.1_5238028_5238802_-	cd05346, SDR_c5, classical (c) SDR, subgroup 5	NA|270aa|up_2|NC_019676.1_5238892_5239702_-	NA	NA|357aa|up_1|NC_019676.1_5239786_5240857_+	cd03785, GT28_MurG, undecaprenyldiphospho-muramoylpentapeptide beta-N-acetylglucosaminyltransferase	NA|158aa|up_0|NC_019676.1_5240967_5241441_+	NA	NA|803aa|down_0|NC_019676.1_5241702_5244111_-	COG4354, COG4354, Predicted bile acid beta-glucosidase [Carbohydrate transport and metabolism]	NA|604aa|down_1|NC_019676.1_5245232_5247044_+	COG1118, CysA, ABC-type sulfate/molybdate transport systems, ATPase component [Inorganic ion transport and metabolism]	NA|606aa|down_2|NC_019676.1_5247085_5248903_-	COG1132, MdlB, ABC-type multidrug transport system, ATPase and permease components [Defense mechanisms]	NA|296aa|down_3|NC_019676.1_5248961_5249849_-	TIGR04352, hypothetical_protein_imdm_1353, HprK-related kinase A	NA|65aa|down_4|NC_019676.1_5250128_5250323_-	NA	NA|67aa|down_5|NC_019676.1_5250490_5250691_-	NA	NA|60aa|down_6|NC_019676.1_5250972_5251152_-	NA	NA|678aa|down_7|NC_019676.1_5251259_5253293_-	cd00712, AsnB, Glutamine amidotransferases class-II (GATase) asparagine synthase_B type	NA|144aa|down_8|NC_019676.1_5253296_5253728_-	pfam13471, Transglut_core3, Transglutaminase-like superfamily	NA|102aa|down_9|NC_019676.1_5253711_5254017_-	pfam05402, PqqD, Coenzyme PQQ synthesis protein D (PqqD)
GCF_000316625.1_ASM31662v1	NC_019676	Nostoc sp. PCC 7107, complete sequence	23	5474143-5474404	22	CRISPRCasFinder	no	c2c5_V-U5	PD-DExK,cas3,csa3,cas2,cas1,cas4,cas6,2OG_CAS,csc1gr5,csc2gr7,cas10d,WYL,c2c9_V-U4,cas10,csm3gr7,csx10gr5,csm2gr11,csx19,csx21,DinG,DEDDh,csx3,csm4gr5,csm5gr7,csm6,RT,c2c5_V-U5	Type V-U5	CCTTTCAACCCACCTCTAGCCGGGATGGTTGTTGAAACT	39	0	0	NA	NA	V-U5	3	3	TypeV-U5	PD-DExK,cas3,csa3,cas2,cas1,cas4,cas6,2OG_CAS,csc1gr5,csc2gr7,cas10d,WYL,c2c9_V-U4,cas10,csm3gr7,csx10gr5,csm2gr11,csx19,csx21,DinG,DEDDh,csx3,csm4gr5,csm5gr7,csm6,RT,c2c5_V-U5	NA|159aa|up_9|NC_019676.1_5461089_5461566_-,NA|382aa|up_8|NC_019676.1_5461685_5462831_+,NA|172aa|up_7|NC_019676.1_5462836_5463352_+,NA|58aa|up_3|NC_019676.1_5469559_5469733_-,NA	NA|159aa|up_9|NC_019676.1_5461089_5461566_-	NA	NA|382aa|up_8|NC_019676.1_5461685_5462831_+	NA	NA|172aa|up_7|NC_019676.1_5462836_5463352_+	NA	NA|457aa|up_6|NC_019676.1_5463643_5465014_-	PRK02705, murD, UDP-N-acetylmuramoyl-L-alanine--D-glutamate ligase	NA|717aa|up_5|NC_019676.1_5465097_5467248_-	PRK01233, glyS, glycyl-tRNA synthetase subunit beta; Validated	NA|691aa|up_4|NC_019676.1_5467490_5469563_-	TIGR03185, DNA_S_dndD, DNA sulfur modification protein DndD	NA|58aa|up_3|NC_019676.1_5469559_5469733_-	NA	NA|492aa|up_2|NC_019676.1_5470048_5471524_-	TIGR04095, type_III_restriction_protein_res_subunit, DNA phosphorothioation system restriction enzyme	NA|202aa|up_1|NC_019676.1_5471629_5472235_-	COG1974, LexA, SOS-response transcriptional repressors (RecA-mediated autopeptidases) [Transcription / Signal transduction mechanisms]	NA|307aa|up_0|NC_019676.1_5472466_5473387_-	PRK00779, PRK00779, ornithine carbamoyltransferase; Provisional	c2c5_V-U5|636aa|down_0|NC_019676.1_5474900_5476808_-	TIGR01766, Putative_transposase_MJ0751, transposase, IS605 OrfB family, central region	NA|54aa|down_1|NC_019676.1_5476970_5477132_+	PHA01623, PHA01623, hypothetical protein	NA|1094aa|down_2|NC_019676.1_5477701_5480983_+	cd18011, DEXDc_RapA, DEXH-box helicase domain of RapA	NA|1322aa|down_3|NC_019676.1_5480996_5484962_+	NF033451, BREX_2_MTaseX, BREX-2 system adenine-specific DNA-methyltransferase PglX	NA|94aa|down_4|NC_019676.1_5484991_5485273_+	pfam10047, DUF2281, Protein of unknown function (DUF2281)	NA|150aa|down_5|NC_019676.1_5485276_5485726_+	cd09881, PIN_VapC4-5_FitB-like, VapC-like PIN domain of Mycobacterium tuberculosis VapC4 and VapC5, and Neisseria gonorrhoeae FitB and related proteins	NA|1183aa|down_6|NC_019676.1_5486664_5490213_+	smart00490, HELICc, helicase superfamily c-terminal domain	NA|621aa|down_7|NC_019676.1_5490209_5492072_+	pfam09369, DUF1998, Domain of unknown function (DUF1998)	NA|266aa|down_8|NC_019676.1_5492077_5492875_+	cd09132, PLDc_unchar4, Putative catalytic domain of uncharacterized phospholipase D-like proteins	NA|87aa|down_9|NC_019676.1_5492913_5493174_-	COG3655, COG3655, Predicted transcriptional regulator [Transcription]
GCF_000316625.1_ASM31662v1	NC_019676	Nostoc sp. PCC 7107, complete sequence	24	5566325-5568673	13,23,13,14,15	PILER-CR,CRISPRCasFinder,CRT,PILER-CR,PILER-CR	no		PD-DExK,cas3,csa3,cas2,cas1,cas4,cas6,2OG_CAS,csc1gr5,csc2gr7,cas10d,WYL,c2c9_V-U4,cas10,csm3gr7,csx10gr5,csm2gr11,csx19,csx21,DinG,DEDDh,csx3,csm4gr5,csm5gr7,csm6,RT,c2c5_V-U5	Orphan	GTTTAAACATCAACTAATCCCTATTAGGGA----------TTGAAAC,GTTTAAACATCAACTAATCCCTATTAGGGATTGAAAC,GTTTAAACATCAACTAATCCCTATTAGGGATTGAAAC,GTTTAAACATCAACTAATCCCTATTAGGGA----------TTGAAAC,GTTTAAACATCAACTAATCCCTATTAGGGA----------TTGAAAC	47,37,37,47,47	0	0	NA	NA	N:A	28,31,31,28,28	31	Orphan	PD-DExK,cas3,csa3,cas2,cas1,cas4,cas6,2OG_CAS,csc1gr5,csc2gr7,cas10d,WYL,c2c9_V-U4,cas10,csm3gr7,csx10gr5,csm2gr11,csx19,csx21,DinG,DEDDh,csx3,csm4gr5,csm5gr7,csm6,RT,c2c5_V-U5	NA|162aa|up_5|NC_019676.1_5560625_5561111_-,NA|285aa|up_1|NC_019676.1_5564424_5565279_+,NA|311aa|down_8|NC_019676.1_5587646_5588579_-	NA|317aa|up_9|NC_019676.1_5556080_5557031_+	COG0714, COG0714, MoxR-like ATPases [General function prediction only]	NA|91aa|up_8|NC_019676.1_5557097_5557370_+	pfam10047, DUF2281, Protein of unknown function (DUF2281)	NA|440aa|up_7|NC_019676.1_5557861_5559181_+	COG1721, COG1721, Uncharacterized conserved protein (some members contain a von Willebrand factor type A (vWA) domain) [General function prediction only]	NA|413aa|up_6|NC_019676.1_5559332_5560571_+	sd00006, TPR, Tetratricopeptide repeat	NA|162aa|up_5|NC_019676.1_5560625_5561111_-	NA	NA|277aa|up_4|NC_019676.1_5561531_5562362_+	COG0412, COG0412, Dienelactone hydrolase and related enzymes [Secondary metabolites biosynthesis, transport, and catabolism]	NA|149aa|up_3|NC_019676.1_5562817_5563264_+	COG4803, COG4803, Predicted membrane protein [Function unknown]	NA|318aa|up_2|NC_019676.1_5563299_5564253_-	COG1300, SpoIIM, Uncharacterized membrane protein [Function unknown]	NA|285aa|up_1|NC_019676.1_5564424_5565279_+	NA	NA|261aa|up_0|NC_019676.1_5565299_5566082_+	COG1714, COG1714, Predicted membrane protein/domain [Function unknown]	NA|885aa|down_0|NC_019676.1_5569476_5572131_+	PRK00390, leuS, leucyl-tRNA synthetase; Validated	NA|417aa|down_1|NC_019676.1_5572329_5573580_-	COG0420, SbcD, DNA repair exonuclease [DNA replication, recombination, and repair]	NA|576aa|down_2|NC_019676.1_5574166_5575894_+	COG1357, COG1357, Pentapeptide repeats containing protein [Function unknown]	NA|1002aa|down_3|NC_019676.1_5576092_5579098_+	PRK02509, PRK02509, hypothetical protein; Provisional	NA|327aa|down_4|NC_019676.1_5579383_5580364_+	cd12828, TmCorA-like_1, Thermotoga maritima CorA_like subfamily	NA|923aa|down_5|NC_019676.1_5580620_5583389_+	COG0612, PqqL, Predicted Zn-dependent peptidases [General function prediction only]	NA|804aa|down_6|NC_019676.1_5583980_5586392_-	COG3706, PleD, Response regulator containing a CheY-like receiver domain and a GGDEF domain [Signal transduction mechanisms]	NA|125aa|down_7|NC_019676.1_5586741_5587116_+	cd17552, REC_RR468-like, phosphoacceptor receiver (REC) domain of Thermotoga maritima response regulator RR468 and similar domains	NA|311aa|down_8|NC_019676.1_5587646_5588579_-	NA	NA|416aa|down_9|NC_019676.1_5588650_5589898_-	COG3330, COG3330, Uncharacterized protein conserved in bacteria [Function unknown]
GCF_000316625.1_ASM31662v1	NC_019676	Nostoc sp. PCC 7107, complete sequence	25	5764801-5764910	24	CRISPRCasFinder	no		PD-DExK,cas3,csa3,cas2,cas1,cas4,cas6,2OG_CAS,csc1gr5,csc2gr7,cas10d,WYL,c2c9_V-U4,cas10,csm3gr7,csx10gr5,csm2gr11,csx19,csx21,DinG,DEDDh,csx3,csm4gr5,csm5gr7,csm6,RT,c2c5_V-U5	Orphan	GTTTCAATCCCTAATAGGGATTATTTGAAATTGCAATT	38	0	0	NA	NA	N:A	1	1	Orphan	PD-DExK,cas3,csa3,cas2,cas1,cas4,cas6,2OG_CAS,csc1gr5,csc2gr7,cas10d,WYL,c2c9_V-U4,cas10,csm3gr7,csx10gr5,csm2gr11,csx19,csx21,DinG,DEDDh,csx3,csm4gr5,csm5gr7,csm6,RT,c2c5_V-U5	NA|102aa|up_9|NC_019676.1_5741428_5741734_-,NA|190aa|up_1|NC_019676.1_5762386_5762956_-,NA|382aa|down_0|NC_019676.1_5765022_5766168_+,NA|155aa|down_5|NC_019676.1_5773366_5773831_-	NA|102aa|up_9|NC_019676.1_5741428_5741734_-	NA	NA|157aa|up_8|NC_019676.1_5742092_5742563_-	cd15457, NADAR, Escherichia coli swarming motility protein YbiA and related proteins	NA|619aa|up_7|NC_019676.1_5742684_5744541_-	COG4191, COG4191, Signal transduction histidine kinase regulating C4-dicarboxylate transport system [Signal transduction mechanisms]	NA|294aa|up_6|NC_019676.1_5744564_5745446_-	cd01071, PBP2_PhnD_like, Substrate binding domain of phosphonate uptake system-like, a member of the type 2 periplasmic-binding fold superfamily	NA|1837aa|up_5|NC_019676.1_5746312_5751823_+	COG3899, COG3899, Predicted ATPase [General function prediction only]	NA|328aa|up_4|NC_019676.1_5751786_5752770_-	COG4188, COG4188, Predicted dienelactone hydrolase [General function prediction only]	NA|1985aa|up_3|NC_019676.1_5752889_5758844_-	COG3899, COG3899, Predicted ATPase [General function prediction only]	NA|750aa|up_2|NC_019676.1_5759736_5761986_-	COG0642, BaeS, Signal transduction histidine kinase [Signal transduction mechanisms]	NA|190aa|up_1|NC_019676.1_5762386_5762956_-	NA	NA|420aa|up_0|NC_019676.1_5763289_5764549_-	COG1649, COG1649, Uncharacterized protein conserved in bacteria [Function unknown]	NA|382aa|down_0|NC_019676.1_5765022_5766168_+	NA	NA|491aa|down_1|NC_019676.1_5768345_5769818_-	TIGR00387, Glycolate_oxidase_subunit_glcD	NA|339aa|down_2|NC_019676.1_5770054_5771071_-	pfam02254, TrkA_N, TrkA-N domain	NA|428aa|down_3|NC_019676.1_5771364_5772648_+	PRK02627, PRK02627, acetylornithine aminotransferase; Provisional	NA|186aa|down_4|NC_019676.1_5772796_5773354_-	COG0625, Gst, Glutathione S-transferase [Posttranslational modification, protein turnover, chaperones]	NA|155aa|down_5|NC_019676.1_5773366_5773831_-	NA	NA|372aa|down_6|NC_019676.1_5774142_5775258_-	pfam03739, YjgP_YjgQ, Predicted permease YjgP/YjgQ family	NA|243aa|down_7|NC_019676.1_5775376_5776105_-	cd03218, ABC_YhbG, ATP-binding cassette component of YhbG transport system	NA|166aa|down_8|NC_019676.1_5776198_5776696_-	COG1934, COG1934, Uncharacterized protein conserved in bacteria [Function unknown]	NA|394aa|down_9|NC_019676.1_5777087_5778269_+	cd01189, INT_ICEBs1_C_like, C-terminal catalytic domain of integrases from bacterial phages and conjugate transposons
GCF_000316625.1_ASM31662v1	NC_019676	Nostoc sp. PCC 7107, complete sequence	26	5766301-5768107	16,25,14	PILER-CR,CRISPRCasFinder,CRT	no		PD-DExK,cas3,csa3,cas2,cas1,cas4,cas6,2OG_CAS,csc1gr5,csc2gr7,cas10d,WYL,c2c9_V-U4,cas10,csm3gr7,csx10gr5,csm2gr11,csx19,csx21,DinG,DEDDh,csx3,csm4gr5,csm5gr7,csm6,RT,c2c5_V-U5	Orphan	GTTT----------CAATCCCTAATAGGGATTATTTGAAATTGCAATT,GTTTCAATCCCTAATAGGGATTATTTGAAATTGCAATT,GTTTCAATCCCTAATAGGGATTATTTGAAATTGCAATT	48,38,38	0	0	NA	NA	N:A	24,24,24	24	Orphan	PD-DExK,cas3,csa3,cas2,cas1,cas4,cas6,2OG_CAS,csc1gr5,csc2gr7,cas10d,WYL,c2c9_V-U4,cas10,csm3gr7,csx10gr5,csm2gr11,csx19,csx21,DinG,DEDDh,csx3,csm4gr5,csm5gr7,csm6,RT,c2c5_V-U5	NA|190aa|up_2|NC_019676.1_5762386_5762956_-,NA|382aa|up_0|NC_019676.1_5765022_5766168_+,NA|155aa|down_4|NC_019676.1_5773366_5773831_-,NA|52aa|down_9|NC_019676.1_5778556_5778712_+	NA|157aa|up_9|NC_019676.1_5742092_5742563_-	cd15457, NADAR, Escherichia coli swarming motility protein YbiA and related proteins	NA|619aa|up_8|NC_019676.1_5742684_5744541_-	COG4191, COG4191, Signal transduction histidine kinase regulating C4-dicarboxylate transport system [Signal transduction mechanisms]	NA|294aa|up_7|NC_019676.1_5744564_5745446_-	cd01071, PBP2_PhnD_like, Substrate binding domain of phosphonate uptake system-like, a member of the type 2 periplasmic-binding fold superfamily	NA|1837aa|up_6|NC_019676.1_5746312_5751823_+	COG3899, COG3899, Predicted ATPase [General function prediction only]	NA|328aa|up_5|NC_019676.1_5751786_5752770_-	COG4188, COG4188, Predicted dienelactone hydrolase [General function prediction only]	NA|1985aa|up_4|NC_019676.1_5752889_5758844_-	COG3899, COG3899, Predicted ATPase [General function prediction only]	NA|750aa|up_3|NC_019676.1_5759736_5761986_-	COG0642, BaeS, Signal transduction histidine kinase [Signal transduction mechanisms]	NA|190aa|up_2|NC_019676.1_5762386_5762956_-	NA	NA|420aa|up_1|NC_019676.1_5763289_5764549_-	COG1649, COG1649, Uncharacterized protein conserved in bacteria [Function unknown]	NA|382aa|up_0|NC_019676.1_5765022_5766168_+	NA	NA|491aa|down_0|NC_019676.1_5768345_5769818_-	TIGR00387, Glycolate_oxidase_subunit_glcD	NA|339aa|down_1|NC_019676.1_5770054_5771071_-	pfam02254, TrkA_N, TrkA-N domain	NA|428aa|down_2|NC_019676.1_5771364_5772648_+	PRK02627, PRK02627, acetylornithine aminotransferase; Provisional	NA|186aa|down_3|NC_019676.1_5772796_5773354_-	COG0625, Gst, Glutathione S-transferase [Posttranslational modification, protein turnover, chaperones]	NA|155aa|down_4|NC_019676.1_5773366_5773831_-	NA	NA|372aa|down_5|NC_019676.1_5774142_5775258_-	pfam03739, YjgP_YjgQ, Predicted permease YjgP/YjgQ family	NA|243aa|down_6|NC_019676.1_5775376_5776105_-	cd03218, ABC_YhbG, ATP-binding cassette component of YhbG transport system	NA|166aa|down_7|NC_019676.1_5776198_5776696_-	COG1934, COG1934, Uncharacterized protein conserved in bacteria [Function unknown]	NA|394aa|down_8|NC_019676.1_5777087_5778269_+	cd01189, INT_ICEBs1_C_like, C-terminal catalytic domain of integrases from bacterial phages and conjugate transposons	NA|52aa|down_9|NC_019676.1_5778556_5778712_+	NA
GCF_000316625.1_ASM31662v1	NC_019676	Nostoc sp. PCC 7107, complete sequence	27	5817107-5818597	17,26,15	PILER-CR,CRISPRCasFinder,CRT	no		PD-DExK,cas3,csa3,cas2,cas1,cas4,cas6,2OG_CAS,csc1gr5,csc2gr7,cas10d,WYL,c2c9_V-U4,cas10,csm3gr7,csx10gr5,csm2gr11,csx19,csx21,DinG,DEDDh,csx3,csm4gr5,csm5gr7,csm6,RT,c2c5_V-U5	Orphan	GTTGCAATTTCTATTAATCCCTATCAGGGA----------TTGAAAC,GTTGCAATTTCTATTAATCCCTATCAGGGATTGAAAC,GTTGCAATTTCTATTAATCCCTATCAGGGATTGAAAC	47,37,37	0	0	NA	NA	N:A	20,20,20	20	Orphan	PD-DExK,cas3,csa3,cas2,cas1,cas4,cas6,2OG_CAS,csc1gr5,csc2gr7,cas10d,WYL,c2c9_V-U4,cas10,csm3gr7,csx10gr5,csm2gr11,csx19,csx21,DinG,DEDDh,csx3,csm4gr5,csm5gr7,csm6,RT,c2c5_V-U5	NA|69aa|up_8|NC_019676.1_5805647_5805854_-,NA|252aa|up_3|NC_019676.1_5809927_5810683_+,NA|361aa|up_0|NC_019676.1_5815733_5816816_+,NA|317aa|down_5|NC_019676.1_5825044_5825995_+	NA|589aa|up_9|NC_019676.1_5803739_5805506_-	PLN02286, PLN02286, arginine-tRNA ligase	NA|69aa|up_8|NC_019676.1_5805647_5805854_-	NA	NA|218aa|up_7|NC_019676.1_5806214_5806868_+	COG4339, COG4339, Uncharacterized protein conserved in bacteria [Function unknown]	NA|460aa|up_6|NC_019676.1_5806919_5808299_-	COG2124, CypX, Cytochrome P450 [Secondary metabolites biosynthesis, transport, and catabolism]	NA|211aa|up_5|NC_019676.1_5808507_5809140_+	COG1192, Soj, ATPases involved in chromosome partitioning [Cell division and chromosome partitioning]	NA|184aa|up_4|NC_019676.1_5809139_5809691_+	TIGR02169, chromosome_segregation_protein_related_ptotein, chromosome segregation protein SMC, primarily archaeal type	NA|252aa|up_3|NC_019676.1_5809927_5810683_+	NA	NA|1025aa|up_2|NC_019676.1_5810834_5813909_+	pfam05860, Haemagg_act, haemagglutination activity domain	NA|575aa|up_1|NC_019676.1_5813944_5815669_+	NF033203, entero_EhxA, enterohemolysin EhxA	NA|361aa|up_0|NC_019676.1_5815733_5816816_+	NA	NA|129aa|down_0|NC_019676.1_5818665_5819052_+	cd08352, VOC_Bs_YwkD_like, vicinal oxygen chelate (VOC) family protein  Bacillus subtilis YwkD and similar proteins	NA|256aa|down_1|NC_019676.1_5819817_5820585_+	cd03513, CrtW_beta-carotene-ketolase, Beta-carotene ketolase/oxygenase (CrtW, also known as CrtO), the carotenoid astaxanthin biosynthetic enzyme, initially catalyzes the addition of two keto groups to carbons C4 and C4' of beta-carotene	NA|184aa|down_2|NC_019676.1_5821206_5821758_+	pfam09685, DUF4870, Domain of unknown function (DUF4870)	NA|91aa|down_3|NC_019676.1_5821799_5822072_+	pfam06937, EURL, EURL protein	NA|399aa|down_4|NC_019676.1_5822163_5823360_-	cd06164, S2P-M50_SpoIVFB_CBS, SpoIVFB Site-2 protease (S2P), a zinc metalloprotease (MEROPS family M50B), regulates intramembrane proteolysis (RIP), and is involved in the pro-sigmaK pathway of bacterial spore formation	NA|317aa|down_5|NC_019676.1_5825044_5825995_+	NA	NA|377aa|down_6|NC_019676.1_5826338_5827469_-	COG1453, COG1453, Predicted oxidoreductases of the aldo/keto reductase family [General function prediction only]	NA|166aa|down_7|NC_019676.1_5827693_5828191_-	COG1259, COG1259, Uncharacterized conserved protein [Function unknown]	NA|227aa|down_8|NC_019676.1_5828528_5829209_+	PRK09289, PRK09289, riboflavin synthase	NA|533aa|down_9|NC_019676.1_5829270_5830869_-	pfam01551, Peptidase_M23, Peptidase family M23
