assembly_id	genome_id	genome_def	crispr_array_locus_merge	crispr_array_location_merge	crispr_locus_id	crispr_pred_method	array_in_prot	prot_within_array_20000	prot_in_genome	crispr_type_by_cas_prot	consensus_repeat	repeat_length	self-targeting_spacer_number	self-targeting_target_number	spacer_location	protospacer_location	repeat_type	spacer_locus_num	spacer_num	correct_crispr_type	genome_cas_prots	unknown_protein_around_crispr	L10	L10_domain	L9	L9_domain	L8	L8_domain	L7	L7_domain	L6	L6_domain	L5	L5_domain	L4	L4_domain	L3	L3_domain	L2	L2_domain	L1	L1_domain	R1	R1_domain	R2	R2_domain	R3	R3_domain	R4	R4_domain	R5	R5_domain	R6	R6_domain	R7	R7_domain	R8	R8_domain	R9	R9_domain	R10	R10_domain
GCF_001548375.1_ASM154837v1	NZ_AP017295	Nostoc sp. NIES-3756	1	207396-207544	1	CRISPRCasFinder	no		csa3,cas3,DinG,2OG_CAS,cas6,Cas9_archaeal,Cas14c_CAS-V-F,cas8b3,cas7,cas5,cas2,cas1,cas4,csc1gr5,csc2gr7,WYL,c2c5_V-U5,DEDDh	Orphan	CGCCAGTTCCCTACGGCGGGAAACCCGCCTACAGGACTGGACTCAC	46	0	0	NA	NA	NA	1	1	Orphan	csa3,cas3,DinG,2OG_CAS,cas6,Cas9_archaeal,Cas14c_CAS-V-F,cas8b3,cas7,cas5,cas2,cas1,cas4,csc1gr5,csc2gr7,WYL,c2c5_V-U5,DEDDh,c2c9_V-U4,cas14k	NA|169aa|up_8|NZ_AP017295.1_194030_194537_-,NA|126aa|up_7|NZ_AP017295.1_194688_195066_-,NA|204aa|down_0|NZ_AP017295.1_207550_208162_-	NA|260aa|up_9|NZ_AP017295.1_193174_193954_-	TIGR03069, RNA-binding_S4_domain-containing_protein, photosystem II S4 domain protein	NA|169aa|up_8|NZ_AP017295.1_194030_194537_-	NA	NA|126aa|up_7|NZ_AP017295.1_194688_195066_-	NA	NA|703aa|up_6|NZ_AP017295.1_195263_197372_-	COG1538, TolC, Outer membrane protein [Cell envelope biogenesis, outer membrane / Intracellular trafficking and secretion]	NA|520aa|up_5|NZ_AP017295.1_197476_199036_-	TIGR02655, Circadian_clock_protein_kinase_KaiC, circadian clock protein KaiC	NA|109aa|up_4|NZ_AP017295.1_199112_199439_-	PRK09301, PRK09301, circadian clock protein KaiB; Provisional	NA|103aa|up_3|NZ_AP017295.1_199544_199853_-	pfam07688, KaiA, KaiA C-terminal domain	NA|1072aa|up_2|NZ_AP017295.1_200782_203998_+	PRK11091, PRK11091, aerobic respiration control sensor protein ArcB; Provisional	NA|294aa|up_1|NZ_AP017295.1_204166_205048_-	pfam08450, SGL, SMP-30/Gluconolaconase/LRE-like region	NA|636aa|up_0|NZ_AP017295.1_205255_207163_-	COG1807, ArnT, 4-amino-4-deoxy-L-arabinose transferase and related glycosyltransferases of PMT family [Cell envelope biogenesis, outer membrane]	NA|204aa|down_0|NZ_AP017295.1_207550_208162_-	NA	NA|111aa|down_1|NZ_AP017295.1_208681_209014_+	pfam04483, DUF565, Protein of unknown function (DUF565)	NA|330aa|down_2|NZ_AP017295.1_209046_210036_+	COG4240, COG4240, Predicted kinase [General function prediction only]	NA|332aa|down_3|NZ_AP017295.1_210129_211125_-	COG3706, PleD, Response regulator containing a CheY-like receiver domain and a GGDEF domain [Signal transduction mechanisms]	NA|1690aa|down_4|NZ_AP017295.1_211121_216191_-	TIGR02956, sensor_protein_TorS, TMAO reductase sytem sensor TorS	NA|486aa|down_5|NZ_AP017295.1_216858_218316_-	pfam02696, UPF0061, Uncharacterized ACR, YdiU/UPF0061 family	NA|460aa|down_6|NZ_AP017295.1_218992_220372_+	pfam13379, NMT1_2, NMT1-like family	NA|280aa|down_7|NZ_AP017295.1_220485_221325_+	TIGR01183, Nitrate_transport_permease_protein_NrtB, nitrate ABC transporter, permease protein	NA|668aa|down_8|NZ_AP017295.1_221460_223464_+	TIGR01184, Nitrate_transport_ATP-binding_protein_NrtC, nitrate transport ATP-binding subunits C and D	NA|285aa|down_9|NZ_AP017295.1_224028_224883_+	TIGR01184, Nitrate_transport_ATP-binding_protein_NrtC, nitrate transport ATP-binding subunits C and D
GCF_001548375.1_ASM154837v1	NZ_AP017295	Nostoc sp. NIES-3756	2	754311-755527	1,2,1	PILER-CR,CRISPRCasFinder,CRT	no		csa3,cas3,DinG,2OG_CAS,cas6,Cas9_archaeal,Cas14c_CAS-V-F,cas8b3,cas7,cas5,cas2,cas1,cas4,csc1gr5,csc2gr7,WYL,c2c5_V-U5,DEDDh	Orphan	GTTTCCATACCTCAAATCCCCTCACGGGGACTGAAAC,GTTTCCATACCTCAAATCCCCTCACGGGGACTGAAAC,GTTTCCATACCTCAAATCCCCTCACGGGGACTGAAAC	37,37,37	0	0	NA	NA	NA:NA:NA	15,16,16	16	Orphan	csa3,cas3,DinG,2OG_CAS,cas6,Cas9_archaeal,Cas14c_CAS-V-F,cas8b3,cas7,cas5,cas2,cas1,cas4,csc1gr5,csc2gr7,WYL,c2c5_V-U5,DEDDh,c2c9_V-U4,cas14k	NA,NA|127aa|down_3|NZ_AP017295.1_761297_761678_-,NA|167aa|down_7|NZ_AP017295.1_764721_765222_-,NA|111aa|down_9|NZ_AP017295.1_766186_766519_-	NA|589aa|up_9|NZ_AP017295.1_738380_740147_+	PLN02286, PLN02286, arginine-tRNA ligase	NA|879aa|up_8|NZ_AP017295.1_740235_742872_-	PRK10060, PRK10060, cyclic di-GMP phosphodiesterase	NA|588aa|up_7|NZ_AP017295.1_743404_745168_-	COG2831, FhaC, Hemolysin activation/secretion protein [Intracellular trafficking and secretion]	NA|336aa|up_6|NZ_AP017295.1_745372_746380_-	COG2130, COG2130, Putative NADP-dependent oxidoreductases [General function prediction only]	NA|247aa|up_5|NZ_AP017295.1_746496_747237_-	COG3772, COG3772, Phage-related lysozyme (muraminidase) [General function prediction only]	NA|433aa|up_4|NZ_AP017295.1_747720_749019_+	PRK09776, PRK09776, putative diguanylate cyclase; Provisional	NA|127aa|up_3|NZ_AP017295.1_749031_749412_-	pfam11535, Calci_bind_CcbP, Calcium binding	NA|261aa|up_2|NZ_AP017295.1_749939_750722_-	pfam08241, Methyltransf_11, Methyltransferase domain	NA|743aa|up_1|NZ_AP017295.1_751011_753240_+	PRK09567, nirA, NirA family protein	NA|235aa|up_0|NZ_AP017295.1_753239_753944_+	PRK05990, PRK05990, precorrin-2 C(20)-methyltransferase; Reviewed	NA|516aa|down_0|NZ_AP017295.1_755896_757444_+	PRK06370, PRK06370, FAD-containing oxidoreductase	NA|873aa|down_1|NZ_AP017295.1_757494_760113_-	PRK10060, PRK10060, cyclic di-GMP phosphodiesterase	NA|230aa|down_2|NZ_AP017295.1_760452_761142_+	cd05373, SDR_c10, classical (c) SDR, subgroup  10	NA|127aa|down_3|NZ_AP017295.1_761297_761678_-	NA	NA|233aa|down_4|NZ_AP017295.1_761797_762496_+	PRK13972, PRK13972, GSH-dependent disulfide bond oxidoreductase; Provisional	NA|388aa|down_5|NZ_AP017295.1_762651_763815_+	cd17324, MFS_NepI_like, Purine ribonucleoside efflux pump NepI and similar transporters of the Major Facilitator Superfamily	NA|235aa|down_6|NZ_AP017295.1_763992_764697_-	cd05386, TraL, transfer origin protein TraL	NA|167aa|down_7|NZ_AP017295.1_764721_765222_-	NA	NA|164aa|down_8|NZ_AP017295.1_765500_765992_+	pfam09150, Carot_N, Orange carotenoid protein, N-terminal	NA|111aa|down_9|NZ_AP017295.1_766186_766519_-	NA
GCF_001548375.1_ASM154837v1	NZ_AP017295	Nostoc sp. NIES-3756	3	1232241-1232366	3	CRISPRCasFinder	no		csa3,cas3,DinG,2OG_CAS,cas6,Cas9_archaeal,Cas14c_CAS-V-F,cas8b3,cas7,cas5,cas2,cas1,cas4,csc1gr5,csc2gr7,WYL,c2c5_V-U5,DEDDh	Orphan	TACACCAATTTTACAAGAATACACAACTGTACA	33	0	0	NA	NA	NA	1	1	Orphan	csa3,cas3,DinG,2OG_CAS,cas6,Cas9_archaeal,Cas14c_CAS-V-F,cas8b3,cas7,cas5,cas2,cas1,cas4,csc1gr5,csc2gr7,WYL,c2c5_V-U5,DEDDh,c2c9_V-U4,cas14k	NA|54aa|up_9|NZ_AP017295.1_1220851_1221013_+,NA|117aa|up_1|NZ_AP017295.1_1230524_1230875_+,NA|128aa|up_0|NZ_AP017295.1_1230953_1231337_+,NA|759aa|down_0|NZ_AP017295.1_1233635_1235912_-	NA|54aa|up_9|NZ_AP017295.1_1220851_1221013_+	NA	NA|663aa|up_8|NZ_AP017295.1_1221029_1223018_-	COG0464, SpoVK, ATPases of the AAA+ class [Posttranslational modification, protein turnover, chaperones]	NA|424aa|up_7|NZ_AP017295.1_1223014_1224286_-	pfam14065, DUF4255, Protein of unknown function (DUF4255)	NA|592aa|up_6|NZ_AP017295.1_1224788_1226564_+	cd14014, STKc_PknB_like, Catalytic domain of bacterial Serine/Threonine kinases, PknB and similar proteins	NA|282aa|up_5|NZ_AP017295.1_1226845_1227691_+	pfam14065, DUF4255, Protein of unknown function (DUF4255)	NA|554aa|up_4|NZ_AP017295.1_1227705_1229367_+	COG3497, COG3497, Phage tail sheath protein FI [General function prediction only]	NA|152aa|up_3|NZ_AP017295.1_1229423_1229879_+	pfam06841, Phage_T4_gp19, T4-like virus tail tube protein gp19	NA|164aa|up_2|NZ_AP017295.1_1229901_1230393_+	pfam06841, Phage_T4_gp19, T4-like virus tail tube protein gp19	NA|117aa|up_1|NZ_AP017295.1_1230524_1230875_+	NA	NA|128aa|up_0|NZ_AP017295.1_1230953_1231337_+	NA	NA|759aa|down_0|NZ_AP017295.1_1233635_1235912_-	NA	NA|742aa|down_1|NZ_AP017295.1_1235925_1238151_-	cd00200, WD40, WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment	NA|216aa|down_2|NZ_AP017295.1_1238711_1239359_-	pfam02183, HALZ, Homeobox associated leucine zipper	NA|267aa|down_3|NZ_AP017295.1_1239601_1240402_-	PRK13111, trpA, tryptophan synthase subunit alpha; Provisional	NA|104aa|down_4|NZ_AP017295.1_1240561_1240873_-	pfam11460, DUF3007, Protein of unknown function (DUF3007)	NA|71aa|down_5|NZ_AP017295.1_1240897_1241110_-	pfam10716, NdhL, NADH dehydrogenase transmembrane subunit	NA|611aa|down_6|NZ_AP017295.1_1242042_1243875_+	COG1132, MdlB, ABC-type multidrug transport system, ATPase and permease components [Defense mechanisms]	NA|523aa|down_7|NZ_AP017295.1_1244055_1245624_+	cd07378, MPP_ACP5, Homo sapiens acid phosphatase 5 and related proteins, metallophosphatase domain	NA|250aa|down_8|NZ_AP017295.1_1245642_1246392_+	pfam13847, Methyltransf_31, Methyltransferase domain	NA|360aa|down_9|NZ_AP017295.1_1246503_1247583_-	PRK12704, PRK12704, phosphodiesterase; Provisional
GCF_001548375.1_ASM154837v1	NZ_AP017295	Nostoc sp. NIES-3756	4	2038452-2038820	2	CRT	no		csa3,cas3,DinG,2OG_CAS,cas6,Cas9_archaeal,Cas14c_CAS-V-F,cas8b3,cas7,cas5,cas2,cas1,cas4,csc1gr5,csc2gr7,WYL,c2c5_V-U5,DEDDh	Orphan	CCANTTACCATTACCAANAGT	21	0	0	NA	NA	NA	8	8	Orphan	csa3,cas3,DinG,2OG_CAS,cas6,Cas9_archaeal,Cas14c_CAS-V-F,cas8b3,cas7,cas5,cas2,cas1,cas4,csc1gr5,csc2gr7,WYL,c2c5_V-U5,DEDDh,c2c9_V-U4,cas14k	NA,NA|211aa|down_0|NZ_AP017295.1_2039320_2039953_-,NA|414aa|down_8|NZ_AP017295.1_2050629_2051871_-	NA|184aa|up_9|NZ_AP017295.1_2022123_2022675_-	cd00118, LysM, Lysin Motif is a small domain involved in binding peptidoglycan	NA|737aa|up_8|NZ_AP017295.1_2022671_2024882_-	cd10170, HSP70_NBD, Nucleotide-binding domain of the HSP70 family	NA|140aa|up_7|NZ_AP017295.1_2025032_2025452_-	COG4270, COG4270, Predicted membrane protein [Function unknown]	NA|1212aa|up_6|NZ_AP017295.1_2026306_2029942_+	pfam03160, Calx-beta, Calx-beta domain	NA|198aa|up_5|NZ_AP017295.1_2030609_2031203_+	COG3409, COG3409, Putative peptidoglycan-binding domain-containing protein [Cell envelope biogenesis, outer membrane]	NA|807aa|up_4|NZ_AP017295.1_2031300_2033721_-	TIGR02470, Sucrose_synthase_1, sucrose synthase	NA|317aa|up_3|NZ_AP017295.1_2034025_2034976_-	cd08419, PBP2_CbbR_RubisCO_like, The C-terminal substrate binding of LysR-type transcriptional regulator (CbbR) of RubisCO operon, which is involved in the carbon dioxide fixation, contains the type 2 periplasmic binding fold	NA|226aa|up_2|NZ_AP017295.1_2035221_2035899_-	COG3544, COG3544, Uncharacterized protein conserved in bacteria [Function unknown]	NA|241aa|up_1|NZ_AP017295.1_2036021_2036744_-	cd02108, bact_SO_family_Moco, bacterial subgroup of the sulfite oxidase (SO) family of molybdopterin binding domains	NA|200aa|up_0|NZ_AP017295.1_2037089_2037689_-	COG4117, COG4117, Thiosulfate reductase cytochrome B subunit (membrane anchoring protein) [Energy production and conversion]	NA|211aa|down_0|NZ_AP017295.1_2039320_2039953_-	NA	NA|355aa|down_1|NZ_AP017295.1_2040417_2041482_-	PRK13396, PRK13396, 3-deoxy-7-phosphoheptulonate synthase; Provisional	NA|373aa|down_2|NZ_AP017295.1_2041571_2042690_-	PRK00188, trpD, anthranilate phosphoribosyltransferase; Provisional	NA|409aa|down_3|NZ_AP017295.1_2042799_2044026_-	PRK04346, PRK04346, tryptophan synthase subunit beta; Validated	NA|275aa|down_4|NZ_AP017295.1_2044476_2045301_-	PLN02591, PLN02591, tryptophan synthase	NA|312aa|down_5|NZ_AP017295.1_2045456_2046392_-	PRK00278, trpC, indole-3-glycerol phosphate synthase TrpC	NA|732aa|down_6|NZ_AP017295.1_2046348_2048544_-	PRK13566, PRK13566, anthranilate synthase component I	NA|461aa|down_7|NZ_AP017295.1_2049059_2050442_-	pfam01663, Phosphodiest, Type I phosphodiesterase / nucleotide pyrophosphatase	NA|414aa|down_8|NZ_AP017295.1_2050629_2051871_-	NA	NA|400aa|down_9|NZ_AP017295.1_2051875_2053075_-	PRK06203, aroB, 3-dehydroquinate synthase; Reviewed
GCF_001548375.1_ASM154837v1	NZ_AP017295	Nostoc sp. NIES-3756	5	2153878-2153978	4	CRISPRCasFinder	no		csa3,cas3,DinG,2OG_CAS,cas6,Cas9_archaeal,Cas14c_CAS-V-F,cas8b3,cas7,cas5,cas2,cas1,cas4,csc1gr5,csc2gr7,WYL,c2c5_V-U5,DEDDh	Orphan	GCGATTTCTTGGGGACGATTCCC	23	0	0	NA	NA	NA	1	1	Orphan	csa3,cas3,DinG,2OG_CAS,cas6,Cas9_archaeal,Cas14c_CAS-V-F,cas8b3,cas7,cas5,cas2,cas1,cas4,csc1gr5,csc2gr7,WYL,c2c5_V-U5,DEDDh,c2c9_V-U4,cas14k	NA|361aa|up_9|NZ_AP017295.1_2142028_2143111_+,NA|270aa|down_5|NZ_AP017295.1_2161213_2162023_+,NA|203aa|down_9|NZ_AP017295.1_2167107_2167716_-	NA|361aa|up_9|NZ_AP017295.1_2142028_2143111_+	NA	NA|377aa|up_8|NZ_AP017295.1_2143235_2144366_-	PRK11858, aksA, trans-homoaconitate synthase; Reviewed	NA|168aa|up_7|NZ_AP017295.1_2144707_2145211_-	COG3431, COG3431, Predicted membrane protein [Function unknown]	NA|324aa|up_6|NZ_AP017295.1_2145698_2146670_+	COG0523, COG0523, Putative GTPases (G3E family) [General function prediction only]	NA|354aa|up_5|NZ_AP017295.1_2146819_2147881_+	cd00200, WD40, WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment	NA|109aa|up_4|NZ_AP017295.1_2148020_2148347_-	pfam18032, FRP, Photoprotection regulator fluorescence recovery protein	NA|320aa|up_3|NZ_AP017295.1_2148500_2149460_-	pfam09150, Carot_N, Orange carotenoid protein, N-terminal	NA|330aa|up_2|NZ_AP017295.1_2150097_2151087_+	cd01137, PsaA, Metal binding protein PsaA	NA|173aa|up_1|NZ_AP017295.1_2151147_2151666_-	pfam04229, GrpB, GrpB protein	NA|406aa|up_0|NZ_AP017295.1_2151699_2152917_-	PRK10535, PRK10535, macrolide ABC transporter ATP-binding protein/permease MacB	NA|486aa|down_0|NZ_AP017295.1_2154949_2156407_-	TIGR04096, conserved_hypothetical_protein, DNA phosphorothioation-associated putative methyltransferase	NA|134aa|down_1|NZ_AP017295.1_2156437_2156839_-	pfam08870, DndE, DNA sulphur modification protein DndE	NA|119aa|down_2|NZ_AP017295.1_2156937_2157294_+	TIGR02436, S23_ribosomal_protein, four helix bundle protein	NA|663aa|down_3|NZ_AP017295.1_2157329_2159318_-	TIGR03185, DNA_S_dndD, DNA sulfur modification protein DndD	NA|338aa|down_4|NZ_AP017295.1_2159798_2160812_+	COG4301, COG4301, Uncharacterized conserved protein [Function unknown]	NA|270aa|down_5|NZ_AP017295.1_2161213_2162023_+	NA	NA|550aa|down_6|NZ_AP017295.1_2163583_2165233_+	TIGR02956, sensor_protein_TorS, TMAO reductase sytem sensor TorS	NA|425aa|down_7|NZ_AP017295.1_2165260_2166535_+	cd19920, REC_PA4781-like, phosphoacceptor receiver (REC) domain of cyclic di-GMP phosphodiesterase PA4781 and similar domains	NA|73aa|down_8|NZ_AP017295.1_2166627_2166846_-	pfam15919, HicB_lk_antitox, HicB_like antitoxin of bacterial toxin-antitoxin system	NA|203aa|down_9|NZ_AP017295.1_2167107_2167716_-	NA
GCF_001548375.1_ASM154837v1	NZ_AP017295	Nostoc sp. NIES-3756	6	2430442-2431304	5	CRISPRCasFinder	no		csa3,cas3,DinG,2OG_CAS,cas6,Cas9_archaeal,Cas14c_CAS-V-F,cas8b3,cas7,cas5,cas2,cas1,cas4,csc1gr5,csc2gr7,WYL,c2c5_V-U5,DEDDh	Orphan	GTATCGGGGTCATCTTTAAAACCCCTGGCTAATTCTTGTACCGCCGCACG	50	0	0	NA	NA	NA	8	8	Orphan	csa3,cas3,DinG,2OG_CAS,cas6,Cas9_archaeal,Cas14c_CAS-V-F,cas8b3,cas7,cas5,cas2,cas1,cas4,csc1gr5,csc2gr7,WYL,c2c5_V-U5,DEDDh,c2c9_V-U4,cas14k	NA|124aa|up_0|NZ_AP017295.1_2429847_2430219_+,NA|77aa|down_6|NZ_AP017295.1_2440256_2440487_+	NA|1178aa|up_9|NZ_AP017295.1_2413011_2416545_-	COG4585, COG4585, Signal transduction histidine kinase [Signal transduction mechanisms]	NA|327aa|up_8|NZ_AP017295.1_2416959_2417940_+	PRK12324, PRK12324, decaprenyl-phosphate phosphoribosyltransferase	NA|195aa|up_7|NZ_AP017295.1_2418042_2418627_+	TIGR01668, Uncharacterized_protein_YqeG, HAD superfamily (subfamily IIIA) phosphatase, TIGR01668	NA|113aa|up_6|NZ_AP017295.1_2418722_2419061_+	COG2076, EmrE, Membrane transporters of cations and cationic drugs [Inorganic ion transport and metabolism]	NA|1080aa|up_5|NZ_AP017295.1_2419119_2422359_+	COG3387, SGA1, Glucoamylase and related glycosyl hydrolases [Carbohydrate transport and metabolism]	NA|317aa|up_4|NZ_AP017295.1_2422438_2423389_-	cd05256, UDP_AE_SDR_e, UDP-N-acetylglucosamine 4-epimerase, extended (e) SDRs	NA|426aa|up_3|NZ_AP017295.1_2423765_2425043_+	PRK00885, PRK00885, phosphoribosylamine--glycine ligase; Provisional	NA|651aa|up_2|NZ_AP017295.1_2425152_2427105_+	NF033092, HK_WalK, cell wall metabolism sensor histidine kinase WalK	NA|584aa|up_1|NZ_AP017295.1_2427335_2429087_-	NF033203, entero_EhxA, enterohemolysin EhxA	NA|124aa|up_0|NZ_AP017295.1_2429847_2430219_+	NA	NA|103aa|down_0|NZ_AP017295.1_2434088_2434397_-	TIGR03792, conserved_hypothetical_protein, uncharacterized cyanobacterial protein, TIGR03792 family	NA|108aa|down_1|NZ_AP017295.1_2434531_2434855_+	COG3937, COG3937, Uncharacterized conserved protein [Function unknown]	NA|174aa|down_2|NZ_AP017295.1_2434931_2435453_+	COG0545, FkpA, FKBP-type peptidyl-prolyl cis-trans isomerases 1 [Posttranslational modification, protein turnover, chaperones]	NA|880aa|down_3|NZ_AP017295.1_2435602_2438242_-	PRK07773, PRK07773, replicative DNA helicase; Validated	NA|153aa|down_4|NZ_AP017295.1_2438389_2438848_-	PRK00137, rplI, 50S ribosomal protein L9; Reviewed	NA|258aa|down_5|NZ_AP017295.1_2439055_2439829_-	TIGR03413, GSH_gloB, hydroxyacylglutathione hydrolase	NA|77aa|down_6|NZ_AP017295.1_2440256_2440487_+	NA	NA|399aa|down_7|NZ_AP017295.1_2440680_2441877_+	cd03801, GT4_PimA-like, phosphatidyl-myo-inositol mannosyltransferase	NA|756aa|down_8|NZ_AP017295.1_2441987_2444255_+	COG1132, MdlB, ABC-type multidrug transport system, ATPase and permease components [Defense mechanisms]	NA|401aa|down_9|NZ_AP017295.1_2444251_2445454_+	cd03794, GT4_WbuB-like, Escherichia coli WbuB and similar proteins
GCF_001548375.1_ASM154837v1	NZ_AP017295	Nostoc sp. NIES-3756	7	2760951-2761139	2,6	PILER-CR,CRISPRCasFinder	no	cas6,cas8b3,cas7,cas5	csa3,cas3,DinG,2OG_CAS,cas6,Cas9_archaeal,Cas14c_CAS-V-F,cas8b3,cas7,cas5,cas2,cas1,cas4,csc1gr5,csc2gr7,WYL,c2c5_V-U5,DEDDh	Unclear	TTTGAGGTGATTCGTGGCTTGATGCTGTTAGGCGTTGCTCAAAG,GTGATTCGTGGCTTGATGCTGTTAGGCGTTGCTCAA	44,36	0	0	NA	NA	NA:NA	2,2	2	Unclear	csa3,cas3,DinG,2OG_CAS,cas6,Cas9_archaeal,Cas14c_CAS-V-F,cas8b3,cas7,cas5,cas2,cas1,cas4,csc1gr5,csc2gr7,WYL,c2c5_V-U5,DEDDh,c2c9_V-U4,cas14k	NA|66aa|up_4|NZ_AP017295.1_2756314_2756512_+,NA|59aa|down_6|NZ_AP017295.1_2773722_2773899_+	NA|492aa|up_9|NZ_AP017295.1_2749248_2750724_-	COG4775, COG4775, Outer membrane protein/protective antigen OMA87 [Cell envelope biogenesis, outer membrane]	NA|327aa|up_8|NZ_AP017295.1_2750967_2751948_-	cd00761, Glyco_tranf_GTA_type, Glycosyltransferase family A (GT-A) includes diverse families of glycosyl transferases with a common GT-A type structural fold	NA|332aa|up_7|NZ_AP017295.1_2751965_2752961_-	pfam00535, Glycos_transf_2, Glycosyl transferase family 2	NA|231aa|up_6|NZ_AP017295.1_2753327_2754020_-	cd03255, ABC_MJ0796_LolCDE_FtsE, ATP-binding cassette domain of the transporters involved in export of lipoprotein and macrolide, and cell division protein	NA|564aa|up_5|NZ_AP017295.1_2754318_2756010_+	COG1226, Kch, Kef-type K+ transport systems, predicted NAD-binding component [Inorganic ion transport and metabolism]	NA|66aa|up_4|NZ_AP017295.1_2756314_2756512_+	NA	cas6|221aa|up_3|NZ_AP017295.1_2756926_2757589_+	pfam09559, Cas6, Cas6 Crispr	cas8b3|512aa|up_2|NZ_AP017295.1_2757591_2759127_+	cd09713, Cas8c_I-C, CRISPR/Cas system-associated protein Cas8c	cas7|323aa|up_1|NZ_AP017295.1_2759104_2760073_+	cd09687, Cas7_I-C, CRISPR/Cas system-associated RAMP superfamily protein Cas7	cas5|213aa|up_0|NZ_AP017295.1_2760069_2760708_+	cd09688, Cas5_I-C, CRISPR/Cas system-associated RAMP superfamily protein Cas5	NA|611aa|down_0|NZ_AP017295.1_2761479_2763312_+	COG0745, OmpR, Response regulators consisting of a CheY-like receiver domain and a winged-helix DNA-binding domain [Signal transduction mechanisms / Transcription]	NA|1453aa|down_1|NZ_AP017295.1_2763351_2767710_+	NF033092, HK_WalK, cell wall metabolism sensor histidine kinase WalK	NA|126aa|down_2|NZ_AP017295.1_2767744_2768122_+	cd17552, REC_RR468-like, phosphoacceptor receiver (REC) domain of Thermotoga maritima response regulator RR468 and similar domains	NA|122aa|down_3|NZ_AP017295.1_2768229_2768595_+	COG2197, CitB, Response regulator containing a CheY-like receiver domain and an HTH DNA-binding domain [Signal transduction mechanisms / Transcription]	NA|141aa|down_4|NZ_AP017295.1_2768649_2769072_+	cd17552, REC_RR468-like, phosphoacceptor receiver (REC) domain of Thermotoga maritima response regulator RR468 and similar domains	NA|1244aa|down_5|NZ_AP017295.1_2769619_2773351_+	pfam00990, GGDEF, Diguanylate cyclase, GGDEF domain	NA|59aa|down_6|NZ_AP017295.1_2773722_2773899_+	NA	NA|227aa|down_7|NZ_AP017295.1_2774127_2774808_+	cd17534, REC_DC-like, phosphoacceptor receiver (REC) domain of modulated diguanylate cyclase and similar domains	NA|128aa|down_8|NZ_AP017295.1_2774817_2775201_+	pfam00072, Response_reg, Response regulator receiver domain	NA|1024aa|down_9|NZ_AP017295.1_2775271_2778343_+	COG4251, COG4251, Bacteriophytochrome (light-regulated signal transduction histidine kinase) [Signal transduction mechanisms]
GCF_001548375.1_ASM154837v1	NZ_AP017295	Nostoc sp. NIES-3756	8	2818271-2818374	7	CRISPRCasFinder	no	csa3	csa3,cas3,DinG,2OG_CAS,cas6,Cas9_archaeal,Cas14c_CAS-V-F,cas8b3,cas7,cas5,cas2,cas1,cas4,csc1gr5,csc2gr7,WYL,c2c5_V-U5,DEDDh	Type I-A	GGTGTTGGGGTTGGTGTTGGGGT	23	1	3	2818333-2818351|2818333-2818351|2818333-2818351	NZ_AP017295.1_714657-714675|NZ_AP017295.1_1851013-1850995|NZ_AP017295.1_5812177-5812159	NA	2	2	Orphan	csa3,cas3,DinG,2OG_CAS,cas6,Cas9_archaeal,Cas14c_CAS-V-F,cas8b3,cas7,cas5,cas2,cas1,cas4,csc1gr5,csc2gr7,WYL,c2c5_V-U5,DEDDh,c2c9_V-U4,cas14k	NA|119aa|up_8|NZ_AP017295.1_2806337_2806694_+,NA|120aa|up_5|NZ_AP017295.1_2811555_2811915_-,NA|54aa|up_3|NZ_AP017295.1_2814770_2814932_+,NA|158aa|down_6|NZ_AP017295.1_2827697_2828171_-	NA|278aa|up_9|NZ_AP017295.1_2804940_2805774_-	pfam01716, MSP, Manganese-stabilizing protein / photosystem II polypeptide	NA|119aa|up_8|NZ_AP017295.1_2806337_2806694_+	NA	NA|122aa|up_7|NZ_AP017295.1_2809110_2809476_-	pfam01609, DDE_Tnp_1, Transposase DDE domain	NA|333aa|up_6|NZ_AP017295.1_2809661_2810660_-	pfam01609, DDE_Tnp_1, Transposase DDE domain	NA|120aa|up_5|NZ_AP017295.1_2811555_2811915_-	NA	NA|281aa|up_4|NZ_AP017295.1_2813136_2813979_+	COG1589, FtsQ, Cell division septal protein [Cell envelope biogenesis, outer membrane]	NA|54aa|up_3|NZ_AP017295.1_2814770_2814932_+	NA	NA|428aa|up_2|NZ_AP017295.1_2815051_2816335_+	PRK09330, PRK09330, cell division protein FtsZ; Validated	NA|322aa|up_1|NZ_AP017295.1_2816656_2817622_-	PRK05246, PRK05246, glutathione synthetase; Provisional	NA|88aa|up_0|NZ_AP017295.1_2817725_2817989_-	TIGR02181, GRX_bact, Glutaredoxin, GrxC family	NA|297aa|down_0|NZ_AP017295.1_2819730_2820621_-	COG3751, EGL-9, Predicted proline hydroxylase [Posttranslational modification, protein turnover, chaperones]	NA|580aa|down_1|NZ_AP017295.1_2820984_2822724_-	TIGR03156, GTP_HflX, GTP-binding protein HflX	NA|325aa|down_2|NZ_AP017295.1_2823127_2824102_-	PRK06245, cofG, FO synthase subunit 1; Reviewed	NA|361aa|down_3|NZ_AP017295.1_2824350_2825433_+	TIGR01151, Photosystem_QB_protein, photosystem II, DI subunit (also called Q(B))	NA|440aa|down_4|NZ_AP017295.1_2825640_2826960_-	PRK10590, PRK10590, ATP-dependent RNA helicase RhlE; Provisional	NA|138aa|down_5|NZ_AP017295.1_2827219_2827633_-	PRK09256, PRK09256, aminoacyl-tRNA hydrolase	NA|158aa|down_6|NZ_AP017295.1_2827697_2828171_-	NA	NA|162aa|down_7|NZ_AP017295.1_2828838_2829324_+	CHL00086, apcA, allophycocyanin alpha subunit	NA|242aa|down_8|NZ_AP017295.1_2829736_2830462_-	TIGR02057, Phosphoadenosine_phosphosulfate_reductase, phosphoadenosine phosphosulfate reductase, thioredoxin dependent	csa3|132aa|down_9|NZ_AP017295.1_2830712_2831108_-	smart00418, HTH_ARSR, helix_turn_helix, Arsenical Resistance Operon Repressor
GCF_001548375.1_ASM154837v1	NZ_AP017295	Nostoc sp. NIES-3756	9	2840584-2841559	8,3,3	CRISPRCasFinder,CRT,PILER-CR	no	csa3	csa3,cas3,DinG,2OG_CAS,cas6,Cas9_archaeal,Cas14c_CAS-V-F,cas8b3,cas7,cas5,cas2,cas1,cas4,csc1gr5,csc2gr7,WYL,c2c5_V-U5,DEDDh	Type I-A	GTTTCAGTCCCCTTGCGGGGTAATGATTTGTGGAAAC,GTTTCAGTCCCCTTGCGGGGTAATGATTTGTGGAAAC,GTTTCCACAAATCATTACCCCGCAAGGGGACTGAAAC	37,37,37	0	0	NA	NA	NA:NA:NA	13,13,12	13	Orphan	csa3,cas3,DinG,2OG_CAS,cas6,Cas9_archaeal,Cas14c_CAS-V-F,cas8b3,cas7,cas5,cas2,cas1,cas4,csc1gr5,csc2gr7,WYL,c2c5_V-U5,DEDDh,c2c9_V-U4,cas14k	NA|52aa|up_0|NZ_AP017295.1_2840155_2840311_+,NA|120aa|down_1|NZ_AP017295.1_2842993_2843353_-,NA|80aa|down_2|NZ_AP017295.1_2843691_2843931_-,NA|52aa|down_3|NZ_AP017295.1_2844156_2844312_+,NA|92aa|down_5|NZ_AP017295.1_2845597_2845873_-	csa3|132aa|up_9|NZ_AP017295.1_2830712_2831108_-	smart00418, HTH_ARSR, helix_turn_helix, Arsenical Resistance Operon Repressor	NA|837aa|up_8|NZ_AP017295.1_2831238_2833749_+	COG2217, ZntA, Cation transport ATPase [Inorganic ion transport and metabolism]	NA|301aa|up_7|NZ_AP017295.1_2834360_2835263_+	COG4360, APA2, ATP adenylyltransferase (5',5'''-P-1,P-4-tetraphosphate phosphorylase II) [Nucleotide transport and metabolism]	NA|195aa|up_6|NZ_AP017295.1_2835318_2835903_-	pfam01949, DUF99, Protein of unknown function DUF99	NA|160aa|up_5|NZ_AP017295.1_2836042_2836522_+	cd17036, T3SC_YbjN-like_1, T110839 is structurally similar to type III secretion system chaperones and YbjN family proteins	NA|249aa|up_4|NZ_AP017295.1_2836539_2837286_+	COG0095, LplA, Lipoate-protein ligase A [Coenzyme metabolism]	NA|373aa|up_3|NZ_AP017295.1_2837304_2838423_-	COG0758, Smf, Predicted Rossmann fold nucleotide-binding protein involved in DNA uptake [DNA replication, recombination, and repair / Intracellular trafficking and secretion]	NA|217aa|up_2|NZ_AP017295.1_2838524_2839175_-	pfam10063, DUF2301, Uncharacterized integral membrane protein (DUF2301)	NA|267aa|up_1|NZ_AP017295.1_2839243_2840044_-	pfam01887, SAM_adeno_trans, S-adenosyl-l-methionine hydroxide adenosyltransferase	NA|52aa|up_0|NZ_AP017295.1_2840155_2840311_+	NA	NA|240aa|down_0|NZ_AP017295.1_2842109_2842829_-	PRK10602, PRK10602, murein tripeptide amidase MpaA	NA|120aa|down_1|NZ_AP017295.1_2842993_2843353_-	NA	NA|80aa|down_2|NZ_AP017295.1_2843691_2843931_-	NA	NA|52aa|down_3|NZ_AP017295.1_2844156_2844312_+	NA	NA|336aa|down_4|NZ_AP017295.1_2844428_2845436_+	cd03801, GT4_PimA-like, phosphatidyl-myo-inositol mannosyltransferase	NA|92aa|down_5|NZ_AP017295.1_2845597_2845873_-	NA	NA|185aa|down_6|NZ_AP017295.1_2847009_2847564_+	COG3161, UbiC, 4-hydroxybenzoate synthetase (chorismate lyase) [Coenzyme metabolism]	NA|514aa|down_7|NZ_AP017295.1_2847592_2849134_+	cd05936, FC-FACS_FadD_like, Prokaryotic long-chain fatty acid CoA synthetases similar to Escherichia coli FadD	NA|85aa|down_8|NZ_AP017295.1_2849127_2849382_+	pfam00550, PP-binding, Phosphopantetheine attachment site	NA|1125aa|down_9|NZ_AP017295.1_2849436_2852811_+	PRK12467, PRK12467, peptide synthase; Provisional
GCF_001548375.1_ASM154837v1	NZ_AP017295	Nostoc sp. NIES-3756	10	2928882-2929583	4,9,4	PILER-CR,CRISPRCasFinder,CRT	no		csa3,cas3,DinG,2OG_CAS,cas6,Cas9_archaeal,Cas14c_CAS-V-F,cas8b3,cas7,cas5,cas2,cas1,cas4,csc1gr5,csc2gr7,WYL,c2c5_V-U5,DEDDh	Orphan	GTTTCCACAAACTTTTACCCCGCAAGGGGACTGAAAC,GTTTCCACAAACTTTTACCCCGCAAGGGGACTGAAAC,GTTTCCACAAACTTTTACCCCGCAAGGGGACTGAAAC	37,37,37	0	0	NA	NA	NA:NA:NA	9,9,9	9	Orphan	csa3,cas3,DinG,2OG_CAS,cas6,Cas9_archaeal,Cas14c_CAS-V-F,cas8b3,cas7,cas5,cas2,cas1,cas4,csc1gr5,csc2gr7,WYL,c2c5_V-U5,DEDDh,c2c9_V-U4,cas14k	NA|202aa|up_7|NZ_AP017295.1_2918093_2918699_+,NA|126aa|up_5|NZ_AP017295.1_2919327_2919705_+,NA|89aa|down_2|NZ_AP017295.1_2932267_2932534_+	NA|74aa|up_9|NZ_AP017295.1_2914750_2914972_+	cd08283, FDH_like_1, Glutathione-dependent formaldehyde dehydrogenase related proteins, child 1	NA|886aa|up_8|NZ_AP017295.1_2915088_2917746_-	pfam03200, Glyco_hydro_63, Glycosyl hydrolase family 63 C-terminal domain	NA|202aa|up_7|NZ_AP017295.1_2918093_2918699_+	NA	NA|51aa|up_6|NZ_AP017295.1_2918710_2918863_+	pfam13358, DDE_3, DDE superfamily endonuclease	NA|126aa|up_5|NZ_AP017295.1_2919327_2919705_+	NA	NA|647aa|up_4|NZ_AP017295.1_2919826_2921767_-	pfam00656, Peptidase_C14, Caspase domain	NA|819aa|up_3|NZ_AP017295.1_2922154_2924611_+	pfam05860, Haemagg_act, haemagglutination activity domain	NA|292aa|up_2|NZ_AP017295.1_2925063_2925939_+	COG0300, DltE, Short-chain dehydrogenases of various substrate specificities [General function prediction only]	NA|495aa|up_1|NZ_AP017295.1_2926125_2927610_+	cd10798, GH57N_like_1, Uncharacterized subfamily of  glycoside hydrolase family 57 (GH57)	NA|252aa|up_0|NZ_AP017295.1_2927755_2928511_+	COG3476, COG3476, Tryptophan-rich sensory protein (mitochondrial benzodiazepine receptor homolog) [Signal transduction mechanisms]	NA|303aa|down_0|NZ_AP017295.1_2929743_2930652_+	COG1805, NqrB, Na+-transporting NADH:ubiquinone oxidoreductase, subunit NqrB [Energy production and conversion]	NA|457aa|down_1|NZ_AP017295.1_2930753_2932124_+	COG4402, COG4402, Uncharacterized protein conserved in bacteria [Function unknown]	NA|89aa|down_2|NZ_AP017295.1_2932267_2932534_+	NA	NA|323aa|down_3|NZ_AP017295.1_2932816_2933785_+	pfam05982, Sbt_1, Na+-dependent bicarbonate transporter superfamily	NA|104aa|down_4|NZ_AP017295.1_2933789_2934101_+	COG0347, GlnK, Nitrogen regulatory protein PII [Amino acid transport and metabolism]	NA|382aa|down_5|NZ_AP017295.1_2934102_2935248_-	cd00854, NagA, N-acetylglucosamine-6-phosphate deacetylase, NagA, catalyzes the hydrolysis of the N-acetyl group of N-acetyl-glucosamine-6-phosphate (GlcNAc-6-P) to glucosamine 6-phosphate and acetate	NA|174aa|down_6|NZ_AP017295.1_2935390_2935912_+	PLN02948, PLN02948, phosphoribosylaminoimidazole carboxylase	NA|532aa|down_7|NZ_AP017295.1_2936712_2938308_+	COG0004, AmtB, Ammonia permease [Inorganic ion transport and metabolism]	NA|472aa|down_8|NZ_AP017295.1_2938568_2939984_+	COG0004, AmtB, Ammonia permease [Inorganic ion transport and metabolism]	NA|276aa|down_9|NZ_AP017295.1_2940094_2940922_+	cd01828, sialate_O-acetylesterase_like2, sialate_O-acetylesterase_like subfamily of the SGNH-hydrolases, a diverse family of lipases and esterases
GCF_001548375.1_ASM154837v1	NZ_AP017295	Nostoc sp. NIES-3756	11	4075667-4075848	10	CRISPRCasFinder	no		csa3,cas3,DinG,2OG_CAS,cas6,Cas9_archaeal,Cas14c_CAS-V-F,cas8b3,cas7,cas5,cas2,cas1,cas4,csc1gr5,csc2gr7,WYL,c2c5_V-U5,DEDDh	Orphan	TATTGGTGGTGAGGGAAATGATACT	25	0	0	NA	NA	NA	3	3	Orphan	csa3,cas3,DinG,2OG_CAS,cas6,Cas9_archaeal,Cas14c_CAS-V-F,cas8b3,cas7,cas5,cas2,cas1,cas4,csc1gr5,csc2gr7,WYL,c2c5_V-U5,DEDDh,c2c9_V-U4,cas14k	NA,NA|390aa|down_4|NZ_AP017295.1_4086769_4087939_-,NA|83aa|down_8|NZ_AP017295.1_4103371_4103620_-	NA|170aa|up_9|NZ_AP017295.1_4060318_4060828_-	pfam13302, Acetyltransf_3, Acetyltransferase (GNAT) domain	NA|154aa|up_8|NZ_AP017295.1_4060832_4061294_-	pfam00583, Acetyltransf_1, Acetyltransferase (GNAT) family	NA|236aa|up_7|NZ_AP017295.1_4061295_4062003_-	COG1011, COG1011, Predicted hydrolase (HAD superfamily) [General function prediction only]	NA|570aa|up_6|NZ_AP017295.1_4062286_4063996_-	PRK08275, PRK08275, putative oxidoreductase; Provisional	NA|335aa|up_5|NZ_AP017295.1_4064494_4065499_+	pfam03781, FGE-sulfatase, Sulfatase-modifying factor enzyme 1	NA|786aa|up_4|NZ_AP017295.1_4065488_4067846_+	cd16025, PAS_like, Bacterial Arylsulfatase of Pseudomonas aeruginosa and related proteins	NA|291aa|up_3|NZ_AP017295.1_4068072_4068945_+	COG0385, COG0385, Predicted Na+-dependent transporter [General function prediction only]	NA|274aa|up_2|NZ_AP017295.1_4068956_4069778_-	pfam13911, AhpC-TSA_2, AhpC/TSA antioxidant enzyme	NA|798aa|up_1|NZ_AP017295.1_4070056_4072450_+	TIGR01901, Heme/hemopexin-binding_protein, filamentous hemagglutinin family N-terminal domain	NA|844aa|up_0|NZ_AP017295.1_4072548_4075080_+	COG4995, COG4995, Uncharacterized protein conserved in bacteria [Function unknown]	NA|1087aa|down_0|NZ_AP017295.1_4079271_4082532_+	cd01948, EAL, EAL domain	NA|120aa|down_1|NZ_AP017295.1_4082606_4082966_-	cd06587, VOC, vicinal oxygen chelate (VOC) family	NA|367aa|down_2|NZ_AP017295.1_4083464_4084565_+	cd13557, PBP2_SsuA, Substrate binding domain of sulfonate binding protein, a member of the type 2 periplasmic binding fold superfamily	NA|674aa|down_3|NZ_AP017295.1_4084608_4086630_-	COG4178, COG4178, ABC-type uncharacterized transport system, permease and ATPase components [General function prediction only]	NA|390aa|down_4|NZ_AP017295.1_4086769_4087939_-	NA	NA|4198aa|down_5|NZ_AP017295.1_4088076_4100670_-	PRK12316, PRK12316, peptide synthase; Provisional	NA|321aa|down_6|NZ_AP017295.1_4101293_4102256_-	COG2339, prsW, Membrane proteinase, regulator of anti-sigma factor [Posttranslational modification, protein turnover, chaperones]	NA|307aa|down_7|NZ_AP017295.1_4102248_4103169_-	pfam13649, Methyltransf_25, Methyltransferase domain	NA|83aa|down_8|NZ_AP017295.1_4103371_4103620_-	NA	NA|82aa|down_9|NZ_AP017295.1_4103660_4103906_-	pfam14279, HNH_5, HNH endonuclease
GCF_001548375.1_ASM154837v1	NZ_AP017295	Nostoc sp. NIES-3756	12	4076192-4076270	11	CRISPRCasFinder	no		csa3,cas3,DinG,2OG_CAS,cas6,Cas9_archaeal,Cas14c_CAS-V-F,cas8b3,cas7,cas5,cas2,cas1,cas4,csc1gr5,csc2gr7,WYL,c2c5_V-U5,DEDDh	Orphan	TATTGGTGGTGAGGGAAATGATACT	25	0	0	NA	NA	NA	1	1	Orphan	csa3,cas3,DinG,2OG_CAS,cas6,Cas9_archaeal,Cas14c_CAS-V-F,cas8b3,cas7,cas5,cas2,cas1,cas4,csc1gr5,csc2gr7,WYL,c2c5_V-U5,DEDDh,c2c9_V-U4,cas14k	NA,NA|390aa|down_4|NZ_AP017295.1_4086769_4087939_-,NA|83aa|down_8|NZ_AP017295.1_4103371_4103620_-	NA|170aa|up_9|NZ_AP017295.1_4060318_4060828_-	pfam13302, Acetyltransf_3, Acetyltransferase (GNAT) domain	NA|154aa|up_8|NZ_AP017295.1_4060832_4061294_-	pfam00583, Acetyltransf_1, Acetyltransferase (GNAT) family	NA|236aa|up_7|NZ_AP017295.1_4061295_4062003_-	COG1011, COG1011, Predicted hydrolase (HAD superfamily) [General function prediction only]	NA|570aa|up_6|NZ_AP017295.1_4062286_4063996_-	PRK08275, PRK08275, putative oxidoreductase; Provisional	NA|335aa|up_5|NZ_AP017295.1_4064494_4065499_+	pfam03781, FGE-sulfatase, Sulfatase-modifying factor enzyme 1	NA|786aa|up_4|NZ_AP017295.1_4065488_4067846_+	cd16025, PAS_like, Bacterial Arylsulfatase of Pseudomonas aeruginosa and related proteins	NA|291aa|up_3|NZ_AP017295.1_4068072_4068945_+	COG0385, COG0385, Predicted Na+-dependent transporter [General function prediction only]	NA|274aa|up_2|NZ_AP017295.1_4068956_4069778_-	pfam13911, AhpC-TSA_2, AhpC/TSA antioxidant enzyme	NA|798aa|up_1|NZ_AP017295.1_4070056_4072450_+	TIGR01901, Heme/hemopexin-binding_protein, filamentous hemagglutinin family N-terminal domain	NA|844aa|up_0|NZ_AP017295.1_4072548_4075080_+	COG4995, COG4995, Uncharacterized protein conserved in bacteria [Function unknown]	NA|1087aa|down_0|NZ_AP017295.1_4079271_4082532_+	cd01948, EAL, EAL domain	NA|120aa|down_1|NZ_AP017295.1_4082606_4082966_-	cd06587, VOC, vicinal oxygen chelate (VOC) family	NA|367aa|down_2|NZ_AP017295.1_4083464_4084565_+	cd13557, PBP2_SsuA, Substrate binding domain of sulfonate binding protein, a member of the type 2 periplasmic binding fold superfamily	NA|674aa|down_3|NZ_AP017295.1_4084608_4086630_-	COG4178, COG4178, ABC-type uncharacterized transport system, permease and ATPase components [General function prediction only]	NA|390aa|down_4|NZ_AP017295.1_4086769_4087939_-	NA	NA|4198aa|down_5|NZ_AP017295.1_4088076_4100670_-	PRK12316, PRK12316, peptide synthase; Provisional	NA|321aa|down_6|NZ_AP017295.1_4101293_4102256_-	COG2339, prsW, Membrane proteinase, regulator of anti-sigma factor [Posttranslational modification, protein turnover, chaperones]	NA|307aa|down_7|NZ_AP017295.1_4102248_4103169_-	pfam13649, Methyltransf_25, Methyltransferase domain	NA|83aa|down_8|NZ_AP017295.1_4103371_4103620_-	NA	NA|82aa|down_9|NZ_AP017295.1_4103660_4103906_-	pfam14279, HNH_5, HNH endonuclease
GCF_001548375.1_ASM154837v1	NZ_AP017295	Nostoc sp. NIES-3756	13	4162919-4163018	12	CRISPRCasFinder	no		csa3,cas3,DinG,2OG_CAS,cas6,Cas9_archaeal,Cas14c_CAS-V-F,cas8b3,cas7,cas5,cas2,cas1,cas4,csc1gr5,csc2gr7,WYL,c2c5_V-U5,DEDDh	Orphan	TAAGACTTGTGTGTACACCGTAGCCTT	27	1	4	4162946-4162991|4162946-4162991|4162946-4162991|4162946-4162991	NZ_AP017295.1_5429531-5429486|NZ_AP017295.1_6296105-6296060|NZ_AP017295.1_6232810-6232855|NZ_AP017295.1_1721882-1721927	NA	1	1	Orphan	csa3,cas3,DinG,2OG_CAS,cas6,Cas9_archaeal,Cas14c_CAS-V-F,cas8b3,cas7,cas5,cas2,cas1,cas4,csc1gr5,csc2gr7,WYL,c2c5_V-U5,DEDDh,c2c9_V-U4,cas14k	NA|80aa|up_9|NZ_AP017295.1_4154310_4154550_-,NA|101aa|up_3|NZ_AP017295.1_4158674_4158977_-,NA|47aa|down_4|NZ_AP017295.1_4167321_4167462_-,NA|55aa|down_7|NZ_AP017295.1_4171899_4172064_-	NA|80aa|up_9|NZ_AP017295.1_4154310_4154550_-	NA	NA|112aa|up_8|NZ_AP017295.1_4154723_4155059_-	COG4244, COG4244, Predicted membrane protein [Function unknown]	NA|203aa|up_7|NZ_AP017295.1_4156151_4156760_+	pfam09383, NIL, NIL domain	NA|113aa|up_6|NZ_AP017295.1_4156813_4157152_+	pfam09383, NIL, NIL domain	NA|197aa|up_5|NZ_AP017295.1_4157157_4157748_-	COG0664, Crp, cAMP-binding proteins - catabolite gene activator and regulatory subunit of cAMP-dependent protein kinases [Signal transduction mechanisms]	NA|244aa|up_4|NZ_AP017295.1_4157922_4158654_-	COG1045, CysE, Serine acetyltransferase [Amino acid transport and metabolism]	NA|101aa|up_3|NZ_AP017295.1_4158674_4158977_-	NA	NA|167aa|up_2|NZ_AP017295.1_4159011_4159512_+	COG1959, COG1959, Predicted transcriptional regulator [Transcription]	NA|279aa|up_1|NZ_AP017295.1_4160043_4160880_+	pfam13649, Methyltransf_25, Methyltransferase domain	NA|189aa|up_0|NZ_AP017295.1_4161695_4162262_+	pfam13975, gag-asp_proteas, gag-polyprotein putative aspartyl protease	NA|502aa|down_0|NZ_AP017295.1_4163070_4164576_-	COG0642, BaeS, Signal transduction histidine kinase [Signal transduction mechanisms]	NA|142aa|down_1|NZ_AP017295.1_4165379_4165805_+	pfam04972, BON, BON domain	NA|282aa|down_2|NZ_AP017295.1_4166020_4166866_-	cd07326, M56_BlaR1_MecR1_like, Peptidase M56-like including those in BlaR1 and MecR1, integral membrane metallopeptidase	NA|140aa|down_3|NZ_AP017295.1_4166865_4167285_-	COG3682, COG3682, Predicted transcriptional regulator [Transcription]	NA|47aa|down_4|NZ_AP017295.1_4167321_4167462_-	NA	NA|246aa|down_5|NZ_AP017295.1_4167603_4168341_+	PRK00886, PRK00886, 2-phosphosulfolactate phosphatase family protein	NA|794aa|down_6|NZ_AP017295.1_4169082_4171464_+	PRK05261, PRK05261, phosphoketolase	NA|55aa|down_7|NZ_AP017295.1_4171899_4172064_-	NA	NA|125aa|down_8|NZ_AP017295.1_4172232_4172607_+	cd00454, TrHb1_N, truncated hemoglobins (TrHbs, 2/2Hb, 2/2 globins); group 1 (N)	NA|393aa|down_9|NZ_AP017295.1_4172832_4174011_-	COG0520, csdA, Selenocysteine lyase/Cysteine desulfurase [Posttranslational modification, protein turnover, chaperones]
GCF_001548375.1_ASM154837v1	NZ_AP017295	Nostoc sp. NIES-3756	14	4254432-4255639	5,13,5,6	CRT,CRISPRCasFinder,PILER-CR,PILER-CR	no	cas2,cas1,cas4,cas6,csc1gr5,csc2gr7,cas3,WYL	csa3,cas3,DinG,2OG_CAS,cas6,Cas9_archaeal,Cas14c_CAS-V-F,cas8b3,cas7,cas5,cas2,cas1,cas4,csc1gr5,csc2gr7,WYL,c2c5_V-U5,DEDDh	Type I-D	GTTTCAGTCCCCGTGAGGGGAATTAGTTAGTTGAAAG,GTTTCAGTCCCCGTGAGGGGAATTAGTTAGTTGAAAG,CTTTCAACTAACTAATTCCCCTCACGGGGACTGAAAC,CTTTCAACTAACTAATTCCCCTCACGGGGACTGAAAC	37,37,37,37	0	0	NA	NA	NA:NA:NA:NA	16,15,12,12	16	TypeI-D	csa3,cas3,DinG,2OG_CAS,cas6,Cas9_archaeal,Cas14c_CAS-V-F,cas8b3,cas7,cas5,cas2,cas1,cas4,csc1gr5,csc2gr7,WYL,c2c5_V-U5,DEDDh,c2c9_V-U4,cas14k	NA|105aa|up_6|NZ_AP017295.1_4249506_4249821_-,NA|100aa|up_5|NZ_AP017295.1_4250205_4250505_+,NA|49aa|up_1|NZ_AP017295.1_4253471_4253618_-,NA	NA|494aa|up_9|NZ_AP017295.1_4243736_4245218_-	PRK06938, PRK06938, diaminobutyrate--2-oxoglutarate aminotransferase; Provisional	NA|861aa|up_8|NZ_AP017295.1_4245736_4248319_+	cd01347, ligand_gated_channel, TonB dependent/Ligand-Gated channels are created by a monomeric 22 strand (22,24) anti-parallel beta-barrel	NA|317aa|up_7|NZ_AP017295.1_4248405_4249356_-	pfam13649, Methyltransf_25, Methyltransferase domain	NA|105aa|up_6|NZ_AP017295.1_4249506_4249821_-	NA	NA|100aa|up_5|NZ_AP017295.1_4250205_4250505_+	NA	NA|215aa|up_4|NZ_AP017295.1_4250487_4251132_-	pfam04755, PAP_fibrillin, PAP_fibrillin	NA|394aa|up_3|NZ_AP017295.1_4251245_4252427_-	PRK03080, PRK03080, phosphoserine transaminase	NA|299aa|up_2|NZ_AP017295.1_4252523_4253420_-	cd07378, MPP_ACP5, Homo sapiens acid phosphatase 5 and related proteins, metallophosphatase domain	NA|49aa|up_1|NZ_AP017295.1_4253471_4253618_-	NA	NA|249aa|up_0|NZ_AP017295.1_4253674_4254421_+	PRK14832, PRK14832, undecaprenyl pyrophosphate synthase; Provisional	cas2|98aa|down_0|NZ_AP017295.1_4255848_4256142_-	cd09725, Cas2_I_II_III, CRISPR/Cas system-associated protein Cas2	cas1|326aa|down_1|NZ_AP017295.1_4256321_4257299_-	TIGR04093, hypothetical_protein_L8106_25395, CRISPR-associated endonuclease Cas1, subtype CYANO	cas4|201aa|down_2|NZ_AP017295.1_4257390_4257993_-	TIGR00372, conserved_hypothetical_protein, CRISPR-associated protein Cas4	cas6|269aa|down_3|NZ_AP017295.1_4258051_4258858_-	TIGR01877, CRISPR-associated_endoribonuclease_Cas6_1, CRISPR-associated endoribonuclease Cas6	csc1gr5|241aa|down_4|NZ_AP017295.1_4258829_4259552_-	TIGR03159, cas_Csc1, CRISPR type I-D/CYANO-associated protein Csc1	csc2gr7|331aa|down_5|NZ_AP017295.1_4259722_4260715_-	pfam18320, Csc2, Csc2 Crispr	cas3|710aa|down_6|NZ_AP017295.1_4263681_4265811_-	cd09710, Cas3_I-D, CRISPR/Cas system-associated protein Cas3; Distinct diverged subfamily of Cas3 helicase domain	WYL|315aa|down_7|NZ_AP017295.1_4265862_4266807_+	pfam13280, WYL, WYL domain	NA|332aa|down_8|NZ_AP017295.1_4266877_4267873_+	pfam00487, FA_desaturase, Fatty acid desaturase	NA|293aa|down_9|NZ_AP017295.1_4267940_4268819_-	PRK02755, truB, tRNA pseudouridine synthase B; Provisional
GCF_001548375.1_ASM154837v1	NZ_AP017295	Nostoc sp. NIES-3756	15	4324247-4324573	14,6,7	CRISPRCasFinder,CRT,PILER-CR	no		csa3,cas3,DinG,2OG_CAS,cas6,Cas9_archaeal,Cas14c_CAS-V-F,cas8b3,cas7,cas5,cas2,cas1,cas4,csc1gr5,csc2gr7,WYL,c2c5_V-U5,DEDDh	Orphan	TTCAACCCGCCTCTAGCTGGGAGGGGTGTTGAAAC,CCGCCTCTAGCTGGGAGGGGTGTTGAAAC,CTTTCAACCCGCCTCTAGCTGGGAGGGGTGTTGAAAC	35,29,37	0	0	NA	NA	V-U5:NA:V-U5	4,4,3	4	Orphan	csa3,cas3,DinG,2OG_CAS,cas6,Cas9_archaeal,Cas14c_CAS-V-F,cas8b3,cas7,cas5,cas2,cas1,cas4,csc1gr5,csc2gr7,WYL,c2c5_V-U5,DEDDh,c2c9_V-U4,cas14k	NA,NA|79aa|down_2|NZ_AP017295.1_4328815_4329052_+,NA|78aa|down_7|NZ_AP017295.1_4333196_4333430_-	NA|212aa|up_9|NZ_AP017295.1_4307806_4308442_-	PRK00698, tmk, thymidylate kinase; Validated	NA|445aa|up_8|NZ_AP017295.1_4308704_4310039_-	COG1252, Ndh, NADH dehydrogenase, FAD-containing subunit [Energy production and conversion]	NA|284aa|up_7|NZ_AP017295.1_4310563_4311415_+	PRK14950, PRK14950, DNA polymerase III subunits gamma and tau; Provisional	NA|674aa|up_6|NZ_AP017295.1_4311621_4313643_-	PRK11107, PRK11107, hybrid sensory histidine kinase BarA; Provisional	NA|1315aa|up_5|NZ_AP017295.1_4313727_4317672_+	PRK05989, cobN, cobaltochelatase subunit CobN; Reviewed	NA|407aa|up_4|NZ_AP017295.1_4317690_4318911_+	COG3409, COG3409, Putative peptidoglycan-binding domain-containing protein [Cell envelope biogenesis, outer membrane]	NA|152aa|up_3|NZ_AP017295.1_4319043_4319499_-	pfam01475, FUR, Ferric uptake regulator family	NA|417aa|up_2|NZ_AP017295.1_4320008_4321259_-	PRK07598, PRK07598, RNA polymerase sigma factor SigC; Validated	NA|309aa|up_1|NZ_AP017295.1_4321831_4322758_-	PRK14619, PRK14619, NAD(P)H-dependent glycerol-3-phosphate dehydrogenase; Provisional	NA|297aa|up_0|NZ_AP017295.1_4322822_4323713_-	PRK05481, PRK05481, lipoyl synthase; Provisional	NA|224aa|down_0|NZ_AP017295.1_4325212_4325884_-	pfam13401, AAA_22, AAA domain	NA|619aa|down_1|NZ_AP017295.1_4325887_4327744_-	pfam09299, Mu-transpos_C, Mu transposase, C-terminal	NA|79aa|down_2|NZ_AP017295.1_4328815_4329052_+	NA	NA|345aa|down_3|NZ_AP017295.1_4329164_4330199_-	COG1647, COG1647, Esterase/lipase [General function prediction only]	NA|83aa|down_4|NZ_AP017295.1_4330363_4330612_-	pfam10718, Ycf34, Hypothetical chloroplast protein Ycf34	NA|423aa|down_5|NZ_AP017295.1_4330698_4331967_+	COG0617, PcnB, tRNA nucleotidyltransferase/poly(A) polymerase [Translation, ribosomal structure and biogenesis]	NA|307aa|down_6|NZ_AP017295.1_4332229_4333150_+	pfam04230, PS_pyruv_trans, Polysaccharide pyruvyl transferase	NA|78aa|down_7|NZ_AP017295.1_4333196_4333430_-	NA	NA|68aa|down_8|NZ_AP017295.1_4333476_4333680_-	pfam06150, ChaB, ChaB	NA|270aa|down_9|NZ_AP017295.1_4334030_4334840_-	COG0596, MhpC, Predicted hydrolases or acyltransferases (alpha/beta hydrolase superfamily) [General function prediction only]
GCF_001548375.1_ASM154837v1	NZ_AP017295	Nostoc sp. NIES-3756	16	4427741-4429032	8,15,7	PILER-CR,CRISPRCasFinder,CRT	no		csa3,cas3,DinG,2OG_CAS,cas6,Cas9_archaeal,Cas14c_CAS-V-F,cas8b3,cas7,cas5,cas2,cas1,cas4,csc1gr5,csc2gr7,WYL,c2c5_V-U5,DEDDh	Orphan	GTTTCCATTAACTAATTCCCCTCACGGGGACTGAAAC,GTTTCAGTCCCCGTGAGGGGAATTAGTTAATGGAAAC,GTTTCAGTCCCCGTGAGGGGAATTAGTTAATGGAAAC	37,37,37	0	0	NA	NA	NA:NA:NA	17,17,17	17	Orphan	csa3,cas3,DinG,2OG_CAS,cas6,Cas9_archaeal,Cas14c_CAS-V-F,cas8b3,cas7,cas5,cas2,cas1,cas4,csc1gr5,csc2gr7,WYL,c2c5_V-U5,DEDDh,c2c9_V-U4,cas14k	NA|70aa|up_5|NZ_AP017295.1_4422912_4423122_-,NA|113aa|up_3|NZ_AP017295.1_4424827_4425166_-,NA|263aa|down_1|NZ_AP017295.1_4430295_4431084_+	NA|492aa|up_9|NZ_AP017295.1_4417977_4419453_-	TIGR04095, type_III_restriction_protein_res_subunit, DNA phosphorothioation system restriction enzyme	NA|202aa|up_8|NZ_AP017295.1_4419587_4420193_-	COG1974, LexA, SOS-response transcriptional repressors (RecA-mediated autopeptidases) [Transcription / Signal transduction mechanisms]	NA|307aa|up_7|NZ_AP017295.1_4420339_4421260_-	PRK00779, PRK00779, ornithine carbamoyltransferase; Provisional	NA|272aa|up_6|NZ_AP017295.1_4422075_4422891_-	pfam08894, DUF1838, Protein of unknown function (DUF1838)	NA|70aa|up_5|NZ_AP017295.1_4422912_4423122_-	NA	NA|404aa|up_4|NZ_AP017295.1_4423252_4424464_-	COG3180, AbrB, Putative ammonia monooxygenase [General function prediction only]	NA|113aa|up_3|NZ_AP017295.1_4424827_4425166_-	NA	NA|128aa|up_2|NZ_AP017295.1_4425845_4426229_-	cd00781, ketosteroid_isomerase, ketosteroid isomerase: Many biological reactions proceed by enzymatic cleavage of a C-H bond adjacent to carbonyl or a carboxyl group, leading to an enol or a enolate intermediate that is subsequently re-protonated at the same or an adjacent carbon	NA|123aa|up_1|NZ_AP017295.1_4426250_4426619_-	TIGR02058, lin0512_fam, conserved hypothetical protein	NA|297aa|up_0|NZ_AP017295.1_4426724_4427615_-	cd05155, APH_ChoK_like_1, Uncharacterized bacterial proteins with similarity to Aminoglycoside 3'-phosphotransferase and Choline kinase	NA|238aa|down_0|NZ_AP017295.1_4429433_4430147_-	TIGR04283, glycosyl_transferase_family_2, transferase 2, rSAM/selenodomain-associated	NA|263aa|down_1|NZ_AP017295.1_4430295_4431084_+	NA	NA|199aa|down_2|NZ_AP017295.1_4431095_4431692_-	pfam05685, Uma2, Putative restriction endonuclease	NA|412aa|down_3|NZ_AP017295.1_4431862_4433098_+	PRK07590, PRK07590, L,L-diaminopimelate aminotransferase; Validated	NA|374aa|down_4|NZ_AP017295.1_4433473_4434595_-	cd06164, S2P-M50_SpoIVFB_CBS, SpoIVFB Site-2 protease (S2P), a zinc metalloprotease (MEROPS family M50B), regulates intramembrane proteolysis (RIP), and is involved in the pro-sigmaK pathway of bacterial spore formation	NA|213aa|down_5|NZ_AP017295.1_4434874_4435513_+	PRK00121, trmB, tRNA (guanine-N(7)-)-methyltransferase; Reviewed	NA|480aa|down_6|NZ_AP017295.1_4435564_4437004_+	TIGR03648, Na_symport_lg, probable sodium:solute symporter, VC_2705 subfamily	NA|282aa|down_7|NZ_AP017295.1_4437058_4437904_-	sd00006, TPR, Tetratricopeptide repeat	NA|304aa|down_8|NZ_AP017295.1_4438348_4439260_-	COG0349, Rnd, Ribonuclease D [Translation, ribosomal structure and biogenesis]	NA|635aa|down_9|NZ_AP017295.1_4439482_4441387_-	PRK00290, dnaK, molecular chaperone DnaK; Provisional
GCF_001548375.1_ASM154837v1	NZ_AP017295	Nostoc sp. NIES-3756	17	4456515-4456602	16	CRISPRCasFinder	no		csa3,cas3,DinG,2OG_CAS,cas6,Cas9_archaeal,Cas14c_CAS-V-F,cas8b3,cas7,cas5,cas2,cas1,cas4,csc1gr5,csc2gr7,WYL,c2c5_V-U5,DEDDh	Orphan	GAAATGAGAAAACATTTTAGCTAAAT	26	0	0	NA	NA	NA	1	1	Orphan	csa3,cas3,DinG,2OG_CAS,cas6,Cas9_archaeal,Cas14c_CAS-V-F,cas8b3,cas7,cas5,cas2,cas1,cas4,csc1gr5,csc2gr7,WYL,c2c5_V-U5,DEDDh,c2c9_V-U4,cas14k	NA|90aa|up_9|NZ_AP017295.1_4441700_4441970_-,NA|232aa|up_8|NZ_AP017295.1_4442402_4443098_-,NA|211aa|up_7|NZ_AP017295.1_4443429_4444062_-,NA|66aa|down_1|NZ_AP017295.1_4458525_4458723_+,NA|111aa|down_5|NZ_AP017295.1_4464087_4464420_-	NA|90aa|up_9|NZ_AP017295.1_4441700_4441970_-	NA	NA|232aa|up_8|NZ_AP017295.1_4442402_4443098_-	NA	NA|211aa|up_7|NZ_AP017295.1_4443429_4444062_-	NA	NA|249aa|up_6|NZ_AP017295.1_4444379_4445126_-	sd00006, TPR, Tetratricopeptide repeat	NA|864aa|up_5|NZ_AP017295.1_4445731_4448323_+	COG0826, COG0826, Collagenase and related proteases [Posttranslational modification, protein turnover, chaperones]	NA|135aa|up_4|NZ_AP017295.1_4448984_4449389_+	cd08026, DUF326, Cysteine-rich 4 helical bundle widely conserved in bacteria	NA|334aa|up_3|NZ_AP017295.1_4449643_4450645_+	cd06309, PBP1_galactofuranose_YtfQ-like, periplasmic binding domain of ABC-type galactofuranose YtfQ-like transport systems	NA|506aa|up_2|NZ_AP017295.1_4450724_4452242_+	COG1129, MglA, ABC-type sugar transport system, ATPase component [Carbohydrate transport and metabolism]	NA|614aa|up_1|NZ_AP017295.1_4452344_4454186_+	cd06268, PBP1_ABC_transporter_LIVBP-like, periplasmic binding domain of ATP-binding cassette transporter-like systems that belong to the type 1 periplasmic binding fold protein superfamily	NA|705aa|up_0|NZ_AP017295.1_4454285_4456400_+	COG0145, HyuA, N-methylhydantoinase A/acetone carboxylase, beta subunit [Amino acid transport and metabolism / Secondary metabolites biosynthesis, transport, and catabolism]	NA|524aa|down_0|NZ_AP017295.1_4456849_4458421_+	COG0146, HyuB, N-methylhydantoinase B/acetone carboxylase, alpha subunit [Amino acid transport and metabolism / Secondary metabolites biosynthesis, transport, and catabolism]	NA|66aa|down_1|NZ_AP017295.1_4458525_4458723_+	NA	NA|677aa|down_2|NZ_AP017295.1_4458960_4460991_-	pfam13646, HEAT_2, HEAT repeats	NA|316aa|down_3|NZ_AP017295.1_4462143_4463091_+	COG1172, AraH, Ribose/xylose/arabinose/galactoside ABC-type transport systems, permease components [Carbohydrate transport and metabolism]	NA|327aa|down_4|NZ_AP017295.1_4463097_4464078_+	COG1172, AraH, Ribose/xylose/arabinose/galactoside ABC-type transport systems, permease components [Carbohydrate transport and metabolism]	NA|111aa|down_5|NZ_AP017295.1_4464087_4464420_-	NA	NA|234aa|down_6|NZ_AP017295.1_4464590_4465292_-	cd00180, PKc, Catalytic domain of Protein Kinases	NA|289aa|down_7|NZ_AP017295.1_4465462_4466329_+	COG0596, MhpC, Predicted hydrolases or acyltransferases (alpha/beta hydrolase superfamily) [General function prediction only]	NA|191aa|down_8|NZ_AP017295.1_4466332_4466905_+	pfam13673, Acetyltransf_10, Acetyltransferase (GNAT) domain	NA|583aa|down_9|NZ_AP017295.1_4466947_4468696_-	cd08513, PBP2_thermophilic_Hb8_like, The substrate-binding component of ABC-type thermophilic oligopeptide-binding protein Hb8-like import systems, contains the type 2 periplasmic binding fold
GCF_001548375.1_ASM154837v1	NZ_AP017295	Nostoc sp. NIES-3756	18	4541662-4542287	17,8,9	CRISPRCasFinder,CRT,PILER-CR	no		csa3,cas3,DinG,2OG_CAS,cas6,Cas9_archaeal,Cas14c_CAS-V-F,cas8b3,cas7,cas5,cas2,cas1,cas4,csc1gr5,csc2gr7,WYL,c2c5_V-U5,DEDDh	Orphan	GTTTCAGTCCCCGTGAGGGGAAGTTGTGAGTTGAAAC,GTTTCAGTCCCCGTGAGGGGAAGTTGTGAGTTGAAAC,GTTTCAACTCACAACTTCCCCTCACGGGGACTGAAAC	37,37,37	0	0	NA	NA	NA:NA:NA	8,8,8	8	Orphan	csa3,cas3,DinG,2OG_CAS,cas6,Cas9_archaeal,Cas14c_CAS-V-F,cas8b3,cas7,cas5,cas2,cas1,cas4,csc1gr5,csc2gr7,WYL,c2c5_V-U5,DEDDh,c2c9_V-U4,cas14k	NA|184aa|up_7|NZ_AP017295.1_4526819_4527371_-,NA|67aa|down_5|NZ_AP017295.1_4548007_4548208_-,NA|312aa|down_6|NZ_AP017295.1_4548280_4549216_-	NA|298aa|up_9|NZ_AP017295.1_4525345_4526239_+	pfam09992, NAGPA, Phosphodiester glycosidase	NA|152aa|up_8|NZ_AP017295.1_4526231_4526687_-	pfam09980, DUF2214, Predicted membrane protein (DUF2214)	NA|184aa|up_7|NZ_AP017295.1_4526819_4527371_-	NA	NA|380aa|up_6|NZ_AP017295.1_4527598_4528738_+	COG0596, MhpC, Predicted hydrolases or acyltransferases (alpha/beta hydrolase superfamily) [General function prediction only]	NA|308aa|up_5|NZ_AP017295.1_4528803_4529727_-	cd05152, MPH2', Macrolide 2'-Phosphotransferase	NA|359aa|up_4|NZ_AP017295.1_4530124_4531201_-	TIGR04070, photo_TT_lyase, spore photoproduct lyase	NA|95aa|up_3|NZ_AP017295.1_4531422_4531707_-	COG2119, COG2119, Predicted membrane protein [Function unknown]	NA|139aa|up_2|NZ_AP017295.1_4531734_4532151_-	COG2119, COG2119, Predicted membrane protein [Function unknown]	NA|386aa|up_1|NZ_AP017295.1_4532369_4533527_+	cd00997, PBP2_GluR0, Bacterial GluR0 ligand-binding domain; the type 2 periplasmic binding protein fold	NA|2604aa|up_0|NZ_AP017295.1_4533618_4541430_-	cd08602, GDPD_ScGlpQ1_like, Glycerophosphodiester phosphodiesterase domain of Streptomycin coelicolor (GlpQ1) and similar proteins	NA|298aa|down_0|NZ_AP017295.1_4542620_4543514_+	COG0596, MhpC, Predicted hydrolases or acyltransferases (alpha/beta hydrolase superfamily) [General function prediction only]	NA|411aa|down_1|NZ_AP017295.1_4543522_4544755_+	pfam01937, DUF89, Protein of unknown function DUF89	NA|231aa|down_2|NZ_AP017295.1_4544855_4545548_-	COG5031, COQ4, Uncharacterized protein involved in ubiquinone biosynthesis [Coenzyme metabolism]	NA|176aa|down_3|NZ_AP017295.1_4545757_4546285_-	pfam10706, Aminoglyc_resit, Aminoglycoside-2''-adenylyltransferase	NA|396aa|down_4|NZ_AP017295.1_4546431_4547619_-	cd17333, MFS_FucP_MFSD4_like, Bacterial fucose permease, eukaryotic Major facilitator superfamily domain-containing protein 4, and similar proteins	NA|67aa|down_5|NZ_AP017295.1_4548007_4548208_-	NA	NA|312aa|down_6|NZ_AP017295.1_4548280_4549216_-	NA	NA|239aa|down_7|NZ_AP017295.1_4549297_4550014_+	pfam03473, MOSC, MOSC domain	NA|234aa|down_8|NZ_AP017295.1_4550104_4550806_+	cd05324, carb_red_PTCR-like_SDR_c, Porcine testicular carbonyl reductase (PTCR)-like, classical (c) SDRs	NA|36aa|down_9|NZ_AP017295.1_4550802_4550910_-	smart00320, WD40, WD40 repeats
GCF_001548375.1_ASM154837v1	NZ_AP017295	Nostoc sp. NIES-3756	19	5091632-5091960	10,18,9	PILER-CR,CRISPRCasFinder,CRT	no	c2c5_V-U5	csa3,cas3,DinG,2OG_CAS,cas6,Cas9_archaeal,Cas14c_CAS-V-F,cas8b3,cas7,cas5,cas2,cas1,cas4,csc1gr5,csc2gr7,WYL,c2c5_V-U5,DEDDh	Type V-U5	CTTTCAACCCACCCCTAGCCGGGATGGTCGTTGAAAC,CTTTCAACCCACCCCTAGCCGGGATGGTCGTTGAAAC,CTTTCAACCCACCCCTAGCCGGGATG	37,37,26	0	0	NA	NA	V-U5:V-U5:V-U5	4,4,4	4	TypeV-U5	csa3,cas3,DinG,2OG_CAS,cas6,Cas9_archaeal,Cas14c_CAS-V-F,cas8b3,cas7,cas5,cas2,cas1,cas4,csc1gr5,csc2gr7,WYL,c2c5_V-U5,DEDDh,c2c9_V-U4,cas14k	NA|257aa|up_7|NZ_AP017295.1_5081811_5082582_+,NA|243aa|up_4|NZ_AP017295.1_5085673_5086402_+,NA|54aa|down_1|NZ_AP017295.1_5094525_5094687_+,NA|105aa|down_3|NZ_AP017295.1_5096751_5097066_-,NA|106aa|down_4|NZ_AP017295.1_5097223_5097541_-,NA|349aa|down_6|NZ_AP017295.1_5100186_5101233_+,NA|364aa|down_9|NZ_AP017295.1_5104637_5105729_+	NA|382aa|up_9|NZ_AP017295.1_5078875_5080021_-	COG2385, SpoIID, Sporulation protein and related proteins [Cell division and chromosome partitioning]	NA|393aa|up_8|NZ_AP017295.1_5080313_5081492_+	cd08014, M20_Acy1-like, M20 Peptidase aminoacylase 1 subfamily	NA|257aa|up_7|NZ_AP017295.1_5081811_5082582_+	NA	NA|126aa|up_6|NZ_AP017295.1_5082907_5083285_+	pfam01925, TauE, Sulfite exporter TauE/SafE	NA|497aa|up_5|NZ_AP017295.1_5083401_5084892_+	PLN02518, PLN02518, pheophorbide a oxygenase	NA|243aa|up_4|NZ_AP017295.1_5085673_5086402_+	NA	NA|531aa|up_3|NZ_AP017295.1_5086464_5088057_+	pfam13738, Pyr_redox_3, Pyridine nucleotide-disulphide oxidoreductase	NA|394aa|up_2|NZ_AP017295.1_5088110_5089292_-	pfam01139, RtcB, tRNA-splicing ligase RtcB	NA|310aa|up_1|NZ_AP017295.1_5089567_5090497_+	pfam13649, Methyltransf_25, Methyltransferase domain	NA|243aa|up_0|NZ_AP017295.1_5090493_5091222_+	COG0637, COG0637, Predicted phosphatase/phosphohexomutase [General function prediction only]	c2c5_V-U5|636aa|down_0|NZ_AP017295.1_5092455_5094363_-	TIGR01766, Putative_transposase_MJ0751, transposase, IS605 OrfB family, central region	NA|54aa|down_1|NZ_AP017295.1_5094525_5094687_+	NA	NA|622aa|down_2|NZ_AP017295.1_5094758_5096624_-	COG3472, COG3472, Uncharacterized conserved protein [Function unknown]	NA|105aa|down_3|NZ_AP017295.1_5096751_5097066_-	NA	NA|106aa|down_4|NZ_AP017295.1_5097223_5097541_-	NA	NA|797aa|down_5|NZ_AP017295.1_5097748_5100139_+	COG4096, HsdR, Type I site-specific restriction-modification system, R (restriction) subunit and related helicases [Defense mechanisms]	NA|349aa|down_6|NZ_AP017295.1_5100186_5101233_+	NA	NA|599aa|down_7|NZ_AP017295.1_5101292_5103089_+	pfam02384, N6_Mtase, N-6 DNA Methylase	NA|441aa|down_8|NZ_AP017295.1_5103117_5104440_+	cd17521, RMtype1_S_Sau13435ORF2165P_TRD2-CR2_like, Type I restriction-modification system specificity (S) subunit TRD-CR, similar to Staphylococcus aureus NCTC 13435 S subunit (S	NA|364aa|down_9|NZ_AP017295.1_5104637_5105729_+	NA
GCF_001548375.1_ASM154837v1	NZ_AP017295	Nostoc sp. NIES-3756	20	5456474-5456606	19	CRISPRCasFinder	no		csa3,cas3,DinG,2OG_CAS,cas6,Cas9_archaeal,Cas14c_CAS-V-F,cas8b3,cas7,cas5,cas2,cas1,cas4,csc1gr5,csc2gr7,WYL,c2c5_V-U5,DEDDh	Orphan	ATACTTGCTGTCGTGGTGGTTCTACTTTCTTCTCAGT	37	0	0	NA	NA	NA	1	1	Orphan	csa3,cas3,DinG,2OG_CAS,cas6,Cas9_archaeal,Cas14c_CAS-V-F,cas8b3,cas7,cas5,cas2,cas1,cas4,csc1gr5,csc2gr7,WYL,c2c5_V-U5,DEDDh,c2c9_V-U4,cas14k	NA|81aa|up_6|NZ_AP017295.1_5445675_5445918_+,NA|106aa|down_1|NZ_AP017295.1_5460631_5460949_-,NA|114aa|down_5|NZ_AP017295.1_5463924_5464266_+	NA|839aa|up_9|NZ_AP017295.1_5438831_5441348_+	COG4796, HofQ, Type II secretory pathway, component HofQ [Intracellular trafficking and secretion]	NA|148aa|up_8|NZ_AP017295.1_5441500_5441944_-	cd00038, CAP_ED, effector domain of the CAP family of transcription factors; members include CAP (or cAMP receptor protein (CRP)), which binds cAMP, FNR (fumarate and nitrate reduction), which uses an iron-sulfur cluster to sense oxygen) and CooA, a heme containing CO sensor	NA|1015aa|up_7|NZ_AP017295.1_5442437_5445482_+	PRK02509, PRK02509, hypothetical protein; Provisional	NA|81aa|up_6|NZ_AP017295.1_5445675_5445918_+	NA	NA|929aa|up_5|NZ_AP017295.1_5446124_5448911_+	pfam12770, CHAT, CHAT domain	NA|100aa|up_4|NZ_AP017295.1_5448993_5449293_-	pfam11691, DUF3288, Protein of unknown function (DUF3288)	NA|816aa|up_3|NZ_AP017295.1_5449527_5451975_+	CHL00095, clpC, Clp protease ATP binding subunit	NA|208aa|up_2|NZ_AP017295.1_5452153_5452777_-	pfam05685, Uma2, Putative restriction endonuclease	NA|575aa|up_1|NZ_AP017295.1_5453035_5454760_+	COG3391, COG3391, Uncharacterized conserved protein [Function unknown]	NA|266aa|up_0|NZ_AP017295.1_5455124_5455922_+	TIGR03442, TIGR03442, ergothioneine biosynthesis protein EgtC	NA|694aa|down_0|NZ_AP017295.1_5458533_5460615_-	pfam00350, Dynamin_N, Dynamin family	NA|106aa|down_1|NZ_AP017295.1_5460631_5460949_-	NA	NA|325aa|down_2|NZ_AP017295.1_5461480_5462455_+	PRK10717, PRK10717, cysteine synthase A; Provisional	NA|110aa|down_3|NZ_AP017295.1_5462524_5462854_+	cd02980, TRX_Fd_family, Thioredoxin (TRX)-like [2Fe-2S] Ferredoxin (Fd) family; composed of [2Fe-2S] Fds with a TRX fold (TRX-like Fds) and proteins containing domains similar to TRX-like Fd including formate dehydrogenases, NAD-reducing hydrogenases and the subunit E of NADH:ubiquinone oxidoreductase (NuoE)	NA|275aa|down_4|NZ_AP017295.1_5462944_5463769_+	pfam17265, DUF5331, Family of unknown function (DUF5331)	NA|114aa|down_5|NZ_AP017295.1_5463924_5464266_+	NA	NA|117aa|down_6|NZ_AP017295.1_5464387_5464738_+	COG4980, GvpP, Gas vesicle protein [General function prediction only]	NA|177aa|down_7|NZ_AP017295.1_5464843_5465374_+	pfam06103, DUF948, Bacterial protein of unknown function (DUF948)	NA|367aa|down_8|NZ_AP017295.1_5465501_5466602_-	pfam01070, FMN_dh, FMN-dependent dehydrogenase	NA|285aa|down_9|NZ_AP017295.1_5466991_5467846_+	cd03265, ABC_DrrA, Daunorubicin/doxorubicin resistance ATP-binding protein
GCF_001548375.1_ASM154837v1	NZ_AP017296	Nostoc sp. NIES-3756 plasmid pNOS3756_1, complete sequence	1	54441-54700	1	CRISPRCasFinder	no	c2c9_V-U4	c2c9_V-U4,cas14k	Type V-U4	CAATTGGGCGTTGACCATCTATGTCAATTGTT	32	0	0	NA	NA	NA	4	4	TypeV-U4	csa3,cas3,DinG,2OG_CAS,cas6,Cas9_archaeal,Cas14c_CAS-V-F,cas8b3,cas7,cas5,cas2,cas1,cas4,csc1gr5,csc2gr7,WYL,c2c5_V-U5,DEDDh,c2c9_V-U4,cas14k	NA|184aa|up_8|NZ_AP017296.1_44012_44564_-,NA|76aa|up_7|NZ_AP017296.1_45323_45551_+,NA|146aa|up_6|NZ_AP017296.1_45697_46135_-,NA|203aa|up_4|NZ_AP017296.1_46974_47583_+,NA|79aa|up_1|NZ_AP017296.1_53443_53680_-,NA|79aa|down_1|NZ_AP017296.1_55945_56182_+,NA|71aa|down_2|NZ_AP017296.1_56234_56447_-,NA|52aa|down_5|NZ_AP017296.1_57823_57979_-,NA|143aa|down_8|NZ_AP017296.1_61204_61633_+,NA|76aa|down_9|NZ_AP017296.1_61656_61884_+	c2c9_V-U4|390aa|up_9|NZ_AP017296.1_41291_42461_+	COG0675, COG0675, Transposase and inactivated derivatives [DNA replication, recombination, and repair]	NA|184aa|up_8|NZ_AP017296.1_44012_44564_-	NA	NA|76aa|up_7|NZ_AP017296.1_45323_45551_+	NA	NA|146aa|up_6|NZ_AP017296.1_45697_46135_-	NA	NA|169aa|up_5|NZ_AP017296.1_46329_46836_+	pfam09351, DUF1993, Domain of unknown function (DUF1993)	NA|203aa|up_4|NZ_AP017296.1_46974_47583_+	NA	NA|652aa|up_3|NZ_AP017296.1_47569_49525_-	cd11324, AmyAc_Amylosucrase, Alpha amylase catalytic domain found in Amylosucrase	NA|1100aa|up_2|NZ_AP017296.1_50175_53475_+	cd18011, DEXDc_RapA, DEXH-box helicase domain of RapA	NA|79aa|up_1|NZ_AP017296.1_53443_53680_-	NA	NA|102aa|up_0|NZ_AP017296.1_53810_54116_-	cd02230, cupin_HP0902-like, Helicobacter pylori HP0902 and related proteins, cupin domain	NA|232aa|down_0|NZ_AP017296.1_55057_55753_-	COG1357, COG1357, Pentapeptide repeats containing protein [Function unknown]	NA|79aa|down_1|NZ_AP017296.1_55945_56182_+	NA	NA|71aa|down_2|NZ_AP017296.1_56234_56447_-	NA	NA|221aa|down_3|NZ_AP017296.1_56469_57132_-	COG0398, COG0398, Uncharacterized conserved protein [Function unknown]	NA|201aa|down_4|NZ_AP017296.1_57136_57739_-	pfam01066, CDP-OH_P_transf, CDP-alcohol phosphatidyltransferase	NA|52aa|down_5|NZ_AP017296.1_57823_57979_-	NA	NA|154aa|down_6|NZ_AP017296.1_58021_58483_+	COG3837, COG3837, Uncharacterized conserved protein, contains double-stranded beta-helix domain [Function unknown]	NA|489aa|down_7|NZ_AP017296.1_59370_60837_+	cd01949, GGDEF, Diguanylate-cyclase (DGC) or GGDEF domain	NA|143aa|down_8|NZ_AP017296.1_61204_61633_+	NA	NA|76aa|down_9|NZ_AP017296.1_61656_61884_+	NA
GCF_001548375.1_ASM154837v1	NZ_AP017296	Nostoc sp. NIES-3756 plasmid pNOS3756_1, complete sequence	2	103855-104418	1,2,1	PILER-CR,CRISPRCasFinder,CRT	no		c2c9_V-U4,cas14k	Orphan	GTTTCCATAACCCACATCCCCTAACGGGGACTGAAAC,GTTTCAGTCCCCGTTAGGGGATGTGGGTTATGGAAAC,GTTTCAGTCCCCGTTAGGGGATGTGGGTTATGGAAAC	37,37,37	0	0	NA	NA	NA:NA:NA	7,7,7	7	Orphan	csa3,cas3,DinG,2OG_CAS,cas6,Cas9_archaeal,Cas14c_CAS-V-F,cas8b3,cas7,cas5,cas2,cas1,cas4,csc1gr5,csc2gr7,WYL,c2c5_V-U5,DEDDh,c2c9_V-U4,cas14k	NA|169aa|up_8|NZ_AP017296.1_99127_99634_-,NA|183aa|up_7|NZ_AP017296.1_99717_100266_-,NA|74aa|up_6|NZ_AP017296.1_100389_100611_-,NA|85aa|up_3|NZ_AP017296.1_101937_102192_-,NA|60aa|up_2|NZ_AP017296.1_102357_102537_+,NA|53aa|down_0|NZ_AP017296.1_104907_105066_+,NA|68aa|down_1|NZ_AP017296.1_105079_105283_-,NA|66aa|down_5|NZ_AP017296.1_109243_109441_+,NA|312aa|down_6|NZ_AP017296.1_110049_110985_+,NA|104aa|down_7|NZ_AP017296.1_111059_111371_+,NA|120aa|down_8|NZ_AP017296.1_111400_111760_+,NA|65aa|down_9|NZ_AP017296.1_111785_111980_+	NA|187aa|up_9|NZ_AP017296.1_98509_99070_-	pfam04147, Nop14, Nop14-like family	NA|169aa|up_8|NZ_AP017296.1_99127_99634_-	NA	NA|183aa|up_7|NZ_AP017296.1_99717_100266_-	NA	NA|74aa|up_6|NZ_AP017296.1_100389_100611_-	NA	NA|196aa|up_5|NZ_AP017296.1_100793_101381_+	COG0637, COG0637, Predicted phosphatase/phosphohexomutase [General function prediction only]	NA|136aa|up_4|NZ_AP017296.1_101533_101941_-	cd18745, PIN_VapC4-5_FitB-like, uncharacterized subgroup of the PIN_VapC4-5_FitB-like subfamily of the PIN domain superfamily	NA|85aa|up_3|NZ_AP017296.1_101937_102192_-	NA	NA|60aa|up_2|NZ_AP017296.1_102357_102537_+	NA	NA|109aa|up_1|NZ_AP017296.1_102641_102968_+	pfam04355, SmpA_OmlA, SmpA / OmlA family	NA|176aa|up_0|NZ_AP017296.1_103104_103632_+	cd00431, cysteine_hydrolases, Cysteine hydrolases; This family contains amidohydrolases, like CSHase (N-carbamoylsarcosine amidohydrolase), involved in creatine metabolism and nicotinamidase, converting nicotinamide to nicotinic acid and ammonia in the pyridine nucleotide cycle	NA|53aa|down_0|NZ_AP017296.1_104907_105066_+	NA	NA|68aa|down_1|NZ_AP017296.1_105079_105283_-	NA	NA|224aa|down_2|NZ_AP017296.1_105300_105972_-	COG5031, COQ4, Uncharacterized protein involved in ubiquinone biosynthesis [Coenzyme metabolism]	NA|211aa|down_3|NZ_AP017296.1_106070_106703_+	pfam17918, TetR_C_15, Tetracyclin repressor-like, C-terminal domain	NA|60aa|down_4|NZ_AP017296.1_107696_107876_-	pfam07878, RHH_5, CopG-like RHH_1 or ribbon-helix-helix domain, RHH_5	NA|66aa|down_5|NZ_AP017296.1_109243_109441_+	NA	NA|312aa|down_6|NZ_AP017296.1_110049_110985_+	NA	NA|104aa|down_7|NZ_AP017296.1_111059_111371_+	NA	NA|120aa|down_8|NZ_AP017296.1_111400_111760_+	NA	NA|65aa|down_9|NZ_AP017296.1_111785_111980_+	NA
