assembly_id	genome_id	genome_def	crispr_array_locus_merge	crispr_array_location_merge	crispr_locus_id	crispr_pred_method	array_in_prot	prot_within_array_20000	prot_in_genome	crispr_type_by_cas_prot	consensus_repeat	repeat_length	self-targeting_spacer_number	self-targeting_target_number	spacer_location	protospacer_location	repeat_type	spacer_locus_num	spacer_num	correct_crispr_type	genome_cas_prots	unknown_protein_around_crispr	L10	L10_domain	L9	L9_domain	L8	L8_domain	L7	L7_domain	L6	L6_domain	L5	L5_domain	L4	L4_domain	L3	L3_domain	L2	L2_domain	L1	L1_domain	R1	R1_domain	R2	R2_domain	R3	R3_domain	R4	R4_domain	R5	R5_domain	R6	R6_domain	R7	R7_domain	R8	R8_domain	R9	R9_domain	R10	R10_domain
GCF_002163975.1_ASM216397v1	NZ_CP019636	Nostocales cyanobacterium HT-58-2, complete genome	1	169080-169184	1	CRISPRCasFinder	no		Cas14c_CAS-V-F,cas14j,c2c9_V-U4,csa3,WYL,csx1,cas10,csm3gr7,csx10gr5,csx19,Cas14u_CAS-V,cas14k,cas6,2OG_CAS,csc1gr5,cas10d,csc2gr7,cas3,cas4,cas1,cas2,csx21,PD-DExK,csm2gr11,csx3,DinG,c2c10_CAS-V-U3	Orphan	ATTAGATTAGTAGTCAACTTCTGTGCTAAAGCACTACTAC	40	1	14	169120-169144|169120-169144|169120-169144|169120-169144|169120-169144|169120-169144|169120-169144|169120-169144|169120-169144|169120-169144|169120-169144|169120-169144|169120-169144|169120-169144	NZ_CP019636.1_768978-768954|NZ_CP019636.1_2760827-2760803|NZ_CP019636.1_4220279-4220303|NZ_CP019636.1_5618983-5619007|NZ_CP019636.1_6351180-6351204|NZ_CP019636.1_7778813-7778837|NZ_CP019636.1_4056427-4056451|NZ_CP019636.1_4375714-4375690|NZ_CP019636.1_6063216-6063192|NZ_CP019636.1_6257256-6257232|NZ_CP019636.1_6924152-6924128|NZ_CP019636.1_930955-930979|NZ_CP019636.1_1405237-1405261|NZ_CP019636.1_4220274-4220250	NA	1	1	Orphan	Cas14c_CAS-V-F,cas14j,c2c9_V-U4,csa3,WYL,csx1,cas10,csm3gr7,csx10gr5,csx19,Cas14u_CAS-V,cas14k,cas6,2OG_CAS,csc1gr5,cas10d,csc2gr7,cas3,cas4,cas1,cas2,csx21,PD-DExK,csm2gr11,csx3,DinG,c2c10_CAS-V-U3	NA|151aa|up_7|NZ_CP019636.1_161556_162009_+,NA|162aa|up_6|NZ_CP019636.1_162005_162491_+,NA|62aa|up_5|NZ_CP019636.1_162625_162811_-,NA|118aa|down_0|NZ_CP019636.1_169372_169726_-,NA|247aa|down_6|NZ_CP019636.1_174713_175454_-,NA|197aa|down_8|NZ_CP019636.1_177704_178295_-	NA|129aa|up_9|NZ_CP019636.1_160223_160610_+	cd08352, VOC_Bs_YwkD_like, vicinal oxygen chelate (VOC) family protein  Bacillus subtilis YwkD and similar proteins	NA|214aa|up_8|NZ_CP019636.1_160888_161530_+	TIGR02168, Chromosome_partition_protein_Smc, chromosome segregation protein SMC, common bacterial type	NA|151aa|up_7|NZ_CP019636.1_161556_162009_+	NA	NA|162aa|up_6|NZ_CP019636.1_162005_162491_+	NA	NA|62aa|up_5|NZ_CP019636.1_162625_162811_-	NA	NA|359aa|up_4|NZ_CP019636.1_162816_163893_-	COG4956, COG4956, Integral membrane protein (PIN domain superfamily) [General function prediction only]	NA|395aa|up_3|NZ_CP019636.1_164146_165331_+	PRK07379, PRK07379, coproporphyrinogen III oxidase; Provisional	NA|355aa|up_2|NZ_CP019636.1_165463_166528_-	pfam03016, Exostosin, Exostosin family	NA|396aa|up_1|NZ_CP019636.1_166651_167839_-	PRK00770, PRK00770, deoxyhypusine synthase	NA|287aa|up_0|NZ_CP019636.1_167980_168841_-	cd02525, Succinoglycan_BP_ExoA, ExoA is involved in the biosynthesis of succinoglycan	NA|118aa|down_0|NZ_CP019636.1_169372_169726_-	NA	NA|221aa|down_1|NZ_CP019636.1_169747_170410_+	PRK05986, PRK05986, cob(I)yrinic acid a,c-diamide adenosyltransferase	NA|130aa|down_2|NZ_CP019636.1_170918_171308_-	pfam07843, DUF1634, Protein of unknown function (DUF1634)	NA|278aa|down_3|NZ_CP019636.1_171309_172143_-	pfam01925, TauE, Sulfite exporter TauE/SafE	NA|310aa|down_4|NZ_CP019636.1_172456_173386_-	cd08420, PBP2_CysL_like, C-terminal substrate binding domain of LysR-type transcriptional regulator CysL, which activates the transcription of the cysJI operon encoding sulfite reductase, contains the type 2 periplasmic binding fold	NA|334aa|down_5|NZ_CP019636.1_173525_174527_+	COG2855, COG2855, Predicted membrane protein [Function unknown]	NA|247aa|down_6|NZ_CP019636.1_174713_175454_-	NA	NA|328aa|down_7|NZ_CP019636.1_176095_177079_-	PRK05949, PRK05949, RNA polymerase sigma factor; Validated	NA|197aa|down_8|NZ_CP019636.1_177704_178295_-	NA	NA|577aa|down_9|NZ_CP019636.1_178406_180137_-	COG0433, COG0433,  HerA helicase [Replication, recombination, and repair]
GCF_002163975.1_ASM216397v1	NZ_CP019636	Nostocales cyanobacterium HT-58-2, complete genome	2	1030714-1030828	2	CRISPRCasFinder	no		Cas14c_CAS-V-F,cas14j,c2c9_V-U4,csa3,WYL,csx1,cas10,csm3gr7,csx10gr5,csx19,Cas14u_CAS-V,cas14k,cas6,2OG_CAS,csc1gr5,cas10d,csc2gr7,cas3,cas4,cas1,cas2,csx21,PD-DExK,csm2gr11,csx3,DinG,c2c10_CAS-V-U3	Orphan	AAAATACGTAGGATGTGTTAGCGCTAGCGTAACGCATCATTC	42	0	0	NA	NA	NA	1	1	Orphan	Cas14c_CAS-V-F,cas14j,c2c9_V-U4,csa3,WYL,csx1,cas10,csm3gr7,csx10gr5,csx19,Cas14u_CAS-V,cas14k,cas6,2OG_CAS,csc1gr5,cas10d,csc2gr7,cas3,cas4,cas1,cas2,csx21,PD-DExK,csm2gr11,csx3,DinG,c2c10_CAS-V-U3	NA,NA|140aa|down_0|NZ_CP019636.1_1030906_1031326_-	NA|367aa|up_9|NZ_CP019636.1_1014089_1015190_-	PRK15062, PRK15062, hydrogenase isoenzymes formation protein HypD; Provisional	NA|344aa|up_8|NZ_CP019636.1_1015270_1016302_-	PRK07565, PRK07565, dihydroorotate dehydrogenase-like protein	NA|1192aa|up_7|NZ_CP019636.1_1016319_1019895_-	TIGR02176, pyruvate_flavodoxin/ferrodoxin_oxidoreductase, pyruvate:ferredoxin (flavodoxin) oxidoreductase, homodimeric	NA|807aa|up_6|NZ_CP019636.1_1020088_1022509_-	PRK13557, PRK13557, histidine kinase; Provisional	NA|1102aa|up_5|NZ_CP019636.1_1023251_1026557_-	PRK13557, PRK13557, histidine kinase; Provisional	NA|86aa|up_4|NZ_CP019636.1_1027414_1027672_-	pfam01455, HupF_HypC, HupF/HypC family	NA|114aa|up_3|NZ_CP019636.1_1027712_1028054_-	pfam01155, HypA, Hydrogenase/urease nickel incorporation, metallochaperone, hypA	NA|164aa|up_2|NZ_CP019636.1_1028035_1028527_-	cd06066, H2MP_NAD-link-bidir, Endopeptidases that belong to the bidirectional NAD-linked hydrogenase group	NA|153aa|up_1|NZ_CP019636.1_1028706_1029165_-	COG2345, COG2345, Predicted transcriptional regulator [Transcription]	NA|483aa|up_0|NZ_CP019636.1_1029232_1030681_-	COG3259, FrhA, Coenzyme F420-reducing hydrogenase, alpha subunit [Energy production and conversion]	NA|140aa|down_0|NZ_CP019636.1_1030906_1031326_-	NA	NA|182aa|down_1|NZ_CP019636.1_1031397_1031943_-	COG1941, FrhG, Coenzyme F420-reducing hydrogenase, gamma subunit [Energy production and conversion]	NA|175aa|down_2|NZ_CP019636.1_1032005_1032530_-	pfam11320, DUF3122, Protein of unknown function (DUF3122)	NA|239aa|down_3|NZ_CP019636.1_1032566_1033283_-	PRK07569, PRK07569, bidirectional hydrogenase complex protein HoxU; Validated	NA|535aa|down_4|NZ_CP019636.1_1033300_1034905_-	COG1894, NuoF, NADH:ubiquinone oxidoreductase, NADH-binding (51 kD) subunit [Energy production and conversion]	NA|180aa|down_5|NZ_CP019636.1_1034988_1035528_-	PRK07571, PRK07571, bidirectional hydrogenase complex protein HoxE; Reviewed	NA|839aa|down_6|NZ_CP019636.1_1035844_1038361_-	cd02077, P-type_ATPase_Mg, magnesium transporting ATPase (MgtA), similar to Escherichia coli MgtA and Salmonella typhimurium MgtA	NA|871aa|down_7|NZ_CP019636.1_1038412_1041025_-	cd07538, P-type_ATPase, uncharacterized subfamily of P-type ATPase transporters	NA|931aa|down_8|NZ_CP019636.1_1041131_1043924_-	COG1042, COG1042, Acyl-CoA synthetase (NDP forming) [Energy production and conversion]	NA|816aa|down_9|NZ_CP019636.1_1044068_1046516_-	PRK06464, PRK06464, phosphoenolpyruvate synthase; Validated
GCF_002163975.1_ASM216397v1	NZ_CP019636	Nostocales cyanobacterium HT-58-2, complete genome	3	1455279-1455367	3	CRISPRCasFinder	no		Cas14c_CAS-V-F,cas14j,c2c9_V-U4,csa3,WYL,csx1,cas10,csm3gr7,csx10gr5,csx19,Cas14u_CAS-V,cas14k,cas6,2OG_CAS,csc1gr5,cas10d,csc2gr7,cas3,cas4,cas1,cas2,csx21,PD-DExK,csm2gr11,csx3,DinG,c2c10_CAS-V-U3	Orphan	CTGTTACTTTCCACGGTTAGGCGCTGG	27	0	0	NA	NA	NA	1	1	Orphan	Cas14c_CAS-V-F,cas14j,c2c9_V-U4,csa3,WYL,csx1,cas10,csm3gr7,csx10gr5,csx19,Cas14u_CAS-V,cas14k,cas6,2OG_CAS,csc1gr5,cas10d,csc2gr7,cas3,cas4,cas1,cas2,csx21,PD-DExK,csm2gr11,csx3,DinG,c2c10_CAS-V-U3	NA|68aa|up_8|NZ_CP019636.1_1445364_1445568_+,NA|65aa|up_0|NZ_CP019636.1_1455042_1455237_-,NA|68aa|down_1|NZ_CP019636.1_1456617_1456821_+,NA|145aa|down_3|NZ_CP019636.1_1458080_1458515_-	NA|679aa|up_9|NZ_CP019636.1_1443092_1445129_+	PRK14559, PRK14559, serine/threonine phosphatase	NA|68aa|up_8|NZ_CP019636.1_1445364_1445568_+	NA	NA|405aa|up_7|NZ_CP019636.1_1445907_1447122_+	cd14014, STKc_PknB_like, Catalytic domain of bacterial Serine/Threonine kinases, PknB and similar proteins	NA|74aa|up_6|NZ_CP019636.1_1447219_1447441_-	pfam14217, DUF4327, Domain of unknown function (DUF4327)	NA|133aa|up_5|NZ_CP019636.1_1448454_1448853_+	pfam14559, TPR_19, Tetratricopeptide repeat	NA|352aa|up_4|NZ_CP019636.1_1449009_1450065_-	cd03801, GT4_PimA-like, phosphatidyl-myo-inositol mannosyltransferase	NA|740aa|up_3|NZ_CP019636.1_1450080_1452300_-	COG3914, Spy, Predicted O-linked N-acetylglucosamine transferase, SPINDLY family [Posttranslational modification, protein turnover, chaperones]	NA|184aa|up_2|NZ_CP019636.1_1452512_1453064_-	pfam16734, Pilin_GH, Type IV pilin-like G and H, putative	NA|458aa|up_1|NZ_CP019636.1_1453619_1454993_+	TIGR01292, Thioredoxin_reductase, thioredoxin-disulfide reductase	NA|65aa|up_0|NZ_CP019636.1_1455042_1455237_-	NA	NA|297aa|down_0|NZ_CP019636.1_1455566_1456457_+	COG1173, DppC, ABC-type dipeptide/oligopeptide/nickel transport systems, permease components [Amino acid transport and metabolism / Inorganic ion transport and metabolism]	NA|68aa|down_1|NZ_CP019636.1_1456617_1456821_+	NA	NA|297aa|down_2|NZ_CP019636.1_1456972_1457863_+	COG1940, NagC, Transcriptional regulator/sugar kinase [Transcription / Carbohydrate transport and metabolism]	NA|145aa|down_3|NZ_CP019636.1_1458080_1458515_-	NA	NA|546aa|down_4|NZ_CP019636.1_1458891_1460529_-	PRK05380, pyrG, CTP synthetase; Validated	NA|597aa|down_5|NZ_CP019636.1_1460707_1462498_+	COG0860, AmiC, N-acetylmuramoyl-L-alanine amidase [Cell envelope biogenesis, outer membrane]	NA|609aa|down_6|NZ_CP019636.1_1462909_1464736_+	cd03603, CLECT_VCBS, A bacterial subgroup of the C-type lectin-like (CTLD) domain; a subgroup of bacterial protein domains homologous to the carbohydrate-recognition domains (CRDs) of the C-type lectins	NA|270aa|down_7|NZ_CP019636.1_1465542_1466352_+	pfam03808, Glyco_tran_WecB, Glycosyl transferase WecB/TagA/CpsF family	NA|176aa|down_8|NZ_CP019636.1_1466415_1466943_-	pfam09858, DUF2085, Predicted membrane protein (DUF2085)	NA|704aa|down_9|NZ_CP019636.1_1467055_1469167_-	COG1915, COG1915, Uncharacterized conserved protein [Function unknown]
GCF_002163975.1_ASM216397v1	NZ_CP019636	Nostocales cyanobacterium HT-58-2, complete genome	4	2050782-2050896	4	CRISPRCasFinder	no		Cas14c_CAS-V-F,cas14j,c2c9_V-U4,csa3,WYL,csx1,cas10,csm3gr7,csx10gr5,csx19,Cas14u_CAS-V,cas14k,cas6,2OG_CAS,csc1gr5,cas10d,csc2gr7,cas3,cas4,cas1,cas2,csx21,PD-DExK,csm2gr11,csx3,DinG,c2c10_CAS-V-U3	Orphan	CCTAGCAGCGAGTTGCTACTCAGGGGAGTCAGTTAT	36	0	0	NA	NA	NA	1	1	Orphan	Cas14c_CAS-V-F,cas14j,c2c9_V-U4,csa3,WYL,csx1,cas10,csm3gr7,csx10gr5,csx19,Cas14u_CAS-V,cas14k,cas6,2OG_CAS,csc1gr5,cas10d,csc2gr7,cas3,cas4,cas1,cas2,csx21,PD-DExK,csm2gr11,csx3,DinG,c2c10_CAS-V-U3	NA|147aa|up_3|NZ_CP019636.1_2046346_2046787_-,NA|78aa|up_1|NZ_CP019636.1_2048866_2049100_+,NA|109aa|up_0|NZ_CP019636.1_2049086_2049413_-,NA	NA|164aa|up_9|NZ_CP019636.1_2040096_2040588_-	pfam04972, BON, BON domain	NA|224aa|up_8|NZ_CP019636.1_2040902_2041574_-	pfam11181, YflT, Heat induced stress protein YflT	NA|514aa|up_7|NZ_CP019636.1_2041950_2043492_+	COG0642, BaeS, Signal transduction histidine kinase [Signal transduction mechanisms]	NA|113aa|up_6|NZ_CP019636.1_2043822_2044161_-	pfam00543, P-II, Nitrogen regulatory protein P-II	NA|217aa|up_5|NZ_CP019636.1_2044450_2045101_-	TIGR04211, hypothetical_protein, SH3 domain protein	NA|274aa|up_4|NZ_CP019636.1_2045378_2046200_-	PRK06427, PRK06427, bifunctional hydroxy-methylpyrimidine kinase/ hydroxy-phosphomethylpyrimidine kinase; Reviewed	NA|147aa|up_3|NZ_CP019636.1_2046346_2046787_-	NA	NA|619aa|up_2|NZ_CP019636.1_2046907_2048764_+	cd07484, Peptidases_S8_Thermitase_like, Peptidase S8 family domain in Thermitase-like proteins	NA|78aa|up_1|NZ_CP019636.1_2048866_2049100_+	NA	NA|109aa|up_0|NZ_CP019636.1_2049086_2049413_-	NA	NA|866aa|down_0|NZ_CP019636.1_2052095_2054693_+	pfam12770, CHAT, CHAT domain	NA|396aa|down_1|NZ_CP019636.1_2054801_2055989_+	cd14014, STKc_PknB_like, Catalytic domain of bacterial Serine/Threonine kinases, PknB and similar proteins	NA|504aa|down_2|NZ_CP019636.1_2056357_2057869_+	cd07091, ALDH_F1-2_Ald2-like, ALDH subfamily: ALDH families 1and 2, including 10-formyltetrahydrofolate dehydrogenase, NAD+-dependent retinal dehydrogenase 1 and related proteins	NA|476aa|down_3|NZ_CP019636.1_2057990_2059418_-	PRK09287, PRK09287, NADP-dependent phosphogluconate dehydrogenase	NA|244aa|down_4|NZ_CP019636.1_2059789_2060521_+	pfam07444, Ycf66_N, Ycf66 protein N-terminus	NA|93aa|down_5|NZ_CP019636.1_2060590_2060869_-	pfam05016, ParE_toxin, ParE toxin of type II toxin-antitoxin system, parDE	NA|89aa|down_6|NZ_CP019636.1_2060865_2061132_-	COG3905, COG3905, Predicted transcriptional regulator [Transcription]	NA|958aa|down_7|NZ_CP019636.1_2061465_2064339_+	PLN02843, PLN02843, isoleucyl-tRNA synthetase	NA|313aa|down_8|NZ_CP019636.1_2064397_2065336_-	COG1192, Soj, ATPases involved in chromosome partitioning [Cell division and chromosome partitioning]	NA|116aa|down_9|NZ_CP019636.1_2065719_2066067_-	COG2361, COG2361, Uncharacterized conserved protein [Function unknown]
GCF_002163975.1_ASM216397v1	NZ_CP019636	Nostocales cyanobacterium HT-58-2, complete genome	5	2573883-2574015	5	CRISPRCasFinder	no		Cas14c_CAS-V-F,cas14j,c2c9_V-U4,csa3,WYL,csx1,cas10,csm3gr7,csx10gr5,csx19,Cas14u_CAS-V,cas14k,cas6,2OG_CAS,csc1gr5,cas10d,csc2gr7,cas3,cas4,cas1,cas2,csx21,PD-DExK,csm2gr11,csx3,DinG,c2c10_CAS-V-U3	Orphan	AATGCAATAGTCTGCATTGCTATTGCATCAAAATGTGAAAATGTAAGAT	49	0	0	NA	NA	NA	1	1	Orphan	Cas14c_CAS-V-F,cas14j,c2c9_V-U4,csa3,WYL,csx1,cas10,csm3gr7,csx10gr5,csx19,Cas14u_CAS-V,cas14k,cas6,2OG_CAS,csc1gr5,cas10d,csc2gr7,cas3,cas4,cas1,cas2,csx21,PD-DExK,csm2gr11,csx3,DinG,c2c10_CAS-V-U3	NA|108aa|up_6|NZ_CP019636.1_2565719_2566043_+,NA|120aa|up_5|NZ_CP019636.1_2566094_2566454_-,NA|94aa|up_0|NZ_CP019636.1_2572101_2572383_-,NA|90aa|down_3|NZ_CP019636.1_2578555_2578825_+	NA|103aa|up_9|NZ_CP019636.1_2563368_2563677_-	PRK09301, PRK09301, circadian clock protein KaiB; Provisional	NA|181aa|up_8|NZ_CP019636.1_2564092_2564635_-	COG1695, COG1695, Predicted transcriptional regulators [Transcription]	NA|300aa|up_7|NZ_CP019636.1_2564749_2565649_+	COG1741, COG1741, Pirin-related protein [General function prediction only]	NA|108aa|up_6|NZ_CP019636.1_2565719_2566043_+	NA	NA|120aa|up_5|NZ_CP019636.1_2566094_2566454_-	NA	NA|330aa|up_4|NZ_CP019636.1_2567076_2568066_-	COG4638, HcaE, Phenylpropionate dioxygenase and related ring-hydroxylating dioxygenases, large terminal subunit [Inorganic ion transport and metabolism / General function prediction only]	NA|305aa|up_3|NZ_CP019636.1_2568338_2569253_+	COG1309, AcrR, Transcriptional regulator [Transcription]	NA|449aa|up_2|NZ_CP019636.1_2569560_2570907_+	COG2124, CypX, Cytochrome P450 [Secondary metabolites biosynthesis, transport, and catabolism]	NA|201aa|up_1|NZ_CP019636.1_2571478_2572081_+	TIGR00229, Sensor_protein_FixL, PAS domain S-box	NA|94aa|up_0|NZ_CP019636.1_2572101_2572383_-	NA	NA|197aa|down_0|NZ_CP019636.1_2574110_2574701_-	COG3431, COG3431, Predicted membrane protein [Function unknown]	NA|574aa|down_1|NZ_CP019636.1_2574745_2576467_-	COG0426, FpaA, Uncharacterized flavoproteins [Energy production and conversion]	NA|575aa|down_2|NZ_CP019636.1_2576492_2578217_-	COG0426, FpaA, Uncharacterized flavoproteins [Energy production and conversion]	NA|90aa|down_3|NZ_CP019636.1_2578555_2578825_+	NA	NA|307aa|down_4|NZ_CP019636.1_2579930_2580851_+	pfam11300, DUF3102, Protein of unknown function (DUF3102)	NA|400aa|down_5|NZ_CP019636.1_2581114_2582314_+	TIGR02971, devB-like_secretion_protein, ABC exporter membrane fusion protein, DevB family	NA|390aa|down_6|NZ_CP019636.1_2582335_2583505_+	TIGR01185, membrane_spanning_subunit, DevC protein	NA|213aa|down_7|NZ_CP019636.1_2584873_2585512_+	cd02215, cupin_QDO_N_C, quercetinase, N- and C-terminal cupin domains	NA|225aa|down_8|NZ_CP019636.1_2585737_2586412_+	cd04647, LbH_MAT_like, Maltose O-acyltransferase (MAT)-like: This family is composed of maltose O-acetyltransferase, galactoside O-acetyltransferase (GAT), xenobiotic acyltransferase (XAT) and similar proteins	NA|365aa|down_9|NZ_CP019636.1_2586792_2587887_+	COG1088, RfbB, dTDP-D-glucose 4,6-dehydratase [Cell envelope biogenesis, outer membrane]
GCF_002163975.1_ASM216397v1	NZ_CP019636	Nostocales cyanobacterium HT-58-2, complete genome	6	2745823-2745921	6	CRISPRCasFinder	no		Cas14c_CAS-V-F,cas14j,c2c9_V-U4,csa3,WYL,csx1,cas10,csm3gr7,csx10gr5,csx19,Cas14u_CAS-V,cas14k,cas6,2OG_CAS,csc1gr5,cas10d,csc2gr7,cas3,cas4,cas1,cas2,csx21,PD-DExK,csm2gr11,csx3,DinG,c2c10_CAS-V-U3	Orphan	AACACTATGTTCAATTGATGAAGG	24	0	0	NA	NA	NA	1	1	Orphan	Cas14c_CAS-V-F,cas14j,c2c9_V-U4,csa3,WYL,csx1,cas10,csm3gr7,csx10gr5,csx19,Cas14u_CAS-V,cas14k,cas6,2OG_CAS,csc1gr5,cas10d,csc2gr7,cas3,cas4,cas1,cas2,csx21,PD-DExK,csm2gr11,csx3,DinG,c2c10_CAS-V-U3	NA|135aa|up_8|NZ_CP019636.1_2723561_2723966_-,NA|88aa|up_2|NZ_CP019636.1_2733141_2733405_+,NA|48aa|down_1|NZ_CP019636.1_2750124_2750268_-,NA|48aa|down_2|NZ_CP019636.1_2750678_2750822_+,NA|48aa|down_5|NZ_CP019636.1_2755340_2755484_-,NA|192aa|down_9|NZ_CP019636.1_2757982_2758558_-	NA|393aa|up_9|NZ_CP019636.1_2722318_2723497_+	COG4552, Eis, Predicted acetyltransferase involved in intracellular survival and related acetyltransferases [General function prediction only]	NA|135aa|up_8|NZ_CP019636.1_2723561_2723966_-	NA	NA|183aa|up_7|NZ_CP019636.1_2725368_2725917_+	COG0071, IbpA, Molecular chaperone (small heat shock protein) [Posttranslational modification, protein turnover, chaperones]	NA|572aa|up_6|NZ_CP019636.1_2726133_2727849_+	COG4188, COG4188, Predicted dienelactone hydrolase [General function prediction only]	NA|242aa|up_5|NZ_CP019636.1_2729360_2730086_+	pfam13924, Lipocalin_5, Lipocalin-like domain	NA|382aa|up_4|NZ_CP019636.1_2730349_2731495_+	cd01158, SCAD_SBCAD, Short chain acyl-CoA dehydrogenases and eukaryotic short/branched chain acyl-CoA dehydrogenases	NA|539aa|up_3|NZ_CP019636.1_2731507_2733124_+	cd05930, A_NRPS, The adenylation domain of nonribosomal peptide synthetases (NRPS)	NA|88aa|up_2|NZ_CP019636.1_2733141_2733405_+	NA	NA|2146aa|up_1|NZ_CP019636.1_2733607_2740045_+	COG3321, COG3321, Polyketide synthase modules and related proteins [Secondary metabolites biosynthesis, transport, and catabolism]	NA|1872aa|up_0|NZ_CP019636.1_2740046_2745662_+	COG3321, COG3321, Polyketide synthase modules and related proteins [Secondary metabolites biosynthesis, transport, and catabolism]	NA|251aa|down_0|NZ_CP019636.1_2749261_2750014_+	COG3208, GrsT, Predicted thioesterase involved in non-ribosomal peptide biosynthesis [Secondary metabolites biosynthesis, transport, and catabolism]	NA|48aa|down_1|NZ_CP019636.1_2750124_2750268_-	NA	NA|48aa|down_2|NZ_CP019636.1_2750678_2750822_+	NA	NA|453aa|down_3|NZ_CP019636.1_2751156_2752515_+	cd13136, MATE_DinF_like, DinF and similar proteins, a subfamily of the multidrug and toxic compound extrusion (MATE)-like proteins	NA|132aa|down_4|NZ_CP019636.1_2752651_2753047_-	pfam01797, Y1_Tnp, Transposase IS200 like	NA|48aa|down_5|NZ_CP019636.1_2755340_2755484_-	NA	NA|100aa|down_6|NZ_CP019636.1_2755491_2755791_-	pfam13591, MerR_2, MerR HTH family regulatory protein	NA|332aa|down_7|NZ_CP019636.1_2755787_2756783_-	PRK14299, PRK14299, chaperone protein DnaJ; Provisional	NA|279aa|down_8|NZ_CP019636.1_2756858_2757695_-	COG3861, COG3861, Uncharacterized protein conserved in bacteria [Function unknown]	NA|192aa|down_9|NZ_CP019636.1_2757982_2758558_-	NA
GCF_002163975.1_ASM216397v1	NZ_CP019636	Nostocales cyanobacterium HT-58-2, complete genome	7	3051164-3051256	7	CRISPRCasFinder	no		Cas14c_CAS-V-F,cas14j,c2c9_V-U4,csa3,WYL,csx1,cas10,csm3gr7,csx10gr5,csx19,Cas14u_CAS-V,cas14k,cas6,2OG_CAS,csc1gr5,cas10d,csc2gr7,cas3,cas4,cas1,cas2,csx21,PD-DExK,csm2gr11,csx3,DinG,c2c10_CAS-V-U3	Orphan	AGATTGCTTTCTCAATTAGTCTGCTTGCTT	30	0	0	NA	NA	NA	1	1	Orphan	Cas14c_CAS-V-F,cas14j,c2c9_V-U4,csa3,WYL,csx1,cas10,csm3gr7,csx10gr5,csx19,Cas14u_CAS-V,cas14k,cas6,2OG_CAS,csc1gr5,cas10d,csc2gr7,cas3,cas4,cas1,cas2,csx21,PD-DExK,csm2gr11,csx3,DinG,c2c10_CAS-V-U3	NA|51aa|up_7|NZ_CP019636.1_3043792_3043945_+,NA|227aa|down_3|NZ_CP019636.1_3058481_3059162_+,NA|212aa|down_8|NZ_CP019636.1_3064390_3065026_+,NA|169aa|down_9|NZ_CP019636.1_3065661_3066168_+	NA|201aa|up_9|NZ_CP019636.1_3041258_3041861_+	COG1845, CyoC, Heme/copper-type cytochrome/quinol oxidase, subunit 3 [Energy production and conversion]	NA|525aa|up_8|NZ_CP019636.1_3041973_3043548_+	COG2303, BetA, Choline dehydrogenase and related flavoproteins [Amino acid transport and metabolism]	NA|51aa|up_7|NZ_CP019636.1_3043792_3043945_+	NA	NA|223aa|up_6|NZ_CP019636.1_3043933_3044602_-	COG2197, CitB, Response regulator containing a CheY-like receiver domain and an HTH DNA-binding domain [Signal transduction mechanisms / Transcription]	NA|449aa|up_5|NZ_CP019636.1_3044549_3045896_-	COG4585, COG4585, Signal transduction histidine kinase [Signal transduction mechanisms]	NA|481aa|up_4|NZ_CP019636.1_3046525_3047968_+	TIGR01730, COG0845:_Membrane-fusion_protein, RND family efflux transporter, MFP subunit	NA|420aa|up_3|NZ_CP019636.1_3047982_3049242_+	PRK10535, PRK10535, macrolide ABC transporter ATP-binding protein/permease MacB	NA|226aa|up_2|NZ_CP019636.1_3049253_3049931_+	cd03255, ABC_MJ0796_LolCDE_FtsE, ATP-binding cassette domain of the transporters involved in export of lipoprotein and macrolide, and cell division protein	NA|178aa|up_1|NZ_CP019636.1_3049969_3050503_-	pfam13673, Acetyltransf_10, Acetyltransferase (GNAT) domain	NA|97aa|up_0|NZ_CP019636.1_3050493_3050784_-	COG4453, COG4453, Uncharacterized protein conserved in bacteria [Function unknown]	NA|147aa|down_0|NZ_CP019636.1_3051403_3051844_+	cd02929, TMADH_HD_FMN, Trimethylamine dehydrogenase (TMADH) and histamine dehydrogenase (HD) FMN-binding domain	NA|184aa|down_1|NZ_CP019636.1_3053902_3054454_+	pfam05685, Uma2, Putative restriction endonuclease	NA|1009aa|down_2|NZ_CP019636.1_3054509_3057536_-	COG0419, SbcC, ATPase involved in DNA repair [DNA replication, recombination, and repair]	NA|227aa|down_3|NZ_CP019636.1_3058481_3059162_+	NA	NA|417aa|down_4|NZ_CP019636.1_3059799_3061050_+	pfam00395, SLH, S-layer homology domain	NA|284aa|down_5|NZ_CP019636.1_3061652_3062504_+	COG1131, CcmA, ABC-type multidrug transport system, ATPase component [Defense mechanisms]	NA|228aa|down_6|NZ_CP019636.1_3062506_3063190_-	sd00006, TPR, Tetratricopeptide repeat	NA|109aa|down_7|NZ_CP019636.1_3063448_3063775_+	pfam01906, YbjQ_1, Putative heavy-metal-binding	NA|212aa|down_8|NZ_CP019636.1_3064390_3065026_+	NA	NA|169aa|down_9|NZ_CP019636.1_3065661_3066168_+	NA
GCF_002163975.1_ASM216397v1	NZ_CP019636	Nostocales cyanobacterium HT-58-2, complete genome	8	3283752-3283832	8	CRISPRCasFinder	no		Cas14c_CAS-V-F,cas14j,c2c9_V-U4,csa3,WYL,csx1,cas10,csm3gr7,csx10gr5,csx19,Cas14u_CAS-V,cas14k,cas6,2OG_CAS,csc1gr5,cas10d,csc2gr7,cas3,cas4,cas1,cas2,csx21,PD-DExK,csm2gr11,csx3,DinG,c2c10_CAS-V-U3	Orphan	ATGCGTTTGAGGTTAGCTTTTGCTTCT	27	0	0	NA	NA	NA	1	1	Orphan	Cas14c_CAS-V-F,cas14j,c2c9_V-U4,csa3,WYL,csx1,cas10,csm3gr7,csx10gr5,csx19,Cas14u_CAS-V,cas14k,cas6,2OG_CAS,csc1gr5,cas10d,csc2gr7,cas3,cas4,cas1,cas2,csx21,PD-DExK,csm2gr11,csx3,DinG,c2c10_CAS-V-U3	NA,NA|92aa|down_3|NZ_CP019636.1_3287446_3287722_-,NA|84aa|down_7|NZ_CP019636.1_3292231_3292483_+	NA|156aa|up_9|NZ_CP019636.1_3272774_3273242_+	pfam00583, Acetyltransf_1, Acetyltransferase (GNAT) family	NA|898aa|up_8|NZ_CP019636.1_3273248_3275942_-	COG3300, COG3300, MHYT domain (predicted integral membrane sensor domain) [Signal transduction mechanisms]	NA|220aa|up_7|NZ_CP019636.1_3276346_3277006_-	cd07185, OmpA_C-like, Peptidoglycan binding domains similar to the C-terminal domain of outer-membrane protein OmpA	NA|261aa|up_6|NZ_CP019636.1_3277139_3277922_-	pfam00520, Ion_trans, Ion transport protein	NA|142aa|up_5|NZ_CP019636.1_3278069_3278495_+	COG0590, CumB, Cytosine/adenosine deaminases [Nucleotide transport and metabolism / Translation, ribosomal structure and biogenesis]	NA|221aa|up_4|NZ_CP019636.1_3278505_3279168_-	COG2197, CitB, Response regulator containing a CheY-like receiver domain and an HTH DNA-binding domain [Signal transduction mechanisms / Transcription]	NA|410aa|up_3|NZ_CP019636.1_3279169_3280399_-	COG4585, COG4585, Signal transduction histidine kinase [Signal transduction mechanisms]	NA|154aa|up_2|NZ_CP019636.1_3280748_3281210_+	COG3678, CpxP, P pilus assembly/Cpx signaling pathway, periplasmic inhibitor/zinc-resistance associated protein [Intracellular trafficking and secretion / Cell motility and secretio / Signal transduction mechanisms / Inorganic ion transport and metabolism]	NA|232aa|up_1|NZ_CP019636.1_3281314_3282010_-	TIGR02982, heterocyst_DevA, ABC exporter ATP-binding subunit, DevA family	NA|394aa|up_0|NZ_CP019636.1_3282038_3283220_-	TIGR01185, membrane_spanning_subunit, DevC protein	NA|207aa|down_0|NZ_CP019636.1_3285026_3285647_+	pfam00440, TetR_N, Bacterial regulatory proteins, tetR family	NA|193aa|down_1|NZ_CP019636.1_3285807_3286386_-	pfam05685, Uma2, Putative restriction endonuclease	NA|138aa|down_2|NZ_CP019636.1_3287043_3287457_-	COG1569, COG1569, Predicted nucleic acid-binding protein, contains PIN domain [General function prediction only]	NA|92aa|down_3|NZ_CP019636.1_3287446_3287722_-	NA	NA|517aa|down_4|NZ_CP019636.1_3288223_3289774_-	COG1357, COG1357, Pentapeptide repeats containing protein [Function unknown]	NA|354aa|down_5|NZ_CP019636.1_3289924_3290986_-	COG0429, COG0429, Predicted hydrolase of the alpha/beta-hydrolase fold [General function prediction only]	NA|248aa|down_6|NZ_CP019636.1_3291143_3291887_+	COG0745, OmpR, Response regulators consisting of a CheY-like receiver domain and a winged-helix DNA-binding domain [Signal transduction mechanisms / Transcription]	NA|84aa|down_7|NZ_CP019636.1_3292231_3292483_+	NA	NA|351aa|down_8|NZ_CP019636.1_3292545_3293598_-	PRK00892, lpxD, UDP-3-O-[3-hydroxymyristoyl] glucosamine N-acyltransferase; Provisional	NA|451aa|down_9|NZ_CP019636.1_3293797_3295150_-	COG2821, MltA, Membrane-bound lytic murein transglycosylase [Cell envelope biogenesis, outer membrane]
GCF_002163975.1_ASM216397v1	NZ_CP019636	Nostocales cyanobacterium HT-58-2, complete genome	9	4115463-4118049	9,1,1	CRISPRCasFinder,CRT,PILER-CR	no	cas14k,cas6,2OG_CAS,csc1gr5	Cas14c_CAS-V-F,cas14j,c2c9_V-U4,csa3,WYL,csx1,cas10,csm3gr7,csx10gr5,csx19,Cas14u_CAS-V,cas14k,cas6,2OG_CAS,csc1gr5,cas10d,csc2gr7,cas3,cas4,cas1,cas2,csx21,PD-DExK,csm2gr11,csx3,DinG,c2c10_CAS-V-U3	Type I-D	GTTTCAATCCCTGATAGGGATTCGTAGTAATTGTAAC,GTTTCAATCCCTGATAGGGATTCGTAGTAATTGTAAC,GTTTCAATCCCTGATAGGGATTCGTAGTAATTGTAAC	37,37,37	0	0	NA	NA	I-D,II-B:I-D,II-B:I-D,II-B	35,35,34	35	TypeV,TypeI-D	Cas14c_CAS-V-F,cas14j,c2c9_V-U4,csa3,WYL,csx1,cas10,csm3gr7,csx10gr5,csx19,Cas14u_CAS-V,cas14k,cas6,2OG_CAS,csc1gr5,cas10d,csc2gr7,cas3,cas4,cas1,cas2,csx21,PD-DExK,csm2gr11,csx3,DinG,c2c10_CAS-V-U3	NA|230aa|up_8|NZ_CP019636.1_4105011_4105701_-,NA|196aa|up_6|NZ_CP019636.1_4107096_4107684_-,NA|47aa|up_3|NZ_CP019636.1_4110546_4110687_-,NA|57aa|down_7|NZ_CP019636.1_4128176_4128347_-	cas14k|463aa|up_9|NZ_CP019636.1_4103500_4104889_+	TIGR01766, Putative_transposase_MJ0751, transposase, IS605 OrfB family, central region	NA|230aa|up_8|NZ_CP019636.1_4105011_4105701_-	NA	NA|333aa|up_7|NZ_CP019636.1_4105954_4106953_+	COG1087, GalE, UDP-glucose 4-epimerase [Cell envelope biogenesis, outer membrane]	NA|196aa|up_6|NZ_CP019636.1_4107096_4107684_-	NA	NA|408aa|up_5|NZ_CP019636.1_4107688_4108912_-	COG2836, COG2836, Uncharacterized conserved protein [Function unknown]	NA|384aa|up_4|NZ_CP019636.1_4109278_4110430_+	cd05640, M28_like, M28 Zn-peptidase; uncharacterized subfamily	NA|47aa|up_3|NZ_CP019636.1_4110546_4110687_-	NA	NA|429aa|up_2|NZ_CP019636.1_4110745_4112032_-	cd07302, CHD, cyclase homology domain	NA|181aa|up_1|NZ_CP019636.1_4112049_4112592_-	TIGR01926, peroxid_rel, uncharacterized peroxidase-related enzyme	NA|437aa|up_0|NZ_CP019636.1_4112891_4114202_+	PRK10879, PRK10879, proline aminopeptidase P II; Provisional	cas6|282aa|down_0|NZ_CP019636.1_4118372_4119218_-	COG5551, COG5551, CRISPR system related protein, RAMP superfamily [Defense    mechanisms]	2OG_CAS|211aa|down_1|NZ_CP019636.1_4119195_4119828_-	pfam13640, 2OG-FeII_Oxy_3, 2OG-Fe(II) oxygenase superfamily	csc1gr5|194aa|down_2|NZ_CP019636.1_4119988_4120570_-	cd09711, Csc1_I-D, CRISPR/Cas system-associated protein Csc1	NA|145aa|down_3|NZ_CP019636.1_4121434_4121869_-	pfam01934, DUF86, Protein of unknown function DUF86	NA|150aa|down_4|NZ_CP019636.1_4121870_4122320_-	pfam18765, Polbeta, Polymerase beta, Nucleotidyltransferase	NA|71aa|down_5|NZ_CP019636.1_4122370_4122583_-	COG2442, COG2442, Uncharacterized conserved protein [Function unknown]	NA|1336aa|down_6|NZ_CP019636.1_4124013_4128021_-	PRK12467, PRK12467, peptide synthase; Provisional	NA|57aa|down_7|NZ_CP019636.1_4128176_4128347_-	NA	NA|466aa|down_8|NZ_CP019636.1_4128315_4129713_-	PRK06849, PRK06849, hypothetical protein; Provisional	NA|279aa|down_9|NZ_CP019636.1_4129923_4130760_-	pfam01596, Methyltransf_3, O-methyltransferase
GCF_002163975.1_ASM216397v1	NZ_CP019636	Nostocales cyanobacterium HT-58-2, complete genome	10	4256269-4261825	10,2,2	CRISPRCasFinder,CRT,PILER-CR	no	WYL,cas10d,csc2gr7,csc1gr5,cas3,cas6,cas4,cas1,cas2,c2c9_V-U4	Cas14c_CAS-V-F,cas14j,c2c9_V-U4,csa3,WYL,csx1,cas10,csm3gr7,csx10gr5,csx19,Cas14u_CAS-V,cas14k,cas6,2OG_CAS,csc1gr5,cas10d,csc2gr7,cas3,cas4,cas1,cas2,csx21,PD-DExK,csm2gr11,csx3,DinG,c2c10_CAS-V-U3	Type I-D	GTTGAAATTTCTCTTACTCCCTATTAGGGATTGAAAC,GTTGAAATTTCTCTTACTCCCTATTAGGGATTGAAAC,GTTTCAATCCCTAATAGGGAGTAAGAGAAATTTCAAC	37,37,37	0	0	NA	NA	I-D,II-B:I-D,II-B:I-D,II-B	76,76,73	76	TypeI-D	Cas14c_CAS-V-F,cas14j,c2c9_V-U4,csa3,WYL,csx1,cas10,csm3gr7,csx10gr5,csx19,Cas14u_CAS-V,cas14k,cas6,2OG_CAS,csc1gr5,cas10d,csc2gr7,cas3,cas4,cas1,cas2,csx21,PD-DExK,csm2gr11,csx3,DinG,c2c10_CAS-V-U3	NA,NA	NA|373aa|up_9|NZ_CP019636.1_4243832_4244951_+	pfam11017, DUF2855, Protein of unknown function (DUF2855)	WYL|288aa|up_8|NZ_CP019636.1_4245310_4246174_-	COG2378, COG2378, Predicted transcriptional regulator [Transcription]	cas10d|897aa|up_7|NZ_CP019636.1_4246455_4249146_+	cd09712, Cas10d_I-D, CRISPR/Cas system-associated protein Cas10d	csc2gr7|339aa|up_6|NZ_CP019636.1_4249159_4250176_+	pfam18320, Csc2, Csc2 Crispr	csc1gr5|254aa|up_5|NZ_CP019636.1_4250179_4250941_+	TIGR03159, cas_Csc1, CRISPR type I-D/CYANO-associated protein Csc1	cas3|762aa|up_4|NZ_CP019636.1_4250933_4253219_+	cd09710, Cas3_I-D, CRISPR/Cas system-associated protein Cas3; Distinct diverged subfamily of Cas3 helicase domain	cas6|290aa|up_3|NZ_CP019636.1_4253232_4254102_+	COG5551, COG5551, CRISPR system related protein, RAMP superfamily [Defense    mechanisms]	cas4|198aa|up_2|NZ_CP019636.1_4254094_4254688_+	TIGR00372, conserved_hypothetical_protein, CRISPR-associated protein Cas4	cas1|335aa|up_1|NZ_CP019636.1_4254741_4255746_+	TIGR04093, hypothetical_protein_L8106_25395, CRISPR-associated endonuclease Cas1, subtype CYANO	cas2|96aa|up_0|NZ_CP019636.1_4255768_4256056_+	cd09725, Cas2_I_II_III, CRISPR/Cas system-associated protein Cas2	NA|402aa|down_0|NZ_CP019636.1_4262490_4263696_-	COG4585, COG4585, Signal transduction histidine kinase [Signal transduction mechanisms]	NA|151aa|down_1|NZ_CP019636.1_4263978_4264431_+	cd09916, CpxP_like, CpxP component of the bacterial Cpx-two-component system and related proteins	c2c9_V-U4|438aa|down_2|NZ_CP019636.1_4264606_4265920_+	COG0675, COG0675, Transposase and inactivated derivatives [DNA replication, recombination, and repair]	NA|570aa|down_3|NZ_CP019636.1_4265973_4267683_-	pfam04966, OprB, Carbohydrate-selective porin, OprB family	NA|718aa|down_4|NZ_CP019636.1_4267908_4270062_-	pfam03253, UT, Urea transporter	NA|461aa|down_5|NZ_CP019636.1_4270091_4271474_-	cd06841, PLPDE_III_MccE_like, Type III Pyridoxal 5-phosphate (PLP)-Dependent Enzyme MccE	NA|347aa|down_6|NZ_CP019636.1_4271477_4272518_-	PRK12767, PRK12767, carbamoyl phosphate synthase-like protein; Provisional	NA|165aa|down_7|NZ_CP019636.1_4272517_4273012_-	pfam05402, PqqD, Coenzyme PQQ synthesis protein D (PqqD)	NA|994aa|down_8|NZ_CP019636.1_4274011_4276993_-	TIGR03797, ABC_transporter_related, NHLM bacteriocin system ABC transporter, ATP-binding protein	NA|733aa|down_9|NZ_CP019636.1_4277042_4279241_-	TIGR03796, ABC_transporter_related, NHLM bacteriocin system ABC transporter, peptidase/ATP-binding protein
GCF_002163975.1_ASM216397v1	NZ_CP019636	Nostocales cyanobacterium HT-58-2, complete genome	11	4554512-4557231	11,3,3	CRISPRCasFinder,CRT,PILER-CR	no		Cas14c_CAS-V-F,cas14j,c2c9_V-U4,csa3,WYL,csx1,cas10,csm3gr7,csx10gr5,csx19,Cas14u_CAS-V,cas14k,cas6,2OG_CAS,csc1gr5,cas10d,csc2gr7,cas3,cas4,cas1,cas2,csx21,PD-DExK,csm2gr11,csx3,DinG,c2c10_CAS-V-U3	Orphan	GTTTCAATCCCTAATAGGGATTCGTATGAATTGTAAC,GTTTCAATCCCTAATAGGGATTCGTATGAATTGTAAC,GTTTCAATCCCTAATAGGGATTCGTATGAATTGTAAC	37,37,37	0	0	NA	NA	I-D,II-B:I-D,II-B:I-D,II-B	37,37,36	37	Orphan	Cas14c_CAS-V-F,cas14j,c2c9_V-U4,csa3,WYL,csx1,cas10,csm3gr7,csx10gr5,csx19,Cas14u_CAS-V,cas14k,cas6,2OG_CAS,csc1gr5,cas10d,csc2gr7,cas3,cas4,cas1,cas2,csx21,PD-DExK,csm2gr11,csx3,DinG,c2c10_CAS-V-U3	NA|133aa|up_8|NZ_CP019636.1_4547379_4547778_-,NA|127aa|up_5|NZ_CP019636.1_4550126_4550507_+,NA|136aa|up_4|NZ_CP019636.1_4550538_4550946_+,NA|164aa|up_3|NZ_CP019636.1_4550958_4551450_+,NA	NA|198aa|up_9|NZ_CP019636.1_4546507_4547101_-	COG2095, MarC, Multiple antibiotic transporter [Intracellular trafficking and secretion]	NA|133aa|up_8|NZ_CP019636.1_4547379_4547778_-	NA	NA|226aa|up_7|NZ_CP019636.1_4547851_4548529_+	COG0745, OmpR, Response regulators consisting of a CheY-like receiver domain and a winged-helix DNA-binding domain [Signal transduction mechanisms / Transcription]	NA|427aa|up_6|NZ_CP019636.1_4548601_4549882_+	TIGR02966, Phosphate_regulon_sensor_protein_PhoR, phosphate regulon sensor kinase PhoR	NA|127aa|up_5|NZ_CP019636.1_4550126_4550507_+	NA	NA|136aa|up_4|NZ_CP019636.1_4550538_4550946_+	NA	NA|164aa|up_3|NZ_CP019636.1_4550958_4551450_+	NA	NA|454aa|up_2|NZ_CP019636.1_4551518_4552880_+	cd03877, M28_like, M28 Zn-peptidase, many containing a protease-associated (PA) domain insert	NA|264aa|up_1|NZ_CP019636.1_4552955_4553747_+	TIGR02037, Probable_periplasmic_serine_protease_do/HhoA-like, periplasmic serine protease, Do/DeqQ family	NA|210aa|up_0|NZ_CP019636.1_4553832_4554462_+	pfam05685, Uma2, Putative restriction endonuclease	NA|273aa|down_0|NZ_CP019636.1_4557956_4558775_+	cd07572, nit, Nit1, Nit 2, and related proteins, and the Nit1-like domain of NitFhit (class 10 nitrilases)	NA|648aa|down_1|NZ_CP019636.1_4559270_4561214_+	cd10918, CE4_NodB_like_5s_6s, Putative catalytic NodB homology domain of PgaB, IcaB, and similar proteins which consist of a deformed (beta/alpha)8 barrel fold with 5- or 6-strands	NA|1067aa|down_2|NZ_CP019636.1_4561241_4564442_-	PRK11091, PRK11091, aerobic respiration control sensor protein ArcB; Provisional	NA|360aa|down_3|NZ_CP019636.1_4564869_4565949_-	PRK09196, PRK09196, fructose-bisphosphate aldolase class II	NA|279aa|down_4|NZ_CP019636.1_4566363_4567200_-	cd09025, Aldose_epim_Slr1438, Aldose 1-epimerase, similar to Synechocystis Slr1438	NA|346aa|down_5|NZ_CP019636.1_4567353_4568391_-	CHL00149, odpA, pyruvate dehydrogenase E1 component alpha subunit; Reviewed	NA|783aa|down_6|NZ_CP019636.1_4569122_4571471_+	pfam13355, DUF4101, Protein of unknown function (DUF4101)	NA|330aa|down_7|NZ_CP019636.1_4571697_4572687_+	COG0679, COG0679, Predicted permeases [General function prediction only]	NA|414aa|down_8|NZ_CP019636.1_4573170_4574412_+	COG3330, COG3330, Uncharacterized protein conserved in bacteria [Function unknown]	NA|314aa|down_9|NZ_CP019636.1_4574733_4575675_+	pfam09992, NAGPA, Phosphodiester glycosidase
GCF_002163975.1_ASM216397v1	NZ_CP019636	Nostocales cyanobacterium HT-58-2, complete genome	12	4772008-4772151	12	CRISPRCasFinder	no		Cas14c_CAS-V-F,cas14j,c2c9_V-U4,csa3,WYL,csx1,cas10,csm3gr7,csx10gr5,csx19,Cas14u_CAS-V,cas14k,cas6,2OG_CAS,csc1gr5,cas10d,csc2gr7,cas3,cas4,cas1,cas2,csx21,PD-DExK,csm2gr11,csx3,DinG,c2c10_CAS-V-U3	Orphan	CAACTCGTCAACGAGTATCGAAGACTCAAGCGCGAGTAT	39	0	0	NA	NA	NA	1	1	Orphan	Cas14c_CAS-V-F,cas14j,c2c9_V-U4,csa3,WYL,csx1,cas10,csm3gr7,csx10gr5,csx19,Cas14u_CAS-V,cas14k,cas6,2OG_CAS,csc1gr5,cas10d,csc2gr7,cas3,cas4,cas1,cas2,csx21,PD-DExK,csm2gr11,csx3,DinG,c2c10_CAS-V-U3	NA|130aa|up_8|NZ_CP019636.1_4760295_4760685_-,NA|191aa|up_3|NZ_CP019636.1_4766767_4767340_+,NA|213aa|up_0|NZ_CP019636.1_4771055_4771694_-,NA|466aa|down_1|NZ_CP019636.1_4775581_4776979_-	NA|410aa|up_9|NZ_CP019636.1_4758755_4759985_+	NF033203, entero_EhxA, enterohemolysin EhxA	NA|130aa|up_8|NZ_CP019636.1_4760295_4760685_-	NA	NA|233aa|up_7|NZ_CP019636.1_4761331_4762030_+	pfam01551, Peptidase_M23, Peptidase family M23	NA|626aa|up_6|NZ_CP019636.1_4762102_4763980_-	CHL00176, ftsH, cell division protein; Validated	NA|182aa|up_5|NZ_CP019636.1_4764450_4764996_+	COG0431, COG0431, Predicted flavoprotein [General function prediction only]	NA|375aa|up_4|NZ_CP019636.1_4765059_4766184_-	COG0276, HemH, Protoheme ferro-lyase (ferrochelatase) [Coenzyme metabolism]	NA|191aa|up_3|NZ_CP019636.1_4766767_4767340_+	NA	NA|423aa|up_2|NZ_CP019636.1_4767685_4768954_+	cd06173, MFS_MefA_like, Macrolide efflux protein A and similar proteins of the Major Facilitator Superfamily of transporters	NA|626aa|up_1|NZ_CP019636.1_4769097_4770975_+	PRK07418, PRK07418, acetolactate synthase large subunit	NA|213aa|up_0|NZ_CP019636.1_4771055_4771694_-	NA	NA|216aa|down_0|NZ_CP019636.1_4772191_4772839_-	cd16383, GUN4, porphyrin-binding protein domain GUN4	NA|466aa|down_1|NZ_CP019636.1_4775581_4776979_-	NA	NA|799aa|down_2|NZ_CP019636.1_4777569_4779966_-	PTZ00184, PTZ00184, calmodulin; Provisional	NA|529aa|down_3|NZ_CP019636.1_4780048_4781635_-	pfam00924, MS_channel, Mechanosensitive ion channel	NA|293aa|down_4|NZ_CP019636.1_4781922_4782801_-	pfam13612, DDE_Tnp_1_3, Transposase DDE domain	NA|229aa|down_5|NZ_CP019636.1_4783062_4783749_-	PRK11907, PRK11907, bifunctional 2',3'-cyclic-nucleotide 2'-phosphodiesterase/3'-nucleotidase	NA|203aa|down_6|NZ_CP019636.1_4784095_4784704_-	COG1525, COG1525, Micrococcal nuclease (thermonuclease) homologs [DNA replication, recombination, and repair]	NA|157aa|down_7|NZ_CP019636.1_4785131_4785602_+	PRK09831, PRK09831, GNAT family N-acetyltransferase	NA|370aa|down_8|NZ_CP019636.1_4786249_4787359_-	COG0673, MviM, Predicted dehydrogenases and related proteins [General function prediction only]	NA|156aa|down_9|NZ_CP019636.1_4787483_4787951_-	COG3296, COG3296, Uncharacterized protein conserved in bacteria [Function unknown]
GCF_002163975.1_ASM216397v1	NZ_CP019636	Nostocales cyanobacterium HT-58-2, complete genome	13	4907538-4909027	4,13,4	PILER-CR,CRISPRCasFinder,CRT	no	cas6,csx1,csx21,csm3gr7,csx19,PD-DExK,csm2gr11,csx10gr5,cas10	Cas14c_CAS-V-F,cas14j,c2c9_V-U4,csa3,WYL,csx1,cas10,csm3gr7,csx10gr5,csx19,Cas14u_CAS-V,cas14k,cas6,2OG_CAS,csc1gr5,cas10d,csc2gr7,cas3,cas4,cas1,cas2,csx21,PD-DExK,csm2gr11,csx3,DinG,c2c10_CAS-V-U3	Type III-B,Type III-A,Type III-D,Type III-C	GTGACAAACTACCTCTTCCCCGCAAGGGGATTGAAAC,GTGACAAACTACCTCTTCCCCGCAAGGGGATTGAAAC,GTGACAAACTACCTCTTCCCCGCAAGGGGATTGAAAC	37,37,37	0	0	NA	NA	NA:NA:NA	17,20,19	20	TypeIII-B,TypeIII-A,TypeIII-D,TypeIII-C	Cas14c_CAS-V-F,cas14j,c2c9_V-U4,csa3,WYL,csx1,cas10,csm3gr7,csx10gr5,csx19,Cas14u_CAS-V,cas14k,cas6,2OG_CAS,csc1gr5,cas10d,csc2gr7,cas3,cas4,cas1,cas2,csx21,PD-DExK,csm2gr11,csx3,DinG,c2c10_CAS-V-U3	NA|54aa|up_4|NZ_CP019636.1_4905232_4905394_+,NA|60aa|up_1|NZ_CP019636.1_4906203_4906383_-,NA|88aa|up_0|NZ_CP019636.1_4906410_4906674_-,csx21|215aa|down_2|NZ_CP019636.1_4910899_4911544_-,csx19|187aa|down_4|NZ_CP019636.1_4912583_4913144_-,PD-DExK|207aa|down_6|NZ_CP019636.1_4914184_4914805_-	NA|512aa|up_9|NZ_CP019636.1_4899066_4900602_-	TIGR01286, Nitrogenase_molybdenum-iron_protein_beta_chain, nitrogenase molybdenum-iron protein beta chain	NA|32aa|up_8|NZ_CP019636.1_4900786_4900882_-	TIGR01282, Nitrogenase_molybdenum-iron_protein_alpha_chain, nitrogenase molybdenum-iron protein alpha chain	NA|434aa|up_7|NZ_CP019636.1_4901172_4902474_+	cd00796, INT_Rci_Hp1_C, Shufflon-specific DNA recombinase Rci and Bacteriophage Hp1_like integrase, C-terminal catalytic domain	NA|367aa|up_6|NZ_CP019636.1_4902965_4904066_+	COG3491, PcbC, Isopenicillin N synthase and related dioxygenases [General function prediction only]	NA|180aa|up_5|NZ_CP019636.1_4904570_4905110_-	COG5502, COG5502, Uncharacterized conserved protein [Function unknown]	NA|54aa|up_4|NZ_CP019636.1_4905232_4905394_+	NA	NA|161aa|up_3|NZ_CP019636.1_4905447_4905930_-	COG2405, COG2405, Predicted nucleic acid-binding protein, contains PIN domain [General function prediction only]	NA|85aa|up_2|NZ_CP019636.1_4905913_4906168_-	pfam03683, UPF0175, Uncharacterized protein family (UPF0175)	NA|60aa|up_1|NZ_CP019636.1_4906203_4906383_-	NA	NA|88aa|up_0|NZ_CP019636.1_4906410_4906674_-	NA	cas6|372aa|down_0|NZ_CP019636.1_4909070_4910186_+	COG5551, COG5551, CRISPR system related protein, RAMP superfamily [Defense    mechanisms]	csx1|169aa|down_1|NZ_CP019636.1_4910214_4910721_-	cd09747, Csx1_III-U, CRISPR/Cas system-associated protein Csx1	csx21|215aa|down_2|NZ_CP019636.1_4910899_4911544_-	NA	csm3gr7|347aa|down_3|NZ_CP019636.1_4911546_4912587_-	TIGR03986, CRISPR-associated_protein, CRISPR-associated protein	csx19|187aa|down_4|NZ_CP019636.1_4912583_4913144_-	NA	csm3gr7|338aa|down_5|NZ_CP019636.1_4913140_4914154_-	COG1337, COG1337, CRISPR system related protein, RAMP superfamily [Defense    mechanisms]	PD-DExK|207aa|down_6|NZ_CP019636.1_4914184_4914805_-	NA	csm3gr7|269aa|down_7|NZ_CP019636.1_4914806_4915613_-	cd09683, Csm3_III-A, CRISPR/Cas system-associated RAMP superfamily protein Csm3	NA|460aa|down_8|NZ_CP019636.1_4915587_4916967_-	pfam01548, DEDD_Tnp_IS110, Transposase	NA|357aa|down_9|NZ_CP019636.1_4917605_4918676_+	pfam01609, DDE_Tnp_1, Transposase DDE domain
GCF_002163975.1_ASM216397v1	NZ_CP019636	Nostocales cyanobacterium HT-58-2, complete genome	14	4930025-4930515	5,5,14	PILER-CR,CRT,CRISPRCasFinder	no	csx1,csx21,csm3gr7,csx19,PD-DExK,csm2gr11,csx10gr5,cas10,csx3,WYL	Cas14c_CAS-V-F,cas14j,c2c9_V-U4,csa3,WYL,csx1,cas10,csm3gr7,csx10gr5,csx19,Cas14u_CAS-V,cas14k,cas6,2OG_CAS,csc1gr5,cas10d,csc2gr7,cas3,cas4,cas1,cas2,csx21,PD-DExK,csm2gr11,csx3,DinG,c2c10_CAS-V-U3	 Type III-D?,Type III-D,Type III-A,Type III-C,Type III-B	GTCGTAGACTACCTTTTCCCCGCAAGGGGACGGAAAC,ACTACCTTTTCCCCGCAAGGGGACGGAAAC,TTTCCCCGCAAGGGGACGGAAAC	37,30,23	0	0	NA	NA	NA:NA:NA	6,6,6	6	TypeIII-D?,TypeIII-D,TypeIII-A,TypeIII-C,TypeIII-B	Cas14c_CAS-V-F,cas14j,c2c9_V-U4,csa3,WYL,csx1,cas10,csm3gr7,csx10gr5,csx19,Cas14u_CAS-V,cas14k,cas6,2OG_CAS,csc1gr5,cas10d,csc2gr7,cas3,cas4,cas1,cas2,csx21,PD-DExK,csm2gr11,csx3,DinG,c2c10_CAS-V-U3	csm2gr11|137aa|up_7|NZ_CP019636.1_4921707_4922118_-,NA|115aa|up_3|NZ_CP019636.1_4926814_4927159_+,NA|141aa|up_2|NZ_CP019636.1_4927151_4927574_+,NA|139aa|down_1|NZ_CP019636.1_4933653_4934070_-,NA|747aa|down_2|NZ_CP019636.1_4934078_4936319_-,NA|81aa|down_3|NZ_CP019636.1_4936585_4936828_+,NA|61aa|down_4|NZ_CP019636.1_4936947_4937130_+	NA|460aa|up_9|NZ_CP019636.1_4918963_4920343_+	pfam01548, DEDD_Tnp_IS110, Transposase	csx1|388aa|up_8|NZ_CP019636.1_4920480_4921644_-	pfam09002, DUF1887, Domain of unknown function (DUF1887)	csm2gr11|137aa|up_7|NZ_CP019636.1_4921707_4922118_-	NA	csx10gr5|419aa|up_6|NZ_CP019636.1_4922114_4923371_-	TIGR02674, cas_cyan_RAMP_2, CRISPR-associated RAMP protein, Csx10 family	csm3gr7|236aa|up_5|NZ_CP019636.1_4923367_4924075_-	COG1337, COG1337, CRISPR system related protein, RAMP superfamily [Defense    mechanisms]	cas10|781aa|up_4|NZ_CP019636.1_4924071_4926414_-	TIGR02577, thermophile-specific_DNA_repair_system, CRISPR-associated protein Cas10/Cmr2, subtype III-B	NA|115aa|up_3|NZ_CP019636.1_4926814_4927159_+	NA	NA|141aa|up_2|NZ_CP019636.1_4927151_4927574_+	NA	NA|285aa|up_1|NZ_CP019636.1_4927596_4928451_-	cd09702, Csx1_III-U, CRISPR/Cas system-associated protein Csx1	csx3|309aa|up_0|NZ_CP019636.1_4928518_4929445_-	pfam09620, Cas_csx3, CRISPR-associated protein (Cas_csx3)	WYL|456aa|down_0|NZ_CP019636.1_4930661_4932029_+	TIGR03985, hypothetical_protein_sll7078, CRISPR-associated protein, TIGR03985 family	NA|139aa|down_1|NZ_CP019636.1_4933653_4934070_-	NA	NA|747aa|down_2|NZ_CP019636.1_4934078_4936319_-	NA	NA|81aa|down_3|NZ_CP019636.1_4936585_4936828_+	NA	NA|61aa|down_4|NZ_CP019636.1_4936947_4937130_+	NA	NA|103aa|down_5|NZ_CP019636.1_4937615_4937924_-	cd10158, CsoR-like_DUF156_1, Uncharacterized family 1; belongs to a superfamily containing the transcriptional regulators CsoR (copper-sensitive operon repressor), RcnR, and FrmR, and related domains; this family was previously known as part of DUF156	NA|356aa|down_6|NZ_CP019636.1_4938263_4939331_+	pfam01297, ZnuA, Zinc-uptake complex component A periplasmic	NA|324aa|down_7|NZ_CP019636.1_4940102_4941074_-	COG3267, ExeA, Type II secretory pathway, component ExeA (predicted ATPase) [Intracellular trafficking and secretion]	NA|188aa|down_8|NZ_CP019636.1_4942718_4943282_-	cd03768, SR_ResInv, Serine Recombinase (SR) family, Resolvase and Invertase subfamily, catalytic domain; members contain a C-terminal DNA binding domain	NA|287aa|down_9|NZ_CP019636.1_4943936_4944797_+	COG1108, ZnuB, ABC-type Mn2+/Zn2+ transport systems, permease components [Inorganic ion transport and metabolism]
GCF_002163975.1_ASM216397v1	NZ_CP019636	Nostocales cyanobacterium HT-58-2, complete genome	15	4932041-4932518	6,6	PILER-CR,CRT	no	csx19,csm3gr7,PD-DExK,csx1,csm2gr11,csx10gr5,cas10,csx3,WYL	Cas14c_CAS-V-F,cas14j,c2c9_V-U4,csa3,WYL,csx1,cas10,csm3gr7,csx10gr5,csx19,Cas14u_CAS-V,cas14k,cas6,2OG_CAS,csc1gr5,cas10d,csc2gr7,cas3,cas4,cas1,cas2,csx21,PD-DExK,csm2gr11,csx3,DinG,c2c10_CAS-V-U3	 Type III-D?,Type III-D,Type III-A,Type III-C,Type III-B	GTTGCTTAACCACTAATCCCCGCAAGGGGACTGAAAC,GTTTCAGTCCCCTTGCGGGGATTAGTGGTTAAGCAAC	37,37	0	0	NA	NA	NA:NA	6,6	6	TypeIII-D?,TypeIII-D,TypeIII-A,TypeIII-C,TypeIII-B	Cas14c_CAS-V-F,cas14j,c2c9_V-U4,csa3,WYL,csx1,cas10,csm3gr7,csx10gr5,csx19,Cas14u_CAS-V,cas14k,cas6,2OG_CAS,csc1gr5,cas10d,csc2gr7,cas3,cas4,cas1,cas2,csx21,PD-DExK,csm2gr11,csx3,DinG,c2c10_CAS-V-U3	csm2gr11|137aa|up_9|NZ_CP019636.1_4921707_4922118_-,NA|115aa|up_5|NZ_CP019636.1_4926814_4927159_+,NA|141aa|up_4|NZ_CP019636.1_4927151_4927574_+,NA|64aa|up_1|NZ_CP019636.1_4930221_4930413_+,NA|139aa|down_0|NZ_CP019636.1_4933653_4934070_-,NA|747aa|down_1|NZ_CP019636.1_4934078_4936319_-,NA|81aa|down_2|NZ_CP019636.1_4936585_4936828_+,NA|61aa|down_3|NZ_CP019636.1_4936947_4937130_+,NA|932aa|down_9|NZ_CP019636.1_4946941_4949737_+	csm2gr11|137aa|up_9|NZ_CP019636.1_4921707_4922118_-	NA	csx10gr5|419aa|up_8|NZ_CP019636.1_4922114_4923371_-	TIGR02674, cas_cyan_RAMP_2, CRISPR-associated RAMP protein, Csx10 family	csm3gr7|236aa|up_7|NZ_CP019636.1_4923367_4924075_-	COG1337, COG1337, CRISPR system related protein, RAMP superfamily [Defense    mechanisms]	cas10|781aa|up_6|NZ_CP019636.1_4924071_4926414_-	TIGR02577, thermophile-specific_DNA_repair_system, CRISPR-associated protein Cas10/Cmr2, subtype III-B	NA|115aa|up_5|NZ_CP019636.1_4926814_4927159_+	NA	NA|141aa|up_4|NZ_CP019636.1_4927151_4927574_+	NA	NA|285aa|up_3|NZ_CP019636.1_4927596_4928451_-	cd09702, Csx1_III-U, CRISPR/Cas system-associated protein Csx1	csx3|309aa|up_2|NZ_CP019636.1_4928518_4929445_-	pfam09620, Cas_csx3, CRISPR-associated protein (Cas_csx3)	NA|64aa|up_1|NZ_CP019636.1_4930221_4930413_+	NA	WYL|456aa|up_0|NZ_CP019636.1_4930661_4932029_+	TIGR03985, hypothetical_protein_sll7078, CRISPR-associated protein, TIGR03985 family	NA|139aa|down_0|NZ_CP019636.1_4933653_4934070_-	NA	NA|747aa|down_1|NZ_CP019636.1_4934078_4936319_-	NA	NA|81aa|down_2|NZ_CP019636.1_4936585_4936828_+	NA	NA|61aa|down_3|NZ_CP019636.1_4936947_4937130_+	NA	NA|103aa|down_4|NZ_CP019636.1_4937615_4937924_-	cd10158, CsoR-like_DUF156_1, Uncharacterized family 1; belongs to a superfamily containing the transcriptional regulators CsoR (copper-sensitive operon repressor), RcnR, and FrmR, and related domains; this family was previously known as part of DUF156	NA|356aa|down_5|NZ_CP019636.1_4938263_4939331_+	pfam01297, ZnuA, Zinc-uptake complex component A periplasmic	NA|324aa|down_6|NZ_CP019636.1_4940102_4941074_-	COG3267, ExeA, Type II secretory pathway, component ExeA (predicted ATPase) [Intracellular trafficking and secretion]	NA|188aa|down_7|NZ_CP019636.1_4942718_4943282_-	cd03768, SR_ResInv, Serine Recombinase (SR) family, Resolvase and Invertase subfamily, catalytic domain; members contain a C-terminal DNA binding domain	NA|287aa|down_8|NZ_CP019636.1_4943936_4944797_+	COG1108, ZnuB, ABC-type Mn2+/Zn2+ transport systems, permease components [Inorganic ion transport and metabolism]	NA|932aa|down_9|NZ_CP019636.1_4946941_4949737_+	NA
GCF_002163975.1_ASM216397v1	NZ_CP019636	Nostocales cyanobacterium HT-58-2, complete genome	16	4932737-4933632	7,15,7	PILER-CR,CRISPRCasFinder,CRT	no	csm3gr7,PD-DExK,csx1,csm2gr11,csx10gr5,cas10,csx3,WYL	Cas14c_CAS-V-F,cas14j,c2c9_V-U4,csa3,WYL,csx1,cas10,csm3gr7,csx10gr5,csx19,Cas14u_CAS-V,cas14k,cas6,2OG_CAS,csc1gr5,cas10d,csc2gr7,cas3,cas4,cas1,cas2,csx21,PD-DExK,csm2gr11,csx3,DinG,c2c10_CAS-V-U3	 Type III-D?,Type III-D,Type III-A,Type III-C,Type III-B	GTGACAAACTACCTCTTCCCCGCAAGGGGATTGAAAC,GTGACAAACTACCTCTTCCCCGCAAGGGGATTGAAAC,GTGACAAACTACCTCTTCCCCGCAAGGGGATTGAAAC	37,37,37	0	0	NA	NA	NA:NA:NA	11,11,12	12	TypeIII-D?,TypeIII-D,TypeIII-A,TypeIII-C,TypeIII-B	Cas14c_CAS-V-F,cas14j,c2c9_V-U4,csa3,WYL,csx1,cas10,csm3gr7,csx10gr5,csx19,Cas14u_CAS-V,cas14k,cas6,2OG_CAS,csc1gr5,cas10d,csc2gr7,cas3,cas4,cas1,cas2,csx21,PD-DExK,csm2gr11,csx3,DinG,c2c10_CAS-V-U3	csm2gr11|137aa|up_9|NZ_CP019636.1_4921707_4922118_-,NA|115aa|up_5|NZ_CP019636.1_4926814_4927159_+,NA|141aa|up_4|NZ_CP019636.1_4927151_4927574_+,NA|64aa|up_1|NZ_CP019636.1_4930221_4930413_+,NA|139aa|down_0|NZ_CP019636.1_4933653_4934070_-,NA|747aa|down_1|NZ_CP019636.1_4934078_4936319_-,NA|81aa|down_2|NZ_CP019636.1_4936585_4936828_+,NA|61aa|down_3|NZ_CP019636.1_4936947_4937130_+,NA|932aa|down_9|NZ_CP019636.1_4946941_4949737_+	csm2gr11|137aa|up_9|NZ_CP019636.1_4921707_4922118_-	NA	csx10gr5|419aa|up_8|NZ_CP019636.1_4922114_4923371_-	TIGR02674, cas_cyan_RAMP_2, CRISPR-associated RAMP protein, Csx10 family	csm3gr7|236aa|up_7|NZ_CP019636.1_4923367_4924075_-	COG1337, COG1337, CRISPR system related protein, RAMP superfamily [Defense    mechanisms]	cas10|781aa|up_6|NZ_CP019636.1_4924071_4926414_-	TIGR02577, thermophile-specific_DNA_repair_system, CRISPR-associated protein Cas10/Cmr2, subtype III-B	NA|115aa|up_5|NZ_CP019636.1_4926814_4927159_+	NA	NA|141aa|up_4|NZ_CP019636.1_4927151_4927574_+	NA	NA|285aa|up_3|NZ_CP019636.1_4927596_4928451_-	cd09702, Csx1_III-U, CRISPR/Cas system-associated protein Csx1	csx3|309aa|up_2|NZ_CP019636.1_4928518_4929445_-	pfam09620, Cas_csx3, CRISPR-associated protein (Cas_csx3)	NA|64aa|up_1|NZ_CP019636.1_4930221_4930413_+	NA	WYL|456aa|up_0|NZ_CP019636.1_4930661_4932029_+	TIGR03985, hypothetical_protein_sll7078, CRISPR-associated protein, TIGR03985 family	NA|139aa|down_0|NZ_CP019636.1_4933653_4934070_-	NA	NA|747aa|down_1|NZ_CP019636.1_4934078_4936319_-	NA	NA|81aa|down_2|NZ_CP019636.1_4936585_4936828_+	NA	NA|61aa|down_3|NZ_CP019636.1_4936947_4937130_+	NA	NA|103aa|down_4|NZ_CP019636.1_4937615_4937924_-	cd10158, CsoR-like_DUF156_1, Uncharacterized family 1; belongs to a superfamily containing the transcriptional regulators CsoR (copper-sensitive operon repressor), RcnR, and FrmR, and related domains; this family was previously known as part of DUF156	NA|356aa|down_5|NZ_CP019636.1_4938263_4939331_+	pfam01297, ZnuA, Zinc-uptake complex component A periplasmic	NA|324aa|down_6|NZ_CP019636.1_4940102_4941074_-	COG3267, ExeA, Type II secretory pathway, component ExeA (predicted ATPase) [Intracellular trafficking and secretion]	NA|188aa|down_7|NZ_CP019636.1_4942718_4943282_-	cd03768, SR_ResInv, Serine Recombinase (SR) family, Resolvase and Invertase subfamily, catalytic domain; members contain a C-terminal DNA binding domain	NA|287aa|down_8|NZ_CP019636.1_4943936_4944797_+	COG1108, ZnuB, ABC-type Mn2+/Zn2+ transport systems, permease components [Inorganic ion transport and metabolism]	NA|932aa|down_9|NZ_CP019636.1_4946941_4949737_+	NA
GCF_002163975.1_ASM216397v1	NZ_CP019636	Nostocales cyanobacterium HT-58-2, complete genome	17	5170178-5170301	16	CRISPRCasFinder	no		Cas14c_CAS-V-F,cas14j,c2c9_V-U4,csa3,WYL,csx1,cas10,csm3gr7,csx10gr5,csx19,Cas14u_CAS-V,cas14k,cas6,2OG_CAS,csc1gr5,cas10d,csc2gr7,cas3,cas4,cas1,cas2,csx21,PD-DExK,csm2gr11,csx3,DinG,c2c10_CAS-V-U3	Orphan	TGTAGGATGGGTTAGCGAAGCGTAACCCATGCTG	34	0	0	NA	NA	NA	1	1	Orphan	Cas14c_CAS-V-F,cas14j,c2c9_V-U4,csa3,WYL,csx1,cas10,csm3gr7,csx10gr5,csx19,Cas14u_CAS-V,cas14k,cas6,2OG_CAS,csc1gr5,cas10d,csc2gr7,cas3,cas4,cas1,cas2,csx21,PD-DExK,csm2gr11,csx3,DinG,c2c10_CAS-V-U3	NA|295aa|up_5|NZ_CP019636.1_5162191_5163076_-,NA|123aa|up_2|NZ_CP019636.1_5166309_5166678_-,NA|308aa|up_0|NZ_CP019636.1_5167196_5168120_+,NA|103aa|down_1|NZ_CP019636.1_5171660_5171969_-,NA|62aa|down_3|NZ_CP019636.1_5176681_5176867_+,NA|68aa|down_7|NZ_CP019636.1_5181267_5181471_+,NA|111aa|down_8|NZ_CP019636.1_5181678_5182011_+,NA|80aa|down_9|NZ_CP019636.1_5184112_5184352_-	NA|199aa|up_9|NZ_CP019636.1_5158583_5159180_-	pfam06206, CpeT, CpeT/CpcT family (DUF1001)	NA|236aa|up_8|NZ_CP019636.1_5159439_5160147_+	pfam11264, ThylakoidFormat, Thylakoid formation protein	NA|121aa|up_7|NZ_CP019636.1_5160217_5160580_-	COG1950, COG1950, Predicted membrane protein [Function unknown]	NA|440aa|up_6|NZ_CP019636.1_5160805_5162125_-	PLN02856, PLN02856, fumarylacetoacetase	NA|295aa|up_5|NZ_CP019636.1_5162191_5163076_-	NA	NA|364aa|up_4|NZ_CP019636.1_5163853_5164945_+	TIGR01263, 4-hydroxyphenylpyruvate_dioxygenase, 4-hydroxyphenylpyruvate dioxygenase	NA|386aa|up_3|NZ_CP019636.1_5165002_5166160_+	COG3508, HmgA, Homogentisate 1,2-dioxygenase [Secondary metabolites biosynthesis, transport, and catabolism]	NA|123aa|up_2|NZ_CP019636.1_5166309_5166678_-	NA	NA|144aa|up_1|NZ_CP019636.1_5166659_5167091_-	cd13633, PBP2_Sa-PDT_like, Catalytic domain of prephenate dehydratase from Staphylococcus aureus and similar proteins, subgroup 4; the type 2 periplasmic binding protein fold	NA|308aa|up_0|NZ_CP019636.1_5167196_5168120_+	NA	NA|400aa|down_0|NZ_CP019636.1_5170479_5171679_+	pfam01231, IDO, Indoleamine 2,3-dioxygenase	NA|103aa|down_1|NZ_CP019636.1_5171660_5171969_-	NA	NA|202aa|down_2|NZ_CP019636.1_5176035_5176641_+	COG1309, AcrR, Transcriptional regulator [Transcription]	NA|62aa|down_3|NZ_CP019636.1_5176681_5176867_+	NA	NA|154aa|down_4|NZ_CP019636.1_5177615_5178077_+	cd17557, REC_Rcp-like, phosphoacceptor receiver (REC) domain of cyanobacterial phytochrome response regulator Rcp and similar domains	NA|357aa|down_5|NZ_CP019636.1_5178395_5179466_+	pfam01609, DDE_Tnp_1, Transposase DDE domain	NA|460aa|down_6|NZ_CP019636.1_5179753_5181133_+	pfam01548, DEDD_Tnp_IS110, Transposase	NA|68aa|down_7|NZ_CP019636.1_5181267_5181471_+	NA	NA|111aa|down_8|NZ_CP019636.1_5181678_5182011_+	NA	NA|80aa|down_9|NZ_CP019636.1_5184112_5184352_-	NA
GCF_002163975.1_ASM216397v1	NZ_CP019636	Nostocales cyanobacterium HT-58-2, complete genome	18	6035546-6035626	17	CRISPRCasFinder	no		Cas14c_CAS-V-F,cas14j,c2c9_V-U4,csa3,WYL,csx1,cas10,csm3gr7,csx10gr5,csx19,Cas14u_CAS-V,cas14k,cas6,2OG_CAS,csc1gr5,cas10d,csc2gr7,cas3,cas4,cas1,cas2,csx21,PD-DExK,csm2gr11,csx3,DinG,c2c10_CAS-V-U3	Orphan	ACTAACGCACCCTTGAAACCTTCT	24	0	0	NA	NA	NA	1	1	Orphan	Cas14c_CAS-V-F,cas14j,c2c9_V-U4,csa3,WYL,csx1,cas10,csm3gr7,csx10gr5,csx19,Cas14u_CAS-V,cas14k,cas6,2OG_CAS,csc1gr5,cas10d,csc2gr7,cas3,cas4,cas1,cas2,csx21,PD-DExK,csm2gr11,csx3,DinG,c2c10_CAS-V-U3	NA|65aa|up_9|NZ_CP019636.1_6018883_6019078_-,NA|106aa|up_8|NZ_CP019636.1_6019868_6020186_-,NA	NA|65aa|up_9|NZ_CP019636.1_6018883_6019078_-	NA	NA|106aa|up_8|NZ_CP019636.1_6019868_6020186_-	NA	NA|182aa|up_7|NZ_CP019636.1_6020573_6021119_-	cd10450, GIY-YIG_AtGrxS16_like, GIY-YIG domain found in CAXIP1-like proteins, iron-sulfur cluster assembly proteins, and similar proteins	NA|353aa|up_6|NZ_CP019636.1_6021506_6022565_+	COG2008, GLY1, Threonine aldolase [Amino acid transport and metabolism]	NA|211aa|up_5|NZ_CP019636.1_6022585_6023218_-	cd03349, LbH_XAT, Xenobiotic acyltransferase (XAT): The XAT class of hexapeptide acyltransferases is composed of a large number of microbial enzymes that catalyze the CoA-dependent acetylation of a variety of hydroxyl-bearing acceptors such as chloramphenicol and streptogramin, among others	NA|866aa|up_4|NZ_CP019636.1_6023311_6025909_-	TIGR03030, Cellulose_synthase_UDP-forming, cellulose synthase catalytic subunit (UDP-forming)	NA|793aa|up_3|NZ_CP019636.1_6026178_6028557_-	pfam03170, BcsB, Bacterial cellulose synthase subunit	NA|422aa|up_2|NZ_CP019636.1_6028725_6029991_-	PRK11097, PRK11097, cellulase	NA|810aa|up_1|NZ_CP019636.1_6030068_6032498_-	TIGR02917, TPR_domain_protein, putative PEP-CTERM system TPR-repeat lipoprotein	NA|682aa|up_0|NZ_CP019636.1_6032569_6034615_-	pfam04966, OprB, Carbohydrate-selective porin, OprB family	NA|101aa|down_0|NZ_CP019636.1_6036320_6036623_+	CHL00102, rps20, ribosomal protein S20	NA|262aa|down_1|NZ_CP019636.1_6036801_6037587_+	COG0084, TatD, Mg-dependent DNase [DNA replication, recombination, and repair]	NA|1100aa|down_2|NZ_CP019636.1_6038369_6041669_+	PRK00405, rpoB, DNA-directed RNA polymerase subunit beta; Reviewed	NA|626aa|down_3|NZ_CP019636.1_6042129_6044007_+	PRK02625, rpoC1, DNA-directed RNA polymerase subunit gamma; Provisional	NA|1359aa|down_4|NZ_CP019636.1_6044187_6048264_+	PRK02597, rpoC2, DNA-directed RNA polymerase subunit beta'; Provisional	NA|546aa|down_5|NZ_CP019636.1_6048349_6049987_-	pfam00150, Cellulase, Cellulase (glycosyl hydrolase family 5)	NA|434aa|down_6|NZ_CP019636.1_6051497_6052799_+	pfam00815, Histidinol_dh, Histidinol dehydrogenase	NA|144aa|down_7|NZ_CP019636.1_6052975_6053407_+	pfam00582, Usp, Universal stress protein family	NA|44aa|down_8|NZ_CP019636.1_6053472_6053604_+	cd06260, DUF820, Domain of unknown function (DUF820)	NA|276aa|down_9|NZ_CP019636.1_6053701_6054529_-	PRK12896, PRK12896, methionine aminopeptidase; Reviewed
GCF_002163975.1_ASM216397v1	NZ_CP019636	Nostocales cyanobacterium HT-58-2, complete genome	19	6281907-6282066	18	CRISPRCasFinder	no	c2c9_V-U4	Cas14c_CAS-V-F,cas14j,c2c9_V-U4,csa3,WYL,csx1,cas10,csm3gr7,csx10gr5,csx19,Cas14u_CAS-V,cas14k,cas6,2OG_CAS,csc1gr5,cas10d,csc2gr7,cas3,cas4,cas1,cas2,csx21,PD-DExK,csm2gr11,csx3,DinG,c2c10_CAS-V-U3	Type V-U4	CAAACGAATAAAGAAGAACAAACATCAC	28	0	0	NA	NA	NA	2	2	TypeV-U4	Cas14c_CAS-V-F,cas14j,c2c9_V-U4,csa3,WYL,csx1,cas10,csm3gr7,csx10gr5,csx19,Cas14u_CAS-V,cas14k,cas6,2OG_CAS,csc1gr5,cas10d,csc2gr7,cas3,cas4,cas1,cas2,csx21,PD-DExK,csm2gr11,csx3,DinG,c2c10_CAS-V-U3	NA|92aa|up_8|NZ_CP019636.1_6270803_6271079_+,NA|139aa|up_7|NZ_CP019636.1_6271086_6271503_-,NA|83aa|up_5|NZ_CP019636.1_6272120_6272369_-,NA|112aa|up_4|NZ_CP019636.1_6272440_6272776_+,NA	NA|466aa|up_9|NZ_CP019636.1_6269394_6270792_+	COG0312, TldD, Predicted Zn-dependent proteases and their inactivated homologs [General function prediction only]	NA|92aa|up_8|NZ_CP019636.1_6270803_6271079_+	NA	NA|139aa|up_7|NZ_CP019636.1_6271086_6271503_-	NA	NA|173aa|up_6|NZ_CP019636.1_6271519_6272038_-	COG1403, McrA, Restriction endonuclease [Defense mechanisms]	NA|83aa|up_5|NZ_CP019636.1_6272120_6272369_-	NA	NA|112aa|up_4|NZ_CP019636.1_6272440_6272776_+	NA	NA|196aa|up_3|NZ_CP019636.1_6275588_6276176_+	cd08866, SRPBCC_11, Ligand-binding SRPBCC domain of an uncharacterized subfamily of proteins	NA|641aa|up_2|NZ_CP019636.1_6276752_6278675_+	COG0025, NhaP, NhaP-type Na+/H+ and K+/H+ antiporters [Inorganic ion transport and metabolism]	NA|219aa|up_1|NZ_CP019636.1_6278681_6279338_-	COG1290, QcrB, Cytochrome b subunit of the bc complex [Energy production and conversion]	NA|232aa|up_0|NZ_CP019636.1_6279452_6280148_-	COG1075, LipA, Predicted acetyltransferases and hydrolases with the alpha/beta hydrolase fold [General function prediction only]	NA|118aa|down_0|NZ_CP019636.1_6282384_6282738_-	PRK13697, PRK13697, cytochrome c6; Provisional	NA|146aa|down_1|NZ_CP019636.1_6282961_6283399_-	smart00347, HTH_MARR, helix_turn_helix multiple antibiotic resistance protein	NA|326aa|down_2|NZ_CP019636.1_6283511_6284489_+	TIGR00950, Uncharacterized_inner_membrane_transporter_YicL, Carboxylate/Amino Acid/Amine Transporter	c2c9_V-U4|385aa|down_3|NZ_CP019636.1_6284681_6285836_-	COG0675, COG0675, Transposase and inactivated derivatives [DNA replication, recombination, and repair]	NA|140aa|down_4|NZ_CP019636.1_6286364_6286784_-	PRK02710, PRK02710, plastocyanin; Provisional	NA|164aa|down_5|NZ_CP019636.1_6287366_6287858_-	PRK13618, psbV, cytochrome c-550; Provisional	NA|72aa|down_6|NZ_CP019636.1_6287980_6288196_-	pfam14177, YkyB, YkyB-like protein	NA|279aa|down_7|NZ_CP019636.1_6288986_6289823_-	COG0395, UgpE, ABC-type sugar transport system, permease component [Carbohydrate transport and metabolism]	NA|300aa|down_8|NZ_CP019636.1_6289819_6290719_-	COG1175, UgpA, ABC-type sugar transport systems, permease components [Carbohydrate transport and metabolism]	NA|245aa|down_9|NZ_CP019636.1_6291247_6291982_-	pfam13649, Methyltransf_25, Methyltransferase domain
GCF_002163975.1_ASM216397v1	NZ_CP019636	Nostocales cyanobacterium HT-58-2, complete genome	20	6520695-6520845	19	CRISPRCasFinder	no		Cas14c_CAS-V-F,cas14j,c2c9_V-U4,csa3,WYL,csx1,cas10,csm3gr7,csx10gr5,csx19,Cas14u_CAS-V,cas14k,cas6,2OG_CAS,csc1gr5,cas10d,csc2gr7,cas3,cas4,cas1,cas2,csx21,PD-DExK,csm2gr11,csx3,DinG,c2c10_CAS-V-U3	Orphan	CAAATGTCTTAGAGCATCTCCCCGACGAAACCAAGCCCAGTAATCATCC	49	0	0	NA	NA	NA	1	1	Orphan	Cas14c_CAS-V-F,cas14j,c2c9_V-U4,csa3,WYL,csx1,cas10,csm3gr7,csx10gr5,csx19,Cas14u_CAS-V,cas14k,cas6,2OG_CAS,csc1gr5,cas10d,csc2gr7,cas3,cas4,cas1,cas2,csx21,PD-DExK,csm2gr11,csx3,DinG,c2c10_CAS-V-U3	NA,NA|765aa|down_3|NZ_CP019636.1_6525852_6528147_-	NA|404aa|up_9|NZ_CP019636.1_6506444_6507656_+	cd04186, GT_2_like_c, Subfamily of Glycosyltransferase Family GT2 of unknown function	NA|428aa|up_8|NZ_CP019636.1_6507652_6508936_+	cd04186, GT_2_like_c, Subfamily of Glycosyltransferase Family GT2 of unknown function	NA|341aa|up_7|NZ_CP019636.1_6508997_6510020_+	cd00761, Glyco_tranf_GTA_type, Glycosyltransferase family A (GT-A) includes diverse families of glycosyl transferases with a common GT-A type structural fold	NA|273aa|up_6|NZ_CP019636.1_6510127_6510946_+	pfam08241, Methyltransf_11, Methyltransferase domain	NA|355aa|up_5|NZ_CP019636.1_6511022_6512087_+	cd05258, CDP_TE_SDR_e, CDP-tyvelose 2-epimerase, extended (e) SDRs	NA|139aa|up_4|NZ_CP019636.1_6512200_6512617_+	pfam04138, GtrA, GtrA-like protein	NA|551aa|up_3|NZ_CP019636.1_6512654_6514307_+	pfam13231, PMT_2, Dolichyl-phosphate-mannose-protein mannosyltransferase	NA|507aa|up_2|NZ_CP019636.1_6514327_6515848_+	cd04179, DPM_DPG-synthase_like, DPM_DPG-synthase_like is a member of the Glycosyltransferase 2 superfamily	NA|438aa|up_1|NZ_CP019636.1_6517305_6518619_+	cd03801, GT4_PimA-like, phosphatidyl-myo-inositol mannosyltransferase	NA|505aa|up_0|NZ_CP019636.1_6518683_6520198_+	TIGR02733, similar_to_to_phytoene_dehydrogenase, C-3',4' desaturase CrtD	NA|909aa|down_0|NZ_CP019636.1_6521513_6524240_+	COG4251, COG4251, Bacteriophytochrome (light-regulated signal transduction histidine kinase) [Signal transduction mechanisms]	NA|284aa|down_1|NZ_CP019636.1_6524346_6525198_+	cd05243, SDR_a5, atypical (a) SDRs, subgroup 5	NA|133aa|down_2|NZ_CP019636.1_6525209_6525608_-	pfam07845, DUF1636, Protein of unknown function (DUF1636)	NA|765aa|down_3|NZ_CP019636.1_6525852_6528147_-	NA	NA|159aa|down_4|NZ_CP019636.1_6528380_6528857_+	pfam08670, MEKHLA, MEKHLA domain	NA|720aa|down_5|NZ_CP019636.1_6528872_6531032_-	cd05387, BY-kinase, bacterial tyrosine-kinase	NA|464aa|down_6|NZ_CP019636.1_6531217_6532609_-	COG1596, Wza, Periplasmic protein involved in polysaccharide export, contains    SLBB domain of b-grasp fold [Cell wall/membrane/envelope biogenesis]	NA|237aa|down_7|NZ_CP019636.1_6532838_6533549_-	TIGR04533, cyanosortB_assc, cyanoexosortase B-associated protein	NA|298aa|down_8|NZ_CP019636.1_6533793_6534687_-	TIGR04156, eight_transmembrane_protein_EpsH, cyanoexosortase B	NA|386aa|down_9|NZ_CP019636.1_6534844_6536002_-	cd00616, AHBA_syn, 3-amino-5-hydroxybenzoic acid synthase family (AHBA_syn)
GCF_002163975.1_ASM216397v1	NZ_CP019636	Nostocales cyanobacterium HT-58-2, complete genome	21	6558638-6558729	20	CRISPRCasFinder	no		Cas14c_CAS-V-F,cas14j,c2c9_V-U4,csa3,WYL,csx1,cas10,csm3gr7,csx10gr5,csx19,Cas14u_CAS-V,cas14k,cas6,2OG_CAS,csc1gr5,cas10d,csc2gr7,cas3,cas4,cas1,cas2,csx21,PD-DExK,csm2gr11,csx3,DinG,c2c10_CAS-V-U3	Orphan	CAAAGACTCAAGCGTGAGTATCAGAACCT	29	0	0	NA	NA	NA	1	1	Orphan	Cas14c_CAS-V-F,cas14j,c2c9_V-U4,csa3,WYL,csx1,cas10,csm3gr7,csx10gr5,csx19,Cas14u_CAS-V,cas14k,cas6,2OG_CAS,csc1gr5,cas10d,csc2gr7,cas3,cas4,cas1,cas2,csx21,PD-DExK,csm2gr11,csx3,DinG,c2c10_CAS-V-U3	NA|60aa|up_9|NZ_CP019636.1_6548395_6548575_+,NA	NA|60aa|up_9|NZ_CP019636.1_6548395_6548575_+	NA	NA|364aa|up_8|NZ_CP019636.1_6548674_6549766_-	PRK14457, PRK14457, 23S rRNA (adenine(2503)-C(2))-methyltransferase RlmN	NA|110aa|up_7|NZ_CP019636.1_6549856_6550186_-	pfam01035, DNA_binding_1, 6-O-methylguanine DNA methyltransferase, DNA binding domain	NA|118aa|up_6|NZ_CP019636.1_6550207_6550561_-	pfam01187, MIF, Macrophage migration inhibitory factor (MIF)	NA|263aa|up_5|NZ_CP019636.1_6550697_6551486_-	COG2875, CobM, Precorrin-4 methylase [Coenzyme metabolism]	NA|284aa|up_4|NZ_CP019636.1_6551670_6552522_-	pfam01790, LGT, Prolipoprotein diacylglyceryl transferase	NA|189aa|up_3|NZ_CP019636.1_6552929_6553496_+	PRK00168, coaD, phosphopantetheine adenylyltransferase; Provisional	NA|182aa|up_2|NZ_CP019636.1_6553537_6554083_+	COG3599, DivIVA, Cell division initiation protein [Cell division and chromosome partitioning]	NA|554aa|up_1|NZ_CP019636.1_6554439_6556101_+	cd07333, M48C_bepA_like, Peptidase M48C Ste24p bepA-like, integral membrane protein	NA|270aa|up_0|NZ_CP019636.1_6556172_6556982_+	pfam14218, COP23, Circadian oscillating protein COP23	NA|1199aa|down_0|NZ_CP019636.1_6559031_6562628_+	COG4247, Phy, 3-phytase (myo-inositol-hexaphosphate 3-phosphohydrolase) [Lipid metabolism]	NA|347aa|down_1|NZ_CP019636.1_6562817_6563858_-	COG3367, COG3367, Uncharacterized conserved protein [Function unknown]	NA|351aa|down_2|NZ_CP019636.1_6563841_6564894_-	cd03319, L-Ala-DL-Glu_epimerase, L-Ala-D/L-Glu epimerase catalyzes the epimerization of L-Ala-D/L-Glu and other dipeptides	NA|183aa|down_3|NZ_CP019636.1_6565156_6565705_+	cd03424, ADPRase_NUDT5, ADP-ribose pyrophosphatase (ADPRase) catalyzes the hydrolysis of ADP-ribose and a variety of additional ADP-sugar conjugates to AMP and ribose-5-phosphate	NA|308aa|down_4|NZ_CP019636.1_6565819_6566743_-	cd02647, nuc_hydro_TvIAG, nuc_hydro_ TvIAG:  Nucleoside hydrolases similar to the Inosine-adenosine-guanosine-preferring nucleoside hydrolase from Trypanosoma vivax	NA|144aa|down_5|NZ_CP019636.1_6566756_6567188_+	COG0590, CumB, Cytosine/adenosine deaminases [Nucleotide transport and metabolism / Translation, ribosomal structure and biogenesis]	NA|309aa|down_6|NZ_CP019636.1_6567261_6568188_-	cd01174, ribokinase, Ribokinase catalyses the phosphorylation of ribose to ribose-5-phosphate using ATP	NA|186aa|down_7|NZ_CP019636.1_6568424_6568982_-	cd03357, LbH_MAT_GAT, Maltose O-acetyltransferase (MAT) and Galactoside O-acetyltransferase (GAT): MAT and GAT catalyze the CoA-dependent acetylation of the 6-hydroxyl group of their respective sugar substrates	NA|436aa|down_8|NZ_CP019636.1_6569038_6570346_-	cd01949, GGDEF, Diguanylate-cyclase (DGC) or GGDEF domain	NA|123aa|down_9|NZ_CP019636.1_6571029_6571398_-	pfam02152, FolB, Dihydroneopterin aldolase
GCF_002163975.1_ASM216397v1	NZ_CP019636	Nostocales cyanobacterium HT-58-2, complete genome	22	6760309-6760409	21	CRISPRCasFinder	no		Cas14c_CAS-V-F,cas14j,c2c9_V-U4,csa3,WYL,csx1,cas10,csm3gr7,csx10gr5,csx19,Cas14u_CAS-V,cas14k,cas6,2OG_CAS,csc1gr5,cas10d,csc2gr7,cas3,cas4,cas1,cas2,csx21,PD-DExK,csm2gr11,csx3,DinG,c2c10_CAS-V-U3	Orphan	GCGAGTGCGTCTGGTGATAGGAGAAGGG	28	0	0	NA	NA	NA	1	1	Orphan	Cas14c_CAS-V-F,cas14j,c2c9_V-U4,csa3,WYL,csx1,cas10,csm3gr7,csx10gr5,csx19,Cas14u_CAS-V,cas14k,cas6,2OG_CAS,csc1gr5,cas10d,csc2gr7,cas3,cas4,cas1,cas2,csx21,PD-DExK,csm2gr11,csx3,DinG,c2c10_CAS-V-U3	NA,NA|57aa|down_2|NZ_CP019636.1_6763222_6763393_+,NA|29aa|down_5|NZ_CP019636.1_6765640_6765727_+	NA|493aa|up_9|NZ_CP019636.1_6741774_6743253_-	PRK00913, PRK00913, multifunctional aminopeptidase A; Provisional	NA|159aa|up_8|NZ_CP019636.1_6745858_6746335_-	cd17557, REC_Rcp-like, phosphoacceptor receiver (REC) domain of cyanobacterial phytochrome response regulator Rcp and similar domains	NA|1321aa|up_7|NZ_CP019636.1_6746345_6750308_-	COG4251, COG4251, Bacteriophytochrome (light-regulated signal transduction histidine kinase) [Signal transduction mechanisms]	NA|409aa|up_6|NZ_CP019636.1_6750557_6751784_-	COG3597, COG3597, Uncharacterized protein/domain associated with GTPases [Function unknown]	NA|271aa|up_5|NZ_CP019636.1_6751898_6752711_-	COG0596, MhpC, Predicted hydrolases or acyltransferases (alpha/beta hydrolase superfamily) [General function prediction only]	NA|244aa|up_4|NZ_CP019636.1_6752801_6753533_-	cd07989, LPLAT_AGPAT-like, Lysophospholipid Acyltransferases (LPLATs) of Glycerophospholipid Biosynthesis: AGPAT-like	NA|534aa|up_3|NZ_CP019636.1_6753905_6755507_+	PRK11893, PRK11893, methionyl-tRNA synthetase; Reviewed	NA|211aa|up_2|NZ_CP019636.1_6755655_6756288_+	cd10911, PIN_LabA, PIN domain of Synechococcus elongatus LabA (low-amplitude and bright) and related proteins	NA|419aa|up_1|NZ_CP019636.1_6756674_6757931_+	TIGR04409, LptC_YrbK, LPS export ABC transporter periplasmic protein LptC	NA|441aa|up_0|NZ_CP019636.1_6758182_6759505_-	COG2027, DacB, D-alanyl-D-alanine carboxypeptidase (penicillin-binding protein 4) [Cell envelope biogenesis, outer membrane]	NA|342aa|down_0|NZ_CP019636.1_6761017_6762043_+	PRK05331, PRK05331, phosphate acyltransferase PlsX	NA|331aa|down_1|NZ_CP019636.1_6762155_6763148_+	PRK09352, PRK09352, beta-ketoacyl-ACP synthase 3	NA|57aa|down_2|NZ_CP019636.1_6763222_6763393_+	NA	NA|293aa|down_3|NZ_CP019636.1_6763430_6764309_+	COG0331, FabD, (acyl-carrier-protein) S-malonyltransferase [Lipid metabolism]	NA|213aa|down_4|NZ_CP019636.1_6764774_6765413_+	cd07989, LPLAT_AGPAT-like, Lysophospholipid Acyltransferases (LPLATs) of Glycerophospholipid Biosynthesis: AGPAT-like	NA|29aa|down_5|NZ_CP019636.1_6765640_6765727_+	NA	NA|346aa|down_6|NZ_CP019636.1_6765804_6766842_+	COG0628, yhhT, Predicted permease, member of the PurR regulon [General function prediction only]	NA|101aa|down_7|NZ_CP019636.1_6766984_6767287_+	COG5626, COG5626, Uncharacterized small conserved protein [Function unknown]	NA|177aa|down_8|NZ_CP019636.1_6767515_6768046_-	cd06259, YdcF-like, YdcF-like	NA|367aa|down_9|NZ_CP019636.1_6768814_6769915_-	pfam14249, Tocopherol_cycl, Tocopherol cyclase
GCF_002163975.1_ASM216397v1	NZ_CP019636	Nostocales cyanobacterium HT-58-2, complete genome	23	6764555-6764664	22	CRISPRCasFinder	no		Cas14c_CAS-V-F,cas14j,c2c9_V-U4,csa3,WYL,csx1,cas10,csm3gr7,csx10gr5,csx19,Cas14u_CAS-V,cas14k,cas6,2OG_CAS,csc1gr5,cas10d,csc2gr7,cas3,cas4,cas1,cas2,csx21,PD-DExK,csm2gr11,csx3,DinG,c2c10_CAS-V-U3	Orphan	TCACCAGACGCACTCGCGTTCGGGT	25	0	0	NA	NA	NA	1	1	Orphan	Cas14c_CAS-V-F,cas14j,c2c9_V-U4,csa3,WYL,csx1,cas10,csm3gr7,csx10gr5,csx19,Cas14u_CAS-V,cas14k,cas6,2OG_CAS,csc1gr5,cas10d,csc2gr7,cas3,cas4,cas1,cas2,csx21,PD-DExK,csm2gr11,csx3,DinG,c2c10_CAS-V-U3	NA|57aa|up_1|NZ_CP019636.1_6763222_6763393_+,NA|29aa|down_1|NZ_CP019636.1_6765640_6765727_+	NA|271aa|up_9|NZ_CP019636.1_6751898_6752711_-	COG0596, MhpC, Predicted hydrolases or acyltransferases (alpha/beta hydrolase superfamily) [General function prediction only]	NA|244aa|up_8|NZ_CP019636.1_6752801_6753533_-	cd07989, LPLAT_AGPAT-like, Lysophospholipid Acyltransferases (LPLATs) of Glycerophospholipid Biosynthesis: AGPAT-like	NA|534aa|up_7|NZ_CP019636.1_6753905_6755507_+	PRK11893, PRK11893, methionyl-tRNA synthetase; Reviewed	NA|211aa|up_6|NZ_CP019636.1_6755655_6756288_+	cd10911, PIN_LabA, PIN domain of Synechococcus elongatus LabA (low-amplitude and bright) and related proteins	NA|419aa|up_5|NZ_CP019636.1_6756674_6757931_+	TIGR04409, LptC_YrbK, LPS export ABC transporter periplasmic protein LptC	NA|441aa|up_4|NZ_CP019636.1_6758182_6759505_-	COG2027, DacB, D-alanyl-D-alanine carboxypeptidase (penicillin-binding protein 4) [Cell envelope biogenesis, outer membrane]	NA|342aa|up_3|NZ_CP019636.1_6761017_6762043_+	PRK05331, PRK05331, phosphate acyltransferase PlsX	NA|331aa|up_2|NZ_CP019636.1_6762155_6763148_+	PRK09352, PRK09352, beta-ketoacyl-ACP synthase 3	NA|57aa|up_1|NZ_CP019636.1_6763222_6763393_+	NA	NA|293aa|up_0|NZ_CP019636.1_6763430_6764309_+	COG0331, FabD, (acyl-carrier-protein) S-malonyltransferase [Lipid metabolism]	NA|213aa|down_0|NZ_CP019636.1_6764774_6765413_+	cd07989, LPLAT_AGPAT-like, Lysophospholipid Acyltransferases (LPLATs) of Glycerophospholipid Biosynthesis: AGPAT-like	NA|29aa|down_1|NZ_CP019636.1_6765640_6765727_+	NA	NA|346aa|down_2|NZ_CP019636.1_6765804_6766842_+	COG0628, yhhT, Predicted permease, member of the PurR regulon [General function prediction only]	NA|101aa|down_3|NZ_CP019636.1_6766984_6767287_+	COG5626, COG5626, Uncharacterized small conserved protein [Function unknown]	NA|177aa|down_4|NZ_CP019636.1_6767515_6768046_-	cd06259, YdcF-like, YdcF-like	NA|367aa|down_5|NZ_CP019636.1_6768814_6769915_-	pfam14249, Tocopherol_cycl, Tocopherol cyclase	NA|205aa|down_6|NZ_CP019636.1_6769944_6770559_+	COG0484, DnaJ, DnaJ-class molecular chaperone with C-terminal Zn finger domain [Posttranslational modification, protein turnover, chaperones]	NA|686aa|down_7|NZ_CP019636.1_6770731_6772789_+	cd00200, WD40, WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment	NA|278aa|down_8|NZ_CP019636.1_6773370_6774204_-	COG1408, COG1408, Predicted phosphohydrolases [General function prediction only]	NA|381aa|down_9|NZ_CP019636.1_6774387_6775530_+	cd02042, ParAB_family, partition proteins ParAB family
GCF_002163975.1_ASM216397v1	NZ_CP019636	Nostocales cyanobacterium HT-58-2, complete genome	24	7250382-7250507	23	CRISPRCasFinder	no		Cas14c_CAS-V-F,cas14j,c2c9_V-U4,csa3,WYL,csx1,cas10,csm3gr7,csx10gr5,csx19,Cas14u_CAS-V,cas14k,cas6,2OG_CAS,csc1gr5,cas10d,csc2gr7,cas3,cas4,cas1,cas2,csx21,PD-DExK,csm2gr11,csx3,DinG,c2c10_CAS-V-U3	Orphan	CGCTAACGCCAGTTGCCGTGAGAGCGGGAAACCC	34	0	0	NA	NA	NA	1	1	Orphan	Cas14c_CAS-V-F,cas14j,c2c9_V-U4,csa3,WYL,csx1,cas10,csm3gr7,csx10gr5,csx19,Cas14u_CAS-V,cas14k,cas6,2OG_CAS,csc1gr5,cas10d,csc2gr7,cas3,cas4,cas1,cas2,csx21,PD-DExK,csm2gr11,csx3,DinG,c2c10_CAS-V-U3	NA|327aa|up_5|NZ_CP019636.1_7243938_7244919_-,NA|397aa|up_4|NZ_CP019636.1_7244911_7246102_-,NA|158aa|up_3|NZ_CP019636.1_7246306_7246780_+,NA|66aa|up_1|NZ_CP019636.1_7248986_7249184_+,NA|82aa|down_1|NZ_CP019636.1_7252830_7253076_+,NA|63aa|down_3|NZ_CP019636.1_7255759_7255948_+,NA|110aa|down_4|NZ_CP019636.1_7256361_7256691_-	NA|201aa|up_9|NZ_CP019636.1_7238576_7239179_-	PRK00300, gmk, guanylate kinase; Provisional	NA|90aa|up_8|NZ_CP019636.1_7239344_7239614_-	PRK04323, PRK04323, hypothetical protein; Provisional	NA|304aa|up_7|NZ_CP019636.1_7240326_7241238_-	pfam13612, DDE_Tnp_1_3, Transposase DDE domain	NA|776aa|up_6|NZ_CP019636.1_7241381_7243709_-	COG1305, COG1305, Transglutaminase-like enzymes, putative cysteine proteases [Amino acid transport and metabolism]	NA|327aa|up_5|NZ_CP019636.1_7243938_7244919_-	NA	NA|397aa|up_4|NZ_CP019636.1_7244911_7246102_-	NA	NA|158aa|up_3|NZ_CP019636.1_7246306_7246780_+	NA	NA|324aa|up_2|NZ_CP019636.1_7246993_7247965_-	TIGR02749, Prenyl_transferase, solanesyl diphosphate synthase	NA|66aa|up_1|NZ_CP019636.1_7248986_7249184_+	NA	NA|287aa|up_0|NZ_CP019636.1_7249251_7250112_-	PRK00865, PRK00865, glutamate racemase; Provisional	NA|665aa|down_0|NZ_CP019636.1_7250707_7252702_-	cd02696, MurNAc-LAA, N-acetylmuramoyl-L-alanine amidase or MurNAc-LAA (also known as peptidoglycan aminohydrolase, NAMLA amidase, NAMLAA, Amidase 3, and peptidoglycan amidase; EC 3	NA|82aa|down_1|NZ_CP019636.1_7252830_7253076_+	NA	NA|647aa|down_2|NZ_CP019636.1_7253558_7255499_-	cd02696, MurNAc-LAA, N-acetylmuramoyl-L-alanine amidase or MurNAc-LAA (also known as peptidoglycan aminohydrolase, NAMLA amidase, NAMLAA, Amidase 3, and peptidoglycan amidase; EC 3	NA|63aa|down_3|NZ_CP019636.1_7255759_7255948_+	NA	NA|110aa|down_4|NZ_CP019636.1_7256361_7256691_-	NA	NA|476aa|down_5|NZ_CP019636.1_7256918_7258346_-	COG0475, KefB, Kef-type K+ transport systems, membrane components [Inorganic ion transport and metabolism]	NA|138aa|down_6|NZ_CP019636.1_7258431_7258845_-	pfam01878, EVE, EVE domain	NA|251aa|down_7|NZ_CP019636.1_7259292_7260045_+	COG2968, COG2968, Uncharacterized conserved protein [Function unknown]	NA|121aa|down_8|NZ_CP019636.1_7260109_7260472_-	PRK07459, PRK07459, single-stranded DNA-binding protein; Provisional	NA|336aa|down_9|NZ_CP019636.1_7260844_7261852_+	PRK13927, PRK13927, rod shape-determining protein MreB; Provisional
GCF_002163975.1_ASM216397v1	NZ_CP019636	Nostocales cyanobacterium HT-58-2, complete genome	25	7420369-7420458	24	CRISPRCasFinder	no		Cas14c_CAS-V-F,cas14j,c2c9_V-U4,csa3,WYL,csx1,cas10,csm3gr7,csx10gr5,csx19,Cas14u_CAS-V,cas14k,cas6,2OG_CAS,csc1gr5,cas10d,csc2gr7,cas3,cas4,cas1,cas2,csx21,PD-DExK,csm2gr11,csx3,DinG,c2c10_CAS-V-U3	Orphan	TCACCCGAACGCGAGTGCGTCTGGTG	26	1	1	7420395-7420432	NZ_CP019636.1_786934-786971	NA	1	1	Orphan	Cas14c_CAS-V-F,cas14j,c2c9_V-U4,csa3,WYL,csx1,cas10,csm3gr7,csx10gr5,csx19,Cas14u_CAS-V,cas14k,cas6,2OG_CAS,csc1gr5,cas10d,csc2gr7,cas3,cas4,cas1,cas2,csx21,PD-DExK,csm2gr11,csx3,DinG,c2c10_CAS-V-U3	NA|66aa|up_2|NZ_CP019636.1_7418191_7418389_+,NA|248aa|down_2|NZ_CP019636.1_7422096_7422840_+	NA|1131aa|up_9|NZ_CP019636.1_7405209_7408602_+	TIGR02956, sensor_protein_TorS, TMAO reductase sytem sensor TorS	NA|624aa|up_8|NZ_CP019636.1_7408594_7410466_-	COG4449, COG4449, Predicted protease of the Abi (CAAX) family [General function prediction only]	NA|657aa|up_7|NZ_CP019636.1_7410973_7412944_+	PRK00174, PRK00174, acetyl-CoA synthetase; Provisional	NA|417aa|up_6|NZ_CP019636.1_7413045_7414296_-	PRK00549, PRK00549, competence damage-inducible protein A; Provisional	NA|119aa|up_5|NZ_CP019636.1_7414538_7414895_+	pfam14534, DUF4440, Domain of unknown function (DUF4440)	NA|349aa|up_4|NZ_CP019636.1_7414989_7416036_-	cd06853, GT_WecA_like, This subfamily contains Escherichia coli WecA, Bacillus subtilis TagO and related proteins	NA|428aa|up_3|NZ_CP019636.1_7416406_7417690_-	PRK00011, glyA, serine hydroxymethyltransferase; Reviewed	NA|66aa|up_2|NZ_CP019636.1_7418191_7418389_+	NA	NA|169aa|up_1|NZ_CP019636.1_7418563_7419070_+	pfam16166, TIC20, Chloroplast import apparatus Tic20-like	NA|268aa|up_0|NZ_CP019636.1_7419305_7420109_-	COG0179, MhpD, 2-keto-4-pentenoate hydratase/2-oxohepta-3-ene-1,7-dioic acid hydratase (catechol pathway) [Secondary metabolites biosynthesis, transport, and catabolism]	NA|109aa|down_0|NZ_CP019636.1_7421053_7421380_+	cd15487, bS6_chloro_cyano, 30S ribosomal protein S6 of chloroplasts and cyanobacteria	NA|186aa|down_1|NZ_CP019636.1_7421546_7422104_+	TIGR04211, hypothetical_protein, SH3 domain protein	NA|248aa|down_2|NZ_CP019636.1_7422096_7422840_+	NA	NA|118aa|down_3|NZ_CP019636.1_7422844_7423198_-	cd07043, STAS_anti-anti-sigma_factors, Sulphate Transporter and Anti-Sigma factor antagonist) domain of anti-anti-sigma factors, key regulators of anti-sigma factors by phosphorylation	NA|211aa|down_4|NZ_CP019636.1_7423652_7424285_+	COG0586, DedA, Uncharacterized membrane-associated protein [Function unknown]	NA|150aa|down_5|NZ_CP019636.1_7424400_7424850_-	cd17557, REC_Rcp-like, phosphoacceptor receiver (REC) domain of cyanobacterial phytochrome response regulator Rcp and similar domains	NA|461aa|down_6|NZ_CP019636.1_7424836_7426219_-	COG4251, COG4251, Bacteriophytochrome (light-regulated signal transduction histidine kinase) [Signal transduction mechanisms]	NA|326aa|down_7|NZ_CP019636.1_7426524_7427502_+	PRK10717, PRK10717, cysteine synthase A; Provisional	NA|111aa|down_8|NZ_CP019636.1_7427822_7428155_+	cd02980, TRX_Fd_family, Thioredoxin (TRX)-like [2Fe-2S] Ferredoxin (Fd) family; composed of [2Fe-2S] Fds with a TRX fold (TRX-like Fds) and proteins containing domains similar to TRX-like Fd including formate dehydrogenases, NAD-reducing hydrogenases and the subunit E of NADH:ubiquinone oxidoreductase (NuoE)	NA|331aa|down_9|NZ_CP019636.1_7429330_7430323_-	COG0354, COG0354, Predicted aminomethyltransferase related to GcvT [General function prediction only]
GCF_002163975.1_ASM216397v1	NZ_CP019636	Nostocales cyanobacterium HT-58-2, complete genome	26	7713014-7713094	25	CRISPRCasFinder	no		Cas14c_CAS-V-F,cas14j,c2c9_V-U4,csa3,WYL,csx1,cas10,csm3gr7,csx10gr5,csx19,Cas14u_CAS-V,cas14k,cas6,2OG_CAS,csc1gr5,cas10d,csc2gr7,cas3,cas4,cas1,cas2,csx21,PD-DExK,csm2gr11,csx3,DinG,c2c10_CAS-V-U3	Orphan	TCTTTTCTTAAAGTCAGAAAGTAGGTGA	28	0	0	NA	NA	NA	1	1	Orphan	Cas14c_CAS-V-F,cas14j,c2c9_V-U4,csa3,WYL,csx1,cas10,csm3gr7,csx10gr5,csx19,Cas14u_CAS-V,cas14k,cas6,2OG_CAS,csc1gr5,cas10d,csc2gr7,cas3,cas4,cas1,cas2,csx21,PD-DExK,csm2gr11,csx3,DinG,c2c10_CAS-V-U3	NA,NA|88aa|down_9|NZ_CP019636.1_7724095_7724359_+	NA|316aa|up_9|NZ_CP019636.1_7700396_7701344_-	cd05300, 2-Hacid_dh_1, Putative D-isomer specific 2-hydroxyacid dehydrogenase	NA|146aa|up_8|NZ_CP019636.1_7701598_7702036_-	cd01284, Riboflavin_deaminase-reductase, Riboflavin-specific deaminase	NA|380aa|up_7|NZ_CP019636.1_7702038_7703178_-	cd13555, PBP2_sulfate_ester_like, Sulfate ester binding protein-like, the type 2 periplasmic binding protein fold	NA|110aa|up_6|NZ_CP019636.1_7703184_7703514_-	PRK09626, oorD, 2-oxoglutarate-acceptor oxidoreductase subunit OorD; Reviewed	NA|549aa|up_5|NZ_CP019636.1_7703510_7705157_-	COG1053, SdhA, Succinate dehydrogenase/fumarate reductase, flavoprotein subunit [Energy production and conversion]	NA|347aa|up_4|NZ_CP019636.1_7705644_7706685_-	cd19094, AKR_Tas-like, Escherichia coli Tas protein and similar proteins	NA|664aa|up_3|NZ_CP019636.1_7707147_7709139_-	pfam04966, OprB, Carbohydrate-selective porin, OprB family	NA|372aa|up_2|NZ_CP019636.1_7709974_7711090_+	cd13557, PBP2_SsuA, Substrate binding domain of sulfonate binding protein, a member of the type 2 periplasmic binding fold superfamily	NA|274aa|up_1|NZ_CP019636.1_7711216_7712038_+	cd02146, NfsA-like, nitroreductase similar to Escherichia coli NfsA	NA|263aa|up_0|NZ_CP019636.1_7712125_7712914_+	cd05344, BKR_like_SDR_like, putative beta-ketoacyl acyl carrier protein [ACP] reductase (BKR)-like, SDR	NA|275aa|down_0|NZ_CP019636.1_7713875_7714700_+	PRK11247, ssuB, aliphatic sulfonates transport ATP-binding subunit; Provisional	NA|399aa|down_1|NZ_CP019636.1_7714764_7715961_+	cd01163, DszC, Dibenzothiophene (DBT) desulfurization enzyme C	NA|356aa|down_2|NZ_CP019636.1_7716546_7717614_+	cd13557, PBP2_SsuA, Substrate binding domain of sulfonate binding protein, a member of the type 2 periplasmic binding fold superfamily	NA|288aa|down_3|NZ_CP019636.1_7717696_7718560_+	COG2175, TauD, Probable taurine catabolism dioxygenase [Secondary metabolites biosynthesis, transport, and catabolism]	NA|332aa|down_4|NZ_CP019636.1_7718572_7719568_+	cd13557, PBP2_SsuA, Substrate binding domain of sulfonate binding protein, a member of the type 2 periplasmic binding fold superfamily	NA|271aa|down_5|NZ_CP019636.1_7719633_7720446_+	pfam13649, Methyltransf_25, Methyltransferase domain	NA|351aa|down_6|NZ_CP019636.1_7720730_7721783_+	cd13557, PBP2_SsuA, Substrate binding domain of sulfonate binding protein, a member of the type 2 periplasmic binding fold superfamily	NA|383aa|down_7|NZ_CP019636.1_7721840_7722989_+	PRK00719, PRK00719, alkanesulfonate monooxygenase; Provisional	NA|268aa|down_8|NZ_CP019636.1_7722998_7723802_+	PRK11365, ssuC, aliphatic sulfonate ABC transporter permease SsuC	NA|88aa|down_9|NZ_CP019636.1_7724095_7724359_+	NA
GCF_002163975.1_ASM216397v1	NZ_CP019636	Nostocales cyanobacterium HT-58-2, complete genome	27	7724018-7724128	26	CRISPRCasFinder	no		Cas14c_CAS-V-F,cas14j,c2c9_V-U4,csa3,WYL,csx1,cas10,csm3gr7,csx10gr5,csx19,Cas14u_CAS-V,cas14k,cas6,2OG_CAS,csc1gr5,cas10d,csc2gr7,cas3,cas4,cas1,cas2,csx21,PD-DExK,csm2gr11,csx3,DinG,c2c10_CAS-V-U3	Orphan	AGTACCCATGCTGTCCTTTGCAATTGATGCGTTACGGCTA	40	0	0	NA	NA	NA	1	1	Orphan	Cas14c_CAS-V-F,cas14j,c2c9_V-U4,csa3,WYL,csx1,cas10,csm3gr7,csx10gr5,csx19,Cas14u_CAS-V,cas14k,cas6,2OG_CAS,csc1gr5,cas10d,csc2gr7,cas3,cas4,cas1,cas2,csx21,PD-DExK,csm2gr11,csx3,DinG,c2c10_CAS-V-U3	NA,NA	NA|263aa|up_9|NZ_CP019636.1_7712125_7712914_+	cd05344, BKR_like_SDR_like, putative beta-ketoacyl acyl carrier protein [ACP] reductase (BKR)-like, SDR	NA|275aa|up_8|NZ_CP019636.1_7713875_7714700_+	PRK11247, ssuB, aliphatic sulfonates transport ATP-binding subunit; Provisional	NA|399aa|up_7|NZ_CP019636.1_7714764_7715961_+	cd01163, DszC, Dibenzothiophene (DBT) desulfurization enzyme C	NA|356aa|up_6|NZ_CP019636.1_7716546_7717614_+	cd13557, PBP2_SsuA, Substrate binding domain of sulfonate binding protein, a member of the type 2 periplasmic binding fold superfamily	NA|288aa|up_5|NZ_CP019636.1_7717696_7718560_+	COG2175, TauD, Probable taurine catabolism dioxygenase [Secondary metabolites biosynthesis, transport, and catabolism]	NA|332aa|up_4|NZ_CP019636.1_7718572_7719568_+	cd13557, PBP2_SsuA, Substrate binding domain of sulfonate binding protein, a member of the type 2 periplasmic binding fold superfamily	NA|271aa|up_3|NZ_CP019636.1_7719633_7720446_+	pfam13649, Methyltransf_25, Methyltransferase domain	NA|351aa|up_2|NZ_CP019636.1_7720730_7721783_+	cd13557, PBP2_SsuA, Substrate binding domain of sulfonate binding protein, a member of the type 2 periplasmic binding fold superfamily	NA|383aa|up_1|NZ_CP019636.1_7721840_7722989_+	PRK00719, PRK00719, alkanesulfonate monooxygenase; Provisional	NA|268aa|up_0|NZ_CP019636.1_7722998_7723802_+	PRK11365, ssuC, aliphatic sulfonate ABC transporter permease SsuC	NA|358aa|down_0|NZ_CP019636.1_7727150_7728224_-	COG1672, COG1672, Predicted ATPase (AAA+ superfamily) [General function prediction only]	NA|70aa|down_1|NZ_CP019636.1_7729369_7729579_-	COG3585, MopI, Molybdopterin-binding protein [Coenzyme metabolism]	NA|91aa|down_2|NZ_CP019636.1_7730197_7730470_+	COG2261, COG2261, Predicted membrane protein [Function unknown]	NA|619aa|down_3|NZ_CP019636.1_7730647_7732504_-	COG1118, CysA, ABC-type sulfate/molybdate transport systems, ATPase component [Inorganic ion transport and metabolism]	NA|275aa|down_4|NZ_CP019636.1_7732748_7733573_-	cd13537, PBP2_YvgL_like, Substrate binding domain of putative molybdate-binding protein YvgL and similar proteins;the type 2 periplasmic binding protein fold	NA|231aa|down_5|NZ_CP019636.1_7733951_7734644_-	cd04647, LbH_MAT_like, Maltose O-acyltransferase (MAT)-like: This family is composed of maltose O-acetyltransferase, galactoside O-acetyltransferase (GAT), xenobiotic acyltransferase (XAT) and similar proteins	NA|231aa|down_6|NZ_CP019636.1_7734866_7735559_-	cd04647, LbH_MAT_like, Maltose O-acyltransferase (MAT)-like: This family is composed of maltose O-acetyltransferase, galactoside O-acetyltransferase (GAT), xenobiotic acyltransferase (XAT) and similar proteins	NA|277aa|down_7|NZ_CP019636.1_7736837_7737668_-	cd17538, REC_D1_PleD-like, first (D1) phosphoacceptor receiver (REC) domain of response regulator PleD and similar domains	NA|658aa|down_8|NZ_CP019636.1_7737733_7739707_-	COG0642, BaeS, Signal transduction histidine kinase [Signal transduction mechanisms]	NA|634aa|down_9|NZ_CP019636.1_7740957_7742859_+	COG0028, IlvB, Thiamine pyrophosphate-requiring enzymes [acetolactate synthase, pyruvate dehydrogenase (cytochrome), glyoxylate carboligase, phosphonopyruvate decarboxylase] [Amino acid transport and metabolism / Coenzyme metabolism]
GCF_002163975.1_ASM216397v1	NZ_CP019636	Nostocales cyanobacterium HT-58-2, complete genome	28	7726495-7726704	8	CRT	no		Cas14c_CAS-V-F,cas14j,c2c9_V-U4,csa3,WYL,csx1,cas10,csm3gr7,csx10gr5,csx19,Cas14u_CAS-V,cas14k,cas6,2OG_CAS,csc1gr5,cas10d,csc2gr7,cas3,cas4,cas1,cas2,csx21,PD-DExK,csm2gr11,csx3,DinG,c2c10_CAS-V-U3	Orphan	CACGCCNANCGCCACGCC	18	2	3	7726597-7726614|7726669-7726686|7726669-7726686	NZ_CP019636.1_7726472-7726489|NZ_CP019636.1_7726693-7726710|NZ_CP019636.1_7726705-7726722	NA	5	5	Orphan	Cas14c_CAS-V-F,cas14j,c2c9_V-U4,csa3,WYL,csx1,cas10,csm3gr7,csx10gr5,csx19,Cas14u_CAS-V,cas14k,cas6,2OG_CAS,csc1gr5,cas10d,csc2gr7,cas3,cas4,cas1,cas2,csx21,PD-DExK,csm2gr11,csx3,DinG,c2c10_CAS-V-U3	NA|88aa|up_0|NZ_CP019636.1_7724095_7724359_+,NA	NA|275aa|up_9|NZ_CP019636.1_7713875_7714700_+	PRK11247, ssuB, aliphatic sulfonates transport ATP-binding subunit; Provisional	NA|399aa|up_8|NZ_CP019636.1_7714764_7715961_+	cd01163, DszC, Dibenzothiophene (DBT) desulfurization enzyme C	NA|356aa|up_7|NZ_CP019636.1_7716546_7717614_+	cd13557, PBP2_SsuA, Substrate binding domain of sulfonate binding protein, a member of the type 2 periplasmic binding fold superfamily	NA|288aa|up_6|NZ_CP019636.1_7717696_7718560_+	COG2175, TauD, Probable taurine catabolism dioxygenase [Secondary metabolites biosynthesis, transport, and catabolism]	NA|332aa|up_5|NZ_CP019636.1_7718572_7719568_+	cd13557, PBP2_SsuA, Substrate binding domain of sulfonate binding protein, a member of the type 2 periplasmic binding fold superfamily	NA|271aa|up_4|NZ_CP019636.1_7719633_7720446_+	pfam13649, Methyltransf_25, Methyltransferase domain	NA|351aa|up_3|NZ_CP019636.1_7720730_7721783_+	cd13557, PBP2_SsuA, Substrate binding domain of sulfonate binding protein, a member of the type 2 periplasmic binding fold superfamily	NA|383aa|up_2|NZ_CP019636.1_7721840_7722989_+	PRK00719, PRK00719, alkanesulfonate monooxygenase; Provisional	NA|268aa|up_1|NZ_CP019636.1_7722998_7723802_+	PRK11365, ssuC, aliphatic sulfonate ABC transporter permease SsuC	NA|88aa|up_0|NZ_CP019636.1_7724095_7724359_+	NA	NA|358aa|down_0|NZ_CP019636.1_7727150_7728224_-	COG1672, COG1672, Predicted ATPase (AAA+ superfamily) [General function prediction only]	NA|70aa|down_1|NZ_CP019636.1_7729369_7729579_-	COG3585, MopI, Molybdopterin-binding protein [Coenzyme metabolism]	NA|91aa|down_2|NZ_CP019636.1_7730197_7730470_+	COG2261, COG2261, Predicted membrane protein [Function unknown]	NA|619aa|down_3|NZ_CP019636.1_7730647_7732504_-	COG1118, CysA, ABC-type sulfate/molybdate transport systems, ATPase component [Inorganic ion transport and metabolism]	NA|275aa|down_4|NZ_CP019636.1_7732748_7733573_-	cd13537, PBP2_YvgL_like, Substrate binding domain of putative molybdate-binding protein YvgL and similar proteins;the type 2 periplasmic binding protein fold	NA|231aa|down_5|NZ_CP019636.1_7733951_7734644_-	cd04647, LbH_MAT_like, Maltose O-acyltransferase (MAT)-like: This family is composed of maltose O-acetyltransferase, galactoside O-acetyltransferase (GAT), xenobiotic acyltransferase (XAT) and similar proteins	NA|231aa|down_6|NZ_CP019636.1_7734866_7735559_-	cd04647, LbH_MAT_like, Maltose O-acyltransferase (MAT)-like: This family is composed of maltose O-acetyltransferase, galactoside O-acetyltransferase (GAT), xenobiotic acyltransferase (XAT) and similar proteins	NA|277aa|down_7|NZ_CP019636.1_7736837_7737668_-	cd17538, REC_D1_PleD-like, first (D1) phosphoacceptor receiver (REC) domain of response regulator PleD and similar domains	NA|658aa|down_8|NZ_CP019636.1_7737733_7739707_-	COG0642, BaeS, Signal transduction histidine kinase [Signal transduction mechanisms]	NA|634aa|down_9|NZ_CP019636.1_7740957_7742859_+	COG0028, IlvB, Thiamine pyrophosphate-requiring enzymes [acetolactate synthase, pyruvate dehydrogenase (cytochrome), glyoxylate carboligase, phosphonopyruvate decarboxylase] [Amino acid transport and metabolism / Coenzyme metabolism]
GCF_002163975.1_ASM216397v1	NZ_CP019636	Nostocales cyanobacterium HT-58-2, complete genome	29	7786140-7786251	27	CRISPRCasFinder	no		Cas14c_CAS-V-F,cas14j,c2c9_V-U4,csa3,WYL,csx1,cas10,csm3gr7,csx10gr5,csx19,Cas14u_CAS-V,cas14k,cas6,2OG_CAS,csc1gr5,cas10d,csc2gr7,cas3,cas4,cas1,cas2,csx21,PD-DExK,csm2gr11,csx3,DinG,c2c10_CAS-V-U3	Orphan	AGGGGATGCAGAGGGAGCAGATGAAAA	27	0	0	NA	NA	NA	1	1	Orphan	Cas14c_CAS-V-F,cas14j,c2c9_V-U4,csa3,WYL,csx1,cas10,csm3gr7,csx10gr5,csx19,Cas14u_CAS-V,cas14k,cas6,2OG_CAS,csc1gr5,cas10d,csc2gr7,cas3,cas4,cas1,cas2,csx21,PD-DExK,csm2gr11,csx3,DinG,c2c10_CAS-V-U3	NA|579aa|up_9|NZ_CP019636.1_7771569_7773306_+,NA|391aa|up_6|NZ_CP019636.1_7777529_7778702_+,NA|54aa|up_4|NZ_CP019636.1_7779423_7779585_+,NA|140aa|up_1|NZ_CP019636.1_7783420_7783840_+,NA|141aa|down_3|NZ_CP019636.1_7791947_7792370_-,NA|101aa|down_4|NZ_CP019636.1_7792568_7792871_-,NA|133aa|down_5|NZ_CP019636.1_7793143_7793542_-,NA|195aa|down_6|NZ_CP019636.1_7793613_7794198_-,NA|277aa|down_7|NZ_CP019636.1_7794272_7795103_-	NA|579aa|up_9|NZ_CP019636.1_7771569_7773306_+	NA	NA|859aa|up_8|NZ_CP019636.1_7773791_7776368_+	cd04299, GT35_Glycogen_Phosphorylase-like, proteins similar to glycogen phosphorylase	NA|249aa|up_7|NZ_CP019636.1_7776418_7777165_-	COG4328, COG4328, Predicted nuclease (RNAse H fold) [General function prediction only]	NA|391aa|up_6|NZ_CP019636.1_7777529_7778702_+	NA	NA|136aa|up_5|NZ_CP019636.1_7779013_7779421_+	COG5499, COG5499, Predicted transcription regulator containing HTH domain [Transcription]	NA|54aa|up_4|NZ_CP019636.1_7779423_7779585_+	NA	NA|527aa|up_3|NZ_CP019636.1_7779570_7781151_-	pfam13282, DUF4070, Domain of unknown function (DUF4070)	NA|480aa|up_2|NZ_CP019636.1_7781519_7782959_+	cd01189, INT_ICEBs1_C_like, C-terminal catalytic domain of integrases from bacterial phages and conjugate transposons	NA|140aa|up_1|NZ_CP019636.1_7783420_7783840_+	NA	NA|487aa|up_0|NZ_CP019636.1_7783954_7785415_+	pfam13424, TPR_12, Tetratricopeptide repeat	NA|324aa|down_0|NZ_CP019636.1_7788453_7789425_-	COG3267, ExeA, Type II secretory pathway, component ExeA (predicted ATPase) [Intracellular trafficking and secretion]	NA|548aa|down_1|NZ_CP019636.1_7789414_7791058_-	pfam00665, rve, Integrase core domain	NA|188aa|down_2|NZ_CP019636.1_7791069_7791633_-	cd03768, SR_ResInv, Serine Recombinase (SR) family, Resolvase and Invertase subfamily, catalytic domain; members contain a C-terminal DNA binding domain	NA|141aa|down_3|NZ_CP019636.1_7791947_7792370_-	NA	NA|101aa|down_4|NZ_CP019636.1_7792568_7792871_-	NA	NA|133aa|down_5|NZ_CP019636.1_7793143_7793542_-	NA	NA|195aa|down_6|NZ_CP019636.1_7793613_7794198_-	NA	NA|277aa|down_7|NZ_CP019636.1_7794272_7795103_-	NA	NA|439aa|down_8|NZ_CP019636.1_7796067_7797384_+	pfam01391, Collagen, Collagen triple helix repeat (20 copies)	NA|1246aa|down_9|NZ_CP019636.1_7797426_7801164_-	cd00200, WD40, WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment
