assembly_id	genome_id	genome_def	crispr_array_locus_merge	crispr_array_location_merge	crispr_locus_id	crispr_pred_method	array_in_prot	prot_within_array_20000	prot_in_genome	crispr_type_by_cas_prot	consensus_repeat	repeat_length	self-targeting_spacer_number	self-targeting_target_number	spacer_location	protospacer_location	repeat_type	spacer_locus_num	spacer_num	correct_crispr_type	genome_cas_prots	unknown_protein_around_crispr	L10	L10_domain	L9	L9_domain	L8	L8_domain	L7	L7_domain	L6	L6_domain	L5	L5_domain	L4	L4_domain	L3	L3_domain	L2	L2_domain	L1	L1_domain	R1	R1_domain	R2	R2_domain	R3	R3_domain	R4	R4_domain	R5	R5_domain	R6	R6_domain	R7	R7_domain	R8	R8_domain	R9	R9_domain	R10	R10_domain
GCF_013343235.1_ASM1334323v1	NZ_CP040094	Nostoc sp. TCL240-02 chromosome, complete genome	1	527053-527151	1	CRISPRCasFinder	no		DinG,cas3,csa3,cas4,cas6,RT,cas1,cas2,csx3,WYL,csm6,cmr5gr11,cmr4gr7,cmr3gr5,cas10,cas14j,c2c5_V-U5,PD-DExK,Cas9_archaeal,DEDDh,c2c8_V-U2,Cas14c_CAS-V-F,csm5gr7,csm4gr5,csm3gr7,csm2gr11,csc2gr7,csc1gr5,2OG_CAS	Orphan	GTTGAGCCTCAAAAGGTCAACATCGAGCCTCAGAA	35	0	0	NA	NA	NA	1	1	Orphan	DinG,cas3,csa3,cas4,cas6,RT,cas1,cas2,csx3,WYL,csm6,cmr5gr11,cmr4gr7,cmr3gr5,cas10,cas14j,c2c5_V-U5,PD-DExK,Cas9_archaeal,DEDDh,c2c8_V-U2,Cas14c_CAS-V-F,csm5gr7,csm4gr5,csm3gr7,csm2gr11,csc2gr7,csc1gr5,2OG_CAS	NA|103aa|up_8|NZ_CP040094.1_517942_518251_-,NA|54aa|up_7|NZ_CP040094.1_518261_518423_+,NA|158aa|up_2|NZ_CP040094.1_522779_523253_+,NA|145aa|up_1|NZ_CP040094.1_523289_523724_-,NA|78aa|down_1|NZ_CP040094.1_529503_529737_-,NA|141aa|down_2|NZ_CP040094.1_529839_530262_-,NA|88aa|down_4|NZ_CP040094.1_531437_531701_-	NA|144aa|up_9|NZ_CP040094.1_517400_517832_+	COG1051, COG1051, ADP-ribose pyrophosphatase [Nucleotide transport and metabolism]	NA|103aa|up_8|NZ_CP040094.1_517942_518251_-	NA	NA|54aa|up_7|NZ_CP040094.1_518261_518423_+	NA	NA|435aa|up_6|NZ_CP040094.1_518456_519761_+	TIGR00004, RutC_family_protein, reactive intermediate/imine deaminase	NA|325aa|up_5|NZ_CP040094.1_519913_520888_+	COG3221, PhnD, ABC-type phosphate/phosphonate transport system, periplasmic component [Inorganic ion transport and metabolism]	NA|245aa|up_4|NZ_CP040094.1_521017_521752_+	COG3638, COG3638, ABC-type phosphate/phosphonate transport system, ATPase component [Inorganic ion transport and metabolism]	NA|265aa|up_3|NZ_CP040094.1_521900_522695_+	TIGR01097, PhnE, phosphonate ABC transporter, permease protein PhnE	NA|158aa|up_2|NZ_CP040094.1_522779_523253_+	NA	NA|145aa|up_1|NZ_CP040094.1_523289_523724_-	NA	NA|423aa|up_0|NZ_CP040094.1_525412_526681_+	sd00006, TPR, Tetratricopeptide repeat	NA|658aa|down_0|NZ_CP040094.1_527302_529276_+	pfam13191, AAA_16, AAA ATPase domain	NA|78aa|down_1|NZ_CP040094.1_529503_529737_-	NA	NA|141aa|down_2|NZ_CP040094.1_529839_530262_-	NA	NA|219aa|down_3|NZ_CP040094.1_530722_531379_-	cd02980, TRX_Fd_family, Thioredoxin (TRX)-like [2Fe-2S] Ferredoxin (Fd) family; composed of [2Fe-2S] Fds with a TRX fold (TRX-like Fds) and proteins containing domains similar to TRX-like Fd including formate dehydrogenases, NAD-reducing hydrogenases and the subunit E of NADH:ubiquinone oxidoreductase (NuoE)	NA|88aa|down_4|NZ_CP040094.1_531437_531701_-	NA	NA|185aa|down_5|NZ_CP040094.1_532176_532731_+	COG0783, Dps, DNA-binding ferritin-like protein (oxidative damage protectant) [Inorganic ion transport and metabolism]	NA|1088aa|down_6|NZ_CP040094.1_533260_536524_+	PRK05294, carB, carbamoyl-phosphate synthase large subunit	NA|319aa|down_7|NZ_CP040094.1_537103_538060_+	PRK07405, PRK07405, RNA polymerase sigma factor SigD; Validated	NA|115aa|down_8|NZ_CP040094.1_538319_538664_+	pfam05542, DUF760, Protein of unknown function (DUF760)	NA|151aa|down_9|NZ_CP040094.1_538912_539365_+	cd03425, MutT_pyrophosphohydrolase, The MutT pyrophosphohydrolase is a prototypical Nudix hydrolase that catalyzes the hydrolysis of nucleoside and deoxynucleoside triphosphates (NTPs and dNTPs) by substitution at a beta-phosphorus to yield a nucleotide monophosphate (NMP) and inorganic pyrophosphate (PPi)
GCF_013343235.1_ASM1334323v1	NZ_CP040094	Nostoc sp. TCL240-02 chromosome, complete genome	2	749091-749341	1,2,1	PILER-CR,CRISPRCasFinder,CRT	no		DinG,cas3,csa3,cas4,cas6,RT,cas1,cas2,csx3,WYL,csm6,cmr5gr11,cmr4gr7,cmr3gr5,cas10,cas14j,c2c5_V-U5,PD-DExK,Cas9_archaeal,DEDDh,c2c8_V-U2,Cas14c_CAS-V-F,csm5gr7,csm4gr5,csm3gr7,csm2gr11,csc2gr7,csc1gr5,2OG_CAS	Orphan	CCCTACCGATTGGGTTAAATCGGAATTAATGGAAA,CCCTACCGATTGGGTTAAATCGGAATTAATGGAAAC,CCCTACCGATTGGGTTAAATCGGAATTAATGGAAAC	35,36,36	0	0	NA	NA	NA:NA:NA	3,3,3	3	Orphan	DinG,cas3,csa3,cas4,cas6,RT,cas1,cas2,csx3,WYL,csm6,cmr5gr11,cmr4gr7,cmr3gr5,cas10,cas14j,c2c5_V-U5,PD-DExK,Cas9_archaeal,DEDDh,c2c8_V-U2,Cas14c_CAS-V-F,csm5gr7,csm4gr5,csm3gr7,csm2gr11,csc2gr7,csc1gr5,2OG_CAS	NA|388aa|up_9|NZ_CP040094.1_736776_737940_+,NA|48aa|up_7|NZ_CP040094.1_739203_739347_-,NA|124aa|up_6|NZ_CP040094.1_739499_739871_+,NA|149aa|up_5|NZ_CP040094.1_740136_740583_+,NA|89aa|down_2|NZ_CP040094.1_752781_753048_+	NA|388aa|up_9|NZ_CP040094.1_736776_737940_+	NA	NA|321aa|up_8|NZ_CP040094.1_738224_739187_+	cd19100, AKR_unchar, uncharacterized aldo-keto reductase (AKR) superfamily protein	NA|48aa|up_7|NZ_CP040094.1_739203_739347_-	NA	NA|124aa|up_6|NZ_CP040094.1_739499_739871_+	NA	NA|149aa|up_5|NZ_CP040094.1_740136_740583_+	NA	NA|636aa|up_4|NZ_CP040094.1_741141_743049_+	PRK05444, PRK05444, 1-deoxy-D-xylulose-5-phosphate synthase; Provisional	NA|394aa|up_3|NZ_CP040094.1_743288_744470_-	cd08152, y4iL_like, Catalase-like heme-binding proteins similar to the uncharacterized y4iL	NA|543aa|up_2|NZ_CP040094.1_744559_746188_-	cd09816, prostaglandin_endoperoxide_synthase, Animal prostaglandin endoperoxide synthase and related bacterial proteins	NA|336aa|up_1|NZ_CP040094.1_746271_747279_-	pfam09994, DUF2235, Uncharacterized alpha/beta hydrolase domain (DUF2235)	NA|354aa|up_0|NZ_CP040094.1_747812_748874_+	PRK05720, mtnA, methylthioribose-1-phosphate isomerase; Reviewed	NA|474aa|down_0|NZ_CP040094.1_750052_751474_-	PRK07362, PRK07362, NADP-dependent isocitrate dehydrogenase	NA|243aa|down_1|NZ_CP040094.1_751862_752591_+	pfam05419, GUN4, GUN4-like	NA|89aa|down_2|NZ_CP040094.1_752781_753048_+	NA	NA|381aa|down_3|NZ_CP040094.1_753099_754242_+	COG3839, MalK, ABC-type sugar transport systems, ATPase components [Carbohydrate transport and metabolism]	NA|205aa|down_4|NZ_CP040094.1_754517_755132_+	COG0605, SodA, Superoxide dismutase [Inorganic ion transport and metabolism]	NA|245aa|down_5|NZ_CP040094.1_755226_755961_-	COG0357, GidB, Predicted S-adenosylmethionine-dependent methyltransferase involved in bacterial cell division [Cell envelope biogenesis, outer membrane]	NA|225aa|down_6|NZ_CP040094.1_756131_756806_-	COG1122, CbiO, ABC-type cobalt transport system, ATPase component [Inorganic ion transport and metabolism]	NA|320aa|down_7|NZ_CP040094.1_757146_758106_+	sd00006, TPR, Tetratricopeptide repeat	NA|151aa|down_8|NZ_CP040094.1_758244_758697_+	pfam12049, DUF3531, Protein of unknown function (DUF3531)	NA|144aa|down_9|NZ_CP040094.1_758705_759137_+	cd04688, Nudix_Hydrolase_29, Members of the Nudix hydrolase superfamily catalyze the hydrolysis of NUcleoside DIphosphates linked to other moieties, X
GCF_013343235.1_ASM1334323v1	NZ_CP040094	Nostoc sp. TCL240-02 chromosome, complete genome	3	962056-962170	3	CRISPRCasFinder	no		DinG,cas3,csa3,cas4,cas6,RT,cas1,cas2,csx3,WYL,csm6,cmr5gr11,cmr4gr7,cmr3gr5,cas10,cas14j,c2c5_V-U5,PD-DExK,Cas9_archaeal,DEDDh,c2c8_V-U2,Cas14c_CAS-V-F,csm5gr7,csm4gr5,csm3gr7,csm2gr11,csc2gr7,csc1gr5,2OG_CAS	Orphan	AAAGCGGCCAAGCAACTGCTCAAATTCT	28	0	0	NA	NA	NA	1	1	Orphan	DinG,cas3,csa3,cas4,cas6,RT,cas1,cas2,csx3,WYL,csm6,cmr5gr11,cmr4gr7,cmr3gr5,cas10,cas14j,c2c5_V-U5,PD-DExK,Cas9_archaeal,DEDDh,c2c8_V-U2,Cas14c_CAS-V-F,csm5gr7,csm4gr5,csm3gr7,csm2gr11,csc2gr7,csc1gr5,2OG_CAS	NA|58aa|up_8|NZ_CP040094.1_946637_946811_+,NA|124aa|up_7|NZ_CP040094.1_946921_947293_+,NA|80aa|up_5|NZ_CP040094.1_953136_953376_+,NA|57aa|up_4|NZ_CP040094.1_953561_953732_+,NA|356aa|up_3|NZ_CP040094.1_953704_954771_-,NA|84aa|up_2|NZ_CP040094.1_954907_955159_+,NA|166aa|up_1|NZ_CP040094.1_955211_955709_+,NA|64aa|down_1|NZ_CP040094.1_964628_964820_-,NA|132aa|down_5|NZ_CP040094.1_969379_969775_-,NA|146aa|down_8|NZ_CP040094.1_970664_971102_+	NA|95aa|up_9|NZ_CP040094.1_945716_946001_-	pfam04248, NTP_transf_9, Domain of unknown function (DUF427)	NA|58aa|up_8|NZ_CP040094.1_946637_946811_+	NA	NA|124aa|up_7|NZ_CP040094.1_946921_947293_+	NA	NA|732aa|up_6|NZ_CP040094.1_947447_949643_-	cd13401, Slt70-like, 70kDa soluble lytic transglycosylase (Slt70) and similar proteins	NA|80aa|up_5|NZ_CP040094.1_953136_953376_+	NA	NA|57aa|up_4|NZ_CP040094.1_953561_953732_+	NA	NA|356aa|up_3|NZ_CP040094.1_953704_954771_-	NA	NA|84aa|up_2|NZ_CP040094.1_954907_955159_+	NA	NA|166aa|up_1|NZ_CP040094.1_955211_955709_+	NA	NA|259aa|up_0|NZ_CP040094.1_955852_956629_+	pfam01987, AIM24, Mitochondrial biogenesis AIM24	NA|420aa|down_0|NZ_CP040094.1_963182_964442_-	PRK07364, PRK07364, FAD-dependent hydroxylase	NA|64aa|down_1|NZ_CP040094.1_964628_964820_-	NA	NA|253aa|down_2|NZ_CP040094.1_965037_965796_-	PRK00110, PRK00110, YebC/PmpR family DNA-binding transcriptional regulator	NA|432aa|down_3|NZ_CP040094.1_965989_967285_+	cd14748, PBP2_UgpB, The periplasmic-binding component of ABC transport system specific for sn-glycerol-3-phosphate; possesses type 2 periplasmic binding fold	NA|524aa|down_4|NZ_CP040094.1_967595_969167_-	cd07402, MPP_GpdQ, Enterobacter aerogenes GpdQ and related proteins, metallophosphatase domain	NA|132aa|down_5|NZ_CP040094.1_969379_969775_-	NA	NA|89aa|down_6|NZ_CP040094.1_969909_970176_+	pfam13586, DDE_Tnp_1_2, Transposase DDE domain	NA|87aa|down_7|NZ_CP040094.1_970370_970631_+	TIGR00911, High-affinity_methionine_permease, L-type amino acid transporter	NA|146aa|down_8|NZ_CP040094.1_970664_971102_+	NA	NA|106aa|down_9|NZ_CP040094.1_971446_971764_+	pfam07862, Nif11, Nif11 domain
GCF_013343235.1_ASM1334323v1	NZ_CP040094	Nostoc sp. TCL240-02 chromosome, complete genome	4	1113127-1114039	2,4,2	PILER-CR,CRISPRCasFinder,CRT	no		DinG,cas3,csa3,cas4,cas6,RT,cas1,cas2,csx3,WYL,csm6,cmr5gr11,cmr4gr7,cmr3gr5,cas10,cas14j,c2c5_V-U5,PD-DExK,Cas9_archaeal,DEDDh,c2c8_V-U2,Cas14c_CAS-V-F,csm5gr7,csm4gr5,csm3gr7,csm2gr11,csc2gr7,csc1gr5,2OG_CAS	Orphan	ATTGCAATTCATCAAAATCCCTATTAGGG----------ATTGAAAC,ATTGCAATTCATCAAAATCCCTATTAGGGATTGAAAC,ATTGCAATTCATCAAAATCCCTATTAGGGATTGAAAC	47,37,37	0	0	NA	NA	N:A	12,12,12	12	Orphan	DinG,cas3,csa3,cas4,cas6,RT,cas1,cas2,csx3,WYL,csm6,cmr5gr11,cmr4gr7,cmr3gr5,cas10,cas14j,c2c5_V-U5,PD-DExK,Cas9_archaeal,DEDDh,c2c8_V-U2,Cas14c_CAS-V-F,csm5gr7,csm4gr5,csm3gr7,csm2gr11,csc2gr7,csc1gr5,2OG_CAS	NA|82aa|up_6|NZ_CP040094.1_1107407_1107653_-,NA|124aa|up_1|NZ_CP040094.1_1111418_1111790_+,NA|141aa|down_5|NZ_CP040094.1_1117592_1118015_-	NA|380aa|up_9|NZ_CP040094.1_1101665_1102805_-	cd14014, STKc_PknB_like, Catalytic domain of bacterial Serine/Threonine kinases, PknB and similar proteins	NA|364aa|up_8|NZ_CP040094.1_1103001_1104093_-	TIGR00378, cax, calcium/proton exchanger (cax)	NA|906aa|up_7|NZ_CP040094.1_1104619_1107337_+	cd10797, GH57N_APU_like_1, N-terminal putative catalytic domain of mainly uncharacterized prokaryotic proteins similar to archaeal thermoactive amylopullulanases; glycoside hydrolase family 57 (GH57)	NA|82aa|up_6|NZ_CP040094.1_1107407_1107653_-	NA	NA|82aa|up_5|NZ_CP040094.1_1107688_1107934_-	pfam14279, HNH_5, HNH endonuclease	NA|366aa|up_4|NZ_CP040094.1_1107942_1109040_-	COG0673, MviM, Predicted dehydrogenases and related proteins [General function prediction only]	NA|156aa|up_3|NZ_CP040094.1_1109074_1109542_-	COG3296, COG3296, Uncharacterized protein conserved in bacteria [Function unknown]	NA|497aa|up_2|NZ_CP040094.1_1109833_1111324_+	COG4775, COG4775, Outer membrane protein/protective antigen OMA87 [Cell envelope biogenesis, outer membrane]	NA|124aa|up_1|NZ_CP040094.1_1111418_1111790_+	NA	NA|341aa|up_0|NZ_CP040094.1_1111792_1112815_+	COG3491, PcbC, Isopenicillin N synthase and related dioxygenases [General function prediction only]	NA|78aa|down_0|NZ_CP040094.1_1114138_1114372_-	COG5126, FRQ1, Ca2+-binding protein (EF-Hand superfamily) [Signal transduction mechanisms / Cytoskeleton / Cell division and chromosome partitioning / General function prediction only]	NA|182aa|down_1|NZ_CP040094.1_1114801_1115347_+	PRK00131, aroK, shikimate kinase; Reviewed	NA|201aa|down_2|NZ_CP040094.1_1115358_1115961_-	pfam05685, Uma2, Putative restriction endonuclease	NA|121aa|down_3|NZ_CP040094.1_1116129_1116492_+	COG1357, COG1357, Pentapeptide repeats containing protein [Function unknown]	NA|299aa|down_4|NZ_CP040094.1_1116592_1117489_+	cd04250, AAK_NAGK-C, AAK_NAGK-C: N-Acetyl-L-glutamate kinase - cyclic (NAGK-C) catalyzes the phosphorylation of the gamma-COOH group of N-acetyl-L-glutamate (NAG) by ATP in the second step of arginine biosynthesis found in some bacteria and photosynthetic organisms using the non-acetylated, cyclic route of ornithine biosynthesis	NA|141aa|down_5|NZ_CP040094.1_1117592_1118015_-	NA	NA|181aa|down_6|NZ_CP040094.1_1118184_1118727_+	TIGR03302, OM_YfiO, outer membrane assembly lipoprotein YfiO	NA|295aa|down_7|NZ_CP040094.1_1118753_1119638_+	pfam11353, DUF3153, Protein of unknown function (DUF3153)	NA|109aa|down_8|NZ_CP040094.1_1119772_1120099_-	COG1733, COG1733, Predicted transcriptional regulators [Transcription]	NA|206aa|down_9|NZ_CP040094.1_1120247_1120865_+	COG1182, AcpD, Acyl carrier protein phosphodiesterase [Lipid metabolism]
GCF_013343235.1_ASM1334323v1	NZ_CP040094	Nostoc sp. TCL240-02 chromosome, complete genome	5	1386988-1387192	3	PILER-CR	no		DinG,cas3,csa3,cas4,cas6,RT,cas1,cas2,csx3,WYL,csm6,cmr5gr11,cmr4gr7,cmr3gr5,cas10,cas14j,c2c5_V-U5,PD-DExK,Cas9_archaeal,DEDDh,c2c8_V-U2,Cas14c_CAS-V-F,csm5gr7,csm4gr5,csm3gr7,csm2gr11,csc2gr7,csc1gr5,2OG_CAS	Orphan	TAATCTTAGCCTAATAGCAAGTACTGTCTCTGGCAATACGGCAAACTA	48	0	0	NA	NA	N:A	2	2	Orphan	DinG,cas3,csa3,cas4,cas6,RT,cas1,cas2,csx3,WYL,csm6,cmr5gr11,cmr4gr7,cmr3gr5,cas10,cas14j,c2c5_V-U5,PD-DExK,Cas9_archaeal,DEDDh,c2c8_V-U2,Cas14c_CAS-V-F,csm5gr7,csm4gr5,csm3gr7,csm2gr11,csc2gr7,csc1gr5,2OG_CAS	NA|49aa|up_6|NZ_CP040094.1_1376250_1376397_+,NA|220aa|up_1|NZ_CP040094.1_1383423_1384083_+,NA	NA|616aa|up_9|NZ_CP040094.1_1369373_1371221_+	TIGR02402, Malto-oligosyltrehalose_trehalohydrolase, malto-oligosyltrehalose trehalohydrolase	NA|933aa|up_8|NZ_CP040094.1_1371604_1374403_+	COG3280, TreY, Maltooligosyl trehalose synthase [Carbohydrate transport and metabolism]	NA|506aa|up_7|NZ_CP040094.1_1374673_1376191_+	COG1626, TreA, Neutral trehalase [Carbohydrate transport and metabolism]	NA|49aa|up_6|NZ_CP040094.1_1376250_1376397_+	NA	NA|110aa|up_5|NZ_CP040094.1_1376470_1376800_-	cd12399, RRM_HP0827_like, RNA recognition motif in Helicobacter pylori HP0827 protein and similar proteins	NA|181aa|up_4|NZ_CP040094.1_1377060_1377603_-	cd19433, lipocalin_CpcS-CpeS, CpcS/CpeS phycobiliprotein lyase family	NA|519aa|up_3|NZ_CP040094.1_1378186_1379743_+	TIGR01730, COG0845:_Membrane-fusion_protein, RND family efflux transporter, MFP subunit	NA|1091aa|up_2|NZ_CP040094.1_1379796_1383069_+	pfam00873, ACR_tran, AcrB/AcrD/AcrF family	NA|220aa|up_1|NZ_CP040094.1_1383423_1384083_+	NA	NA|349aa|up_0|NZ_CP040094.1_1384247_1385294_-	cd03811, GT4_GT28_WabH-like, family 4 and family 28 glycosyltransferases similar to Klebsiella WabH	NA|315aa|down_0|NZ_CP040094.1_1390867_1391812_-	pfam00535, Glycos_transf_2, Glycosyl transferase family 2	NA|121aa|down_1|NZ_CP040094.1_1391971_1392334_+	PRK05338, rplS, 50S ribosomal protein L19; Provisional	NA|74aa|down_2|NZ_CP040094.1_1392823_1393045_+	PRK07597, secE, preprotein translocase subunit SecE; Reviewed	NA|214aa|down_3|NZ_CP040094.1_1393044_1393686_+	PRK05609, nusG, transcription antitermination protein NusG; Validated	NA|142aa|down_4|NZ_CP040094.1_1393692_1394118_+	PRK00140, rplK, 50S ribosomal protein L11; Validated	NA|238aa|down_5|NZ_CP040094.1_1394250_1394964_+	CHL00129, rpl1, ribosomal protein L1; Reviewed	NA|205aa|down_6|NZ_CP040094.1_1395298_1395913_+	PRK00099, rplJ, 50S ribosomal protein L10; Reviewed	NA|131aa|down_7|NZ_CP040094.1_1396005_1396398_+	CHL00083, rpl12, ribosomal protein L12	NA|110aa|down_8|NZ_CP040094.1_1396636_1396966_-	pfam11189, DUF2973, Protein of unknown function (DUF2973)	NA|106aa|down_9|NZ_CP040094.1_1397266_1397584_-	pfam10792, DUF2605, Protein of unknown function (DUF2605)
GCF_013343235.1_ASM1334323v1	NZ_CP040094	Nostoc sp. TCL240-02 chromosome, complete genome	6	1738066-1738158	5	CRISPRCasFinder	no		DinG,cas3,csa3,cas4,cas6,RT,cas1,cas2,csx3,WYL,csm6,cmr5gr11,cmr4gr7,cmr3gr5,cas10,cas14j,c2c5_V-U5,PD-DExK,Cas9_archaeal,DEDDh,c2c8_V-U2,Cas14c_CAS-V-F,csm5gr7,csm4gr5,csm3gr7,csm2gr11,csc2gr7,csc1gr5,2OG_CAS	Orphan	ATTGCCTGAAGAAGAAAATGAATTGCAAATGAA	33	0	0	NA	NA	N:A	1	1	Orphan	DinG,cas3,csa3,cas4,cas6,RT,cas1,cas2,csx3,WYL,csm6,cmr5gr11,cmr4gr7,cmr3gr5,cas10,cas14j,c2c5_V-U5,PD-DExK,Cas9_archaeal,DEDDh,c2c8_V-U2,Cas14c_CAS-V-F,csm5gr7,csm4gr5,csm3gr7,csm2gr11,csc2gr7,csc1gr5,2OG_CAS	NA|77aa|up_6|NZ_CP040094.1_1729298_1729529_+,NA|88aa|up_5|NZ_CP040094.1_1729525_1729789_+,NA|185aa|down_6|NZ_CP040094.1_1750136_1750691_-	NA|186aa|up_9|NZ_CP040094.1_1725370_1725928_+	pfam11322, DUF3124, Protein of unknown function (DUF3124)	NA|518aa|up_8|NZ_CP040094.1_1726182_1727736_-	COG0025, NhaP, NhaP-type Na+/H+ and K+/H+ antiporters [Inorganic ion transport and metabolism]	NA|244aa|up_7|NZ_CP040094.1_1727980_1728712_-	PRK00024, PRK00024, DNA repair protein RadC	NA|77aa|up_6|NZ_CP040094.1_1729298_1729529_+	NA	NA|88aa|up_5|NZ_CP040094.1_1729525_1729789_+	NA	NA|313aa|up_4|NZ_CP040094.1_1729831_1730770_-	PRK07429, PRK07429, phosphoribulokinase; Provisional	NA|664aa|up_3|NZ_CP040094.1_1731208_1733200_-	cd11350, AmyAc_4, Alpha amylase catalytic domain found in an uncharacterized protein family	NA|221aa|up_2|NZ_CP040094.1_1733332_1733995_+	pfam01596, Methyltransf_3, O-methyltransferase	NA|462aa|up_1|NZ_CP040094.1_1734216_1735602_+	pfam14516, AAA_35, AAA-like domain	NA|521aa|up_0|NZ_CP040094.1_1735735_1737298_+	pfam14516, AAA_35, AAA-like domain	NA|720aa|down_0|NZ_CP040094.1_1739811_1741971_+	COG0514, RecQ, Superfamily II DNA helicase [DNA replication, recombination, and repair]	NA|754aa|down_1|NZ_CP040094.1_1742189_1744451_+	COG0475, KefB, Kef-type K+ transport systems, membrane components [Inorganic ion transport and metabolism]	NA|471aa|down_2|NZ_CP040094.1_1744758_1746171_-	COG1538, TolC, Outer membrane protein [Cell envelope biogenesis, outer membrane / Intracellular trafficking and secretion]	NA|221aa|down_3|NZ_CP040094.1_1746287_1746950_-	cd03255, ABC_MJ0796_LolCDE_FtsE, ATP-binding cassette domain of the transporters involved in export of lipoprotein and macrolide, and cell division protein	NA|424aa|down_4|NZ_CP040094.1_1747017_1748289_-	PRK10535, PRK10535, macrolide ABC transporter ATP-binding protein/permease MacB	NA|526aa|down_5|NZ_CP040094.1_1748458_1750036_-	PRK03598, PRK03598, putative efflux pump membrane fusion protein; Provisional	NA|185aa|down_6|NZ_CP040094.1_1750136_1750691_-	NA	NA|213aa|down_7|NZ_CP040094.1_1750959_1751598_-	pfam00440, TetR_N, Bacterial regulatory proteins, tetR family	NA|296aa|down_8|NZ_CP040094.1_1751743_1752631_+	sd00006, TPR, Tetratricopeptide repeat	NA|135aa|down_9|NZ_CP040094.1_1752863_1753268_-	cd17548, REC_DivK-like, phosphoacceptor receiver (REC) domain of DivK and similar proteins
GCF_013343235.1_ASM1334323v1	NZ_CP040094	Nostoc sp. TCL240-02 chromosome, complete genome	7	1933822-1933929	6	CRISPRCasFinder	no		DinG,cas3,csa3,cas4,cas6,RT,cas1,cas2,csx3,WYL,csm6,cmr5gr11,cmr4gr7,cmr3gr5,cas10,cas14j,c2c5_V-U5,PD-DExK,Cas9_archaeal,DEDDh,c2c8_V-U2,Cas14c_CAS-V-F,csm5gr7,csm4gr5,csm3gr7,csm2gr11,csc2gr7,csc1gr5,2OG_CAS	Orphan	AACAGCTTGAGCATTAGCGATCGC	24	0	0	NA	NA	N:A	1	1	Orphan	DinG,cas3,csa3,cas4,cas6,RT,cas1,cas2,csx3,WYL,csm6,cmr5gr11,cmr4gr7,cmr3gr5,cas10,cas14j,c2c5_V-U5,PD-DExK,Cas9_archaeal,DEDDh,c2c8_V-U2,Cas14c_CAS-V-F,csm5gr7,csm4gr5,csm3gr7,csm2gr11,csc2gr7,csc1gr5,2OG_CAS	NA|57aa|up_6|NZ_CP040094.1_1922575_1922746_-,NA|127aa|up_1|NZ_CP040094.1_1929385_1929766_+,NA|303aa|down_1|NZ_CP040094.1_1935975_1936884_-,NA|124aa|down_3|NZ_CP040094.1_1938324_1938696_-,NA|51aa|down_7|NZ_CP040094.1_1942208_1942361_+,NA|227aa|down_8|NZ_CP040094.1_1942715_1943396_-,NA|255aa|down_9|NZ_CP040094.1_1943573_1944338_-	NA|261aa|up_9|NZ_CP040094.1_1918911_1919694_-	pfam14218, COP23, Circadian oscillating protein COP23	NA|544aa|up_8|NZ_CP040094.1_1919774_1921406_-	cd07333, M48C_bepA_like, Peptidase M48C Ste24p bepA-like, integral membrane protein	NA|242aa|up_7|NZ_CP040094.1_1921672_1922398_-	pfam05685, Uma2, Putative restriction endonuclease	NA|57aa|up_6|NZ_CP040094.1_1922575_1922746_-	NA	NA|349aa|up_5|NZ_CP040094.1_1923259_1924306_-	COG3367, COG3367, Uncharacterized conserved protein [Function unknown]	NA|351aa|up_4|NZ_CP040094.1_1924289_1925342_-	cd03319, L-Ala-DL-Glu_epimerase, L-Ala-D/L-Glu epimerase catalyzes the epimerization of L-Ala-D/L-Glu and other dipeptides	NA|183aa|up_3|NZ_CP040094.1_1925464_1926013_+	cd03424, ADPRase_NUDT5, ADP-ribose pyrophosphatase (ADPRase) catalyzes the hydrolysis of ADP-ribose and a variety of additional ADP-sugar conjugates to AMP and ribose-5-phosphate	NA|1029aa|up_2|NZ_CP040094.1_1926293_1929380_-	PRK10614, PRK10614, multidrug efflux system subunit MdtC; Provisional	NA|127aa|up_1|NZ_CP040094.1_1929385_1929766_+	NA	NA|1041aa|up_0|NZ_CP040094.1_1929955_1933078_-	PRK10503, PRK10503, MdtB/MuxB family multidrug efflux RND transporter permease subunit	NA|324aa|down_0|NZ_CP040094.1_1934875_1935847_-	TIGR02997, RNA_polymerase_sigma_subunit_sigma70/sigma32, RNA polymerase sigma factor, cyanobacterial RpoD-like family	NA|303aa|down_1|NZ_CP040094.1_1935975_1936884_-	NA	NA|427aa|down_2|NZ_CP040094.1_1936929_1938210_-	COG0793, Prc, Periplasmic protease [Cell envelope biogenesis, outer membrane]	NA|124aa|down_3|NZ_CP040094.1_1938324_1938696_-	NA	NA|420aa|down_4|NZ_CP040094.1_1938970_1940230_+	COG4585, COG4585, Signal transduction histidine kinase [Signal transduction mechanisms]	NA|216aa|down_5|NZ_CP040094.1_1940233_1940881_+	COG2197, CitB, Response regulator containing a CheY-like receiver domain and an HTH DNA-binding domain [Signal transduction mechanisms / Transcription]	NA|125aa|down_6|NZ_CP040094.1_1941454_1941829_+	pfam13340, DUF4096, Putative transposase of IS4/5 family (DUF4096)	NA|51aa|down_7|NZ_CP040094.1_1942208_1942361_+	NA	NA|227aa|down_8|NZ_CP040094.1_1942715_1943396_-	NA	NA|255aa|down_9|NZ_CP040094.1_1943573_1944338_-	NA
GCF_013343235.1_ASM1334323v1	NZ_CP040094	Nostoc sp. TCL240-02 chromosome, complete genome	8	1978343-1978741	7,4,3	CRISPRCasFinder,PILER-CR,CRT	no	cas4,cas6,cas3	DinG,cas3,csa3,cas4,cas6,RT,cas1,cas2,csx3,WYL,csm6,cmr5gr11,cmr4gr7,cmr3gr5,cas10,cas14j,c2c5_V-U5,PD-DExK,Cas9_archaeal,DEDDh,c2c8_V-U2,Cas14c_CAS-V-F,csm5gr7,csm4gr5,csm3gr7,csm2gr11,csc2gr7,csc1gr5,2OG_CAS	Unclear	TGTTTCAGTCCCCGTGAGGGGATTTGGTTAATGGAAAC,TGTTTCAGTCCCCGTGAGGGGATTTGGTTAATGGAAAC,GTTTCAGTCCCCNTGNGGGGNNNTNG	38,38,26	0	0	NA	NA	N:A	4,2,5	5	Unclear	DinG,cas3,csa3,cas4,cas6,RT,cas1,cas2,csx3,WYL,csm6,cmr5gr11,cmr4gr7,cmr3gr5,cas10,cas14j,c2c5_V-U5,PD-DExK,Cas9_archaeal,DEDDh,c2c8_V-U2,Cas14c_CAS-V-F,csm5gr7,csm4gr5,csm3gr7,csm2gr11,csc2gr7,csc1gr5,2OG_CAS	NA|104aa|up_8|NZ_CP040094.1_1964523_1964835_+,NA|109aa|up_6|NZ_CP040094.1_1968334_1968661_-,NA|94aa|up_5|NZ_CP040094.1_1968709_1968991_-,NA|63aa|up_3|NZ_CP040094.1_1970296_1970485_-,NA|153aa|up_1|NZ_CP040094.1_1975085_1975544_-,NA|88aa|up_0|NZ_CP040094.1_1978045_1978309_-,NA	NA|92aa|up_9|NZ_CP040094.1_1963569_1963845_+	COG1357, COG1357, Pentapeptide repeats containing protein [Function unknown]	NA|104aa|up_8|NZ_CP040094.1_1964523_1964835_+	NA	NA|341aa|up_7|NZ_CP040094.1_1966388_1967411_-	pfam01609, DDE_Tnp_1, Transposase DDE domain	NA|109aa|up_6|NZ_CP040094.1_1968334_1968661_-	NA	NA|94aa|up_5|NZ_CP040094.1_1968709_1968991_-	NA	NA|137aa|up_4|NZ_CP040094.1_1969448_1969859_+	pfam01471, PG_binding_1, Putative peptidoglycan binding domain	NA|63aa|up_3|NZ_CP040094.1_1970296_1970485_-	NA	NA|443aa|up_2|NZ_CP040094.1_1973533_1974862_+	pfam13699, DUF4157, Domain of unknown function (DUF4157)	NA|153aa|up_1|NZ_CP040094.1_1975085_1975544_-	NA	NA|88aa|up_0|NZ_CP040094.1_1978045_1978309_-	NA	cas4|200aa|down_0|NZ_CP040094.1_1979260_1979860_-	TIGR00372, conserved_hypothetical_protein, CRISPR-associated protein Cas4	cas6|343aa|down_1|NZ_CP040094.1_1979967_1980996_-	pfam10040, CRISPR_Cas6, CRISPR-associated endoribonuclease Cas6	cas3|155aa|down_2|NZ_CP040094.1_1984793_1985258_-	cd17930, DEXHc_cas3, DEXH/Q-box helicase domain of Cas3	NA|57aa|down_3|NZ_CP040094.1_1985254_1985425_-	COG1201, Lhr, Lhr-like helicases [General function prediction only]	NA|371aa|down_4|NZ_CP040094.1_1985688_1986801_+	pfam13358, DDE_3, DDE superfamily endonuclease	NA|2317aa|down_5|NZ_CP040094.1_1988621_1995572_+	COG3899, COG3899, Predicted ATPase [General function prediction only]	NA|210aa|down_6|NZ_CP040094.1_1995555_1996185_+	COG2197, CitB, Response regulator containing a CheY-like receiver domain and an HTH DNA-binding domain [Signal transduction mechanisms / Transcription]	NA|198aa|down_7|NZ_CP040094.1_1996430_1997024_+	COG1651, DsbG, Protein-disulfide isomerase [Posttranslational modification, protein turnover, chaperones]	NA|605aa|down_8|NZ_CP040094.1_1998367_2000182_+	COG4585, COG4585, Signal transduction histidine kinase [Signal transduction mechanisms]	NA|210aa|down_9|NZ_CP040094.1_2000190_2000820_+	COG2197, CitB, Response regulator containing a CheY-like receiver domain and an HTH DNA-binding domain [Signal transduction mechanisms / Transcription]
GCF_013343235.1_ASM1334323v1	NZ_CP040094	Nostoc sp. TCL240-02 chromosome, complete genome	9	2096244-2098018	5,8,4	PILER-CR,CRISPRCasFinder,CRT	no	RT,cas1,cas2,csx3,WYL,csm6	DinG,cas3,csa3,cas4,cas6,RT,cas1,cas2,csx3,WYL,csm6,cmr5gr11,cmr4gr7,cmr3gr5,cas10,cas14j,c2c5_V-U5,PD-DExK,Cas9_archaeal,DEDDh,c2c8_V-U2,Cas14c_CAS-V-F,csm5gr7,csm4gr5,csm3gr7,csm2gr11,csc2gr7,csc1gr5,2OG_CAS	Type III-A	CCCTACCGATTGGGTTAAATCGGATTAGTTGTAAAC,CCCTACCGATTGGGTTAAATCGGATTAGTTGTAAAC,CCCTACCGATTGGGTTAAATCGGATTAGTTGTAAAC	36,36,36	0	0	NA	NA	N:A	22,24,24	24	TypeIII-A	DinG,cas3,csa3,cas4,cas6,RT,cas1,cas2,csx3,WYL,csm6,cmr5gr11,cmr4gr7,cmr3gr5,cas10,cas14j,c2c5_V-U5,PD-DExK,Cas9_archaeal,DEDDh,c2c8_V-U2,Cas14c_CAS-V-F,csm5gr7,csm4gr5,csm3gr7,csm2gr11,csc2gr7,csc1gr5,2OG_CAS	NA|66aa|up_2|NZ_CP040094.1_2094398_2094596_+,NA|126aa|down_2|NZ_CP040094.1_2099919_2100297_+,NA|243aa|down_9|NZ_CP040094.1_2106525_2107254_+	NA|120aa|up_9|NZ_CP040094.1_2087961_2088321_+	cd16382, XisI-like, XisI is FdxN element excision controlling factor protein	NA|184aa|up_8|NZ_CP040094.1_2088468_2089020_+	pfam13302, Acetyltransf_3, Acetyltransferase (GNAT) domain	NA|533aa|up_7|NZ_CP040094.1_2089162_2090761_+	COG1928, PMT1, Dolichyl-phosphate-mannose--protein O-mannosyl transferase [Posttranslational modification, protein turnover, chaperones]	NA|212aa|up_6|NZ_CP040094.1_2090896_2091532_-	PRK00121, trmB, tRNA (guanine-N(7)-)-methyltransferase; Reviewed	NA|83aa|up_5|NZ_CP040094.1_2091856_2092105_-	TIGR02436, S23_ribosomal_protein, four helix bundle protein	RT|257aa|up_4|NZ_CP040094.1_2092713_2093484_+	TIGR04416, hypothetical_protein, group II intron reverse transcriptase/maturase	NA|272aa|up_3|NZ_CP040094.1_2093492_2094307_-	pfam13340, DUF4096, Putative transposase of IS4/5 family (DUF4096)	NA|66aa|up_2|NZ_CP040094.1_2094398_2094596_+	NA	cas1|335aa|up_1|NZ_CP040094.1_2094745_2095750_+	pfam01867, Cas_Cas1, CRISPR associated protein Cas1	cas2|92aa|up_0|NZ_CP040094.1_2095750_2096026_+	pfam09827, CRISPR_Cas2, CRISPR associated protein Cas2	NA|323aa|down_0|NZ_CP040094.1_2098078_2099047_-	cd10911, PIN_LabA, PIN domain of Synechococcus elongatus LabA (low-amplitude and bright) and related proteins	NA|191aa|down_1|NZ_CP040094.1_2099303_2099876_+	cd10911, PIN_LabA, PIN domain of Synechococcus elongatus LabA (low-amplitude and bright) and related proteins	NA|126aa|down_2|NZ_CP040094.1_2099919_2100297_+	NA	NA|74aa|down_3|NZ_CP040094.1_2100341_2100563_+	pfam14453, ThiS-like, ThiS-like ubiquitin	NA|162aa|down_4|NZ_CP040094.1_2100604_2101090_+	pfam14462, Prok-E2_E, Prokaryotic E2 family E	NA|445aa|down_5|NZ_CP040094.1_2101325_2102660_+	cd01483, E1_enzyme_family, Superfamily of activating enzymes (E1) of the ubiquitin-like proteins	csx3|410aa|down_6|NZ_CP040094.1_2102830_2104060_-	pfam09620, Cas_csx3, CRISPR-associated protein (Cas_csx3)	WYL|521aa|down_7|NZ_CP040094.1_2104187_2105750_+	TIGR03985, hypothetical_protein_sll7078, CRISPR-associated protein, TIGR03985 family	NA|194aa|down_8|NZ_CP040094.1_2105779_2106361_+	pfam13328, HD_4, HD domain	NA|243aa|down_9|NZ_CP040094.1_2106525_2107254_+	NA
GCF_013343235.1_ASM1334323v1	NZ_CP040094	Nostoc sp. TCL240-02 chromosome, complete genome	10	2110528-2112181	6,9,5,7	PILER-CR,CRISPRCasFinder,CRT,PILER-CR	no	RT,cas1,cas2,csx3,WYL,csm6,cmr5gr11,cmr4gr7,cmr3gr5,cas10	DinG,cas3,csa3,cas4,cas6,RT,cas1,cas2,csx3,WYL,csm6,cmr5gr11,cmr4gr7,cmr3gr5,cas10,cas14j,c2c5_V-U5,PD-DExK,Cas9_archaeal,DEDDh,c2c8_V-U2,Cas14c_CAS-V-F,csm5gr7,csm4gr5,csm3gr7,csm2gr11,csc2gr7,csc1gr5,2OG_CAS	Type III-D,Type III-C,Type III-A,Type III-B	CCCTACCGATTGGGTTAAATCGGATTAGTTGGAAAC,CCCTACCGATTGGGTTAAATCGGATTAGTTGGAAAC,CCCTACCGATTGGGTTAAATCGGATTAGTTGGAAAC,CCCTACCGATTGGGTTAAATCGGATTAGTTGGAAAC	36,36,36,36	0	0	NA	NA	N:A	21,22,22,21	22	TypeIII-D,TypeIII-C,TypeIII-A,TypeIII-B	DinG,cas3,csa3,cas4,cas6,RT,cas1,cas2,csx3,WYL,csm6,cmr5gr11,cmr4gr7,cmr3gr5,cas10,cas14j,c2c5_V-U5,PD-DExK,Cas9_archaeal,DEDDh,c2c8_V-U2,Cas14c_CAS-V-F,csm5gr7,csm4gr5,csm3gr7,csm2gr11,csc2gr7,csc1gr5,2OG_CAS	NA|243aa|up_5|NZ_CP040094.1_2106525_2107254_+,NA|145aa|up_4|NZ_CP040094.1_2107304_2107739_+,NA|111aa|up_3|NZ_CP040094.1_2107748_2108081_+,NA|193aa|up_2|NZ_CP040094.1_2108055_2108634_+,NA|80aa|down_1|NZ_CP040094.1_2113523_2113763_-,NA|55aa|down_4|NZ_CP040094.1_2115631_2115796_-,cmr5gr11|143aa|down_7|NZ_CP040094.1_2118115_2118544_-	NA|445aa|up_9|NZ_CP040094.1_2101325_2102660_+	cd01483, E1_enzyme_family, Superfamily of activating enzymes (E1) of the ubiquitin-like proteins	csx3|410aa|up_8|NZ_CP040094.1_2102830_2104060_-	pfam09620, Cas_csx3, CRISPR-associated protein (Cas_csx3)	WYL|521aa|up_7|NZ_CP040094.1_2104187_2105750_+	TIGR03985, hypothetical_protein_sll7078, CRISPR-associated protein, TIGR03985 family	NA|194aa|up_6|NZ_CP040094.1_2105779_2106361_+	pfam13328, HD_4, HD domain	NA|243aa|up_5|NZ_CP040094.1_2106525_2107254_+	NA	NA|145aa|up_4|NZ_CP040094.1_2107304_2107739_+	NA	NA|111aa|up_3|NZ_CP040094.1_2107748_2108081_+	NA	NA|193aa|up_2|NZ_CP040094.1_2108055_2108634_+	NA	NA|83aa|up_1|NZ_CP040094.1_2109133_2109382_+	TIGR02436, S23_ribosomal_protein, four helix bundle protein	NA|236aa|up_0|NZ_CP040094.1_2109513_2110221_+	cd09747, Csx1_III-U, CRISPR/Cas system-associated protein Csx1	csm6|378aa|down_0|NZ_CP040094.1_2112278_2113412_-	cd09742, Csm6_III-A, CRISPR/Cas system-associated protein Csm6	NA|80aa|down_1|NZ_CP040094.1_2113523_2113763_-	NA	NA|341aa|down_2|NZ_CP040094.1_2113855_2114878_-	pfam01609, DDE_Tnp_1, Transposase DDE domain	NA|83aa|down_3|NZ_CP040094.1_2115209_2115458_+	TIGR02436, S23_ribosomal_protein, four helix bundle protein	NA|55aa|down_4|NZ_CP040094.1_2115631_2115796_-	NA	NA|158aa|down_5|NZ_CP040094.1_2115834_2116308_-	pfam03703, bPH_2, Bacterial PH domain	NA|559aa|down_6|NZ_CP040094.1_2116432_2118109_-	TIGR01898, repair_system, CRISPR type III-B/RAMP module RAMP protein Cmr6	cmr5gr11|143aa|down_7|NZ_CP040094.1_2118115_2118544_-	NA	cmr4gr7|282aa|down_8|NZ_CP040094.1_2118543_2119389_-	COG1336, COG1336, CRISPR system related protein, RAMP superfamily [Defense mechanisms]	cmr3gr5|378aa|down_9|NZ_CP040094.1_2119577_2120711_-	cd09748, Cmr3_III-B, CRISPR/Cas system-associated RAMP superfamily protein Cmr3
GCF_013343235.1_ASM1334323v1	NZ_CP040094	Nostoc sp. TCL240-02 chromosome, complete genome	11	2219953-2220055	10	CRISPRCasFinder	no		DinG,cas3,csa3,cas4,cas6,RT,cas1,cas2,csx3,WYL,csm6,cmr5gr11,cmr4gr7,cmr3gr5,cas10,cas14j,c2c5_V-U5,PD-DExK,Cas9_archaeal,DEDDh,c2c8_V-U2,Cas14c_CAS-V-F,csm5gr7,csm4gr5,csm3gr7,csm2gr11,csc2gr7,csc1gr5,2OG_CAS	Orphan	GGGACTGGGGACTGGGGACTGGG	23	2	149	2219976-2219992|2220016-2220032|2220016-2220032|2220016-2220032|2220016-2220032|2220016-2220032|2220016-2220032|2220016-2220032|2220016-2220032|2220016-2220032|2220016-2220032|2220016-2220032|2220016-2220032|2220016-2220032|2220016-2220032|2220016-2220032|2220016-2220032|2220016-2220032|2220016-2220032|2220016-2220032|2220016-2220032|2220016-2220032|2220016-2220032|2220016-2220032|2220016-2220032|2220016-2220032|2220016-2220032|2220016-2220032|2220016-2220032|2220016-2220032|2220016-2220032|2220016-2220032|2220016-2220032|2220016-2220032|2220016-2220032|2220016-2220032|2220016-2220032|2220016-2220032|2220016-2220032|2220016-2220032|2220016-2220032|2220016-2220032|2220016-2220032|2220016-2220032|2220016-2220032|2220016-2220032|2220016-2220032|2220016-2220032|2220016-2220032|2220016-2220032|2220016-2220032|2220016-2220032|2220016-2220032|2220016-2220032|2220016-2220032|2220016-2220032|2220016-2220032|2220016-2220032|2220016-2220032|2220016-2220032|2220016-2220032|2220016-2220032|2220016-2220032|2220016-2220032|2220016-2220032|2220016-2220032|2220016-2220032|2220016-2220032|2220016-2220032|2220016-2220032|2220016-2220032|2220016-2220032|2220016-2220032|2220016-2220032|2220016-2220032|2220016-2220032|2220016-2220032|2220016-2220032|2220016-2220032|2220016-2220032|2220016-2220032|2220016-2220032|2220016-2220032|2220016-2220032|2220016-2220032|2220016-2220032|2220016-2220032|2220016-2220032|2220016-2220032|2220016-2220032|2220016-2220032|2220016-2220032|2220016-2220032|2220016-2220032|2220016-2220032|2220016-2220032|2220016-2220032|2220016-2220032|2220016-2220032|2220016-2220032|2220016-2220032|2220016-2220032|2220016-2220032|2220016-2220032|2220016-2220032|2220016-2220032|2220016-2220032|2220016-2220032|2220016-2220032|2220016-2220032|2220016-2220032|2220016-2220032|2220016-2220032|2220016-2220032|2220016-2220032|2220016-2220032|2220016-2220032|2220016-2220032|2220016-2220032|2220016-2220032|2220016-2220032|2220016-2220032|2220016-2220032|2220016-2220032|2220016-2220032|2220016-2220032|2220016-2220032|2220016-2220032|2220016-2220032|2220016-2220032|2220016-2220032|2220016-2220032|2220016-2220032|2220016-2220032|2220016-2220032|2220016-2220032|2220016-2220032|2220016-2220032|2220016-2220032|2220016-2220032|2220016-2220032|2220016-2220032|2220016-2220032|2220016-2220032|2220016-2220032|2220016-2220032|2220016-2220032|2220016-2220032|2220016-2220032	NZ_CP040094.1_4722326-4722310|NZ_CP040094.1_366217-366201|NZ_CP040094.1_366224-366208|NZ_CP040094.1_366231-366215|NZ_CP040094.1_379964-379948|NZ_CP040094.1_488983-488967|NZ_CP040094.1_559811-559795|NZ_CP040094.1_751668-751652|NZ_CP040094.1_751675-751659|NZ_CP040094.1_812793-812809|NZ_CP040094.1_1024667-1024651|NZ_CP040094.1_1024674-1024658|NZ_CP040094.1_1024681-1024665|NZ_CP040094.1_1024688-1024672|NZ_CP040094.1_1024695-1024679|NZ_CP040094.1_1024702-1024686|NZ_CP040094.1_1024709-1024693|NZ_CP040094.1_1041525-1041509|NZ_CP040094.1_1360838-1360822|NZ_CP040094.1_1394970-1394986|NZ_CP040094.1_1394977-1394993|NZ_CP040094.1_1394984-1395000|NZ_CP040094.1_2119555-2119539|NZ_CP040094.1_2220043-2220059|NZ_CP040094.1_2298097-2298081|NZ_CP040094.1_2298104-2298088|NZ_CP040094.1_2548940-2548924|NZ_CP040094.1_2647994-2648010|NZ_CP040094.1_2648001-2648017|NZ_CP040094.1_2651362-2651378|NZ_CP040094.1_2651369-2651385|NZ_CP040094.1_2651376-2651392|NZ_CP040094.1_3500148-3500132|NZ_CP040094.1_3849572-3849556|NZ_CP040094.1_3849579-3849563|NZ_CP040094.1_3849586-3849570|NZ_CP040094.1_3849593-3849577|NZ_CP040094.1_4058755-4058739|NZ_CP040094.1_4343403-4343387|NZ_CP040094.1_4343410-4343394|NZ_CP040094.1_4630510-4630494|NZ_CP040094.1_4630517-4630501|NZ_CP040094.1_4722313-4722297|NZ_CP040094.1_5310373-5310357|NZ_CP040094.1_5331020-5331036|NZ_CP040094.1_5331027-5331043|NZ_CP040094.1_5387568-5387584|NZ_CP040094.1_5399949-5399933|NZ_CP040094.1_5595757-5595773|NZ_CP040094.1_5595764-5595780|NZ_CP040094.1_5764269-5764285|NZ_CP040094.1_6068030-6068046|NZ_CP040094.1_6300826-6300810|NZ_CP040094.1_6363111-6363127|NZ_CP040094.1_6587161-6587145|NZ_CP040094.1_6661391-6661407|NZ_CP040094.1_6661398-6661414|NZ_CP040094.1_6935016-6935000|NZ_CP040094.1_6935023-6935007|NZ_CP040094.1_7299164-7299180|NZ_CP040094.1_7299171-7299187|NZ_CP040094.1_7299178-7299194|NZ_CP040094.1_7299185-7299201|NZ_CP040094.1_7299192-7299208|NZ_CP040094.1_7299199-7299215|NZ_CP040094.1_7299206-7299222|NZ_CP040094.1_7309063-7309047|NZ_CP040094.1_7323360-7323376|NZ_CP040094.1_7323367-7323383|NZ_CP040094.1_7413557-7413573|NZ_CP040094.1_7413564-7413580|NZ_CP040094.1_7689276-7689292|NZ_CP040094.1_7689283-7689299|NZ_CP040094.1_7689290-7689306|NZ_CP040094.1_7709925-7709941|NZ_CP040094.1_27424-27440|NZ_CP040094.1_90858-90842|NZ_CP040094.1_134374-134390|NZ_CP040094.1_134388-134404|NZ_CP040094.1_188605-188589|NZ_CP040094.1_200774-200758|NZ_CP040094.1_366238-366222|NZ_CP040094.1_451948-451932|NZ_CP040094.1_488969-488953|NZ_CP040094.1_488976-488960|NZ_CP040094.1_812800-812816|NZ_CP040094.1_1024716-1024700|NZ_CP040094.1_1024723-1024707|NZ_CP040094.1_1041532-1041516|NZ_CP040094.1_1128347-1128363|NZ_CP040094.1_1150882-1150866|NZ_CP040094.1_1150889-1150873|NZ_CP040094.1_1360831-1360815|NZ_CP040094.1_1384105-1384121|NZ_CP040094.1_1394991-1395007|NZ_CP040094.1_2119527-2119511|NZ_CP040094.1_2119534-2119518|NZ_CP040094.1_2119548-2119532|NZ_CP040094.1_2298111-2298095|NZ_CP040094.1_2298118-2298102|NZ_CP040094.1_2405978-2405994|NZ_CP040094.1_2548933-2548917|NZ_CP040094.1_2548947-2548931|NZ_CP040094.1_2651404-2651420|NZ_CP040094.1_2919328-2919344|NZ_CP040094.1_3119037-3119021|NZ_CP040094.1_3214602-3214586|NZ_CP040094.1_3214630-3214614|NZ_CP040094.1_3310791-3310807|NZ_CP040094.1_3519373-3519389|NZ_CP040094.1_3799641-3799657|NZ_CP040094.1_3799648-3799664|NZ_CP040094.1_4335417-4335401|NZ_CP040094.1_4335424-4335408|NZ_CP040094.1_4362958-4362942|NZ_CP040094.1_4722346-4722330|NZ_CP040094.1_5100812-5100828|NZ_CP040094.1_5100826-5100842|NZ_CP040094.1_5100833-5100849|NZ_CP040094.1_5100840-5100856|NZ_CP040094.1_5331013-5331029|NZ_CP040094.1_5387554-5387570|NZ_CP040094.1_5387561-5387577|NZ_CP040094.1_5387575-5387591|NZ_CP040094.1_5387582-5387598|NZ_CP040094.1_5389346-5389362|NZ_CP040094.1_5396705-5396689|NZ_CP040094.1_5396712-5396696|NZ_CP040094.1_5396719-5396703|NZ_CP040094.1_5396726-5396710|NZ_CP040094.1_5397994-5397978|NZ_CP040094.1_6068037-6068053|NZ_CP040094.1_6107531-6107515|NZ_CP040094.1_6366294-6366278|NZ_CP040094.1_6587154-6587138|NZ_CP040094.1_6628854-6628838|NZ_CP040094.1_6653845-6653861|NZ_CP040094.1_6653852-6653868|NZ_CP040094.1_6839600-6839584|NZ_CP040094.1_6911615-6911599|NZ_CP040094.1_6913917-6913933|NZ_CP040094.1_6935009-6934993|NZ_CP040094.1_7076104-7076088|NZ_CP040094.1_7261881-7261865|NZ_CP040094.1_7323374-7323390|NZ_CP040094.1_7413550-7413566|NZ_CP040094.1_7632472-7632488|NZ_CP040094.1_7808698-7808682|NZ_CP040094.1_7815851-7815867	N:A	2	2	Orphan	DinG,cas3,csa3,cas4,cas6,RT,cas1,cas2,csx3,WYL,csm6,cmr5gr11,cmr4gr7,cmr3gr5,cas10,cas14j,c2c5_V-U5,PD-DExK,Cas9_archaeal,DEDDh,c2c8_V-U2,Cas14c_CAS-V-F,csm5gr7,csm4gr5,csm3gr7,csm2gr11,csc2gr7,csc1gr5,2OG_CAS	NA,NA|70aa|down_2|NZ_CP040094.1_2221618_2221828_+,NA|168aa|down_7|NZ_CP040094.1_2225496_2226000_+,NA|381aa|down_8|NZ_CP040094.1_2226670_2227813_+	NA|162aa|up_9|NZ_CP040094.1_2201165_2201651_+	cd04647, LbH_MAT_like, Maltose O-acyltransferase (MAT)-like: This family is composed of maltose O-acetyltransferase, galactoside O-acetyltransferase (GAT), xenobiotic acyltransferase (XAT) and similar proteins	NA|343aa|up_8|NZ_CP040094.1_2201634_2202663_+	cd00616, AHBA_syn, 3-amino-5-hydroxybenzoic acid synthase family (AHBA_syn)	NA|236aa|up_7|NZ_CP040094.1_2202659_2203367_+	pfam08889, WbqC, WbqC-like protein family	NA|230aa|up_6|NZ_CP040094.1_2203543_2204233_+	pfam13649, Methyltransf_25, Methyltransferase domain	NA|231aa|up_5|NZ_CP040094.1_2204238_2204931_+	COG2120, COG2120, Uncharacterized proteins, LmbE homologs [Function unknown]	NA|567aa|up_4|NZ_CP040094.1_2210388_2212089_+	COG3914, Spy, Predicted O-linked N-acetylglucosamine transferase, SPINDLY family [Posttranslational modification, protein turnover, chaperones]	NA|737aa|up_3|NZ_CP040094.1_2212202_2214413_-	COG3914, Spy, Predicted O-linked N-acetylglucosamine transferase, SPINDLY family [Posttranslational modification, protein turnover, chaperones]	NA|185aa|up_2|NZ_CP040094.1_2214531_2215086_-	pfam16734, Pilin_GH, Type IV pilin-like G and H, putative	NA|181aa|up_1|NZ_CP040094.1_2215732_2216275_-	COG4627, COG4627, Uncharacterized protein conserved in bacteria [Function unknown]	NA|444aa|up_0|NZ_CP040094.1_2216697_2218029_-	pfam13847, Methyltransf_31, Methyltransferase domain	NA|135aa|down_0|NZ_CP040094.1_2220111_2220516_-	PRK12275, PRK12275, hypothetical protein; Reviewed	NA|299aa|down_1|NZ_CP040094.1_2220646_2221543_+	COG1173, DppC, ABC-type dipeptide/oligopeptide/nickel transport systems, permease components [Amino acid transport and metabolism / Inorganic ion transport and metabolism]	NA|70aa|down_2|NZ_CP040094.1_2221618_2221828_+	NA	NA|299aa|down_3|NZ_CP040094.1_2221944_2222841_+	COG1940, NagC, Transcriptional regulator/sugar kinase [Transcription / Carbohydrate transport and metabolism]	NA|248aa|down_4|NZ_CP040094.1_2223105_2223849_+	pfam13676, TIR_2, TIR domain	NA|108aa|down_5|NZ_CP040094.1_2223878_2224202_+	pfam13676, TIR_2, TIR domain	NA|214aa|down_6|NZ_CP040094.1_2224232_2224874_+	sd00006, TPR, Tetratricopeptide repeat	NA|168aa|down_7|NZ_CP040094.1_2225496_2226000_+	NA	NA|381aa|down_8|NZ_CP040094.1_2226670_2227813_+	NA	NA|419aa|down_9|NZ_CP040094.1_2227922_2229179_+	COG1649, COG1649, Uncharacterized protein conserved in bacteria [Function unknown]
GCF_013343235.1_ASM1334323v1	NZ_CP040094	Nostoc sp. TCL240-02 chromosome, complete genome	12	2397888-2398023	11	CRISPRCasFinder	no		DinG,cas3,csa3,cas4,cas6,RT,cas1,cas2,csx3,WYL,csm6,cmr5gr11,cmr4gr7,cmr3gr5,cas10,cas14j,c2c5_V-U5,PD-DExK,Cas9_archaeal,DEDDh,c2c8_V-U2,Cas14c_CAS-V-F,csm5gr7,csm4gr5,csm3gr7,csm2gr11,csc2gr7,csc1gr5,2OG_CAS	Orphan	CTTTGCTGCAACGGCTTATACCTTTGCAAGAAATGCTTAAGCGTTTCTTATA	52	0	0	NA	NA	N:A	1	1	Orphan	DinG,cas3,csa3,cas4,cas6,RT,cas1,cas2,csx3,WYL,csm6,cmr5gr11,cmr4gr7,cmr3gr5,cas10,cas14j,c2c5_V-U5,PD-DExK,Cas9_archaeal,DEDDh,c2c8_V-U2,Cas14c_CAS-V-F,csm5gr7,csm4gr5,csm3gr7,csm2gr11,csc2gr7,csc1gr5,2OG_CAS	NA|49aa|up_1|NZ_CP040094.1_2396827_2396974_+,NA|142aa|up_0|NZ_CP040094.1_2397209_2397635_-,NA|126aa|down_7|NZ_CP040094.1_2407372_2407750_-,NA|356aa|down_8|NZ_CP040094.1_2408251_2409318_-	NA|1798aa|up_9|NZ_CP040094.1_2376888_2382282_+	TIGR02813, omega-3_polyunsaturated_fatty_acid_synthase_PfaA, polyketide-type polyunsaturated fatty acid synthase PfaA	NA|576aa|up_8|NZ_CP040094.1_2382519_2384247_+	TIGR02813, omega-3_polyunsaturated_fatty_acid_synthase_PfaA, polyketide-type polyunsaturated fatty acid synthase PfaA	NA|1584aa|up_7|NZ_CP040094.1_2384491_2389243_+	COG3321, COG3321, Polyketide synthase modules and related proteins [Secondary metabolites biosynthesis, transport, and catabolism]	NA|559aa|up_6|NZ_CP040094.1_2389449_2391126_+	cd04742, NPD_FabD, 2-Nitropropane dioxygenase (NPD)-like domain, associated with the (acyl-carrier-protein) S-malonyltransferase  FabD	NA|505aa|up_5|NZ_CP040094.1_2391128_2392643_+	COG3320, COG3320, Putative dehydrogenase domain of multifunctional non-ribosomal peptide synthetases and related enzymes [Secondary metabolites biosynthesis, transport, and catabolism]	NA|268aa|up_4|NZ_CP040094.1_2393212_2394016_+	COG0300, DltE, Short-chain dehydrogenases of various substrate specificities [General function prediction only]	NA|240aa|up_3|NZ_CP040094.1_2394068_2394788_-	COG2091, Sfp, Phosphopantetheinyl transferase [Coenzyme metabolism]	NA|395aa|up_2|NZ_CP040094.1_2395241_2396426_-	COG1262, COG1262, Uncharacterized conserved protein [Function unknown]	NA|49aa|up_1|NZ_CP040094.1_2396827_2396974_+	NA	NA|142aa|up_0|NZ_CP040094.1_2397209_2397635_-	NA	NA|388aa|down_0|NZ_CP040094.1_2398157_2399321_+	PRK05643, PRK05643, DNA polymerase III subunit beta; Validated	NA|529aa|down_1|NZ_CP040094.1_2400057_2401644_+	pfam13282, DUF4070, Domain of unknown function (DUF4070)	NA|347aa|down_2|NZ_CP040094.1_2401878_2402919_+	pfam11199, DUF2891, Protein of unknown function (DUF2891)	NA|48aa|down_3|NZ_CP040094.1_2402908_2403052_-	PRK13529, PRK13529, oxaloacetate-decarboxylating malate dehydrogenase	NA|348aa|down_4|NZ_CP040094.1_2403169_2404213_+	COG0057, GapA, Glyceraldehyde-3-phosphate dehydrogenase/erythrose-4-phosphate dehydrogenase [Carbohydrate transport and metabolism]	NA|477aa|down_5|NZ_CP040094.1_2404454_2405885_+	COG0469, PykF, Pyruvate kinase [Carbohydrate transport and metabolism]	NA|335aa|down_6|NZ_CP040094.1_2406175_2407180_+	PRK12309, PRK12309, transaldolase	NA|126aa|down_7|NZ_CP040094.1_2407372_2407750_-	NA	NA|356aa|down_8|NZ_CP040094.1_2408251_2409318_-	NA	NA|527aa|down_9|NZ_CP040094.1_2409726_2411307_-	PRK00302, lnt, apolipoprotein N-acyltransferase; Reviewed
GCF_013343235.1_ASM1334323v1	NZ_CP040094	Nostoc sp. TCL240-02 chromosome, complete genome	13	2676258-2676659	6,12,8	CRT,CRISPRCasFinder,PILER-CR	no	c2c5_V-U5,RT	DinG,cas3,csa3,cas4,cas6,RT,cas1,cas2,csx3,WYL,csm6,cmr5gr11,cmr4gr7,cmr3gr5,cas10,cas14j,c2c5_V-U5,PD-DExK,Cas9_archaeal,DEDDh,c2c8_V-U2,Cas14c_CAS-V-F,csm5gr7,csm4gr5,csm3gr7,csm2gr11,csc2gr7,csc1gr5,2OG_CAS	Unclear	CTTTCAACCCGNCTCTAGCTGNNAGGGGTNTTGAAAC,ACTTTCAACCCGCCTCTAGCTGGGAGGGGTGTTGAAAC,ACTTTCAACCCGCCTCTAGCTGGGAGGGGTGTTGAAAC	37,38,38	0	0	NA	NA	N:A	5,4,3	5	Unclear	DinG,cas3,csa3,cas4,cas6,RT,cas1,cas2,csx3,WYL,csm6,cmr5gr11,cmr4gr7,cmr3gr5,cas10,cas14j,c2c5_V-U5,PD-DExK,Cas9_archaeal,DEDDh,c2c8_V-U2,Cas14c_CAS-V-F,csm5gr7,csm4gr5,csm3gr7,csm2gr11,csc2gr7,csc1gr5,2OG_CAS	NA|72aa|up_9|NZ_CP040094.1_2661912_2662128_-,NA|106aa|up_8|NZ_CP040094.1_2662450_2662768_-,NA|96aa|up_5|NZ_CP040094.1_2667613_2667901_-,NA|172aa|up_3|NZ_CP040094.1_2670372_2670888_+,NA|188aa|up_1|NZ_CP040094.1_2672547_2673111_+,c2c5_V-U5|646aa|down_0|NZ_CP040094.1_2677119_2679057_-	NA|72aa|up_9|NZ_CP040094.1_2661912_2662128_-	NA	NA|106aa|up_8|NZ_CP040094.1_2662450_2662768_-	NA	NA|723aa|up_7|NZ_CP040094.1_2662798_2664967_-	COG2274, SunT, ABC-type bacteriocin/lantibiotic exporters, contain an N-terminal double-glycine peptidase domain [Defense mechanisms]	NA|503aa|up_6|NZ_CP040094.1_2665046_2666555_-	TIGR01843, Hemolysin_secretion_protein_D_plasmid, type I secretion membrane fusion protein, HlyD family	NA|96aa|up_5|NZ_CP040094.1_2667613_2667901_-	NA	NA|491aa|up_4|NZ_CP040094.1_2668432_2669905_+	TIGR00387, Glycolate_oxidase_subunit_glcD	NA|172aa|up_3|NZ_CP040094.1_2670372_2670888_+	NA	NA|416aa|up_2|NZ_CP040094.1_2671300_2672548_+	COG1649, COG1649, Uncharacterized protein conserved in bacteria [Function unknown]	NA|188aa|up_1|NZ_CP040094.1_2672547_2673111_+	NA	NA|749aa|up_0|NZ_CP040094.1_2673465_2675712_+	COG0642, BaeS, Signal transduction histidine kinase [Signal transduction mechanisms]	c2c5_V-U5|646aa|down_0|NZ_CP040094.1_2677119_2679057_-	NA	NA|142aa|down_1|NZ_CP040094.1_2679134_2679560_+	cd01105, HTH_GlnR-like, Helix-Turn-Helix DNA binding domain of GlnR-like transcription regulators	NA|512aa|down_2|NZ_CP040094.1_2679650_2681186_+	COG0286, HsdM, Type I restriction-modification system methyltransferase subunit [Defense mechanisms]	NA|355aa|down_3|NZ_CP040094.1_2681185_2682250_+	COG1196, Smc, Chromosome segregation ATPases [Cell division and chromosome partitioning]	NA|459aa|down_4|NZ_CP040094.1_2682242_2683619_+	cd17261, RMtype1_S_EcoKI-TRD2-CR2_like, Type I restriction-modification system specificity (S) subunit TRD-CR, similar to Escherichia coli str	NA|208aa|down_5|NZ_CP040094.1_2683664_2684288_+	pfam14487, DUF4433, Domain of unknown function (DUF4433)	NA|355aa|down_6|NZ_CP040094.1_2684342_2685407_+	cd02901, Macro_Poa1p-like, macrodomain, Poa1p-like family	RT|557aa|down_7|NZ_CP040094.1_2685384_2687055_-	TIGR04416, hypothetical_protein, group II intron reverse transcriptase/maturase	NA|1061aa|down_8|NZ_CP040094.1_2687316_2690499_+	COG0610, COG0610, Type I site-specific restriction-modification system, R (restriction) subunit and related helicases [Defense mechanisms]	NA|233aa|down_9|NZ_CP040094.1_2690495_2691194_+	pfam01863, DUF45, Protein of unknown function DUF45
GCF_013343235.1_ASM1334323v1	NZ_CP040094	Nostoc sp. TCL240-02 chromosome, complete genome	14	2848718-2849046	9,13,7	PILER-CR,CRISPRCasFinder,CRT	no	c2c5_V-U5	DinG,cas3,csa3,cas4,cas6,RT,cas1,cas2,csx3,WYL,csm6,cmr5gr11,cmr4gr7,cmr3gr5,cas10,cas14j,c2c5_V-U5,PD-DExK,Cas9_archaeal,DEDDh,c2c8_V-U2,Cas14c_CAS-V-F,csm5gr7,csm4gr5,csm3gr7,csm2gr11,csc2gr7,csc1gr5,2OG_CAS	Type V-U5	CTTTCAACCCGCCTCTAGCTGGGAGGGGTGTTGAAAC,GTTTCAACACCCCTCCCAGCTAGAGGCGGGTTGAAAG,GTTTCAACACCCCTCCCAGCTAGAGGCGGGTTGAAAG	37,37,37	0	0	NA	NA	N:A	4,4,4	4	TypeV-U5	DinG,cas3,csa3,cas4,cas6,RT,cas1,cas2,csx3,WYL,csm6,cmr5gr11,cmr4gr7,cmr3gr5,cas10,cas14j,c2c5_V-U5,PD-DExK,Cas9_archaeal,DEDDh,c2c8_V-U2,Cas14c_CAS-V-F,csm5gr7,csm4gr5,csm3gr7,csm2gr11,csc2gr7,csc1gr5,2OG_CAS	c2c5_V-U5|637aa|up_0|NZ_CP040094.1_2846346_2848257_+,NA|53aa|down_4|NZ_CP040094.1_2855794_2855953_+	NA|761aa|up_9|NZ_CP040094.1_2828645_2830928_+	COG3408, GDB1, Glycogen debranching enzyme [Carbohydrate transport and metabolism]	NA|890aa|up_8|NZ_CP040094.1_2830990_2833660_-	cd02089, P-type_ATPase_Ca_prok, prokaryotic P-type Ca(2+)-ATPase similar to Synechococcus elongatus sp	NA|848aa|up_7|NZ_CP040094.1_2833983_2836527_+	cd02077, P-type_ATPase_Mg, magnesium transporting ATPase (MgtA), similar to Escherichia coli MgtA and Salmonella typhimurium MgtA	NA|279aa|up_6|NZ_CP040094.1_2838609_2839446_+	COG2842, COG2842, Uncharacterized ATPase, putative transposase [General function prediction only]	NA|495aa|up_5|NZ_CP040094.1_2840006_2841491_-	COG0286, HsdM, Type I restriction-modification system methyltransferase subunit [Defense mechanisms]	NA|279aa|up_4|NZ_CP040094.1_2841502_2842339_-	TIGR02646, Hypothetical_protein_SMc04429, TIGR02646 family protein	NA|558aa|up_3|NZ_CP040094.1_2842347_2844021_-	TIGR04435, ABC_transporter, restriction system-associated AAA family ATPase	NA|590aa|up_2|NZ_CP040094.1_2844023_2845793_-	cd17517, RMtype1_S_EcoKI_StySPI-TRD2-CR2_like, Type I restriction-modification system specificity (S) subunit Target Recognition Domain-ConseRved domain (TRD-CR),similar to Escherichia coli str	NA|142aa|up_1|NZ_CP040094.1_2845852_2846278_-	cd01105, HTH_GlnR-like, Helix-Turn-Helix DNA binding domain of GlnR-like transcription regulators	c2c5_V-U5|637aa|up_0|NZ_CP040094.1_2846346_2848257_+	NA	NA|402aa|down_0|NZ_CP040094.1_2850711_2851917_-	pfam01551, Peptidase_M23, Peptidase family M23	NA|540aa|down_1|NZ_CP040094.1_2852746_2854366_+	COG0063, COG0063, Predicted sugar kinase [Carbohydrate transport and metabolism]	NA|138aa|down_2|NZ_CP040094.1_2854403_2854817_+	cd07264, VOC_like, uncharacterized subfamily of vicinal oxygen chelate (VOC) family	NA|294aa|down_3|NZ_CP040094.1_2854877_2855759_+	PRK09563, rbgA, GTPase YlqF; Reviewed	NA|53aa|down_4|NZ_CP040094.1_2855794_2855953_+	NA	NA|210aa|down_5|NZ_CP040094.1_2856280_2856910_+	cd19368, TenA_C_AtTH2-like, TenA_C family similar to the N-terminal TenA_C domain of Arabidopsis thaliana thiamine requiring 2	NA|301aa|down_6|NZ_CP040094.1_2857078_2857981_+	TIGR02037, Probable_periplasmic_serine_protease_do/HhoA-like, periplasmic serine protease, Do/DeqQ family	NA|279aa|down_7|NZ_CP040094.1_2858235_2859072_+	TIGR02037, Probable_periplasmic_serine_protease_do/HhoA-like, periplasmic serine protease, Do/DeqQ family	NA|211aa|down_8|NZ_CP040094.1_2859068_2859701_+	COG2197, CitB, Response regulator containing a CheY-like receiver domain and an HTH DNA-binding domain [Signal transduction mechanisms / Transcription]	NA|545aa|down_9|NZ_CP040094.1_2859938_2861573_+	cd03085, PGM1, Phosphoglucomutase 1 (PGM1) catalyzes the bidirectional interconversion of glucose-1-phosphate (G-1-P) and glucose-6-phosphate (G-6-P) via a glucose 1,6-diphosphate intermediate, an important metabolic step in prokaryotes and eukaryotes
GCF_013343235.1_ASM1334323v1	NZ_CP040094	Nostoc sp. TCL240-02 chromosome, complete genome	15	3153553-3154958	14,8,10	CRISPRCasFinder,CRT,PILER-CR	no	PD-DExK	DinG,cas3,csa3,cas4,cas6,RT,cas1,cas2,csx3,WYL,csm6,cmr5gr11,cmr4gr7,cmr3gr5,cas10,cas14j,c2c5_V-U5,PD-DExK,Cas9_archaeal,DEDDh,c2c8_V-U2,Cas14c_CAS-V-F,csm5gr7,csm4gr5,csm3gr7,csm2gr11,csc2gr7,csc1gr5,2OG_CAS	Unclear	GTTTCAATCCCTAATAGGGATTTTGATTAATTGCAAT,GTTTCAATCCCTAATANGGATTTTGATTAATTGCAAT,GTTTCAATCCCTAATATGGATTTTGATTAATTGCAAT	37,37,37	0	0	NA	NA	N:A	19,19,19	19	Orphan	DinG,cas3,csa3,cas4,cas6,RT,cas1,cas2,csx3,WYL,csm6,cmr5gr11,cmr4gr7,cmr3gr5,cas10,cas14j,c2c5_V-U5,PD-DExK,Cas9_archaeal,DEDDh,c2c8_V-U2,Cas14c_CAS-V-F,csm5gr7,csm4gr5,csm3gr7,csm2gr11,csc2gr7,csc1gr5,2OG_CAS	NA|119aa|up_6|NZ_CP040094.1_3144730_3145087_-,NA|102aa|up_5|NZ_CP040094.1_3145386_3145692_-,NA|184aa|up_1|NZ_CP040094.1_3151048_3151600_+,NA|81aa|down_2|NZ_CP040094.1_3157348_3157591_-,NA|71aa|down_4|NZ_CP040094.1_3161971_3162184_+,NA|66aa|down_5|NZ_CP040094.1_3162230_3162428_+	NA|972aa|up_9|NZ_CP040094.1_3140769_3143685_+	PRK06241, PRK06241, phosphoenolpyruvate synthase; Validated	NA|116aa|up_8|NZ_CP040094.1_3143681_3144029_+	pfam07883, Cupin_2, Cupin domain	NA|197aa|up_7|NZ_CP040094.1_3144032_3144623_-	cd06260, DUF820, Domain of unknown function (DUF820)	NA|119aa|up_6|NZ_CP040094.1_3144730_3145087_-	NA	NA|102aa|up_5|NZ_CP040094.1_3145386_3145692_-	NA	NA|742aa|up_4|NZ_CP040094.1_3145963_3148189_-	COG4252, COG4252, Predicted transmembrane sensor domain [Signal transduction mechanisms]	NA|282aa|up_3|NZ_CP040094.1_3148194_3149040_-	pfam06051, DUF928, Domain of Unknown Function (DUF928)	NA|427aa|up_2|NZ_CP040094.1_3149343_3150624_+	PRK02427, PRK02427, 3-phosphoshikimate 1-carboxyvinyltransferase; Provisional	NA|184aa|up_1|NZ_CP040094.1_3151048_3151600_+	NA	NA|529aa|up_0|NZ_CP040094.1_3151826_3153413_+	COG3540, PhoD, Phosphodiesterase/alkaline phosphatase D [Inorganic ion transport and metabolism]	NA|277aa|down_0|NZ_CP040094.1_3155446_3156277_+	COG0412, COG0412, Dienelactone hydrolase and related enzymes [Secondary metabolites biosynthesis, transport, and catabolism]	NA|314aa|down_1|NZ_CP040094.1_3156287_3157229_-	COG3509, LpqC, Poly(3-hydroxybutyrate) depolymerase [Secondary metabolites biosynthesis, transport, and catabolism]	NA|81aa|down_2|NZ_CP040094.1_3157348_3157591_-	NA	NA|1039aa|down_3|NZ_CP040094.1_3158226_3161343_-	COG3641, PfoR, Predicted membrane protein, putative toxin regulator [General function prediction only]	NA|71aa|down_4|NZ_CP040094.1_3161971_3162184_+	NA	NA|66aa|down_5|NZ_CP040094.1_3162230_3162428_+	NA	NA|206aa|down_6|NZ_CP040094.1_3162941_3163559_-	pfam00582, Usp, Universal stress protein family	NA|279aa|down_7|NZ_CP040094.1_3163912_3164749_-	COG1116, TauB, ABC-type nitrate/sulfonate/bicarbonate transport system, ATPase component [Inorganic ion transport and metabolism]	NA|481aa|down_8|NZ_CP040094.1_3164849_3166292_-	cd13553, PBP2_NrtA_CpmA_like, Substrate binding domain of ABC-type nitrate/bicarbonate transporters, a member of the type 2 periplasmic binding fold superfamily	NA|284aa|down_9|NZ_CP040094.1_3166379_3167231_-	TIGR01183, Nitrate_transport_permease_protein_NrtB, nitrate ABC transporter, permease protein
GCF_013343235.1_ASM1334323v1	NZ_CP040094	Nostoc sp. TCL240-02 chromosome, complete genome	16	3216081-3216209	15	CRISPRCasFinder	no		DinG,cas3,csa3,cas4,cas6,RT,cas1,cas2,csx3,WYL,csm6,cmr5gr11,cmr4gr7,cmr3gr5,cas10,cas14j,c2c5_V-U5,PD-DExK,Cas9_archaeal,DEDDh,c2c8_V-U2,Cas14c_CAS-V-F,csm5gr7,csm4gr5,csm3gr7,csm2gr11,csc2gr7,csc1gr5,2OG_CAS	Orphan	GAAGCAGAAGTCACCTTCGTACAGCCACGTGATATTTTC	39	0	0	NA	NA	N:A	1	1	Orphan	DinG,cas3,csa3,cas4,cas6,RT,cas1,cas2,csx3,WYL,csm6,cmr5gr11,cmr4gr7,cmr3gr5,cas10,cas14j,c2c5_V-U5,PD-DExK,Cas9_archaeal,DEDDh,c2c8_V-U2,Cas14c_CAS-V-F,csm5gr7,csm4gr5,csm3gr7,csm2gr11,csc2gr7,csc1gr5,2OG_CAS	NA|116aa|up_1|NZ_CP040094.1_3214131_3214479_+,NA|128aa|up_0|NZ_CP040094.1_3214639_3215023_+,NA|73aa|down_1|NZ_CP040094.1_3220118_3220337_+,NA|243aa|down_2|NZ_CP040094.1_3220697_3221426_+,NA|129aa|down_5|NZ_CP040094.1_3224065_3224452_-,NA|418aa|down_9|NZ_CP040094.1_3227742_3228996_-	NA|215aa|up_9|NZ_CP040094.1_3204954_3205599_-	cd00060, FHA, Forkhead associated domain (FHA); found in eukaryotic and prokaryotic proteins	NA|713aa|up_8|NZ_CP040094.1_3205570_3207709_-	TIGR02136, ptsS_2, phosphate binding protein	NA|227aa|up_7|NZ_CP040094.1_3207820_3208501_-	COG2197, CitB, Response regulator containing a CheY-like receiver domain and an HTH DNA-binding domain [Signal transduction mechanisms / Transcription]	NA|374aa|up_6|NZ_CP040094.1_3208592_3209714_-	COG0642, BaeS, Signal transduction histidine kinase [Signal transduction mechanisms]	NA|280aa|up_5|NZ_CP040094.1_3210384_3211224_+	pfam14065, DUF4255, Protein of unknown function (DUF4255)	NA|554aa|up_4|NZ_CP040094.1_3211254_3212916_+	COG3497, COG3497, Phage tail sheath protein FI [General function prediction only]	NA|152aa|up_3|NZ_CP040094.1_3213063_3213519_+	pfam06841, Phage_T4_gp19, T4-like virus tail tube protein gp19	NA|171aa|up_2|NZ_CP040094.1_3213607_3214120_+	pfam06841, Phage_T4_gp19, T4-like virus tail tube protein gp19	NA|116aa|up_1|NZ_CP040094.1_3214131_3214479_+	NA	NA|128aa|up_0|NZ_CP040094.1_3214639_3215023_+	NA	NA|727aa|down_0|NZ_CP040094.1_3217315_3219496_+	COG4252, COG4252, Predicted transmembrane sensor domain [Signal transduction mechanisms]	NA|73aa|down_1|NZ_CP040094.1_3220118_3220337_+	NA	NA|243aa|down_2|NZ_CP040094.1_3220697_3221426_+	NA	NA|259aa|down_3|NZ_CP040094.1_3221498_3222275_-	cd06445, ATase, The DNA repair protein O6-alkylguanine-DNA alkyltransferase (ATase; also known as AGT, AGAT and MGMT) reverses O6-alkylation DNA damage by transferring O6-alkyl adducts to an active site cysteine irreversibly, without inducing DNA strand breaks	NA|237aa|down_4|NZ_CP040094.1_3223054_3223765_-	PRK00702, PRK00702, ribose-5-phosphate isomerase RpiA	NA|129aa|down_5|NZ_CP040094.1_3224065_3224452_-	NA	NA|194aa|down_6|NZ_CP040094.1_3224706_3225288_+	cd17256, RMtype1_S_EcoJA65PI-TRD1-CR1_like, Type I restriction-modification system specificity (S) subunit Target Recognition Domain-ConseRved domain (TRD-CR), similar to S	NA|489aa|down_7|NZ_CP040094.1_3225284_3226751_+	COG0286, HsdM, Type I restriction-modification system methyltransferase subunit [Defense mechanisms]	NA|225aa|down_8|NZ_CP040094.1_3227033_3227708_+	pfam00395, SLH, S-layer homology domain	NA|418aa|down_9|NZ_CP040094.1_3227742_3228996_-	NA
GCF_013343235.1_ASM1334323v1	NZ_CP040094	Nostoc sp. TCL240-02 chromosome, complete genome	17	3306110-3306209	16	CRISPRCasFinder	no		DinG,cas3,csa3,cas4,cas6,RT,cas1,cas2,csx3,WYL,csm6,cmr5gr11,cmr4gr7,cmr3gr5,cas10,cas14j,c2c5_V-U5,PD-DExK,Cas9_archaeal,DEDDh,c2c8_V-U2,Cas14c_CAS-V-F,csm5gr7,csm4gr5,csm3gr7,csm2gr11,csc2gr7,csc1gr5,2OG_CAS	Orphan	ATGTCTACGACGGGCTACGCCTACGC	26	0	0	NA	NA	N:A	1	1	Orphan	DinG,cas3,csa3,cas4,cas6,RT,cas1,cas2,csx3,WYL,csm6,cmr5gr11,cmr4gr7,cmr3gr5,cas10,cas14j,c2c5_V-U5,PD-DExK,Cas9_archaeal,DEDDh,c2c8_V-U2,Cas14c_CAS-V-F,csm5gr7,csm4gr5,csm3gr7,csm2gr11,csc2gr7,csc1gr5,2OG_CAS	NA|226aa|up_7|NZ_CP040094.1_3293847_3294525_-,NA|245aa|down_1|NZ_CP040094.1_3308865_3309600_+	NA|817aa|up_9|NZ_CP040094.1_3290000_3292451_+	COG1413, COG1413, FOG: HEAT repeat [Energy production and conversion]	NA|306aa|up_8|NZ_CP040094.1_3292576_3293494_-	PRK15370, PRK15370, type III secretion system effector E3 ubiquitin transferase SlrP	NA|226aa|up_7|NZ_CP040094.1_3293847_3294525_-	NA	NA|160aa|up_6|NZ_CP040094.1_3294706_3295186_-	pfam00583, Acetyltransf_1, Acetyltransferase (GNAT) family	NA|535aa|up_5|NZ_CP040094.1_3295287_3296892_-	PRK00915, PRK00915, 2-isopropylmalate synthase; Validated	NA|174aa|up_4|NZ_CP040094.1_3296987_3297509_-	cd10911, PIN_LabA, PIN domain of Synechococcus elongatus LabA (low-amplitude and bright) and related proteins	NA|787aa|up_3|NZ_CP040094.1_3298249_3300610_+	COG4252, COG4252, Predicted transmembrane sensor domain [Signal transduction mechanisms]	NA|454aa|up_2|NZ_CP040094.1_3300960_3302322_-	PRK06116, PRK06116, glutathione reductase; Validated	NA|775aa|up_1|NZ_CP040094.1_3302658_3304983_-	pfam00931, NB-ARC, NB-ARC domain	NA|284aa|up_0|NZ_CP040094.1_3305228_3306080_+	COG3409, COG3409, Putative peptidoglycan-binding domain-containing protein [Cell envelope biogenesis, outer membrane]	NA|534aa|down_0|NZ_CP040094.1_3306415_3308017_-	pfam05731, TROVE, TROVE domain	NA|245aa|down_1|NZ_CP040094.1_3308865_3309600_+	NA	NA|377aa|down_2|NZ_CP040094.1_3309648_3310779_-	PRK05286, PRK05286, quinone-dependent dihydroorotate dehydrogenase	NA|974aa|down_3|NZ_CP040094.1_3311016_3313938_+	pfam01590, GAF, GAF domain	NA|65aa|down_4|NZ_CP040094.1_3314145_3314340_+	pfam10013, DUF2256, Uncharacterized protein conserved in bacteria (DUF2256)	NA|318aa|down_5|NZ_CP040094.1_3314339_3315293_+	cd14949, Asparaginase_2_like_3, Uncharacterized bacterial subfamily of the L-Asparaginase type 2-like enzymes, an Ntn-hydrolase family	NA|148aa|down_6|NZ_CP040094.1_3315283_3315727_-	pfam01724, DUF29, Domain of unknown function DUF29	NA|460aa|down_7|NZ_CP040094.1_3315838_3317218_-	PRK14360, glmU, bifunctional UDP-N-acetylglucosamine diphosphorylase/glucosamine-1-phosphate N-acetyltransferase GlmU	NA|410aa|down_8|NZ_CP040094.1_3317371_3318601_-	COG3437, COG3437, Response regulator containing a CheY-like receiver domain and an HD-GYP domain [Transcription / Signal transduction mechanisms]	NA|308aa|down_9|NZ_CP040094.1_3319305_3320229_-	COG4121, COG4121, Uncharacterized conserved protein [Function unknown]
GCF_013343235.1_ASM1334323v1	NZ_CP040094	Nostoc sp. TCL240-02 chromosome, complete genome	18	3813113-3813308	17	CRISPRCasFinder	no		DinG,cas3,csa3,cas4,cas6,RT,cas1,cas2,csx3,WYL,csm6,cmr5gr11,cmr4gr7,cmr3gr5,cas10,cas14j,c2c5_V-U5,PD-DExK,Cas9_archaeal,DEDDh,c2c8_V-U2,Cas14c_CAS-V-F,csm5gr7,csm4gr5,csm3gr7,csm2gr11,csc2gr7,csc1gr5,2OG_CAS	Orphan	CTCCACCATCAAGAATGTCATTACCTGCCCCACC	34	0	0	NA	NA	N:A	2	2	Orphan	DinG,cas3,csa3,cas4,cas6,RT,cas1,cas2,csx3,WYL,csm6,cmr5gr11,cmr4gr7,cmr3gr5,cas10,cas14j,c2c5_V-U5,PD-DExK,Cas9_archaeal,DEDDh,c2c8_V-U2,Cas14c_CAS-V-F,csm5gr7,csm4gr5,csm3gr7,csm2gr11,csc2gr7,csc1gr5,2OG_CAS	NA,NA|70aa|down_0|NZ_CP040094.1_3819263_3819473_-	NA|478aa|up_9|NZ_CP040094.1_3800397_3801831_-	COG0642, BaeS, Signal transduction histidine kinase [Signal transduction mechanisms]	NA|100aa|up_8|NZ_CP040094.1_3802287_3802587_+	pfam14243, DUF4343, Domain of unknown function (DUF4343)	NA|661aa|up_7|NZ_CP040094.1_3802617_3804600_-	pfam06202, GDE_C, Amylo-alpha-1,6-glucosidase	NA|166aa|up_6|NZ_CP040094.1_3804876_3805374_-	cd02215, cupin_QDO_N_C, quercetinase, N- and C-terminal cupin domains	NA|423aa|up_5|NZ_CP040094.1_3805752_3807021_+	PRK11360, PRK11360, two-component system sensor histidine kinase AtoS	NA|188aa|up_4|NZ_CP040094.1_3807117_3807681_+	COG0317, SpoT, Guanosine polyphosphate pyrophosphohydrolases/synthetases [Signal transduction mechanisms / Transcription]	NA|146aa|up_3|NZ_CP040094.1_3807766_3808204_+	COG1525, COG1525, Micrococcal nuclease (thermonuclease) homologs [DNA replication, recombination, and repair]	NA|348aa|up_2|NZ_CP040094.1_3808449_3809493_-	COG2008, GLY1, Threonine aldolase [Amino acid transport and metabolism]	NA|261aa|up_1|NZ_CP040094.1_3810117_3810900_+	COG1127, Ttg2A, ABC-type transport system involved in resistance to organic solvents, ATPase component [Secondary metabolites biosynthesis, transport, and catabolism]	NA|480aa|up_0|NZ_CP040094.1_3811021_3812461_+	PLN03094, PLN03094, Substrate binding subunit of ER-derived-lipid transporter; Provisional	NA|70aa|down_0|NZ_CP040094.1_3819263_3819473_-	NA	NA|437aa|down_1|NZ_CP040094.1_3819946_3821257_+	pfam11717, Tudor-knot, RNA binding activity-knot of a chromodomain	NA|373aa|down_2|NZ_CP040094.1_3821491_3822610_+	pfam01636, APH, Phosphotransferase enzyme family	NA|180aa|down_3|NZ_CP040094.1_3822606_3823146_+	cd09627, DOMON_murB_like, Domon-like domain of UDP-N-acetylenolpyruvoylglucosamine reductase	NA|292aa|down_4|NZ_CP040094.1_3823387_3824263_-	COG0385, COG0385, Predicted Na+-dependent transporter [General function prediction only]	NA|779aa|down_5|NZ_CP040094.1_3824459_3826796_-	cd16025, PAS_like, Bacterial Arylsulfatase of Pseudomonas aeruginosa and related proteins	NA|333aa|down_6|NZ_CP040094.1_3826884_3827883_-	pfam03781, FGE-sulfatase, Sulfatase-modifying factor enzyme 1	NA|357aa|down_7|NZ_CP040094.1_3828424_3829495_+	pfam10609, ParA, NUBPL iron-transfer P-loop NTPase	NA|570aa|down_8|NZ_CP040094.1_3829627_3831337_+	PRK08275, PRK08275, putative oxidoreductase; Provisional	NA|212aa|down_9|NZ_CP040094.1_3831570_3832206_+	COG4122, COG4122, Predicted O-methyltransferase [General function prediction only]
GCF_013343235.1_ASM1334323v1	NZ_CP040094	Nostoc sp. TCL240-02 chromosome, complete genome	19	3853367-3853462	18	CRISPRCasFinder	no		DinG,cas3,csa3,cas4,cas6,RT,cas1,cas2,csx3,WYL,csm6,cmr5gr11,cmr4gr7,cmr3gr5,cas10,cas14j,c2c5_V-U5,PD-DExK,Cas9_archaeal,DEDDh,c2c8_V-U2,Cas14c_CAS-V-F,csm5gr7,csm4gr5,csm3gr7,csm2gr11,csc2gr7,csc1gr5,2OG_CAS	Orphan	TAATATTATTTTTGTTAAATTACT	24	0	0	NA	NA	N:A	1	1	Orphan	DinG,cas3,csa3,cas4,cas6,RT,cas1,cas2,csx3,WYL,csm6,cmr5gr11,cmr4gr7,cmr3gr5,cas10,cas14j,c2c5_V-U5,PD-DExK,Cas9_archaeal,DEDDh,c2c8_V-U2,Cas14c_CAS-V-F,csm5gr7,csm4gr5,csm3gr7,csm2gr11,csc2gr7,csc1gr5,2OG_CAS	NA|298aa|up_7|NZ_CP040094.1_3839148_3840042_+,NA|289aa|down_4|NZ_CP040094.1_3861354_3862221_+	NA|287aa|up_9|NZ_CP040094.1_3837709_3838570_+	COG0313, COG0313, Predicted methyltransferases [General function prediction only]	NA|93aa|up_8|NZ_CP040094.1_3838811_3839090_+	PRK10457, PRK10457, hypothetical protein; Provisional	NA|298aa|up_7|NZ_CP040094.1_3839148_3840042_+	NA	NA|1212aa|up_6|NZ_CP040094.1_3841033_3844669_+	PRK11107, PRK11107, hybrid sensory histidine kinase BarA; Provisional	NA|339aa|up_5|NZ_CP040094.1_3844828_3845845_-	PRK02812, PRK02812, ribose-phosphate pyrophosphokinase; Provisional	NA|579aa|up_4|NZ_CP040094.1_3846704_3848441_+	cd14014, STKc_PknB_like, Catalytic domain of bacterial Serine/Threonine kinases, PknB and similar proteins	NA|227aa|up_3|NZ_CP040094.1_3848440_3849121_+	PRK00090, bioD, ATP-dependent dethiobiotin synthetase BioD	NA|301aa|up_2|NZ_CP040094.1_3849699_3850602_+	pfam02673, BacA, Bacitracin resistance protein BacA	NA|406aa|up_1|NZ_CP040094.1_3850700_3851918_+	cd08021, M20_Acy1_YhaA-like, M20 Peptidase aminoacylase 1 subfamily, includes Bacillus subtilis YhaA and Staphylococcus aureus amidohydrolase, SACOL0085	NA|91aa|up_0|NZ_CP040094.1_3852406_3852679_-	pfam10047, DUF2281, Protein of unknown function (DUF2281)	NA|350aa|down_0|NZ_CP040094.1_3854672_3855722_-	cd14014, STKc_PknB_like, Catalytic domain of bacterial Serine/Threonine kinases, PknB and similar proteins	NA|440aa|down_1|NZ_CP040094.1_3856026_3857346_+	PRK05769, PRK05769, acetyl ornithine aminotransferase family protein	NA|305aa|down_2|NZ_CP040094.1_3857548_3858463_+	cd16350, VOC_like, uncharacterized subfamily of the vicinal oxygen chelate (VOC) family	NA|665aa|down_3|NZ_CP040094.1_3858713_3860708_-	cd00707, Pancreat_lipase_like, Pancreatic lipase-like enzymes	NA|289aa|down_4|NZ_CP040094.1_3861354_3862221_+	NA	NA|370aa|down_5|NZ_CP040094.1_3862339_3863449_-	cd08300, alcohol_DH_class_III, class III alcohol dehydrogenases	NA|428aa|down_6|NZ_CP040094.1_3863806_3865090_+	PRK09440, avtA, valine--pyruvate transaminase; Provisional	NA|236aa|down_7|NZ_CP040094.1_3865211_3865919_+	PRK00122, rimM, 16S rRNA-processing protein RimM; Provisional	NA|794aa|down_8|NZ_CP040094.1_3866398_3868780_+	PRK05261, PRK05261, phosphoketolase	NA|554aa|down_9|NZ_CP040094.1_3869117_3870779_+	pfam07602, DUF1565, Protein of unknown function (DUF1565)
GCF_013343235.1_ASM1334323v1	NZ_CP040094	Nostoc sp. TCL240-02 chromosome, complete genome	20	4150917-4151018	19	CRISPRCasFinder	no		DinG,cas3,csa3,cas4,cas6,RT,cas1,cas2,csx3,WYL,csm6,cmr5gr11,cmr4gr7,cmr3gr5,cas10,cas14j,c2c5_V-U5,PD-DExK,Cas9_archaeal,DEDDh,c2c8_V-U2,Cas14c_CAS-V-F,csm5gr7,csm4gr5,csm3gr7,csm2gr11,csc2gr7,csc1gr5,2OG_CAS	Orphan	AGTTTGGCGGTAGGAAAGAAGGCTTTTTGAGCTG	34	0	0	NA	NA	N:A	1	1	Orphan	DinG,cas3,csa3,cas4,cas6,RT,cas1,cas2,csx3,WYL,csm6,cmr5gr11,cmr4gr7,cmr3gr5,cas10,cas14j,c2c5_V-U5,PD-DExK,Cas9_archaeal,DEDDh,c2c8_V-U2,Cas14c_CAS-V-F,csm5gr7,csm4gr5,csm3gr7,csm2gr11,csc2gr7,csc1gr5,2OG_CAS	NA,NA|151aa|down_5|NZ_CP040094.1_4155907_4156360_+	NA|294aa|up_9|NZ_CP040094.1_4135050_4135932_+	pfam02668, TauD, Taurine catabolism dioxygenase TauD, TfdA family	NA|503aa|up_8|NZ_CP040094.1_4136189_4137698_-	COG0043, UbiD, 3-polyprenyl-4-hydroxybenzoate decarboxylase and related decarboxylases [Coenzyme metabolism]	NA|540aa|up_7|NZ_CP040094.1_4138353_4139973_-	sd00006, TPR, Tetratricopeptide repeat	NA|305aa|up_6|NZ_CP040094.1_4140588_4141503_+	TIGR03709, PPK2_rel_1, polyphosphate:nucleotide phosphotransferase, PPK2 family	NA|632aa|up_5|NZ_CP040094.1_4142365_4144261_+	COG1352, CheR, Methylase of chemotaxis methyl-accepting proteins [Cell motility and secretion / Signal transduction mechanisms]	NA|370aa|up_4|NZ_CP040094.1_4144424_4145534_-	pfam13191, AAA_16, AAA ATPase domain	NA|816aa|up_3|NZ_CP040094.1_4145537_4147985_-	cd00200, WD40, WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment	NA|522aa|up_2|NZ_CP040094.1_4147986_4149552_-	pfam05729, NACHT, NACHT domain	NA|199aa|up_1|NZ_CP040094.1_4149863_4150460_+	COG4445, MiaE, Hydroxylase for synthesis of 2-methylthio-cis-ribozeatin in tRNA [Nucleotide transport and metabolism / Translation, ribosomal structure and biogenesis]	NA|135aa|up_0|NZ_CP040094.1_4150478_4150883_+	cd07264, VOC_like, uncharacterized subfamily of vicinal oxygen chelate (VOC) family	NA|451aa|down_0|NZ_CP040094.1_4151309_4152662_+	pfam05787, DUF839, Bacterial protein of unknown function (DUF839)	NA|164aa|down_1|NZ_CP040094.1_4152888_4153380_+	pfam00583, Acetyltransf_1, Acetyltransferase (GNAT) family	NA|226aa|down_2|NZ_CP040094.1_4153460_4154138_+	COG1040, ComFC, Predicted amidophosphoribosyltransferases [General function prediction only]	NA|152aa|down_3|NZ_CP040094.1_4154225_4154681_+	pfam04151, PPC, Bacterial pre-peptidase C-terminal domain	NA|258aa|down_4|NZ_CP040094.1_4154755_4155529_-	PRK00235, cobS, cobalamin synthase; Reviewed	NA|151aa|down_5|NZ_CP040094.1_4155907_4156360_+	NA	NA|382aa|down_6|NZ_CP040094.1_4156724_4157870_+	PRK00112, tgt, queuine tRNA-ribosyltransferase; Provisional	NA|46aa|down_7|NZ_CP040094.1_4157968_4158106_+	CHL00047, psbK, photosystem II protein K	NA|99aa|down_8|NZ_CP040094.1_4158292_4158589_+	COG0633, Fdx, Ferredoxin [Energy production and conversion]	NA|39aa|down_9|NZ_CP040094.1_4158706_4158823_+	PRK04989, psbM, photosystem II reaction center protein M; Provisional
GCF_013343235.1_ASM1334323v1	NZ_CP040094	Nostoc sp. TCL240-02 chromosome, complete genome	21	4692097-4692189	20	CRISPRCasFinder	no		DinG,cas3,csa3,cas4,cas6,RT,cas1,cas2,csx3,WYL,csm6,cmr5gr11,cmr4gr7,cmr3gr5,cas10,cas14j,c2c5_V-U5,PD-DExK,Cas9_archaeal,DEDDh,c2c8_V-U2,Cas14c_CAS-V-F,csm5gr7,csm4gr5,csm3gr7,csm2gr11,csc2gr7,csc1gr5,2OG_CAS	Orphan	AAATGCGATCGCGTTAATCTTAA	23	0	0	NA	NA	N:A	1	1	Orphan	DinG,cas3,csa3,cas4,cas6,RT,cas1,cas2,csx3,WYL,csm6,cmr5gr11,cmr4gr7,cmr3gr5,cas10,cas14j,c2c5_V-U5,PD-DExK,Cas9_archaeal,DEDDh,c2c8_V-U2,Cas14c_CAS-V-F,csm5gr7,csm4gr5,csm3gr7,csm2gr11,csc2gr7,csc1gr5,2OG_CAS	NA|81aa|up_6|NZ_CP040094.1_4680152_4680395_+,NA|131aa|up_1|NZ_CP040094.1_4686629_4687022_-,NA|85aa|down_2|NZ_CP040094.1_4692734_4692989_+,NA|72aa|down_8|NZ_CP040094.1_4698614_4698830_+	NA|575aa|up_9|NZ_CP040094.1_4675429_4677154_-	PRK15347, PRK15347, two component system sensor kinase	NA|145aa|up_8|NZ_CP040094.1_4678098_4678533_+	pfam14159, CAAD, CAAD domains of cyanobacterial aminoacyl-tRNA synthetase	NA|378aa|up_7|NZ_CP040094.1_4678768_4679902_-	pfam00395, SLH, S-layer homology domain	NA|81aa|up_6|NZ_CP040094.1_4680152_4680395_+	NA	NA|233aa|up_5|NZ_CP040094.1_4680494_4681193_-	COG3544, COG3544, Uncharacterized protein conserved in bacteria [Function unknown]	NA|739aa|up_4|NZ_CP040094.1_4681304_4683521_-	cd09912, DLP_2, Dynamin-like protein including dynamins, mitofusins, and guanylate-binding proteins	NA|694aa|up_3|NZ_CP040094.1_4684019_4686101_-	pfam00350, Dynamin_N, Dynamin family	NA|107aa|up_2|NZ_CP040094.1_4686117_4686438_-	pfam11239, DUF3040, Protein of unknown function (DUF3040)	NA|131aa|up_1|NZ_CP040094.1_4686629_4687022_-	NA	NA|1511aa|up_0|NZ_CP040094.1_4687559_4692092_+	pfam12770, CHAT, CHAT domain	NA|70aa|down_0|NZ_CP040094.1_4692250_4692460_+	pfam04255, DUF433, Protein of unknown function (DUF433)	NA|65aa|down_1|NZ_CP040094.1_4692427_4692622_-	pfam04255, DUF433, Protein of unknown function (DUF433)	NA|85aa|down_2|NZ_CP040094.1_4692734_4692989_+	NA	NA|325aa|down_3|NZ_CP040094.1_4693232_4694207_+	PRK10717, PRK10717, cysteine synthase A; Provisional	NA|111aa|down_4|NZ_CP040094.1_4694266_4694599_+	cd02980, TRX_Fd_family, Thioredoxin (TRX)-like [2Fe-2S] Ferredoxin (Fd) family; composed of [2Fe-2S] Fds with a TRX fold (TRX-like Fds) and proteins containing domains similar to TRX-like Fd including formate dehydrogenases, NAD-reducing hydrogenases and the subunit E of NADH:ubiquinone oxidoreductase (NuoE)	NA|307aa|down_5|NZ_CP040094.1_4694748_4695669_+	pfam17265, DUF5331, Family of unknown function (DUF5331)	NA|332aa|down_6|NZ_CP040094.1_4695674_4696670_-	COG0354, COG0354, Predicted aminomethyltransferase related to GcvT [General function prediction only]	NA|528aa|down_7|NZ_CP040094.1_4696877_4698461_-	COG1167, ARO8, Transcriptional regulators containing a DNA-binding HTH domain and an aminotransferase domain (MocR family) and their eukaryotic orthologs [Transcription / Amino acid transport and metabolism]	NA|72aa|down_8|NZ_CP040094.1_4698614_4698830_+	NA	NA|519aa|down_9|NZ_CP040094.1_4699007_4700564_-	COG0146, HyuB, N-methylhydantoinase B/acetone carboxylase, alpha subunit [Amino acid transport and metabolism / Secondary metabolites biosynthesis, transport, and catabolism]
GCF_013343235.1_ASM1334323v1	NZ_CP040094	Nostoc sp. TCL240-02 chromosome, complete genome	22	4756385-4756495	21	CRISPRCasFinder	no		DinG,cas3,csa3,cas4,cas6,RT,cas1,cas2,csx3,WYL,csm6,cmr5gr11,cmr4gr7,cmr3gr5,cas10,cas14j,c2c5_V-U5,PD-DExK,Cas9_archaeal,DEDDh,c2c8_V-U2,Cas14c_CAS-V-F,csm5gr7,csm4gr5,csm3gr7,csm2gr11,csc2gr7,csc1gr5,2OG_CAS	Orphan	ATTTCTTGCAAATGTATAATCAGTTGC	27	0	0	NA	NA	N:A	1	1	Orphan	DinG,cas3,csa3,cas4,cas6,RT,cas1,cas2,csx3,WYL,csm6,cmr5gr11,cmr4gr7,cmr3gr5,cas10,cas14j,c2c5_V-U5,PD-DExK,Cas9_archaeal,DEDDh,c2c8_V-U2,Cas14c_CAS-V-F,csm5gr7,csm4gr5,csm3gr7,csm2gr11,csc2gr7,csc1gr5,2OG_CAS	NA|195aa|up_7|NZ_CP040094.1_4742535_4743120_+,NA|170aa|up_3|NZ_CP040094.1_4751814_4752324_-,NA|85aa|down_2|NZ_CP040094.1_4762065_4762320_+	NA|631aa|up_9|NZ_CP040094.1_4739120_4741013_+	pfam00305, Lipoxygenase, Lipoxygenase	NA|450aa|up_8|NZ_CP040094.1_4741060_4742410_+	COG2124, CypX, Cytochrome P450 [Secondary metabolites biosynthesis, transport, and catabolism]	NA|195aa|up_7|NZ_CP040094.1_4742535_4743120_+	NA	NA|559aa|up_6|NZ_CP040094.1_4743147_4744824_+	COG4782, COG4782, Uncharacterized protein conserved in bacteria [Function unknown]	NA|815aa|up_5|NZ_CP040094.1_4745500_4747945_+	pfam05860, Haemagg_act, haemagglutination activity domain	NA|821aa|up_4|NZ_CP040094.1_4749269_4751732_+	pfam05860, Haemagg_act, haemagglutination activity domain	NA|170aa|up_3|NZ_CP040094.1_4751814_4752324_-	NA	NA|305aa|up_2|NZ_CP040094.1_4752367_4753282_-	PRK01212, PRK01212, homoserine kinase; Provisional	NA|247aa|up_1|NZ_CP040094.1_4753455_4754196_+	COG0565, LasT, rRNA methylase [Translation, ribosomal structure and biogenesis]	NA|522aa|up_0|NZ_CP040094.1_4754567_4756133_+	pfam13354, Beta-lactamase2, Beta-lactamase enzyme family	NA|1109aa|down_0|NZ_CP040094.1_4756956_4760283_+	PRK11448, hsdR, type I restriction enzyme EcoKI subunit R; Provisional	NA|490aa|down_1|NZ_CP040094.1_4760455_4761925_+	pfam02384, N6_Mtase, N-6 DNA Methylase	NA|85aa|down_2|NZ_CP040094.1_4762065_4762320_+	NA	NA|104aa|down_3|NZ_CP040094.1_4762316_4762628_+	COG3668, ParE, Plasmid stabilization system protein [General function prediction only]	NA|73aa|down_4|NZ_CP040094.1_4762634_4762853_+	COG1598, COG1598, Predicted nuclease of the RNAse H fold, HicB family [General    function prediction only]	NA|78aa|down_5|NZ_CP040094.1_4762849_4763083_+	COG1724, COG1724, Predicted RNA binding protein (dsRBD-like fold), HicA family    [General function prediction only]	NA|467aa|down_6|NZ_CP040094.1_4763088_4764489_+	cd17282, RMtype1_S_Eco16444ORF1681_TRD1-CR1_like, Type I restriction-modification system specificity (S) subunit Target Recognition Domain-ConseRved domain (TRD-CR), similar to Escherichia coli G4/9 S subunit (S	NA|372aa|down_7|NZ_CP040094.1_4764515_4765631_-	COG0758, Smf, Predicted Rossmann fold nucleotide-binding protein involved in DNA uptake [DNA replication, recombination, and repair / Intracellular trafficking and secretion]	NA|379aa|down_8|NZ_CP040094.1_4765832_4766969_+	pfam07176, DUF1400, Alpha/beta hydrolase of unknown function (DUF1400)	NA|230aa|down_9|NZ_CP040094.1_4767076_4767766_-	pfam02698, DUF218, DUF218 domain
GCF_013343235.1_ASM1334323v1	NZ_CP040094	Nostoc sp. TCL240-02 chromosome, complete genome	23	5027934-5028047	22	CRISPRCasFinder	no		DinG,cas3,csa3,cas4,cas6,RT,cas1,cas2,csx3,WYL,csm6,cmr5gr11,cmr4gr7,cmr3gr5,cas10,cas14j,c2c5_V-U5,PD-DExK,Cas9_archaeal,DEDDh,c2c8_V-U2,Cas14c_CAS-V-F,csm5gr7,csm4gr5,csm3gr7,csm2gr11,csc2gr7,csc1gr5,2OG_CAS	Orphan	GACATCATTGATGGTGGAGAAGGCAACGATAC	32	0	0	NA	NA	N:A	1	1	Orphan	DinG,cas3,csa3,cas4,cas6,RT,cas1,cas2,csx3,WYL,csm6,cmr5gr11,cmr4gr7,cmr3gr5,cas10,cas14j,c2c5_V-U5,PD-DExK,Cas9_archaeal,DEDDh,c2c8_V-U2,Cas14c_CAS-V-F,csm5gr7,csm4gr5,csm3gr7,csm2gr11,csc2gr7,csc1gr5,2OG_CAS	NA|66aa|up_6|NZ_CP040094.1_5013125_5013323_+,NA|269aa|down_0|NZ_CP040094.1_5031165_5031972_+,NA|326aa|down_3|NZ_CP040094.1_5035387_5036365_+,NA|51aa|down_5|NZ_CP040094.1_5038167_5038320_+,NA|463aa|down_6|NZ_CP040094.1_5043188_5044577_+	NA|333aa|up_9|NZ_CP040094.1_5007951_5008950_-	pfam07082, DUF1350, Protein of unknown function (DUF1350)	NA|613aa|up_8|NZ_CP040094.1_5009291_5011130_+	pfam01663, Phosphodiest, Type I phosphodiesterase / nucleotide pyrophosphatase	NA|424aa|up_7|NZ_CP040094.1_5011749_5013021_+	cd03800, GT4_sucrose_synthase, sucrose-phosphate synthase and similar proteins	NA|66aa|up_6|NZ_CP040094.1_5013125_5013323_+	NA	NA|451aa|up_5|NZ_CP040094.1_5013878_5015231_+	pfam03709, OKR_DC_1_N, Orn/Lys/Arg decarboxylase, N-terminal domain	NA|1779aa|up_4|NZ_CP040094.1_5015306_5020643_+	cd00200, WD40, WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment	NA|454aa|up_3|NZ_CP040094.1_5020864_5022226_+	cd13131, MATE_NorM_like, Subfamily of the multidrug and toxic compound extrusion (MATE)-like proteins similar to Vibrio cholerae NorM	NA|330aa|up_2|NZ_CP040094.1_5022501_5023491_-	PRK01372, ddl, D-alanine--D-alanine ligase; Reviewed	NA|385aa|up_1|NZ_CP040094.1_5023496_5024651_-	TIGR03820, lys_2_3_AblA, lysine-2,3-aminomutase	NA|555aa|up_0|NZ_CP040094.1_5025284_5026949_+	COG0668, MscS, Small-conductance mechanosensitive channel [Cell envelope biogenesis, outer membrane]	NA|269aa|down_0|NZ_CP040094.1_5031165_5031972_+	NA	NA|393aa|down_1|NZ_CP040094.1_5032886_5034065_+	cd17330, MFS_SLC46_TetA_like, Eukaryotic Solute carrier 46 (SLC46) family, Bacterial Tetracycline resistance proteins, and similar proteins of the Major Facilitator Superfamily of transporters	NA|341aa|down_2|NZ_CP040094.1_5034158_5035181_+	pfam01609, DDE_Tnp_1, Transposase DDE domain	NA|326aa|down_3|NZ_CP040094.1_5035387_5036365_+	NA	NA|482aa|down_4|NZ_CP040094.1_5036529_5037975_-	TIGR03556, photolyase_8HDF, deoxyribodipyrimidine photo-lyase, 8-HDF type	NA|51aa|down_5|NZ_CP040094.1_5038167_5038320_+	NA	NA|463aa|down_6|NZ_CP040094.1_5043188_5044577_+	NA	NA|301aa|down_7|NZ_CP040094.1_5044622_5045525_+	COG1587, HemD, Uroporphyrinogen-III synthase [Coenzyme metabolism]	NA|100aa|down_8|NZ_CP040094.1_5045628_5045928_+	TIGR02008, Ferredoxin_root_R-B1, ferredoxin [2Fe-2S]	NA|341aa|down_9|NZ_CP040094.1_5046170_5047193_+	pfam01609, DDE_Tnp_1, Transposase DDE domain
GCF_013343235.1_ASM1334323v1	NZ_CP040094	Nostoc sp. TCL240-02 chromosome, complete genome	24	5028381-5028493	23	CRISPRCasFinder	no		DinG,cas3,csa3,cas4,cas6,RT,cas1,cas2,csx3,WYL,csm6,cmr5gr11,cmr4gr7,cmr3gr5,cas10,cas14j,c2c5_V-U5,PD-DExK,Cas9_archaeal,DEDDh,c2c8_V-U2,Cas14c_CAS-V-F,csm5gr7,csm4gr5,csm3gr7,csm2gr11,csc2gr7,csc1gr5,2OG_CAS	Orphan	GACATCATTGATGGTGGAGAAGGCAACGATAC	32	1	1	5028413-5028461	NZ_CP040094.1_5029754-5029802	N:A	1	1	Orphan	DinG,cas3,csa3,cas4,cas6,RT,cas1,cas2,csx3,WYL,csm6,cmr5gr11,cmr4gr7,cmr3gr5,cas10,cas14j,c2c5_V-U5,PD-DExK,Cas9_archaeal,DEDDh,c2c8_V-U2,Cas14c_CAS-V-F,csm5gr7,csm4gr5,csm3gr7,csm2gr11,csc2gr7,csc1gr5,2OG_CAS	NA|66aa|up_6|NZ_CP040094.1_5013125_5013323_+,NA|269aa|down_0|NZ_CP040094.1_5031165_5031972_+,NA|326aa|down_3|NZ_CP040094.1_5035387_5036365_+,NA|51aa|down_5|NZ_CP040094.1_5038167_5038320_+,NA|463aa|down_6|NZ_CP040094.1_5043188_5044577_+	NA|333aa|up_9|NZ_CP040094.1_5007951_5008950_-	pfam07082, DUF1350, Protein of unknown function (DUF1350)	NA|613aa|up_8|NZ_CP040094.1_5009291_5011130_+	pfam01663, Phosphodiest, Type I phosphodiesterase / nucleotide pyrophosphatase	NA|424aa|up_7|NZ_CP040094.1_5011749_5013021_+	cd03800, GT4_sucrose_synthase, sucrose-phosphate synthase and similar proteins	NA|66aa|up_6|NZ_CP040094.1_5013125_5013323_+	NA	NA|451aa|up_5|NZ_CP040094.1_5013878_5015231_+	pfam03709, OKR_DC_1_N, Orn/Lys/Arg decarboxylase, N-terminal domain	NA|1779aa|up_4|NZ_CP040094.1_5015306_5020643_+	cd00200, WD40, WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment	NA|454aa|up_3|NZ_CP040094.1_5020864_5022226_+	cd13131, MATE_NorM_like, Subfamily of the multidrug and toxic compound extrusion (MATE)-like proteins similar to Vibrio cholerae NorM	NA|330aa|up_2|NZ_CP040094.1_5022501_5023491_-	PRK01372, ddl, D-alanine--D-alanine ligase; Reviewed	NA|385aa|up_1|NZ_CP040094.1_5023496_5024651_-	TIGR03820, lys_2_3_AblA, lysine-2,3-aminomutase	NA|555aa|up_0|NZ_CP040094.1_5025284_5026949_+	COG0668, MscS, Small-conductance mechanosensitive channel [Cell envelope biogenesis, outer membrane]	NA|269aa|down_0|NZ_CP040094.1_5031165_5031972_+	NA	NA|393aa|down_1|NZ_CP040094.1_5032886_5034065_+	cd17330, MFS_SLC46_TetA_like, Eukaryotic Solute carrier 46 (SLC46) family, Bacterial Tetracycline resistance proteins, and similar proteins of the Major Facilitator Superfamily of transporters	NA|341aa|down_2|NZ_CP040094.1_5034158_5035181_+	pfam01609, DDE_Tnp_1, Transposase DDE domain	NA|326aa|down_3|NZ_CP040094.1_5035387_5036365_+	NA	NA|482aa|down_4|NZ_CP040094.1_5036529_5037975_-	TIGR03556, photolyase_8HDF, deoxyribodipyrimidine photo-lyase, 8-HDF type	NA|51aa|down_5|NZ_CP040094.1_5038167_5038320_+	NA	NA|463aa|down_6|NZ_CP040094.1_5043188_5044577_+	NA	NA|301aa|down_7|NZ_CP040094.1_5044622_5045525_+	COG1587, HemD, Uroporphyrinogen-III synthase [Coenzyme metabolism]	NA|100aa|down_8|NZ_CP040094.1_5045628_5045928_+	TIGR02008, Ferredoxin_root_R-B1, ferredoxin [2Fe-2S]	NA|341aa|down_9|NZ_CP040094.1_5046170_5047193_+	pfam01609, DDE_Tnp_1, Transposase DDE domain
GCF_013343235.1_ASM1334323v1	NZ_CP040094	Nostoc sp. TCL240-02 chromosome, complete genome	25	5112598-5113730	11,24,9	PILER-CR,CRISPRCasFinder,CRT	no	csx3,c2c8_V-U2	DinG,cas3,csa3,cas4,cas6,RT,cas1,cas2,csx3,WYL,csm6,cmr5gr11,cmr4gr7,cmr3gr5,cas10,cas14j,c2c5_V-U5,PD-DExK,Cas9_archaeal,DEDDh,c2c8_V-U2,Cas14c_CAS-V-F,csm5gr7,csm4gr5,csm3gr7,csm2gr11,csc2gr7,csc1gr5,2OG_CAS	Type V-U2	GTTTCAATCCCTAATAGGGATTTTGATAAATTGCAAT,GTTTCAATCCCTAATAGGGATTTTGATAAATTGCAAT,GTTTCAATCCCTAATAGGGATTTTGATAAATTGCAAT	37,37,37	0	0	NA	NA	N:A	15,15,15	15	TypeV-U2	DinG,cas3,csa3,cas4,cas6,RT,cas1,cas2,csx3,WYL,csm6,cmr5gr11,cmr4gr7,cmr3gr5,cas10,cas14j,c2c5_V-U5,PD-DExK,Cas9_archaeal,DEDDh,c2c8_V-U2,Cas14c_CAS-V-F,csm5gr7,csm4gr5,csm3gr7,csm2gr11,csc2gr7,csc1gr5,2OG_CAS	NA|165aa|up_5|NZ_CP040094.1_5106172_5106667_+,NA|147aa|up_1|NZ_CP040094.1_5110565_5111006_+,NA|108aa|down_0|NZ_CP040094.1_5114075_5114399_-	NA|208aa|up_9|NZ_CP040094.1_5097938_5098562_-	pfam05685, Uma2, Putative restriction endonuclease	NA|1117aa|up_8|NZ_CP040094.1_5099183_5102534_-	PRK10503, PRK10503, MdtB/MuxB family multidrug efflux RND transporter permease subunit	NA|482aa|up_7|NZ_CP040094.1_5102657_5104103_-	TIGR01730, COG0845:_Membrane-fusion_protein, RND family efflux transporter, MFP subunit	NA|291aa|up_6|NZ_CP040094.1_5104421_5105294_-	PRK13398, PRK13398, 3-deoxy-7-phosphoheptulonate synthase; Provisional	NA|165aa|up_5|NZ_CP040094.1_5106172_5106667_+	NA	NA|222aa|up_4|NZ_CP040094.1_5106749_5107415_+	COG1357, COG1357, Pentapeptide repeats containing protein [Function unknown]	NA|202aa|up_3|NZ_CP040094.1_5108234_5108840_+	pfam14273, DUF4360, Domain of unknown function (DUF4360)	NA|286aa|up_2|NZ_CP040094.1_5109422_5110280_+	COG1801, COG1801, Uncharacterized conserved protein [Function unknown]	NA|147aa|up_1|NZ_CP040094.1_5110565_5111006_+	NA	NA|354aa|up_0|NZ_CP040094.1_5111321_5112383_+	PRK01889, PRK01889, GTPase RsgA; Reviewed	NA|108aa|down_0|NZ_CP040094.1_5114075_5114399_-	NA	NA|60aa|down_1|NZ_CP040094.1_5114691_5114871_-	PLN00014, PLN00014, light-harvesting-like protein 3; Provisional	NA|101aa|down_2|NZ_CP040094.1_5118068_5118371_-	PRK14423, PRK14423, acylphosphatase; Provisional	NA|126aa|down_3|NZ_CP040094.1_5118720_5119098_+	pfam08853, DUF1823, Domain of unknown function (DUF1823)	NA|553aa|down_4|NZ_CP040094.1_5119418_5121077_+	cd11350, AmyAc_4, Alpha amylase catalytic domain found in an uncharacterized protein family	NA|352aa|down_5|NZ_CP040094.1_5121277_5122333_+	PRK12755, PRK12755, phospho-2-dehydro-3-deoxyheptonate aldolase; Provisional	NA|353aa|down_6|NZ_CP040094.1_5123166_5124225_+	cd00796, INT_Rci_Hp1_C, Shufflon-specific DNA recombinase Rci and Bacteriophage Hp1_like integrase, C-terminal catalytic domain	NA|242aa|down_7|NZ_CP040094.1_5124230_5124956_-	smart00271, DnaJ, DnaJ molecular chaperone homology domain	csx3|65aa|down_8|NZ_CP040094.1_5125014_5125209_-	pfam09620, Cas_csx3, CRISPR-associated protein (Cas_csx3)	NA|181aa|down_9|NZ_CP040094.1_5125396_5125939_-	cd07503, HAD_HisB-N, histidinol phosphate phosphatase and related phosphatases
GCF_013343235.1_ASM1334323v1	NZ_CP040094	Nostoc sp. TCL240-02 chromosome, complete genome	26	5297813-5297920	25	CRISPRCasFinder	no		DinG,cas3,csa3,cas4,cas6,RT,cas1,cas2,csx3,WYL,csm6,cmr5gr11,cmr4gr7,cmr3gr5,cas10,cas14j,c2c5_V-U5,PD-DExK,Cas9_archaeal,DEDDh,c2c8_V-U2,Cas14c_CAS-V-F,csm5gr7,csm4gr5,csm3gr7,csm2gr11,csc2gr7,csc1gr5,2OG_CAS	Orphan	ATAGAAGGTTTAAAAGCCGGGAAGAAACTTAAAGAA	36	0	0	NA	NA	N:A	1	1	Orphan	DinG,cas3,csa3,cas4,cas6,RT,cas1,cas2,csx3,WYL,csm6,cmr5gr11,cmr4gr7,cmr3gr5,cas10,cas14j,c2c5_V-U5,PD-DExK,Cas9_archaeal,DEDDh,c2c8_V-U2,Cas14c_CAS-V-F,csm5gr7,csm4gr5,csm3gr7,csm2gr11,csc2gr7,csc1gr5,2OG_CAS	NA|326aa|up_9|NZ_CP040094.1_5276836_5277814_+,NA|159aa|up_7|NZ_CP040094.1_5280833_5281310_-,NA|356aa|up_0|NZ_CP040094.1_5295534_5296601_+,NA|80aa|down_3|NZ_CP040094.1_5305070_5305310_+,NA|80aa|down_4|NZ_CP040094.1_5305390_5305630_+,NA|80aa|down_5|NZ_CP040094.1_5305710_5305950_+,NA|80aa|down_6|NZ_CP040094.1_5306030_5306270_+,NA|80aa|down_7|NZ_CP040094.1_5306350_5306590_+,NA|80aa|down_8|NZ_CP040094.1_5306670_5306910_+,NA|80aa|down_9|NZ_CP040094.1_5306990_5307230_+	NA|326aa|up_9|NZ_CP040094.1_5276836_5277814_+	NA	NA|768aa|up_8|NZ_CP040094.1_5278373_5280677_+	COG1305, COG1305, Transglutaminase-like enzymes, putative cysteine proteases [Amino acid transport and metabolism]	NA|159aa|up_7|NZ_CP040094.1_5280833_5281310_-	NA	NA|425aa|up_6|NZ_CP040094.1_5281395_5282670_-	PRK02507, PRK02507, proton extrusion protein PcxA; Provisional	NA|30aa|up_5|NZ_CP040094.1_5284769_5284859_+	pfam14218, COP23, Circadian oscillating protein COP23	NA|1299aa|up_4|NZ_CP040094.1_5285415_5289312_+	COG3899, COG3899, Predicted ATPase [General function prediction only]	NA|446aa|up_3|NZ_CP040094.1_5290388_5291726_+	COG5659, COG5659, FOG: Transposase [DNA replication, recombination, and repair]	NA|525aa|up_2|NZ_CP040094.1_5291759_5293334_+	COG0642, BaeS, Signal transduction histidine kinase [Signal transduction mechanisms]	NA|241aa|up_1|NZ_CP040094.1_5294579_5295302_-	cd03378, beta_CA_cladeC, Carbonic anhydrases (CA) are zinc-containing enzymes that catalyze the reversible hydration of carbon dioxide in a two-step mechanism in which the nucleophilic attack of a zinc-bound hydroxide ion on carbon dioxide is followed by the regeneration of an active site by ionization of the zinc-bound water molecule and removal of a proton from the active site	NA|356aa|up_0|NZ_CP040094.1_5295534_5296601_+	NA	NA|913aa|down_0|NZ_CP040094.1_5299979_5302718_+	cd02619, Peptidase_C1, C1 Peptidase family (MEROPS database nomenclature), also referred to as the papain family; composed of two subfamilies of cysteine peptidases (CPs), C1A (papain) and C1B (bleomycin hydrolase)	NA|265aa|down_1|NZ_CP040094.1_5302735_5303530_-	COG0455, flhG, Antiactivator of flagellar biosynthesis FleN, an ATPase [Cell motility]	NA|354aa|down_2|NZ_CP040094.1_5303573_5304635_-	pfam11845, DUF3365, Protein of unknown function (DUF3365)	NA|80aa|down_3|NZ_CP040094.1_5305070_5305310_+	NA	NA|80aa|down_4|NZ_CP040094.1_5305390_5305630_+	NA	NA|80aa|down_5|NZ_CP040094.1_5305710_5305950_+	NA	NA|80aa|down_6|NZ_CP040094.1_5306030_5306270_+	NA	NA|80aa|down_7|NZ_CP040094.1_5306350_5306590_+	NA	NA|80aa|down_8|NZ_CP040094.1_5306670_5306910_+	NA	NA|80aa|down_9|NZ_CP040094.1_5306990_5307230_+	NA
GCF_013343235.1_ASM1334323v1	NZ_CP040094	Nostoc sp. TCL240-02 chromosome, complete genome	27	5660078-5662236	26,10,12	CRISPRCasFinder,CRT,PILER-CR	no		DinG,cas3,csa3,cas4,cas6,RT,cas1,cas2,csx3,WYL,csm6,cmr5gr11,cmr4gr7,cmr3gr5,cas10,cas14j,c2c5_V-U5,PD-DExK,Cas9_archaeal,DEDDh,c2c8_V-U2,Cas14c_CAS-V-F,csm5gr7,csm4gr5,csm3gr7,csm2gr11,csc2gr7,csc1gr5,2OG_CAS	Orphan	GTTTCAATCCCTAATAGGGATTTTGATGAATTGCAAT,GTTTCAATCCCTAATAGGGATTTTGATGAATTGCAAT,GTTTCAATCCCTAATAGGGATTTTGATGAATTGCAAT	37,37,37	0	0	NA	NA	N:A	29,29,28	29	Orphan	DinG,cas3,csa3,cas4,cas6,RT,cas1,cas2,csx3,WYL,csm6,cmr5gr11,cmr4gr7,cmr3gr5,cas10,cas14j,c2c5_V-U5,PD-DExK,Cas9_archaeal,DEDDh,c2c8_V-U2,Cas14c_CAS-V-F,csm5gr7,csm4gr5,csm3gr7,csm2gr11,csc2gr7,csc1gr5,2OG_CAS	NA|248aa|up_9|NZ_CP040094.1_5634165_5634909_+,NA|261aa|down_4|NZ_CP040094.1_5666969_5667752_+,NA|71aa|down_5|NZ_CP040094.1_5667854_5668067_-,NA|378aa|down_6|NZ_CP040094.1_5668073_5669207_-,NA|81aa|down_7|NZ_CP040094.1_5669465_5669708_-,NA|107aa|down_8|NZ_CP040094.1_5670266_5670587_+,NA|356aa|down_9|NZ_CP040094.1_5670971_5672038_+	NA|248aa|up_9|NZ_CP040094.1_5634165_5634909_+	NA	NA|350aa|up_8|NZ_CP040094.1_5634929_5635979_-	cd02110, SO_family_Moco_dimer, Subgroup of sulfite oxidase (SO) family molybdopterin binding domains that contains conserved dimerization domain	NA|1860aa|up_7|NZ_CP040094.1_5636380_5641960_+	COG3899, COG3899, Predicted ATPase [General function prediction only]	NA|129aa|up_6|NZ_CP040094.1_5642042_5642429_+	pfam05635, 23S_rRNA_IVP, 23S rRNA-intervening sequence protein	NA|779aa|up_5|NZ_CP040094.1_5642509_5644846_-	COG0475, KefB, Kef-type K+ transport systems, membrane components [Inorganic ion transport and metabolism]	NA|326aa|up_4|NZ_CP040094.1_5644998_5645976_-	cd09763, DHRS1-like_SDR_c, human dehydrogenase/reductase (SDR family) member 1 (DHRS1) -like, classical (c) SDRs	NA|314aa|up_3|NZ_CP040094.1_5646341_5647283_-	cd02696, MurNAc-LAA, N-acetylmuramoyl-L-alanine amidase or MurNAc-LAA (also known as peptidoglycan aminohydrolase, NAMLA amidase, NAMLAA, Amidase 3, and peptidoglycan amidase; EC 3	NA|1253aa|up_2|NZ_CP040094.1_5648145_5651904_+	PLN03241, PLN03241, magnesium chelatase subunit H; Provisional	NA|74aa|up_1|NZ_CP040094.1_5651956_5652178_+	COG1598, COG1598, Predicted nuclease of the RNAse H fold, HicB family [General    function prediction only]	NA|2199aa|up_0|NZ_CP040094.1_5653429_5660026_+	PRK11107, PRK11107, hybrid sensory histidine kinase BarA; Provisional	NA|160aa|down_0|NZ_CP040094.1_5662560_5663040_-	cd14503, PTP-bact, bacterial tyrosine-protein phosphataseS similar to Neisseria NMA1982	NA|166aa|down_1|NZ_CP040094.1_5663679_5664177_+	pfam13673, Acetyltransf_10, Acetyltransferase (GNAT) domain	NA|210aa|down_2|NZ_CP040094.1_5664298_5664928_+	COG1309, AcrR, Transcriptional regulator [Transcription]	NA|219aa|down_3|NZ_CP040094.1_5666150_5666807_-	COG3703, ChaC, Uncharacterized protein involved in cation transport [Inorganic ion transport and metabolism]	NA|261aa|down_4|NZ_CP040094.1_5666969_5667752_+	NA	NA|71aa|down_5|NZ_CP040094.1_5667854_5668067_-	NA	NA|378aa|down_6|NZ_CP040094.1_5668073_5669207_-	NA	NA|81aa|down_7|NZ_CP040094.1_5669465_5669708_-	NA	NA|107aa|down_8|NZ_CP040094.1_5670266_5670587_+	NA	NA|356aa|down_9|NZ_CP040094.1_5670971_5672038_+	NA
GCF_013343235.1_ASM1334323v1	NZ_CP040094	Nostoc sp. TCL240-02 chromosome, complete genome	28	6117125-6117229	27	CRISPRCasFinder	no		DinG,cas3,csa3,cas4,cas6,RT,cas1,cas2,csx3,WYL,csm6,cmr5gr11,cmr4gr7,cmr3gr5,cas10,cas14j,c2c5_V-U5,PD-DExK,Cas9_archaeal,DEDDh,c2c8_V-U2,Cas14c_CAS-V-F,csm5gr7,csm4gr5,csm3gr7,csm2gr11,csc2gr7,csc1gr5,2OG_CAS	Orphan	TCCTATTTCAGGAGACAGTTCTATCAGTTGATTGTC	36	0	0	NA	NA	N:A	1	1	Orphan	DinG,cas3,csa3,cas4,cas6,RT,cas1,cas2,csx3,WYL,csm6,cmr5gr11,cmr4gr7,cmr3gr5,cas10,cas14j,c2c5_V-U5,PD-DExK,Cas9_archaeal,DEDDh,c2c8_V-U2,Cas14c_CAS-V-F,csm5gr7,csm4gr5,csm3gr7,csm2gr11,csc2gr7,csc1gr5,2OG_CAS	NA|101aa|up_6|NZ_CP040094.1_6104243_6104546_+,NA|174aa|down_0|NZ_CP040094.1_6117912_6118434_-,NA|158aa|down_6|NZ_CP040094.1_6125162_6125636_+,NA|123aa|down_8|NZ_CP040094.1_6127252_6127621_+	NA|663aa|up_9|NZ_CP040094.1_6097783_6099772_-	PRK05218, PRK05218, heat shock protein 90; Provisional	NA|442aa|up_8|NZ_CP040094.1_6100040_6101366_+	pfam13191, AAA_16, AAA ATPase domain	NA|932aa|up_7|NZ_CP040094.1_6101362_6104158_+	TIGR02917, TPR_domain_protein, putative PEP-CTERM system TPR-repeat lipoprotein	NA|101aa|up_6|NZ_CP040094.1_6104243_6104546_+	NA	NA|881aa|up_5|NZ_CP040094.1_6104705_6107348_-	TIGR03346, chaperone_ClpB, ATP-dependent chaperone ClpB	NA|145aa|up_4|NZ_CP040094.1_6107936_6108371_-	TIGR00068, Lactoylglutathione_lyase, lactoylglutathione lyase	NA|430aa|up_3|NZ_CP040094.1_6108627_6109917_+	PRK00077, eno, enolase; Provisional	NA|353aa|up_2|NZ_CP040094.1_6110062_6111121_-	PRK00436, argC, N-acetyl-gamma-glutamyl-phosphate reductase; Validated	NA|558aa|up_1|NZ_CP040094.1_6111445_6113119_+	PRK09319, PRK09319, bifunctional 3,4-dihydroxy-2-butanone-4-phosphate synthase RibB/GTP cyclohydrolase II RibA	NA|293aa|up_0|NZ_CP040094.1_6113288_6114167_-	cd06583, PGRP, Peptidoglycan recognition proteins (PGRPs) are pattern recognition receptors that bind, and in certain cases, hydrolyze peptidoglycans (PGNs) of bacterial cell walls	NA|174aa|down_0|NZ_CP040094.1_6117912_6118434_-	NA	NA|95aa|down_1|NZ_CP040094.1_6118423_6118708_-	pfam08681, DUF1778, Protein of unknown function (DUF1778)	NA|877aa|down_2|NZ_CP040094.1_6118910_6121541_+	COG0826, COG0826, Collagenase and related proteases [Posttranslational modification, protein turnover, chaperones]	NA|444aa|down_3|NZ_CP040094.1_6121767_6123099_+	pfam04185, Phosphoesterase, Phosphoesterase family	NA|296aa|down_4|NZ_CP040094.1_6123234_6124122_-	cd08414, PBP2_LTTR_aromatics_like, The C-terminal substrate binding domain of LysR-type transcriptional regulators involved in the catabolism of aromatic compounds and that of other related regulators, contains type 2 periplasmic binding fold	NA|259aa|down_5|NZ_CP040094.1_6124256_6125033_+	pfam13649, Methyltransf_25, Methyltransferase domain	NA|158aa|down_6|NZ_CP040094.1_6125162_6125636_+	NA	NA|393aa|down_7|NZ_CP040094.1_6125717_6126896_-	cd13661, PBP2_PotD_PotF_like_1, The periplasmic substrate-binding component of an uncharacterized active transport system closely related to spermidine and putrescine transporters; contains the type 2 periplasmic binding fold	NA|123aa|down_8|NZ_CP040094.1_6127252_6127621_+	NA	NA|398aa|down_9|NZ_CP040094.1_6130513_6131707_-	PRK04447, PRK04447, hypothetical protein; Provisional
GCF_013343235.1_ASM1334323v1	NZ_CP040094	Nostoc sp. TCL240-02 chromosome, complete genome	29	6359181-6359259	28	CRISPRCasFinder	no		DinG,cas3,csa3,cas4,cas6,RT,cas1,cas2,csx3,WYL,csm6,cmr5gr11,cmr4gr7,cmr3gr5,cas10,cas14j,c2c5_V-U5,PD-DExK,Cas9_archaeal,DEDDh,c2c8_V-U2,Cas14c_CAS-V-F,csm5gr7,csm4gr5,csm3gr7,csm2gr11,csc2gr7,csc1gr5,2OG_CAS	Orphan	GCTTTGGCTTCGTTAATCTGCTGTT	25	0	0	NA	NA	N:A	1	1	Orphan	DinG,cas3,csa3,cas4,cas6,RT,cas1,cas2,csx3,WYL,csm6,cmr5gr11,cmr4gr7,cmr3gr5,cas10,cas14j,c2c5_V-U5,PD-DExK,Cas9_archaeal,DEDDh,c2c8_V-U2,Cas14c_CAS-V-F,csm5gr7,csm4gr5,csm3gr7,csm2gr11,csc2gr7,csc1gr5,2OG_CAS	NA|178aa|up_6|NZ_CP040094.1_6353062_6353596_+,NA|134aa|up_5|NZ_CP040094.1_6353646_6354048_+,NA|69aa|up_3|NZ_CP040094.1_6354659_6354866_+,NA|502aa|up_2|NZ_CP040094.1_6355064_6356570_+,NA	NA|206aa|up_9|NZ_CP040094.1_6346830_6347448_-	COG3617, COG3617, Prophage antirepressor [Transcription]	NA|193aa|up_8|NZ_CP040094.1_6347529_6348108_-	pfam08346, AntA, AntA/AntB antirepressor	NA|815aa|up_7|NZ_CP040094.1_6350405_6352850_-	TIGR03788, marine_srt_targ, marine proteobacterial sortase target protein	NA|178aa|up_6|NZ_CP040094.1_6353062_6353596_+	NA	NA|134aa|up_5|NZ_CP040094.1_6353646_6354048_+	NA	NA|150aa|up_4|NZ_CP040094.1_6354059_6354509_+	cd00085, HNHc, HNH nucleases; HNH endonuclease signature which is found in viral, prokaryotic, and eukaryotic proteins	NA|69aa|up_3|NZ_CP040094.1_6354659_6354866_+	NA	NA|502aa|up_2|NZ_CP040094.1_6355064_6356570_+	NA	NA|235aa|up_1|NZ_CP040094.1_6356729_6357434_-	TIGR02982, heterocyst_DevA, ABC exporter ATP-binding subunit, DevA family	NA|392aa|up_0|NZ_CP040094.1_6357521_6358697_-	TIGR01185, membrane_spanning_subunit, DevC protein	NA|205aa|down_0|NZ_CP040094.1_6360254_6360869_+	COG1309, AcrR, Transcriptional regulator [Transcription]	NA|163aa|down_1|NZ_CP040094.1_6361049_6361538_-	PRK00028, infC, translation initiation factor IF-3; Reviewed	NA|444aa|down_2|NZ_CP040094.1_6361776_6363108_+	PRK10590, PRK10590, ATP-dependent RNA helicase RhlE; Provisional	NA|405aa|down_3|NZ_CP040094.1_6363386_6364601_-	pfam05626, DUF790, Protein of unknown function (DUF790)	NA|534aa|down_4|NZ_CP040094.1_6364530_6366132_-	COG1061, SSL2, DNA or RNA helicases of superfamily II [Transcription / DNA replication, recombination, and repair]	NA|434aa|down_5|NZ_CP040094.1_6366337_6367639_+	pfam00211, Guanylate_cyc, Adenylate and Guanylate cyclase catalytic domain	NA|1047aa|down_6|NZ_CP040094.1_6367933_6371074_+	TIGR02956, sensor_protein_TorS, TMAO reductase sytem sensor TorS	NA|615aa|down_7|NZ_CP040094.1_6371356_6373201_+	pfam13424, TPR_12, Tetratricopeptide repeat	NA|1371aa|down_8|NZ_CP040094.1_6373332_6377445_-	PRK11107, PRK11107, hybrid sensory histidine kinase BarA; Provisional	NA|728aa|down_9|NZ_CP040094.1_6377787_6379971_-	pfam06616, BsuBI_PstI_RE, BsuBI/PstI restriction endonuclease C-terminus
GCF_013343235.1_ASM1334323v1	NZ_CP040094	Nostoc sp. TCL240-02 chromosome, complete genome	30	6500361-6500616	13,29,11	PILER-CR,CRISPRCasFinder,CRT	no		DinG,cas3,csa3,cas4,cas6,RT,cas1,cas2,csx3,WYL,csm6,cmr5gr11,cmr4gr7,cmr3gr5,cas10,cas14j,c2c5_V-U5,PD-DExK,Cas9_archaeal,DEDDh,c2c8_V-U2,Cas14c_CAS-V-F,csm5gr7,csm4gr5,csm3gr7,csm2gr11,csc2gr7,csc1gr5,2OG_CAS	Orphan	GTTTCAATCCCTAATAGGGATTTTATTTGATTGCAATTT,GTTTTAATCCCTAATAGGGATTTTATTTGATTGCAATT,AATCCCTAATAGGGATTTTATTTGATTGCAATTT	39,38,34	0	0	NA	NA	N:A	3,3,3	3	Orphan	DinG,cas3,csa3,cas4,cas6,RT,cas1,cas2,csx3,WYL,csm6,cmr5gr11,cmr4gr7,cmr3gr5,cas10,cas14j,c2c5_V-U5,PD-DExK,Cas9_archaeal,DEDDh,c2c8_V-U2,Cas14c_CAS-V-F,csm5gr7,csm4gr5,csm3gr7,csm2gr11,csc2gr7,csc1gr5,2OG_CAS	NA|67aa|up_9|NZ_CP040094.1_6489571_6489772_+,NA|109aa|up_8|NZ_CP040094.1_6489891_6490218_+,NA|538aa|up_7|NZ_CP040094.1_6490219_6491833_+,NA|356aa|up_0|NZ_CP040094.1_6499137_6500204_-,NA|125aa|down_2|NZ_CP040094.1_6503738_6504113_+	NA|67aa|up_9|NZ_CP040094.1_6489571_6489772_+	NA	NA|109aa|up_8|NZ_CP040094.1_6489891_6490218_+	NA	NA|538aa|up_7|NZ_CP040094.1_6490219_6491833_+	NA	NA|336aa|up_6|NZ_CP040094.1_6492914_6493922_+	COG1295, Rbn, Ribonuclease BN family enzyme [Replication, recombination, and repair]	NA|184aa|up_5|NZ_CP040094.1_6494087_6494639_+	COG2323, COG2323, Predicted membrane protein [Function unknown]	NA|432aa|up_4|NZ_CP040094.1_6494679_6495975_+	PRK07380, PRK07380, adenylosuccinate lyase; Provisional	NA|185aa|up_3|NZ_CP040094.1_6496335_6496890_+	pfam13548, DUF4126, Domain of unknown function (DUF4126)	NA|389aa|up_2|NZ_CP040094.1_6497014_6498181_-	PLN02449, PLN02449, ferrochelatase	NA|208aa|up_1|NZ_CP040094.1_6498275_6498899_+	pfam13649, Methyltransf_25, Methyltransferase domain	NA|356aa|up_0|NZ_CP040094.1_6499137_6500204_-	NA	NA|256aa|down_0|NZ_CP040094.1_6500769_6501537_-	cd02978, KaiB_like, KaiB-like family; composed of the circadian clock proteins, KaiB and the N-terminal KaiB-like sensory domain of SasA	NA|428aa|down_1|NZ_CP040094.1_6501726_6503010_-	COG2805, PilT, Tfp pilus assembly protein, pilus retraction ATPase PilT [Cell motility and secretion / Intracellular trafficking and secretion]	NA|125aa|down_2|NZ_CP040094.1_6503738_6504113_+	NA	NA|288aa|down_3|NZ_CP040094.1_6504717_6505581_-	COG1091, RfbD, dTDP-4-dehydrorhamnose reductase [Cell envelope biogenesis, outer membrane]	NA|248aa|down_4|NZ_CP040094.1_6505659_6506403_-	sd00006, TPR, Tetratricopeptide repeat	NA|565aa|down_5|NZ_CP040094.1_6507220_6508915_-	COG1233, COG1233, Phytoene dehydrogenase and related proteins [Secondary metabolites biosynthesis, transport, and catabolism]	NA|253aa|down_6|NZ_CP040094.1_6509030_6509789_-	pfam11209, DUF2993, Protein of unknown function (DUF2993)	NA|224aa|down_7|NZ_CP040094.1_6510272_6510944_+	TIGR02869, Spore_cortex-lytic_enzyme, spore cortex-lytic enzyme	NA|377aa|down_8|NZ_CP040094.1_6511200_6512331_-	TIGR00236, UDP-N-acetylglucosamine_2-epimerase, UDP-N-acetylglucosamine 2-epimerase	NA|348aa|down_9|NZ_CP040094.1_6513332_6514376_+	cd13565, PBP2_PstS, Substrate binding domain of ABC-type phosphate transporter, a member of the type 2 periplasmic-binding fold superfamily
GCF_013343235.1_ASM1334323v1	NZ_CP040094	Nostoc sp. TCL240-02 chromosome, complete genome	31	6743293-6743397	30	CRISPRCasFinder	no	csa3	DinG,cas3,csa3,cas4,cas6,RT,cas1,cas2,csx3,WYL,csm6,cmr5gr11,cmr4gr7,cmr3gr5,cas10,cas14j,c2c5_V-U5,PD-DExK,Cas9_archaeal,DEDDh,c2c8_V-U2,Cas14c_CAS-V-F,csm5gr7,csm4gr5,csm3gr7,csm2gr11,csc2gr7,csc1gr5,2OG_CAS	Type I-A	TACAAGCCCCTGAATTTATTTATGG	25	0	0	NA	NA	N:A	1	1	Orphan	DinG,cas3,csa3,cas4,cas6,RT,cas1,cas2,csx3,WYL,csm6,cmr5gr11,cmr4gr7,cmr3gr5,cas10,cas14j,c2c5_V-U5,PD-DExK,Cas9_archaeal,DEDDh,c2c8_V-U2,Cas14c_CAS-V-F,csm5gr7,csm4gr5,csm3gr7,csm2gr11,csc2gr7,csc1gr5,2OG_CAS	NA,NA|110aa|down_1|NZ_CP040094.1_6744792_6745122_-,NA|161aa|down_2|NZ_CP040094.1_6745218_6745701_-,NA|219aa|down_3|NZ_CP040094.1_6746040_6746697_+,NA|624aa|down_4|NZ_CP040094.1_6746735_6748607_+,NA|86aa|down_8|NZ_CP040094.1_6752992_6753250_-,NA|66aa|down_9|NZ_CP040094.1_6753356_6753554_-	NA|151aa|up_9|NZ_CP040094.1_6735229_6735682_+	PRK05273, PRK05273, D-tyrosyl-tRNA(Tyr) deacylase; Provisional	NA|114aa|up_8|NZ_CP040094.1_6735786_6736128_+	PRK13612, PRK13612, photosystem II reaction center protein Psb28; Provisional	NA|170aa|up_7|NZ_CP040094.1_6736231_6736741_+	cd00886, MogA_MoaB, MogA_MoaB family	NA|586aa|up_6|NZ_CP040094.1_6736869_6738627_+	cd01949, GGDEF, Diguanylate-cyclase (DGC) or GGDEF domain	NA|364aa|up_5|NZ_CP040094.1_6738922_6740014_-	PRK00108, mraY, phospho-N-acetylmuramoyl-pentapeptide-transferase; Provisional	NA|79aa|up_4|NZ_CP040094.1_6740093_6740330_-	pfam11332, DUF3134, Protein of unknown function (DUF3134)	NA|195aa|up_3|NZ_CP040094.1_6740572_6741157_+	pfam04755, PAP_fibrillin, PAP_fibrillin	NA|72aa|up_2|NZ_CP040094.1_6741445_6741661_+	pfam02427, PSI_PsaE, Photosystem I reaction centre subunit IV / PsaE	NA|300aa|up_1|NZ_CP040094.1_6741912_6742812_+	PRK13945, PRK13945, formamidopyrimidine-DNA glycosylase; Provisional	NA|72aa|up_0|NZ_CP040094.1_6742913_6743129_+	pfam11910, NdhO, Cyanobacterial and plant NDH-1 subunit O	NA|323aa|down_0|NZ_CP040094.1_6743486_6744455_+	cd01339, LDH-like_MDH, L-lactate dehydrogenase-like malate dehydrogenase proteins	NA|110aa|down_1|NZ_CP040094.1_6744792_6745122_-	NA	NA|161aa|down_2|NZ_CP040094.1_6745218_6745701_-	NA	NA|219aa|down_3|NZ_CP040094.1_6746040_6746697_+	NA	NA|624aa|down_4|NZ_CP040094.1_6746735_6748607_+	NA	NA|458aa|down_5|NZ_CP040094.1_6748617_6749991_+	pfam08819, DUF1802, Domain of unknown function (DUF1802)	NA|221aa|down_6|NZ_CP040094.1_6750372_6751035_+	pfam07862, Nif11, Nif11 domain	NA|529aa|down_7|NZ_CP040094.1_6751293_6752880_-	PRK14096, pgi, glucose-6-phosphate isomerase; Provisional	NA|86aa|down_8|NZ_CP040094.1_6752992_6753250_-	NA	NA|66aa|down_9|NZ_CP040094.1_6753356_6753554_-	NA
GCF_013343235.1_ASM1334323v1	NZ_CP040094	Nostoc sp. TCL240-02 chromosome, complete genome	32	6744812-6745016	31	CRISPRCasFinder	no	csa3	DinG,cas3,csa3,cas4,cas6,RT,cas1,cas2,csx3,WYL,csm6,cmr5gr11,cmr4gr7,cmr3gr5,cas10,cas14j,c2c5_V-U5,PD-DExK,Cas9_archaeal,DEDDh,c2c8_V-U2,Cas14c_CAS-V-F,csm5gr7,csm4gr5,csm3gr7,csm2gr11,csc2gr7,csc1gr5,2OG_CAS	Type I-A	TCTGCGGGGTCGCCGTAGGGGTCTT	25	0	0	NA	NA	N:A	3	3	Orphan	DinG,cas3,csa3,cas4,cas6,RT,cas1,cas2,csx3,WYL,csm6,cmr5gr11,cmr4gr7,cmr3gr5,cas10,cas14j,c2c5_V-U5,PD-DExK,Cas9_archaeal,DEDDh,c2c8_V-U2,Cas14c_CAS-V-F,csm5gr7,csm4gr5,csm3gr7,csm2gr11,csc2gr7,csc1gr5,2OG_CAS	NA,NA|161aa|down_0|NZ_CP040094.1_6745218_6745701_-,NA|219aa|down_1|NZ_CP040094.1_6746040_6746697_+,NA|624aa|down_2|NZ_CP040094.1_6746735_6748607_+,NA|86aa|down_6|NZ_CP040094.1_6752992_6753250_-,NA|66aa|down_7|NZ_CP040094.1_6753356_6753554_-	NA|114aa|up_9|NZ_CP040094.1_6735786_6736128_+	PRK13612, PRK13612, photosystem II reaction center protein Psb28; Provisional	NA|170aa|up_8|NZ_CP040094.1_6736231_6736741_+	cd00886, MogA_MoaB, MogA_MoaB family	NA|586aa|up_7|NZ_CP040094.1_6736869_6738627_+	cd01949, GGDEF, Diguanylate-cyclase (DGC) or GGDEF domain	NA|364aa|up_6|NZ_CP040094.1_6738922_6740014_-	PRK00108, mraY, phospho-N-acetylmuramoyl-pentapeptide-transferase; Provisional	NA|79aa|up_5|NZ_CP040094.1_6740093_6740330_-	pfam11332, DUF3134, Protein of unknown function (DUF3134)	NA|195aa|up_4|NZ_CP040094.1_6740572_6741157_+	pfam04755, PAP_fibrillin, PAP_fibrillin	NA|72aa|up_3|NZ_CP040094.1_6741445_6741661_+	pfam02427, PSI_PsaE, Photosystem I reaction centre subunit IV / PsaE	NA|300aa|up_2|NZ_CP040094.1_6741912_6742812_+	PRK13945, PRK13945, formamidopyrimidine-DNA glycosylase; Provisional	NA|72aa|up_1|NZ_CP040094.1_6742913_6743129_+	pfam11910, NdhO, Cyanobacterial and plant NDH-1 subunit O	NA|323aa|up_0|NZ_CP040094.1_6743486_6744455_+	cd01339, LDH-like_MDH, L-lactate dehydrogenase-like malate dehydrogenase proteins	NA|161aa|down_0|NZ_CP040094.1_6745218_6745701_-	NA	NA|219aa|down_1|NZ_CP040094.1_6746040_6746697_+	NA	NA|624aa|down_2|NZ_CP040094.1_6746735_6748607_+	NA	NA|458aa|down_3|NZ_CP040094.1_6748617_6749991_+	pfam08819, DUF1802, Domain of unknown function (DUF1802)	NA|221aa|down_4|NZ_CP040094.1_6750372_6751035_+	pfam07862, Nif11, Nif11 domain	NA|529aa|down_5|NZ_CP040094.1_6751293_6752880_-	PRK14096, pgi, glucose-6-phosphate isomerase; Provisional	NA|86aa|down_6|NZ_CP040094.1_6752992_6753250_-	NA	NA|66aa|down_7|NZ_CP040094.1_6753356_6753554_-	NA	NA|378aa|down_8|NZ_CP040094.1_6753772_6754906_+	COG4177, LivM, ABC-type branched-chain amino acid transport system, permease component [Amino acid transport and metabolism]	NA|261aa|down_9|NZ_CP040094.1_6754895_6755678_+	COG0411, LivG, ABC-type branched-chain amino acid transport systems, ATPase component [Amino acid transport and metabolism]
GCF_013343235.1_ASM1334323v1	NZ_CP040094	Nostoc sp. TCL240-02 chromosome, complete genome	33	7037226-7037401	32	CRISPRCasFinder	no		DinG,cas3,csa3,cas4,cas6,RT,cas1,cas2,csx3,WYL,csm6,cmr5gr11,cmr4gr7,cmr3gr5,cas10,cas14j,c2c5_V-U5,PD-DExK,Cas9_archaeal,DEDDh,c2c8_V-U2,Cas14c_CAS-V-F,csm5gr7,csm4gr5,csm3gr7,csm2gr11,csc2gr7,csc1gr5,2OG_CAS	Orphan	TGCTCCTTCCAAGTTGGCATCACAAA	26	0	0	NA	NA	N:A	2	2	Orphan	DinG,cas3,csa3,cas4,cas6,RT,cas1,cas2,csx3,WYL,csm6,cmr5gr11,cmr4gr7,cmr3gr5,cas10,cas14j,c2c5_V-U5,PD-DExK,Cas9_archaeal,DEDDh,c2c8_V-U2,Cas14c_CAS-V-F,csm5gr7,csm4gr5,csm3gr7,csm2gr11,csc2gr7,csc1gr5,2OG_CAS	NA|251aa|up_1|NZ_CP040094.1_7034567_7035320_-,NA|106aa|down_3|NZ_CP040094.1_7041718_7042036_-,NA|110aa|down_6|NZ_CP040094.1_7046658_7046988_-	NA|371aa|up_9|NZ_CP040094.1_7027244_7028357_-	COG3292, COG3292, Predicted periplasmic ligand-binding sensor domain [Signal transduction mechanisms]	NA|477aa|up_8|NZ_CP040094.1_7029208_7030639_+	CHL00040, rbcL, ribulose-1,5-bisphosphate carboxylase/oxygenase large subunit	NA|136aa|up_7|NZ_CP040094.1_7030739_7031147_+	pfam02341, RcbX, RbcX protein	NA|113aa|up_6|NZ_CP040094.1_7031194_7031533_+	pfam00101, RuBisCO_small, Ribulose bisphosphate carboxylase, small chain	NA|214aa|up_5|NZ_CP040094.1_7031755_7032397_+	cd01835, SGNH_hydrolase_like_3, SGNH_hydrolase subfamily	NA|76aa|up_4|NZ_CP040094.1_7032439_7032667_+	COG4118, Phd, Antitoxin of toxin-antitoxin stability system [Cell division and chromosome partitioning]	NA|129aa|up_3|NZ_CP040094.1_7032663_7033050_+	cd09872, PIN_Sll0205-like, VapC-like PIN domain of Sll0205 protein and homologs	NA|426aa|up_2|NZ_CP040094.1_7033141_7034419_+	PLN00020, PLN00020, ribulose bisphosphate carboxylase/oxygenase activase -RuBisCO activase (RCA); Provisional	NA|251aa|up_1|NZ_CP040094.1_7034567_7035320_-	NA	NA|124aa|up_0|NZ_CP040094.1_7036429_7036801_-	pfam10184, DUF2358, Uncharacterized conserved protein (DUF2358)	NA|236aa|down_0|NZ_CP040094.1_7038638_7039346_-	TIGR04286, hypothetical_protein_HMPREF9455_00034, MSEP-CTERM protein	NA|236aa|down_1|NZ_CP040094.1_7039384_7040092_-	pfam09843, DUF2070, Predicted membrane protein (DUF2070)	NA|515aa|down_2|NZ_CP040094.1_7040115_7041660_-	COG0644, FixC, Dehydrogenases (flavoproteins) [Energy production and conversion]	NA|106aa|down_3|NZ_CP040094.1_7041718_7042036_-	NA	NA|750aa|down_4|NZ_CP040094.1_7042183_7044433_-	COG1244, COG1244, Predicted Fe-S oxidoreductase [General function prediction only]	NA|307aa|down_5|NZ_CP040094.1_7045684_7046605_+	pfam12146, Hydrolase_4, Serine aminopeptidase, S33	NA|110aa|down_6|NZ_CP040094.1_7046658_7046988_-	NA	NA|257aa|down_7|NZ_CP040094.1_7047038_7047809_-	PRK09009, PRK09009, SDR family oxidoreductase	NA|168aa|down_8|NZ_CP040094.1_7047815_7048319_-	pfam00583, Acetyltransf_1, Acetyltransferase (GNAT) family	NA|465aa|down_9|NZ_CP040094.1_7048503_7049898_-	PRK05291, trmE, tRNA uridine-5-carboxymethylaminomethyl(34) synthesis GTPase MnmE
