assembly_id	genome_id	genome_def	crispr_array_locus_merge	crispr_array_location_merge	crispr_locus_id	crispr_pred_method	array_in_prot	prot_within_array_20000	prot_in_genome	crispr_type_by_cas_prot	consensus_repeat	repeat_length	self-targeting_spacer_number	self-targeting_target_number	spacer_location	protospacer_location	repeat_type	spacer_locus_num	spacer_num	correct_crispr_type	genome_cas_prots	unknown_protein_around_crispr	L10	L10_domain	L9	L9_domain	L8	L8_domain	L7	L7_domain	L6	L6_domain	L5	L5_domain	L4	L4_domain	L3	L3_domain	L2	L2_domain	L1	L1_domain	R1	R1_domain	R2	R2_domain	R3	R3_domain	R4	R4_domain	R5	R5_domain	R6	R6_domain	R7	R7_domain	R8	R8_domain	R9	R9_domain	R10	R10_domain
GCF_002368395.1_ASM236839v1	NZ_AP018254	Calothrix sp. NIES-3974	1	76631-76743	1	CRISPRCasFinder	no		DinG,PD-DExK,cas14j,csa3,Cas14c_CAS-V-F,WYL,csx18,cas1,cas2,cas10,cmr3gr5,cmr4gr7,cmr5gr11,csm3gr7,cas3,csx1,cas6,csm5gr7,csm4gr5,csm2gr11,csx19,csx10gr5,c2c9_V-U4,RT,cas4,csc1gr5,csc2gr7,cas10d,csx3,csm6	Orphan	ATCATGGGGAGGTAGGAGACAGGAGGGGA	29	0	0	NA	NA	NA	1	1	Orphan	DinG,PD-DExK,cas14j,csa3,Cas14c_CAS-V-F,WYL,csx18,cas1,cas2,cas10,cmr3gr5,cmr4gr7,cmr5gr11,csm3gr7,cas3,csx1,cas6,csm5gr7,csm4gr5,csm2gr11,csx19,csx10gr5,c2c9_V-U4,RT,cas4,csc1gr5,csc2gr7,cas10d,csx3,csm6	NA,NA|331aa|down_1|NZ_AP018254.1_80372_81365_-,NA|152aa|down_6|NZ_AP018254.1_87419_87875_+,NA|71aa|down_9|NZ_AP018254.1_90910_91123_-	NA|243aa|up_9|NZ_AP018254.1_62679_63408_+	cd03218, ABC_YhbG, ATP-binding cassette component of YhbG transport system	NA|372aa|up_8|NZ_AP018254.1_63871_64987_+	pfam03739, YjgP_YjgQ, Predicted permease YjgP/YjgQ family	NA|832aa|up_7|NZ_AP018254.1_65267_67763_-	COG4449, COG4449, Predicted protease of the Abi (CAAX) family [General function prediction only]	NA|190aa|up_6|NZ_AP018254.1_68188_68758_+	PRK00061, ribH, 6,7-dimethyl-8-ribityllumazine synthase; Provisional	NA|576aa|up_5|NZ_AP018254.1_69084_70812_+	PRK07449, PRK07449, 2-succinyl-5-enolpyruvyl-6-hydroxy-3-cyclohexene-1-carboxylate synthase; Validated	NA|93aa|up_4|NZ_AP018254.1_71138_71417_+	COG2261, COG2261, Predicted membrane protein [Function unknown]	NA|316aa|up_3|NZ_AP018254.1_71656_72604_+	pfam06897, DUF1269, Protein of unknown function (DUF1269)	NA|156aa|up_2|NZ_AP018254.1_72820_73288_+	pfam04972, BON, BON domain	NA|491aa|up_1|NZ_AP018254.1_73438_74911_+	TIGR01386, Probable_sensor_protein_PcoS, heavy metal sensor kinase	NA|223aa|up_0|NZ_AP018254.1_74915_75584_+	COG0745, OmpR, Response regulators consisting of a CheY-like receiver domain and a winged-helix DNA-binding domain [Signal transduction mechanisms / Transcription]	NA|1002aa|down_0|NZ_AP018254.1_76916_79922_-	COG1357, COG1357, Pentapeptide repeats containing protein [Function unknown]	NA|331aa|down_1|NZ_AP018254.1_80372_81365_-	NA	NA|337aa|down_2|NZ_AP018254.1_81546_82557_+	cd02525, Succinoglycan_BP_ExoA, ExoA is involved in the biosynthesis of succinoglycan	NA|403aa|down_3|NZ_AP018254.1_83035_84244_+	cd03801, GT4_PimA-like, phosphatidyl-myo-inositol mannosyltransferase	NA|451aa|down_4|NZ_AP018254.1_84602_85955_+	cd13127, MATE_tuaB_like, Uncharacterized subfamily of the multidrug and toxic compound extrusion (MATE) proteins	NA|276aa|down_5|NZ_AP018254.1_86422_87250_+	pfam00805, Pentapeptide, Pentapeptide repeats (8 copies)	NA|152aa|down_6|NZ_AP018254.1_87419_87875_+	NA	NA|271aa|down_7|NZ_AP018254.1_87900_88713_+	COG0115, IlvE, Branched-chain amino acid aminotransferase/4-amino-4-deoxychorismate lyase [Amino acid transport and metabolism / Coenzyme metabolism]	NA|615aa|down_8|NZ_AP018254.1_88830_90675_+	TIGR01241, ATP-dependent_zinc_metalloprotease_FtsH, ATP-dependent metalloprotease FtsH	NA|71aa|down_9|NZ_AP018254.1_90910_91123_-	NA
GCF_002368395.1_ASM236839v1	NZ_AP018254	Calothrix sp. NIES-3974	2	264811-264919	2	CRISPRCasFinder	no		DinG,PD-DExK,cas14j,csa3,Cas14c_CAS-V-F,WYL,csx18,cas1,cas2,cas10,cmr3gr5,cmr4gr7,cmr5gr11,csm3gr7,cas3,csx1,cas6,csm5gr7,csm4gr5,csm2gr11,csx19,csx10gr5,c2c9_V-U4,RT,cas4,csc1gr5,csc2gr7,cas10d,csx3,csm6	Orphan	CCGTCGGAAAATACTCACTACTACTTTGGTTTCATCG	37	0	0	NA	NA	NA	1	1	Orphan	DinG,PD-DExK,cas14j,csa3,Cas14c_CAS-V-F,WYL,csx18,cas1,cas2,cas10,cmr3gr5,cmr4gr7,cmr5gr11,csm3gr7,cas3,csx1,cas6,csm5gr7,csm4gr5,csm2gr11,csx19,csx10gr5,c2c9_V-U4,RT,cas4,csc1gr5,csc2gr7,cas10d,csx3,csm6	NA|88aa|up_8|NZ_AP018254.1_256049_256313_-,NA|89aa|up_7|NZ_AP018254.1_256783_257050_-,NA|111aa|up_2|NZ_AP018254.1_262794_263127_+,NA|193aa|up_0|NZ_AP018254.1_264231_264810_-,NA|52aa|down_1|NZ_AP018254.1_267128_267284_+,NA|179aa|down_2|NZ_AP018254.1_267298_267835_-,NA|74aa|down_7|NZ_AP018254.1_273349_273571_+	NA|391aa|up_9|NZ_AP018254.1_254722_255895_-	cd05120, APH_ChoK_like, Aminoglycoside 3'-phosphotransferase and Choline Kinase family	NA|88aa|up_8|NZ_AP018254.1_256049_256313_-	NA	NA|89aa|up_7|NZ_AP018254.1_256783_257050_-	NA	NA|220aa|up_6|NZ_AP018254.1_258130_258790_+	cd02149, NfsB-like, nitroreductase similar to Escherichia coli NfsB	NA|133aa|up_5|NZ_AP018254.1_259531_259930_+	TIGR03044, possible_photosystem_II_Psb27_protein, photosystem II protein Psb27	NA|490aa|up_4|NZ_AP018254.1_260145_261615_+	COG1649, COG1649, Uncharacterized protein conserved in bacteria [Function unknown]	NA|248aa|up_3|NZ_AP018254.1_261759_262503_+	TIGR04526, predic_Ig_block, putative immunoglobulin-blocking virulence protein	NA|111aa|up_2|NZ_AP018254.1_262794_263127_+	NA	NA|144aa|up_1|NZ_AP018254.1_263462_263894_+	COG0394, Wzb, Protein-tyrosine-phosphatase [Signal transduction mechanisms]	NA|193aa|up_0|NZ_AP018254.1_264231_264810_-	NA	NA|648aa|down_0|NZ_AP018254.1_265053_266997_-	TIGR02042, Sulfite_reductase, ferredoxin-sulfite reductase	NA|52aa|down_1|NZ_AP018254.1_267128_267284_+	NA	NA|179aa|down_2|NZ_AP018254.1_267298_267835_-	NA	NA|224aa|down_3|NZ_AP018254.1_269357_270029_-	pfam00583, Acetyltransf_1, Acetyltransferase (GNAT) family	NA|198aa|down_4|NZ_AP018254.1_270509_271103_+	COG1309, AcrR, Transcriptional regulator [Transcription]	NA|409aa|down_5|NZ_AP018254.1_271076_272303_-	TIGR00275, TIGR00275, flavoprotein, HI0933 family	NA|197aa|down_6|NZ_AP018254.1_272605_273196_-	COG0655, WrbA, Multimeric flavodoxin WrbA [General function prediction only]	NA|74aa|down_7|NZ_AP018254.1_273349_273571_+	NA	NA|338aa|down_8|NZ_AP018254.1_274237_275251_+	PRK03427, PRK03427, cell division protein ZipA; Provisional	NA|86aa|down_9|NZ_AP018254.1_275393_275651_+	TIGR02181, GRX_bact, Glutaredoxin, GrxC family
GCF_002368395.1_ASM236839v1	NZ_AP018254	Calothrix sp. NIES-3974	3	272500-272601	3	CRISPRCasFinder	no		DinG,PD-DExK,cas14j,csa3,Cas14c_CAS-V-F,WYL,csx18,cas1,cas2,cas10,cmr3gr5,cmr4gr7,cmr5gr11,csm3gr7,cas3,csx1,cas6,csm5gr7,csm4gr5,csm2gr11,csx19,csx10gr5,c2c9_V-U4,RT,cas4,csc1gr5,csc2gr7,cas10d,csx3,csm6	Orphan	CTACTAAAAAACTTTTACAGCAGACCCTTGTCA	33	1	1	272533-272568	NZ_AP018254.1_272463-272498	NA	1	1	Orphan	DinG,PD-DExK,cas14j,csa3,Cas14c_CAS-V-F,WYL,csx18,cas1,cas2,cas10,cmr3gr5,cmr4gr7,cmr5gr11,csm3gr7,cas3,csx1,cas6,csm5gr7,csm4gr5,csm2gr11,csx19,csx10gr5,c2c9_V-U4,RT,cas4,csc1gr5,csc2gr7,cas10d,csx3,csm6	NA|111aa|up_8|NZ_AP018254.1_262794_263127_+,NA|193aa|up_6|NZ_AP018254.1_264231_264810_-,NA|52aa|up_4|NZ_AP018254.1_267128_267284_+,NA|179aa|up_3|NZ_AP018254.1_267298_267835_-,NA|74aa|down_1|NZ_AP018254.1_273349_273571_+,NA|104aa|down_5|NZ_AP018254.1_276955_277267_+	NA|248aa|up_9|NZ_AP018254.1_261759_262503_+	TIGR04526, predic_Ig_block, putative immunoglobulin-blocking virulence protein	NA|111aa|up_8|NZ_AP018254.1_262794_263127_+	NA	NA|144aa|up_7|NZ_AP018254.1_263462_263894_+	COG0394, Wzb, Protein-tyrosine-phosphatase [Signal transduction mechanisms]	NA|193aa|up_6|NZ_AP018254.1_264231_264810_-	NA	NA|648aa|up_5|NZ_AP018254.1_265053_266997_-	TIGR02042, Sulfite_reductase, ferredoxin-sulfite reductase	NA|52aa|up_4|NZ_AP018254.1_267128_267284_+	NA	NA|179aa|up_3|NZ_AP018254.1_267298_267835_-	NA	NA|224aa|up_2|NZ_AP018254.1_269357_270029_-	pfam00583, Acetyltransf_1, Acetyltransferase (GNAT) family	NA|198aa|up_1|NZ_AP018254.1_270509_271103_+	COG1309, AcrR, Transcriptional regulator [Transcription]	NA|409aa|up_0|NZ_AP018254.1_271076_272303_-	TIGR00275, TIGR00275, flavoprotein, HI0933 family	NA|197aa|down_0|NZ_AP018254.1_272605_273196_-	COG0655, WrbA, Multimeric flavodoxin WrbA [General function prediction only]	NA|74aa|down_1|NZ_AP018254.1_273349_273571_+	NA	NA|338aa|down_2|NZ_AP018254.1_274237_275251_+	PRK03427, PRK03427, cell division protein ZipA; Provisional	NA|86aa|down_3|NZ_AP018254.1_275393_275651_+	TIGR02181, GRX_bact, Glutaredoxin, GrxC family	NA|322aa|down_4|NZ_AP018254.1_275918_276884_+	PRK05246, PRK05246, glutathione synthetase; Provisional	NA|104aa|down_5|NZ_AP018254.1_276955_277267_+	NA	NA|482aa|down_6|NZ_AP018254.1_277343_278789_+	COG3670, COG3670, Lignostilbene-alpha,beta-dioxygenase and related enzymes [Secondary metabolites biosynthesis, transport, and catabolism]	NA|390aa|down_7|NZ_AP018254.1_278914_280084_+	TIGR00937, Chromate_transport_protein, chromate transporter, chromate ion transporter (CHR) family	NA|202aa|down_8|NZ_AP018254.1_280130_280736_+	COG1075, LipA, Predicted acetyltransferases and hydrolases with the alpha/beta hydrolase fold [General function prediction only]	NA|739aa|down_9|NZ_AP018254.1_280908_283125_-	cd05387, BY-kinase, bacterial tyrosine-kinase
GCF_002368395.1_ASM236839v1	NZ_AP018254	Calothrix sp. NIES-3974	4	459474-459554	4	CRISPRCasFinder	no		DinG,PD-DExK,cas14j,csa3,Cas14c_CAS-V-F,WYL,csx18,cas1,cas2,cas10,cmr3gr5,cmr4gr7,cmr5gr11,csm3gr7,cas3,csx1,cas6,csm5gr7,csm4gr5,csm2gr11,csx19,csx10gr5,c2c9_V-U4,RT,cas4,csc1gr5,csc2gr7,cas10d,csx3,csm6	Orphan	GGATTACTCTTTGTCAGGAGTGTACAC	27	0	0	NA	NA	NA	1	1	Orphan	DinG,PD-DExK,cas14j,csa3,Cas14c_CAS-V-F,WYL,csx18,cas1,cas2,cas10,cmr3gr5,cmr4gr7,cmr5gr11,csm3gr7,cas3,csx1,cas6,csm5gr7,csm4gr5,csm2gr11,csx19,csx10gr5,c2c9_V-U4,RT,cas4,csc1gr5,csc2gr7,cas10d,csx3,csm6	NA|59aa|up_8|NZ_AP018254.1_449219_449396_+,NA|123aa|down_3|NZ_AP018254.1_463913_464282_+,NA|92aa|down_4|NZ_AP018254.1_464285_464561_-,NA|73aa|down_7|NZ_AP018254.1_468360_468579_+	NA|228aa|up_9|NZ_AP018254.1_448494_449178_+	COG0071, IbpA, Molecular chaperone (small heat shock protein) [Posttranslational modification, protein turnover, chaperones]	NA|59aa|up_8|NZ_AP018254.1_449219_449396_+	NA	NA|282aa|up_7|NZ_AP018254.1_449910_450756_-	cd07987, LPLAT_MGAT-like, Lysophospholipid Acyltransferases (LPLATs) of Glycerophospholipid Biosynthesis: MGAT-like	NA|315aa|up_6|NZ_AP018254.1_451075_452020_-	cd05239, GDP_FS_SDR_e, GDP-fucose synthetase, extended (e) SDRs	NA|360aa|up_5|NZ_AP018254.1_452145_453225_-	COG1089, Gmd, GDP-D-mannose dehydratase [Cell envelope biogenesis, outer membrane]	NA|340aa|up_4|NZ_AP018254.1_453628_454648_+	pfam04371, PAD_porph, Porphyromonas-type peptidyl-arginine deiminase	NA|280aa|up_3|NZ_AP018254.1_454807_455647_+	TIGR03381, putative_carbon-nitrogen_hydrolase, N-carbamoylputrescine amidase	NA|306aa|up_2|NZ_AP018254.1_455993_456911_-	pfam02397, Bac_transf, Bacterial sugar transferase	NA|403aa|up_1|NZ_AP018254.1_456950_458159_-	cd03804, GT4_WbaZ-like, mannosyltransferase WbaZ and similar proteins	NA|323aa|up_0|NZ_AP018254.1_458472_459441_-	pfam03186, CobD_Cbib, CobD/Cbib protein	NA|148aa|down_0|NZ_AP018254.1_459694_460138_-	COG1959, COG1959, Predicted transcriptional regulator [Transcription]	NA|138aa|down_1|NZ_AP018254.1_460658_461072_+	pfam14250, AbrB-like, AbrB-like transcriptional regulator	NA|676aa|down_2|NZ_AP018254.1_461287_463315_-	cd00400, Voltage_gated_ClC, CLC voltage-gated chloride channel	NA|123aa|down_3|NZ_AP018254.1_463913_464282_+	NA	NA|92aa|down_4|NZ_AP018254.1_464285_464561_-	NA	NA|270aa|down_5|NZ_AP018254.1_464717_465527_+	pfam12705, PDDEXK_1, PD-(D/E)XK nuclease superfamily	NA|521aa|down_6|NZ_AP018254.1_466134_467697_+	pfam00563, EAL, EAL domain	NA|73aa|down_7|NZ_AP018254.1_468360_468579_+	NA	NA|553aa|down_8|NZ_AP018254.1_468832_470491_-	COG0659, SUL1, Sulfate permease and related transporters (MFS superfamily) [Inorganic ion transport and metabolism]	NA|288aa|down_9|NZ_AP018254.1_471284_472148_-	cd00293, USP_Like, Usp: Universal stress protein family
GCF_002368395.1_ASM236839v1	NZ_AP018254	Calothrix sp. NIES-3974	5	518056-518160	5	CRISPRCasFinder	no		DinG,PD-DExK,cas14j,csa3,Cas14c_CAS-V-F,WYL,csx18,cas1,cas2,cas10,cmr3gr5,cmr4gr7,cmr5gr11,csm3gr7,cas3,csx1,cas6,csm5gr7,csm4gr5,csm2gr11,csx19,csx10gr5,c2c9_V-U4,RT,cas4,csc1gr5,csc2gr7,cas10d,csx3,csm6	Orphan	GCGAAGCGTTGCCACAGGCTATTAC	25	0	0	NA	NA	NA	1	1	Orphan	DinG,PD-DExK,cas14j,csa3,Cas14c_CAS-V-F,WYL,csx18,cas1,cas2,cas10,cmr3gr5,cmr4gr7,cmr5gr11,csm3gr7,cas3,csx1,cas6,csm5gr7,csm4gr5,csm2gr11,csx19,csx10gr5,c2c9_V-U4,RT,cas4,csc1gr5,csc2gr7,cas10d,csx3,csm6	NA,NA|95aa|down_0|NZ_AP018254.1_519098_519383_+,NA|104aa|down_1|NZ_AP018254.1_519385_519697_+,NA|123aa|down_3|NZ_AP018254.1_520888_521257_+,NA|189aa|down_5|NZ_AP018254.1_522158_522725_+,NA|251aa|down_6|NZ_AP018254.1_523102_523855_+,NA|145aa|down_9|NZ_AP018254.1_527814_528249_+	NA|366aa|up_9|NZ_AP018254.1_500617_501715_-	cd03802, GT4_AviGT4-like, UDP-Glc:tetrahydrobiopterin alpha-glucosyltransferase and similar proteins	NA|525aa|up_8|NZ_AP018254.1_502453_504028_+	pfam07602, DUF1565, Protein of unknown function (DUF1565)	NA|400aa|up_7|NZ_AP018254.1_504250_505450_-	TIGR00326, eubact_ribD, riboflavin biosynthesis protein RibD	NA|634aa|up_6|NZ_AP018254.1_506468_508370_+	PRK00290, dnaK, molecular chaperone DnaK; Provisional	NA|715aa|up_5|NZ_AP018254.1_508617_510762_+	cd07496, Peptidases_S8_13, Peptidase S8 family domain, uncharacterized subfamily 13	NA|824aa|up_4|NZ_AP018254.1_510945_513417_+	pfam12770, CHAT, CHAT domain	NA|419aa|up_3|NZ_AP018254.1_513773_515030_+	PRK00549, PRK00549, competence damage-inducible protein A; Provisional	NA|130aa|up_2|NZ_AP018254.1_515346_515736_+	pfam14213, DUF4325, STAS-like domain of unknown function (DUF4325)	NA|188aa|up_1|NZ_AP018254.1_515732_516296_+	pfam01850, PIN, PIN domain	NA|312aa|up_0|NZ_AP018254.1_516309_517245_-	TIGR01139, Cysteine_synthase, cysteine synthase A	NA|95aa|down_0|NZ_AP018254.1_519098_519383_+	NA	NA|104aa|down_1|NZ_AP018254.1_519385_519697_+	NA	NA|112aa|down_2|NZ_AP018254.1_520198_520534_+	PRK14948, PRK14948, DNA polymerase III subunit gamma/tau	NA|123aa|down_3|NZ_AP018254.1_520888_521257_+	NA	NA|180aa|down_4|NZ_AP018254.1_521428_521968_-	pfam03358, FMN_red, NADPH-dependent FMN reductase	NA|189aa|down_5|NZ_AP018254.1_522158_522725_+	NA	NA|251aa|down_6|NZ_AP018254.1_523102_523855_+	NA	NA|316aa|down_7|NZ_AP018254.1_525249_526197_-	cd05256, UDP_AE_SDR_e, UDP-N-acetylglucosamine 4-epimerase, extended (e) SDRs	NA|242aa|down_8|NZ_AP018254.1_526928_527654_+	cd01835, SGNH_hydrolase_like_3, SGNH_hydrolase subfamily	NA|145aa|down_9|NZ_AP018254.1_527814_528249_+	NA
GCF_002368395.1_ASM236839v1	NZ_AP018254	Calothrix sp. NIES-3974	6	570165-570235	6	CRISPRCasFinder	no		DinG,PD-DExK,cas14j,csa3,Cas14c_CAS-V-F,WYL,csx18,cas1,cas2,cas10,cmr3gr5,cmr4gr7,cmr5gr11,csm3gr7,cas3,csx1,cas6,csm5gr7,csm4gr5,csm2gr11,csx19,csx10gr5,c2c9_V-U4,RT,cas4,csc1gr5,csc2gr7,cas10d,csx3,csm6	Orphan	GGTTGACGTTGAAGGGTTTGCAC	23	0	0	NA	NA	NA	1	1	Orphan	DinG,PD-DExK,cas14j,csa3,Cas14c_CAS-V-F,WYL,csx18,cas1,cas2,cas10,cmr3gr5,cmr4gr7,cmr5gr11,csm3gr7,cas3,csx1,cas6,csm5gr7,csm4gr5,csm2gr11,csx19,csx10gr5,c2c9_V-U4,RT,cas4,csc1gr5,csc2gr7,cas10d,csx3,csm6	NA|95aa|up_8|NZ_AP018254.1_551097_551382_+,NA|125aa|up_6|NZ_AP018254.1_553602_553977_-,NA|90aa|up_3|NZ_AP018254.1_560929_561199_-,NA|214aa|up_0|NZ_AP018254.1_566446_567088_+,NA|740aa|down_3|NZ_AP018254.1_575853_578073_-,NA|1564aa|down_4|NZ_AP018254.1_578126_582818_-,NA|153aa|down_6|NZ_AP018254.1_586914_587373_-	NA|424aa|up_9|NZ_AP018254.1_549043_550315_+	COG0763, LpxB, Lipid A disaccharide synthetase [Cell envelope biogenesis, outer membrane]	NA|95aa|up_8|NZ_AP018254.1_551097_551382_+	NA	NA|407aa|up_7|NZ_AP018254.1_551804_553025_-	TIGR03087, stp1, sugar transferase, PEP-CTERM/EpsH1 system associated	NA|125aa|up_6|NZ_AP018254.1_553602_553977_-	NA	NA|576aa|up_5|NZ_AP018254.1_554984_556712_-	COG0661, AarF, Predicted unusual protein kinase [General function prediction only]	NA|861aa|up_4|NZ_AP018254.1_557651_560234_-	COG0308, PepN, Aminopeptidase N [Amino acid transport and metabolism]	NA|90aa|up_3|NZ_AP018254.1_560929_561199_-	NA	NA|827aa|up_2|NZ_AP018254.1_562676_565157_+	COG4995, COG4995, Uncharacterized protein conserved in bacteria [Function unknown]	NA|356aa|up_1|NZ_AP018254.1_565173_566241_-	cd17033, DR1245-like, possible type III secretion system (T3SS) chaperone protein DR1245 found in Deinococcus radiodurans	NA|214aa|up_0|NZ_AP018254.1_566446_567088_+	NA	NA|82aa|down_0|NZ_AP018254.1_572092_572338_+	pfam05708, Peptidase_C92, Permuted papain-like amidase enzyme, YaeF/YiiX, C92 family	NA|355aa|down_1|NZ_AP018254.1_572490_573555_+	sd00033, LRR_RI, leucine-rich repeats, ribonuclease inhibitor (RI)-like subfamily	NA|586aa|down_2|NZ_AP018254.1_573675_575433_-	pfam13699, DUF4157, Domain of unknown function (DUF4157)	NA|740aa|down_3|NZ_AP018254.1_575853_578073_-	NA	NA|1564aa|down_4|NZ_AP018254.1_578126_582818_-	NA	NA|1306aa|down_5|NZ_AP018254.1_582822_586740_-	TIGR02243, hypothetical_protein_SCD8A	NA|153aa|down_6|NZ_AP018254.1_586914_587373_-	NA	NA|1232aa|down_7|NZ_AP018254.1_587497_591193_-	TIGR02243, hypothetical_protein_SCD8A	NA|163aa|down_8|NZ_AP018254.1_591214_591703_-	pfam04965, GPW_gp25, Gene 25-like lysozyme	NA|98aa|down_9|NZ_AP018254.1_591812_592106_-	cd14738, PAAR_2, proline-alanine-alanine-arginine (PAAR) domain
GCF_002368395.1_ASM236839v1	NZ_AP018254	Calothrix sp. NIES-3974	7	767781-767887	7	CRISPRCasFinder	no	cas14j	DinG,PD-DExK,cas14j,csa3,Cas14c_CAS-V-F,WYL,csx18,cas1,cas2,cas10,cmr3gr5,cmr4gr7,cmr5gr11,csm3gr7,cas3,csx1,cas6,csm5gr7,csm4gr5,csm2gr11,csx19,csx10gr5,c2c9_V-U4,RT,cas4,csc1gr5,csc2gr7,cas10d,csx3,csm6	Unclear	ATTATTACCCATTCCCGACTCCCGACTCCC	30	1	1	767811-767857	NZ_AP018254.1_767895-767941	NA	1	1	TypeV	DinG,PD-DExK,cas14j,csa3,Cas14c_CAS-V-F,WYL,csx18,cas1,cas2,cas10,cmr3gr5,cmr4gr7,cmr5gr11,csm3gr7,cas3,csx1,cas6,csm5gr7,csm4gr5,csm2gr11,csx19,csx10gr5,c2c9_V-U4,RT,cas4,csc1gr5,csc2gr7,cas10d,csx3,csm6	NA|166aa|up_9|NZ_AP018254.1_754623_755121_-,NA|158aa|up_2|NZ_AP018254.1_764928_765402_+,NA|205aa|up_1|NZ_AP018254.1_765843_766458_+,NA	NA|166aa|up_9|NZ_AP018254.1_754623_755121_-	NA	cas14j|375aa|up_8|NZ_AP018254.1_755758_756883_-	COG0675, COG0675, Transposase and inactivated derivatives [DNA replication, recombination, and repair]	NA|88aa|up_7|NZ_AP018254.1_757094_757358_+	pfam14384, BrnA_antitoxin, BrnA antitoxin of type II toxin-antitoxin system	NA|406aa|up_6|NZ_AP018254.1_757369_758587_-	COG2805, PilT, Tfp pilus assembly protein, pilus retraction ATPase PilT [Cell motility and secretion / Intracellular trafficking and secretion]	NA|387aa|up_5|NZ_AP018254.1_759622_760783_+	COG1721, COG1721, Uncharacterized conserved protein (some members contain a von Willebrand factor type A (vWA) domain) [General function prediction only]	NA|762aa|up_4|NZ_AP018254.1_761006_763292_-	pfam00196, GerE, Bacterial regulatory proteins, luxR family	NA|121aa|up_3|NZ_AP018254.1_764462_764825_+	cd07177, terB_like, tellurium resistance terB-like protein	NA|158aa|up_2|NZ_AP018254.1_764928_765402_+	NA	NA|205aa|up_1|NZ_AP018254.1_765843_766458_+	NA	NA|315aa|up_0|NZ_AP018254.1_766631_767576_+	pfam01891, CbiM, Cobalt uptake substrate-specific transmembrane region	NA|417aa|down_0|NZ_AP018254.1_768757_770008_-	COG1819, COG1819, Glycosyl transferases, related to UDP-glucuronosyltransferase [Carbohydrate transport and metabolism / Signal transduction mechanisms]	NA|215aa|down_1|NZ_AP018254.1_770851_771496_+	PRK00001, rplC, 50S ribosomal protein L3; Validated	NA|211aa|down_2|NZ_AP018254.1_771570_772203_+	PRK05319, rplD, 50S ribosomal protein L4; Provisional	NA|109aa|down_3|NZ_AP018254.1_772195_772522_+	pfam00276, Ribosomal_L23, Ribosomal protein L23	NA|288aa|down_4|NZ_AP018254.1_772782_773646_+	PRK09374, rplB, 50S ribosomal protein L2; Validated	NA|93aa|down_5|NZ_AP018254.1_773752_774031_+	PRK00357, rpsS, 30S ribosomal protein S19; Reviewed	NA|120aa|down_6|NZ_AP018254.1_774033_774393_+	CHL00034, rpl22, ribosomal protein L22	NA|252aa|down_7|NZ_AP018254.1_774468_775224_+	PRK00310, rpsC, 30S ribosomal protein S3; Reviewed	NA|144aa|down_8|NZ_AP018254.1_775340_775772_+	PRK09203, rplP, 50S ribosomal protein L16; Reviewed	NA|78aa|down_9|NZ_AP018254.1_775776_776010_+	CHL00154, rpl29, ribosomal protein L29; Validated
GCF_002368395.1_ASM236839v1	NZ_AP018254	Calothrix sp. NIES-3974	8	1115350-1115841	1,8,1	PILER-CR,CRISPRCasFinder,CRT	no		DinG,PD-DExK,cas14j,csa3,Cas14c_CAS-V-F,WYL,csx18,cas1,cas2,cas10,cmr3gr5,cmr4gr7,cmr5gr11,csm3gr7,cas3,csx1,cas6,csm5gr7,csm4gr5,csm2gr11,csx19,csx10gr5,c2c9_V-U4,RT,cas4,csc1gr5,csc2gr7,cas10d,csx3,csm6	Orphan	GTCTCCTCTTATGAGGGGAAATAATTGATTGGAAAC,GTCTCCTCTTATGAGGGGAAATAATTGATTGGAAAC,GTCTCCTCTTATGAGGGGAAATAATTGATTGGAAAC	36,36,36	0	0	NA	NA	NA:NA:NA	6,6,6	6	Orphan	DinG,PD-DExK,cas14j,csa3,Cas14c_CAS-V-F,WYL,csx18,cas1,cas2,cas10,cmr3gr5,cmr4gr7,cmr5gr11,csm3gr7,cas3,csx1,cas6,csm5gr7,csm4gr5,csm2gr11,csx19,csx10gr5,c2c9_V-U4,RT,cas4,csc1gr5,csc2gr7,cas10d,csx3,csm6	NA|86aa|up_1|NZ_AP018254.1_1112858_1113116_-,NA|50aa|down_1|NZ_AP018254.1_1116834_1116984_-,NA|88aa|down_4|NZ_AP018254.1_1119436_1119700_-,NA|57aa|down_5|NZ_AP018254.1_1120394_1120565_+	NA|392aa|up_9|NZ_AP018254.1_1100841_1102017_-	COG0520, csdA, Selenocysteine lyase/Cysteine desulfurase [Posttranslational modification, protein turnover, chaperones]	NA|183aa|up_8|NZ_AP018254.1_1102369_1102918_-	COG3688, COG3688, Predicted RNA-binding protein containing a PIN domain [General function prediction only]	NA|442aa|up_7|NZ_AP018254.1_1103297_1104623_-	TIGR01964, chpXY, CO2 hydration protein	NA|520aa|up_6|NZ_AP018254.1_1105042_1106602_-	PRK07363, PRK07363, NADH-quinone oxidoreductase subunit M	NA|631aa|up_5|NZ_AP018254.1_1106743_1108636_-	PRK07390, PRK07390, NAD(P)H-quinone oxidoreductase subunit F; Validated	NA|267aa|up_4|NZ_AP018254.1_1109142_1109943_-	COG1691, COG1691, NCAIR mutase (PurE)-related proteins [General function prediction only]	NA|240aa|up_3|NZ_AP018254.1_1110149_1110869_-	pfam00226, DnaJ, DnaJ domain	NA|426aa|up_2|NZ_AP018254.1_1111547_1112825_-	COG2821, MltA, Membrane-bound lytic murein transglycosylase [Cell envelope biogenesis, outer membrane]	NA|86aa|up_1|NZ_AP018254.1_1112858_1113116_-	NA	NA|399aa|up_0|NZ_AP018254.1_1113549_1114746_+	cd06164, S2P-M50_SpoIVFB_CBS, SpoIVFB Site-2 protease (S2P), a zinc metalloprotease (MEROPS family M50B), regulates intramembrane proteolysis (RIP), and is involved in the pro-sigmaK pathway of bacterial spore formation	NA|242aa|down_0|NZ_AP018254.1_1115859_1116585_-	COG0546, Gph, Predicted phosphatases [General function prediction only]	NA|50aa|down_1|NZ_AP018254.1_1116834_1116984_-	NA	NA|144aa|down_2|NZ_AP018254.1_1117105_1117537_+	cd16358, GlxI_Ni, Glyoxalase I that uses Ni(++) as cofactor	NA|569aa|down_3|NZ_AP018254.1_1117575_1119282_+	pfam04966, OprB, Carbohydrate-selective porin, OprB family	NA|88aa|down_4|NZ_AP018254.1_1119436_1119700_-	NA	NA|57aa|down_5|NZ_AP018254.1_1120394_1120565_+	NA	NA|163aa|down_6|NZ_AP018254.1_1121031_1121520_+	pfam13517, VCBS, Repeat domain in Vibrio, Colwellia, Bradyrhizobium and Shewanella	NA|286aa|down_7|NZ_AP018254.1_1121662_1122520_-	COG1176, PotB, ABC-type spermidine/putrescine transport system, permease component I [Amino acid transport and metabolism]	NA|363aa|down_8|NZ_AP018254.1_1122942_1124031_-	cd13590, PBP2_PotD_PotF_like, The periplasmic-binding component of ABC transporters involved in uptake of polyamines; possess the type 2 periplasmic binding fold	NA|375aa|down_9|NZ_AP018254.1_1124205_1125330_-	COG3842, PotA, ABC-type spermidine/putrescine transport systems, ATPase components [Amino acid transport and metabolism]
GCF_002368395.1_ASM236839v1	NZ_AP018254	Calothrix sp. NIES-3974	9	1223637-1224131	2,9,2	PILER-CR,CRISPRCasFinder,CRT	no		DinG,PD-DExK,cas14j,csa3,Cas14c_CAS-V-F,WYL,csx18,cas1,cas2,cas10,cmr3gr5,cmr4gr7,cmr5gr11,csm3gr7,cas3,csx1,cas6,csm5gr7,csm4gr5,csm2gr11,csx19,csx10gr5,c2c9_V-U4,RT,cas4,csc1gr5,csc2gr7,cas10d,csx3,csm6	Orphan	GTCTCCTCTTATGAGGGGAAATAATTGATTGGAAAC,GTTTCCAATCAATTATTTCCCCTCATAAGAGGAGAC,GTTTCCAATCAATTATTTCCCCTCATAAGAGGAGAC	36,36,36	0	0	NA	NA	NA:NA:NA	4,6,6	6	Orphan	DinG,PD-DExK,cas14j,csa3,Cas14c_CAS-V-F,WYL,csx18,cas1,cas2,cas10,cmr3gr5,cmr4gr7,cmr5gr11,csm3gr7,cas3,csx1,cas6,csm5gr7,csm4gr5,csm2gr11,csx19,csx10gr5,c2c9_V-U4,RT,cas4,csc1gr5,csc2gr7,cas10d,csx3,csm6	NA|71aa|up_8|NZ_AP018254.1_1211114_1211327_-,NA|62aa|up_6|NZ_AP018254.1_1212382_1212568_+,NA|509aa|up_3|NZ_AP018254.1_1217376_1218903_+,NA|60aa|up_1|NZ_AP018254.1_1220701_1220881_-,NA|46aa|down_0|NZ_AP018254.1_1225291_1225429_-,NA|223aa|down_7|NZ_AP018254.1_1245617_1246286_-	NA|512aa|up_9|NZ_AP018254.1_1208384_1209920_+	pfam02517, Abi, CAAX protease self-immunity	NA|71aa|up_8|NZ_AP018254.1_1211114_1211327_-	NA	NA|92aa|up_7|NZ_AP018254.1_1211667_1211943_+	COG2314, XynA, Predicted membrane protein [Function unknown]	NA|62aa|up_6|NZ_AP018254.1_1212382_1212568_+	NA	NA|796aa|up_5|NZ_AP018254.1_1212667_1215055_+	cd02094, P-type_ATPase_Cu-like, P-type heavy metal-transporting ATPase, similar to human copper-transporting ATPases, ATP7A and ATP7B	NA|450aa|up_4|NZ_AP018254.1_1215529_1216879_-	pfam00561, Abhydrolase_1, alpha/beta hydrolase fold	NA|509aa|up_3|NZ_AP018254.1_1217376_1218903_+	NA	NA|500aa|up_2|NZ_AP018254.1_1218971_1220471_+	COG3046, COG3046, Uncharacterized protein related to deoxyribodipyrimidine photolyase [General function prediction only]	NA|60aa|up_1|NZ_AP018254.1_1220701_1220881_-	NA	NA|602aa|up_0|NZ_AP018254.1_1220985_1222791_-	PRK09194, PRK09194, prolyl-tRNA synthetase; Provisional	NA|46aa|down_0|NZ_AP018254.1_1225291_1225429_-	NA	NA|188aa|down_1|NZ_AP018254.1_1225680_1226244_+	COG4636, Uma2, Endonuclease, Uma2 family (restriction endonuclease fold) [General function prediction only]	NA|306aa|down_2|NZ_AP018254.1_1226235_1227153_-	COG0596, MhpC, Predicted hydrolases or acyltransferases (alpha/beta hydrolase superfamily) [General function prediction only]	NA|808aa|down_3|NZ_AP018254.1_1228404_1230828_+	PRK00409, PRK00409, recombination and DNA strand exchange inhibitor protein; Reviewed	NA|928aa|down_4|NZ_AP018254.1_1232497_1235281_-	TIGR01451, unnamed_protein_product, conserved repeat domain	NA|1509aa|down_5|NZ_AP018254.1_1235686_1240213_-	COG5253, MSS4, Phosphatidylinositol-4-phosphate 5-kinase [Signal transduction mechanisms]	NA|605aa|down_6|NZ_AP018254.1_1243796_1245611_-	COG0768, FtsI, Cell division protein FtsI/penicillin-binding protein 2 [Cell envelope biogenesis, outer membrane]	NA|223aa|down_7|NZ_AP018254.1_1245617_1246286_-	NA	NA|148aa|down_8|NZ_AP018254.1_1247742_1248186_+	cd07176, terB, tellurite resistance protein terB	NA|514aa|down_9|NZ_AP018254.1_1248292_1249834_+	COG2187, COG2187, Uncharacterized protein conserved in bacteria [Function unknown]
GCF_002368395.1_ASM236839v1	NZ_AP018254	Calothrix sp. NIES-3974	10	1224276-1225137	10,3,3	CRISPRCasFinder,CRT,PILER-CR	no		DinG,PD-DExK,cas14j,csa3,Cas14c_CAS-V-F,WYL,csx18,cas1,cas2,cas10,cmr3gr5,cmr4gr7,cmr5gr11,csm3gr7,cas3,csx1,cas6,csm5gr7,csm4gr5,csm2gr11,csx19,csx10gr5,c2c9_V-U4,RT,cas4,csc1gr5,csc2gr7,cas10d,csx3,csm6	Orphan	GTTTCCAATCAATTATTTCCCCTCATAAGAGGAGAC,GTTTCCAATCAATTATTTCCCCTCATAAGAGGAGAC,GTCTCCTCTTATGAGGGGAAATAATTGATTGGAAAC	36,36,36	0	0	NA	NA	NA:NA:NA	11,11,10	11	Orphan	DinG,PD-DExK,cas14j,csa3,Cas14c_CAS-V-F,WYL,csx18,cas1,cas2,cas10,cmr3gr5,cmr4gr7,cmr5gr11,csm3gr7,cas3,csx1,cas6,csm5gr7,csm4gr5,csm2gr11,csx19,csx10gr5,c2c9_V-U4,RT,cas4,csc1gr5,csc2gr7,cas10d,csx3,csm6	NA|71aa|up_8|NZ_AP018254.1_1211114_1211327_-,NA|62aa|up_6|NZ_AP018254.1_1212382_1212568_+,NA|509aa|up_3|NZ_AP018254.1_1217376_1218903_+,NA|60aa|up_1|NZ_AP018254.1_1220701_1220881_-,NA|46aa|down_0|NZ_AP018254.1_1225291_1225429_-,NA|223aa|down_7|NZ_AP018254.1_1245617_1246286_-	NA|512aa|up_9|NZ_AP018254.1_1208384_1209920_+	pfam02517, Abi, CAAX protease self-immunity	NA|71aa|up_8|NZ_AP018254.1_1211114_1211327_-	NA	NA|92aa|up_7|NZ_AP018254.1_1211667_1211943_+	COG2314, XynA, Predicted membrane protein [Function unknown]	NA|62aa|up_6|NZ_AP018254.1_1212382_1212568_+	NA	NA|796aa|up_5|NZ_AP018254.1_1212667_1215055_+	cd02094, P-type_ATPase_Cu-like, P-type heavy metal-transporting ATPase, similar to human copper-transporting ATPases, ATP7A and ATP7B	NA|450aa|up_4|NZ_AP018254.1_1215529_1216879_-	pfam00561, Abhydrolase_1, alpha/beta hydrolase fold	NA|509aa|up_3|NZ_AP018254.1_1217376_1218903_+	NA	NA|500aa|up_2|NZ_AP018254.1_1218971_1220471_+	COG3046, COG3046, Uncharacterized protein related to deoxyribodipyrimidine photolyase [General function prediction only]	NA|60aa|up_1|NZ_AP018254.1_1220701_1220881_-	NA	NA|602aa|up_0|NZ_AP018254.1_1220985_1222791_-	PRK09194, PRK09194, prolyl-tRNA synthetase; Provisional	NA|46aa|down_0|NZ_AP018254.1_1225291_1225429_-	NA	NA|188aa|down_1|NZ_AP018254.1_1225680_1226244_+	COG4636, Uma2, Endonuclease, Uma2 family (restriction endonuclease fold) [General function prediction only]	NA|306aa|down_2|NZ_AP018254.1_1226235_1227153_-	COG0596, MhpC, Predicted hydrolases or acyltransferases (alpha/beta hydrolase superfamily) [General function prediction only]	NA|808aa|down_3|NZ_AP018254.1_1228404_1230828_+	PRK00409, PRK00409, recombination and DNA strand exchange inhibitor protein; Reviewed	NA|928aa|down_4|NZ_AP018254.1_1232497_1235281_-	TIGR01451, unnamed_protein_product, conserved repeat domain	NA|1509aa|down_5|NZ_AP018254.1_1235686_1240213_-	COG5253, MSS4, Phosphatidylinositol-4-phosphate 5-kinase [Signal transduction mechanisms]	NA|605aa|down_6|NZ_AP018254.1_1243796_1245611_-	COG0768, FtsI, Cell division protein FtsI/penicillin-binding protein 2 [Cell envelope biogenesis, outer membrane]	NA|223aa|down_7|NZ_AP018254.1_1245617_1246286_-	NA	NA|148aa|down_8|NZ_AP018254.1_1247742_1248186_+	cd07176, terB, tellurite resistance protein terB	NA|514aa|down_9|NZ_AP018254.1_1248292_1249834_+	COG2187, COG2187, Uncharacterized protein conserved in bacteria [Function unknown]
GCF_002368395.1_ASM236839v1	NZ_AP018254	Calothrix sp. NIES-3974	11	1288518-1290511	4,11,4,5	PILER-CR,CRISPRCasFinder,CRT,PILER-CR	no	WYL,csx18,cas1,cas2,cas10,cmr3gr5,cmr4gr7,cmr5gr11,csm3gr7	DinG,PD-DExK,cas14j,csa3,Cas14c_CAS-V-F,WYL,csx18,cas1,cas2,cas10,cmr3gr5,cmr4gr7,cmr5gr11,csm3gr7,cas3,csx1,cas6,csm5gr7,csm4gr5,csm2gr11,csx19,csx10gr5,c2c9_V-U4,RT,cas4,csc1gr5,csc2gr7,cas10d,csx3,csm6	Type III-D,Type III-C,Type III-A,Type III-B	GTCTCCTCTTATGAGGGGAAATAATTGATTGGAAAC,GTCTCCTCTTATGAGGGGAAATAATTGATTGGAAAC,GTCTCCTCTTATGAGGGGAAATAATTGATTGGAAAC,GTCTCCTCTTATGAGGGGAAATAATTGATTGGAAACA	36,36,36,37	0	0	NA	NA	NA:NA:NA:NA	25,26,26,25	26	TypeIII-B,TypeIII-C,TypeIII-D,TypeIII-A	DinG,PD-DExK,cas14j,csa3,Cas14c_CAS-V-F,WYL,csx18,cas1,cas2,cas10,cmr3gr5,cmr4gr7,cmr5gr11,csm3gr7,cas3,csx1,cas6,csm5gr7,csm4gr5,csm2gr11,csx19,csx10gr5,c2c9_V-U4,RT,cas4,csc1gr5,csc2gr7,cas10d,csx3,csm6	NA|1596aa|up_7|NZ_AP018254.1_1275861_1280649_-,csx18|85aa|up_2|NZ_AP018254.1_1286566_1286821_+,NA|297aa|down_0|NZ_AP018254.1_1290768_1291659_+,NA|101aa|down_1|NZ_AP018254.1_1291844_1292147_-,cmr5gr11|133aa|down_5|NZ_AP018254.1_1297453_1297852_+,NA|522aa|down_7|NZ_AP018254.1_1299959_1301525_+,NA|52aa|down_9|NZ_AP018254.1_1302529_1302685_+	NA|358aa|up_9|NZ_AP018254.1_1273746_1274820_-	COG4795, PulJ, Type II secretory pathway, component PulJ [Intracellular trafficking and secretion]	NA|221aa|up_8|NZ_AP018254.1_1274946_1275609_-	COG2165, PulG, Type II secretory pathway, pseudopilin PulG [Cell motility and secretion / Intracellular trafficking and secretion]	NA|1596aa|up_7|NZ_AP018254.1_1275861_1280649_-	NA	NA|79aa|up_6|NZ_AP018254.1_1281274_1281511_-	PRK00359, rpmB, 50S ribosomal protein L28; Reviewed	NA|653aa|up_5|NZ_AP018254.1_1281658_1283617_-	PRK05218, PRK05218, heat shock protein 90; Provisional	NA|97aa|up_4|NZ_AP018254.1_1284755_1285046_-	COG1669, COG1669, Predicted nucleotidyltransferases [General function prediction only]	WYL|383aa|up_3|NZ_AP018254.1_1285107_1286256_-	pfam13280, WYL, WYL domain	csx18|85aa|up_2|NZ_AP018254.1_1286566_1286821_+	NA	cas1|331aa|up_1|NZ_AP018254.1_1287059_1288052_+	pfam01867, Cas_Cas1, CRISPR associated protein Cas1	cas2|94aa|up_0|NZ_AP018254.1_1288051_1288333_+	cd09725, Cas2_I_II_III, CRISPR/Cas system-associated protein Cas2	NA|297aa|down_0|NZ_AP018254.1_1290768_1291659_+	NA	NA|101aa|down_1|NZ_AP018254.1_1291844_1292147_-	NA	cas10|1026aa|down_2|NZ_AP018254.1_1292240_1295318_+	pfam12469, DUF3692, CRISPR-associated protein	cmr3gr5|366aa|down_3|NZ_AP018254.1_1295304_1296402_+	cd09748, Cmr3_III-B, CRISPR/Cas system-associated RAMP superfamily protein Cmr3	cmr4gr7|292aa|down_4|NZ_AP018254.1_1296535_1297411_+	TIGR02580, putative_CRISPR-associated_protein, CRISPR type III-B/RAMP module RAMP protein Cmr4	cmr5gr11|133aa|down_5|NZ_AP018254.1_1297453_1297852_+	NA	csm3gr7|651aa|down_6|NZ_AP018254.1_1297848_1299801_+	cd09661, Cmr6_III-B, CRISPR/Cas system-associated RAMP superfamily protein Cmr6	NA|522aa|down_7|NZ_AP018254.1_1299959_1301525_+	NA	NA|190aa|down_8|NZ_AP018254.1_1301638_1302208_-	pfam05685, Uma2, Putative restriction endonuclease	NA|52aa|down_9|NZ_AP018254.1_1302529_1302685_+	NA
GCF_002368395.1_ASM236839v1	NZ_AP018254	Calothrix sp. NIES-3974	12	1404995-1405093	12	CRISPRCasFinder	no		DinG,PD-DExK,cas14j,csa3,Cas14c_CAS-V-F,WYL,csx18,cas1,cas2,cas10,cmr3gr5,cmr4gr7,cmr5gr11,csm3gr7,cas3,csx1,cas6,csm5gr7,csm4gr5,csm2gr11,csx19,csx10gr5,c2c9_V-U4,RT,cas4,csc1gr5,csc2gr7,cas10d,csx3,csm6	Orphan	GACGTTGCGTAGAGACGTAGCATTGCTA	28	0	0	NA	NA	NA	1	1	Orphan	DinG,PD-DExK,cas14j,csa3,Cas14c_CAS-V-F,WYL,csx18,cas1,cas2,cas10,cmr3gr5,cmr4gr7,cmr5gr11,csm3gr7,cas3,csx1,cas6,csm5gr7,csm4gr5,csm2gr11,csx19,csx10gr5,c2c9_V-U4,RT,cas4,csc1gr5,csc2gr7,cas10d,csx3,csm6	NA|59aa|up_5|NZ_AP018254.1_1401065_1401242_-,NA|80aa|up_2|NZ_AP018254.1_1403407_1403647_-,NA|75aa|up_1|NZ_AP018254.1_1404225_1404450_+,NA|59aa|down_2|NZ_AP018254.1_1407781_1407958_-	NA|263aa|up_9|NZ_AP018254.1_1394852_1395641_+	pfam01887, SAM_adeno_trans, S-adenosyl-l-methionine hydroxide adenosyltransferase	NA|416aa|up_8|NZ_AP018254.1_1395987_1397235_-	PRK00854, rocD, ornithine--oxo-acid transaminase; Reviewed	NA|161aa|up_7|NZ_AP018254.1_1398150_1398633_-	pfam04134, DUF393, Protein of unknown function, DUF393	NA|557aa|up_6|NZ_AP018254.1_1399010_1400681_+	PRK12561, PRK12561, NAD(P)H-quinone oxidoreductase subunit 4; Provisional	NA|59aa|up_5|NZ_AP018254.1_1401065_1401242_-	NA	NA|472aa|up_4|NZ_AP018254.1_1401391_1402807_+	pfam01551, Peptidase_M23, Peptidase family M23	NA|154aa|up_3|NZ_AP018254.1_1402949_1403411_-	pfam01844, HNH, HNH endonuclease	NA|80aa|up_2|NZ_AP018254.1_1403407_1403647_-	NA	NA|75aa|up_1|NZ_AP018254.1_1404225_1404450_+	NA	NA|149aa|up_0|NZ_AP018254.1_1404479_1404926_-	COG0824, FcbC, Predicted thioesterase [General function prediction only]	NA|362aa|down_0|NZ_AP018254.1_1405098_1406184_-	pfam17914, HopA1, HopA1 effector protein family	NA|412aa|down_1|NZ_AP018254.1_1406542_1407778_-	pfam01636, APH, Phosphotransferase enzyme family	NA|59aa|down_2|NZ_AP018254.1_1407781_1407958_-	NA	NA|452aa|down_3|NZ_AP018254.1_1408647_1410003_+	pfam05673, DUF815, Protein of unknown function (DUF815)	NA|397aa|down_4|NZ_AP018254.1_1409986_1411177_-	cd03801, GT4_PimA-like, phosphatidyl-myo-inositol mannosyltransferase	NA|115aa|down_5|NZ_AP018254.1_1411327_1411672_+	PRK07451, PRK07451, translation initiation factor	NA|200aa|down_6|NZ_AP018254.1_1412048_1412648_-	pfam08239, SH3_3, Bacterial SH3 domain	NA|65aa|down_7|NZ_AP018254.1_1412711_1412906_-	pfam13318, DUF4089, Protein of unknown function (DUF4089)	NA|1064aa|down_8|NZ_AP018254.1_1413194_1416386_+	pfam00873, ACR_tran, AcrB/AcrD/AcrF family	NA|218aa|down_9|NZ_AP018254.1_1416532_1417186_-	COG2020, STE14, Putative protein-S-isoprenylcysteine methyltransferase [Posttranslational modification, protein turnover, chaperones]
GCF_002368395.1_ASM236839v1	NZ_AP018254	Calothrix sp. NIES-3974	13	1483597-1483711	13	CRISPRCasFinder	no		DinG,PD-DExK,cas14j,csa3,Cas14c_CAS-V-F,WYL,csx18,cas1,cas2,cas10,cmr3gr5,cmr4gr7,cmr5gr11,csm3gr7,cas3,csx1,cas6,csm5gr7,csm4gr5,csm2gr11,csx19,csx10gr5,c2c9_V-U4,RT,cas4,csc1gr5,csc2gr7,cas10d,csx3,csm6	Orphan	AATTTTCCTTCTACTCCCTATTCACTACTCCCCACTCCC	39	0	0	NA	NA	NA	1	1	Orphan	DinG,PD-DExK,cas14j,csa3,Cas14c_CAS-V-F,WYL,csx18,cas1,cas2,cas10,cmr3gr5,cmr4gr7,cmr5gr11,csm3gr7,cas3,csx1,cas6,csm5gr7,csm4gr5,csm2gr11,csx19,csx10gr5,c2c9_V-U4,RT,cas4,csc1gr5,csc2gr7,cas10d,csx3,csm6	NA|68aa|up_9|NZ_AP018254.1_1468085_1468289_+,NA|47aa|up_4|NZ_AP018254.1_1477069_1477210_-,NA|56aa|up_2|NZ_AP018254.1_1481691_1481859_-,NA|47aa|down_9|NZ_AP018254.1_1493488_1493629_+	NA|68aa|up_9|NZ_AP018254.1_1468085_1468289_+	NA	NA|579aa|up_8|NZ_AP018254.1_1469679_1471416_+	cd07478, Peptidases_S8_CspA-like, Peptidase S8 family domain in CspA-like proteins	NA|734aa|up_7|NZ_AP018254.1_1471466_1473668_-	TIGR02472, putative_sucrose-phosphate_synthase, sucrose-phosphate synthase, putative, glycosyltransferase domain	NA|380aa|up_6|NZ_AP018254.1_1474159_1475299_-	PRK05952, PRK05952, beta-ketoacyl-ACP synthase	NA|275aa|up_5|NZ_AP018254.1_1476074_1476899_-	cd01924, cyclophilin_TLP40_like, cyclophilin_TLP40_like: cyclophilin-type peptidylprolyl cis- trans isomerases (cyclophilins) similar ot the Spinach thylakoid lumen protein TLP40	NA|47aa|up_4|NZ_AP018254.1_1477069_1477210_-	NA	NA|1247aa|up_3|NZ_AP018254.1_1477279_1481020_-	PLN02666, PLN02666, 5-oxoprolinase	NA|56aa|up_2|NZ_AP018254.1_1481691_1481859_-	NA	NA|155aa|up_1|NZ_AP018254.1_1482115_1482580_+	pfam08670, MEKHLA, MEKHLA domain	NA|121aa|up_0|NZ_AP018254.1_1483072_1483435_+	cd07043, STAS_anti-anti-sigma_factors, Sulphate Transporter and Anti-Sigma factor antagonist) domain of anti-anti-sigma factors, key regulators of anti-sigma factors by phosphorylation	NA|353aa|down_0|NZ_AP018254.1_1483989_1485048_-	PRK13396, PRK13396, 3-deoxy-7-phosphoheptulonate synthase; Provisional	NA|160aa|down_1|NZ_AP018254.1_1485866_1486346_-	pfam11947, DUF3464, Protein of unknown function (DUF3464)	NA|90aa|down_2|NZ_AP018254.1_1486367_1486637_-	PRK05626, rpsO, 30S ribosomal protein S15; Reviewed	NA|403aa|down_3|NZ_AP018254.1_1486813_1488022_+	cd08014, M20_Acy1-like, M20 Peptidase aminoacylase 1 subfamily	NA|76aa|down_4|NZ_AP018254.1_1488752_1488980_+	pfam02594, DUF167, Uncharacterized ACR, YggU family COG1872	NA|204aa|down_5|NZ_AP018254.1_1489074_1489686_-	COG0625, Gst, Glutathione S-transferase [Posttranslational modification, protein turnover, chaperones]	NA|397aa|down_6|NZ_AP018254.1_1490427_1491618_-	COG0003, ArsA, Predicted ATPase involved in chromosome partitioning [Cell division and chromosome partitioning]	NA|130aa|down_7|NZ_AP018254.1_1491683_1492073_-	pfam10184, DUF2358, Uncharacterized conserved protein (DUF2358)	NA|202aa|down_8|NZ_AP018254.1_1492182_1492788_-	pfam01947, DUF98, Protein of unknown function (DUF98)	NA|47aa|down_9|NZ_AP018254.1_1493488_1493629_+	NA
GCF_002368395.1_ASM236839v1	NZ_AP018254	Calothrix sp. NIES-3974	14	1498437-1498512	14	CRISPRCasFinder	no		DinG,PD-DExK,cas14j,csa3,Cas14c_CAS-V-F,WYL,csx18,cas1,cas2,cas10,cmr3gr5,cmr4gr7,cmr5gr11,csm3gr7,cas3,csx1,cas6,csm5gr7,csm4gr5,csm2gr11,csx19,csx10gr5,c2c9_V-U4,RT,cas4,csc1gr5,csc2gr7,cas10d,csx3,csm6	Orphan	ATTTTGGATTACTTTGGTCTCTT	23	0	0	NA	NA	NA	1	1	Orphan	DinG,PD-DExK,cas14j,csa3,Cas14c_CAS-V-F,WYL,csx18,cas1,cas2,cas10,cmr3gr5,cmr4gr7,cmr5gr11,csm3gr7,cas3,csx1,cas6,csm5gr7,csm4gr5,csm2gr11,csx19,csx10gr5,c2c9_V-U4,RT,cas4,csc1gr5,csc2gr7,cas10d,csx3,csm6	NA|47aa|up_5|NZ_AP018254.1_1493488_1493629_+,NA|259aa|down_4|NZ_AP018254.1_1504329_1505106_-,NA|189aa|down_5|NZ_AP018254.1_1505210_1505777_-	NA|204aa|up_9|NZ_AP018254.1_1489074_1489686_-	COG0625, Gst, Glutathione S-transferase [Posttranslational modification, protein turnover, chaperones]	NA|397aa|up_8|NZ_AP018254.1_1490427_1491618_-	COG0003, ArsA, Predicted ATPase involved in chromosome partitioning [Cell division and chromosome partitioning]	NA|130aa|up_7|NZ_AP018254.1_1491683_1492073_-	pfam10184, DUF2358, Uncharacterized conserved protein (DUF2358)	NA|202aa|up_6|NZ_AP018254.1_1492182_1492788_-	pfam01947, DUF98, Protein of unknown function (DUF98)	NA|47aa|up_5|NZ_AP018254.1_1493488_1493629_+	NA	NA|454aa|up_4|NZ_AP018254.1_1493618_1494980_+	PRK08591, PRK08591, acetyl-CoA carboxylase biotin carboxylase subunit; Validated	NA|97aa|up_3|NZ_AP018254.1_1495013_1495304_-	pfam02325, YGGT, YGGT family	NA|40aa|up_2|NZ_AP018254.1_1495849_1495969_+	pfam06596, PsbX, Photosystem II reaction centre X protein (PsbX)	NA|326aa|up_1|NZ_AP018254.1_1496342_1497320_+	pfam07444, Ycf66_N, Ycf66 protein N-terminus	NA|186aa|up_0|NZ_AP018254.1_1497850_1498408_+	COG0823, TolB, Periplasmic component of the Tol biopolymer transport system [Intracellular trafficking and secretion]	NA|173aa|down_0|NZ_AP018254.1_1498943_1499462_+	COG0823, TolB, Periplasmic component of the Tol biopolymer transport system [Intracellular trafficking and secretion]	NA|336aa|down_1|NZ_AP018254.1_1499468_1500476_-	PRK12577, PRK12577, succinate dehydrogenase/fumarate reductase iron-sulfur subunit	NA|139aa|down_2|NZ_AP018254.1_1502888_1503305_+	pfam14250, AbrB-like, AbrB-like transcriptional regulator	NA|177aa|down_3|NZ_AP018254.1_1503703_1504234_-	COG1525, COG1525, Micrococcal nuclease (thermonuclease) homologs [DNA replication, recombination, and repair]	NA|259aa|down_4|NZ_AP018254.1_1504329_1505106_-	NA	NA|189aa|down_5|NZ_AP018254.1_1505210_1505777_-	NA	NA|468aa|down_6|NZ_AP018254.1_1506794_1508198_-	cd01298, ATZ_TRZ_like, TRZ/ATZ family contains enzymes from the atrazine degradation pathway and related hydrolases	NA|260aa|down_7|NZ_AP018254.1_1508347_1509127_+	cd00431, cysteine_hydrolases, Cysteine hydrolases; This family contains amidohydrolases, like CSHase (N-carbamoylsarcosine amidohydrolase), involved in creatine metabolism and nicotinamidase, converting nicotinamide to nicotinic acid and ammonia in the pyridine nucleotide cycle	NA|148aa|down_8|NZ_AP018254.1_1509628_1510072_+	cd06987, cupin_MAE_RS03005, Microcystis aeruginosa MAE_RS03005 and related proteins, cupin domain	NA|442aa|down_9|NZ_AP018254.1_1510489_1511815_-	PRK07575, PRK07575, dihydroorotase; Provisional
GCF_002368395.1_ASM236839v1	NZ_AP018254	Calothrix sp. NIES-3974	15	1576022-1576126	15	CRISPRCasFinder	no		DinG,PD-DExK,cas14j,csa3,Cas14c_CAS-V-F,WYL,csx18,cas1,cas2,cas10,cmr3gr5,cmr4gr7,cmr5gr11,csm3gr7,cas3,csx1,cas6,csm5gr7,csm4gr5,csm2gr11,csx19,csx10gr5,c2c9_V-U4,RT,cas4,csc1gr5,csc2gr7,cas10d,csx3,csm6	Orphan	AATTACGTAGCTTGCTTCCCGCGTAGCGGGTATTA	35	0	0	NA	NA	NA	1	1	Orphan	DinG,PD-DExK,cas14j,csa3,Cas14c_CAS-V-F,WYL,csx18,cas1,cas2,cas10,cmr3gr5,cmr4gr7,cmr5gr11,csm3gr7,cas3,csx1,cas6,csm5gr7,csm4gr5,csm2gr11,csx19,csx10gr5,c2c9_V-U4,RT,cas4,csc1gr5,csc2gr7,cas10d,csx3,csm6	NA|329aa|up_2|NZ_AP018254.1_1572693_1573680_-,NA|397aa|up_1|NZ_AP018254.1_1573672_1574863_-,NA|160aa|up_0|NZ_AP018254.1_1575096_1575576_+,NA|66aa|down_1|NZ_AP018254.1_1578243_1578441_+,NA|68aa|down_4|NZ_AP018254.1_1581930_1582134_+	NA|349aa|up_9|NZ_AP018254.1_1564071_1565118_-	PRK09604, PRK09604, tRNA (adenosine(37)-N6)-threonylcarbamoyltransferase complex transferase subunit TsaD	NA|165aa|up_8|NZ_AP018254.1_1565771_1566266_+	CHL00132, psaF, photosystem I subunit III; Validated	NA|44aa|up_7|NZ_AP018254.1_1566374_1566506_+	PRK02733, PRK02733, photosystem I reaction center subunit IX; Provisional	NA|173aa|up_6|NZ_AP018254.1_1566934_1567453_+	pfam02605, PsaL, Photosystem I reaction centre subunit XI	NA|236aa|up_5|NZ_AP018254.1_1567909_1568617_-	PRK00300, gmk, guanylate kinase; Provisional	NA|89aa|up_4|NZ_AP018254.1_1568843_1569110_-	PRK04323, PRK04323, hypothetical protein; Provisional	NA|781aa|up_3|NZ_AP018254.1_1569897_1572240_-	COG1305, COG1305, Transglutaminase-like enzymes, putative cysteine proteases [Amino acid transport and metabolism]	NA|329aa|up_2|NZ_AP018254.1_1572693_1573680_-	NA	NA|397aa|up_1|NZ_AP018254.1_1573672_1574863_-	NA	NA|160aa|up_0|NZ_AP018254.1_1575096_1575576_+	NA	NA|324aa|down_0|NZ_AP018254.1_1576162_1577134_-	TIGR02749, Prenyl_transferase, solanesyl diphosphate synthase	NA|66aa|down_1|NZ_AP018254.1_1578243_1578441_+	NA	NA|309aa|down_2|NZ_AP018254.1_1578470_1579397_-	PRK00865, PRK00865, glutamate racemase; Provisional	NA|632aa|down_3|NZ_AP018254.1_1579629_1581525_-	cd02696, MurNAc-LAA, N-acetylmuramoyl-L-alanine amidase or MurNAc-LAA (also known as peptidoglycan aminohydrolase, NAMLA amidase, NAMLAA, Amidase 3, and peptidoglycan amidase; EC 3	NA|68aa|down_4|NZ_AP018254.1_1581930_1582134_+	NA	NA|454aa|down_5|NZ_AP018254.1_1582471_1583833_-	cd14748, PBP2_UgpB, The periplasmic-binding component of ABC transport system specific for sn-glycerol-3-phosphate; possesses type 2 periplasmic binding fold	NA|244aa|down_6|NZ_AP018254.1_1584082_1584814_-	COG1940, NagC, Transcriptional regulator/sugar kinase [Transcription / Carbohydrate transport and metabolism]	NA|615aa|down_7|NZ_AP018254.1_1585058_1586903_-	COG1538, TolC, Outer membrane protein [Cell envelope biogenesis, outer membrane / Intracellular trafficking and secretion]	NA|287aa|down_8|NZ_AP018254.1_1588014_1588875_+	sd00006, TPR, Tetratricopeptide repeat	NA|132aa|down_9|NZ_AP018254.1_1588985_1589381_-	cd17548, REC_DivK-like, phosphoacceptor receiver (REC) domain of DivK and similar proteins
GCF_002368395.1_ASM236839v1	NZ_AP018254	Calothrix sp. NIES-3974	16	1620356-1620466	16	CRISPRCasFinder	no		DinG,PD-DExK,cas14j,csa3,Cas14c_CAS-V-F,WYL,csx18,cas1,cas2,cas10,cmr3gr5,cmr4gr7,cmr5gr11,csm3gr7,cas3,csx1,cas6,csm5gr7,csm4gr5,csm2gr11,csx19,csx10gr5,c2c9_V-U4,RT,cas4,csc1gr5,csc2gr7,cas10d,csx3,csm6	Orphan	GTTTCCAACTAATCCAATTTAACCCAATCGGTAGGG	36	0	0	NA	NA	NA	1	1	Orphan	DinG,PD-DExK,cas14j,csa3,Cas14c_CAS-V-F,WYL,csx18,cas1,cas2,cas10,cmr3gr5,cmr4gr7,cmr5gr11,csm3gr7,cas3,csx1,cas6,csm5gr7,csm4gr5,csm2gr11,csx19,csx10gr5,c2c9_V-U4,RT,cas4,csc1gr5,csc2gr7,cas10d,csx3,csm6	NA|76aa|up_1|NZ_AP018254.1_1617198_1617426_+,NA|724aa|up_0|NZ_AP018254.1_1617663_1619835_+,NA	NA|229aa|up_9|NZ_AP018254.1_1610090_1610777_-	COG1413, COG1413, FOG: HEAT repeat [Energy production and conversion]	NA|198aa|up_8|NZ_AP018254.1_1610879_1611473_-	pfam05724, TPMT, Thiopurine S-methyltransferase (TPMT)	NA|483aa|up_7|NZ_AP018254.1_1611570_1613019_-	COG2211, MelB, Na+/melibiose symporter and related transporters [Carbohydrate transport and metabolism]	NA|188aa|up_6|NZ_AP018254.1_1613086_1613650_+	COG2236, COG2236, Predicted phosphoribosyltransferases [General function prediction only]	NA|178aa|up_5|NZ_AP018254.1_1614139_1614673_+	cd14767, PE_beta-like, Phycoerythrin beta subunit, a component of the phycobilisome rod; and related proteins	NA|165aa|up_4|NZ_AP018254.1_1614789_1615284_+	cd14769, PE_alpha, Phycoerythrin alpha subunit, a phycobilisome rod component	NA|70aa|up_3|NZ_AP018254.1_1615480_1615690_+	pfam14430, Imm1, Immunity protein Imm1	NA|449aa|up_2|NZ_AP018254.1_1615710_1617057_-	PRK02507, PRK02507, proton extrusion protein PcxA; Provisional	NA|76aa|up_1|NZ_AP018254.1_1617198_1617426_+	NA	NA|724aa|up_0|NZ_AP018254.1_1617663_1619835_+	NA	NA|289aa|down_0|NZ_AP018254.1_1620726_1621593_-	PRK02755, truB, tRNA pseudouridine synthase B; Provisional	NA|377aa|down_1|NZ_AP018254.1_1621755_1622886_-	pfam10216, ChpXY, CO2 hydration protein (ChpXY)	NA|493aa|down_2|NZ_AP018254.1_1623022_1624501_-	PRK06473, PRK06473, NADH-quinone oxidoreductase subunit M	NA|619aa|down_3|NZ_AP018254.1_1624639_1626496_-	PRK07390, PRK07390, NAD(P)H-quinone oxidoreductase subunit F; Validated	NA|103aa|down_4|NZ_AP018254.1_1627323_1627632_+	cd07057, BMC_CcmK, Carbon dioxide concentrating mechanism (CcmK); Bacterial Micro-Compartment (BMC) domain	NA|115aa|down_5|NZ_AP018254.1_1627846_1628191_+	cd07057, BMC_CcmK, Carbon dioxide concentrating mechanism (CcmK); Bacterial Micro-Compartment (BMC) domain	NA|101aa|down_6|NZ_AP018254.1_1628207_1628510_+	COG4576, CcmL, Carbon dioxide concentrating mechanism/carboxysome shell protein [Secondary metabolites biosynthesis, transport, and catabolism / Energy production and conversion]	NA|572aa|down_7|NZ_AP018254.1_1628984_1630700_+	cd00710, LbH_gamma_CA, Gamma carbonic anhydrases (CA): Carbonic anhydrases are zinc-containing enzymes that catalyze the reversible hydration of carbon dioxide in a two-step mechanism, involving the nucleophilic attack of a zinc-bound hydroxide ion on carbon dioxide, followed by the regeneration of the active site by ionization of the zinc-bound water molecule and removal of a proton from the active site	NA|251aa|down_8|NZ_AP018254.1_1630996_1631749_+	cd03360, LbH_AT_putative, Putative Acyltransferase (AT), Left-handed parallel beta-Helix (LbH) domain; This group is composed of mostly uncharacterized proteins containing an N-terminal helical subdomain followed by a LbH domain	NA|261aa|down_9|NZ_AP018254.1_1631790_1632573_+	COG4577, CcmK, Carbon dioxide concentrating mechanism/carboxysome shell protein [Secondary metabolites biosynthesis, transport, and catabolism / Energy production and conversion]
GCF_002368395.1_ASM236839v1	NZ_AP018254	Calothrix sp. NIES-3974	17	1622931-1623020	17	CRISPRCasFinder	no		DinG,PD-DExK,cas14j,csa3,Cas14c_CAS-V-F,WYL,csx18,cas1,cas2,cas10,cmr3gr5,cmr4gr7,cmr5gr11,csm3gr7,cas3,csx1,cas6,csm5gr7,csm4gr5,csm2gr11,csx19,csx10gr5,c2c9_V-U4,RT,cas4,csc1gr5,csc2gr7,cas10d,csx3,csm6	Orphan	GCTTCCCGCATAGCGGGTATTACG	24	0	0	NA	NA	NA	1	1	Orphan	DinG,PD-DExK,cas14j,csa3,Cas14c_CAS-V-F,WYL,csx18,cas1,cas2,cas10,cmr3gr5,cmr4gr7,cmr5gr11,csm3gr7,cas3,csx1,cas6,csm5gr7,csm4gr5,csm2gr11,csx19,csx10gr5,c2c9_V-U4,RT,cas4,csc1gr5,csc2gr7,cas10d,csx3,csm6	NA|76aa|up_3|NZ_AP018254.1_1617198_1617426_+,NA|724aa|up_2|NZ_AP018254.1_1617663_1619835_+,NA	NA|483aa|up_9|NZ_AP018254.1_1611570_1613019_-	COG2211, MelB, Na+/melibiose symporter and related transporters [Carbohydrate transport and metabolism]	NA|188aa|up_8|NZ_AP018254.1_1613086_1613650_+	COG2236, COG2236, Predicted phosphoribosyltransferases [General function prediction only]	NA|178aa|up_7|NZ_AP018254.1_1614139_1614673_+	cd14767, PE_beta-like, Phycoerythrin beta subunit, a component of the phycobilisome rod; and related proteins	NA|165aa|up_6|NZ_AP018254.1_1614789_1615284_+	cd14769, PE_alpha, Phycoerythrin alpha subunit, a phycobilisome rod component	NA|70aa|up_5|NZ_AP018254.1_1615480_1615690_+	pfam14430, Imm1, Immunity protein Imm1	NA|449aa|up_4|NZ_AP018254.1_1615710_1617057_-	PRK02507, PRK02507, proton extrusion protein PcxA; Provisional	NA|76aa|up_3|NZ_AP018254.1_1617198_1617426_+	NA	NA|724aa|up_2|NZ_AP018254.1_1617663_1619835_+	NA	NA|289aa|up_1|NZ_AP018254.1_1620726_1621593_-	PRK02755, truB, tRNA pseudouridine synthase B; Provisional	NA|377aa|up_0|NZ_AP018254.1_1621755_1622886_-	pfam10216, ChpXY, CO2 hydration protein (ChpXY)	NA|493aa|down_0|NZ_AP018254.1_1623022_1624501_-	PRK06473, PRK06473, NADH-quinone oxidoreductase subunit M	NA|619aa|down_1|NZ_AP018254.1_1624639_1626496_-	PRK07390, PRK07390, NAD(P)H-quinone oxidoreductase subunit F; Validated	NA|103aa|down_2|NZ_AP018254.1_1627323_1627632_+	cd07057, BMC_CcmK, Carbon dioxide concentrating mechanism (CcmK); Bacterial Micro-Compartment (BMC) domain	NA|115aa|down_3|NZ_AP018254.1_1627846_1628191_+	cd07057, BMC_CcmK, Carbon dioxide concentrating mechanism (CcmK); Bacterial Micro-Compartment (BMC) domain	NA|101aa|down_4|NZ_AP018254.1_1628207_1628510_+	COG4576, CcmL, Carbon dioxide concentrating mechanism/carboxysome shell protein [Secondary metabolites biosynthesis, transport, and catabolism / Energy production and conversion]	NA|572aa|down_5|NZ_AP018254.1_1628984_1630700_+	cd00710, LbH_gamma_CA, Gamma carbonic anhydrases (CA): Carbonic anhydrases are zinc-containing enzymes that catalyze the reversible hydration of carbon dioxide in a two-step mechanism, involving the nucleophilic attack of a zinc-bound hydroxide ion on carbon dioxide, followed by the regeneration of the active site by ionization of the zinc-bound water molecule and removal of a proton from the active site	NA|251aa|down_6|NZ_AP018254.1_1630996_1631749_+	cd03360, LbH_AT_putative, Putative Acyltransferase (AT), Left-handed parallel beta-Helix (LbH) domain; This group is composed of mostly uncharacterized proteins containing an N-terminal helical subdomain followed by a LbH domain	NA|261aa|down_7|NZ_AP018254.1_1631790_1632573_+	COG4577, CcmK, Carbon dioxide concentrating mechanism/carboxysome shell protein [Secondary metabolites biosynthesis, transport, and catabolism / Energy production and conversion]	NA|305aa|down_8|NZ_AP018254.1_1633812_1634727_+	cd08419, PBP2_CbbR_RubisCO_like, The C-terminal substrate binding of LysR-type transcriptional regulator (CbbR) of RubisCO operon, which is involved in the carbon dioxide fixation, contains the type 2 periplasmic binding fold	NA|540aa|down_9|NZ_AP018254.1_1634824_1636444_+	TIGR02231, Hypothetical_protein_ZK1055
GCF_002368395.1_ASM236839v1	NZ_AP018254	Calothrix sp. NIES-3974	18	1675625-1675729	18	CRISPRCasFinder	no		DinG,PD-DExK,cas14j,csa3,Cas14c_CAS-V-F,WYL,csx18,cas1,cas2,cas10,cmr3gr5,cmr4gr7,cmr5gr11,csm3gr7,cas3,csx1,cas6,csm5gr7,csm4gr5,csm2gr11,csx19,csx10gr5,c2c9_V-U4,RT,cas4,csc1gr5,csc2gr7,cas10d,csx3,csm6	Orphan	CTACATACAGGACTTATCTTGACATGTCAATAACAC	36	0	0	NA	NA	NA	1	1	Orphan	DinG,PD-DExK,cas14j,csa3,Cas14c_CAS-V-F,WYL,csx18,cas1,cas2,cas10,cmr3gr5,cmr4gr7,cmr5gr11,csm3gr7,cas3,csx1,cas6,csm5gr7,csm4gr5,csm2gr11,csx19,csx10gr5,c2c9_V-U4,RT,cas4,csc1gr5,csc2gr7,cas10d,csx3,csm6	NA,NA|64aa|down_0|NZ_AP018254.1_1676293_1676485_+,NA|136aa|down_1|NZ_AP018254.1_1677382_1677790_+,NA|465aa|down_2|NZ_AP018254.1_1677908_1679303_+,NA|953aa|down_5|NZ_AP018254.1_1682589_1685448_+,NA|143aa|down_7|NZ_AP018254.1_1687059_1687488_-	NA|403aa|up_9|NZ_AP018254.1_1656193_1657402_+	cd17329, MFS_MdtH_MDR_like, Multidrug resistance protein MdtH and similar multidrug resistance (MDR) transporters of the Major Facilitator Superfamily	NA|551aa|up_8|NZ_AP018254.1_1658256_1659909_+	COG3349, COG3349, Uncharacterized conserved protein [Function unknown]	NA|214aa|up_7|NZ_AP018254.1_1660316_1660958_+	pfam12787, EcsC, EcsC protein family	NA|872aa|up_6|NZ_AP018254.1_1661253_1663869_-	PRK09238, PRK09238, bifunctional aconitate hydratase 2/2-methylisocitrate dehydratase; Validated	NA|106aa|up_5|NZ_AP018254.1_1664165_1664483_+	TIGR02008, Ferredoxin_root_R-B1, ferredoxin [2Fe-2S]	NA|259aa|up_4|NZ_AP018254.1_1664718_1665495_-	COG0398, COG0398, Uncharacterized conserved protein [Function unknown]	NA|575aa|up_3|NZ_AP018254.1_1666091_1667816_+	COG4986, COG4986, ABC-type anion transport system, duplicated permease component [Inorganic ion transport and metabolism]	NA|459aa|up_2|NZ_AP018254.1_1667832_1669209_+	COG1116, TauB, ABC-type nitrate/sulfonate/bicarbonate transport system, ATPase component [Inorganic ion transport and metabolism]	NA|786aa|up_1|NZ_AP018254.1_1671151_1673509_-	pfam13191, AAA_16, AAA ATPase domain	NA|599aa|up_0|NZ_AP018254.1_1673609_1675406_-	COG1357, COG1357, Pentapeptide repeats containing protein [Function unknown]	NA|64aa|down_0|NZ_AP018254.1_1676293_1676485_+	NA	NA|136aa|down_1|NZ_AP018254.1_1677382_1677790_+	NA	NA|465aa|down_2|NZ_AP018254.1_1677908_1679303_+	NA	NA|498aa|down_3|NZ_AP018254.1_1679388_1680882_+	pfam05762, VWA_CoxE, VWA domain containing CoxE-like protein	NA|392aa|down_4|NZ_AP018254.1_1681052_1682228_+	pfam07728, AAA_5, AAA domain (dynein-related subfamily)	NA|953aa|down_5|NZ_AP018254.1_1682589_1685448_+	NA	NA|427aa|down_6|NZ_AP018254.1_1685558_1686839_-	PHA03100, PHA03100, ankyrin repeat protein; Provisional	NA|143aa|down_7|NZ_AP018254.1_1687059_1687488_-	NA	NA|491aa|down_8|NZ_AP018254.1_1688165_1689638_+	PRK10867, PRK10867, signal recognition particle protein; Provisional	NA|231aa|down_9|NZ_AP018254.1_1689838_1690531_+	cd06260, DUF820, Domain of unknown function (DUF820)
GCF_002368395.1_ASM236839v1	NZ_AP018254	Calothrix sp. NIES-3974	19	1728466-1728571	19	CRISPRCasFinder	no		DinG,PD-DExK,cas14j,csa3,Cas14c_CAS-V-F,WYL,csx18,cas1,cas2,cas10,cmr3gr5,cmr4gr7,cmr5gr11,csm3gr7,cas3,csx1,cas6,csm5gr7,csm4gr5,csm2gr11,csx19,csx10gr5,c2c9_V-U4,RT,cas4,csc1gr5,csc2gr7,cas10d,csx3,csm6	Orphan	TTTCAATCCCTAATAGGGATTATTTGGAATAGTA	34	0	0	NA	NA	I-D,II-B	1	1	Orphan	DinG,PD-DExK,cas14j,csa3,Cas14c_CAS-V-F,WYL,csx18,cas1,cas2,cas10,cmr3gr5,cmr4gr7,cmr5gr11,csm3gr7,cas3,csx1,cas6,csm5gr7,csm4gr5,csm2gr11,csx19,csx10gr5,c2c9_V-U4,RT,cas4,csc1gr5,csc2gr7,cas10d,csx3,csm6	NA|116aa|up_8|NZ_AP018254.1_1718027_1718375_-,NA|132aa|up_5|NZ_AP018254.1_1721677_1722073_-,NA|145aa|up_4|NZ_AP018254.1_1722350_1722785_+,NA|172aa|up_3|NZ_AP018254.1_1723218_1723734_-,NA|72aa|up_1|NZ_AP018254.1_1725520_1725736_-,NA|99aa|down_7|NZ_AP018254.1_1745154_1745451_+	NA|280aa|up_9|NZ_AP018254.1_1717187_1718027_+	pfam11353, DUF3153, Protein of unknown function (DUF3153)	NA|116aa|up_8|NZ_AP018254.1_1718027_1718375_-	NA	NA|417aa|up_7|NZ_AP018254.1_1718955_1720206_+	PLN00020, PLN00020, ribulose bisphosphate carboxylase/oxygenase activase -RuBisCO activase (RCA); Provisional	NA|150aa|up_6|NZ_AP018254.1_1720930_1721380_-	cd17618, REC_OmpR_PhoB, phosphoacceptor receiver (REC) domain of PhoB response regulator from the OmpR family	NA|132aa|up_5|NZ_AP018254.1_1721677_1722073_-	NA	NA|145aa|up_4|NZ_AP018254.1_1722350_1722785_+	NA	NA|172aa|up_3|NZ_AP018254.1_1723218_1723734_-	NA	NA|274aa|up_2|NZ_AP018254.1_1724006_1724828_-	PRK00281, PRK00281, undecaprenyl-diphosphate phosphatase	NA|72aa|up_1|NZ_AP018254.1_1725520_1725736_-	NA	NA|548aa|up_0|NZ_AP018254.1_1725818_1727462_-	pfam04966, OprB, Carbohydrate-selective porin, OprB family	NA|361aa|down_0|NZ_AP018254.1_1729406_1730489_+	TIGR01151, Photosystem_QB_protein, photosystem II, DI subunit (also called Q(B))	NA|410aa|down_1|NZ_AP018254.1_1730699_1731929_-	COG2821, MltA, Membrane-bound lytic murein transglycosylase [Cell envelope biogenesis, outer membrane]	NA|568aa|down_2|NZ_AP018254.1_1732094_1733798_-	COG0642, BaeS, Signal transduction histidine kinase [Signal transduction mechanisms]	NA|289aa|down_3|NZ_AP018254.1_1739116_1739983_-	smart00318, SNc, Staphylococcal nuclease homologues	NA|274aa|down_4|NZ_AP018254.1_1741997_1742819_+	TIGR00706, Putative_signal_peptide_peptidase_SppA, signal peptide peptidase SppA, 36K type	NA|134aa|down_5|NZ_AP018254.1_1743344_1743746_+	pfam07736, CM_1, Chorismate mutase type I	NA|391aa|down_6|NZ_AP018254.1_1743734_1744907_-	COG1565, COG1565, Uncharacterized conserved protein [Function unknown]	NA|99aa|down_7|NZ_AP018254.1_1745154_1745451_+	NA	NA|283aa|down_8|NZ_AP018254.1_1745456_1746305_+	cd07516, HAD_Pase, phosphatase, similar to Escherichia coli Cof and Thermotoga maritima TM0651; belongs to the haloacid dehalogenase-like superfamily	NA|433aa|down_9|NZ_AP018254.1_1746727_1748026_-	pfam04932, Wzy_C, O-Antigen ligase
GCF_002368395.1_ASM236839v1	NZ_AP018254	Calothrix sp. NIES-3974	20	1768784-1768895	20	CRISPRCasFinder	no		DinG,PD-DExK,cas14j,csa3,Cas14c_CAS-V-F,WYL,csx18,cas1,cas2,cas10,cmr3gr5,cmr4gr7,cmr5gr11,csm3gr7,cas3,csx1,cas6,csm5gr7,csm4gr5,csm2gr11,csx19,csx10gr5,c2c9_V-U4,RT,cas4,csc1gr5,csc2gr7,cas10d,csx3,csm6	Orphan	GTTTCAATCCCTAATAGGGATTATTTGAAATTGTAAC	37	0	0	NA	NA	I-D,II-B	1	1	Orphan	DinG,PD-DExK,cas14j,csa3,Cas14c_CAS-V-F,WYL,csx18,cas1,cas2,cas10,cmr3gr5,cmr4gr7,cmr5gr11,csm3gr7,cas3,csx1,cas6,csm5gr7,csm4gr5,csm2gr11,csx19,csx10gr5,c2c9_V-U4,RT,cas4,csc1gr5,csc2gr7,cas10d,csx3,csm6	NA|47aa|up_7|NZ_AP018254.1_1754544_1754685_+,NA|470aa|up_0|NZ_AP018254.1_1767259_1768669_+,NA|68aa|down_0|NZ_AP018254.1_1769189_1769393_-,NA|198aa|down_5|NZ_AP018254.1_1776196_1776790_-,NA|116aa|down_8|NZ_AP018254.1_1779223_1779571_-	NA|476aa|up_9|NZ_AP018254.1_1751785_1753213_-	PRK07362, PRK07362, NADP-dependent isocitrate dehydrogenase	NA|239aa|up_8|NZ_AP018254.1_1753776_1754493_+	pfam05419, GUN4, GUN4-like	NA|47aa|up_7|NZ_AP018254.1_1754544_1754685_+	NA	NA|456aa|up_6|NZ_AP018254.1_1754704_1756072_+	COG3670, COG3670, Lignostilbene-alpha,beta-dioxygenase and related enzymes [Secondary metabolites biosynthesis, transport, and catabolism]	NA|644aa|up_5|NZ_AP018254.1_1756753_1758685_+	COG3596, COG3596, Predicted GTPase [General function prediction only]	NA|712aa|up_4|NZ_AP018254.1_1758856_1760992_-	cd14014, STKc_PknB_like, Catalytic domain of bacterial Serine/Threonine kinases, PknB and similar proteins	NA|280aa|up_3|NZ_AP018254.1_1762131_1762971_+	COG0566, SpoU, rRNA methylases [Translation, ribosomal structure and biogenesis]	NA|679aa|up_2|NZ_AP018254.1_1763362_1765399_-	PRK07956, ligA, NAD-dependent DNA ligase LigA; Validated	NA|466aa|up_1|NZ_AP018254.1_1765843_1767241_+	COG0312, TldD, Predicted Zn-dependent proteases and their inactivated homologs [General function prediction only]	NA|470aa|up_0|NZ_AP018254.1_1767259_1768669_+	NA	NA|68aa|down_0|NZ_AP018254.1_1769189_1769393_-	NA	NA|478aa|down_1|NZ_AP018254.1_1769687_1771121_-	COG0786, GltS, Na+/glutamate symporter [Amino acid transport and metabolism]	NA|398aa|down_2|NZ_AP018254.1_1771236_1772430_-	cd00854, NagA, N-acetylglucosamine-6-phosphate deacetylase, NagA, catalyzes the hydrolysis of the N-acetyl group of N-acetyl-glucosamine-6-phosphate (GlcNAc-6-P) to glucosamine 6-phosphate and acetate	NA|175aa|down_3|NZ_AP018254.1_1772589_1773114_+	PLN02948, PLN02948, phosphoribosylaminoimidazole carboxylase	NA|533aa|down_4|NZ_AP018254.1_1774208_1775807_+	COG0004, AmtB, Ammonia permease [Inorganic ion transport and metabolism]	NA|198aa|down_5|NZ_AP018254.1_1776196_1776790_-	NA	NA|153aa|down_6|NZ_AP018254.1_1777182_1777641_-	COG1357, COG1357, Pentapeptide repeats containing protein [Function unknown]	NA|203aa|down_7|NZ_AP018254.1_1778182_1778791_-	COG4445, MiaE, Hydroxylase for synthesis of 2-methylthio-cis-ribozeatin in tRNA [Nucleotide transport and metabolism / Translation, ribosomal structure and biogenesis]	NA|116aa|down_8|NZ_AP018254.1_1779223_1779571_-	NA	NA|1059aa|down_9|NZ_AP018254.1_1779660_1782837_-	COG0383, AMS1, Alpha-mannosidase [Carbohydrate transport and metabolism]
GCF_002368395.1_ASM236839v1	NZ_AP018254	Calothrix sp. NIES-3974	21	1949484-1949751	6,5,21	PILER-CR,CRT,CRISPRCasFinder	no		DinG,PD-DExK,cas14j,csa3,Cas14c_CAS-V-F,WYL,csx18,cas1,cas2,cas10,cmr3gr5,cmr4gr7,cmr5gr11,csm3gr7,cas3,csx1,cas6,csm5gr7,csm4gr5,csm2gr11,csx19,csx10gr5,c2c9_V-U4,RT,cas4,csc1gr5,csc2gr7,cas10d,csx3,csm6	Orphan	GTCTCCTCTTATGAGGGGAAATAATTGATTGGAAACT,AGTTTCCAATCAATTATTTCCCCTCATAAGAGGAGAC,GTTTCCAATCAATTATTTCCCCTCATAAGAGGAGAC	37,37,36	0	0	NA	NA	NA:NA:NA	3,3,3	3	Orphan	DinG,PD-DExK,cas14j,csa3,Cas14c_CAS-V-F,WYL,csx18,cas1,cas2,cas10,cmr3gr5,cmr4gr7,cmr5gr11,csm3gr7,cas3,csx1,cas6,csm5gr7,csm4gr5,csm2gr11,csx19,csx10gr5,c2c9_V-U4,RT,cas4,csc1gr5,csc2gr7,cas10d,csx3,csm6	NA|62aa|up_5|NZ_AP018254.1_1940348_1940534_+,NA|471aa|up_4|NZ_AP018254.1_1940785_1942198_-,NA	NA|102aa|up_9|NZ_AP018254.1_1936303_1936609_+	CHL00015, ndhE, NADH dehydrogenase subunit 4L	NA|308aa|up_8|NZ_AP018254.1_1936616_1937540_+	PRK02645, ppnK, NAD(+) kinase	NA|218aa|up_7|NZ_AP018254.1_1937769_1938423_-	pfam09778, Guanylate_cyc_2, Guanylylate cyclase	NA|364aa|up_6|NZ_AP018254.1_1938805_1939897_-	PRK07409, PRK07409, threonine synthase; Validated	NA|62aa|up_5|NZ_AP018254.1_1940348_1940534_+	NA	NA|471aa|up_4|NZ_AP018254.1_1940785_1942198_-	NA	NA|554aa|up_3|NZ_AP018254.1_1942626_1944288_+	pfam04966, OprB, Carbohydrate-selective porin, OprB family	NA|258aa|up_2|NZ_AP018254.1_1944940_1945714_+	pfam10186, Atg14, Vacuolar sorting 38 and autophagy-related subunit 14	NA|452aa|up_1|NZ_AP018254.1_1945956_1947312_-	pfam01520, Amidase_3, N-acetylmuramoyl-L-alanine amidase	NA|583aa|up_0|NZ_AP018254.1_1947729_1949478_+	PRK11360, PRK11360, two-component system sensor histidine kinase AtoS	NA|453aa|down_0|NZ_AP018254.1_1949891_1951250_+	pfam14706, Tnp_DNA_bind, Transposase DNA-binding	NA|291aa|down_1|NZ_AP018254.1_1951594_1952467_+	PRK14186, PRK14186, bifunctional methylenetetrahydrofolate dehydrogenase/methenyltetrahydrofolate cyclohydrolase FolD	NA|309aa|down_2|NZ_AP018254.1_1952680_1953607_+	COG0142, IspA, Geranylgeranyl pyrophosphate synthase [Coenzyme metabolism]	NA|154aa|down_3|NZ_AP018254.1_1953757_1954219_+	COG1963, COG1963, Uncharacterized protein conserved in bacteria [Function unknown]	NA|94aa|down_4|NZ_AP018254.1_1954350_1954632_+	PRK14857, tatA, TatA/E family twin arginine-targeting protein translocase	NA|211aa|down_5|NZ_AP018254.1_1954847_1955480_+	PRK05426, PRK05426, peptidyl-tRNA hydrolase; Provisional	NA|260aa|down_6|NZ_AP018254.1_1955641_1956421_+	cd01828, sialate_O-acetylesterase_like2, sialate_O-acetylesterase_like subfamily of the SGNH-hydrolases, a diverse family of lipases and esterases	NA|264aa|down_7|NZ_AP018254.1_1956684_1957476_+	COG1512, COG1512, Beta-propeller domains of methanol dehydrogenase type [General function prediction only]	NA|509aa|down_8|NZ_AP018254.1_1957739_1959266_+	pfam00743, FMO-like, Flavin-binding monooxygenase-like	NA|203aa|down_9|NZ_AP018254.1_1959538_1960147_-	COG1413, COG1413, FOG: HEAT repeat [Energy production and conversion]
GCF_002368395.1_ASM236839v1	NZ_AP018254	Calothrix sp. NIES-3974	22	2040037-2040148	22	CRISPRCasFinder	no		DinG,PD-DExK,cas14j,csa3,Cas14c_CAS-V-F,WYL,csx18,cas1,cas2,cas10,cmr3gr5,cmr4gr7,cmr5gr11,csm3gr7,cas3,csx1,cas6,csm5gr7,csm4gr5,csm2gr11,csx19,csx10gr5,c2c9_V-U4,RT,cas4,csc1gr5,csc2gr7,cas10d,csx3,csm6	Orphan	GTTTCAATCCCTAATAGGGATTAATTTGAATTGCAAT	37	0	0	NA	NA	I-D,II-B	1	1	Orphan	DinG,PD-DExK,cas14j,csa3,Cas14c_CAS-V-F,WYL,csx18,cas1,cas2,cas10,cmr3gr5,cmr4gr7,cmr5gr11,csm3gr7,cas3,csx1,cas6,csm5gr7,csm4gr5,csm2gr11,csx19,csx10gr5,c2c9_V-U4,RT,cas4,csc1gr5,csc2gr7,cas10d,csx3,csm6	NA|110aa|up_4|NZ_AP018254.1_2034853_2035183_-,NA	NA|640aa|up_9|NZ_AP018254.1_2026103_2028023_-	PRK13557, PRK13557, histidine kinase; Provisional	NA|152aa|up_8|NZ_AP018254.1_2028041_2028497_-	cd17557, REC_Rcp-like, phosphoacceptor receiver (REC) domain of cyanobacterial phytochrome response regulator Rcp and similar domains	NA|807aa|up_7|NZ_AP018254.1_2029011_2031432_-	COG4251, COG4251, Bacteriophytochrome (light-regulated signal transduction histidine kinase) [Signal transduction mechanisms]	NA|511aa|up_6|NZ_AP018254.1_2031870_2033403_-	cd02142, McbC_SagB-like_oxidoreductase, oxidase similar to the microcin B17 processing protein McbC	NA|119aa|up_5|NZ_AP018254.1_2034117_2034474_-	COG4980, GvpP, Gas vesicle protein [General function prediction only]	NA|110aa|up_4|NZ_AP018254.1_2034853_2035183_-	NA	NA|258aa|up_3|NZ_AP018254.1_2035666_2036440_+	COG1045, CysE, Serine acetyltransferase [Amino acid transport and metabolism]	NA|533aa|up_2|NZ_AP018254.1_2036586_2038185_+	PRK05434, PRK05434, 2,3-bisphosphoglycerate-independent phosphoglycerate mutase	NA|78aa|up_1|NZ_AP018254.1_2038476_2038710_+	PRK06870, secG, preprotein translocase subunit SecG; Reviewed	NA|190aa|up_0|NZ_AP018254.1_2039420_2039990_+	COG5549, COG5549, Predicted Zn-dependent protease [Posttranslational modification, protein turnover, chaperones]	NA|335aa|down_0|NZ_AP018254.1_2040429_2041434_-	PRK07453, PRK07453, protochlorophyllide reductase	NA|380aa|down_1|NZ_AP018254.1_2041984_2043124_+	TIGR03169, selenide_water_dikinase_putative, pyridine nucleotide-disulfide oxidoreductase family protein	NA|459aa|down_2|NZ_AP018254.1_2043912_2045289_-	PRK14333, PRK14333, (dimethylallyl)adenosine tRNA methylthiotransferase; Provisional	NA|295aa|down_3|NZ_AP018254.1_2045495_2046380_-	cd03225, ABC_cobalt_CbiO_domain1, First domain of the ATP-binding cassette component of cobalt transport system	NA|96aa|down_4|NZ_AP018254.1_2047361_2047649_-	PRK02898, PRK02898, energy-coupling factor ABC transporter substrate-binding protein	NA|263aa|down_5|NZ_AP018254.1_2047620_2048409_-	PRK08319, PRK08319, energy-coupling factor ABC transporter permease	NA|453aa|down_6|NZ_AP018254.1_2050198_2051557_+	COG0642, BaeS, Signal transduction histidine kinase [Signal transduction mechanisms]	NA|177aa|down_7|NZ_AP018254.1_2051770_2052301_-	PRK07414, PRK07414, P-loop NTPase family protein	NA|194aa|down_8|NZ_AP018254.1_2052790_2053372_+	cd01428, ADK, Adenylate kinase (ADK) catalyzes the reversible phosphoryl transfer from adenosine triphosphates (ATP) to adenosine monophosphates (AMP) and to yield adenosine diphosphates (ADP)	NA|270aa|down_9|NZ_AP018254.1_2054296_2055106_-	PRK14243, PRK14243, phosphate transporter ATP-binding protein; Provisional
GCF_002368395.1_ASM236839v1	NZ_AP018254	Calothrix sp. NIES-3974	23	2129115-2129227	23	CRISPRCasFinder	no		DinG,PD-DExK,cas14j,csa3,Cas14c_CAS-V-F,WYL,csx18,cas1,cas2,cas10,cmr3gr5,cmr4gr7,cmr5gr11,csm3gr7,cas3,csx1,cas6,csm5gr7,csm4gr5,csm2gr11,csx19,csx10gr5,c2c9_V-U4,RT,cas4,csc1gr5,csc2gr7,cas10d,csx3,csm6	Orphan	TCGGAGAACGGAAAGCCTCTGCACGTCTAGGGATTTTGGA	40	0	0	NA	NA	NA	1	1	Orphan	DinG,PD-DExK,cas14j,csa3,Cas14c_CAS-V-F,WYL,csx18,cas1,cas2,cas10,cmr3gr5,cmr4gr7,cmr5gr11,csm3gr7,cas3,csx1,cas6,csm5gr7,csm4gr5,csm2gr11,csx19,csx10gr5,c2c9_V-U4,RT,cas4,csc1gr5,csc2gr7,cas10d,csx3,csm6	NA|51aa|up_8|NZ_AP018254.1_2119419_2119572_-,NA|127aa|up_4|NZ_AP018254.1_2123283_2123664_-,NA|203aa|up_0|NZ_AP018254.1_2128430_2129039_+,NA|61aa|down_2|NZ_AP018254.1_2131624_2131807_+,NA|91aa|down_7|NZ_AP018254.1_2137388_2137661_-	NA|356aa|up_9|NZ_AP018254.1_2118227_2119295_+	COG1118, CysA, ABC-type sulfate/molybdate transport systems, ATPase component [Inorganic ion transport and metabolism]	NA|51aa|up_8|NZ_AP018254.1_2119419_2119572_-	NA	NA|387aa|up_7|NZ_AP018254.1_2119734_2120895_+	cd17370, MFS_MJ1317_like, MJ1317 and similar transporters of the Major Facilitator Superfamily	NA|304aa|up_6|NZ_AP018254.1_2121752_2122664_+	COG0642, BaeS, Signal transduction histidine kinase [Signal transduction mechanisms]	NA|162aa|up_5|NZ_AP018254.1_2122759_2123245_-	pfam13508, Acetyltransf_7, Acetyltransferase (GNAT) domain	NA|127aa|up_4|NZ_AP018254.1_2123283_2123664_-	NA	NA|318aa|up_3|NZ_AP018254.1_2123960_2124914_-	PRK13022, secF, protein translocase subunit SecF	NA|477aa|up_2|NZ_AP018254.1_2124910_2126341_-	PRK05812, secD, preprotein translocase subunit SecD; Reviewed	NA|378aa|up_1|NZ_AP018254.1_2126547_2127681_-	smart00854, PGA_cap, Bacterial capsule synthesis protein PGA_cap	NA|203aa|up_0|NZ_AP018254.1_2128430_2129039_+	NA	NA|215aa|down_0|NZ_AP018254.1_2129288_2129933_+	pfam10063, DUF2301, Uncharacterized integral membrane protein (DUF2301)	NA|242aa|down_1|NZ_AP018254.1_2130260_2130986_+	TIGR01198, 6-phosphogluconolactonase_6PGL	NA|61aa|down_2|NZ_AP018254.1_2131624_2131807_+	NA	NA|297aa|down_3|NZ_AP018254.1_2131998_2132889_+	COG1716, COG1716, FOG: FHA domain [Signal transduction mechanisms]	NA|118aa|down_4|NZ_AP018254.1_2132919_2133273_+	pfam00498, FHA, FHA domain	NA|247aa|down_5|NZ_AP018254.1_2133414_2134155_+	pfam13398, Peptidase_M50B, Peptidase M50B-like	NA|865aa|down_6|NZ_AP018254.1_2134264_2136859_+	cd07302, CHD, cyclase homology domain	NA|91aa|down_7|NZ_AP018254.1_2137388_2137661_-	NA	NA|247aa|down_8|NZ_AP018254.1_2138242_2138983_+	PRK09362, PRK09362, phosphoribosylaminoimidazole-succinocarboxamide synthase; Reviewed	NA|845aa|down_9|NZ_AP018254.1_2139526_2142061_+	TIGR00992, chloroplast_import-associated_channel_homolog, chloroplast envelope protein translocase, IAP75 family
GCF_002368395.1_ASM236839v1	NZ_AP018254	Calothrix sp. NIES-3974	24	2223942-2224036	24	CRISPRCasFinder	no		DinG,PD-DExK,cas14j,csa3,Cas14c_CAS-V-F,WYL,csx18,cas1,cas2,cas10,cmr3gr5,cmr4gr7,cmr5gr11,csm3gr7,cas3,csx1,cas6,csm5gr7,csm4gr5,csm2gr11,csx19,csx10gr5,c2c9_V-U4,RT,cas4,csc1gr5,csc2gr7,cas10d,csx3,csm6	Orphan	GTACAGACGACCCGTCGGGTCGTCT	25	0	0	NA	NA	NA	1	1	Orphan	DinG,PD-DExK,cas14j,csa3,Cas14c_CAS-V-F,WYL,csx18,cas1,cas2,cas10,cmr3gr5,cmr4gr7,cmr5gr11,csm3gr7,cas3,csx1,cas6,csm5gr7,csm4gr5,csm2gr11,csx19,csx10gr5,c2c9_V-U4,RT,cas4,csc1gr5,csc2gr7,cas10d,csx3,csm6	NA|449aa|up_6|NZ_AP018254.1_2217198_2218545_-,NA|309aa|up_2|NZ_AP018254.1_2221902_2222829_-,NA|298aa|down_0|NZ_AP018254.1_2225449_2226343_-,NA|290aa|down_1|NZ_AP018254.1_2226505_2227375_-,NA|67aa|down_2|NZ_AP018254.1_2227454_2227655_+	NA|342aa|up_9|NZ_AP018254.1_2214471_2215497_-	cd03809, GT4_MtfB-like, glycosyltransferases MtfB, WbpX, and similar proteins	NA|242aa|up_8|NZ_AP018254.1_2215528_2216254_-	TIGR01444, 2-O-methyltransferase_NoeI, methyltransferase, FkbM family	NA|285aa|up_7|NZ_AP018254.1_2216315_2217170_-	cd06433, GT_2_WfgS_like, WfgS and WfeV are involved in O-antigen biosynthesis	NA|449aa|up_6|NZ_AP018254.1_2217198_2218545_-	NA	NA|62aa|up_5|NZ_AP018254.1_2218583_2218769_-	pfam05050, Methyltransf_21, Methyltransferase FkbM domain	NA|295aa|up_4|NZ_AP018254.1_2219087_2219972_-	TIGR01444, 2-O-methyltransferase_NoeI, methyltransferase, FkbM family	NA|495aa|up_3|NZ_AP018254.1_2219965_2221450_-	cd13128, MATE_Wzx_like, Wzx, a subfamily of the multidrug and toxic compound extrusion (MATE)-like proteins	NA|309aa|up_2|NZ_AP018254.1_2221902_2222829_-	NA	NA|129aa|up_1|NZ_AP018254.1_2223185_2223572_-	cd09872, PIN_Sll0205-like, VapC-like PIN domain of Sll0205 protein and homologs	NA|81aa|up_0|NZ_AP018254.1_2223568_2223811_-	pfam10047, DUF2281, Protein of unknown function (DUF2281)	NA|298aa|down_0|NZ_AP018254.1_2225449_2226343_-	NA	NA|290aa|down_1|NZ_AP018254.1_2226505_2227375_-	NA	NA|67aa|down_2|NZ_AP018254.1_2227454_2227655_+	NA	NA|66aa|down_3|NZ_AP018254.1_2227907_2228105_-	pfam10929, DUF2811, Protein of unknown function (DUF2811)	NA|424aa|down_4|NZ_AP018254.1_2228613_2229885_+	cd17329, MFS_MdtH_MDR_like, Multidrug resistance protein MdtH and similar multidrug resistance (MDR) transporters of the Major Facilitator Superfamily	NA|143aa|down_5|NZ_AP018254.1_2230041_2230470_-	pfam04343, DUF488, Protein of unknown function, DUF488	NA|271aa|down_6|NZ_AP018254.1_2232850_2233663_+	cd07572, nit, Nit1, Nit 2, and related proteins, and the Nit1-like domain of NitFhit (class 10 nitrilases)	NA|112aa|down_7|NZ_AP018254.1_2233774_2234110_-	PRK13697, PRK13697, cytochrome c6; Provisional	NA|370aa|down_8|NZ_AP018254.1_2234340_2235450_-	COG2205, KdpD, Osmosensitive K+ channel histidine kinase [Signal transduction mechanisms]	NA|191aa|down_9|NZ_AP018254.1_2235454_2236027_-	PRK14003, PRK14003, K(+)-transporting ATPase subunit C
GCF_002368395.1_ASM236839v1	NZ_AP018254	Calothrix sp. NIES-3974	25	2332504-2332613	25	CRISPRCasFinder	no		DinG,PD-DExK,cas14j,csa3,Cas14c_CAS-V-F,WYL,csx18,cas1,cas2,cas10,cmr3gr5,cmr4gr7,cmr5gr11,csm3gr7,cas3,csx1,cas6,csm5gr7,csm4gr5,csm2gr11,csx19,csx10gr5,c2c9_V-U4,RT,cas4,csc1gr5,csc2gr7,cas10d,csx3,csm6	Orphan	CAACCAATTATTGAAGATAGTCAATAGGCTCAAACGGT	38	0	0	NA	NA	NA	1	1	Orphan	DinG,PD-DExK,cas14j,csa3,Cas14c_CAS-V-F,WYL,csx18,cas1,cas2,cas10,cmr3gr5,cmr4gr7,cmr5gr11,csm3gr7,cas3,csx1,cas6,csm5gr7,csm4gr5,csm2gr11,csx19,csx10gr5,c2c9_V-U4,RT,cas4,csc1gr5,csc2gr7,cas10d,csx3,csm6	NA|51aa|up_3|NZ_AP018254.1_2327199_2327352_-,NA|144aa|up_2|NZ_AP018254.1_2327587_2328019_-,NA|56aa|up_0|NZ_AP018254.1_2332164_2332332_-,NA|48aa|down_3|NZ_AP018254.1_2335778_2335922_-,NA|227aa|down_4|NZ_AP018254.1_2335938_2336619_-,NA|381aa|down_7|NZ_AP018254.1_2341786_2342929_+	NA|247aa|up_9|NZ_AP018254.1_2320127_2320868_-	COG5514, COG5514, Uncharacterized conserved protein [Function unknown]	NA|121aa|up_8|NZ_AP018254.1_2321193_2321556_+	pfam06967, Mo-nitro_C, Mo-dependent nitrogenase C-terminus	NA|91aa|up_7|NZ_AP018254.1_2321740_2322013_+	cd02227, cupin_TM1112-like, Thermotoga maritima TM1112 and related proteins, cupin domain	NA|249aa|up_6|NZ_AP018254.1_2322275_2323022_+	COG1402, COG1402, Uncharacterized protein, putative amidase [General function prediction only]	NA|448aa|up_5|NZ_AP018254.1_2324215_2325559_+	PRK01117, PRK01117, adenylosuccinate synthetase; Provisional	NA|102aa|up_4|NZ_AP018254.1_2325830_2326136_+	PRK05943, PRK05943, 50S ribosomal protein L25; Reviewed	NA|51aa|up_3|NZ_AP018254.1_2327199_2327352_-	NA	NA|144aa|up_2|NZ_AP018254.1_2327587_2328019_-	NA	NA|1020aa|up_1|NZ_AP018254.1_2328670_2331730_-	PRK05729, valS, valyl-tRNA synthetase; Reviewed	NA|56aa|up_0|NZ_AP018254.1_2332164_2332332_-	NA	NA|170aa|down_0|NZ_AP018254.1_2333166_2333676_-	TIGR00560, pgsA, CDP-diacylglycerol--glycerol-3-phosphate 3-phosphatidyltransferase	NA|289aa|down_1|NZ_AP018254.1_2333765_2334632_-	pfam11103, DUF2887, Protein of unknown function (DUF2887)	NA|285aa|down_2|NZ_AP018254.1_2334661_2335516_-	pfam11103, DUF2887, Protein of unknown function (DUF2887)	NA|48aa|down_3|NZ_AP018254.1_2335778_2335922_-	NA	NA|227aa|down_4|NZ_AP018254.1_2335938_2336619_-	NA	NA|527aa|down_5|NZ_AP018254.1_2336837_2338418_-	pfam12770, CHAT, CHAT domain	NA|781aa|down_6|NZ_AP018254.1_2338639_2340982_-	PRK14559, PRK14559, serine/threonine phosphatase	NA|381aa|down_7|NZ_AP018254.1_2341786_2342929_+	NA	NA|257aa|down_8|NZ_AP018254.1_2343124_2343895_-	cd10911, PIN_LabA, PIN domain of Synechococcus elongatus LabA (low-amplitude and bright) and related proteins	NA|76aa|down_9|NZ_AP018254.1_2344668_2344896_-	pfam11347, DUF3148, Protein of unknown function (DUF3148)
GCF_002368395.1_ASM236839v1	NZ_AP018254	Calothrix sp. NIES-3974	26	2397325-2397424	26	CRISPRCasFinder	no		DinG,PD-DExK,cas14j,csa3,Cas14c_CAS-V-F,WYL,csx18,cas1,cas2,cas10,cmr3gr5,cmr4gr7,cmr5gr11,csm3gr7,cas3,csx1,cas6,csm5gr7,csm4gr5,csm2gr11,csx19,csx10gr5,c2c9_V-U4,RT,cas4,csc1gr5,csc2gr7,cas10d,csx3,csm6	Orphan	TCCCTACTCCCTACTCCCTGTAT	23	0	0	NA	NA	NA	1	1	Orphan	DinG,PD-DExK,cas14j,csa3,Cas14c_CAS-V-F,WYL,csx18,cas1,cas2,cas10,cmr3gr5,cmr4gr7,cmr5gr11,csm3gr7,cas3,csx1,cas6,csm5gr7,csm4gr5,csm2gr11,csx19,csx10gr5,c2c9_V-U4,RT,cas4,csc1gr5,csc2gr7,cas10d,csx3,csm6	NA|46aa|up_4|NZ_AP018254.1_2393202_2393340_+,NA|114aa|up_2|NZ_AP018254.1_2395566_2395908_-,NA|143aa|up_1|NZ_AP018254.1_2396176_2396605_+,NA	NA|280aa|up_9|NZ_AP018254.1_2386995_2387835_+	TIGR01183, Nitrate_transport_permease_protein_NrtB, nitrate ABC transporter, permease protein	NA|672aa|up_8|NZ_AP018254.1_2387936_2389952_+	TIGR01184, Nitrate_transport_ATP-binding_protein_NrtC, nitrate transport ATP-binding subunits C and D	NA|281aa|up_7|NZ_AP018254.1_2390064_2390907_+	TIGR01184, Nitrate_transport_ATP-binding_protein_NrtC, nitrate transport ATP-binding subunits C and D	NA|349aa|up_6|NZ_AP018254.1_2391001_2392048_-	COG0601, DppB, ABC-type dipeptide/oligopeptide/nickel transport systems, permease components [Amino acid transport and metabolism / Inorganic ion transport and metabolism]	NA|173aa|up_5|NZ_AP018254.1_2392406_2392925_-	PRK02304, PRK02304, adenine phosphoribosyltransferase; Provisional	NA|46aa|up_4|NZ_AP018254.1_2393202_2393340_+	NA	NA|715aa|up_3|NZ_AP018254.1_2393364_2395509_-	COG1357, COG1357, Pentapeptide repeats containing protein [Function unknown]	NA|114aa|up_2|NZ_AP018254.1_2395566_2395908_-	NA	NA|143aa|up_1|NZ_AP018254.1_2396176_2396605_+	NA	NA|128aa|up_0|NZ_AP018254.1_2396820_2397204_-	pfam09538, FYDLN_acid, Protein of unknown function (FYDLN_acid)	NA|435aa|down_0|NZ_AP018254.1_2399213_2400518_+	PRK05388, argJ, bifunctional glutamate N-acetyltransferase/amino-acid acetyltransferase ArgJ	NA|453aa|down_1|NZ_AP018254.1_2400909_2402268_-	pfam14706, Tnp_DNA_bind, Transposase DNA-binding	NA|222aa|down_2|NZ_AP018254.1_2402512_2403178_-	pfam05685, Uma2, Putative restriction endonuclease	NA|453aa|down_3|NZ_AP018254.1_2403756_2405115_+	pfam14706, Tnp_DNA_bind, Transposase DNA-binding	NA|138aa|down_4|NZ_AP018254.1_2405502_2405916_-	COG3686, COG3686, Predicted membrane protein [Function unknown]	NA|169aa|down_5|NZ_AP018254.1_2406475_2406982_-	pfam02481, DNA_processg_A, DNA recombination-mediator protein A	NA|308aa|down_6|NZ_AP018254.1_2407208_2408132_+	TIGR04168, Ser/Thr_protein_phosphatase_family_protein, TIGR04168 family protein	NA|237aa|down_7|NZ_AP018254.1_2408332_2409043_-	cd03225, ABC_cobalt_CbiO_domain1, First domain of the ATP-binding cassette component of cobalt transport system	NA|307aa|down_8|NZ_AP018254.1_2409854_2410775_+	TIGR02917, TPR_domain_protein, putative PEP-CTERM system TPR-repeat lipoprotein	NA|506aa|down_9|NZ_AP018254.1_2410840_2412358_-	COG0642, BaeS, Signal transduction histidine kinase [Signal transduction mechanisms]
GCF_002368395.1_ASM236839v1	NZ_AP018254	Calothrix sp. NIES-3974	27	2496286-2496370	27	CRISPRCasFinder	no		DinG,PD-DExK,cas14j,csa3,Cas14c_CAS-V-F,WYL,csx18,cas1,cas2,cas10,cmr3gr5,cmr4gr7,cmr5gr11,csm3gr7,cas3,csx1,cas6,csm5gr7,csm4gr5,csm2gr11,csx19,csx10gr5,c2c9_V-U4,RT,cas4,csc1gr5,csc2gr7,cas10d,csx3,csm6	Orphan	CGTAATAGCCTGCGGCAACGCTTCGC	26	1	5	2496312-2496344|2496312-2496344|2496312-2496344|2496312-2496344|2496312-2496344	NZ_AP018254.1_547857-547889|NZ_AP018254.1_1761162-1761130|NZ_AP018254.1_2534183-2534215|NZ_AP018254.1_3476199-3476231|NZ_AP018254.1_5236157-5236125	NA	1	1	Orphan	DinG,PD-DExK,cas14j,csa3,Cas14c_CAS-V-F,WYL,csx18,cas1,cas2,cas10,cmr3gr5,cmr4gr7,cmr5gr11,csm3gr7,cas3,csx1,cas6,csm5gr7,csm4gr5,csm2gr11,csx19,csx10gr5,c2c9_V-U4,RT,cas4,csc1gr5,csc2gr7,cas10d,csx3,csm6	NA,NA|616aa|down_4|NZ_AP018254.1_2500943_2502791_+	NA|71aa|up_9|NZ_AP018254.1_2485122_2485335_+	pfam10742, DUF2555, Protein of unknown function (DUF2555)	NA|410aa|up_8|NZ_AP018254.1_2485820_2487050_+	PRK05579, PRK05579, bifunctional phosphopantothenoylcysteine decarboxylase/phosphopantothenate synthase; Validated	NA|381aa|up_7|NZ_AP018254.1_2487906_2489049_+	PRK14293, PRK14293, molecular chaperone DnaJ	NA|85aa|up_6|NZ_AP018254.1_2489045_2489300_+	cd00291, SirA_YedF_YeeD, SirA, YedF, and YeeD	NA|353aa|up_5|NZ_AP018254.1_2489300_2490359_+	PRK12289, PRK12289, small ribosomal subunit biogenesis GTPase RsgA	NA|147aa|up_4|NZ_AP018254.1_2490520_2490961_-	PRK05273, PRK05273, D-tyrosyl-tRNA(Tyr) deacylase; Provisional	NA|358aa|up_3|NZ_AP018254.1_2490988_2492062_-	COG0523, COG0523, Putative GTPases (G3E family) [General function prediction only]	NA|338aa|up_2|NZ_AP018254.1_2492603_2493617_+	PRK06245, cofG, FO synthase subunit 1; Reviewed	NA|254aa|up_1|NZ_AP018254.1_2493730_2494492_+	sd00006, TPR, Tetratricopeptide repeat	NA|458aa|up_0|NZ_AP018254.1_2494744_2496118_+	cd07100, ALDH_SSADH1_GabD1, Mycobacterium tuberculosis succinate-semialdehyde dehydrogenase 1-like	NA|142aa|down_0|NZ_AP018254.1_2496913_2497339_-	cd00293, USP_Like, Usp: Universal stress protein family	NA|154aa|down_1|NZ_AP018254.1_2497533_2497995_+	cd04586, CBS_pair_BON_assoc, Two tandem repeats of the cystathionine beta-synthase (CBS pair) domains associated with the BON (bacterial OsmY and nodulation domain) domain	NA|224aa|down_2|NZ_AP018254.1_2498048_2498720_+	COG1413, COG1413, FOG: HEAT repeat [Energy production and conversion]	NA|215aa|down_3|NZ_AP018254.1_2499396_2500041_-	COG2197, CitB, Response regulator containing a CheY-like receiver domain and an HTH DNA-binding domain [Signal transduction mechanisms / Transcription]	NA|616aa|down_4|NZ_AP018254.1_2500943_2502791_+	NA	NA|536aa|down_5|NZ_AP018254.1_2503146_2504754_-	COG0443, DnaK, Molecular chaperone [Posttranslational modification, protein turnover, chaperones]	NA|288aa|down_6|NZ_AP018254.1_2505040_2505904_+	pfam06485, DUF1092, Protein of unknown function (DUF1092)	NA|545aa|down_7|NZ_AP018254.1_2506029_2507664_-	cd03085, PGM1, Phosphoglucomutase 1 (PGM1) catalyzes the bidirectional interconversion of glucose-1-phosphate (G-1-P) and glucose-6-phosphate (G-6-P) via a glucose 1,6-diphosphate intermediate, an important metabolic step in prokaryotes and eukaryotes	NA|236aa|down_8|NZ_AP018254.1_2507802_2508510_-	pfam05685, Uma2, Putative restriction endonuclease	NA|414aa|down_9|NZ_AP018254.1_2508935_2510177_+	pfam01384, PHO4, Phosphate transporter family
GCF_002368395.1_ASM236839v1	NZ_AP018254	Calothrix sp. NIES-3974	28	2592626-2592807	7	PILER-CR	no	csa3	DinG,PD-DExK,cas14j,csa3,Cas14c_CAS-V-F,WYL,csx18,cas1,cas2,cas10,cmr3gr5,cmr4gr7,cmr5gr11,csm3gr7,cas3,csx1,cas6,csm5gr7,csm4gr5,csm2gr11,csx19,csx10gr5,c2c9_V-U4,RT,cas4,csc1gr5,csc2gr7,cas10d,csx3,csm6	Type I-A	CTTGAAATTAACTATTTCCCCGCAAGGAGACTGAAACT	38	0	0	NA	NA	NA	2	2	Orphan	DinG,PD-DExK,cas14j,csa3,Cas14c_CAS-V-F,WYL,csx18,cas1,cas2,cas10,cmr3gr5,cmr4gr7,cmr5gr11,csm3gr7,cas3,csx1,cas6,csm5gr7,csm4gr5,csm2gr11,csx19,csx10gr5,c2c9_V-U4,RT,cas4,csc1gr5,csc2gr7,cas10d,csx3,csm6	NA|60aa|up_0|NZ_AP018254.1_2592189_2592369_+,NA|227aa|down_5|NZ_AP018254.1_2600446_2601127_-,NA|222aa|down_9|NZ_AP018254.1_2606454_2607120_+	NA|112aa|up_9|NZ_AP018254.1_2581131_2581467_-	pfam09685, DUF4870, Domain of unknown function (DUF4870)	NA|245aa|up_8|NZ_AP018254.1_2582074_2582809_+	PLN02309, PLN02309, 5'-adenylylsulfate reductase	csa3|71aa|up_7|NZ_AP018254.1_2585939_2586152_+	cd00090, HTH_ARSR, Arsenical Resistance Operon Repressor and similar prokaryotic, metal regulated homodimeric repressors	NA|136aa|up_6|NZ_AP018254.1_2586324_2586732_+	cd00090, HTH_ARSR, Arsenical Resistance Operon Repressor and similar prokaryotic, metal regulated homodimeric repressors	NA|417aa|up_5|NZ_AP018254.1_2586795_2588046_-	COG0446, HcaD, Uncharacterized NAD(FAD)-dependent dehydrogenases [General function prediction only]	NA|232aa|up_4|NZ_AP018254.1_2588567_2589263_+	cd07724, POD-like_MBL-fold, ETHE1 (PDO type I), persulfide dioxygenase A (PDOA, PDO type II) and related proteins; MBL-fold metallo-hydrolase domain	NA|181aa|up_3|NZ_AP018254.1_2589416_2589959_+	cd00158, RHOD, Rhodanese Homology Domain (RHOD); an alpha beta fold domain found duplicated in the rhodanese protein	NA|261aa|up_2|NZ_AP018254.1_2590559_2591342_+	pfam01925, TauE, Sulfite exporter TauE/SafE	NA|128aa|up_1|NZ_AP018254.1_2591498_2591882_-	pfam13682, CZB, Chemoreceptor zinc-binding domain	NA|60aa|up_0|NZ_AP018254.1_2592189_2592369_+	NA	NA|176aa|down_0|NZ_AP018254.1_2593337_2593865_-	pfam10726, DUF2518, Protein of function (DUF2518)	NA|409aa|down_1|NZ_AP018254.1_2594092_2595319_-	pfam05626, DUF790, Protein of unknown function (DUF790)	NA|600aa|down_2|NZ_AP018254.1_2595800_2597600_-	PHA03095, PHA03095, ankyrin-like protein; Provisional	NA|332aa|down_3|NZ_AP018254.1_2598238_2599234_-	COG0673, MviM, Predicted dehydrogenases and related proteins [General function prediction only]	NA|164aa|down_4|NZ_AP018254.1_2599863_2600355_-	cd11740, YajQ_like, Proteins similar to Escherichia coli YajQ	NA|227aa|down_5|NZ_AP018254.1_2600446_2601127_-	NA	NA|647aa|down_6|NZ_AP018254.1_2601482_2603423_+	CHL00162, thiG, thiamin biosynthesis protein G; Validated	NA|329aa|down_7|NZ_AP018254.1_2603712_2604699_-	cd01167, bac_FRK, Fructokinases (FRKs) mainly from bacteria and plants are enzymes with high specificity for fructose, as are all FRKs, but they catalyzes the conversion of fructose to fructose-6-phosphate, which is an entry point into glycolysis via conversion into glucose-6-phosphate	NA|88aa|down_8|NZ_AP018254.1_2605229_2605493_+	smart00421, HTH_LUXR, helix_turn_helix, Lux Regulon	NA|222aa|down_9|NZ_AP018254.1_2606454_2607120_+	NA
GCF_002368395.1_ASM236839v1	NZ_AP018254	Calothrix sp. NIES-3974	29	2605742-2605847	28	CRISPRCasFinder	no	csa3	DinG,PD-DExK,cas14j,csa3,Cas14c_CAS-V-F,WYL,csx18,cas1,cas2,cas10,cmr3gr5,cmr4gr7,cmr5gr11,csm3gr7,cas3,csx1,cas6,csm5gr7,csm4gr5,csm2gr11,csx19,csx10gr5,c2c9_V-U4,RT,cas4,csc1gr5,csc2gr7,cas10d,csx3,csm6	Type I-A	TAATTTGTAATAGCCTACGGCAACGCTTCGCGTACGTAAT	40	1	62	2605782-2605807|2605782-2605807|2605782-2605807|2605782-2605807|2605782-2605807|2605782-2605807|2605782-2605807|2605782-2605807|2605782-2605807|2605782-2605807|2605782-2605807|2605782-2605807|2605782-2605807|2605782-2605807|2605782-2605807|2605782-2605807|2605782-2605807|2605782-2605807|2605782-2605807|2605782-2605807|2605782-2605807|2605782-2605807|2605782-2605807|2605782-2605807|2605782-2605807|2605782-2605807|2605782-2605807|2605782-2605807|2605782-2605807|2605782-2605807|2605782-2605807|2605782-2605807|2605782-2605807|2605782-2605807|2605782-2605807|2605782-2605807|2605782-2605807|2605782-2605807|2605782-2605807|2605782-2605807|2605782-2605807|2605782-2605807|2605782-2605807|2605782-2605807|2605782-2605807|2605782-2605807|2605782-2605807|2605782-2605807|2605782-2605807|2605782-2605807|2605782-2605807|2605782-2605807|2605782-2605807|2605782-2605807|2605782-2605807|2605782-2605807|2605782-2605807|2605782-2605807|2605782-2605807|2605782-2605807|2605782-2605807|2605782-2605807	NZ_AP018254.1_82578-82603|NZ_AP018254.1_547866-547891|NZ_AP018254.1_702787-702812|NZ_AP018254.1_1342221-1342246|NZ_AP018254.1_1497356-1497381|NZ_AP018254.1_1519014-1519039|NZ_AP018254.1_1566603-1566628|NZ_AP018254.1_2534192-2534217|NZ_AP018254.1_2856290-2856315|NZ_AP018254.1_3102481-3102506|NZ_AP018254.1_3476208-3476233|NZ_AP018254.1_4117261-4117286|NZ_AP018254.1_4587859-4587884|NZ_AP018254.1_5308056-5308081|NZ_AP018254.1_5381576-5381601|NZ_AP018254.1_5605425-5605450|NZ_AP018254.1_5612730-5612755|NZ_AP018254.1_971310-971285|NZ_AP018254.1_1116892-1116867|NZ_AP018254.1_1339172-1339147|NZ_AP018254.1_1510481-1510456|NZ_AP018254.1_1761153-1761128|NZ_AP018254.1_1761212-1761187|NZ_AP018254.1_1811374-1811349|NZ_AP018254.1_1955615-1955590|NZ_AP018254.1_2279166-2279141|NZ_AP018254.1_2343109-2343084|NZ_AP018254.1_2574446-2574421|NZ_AP018254.1_2781104-2781079|NZ_AP018254.1_3650436-3650411|NZ_AP018254.1_3662442-3662417|NZ_AP018254.1_3812156-3812131|NZ_AP018254.1_4775035-4775010|NZ_AP018254.1_5236148-5236123|NZ_AP018254.1_5819441-5819416|NZ_AP018254.1_1038142-1038167|NZ_AP018254.1_1111261-1111286|NZ_AP018254.1_1937592-1937617|NZ_AP018254.1_3988568-3988593|NZ_AP018254.1_4414279-4414304|NZ_AP018254.1_4431752-4431777|NZ_AP018254.1_157608-157583|NZ_AP018254.1_442005-441980|NZ_AP018254.1_1116991-1116966|NZ_AP018254.1_1148379-1148354|NZ_AP018254.1_2263735-2263710|NZ_AP018254.1_2595585-2595560|NZ_AP018254.1_3812003-3811978|NZ_AP018254.1_3883456-3883431|NZ_AP018254.1_3941700-3941675|NZ_AP018254.1_5042773-5042748|NZ_AP018254.1_5447753-5447728|NZ_AP018254.1_5447976-5447951|NZ_AP018254.1_5976684-5976659|NZ_AP018254.1_1639570-1639595|NZ_AP018254.1_3322914-3322939|NZ_AP018254.1_3382166-3382191|NZ_AP018254.1_5585174-5585199|NZ_AP018254.1_1835256-1835231|NZ_AP018254.1_3986327-3986302|NZ_AP018254.1_5669007-5668982|NZ_AP018254.1_5694103-5694078	NA	1	1	Orphan	DinG,PD-DExK,cas14j,csa3,Cas14c_CAS-V-F,WYL,csx18,cas1,cas2,cas10,cmr3gr5,cmr4gr7,cmr5gr11,csm3gr7,cas3,csx1,cas6,csm5gr7,csm4gr5,csm2gr11,csx19,csx10gr5,c2c9_V-U4,RT,cas4,csc1gr5,csc2gr7,cas10d,csx3,csm6	NA|60aa|up_9|NZ_AP018254.1_2592189_2592369_+,NA|227aa|up_3|NZ_AP018254.1_2600446_2601127_-,NA|222aa|down_0|NZ_AP018254.1_2606454_2607120_+,NA|78aa|down_8|NZ_AP018254.1_2615332_2615566_-	NA|60aa|up_9|NZ_AP018254.1_2592189_2592369_+	NA	NA|176aa|up_8|NZ_AP018254.1_2593337_2593865_-	pfam10726, DUF2518, Protein of function (DUF2518)	NA|409aa|up_7|NZ_AP018254.1_2594092_2595319_-	pfam05626, DUF790, Protein of unknown function (DUF790)	NA|600aa|up_6|NZ_AP018254.1_2595800_2597600_-	PHA03095, PHA03095, ankyrin-like protein; Provisional	NA|332aa|up_5|NZ_AP018254.1_2598238_2599234_-	COG0673, MviM, Predicted dehydrogenases and related proteins [General function prediction only]	NA|164aa|up_4|NZ_AP018254.1_2599863_2600355_-	cd11740, YajQ_like, Proteins similar to Escherichia coli YajQ	NA|227aa|up_3|NZ_AP018254.1_2600446_2601127_-	NA	NA|647aa|up_2|NZ_AP018254.1_2601482_2603423_+	CHL00162, thiG, thiamin biosynthesis protein G; Validated	NA|329aa|up_1|NZ_AP018254.1_2603712_2604699_-	cd01167, bac_FRK, Fructokinases (FRKs) mainly from bacteria and plants are enzymes with high specificity for fructose, as are all FRKs, but they catalyzes the conversion of fructose to fructose-6-phosphate, which is an entry point into glycolysis via conversion into glucose-6-phosphate	NA|88aa|up_0|NZ_AP018254.1_2605229_2605493_+	smart00421, HTH_LUXR, helix_turn_helix, Lux Regulon	NA|222aa|down_0|NZ_AP018254.1_2606454_2607120_+	NA	NA|335aa|down_1|NZ_AP018254.1_2607365_2608370_+	COG0438, RfaG, Glycosyltransferase [Cell envelope biogenesis, outer membrane]	NA|429aa|down_2|NZ_AP018254.1_2609034_2610321_-	cd17486, MFS_AmpG_like, AmpG and similar transporters of the Major Facilitator Superfamily	NA|323aa|down_3|NZ_AP018254.1_2610615_2611584_+	cd01339, LDH-like_MDH, L-lactate dehydrogenase-like malate dehydrogenase proteins	NA|289aa|down_4|NZ_AP018254.1_2611580_2612447_+	PRK00258, aroE, shikimate 5-dehydrogenase; Reviewed	NA|75aa|down_5|NZ_AP018254.1_2612433_2612658_+	pfam02517, Abi, CAAX protease self-immunity	NA|436aa|down_6|NZ_AP018254.1_2613063_2614371_+	COG4191, COG4191, Signal transduction histidine kinase regulating C4-dicarboxylate transport system [Signal transduction mechanisms]	NA|219aa|down_7|NZ_AP018254.1_2614578_2615235_-	pfam05685, Uma2, Putative restriction endonuclease	NA|78aa|down_8|NZ_AP018254.1_2615332_2615566_-	NA	NA|695aa|down_9|NZ_AP018254.1_2615726_2617811_+	COG0644, FixC, Dehydrogenases (flavoproteins) [Energy production and conversion]
GCF_002368395.1_ASM236839v1	NZ_AP018254	Calothrix sp. NIES-3974	30	2673869-2674008	29	CRISPRCasFinder	no		DinG,PD-DExK,cas14j,csa3,Cas14c_CAS-V-F,WYL,csx18,cas1,cas2,cas10,cmr3gr5,cmr4gr7,cmr5gr11,csm3gr7,cas3,csx1,cas6,csm5gr7,csm4gr5,csm2gr11,csx19,csx10gr5,c2c9_V-U4,RT,cas4,csc1gr5,csc2gr7,cas10d,csx3,csm6	Orphan	ATGTAATGCTGATATCTTTGCTGATAAAAGTGATTTTCCGATTCACT	47	0	0	NA	NA	NA	1	1	Orphan	DinG,PD-DExK,cas14j,csa3,Cas14c_CAS-V-F,WYL,csx18,cas1,cas2,cas10,cmr3gr5,cmr4gr7,cmr5gr11,csm3gr7,cas3,csx1,cas6,csm5gr7,csm4gr5,csm2gr11,csx19,csx10gr5,c2c9_V-U4,RT,cas4,csc1gr5,csc2gr7,cas10d,csx3,csm6	NA,NA	NA|468aa|up_9|NZ_AP018254.1_2657977_2659381_+	COG0154, GatA, Asp-tRNAAsn/Glu-tRNAGln amidotransferase A subunit and related amidases [Translation, ribosomal structure and biogenesis]	NA|707aa|up_8|NZ_AP018254.1_2659769_2661890_+	PRK14948, PRK14948, DNA polymerase III subunit gamma/tau	NA|299aa|up_7|NZ_AP018254.1_2662133_2663030_-	PLN02679, PLN02679, hydrolase, alpha/beta fold family protein	NA|390aa|up_6|NZ_AP018254.1_2663242_2664412_+	PRK05764, PRK05764, aspartate aminotransferase; Provisional	NA|199aa|up_5|NZ_AP018254.1_2665340_2665937_-	COG0262, FolA, Dihydrofolate reductase [Coenzyme metabolism]	NA|606aa|up_4|NZ_AP018254.1_2666083_2667901_-	COG1132, MdlB, ABC-type multidrug transport system, ATPase and permease components [Defense mechanisms]	NA|477aa|up_3|NZ_AP018254.1_2668568_2669999_+	PRK09287, PRK09287, NADP-dependent phosphogluconate dehydrogenase	NA|530aa|up_2|NZ_AP018254.1_2670426_2672016_-	COG1543, COG1543, Uncharacterized conserved protein [Function unknown]	NA|112aa|up_1|NZ_AP018254.1_2672633_2672969_+	PRK13612, PRK13612, photosystem II reaction center protein Psb28; Provisional	NA|166aa|up_0|NZ_AP018254.1_2673310_2673808_+	cd00886, MogA_MoaB, MogA_MoaB family	NA|368aa|down_0|NZ_AP018254.1_2674144_2675248_-	PRK00108, mraY, phospho-N-acetylmuramoyl-pentapeptide-transferase; Provisional	NA|79aa|down_1|NZ_AP018254.1_2675335_2675572_-	pfam11332, DUF3134, Protein of unknown function (DUF3134)	NA|195aa|down_2|NZ_AP018254.1_2675815_2676400_+	pfam04755, PAP_fibrillin, PAP_fibrillin	NA|252aa|down_3|NZ_AP018254.1_2676992_2677748_-	pfam00427, PBS_linker_poly, Phycobilisome Linker polypeptide	NA|290aa|down_4|NZ_AP018254.1_2677805_2678675_-	pfam00427, PBS_linker_poly, Phycobilisome Linker polypeptide	NA|423aa|down_5|NZ_AP018254.1_2679290_2680559_-	COG0277, GlcD, FAD/FMN-containing dehydrogenases [Energy production and conversion]	NA|441aa|down_6|NZ_AP018254.1_2681252_2682575_-	COG3395, COG3395, Uncharacterized protein conserved in bacteria [Function unknown]	NA|189aa|down_7|NZ_AP018254.1_2682580_2683147_-	PRK00076, recR, recombination protein RecR; Reviewed	NA|341aa|down_8|NZ_AP018254.1_2683836_2684859_+	COG0523, COG0523, Putative GTPases (G3E family) [General function prediction only]	NA|218aa|down_9|NZ_AP018254.1_2685318_2685972_+	PRK09347, folE, GTP cyclohydrolase I; Provisional
GCF_002368395.1_ASM236839v1	NZ_AP018254	Calothrix sp. NIES-3974	31	2696878-2697035	8	PILER-CR	no		DinG,PD-DExK,cas14j,csa3,Cas14c_CAS-V-F,WYL,csx18,cas1,cas2,cas10,cmr3gr5,cmr4gr7,cmr5gr11,csm3gr7,cas3,csx1,cas6,csm5gr7,csm4gr5,csm2gr11,csx19,csx10gr5,c2c9_V-U4,RT,cas4,csc1gr5,csc2gr7,cas10d,csx3,csm6	Orphan	CTTGCCGCAGGCTATTACGTAG	22	1	1	2696970-2697017	NZ_AP018254.1_2534223-2534176	NA	2	2	Orphan	DinG,PD-DExK,cas14j,csa3,Cas14c_CAS-V-F,WYL,csx18,cas1,cas2,cas10,cmr3gr5,cmr4gr7,cmr5gr11,csm3gr7,cas3,csx1,cas6,csm5gr7,csm4gr5,csm2gr11,csx19,csx10gr5,c2c9_V-U4,RT,cas4,csc1gr5,csc2gr7,cas10d,csx3,csm6	NA|61aa|up_2|NZ_AP018254.1_2692869_2693052_-,NA|111aa|down_4|NZ_AP018254.1_2705012_2705345_+	NA|189aa|up_9|NZ_AP018254.1_2682580_2683147_-	PRK00076, recR, recombination protein RecR; Reviewed	NA|341aa|up_8|NZ_AP018254.1_2683836_2684859_+	COG0523, COG0523, Putative GTPases (G3E family) [General function prediction only]	NA|218aa|up_7|NZ_AP018254.1_2685318_2685972_+	PRK09347, folE, GTP cyclohydrolase I; Provisional	NA|358aa|up_6|NZ_AP018254.1_2686072_2687146_-	COG1649, COG1649, Uncharacterized protein conserved in bacteria [Function unknown]	NA|807aa|up_5|NZ_AP018254.1_2687319_2689740_+	TIGR02470, Sucrose_synthase_1, sucrose synthase	NA|186aa|up_4|NZ_AP018254.1_2690293_2690851_+	pfam05685, Uma2, Putative restriction endonuclease	NA|304aa|up_3|NZ_AP018254.1_2691382_2692294_+	COG1940, NagC, Transcriptional regulator/sugar kinase [Transcription / Carbohydrate transport and metabolism]	NA|61aa|up_2|NZ_AP018254.1_2692869_2693052_-	NA	NA|546aa|up_1|NZ_AP018254.1_2693247_2694885_-	PRK05380, pyrG, CTP synthetase; Validated	NA|588aa|up_0|NZ_AP018254.1_2694975_2696739_+	COG0860, AmiC, N-acetylmuramoyl-L-alanine amidase [Cell envelope biogenesis, outer membrane]	NA|228aa|down_0|NZ_AP018254.1_2697079_2697763_-	COG2340, COG2340, Uncharacterized protein with SCP/PR1 domains [Function unknown]	NA|470aa|down_1|NZ_AP018254.1_2698197_2699607_-	PRK10263, PRK10263, DNA translocase FtsK; Provisional	NA|466aa|down_2|NZ_AP018254.1_2700714_2702112_+	PRK09201, PRK09201, AtzE family amidohydrolase	NA|440aa|down_3|NZ_AP018254.1_2702205_2703525_+	TIGR01730, COG0845:_Membrane-fusion_protein, RND family efflux transporter, MFP subunit	NA|111aa|down_4|NZ_AP018254.1_2705012_2705345_+	NA	NA|402aa|down_5|NZ_AP018254.1_2706055_2707261_+	COG0465, HflB, ATP-dependent Zn proteases [Posttranslational modification, protein turnover, chaperones]	NA|754aa|down_6|NZ_AP018254.1_2707902_2710164_-	COG0744, MrcB, Membrane carboxypeptidase (penicillin-binding protein) [Cell envelope biogenesis, outer membrane]	NA|339aa|down_7|NZ_AP018254.1_2712473_2713490_-	COG1230, CzcD, Co/Zn/Cd efflux system component [Inorganic ion transport and metabolism]	NA|486aa|down_8|NZ_AP018254.1_2714194_2715652_+	PRK09567, nirA, NirA family protein	NA|210aa|down_9|NZ_AP018254.1_2715882_2716512_+	PRK08285, cobH, precorrin-8X methylmutase; Reviewed
GCF_002368395.1_ASM236839v1	NZ_AP018254	Calothrix sp. NIES-3974	32	3177397-3177499	30	CRISPRCasFinder	no		DinG,PD-DExK,cas14j,csa3,Cas14c_CAS-V-F,WYL,csx18,cas1,cas2,cas10,cmr3gr5,cmr4gr7,cmr5gr11,csm3gr7,cas3,csx1,cas6,csm5gr7,csm4gr5,csm2gr11,csx19,csx10gr5,c2c9_V-U4,RT,cas4,csc1gr5,csc2gr7,cas10d,csx3,csm6	Orphan	CCAATCCCAGTCAACCCAGAACTTC	25	0	0	NA	NA	NA	1	1	Orphan	DinG,PD-DExK,cas14j,csa3,Cas14c_CAS-V-F,WYL,csx18,cas1,cas2,cas10,cmr3gr5,cmr4gr7,cmr5gr11,csm3gr7,cas3,csx1,cas6,csm5gr7,csm4gr5,csm2gr11,csx19,csx10gr5,c2c9_V-U4,RT,cas4,csc1gr5,csc2gr7,cas10d,csx3,csm6	NA|55aa|up_3|NZ_AP018254.1_3173713_3173878_-,NA|178aa|up_1|NZ_AP018254.1_3174059_3174593_+,NA|47aa|down_4|NZ_AP018254.1_3185687_3185828_-,NA|193aa|down_5|NZ_AP018254.1_3186075_3186654_+	NA|153aa|up_9|NZ_AP018254.1_3163972_3164431_+	pfam01475, FUR, Ferric uptake regulator family	NA|388aa|up_8|NZ_AP018254.1_3164669_3165833_-	COG3409, COG3409, Putative peptidoglycan-binding domain-containing protein [Cell envelope biogenesis, outer membrane]	NA|1277aa|up_7|NZ_AP018254.1_3165854_3169685_-	PRK05989, cobN, cobaltochelatase subunit CobN; Reviewed	NA|648aa|up_6|NZ_AP018254.1_3169907_3171851_+	COG0642, BaeS, Signal transduction histidine kinase [Signal transduction mechanisms]	NA|207aa|up_5|NZ_AP018254.1_3171949_3172570_-	pfam08974, DUF1877, Domain of unknown function (DUF1877)	NA|262aa|up_4|NZ_AP018254.1_3172598_3173384_-	PRK13331, PRK13331, pantothenate kinase; Reviewed	NA|55aa|up_3|NZ_AP018254.1_3173713_3173878_-	NA	NA|29aa|up_2|NZ_AP018254.1_3173887_3173974_-	PRK02529, petN, cytochrome b6-f complex subunit PetN; Provisional	NA|178aa|up_1|NZ_AP018254.1_3174059_3174593_+	NA	NA|442aa|up_0|NZ_AP018254.1_3174720_3176046_+	TIGR03279, cyano_FeS_chp, putative radical SAM enzyme, TIGR03279 family	NA|280aa|down_0|NZ_AP018254.1_3179404_3180244_-	COG0415, PhrB, Deoxyribodipyrimidine photolyase [DNA replication, recombination, and repair]	NA|607aa|down_1|NZ_AP018254.1_3180803_3182624_-	PRK07431, PRK07431, aspartate kinase; Provisional	NA|344aa|down_2|NZ_AP018254.1_3183211_3184243_+	cd07473, Peptidases_S8_Subtilisin_like, Peptidase S8 family domain in Subtilisin-like proteins	NA|320aa|down_3|NZ_AP018254.1_3184726_3185686_+	TIGR00367, Uncharacterized_membrane_protein_MJ0091, K+-dependent Na+/Ca+ exchanger related-protein	NA|47aa|down_4|NZ_AP018254.1_3185687_3185828_-	NA	NA|193aa|down_5|NZ_AP018254.1_3186075_3186654_+	NA	NA|57aa|down_6|NZ_AP018254.1_3186703_3186874_-	pfam10013, DUF2256, Uncharacterized protein conserved in bacteria (DUF2256)	NA|628aa|down_7|NZ_AP018254.1_3187031_3188915_+	cd07333, M48C_bepA_like, Peptidase M48C Ste24p bepA-like, integral membrane protein	NA|90aa|down_8|NZ_AP018254.1_3188890_3189160_+	cd17921, DEXHc_Ski2, DEXH-box helicase domain of DEAD-like helicase Ski2 family proteins	NA|254aa|down_9|NZ_AP018254.1_3189479_3190241_+	pfam14218, COP23, Circadian oscillating protein COP23
GCF_002368395.1_ASM236839v1	NZ_AP018254	Calothrix sp. NIES-3974	33	3228154-3228239	31	CRISPRCasFinder	no	csa3	DinG,PD-DExK,cas14j,csa3,Cas14c_CAS-V-F,WYL,csx18,cas1,cas2,cas10,cmr3gr5,cmr4gr7,cmr5gr11,csm3gr7,cas3,csx1,cas6,csm5gr7,csm4gr5,csm2gr11,csx19,csx10gr5,c2c9_V-U4,RT,cas4,csc1gr5,csc2gr7,cas10d,csx3,csm6	Type I-A	TATCTTATCTAGAGACCATGCTCTGCGT	28	0	0	NA	NA	NA	1	1	Orphan	DinG,PD-DExK,cas14j,csa3,Cas14c_CAS-V-F,WYL,csx18,cas1,cas2,cas10,cmr3gr5,cmr4gr7,cmr5gr11,csm3gr7,cas3,csx1,cas6,csm5gr7,csm4gr5,csm2gr11,csx19,csx10gr5,c2c9_V-U4,RT,cas4,csc1gr5,csc2gr7,cas10d,csx3,csm6	NA|285aa|up_4|NZ_AP018254.1_3220641_3221496_+,NA|221aa|up_3|NZ_AP018254.1_3221542_3222205_+,NA|49aa|down_1|NZ_AP018254.1_3229298_3229445_-,NA|280aa|down_4|NZ_AP018254.1_3234007_3234847_+,NA|194aa|down_5|NZ_AP018254.1_3235150_3235732_-,NA|87aa|down_8|NZ_AP018254.1_3239578_3239839_+	NA|100aa|up_9|NZ_AP018254.1_3211328_3211628_+	PRK02724, PRK02724, 30S ribosomal protein PSRP-3	NA|576aa|up_8|NZ_AP018254.1_3211731_3213459_-	COG0426, FpaA, Uncharacterized flavoproteins [Energy production and conversion]	NA|573aa|up_7|NZ_AP018254.1_3213829_3215548_-	COG0426, FpaA, Uncharacterized flavoproteins [Energy production and conversion]	NA|420aa|up_6|NZ_AP018254.1_3216225_3217485_+	sd00006, TPR, Tetratricopeptide repeat	NA|544aa|up_5|NZ_AP018254.1_3217879_3219511_-	pfam04966, OprB, Carbohydrate-selective porin, OprB family	NA|285aa|up_4|NZ_AP018254.1_3220641_3221496_+	NA	NA|221aa|up_3|NZ_AP018254.1_3221542_3222205_+	NA	NA|534aa|up_2|NZ_AP018254.1_3222473_3224075_-	pfam04966, OprB, Carbohydrate-selective porin, OprB family	NA|205aa|up_1|NZ_AP018254.1_3224227_3224842_-	COG0664, Crp, cAMP-binding proteins - catabolite gene activator and regulatory subunit of cAMP-dependent protein kinases [Signal transduction mechanisms]	NA|746aa|up_0|NZ_AP018254.1_3225499_3227737_-	COG3211, PhoX, Predicted phosphatase [General function prediction only]	NA|223aa|down_0|NZ_AP018254.1_3228524_3229193_-	COG0704, PhoU, Phosphate uptake regulator [Inorganic ion transport and metabolism]	NA|49aa|down_1|NZ_AP018254.1_3229298_3229445_-	NA	NA|449aa|down_2|NZ_AP018254.1_3231415_3232762_-	TIGR02966, Phosphate_regulon_sensor_protein_PhoR, phosphate regulon sensor kinase PhoR	NA|251aa|down_3|NZ_AP018254.1_3232809_3233562_-	COG0745, OmpR, Response regulators consisting of a CheY-like receiver domain and a winged-helix DNA-binding domain [Signal transduction mechanisms / Transcription]	NA|280aa|down_4|NZ_AP018254.1_3234007_3234847_+	NA	NA|194aa|down_5|NZ_AP018254.1_3235150_3235732_-	NA	csa3|199aa|down_6|NZ_AP018254.1_3236423_3237020_-	pfam09860, DUF2087, Uncharacterized protein conserved in bacteria (DUF2087)	NA|363aa|down_7|NZ_AP018254.1_3238055_3239144_+	PRK00772, PRK00772, 3-isopropylmalate dehydrogenase; Provisional	NA|87aa|down_8|NZ_AP018254.1_3239578_3239839_+	NA	NA|273aa|down_9|NZ_AP018254.1_3239952_3240771_+	pfam06750, DiS_P_DiS, Bacterial Peptidase A24 N-terminal domain
GCF_002368395.1_ASM236839v1	NZ_AP018254	Calothrix sp. NIES-3974	34	3488923-3489051	32	CRISPRCasFinder	no	PD-DExK	DinG,PD-DExK,cas14j,csa3,Cas14c_CAS-V-F,WYL,csx18,cas1,cas2,cas10,cmr3gr5,cmr4gr7,cmr5gr11,csm3gr7,cas3,csx1,cas6,csm5gr7,csm4gr5,csm2gr11,csx19,csx10gr5,c2c9_V-U4,RT,cas4,csc1gr5,csc2gr7,cas10d,csx3,csm6	Unclear	ATAAGGGTAATCCAAAATCCATAATCCAAAATCCAAAATC	40	0	0	NA	NA	NA	1	1	Orphan	DinG,PD-DExK,cas14j,csa3,Cas14c_CAS-V-F,WYL,csx18,cas1,cas2,cas10,cmr3gr5,cmr4gr7,cmr5gr11,csm3gr7,cas3,csx1,cas6,csm5gr7,csm4gr5,csm2gr11,csx19,csx10gr5,c2c9_V-U4,RT,cas4,csc1gr5,csc2gr7,cas10d,csx3,csm6	NA|52aa|up_5|NZ_AP018254.1_3482205_3482361_+,NA|296aa|down_0|NZ_AP018254.1_3489066_3489954_-,NA|170aa|down_1|NZ_AP018254.1_3490067_3490577_-,PD-DExK|207aa|down_3|NZ_AP018254.1_3491565_3492186_+	NA|392aa|up_9|NZ_AP018254.1_3476896_3478072_+	COG4552, Eis, Predicted acetyltransferase involved in intracellular survival and related acetyltransferases [General function prediction only]	NA|282aa|up_8|NZ_AP018254.1_3478320_3479166_+	TIGR02821, S-formylglutathione_hydrolase, S-formylglutathione hydrolase	NA|418aa|up_7|NZ_AP018254.1_3479428_3480682_-	cd06348, PBP1_ABC_HAAT-like, type 1 periplasmic ligand-binding domain of uncharacterized ABC (Atpase Binding Cassette)-type active transport systems predicted to be involved in uptake of amino acids or peptides	NA|346aa|up_6|NZ_AP018254.1_3481119_3482157_-	cd01011, nicotinamidase, Nicotinamidase/pyrazinamidase (PZase)	NA|52aa|up_5|NZ_AP018254.1_3482205_3482361_+	NA	NA|397aa|up_4|NZ_AP018254.1_3482337_3483528_-	PRK03080, PRK03080, phosphoserine transaminase	NA|123aa|up_3|NZ_AP018254.1_3484019_3484388_-	PTZ00038, PTZ00038, ferredoxin; Provisional	NA|335aa|up_2|NZ_AP018254.1_3484899_3485904_+	COG0451, WcaG, Nucleoside-diphosphate-sugar epimerases [Cell envelope biogenesis, outer membrane / Carbohydrate transport and metabolism]	NA|280aa|up_1|NZ_AP018254.1_3485884_3486724_+	cd07730, metallo-hydrolase-like_MBL-fold, uncharacterized subgroup of the MBL-fold_metallo-hydrolase superfamily; MBL-fold metallo hydrolase domain	NA|474aa|up_0|NZ_AP018254.1_3487351_3488773_+	cd09173, PLDc_Nuc_like_unchar1_2, Putative catalytic domain, repeat 2, of uncharacterized hypothetical proteins similar to Nuc, an endonuclease from Salmonella typhimurium	NA|296aa|down_0|NZ_AP018254.1_3489066_3489954_-	NA	NA|170aa|down_1|NZ_AP018254.1_3490067_3490577_-	NA	NA|193aa|down_2|NZ_AP018254.1_3490790_3491369_+	cd08070, MPN_like, Mpr1p, Pad1p N-terminal (MPN) domains with catalytic isopeptidase activity (metal-binding)	PD-DExK|207aa|down_3|NZ_AP018254.1_3491565_3492186_+	NA	NA|588aa|down_4|NZ_AP018254.1_3492417_3494181_-	cd01991, Asn_Synthase_B_C, The C-terminal domain of Asparagine Synthase B	NA|155aa|down_5|NZ_AP018254.1_3494446_3494911_-	pfam14229, DUF4332, Domain of unknown function (DUF4332)	NA|169aa|down_6|NZ_AP018254.1_3495071_3495578_+	COG1357, COG1357, Pentapeptide repeats containing protein [Function unknown]	NA|71aa|down_7|NZ_AP018254.1_3495743_3495956_-	cd01716, Hfq, bacterial Hfq-like	NA|280aa|down_8|NZ_AP018254.1_3496090_3496930_+	PRK00450, dapF, diaminopimelate epimerase; Provisional	NA|327aa|down_9|NZ_AP018254.1_3497155_3498136_+	COG2971, COG2971, Predicted N-acetylglucosamine kinase [Carbohydrate transport and metabolism]
GCF_002368395.1_ASM236839v1	NZ_AP018254	Calothrix sp. NIES-3974	35	3552245-3552991	33,9,6	CRISPRCasFinder,PILER-CR,CRT	no		DinG,PD-DExK,cas14j,csa3,Cas14c_CAS-V-F,WYL,csx18,cas1,cas2,cas10,cmr3gr5,cmr4gr7,cmr5gr11,csm3gr7,cas3,csx1,cas6,csm5gr7,csm4gr5,csm2gr11,csx19,csx10gr5,c2c9_V-U4,RT,cas4,csc1gr5,csc2gr7,cas10d,csx3,csm6	Orphan	GTCTCCTCTTATGAGGGGAAATAATTGATTGGAAAC,GTCTCCTCTTATGAGGGGAAATAATTGATTGGAAAC,GTCTCCTCTTATGAGGGGAAATAATTGATTGGAAAC	36,36,36	0	0	NA	NA	NA:NA:NA	9,8,8	9	Orphan	DinG,PD-DExK,cas14j,csa3,Cas14c_CAS-V-F,WYL,csx18,cas1,cas2,cas10,cmr3gr5,cmr4gr7,cmr5gr11,csm3gr7,cas3,csx1,cas6,csm5gr7,csm4gr5,csm2gr11,csx19,csx10gr5,c2c9_V-U4,RT,cas4,csc1gr5,csc2gr7,cas10d,csx3,csm6	NA|57aa|up_9|NZ_AP018254.1_3544296_3544467_+,NA|344aa|up_4|NZ_AP018254.1_3547856_3548888_+,NA|167aa|up_3|NZ_AP018254.1_3549574_3550075_+,NA|138aa|up_2|NZ_AP018254.1_3550100_3550514_-,NA	NA|57aa|up_9|NZ_AP018254.1_3544296_3544467_+	NA	NA|170aa|up_8|NZ_AP018254.1_3544663_3545173_+	cd12126, APC_beta, Allophycocyanin beta subunit of the phycobilisome core	NA|268aa|up_7|NZ_AP018254.1_3545385_3546189_+	COG1496, yfiH, Multicopper polyphenol oxidase (laccase) [Secondary metabolites biosynthesis, transport and catabolism]	NA|195aa|up_6|NZ_AP018254.1_3546284_3546869_+	pfam05685, Uma2, Putative restriction endonuclease	NA|195aa|up_5|NZ_AP018254.1_3547076_3547661_+	pfam05685, Uma2, Putative restriction endonuclease	NA|344aa|up_4|NZ_AP018254.1_3547856_3548888_+	NA	NA|167aa|up_3|NZ_AP018254.1_3549574_3550075_+	NA	NA|138aa|up_2|NZ_AP018254.1_3550100_3550514_-	NA	NA|294aa|up_1|NZ_AP018254.1_3550766_3551648_+	PRK12928, PRK12928, lipoyl synthase; Provisional	NA|45aa|up_0|NZ_AP018254.1_3551740_3551875_-	pfam08078, PsaX, PsaX family	NA|398aa|down_0|NZ_AP018254.1_3553403_3554597_+	COG0628, yhhT, Predicted permease, member of the PurR regulon [General function prediction only]	NA|306aa|down_1|NZ_AP018254.1_3554876_3555794_+	cd09993, HDAC_classIV, Histone deacetylase class IV also known as histone deacetylase 11	NA|805aa|down_2|NZ_AP018254.1_3555895_3558310_+	cd07389, MPP_PhoD, Bacillus subtilis PhoD and related proteins, metallophosphatase domain	NA|299aa|down_3|NZ_AP018254.1_3558490_3559387_-	cd03401, SPFH_prohibitin, Prohibitin family; SPFH (stomatin, prohibitin, flotillin, and HflK/C) superfamily	NA|177aa|down_4|NZ_AP018254.1_3559540_3560071_-	sd00006, TPR, Tetratricopeptide repeat	NA|275aa|down_5|NZ_AP018254.1_3560241_3561066_-	cd01000, PBP2_Cys_DEBP_like, Substrate-binding domain of cysteine- and aspartate/glutamate-binding proteins; the type 2 periplasmic-binding protein fold	NA|80aa|down_6|NZ_AP018254.1_3561251_3561491_-	cd04302, HAD_5NT, haloacid dehalogenase (HAD)-like 5'-nucleotidases similar to the Pseudomonas aeruginosa PA0065	NA|119aa|down_7|NZ_AP018254.1_3561557_3561914_-	CHL00068, rpl20, ribosomal protein L20	NA|66aa|down_8|NZ_AP018254.1_3562079_3562277_-	PRK00172, rpmI, 50S ribosomal protein L35; Reviewed	NA|272aa|down_9|NZ_AP018254.1_3562319_3563135_-	PLN03100, PLN03100, Permease subunit of ER-derived-lipid transporter; Provisional
GCF_002368395.1_ASM236839v1	NZ_AP018254	Calothrix sp. NIES-3974	36	3716679-3718910	10,34,7	PILER-CR,CRISPRCasFinder,CRT	no	cas2,cas3,csm3gr7,csx19,csx10gr5,cas10,csx1,WYL	DinG,PD-DExK,cas14j,csa3,Cas14c_CAS-V-F,WYL,csx18,cas1,cas2,cas10,cmr3gr5,cmr4gr7,cmr5gr11,csm3gr7,cas3,csx1,cas6,csm5gr7,csm4gr5,csm2gr11,csx19,csx10gr5,c2c9_V-U4,RT,cas4,csc1gr5,csc2gr7,cas10d,csx3,csm6	Type III-D,Type III-B,Type III-C,Type III-A	GTTTCAATCCCTAATAGGGATTATTTAGTTTTGTAAC,GTTTCAATCCCTAATAGGGATTATTTAGTTTTGTAAC,GTTTCAATCCCTAATAGGGATTATTTAGTTTTGTAAC	37,37,37	0	0	NA	NA	I-D,II-B:I-D,II-B:I-D,II-B	30,30,30	30	TypeIII-B,TypeIII-C,TypeIII-D,TypeIII-A	DinG,PD-DExK,cas14j,csa3,Cas14c_CAS-V-F,WYL,csx18,cas1,cas2,cas10,cmr3gr5,cmr4gr7,cmr5gr11,csm3gr7,cas3,csx1,cas6,csm5gr7,csm4gr5,csm2gr11,csx19,csx10gr5,c2c9_V-U4,RT,cas4,csc1gr5,csc2gr7,cas10d,csx3,csm6	NA|266aa|up_0|NZ_AP018254.1_3715618_3716416_-,NA	NA|198aa|up_9|NZ_AP018254.1_3703710_3704304_+	pfam05685, Uma2, Putative restriction endonuclease	NA|353aa|up_8|NZ_AP018254.1_3704341_3705400_-	cd00200, WD40, WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment	NA|812aa|up_7|NZ_AP018254.1_3705568_3708004_-	PRK00629, pheT, phenylalanyl-tRNA synthetase subunit beta; Reviewed	NA|141aa|up_6|NZ_AP018254.1_3708333_3708756_+	PRK00222, PRK00222, peptide-methionine (R)-S-oxide reductase MsrB	NA|583aa|up_5|NZ_AP018254.1_3708941_3710690_+	pfam11832, DUF3352, Protein of unknown function (DUF3352)	NA|721aa|up_4|NZ_AP018254.1_3711242_3713405_+	cd02754, MopB_Nitrate-R-NapA-like, Nitrate reductases, NapA (Nitrate-R-NapA), NasA, and NarB catalyze the reduction of nitrate to nitrite	NA|153aa|up_3|NZ_AP018254.1_3713434_3713893_+	COG3431, COG3431, Predicted membrane protein [Function unknown]	NA|122aa|up_2|NZ_AP018254.1_3714058_3714424_+	pfam16156, DUF4864, Domain of unknown function (DUF4864)	NA|205aa|up_1|NZ_AP018254.1_3714834_3715449_+	pfam05685, Uma2, Putative restriction endonuclease	NA|266aa|up_0|NZ_AP018254.1_3715618_3716416_-	NA	cas2|89aa|down_0|NZ_AP018254.1_3719182_3719449_-	COG1343, COG1343, CRISPR-associated protein Cas2 [Defense mechanisms]	cas3|83aa|down_1|NZ_AP018254.1_3719561_3719810_-	pfam00270, DEAD, DEAD/DEAH box helicase	csm3gr7|739aa|down_2|NZ_AP018254.1_3719810_3722027_-	TIGR03986, CRISPR-associated_protein, CRISPR-associated protein	csx19|195aa|down_3|NZ_AP018254.1_3722016_3722601_-	TIGR03984, hypothetical_protein_FrEUN1fDRAFT_5778, CRISPR-associated protein, TIGR03984 family	csm3gr7|495aa|down_4|NZ_AP018254.1_3722609_3724094_-	cd09726, RAMP_I_III, CRISPR/Cas system-associated RAMP superfamily protein	csx10gr5|527aa|down_5|NZ_AP018254.1_3724093_3725674_-	TIGR02674, cas_cyan_RAMP_2, CRISPR-associated RAMP protein, Csx10 family	csm3gr7|228aa|down_6|NZ_AP018254.1_3725670_3726354_-	pfam03787, RAMPs, RAMP superfamily	cas10|538aa|down_7|NZ_AP018254.1_3726356_3727970_-	COG1353, COG1353, Predicted CRISPR-associated polymerase [Defense mechanisms]	csx1|419aa|down_8|NZ_AP018254.1_3727966_3729223_-	pfam09002, DUF1887, Domain of unknown function (DUF1887)	WYL|475aa|down_9|NZ_AP018254.1_3729336_3730761_+	TIGR03985, hypothetical_protein_sll7078, CRISPR-associated protein, TIGR03985 family
GCF_002368395.1_ASM236839v1	NZ_AP018254	Calothrix sp. NIES-3974	37	3768301-3768408	35	CRISPRCasFinder	no		DinG,PD-DExK,cas14j,csa3,Cas14c_CAS-V-F,WYL,csx18,cas1,cas2,cas10,cmr3gr5,cmr4gr7,cmr5gr11,csm3gr7,cas3,csx1,cas6,csm5gr7,csm4gr5,csm2gr11,csx19,csx10gr5,c2c9_V-U4,RT,cas4,csc1gr5,csc2gr7,cas10d,csx3,csm6	Orphan	AATTACGTACGCGAAGCGTTGCCGCAGGCTATTAC	35	1	11	3768336-3768373|3768336-3768373|3768336-3768373|3768336-3768373|3768336-3768373|3768336-3768373|3768336-3768373|3768336-3768373|3768336-3768373|3768336-3768373|3768336-3768373	NZ_AP018254.1_4587883-4587846|NZ_AP018254.1_5605449-5605412|NZ_AP018254.1_971286-971323|NZ_AP018254.1_1148355-1148392|NZ_AP018254.1_1955591-1955628|NZ_AP018254.1_2343085-2343122|NZ_AP018254.1_3883432-3883469|NZ_AP018254.1_3941676-3941713|NZ_AP018254.1_1497380-1497343|NZ_AP018254.1_1622990-1623027|NZ_AP018254.1_2043213-2043176	NA	1	1	Orphan	DinG,PD-DExK,cas14j,csa3,Cas14c_CAS-V-F,WYL,csx18,cas1,cas2,cas10,cmr3gr5,cmr4gr7,cmr5gr11,csm3gr7,cas3,csx1,cas6,csm5gr7,csm4gr5,csm2gr11,csx19,csx10gr5,c2c9_V-U4,RT,cas4,csc1gr5,csc2gr7,cas10d,csx3,csm6	NA|93aa|up_1|NZ_AP018254.1_3765759_3766038_-,NA|64aa|down_0|NZ_AP018254.1_3768420_3768612_+,NA|112aa|down_7|NZ_AP018254.1_3775702_3776038_-	NA|476aa|up_9|NZ_AP018254.1_3750268_3751696_+	COG4995, COG4995, Uncharacterized protein conserved in bacteria [Function unknown]	NA|202aa|up_8|NZ_AP018254.1_3752049_3752655_+	pfam01292, Ni_hydr_CYTB, Prokaryotic cytochrome b561	NA|955aa|up_7|NZ_AP018254.1_3753164_3756029_-	pfam02518, HATPase_c, Histidine kinase-, DNA gyrase B-, and HSP90-like ATPase	NA|377aa|up_6|NZ_AP018254.1_3756711_3757842_+	pfam12617, LdpA_C, Iron-Sulfur binding protein C terminal	NA|583aa|up_5|NZ_AP018254.1_3757865_3759614_+	COG3854, SpoIIIAA, ncharacterized protein conserved in bacteria [Function unknown]	NA|195aa|up_4|NZ_AP018254.1_3759848_3760433_-	TIGR04026, hypothetical_protein, PPOX class probable FMN-dependent enzyme, alr4036 family	NA|766aa|up_3|NZ_AP018254.1_3760797_3763095_+	TIGR02505, RTPR, ribonucleoside-triphosphate reductase, adenosylcobalamin-dependent	NA|563aa|up_2|NZ_AP018254.1_3764078_3765767_+	pfam03814, KdpA, Potassium-transporting ATPase A subunit	NA|93aa|up_1|NZ_AP018254.1_3765759_3766038_-	NA	NA|699aa|up_0|NZ_AP018254.1_3766013_3768110_+	PRK01122, PRK01122, potassium-transporting ATPase subunit KdpB	NA|64aa|down_0|NZ_AP018254.1_3768420_3768612_+	NA	NA|196aa|down_1|NZ_AP018254.1_3768687_3769275_+	PRK14003, PRK14003, K(+)-transporting ATPase subunit C	NA|259aa|down_2|NZ_AP018254.1_3769668_3770445_-	PRK00443, nagB, glucosamine-6-phosphate deaminase; Provisional	NA|231aa|down_3|NZ_AP018254.1_3770926_3771619_+	COG0637, COG0637, Predicted phosphatase/phosphohexomutase [General function prediction only]	NA|199aa|down_4|NZ_AP018254.1_3771755_3772352_-	cd02109, arch_bact_SO_family_Moco, bacterial and archael members of the sulfite oxidase (SO) family of molybdopterin binding domains	NA|484aa|down_5|NZ_AP018254.1_3772448_3773900_-	COG1596, Wza, Periplasmic protein involved in polysaccharide export, contains    SLBB domain of b-grasp fold [Cell wall/membrane/envelope biogenesis]	NA|326aa|down_6|NZ_AP018254.1_3774434_3775412_-	COG1131, CcmA, ABC-type multidrug transport system, ATPase component [Defense mechanisms]	NA|112aa|down_7|NZ_AP018254.1_3775702_3776038_-	NA	NA|265aa|down_8|NZ_AP018254.1_3776030_3776825_+	pfam03808, Glyco_tran_WecB, Glycosyl transferase WecB/TagA/CpsF family	NA|213aa|down_9|NZ_AP018254.1_3777199_3777838_-	pfam18475, PIN7, PIN domain
GCF_002368395.1_ASM236839v1	NZ_AP018254	Calothrix sp. NIES-3974	38	3826895-3827008	36	CRISPRCasFinder	no		DinG,PD-DExK,cas14j,csa3,Cas14c_CAS-V-F,WYL,csx18,cas1,cas2,cas10,cmr3gr5,cmr4gr7,cmr5gr11,csm3gr7,cas3,csx1,cas6,csm5gr7,csm4gr5,csm2gr11,csx19,csx10gr5,c2c9_V-U4,RT,cas4,csc1gr5,csc2gr7,cas10d,csx3,csm6	Orphan	TGGGGAGTCGGGAATAGGGAGTGGTGGGAAAGTTAACC	38	0	0	NA	NA	NA	1	1	Orphan	DinG,PD-DExK,cas14j,csa3,Cas14c_CAS-V-F,WYL,csx18,cas1,cas2,cas10,cmr3gr5,cmr4gr7,cmr5gr11,csm3gr7,cas3,csx1,cas6,csm5gr7,csm4gr5,csm2gr11,csx19,csx10gr5,c2c9_V-U4,RT,cas4,csc1gr5,csc2gr7,cas10d,csx3,csm6	NA,NA|118aa|down_0|NZ_AP018254.1_3827115_3827469_+,NA|200aa|down_1|NZ_AP018254.1_3827548_3828148_-,NA|232aa|down_7|NZ_AP018254.1_3835987_3836683_+	NA|95aa|up_9|NZ_AP018254.1_3813908_3814193_+	pfam00550, PP-binding, Phosphopantetheine attachment site	NA|504aa|up_8|NZ_AP018254.1_3814872_3816384_+	COG3320, COG3320, Putative dehydrogenase domain of multifunctional non-ribosomal peptide synthetases and related enzymes [Secondary metabolites biosynthesis, transport, and catabolism]	NA|465aa|up_7|NZ_AP018254.1_3817243_3818638_+	PRK05291, trmE, tRNA uridine-5-carboxymethylaminomethyl(34) synthesis GTPase MnmE	NA|93aa|up_6|NZ_AP018254.1_3818875_3819154_+	pfam08872, KGK, KGK domain	NA|305aa|up_5|NZ_AP018254.1_3819726_3820641_+	cd02525, Succinoglycan_BP_ExoA, ExoA is involved in the biosynthesis of succinoglycan	NA|286aa|up_4|NZ_AP018254.1_3820691_3821549_+	cd02525, Succinoglycan_BP_ExoA, ExoA is involved in the biosynthesis of succinoglycan	NA|125aa|up_3|NZ_AP018254.1_3822029_3822404_-	cd00454, TrHb1_N, truncated hemoglobins (TrHbs, 2/2Hb, 2/2 globins); group 1 (N)	NA|191aa|up_2|NZ_AP018254.1_3822493_3823066_+	COG0456, RimI, Acetyltransferases [General function prediction only]	NA|719aa|up_1|NZ_AP018254.1_3823182_3825339_+	PRK11824, PRK11824, polynucleotide phosphorylase/polyadenylase; Provisional	NA|296aa|up_0|NZ_AP018254.1_3825716_3826604_-	COG0053, MMT1, Predicted Co/Zn/Cd cation transporters [Inorganic ion transport and metabolism]	NA|118aa|down_0|NZ_AP018254.1_3827115_3827469_+	NA	NA|200aa|down_1|NZ_AP018254.1_3827548_3828148_-	NA	NA|790aa|down_2|NZ_AP018254.1_3828628_3830998_+	COG0745, OmpR, Response regulators consisting of a CheY-like receiver domain and a winged-helix DNA-binding domain [Signal transduction mechanisms / Transcription]	NA|440aa|down_3|NZ_AP018254.1_3831469_3832789_+	COG0420, SbcD, DNA repair exonuclease [DNA replication, recombination, and repair]	NA|36aa|down_4|NZ_AP018254.1_3833059_3833167_-	CHL00186, psaI, photosystem I subunit VIII; Validated	NA|232aa|down_5|NZ_AP018254.1_3833529_3834225_+	cd02108, bact_SO_family_Moco, bacterial subgroup of the sulfite oxidase (SO) family of molybdopterin binding domains	NA|80aa|down_6|NZ_AP018254.1_3834751_3834991_+	cd00093, HTH_XRE, Helix-turn-helix XRE-family like proteins	NA|232aa|down_7|NZ_AP018254.1_3835987_3836683_+	NA	NA|819aa|down_8|NZ_AP018254.1_3837075_3839532_-	CHL00095, clpC, Clp protease ATP binding subunit	NA|94aa|down_9|NZ_AP018254.1_3840101_3840383_+	pfam11691, DUF3288, Protein of unknown function (DUF3288)
GCF_002368395.1_ASM236839v1	NZ_AP018254	Calothrix sp. NIES-3974	39	3946065-3946181	37	CRISPRCasFinder	no	Cas14c_CAS-V-F	DinG,PD-DExK,cas14j,csa3,Cas14c_CAS-V-F,WYL,csx18,cas1,cas2,cas10,cmr3gr5,cmr4gr7,cmr5gr11,csm3gr7,cas3,csx1,cas6,csm5gr7,csm4gr5,csm2gr11,csx19,csx10gr5,c2c9_V-U4,RT,cas4,csc1gr5,csc2gr7,cas10d,csx3,csm6	Unclear	CAAGTAAAAGCTCTTTTTATTGCCGAACTCATTCCTACTC	40	0	0	NA	NA	NA	1	1	TypeV	DinG,PD-DExK,cas14j,csa3,Cas14c_CAS-V-F,WYL,csx18,cas1,cas2,cas10,cmr3gr5,cmr4gr7,cmr5gr11,csm3gr7,cas3,csx1,cas6,csm5gr7,csm4gr5,csm2gr11,csx19,csx10gr5,c2c9_V-U4,RT,cas4,csc1gr5,csc2gr7,cas10d,csx3,csm6	NA|70aa|up_8|NZ_AP018254.1_3938322_3938532_-,NA|72aa|down_2|NZ_AP018254.1_3949050_3949266_+,NA|387aa|down_6|NZ_AP018254.1_3952733_3953894_-	NA|124aa|up_9|NZ_AP018254.1_3937843_3938215_-	TIGR00049, Uncharacterized_protein_in_nifU_5'region, Iron-sulfur cluster assembly accessory protein	NA|70aa|up_8|NZ_AP018254.1_3938322_3938532_-	NA	NA|264aa|up_7|NZ_AP018254.1_3938555_3939347_-	cd00757, ThiF_MoeB_HesA_family, ThiF_MoeB_HesA	NA|112aa|up_6|NZ_AP018254.1_3939339_3939675_-	PRK14102, nifW, nitrogenase-stabilizing/protective protein NifW	NA|70aa|up_5|NZ_AP018254.1_3939674_3939884_-	pfam05082, Rop-like, Rop-like	NA|159aa|up_4|NZ_AP018254.1_3940009_3940486_-	pfam03270, DUF269, Protein of unknown function, DUF269	NA|133aa|up_3|NZ_AP018254.1_3940772_3941171_-	TIGR02663, Protein_NifX, nitrogen fixation protein NifX	NA|440aa|up_2|NZ_AP018254.1_3941716_3943036_-	PRK14476, PRK14476, nitrogenase molybdenum-cofactor biosynthesis protein NifN; Provisional	NA|225aa|up_1|NZ_AP018254.1_3944145_3944820_-	COG1413, COG1413, FOG: HEAT repeat [Energy production and conversion]	NA|227aa|up_0|NZ_AP018254.1_3945173_3945854_-	COG4636, Uma2, Endonuclease, Uma2 family (restriction endonuclease fold) [General function prediction only]	NA|105aa|down_0|NZ_AP018254.1_3946205_3946520_-	cd03418, GRX_GRXb_1_3_like, Glutaredoxin (GRX) family, GRX bacterial class 1 and 3 (b_1_3)-like subfamily; composed of bacterial GRXs, approximately 10 kDa in size, and proteins containing a GRX or GRX-like domain	NA|421aa|down_1|NZ_AP018254.1_3946808_3948071_-	COG2821, MltA, Membrane-bound lytic murein transglycosylase [Cell envelope biogenesis, outer membrane]	NA|72aa|down_2|NZ_AP018254.1_3949050_3949266_+	NA	NA|235aa|down_3|NZ_AP018254.1_3949275_3949980_-	COG0745, OmpR, Response regulators consisting of a CheY-like receiver domain and a winged-helix DNA-binding domain [Signal transduction mechanisms / Transcription]	NA|537aa|down_4|NZ_AP018254.1_3950007_3951618_-	pfam18105, PGM1_C, PGM1 C-terminal domain	NA|268aa|down_5|NZ_AP018254.1_3951833_3952637_-	pfam11103, DUF2887, Protein of unknown function (DUF2887)	NA|387aa|down_6|NZ_AP018254.1_3952733_3953894_-	NA	Cas14c_CAS-V-F|393aa|down_7|NZ_AP018254.1_3956311_3957490_-	COG0675, COG0675, Transposase and inactivated derivatives [DNA replication, recombination, and repair]	NA|127aa|down_8|NZ_AP018254.1_3958290_3958671_+	pfam14534, DUF4440, Domain of unknown function (DUF4440)	NA|281aa|down_9|NZ_AP018254.1_3958723_3959566_-	PRK09687, PRK09687, putative lyase; Provisional
GCF_002368395.1_ASM236839v1	NZ_AP018254	Calothrix sp. NIES-3974	40	4303661-4303768	38	CRISPRCasFinder	no	PD-DExK	DinG,PD-DExK,cas14j,csa3,Cas14c_CAS-V-F,WYL,csx18,cas1,cas2,cas10,cmr3gr5,cmr4gr7,cmr5gr11,csm3gr7,cas3,csx1,cas6,csm5gr7,csm4gr5,csm2gr11,csx19,csx10gr5,c2c9_V-U4,RT,cas4,csc1gr5,csc2gr7,cas10d,csx3,csm6	Unclear	GTGAAACAGCTACAGATGTTAGCGCTTGATTGTTAA	36	0	0	NA	NA	NA	1	1	Orphan	DinG,PD-DExK,cas14j,csa3,Cas14c_CAS-V-F,WYL,csx18,cas1,cas2,cas10,cmr3gr5,cmr4gr7,cmr5gr11,csm3gr7,cas3,csx1,cas6,csm5gr7,csm4gr5,csm2gr11,csx19,csx10gr5,c2c9_V-U4,RT,cas4,csc1gr5,csc2gr7,cas10d,csx3,csm6	NA,PD-DExK|222aa|down_2|NZ_AP018254.1_4307568_4308234_-	NA|274aa|up_9|NZ_AP018254.1_4291873_4292695_+	PRK11880, PRK11880, pyrroline-5-carboxylate reductase; Reviewed	NA|203aa|up_8|NZ_AP018254.1_4292900_4293509_-	PRK00277, clpP, ATP-dependent Clp protease proteolytic subunit; Reviewed	NA|308aa|up_7|NZ_AP018254.1_4293618_4294542_-	cd06583, PGRP, Peptidoglycan recognition proteins (PGRPs) are pattern recognition receptors that bind, and in certain cases, hydrolyze peptidoglycans (PGNs) of bacterial cell walls	NA|388aa|up_6|NZ_AP018254.1_4294780_4295944_-	cd03801, GT4_PimA-like, phosphatidyl-myo-inositol mannosyltransferase	NA|528aa|up_5|NZ_AP018254.1_4296365_4297949_-	cd06173, MFS_MefA_like, Macrolide efflux protein A and similar proteins of the Major Facilitator Superfamily of transporters	NA|299aa|up_4|NZ_AP018254.1_4297945_4298842_-	PRK00085, recO, DNA repair protein RecO; Reviewed	NA|227aa|up_3|NZ_AP018254.1_4298921_4299602_-	PRK00507, PRK00507, deoxyribose-phosphate aldolase; Provisional	NA|484aa|up_2|NZ_AP018254.1_4300378_4301830_-	pfam05128, DUF697, Domain of unknown function (DUF697)	NA|104aa|up_1|NZ_AP018254.1_4302627_4302939_+	cd07057, BMC_CcmK, Carbon dioxide concentrating mechanism (CcmK); Bacterial Micro-Compartment (BMC) domain	NA|114aa|up_0|NZ_AP018254.1_4303146_4303488_+	cd07057, BMC_CcmK, Carbon dioxide concentrating mechanism (CcmK); Bacterial Micro-Compartment (BMC) domain	NA|331aa|down_0|NZ_AP018254.1_4303968_4304961_+	cd08272, MDR6, Medium chain dehydrogenases/reductase (MDR)/zinc-dependent alcohol dehydrogenase-like family	NA|734aa|down_1|NZ_AP018254.1_4305266_4307468_+	pfam13576, Pentapeptide_3, Pentapeptide repeats (9 copies)	PD-DExK|222aa|down_2|NZ_AP018254.1_4307568_4308234_-	NA	NA|732aa|down_3|NZ_AP018254.1_4308395_4310591_-	cd05387, BY-kinase, bacterial tyrosine-kinase	NA|381aa|down_4|NZ_AP018254.1_4311234_4312377_-	TIGR04181, DegT/DnrJ/EryC1/StrS_aminotransferase, aminotransferase, LLPSF_NHT_00031 family	NA|351aa|down_5|NZ_AP018254.1_4312611_4313664_-	pfam02388, FemAB, FemAB family	NA|260aa|down_6|NZ_AP018254.1_4314015_4314795_-	pfam13649, Methyltransf_25, Methyltransferase domain	NA|303aa|down_7|NZ_AP018254.1_4315086_4315995_-	pfam05721, PhyH, Phytanoyl-CoA dioxygenase (PhyH)	NA|216aa|down_8|NZ_AP018254.1_4316037_4316685_-	pfam02397, Bac_transf, Bacterial sugar transferase	NA|388aa|down_9|NZ_AP018254.1_4316975_4318139_-	cd03808, GT4_CapM-like, capsular polysaccharide biosynthesis glycosyltransferase CapM and similar proteins
GCF_002368395.1_ASM236839v1	NZ_AP018254	Calothrix sp. NIES-3974	41	4433694-4433807	39	CRISPRCasFinder	no	Cas14c_CAS-V-F	DinG,PD-DExK,cas14j,csa3,Cas14c_CAS-V-F,WYL,csx18,cas1,cas2,cas10,cmr3gr5,cmr4gr7,cmr5gr11,csm3gr7,cas3,csx1,cas6,csm5gr7,csm4gr5,csm2gr11,csx19,csx10gr5,c2c9_V-U4,RT,cas4,csc1gr5,csc2gr7,cas10d,csx3,csm6	Unclear	GTAATTACTGAAGCATTAAGGAAACTTTTAATTTCCAC	38	0	0	NA	NA	NA	1	1	TypeV	DinG,PD-DExK,cas14j,csa3,Cas14c_CAS-V-F,WYL,csx18,cas1,cas2,cas10,cmr3gr5,cmr4gr7,cmr5gr11,csm3gr7,cas3,csx1,cas6,csm5gr7,csm4gr5,csm2gr11,csx19,csx10gr5,c2c9_V-U4,RT,cas4,csc1gr5,csc2gr7,cas10d,csx3,csm6	NA|105aa|up_6|NZ_AP018254.1_4425634_4425949_-,NA|63aa|up_4|NZ_AP018254.1_4428011_4428200_+,NA|136aa|up_3|NZ_AP018254.1_4428297_4428705_+,NA|146aa|down_0|NZ_AP018254.1_4434186_4434624_+	NA|598aa|up_9|NZ_AP018254.1_4419520_4421314_-	COG1217, TypA, Predicted membrane GTPase involved in stress response [Signal transduction mechanisms]	NA|263aa|up_8|NZ_AP018254.1_4422358_4423147_-	COG0412, COG0412, Dienelactone hydrolase and related enzymes [Secondary metabolites biosynthesis, transport, and catabolism]	Cas14c_CAS-V-F|393aa|up_7|NZ_AP018254.1_4423728_4424907_-	COG0675, COG0675, Transposase and inactivated derivatives [DNA replication, recombination, and repair]	NA|105aa|up_6|NZ_AP018254.1_4425634_4425949_-	NA	NA|378aa|up_5|NZ_AP018254.1_4426598_4427732_+	COG3329, COG3329, Predicted permease [General function prediction only]	NA|63aa|up_4|NZ_AP018254.1_4428011_4428200_+	NA	NA|136aa|up_3|NZ_AP018254.1_4428297_4428705_+	NA	NA|954aa|up_2|NZ_AP018254.1_4428881_4431743_+	PRK00349, uvrA, excinuclease ABC subunit UvrA	NA|271aa|up_1|NZ_AP018254.1_4432004_4432817_+	pfam11103, DUF2887, Protein of unknown function (DUF2887)	NA|114aa|up_0|NZ_AP018254.1_4433153_4433495_-	cd16382, XisI-like, XisI is FdxN element excision controlling factor protein	NA|146aa|down_0|NZ_AP018254.1_4434186_4434624_+	NA	NA|362aa|down_1|NZ_AP018254.1_4434807_4435893_+	COG4585, COG4585, Signal transduction histidine kinase [Signal transduction mechanisms]	NA|140aa|down_2|NZ_AP018254.1_4436141_4436561_+	pfam07730, HisKA_3, Histidine kinase	NA|220aa|down_3|NZ_AP018254.1_4436821_4437481_+	COG2197, CitB, Response regulator containing a CheY-like receiver domain and an HTH DNA-binding domain [Signal transduction mechanisms / Transcription]	NA|600aa|down_4|NZ_AP018254.1_4437477_4439277_-	cd01991, Asn_Synthase_B_C, The C-terminal domain of Asparagine Synthase B	NA|208aa|down_5|NZ_AP018254.1_4439283_4439907_-	TIGR00763, Lon_protease, endopeptidase La	NA|228aa|down_6|NZ_AP018254.1_4440259_4440943_-	PRK14419, PRK14419, membrane protein; Provisional	NA|381aa|down_7|NZ_AP018254.1_4441062_4442205_-	pfam11285, DUF3086, Protein of unknown function (DUF3086)	NA|131aa|down_8|NZ_AP018254.1_4442360_4442753_-	pfam11317, DUF3119, Protein of unknown function (DUF3119)	NA|891aa|down_9|NZ_AP018254.1_4443445_4446118_+	TIGR02917, TPR_domain_protein, putative PEP-CTERM system TPR-repeat lipoprotein
GCF_002368395.1_ASM236839v1	NZ_AP018254	Calothrix sp. NIES-3974	42	4595809-4595922	40	CRISPRCasFinder	no		DinG,PD-DExK,cas14j,csa3,Cas14c_CAS-V-F,WYL,csx18,cas1,cas2,cas10,cmr3gr5,cmr4gr7,cmr5gr11,csm3gr7,cas3,csx1,cas6,csm5gr7,csm4gr5,csm2gr11,csx19,csx10gr5,c2c9_V-U4,RT,cas4,csc1gr5,csc2gr7,cas10d,csx3,csm6	Orphan	GTTTCCAATCAATTATTTCCCCTCATAAGAGGAGACCC	38	0	0	NA	NA	NA	1	1	Orphan	DinG,PD-DExK,cas14j,csa3,Cas14c_CAS-V-F,WYL,csx18,cas1,cas2,cas10,cmr3gr5,cmr4gr7,cmr5gr11,csm3gr7,cas3,csx1,cas6,csm5gr7,csm4gr5,csm2gr11,csx19,csx10gr5,c2c9_V-U4,RT,cas4,csc1gr5,csc2gr7,cas10d,csx3,csm6	NA|55aa|up_9|NZ_AP018254.1_4583749_4583914_-,NA|93aa|up_8|NZ_AP018254.1_4583952_4584231_-,NA|108aa|up_1|NZ_AP018254.1_4593501_4593825_-,NA|76aa|down_3|NZ_AP018254.1_4601124_4601352_+,NA|238aa|down_8|NZ_AP018254.1_4608490_4609204_-	NA|55aa|up_9|NZ_AP018254.1_4583749_4583914_-	NA	NA|93aa|up_8|NZ_AP018254.1_4583952_4584231_-	NA	NA|776aa|up_7|NZ_AP018254.1_4584294_4586622_-	cd07473, Peptidases_S8_Subtilisin_like, Peptidase S8 family domain in Subtilisin-like proteins	NA|216aa|up_6|NZ_AP018254.1_4587186_4587834_-	COG2197, CitB, Response regulator containing a CheY-like receiver domain and an HTH DNA-binding domain [Signal transduction mechanisms / Transcription]	NA|453aa|up_5|NZ_AP018254.1_4588768_4590127_+	TIGR00665, DnaB, replicative DNA helicase	NA|419aa|up_4|NZ_AP018254.1_4590268_4591525_-	pfam13304, AAA_21, AAA domain, putative AbiEii toxin, Type IV TA system	NA|388aa|up_3|NZ_AP018254.1_4591656_4592820_-	COG0628, yhhT, Predicted permease, member of the PurR regulon [General function prediction only]	NA|146aa|up_2|NZ_AP018254.1_4593016_4593454_-	cd04682, Nudix_Hydrolase_23, Members of the Nudix hydrolase superfamily catalyze the hydrolysis of NUcleoside DIphosphates linked to other moieties, X	NA|108aa|up_1|NZ_AP018254.1_4593501_4593825_-	NA	NA|453aa|up_0|NZ_AP018254.1_4594326_4595685_+	pfam14706, Tnp_DNA_bind, Transposase DNA-binding	NA|422aa|down_0|NZ_AP018254.1_4596505_4597771_-	PRK07364, PRK07364, FAD-dependent hydroxylase	NA|189aa|down_1|NZ_AP018254.1_4598065_4598632_+	COG1247, COG1247, Sortase and related acyltransferases [Cell envelope biogenesis, outer membrane]	NA|451aa|down_2|NZ_AP018254.1_4598903_4600256_+	COG0770, MurF, UDP-N-acetylmuramyl pentapeptide synthase [Cell envelope biogenesis, outer membrane]	NA|76aa|down_3|NZ_AP018254.1_4601124_4601352_+	NA	NA|370aa|down_4|NZ_AP018254.1_4601344_4602454_+	COG3093, VapI, Plasmid maintenance system antidote protein [General function prediction only]	NA|1069aa|down_5|NZ_AP018254.1_4602798_4606005_+	pfam00723, Glyco_hydro_15, Glycosyl hydrolases family 15	NA|326aa|down_6|NZ_AP018254.1_4606360_4607338_+	COG0583, LysR, Transcriptional regulator [Transcription]	NA|351aa|down_7|NZ_AP018254.1_4607418_4608471_+	PRK07394, PRK07394, hypothetical protein; Provisional	NA|238aa|down_8|NZ_AP018254.1_4608490_4609204_-	NA	NA|226aa|down_9|NZ_AP018254.1_4609705_4610383_-	TIGR04153, hypothetical_protein_L8106_04486, cyanosortase A-associated protein
GCF_002368395.1_ASM236839v1	NZ_AP018254	Calothrix sp. NIES-3974	43	4891267-4891341	41	CRISPRCasFinder	no		DinG,PD-DExK,cas14j,csa3,Cas14c_CAS-V-F,WYL,csx18,cas1,cas2,cas10,cmr3gr5,cmr4gr7,cmr5gr11,csm3gr7,cas3,csx1,cas6,csm5gr7,csm4gr5,csm2gr11,csx19,csx10gr5,c2c9_V-U4,RT,cas4,csc1gr5,csc2gr7,cas10d,csx3,csm6	Orphan	TATTTCAGCCAACCTGTACTAGA	23	0	0	NA	NA	NA	1	1	Orphan	DinG,PD-DExK,cas14j,csa3,Cas14c_CAS-V-F,WYL,csx18,cas1,cas2,cas10,cmr3gr5,cmr4gr7,cmr5gr11,csm3gr7,cas3,csx1,cas6,csm5gr7,csm4gr5,csm2gr11,csx19,csx10gr5,c2c9_V-U4,RT,cas4,csc1gr5,csc2gr7,cas10d,csx3,csm6	NA,NA|76aa|down_2|NZ_AP018254.1_4894683_4894911_-,NA|446aa|down_6|NZ_AP018254.1_4900901_4902239_-	NA|338aa|up_9|NZ_AP018254.1_4869077_4870091_-	COG0569, TrkA, K+ transport systems, NAD-binding component [Inorganic ion transport and metabolism]	NA|396aa|up_8|NZ_AP018254.1_4870441_4871629_+	PRK02627, PRK02627, acetylornithine aminotransferase; Provisional	NA|582aa|up_7|NZ_AP018254.1_4871705_4873451_-	cd10918, CE4_NodB_like_5s_6s, Putative catalytic NodB homology domain of PgaB, IcaB, and similar proteins which consist of a deformed (beta/alpha)8 barrel fold with 5- or 6-strands	NA|401aa|up_6|NZ_AP018254.1_4874154_4875357_+	cd17602, REC_PatA-like, phosphoacceptor receiver (REC) domain of PatA and similar domains	NA|121aa|up_5|NZ_AP018254.1_4875530_4875893_+	cd17574, REC_OmpR, phosphoacceptor receiver (REC) domain of OmpR family response regulators	NA|158aa|up_4|NZ_AP018254.1_4876171_4876645_+	COG0835, CheW, Chemotaxis signal transduction protein [Cell motility and secretion / Signal transduction mechanisms]	NA|1381aa|up_3|NZ_AP018254.1_4877632_4881775_+	smart00283, MA, Methyl-accepting chemotaxis-like domains (chemotaxis sensory transducer)	NA|1237aa|up_2|NZ_AP018254.1_4882756_4886467_+	COG0643, CheA, Chemotaxis protein histidine kinase and related kinases [Cell motility and secretion / Signal transduction mechanisms]	NA|467aa|up_1|NZ_AP018254.1_4886614_4888015_-	PLN02518, PLN02518, pheophorbide a oxygenase	NA|908aa|up_0|NZ_AP018254.1_4888189_4890913_+	COG4995, COG4995, Uncharacterized protein conserved in bacteria [Function unknown]	NA|279aa|down_0|NZ_AP018254.1_4892507_4893344_+	COG3442, COG3442, Predicted glutamine amidotransferase [General function prediction only]	NA|224aa|down_1|NZ_AP018254.1_4893489_4894161_-	pfam00395, SLH, S-layer homology domain	NA|76aa|down_2|NZ_AP018254.1_4894683_4894911_-	NA	NA|798aa|down_3|NZ_AP018254.1_4895229_4897623_+	sd00006, TPR, Tetratricopeptide repeat	NA|782aa|down_4|NZ_AP018254.1_4897893_4900239_+	cd06158, S2P-M50_like_1, Uncharacterized homologs of Site-2 protease (S2P), zinc metalloproteases (MEROPS family M50) which cleave transmembrane domains of substrate proteins, regulating intramembrane proteolysis (RIP) of diverse signal transduction mechanisms	NA|196aa|down_5|NZ_AP018254.1_4900307_4900895_+	COG0009, SUA5, Putative translation factor (SUA5) [Translation, ribosomal structure and biogenesis]	NA|446aa|down_6|NZ_AP018254.1_4900901_4902239_-	NA	NA|369aa|down_7|NZ_AP018254.1_4902873_4903980_-	COG0642, BaeS, Signal transduction histidine kinase [Signal transduction mechanisms]	NA|145aa|down_8|NZ_AP018254.1_4905357_4905792_-	cd17557, REC_Rcp-like, phosphoacceptor receiver (REC) domain of cyanobacterial phytochrome response regulator Rcp and similar domains	NA|602aa|down_9|NZ_AP018254.1_4905989_4907795_-	COG4251, COG4251, Bacteriophytochrome (light-regulated signal transduction histidine kinase) [Signal transduction mechanisms]
GCF_002368395.1_ASM236839v1	NZ_AP018254	Calothrix sp. NIES-3974	44	4956138-4956292	42	CRISPRCasFinder	no	PD-DExK,Cas14c_CAS-V-F	DinG,PD-DExK,cas14j,csa3,Cas14c_CAS-V-F,WYL,csx18,cas1,cas2,cas10,cmr3gr5,cmr4gr7,cmr5gr11,csm3gr7,cas3,csx1,cas6,csm5gr7,csm4gr5,csm2gr11,csx19,csx10gr5,c2c9_V-U4,RT,cas4,csc1gr5,csc2gr7,cas10d,csx3,csm6	Unclear	GGGGTGAATAAATTACCTCTGGAAGCTCCTCC	32	0	0	NA	NA	NA	2	2	TypeV	DinG,PD-DExK,cas14j,csa3,Cas14c_CAS-V-F,WYL,csx18,cas1,cas2,cas10,cmr3gr5,cmr4gr7,cmr5gr11,csm3gr7,cas3,csx1,cas6,csm5gr7,csm4gr5,csm2gr11,csx19,csx10gr5,c2c9_V-U4,RT,cas4,csc1gr5,csc2gr7,cas10d,csx3,csm6	NA,NA|59aa|down_1|NZ_AP018254.1_4959353_4959530_+,PD-DExK|200aa|down_2|NZ_AP018254.1_4959682_4960282_-,NA|84aa|down_6|NZ_AP018254.1_4963349_4963601_-,NA|50aa|down_8|NZ_AP018254.1_4965166_4965316_-	NA|105aa|up_9|NZ_AP018254.1_4944169_4944484_+	pfam12559, Inhibitor_I10, Serine endopeptidase inhibitors	NA|115aa|up_8|NZ_AP018254.1_4944651_4944996_-	COG5439, COG5439, Uncharacterized conserved protein [Function unknown]	NA|192aa|up_7|NZ_AP018254.1_4945024_4945600_-	COG5381, COG5381, Uncharacterized protein conserved in bacteria [Function unknown]	NA|201aa|up_6|NZ_AP018254.1_4945766_4946369_-	COG5381, COG5381, Uncharacterized protein conserved in bacteria [Function unknown]	NA|68aa|up_5|NZ_AP018254.1_4946882_4947086_+	pfam01155, HypA, Hydrogenase/urease nickel incorporation, metallochaperone, hypA	NA|204aa|up_4|NZ_AP018254.1_4947308_4947920_-	pfam10229, MMADHC, Methylmalonic aciduria and homocystinuria type D protein	NA|212aa|up_3|NZ_AP018254.1_4948431_4949067_+	pfam13649, Methyltransf_25, Methyltransferase domain	NA|544aa|up_2|NZ_AP018254.1_4949085_4950717_+	COG4615, PvdE, ABC-type siderophore export system, fused ATPase and permease components [Secondary metabolites biosynthesis, transport, and catabolism / Inorganic ion transport and metabolism]	NA|434aa|up_1|NZ_AP018254.1_4950724_4952026_-	cd19920, REC_PA4781-like, phosphoacceptor receiver (REC) domain of cyclic di-GMP phosphodiesterase PA4781 and similar domains	NA|927aa|up_0|NZ_AP018254.1_4952025_4954806_-	COG4252, COG4252, Predicted transmembrane sensor domain [Signal transduction mechanisms]	NA|861aa|down_0|NZ_AP018254.1_4956749_4959332_-	COG4995, COG4995, Uncharacterized protein conserved in bacteria [Function unknown]	NA|59aa|down_1|NZ_AP018254.1_4959353_4959530_+	NA	PD-DExK|200aa|down_2|NZ_AP018254.1_4959682_4960282_-	NA	NA|306aa|down_3|NZ_AP018254.1_4960329_4961247_-	COG3210, FhaB, Large exoproteins involved in heme utilization or adhesion [Intracellular trafficking and secretion]	Cas14c_CAS-V-F|393aa|down_4|NZ_AP018254.1_4961477_4962656_-	COG0675, COG0675, Transposase and inactivated derivatives [DNA replication, recombination, and repair]	NA|132aa|down_5|NZ_AP018254.1_4962744_4963140_+	pfam01797, Y1_Tnp, Transposase IS200 like	NA|84aa|down_6|NZ_AP018254.1_4963349_4963601_-	NA	NA|446aa|down_7|NZ_AP018254.1_4963685_4965023_-	pfam00395, SLH, S-layer homology domain	NA|50aa|down_8|NZ_AP018254.1_4965166_4965316_-	NA	NA|61aa|down_9|NZ_AP018254.1_4965518_4965701_-	pfam02416, MttA_Hcf106, mttA/Hcf106 family
GCF_002368395.1_ASM236839v1	NZ_AP018254	Calothrix sp. NIES-3974	45	5348984-5349056	43	CRISPRCasFinder	no		DinG,PD-DExK,cas14j,csa3,Cas14c_CAS-V-F,WYL,csx18,cas1,cas2,cas10,cmr3gr5,cmr4gr7,cmr5gr11,csm3gr7,cas3,csx1,cas6,csm5gr7,csm4gr5,csm2gr11,csx19,csx10gr5,c2c9_V-U4,RT,cas4,csc1gr5,csc2gr7,cas10d,csx3,csm6	Orphan	GTGGGTCATTTTCGGTGGGTCGTC	24	0	0	NA	NA	NA	1	1	Orphan	DinG,PD-DExK,cas14j,csa3,Cas14c_CAS-V-F,WYL,csx18,cas1,cas2,cas10,cmr3gr5,cmr4gr7,cmr5gr11,csm3gr7,cas3,csx1,cas6,csm5gr7,csm4gr5,csm2gr11,csx19,csx10gr5,c2c9_V-U4,RT,cas4,csc1gr5,csc2gr7,cas10d,csx3,csm6	NA|47aa|up_9|NZ_AP018254.1_5334883_5335024_-,NA|77aa|up_4|NZ_AP018254.1_5342221_5342452_+,NA|198aa|down_2|NZ_AP018254.1_5354504_5355098_-,NA|118aa|down_5|NZ_AP018254.1_5359135_5359489_-,NA|104aa|down_9|NZ_AP018254.1_5363070_5363382_-	NA|47aa|up_9|NZ_AP018254.1_5334883_5335024_-	NA	NA|500aa|up_8|NZ_AP018254.1_5335176_5336676_-	COG1061, SSL2, DNA or RNA helicases of superfamily II [Transcription / DNA replication, recombination, and repair]	NA|396aa|up_7|NZ_AP018254.1_5337051_5338239_-	cd00200, WD40, WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment	NA|499aa|up_6|NZ_AP018254.1_5338881_5340378_+	PRK00484, lysS, lysyl-tRNA synthetase; Reviewed	NA|310aa|up_5|NZ_AP018254.1_5340547_5341477_+	pfam00892, EamA, EamA-like transporter family	NA|77aa|up_4|NZ_AP018254.1_5342221_5342452_+	NA	NA|469aa|up_3|NZ_AP018254.1_5342719_5344126_+	COG2124, CypX, Cytochrome P450 [Secondary metabolites biosynthesis, transport, and catabolism]	NA|565aa|up_2|NZ_AP018254.1_5344289_5345984_+	COG1233, COG1233, Phytoene dehydrogenase and related proteins [Secondary metabolites biosynthesis, transport, and catabolism]	NA|207aa|up_1|NZ_AP018254.1_5346188_5346809_-	pfam13649, Methyltransf_25, Methyltransferase domain	NA|448aa|up_0|NZ_AP018254.1_5346974_5348318_-	TIGR00911, High-affinity_methionine_permease, L-type amino acid transporter	NA|1235aa|down_0|NZ_AP018254.1_5349173_5352878_+	PLN03241, PLN03241, magnesium chelatase subunit H; Provisional	NA|419aa|down_1|NZ_AP018254.1_5353125_5354382_-	cd00887, MoeA, MoeA family	NA|198aa|down_2|NZ_AP018254.1_5354504_5355098_-	NA	NA|874aa|down_3|NZ_AP018254.1_5355704_5358326_+	PRK05399, PRK05399, DNA mismatch repair protein MutS; Provisional	NA|153aa|down_4|NZ_AP018254.1_5358412_5358871_-	cd17036, T3SC_YbjN-like_1, T110839 is structurally similar to type III secretion system chaperones and YbjN family proteins	NA|118aa|down_5|NZ_AP018254.1_5359135_5359489_-	NA	NA|299aa|down_6|NZ_AP018254.1_5359597_5360494_-	TIGR01247, drrB, daunorubicin resistance ABC transporter membrane protein	NA|340aa|down_7|NZ_AP018254.1_5360808_5361828_-	COG1131, CcmA, ABC-type multidrug transport system, ATPase component [Defense mechanisms]	NA|307aa|down_8|NZ_AP018254.1_5362149_5363070_+	COG1090, COG1090, Predicted nucleoside-diphosphate sugar epimerase [General function prediction only]	NA|104aa|down_9|NZ_AP018254.1_5363070_5363382_-	NA
GCF_002368395.1_ASM236839v1	NZ_AP018254	Calothrix sp. NIES-3974	46	5518585-5518697	44	CRISPRCasFinder	no		DinG,PD-DExK,cas14j,csa3,Cas14c_CAS-V-F,WYL,csx18,cas1,cas2,cas10,cmr3gr5,cmr4gr7,cmr5gr11,csm3gr7,cas3,csx1,cas6,csm5gr7,csm4gr5,csm2gr11,csx19,csx10gr5,c2c9_V-U4,RT,cas4,csc1gr5,csc2gr7,cas10d,csx3,csm6	Orphan	CATCAATCGCTTGCAAAATCAGTTTTTTCTTTGTGACTT	39	0	0	NA	NA	NA	1	1	Orphan	DinG,PD-DExK,cas14j,csa3,Cas14c_CAS-V-F,WYL,csx18,cas1,cas2,cas10,cmr3gr5,cmr4gr7,cmr5gr11,csm3gr7,cas3,csx1,cas6,csm5gr7,csm4gr5,csm2gr11,csx19,csx10gr5,c2c9_V-U4,RT,cas4,csc1gr5,csc2gr7,cas10d,csx3,csm6	NA|596aa|up_9|NZ_AP018254.1_5507340_5509128_-,NA|148aa|down_2|NZ_AP018254.1_5520503_5520947_+,NA|433aa|down_8|NZ_AP018254.1_5528403_5529702_+,NA|199aa|down_9|NZ_AP018254.1_5530126_5530723_+	NA|596aa|up_9|NZ_AP018254.1_5507340_5509128_-	NA	NA|284aa|up_8|NZ_AP018254.1_5509876_5510728_-	COG2226, UbiE, Methylase involved in ubiquinone/menaquinone biosynthesis [Coenzyme metabolism]	NA|385aa|up_7|NZ_AP018254.1_5511866_5513021_+	COG2203, FhlA, FOG: GAF domain [Signal transduction mechanisms]	NA|111aa|up_6|NZ_AP018254.1_5513086_5513419_+	cd00156, REC, phosphoacceptor receiver (REC) domain of response regulators (RRs) and pseudo response regulators (PRRs)	NA|209aa|up_5|NZ_AP018254.1_5513464_5514091_+	pfam12770, CHAT, CHAT domain	NA|173aa|up_4|NZ_AP018254.1_5514795_5515314_+	cd14768, PC_PEC_beta, Beta subunits of phycoerythrin and phycoerythrocyanin; phycobilisome rod components	NA|163aa|up_3|NZ_AP018254.1_5515436_5515925_+	cd14770, PC-PEC_alpha, Alpha subunits of phycoerythrin and phycoerythrocyanin; phycobilisome rod components	NA|272aa|up_2|NZ_AP018254.1_5516193_5517009_+	pfam00427, PBS_linker_poly, Phycobilisome Linker polypeptide	NA|290aa|up_1|NZ_AP018254.1_5517175_5518045_+	pfam00427, PBS_linker_poly, Phycobilisome Linker polypeptide	NA|75aa|up_0|NZ_AP018254.1_5518173_5518398_+	pfam01383, CpcD, CpcD/allophycocyanin linker domain	NA|275aa|down_0|NZ_AP018254.1_5518813_5519638_+	COG1413, COG1413, FOG: HEAT repeat [Energy production and conversion]	NA|251aa|down_1|NZ_AP018254.1_5519741_5520494_+	COG1413, COG1413, FOG: HEAT repeat [Energy production and conversion]	NA|148aa|down_2|NZ_AP018254.1_5520503_5520947_+	NA	NA|279aa|down_3|NZ_AP018254.1_5521008_5521845_+	TIGR03695, menH_SHCHC, 2-succinyl-6-hydroxy-2,4-cyclohexadiene-1-carboxylate synthase	NA|428aa|down_4|NZ_AP018254.1_5522159_5523443_-	cd19920, REC_PA4781-like, phosphoacceptor receiver (REC) domain of cyclic di-GMP phosphodiesterase PA4781 and similar domains	NA|730aa|down_5|NZ_AP018254.1_5523728_5525918_-	TIGR02956, sensor_protein_TorS, TMAO reductase sytem sensor TorS	NA|302aa|down_6|NZ_AP018254.1_5525914_5526820_-	COG3221, PhnD, ABC-type phosphate/phosphonate transport system, periplasmic component [Inorganic ion transport and metabolism]	NA|142aa|down_7|NZ_AP018254.1_5527131_5527557_-	COG3755, COG3755, Uncharacterized protein conserved in bacteria [Function unknown]	NA|433aa|down_8|NZ_AP018254.1_5528403_5529702_+	NA	NA|199aa|down_9|NZ_AP018254.1_5530126_5530723_+	NA
GCF_002368395.1_ASM236839v1	NZ_AP018254	Calothrix sp. NIES-3974	47	5628611-5629006	11,45,8	PILER-CR,CRISPRCasFinder,CRT	no	cas2,cas1,cas4,cas6,csc1gr5,csc2gr7,cas10d,cas3,WYL	DinG,PD-DExK,cas14j,csa3,Cas14c_CAS-V-F,WYL,csx18,cas1,cas2,cas10,cmr3gr5,cmr4gr7,cmr5gr11,csm3gr7,cas3,csx1,cas6,csm5gr7,csm4gr5,csm2gr11,csx19,csx10gr5,c2c9_V-U4,RT,cas4,csc1gr5,csc2gr7,cas10d,csx3,csm6	Type I-D	GTTTCAATCCCTAATAGGGATTATTTAGTTTTGTAAC,GTTTCAATCCCTAATAGGGATTATTTAGTTTTGTAAC,GTTTCAATCCCTAATAGGGATTATTTAGTTTTGTAAC	37,37,37	0	0	NA	NA	I-D,II-B:I-D,II-B:I-D,II-B	4,5,5	5	TypeI-D	DinG,PD-DExK,cas14j,csa3,Cas14c_CAS-V-F,WYL,csx18,cas1,cas2,cas10,cmr3gr5,cmr4gr7,cmr5gr11,csm3gr7,cas3,csx1,cas6,csm5gr7,csm4gr5,csm2gr11,csx19,csx10gr5,c2c9_V-U4,RT,cas4,csc1gr5,csc2gr7,cas10d,csx3,csm6	NA|52aa|up_4|NZ_AP018254.1_5624284_5624440_-,NA	NA|140aa|up_9|NZ_AP018254.1_5618820_5619240_-	PRK02710, PRK02710, plastocyanin; Provisional	NA|164aa|up_8|NZ_AP018254.1_5619603_5620095_-	PRK13618, psbV, cytochrome c-550; Provisional	NA|225aa|up_7|NZ_AP018254.1_5620357_5621032_+	COG3145, AlkB, Alkylated DNA repair protein [DNA replication, recombination, and repair]	NA|552aa|up_6|NZ_AP018254.1_5621095_5622751_-	PRK00013, groEL, chaperonin GroEL; Reviewed	NA|379aa|up_5|NZ_AP018254.1_5623126_5624263_+	cd06853, GT_WecA_like, This subfamily contains Escherichia coli WecA, Bacillus subtilis TagO and related proteins	NA|52aa|up_4|NZ_AP018254.1_5624284_5624440_-	NA	NA|281aa|up_3|NZ_AP018254.1_5624583_5625426_-	PLN02244, PLN02244, tocopherol O-methyltransferase	NA|445aa|up_2|NZ_AP018254.1_5625584_5626919_+	TIGR00933, Trk_system_potassium_uptake_protein_trkH	NA|232aa|up_1|NZ_AP018254.1_5626982_5627678_+	COG0569, TrkA, K+ transport systems, NAD-binding component [Inorganic ion transport and metabolism]	NA|183aa|up_0|NZ_AP018254.1_5627860_5628409_-	cd19433, lipocalin_CpcS-CpeS, CpcS/CpeS phycobiliprotein lyase family	cas2|91aa|down_0|NZ_AP018254.1_5631323_5631596_-	cd09725, Cas2_I_II_III, CRISPR/Cas system-associated protein Cas2	cas1|335aa|down_1|NZ_AP018254.1_5632021_5633026_-	TIGR04093, hypothetical_protein_L8106_25395, CRISPR-associated endonuclease Cas1, subtype CYANO	cas4|214aa|down_2|NZ_AP018254.1_5633220_5633862_-	cd09637, Cas4_I-A_I-B_I-C_I-D_II-B, CRISPR/Cas system-associated protein Cas4	cas6|283aa|down_3|NZ_AP018254.1_5634034_5634883_-	COG5551, COG5551, CRISPR system related protein, RAMP superfamily [Defense    mechanisms]	csc1gr5|238aa|down_4|NZ_AP018254.1_5635730_5636444_-	cd09711, Csc1_I-D, CRISPR/Cas system-associated protein Csc1	csc2gr7|342aa|down_5|NZ_AP018254.1_5636449_5637475_-	pfam18320, Csc2, Csc2 Crispr	cas10d|1122aa|down_6|NZ_AP018254.1_5637552_5640918_-	TIGR03174, cas_Csc3, CRISPR type I-D/CYANO-associated protein Csc3/Cas10d	NA|125aa|down_7|NZ_AP018254.1_5640923_5641298_-	pfam18480, DUF5615, Domain of unknown function (DUF5615)	NA|92aa|down_8|NZ_AP018254.1_5641278_5641554_-	pfam04255, DUF433, Protein of unknown function (DUF433)	NA|57aa|down_9|NZ_AP018254.1_5641546_5641717_-	pfam18755, RAMA, Restriction Enzyme Adenine Methylase Associated
GCF_002368395.1_ASM236839v1	NZ_AP018254	Calothrix sp. NIES-3974	48	5629116-5631051	12,46,9,13	PILER-CR,CRISPRCasFinder,CRT,PILER-CR	no	cas2,cas1,cas4,cas6,csc1gr5,csc2gr7,cas10d,cas3,WYL	DinG,PD-DExK,cas14j,csa3,Cas14c_CAS-V-F,WYL,csx18,cas1,cas2,cas10,cmr3gr5,cmr4gr7,cmr5gr11,csm3gr7,cas3,csx1,cas6,csm5gr7,csm4gr5,csm2gr11,csx19,csx10gr5,c2c9_V-U4,RT,cas4,csc1gr5,csc2gr7,cas10d,csx3,csm6	Type I-D	GTTTCAATCCCTAATAGGGATTATTTAGTTTTGTAAC,GTTTCAATCCCTAATAGGGATTATTTAGTTTTGTAAC,GTTTCAATCCCTAATAGGGATTATTTAGTTTTGTAAC,GTTTCAATCCCTAATAGGGATTATTTAGTTTTGTAAC	37,37,37,37	0	0	NA	NA	I-D,II-B:I-D,II-B:I-D,II-B:I-D,II-B	24,26,26,24	26	TypeI-D	DinG,PD-DExK,cas14j,csa3,Cas14c_CAS-V-F,WYL,csx18,cas1,cas2,cas10,cmr3gr5,cmr4gr7,cmr5gr11,csm3gr7,cas3,csx1,cas6,csm5gr7,csm4gr5,csm2gr11,csx19,csx10gr5,c2c9_V-U4,RT,cas4,csc1gr5,csc2gr7,cas10d,csx3,csm6	NA|52aa|up_4|NZ_AP018254.1_5624284_5624440_-,NA	NA|140aa|up_9|NZ_AP018254.1_5618820_5619240_-	PRK02710, PRK02710, plastocyanin; Provisional	NA|164aa|up_8|NZ_AP018254.1_5619603_5620095_-	PRK13618, psbV, cytochrome c-550; Provisional	NA|225aa|up_7|NZ_AP018254.1_5620357_5621032_+	COG3145, AlkB, Alkylated DNA repair protein [DNA replication, recombination, and repair]	NA|552aa|up_6|NZ_AP018254.1_5621095_5622751_-	PRK00013, groEL, chaperonin GroEL; Reviewed	NA|379aa|up_5|NZ_AP018254.1_5623126_5624263_+	cd06853, GT_WecA_like, This subfamily contains Escherichia coli WecA, Bacillus subtilis TagO and related proteins	NA|52aa|up_4|NZ_AP018254.1_5624284_5624440_-	NA	NA|281aa|up_3|NZ_AP018254.1_5624583_5625426_-	PLN02244, PLN02244, tocopherol O-methyltransferase	NA|445aa|up_2|NZ_AP018254.1_5625584_5626919_+	TIGR00933, Trk_system_potassium_uptake_protein_trkH	NA|232aa|up_1|NZ_AP018254.1_5626982_5627678_+	COG0569, TrkA, K+ transport systems, NAD-binding component [Inorganic ion transport and metabolism]	NA|183aa|up_0|NZ_AP018254.1_5627860_5628409_-	cd19433, lipocalin_CpcS-CpeS, CpcS/CpeS phycobiliprotein lyase family	cas2|91aa|down_0|NZ_AP018254.1_5631323_5631596_-	cd09725, Cas2_I_II_III, CRISPR/Cas system-associated protein Cas2	cas1|335aa|down_1|NZ_AP018254.1_5632021_5633026_-	TIGR04093, hypothetical_protein_L8106_25395, CRISPR-associated endonuclease Cas1, subtype CYANO	cas4|214aa|down_2|NZ_AP018254.1_5633220_5633862_-	cd09637, Cas4_I-A_I-B_I-C_I-D_II-B, CRISPR/Cas system-associated protein Cas4	cas6|283aa|down_3|NZ_AP018254.1_5634034_5634883_-	COG5551, COG5551, CRISPR system related protein, RAMP superfamily [Defense    mechanisms]	csc1gr5|238aa|down_4|NZ_AP018254.1_5635730_5636444_-	cd09711, Csc1_I-D, CRISPR/Cas system-associated protein Csc1	csc2gr7|342aa|down_5|NZ_AP018254.1_5636449_5637475_-	pfam18320, Csc2, Csc2 Crispr	cas10d|1122aa|down_6|NZ_AP018254.1_5637552_5640918_-	TIGR03174, cas_Csc3, CRISPR type I-D/CYANO-associated protein Csc3/Cas10d	NA|125aa|down_7|NZ_AP018254.1_5640923_5641298_-	pfam18480, DUF5615, Domain of unknown function (DUF5615)	NA|92aa|down_8|NZ_AP018254.1_5641278_5641554_-	pfam04255, DUF433, Protein of unknown function (DUF433)	NA|57aa|down_9|NZ_AP018254.1_5641546_5641717_-	pfam18755, RAMA, Restriction Enzyme Adenine Methylase Associated
GCF_002368395.1_ASM236839v1	NZ_AP018254	Calothrix sp. NIES-3974	49	5646958-5647068	47	CRISPRCasFinder	no	cas2,cas1,cas4,cas6,csc1gr5,csc2gr7,cas10d,cas3,WYL	DinG,PD-DExK,cas14j,csa3,Cas14c_CAS-V-F,WYL,csx18,cas1,cas2,cas10,cmr3gr5,cmr4gr7,cmr5gr11,csm3gr7,cas3,csx1,cas6,csm5gr7,csm4gr5,csm2gr11,csx19,csx10gr5,c2c9_V-U4,RT,cas4,csc1gr5,csc2gr7,cas10d,csx3,csm6	Type I-D	GGTGTAGGGATGGGAAAACAGGGGGTCGGAACAAACTAC	39	0	0	NA	NA	NA	1	1	TypeI-D	DinG,PD-DExK,cas14j,csa3,Cas14c_CAS-V-F,WYL,csx18,cas1,cas2,cas10,cmr3gr5,cmr4gr7,cmr5gr11,csm3gr7,cas3,csx1,cas6,csm5gr7,csm4gr5,csm2gr11,csx19,csx10gr5,c2c9_V-U4,RT,cas4,csc1gr5,csc2gr7,cas10d,csx3,csm6	NA,NA|294aa|down_5|NZ_AP018254.1_5655231_5656113_-,NA|220aa|down_6|NZ_AP018254.1_5656266_5656926_-,NA|135aa|down_7|NZ_AP018254.1_5657512_5657917_+	csc1gr5|238aa|up_9|NZ_AP018254.1_5635730_5636444_-	cd09711, Csc1_I-D, CRISPR/Cas system-associated protein Csc1	csc2gr7|342aa|up_8|NZ_AP018254.1_5636449_5637475_-	pfam18320, Csc2, Csc2 Crispr	cas10d|1122aa|up_7|NZ_AP018254.1_5637552_5640918_-	TIGR03174, cas_Csc3, CRISPR type I-D/CYANO-associated protein Csc3/Cas10d	NA|125aa|up_6|NZ_AP018254.1_5640923_5641298_-	pfam18480, DUF5615, Domain of unknown function (DUF5615)	NA|92aa|up_5|NZ_AP018254.1_5641278_5641554_-	pfam04255, DUF433, Protein of unknown function (DUF433)	NA|57aa|up_4|NZ_AP018254.1_5641546_5641717_-	pfam18755, RAMA, Restriction Enzyme Adenine Methylase Associated	cas3|700aa|up_3|NZ_AP018254.1_5641697_5643797_-	cd09710, Cas3_I-D, CRISPR/Cas system-associated protein Cas3; Distinct diverged subfamily of Cas3 helicase domain	WYL|285aa|up_2|NZ_AP018254.1_5643898_5644753_+	COG2378, COG2378, Predicted transcriptional regulator [Transcription]	NA|288aa|up_1|NZ_AP018254.1_5645072_5645936_+	PRK03982, PRK03982, heat shock protein HtpX; Provisional	NA|243aa|up_0|NZ_AP018254.1_5646087_5646816_-	COG0637, COG0637, Predicted phosphatase/phosphohexomutase [General function prediction only]	NA|868aa|down_0|NZ_AP018254.1_5647330_5649934_-	TIGR04075, Ser/Thr_phosphatase_family_protein, polynucleotide kinase-phosphatase	NA|465aa|down_1|NZ_AP018254.1_5650913_5652308_-	TIGR04074, Methyltransferase_type_12, 3' terminal RNA ribose 2'-O-methyltransferase Hen1	NA|411aa|down_2|NZ_AP018254.1_5652536_5653769_+	COG1649, COG1649, Uncharacterized protein conserved in bacteria [Function unknown]	NA|108aa|down_3|NZ_AP018254.1_5653815_5654139_-	pfam03091, CutA1, CutA1 divalent ion tolerance protein	NA|338aa|down_4|NZ_AP018254.1_5654195_5655209_-	pfam07592, DDE_Tnp_ISAZ013, Rhodopirellula transposase DDE domain	NA|294aa|down_5|NZ_AP018254.1_5655231_5656113_-	NA	NA|220aa|down_6|NZ_AP018254.1_5656266_5656926_-	NA	NA|135aa|down_7|NZ_AP018254.1_5657512_5657917_+	NA	NA|119aa|down_8|NZ_AP018254.1_5658088_5658445_+	pfam02152, FolB, Dihydroneopterin aldolase	NA|512aa|down_9|NZ_AP018254.1_5658639_5660175_+	COG1541, PaaK, Coenzyme F390 synthetase [Coenzyme metabolism]
GCF_002368395.1_ASM236839v1	NZ_AP018254	Calothrix sp. NIES-3974	50	5665988-5666072	48	CRISPRCasFinder	no		DinG,PD-DExK,cas14j,csa3,Cas14c_CAS-V-F,WYL,csx18,cas1,cas2,cas10,cmr3gr5,cmr4gr7,cmr5gr11,csm3gr7,cas3,csx1,cas6,csm5gr7,csm4gr5,csm2gr11,csx19,csx10gr5,c2c9_V-U4,RT,cas4,csc1gr5,csc2gr7,cas10d,csx3,csm6	Orphan	TGGAATGACGGATTAATTTTCCTT	24	0	0	NA	NA	NA	1	1	Orphan	DinG,PD-DExK,cas14j,csa3,Cas14c_CAS-V-F,WYL,csx18,cas1,cas2,cas10,cmr3gr5,cmr4gr7,cmr5gr11,csm3gr7,cas3,csx1,cas6,csm5gr7,csm4gr5,csm2gr11,csx19,csx10gr5,c2c9_V-U4,RT,cas4,csc1gr5,csc2gr7,cas10d,csx3,csm6	NA|294aa|up_9|NZ_AP018254.1_5655231_5656113_-,NA|220aa|up_8|NZ_AP018254.1_5656266_5656926_-,NA|135aa|up_7|NZ_AP018254.1_5657512_5657917_+,NA|129aa|down_1|NZ_AP018254.1_5668213_5668600_+	NA|294aa|up_9|NZ_AP018254.1_5655231_5656113_-	NA	NA|220aa|up_8|NZ_AP018254.1_5656266_5656926_-	NA	NA|135aa|up_7|NZ_AP018254.1_5657512_5657917_+	NA	NA|119aa|up_6|NZ_AP018254.1_5658088_5658445_+	pfam02152, FolB, Dihydroneopterin aldolase	NA|512aa|up_5|NZ_AP018254.1_5658639_5660175_+	COG1541, PaaK, Coenzyme F390 synthetase [Coenzyme metabolism]	NA|453aa|up_4|NZ_AP018254.1_5660246_5661605_+	PRK14901, PRK14901, 16S rRNA methyltransferase B; Provisional	NA|138aa|up_3|NZ_AP018254.1_5661663_5662077_+	cd07177, terB_like, tellurium resistance terB-like protein	NA|89aa|up_2|NZ_AP018254.1_5662230_5662497_+	pfam08869, XisI, XisI protein	NA|71aa|up_1|NZ_AP018254.1_5662621_5662834_+	COG2886, COG2886, Uncharacterized small protein [Function unknown]	NA|847aa|up_0|NZ_AP018254.1_5663381_5665922_-	COG0642, BaeS, Signal transduction histidine kinase [Signal transduction mechanisms]	NA|492aa|down_0|NZ_AP018254.1_5666205_5667681_-	PRK00913, PRK00913, multifunctional aminopeptidase A; Provisional	NA|129aa|down_1|NZ_AP018254.1_5668213_5668600_+	NA	NA|720aa|down_2|NZ_AP018254.1_5669030_5671190_-	COG3972, COG3972, Superfamily I DNA and RNA helicases [General function prediction only]	NA|388aa|down_3|NZ_AP018254.1_5672415_5673579_-	PRK09510, tolA, cell envelope integrity inner membrane protein TolA; Provisional	NA|497aa|down_4|NZ_AP018254.1_5674142_5675633_+	PRK07349, PRK07349, amidophosphoribosyltransferase; Provisional	NA|427aa|down_5|NZ_AP018254.1_5676187_5677468_+	PRK00877, hisD, bifunctional histidinal dehydrogenase/ histidinol dehydrogenase; Reviewed	NA|479aa|down_6|NZ_AP018254.1_5677493_5678930_-	COG0642, BaeS, Signal transduction histidine kinase [Signal transduction mechanisms]	NA|888aa|down_7|NZ_AP018254.1_5679066_5681730_-	COG4191, COG4191, Signal transduction histidine kinase regulating C4-dicarboxylate transport system [Signal transduction mechanisms]	NA|578aa|down_8|NZ_AP018254.1_5682294_5684028_+	cd14014, STKc_PknB_like, Catalytic domain of bacterial Serine/Threonine kinases, PknB and similar proteins	NA|362aa|down_9|NZ_AP018254.1_5684137_5685223_-	cd01828, sialate_O-acetylesterase_like2, sialate_O-acetylesterase_like subfamily of the SGNH-hydrolases, a diverse family of lipases and esterases
