assembly_id	genome_id	genome_def	crispr_array_locus_merge	crispr_array_location_merge	crispr_locus_id	crispr_pred_method	array_in_prot	prot_within_array_20000	prot_in_genome	crispr_type_by_cas_prot	consensus_repeat	repeat_length	self-targeting_spacer_number	self-targeting_target_number	spacer_location	protospacer_location	repeat_type	spacer_locus_num	spacer_num	correct_crispr_type	genome_cas_prots	unknown_protein_around_crispr	L10	L10_domain	L9	L9_domain	L8	L8_domain	L7	L7_domain	L6	L6_domain	L5	L5_domain	L4	L4_domain	L3	L3_domain	L2	L2_domain	L1	L1_domain	R1	R1_domain	R2	R2_domain	R3	R3_domain	R4	R4_domain	R5	R5_domain	R6	R6_domain	R7	R7_domain	R8	R8_domain	R9	R9_domain	R10	R10_domain
GCA_003945305.1_ASM394530v1	AP019314	Microcystis viridis NIES-102 DNA, complete genome	3	483724-483795	3	CRISPRCasFinder	no		Cas14c_CAS-V-F,cas14j,c2c9_V-U4,cas14k,WYL,cas3,cas10d,csc2gr7,csc1gr5,cas8b5,cas7,cas5,cas6,cas4,cas1,cas2,2OG_CAS,RT,DinG,csa3	Orphan	CCCCCCTTAATAAGGGGGGTGCC	23	1	14	483747-483772|483747-483772|483747-483772|483747-483772|483747-483772|483747-483772|483747-483772|483747-483772|483747-483772|483747-483772|483747-483772|483747-483772|483747-483772|483747-483772	AP019314.1_524026-524001|AP019314.1_1084435-1084410|AP019314.1_1457163-1457138|AP019314.1_2855978-2855953|AP019314.1_3199585-3199560|AP019314.1_3597991-3598016|AP019314.1_4032108-4032083|AP019314.1_4880960-4880985|AP019314.1_5663648-5663673|AP019314.1_730904-730879|AP019314.1_2281582-2281557|AP019314.1_3796067-3796092|AP019314.1_4125535-4125560|AP019314.1_5224286-5224261	NA	1	1	Orphan	Cas14c_CAS-V-F,cas14j,c2c9_V-U4,cas14k,WYL,cas3,cas10d,csc2gr7,csc1gr5,cas8b5,cas7,cas5,cas6,cas4,cas1,cas2,2OG_CAS,RT,DinG,csa3	NA|67aa|up_4|AP019314.1_479609_479810_-,NA|182aa|up_3|AP019314.1_479967_480513_+,NA|353aa|up_1|AP019314.1_481777_482836_+,NA|92aa|down_4|AP019314.1_488457_488733_-	NA|305aa|up_9|AP019314.1_473286_474201_-	PLN02679, PLN02679, hydrolase, alpha/beta fold family protein	NA|190aa|up_8|AP019314.1_474802_475372_+	TIGR04376, conserved_hypothetical_protein, TIGR04376 family protein	NA|574aa|up_7|AP019314.1_475506_477228_+	PRK00484, lysS, lysyl-tRNA synthetase; Reviewed	NA|549aa|up_6|AP019314.1_477418_479065_-	pfam12452, DUF3685, Protein of unknown function (DUF3685)	NA|147aa|up_5|AP019314.1_479123_479564_-	COG0735, Fur, Fe2+/Zn2+ uptake regulation proteins [Inorganic ion transport and metabolism]	NA|67aa|up_4|AP019314.1_479609_479810_-	NA	NA|182aa|up_3|AP019314.1_479967_480513_+	NA	NA|157aa|up_2|AP019314.1_481128_481599_-	pfam01724, DUF29, Domain of unknown function DUF29	NA|353aa|up_1|AP019314.1_481777_482836_+	NA	NA|267aa|up_0|AP019314.1_482845_483646_+	COG0745, OmpR, Response regulators consisting of a CheY-like receiver domain and a winged-helix DNA-binding domain [Signal transduction mechanisms / Transcription]	NA|576aa|down_0|AP019314.1_483854_485582_+	COG0497, RecN, ATPase involved in DNA repair [DNA replication, recombination, and repair]	NA|214aa|down_1|AP019314.1_485585_486227_-	pfam05685, Uma2, Putative restriction endonuclease	NA|214aa|down_2|AP019314.1_486251_486893_-	pfam05685, Uma2, Putative restriction endonuclease	NA|386aa|down_3|AP019314.1_487014_488172_+	COG0665, DadA, Glycine/D-amino acid oxidases (deaminating) [Amino acid transport and metabolism]	NA|92aa|down_4|AP019314.1_488457_488733_-	NA	NA|481aa|down_5|AP019314.1_489788_491231_+	PRK00260, cysS, cysteinyl-tRNA synthetase; Validated	NA|510aa|down_6|AP019314.1_491213_492743_-	cd13613, PBP2_Opu_like_2, Substrate-binding domain of putative ABC-type osmoprotectant uptake system; the type 2 periplasmic-binding protein fold	NA|335aa|down_7|AP019314.1_492861_493866_+	COG5607, COG5607, Uncharacterized conserved protein [Function unknown]	NA|478aa|down_8|AP019314.1_493878_495312_+	PRK05500, PRK05500, bifunctional orotidine-5'-phosphate decarboxylase/orotate phosphoribosyltransferase	NA|95aa|down_9|AP019314.1_495665_495950_-	pfam08846, DUF1816, Domain of unknown function (DUF1816)
GCA_003945305.1_ASM394530v1	AP019314	Microcystis viridis NIES-102 DNA, complete genome	4	527534-527637	4	CRISPRCasFinder	no		Cas14c_CAS-V-F,cas14j,c2c9_V-U4,cas14k,WYL,cas3,cas10d,csc2gr7,csc1gr5,cas8b5,cas7,cas5,cas6,cas4,cas1,cas2,2OG_CAS,RT,DinG,csa3	Orphan	CCCGCACAGTAAAATCGGACAAAATCAATTTCT	33	0	0	NA	NA	NA	1	1	Orphan	Cas14c_CAS-V-F,cas14j,c2c9_V-U4,cas14k,WYL,cas3,cas10d,csc2gr7,csc1gr5,cas8b5,cas7,cas5,cas6,cas4,cas1,cas2,2OG_CAS,RT,DinG,csa3	NA|318aa|up_8|AP019314.1_511600_512554_+,NA|74aa|down_0|AP019314.1_527996_528218_+,NA|256aa|down_4|AP019314.1_531797_532565_-,NA|142aa|down_7|AP019314.1_533548_533974_-	NA|319aa|up_9|AP019314.1_510514_511471_+	pfam14261, DUF4351, Domain of unknown function (DUF4351)	NA|318aa|up_8|AP019314.1_511600_512554_+	NA	NA|1350aa|up_7|AP019314.1_512694_516744_+	NF033203, entero_EhxA, enterohemolysin EhxA	NA|562aa|up_6|AP019314.1_517044_518730_+	PRK00911, PRK00911, dihydroxy-acid dehydratase; Provisional	NA|289aa|up_5|AP019314.1_519544_520411_+	sd00006, TPR, Tetratricopeptide repeat	NA|145aa|up_4|AP019314.1_520655_521090_+	COG2193, Bfr, Bacterioferritin (cytochrome b1) [Inorganic ion transport and metabolism]	NA|307aa|up_3|AP019314.1_521124_522045_+	TIGR00145, Uncharacterized_protein_slr0964, FTR1 family protein	NA|538aa|up_2|AP019314.1_522254_523868_+	cd06268, PBP1_ABC_transporter_LIVBP-like, periplasmic binding domain of ATP-binding cassette transporter-like systems that belong to the type 1 periplasmic binding fold protein superfamily	NA|374aa|up_1|AP019314.1_524292_525414_-	pfam13808, DDE_Tnp_1_assoc, DDE_Tnp_1-associated	NA|386aa|up_0|AP019314.1_525730_526888_+	pfam13358, DDE_3, DDE superfamily endonuclease	NA|74aa|down_0|AP019314.1_527996_528218_+	NA	NA|63aa|down_1|AP019314.1_528214_528403_+	cd18737, PIN_VapC4-5_FitB-like, uncharacterized subgroup of the PIN_VapC4-5_FitB-like subfamily of the PIN domain superfamily	NA|499aa|down_2|AP019314.1_528944_530441_+	pfam13586, DDE_Tnp_1_2, Transposase DDE domain	NA|413aa|down_3|AP019314.1_530510_531749_-	TIGR00675, Modification_methylase, DNA-methyltransferase (dcm)	NA|256aa|down_4|AP019314.1_531797_532565_-	NA	NA|81aa|down_5|AP019314.1_532684_532927_+	pfam01797, Y1_Tnp, Transposase IS200 like	NA|216aa|down_6|AP019314.1_532901_533549_-	pfam06951, PLA2G12, Group XII secretory phospholipase A2 precursor (PLA2G12)	NA|142aa|down_7|AP019314.1_533548_533974_-	NA	NA|195aa|down_8|AP019314.1_534035_534620_-	cd06260, DUF820, Domain of unknown function (DUF820)	NA|329aa|down_9|AP019314.1_535111_536098_-	cd12916, VKOR_1, Vitamin K epoxide reductase family in bacteria and plants
GCA_003945305.1_ASM394530v1	AP019314	Microcystis viridis NIES-102 DNA, complete genome	5	560540-560636	5	CRISPRCasFinder	no	cas14j	Cas14c_CAS-V-F,cas14j,c2c9_V-U4,cas14k,WYL,cas3,cas10d,csc2gr7,csc1gr5,cas8b5,cas7,cas5,cas6,cas4,cas1,cas2,2OG_CAS,RT,DinG,csa3	Unclear	TCAGTAAACAGTAAACAGTAATCAG	25	0	0	NA	NA	NA	1	1	TypeV	Cas14c_CAS-V-F,cas14j,c2c9_V-U4,cas14k,WYL,cas3,cas10d,csc2gr7,csc1gr5,cas8b5,cas7,cas5,cas6,cas4,cas1,cas2,2OG_CAS,RT,DinG,csa3	NA|161aa|up_6|AP019314.1_552484_552967_-,NA|67aa|up_4|AP019314.1_553614_553815_+,NA|139aa|down_0|AP019314.1_561121_561538_-,NA|101aa|down_4|AP019314.1_565146_565449_+,NA|69aa|down_5|AP019314.1_565582_565789_+	NA|1155aa|up_9|AP019314.1_544958_548423_-	COG4889, COG4889, Predicted helicase [General function prediction only]	NA|801aa|up_8|AP019314.1_548667_551070_+	cd04268, ZnMc_MMP_like, Zinc-dependent metalloprotease, MMP_like subfamily	NA|149aa|up_7|AP019314.1_551764_552211_+	cd03467, Rieske, Rieske domain; a [2Fe-2S] cluster binding domain commonly found in Rieske non-heme iron oxygenase (RO) systems such as naphthalene and biphenyl dioxygenases, as well as in plant/cyanobacterial chloroplast b6f and mitochondrial cytochrome bc(1) complexes	NA|161aa|up_6|AP019314.1_552484_552967_-	NA	NA|76aa|up_5|AP019314.1_553376_553604_+	COG2026, RelE, Cytotoxic translational repressor of toxin-antitoxin stability system [Translation, ribosomal structure and biogenesis / Cell division and chromosome partitioning]	NA|67aa|up_4|AP019314.1_553614_553815_+	NA	NA|124aa|up_3|AP019314.1_553772_554144_-	pfam00145, DNA_methylase, C-5 cytosine-specific DNA methylase	NA|111aa|up_2|AP019314.1_554323_554656_-	pfam02604, PhdYeFM_antitox, Antitoxin Phd_YefM, type II toxin-antitoxin system	NA|1535aa|up_1|AP019314.1_554724_559329_-	PRK11750, gltB, glutamate synthase subunit alpha; Provisional	NA|296aa|up_0|AP019314.1_559629_560517_-	COG0074, SucD, Succinyl-CoA synthetase, alpha subunit [Energy production and conversion]	NA|139aa|down_0|AP019314.1_561121_561538_-	NA	NA|288aa|down_1|AP019314.1_561627_562491_-	TIGR02069, cyanophycinase, cyanophycinase	NA|159aa|down_2|AP019314.1_563874_564351_-	pfam05532, CsbD, CsbD-like	NA|61aa|down_3|AP019314.1_564643_564826_-	COG3237, COG3237, Uncharacterized protein conserved in bacteria [Function unknown]	NA|101aa|down_4|AP019314.1_565146_565449_+	NA	NA|69aa|down_5|AP019314.1_565582_565789_+	NA	cas14j|348aa|down_6|AP019314.1_565799_566843_+	COG0675, COG0675, Transposase and inactivated derivatives [DNA replication, recombination, and repair]	NA|385aa|down_7|AP019314.1_566901_568056_-	pfam01610, DDE_Tnp_ISL3, Transposase	NA|152aa|down_8|AP019314.1_569060_569516_-	pfam04972, BON, BON domain	NA|201aa|down_9|AP019314.1_569544_570147_-	pfam11181, YflT, Heat induced stress protein YflT
GCA_003945305.1_ASM394530v1	AP019314	Microcystis viridis NIES-102 DNA, complete genome	7	717767-721204	1,7,1	PILER-CR,CRISPRCasFinder,CRT	no	cas10d,csc2gr7,csc1gr5,WYL,cas8b5,cas7,cas5,cas3,cas6,cas4,cas1,cas2	Cas14c_CAS-V-F,cas14j,c2c9_V-U4,cas14k,WYL,cas3,cas10d,csc2gr7,csc1gr5,cas8b5,cas7,cas5,cas6,cas4,cas1,cas2,2OG_CAS,RT,DinG,csa3	Type I-D	GTTCCAATTAATCTTAAACCCTATTAGGGATTGAAAC,GTTCCAATTAATCTTAAACCCTATTAGGGATTGAAAC,GTTCCAATTAATCTTAAACCCTATTAGGGATTGAAAC	37,37,37	0	0	NA	NA	I-D,II-B:I-D,II-B:I-D,II-B	47,47,47	47	TypeI-D	Cas14c_CAS-V-F,cas14j,c2c9_V-U4,cas14k,WYL,cas3,cas10d,csc2gr7,csc1gr5,cas8b5,cas7,cas5,cas6,cas4,cas1,cas2,2OG_CAS,RT,DinG,csa3	cas7|298aa|up_9|AP019314.1_708431_709325_+,cas5|275aa|up_8|AP019314.1_709328_710153_+,NA|89aa|down_9|AP019314.1_728997_729264_-	cas7|298aa|up_9|AP019314.1_708431_709325_+	NA	cas5|275aa|up_8|AP019314.1_709328_710153_+	NA	cas3|908aa|up_7|AP019314.1_710145_712869_+	COG1203, COG1203, CRISPR-associated helicase Cas3 [Defense mechanisms]	NA|189aa|up_6|AP019314.1_712916_713483_+	cd06260, DUF820, Domain of unknown function (DUF820)	NA|50aa|up_5|AP019314.1_713842_713992_+	TIGR03159, cas_Csc1, CRISPR type I-D/CYANO-associated protein Csc1	NA|189aa|up_4|AP019314.1_714001_714568_+	cd06260, DUF820, Domain of unknown function (DUF820)	cas6|278aa|up_3|AP019314.1_714808_715642_+	COG5551, COG5551, CRISPR system related protein, RAMP superfamily [Defense    mechanisms]	cas4|198aa|up_2|AP019314.1_715644_716238_+	cd09637, Cas4_I-A_I-B_I-C_I-D_II-B, CRISPR/Cas system-associated protein Cas4	cas1|335aa|up_1|AP019314.1_716243_717248_+	TIGR04093, hypothetical_protein_L8106_25395, CRISPR-associated endonuclease Cas1, subtype CYANO	cas2|91aa|up_0|AP019314.1_717260_717533_+	cd09725, Cas2_I_II_III, CRISPR/Cas system-associated protein Cas2	NA|91aa|down_0|AP019314.1_721502_721775_-	pfam06305, LapA_dom, Lipopolysaccharide assembly protein A domain	NA|122aa|down_1|AP019314.1_721940_722306_-	pfam14706, Tnp_DNA_bind, Transposase DNA-binding	NA|310aa|down_2|AP019314.1_722430_723360_-	PRK14619, PRK14619, NAD(P)H-dependent glycerol-3-phosphate dehydrogenase; Provisional	NA|144aa|down_3|AP019314.1_723637_724069_+	COG1051, COG1051, ADP-ribose pyrophosphatase [Nucleotide transport and metabolism]	NA|406aa|down_4|AP019314.1_724129_725347_-	PRK10535, PRK10535, macrolide ABC transporter ATP-binding protein/permease MacB	NA|474aa|down_5|AP019314.1_725544_726966_-	TIGR01730, COG0845:_Membrane-fusion_protein, RND family efflux transporter, MFP subunit	NA|294aa|down_6|AP019314.1_727145_728027_+	sd00006, TPR, Tetratricopeptide repeat	NA|80aa|down_7|AP019314.1_728315_728555_+	COG4456, VagC, Virulence-associated protein and related proteins [Function unknown]	NA|143aa|down_8|AP019314.1_728564_728993_+	cd18748, PIN_VapC4-5_FitB-like, uncharacterized subgroup of the PIN_VapC4-5_FitB-like subfamily of the PIN domain superfamily	NA|89aa|down_9|AP019314.1_728997_729264_-	NA
GCA_003945305.1_ASM394530v1	AP019314	Microcystis viridis NIES-102 DNA, complete genome	8	1060955-1061049	8	CRISPRCasFinder	no		Cas14c_CAS-V-F,cas14j,c2c9_V-U4,cas14k,WYL,cas3,cas10d,csc2gr7,csc1gr5,cas8b5,cas7,cas5,cas6,cas4,cas1,cas2,2OG_CAS,RT,DinG,csa3	Orphan	CTTATCAAGGGGGGCAGGGGGGATC	25	0	0	NA	NA	NA	1	1	Orphan	Cas14c_CAS-V-F,cas14j,c2c9_V-U4,cas14k,WYL,cas3,cas10d,csc2gr7,csc1gr5,cas8b5,cas7,cas5,cas6,cas4,cas1,cas2,2OG_CAS,RT,DinG,csa3	NA|88aa|up_7|AP019314.1_1053129_1053393_-,NA|77aa|up_4|AP019314.1_1056154_1056385_-,NA|119aa|up_2|AP019314.1_1057790_1058147_-,NA|62aa|down_1|AP019314.1_1062942_1063128_+,NA|103aa|down_6|AP019314.1_1065294_1065603_+,NA|60aa|down_9|AP019314.1_1066640_1066820_-	NA|385aa|up_9|AP019314.1_1051116_1052271_-	pfam01610, DDE_Tnp_ISL3, Transposase	NA|156aa|up_8|AP019314.1_1052661_1053129_-	cd09874, PIN_MT3492-like, VapC-like PIN domain of the hypothetical protein MT3492 of Mycobacterium tuberculosis CDC1551 and other uncharacterized, annotated PilT protein domain proteins	NA|88aa|up_7|AP019314.1_1053129_1053393_-	NA	NA|276aa|up_6|AP019314.1_1053785_1054613_-	PLN02498, PLN02498, omega-3 fatty acid desaturase	NA|347aa|up_5|AP019314.1_1055121_1056162_+	TIGR00675, Modification_methylase, DNA-methyltransferase (dcm)	NA|77aa|up_4|AP019314.1_1056154_1056385_-	NA	NA|348aa|up_3|AP019314.1_1056750_1057794_+	pfam13358, DDE_3, DDE superfamily endonuclease	NA|119aa|up_2|AP019314.1_1057790_1058147_-	NA	NA|89aa|up_1|AP019314.1_1058692_1058959_-	pfam13529, Peptidase_C39_2, Peptidase_C39 like family	NA|457aa|up_0|AP019314.1_1059382_1060753_-	pfam14706, Tnp_DNA_bind, Transposase DNA-binding	NA|445aa|down_0|AP019314.1_1061361_1062696_-	pfam00931, NB-ARC, NB-ARC domain	NA|62aa|down_1|AP019314.1_1062942_1063128_+	NA	NA|212aa|down_2|AP019314.1_1063194_1063830_+	COG3596, COG3596, Predicted GTPase [General function prediction only]	NA|264aa|down_3|AP019314.1_1063816_1064608_+	COG3596, COG3596, Predicted GTPase [General function prediction only]	NA|136aa|down_4|AP019314.1_1064625_1065033_+	pfam06282, DUF1036, Protein of unknown function (DUF1036)	NA|70aa|down_5|AP019314.1_1065078_1065288_+	PRK09706, PRK09706, transcriptional repressor DicA; Reviewed	NA|103aa|down_6|AP019314.1_1065294_1065603_+	NA	NA|80aa|down_7|AP019314.1_1065842_1066082_+	pfam03683, UPF0175, Uncharacterized protein family (UPF0175)	NA|165aa|down_8|AP019314.1_1066075_1066570_+	COG2405, COG2405, Predicted nucleic acid-binding protein, contains PIN domain [General function prediction only]	NA|60aa|down_9|AP019314.1_1066640_1066820_-	NA
GCA_003945305.1_ASM394530v1	AP019314	Microcystis viridis NIES-102 DNA, complete genome	10	1979352-1979454	10	CRISPRCasFinder	no	cas14k	Cas14c_CAS-V-F,cas14j,c2c9_V-U4,cas14k,WYL,cas3,cas10d,csc2gr7,csc1gr5,cas8b5,cas7,cas5,cas6,cas4,cas1,cas2,2OG_CAS,RT,DinG,csa3	Unclear	AACTTGCTTCCAATTCGTGAAGCGTCTGA	29	0	0	NA	NA	NA	1	1	TypeV	Cas14c_CAS-V-F,cas14j,c2c9_V-U4,cas14k,WYL,cas3,cas10d,csc2gr7,csc1gr5,cas8b5,cas7,cas5,cas6,cas4,cas1,cas2,2OG_CAS,RT,DinG,csa3	NA|71aa|up_9|AP019314.1_1968204_1968417_-,NA|115aa|up_7|AP019314.1_1969887_1970232_+,NA|75aa|up_1|AP019314.1_1977220_1977445_-,NA|100aa|down_2|AP019314.1_1981397_1981697_+,NA|106aa|down_7|AP019314.1_1986405_1986723_-	NA|71aa|up_9|AP019314.1_1968204_1968417_-	NA	NA|278aa|up_8|AP019314.1_1968546_1969380_-	pfam01716, MSP, Manganese-stabilizing protein / photosystem II polypeptide	NA|115aa|up_7|AP019314.1_1969887_1970232_+	NA	NA|478aa|up_6|AP019314.1_1970475_1971909_+	cd05800, PGM_like2, This PGM-like (phosphoglucomutase-like) protein of unknown function belongs to the alpha-D-phosphohexomutase superfamily and is found in both archaea and bacteria	NA|452aa|up_5|AP019314.1_1971961_1973317_-	COG0247, GlpC, Fe-S oxidoreductase [Energy production and conversion]	NA|429aa|up_4|AP019314.1_1973326_1974613_-	COG0277, GlcD, FAD/FMN-containing dehydrogenases [Energy production and conversion]	NA|396aa|up_3|AP019314.1_1974869_1976057_+	PRK05942, PRK05942, aspartate aminotransferase; Provisional	NA|244aa|up_2|AP019314.1_1976154_1976886_+	PRK00024, PRK00024, DNA repair protein RadC	NA|75aa|up_1|AP019314.1_1977220_1977445_-	NA	NA|351aa|up_0|AP019314.1_1977921_1978974_+	PRK05385, PRK05385, phosphoribosylaminoimidazole synthetase; Provisional	NA|125aa|down_0|AP019314.1_1979509_1979884_+	COG1359, COG1359, Uncharacterized conserved protein [Function unknown]	NA|361aa|down_1|AP019314.1_1980209_1981292_-	TIGR01151, Photosystem_QB_protein, photosystem II, DI subunit (also called Q(B))	NA|100aa|down_2|AP019314.1_1981397_1981697_+	NA	NA|379aa|down_3|AP019314.1_1982713_1983850_-	PRK07360, PRK07360, FO synthase subunit 2; Reviewed	NA|234aa|down_4|AP019314.1_1983987_1984689_-	PRK13266, PRK13266, Thf1-like protein; Reviewed	NA|155aa|down_5|AP019314.1_1985041_1985506_+	pfam01668, SmpB, SmpB protein	NA|183aa|down_6|AP019314.1_1985624_1986173_+	pfam10989, DUF2808, Protein of unknown function (DUF2808)	NA|106aa|down_7|AP019314.1_1986405_1986723_-	NA	NA|769aa|down_8|AP019314.1_1987263_1989570_-	pfam13355, DUF4101, Protein of unknown function (DUF4101)	NA|345aa|down_9|AP019314.1_1991138_1992173_+	CHL00149, odpA, pyruvate dehydrogenase E1 component alpha subunit; Reviewed
GCA_003945305.1_ASM394530v1	AP019314	Microcystis viridis NIES-102 DNA, complete genome	11	2113284-2113425	11	CRISPRCasFinder	no		Cas14c_CAS-V-F,cas14j,c2c9_V-U4,cas14k,WYL,cas3,cas10d,csc2gr7,csc1gr5,cas8b5,cas7,cas5,cas6,cas4,cas1,cas2,2OG_CAS,RT,DinG,csa3	Orphan	ATAGGTAATCATCTGTAGGGTGGGTTAGGCAAAAATAAGTTATACTCTGACT	52	0	0	NA	NA	NA	1	1	Orphan	Cas14c_CAS-V-F,cas14j,c2c9_V-U4,cas14k,WYL,cas3,cas10d,csc2gr7,csc1gr5,cas8b5,cas7,cas5,cas6,cas4,cas1,cas2,2OG_CAS,RT,DinG,csa3	NA,NA|93aa|down_9|AP019314.1_2124500_2124779_+	NA|278aa|up_9|AP019314.1_2098359_2099193_+	PRK07396, PRK07396, dihydroxynaphthoic acid synthetase; Validated	NA|101aa|up_8|AP019314.1_2099548_2099851_+	cd12399, RRM_HP0827_like, RNA recognition motif in Helicobacter pylori HP0827 protein and similar proteins	NA|206aa|up_7|AP019314.1_2099908_2100526_-	COG0586, DedA, Uncharacterized membrane-associated protein [Function unknown]	NA|180aa|up_6|AP019314.1_2100595_2101135_+	COG1357, COG1357, Pentapeptide repeats containing protein [Function unknown]	NA|539aa|up_5|AP019314.1_2101576_2103193_+	cd08519, PBP2_NikA_DppA_OppA_like_20, The substrate-binding component of an uncharacterized ABC-type nickel/dipeptide/oligopeptide-like import system contains the type 2 periplasmic binding fold	NA|256aa|up_4|AP019314.1_2103625_2104393_+	cd09083, EEP-1, Exonuclease-Endonuclease-Phosphatase domain; uncharacterized family 1	NA|178aa|up_3|AP019314.1_2104718_2105252_-	PRK05205, PRK05205, bifunctional pyr operon transcriptional regulator/uracil phosphoribosyltransferase PyrR	NA|466aa|up_2|AP019314.1_2105396_2106794_+	COG0154, GatA, Asp-tRNAAsn/Glu-tRNAGln amidotransferase A subunit and related amidases [Translation, ribosomal structure and biogenesis]	NA|625aa|up_1|AP019314.1_2106925_2108800_-	COG1123, COG1123, ATPase components of various ABC-type transport systems, contain duplicated ATPase [General function prediction only]	NA|417aa|up_0|AP019314.1_2109389_2110640_-	PRK05250, PRK05250, S-adenosylmethionine synthetase; Validated	NA|144aa|down_0|AP019314.1_2113752_2114184_+	pfam14159, CAAD, CAAD domains of cyanobacterial aminoacyl-tRNA synthetase	NA|388aa|down_1|AP019314.1_2114596_2115760_-	cd17774, CBS_two-component_sensor_histidine_kinase_repeat2, 2 tandem repeats of the CBS domain in the two-component sensor histidine kinase and related-proteins, repeat 2	NA|152aa|down_2|AP019314.1_2115992_2116448_-	pfam07508, Recombinase, Recombinase	NA|108aa|down_3|AP019314.1_2116787_2117111_+	cd02947, TRX_family, TRX family; composed of two groups: Group I, which includes proteins that exclusively encode a TRX domain; and Group II, which are composed of fusion proteins of TRX and additional domains	NA|206aa|down_4|AP019314.1_2117568_2118186_+	cd04630, CBS_pair_bac, Two tandem repeats of the cystathionine beta-synthase (CBS pair) domains present in bacteria	NA|104aa|down_5|AP019314.1_2119110_2119422_+	PRK00364, groES, co-chaperonin GroES; Reviewed	NA|542aa|down_6|AP019314.1_2119468_2121094_+	PRK00013, groEL, chaperonin GroEL; Reviewed	NA|638aa|down_7|AP019314.1_2121442_2123356_+	sd00006, TPR, Tetratricopeptide repeat	NA|309aa|down_8|AP019314.1_2123352_2124279_+	pfam13191, AAA_16, AAA ATPase domain	NA|93aa|down_9|AP019314.1_2124500_2124779_+	NA
GCA_003945305.1_ASM394530v1	AP019314	Microcystis viridis NIES-102 DNA, complete genome	12	2157244-2157336	12	CRISPRCasFinder	no		Cas14c_CAS-V-F,cas14j,c2c9_V-U4,cas14k,WYL,cas3,cas10d,csc2gr7,csc1gr5,cas8b5,cas7,cas5,cas6,cas4,cas1,cas2,2OG_CAS,RT,DinG,csa3	Orphan	ATGAATTAACCCTACGATGAATTAACCCTAC	31	1	1	2157275-2157305	AP019314.1_2157229-2157259	NA	1	1	Orphan	Cas14c_CAS-V-F,cas14j,c2c9_V-U4,cas14k,WYL,cas3,cas10d,csc2gr7,csc1gr5,cas8b5,cas7,cas5,cas6,cas4,cas1,cas2,2OG_CAS,RT,DinG,csa3	NA|263aa|up_0|AP019314.1_2156384_2157173_-,NA|275aa|down_5|AP019314.1_2160435_2161260_-	NA|474aa|up_9|AP019314.1_2147822_2149244_-	COG1928, PMT1, Dolichyl-phosphate-mannose--protein O-mannosyl transferase [Posttranslational modification, protein turnover, chaperones]	NA|356aa|up_8|AP019314.1_2149422_2150490_-	cd05258, CDP_TE_SDR_e, CDP-tyvelose 2-epimerase, extended (e) SDRs	NA|260aa|up_7|AP019314.1_2150560_2151340_-	PRK13331, PRK13331, pantothenate kinase; Reviewed	NA|278aa|up_6|AP019314.1_2151443_2152277_-	COG1408, COG1408, Predicted phosphohydrolases [General function prediction only]	NA|412aa|up_5|AP019314.1_2152388_2153624_-	PRK07590, PRK07590, L,L-diaminopimelate aminotransferase; Validated	NA|71aa|up_4|AP019314.1_2153777_2153990_+	pfam10999, DUF2839, Protein of unknown function (DUF2839)	NA|314aa|up_3|AP019314.1_2154199_2155141_-	PLN00016, PLN00016, RNA-binding protein; Provisional	NA|149aa|up_2|AP019314.1_2155370_2155817_+	pfam01724, DUF29, Domain of unknown function DUF29	NA|152aa|up_1|AP019314.1_2155920_2156376_+	pfam01724, DUF29, Domain of unknown function DUF29	NA|263aa|up_0|AP019314.1_2156384_2157173_-	NA	NA|169aa|down_0|AP019314.1_2157449_2157956_-	pfam16734, Pilin_GH, Type IV pilin-like G and H, putative	NA|182aa|down_1|AP019314.1_2158229_2158775_-	COG4968, PilE, Tfp pilus assembly protein PilE [Cell motility and secretion / Intracellular trafficking and secretion]	NA|107aa|down_2|AP019314.1_2158901_2159222_-	pfam13747, DUF4164, Domain of unknown function (DUF4164)	NA|129aa|down_3|AP019314.1_2159306_2159693_-	pfam13747, DUF4164, Domain of unknown function (DUF4164)	NA|144aa|down_4|AP019314.1_2159778_2160210_-	pfam13747, DUF4164, Domain of unknown function (DUF4164)	NA|275aa|down_5|AP019314.1_2160435_2161260_-	NA	NA|353aa|down_6|AP019314.1_2161330_2162389_-	PRK13654, PRK13654, magnesium-protoporphyrin IX monomethyl ester cyclase; Provisional	NA|655aa|down_7|AP019314.1_2162596_2164561_-	cd07338, M48B_HtpX_like, Peptidase M48 subfamily B HtpX-like membrane-bound metallopeptidase	NA|61aa|down_8|AP019314.1_2164936_2165119_+	CHL00152, rpl32, ribosomal protein L32; Validated	NA|200aa|down_9|AP019314.1_2165300_2165900_+	cd02109, arch_bact_SO_family_Moco, bacterial and archael members of the sulfite oxidase (SO) family of molybdopterin binding domains
GCA_003945305.1_ASM394530v1	AP019314	Microcystis viridis NIES-102 DNA, complete genome	13	2288947-2289054	13	CRISPRCasFinder	no		Cas14c_CAS-V-F,cas14j,c2c9_V-U4,cas14k,WYL,cas3,cas10d,csc2gr7,csc1gr5,cas8b5,cas7,cas5,cas6,cas4,cas1,cas2,2OG_CAS,RT,DinG,csa3	Orphan	GGGTGTTAGGGTGTTGGGGTGTTAG	25	2	33	2288972-2288986|2288972-2288986|2288972-2288986|2288972-2288986|2289012-2289029|2289012-2289029|2289012-2289029|2289012-2289029|2289012-2289029|2289012-2289029|2289012-2289029|2289012-2289029|2289012-2289029|2289012-2289029|2289012-2289029|2289012-2289029|2289012-2289029|2289012-2289029|2289012-2289029|2289012-2289029|2289012-2289029|2289012-2289029|2289012-2289029|2289012-2289029|2289012-2289029|2289012-2289029|2289012-2289029|2289012-2289029|2289012-2289029|2289012-2289029|2289012-2289029|2289012-2289029|2289012-2289029	AP019314.1_409830-409844|AP019314.1_737951-737937|AP019314.1_1984955-1984941|AP019314.1_3052633-3052619|AP019314.1_1129309-1129326|AP019314.1_1650212-1650229|AP019314.1_1893156-1893173|AP019314.1_1925097-1925114|AP019314.1_2001275-2001292|AP019314.1_3052652-3052635|AP019314.1_3059091-3059108|AP019314.1_3550896-3550913|AP019314.1_4013946-4013929|AP019314.1_4149323-4149340|AP019314.1_4471358-4471341|AP019314.1_44124-44107|AP019314.1_285315-285298|AP019314.1_298194-298177|AP019314.1_298221-298204|AP019314.1_454452-454435|AP019314.1_1150610-1150627|AP019314.1_1245117-1245100|AP019314.1_1599487-1599504|AP019314.1_2371640-2371657|AP019314.1_2672824-2672807|AP019314.1_2692296-2692279|AP019314.1_2819102-2819119|AP019314.1_2902576-2902593|AP019314.1_4467841-4467858|AP019314.1_5115584-5115567|AP019314.1_5334621-5334604|AP019314.1_5459854-5459871|AP019314.1_5460917-5460934	NA	2	2	Orphan	Cas14c_CAS-V-F,cas14j,c2c9_V-U4,cas14k,WYL,cas3,cas10d,csc2gr7,csc1gr5,cas8b5,cas7,cas5,cas6,cas4,cas1,cas2,2OG_CAS,RT,DinG,csa3	NA,NA	NA|348aa|up_9|AP019314.1_2276438_2277482_+	pfam13358, DDE_3, DDE superfamily endonuclease	NA|244aa|up_8|AP019314.1_2277548_2278280_-	TIGR02595, conserved_hypothetical_protein, PEP-CTERM protein-sorting domain	NA|272aa|up_7|AP019314.1_2279713_2280529_-	COG5421, COG5421, Transposase [DNA replication, recombination, and repair]	NA|65aa|up_6|AP019314.1_2280798_2280993_+	CHL00104, rpl33, ribosomal protein L33	NA|72aa|up_5|AP019314.1_2281033_2281249_+	PRK00391, rpsR, 30S ribosomal protein S18; Reviewed	NA|743aa|up_4|AP019314.1_2281892_2284121_+	COG0642, BaeS, Signal transduction histidine kinase [Signal transduction mechanisms]	NA|248aa|up_3|AP019314.1_2284578_2285322_+	COG1413, COG1413, FOG: HEAT repeat [Energy production and conversion]	NA|206aa|up_2|AP019314.1_2285305_2285923_+	PRK05920, PRK05920, aromatic acid decarboxylase; Validated	NA|205aa|up_1|AP019314.1_2286330_2286945_+	PRK14965, PRK14965, DNA polymerase III subunits gamma and tau; Provisional	NA|175aa|up_0|AP019314.1_2288403_2288928_+	CHL00100, ilvH, acetohydroxyacid synthase small subunit	NA|751aa|down_0|AP019314.1_2289131_2291384_+	COG0744, MrcB, Membrane carboxypeptidase (penicillin-binding protein) [Cell envelope biogenesis, outer membrane]	NA|327aa|down_1|AP019314.1_2291593_2292574_-	NF033203, entero_EhxA, enterohemolysin EhxA	NA|178aa|down_2|AP019314.1_2292710_2293244_-	TIGR00560, pgsA, CDP-diacylglycerol--glycerol-3-phosphate 3-phosphatidyltransferase	NA|365aa|down_3|AP019314.1_2293575_2294670_-	cd03506, Delta6-FADS-like, The Delta6 Fatty Acid Desaturase (Delta6-FADS)-like CD includes the integral-membrane enzymes: delta-4, delta-5, delta-6, delta-8, delta-8-sphingolipid, and delta-11 desaturases found in vertebrates, higher plants, fungi, and bacteria	NA|347aa|down_4|AP019314.1_2295023_2296064_+	COG1191, FliA, DNA-directed RNA polymerase specialized sigma subunit [Transcription]	NA|455aa|down_5|AP019314.1_2296080_2297445_+	pfam08852, DUF1822, Protein of unknown function (DUF1822)	NA|887aa|down_6|AP019314.1_2297477_2300138_+	cd06268, PBP1_ABC_transporter_LIVBP-like, periplasmic binding domain of ATP-binding cassette transporter-like systems that belong to the type 1 periplasmic binding fold protein superfamily	NA|240aa|down_7|AP019314.1_2300245_2300965_+	COG1842, PspA, Phage shock protein A (IM30), suppresses sigma54-dependent transcription [Transcription / Signal transduction mechanisms]	NA|450aa|down_8|AP019314.1_2300976_2302326_+	cd06268, PBP1_ABC_transporter_LIVBP-like, periplasmic binding domain of ATP-binding cassette transporter-like systems that belong to the type 1 periplasmic binding fold protein superfamily	NA|237aa|down_9|AP019314.1_2302480_2303191_+	COG0745, OmpR, Response regulators consisting of a CheY-like receiver domain and a winged-helix DNA-binding domain [Signal transduction mechanisms / Transcription]
GCA_003945305.1_ASM394530v1	AP019314	Microcystis viridis NIES-102 DNA, complete genome	15	2323430-2323549	15	CRISPRCasFinder	no	cas14k	Cas14c_CAS-V-F,cas14j,c2c9_V-U4,cas14k,WYL,cas3,cas10d,csc2gr7,csc1gr5,cas8b5,cas7,cas5,cas6,cas4,cas1,cas2,2OG_CAS,RT,DinG,csa3	Unclear	GAGTGCGATGCCAAAGGCACTGCGTAGCAG	30	0	0	NA	NA	NA	1	1	TypeV	Cas14c_CAS-V-F,cas14j,c2c9_V-U4,cas14k,WYL,cas3,cas10d,csc2gr7,csc1gr5,cas8b5,cas7,cas5,cas6,cas4,cas1,cas2,2OG_CAS,RT,DinG,csa3	NA|58aa|up_8|AP019314.1_2318923_2319097_-,NA|180aa|up_6|AP019314.1_2319776_2320316_+,NA|180aa|up_5|AP019314.1_2320449_2320989_+,NA|72aa|up_4|AP019314.1_2321083_2321299_+,NA|127aa|up_1|AP019314.1_2322171_2322552_+,NA|64aa|down_1|AP019314.1_2324680_2324872_+,NA|76aa|down_5|AP019314.1_2328113_2328341_+,NA|103aa|down_7|AP019314.1_2329868_2330177_+,NA|73aa|down_9|AP019314.1_2330671_2330890_-	NA|285aa|up_9|AP019314.1_2318065_2318920_+	pfam00582, Usp, Universal stress protein family	NA|58aa|up_8|AP019314.1_2318923_2319097_-	NA	NA|147aa|up_7|AP019314.1_2319077_2319518_-	TIGR02249, Integrase/recombinase_E2_protein	NA|180aa|up_6|AP019314.1_2319776_2320316_+	NA	NA|180aa|up_5|AP019314.1_2320449_2320989_+	NA	NA|72aa|up_4|AP019314.1_2321083_2321299_+	NA	NA|143aa|up_3|AP019314.1_2321291_2321720_+	pfam13650, Asp_protease_2, Aspartyl protease	NA|111aa|up_2|AP019314.1_2321767_2322100_+	pfam03992, ABM, Antibiotic biosynthesis monooxygenase	NA|127aa|up_1|AP019314.1_2322171_2322552_+	NA	NA|67aa|up_0|AP019314.1_2323015_2323216_+	COG1598, COG1598, Predicted nuclease of the RNAse H fold, HicB family [General    function prediction only]	NA|153aa|down_0|AP019314.1_2324035_2324494_+	pfam13384, HTH_23, Homeodomain-like domain	NA|64aa|down_1|AP019314.1_2324680_2324872_+	NA	NA|123aa|down_2|AP019314.1_2325008_2325377_+	pfam13358, DDE_3, DDE superfamily endonuclease	NA|192aa|down_3|AP019314.1_2325400_2325976_+	TIGR03897, Lantibiotic_mersacidin_modifying_enzyme, type 2 lantibiotic biosynthesis protein LanM	NA|158aa|down_4|AP019314.1_2327637_2328111_+	cd18687, PIN_VapC-like, uncharacterized subfamily of the VapC (virulence-associated protein C)-like family of the PIN domain superfamily	NA|76aa|down_5|AP019314.1_2328113_2328341_+	NA	NA|348aa|down_6|AP019314.1_2328759_2329803_-	pfam13358, DDE_3, DDE superfamily endonuclease	NA|103aa|down_7|AP019314.1_2329868_2330177_+	NA	NA|131aa|down_8|AP019314.1_2330286_2330679_-	cd00303, retropepsin_like, Retropepsins; pepsin-like aspartate proteases	NA|73aa|down_9|AP019314.1_2330671_2330890_-	NA
GCA_003945305.1_ASM394530v1	AP019314	Microcystis viridis NIES-102 DNA, complete genome	16	2334649-2334768	16	CRISPRCasFinder	no	cas14k,RT	Cas14c_CAS-V-F,cas14j,c2c9_V-U4,cas14k,WYL,cas3,cas10d,csc2gr7,csc1gr5,cas8b5,cas7,cas5,cas6,cas4,cas1,cas2,2OG_CAS,RT,DinG,csa3	Unclear	GAGTGCGATGCCAAAGGCACTGCGTAGCAG	30	0	0	NA	NA	NA	1	1	TypeV	Cas14c_CAS-V-F,cas14j,c2c9_V-U4,cas14k,WYL,cas3,cas10d,csc2gr7,csc1gr5,cas8b5,cas7,cas5,cas6,cas4,cas1,cas2,2OG_CAS,RT,DinG,csa3	NA|76aa|up_8|AP019314.1_2328113_2328341_+,NA|103aa|up_6|AP019314.1_2329868_2330177_+,NA|73aa|up_4|AP019314.1_2330671_2330890_-,NA|77aa|down_3|AP019314.1_2338070_2338301_+,NA|191aa|down_8|AP019314.1_2343212_2343785_+	NA|158aa|up_9|AP019314.1_2327637_2328111_+	cd18687, PIN_VapC-like, uncharacterized subfamily of the VapC (virulence-associated protein C)-like family of the PIN domain superfamily	NA|76aa|up_8|AP019314.1_2328113_2328341_+	NA	NA|348aa|up_7|AP019314.1_2328759_2329803_-	pfam13358, DDE_3, DDE superfamily endonuclease	NA|103aa|up_6|AP019314.1_2329868_2330177_+	NA	NA|131aa|up_5|AP019314.1_2330286_2330679_-	cd00303, retropepsin_like, Retropepsins; pepsin-like aspartate proteases	NA|73aa|up_4|AP019314.1_2330671_2330890_-	NA	NA|75aa|up_3|AP019314.1_2331081_2331306_-	COG2442, COG2442, Uncharacterized conserved protein [Function unknown]	NA|168aa|up_2|AP019314.1_2331507_2332011_+	pfam13565, HTH_32, Homeodomain-like domain	NA|249aa|up_1|AP019314.1_2331938_2332685_+	pfam13358, DDE_3, DDE superfamily endonuclease	NA|66aa|up_0|AP019314.1_2334237_2334435_+	COG1598, COG1598, Predicted nuclease of the RNAse H fold, HicB family [General    function prediction only]	NA|153aa|down_0|AP019314.1_2335254_2335713_+	pfam13384, HTH_23, Homeodomain-like domain	NA|169aa|down_1|AP019314.1_2335899_2336406_+	pfam13358, DDE_3, DDE superfamily endonuclease	cas14k|361aa|down_2|AP019314.1_2336784_2337867_+	pfam01385, OrfB_IS605, Probable transposase	NA|77aa|down_3|AP019314.1_2338070_2338301_+	NA	NA|458aa|down_4|AP019314.1_2338509_2339883_+	PRK05291, trmE, tRNA uridine-5-carboxymethylaminomethyl(34) synthesis GTPase MnmE	NA|106aa|down_5|AP019314.1_2339967_2340285_-	pfam07889, DUF1664, Protein of unknown function (DUF1664)	NA|103aa|down_6|AP019314.1_2340397_2340706_-	pfam13747, DUF4164, Domain of unknown function (DUF4164)	NA|92aa|down_7|AP019314.1_2340818_2341094_-	pfam10779, XhlA, Haemolysin XhlA	NA|191aa|down_8|AP019314.1_2343212_2343785_+	NA	NA|174aa|down_9|AP019314.1_2343811_2344333_+	TIGR02646, Hypothetical_protein_SMc04429, TIGR02646 family protein
GCA_003945305.1_ASM394530v1	AP019314	Microcystis viridis NIES-102 DNA, complete genome	17	2345562-2345633	17	CRISPRCasFinder	no	cas14k,RT	Cas14c_CAS-V-F,cas14j,c2c9_V-U4,cas14k,WYL,cas3,cas10d,csc2gr7,csc1gr5,cas8b5,cas7,cas5,cas6,cas4,cas1,cas2,2OG_CAS,RT,DinG,csa3	Unclear	GACACCCCCCTTATCAAGGGGGG	23	1	11	2345585-2345610|2345585-2345610|2345585-2345610|2345585-2345610|2345585-2345610|2345585-2345610|2345585-2345610|2345585-2345610|2345585-2345610|2345585-2345610|2345585-2345610	AP019314.1_524001-524026|AP019314.1_1084410-1084435|AP019314.1_1457138-1457163|AP019314.1_2855953-2855978|AP019314.1_3199560-3199585|AP019314.1_3598016-3597991|AP019314.1_4032083-4032108|AP019314.1_4880985-4880960|AP019314.1_5663673-5663648|AP019314.1_2281557-2281582|AP019314.1_3796092-3796067	NA	1	1	TypeV	Cas14c_CAS-V-F,cas14j,c2c9_V-U4,cas14k,WYL,cas3,cas10d,csc2gr7,csc1gr5,cas8b5,cas7,cas5,cas6,cas4,cas1,cas2,2OG_CAS,RT,DinG,csa3	NA|77aa|up_7|AP019314.1_2338070_2338301_+,NA|191aa|up_2|AP019314.1_2343212_2343785_+,NA|108aa|down_5|AP019314.1_2355986_2356310_-,NA|134aa|down_8|AP019314.1_2358958_2359360_-	NA|169aa|up_9|AP019314.1_2335899_2336406_+	pfam13358, DDE_3, DDE superfamily endonuclease	cas14k|361aa|up_8|AP019314.1_2336784_2337867_+	pfam01385, OrfB_IS605, Probable transposase	NA|77aa|up_7|AP019314.1_2338070_2338301_+	NA	NA|458aa|up_6|AP019314.1_2338509_2339883_+	PRK05291, trmE, tRNA uridine-5-carboxymethylaminomethyl(34) synthesis GTPase MnmE	NA|106aa|up_5|AP019314.1_2339967_2340285_-	pfam07889, DUF1664, Protein of unknown function (DUF1664)	NA|103aa|up_4|AP019314.1_2340397_2340706_-	pfam13747, DUF4164, Domain of unknown function (DUF4164)	NA|92aa|up_3|AP019314.1_2340818_2341094_-	pfam10779, XhlA, Haemolysin XhlA	NA|191aa|up_2|AP019314.1_2343212_2343785_+	NA	NA|174aa|up_1|AP019314.1_2343811_2344333_+	TIGR02646, Hypothetical_protein_SMc04429, TIGR02646 family protein	NA|363aa|up_0|AP019314.1_2344407_2345496_+	cd17507, GT28_Beta-DGS-like, beta-diglucosyldiacylglycerol synthase and similar proteins	NA|664aa|down_0|AP019314.1_2345952_2347944_+	COG0661, AarF, Predicted unusual protein kinase [General function prediction only]	NA|435aa|down_1|AP019314.1_2348072_2349377_-	COG2124, CypX, Cytochrome P450 [Secondary metabolites biosynthesis, transport, and catabolism]	NA|399aa|down_2|AP019314.1_2349590_2350787_-	PRK00053, alr, alanine racemase; Reviewed	RT|615aa|down_3|AP019314.1_2351897_2353742_+	TIGR04416, hypothetical_protein, group II intron reverse transcriptase/maturase	RT|492aa|down_4|AP019314.1_2354431_2355907_+	TIGR04416, hypothetical_protein, group II intron reverse transcriptase/maturase	NA|108aa|down_5|AP019314.1_2355986_2356310_-	NA	NA|456aa|down_6|AP019314.1_2356306_2357674_-	COG3950, COG3950, Predicted ATP-binding protein involved in virulence [General function prediction only]	NA|358aa|down_7|AP019314.1_2357801_2358875_-	CHL00081, chlI, Mg-protoporyphyrin IX chelatase	NA|134aa|down_8|AP019314.1_2358958_2359360_-	NA	NA|711aa|down_9|AP019314.1_2359653_2361786_-	PRK01233, glyS, glycyl-tRNA synthetase subunit beta; Validated
GCA_003945305.1_ASM394530v1	AP019314	Microcystis viridis NIES-102 DNA, complete genome	18	2679457-2679558	18	CRISPRCasFinder	no		Cas14c_CAS-V-F,cas14j,c2c9_V-U4,cas14k,WYL,cas3,cas10d,csc2gr7,csc1gr5,cas8b5,cas7,cas5,cas6,cas4,cas1,cas2,2OG_CAS,RT,DinG,csa3	Orphan	TCATGACTGAGATATGTCTGAGTTTTTTGCGATAAAA	37	0	0	NA	NA	NA	1	1	Orphan	Cas14c_CAS-V-F,cas14j,c2c9_V-U4,cas14k,WYL,cas3,cas10d,csc2gr7,csc1gr5,cas8b5,cas7,cas5,cas6,cas4,cas1,cas2,2OG_CAS,RT,DinG,csa3	NA|294aa|up_1|AP019314.1_2678222_2679104_-,NA|99aa|up_0|AP019314.1_2679090_2679387_-,NA|167aa|down_5|AP019314.1_2684045_2684546_+	NA|298aa|up_9|AP019314.1_2664844_2665738_+	pfam13359, DDE_Tnp_4, DDE superfamily endonuclease	NA|306aa|up_8|AP019314.1_2665999_2666917_+	pfam08548, Peptidase_M10_C, Peptidase M10 serralysin C terminal	NA|385aa|up_7|AP019314.1_2667805_2668960_-	PRK07415, PRK07415, NAD(P)H-quinone oxidoreductase subunit H; Validated	NA|990aa|up_6|AP019314.1_2669558_2672528_+	cd07124, ALDH_PutA-P5CDH-RocA, Delta(1)-pyrroline-5-carboxylate dehydrogenase, RocA	NA|545aa|up_5|AP019314.1_2672972_2674607_+	cd03085, PGM1, Phosphoglucomutase 1 (PGM1) catalyzes the bidirectional interconversion of glucose-1-phosphate (G-1-P) and glucose-6-phosphate (G-6-P) via a glucose 1,6-diphosphate intermediate, an important metabolic step in prokaryotes and eukaryotes	NA|137aa|up_4|AP019314.1_2674886_2675297_-	cd07177, terB_like, tellurium resistance terB-like protein	NA|451aa|up_3|AP019314.1_2675286_2676639_-	PRK14901, PRK14901, 16S rRNA methyltransferase B; Provisional	NA|184aa|up_2|AP019314.1_2676635_2677187_-	COG0400, COG0400, Predicted esterase [General function prediction only]	NA|294aa|up_1|AP019314.1_2678222_2679104_-	NA	NA|99aa|up_0|AP019314.1_2679090_2679387_-	NA	NA|419aa|down_0|AP019314.1_2679586_2680843_+	pfam14516, AAA_35, AAA-like domain	NA|374aa|down_1|AP019314.1_2681019_2682141_+	pfam14516, AAA_35, AAA-like domain	NA|158aa|down_2|AP019314.1_2682071_2682545_-	TIGR00836, Ammonium_transporter, ammonium transporter	NA|277aa|down_3|AP019314.1_2682597_2683428_-	pfam02517, Abi, CAAX protease self-immunity	NA|96aa|down_4|AP019314.1_2683420_2683708_-	PRK00033, clpS, ATP-dependent Clp protease adaptor protein ClpS; Reviewed	NA|167aa|down_5|AP019314.1_2684045_2684546_+	NA	NA|701aa|down_6|AP019314.1_2684569_2686672_+	pfam13424, TPR_12, Tetratricopeptide repeat	NA|78aa|down_7|AP019314.1_2687143_2687377_-	CHL00136, rpl31, ribosomal protein L31; Validated	NA|138aa|down_8|AP019314.1_2687396_2687810_-	PRK00132, rpsI, 30S ribosomal protein S9; Reviewed	NA|152aa|down_9|AP019314.1_2687814_2688270_-	PRK09216, rplM, 50S ribosomal protein L13; Reviewed
GCA_003945305.1_ASM394530v1	AP019314	Microcystis viridis NIES-102 DNA, complete genome	20	3044939-3045038	20	CRISPRCasFinder	no		Cas14c_CAS-V-F,cas14j,c2c9_V-U4,cas14k,WYL,cas3,cas10d,csc2gr7,csc1gr5,cas8b5,cas7,cas5,cas6,cas4,cas1,cas2,2OG_CAS,RT,DinG,csa3	Orphan	CAAGTTTGACAGCCTGTCTTTTGACAA	27	0	0	NA	NA	NA	1	1	Orphan	Cas14c_CAS-V-F,cas14j,c2c9_V-U4,cas14k,WYL,cas3,cas10d,csc2gr7,csc1gr5,cas8b5,cas7,cas5,cas6,cas4,cas1,cas2,2OG_CAS,RT,DinG,csa3	NA|70aa|up_8|AP019314.1_3036319_3036529_-,NA|78aa|up_6|AP019314.1_3036779_3037013_-,NA|111aa|up_2|AP019314.1_3038799_3039132_-,NA|83aa|down_2|AP019314.1_3047541_3047790_+,NA|98aa|down_3|AP019314.1_3047757_3048051_-,NA|124aa|down_4|AP019314.1_3048651_3049023_-,NA|307aa|down_6|AP019314.1_3050699_3051620_-	NA|76aa|up_9|AP019314.1_3036047_3036275_-	COG4118, Phd, Antitoxin of toxin-antitoxin stability system [Cell division and chromosome partitioning]	NA|70aa|up_8|AP019314.1_3036319_3036529_-	NA	NA|86aa|up_7|AP019314.1_3036515_3036773_-	COG2026, RelE, Cytotoxic translational repressor of toxin-antitoxin stability system [Translation, ribosomal structure and biogenesis / Cell division and chromosome partitioning]	NA|78aa|up_6|AP019314.1_3036779_3037013_-	NA	NA|150aa|up_5|AP019314.1_3037204_3037654_-	pfam16277, DUF4926, Domain of unknown function (DUF4926)	NA|112aa|up_4|AP019314.1_3038053_3038389_-	cd16382, XisI-like, XisI is FdxN element excision controlling factor protein	NA|82aa|up_3|AP019314.1_3038546_3038792_-	pfam16277, DUF4926, Domain of unknown function (DUF4926)	NA|111aa|up_2|AP019314.1_3038799_3039132_-	NA	NA|576aa|up_1|AP019314.1_3042453_3044181_-	PRK05945, sdhA, succinate dehydrogenase/fumarate reductase flavoprotein subunit	NA|126aa|up_0|AP019314.1_3044529_3044907_-	PRK02710, PRK02710, plastocyanin; Provisional	NA|107aa|down_0|AP019314.1_3045099_3045420_+	PRK13697, PRK13697, cytochrome c6; Provisional	NA|560aa|down_1|AP019314.1_3045536_3047216_+	pfam11832, DUF3352, Protein of unknown function (DUF3352)	NA|83aa|down_2|AP019314.1_3047541_3047790_+	NA	NA|98aa|down_3|AP019314.1_3047757_3048051_-	NA	NA|124aa|down_4|AP019314.1_3048651_3049023_-	NA	NA|194aa|down_5|AP019314.1_3050029_3050611_+	pfam12973, Cupin_7, ChrR Cupin-like domain	NA|307aa|down_6|AP019314.1_3050699_3051620_-	NA	NA|268aa|down_7|AP019314.1_3051696_3052500_-	COG3442, COG3442, Predicted glutamine amidotransferase [General function prediction only]	NA|446aa|down_8|AP019314.1_3052697_3054035_-	COG0769, MurE, UDP-N-acetylmuramyl tripeptide synthase [Cell envelope biogenesis, outer membrane]	NA|249aa|down_9|AP019314.1_3054314_3055061_-	pfam13358, DDE_3, DDE superfamily endonuclease
GCA_003945305.1_ASM394530v1	AP019314	Microcystis viridis NIES-102 DNA, complete genome	23	3356880-3357000	23	CRISPRCasFinder	no		Cas14c_CAS-V-F,cas14j,c2c9_V-U4,cas14k,WYL,cas3,cas10d,csc2gr7,csc1gr5,cas8b5,cas7,cas5,cas6,cas4,cas1,cas2,2OG_CAS,RT,DinG,csa3	Orphan	TAGGTGTTAAAAACTGTCAGACACCCCCCTTATCAAGG	38	0	0	NA	NA	NA	1	1	Orphan	Cas14c_CAS-V-F,cas14j,c2c9_V-U4,cas14k,WYL,cas3,cas10d,csc2gr7,csc1gr5,cas8b5,cas7,cas5,cas6,cas4,cas1,cas2,2OG_CAS,RT,DinG,csa3	NA|44aa|up_4|AP019314.1_3352825_3352957_+,NA|300aa|up_3|AP019314.1_3352987_3353887_+,NA|228aa|down_5|AP019314.1_3366182_3366866_-	NA|178aa|up_9|AP019314.1_3346011_3346545_-	COG4942, COG4942, Membrane-bound metallopeptidase [Cell division and chromosome partitioning]	NA|657aa|up_8|AP019314.1_3346914_3348885_-	PRK00174, PRK00174, acetyl-CoA synthetase; Provisional	NA|82aa|up_7|AP019314.1_3349136_3349382_+	CHL00065, psaC, photosystem I subunit VII	NA|364aa|up_6|AP019314.1_3349909_3351001_-	pfam13808, DDE_Tnp_1_assoc, DDE_Tnp_1-associated	NA|550aa|up_5|AP019314.1_3351179_3352829_+	pfam14104, DUF4277, Domain of unknown function (DUF4277)	NA|44aa|up_4|AP019314.1_3352825_3352957_+	NA	NA|300aa|up_3|AP019314.1_3352987_3353887_+	NA	NA|244aa|up_2|AP019314.1_3354439_3355171_-	cd06260, DUF820, Domain of unknown function (DUF820)	NA|243aa|up_1|AP019314.1_3355246_3355975_+	COG1135, AbcC, ABC-type metal ion transport system, ATPase component [Inorganic ion transport and metabolism]	NA|268aa|up_0|AP019314.1_3355977_3356781_-	pfam13230, GATase_4, Glutamine amidotransferases class-II	NA|171aa|down_0|AP019314.1_3357653_3358166_+	pfam07592, DDE_Tnp_ISAZ013, Rhodopirellula transposase DDE domain	NA|287aa|down_1|AP019314.1_3361023_3361884_-	COG5433, COG5433, Transposase [DNA replication, recombination, and repair]	NA|347aa|down_2|AP019314.1_3362596_3363637_+	PRK07409, PRK07409, threonine synthase; Validated	NA|210aa|down_3|AP019314.1_3363666_3364296_-	PRK00951, hisB, imidazoleglycerol-phosphate dehydratase HisB	NA|505aa|down_4|AP019314.1_3364502_3366017_-	PRK14950, PRK14950, DNA polymerase III subunits gamma and tau; Provisional	NA|228aa|down_5|AP019314.1_3366182_3366866_-	NA	NA|71aa|down_6|AP019314.1_3367367_3367580_+	PLN00014, PLN00014, light-harvesting-like protein 3; Provisional	NA|282aa|down_7|AP019314.1_3367638_3368484_-	PLN02536, PLN02536, diaminopimelate epimerase	NA|92aa|down_8|AP019314.1_3368482_3368758_+	cd01716, Hfq, bacterial Hfq-like	NA|391aa|down_9|AP019314.1_3369095_3370268_+	cd04179, DPM_DPG-synthase_like, DPM_DPG-synthase_like is a member of the Glycosyltransferase 2 superfamily
GCA_003945305.1_ASM394530v1	AP019314	Microcystis viridis NIES-102 DNA, complete genome	26	3586548-3586653	26	CRISPRCasFinder	no		Cas14c_CAS-V-F,cas14j,c2c9_V-U4,cas14k,WYL,cas3,cas10d,csc2gr7,csc1gr5,cas8b5,cas7,cas5,cas6,cas4,cas1,cas2,2OG_CAS,RT,DinG,csa3	Orphan	CCCACTTCCCTACACCCCACACCCCAC	27	0	0	NA	NA	NA	1	1	Orphan	Cas14c_CAS-V-F,cas14j,c2c9_V-U4,cas14k,WYL,cas3,cas10d,csc2gr7,csc1gr5,cas8b5,cas7,cas5,cas6,cas4,cas1,cas2,2OG_CAS,RT,DinG,csa3	NA|329aa|up_9|AP019314.1_3577014_3578001_-,NA|87aa|up_6|AP019314.1_3580224_3580485_-,NA|337aa|up_5|AP019314.1_3580872_3581883_-,NA|277aa|down_3|AP019314.1_3590492_3591323_+,NA|59aa|down_4|AP019314.1_3591395_3591572_+,NA|322aa|down_7|AP019314.1_3592593_3593559_+	NA|329aa|up_9|AP019314.1_3577014_3578001_-	NA	NA|120aa|up_8|AP019314.1_3579052_3579412_-	pfam14252, DUF4347, Domain of unknown function (DUF4347)	NA|136aa|up_7|AP019314.1_3579811_3580219_-	COG2402, COG2402, Predicted nucleic acid-binding protein, contains PIN domain [General function prediction only]	NA|87aa|up_6|AP019314.1_3580224_3580485_-	NA	NA|337aa|up_5|AP019314.1_3580872_3581883_-	NA	NA|177aa|up_4|AP019314.1_3582133_3582664_-	pfam16261, DUF4915, Domain of unknown function (DUF4915)	NA|195aa|up_3|AP019314.1_3582753_3583338_-	pfam16261, DUF4915, Domain of unknown function (DUF4915)	NA|386aa|up_2|AP019314.1_3583646_3584804_+	TIGR00937, Chromate_transport_protein, chromate transporter, chromate ion transporter (CHR) family	NA|217aa|up_1|AP019314.1_3584839_3585490_-	PRK01686, hisG, ATP phosphoribosyltransferase catalytic subunit; Reviewed	NA|250aa|up_0|AP019314.1_3585690_3586440_-	COG0546, Gph, Predicted phosphatases [General function prediction only]	NA|226aa|down_0|AP019314.1_3586705_3587383_-	PRK00507, PRK00507, deoxyribose-phosphate aldolase; Provisional	NA|870aa|down_1|AP019314.1_3587542_3590152_+	PRK09238, PRK09238, bifunctional aconitate hydratase 2/2-methylisocitrate dehydratase; Validated	NA|111aa|down_2|AP019314.1_3590153_3590486_+	pfam10763, DUF2584, Protein of unknown function (DUF2584)	NA|277aa|down_3|AP019314.1_3590492_3591323_+	NA	NA|59aa|down_4|AP019314.1_3591395_3591572_+	NA	NA|98aa|down_5|AP019314.1_3591954_3592248_+	pfam04365, BrnT_toxin, Ribonuclease toxin, BrnT, of type II toxin-antitoxin system	NA|89aa|down_6|AP019314.1_3592195_3592462_+	pfam14384, BrnA_antitoxin, BrnA antitoxin of type II toxin-antitoxin system	NA|322aa|down_7|AP019314.1_3592593_3593559_+	NA	NA|74aa|down_8|AP019314.1_3593723_3593945_+	pfam08869, XisI, XisI protein	NA|299aa|down_9|AP019314.1_3594243_3595140_+	TIGR03709, PPK2_rel_1, polyphosphate:nucleotide phosphotransferase, PPK2 family
GCA_003945305.1_ASM394530v1	AP019314	Microcystis viridis NIES-102 DNA, complete genome	27	3754066-3754331	2,2,27	CRT,PILER-CR,CRISPRCasFinder	no	Cas14c_CAS-V-F,cas14k,c2c9_V-U4	Cas14c_CAS-V-F,cas14j,c2c9_V-U4,cas14k,WYL,cas3,cas10d,csc2gr7,csc1gr5,cas8b5,cas7,cas5,cas6,cas4,cas1,cas2,2OG_CAS,RT,DinG,csa3	Unclear	CCTTACCTATTAGGTCAAATAGGATTAGTTGGAAAC,CCTTACCTATTAGGTCAAATAGGATTAGTTGGAAA,CCTTACCTATTAGGTCAAATAGGATTAGTTGGAAA	36,35,35	0	0	NA	NA	NA:NA:NA	3,2,2	3	TypeV	Cas14c_CAS-V-F,cas14j,c2c9_V-U4,cas14k,WYL,cas3,cas10d,csc2gr7,csc1gr5,cas8b5,cas7,cas5,cas6,cas4,cas1,cas2,2OG_CAS,RT,DinG,csa3	NA|239aa|up_5|AP019314.1_3749633_3750350_+,NA|82aa|up_2|AP019314.1_3752180_3752426_-,NA|108aa|up_1|AP019314.1_3753133_3753457_-,NA|71aa|up_0|AP019314.1_3753829_3754042_+,NA|63aa|down_3|AP019314.1_3757465_3757654_-	NA|69aa|up_9|AP019314.1_3746649_3746856_+	COG1598, COG1598, Predicted nuclease of the RNAse H fold, HicB family [General    function prediction only]	NA|76aa|up_8|AP019314.1_3746858_3747086_+	pfam07927, HicA_toxin, HicA toxin of bacterial toxin-antitoxin,	NA|178aa|up_7|AP019314.1_3747344_3747878_+	PRK09448, PRK09448, DNA starvation/stationary phase protection protein Dps; Provisional	NA|428aa|up_6|AP019314.1_3747910_3749194_-	cd14014, STKc_PknB_like, Catalytic domain of bacterial Serine/Threonine kinases, PknB and similar proteins	NA|239aa|up_5|AP019314.1_3749633_3750350_+	NA	NA|281aa|up_4|AP019314.1_3750577_3751420_+	pfam13359, DDE_Tnp_4, DDE superfamily endonuclease	NA|244aa|up_3|AP019314.1_3751460_3752192_+	pfam13649, Methyltransf_25, Methyltransferase domain	NA|82aa|up_2|AP019314.1_3752180_3752426_-	NA	NA|108aa|up_1|AP019314.1_3753133_3753457_-	NA	NA|71aa|up_0|AP019314.1_3753829_3754042_+	NA	NA|303aa|down_0|AP019314.1_3754539_3755448_-	COG1305, COG1305, Transglutaminase-like enzymes, putative cysteine proteases [Amino acid transport and metabolism]	NA|479aa|down_1|AP019314.1_3755602_3757039_+	cd00880, Era_like, E	NA|99aa|down_2|AP019314.1_3757184_3757481_-	pfam05016, ParE_toxin, ParE toxin of type II toxin-antitoxin system, parDE	NA|63aa|down_3|AP019314.1_3757465_3757654_-	NA	NA|231aa|down_4|AP019314.1_3758127_3758820_+	COG0664, Crp, cAMP-binding proteins - catabolite gene activator and regulatory subunit of cAMP-dependent protein kinases [Signal transduction mechanisms]	NA|190aa|down_5|AP019314.1_3758910_3759480_+	cd03769, SR_IS607_transposase_like, Serine Recombinase (SR) family, IS607-like transposase subfamily, catalytic domain; members contain a DNA binding domain with homology to MerR/SoxR located N-terminal to the catalytic domain	cas14k|355aa|down_6|AP019314.1_3759482_3760547_+	COG0675, COG0675, Transposase and inactivated derivatives [DNA replication, recombination, and repair]	NA|60aa|down_7|AP019314.1_3760794_3760974_-	PRK08629, PRK08629, coproporphyrinogen III oxidase family protein	NA|359aa|down_8|AP019314.1_3761029_3762106_-	PRK10983, PRK10983, AI-2E family transporter YdiK	NA|256aa|down_9|AP019314.1_3762377_3763145_+	COG0411, LivG, ABC-type branched-chain amino acid transport systems, ATPase component [Amino acid transport and metabolism]
GCA_003945305.1_ASM394530v1	AP019314	Microcystis viridis NIES-102 DNA, complete genome	28	3829006-3829126	28	CRISPRCasFinder	no		Cas14c_CAS-V-F,cas14j,c2c9_V-U4,cas14k,WYL,cas3,cas10d,csc2gr7,csc1gr5,cas8b5,cas7,cas5,cas6,cas4,cas1,cas2,2OG_CAS,RT,DinG,csa3	Orphan	GGGTGTGGGGTGTAGGGTTTTACCGATTTTGAGG	34	0	0	NA	NA	NA	1	1	Orphan	Cas14c_CAS-V-F,cas14j,c2c9_V-U4,cas14k,WYL,cas3,cas10d,csc2gr7,csc1gr5,cas8b5,cas7,cas5,cas6,cas4,cas1,cas2,2OG_CAS,RT,DinG,csa3	NA|67aa|up_2|AP019314.1_3826843_3827044_+,NA|164aa|up_0|AP019314.1_3828448_3828940_-,NA|81aa|down_0|AP019314.1_3829374_3829617_-,NA|115aa|down_4|AP019314.1_3832949_3833294_+,NA|337aa|down_5|AP019314.1_3833398_3834409_+,NA|50aa|down_7|AP019314.1_3838383_3838533_-	NA|320aa|up_9|AP019314.1_3820778_3821738_+	TIGR04447, hypothetical_protein, cyanobactin cluster PatC/TenC/TruC protein	NA|52aa|up_8|AP019314.1_3821882_3822038_+	TIGR04446, anacyclamide_precursor, prenylated cyclic peptide, anacyclamide/piricyclamide family	NA|264aa|up_7|AP019314.1_3822109_3822901_-	pfam05685, Uma2, Putative restriction endonuclease	NA|170aa|up_6|AP019314.1_3822968_3823478_+	cd07476, Peptidases_S8_thiazoline_oxidase_subtilisin-like_protease, Peptidase S8 family domain in Thiazoline oxidase/subtilisin-like proteases	NA|540aa|up_5|AP019314.1_3823444_3825064_+	TIGR03895, hypothetical_protein, cyanobactin maturation protease, PatA/PatG family	NA|342aa|up_4|AP019314.1_3825354_3826380_+	cd19080, AKR_AKR9A_9B, AKR9A and AKR9B families of aldo-keto reductase (AKR)	NA|136aa|up_3|AP019314.1_3826376_3826784_+	pfam06094, GGACT, Gamma-glutamyl cyclotransferase, AIG2-like	NA|67aa|up_2|AP019314.1_3826843_3827044_+	NA	NA|382aa|up_1|AP019314.1_3827328_3828474_+	COG0270, Dcm, Site-specific DNA methylase [DNA replication, recombination, and repair]	NA|164aa|up_0|AP019314.1_3828448_3828940_-	NA	NA|81aa|down_0|AP019314.1_3829374_3829617_-	NA	NA|84aa|down_1|AP019314.1_3829594_3829846_-	pfam01381, HTH_3, Helix-turn-helix	NA|453aa|down_2|AP019314.1_3829998_3831357_+	PRK00093, PRK00093, GTP-binding protein Der; Reviewed	NA|292aa|down_3|AP019314.1_3831360_3832236_+	pfam02361, CbiQ, Cobalt transport protein	NA|115aa|down_4|AP019314.1_3832949_3833294_+	NA	NA|337aa|down_5|AP019314.1_3833398_3834409_+	NA	NA|234aa|down_6|AP019314.1_3836777_3837479_-	pfam01548, DEDD_Tnp_IS110, Transposase	NA|50aa|down_7|AP019314.1_3838383_3838533_-	NA	NA|204aa|down_8|AP019314.1_3838908_3839520_+	pfam11780, DUF3318, Protein of unknown function (DUF3318)	NA|704aa|down_9|AP019314.1_3839635_3841747_+	COG0514, RecQ, Superfamily II DNA helicase [DNA replication, recombination, and repair]
GCA_003945305.1_ASM394530v1	AP019314	Microcystis viridis NIES-102 DNA, complete genome	29	4642866-4642956	29	CRISPRCasFinder	no	cas14j	Cas14c_CAS-V-F,cas14j,c2c9_V-U4,cas14k,WYL,cas3,cas10d,csc2gr7,csc1gr5,cas8b5,cas7,cas5,cas6,cas4,cas1,cas2,2OG_CAS,RT,DinG,csa3	Unclear	GATTTGCGTTGAAGCACTTTTTTGC	25	0	0	NA	NA	NA	1	1	TypeV	Cas14c_CAS-V-F,cas14j,c2c9_V-U4,cas14k,WYL,cas3,cas10d,csc2gr7,csc1gr5,cas8b5,cas7,cas5,cas6,cas4,cas1,cas2,2OG_CAS,RT,DinG,csa3	NA|117aa|up_7|AP019314.1_4632208_4632559_+,NA|84aa|down_2|AP019314.1_4644756_4645008_+,NA|263aa|down_7|AP019314.1_4648727_4649516_+	cas14j|396aa|up_9|AP019314.1_4629293_4630481_+	COG0675, COG0675, Transposase and inactivated derivatives [DNA replication, recombination, and repair]	NA|240aa|up_8|AP019314.1_4631044_4631764_-	TIGR03716, R_switched_YkoY, integral membrane protein, YkoY family	NA|117aa|up_7|AP019314.1_4632208_4632559_+	NA	NA|376aa|up_6|AP019314.1_4633130_4634258_+	PRK11783, rlmL, bifunctional 23S rRNA (guanine(2069)-N(7))-methyltransferase RlmK/23S rRNA (guanine(2445)-N(2))-methyltransferase RlmL	NA|326aa|up_5|AP019314.1_4634258_4635236_-	cd01339, LDH-like_MDH, L-lactate dehydrogenase-like malate dehydrogenase proteins	NA|72aa|up_4|AP019314.1_4635262_4635478_-	pfam11910, NdhO, Cyanobacterial and plant NDH-1 subunit O	NA|884aa|up_3|AP019314.1_4635708_4638360_-	COG1649, COG1649, Uncharacterized protein conserved in bacteria [Function unknown]	NA|450aa|up_2|AP019314.1_4638538_4639888_-	PRK02705, murD, UDP-N-acetylmuramoyl-L-alanine--D-glutamate ligase	NA|371aa|up_1|AP019314.1_4640276_4641389_+	pfam12565, DUF3747, Protein of unknown function (DUF3747)	NA|331aa|up_0|AP019314.1_4641793_4642786_+	PRK00578, prfB, peptide chain release factor 2; Validated	NA|192aa|down_0|AP019314.1_4642980_4643556_-	COG2865, COG2865, Predicted transcriptional regulator containing an HTH domain and an uncharacterized domain shared with the mammalian protein Schlafen [Transcription]	NA|263aa|down_1|AP019314.1_4643828_4644617_-	COG2865, COG2865, Predicted transcriptional regulator containing an HTH domain and an uncharacterized domain shared with the mammalian protein Schlafen [Transcription]	NA|84aa|down_2|AP019314.1_4644756_4645008_+	NA	NA|388aa|down_3|AP019314.1_4645100_4646264_-	PLN02449, PLN02449, ferrochelatase	NA|208aa|down_4|AP019314.1_4646383_4647007_+	pfam13649, Methyltransf_25, Methyltransferase domain	NA|185aa|down_5|AP019314.1_4647106_4647661_+	COG2179, COG2179, Predicted hydrolase of the HAD superfamily [General function prediction only]	NA|252aa|down_6|AP019314.1_4647913_4648669_+	pfam13649, Methyltransf_25, Methyltransferase domain	NA|263aa|down_7|AP019314.1_4648727_4649516_+	NA	NA|331aa|down_8|AP019314.1_4649640_4650633_-	PRK05479, PRK05479, ketol-acid reductoisomerase; Provisional	NA|372aa|down_9|AP019314.1_4651144_4652260_+	COG4972, PilM, Tfp pilus assembly protein, ATPase PilM [Cell motility and secretion / Intracellular trafficking and secretion]
GCA_003945305.1_ASM394530v1	AP019314	Microcystis viridis NIES-102 DNA, complete genome	30	4786733-4786829	30	CRISPRCasFinder	no		Cas14c_CAS-V-F,cas14j,c2c9_V-U4,cas14k,WYL,cas3,cas10d,csc2gr7,csc1gr5,cas8b5,cas7,cas5,cas6,cas4,cas1,cas2,2OG_CAS,RT,DinG,csa3	Orphan	AATAAAATTTTCCCAAGCTCTCAGA	25	0	0	NA	NA	NA	1	1	Orphan	Cas14c_CAS-V-F,cas14j,c2c9_V-U4,cas14k,WYL,cas3,cas10d,csc2gr7,csc1gr5,cas8b5,cas7,cas5,cas6,cas4,cas1,cas2,2OG_CAS,RT,DinG,csa3	NA|145aa|up_7|AP019314.1_4774790_4775225_-,NA	NA|385aa|up_9|AP019314.1_4772183_4773338_-	pfam03235, DUF262, Protein of unknown function DUF262	NA|404aa|up_8|AP019314.1_4773568_4774780_-	COG1192, Soj, ATPases involved in chromosome partitioning [Cell division and chromosome partitioning]	NA|145aa|up_7|AP019314.1_4774790_4775225_-	NA	NA|953aa|up_6|AP019314.1_4775328_4778187_-	PRK05743, ileS, isoleucyl-tRNA synthetase; Reviewed	NA|191aa|up_5|AP019314.1_4778396_4778969_+	cd06260, DUF820, Domain of unknown function (DUF820)	NA|244aa|up_4|AP019314.1_4779469_4780201_-	pfam02397, Bac_transf, Bacterial sugar transferase	NA|287aa|up_3|AP019314.1_4780806_4781667_-	COG5433, COG5433, Transposase [DNA replication, recombination, and repair]	NA|499aa|up_2|AP019314.1_4782157_4783654_-	pfam13586, DDE_Tnp_1_2, Transposase DDE domain	NA|264aa|up_1|AP019314.1_4784174_4784966_-	TIGR04211, hypothetical_protein, SH3 domain protein	NA|366aa|up_0|AP019314.1_4785025_4786123_-	pfam14239, RRXRR, RRXRR protein	NA|241aa|down_0|AP019314.1_4786887_4787610_+	pfam01182, Glucosamine_iso, Glucosamine-6-phosphate isomerases/6-phosphogluconolactonase	NA|209aa|down_1|AP019314.1_4787855_4788482_+	COG1716, COG1716, FOG: FHA domain [Signal transduction mechanisms]	NA|416aa|down_2|AP019314.1_4788655_4789903_+	cd06164, S2P-M50_SpoIVFB_CBS, SpoIVFB Site-2 protease (S2P), a zinc metalloprotease (MEROPS family M50B), regulates intramembrane proteolysis (RIP), and is involved in the pro-sigmaK pathway of bacterial spore formation	NA|255aa|down_3|AP019314.1_4789821_4790586_-	cd00144, MPP_PPP_family, phosphoprotein phosphatases of the metallophosphatase superfamily, metallophosphatase domain	NA|231aa|down_4|AP019314.1_4790996_4791689_+	PRK00042, tpiA, triosephosphate isomerase; Provisional	NA|264aa|down_5|AP019314.1_4791675_4792467_-	COG0179, MhpD, 2-keto-4-pentenoate hydratase/2-oxohepta-3-ene-1,7-dioic acid hydratase (catechol pathway) [Secondary metabolites biosynthesis, transport, and catabolism]	NA|130aa|down_6|AP019314.1_4792612_4793002_+	cd15487, bS6_chloro_cyano, 30S ribosomal protein S6 of chloroplasts and cyanobacteria	NA|219aa|down_7|AP019314.1_4793008_4793665_-	COG4636, Uma2, Endonuclease, Uma2 family (restriction endonuclease fold) [General function prediction only]	NA|203aa|down_8|AP019314.1_4793871_4794480_-	pfam12644, DUF3782, Protein of unknown function (DUF3782)	NA|258aa|down_9|AP019314.1_4794914_4795688_+	TIGR04155, hypothetical_protein, PEP-CTERM protein sorting domain, cyanobacterial subclass
GCA_003945305.1_ASM394530v1	AP019314	Microcystis viridis NIES-102 DNA, complete genome	32	4997424-4997504	32	CRISPRCasFinder	no		Cas14c_CAS-V-F,cas14j,c2c9_V-U4,cas14k,WYL,cas3,cas10d,csc2gr7,csc1gr5,cas8b5,cas7,cas5,cas6,cas4,cas1,cas2,2OG_CAS,RT,DinG,csa3	Orphan	AATCCGTCGTAACGCACCACCTATC	25	0	0	NA	NA	NA	1	1	Orphan	Cas14c_CAS-V-F,cas14j,c2c9_V-U4,cas14k,WYL,cas3,cas10d,csc2gr7,csc1gr5,cas8b5,cas7,cas5,cas6,cas4,cas1,cas2,2OG_CAS,RT,DinG,csa3	NA,NA|357aa|down_5|AP019314.1_5004675_5005746_-,NA|376aa|down_6|AP019314.1_5006413_5007541_-,NA|116aa|down_7|AP019314.1_5009124_5009472_+	NA|170aa|up_9|AP019314.1_4987878_4988388_+	COG1399, COG1399, Predicted metal-binding, possibly nucleic acid-binding protein [General function prediction only]	NA|321aa|up_8|AP019314.1_4988449_4989412_-	COG0859, RfaF, ADP-heptose:LPS heptosyltransferase [Cell envelope biogenesis, outer membrane]	NA|201aa|up_7|AP019314.1_4989538_4990141_-	pfam01551, Peptidase_M23, Peptidase family M23	NA|428aa|up_6|AP019314.1_4990464_4991748_+	PRK05431, PRK05431, seryl-tRNA synthetase; Provisional	NA|387aa|up_5|AP019314.1_4992024_4993185_-	cd08209, RLP_DK-MTP-1-P-enolase, 2,3-diketo-5-methylthiopentyl-1-phosphate enolase	NA|373aa|up_4|AP019314.1_4993506_4994625_+	cd01924, cyclophilin_TLP40_like, cyclophilin_TLP40_like: cyclophilin-type peptidylprolyl cis- trans isomerases (cyclophilins) similar ot the Spinach thylakoid lumen protein TLP40	NA|249aa|up_3|AP019314.1_4994826_4995573_-	pfam13358, DDE_3, DDE superfamily endonuclease	NA|168aa|up_2|AP019314.1_4995500_4996004_-	pfam13565, HTH_32, Homeodomain-like domain	NA|249aa|up_1|AP019314.1_4996142_4996889_-	pfam13358, DDE_3, DDE superfamily endonuclease	NA|168aa|up_0|AP019314.1_4996816_4997320_-	pfam13565, HTH_32, Homeodomain-like domain	NA|506aa|down_0|AP019314.1_4997544_4999062_-	pfam10119, MethyTransf_Reg, Predicted methyltransferase regulatory domain	NA|124aa|down_1|AP019314.1_4999347_4999719_+	PRK13386, fliH, flagellar assembly protein H; Provisional	NA|292aa|down_2|AP019314.1_4999740_5000616_+	pfam11103, DUF2887, Protein of unknown function (DUF2887)	NA|904aa|down_3|AP019314.1_5000852_5003564_-	pfam08548, Peptidase_M10_C, Peptidase M10 serralysin C terminal	NA|250aa|down_4|AP019314.1_5003871_5004621_-	TIGR01444, 2-O-methyltransferase_NoeI, methyltransferase, FkbM family	NA|357aa|down_5|AP019314.1_5004675_5005746_-	NA	NA|376aa|down_6|AP019314.1_5006413_5007541_-	NA	NA|116aa|down_7|AP019314.1_5009124_5009472_+	NA	NA|476aa|down_8|AP019314.1_5010043_5011471_-	TIGR01444, 2-O-methyltransferase_NoeI, methyltransferase, FkbM family	NA|1021aa|down_9|AP019314.1_5011495_5014558_-	TIGR01444, 2-O-methyltransferase_NoeI, methyltransferase, FkbM family
GCA_003945305.1_ASM394530v1	AP019314	Microcystis viridis NIES-102 DNA, complete genome	33	5479162-5479301	33	CRISPRCasFinder	no		Cas14c_CAS-V-F,cas14j,c2c9_V-U4,cas14k,WYL,cas3,cas10d,csc2gr7,csc1gr5,cas8b5,cas7,cas5,cas6,cas4,cas1,cas2,2OG_CAS,RT,DinG,csa3	Orphan	ATCGGCACCCCCCTTATCAAGGG	23	1	3	5479264-5479278|5479264-5479278|5479264-5479278	AP019314.1_2849205-2849191|AP019314.1_2849242-2849228|AP019314.1_2849279-2849265	NA	3	3	Orphan	Cas14c_CAS-V-F,cas14j,c2c9_V-U4,cas14k,WYL,cas3,cas10d,csc2gr7,csc1gr5,cas8b5,cas7,cas5,cas6,cas4,cas1,cas2,2OG_CAS,RT,DinG,csa3	NA,NA|127aa|down_2|AP019314.1_5480368_5480749_-	NA|266aa|up_9|AP019314.1_5467141_5467939_-	PRK14243, PRK14243, phosphate transporter ATP-binding protein; Provisional	NA|268aa|up_8|AP019314.1_5468096_5468900_-	PRK14243, PRK14243, phosphate transporter ATP-binding protein; Provisional	NA|298aa|up_7|AP019314.1_5469119_5470013_-	TIGR00974, 3a0107s02c, phosphate ABC transporter, permease protein PstA	NA|320aa|up_6|AP019314.1_5470411_5471371_-	COG0573, PstC, ABC-type phosphate transport system, permease component [Inorganic ion transport and metabolism]	NA|374aa|up_5|AP019314.1_5471465_5472587_-	TIGR00975, precursor_PBP-3_PstS-3_Antigen_Ag88	NA|345aa|up_4|AP019314.1_5473224_5474259_+	cd13654, PBP2_phosphate_like_2, Substrate binding domain of putative ABC-type phosphate transporter, a member of the type 2 periplasmic binding fold superfamily	NA|514aa|up_3|AP019314.1_5474512_5476054_-	PRK09566, nirA, ferredoxin-nitrite reductase; Reviewed	NA|213aa|up_2|AP019314.1_5476440_5477079_+	COG2802, COG2802, Uncharacterized protein, similar to the N-terminal domain of Lon protease [General function prediction only]	NA|201aa|up_1|AP019314.1_5477309_5477912_+	COG0664, Crp, cAMP-binding proteins - catabolite gene activator and regulatory subunit of cAMP-dependent protein kinases [Signal transduction mechanisms]	NA|308aa|up_0|AP019314.1_5477908_5478832_-	COG1940, NagC, Transcriptional regulator/sugar kinase [Transcription / Carbohydrate transport and metabolism]	NA|164aa|down_0|AP019314.1_5479360_5479852_+	pfam01724, DUF29, Domain of unknown function DUF29	NA|156aa|down_1|AP019314.1_5479862_5480330_+	pfam01724, DUF29, Domain of unknown function DUF29	NA|127aa|down_2|AP019314.1_5480368_5480749_-	NA	NA|351aa|down_3|AP019314.1_5481091_5482144_-	COG2334, COG2334, Putative homoserine kinase type II (protein kinase fold) [General function prediction only]	NA|348aa|down_4|AP019314.1_5482467_5483511_-	cd07025, Peptidase_S66, LD-Carboxypeptidase, a serine protease, includes microcin C7 self immunity protein	NA|97aa|down_5|AP019314.1_5483565_5483856_-	pfam10779, XhlA, Haemolysin XhlA	NA|156aa|down_6|AP019314.1_5483898_5484366_-	cd04586, CBS_pair_BON_assoc, Two tandem repeats of the cystathionine beta-synthase (CBS pair) domains associated with the BON (bacterial OsmY and nodulation domain) domain	NA|189aa|down_7|AP019314.1_5484527_5485094_+	cd06260, DUF820, Domain of unknown function (DUF820)	NA|283aa|down_8|AP019314.1_5485165_5486014_+	PRK07428, PRK07428, carboxylating nicotinate-nucleotide diphosphorylase	NA|266aa|down_9|AP019314.1_5486217_5487015_-	pfam01887, SAM_adeno_trans, S-adenosyl-l-methionine hydroxide adenosyltransferase
GCA_003945305.1_ASM394530v1	AP019314	Microcystis viridis NIES-102 DNA, complete genome	34	5587805-5587939	34	CRISPRCasFinder	no	cas14j	Cas14c_CAS-V-F,cas14j,c2c9_V-U4,cas14k,WYL,cas3,cas10d,csc2gr7,csc1gr5,cas8b5,cas7,cas5,cas6,cas4,cas1,cas2,2OG_CAS,RT,DinG,csa3	Unclear	AATGATTGAATGAATACTATGATTGTGTTGTCTTCAGCA	39	0	0	NA	NA	NA	1	1	TypeV	Cas14c_CAS-V-F,cas14j,c2c9_V-U4,cas14k,WYL,cas3,cas10d,csc2gr7,csc1gr5,cas8b5,cas7,cas5,cas6,cas4,cas1,cas2,2OG_CAS,RT,DinG,csa3	NA|56aa|up_3|AP019314.1_5584061_5584229_-,NA|115aa|down_1|AP019314.1_5589488_5589833_+,NA|86aa|down_9|AP019314.1_5601551_5601809_-	NA|354aa|up_9|AP019314.1_5576960_5578022_-	PRK00436, argC, N-acetyl-gamma-glutamyl-phosphate reductase; Validated	NA|88aa|up_8|AP019314.1_5578276_5578540_-	pfam11344, DUF3146, Protein of unknown function (DUF3146)	NA|307aa|up_7|AP019314.1_5578626_5579547_-	COG1808, COG1808, Predicted membrane protein [Function unknown]	cas14j|293aa|up_6|AP019314.1_5580327_5581206_+	COG0675, COG0675, Transposase and inactivated derivatives [DNA replication, recombination, and repair]	NA|186aa|up_5|AP019314.1_5582005_5582563_-	PRK05618, PRK05618, 50S ribosomal protein L25/general stress protein Ctc; Reviewed	NA|448aa|up_4|AP019314.1_5582590_5583934_-	PRK01117, PRK01117, adenylosuccinate synthetase; Provisional	NA|56aa|up_3|AP019314.1_5584061_5584229_-	NA	NA|388aa|up_2|AP019314.1_5584289_5585453_-	pfam03739, YjgP_YjgQ, Predicted permease YjgP/YjgQ family	NA|348aa|up_1|AP019314.1_5585660_5586704_+	cd03319, L-Ala-DL-Glu_epimerase, L-Ala-D/L-Glu epimerase catalyzes the epimerization of L-Ala-D/L-Glu and other dipeptides	NA|348aa|up_0|AP019314.1_5586706_5587750_+	COG3367, COG3367, Uncharacterized conserved protein [Function unknown]	NA|385aa|down_0|AP019314.1_5588110_5589265_+	pfam01610, DDE_Tnp_ISL3, Transposase	NA|115aa|down_1|AP019314.1_5589488_5589833_+	NA	NA|336aa|down_2|AP019314.1_5589962_5590970_-	PRK14299, PRK14299, chaperone protein DnaJ; Provisional	NA|334aa|down_3|AP019314.1_5593409_5594411_-	COG1357, COG1357, Pentapeptide repeats containing protein [Function unknown]	NA|504aa|down_4|AP019314.1_5594960_5596472_-	COG5305, COG5305, Predicted membrane protein [Function unknown]	NA|746aa|down_5|AP019314.1_5596495_5598733_-	pfam05231, MASE1, MASE1	NA|211aa|down_6|AP019314.1_5599201_5599834_+	COG2197, CitB, Response regulator containing a CheY-like receiver domain and an HTH DNA-binding domain [Signal transduction mechanisms / Transcription]	NA|226aa|down_7|AP019314.1_5599853_5600531_-	pfam12697, Abhydrolase_6, Alpha/beta hydrolase family	NA|244aa|down_8|AP019314.1_5600722_5601454_+	COG4636, Uma2, Endonuclease, Uma2 family (restriction endonuclease fold) [General function prediction only]	NA|86aa|down_9|AP019314.1_5601551_5601809_-	NA
