assembly_id	genome_id	genome_def	crispr_array_locus_merge	crispr_array_location_merge	crispr_locus_id	crispr_pred_method	array_in_prot	prot_within_array_20000	prot_in_genome	crispr_type_by_cas_prot	consensus_repeat	repeat_length	self-targeting_spacer_number	self-targeting_target_number	spacer_location	protospacer_location	repeat_type	spacer_locus_num	spacer_num	correct_crispr_type	genome_cas_prots	unknown_protein_around_crispr	L10	L10_domain	L9	L9_domain	L8	L8_domain	L7	L7_domain	L6	L6_domain	L5	L5_domain	L4	L4_domain	L3	L3_domain	L2	L2_domain	L1	L1_domain	R1	R1_domain	R2	R2_domain	R3	R3_domain	R4	R4_domain	R5	R5_domain	R6	R6_domain	R7	R7_domain	R8	R8_domain	R9	R9_domain	R10	R10_domain
GCF_000981785.2_ASM98178v2	NZ_CP011304	Microcystis aeruginosa NIES-2549 chromosome, complete genome	1	142678-142818	1	CRISPRCasFinder	no	cas14j	cas3,cas14j,c2c9_V-U4,Cas14c_CAS-V-F,RT,csa3,cas14k,2OG_CAS,DinG,cmr6gr7,cmr5gr11,cmr4gr7,cmr3gr5,cas10,cas2,cas1,cas4,cas6,cas5,cas7,cas8b5,WYL	Unclear	TTATTTATTCTCCCCTCTCCCTTC	24	1	13	142777-142794|142777-142794|142777-142794|142777-142794|142777-142794|142777-142794|142777-142794|142777-142794|142777-142794|142777-142794|142777-142794|142777-142794|142777-142794	NZ_CP011304.1_1550360-1550377|NZ_CP011304.1_15993-15976|NZ_CP011304.1_287309-287292|NZ_CP011304.1_287323-287306|NZ_CP011304.1_675545-675528|NZ_CP011304.1_742640-742657|NZ_CP011304.1_1079158-1079141|NZ_CP011304.1_1282067-1282084|NZ_CP011304.1_1307434-1307451|NZ_CP011304.1_2065989-2066006|NZ_CP011304.1_2714208-2714191|NZ_CP011304.1_2731412-2731429|NZ_CP011304.1_3173703-3173720	NA	2	2	TypeV	cas3,cas14j,c2c9_V-U4,Cas14c_CAS-V-F,RT,csa3,cas14k,2OG_CAS,DinG,cmr6gr7,cmr5gr11,cmr4gr7,cmr3gr5,cas10,cas2,cas1,cas4,cas6,cas5,cas7,cas8b5,WYL	NA|89aa|up_5|NZ_CP011304.1_137679_137946_+,NA|130aa|up_2|NZ_CP011304.1_140152_140542_-,NA|101aa|down_5|NZ_CP011304.1_146257_146560_+,NA|130aa|down_6|NZ_CP011304.1_146560_146950_+,NA|69aa|down_9|NZ_CP011304.1_149838_150045_+	NA|228aa|up_9|NZ_CP011304.1_130984_131668_-	COG4300, CadD, Predicted permease, cadmium resistance protein [Inorganic ion transport and metabolism]	NA|307aa|up_8|NZ_CP011304.1_131815_132736_+	cd08420, PBP2_CysL_like, C-terminal substrate binding domain of LysR-type transcriptional regulator CysL, which activates the transcription of the cysJI operon encoding sulfite reductase, contains the type 2 periplasmic binding fold	NA|114aa|up_7|NZ_CP011304.1_132873_133215_-	COG4226, HicB, Predicted nuclease of the RNAse H fold, HicB family [General    function prediction only]	NA|1331aa|up_6|NZ_CP011304.1_133468_137461_-	pfam12950, TaqI_C, TaqI-like C-terminal specificity domain	NA|89aa|up_5|NZ_CP011304.1_137679_137946_+	NA	NA|119aa|up_4|NZ_CP011304.1_138146_138503_-	TIGR00049, Uncharacterized_protein_in_nifU_5'region, Iron-sulfur cluster assembly accessory protein	cas14j|405aa|up_3|NZ_CP011304.1_138827_140042_+	COG0675, COG0675, Transposase and inactivated derivatives [DNA replication, recombination, and repair]	NA|130aa|up_2|NZ_CP011304.1_140152_140542_-	NA	NA|114aa|up_1|NZ_CP011304.1_140753_141095_+	pfam04020, Phage_holin_4_2, Mycobacterial 4 TMS phage holin, superfamily IV	NA|361aa|up_0|NZ_CP011304.1_141545_142628_+	smart00960, Robl_LC7, Roadblock/LC7 domain	NA|309aa|down_0|NZ_CP011304.1_142853_143780_+	TIGR00005, Ribosomal_large_subunit_pseudouridine_synthase_D, pseudouridine synthase, RluA family	NA|334aa|down_1|NZ_CP011304.1_143776_144778_-	PRK06270, PRK06270, homoserine dehydrogenase; Provisional	NA|72aa|down_2|NZ_CP011304.1_145088_145304_+	pfam11211, DUF2997, Protein of unknown function (DUF2997)	NA|129aa|down_3|NZ_CP011304.1_145348_145735_+	CHL00193, ycf35, Ycf35; Provisional	NA|136aa|down_4|NZ_CP011304.1_145734_146142_+	pfam13370, Fer4_13, 4Fe-4S single cluster domain of Ferredoxin I	NA|101aa|down_5|NZ_CP011304.1_146257_146560_+	NA	NA|130aa|down_6|NZ_CP011304.1_146560_146950_+	NA	NA|239aa|down_7|NZ_CP011304.1_147217_147934_+	COG0569, TrkA, K+ transport systems, NAD-binding component [Inorganic ion transport and metabolism]	NA|543aa|down_8|NZ_CP011304.1_147890_149519_-	COG0025, NhaP, NhaP-type Na+/H+ and K+/H+ antiporters [Inorganic ion transport and metabolism]	NA|69aa|down_9|NZ_CP011304.1_149838_150045_+	NA
GCF_000981785.2_ASM98178v2	NZ_CP011304	Microcystis aeruginosa NIES-2549 chromosome, complete genome	2	455953-456066	2	CRISPRCasFinder	no	cas14j	cas3,cas14j,c2c9_V-U4,Cas14c_CAS-V-F,RT,csa3,cas14k,2OG_CAS,DinG,cmr6gr7,cmr5gr11,cmr4gr7,cmr3gr5,cas10,cas2,cas1,cas4,cas6,cas5,cas7,cas8b5,WYL	Unclear	AATCTCTAATAGGGGTTAAGATTAATGGGAACGCTGTAGGGTT	43	0	0	NA	NA	NA	1	1	TypeV	cas3,cas14j,c2c9_V-U4,Cas14c_CAS-V-F,RT,csa3,cas14k,2OG_CAS,DinG,cmr6gr7,cmr5gr11,cmr4gr7,cmr3gr5,cas10,cas2,cas1,cas4,cas6,cas5,cas7,cas8b5,WYL	NA|78aa|up_8|NZ_CP011304.1_447767_448001_+,NA|75aa|up_6|NZ_CP011304.1_448700_448925_+,NA	NA|322aa|up_9|NZ_CP011304.1_446567_447533_+	pfam13358, DDE_3, DDE superfamily endonuclease	NA|78aa|up_8|NZ_CP011304.1_447767_448001_+	NA	NA|91aa|up_7|NZ_CP011304.1_448447_448720_+	pfam04365, BrnT_toxin, Ribonuclease toxin, BrnT, of type II toxin-antitoxin system	NA|75aa|up_6|NZ_CP011304.1_448700_448925_+	NA	NA|771aa|up_5|NZ_CP011304.1_448928_451241_+	COG4096, HsdR, Type I site-specific restriction-modification system, R (restriction) subunit and related helicases [Defense mechanisms]	NA|137aa|up_4|NZ_CP011304.1_451322_451733_+	cd07264, VOC_like, uncharacterized subfamily of vicinal oxygen chelate (VOC) family	NA|395aa|up_3|NZ_CP011304.1_451699_452884_-	COG1092, COG1092, Predicted SAM-dependent methyltransferases [General function prediction only]	NA|265aa|up_2|NZ_CP011304.1_453070_453865_+	PLN03100, PLN03100, Permease subunit of ER-derived-lipid transporter; Provisional	NA|458aa|up_1|NZ_CP011304.1_454062_455436_-	TIGR03279, cyano_FeS_chp, putative radical SAM enzyme, TIGR03279 family	NA|111aa|up_0|NZ_CP011304.1_455541_455874_-	pfam13747, DUF4164, Domain of unknown function (DUF4164)	NA|626aa|down_0|NZ_CP011304.1_456184_458062_-	TIGR01241, ATP-dependent_zinc_metalloprotease_FtsH, ATP-dependent metalloprotease FtsH	NA|395aa|down_1|NZ_CP011304.1_458139_459324_-	sd00006, TPR, Tetratricopeptide repeat	cas14j|405aa|down_2|NZ_CP011304.1_459459_460674_-	COG0675, COG0675, Transposase and inactivated derivatives [DNA replication, recombination, and repair]	NA|492aa|down_3|NZ_CP011304.1_461113_462589_-	PRK00913, PRK00913, multifunctional aminopeptidase A; Provisional	NA|376aa|down_4|NZ_CP011304.1_463036_464164_+	COG1453, COG1453, Predicted oxidoreductases of the aldo/keto reductase family [General function prediction only]	NA|360aa|down_5|NZ_CP011304.1_464296_465376_+	PRK09196, PRK09196, fructose-bisphosphate aldolase class II	NA|413aa|down_6|NZ_CP011304.1_465567_466806_+	TIGR00225, Tail-specific_protease, C-terminal peptidase (prc)	NA|435aa|down_7|NZ_CP011304.1_467238_468543_+	cd06346, PBP1_ABC_ligand_binding-like, type 1 periplasmic ligand-binding domain of uncharacterized ABC (Atpase Binding Cassette)-type active transport systems predicted to be involved in uptake of amino acids, peptides, or inorganic ions	NA|396aa|down_8|NZ_CP011304.1_469116_470304_-	pfam01098, FTSW_RODA_SPOVE, Cell cycle protein	NA|315aa|down_9|NZ_CP011304.1_470375_471320_-	cd14949, Asparaginase_2_like_3, Uncharacterized bacterial subfamily of the L-Asparaginase type 2-like enzymes, an Ntn-hydrolase family
GCF_000981785.2_ASM98178v2	NZ_CP011304	Microcystis aeruginosa NIES-2549 chromosome, complete genome	3	507958-508044	3	CRISPRCasFinder	no		cas3,cas14j,c2c9_V-U4,Cas14c_CAS-V-F,RT,csa3,cas14k,2OG_CAS,DinG,cmr6gr7,cmr5gr11,cmr4gr7,cmr3gr5,cas10,cas2,cas1,cas4,cas6,cas5,cas7,cas8b5,WYL	Orphan	CAGTAAACAGTAATCGGTGCAAAGACAG	28	0	0	NA	NA	NA	1	1	Orphan	cas3,cas14j,c2c9_V-U4,Cas14c_CAS-V-F,RT,csa3,cas14k,2OG_CAS,DinG,cmr6gr7,cmr5gr11,cmr4gr7,cmr3gr5,cas10,cas2,cas1,cas4,cas6,cas5,cas7,cas8b5,WYL	NA,NA	NA|141aa|up_9|NZ_CP011304.1_496896_497319_-	pfam01850, PIN, PIN domain	NA|70aa|up_8|NZ_CP011304.1_497303_497513_-	pfam10047, DUF2281, Protein of unknown function (DUF2281)	NA|337aa|up_7|NZ_CP011304.1_497770_498781_-	pfam13651, EcoRI_methylase, Adenine-specific methyltransferase EcoRI	NA|267aa|up_6|NZ_CP011304.1_500100_500901_-	COG4577, CcmK, Carbon dioxide concentrating mechanism/carboxysome shell protein [Secondary metabolites biosynthesis, transport, and catabolism / Energy production and conversion]	NA|360aa|up_5|NZ_CP011304.1_501114_502194_+	pfam18578, Raf1_N, Rubisco accumulation factor 1 alpha helical domain	NA|346aa|up_4|NZ_CP011304.1_502467_503505_+	TIGR03609, S_layer_CsaB, polysaccharide pyruvyl transferase CsaB	NA|208aa|up_3|NZ_CP011304.1_503510_504134_+	pfam04313, HSDR_N, Type I restriction enzyme R protein N-terminus (HSDR_N)	NA|97aa|up_2|NZ_CP011304.1_504194_504485_-	cd02978, KaiB_like, KaiB-like family; composed of the circadian clock proteins, KaiB and the N-terminal KaiB-like sensory domain of SasA	NA|289aa|up_1|NZ_CP011304.1_504481_505348_-	cd00156, REC, phosphoacceptor receiver (REC) domain of response regulators (RRs) and pseudo response regulators (PRRs)	NA|734aa|up_0|NZ_CP011304.1_505643_507845_-	COG3211, PhoX, Predicted phosphatase [General function prediction only]	NA|108aa|down_0|NZ_CP011304.1_508150_508474_+	pfam13747, DUF4164, Domain of unknown function (DUF4164)	NA|229aa|down_1|NZ_CP011304.1_508511_509198_-	PRK01130, PRK01130, putative N-acetylmannosamine-6-phosphate 2-epimerase	NA|170aa|down_2|NZ_CP011304.1_509314_509824_-	pfam00719, Pyrophosphatase, Inorganic pyrophosphatase	NA|393aa|down_3|NZ_CP011304.1_511053_512232_-	COG0628, yhhT, Predicted permease, member of the PurR regulon [General function prediction only]	NA|672aa|down_4|NZ_CP011304.1_512403_514419_-	COG0557, VacB, Exoribonuclease R [Transcription]	NA|680aa|down_5|NZ_CP011304.1_514496_516536_-	PRK05354, PRK05354, biosynthetic arginine decarboxylase	NA|178aa|down_6|NZ_CP011304.1_516676_517210_+	pfam14221, DUF4330, Domain of unknown function (DUF4330)	NA|197aa|down_7|NZ_CP011304.1_517352_517943_+	PRK00076, recR, recombination protein RecR; Reviewed	NA|456aa|down_8|NZ_CP011304.1_518186_519554_-	TIGR00225, Tail-specific_protease, C-terminal peptidase (prc)	NA|140aa|down_9|NZ_CP011304.1_519654_520074_-	cd04682, Nudix_Hydrolase_23, Members of the Nudix hydrolase superfamily catalyze the hydrolysis of NUcleoside DIphosphates linked to other moieties, X
GCF_000981785.2_ASM98178v2	NZ_CP011304	Microcystis aeruginosa NIES-2549 chromosome, complete genome	4	671329-671419	4	CRISPRCasFinder	no		cas3,cas14j,c2c9_V-U4,Cas14c_CAS-V-F,RT,csa3,cas14k,2OG_CAS,DinG,cmr6gr7,cmr5gr11,cmr4gr7,cmr3gr5,cas10,cas2,cas1,cas4,cas6,cas5,cas7,cas8b5,WYL	Orphan	GCTAAAAAGTGCTTCAACGCAAATC	25	0	0	NA	NA	NA	1	1	Orphan	cas3,cas14j,c2c9_V-U4,Cas14c_CAS-V-F,RT,csa3,cas14k,2OG_CAS,DinG,cmr6gr7,cmr5gr11,cmr4gr7,cmr3gr5,cas10,cas2,cas1,cas4,cas6,cas5,cas7,cas8b5,WYL	NA|84aa|up_1|NZ_CP011304.1_669459_669711_-,NA	NA|748aa|up_9|NZ_CP011304.1_660055_662299_-	cd04659, Piwi_piwi-like_ProArk, Piwi_piwi-like_ProArk: PIWI domain, Piwi-like subfamily found in Archaea and Bacteria	NA|264aa|up_8|NZ_CP011304.1_662302_663094_-	pfam12705, PDDEXK_1, PD-(D/E)XK nuclease superfamily	NA|119aa|up_7|NZ_CP011304.1_663290_663647_+	cd00093, HTH_XRE, Helix-turn-helix XRE-family like proteins	NA|222aa|up_6|NZ_CP011304.1_663652_664318_+	pfam06114, Peptidase_M78, IrrE N-terminal-like domain	NA|332aa|up_5|NZ_CP011304.1_664358_665354_-	CHL00180, rbcR, LysR transcriptional regulator; Provisional	NA|185aa|up_4|NZ_CP011304.1_666791_667346_-	COG2179, COG2179, Predicted hydrolase of the HAD superfamily [General function prediction only]	NA|208aa|up_3|NZ_CP011304.1_667462_668086_-	pfam13649, Methyltransf_25, Methyltransferase domain	NA|388aa|up_2|NZ_CP011304.1_668205_669369_+	PLN02449, PLN02449, ferrochelatase	NA|84aa|up_1|NZ_CP011304.1_669459_669711_-	NA	NA|473aa|up_0|NZ_CP011304.1_669861_671280_+	COG2865, COG2865, Predicted transcriptional regulator containing an HTH domain and an uncharacterized domain shared with the mammalian protein Schlafen [Transcription]	NA|369aa|down_0|NZ_CP011304.1_671498_672606_-	PRK00578, prfB, peptide chain release factor 2; Validated	NA|379aa|down_1|NZ_CP011304.1_672871_674008_-	pfam12565, DUF3747, Protein of unknown function (DUF3747)	NA|450aa|down_2|NZ_CP011304.1_674168_675518_+	PRK02705, murD, UDP-N-acetylmuramoyl-L-alanine--D-glutamate ligase	NA|876aa|down_3|NZ_CP011304.1_675682_678310_+	COG1649, COG1649, Uncharacterized protein conserved in bacteria [Function unknown]	NA|61aa|down_4|NZ_CP011304.1_678478_678661_-	PRK00270, rpsU, 30S ribosomal protein S21; Reviewed	NA|502aa|down_5|NZ_CP011304.1_678833_680339_-	COG0564, RluA, Pseudouridylate synthases, 23S RNA-specific [Translation, ribosomal structure and biogenesis]	NA|421aa|down_6|NZ_CP011304.1_680373_681636_-	PLN02855, PLN02855, Bifunctional selenocysteine lyase/cysteine desulfurase	NA|442aa|down_7|NZ_CP011304.1_681691_683017_-	TIGR01981, UPF0051_protein_Rv1462/MT1509, FeS assembly protein SufD	NA|257aa|down_8|NZ_CP011304.1_683260_684031_-	CHL00131, ycf16, sulfate ABC transporter protein; Validated	NA|481aa|down_9|NZ_CP011304.1_684228_685671_-	PRK11814, PRK11814, cysteine desulfurase activator complex subunit SufB; Provisional
GCF_000981785.2_ASM98178v2	NZ_CP011304	Microcystis aeruginosa NIES-2549 chromosome, complete genome	5	751880-751970	5	CRISPRCasFinder	no	cas14j	cas3,cas14j,c2c9_V-U4,Cas14c_CAS-V-F,RT,csa3,cas14k,2OG_CAS,DinG,cmr6gr7,cmr5gr11,cmr4gr7,cmr3gr5,cas10,cas2,cas1,cas4,cas6,cas5,cas7,cas8b5,WYL	Unclear	GTTCAGCTTTAAGACGCTCTTGTTCTAA	28	0	0	NA	NA	NA	1	1	TypeV	cas3,cas14j,c2c9_V-U4,Cas14c_CAS-V-F,RT,csa3,cas14k,2OG_CAS,DinG,cmr6gr7,cmr5gr11,cmr4gr7,cmr3gr5,cas10,cas2,cas1,cas4,cas6,cas5,cas7,cas8b5,WYL	NA,NA|75aa|down_3|NZ_CP011304.1_757632_757857_+,NA|127aa|down_4|NZ_CP011304.1_757874_758255_-,NA|112aa|down_6|NZ_CP011304.1_759353_759689_+,NA|337aa|down_9|NZ_CP011304.1_770188_771199_-	NA|431aa|up_9|NZ_CP011304.1_739895_741188_-	COG2027, DacB, D-alanyl-D-alanine carboxypeptidase (penicillin-binding protein 4) [Cell envelope biogenesis, outer membrane]	NA|352aa|up_8|NZ_CP011304.1_741337_742393_+	PRK05331, PRK05331, phosphate acyltransferase PlsX	NA|334aa|up_7|NZ_CP011304.1_742713_743715_+	PLN02326, PLN02326, 3-oxoacyl-[acyl-carrier-protein] synthase III	NA|309aa|up_6|NZ_CP011304.1_743937_744864_+	TIGR00128, Malonyl_CoA-acyl_carrier_protein_transacylase, malonyl CoA-acyl carrier protein transacylase	NA|407aa|up_5|NZ_CP011304.1_745094_746315_-	cd00887, MoeA, MoeA family	NA|296aa|up_4|NZ_CP011304.1_746326_747214_+	PRK00971, PRK00971, glutaminase; Provisional	NA|574aa|up_3|NZ_CP011304.1_747217_748939_+	COG1807, ArnT, 4-amino-4-deoxy-L-arabinose transferase and related glycosyltransferases of PMT family [Cell envelope biogenesis, outer membrane]	NA|478aa|up_2|NZ_CP011304.1_748935_750369_-	COG3670, COG3670, Lignostilbene-alpha,beta-dioxygenase and related enzymes [Secondary metabolites biosynthesis, transport, and catabolism]	NA|211aa|up_1|NZ_CP011304.1_750482_751115_-	cd06260, DUF820, Domain of unknown function (DUF820)	NA|224aa|up_0|NZ_CP011304.1_751136_751808_-	cd06260, DUF820, Domain of unknown function (DUF820)	NA|157aa|down_0|NZ_CP011304.1_752820_753291_+	pfam01724, DUF29, Domain of unknown function DUF29	NA|405aa|down_1|NZ_CP011304.1_754127_755342_-	COG0520, csdA, Selenocysteine lyase/Cysteine desulfurase [Posttranslational modification, protein turnover, chaperones]	NA|479aa|down_2|NZ_CP011304.1_755635_757072_+	TIGR03491, TIGR03491, RecB family nuclease, putative, TM0106 family	NA|75aa|down_3|NZ_CP011304.1_757632_757857_+	NA	NA|127aa|down_4|NZ_CP011304.1_757874_758255_-	NA	NA|206aa|down_5|NZ_CP011304.1_758483_759101_+	TIGR03328, dehydratase-enolase-phosphatase_complex_1, methylthioribulose-1-phosphate dehydratase	NA|112aa|down_6|NZ_CP011304.1_759353_759689_+	NA	NA|2587aa|down_7|NZ_CP011304.1_759881_767642_-	NF033203, entero_EhxA, enterohemolysin EhxA	NA|409aa|down_8|NZ_CP011304.1_768541_769768_-	pfam01610, DDE_Tnp_ISL3, Transposase	NA|337aa|down_9|NZ_CP011304.1_770188_771199_-	NA
GCF_000981785.2_ASM98178v2	NZ_CP011304	Microcystis aeruginosa NIES-2549 chromosome, complete genome	6	1045396-1045495	6	CRISPRCasFinder	no	c2c9_V-U4	cas3,cas14j,c2c9_V-U4,Cas14c_CAS-V-F,RT,csa3,cas14k,2OG_CAS,DinG,cmr6gr7,cmr5gr11,cmr4gr7,cmr3gr5,cas10,cas2,cas1,cas4,cas6,cas5,cas7,cas8b5,WYL	Type V-U4	TTGTCAAAAGACAAGCTGTCAAACTTG	27	0	0	NA	NA	NA	1	1	TypeV-U4	cas3,cas14j,c2c9_V-U4,Cas14c_CAS-V-F,RT,csa3,cas14k,2OG_CAS,DinG,cmr6gr7,cmr5gr11,cmr4gr7,cmr3gr5,cas10,cas2,cas1,cas4,cas6,cas5,cas7,cas8b5,WYL	NA|63aa|up_8|NZ_CP011304.1_1038275_1038464_-,NA|297aa|up_7|NZ_CP011304.1_1038621_1039512_-,NA|98aa|up_2|NZ_CP011304.1_1042451_1042745_-,NA|111aa|down_6|NZ_CP011304.1_1053941_1054274_+	NA|239aa|up_9|NZ_CP011304.1_1037107_1037824_-	cd06179, MFS_TRI12_like, Fungal trichothecene efflux pump (TRI12) of the Major Facilitator Superfamily of transporters	NA|63aa|up_8|NZ_CP011304.1_1038275_1038464_-	NA	NA|297aa|up_7|NZ_CP011304.1_1038621_1039512_-	NA	NA|361aa|up_6|NZ_CP011304.1_1039689_1040772_+	TIGR01151, Photosystem_QB_protein, photosystem II, DI subunit (also called Q(B))	NA|171aa|up_5|NZ_CP011304.1_1041040_1041553_+	pfam13551, HTH_29, Winged helix-turn helix	NA|163aa|up_4|NZ_CP011304.1_1041621_1042110_+	pfam13358, DDE_3, DDE superfamily endonuclease	NA|88aa|up_3|NZ_CP011304.1_1042198_1042462_-	COG2026, RelE, Cytotoxic translational repressor of toxin-antitoxin stability system [Translation, ribosomal structure and biogenesis / Cell division and chromosome partitioning]	NA|98aa|up_2|NZ_CP011304.1_1042451_1042745_-	NA	NA|560aa|up_1|NZ_CP011304.1_1043217_1044897_-	pfam11832, DUF3352, Protein of unknown function (DUF3352)	NA|107aa|up_0|NZ_CP011304.1_1045013_1045334_-	PRK13697, PRK13697, cytochrome c6; Provisional	NA|126aa|down_0|NZ_CP011304.1_1045526_1045904_+	PRK02710, PRK02710, plastocyanin; Provisional	NA|576aa|down_1|NZ_CP011304.1_1046346_1048074_+	PRK05945, sdhA, succinate dehydrogenase/fumarate reductase flavoprotein subunit	NA|243aa|down_2|NZ_CP011304.1_1048267_1048996_-	pfam02668, TauD, Taurine catabolism dioxygenase TauD, TfdA family	NA|1251aa|down_3|NZ_CP011304.1_1049162_1052915_+	COG5635, COG5635, Predicted NTPase (NACHT family) [Signal transduction mechanisms]	NA|68aa|down_4|NZ_CP011304.1_1053279_1053483_+	pfam18506, RelB_N, RelB Antitoxin alpha helical domain	NA|90aa|down_5|NZ_CP011304.1_1053479_1053749_+	COG2026, RelE, Cytotoxic translational repressor of toxin-antitoxin stability system [Translation, ribosomal structure and biogenesis / Cell division and chromosome partitioning]	NA|111aa|down_6|NZ_CP011304.1_1053941_1054274_+	NA	NA|82aa|down_7|NZ_CP011304.1_1054281_1054527_+	pfam16277, DUF4926, Domain of unknown function (DUF4926)	NA|112aa|down_8|NZ_CP011304.1_1054684_1055020_+	cd16382, XisI-like, XisI is FdxN element excision controlling factor protein	NA|73aa|down_9|NZ_CP011304.1_1055117_1055336_+	pfam10049, DUF2283, Protein of unknown function (DUF2283)
GCF_000981785.2_ASM98178v2	NZ_CP011304	Microcystis aeruginosa NIES-2549 chromosome, complete genome	7	1133024-1133137	7	CRISPRCasFinder	no		cas3,cas14j,c2c9_V-U4,Cas14c_CAS-V-F,RT,csa3,cas14k,2OG_CAS,DinG,cmr6gr7,cmr5gr11,cmr4gr7,cmr3gr5,cas10,cas2,cas1,cas4,cas6,cas5,cas7,cas8b5,WYL	Orphan	GCTTTTTCCCCAATTTCTGTCACTGAT	27	0	0	NA	NA	NA	1	1	Orphan	cas3,cas14j,c2c9_V-U4,Cas14c_CAS-V-F,RT,csa3,cas14k,2OG_CAS,DinG,cmr6gr7,cmr5gr11,cmr4gr7,cmr3gr5,cas10,cas2,cas1,cas4,cas6,cas5,cas7,cas8b5,WYL	NA|291aa|up_5|NZ_CP011304.1_1125265_1126138_+,NA|79aa|up_2|NZ_CP011304.1_1128470_1128707_-,NA|119aa|up_1|NZ_CP011304.1_1128824_1129181_-,NA	NA|154aa|up_9|NZ_CP011304.1_1121932_1122394_+	pfam11210, DUF2996, Protein of unknown function (DUF2996)	NA|126aa|up_8|NZ_CP011304.1_1122473_1122851_+	pfam11360, DUF3110, Protein of unknown function (DUF3110)	NA|140aa|up_7|NZ_CP011304.1_1123321_1123741_+	pfam02021, UPF0102, Uncharacterized protein family UPF0102	NA|267aa|up_6|NZ_CP011304.1_1124190_1124991_-	TIGR02917, TPR_domain_protein, putative PEP-CTERM system TPR-repeat lipoprotein	NA|291aa|up_5|NZ_CP011304.1_1125265_1126138_+	NA	NA|232aa|up_4|NZ_CP011304.1_1126451_1127146_-	COG1662, InsB, Transposase and inactivated derivatives, IS1 family [DNA replication, recombination, and repair]	NA|308aa|up_3|NZ_CP011304.1_1127482_1128406_+	COG0392, COG0392, Predicted integral membrane protein [Function unknown]	NA|79aa|up_2|NZ_CP011304.1_1128470_1128707_-	NA	NA|119aa|up_1|NZ_CP011304.1_1128824_1129181_-	NA	NA|1108aa|up_0|NZ_CP011304.1_1129295_1132619_+	cd00200, WD40, WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment	NA|63aa|down_0|NZ_CP011304.1_1133745_1133934_+	cd00200, WD40, WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment	NA|328aa|down_1|NZ_CP011304.1_1134098_1135082_+	PRK07452, PRK07452, DNA polymerase III subunit delta; Validated	NA|386aa|down_2|NZ_CP011304.1_1136977_1138135_-	cd12828, TmCorA-like_1, Thermotoga maritima CorA_like subfamily	NA|343aa|down_3|NZ_CP011304.1_1138326_1139355_+	PRK02746, pdxA, 4-hydroxythreonine-4-phosphate dehydrogenase PdxA	NA|455aa|down_4|NZ_CP011304.1_1139379_1140744_+	cd07100, ALDH_SSADH1_GabD1, Mycobacterium tuberculosis succinate-semialdehyde dehydrogenase 1-like	NA|1110aa|down_5|NZ_CP011304.1_1140852_1144182_+	cd09178, PLDc_N_Snf2_like, N-terminal putative catalytic domain of uncharacterized HKD family nucleases fused to putative helicases from the Snf2-like family	NA|1139aa|down_6|NZ_CP011304.1_1144907_1148324_+	TIGR02987, m6_adenine_and_m5_cytosine_DNA_methyltransferase, type II restriction m6 adenine DNA methyltransferase, Alw26I/Eco31I/Esp3I family	NA|271aa|down_7|NZ_CP011304.1_1148549_1149362_+	PRK00346, surE, 5'(3')-nucleotidase/polyphosphatase; Provisional	NA|194aa|down_8|NZ_CP011304.1_1149409_1149991_+	pfam12527, DUF3727, Protein of unknown function (DUF3727)	NA|277aa|down_9|NZ_CP011304.1_1150237_1151068_-	pfam00950, ABC-3, ABC 3 transport family
GCF_000981785.2_ASM98178v2	NZ_CP011304	Microcystis aeruginosa NIES-2549 chromosome, complete genome	8	1232112-1232527	1,8,1	PILER-CR,CRISPRCasFinder,CRT	no		cas3,cas14j,c2c9_V-U4,Cas14c_CAS-V-F,RT,csa3,cas14k,2OG_CAS,DinG,cmr6gr7,cmr5gr11,cmr4gr7,cmr3gr5,cas10,cas2,cas1,cas4,cas6,cas5,cas7,cas8b5,WYL	Orphan	GTTTCCAACTAATCCTATTTGACCTAATAGGTAAGG,GTTTCCAACTAATCCTATTTGACCTAATAGGTAAGG,GTTTCCAACTAATCCTATTTGACCTAATAGGTAAGG	36,36,36	0	0	NA	NA	NA:NA:NA	5,5,5	5	Orphan	cas3,cas14j,c2c9_V-U4,Cas14c_CAS-V-F,RT,csa3,cas14k,2OG_CAS,DinG,cmr6gr7,cmr5gr11,cmr4gr7,cmr3gr5,cas10,cas2,cas1,cas4,cas6,cas5,cas7,cas8b5,WYL	NA|204aa|up_3|NZ_CP011304.1_1228762_1229374_-,NA|63aa|down_3|NZ_CP011304.1_1236121_1236310_-,NA|307aa|down_6|NZ_CP011304.1_1238574_1239495_-	NA|277aa|up_9|NZ_CP011304.1_1223152_1223983_+	cd01639, IMPase, IMPase, inositol monophosphatase and related domains	NA|313aa|up_8|NZ_CP011304.1_1224026_1224965_+	smart00271, DnaJ, DnaJ molecular chaperone homology domain	NA|406aa|up_7|NZ_CP011304.1_1225003_1226221_+	PRK12292, hisZ, ATP phosphoribosyltransferase regulatory subunit; Provisional	NA|171aa|up_6|NZ_CP011304.1_1226306_1226819_+	pfam09626, DHC, Dihaem cytochrome c	NA|288aa|up_5|NZ_CP011304.1_1227161_1228025_-	COG1426, COG1426, Predicted transcriptional regulator contains Xre-like HTH domain [Function unknown]	NA|168aa|up_4|NZ_CP011304.1_1228257_1228761_+	cd00886, MogA_MoaB, MogA_MoaB family	NA|204aa|up_3|NZ_CP011304.1_1228762_1229374_-	NA	NA|222aa|up_2|NZ_CP011304.1_1229508_1230174_+	COG0830, UreF, Urease accessory protein UreF [Posttranslational modification, protein turnover, chaperones]	NA|317aa|up_1|NZ_CP011304.1_1230401_1231352_+	cd02511, Beta4Glucosyltransferase, UDP-glucose LOS-beta-1,4 glucosyltransferase is required for biosynthesis of lipooligosaccharide	NA|197aa|up_0|NZ_CP011304.1_1231357_1231948_+	PRK10502, PRK10502, putative acyl transferase; Provisional	NA|75aa|down_0|NZ_CP011304.1_1233273_1233498_+	pfam13808, DDE_Tnp_1_assoc, DDE_Tnp_1-associated	NA|488aa|down_1|NZ_CP011304.1_1234129_1235593_+	cd00880, Era_like, E	NA|99aa|down_2|NZ_CP011304.1_1235840_1236137_-	pfam05016, ParE_toxin, ParE toxin of type II toxin-antitoxin system, parDE	NA|63aa|down_3|NZ_CP011304.1_1236121_1236310_-	NA	NA|231aa|down_4|NZ_CP011304.1_1236664_1237357_+	COG0664, Crp, cAMP-binding proteins - catabolite gene activator and regulatory subunit of cAMP-dependent protein kinases [Signal transduction mechanisms]	NA|248aa|down_5|NZ_CP011304.1_1237744_1238488_+	pfam12973, Cupin_7, ChrR Cupin-like domain	NA|307aa|down_6|NZ_CP011304.1_1238574_1239495_-	NA	NA|268aa|down_7|NZ_CP011304.1_1239571_1240375_-	COG3442, COG3442, Predicted glutamine amidotransferase [General function prediction only]	NA|446aa|down_8|NZ_CP011304.1_1240538_1241876_-	COG0769, MurE, UDP-N-acetylmuramyl tripeptide synthase [Cell envelope biogenesis, outer membrane]	NA|397aa|down_9|NZ_CP011304.1_1242039_1243230_+	TIGR02037, Probable_periplasmic_serine_protease_do/HhoA-like, periplasmic serine protease, Do/DeqQ family
GCF_000981785.2_ASM98178v2	NZ_CP011304	Microcystis aeruginosa NIES-2549 chromosome, complete genome	9	1504223-1504342	9	CRISPRCasFinder	no		cas3,cas14j,c2c9_V-U4,Cas14c_CAS-V-F,RT,csa3,cas14k,2OG_CAS,DinG,cmr6gr7,cmr5gr11,cmr4gr7,cmr3gr5,cas10,cas2,cas1,cas4,cas6,cas5,cas7,cas8b5,WYL	Orphan	CTGATTCGGAGCATCTTTCAATTTGACAACGCC	33	0	0	NA	NA	NA	1	1	Orphan	cas3,cas14j,c2c9_V-U4,Cas14c_CAS-V-F,RT,csa3,cas14k,2OG_CAS,DinG,cmr6gr7,cmr5gr11,cmr4gr7,cmr3gr5,cas10,cas2,cas1,cas4,cas6,cas5,cas7,cas8b5,WYL	NA|297aa|up_3|NZ_CP011304.1_1498459_1499350_-,NA|168aa|up_2|NZ_CP011304.1_1499360_1499864_-,NA|561aa|up_0|NZ_CP011304.1_1502308_1503991_+,NA|263aa|down_4|NZ_CP011304.1_1508721_1509510_+	NA|278aa|up_9|NZ_CP011304.1_1493788_1494622_-	PRK07396, PRK07396, dihydroxynaphthoic acid synthetase; Validated	NA|58aa|up_8|NZ_CP011304.1_1494924_1495098_+	COG1598, COG1598, Predicted nuclease of the RNAse H fold, HicB family [General    function prediction only]	NA|317aa|up_7|NZ_CP011304.1_1495202_1496153_-	TIGR03965, glycosyltransferase_Rv0696_family, mycofactocin system glycosyltransferase	NA|206aa|up_6|NZ_CP011304.1_1496257_1496875_+	COG1045, CysE, Serine acetyltransferase [Amino acid transport and metabolism]	NA|227aa|up_5|NZ_CP011304.1_1496869_1497550_-	PRK00155, ispD, D-ribitol-5-phosphate cytidylyltransferase	NA|133aa|up_4|NZ_CP011304.1_1497696_1498095_-	pfam08854, DUF1824, Domain of unknown function (DUF1824)	NA|297aa|up_3|NZ_CP011304.1_1498459_1499350_-	NA	NA|168aa|up_2|NZ_CP011304.1_1499360_1499864_-	NA	NA|666aa|up_1|NZ_CP011304.1_1500120_1502118_-	TIGR02442, Uncharacterized_protein_Rv2850c/MT2916, cobaltochelatase subunit	NA|561aa|up_0|NZ_CP011304.1_1502308_1503991_+	NA	NA|428aa|down_0|NZ_CP011304.1_1504348_1505632_-	COG0312, TldD, Predicted Zn-dependent proteases and their inactivated homologs [General function prediction only]	NA|178aa|down_1|NZ_CP011304.1_1505903_1506437_+	TIGR01710, Type_II_secretion_system_protein_G, type II secretion system protein G	NA|178aa|down_2|NZ_CP011304.1_1507212_1507746_+	TIGR01710, Type_II_secretion_system_protein_G, type II secretion system protein G	NA|182aa|down_3|NZ_CP011304.1_1508048_1508594_+	pfam16734, Pilin_GH, Type IV pilin-like G and H, putative	NA|263aa|down_4|NZ_CP011304.1_1508721_1509510_+	NA	NA|312aa|down_5|NZ_CP011304.1_1510567_1511503_+	PLN00016, PLN00016, RNA-binding protein; Provisional	NA|71aa|down_6|NZ_CP011304.1_1511798_1512011_-	pfam10999, DUF2839, Protein of unknown function (DUF2839)	NA|412aa|down_7|NZ_CP011304.1_1512164_1513400_+	PRK07590, PRK07590, L,L-diaminopimelate aminotransferase; Validated	NA|277aa|down_8|NZ_CP011304.1_1513491_1514322_+	cd07385, MPP_YkuE_C, Bacillus subtilis YkuE and related proteins, C-terminal metallophosphatase domain	NA|610aa|down_9|NZ_CP011304.1_1514572_1516402_+	pfam13424, TPR_12, Tetratricopeptide repeat
GCF_000981785.2_ASM98178v2	NZ_CP011304	Microcystis aeruginosa NIES-2549 chromosome, complete genome	10	1800332-1800431	10	CRISPRCasFinder	no	csa3	cas3,cas14j,c2c9_V-U4,Cas14c_CAS-V-F,RT,csa3,cas14k,2OG_CAS,DinG,cmr6gr7,cmr5gr11,cmr4gr7,cmr3gr5,cas10,cas2,cas1,cas4,cas6,cas5,cas7,cas8b5,WYL	Type I-A	TTATGAGAGAATTGTTAATCAAGGG	25	0	0	NA	NA	NA	1	1	Orphan	cas3,cas14j,c2c9_V-U4,Cas14c_CAS-V-F,RT,csa3,cas14k,2OG_CAS,DinG,cmr6gr7,cmr5gr11,cmr4gr7,cmr3gr5,cas10,cas2,cas1,cas4,cas6,cas5,cas7,cas8b5,WYL	NA|70aa|up_5|NZ_CP011304.1_1792850_1793060_+,NA|65aa|up_1|NZ_CP011304.1_1796110_1796305_-,NA|62aa|down_1|NZ_CP011304.1_1802011_1802197_-,NA|121aa|down_2|NZ_CP011304.1_1802641_1803004_+,NA|97aa|down_3|NZ_CP011304.1_1802963_1803254_+	NA|346aa|up_9|NZ_CP011304.1_1787731_1788769_-	PRK14874, PRK14874, aspartate-semialdehyde dehydrogenase; Provisional	NA|466aa|up_8|NZ_CP011304.1_1789093_1790491_+	PRK01490, tig, trigger factor; Provisional	NA|234aa|up_7|NZ_CP011304.1_1790782_1791484_+	PRK00277, clpP, ATP-dependent Clp protease proteolytic subunit; Reviewed	NA|445aa|up_6|NZ_CP011304.1_1791494_1792829_+	PRK05342, clpX, ATP-dependent Clp protease ATP-binding subunit ClpX	NA|70aa|up_5|NZ_CP011304.1_1792850_1793060_+	NA	NA|286aa|up_4|NZ_CP011304.1_1793037_1793895_+	COG0596, MhpC, Predicted hydrolases or acyltransferases (alpha/beta hydrolase superfamily) [General function prediction only]	NA|75aa|up_3|NZ_CP011304.1_1793910_1794135_+	COG0760, SurA, Parvulin-like peptidyl-prolyl isomerase [Posttranslational modification, protein turnover, chaperones]	NA|259aa|up_2|NZ_CP011304.1_1795074_1795851_-	pfam13359, DDE_Tnp_4, DDE superfamily endonuclease	NA|65aa|up_1|NZ_CP011304.1_1796110_1796305_-	NA	NA|905aa|up_0|NZ_CP011304.1_1796648_1799363_+	COG2274, SunT, ABC-type bacteriocin/lantibiotic exporters, contain an N-terminal double-glycine peptidase domain [Defense mechanisms]	NA|246aa|down_0|NZ_CP011304.1_1801170_1801908_+	TIGR04500, PpiC_rel_mature, putative peptide maturation system protein	NA|62aa|down_1|NZ_CP011304.1_1802011_1802197_-	NA	NA|121aa|down_2|NZ_CP011304.1_1802641_1803004_+	NA	NA|97aa|down_3|NZ_CP011304.1_1802963_1803254_+	NA	NA|368aa|down_4|NZ_CP011304.1_1803407_1804511_-	pfam17914, HopA1, HopA1 effector protein family	NA|390aa|down_5|NZ_CP011304.1_1804535_1805705_-	pfam01636, APH, Phosphotransferase enzyme family	NA|317aa|down_6|NZ_CP011304.1_1806376_1807327_-	PRK00281, PRK00281, undecaprenyl-diphosphate phosphatase	NA|244aa|down_7|NZ_CP011304.1_1807599_1808331_+	COG0678, AHP1, Peroxiredoxin [Posttranslational modification, protein turnover, chaperones]	NA|178aa|down_8|NZ_CP011304.1_1808544_1809078_+	PRK09448, PRK09448, DNA starvation/stationary phase protection protein Dps; Provisional	NA|428aa|down_9|NZ_CP011304.1_1809121_1810405_-	cd14014, STKc_PknB_like, Catalytic domain of bacterial Serine/Threonine kinases, PknB and similar proteins
GCF_000981785.2_ASM98178v2	NZ_CP011304	Microcystis aeruginosa NIES-2549 chromosome, complete genome	11	1867756-1867854	11	CRISPRCasFinder	no	cas14j	cas3,cas14j,c2c9_V-U4,Cas14c_CAS-V-F,RT,csa3,cas14k,2OG_CAS,DinG,cmr6gr7,cmr5gr11,cmr4gr7,cmr3gr5,cas10,cas2,cas1,cas4,cas6,cas5,cas7,cas8b5,WYL	Unclear	TCTTTACTAGGGGATAAGTTTGTACTTGTTAAG	33	0	0	NA	NA	NA	1	1	TypeV	cas3,cas14j,c2c9_V-U4,Cas14c_CAS-V-F,RT,csa3,cas14k,2OG_CAS,DinG,cmr6gr7,cmr5gr11,cmr4gr7,cmr3gr5,cas10,cas2,cas1,cas4,cas6,cas5,cas7,cas8b5,WYL	NA|88aa|up_8|NZ_CP011304.1_1862660_1862924_-,NA|71aa|up_5|NZ_CP011304.1_1864092_1864305_+,NA|128aa|up_3|NZ_CP011304.1_1865153_1865537_+,NA|108aa|down_3|NZ_CP011304.1_1876290_1876614_-,NA|126aa|down_9|NZ_CP011304.1_1887257_1887635_-	NA|155aa|up_9|NZ_CP011304.1_1861881_1862346_+	pfam01724, DUF29, Domain of unknown function DUF29	NA|88aa|up_8|NZ_CP011304.1_1862660_1862924_-	NA	NA|170aa|up_7|NZ_CP011304.1_1862922_1863432_+	pfam00652, Ricin_B_lectin, Ricin-type beta-trefoil lectin domain	NA|149aa|up_6|NZ_CP011304.1_1863659_1864106_+	cd09874, PIN_MT3492-like, VapC-like PIN domain of the hypothetical protein MT3492 of Mycobacterium tuberculosis CDC1551 and other uncharacterized, annotated PilT protein domain proteins	NA|71aa|up_5|NZ_CP011304.1_1864092_1864305_+	NA	NA|81aa|up_4|NZ_CP011304.1_1864827_1865070_+	pfam04255, DUF433, Protein of unknown function (DUF433)	NA|128aa|up_3|NZ_CP011304.1_1865153_1865537_+	NA	NA|128aa|up_2|NZ_CP011304.1_1865515_1865899_+	pfam18480, DUF5615, Domain of unknown function (DUF5615)	NA|232aa|up_1|NZ_CP011304.1_1865914_1866609_+	COG1662, InsB, Transposase and inactivated derivatives, IS1 family [DNA replication, recombination, and repair]	NA|103aa|up_0|NZ_CP011304.1_1867138_1867447_-	cd06554, ASCH_ASC-1_like, ASC-1 homology domain, ASC-1-like subfamily	NA|321aa|down_0|NZ_CP011304.1_1868787_1869750_+	cd01195, INT_C_like_5, Uncharacterized site-specific tyrosine recombinase, C-terminal catalytic domain	NA|1007aa|down_1|NZ_CP011304.1_1870804_1873825_-	COG4928, COG4928, Predicted P-loop ATPase [General function prediction only]	NA|745aa|down_2|NZ_CP011304.1_1873984_1876219_-	TIGR03296, hypothetical_protein, M6 family metalloprotease domain	NA|108aa|down_3|NZ_CP011304.1_1876290_1876614_-	NA	NA|330aa|down_4|NZ_CP011304.1_1878611_1879601_-	pfam13359, DDE_Tnp_4, DDE superfamily endonuclease	NA|81aa|down_5|NZ_CP011304.1_1880872_1881115_+	PRK09798, PRK09798, MazF-MazE toxin-antitoxin system antitoxin MazE	NA|146aa|down_6|NZ_CP011304.1_1881119_1881557_+	cd18681, PIN_MtVapC27-VapC40_like, VapC-like PIN domain of Mycobacterium tuberculosis VapC27, and VapC40, and related proteins	NA|124aa|down_7|NZ_CP011304.1_1881725_1882097_+	cd05468, pVHL, von Hippel-Landau (pVHL) tumor suppressor protein	cas14j|371aa|down_8|NZ_CP011304.1_1882492_1883605_+	COG0675, COG0675, Transposase and inactivated derivatives [DNA replication, recombination, and repair]	NA|126aa|down_9|NZ_CP011304.1_1887257_1887635_-	NA
GCF_000981785.2_ASM98178v2	NZ_CP011304	Microcystis aeruginosa NIES-2549 chromosome, complete genome	12	1882171-1882337	12	CRISPRCasFinder	no	cas14j	cas3,cas14j,c2c9_V-U4,Cas14c_CAS-V-F,RT,csa3,cas14k,2OG_CAS,DinG,cmr6gr7,cmr5gr11,cmr4gr7,cmr3gr5,cas10,cas2,cas1,cas4,cas6,cas5,cas7,cas8b5,WYL	Unclear	CTCTCTACTCGCCCTTAGAAATC	23	0	0	NA	NA	NA	2	2	TypeV	cas3,cas14j,c2c9_V-U4,Cas14c_CAS-V-F,RT,csa3,cas14k,2OG_CAS,DinG,cmr6gr7,cmr5gr11,cmr4gr7,cmr3gr5,cas10,cas2,cas1,cas4,cas6,cas5,cas7,cas8b5,WYL	NA|108aa|up_4|NZ_CP011304.1_1876290_1876614_-,NA|126aa|down_1|NZ_CP011304.1_1887257_1887635_-,NA|75aa|down_2|NZ_CP011304.1_1887649_1887874_-,NA|78aa|down_3|NZ_CP011304.1_1888160_1888394_-,NA|185aa|down_4|NZ_CP011304.1_1888981_1889536_-,NA|90aa|down_5|NZ_CP011304.1_1889684_1889954_-	NA|232aa|up_9|NZ_CP011304.1_1865914_1866609_+	COG1662, InsB, Transposase and inactivated derivatives, IS1 family [DNA replication, recombination, and repair]	NA|103aa|up_8|NZ_CP011304.1_1867138_1867447_-	cd06554, ASCH_ASC-1_like, ASC-1 homology domain, ASC-1-like subfamily	NA|321aa|up_7|NZ_CP011304.1_1868787_1869750_+	cd01195, INT_C_like_5, Uncharacterized site-specific tyrosine recombinase, C-terminal catalytic domain	NA|1007aa|up_6|NZ_CP011304.1_1870804_1873825_-	COG4928, COG4928, Predicted P-loop ATPase [General function prediction only]	NA|745aa|up_5|NZ_CP011304.1_1873984_1876219_-	TIGR03296, hypothetical_protein, M6 family metalloprotease domain	NA|108aa|up_4|NZ_CP011304.1_1876290_1876614_-	NA	NA|330aa|up_3|NZ_CP011304.1_1878611_1879601_-	pfam13359, DDE_Tnp_4, DDE superfamily endonuclease	NA|81aa|up_2|NZ_CP011304.1_1880872_1881115_+	PRK09798, PRK09798, MazF-MazE toxin-antitoxin system antitoxin MazE	NA|146aa|up_1|NZ_CP011304.1_1881119_1881557_+	cd18681, PIN_MtVapC27-VapC40_like, VapC-like PIN domain of Mycobacterium tuberculosis VapC27, and VapC40, and related proteins	NA|124aa|up_0|NZ_CP011304.1_1881725_1882097_+	cd05468, pVHL, von Hippel-Landau (pVHL) tumor suppressor protein	cas14j|371aa|down_0|NZ_CP011304.1_1882492_1883605_+	COG0675, COG0675, Transposase and inactivated derivatives [DNA replication, recombination, and repair]	NA|126aa|down_1|NZ_CP011304.1_1887257_1887635_-	NA	NA|75aa|down_2|NZ_CP011304.1_1887649_1887874_-	NA	NA|78aa|down_3|NZ_CP011304.1_1888160_1888394_-	NA	NA|185aa|down_4|NZ_CP011304.1_1888981_1889536_-	NA	NA|90aa|down_5|NZ_CP011304.1_1889684_1889954_-	NA	NA|201aa|down_6|NZ_CP011304.1_1889950_1890553_-	COG1192, Soj, ATPases involved in chromosome partitioning [Cell division and chromosome partitioning]	NA|232aa|down_7|NZ_CP011304.1_1891617_1892312_+	COG1662, InsB, Transposase and inactivated derivatives, IS1 family [DNA replication, recombination, and repair]	NA|298aa|down_8|NZ_CP011304.1_1892460_1893353_+	pfam13358, DDE_3, DDE superfamily endonuclease	NA|333aa|down_9|NZ_CP011304.1_1893516_1894515_-	cd01168, adenosine_kinase, Adenosine kinase (AK) catalyzes the phosphorylation of ribofuranosyl-containing nucleoside analogues at the 5'-hydroxyl using ATP or GTP as the phosphate donor
GCF_000981785.2_ASM98178v2	NZ_CP011304	Microcystis aeruginosa NIES-2549 chromosome, complete genome	13	2096384-2096505	13	CRISPRCasFinder	no		cas3,cas14j,c2c9_V-U4,Cas14c_CAS-V-F,RT,csa3,cas14k,2OG_CAS,DinG,cmr6gr7,cmr5gr11,cmr4gr7,cmr3gr5,cas10,cas2,cas1,cas4,cas6,cas5,cas7,cas8b5,WYL	Orphan	AGTATTGGCAGTGTTGGCAGTAT	23	0	0	NA	NA	NA	2	2	Orphan	cas3,cas14j,c2c9_V-U4,Cas14c_CAS-V-F,RT,csa3,cas14k,2OG_CAS,DinG,cmr6gr7,cmr5gr11,cmr4gr7,cmr3gr5,cas10,cas2,cas1,cas4,cas6,cas5,cas7,cas8b5,WYL	NA,NA|69aa|down_1|NZ_CP011304.1_2097546_2097753_+,NA|70aa|down_2|NZ_CP011304.1_2097834_2098044_+,NA|123aa|down_7|NZ_CP011304.1_2102500_2102869_-	NA|158aa|up_9|NZ_CP011304.1_2084433_2084907_-	COG3791, COG3791, Uncharacterized conserved protein [Function unknown]	NA|148aa|up_8|NZ_CP011304.1_2084854_2085298_-	TIGR00738, Putative_HTH-type_transcriptional_regulator, Rrf2 family protein	NA|130aa|up_7|NZ_CP011304.1_2085598_2085988_+	pfam14250, AbrB-like, AbrB-like transcriptional regulator	NA|250aa|up_6|NZ_CP011304.1_2086660_2087410_-	COG2859, COG2859, Uncharacterized protein conserved in bacteria [Function unknown]	NA|433aa|up_5|NZ_CP011304.1_2088009_2089308_-	PRK00077, eno, enolase; Provisional	NA|254aa|up_4|NZ_CP011304.1_2089354_2090116_-	pfam13026, DUF3887, Protein of unknown function (DUF3887)	NA|693aa|up_3|NZ_CP011304.1_2090231_2092310_-	COG1523, PulA, Type II secretory pathway, pullulanase PulA and related glycosidases [Carbohydrate transport and metabolism]	NA|290aa|up_2|NZ_CP011304.1_2092588_2093458_-	pfam14261, DUF4351, Domain of unknown function (DUF4351)	NA|313aa|up_1|NZ_CP011304.1_2093540_2094479_-	pfam14261, DUF4351, Domain of unknown function (DUF4351)	NA|569aa|up_0|NZ_CP011304.1_2094629_2096336_+	COG1226, Kch, Kef-type K+ transport systems, predicted NAD-binding component [Inorganic ion transport and metabolism]	NA|204aa|down_0|NZ_CP011304.1_2096704_2097316_-	cd06260, DUF820, Domain of unknown function (DUF820)	NA|69aa|down_1|NZ_CP011304.1_2097546_2097753_+	NA	NA|70aa|down_2|NZ_CP011304.1_2097834_2098044_+	NA	NA|189aa|down_3|NZ_CP011304.1_2098151_2098718_+	PRK00071, nadD, nicotinate-nucleotide adenylyltransferase	NA|243aa|down_4|NZ_CP011304.1_2098714_2099443_+	COG1051, COG1051, ADP-ribose pyrophosphatase [Nucleotide transport and metabolism]	NA|553aa|down_5|NZ_CP011304.1_2099455_2101114_+	PRK13981, PRK13981, NAD synthetase; Provisional	NA|338aa|down_6|NZ_CP011304.1_2101490_2102504_-	COG0354, COG0354, Predicted aminomethyltransferase related to GcvT [General function prediction only]	NA|123aa|down_7|NZ_CP011304.1_2102500_2102869_-	NA	NA|345aa|down_8|NZ_CP011304.1_2103150_2104185_-	COG0057, GapA, Glyceraldehyde-3-phosphate dehydrogenase/erythrose-4-phosphate dehydrogenase [Carbohydrate transport and metabolism]	NA|918aa|down_9|NZ_CP011304.1_2104909_2107663_-	COG1002, COG1002, Type II restriction enzyme, methylase subunits [Defense mechanisms]
GCF_000981785.2_ASM98178v2	NZ_CP011304	Microcystis aeruginosa NIES-2549 chromosome, complete genome	14	2271790-2271886	14	CRISPRCasFinder	no	cas14k	cas3,cas14j,c2c9_V-U4,Cas14c_CAS-V-F,RT,csa3,cas14k,2OG_CAS,DinG,cmr6gr7,cmr5gr11,cmr4gr7,cmr3gr5,cas10,cas2,cas1,cas4,cas6,cas5,cas7,cas8b5,WYL	Unclear	TCAGTAAACAGTAAACAGTAATCAG	25	1	1	2271815-2271861	NZ_CP011304.1_313615-313569	NA	1	1	TypeV	cas3,cas14j,c2c9_V-U4,Cas14c_CAS-V-F,RT,csa3,cas14k,2OG_CAS,DinG,cmr6gr7,cmr5gr11,cmr4gr7,cmr3gr5,cas10,cas2,cas1,cas4,cas6,cas5,cas7,cas8b5,WYL	NA|260aa|up_9|NZ_CP011304.1_2250858_2251638_+,NA|56aa|up_7|NZ_CP011304.1_2254118_2254286_+,NA|139aa|down_0|NZ_CP011304.1_2272491_2272908_-,NA|50aa|down_2|NZ_CP011304.1_2274114_2274264_-	NA|260aa|up_9|NZ_CP011304.1_2250858_2251638_+	NA	NA|641aa|up_8|NZ_CP011304.1_2251987_2253910_-	COG1506, DAP2, Dipeptidyl aminopeptidases/acylaminoacyl-peptidases [Amino acid transport and metabolism]	NA|56aa|up_7|NZ_CP011304.1_2254118_2254286_+	NA	NA|404aa|up_6|NZ_CP011304.1_2254434_2255646_-	PRK07364, PRK07364, FAD-dependent hydroxylase	NA|281aa|up_5|NZ_CP011304.1_2255854_2256697_+	PLN02244, PLN02244, tocopherol O-methyltransferase	NA|609aa|up_4|NZ_CP011304.1_2257549_2259376_-	pfam01551, Peptidase_M23, Peptidase family M23	NA|1155aa|up_3|NZ_CP011304.1_2260012_2263477_-	COG4889, COG4889, Predicted helicase [General function prediction only]	NA|729aa|up_2|NZ_CP011304.1_2263718_2265905_-	pfam00145, DNA_methylase, C-5 cytosine-specific DNA methylase	NA|1535aa|up_1|NZ_CP011304.1_2265974_2270579_-	PRK11750, gltB, glutamate synthase subunit alpha; Provisional	NA|296aa|up_0|NZ_CP011304.1_2270879_2271767_-	COG0074, SucD, Succinyl-CoA synthetase, alpha subunit [Energy production and conversion]	NA|139aa|down_0|NZ_CP011304.1_2272491_2272908_-	NA	NA|288aa|down_1|NZ_CP011304.1_2273091_2273955_-	TIGR02069, cyanophycinase, cyanophycinase	NA|50aa|down_2|NZ_CP011304.1_2274114_2274264_-	NA	NA|159aa|down_3|NZ_CP011304.1_2274314_2274791_-	pfam05532, CsbD, CsbD-like	NA|61aa|down_4|NZ_CP011304.1_2275083_2275266_-	COG3237, COG3237, Uncharacterized protein conserved in bacteria [Function unknown]	NA|259aa|down_5|NZ_CP011304.1_2275437_2276214_-	COG3000, ERG3, Sterol desaturase [Lipid metabolism]	NA|770aa|down_6|NZ_CP011304.1_2276215_2278525_-	COG5635, COG5635, Predicted NTPase (NACHT family) [Signal transduction mechanisms]	NA|523aa|down_7|NZ_CP011304.1_2278554_2280123_-	pfam13282, DUF4070, Domain of unknown function (DUF4070)	cas14k|413aa|down_8|NZ_CP011304.1_2281909_2283148_-	COG0675, COG0675, Transposase and inactivated derivatives [DNA replication, recombination, and repair]	NA|198aa|down_9|NZ_CP011304.1_2283119_2283713_-	cd03769, SR_IS607_transposase_like, Serine Recombinase (SR) family, IS607-like transposase subfamily, catalytic domain; members contain a DNA binding domain with homology to MerR/SoxR located N-terminal to the catalytic domain
GCF_000981785.2_ASM98178v2	NZ_CP011304	Microcystis aeruginosa NIES-2549 chromosome, complete genome	15	2769449-2769687	2	PILER-CR	no	cmr6gr7,cmr5gr11,cmr4gr7,cmr3gr5,cas10,cas14j	cas3,cas14j,c2c9_V-U4,Cas14c_CAS-V-F,RT,csa3,cas14k,2OG_CAS,DinG,cmr6gr7,cmr5gr11,cmr4gr7,cmr3gr5,cas10,cas2,cas1,cas4,cas6,cas5,cas7,cas8b5,WYL	Type III-C,Type III-A,Type III-D,Type III-B	AGAAATTAATTGACTGGAAACA	22	0	0	NA	NA	NA	3	3	TypeIII-C,TypeIII-D,TypeV,TypeIII-A,TypeIII-B	cas3,cas14j,c2c9_V-U4,Cas14c_CAS-V-F,RT,csa3,cas14k,2OG_CAS,DinG,cmr6gr7,cmr5gr11,cmr4gr7,cmr3gr5,cas10,cas2,cas1,cas4,cas6,cas5,cas7,cas8b5,WYL	NA|103aa|up_8|NZ_CP011304.1_2757880_2758189_-,NA|151aa|up_6|NZ_CP011304.1_2759750_2760203_+,cmr5gr11|130aa|up_3|NZ_CP011304.1_2762772_2763162_-,NA|118aa|down_3|NZ_CP011304.1_2773083_2773437_+,NA|115aa|down_7|NZ_CP011304.1_2776769_2777114_-	NA|154aa|up_9|NZ_CP011304.1_2757256_2757718_-	cd04210, Cupredoxin_like_1, Uncharacterized Cupredoxin-like subfamily	NA|103aa|up_8|NZ_CP011304.1_2757880_2758189_-	NA	NA|281aa|up_7|NZ_CP011304.1_2758674_2759517_-	COG0596, MhpC, Predicted hydrolases or acyltransferases (alpha/beta hydrolase superfamily) [General function prediction only]	NA|151aa|up_6|NZ_CP011304.1_2759750_2760203_+	NA	NA|91aa|up_5|NZ_CP011304.1_2760395_2760668_+	pfam06305, LapA_dom, Lipopolysaccharide assembly protein A domain	cmr6gr7|648aa|up_4|NZ_CP011304.1_2760765_2762709_-	cd09661, Cmr6_III-B, CRISPR/Cas system-associated RAMP superfamily protein Cmr6	cmr5gr11|130aa|up_3|NZ_CP011304.1_2762772_2763162_-	NA	cmr4gr7|260aa|up_2|NZ_CP011304.1_2763494_2764274_-	TIGR02580, putative_CRISPR-associated_protein, CRISPR type III-B/RAMP module RAMP protein Cmr4	cmr3gr5|376aa|up_1|NZ_CP011304.1_2764708_2765836_-	cd09748, Cmr3_III-B, CRISPR/Cas system-associated RAMP superfamily protein Cmr3	cas10|1005aa|up_0|NZ_CP011304.1_2765808_2768823_-	pfam12469, DUF3692, CRISPR-associated protein	NA|206aa|down_0|NZ_CP011304.1_2769721_2770339_-	TIGR02595, conserved_hypothetical_protein, PEP-CTERM protein-sorting domain	NA|144aa|down_1|NZ_CP011304.1_2771005_2771437_-	pfam01797, Y1_Tnp, Transposase IS200 like	cas14j|420aa|down_2|NZ_CP011304.1_2771487_2772747_+	COG0675, COG0675, Transposase and inactivated derivatives [DNA replication, recombination, and repair]	NA|118aa|down_3|NZ_CP011304.1_2773083_2773437_+	NA	NA|142aa|down_4|NZ_CP011304.1_2773609_2774035_-	COG3755, COG3755, Uncharacterized protein conserved in bacteria [Function unknown]	NA|340aa|down_5|NZ_CP011304.1_2774205_2775225_+	pfam00891, Methyltransf_2, O-methyltransferase	NA|478aa|down_6|NZ_CP011304.1_2775267_2776701_-	cd05800, PGM_like2, This PGM-like (phosphoglucomutase-like) protein of unknown function belongs to the alpha-D-phosphohexomutase superfamily and is found in both archaea and bacteria	NA|115aa|down_7|NZ_CP011304.1_2776769_2777114_-	NA	NA|278aa|down_8|NZ_CP011304.1_2777487_2778321_+	pfam01716, MSP, Manganese-stabilizing protein / photosystem II polypeptide	NA|367aa|down_9|NZ_CP011304.1_2778988_2780089_-	COG4748, COG4748, Uncharacterized conserved protein [Function unknown]
GCF_000981785.2_ASM98178v2	NZ_CP011304	Microcystis aeruginosa NIES-2549 chromosome, complete genome	16	2929111-2929207	15	CRISPRCasFinder	no	RT	cas3,cas14j,c2c9_V-U4,Cas14c_CAS-V-F,RT,csa3,cas14k,2OG_CAS,DinG,cmr6gr7,cmr5gr11,cmr4gr7,cmr3gr5,cas10,cas2,cas1,cas4,cas6,cas5,cas7,cas8b5,WYL	Unclear	TATCTATAGAACTAGAAAAGTTTACCAA	28	0	0	NA	NA	NA	1	1	Orphan	cas3,cas14j,c2c9_V-U4,Cas14c_CAS-V-F,RT,csa3,cas14k,2OG_CAS,DinG,cmr6gr7,cmr5gr11,cmr4gr7,cmr3gr5,cas10,cas2,cas1,cas4,cas6,cas5,cas7,cas8b5,WYL	NA|55aa|up_7|NZ_CP011304.1_2922168_2922333_-,NA|105aa|up_5|NZ_CP011304.1_2923475_2923790_-,NA|168aa|up_4|NZ_CP011304.1_2924080_2924584_+,NA|405aa|up_3|NZ_CP011304.1_2925070_2926285_+,NA|200aa|up_2|NZ_CP011304.1_2926333_2926933_+,NA	NA|142aa|up_9|NZ_CP011304.1_2920546_2920972_+	pfam04138, GtrA, GtrA-like protein	NA|311aa|up_8|NZ_CP011304.1_2920968_2921901_+	cd04187, DPM1_like_bac, Bacterial DPM1_like enzymes are related to eukaryotic DPM1	NA|55aa|up_7|NZ_CP011304.1_2922168_2922333_-	NA	NA|132aa|up_6|NZ_CP011304.1_2922797_2923193_-	PRK12270, kgd, multifunctional oxoglutarate decarboxylase/oxoglutarate dehydrogenase thiamine pyrophosphate-binding subunit/dihydrolipoyllysine-residue succinyltransferase subunit	NA|105aa|up_5|NZ_CP011304.1_2923475_2923790_-	NA	NA|168aa|up_4|NZ_CP011304.1_2924080_2924584_+	NA	NA|405aa|up_3|NZ_CP011304.1_2925070_2926285_+	NA	NA|200aa|up_2|NZ_CP011304.1_2926333_2926933_+	NA	NA|402aa|up_1|NZ_CP011304.1_2926954_2928160_-	pfam13365, Trypsin_2, Trypsin-like peptidase domain	NA|302aa|up_0|NZ_CP011304.1_2928188_2929094_-	pfam13365, Trypsin_2, Trypsin-like peptidase domain	NA|363aa|down_0|NZ_CP011304.1_2929984_2931073_-	pfam00180, Iso_dh, Isocitrate/isopropylmalate dehydrogenase	NA|79aa|down_1|NZ_CP011304.1_2931513_2931750_-	pfam01106, NifU, NifU-like domain	NA|216aa|down_2|NZ_CP011304.1_2931823_2932471_-	pfam11866, DUF3386, Protein of unknown function (DUF3386)	NA|79aa|down_3|NZ_CP011304.1_2932811_2933048_+	COG1598, COG1598, Predicted nuclease of the RNAse H fold, HicB family [General    function prediction only]	NA|622aa|down_4|NZ_CP011304.1_2933316_2935182_+	COG1262, COG1262, Uncharacterized conserved protein [Function unknown]	NA|118aa|down_5|NZ_CP011304.1_2935292_2935646_+	NF033474, DivGenRetAVD, diversity-generating retroelement protein Avd	RT|353aa|down_6|NZ_CP011304.1_2935642_2936701_+	TIGR04416, hypothetical_protein, group II intron reverse transcriptase/maturase	NA|164aa|down_7|NZ_CP011304.1_2937694_2938186_-	pfam01724, DUF29, Domain of unknown function DUF29	NA|757aa|down_8|NZ_CP011304.1_2938334_2940605_+	PRK01213, PRK01213, phosphoribosylformylglycinamidine synthase subunit PurL	NA|488aa|down_9|NZ_CP011304.1_2940742_2942206_+	PRK07349, PRK07349, amidophosphoribosyltransferase; Provisional
GCF_000981785.2_ASM98178v2	NZ_CP011304	Microcystis aeruginosa NIES-2549 chromosome, complete genome	17	2934358-2934461	16	CRISPRCasFinder	no	RT	cas3,cas14j,c2c9_V-U4,Cas14c_CAS-V-F,RT,csa3,cas14k,2OG_CAS,DinG,cmr6gr7,cmr5gr11,cmr4gr7,cmr3gr5,cas10,cas2,cas1,cas4,cas6,cas5,cas7,cas8b5,WYL	Unclear	CAACCAAAGACACCCCTAGACAATAAAACTCC	32	0	0	NA	NA	NA	1	1	Orphan	cas3,cas14j,c2c9_V-U4,Cas14c_CAS-V-F,RT,csa3,cas14k,2OG_CAS,DinG,cmr6gr7,cmr5gr11,cmr4gr7,cmr3gr5,cas10,cas2,cas1,cas4,cas6,cas5,cas7,cas8b5,WYL	NA|168aa|up_9|NZ_CP011304.1_2924080_2924584_+,NA|405aa|up_8|NZ_CP011304.1_2925070_2926285_+,NA|200aa|up_7|NZ_CP011304.1_2926333_2926933_+,NA|73aa|down_7|NZ_CP011304.1_2945788_2946007_+	NA|168aa|up_9|NZ_CP011304.1_2924080_2924584_+	NA	NA|405aa|up_8|NZ_CP011304.1_2925070_2926285_+	NA	NA|200aa|up_7|NZ_CP011304.1_2926333_2926933_+	NA	NA|402aa|up_6|NZ_CP011304.1_2926954_2928160_-	pfam13365, Trypsin_2, Trypsin-like peptidase domain	NA|302aa|up_5|NZ_CP011304.1_2928188_2929094_-	pfam13365, Trypsin_2, Trypsin-like peptidase domain	NA|208aa|up_4|NZ_CP011304.1_2929200_2929824_-	pfam14218, COP23, Circadian oscillating protein COP23	NA|363aa|up_3|NZ_CP011304.1_2929984_2931073_-	pfam00180, Iso_dh, Isocitrate/isopropylmalate dehydrogenase	NA|79aa|up_2|NZ_CP011304.1_2931513_2931750_-	pfam01106, NifU, NifU-like domain	NA|216aa|up_1|NZ_CP011304.1_2931823_2932471_-	pfam11866, DUF3386, Protein of unknown function (DUF3386)	NA|79aa|up_0|NZ_CP011304.1_2932811_2933048_+	COG1598, COG1598, Predicted nuclease of the RNAse H fold, HicB family [General    function prediction only]	NA|118aa|down_0|NZ_CP011304.1_2935292_2935646_+	NF033474, DivGenRetAVD, diversity-generating retroelement protein Avd	RT|353aa|down_1|NZ_CP011304.1_2935642_2936701_+	TIGR04416, hypothetical_protein, group II intron reverse transcriptase/maturase	NA|164aa|down_2|NZ_CP011304.1_2937694_2938186_-	pfam01724, DUF29, Domain of unknown function DUF29	NA|757aa|down_3|NZ_CP011304.1_2938334_2940605_+	PRK01213, PRK01213, phosphoribosylformylglycinamidine synthase subunit PurL	NA|488aa|down_4|NZ_CP011304.1_2940742_2942206_+	PRK07349, PRK07349, amidophosphoribosyltransferase; Provisional	NA|572aa|down_5|NZ_CP011304.1_2942420_2944136_-	TIGR04520, ECF_ATPase_1, energy-coupling factor transporter ATPase	NA|447aa|down_6|NZ_CP011304.1_2944331_2945672_-	PRK02427, PRK02427, 3-phosphoshikimate 1-carboxyvinyltransferase; Provisional	NA|73aa|down_7|NZ_CP011304.1_2945788_2946007_+	NA	NA|116aa|down_8|NZ_CP011304.1_2946010_2946358_+	pfam02452, PemK_toxin, PemK-like, MazF-like toxin of type II toxin-antitoxin system	NA|181aa|down_9|NZ_CP011304.1_2946399_2946942_+	cd03017, PRX_BCP, Peroxiredoxin (PRX) family, Bacterioferritin comigratory protein (BCP) subfamily; composed of  thioredoxin-dependent thiol peroxidases, widely expressed in pathogenic bacteria, that protect cells against toxicity from reactive oxygen species by reducing and detoxifying hydroperoxides
GCF_000981785.2_ASM98178v2	NZ_CP011304	Microcystis aeruginosa NIES-2549 chromosome, complete genome	18	3306602-3306697	17	CRISPRCasFinder	no	cas14j	cas3,cas14j,c2c9_V-U4,Cas14c_CAS-V-F,RT,csa3,cas14k,2OG_CAS,DinG,cmr6gr7,cmr5gr11,cmr4gr7,cmr3gr5,cas10,cas2,cas1,cas4,cas6,cas5,cas7,cas8b5,WYL	Unclear	ATCAAGGGGGGATCAAGGGGGGATC	25	0	0	NA	NA	NA	1	1	TypeV	cas3,cas14j,c2c9_V-U4,Cas14c_CAS-V-F,RT,csa3,cas14k,2OG_CAS,DinG,cmr6gr7,cmr5gr11,cmr4gr7,cmr3gr5,cas10,cas2,cas1,cas4,cas6,cas5,cas7,cas8b5,WYL	NA|256aa|up_1|NZ_CP011304.1_3303994_3304762_+,NA|155aa|down_6|NZ_CP011304.1_3316308_3316773_-	NA|388aa|up_9|NZ_CP011304.1_3294106_3295270_-	PRK05764, PRK05764, aspartate aminotransferase; Provisional	NA|921aa|up_8|NZ_CP011304.1_3295558_3298321_-	COG1002, COG1002, Type II restriction enzyme, methylase subunits [Defense mechanisms]	NA|156aa|up_7|NZ_CP011304.1_3298531_3298999_+	pfam01724, DUF29, Domain of unknown function DUF29	NA|92aa|up_6|NZ_CP011304.1_3299169_3299445_-	cd17074, Ubl_CysO_like, ubiquitin-like (Ubl) domain found in Mycobacterium tuberculosis CysO and similar proteins	NA|435aa|up_5|NZ_CP011304.1_3299551_3300856_-	PRK07591, PRK07591, threonine synthase; Validated	NA|182aa|up_4|NZ_CP011304.1_3301632_3302178_-	COG0742, COG0742, N6-adenine-specific methylase [DNA replication, recombination, and repair]	NA|226aa|up_3|NZ_CP011304.1_3302181_3302859_-	COG1573, COG1573, Uracil-DNA glycosylase [DNA replication, recombination, and repair]	NA|319aa|up_2|NZ_CP011304.1_3303002_3303959_-	PRK00089, era, GTPase Era; Reviewed	NA|256aa|up_1|NZ_CP011304.1_3303994_3304762_+	NA	NA|427aa|up_0|NZ_CP011304.1_3305132_3306413_+	TIGR02037, Probable_periplasmic_serine_protease_do/HhoA-like, periplasmic serine protease, Do/DeqQ family	NA|560aa|down_0|NZ_CP011304.1_3306756_3308436_-	cd00200, WD40, WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment	NA|671aa|down_1|NZ_CP011304.1_3309221_3311234_+	cd16383, GUN4, porphyrin-binding protein domain GUN4	NA|504aa|down_2|NZ_CP011304.1_3311530_3313042_+	PRK09224, PRK09224, threonine ammonia-lyase IlvA	NA|291aa|down_3|NZ_CP011304.1_3313072_3313945_+	PRK00050, PRK00050, 16S rRNA (cytosine(1402)-N(4))-methyltransferase RsmH	NA|389aa|down_4|NZ_CP011304.1_3314085_3315252_-	PRK05957, PRK05957, pyridoxal phosphate-dependent aminotransferase	NA|123aa|down_5|NZ_CP011304.1_3315251_3315620_-	pfam00498, FHA, FHA domain	NA|155aa|down_6|NZ_CP011304.1_3316308_3316773_-	NA	NA|490aa|down_7|NZ_CP011304.1_3316956_3318426_+	COG0312, TldD, Predicted Zn-dependent proteases and their inactivated homologs [General function prediction only]	NA|201aa|down_8|NZ_CP011304.1_3318428_3319031_+	COG2839, COG2839, Uncharacterized protein conserved in bacteria [Function unknown]	NA|448aa|down_9|NZ_CP011304.1_3319124_3320468_+	COG0312, TldD, Predicted Zn-dependent proteases and their inactivated homologs [General function prediction only]
GCF_000981785.2_ASM98178v2	NZ_CP011304	Microcystis aeruginosa NIES-2549 chromosome, complete genome	19	3438700-3438803	18	CRISPRCasFinder	no		cas3,cas14j,c2c9_V-U4,Cas14c_CAS-V-F,RT,csa3,cas14k,2OG_CAS,DinG,cmr6gr7,cmr5gr11,cmr4gr7,cmr3gr5,cas10,cas2,cas1,cas4,cas6,cas5,cas7,cas8b5,WYL	Orphan	CCCCTTCGACGTAGCTCAGGGCAAGC	26	0	0	NA	NA	NA	1	1	Orphan	cas3,cas14j,c2c9_V-U4,Cas14c_CAS-V-F,RT,csa3,cas14k,2OG_CAS,DinG,cmr6gr7,cmr5gr11,cmr4gr7,cmr3gr5,cas10,cas2,cas1,cas4,cas6,cas5,cas7,cas8b5,WYL	NA,NA	NA|104aa|up_9|NZ_CP011304.1_3427745_3428057_-	cd07057, BMC_CcmK, Carbon dioxide concentrating mechanism (CcmK); Bacterial Micro-Compartment (BMC) domain	NA|95aa|up_8|NZ_CP011304.1_3428822_3429107_-	pfam07862, Nif11, Nif11 domain	NA|447aa|up_7|NZ_CP011304.1_3429492_3430833_+	PRK14333, PRK14333, (dimethylallyl)adenosine tRNA methylthiotransferase; Provisional	NA|129aa|up_6|NZ_CP011304.1_3430897_3431284_+	pfam10184, DUF2358, Uncharacterized conserved protein (DUF2358)	NA|471aa|up_5|NZ_CP011304.1_3431293_3432706_-	pfam05128, DUF697, Domain of unknown function (DUF697)	NA|65aa|up_4|NZ_CP011304.1_3432882_3433077_+	CHL00104, rpl33, ribosomal protein L33	NA|72aa|up_3|NZ_CP011304.1_3433117_3433333_+	PRK00391, rpsR, 30S ribosomal protein S18; Reviewed	NA|743aa|up_2|NZ_CP011304.1_3433831_3436060_+	COG0642, BaeS, Signal transduction histidine kinase [Signal transduction mechanisms]	NA|248aa|up_1|NZ_CP011304.1_3436640_3437384_+	COG1413, COG1413, FOG: HEAT repeat [Energy production and conversion]	NA|206aa|up_0|NZ_CP011304.1_3437367_3437985_+	PRK05920, PRK05920, aromatic acid decarboxylase; Validated	NA|476aa|down_0|NZ_CP011304.1_3439294_3440722_+	PRK00654, glgA, glycogen synthase GlgA	NA|207aa|down_1|NZ_CP011304.1_3440771_3441392_-	cd01457, vWA_ORF176_type, VWA ORF176 type: Von Willebrand factor type A (vWA) domain was originally found in the blood coagulation protein von Willebrand factor (vWF)	NA|111aa|down_2|NZ_CP011304.1_3441586_3441919_+	COG2119, COG2119, Predicted membrane protein [Function unknown]	NA|92aa|down_3|NZ_CP011304.1_3441998_3442274_+	COG2119, COG2119, Predicted membrane protein [Function unknown]	NA|439aa|down_4|NZ_CP011304.1_3442696_3444013_+	pfam13433, Peripla_BP_5, Periplasmic binding protein domain	NA|387aa|down_5|NZ_CP011304.1_3444107_3445268_+	TIGR03409, urea_trans_UrtB, urea ABC transporter, permease protein UrtB	NA|385aa|down_6|NZ_CP011304.1_3445271_3446426_+	TIGR03408, urea_trans_UrtC, urea ABC transporter, permease protein UrtC	NA|249aa|down_7|NZ_CP011304.1_3446589_3447336_+	TIGR03411, urea_trans_UrtD, urea ABC transporter, ATP-binding protein UrtD	NA|233aa|down_8|NZ_CP011304.1_3447374_3448073_+	TIGR03410, urea_trans_UrtE, urea ABC transporter, ATP-binding protein UrtE	NA|408aa|down_9|NZ_CP011304.1_3448094_3449318_+	TIGR00275, TIGR00275, flavoprotein, HI0933 family
GCF_000981785.2_ASM98178v2	NZ_CP011304	Microcystis aeruginosa NIES-2549 chromosome, complete genome	20	3542009-3542117	19	CRISPRCasFinder	no		cas3,cas14j,c2c9_V-U4,Cas14c_CAS-V-F,RT,csa3,cas14k,2OG_CAS,DinG,cmr6gr7,cmr5gr11,cmr4gr7,cmr3gr5,cas10,cas2,cas1,cas4,cas6,cas5,cas7,cas8b5,WYL	Orphan	CCCTGAGAGAAGCCGATACTTCCCCCATTGTCCC	34	0	0	NA	NA	NA	1	1	Orphan	cas3,cas14j,c2c9_V-U4,Cas14c_CAS-V-F,RT,csa3,cas14k,2OG_CAS,DinG,cmr6gr7,cmr5gr11,cmr4gr7,cmr3gr5,cas10,cas2,cas1,cas4,cas6,cas5,cas7,cas8b5,WYL	NA|115aa|up_8|NZ_CP011304.1_3529622_3529967_-,NA|338aa|up_3|NZ_CP011304.1_3537112_3538126_-,NA|550aa|down_7|NZ_CP011304.1_3560935_3562585_+,NA|181aa|down_8|NZ_CP011304.1_3562803_3563346_+	NA|183aa|up_9|NZ_CP011304.1_3526706_3527255_-	pfam13646, HEAT_2, HEAT repeats	NA|115aa|up_8|NZ_CP011304.1_3529622_3529967_-	NA	NA|245aa|up_7|NZ_CP011304.1_3530158_3530893_-	pfam02557, VanY, D-alanyl-D-alanine carboxypeptidase	NA|464aa|up_6|NZ_CP011304.1_3530924_3532316_-	COG0281, SfcA, Malic enzyme [Energy production and conversion]	NA|317aa|up_5|NZ_CP011304.1_3532514_3533465_+	TIGR01249, Putative_proline_iminopeptidase, proline iminopeptidase, Neisseria-type subfamily	NA|317aa|up_4|NZ_CP011304.1_3535204_3536155_+	pfam13358, DDE_3, DDE superfamily endonuclease	NA|338aa|up_3|NZ_CP011304.1_3537112_3538126_-	NA	NA|311aa|up_2|NZ_CP011304.1_3538375_3539308_+	pfam13401, AAA_22, AAA domain	NA|351aa|up_1|NZ_CP011304.1_3539536_3540589_+	pfam04307, YdjM, LexA-binding, inner membrane-associated putative hydrolase	NA|365aa|up_0|NZ_CP011304.1_3540581_3541676_+	COG0433, COG0433,  HerA helicase [Replication, recombination, and repair]	NA|107aa|down_0|NZ_CP011304.1_3543568_3543889_+	pfam02321, OEP, Outer membrane efflux protein	NA|232aa|down_1|NZ_CP011304.1_3544011_3544706_+	COG1662, InsB, Transposase and inactivated derivatives, IS1 family [DNA replication, recombination, and repair]	NA|108aa|down_2|NZ_CP011304.1_3544769_3545093_+	PRK09974, PRK09974, type II toxin-antitoxin system PrlF family antitoxin	NA|499aa|down_3|NZ_CP011304.1_3545507_3547004_+	pfam13586, DDE_Tnp_1_2, Transposase DDE domain	NA|963aa|down_4|NZ_CP011304.1_3547357_3550246_+	cd18011, DEXDc_RapA, DEXH-box helicase domain of RapA	NA|1315aa|down_5|NZ_CP011304.1_3550334_3554279_+	NF033451, BREX_2_MTaseX, BREX-2 system adenine-specific DNA-methyltransferase PglX	NA|1782aa|down_6|NZ_CP011304.1_3554687_3560033_+	cd17923, DEXHc_Hrq1-like, DEAH-box helicase domain of Hrq1 and similar proteins	NA|550aa|down_7|NZ_CP011304.1_3560935_3562585_+	NA	NA|181aa|down_8|NZ_CP011304.1_3562803_3563346_+	NA	NA|202aa|down_9|NZ_CP011304.1_3563445_3564051_-	pfam02517, Abi, CAAX protease self-immunity
GCF_000981785.2_ASM98178v2	NZ_CP011304	Microcystis aeruginosa NIES-2549 chromosome, complete genome	21	3599679-3599861	20	CRISPRCasFinder	no	cas2,cas1,cas4,cas6,cas3,cas5	cas3,cas14j,c2c9_V-U4,Cas14c_CAS-V-F,RT,csa3,cas14k,2OG_CAS,DinG,cmr6gr7,cmr5gr11,cmr4gr7,cmr3gr5,cas10,cas2,cas1,cas4,cas6,cas5,cas7,cas8b5,WYL	Unclear	GTTTCAATCCCTAATAGGGTTTAAGATTAATTGGAAC	37	0	0	NA	NA	I-D,II-B	2	2	Unclear	cas3,cas14j,c2c9_V-U4,Cas14c_CAS-V-F,RT,csa3,cas14k,2OG_CAS,DinG,cmr6gr7,cmr5gr11,cmr4gr7,cmr3gr5,cas10,cas2,cas1,cas4,cas6,cas5,cas7,cas8b5,WYL	NA,NA|383aa|down_1|NZ_CP011304.1_3607003_3608152_-,cas5|275aa|down_9|NZ_CP011304.1_3618873_3619698_-	NA|568aa|up_9|NZ_CP011304.1_3585656_3587360_-	TIGR00815, Sulfate_transporter, high affinity sulphate transporter 1	NA|655aa|up_8|NZ_CP011304.1_3587659_3589624_-	COG0644, FixC, Dehydrogenases (flavoproteins) [Energy production and conversion]	NA|81aa|up_7|NZ_CP011304.1_3589769_3590012_+	COG1327, COG1327, Predicted transcriptional regulator, consists of a Zn-ribbon and ATP-cone domains [Transcription]	NA|272aa|up_6|NZ_CP011304.1_3590019_3590835_+	pfam11209, DUF2993, Protein of unknown function (DUF2993)	NA|227aa|up_5|NZ_CP011304.1_3591107_3591788_+	COG5401, COG5401, Spore germination protein [General function prediction only]	NA|354aa|up_4|NZ_CP011304.1_3591832_3592894_-	pfam00924, MS_channel, Mechanosensitive ion channel	NA|161aa|up_3|NZ_CP011304.1_3592922_3593405_+	pfam13442, Cytochrome_CBB3, Cytochrome C oxidase, cbb3-type, subunit III	NA|718aa|up_2|NZ_CP011304.1_3593744_3595898_+	COG3914, Spy, Predicted O-linked N-acetylglucosamine transferase, SPINDLY family [Posttranslational modification, protein turnover, chaperones]	NA|722aa|up_1|NZ_CP011304.1_3595915_3598081_+	COG3914, Spy, Predicted O-linked N-acetylglucosamine transferase, SPINDLY family [Posttranslational modification, protein turnover, chaperones]	NA|298aa|up_0|NZ_CP011304.1_3598665_3599558_+	pfam13358, DDE_3, DDE superfamily endonuclease	NA|409aa|down_0|NZ_CP011304.1_3599881_3601108_-	pfam01610, DDE_Tnp_ISL3, Transposase	NA|383aa|down_1|NZ_CP011304.1_3607003_3608152_-	NA	NA|499aa|down_2|NZ_CP011304.1_3609111_3610608_+	pfam13586, DDE_Tnp_1_2, Transposase DDE domain	cas2|91aa|down_3|NZ_CP011304.1_3612814_3613087_-	cd09725, Cas2_I_II_III, CRISPR/Cas system-associated protein Cas2	cas1|335aa|down_4|NZ_CP011304.1_3613099_3614104_-	TIGR04093, hypothetical_protein_L8106_25395, CRISPR-associated endonuclease Cas1, subtype CYANO	cas4|198aa|down_5|NZ_CP011304.1_3614109_3614703_-	cd09637, Cas4_I-A_I-B_I-C_I-D_II-B, CRISPR/Cas system-associated protein Cas4	cas6|278aa|down_6|NZ_CP011304.1_3614705_3615539_-	COG5551, COG5551, CRISPR system related protein, RAMP superfamily [Defense    mechanisms]	NA|189aa|down_7|NZ_CP011304.1_3615538_3616105_-	cd06260, DUF820, Domain of unknown function (DUF820)	cas3|912aa|down_8|NZ_CP011304.1_3616145_3618881_-	COG1203, COG1203, CRISPR-associated helicase Cas3 [Defense mechanisms]	cas5|275aa|down_9|NZ_CP011304.1_3618873_3619698_-	NA
GCF_000981785.2_ASM98178v2	NZ_CP011304	Microcystis aeruginosa NIES-2549 chromosome, complete genome	22	3601305-3606945	21,2,3	CRISPRCasFinder,CRT,PILER-CR	no	cas2,cas1,cas4,cas6,cas3,cas5,cas7,cas8b5,WYL	cas3,cas14j,c2c9_V-U4,Cas14c_CAS-V-F,RT,csa3,cas14k,2OG_CAS,DinG,cmr6gr7,cmr5gr11,cmr4gr7,cmr3gr5,cas10,cas2,cas1,cas4,cas6,cas5,cas7,cas8b5,WYL	Unclear	GTTTCAATCCCTAATAGGGTTTAAGATTAATTGGAAC,GTTTCAATCCCTAATAGGGTTTAAGATTAATTGGAAC,GTTTCAATCCCTAATAGGGTTTAAGATTAATTGGAAC	37,37,37	1	1	3603206-3603240	NZ_CP026286.1_5265-5231	I-D,II-B:I-D,II-B:I-D,II-B	78,78,26	78	Unclear	cas3,cas14j,c2c9_V-U4,Cas14c_CAS-V-F,RT,csa3,cas14k,2OG_CAS,DinG,cmr6gr7,cmr5gr11,cmr4gr7,cmr3gr5,cas10,cas2,cas1,cas4,cas6,cas5,cas7,cas8b5,WYL	NA,NA|383aa|down_0|NZ_CP011304.1_3607003_3608152_-,cas5|275aa|down_8|NZ_CP011304.1_3618873_3619698_-,cas7|298aa|down_9|NZ_CP011304.1_3619701_3620595_-	NA|655aa|up_9|NZ_CP011304.1_3587659_3589624_-	COG0644, FixC, Dehydrogenases (flavoproteins) [Energy production and conversion]	NA|81aa|up_8|NZ_CP011304.1_3589769_3590012_+	COG1327, COG1327, Predicted transcriptional regulator, consists of a Zn-ribbon and ATP-cone domains [Transcription]	NA|272aa|up_7|NZ_CP011304.1_3590019_3590835_+	pfam11209, DUF2993, Protein of unknown function (DUF2993)	NA|227aa|up_6|NZ_CP011304.1_3591107_3591788_+	COG5401, COG5401, Spore germination protein [General function prediction only]	NA|354aa|up_5|NZ_CP011304.1_3591832_3592894_-	pfam00924, MS_channel, Mechanosensitive ion channel	NA|161aa|up_4|NZ_CP011304.1_3592922_3593405_+	pfam13442, Cytochrome_CBB3, Cytochrome C oxidase, cbb3-type, subunit III	NA|718aa|up_3|NZ_CP011304.1_3593744_3595898_+	COG3914, Spy, Predicted O-linked N-acetylglucosamine transferase, SPINDLY family [Posttranslational modification, protein turnover, chaperones]	NA|722aa|up_2|NZ_CP011304.1_3595915_3598081_+	COG3914, Spy, Predicted O-linked N-acetylglucosamine transferase, SPINDLY family [Posttranslational modification, protein turnover, chaperones]	NA|298aa|up_1|NZ_CP011304.1_3598665_3599558_+	pfam13358, DDE_3, DDE superfamily endonuclease	NA|409aa|up_0|NZ_CP011304.1_3599881_3601108_-	pfam01610, DDE_Tnp_ISL3, Transposase	NA|383aa|down_0|NZ_CP011304.1_3607003_3608152_-	NA	NA|499aa|down_1|NZ_CP011304.1_3609111_3610608_+	pfam13586, DDE_Tnp_1_2, Transposase DDE domain	cas2|91aa|down_2|NZ_CP011304.1_3612814_3613087_-	cd09725, Cas2_I_II_III, CRISPR/Cas system-associated protein Cas2	cas1|335aa|down_3|NZ_CP011304.1_3613099_3614104_-	TIGR04093, hypothetical_protein_L8106_25395, CRISPR-associated endonuclease Cas1, subtype CYANO	cas4|198aa|down_4|NZ_CP011304.1_3614109_3614703_-	cd09637, Cas4_I-A_I-B_I-C_I-D_II-B, CRISPR/Cas system-associated protein Cas4	cas6|278aa|down_5|NZ_CP011304.1_3614705_3615539_-	COG5551, COG5551, CRISPR system related protein, RAMP superfamily [Defense    mechanisms]	NA|189aa|down_6|NZ_CP011304.1_3615538_3616105_-	cd06260, DUF820, Domain of unknown function (DUF820)	cas3|912aa|down_7|NZ_CP011304.1_3616145_3618881_-	COG1203, COG1203, CRISPR-associated helicase Cas3 [Defense mechanisms]	cas5|275aa|down_8|NZ_CP011304.1_3618873_3619698_-	NA	cas7|298aa|down_9|NZ_CP011304.1_3619701_3620595_-	NA
GCF_000981785.2_ASM98178v2	NZ_CP011304	Microcystis aeruginosa NIES-2549 chromosome, complete genome	23	3608265-3608953	4,22,3	PILER-CR,CRISPRCasFinder,CRT	no	cas2,cas1,cas4,cas6,cas3,cas5,cas7,cas8b5,WYL	cas3,cas14j,c2c9_V-U4,Cas14c_CAS-V-F,RT,csa3,cas14k,2OG_CAS,DinG,cmr6gr7,cmr5gr11,cmr4gr7,cmr3gr5,cas10,cas2,cas1,cas4,cas6,cas5,cas7,cas8b5,WYL	Unclear	GTTTCAATCCCTAATAGGGTTTAAGATTAATTGGAAC,GTTTCAATCCCTAATAGGGTTTAAGATTAATTGGAAC,GTTTCAATCCCTAATAGGGTTTAAGATTAATTGGAAC	37,37,37	0	0	NA	NA	I-D,II-B:I-D,II-B:I-D,II-B	9,9,9	9	Unclear	cas3,cas14j,c2c9_V-U4,Cas14c_CAS-V-F,RT,csa3,cas14k,2OG_CAS,DinG,cmr6gr7,cmr5gr11,cmr4gr7,cmr3gr5,cas10,cas2,cas1,cas4,cas6,cas5,cas7,cas8b5,WYL	NA|383aa|up_0|NZ_CP011304.1_3607003_3608152_-,cas5|275aa|down_7|NZ_CP011304.1_3618873_3619698_-,cas7|298aa|down_8|NZ_CP011304.1_3619701_3620595_-	NA|81aa|up_9|NZ_CP011304.1_3589769_3590012_+	COG1327, COG1327, Predicted transcriptional regulator, consists of a Zn-ribbon and ATP-cone domains [Transcription]	NA|272aa|up_8|NZ_CP011304.1_3590019_3590835_+	pfam11209, DUF2993, Protein of unknown function (DUF2993)	NA|227aa|up_7|NZ_CP011304.1_3591107_3591788_+	COG5401, COG5401, Spore germination protein [General function prediction only]	NA|354aa|up_6|NZ_CP011304.1_3591832_3592894_-	pfam00924, MS_channel, Mechanosensitive ion channel	NA|161aa|up_5|NZ_CP011304.1_3592922_3593405_+	pfam13442, Cytochrome_CBB3, Cytochrome C oxidase, cbb3-type, subunit III	NA|718aa|up_4|NZ_CP011304.1_3593744_3595898_+	COG3914, Spy, Predicted O-linked N-acetylglucosamine transferase, SPINDLY family [Posttranslational modification, protein turnover, chaperones]	NA|722aa|up_3|NZ_CP011304.1_3595915_3598081_+	COG3914, Spy, Predicted O-linked N-acetylglucosamine transferase, SPINDLY family [Posttranslational modification, protein turnover, chaperones]	NA|298aa|up_2|NZ_CP011304.1_3598665_3599558_+	pfam13358, DDE_3, DDE superfamily endonuclease	NA|409aa|up_1|NZ_CP011304.1_3599881_3601108_-	pfam01610, DDE_Tnp_ISL3, Transposase	NA|383aa|up_0|NZ_CP011304.1_3607003_3608152_-	NA	NA|499aa|down_0|NZ_CP011304.1_3609111_3610608_+	pfam13586, DDE_Tnp_1_2, Transposase DDE domain	cas2|91aa|down_1|NZ_CP011304.1_3612814_3613087_-	cd09725, Cas2_I_II_III, CRISPR/Cas system-associated protein Cas2	cas1|335aa|down_2|NZ_CP011304.1_3613099_3614104_-	TIGR04093, hypothetical_protein_L8106_25395, CRISPR-associated endonuclease Cas1, subtype CYANO	cas4|198aa|down_3|NZ_CP011304.1_3614109_3614703_-	cd09637, Cas4_I-A_I-B_I-C_I-D_II-B, CRISPR/Cas system-associated protein Cas4	cas6|278aa|down_4|NZ_CP011304.1_3614705_3615539_-	COG5551, COG5551, CRISPR system related protein, RAMP superfamily [Defense    mechanisms]	NA|189aa|down_5|NZ_CP011304.1_3615538_3616105_-	cd06260, DUF820, Domain of unknown function (DUF820)	cas3|912aa|down_6|NZ_CP011304.1_3616145_3618881_-	COG1203, COG1203, CRISPR-associated helicase Cas3 [Defense mechanisms]	cas5|275aa|down_7|NZ_CP011304.1_3618873_3619698_-	NA	cas7|298aa|down_8|NZ_CP011304.1_3619701_3620595_-	NA	cas8b5|844aa|down_9|NZ_CP011304.1_3620597_3623129_-	PRK12704, PRK12704, phosphodiesterase; Provisional
GCF_000981785.2_ASM98178v2	NZ_CP011304	Microcystis aeruginosa NIES-2549 chromosome, complete genome	24	3610677-3612581	5,23,4	PILER-CR,CRISPRCasFinder,CRT	no	cas2,cas1,cas4,cas6,cas3,cas5,cas7,cas8b5,WYL	cas3,cas14j,c2c9_V-U4,Cas14c_CAS-V-F,RT,csa3,cas14k,2OG_CAS,DinG,cmr6gr7,cmr5gr11,cmr4gr7,cmr3gr5,cas10,cas2,cas1,cas4,cas6,cas5,cas7,cas8b5,WYL	Unclear	GTTTCAATCCCTAATAGGGTTTAAGATTAATTGGAAC,GTTTCAATCCCTAATAGGGTTTAAGATTAATTGGAAC,GTTTCAATCCCTAATAGGGTTTAAGATTAATTGGAAC	37,37,37	0	0	NA	NA	I-D,II-B:I-D,II-B:I-D,II-B	26,26,26	26	Unclear	cas3,cas14j,c2c9_V-U4,Cas14c_CAS-V-F,RT,csa3,cas14k,2OG_CAS,DinG,cmr6gr7,cmr5gr11,cmr4gr7,cmr3gr5,cas10,cas2,cas1,cas4,cas6,cas5,cas7,cas8b5,WYL	NA|383aa|up_1|NZ_CP011304.1_3607003_3608152_-,cas5|275aa|down_6|NZ_CP011304.1_3618873_3619698_-,cas7|298aa|down_7|NZ_CP011304.1_3619701_3620595_-,NA|48aa|down_9|NZ_CP011304.1_3623216_3623360_-	NA|272aa|up_9|NZ_CP011304.1_3590019_3590835_+	pfam11209, DUF2993, Protein of unknown function (DUF2993)	NA|227aa|up_8|NZ_CP011304.1_3591107_3591788_+	COG5401, COG5401, Spore germination protein [General function prediction only]	NA|354aa|up_7|NZ_CP011304.1_3591832_3592894_-	pfam00924, MS_channel, Mechanosensitive ion channel	NA|161aa|up_6|NZ_CP011304.1_3592922_3593405_+	pfam13442, Cytochrome_CBB3, Cytochrome C oxidase, cbb3-type, subunit III	NA|718aa|up_5|NZ_CP011304.1_3593744_3595898_+	COG3914, Spy, Predicted O-linked N-acetylglucosamine transferase, SPINDLY family [Posttranslational modification, protein turnover, chaperones]	NA|722aa|up_4|NZ_CP011304.1_3595915_3598081_+	COG3914, Spy, Predicted O-linked N-acetylglucosamine transferase, SPINDLY family [Posttranslational modification, protein turnover, chaperones]	NA|298aa|up_3|NZ_CP011304.1_3598665_3599558_+	pfam13358, DDE_3, DDE superfamily endonuclease	NA|409aa|up_2|NZ_CP011304.1_3599881_3601108_-	pfam01610, DDE_Tnp_ISL3, Transposase	NA|383aa|up_1|NZ_CP011304.1_3607003_3608152_-	NA	NA|499aa|up_0|NZ_CP011304.1_3609111_3610608_+	pfam13586, DDE_Tnp_1_2, Transposase DDE domain	cas2|91aa|down_0|NZ_CP011304.1_3612814_3613087_-	cd09725, Cas2_I_II_III, CRISPR/Cas system-associated protein Cas2	cas1|335aa|down_1|NZ_CP011304.1_3613099_3614104_-	TIGR04093, hypothetical_protein_L8106_25395, CRISPR-associated endonuclease Cas1, subtype CYANO	cas4|198aa|down_2|NZ_CP011304.1_3614109_3614703_-	cd09637, Cas4_I-A_I-B_I-C_I-D_II-B, CRISPR/Cas system-associated protein Cas4	cas6|278aa|down_3|NZ_CP011304.1_3614705_3615539_-	COG5551, COG5551, CRISPR system related protein, RAMP superfamily [Defense    mechanisms]	NA|189aa|down_4|NZ_CP011304.1_3615538_3616105_-	cd06260, DUF820, Domain of unknown function (DUF820)	cas3|912aa|down_5|NZ_CP011304.1_3616145_3618881_-	COG1203, COG1203, CRISPR-associated helicase Cas3 [Defense mechanisms]	cas5|275aa|down_6|NZ_CP011304.1_3618873_3619698_-	NA	cas7|298aa|down_7|NZ_CP011304.1_3619701_3620595_-	NA	cas8b5|844aa|down_8|NZ_CP011304.1_3620597_3623129_-	PRK12704, PRK12704, phosphodiesterase; Provisional	NA|48aa|down_9|NZ_CP011304.1_3623216_3623360_-	NA
GCF_000981785.2_ASM98178v2	NZ_CP011304	Microcystis aeruginosa NIES-2549 chromosome, complete genome	25	3885605-3885698	24	CRISPRCasFinder	no		cas3,cas14j,c2c9_V-U4,Cas14c_CAS-V-F,RT,csa3,cas14k,2OG_CAS,DinG,cmr6gr7,cmr5gr11,cmr4gr7,cmr3gr5,cas10,cas2,cas1,cas4,cas6,cas5,cas7,cas8b5,WYL	Orphan	CACTGATTACTGTTTACTGTTTACTG	26	1	11	3885631-3885672|3885631-3885672|3885631-3885672|3885631-3885672|3885631-3885672|3885631-3885672|3885631-3885672|3885631-3885672|3885631-3885672|3885631-3885672|3885631-3885672	NZ_CP011304.1_1117050-1117009|NZ_CP011304.1_1709798-1709839|NZ_CP011304.1_1826675-1826634|NZ_CP011304.1_2450844-2450803|NZ_CP011304.1_2830001-2830042|NZ_CP011304.1_3428767-3428726|NZ_CP011304.1_3485185-3485226|NZ_CP011304.1_1457253-1457212|NZ_CP011304.1_2879312-2879271|NZ_CP011304.1_3960143-3960102|NZ_CP011304.1_4119955-4119996	NA	1	1	Orphan	cas3,cas14j,c2c9_V-U4,Cas14c_CAS-V-F,RT,csa3,cas14k,2OG_CAS,DinG,cmr6gr7,cmr5gr11,cmr4gr7,cmr3gr5,cas10,cas2,cas1,cas4,cas6,cas5,cas7,cas8b5,WYL	NA|136aa|up_8|NZ_CP011304.1_3870139_3870547_+,NA|73aa|down_7|NZ_CP011304.1_3895696_3895915_-	NA|414aa|up_9|NZ_CP011304.1_3868717_3869959_-	PRK05388, argJ, bifunctional glutamate N-acetyltransferase/amino-acid acetyltransferase ArgJ	NA|136aa|up_8|NZ_CP011304.1_3870139_3870547_+	NA	NA|550aa|up_7|NZ_CP011304.1_3870559_3872209_-	COG0661, AarF, Predicted unusual protein kinase [General function prediction only]	NA|354aa|up_6|NZ_CP011304.1_3873859_3874921_-	COG0836, {ManC}, Mannose-1-phosphate guanylyltransferase [Cell envelope biogenesis, outer membrane]	NA|329aa|up_5|NZ_CP011304.1_3875053_3876040_-	PRK02693, PRK02693, apocytochrome f; Reviewed	NA|180aa|up_4|NZ_CP011304.1_3876083_3876623_-	PRK13474, PRK13474, cytochrome b6-f complex iron-sulfur subunit; Provisional	NA|289aa|up_3|NZ_CP011304.1_3877168_3878035_+	pfam01242, PTPS, 6-pyruvoyl tetrahydropterin synthase	NA|270aa|up_2|NZ_CP011304.1_3878096_3878906_-	COG0300, DltE, Short-chain dehydrogenases of various substrate specificities [General function prediction only]	NA|475aa|up_1|NZ_CP011304.1_3882292_3883717_-	COG0004, AmtB, Ammonia permease [Inorganic ion transport and metabolism]	NA|479aa|up_0|NZ_CP011304.1_3883892_3885329_-	COG0004, AmtB, Ammonia permease [Inorganic ion transport and metabolism]	NA|366aa|down_0|NZ_CP011304.1_3885744_3886842_+	TIGR00326, eubact_ribD, riboflavin biosynthesis protein RibD	NA|658aa|down_1|NZ_CP011304.1_3887101_3889075_+	PRK05218, PRK05218, heat shock protein 90; Provisional	NA|501aa|down_2|NZ_CP011304.1_3889384_3890887_+	COG1061, SSL2, DNA or RNA helicases of superfamily II [Transcription / DNA replication, recombination, and repair]	NA|117aa|down_3|NZ_CP011304.1_3890947_3891298_+	PRK00823, phhB, pterin-4-alpha-carbinolamine dehydratase; Validated	NA|293aa|down_4|NZ_CP011304.1_3891294_3892173_-	COG2084, MmsB, 3-hydroxyisobutyrate dehydrogenase and related beta-hydroxyacid dehydrogenases [Lipid metabolism]	NA|631aa|down_5|NZ_CP011304.1_3892331_3894224_-	PRK07956, ligA, NAD-dependent DNA ligase LigA; Validated	NA|75aa|down_6|NZ_CP011304.1_3894730_3894955_-	COG1598, COG1598, Predicted nuclease of the RNAse H fold, HicB family [General    function prediction only]	NA|73aa|down_7|NZ_CP011304.1_3895696_3895915_-	NA	NA|684aa|down_8|NZ_CP011304.1_3896159_3898211_-	cd06456, M3A_DCP, Peptidase family M3, dipeptidyl carboxypeptidase (DCP)	NA|345aa|down_9|NZ_CP011304.1_3898899_3899934_-	TIGR02475, Probable_cobalamine_biosynthesis_protein, cobalamin biosynthesis protein CobW
GCF_000981785.2_ASM98178v2	NZ_CP011304	Microcystis aeruginosa NIES-2549 chromosome, complete genome	26	4227228-4227340	25	CRISPRCasFinder	no	c2c9_V-U4	cas3,cas14j,c2c9_V-U4,Cas14c_CAS-V-F,RT,csa3,cas14k,2OG_CAS,DinG,cmr6gr7,cmr5gr11,cmr4gr7,cmr3gr5,cas10,cas2,cas1,cas4,cas6,cas5,cas7,cas8b5,WYL	Type V-U4	AATTTGCGTTATTTCAGCTTCTATTTTC	28	0	0	NA	NA	NA	1	1	TypeV-U4	cas3,cas14j,c2c9_V-U4,Cas14c_CAS-V-F,RT,csa3,cas14k,2OG_CAS,DinG,cmr6gr7,cmr5gr11,cmr4gr7,cmr3gr5,cas10,cas2,cas1,cas4,cas6,cas5,cas7,cas8b5,WYL	NA|145aa|up_5|NZ_CP011304.1_4218923_4219358_+,NA|334aa|down_1|NZ_CP011304.1_4228281_4229283_+	NA|428aa|up_9|NZ_CP011304.1_4213020_4214304_+	TIGR02971, devB-like_secretion_protein, ABC exporter membrane fusion protein, DevB family	NA|391aa|up_8|NZ_CP011304.1_4214311_4215484_+	TIGR01185, membrane_spanning_subunit, DevC protein	NA|240aa|up_7|NZ_CP011304.1_4215502_4216222_+	TIGR02982, heterocyst_DevA, ABC exporter ATP-binding subunit, DevA family	NA|641aa|up_6|NZ_CP011304.1_4216578_4218501_-	TIGR03423, pbp2_mrdA, penicillin-binding protein 2	NA|145aa|up_5|NZ_CP011304.1_4218923_4219358_+	NA	NA|138aa|up_4|NZ_CP011304.1_4219354_4219768_-	pfam12680, SnoaL_2, SnoaL-like domain	NA|370aa|up_3|NZ_CP011304.1_4219823_4220933_-	cd08300, alcohol_DH_class_III, class III alcohol dehydrogenases	NA|211aa|up_2|NZ_CP011304.1_4221400_4222033_+	pfam04313, HSDR_N, Type I restriction enzyme R protein N-terminus (HSDR_N)	NA|377aa|up_1|NZ_CP011304.1_4222276_4223407_+	TIGR02669, stage_II_sporulation_protein_D, SpoIID/LytB domain	NA|1188aa|up_0|NZ_CP011304.1_4223545_4227109_+	TIGR02082, Methionine_synthase, 5-methyltetrahydrofolate--homocysteine methyltransferase	NA|200aa|down_0|NZ_CP011304.1_4227679_4228279_+	cd06257, DnaJ, DnaJ domain or J-domain	NA|334aa|down_1|NZ_CP011304.1_4228281_4229283_+	NA	NA|353aa|down_2|NZ_CP011304.1_4229515_4230574_+	PRK01966, ddl, D-alanine--D-alanine ligase	NA|248aa|down_3|NZ_CP011304.1_4230631_4231375_-	COG4636, Uma2, Endonuclease, Uma2 family (restriction endonuclease fold) [General function prediction only]	NA|559aa|down_4|NZ_CP011304.1_4231441_4233118_-	PRK09319, PRK09319, bifunctional 3,4-dihydroxy-2-butanone-4-phosphate synthase RibB/GTP cyclohydrolase II RibA	NA|293aa|down_5|NZ_CP011304.1_4233255_4234134_-	pfam00427, PBS_linker_poly, Phycobilisome Linker polypeptide	NA|163aa|down_6|NZ_CP011304.1_4234402_4234891_-	cd14770, PC-PEC_alpha, Alpha subunits of phycoerythrin and phycoerythrocyanin; phycobilisome rod components	NA|173aa|down_7|NZ_CP011304.1_4234957_4235476_-	cd14768, PC_PEC_beta, Beta subunits of phycoerythrin and phycoerythrocyanin; phycobilisome rod components	NA|408aa|down_8|NZ_CP011304.1_4236089_4237313_+	cd08021, M20_Acy1_YhaA-like, M20 Peptidase aminoacylase 1 subfamily, includes Bacillus subtilis YhaA and Staphylococcus aureus amidohydrolase, SACOL0085	NA|296aa|down_9|NZ_CP011304.1_4237391_4238279_+	cd04250, AAK_NAGK-C, AAK_NAGK-C: N-Acetyl-L-glutamate kinase - cyclic (NAGK-C) catalyzes the phosphorylation of the gamma-COOH group of N-acetyl-L-glutamate (NAG) by ATP in the second step of arginine biosynthesis found in some bacteria and photosynthetic organisms using the non-acetylated, cyclic route of ornithine biosynthesis
GCF_000981785.2_ASM98178v2	NZ_CP011304	Microcystis aeruginosa NIES-2549 chromosome, complete genome	27	4254834-4254976	26	CRISPRCasFinder	no	c2c9_V-U4,cas14j	cas3,cas14j,c2c9_V-U4,Cas14c_CAS-V-F,RT,csa3,cas14k,2OG_CAS,DinG,cmr6gr7,cmr5gr11,cmr4gr7,cmr3gr5,cas10,cas2,cas1,cas4,cas6,cas5,cas7,cas8b5,WYL	Unclear	GGTGGGTTACGGCGAATACTCAATTTTTAGTGAGAGTATAACTTATTTTCGCC	53	0	0	NA	NA	NA	1	1	TypeV	cas3,cas14j,c2c9_V-U4,Cas14c_CAS-V-F,RT,csa3,cas14k,2OG_CAS,DinG,cmr6gr7,cmr5gr11,cmr4gr7,cmr3gr5,cas10,cas2,cas1,cas4,cas6,cas5,cas7,cas8b5,WYL	NA|223aa|up_3|NZ_CP011304.1_4250286_4250955_-,NA	NA|356aa|up_9|NZ_CP011304.1_4245341_4246409_-	PRK00108, mraY, phospho-N-acetylmuramoyl-pentapeptide-transferase; Provisional	NA|88aa|up_8|NZ_CP011304.1_4246454_4246718_-	pfam11332, DUF3134, Protein of unknown function (DUF3134)	NA|141aa|up_7|NZ_CP011304.1_4246714_4247137_-	cd03425, MutT_pyrophosphohydrolase, The MutT pyrophosphohydrolase is a prototypical Nudix hydrolase that catalyzes the hydrolysis of nucleoside and deoxynucleoside triphosphates (NTPs and dNTPs) by substitution at a beta-phosphorus to yield a nucleotide monophosphate (NMP) and inorganic pyrophosphate (PPi)	NA|108aa|up_6|NZ_CP011304.1_4247240_4247564_+	PRK02724, PRK02724, 30S ribosomal protein PSRP-3	NA|139aa|up_5|NZ_CP011304.1_4247784_4248201_-	CHL00063, atpE, ATP synthase CF1 epsilon subunit	NA|483aa|up_4|NZ_CP011304.1_4248275_4249724_-	CHL00060, atpB, ATP synthase CF1 beta subunit	NA|223aa|up_3|NZ_CP011304.1_4250286_4250955_-	NA	NA|542aa|up_2|NZ_CP011304.1_4251267_4252893_-	PRK00013, groEL, chaperonin GroEL; Reviewed	NA|104aa|up_1|NZ_CP011304.1_4252939_4253251_-	PRK00364, groES, co-chaperonin GroES; Reviewed	NA|143aa|up_0|NZ_CP011304.1_4254107_4254536_-	pfam14159, CAAD, CAAD domains of cyanobacterial aminoacyl-tRNA synthetase	NA|774aa|down_0|NZ_CP011304.1_4255106_4257428_-	TIGR01073, ATP-dependent_DNA_helicase_PcrA, ATP-dependent DNA helicase PcrA	NA|417aa|down_1|NZ_CP011304.1_4257753_4259004_+	PRK05250, PRK05250, S-adenosylmethionine synthetase; Validated	NA|185aa|down_2|NZ_CP011304.1_4259549_4260104_-	pfam09367, CpeS, CpeS-like protein	NA|328aa|down_3|NZ_CP011304.1_4260131_4261115_-	cd04187, DPM1_like_bac, Bacterial DPM1_like enzymes are related to eukaryotic DPM1	NA|406aa|down_4|NZ_CP011304.1_4261145_4262363_-	TIGR03087, stp1, sugar transferase, PEP-CTERM/EpsH1 system associated	NA|320aa|down_5|NZ_CP011304.1_4262365_4263325_-	COG1089, Gmd, GDP-D-mannose dehydratase [Cell envelope biogenesis, outer membrane]	NA|307aa|down_6|NZ_CP011304.1_4263339_4264260_-	pfam00535, Glycos_transf_2, Glycosyl transferase family 2	NA|328aa|down_7|NZ_CP011304.1_4264525_4265509_+	pfam00535, Glycos_transf_2, Glycosyl transferase family 2	NA|119aa|down_8|NZ_CP011304.1_4265641_4265998_+	pfam02152, FolB, Dihydroneopterin aldolase	NA|181aa|down_9|NZ_CP011304.1_4265984_4266527_-	pfam03358, FMN_red, NADPH-dependent FMN reductase
GCF_000981785.2_ASM98178v2	NZ_CP011304	Microcystis aeruginosa NIES-2549 chromosome, complete genome	28	4280123-4280204	27	CRISPRCasFinder	no	cas14j,RT	cas3,cas14j,c2c9_V-U4,Cas14c_CAS-V-F,RT,csa3,cas14k,2OG_CAS,DinG,cmr6gr7,cmr5gr11,cmr4gr7,cmr3gr5,cas10,cas2,cas1,cas4,cas6,cas5,cas7,cas8b5,WYL	Unclear	TTCTCAAAGAACAATCTTTAAAAACCGA	28	0	0	NA	NA	NA	1	1	TypeV	cas3,cas14j,c2c9_V-U4,Cas14c_CAS-V-F,RT,csa3,cas14k,2OG_CAS,DinG,cmr6gr7,cmr5gr11,cmr4gr7,cmr3gr5,cas10,cas2,cas1,cas4,cas6,cas5,cas7,cas8b5,WYL	NA|216aa|up_3|NZ_CP011304.1_4273502_4274150_-,NA|212aa|down_0|NZ_CP011304.1_4280712_4281348_+,NA|184aa|down_8|NZ_CP011304.1_4292314_4292866_-	cas14j|420aa|up_9|NZ_CP011304.1_4267635_4268895_-	COG0675, COG0675, Transposase and inactivated derivatives [DNA replication, recombination, and repair]	NA|219aa|up_8|NZ_CP011304.1_4269494_4270151_-	COG0177, Nth, Predicted EndoIII-related endonuclease [DNA replication, recombination, and repair]	NA|364aa|up_7|NZ_CP011304.1_4270150_4271242_-	pfam02163, Peptidase_M50, Peptidase family M50	NA|188aa|up_6|NZ_CP011304.1_4271391_4271955_+	cd12130, Apl, Allophycocyanin-like globins	NA|197aa|up_5|NZ_CP011304.1_4272134_4272725_+	pfam04755, PAP_fibrillin, PAP_fibrillin	NA|84aa|up_4|NZ_CP011304.1_4273179_4273431_-	COG0271, BolA, Stress-induced morphogen (activity unknown) [Signal transduction mechanisms]	NA|216aa|up_3|NZ_CP011304.1_4273502_4274150_-	NA	NA|194aa|up_2|NZ_CP011304.1_4274380_4274962_+	cd06260, DUF820, Domain of unknown function (DUF820)	NA|145aa|up_1|NZ_CP011304.1_4277989_4278424_+	cd01038, Endonuclease_DUF559, Domain of unknown function, appears to be related to a diverse group of endonucleases	NA|423aa|up_0|NZ_CP011304.1_4278675_4279944_-	COG2242, CobL, Precorrin-6B methylase 2 [Coenzyme metabolism]	NA|212aa|down_0|NZ_CP011304.1_4280712_4281348_+	NA	NA|472aa|down_1|NZ_CP011304.1_4281532_4282948_+	PRK09567, nirA, NirA family protein	NA|209aa|down_2|NZ_CP011304.1_4282957_4283584_+	PRK08285, cobH, precorrin-8X methylmutase; Reviewed	NA|235aa|down_3|NZ_CP011304.1_4283580_4284285_+	PRK05990, PRK05990, precorrin-2 C(20)-methyltransferase; Reviewed	NA|383aa|down_4|NZ_CP011304.1_4284595_4285744_+	COG0758, Smf, Predicted Rossmann fold nucleotide-binding protein involved in DNA uptake [DNA replication, recombination, and repair / Intracellular trafficking and secretion]	NA|119aa|down_5|NZ_CP011304.1_4285810_4286167_-	TIGR01068, Thioredoxin-like_protein_slr0233, thioredoxin	NA|330aa|down_6|NZ_CP011304.1_4286786_4287776_-	pfam13359, DDE_Tnp_4, DDE superfamily endonuclease	RT|587aa|down_7|NZ_CP011304.1_4288507_4290268_+	TIGR04416, hypothetical_protein, group II intron reverse transcriptase/maturase	NA|184aa|down_8|NZ_CP011304.1_4292314_4292866_-	NA	NA|219aa|down_9|NZ_CP011304.1_4292979_4293636_-	PRK09652, PRK09652, RNA polymerase sigma factor RpoE; Provisional
GCF_000981785.2_ASM98178v2	NZ_CP011304	Microcystis aeruginosa NIES-2549 chromosome, complete genome	29	4285765-4285900	28	CRISPRCasFinder	no	cas14j,RT	cas3,cas14j,c2c9_V-U4,Cas14c_CAS-V-F,RT,csa3,cas14k,2OG_CAS,DinG,cmr6gr7,cmr5gr11,cmr4gr7,cmr3gr5,cas10,cas2,cas1,cas4,cas6,cas5,cas7,cas8b5,WYL	Unclear	TATGGTATTAGCTAAAGTGGTTTTCGGTGCAGCCCCGACCAC	42	0	0	NA	NA	NA	1	1	TypeV	cas3,cas14j,c2c9_V-U4,Cas14c_CAS-V-F,RT,csa3,cas14k,2OG_CAS,DinG,cmr6gr7,cmr5gr11,cmr4gr7,cmr3gr5,cas10,cas2,cas1,cas4,cas6,cas5,cas7,cas8b5,WYL	NA|216aa|up_9|NZ_CP011304.1_4273502_4274150_-,NA|216aa|up_5|NZ_CP011304.1_4280007_4280655_+,NA|212aa|up_4|NZ_CP011304.1_4280712_4281348_+,NA|184aa|down_2|NZ_CP011304.1_4292314_4292866_-	NA|216aa|up_9|NZ_CP011304.1_4273502_4274150_-	NA	NA|194aa|up_8|NZ_CP011304.1_4274380_4274962_+	cd06260, DUF820, Domain of unknown function (DUF820)	NA|145aa|up_7|NZ_CP011304.1_4277989_4278424_+	cd01038, Endonuclease_DUF559, Domain of unknown function, appears to be related to a diverse group of endonucleases	NA|423aa|up_6|NZ_CP011304.1_4278675_4279944_-	COG2242, CobL, Precorrin-6B methylase 2 [Coenzyme metabolism]	NA|216aa|up_5|NZ_CP011304.1_4280007_4280655_+	NA	NA|212aa|up_4|NZ_CP011304.1_4280712_4281348_+	NA	NA|472aa|up_3|NZ_CP011304.1_4281532_4282948_+	PRK09567, nirA, NirA family protein	NA|209aa|up_2|NZ_CP011304.1_4282957_4283584_+	PRK08285, cobH, precorrin-8X methylmutase; Reviewed	NA|235aa|up_1|NZ_CP011304.1_4283580_4284285_+	PRK05990, PRK05990, precorrin-2 C(20)-methyltransferase; Reviewed	NA|383aa|up_0|NZ_CP011304.1_4284595_4285744_+	COG0758, Smf, Predicted Rossmann fold nucleotide-binding protein involved in DNA uptake [DNA replication, recombination, and repair / Intracellular trafficking and secretion]	NA|330aa|down_0|NZ_CP011304.1_4286786_4287776_-	pfam13359, DDE_Tnp_4, DDE superfamily endonuclease	RT|587aa|down_1|NZ_CP011304.1_4288507_4290268_+	TIGR04416, hypothetical_protein, group II intron reverse transcriptase/maturase	NA|184aa|down_2|NZ_CP011304.1_4292314_4292866_-	NA	NA|219aa|down_3|NZ_CP011304.1_4292979_4293636_-	PRK09652, PRK09652, RNA polymerase sigma factor RpoE; Provisional	NA|NA	NA	NA|NA	NA	NA|NA	NA	NA|NA	NA	NA|NA	NA	NA|NA	NA
