assembly_id	genome_id	genome_def	crispr_array_locus_merge	crispr_array_location_merge	crispr_locus_id	crispr_pred_method	array_in_prot	prot_within_array_20000	prot_in_genome	crispr_type_by_cas_prot	consensus_repeat	repeat_length	self-targeting_spacer_number	self-targeting_target_number	spacer_location	protospacer_location	repeat_type	spacer_locus_num	spacer_num	correct_crispr_type	genome_cas_prots	unknown_protein_around_crispr	L10	L10_domain	L9	L9_domain	L8	L8_domain	L7	L7_domain	L6	L6_domain	L5	L5_domain	L4	L4_domain	L3	L3_domain	L2	L2_domain	L1	L1_domain	R1	R1_domain	R2	R2_domain	R3	R3_domain	R4	R4_domain	R5	R5_domain	R6	R6_domain	R7	R7_domain	R8	R8_domain	R9	R9_domain	R10	R10_domain
GCA_000981785.2_ASM98178v2	CP011304	Microcystis aeruginosa NIES-2549, complete genome	1	142678-142818	1	CRISPRCasFinder	no	cas14j	cas3,cas14j,Cas14c_CAS-V-F,RT,c2c9_V-U4,csa3,cas14k,2OG_CAS,DinG,cmr6gr7,cmr5gr11,cmr4gr7,cmr3gr5,cas10,cas2,cas1,cas4,cas6,cas5,cas7,cas8b5,WYL	Unclear	TTATTTATTCTCCCCTCTCCCTTC	24	1	13	142777-142794|142777-142794|142777-142794|142777-142794|142777-142794|142777-142794|142777-142794|142777-142794|142777-142794|142777-142794|142777-142794|142777-142794|142777-142794	CP011304.1_1550360-1550377|CP011304.1_15993-15976|CP011304.1_287309-287292|CP011304.1_287323-287306|CP011304.1_675545-675528|CP011304.1_742640-742657|CP011304.1_1079158-1079141|CP011304.1_1282067-1282084|CP011304.1_1307434-1307451|CP011304.1_2065989-2066006|CP011304.1_2714208-2714191|CP011304.1_2731412-2731429|CP011304.1_3173703-3173720	NA	2	2	TypeV	cas3,cas14j,Cas14c_CAS-V-F,RT,c2c9_V-U4,csa3,cas14k,2OG_CAS,DinG,cmr6gr7,cmr5gr11,cmr4gr7,cmr3gr5,cas10,cas2,cas1,cas4,cas6,cas5,cas7,cas8b5,WYL	NA|53aa|up_7|CP011304.1_137549_137708_-,NA|89aa|up_6|CP011304.1_137679_137946_+,NA|130aa|up_3|CP011304.1_140152_140542_-,NA|38aa|up_1|CP011304.1_141156_141270_-,NA|101aa|down_5|CP011304.1_146257_146560_+,NA|130aa|down_6|CP011304.1_146560_146950_+,NA|45aa|down_9|CP011304.1_149910_150045_+	NA|114aa|up_9|CP011304.1_132873_133215_-	COG4226, HicB, Predicted nuclease of the RNAse H fold, HicB family [General    function prediction only]	NA|1331aa|up_8|CP011304.1_133468_137461_-	pfam12950, TaqI_C, TaqI-like C-terminal specificity domain	NA|53aa|up_7|CP011304.1_137549_137708_-	NA	NA|89aa|up_6|CP011304.1_137679_137946_+	NA	NA|119aa|up_5|CP011304.1_138146_138503_-	TIGR00049, Uncharacterized_protein_in_nifU_5'region, Iron-sulfur cluster assembly accessory protein	cas14j|405aa|up_4|CP011304.1_138827_140042_+	COG0675, COG0675, Transposase and inactivated derivatives [DNA replication, recombination, and repair]	NA|130aa|up_3|CP011304.1_140152_140542_-	NA	NA|114aa|up_2|CP011304.1_140753_141095_+	pfam04020, Phage_holin_4_2, Mycobacterial 4 TMS phage holin, superfamily IV	NA|38aa|up_1|CP011304.1_141156_141270_-	NA	NA|361aa|up_0|CP011304.1_141545_142628_+	smart00960, Robl_LC7, Roadblock/LC7 domain	NA|309aa|down_0|CP011304.1_142853_143780_+	TIGR00005, Ribosomal_large_subunit_pseudouridine_synthase_D, pseudouridine synthase, RluA family	NA|334aa|down_1|CP011304.1_143776_144778_-	PRK06270, PRK06270, homoserine dehydrogenase; Provisional	NA|70aa|down_2|CP011304.1_145094_145304_+	pfam11211, DUF2997, Protein of unknown function (DUF2997)	NA|129aa|down_3|CP011304.1_145348_145735_+	CHL00193, ycf35, Ycf35; Provisional	NA|136aa|down_4|CP011304.1_145734_146142_+	pfam13370, Fer4_13, 4Fe-4S single cluster domain of Ferredoxin I	NA|101aa|down_5|CP011304.1_146257_146560_+	NA	NA|130aa|down_6|CP011304.1_146560_146950_+	NA	NA|239aa|down_7|CP011304.1_147217_147934_+	COG0569, TrkA, K+ transport systems, NAD-binding component [Inorganic ion transport and metabolism]	NA|543aa|down_8|CP011304.1_147890_149519_-	COG0025, NhaP, NhaP-type Na+/H+ and K+/H+ antiporters [Inorganic ion transport and metabolism]	NA|45aa|down_9|CP011304.1_149910_150045_+	NA
GCA_000981785.2_ASM98178v2	CP011304	Microcystis aeruginosa NIES-2549, complete genome	2	455953-456066	2	CRISPRCasFinder	no	cas14j	cas3,cas14j,Cas14c_CAS-V-F,RT,c2c9_V-U4,csa3,cas14k,2OG_CAS,DinG,cmr6gr7,cmr5gr11,cmr4gr7,cmr3gr5,cas10,cas2,cas1,cas4,cas6,cas5,cas7,cas8b5,WYL	Unclear	AATCTCTAATAGGGGTTAAGATTAATGGGAACGCTGTAGGGTT	43	0	0	NA	NA	NA	1	1	TypeV	cas3,cas14j,Cas14c_CAS-V-F,RT,c2c9_V-U4,csa3,cas14k,2OG_CAS,DinG,cmr6gr7,cmr5gr11,cmr4gr7,cmr3gr5,cas10,cas2,cas1,cas4,cas6,cas5,cas7,cas8b5,WYL	NA|78aa|up_9|CP011304.1_447767_448001_+,NA|75aa|up_7|CP011304.1_448700_448925_+,NA|38aa|up_2|CP011304.1_453992_454106_+,NA	NA|78aa|up_9|CP011304.1_447767_448001_+	NA	NA|91aa|up_8|CP011304.1_448447_448720_+	pfam04365, BrnT_toxin, Ribonuclease toxin, BrnT, of type II toxin-antitoxin system	NA|75aa|up_7|CP011304.1_448700_448925_+	NA	NA|771aa|up_6|CP011304.1_448928_451241_+	COG4096, HsdR, Type I site-specific restriction-modification system, R (restriction) subunit and related helicases [Defense mechanisms]	NA|137aa|up_5|CP011304.1_451322_451733_+	cd07264, VOC_like, uncharacterized subfamily of vicinal oxygen chelate (VOC) family	NA|395aa|up_4|CP011304.1_451699_452884_-	COG1092, COG1092, Predicted SAM-dependent methyltransferases [General function prediction only]	NA|265aa|up_3|CP011304.1_453070_453865_+	PLN03100, PLN03100, Permease subunit of ER-derived-lipid transporter; Provisional	NA|38aa|up_2|CP011304.1_453992_454106_+	NA	NA|458aa|up_1|CP011304.1_454062_455436_-	TIGR03279, cyano_FeS_chp, putative radical SAM enzyme, TIGR03279 family	NA|111aa|up_0|CP011304.1_455541_455874_-	pfam13747, DUF4164, Domain of unknown function (DUF4164)	NA|626aa|down_0|CP011304.1_456184_458062_-	TIGR01241, ATP-dependent_zinc_metalloprotease_FtsH, ATP-dependent metalloprotease FtsH	NA|410aa|down_1|CP011304.1_458139_459369_-	sd00006, TPR, Tetratricopeptide repeat	cas14j|405aa|down_2|CP011304.1_459459_460674_-	COG0675, COG0675, Transposase and inactivated derivatives [DNA replication, recombination, and repair]	NA|48aa|down_3|CP011304.1_460815_460959_+	pfam01797, Y1_Tnp, Transposase IS200 like	NA|492aa|down_4|CP011304.1_461113_462589_-	PRK00913, PRK00913, multifunctional aminopeptidase A; Provisional	NA|376aa|down_5|CP011304.1_463036_464164_+	COG1453, COG1453, Predicted oxidoreductases of the aldo/keto reductase family [General function prediction only]	NA|360aa|down_6|CP011304.1_464296_465376_+	PRK09196, PRK09196, fructose-bisphosphate aldolase class II	NA|408aa|down_7|CP011304.1_465582_466806_+	TIGR00225, Tail-specific_protease, C-terminal peptidase (prc)	NA|435aa|down_8|CP011304.1_467238_468543_+	cd06346, PBP1_ABC_ligand_binding-like, type 1 periplasmic ligand-binding domain of uncharacterized ABC (Atpase Binding Cassette)-type active transport systems predicted to be involved in uptake of amino acids, peptides, or inorganic ions	NA|406aa|down_9|CP011304.1_469116_470334_-	pfam01098, FTSW_RODA_SPOVE, Cell cycle protein
GCA_000981785.2_ASM98178v2	CP011304	Microcystis aeruginosa NIES-2549, complete genome	3	507958-508044	3	CRISPRCasFinder	no		cas3,cas14j,Cas14c_CAS-V-F,RT,c2c9_V-U4,csa3,cas14k,2OG_CAS,DinG,cmr6gr7,cmr5gr11,cmr4gr7,cmr3gr5,cas10,cas2,cas1,cas4,cas6,cas5,cas7,cas8b5,WYL	Orphan	CAGTAAACAGTAATCGGTGCAAAGACAG	28	0	0	NA	NA	NA	1	1	Orphan	cas3,cas14j,Cas14c_CAS-V-F,RT,c2c9_V-U4,csa3,cas14k,2OG_CAS,DinG,cmr6gr7,cmr5gr11,cmr4gr7,cmr3gr5,cas10,cas2,cas1,cas4,cas6,cas5,cas7,cas8b5,WYL	NA|52aa|up_8|CP011304.1_499935_500091_+,NA|43aa|up_5|CP011304.1_502393_502522_-,NA	NA|337aa|up_9|CP011304.1_497770_498781_-	pfam13651, EcoRI_methylase, Adenine-specific methyltransferase EcoRI	NA|52aa|up_8|CP011304.1_499935_500091_+	NA	NA|289aa|up_7|CP011304.1_500100_500967_-	COG4577, CcmK, Carbon dioxide concentrating mechanism/carboxysome shell protein [Secondary metabolites biosynthesis, transport, and catabolism / Energy production and conversion]	NA|360aa|up_6|CP011304.1_501114_502194_+	pfam18578, Raf1_N, Rubisco accumulation factor 1 alpha helical domain	NA|43aa|up_5|CP011304.1_502393_502522_-	NA	NA|321aa|up_4|CP011304.1_502542_503505_+	TIGR03609, S_layer_CsaB, polysaccharide pyruvyl transferase CsaB	NA|208aa|up_3|CP011304.1_503510_504134_+	pfam04313, HSDR_N, Type I restriction enzyme R protein N-terminus (HSDR_N)	NA|97aa|up_2|CP011304.1_504194_504485_-	cd02978, KaiB_like, KaiB-like family; composed of the circadian clock proteins, KaiB and the N-terminal KaiB-like sensory domain of SasA	NA|289aa|up_1|CP011304.1_504481_505348_-	cd00156, REC, phosphoacceptor receiver (REC) domain of response regulators (RRs) and pseudo response regulators (PRRs)	NA|734aa|up_0|CP011304.1_505643_507845_-	COG3211, PhoX, Predicted phosphatase [General function prediction only]	NA|108aa|down_0|CP011304.1_508150_508474_+	pfam13747, DUF4164, Domain of unknown function (DUF4164)	NA|229aa|down_1|CP011304.1_508511_509198_-	PRK01130, PRK01130, putative N-acetylmannosamine-6-phosphate 2-epimerase	NA|170aa|down_2|CP011304.1_509314_509824_-	pfam00719, Pyrophosphatase, Inorganic pyrophosphatase	NA|393aa|down_3|CP011304.1_511053_512232_-	COG0628, yhhT, Predicted permease, member of the PurR regulon [General function prediction only]	NA|662aa|down_4|CP011304.1_512403_514389_-	COG0557, VacB, Exoribonuclease R [Transcription]	NA|680aa|down_5|CP011304.1_514496_516536_-	PRK05354, PRK05354, biosynthetic arginine decarboxylase	NA|178aa|down_6|CP011304.1_516676_517210_+	pfam14221, DUF4330, Domain of unknown function (DUF4330)	NA|189aa|down_7|CP011304.1_517376_517943_+	PRK00076, recR, recombination protein RecR; Reviewed	NA|456aa|down_8|CP011304.1_518186_519554_-	TIGR00225, Tail-specific_protease, C-terminal peptidase (prc)	NA|140aa|down_9|CP011304.1_519654_520074_-	cd04682, Nudix_Hydrolase_23, Members of the Nudix hydrolase superfamily catalyze the hydrolysis of NUcleoside DIphosphates linked to other moieties, X
GCA_000981785.2_ASM98178v2	CP011304	Microcystis aeruginosa NIES-2549, complete genome	4	671329-671419	4	CRISPRCasFinder	no		cas3,cas14j,Cas14c_CAS-V-F,RT,c2c9_V-U4,csa3,cas14k,2OG_CAS,DinG,cmr6gr7,cmr5gr11,cmr4gr7,cmr3gr5,cas10,cas2,cas1,cas4,cas6,cas5,cas7,cas8b5,WYL	Orphan	GCTAAAAAGTGCTTCAACGCAAATC	25	0	0	NA	NA	NA	1	1	Orphan	cas3,cas14j,Cas14c_CAS-V-F,RT,c2c9_V-U4,csa3,cas14k,2OG_CAS,DinG,cmr6gr7,cmr5gr11,cmr4gr7,cmr3gr5,cas10,cas2,cas1,cas4,cas6,cas5,cas7,cas8b5,WYL	NA|38aa|up_6|CP011304.1_666230_666344_-,NA|57aa|up_5|CP011304.1_666415_666586_-,NA|84aa|up_1|CP011304.1_669459_669711_-,NA|39aa|down_1|CP011304.1_672474_672591_+,NA|58aa|down_2|CP011304.1_672604_672778_+	NA|119aa|up_9|CP011304.1_663290_663647_+	cd00093, HTH_XRE, Helix-turn-helix XRE-family like proteins	NA|222aa|up_8|CP011304.1_663652_664318_+	pfam06114, Peptidase_M78, IrrE N-terminal-like domain	NA|332aa|up_7|CP011304.1_664358_665354_-	CHL00180, rbcR, LysR transcriptional regulator; Provisional	NA|38aa|up_6|CP011304.1_666230_666344_-	NA	NA|57aa|up_5|CP011304.1_666415_666586_-	NA	NA|202aa|up_4|CP011304.1_666791_667397_-	COG2179, COG2179, Predicted hydrolase of the HAD superfamily [General function prediction only]	NA|210aa|up_3|CP011304.1_667462_668092_-	pfam13649, Methyltransf_25, Methyltransferase domain	NA|388aa|up_2|CP011304.1_668205_669369_+	PLN02449, PLN02449, ferrochelatase	NA|84aa|up_1|CP011304.1_669459_669711_-	NA	NA|473aa|up_0|CP011304.1_669861_671280_+	COG2865, COG2865, Predicted transcriptional regulator containing an HTH domain and an uncharacterized domain shared with the mammalian protein Schlafen [Transcription]	NA|331aa|down_0|CP011304.1_671498_672491_-	PRK00578, prfB, peptide chain release factor 2; Validated	NA|39aa|down_1|CP011304.1_672474_672591_+	NA	NA|58aa|down_2|CP011304.1_672604_672778_+	NA	NA|379aa|down_3|CP011304.1_672871_674008_-	pfam12565, DUF3747, Protein of unknown function (DUF3747)	NA|450aa|down_4|CP011304.1_674168_675518_+	PRK02705, murD, UDP-N-acetylmuramoyl-L-alanine--D-glutamate ligase	NA|876aa|down_5|CP011304.1_675682_678310_+	COG1649, COG1649, Uncharacterized protein conserved in bacteria [Function unknown]	NA|61aa|down_6|CP011304.1_678478_678661_-	PRK00270, rpsU, 30S ribosomal protein S21; Reviewed	NA|502aa|down_7|CP011304.1_678833_680339_-	COG0564, RluA, Pseudouridylate synthases, 23S RNA-specific [Translation, ribosomal structure and biogenesis]	NA|421aa|down_8|CP011304.1_680373_681636_-	PLN02855, PLN02855, Bifunctional selenocysteine lyase/cysteine desulfurase	NA|501aa|down_9|CP011304.1_681691_683194_-	TIGR01981, UPF0051_protein_Rv1462/MT1509, FeS assembly protein SufD
GCA_000981785.2_ASM98178v2	CP011304	Microcystis aeruginosa NIES-2549, complete genome	6	1045396-1045495	6	CRISPRCasFinder	no	c2c9_V-U4	cas3,cas14j,Cas14c_CAS-V-F,RT,c2c9_V-U4,csa3,cas14k,2OG_CAS,DinG,cmr6gr7,cmr5gr11,cmr4gr7,cmr3gr5,cas10,cas2,cas1,cas4,cas6,cas5,cas7,cas8b5,WYL	Type V-U4	TTGTCAAAAGACAAGCTGTCAAACTTG	27	0	0	NA	NA	NA	1	1	TypeV-U4	cas3,cas14j,Cas14c_CAS-V-F,RT,c2c9_V-U4,csa3,cas14k,2OG_CAS,DinG,cmr6gr7,cmr5gr11,cmr4gr7,cmr3gr5,cas10,cas2,cas1,cas4,cas6,cas5,cas7,cas8b5,WYL	NA|40aa|up_9|CP011304.1_1037881_1038001_-,NA|63aa|up_8|CP011304.1_1038275_1038464_-,NA|297aa|up_7|CP011304.1_1038621_1039512_-,NA|98aa|up_2|CP011304.1_1042451_1042745_-,NA|111aa|down_7|CP011304.1_1053941_1054274_+	NA|40aa|up_9|CP011304.1_1037881_1038001_-	NA	NA|63aa|up_8|CP011304.1_1038275_1038464_-	NA	NA|297aa|up_7|CP011304.1_1038621_1039512_-	NA	NA|361aa|up_6|CP011304.1_1039689_1040772_+	TIGR01151, Photosystem_QB_protein, photosystem II, DI subunit (also called Q(B))	NA|171aa|up_5|CP011304.1_1041040_1041553_+	pfam13551, HTH_29, Winged helix-turn helix	NA|142aa|up_4|CP011304.1_1041684_1042110_+	pfam13358, DDE_3, DDE superfamily endonuclease	NA|88aa|up_3|CP011304.1_1042198_1042462_-	COG2026, RelE, Cytotoxic translational repressor of toxin-antitoxin stability system [Translation, ribosomal structure and biogenesis / Cell division and chromosome partitioning]	NA|98aa|up_2|CP011304.1_1042451_1042745_-	NA	NA|560aa|up_1|CP011304.1_1043217_1044897_-	pfam11832, DUF3352, Protein of unknown function (DUF3352)	NA|89aa|up_0|CP011304.1_1045013_1045280_-	PRK13697, PRK13697, cytochrome c6; Provisional	NA|126aa|down_0|CP011304.1_1045526_1045904_+	PRK02710, PRK02710, plastocyanin; Provisional	NA|576aa|down_1|CP011304.1_1046346_1048074_+	PRK05945, sdhA, succinate dehydrogenase/fumarate reductase flavoprotein subunit	NA|243aa|down_2|CP011304.1_1048267_1048996_-	pfam02668, TauD, Taurine catabolism dioxygenase TauD, TfdA family	NA|1251aa|down_3|CP011304.1_1049162_1052915_+	COG5635, COG5635, Predicted NTPase (NACHT family) [Signal transduction mechanisms]	NA|68aa|down_4|CP011304.1_1053279_1053483_+	pfam18506, RelB_N, RelB Antitoxin alpha helical domain	NA|90aa|down_5|CP011304.1_1053479_1053749_+	COG2026, RelE, Cytotoxic translational repressor of toxin-antitoxin stability system [Translation, ribosomal structure and biogenesis / Cell division and chromosome partitioning]	NA|46aa|down_6|CP011304.1_1053802_1053940_+	pfam17874, TPR_MalT, MalT-like TPR region	NA|111aa|down_7|CP011304.1_1053941_1054274_+	NA	NA|81aa|down_8|CP011304.1_1054284_1054527_+	pfam16277, DUF4926, Domain of unknown function (DUF4926)	NA|112aa|down_9|CP011304.1_1054684_1055020_+	cd16382, XisI-like, XisI is FdxN element excision controlling factor protein
GCA_000981785.2_ASM98178v2	CP011304	Microcystis aeruginosa NIES-2549, complete genome	8	1232112-1232527	1,8,1	PILER-CR,CRISPRCasFinder,CRT	no		cas3,cas14j,Cas14c_CAS-V-F,RT,c2c9_V-U4,csa3,cas14k,2OG_CAS,DinG,cmr6gr7,cmr5gr11,cmr4gr7,cmr3gr5,cas10,cas2,cas1,cas4,cas6,cas5,cas7,cas8b5,WYL	Orphan	GTTTCCAACTAATCCTATTTGACCTAATAGGTAAGG,GTTTCCAACTAATCCTATTTGACCTAATAGGTAAGG,GTTTCCAACTAATCCTATTTGACCTAATAGGTAAGG	36,36,36	0	0	NA	NA	NA:NA:NA	5,5,5	5	Orphan	cas3,cas14j,Cas14c_CAS-V-F,RT,c2c9_V-U4,csa3,cas14k,2OG_CAS,DinG,cmr6gr7,cmr5gr11,cmr4gr7,cmr3gr5,cas10,cas2,cas1,cas4,cas6,cas5,cas7,cas8b5,WYL	NA|204aa|up_3|CP011304.1_1228762_1229374_-,NA|72aa|down_0|CP011304.1_1233712_1233928_+,NA|63aa|down_3|CP011304.1_1236121_1236310_-,NA|307aa|down_6|CP011304.1_1238574_1239495_-	NA|277aa|up_9|CP011304.1_1223152_1223983_+	cd01639, IMPase, IMPase, inositol monophosphatase and related domains	NA|313aa|up_8|CP011304.1_1224026_1224965_+	smart00271, DnaJ, DnaJ molecular chaperone homology domain	NA|406aa|up_7|CP011304.1_1225003_1226221_+	PRK12292, hisZ, ATP phosphoribosyltransferase regulatory subunit; Provisional	NA|171aa|up_6|CP011304.1_1226306_1226819_+	pfam09626, DHC, Dihaem cytochrome c	NA|284aa|up_5|CP011304.1_1227161_1228013_-	COG1426, COG1426, Predicted transcriptional regulator contains Xre-like HTH domain [Function unknown]	NA|168aa|up_4|CP011304.1_1228257_1228761_+	cd00886, MogA_MoaB, MogA_MoaB family	NA|204aa|up_3|CP011304.1_1228762_1229374_-	NA	NA|222aa|up_2|CP011304.1_1229508_1230174_+	COG0830, UreF, Urease accessory protein UreF [Posttranslational modification, protein turnover, chaperones]	NA|317aa|up_1|CP011304.1_1230401_1231352_+	cd02511, Beta4Glucosyltransferase, UDP-glucose LOS-beta-1,4 glucosyltransferase is required for biosynthesis of lipooligosaccharide	NA|197aa|up_0|CP011304.1_1231357_1231948_+	PRK10502, PRK10502, putative acyl transferase; Provisional	NA|72aa|down_0|CP011304.1_1233712_1233928_+	NA	NA|488aa|down_1|CP011304.1_1234129_1235593_+	cd00880, Era_like, E	NA|99aa|down_2|CP011304.1_1235840_1236137_-	pfam05016, ParE_toxin, ParE toxin of type II toxin-antitoxin system, parDE	NA|63aa|down_3|CP011304.1_1236121_1236310_-	NA	NA|266aa|down_4|CP011304.1_1236559_1237357_+	COG0664, Crp, cAMP-binding proteins - catabolite gene activator and regulatory subunit of cAMP-dependent protein kinases [Signal transduction mechanisms]	NA|248aa|down_5|CP011304.1_1237744_1238488_+	pfam12973, Cupin_7, ChrR Cupin-like domain	NA|307aa|down_6|CP011304.1_1238574_1239495_-	NA	NA|268aa|down_7|CP011304.1_1239571_1240375_-	COG3442, COG3442, Predicted glutamine amidotransferase [General function prediction only]	NA|446aa|down_8|CP011304.1_1240538_1241876_-	COG0769, MurE, UDP-N-acetylmuramyl tripeptide synthase [Cell envelope biogenesis, outer membrane]	NA|397aa|down_9|CP011304.1_1242039_1243230_+	TIGR02037, Probable_periplasmic_serine_protease_do/HhoA-like, periplasmic serine protease, Do/DeqQ family
GCA_000981785.2_ASM98178v2	CP011304	Microcystis aeruginosa NIES-2549, complete genome	9	1504223-1504342	9	CRISPRCasFinder	no		cas3,cas14j,Cas14c_CAS-V-F,RT,c2c9_V-U4,csa3,cas14k,2OG_CAS,DinG,cmr6gr7,cmr5gr11,cmr4gr7,cmr3gr5,cas10,cas2,cas1,cas4,cas6,cas5,cas7,cas8b5,WYL	Orphan	CTGATTCGGAGCATCTTTCAATTTGACAACGCC	33	0	0	NA	NA	NA	1	1	Orphan	cas3,cas14j,Cas14c_CAS-V-F,RT,c2c9_V-U4,csa3,cas14k,2OG_CAS,DinG,cmr6gr7,cmr5gr11,cmr4gr7,cmr3gr5,cas10,cas2,cas1,cas4,cas6,cas5,cas7,cas8b5,WYL	NA|44aa|up_5|CP011304.1_1498255_1498387_+,NA|297aa|up_4|CP011304.1_1498459_1499350_-,NA|159aa|up_3|CP011304.1_1499360_1499837_-,NA|62aa|up_2|CP011304.1_1499927_1500113_+,NA|561aa|up_0|CP011304.1_1502308_1503991_+,NA|51aa|down_2|CP011304.1_1506550_1506703_-,NA|237aa|down_5|CP011304.1_1508799_1509510_+	NA|317aa|up_9|CP011304.1_1495202_1496153_-	TIGR03965, glycosyltransferase_Rv0696_family, mycofactocin system glycosyltransferase	NA|206aa|up_8|CP011304.1_1496257_1496875_+	COG1045, CysE, Serine acetyltransferase [Amino acid transport and metabolism]	NA|214aa|up_7|CP011304.1_1496869_1497511_-	PRK00155, ispD, D-ribitol-5-phosphate cytidylyltransferase	NA|131aa|up_6|CP011304.1_1497696_1498089_-	pfam08854, DUF1824, Domain of unknown function (DUF1824)	NA|44aa|up_5|CP011304.1_1498255_1498387_+	NA	NA|297aa|up_4|CP011304.1_1498459_1499350_-	NA	NA|159aa|up_3|CP011304.1_1499360_1499837_-	NA	NA|62aa|up_2|CP011304.1_1499927_1500113_+	NA	NA|663aa|up_1|CP011304.1_1500120_1502109_-	TIGR02442, Uncharacterized_protein_Rv2850c/MT2916, cobaltochelatase subunit	NA|561aa|up_0|CP011304.1_1502308_1503991_+	NA	NA|428aa|down_0|CP011304.1_1504348_1505632_-	COG0312, TldD, Predicted Zn-dependent proteases and their inactivated homologs [General function prediction only]	NA|178aa|down_1|CP011304.1_1505903_1506437_+	TIGR01710, Type_II_secretion_system_protein_G, type II secretion system protein G	NA|51aa|down_2|CP011304.1_1506550_1506703_-	NA	NA|178aa|down_3|CP011304.1_1507212_1507746_+	TIGR01710, Type_II_secretion_system_protein_G, type II secretion system protein G	NA|218aa|down_4|CP011304.1_1507940_1508594_+	pfam16734, Pilin_GH, Type IV pilin-like G and H, putative	NA|237aa|down_5|CP011304.1_1508799_1509510_+	NA	NA|104aa|down_6|CP011304.1_1509658_1509970_-	pfam01724, DUF29, Domain of unknown function DUF29	NA|312aa|down_7|CP011304.1_1510567_1511503_+	PLN00016, PLN00016, RNA-binding protein; Provisional	NA|71aa|down_8|CP011304.1_1511798_1512011_-	pfam10999, DUF2839, Protein of unknown function (DUF2839)	NA|412aa|down_9|CP011304.1_1512164_1513400_+	PRK07590, PRK07590, L,L-diaminopimelate aminotransferase; Validated
GCA_000981785.2_ASM98178v2	CP011304	Microcystis aeruginosa NIES-2549, complete genome	11	1867756-1867854	11	CRISPRCasFinder	no	cas14j	cas3,cas14j,Cas14c_CAS-V-F,RT,c2c9_V-U4,csa3,cas14k,2OG_CAS,DinG,cmr6gr7,cmr5gr11,cmr4gr7,cmr3gr5,cas10,cas2,cas1,cas4,cas6,cas5,cas7,cas8b5,WYL	Unclear	TCTTTACTAGGGGATAAGTTTGTACTTGTTAAG	33	0	0	NA	NA	NA	1	1	TypeV	cas3,cas14j,Cas14c_CAS-V-F,RT,c2c9_V-U4,csa3,cas14k,2OG_CAS,DinG,cmr6gr7,cmr5gr11,cmr4gr7,cmr3gr5,cas10,cas2,cas1,cas4,cas6,cas5,cas7,cas8b5,WYL	NA|53aa|up_5|CP011304.1_1864911_1865070_+,NA|128aa|up_4|CP011304.1_1865153_1865537_+,NA|41aa|up_0|CP011304.1_1867469_1867592_-,NA|107aa|down_1|CP011304.1_1868425_1868746_-,NA|177aa|down_3|CP011304.1_1870058_1870589_+,NA|108aa|down_6|CP011304.1_1876290_1876614_-	NA|328aa|up_9|CP011304.1_1860379_1861363_+	PRK05363, PRK05363, protein-methionine-sulfoxide reductase catalytic subunit MsrP	NA|153aa|up_8|CP011304.1_1861406_1861865_+	pfam01724, DUF29, Domain of unknown function DUF29	NA|155aa|up_7|CP011304.1_1861881_1862346_+	pfam01724, DUF29, Domain of unknown function DUF29	NA|239aa|up_6|CP011304.1_1862715_1863432_+	pfam00652, Ricin_B_lectin, Ricin-type beta-trefoil lectin domain	NA|53aa|up_5|CP011304.1_1864911_1865070_+	NA	NA|128aa|up_4|CP011304.1_1865153_1865537_+	NA	NA|98aa|up_3|CP011304.1_1865605_1865899_+	pfam18480, DUF5615, Domain of unknown function (DUF5615)	NA|134aa|up_2|CP011304.1_1865899_1866301_+	COG3677, COG3677, Transposase and inactivated derivatives [DNA replication, recombination, and repair]	NA|71aa|up_1|CP011304.1_1867138_1867351_-	PRK05958, PRK05958, 8-amino-7-oxononanoate synthase; Reviewed	NA|41aa|up_0|CP011304.1_1867469_1867592_-	NA	NA|186aa|down_0|CP011304.1_1867914_1868472_-	pfam13358, DDE_3, DDE superfamily endonuclease	NA|107aa|down_1|CP011304.1_1868425_1868746_-	NA	NA|321aa|down_2|CP011304.1_1868787_1869750_+	cd01195, INT_C_like_5, Uncharacterized site-specific tyrosine recombinase, C-terminal catalytic domain	NA|177aa|down_3|CP011304.1_1870058_1870589_+	NA	NA|1007aa|down_4|CP011304.1_1870804_1873825_-	COG4928, COG4928, Predicted P-loop ATPase [General function prediction only]	NA|745aa|down_5|CP011304.1_1873984_1876219_-	TIGR03296, hypothetical_protein, M6 family metalloprotease domain	NA|108aa|down_6|CP011304.1_1876290_1876614_-	NA	NA|510aa|down_7|CP011304.1_1876994_1878524_-	cd11352, AmyAc_5, Alpha amylase catalytic domain found in an uncharacterized protein family	NA|330aa|down_8|CP011304.1_1878611_1879601_-	pfam13359, DDE_Tnp_4, DDE superfamily endonuclease	NA|106aa|down_9|CP011304.1_1879670_1879988_-	cd11352, AmyAc_5, Alpha amylase catalytic domain found in an uncharacterized protein family
GCA_000981785.2_ASM98178v2	CP011304	Microcystis aeruginosa NIES-2549, complete genome	12	1882171-1882337	12	CRISPRCasFinder	no	cas14j	cas3,cas14j,Cas14c_CAS-V-F,RT,c2c9_V-U4,csa3,cas14k,2OG_CAS,DinG,cmr6gr7,cmr5gr11,cmr4gr7,cmr3gr5,cas10,cas2,cas1,cas4,cas6,cas5,cas7,cas8b5,WYL	Unclear	CTCTCTACTCGCCCTTAGAAATC	23	0	0	NA	NA	NA	2	2	TypeV	cas3,cas14j,Cas14c_CAS-V-F,RT,c2c9_V-U4,csa3,cas14k,2OG_CAS,DinG,cmr6gr7,cmr5gr11,cmr4gr7,cmr3gr5,cas10,cas2,cas1,cas4,cas6,cas5,cas7,cas8b5,WYL	NA|177aa|up_9|CP011304.1_1870058_1870589_+,NA|108aa|up_6|CP011304.1_1876290_1876614_-,NA|49aa|up_2|CP011304.1_1880280_1880427_+,NA|126aa|down_6|CP011304.1_1887257_1887635_-,NA|75aa|down_7|CP011304.1_1887649_1887874_-,NA|46aa|down_8|CP011304.1_1888066_1888204_+,NA|47aa|down_9|CP011304.1_1888160_1888301_-	NA|177aa|up_9|CP011304.1_1870058_1870589_+	NA	NA|1007aa|up_8|CP011304.1_1870804_1873825_-	COG4928, COG4928, Predicted P-loop ATPase [General function prediction only]	NA|745aa|up_7|CP011304.1_1873984_1876219_-	TIGR03296, hypothetical_protein, M6 family metalloprotease domain	NA|108aa|up_6|CP011304.1_1876290_1876614_-	NA	NA|510aa|up_5|CP011304.1_1876994_1878524_-	cd11352, AmyAc_5, Alpha amylase catalytic domain found in an uncharacterized protein family	NA|330aa|up_4|CP011304.1_1878611_1879601_-	pfam13359, DDE_Tnp_4, DDE superfamily endonuclease	NA|106aa|up_3|CP011304.1_1879670_1879988_-	cd11352, AmyAc_5, Alpha amylase catalytic domain found in an uncharacterized protein family	NA|49aa|up_2|CP011304.1_1880280_1880427_+	NA	NA|81aa|up_1|CP011304.1_1880872_1881115_+	PRK09798, PRK09798, MazF-MazE toxin-antitoxin system antitoxin MazE	NA|146aa|up_0|CP011304.1_1881119_1881557_+	cd18681, PIN_MtVapC27-VapC40_like, VapC-like PIN domain of Mycobacterium tuberculosis VapC27, and VapC40, and related proteins	cas14j|374aa|down_0|CP011304.1_1882483_1883605_+	COG0675, COG0675, Transposase and inactivated derivatives [DNA replication, recombination, and repair]	NA|181aa|down_1|CP011304.1_1883636_1884179_+	pfam04832, SOUL, SOUL heme-binding protein	NA|134aa|down_2|CP011304.1_1884179_1884581_+	COG3677, COG3677, Transposase and inactivated derivatives [DNA replication, recombination, and repair]	NA|40aa|down_3|CP011304.1_1885117_1885237_-	NF033474, DivGenRetAVD, diversity-generating retroelement protein Avd	NA|81aa|down_4|CP011304.1_1885332_1885575_-	COG1262, COG1262, Uncharacterized conserved protein [Function unknown]	NA|579aa|down_5|CP011304.1_1885528_1887265_-	COG1262, COG1262, Uncharacterized conserved protein [Function unknown]	NA|126aa|down_6|CP011304.1_1887257_1887635_-	NA	NA|75aa|down_7|CP011304.1_1887649_1887874_-	NA	NA|46aa|down_8|CP011304.1_1888066_1888204_+	NA	NA|47aa|down_9|CP011304.1_1888160_1888301_-	NA
GCA_000981785.2_ASM98178v2	CP011304	Microcystis aeruginosa NIES-2549, complete genome	14	2271790-2271886	14	CRISPRCasFinder	no	cas14k	cas3,cas14j,Cas14c_CAS-V-F,RT,c2c9_V-U4,csa3,cas14k,2OG_CAS,DinG,cmr6gr7,cmr5gr11,cmr4gr7,cmr3gr5,cas10,cas2,cas1,cas4,cas6,cas5,cas7,cas8b5,WYL	Unclear	TCAGTAAACAGTAAACAGTAATCAG	25	1	1	2271815-2271861	CP011304.1_313615-313569	NA	1	1	TypeV	cas3,cas14j,Cas14c_CAS-V-F,RT,c2c9_V-U4,csa3,cas14k,2OG_CAS,DinG,cmr6gr7,cmr5gr11,cmr4gr7,cmr3gr5,cas10,cas2,cas1,cas4,cas6,cas5,cas7,cas8b5,WYL	NA|44aa|up_8|CP011304.1_2254154_2254286_+,NA|38aa|up_4|CP011304.1_2259853_2259967_+,NA	NA|641aa|up_9|CP011304.1_2251987_2253910_-	COG1506, DAP2, Dipeptidyl aminopeptidases/acylaminoacyl-peptidases [Amino acid transport and metabolism]	NA|44aa|up_8|CP011304.1_2254154_2254286_+	NA	NA|404aa|up_7|CP011304.1_2254434_2255646_-	PRK07364, PRK07364, FAD-dependent hydroxylase	NA|281aa|up_6|CP011304.1_2255854_2256697_+	PLN02244, PLN02244, tocopherol O-methyltransferase	NA|618aa|up_5|CP011304.1_2257549_2259403_-	pfam01551, Peptidase_M23, Peptidase family M23	NA|38aa|up_4|CP011304.1_2259853_2259967_+	NA	NA|1155aa|up_3|CP011304.1_2260012_2263477_-	COG4889, COG4889, Predicted helicase [General function prediction only]	NA|729aa|up_2|CP011304.1_2263718_2265905_-	pfam00145, DNA_methylase, C-5 cytosine-specific DNA methylase	NA|1535aa|up_1|CP011304.1_2265974_2270579_-	PRK11750, gltB, glutamate synthase subunit alpha; Provisional	NA|286aa|up_0|CP011304.1_2270879_2271737_-	COG0074, SucD, Succinyl-CoA synthetase, alpha subunit [Energy production and conversion]	NA|288aa|down_0|CP011304.1_2273091_2273955_-	TIGR02069, cyanophycinase, cyanophycinase	NA|160aa|down_1|CP011304.1_2274314_2274794_-	pfam05532, CsbD, CsbD-like	NA|61aa|down_2|CP011304.1_2275083_2275266_-	COG3237, COG3237, Uncharacterized protein conserved in bacteria [Function unknown]	NA|224aa|down_3|CP011304.1_2275437_2276109_-	COG3000, ERG3, Sterol desaturase [Lipid metabolism]	NA|770aa|down_4|CP011304.1_2276215_2278525_-	COG5635, COG5635, Predicted NTPase (NACHT family) [Signal transduction mechanisms]	NA|523aa|down_5|CP011304.1_2278554_2280123_-	pfam13282, DUF4070, Domain of unknown function (DUF4070)	NA|43aa|down_6|CP011304.1_2281486_2281615_-	pfam14239, RRXRR, RRXRR protein	cas14k|413aa|down_7|CP011304.1_2281909_2283148_-	COG0675, COG0675, Transposase and inactivated derivatives [DNA replication, recombination, and repair]	NA|206aa|down_8|CP011304.1_2283119_2283737_-	cd03769, SR_IS607_transposase_like, Serine Recombinase (SR) family, IS607-like transposase subfamily, catalytic domain; members contain a DNA binding domain with homology to MerR/SoxR located N-terminal to the catalytic domain	NA|283aa|down_9|CP011304.1_2283813_2284662_-	COG1305, COG1305, Transglutaminase-like enzymes, putative cysteine proteases [Amino acid transport and metabolism]
GCA_000981785.2_ASM98178v2	CP011304	Microcystis aeruginosa NIES-2549, complete genome	15	2769449-2769687	2	PILER-CR	no	cmr6gr7,cmr5gr11,cmr4gr7,cmr3gr5,cas10,cas14j	cas3,cas14j,Cas14c_CAS-V-F,RT,c2c9_V-U4,csa3,cas14k,2OG_CAS,DinG,cmr6gr7,cmr5gr11,cmr4gr7,cmr3gr5,cas10,cas2,cas1,cas4,cas6,cas5,cas7,cas8b5,WYL	Type III-C,Type III-A,Type III-D,Type III-B	AGAAATTAATTGACTGGAAACA	22	0	0	NA	NA	NA	3	3	TypeV,TypeIII-B,TypeIII-C,TypeIII-A,TypeIII-D	cas3,cas14j,Cas14c_CAS-V-F,RT,c2c9_V-U4,csa3,cas14k,2OG_CAS,DinG,cmr6gr7,cmr5gr11,cmr4gr7,cmr3gr5,cas10,cas2,cas1,cas4,cas6,cas5,cas7,cas8b5,WYL	NA|54aa|up_9|CP011304.1_2757694_2757856_+,NA|103aa|up_8|CP011304.1_2757880_2758189_-,NA|151aa|up_6|CP011304.1_2759750_2760203_+,cmr5gr11|119aa|up_3|CP011304.1_2762772_2763129_-,NA|111aa|down_3|CP011304.1_2773104_2773437_+,NA|115aa|down_7|CP011304.1_2776769_2777114_-	NA|54aa|up_9|CP011304.1_2757694_2757856_+	NA	NA|103aa|up_8|CP011304.1_2757880_2758189_-	NA	NA|281aa|up_7|CP011304.1_2758674_2759517_-	COG0596, MhpC, Predicted hydrolases or acyltransferases (alpha/beta hydrolase superfamily) [General function prediction only]	NA|151aa|up_6|CP011304.1_2759750_2760203_+	NA	NA|91aa|up_5|CP011304.1_2760395_2760668_+	pfam06305, LapA_dom, Lipopolysaccharide assembly protein A domain	cmr6gr7|648aa|up_4|CP011304.1_2760765_2762709_-	cd09661, Cmr6_III-B, CRISPR/Cas system-associated RAMP superfamily protein Cmr6	cmr5gr11|119aa|up_3|CP011304.1_2762772_2763129_-	NA	cmr4gr7|260aa|up_2|CP011304.1_2763494_2764274_-	TIGR02580, putative_CRISPR-associated_protein, CRISPR type III-B/RAMP module RAMP protein Cmr4	cmr3gr5|376aa|up_1|CP011304.1_2764708_2765836_-	cd09748, Cmr3_III-B, CRISPR/Cas system-associated RAMP superfamily protein Cmr3	cas10|1005aa|up_0|CP011304.1_2765808_2768823_-	pfam12469, DUF3692, CRISPR-associated protein	NA|206aa|down_0|CP011304.1_2769721_2770339_-	TIGR02595, conserved_hypothetical_protein, PEP-CTERM protein-sorting domain	NA|102aa|down_1|CP011304.1_2771005_2771311_-	pfam01797, Y1_Tnp, Transposase IS200 like	cas14j|420aa|down_2|CP011304.1_2771487_2772747_+	COG0675, COG0675, Transposase and inactivated derivatives [DNA replication, recombination, and repair]	NA|111aa|down_3|CP011304.1_2773104_2773437_+	NA	NA|142aa|down_4|CP011304.1_2773609_2774035_-	COG3755, COG3755, Uncharacterized protein conserved in bacteria [Function unknown]	NA|326aa|down_5|CP011304.1_2774247_2775225_+	pfam00891, Methyltransf_2, O-methyltransferase	NA|478aa|down_6|CP011304.1_2775267_2776701_-	cd05800, PGM_like2, This PGM-like (phosphoglucomutase-like) protein of unknown function belongs to the alpha-D-phosphohexomutase superfamily and is found in both archaea and bacteria	NA|115aa|down_7|CP011304.1_2776769_2777114_-	NA	NA|278aa|down_8|CP011304.1_2777487_2778321_+	pfam01716, MSP, Manganese-stabilizing protein / photosystem II polypeptide	NA|367aa|down_9|CP011304.1_2778988_2780089_-	COG4748, COG4748, Uncharacterized conserved protein [Function unknown]
GCA_000981785.2_ASM98178v2	CP011304	Microcystis aeruginosa NIES-2549, complete genome	16	2929111-2929207	15	CRISPRCasFinder	no	RT	cas3,cas14j,Cas14c_CAS-V-F,RT,c2c9_V-U4,csa3,cas14k,2OG_CAS,DinG,cmr6gr7,cmr5gr11,cmr4gr7,cmr3gr5,cas10,cas2,cas1,cas4,cas6,cas5,cas7,cas8b5,WYL	Unclear	TATCTATAGAACTAGAAAAGTTTACCAA	28	0	0	NA	NA	NA	1	1	Orphan	cas3,cas14j,Cas14c_CAS-V-F,RT,c2c9_V-U4,csa3,cas14k,2OG_CAS,DinG,cmr6gr7,cmr5gr11,cmr4gr7,cmr3gr5,cas10,cas2,cas1,cas4,cas6,cas5,cas7,cas8b5,WYL	NA|54aa|up_9|CP011304.1_2922168_2922330_-,NA|47aa|up_8|CP011304.1_2922613_2922754_-,NA|38aa|up_6|CP011304.1_2923211_2923325_+,NA|105aa|up_5|CP011304.1_2923475_2923790_-,NA|168aa|up_4|CP011304.1_2924080_2924584_+,NA|394aa|up_3|CP011304.1_2925103_2926285_+,NA|200aa|up_2|CP011304.1_2926333_2926933_+,NA|47aa|down_1|CP011304.1_2931069_2931210_-	NA|54aa|up_9|CP011304.1_2922168_2922330_-	NA	NA|47aa|up_8|CP011304.1_2922613_2922754_-	NA	NA|132aa|up_7|CP011304.1_2922797_2923193_-	PRK12270, kgd, multifunctional oxoglutarate decarboxylase/oxoglutarate dehydrogenase thiamine pyrophosphate-binding subunit/dihydrolipoyllysine-residue succinyltransferase subunit	NA|38aa|up_6|CP011304.1_2923211_2923325_+	NA	NA|105aa|up_5|CP011304.1_2923475_2923790_-	NA	NA|168aa|up_4|CP011304.1_2924080_2924584_+	NA	NA|394aa|up_3|CP011304.1_2925103_2926285_+	NA	NA|200aa|up_2|CP011304.1_2926333_2926933_+	NA	NA|397aa|up_1|CP011304.1_2926954_2928145_-	pfam13365, Trypsin_2, Trypsin-like peptidase domain	NA|294aa|up_0|CP011304.1_2928188_2929070_-	pfam13365, Trypsin_2, Trypsin-like peptidase domain	NA|363aa|down_0|CP011304.1_2929984_2931073_-	pfam00180, Iso_dh, Isocitrate/isopropylmalate dehydrogenase	NA|47aa|down_1|CP011304.1_2931069_2931210_-	NA	NA|79aa|down_2|CP011304.1_2931513_2931750_-	pfam01106, NifU, NifU-like domain	NA|216aa|down_3|CP011304.1_2931823_2932471_-	pfam11866, DUF3386, Protein of unknown function (DUF3386)	NA|79aa|down_4|CP011304.1_2932811_2933048_+	COG1598, COG1598, Predicted nuclease of the RNAse H fold, HicB family [General    function prediction only]	NA|622aa|down_5|CP011304.1_2933316_2935182_+	COG1262, COG1262, Uncharacterized conserved protein [Function unknown]	NA|79aa|down_6|CP011304.1_2935409_2935646_+	NF033474, DivGenRetAVD, diversity-generating retroelement protein Avd	RT|353aa|down_7|CP011304.1_2935642_2936701_+	TIGR04416, hypothetical_protein, group II intron reverse transcriptase/maturase	NA|92aa|down_8|CP011304.1_2937303_2937579_-	COG5464, COG5464, Uncharacterized conserved protein [Function unknown]	NA|171aa|down_9|CP011304.1_2937694_2938207_-	pfam01724, DUF29, Domain of unknown function DUF29
GCA_000981785.2_ASM98178v2	CP011304	Microcystis aeruginosa NIES-2549, complete genome	18	3306602-3306697	17	CRISPRCasFinder	no	cas14j	cas3,cas14j,Cas14c_CAS-V-F,RT,c2c9_V-U4,csa3,cas14k,2OG_CAS,DinG,cmr6gr7,cmr5gr11,cmr4gr7,cmr3gr5,cas10,cas2,cas1,cas4,cas6,cas5,cas7,cas8b5,WYL	Unclear	ATCAAGGGGGGATCAAGGGGGGATC	25	0	0	NA	NA	NA	1	1	TypeV	cas3,cas14j,Cas14c_CAS-V-F,RT,c2c9_V-U4,csa3,cas14k,2OG_CAS,DinG,cmr6gr7,cmr5gr11,cmr4gr7,cmr3gr5,cas10,cas2,cas1,cas4,cas6,cas5,cas7,cas8b5,WYL	NA|44aa|up_6|CP011304.1_3300818_3300950_+,NA|47aa|up_5|CP011304.1_3301377_3301518_-,NA|249aa|up_1|CP011304.1_3304015_3304762_+,NA|89aa|down_1|CP011304.1_3308389_3308656_+,NA|38aa|down_3|CP011304.1_3311264_3311378_+,NA|38aa|down_8|CP011304.1_3316086_3316200_+,NA|155aa|down_9|CP011304.1_3316308_3316773_-	NA|156aa|up_9|CP011304.1_3298531_3298999_+	pfam01724, DUF29, Domain of unknown function DUF29	NA|92aa|up_8|CP011304.1_3299169_3299445_-	cd17074, Ubl_CysO_like, ubiquitin-like (Ubl) domain found in Mycobacterium tuberculosis CysO and similar proteins	NA|435aa|up_7|CP011304.1_3299551_3300856_-	PRK07591, PRK07591, threonine synthase; Validated	NA|44aa|up_6|CP011304.1_3300818_3300950_+	NA	NA|47aa|up_5|CP011304.1_3301377_3301518_-	NA	NA|182aa|up_4|CP011304.1_3301632_3302178_-	COG0742, COG0742, N6-adenine-specific methylase [DNA replication, recombination, and repair]	NA|226aa|up_3|CP011304.1_3302181_3302859_-	COG1573, COG1573, Uracil-DNA glycosylase [DNA replication, recombination, and repair]	NA|319aa|up_2|CP011304.1_3303002_3303959_-	PRK00089, era, GTPase Era; Reviewed	NA|249aa|up_1|CP011304.1_3304015_3304762_+	NA	NA|427aa|up_0|CP011304.1_3305132_3306413_+	TIGR02037, Probable_periplasmic_serine_protease_do/HhoA-like, periplasmic serine protease, Do/DeqQ family	NA|560aa|down_0|CP011304.1_3306756_3308436_-	cd00200, WD40, WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment	NA|89aa|down_1|CP011304.1_3308389_3308656_+	NA	NA|671aa|down_2|CP011304.1_3309221_3311234_+	cd16383, GUN4, porphyrin-binding protein domain GUN4	NA|38aa|down_3|CP011304.1_3311264_3311378_+	NA	NA|504aa|down_4|CP011304.1_3311530_3313042_+	PRK09224, PRK09224, threonine ammonia-lyase IlvA	NA|291aa|down_5|CP011304.1_3313072_3313945_+	PRK00050, PRK00050, 16S rRNA (cytosine(1402)-N(4))-methyltransferase RsmH	NA|389aa|down_6|CP011304.1_3314085_3315252_-	PRK05957, PRK05957, pyridoxal phosphate-dependent aminotransferase	NA|144aa|down_7|CP011304.1_3315251_3315683_-	pfam00498, FHA, FHA domain	NA|38aa|down_8|CP011304.1_3316086_3316200_+	NA	NA|155aa|down_9|CP011304.1_3316308_3316773_-	NA
GCA_000981785.2_ASM98178v2	CP011304	Microcystis aeruginosa NIES-2549, complete genome	21	3599679-3599861	20	CRISPRCasFinder	no	cas2,cas1,cas4,cas6,cas3,cas5	cas3,cas14j,Cas14c_CAS-V-F,RT,c2c9_V-U4,csa3,cas14k,2OG_CAS,DinG,cmr6gr7,cmr5gr11,cmr4gr7,cmr3gr5,cas10,cas2,cas1,cas4,cas6,cas5,cas7,cas8b5,WYL	Unclear	GTTTCAATCCCTAATAGGGTTTAAGATTAATTGGAAC	37	0	0	NA	NA	I-D,II-B	2	2	Unclear	cas3,cas14j,Cas14c_CAS-V-F,RT,c2c9_V-U4,csa3,cas14k,2OG_CAS,DinG,cmr6gr7,cmr5gr11,cmr4gr7,cmr3gr5,cas10,cas2,cas1,cas4,cas6,cas5,cas7,cas8b5,WYL	NA|178aa|up_2|CP011304.1_3598093_3598627_+,NA|53aa|down_1|CP011304.1_3603598_3603757_+,NA|383aa|down_2|CP011304.1_3607003_3608152_-,NA|45aa|down_4|CP011304.1_3611751_3611886_+	NA|81aa|up_9|CP011304.1_3589769_3590012_+	COG1327, COG1327, Predicted transcriptional regulator, consists of a Zn-ribbon and ATP-cone domains [Transcription]	NA|272aa|up_8|CP011304.1_3590019_3590835_+	pfam11209, DUF2993, Protein of unknown function (DUF2993)	NA|227aa|up_7|CP011304.1_3591107_3591788_+	COG5401, COG5401, Spore germination protein [General function prediction only]	NA|354aa|up_6|CP011304.1_3591832_3592894_-	pfam00924, MS_channel, Mechanosensitive ion channel	NA|161aa|up_5|CP011304.1_3592922_3593405_+	pfam13442, Cytochrome_CBB3, Cytochrome C oxidase, cbb3-type, subunit III	NA|718aa|up_4|CP011304.1_3593744_3595898_+	COG3914, Spy, Predicted O-linked N-acetylglucosamine transferase, SPINDLY family [Posttranslational modification, protein turnover, chaperones]	NA|722aa|up_3|CP011304.1_3595915_3598081_+	COG3914, Spy, Predicted O-linked N-acetylglucosamine transferase, SPINDLY family [Posttranslational modification, protein turnover, chaperones]	NA|178aa|up_2|CP011304.1_3598093_3598627_+	NA	NA|148aa|up_1|CP011304.1_3598665_3599109_+	COG3415, COG3415, Transposase and inactivated derivatives [DNA replication, recombination, and repair]	NA|165aa|up_0|CP011304.1_3599063_3599558_+	pfam13358, DDE_3, DDE superfamily endonuclease	NA|409aa|down_0|CP011304.1_3599881_3601108_-	pfam01610, DDE_Tnp_ISL3, Transposase	NA|53aa|down_1|CP011304.1_3603598_3603757_+	NA	NA|383aa|down_2|CP011304.1_3607003_3608152_-	NA	NA|499aa|down_3|CP011304.1_3609111_3610608_+	pfam13586, DDE_Tnp_1_2, Transposase DDE domain	NA|45aa|down_4|CP011304.1_3611751_3611886_+	NA	cas2|91aa|down_5|CP011304.1_3612814_3613087_-	cd09725, Cas2_I_II_III, CRISPR/Cas system-associated protein Cas2	cas1|335aa|down_6|CP011304.1_3613099_3614104_-	TIGR04093, hypothetical_protein_L8106_25395, CRISPR-associated endonuclease Cas1, subtype CYANO	cas4|198aa|down_7|CP011304.1_3614109_3614703_-	cd09637, Cas4_I-A_I-B_I-C_I-D_II-B, CRISPR/Cas system-associated protein Cas4	cas6|278aa|down_8|CP011304.1_3614705_3615539_-	COG5551, COG5551, CRISPR system related protein, RAMP superfamily [Defense    mechanisms]	NA|189aa|down_9|CP011304.1_3615538_3616105_-	cd06260, DUF820, Domain of unknown function (DUF820)
GCA_000981785.2_ASM98178v2	CP011304	Microcystis aeruginosa NIES-2549, complete genome	22	3601305-3606945	21,2,3	CRISPRCasFinder,CRT,PILER-CR	no	cas2,cas1,cas4,cas6,cas3,cas5,cas7,cas8b5,WYL	cas3,cas14j,Cas14c_CAS-V-F,RT,c2c9_V-U4,csa3,cas14k,2OG_CAS,DinG,cmr6gr7,cmr5gr11,cmr4gr7,cmr3gr5,cas10,cas2,cas1,cas4,cas6,cas5,cas7,cas8b5,WYL	Unclear	GTTTCAATCCCTAATAGGGTTTAAGATTAATTGGAAC,GTTTCAATCCCTAATAGGGTTTAAGATTAATTGGAAC,GTTTCAATCCCTAATAGGGTTTAAGATTAATTGGAAC	37,37,37	1	1	3603206-3603240	CP026286.1_5265-5231	I-D,II-B:I-D,II-B:I-D,II-B	78,78,26	78	Unclear	cas3,cas14j,Cas14c_CAS-V-F,RT,c2c9_V-U4,csa3,cas14k,2OG_CAS,DinG,cmr6gr7,cmr5gr11,cmr4gr7,cmr3gr5,cas10,cas2,cas1,cas4,cas6,cas5,cas7,cas8b5,WYL	NA|178aa|up_3|CP011304.1_3598093_3598627_+,NA|383aa|down_0|CP011304.1_3607003_3608152_-,NA|45aa|down_2|CP011304.1_3611751_3611886_+,cas5|230aa|down_9|CP011304.1_3618873_3619563_-	NA|272aa|up_9|CP011304.1_3590019_3590835_+	pfam11209, DUF2993, Protein of unknown function (DUF2993)	NA|227aa|up_8|CP011304.1_3591107_3591788_+	COG5401, COG5401, Spore germination protein [General function prediction only]	NA|354aa|up_7|CP011304.1_3591832_3592894_-	pfam00924, MS_channel, Mechanosensitive ion channel	NA|161aa|up_6|CP011304.1_3592922_3593405_+	pfam13442, Cytochrome_CBB3, Cytochrome C oxidase, cbb3-type, subunit III	NA|718aa|up_5|CP011304.1_3593744_3595898_+	COG3914, Spy, Predicted O-linked N-acetylglucosamine transferase, SPINDLY family [Posttranslational modification, protein turnover, chaperones]	NA|722aa|up_4|CP011304.1_3595915_3598081_+	COG3914, Spy, Predicted O-linked N-acetylglucosamine transferase, SPINDLY family [Posttranslational modification, protein turnover, chaperones]	NA|178aa|up_3|CP011304.1_3598093_3598627_+	NA	NA|148aa|up_2|CP011304.1_3598665_3599109_+	COG3415, COG3415, Transposase and inactivated derivatives [DNA replication, recombination, and repair]	NA|165aa|up_1|CP011304.1_3599063_3599558_+	pfam13358, DDE_3, DDE superfamily endonuclease	NA|409aa|up_0|CP011304.1_3599881_3601108_-	pfam01610, DDE_Tnp_ISL3, Transposase	NA|383aa|down_0|CP011304.1_3607003_3608152_-	NA	NA|499aa|down_1|CP011304.1_3609111_3610608_+	pfam13586, DDE_Tnp_1_2, Transposase DDE domain	NA|45aa|down_2|CP011304.1_3611751_3611886_+	NA	cas2|91aa|down_3|CP011304.1_3612814_3613087_-	cd09725, Cas2_I_II_III, CRISPR/Cas system-associated protein Cas2	cas1|335aa|down_4|CP011304.1_3613099_3614104_-	TIGR04093, hypothetical_protein_L8106_25395, CRISPR-associated endonuclease Cas1, subtype CYANO	cas4|198aa|down_5|CP011304.1_3614109_3614703_-	cd09637, Cas4_I-A_I-B_I-C_I-D_II-B, CRISPR/Cas system-associated protein Cas4	cas6|278aa|down_6|CP011304.1_3614705_3615539_-	COG5551, COG5551, CRISPR system related protein, RAMP superfamily [Defense    mechanisms]	NA|189aa|down_7|CP011304.1_3615538_3616105_-	cd06260, DUF820, Domain of unknown function (DUF820)	cas3|912aa|down_8|CP011304.1_3616145_3618881_-	COG1203, COG1203, CRISPR-associated helicase Cas3 [Defense mechanisms]	cas5|230aa|down_9|CP011304.1_3618873_3619563_-	NA
GCA_000981785.2_ASM98178v2	CP011304	Microcystis aeruginosa NIES-2549, complete genome	23	3608265-3608953	4,22,3	PILER-CR,CRISPRCasFinder,CRT	no	cas2,cas1,cas4,cas6,cas3,cas5,cas7,cas8b5,WYL	cas3,cas14j,Cas14c_CAS-V-F,RT,c2c9_V-U4,csa3,cas14k,2OG_CAS,DinG,cmr6gr7,cmr5gr11,cmr4gr7,cmr3gr5,cas10,cas2,cas1,cas4,cas6,cas5,cas7,cas8b5,WYL	Unclear	GTTTCAATCCCTAATAGGGTTTAAGATTAATTGGAAC,GTTTCAATCCCTAATAGGGTTTAAGATTAATTGGAAC,GTTTCAATCCCTAATAGGGTTTAAGATTAATTGGAAC	37,37,37	0	0	NA	NA	I-D,II-B:I-D,II-B:I-D,II-B	9,9,9	9	Unclear	cas3,cas14j,Cas14c_CAS-V-F,RT,c2c9_V-U4,csa3,cas14k,2OG_CAS,DinG,cmr6gr7,cmr5gr11,cmr4gr7,cmr3gr5,cas10,cas2,cas1,cas4,cas6,cas5,cas7,cas8b5,WYL	NA|178aa|up_5|CP011304.1_3598093_3598627_+,NA|53aa|up_1|CP011304.1_3603598_3603757_+,NA|383aa|up_0|CP011304.1_3607003_3608152_-,NA|45aa|down_1|CP011304.1_3611751_3611886_+,cas5|230aa|down_8|CP011304.1_3618873_3619563_-,cas7|298aa|down_9|CP011304.1_3619701_3620595_-	NA|354aa|up_9|CP011304.1_3591832_3592894_-	pfam00924, MS_channel, Mechanosensitive ion channel	NA|161aa|up_8|CP011304.1_3592922_3593405_+	pfam13442, Cytochrome_CBB3, Cytochrome C oxidase, cbb3-type, subunit III	NA|718aa|up_7|CP011304.1_3593744_3595898_+	COG3914, Spy, Predicted O-linked N-acetylglucosamine transferase, SPINDLY family [Posttranslational modification, protein turnover, chaperones]	NA|722aa|up_6|CP011304.1_3595915_3598081_+	COG3914, Spy, Predicted O-linked N-acetylglucosamine transferase, SPINDLY family [Posttranslational modification, protein turnover, chaperones]	NA|178aa|up_5|CP011304.1_3598093_3598627_+	NA	NA|148aa|up_4|CP011304.1_3598665_3599109_+	COG3415, COG3415, Transposase and inactivated derivatives [DNA replication, recombination, and repair]	NA|165aa|up_3|CP011304.1_3599063_3599558_+	pfam13358, DDE_3, DDE superfamily endonuclease	NA|409aa|up_2|CP011304.1_3599881_3601108_-	pfam01610, DDE_Tnp_ISL3, Transposase	NA|53aa|up_1|CP011304.1_3603598_3603757_+	NA	NA|383aa|up_0|CP011304.1_3607003_3608152_-	NA	NA|499aa|down_0|CP011304.1_3609111_3610608_+	pfam13586, DDE_Tnp_1_2, Transposase DDE domain	NA|45aa|down_1|CP011304.1_3611751_3611886_+	NA	cas2|91aa|down_2|CP011304.1_3612814_3613087_-	cd09725, Cas2_I_II_III, CRISPR/Cas system-associated protein Cas2	cas1|335aa|down_3|CP011304.1_3613099_3614104_-	TIGR04093, hypothetical_protein_L8106_25395, CRISPR-associated endonuclease Cas1, subtype CYANO	cas4|198aa|down_4|CP011304.1_3614109_3614703_-	cd09637, Cas4_I-A_I-B_I-C_I-D_II-B, CRISPR/Cas system-associated protein Cas4	cas6|278aa|down_5|CP011304.1_3614705_3615539_-	COG5551, COG5551, CRISPR system related protein, RAMP superfamily [Defense    mechanisms]	NA|189aa|down_6|CP011304.1_3615538_3616105_-	cd06260, DUF820, Domain of unknown function (DUF820)	cas3|912aa|down_7|CP011304.1_3616145_3618881_-	COG1203, COG1203, CRISPR-associated helicase Cas3 [Defense mechanisms]	cas5|230aa|down_8|CP011304.1_3618873_3619563_-	NA	cas7|298aa|down_9|CP011304.1_3619701_3620595_-	NA
GCA_000981785.2_ASM98178v2	CP011304	Microcystis aeruginosa NIES-2549, complete genome	24	3610677-3612581	5,23,4	PILER-CR,CRISPRCasFinder,CRT	no	cas2,cas1,cas4,cas6,cas3,cas5,cas7,cas8b5,WYL,c2c9_V-U4,cas14j	cas3,cas14j,Cas14c_CAS-V-F,RT,c2c9_V-U4,csa3,cas14k,2OG_CAS,DinG,cmr6gr7,cmr5gr11,cmr4gr7,cmr3gr5,cas10,cas2,cas1,cas4,cas6,cas5,cas7,cas8b5,WYL	Unclear	GTTTCAATCCCTAATAGGGTTTAAGATTAATTGGAAC,GTTTCAATCCCTAATAGGGTTTAAGATTAATTGGAAC,GTTTCAATCCCTAATAGGGTTTAAGATTAATTGGAAC	37,37,37	0	0	NA	NA	I-D,II-B:I-D,II-B:I-D,II-B	26,26,26	26	TypeV	cas3,cas14j,Cas14c_CAS-V-F,RT,c2c9_V-U4,csa3,cas14k,2OG_CAS,DinG,cmr6gr7,cmr5gr11,cmr4gr7,cmr3gr5,cas10,cas2,cas1,cas4,cas6,cas5,cas7,cas8b5,WYL	NA|178aa|up_6|CP011304.1_3598093_3598627_+,NA|53aa|up_2|CP011304.1_3603598_3603757_+,NA|383aa|up_1|CP011304.1_3607003_3608152_-,cas5|230aa|down_6|CP011304.1_3618873_3619563_-,cas7|298aa|down_7|CP011304.1_3619701_3620595_-,NA|48aa|down_9|CP011304.1_3623216_3623360_-	NA|161aa|up_9|CP011304.1_3592922_3593405_+	pfam13442, Cytochrome_CBB3, Cytochrome C oxidase, cbb3-type, subunit III	NA|718aa|up_8|CP011304.1_3593744_3595898_+	COG3914, Spy, Predicted O-linked N-acetylglucosamine transferase, SPINDLY family [Posttranslational modification, protein turnover, chaperones]	NA|722aa|up_7|CP011304.1_3595915_3598081_+	COG3914, Spy, Predicted O-linked N-acetylglucosamine transferase, SPINDLY family [Posttranslational modification, protein turnover, chaperones]	NA|178aa|up_6|CP011304.1_3598093_3598627_+	NA	NA|148aa|up_5|CP011304.1_3598665_3599109_+	COG3415, COG3415, Transposase and inactivated derivatives [DNA replication, recombination, and repair]	NA|165aa|up_4|CP011304.1_3599063_3599558_+	pfam13358, DDE_3, DDE superfamily endonuclease	NA|409aa|up_3|CP011304.1_3599881_3601108_-	pfam01610, DDE_Tnp_ISL3, Transposase	NA|53aa|up_2|CP011304.1_3603598_3603757_+	NA	NA|383aa|up_1|CP011304.1_3607003_3608152_-	NA	NA|499aa|up_0|CP011304.1_3609111_3610608_+	pfam13586, DDE_Tnp_1_2, Transposase DDE domain	cas2|91aa|down_0|CP011304.1_3612814_3613087_-	cd09725, Cas2_I_II_III, CRISPR/Cas system-associated protein Cas2	cas1|335aa|down_1|CP011304.1_3613099_3614104_-	TIGR04093, hypothetical_protein_L8106_25395, CRISPR-associated endonuclease Cas1, subtype CYANO	cas4|198aa|down_2|CP011304.1_3614109_3614703_-	cd09637, Cas4_I-A_I-B_I-C_I-D_II-B, CRISPR/Cas system-associated protein Cas4	cas6|278aa|down_3|CP011304.1_3614705_3615539_-	COG5551, COG5551, CRISPR system related protein, RAMP superfamily [Defense    mechanisms]	NA|189aa|down_4|CP011304.1_3615538_3616105_-	cd06260, DUF820, Domain of unknown function (DUF820)	cas3|912aa|down_5|CP011304.1_3616145_3618881_-	COG1203, COG1203, CRISPR-associated helicase Cas3 [Defense mechanisms]	cas5|230aa|down_6|CP011304.1_3618873_3619563_-	NA	cas7|298aa|down_7|CP011304.1_3619701_3620595_-	NA	cas8b5|844aa|down_8|CP011304.1_3620597_3623129_-	PRK12704, PRK12704, phosphodiesterase; Provisional	NA|48aa|down_9|CP011304.1_3623216_3623360_-	NA
GCA_000981785.2_ASM98178v2	CP011304	Microcystis aeruginosa NIES-2549, complete genome	25	3885605-3885698	24	CRISPRCasFinder	no		cas3,cas14j,Cas14c_CAS-V-F,RT,c2c9_V-U4,csa3,cas14k,2OG_CAS,DinG,cmr6gr7,cmr5gr11,cmr4gr7,cmr3gr5,cas10,cas2,cas1,cas4,cas6,cas5,cas7,cas8b5,WYL	Orphan	CACTGATTACTGTTTACTGTTTACTG	26	1	11	3885631-3885672|3885631-3885672|3885631-3885672|3885631-3885672|3885631-3885672|3885631-3885672|3885631-3885672|3885631-3885672|3885631-3885672|3885631-3885672|3885631-3885672	CP011304.1_1117050-1117009|CP011304.1_1709798-1709839|CP011304.1_1826675-1826634|CP011304.1_2450844-2450803|CP011304.1_2830001-2830042|CP011304.1_3428767-3428726|CP011304.1_3485185-3485226|CP011304.1_1457253-1457212|CP011304.1_2879312-2879271|CP011304.1_3960143-3960102|CP011304.1_4119955-4119996	NA	1	1	Orphan	cas3,cas14j,Cas14c_CAS-V-F,RT,c2c9_V-U4,csa3,cas14k,2OG_CAS,DinG,cmr6gr7,cmr5gr11,cmr4gr7,cmr3gr5,cas10,cas2,cas1,cas4,cas6,cas5,cas7,cas8b5,WYL	NA|81aa|up_6|CP011304.1_3876754_3876997_+,NA|73aa|down_7|CP011304.1_3895696_3895915_-,NA|70aa|down_9|CP011304.1_3898327_3898537_-	NA|354aa|up_9|CP011304.1_3873859_3874921_-	COG0836, {ManC}, Mannose-1-phosphate guanylyltransferase [Cell envelope biogenesis, outer membrane]	NA|312aa|up_8|CP011304.1_3875053_3875989_-	PRK02693, PRK02693, apocytochrome f; Reviewed	NA|180aa|up_7|CP011304.1_3876083_3876623_-	PRK13474, PRK13474, cytochrome b6-f complex iron-sulfur subunit; Provisional	NA|81aa|up_6|CP011304.1_3876754_3876997_+	NA	NA|308aa|up_5|CP011304.1_3877111_3878035_+	pfam01242, PTPS, 6-pyruvoyl tetrahydropterin synthase	NA|270aa|up_4|CP011304.1_3878096_3878906_-	COG0300, DltE, Short-chain dehydrogenases of various substrate specificities [General function prediction only]	NA|59aa|up_3|CP011304.1_3879571_3879748_-	smart00387, HATPase_c, Histidine kinase-like ATPases	NA|70aa|up_2|CP011304.1_3881601_3881811_+	COG1662, InsB, Transposase and inactivated derivatives, IS1 family [DNA replication, recombination, and repair]	NA|465aa|up_1|CP011304.1_3882292_3883687_-	COG0004, AmtB, Ammonia permease [Inorganic ion transport and metabolism]	NA|479aa|up_0|CP011304.1_3883892_3885329_-	COG0004, AmtB, Ammonia permease [Inorganic ion transport and metabolism]	NA|366aa|down_0|CP011304.1_3885744_3886842_+	TIGR00326, eubact_ribD, riboflavin biosynthesis protein RibD	NA|658aa|down_1|CP011304.1_3887101_3889075_+	PRK05218, PRK05218, heat shock protein 90; Provisional	NA|501aa|down_2|CP011304.1_3889384_3890887_+	COG1061, SSL2, DNA or RNA helicases of superfamily II [Transcription / DNA replication, recombination, and repair]	NA|117aa|down_3|CP011304.1_3890947_3891298_+	PRK00823, phhB, pterin-4-alpha-carbinolamine dehydratase; Validated	NA|293aa|down_4|CP011304.1_3891294_3892173_-	COG2084, MmsB, 3-hydroxyisobutyrate dehydrogenase and related beta-hydroxyacid dehydrogenases [Lipid metabolism]	NA|631aa|down_5|CP011304.1_3892331_3894224_-	PRK07956, ligA, NAD-dependent DNA ligase LigA; Validated	NA|75aa|down_6|CP011304.1_3894730_3894955_-	COG1598, COG1598, Predicted nuclease of the RNAse H fold, HicB family [General    function prediction only]	NA|73aa|down_7|CP011304.1_3895696_3895915_-	NA	NA|684aa|down_8|CP011304.1_3896159_3898211_-	cd06456, M3A_DCP, Peptidase family M3, dipeptidyl carboxypeptidase (DCP)	NA|70aa|down_9|CP011304.1_3898327_3898537_-	NA
GCA_000981785.2_ASM98178v2	CP011304	Microcystis aeruginosa NIES-2549, complete genome	26	4227228-4227340	25	CRISPRCasFinder	no	c2c9_V-U4	cas3,cas14j,Cas14c_CAS-V-F,RT,c2c9_V-U4,csa3,cas14k,2OG_CAS,DinG,cmr6gr7,cmr5gr11,cmr4gr7,cmr3gr5,cas10,cas2,cas1,cas4,cas6,cas5,cas7,cas8b5,WYL	Type V-U4	AATTTGCGTTATTTCAGCTTCTATTTTC	28	0	0	NA	NA	NA	1	1	TypeV-U4	cas3,cas14j,Cas14c_CAS-V-F,RT,c2c9_V-U4,csa3,cas14k,2OG_CAS,DinG,cmr6gr7,cmr5gr11,cmr4gr7,cmr3gr5,cas10,cas2,cas1,cas4,cas6,cas5,cas7,cas8b5,WYL	NA|38aa|up_7|CP011304.1_4216468_4216582_+,NA|145aa|up_5|CP011304.1_4218923_4219358_+,NA|334aa|down_1|CP011304.1_4228281_4229283_+,NA|43aa|down_2|CP011304.1_4229319_4229448_-,NA|53aa|down_9|CP011304.1_4235804_4235963_+	NA|391aa|up_9|CP011304.1_4214311_4215484_+	TIGR01185, membrane_spanning_subunit, DevC protein	NA|240aa|up_8|CP011304.1_4215502_4216222_+	TIGR02982, heterocyst_DevA, ABC exporter ATP-binding subunit, DevA family	NA|38aa|up_7|CP011304.1_4216468_4216582_+	NA	NA|641aa|up_6|CP011304.1_4216578_4218501_-	TIGR03423, pbp2_mrdA, penicillin-binding protein 2	NA|145aa|up_5|CP011304.1_4218923_4219358_+	NA	NA|138aa|up_4|CP011304.1_4219354_4219768_-	pfam12680, SnoaL_2, SnoaL-like domain	NA|370aa|up_3|CP011304.1_4219823_4220933_-	cd08300, alcohol_DH_class_III, class III alcohol dehydrogenases	NA|218aa|up_2|CP011304.1_4221379_4222033_+	pfam04313, HSDR_N, Type I restriction enzyme R protein N-terminus (HSDR_N)	NA|377aa|up_1|CP011304.1_4222276_4223407_+	TIGR02669, stage_II_sporulation_protein_D, SpoIID/LytB domain	NA|1188aa|up_0|CP011304.1_4223545_4227109_+	TIGR02082, Methionine_synthase, 5-methyltetrahydrofolate--homocysteine methyltransferase	NA|200aa|down_0|CP011304.1_4227679_4228279_+	cd06257, DnaJ, DnaJ domain or J-domain	NA|334aa|down_1|CP011304.1_4228281_4229283_+	NA	NA|43aa|down_2|CP011304.1_4229319_4229448_-	NA	NA|345aa|down_3|CP011304.1_4229539_4230574_+	PRK01966, ddl, D-alanine--D-alanine ligase	NA|248aa|down_4|CP011304.1_4230631_4231375_-	COG4636, Uma2, Endonuclease, Uma2 family (restriction endonuclease fold) [General function prediction only]	NA|559aa|down_5|CP011304.1_4231441_4233118_-	PRK09319, PRK09319, bifunctional 3,4-dihydroxy-2-butanone-4-phosphate synthase RibB/GTP cyclohydrolase II RibA	NA|293aa|down_6|CP011304.1_4233255_4234134_-	pfam00427, PBS_linker_poly, Phycobilisome Linker polypeptide	NA|163aa|down_7|CP011304.1_4234402_4234891_-	cd14770, PC-PEC_alpha, Alpha subunits of phycoerythrin and phycoerythrocyanin; phycobilisome rod components	NA|173aa|down_8|CP011304.1_4234957_4235476_-	cd14768, PC_PEC_beta, Beta subunits of phycoerythrin and phycoerythrocyanin; phycobilisome rod components	NA|53aa|down_9|CP011304.1_4235804_4235963_+	NA
GCA_000981785.2_ASM98178v2	CP011304	Microcystis aeruginosa NIES-2549, complete genome	27	4254834-4254976	26	CRISPRCasFinder	no	c2c9_V-U4,cas14j	cas3,cas14j,Cas14c_CAS-V-F,RT,c2c9_V-U4,csa3,cas14k,2OG_CAS,DinG,cmr6gr7,cmr5gr11,cmr4gr7,cmr3gr5,cas10,cas2,cas1,cas4,cas6,cas5,cas7,cas8b5,WYL	Unclear	GGTGGGTTACGGCGAATACTCAATTTTTAGTGAGAGTATAACTTATTTTCGCC	53	0	0	NA	NA	NA	1	1	TypeV	cas3,cas14j,Cas14c_CAS-V-F,RT,c2c9_V-U4,csa3,cas14k,2OG_CAS,DinG,cmr6gr7,cmr5gr11,cmr4gr7,cmr3gr5,cas10,cas2,cas1,cas4,cas6,cas5,cas7,cas8b5,WYL	NA|218aa|up_5|CP011304.1_4250286_4250940_-,NA|46aa|up_2|CP011304.1_4253387_4253525_-,NA|39aa|up_0|CP011304.1_4254625_4254742_+,NA	NA|141aa|up_9|CP011304.1_4246714_4247137_-	cd03425, MutT_pyrophosphohydrolase, The MutT pyrophosphohydrolase is a prototypical Nudix hydrolase that catalyzes the hydrolysis of nucleoside and deoxynucleoside triphosphates (NTPs and dNTPs) by substitution at a beta-phosphorus to yield a nucleotide monophosphate (NMP) and inorganic pyrophosphate (PPi)	NA|108aa|up_8|CP011304.1_4247240_4247564_+	PRK02724, PRK02724, 30S ribosomal protein PSRP-3	NA|139aa|up_7|CP011304.1_4247784_4248201_-	CHL00063, atpE, ATP synthase CF1 epsilon subunit	NA|483aa|up_6|CP011304.1_4248275_4249724_-	CHL00060, atpB, ATP synthase CF1 beta subunit	NA|218aa|up_5|CP011304.1_4250286_4250940_-	NA	NA|542aa|up_4|CP011304.1_4251267_4252893_-	PRK00013, groEL, chaperonin GroEL; Reviewed	NA|104aa|up_3|CP011304.1_4252939_4253251_-	PRK00364, groES, co-chaperonin GroES; Reviewed	NA|46aa|up_2|CP011304.1_4253387_4253525_-	NA	NA|143aa|up_1|CP011304.1_4254107_4254536_-	pfam14159, CAAD, CAAD domains of cyanobacterial aminoacyl-tRNA synthetase	NA|39aa|up_0|CP011304.1_4254625_4254742_+	NA	NA|774aa|down_0|CP011304.1_4255106_4257428_-	TIGR01073, ATP-dependent_DNA_helicase_PcrA, ATP-dependent DNA helicase PcrA	NA|417aa|down_1|CP011304.1_4257753_4259004_+	PRK05250, PRK05250, S-adenosylmethionine synthetase; Validated	NA|185aa|down_2|CP011304.1_4259549_4260104_-	pfam09367, CpeS, CpeS-like protein	NA|321aa|down_3|CP011304.1_4260131_4261094_-	cd04187, DPM1_like_bac, Bacterial DPM1_like enzymes are related to eukaryotic DPM1	NA|406aa|down_4|CP011304.1_4261145_4262363_-	TIGR03087, stp1, sugar transferase, PEP-CTERM/EpsH1 system associated	NA|320aa|down_5|CP011304.1_4262365_4263325_-	COG1089, Gmd, GDP-D-mannose dehydratase [Cell envelope biogenesis, outer membrane]	NA|307aa|down_6|CP011304.1_4263339_4264260_-	pfam00535, Glycos_transf_2, Glycosyl transferase family 2	NA|328aa|down_7|CP011304.1_4264525_4265509_+	pfam00535, Glycos_transf_2, Glycosyl transferase family 2	NA|119aa|down_8|CP011304.1_4265641_4265998_+	pfam02152, FolB, Dihydroneopterin aldolase	NA|181aa|down_9|CP011304.1_4265984_4266527_-	pfam03358, FMN_red, NADPH-dependent FMN reductase
GCA_000981785.2_ASM98178v2	CP011304	Microcystis aeruginosa NIES-2549, complete genome	29	4285765-4285900	28	CRISPRCasFinder	no	cas14j,RT	cas3,cas14j,Cas14c_CAS-V-F,RT,c2c9_V-U4,csa3,cas14k,2OG_CAS,DinG,cmr6gr7,cmr5gr11,cmr4gr7,cmr3gr5,cas10,cas2,cas1,cas4,cas6,cas5,cas7,cas8b5,WYL	Unclear	TATGGTATTAGCTAAAGTGGTTTTCGGTGCAGCCCCGACCAC	42	0	0	NA	NA	NA	1	1	TypeV	cas3,cas14j,Cas14c_CAS-V-F,RT,c2c9_V-U4,csa3,cas14k,2OG_CAS,DinG,cmr6gr7,cmr5gr11,cmr4gr7,cmr3gr5,cas10,cas2,cas1,cas4,cas6,cas5,cas7,cas8b5,WYL	NA|216aa|up_5|CP011304.1_4280007_4280655_+,NA|212aa|up_4|CP011304.1_4280712_4281348_+,NA|40aa|down_0|CP011304.1_4286355_4286475_-,NA|184aa|down_5|CP011304.1_4292314_4292866_-,NA|46aa|down_7|CP011304.1_4293666_4293804_-	NA|39aa|up_9|CP011304.1_4274964_4275081_-	PRK06187, PRK06187, long-chain-fatty-acid--CoA ligase; Validated	NA|785aa|up_8|CP011304.1_4275341_4277696_+	cd16383, GUN4, porphyrin-binding protein domain GUN4	NA|145aa|up_7|CP011304.1_4277989_4278424_+	cd01038, Endonuclease_DUF559, Domain of unknown function, appears to be related to a diverse group of endonucleases	NA|423aa|up_6|CP011304.1_4278675_4279944_-	COG2242, CobL, Precorrin-6B methylase 2 [Coenzyme metabolism]	NA|216aa|up_5|CP011304.1_4280007_4280655_+	NA	NA|212aa|up_4|CP011304.1_4280712_4281348_+	NA	NA|472aa|up_3|CP011304.1_4281532_4282948_+	PRK09567, nirA, NirA family protein	NA|209aa|up_2|CP011304.1_4282957_4283584_+	PRK08285, cobH, precorrin-8X methylmutase; Reviewed	NA|235aa|up_1|CP011304.1_4283580_4284285_+	PRK05990, PRK05990, precorrin-2 C(20)-methyltransferase; Reviewed	NA|383aa|up_0|CP011304.1_4284595_4285744_+	COG0758, Smf, Predicted Rossmann fold nucleotide-binding protein involved in DNA uptake [DNA replication, recombination, and repair / Intracellular trafficking and secretion]	NA|40aa|down_0|CP011304.1_4286355_4286475_-	NA	NA|330aa|down_1|CP011304.1_4286786_4287776_-	pfam13359, DDE_Tnp_4, DDE superfamily endonuclease	RT|587aa|down_2|CP011304.1_4288507_4290268_+	TIGR04416, hypothetical_protein, group II intron reverse transcriptase/maturase	cas14j|291aa|down_3|CP011304.1_4290755_4291628_+	COG0675, COG0675, Transposase and inactivated derivatives [DNA replication, recombination, and repair]	NA|205aa|down_4|CP011304.1_4291695_4292310_+	PRK00180, PRK00180, acetate kinase A/propionate kinase 2; Reviewed	NA|184aa|down_5|CP011304.1_4292314_4292866_-	NA	NA|219aa|down_6|CP011304.1_4292979_4293636_-	PRK09652, PRK09652, RNA polymerase sigma factor RpoE; Provisional	NA|46aa|down_7|CP011304.1_4293666_4293804_-	NA	NA|NA	NA	NA|NA	NA
