assembly_id	genome_id	genome_def	crispr_array_locus_merge	crispr_array_location_merge	crispr_locus_id	crispr_pred_method	array_in_prot	prot_within_array_20000	prot_in_genome	crispr_type_by_cas_prot	consensus_repeat	repeat_length	self-targeting_spacer_number	self-targeting_target_number	spacer_location	protospacer_location	repeat_type	spacer_locus_num	spacer_num	correct_crispr_type	genome_cas_prots	unknown_protein_around_crispr	L10	L10_domain	L9	L9_domain	L8	L8_domain	L7	L7_domain	L6	L6_domain	L5	L5_domain	L4	L4_domain	L3	L3_domain	L2	L2_domain	L1	L1_domain	R1	R1_domain	R2	R2_domain	R3	R3_domain	R4	R4_domain	R5	R5_domain	R6	R6_domain	R7	R7_domain	R8	R8_domain	R9	R9_domain	R10	R10_domain
GCF_000021825.1_ASM2182v1	NC_011729	Gloeothece citriformis PCC 7424, complete sequence	1	7479-7566	1	CRISPRCasFinder	no		cas14j,cas14k,DEDDh,c2c9_V-U4,cas6,cas5,cas7,cas8a4,cas3,Cas14c_CAS-V-F,cas10,cmr3gr5,cmr4gr7,cmr5gr11,Cas14u_CAS-V,RT,csa3,WYL,cas10d,csc2gr7,csc1gr5,cas4,cas1,cas2,PD-DExK,cas8b3,DinG,cas8e,cse2gr11,cas6e,Cas9_archaeal,csx15,2OG_CAS	Orphan	AAATCTTGCTCAAACTGACGTAA	23	0	0	NA	NA	NA	1	1	Orphan	cas14j,cas14k,DEDDh,c2c9_V-U4,cas6,cas5,cas7,cas8a4,cas3,Cas14c_CAS-V-F,cas10,cmr3gr5,cmr4gr7,cmr5gr11,Cas14u_CAS-V,RT,csa3,WYL,cas10d,csc2gr7,csc1gr5,cas4,cas1,cas2,PD-DExK,cas8b3,DinG,cas8e,cse2gr11,cas6e,Cas9_archaeal,csx15,2OG_CAS,csm6	NA,NA	NA|NA	NA	NA|NA	NA	NA|NA	NA	NA|NA	NA	NA|454aa|up_5|NC_011729.1_89_1451_+	PRK00149, dnaA, chromosomal replication initiator protein DnaA	NA|300aa|up_4|NC_011729.1_1605_2505_+	PRK12887, ubiA, tocopherol phytyltransferase; Reviewed	NA|369aa|up_3|NC_011729.1_2616_3723_+	pfam14249, Tocopherol_cycl, Tocopherol cyclase	NA|172aa|up_2|NC_011729.1_3713_4229_-	cd16913, YkuD_like, L,D-transpeptidases/carboxypeptidases similar to Bacillus YkuD	NA|246aa|up_1|NC_011729.1_4802_5540_-	TIGR03763, conserved_hypothetical_protein, cyanoexosortase A	NA|503aa|up_0|NC_011729.1_5733_7242_+	COG0043, UbiD, 3-polyprenyl-4-hydroxybenzoate decarboxylase and related decarboxylases [Coenzyme metabolism]	NA|232aa|down_0|NC_011729.1_8188_8884_-	TIGR02982, heterocyst_DevA, ABC exporter ATP-binding subunit, DevA family	NA|384aa|down_1|NC_011729.1_8910_10062_-	TIGR01185, membrane_spanning_subunit, DevC protein	NA|440aa|down_2|NC_011729.1_10064_11384_-	TIGR02971, devB-like_secretion_protein, ABC exporter membrane fusion protein, DevB family	NA|793aa|down_3|NC_011729.1_11864_14243_-	CHL00095, clpC, Clp protease ATP binding subunit	NA|285aa|down_4|NC_011729.1_14378_15233_-	PRK06027, purU, formyltetrahydrofolate deformylase; Reviewed	NA|136aa|down_5|NC_011729.1_15436_15844_-	pfam01724, DUF29, Domain of unknown function DUF29	NA|136aa|down_6|NC_011729.1_15966_16374_-	pfam01724, DUF29, Domain of unknown function DUF29	NA|412aa|down_7|NC_011729.1_16509_17745_-	TIGR02037, Probable_periplasmic_serine_protease_do/HhoA-like, periplasmic serine protease, Do/DeqQ family	NA|362aa|down_8|NC_011729.1_17965_19051_-	cd03809, GT4_MtfB-like, glycosyltransferases MtfB, WbpX, and similar proteins	NA|378aa|down_9|NC_011729.1_19064_20198_-	cd03809, GT4_MtfB-like, glycosyltransferases MtfB, WbpX, and similar proteins
GCF_000021825.1_ASM2182v1	NC_011729	Gloeothece citriformis PCC 7424, complete sequence	2	84037-84437	1	PILER-CR	no	cas14k	cas14j,cas14k,DEDDh,c2c9_V-U4,cas6,cas5,cas7,cas8a4,cas3,Cas14c_CAS-V-F,cas10,cmr3gr5,cmr4gr7,cmr5gr11,Cas14u_CAS-V,RT,csa3,WYL,cas10d,csc2gr7,csc1gr5,cas4,cas1,cas2,PD-DExK,cas8b3,DinG,cas8e,cse2gr11,cas6e,Cas9_archaeal,csx15,2OG_CAS	Unclear	GATCGCACTCCATTCACCCTTTTGAGCTTGTTGTTGGATAATGGGAAAAGTTT	53	1	1	84090-84123	NC_011729.1_84438-84471	NA	4	4	TypeV	cas14j,cas14k,DEDDh,c2c9_V-U4,cas6,cas5,cas7,cas8a4,cas3,Cas14c_CAS-V-F,cas10,cmr3gr5,cmr4gr7,cmr5gr11,Cas14u_CAS-V,RT,csa3,WYL,cas10d,csc2gr7,csc1gr5,cas4,cas1,cas2,PD-DExK,cas8b3,DinG,cas8e,cse2gr11,cas6e,Cas9_archaeal,csx15,2OG_CAS,csm6	NA|64aa|up_2|NC_011729.1_82655_82847_-,NA|55aa|up_1|NC_011729.1_82843_83008_-,NA|75aa|up_0|NC_011729.1_83027_83252_+,NA|46aa|down_0|NC_011729.1_87197_87335_+,NA|69aa|down_5|NC_011729.1_91337_91544_-,NA|74aa|down_7|NC_011729.1_93546_93768_-	NA|302aa|up_9|NC_011729.1_72136_73042_-	PLN02824, PLN02824, hydrolase, alpha/beta fold family protein	NA|293aa|up_8|NC_011729.1_73114_73993_-	pfam08450, SGL, SMP-30/Gluconolaconase/LRE-like region	NA|501aa|up_7|NC_011729.1_74162_75665_+	pfam13205, Big_5, Bacterial Ig-like domain	NA|931aa|up_6|NC_011729.1_75661_78454_-	cd00200, WD40, WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment	NA|451aa|up_5|NC_011729.1_78750_80103_+	pfam01832, Glucosaminidase, Mannosyl-glycoprotein endo-beta-N-acetylglucosaminidase	NA|568aa|up_4|NC_011729.1_80235_81939_+	COG0488, Uup, ATPase components of ABC transporters with duplicated ATPase domains [General function prediction only]	NA|231aa|up_3|NC_011729.1_81942_82635_-	cd06260, DUF820, Domain of unknown function (DUF820)	NA|64aa|up_2|NC_011729.1_82655_82847_-	NA	NA|55aa|up_1|NC_011729.1_82843_83008_-	NA	NA|75aa|up_0|NC_011729.1_83027_83252_+	NA	NA|46aa|down_0|NC_011729.1_87197_87335_+	NA	NA|476aa|down_1|NC_011729.1_87567_88995_-	TIGR03556, photolyase_8HDF, deoxyribodipyrimidine photo-lyase, 8-HDF type	NA|185aa|down_2|NC_011729.1_89026_89581_-	pfam05685, Uma2, Putative restriction endonuclease	NA|185aa|down_3|NC_011729.1_89626_90181_-	cd03424, ADPRase_NUDT5, ADP-ribose pyrophosphatase (ADPRase) catalyzes the hydrolysis of ADP-ribose and a variety of additional ADP-sugar conjugates to AMP and ribose-5-phosphate	NA|160aa|down_4|NC_011729.1_90192_90672_-	cd00483, HPPK, 7,8-dihydro-6-hydroxymethylpterin-pyrophosphokinase (HPPK)	NA|69aa|down_5|NC_011729.1_91337_91544_-	NA	NA|500aa|down_6|NC_011729.1_91652_93152_+	COG1061, SSL2, DNA or RNA helicases of superfamily II [Transcription / DNA replication, recombination, and repair]	NA|74aa|down_7|NC_011729.1_93546_93768_-	NA	NA|222aa|down_8|NC_011729.1_97304_97970_-	pfam08548, Peptidase_M10_C, Peptidase M10 serralysin C terminal	NA|297aa|down_9|NC_011729.1_98062_98953_-	COG2207, AraC, AraC-type DNA-binding domain-containing proteins [Transcription]
GCF_000021825.1_ASM2182v1	NC_011729	Gloeothece citriformis PCC 7424, complete sequence	3	465444-465541	2	CRISPRCasFinder	no		cas14j,cas14k,DEDDh,c2c9_V-U4,cas6,cas5,cas7,cas8a4,cas3,Cas14c_CAS-V-F,cas10,cmr3gr5,cmr4gr7,cmr5gr11,Cas14u_CAS-V,RT,csa3,WYL,cas10d,csc2gr7,csc1gr5,cas4,cas1,cas2,PD-DExK,cas8b3,DinG,cas8e,cse2gr11,cas6e,Cas9_archaeal,csx15,2OG_CAS	Orphan	ATGCGTTCCCAACGCATCAAAAC	23	0	0	NA	NA	NA	1	1	Orphan	cas14j,cas14k,DEDDh,c2c9_V-U4,cas6,cas5,cas7,cas8a4,cas3,Cas14c_CAS-V-F,cas10,cmr3gr5,cmr4gr7,cmr5gr11,Cas14u_CAS-V,RT,csa3,WYL,cas10d,csc2gr7,csc1gr5,cas4,cas1,cas2,PD-DExK,cas8b3,DinG,cas8e,cse2gr11,cas6e,Cas9_archaeal,csx15,2OG_CAS,csm6	NA|74aa|up_4|NC_011729.1_458115_458337_-,NA|134aa|down_8|NC_011729.1_473865_474267_+	NA|206aa|up_9|NC_011729.1_451728_452346_+	pfam13565, HTH_32, Homeodomain-like domain	NA|909aa|up_8|NC_011729.1_452351_455078_+	COG2274, SunT, ABC-type bacteriocin/lantibiotic exporters, contain an N-terminal double-glycine peptidase domain [Defense mechanisms]	NA|505aa|up_7|NC_011729.1_455150_456665_+	TIGR01843, Hemolysin_secretion_protein_D_plasmid, type I secretion membrane fusion protein, HlyD family	NA|242aa|up_6|NC_011729.1_456734_457460_+	TIGR04500, PpiC_rel_mature, putative peptide maturation system protein	NA|136aa|up_5|NC_011729.1_457711_458119_-	cd18744, PIN_VapC4-5_FitB-like, uncharacterized subgroup of the PIN_VapC4-5_FitB-like subfamily of the PIN domain superfamily	NA|74aa|up_4|NC_011729.1_458115_458337_-	NA	NA|482aa|up_3|NC_011729.1_458588_460034_-	pfam13424, TPR_12, Tetratricopeptide repeat	NA|434aa|up_2|NC_011729.1_460030_461332_-	smart00382, AAA, ATPases associated with a variety of cellular activities	NA|362aa|up_1|NC_011729.1_463166_464252_-	pfam17914, HopA1, HopA1 effector protein family	NA|384aa|up_0|NC_011729.1_464263_465415_-	cd05120, APH_ChoK_like, Aminoglycoside 3'-phosphotransferase and Choline Kinase family	NA|421aa|down_0|NC_011729.1_465745_467008_-	TIGR04103, heme_biosynthesis_protein, nif11-class peptide radical SAM maturase 3	NA|99aa|down_1|NC_011729.1_467140_467437_-	pfam07862, Nif11, Nif11 domain	NA|236aa|down_2|NC_011729.1_467892_468600_+	sd00006, TPR, Tetratricopeptide repeat	NA|208aa|down_3|NC_011729.1_468851_469475_-	PRK00951, hisB, imidazoleglycerol-phosphate dehydratase HisB	NA|275aa|down_4|NC_011729.1_469760_470585_+	cd01641, Bacterial_IMPase_like_1, Predominantly bacterial family of Mg++ dependend phosphatases, related to inositol monophosphatases	NA|66aa|down_5|NC_011729.1_470717_470915_+	PRK15489, nfrB, glycosyl transferase family protein	NA|188aa|down_6|NC_011729.1_471344_471908_+	cd16433, CheB, Chemotaxis response regulator protein-glutamate methylesterase, CheB	NA|619aa|down_7|NC_011729.1_471970_473827_+	COG1352, CheR, Methylase of chemotaxis methyl-accepting proteins [Cell motility and secretion / Signal transduction mechanisms]	NA|134aa|down_8|NC_011729.1_473865_474267_+	NA	NA|704aa|down_9|NC_011729.1_474550_476662_+	PRK13557, PRK13557, histidine kinase; Provisional
GCF_000021825.1_ASM2182v1	NC_011729	Gloeothece citriformis PCC 7424, complete sequence	4	535108-535190	3	CRISPRCasFinder	no		cas14j,cas14k,DEDDh,c2c9_V-U4,cas6,cas5,cas7,cas8a4,cas3,Cas14c_CAS-V-F,cas10,cmr3gr5,cmr4gr7,cmr5gr11,Cas14u_CAS-V,RT,csa3,WYL,cas10d,csc2gr7,csc1gr5,cas4,cas1,cas2,PD-DExK,cas8b3,DinG,cas8e,cse2gr11,cas6e,Cas9_archaeal,csx15,2OG_CAS	Orphan	AGCGTAACGCACCTCTACCCATGAA	25	0	0	NA	NA	NA	1	1	Orphan	cas14j,cas14k,DEDDh,c2c9_V-U4,cas6,cas5,cas7,cas8a4,cas3,Cas14c_CAS-V-F,cas10,cmr3gr5,cmr4gr7,cmr5gr11,Cas14u_CAS-V,RT,csa3,WYL,cas10d,csc2gr7,csc1gr5,cas4,cas1,cas2,PD-DExK,cas8b3,DinG,cas8e,cse2gr11,cas6e,Cas9_archaeal,csx15,2OG_CAS,csm6	NA,NA|72aa|down_0|NC_011729.1_535461_535677_+	NA|468aa|up_9|NC_011729.1_522617_524021_-	PRK12296, obgE, GTPase CgtA; Reviewed	NA|430aa|up_8|NC_011729.1_524354_525644_+	PRK02862, glgC, glucose-1-phosphate adenylyltransferase; Provisional	NA|368aa|up_7|NC_011729.1_525852_526956_+	TIGR00367, Uncharacterized_membrane_protein_MJ0091, K+-dependent Na+/Ca+ exchanger related-protein	NA|344aa|up_6|NC_011729.1_526965_527997_-	cd10001, HDAC_classII_APAH, Histone deacetylase class IIa	NA|362aa|up_5|NC_011729.1_527980_529066_-	cd03809, GT4_MtfB-like, glycosyltransferases MtfB, WbpX, and similar proteins	NA|163aa|up_4|NC_011729.1_529238_529727_+	pfam04307, YdjM, LexA-binding, inner membrane-associated putative hydrolase	NA|213aa|up_3|NC_011729.1_529743_530382_-	PRK05647, purN, phosphoribosylglycinamide formyltransferase; Reviewed	NA|576aa|up_2|NC_011729.1_530821_532549_+	COG0426, FpaA, Uncharacterized flavoproteins [Energy production and conversion]	NA|191aa|up_1|NC_011729.1_532679_533252_+	COG3431, COG3431, Predicted membrane protein [Function unknown]	NA|578aa|up_0|NC_011729.1_533351_535085_+	COG0426, FpaA, Uncharacterized flavoproteins [Energy production and conversion]	NA|72aa|down_0|NC_011729.1_535461_535677_+	NA	NA|155aa|down_1|NC_011729.1_535681_536146_+	TIGR04062, hypothetical_protein_CY0110_29519, dnd system-associated protein 4	NA|278aa|down_2|NC_011729.1_536220_537054_+	COG0189, RimK, Glutathione synthase/Ribosomal protein S6 modification enzyme (glutaminyl transferase) [Coenzyme metabolism / Translation, ribosomal structure and biogenesis]	NA|213aa|down_3|NC_011729.1_537139_537778_+	pfam07099, DUF1361, Protein of unknown function (DUF1361)	NA|189aa|down_4|NC_011729.1_537821_538388_-	pfam08239, SH3_3, Bacterial SH3 domain	NA|158aa|down_5|NC_011729.1_538758_539232_+	COG3476, COG3476, Tryptophan-rich sensory protein (mitochondrial benzodiazepine receptor homolog) [Signal transduction mechanisms]	NA|262aa|down_6|NC_011729.1_539256_540042_-	cd05233, SDR_c, classical (c) SDRs	NA|384aa|down_7|NC_011729.1_540181_541333_+	cd12828, TmCorA-like_1, Thermotoga maritima CorA_like subfamily	NA|441aa|down_8|NC_011729.1_542032_543355_-	pfam13354, Beta-lactamase2, Beta-lactamase enzyme family	NA|254aa|down_9|NC_011729.1_543439_544201_-	COG0565, LasT, rRNA methylase [Translation, ribosomal structure and biogenesis]
GCF_000021825.1_ASM2182v1	NC_011729	Gloeothece citriformis PCC 7424, complete sequence	5	684808-684891	4	CRISPRCasFinder	no		cas14j,cas14k,DEDDh,c2c9_V-U4,cas6,cas5,cas7,cas8a4,cas3,Cas14c_CAS-V-F,cas10,cmr3gr5,cmr4gr7,cmr5gr11,Cas14u_CAS-V,RT,csa3,WYL,cas10d,csc2gr7,csc1gr5,cas4,cas1,cas2,PD-DExK,cas8b3,DinG,cas8e,cse2gr11,cas6e,Cas9_archaeal,csx15,2OG_CAS	Orphan	AAACTTTAACACCACAATCAATGGCTCA	28	1	25	684836-684863|684836-684863|684836-684863|684836-684863|684836-684863|684836-684863|684836-684863|684836-684863|684836-684863|684836-684863|684836-684863|684836-684863|684836-684863|684836-684863|684836-684863|684836-684863|684836-684863|684836-684863|684836-684863|684836-684863|684836-684863|684836-684863|684836-684863|684836-684863|684836-684863	NC_011729.1_684983-685010|NC_011729.1_838313-838340|NC_011729.1_1815177-1815150|NC_011729.1_1815245-1815218|NC_011729.1_1815344-1815317|NC_011729.1_3455265-3455292|NC_011729.1_4794463-4794436|NC_011729.1_272193-272220|NC_011729.1_293491-293464|NC_011729.1_749756-749783|NC_011729.1_882983-883010|NC_011729.1_1602306-1602333|NC_011729.1_1602398-1602425|NC_011729.1_1706047-1706020|NC_011729.1_2557278-2557305|NC_011729.1_3339531-3339504|NC_011729.1_3339574-3339547|NC_011729.1_3339617-3339590|NC_011729.1_4441623-4441596|NC_011729.1_4995902-4995929|NC_011729.1_5002316-5002289|NC_011729.1_5671597-5671570|NC_011729.1_5718867-5718840|NC_011729.1_5844653-5844680|NC_011729.1_5872560-5872533	NA	1	1	Orphan	cas14j,cas14k,DEDDh,c2c9_V-U4,cas6,cas5,cas7,cas8a4,cas3,Cas14c_CAS-V-F,cas10,cmr3gr5,cmr4gr7,cmr5gr11,Cas14u_CAS-V,RT,csa3,WYL,cas10d,csc2gr7,csc1gr5,cas4,cas1,cas2,PD-DExK,cas8b3,DinG,cas8e,cse2gr11,cas6e,Cas9_archaeal,csx15,2OG_CAS,csm6	NA|64aa|up_6|NC_011729.1_679638_679830_+,NA|81aa|up_2|NC_011729.1_682191_682434_-,NA|116aa|down_6|NC_011729.1_694418_694766_-	NA|322aa|up_9|NC_011729.1_675797_676763_-	pfam04230, PS_pyruv_trans, Polysaccharide pyruvyl transferase	NA|623aa|up_8|NC_011729.1_677076_678945_-	PRK14016, PRK14016, cyanophycin synthetase; Provisional	NA|147aa|up_7|NC_011729.1_679161_679602_+	COG4067, COG4067, Uncharacterized protein conserved in archaea [Posttranslational modification, protein turnover, chaperones]	NA|64aa|up_6|NC_011729.1_679638_679830_+	NA	NA|360aa|up_5|NC_011729.1_679829_680909_+	pfam11805, DUF3326, Protein of unknown function (DUF3326)	NA|197aa|up_4|NC_011729.1_680895_681486_+	pfam02517, Abi, CAAX protease self-immunity	NA|150aa|up_3|NC_011729.1_681506_681956_+	pfam12049, DUF3531, Protein of unknown function (DUF3531)	NA|81aa|up_2|NC_011729.1_682191_682434_-	NA	NA|401aa|up_1|NC_011729.1_682700_683903_-	TIGR02037, Probable_periplasmic_serine_protease_do/HhoA-like, periplasmic serine protease, Do/DeqQ family	NA|225aa|up_0|NC_011729.1_684053_684728_+	COG0398, COG0398, Uncharacterized conserved protein [Function unknown]	NA|444aa|down_0|NC_011729.1_685206_686538_+	COG0312, TldD, Predicted Zn-dependent proteases and their inactivated homologs [General function prediction only]	NA|873aa|down_1|NC_011729.1_686600_689219_-	COG5635, COG5635, Predicted NTPase (NACHT family) [Signal transduction mechanisms]	NA|324aa|down_2|NC_011729.1_689933_690905_+	COG1957, URH1, Inosine-uridine nucleoside N-ribohydrolase [Nucleotide transport and metabolism]	NA|242aa|down_3|NC_011729.1_691130_691856_-	COG0631, PTC1, Serine/threonine protein phosphatase [Signal transduction mechanisms]	NA|586aa|down_4|NC_011729.1_691895_693653_-	COG0661, AarF, Predicted unusual protein kinase [General function prediction only]	NA|105aa|down_5|NC_011729.1_693736_694051_-	pfam09421, FRQ, Frequency clock protein	NA|116aa|down_6|NC_011729.1_694418_694766_-	NA	NA|103aa|down_7|NC_011729.1_694776_695085_-	TIGR03792, conserved_hypothetical_protein, uncharacterized cyanobacterial protein, TIGR03792 family	NA|544aa|down_8|NC_011729.1_695310_696942_-	pfam11832, DUF3352, Protein of unknown function (DUF3352)	NA|226aa|down_9|NC_011729.1_697259_697937_+	COG1357, COG1357, Pentapeptide repeats containing protein [Function unknown]
GCF_000021825.1_ASM2182v1	NC_011729	Gloeothece citriformis PCC 7424, complete sequence	6	873485-874248	5,1,2	CRISPRCasFinder,CRT,PILER-CR	no	cas14j,cas10,cmr3gr5,cmr4gr7,cmr5gr11	cas14j,cas14k,DEDDh,c2c9_V-U4,cas6,cas5,cas7,cas8a4,cas3,Cas14c_CAS-V-F,cas10,cmr3gr5,cmr4gr7,cmr5gr11,Cas14u_CAS-V,RT,csa3,WYL,cas10d,csc2gr7,csc1gr5,cas4,cas1,cas2,PD-DExK,cas8b3,DinG,cas8e,cse2gr11,cas6e,Cas9_archaeal,csx15,2OG_CAS	Type III-C,Type III-D,Type III-A,Type III-B	GTTTCCAACAATTCCTATTCTACCCAATAGGTAGGG,GTTTCCAACAATTCCTATTCTACCCAATAGGTAGGG,GTTTCCAACAATTCCTATTCTACCCAATAGGTAGGG	36,36,36	0	0	NA	NA	NA:NA:NA	10,10,9	10	TypeIII-C,TypeIII-A,TypeV,TypeIII-D,TypeIII-B	cas14j,cas14k,DEDDh,c2c9_V-U4,cas6,cas5,cas7,cas8a4,cas3,Cas14c_CAS-V-F,cas10,cmr3gr5,cmr4gr7,cmr5gr11,Cas14u_CAS-V,RT,csa3,WYL,cas10d,csc2gr7,csc1gr5,cas4,cas1,cas2,PD-DExK,cas8b3,DinG,cas8e,cse2gr11,cas6e,Cas9_archaeal,csx15,2OG_CAS,csm6	NA|220aa|up_8|NC_011729.1_861223_861883_-,NA|376aa|up_7|NC_011729.1_861892_863020_-,NA|137aa|up_2|NC_011729.1_870554_870965_-,cmr5gr11|142aa|down_3|NC_011729.1_880720_881146_+,NA|90aa|down_5|NC_011729.1_883262_883532_+	NA|342aa|up_9|NC_011729.1_860186_861212_-	COG0142, IspA, Geranylgeranyl pyrophosphate synthase [Coenzyme metabolism]	NA|220aa|up_8|NC_011729.1_861223_861883_-	NA	NA|376aa|up_7|NC_011729.1_861892_863020_-	NA	NA|637aa|up_6|NC_011729.1_863244_865155_-	sd00006, TPR, Tetratricopeptide repeat	NA|208aa|up_5|NC_011729.1_865756_866380_+	cd03392, PAP2_like_2, PAP2_like_2 proteins	NA|480aa|up_4|NC_011729.1_868523_869963_-	TIGR01730, COG0845:_Membrane-fusion_protein, RND family efflux transporter, MFP subunit	NA|154aa|up_3|NC_011729.1_870093_870555_-	pfam13673, Acetyltransf_10, Acetyltransferase (GNAT) domain	NA|137aa|up_2|NC_011729.1_870554_870965_-	NA	cas14j|501aa|up_1|NC_011729.1_871332_872835_-	COG0675, COG0675, Transposase and inactivated derivatives [DNA replication, recombination, and repair]	NA|135aa|up_0|NC_011729.1_872902_873307_+	pfam01797, Y1_Tnp, Transposase IS200 like	cas10|681aa|down_0|NC_011729.1_875979_878022_+	cd09679, Cas10_III, CRISPR/Cas system-associated protein Cas10	cmr3gr5|375aa|down_1|NC_011729.1_878009_879134_+	cd09748, Cmr3_III-B, CRISPR/Cas system-associated RAMP superfamily protein Cmr3	cmr4gr7|286aa|down_2|NC_011729.1_879863_880721_+	COG1336, COG1336, CRISPR system related protein, RAMP superfamily [Defense mechanisms]	cmr5gr11|142aa|down_3|NC_011729.1_880720_881146_+	NA	NA|562aa|down_4|NC_011729.1_881255_882941_+	TIGR01898, repair_system, CRISPR type III-B/RAMP module RAMP protein Cmr6	NA|90aa|down_5|NC_011729.1_883262_883532_+	NA	NA|146aa|down_6|NC_011729.1_883528_883966_+	cd18696, PIN_MtVapC26-like, VapC-like PIN domain of Mycobacterium tuberculosis VapC26 and related proteins	NA|102aa|down_7|NC_011729.1_885723_886029_+	pfam10693, DUF2499, Protein of unknown function (DUF2499)	NA|99aa|down_8|NC_011729.1_886048_886345_+	pfam12159, DUF3593, Protein of unknown function (DUF3593)	NA|349aa|down_9|NC_011729.1_886905_887952_+	cd13542, PBP2_FutA1_ilke, Substrate binding domain of ferric iron-binding protein, a member of the type 2 periplasmic binding fold superfamily
GCF_000021825.1_ASM2182v1	NC_011729	Gloeothece citriformis PCC 7424, complete sequence	7	884072-885274	6,2,3	CRISPRCasFinder,CRT,PILER-CR	no	cas14j,cas10,cmr3gr5,cmr4gr7,cmr5gr11	cas14j,cas14k,DEDDh,c2c9_V-U4,cas6,cas5,cas7,cas8a4,cas3,Cas14c_CAS-V-F,cas10,cmr3gr5,cmr4gr7,cmr5gr11,Cas14u_CAS-V,RT,csa3,WYL,cas10d,csc2gr7,csc1gr5,cas4,cas1,cas2,PD-DExK,cas8b3,DinG,cas8e,cse2gr11,cas6e,Cas9_archaeal,csx15,2OG_CAS	Type III-C,Type III-D,Type III-A,Type III-B	GTTTCCAACAAGTCCTATTCAACCCAATAGGTAGGG,GTTTCCAACAAGTCCTATTCAACCCAATAGGTAGGG,GTTTCCAACAAGTCCTATTCAACCCAATAGGTAGGG	36,36,36	0	0	NA	NA	NA:NA:NA	16,16,15	16	TypeIII-C,TypeIII-A,TypeV,TypeIII-D,TypeIII-B	cas14j,cas14k,DEDDh,c2c9_V-U4,cas6,cas5,cas7,cas8a4,cas3,Cas14c_CAS-V-F,cas10,cmr3gr5,cmr4gr7,cmr5gr11,Cas14u_CAS-V,RT,csa3,WYL,cas10d,csc2gr7,csc1gr5,cas4,cas1,cas2,PD-DExK,cas8b3,DinG,cas8e,cse2gr11,cas6e,Cas9_archaeal,csx15,2OG_CAS,csm6	NA|137aa|up_9|NC_011729.1_870554_870965_-,cmr5gr11|142aa|up_3|NC_011729.1_880720_881146_+,NA|90aa|up_1|NC_011729.1_883262_883532_+,NA|177aa|down_5|NC_011729.1_890468_890999_+,NA|100aa|down_6|NC_011729.1_891012_891312_+,NA|88aa|down_8|NC_011729.1_893106_893370_-	NA|137aa|up_9|NC_011729.1_870554_870965_-	NA	cas14j|501aa|up_8|NC_011729.1_871332_872835_-	COG0675, COG0675, Transposase and inactivated derivatives [DNA replication, recombination, and repair]	NA|135aa|up_7|NC_011729.1_872902_873307_+	pfam01797, Y1_Tnp, Transposase IS200 like	cas10|681aa|up_6|NC_011729.1_875979_878022_+	cd09679, Cas10_III, CRISPR/Cas system-associated protein Cas10	cmr3gr5|375aa|up_5|NC_011729.1_878009_879134_+	cd09748, Cmr3_III-B, CRISPR/Cas system-associated RAMP superfamily protein Cmr3	cmr4gr7|286aa|up_4|NC_011729.1_879863_880721_+	COG1336, COG1336, CRISPR system related protein, RAMP superfamily [Defense mechanisms]	cmr5gr11|142aa|up_3|NC_011729.1_880720_881146_+	NA	NA|562aa|up_2|NC_011729.1_881255_882941_+	TIGR01898, repair_system, CRISPR type III-B/RAMP module RAMP protein Cmr6	NA|90aa|up_1|NC_011729.1_883262_883532_+	NA	NA|146aa|up_0|NC_011729.1_883528_883966_+	cd18696, PIN_MtVapC26-like, VapC-like PIN domain of Mycobacterium tuberculosis VapC26 and related proteins	NA|102aa|down_0|NC_011729.1_885723_886029_+	pfam10693, DUF2499, Protein of unknown function (DUF2499)	NA|99aa|down_1|NC_011729.1_886048_886345_+	pfam12159, DUF3593, Protein of unknown function (DUF3593)	NA|349aa|down_2|NC_011729.1_886905_887952_+	cd13542, PBP2_FutA1_ilke, Substrate binding domain of ferric iron-binding protein, a member of the type 2 periplasmic binding fold superfamily	NA|493aa|down_3|NC_011729.1_888501_889980_+	pfam04966, OprB, Carbohydrate-selective porin, OprB family	NA|112aa|down_4|NC_011729.1_889998_890334_+	pfam04023, FeoA, FeoA domain	NA|177aa|down_5|NC_011729.1_890468_890999_+	NA	NA|100aa|down_6|NC_011729.1_891012_891312_+	NA	NA|454aa|down_7|NC_011729.1_891736_893098_+	COG2252, COG2252, Xanthine/uracil/vitamin C permease [Nucleotide transport and    metabolism]	NA|88aa|down_8|NC_011729.1_893106_893370_-	NA	NA|166aa|down_9|NC_011729.1_893558_894056_-	cd14741, PAAR_5, proline-alanine-alanine-arginine (PAAR) domain
GCF_000021825.1_ASM2182v1	NC_011729	Gloeothece citriformis PCC 7424, complete sequence	8	1278038-1278136	7	CRISPRCasFinder	no		cas14j,cas14k,DEDDh,c2c9_V-U4,cas6,cas5,cas7,cas8a4,cas3,Cas14c_CAS-V-F,cas10,cmr3gr5,cmr4gr7,cmr5gr11,Cas14u_CAS-V,RT,csa3,WYL,cas10d,csc2gr7,csc1gr5,cas4,cas1,cas2,PD-DExK,cas8b3,DinG,cas8e,cse2gr11,cas6e,Cas9_archaeal,csx15,2OG_CAS	Orphan	TCAAGCACGAATGGAGGAAAAATT	24	0	0	NA	NA	NA	1	1	Orphan	cas14j,cas14k,DEDDh,c2c9_V-U4,cas6,cas5,cas7,cas8a4,cas3,Cas14c_CAS-V-F,cas10,cmr3gr5,cmr4gr7,cmr5gr11,Cas14u_CAS-V,RT,csa3,WYL,cas10d,csc2gr7,csc1gr5,cas4,cas1,cas2,PD-DExK,cas8b3,DinG,cas8e,cse2gr11,cas6e,Cas9_archaeal,csx15,2OG_CAS,csm6	NA,NA|99aa|down_4|NC_011729.1_1287167_1287464_+,NA|173aa|down_6|NC_011729.1_1288651_1289170_-	NA|523aa|up_9|NC_011729.1_1262593_1264162_-	pfam13282, DUF4070, Domain of unknown function (DUF4070)	NA|132aa|up_8|NC_011729.1_1264336_1264732_-	PRK01202, PRK01202, glycine cleavage system protein GcvH	NA|472aa|up_7|NC_011729.1_1264923_1266339_+	TIGR02731, Phytoene_dehydrogenase_chloroplastic/chromoplastic, phytoene desaturase	NA|311aa|up_6|NC_011729.1_1266510_1267443_+	PLN02632, PLN02632, phytoene synthase	NA|295aa|up_5|NC_011729.1_1267497_1268382_-	PRK12870, ubiA, 4-hydroxybenzoate octaprenyltransferase	NA|551aa|up_4|NC_011729.1_1268479_1270132_+	COG0248, GppA, Exopolyphosphatase [Nucleotide transport and metabolism / Inorganic ion transport and metabolism]	NA|116aa|up_3|NC_011729.1_1270267_1270615_+	cd01528, RHOD_2, Member of the Rhodanese Homology Domain superfamily, subgroup 2	NA|361aa|up_2|NC_011729.1_1270787_1271870_+	PRK00082, hrcA, heat-inducible transcription repressor; Provisional	NA|192aa|up_1|NC_011729.1_1272076_1272652_-	pfam03551, PadR, Transcriptional regulator PadR-like family	NA|1044aa|up_0|NC_011729.1_1274413_1277545_+	TIGR00915, Probable_aminoglycoside_efflux_pump, The (Largely Gram-negative Bacterial) Hydrophobe/Amphiphile Efflux-1 (HAE1) Family	NA|1164aa|down_0|NC_011729.1_1278558_1282050_-	cd00200, WD40, WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment	NA|521aa|down_1|NC_011729.1_1282119_1283682_-	pfam14516, AAA_35, AAA-like domain	NA|557aa|down_2|NC_011729.1_1283986_1285657_-	pfam00924, MS_channel, Mechanosensitive ion channel	NA|257aa|down_3|NC_011729.1_1286369_1287140_+	COG0455, flhG, Antiactivator of flagellar biosynthesis FleN, an ATPase [Cell motility]	NA|99aa|down_4|NC_011729.1_1287167_1287464_+	NA	NA|314aa|down_5|NC_011729.1_1287576_1288518_+	COG0204, PlsC, 1-acyl-sn-glycerol-3-phosphate acyltransferase [Lipid metabolism]	NA|173aa|down_6|NC_011729.1_1288651_1289170_-	NA	NA|366aa|down_7|NC_011729.1_1289225_1290323_-	pfam14559, TPR_19, Tetratricopeptide repeat	NA|242aa|down_8|NC_011729.1_1290545_1291271_-	pfam04452, Methyltrans_RNA, RNA methyltransferase	NA|200aa|down_9|NC_011729.1_1291267_1291867_-	PRK10502, PRK10502, putative acyl transferase; Provisional
GCF_000021825.1_ASM2182v1	NC_011729	Gloeothece citriformis PCC 7424, complete sequence	9	1283764-1283884	8	CRISPRCasFinder	no		cas14j,cas14k,DEDDh,c2c9_V-U4,cas6,cas5,cas7,cas8a4,cas3,Cas14c_CAS-V-F,cas10,cmr3gr5,cmr4gr7,cmr5gr11,Cas14u_CAS-V,RT,csa3,WYL,cas10d,csc2gr7,csc1gr5,cas4,cas1,cas2,PD-DExK,cas8b3,DinG,cas8e,cse2gr11,cas6e,Cas9_archaeal,csx15,2OG_CAS	Orphan	GTAGGGTGTGTTAGCGCAGAGTAACGCACCAAAACCC	37	0	0	NA	NA	NA	1	1	Orphan	cas14j,cas14k,DEDDh,c2c9_V-U4,cas6,cas5,cas7,cas8a4,cas3,Cas14c_CAS-V-F,cas10,cmr3gr5,cmr4gr7,cmr5gr11,Cas14u_CAS-V,RT,csa3,WYL,cas10d,csc2gr7,csc1gr5,cas4,cas1,cas2,PD-DExK,cas8b3,DinG,cas8e,cse2gr11,cas6e,Cas9_archaeal,csx15,2OG_CAS,csm6	NA|137aa|up_2|NC_011729.1_1277927_1278338_+,NA|99aa|down_2|NC_011729.1_1287167_1287464_+,NA|173aa|down_4|NC_011729.1_1288651_1289170_-,NA|140aa|down_9|NC_011729.1_1292851_1293271_+	NA|311aa|up_9|NC_011729.1_1266510_1267443_+	PLN02632, PLN02632, phytoene synthase	NA|295aa|up_8|NC_011729.1_1267497_1268382_-	PRK12870, ubiA, 4-hydroxybenzoate octaprenyltransferase	NA|551aa|up_7|NC_011729.1_1268479_1270132_+	COG0248, GppA, Exopolyphosphatase [Nucleotide transport and metabolism / Inorganic ion transport and metabolism]	NA|116aa|up_6|NC_011729.1_1270267_1270615_+	cd01528, RHOD_2, Member of the Rhodanese Homology Domain superfamily, subgroup 2	NA|361aa|up_5|NC_011729.1_1270787_1271870_+	PRK00082, hrcA, heat-inducible transcription repressor; Provisional	NA|192aa|up_4|NC_011729.1_1272076_1272652_-	pfam03551, PadR, Transcriptional regulator PadR-like family	NA|1044aa|up_3|NC_011729.1_1274413_1277545_+	TIGR00915, Probable_aminoglycoside_efflux_pump, The (Largely Gram-negative Bacterial) Hydrophobe/Amphiphile Efflux-1 (HAE1) Family	NA|137aa|up_2|NC_011729.1_1277927_1278338_+	NA	NA|1164aa|up_1|NC_011729.1_1278558_1282050_-	cd00200, WD40, WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment	NA|521aa|up_0|NC_011729.1_1282119_1283682_-	pfam14516, AAA_35, AAA-like domain	NA|557aa|down_0|NC_011729.1_1283986_1285657_-	pfam00924, MS_channel, Mechanosensitive ion channel	NA|257aa|down_1|NC_011729.1_1286369_1287140_+	COG0455, flhG, Antiactivator of flagellar biosynthesis FleN, an ATPase [Cell motility]	NA|99aa|down_2|NC_011729.1_1287167_1287464_+	NA	NA|314aa|down_3|NC_011729.1_1287576_1288518_+	COG0204, PlsC, 1-acyl-sn-glycerol-3-phosphate acyltransferase [Lipid metabolism]	NA|173aa|down_4|NC_011729.1_1288651_1289170_-	NA	NA|366aa|down_5|NC_011729.1_1289225_1290323_-	pfam14559, TPR_19, Tetratricopeptide repeat	NA|242aa|down_6|NC_011729.1_1290545_1291271_-	pfam04452, Methyltrans_RNA, RNA methyltransferase	NA|200aa|down_7|NC_011729.1_1291267_1291867_-	PRK10502, PRK10502, putative acyl transferase; Provisional	NA|186aa|down_8|NC_011729.1_1292081_1292639_+	PRK05800, cobU, adenosylcobinamide kinase/adenosylcobinamide-phosphate guanylyltransferase; Validated	NA|140aa|down_9|NC_011729.1_1292851_1293271_+	NA
GCF_000021825.1_ASM2182v1	NC_011729	Gloeothece citriformis PCC 7424, complete sequence	10	1288514-1288606	9	CRISPRCasFinder	no		cas14j,cas14k,DEDDh,c2c9_V-U4,cas6,cas5,cas7,cas8a4,cas3,Cas14c_CAS-V-F,cas10,cmr3gr5,cmr4gr7,cmr5gr11,Cas14u_CAS-V,RT,csa3,WYL,cas10d,csc2gr7,csc1gr5,cas4,cas1,cas2,PD-DExK,cas8b3,DinG,cas8e,cse2gr11,cas6e,Cas9_archaeal,csx15,2OG_CAS	Orphan	ATTAAATAGTAGGGTGTGTTAGC	23	0	0	NA	NA	NA	1	1	Orphan	cas14j,cas14k,DEDDh,c2c9_V-U4,cas6,cas5,cas7,cas8a4,cas3,Cas14c_CAS-V-F,cas10,cmr3gr5,cmr4gr7,cmr5gr11,Cas14u_CAS-V,RT,csa3,WYL,cas10d,csc2gr7,csc1gr5,cas4,cas1,cas2,PD-DExK,cas8b3,DinG,cas8e,cse2gr11,cas6e,Cas9_archaeal,csx15,2OG_CAS,csm6	NA|137aa|up_5|NC_011729.1_1277927_1278338_+,NA|99aa|up_0|NC_011729.1_1287167_1287464_+,NA|173aa|down_0|NC_011729.1_1288651_1289170_-,NA|140aa|down_5|NC_011729.1_1292851_1293271_+,NA|61aa|down_7|NC_011729.1_1294307_1294490_+,NA|583aa|down_8|NC_011729.1_1294470_1296219_-	NA|116aa|up_9|NC_011729.1_1270267_1270615_+	cd01528, RHOD_2, Member of the Rhodanese Homology Domain superfamily, subgroup 2	NA|361aa|up_8|NC_011729.1_1270787_1271870_+	PRK00082, hrcA, heat-inducible transcription repressor; Provisional	NA|192aa|up_7|NC_011729.1_1272076_1272652_-	pfam03551, PadR, Transcriptional regulator PadR-like family	NA|1044aa|up_6|NC_011729.1_1274413_1277545_+	TIGR00915, Probable_aminoglycoside_efflux_pump, The (Largely Gram-negative Bacterial) Hydrophobe/Amphiphile Efflux-1 (HAE1) Family	NA|137aa|up_5|NC_011729.1_1277927_1278338_+	NA	NA|1164aa|up_4|NC_011729.1_1278558_1282050_-	cd00200, WD40, WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment	NA|521aa|up_3|NC_011729.1_1282119_1283682_-	pfam14516, AAA_35, AAA-like domain	NA|557aa|up_2|NC_011729.1_1283986_1285657_-	pfam00924, MS_channel, Mechanosensitive ion channel	NA|257aa|up_1|NC_011729.1_1286369_1287140_+	COG0455, flhG, Antiactivator of flagellar biosynthesis FleN, an ATPase [Cell motility]	NA|99aa|up_0|NC_011729.1_1287167_1287464_+	NA	NA|173aa|down_0|NC_011729.1_1288651_1289170_-	NA	NA|366aa|down_1|NC_011729.1_1289225_1290323_-	pfam14559, TPR_19, Tetratricopeptide repeat	NA|242aa|down_2|NC_011729.1_1290545_1291271_-	pfam04452, Methyltrans_RNA, RNA methyltransferase	NA|200aa|down_3|NC_011729.1_1291267_1291867_-	PRK10502, PRK10502, putative acyl transferase; Provisional	NA|186aa|down_4|NC_011729.1_1292081_1292639_+	PRK05800, cobU, adenosylcobinamide kinase/adenosylcobinamide-phosphate guanylyltransferase; Validated	NA|140aa|down_5|NC_011729.1_1292851_1293271_+	NA	NA|319aa|down_6|NC_011729.1_1293273_1294230_-	TIGR02651, Ribonuclease_Z, ribonuclease Z	NA|61aa|down_7|NC_011729.1_1294307_1294490_+	NA	NA|583aa|down_8|NC_011729.1_1294470_1296219_-	NA	NA|347aa|down_9|NC_011729.1_1296339_1297380_-	PRK09604, PRK09604, tRNA (adenosine(37)-N6)-threonylcarbamoyltransferase complex transferase subunit TsaD
GCF_000021825.1_ASM2182v1	NC_011729	Gloeothece citriformis PCC 7424, complete sequence	11	1894683-1904400	4,10,3	PILER-CR,CRISPRCasFinder,CRT	no	csa3,WYL,cas3,cas10d,csc2gr7,csc1gr5,cas6,cas4,cas1,cas2	cas14j,cas14k,DEDDh,c2c9_V-U4,cas6,cas5,cas7,cas8a4,cas3,Cas14c_CAS-V-F,cas10,cmr3gr5,cmr4gr7,cmr5gr11,Cas14u_CAS-V,RT,csa3,WYL,cas10d,csc2gr7,csc1gr5,cas4,cas1,cas2,PD-DExK,cas8b3,DinG,cas8e,cse2gr11,cas6e,Cas9_archaeal,csx15,2OG_CAS	Type I-D	CTTTCTATTT--AATGAATCCTGGTAACGGGATTGAAAC,CTTTCTATTTAATGAATCCTGGTAACGGGATTGAAAC,CTTTCTATTTAATGAATCCTGGTAACGGGATTGAAAC	39,37,37	1	1	1900255-1900287	NC_011729.1_150129-150161	V-U2:V-U2:V-U2	133,133,133	133	TypeI-D	cas14j,cas14k,DEDDh,c2c9_V-U4,cas6,cas5,cas7,cas8a4,cas3,Cas14c_CAS-V-F,cas10,cmr3gr5,cmr4gr7,cmr5gr11,Cas14u_CAS-V,RT,csa3,WYL,cas10d,csc2gr7,csc1gr5,cas4,cas1,cas2,PD-DExK,cas8b3,DinG,cas8e,cse2gr11,cas6e,Cas9_archaeal,csx15,2OG_CAS,csm6	NA|82aa|up_8|NC_011729.1_1883624_1883870_+,NA|51aa|down_7|NC_011729.1_1912780_1912933_-	NA|152aa|up_9|NC_011729.1_1882840_1883296_-	pfam01724, DUF29, Domain of unknown function DUF29	NA|82aa|up_8|NC_011729.1_1883624_1883870_+	NA	cas3|727aa|up_7|NC_011729.1_1884117_1886298_+	cd09710, Cas3_I-D, CRISPR/Cas system-associated protein Cas3; Distinct diverged subfamily of Cas3 helicase domain	cas10d|995aa|up_6|NC_011729.1_1886386_1889371_+	cd09712, Cas10d_I-D, CRISPR/Cas system-associated protein Cas10d	csc2gr7|339aa|up_5|NC_011729.1_1889405_1890422_+	pfam18320, Csc2, Csc2 Crispr	csc1gr5|254aa|up_4|NC_011729.1_1890878_1891640_+	cd09711, Csc1_I-D, CRISPR/Cas system-associated protein Csc1	cas6|272aa|up_3|NC_011729.1_1891611_1892427_+	pfam10040, CRISPR_Cas6, CRISPR-associated endoribonuclease Cas6	cas4|201aa|up_2|NC_011729.1_1892429_1893032_+	TIGR00372, conserved_hypothetical_protein, CRISPR-associated protein Cas4	cas1|326aa|up_1|NC_011729.1_1893172_1894150_+	TIGR04093, hypothetical_protein_L8106_25395, CRISPR-associated endonuclease Cas1, subtype CYANO	cas2|98aa|up_0|NC_011729.1_1894146_1894440_+	cd09725, Cas2_I_II_III, CRISPR/Cas system-associated protein Cas2	NA|257aa|down_0|NC_011729.1_1904518_1905289_-	pfam13911, AhpC-TSA_2, AhpC/TSA antioxidant enzyme	NA|227aa|down_1|NC_011729.1_1905467_1906148_-	pfam13649, Methyltransf_25, Methyltransferase domain	NA|124aa|down_2|NC_011729.1_1906253_1906625_-	cd17552, REC_RR468-like, phosphoacceptor receiver (REC) domain of Thermotoga maritima response regulator RR468 and similar domains	NA|598aa|down_3|NC_011729.1_1906807_1908601_-	COG0745, OmpR, Response regulators consisting of a CheY-like receiver domain and a winged-helix DNA-binding domain [Signal transduction mechanisms / Transcription]	NA|261aa|down_4|NC_011729.1_1908779_1909562_+	PRK05420, PRK05420, aquaporin Z; Provisional	NA|786aa|down_5|NC_011729.1_1909852_1912210_-	TIGR02243, hypothetical_protein_SCD8A	NA|134aa|down_6|NC_011729.1_1912295_1912697_-	pfam04965, GPW_gp25, Gene 25-like lysozyme	NA|51aa|down_7|NC_011729.1_1912780_1912933_-	NA	NA|560aa|down_8|NC_011729.1_1913168_1914848_-	pfam13699, DUF4157, Domain of unknown function (DUF4157)	NA|168aa|down_9|NC_011729.1_1915353_1915857_-	pfam06841, Phage_T4_gp19, T4-like virus tail tube protein gp19
GCF_000021825.1_ASM2182v1	NC_011729	Gloeothece citriformis PCC 7424, complete sequence	12	2344682-2344863	11	CRISPRCasFinder	no		cas14j,cas14k,DEDDh,c2c9_V-U4,cas6,cas5,cas7,cas8a4,cas3,Cas14c_CAS-V-F,cas10,cmr3gr5,cmr4gr7,cmr5gr11,Cas14u_CAS-V,RT,csa3,WYL,cas10d,csc2gr7,csc1gr5,cas4,cas1,cas2,PD-DExK,cas8b3,DinG,cas8e,cse2gr11,cas6e,Cas9_archaeal,csx15,2OG_CAS	Orphan	GTTTCAACAGCCCTCCCGATGTGGGATGGGTTGAAAG	37	0	0	NA	NA	V-U5	2	2	Orphan	cas14j,cas14k,DEDDh,c2c9_V-U4,cas6,cas5,cas7,cas8a4,cas3,Cas14c_CAS-V-F,cas10,cmr3gr5,cmr4gr7,cmr5gr11,Cas14u_CAS-V,RT,csa3,WYL,cas10d,csc2gr7,csc1gr5,cas4,cas1,cas2,PD-DExK,cas8b3,DinG,cas8e,cse2gr11,cas6e,Cas9_archaeal,csx15,2OG_CAS,csm6	NA|296aa|up_7|NC_011729.1_2331454_2332342_-,NA|169aa|up_6|NC_011729.1_2332361_2332868_-,NA|411aa|up_4|NC_011729.1_2335047_2336280_-,NA|248aa|down_6|NC_011729.1_2353305_2354049_-,NA|203aa|down_8|NC_011729.1_2355185_2355794_+,NA|98aa|down_9|NC_011729.1_2355938_2356232_-	NA|333aa|up_9|NC_011729.1_2329839_2330838_+	pfam02397, Bac_transf, Bacterial sugar transferase	NA|140aa|up_8|NC_011729.1_2330863_2331283_+	pfam13581, HATPase_c_2, Histidine kinase-like ATPase domain	NA|296aa|up_7|NC_011729.1_2331454_2332342_-	NA	NA|169aa|up_6|NC_011729.1_2332361_2332868_-	NA	NA|556aa|up_5|NC_011729.1_2333269_2334937_+	pfam00665, rve, Integrase core domain	NA|411aa|up_4|NC_011729.1_2335047_2336280_-	NA	NA|507aa|up_3|NC_011729.1_2336282_2337803_-	TIGR04435, ABC_transporter, restriction system-associated AAA family ATPase	NA|712aa|up_2|NC_011729.1_2337820_2339956_-	pfam02384, N6_Mtase, N-6 DNA Methylase	NA|418aa|up_1|NC_011729.1_2339959_2341213_-	cd17515, RMtype1_S_MjaORF132P_Sau1132ORF3780P-TRD1-CR1_like, Type I restriction-modification system specificity (S) subunit Target Recognition Domain-ConseRved domain (TRD-CR), similar to MjaXIP/S	NA|510aa|up_0|NC_011729.1_2341187_2342717_-	pfam02384, N6_Mtase, N-6 DNA Methylase	NA|752aa|down_0|NC_011729.1_2345429_2347685_-	pfam01551, Peptidase_M23, Peptidase family M23	NA|154aa|down_1|NC_011729.1_2348021_2348483_-	cd18094, SpoU-like_TrmL, SAM-dependent tRNA methylase related to TrmL	NA|396aa|down_2|NC_011729.1_2348616_2349804_+	cd08014, M20_Acy1-like, M20 Peptidase aminoacylase 1 subfamily	NA|389aa|down_3|NC_011729.1_2349974_2351141_-	TIGR01185, membrane_spanning_subunit, DevC protein	NA|429aa|down_4|NC_011729.1_2351163_2352450_-	TIGR02971, devB-like_secretion_protein, ABC exporter membrane fusion protein, DevB family	NA|187aa|down_5|NC_011729.1_2352502_2353063_-	COG1309, AcrR, Transcriptional regulator [Transcription]	NA|248aa|down_6|NC_011729.1_2353305_2354049_-	NA	NA|291aa|down_7|NC_011729.1_2354086_2354959_-	cd19088, AKR_AKR13B1, AKR13B family of aldo-keto reductase (AKR)	NA|203aa|down_8|NC_011729.1_2355185_2355794_+	NA	NA|98aa|down_9|NC_011729.1_2355938_2356232_-	NA
GCF_000021825.1_ASM2182v1	NC_011729	Gloeothece citriformis PCC 7424, complete sequence	13	2385683-2385781	12	CRISPRCasFinder	no	c2c9_V-U4	cas14j,cas14k,DEDDh,c2c9_V-U4,cas6,cas5,cas7,cas8a4,cas3,Cas14c_CAS-V-F,cas10,cmr3gr5,cmr4gr7,cmr5gr11,Cas14u_CAS-V,RT,csa3,WYL,cas10d,csc2gr7,csc1gr5,cas4,cas1,cas2,PD-DExK,cas8b3,DinG,cas8e,cse2gr11,cas6e,Cas9_archaeal,csx15,2OG_CAS	Type V-U4	CTACTATAAAAATGTCCCACTGTT	24	0	0	NA	NA	NA	1	1	TypeV-U4	cas14j,cas14k,DEDDh,c2c9_V-U4,cas6,cas5,cas7,cas8a4,cas3,Cas14c_CAS-V-F,cas10,cmr3gr5,cmr4gr7,cmr5gr11,Cas14u_CAS-V,RT,csa3,WYL,cas10d,csc2gr7,csc1gr5,cas4,cas1,cas2,PD-DExK,cas8b3,DinG,cas8e,cse2gr11,cas6e,Cas9_archaeal,csx15,2OG_CAS,csm6	NA,NA	NA|402aa|up_9|NC_011729.1_2374433_2375639_-	TIGR03402, Cysteine_desulfurase_NifS, cysteine desulfurase NifS	NA|129aa|up_8|NC_011729.1_2375716_2376103_-	PRK07118, PRK07118, Fe-S cluster domain-containing protein	NA|484aa|up_7|NC_011729.1_2376156_2377608_-	TIGR01290, FeMo_cofactor_biosynthesis_protein_NifB, nitrogenase cofactor biosynthesis protein NifB	NA|74aa|up_6|NC_011729.1_2379726_2379948_+	pfam11165, DUF2949, Protein of unknown function (DUF2949)	NA|383aa|up_5|NC_011729.1_2380067_2381216_+	PRK11858, aksA, trans-homoaconitate synthase; Reviewed	NA|89aa|up_4|NC_011729.1_2381188_2381455_+	pfam04319, NifZ, NifZ domain	NA|67aa|up_3|NC_011729.1_2381485_2381686_+	pfam06988, NifT, NifT/FixU protein	NA|209aa|up_2|NC_011729.1_2381738_2382365_+	cd04630, CBS_pair_bac, Two tandem repeats of the cystathionine beta-synthase (CBS pair) domains present in bacteria	NA|534aa|up_1|NC_011729.1_2383228_2384830_+	cd00093, HTH_XRE, Helix-turn-helix XRE-family like proteins	NA|122aa|up_0|NC_011729.1_2385126_2385492_+	cd00562, NifX_NifB, This CD represents a family of iron-molybdenum cluster-binding proteins that includes NifB, NifX, and NifY, all of which are involved in the synthesis of an iron-molybdenum cofactor (FeMo-co) that binds the active site of the dinitrogenase enzyme	NA|285aa|down_0|NC_011729.1_2385846_2386701_-	TIGR03340, phn_DUF6, phosphonate utilization associated putative membrane protein	NA|285aa|down_1|NC_011729.1_2387124_2387979_-	COG2510, COG2510, Predicted membrane protein [Function unknown]	NA|281aa|down_2|NC_011729.1_2388099_2388942_-	cd00657, Ferritin_like, Ferritin-like superfamily of diiron-containing four-helix-bundle proteins	NA|88aa|down_3|NC_011729.1_2389480_2389744_-	pfam10049, DUF2283, Protein of unknown function (DUF2283)	NA|117aa|down_4|NC_011729.1_2389829_2390180_-	pfam08883, DOPA_dioxygen, Dopa 4,5-dioxygenase family	NA|301aa|down_5|NC_011729.1_2390296_2391199_-	cd09020, D-hex-6-P-epi_like, D-hexose-6-phosphate epimerase-like	c2c9_V-U4|364aa|down_6|NC_011729.1_2393854_2394946_-	COG0675, COG0675, Transposase and inactivated derivatives [DNA replication, recombination, and repair]	NA|389aa|down_7|NC_011729.1_2395419_2396586_-	PRK07366, PRK07366, LL-diaminopimelate aminotransferase	NA|109aa|down_8|NC_011729.1_2396676_2397003_-	pfam00085, Thioredoxin, Thioredoxin	NA|260aa|down_9|NC_011729.1_2397124_2397904_-	COG1842, PspA, Phage shock protein A (IM30), suppresses sigma54-dependent transcription [Transcription / Signal transduction mechanisms]
GCF_000021825.1_ASM2182v1	NC_011729	Gloeothece citriformis PCC 7424, complete sequence	14	2393451-2393698	4,13,5	CRT,CRISPRCasFinder,PILER-CR	no	c2c9_V-U4	cas14j,cas14k,DEDDh,c2c9_V-U4,cas6,cas5,cas7,cas8a4,cas3,Cas14c_CAS-V-F,cas10,cmr3gr5,cmr4gr7,cmr5gr11,Cas14u_CAS-V,RT,csa3,WYL,cas10d,csc2gr7,csc1gr5,cas4,cas1,cas2,PD-DExK,cas8b3,DinG,cas8e,cse2gr11,cas6e,Cas9_archaeal,csx15,2OG_CAS	Type V-U4	TGAAGTTTCCAACAATTCCTATTCAACCCAATAGGTAGGG,GTTTCCAACAATTCCTATTCAACCCAATAGGTAGGG,GAAGTTTCCAACAATTCCTATTCAACCCAATAGGTAGGG	40,36,39	0	0	NA	NA	NA:NA:NA	3,3,2	3	TypeV-U4	cas14j,cas14k,DEDDh,c2c9_V-U4,cas6,cas5,cas7,cas8a4,cas3,Cas14c_CAS-V-F,cas10,cmr3gr5,cmr4gr7,cmr5gr11,Cas14u_CAS-V,RT,csa3,WYL,cas10d,csc2gr7,csc1gr5,cas4,cas1,cas2,PD-DExK,cas8b3,DinG,cas8e,cse2gr11,cas6e,Cas9_archaeal,csx15,2OG_CAS,csm6	NA,NA|71aa|down_4|NC_011729.1_2398715_2398928_+,NA|84aa|down_7|NC_011729.1_2401199_2401451_+	NA|67aa|up_9|NC_011729.1_2381485_2381686_+	pfam06988, NifT, NifT/FixU protein	NA|209aa|up_8|NC_011729.1_2381738_2382365_+	cd04630, CBS_pair_bac, Two tandem repeats of the cystathionine beta-synthase (CBS pair) domains present in bacteria	NA|534aa|up_7|NC_011729.1_2383228_2384830_+	cd00093, HTH_XRE, Helix-turn-helix XRE-family like proteins	NA|122aa|up_6|NC_011729.1_2385126_2385492_+	cd00562, NifX_NifB, This CD represents a family of iron-molybdenum cluster-binding proteins that includes NifB, NifX, and NifY, all of which are involved in the synthesis of an iron-molybdenum cofactor (FeMo-co) that binds the active site of the dinitrogenase enzyme	NA|285aa|up_5|NC_011729.1_2385846_2386701_-	TIGR03340, phn_DUF6, phosphonate utilization associated putative membrane protein	NA|285aa|up_4|NC_011729.1_2387124_2387979_-	COG2510, COG2510, Predicted membrane protein [Function unknown]	NA|281aa|up_3|NC_011729.1_2388099_2388942_-	cd00657, Ferritin_like, Ferritin-like superfamily of diiron-containing four-helix-bundle proteins	NA|88aa|up_2|NC_011729.1_2389480_2389744_-	pfam10049, DUF2283, Protein of unknown function (DUF2283)	NA|117aa|up_1|NC_011729.1_2389829_2390180_-	pfam08883, DOPA_dioxygen, Dopa 4,5-dioxygenase family	NA|301aa|up_0|NC_011729.1_2390296_2391199_-	cd09020, D-hex-6-P-epi_like, D-hexose-6-phosphate epimerase-like	c2c9_V-U4|364aa|down_0|NC_011729.1_2393854_2394946_-	COG0675, COG0675, Transposase and inactivated derivatives [DNA replication, recombination, and repair]	NA|389aa|down_1|NC_011729.1_2395419_2396586_-	PRK07366, PRK07366, LL-diaminopimelate aminotransferase	NA|109aa|down_2|NC_011729.1_2396676_2397003_-	pfam00085, Thioredoxin, Thioredoxin	NA|260aa|down_3|NC_011729.1_2397124_2397904_-	COG1842, PspA, Phage shock protein A (IM30), suppresses sigma54-dependent transcription [Transcription / Signal transduction mechanisms]	NA|71aa|down_4|NC_011729.1_2398715_2398928_+	NA	NA|393aa|down_5|NC_011729.1_2399025_2400204_+	TIGR01185, membrane_spanning_subunit, DevC protein	NA|253aa|down_6|NC_011729.1_2400223_2400982_+	COG1136, SalX, ABC-type antimicrobial peptide transport system, ATPase component [Defense mechanisms]	NA|84aa|down_7|NC_011729.1_2401199_2401451_+	NA	NA|261aa|down_8|NC_011729.1_2401447_2402230_-	pfam13649, Methyltransf_25, Methyltransferase domain	NA|256aa|down_9|NC_011729.1_2402441_2403209_-	COG0095, LplA, Lipoate-protein ligase A [Coenzyme metabolism]
GCF_000021825.1_ASM2182v1	NC_011729	Gloeothece citriformis PCC 7424, complete sequence	15	2456856-2456925	14	CRISPRCasFinder	no	cas14j	cas14j,cas14k,DEDDh,c2c9_V-U4,cas6,cas5,cas7,cas8a4,cas3,Cas14c_CAS-V-F,cas10,cmr3gr5,cmr4gr7,cmr5gr11,Cas14u_CAS-V,RT,csa3,WYL,cas10d,csc2gr7,csc1gr5,cas4,cas1,cas2,PD-DExK,cas8b3,DinG,cas8e,cse2gr11,cas6e,Cas9_archaeal,csx15,2OG_CAS	Unclear	ACTGTTAAGTGTTCACTGTTAAC	23	0	0	NA	NA	NA	1	1	TypeV	cas14j,cas14k,DEDDh,c2c9_V-U4,cas6,cas5,cas7,cas8a4,cas3,Cas14c_CAS-V-F,cas10,cmr3gr5,cmr4gr7,cmr5gr11,Cas14u_CAS-V,RT,csa3,WYL,cas10d,csc2gr7,csc1gr5,cas4,cas1,cas2,PD-DExK,cas8b3,DinG,cas8e,cse2gr11,cas6e,Cas9_archaeal,csx15,2OG_CAS,csm6	NA|175aa|up_4|NC_011729.1_2451539_2452064_+,NA|256aa|down_1|NC_011729.1_2458439_2459207_-,NA|442aa|down_4|NC_011729.1_2463407_2464733_+,NA|187aa|down_6|NC_011729.1_2465989_2466550_+	NA|474aa|up_9|NC_011729.1_2447480_2448902_+	TIGR00653, Glutamine_synthetase, glutamine synthetase, type I	NA|122aa|up_8|NC_011729.1_2448969_2449335_-	pfam01391, Collagen, Collagen triple helix repeat (20 copies)	NA|353aa|up_7|NC_011729.1_2449367_2450426_+	pfam13649, Methyltransf_25, Methyltransferase domain	NA|121aa|up_6|NC_011729.1_2450483_2450846_-	pfam18480, DUF5615, Domain of unknown function (DUF5615)	NA|87aa|up_5|NC_011729.1_2450842_2451103_-	COG2442, COG2442, Uncharacterized conserved protein [Function unknown]	NA|175aa|up_4|NC_011729.1_2451539_2452064_+	NA	cas14j|414aa|up_3|NC_011729.1_2452127_2453369_-	COG0675, COG0675, Transposase and inactivated derivatives [DNA replication, recombination, and repair]	NA|533aa|up_2|NC_011729.1_2454125_2455724_+	PRK05434, PRK05434, 2,3-bisphosphoglycerate-independent phosphoglycerate mutase	NA|80aa|up_1|NC_011729.1_2455764_2456004_+	PRK06870, secG, preprotein translocase subunit SecG; Reviewed	NA|264aa|up_0|NC_011729.1_2456007_2456799_+	COG5549, COG5549, Predicted Zn-dependent protease [Posttranslational modification, protein turnover, chaperones]	NA|500aa|down_0|NC_011729.1_2456943_2458443_+	cd07131, ALDH_AldH-CAJ73105, Uncharacterized Candidatus kuenenia aldehyde dehydrogenase AldH (CAJ73105)-like	NA|256aa|down_1|NC_011729.1_2458439_2459207_-	NA	NA|607aa|down_2|NC_011729.1_2459306_2461127_-	pfam07705, CARDB, CARDB	NA|389aa|down_3|NC_011729.1_2461414_2462581_-	pfam18849, baeRF_family7, Bacterial archaeo-eukaryotic release factor family 7	NA|442aa|down_4|NC_011729.1_2463407_2464733_+	NA	NA|350aa|down_5|NC_011729.1_2464791_2465841_-	PRK00143, mnmA, tRNA-specific 2-thiouridylase MnmA; Reviewed	NA|187aa|down_6|NC_011729.1_2465989_2466550_+	NA	NA|67aa|down_7|NC_011729.1_2466500_2466701_+	pfam01391, Collagen, Collagen triple helix repeat (20 copies)	NA|338aa|down_8|NC_011729.1_2466863_2467877_-	PRK07403, PRK07403, type I glyceraldehyde-3-phosphate dehydrogenase	NA|496aa|down_9|NC_011729.1_2468273_2469761_+	PRK00421, murC, UDP-N-acetylmuramate--L-alanine ligase; Provisional
GCF_000021825.1_ASM2182v1	NC_011729	Gloeothece citriformis PCC 7424, complete sequence	16	2576236-2576390	6	PILER-CR	no	cas5,cas7,cas8b3,cas6	cas14j,cas14k,DEDDh,c2c9_V-U4,cas6,cas5,cas7,cas8a4,cas3,Cas14c_CAS-V-F,cas10,cmr3gr5,cmr4gr7,cmr5gr11,Cas14u_CAS-V,RT,csa3,WYL,cas10d,csc2gr7,csc1gr5,cas4,cas1,cas2,PD-DExK,cas8b3,DinG,cas8e,cse2gr11,cas6e,Cas9_archaeal,csx15,2OG_CAS	Unclear	AAATAATAAAGTTGTATAATTTC	23	0	0	NA	NA	NA	2	2	Unclear	cas14j,cas14k,DEDDh,c2c9_V-U4,cas6,cas5,cas7,cas8a4,cas3,Cas14c_CAS-V-F,cas10,cmr3gr5,cmr4gr7,cmr5gr11,Cas14u_CAS-V,RT,csa3,WYL,cas10d,csc2gr7,csc1gr5,cas4,cas1,cas2,PD-DExK,cas8b3,DinG,cas8e,cse2gr11,cas6e,Cas9_archaeal,csx15,2OG_CAS,csm6	NA|98aa|up_9|NC_011729.1_2566906_2567200_-,NA|78aa|up_1|NC_011729.1_2575508_2575742_-,NA|142aa|down_0|NC_011729.1_2581171_2581597_-,NA|105aa|down_6|NC_011729.1_2586852_2587167_+,NA|122aa|down_8|NC_011729.1_2587626_2587992_-,NA|69aa|down_9|NC_011729.1_2587991_2588198_-	NA|98aa|up_9|NC_011729.1_2566906_2567200_-	NA	NA|294aa|up_8|NC_011729.1_2568103_2568985_+	pfam09622, DUF2391, Putative integral membrane protein (DUF2391)	NA|141aa|up_7|NC_011729.1_2568981_2569404_+	TIGR02588, TIGR02588, TIGR02588 family protein	NA|229aa|up_6|NC_011729.1_2569597_2570284_+	PRK12552, PRK12552, ATP-dependent Clp protease proteolytic subunit	NA|200aa|up_5|NC_011729.1_2570324_2570924_+	PRK00277, clpP, ATP-dependent Clp protease proteolytic subunit; Reviewed	NA|295aa|up_4|NC_011729.1_2571011_2571896_-	COG1357, COG1357, Pentapeptide repeats containing protein [Function unknown]	NA|288aa|up_3|NC_011729.1_2572459_2573323_-	COG1218, CysQ, 3'-Phosphoadenosine 5'-phosphosulfate (PAPS) 3'-phosphatase [Inorganic ion transport and metabolism]	NA|395aa|up_2|NC_011729.1_2573809_2574994_-	COG1293, COG1293, Predicted RNA-binding protein homologous to eukaryotic snRNP [Transcription]	NA|78aa|up_1|NC_011729.1_2575508_2575742_-	NA	NA|144aa|up_0|NC_011729.1_2575738_2576170_-	pfam13470, PIN_3, PIN domain	NA|142aa|down_0|NC_011729.1_2581171_2581597_-	NA	NA|158aa|down_1|NC_011729.1_2581677_2582151_-	pfam15978, TnsD, Tn7-like transposition protein D	cas5|210aa|down_2|NC_011729.1_2582869_2583499_-	cd09688, Cas5_I-C, CRISPR/Cas system-associated RAMP superfamily protein Cas5	cas7|287aa|down_3|NC_011729.1_2583510_2584371_-	TIGR02585, conserved_protein, CRISPR-associated protein Cas7/Cst2/DevR, subtype I-B/TNEAP	cas8b3|574aa|down_4|NC_011729.1_2584398_2586120_-	cd09713, Cas8c_I-C, CRISPR/Cas system-associated protein Cas8c	cas6|215aa|down_5|NC_011729.1_2586116_2586761_-	pfam09559, Cas6, Cas6 Crispr	NA|105aa|down_6|NC_011729.1_2586852_2587167_+	NA	NA|77aa|down_7|NC_011729.1_2587179_2587410_+	pfam16277, DUF4926, Domain of unknown function (DUF4926)	NA|122aa|down_8|NC_011729.1_2587626_2587992_-	NA	NA|69aa|down_9|NC_011729.1_2587991_2588198_-	NA
GCF_000021825.1_ASM2182v1	NC_011729	Gloeothece citriformis PCC 7424, complete sequence	17	2582378-2582627	7,15	PILER-CR,CRISPRCasFinder	no	cas5,cas7,cas8b3,cas6	cas14j,cas14k,DEDDh,c2c9_V-U4,cas6,cas5,cas7,cas8a4,cas3,Cas14c_CAS-V-F,cas10,cmr3gr5,cmr4gr7,cmr5gr11,Cas14u_CAS-V,RT,csa3,WYL,cas10d,csc2gr7,csc1gr5,cas4,cas1,cas2,PD-DExK,cas8b3,DinG,cas8e,cse2gr11,cas6e,Cas9_archaeal,csx15,2OG_CAS	Unclear	GTGAGTAACGCCTTACGACATCAAGCTATAAATCAC,AGTAACGCCTTGCGACATCAAGCTATAAATCAC	36,33	0	0	NA	NA	NA:NA	3,3	3	Unclear	cas14j,cas14k,DEDDh,c2c9_V-U4,cas6,cas5,cas7,cas8a4,cas3,Cas14c_CAS-V-F,cas10,cmr3gr5,cmr4gr7,cmr5gr11,Cas14u_CAS-V,RT,csa3,WYL,cas10d,csc2gr7,csc1gr5,cas4,cas1,cas2,PD-DExK,cas8b3,DinG,cas8e,cse2gr11,cas6e,Cas9_archaeal,csx15,2OG_CAS,csm6	NA|78aa|up_3|NC_011729.1_2575508_2575742_-,NA|142aa|up_1|NC_011729.1_2581171_2581597_-,NA|105aa|down_4|NC_011729.1_2586852_2587167_+,NA|122aa|down_6|NC_011729.1_2587626_2587992_-,NA|69aa|down_7|NC_011729.1_2587991_2588198_-,NA|75aa|down_8|NC_011729.1_2588210_2588435_-	NA|141aa|up_9|NC_011729.1_2568981_2569404_+	TIGR02588, TIGR02588, TIGR02588 family protein	NA|229aa|up_8|NC_011729.1_2569597_2570284_+	PRK12552, PRK12552, ATP-dependent Clp protease proteolytic subunit	NA|200aa|up_7|NC_011729.1_2570324_2570924_+	PRK00277, clpP, ATP-dependent Clp protease proteolytic subunit; Reviewed	NA|295aa|up_6|NC_011729.1_2571011_2571896_-	COG1357, COG1357, Pentapeptide repeats containing protein [Function unknown]	NA|288aa|up_5|NC_011729.1_2572459_2573323_-	COG1218, CysQ, 3'-Phosphoadenosine 5'-phosphosulfate (PAPS) 3'-phosphatase [Inorganic ion transport and metabolism]	NA|395aa|up_4|NC_011729.1_2573809_2574994_-	COG1293, COG1293, Predicted RNA-binding protein homologous to eukaryotic snRNP [Transcription]	NA|78aa|up_3|NC_011729.1_2575508_2575742_-	NA	NA|144aa|up_2|NC_011729.1_2575738_2576170_-	pfam13470, PIN_3, PIN domain	NA|142aa|up_1|NC_011729.1_2581171_2581597_-	NA	NA|158aa|up_0|NC_011729.1_2581677_2582151_-	pfam15978, TnsD, Tn7-like transposition protein D	cas5|210aa|down_0|NC_011729.1_2582869_2583499_-	cd09688, Cas5_I-C, CRISPR/Cas system-associated RAMP superfamily protein Cas5	cas7|287aa|down_1|NC_011729.1_2583510_2584371_-	TIGR02585, conserved_protein, CRISPR-associated protein Cas7/Cst2/DevR, subtype I-B/TNEAP	cas8b3|574aa|down_2|NC_011729.1_2584398_2586120_-	cd09713, Cas8c_I-C, CRISPR/Cas system-associated protein Cas8c	cas6|215aa|down_3|NC_011729.1_2586116_2586761_-	pfam09559, Cas6, Cas6 Crispr	NA|105aa|down_4|NC_011729.1_2586852_2587167_+	NA	NA|77aa|down_5|NC_011729.1_2587179_2587410_+	pfam16277, DUF4926, Domain of unknown function (DUF4926)	NA|122aa|down_6|NC_011729.1_2587626_2587992_-	NA	NA|69aa|down_7|NC_011729.1_2587991_2588198_-	NA	NA|75aa|down_8|NC_011729.1_2588210_2588435_-	NA	NA|306aa|down_9|NC_011729.1_2588406_2589324_-	pfam00004, AAA, ATPase family associated with various cellular activities (AAA)
GCF_000021825.1_ASM2182v1	NC_011729	Gloeothece citriformis PCC 7424, complete sequence	18	2614729-2614830	16	CRISPRCasFinder	no		cas14j,cas14k,DEDDh,c2c9_V-U4,cas6,cas5,cas7,cas8a4,cas3,Cas14c_CAS-V-F,cas10,cmr3gr5,cmr4gr7,cmr5gr11,Cas14u_CAS-V,RT,csa3,WYL,cas10d,csc2gr7,csc1gr5,cas4,cas1,cas2,PD-DExK,cas8b3,DinG,cas8e,cse2gr11,cas6e,Cas9_archaeal,csx15,2OG_CAS	Orphan	AGTCTTTTTAGTTAACAGTTAACAGT	26	0	0	NA	NA	NA	1	1	Orphan	cas14j,cas14k,DEDDh,c2c9_V-U4,cas6,cas5,cas7,cas8a4,cas3,Cas14c_CAS-V-F,cas10,cmr3gr5,cmr4gr7,cmr5gr11,Cas14u_CAS-V,RT,csa3,WYL,cas10d,csc2gr7,csc1gr5,cas4,cas1,cas2,PD-DExK,cas8b3,DinG,cas8e,cse2gr11,cas6e,Cas9_archaeal,csx15,2OG_CAS,csm6	NA|133aa|up_7|NC_011729.1_2607562_2607961_+,NA|112aa|up_6|NC_011729.1_2608060_2608396_-,NA|113aa|up_1|NC_011729.1_2611036_2611375_+,NA|152aa|down_2|NC_011729.1_2617458_2617914_-	NA|558aa|up_9|NC_011729.1_2602481_2604155_-	sd00006, TPR, Tetratricopeptide repeat	NA|844aa|up_8|NC_011729.1_2604920_2607452_+	COG4449, COG4449, Predicted protease of the Abi (CAAX) family [General function prediction only]	NA|133aa|up_7|NC_011729.1_2607562_2607961_+	NA	NA|112aa|up_6|NC_011729.1_2608060_2608396_-	NA	NA|148aa|up_5|NC_011729.1_2608890_2609334_-	pfam00903, Glyoxalase, Glyoxalase/Bleomycin resistance protein/Dioxygenase superfamily	NA|169aa|up_4|NC_011729.1_2609367_2609874_-	pfam09654, DUF2396, Protein of unknown function (DUF2396)	NA|183aa|up_3|NC_011729.1_2609989_2610538_+	cd04645, LbH_gamma_CA_like, Gamma carbonic anhydrase-like: This family is composed of gamma carbonic anhydrase (CA), Ferripyochelin Binding Protein (FBP), E	NA|42aa|up_2|NC_011729.1_2610604_2610730_+	pfam06298, PsbY, Photosystem II protein Y (PsbY)	NA|113aa|up_1|NC_011729.1_2611036_2611375_+	NA	NA|571aa|up_0|NC_011729.1_2611392_2613105_+	pfam00924, MS_channel, Mechanosensitive ion channel	NA|519aa|down_0|NC_011729.1_2614935_2616492_-	COG0612, PqqL, Predicted Zn-dependent peptidases [General function prediction only]	NA|238aa|down_1|NC_011729.1_2616677_2617391_+	COG0625, Gst, Glutathione S-transferase [Posttranslational modification, protein turnover, chaperones]	NA|152aa|down_2|NC_011729.1_2617458_2617914_-	NA	NA|124aa|down_3|NC_011729.1_2617961_2618333_+	pfam02537, CRCB, CrcB-like protein, Camphor Resistance (CrcB)	NA|451aa|down_4|NC_011729.1_2618343_2619696_-	cd00156, REC, phosphoacceptor receiver (REC) domain of response regulators (RRs) and pseudo response regulators (PRRs)	NA|158aa|down_5|NC_011729.1_2619692_2620166_-	cd17557, REC_Rcp-like, phosphoacceptor receiver (REC) domain of cyanobacterial phytochrome response regulator Rcp and similar domains	NA|761aa|down_6|NC_011729.1_2620175_2622458_-	COG4251, COG4251, Bacteriophytochrome (light-regulated signal transduction histidine kinase) [Signal transduction mechanisms]	NA|234aa|down_7|NC_011729.1_2622851_2623553_-	PRK01112, PRK01112, 2,3-bisphosphoglycerate-dependent phosphoglycerate mutase	NA|477aa|down_8|NC_011729.1_2623691_2625122_-	COG0469, PykF, Pyruvate kinase [Carbohydrate transport and metabolism]	NA|363aa|down_9|NC_011729.1_2625437_2626526_+	PLN02754, PLN02754, chorismate synthase
GCF_000021825.1_ASM2182v1	NC_011729	Gloeothece citriformis PCC 7424, complete sequence	19	2726175-2729811	8,17,5	PILER-CR,CRISPRCasFinder,CRT	no	DinG,WYL,cas3,cas8e,cse2gr11,cas7,cas5,cas6e,cas1,cas2	cas14j,cas14k,DEDDh,c2c9_V-U4,cas6,cas5,cas7,cas8a4,cas3,Cas14c_CAS-V-F,cas10,cmr3gr5,cmr4gr7,cmr5gr11,Cas14u_CAS-V,RT,csa3,WYL,cas10d,csc2gr7,csc1gr5,cas4,cas1,cas2,PD-DExK,cas8b3,DinG,cas8e,cse2gr11,cas6e,Cas9_archaeal,csx15,2OG_CAS	Type I-E	GTATTCCCCATGCTTGTGGGGGTGAACCG,GTATTCCCCATGCTTGTGGGGGTGAACCG,GTATTCCCCATGCTTGTGGGGGTGAACCG	29,29,29	0	0	NA	NA	I-B,III-A,III-B:I-B,III-A,III-B:I-B,III-A,III-B	59,59,59	59	TypeI-E	cas14j,cas14k,DEDDh,c2c9_V-U4,cas6,cas5,cas7,cas8a4,cas3,Cas14c_CAS-V-F,cas10,cmr3gr5,cmr4gr7,cmr5gr11,Cas14u_CAS-V,RT,csa3,WYL,cas10d,csc2gr7,csc1gr5,cas4,cas1,cas2,PD-DExK,cas8b3,DinG,cas8e,cse2gr11,cas6e,Cas9_archaeal,csx15,2OG_CAS,csm6	NA,NA|54aa|down_2|NC_011729.1_2730993_2731155_+,NA|126aa|down_3|NC_011729.1_2731254_2731632_+,NA|71aa|down_5|NC_011729.1_2732708_2732921_+,NA|78aa|down_8|NC_011729.1_2734132_2734366_+	DinG|835aa|up_9|NC_011729.1_2713321_2715826_+	COG1199, DinG, Rad3-related DNA helicases [Transcription / DNA replication, recombination, and repair]	WYL|324aa|up_8|NC_011729.1_2716274_2717246_+	pfam13280, WYL, WYL domain	cas3|909aa|up_7|NC_011729.1_2717260_2719987_+	PRK09694, PRK09694, CRISPR-associated helicase/endonuclease Cas3	cas8e|526aa|up_6|NC_011729.1_2720016_2721594_+	pfam09481, CRISPR_Cse1, CRISPR-associated protein Cse1 (CRISPR_cse1)	cse2gr11|172aa|up_5|NC_011729.1_2721590_2722106_+	pfam09485, CRISPR_Cse2, CRISPR-associated protein Cse2 (CRISPR_cse2)	cas7|481aa|up_4|NC_011729.1_2722126_2723569_+	pfam09344, Cas_CT1975, CT1975-like protein	cas5|215aa|up_3|NC_011729.1_2723568_2724213_+	TIGR01868, hypothetical_protein, CRISPR-associated protein Cas5/CasD, subtype I-E/ECOLI	cas6e|220aa|up_2|NC_011729.1_2724199_2724859_+	pfam08798, CRISPR_assoc, CRISPR associated protein	cas1|304aa|up_1|NC_011729.1_2724965_2725877_+	TIGR03638, cas1_ECOLI, CRISPR-associated endonuclease Cas1, subtype I-E/ECOLI	cas2|93aa|up_0|NC_011729.1_2725882_2726161_+	pfam09707, Cas_Cas2CT1978, CRISPR-associated protein (Cas_Cas2CT1978)	NA|205aa|down_0|NC_011729.1_2729835_2730450_-	cd05782, DNA_polB_like1_exo, Uncharacterized bacterial subgroup of the DEDDy 3'-5' exonuclease domain of family-B DNA polymerases	NA|59aa|down_1|NC_011729.1_2730680_2730857_-	pfam08814, XisH, XisH protein	NA|54aa|down_2|NC_011729.1_2730993_2731155_+	NA	NA|126aa|down_3|NC_011729.1_2731254_2731632_+	NA	NA|144aa|down_4|NC_011729.1_2731628_2732060_+	pfam01844, HNH, HNH endonuclease	NA|71aa|down_5|NC_011729.1_2732708_2732921_+	NA	NA|110aa|down_6|NC_011729.1_2732984_2733314_+	COG1669, COG1669, Predicted nucleotidyltransferases [General function prediction only]	NA|90aa|down_7|NC_011729.1_2733300_2733570_+	COG2361, COG2361, Uncharacterized conserved protein [Function unknown]	NA|78aa|down_8|NC_011729.1_2734132_2734366_+	NA	NA|151aa|down_9|NC_011729.1_2734368_2734821_+	cd18694, PIN_VapC-like, uncharacterized subfamily of the VapC-like nuclease family of the PIN domain superfamily
GCF_000021825.1_ASM2182v1	NC_011729	Gloeothece citriformis PCC 7424, complete sequence	20	2841764-2844496	9,18,6	PILER-CR,CRISPRCasFinder,CRT	no	RT	cas14j,cas14k,DEDDh,c2c9_V-U4,cas6,cas5,cas7,cas8a4,cas3,Cas14c_CAS-V-F,cas10,cmr3gr5,cmr4gr7,cmr5gr11,Cas14u_CAS-V,RT,csa3,WYL,cas10d,csc2gr7,csc1gr5,cas4,cas1,cas2,PD-DExK,cas8b3,DinG,cas8e,cse2gr11,cas6e,Cas9_archaeal,csx15,2OG_CAS	Unclear	CTTTCTATTT--AATGAATCCTGGCAACGGGATTGAAAC,GTTTCAATCCCGTTGCCAGGATTCATTAAATAGAAAG,GTTTCAATCCCGTTGCCAGGATTCATTAAATAGAAAG	39,37,37	0	0	NA	NA	N:A	37,37,37	37	Orphan	cas14j,cas14k,DEDDh,c2c9_V-U4,cas6,cas5,cas7,cas8a4,cas3,Cas14c_CAS-V-F,cas10,cmr3gr5,cmr4gr7,cmr5gr11,Cas14u_CAS-V,RT,csa3,WYL,cas10d,csc2gr7,csc1gr5,cas4,cas1,cas2,PD-DExK,cas8b3,DinG,cas8e,cse2gr11,cas6e,Cas9_archaeal,csx15,2OG_CAS,csm6	NA|532aa|up_9|NC_011729.1_2831024_2832620_-,NA|154aa|up_4|NC_011729.1_2837404_2837866_-,NA|106aa|up_1|NC_011729.1_2840599_2840917_-,NA|82aa|down_3|NC_011729.1_2846243_2846489_-,NA|126aa|down_4|NC_011729.1_2846491_2846869_-	NA|532aa|up_9|NC_011729.1_2831024_2832620_-	NA	NA|322aa|up_8|NC_011729.1_2832773_2833739_-	cd04179, DPM_DPG-synthase_like, DPM_DPG-synthase_like is a member of the Glycosyltransferase 2 superfamily	NA|507aa|up_7|NC_011729.1_2834159_2835680_-	cd11642, SUMT, Uroporphyrin-III C-methyltransferase (also known as S-Adenosyl-L-methionine:uroporphyrinogen III methyltransferase, SUMT)	NA|169aa|up_6|NC_011729.1_2835683_2836190_-	COG3788, COG3788, Uncharacterized relative of glutathione S-transferase, MAPEG superfamily [General function prediction only]	NA|214aa|up_5|NC_011729.1_2836653_2837295_+	COG0009, SUA5, Putative translation factor (SUA5) [Translation, ribosomal structure and biogenesis]	NA|154aa|up_4|NC_011729.1_2837404_2837866_-	NA	NA|365aa|up_3|NC_011729.1_2838425_2839520_-	TIGR02917, TPR_domain_protein, putative PEP-CTERM system TPR-repeat lipoprotein	NA|221aa|up_2|NC_011729.1_2839607_2840270_-	COG1413, COG1413, FOG: HEAT repeat [Energy production and conversion]	NA|106aa|up_1|NC_011729.1_2840599_2840917_-	NA	NA|152aa|up_0|NC_011729.1_2841188_2841644_-	pfam01724, DUF29, Domain of unknown function DUF29	NA|221aa|down_0|NC_011729.1_2844726_2845389_-	PRK12567, PRK12567, putative monovalent cation/H+ antiporter subunit B; Reviewed	NA|181aa|down_1|NC_011729.1_2845388_2845931_-	PRK07377, PRK07377, hypothetical protein; Provisional	NA|97aa|down_2|NC_011729.1_2845923_2846214_-	pfam03334, PhaG_MnhG_YufB, Na+/H+ antiporter subunit	NA|82aa|down_3|NC_011729.1_2846243_2846489_-	NA	NA|126aa|down_4|NC_011729.1_2846491_2846869_-	NA	NA|484aa|down_5|NC_011729.1_2846865_2848317_-	PRK07234, PRK07234, putative monovalent cation/H+ antiporter subunit D; Reviewed	NA|112aa|down_6|NC_011729.1_2848313_2848649_-	PRK08389, PRK08389, putative monovalent cation/H+ antiporter subunit C; Reviewed	NA|133aa|down_7|NC_011729.1_2849157_2849556_+	cd00038, CAP_ED, effector domain of the CAP family of transcription factors; members include CAP (or cAMP receptor protein (CRP)), which binds cAMP, FNR (fumarate and nitrate reduction), which uses an iron-sulfur cluster to sense oxygen) and CooA, a heme containing CO sensor	NA|301aa|down_8|NC_011729.1_2850201_2851104_-	pfam11927, DUF3445, Protein of unknown function (DUF3445)	NA|130aa|down_9|NC_011729.1_2851781_2852171_+	pfam01610, DDE_Tnp_ISL3, Transposase
GCF_000021825.1_ASM2182v1	NC_011729	Gloeothece citriformis PCC 7424, complete sequence	21	2860565-2860657	19	CRISPRCasFinder	no	RT	cas14j,cas14k,DEDDh,c2c9_V-U4,cas6,cas5,cas7,cas8a4,cas3,Cas14c_CAS-V-F,cas10,cmr3gr5,cmr4gr7,cmr5gr11,Cas14u_CAS-V,RT,csa3,WYL,cas10d,csc2gr7,csc1gr5,cas4,cas1,cas2,PD-DExK,cas8b3,DinG,cas8e,cse2gr11,cas6e,Cas9_archaeal,csx15,2OG_CAS	Unclear	CCCTTAAAAAGGGGGGCTTTAAA	23	0	0	NA	NA	N:A	1	1	Orphan	cas14j,cas14k,DEDDh,c2c9_V-U4,cas6,cas5,cas7,cas8a4,cas3,Cas14c_CAS-V-F,cas10,cmr3gr5,cmr4gr7,cmr5gr11,Cas14u_CAS-V,RT,csa3,WYL,cas10d,csc2gr7,csc1gr5,cas4,cas1,cas2,PD-DExK,cas8b3,DinG,cas8e,cse2gr11,cas6e,Cas9_archaeal,csx15,2OG_CAS,csm6	NA,NA|78aa|down_1|NC_011729.1_2862226_2862460_-,NA|103aa|down_4|NC_011729.1_2864801_2865110_+,NA|156aa|down_7|NC_011729.1_2871007_2871475_+	NA|112aa|up_9|NC_011729.1_2848313_2848649_-	PRK08389, PRK08389, putative monovalent cation/H+ antiporter subunit C; Reviewed	NA|133aa|up_8|NC_011729.1_2849157_2849556_+	cd00038, CAP_ED, effector domain of the CAP family of transcription factors; members include CAP (or cAMP receptor protein (CRP)), which binds cAMP, FNR (fumarate and nitrate reduction), which uses an iron-sulfur cluster to sense oxygen) and CooA, a heme containing CO sensor	NA|301aa|up_7|NC_011729.1_2850201_2851104_-	pfam11927, DUF3445, Protein of unknown function (DUF3445)	NA|130aa|up_6|NC_011729.1_2851781_2852171_+	pfam01610, DDE_Tnp_ISL3, Transposase	NA|682aa|up_5|NC_011729.1_2852294_2854340_-	cd01833, XynB_like, SGNH_hydrolase subfamily, similar to Ruminococcus flavefaciens XynB	NA|280aa|up_4|NC_011729.1_2854742_2855582_-	pfam03724, META, META domain	NA|607aa|up_3|NC_011729.1_2855948_2857769_+	TIGR03108, eps_aminotran_1, exosortase A system-associated amidotransferase 1	NA|291aa|up_2|NC_011729.1_2858056_2858929_-	PRK12896, PRK12896, methionine aminopeptidase; Reviewed	NA|148aa|up_1|NC_011729.1_2859113_2859557_-	cd04210, Cupredoxin_like_1, Uncharacterized Cupredoxin-like subfamily	NA|276aa|up_0|NC_011729.1_2859573_2860401_-	PRK07432, PRK07432, S-methyl-5'-thioadenosine phosphorylase	RT|403aa|down_0|NC_011729.1_2860791_2862000_-	TIGR04416, hypothetical_protein, group II intron reverse transcriptase/maturase	NA|78aa|down_1|NC_011729.1_2862226_2862460_-	NA	NA|246aa|down_2|NC_011729.1_2863031_2863769_-	PRK09065, PRK09065, glutamine amidotransferase; Provisional	NA|260aa|down_3|NC_011729.1_2863914_2864694_-	COG1127, Ttg2A, ABC-type transport system involved in resistance to organic solvents, ATPase component [Secondary metabolites biosynthesis, transport, and catabolism]	NA|103aa|down_4|NC_011729.1_2864801_2865110_+	NA	NA|361aa|down_5|NC_011729.1_2865576_2866659_+	pfam11300, DUF3102, Protein of unknown function (DUF3102)	NA|1189aa|down_6|NC_011729.1_2867328_2870895_+	cd00200, WD40, WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment	NA|156aa|down_7|NC_011729.1_2871007_2871475_+	NA	NA|435aa|down_8|NC_011729.1_2872163_2873468_+	PRK00197, proA, gamma-glutamyl phosphate reductase; Provisional	NA|133aa|down_9|NC_011729.1_2873505_2873904_-	COG5499, COG5499, Predicted transcription regulator containing HTH domain [Transcription]
GCF_000021825.1_ASM2182v1	NC_011729	Gloeothece citriformis PCC 7424, complete sequence	22	2963414-2963520	20	CRISPRCasFinder	no		cas14j,cas14k,DEDDh,c2c9_V-U4,cas6,cas5,cas7,cas8a4,cas3,Cas14c_CAS-V-F,cas10,cmr3gr5,cmr4gr7,cmr5gr11,Cas14u_CAS-V,RT,csa3,WYL,cas10d,csc2gr7,csc1gr5,cas4,cas1,cas2,PD-DExK,cas8b3,DinG,cas8e,cse2gr11,cas6e,Cas9_archaeal,csx15,2OG_CAS	Orphan	ATGTCTTCAGGAGGAGATACTATTTCTGGTTCGATTAAATC	41	0	0	NA	NA	N:A	1	1	Orphan	cas14j,cas14k,DEDDh,c2c9_V-U4,cas6,cas5,cas7,cas8a4,cas3,Cas14c_CAS-V-F,cas10,cmr3gr5,cmr4gr7,cmr5gr11,Cas14u_CAS-V,RT,csa3,WYL,cas10d,csc2gr7,csc1gr5,cas4,cas1,cas2,PD-DExK,cas8b3,DinG,cas8e,cse2gr11,cas6e,Cas9_archaeal,csx15,2OG_CAS,csm6	NA|144aa|up_9|NC_011729.1_2951608_2952040_-,NA|85aa|up_3|NC_011729.1_2955817_2956072_-,NA|96aa|down_4|NC_011729.1_2968725_2969013_+,NA|111aa|down_7|NC_011729.1_2971322_2971655_+	NA|144aa|up_9|NC_011729.1_2951608_2952040_-	NA	NA|297aa|up_8|NC_011729.1_2952196_2953087_+	COG1619, LdcA, Uncharacterized proteins, homologs of microcin C7 resistance protein MccF [Defense mechanisms]	NA|75aa|up_7|NC_011729.1_2953101_2953326_-	COG1724, COG1724, Predicted RNA binding protein (dsRBD-like fold), HicA family    [General function prediction only]	NA|70aa|up_6|NC_011729.1_2953329_2953539_-	COG1598, COG1598, Predicted nuclease of the RNAse H fold, HicB family [General    function prediction only]	NA|354aa|up_5|NC_011729.1_2953684_2954746_+	PLN02433, PLN02433, uroporphyrinogen decarboxylase	NA|316aa|up_4|NC_011729.1_2954758_2955706_+	COG0451, WcaG, Nucleoside-diphosphate-sugar epimerases [Cell envelope biogenesis, outer membrane / Carbohydrate transport and metabolism]	NA|85aa|up_3|NC_011729.1_2955817_2956072_-	NA	NA|368aa|up_2|NC_011729.1_2956620_2957724_+	cd03802, GT4_AviGT4-like, UDP-Glc:tetrahydrobiopterin alpha-glucosyltransferase and similar proteins	NA|742aa|up_1|NC_011729.1_2957839_2960065_+	COG3408, GDB1, Glycogen debranching enzyme [Carbohydrate transport and metabolism]	NA|447aa|up_0|NC_011729.1_2960138_2961479_-	cd07217, Pat17_PNPLA8_PNPLA9_like4, Patatin-like phospholipase	NA|236aa|down_0|NC_011729.1_2964442_2965150_+	COG0664, Crp, cAMP-binding proteins - catabolite gene activator and regulatory subunit of cAMP-dependent protein kinases [Signal transduction mechanisms]	NA|270aa|down_1|NC_011729.1_2965207_2966017_+	COG4241, COG4241, Predicted membrane protein [Function unknown]	NA|444aa|down_2|NC_011729.1_2966019_2967351_-	PLN02302, PLN02302, ent-kaurenoic acid oxidase	NA|371aa|down_3|NC_011729.1_2967411_2968524_+	PRK04447, PRK04447, hypothetical protein; Provisional	NA|96aa|down_4|NC_011729.1_2968725_2969013_+	NA	NA|215aa|down_5|NC_011729.1_2969027_2969672_+	pfam05685, Uma2, Putative restriction endonuclease	NA|464aa|down_6|NC_011729.1_2969748_2971140_+	cd01087, Prolidase, Prolidase	NA|111aa|down_7|NC_011729.1_2971322_2971655_+	NA	NA|136aa|down_8|NC_011729.1_2971881_2972289_+	pfam07784, DUF1622, Protein of unknown function (DUF1622)	NA|196aa|down_9|NC_011729.1_2972341_2972929_-	pfam01694, Rhomboid, Rhomboid family
GCF_000021825.1_ASM2182v1	NC_011729	Gloeothece citriformis PCC 7424, complete sequence	23	2968615-2968723	21	CRISPRCasFinder	no		cas14j,cas14k,DEDDh,c2c9_V-U4,cas6,cas5,cas7,cas8a4,cas3,Cas14c_CAS-V-F,cas10,cmr3gr5,cmr4gr7,cmr5gr11,Cas14u_CAS-V,RT,csa3,WYL,cas10d,csc2gr7,csc1gr5,cas4,cas1,cas2,PD-DExK,cas8b3,DinG,cas8e,cse2gr11,cas6e,Cas9_archaeal,csx15,2OG_CAS	Orphan	TGCCCACCTTAACCCCAGTAACTCCTATTAACATCACAAAA	41	0	0	NA	NA	N:A	1	1	Orphan	cas14j,cas14k,DEDDh,c2c9_V-U4,cas6,cas5,cas7,cas8a4,cas3,Cas14c_CAS-V-F,cas10,cmr3gr5,cmr4gr7,cmr5gr11,Cas14u_CAS-V,RT,csa3,WYL,cas10d,csc2gr7,csc1gr5,cas4,cas1,cas2,PD-DExK,cas8b3,DinG,cas8e,cse2gr11,cas6e,Cas9_archaeal,csx15,2OG_CAS,csm6	NA|85aa|up_8|NC_011729.1_2955817_2956072_-,NA|96aa|down_0|NC_011729.1_2968725_2969013_+,NA|111aa|down_3|NC_011729.1_2971322_2971655_+,NA|65aa|down_6|NC_011729.1_2972947_2973142_-	NA|316aa|up_9|NC_011729.1_2954758_2955706_+	COG0451, WcaG, Nucleoside-diphosphate-sugar epimerases [Cell envelope biogenesis, outer membrane / Carbohydrate transport and metabolism]	NA|85aa|up_8|NC_011729.1_2955817_2956072_-	NA	NA|368aa|up_7|NC_011729.1_2956620_2957724_+	cd03802, GT4_AviGT4-like, UDP-Glc:tetrahydrobiopterin alpha-glucosyltransferase and similar proteins	NA|742aa|up_6|NC_011729.1_2957839_2960065_+	COG3408, GDB1, Glycogen debranching enzyme [Carbohydrate transport and metabolism]	NA|447aa|up_5|NC_011729.1_2960138_2961479_-	cd07217, Pat17_PNPLA8_PNPLA9_like4, Patatin-like phospholipase	NA|711aa|up_4|NC_011729.1_2962188_2964321_-	PRK00286, xseA, exodeoxyribonuclease VII large subunit; Reviewed	NA|236aa|up_3|NC_011729.1_2964442_2965150_+	COG0664, Crp, cAMP-binding proteins - catabolite gene activator and regulatory subunit of cAMP-dependent protein kinases [Signal transduction mechanisms]	NA|270aa|up_2|NC_011729.1_2965207_2966017_+	COG4241, COG4241, Predicted membrane protein [Function unknown]	NA|444aa|up_1|NC_011729.1_2966019_2967351_-	PLN02302, PLN02302, ent-kaurenoic acid oxidase	NA|371aa|up_0|NC_011729.1_2967411_2968524_+	PRK04447, PRK04447, hypothetical protein; Provisional	NA|96aa|down_0|NC_011729.1_2968725_2969013_+	NA	NA|215aa|down_1|NC_011729.1_2969027_2969672_+	pfam05685, Uma2, Putative restriction endonuclease	NA|464aa|down_2|NC_011729.1_2969748_2971140_+	cd01087, Prolidase, Prolidase	NA|111aa|down_3|NC_011729.1_2971322_2971655_+	NA	NA|136aa|down_4|NC_011729.1_2971881_2972289_+	pfam07784, DUF1622, Protein of unknown function (DUF1622)	NA|196aa|down_5|NC_011729.1_2972341_2972929_-	pfam01694, Rhomboid, Rhomboid family	NA|65aa|down_6|NC_011729.1_2972947_2973142_-	NA	NA|434aa|down_7|NC_011729.1_2973213_2974515_+	PRK00877, hisD, bifunctional histidinal dehydrogenase/ histidinol dehydrogenase; Reviewed	NA|436aa|down_8|NC_011729.1_2974804_2976112_-	cd14014, STKc_PknB_like, Catalytic domain of bacterial Serine/Threonine kinases, PknB and similar proteins	NA|443aa|down_9|NC_011729.1_2976579_2977908_-	COG0642, BaeS, Signal transduction histidine kinase [Signal transduction mechanisms]
GCF_000021825.1_ASM2182v1	NC_011729	Gloeothece citriformis PCC 7424, complete sequence	24	3083166-3083445	22	CRISPRCasFinder	no		cas14j,cas14k,DEDDh,c2c9_V-U4,cas6,cas5,cas7,cas8a4,cas3,Cas14c_CAS-V-F,cas10,cmr3gr5,cmr4gr7,cmr5gr11,Cas14u_CAS-V,RT,csa3,WYL,cas10d,csc2gr7,csc1gr5,cas4,cas1,cas2,PD-DExK,cas8b3,DinG,cas8e,cse2gr11,cas6e,Cas9_archaeal,csx15,2OG_CAS	Orphan	CCAAAAACATCATCTAAGGAAGGAG	25	0	0	NA	NA	N:A	4	4	Orphan	cas14j,cas14k,DEDDh,c2c9_V-U4,cas6,cas5,cas7,cas8a4,cas3,Cas14c_CAS-V-F,cas10,cmr3gr5,cmr4gr7,cmr5gr11,Cas14u_CAS-V,RT,csa3,WYL,cas10d,csc2gr7,csc1gr5,cas4,cas1,cas2,PD-DExK,cas8b3,DinG,cas8e,cse2gr11,cas6e,Cas9_archaeal,csx15,2OG_CAS,csm6	NA,NA|550aa|down_3|NC_011729.1_3093292_3094942_+,NA|260aa|down_4|NC_011729.1_3095514_3096294_-,NA|105aa|down_5|NC_011729.1_3096650_3096965_-,NA|173aa|down_9|NC_011729.1_3101134_3101653_-	NA|394aa|up_9|NC_011729.1_3067236_3068418_-	pfam01139, RtcB, tRNA-splicing ligase RtcB	NA|93aa|up_8|NC_011729.1_3068507_3068786_+	cd05332, 11beta-HSD1_like_SDR_c, 11beta-hydroxysteroid dehydrogenase type 1 (11beta-HSD1)-like, classical (c) SDRs	NA|354aa|up_7|NC_011729.1_3068826_3069888_+	COG0628, yhhT, Predicted permease, member of the PurR regulon [General function prediction only]	NA|320aa|up_6|NC_011729.1_3069894_3070854_-	pfam12275, DUF3616, Protein of unknown function (DUF3616)	NA|458aa|up_5|NC_011729.1_3070872_3072246_-	pfam12899, Glyco_hydro_100, Alkaline and neutral invertase	NA|256aa|up_4|NC_011729.1_3072337_3073105_-	PRK00110, PRK00110, YebC/PmpR family DNA-binding transcriptional regulator	NA|573aa|up_3|NC_011729.1_3073887_3075606_-	PRK09344, PRK09344, phosphoenolpyruvate carboxykinase	NA|967aa|up_2|NC_011729.1_3076294_3079195_+	PRK06241, PRK06241, phosphoenolpyruvate synthase; Validated	NA|247aa|up_1|NC_011729.1_3079319_3080060_-	pfam13030, DUF3891, Protein of unknown function (DUF3891)	NA|161aa|up_0|NC_011729.1_3080401_3080884_-	pfam01584, CheW, CheW-like domain	NA|935aa|down_0|NC_011729.1_3086130_3088935_-	smart00283, MA, Methyl-accepting chemotaxis-like domains (chemotaxis sensory transducer)	NA|903aa|down_1|NC_011729.1_3089012_3091721_-	smart00283, MA, Methyl-accepting chemotaxis-like domains (chemotaxis sensory transducer)	NA|187aa|down_2|NC_011729.1_3092143_3092704_-	pfam07963, N_methyl, Prokaryotic N-terminal methylation motif	NA|550aa|down_3|NC_011729.1_3093292_3094942_+	NA	NA|260aa|down_4|NC_011729.1_3095514_3096294_-	NA	NA|105aa|down_5|NC_011729.1_3096650_3096965_-	NA	NA|183aa|down_6|NC_011729.1_3098266_3098815_-	COG0835, CheW, Chemotaxis signal transduction protein [Cell motility and secretion / Signal transduction mechanisms]	NA|122aa|down_7|NC_011729.1_3098831_3099197_-	cd17574, REC_OmpR, phosphoacceptor receiver (REC) domain of OmpR family response regulators	NA|398aa|down_8|NC_011729.1_3099274_3100468_-	cd17602, REC_PatA-like, phosphoacceptor receiver (REC) domain of PatA and similar domains	NA|173aa|down_9|NC_011729.1_3101134_3101653_-	NA
GCF_000021825.1_ASM2182v1	NC_011729	Gloeothece citriformis PCC 7424, complete sequence	25	3178831-3178966	23	CRISPRCasFinder	no		cas14j,cas14k,DEDDh,c2c9_V-U4,cas6,cas5,cas7,cas8a4,cas3,Cas14c_CAS-V-F,cas10,cmr3gr5,cmr4gr7,cmr5gr11,Cas14u_CAS-V,RT,csa3,WYL,cas10d,csc2gr7,csc1gr5,cas4,cas1,cas2,PD-DExK,cas8b3,DinG,cas8e,cse2gr11,cas6e,Cas9_archaeal,csx15,2OG_CAS	Orphan	GCTTCCTCGATGGATACTTTTGTCAGATGAGCAACCAGTT	40	0	0	NA	NA	N:A	1	1	Orphan	cas14j,cas14k,DEDDh,c2c9_V-U4,cas6,cas5,cas7,cas8a4,cas3,Cas14c_CAS-V-F,cas10,cmr3gr5,cmr4gr7,cmr5gr11,Cas14u_CAS-V,RT,csa3,WYL,cas10d,csc2gr7,csc1gr5,cas4,cas1,cas2,PD-DExK,cas8b3,DinG,cas8e,cse2gr11,cas6e,Cas9_archaeal,csx15,2OG_CAS,csm6	NA|182aa|up_3|NC_011729.1_3176098_3176644_-,NA|76aa|up_0|NC_011729.1_3177819_3178047_-,NA|166aa|down_7|NC_011729.1_3190471_3190969_-	NA|557aa|up_9|NC_011729.1_3166996_3168667_+	PRK00013, groEL, chaperonin GroEL; Reviewed	NA|98aa|up_8|NC_011729.1_3169314_3169608_-	cd12399, RRM_HP0827_like, RNA recognition motif in Helicobacter pylori HP0827 protein and similar proteins	NA|670aa|up_7|NC_011729.1_3170071_3172081_+	PRK05218, PRK05218, heat shock protein 90; Provisional	NA|79aa|up_6|NC_011729.1_3172183_3172420_+	PRK00359, rpmB, 50S ribosomal protein L28; Reviewed	NA|725aa|up_5|NC_011729.1_3172991_3175166_+	cd13401, Slt70-like, 70kDa soluble lytic transglycosylase (Slt70) and similar proteins	NA|264aa|up_4|NC_011729.1_3175338_3176130_+	pfam01887, SAM_adeno_trans, S-adenosyl-l-methionine hydroxide adenosyltransferase	NA|182aa|up_3|NC_011729.1_3176098_3176644_-	NA	NA|85aa|up_2|NC_011729.1_3177106_3177361_+	pfam03683, UPF0175, Uncharacterized protein family (UPF0175)	NA|137aa|up_1|NC_011729.1_3177416_3177827_-	COG1569, COG1569, Predicted nucleic acid-binding protein, contains PIN domain [General function prediction only]	NA|76aa|up_0|NC_011729.1_3177819_3178047_-	NA	NA|337aa|down_0|NC_011729.1_3182013_3183024_-	pfam01972, SDH_sah, Serine dehydrogenase proteinase	NA|444aa|down_1|NC_011729.1_3183332_3184664_+	pfam13433, Peripla_BP_5, Periplasmic binding protein domain	NA|389aa|down_2|NC_011729.1_3184794_3185961_+	TIGR03409, urea_trans_UrtB, urea ABC transporter, permease protein UrtB	NA|386aa|down_3|NC_011729.1_3185985_3187143_+	TIGR03408, urea_trans_UrtC, urea ABC transporter, permease protein UrtC	NA|249aa|down_4|NC_011729.1_3187241_3187988_+	TIGR03411, urea_trans_UrtD, urea ABC transporter, ATP-binding protein UrtD	NA|234aa|down_5|NC_011729.1_3188017_3188719_+	TIGR03410, urea_trans_UrtE, urea ABC transporter, ATP-binding protein UrtE	NA|470aa|down_6|NC_011729.1_3188913_3190323_-	COG2837, COG2837, Predicted iron-dependent peroxidase [Inorganic ion transport and metabolism]	NA|166aa|down_7|NC_011729.1_3190471_3190969_-	NA	NA|239aa|down_8|NC_011729.1_3191548_3192265_-	COG2335, COG2335, Secreted and surface protein containing fasciclin-like repeats [Cell envelope biogenesis, outer membrane]	NA|150aa|down_9|NC_011729.1_3192432_3192882_-	PRK00668, ndk, mulitfunctional nucleoside diphosphate kinase/apyrimidinic endonuclease/3'-; Validated
GCF_000021825.1_ASM2182v1	NC_011729	Gloeothece citriformis PCC 7424, complete sequence	26	3575124-3575742	10,24,7	PILER-CR,CRISPRCasFinder,CRT	no		cas14j,cas14k,DEDDh,c2c9_V-U4,cas6,cas5,cas7,cas8a4,cas3,Cas14c_CAS-V-F,cas10,cmr3gr5,cmr4gr7,cmr5gr11,Cas14u_CAS-V,RT,csa3,WYL,cas10d,csc2gr7,csc1gr5,cas4,cas1,cas2,PD-DExK,cas8b3,DinG,cas8e,cse2gr11,cas6e,Cas9_archaeal,csx15,2OG_CAS	Orphan	GTTACAATTAAAATGAATCCCTATTA-GGGATTGAAAC,GTTACAATTAAAATGAATCCCTATTAGGGATTGAAAC,GTTACAATTAAAATGAATCCCTATTAGGGATTGAAAC	38,37,37	0	0	NA	NA	N:A	8,8,8	8	Orphan	cas14j,cas14k,DEDDh,c2c9_V-U4,cas6,cas5,cas7,cas8a4,cas3,Cas14c_CAS-V-F,cas10,cmr3gr5,cmr4gr7,cmr5gr11,Cas14u_CAS-V,RT,csa3,WYL,cas10d,csc2gr7,csc1gr5,cas4,cas1,cas2,PD-DExK,cas8b3,DinG,cas8e,cse2gr11,cas6e,Cas9_archaeal,csx15,2OG_CAS,csm6	NA|136aa|up_7|NC_011729.1_3562559_3562967_+,NA|124aa|up_6|NC_011729.1_3563029_3563401_+,NA|84aa|down_1|NC_011729.1_3580127_3580379_+,NA|344aa|down_8|NC_011729.1_3586336_3587368_-,NA|382aa|down_9|NC_011729.1_3587519_3588665_-	NA|183aa|up_9|NC_011729.1_3560063_3560612_+	TIGR04155, hypothetical_protein, PEP-CTERM protein sorting domain, cyanobacterial subclass	NA|525aa|up_8|NC_011729.1_3560906_3562481_+	PRK14971, PRK14971, DNA polymerase III subunit gamma/tau	NA|136aa|up_7|NC_011729.1_3562559_3562967_+	NA	NA|124aa|up_6|NC_011729.1_3563029_3563401_+	NA	NA|915aa|up_5|NC_011729.1_3563354_3566099_+	COG3451, VirB4, Type IV secretory pathway, VirB4 components [Intracellular trafficking and secretion]	NA|769aa|up_4|NC_011729.1_3566425_3568732_+	cd04277, ZnMc_serralysin_like, Zinc-dependent metalloprotease, serralysin_like subfamily	NA|495aa|up_3|NC_011729.1_3568936_3570421_+	TIGR00387, Glycolate_oxidase_subunit_glcD	NA|433aa|up_2|NC_011729.1_3570426_3571725_+	COG3597, COG3597, Uncharacterized protein/domain associated with GTPases [Function unknown]	NA|324aa|up_1|NC_011729.1_3572817_3573789_+	COG0523, COG0523, Putative GTPases (G3E family) [General function prediction only]	NA|359aa|up_0|NC_011729.1_3573776_3574853_+	cd00200, WD40, WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment	NA|1029aa|down_0|NC_011729.1_3576397_3579484_-	COG0474, MgtA, Cation transport ATPase [Inorganic ion transport and metabolism]	NA|84aa|down_1|NC_011729.1_3580127_3580379_+	NA	NA|117aa|down_2|NC_011729.1_3580437_3580788_-	COG1939, COG1939, Ribonuclease III family protein [Replication, recombination, and    repair]	NA|133aa|down_3|NC_011729.1_3581596_3581995_-	cd07043, STAS_anti-anti-sigma_factors, Sulphate Transporter and Anti-Sigma factor antagonist) domain of anti-anti-sigma factors, key regulators of anti-sigma factors by phosphorylation	NA|380aa|down_4|NC_011729.1_3582108_3583248_-	PRK12564, PRK12564, carbamoyl-phosphate synthase small subunit	NA|336aa|down_5|NC_011729.1_3583441_3584449_+	cd00569, HTH_Hin_like, Helix-turn-helix domain of Hin and related proteins	NA|295aa|down_6|NC_011729.1_3584873_3585758_+	COG1357, COG1357, Pentapeptide repeats containing protein [Function unknown]	NA|109aa|down_7|NC_011729.1_3585758_3586085_+	pfam03091, CutA1, CutA1 divalent ion tolerance protein	NA|344aa|down_8|NC_011729.1_3586336_3587368_-	NA	NA|382aa|down_9|NC_011729.1_3587519_3588665_-	NA
GCF_000021825.1_ASM2182v1	NC_011729	Gloeothece citriformis PCC 7424, complete sequence	27	3638122-3638219	25	CRISPRCasFinder	no		cas14j,cas14k,DEDDh,c2c9_V-U4,cas6,cas5,cas7,cas8a4,cas3,Cas14c_CAS-V-F,cas10,cmr3gr5,cmr4gr7,cmr5gr11,Cas14u_CAS-V,RT,csa3,WYL,cas10d,csc2gr7,csc1gr5,cas4,cas1,cas2,PD-DExK,cas8b3,DinG,cas8e,cse2gr11,cas6e,Cas9_archaeal,csx15,2OG_CAS	Orphan	AAGGCGGGCTAATTTTGGGATAA	23	0	0	NA	NA	N:A	1	1	Orphan	cas14j,cas14k,DEDDh,c2c9_V-U4,cas6,cas5,cas7,cas8a4,cas3,Cas14c_CAS-V-F,cas10,cmr3gr5,cmr4gr7,cmr5gr11,Cas14u_CAS-V,RT,csa3,WYL,cas10d,csc2gr7,csc1gr5,cas4,cas1,cas2,PD-DExK,cas8b3,DinG,cas8e,cse2gr11,cas6e,Cas9_archaeal,csx15,2OG_CAS,csm6	NA|281aa|up_8|NC_011729.1_3624866_3625709_+,NA|258aa|up_4|NC_011729.1_3630863_3631637_-,NA|78aa|up_2|NC_011729.1_3634836_3635070_-,NA|71aa|up_1|NC_011729.1_3635201_3635414_-,NA|485aa|down_0|NC_011729.1_3638570_3640025_-	NA|586aa|up_9|NC_011729.1_3623071_3624829_+	PLN02286, PLN02286, arginine-tRNA ligase	NA|281aa|up_8|NC_011729.1_3624866_3625709_+	NA	NA|568aa|up_7|NC_011729.1_3625894_3627598_-	COG4188, COG4188, Predicted dienelactone hydrolase [General function prediction only]	NA|302aa|up_6|NC_011729.1_3627868_3628774_-	COG0679, COG0679, Predicted permeases [General function prediction only]	NA|602aa|up_5|NC_011729.1_3628976_3630782_-	PRK07431, PRK07431, aspartate kinase; Provisional	NA|258aa|up_4|NC_011729.1_3630863_3631637_-	NA	NA|948aa|up_3|NC_011729.1_3631827_3634671_-	cd00200, WD40, WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment	NA|78aa|up_2|NC_011729.1_3634836_3635070_-	NA	NA|71aa|up_1|NC_011729.1_3635201_3635414_-	NA	NA|733aa|up_0|NC_011729.1_3635825_3638024_+	COG3957, COG3957, Phosphoketolase [Carbohydrate transport and metabolism]	NA|485aa|down_0|NC_011729.1_3638570_3640025_-	NA	NA|135aa|down_1|NC_011729.1_3640331_3640736_+	pfam01797, Y1_Tnp, Transposase IS200 like	NA|320aa|down_2|NC_011729.1_3640752_3641712_-	COG1089, Gmd, GDP-D-mannose dehydratase [Cell envelope biogenesis, outer membrane]	NA|410aa|down_3|NC_011729.1_3641853_3643083_-	TIGR03087, stp1, sugar transferase, PEP-CTERM/EpsH1 system associated	NA|868aa|down_4|NC_011729.1_3643457_3646061_+	pfam16095, COR, C-terminal of Roc, COR, domain	NA|301aa|down_5|NC_011729.1_3646093_3646996_-	PRK13057, PRK13057, lipid kinase	NA|570aa|down_6|NC_011729.1_3647047_3648757_-	pfam00498, FHA, FHA domain	NA|439aa|down_7|NC_011729.1_3648851_3650168_-	smart00421, HTH_LUXR, helix_turn_helix, Lux Regulon	NA|427aa|down_8|NC_011729.1_3650623_3651904_+	cd03800, GT4_sucrose_synthase, sucrose-phosphate synthase and similar proteins	NA|512aa|down_9|NC_011729.1_3652087_3653623_-	PRK05940, PRK05940, anthranilate synthase component I
GCF_000021825.1_ASM2182v1	NC_011729	Gloeothece citriformis PCC 7424, complete sequence	28	3643533-3643694	26	CRISPRCasFinder	no		cas14j,cas14k,DEDDh,c2c9_V-U4,cas6,cas5,cas7,cas8a4,cas3,Cas14c_CAS-V-F,cas10,cmr3gr5,cmr4gr7,cmr5gr11,Cas14u_CAS-V,RT,csa3,WYL,cas10d,csc2gr7,csc1gr5,cas4,cas1,cas2,PD-DExK,cas8b3,DinG,cas8e,cse2gr11,cas6e,Cas9_archaeal,csx15,2OG_CAS	Orphan	AATCAACTGACAACTTTACCCCC	23	0	0	NA	NA	N:A	2	2	Orphan	cas14j,cas14k,DEDDh,c2c9_V-U4,cas6,cas5,cas7,cas8a4,cas3,Cas14c_CAS-V-F,cas10,cmr3gr5,cmr4gr7,cmr5gr11,Cas14u_CAS-V,RT,csa3,WYL,cas10d,csc2gr7,csc1gr5,cas4,cas1,cas2,PD-DExK,cas8b3,DinG,cas8e,cse2gr11,cas6e,Cas9_archaeal,csx15,2OG_CAS,csm6	NA|258aa|up_8|NC_011729.1_3630863_3631637_-,NA|78aa|up_6|NC_011729.1_3634836_3635070_-,NA|71aa|up_5|NC_011729.1_3635201_3635414_-,NA|485aa|up_3|NC_011729.1_3638570_3640025_-,NA|112aa|down_5|NC_011729.1_3654108_3654444_+,NA|64aa|down_6|NC_011729.1_3655342_3655534_+,NA|136aa|down_9|NC_011729.1_3656819_3657227_+	NA|602aa|up_9|NC_011729.1_3628976_3630782_-	PRK07431, PRK07431, aspartate kinase; Provisional	NA|258aa|up_8|NC_011729.1_3630863_3631637_-	NA	NA|948aa|up_7|NC_011729.1_3631827_3634671_-	cd00200, WD40, WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment	NA|78aa|up_6|NC_011729.1_3634836_3635070_-	NA	NA|71aa|up_5|NC_011729.1_3635201_3635414_-	NA	NA|733aa|up_4|NC_011729.1_3635825_3638024_+	COG3957, COG3957, Phosphoketolase [Carbohydrate transport and metabolism]	NA|485aa|up_3|NC_011729.1_3638570_3640025_-	NA	NA|135aa|up_2|NC_011729.1_3640331_3640736_+	pfam01797, Y1_Tnp, Transposase IS200 like	NA|320aa|up_1|NC_011729.1_3640752_3641712_-	COG1089, Gmd, GDP-D-mannose dehydratase [Cell envelope biogenesis, outer membrane]	NA|410aa|up_0|NC_011729.1_3641853_3643083_-	TIGR03087, stp1, sugar transferase, PEP-CTERM/EpsH1 system associated	NA|301aa|down_0|NC_011729.1_3646093_3646996_-	PRK13057, PRK13057, lipid kinase	NA|570aa|down_1|NC_011729.1_3647047_3648757_-	pfam00498, FHA, FHA domain	NA|439aa|down_2|NC_011729.1_3648851_3650168_-	smart00421, HTH_LUXR, helix_turn_helix, Lux Regulon	NA|427aa|down_3|NC_011729.1_3650623_3651904_+	cd03800, GT4_sucrose_synthase, sucrose-phosphate synthase and similar proteins	NA|512aa|down_4|NC_011729.1_3652087_3653623_-	PRK05940, PRK05940, anthranilate synthase component I	NA|112aa|down_5|NC_011729.1_3654108_3654444_+	NA	NA|64aa|down_6|NC_011729.1_3655342_3655534_+	NA	NA|63aa|down_7|NC_011729.1_3655625_3655814_+	PRK02576, psbZ, photosystem II reaction center protein PsbZ	NA|201aa|down_8|NC_011729.1_3655910_3656513_+	PRK00061, ribH, 6,7-dimethyl-8-ribityllumazine synthase; Provisional	NA|136aa|down_9|NC_011729.1_3656819_3657227_+	NA
GCF_000021825.1_ASM2182v1	NC_011729	Gloeothece citriformis PCC 7424, complete sequence	29	3643878-3644383	27	CRISPRCasFinder	no		cas14j,cas14k,DEDDh,c2c9_V-U4,cas6,cas5,cas7,cas8a4,cas3,Cas14c_CAS-V-F,cas10,cmr3gr5,cmr4gr7,cmr5gr11,Cas14u_CAS-V,RT,csa3,WYL,cas10d,csc2gr7,csc1gr5,cas4,cas1,cas2,PD-DExK,cas8b3,DinG,cas8e,cse2gr11,cas6e,Cas9_archaeal,csx15,2OG_CAS	Orphan	AATCAACTGACAACTTTACCCCC	23	0	0	NA	NA	N:A	7	7	Orphan	cas14j,cas14k,DEDDh,c2c9_V-U4,cas6,cas5,cas7,cas8a4,cas3,Cas14c_CAS-V-F,cas10,cmr3gr5,cmr4gr7,cmr5gr11,Cas14u_CAS-V,RT,csa3,WYL,cas10d,csc2gr7,csc1gr5,cas4,cas1,cas2,PD-DExK,cas8b3,DinG,cas8e,cse2gr11,cas6e,Cas9_archaeal,csx15,2OG_CAS,csm6	NA|258aa|up_8|NC_011729.1_3630863_3631637_-,NA|78aa|up_6|NC_011729.1_3634836_3635070_-,NA|71aa|up_5|NC_011729.1_3635201_3635414_-,NA|485aa|up_3|NC_011729.1_3638570_3640025_-,NA|112aa|down_5|NC_011729.1_3654108_3654444_+,NA|64aa|down_6|NC_011729.1_3655342_3655534_+,NA|136aa|down_9|NC_011729.1_3656819_3657227_+	NA|602aa|up_9|NC_011729.1_3628976_3630782_-	PRK07431, PRK07431, aspartate kinase; Provisional	NA|258aa|up_8|NC_011729.1_3630863_3631637_-	NA	NA|948aa|up_7|NC_011729.1_3631827_3634671_-	cd00200, WD40, WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment	NA|78aa|up_6|NC_011729.1_3634836_3635070_-	NA	NA|71aa|up_5|NC_011729.1_3635201_3635414_-	NA	NA|733aa|up_4|NC_011729.1_3635825_3638024_+	COG3957, COG3957, Phosphoketolase [Carbohydrate transport and metabolism]	NA|485aa|up_3|NC_011729.1_3638570_3640025_-	NA	NA|135aa|up_2|NC_011729.1_3640331_3640736_+	pfam01797, Y1_Tnp, Transposase IS200 like	NA|320aa|up_1|NC_011729.1_3640752_3641712_-	COG1089, Gmd, GDP-D-mannose dehydratase [Cell envelope biogenesis, outer membrane]	NA|410aa|up_0|NC_011729.1_3641853_3643083_-	TIGR03087, stp1, sugar transferase, PEP-CTERM/EpsH1 system associated	NA|301aa|down_0|NC_011729.1_3646093_3646996_-	PRK13057, PRK13057, lipid kinase	NA|570aa|down_1|NC_011729.1_3647047_3648757_-	pfam00498, FHA, FHA domain	NA|439aa|down_2|NC_011729.1_3648851_3650168_-	smart00421, HTH_LUXR, helix_turn_helix, Lux Regulon	NA|427aa|down_3|NC_011729.1_3650623_3651904_+	cd03800, GT4_sucrose_synthase, sucrose-phosphate synthase and similar proteins	NA|512aa|down_4|NC_011729.1_3652087_3653623_-	PRK05940, PRK05940, anthranilate synthase component I	NA|112aa|down_5|NC_011729.1_3654108_3654444_+	NA	NA|64aa|down_6|NC_011729.1_3655342_3655534_+	NA	NA|63aa|down_7|NC_011729.1_3655625_3655814_+	PRK02576, psbZ, photosystem II reaction center protein PsbZ	NA|201aa|down_8|NC_011729.1_3655910_3656513_+	PRK00061, ribH, 6,7-dimethyl-8-ribityllumazine synthase; Provisional	NA|136aa|down_9|NC_011729.1_3656819_3657227_+	NA
GCF_000021825.1_ASM2182v1	NC_011729	Gloeothece citriformis PCC 7424, complete sequence	30	3678371-3678478	28	CRISPRCasFinder	no		cas14j,cas14k,DEDDh,c2c9_V-U4,cas6,cas5,cas7,cas8a4,cas3,Cas14c_CAS-V-F,cas10,cmr3gr5,cmr4gr7,cmr5gr11,Cas14u_CAS-V,RT,csa3,WYL,cas10d,csc2gr7,csc1gr5,cas4,cas1,cas2,PD-DExK,cas8b3,DinG,cas8e,cse2gr11,cas6e,Cas9_archaeal,csx15,2OG_CAS	Orphan	GTTTATTTATGCCCACCTACTTAGT	25	0	0	NA	NA	N:A	1	1	Orphan	cas14j,cas14k,DEDDh,c2c9_V-U4,cas6,cas5,cas7,cas8a4,cas3,Cas14c_CAS-V-F,cas10,cmr3gr5,cmr4gr7,cmr5gr11,Cas14u_CAS-V,RT,csa3,WYL,cas10d,csc2gr7,csc1gr5,cas4,cas1,cas2,PD-DExK,cas8b3,DinG,cas8e,cse2gr11,cas6e,Cas9_archaeal,csx15,2OG_CAS,csm6	NA|301aa|up_8|NC_011729.1_3669552_3670455_-,NA|353aa|down_0|NC_011729.1_3678736_3679795_+,NA|257aa|down_2|NC_011729.1_3682167_3682938_-	NA|641aa|up_9|NC_011729.1_3667626_3669549_-	COG0443, DnaK, Molecular chaperone [Posttranslational modification, protein turnover, chaperones]	NA|301aa|up_8|NC_011729.1_3669552_3670455_-	NA	NA|329aa|up_7|NC_011729.1_3670600_3671587_-	TIGR03466, HpnA, hopanoid-associated sugar epimerase	NA|278aa|up_6|NC_011729.1_3671650_3672484_+	cd07385, MPP_YkuE_C, Bacillus subtilis YkuE and related proteins, C-terminal metallophosphatase domain	NA|198aa|up_5|NC_011729.1_3672984_3673578_+	COG1943, COG1943, Transposase and inactivated derivatives [DNA replication, recombination, and repair]	NA|395aa|up_4|NC_011729.1_3673641_3674826_-	pfam02374, ArsA_ATPase, Anion-transporting ATPase	NA|297aa|up_3|NC_011729.1_3674885_3675776_-	TIGR04262, possible_ABC_transporter_solute_binding_protein, extracellular substrate-binding orphan protein, GRRM family	NA|134aa|up_2|NC_011729.1_3675912_3676314_+	TIGR04260, hypothetical_protein, rSAM-associated Gly-rich repeat protein	NA|127aa|up_1|NC_011729.1_3676568_3676949_+	TIGR04260, hypothetical_protein, rSAM-associated Gly-rich repeat protein	NA|381aa|up_0|NC_011729.1_3677062_3678205_+	TIGR04261, putative_arylsulfatase_regulatory_protein, radical SAM/SPASM domain protein, GRRM system	NA|353aa|down_0|NC_011729.1_3678736_3679795_+	NA	NA|723aa|down_1|NC_011729.1_3679853_3682022_-	pfam00263, Secretin, Bacterial type II and III secretion system protein	NA|257aa|down_2|NC_011729.1_3682167_3682938_-	NA	NA|262aa|down_3|NC_011729.1_3682934_3683720_-	COG3166, PilN, Tfp pilus assembly protein PilN [Cell motility and secretion / Intracellular trafficking and secretion]	NA|369aa|down_4|NC_011729.1_3683722_3684829_-	COG4972, PilM, Tfp pilus assembly protein, ATPase PilM [Cell motility and secretion / Intracellular trafficking and secretion]	NA|429aa|down_5|NC_011729.1_3685349_3686636_+	cd14014, STKc_PknB_like, Catalytic domain of bacterial Serine/Threonine kinases, PknB and similar proteins	NA|424aa|down_6|NC_011729.1_3686904_3688176_+	pfam05673, DUF815, Protein of unknown function (DUF815)	NA|297aa|down_7|NC_011729.1_3688183_3689074_-	PRK14875, PRK14875, acetoin dehydrogenase E2 subunit dihydrolipoyllysine-residue acetyltransferase; Provisional	NA|327aa|down_8|NC_011729.1_3689421_3690402_-	cd01051, Mn_catalase, Manganese catalase, ferritin-like diiron-binding domain	NA|176aa|down_9|NC_011729.1_3691090_3691618_-	cd09916, CpxP_like, CpxP component of the bacterial Cpx-two-component system and related proteins
GCF_000021825.1_ASM2182v1	NC_011729	Gloeothece citriformis PCC 7424, complete sequence	31	3750957-3751074	29	CRISPRCasFinder	no	csa3,RT	cas14j,cas14k,DEDDh,c2c9_V-U4,cas6,cas5,cas7,cas8a4,cas3,Cas14c_CAS-V-F,cas10,cmr3gr5,cmr4gr7,cmr5gr11,Cas14u_CAS-V,RT,csa3,WYL,cas10d,csc2gr7,csc1gr5,cas4,cas1,cas2,PD-DExK,cas8b3,DinG,cas8e,cse2gr11,cas6e,Cas9_archaeal,csx15,2OG_CAS	Type I-A	TAAAAGTAGAACGTAGGTTGGGTTGAGGAACGAAACCCAACA	42	0	0	NA	NA	N:A	1	1	Orphan	cas14j,cas14k,DEDDh,c2c9_V-U4,cas6,cas5,cas7,cas8a4,cas3,Cas14c_CAS-V-F,cas10,cmr3gr5,cmr4gr7,cmr5gr11,Cas14u_CAS-V,RT,csa3,WYL,cas10d,csc2gr7,csc1gr5,cas4,cas1,cas2,PD-DExK,cas8b3,DinG,cas8e,cse2gr11,cas6e,Cas9_archaeal,csx15,2OG_CAS,csm6	NA|129aa|up_3|NC_011729.1_3748350_3748737_-,NA|143aa|up_2|NC_011729.1_3748838_3749267_-,NA|115aa|up_0|NC_011729.1_3750421_3750766_+,NA|204aa|down_0|NC_011729.1_3751325_3751937_-,NA|405aa|down_4|NC_011729.1_3755398_3756613_-,NA|319aa|down_6|NC_011729.1_3759530_3760487_+	NA|411aa|up_9|NC_011729.1_3742264_3743497_-	pfam06838, Met_gamma_lyase, Methionine gamma-lyase	NA|273aa|up_8|NC_011729.1_3743741_3744560_+	COG1398, OLE1, Fatty-acid desaturase [Lipid metabolism]	NA|248aa|up_7|NC_011729.1_3744831_3745575_-	PRK09347, folE, GTP cyclohydrolase I; Provisional	NA|243aa|up_6|NC_011729.1_3745634_3746363_-	PRK07454, PRK07454, SDR family oxidoreductase	NA|170aa|up_5|NC_011729.1_3746709_3747219_-	COG3153, COG3153, Predicted acetyltransferase [General function prediction only]	NA|359aa|up_4|NC_011729.1_3747244_3748321_-	COG0857, Pta, BioD-like N-terminal domain of phosphotransacetylase [General function prediction only]	NA|129aa|up_3|NC_011729.1_3748350_3748737_-	NA	NA|143aa|up_2|NC_011729.1_3748838_3749267_-	NA	NA|319aa|up_1|NC_011729.1_3749355_3750312_+	PRK11815, PRK11815, tRNA dihydrouridine(20/20a) synthase DusA	NA|115aa|up_0|NC_011729.1_3750421_3750766_+	NA	NA|204aa|down_0|NC_011729.1_3751325_3751937_-	NA	NA|101aa|down_1|NC_011729.1_3752189_3752492_-	pfam17195, DUF5132, Protein of unknown function (DUF5132)	NA|102aa|down_2|NC_011729.1_3752747_3753053_-	pfam17195, DUF5132, Protein of unknown function (DUF5132)	NA|756aa|down_3|NC_011729.1_3753070_3755338_-	cd07550, P-type_ATPase_HM, P-type heavy metal-transporting ATPase; uncharacterized subfamily	NA|405aa|down_4|NC_011729.1_3755398_3756613_-	NA	NA|827aa|down_5|NC_011729.1_3756855_3759336_-	cd00200, WD40, WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment	NA|319aa|down_6|NC_011729.1_3759530_3760487_+	NA	NA|161aa|down_7|NC_011729.1_3760478_3760961_-	pfam09557, DUF2382, Domain of unknown function (DUF2382)	NA|293aa|down_8|NC_011729.1_3761075_3761954_-	COG3861, COG3861, Uncharacterized protein conserved in bacteria [Function unknown]	NA|746aa|down_9|NC_011729.1_3763446_3765684_-	TIGR01701, Hypothetical_protein_Rv2900c/MT2968/Mb2924c
GCF_000021825.1_ASM2182v1	NC_011729	Gloeothece citriformis PCC 7424, complete sequence	32	3796369-3797432	11,30,8	PILER-CR,CRISPRCasFinder,CRT	no		cas14j,cas14k,DEDDh,c2c9_V-U4,cas6,cas5,cas7,cas8a4,cas3,Cas14c_CAS-V-F,cas10,cmr3gr5,cmr4gr7,cmr5gr11,Cas14u_CAS-V,RT,csa3,WYL,cas10d,csc2gr7,csc1gr5,cas4,cas1,cas2,PD-DExK,cas8b3,DinG,cas8e,cse2gr11,cas6e,Cas9_archaeal,csx15,2OG_CAS	Orphan	GTTTCCAACAAGTCCTATTCAACCCAATAGGTAGGG,GTTTCCAACAAGTCCTATTCAACCCAATAGGTAGGG,GTTTCCAACAAGTCCTATTCAACCCAATAGGTAGGG	36,36,36	0	0	NA	NA	N:A	14,14,14	14	Orphan	cas14j,cas14k,DEDDh,c2c9_V-U4,cas6,cas5,cas7,cas8a4,cas3,Cas14c_CAS-V-F,cas10,cmr3gr5,cmr4gr7,cmr5gr11,Cas14u_CAS-V,RT,csa3,WYL,cas10d,csc2gr7,csc1gr5,cas4,cas1,cas2,PD-DExK,cas8b3,DinG,cas8e,cse2gr11,cas6e,Cas9_archaeal,csx15,2OG_CAS,csm6	NA,NA|115aa|down_0|NC_011729.1_3797880_3798225_+,NA|93aa|down_4|NC_011729.1_3801013_3801292_-	NA|936aa|up_9|NC_011729.1_3783583_3786391_-	COG0474, MgtA, Cation transport ATPase [Inorganic ion transport and metabolism]	NA|180aa|up_8|NC_011729.1_3787285_3787825_-	COG2891, MreD, Cell shape-determining protein [Cell envelope biogenesis, outer membrane]	NA|258aa|up_7|NC_011729.1_3787817_3788591_-	PRK13922, PRK13922, rod shape-determining protein MreC; Provisional	NA|353aa|up_6|NC_011729.1_3788631_3789690_-	PRK13927, PRK13927, rod shape-determining protein MreB; Provisional	NA|124aa|up_5|NC_011729.1_3789930_3790302_+	PRK07459, PRK07459, single-stranded DNA-binding protein; Provisional	NA|379aa|up_4|NC_011729.1_3790305_3791442_+	cd03804, GT4_WbaZ-like, mannosyltransferase WbaZ and similar proteins	NA|244aa|up_3|NC_011729.1_3791537_3792269_+	pfam02397, Bac_transf, Bacterial sugar transferase	NA|361aa|up_2|NC_011729.1_3792672_3793755_+	COG1089, Gmd, GDP-D-mannose dehydratase [Cell envelope biogenesis, outer membrane]	NA|313aa|up_1|NC_011729.1_3793790_3794729_+	cd05239, GDP_FS_SDR_e, GDP-fucose synthetase, extended (e) SDRs	NA|280aa|up_0|NC_011729.1_3794973_3795813_-	cd03401, SPFH_prohibitin, Prohibitin family; SPFH (stomatin, prohibitin, flotillin, and HflK/C) superfamily	NA|115aa|down_0|NC_011729.1_3797880_3798225_+	NA	NA|81aa|down_1|NC_011729.1_3798384_3798627_-	pfam14217, DUF4327, Domain of unknown function (DUF4327)	NA|81aa|down_2|NC_011729.1_3798952_3799195_-	pfam14217, DUF4327, Domain of unknown function (DUF4327)	NA|346aa|down_3|NC_011729.1_3799792_3800830_+	cd00997, PBP2_GluR0, Bacterial GluR0 ligand-binding domain; the type 2 periplasmic binding protein fold	NA|93aa|down_4|NC_011729.1_3801013_3801292_-	NA	NA|257aa|down_5|NC_011729.1_3801374_3802145_-	cd06260, DUF820, Domain of unknown function (DUF820)	NA|877aa|down_6|NC_011729.1_3802221_3804852_-	PRK09532, PRK09532, DNA polymerase III subunit alpha; Reviewed	NA|277aa|down_7|NC_011729.1_3805014_3805845_-	cd13688, PBP2_GltI_DEBP, Substrate-binding domain of ABC aspartate-glutamate transporter; the type 2 periplasmic binding protein fold	NA|223aa|down_8|NC_011729.1_3806237_3806906_+	pfam13643, DUF4145, Domain of unknown function (DUF4145)	NA|214aa|down_9|NC_011729.1_3807528_3808170_+	pfam05685, Uma2, Putative restriction endonuclease
GCF_000021825.1_ASM2182v1	NC_011729	Gloeothece citriformis PCC 7424, complete sequence	33	3838452-3838538	31	CRISPRCasFinder	no		cas14j,cas14k,DEDDh,c2c9_V-U4,cas6,cas5,cas7,cas8a4,cas3,Cas14c_CAS-V-F,cas10,cmr3gr5,cmr4gr7,cmr5gr11,Cas14u_CAS-V,RT,csa3,WYL,cas10d,csc2gr7,csc1gr5,cas4,cas1,cas2,PD-DExK,cas8b3,DinG,cas8e,cse2gr11,cas6e,Cas9_archaeal,csx15,2OG_CAS	Orphan	CATTAGTCAGTAGTCATTAGTCGT	24	1	3	3838476-3838514|3838476-3838514|3838476-3838514	NC_011729.1_42641-42603|NC_011729.1_1233952-1233914|NC_011729.1_1512391-1512429	N:A	1	1	Orphan	cas14j,cas14k,DEDDh,c2c9_V-U4,cas6,cas5,cas7,cas8a4,cas3,Cas14c_CAS-V-F,cas10,cmr3gr5,cmr4gr7,cmr5gr11,Cas14u_CAS-V,RT,csa3,WYL,cas10d,csc2gr7,csc1gr5,cas4,cas1,cas2,PD-DExK,cas8b3,DinG,cas8e,cse2gr11,cas6e,Cas9_archaeal,csx15,2OG_CAS,csm6	NA|169aa|up_4|NC_011729.1_3832530_3833037_+,NA|231aa|up_3|NC_011729.1_3833045_3833738_-,NA|63aa|up_2|NC_011729.1_3833991_3834180_-,NA|69aa|down_2|NC_011729.1_3842292_3842499_-,NA|227aa|down_6|NC_011729.1_3845629_3846310_+	NA|491aa|up_9|NC_011729.1_3826187_3827660_-	COG1982, LdcC, Arginine/lysine/ornithine decarboxylases [Amino acid transport and metabolism]	NA|242aa|up_8|NC_011729.1_3827738_3828464_+	COG0745, OmpR, Response regulators consisting of a CheY-like receiver domain and a winged-helix DNA-binding domain [Signal transduction mechanisms / Transcription]	NA|305aa|up_7|NC_011729.1_3828620_3829535_+	PLN02953, PLN02953, phosphatidate cytidylyltransferase	NA|513aa|up_6|NC_011729.1_3829828_3831367_-	PRK00302, lnt, apolipoprotein N-acyltransferase; Reviewed	NA|288aa|up_5|NC_011729.1_3831472_3832336_+	COG0596, MhpC, Predicted hydrolases or acyltransferases (alpha/beta hydrolase superfamily) [General function prediction only]	NA|169aa|up_4|NC_011729.1_3832530_3833037_+	NA	NA|231aa|up_3|NC_011729.1_3833045_3833738_-	NA	NA|63aa|up_2|NC_011729.1_3833991_3834180_-	NA	NA|873aa|up_1|NC_011729.1_3834277_3836896_-	PRK07561, PRK07561, DNA topoisomerase I subunit omega; Validated	NA|372aa|up_0|NC_011729.1_3837272_3838388_-	TIGR00236, UDP-N-acetylglucosamine_2-epimerase, UDP-N-acetylglucosamine 2-epimerase	NA|271aa|down_0|NC_011729.1_3838653_3839466_-	COG0300, DltE, Short-chain dehydrogenases of various substrate specificities [General function prediction only]	NA|602aa|down_1|NC_011729.1_3840105_3841911_-	PRK06354, PRK06354, pyruvate kinase; Provisional	NA|69aa|down_2|NC_011729.1_3842292_3842499_-	NA	NA|398aa|down_3|NC_011729.1_3842622_3843816_-	pfam01594, AI-2E_transport, AI-2E family transporter	NA|117aa|down_4|NC_011729.1_3843850_3844201_-	COG2146, {NirD}, Ferredoxin subunits of nitrite reductase and ring-hydroxylating dioxygenases [Inorganic ion transport and metabolism / General function prediction only]	NA|340aa|down_5|NC_011729.1_3844388_3845408_-	COG0484, DnaJ, DnaJ-class molecular chaperone with C-terminal Zn finger domain [Posttranslational modification, protein turnover, chaperones]	NA|227aa|down_6|NC_011729.1_3845629_3846310_+	NA	NA|207aa|down_7|NC_011729.1_3846674_3847295_-	COG1357, COG1357, Pentapeptide repeats containing protein [Function unknown]	NA|338aa|down_8|NC_011729.1_3847865_3848879_+	TIGR00737, Probable_tRNA-dihydrouridine_synthase, putative TIM-barrel protein, nifR3 family	NA|69aa|down_9|NC_011729.1_3849075_3849282_+	pfam05685, Uma2, Putative restriction endonuclease
GCF_000021825.1_ASM2182v1	NC_011729	Gloeothece citriformis PCC 7424, complete sequence	34	3879424-3879536	32	CRISPRCasFinder	no		cas14j,cas14k,DEDDh,c2c9_V-U4,cas6,cas5,cas7,cas8a4,cas3,Cas14c_CAS-V-F,cas10,cmr3gr5,cmr4gr7,cmr5gr11,Cas14u_CAS-V,RT,csa3,WYL,cas10d,csc2gr7,csc1gr5,cas4,cas1,cas2,PD-DExK,cas8b3,DinG,cas8e,cse2gr11,cas6e,Cas9_archaeal,csx15,2OG_CAS	Orphan	TCTGCGGAGCAGGGACGAAGTCTAACCCAACACCCGATTTAG	42	0	0	NA	NA	N:A	1	1	Orphan	cas14j,cas14k,DEDDh,c2c9_V-U4,cas6,cas5,cas7,cas8a4,cas3,Cas14c_CAS-V-F,cas10,cmr3gr5,cmr4gr7,cmr5gr11,Cas14u_CAS-V,RT,csa3,WYL,cas10d,csc2gr7,csc1gr5,cas4,cas1,cas2,PD-DExK,cas8b3,DinG,cas8e,cse2gr11,cas6e,Cas9_archaeal,csx15,2OG_CAS,csm6	NA|458aa|up_7|NC_011729.1_3871548_3872922_+,NA|58aa|up_5|NC_011729.1_3875332_3875506_-,NA|84aa|up_4|NC_011729.1_3875619_3875871_-,NA|296aa|up_0|NC_011729.1_3878474_3879362_-,NA|184aa|down_0|NC_011729.1_3879642_3880194_+,NA|83aa|down_4|NC_011729.1_3882926_3883175_+,NA|67aa|down_5|NC_011729.1_3883197_3883398_+	NA|361aa|up_9|NC_011729.1_3869026_3870109_-	cd06853, GT_WecA_like, This subfamily contains Escherichia coli WecA, Bacillus subtilis TagO and related proteins	NA|229aa|up_8|NC_011729.1_3870454_3871141_-	TIGR04283, glycosyl_transferase_family_2, transferase 2, rSAM/selenodomain-associated	NA|458aa|up_7|NC_011729.1_3871548_3872922_+	NA	NA|782aa|up_6|NC_011729.1_3872954_3875300_-	COG0631, PTC1, Serine/threonine protein phosphatase [Signal transduction mechanisms]	NA|58aa|up_5|NC_011729.1_3875332_3875506_-	NA	NA|84aa|up_4|NC_011729.1_3875619_3875871_-	NA	NA|120aa|up_3|NC_011729.1_3875898_3876258_-	pfam10664, NdhM, Cyanobacterial and plastid NDH-1 subunit M	NA|419aa|up_2|NC_011729.1_3876365_3877622_-	PRK05250, PRK05250, S-adenosylmethionine synthetase; Validated	NA|250aa|up_1|NC_011729.1_3877690_3878440_-	COG0546, Gph, Predicted phosphatases [General function prediction only]	NA|296aa|up_0|NC_011729.1_3878474_3879362_-	NA	NA|184aa|down_0|NC_011729.1_3879642_3880194_+	NA	NA|491aa|down_1|NC_011729.1_3880259_3881732_+	cd16383, GUN4, porphyrin-binding protein domain GUN4	NA|113aa|down_2|NC_011729.1_3881929_3882268_-	COG2361, COG2361, Uncharacterized conserved protein [Function unknown]	NA|113aa|down_3|NC_011729.1_3882254_3882593_-	COG1669, COG1669, Predicted nucleotidyltransferases [General function prediction only]	NA|83aa|down_4|NC_011729.1_3882926_3883175_+	NA	NA|67aa|down_5|NC_011729.1_3883197_3883398_+	NA	NA|82aa|down_6|NC_011729.1_3883440_3883686_+	pfam03683, UPF0175, Uncharacterized protein family (UPF0175)	NA|329aa|down_7|NC_011729.1_3884517_3885504_-	PRK07400, PRK07400, 30S ribosomal protein S1; Reviewed	NA|181aa|down_8|NC_011729.1_3885826_3886369_-	PRK00464, nrdR, transcriptional repressor NrdR	NA|32aa|down_9|NC_011729.1_3886524_3886620_-	PRK11875, psbT, photosystem II reaction center protein T; Reviewed
GCF_000021825.1_ASM2182v1	NC_011729	Gloeothece citriformis PCC 7424, complete sequence	35	4019765-4021254	12,33,9	PILER-CR,CRISPRCasFinder,CRT	no		cas14j,cas14k,DEDDh,c2c9_V-U4,cas6,cas5,cas7,cas8a4,cas3,Cas14c_CAS-V-F,cas10,cmr3gr5,cmr4gr7,cmr5gr11,Cas14u_CAS-V,RT,csa3,WYL,cas10d,csc2gr7,csc1gr5,cas4,cas1,cas2,PD-DExK,cas8b3,DinG,cas8e,cse2gr11,cas6e,Cas9_archaeal,csx15,2OG_CAS	Orphan	GTTTCCAACAATTCCTATTCAACCCAATAGGTAGGG,CCCTACCTATTGGGTTGAATAGGAATTGTTGGAAAC,CCCTACCTATTGGGTTGAATAGGAATTGTTGGAAAC	36,36,36	0	0	NA	NA	N:A	19,20,20	20	Orphan	cas14j,cas14k,DEDDh,c2c9_V-U4,cas6,cas5,cas7,cas8a4,cas3,Cas14c_CAS-V-F,cas10,cmr3gr5,cmr4gr7,cmr5gr11,Cas14u_CAS-V,RT,csa3,WYL,cas10d,csc2gr7,csc1gr5,cas4,cas1,cas2,PD-DExK,cas8b3,DinG,cas8e,cse2gr11,cas6e,Cas9_archaeal,csx15,2OG_CAS,csm6	NA,NA	NA|349aa|up_9|NC_011729.1_4001066_4002113_+	COG3367, COG3367, Uncharacterized conserved protein [Function unknown]	NA|662aa|up_8|NC_011729.1_4002176_4004162_+	cd11324, AmyAc_Amylosucrase, Alpha amylase catalytic domain found in Amylosucrase	NA|876aa|up_7|NC_011729.1_4004294_4006922_+	cd01031, EriC, ClC chloride channel EriC	NA|304aa|up_6|NC_011729.1_4007096_4008008_-	pfam08548, Peptidase_M10_C, Peptidase M10 serralysin C terminal	NA|809aa|up_5|NC_011729.1_4008379_4010806_-	cd08162, MPP_PhoA_N, Synechococcus sp	NA|1165aa|up_4|NC_011729.1_4011343_4014838_+	cd00200, WD40, WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment	NA|554aa|up_3|NC_011729.1_4014938_4016600_+	cd06905, M14-like, Peptidase M14-like domain; uncharacterized subfamily	NA|183aa|up_2|NC_011729.1_4016766_4017315_-	pfam03358, FMN_red, NADPH-dependent FMN reductase	NA|448aa|up_1|NC_011729.1_4017746_4019090_+	COG1253, TlyC, Hemolysins and related proteins containing CBS domains [General function prediction only]	NA|137aa|up_0|NC_011729.1_4019256_4019667_-	pfam05642, Sporozoite_P67, Sporozoite P67 surface antigen	NA|251aa|down_0|NC_011729.1_4021555_4022308_-	pfam02633, Creatininase, Creatinine amidohydrolase	NA|251aa|down_1|NC_011729.1_4022365_4023118_+	COG0745, OmpR, Response regulators consisting of a CheY-like receiver domain and a winged-helix DNA-binding domain [Signal transduction mechanisms / Transcription]	NA|456aa|down_2|NC_011729.1_4023208_4024576_+	COG0642, BaeS, Signal transduction histidine kinase [Signal transduction mechanisms]	NA|223aa|down_3|NC_011729.1_4024644_4025313_+	COG0704, PhoU, Phosphate uptake regulator [Inorganic ion transport and metabolism]	NA|594aa|down_4|NC_011729.1_4025753_4027535_+	pfam04966, OprB, Carbohydrate-selective porin, OprB family	NA|592aa|down_5|NC_011729.1_4028123_4029899_+	pfam04966, OprB, Carbohydrate-selective porin, OprB family	NA|296aa|down_6|NC_011729.1_4029996_4030884_+	COG1176, PotB, ABC-type spermidine/putrescine transport system, permease component I [Amino acid transport and metabolism]	NA|289aa|down_7|NC_011729.1_4030873_4031740_+	COG1177, PotC, ABC-type spermidine/putrescine transport system, permease component II [Amino acid transport and metabolism]	NA|842aa|down_8|NC_011729.1_4031901_4034427_+	COG1221, PspF, Transcriptional regulators containing an AAA-type ATPase domain and a DNA-binding domain [Transcription / Signal transduction mechanisms]	NA|124aa|down_9|NC_011729.1_4034791_4035163_+	pfam02427, PSI_PsaE, Photosystem I reaction centre subunit IV / PsaE
GCF_000021825.1_ASM2182v1	NC_011729	Gloeothece citriformis PCC 7424, complete sequence	36	4108705-4108815	34	CRISPRCasFinder	no		cas14j,cas14k,DEDDh,c2c9_V-U4,cas6,cas5,cas7,cas8a4,cas3,Cas14c_CAS-V-F,cas10,cmr3gr5,cmr4gr7,cmr5gr11,Cas14u_CAS-V,RT,csa3,WYL,cas10d,csc2gr7,csc1gr5,cas4,cas1,cas2,PD-DExK,cas8b3,DinG,cas8e,cse2gr11,cas6e,Cas9_archaeal,csx15,2OG_CAS	Orphan	AGTATTAGACTAATTGTAGGGTGGGCATTGCCCACCTTAT	40	0	0	NA	NA	N:A	1	1	Orphan	cas14j,cas14k,DEDDh,c2c9_V-U4,cas6,cas5,cas7,cas8a4,cas3,Cas14c_CAS-V-F,cas10,cmr3gr5,cmr4gr7,cmr5gr11,Cas14u_CAS-V,RT,csa3,WYL,cas10d,csc2gr7,csc1gr5,cas4,cas1,cas2,PD-DExK,cas8b3,DinG,cas8e,cse2gr11,cas6e,Cas9_archaeal,csx15,2OG_CAS,csm6	NA|730aa|up_4|NC_011729.1_4097348_4099538_+,NA|323aa|up_3|NC_011729.1_4099812_4100781_+,NA|244aa|down_0|NC_011729.1_4108927_4109659_-,NA|121aa|down_9|NC_011729.1_4118981_4119344_+	NA|281aa|up_9|NC_011729.1_4092413_4093256_-	pfam12974, Phosphonate-bd, ABC transporter, phosphonate, periplasmic substrate-binding protein	NA|302aa|up_8|NC_011729.1_4093344_4094250_+	pfam11845, DUF3365, Protein of unknown function (DUF3365)	NA|161aa|up_7|NC_011729.1_4094431_4094914_+	TIGR00281, Segregation_and_condensation_protein_B, segregation and condensation protein B	NA|110aa|up_6|NC_011729.1_4095005_4095335_+	pfam05542, DUF760, Protein of unknown function (DUF760)	NA|458aa|up_5|NC_011729.1_4095861_4097235_-	PRK02507, PRK02507, proton extrusion protein PcxA; Provisional	NA|730aa|up_4|NC_011729.1_4097348_4099538_+	NA	NA|323aa|up_3|NC_011729.1_4099812_4100781_+	NA	NA|91aa|up_2|NC_011729.1_4100994_4101267_+	COG2197, CitB, Response regulator containing a CheY-like receiver domain and an HTH DNA-binding domain [Signal transduction mechanisms / Transcription]	NA|1263aa|up_1|NC_011729.1_4101273_4105062_-	cd01948, EAL, EAL domain	NA|888aa|up_0|NC_011729.1_4105974_4108638_+	PRK13805, PRK13805, bifunctional acetaldehyde-CoA/alcohol dehydrogenase; Provisional	NA|244aa|down_0|NC_011729.1_4108927_4109659_-	NA	NA|364aa|down_1|NC_011729.1_4110057_4111149_+	pfam00180, Iso_dh, Isocitrate/isopropylmalate dehydrogenase	NA|246aa|down_2|NC_011729.1_4111211_4111949_-	pfam13649, Methyltransf_25, Methyltransferase domain	NA|196aa|down_3|NC_011729.1_4111988_4112576_-	pfam06206, CpeT, CpeT/CpcT family (DUF1001)	NA|671aa|down_4|NC_011729.1_4112811_4114824_-	COG0661, AarF, Predicted unusual protein kinase [General function prediction only]	NA|80aa|down_5|NC_011729.1_4115014_4115254_-	pfam01455, HupF_HypC, HupF/HypC family	NA|781aa|down_6|NC_011729.1_4115257_4117600_-	COG0068, HypF, Hydrogenase maturation factor [Posttranslational modification, protein turnover, chaperones]	NA|87aa|down_7|NC_011729.1_4117980_4118241_-	pfam14261, DUF4351, Domain of unknown function (DUF4351)	NA|87aa|down_8|NC_011729.1_4118412_4118673_-	pfam14261, DUF4351, Domain of unknown function (DUF4351)	NA|121aa|down_9|NC_011729.1_4118981_4119344_+	NA
GCF_000021825.1_ASM2182v1	NC_011729	Gloeothece citriformis PCC 7424, complete sequence	37	4545862-4545951	35	CRISPRCasFinder	no		cas14j,cas14k,DEDDh,c2c9_V-U4,cas6,cas5,cas7,cas8a4,cas3,Cas14c_CAS-V-F,cas10,cmr3gr5,cmr4gr7,cmr5gr11,Cas14u_CAS-V,RT,csa3,WYL,cas10d,csc2gr7,csc1gr5,cas4,cas1,cas2,PD-DExK,cas8b3,DinG,cas8e,cse2gr11,cas6e,Cas9_archaeal,csx15,2OG_CAS	Orphan	TTAAAACCCAGACCAAGCAGCGATCGCC	28	0	0	NA	NA	N:A	1	1	Orphan	cas14j,cas14k,DEDDh,c2c9_V-U4,cas6,cas5,cas7,cas8a4,cas3,Cas14c_CAS-V-F,cas10,cmr3gr5,cmr4gr7,cmr5gr11,Cas14u_CAS-V,RT,csa3,WYL,cas10d,csc2gr7,csc1gr5,cas4,cas1,cas2,PD-DExK,cas8b3,DinG,cas8e,cse2gr11,cas6e,Cas9_archaeal,csx15,2OG_CAS,csm6	NA|125aa|up_9|NC_011729.1_4533650_4534025_-,NA|234aa|up_3|NC_011729.1_4541141_4541843_-,NA|154aa|up_2|NC_011729.1_4541839_4542301_-,NA|61aa|down_0|NC_011729.1_4546528_4546711_-,NA|141aa|down_2|NC_011729.1_4548601_4549024_+,NA|239aa|down_7|NC_011729.1_4555098_4555815_-	NA|125aa|up_9|NC_011729.1_4533650_4534025_-	NA	NA|204aa|up_8|NC_011729.1_4534084_4534696_-	TIGR04211, hypothetical_protein, SH3 domain protein	NA|929aa|up_7|NC_011729.1_4534776_4537563_-	smart00065, GAF, Domain present in phytochromes and cGMP-specific phosphodiesterases	NA|169aa|up_6|NC_011729.1_4537727_4538234_+	COG2018, COG2018, Uncharacterized distant relative of homeotic protein bithoraxoid [General function prediction only]	NA|607aa|up_5|NC_011729.1_4538314_4540135_-	COG1807, ArnT, 4-amino-4-deoxy-L-arabinose transferase and related glycosyltransferases of PMT family [Cell envelope biogenesis, outer membrane]	NA|222aa|up_4|NC_011729.1_4540253_4540919_-	cd03392, PAP2_like_2, PAP2_like_2 proteins	NA|234aa|up_3|NC_011729.1_4541141_4541843_-	NA	NA|154aa|up_2|NC_011729.1_4541839_4542301_-	NA	NA|628aa|up_1|NC_011729.1_4542306_4544190_-	TIGR02168, Chromosome_partition_protein_Smc, chromosome segregation protein SMC, common bacterial type	NA|478aa|up_0|NC_011729.1_4544241_4545675_-	COG0642, BaeS, Signal transduction histidine kinase [Signal transduction mechanisms]	NA|61aa|down_0|NC_011729.1_4546528_4546711_-	NA	NA|313aa|down_1|NC_011729.1_4547082_4548021_-	pfam00805, Pentapeptide, Pentapeptide repeats (8 copies)	NA|141aa|down_2|NC_011729.1_4548601_4549024_+	NA	NA|149aa|down_3|NC_011729.1_4549242_4549689_+	pfam01724, DUF29, Domain of unknown function DUF29	NA|158aa|down_4|NC_011729.1_4549695_4550169_-	TIGR04526, predic_Ig_block, putative immunoglobulin-blocking virulence protein	NA|207aa|down_5|NC_011729.1_4550260_4550881_-	pfam05685, Uma2, Putative restriction endonuclease	NA|1237aa|down_6|NC_011729.1_4551089_4554800_+	PLN03241, PLN03241, magnesium chelatase subunit H; Provisional	NA|239aa|down_7|NC_011729.1_4555098_4555815_-	NA	NA|169aa|down_8|NC_011729.1_4556315_4556822_+	pfam14516, AAA_35, AAA-like domain	NA|244aa|down_9|NC_011729.1_4556830_4557562_+	PRK00347, PRK00347, DNA/RNA nuclease SfsA
GCF_000021825.1_ASM2182v1	NC_011729	Gloeothece citriformis PCC 7424, complete sequence	38	4756609-4756730	36	CRISPRCasFinder	no	csx15	cas14j,cas14k,DEDDh,c2c9_V-U4,cas6,cas5,cas7,cas8a4,cas3,Cas14c_CAS-V-F,cas10,cmr3gr5,cmr4gr7,cmr5gr11,Cas14u_CAS-V,RT,csa3,WYL,cas10d,csc2gr7,csc1gr5,cas4,cas1,cas2,PD-DExK,cas8b3,DinG,cas8e,cse2gr11,cas6e,Cas9_archaeal,csx15,2OG_CAS	Orphan	CTGTTAACTGTTCACTGTTCACTAAAGAAGGTGCGTTACGCTACGC	46	0	0	NA	NA	N:A	1	1	Orphan	cas14j,cas14k,DEDDh,c2c9_V-U4,cas6,cas5,cas7,cas8a4,cas3,Cas14c_CAS-V-F,cas10,cmr3gr5,cmr4gr7,cmr5gr11,Cas14u_CAS-V,RT,csa3,WYL,cas10d,csc2gr7,csc1gr5,cas4,cas1,cas2,PD-DExK,cas8b3,DinG,cas8e,cse2gr11,cas6e,Cas9_archaeal,csx15,2OG_CAS,csm6	NA|82aa|up_9|NC_011729.1_4743350_4743596_-,NA|73aa|down_4|NC_011729.1_4761132_4761351_-,NA|92aa|down_5|NC_011729.1_4761382_4761658_-,NA|56aa|down_6|NC_011729.1_4761719_4761887_-,NA|67aa|down_7|NC_011729.1_4761955_4762156_-,NA|98aa|down_9|NC_011729.1_4763190_4763484_-	NA|82aa|up_9|NC_011729.1_4743350_4743596_-	NA	NA|668aa|up_8|NC_011729.1_4743849_4745853_+	TIGR01241, ATP-dependent_zinc_metalloprotease_FtsH, ATP-dependent metalloprotease FtsH	NA|406aa|up_7|NC_011729.1_4746022_4747240_-	PLN00093, PLN00093, geranylgeranyl diphosphate reductase; Provisional	NA|861aa|up_6|NC_011729.1_4747506_4750089_+	COG0308, PepN, Aminopeptidase N [Amino acid transport and metabolism]	NA|235aa|up_5|NC_011729.1_4750493_4751198_-	COG0705, COG0705, Membrane associated serine protease [Amino acid transport and metabolism]	NA|529aa|up_4|NC_011729.1_4751545_4753132_+	pfam07693, KAP_NTPase, KAP family P-loop domain	NA|158aa|up_3|NC_011729.1_4753366_4753840_-	COG1225, Bcp, Peroxiredoxin [Posttranslational modification, protein turnover, chaperones]	NA|288aa|up_2|NC_011729.1_4753843_4754707_-	cd07326, M56_BlaR1_MecR1_like, Peptidase M56-like including those in BlaR1 and MecR1, integral membrane metallopeptidase	NA|141aa|up_1|NC_011729.1_4754706_4755129_-	COG3682, COG3682, Predicted transcriptional regulator [Transcription]	NA|363aa|up_0|NC_011729.1_4755157_4756246_-	cd05305, L-AlaDH, Alanine dehydrogenase NAD-binding and catalytic domains	NA|228aa|down_0|NC_011729.1_4756853_4757537_+	COG0745, OmpR, Response regulators consisting of a CheY-like receiver domain and a winged-helix DNA-binding domain [Signal transduction mechanisms / Transcription]	NA|463aa|down_1|NC_011729.1_4757536_4758925_+	TIGR01386, Probable_sensor_protein_PcoS, heavy metal sensor kinase	NA|139aa|down_2|NC_011729.1_4760181_4760598_-	pfam13676, TIR_2, TIR domain	NA|138aa|down_3|NC_011729.1_4760620_4761034_-	pfam13676, TIR_2, TIR domain	NA|73aa|down_4|NC_011729.1_4761132_4761351_-	NA	NA|92aa|down_5|NC_011729.1_4761382_4761658_-	NA	NA|56aa|down_6|NC_011729.1_4761719_4761887_-	NA	NA|67aa|down_7|NC_011729.1_4761955_4762156_-	NA	csx15|301aa|down_8|NC_011729.1_4762194_4763097_-	cd09766, Csx15_I-U, CRISPR/Cas system-associated protein Csx15	NA|98aa|down_9|NC_011729.1_4763190_4763484_-	NA
GCF_000021825.1_ASM2182v1	NC_011729	Gloeothece citriformis PCC 7424, complete sequence	39	4870936-4872533	10	CRT	no		cas14j,cas14k,DEDDh,c2c9_V-U4,cas6,cas5,cas7,cas8a4,cas3,Cas14c_CAS-V-F,cas10,cmr3gr5,cmr4gr7,cmr5gr11,Cas14u_CAS-V,RT,csa3,WYL,cas10d,csc2gr7,csc1gr5,cas4,cas1,cas2,PD-DExK,cas8b3,DinG,cas8e,cse2gr11,cas6e,Cas9_archaeal,csx15,2OG_CAS	Orphan	TCTTTNTCTTGNGTTAATTGTTG	23	0	0	NA	NA	N:A	25	25	Orphan	cas14j,cas14k,DEDDh,c2c9_V-U4,cas6,cas5,cas7,cas8a4,cas3,Cas14c_CAS-V-F,cas10,cmr3gr5,cmr4gr7,cmr5gr11,Cas14u_CAS-V,RT,csa3,WYL,cas10d,csc2gr7,csc1gr5,cas4,cas1,cas2,PD-DExK,cas8b3,DinG,cas8e,cse2gr11,cas6e,Cas9_archaeal,csx15,2OG_CAS,csm6	NA,NA|83aa|down_4|NC_011729.1_4875634_4875883_-,NA|128aa|down_9|NC_011729.1_4879785_4880169_+	NA|206aa|up_9|NC_011729.1_4858232_4858850_+	COG3932, COG3932, Uncharacterized ABC-type transport system, permease components [General function prediction only]	NA|326aa|up_8|NC_011729.1_4858965_4859943_-	pfam13469, Sulfotransfer_3, Sulfotransferase family	NA|606aa|up_7|NC_011729.1_4860685_4862503_+	COG4188, COG4188, Predicted dienelactone hydrolase [General function prediction only]	NA|123aa|up_6|NC_011729.1_4862677_4863046_+	TIGR02058, lin0512_fam, conserved hypothetical protein	NA|668aa|up_5|NC_011729.1_4863367_4865371_+	cd07338, M48B_HtpX_like, Peptidase M48 subfamily B HtpX-like membrane-bound metallopeptidase	NA|74aa|up_4|NC_011729.1_4865388_4865610_-	pfam04255, DUF433, Protein of unknown function (DUF433)	NA|195aa|up_3|NC_011729.1_4865668_4866253_-	pfam14238, DUF4340, Domain of unknown function (DUF4340)	NA|704aa|up_2|NC_011729.1_4866425_4868537_-	COG3225, GldG, ABC-type uncharacterized transport system involved in gliding motility, auxiliary component [Cell motility and secretion]	NA|261aa|up_1|NC_011729.1_4868542_4869325_-	TIGR03518, ABC_transporter_permease_protein, gliding motility-associated ABC transporter permease protein GldF	NA|320aa|up_0|NC_011729.1_4869326_4870286_-	COG1131, CcmA, ABC-type multidrug transport system, ATPase component [Defense mechanisms]	NA|156aa|down_0|NC_011729.1_4873220_4873688_-	cd07820, SRPBCC_3, Ligand-binding SRPBCC domain of an uncharacterized subfamily of proteins	NA|187aa|down_1|NC_011729.1_4873712_4874273_-	COG1555, ComEA, DNA uptake protein and related DNA-binding proteins [DNA replication, recombination, and repair]	NA|133aa|down_2|NC_011729.1_4874449_4874848_-	COG2314, XynA, Predicted membrane protein [Function unknown]	NA|115aa|down_3|NC_011729.1_4875238_4875583_+	TIGR00365, TIGR00365, monothiol glutaredoxin, Grx4 family	NA|83aa|down_4|NC_011729.1_4875634_4875883_-	NA	NA|264aa|down_5|NC_011729.1_4876188_4876980_+	COG0745, OmpR, Response regulators consisting of a CheY-like receiver domain and a winged-helix DNA-binding domain [Signal transduction mechanisms / Transcription]	NA|249aa|down_6|NC_011729.1_4876974_4877721_-	COG1135, AbcC, ABC-type metal ion transport system, ATPase component [Inorganic ion transport and metabolism]	NA|348aa|down_7|NC_011729.1_4878032_4879076_+	PRK00188, trpD, anthranilate phosphoribosyltransferase; Provisional	NA|117aa|down_8|NC_011729.1_4879206_4879557_-	PRK14298, PRK14298, chaperone protein DnaJ; Provisional	NA|128aa|down_9|NC_011729.1_4879785_4880169_+	NA
GCF_000021825.1_ASM2182v1	NC_011729	Gloeothece citriformis PCC 7424, complete sequence	40	4916091-4916174	37	CRISPRCasFinder	no		cas14j,cas14k,DEDDh,c2c9_V-U4,cas6,cas5,cas7,cas8a4,cas3,Cas14c_CAS-V-F,cas10,cmr3gr5,cmr4gr7,cmr5gr11,Cas14u_CAS-V,RT,csa3,WYL,cas10d,csc2gr7,csc1gr5,cas4,cas1,cas2,PD-DExK,cas8b3,DinG,cas8e,cse2gr11,cas6e,Cas9_archaeal,csx15,2OG_CAS	Orphan	CTGCGATCGCTTAATAATTGATT	23	0	0	NA	NA	N:A	1	1	Orphan	cas14j,cas14k,DEDDh,c2c9_V-U4,cas6,cas5,cas7,cas8a4,cas3,Cas14c_CAS-V-F,cas10,cmr3gr5,cmr4gr7,cmr5gr11,Cas14u_CAS-V,RT,csa3,WYL,cas10d,csc2gr7,csc1gr5,cas4,cas1,cas2,PD-DExK,cas8b3,DinG,cas8e,cse2gr11,cas6e,Cas9_archaeal,csx15,2OG_CAS,csm6	NA|386aa|up_9|NC_011729.1_4903507_4904665_+,NA|68aa|up_8|NC_011729.1_4904750_4904954_-,NA|75aa|up_7|NC_011729.1_4905145_4905370_+,NA|84aa|up_5|NC_011729.1_4909217_4909469_+,NA	NA|386aa|up_9|NC_011729.1_4903507_4904665_+	NA	NA|68aa|up_8|NC_011729.1_4904750_4904954_-	NA	NA|75aa|up_7|NC_011729.1_4905145_4905370_+	NA	NA|1191aa|up_6|NC_011729.1_4905387_4908960_-	TIGR02169, chromosome_segregation_protein_related_ptotein, chromosome segregation protein SMC, primarily archaeal type	NA|84aa|up_5|NC_011729.1_4909217_4909469_+	NA	NA|176aa|up_4|NC_011729.1_4910224_4910752_+	PRK09267, PRK09267, flavodoxin FldA; Validated	NA|602aa|up_3|NC_011729.1_4911039_4912845_-	COG1132, MdlB, ABC-type multidrug transport system, ATPase and permease components [Defense mechanisms]	NA|351aa|up_2|NC_011729.1_4912996_4914049_+	TIGR03609, S_layer_CsaB, polysaccharide pyruvyl transferase CsaB	NA|218aa|up_1|NC_011729.1_4914265_4914919_-	PLN02770, PLN02770, haloacid dehalogenase-like hydrolase family protein	NA|139aa|up_0|NC_011729.1_4915080_4915497_-	pfam14812, PBP1_TM, Transmembrane domain of transglycosylase PBP1 at N-terminal	NA|726aa|down_0|NC_011729.1_4916408_4918586_-	COG5635, COG5635, Predicted NTPase (NACHT family) [Signal transduction mechanisms]	NA|230aa|down_1|NC_011729.1_4918768_4919458_-	COG4636, Uma2, Endonuclease, Uma2 family (restriction endonuclease fold) [General function prediction only]	NA|731aa|down_2|NC_011729.1_4919723_4921916_+	PRK14948, PRK14948, DNA polymerase III subunit gamma/tau	NA|221aa|down_3|NC_011729.1_4921904_4922567_-	COG1309, AcrR, Transcriptional regulator [Transcription]	NA|320aa|down_4|NC_011729.1_4922742_4923702_+	COG4638, HcaE, Phenylpropionate dioxygenase and related ring-hydroxylating dioxygenases, large terminal subunit [Inorganic ion transport and metabolism / General function prediction only]	NA|1268aa|down_5|NC_011729.1_4924034_4927838_-	PRK09776, PRK09776, putative diguanylate cyclase; Provisional	NA|315aa|down_6|NC_011729.1_4928546_4929491_+	cd08267, MDR1, Medium chain dehydrogenases/reductase (MDR)/zinc-dependent alcohol dehydrogenase-like family	NA|212aa|down_7|NC_011729.1_4929565_4930201_-	cd00956, Transaldolase_FSA, Transaldolase-like fructose-6-phosphate aldolases (FSA) found in bacteria and archaea	NA|648aa|down_8|NC_011729.1_4930345_4932289_+	COG1123, COG1123, ATPase components of various ABC-type transport systems, contain duplicated ATPase [General function prediction only]	NA|216aa|down_9|NC_011729.1_4932352_4933000_+	pfam05685, Uma2, Putative restriction endonuclease
GCF_000021825.1_ASM2182v1	NC_011729	Gloeothece citriformis PCC 7424, complete sequence	41	4916758-4916979	11	CRT	no		cas14j,cas14k,DEDDh,c2c9_V-U4,cas6,cas5,cas7,cas8a4,cas3,Cas14c_CAS-V-F,cas10,cmr3gr5,cmr4gr7,cmr5gr11,Cas14u_CAS-V,RT,csa3,WYL,cas10d,csc2gr7,csc1gr5,cas4,cas1,cas2,PD-DExK,cas8b3,DinG,cas8e,cse2gr11,cas6e,Cas9_archaeal,csx15,2OG_CAS	Orphan	GCGTAGNNGTTGGCGTAG	18	0	0	NA	NA	N:A	5	5	Orphan	cas14j,cas14k,DEDDh,c2c9_V-U4,cas6,cas5,cas7,cas8a4,cas3,Cas14c_CAS-V-F,cas10,cmr3gr5,cmr4gr7,cmr5gr11,Cas14u_CAS-V,RT,csa3,WYL,cas10d,csc2gr7,csc1gr5,cas4,cas1,cas2,PD-DExK,cas8b3,DinG,cas8e,cse2gr11,cas6e,Cas9_archaeal,csx15,2OG_CAS,csm6	NA|68aa|up_9|NC_011729.1_4904750_4904954_-,NA|75aa|up_8|NC_011729.1_4905145_4905370_+,NA|84aa|up_6|NC_011729.1_4909217_4909469_+,NA	NA|68aa|up_9|NC_011729.1_4904750_4904954_-	NA	NA|75aa|up_8|NC_011729.1_4905145_4905370_+	NA	NA|1191aa|up_7|NC_011729.1_4905387_4908960_-	TIGR02169, chromosome_segregation_protein_related_ptotein, chromosome segregation protein SMC, primarily archaeal type	NA|84aa|up_6|NC_011729.1_4909217_4909469_+	NA	NA|176aa|up_5|NC_011729.1_4910224_4910752_+	PRK09267, PRK09267, flavodoxin FldA; Validated	NA|602aa|up_4|NC_011729.1_4911039_4912845_-	COG1132, MdlB, ABC-type multidrug transport system, ATPase and permease components [Defense mechanisms]	NA|351aa|up_3|NC_011729.1_4912996_4914049_+	TIGR03609, S_layer_CsaB, polysaccharide pyruvyl transferase CsaB	NA|218aa|up_2|NC_011729.1_4914265_4914919_-	PLN02770, PLN02770, haloacid dehalogenase-like hydrolase family protein	NA|139aa|up_1|NC_011729.1_4915080_4915497_-	pfam14812, PBP1_TM, Transmembrane domain of transglycosylase PBP1 at N-terminal	NA|151aa|up_0|NC_011729.1_4915651_4916104_+	cd14503, PTP-bact, bacterial tyrosine-protein phosphataseS similar to Neisseria NMA1982	NA|230aa|down_0|NC_011729.1_4918768_4919458_-	COG4636, Uma2, Endonuclease, Uma2 family (restriction endonuclease fold) [General function prediction only]	NA|731aa|down_1|NC_011729.1_4919723_4921916_+	PRK14948, PRK14948, DNA polymerase III subunit gamma/tau	NA|221aa|down_2|NC_011729.1_4921904_4922567_-	COG1309, AcrR, Transcriptional regulator [Transcription]	NA|320aa|down_3|NC_011729.1_4922742_4923702_+	COG4638, HcaE, Phenylpropionate dioxygenase and related ring-hydroxylating dioxygenases, large terminal subunit [Inorganic ion transport and metabolism / General function prediction only]	NA|1268aa|down_4|NC_011729.1_4924034_4927838_-	PRK09776, PRK09776, putative diguanylate cyclase; Provisional	NA|315aa|down_5|NC_011729.1_4928546_4929491_+	cd08267, MDR1, Medium chain dehydrogenases/reductase (MDR)/zinc-dependent alcohol dehydrogenase-like family	NA|212aa|down_6|NC_011729.1_4929565_4930201_-	cd00956, Transaldolase_FSA, Transaldolase-like fructose-6-phosphate aldolases (FSA) found in bacteria and archaea	NA|648aa|down_7|NC_011729.1_4930345_4932289_+	COG1123, COG1123, ATPase components of various ABC-type transport systems, contain duplicated ATPase [General function prediction only]	NA|216aa|down_8|NC_011729.1_4932352_4933000_+	pfam05685, Uma2, Putative restriction endonuclease	NA|366aa|down_9|NC_011729.1_4933195_4934293_+	COG1752, RssA, Predicted esterase of the alpha-beta hydrolase superfamily [General function prediction only]
GCF_000021825.1_ASM2182v1	NC_011729	Gloeothece citriformis PCC 7424, complete sequence	42	5095710-5096251	38,12,13	CRISPRCasFinder,CRT,PILER-CR	no	cas14j	cas14j,cas14k,DEDDh,c2c9_V-U4,cas6,cas5,cas7,cas8a4,cas3,Cas14c_CAS-V-F,cas10,cmr3gr5,cmr4gr7,cmr5gr11,Cas14u_CAS-V,RT,csa3,WYL,cas10d,csc2gr7,csc1gr5,cas4,cas1,cas2,PD-DExK,cas8b3,DinG,cas8e,cse2gr11,cas6e,Cas9_archaeal,csx15,2OG_CAS	Unclear	GTTTCAATCCCGTTGCCAGGATTCATTAATTAAAAAG,GTTTCAATCCCGTTGCCAGGATTCATTAATTAAAAAG,CTTTTTAATT--AATGAATCCTGGCAACGGGATTGAAAC	37,37,39	2	2	5095817-5095852|5095819-5095854	NC_011729.1_153399-153434|NC_011729.1_153399-153434	N:A	7,7,7	7	TypeV	cas14j,cas14k,DEDDh,c2c9_V-U4,cas6,cas5,cas7,cas8a4,cas3,Cas14c_CAS-V-F,cas10,cmr3gr5,cmr4gr7,cmr5gr11,Cas14u_CAS-V,RT,csa3,WYL,cas10d,csc2gr7,csc1gr5,cas4,cas1,cas2,PD-DExK,cas8b3,DinG,cas8e,cse2gr11,cas6e,Cas9_archaeal,csx15,2OG_CAS,csm6	NA|529aa|up_9|NC_011729.1_5081575_5083162_+,NA|67aa|down_9|NC_011729.1_5111117_5111318_+	NA|529aa|up_9|NC_011729.1_5081575_5083162_+	NA	NA|551aa|up_8|NC_011729.1_5083826_5085479_+	COG3379, COG3379, Uncharacterized conserved protein [Function unknown]	NA|554aa|up_7|NC_011729.1_5085640_5087302_+	COG3379, COG3379, Uncharacterized conserved protein [Function unknown]	NA|468aa|up_6|NC_011729.1_5087346_5088750_+	COG3206, GumC, Uncharacterized protein involved in exopolysaccharide biosynthesis [Cell envelope biogenesis, outer membrane]	NA|379aa|up_5|NC_011729.1_5088948_5090085_+	pfam00150, Cellulase, Cellulase (glycosyl hydrolase family 5)	NA|306aa|up_4|NC_011729.1_5090122_5091040_+	pfam10111, Glyco_tranf_2_2, Glycosyltransferase like family 2	NA|440aa|up_3|NC_011729.1_5091233_5092553_-	pfam04932, Wzy_C, O-Antigen ligase	NA|363aa|up_2|NC_011729.1_5092713_5093802_-	pfam13469, Sulfotransfer_3, Sulfotransferase family	NA|65aa|up_1|NC_011729.1_5093907_5094102_-	NF012221, MARTX_Nterm, MARTX multifunctional-autoprocessing repeats-in-toxin holotoxin RtxA	NA|474aa|up_0|NC_011729.1_5094127_5095549_-	COG1596, Wza, Periplasmic protein involved in polysaccharide export, contains    SLBB domain of b-grasp fold [Cell wall/membrane/envelope biogenesis]	NA|975aa|down_0|NC_011729.1_5096906_5099831_-	COG0643, CheA, Chemotaxis protein histidine kinase and related kinases [Cell motility and secretion / Signal transduction mechanisms]	NA|821aa|down_1|NC_011729.1_5099848_5102311_-	smart00283, MA, Methyl-accepting chemotaxis-like domains (chemotaxis sensory transducer)	NA|183aa|down_2|NC_011729.1_5102432_5102981_-	COG0835, CheW, Chemotaxis signal transduction protein [Cell motility and secretion / Signal transduction mechanisms]	NA|121aa|down_3|NC_011729.1_5102989_5103352_-	cd17574, REC_OmpR, phosphoacceptor receiver (REC) domain of OmpR family response regulators	NA|409aa|down_4|NC_011729.1_5103481_5104708_-	cd00156, REC, phosphoacceptor receiver (REC) domain of response regulators (RRs) and pseudo response regulators (PRRs)	NA|350aa|down_5|NC_011729.1_5105314_5106364_+	PRK05437, PRK05437, isopentenyl pyrophosphate isomerase; Provisional	NA|272aa|down_6|NC_011729.1_5106449_5107265_-	pfam13640, 2OG-FeII_Oxy_3, 2OG-Fe(II) oxygenase superfamily	cas14j|397aa|down_7|NC_011729.1_5107795_5108986_+	COG0675, COG0675, Transposase and inactivated derivatives [DNA replication, recombination, and repair]	NA|636aa|down_8|NC_011729.1_5109124_5111032_+	smart00955, RNB, This domain is the catalytic domain of ribonuclease II	NA|67aa|down_9|NC_011729.1_5111117_5111318_+	NA
GCF_000021825.1_ASM2182v1	NC_011729	Gloeothece citriformis PCC 7424, complete sequence	43	5341587-5341727	39	CRISPRCasFinder	no		cas14j,cas14k,DEDDh,c2c9_V-U4,cas6,cas5,cas7,cas8a4,cas3,Cas14c_CAS-V-F,cas10,cmr3gr5,cmr4gr7,cmr5gr11,Cas14u_CAS-V,RT,csa3,WYL,cas10d,csc2gr7,csc1gr5,cas4,cas1,cas2,PD-DExK,cas8b3,DinG,cas8e,cse2gr11,cas6e,Cas9_archaeal,csx15,2OG_CAS	Orphan	TATCTAATCATAGTGATCATTTAATCATTTTAATCACAGTTAAGACTTTGTG	52	0	0	NA	NA	N:A	1	1	Orphan	cas14j,cas14k,DEDDh,c2c9_V-U4,cas6,cas5,cas7,cas8a4,cas3,Cas14c_CAS-V-F,cas10,cmr3gr5,cmr4gr7,cmr5gr11,Cas14u_CAS-V,RT,csa3,WYL,cas10d,csc2gr7,csc1gr5,cas4,cas1,cas2,PD-DExK,cas8b3,DinG,cas8e,cse2gr11,cas6e,Cas9_archaeal,csx15,2OG_CAS,csm6	NA|191aa|up_7|NC_011729.1_5334393_5334966_+,NA|66aa|up_5|NC_011729.1_5337157_5337355_+,NA|97aa|down_2|NC_011729.1_5345293_5345584_-,NA|209aa|down_7|NC_011729.1_5352524_5353151_-	NA|591aa|up_9|NC_011729.1_5331397_5333170_+	COG3975, COG3975, Predicted protease with the C-terminal PDZ domain [General function prediction only]	NA|305aa|up_8|NC_011729.1_5333198_5334113_+	PRK06606, PRK06606, branched-chain amino acid transaminase	NA|191aa|up_7|NC_011729.1_5334393_5334966_+	NA	NA|418aa|up_6|NC_011729.1_5335572_5336826_-	PRK00011, glyA, serine hydroxymethyltransferase; Reviewed	NA|66aa|up_5|NC_011729.1_5337157_5337355_+	NA	NA|160aa|up_4|NC_011729.1_5337465_5337945_+	pfam16166, TIC20, Chloroplast import apparatus Tic20-like	NA|260aa|up_3|NC_011729.1_5338024_5338804_-	COG0179, MhpD, 2-keto-4-pentenoate hydratase/2-oxohepta-3-ene-1,7-dioic acid hydratase (catechol pathway) [Secondary metabolites biosynthesis, transport, and catabolism]	NA|177aa|up_2|NC_011729.1_5339105_5339636_+	cd15487, bS6_chloro_cyano, 30S ribosomal protein S6 of chloroplasts and cyanobacteria	NA|429aa|up_1|NC_011729.1_5339662_5340949_-	COG0793, Prc, Periplasmic protease [Cell envelope biogenesis, outer membrane]	NA|133aa|up_0|NC_011729.1_5341094_5341493_-	TIGR04256, conserved_hypothetical_protein, GxxExxY protein	NA|117aa|down_0|NC_011729.1_5343914_5344265_-	PRK07451, PRK07451, translation initiation factor	NA|307aa|down_1|NC_011729.1_5344368_5345289_-	cd08919, PBP-like, Phycobiliproteins (PBPs) and related proteins	NA|97aa|down_2|NC_011729.1_5345293_5345584_-	NA	NA|521aa|down_3|NC_011729.1_5345606_5347169_-	cd07207, Pat_ExoU_VipD_like, ExoU and VipD-like proteins; homologus to patatin, cPLA2, and iPLA2	NA|626aa|down_4|NC_011729.1_5347320_5349198_-	TIGR01241, ATP-dependent_zinc_metalloprotease_FtsH, ATP-dependent metalloprotease FtsH	NA|218aa|down_5|NC_011729.1_5349432_5350086_-	cd16913, YkuD_like, L,D-transpeptidases/carboxypeptidases similar to Bacillus YkuD	NA|718aa|down_6|NC_011729.1_5350330_5352484_+	PRK01233, glyS, glycyl-tRNA synthetase subunit beta; Validated	NA|209aa|down_7|NC_011729.1_5352524_5353151_-	NA	NA|207aa|down_8|NC_011729.1_5353458_5354079_-	COG0596, MhpC, Predicted hydrolases or acyltransferases (alpha/beta hydrolase superfamily) [General function prediction only]	NA|147aa|down_9|NC_011729.1_5354249_5354690_-	COG0071, IbpA, Molecular chaperone (small heat shock protein) [Posttranslational modification, protein turnover, chaperones]
GCF_000021825.1_ASM2182v1	NC_011729	Gloeothece citriformis PCC 7424, complete sequence	44	5376812-5376964	40	CRISPRCasFinder	no	csa3	cas14j,cas14k,DEDDh,c2c9_V-U4,cas6,cas5,cas7,cas8a4,cas3,Cas14c_CAS-V-F,cas10,cmr3gr5,cmr4gr7,cmr5gr11,Cas14u_CAS-V,RT,csa3,WYL,cas10d,csc2gr7,csc1gr5,cas4,cas1,cas2,PD-DExK,cas8b3,DinG,cas8e,cse2gr11,cas6e,Cas9_archaeal,csx15,2OG_CAS	Type I-A	AGAGGATATGATTTTAGTAGGACTTGTGTTTTGAGCAATTTCCTTTAATTTTAA	54	0	0	NA	NA	N:A	1	1	Orphan	cas14j,cas14k,DEDDh,c2c9_V-U4,cas6,cas5,cas7,cas8a4,cas3,Cas14c_CAS-V-F,cas10,cmr3gr5,cmr4gr7,cmr5gr11,Cas14u_CAS-V,RT,csa3,WYL,cas10d,csc2gr7,csc1gr5,cas4,cas1,cas2,PD-DExK,cas8b3,DinG,cas8e,cse2gr11,cas6e,Cas9_archaeal,csx15,2OG_CAS,csm6	NA,NA|223aa|down_4|NC_011729.1_5382004_5382673_+,NA|148aa|down_6|NC_011729.1_5383682_5384126_-,NA|71aa|down_7|NC_011729.1_5384216_5384429_-,NA|79aa|down_8|NC_011729.1_5384460_5384697_-	NA|295aa|up_9|NC_011729.1_5366213_5367098_-	COG1216, COG1216, Predicted glycosyltransferases [General function prediction only]	NA|305aa|up_8|NC_011729.1_5367347_5368262_-	COG0697, RhaT, Permeases of the drug/metabolite transporter (DMT) superfamily [Carbohydrate transport and metabolism / Amino acid transport and metabolism / General function prediction only]	NA|121aa|up_7|NC_011729.1_5368532_5368895_+	pfam08853, DUF1823, Domain of unknown function (DUF1823)	NA|381aa|up_6|NC_011729.1_5368944_5370087_+	PRK05958, PRK05958, 8-amino-7-oxononanoate synthase; Reviewed	NA|67aa|up_5|NC_011729.1_5370287_5370488_-	pfam05685, Uma2, Putative restriction endonuclease	NA|528aa|up_4|NC_011729.1_5370576_5372160_-	cd16414, dndB_like, DNA-sulfur modification-associated domain	NA|250aa|up_3|NC_011729.1_5372523_5373273_+	pfam14072, DndB, DNA-sulfur modification-associated	NA|119aa|up_2|NC_011729.1_5373447_5373804_+	COG5464, COG5464, Uncharacterized conserved protein [Function unknown]	NA|392aa|up_1|NC_011729.1_5373853_5375029_-	pfam14072, DndB, DNA-sulfur modification-associated	NA|542aa|up_0|NC_011729.1_5375119_5376745_+	PRK06850, PRK06850, hypothetical protein; Provisional	NA|465aa|down_0|NC_011729.1_5376997_5378392_-	COG3670, COG3670, Lignostilbene-alpha,beta-dioxygenase and related enzymes [Secondary metabolites biosynthesis, transport, and catabolism]	NA|669aa|down_1|NC_011729.1_5378758_5380765_+	TIGR03185, DNA_S_dndD, DNA sulfur modification protein DndD	NA|192aa|down_2|NC_011729.1_5380771_5381347_+	cd16383, GUN4, porphyrin-binding protein domain GUN4	NA|133aa|down_3|NC_011729.1_5381507_5381906_+	pfam08870, DndE, DNA sulphur modification protein DndE	NA|223aa|down_4|NC_011729.1_5382004_5382673_+	NA	NA|273aa|down_5|NC_011729.1_5382736_5383555_+	TIGR00726, Laccase_domain_protein, YfiH family protein	NA|148aa|down_6|NC_011729.1_5383682_5384126_-	NA	NA|71aa|down_7|NC_011729.1_5384216_5384429_-	NA	NA|79aa|down_8|NC_011729.1_5384460_5384697_-	NA	NA|282aa|down_9|NC_011729.1_5384995_5385841_+	PRK00236, xerC, site-specific tyrosine recombinase XerC; Reviewed
GCF_000021825.1_ASM2182v1	NC_011729	Gloeothece citriformis PCC 7424, complete sequence	45	5399751-5399876	41	CRISPRCasFinder	no	csa3	cas14j,cas14k,DEDDh,c2c9_V-U4,cas6,cas5,cas7,cas8a4,cas3,Cas14c_CAS-V-F,cas10,cmr3gr5,cmr4gr7,cmr5gr11,Cas14u_CAS-V,RT,csa3,WYL,cas10d,csc2gr7,csc1gr5,cas4,cas1,cas2,PD-DExK,cas8b3,DinG,cas8e,cse2gr11,cas6e,Cas9_archaeal,csx15,2OG_CAS	Type I-A	CCTTCATCAGCGATAAAAGTTTCCATTTGATAA	33	0	0	NA	NA	N:A	1	1	Orphan	cas14j,cas14k,DEDDh,c2c9_V-U4,cas6,cas5,cas7,cas8a4,cas3,Cas14c_CAS-V-F,cas10,cmr3gr5,cmr4gr7,cmr5gr11,Cas14u_CAS-V,RT,csa3,WYL,cas10d,csc2gr7,csc1gr5,cas4,cas1,cas2,PD-DExK,cas8b3,DinG,cas8e,cse2gr11,cas6e,Cas9_archaeal,csx15,2OG_CAS,csm6	NA|170aa|up_6|NC_011729.1_5391222_5391732_+,NA|156aa|up_5|NC_011729.1_5391822_5392290_-,NA|70aa|down_0|NC_011729.1_5401246_5401456_-,NA|70aa|down_3|NC_011729.1_5405801_5406011_+,NA|114aa|down_7|NC_011729.1_5410611_5410953_+,NA|86aa|down_9|NC_011729.1_5412338_5412596_+	NA|275aa|up_9|NC_011729.1_5388118_5388943_+	COG0300, DltE, Short-chain dehydrogenases of various substrate specificities [General function prediction only]	NA|448aa|up_8|NC_011729.1_5388929_5390273_+	pfam00743, FMO-like, Flavin-binding monooxygenase-like	NA|71aa|up_7|NC_011729.1_5390431_5390644_+	pfam11165, DUF2949, Protein of unknown function (DUF2949)	NA|170aa|up_6|NC_011729.1_5391222_5391732_+	NA	NA|156aa|up_5|NC_011729.1_5391822_5392290_-	NA	NA|219aa|up_4|NC_011729.1_5392555_5393212_-	pfam11866, DUF3386, Protein of unknown function (DUF3386)	NA|891aa|up_3|NC_011729.1_5393441_5396114_+	TIGR03346, chaperone_ClpB, ATP-dependent chaperone ClpB	NA|92aa|up_2|NC_011729.1_5396858_5397134_-	pfam11998, DUF3493, Protein of unknown function (DUF3493)	NA|313aa|up_1|NC_011729.1_5397521_5398460_+	TIGR04155, hypothetical_protein, PEP-CTERM protein sorting domain, cyanobacterial subclass	NA|324aa|up_0|NC_011729.1_5398601_5399573_+	TIGR04155, hypothetical_protein, PEP-CTERM protein sorting domain, cyanobacterial subclass	NA|70aa|down_0|NC_011729.1_5401246_5401456_-	NA	NA|646aa|down_1|NC_011729.1_5401893_5403831_+	COG3349, COG3349, Uncharacterized conserved protein [Function unknown]	NA|570aa|down_2|NC_011729.1_5403852_5405562_+	COG5305, COG5305, Predicted membrane protein [Function unknown]	NA|70aa|down_3|NC_011729.1_5405801_5406011_+	NA	NA|685aa|down_4|NC_011729.1_5406086_5408141_-	PRK10060, PRK10060, cyclic di-GMP phosphodiesterase	NA|340aa|down_5|NC_011729.1_5408485_5409505_+	COG1131, CcmA, ABC-type multidrug transport system, ATPase component [Defense mechanisms]	NA|299aa|down_6|NC_011729.1_5409535_5410432_+	TIGR01247, drrB, daunorubicin resistance ABC transporter membrane protein	NA|114aa|down_7|NC_011729.1_5410611_5410953_+	NA	NA|258aa|down_8|NC_011729.1_5411155_5411929_+	TIGR03413, GSH_gloB, hydroxyacylglutathione hydrolase	NA|86aa|down_9|NC_011729.1_5412338_5412596_+	NA
GCF_000021825.1_ASM2182v1	NC_011729	Gloeothece citriformis PCC 7424, complete sequence	46	5467998-5468128	42	CRISPRCasFinder	no	c2c9_V-U4	cas14j,cas14k,DEDDh,c2c9_V-U4,cas6,cas5,cas7,cas8a4,cas3,Cas14c_CAS-V-F,cas10,cmr3gr5,cmr4gr7,cmr5gr11,Cas14u_CAS-V,RT,csa3,WYL,cas10d,csc2gr7,csc1gr5,cas4,cas1,cas2,PD-DExK,cas8b3,DinG,cas8e,cse2gr11,cas6e,Cas9_archaeal,csx15,2OG_CAS	Type V-U4	GAACAAAGACCGCCTACGCGGTCTAAATTTCAACCT	36	0	0	NA	NA	N:A	1	1	TypeV-U4	cas14j,cas14k,DEDDh,c2c9_V-U4,cas6,cas5,cas7,cas8a4,cas3,Cas14c_CAS-V-F,cas10,cmr3gr5,cmr4gr7,cmr5gr11,Cas14u_CAS-V,RT,csa3,WYL,cas10d,csc2gr7,csc1gr5,cas4,cas1,cas2,PD-DExK,cas8b3,DinG,cas8e,cse2gr11,cas6e,Cas9_archaeal,csx15,2OG_CAS,csm6	NA,NA|106aa|down_4|NC_011729.1_5472012_5472330_+	NA|385aa|up_9|NC_011729.1_5453595_5454750_+	PLN02572, PLN02572, UDP-sulfoquinovose synthase	NA|378aa|up_8|NC_011729.1_5455076_5456210_+	PLN02871, PLN02871, UDP-sulfoquinovose:DAG sulfoquinovosyltransferase	NA|192aa|up_7|NC_011729.1_5457797_5458373_+	COG4636, Uma2, Endonuclease, Uma2 family (restriction endonuclease fold) [General function prediction only]	c2c9_V-U4|394aa|up_6|NC_011729.1_5459013_5460195_+	COG0675, COG0675, Transposase and inactivated derivatives [DNA replication, recombination, and repair]	NA|811aa|up_5|NC_011729.1_5460290_5462723_+	COG1080, PtsA, Phosphoenolpyruvate-protein kinase (PTS system EI component in bacteria) [Carbohydrate transport and metabolism]	NA|175aa|up_4|NC_011729.1_5463036_5463561_-	TIGR04155, hypothetical_protein, PEP-CTERM protein sorting domain, cyanobacterial subclass	NA|138aa|up_3|NC_011729.1_5463719_5464133_-	COG2510, COG2510, Predicted membrane protein [Function unknown]	NA|389aa|up_2|NC_011729.1_5464148_5465315_-	cd05483, retropepsin_like_bacteria, Bacterial aspartate proteases, retropepsin-like protease family	NA|434aa|up_1|NC_011729.1_5465318_5466620_-	COG2027, DacB, D-alanyl-D-alanine carboxypeptidase (penicillin-binding protein 4) [Cell envelope biogenesis, outer membrane]	NA|403aa|up_0|NC_011729.1_5466747_5467956_+	PRK13371, PRK13371, 4-hydroxy-3-methylbut-2-enyl diphosphate reductase; Provisional	NA|126aa|down_0|NC_011729.1_5468270_5468648_+	pfam01724, DUF29, Domain of unknown function DUF29	NA|154aa|down_1|NC_011729.1_5469034_5469496_-	cd04586, CBS_pair_BON_assoc, Two tandem repeats of the cystathionine beta-synthase (CBS pair) domains associated with the BON (bacterial OsmY and nodulation domain) domain	NA|259aa|down_2|NC_011729.1_5469609_5470386_-	PRK07408, PRK07408, RNA polymerase sigma factor SigF; Reviewed	NA|276aa|down_3|NC_011729.1_5470997_5471825_-	pfam01716, MSP, Manganese-stabilizing protein / photosystem II polypeptide	NA|106aa|down_4|NC_011729.1_5472012_5472330_+	NA	NA|478aa|down_5|NC_011729.1_5472534_5473968_+	cd05800, PGM_like2, This PGM-like (phosphoglucomutase-like) protein of unknown function belongs to the alpha-D-phosphohexomutase superfamily and is found in both archaea and bacteria	NA|300aa|down_6|NC_011729.1_5474136_5475036_-	PRK09328, PRK09328, N5-glutamine S-adenosyl-L-methionine-dependent methyltransferase; Provisional	NA|152aa|down_7|NC_011729.1_5475101_5475557_-	pfam04972, BON, BON domain	NA|213aa|down_8|NC_011729.1_5475597_5476236_-	pfam06897, DUF1269, Protein of unknown function (DUF1269)	NA|1053aa|down_9|NC_011729.1_5476692_5479851_-	sd00006, TPR, Tetratricopeptide repeat
GCF_000021825.1_ASM2182v1	NC_011738	Gloeothece citriformis PCC 7424 plasmid pP742401, complete sequence	1	262869-263274	1,1,1	PILER-CR,CRISPRCasFinder,CRT	no	WYL,csm6,RT,cas1,cas2	cmr4gr7,cmr5gr11,WYL,csm6,RT,cas1,cas2	Type III-A	GTTTCCAACTATTCCTATTTAACCCAATAGGTAGGG,CCCTACCTATTGGGTTAAATAGGAATAGTTGGAAACGA,CCCTACCTATTGGGTTAAATAGGAATAGTTGGAAAC	36,38,36	0	0	NA	NA	N:A	4,4,5	5	TypeIII-A	cas14j,cas14k,DEDDh,c2c9_V-U4,cas6,cas5,cas7,cas8a4,cas3,Cas14c_CAS-V-F,cas10,cmr3gr5,cmr4gr7,cmr5gr11,Cas14u_CAS-V,RT,csa3,WYL,cas10d,csc2gr7,csc1gr5,cas4,cas1,cas2,PD-DExK,cas8b3,DinG,cas8e,cse2gr11,cas6e,Cas9_archaeal,csx15,2OG_CAS,csm6	NA|75aa|up_7|NC_011738.1_254893_255118_+,NA|119aa|up_6|NC_011738.1_255130_255487_+,NA|225aa|up_4|NC_011738.1_257316_257991_+,NA|46aa|down_5|NC_011738.1_270936_271074_-,NA|539aa|down_8|NC_011738.1_272235_273852_-,NA|251aa|down_9|NC_011738.1_274515_275268_-	NA|80aa|up_9|NC_011738.1_252601_252841_-	pfam13443, HTH_26, Cro/C1-type HTH DNA-binding domain	NA|210aa|up_8|NC_011738.1_254243_254873_+	cd10030, UDG-F4_TTUDGA_SPO1dp_like, Uracil DNA glycosylase family 4, includes Thermotoga maritima TTUDGA, Bacillus phage SPO1 DNA polymerase, and similar proteins	NA|75aa|up_7|NC_011738.1_254893_255118_+	NA	NA|119aa|up_6|NC_011738.1_255130_255487_+	NA	NA|378aa|up_5|NC_011738.1_255982_257116_-	pfam13808, DDE_Tnp_1_assoc, DDE_Tnp_1-associated	NA|225aa|up_4|NC_011738.1_257316_257991_+	NA	NA|458aa|up_3|NC_011738.1_258046_259420_+	cd17933, DEXSc_RecD-like, DEXS-box helicase domain of RecD and similar proteins	NA|195aa|up_2|NC_011738.1_259422_260007_-	pfam13328, HD_4, HD domain	WYL|459aa|up_1|NC_011738.1_260006_261383_-	TIGR03985, hypothetical_protein_sll7078, CRISPR-associated protein, TIGR03985 family	csm6|383aa|up_0|NC_011738.1_261506_262655_+	cd09742, Csm6_III-A, CRISPR/Cas system-associated protein Csm6	NA|111aa|down_0|NC_011738.1_263336_263669_+	PRK09974, PRK09974, type II toxin-antitoxin system PrlF family antitoxin	NA|173aa|down_1|NC_011738.1_263671_264190_+	pfam11663, Toxin_YhaV, Toxin with endonuclease activity, of toxin-antitoxin system	RT|310aa|down_2|NC_011738.1_264544_265474_+	TIGR04416, hypothetical_protein, group II intron reverse transcriptase/maturase	cas1|335aa|down_3|NC_011738.1_265617_266622_+	pfam01867, Cas_Cas1, CRISPR associated protein Cas1	cas2|92aa|down_4|NC_011738.1_266622_266898_+	pfam09827, CRISPR_Cas2, CRISPR associated protein Cas2	NA|46aa|down_5|NC_011738.1_270936_271074_-	NA	NA|165aa|down_6|NC_011738.1_271409_271904_-	COG2405, COG2405, Predicted nucleic acid-binding protein, contains PIN domain [General function prediction only]	NA|74aa|down_7|NC_011738.1_271900_272122_-	pfam03683, UPF0175, Uncharacterized protein family (UPF0175)	NA|539aa|down_8|NC_011738.1_272235_273852_-	NA	NA|251aa|down_9|NC_011738.1_274515_275268_-	NA
GCF_000021825.1_ASM2182v1	NC_011737	Gloeothece citriformis PCC 7424 plasmid pP742402, complete sequence	1	168133-169237	1,1,1	PILER-CR,CRISPRCasFinder,CRT	no	cas3,cas2,cas1,cas5,cas7,cas8b3,cas6	cas3,cas2,cas1,cas5,cas7,cas8b3,cas6	Unclear	GTGATCAACGCCTTTCGGCATCAAAGGTTAAAGTAG,GTGATCAACGCCTTTCGGCATCAAAGGTTAAAGTAG,GTGATCAACGCCTTTCGGCATCAAAGGTTAAAGTAG	36,36,36	0	0	NA	NA	N:A	15,15,15	15	Unclear	cas14j,cas14k,DEDDh,c2c9_V-U4,cas6,cas5,cas7,cas8a4,cas3,Cas14c_CAS-V-F,cas10,cmr3gr5,cmr4gr7,cmr5gr11,Cas14u_CAS-V,RT,csa3,WYL,cas10d,csc2gr7,csc1gr5,cas4,cas1,cas2,PD-DExK,cas8b3,DinG,cas8e,cse2gr11,cas6e,Cas9_archaeal,csx15,2OG_CAS,csm6	NA|201aa|up_7|NC_011737.1_161742_162345_+,NA|71aa|up_4|NC_011737.1_165012_165225_+,NA|100aa|up_2|NC_011737.1_166193_166493_+,NA|110aa|up_1|NC_011737.1_166523_166853_+,NA|174aa|down_9|NC_011737.1_182030_182552_-	NA|480aa|up_9|NC_011737.1_158803_160243_+	cd00140, beta_clamp, Beta clamp domain	NA|453aa|up_8|NC_011737.1_160314_161673_+	PRK14948, PRK14948, DNA polymerase III subunit gamma/tau	NA|201aa|up_7|NC_011737.1_161742_162345_+	NA	NA|604aa|up_6|NC_011737.1_162671_164483_+	COG0417, PolB, DNA polymerase elongation subunit (family B) [DNA replication, recombination, and repair]	NA|174aa|up_5|NC_011737.1_164489_165011_+	cd02440, AdoMet_MTases, S-adenosylmethionine-dependent methyltransferases (SAM or AdoMet-MTase), class I;  AdoMet-MTases are enzymes that use S-adenosyl-L-methionine (SAM or AdoMet) as a substrate for methyltransfer, creating the product S-adenosyl-L-homocysteine (AdoHcy)	NA|71aa|up_4|NC_011737.1_165012_165225_+	NA	NA|319aa|up_3|NC_011737.1_165240_166197_+	PRK07452, PRK07452, DNA polymerase III subunit delta; Validated	NA|100aa|up_2|NC_011737.1_166193_166493_+	NA	NA|110aa|up_1|NC_011737.1_166523_166853_+	NA	NA|307aa|up_0|NC_011737.1_166892_167813_+	PRK07399, PRK07399, DNA polymerase III subunit delta'; Validated	NA|388aa|down_0|NC_011737.1_169968_171132_-	pfam01609, DDE_Tnp_1, Transposase DDE domain	cas2|98aa|down_1|NC_011737.1_172654_172948_-	pfam09827, CRISPR_Cas2, CRISPR associated protein Cas2	cas1|559aa|down_2|NC_011737.1_172955_174632_-	TIGR03983, hypothetical_protein_LA3181, CRISPR-associated endonuclease Cas1, subtype MYXAN	NA|214aa|down_3|NC_011737.1_174745_175387_-	pfam05685, Uma2, Putative restriction endonuclease	cas5|221aa|down_4|NC_011737.1_175559_176222_-	cd09688, Cas5_I-C, CRISPR/Cas system-associated RAMP superfamily protein Cas5	cas7|342aa|down_5|NC_011737.1_176218_177244_-	TIGR02585, conserved_protein, CRISPR-associated protein Cas7/Cst2/DevR, subtype I-B/TNEAP	cas8b3|523aa|down_6|NC_011737.1_177243_178812_-	TIGR03485, hypothetical_protein_L8106_30105, CRISPR-associated protein Cas8a1/Csx13, MYXAN subtype	cas3|828aa|down_7|NC_011737.1_178802_181286_-	TIGR01587, CRISPR-associated_endonuclease/helicase_Cas3, CRISPR-associated helicase Cas3	cas6|215aa|down_8|NC_011737.1_181282_181927_-	pfam09559, Cas6, Cas6 Crispr	NA|174aa|down_9|NC_011737.1_182030_182552_-	NA
