assembly_id	genome_id	genome_def	crispr_array_locus_merge	crispr_array_location_merge	crispr_locus_id	crispr_pred_method	array_in_prot	prot_within_array_20000	prot_in_genome	crispr_type_by_cas_prot	consensus_repeat	repeat_length	self-targeting_spacer_number	self-targeting_target_number	spacer_location	protospacer_location	repeat_type	spacer_locus_num	spacer_num	correct_crispr_type	genome_cas_prots	unknown_protein_around_crispr	L10	L10_domain	L9	L9_domain	L8	L8_domain	L7	L7_domain	L6	L6_domain	L5	L5_domain	L4	L4_domain	L3	L3_domain	L2	L2_domain	L1	L1_domain	R1	R1_domain	R2	R2_domain	R3	R3_domain	R4	R4_domain	R5	R5_domain	R6	R6_domain	R7	R7_domain	R8	R8_domain	R9	R9_domain	R10	R10_domain
GCF_000024045.1_ASM2404v1	NC_013161	Rippkaea orientalis PCC 8802, complete sequence	1	512560-513623	1,1,1	PILER-CR,CRISPRCasFinder,CRT	no	cas2,cas1,cas4,cas6,csc1gr5,csc2gr7,cas10d,cas3,WYL	PD-DExK,cas14j,cas2,cas1,cas4,cas6,csc1gr5,csc2gr7,cas10d,cas3,WYL,2OG_CAS,csa3,c2c9_V-U4,DinG,c2c5_V-U5,Cas14c_CAS-V-F,Cas14u_CAS-V,cas14k,RT,c2c8_V-U2	Type I-D	GTTTCAATCCCATTACTAGGATTCATTAAAAA--GAAAC,GTTTCAATCCCATTACTAGGATTCATTAAAAAGAAAC,GTTTCAATCCCATTACTAGGATTCATTAAAAAGAAAC	39,37,37	0	0	NA	NA	V-U2:V-U2:V-U2	14,14,14	14	TypeI-D	PD-DExK,cas14j,cas2,cas1,cas4,cas6,csc1gr5,csc2gr7,cas10d,cas3,WYL,2OG_CAS,csa3,c2c9_V-U4,DinG,c2c5_V-U5,Cas14c_CAS-V-F,Cas14u_CAS-V,cas14k,RT,c2c8_V-U2,cas5,cas7,cas8b5	NA|74aa|up_3|NC_013161.1_505890_506112_-,NA|83aa|down_8|NC_013161.1_523911_524160_-	NA|262aa|up_9|NC_013161.1_498966_499752_+	PRK14143, PRK14143, heat shock protein GrpE; Provisional	NA|694aa|up_8|NC_013161.1_499902_501984_+	PRK13411, PRK13411, molecular chaperone DnaK; Provisional	NA|132aa|up_7|NC_013161.1_502014_502410_+	TIGR00004, RutC_family_protein, reactive intermediate/imine deaminase	NA|412aa|up_6|NC_013161.1_502476_503712_+	PRK07590, PRK07590, L,L-diaminopimelate aminotransferase; Validated	NA|283aa|up_5|NC_013161.1_504310_505159_+	cd03401, SPFH_prohibitin, Prohibitin family; SPFH (stomatin, prohibitin, flotillin, and HflK/C) superfamily	NA|184aa|up_4|NC_013161.1_505323_505875_+	pfam11371, DUF3172, Protein of unknown function (DUF3172)	NA|74aa|up_3|NC_013161.1_505890_506112_-	NA	NA|1107aa|up_2|NC_013161.1_506362_509683_-	TIGR00915, Probable_aminoglycoside_efflux_pump, The (Largely Gram-negative Bacterial) Hydrophobe/Amphiphile Efflux-1 (HAE1) Family	NA|219aa|up_1|NC_013161.1_509894_510551_+	pfam07885, Ion_trans_2, Ion channel	NA|271aa|up_0|NC_013161.1_511603_512416_+	COG1562, ERG9, Phytoene/squalene synthetase [Lipid metabolism]	cas2|98aa|down_0|NC_013161.1_513816_514110_-	cd09725, Cas2_I_II_III, CRISPR/Cas system-associated protein Cas2	cas1|326aa|down_1|NC_013161.1_514106_515084_-	TIGR04093, hypothetical_protein_L8106_25395, CRISPR-associated endonuclease Cas1, subtype CYANO	cas4|197aa|down_2|NC_013161.1_515116_515707_-	TIGR00372, conserved_hypothetical_protein, CRISPR-associated protein Cas4	cas6|270aa|down_3|NC_013161.1_515832_516642_-	pfam10040, CRISPR_Cas6, CRISPR-associated endoribonuclease Cas6	csc1gr5|259aa|down_4|NC_013161.1_516610_517387_-	TIGR03159, cas_Csc1, CRISPR type I-D/CYANO-associated protein Csc1	csc2gr7|357aa|down_5|NC_013161.1_517529_518600_-	pfam18320, Csc2, Csc2 Crispr	cas10d|975aa|down_6|NC_013161.1_518672_521597_-	cd09712, Cas10d_I-D, CRISPR/Cas system-associated protein Cas10d	cas3|702aa|down_7|NC_013161.1_521654_523760_-	cd09710, Cas3_I-D, CRISPR/Cas system-associated protein Cas3; Distinct diverged subfamily of Cas3 helicase domain	NA|83aa|down_8|NC_013161.1_523911_524160_-	NA	WYL|316aa|down_9|NC_013161.1_524356_525304_+	pfam13280, WYL, WYL domain
GCF_000024045.1_ASM2404v1	NC_013161	Rippkaea orientalis PCC 8802, complete sequence	2	603702-603889	2	PILER-CR	no		PD-DExK,cas14j,cas2,cas1,cas4,cas6,csc1gr5,csc2gr7,cas10d,cas3,WYL,2OG_CAS,csa3,c2c9_V-U4,DinG,c2c5_V-U5,Cas14c_CAS-V-F,Cas14u_CAS-V,cas14k,RT,c2c8_V-U2	Orphan	TAGTTTCAATCCC--TCATAGGGATTATTGCTTATTTTAACT	42	0	0	NA	NA	NA	2	2	Orphan	PD-DExK,cas14j,cas2,cas1,cas4,cas6,csc1gr5,csc2gr7,cas10d,cas3,WYL,2OG_CAS,csa3,c2c9_V-U4,DinG,c2c5_V-U5,Cas14c_CAS-V-F,Cas14u_CAS-V,cas14k,RT,c2c8_V-U2,cas5,cas7,cas8b5	NA|364aa|up_3|NC_013161.1_600139_601231_+,NA|75aa|down_3|NC_013161.1_607048_607273_+,NA|161aa|down_9|NC_013161.1_611551_612034_+	NA|157aa|up_9|NC_013161.1_590149_590620_+	pfam08846, DUF1816, Domain of unknown function (DUF1816)	NA|1265aa|up_8|NC_013161.1_590644_594439_-	cd00200, WD40, WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment	NA|553aa|up_7|NC_013161.1_594487_596146_-	pfam13676, TIR_2, TIR domain	NA|321aa|up_6|NC_013161.1_596498_597461_+	COG1622, CyoA, Heme/copper-type cytochrome/quinol oxidases, subunit 2 [Energy production and conversion]	NA|556aa|up_5|NC_013161.1_597489_599157_+	TIGR02891, Probable_cytochrome_c_oxidase_subunit_1-beta, cytochrome c oxidase, subunit I	NA|206aa|up_4|NC_013161.1_599255_599873_+	COG1845, CyoC, Heme/copper-type cytochrome/quinol oxidase, subunit 3 [Energy production and conversion]	NA|364aa|up_3|NC_013161.1_600139_601231_+	NA	NA|50aa|up_2|NC_013161.1_601227_601377_-	pfam01724, DUF29, Domain of unknown function DUF29	NA|493aa|up_1|NC_013161.1_601459_602938_-	PRK10867, PRK10867, signal recognition particle protein; Provisional	NA|216aa|up_0|NC_013161.1_603005_603653_+	COG0013, AlaS, Alanyl-tRNA synthetase [Translation, ribosomal structure and biogenesis]	NA|391aa|down_0|NC_013161.1_604198_605371_-	PRK06019, PRK06019, phosphoribosylaminoimidazole carboxylase ATPase subunit; Reviewed	NA|169aa|down_1|NC_013161.1_605375_605882_-	COG1357, COG1357, Pentapeptide repeats containing protein [Function unknown]	NA|192aa|down_2|NC_013161.1_606146_606722_+	pfam16734, Pilin_GH, Type IV pilin-like G and H, putative	NA|75aa|down_3|NC_013161.1_607048_607273_+	NA	NA|532aa|down_4|NC_013161.1_607362_608958_+	cd07488, Peptidases_S8_2, Peptidase S8 family domain, uncharacterized subfamily 2	NA|171aa|down_5|NC_013161.1_609096_609609_+	pfam13523, Acetyltransf_8, Acetyltransferase (GNAT) domain	NA|288aa|down_6|NC_013161.1_609654_610518_-	cd01637, IMPase_like, Inositol-monophosphatase-like domains	NA|56aa|down_7|NC_013161.1_610581_610749_+	COG0484, DnaJ, DnaJ-class molecular chaperone with C-terminal Zn finger domain [Posttranslational modification, protein turnover, chaperones]	NA|264aa|down_8|NC_013161.1_610732_611524_+	sd00006, TPR, Tetratricopeptide repeat	NA|161aa|down_9|NC_013161.1_611551_612034_+	NA
GCF_000024045.1_ASM2404v1	NC_013161	Rippkaea orientalis PCC 8802, complete sequence	3	827748-827845	2	CRISPRCasFinder	no		PD-DExK,cas14j,cas2,cas1,cas4,cas6,csc1gr5,csc2gr7,cas10d,cas3,WYL,2OG_CAS,csa3,c2c9_V-U4,DinG,c2c5_V-U5,Cas14c_CAS-V-F,Cas14u_CAS-V,cas14k,RT,c2c8_V-U2	Orphan	GTTGAGGAAGAAGAAGACATTGA	23	0	0	NA	NA	NA	1	1	Orphan	PD-DExK,cas14j,cas2,cas1,cas4,cas6,csc1gr5,csc2gr7,cas10d,cas3,WYL,2OG_CAS,csa3,c2c9_V-U4,DinG,c2c5_V-U5,Cas14c_CAS-V-F,Cas14u_CAS-V,cas14k,RT,c2c8_V-U2,cas5,cas7,cas8b5	NA|168aa|up_9|NC_013161.1_818155_818659_+,NA|237aa|up_5|NC_013161.1_822323_823034_+,NA|66aa|up_0|NC_013161.1_827018_827216_+,NA|163aa|down_5|NC_013161.1_834426_834915_+,NA|159aa|down_8|NC_013161.1_836981_837458_+	NA|168aa|up_9|NC_013161.1_818155_818659_+	NA	NA|271aa|up_8|NC_013161.1_818720_819533_-	COG4577, CcmK, Carbon dioxide concentrating mechanism/carboxysome shell protein [Secondary metabolites biosynthesis, transport, and catabolism / Energy production and conversion]	NA|355aa|up_7|NC_013161.1_819754_820819_+	pfam18087, RuBisCo_chap_C, Rubisco Assembly chaperone C-terminal domain	NA|326aa|up_6|NC_013161.1_821128_822106_+	PRK09375, PRK09375, quinolinate synthase NadA	NA|237aa|up_5|NC_013161.1_822323_823034_+	NA	NA|467aa|up_4|NC_013161.1_823308_824709_-	CHL00073, chlN, photochlorophyllide reductase subunit N	NA|125aa|up_3|NC_013161.1_824713_825088_-	pfam17265, DUF5331, Family of unknown function (DUF5331)	NA|290aa|up_2|NC_013161.1_825249_826119_-	CHL00072, chlL, photochlorophyllide reductase subunit L	NA|55aa|up_1|NC_013161.1_826810_826975_-	COG2002, AbrB, Regulators of stationary/sporulation gene expression [Transcription]	NA|66aa|up_0|NC_013161.1_827018_827216_+	NA	NA|818aa|down_0|NC_013161.1_828424_830878_+	COG2208, RsbU, Serine phosphatase RsbU, regulator of sigma subunit [Signal transduction mechanisms / Transcription]	NA|198aa|down_1|NC_013161.1_830961_831555_+	COG5381, COG5381, Uncharacterized protein conserved in bacteria [Function unknown]	NA|118aa|down_2|NC_013161.1_831614_831968_+	COG5439, COG5439, Uncharacterized conserved protein [Function unknown]	NA|282aa|down_3|NC_013161.1_832040_832886_-	COG0739, NlpD, Membrane proteins related to metalloendopeptidases [Cell envelope biogenesis, outer membrane]	NA|433aa|down_4|NC_013161.1_832891_834190_-	PRK06349, PRK06349, homoserine dehydrogenase; Provisional	NA|163aa|down_5|NC_013161.1_834426_834915_+	NA	NA|306aa|down_6|NC_013161.1_835004_835922_-	PLN02578, PLN02578, hydrolase	NA|177aa|down_7|NC_013161.1_836114_836645_+	PRK00028, infC, translation initiation factor IF-3; Reviewed	NA|159aa|down_8|NC_013161.1_836981_837458_+	NA	NA|315aa|down_9|NC_013161.1_837454_838399_+	COG0392, COG0392, Predicted integral membrane protein [Function unknown]
GCF_000024045.1_ASM2404v1	NC_013161	Rippkaea orientalis PCC 8802, complete sequence	4	910346-910959	3,3,2	PILER-CR,CRISPRCasFinder,CRT	no	csa3	PD-DExK,cas14j,cas2,cas1,cas4,cas6,csc1gr5,csc2gr7,cas10d,cas3,WYL,2OG_CAS,csa3,c2c9_V-U4,DinG,c2c5_V-U5,Cas14c_CAS-V-F,Cas14u_CAS-V,cas14k,RT,c2c8_V-U2	Type I-A	GTTTCAATCCCATTACTAGGATTCATTAATAA--GAAAC,GTTTCTTATTAATGAATCCTAGTAATGGGATTGAAAC,GTTTCTTATTAATGAATCCTAGTAATGGGATTGAAAC	39,37,37	0	0	NA	NA	V-U2:V-U2:V-U2	7,8,8	8	Orphan	PD-DExK,cas14j,cas2,cas1,cas4,cas6,csc1gr5,csc2gr7,cas10d,cas3,WYL,2OG_CAS,csa3,c2c9_V-U4,DinG,c2c5_V-U5,Cas14c_CAS-V-F,Cas14u_CAS-V,cas14k,RT,c2c8_V-U2,cas5,cas7,cas8b5	NA|234aa|up_1|NC_013161.1_907202_907904_-,NA|392aa|down_3|NC_013161.1_914243_915419_-,NA|229aa|down_4|NC_013161.1_915764_916451_+,NA|147aa|down_8|NC_013161.1_919987_920428_-,NA|92aa|down_9|NC_013161.1_920631_920907_+	csa3|130aa|up_9|NC_013161.1_900689_901079_+	smart00418, HTH_ARSR, helix_turn_helix, Arsenical Resistance Operon Repressor	NA|58aa|up_8|NC_013161.1_901075_901249_-	pfam02069, Metallothio_Pro, Prokaryotic metallothionein	NA|324aa|up_7|NC_013161.1_901416_902388_+	COG0523, COG0523, Putative GTPases (G3E family) [General function prediction only]	NA|359aa|up_6|NC_013161.1_902375_903452_+	COG2319, COG2319, FOG: WD40 repeat [General function prediction only]	NA|215aa|up_5|NC_013161.1_903549_904194_+	PRK09347, folE, GTP cyclohydrolase I; Provisional	NA|347aa|up_4|NC_013161.1_904263_905304_+	COG0523, COG0523, Putative GTPases (G3E family) [General function prediction only]	NA|370aa|up_3|NC_013161.1_905434_906544_-	PRK15062, PRK15062, hydrogenase isoenzymes formation protein HypD; Provisional	NA|147aa|up_2|NC_013161.1_906732_907173_+	pfam02657, SufE, Fe-S metabolism associated domain	NA|234aa|up_1|NC_013161.1_907202_907904_-	NA	NA|616aa|up_0|NC_013161.1_908251_910099_+	cd10918, CE4_NodB_like_5s_6s, Putative catalytic NodB homology domain of PgaB, IcaB, and similar proteins which consist of a deformed (beta/alpha)8 barrel fold with 5- or 6-strands	NA|170aa|down_0|NC_013161.1_911170_911680_-	pfam01475, FUR, Ferric uptake regulator family	NA|294aa|down_1|NC_013161.1_911876_912758_-	COG2027, DacB, D-alanyl-D-alanine carboxypeptidase (penicillin-binding protein 4) [Cell envelope biogenesis, outer membrane]	NA|341aa|down_2|NC_013161.1_912933_913956_-	PRK14982, PRK14982, acyl-ACP reductase; Provisional	NA|392aa|down_3|NC_013161.1_914243_915419_-	NA	NA|229aa|down_4|NC_013161.1_915764_916451_+	NA	NA|361aa|down_5|NC_013161.1_916480_917563_-	PRK00082, hrcA, heat-inducible transcription repressor; Provisional	NA|120aa|down_6|NC_013161.1_917727_918087_-	cd01528, RHOD_2, Member of the Rhodanese Homology Domain superfamily, subgroup 2	NA|570aa|down_7|NC_013161.1_918263_919973_+	pfam11832, DUF3352, Protein of unknown function (DUF3352)	NA|147aa|down_8|NC_013161.1_919987_920428_-	NA	NA|92aa|down_9|NC_013161.1_920631_920907_+	NA
GCF_000024045.1_ASM2404v1	NC_013161	Rippkaea orientalis PCC 8802, complete sequence	5	1453381-1453839	3	CRT	no	cas14j	PD-DExK,cas14j,cas2,cas1,cas4,cas6,csc1gr5,csc2gr7,cas10d,cas3,WYL,2OG_CAS,csa3,c2c9_V-U4,DinG,c2c5_V-U5,Cas14c_CAS-V-F,Cas14u_CAS-V,cas14k,RT,c2c8_V-U2	Unclear	AAAGAAGATGTCCGCGAATTGAAA	24	0	0	NA	NA	NA	9	9	TypeV	PD-DExK,cas14j,cas2,cas1,cas4,cas6,csc1gr5,csc2gr7,cas10d,cas3,WYL,2OG_CAS,csa3,c2c9_V-U4,DinG,c2c5_V-U5,Cas14c_CAS-V-F,Cas14u_CAS-V,cas14k,RT,c2c8_V-U2,cas5,cas7,cas8b5	NA,NA|125aa|down_4|NC_013161.1_1461591_1461966_+	NA|177aa|up_9|NC_013161.1_1445183_1445714_+	cd16339, CpcS, S-type phycobiliprotein (PBP) lyase	NA|197aa|up_8|NC_013161.1_1445775_1446366_+	pfam06206, CpeT, CpeT/CpcT family (DUF1001)	NA|147aa|up_7|NC_013161.1_1446387_1446828_+	pfam01724, DUF29, Domain of unknown function DUF29	NA|189aa|up_6|NC_013161.1_1446869_1447436_+	pfam05685, Uma2, Putative restriction endonuclease	NA|267aa|up_5|NC_013161.1_1447467_1448268_-	pfam08241, Methyltransf_11, Methyltransferase domain	NA|202aa|up_4|NC_013161.1_1448316_1448922_-	COG0625, Gst, Glutathione S-transferase [Posttranslational modification, protein turnover, chaperones]	NA|689aa|up_3|NC_013161.1_1448923_1450990_-	COG1331, COG1331, Highly conserved protein containing a thioredoxin domain [Posttranslational modification, protein turnover, chaperones]	NA|187aa|up_2|NC_013161.1_1451093_1451654_-	cd06260, DUF820, Domain of unknown function (DUF820)	NA|250aa|up_1|NC_013161.1_1451667_1452417_-	pfam04016, DUF364, Putative heavy-metal chelation	NA|234aa|up_0|NC_013161.1_1452432_1453134_+	COG2173, DdpX, D-alanyl-D-alanine dipeptidase [Cell envelope biogenesis, outer membrane]	NA|212aa|down_0|NC_013161.1_1453957_1454593_-	COG0739, NlpD, Membrane proteins related to metalloendopeptidases [Cell envelope biogenesis, outer membrane]	NA|606aa|down_1|NC_013161.1_1455051_1456869_+	TIGR02402, Malto-oligosyltrehalose_trehalohydrolase, malto-oligosyltrehalose trehalohydrolase	NA|943aa|down_2|NC_013161.1_1456973_1459802_+	PRK14511, PRK14511, malto-oligosyltrehalose synthase	NA|515aa|down_3|NC_013161.1_1459939_1461484_+	COG1626, TreA, Neutral trehalase [Carbohydrate transport and metabolism]	NA|125aa|down_4|NC_013161.1_1461591_1461966_+	NA	NA|152aa|down_5|NC_013161.1_1462249_1462705_+	COG3431, COG3431, Predicted membrane protein [Function unknown]	NA|563aa|down_6|NC_013161.1_1462992_1464681_-	PHA00370, III, attachment protein	NA|88aa|down_7|NC_013161.1_1465221_1465485_-	pfam11344, DUF3146, Protein of unknown function (DUF3146)	NA|480aa|down_8|NC_013161.1_1465634_1467074_+	cd00880, Era_like, E	NA|465aa|down_9|NC_013161.1_1467144_1468539_+	pfam10009, DUF2252, Uncharacterized protein conserved in bacteria (DUF2252)
GCF_000024045.1_ASM2404v1	NC_013161	Rippkaea orientalis PCC 8802, complete sequence	6	2228753-2229223	4,4,4	PILER-CR,CRISPRCasFinder,CRT	no	c2c5_V-U5	PD-DExK,cas14j,cas2,cas1,cas4,cas6,csc1gr5,csc2gr7,cas10d,cas3,WYL,2OG_CAS,csa3,c2c9_V-U4,DinG,c2c5_V-U5,Cas14c_CAS-V-F,Cas14u_CAS-V,cas14k,RT,c2c8_V-U2	Type V-U5	GTTTCAACGACCATTCCCAACAGGGATGGGTTGAAAG,GTTTCAACGACCATTCCCAACAGGGATGGGTTGAAAG,GTTTCAACGACCATTCCCAACAGGGATGGGTTGAAAG	37,37,37	0	0	NA	NA	V-U5:V-U5:V-U5	5,6,6	6	TypeV-U5	PD-DExK,cas14j,cas2,cas1,cas4,cas6,csc1gr5,csc2gr7,cas10d,cas3,WYL,2OG_CAS,csa3,c2c9_V-U4,DinG,c2c5_V-U5,Cas14c_CAS-V-F,Cas14u_CAS-V,cas14k,RT,c2c8_V-U2,cas5,cas7,cas8b5	c2c5_V-U5|668aa|up_0|NC_013161.1_2226228_2228232_+,NA|193aa|down_5|NC_013161.1_2234031_2234610_+	NA|836aa|up_9|NC_013161.1_2212404_2214912_-	TIGR02687, conserved_hypothetical_protein, TIGR02687 family protein	NA|114aa|up_8|NC_013161.1_2215174_2215516_-	cd16382, XisI-like, XisI is FdxN element excision controlling factor protein	NA|140aa|up_7|NC_013161.1_2215503_2215923_-	pfam08814, XisH, XisH protein	NA|333aa|up_6|NC_013161.1_2216028_2217027_-	NF033452, BREX_1_MTaseX, BREX-1 system adenine-specific DNA-methyltransferase PglX	NA|120aa|up_5|NC_013161.1_2219430_2219790_-	cd16377, 23S_rRNA_IVP_like, 23S rRNA-intervening sequence protein and similar proteins	NA|207aa|up_4|NC_013161.1_2220695_2221316_+	COG5464, COG5464, Uncharacterized conserved protein [Function unknown]	NA|1169aa|up_3|NC_013161.1_2221353_2224860_-	NF033441, BREX_BrxC, BREX system P-loop protein BrxC	NA|203aa|up_2|NC_013161.1_2224873_2225482_-	pfam08747, DUF1788, Domain of unknown function (DUF1788)	NA|159aa|up_1|NC_013161.1_2225646_2226123_-	cd01105, HTH_GlnR-like, Helix-Turn-Helix DNA binding domain of GlnR-like transcription regulators	c2c5_V-U5|668aa|up_0|NC_013161.1_2226228_2228232_+	NA	NA|99aa|down_0|NC_013161.1_2229524_2229821_-	PRK00034, gatC, Asp-tRNA(Asn)/Glu-tRNA(Gln) amidotransferase subunit GatC	NA|174aa|down_1|NC_013161.1_2229841_2230363_-	PRK02603, PRK02603, photosystem I assembly protein Ycf3; Provisional	NA|197aa|down_2|NC_013161.1_2230450_2231041_-	PRK07402, PRK07402, precorrin-6Y C5,15-methyltransferase subunit CbiT	NA|288aa|down_3|NC_013161.1_2231082_2231946_-	pfam00685, Sulfotransfer_1, Sulfotransferase domain	NA|553aa|down_4|NC_013161.1_2232064_2233723_+	COG3961, COG3961, Pyruvate decarboxylase and related thiamine pyrophosphate-requiring enzymes [Carbohydrate transport and metabolism / Coenzyme metabolism / General function prediction only]	NA|193aa|down_5|NC_013161.1_2234031_2234610_+	NA	NA|263aa|down_6|NC_013161.1_2234820_2235609_+	COG1357, COG1357, Pentapeptide repeats containing protein [Function unknown]	NA|733aa|down_7|NC_013161.1_2235624_2237823_-	pfam00990, GGDEF, Diguanylate cyclase, GGDEF domain	NA|331aa|down_8|NC_013161.1_2238178_2239171_+	PRK06270, PRK06270, homoserine dehydrogenase; Provisional	NA|846aa|down_9|NC_013161.1_2239844_2242382_+	TIGR02956, sensor_protein_TorS, TMAO reductase sytem sensor TorS
GCF_000024045.1_ASM2404v1	NC_013161	Rippkaea orientalis PCC 8802, complete sequence	7	2251718-2252131	5	CRT	no	Cas14c_CAS-V-F	PD-DExK,cas14j,cas2,cas1,cas4,cas6,csc1gr5,csc2gr7,cas10d,cas3,WYL,2OG_CAS,csa3,c2c9_V-U4,DinG,c2c5_V-U5,Cas14c_CAS-V-F,Cas14u_CAS-V,cas14k,RT,c2c8_V-U2	Unclear	AAGACGNCGGCAAAGANG	18	1	1	2251736-2251765	NC_013161.1_2251700-2251729	NA	8	8	TypeV	PD-DExK,cas14j,cas2,cas1,cas4,cas6,csc1gr5,csc2gr7,cas10d,cas3,WYL,2OG_CAS,csa3,c2c9_V-U4,DinG,c2c5_V-U5,Cas14c_CAS-V-F,Cas14u_CAS-V,cas14k,RT,c2c8_V-U2,cas5,cas7,cas8b5	NA,NA|194aa|down_2|NC_013161.1_2255439_2256021_-	NA|846aa|up_9|NC_013161.1_2239844_2242382_+	TIGR02956, sensor_protein_TorS, TMAO reductase sytem sensor TorS	NA|208aa|up_8|NC_013161.1_2242375_2242999_+	TIGR04282, hypothetical_protein, transferase 1, rSAM/selenodomain-associated	NA|387aa|up_7|NC_013161.1_2243032_2244193_+	cd00854, NagA, N-acetylglucosamine-6-phosphate deacetylase, NagA, catalyzes the hydrolysis of the N-acetyl group of N-acetyl-glucosamine-6-phosphate (GlcNAc-6-P) to glucosamine 6-phosphate and acetate	NA|143aa|up_6|NC_013161.1_2244311_2244740_-	COG3585, MopI, Molybdopterin-binding protein [Coenzyme metabolism]	NA|263aa|up_5|NC_013161.1_2244857_2245646_+	cd13537, PBP2_YvgL_like, Substrate binding domain of putative molybdate-binding protein YvgL and similar proteins;the type 2 periplasmic binding protein fold	NA|605aa|up_4|NC_013161.1_2245794_2247609_+	COG1118, CysA, ABC-type sulfate/molybdate transport systems, ATPase component [Inorganic ion transport and metabolism]	NA|156aa|up_3|NC_013161.1_2247894_2248362_+	PRK05422, smpB, SsrA-binding protein SmpB	NA|232aa|up_2|NC_013161.1_2248368_2249064_-	pfam13759, 2OG-FeII_Oxy_5, Putative 2OG-Fe(II) oxygenase	NA|315aa|up_1|NC_013161.1_2249181_2250126_+	pfam05721, PhyH, Phytanoyl-CoA dioxygenase (PhyH)	NA|343aa|up_0|NC_013161.1_2250109_2251138_-	pfam01551, Peptidase_M23, Peptidase family M23	NA|429aa|down_0|NC_013161.1_2252301_2253588_+	pfam00144, Beta-lactamase, Beta-lactamase	NA|606aa|down_1|NC_013161.1_2253605_2255423_+	smart00752, HTTM, Horizontally Transferred TransMembrane Domain	NA|194aa|down_2|NC_013161.1_2255439_2256021_-	NA	NA|160aa|down_3|NC_013161.1_2256086_2256566_-	pfam00583, Acetyltransf_1, Acetyltransferase (GNAT) family	NA|428aa|down_4|NC_013161.1_2256915_2258199_-	COG0312, TldD, Predicted Zn-dependent proteases and their inactivated homologs [General function prediction only]	NA|576aa|down_5|NC_013161.1_2258286_2260014_+	pfam05199, GMC_oxred_C, GMC oxidoreductase	NA|293aa|down_6|NC_013161.1_2260938_2261817_+	cd00293, USP_Like, Usp: Universal stress protein family	NA|194aa|down_7|NC_013161.1_2261847_2262429_+	pfam12263, DUF3611, Protein of unknown function (DUF3611)	NA|480aa|down_8|NC_013161.1_2262573_2264013_+	PRK10811, rne, ribonuclease E; Reviewed	Cas14c_CAS-V-F|402aa|down_9|NC_013161.1_2264283_2265489_-	COG0675, COG0675, Transposase and inactivated derivatives [DNA replication, recombination, and repair]
GCF_000024045.1_ASM2404v1	NC_013161	Rippkaea orientalis PCC 8802, complete sequence	8	2288506-2289058	5,5,6	PILER-CR,CRISPRCasFinder,CRT	no	c2c5_V-U5	PD-DExK,cas14j,cas2,cas1,cas4,cas6,csc1gr5,csc2gr7,cas10d,cas3,WYL,2OG_CAS,csa3,c2c9_V-U4,DinG,c2c5_V-U5,Cas14c_CAS-V-F,Cas14u_CAS-V,cas14k,RT,c2c8_V-U2	Type V-U5	CTTTCAACCTACCCCTTATCGGGATGGCGGTTGAAAC,CTTTCAACCTACCCCTTATCGGGATGGCGGTTGAAAC,CTTTCAACCTACCCCTTATCGGGATGGCGGTTGAAAC	37,37,37	0	0	NA	NA	V-U5:V-U5:V-U5	7,7,7	7	TypeV-U5	PD-DExK,cas14j,cas2,cas1,cas4,cas6,csc1gr5,csc2gr7,cas10d,cas3,WYL,2OG_CAS,csa3,c2c9_V-U4,DinG,c2c5_V-U5,Cas14c_CAS-V-F,Cas14u_CAS-V,cas14k,RT,c2c8_V-U2,cas5,cas7,cas8b5	NA|156aa|up_5|NC_013161.1_2282259_2282727_-,NA|64aa|up_1|NC_013161.1_2285414_2285606_-,c2c5_V-U5|613aa|down_0|NC_013161.1_2289596_2291435_-,NA|73aa|down_4|NC_013161.1_2294064_2294283_+,NA|87aa|down_7|NC_013161.1_2295893_2296154_+,NA|84aa|down_8|NC_013161.1_2296190_2296442_+	NA|329aa|up_9|NC_013161.1_2276483_2277470_-	PHA03247, PHA03247, large tegument protein UL36; Provisional	NA|217aa|up_8|NC_013161.1_2277560_2278211_-	pfam05685, Uma2, Putative restriction endonuclease	NA|806aa|up_7|NC_013161.1_2278336_2280754_-	COG4354, COG4354, Predicted bile acid beta-glucosidase [Carbohydrate transport and metabolism]	NA|429aa|up_6|NC_013161.1_2280917_2282204_-	PRK07369, PRK07369, dihydroorotase; Provisional	NA|156aa|up_5|NC_013161.1_2282259_2282727_-	NA	NA|196aa|up_4|NC_013161.1_2282911_2283499_+	cd06260, DUF820, Domain of unknown function (DUF820)	NA|381aa|up_3|NC_013161.1_2283615_2284758_+	TIGR02048, gshA_cyano, glutamate--cysteine ligase, cyanobacterial, putative	NA|153aa|up_2|NC_013161.1_2284952_2285411_+	cd18094, SpoU-like_TrmL, SAM-dependent tRNA methylase related to TrmL	NA|64aa|up_1|NC_013161.1_2285414_2285606_-	NA	NA|728aa|up_0|NC_013161.1_2285871_2288055_+	pfam01551, Peptidase_M23, Peptidase family M23	c2c5_V-U5|613aa|down_0|NC_013161.1_2289596_2291435_-	NA	NA|144aa|down_1|NC_013161.1_2291503_2291935_+	cd01105, HTH_GlnR-like, Helix-Turn-Helix DNA binding domain of GlnR-like transcription regulators	NA|523aa|down_2|NC_013161.1_2291949_2293518_+	COG0286, HsdM, Type I restriction-modification system methyltransferase subunit [Defense mechanisms]	NA|123aa|down_3|NC_013161.1_2293620_2293989_+	cd01038, Endonuclease_DUF559, Domain of unknown function, appears to be related to a diverse group of endonucleases	NA|73aa|down_4|NC_013161.1_2294064_2294283_+	NA	NA|80aa|down_5|NC_013161.1_2294279_2294519_+	pfam07927, HicA_toxin, HicA toxin of bacterial toxin-antitoxin,	NA|434aa|down_6|NC_013161.1_2294585_2295887_+	cd17253, RMtype1_S_Eco933I-TRD2-CR2_like, Type I restriction-modification system specificity (S) subunit TRD-CR, similar to Escherichia coli O157:H7 EDL933 S subunit (S	NA|87aa|down_7|NC_013161.1_2295893_2296154_+	NA	NA|84aa|down_8|NC_013161.1_2296190_2296442_+	NA	NA|142aa|down_9|NC_013161.1_2296428_2296854_+	COG1569, COG1569, Predicted nucleic acid-binding protein, contains PIN domain [General function prediction only]
GCF_000024045.1_ASM2404v1	NC_013161	Rippkaea orientalis PCC 8802, complete sequence	9	3192357-3192843	6,7,6	CRISPRCasFinder,CRT,PILER-CR	no	c2c5_V-U5	PD-DExK,cas14j,cas2,cas1,cas4,cas6,csc1gr5,csc2gr7,cas10d,cas3,WYL,2OG_CAS,csa3,c2c9_V-U4,DinG,c2c5_V-U5,Cas14c_CAS-V-F,Cas14u_CAS-V,cas14k,RT,c2c8_V-U2	Type V-U5	CTTTCAACCCATCCCTGTTGGGAATGGTCGTTGAAAC,CTTTCAACCCATCCCTGTTGGGAATGGTCGTTGAAAC,GTTTCAACGACCATTCCCAACAGGGATGGGTTGAAAG	37,37,37	0	0	NA	NA	V-U5:V-U5:V-U5	6,6,5	6	TypeV-U5	PD-DExK,cas14j,cas2,cas1,cas4,cas6,csc1gr5,csc2gr7,cas10d,cas3,WYL,2OG_CAS,csa3,c2c9_V-U4,DinG,c2c5_V-U5,Cas14c_CAS-V-F,Cas14u_CAS-V,cas14k,RT,c2c8_V-U2,cas5,cas7,cas8b5	NA,c2c5_V-U5|671aa|down_0|NC_013161.1_3193363_3195376_-,NA|120aa|down_3|NC_013161.1_3196396_3196756_-,NA|73aa|down_7|NC_013161.1_3204088_3204307_+	NA|310aa|up_9|NC_013161.1_3179815_3180745_+	PRK00779, PRK00779, ornithine carbamoyltransferase; Provisional	NA|111aa|up_8|NC_013161.1_3180764_3181097_+	cd11532, NTP-PPase_COG4997, Nucleoside Triphosphate Pyrophosphohydrolase (EC 3	NA|365aa|up_7|NC_013161.1_3181105_3182200_+	cd01156, IVD, Isovaleryl-CoA dehydrogenase	NA|411aa|up_6|NC_013161.1_3182430_3183663_-	PRK12459, PRK12459, S-adenosylmethionine synthetase; Provisional	NA|997aa|up_5|NC_013161.1_3183806_3186797_-	COG0474, MgtA, Cation transport ATPase [Inorganic ion transport and metabolism]	NA|100aa|up_4|NC_013161.1_3187048_3187348_-	pfam17195, DUF5132, Protein of unknown function (DUF5132)	NA|445aa|up_3|NC_013161.1_3187517_3188852_-	COG4664, FcbT3, TRAP-type mannitol/chloroaromatic compound transport system, large permease component [Secondary metabolites biosynthesis, transport, and catabolism]	NA|198aa|up_2|NC_013161.1_3188963_3189557_-	COG4665, FcbT2, TRAP-type mannitol/chloroaromatic compound transport system, small permease component [Secondary metabolites biosynthesis, transport, and catabolism]	NA|380aa|up_1|NC_013161.1_3189632_3190772_+	cd13682, PBP2_TRAP_alpha-ketoacid, Substrate-binding component of an alpha-keto acid binding Tripartite ATP-independent Periplasmic transporter and related proteins; contains the type 2 periplasmic-binding protein fold	NA|355aa|up_0|NC_013161.1_3190865_3191930_-	PLN02433, PLN02433, uroporphyrinogen decarboxylase	c2c5_V-U5|671aa|down_0|NC_013161.1_3193363_3195376_-	NA	NA|153aa|down_1|NC_013161.1_3195482_3195941_+	cd01105, HTH_GlnR-like, Helix-Turn-Helix DNA binding domain of GlnR-like transcription regulators	NA|142aa|down_2|NC_013161.1_3195957_3196383_-	cd09881, PIN_VapC4-5_FitB-like, VapC-like PIN domain of Mycobacterium tuberculosis VapC4 and VapC5, and Neisseria gonorrhoeae FitB and related proteins	NA|120aa|down_3|NC_013161.1_3196396_3196756_-	NA	NA|1283aa|down_4|NC_013161.1_3196833_3200682_-	COG5635, COG5635, Predicted NTPase (NACHT family) [Signal transduction mechanisms]	NA|277aa|down_5|NC_013161.1_3201212_3202043_-	COG2842, COG2842, Uncharacterized ATPase, putative transposase [General function prediction only]	NA|559aa|down_6|NC_013161.1_3202046_3203723_-	pfam09299, Mu-transpos_C, Mu transposase, C-terminal	NA|73aa|down_7|NC_013161.1_3204088_3204307_+	NA	NA|125aa|down_8|NC_013161.1_3204297_3204672_+	cd18738, PIN_VapC4-5_FitB-like, uncharacterized subgroup of the PIN_VapC4-5_FitB-like subfamily of the PIN domain superfamily	NA|425aa|down_9|NC_013161.1_3204736_3206011_+	cd17287, RMtype1_S_EcoN10ORF171P_TRD2-CR2_like, Type I restriction-modification system specificity (S) subunit TRD-CR, similar to Escherichia coli N10-0505 S subunit (S
GCF_000024045.1_ASM2404v1	NC_013161	Rippkaea orientalis PCC 8802, complete sequence	10	3440939-3441036	7	CRISPRCasFinder	no		PD-DExK,cas14j,cas2,cas1,cas4,cas6,csc1gr5,csc2gr7,cas10d,cas3,WYL,2OG_CAS,csa3,c2c9_V-U4,DinG,c2c5_V-U5,Cas14c_CAS-V-F,Cas14u_CAS-V,cas14k,RT,c2c8_V-U2	Orphan	TCCTAATTTAACCCAATAGGTAGGGA	26	0	0	NA	NA	NA	1	1	Orphan	PD-DExK,cas14j,cas2,cas1,cas4,cas6,csc1gr5,csc2gr7,cas10d,cas3,WYL,2OG_CAS,csa3,c2c9_V-U4,DinG,c2c5_V-U5,Cas14c_CAS-V-F,Cas14u_CAS-V,cas14k,RT,c2c8_V-U2,cas5,cas7,cas8b5	NA|124aa|up_8|NC_013161.1_3430122_3430494_+,NA	NA|350aa|up_9|NC_013161.1_3428773_3429823_-	PRK00292, glk, glucokinase; Provisional	NA|124aa|up_8|NC_013161.1_3430122_3430494_+	NA	NA|246aa|up_7|NC_013161.1_3430513_3431251_-	pfam09353, DUF1995, Domain of unknown function (DUF1995)	NA|248aa|up_6|NC_013161.1_3431801_3432545_+	TIGR03716, R_switched_YkoY, integral membrane protein, YkoY family	NA|889aa|up_5|NC_013161.1_3432583_3435250_+	PRK05399, PRK05399, DNA mismatch repair protein MutS; Provisional	NA|257aa|up_4|NC_013161.1_3435338_3436109_-	pfam08241, Methyltransf_11, Methyltransferase domain	NA|426aa|up_3|NC_013161.1_3436131_3437409_-	cd00038, CAP_ED, effector domain of the CAP family of transcription factors; members include CAP (or cAMP receptor protein (CRP)), which binds cAMP, FNR (fumarate and nitrate reduction), which uses an iron-sulfur cluster to sense oxygen) and CooA, a heme containing CO sensor	NA|260aa|up_2|NC_013161.1_3437560_3438340_-	pfam13483, Lactamase_B_3, Beta-lactamase superfamily domain	NA|263aa|up_1|NC_013161.1_3438454_3439243_+	pfam13483, Lactamase_B_3, Beta-lactamase superfamily domain	NA|529aa|up_0|NC_013161.1_3439294_3440881_+	pfam00924, MS_channel, Mechanosensitive ion channel	NA|454aa|down_0|NC_013161.1_3441279_3442641_+	cd11315, AmyAc_bac1_AmyA, Alpha amylase catalytic domain found in bacterial Alpha-amylases (also called 1,4-alpha-D-glucan-4-glucanohydrolase)	NA|436aa|down_1|NC_013161.1_3442765_3444073_-	pfam05787, DUF839, Bacterial protein of unknown function (DUF839)	NA|611aa|down_2|NC_013161.1_3444308_3446141_-	COG1132, MdlB, ABC-type multidrug transport system, ATPase and permease components [Defense mechanisms]	NA|627aa|down_3|NC_013161.1_3446909_3448790_+	PRK11107, PRK11107, hybrid sensory histidine kinase BarA; Provisional	NA|321aa|down_4|NC_013161.1_3448965_3449928_-	cd07325, M48_Ste24p_like, M48 Ste24 endopeptidase-like, integral membrane metallopeptidase	NA|131aa|down_5|NC_013161.1_3450165_3450558_-	TIGR02689, arsenate_reductase, arsenate reductase, glutathione/glutaredoxin type	NA|315aa|down_6|NC_013161.1_3450748_3451693_-	PRK05621, PRK05621, F0F1 ATP synthase subunit gamma; Validated	NA|504aa|down_7|NC_013161.1_3451779_3453291_-	PRK09281, PRK09281, F0F1 ATP synthase subunit alpha; Validated	NA|186aa|down_8|NC_013161.1_3453341_3453899_-	PRK05758, PRK05758, F0F1 ATP synthase subunit delta; Validated	NA|179aa|down_9|NC_013161.1_3453901_3454438_-	PRK07352, PRK07352, F0F1 ATP synthase subunit B; Validated
GCF_000024045.1_ASM2404v1	NC_013161	Rippkaea orientalis PCC 8802, complete sequence	11	4298967-4299800	7,8,8	PILER-CR,CRISPRCasFinder,CRT	no	c2c8_V-U2	PD-DExK,cas14j,cas2,cas1,cas4,cas6,csc1gr5,csc2gr7,cas10d,cas3,WYL,2OG_CAS,csa3,c2c9_V-U4,DinG,c2c5_V-U5,Cas14c_CAS-V-F,Cas14u_CAS-V,cas14k,RT,c2c8_V-U2	Type V-U2	GTTTCAATCCCATTGCTAGGATTCATTAATAA--GAAAC,GTTTCAATCCCATTGCTAGGATTCATTAATAAGAAAC,GTTTCAATCCCATTGCTAGGATTCATTAATAAGAAAC	39,37,37	0	0	NA	NA	V-U2:V-U2:V-U2	11,11,11	11	TypeV-U2	PD-DExK,cas14j,cas2,cas1,cas4,cas6,csc1gr5,csc2gr7,cas10d,cas3,WYL,2OG_CAS,csa3,c2c9_V-U4,DinG,c2c5_V-U5,Cas14c_CAS-V-F,Cas14u_CAS-V,cas14k,RT,c2c8_V-U2,cas5,cas7,cas8b5	NA|253aa|up_2|NC_013161.1_4295608_4296367_-,NA	NA|352aa|up_9|NC_013161.1_4287856_4288912_+	cd03785, GT28_MurG, undecaprenyldiphospho-muramoylpentapeptide beta-N-acetylglucosaminyltransferase	NA|534aa|up_8|NC_013161.1_4288947_4290549_-	COG5305, COG5305, Predicted membrane protein [Function unknown]	NA|445aa|up_7|NC_013161.1_4290578_4291913_-	TIGR00665, DnaB, replicative DNA helicase	NA|73aa|up_6|NC_013161.1_4291983_4292202_-	PRK11409, PRK11409, YoeB-YefM toxin-antitoxin system antitoxin YefM	NA|70aa|up_5|NC_013161.1_4292471_4292681_+	COG1598, COG1598, Predicted nuclease of the RNAse H fold, HicB family [General    function prediction only]	NA|448aa|up_4|NC_013161.1_4292766_4294110_-	COG0312, TldD, Predicted Zn-dependent proteases and their inactivated homologs [General function prediction only]	NA|480aa|up_3|NC_013161.1_4294106_4295546_-	COG0312, TldD, Predicted Zn-dependent proteases and their inactivated homologs [General function prediction only]	NA|253aa|up_2|NC_013161.1_4295608_4296367_-	NA	NA|457aa|up_1|NC_013161.1_4296395_4297766_-	COG1357, COG1357, Pentapeptide repeats containing protein [Function unknown]	NA|268aa|up_0|NC_013161.1_4298039_4298843_+	cd00884, beta_CA_cladeB, Carbonic anhydrases (CA) are zinc-containing enzymes that catalyze the reversible hydration of carbon dioxide in a two-step mechanism in which the nucleophilic attack of a zinc-bound hydroxide ion on carbon dioxide is followed by the regeneration of an active site by ionization of the zinc-bound water molecule and removal of a proton from the active site	c2c8_V-U2|495aa|down_0|NC_013161.1_4300241_4301726_-	COG0675, COG0675, Transposase and inactivated derivatives [DNA replication, recombination, and repair]	NA|182aa|down_1|NC_013161.1_4301911_4302457_-	TIGR03426, shape_MreD, rod shape-determining protein MreD	NA|250aa|down_2|NC_013161.1_4302453_4303203_-	PRK13922, PRK13922, rod shape-determining protein MreC; Provisional	NA|350aa|down_3|NC_013161.1_4303229_4304279_-	PRK13927, PRK13927, rod shape-determining protein MreB; Provisional	NA|123aa|down_4|NC_013161.1_4304552_4304921_+	PRK07459, PRK07459, single-stranded DNA-binding protein; Provisional	NA|418aa|down_5|NC_013161.1_4304925_4306179_-	PRK07598, PRK07598, RNA polymerase sigma factor SigC; Validated	NA|300aa|down_6|NC_013161.1_4306736_4307636_+	PRK00091, miaA, tRNA delta(2)-isopentenylpyrophosphate transferase; Reviewed	NA|133aa|down_7|NC_013161.1_4307632_4308031_-	COG0735, Fur, Fe2+/Zn2+ uptake regulation proteins [Inorganic ion transport and metabolism]	NA|94aa|down_8|NC_013161.1_4308140_4308422_+	PRK05974, PRK05974, phosphoribosylformylglycinamidine synthase subunit PurS; Reviewed	NA|228aa|down_9|NC_013161.1_4308425_4309109_+	PRK03619, PRK03619, phosphoribosylformylglycinamidine synthase subunit PurQ
GCF_000024045.1_ASM2404v1	NC_013161	Rippkaea orientalis PCC 8802, complete sequence	12	4370008-4370088	9	CRISPRCasFinder	no	c2c9_V-U4	PD-DExK,cas14j,cas2,cas1,cas4,cas6,csc1gr5,csc2gr7,cas10d,cas3,WYL,2OG_CAS,csa3,c2c9_V-U4,DinG,c2c5_V-U5,Cas14c_CAS-V-F,Cas14u_CAS-V,cas14k,RT,c2c8_V-U2	Type V-U4	TTGCTGTTTATTGGCTTCTTGAAAGGC	27	0	0	NA	NA	NA	1	1	TypeV-U4	PD-DExK,cas14j,cas2,cas1,cas4,cas6,csc1gr5,csc2gr7,cas10d,cas3,WYL,2OG_CAS,csa3,c2c9_V-U4,DinG,c2c5_V-U5,Cas14c_CAS-V-F,Cas14u_CAS-V,cas14k,RT,c2c8_V-U2,cas5,cas7,cas8b5	NA|70aa|up_3|NC_013161.1_4366415_4366625_+,NA	NA|367aa|up_9|NC_013161.1_4359330_4360431_+	PRK00002, aroB, 3-dehydroquinate synthase; Reviewed	NA|582aa|up_8|NC_013161.1_4360527_4362273_-	pfam04966, OprB, Carbohydrate-selective porin, OprB family	NA|311aa|up_7|NC_013161.1_4362616_4363549_-	pfam07313, DUF1460, Protein of unknown function (DUF1460)	NA|183aa|up_6|NC_013161.1_4363677_4364226_-	cd00293, USP_Like, Usp: Universal stress protein family	NA|89aa|up_5|NC_013161.1_4364328_4364595_+	cd00754, Ubl_MoaD, ubiquitin-like (Ubl) domain found in molybdenum cofactor biosynthesis protein D (MoaD) and similar proteins	NA|509aa|up_4|NC_013161.1_4364652_4366179_-	cd02142, McbC_SagB-like_oxidoreductase, oxidase similar to the microcin B17 processing protein McbC	NA|70aa|up_3|NC_013161.1_4366415_4366625_+	NA	NA|276aa|up_2|NC_013161.1_4366681_4367509_-	COG0668, MscS, Small-conductance mechanosensitive channel [Cell envelope biogenesis, outer membrane]	NA|314aa|up_1|NC_013161.1_4367663_4368605_-	COG0384, COG0384, Predicted epimerase, PhzC/PhzF homolog [General function prediction only]	NA|418aa|up_0|NC_013161.1_4368633_4369887_-	TIGR03087, stp1, sugar transferase, PEP-CTERM/EpsH1 system associated	NA|425aa|down_0|NC_013161.1_4370664_4371939_+	COG0612, PqqL, Predicted Zn-dependent peptidases [General function prediction only]	NA|275aa|down_1|NC_013161.1_4371935_4372760_-	pfam12644, DUF3782, Protein of unknown function (DUF3782)	NA|422aa|down_2|NC_013161.1_4373438_4374704_+	COG0612, PqqL, Predicted Zn-dependent peptidases [General function prediction only]	NA|62aa|down_3|NC_013161.1_4374654_4374840_-	pfam16277, DUF4926, Domain of unknown function (DUF4926)	NA|437aa|down_4|NC_013161.1_4374993_4376304_-	COG4247, Phy, 3-phytase (myo-inositol-hexaphosphate 3-phosphohydrolase) [Lipid metabolism]	c2c9_V-U4|404aa|down_5|NC_013161.1_4376696_4377908_+	COG0675, COG0675, Transposase and inactivated derivatives [DNA replication, recombination, and repair]	NA|598aa|down_6|NC_013161.1_4378018_4379812_-	COG1217, TypA, Predicted membrane GTPase involved in stress response [Signal transduction mechanisms]	NA|650aa|down_7|NC_013161.1_4380233_4382183_+	COG1807, ArnT, 4-amino-4-deoxy-L-arabinose transferase and related glycosyltransferases of PMT family [Cell envelope biogenesis, outer membrane]	NA|142aa|down_8|NC_013161.1_4382183_4382609_-	pfam13508, Acetyltransf_7, Acetyltransferase (GNAT) domain	NA|810aa|down_9|NC_013161.1_4382708_4385138_+	COG0699, COG0699, Predicted GTPases (dynamin-related) [General function prediction only]
GCF_000024045.1_ASM2404v1	NC_013161	Rippkaea orientalis PCC 8802, complete sequence	13	4372439-4372550	10	CRISPRCasFinder	no	c2c9_V-U4	PD-DExK,cas14j,cas2,cas1,cas4,cas6,csc1gr5,csc2gr7,cas10d,cas3,WYL,2OG_CAS,csa3,c2c9_V-U4,DinG,c2c5_V-U5,Cas14c_CAS-V-F,Cas14u_CAS-V,cas14k,RT,c2c8_V-U2	Type V-U4	TATTTTCATCCCATTTACGCGCTTGTTCTTCCCTATC	37	0	0	NA	NA	NA	1	1	TypeV-U4	PD-DExK,cas14j,cas2,cas1,cas4,cas6,csc1gr5,csc2gr7,cas10d,cas3,WYL,2OG_CAS,csa3,c2c9_V-U4,DinG,c2c5_V-U5,Cas14c_CAS-V-F,Cas14u_CAS-V,cas14k,RT,c2c8_V-U2,cas5,cas7,cas8b5	NA|70aa|up_5|NC_013161.1_4366415_4366625_+,NA|117aa|up_1|NC_013161.1_4369929_4370280_-,NA|88aa|down_9|NC_013161.1_4385876_4386140_+	NA|311aa|up_9|NC_013161.1_4362616_4363549_-	pfam07313, DUF1460, Protein of unknown function (DUF1460)	NA|183aa|up_8|NC_013161.1_4363677_4364226_-	cd00293, USP_Like, Usp: Universal stress protein family	NA|89aa|up_7|NC_013161.1_4364328_4364595_+	cd00754, Ubl_MoaD, ubiquitin-like (Ubl) domain found in molybdenum cofactor biosynthesis protein D (MoaD) and similar proteins	NA|509aa|up_6|NC_013161.1_4364652_4366179_-	cd02142, McbC_SagB-like_oxidoreductase, oxidase similar to the microcin B17 processing protein McbC	NA|70aa|up_5|NC_013161.1_4366415_4366625_+	NA	NA|276aa|up_4|NC_013161.1_4366681_4367509_-	COG0668, MscS, Small-conductance mechanosensitive channel [Cell envelope biogenesis, outer membrane]	NA|314aa|up_3|NC_013161.1_4367663_4368605_-	COG0384, COG0384, Predicted epimerase, PhzC/PhzF homolog [General function prediction only]	NA|418aa|up_2|NC_013161.1_4368633_4369887_-	TIGR03087, stp1, sugar transferase, PEP-CTERM/EpsH1 system associated	NA|117aa|up_1|NC_013161.1_4369929_4370280_-	NA	NA|425aa|up_0|NC_013161.1_4370664_4371939_+	COG0612, PqqL, Predicted Zn-dependent peptidases [General function prediction only]	NA|422aa|down_0|NC_013161.1_4373438_4374704_+	COG0612, PqqL, Predicted Zn-dependent peptidases [General function prediction only]	NA|62aa|down_1|NC_013161.1_4374654_4374840_-	pfam16277, DUF4926, Domain of unknown function (DUF4926)	NA|437aa|down_2|NC_013161.1_4374993_4376304_-	COG4247, Phy, 3-phytase (myo-inositol-hexaphosphate 3-phosphohydrolase) [Lipid metabolism]	c2c9_V-U4|404aa|down_3|NC_013161.1_4376696_4377908_+	COG0675, COG0675, Transposase and inactivated derivatives [DNA replication, recombination, and repair]	NA|598aa|down_4|NC_013161.1_4378018_4379812_-	COG1217, TypA, Predicted membrane GTPase involved in stress response [Signal transduction mechanisms]	NA|650aa|down_5|NC_013161.1_4380233_4382183_+	COG1807, ArnT, 4-amino-4-deoxy-L-arabinose transferase and related glycosyltransferases of PMT family [Cell envelope biogenesis, outer membrane]	NA|142aa|down_6|NC_013161.1_4382183_4382609_-	pfam13508, Acetyltransf_7, Acetyltransferase (GNAT) domain	NA|810aa|down_7|NC_013161.1_4382708_4385138_+	COG0699, COG0699, Predicted GTPases (dynamin-related) [General function prediction only]	NA|113aa|down_8|NC_013161.1_4385376_4385715_+	pfam08872, KGK, KGK domain	NA|88aa|down_9|NC_013161.1_4385876_4386140_+	NA
GCF_000024045.1_ASM2404v1	NC_013160	Rippkaea orientalis PCC 8802 plasmid pP880201, complete sequence	1	45215-45763	1,1,1	CRISPRCasFinder,CRT,PILER-CR	no	cas2,cas1,cas4,cas6,cas3,cas5,cas7,cas8b5,WYL	cas2,cas1,cas4,cas6,cas3,cas5,cas7,cas8b5,WYL	Unclear	GTTTCAATCCCTCATAGGGATTAACAATAATTGGAAC,GTTTCAATCCCTCATAGGGATTAACAATAATTGGAAC,GTTTCAATCCC--TCATAGGGATTAACAATAATTGGAAC	37,37,39	0	0	NA	NA	I-D,II-B:I-D,II-B:I-D,II-B	7,7,6	7	Unclear	PD-DExK,cas14j,cas2,cas1,cas4,cas6,csc1gr5,csc2gr7,cas10d,cas3,WYL,2OG_CAS,csa3,c2c9_V-U4,DinG,c2c5_V-U5,Cas14c_CAS-V-F,Cas14u_CAS-V,cas14k,RT,c2c8_V-U2,cas5,cas7,cas8b5	NA|359aa|up_9|NC_013160.1_35544_36621_-,NA|146aa|up_8|NC_013160.1_36644_37082_-,NA|124aa|up_7|NC_013160.1_37388_37760_-,NA|120aa|up_6|NC_013160.1_37779_38139_-,NA|98aa|up_5|NC_013160.1_38290_38584_-,NA|98aa|up_4|NC_013160.1_38660_38954_-,NA|182aa|up_3|NC_013160.1_39457_40003_-,NA|125aa|down_2|NC_013160.1_48533_48908_-,cas5|274aa|down_7|NC_013160.1_54052_54874_-,cas7|303aa|down_8|NC_013160.1_54877_55786_-	NA|359aa|up_9|NC_013160.1_35544_36621_-	NA	NA|146aa|up_8|NC_013160.1_36644_37082_-	NA	NA|124aa|up_7|NC_013160.1_37388_37760_-	NA	NA|120aa|up_6|NC_013160.1_37779_38139_-	NA	NA|98aa|up_5|NC_013160.1_38290_38584_-	NA	NA|98aa|up_4|NC_013160.1_38660_38954_-	NA	NA|182aa|up_3|NC_013160.1_39457_40003_-	NA	NA|303aa|up_2|NC_013160.1_40428_41337_-	pfam11845, DUF3365, Protein of unknown function (DUF3365)	NA|287aa|up_1|NC_013160.1_41552_42413_+	pfam12974, Phosphonate-bd, ABC transporter, phosphonate, periplasmic substrate-binding protein	NA|214aa|up_0|NC_013160.1_42432_43074_+	pfam05685, Uma2, Putative restriction endonuclease	cas2|91aa|down_0|NC_013160.1_45997_46270_-	cd09725, Cas2_I_II_III, CRISPR/Cas system-associated protein Cas2	NA|552aa|down_1|NC_013160.1_46846_48502_+	pfam13751, DDE_Tnp_1_6, Transposase DDE domain	NA|125aa|down_2|NC_013160.1_48533_48908_-	NA	cas1|230aa|down_3|NC_013160.1_48992_49682_-	pfam01867, Cas_Cas1, CRISPR associated protein Cas1	cas4|195aa|down_4|NC_013160.1_49772_50357_-	cd09637, Cas4_I-A_I-B_I-C_I-D_II-B, CRISPR/Cas system-associated protein Cas4	cas6|276aa|down_5|NC_013160.1_50435_51263_-	COG5551, COG5551, CRISPR system related protein, RAMP superfamily [Defense    mechanisms]	cas3|915aa|down_6|NC_013160.1_51315_54060_-	COG1203, COG1203, CRISPR-associated helicase Cas3 [Defense mechanisms]	cas5|274aa|down_7|NC_013160.1_54052_54874_-	NA	cas7|303aa|down_8|NC_013160.1_54877_55786_-	NA	cas8b5|839aa|down_9|NC_013160.1_55788_58305_-	TIGR01069, Endonuclease_MutS2, MutS2 family protein
GCF_000024045.1_ASM2404v1	NC_013163	Rippkaea orientalis PCC 8802 plasmid pP880202, complete sequence	1	16017-16123	1	CRISPRCasFinder	no			Orphan	GGGGACAGGGCTCCTTTTTTGTATACACTCTT	32	0	0	NA	NA	NA	1	1	Orphan	PD-DExK,cas14j,cas2,cas1,cas4,cas6,csc1gr5,csc2gr7,cas10d,cas3,WYL,2OG_CAS,csa3,c2c9_V-U4,DinG,c2c5_V-U5,Cas14c_CAS-V-F,Cas14u_CAS-V,cas14k,RT,c2c8_V-U2,cas5,cas7,cas8b5	NA|119aa|up_8|NC_013163.1_7585_7942_-,NA|312aa|up_7|NC_013163.1_8059_8995_-,NA|195aa|up_4|NC_013163.1_9790_10375_+,NA|147aa|up_3|NC_013163.1_10561_11002_-,NA|47aa|down_0|NC_013163.1_19228_19369_-,NA|103aa|down_1|NC_013163.1_19503_19812_-,NA|363aa|down_2|NC_013163.1_20015_21104_+	NA|317aa|up_9|NC_013163.1_6532_7483_-	COG4974, XerD, Site-specific recombinase XerD [DNA replication, recombination, and repair]	NA|119aa|up_8|NC_013163.1_7585_7942_-	NA	NA|312aa|up_7|NC_013163.1_8059_8995_-	NA	NA|129aa|up_6|NC_013163.1_9038_9425_-	COG3654, Doc, Prophage maintenance system killer protein [General function prediction only]	NA|74aa|up_5|NC_013163.1_9421_9643_-	TIGR02609, hypothetical_protein_XAC1195, putative addiction module antidote	NA|195aa|up_4|NC_013163.1_9790_10375_+	NA	NA|147aa|up_3|NC_013163.1_10561_11002_-	NA	NA|463aa|up_2|NC_013163.1_11001_12390_-	cd10227, ParM_like, Plasmid segregation protein ParM and similar proteins	NA|282aa|up_1|NC_013163.1_13293_14139_+	pfam03432, Relaxase, Relaxase/Mobilisation nuclease domain	NA|518aa|up_0|NC_013163.1_14108_15662_+	pfam13155, Toprim_2, Toprim-like	NA|47aa|down_0|NC_013163.1_19228_19369_-	NA	NA|103aa|down_1|NC_013163.1_19503_19812_-	NA	NA|363aa|down_2|NC_013163.1_20015_21104_+	NA	NA|321aa|down_3|NC_013163.1_21197_22160_+	sd00006, TPR, Tetratricopeptide repeat	NA|529aa|down_4|NC_013163.1_22283_23870_-	COG1131, CcmA, ABC-type multidrug transport system, ATPase component [Defense mechanisms]	NA|NA	NA	NA|NA	NA	NA|NA	NA	NA|NA	NA	NA|NA	NA
