assembly_id	genome_id	genome_def	crispr_array_locus_merge	crispr_array_location_merge	crispr_locus_id	crispr_pred_method	array_in_prot	prot_within_array_20000	prot_in_genome	crispr_type_by_cas_prot	consensus_repeat	repeat_length	self-targeting_spacer_number	self-targeting_target_number	spacer_location	protospacer_location	repeat_type	spacer_locus_num	spacer_num	correct_crispr_type	genome_cas_prots	unknown_protein_around_crispr	L10	L10_domain	L9	L9_domain	L8	L8_domain	L7	L7_domain	L6	L6_domain	L5	L5_domain	L4	L4_domain	L3	L3_domain	L2	L2_domain	L1	L1_domain	R1	R1_domain	R2	R2_domain	R3	R3_domain	R4	R4_domain	R5	R5_domain	R6	R6_domain	R7	R7_domain	R8	R8_domain	R9	R9_domain	R10	R10_domain
GCF_000020625.1_ASM2062v1	NC_011059	Prosthecochloris aestuarii DSM 271, complete sequence	1	500712-500819	1	CRISPRCasFinder	no	csa3	RT,Cas9_archaeal,PD-DExK,csa3,cas6,cas3,DEDDh,cas2,cas1,cas5,cas7,cas6e,cse2gr11,cas8e,WYL,cmr1gr7,cmr6gr7,cas10,cmr4gr7,cmr5gr11,cmr3gr5,cas4	Type I-A	AATTCGTGACAATTCGTGGACAGGAATC	28	0	0	NA	NA	NA	1	1	Orphan	RT,Cas9_archaeal,PD-DExK,csa3,cas6,cas3,DEDDh,cas2,cas1,cas5,cas7,cas6e,cse2gr11,cas8e,WYL,cmr1gr7,cmr6gr7,cas10,cmr4gr7,cmr5gr11,cmr3gr5,cas4	NA|149aa|up_9|NC_011059.1_490010_490457_+,NA|205aa|down_8|NC_011059.1_511369_511984_+,NA|211aa|down_9|NC_011059.1_512016_512649_+	NA|149aa|up_9|NC_011059.1_490010_490457_+	NA	NA|212aa|up_8|NC_011059.1_490547_491183_-	COG0177, Nth, Predicted EndoIII-related endonuclease [DNA replication, recombination, and repair]	NA|231aa|up_7|NC_011059.1_491193_491886_-	cd00144, MPP_PPP_family, phosphoprotein phosphatases of the metallophosphatase superfamily, metallophosphatase domain	NA|169aa|up_6|NC_011059.1_491909_492416_-	PRK00202, nusB, transcription antitermination factor NusB	NA|231aa|up_5|NC_011059.1_492435_493128_-	cd07506, HAD_like, uncharacterized family of the haloacid dehalogenase-like (HAD) hydrolase superfamily	NA|1187aa|up_4|NC_011059.1_493117_496678_-	TIGR02168, Chromosome_partition_protein_Smc, chromosome segregation protein SMC, common bacterial type	NA|292aa|up_3|NC_011059.1_496785_497661_-	cd00739, DHPS, DHPS subgroup of Pterin binding enzymes	NA|343aa|up_2|NC_011059.1_497681_498710_-	cd06160, S2P-M50_like_2, Uncharacterized homologs of Site-2 protease (S2P), zinc metalloproteases (MEROPS family M50) which cleave transmembrane domains of substrate proteins, regulating intramembrane proteolysis (RIP) of diverse signal transduction mechanisms	NA|152aa|up_1|NC_011059.1_498797_499253_-	pfam14376, Haem_bd, Haem-binding domain	NA|301aa|up_0|NC_011059.1_499378_500281_-	pfam13649, Methyltransf_25, Methyltransferase domain	NA|543aa|down_0|NC_011059.1_500970_502599_+	PRK05290, PRK05290, hybrid cluster protein; Provisional	NA|206aa|down_1|NC_011059.1_502793_503411_+	COG0655, WrbA, Multimeric flavodoxin WrbA [General function prediction only]	NA|321aa|down_2|NC_011059.1_503490_504453_+	pfam13449, Phytase-like, Esterase-like activity of phytase	csa3|121aa|down_3|NC_011059.1_504503_504866_+	smart00418, HTH_ARSR, helix_turn_helix, Arsenical Resistance Operon Repressor	NA|358aa|down_4|NC_011059.1_504933_506007_+	COG0701, COG0701, Predicted permeases [General function prediction only]	NA|355aa|down_5|NC_011059.1_506173_507238_+	COG4782, COG4782, Uncharacterized protein conserved in bacteria [Function unknown]	NA|212aa|down_6|NC_011059.1_507359_507995_+	COG0637, COG0637, Predicted phosphatase/phosphohexomutase [General function prediction only]	NA|947aa|down_7|NC_011059.1_508017_510858_-	PRK00349, uvrA, excinuclease ABC subunit UvrA	NA|205aa|down_8|NC_011059.1_511369_511984_+	NA	NA|211aa|down_9|NC_011059.1_512016_512649_+	NA
GCF_000020625.1_ASM2062v1	NC_011059	Prosthecochloris aestuarii DSM 271, complete sequence	2	1536024-1538308	2,1,1,2	CRISPRCasFinder,CRT,PILER-CR,PILER-CR	no	cas2,cas1,cas5,cas7,cas6e,cse2gr11,cas8e,cas3,DEDDh,WYL	RT,Cas9_archaeal,PD-DExK,csa3,cas6,cas3,DEDDh,cas2,cas1,cas5,cas7,cas6e,cse2gr11,cas8e,WYL,cmr1gr7,cmr6gr7,cas10,cmr4gr7,cmr5gr11,cmr3gr5,cas4	Type I-E	GAAACACCCCCACGCCTGTGGGGAAGAC,GAAACACCCCCACGCCTGTGGGGAAGAC,AACACCCCCACGCCTGTGGGGAAGAC,GAAACACCCCCACGCCTGTGGGGAAGAC	28,28,26,28	1	1	1536052-1536084	NC_011059.1_494741-494773	I-E,II-B:I-E,II-B:I-E,II-B:I-E,II-B	37,37,35,35	37	TypeI-E	RT,Cas9_archaeal,PD-DExK,csa3,cas6,cas3,DEDDh,cas2,cas1,cas5,cas7,cas6e,cse2gr11,cas8e,WYL,cmr1gr7,cmr6gr7,cas10,cmr4gr7,cmr5gr11,cmr3gr5,cas4	NA|95aa|up_4|NC_011059.1_1530346_1530631_-,NA	NA|281aa|up_9|NC_011059.1_1523233_1524076_-	TIGR02297, AraC-type_DNA-binding_domain-containing_protein, 4-hydroxyphenylacetate catabolism regulatory protein HpaA	NA|1031aa|up_8|NC_011059.1_1524078_1527171_-	COG0841, AcrB, Cation/multidrug efflux pump [Defense mechanisms]	NA|507aa|up_7|NC_011059.1_1527431_1528952_-	COG1538, TolC, Outer membrane protein [Cell envelope biogenesis, outer membrane / Intracellular trafficking and secretion]	NA|204aa|up_6|NC_011059.1_1529019_1529631_-	pfam02517, Abi, CAAX protease self-immunity	NA|194aa|up_5|NC_011059.1_1529727_1530309_+	COG1357, COG1357, Pentapeptide repeats containing protein [Function unknown]	NA|95aa|up_4|NC_011059.1_1530346_1530631_-	NA	NA|399aa|up_3|NC_011059.1_1530717_1531914_-	TIGR02032, Uncharacterized_protein_MJ1520, geranylgeranyl reductase family	NA|328aa|up_2|NC_011059.1_1532114_1533098_+	pfam00762, Ferrochelatase, Ferrochelatase	NA|428aa|up_1|NC_011059.1_1533310_1534594_+	pfam01548, DEDD_Tnp_IS110, Transposase	NA|293aa|up_0|NC_011059.1_1534920_1535799_-	PRK11873, arsM, arsenite methyltransferase	cas2|102aa|down_0|NC_011059.1_1538339_1538645_-	cd09648, Cas2_I-E, CRISPR/Cas system-associated protein Cas2	cas1|292aa|down_1|NC_011059.1_1538647_1539523_-	TIGR03638, cas1_ECOLI, CRISPR-associated endonuclease Cas1, subtype I-E/ECOLI	cas5|244aa|down_2|NC_011059.1_1539512_1540244_-	TIGR01868, hypothetical_protein, CRISPR-associated protein Cas5/CasD, subtype I-E/ECOLI	cas7|346aa|down_3|NC_011059.1_1540240_1541278_-	cd09646, Cas7_I-E, CRISPR/Cas system-associated RAMP superfamily protein Cas7	cas6e|209aa|down_4|NC_011059.1_1541296_1541923_-	TIGR01907, CRISPR_system_Cascade_subunit_CasE, CRISPR-associated protein Cas6/Cse3/CasE, subtype I-E/ECOLI	cse2gr11|178aa|down_5|NC_011059.1_1541919_1542453_-	cd09731, Cse2_I-E, CRISPR/Cas system-associated protein Cse2	cas8e|520aa|down_6|NC_011059.1_1542456_1544016_-	cd09729, Cse1_I-E, CRISPR/Cas system-associated protein Cse1	cas3|861aa|down_7|NC_011059.1_1544036_1546619_-	PRK09694, PRK09694, CRISPR-associated helicase/endonuclease Cas3	DEDDh|181aa|down_8|NC_011059.1_1546639_1547182_-	cd06127, DEDDh, DEDDh 3'-5' exonuclease domain family	WYL|325aa|down_9|NC_011059.1_1547171_1548146_-	pfam13280, WYL, WYL domain
GCF_000020625.1_ASM2062v1	NC_011059	Prosthecochloris aestuarii DSM 271, complete sequence	3	1896314-1896402	3	CRISPRCasFinder	no		RT,Cas9_archaeal,PD-DExK,csa3,cas6,cas3,DEDDh,cas2,cas1,cas5,cas7,cas6e,cse2gr11,cas8e,WYL,cmr1gr7,cmr6gr7,cas10,cmr4gr7,cmr5gr11,cmr3gr5,cas4	Orphan	GTCTCTTGTCACTCGTCACGCGTCAC	26	0	0	NA	NA	NA	1	1	Orphan	RT,Cas9_archaeal,PD-DExK,csa3,cas6,cas3,DEDDh,cas2,cas1,cas5,cas7,cas6e,cse2gr11,cas8e,WYL,cmr1gr7,cmr6gr7,cas10,cmr4gr7,cmr5gr11,cmr3gr5,cas4	NA|161aa|up_6|NC_011059.1_1888322_1888805_+,NA|328aa|up_1|NC_011059.1_1894140_1895124_+,NA|255aa|up_0|NC_011059.1_1895117_1895882_+,NA|78aa|down_1|NC_011059.1_1898804_1899038_-	NA|182aa|up_9|NC_011059.1_1883927_1884473_-	pfam00908, dTDP_sugar_isom, dTDP-4-dehydrorhamnose 3,5-epimerase	NA|292aa|up_8|NC_011059.1_1884976_1885852_-	TIGR01207, Glucose-1-phosphate_thymidylyltransferase_1, glucose-1-phosphate thymidylyltransferase, short form	NA|474aa|up_7|NC_011059.1_1886033_1887455_-	TIGR01479, Mannose-1-phosphate_guanylyltransferase, mannose-1-phosphate guanylyltransferase/mannose-6-phosphate isomerase	NA|161aa|up_6|NC_011059.1_1888322_1888805_+	NA	NA|172aa|up_5|NC_011059.1_1889538_1890054_-	pfam00583, Acetyltransf_1, Acetyltransferase (GNAT) family	NA|94aa|up_4|NC_011059.1_1890050_1890332_-	pfam08681, DUF1778, Protein of unknown function (DUF1778)	NA|251aa|up_3|NC_011059.1_1891005_1891758_-	cd06259, YdcF-like, YdcF-like	NA|470aa|up_2|NC_011059.1_1892272_1893682_-	cd03088, ManB, ManB is a bacterial phosphomannomutase (PMM) that catalyzes the conversion of mannose 6-phosphate to mannose-1-phosphate in the second of three steps in the GDP-mannose pathway, in which GDP-D-mannose is synthesized from fructose-6-phosphate	NA|328aa|up_1|NC_011059.1_1894140_1895124_+	NA	NA|255aa|up_0|NC_011059.1_1895117_1895882_+	NA	NA|503aa|down_0|NC_011059.1_1896592_1898101_+	pfam02661, Fic, Fic/DOC family	NA|78aa|down_1|NC_011059.1_1898804_1899038_-	NA	NA|131aa|down_2|NC_011059.1_1899034_1899427_-	cd09873, PIN_Pae0151-like, VapC-like PIN domain of the Pyrobaculum aerophilum Pae0151 and Pae2754 proteins and homologs	NA|629aa|down_3|NC_011059.1_1900312_1902199_-	PRK00331, PRK00331, isomerizing glutamine--fructose-6-phosphate transaminase	NA|145aa|down_4|NC_011059.1_1902206_1902641_-	pfam00582, Usp, Universal stress protein family	NA|344aa|down_5|NC_011059.1_1903144_1904176_-	COG2605, COG2605, Predicted kinase related to galactokinase and mevalonate kinase [General function prediction only]	NA|188aa|down_6|NC_011059.1_1904202_1904766_-	cd07503, HAD_HisB-N, histidinol phosphate phosphatase and related phosphatases	NA|238aa|down_7|NC_011059.1_1904749_1905463_-	cd06915, NTP_transferase_WcbM_like, WcbM_like is a subfamily of nucleotidyl transferases	NA|200aa|down_8|NC_011059.1_1905459_1906059_-	PRK13937, PRK13937, phosphoheptose isomerase; Provisional	NA|197aa|down_9|NC_011059.1_1906391_1906982_-	COG0529, CysC, Adenylylsulfate kinase and related kinases [Inorganic ion transport and metabolism]
GCF_000020625.1_ASM2062v1	NC_011059	Prosthecochloris aestuarii DSM 271, complete sequence	4	2174895-2175529	4,2,3	CRISPRCasFinder,CRT,PILER-CR	no	cas1,cas6,cmr1gr7,cmr6gr7,cas10,cmr4gr7,cmr5gr11,cmr3gr5	RT,Cas9_archaeal,PD-DExK,csa3,cas6,cas3,DEDDh,cas2,cas1,cas5,cas7,cas6e,cse2gr11,cas8e,WYL,cmr1gr7,cmr6gr7,cas10,cmr4gr7,cmr5gr11,cmr3gr5,cas4	Type III-D,Type III-B,Type III-A,Type III-C	GTCTCAATCCCCCTTACTCAATCGGGTCTTCCTACAC,GTCTCAATCCCCCTTACTCAATCGGGTCTTCCTACAC,GTCTCAATCCCCCTTACTCAATCGGGTCTTCCTACAC	37,37,37	0	0	NA	NA	NA:NA:NA	8,8,7	8	TypeIII-D,TypeIII-B,TypeIII-A,TypeIII-C	RT,Cas9_archaeal,PD-DExK,csa3,cas6,cas3,DEDDh,cas2,cas1,cas5,cas7,cas6e,cse2gr11,cas8e,WYL,cmr1gr7,cmr6gr7,cas10,cmr4gr7,cmr5gr11,cmr3gr5,cas4	NA|131aa|up_8|NC_011059.1_2162473_2162866_+,NA|360aa|down_2|NC_011059.1_2178120_2179200_-,NA|445aa|down_3|NC_011059.1_2179330_2180665_-,cmr5gr11|138aa|down_8|NC_011059.1_2186998_2187412_+,cmr3gr5|288aa|down_9|NC_011059.1_2187408_2188272_+	NA|1198aa|up_9|NC_011059.1_2158554_2162148_-	TIGR01414, Contains:_AIDA-I_translocator, outer membrane autotransporter barrel domain	NA|131aa|up_8|NC_011059.1_2162473_2162866_+	NA	NA|1278aa|up_7|NC_011059.1_2162973_2166807_+	PRK12493, PRK12493, magnesium chelatase subunit H; Provisional	NA|234aa|up_6|NC_011059.1_2166856_2167558_+	PRK07580, PRK07580, Mg-protoporphyrin IX methyl transferase; Validated	NA|547aa|up_5|NC_011059.1_2167705_2169346_+	TIGR02026, BchE, magnesium-protoporphyrin IX monomethyl ester anaerobic oxidative cyclase	NA|140aa|up_4|NC_011059.1_2169422_2169842_-	COG0071, IbpA, Molecular chaperone (small heat shock protein) [Posttranslational modification, protein turnover, chaperones]	NA|297aa|up_3|NC_011059.1_2170027_2170918_-	pfam14505, DUF4438, Domain of unknown function (DUF4438)	NA|411aa|up_2|NC_011059.1_2171093_2172326_-	pfam13194, DUF4010, Domain of unknown function (DUF4010)	NA|461aa|up_1|NC_011059.1_2172683_2174066_+	pfam01551, Peptidase_M23, Peptidase family M23	NA|137aa|up_0|NC_011059.1_2174118_2174529_-	PRK09256, PRK09256, aminoacyl-tRNA hydrolase	cas1|75aa|down_0|NC_011059.1_2175841_2176066_-	pfam01867, Cas_Cas1, CRISPR associated protein Cas1	cas6|333aa|down_1|NC_011059.1_2176062_2177061_-	cd09760, Cas6_III, CRISPR/Cas system-associated RAMP superfamily protein Cas6	NA|360aa|down_2|NC_011059.1_2178120_2179200_-	NA	NA|445aa|down_3|NC_011059.1_2179330_2180665_-	NA	cmr1gr7|387aa|down_4|NC_011059.1_2180823_2181984_+	TIGR01894, hypothetical_protein, CRISPR type III-B/RAMP module RAMP protein Cmr1	cmr6gr7|360aa|down_5|NC_011059.1_2181988_2183068_+	pfam03787, RAMPs, RAMP superfamily	cas10|981aa|down_6|NC_011059.1_2183064_2186007_+	cd09701, Cas10_III, CRISPR/Cas system-associated protein Cas10	cmr4gr7|333aa|down_7|NC_011059.1_2186003_2187002_+	COG1336, COG1336, CRISPR system related protein, RAMP superfamily [Defense mechanisms]	cmr5gr11|138aa|down_8|NC_011059.1_2186998_2187412_+	NA	cmr3gr5|288aa|down_9|NC_011059.1_2187408_2188272_+	NA
GCF_000020625.1_ASM2062v1	NC_011059	Prosthecochloris aestuarii DSM 271, complete sequence	5	2177296-2177690	5,3,4	CRISPRCasFinder,CRT,PILER-CR	no	cas1,cas6,cmr1gr7,cmr6gr7,cas10,cmr4gr7,cmr5gr11,cmr3gr5	RT,Cas9_archaeal,PD-DExK,csa3,cas6,cas3,DEDDh,cas2,cas1,cas5,cas7,cas6e,cse2gr11,cas8e,WYL,cmr1gr7,cmr6gr7,cas10,cmr4gr7,cmr5gr11,cmr3gr5,cas4	Type III-D,Type III-B,Type III-A,Type III-C	GTCGCAATCCCCTGTAGCTAATCGGGTCTTCAAAC,GTCGCAATCCCCTGTAGCTAATCGGGTCTTCAAAC,GTCGCAATCCCCTGTAGCTAATCGGGTCTTCAAAC	35,35,35	0	0	NA	NA	NA:NA:NA	5,5,4	5	TypeIII-D,TypeIII-B,TypeIII-A,TypeIII-C	RT,Cas9_archaeal,PD-DExK,csa3,cas6,cas3,DEDDh,cas2,cas1,cas5,cas7,cas6e,cse2gr11,cas8e,WYL,cmr1gr7,cmr6gr7,cas10,cmr4gr7,cmr5gr11,cmr3gr5,cas4	NA,NA|360aa|down_0|NC_011059.1_2178120_2179200_-,NA|445aa|down_1|NC_011059.1_2179330_2180665_-,cmr5gr11|138aa|down_6|NC_011059.1_2186998_2187412_+,cmr3gr5|288aa|down_7|NC_011059.1_2187408_2188272_+	NA|1278aa|up_9|NC_011059.1_2162973_2166807_+	PRK12493, PRK12493, magnesium chelatase subunit H; Provisional	NA|234aa|up_8|NC_011059.1_2166856_2167558_+	PRK07580, PRK07580, Mg-protoporphyrin IX methyl transferase; Validated	NA|547aa|up_7|NC_011059.1_2167705_2169346_+	TIGR02026, BchE, magnesium-protoporphyrin IX monomethyl ester anaerobic oxidative cyclase	NA|140aa|up_6|NC_011059.1_2169422_2169842_-	COG0071, IbpA, Molecular chaperone (small heat shock protein) [Posttranslational modification, protein turnover, chaperones]	NA|297aa|up_5|NC_011059.1_2170027_2170918_-	pfam14505, DUF4438, Domain of unknown function (DUF4438)	NA|411aa|up_4|NC_011059.1_2171093_2172326_-	pfam13194, DUF4010, Domain of unknown function (DUF4010)	NA|461aa|up_3|NC_011059.1_2172683_2174066_+	pfam01551, Peptidase_M23, Peptidase family M23	NA|137aa|up_2|NC_011059.1_2174118_2174529_-	PRK09256, PRK09256, aminoacyl-tRNA hydrolase	cas1|75aa|up_1|NC_011059.1_2175841_2176066_-	pfam01867, Cas_Cas1, CRISPR associated protein Cas1	cas6|333aa|up_0|NC_011059.1_2176062_2177061_-	cd09760, Cas6_III, CRISPR/Cas system-associated RAMP superfamily protein Cas6	NA|360aa|down_0|NC_011059.1_2178120_2179200_-	NA	NA|445aa|down_1|NC_011059.1_2179330_2180665_-	NA	cmr1gr7|387aa|down_2|NC_011059.1_2180823_2181984_+	TIGR01894, hypothetical_protein, CRISPR type III-B/RAMP module RAMP protein Cmr1	cmr6gr7|360aa|down_3|NC_011059.1_2181988_2183068_+	pfam03787, RAMPs, RAMP superfamily	cas10|981aa|down_4|NC_011059.1_2183064_2186007_+	cd09701, Cas10_III, CRISPR/Cas system-associated protein Cas10	cmr4gr7|333aa|down_5|NC_011059.1_2186003_2187002_+	COG1336, COG1336, CRISPR system related protein, RAMP superfamily [Defense mechanisms]	cmr5gr11|138aa|down_6|NC_011059.1_2186998_2187412_+	NA	cmr3gr5|288aa|down_7|NC_011059.1_2187408_2188272_+	NA	NA|147aa|down_8|NC_011059.1_2190459_2190900_+	COG3753, COG3753, Uncharacterized protein conserved in bacteria [Function unknown]	NA|317aa|down_9|NC_011059.1_2191009_2191960_-	cd00200, WD40, WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment
GCF_000020625.1_ASM2062v1	NC_011059	Prosthecochloris aestuarii DSM 271, complete sequence	6	2188672-2188786	6	CRISPRCasFinder	no	cas1,cas6,cmr1gr7,cmr6gr7,cas10,cmr4gr7,cmr5gr11,cmr3gr5	RT,Cas9_archaeal,PD-DExK,csa3,cas6,cas3,DEDDh,cas2,cas1,cas5,cas7,cas6e,cse2gr11,cas8e,WYL,cmr1gr7,cmr6gr7,cas10,cmr4gr7,cmr5gr11,cmr3gr5,cas4	Type III-D,Type III-B,Type III-A,Type III-C	GTCTCAATCCCCTTTATCCAAGCGGGTCTTCATCTAC	37	0	0	NA	NA	NA	1	1	TypeIII-D,TypeIII-B,TypeIII-A,TypeIII-C	RT,Cas9_archaeal,PD-DExK,csa3,cas6,cas3,DEDDh,cas2,cas1,cas5,cas7,cas6e,cse2gr11,cas8e,WYL,cmr1gr7,cmr6gr7,cas10,cmr4gr7,cmr5gr11,cmr3gr5,cas4	NA|360aa|up_7|NC_011059.1_2178120_2179200_-,NA|445aa|up_6|NC_011059.1_2179330_2180665_-,cmr5gr11|138aa|up_1|NC_011059.1_2186998_2187412_+,cmr3gr5|288aa|up_0|NC_011059.1_2187408_2188272_+,NA	cas1|75aa|up_9|NC_011059.1_2175841_2176066_-	pfam01867, Cas_Cas1, CRISPR associated protein Cas1	cas6|333aa|up_8|NC_011059.1_2176062_2177061_-	cd09760, Cas6_III, CRISPR/Cas system-associated RAMP superfamily protein Cas6	NA|360aa|up_7|NC_011059.1_2178120_2179200_-	NA	NA|445aa|up_6|NC_011059.1_2179330_2180665_-	NA	cmr1gr7|387aa|up_5|NC_011059.1_2180823_2181984_+	TIGR01894, hypothetical_protein, CRISPR type III-B/RAMP module RAMP protein Cmr1	cmr6gr7|360aa|up_4|NC_011059.1_2181988_2183068_+	pfam03787, RAMPs, RAMP superfamily	cas10|981aa|up_3|NC_011059.1_2183064_2186007_+	cd09701, Cas10_III, CRISPR/Cas system-associated protein Cas10	cmr4gr7|333aa|up_2|NC_011059.1_2186003_2187002_+	COG1336, COG1336, CRISPR system related protein, RAMP superfamily [Defense mechanisms]	cmr5gr11|138aa|up_1|NC_011059.1_2186998_2187412_+	NA	cmr3gr5|288aa|up_0|NC_011059.1_2187408_2188272_+	NA	NA|147aa|down_0|NC_011059.1_2190459_2190900_+	COG3753, COG3753, Uncharacterized protein conserved in bacteria [Function unknown]	NA|317aa|down_1|NC_011059.1_2191009_2191960_-	cd00200, WD40, WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment	NA|384aa|down_2|NC_011059.1_2192089_2193241_-	TIGR03469, HpnB, hopene-associated glycosyltransferase HpnB	NA|295aa|down_3|NC_011059.1_2193342_2194227_-	PRK00489, hisG, ATP phosphoribosyltransferase; Reviewed	NA|246aa|down_4|NC_011059.1_2194245_2194983_-	cd07398, MPP_YbbF-LpxH, Escherichia coli YbbF/LpxH and related proteins, metallophosphatase domain	NA|333aa|down_5|NC_011059.1_2195798_2196797_-	PRK12392, PRK12392, bacteriochlorophyll c synthase; Provisional	NA|442aa|down_6|NC_011059.1_2196928_2198254_-	PRK14340, PRK14340, (dimethylallyl)adenosine tRNA methylthiotransferase; Provisional	NA|204aa|down_7|NC_011059.1_2198661_2199273_-	TIGR02000, Nitrogen_fixation_protein_NifU, Fe-S cluster assembly protein NifU	NA|402aa|down_8|NC_011059.1_2199256_2200462_-	TIGR03402, Cysteine_desulfurase_NifS, cysteine desulfurase NifS	NA|310aa|down_9|NC_011059.1_2200514_2201444_-	TIGR01136, Cysteine_synthase, cysteine synthase
GCF_000020625.1_ASM2062v1	NC_011059	Prosthecochloris aestuarii DSM 271, complete sequence	7	2189181-2189643	7,4,5	CRISPRCasFinder,CRT,PILER-CR	no	cas1,cas6,cmr1gr7,cmr6gr7,cas10,cmr4gr7,cmr5gr11,cmr3gr5	RT,Cas9_archaeal,PD-DExK,csa3,cas6,cas3,DEDDh,cas2,cas1,cas5,cas7,cas6e,cse2gr11,cas8e,WYL,cmr1gr7,cmr6gr7,cas10,cmr4gr7,cmr5gr11,cmr3gr5,cas4	Type III-D,Type III-B,Type III-A,Type III-C	GTCTCAATCCCCTTTATCCAAGCGGGTCTTCATCTAC,GTCTCAATCCCCTTTATCCAAGCGGGTCTTCATCTAC,GTCTCAATCCCCTTTATCCAAGCGGGTCTTCATCTAC	37,37,37	0	0	NA	NA	NA:NA:NA	6,6,5	6	TypeIII-D,TypeIII-B,TypeIII-A,TypeIII-C	RT,Cas9_archaeal,PD-DExK,csa3,cas6,cas3,DEDDh,cas2,cas1,cas5,cas7,cas6e,cse2gr11,cas8e,WYL,cmr1gr7,cmr6gr7,cas10,cmr4gr7,cmr5gr11,cmr3gr5,cas4	NA|360aa|up_7|NC_011059.1_2178120_2179200_-,NA|445aa|up_6|NC_011059.1_2179330_2180665_-,cmr5gr11|138aa|up_1|NC_011059.1_2186998_2187412_+,cmr3gr5|288aa|up_0|NC_011059.1_2187408_2188272_+,NA	cas1|75aa|up_9|NC_011059.1_2175841_2176066_-	pfam01867, Cas_Cas1, CRISPR associated protein Cas1	cas6|333aa|up_8|NC_011059.1_2176062_2177061_-	cd09760, Cas6_III, CRISPR/Cas system-associated RAMP superfamily protein Cas6	NA|360aa|up_7|NC_011059.1_2178120_2179200_-	NA	NA|445aa|up_6|NC_011059.1_2179330_2180665_-	NA	cmr1gr7|387aa|up_5|NC_011059.1_2180823_2181984_+	TIGR01894, hypothetical_protein, CRISPR type III-B/RAMP module RAMP protein Cmr1	cmr6gr7|360aa|up_4|NC_011059.1_2181988_2183068_+	pfam03787, RAMPs, RAMP superfamily	cas10|981aa|up_3|NC_011059.1_2183064_2186007_+	cd09701, Cas10_III, CRISPR/Cas system-associated protein Cas10	cmr4gr7|333aa|up_2|NC_011059.1_2186003_2187002_+	COG1336, COG1336, CRISPR system related protein, RAMP superfamily [Defense mechanisms]	cmr5gr11|138aa|up_1|NC_011059.1_2186998_2187412_+	NA	cmr3gr5|288aa|up_0|NC_011059.1_2187408_2188272_+	NA	NA|147aa|down_0|NC_011059.1_2190459_2190900_+	COG3753, COG3753, Uncharacterized protein conserved in bacteria [Function unknown]	NA|317aa|down_1|NC_011059.1_2191009_2191960_-	cd00200, WD40, WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment	NA|384aa|down_2|NC_011059.1_2192089_2193241_-	TIGR03469, HpnB, hopene-associated glycosyltransferase HpnB	NA|295aa|down_3|NC_011059.1_2193342_2194227_-	PRK00489, hisG, ATP phosphoribosyltransferase; Reviewed	NA|246aa|down_4|NC_011059.1_2194245_2194983_-	cd07398, MPP_YbbF-LpxH, Escherichia coli YbbF/LpxH and related proteins, metallophosphatase domain	NA|333aa|down_5|NC_011059.1_2195798_2196797_-	PRK12392, PRK12392, bacteriochlorophyll c synthase; Provisional	NA|442aa|down_6|NC_011059.1_2196928_2198254_-	PRK14340, PRK14340, (dimethylallyl)adenosine tRNA methylthiotransferase; Provisional	NA|204aa|down_7|NC_011059.1_2198661_2199273_-	TIGR02000, Nitrogen_fixation_protein_NifU, Fe-S cluster assembly protein NifU	NA|402aa|down_8|NC_011059.1_2199256_2200462_-	TIGR03402, Cysteine_desulfurase_NifS, cysteine desulfurase NifS	NA|310aa|down_9|NC_011059.1_2200514_2201444_-	TIGR01136, Cysteine_synthase, cysteine synthase
