assembly_id	genome_id	genome_def	crispr_array_locus_merge	crispr_array_location_merge	crispr_locus_id	crispr_pred_method	array_in_prot	prot_within_array_20000	prot_in_genome	crispr_type_by_cas_prot	consensus_repeat	repeat_length	self-targeting_spacer_number	self-targeting_target_number	spacer_location	protospacer_location	repeat_type	spacer_locus_num	spacer_num	correct_crispr_type	genome_cas_prots	unknown_protein_around_crispr	L10	L10_domain	L9	L9_domain	L8	L8_domain	L7	L7_domain	L6	L6_domain	L5	L5_domain	L4	L4_domain	L3	L3_domain	L2	L2_domain	L1	L1_domain	R1	R1_domain	R2	R2_domain	R3	R3_domain	R4	R4_domain	R5	R5_domain	R6	R6_domain	R7	R7_domain	R8	R8_domain	R9	R9_domain	R10	R10_domain
GCF_000020525.1_ASM2052v1	NC_011026	Chloroherpeton thalassium ATCC 35110, complete sequence	1	33988-34243	1	PILER-CR	no	DEDDh	DEDDh,cas3,cmr1gr7,cmr6gr7,cmr5gr11,cmr4gr7,csm6,cmr3gr5,cas10,csa3,Cas9_archaeal,cas7,cas8b6,cas5,cas6,WYL,cas4,cas2,cas1	Unclear	CTCGCACAAGTGAAAGAAGATGGCAAATGGGGCTACATTGATTCCACAGGAA	52	0	0	NA	NA	NA	2	2	Orphan	DEDDh,cas3,cmr1gr7,cmr6gr7,cmr5gr11,cmr4gr7,csm6,cmr3gr5,cas10,csa3,Cas9_archaeal,cas7,cas8b6,cas5,cas6,WYL,cas4,cas2,cas1	NA,NA|98aa|down_2|NC_011026.1_36619_36913_+,NA|57aa|down_5|NC_011026.1_41054_41225_+,NA|315aa|down_6|NC_011026.1_41347_42292_+	NA|157aa|up_9|NC_011026.1_17055_17526_+	pfam00359, PTS_EIIA_2, Phosphoenolpyruvate-dependent sugar phosphotransferase system, EIIA 2	NA|428aa|up_8|NC_011026.1_17544_18828_+	PRK00037, hisS, histidyl-tRNA synthetase; Reviewed	NA|260aa|up_7|NC_011026.1_18895_19675_-	cd07185, OmpA_C-like, Peptidoglycan binding domains similar to the C-terminal domain of outer-membrane protein OmpA	NA|774aa|up_6|NC_011026.1_20292_22614_+	COG2274, SunT, ABC-type bacteriocin/lantibiotic exporters, contain an N-terminal double-glycine peptidase domain [Defense mechanisms]	NA|425aa|up_5|NC_011026.1_22701_23976_+	TIGR01843, Hemolysin_secretion_protein_D_plasmid, type I secretion membrane fusion protein, HlyD family	NA|475aa|up_4|NC_011026.1_23978_25403_+	COG1538, TolC, Outer membrane protein [Cell envelope biogenesis, outer membrane / Intracellular trafficking and secretion]	NA|788aa|up_3|NC_011026.1_25461_27825_-	TIGR02956, sensor_protein_TorS, TMAO reductase sytem sensor TorS	NA|137aa|up_2|NC_011026.1_28068_28479_-	cd17557, REC_Rcp-like, phosphoacceptor receiver (REC) domain of cyanobacterial phytochrome response regulator Rcp and similar domains	NA|1333aa|up_1|NC_011026.1_28502_32501_-	COG4251, COG4251, Bacteriophytochrome (light-regulated signal transduction histidine kinase) [Signal transduction mechanisms]	NA|141aa|up_0|NC_011026.1_32744_33167_+	TIGR04320, hypothetical_protein, SEC10/PgrA surface exclusion domain	NA|255aa|down_0|NC_011026.1_34539_35304_+	cd03394, PAP2_like_5, PAP2_like_5 proteins	NA|433aa|down_1|NC_011026.1_35322_36621_-	PRK00062, PRK00062, glutamate-1-semialdehyde 2,1-aminomutase	NA|98aa|down_2|NC_011026.1_36619_36913_+	NA	NA|573aa|down_3|NC_011026.1_37079_38798_+	cd01949, GGDEF, Diguanylate-cyclase (DGC) or GGDEF domain	NA|238aa|down_4|NC_011026.1_39152_39866_+	PRK14410, PRK14410, glycerol-3-phosphate acyltransferase	NA|57aa|down_5|NC_011026.1_41054_41225_+	NA	NA|315aa|down_6|NC_011026.1_41347_42292_+	NA	DEDDh|579aa|down_7|NC_011026.1_42574_44311_+	PRK07883, PRK07883, DEDD exonuclease domain-containing protein	NA|223aa|down_8|NC_011026.1_44500_45169_+	cd11648, RsmI, Ribosomal RNA small subunit methyltransferase I (RsmI), also known as rRNA (cytidine-2'-O-)-methyltransferase	NA|285aa|down_9|NC_011026.1_45155_46010_+	PRK00380, panC, pantoate--beta-alanine ligase; Reviewed
GCF_000020525.1_ASM2052v1	NC_011026	Chloroherpeton thalassium ATCC 35110, complete sequence	2	442017-442111	1	CRISPRCasFinder	no		DEDDh,cas3,cmr1gr7,cmr6gr7,cmr5gr11,cmr4gr7,csm6,cmr3gr5,cas10,csa3,Cas9_archaeal,cas7,cas8b6,cas5,cas6,WYL,cas4,cas2,cas1	Orphan	CTCAAATACCTTCAGCAATCGCTTCAAAT	29	0	0	NA	NA	NA	1	1	Orphan	DEDDh,cas3,cmr1gr7,cmr6gr7,cmr5gr11,cmr4gr7,csm6,cmr3gr5,cas10,csa3,Cas9_archaeal,cas7,cas8b6,cas5,cas6,WYL,cas4,cas2,cas1	NA|156aa|up_9|NC_011026.1_425691_426159_+,NA|148aa|up_1|NC_011026.1_438465_438909_+,NA	NA|156aa|up_9|NC_011026.1_425691_426159_+	NA	NA|354aa|up_8|NC_011026.1_426163_427225_-	TIGR00990, Mitochondrial_import_receptor_subunit_TOM70, mitochondrial precursor proteins import receptor (72 kDa mitochondrial outermembrane protein) (mitochondrial import receptor for the ADP/ATP carrier) (translocase of outermembrane tom70)	NA|1063aa|up_7|NC_011026.1_427370_430559_-	COG4946, COG4946, Uncharacterized protein related to the periplasmic component of the Tol biopolymer transport system [Function unknown]	NA|195aa|up_6|NC_011026.1_430602_431187_-	COG0317, SpoT, Guanosine polyphosphate pyrophosphohydrolases/synthetases [Signal transduction mechanisms / Transcription]	NA|516aa|up_5|NC_011026.1_431176_432724_-	PRK10859, PRK10859, membrane-bound lytic murein transglycosylase MltF	NA|691aa|up_4|NC_011026.1_432970_435043_-	TIGR04456, hypothetical_protein_ACD_77C00477G0043, LruC domain	NA|391aa|up_3|NC_011026.1_435211_436384_-	cd06439, CESA_like_1, CESA_like_1 is a member of the cellulose synthase (CESA) superfamily	NA|321aa|up_2|NC_011026.1_436398_437361_-	COG1216, COG1216, Predicted glycosyltransferases [General function prediction only]	NA|148aa|up_1|NC_011026.1_438465_438909_+	NA	NA|83aa|up_0|NC_011026.1_438928_439177_+	smart00834, CxxC_CXXC_SSSS, Putative regulatory protein	NA|206aa|down_0|NC_011026.1_443136_443754_-	cd04647, LbH_MAT_like, Maltose O-acyltransferase (MAT)-like: This family is composed of maltose O-acetyltransferase, galactoside O-acetyltransferase (GAT), xenobiotic acyltransferase (XAT) and similar proteins	NA|424aa|down_1|NC_011026.1_443750_445022_-	cd04950, GT4_TuaH-like, teichuronic acid biosynthesis glycosyltransferase TuaH and similar proteins	NA|304aa|down_2|NC_011026.1_445018_445930_-	cd02525, Succinoglycan_BP_ExoA, ExoA is involved in the biosynthesis of succinoglycan	NA|439aa|down_3|NC_011026.1_445926_447243_-	cd13128, MATE_Wzx_like, Wzx, a subfamily of the multidrug and toxic compound extrusion (MATE)-like proteins	NA|352aa|down_4|NC_011026.1_447569_448625_-	cd03809, GT4_MtfB-like, glycosyltransferases MtfB, WbpX, and similar proteins	NA|365aa|down_5|NC_011026.1_448629_449724_-	cd03809, GT4_MtfB-like, glycosyltransferases MtfB, WbpX, and similar proteins	NA|503aa|down_6|NC_011026.1_449720_451229_-	pfam04932, Wzy_C, O-Antigen ligase	NA|421aa|down_7|NC_011026.1_451209_452472_-	pfam13632, Glyco_trans_2_3, Glycosyl transferase family group 2	NA|727aa|down_8|NC_011026.1_452468_454649_-	COG3206, GumC, Uncharacterized protein involved in exopolysaccharide biosynthesis [Cell envelope biogenesis, outer membrane]	NA|239aa|down_9|NC_011026.1_454734_455451_-	pfam02321, OEP, Outer membrane efflux protein
GCF_000020525.1_ASM2052v1	NC_011026	Chloroherpeton thalassium ATCC 35110, complete sequence	3	783080-783201	2	CRISPRCasFinder	no		DEDDh,cas3,cmr1gr7,cmr6gr7,cmr5gr11,cmr4gr7,csm6,cmr3gr5,cas10,csa3,Cas9_archaeal,cas7,cas8b6,cas5,cas6,WYL,cas4,cas2,cas1	Orphan	GGCATGGGTGGCCCTGGTGGCGGCATGGG	29	0	0	NA	NA	NA	1	1	Orphan	DEDDh,cas3,cmr1gr7,cmr6gr7,cmr5gr11,cmr4gr7,csm6,cmr3gr5,cas10,csa3,Cas9_archaeal,cas7,cas8b6,cas5,cas6,WYL,cas4,cas2,cas1	NA|126aa|up_3|NC_011026.1_778335_778713_+,NA|207aa|down_0|NC_011026.1_783404_784025_-,NA|235aa|down_5|NC_011026.1_788062_788767_-	NA|417aa|up_9|NC_011026.1_770713_771964_-	PRK08363, PRK08363, alanine aminotransferase; Validated	NA|325aa|up_8|NC_011026.1_772007_772982_-	pfam05103, DivIVA, DivIVA protein	NA|440aa|up_7|NC_011026.1_773298_774618_-	pfam03349, Toluene_X, Outer membrane protein transport protein (OMPP1/FadL/TodX)	NA|240aa|up_6|NC_011026.1_774994_775714_+	cd03395, PAP2_like_4, PAP2_like_4 proteins	NA|526aa|up_5|NC_011026.1_775728_777306_-	COG1807, ArnT, 4-amino-4-deoxy-L-arabinose transferase and related glycosyltransferases of PMT family [Cell envelope biogenesis, outer membrane]	NA|235aa|up_4|NC_011026.1_777605_778310_+	cd19359, TenA_C_Bt3146-like, uncharacterized TenA_C proteins similar to Bacteroides thetaiotaomicron Bt3146	NA|126aa|up_3|NC_011026.1_778335_778713_+	NA	NA|554aa|up_2|NC_011026.1_778812_780474_-	cd03877, M28_like, M28 Zn-peptidase, many containing a protease-associated (PA) domain insert	NA|408aa|up_1|NC_011026.1_780497_781721_-	pfam00375, SDF, Sodium:dicarboxylate symporter family	NA|127aa|up_0|NC_011026.1_782011_782392_+	pfam14358, DUF4405, Domain of unknown function (DUF4405)	NA|207aa|down_0|NC_011026.1_783404_784025_-	NA	NA|250aa|down_1|NC_011026.1_784308_785058_-	TIGR01830, 3-oxoacyl-_reductase_FabG, 3-oxoacyl-(acyl-carrier-protein) reductase	NA|308aa|down_2|NC_011026.1_785075_785999_-	COG0331, FabD, (acyl-carrier-protein) S-malonyltransferase [Lipid metabolism]	NA|329aa|down_3|NC_011026.1_786139_787126_-	PRK09352, PRK09352, beta-ketoacyl-ACP synthase 3	NA|184aa|down_4|NC_011026.1_787509_788061_-	COG2229, COG2229, Predicted GTPase [General function prediction only]	NA|235aa|down_5|NC_011026.1_788062_788767_-	NA	NA|109aa|down_6|NC_011026.1_788839_789166_-	pfam13682, CZB, Chemoreceptor zinc-binding domain	NA|124aa|down_7|NC_011026.1_789190_789562_-	COG2018, COG2018, Uncharacterized distant relative of homeotic protein bithoraxoid [General function prediction only]	NA|153aa|down_8|NC_011026.1_790583_791042_-	pfam09980, DUF2214, Predicted membrane protein (DUF2214)	NA|89aa|down_9|NC_011026.1_791060_791327_-	TIGR01003, Phosphocarrier_protein_HPr, Phosphotransferase System HPr (HPr) Family
GCF_000020525.1_ASM2052v1	NC_011026	Chloroherpeton thalassium ATCC 35110, complete sequence	4	1070142-1070254	3	CRISPRCasFinder	no		DEDDh,cas3,cmr1gr7,cmr6gr7,cmr5gr11,cmr4gr7,csm6,cmr3gr5,cas10,csa3,Cas9_archaeal,cas7,cas8b6,cas5,cas6,WYL,cas4,cas2,cas1	Orphan	AAAAAGCGAACCCAAGAGCTTCAGGCGTCGCAGGACGA	38	0	0	NA	NA	NA	1	1	Orphan	DEDDh,cas3,cmr1gr7,cmr6gr7,cmr5gr11,cmr4gr7,csm6,cmr3gr5,cas10,csa3,Cas9_archaeal,cas7,cas8b6,cas5,cas6,WYL,cas4,cas2,cas1	NA|316aa|up_9|NC_011026.1_1055828_1056776_+,NA|127aa|up_5|NC_011026.1_1063072_1063453_-,NA|460aa|down_1|NC_011026.1_1072392_1073772_+,NA|419aa|down_4|NC_011026.1_1077512_1078769_+,NA|164aa|down_7|NC_011026.1_1080384_1080876_+	NA|316aa|up_9|NC_011026.1_1055828_1056776_+	NA	NA|440aa|up_8|NC_011026.1_1056830_1058150_-	TIGR00275, TIGR00275, flavoprotein, HI0933 family	NA|469aa|up_7|NC_011026.1_1058779_1060186_+	PRK09249, PRK09249, coproporphyrinogen dehydrogenase	NA|634aa|up_6|NC_011026.1_1060899_1062801_+	PRK05183, hscA, chaperone protein HscA; Provisional	NA|127aa|up_5|NC_011026.1_1063072_1063453_-	NA	NA|250aa|up_4|NC_011026.1_1063433_1064183_-	pfam14020, DUF4236, Protein of unknown function (DUF4236)	NA|281aa|up_3|NC_011026.1_1064297_1065140_-	cd01193, INT_IntI_C, Integron integrase and similar protiens, C-terminal catalytic domain	NA|401aa|up_2|NC_011026.1_1065541_1066744_-	PRK13028, PRK13028, tryptophan synthase subunit beta; Provisional	NA|131aa|up_1|NC_011026.1_1067432_1067825_+	cd00158, RHOD, Rhodanese Homology Domain (RHOD); an alpha beta fold domain found duplicated in the rhodanese protein	NA|252aa|up_0|NC_011026.1_1068380_1069136_+	COG1974, LexA, SOS-response transcriptional repressors (RecA-mediated autopeptidases) [Transcription / Signal transduction mechanisms]	NA|313aa|down_0|NC_011026.1_1071236_1072175_+	cd17569, REC_HupR-like, phosphoacceptor receiver (REC) domain of hydrogen uptake protein regulator (HupR) and similar domains	NA|460aa|down_1|NC_011026.1_1072392_1073772_+	NA	NA|338aa|down_2|NC_011026.1_1074007_1075021_-	pfam11845, DUF3365, Protein of unknown function (DUF3365)	NA|528aa|down_3|NC_011026.1_1075382_1076966_-	cd17502, MFS_Azr1_MDR_like, Saccharomyces cerevisiae Azole resistance protein 1 (Azr1p), and similar multidrug resistance (MDR) transporters of the Major Facilitator Superfamily	NA|419aa|down_4|NC_011026.1_1077512_1078769_+	NA	NA|135aa|down_5|NC_011026.1_1078770_1079175_+	pfam08937, DUF1863, MTH538 TIR-like domain (DUF1863)	NA|330aa|down_6|NC_011026.1_1079395_1080385_+	TIGR00571, DNA_adenine_methylase, DNA adenine methylase (dam)	NA|164aa|down_7|NC_011026.1_1080384_1080876_+	NA	NA|81aa|down_8|NC_011026.1_1081480_1081723_+	pfam10387, DUF2442, Protein of unknown function (DUF2442)	NA|722aa|down_9|NC_011026.1_1081979_1084145_-	cd08255, 2-desacetyl-2-hydroxyethyl_bacteriochlorophyllide_like, 2-desacetyl-2-hydroxyethyl bacteriochlorophyllide and other MDR family members
GCF_000020525.1_ASM2052v1	NC_011026	Chloroherpeton thalassium ATCC 35110, complete sequence	5	1569992-1570109	4	CRISPRCasFinder	no		DEDDh,cas3,cmr1gr7,cmr6gr7,cmr5gr11,cmr4gr7,csm6,cmr3gr5,cas10,csa3,Cas9_archaeal,cas7,cas8b6,cas5,cas6,WYL,cas4,cas2,cas1	Orphan	TGTCATGCTGAACAAAGTGAAGCATCTACCCGTCAT	36	0	0	NA	NA	NA	1	1	Orphan	DEDDh,cas3,cmr1gr7,cmr6gr7,cmr5gr11,cmr4gr7,csm6,cmr3gr5,cas10,csa3,Cas9_archaeal,cas7,cas8b6,cas5,cas6,WYL,cas4,cas2,cas1	NA|424aa|up_6|NC_011026.1_1562283_1563555_-,NA|504aa|up_5|NC_011026.1_1563604_1565116_-,NA|407aa|down_9|NC_011026.1_1581003_1582224_-	NA|268aa|up_9|NC_011026.1_1558870_1559674_+	PRK14282, PRK14282, chaperone protein DnaJ; Provisional	NA|367aa|up_8|NC_011026.1_1559839_1560941_+	PRK00578, prfB, peptide chain release factor 2; Validated	NA|228aa|up_7|NC_011026.1_1561219_1561903_+	COG0177, Nth, Predicted EndoIII-related endonuclease [DNA replication, recombination, and repair]	NA|424aa|up_6|NC_011026.1_1562283_1563555_-	NA	NA|504aa|up_5|NC_011026.1_1563604_1565116_-	NA	NA|281aa|up_4|NC_011026.1_1565319_1566162_-	pfam09992, NAGPA, Phosphodiester glycosidase	NA|232aa|up_3|NC_011026.1_1566268_1566964_-	pfam02586, SRAP, SOS response associated peptidase (SRAP)	NA|331aa|up_2|NC_011026.1_1566975_1567968_+	cd07396, MPP_Nbla03831, Homo sapiens Nbla03831 and related proteins, metallophosphatase domain	NA|309aa|up_1|NC_011026.1_1568083_1569010_+	cd00739, DHPS, DHPS subgroup of Pterin binding enzymes	NA|263aa|up_0|NC_011026.1_1569049_1569838_+	COG1624, COG1624, Uncharacterized conserved protein [Function unknown]	NA|648aa|down_0|NC_011026.1_1570268_1572212_+	COG0488, Uup, ATPase components of ABC transporters with duplicated ATPase domains [General function prediction only]	NA|159aa|down_1|NC_011026.1_1572297_1572774_-	COG2405, COG2405, Predicted nucleic acid-binding protein, contains PIN domain [General function prediction only]	NA|92aa|down_2|NC_011026.1_1572770_1573046_-	pfam03683, UPF0175, Uncharacterized protein family (UPF0175)	NA|84aa|down_3|NC_011026.1_1573108_1573360_-	pfam10387, DUF2442, Protein of unknown function (DUF2442)	NA|586aa|down_4|NC_011026.1_1573992_1575750_+	pfam02666, PS_Dcarbxylase, Phosphatidylserine decarboxylase	NA|466aa|down_5|NC_011026.1_1575835_1577233_-	PRK09084, PRK09084, aspartate kinase III; Validated	NA|518aa|down_6|NC_011026.1_1577789_1579343_+	cd00156, REC, phosphoacceptor receiver (REC) domain of response regulators (RRs) and pseudo response regulators (PRRs)	NA|138aa|down_7|NC_011026.1_1579355_1579769_+	cd07007, cupin_CapF-like_C, Staphylococcus aureus CapF and related proteins, C-terminal cupin domain	NA|329aa|down_8|NC_011026.1_1579918_1580905_+	pfam14903, WG_beta_rep, WG containing repeat	NA|407aa|down_9|NC_011026.1_1581003_1582224_-	NA
GCF_000020525.1_ASM2052v1	NC_011026	Chloroherpeton thalassium ATCC 35110, complete sequence	6	1624000-1624147	5	CRISPRCasFinder	no		DEDDh,cas3,cmr1gr7,cmr6gr7,cmr5gr11,cmr4gr7,csm6,cmr3gr5,cas10,csa3,Cas9_archaeal,cas7,cas8b6,cas5,cas6,WYL,cas4,cas2,cas1	Orphan	CCTCAGAATGACAAAGTTCTTTTTCGTCATGGGCAAGCGGAGACGAAGCCTGAT	54	0	0	NA	NA	NA	1	1	Orphan	DEDDh,cas3,cmr1gr7,cmr6gr7,cmr5gr11,cmr4gr7,csm6,cmr3gr5,cas10,csa3,Cas9_archaeal,cas7,cas8b6,cas5,cas6,WYL,cas4,cas2,cas1	NA|379aa|up_5|NC_011026.1_1614975_1616112_+,NA|198aa|down_1|NC_011026.1_1626268_1626862_-,NA|446aa|down_2|NC_011026.1_1627226_1628564_+,NA|218aa|down_5|NC_011026.1_1631198_1631852_+	NA|176aa|up_9|NC_011026.1_1612281_1612809_+	PRK14472, PRK14472, F0F1 ATP synthase subunit B; Provisional	NA|184aa|up_8|NC_011026.1_1612829_1613381_+	pfam00213, OSCP, ATP synthase delta (OSCP) subunit	NA|130aa|up_7|NC_011026.1_1613661_1614051_-	PRK12497, PRK12497, YraN family protein	NA|205aa|up_6|NC_011026.1_1614104_1614719_-	cd07182, RNase_HII_bacteria_HII_like, Bacterial Ribonuclease HII-like	NA|379aa|up_5|NC_011026.1_1614975_1616112_+	NA	NA|522aa|up_4|NC_011026.1_1616408_1617974_+	pfam02538, Hydantoinase_B, Hydantoinase B/oxoprolinase	NA|319aa|up_3|NC_011026.1_1618123_1619080_+	cd08255, 2-desacetyl-2-hydroxyethyl_bacteriochlorophyllide_like, 2-desacetyl-2-hydroxyethyl bacteriochlorophyllide and other MDR family members	NA|516aa|up_2|NC_011026.1_1619127_1620675_+	COG3845, COG3845, ABC-type uncharacterized transport systems, ATPase components [General function prediction only]	NA|377aa|up_1|NC_011026.1_1620830_1621961_+	TIGR02917, TPR_domain_protein, putative PEP-CTERM system TPR-repeat lipoprotein	NA|499aa|up_0|NC_011026.1_1622047_1623544_-	pfam00365, PFK, Phosphofructokinase	NA|635aa|down_0|NC_011026.1_1624263_1626168_-	PRK04210, PRK04210, phosphoenolpyruvate carboxykinase (GTP)	NA|198aa|down_1|NC_011026.1_1626268_1626862_-	NA	NA|446aa|down_2|NC_011026.1_1627226_1628564_+	NA	NA|639aa|down_3|NC_011026.1_1628556_1630473_+	pfam13424, TPR_12, Tetratricopeptide repeat	NA|189aa|down_4|NC_011026.1_1630629_1631196_+	COG1595, RpoE, DNA-directed RNA polymerase specialized sigma subunit, sigma24 homolog [Transcription]	NA|218aa|down_5|NC_011026.1_1631198_1631852_+	NA	NA|147aa|down_6|NC_011026.1_1631999_1632440_+	pfam13801, Metal_resist, Heavy-metal resistance	NA|171aa|down_7|NC_011026.1_1632503_1633016_-	cd01052, DPSL, DPS-like protein, ferritin-like diiron-binding domain	NA|288aa|down_8|NC_011026.1_1633300_1634164_-	PRK06222, PRK06222, sulfide/dihydroorotate dehydrogenase-like FAD/NAD-binding protein	NA|711aa|down_9|NC_011026.1_1634464_1636597_+	COG1331, COG1331, Highly conserved protein containing a thioredoxin domain [Posttranslational modification, protein turnover, chaperones]
GCF_000020525.1_ASM2052v1	NC_011026	Chloroherpeton thalassium ATCC 35110, complete sequence	7	1653832-1653959	6	CRISPRCasFinder	no		DEDDh,cas3,cmr1gr7,cmr6gr7,cmr5gr11,cmr4gr7,csm6,cmr3gr5,cas10,csa3,Cas9_archaeal,cas7,cas8b6,cas5,cas6,WYL,cas4,cas2,cas1	Orphan	AGCGTAGACGAAGAGCGGCTTCTTTAAAGGCAAACGGTCTTCGTCT	46	0	0	NA	NA	NA	1	1	Orphan	DEDDh,cas3,cmr1gr7,cmr6gr7,cmr5gr11,cmr4gr7,csm6,cmr3gr5,cas10,csa3,Cas9_archaeal,cas7,cas8b6,cas5,cas6,WYL,cas4,cas2,cas1	NA|72aa|up_6|NC_011026.1_1640259_1640475_-,NA|58aa|up_2|NC_011026.1_1644002_1644176_-,NA|62aa|down_6|NC_011026.1_1662830_1663016_-	NA|202aa|up_9|NC_011026.1_1636613_1637219_+	PRK13181, hisH, imidazole glycerol phosphate synthase subunit HisH; Provisional	NA|263aa|up_8|NC_011026.1_1637253_1638042_+	PRK00748, PRK00748, 1-(5-phosphoribosyl)-5-[(5-phosphoribosylamino)methylideneamino] imidazole-4-carboxamide isomerase; Validated	NA|623aa|up_7|NC_011026.1_1638281_1640150_-	PRK05294, carB, carbamoyl-phosphate synthase large subunit	NA|72aa|up_6|NC_011026.1_1640259_1640475_-	NA	NA|249aa|up_5|NC_011026.1_1640814_1641561_+	cd03255, ABC_MJ0796_LolCDE_FtsE, ATP-binding cassette domain of the transporters involved in export of lipoprotein and macrolide, and cell division protein	NA|458aa|up_4|NC_011026.1_1641585_1642959_+	COG4591, LolE, ABC-type transport system, involved in lipoprotein release, permease component [Cell envelope biogenesis, outer membrane]	NA|194aa|up_3|NC_011026.1_1643114_1643696_-	pfam13505, OMP_b-brl, Outer membrane protein beta-barrel domain	NA|58aa|up_2|NC_011026.1_1644002_1644176_-	NA	NA|2488aa|up_1|NC_011026.1_1644174_1651638_+	TIGR04189, gliding_motility-related_protein, cell surface protein SprA	NA|641aa|up_0|NC_011026.1_1651683_1653606_-	PRK14671, uvrC, excinuclease ABC subunit C; Provisional	NA|267aa|down_0|NC_011026.1_1654213_1655014_+	cd07577, Ph0642_like, Pyrococcus horikoshii Ph0642 and related proteins, members of the nitrilase superfamily (putative class 13 nitrilases)	NA|696aa|down_1|NC_011026.1_1655197_1657285_+	PRK12740, PRK12740, elongation factor G-like protein EF-G2	NA|265aa|down_2|NC_011026.1_1657372_1658167_-	cd06442, DPM1_like, DPM1_like represents putative enzymes similar to eukaryotic DPM1	NA|112aa|down_3|NC_011026.1_1658274_1658610_-	TIGR02494, PFLE_PFLC, glycyl-radical enzyme activating protein	NA|680aa|down_4|NC_011026.1_1658961_1661001_-	pfam00223, PsaA_PsaB, Photosystem I psaA/psaB protein	NA|348aa|down_5|NC_011026.1_1661451_1662495_-	pfam02607, B12-binding_2, B12 binding domain	NA|62aa|down_6|NC_011026.1_1662830_1663016_-	NA	NA|226aa|down_7|NC_011026.1_1663040_1663718_-	PRK01362, PRK01362, fructose-6-phosphate aldolase	NA|364aa|down_8|NC_011026.1_1663752_1664844_-	PRK02492, PRK02492, deoxyhypusine synthase	NA|400aa|down_9|NC_011026.1_1665274_1666474_+	COG0426, FpaA, Uncharacterized flavoproteins [Energy production and conversion]
GCF_000020525.1_ASM2052v1	NC_011026	Chloroherpeton thalassium ATCC 35110, complete sequence	8	1815689-1815885	2	PILER-CR	no	cas3	DEDDh,cas3,cmr1gr7,cmr6gr7,cmr5gr11,cmr4gr7,csm6,cmr3gr5,cas10,csa3,Cas9_archaeal,cas7,cas8b6,cas5,cas6,WYL,cas4,cas2,cas1	Unclear	CCGACACCGGAAACTTCGCCCGTTGCATAGCAGTCGCTAAT	41	0	0	NA	NA	NA	2	2	Unclear	DEDDh,cas3,cmr1gr7,cmr6gr7,cmr5gr11,cmr4gr7,csm6,cmr3gr5,cas10,csa3,Cas9_archaeal,cas7,cas8b6,cas5,cas6,WYL,cas4,cas2,cas1	NA,NA|111aa|down_6|NC_011026.1_1827271_1827604_-	NA|220aa|up_9|NC_011026.1_1794911_1795571_+	cd00405, PRAI, Phosphoribosylanthranilate isomerase (PRAI) catalyzes the fourth step of the tryptophan biosynthesis, the conversion of N-(5'- phosphoribosyl)-anthranilate (PRA) to 1-(o-carboxyphenylamino)- 1-deoxyribulose 5-phosphate (CdRP)	NA|702aa|up_8|NC_011026.1_1795723_1797829_+	COG0642, BaeS, Signal transduction histidine kinase [Signal transduction mechanisms]	NA|589aa|up_7|NC_011026.1_1797872_1799639_+	pfam04773, FecR, FecR protein	NA|1587aa|up_6|NC_011026.1_1799635_1804396_+	smart00089, PKD, Repeats in polycystic kidney disease 1 (PKD1) and other proteins	NA|449aa|up_5|NC_011026.1_1804584_1805931_+	COG0144, Sun, tRNA and rRNA cytosine-C5-methylases [Translation, ribosomal structure and biogenesis]	NA|487aa|up_4|NC_011026.1_1806023_1807484_-	pfam03382, DUF285, Mycoplasma protein of unknown function, DUF285	NA|457aa|up_3|NC_011026.1_1807757_1809128_-	PRK09564, PRK09564, coenzyme A disulfide reductase; Reviewed	NA|618aa|up_2|NC_011026.1_1809436_1811290_-	COG1132, MdlB, ABC-type multidrug transport system, ATPase and permease components [Defense mechanisms]	NA|98aa|up_1|NC_011026.1_1811296_1811590_-	cd01614, EutN_CcmL, Ethanolamine utilisation protein and carboxysome structural protein domain family	NA|687aa|up_0|NC_011026.1_1811970_1814031_-	TIGR04183, hypothetical_protein, Por secretion system C-terminal sorting domain	NA|335aa|down_0|NC_011026.1_1816854_1817859_-	cd01854, YjeQ_EngC, Ribosomal interacting GTPase YjeQ/EngC, a circularly permuted subfamily of the Ras GTPases	cas3|712aa|down_1|NC_011026.1_1817871_1820007_-	PRK10917, PRK10917, ATP-dependent DNA helicase RecG; Provisional	NA|87aa|down_2|NC_011026.1_1820093_1820354_+	PRK01678, rpmE2, type B 50S ribosomal protein L31	NA|188aa|down_3|NC_011026.1_1820759_1821323_+	PRK00083, frr, ribosome recycling factor; Reviewed	NA|1327aa|down_4|NC_011026.1_1821693_1825674_+	PRK13557, PRK13557, histidine kinase; Provisional	NA|435aa|down_5|NC_011026.1_1825870_1827175_+	cd01360, Adenylsuccinate_lyase_1, Adenylsuccinate lyase (ASL)_subgroup 1	NA|111aa|down_6|NC_011026.1_1827271_1827604_-	NA	NA|526aa|down_7|NC_011026.1_1827816_1829394_-	PRK00881, purH, bifunctional phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase; Provisional	NA|210aa|down_8|NC_011026.1_1829458_1830088_+	cd08645, FMT_core_GART, Phosphoribosylglycinamide formyltransferase (GAR transformylase, GART)	NA|224aa|down_9|NC_011026.1_1830270_1830942_+	TIGR04349, Radical_SAM_domain_protein, putative 7-cyano-7-deazaguanosine (preQ0) biosynthesis protein QueE, gammaproteobacterial type
GCF_000020525.1_ASM2052v1	NC_011026	Chloroherpeton thalassium ATCC 35110, complete sequence	9	2084564-2084936	7	CRISPRCasFinder	no	WYL,DEDDh	DEDDh,cas3,cmr1gr7,cmr6gr7,cmr5gr11,cmr4gr7,csm6,cmr3gr5,cas10,csa3,Cas9_archaeal,cas7,cas8b6,cas5,cas6,WYL,cas4,cas2,cas1	Unclear	TTCATTGGAATGAATAGTAATTCAACTATTA	31	0	0	NA	NA	NA	4	4	Orphan	DEDDh,cas3,cmr1gr7,cmr6gr7,cmr5gr11,cmr4gr7,csm6,cmr3gr5,cas10,csa3,Cas9_archaeal,cas7,cas8b6,cas5,cas6,WYL,cas4,cas2,cas1	NA|120aa|up_8|NC_011026.1_2075120_2075480_-,NA|169aa|up_3|NC_011026.1_2079263_2079770_+,NA	NA|173aa|up_9|NC_011026.1_2074575_2075094_-	cd06121, cupin_YML079wp, Saccharomyces cerevisiae YML079wp and related proteins, cupin domain	NA|120aa|up_8|NC_011026.1_2075120_2075480_-	NA	NA|177aa|up_7|NC_011026.1_2075610_2076141_-	pfam00731, AIRC, AIR carboxylase	NA|363aa|up_6|NC_011026.1_2076151_2077240_-	PRK06019, PRK06019, phosphoribosylaminoimidazole carboxylase ATPase subunit; Reviewed	NA|177aa|up_5|NC_011026.1_2077624_2078155_+	PRK06638, PRK06638, NADH-quinone oxidoreductase subunit J	NA|330aa|up_4|NC_011026.1_2078206_2079196_+	cd06354, PBP1_PrnA-like, periplasmic binding domain of basic membrane lipoprotein, PnrA, in Treponema pallidum and its homologs from other bacteria and Archaea	NA|169aa|up_3|NC_011026.1_2079263_2079770_+	NA	NA|329aa|up_2|NC_011026.1_2079837_2080824_+	cd05228, AR_FR_like_1_SDR_e, uncharacterized subgroup of aldehyde reductase and flavonoid reductase related proteins, extended (e) SDRs	NA|650aa|up_1|NC_011026.1_2081327_2083277_+	COG2905, COG2905, Predicted signal-transduction protein containing cAMP-binding and CBS domains [Signal transduction mechanisms]	DEDDh|230aa|up_0|NC_011026.1_2083273_2083963_+	cd06127, DEDDh, DEDDh 3'-5' exonuclease domain family	NA|84aa|down_0|NC_011026.1_2085532_2085784_+	TIGR04183, hypothetical_protein, Por secretion system C-terminal sorting domain	NA|442aa|down_1|NC_011026.1_2086329_2087655_+	TIGR01945, Electron_transport_complex_subunit_C, electron transport complex, RnfABCDGE type, C subunit	NA|324aa|down_2|NC_011026.1_2087676_2088648_+	pfam03116, NQR2_RnfD_RnfE, NQR2, RnfD, RnfE family	NA|182aa|down_3|NC_011026.1_2088649_2089195_+	TIGR01947, Electron_transport_complex_subunit_G, electron transport complex, RnfABCDGE type, G subunit	NA|200aa|down_4|NC_011026.1_2089191_2089791_+	PRK12405, PRK12405, electron transport complex RsxE subunit; Provisional	NA|193aa|down_5|NC_011026.1_2089794_2090373_+	TIGR01943, Electron_transport_complex_protein_rnfA	NA|340aa|down_6|NC_011026.1_2090380_2091400_+	COG1477, ApbE, Membrane-associated lipoprotein involved in thiamine biosynthesis [Coenzyme metabolism]	NA|133aa|down_7|NC_011026.1_2091465_2091864_+	pfam04246, RseC_MucC, Positive regulator of sigma(E), RseC/MucC	NA|297aa|down_8|NC_011026.1_2091866_2092757_+	PRK07118, PRK07118, Fe-S cluster domain-containing protein	NA|440aa|down_9|NC_011026.1_2092777_2094097_-	PRK13342, PRK13342, recombination factor protein RarA; Reviewed
GCF_000020525.1_ASM2052v1	NC_011026	Chloroherpeton thalassium ATCC 35110, complete sequence	10	2683900-2694522	8,1,3,4	CRISPRCasFinder,CRT,PILER-CR,PILER-CR	no		DEDDh,cas3,cmr1gr7,cmr6gr7,cmr5gr11,cmr4gr7,csm6,cmr3gr5,cas10,csa3,Cas9_archaeal,cas7,cas8b6,cas5,cas6,WYL,cas4,cas2,cas1	Orphan	GTTTCAATTCCACATTGGTGCAATTAGATG,GTTTCAATTCCACATTGGTGCAATTAGATG,GTTTCAATTCCACATTGGTGCAATTAGATG,GTTTCAATTCCACATTGGTGCAATTAGATG	30,30,30,30	0	0	NA	NA	I-A,II-B,III-A:I-A,II-B,III-A:I-A,II-B,III-A:I-A,II-B,III-A	160,160,158,158	160	Orphan	DEDDh,cas3,cmr1gr7,cmr6gr7,cmr5gr11,cmr4gr7,csm6,cmr3gr5,cas10,csa3,Cas9_archaeal,cas7,cas8b6,cas5,cas6,WYL,cas4,cas2,cas1	NA|57aa|up_6|NC_011026.1_2673541_2673712_-,NA	NA|381aa|up_9|NC_011026.1_2666741_2667884_-	pfam01955, CbiZ, Adenosylcobinamide amidohydrolase	NA|418aa|up_8|NC_011026.1_2667976_2669230_-	cd01141, TroA_d, Periplasmic binding protein TroA_d	NA|1216aa|up_7|NC_011026.1_2669723_2673371_+	PRK13557, PRK13557, histidine kinase; Provisional	NA|57aa|up_6|NC_011026.1_2673541_2673712_-	NA	NA|643aa|up_5|NC_011026.1_2673710_2675639_+	PRK05667, dnaG, DNA primase; Validated	NA|431aa|up_4|NC_011026.1_2676063_2677356_+	PRK11649, PRK11649, putative peptidase; Provisional	NA|635aa|up_3|NC_011026.1_2677577_2679482_-	COG3975, COG3975, Predicted protease with the C-terminal PDZ domain [General function prediction only]	NA|303aa|up_2|NC_011026.1_2679514_2680423_-	pfam01555, N6_N4_Mtase, DNA methylase	NA|240aa|up_1|NC_011026.1_2681522_2682242_+	COG0745, OmpR, Response regulators consisting of a CheY-like receiver domain and a winged-helix DNA-binding domain [Signal transduction mechanisms / Transcription]	NA|501aa|up_0|NC_011026.1_2682332_2683835_+	COG0642, BaeS, Signal transduction histidine kinase [Signal transduction mechanisms]	NA|522aa|down_0|NC_011026.1_2695007_2696573_+	TIGR04315, cytochrome_c_family_protein, octaheme c-type cytochrome, tetrathionate reductase family	NA|209aa|down_1|NC_011026.1_2696621_2697248_+	PRK15006, PRK15006, thiosulfate reductase cytochrome B subunit; Provisional	NA|172aa|down_2|NC_011026.1_2697281_2697797_+	pfam04143, Sulf_transp, Sulphur transport	NA|152aa|down_3|NC_011026.1_2697812_2698268_+	pfam04143, Sulf_transp, Sulphur transport	NA|242aa|down_4|NC_011026.1_2698279_2699005_+	pfam00581, Rhodanese, Rhodanese-like domain	NA|321aa|down_5|NC_011026.1_2699046_2700009_-	pfam12784, PDDEXK_2, PD-(D/E)XK nuclease family transposase	NA|451aa|down_6|NC_011026.1_2700116_2701469_-	cd07389, MPP_PhoD, Bacillus subtilis PhoD and related proteins, metallophosphatase domain	NA|626aa|down_7|NC_011026.1_2702034_2703912_+	cd00400, Voltage_gated_ClC, CLC voltage-gated chloride channel	NA|154aa|down_8|NC_011026.1_2704037_2704499_-	PRK10293, PRK10293, 1,4-dihydroxy-2-naphthoyl-CoA hydrolase	NA|601aa|down_9|NC_011026.1_2704512_2706315_-	COG1022, FAA1, Long-chain acyl-CoA synthetases (AMP-forming) [Lipid metabolism]
GCF_000020525.1_ASM2052v1	NC_011026	Chloroherpeton thalassium ATCC 35110, complete sequence	11	2813147-2819595	5,9,2,6	PILER-CR,CRISPRCasFinder,CRT,PILER-CR	no	cas4,cas2,cas1,cas6	DEDDh,cas3,cmr1gr7,cmr6gr7,cmr5gr11,cmr4gr7,csm6,cmr3gr5,cas10,csa3,Cas9_archaeal,cas7,cas8b6,cas5,cas6,WYL,cas4,cas2,cas1	Unclear	GTTTCAATTCCACATTGGTGCAATTAGATG,GTTTCAATTCCACATTGGTGCAATTAGATG,GTTTCAATTCCACATTGGTGCAATTAGATG,GTTTCAATTCCACATTGGTGCAATTAGATG	30,30,30,30	0	0	NA	NA	I-A,II-B,III-A:I-A,II-B,III-A:I-A,II-B,III-A:I-A,II-B,III-A	95,97,97,95	97	Unclear	DEDDh,cas3,cmr1gr7,cmr6gr7,cmr5gr11,cmr4gr7,csm6,cmr3gr5,cas10,csa3,Cas9_archaeal,cas7,cas8b6,cas5,cas6,WYL,cas4,cas2,cas1	NA,NA|251aa|down_8|NC_011026.1_2828804_2829557_+	NA|342aa|up_9|NC_011026.1_2795631_2796657_-	pfam00762, Ferrochelatase, Ferrochelatase	NA|203aa|up_8|NC_011026.1_2796933_2797542_-	pfam14827, dCache_3, Double sensory domain of two-component sensor kinase	NA|637aa|up_7|NC_011026.1_2798150_2800061_-	pfam03781, FGE-sulfatase, Sulfatase-modifying factor enzyme 1	NA|234aa|up_6|NC_011026.1_2801902_2802604_-	cd12843, Bvu_2165_C_like, The C-terminal domain of uncharacterized bacterial proteins	NA|380aa|up_5|NC_011026.1_2802936_2804076_-	pfam03781, FGE-sulfatase, Sulfatase-modifying factor enzyme 1	NA|604aa|up_4|NC_011026.1_2805282_2807094_-	pfam00350, Dynamin_N, Dynamin family	NA|141aa|up_3|NC_011026.1_2807244_2807667_-	pfam10722, YbjN, Putative bacterial sensory transduction regulator	NA|438aa|up_2|NC_011026.1_2807742_2809056_-	cd10170, HSP70_NBD, Nucleotide-binding domain of the HSP70 family	NA|688aa|up_1|NC_011026.1_2809218_2811282_-	cd09912, DLP_2, Dynamin-like protein including dynamins, mitofusins, and guanylate-binding proteins	NA|359aa|up_0|NC_011026.1_2811376_2812453_-	cd02549, Peptidase_C39A, A sub-family of peptidase family C39	cas4|171aa|down_0|NC_011026.1_2819812_2820325_-	pfam01930, Cas_Cas4, Domain of unknown function DUF83	cas2|87aa|down_1|NC_011026.1_2820343_2820604_-	cd09725, Cas2_I_II_III, CRISPR/Cas system-associated protein Cas2	cas1|385aa|down_2|NC_011026.1_2820637_2821792_-	cd09722, Cas1_I-B, CRISPR/Cas system-associated protein Cas1	cas6|268aa|down_3|NC_011026.1_2821834_2822638_-	cd09759, Cas6_I-A, CRISPR/Cas system-associated RAMP superfamily protein Cas6	NA|556aa|down_4|NC_011026.1_2822723_2824391_-	COG0793, Prc, Periplasmic protease [Cell envelope biogenesis, outer membrane]	NA|172aa|down_5|NC_011026.1_2824480_2824996_-	pfam03259, Robl_LC7, Roadblock/LC7 domain	NA|122aa|down_6|NC_011026.1_2825013_2825379_-	smart00960, Robl_LC7, Roadblock/LC7 domain	NA|894aa|down_7|NC_011026.1_2825870_2828552_+	smart00283, MA, Methyl-accepting chemotaxis-like domains (chemotaxis sensory transducer)	NA|251aa|down_8|NC_011026.1_2828804_2829557_+	NA	NA|2448aa|down_9|NC_011026.1_2829546_2836890_+	COG0643, CheA, Chemotaxis protein histidine kinase and related kinases [Cell motility and secretion / Signal transduction mechanisms]
