assembly_id	genome_id	genome_def	crispr_array_locus_merge	crispr_array_location_merge	crispr_locus_id	crispr_pred_method	array_in_prot	prot_within_array_20000	prot_in_genome	crispr_type_by_cas_prot	consensus_repeat	repeat_length	self-targeting_spacer_number	self-targeting_target_number	spacer_location	protospacer_location	repeat_type	spacer_locus_num	spacer_num	correct_crispr_type	genome_cas_prots	unknown_protein_around_crispr	L10	L10_domain	L9	L9_domain	L8	L8_domain	L7	L7_domain	L6	L6_domain	L5	L5_domain	L4	L4_domain	L3	L3_domain	L2	L2_domain	L1	L1_domain	R1	R1_domain	R2	R2_domain	R3	R3_domain	R4	R4_domain	R5	R5_domain	R6	R6_domain	R7	R7_domain	R8	R8_domain	R9	R9_domain	R10	R10_domain
GCA_004104415.1_ASM410441v1	CP034669	Corallococcus coralloides strain B035 chromosome, complete genome	5	1944101-1944177	4	CRISPRCasFinder	no		csa3,DEDDh,cas3,WYL,Cas9_archaeal,RT,2OG_CAS,DinG,PD-DExK	Orphan	AACCTGTCCGACAGTCGGACAGGTTT	26	0	0	NA	NA	NA	1	1	Orphan	csa3,DEDDh,cas3,WYL,Cas9_archaeal,RT,2OG_CAS,DinG,PD-DExK	NA,NA|240aa|down_0|CP034669.1_1944239_1944959_-,NA|62aa|down_1|CP034669.1_1944999_1945185_+,NA|119aa|down_3|CP034669.1_1945429_1945786_-,NA|112aa|down_4|CP034669.1_1945789_1946125_-,NA|312aa|down_8|CP034669.1_1951627_1952563_+	NA|285aa|up_9|CP034669.1_1928697_1929552_-	cd08023, GH16_laminarinase_like, Laminarinase, member of the glycosyl hydrolase family 16	NA|249aa|up_8|CP034669.1_1929578_1930325_-	cd03146, GAT1_Peptidase_E, Type 1 glutamine amidotransferase (GATase1)-like domain found in peptidase E	NA|439aa|up_7|CP034669.1_1930335_1931652_-	cd08023, GH16_laminarinase_like, Laminarinase, member of the glycosyl hydrolase family 16	NA|862aa|up_6|CP034669.1_1931741_1934327_+	pfam13385, Laminin_G_3, Concanavalin A-like lectin/glucanases superfamily	NA|754aa|up_5|CP034669.1_1934426_1936688_+	COG5184, ATS1, Alpha-tubulin suppressor and related RCC1 domain-containing proteins [Cell division and chromosome partitioning / Cytoskeleton]	NA|412aa|up_4|CP034669.1_1936756_1937992_-	PRK08242, PRK08242, acetyl-CoA C-acetyltransferase	NA|450aa|up_3|CP034669.1_1938245_1939595_+	TIGR00711, Uncharacterized_MFS-type_transporter_YhcA, drug resistance transporter, EmrB/QacA subfamily	NA|494aa|up_2|CP034669.1_1939600_1941082_-	pfam00498, FHA, FHA domain	NA|768aa|up_1|CP034669.1_1941084_1943388_-	cd02947, TRX_family, TRX family; composed of two groups: Group I, which includes proteins that exclusively encode a TRX domain; and Group II, which are composed of fusion proteins of TRX and additional domains	NA|120aa|up_0|CP034669.1_1943528_1943888_+	TIGR02144, lysine_biosynthesis_enzyme, Lysine biosynthesis enzyme LysX	NA|240aa|down_0|CP034669.1_1944239_1944959_-	NA	NA|62aa|down_1|CP034669.1_1944999_1945185_+	NA	NA|96aa|down_2|CP034669.1_1945138_1945426_+	cd00146, PKD, polycystic kidney disease I (PKD) domain; similar to other cell-surface modules, with an IG-like fold; domain probably functions as a ligand binding site in protein-protein or protein-carbohydrate interactions; a single instance of the repeat is presented here	NA|119aa|down_3|CP034669.1_1945429_1945786_-	NA	NA|112aa|down_4|CP034669.1_1945789_1946125_-	NA	NA|259aa|down_5|CP034669.1_1946228_1947005_+	pfam09414, RNA_ligase, RNA ligase	NA|938aa|down_6|CP034669.1_1947358_1950172_+	cd14014, STKc_PknB_like, Catalytic domain of bacterial Serine/Threonine kinases, PknB and similar proteins	NA|448aa|down_7|CP034669.1_1950226_1951570_+	COG2204, AtoC, Response regulator containing CheY-like receiver, AAA-type ATPase, and DNA-binding domains [Signal transduction mechanisms]	NA|312aa|down_8|CP034669.1_1951627_1952563_+	NA	NA|425aa|down_9|CP034669.1_1952611_1953886_-	PRK07764, PRK07764, DNA polymerase III subunits gamma and tau; Validated
GCA_004104415.1_ASM410441v1	CP034669	Corallococcus coralloides strain B035 chromosome, complete genome	6	1965195-1965336	5	CRISPRCasFinder	no		csa3,DEDDh,cas3,WYL,Cas9_archaeal,RT,2OG_CAS,DinG,PD-DExK	Orphan	GCACGGGGGGGCGCCGCGACCTCCAAACCTGTCCGACAGTCGGACAGG	48	0	0	NA	NA	NA	1	1	Orphan	csa3,DEDDh,cas3,WYL,Cas9_archaeal,RT,2OG_CAS,DinG,PD-DExK	NA|312aa|up_9|CP034669.1_1951627_1952563_+,NA|1016aa|up_6|CP034669.1_1955426_1958474_+,NA|291aa|up_3|CP034669.1_1961839_1962712_+,NA|372aa|down_7|CP034669.1_1980397_1981513_-	NA|312aa|up_9|CP034669.1_1951627_1952563_+	NA	NA|425aa|up_8|CP034669.1_1952611_1953886_-	PRK07764, PRK07764, DNA polymerase III subunits gamma and tau; Validated	NA|336aa|up_7|CP034669.1_1953911_1954919_-	pfam00314, Thaumatin, Thaumatin family	NA|1016aa|up_6|CP034669.1_1955426_1958474_+	NA	NA|890aa|up_5|CP034669.1_1958582_1961252_+	pfam00200, Disintegrin, Disintegrin	NA|90aa|up_4|CP034669.1_1961335_1961605_-	COG2261, COG2261, Predicted membrane protein [Function unknown]	NA|291aa|up_3|CP034669.1_1961839_1962712_+	NA	NA|122aa|up_2|CP034669.1_1962910_1963276_+	cd00454, TrHb1_N, truncated hemoglobins (TrHbs, 2/2Hb, 2/2 globins); group 1 (N)	NA|332aa|up_1|CP034669.1_1963298_1964294_+	cd06194, FNR_N-term_Iron_sulfur_binding, Iron-sulfur binding ferredoxin reductase (FNR) proteins combine the FAD and NAD(P) binding regions of FNR with an N-terminal Iron-Sulfur binding cluster domain	NA|169aa|up_0|CP034669.1_1964241_1964748_+	COG1959, COG1959, Predicted transcriptional regulator [Transcription]	NA|1452aa|down_0|CP034669.1_1965584_1969940_-	cd11663, GH119_BcIgtZ-like, putative catalytic domain of glycoside hydrolase family 119 (GH119)	NA|1024aa|down_1|CP034669.1_1970119_1973191_+	PLN02672, PLN02672, methionine S-methyltransferase	NA|433aa|down_2|CP034669.1_1973229_1974528_+	COG2421, COG2421, Predicted acetamidase/formamidase [Energy production and conversion]	NA|196aa|down_3|CP034669.1_1974666_1975254_+	cd07818, SRPBCC_1, Ligand-binding SRPBCC domain of an uncharacterized subfamily of proteins	NA|392aa|down_4|CP034669.1_1975265_1976441_-	pfam00144, Beta-lactamase, Beta-lactamase	NA|448aa|down_5|CP034669.1_1976812_1978156_+	pfam05139, Erythro_esteras, Erythromycin esterase	NA|698aa|down_6|CP034669.1_1978329_1980423_+	cd00200, WD40, WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment	NA|372aa|down_7|CP034669.1_1980397_1981513_-	NA	NA|651aa|down_8|CP034669.1_1982180_1984133_+	TIGR03108, eps_aminotran_1, exosortase A system-associated amidotransferase 1	NA|375aa|down_9|CP034669.1_1984129_1985254_+	PRK06847, PRK06847, hypothetical protein; Provisional
GCA_004104415.1_ASM410441v1	CP034669	Corallococcus coralloides strain B035 chromosome, complete genome	7	2023446-2024036	2	CRT	no		csa3,DEDDh,cas3,WYL,Cas9_archaeal,RT,2OG_CAS,DinG,PD-DExK	Orphan	GGNAACNTGTCNGACAGTCGGACAGGTT	28	0	0	NA	NA	NA	8	8	Orphan	csa3,DEDDh,cas3,WYL,Cas9_archaeal,RT,2OG_CAS,DinG,PD-DExK	NA|174aa|up_7|CP034669.1_2015662_2016184_-,NA|56aa|down_3|CP034669.1_2026489_2026657_+	NA|234aa|up_9|CP034669.1_2013845_2014547_+	cd00156, REC, phosphoacceptor receiver (REC) domain of response regulators (RRs) and pseudo response regulators (PRRs)	NA|371aa|up_8|CP034669.1_2014543_2015656_+	COG0642, BaeS, Signal transduction histidine kinase [Signal transduction mechanisms]	NA|174aa|up_7|CP034669.1_2015662_2016184_-	NA	NA|310aa|up_6|CP034669.1_2016268_2017198_-	cd16896, LT_Slt70-like, uncharacterized lytic transglycosylase subfamily with similarity to Slt70	NA|528aa|up_5|CP034669.1_2017378_2018962_-	COG0642, BaeS, Signal transduction histidine kinase [Signal transduction mechanisms]	NA|246aa|up_4|CP034669.1_2019154_2019892_-	COG4221, COG4221, Short-chain alcohol dehydrogenase of unknown specificity [General function prediction only]	NA|281aa|up_3|CP034669.1_2020044_2020887_+	COG2207, AraC, AraC-type DNA-binding domain-containing proteins [Transcription]	NA|137aa|up_2|CP034669.1_2020895_2021306_-	cd07262, VOC_like, uncharacterized subfamily of vicinal oxygen chelate (VOC) family	NA|179aa|up_1|CP034669.1_2021311_2021848_-	cd06456, M3A_DCP, Peptidase family M3, dipeptidyl carboxypeptidase (DCP)	NA|435aa|up_0|CP034669.1_2022094_2023399_+	COG1233, COG1233, Phytoene dehydrogenase and related proteins [Secondary metabolites biosynthesis, transport, and catabolism]	NA|176aa|down_0|CP034669.1_2024152_2024680_+	TIGR02957, putative_sigma_factor, RNA polymerase sigma-70 factor, TIGR02957 family	NA|404aa|down_1|CP034669.1_2024657_2025869_-	pfam07592, DDE_Tnp_ISAZ013, Rhodopirellula transposase DDE domain	NA|147aa|down_2|CP034669.1_2025967_2026408_+	TIGR02957, putative_sigma_factor, RNA polymerase sigma-70 factor, TIGR02957 family	NA|56aa|down_3|CP034669.1_2026489_2026657_+	NA	NA|758aa|down_4|CP034669.1_2026763_2029037_+	COG2409, COG2409, Predicted drug exporters of the RND superfamily [General function prediction only]	NA|397aa|down_5|CP034669.1_2029033_2030224_+	TIGR03472, HpnI, hopanoid biosynthesis associated glycosyl transferase protein HpnI	NA|470aa|down_6|CP034669.1_2030163_2031573_-	COG0654, UbiH, 2-polyprenyl-6-methoxyphenol hydroxylase and related FAD-dependent oxidoreductases [Coenzyme metabolism / Energy production and conversion]	NA|349aa|down_7|CP034669.1_2031659_2032706_+	pfam00494, SQS_PSY, Squalene/phytoene synthase	NA|654aa|down_8|CP034669.1_2032705_2034667_+	TIGR03463, osq_cycl, 2,3-oxidosqualene cyclase	NA|169aa|down_9|CP034669.1_2034614_2035121_-	pfam09351, DUF1993, Domain of unknown function (DUF1993)
GCA_004104415.1_ASM410441v1	CP034669	Corallococcus coralloides strain B035 chromosome, complete genome	8	2221621-2222429	3	CRT	no		csa3,DEDDh,cas3,WYL,Cas9_archaeal,RT,2OG_CAS,DinG,PD-DExK	Orphan	GATGGCNGCGGNCTGGGGNGC	21	2	2	2221845-2221862|2221929-2221949	CP034669.1_165247-165264|CP034669.1_2221603-2221623	NA	14	14	Orphan	csa3,DEDDh,cas3,WYL,Cas9_archaeal,RT,2OG_CAS,DinG,PD-DExK	NA|656aa|up_8|CP034669.1_2204292_2206260_+,NA|150aa|down_6|CP034669.1_2229550_2230000_-,NA|133aa|down_8|CP034669.1_2233720_2234119_+	NA|522aa|up_9|CP034669.1_2202677_2204243_+	pfam13715, CarbopepD_reg_2, CarboxypepD_reg-like domain	NA|656aa|up_8|CP034669.1_2204292_2206260_+	NA	NA|626aa|up_7|CP034669.1_2206263_2208141_+	cd14014, STKc_PknB_like, Catalytic domain of bacterial Serine/Threonine kinases, PknB and similar proteins	NA|198aa|up_6|CP034669.1_2208150_2208744_-	PRK00076, recR, recombination protein RecR; Reviewed	NA|106aa|up_5|CP034669.1_2208750_2209068_-	pfam02575, YbaB_DNA_bd, YbaB/EbfC DNA-binding family	NA|345aa|up_4|CP034669.1_2209166_2210201_-	PRK14965, PRK14965, DNA polymerase III subunits gamma and tau; Provisional	NA|403aa|up_3|CP034669.1_2210445_2211654_-	PRK14965, PRK14965, DNA polymerase III subunits gamma and tau; Provisional	NA|513aa|up_2|CP034669.1_2211731_2213270_-	COG1233, COG1233, Phytoene dehydrogenase and related proteins [Secondary metabolites biosynthesis, transport, and catabolism]	NA|489aa|up_1|CP034669.1_2213379_2214846_+	pfam03901, Glyco_transf_22, Alg9-like mannosyltransferase family	NA|427aa|up_0|CP034669.1_2216188_2217469_-	PRK05431, PRK05431, seryl-tRNA synthetase; Provisional	NA|438aa|down_0|CP034669.1_2222793_2224107_+	COG1232, HemY, Protoporphyrinogen oxidase [Coenzyme metabolism]	NA|237aa|down_1|CP034669.1_2224185_2224896_+	cd04179, DPM_DPG-synthase_like, DPM_DPG-synthase_like is a member of the Glycosyltransferase 2 superfamily	NA|316aa|down_2|CP034669.1_2224912_2225860_+	cd10940, CE4_PuuE_HpPgdA_like_1, Putative catalytic domain of uncharacterized bacterial polysaccharide deacetylases similar to bacterial PuuE allantoinases and Helicobacter pylori peptidoglycan deacetylase (HpPgdA)	NA|478aa|down_3|CP034669.1_2225861_2227295_-	pfam10009, DUF2252, Uncharacterized protein conserved in bacteria (DUF2252)	NA|285aa|down_4|CP034669.1_2227386_2228241_+	cd07987, LPLAT_MGAT-like, Lysophospholipid Acyltransferases (LPLATs) of Glycerophospholipid Biosynthesis: MGAT-like	NA|322aa|down_5|CP034669.1_2228237_2229203_+	cd05240, UDP_G4E_3_SDR_e, UDP-glucose 4 epimerase (G4E), subgroup 3, extended (e) SDRs	NA|150aa|down_6|CP034669.1_2229550_2230000_-	NA	NA|1209aa|down_7|CP034669.1_2230092_2233719_+	TIGR02917, TPR_domain_protein, putative PEP-CTERM system TPR-repeat lipoprotein	NA|133aa|down_8|CP034669.1_2233720_2234119_+	NA	NA|683aa|down_9|CP034669.1_2234124_2236173_-	COG0514, RecQ, Superfamily II DNA helicase [DNA replication, recombination, and repair]
GCA_004104415.1_ASM410441v1	CP034669	Corallococcus coralloides strain B035 chromosome, complete genome	14	3644458-3644540	9	CRISPRCasFinder	no		csa3,DEDDh,cas3,WYL,Cas9_archaeal,RT,2OG_CAS,DinG,PD-DExK	Orphan	ACGCACCGGGCAGGCGCCACGGC	23	0	0	NA	NA	NA	1	1	Orphan	csa3,DEDDh,cas3,WYL,Cas9_archaeal,RT,2OG_CAS,DinG,PD-DExK	NA|271aa|up_8|CP034669.1_3636456_3637269_-,NA|95aa|up_4|CP034669.1_3640723_3641008_-,NA|310aa|up_3|CP034669.1_3641206_3642136_+,NA|84aa|up_2|CP034669.1_3642171_3642423_-,NA|216aa|up_1|CP034669.1_3642738_3643386_-,NA|255aa|up_0|CP034669.1_3643449_3644214_-,NA|111aa|down_0|CP034669.1_3645086_3645419_-,NA|305aa|down_1|CP034669.1_3645846_3646761_-,NA|69aa|down_4|CP034669.1_3651250_3651457_-,NA|219aa|down_5|CP034669.1_3651456_3652113_-,NA|92aa|down_6|CP034669.1_3652223_3652499_-	NA|994aa|up_9|CP034669.1_3633360_3636342_-	pfam17957, Big_7, Bacterial Ig domain	NA|271aa|up_8|CP034669.1_3636456_3637269_-	NA	NA|206aa|up_7|CP034669.1_3637441_3638059_-	COG0625, Gst, Glutathione S-transferase [Posttranslational modification, protein turnover, chaperones]	NA|456aa|up_6|CP034669.1_3638268_3639636_+	pfam12388, Peptidase_M57, Dual-action HEIGH metallo-peptidase	NA|275aa|up_5|CP034669.1_3639884_3640709_-	pfam11583, AurF, P-aminobenzoate N-oxygenase AurF	NA|95aa|up_4|CP034669.1_3640723_3641008_-	NA	NA|310aa|up_3|CP034669.1_3641206_3642136_+	NA	NA|84aa|up_2|CP034669.1_3642171_3642423_-	NA	NA|216aa|up_1|CP034669.1_3642738_3643386_-	NA	NA|255aa|up_0|CP034669.1_3643449_3644214_-	NA	NA|111aa|down_0|CP034669.1_3645086_3645419_-	NA	NA|305aa|down_1|CP034669.1_3645846_3646761_-	NA	NA|582aa|down_2|CP034669.1_3647409_3649155_-	cd14014, STKc_PknB_like, Catalytic domain of bacterial Serine/Threonine kinases, PknB and similar proteins	NA|654aa|down_3|CP034669.1_3649279_3651241_+	COG0488, Uup, ATPase components of ABC transporters with duplicated ATPase domains [General function prediction only]	NA|69aa|down_4|CP034669.1_3651250_3651457_-	NA	NA|219aa|down_5|CP034669.1_3651456_3652113_-	NA	NA|92aa|down_6|CP034669.1_3652223_3652499_-	NA	NA|94aa|down_7|CP034669.1_3652578_3652860_-	TIGR01713, Type_II_secretion_system_protein_C, type II secretion system protein C	NA|646aa|down_8|CP034669.1_3653013_3654951_+	COG3593, COG3593, Predicted ATP-dependent endonuclease of the OLD family [DNA replication, recombination, and repair]	NA|554aa|down_9|CP034669.1_3654947_3656609_+	cd17932, DEXQc_UvrD, DEXQD-box helicase domain of UvrD
GCA_004104415.1_ASM410441v1	CP034669	Corallococcus coralloides strain B035 chromosome, complete genome	16	5118352-5118455	11	CRISPRCasFinder	no		csa3,DEDDh,cas3,WYL,Cas9_archaeal,RT,2OG_CAS,DinG,PD-DExK	Orphan	ACCGCGAACCTGTCCGACTGTCGGACAGGTTTT	33	0	0	NA	NA	NA	1	1	Orphan	csa3,DEDDh,cas3,WYL,Cas9_archaeal,RT,2OG_CAS,DinG,PD-DExK	NA|195aa|up_4|CP034669.1_5111370_5111955_-,NA|184aa|down_7|CP034669.1_5132466_5133018_+,NA|351aa|down_9|CP034669.1_5133975_5135028_-	NA|157aa|up_9|CP034669.1_5041220_5041691_+	cd00038, CAP_ED, effector domain of the CAP family of transcription factors; members include CAP (or cAMP receptor protein (CRP)), which binds cAMP, FNR (fumarate and nitrate reduction), which uses an iron-sulfur cluster to sense oxygen) and CooA, a heme containing CO sensor	NA|265aa|up_8|CP034669.1_5041997_5042792_+	COG0586, DedA, Uncharacterized membrane-associated protein [Function unknown]	NA|11161aa|up_7|CP034669.1_5042881_5076364_-	PRK12316, PRK12316, peptide synthase; Provisional	NA|748aa|up_6|CP034669.1_5107412_5109656_-	PRK00290, dnaK, molecular chaperone DnaK; Provisional	NA|486aa|up_5|CP034669.1_5109820_5111278_+	PRK05567, PRK05567, inosine 5'-monophosphate dehydrogenase; Reviewed	NA|195aa|up_4|CP034669.1_5111370_5111955_-	NA	NA|518aa|up_3|CP034669.1_5112079_5113633_+	PRK00074, guaA, GMP synthase; Reviewed	NA|255aa|up_2|CP034669.1_5113645_5114410_-	pfam05721, PhyH, Phytanoyl-CoA dioxygenase (PhyH)	NA|698aa|up_1|CP034669.1_5114550_5116644_-	pfam10459, Peptidase_S46, Peptidase S46	NA|415aa|up_0|CP034669.1_5117034_5118279_+	cd01299, Met_dep_hydrolase_A, Metallo-dependent hydrolases, subgroup A is part of the superfamily of metallo-dependent hydrolases, a large group of proteins that show conservation in their 3-dimensional fold (TIM barrel) and in details of their active site	NA|582aa|down_0|CP034669.1_5118558_5120304_-	cd01161, VLCAD, Very long chain acyl-CoA dehydrogenase	NA|248aa|down_1|CP034669.1_5120300_5121044_-	COG3208, GrsT, Predicted thioesterase involved in non-ribosomal peptide biosynthesis [Secondary metabolites biosynthesis, transport, and catabolism]	NA|983aa|down_2|CP034669.1_5121040_5123989_-	COG3321, COG3321, Polyketide synthase modules and related proteins [Secondary metabolites biosynthesis, transport, and catabolism]	NA|1691aa|down_3|CP034669.1_5123985_5129058_-	PRK12467, PRK12467, peptide synthase; Provisional	NA|425aa|down_4|CP034669.1_5129281_5130556_+	COG2312, COG2312, Erythromycin esterase homolog [General function prediction only]	NA|388aa|down_5|CP034669.1_5130576_5131740_-	TIGR00937, Chromate_transport_protein, chromate transporter, chromate ion transporter (CHR) family	NA|127aa|down_6|CP034669.1_5131971_5132352_-	pfam09543, DUF2379, Protein of unknown function (DUF2379)	NA|184aa|down_7|CP034669.1_5132466_5133018_+	NA	NA|299aa|down_8|CP034669.1_5133021_5133918_-	pfam11219, DUF3014, Protein of unknown function (DUF3014)	NA|351aa|down_9|CP034669.1_5133975_5135028_-	NA
GCA_004104415.1_ASM410441v1	CP034669	Corallococcus coralloides strain B035 chromosome, complete genome	20	6184062-6184173	14	CRISPRCasFinder	no		csa3,DEDDh,cas3,WYL,Cas9_archaeal,RT,2OG_CAS,DinG,PD-DExK	Orphan	CCCGCGAGGTCGGACCGCGGAAACTTGTCCGACTGTCCGACA	42	0	0	NA	NA	NA	1	1	Orphan	csa3,DEDDh,cas3,WYL,Cas9_archaeal,RT,2OG_CAS,DinG,PD-DExK	NA|41aa|up_8|CP034669.1_6172614_6172737_+,NA|175aa|down_6|CP034669.1_6193094_6193619_-,NA|112aa|down_9|CP034669.1_6198694_6199030_-	NA|433aa|up_9|CP034669.1_6171267_6172566_+	pfam01609, DDE_Tnp_1, Transposase DDE domain	NA|41aa|up_8|CP034669.1_6172614_6172737_+	NA	NA|371aa|up_7|CP034669.1_6172748_6173861_-	COG2814, AraJ, Arabinose efflux permease [Carbohydrate transport and metabolism]	NA|303aa|up_6|CP034669.1_6174055_6174964_+	cd08480, PBP2_CrgA_like_10, The C-terminal substrate binding domain of an uncharacterized LysR-type transcriptional regulator CrgA-like, contains the type 2 periplasmic binding fold	NA|331aa|up_5|CP034669.1_6175168_6176161_+	pfam04885, Stig1, Stigma-specific protein, Stig1	NA|289aa|up_4|CP034669.1_6176200_6177067_-	pfam13527, Acetyltransf_9, Acetyltransferase (GNAT) domain	NA|422aa|up_3|CP034669.1_6177093_6178359_-	pfam04932, Wzy_C, O-Antigen ligase	NA|490aa|up_2|CP034669.1_6178538_6180008_+	TIGR00711, Uncharacterized_MFS-type_transporter_YhcA, drug resistance transporter, EmrB/QacA subfamily	NA|787aa|up_1|CP034669.1_6180322_6182683_+	PRK05261, PRK05261, phosphoketolase	NA|392aa|up_0|CP034669.1_6182679_6183855_+	PRK00180, PRK00180, acetate kinase A/propionate kinase 2; Reviewed	NA|373aa|down_0|CP034669.1_6184422_6185541_-	pfam03739, YjgP_YjgQ, Predicted permease YjgP/YjgQ family	NA|371aa|down_1|CP034669.1_6185537_6186650_-	pfam03739, YjgP_YjgQ, Predicted permease YjgP/YjgQ family	NA|420aa|down_2|CP034669.1_6186646_6187906_-	PRK00885, PRK00885, phosphoribosylamine--glycine ligase; Provisional	NA|994aa|down_3|CP034669.1_6187907_6190889_-	TIGR02917, TPR_domain_protein, putative PEP-CTERM system TPR-repeat lipoprotein	NA|515aa|down_4|CP034669.1_6190893_6192438_-	PRK00881, purH, bifunctional phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase; Provisional	NA|164aa|down_5|CP034669.1_6192545_6193037_-	COG0568, RpoD, DNA-directed RNA polymerase, sigma subunit (sigma70/sigma32) [Transcription]	NA|175aa|down_6|CP034669.1_6193094_6193619_-	NA	NA|695aa|down_7|CP034669.1_6193778_6195863_-	cd14014, STKc_PknB_like, Catalytic domain of bacterial Serine/Threonine kinases, PknB and similar proteins	NA|907aa|down_8|CP034669.1_6195871_6198592_-	cd14014, STKc_PknB_like, Catalytic domain of bacterial Serine/Threonine kinases, PknB and similar proteins	NA|112aa|down_9|CP034669.1_6198694_6199030_-	NA
GCA_004104415.1_ASM410441v1	CP034669	Corallococcus coralloides strain B035 chromosome, complete genome	22	6368424-6368674	3	PILER-CR	no		csa3,DEDDh,cas3,WYL,Cas9_archaeal,RT,2OG_CAS,DinG,PD-DExK	Orphan	CCGGCCGTGGGCGCGGTGACTCCGCCCCAGGCTCCCGC	38	0	0	NA	NA	NA	3	3	Orphan	csa3,DEDDh,cas3,WYL,Cas9_archaeal,RT,2OG_CAS,DinG,PD-DExK	NA|59aa|up_0|CP034669.1_6368032_6368209_-,NA|176aa|down_4|CP034669.1_6373966_6374494_-	NA|652aa|up_9|CP034669.1_6357981_6359937_-	PRK11360, PRK11360, two-component system sensor histidine kinase AtoS	NA|310aa|up_8|CP034669.1_6360014_6360944_-	pfam09674, DUF2400, Protein of unknown function (DUF2400)	NA|219aa|up_7|CP034669.1_6360957_6361614_-	cd00886, MogA_MoaB, MogA_MoaB family	NA|105aa|up_6|CP034669.1_6361640_6361955_-	cd03528, Rieske_RO_ferredoxin, Rieske non-heme iron oxygenase (RO) family, Rieske ferredoxin component; composed of the Rieske ferredoxin component of some three-component RO systems including biphenyl dioxygenase (BPDO) and carbazole 1,9a-dioxygenase (CARDO)	NA|587aa|up_5|CP034669.1_6362021_6363782_+	PRK08609, PRK08609, DNA polymerase/3'-5' exonuclease PolX	NA|204aa|up_4|CP034669.1_6363801_6364413_+	pfam13365, Trypsin_2, Trypsin-like peptidase domain	NA|242aa|up_3|CP034669.1_6364409_6365135_+	pfam02517, Abi, CAAX protease self-immunity	NA|278aa|up_2|CP034669.1_6365143_6365977_-	cd01908, YafJ, Glutamine amidotransferases class-II (Gn-AT)_YafJ-type	NA|536aa|up_1|CP034669.1_6366191_6367799_+	PRK00290, dnaK, molecular chaperone DnaK; Provisional	NA|59aa|up_0|CP034669.1_6368032_6368209_-	NA	NA|352aa|down_0|CP034669.1_6368792_6369848_+	pfam00226, DnaJ, DnaJ domain	NA|258aa|down_1|CP034669.1_6369869_6370643_+	pfam02674, Colicin_V, Colicin V production protein	NA|359aa|down_2|CP034669.1_6370632_6371709_-	PRK05342, clpX, ATP-dependent Clp protease ATP-binding subunit ClpX	NA|669aa|down_3|CP034669.1_6371946_6373953_+	PRK05354, PRK05354, biosynthetic arginine decarboxylase	NA|176aa|down_4|CP034669.1_6373966_6374494_-	NA	NA|143aa|down_5|CP034669.1_6374586_6375015_-	pfam02566, OsmC, OsmC-like protein	NA|408aa|down_6|CP034669.1_6375060_6376284_-	PRK00770, PRK00770, deoxyhypusine synthase	NA|277aa|down_7|CP034669.1_6376478_6377309_+	COG0613, COG0613, Predicted metal-dependent phosphoesterases (PHP family) [General function prediction only]	NA|516aa|down_8|CP034669.1_6377310_6378858_-	COG0840, Tar, Methyl-accepting chemotaxis protein [Cell motility and secretion / Signal transduction mechanisms]	NA|123aa|down_9|CP034669.1_6379117_6379486_+	pfam00507, Oxidored_q4, NADH-ubiquinone/plastoquinone oxidoreductase, chain 3
GCA_004104415.1_ASM410441v1	CP034669	Corallococcus coralloides strain B035 chromosome, complete genome	23	6603053-6603222	4	PILER-CR	no		csa3,DEDDh,cas3,WYL,Cas9_archaeal,RT,2OG_CAS,DinG,PD-DExK	Orphan	TCCAAACCTGTCGGACAGTCGGACAGGTTTC	31	0	0	NA	NA	NA	2	2	Orphan	csa3,DEDDh,cas3,WYL,Cas9_archaeal,RT,2OG_CAS,DinG,PD-DExK	NA|400aa|up_9|CP034669.1_6592944_6594144_+,NA|109aa|up_8|CP034669.1_6594182_6594509_+,NA|204aa|up_3|CP034669.1_6600233_6600845_+,NA	NA|400aa|up_9|CP034669.1_6592944_6594144_+	NA	NA|109aa|up_8|CP034669.1_6594182_6594509_+	NA	NA|342aa|up_7|CP034669.1_6594613_6595639_+	cd04273, ZnMc_ADAMTS_like, Zinc-dependent metalloprotease, ADAMTS_like subgroup	NA|596aa|up_6|CP034669.1_6595713_6597501_-	pfam05960, DUF885, Bacterial protein of unknown function (DUF885)	NA|574aa|up_5|CP034669.1_6597718_6599440_+	COG2303, BetA, Choline dehydrogenase and related flavoproteins [Amino acid transport and metabolism]	NA|186aa|up_4|CP034669.1_6599443_6600001_-	pfam09346, SMI1_KNR4, SMI1 / KNR4 family (SUKH-1)	NA|204aa|up_3|CP034669.1_6600233_6600845_+	NA	NA|303aa|up_2|CP034669.1_6600857_6601766_-	cd08474, PBP2_CrgA_like_5, The C-terminal substrate binding domain of an uncharacterized LysR-type transcriptional regulator CrgA-like, contains the type 2 periplasmic binding fold	NA|293aa|up_1|CP034669.1_6601860_6602739_+	PRK10376, PRK10376, putative oxidoreductase; Provisional	NA|92aa|up_0|CP034669.1_6602742_6603018_-	cd05466, PBP2_LTTR_substrate, The substrate binding domain of LysR-type transcriptional regulators (LTTRs), a member of the type 2 periplasmic binding fold protein superfamily	NA|124aa|down_0|CP034669.1_6603332_6603704_-	pfam02452, PemK_toxin, PemK-like, MazF-like toxin of type II toxin-antitoxin system	NA|301aa|down_1|CP034669.1_6603805_6604708_+	pfam02517, Abi, CAAX protease self-immunity	NA|299aa|down_2|CP034669.1_6605035_6605932_-	smart00138, MeTrc, Methyltransferase, chemotaxis proteins	NA|187aa|down_3|CP034669.1_6605936_6606497_-	cd00732, CheW, CheW, a small regulator protein, unique to the chemotaxis signalling in prokaryotes and archea	NA|563aa|down_4|CP034669.1_6606510_6608199_-	PRK15041, PRK15041, methyl-accepting chemotaxis protein	NA|573aa|down_5|CP034669.1_6608226_6609945_-	COG0643, CheA, Chemotaxis protein histidine kinase and related kinases [Cell motility and secretion / Signal transduction mechanisms]	NA|596aa|down_6|CP034669.1_6610395_6612183_+	COG3349, COG3349, Uncharacterized conserved protein [Function unknown]	NA|133aa|down_7|CP034669.1_6612194_6612593_-	cd06990, cupin_DUF861, domain of unknown function DUF 861, cupin domain	NA|350aa|down_8|CP034669.1_6612759_6613809_+	cd01062, RNase_T2_prok, Ribonuclease T2 (RNase T2) is a widespread family of secreted RNases found in every organism examined thus far	NA|338aa|down_9|CP034669.1_6613810_6614824_-	PRK11815, PRK11815, tRNA dihydrouridine(20/20a) synthase DusA
GCA_004104415.1_ASM410441v1	CP034669	Corallococcus coralloides strain B035 chromosome, complete genome	24	6663023-6663240	5	PILER-CR	no		csa3,DEDDh,cas3,WYL,Cas9_archaeal,RT,2OG_CAS,DinG,PD-DExK	Orphan	CCTGTCGGACAGTCGGACAG	20	0	0	NA	NA	NA	3	3	Orphan	csa3,DEDDh,cas3,WYL,Cas9_archaeal,RT,2OG_CAS,DinG,PD-DExK	NA|169aa|up_6|CP034669.1_6655325_6655832_+,NA|67aa|down_0|CP034669.1_6663377_6663578_-,NA|176aa|down_4|CP034669.1_6667222_6667750_+,NA|164aa|down_8|CP034669.1_6671029_6671521_-	NA|678aa|up_9|CP034669.1_6651307_6653341_-	PRK12270, kgd, multifunctional oxoglutarate decarboxylase/oxoglutarate dehydrogenase thiamine pyrophosphate-binding subunit/dihydrolipoyllysine-residue succinyltransferase subunit	NA|294aa|up_8|CP034669.1_6653347_6654229_-	TIGR04565, hypothetical_protein_N47_G32130, outer membrane beta-barrel protein	NA|295aa|up_7|CP034669.1_6654225_6655110_-	TIGR04565, hypothetical_protein_N47_G32130, outer membrane beta-barrel protein	NA|169aa|up_6|CP034669.1_6655325_6655832_+	NA	NA|478aa|up_5|CP034669.1_6655946_6657380_+	cd01465, vWA_subgroup, VWA subgroup: Von Willebrand factor type A (vWA) domain was originally found in the blood coagulation protein von Willebrand factor (vWF)	NA|168aa|up_4|CP034669.1_6657546_6658050_+	pfam00731, AIRC, AIR carboxylase	NA|381aa|up_3|CP034669.1_6658046_6659189_+	PRK06019, PRK06019, phosphoribosylaminoimidazole carboxylase ATPase subunit; Reviewed	NA|159aa|up_2|CP034669.1_6659194_6659671_+	cd00002, YbaK_deacylase, This CD includes cysteinyl-tRNA(Pro) deacylases from Haemophilus influenzae and Escherichia coli and other related bacterial proteins	NA|411aa|up_1|CP034669.1_6659740_6660973_+	TIGR02168, Chromosome_partition_protein_Smc, chromosome segregation protein SMC, common bacterial type	NA|593aa|up_0|CP034669.1_6661112_6662891_+	COG2200, Rtn, c-di-GMP phosphodiesterase class I (EAL domain) [Signal    transduction mechanisms]	NA|67aa|down_0|CP034669.1_6663377_6663578_-	NA	NA|186aa|down_1|CP034669.1_6663820_6664378_-	pfam02643, DUF192, Uncharacterized ACR, COG1430	NA|144aa|down_2|CP034669.1_6664374_6664806_-	TIGR02266, gmx_TIGR02266, Myxococcus xanthus paralogous domain TIGR02266	NA|768aa|down_3|CP034669.1_6664864_6667168_+	sd00008, TPR_YbbN, C-terminal Tetratricopeptide repeat (TPR) region of YbbN and similar motifs	NA|176aa|down_4|CP034669.1_6667222_6667750_+	NA	NA|158aa|down_5|CP034669.1_6667823_6668297_+	pfam14534, DUF4440, Domain of unknown function (DUF4440)	NA|323aa|down_6|CP034669.1_6668312_6669281_-	cd05153, HomoserineK_II, Type II Homoserine Kinase	NA|536aa|down_7|CP034669.1_6669425_6671033_+	TIGR02917, TPR_domain_protein, putative PEP-CTERM system TPR-repeat lipoprotein	NA|164aa|down_8|CP034669.1_6671029_6671521_-	NA	NA|446aa|down_9|CP034669.1_6671522_6672860_-	cd00038, CAP_ED, effector domain of the CAP family of transcription factors; members include CAP (or cAMP receptor protein (CRP)), which binds cAMP, FNR (fumarate and nitrate reduction), which uses an iron-sulfur cluster to sense oxygen) and CooA, a heme containing CO sensor
GCA_004104415.1_ASM410441v1	CP034669	Corallococcus coralloides strain B035 chromosome, complete genome	27	8794780-8795237	6	CRT	no		csa3,DEDDh,cas3,WYL,Cas9_archaeal,RT,2OG_CAS,DinG,PD-DExK	Orphan	CCGAGCCGNGGAAACCTGTCCGACTGTCGGACAGCTTTG	39	0	0	NA	NA	NA	6	6	Orphan	csa3,DEDDh,cas3,WYL,Cas9_archaeal,RT,2OG_CAS,DinG,PD-DExK	NA|104aa|up_3|CP034669.1_8788148_8788460_-,NA|1012aa|up_0|CP034669.1_8791538_8794574_-,NA|70aa|down_0|CP034669.1_8795352_8795562_+,NA|52aa|down_1|CP034669.1_8795611_8795767_+,NA|373aa|down_8|CP034669.1_8802679_8803798_-	NA|467aa|up_9|CP034669.1_8780052_8781453_-	cd00038, CAP_ED, effector domain of the CAP family of transcription factors; members include CAP (or cAMP receptor protein (CRP)), which binds cAMP, FNR (fumarate and nitrate reduction), which uses an iron-sulfur cluster to sense oxygen) and CooA, a heme containing CO sensor	NA|466aa|up_8|CP034669.1_8781486_8782884_-	cd00038, CAP_ED, effector domain of the CAP family of transcription factors; members include CAP (or cAMP receptor protein (CRP)), which binds cAMP, FNR (fumarate and nitrate reduction), which uses an iron-sulfur cluster to sense oxygen) and CooA, a heme containing CO sensor	NA|753aa|up_7|CP034669.1_8782939_8785198_-	cd00687, Terpene_cyclase_nonplant_C1, Non-plant Terpene Cyclases, Class 1	NA|363aa|up_6|CP034669.1_8785407_8786496_-	PRK13007, PRK13007, succinyl-diaminopimelate desuccinylase; Reviewed	NA|211aa|up_5|CP034669.1_8786541_8787174_-	cd20301, cupin_ChrR, anti-ECFsigma factor, ChrR , cupin domain	NA|276aa|up_4|CP034669.1_8787256_8788084_-	PRK11830, dapD, 2,3,4,5-tetrahydropyridine-2,6-carboxylate N-succinyltransferase; Provisional	NA|104aa|up_3|CP034669.1_8788148_8788460_-	NA	NA|489aa|up_2|CP034669.1_8788645_8790112_+	cd06450, DOPA_deC_like, DOPA decarboxylase family	NA|240aa|up_1|CP034669.1_8790168_8790888_+	cd04179, DPM_DPG-synthase_like, DPM_DPG-synthase_like is a member of the Glycosyltransferase 2 superfamily	NA|1012aa|up_0|CP034669.1_8791538_8794574_-	NA	NA|70aa|down_0|CP034669.1_8795352_8795562_+	NA	NA|52aa|down_1|CP034669.1_8795611_8795767_+	NA	NA|273aa|down_2|CP034669.1_8795806_8796625_-	TIGR04484, sulfur_oxidation_protein_SoxA, sulfur oxidation c-type cytochrome SoxA	NA|373aa|down_3|CP034669.1_8796648_8797767_-	cd07402, MPP_GpdQ, Enterobacter aerogenes GpdQ and related proteins, metallophosphatase domain	NA|437aa|down_4|CP034669.1_8797779_8799090_-	COG1858, MauG, Cytochrome c peroxidase [Inorganic ion transport and metabolism]	NA|459aa|down_5|CP034669.1_8799311_8800688_-	pfam09924, DUF2156, Uncharacterized conserved protein (DUF2156)	NA|297aa|down_6|CP034669.1_8800936_8801827_-	pfam00912, Transgly, Transglycosylase	NA|234aa|down_7|CP034669.1_8801840_8802542_-	pfam05257, CHAP, CHAP domain	NA|373aa|down_8|CP034669.1_8802679_8803798_-	NA	NA|852aa|down_9|CP034669.1_8804218_8806774_+	cd09601, M1_APN-Q_like, Peptidase M1 aminopeptidase N catalytic domain family which includes aminopeptidase N (APN), aminopeptidase Q (APQ), tricorn interacting factor F3, and endoplasmic reticulum aminopeptidase 1 (ERAP1)
