assembly_id	genome_id	genome_def	crispr_array_locus_merge	crispr_array_location_merge	crispr_locus_id	crispr_pred_method	array_in_prot	prot_within_array_20000	prot_in_genome	crispr_type_by_cas_prot	consensus_repeat	repeat_length	self-targeting_spacer_number	self-targeting_target_number	spacer_location	protospacer_location	repeat_type	spacer_locus_num	spacer_num	correct_crispr_type	genome_cas_prots	unknown_protein_around_crispr	L10	L10_domain	L9	L9_domain	L8	L8_domain	L7	L7_domain	L6	L6_domain	L5	L5_domain	L4	L4_domain	L3	L3_domain	L2	L2_domain	L1	L1_domain	R1	R1_domain	R2	R2_domain	R3	R3_domain	R4	R4_domain	R5	R5_domain	R6	R6_domain	R7	R7_domain	R8	R8_domain	R9	R9_domain	R10	R10_domain
GCF_000018005.1_ASM1800v1	NC_009921	Frankia sp. EAN1pec, complete genome	1	607252-607550	1	CRISPRCasFinder	no		c2c9_V-U4,WYL,cas3,csa3,cas4,RT,csx19,csm3gr7,cas6,cas8b2,cas7,cas5,cas2,casR,DEDDh	Orphan	CAGCCAGGCGCGGGTCAGGTTCGCTCCGC	29	0	0	NA	NA	NA	4	4	Orphan	c2c9_V-U4,WYL,cas3,csa3,cas4,RT,csx19,csm3gr7,cas6,cas8b2,cas7,cas5,cas2,casR,DEDDh	NA|73aa|up_8|NC_009921.1_598035_598254_-,NA|93aa|down_9|NC_009921.1_624518_624797_+	NA|195aa|up_9|NC_009921.1_597382_597967_-	COG1309, AcrR, Transcriptional regulator [Transcription]	NA|73aa|up_8|NC_009921.1_598035_598254_-	NA	NA|280aa|up_7|NC_009921.1_598588_599428_+	pfam02668, TauD, Taurine catabolism dioxygenase TauD, TfdA family	NA|295aa|up_6|NC_009921.1_599465_600350_+	pfam08450, SGL, SMP-30/Gluconolaconase/LRE-like region	NA|250aa|up_5|NC_009921.1_600406_601156_+	PRK05653, fabG, 3-oxoacyl-ACP reductase FabG	NA|330aa|up_4|NC_009921.1_601142_602132_+	TIGR03621, F420_MSMEG_2516, probable F420-dependent oxidoreductase, MSMEG_2516 family	NA|305aa|up_3|NC_009921.1_602209_603124_+	TIGR03564, F420_MSMEG_4879, F420-dependent oxidoreductase, MSMEG_4879 family	NA|186aa|up_2|NC_009921.1_603226_603784_+	pfam13577, SnoaL_4, SnoaL-like domain	NA|419aa|up_1|NC_009921.1_604271_605528_-	cd06341, PBP1_ABC_ligand_binding-like, type 1 periplasmic ligand-binding domain of uncharacterized ABC (ATPase Binding Cassette)-type active transport systems predicted to be involved in transport of amino acids, peptides, or inorganic ions	NA|273aa|up_0|NC_009921.1_605916_606735_-	COG4565, CitB, Response regulator of citrate/malate metabolism [Transcription / Signal transduction mechanisms]	NA|515aa|down_0|NC_009921.1_609227_610772_+	cd00200, WD40, WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment	NA|114aa|down_1|NC_009921.1_612580_612922_-	pfam11774, Lsr2, Lsr2	NA|202aa|down_2|NC_009921.1_614950_615556_+	COG2197, CitB, Response regulator containing a CheY-like receiver domain and an HTH DNA-binding domain [Signal transduction mechanisms / Transcription]	NA|842aa|down_3|NC_009921.1_615598_618124_-	COG0577, SalY, ABC-type antimicrobial peptide transport system, permease component [Defense mechanisms]	NA|278aa|down_4|NC_009921.1_618120_618954_-	cd03255, ABC_MJ0796_LolCDE_FtsE, ATP-binding cassette domain of the transporters involved in export of lipoprotein and macrolide, and cell division protein	NA|229aa|down_5|NC_009921.1_618950_619637_-	pfam10067, DUF2306, Predicted membrane protein (DUF2306)	NA|422aa|down_6|NC_009921.1_619938_621204_+	COG4585, COG4585, Signal transduction histidine kinase [Signal transduction mechanisms]	NA|223aa|down_7|NC_009921.1_621200_621869_+	COG2197, CitB, Response regulator containing a CheY-like receiver domain and an HTH DNA-binding domain [Signal transduction mechanisms / Transcription]	NA|287aa|down_8|NC_009921.1_623061_623922_+	pfam04672, Methyltransf_19, S-adenosyl methyltransferase	NA|93aa|down_9|NC_009921.1_624518_624797_+	NA
GCF_000018005.1_ASM1800v1	NC_009921	Frankia sp. EAN1pec, complete genome	2	1655562-1657804	2,1,1,2,3,3,4,2	CRISPRCasFinder,CRT,PILER-CR,CRT,CRISPRCasFinder,CRT,CRISPRCasFinder,PILER-CR	no	csx19,csm3gr7	c2c9_V-U4,WYL,cas3,csa3,cas4,RT,csx19,csm3gr7,cas6,cas8b2,cas7,cas5,cas2,casR,DEDDh	Type III-A, Type III-D?	GTTGCGATCCCTCCAGGGATGATCAGCGAC,GTTGCGATCCCTCCAGGGATGATCAGCGAC,GTTGCGATCCCTCCAGGGATGATCAGCGAC,GTTGCGATCCCTCCAGGGATGATCAGCGAC,GTTGCGATCCCTCCAGGGATGATCAGCGAC,GTTGCGATCCCTCCAGGGATGATCAGCGACC,GTTGCGATCCCTCCAGGGATGATCAGCGAC,GTTGCGATCCCTCCAGGGATGATCAGCGAC	30,30,30,30,30,31,30,30	0	0	NA	NA	?:?:?:?:?:?:?:?	30,32,32,32,30,32,30,32	32	TypeIII-A,TypeIII-D?	c2c9_V-U4,WYL,cas3,csa3,cas4,RT,csx19,csm3gr7,cas6,cas8b2,cas7,cas5,cas2,casR,DEDDh	NA|256aa|up_8|NC_009921.1_1641120_1641888_+,NA|288aa|up_6|NC_009921.1_1643248_1644112_+,NA|83aa|up_4|NC_009921.1_1645108_1645357_+,NA|150aa|up_3|NC_009921.1_1645379_1645829_+,NA|147aa|up_2|NC_009921.1_1645825_1646266_+,NA|135aa|up_1|NC_009921.1_1646397_1646802_+,NA|533aa|down_4|NC_009921.1_1668305_1669904_-,NA|56aa|down_5|NC_009921.1_1670183_1670351_+	NA|193aa|up_9|NC_009921.1_1640545_1641124_+	pfam13144, ChapFlgA, Chaperone for flagella basal body P-ring formation	NA|256aa|up_8|NC_009921.1_1641120_1641888_+	NA	NA|456aa|up_7|NC_009921.1_1641884_1643252_+	COG4962, CpaF, Flp pilus assembly protein, ATPase CpaF [Intracellular trafficking and secretion]	NA|288aa|up_6|NC_009921.1_1643248_1644112_+	NA	NA|295aa|up_5|NC_009921.1_1644111_1644996_+	COG4965, TadB, Flp pilus assembly protein TadB [Intracellular trafficking and secretion]	NA|83aa|up_4|NC_009921.1_1645108_1645357_+	NA	NA|150aa|up_3|NC_009921.1_1645379_1645829_+	NA	NA|147aa|up_2|NC_009921.1_1645825_1646266_+	NA	NA|135aa|up_1|NC_009921.1_1646397_1646802_+	NA	NA|916aa|up_0|NC_009921.1_1646970_1649718_+	smart01043, BTAD, Bacterial transcriptional activator domain	NA|321aa|down_0|NC_009921.1_1658211_1659174_-	cd09747, Csx1_III-U, CRISPR/Cas system-associated protein Csx1	csx19|238aa|down_1|NC_009921.1_1661332_1662046_-	TIGR03984, hypothetical_protein_FrEUN1fDRAFT_5778, CRISPR-associated protein, TIGR03984 family	csm3gr7|577aa|down_2|NC_009921.1_1662042_1663773_-	cd09726, RAMP_I_III, CRISPR/Cas system-associated RAMP superfamily protein	csm3gr7|880aa|down_3|NC_009921.1_1663769_1666409_-	pfam03787, RAMPs, RAMP superfamily	NA|533aa|down_4|NC_009921.1_1668305_1669904_-	NA	NA|56aa|down_5|NC_009921.1_1670183_1670351_+	NA	NA|657aa|down_6|NC_009921.1_1671182_1673153_+	pfam12696, TraG-D_C, TraM recognition site of TraD and TraG	NA|385aa|down_7|NC_009921.1_1673149_1674304_+	PRK07764, PRK07764, DNA polymerase III subunits gamma and tau; Validated	NA|401aa|down_8|NC_009921.1_1676490_1677692_+	PHA02517, PHA02517, putative transposase OrfB; Reviewed	cas6|238aa|down_9|NC_009921.1_1678161_1678875_+	TIGR01877, CRISPR-associated_endoribonuclease_Cas6_1, CRISPR-associated endoribonuclease Cas6
GCF_000018005.1_ASM1800v1	NC_009921	Frankia sp. EAN1pec, complete genome	3	1674453-1676262	5,4,3,4,5	CRISPRCasFinder,CRT,PILER-CR,PILER-CR,PILER-CR	no	csx19,csm3gr7,cas6,cas8b2,cas7,cas5,cas3,cas4,cas2	c2c9_V-U4,WYL,cas3,csa3,cas4,RT,csx19,csm3gr7,cas6,cas8b2,cas7,cas5,cas2,casR,DEDDh	Unclear	GTTGCGATCCCTCCAGGGATGATCAGCGAC,GTTGCGATCCCTCCAGGGATGATCAGCGAC,GTTGCGATCCCTCCAGGGATGATCAGCGAC,TGTTGCGATCCCTCCAGGGATGATCAGCGAC,GTTGCGATCCCTCCAGGGATGATCAGCGAC	30,30,30,31,30	0	0	NA	NA	?:?:?:?:?	27,27,22,22,22	27	Unclear	c2c9_V-U4,WYL,cas3,csa3,cas4,RT,csx19,csm3gr7,cas6,cas8b2,cas7,cas5,cas2,casR,DEDDh	NA|135aa|up_9|NC_009921.1_1646397_1646802_+,NA|533aa|up_3|NC_009921.1_1668305_1669904_-,NA|56aa|up_2|NC_009921.1_1670183_1670351_+,cas8b2|482aa|down_2|NC_009921.1_1678874_1680320_+	NA|135aa|up_9|NC_009921.1_1646397_1646802_+	NA	NA|916aa|up_8|NC_009921.1_1646970_1649718_+	smart01043, BTAD, Bacterial transcriptional activator domain	NA|321aa|up_7|NC_009921.1_1658211_1659174_-	cd09747, Csx1_III-U, CRISPR/Cas system-associated protein Csx1	csx19|238aa|up_6|NC_009921.1_1661332_1662046_-	TIGR03984, hypothetical_protein_FrEUN1fDRAFT_5778, CRISPR-associated protein, TIGR03984 family	csm3gr7|577aa|up_5|NC_009921.1_1662042_1663773_-	cd09726, RAMP_I_III, CRISPR/Cas system-associated RAMP superfamily protein	csm3gr7|880aa|up_4|NC_009921.1_1663769_1666409_-	pfam03787, RAMPs, RAMP superfamily	NA|533aa|up_3|NC_009921.1_1668305_1669904_-	NA	NA|56aa|up_2|NC_009921.1_1670183_1670351_+	NA	NA|657aa|up_1|NC_009921.1_1671182_1673153_+	pfam12696, TraG-D_C, TraM recognition site of TraD and TraG	NA|385aa|up_0|NC_009921.1_1673149_1674304_+	PRK07764, PRK07764, DNA polymerase III subunits gamma and tau; Validated	NA|401aa|down_0|NC_009921.1_1676490_1677692_+	PHA02517, PHA02517, putative transposase OrfB; Reviewed	cas6|238aa|down_1|NC_009921.1_1678161_1678875_+	TIGR01877, CRISPR-associated_endoribonuclease_Cas6_1, CRISPR-associated endoribonuclease Cas6	cas8b2|482aa|down_2|NC_009921.1_1678874_1680320_+	NA	cas7|353aa|down_3|NC_009921.1_1680319_1681378_+	TIGR02585, conserved_protein, CRISPR-associated protein Cas7/Cst2/DevR, subtype I-B/TNEAP	cas5|264aa|down_4|NC_009921.1_1681374_1682166_+	cd09693, Cas5_I, CRISPR/Cas system-associated RAMP superfamily protein Cas5	cas3|824aa|down_5|NC_009921.1_1682162_1684634_+	COG1203, COG1203, CRISPR-associated helicase Cas3 [Defense mechanisms]	cas4|172aa|down_6|NC_009921.1_1684630_1685146_+	pfam01930, Cas_Cas4, Domain of unknown function DUF83	cas2|88aa|down_7|NC_009921.1_1686127_1686391_+	cd09725, Cas2_I_II_III, CRISPR/Cas system-associated protein Cas2	NA|81aa|down_8|NC_009921.1_1689846_1690089_+	pfam12728, HTH_17, Helix-turn-helix domain	NA|435aa|down_9|NC_009921.1_1690152_1691457_-	cd01189, INT_ICEBs1_C_like, C-terminal catalytic domain of integrases from bacterial phages and conjugate transposons
GCF_000018005.1_ASM1800v1	NC_009921	Frankia sp. EAN1pec, complete genome	4	1686503-1689446	6,6,5	PILER-CR,CRISPRCasFinder,CRT	no	cas6,cas8b2,cas7,cas5,cas3,cas4,cas2	c2c9_V-U4,WYL,cas3,csa3,cas4,RT,csx19,csm3gr7,cas6,cas8b2,cas7,cas5,cas2,casR,DEDDh	Unclear	GTTGCGATCCCTCCAGGGATGATCAGCGAC,GTCGCTGATCATCCCTGGAGGGATCGCAAC,GTCGCTGATCATCCCTGGAGGGATCGCAAC	30,30,30	0	0	NA	NA	?:?:?	43,44,44	44	Unclear	c2c9_V-U4,WYL,cas3,csa3,cas4,RT,csx19,csm3gr7,cas6,cas8b2,cas7,cas5,cas2,casR,DEDDh	cas8b2|482aa|up_5|NC_009921.1_1678874_1680320_+,NA|444aa|down_4|NC_009921.1_1693341_1694673_-,NA|256aa|down_5|NC_009921.1_1694948_1695716_+,NA|157aa|down_9|NC_009921.1_1701900_1702371_-	NA|657aa|up_9|NC_009921.1_1671182_1673153_+	pfam12696, TraG-D_C, TraM recognition site of TraD and TraG	NA|385aa|up_8|NC_009921.1_1673149_1674304_+	PRK07764, PRK07764, DNA polymerase III subunits gamma and tau; Validated	NA|401aa|up_7|NC_009921.1_1676490_1677692_+	PHA02517, PHA02517, putative transposase OrfB; Reviewed	cas6|238aa|up_6|NC_009921.1_1678161_1678875_+	TIGR01877, CRISPR-associated_endoribonuclease_Cas6_1, CRISPR-associated endoribonuclease Cas6	cas8b2|482aa|up_5|NC_009921.1_1678874_1680320_+	NA	cas7|353aa|up_4|NC_009921.1_1680319_1681378_+	TIGR02585, conserved_protein, CRISPR-associated protein Cas7/Cst2/DevR, subtype I-B/TNEAP	cas5|264aa|up_3|NC_009921.1_1681374_1682166_+	cd09693, Cas5_I, CRISPR/Cas system-associated RAMP superfamily protein Cas5	cas3|824aa|up_2|NC_009921.1_1682162_1684634_+	COG1203, COG1203, CRISPR-associated helicase Cas3 [Defense mechanisms]	cas4|172aa|up_1|NC_009921.1_1684630_1685146_+	pfam01930, Cas_Cas4, Domain of unknown function DUF83	cas2|88aa|up_0|NC_009921.1_1686127_1686391_+	cd09725, Cas2_I_II_III, CRISPR/Cas system-associated protein Cas2	NA|81aa|down_0|NC_009921.1_1689846_1690089_+	pfam12728, HTH_17, Helix-turn-helix domain	NA|435aa|down_1|NC_009921.1_1690152_1691457_-	cd01189, INT_ICEBs1_C_like, C-terminal catalytic domain of integrases from bacterial phages and conjugate transposons	NA|244aa|down_2|NC_009921.1_1691726_1692458_+	cd14021, ChoK-like_euk, Euykaryotic Choline Kinase and similar proteins	NA|245aa|down_3|NC_009921.1_1692610_1693345_-	pfam13563, 2_5_RNA_ligase2, 2'-5' RNA ligase superfamily	NA|444aa|down_4|NC_009921.1_1693341_1694673_-	NA	NA|256aa|down_5|NC_009921.1_1694948_1695716_+	NA	NA|306aa|down_6|NC_009921.1_1695801_1696719_+	cd05120, APH_ChoK_like, Aminoglycoside 3'-phosphotransferase and Choline Kinase family	NA|219aa|down_7|NC_009921.1_1696694_1697351_+	pfam13649, Methyltransf_25, Methyltransferase domain	NA|357aa|down_8|NC_009921.1_1700586_1701657_-	pfam13683, rve_3, Integrase core domain	NA|157aa|down_9|NC_009921.1_1701900_1702371_-	NA
GCF_000018005.1_ASM1800v1	NC_009921	Frankia sp. EAN1pec, complete genome	5	1825564-1825646	7	CRISPRCasFinder	no		c2c9_V-U4,WYL,cas3,csa3,cas4,RT,csx19,csm3gr7,cas6,cas8b2,cas7,cas5,cas2,casR,DEDDh	Orphan	ACCGGGCAACGATCAACAGGGCGC	24	0	0	NA	NA	NA	1	1	Orphan	c2c9_V-U4,WYL,cas3,csa3,cas4,RT,csx19,csm3gr7,cas6,cas8b2,cas7,cas5,cas2,casR,DEDDh	NA|54aa|up_3|NC_009921.1_1824374_1824536_-,NA|71aa|up_2|NC_009921.1_1824532_1824745_-,NA|103aa|up_1|NC_009921.1_1824801_1825110_-,NA|67aa|up_0|NC_009921.1_1825174_1825375_-,NA|176aa|down_0|NC_009921.1_1825875_1826403_+,NA|139aa|down_1|NC_009921.1_1826828_1827245_+,NA|100aa|down_5|NC_009921.1_1831998_1832298_-	NA|286aa|up_9|NC_009921.1_1816045_1816903_+	COG1946, TesB, Acyl-CoA thioesterase [Lipid metabolism]	NA|188aa|up_8|NC_009921.1_1816931_1817495_-	pfam01872, RibD_C, RibD C-terminal domain	NA|334aa|up_7|NC_009921.1_1817764_1818766_-	cd05228, AR_FR_like_1_SDR_e, uncharacterized subgroup of aldehyde reductase and flavonoid reductase related proteins, extended (e) SDRs	NA|230aa|up_6|NC_009921.1_1818849_1819539_-	COG1011, COG1011, Predicted hydrolase (HAD superfamily) [General function prediction only]	NA|486aa|up_5|NC_009921.1_1820196_1821654_+	smart00857, Resolvase, Resolvase, N terminal domain	NA|680aa|up_4|NC_009921.1_1822153_1824193_-	COG1674, FtsK, DNA segregation ATPase FtsK/SpoIIIE and related proteins [Cell division and chromosome partitioning]	NA|54aa|up_3|NC_009921.1_1824374_1824536_-	NA	NA|71aa|up_2|NC_009921.1_1824532_1824745_-	NA	NA|103aa|up_1|NC_009921.1_1824801_1825110_-	NA	NA|67aa|up_0|NC_009921.1_1825174_1825375_-	NA	NA|176aa|down_0|NC_009921.1_1825875_1826403_+	NA	NA|139aa|down_1|NC_009921.1_1826828_1827245_+	NA	NA|339aa|down_2|NC_009921.1_1827330_1828347_-	pfam13649, Methyltransf_25, Methyltransferase domain	NA|282aa|down_3|NC_009921.1_1828205_1829051_-	cd05403, NT_KNTase_like, Nucleotidyltransferase (NT) domain of Staphylococcus aureus kanamycin nucleotidyltransferase, and similar proteins	NA|434aa|down_4|NC_009921.1_1829377_1830679_+	PRK05159, aspC, aspartyl-tRNA synthetase; Provisional	NA|100aa|down_5|NC_009921.1_1831998_1832298_-	NA	NA|404aa|down_6|NC_009921.1_1832503_1833715_+	cd00093, HTH_XRE, Helix-turn-helix XRE-family like proteins	NA|116aa|down_7|NC_009921.1_1833797_1834145_-	pfam04672, Methyltransf_19, S-adenosyl methyltransferase	NA|160aa|down_8|NC_009921.1_1834527_1835007_+	COG0735, Fur, Fe2+/Zn2+ uptake regulation proteins [Inorganic ion transport and metabolism]	NA|744aa|down_9|NC_009921.1_1835123_1837355_+	PRK15061, PRK15061, catalase/peroxidase
GCF_000018005.1_ASM1800v1	NC_009921	Frankia sp. EAN1pec, complete genome	6	3618753-3618836	8	CRISPRCasFinder	no		c2c9_V-U4,WYL,cas3,csa3,cas4,RT,csx19,csm3gr7,cas6,cas8b2,cas7,cas5,cas2,casR,DEDDh	Orphan	CCCGCGCACGCGGGGATCTTCCC	23	0	0	NA	NA	NA	1	1	Orphan	c2c9_V-U4,WYL,cas3,csa3,cas4,RT,csx19,csm3gr7,cas6,cas8b2,cas7,cas5,cas2,casR,DEDDh	NA,NA|247aa|down_5|NC_009921.1_3631691_3632432_-,NA|160aa|down_7|NC_009921.1_3633679_3634159_-,NA|101aa|down_8|NC_009921.1_3634444_3634747_-	NA|141aa|up_9|NC_009921.1_3592018_3592441_+	pfam05598, DUF772, Transposase domain (DUF772)	NA|409aa|up_8|NC_009921.1_3593124_3594351_-	PRK08132, PRK08132, FAD-dependent oxidoreductase; Provisional	NA|483aa|up_7|NC_009921.1_3594354_3595803_-	PRK08274, PRK08274, FAD-dependent tricarballylate dehydrogenase TcuA	NA|542aa|up_6|NC_009921.1_3595868_3597494_-	COG1021, EntE, Peptide arylation enzymes [Secondary metabolites biosynthesis, transport, and catabolism]	NA|249aa|up_5|NC_009921.1_3597626_3598373_-	PRK00048, PRK00048, dihydrodipicolinate reductase; Provisional	NA|84aa|up_4|NC_009921.1_3598987_3599239_-	pfam03621, MbtH, MbtH-like protein	NA|440aa|up_3|NC_009921.1_3599272_3600592_-	pfam13434, K_oxygenase, L-lysine 6-monooxygenase (NADPH-requiring)	NA|281aa|up_2|NC_009921.1_3600597_3601440_-	TIGR02427, b-ketoadipate_enol-lactone_hydrolase, 3-oxoadipate enol-lactonase	NA|85aa|up_1|NC_009921.1_3601512_3601767_-	COG3433, COG3433, Aryl carrier domain [Secondary metabolites biosynthesis, transport, and catabolism]	NA|5650aa|up_0|NC_009921.1_3601750_3618700_-	PRK12467, PRK12467, peptide synthase; Provisional	NA|401aa|down_0|NC_009921.1_3623464_3624666_-	PHA02517, PHA02517, putative transposase OrfB; Reviewed	NA|180aa|down_1|NC_009921.1_3624876_3625416_+	pfam13592, HTH_33, Winged helix-turn helix	NA|207aa|down_2|NC_009921.1_3625340_3625961_+	pfam13358, DDE_3, DDE superfamily endonuclease	NA|118aa|down_3|NC_009921.1_3629137_3629491_+	pfam00877, NLPC_P60, NlpC/P60 family	NA|259aa|down_4|NC_009921.1_3629893_3630670_+	PRK14828, PRK14828, undecaprenyl pyrophosphate synthase; Provisional	NA|247aa|down_5|NC_009921.1_3631691_3632432_-	NA	NA|177aa|down_6|NC_009921.1_3632609_3633140_-	cd16837, BldD_C_like, C-terminal domain of BldD and similar transcription factors	NA|160aa|down_7|NC_009921.1_3633679_3634159_-	NA	NA|101aa|down_8|NC_009921.1_3634444_3634747_-	NA	NA|154aa|down_9|NC_009921.1_3634926_3635388_-	pfam04672, Methyltransf_19, S-adenosyl methyltransferase
GCF_000018005.1_ASM1800v1	NC_009921	Frankia sp. EAN1pec, complete genome	7	7953287-7953504	9	CRISPRCasFinder	no		c2c9_V-U4,WYL,cas3,csa3,cas4,RT,csx19,csm3gr7,cas6,cas8b2,cas7,cas5,cas2,casR,DEDDh	Orphan	CGCCACCGGGGGCGCGGGCGGCGGC	25	1	2	7953460-7953479|7953460-7953479	NC_009921.1_6203717-6203736|NC_009921.1_6156728-6156747	NA	4	4	Orphan	c2c9_V-U4,WYL,cas3,csa3,cas4,RT,csx19,csm3gr7,cas6,cas8b2,cas7,cas5,cas2,casR,DEDDh	NA,NA|195aa|down_1|NC_009921.1_7955660_7956245_+	NA|223aa|up_9|NC_009921.1_7940547_7941216_-	COG3509, LpqC, Poly(3-hydroxybutyrate) depolymerase [Secondary metabolites biosynthesis, transport, and catabolism]	NA|251aa|up_8|NC_009921.1_7941315_7942068_-	COG3442, COG3442, Predicted glutamine amidotransferase [General function prediction only]	NA|418aa|up_7|NC_009921.1_7942064_7943318_-	COG0769, MurE, UDP-N-acetylmuramyl tripeptide synthase [Cell envelope biogenesis, outer membrane]	NA|162aa|up_6|NC_009921.1_7943647_7944133_+	TIGR03941, conserved_hypothetical_protein, putative tRNA adenosine deaminase-associated protein	NA|157aa|up_5|NC_009921.1_7944212_7944683_+	COG0590, CumB, Cytosine/adenosine deaminases [Nucleotide transport and metabolism / Translation, ribosomal structure and biogenesis]	NA|215aa|up_4|NC_009921.1_7945171_7945816_+	COG2910, COG2910, Putative NADH-flavin reductase [General function prediction only]	NA|455aa|up_3|NC_009921.1_7945965_7947330_-	cd06450, DOPA_deC_like, DOPA decarboxylase family	NA|353aa|up_2|NC_009921.1_7948086_7949145_+	TIGR03617, F420_MSMEG_2256, probable F420-dependent oxidoreductase, MSMEG_2256 family	NA|396aa|up_1|NC_009921.1_7949207_7950395_-	COG2124, CypX, Cytochrome P450 [Secondary metabolites biosynthesis, transport, and catabolism]	NA|786aa|up_0|NC_009921.1_7950605_7952963_-	cd02094, P-type_ATPase_Cu-like, P-type heavy metal-transporting ATPase, similar to human copper-transporting ATPases, ATP7A and ATP7B	NA|313aa|down_0|NC_009921.1_7954268_7955207_-	PRK07478, PRK07478, short chain dehydrogenase; Provisional	NA|195aa|down_1|NC_009921.1_7955660_7956245_+	NA	NA|501aa|down_2|NC_009921.1_7956831_7958334_+	TIGR03025, EPS_sugtrans, exopolysaccharide biosynthesis polyprenyl glycosylphosphotransferase	NA|330aa|down_3|NC_009921.1_7958398_7959388_-	cd04186, GT_2_like_c, Subfamily of Glycosyltransferase Family GT2 of unknown function	NA|222aa|down_4|NC_009921.1_7959363_7960029_-	cd04647, LbH_MAT_like, Maltose O-acyltransferase (MAT)-like: This family is composed of maltose O-acetyltransferase, galactoside O-acetyltransferase (GAT), xenobiotic acyltransferase (XAT) and similar proteins	NA|294aa|down_5|NC_009921.1_7960028_7960910_-	cd06438, EpsO_like, EpsO protein participates in the methanolan synthesis	NA|491aa|down_6|NC_009921.1_7961385_7962858_+	COG1004, Ugd, Predicted UDP-glucose 6-dehydrogenase [Cell envelope biogenesis, outer membrane]	NA|369aa|down_7|NC_009921.1_7962967_7964074_+	COG2159, COG2159, Predicted metal-dependent hydrolase of the TIM-barrel fold [General function prediction only]	NA|406aa|down_8|NC_009921.1_7964127_7965345_-	cd03801, GT4_PimA-like, phosphatidyl-myo-inositol mannosyltransferase	NA|401aa|down_9|NC_009921.1_7965341_7966544_-	cd03801, GT4_PimA-like, phosphatidyl-myo-inositol mannosyltransferase
GCF_000018005.1_ASM1800v1	NC_009921	Frankia sp. EAN1pec, complete genome	8	8145156-8145268	10	CRISPRCasFinder	no		c2c9_V-U4,WYL,cas3,csa3,cas4,RT,csx19,csm3gr7,cas6,cas8b2,cas7,cas5,cas2,casR,DEDDh	Orphan	GCCACACCCGCGTCAGGCTGGTGGCAGGC	29	0	0	NA	NA	NA	1	1	Orphan	c2c9_V-U4,WYL,cas3,csa3,cas4,RT,csx19,csm3gr7,cas6,cas8b2,cas7,cas5,cas2,casR,DEDDh	NA|69aa|up_9|NC_009921.1_8135665_8135872_-,NA|168aa|up_5|NC_009921.1_8139168_8139672_+,NA|224aa|up_4|NC_009921.1_8139668_8140340_+,NA|281aa|up_2|NC_009921.1_8141079_8141922_+,NA|265aa|up_1|NC_009921.1_8141932_8142727_+,NA	NA|69aa|up_9|NC_009921.1_8135665_8135872_-	NA	NA|627aa|up_8|NC_009921.1_8136122_8138003_+	COG4585, COG4585, Signal transduction histidine kinase [Signal transduction mechanisms]	NA|247aa|up_7|NC_009921.1_8138032_8138773_+	COG2197, CitB, Response regulator containing a CheY-like receiver domain and an HTH DNA-binding domain [Signal transduction mechanisms / Transcription]	NA|128aa|up_6|NC_009921.1_8138664_8139048_-	COG2197, CitB, Response regulator containing a CheY-like receiver domain and an HTH DNA-binding domain [Signal transduction mechanisms / Transcription]	NA|168aa|up_5|NC_009921.1_8139168_8139672_+	NA	NA|224aa|up_4|NC_009921.1_8139668_8140340_+	NA	NA|209aa|up_3|NC_009921.1_8140336_8140963_+	PRK12270, kgd, multifunctional oxoglutarate decarboxylase/oxoglutarate dehydrogenase thiamine pyrophosphate-binding subunit/dihydrolipoyllysine-residue succinyltransferase subunit	NA|281aa|up_2|NC_009921.1_8141079_8141922_+	NA	NA|265aa|up_1|NC_009921.1_8141932_8142727_+	NA	NA|489aa|up_0|NC_009921.1_8142723_8144190_+	PRK07764, PRK07764, DNA polymerase III subunits gamma and tau; Validated	NA|346aa|down_0|NC_009921.1_8147946_8148984_+	PRK09354, recA, recombinase A; Provisional	NA|326aa|down_1|NC_009921.1_8149941_8150919_+	pfam13808, DDE_Tnp_1_assoc, DDE_Tnp_1-associated	NA|397aa|down_2|NC_009921.1_8151354_8152545_+	COG1167, ARO8, Transcriptional regulators containing a DNA-binding HTH domain and an aminotransferase domain (MocR family) and their eukaryotic orthologs [Transcription / Amino acid transport and metabolism]	NA|322aa|down_3|NC_009921.1_8152573_8153539_+	PRK01372, ddl, D-alanine--D-alanine ligase; Reviewed	NA|238aa|down_4|NC_009921.1_8153555_8154269_-	pfam13649, Methyltransf_25, Methyltransferase domain	NA|356aa|down_5|NC_009921.1_8154280_8155348_+	pfam13683, rve_3, Integrase core domain	NA|248aa|down_6|NC_009921.1_8155322_8156066_-	pfam13546, DDE_5, DDE superfamily endonuclease	NA|304aa|down_7|NC_009921.1_8156311_8157223_+	PRK09685, PRK09685, DNA-binding transcriptional activator FeaR; Provisional	NA|61aa|down_8|NC_009921.1_8161157_8161340_+	COG2452, COG2452, Predicted site-specific integrase-resolvase [DNA replication, recombination, and repair]	NA|657aa|down_9|NC_009921.1_8161442_8163413_+	cd01189, INT_ICEBs1_C_like, C-terminal catalytic domain of integrases from bacterial phages and conjugate transposons
