assembly_id	genome_id	genome_def	crispr_array_locus_merge	crispr_array_location_merge	crispr_locus_id	crispr_pred_method	array_in_prot	prot_within_array_20000	prot_in_genome	crispr_type_by_cas_prot	consensus_repeat	repeat_length	self-targeting_spacer_number	self-targeting_target_number	spacer_location	protospacer_location	repeat_type	spacer_locus_num	spacer_num	correct_crispr_type	genome_cas_prots	unknown_protein_around_crispr	L10	L10_domain	L9	L9_domain	L8	L8_domain	L7	L7_domain	L6	L6_domain	L5	L5_domain	L4	L4_domain	L3	L3_domain	L2	L2_domain	L1	L1_domain	R1	R1_domain	R2	R2_domain	R3	R3_domain	R4	R4_domain	R5	R5_domain	R6	R6_domain	R7	R7_domain	R8	R8_domain	R9	R9_domain	R10	R10_domain
GCF_008254045.1_ASM825404v1	NZ_CP042425	Gemmataceae bacterium PX52 chromosome, complete genome	1	748345-748440	1	CRISPRCasFinder	no		RT,DEDDh,csa3,DinG,cas4,cas3	Orphan	CACCGATTTTAACGCCTCGACGG	23	0	0	NA	NA	NA	1	1	Orphan	RT,DEDDh,csa3,DinG,cas4,cas3	NA|343aa|up_9|NZ_CP042425.1_735919_736948_+,NA|131aa|up_8|NZ_CP042425.1_736949_737342_+,NA|142aa|up_4|NZ_CP042425.1_743801_744227_-,NA|513aa|up_2|NZ_CP042425.1_745156_746695_-,NA|410aa|down_0|NZ_CP042425.1_748582_749812_+	NA|343aa|up_9|NZ_CP042425.1_735919_736948_+	NA	NA|131aa|up_8|NZ_CP042425.1_736949_737342_+	NA	NA|1057aa|up_7|NZ_CP042425.1_737338_740509_+	COG1391, GlnE, Glutamine synthetase adenylyltransferase [Posttranslational modification, protein turnover, chaperones / Signal transduction mechanisms]	NA|424aa|up_6|NZ_CP042425.1_740505_741777_+	TIGR02996, rpt_mate_G_obs, repeat-companion domain TIGR02996	NA|616aa|up_5|NZ_CP042425.1_741860_743708_-	TIGR03960, radical_SAM_domain_protein, radical SAM family uncharacterized protein	NA|142aa|up_4|NZ_CP042425.1_743801_744227_-	NA	NA|255aa|up_3|NZ_CP042425.1_744395_745160_-	cd05347, Ga5DH-like_SDR_c, gluconate 5-dehydrogenase (Ga5DH)-like, classical (c) SDRs	NA|513aa|up_2|NZ_CP042425.1_745156_746695_-	NA	NA|310aa|up_1|NZ_CP042425.1_746691_747621_-	PRK00059, prsA, peptidylprolyl isomerase; Provisional	NA|135aa|up_0|NZ_CP042425.1_747760_748165_-	PRK12270, kgd, multifunctional oxoglutarate decarboxylase/oxoglutarate dehydrogenase thiamine pyrophosphate-binding subunit/dihydrolipoyllysine-residue succinyltransferase subunit	NA|410aa|down_0|NZ_CP042425.1_748582_749812_+	NA	NA|636aa|down_1|NZ_CP042425.1_749808_751716_+	smart00752, HTTM, Horizontally Transferred TransMembrane Domain	NA|736aa|down_2|NZ_CP042425.1_751858_754066_-	pfam07583, PSCyt2, Protein of unknown function (DUF1549)	NA|360aa|down_3|NZ_CP042425.1_754145_755225_-	pfam03781, FGE-sulfatase, Sulfatase-modifying factor enzyme 1	NA|344aa|down_4|NZ_CP042425.1_755310_756342_-	PRK03910, PRK03910, D-cysteine desulfhydrase; Validated	NA|468aa|down_5|NZ_CP042425.1_756418_757822_+	PRK05249, PRK05249, Si-specific NAD(P)(+) transhydrogenase	NA|346aa|down_6|NZ_CP042425.1_757871_758909_-	pfam13485, Peptidase_MA_2, Peptidase MA superfamily	NA|143aa|down_7|NZ_CP042425.1_759179_759608_+	pfam06941, NT5C, 5' nucleotidase, deoxy (Pyrimidine), cytosolic type C protein (NT5C)	NA|168aa|down_8|NZ_CP042425.1_759663_760167_+	pfam15891, Nuc_deoxyri_tr2, Nucleoside 2-deoxyribosyltransferase like	NA|306aa|down_9|NZ_CP042425.1_760227_761145_-	COG3781, COG3781, Predicted membrane protein [Function unknown]
GCF_008254045.1_ASM825404v1	NZ_CP042425	Gemmataceae bacterium PX52 chromosome, complete genome	2	1891571-1891674	2	CRISPRCasFinder	no		RT,DEDDh,csa3,DinG,cas4,cas3	Orphan	ACGTCGTCGGAGATCGACCTGCCGCCGAT	29	0	0	NA	NA	NA	1	1	Orphan	RT,DEDDh,csa3,DinG,cas4,cas3	NA|425aa|up_6|NZ_CP042425.1_1880265_1881540_+,NA|234aa|up_4|NZ_CP042425.1_1882605_1883307_-,NA|76aa|up_0|NZ_CP042425.1_1889356_1889584_-,NA|130aa|down_1|NZ_CP042425.1_1895291_1895681_-,NA|223aa|down_3|NZ_CP042425.1_1896590_1897259_+,NA|145aa|down_4|NZ_CP042425.1_1897289_1897724_+,NA|201aa|down_5|NZ_CP042425.1_1897801_1898404_+,NA|111aa|down_6|NZ_CP042425.1_1898462_1898795_+,NA|248aa|down_9|NZ_CP042425.1_1901276_1902020_-	NA|243aa|up_9|NZ_CP042425.1_1877785_1878514_-	COG4221, COG4221, Short-chain alcohol dehydrogenase of unknown specificity [General function prediction only]	NA|379aa|up_8|NZ_CP042425.1_1878606_1879744_+	PRK00578, prfB, peptide chain release factor 2; Validated	NA|160aa|up_7|NZ_CP042425.1_1879740_1880220_+	pfam02367, TsaE, Threonylcarbamoyl adenosine biosynthesis protein TsaE	NA|425aa|up_6|NZ_CP042425.1_1880265_1881540_+	NA	NA|374aa|up_5|NZ_CP042425.1_1881536_1882658_+	cd08231, MDR_TM0436_like, Hypothetical enzyme TM0436 resembles the zinc-dependent alcohol dehydrogenases (ADH)	NA|234aa|up_4|NZ_CP042425.1_1882605_1883307_-	NA	NA|600aa|up_3|NZ_CP042425.1_1883412_1885212_-	PRK14083, PRK14083, HSP90 family protein; Provisional	NA|829aa|up_2|NZ_CP042425.1_1885315_1887802_+	COG0826, COG0826, Collagenase and related proteases [Posttranslational modification, protein turnover, chaperones]	NA|465aa|up_1|NZ_CP042425.1_1887888_1889283_+	pfam07394, DUF1501, Protein of unknown function (DUF1501)	NA|76aa|up_0|NZ_CP042425.1_1889356_1889584_-	NA	NA|298aa|down_0|NZ_CP042425.1_1894329_1895223_-	PRK03170, PRK03170, dihydrodipicolinate synthase; Provisional	NA|130aa|down_1|NZ_CP042425.1_1895291_1895681_-	NA	NA|219aa|down_2|NZ_CP042425.1_1895775_1896432_+	TIGR02937, RNA_polymerase_sigma_factor, RNA polymerase sigma factor, sigma-70 family	NA|223aa|down_3|NZ_CP042425.1_1896590_1897259_+	NA	NA|145aa|down_4|NZ_CP042425.1_1897289_1897724_+	NA	NA|201aa|down_5|NZ_CP042425.1_1897801_1898404_+	NA	NA|111aa|down_6|NZ_CP042425.1_1898462_1898795_+	NA	NA|402aa|down_7|NZ_CP042425.1_1899011_1900217_-	pfam04230, PS_pyruv_trans, Polysaccharide pyruvyl transferase	NA|337aa|down_8|NZ_CP042425.1_1900303_1901314_+	pfam02517, Abi, CAAX protease self-immunity	NA|248aa|down_9|NZ_CP042425.1_1901276_1902020_-	NA
GCF_008254045.1_ASM825404v1	NZ_CP042425	Gemmataceae bacterium PX52 chromosome, complete genome	3	2449108-2449209	3	CRISPRCasFinder	no		RT,DEDDh,csa3,DinG,cas4,cas3	Orphan	TCCAACGAAAAGTGACTGAGAATTAATGC	29	0	0	NA	NA	NA	1	1	Orphan	RT,DEDDh,csa3,DinG,cas4,cas3	NA|323aa|up_2|NZ_CP042425.1_2446619_2447588_-,NA|197aa|up_0|NZ_CP042425.1_2448450_2449041_-,NA	NA|76aa|up_9|NZ_CP042425.1_2436844_2437072_+	pfam12728, HTH_17, Helix-turn-helix domain	NA|400aa|up_8|NZ_CP042425.1_2437032_2438232_-	cd00796, INT_Rci_Hp1_C, Shufflon-specific DNA recombinase Rci and Bacteriophage Hp1_like integrase, C-terminal catalytic domain	NA|441aa|up_7|NZ_CP042425.1_2438525_2439848_-	PRK02862, glgC, glucose-1-phosphate adenylyltransferase; Provisional	NA|578aa|up_6|NZ_CP042425.1_2440033_2441767_+	PRK09274, PRK09274, peptide synthase; Provisional	NA|332aa|up_5|NZ_CP042425.1_2441763_2442759_+	cd09813, 3b-HSD-NSDHL-like_SDR_e, human NSDHL (NAD(P)H steroid dehydrogenase-like protein)-like, extended (e) SDRs	NA|432aa|up_4|NZ_CP042425.1_2442760_2444056_-	cd03784, GT1_Gtf-like, UDP-glycosyltransferases and similar proteins	NA|715aa|up_3|NZ_CP042425.1_2444141_2446286_-	cd14014, STKc_PknB_like, Catalytic domain of bacterial Serine/Threonine kinases, PknB and similar proteins	NA|323aa|up_2|NZ_CP042425.1_2446619_2447588_-	NA	NA|207aa|up_1|NZ_CP042425.1_2447584_2448205_-	pfam02517, Abi, CAAX protease self-immunity	NA|197aa|up_0|NZ_CP042425.1_2448450_2449041_-	NA	NA|450aa|down_0|NZ_CP042425.1_2449248_2450598_-	PRK08591, PRK08591, acetyl-CoA carboxylase biotin carboxylase subunit; Validated	NA|149aa|down_1|NZ_CP042425.1_2450684_2451131_-	PRK06302, PRK06302, acetyl-CoA carboxylase biotin carboxyl carrier protein	NA|374aa|down_2|NZ_CP042425.1_2451297_2452419_-	COG0006, PepP, Xaa-Pro aminopeptidase [Amino acid transport and metabolism]	NA|204aa|down_3|NZ_CP042425.1_2452775_2453387_+	COG1648, CysG, Siroheme synthase (precorrin-2 oxidase/ferrochelatase domain) [Coenzyme metabolism]	NA|276aa|down_4|NZ_CP042425.1_2453415_2454243_+	TIGR03144, cytochrome_c_biogenesis_protein_chloroplast, cytochrome c-type biogenesis protein CcsB	NA|425aa|down_5|NZ_CP042425.1_2454239_2455514_+	PRK00045, hemA, glutamyl-tRNA reductase; Reviewed	NA|243aa|down_6|NZ_CP042425.1_2455518_2456247_+	pfam13267, DUF4058, Protein of unknown function (DUF4058)	NA|297aa|down_7|NZ_CP042425.1_2456243_2457134_-	pfam07596, SBP_bac_10, Protein of unknown function (DUF1559)	NA|164aa|down_8|NZ_CP042425.1_2457168_2457660_-	COG0456, RimI, Acetyltransferases [General function prediction only]	NA|171aa|down_9|NZ_CP042425.1_2457731_2458244_+	PRK02304, PRK02304, adenine phosphoribosyltransferase; Provisional
GCF_008254045.1_ASM825404v1	NZ_CP042425	Gemmataceae bacterium PX52 chromosome, complete genome	4	3078725-3078823	4	CRISPRCasFinder	no		RT,DEDDh,csa3,DinG,cas4,cas3	Orphan	TTCGGCCCTGAAGGGGTCGTTCA	23	0	0	NA	NA	NA	1	1	Orphan	RT,DEDDh,csa3,DinG,cas4,cas3	NA|622aa|up_9|NZ_CP042425.1_3067377_3069243_+,NA|66aa|up_8|NZ_CP042425.1_3069289_3069487_+,NA|148aa|up_7|NZ_CP042425.1_3069566_3070010_-,NA|273aa|up_5|NZ_CP042425.1_3072629_3073448_+,NA|90aa|down_0|NZ_CP042425.1_3078937_3079207_-,NA|130aa|down_9|NZ_CP042425.1_3088958_3089348_-	NA|622aa|up_9|NZ_CP042425.1_3067377_3069243_+	NA	NA|66aa|up_8|NZ_CP042425.1_3069289_3069487_+	NA	NA|148aa|up_7|NZ_CP042425.1_3069566_3070010_-	NA	NA|766aa|up_6|NZ_CP042425.1_3070160_3072458_+	TIGR02917, TPR_domain_protein, putative PEP-CTERM system TPR-repeat lipoprotein	NA|273aa|up_5|NZ_CP042425.1_3072629_3073448_+	NA	NA|216aa|up_4|NZ_CP042425.1_3073580_3074228_+	pfam06439, DUF1080, Domain of Unknown Function (DUF1080)	NA|186aa|up_3|NZ_CP042425.1_3074244_3074802_+	cd06127, DEDDh, DEDDh 3'-5' exonuclease domain family	NA|260aa|up_2|NZ_CP042425.1_3074798_3075578_-	cd00657, Ferritin_like, Ferritin-like superfamily of diiron-containing four-helix-bundle proteins	NA|520aa|up_1|NZ_CP042425.1_3075732_3077292_-	cd14014, STKc_PknB_like, Catalytic domain of bacterial Serine/Threonine kinases, PknB and similar proteins	NA|433aa|up_0|NZ_CP042425.1_3077401_3078700_+	pfam13360, PQQ_2, PQQ-like domain	NA|90aa|down_0|NZ_CP042425.1_3078937_3079207_-	NA	NA|577aa|down_1|NZ_CP042425.1_3079696_3081427_+	TIGR02937, RNA_polymerase_sigma_factor, RNA polymerase sigma factor, sigma-70 family	NA|344aa|down_2|NZ_CP042425.1_3081498_3082530_+	cd19091, AKR_PsAKR, Polaromonas Sp	NA|182aa|down_3|NZ_CP042425.1_3082750_3083296_+	pfam11026, DUF2721, Protein of unknown function (DUF2721)	NA|292aa|down_4|NZ_CP042425.1_3083365_3084241_+	COG0657, Aes, Esterase/lipase [Lipid metabolism]	NA|410aa|down_5|NZ_CP042425.1_3084393_3085623_+	COG0334, GdhA, Glutamate dehydrogenase/leucine dehydrogenase [Amino acid transport and metabolism]	NA|262aa|down_6|NZ_CP042425.1_3086047_3086833_+	TIGR01349, PDHac_trf_mito, pyruvate dehydrogenase complex dihydrolipoamide acetyltransferase, long form	NA|270aa|down_7|NZ_CP042425.1_3086845_3087655_+	PRK11856, PRK11856, branched-chain alpha-keto acid dehydrogenase subunit E2; Reviewed	NA|384aa|down_8|NZ_CP042425.1_3087709_3088861_-	cd02966, TlpA_like_family, TlpA-like family; composed of  TlpA, ResA, DsbE and similar proteins	NA|130aa|down_9|NZ_CP042425.1_3088958_3089348_-	NA
GCF_008254045.1_ASM825404v1	NZ_CP042425	Gemmataceae bacterium PX52 chromosome, complete genome	5	3380749-3380860	5	CRISPRCasFinder	no		RT,DEDDh,csa3,DinG,cas4,cas3	Orphan	CGGCCGATTTCGTTTCACGTTTGGTGGAATTGCAAGG	37	0	0	NA	NA	NA	1	1	Orphan	RT,DEDDh,csa3,DinG,cas4,cas3	NA|105aa|up_6|NZ_CP042425.1_3372608_3372923_+,NA|256aa|up_2|NZ_CP042425.1_3377107_3377875_-,NA|155aa|down_0|NZ_CP042425.1_3381358_3381823_-,NA|504aa|down_1|NZ_CP042425.1_3381833_3383345_-,NA|201aa|down_3|NZ_CP042425.1_3384988_3385591_+	NA|306aa|up_9|NZ_CP042425.1_3370345_3371263_-	COG2319, COG2319, FOG: WD40 repeat [General function prediction only]	NA|268aa|up_8|NZ_CP042425.1_3371272_3372076_-	PRK00311, panB, 3-methyl-2-oxobutanoate hydroxymethyltransferase; Reviewed	NA|136aa|up_7|NZ_CP042425.1_3372176_3372584_+	pfam04134, DUF393, Protein of unknown function, DUF393	NA|105aa|up_6|NZ_CP042425.1_3372608_3372923_+	NA	NA|594aa|up_5|NZ_CP042425.1_3373069_3374851_+	PHA03247, PHA03247, large tegument protein UL36; Provisional	NA|385aa|up_4|NZ_CP042425.1_3374961_3376116_+	pfam00144, Beta-lactamase, Beta-lactamase	NA|176aa|up_3|NZ_CP042425.1_3376123_3376651_-	pfam11026, DUF2721, Protein of unknown function (DUF2721)	NA|256aa|up_2|NZ_CP042425.1_3377107_3377875_-	NA	NA|115aa|up_1|NZ_CP042425.1_3377871_3378216_-	TIGR03433, padR_acidobact, transcriptional regulator, Acidobacterial, PadR-family	NA|532aa|up_0|NZ_CP042425.1_3379126_3380722_+	pfam05731, TROVE, TROVE domain	NA|155aa|down_0|NZ_CP042425.1_3381358_3381823_-	NA	NA|504aa|down_1|NZ_CP042425.1_3381833_3383345_-	NA	NA|439aa|down_2|NZ_CP042425.1_3383451_3384768_-	COG2204, AtoC, Response regulator containing CheY-like receiver, AAA-type ATPase, and DNA-binding domains [Signal transduction mechanisms]	NA|201aa|down_3|NZ_CP042425.1_3384988_3385591_+	NA	NA|444aa|down_4|NZ_CP042425.1_3385826_3387158_+	pfam07642, BBP2, Putative beta-barrel porin-2, OmpL-like	NA|219aa|down_5|NZ_CP042425.1_3387208_3387865_-	COG4359, COG4359, Uncharacterized conserved protein [Function unknown]	NA|296aa|down_6|NZ_CP042425.1_3387866_3388754_-	pfam03747, ADP_ribosyl_GH, ADP-ribosylglycohydrolase	NA|269aa|down_7|NZ_CP042425.1_3388833_3389640_-	COG1082, IolE, Sugar phosphate isomerases/epimerases [Carbohydrate transport and metabolism]	NA|425aa|down_8|NZ_CP042425.1_3389746_3391021_-	pfam13360, PQQ_2, PQQ-like domain	NA|843aa|down_9|NZ_CP042425.1_3391183_3393712_-	cd07341, M56_BlaR1_MecR1_like, Peptidase M56-like including those in BlaR1 and MecR1, integral membrane metallopeptidase
GCF_008254045.1_ASM825404v1	NZ_CP042425	Gemmataceae bacterium PX52 chromosome, complete genome	6	3880317-3880424	6	CRISPRCasFinder	no		RT,DEDDh,csa3,DinG,cas4,cas3	Orphan	CTGTAGACGCGGTCGGCCGCCGCCGCGGGGGTGCGG	36	0	0	NA	NA	NA	1	1	Orphan	RT,DEDDh,csa3,DinG,cas4,cas3	NA|60aa|up_0|NZ_CP042425.1_3878899_3879079_+,NA|82aa|down_0|NZ_CP042425.1_3881088_3881334_-,NA|70aa|down_4|NZ_CP042425.1_3885828_3886038_+,NA|164aa|down_6|NZ_CP042425.1_3886311_3886803_-,NA|144aa|down_8|NZ_CP042425.1_3887891_3888323_-,NA|152aa|down_9|NZ_CP042425.1_3888882_3889338_-	NA|130aa|up_9|NZ_CP042425.1_3868293_3868683_+	cd14014, STKc_PknB_like, Catalytic domain of bacterial Serine/Threonine kinases, PknB and similar proteins	NA|286aa|up_8|NZ_CP042425.1_3868722_3869580_+	cd14014, STKc_PknB_like, Catalytic domain of bacterial Serine/Threonine kinases, PknB and similar proteins	NA|299aa|up_7|NZ_CP042425.1_3870177_3871074_+	cd01834, SGNH_hydrolase_like_2, SGNH_hydrolase subfamily	NA|279aa|up_6|NZ_CP042425.1_3871104_3871941_+	COG0627, COG0627, Predicted esterase [General function prediction only]	NA|525aa|up_5|NZ_CP042425.1_3872197_3873772_+	COG0488, Uup, ATPase components of ABC transporters with duplicated ATPase domains [General function prediction only]	NA|259aa|up_4|NZ_CP042425.1_3873854_3874631_-	COG0702, COG0702, Predicted nucleoside-diphosphate-sugar epimerases [Cell envelope biogenesis, outer membrane / Carbohydrate transport and metabolism]	NA|143aa|up_3|NZ_CP042425.1_3874688_3875117_-	cd02234, cupin_BLR7677-like, Bradyrhizobium japonicum BLR7677 and related proteins, cupin domain	NA|206aa|up_2|NZ_CP042425.1_3875295_3875913_+	TIGR02937, RNA_polymerase_sigma_factor, RNA polymerase sigma factor, sigma-70 family	NA|948aa|up_1|NZ_CP042425.1_3875929_3878773_+	cd14014, STKc_PknB_like, Catalytic domain of bacterial Serine/Threonine kinases, PknB and similar proteins	NA|60aa|up_0|NZ_CP042425.1_3878899_3879079_+	NA	NA|82aa|down_0|NZ_CP042425.1_3881088_3881334_-	NA	NA|375aa|down_1|NZ_CP042425.1_3881899_3883024_+	pfam13808, DDE_Tnp_1_assoc, DDE_Tnp_1-associated	NA|645aa|down_2|NZ_CP042425.1_3882962_3884897_-	pfam00207, A2M, Alpha-2-macroglobulin family	NA|260aa|down_3|NZ_CP042425.1_3884895_3885675_+	PRK08181, PRK08181, transposase; Validated	NA|70aa|down_4|NZ_CP042425.1_3885828_3886038_+	NA	NA|47aa|down_5|NZ_CP042425.1_3886031_3886172_+	pfam05717, TnpB_IS66, IS66 Orf2 like protein	NA|164aa|down_6|NZ_CP042425.1_3886311_3886803_-	NA	NA|324aa|down_7|NZ_CP042425.1_3886892_3887864_-	pfam07596, SBP_bac_10, Protein of unknown function (DUF1559)	NA|144aa|down_8|NZ_CP042425.1_3887891_3888323_-	NA	NA|152aa|down_9|NZ_CP042425.1_3888882_3889338_-	NA
GCF_008254045.1_ASM825404v1	NZ_CP042425	Gemmataceae bacterium PX52 chromosome, complete genome	7	3933785-3933952	7	CRISPRCasFinder	no	RT	RT,DEDDh,csa3,DinG,cas4,cas3	Unclear	GACGGTGTCGTCGCCGTCGCCACCGT	26	0	0	NA	NA	NA	2	2	Orphan	RT,DEDDh,csa3,DinG,cas4,cas3	NA|135aa|up_9|NZ_CP042425.1_3918278_3918683_-,NA|63aa|up_8|NZ_CP042425.1_3918743_3918932_-,NA|207aa|down_7|NZ_CP042425.1_3944721_3945342_+	NA|135aa|up_9|NZ_CP042425.1_3918278_3918683_-	NA	NA|63aa|up_8|NZ_CP042425.1_3918743_3918932_-	NA	NA|775aa|up_7|NZ_CP042425.1_3919021_3921346_-	cd14014, STKc_PknB_like, Catalytic domain of bacterial Serine/Threonine kinases, PknB and similar proteins	NA|266aa|up_6|NZ_CP042425.1_3921616_3922414_+	smart00421, HTH_LUXR, helix_turn_helix, Lux Regulon	NA|224aa|up_5|NZ_CP042425.1_3922606_3923278_+	pfam16227, DUF4886, Domain of unknown function (DUF4886)	NA|71aa|up_4|NZ_CP042425.1_3923626_3923839_-	cd00090, HTH_ARSR, Arsenical Resistance Operon Repressor and similar prokaryotic, metal regulated homodimeric repressors	NA|351aa|up_3|NZ_CP042425.1_3923864_3924917_-	pfam07596, SBP_bac_10, Protein of unknown function (DUF1559)	NA|1603aa|up_2|NZ_CP042425.1_3925085_3929894_+	cd05819, NHL, NHL repeat unit of beta-propeller proteins	NA|516aa|up_1|NZ_CP042425.1_3930406_3931954_-	pfam13517, VCBS, Repeat domain in Vibrio, Colwellia, Bradyrhizobium and Shewanella	NA|75aa|up_0|NZ_CP042425.1_3932273_3932498_+	COG3328, COG3328, Transposase and inactivated derivatives [DNA replication, recombination, and repair]	NA|203aa|down_0|NZ_CP042425.1_3935584_3936193_+	pfam07638, Sigma70_ECF, ECF sigma factor	NA|826aa|down_1|NZ_CP042425.1_3936189_3938667_+	cd14014, STKc_PknB_like, Catalytic domain of bacterial Serine/Threonine kinases, PknB and similar proteins	NA|243aa|down_2|NZ_CP042425.1_3938831_3939560_-	pfam03781, FGE-sulfatase, Sulfatase-modifying factor enzyme 1	RT|507aa|down_3|NZ_CP042425.1_3939808_3941329_-	cd03487, RT_Bac_retron_II, RT_Bac_retron_II: Reverse transcriptases (RTs) in bacterial retrotransposons or retrons	NA|661aa|down_4|NZ_CP042425.1_3941411_3943394_+	cd19100, AKR_unchar, uncharacterized aldo-keto reductase (AKR) superfamily protein	NA|72aa|down_5|NZ_CP042425.1_3943784_3944000_+	COG3844, COG3844, Kynureninase [Amino acid transport and metabolism]	NA|213aa|down_6|NZ_CP042425.1_3943983_3944622_+	TIGR03035, trp_arylform, arylformamidase	NA|207aa|down_7|NZ_CP042425.1_3944721_3945342_+	NA	NA|228aa|down_8|NZ_CP042425.1_3946037_3946721_+	pfam13340, DUF4096, Putative transposase of IS4/5 family (DUF4096)	NA|147aa|down_9|NZ_CP042425.1_3946841_3947282_+	pfam09424, YqeY, Yqey-like protein
GCF_008254045.1_ASM825404v1	NZ_CP042425	Gemmataceae bacterium PX52 chromosome, complete genome	8	3934442-3934575	8	CRISPRCasFinder	no	RT	RT,DEDDh,csa3,DinG,cas4,cas3	Unclear	GACGGTGTCGTCGCCGTCGCCACCGT	26	0	0	NA	NA	NA	2	2	Orphan	RT,DEDDh,csa3,DinG,cas4,cas3	NA|135aa|up_9|NZ_CP042425.1_3918278_3918683_-,NA|63aa|up_8|NZ_CP042425.1_3918743_3918932_-,NA|207aa|down_7|NZ_CP042425.1_3944721_3945342_+	NA|135aa|up_9|NZ_CP042425.1_3918278_3918683_-	NA	NA|63aa|up_8|NZ_CP042425.1_3918743_3918932_-	NA	NA|775aa|up_7|NZ_CP042425.1_3919021_3921346_-	cd14014, STKc_PknB_like, Catalytic domain of bacterial Serine/Threonine kinases, PknB and similar proteins	NA|266aa|up_6|NZ_CP042425.1_3921616_3922414_+	smart00421, HTH_LUXR, helix_turn_helix, Lux Regulon	NA|224aa|up_5|NZ_CP042425.1_3922606_3923278_+	pfam16227, DUF4886, Domain of unknown function (DUF4886)	NA|71aa|up_4|NZ_CP042425.1_3923626_3923839_-	cd00090, HTH_ARSR, Arsenical Resistance Operon Repressor and similar prokaryotic, metal regulated homodimeric repressors	NA|351aa|up_3|NZ_CP042425.1_3923864_3924917_-	pfam07596, SBP_bac_10, Protein of unknown function (DUF1559)	NA|1603aa|up_2|NZ_CP042425.1_3925085_3929894_+	cd05819, NHL, NHL repeat unit of beta-propeller proteins	NA|516aa|up_1|NZ_CP042425.1_3930406_3931954_-	pfam13517, VCBS, Repeat domain in Vibrio, Colwellia, Bradyrhizobium and Shewanella	NA|75aa|up_0|NZ_CP042425.1_3932273_3932498_+	COG3328, COG3328, Transposase and inactivated derivatives [DNA replication, recombination, and repair]	NA|203aa|down_0|NZ_CP042425.1_3935584_3936193_+	pfam07638, Sigma70_ECF, ECF sigma factor	NA|826aa|down_1|NZ_CP042425.1_3936189_3938667_+	cd14014, STKc_PknB_like, Catalytic domain of bacterial Serine/Threonine kinases, PknB and similar proteins	NA|243aa|down_2|NZ_CP042425.1_3938831_3939560_-	pfam03781, FGE-sulfatase, Sulfatase-modifying factor enzyme 1	RT|507aa|down_3|NZ_CP042425.1_3939808_3941329_-	cd03487, RT_Bac_retron_II, RT_Bac_retron_II: Reverse transcriptases (RTs) in bacterial retrotransposons or retrons	NA|661aa|down_4|NZ_CP042425.1_3941411_3943394_+	cd19100, AKR_unchar, uncharacterized aldo-keto reductase (AKR) superfamily protein	NA|72aa|down_5|NZ_CP042425.1_3943784_3944000_+	COG3844, COG3844, Kynureninase [Amino acid transport and metabolism]	NA|213aa|down_6|NZ_CP042425.1_3943983_3944622_+	TIGR03035, trp_arylform, arylformamidase	NA|207aa|down_7|NZ_CP042425.1_3944721_3945342_+	NA	NA|228aa|down_8|NZ_CP042425.1_3946037_3946721_+	pfam13340, DUF4096, Putative transposase of IS4/5 family (DUF4096)	NA|147aa|down_9|NZ_CP042425.1_3946841_3947282_+	pfam09424, YqeY, Yqey-like protein
GCF_008254045.1_ASM825404v1	NZ_CP042425	Gemmataceae bacterium PX52 chromosome, complete genome	9	4274574-4274758	1	PILER-CR	no		RT,DEDDh,csa3,DinG,cas4,cas3	Orphan	GACAAAGGGATGGCTCACTTCAAGGGCTGCAAGAACCT	38	0	0	NA	NA	NA	2	2	Orphan	RT,DEDDh,csa3,DinG,cas4,cas3	NA|111aa|up_7|NZ_CP042425.1_4265725_4266058_+,NA|81aa|up_6|NZ_CP042425.1_4266202_4266445_+,NA|96aa|up_5|NZ_CP042425.1_4266431_4266719_+,NA|266aa|up_4|NZ_CP042425.1_4266715_4267513_+,NA|597aa|down_0|NZ_CP042425.1_4275295_4277086_+,NA|80aa|down_3|NZ_CP042425.1_4281643_4281883_-	NA|711aa|up_9|NZ_CP042425.1_4261783_4263916_+	pfam13481, AAA_25, AAA domain	NA|383aa|up_8|NZ_CP042425.1_4263943_4265092_+	cd00796, INT_Rci_Hp1_C, Shufflon-specific DNA recombinase Rci and Bacteriophage Hp1_like integrase, C-terminal catalytic domain	NA|111aa|up_7|NZ_CP042425.1_4265725_4266058_+	NA	NA|81aa|up_6|NZ_CP042425.1_4266202_4266445_+	NA	NA|96aa|up_5|NZ_CP042425.1_4266431_4266719_+	NA	NA|266aa|up_4|NZ_CP042425.1_4266715_4267513_+	NA	NA|297aa|up_3|NZ_CP042425.1_4268949_4269840_+	cd08894, SRPBCC_CalC_Aha1-like_1, Putative hydrophobic ligand-binding SRPBCC domain of an uncharacterized subgroup of CalC- and Aha1-like proteins	NA|184aa|up_2|NZ_CP042425.1_4269896_4270448_+	COG1853, COG1853, Conserved protein/domain typically associated with flavoprotein oxygenases, DIM6/NTAB family [General function prediction only]	NA|250aa|up_1|NZ_CP042425.1_4270493_4271243_+	pfam01925, TauE, Sulfite exporter TauE/SafE	NA|194aa|up_0|NZ_CP042425.1_4271530_4272112_+	TIGR02999, Sig-70_X6, RNA polymerase sigma factor, TIGR02999 family	NA|597aa|down_0|NZ_CP042425.1_4275295_4277086_+	NA	NA|1042aa|down_1|NZ_CP042425.1_4277740_4280866_-	TIGR02966, Phosphate_regulon_sensor_protein_PhoR, phosphate regulon sensor kinase PhoR	NA|203aa|down_2|NZ_CP042425.1_4280884_4281493_-	pfam14417, MEDS, MEDS: MEthanogen/methylotroph, DcmR Sensory domain	NA|80aa|down_3|NZ_CP042425.1_4281643_4281883_-	NA	NA|200aa|down_4|NZ_CP042425.1_4281915_4282515_-	TIGR02937, RNA_polymerase_sigma_factor, RNA polymerase sigma factor, sigma-70 family	NA|170aa|down_5|NZ_CP042425.1_4282641_4283151_+	pfam04224, DUF417, Protein of unknown function, DUF417	NA|222aa|down_6|NZ_CP042425.1_4283244_4283910_-	cd03024, DsbA_FrnE, DsbA family, FrnE subfamily; FrnE is a DsbA-like protein containing a CXXC motif	NA|146aa|down_7|NZ_CP042425.1_4283926_4284364_-	pfam00583, Acetyltransf_1, Acetyltransferase (GNAT) family	NA|483aa|down_8|NZ_CP042425.1_4285138_4286587_-	PRK06184, PRK06184, hypothetical protein; Provisional	NA|515aa|down_9|NZ_CP042425.1_4286728_4288273_-	PRK06834, PRK06834, hypothetical protein; Provisional
GCF_008254045.1_ASM825404v1	NZ_CP042425	Gemmataceae bacterium PX52 chromosome, complete genome	10	4379925-4379998	9	CRISPRCasFinder	no	RT	RT,DEDDh,csa3,DinG,cas4,cas3	Unclear	TTTCTTCCCGTTCGGGAAGATTTG	24	0	0	NA	NA	NA	1	1	Orphan	RT,DEDDh,csa3,DinG,cas4,cas3	NA|223aa|up_9|NZ_CP042425.1_4373012_4373681_-,NA|72aa|up_8|NZ_CP042425.1_4373670_4373886_-,NA|111aa|up_7|NZ_CP042425.1_4374012_4374345_-,NA|203aa|up_5|NZ_CP042425.1_4375848_4376457_+,NA|73aa|up_3|NZ_CP042425.1_4378050_4378269_-,NA|160aa|up_2|NZ_CP042425.1_4378479_4378959_-,NA|97aa|up_1|NZ_CP042425.1_4379005_4379296_-,NA|142aa|up_0|NZ_CP042425.1_4379481_4379907_+,NA|177aa|down_0|NZ_CP042425.1_4380120_4380651_+,NA|223aa|down_1|NZ_CP042425.1_4380943_4381612_-,NA|113aa|down_2|NZ_CP042425.1_4382140_4382479_-,NA|81aa|down_3|NZ_CP042425.1_4382972_4383215_-,NA|87aa|down_8|NZ_CP042425.1_4386530_4386791_-,NA|116aa|down_9|NZ_CP042425.1_4387063_4387411_+	NA|223aa|up_9|NZ_CP042425.1_4373012_4373681_-	NA	NA|72aa|up_8|NZ_CP042425.1_4373670_4373886_-	NA	NA|111aa|up_7|NZ_CP042425.1_4374012_4374345_-	NA	NA|459aa|up_6|NZ_CP042425.1_4374343_4375720_+	pfam03050, DDE_Tnp_IS66, Transposase IS66 family	NA|203aa|up_5|NZ_CP042425.1_4375848_4376457_+	NA	NA|411aa|up_4|NZ_CP042425.1_4376548_4377781_+	pfam13546, DDE_5, DDE superfamily endonuclease	NA|73aa|up_3|NZ_CP042425.1_4378050_4378269_-	NA	NA|160aa|up_2|NZ_CP042425.1_4378479_4378959_-	NA	NA|97aa|up_1|NZ_CP042425.1_4379005_4379296_-	NA	NA|142aa|up_0|NZ_CP042425.1_4379481_4379907_+	NA	NA|177aa|down_0|NZ_CP042425.1_4380120_4380651_+	NA	NA|223aa|down_1|NZ_CP042425.1_4380943_4381612_-	NA	NA|113aa|down_2|NZ_CP042425.1_4382140_4382479_-	NA	NA|81aa|down_3|NZ_CP042425.1_4382972_4383215_-	NA	NA|459aa|down_4|NZ_CP042425.1_4383788_4385165_-	pfam03050, DDE_Tnp_IS66, Transposase IS66 family	NA|127aa|down_5|NZ_CP042425.1_4385217_4385598_-	pfam07120, DUF1376, Protein of unknown function (DUF1376)	NA|154aa|down_6|NZ_CP042425.1_4385732_4386194_-	smart00421, HTH_LUXR, helix_turn_helix, Lux Regulon	NA|117aa|down_7|NZ_CP042425.1_4386190_4386541_-	pfam06356, DUF1064, Protein of unknown function (DUF1064)	NA|87aa|down_8|NZ_CP042425.1_4386530_4386791_-	NA	NA|116aa|down_9|NZ_CP042425.1_4387063_4387411_+	NA
GCF_008254045.1_ASM825404v1	NZ_CP042425	Gemmataceae bacterium PX52 chromosome, complete genome	11	6741514-6741605	10	CRISPRCasFinder	no		RT,DEDDh,csa3,DinG,cas4,cas3	Orphan	CACCGCAGACGTTTGCGGTGGGG	23	0	0	NA	NA	NA	1	1	Orphan	RT,DEDDh,csa3,DinG,cas4,cas3	NA|130aa|up_6|NZ_CP042425.1_6734757_6735147_-,NA|188aa|up_5|NZ_CP042425.1_6735149_6735713_-,NA|52aa|up_4|NZ_CP042425.1_6735811_6735967_-,NA|137aa|up_0|NZ_CP042425.1_6740676_6741087_-,NA|68aa|down_0|NZ_CP042425.1_6741715_6741919_-,NA|289aa|down_1|NZ_CP042425.1_6741915_6742782_-,NA|257aa|down_2|NZ_CP042425.1_6742903_6743674_-,NA|109aa|down_4|NZ_CP042425.1_6744159_6744486_-,NA|221aa|down_6|NZ_CP042425.1_6745098_6745761_+,NA|108aa|down_8|NZ_CP042425.1_6747183_6747507_+,NA|51aa|down_9|NZ_CP042425.1_6747555_6747708_+	NA|475aa|up_9|NZ_CP042425.1_6730646_6732071_-	TIGR02224, Tyrosine_recombinase_XerC, tyrosine recombinase XerC	NA|84aa|up_8|NZ_CP042425.1_6732145_6732397_-	pfam12728, HTH_17, Helix-turn-helix domain	NA|782aa|up_7|NZ_CP042425.1_6732412_6734758_-	pfam13148, DUF3987, Protein of unknown function (DUF3987)	NA|130aa|up_6|NZ_CP042425.1_6734757_6735147_-	NA	NA|188aa|up_5|NZ_CP042425.1_6735149_6735713_-	NA	NA|52aa|up_4|NZ_CP042425.1_6735811_6735967_-	NA	NA|422aa|up_3|NZ_CP042425.1_6736737_6738003_+	pfam02371, Transposase_20, Transposase IS116/IS110/IS902 family	NA|376aa|up_2|NZ_CP042425.1_6738290_6739419_+	PHA02517, PHA02517, putative transposase OrfB; Reviewed	NA|184aa|up_1|NZ_CP042425.1_6740065_6740617_+	pfam04542, Sigma70_r2, Sigma-70 region 2	NA|137aa|up_0|NZ_CP042425.1_6740676_6741087_-	NA	NA|68aa|down_0|NZ_CP042425.1_6741715_6741919_-	NA	NA|289aa|down_1|NZ_CP042425.1_6741915_6742782_-	NA	NA|257aa|down_2|NZ_CP042425.1_6742903_6743674_-	NA	NA|134aa|down_3|NZ_CP042425.1_6743712_6744114_-	pfam02082, Rrf2, Transcriptional regulator	NA|109aa|down_4|NZ_CP042425.1_6744159_6744486_-	NA	NA|106aa|down_5|NZ_CP042425.1_6744637_6744955_+	cd00093, HTH_XRE, Helix-turn-helix XRE-family like proteins	NA|221aa|down_6|NZ_CP042425.1_6745098_6745761_+	NA	NA|446aa|down_7|NZ_CP042425.1_6745757_6747095_+	smart00974, T5orf172, This entry represents the putative helicase A859L	NA|108aa|down_8|NZ_CP042425.1_6747183_6747507_+	NA	NA|51aa|down_9|NZ_CP042425.1_6747555_6747708_+	NA
GCF_008254045.1_ASM825404v1	NZ_CP042425	Gemmataceae bacterium PX52 chromosome, complete genome	12	6924982-6925105	11	CRISPRCasFinder	no		RT,DEDDh,csa3,DinG,cas4,cas3	Orphan	CGTGTAACAGTGCCCGGCCGAGAACTTTTCTCC	33	0	0	NA	NA	NA	1	1	Orphan	RT,DEDDh,csa3,DinG,cas4,cas3	NA|339aa|up_9|NZ_CP042425.1_6916482_6917499_+,NA|71aa|up_7|NZ_CP042425.1_6917917_6918130_-,NA|139aa|up_5|NZ_CP042425.1_6920349_6920766_+,NA|46aa|up_4|NZ_CP042425.1_6920878_6921016_-,NA|131aa|up_3|NZ_CP042425.1_6921282_6921675_+,NA|93aa|up_2|NZ_CP042425.1_6921671_6921950_+,NA|126aa|up_1|NZ_CP042425.1_6922214_6922592_-,NA|120aa|up_0|NZ_CP042425.1_6922631_6922991_-,NA|138aa|down_1|NZ_CP042425.1_6925723_6926137_+,NA|117aa|down_4|NZ_CP042425.1_6927824_6928175_-,NA|102aa|down_7|NZ_CP042425.1_6929590_6929896_-	NA|339aa|up_9|NZ_CP042425.1_6916482_6917499_+	NA	NA|73aa|up_8|NZ_CP042425.1_6917670_6917889_-	cd03512, Alkane-hydroxylase, Alkane hydroxylase is a bacterial, integral-membrane di-iron enzyme that shares a requirement for iron and oxygen for activity similar to that of the non-heme integral-membrane acyl coenzyme A (CoA) desaturases and acyl lipid desaturases	NA|71aa|up_7|NZ_CP042425.1_6917917_6918130_-	NA	NA|422aa|up_6|NZ_CP042425.1_6918665_6919931_-	pfam02371, Transposase_20, Transposase IS116/IS110/IS902 family	NA|139aa|up_5|NZ_CP042425.1_6920349_6920766_+	NA	NA|46aa|up_4|NZ_CP042425.1_6920878_6921016_-	NA	NA|131aa|up_3|NZ_CP042425.1_6921282_6921675_+	NA	NA|93aa|up_2|NZ_CP042425.1_6921671_6921950_+	NA	NA|126aa|up_1|NZ_CP042425.1_6922214_6922592_-	NA	NA|120aa|up_0|NZ_CP042425.1_6922631_6922991_-	NA	NA|182aa|down_0|NZ_CP042425.1_6925162_6925708_+	pfam00754, F5_F8_type_C, F5/8 type C domain	NA|138aa|down_1|NZ_CP042425.1_6925723_6926137_+	NA	NA|130aa|down_2|NZ_CP042425.1_6926118_6926508_-	cd05018, CoxG, Carbon monoxide dehydrogenase subunit G (CoxG)	NA|317aa|down_3|NZ_CP042425.1_6926762_6927713_+	COG1398, OLE1, Fatty-acid desaturase [Lipid metabolism]	NA|117aa|down_4|NZ_CP042425.1_6927824_6928175_-	NA	NA|242aa|down_5|NZ_CP042425.1_6928202_6928928_-	pfam04945, YHS, YHS domain	NA|146aa|down_6|NZ_CP042425.1_6928986_6929424_-	PHA01748, PHA01748, hypothetical protein	NA|102aa|down_7|NZ_CP042425.1_6929590_6929896_-	NA	NA|925aa|down_8|NZ_CP042425.1_6930124_6932899_+	cd00995, PBP2_NikA_DppA_OppA_like, The substrate-binding domain of an ABC-type nickel/oligopeptide-like import system contains the type 2 periplasmic binding fold	NA|418aa|down_9|NZ_CP042425.1_6933099_6934353_+	cd14014, STKc_PknB_like, Catalytic domain of bacterial Serine/Threonine kinases, PknB and similar proteins
GCF_008254045.1_ASM825404v1	NZ_CP042425	Gemmataceae bacterium PX52 chromosome, complete genome	13	7077338-7077446	12	CRISPRCasFinder	no	csa3	RT,DEDDh,csa3,DinG,cas4,cas3	Type I-A	CCACGCAAGCCACTGTTCCTCCGTCATCGCCCACCCCC	38	0	0	NA	NA	NA	1	1	Orphan	RT,DEDDh,csa3,DinG,cas4,cas3	NA|116aa|up_8|NZ_CP042425.1_7067620_7067968_+,NA|841aa|up_6|NZ_CP042425.1_7068979_7071502_+,NA|176aa|up_5|NZ_CP042425.1_7071585_7072113_+,NA|129aa|up_4|NZ_CP042425.1_7072147_7072534_+,NA|203aa|up_1|NZ_CP042425.1_7075937_7076546_+,NA|82aa|up_0|NZ_CP042425.1_7076701_7076947_-,NA|72aa|down_0|NZ_CP042425.1_7078151_7078367_-,NA|94aa|down_1|NZ_CP042425.1_7078389_7078671_-,NA|114aa|down_2|NZ_CP042425.1_7078768_7079110_-,NA|60aa|down_5|NZ_CP042425.1_7080348_7080528_-,NA|279aa|down_9|NZ_CP042425.1_7083954_7084791_+	NA|103aa|up_9|NZ_CP042425.1_7067163_7067472_-	cd00093, HTH_XRE, Helix-turn-helix XRE-family like proteins	NA|116aa|up_8|NZ_CP042425.1_7067620_7067968_+	NA	NA|267aa|up_7|NZ_CP042425.1_7068044_7068845_+	TIGR02937, RNA_polymerase_sigma_factor, RNA polymerase sigma factor, sigma-70 family	NA|841aa|up_6|NZ_CP042425.1_7068979_7071502_+	NA	NA|176aa|up_5|NZ_CP042425.1_7071585_7072113_+	NA	NA|129aa|up_4|NZ_CP042425.1_7072147_7072534_+	NA	NA|311aa|up_3|NZ_CP042425.1_7073102_7074035_-	pfam07596, SBP_bac_10, Protein of unknown function (DUF1559)	NA|459aa|up_2|NZ_CP042425.1_7074432_7075809_+	pfam03050, DDE_Tnp_IS66, Transposase IS66 family	NA|203aa|up_1|NZ_CP042425.1_7075937_7076546_+	NA	NA|82aa|up_0|NZ_CP042425.1_7076701_7076947_-	NA	NA|72aa|down_0|NZ_CP042425.1_7078151_7078367_-	NA	NA|94aa|down_1|NZ_CP042425.1_7078389_7078671_-	NA	NA|114aa|down_2|NZ_CP042425.1_7078768_7079110_-	NA	NA|167aa|down_3|NZ_CP042425.1_7079221_7079722_-	pfam13628, DUF4142, Domain of unknown function (DUF4142)	NA|129aa|down_4|NZ_CP042425.1_7079868_7080255_-	cd17580, REC_2_DhkD-like, second phosphoacceptor receiver (REC) domain of Dictyostelium discoideum hybrid signal transduction histidine kinase D and similar domains	NA|60aa|down_5|NZ_CP042425.1_7080348_7080528_-	NA	NA|82aa|down_6|NZ_CP042425.1_7080704_7080950_-	pfam09855, zinc_ribbon_13, Nucleic-acid-binding protein containing Zn-ribbon domain (DUF2082)	NA|435aa|down_7|NZ_CP042425.1_7081547_7082852_+	cd00397, DNA_BRE_C, DNA breaking-rejoining enzymes, C-terminal catalytic domain	NA|116aa|down_8|NZ_CP042425.1_7083124_7083472_+	pfam06250, DUF1016, Protein of unknown function (DUF1016)	NA|279aa|down_9|NZ_CP042425.1_7083954_7084791_+	NA
GCF_008254045.1_ASM825404v1	NZ_CP042425	Gemmataceae bacterium PX52 chromosome, complete genome	14	7567577-7567675	13	CRISPRCasFinder	no		RT,DEDDh,csa3,DinG,cas4,cas3	Orphan	GGCAACGGATCGTGCCGAGTGCGAC	25	0	0	NA	NA	NA	1	1	Orphan	RT,DEDDh,csa3,DinG,cas4,cas3	NA|79aa|up_0|NZ_CP042425.1_7567119_7567356_-,NA|81aa|down_3|NZ_CP042425.1_7581942_7582185_+,NA|123aa|down_6|NZ_CP042425.1_7586128_7586497_-,NA|468aa|down_7|NZ_CP042425.1_7586884_7588288_+,NA|70aa|down_8|NZ_CP042425.1_7589003_7589213_+	NA|325aa|up_9|NZ_CP042425.1_7552342_7553317_-	cd05286, QOR2, Quinone oxidoreductase (QOR)	NA|469aa|up_8|NZ_CP042425.1_7553989_7555396_+	TIGR01730, COG0845:_Membrane-fusion_protein, RND family efflux transporter, MFP subunit	NA|1118aa|up_7|NZ_CP042425.1_7555522_7558876_+	COG0841, AcrB, Cation/multidrug efflux pump [Defense mechanisms]	NA|498aa|up_6|NZ_CP042425.1_7558962_7560456_-	COG1538, TolC, Outer membrane protein [Cell envelope biogenesis, outer membrane / Intracellular trafficking and secretion]	NA|518aa|up_5|NZ_CP042425.1_7560500_7562054_-	COG1538, TolC, Outer membrane protein [Cell envelope biogenesis, outer membrane / Intracellular trafficking and secretion]	NA|479aa|up_4|NZ_CP042425.1_7562598_7564035_-	PRK03598, PRK03598, putative efflux pump membrane fusion protein; Provisional	NA|57aa|up_3|NZ_CP042425.1_7565814_7565985_-	TIGR02937, RNA_polymerase_sigma_factor, RNA polymerase sigma factor, sigma-70 family	NA|115aa|up_2|NZ_CP042425.1_7566060_7566405_-	TIGR02999, Sig-70_X6, RNA polymerase sigma factor, TIGR02999 family	NA|106aa|up_1|NZ_CP042425.1_7566554_7566872_-	pfam00561, Abhydrolase_1, alpha/beta hydrolase fold	NA|79aa|up_0|NZ_CP042425.1_7567119_7567356_-	NA	NA|3394aa|down_0|NZ_CP042425.1_7568029_7578211_-	cd14955, NHL_like_4, Uncharacterized NHL-repeat domain in bacterial and archaeal proteins	NA|761aa|down_1|NZ_CP042425.1_7578453_7580736_-	cd03143, A4_beta-galactosidase_middle_domain, A4 beta-galactosidase middle domain: a type 1 glutamine amidotransferase (GATase1)-like domain	NA|317aa|down_2|NZ_CP042425.1_7580851_7581802_-	COG0657, Aes, Esterase/lipase [Lipid metabolism]	NA|81aa|down_3|NZ_CP042425.1_7581942_7582185_+	NA	NA|746aa|down_4|NZ_CP042425.1_7582416_7584654_-	COG0659, SUL1, Sulfate permease and related transporters (MFS superfamily) [Inorganic ion transport and metabolism]	NA|385aa|down_5|NZ_CP042425.1_7584708_7585863_-	COG1253, TlyC, Hemolysins and related proteins containing CBS domains [General function prediction only]	NA|123aa|down_6|NZ_CP042425.1_7586128_7586497_-	NA	NA|468aa|down_7|NZ_CP042425.1_7586884_7588288_+	NA	NA|70aa|down_8|NZ_CP042425.1_7589003_7589213_+	NA	NA|95aa|down_9|NZ_CP042425.1_7589292_7589577_+	pfam05717, TnpB_IS66, IS66 Orf2 like protein
GCF_008254045.1_ASM825404v1	NZ_CP042425	Gemmataceae bacterium PX52 chromosome, complete genome	15	8569816-8569907	14	CRISPRCasFinder	no		RT,DEDDh,csa3,DinG,cas4,cas3	Orphan	CACCGCAGACGTTTGCGGTGGGG	23	0	0	NA	NA	NA	1	1	Orphan	RT,DEDDh,csa3,DinG,cas4,cas3	NA|170aa|up_4|NZ_CP042425.1_8564826_8565336_-,NA|160aa|up_3|NZ_CP042425.1_8565332_8565812_-,NA|189aa|up_2|NZ_CP042425.1_8565885_8566452_-,NA|137aa|up_0|NZ_CP042425.1_8568977_8569388_-,NA|68aa|down_0|NZ_CP042425.1_8570017_8570221_-,NA|289aa|down_1|NZ_CP042425.1_8570217_8571084_-,NA|257aa|down_2|NZ_CP042425.1_8571205_8571976_-,NA|111aa|down_4|NZ_CP042425.1_8572454_8572787_-,NA|167aa|down_6|NZ_CP042425.1_8573328_8573829_+	NA|143aa|up_9|NZ_CP042425.1_8557143_8557572_+	pfam13581, HATPase_c_2, Histidine kinase-like ATPase domain	NA|832aa|up_8|NZ_CP042425.1_8557771_8560267_+	PRK05399, PRK05399, DNA mismatch repair protein MutS; Provisional	NA|290aa|up_7|NZ_CP042425.1_8560366_8561236_-	COG0179, MhpD, 2-keto-4-pentenoate hydratase/2-oxohepta-3-ene-1,7-dioic acid hydratase (catechol pathway) [Secondary metabolites biosynthesis, transport, and catabolism]	NA|106aa|up_6|NZ_CP042425.1_8563070_8563388_-	COG3311, AlpA, Predicted transcriptional regulator [Transcription]	NA|381aa|up_5|NZ_CP042425.1_8563687_8564830_-	pfam13481, AAA_25, AAA domain	NA|170aa|up_4|NZ_CP042425.1_8564826_8565336_-	NA	NA|160aa|up_3|NZ_CP042425.1_8565332_8565812_-	NA	NA|189aa|up_2|NZ_CP042425.1_8565885_8566452_-	NA	NA|184aa|up_1|NZ_CP042425.1_8568366_8568918_+	pfam04542, Sigma70_r2, Sigma-70 region 2	NA|137aa|up_0|NZ_CP042425.1_8568977_8569388_-	NA	NA|68aa|down_0|NZ_CP042425.1_8570017_8570221_-	NA	NA|289aa|down_1|NZ_CP042425.1_8570217_8571084_-	NA	NA|257aa|down_2|NZ_CP042425.1_8571205_8571976_-	NA	NA|133aa|down_3|NZ_CP042425.1_8572010_8572409_-	pfam02082, Rrf2, Transcriptional regulator	NA|111aa|down_4|NZ_CP042425.1_8572454_8572787_-	NA	NA|97aa|down_5|NZ_CP042425.1_8572963_8573254_+	cd00093, HTH_XRE, Helix-turn-helix XRE-family like proteins	NA|167aa|down_6|NZ_CP042425.1_8573328_8573829_+	NA	NA|125aa|down_7|NZ_CP042425.1_8573905_8574280_+	TIGR02996, rpt_mate_G_obs, repeat-companion domain TIGR02996	NA|147aa|down_8|NZ_CP042425.1_8574320_8574761_-	pfam04940, BLUF, Sensors of blue-light using FAD	NA|176aa|down_9|NZ_CP042425.1_8575111_8575639_+	pfam11336, DUF3138, Protein of unknown function (DUF3138)
GCF_008254045.1_ASM825404v1	NZ_CP042425	Gemmataceae bacterium PX52 chromosome, complete genome	16	8628253-8628341	15	CRISPRCasFinder	no		RT,DEDDh,csa3,DinG,cas4,cas3	Orphan	GAAAACCCGATCATGTCCGGGTT	23	1	1	8628276-8628318	NZ_CP042425.1_8703863-8703905	NA	1	1	Orphan	RT,DEDDh,csa3,DinG,cas4,cas3	NA|121aa|up_8|NZ_CP042425.1_8620099_8620462_-,NA|69aa|up_6|NZ_CP042425.1_8621865_8622072_+,NA|141aa|up_3|NZ_CP042425.1_8624200_8624623_+,NA|109aa|up_2|NZ_CP042425.1_8624799_8625126_+,NA|90aa|down_1|NZ_CP042425.1_8629591_8629861_-,NA|133aa|down_5|NZ_CP042425.1_8634804_8635203_+,NA|118aa|down_6|NZ_CP042425.1_8635202_8635556_+,NA|78aa|down_7|NZ_CP042425.1_8635552_8635786_+,NA|130aa|down_8|NZ_CP042425.1_8635796_8636186_+,NA|78aa|down_9|NZ_CP042425.1_8636182_8636416_+	NA|338aa|up_9|NZ_CP042425.1_8618878_8619892_-	cd00200, WD40, WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment	NA|121aa|up_8|NZ_CP042425.1_8620099_8620462_-	NA	NA|297aa|up_7|NZ_CP042425.1_8620809_8621700_-	pfam13612, DDE_Tnp_1_3, Transposase DDE domain	NA|69aa|up_6|NZ_CP042425.1_8621865_8622072_+	NA	NA|170aa|up_5|NZ_CP042425.1_8622114_8622624_+	pfam11149, DUF2924, Protein of unknown function (DUF2924)	NA|528aa|up_4|NZ_CP042425.1_8622620_8624204_+	smart00857, Resolvase, Resolvase, N terminal domain	NA|141aa|up_3|NZ_CP042425.1_8624200_8624623_+	NA	NA|109aa|up_2|NZ_CP042425.1_8624799_8625126_+	NA	NA|496aa|up_1|NZ_CP042425.1_8625470_8626958_+	cd16410, ParB_N_like, ParB N-terminal, parA-binding, -like domain of bacterial and plasmid parABS partitioning systems	NA|414aa|up_0|NZ_CP042425.1_8626954_8628196_+	pfam09250, Prim-Pol, Bifunctional DNA primase/polymerase, N-terminal	NA|306aa|down_0|NZ_CP042425.1_8628677_8629595_-	PRK13800, PRK13800, fumarate reductase/succinate dehydrogenase flavoprotein subunit	NA|90aa|down_1|NZ_CP042425.1_8629591_8629861_-	NA	NA|308aa|down_2|NZ_CP042425.1_8629860_8630784_-	pfam07596, SBP_bac_10, Protein of unknown function (DUF1559)	NA|481aa|down_3|NZ_CP042425.1_8630865_8632308_-	TIGR01386, Probable_sensor_protein_PcoS, heavy metal sensor kinase	NA|403aa|down_4|NZ_CP042425.1_8633525_8634734_+	cd16402, ParB_N_like_MT, ParB N-terminal-like domain, some attached to C-terminal S-adenosylmethionine-dependent methyltransferase domain	NA|133aa|down_5|NZ_CP042425.1_8634804_8635203_+	NA	NA|118aa|down_6|NZ_CP042425.1_8635202_8635556_+	NA	NA|78aa|down_7|NZ_CP042425.1_8635552_8635786_+	NA	NA|130aa|down_8|NZ_CP042425.1_8635796_8636186_+	NA	NA|78aa|down_9|NZ_CP042425.1_8636182_8636416_+	NA
GCF_008254045.1_ASM825404v1	NZ_CP042425	Gemmataceae bacterium PX52 chromosome, complete genome	17	8640100-8640211	16	CRISPRCasFinder	no		RT,DEDDh,csa3,DinG,cas4,cas3	Orphan	CTGCTCCAACTTGTCGTGAGGATGACCCATGCGCGT	36	0	0	NA	NA	NA	1	1	Orphan	RT,DEDDh,csa3,DinG,cas4,cas3	NA|118aa|up_9|NZ_CP042425.1_8635202_8635556_+,NA|78aa|up_8|NZ_CP042425.1_8635552_8635786_+,NA|130aa|up_7|NZ_CP042425.1_8635796_8636186_+,NA|78aa|up_6|NZ_CP042425.1_8636182_8636416_+,NA|57aa|up_4|NZ_CP042425.1_8637334_8637505_+,NA|102aa|up_3|NZ_CP042425.1_8637699_8638005_+,NA|96aa|up_2|NZ_CP042425.1_8638314_8638602_+,NA|119aa|up_1|NZ_CP042425.1_8638640_8638997_+,NA|143aa|up_0|NZ_CP042425.1_8639513_8639942_+,NA|63aa|down_6|NZ_CP042425.1_8650865_8651054_+,NA|156aa|down_7|NZ_CP042425.1_8651135_8651603_-	NA|118aa|up_9|NZ_CP042425.1_8635202_8635556_+	NA	NA|78aa|up_8|NZ_CP042425.1_8635552_8635786_+	NA	NA|130aa|up_7|NZ_CP042425.1_8635796_8636186_+	NA	NA|78aa|up_6|NZ_CP042425.1_8636182_8636416_+	NA	NA|151aa|up_5|NZ_CP042425.1_8636772_8637225_+	cd17580, REC_2_DhkD-like, second phosphoacceptor receiver (REC) domain of Dictyostelium discoideum hybrid signal transduction histidine kinase D and similar domains	NA|57aa|up_4|NZ_CP042425.1_8637334_8637505_+	NA	NA|102aa|up_3|NZ_CP042425.1_8637699_8638005_+	NA	NA|96aa|up_2|NZ_CP042425.1_8638314_8638602_+	NA	NA|119aa|up_1|NZ_CP042425.1_8638640_8638997_+	NA	NA|143aa|up_0|NZ_CP042425.1_8639513_8639942_+	NA	NA|78aa|down_0|NZ_CP042425.1_8640421_8640655_+	COG2335, COG2335, Secreted and surface protein containing fasciclin-like repeats [Cell envelope biogenesis, outer membrane]	NA|338aa|down_1|NZ_CP042425.1_8640741_8641755_-	cd17580, REC_2_DhkD-like, second phosphoacceptor receiver (REC) domain of Dictyostelium discoideum hybrid signal transduction histidine kinase D and similar domains	NA|222aa|down_2|NZ_CP042425.1_8641959_8642625_-	COG2197, CitB, Response regulator containing a CheY-like receiver domain and an HTH DNA-binding domain [Signal transduction mechanisms / Transcription]	NA|651aa|down_3|NZ_CP042425.1_8642602_8644555_-	COG4585, COG4585, Signal transduction histidine kinase [Signal transduction mechanisms]	NA|832aa|down_4|NZ_CP042425.1_8646194_8648690_+	cd02077, P-type_ATPase_Mg, magnesium transporting ATPase (MgtA), similar to Escherichia coli MgtA and Salmonella typhimurium MgtA	NA|412aa|down_5|NZ_CP042425.1_8649378_8650614_-	PRK00180, PRK00180, acetate kinase A/propionate kinase 2; Reviewed	NA|63aa|down_6|NZ_CP042425.1_8650865_8651054_+	NA	NA|156aa|down_7|NZ_CP042425.1_8651135_8651603_-	NA	NA|169aa|down_8|NZ_CP042425.1_8651761_8652268_+	cd17580, REC_2_DhkD-like, second phosphoacceptor receiver (REC) domain of Dictyostelium discoideum hybrid signal transduction histidine kinase D and similar domains	NA|86aa|down_9|NZ_CP042425.1_8652264_8652522_+	cd00590, RRM_SF, RNA recognition motif (RRM) superfamily
