assembly_id	genome_id	genome_def	crispr_array_locus_merge	crispr_array_location_merge	crispr_locus_id	crispr_pred_method	array_in_prot	prot_within_array_20000	prot_in_genome	crispr_type_by_cas_prot	consensus_repeat	repeat_length	self-targeting_spacer_number	self-targeting_target_number	spacer_location	protospacer_location	repeat_type	spacer_locus_num	spacer_num	correct_crispr_type	genome_cas_prots	unknown_protein_around_crispr	L10	L10_domain	L9	L9_domain	L8	L8_domain	L7	L7_domain	L6	L6_domain	L5	L5_domain	L4	L4_domain	L3	L3_domain	L2	L2_domain	L1	L1_domain	R1	R1_domain	R2	R2_domain	R3	R3_domain	R4	R4_domain	R5	R5_domain	R6	R6_domain	R7	R7_domain	R8	R8_domain	R9	R9_domain	R10	R10_domain
GCF_003149495.1_ASM314949v1	NZ_CP025958	Gemmata obscuriglobus strain DSM 5831 chromosome, complete genome	1	9-13233	1,1,1,2,3	PILER-CR,CRISPRCasFinder,CRT,PILER-CR,PILER-CR	no	cas8u1,cas5u,cas7,cas8u2,cas3,cas2,cas1	cas8u1,cas5u,cas7,cas8u2,cas3,cas2,cas1,csa3,DEDDh,RT,cas6,cas8b3,DinG,cas4,PD-DExK	Unclear	GCTTCAATTCGGCCGCGGCTATGAGCCGCGGAGAAC,GCTTCAATTCGGCCGCGGCTATGAGCCGCGGAGAAC,GCTTCAATTCGGCCGCGGCTATGAGCCGCGGAGAAC,GCTTCAATTCGGCCGCGGCTATGAGCCGCGGAGAAC,GCTTCAATTCGGCCGCGGCTATGAGCCGCGGAGAAC	36,36,36,36,36	0	0	NA	NA	NA:NA:NA:NA:NA	177,181,181,177,177	181	Unclear	cas8u1,cas5u,cas7,cas8u2,cas3,cas2,cas1,csa3,DEDDh,RT,cas6,cas8b3,DinG,cas4,PD-DExK	NA,NA|87aa|down_0|NZ_CP025958.1_13700_13961_+,cas8u1|360aa|down_1|NZ_CP025958.1_14056_15136_-	NA|NA	NA	NA|NA	NA	NA|NA	NA	NA|NA	NA	NA|NA	NA	NA|NA	NA	NA|NA	NA	NA|NA	NA	NA|NA	NA	NA|NA	NA	NA|87aa|down_0|NZ_CP025958.1_13700_13961_+	NA	cas8u1|360aa|down_1|NZ_CP025958.1_14056_15136_-	NA	cas5u|500aa|down_2|NZ_CP025958.1_15176_16676_-	cd09667, Csb2_I-U, CRISPR/Cas system-associated protein Csb2	cas7|376aa|down_3|NZ_CP025958.1_16675_17803_-	cd09678, Csb1_I-U, CRISPR/Cas system-associated protein Csb1	cas8u2|280aa|down_4|NZ_CP025958.1_17799_18639_-	cd09765, Csx14_I-U, CRISPR/Cas system-associated protein Csx14	cas3|860aa|down_5|NZ_CP025958.1_18635_21215_-	cd09696, Cas3_I, CRISPR/Cas system-associated protein Cas3; Distinct Cas3 family with HD domain fused to C-termus of Helicase domain	cas2|95aa|down_6|NZ_CP025958.1_21227_21512_-	cd09725, Cas2_I_II_III, CRISPR/Cas system-associated protein Cas2	cas1|572aa|down_7|NZ_CP025958.1_21511_23227_-	pfam01867, Cas_Cas1, CRISPR associated protein Cas1	NA|383aa|down_8|NZ_CP025958.1_23774_24923_-	cd06112, citrate_synt_like_1_1, Citrate synthase (CS) catalyzes the condensation of acetyl coenzyme A (AcCoA) and oxalacetate (OAA) to form citrate and coenzyme A (CoA), the first step in the oxidative citric acid cycle (TCA or Krebs cycle)	NA|288aa|down_9|NZ_CP025958.1_25486_26350_+	COG0564, RluA, Pseudouridylate synthases, 23S RNA-specific [Translation, ribosomal structure and biogenesis]
GCF_003149495.1_ASM314949v1	NZ_CP025958	Gemmata obscuriglobus strain DSM 5831 chromosome, complete genome	2	164478-164577	2	CRISPRCasFinder	no		cas8u1,cas5u,cas7,cas8u2,cas3,cas2,cas1,csa3,DEDDh,RT,cas6,cas8b3,DinG,cas4,PD-DExK	Orphan	CCGGTCGGCGCTGTAACCCCCGTTACACC	29	0	0	NA	NA	NA	1	1	Orphan	cas8u1,cas5u,cas7,cas8u2,cas3,cas2,cas1,csa3,DEDDh,RT,cas6,cas8b3,DinG,cas4,PD-DExK	NA|84aa|up_5|NZ_CP025958.1_155676_155928_+,NA|43aa|up_3|NZ_CP025958.1_156574_156703_-,NA|284aa|down_4|NZ_CP025958.1_167657_168509_+,NA|201aa|down_6|NZ_CP025958.1_169661_170264_+	NA|812aa|up_9|NZ_CP025958.1_148688_151124_+	pfam07631, PSD4, Protein of unknown function (DUF1592)	NA|449aa|up_8|NZ_CP025958.1_151257_152604_+	pfam07586, HXXSHH, Protein of unknown function (DUF1552)	NA|378aa|up_7|NZ_CP025958.1_152802_153936_-	cd00688, ISOPREN_C2_like, This group contains class II terpene cyclases, protein prenyltransferases beta subunit, two broadly specific proteinase inhibitors alpha2-macroglobulin (alpha (2)-M) and pregnancy zone protein (PZP) and, the C3 C4 and C5 components of vertebrate complement	NA|316aa|up_6|NZ_CP025958.1_154674_155622_+	TIGR01172, Serine_acetyltransferase, serine O-acetyltransferase	NA|84aa|up_5|NZ_CP025958.1_155676_155928_+	NA	NA|162aa|up_4|NZ_CP025958.1_155947_156433_-	COG0735, Fur, Fe2+/Zn2+ uptake regulation proteins [Inorganic ion transport and metabolism]	NA|43aa|up_3|NZ_CP025958.1_156574_156703_-	NA	NA|228aa|up_2|NZ_CP025958.1_157419_158103_-	cd06559, Endonuclease_V, Endonuclease_V, a DNA repair enzyme that initiates repair of nitrosative deaminated purine bases	NA|1316aa|up_1|NZ_CP025958.1_158399_162347_+	cd05930, A_NRPS, The adenylation domain of nonribosomal peptide synthetases (NRPS)	NA|552aa|up_0|NZ_CP025958.1_162432_164088_-	COG0642, BaeS, Signal transduction histidine kinase [Signal transduction mechanisms]	NA|215aa|down_0|NZ_CP025958.1_164611_165256_-	PRK01103, PRK01103, bifunctional DNA-formamidopyrimidine glycosylase/DNA-(apurinic or apyrimidinic site) lyase	NA|119aa|down_1|NZ_CP025958.1_165306_165663_-	pfam01149, Fapy_DNA_glyco, Formamidopyrimidine-DNA glycosylase N-terminal domain	NA|406aa|down_2|NZ_CP025958.1_165793_167011_+	PRK04346, PRK04346, tryptophan synthase subunit beta; Validated	NA|123aa|down_3|NZ_CP025958.1_167285_167654_+	cd14265, UDPK_IM_like, Integral membrane undecaprenol kinase and similar enzymes	NA|284aa|down_4|NZ_CP025958.1_167657_168509_+	NA	NA|185aa|down_5|NZ_CP025958.1_168977_169532_+	sd00045, ANK, ankyrin repeats	NA|201aa|down_6|NZ_CP025958.1_169661_170264_+	NA	NA|412aa|down_7|NZ_CP025958.1_170334_171570_+	cd02966, TlpA_like_family, TlpA-like family; composed of  TlpA, ResA, DsbE and similar proteins	NA|167aa|down_8|NZ_CP025958.1_171614_172115_+	TIGR03067, Planc_TIGR03067, Planctomycetes uncharacterized domain TIGR03067	NA|334aa|down_9|NZ_CP025958.1_172238_173240_+	sd00006, TPR, Tetratricopeptide repeat
GCF_003149495.1_ASM314949v1	NZ_CP025958	Gemmata obscuriglobus strain DSM 5831 chromosome, complete genome	3	353476-353559	3	CRISPRCasFinder	no		cas8u1,cas5u,cas7,cas8u2,cas3,cas2,cas1,csa3,DEDDh,RT,cas6,cas8b3,DinG,cas4,PD-DExK	Orphan	GAAGAGGATTGGCCGCAAAGAAGCACAAA	29	0	0	NA	NA	NA	1	1	Orphan	cas8u1,cas5u,cas7,cas8u2,cas3,cas2,cas1,csa3,DEDDh,RT,cas6,cas8b3,DinG,cas4,PD-DExK	NA|130aa|up_7|NZ_CP025958.1_339222_339612_-,NA|286aa|down_1|NZ_CP025958.1_355430_356288_-,NA|195aa|down_4|NZ_CP025958.1_358640_359225_-,NA|59aa|down_8|NZ_CP025958.1_363890_364067_-	NA|904aa|up_9|NZ_CP025958.1_335125_337837_+	PRK00252, alaS, alanyl-tRNA synthetase; Reviewed	NA|362aa|up_8|NZ_CP025958.1_337939_339025_-	PRK14950, PRK14950, DNA polymerase III subunits gamma and tau; Provisional	NA|130aa|up_7|NZ_CP025958.1_339222_339612_-	NA	NA|489aa|up_6|NZ_CP025958.1_339843_341310_-	PRK12810, gltD, glutamate synthase subunit beta; Reviewed	NA|1528aa|up_5|NZ_CP025958.1_341495_346079_-	PRK11750, gltB, glutamate synthase subunit alpha; Provisional	NA|921aa|up_4|NZ_CP025958.1_346392_349155_-	PRK00009, PRK00009, phosphoenolpyruvate carboxylase; Reviewed	NA|440aa|up_3|NZ_CP025958.1_349253_350573_-	PRK06830, PRK06830, ATP-dependent 6-phosphofructokinase	NA|232aa|up_2|NZ_CP025958.1_350941_351637_-	cd01834, SGNH_hydrolase_like_2, SGNH_hydrolase subfamily	NA|376aa|up_1|NZ_CP025958.1_351769_352897_+	cd14014, STKc_PknB_like, Catalytic domain of bacterial Serine/Threonine kinases, PknB and similar proteins	NA|168aa|up_0|NZ_CP025958.1_352928_353432_-	cd06121, cupin_YML079wp, Saccharomyces cerevisiae YML079wp and related proteins, cupin domain	NA|502aa|down_0|NZ_CP025958.1_353720_355226_-	PRK07003, PRK07003, DNA polymerase III subunit gamma/tau	NA|286aa|down_1|NZ_CP025958.1_355430_356288_-	NA	NA|309aa|down_2|NZ_CP025958.1_356305_357232_-	COG1957, URH1, Inosine-uridine nucleoside N-ribohydrolase [Nucleotide transport and metabolism]	NA|292aa|down_3|NZ_CP025958.1_357515_358391_+	pfam01904, DUF72, Protein of unknown function DUF72	NA|195aa|down_4|NZ_CP025958.1_358640_359225_-	NA	NA|342aa|down_5|NZ_CP025958.1_359461_360487_+	PRK00050, PRK00050, 16S rRNA (cytosine(1402)-N(4))-methyltransferase RsmH	NA|490aa|down_6|NZ_CP025958.1_360619_362089_+	COG2319, COG2319, FOG: WD40 repeat [General function prediction only]	NA|549aa|down_7|NZ_CP025958.1_362177_363824_-	COG0606, COG0606, Predicted ATPase with chaperone activity [Posttranslational modification, protein turnover, chaperones]	NA|59aa|down_8|NZ_CP025958.1_363890_364067_-	NA	NA|89aa|down_9|NZ_CP025958.1_364290_364557_-	cd04240, AAK_UC, AAK_UC: Uncharacterized (UC) amino acid kinase-like proteins found mainly in archaea and a few bacteria
GCF_003149495.1_ASM314949v1	NZ_CP025958	Gemmata obscuriglobus strain DSM 5831 chromosome, complete genome	4	677792-677898	4	CRISPRCasFinder	no		cas8u1,cas5u,cas7,cas8u2,cas3,cas2,cas1,csa3,DEDDh,RT,cas6,cas8b3,DinG,cas4,PD-DExK	Orphan	GGTGCCATCAGCCTGATAGGCGAGTTGTTTCCGCC	35	0	0	NA	NA	NA	1	1	Orphan	cas8u1,cas5u,cas7,cas8u2,cas3,cas2,cas1,csa3,DEDDh,RT,cas6,cas8b3,DinG,cas4,PD-DExK	NA|107aa|up_8|NZ_CP025958.1_670847_671168_-,NA|90aa|up_7|NZ_CP025958.1_671284_671554_-,NA|186aa|up_6|NZ_CP025958.1_672204_672762_-,NA|204aa|up_5|NZ_CP025958.1_673555_674167_-,NA|93aa|up_4|NZ_CP025958.1_674361_674640_+,NA|131aa|up_3|NZ_CP025958.1_674750_675143_-,NA|54aa|up_2|NZ_CP025958.1_675804_675966_-,NA|109aa|up_1|NZ_CP025958.1_676134_676461_+,NA|63aa|up_0|NZ_CP025958.1_677289_677478_+,NA|361aa|down_0|NZ_CP025958.1_678502_679585_+,NA|167aa|down_4|NZ_CP025958.1_683910_684411_-,NA|256aa|down_6|NZ_CP025958.1_685800_686568_+	NA|168aa|up_9|NZ_CP025958.1_670256_670760_-	COG0317, SpoT, Guanosine polyphosphate pyrophosphohydrolases/synthetases [Signal transduction mechanisms / Transcription]	NA|107aa|up_8|NZ_CP025958.1_670847_671168_-	NA	NA|90aa|up_7|NZ_CP025958.1_671284_671554_-	NA	NA|186aa|up_6|NZ_CP025958.1_672204_672762_-	NA	NA|204aa|up_5|NZ_CP025958.1_673555_674167_-	NA	NA|93aa|up_4|NZ_CP025958.1_674361_674640_+	NA	NA|131aa|up_3|NZ_CP025958.1_674750_675143_-	NA	NA|54aa|up_2|NZ_CP025958.1_675804_675966_-	NA	NA|109aa|up_1|NZ_CP025958.1_676134_676461_+	NA	NA|63aa|up_0|NZ_CP025958.1_677289_677478_+	NA	NA|361aa|down_0|NZ_CP025958.1_678502_679585_+	NA	NA|541aa|down_1|NZ_CP025958.1_679969_681592_+	COG0034, PurF, Glutamine phosphoribosylpyrophosphate amidotransferase [Nucleotide transport and metabolism]	NA|402aa|down_2|NZ_CP025958.1_681644_682850_-	cd03801, GT4_PimA-like, phosphatidyl-myo-inositol mannosyltransferase	NA|362aa|down_3|NZ_CP025958.1_682846_683932_-	cd03801, GT4_PimA-like, phosphatidyl-myo-inositol mannosyltransferase	NA|167aa|down_4|NZ_CP025958.1_683910_684411_-	NA	NA|434aa|down_5|NZ_CP025958.1_684445_685747_-	cd03809, GT4_MtfB-like, glycosyltransferases MtfB, WbpX, and similar proteins	NA|256aa|down_6|NZ_CP025958.1_685800_686568_+	NA	NA|526aa|down_7|NZ_CP025958.1_686866_688444_+	PRK13557, PRK13557, histidine kinase; Provisional	NA|420aa|down_8|NZ_CP025958.1_688480_689740_+	pfam01555, N6_N4_Mtase, DNA methylase	NA|137aa|down_9|NZ_CP025958.1_689812_690223_+	pfam08906, DUF1851, Domain of unknown function (DUF1851)
GCF_003149495.1_ASM314949v1	NZ_CP025958	Gemmata obscuriglobus strain DSM 5831 chromosome, complete genome	5	699017-699372	2	CRT	no		cas8u1,cas5u,cas7,cas8u2,cas3,cas2,cas1,csa3,DEDDh,RT,cas6,cas8b3,DinG,cas4,PD-DExK	Orphan	GTCTCGNNNACCGGCTTGCA	20	0	0	NA	NA	NA	7	7	Orphan	cas8u1,cas5u,cas7,cas8u2,cas3,cas2,cas1,csa3,DEDDh,RT,cas6,cas8b3,DinG,cas4,PD-DExK	NA|256aa|up_8|NZ_CP025958.1_685800_686568_+,NA|260aa|up_0|NZ_CP025958.1_696814_697594_-,NA|103aa|down_0|NZ_CP025958.1_700584_700893_+,NA|94aa|down_8|NZ_CP025958.1_712397_712679_-	NA|434aa|up_9|NZ_CP025958.1_684445_685747_-	cd03809, GT4_MtfB-like, glycosyltransferases MtfB, WbpX, and similar proteins	NA|256aa|up_8|NZ_CP025958.1_685800_686568_+	NA	NA|526aa|up_7|NZ_CP025958.1_686866_688444_+	PRK13557, PRK13557, histidine kinase; Provisional	NA|420aa|up_6|NZ_CP025958.1_688480_689740_+	pfam01555, N6_N4_Mtase, DNA methylase	NA|137aa|up_5|NZ_CP025958.1_689812_690223_+	pfam08906, DUF1851, Domain of unknown function (DUF1851)	NA|264aa|up_4|NZ_CP025958.1_690259_691051_-	cd07721, yflN-like_MBL-fold, uncharacterized subgroup which includes Bacillus subtilis yflN; MBL-fold metallo hydrolase domain	NA|334aa|up_3|NZ_CP025958.1_691335_692337_+	pfam07596, SBP_bac_10, Protein of unknown function (DUF1559)	NA|237aa|up_2|NZ_CP025958.1_692666_693377_+	COG0745, OmpR, Response regulators consisting of a CheY-like receiver domain and a winged-helix DNA-binding domain [Signal transduction mechanisms / Transcription]	NA|1031aa|up_1|NZ_CP025958.1_693470_696563_-	PRK07764, PRK07764, DNA polymerase III subunits gamma and tau; Validated	NA|260aa|up_0|NZ_CP025958.1_696814_697594_-	NA	NA|103aa|down_0|NZ_CP025958.1_700584_700893_+	NA	NA|481aa|down_1|NZ_CP025958.1_701055_702498_+	PRK05722, PRK05722, glucose-6-phosphate 1-dehydrogenase; Validated	NA|245aa|down_2|NZ_CP025958.1_702631_703366_+	cd06915, NTP_transferase_WcbM_like, WcbM_like is a subfamily of nucleotidyl transferases	NA|184aa|down_3|NZ_CP025958.1_703355_703907_+	cd07503, HAD_HisB-N, histidinol phosphate phosphatase and related phosphatases	NA|1085aa|down_4|NZ_CP025958.1_704232_707487_-	cd14014, STKc_PknB_like, Catalytic domain of bacterial Serine/Threonine kinases, PknB and similar proteins	NA|205aa|down_5|NZ_CP025958.1_707699_708314_-	TIGR02937, RNA_polymerase_sigma_factor, RNA polymerase sigma factor, sigma-70 family	NA|506aa|down_6|NZ_CP025958.1_709745_711263_+	pfam13646, HEAT_2, HEAT repeats	NA|244aa|down_7|NZ_CP025958.1_711519_712251_-	pfam00756, Esterase, Putative esterase	NA|94aa|down_8|NZ_CP025958.1_712397_712679_-	NA	NA|422aa|down_9|NZ_CP025958.1_712703_713969_-	TIGR02917, TPR_domain_protein, putative PEP-CTERM system TPR-repeat lipoprotein
GCF_003149495.1_ASM314949v1	NZ_CP025958	Gemmata obscuriglobus strain DSM 5831 chromosome, complete genome	6	1159481-1159674	3	CRT	no		cas8u1,cas5u,cas7,cas8u2,cas3,cas2,cas1,csa3,DEDDh,RT,cas6,cas8b3,DinG,cas4,PD-DExK	Orphan	GTCGTTGCCGGCCCCGCCGA	20	0	0	NA	NA	NA	3	3	Orphan	cas8u1,cas5u,cas7,cas8u2,cas3,cas2,cas1,csa3,DEDDh,RT,cas6,cas8b3,DinG,cas4,PD-DExK	NA,NA	NA|1031aa|up_9|NZ_CP025958.1_1143717_1146810_-	cd01920, cyclophilin_EcCYP_like, cyclophilin_EcCYP_like: cyclophilin-type A-like peptidylprolyl cis- trans isomerase (PPIase) domain similar to the cytosolic E	NA|796aa|up_8|NZ_CP025958.1_1147430_1149818_-	COG0643, CheA, Chemotaxis protein histidine kinase and related kinases [Cell motility and secretion / Signal transduction mechanisms]	NA|288aa|up_7|NZ_CP025958.1_1149873_1150737_-	COG1352, CheR, Methylase of chemotaxis methyl-accepting proteins [Cell motility and secretion / Signal transduction mechanisms]	NA|367aa|up_6|NZ_CP025958.1_1150749_1151850_-	PRK00742, PRK00742, chemotaxis-specific protein-glutamate methyltransferase CheB	NA|620aa|up_5|NZ_CP025958.1_1151907_1153767_-	COG0835, CheW, Chemotaxis signal transduction protein [Cell motility and secretion / Signal transduction mechanisms]	NA|651aa|up_4|NZ_CP025958.1_1153791_1155744_-	smart00283, MA, Methyl-accepting chemotaxis-like domains (chemotaxis sensory transducer)	NA|102aa|up_3|NZ_CP025958.1_1156150_1156456_+	pfam01740, STAS, STAS domain	NA|327aa|up_2|NZ_CP025958.1_1156485_1157466_+	smart00283, MA, Methyl-accepting chemotaxis-like domains (chemotaxis sensory transducer)	NA|122aa|up_1|NZ_CP025958.1_1157477_1157843_+	cd17562, REC_CheY4-like, phosphoacceptor receiver (REC) domain of chemotaxis response regulator CheY4 and similar CheY family proteins	NA|278aa|up_0|NZ_CP025958.1_1157854_1158688_+	COG0643, CheA, Chemotaxis protein histidine kinase and related kinases [Cell motility and secretion / Signal transduction mechanisms]	NA|197aa|down_0|NZ_CP025958.1_1169134_1169725_+	TIGR02999, Sig-70_X6, RNA polymerase sigma factor, TIGR02999 family	NA|1080aa|down_1|NZ_CP025958.1_1169803_1173043_+	cd14014, STKc_PknB_like, Catalytic domain of bacterial Serine/Threonine kinases, PknB and similar proteins	NA|377aa|down_2|NZ_CP025958.1_1173219_1174350_-	COG2319, COG2319, FOG: WD40 repeat [General function prediction only]	NA|484aa|down_3|NZ_CP025958.1_1177000_1178452_+	PRK05249, PRK05249, Si-specific NAD(P)(+) transhydrogenase	NA|885aa|down_4|NZ_CP025958.1_1179177_1181832_+	PRK10669, PRK10669, putative cation:proton antiport protein; Provisional	NA|328aa|down_5|NZ_CP025958.1_1181954_1182938_+	COG2239, MgtE, Mg/Co/Ni transporter MgtE (contains CBS domain) [Inorganic ion transport and metabolism]	NA|1135aa|down_6|NZ_CP025958.1_1183008_1186413_-	pfam13646, HEAT_2, HEAT repeats	NA|260aa|down_7|NZ_CP025958.1_1186893_1187673_-	pfam06167, Peptidase_M90, Glucose-regulated metallo-peptidase M90	NA|611aa|down_8|NZ_CP025958.1_1187859_1189692_+	COG3387, SGA1, Glucoamylase and related glycosyl hydrolases [Carbohydrate transport and metabolism]	NA|386aa|down_9|NZ_CP025958.1_1189943_1191101_-	pfam01070, FMN_dh, FMN-dependent dehydrogenase
GCF_003149495.1_ASM314949v1	NZ_CP025958	Gemmata obscuriglobus strain DSM 5831 chromosome, complete genome	7	1693521-1693628	5	CRISPRCasFinder	no		cas8u1,cas5u,cas7,cas8u2,cas3,cas2,cas1,csa3,DEDDh,RT,cas6,cas8b3,DinG,cas4,PD-DExK	Orphan	GGATGAAGACAGATCAGTTTTAAATCCTGTTTATCC	36	0	0	NA	NA	NA	1	1	Orphan	cas8u1,cas5u,cas7,cas8u2,cas3,cas2,cas1,csa3,DEDDh,RT,cas6,cas8b3,DinG,cas4,PD-DExK	NA|155aa|up_8|NZ_CP025958.1_1682399_1682864_+,NA|1303aa|up_1|NZ_CP025958.1_1688476_1692385_-,NA|232aa|down_6|NZ_CP025958.1_1701766_1702462_+,NA|320aa|down_9|NZ_CP025958.1_1704769_1705729_-	NA|433aa|up_9|NZ_CP025958.1_1680421_1681720_-	pfam13546, DDE_5, DDE superfamily endonuclease	NA|155aa|up_8|NZ_CP025958.1_1682399_1682864_+	NA	NA|159aa|up_7|NZ_CP025958.1_1683145_1683622_+	pfam13648, Lipocalin_4, Lipocalin-like domain	NA|68aa|up_6|NZ_CP025958.1_1684390_1684594_+	pfam17210, SdrD_B, SdrD B-like domain	NA|278aa|up_5|NZ_CP025958.1_1685563_1686397_-	cd04179, DPM_DPG-synthase_like, DPM_DPG-synthase_like is a member of the Glycosyltransferase 2 superfamily	NA|266aa|up_4|NZ_CP025958.1_1686486_1687284_-	cd03225, ABC_cobalt_CbiO_domain1, First domain of the ATP-binding cassette component of cobalt transport system	NA|251aa|up_3|NZ_CP025958.1_1687283_1688036_-	COG0619, CbiQ, ABC-type cobalt transport system, permease component CbiQ and related transporters [Inorganic ion transport and metabolism]	NA|56aa|up_2|NZ_CP025958.1_1688032_1688200_-	pfam11154, DUF2934, Protein of unknown function (DUF2934)	NA|1303aa|up_1|NZ_CP025958.1_1688476_1692385_-	NA	NA|316aa|up_0|NZ_CP025958.1_1692474_1693422_+	PRK05710, PRK05710, tRNA glutamyl-Q(34) synthetase GluQRS	NA|368aa|down_0|NZ_CP025958.1_1693792_1694896_-	cd00831, CHS_like, Chalcone and stilbene synthases; plant-specific polyketide synthases (PKS) and related enzymes, also called type III PKSs	NA|368aa|down_1|NZ_CP025958.1_1694892_1695996_-	COG0644, FixC, Dehydrogenases (flavoproteins) [Energy production and conversion]	NA|218aa|down_2|NZ_CP025958.1_1695980_1696634_-	PRK06202, PRK06202, hypothetical protein; Provisional	NA|232aa|down_3|NZ_CP025958.1_1696792_1697488_+	pfam04219, DUF413, Protein of unknown function, DUF	NA|398aa|down_4|NZ_CP025958.1_1698073_1699267_-	pfam01204, Trehalase, Trehalase	NA|720aa|down_5|NZ_CP025958.1_1699426_1701586_-	TIGR02100, Glycogen_operon_protein_GlgX_homolog, glycogen debranching enzyme GlgX	NA|232aa|down_6|NZ_CP025958.1_1701766_1702462_+	NA	NA|506aa|down_7|NZ_CP025958.1_1702580_1704098_+	COG0665, DadA, Glycine/D-amino acid oxidases (deaminating) [Amino acid transport and metabolism]	NA|174aa|down_8|NZ_CP025958.1_1704151_1704673_+	pfam05818, TraT, Enterobacterial TraT complement resistance protein	NA|320aa|down_9|NZ_CP025958.1_1704769_1705729_-	NA
GCF_003149495.1_ASM314949v1	NZ_CP025958	Gemmata obscuriglobus strain DSM 5831 chromosome, complete genome	8	2153723-2161082	6,4,4,5	CRISPRCasFinder,CRT,PILER-CR,PILER-CR	no	cas6,cas3,cas8b3,cas7,cas1,cas2	cas8u1,cas5u,cas7,cas8u2,cas3,cas2,cas1,csa3,DEDDh,RT,cas6,cas8b3,DinG,cas4,PD-DExK	Unclear	GTGCAGACTTCAGTGACGCCGTGAGGCGTTGAGCAC,GTGCAGACTTCAGTGACGCCGTGAGGCGTTGAGCAC,GTGCAGACTTCAGTGACGCCGTGAGGCGTTGAGCAC,GTGCAGACTTCAGTGACGCCGTGAGGCGTTGAGCAC	36,36,36,36	0	0	NA	NA	NA:NA:NA:NA	102,102,98,98	102	Unclear	cas8u1,cas5u,cas7,cas8u2,cas3,cas2,cas1,csa3,DEDDh,RT,cas6,cas8b3,DinG,cas4,PD-DExK	NA|64aa|up_7|NZ_CP025958.1_2140438_2140630_-,NA|64aa|down_1|NZ_CP025958.1_2162455_2162647_-	NA|598aa|up_9|NZ_CP025958.1_2137953_2139747_-	pfam03796, DnaB_C, DnaB-like helicase C terminal domain	NA|124aa|up_8|NZ_CP025958.1_2140070_2140442_-	pfam12728, HTH_17, Helix-turn-helix domain	NA|64aa|up_7|NZ_CP025958.1_2140438_2140630_-	NA	cas6|195aa|up_6|NZ_CP025958.1_2145161_2145746_+	cd09703, Cas6-I-III, CRISPR/Cas system-associated RAMP superfamily protein Cas6	cas3|770aa|up_5|NZ_CP025958.1_2145742_2148052_+	TIGR01587, CRISPR-associated_endonuclease/helicase_Cas3, CRISPR-associated helicase Cas3	cas8b3|571aa|up_4|NZ_CP025958.1_2148064_2149777_+	TIGR04413, hypothetical_protein_LEP1GSC082_4029, CRISPR type MYXAN-associated protein Cmx8	cas7|305aa|up_3|NZ_CP025958.1_2149798_2150713_+	cd09687, Cas7_I-C, CRISPR/Cas system-associated RAMP superfamily protein Cas7	NA|254aa|up_2|NZ_CP025958.1_2150712_2151474_+	TIGR02586, CRISPR-associated_protein_Cas5, CRISPR-associated protein Cas5/DevS, subtype MYXAN	cas1|554aa|up_1|NZ_CP025958.1_2151489_2153151_+	TIGR03983, hypothetical_protein_LA3181, CRISPR-associated endonuclease Cas1, subtype MYXAN	cas2|100aa|up_0|NZ_CP025958.1_2153160_2153460_+	pfam09827, CRISPR_Cas2, CRISPR associated protein Cas2	NA|174aa|down_0|NZ_CP025958.1_2161213_2161735_-	pfam11251, DUF3050, Protein of unknown function (DUF3050)	NA|64aa|down_1|NZ_CP025958.1_2162455_2162647_-	NA	NA|115aa|down_2|NZ_CP025958.1_2162648_2162993_+	TIGR01764, Probable_excisionase, DNA binding domain, excisionase family	NA|145aa|down_3|NZ_CP025958.1_2163000_2163435_+	pfam13470, PIN_3, PIN domain	NA|888aa|down_4|NZ_CP025958.1_2164275_2166939_+	pfam00535, Glycos_transf_2, Glycosyl transferase family 2	NA|2599aa|down_5|NZ_CP025958.1_2167110_2174907_+	TIGR03696, tRNA_nuclease_WapA, RHS repeat-associated core domain	NA|221aa|down_6|NZ_CP025958.1_2176303_2176966_-	pfam03050, DDE_Tnp_IS66, Transposase IS66 family	NA|208aa|down_7|NZ_CP025958.1_2176978_2177602_-	pfam13005, zf-IS66, zinc-finger binding domain of transposase IS66	NA|285aa|down_8|NZ_CP025958.1_2177648_2178503_-	pfam05717, TnpB_IS66, IS66 Orf2 like protein	NA|366aa|down_9|NZ_CP025958.1_2178779_2179877_-	pfam04991, LicD, LicD family
GCF_003149495.1_ASM314949v1	NZ_CP025958	Gemmata obscuriglobus strain DSM 5831 chromosome, complete genome	9	3811007-3811404	6,7,5	PILER-CR,CRISPRCasFinder,CRT	no		cas8u1,cas5u,cas7,cas8u2,cas3,cas2,cas1,csa3,DEDDh,RT,cas6,cas8b3,DinG,cas4,PD-DExK	Orphan	CGTGTTCCCCACGCTCGTGGGGGTGAACCG,CGTGTTCCCCACGCTCGTGGGGGTGAACCG,GTGTTCCCCACGCTCGTGGGGGTGAACCG	30,30,29	0	0	NA	NA	I-B,III-A,III-B:I-B,III-A,III-B:I-B,III-A,III-B	4,6,6	6	Orphan	cas8u1,cas5u,cas7,cas8u2,cas3,cas2,cas1,csa3,DEDDh,RT,cas6,cas8b3,DinG,cas4,PD-DExK	NA|261aa|up_8|NZ_CP025958.1_3804353_3805136_+,NA|304aa|up_7|NZ_CP025958.1_3805142_3806054_+,NA|86aa|up_6|NZ_CP025958.1_3806390_3806648_+,NA|64aa|up_5|NZ_CP025958.1_3806644_3806836_+,NA|87aa|up_3|NZ_CP025958.1_3808565_3808826_+,NA|178aa|up_1|NZ_CP025958.1_3810033_3810567_-,NA|80aa|up_0|NZ_CP025958.1_3810746_3810986_-,NA|243aa|down_1|NZ_CP025958.1_3811953_3812682_+	NA|192aa|up_9|NZ_CP025958.1_3803790_3804366_+	pfam10145, PhageMin_Tail, Phage-related minor tail protein	NA|261aa|up_8|NZ_CP025958.1_3804353_3805136_+	NA	NA|304aa|up_7|NZ_CP025958.1_3805142_3806054_+	NA	NA|86aa|up_6|NZ_CP025958.1_3806390_3806648_+	NA	NA|64aa|up_5|NZ_CP025958.1_3806644_3806836_+	NA	NA|406aa|up_4|NZ_CP025958.1_3806860_3808078_-	pfam01610, DDE_Tnp_ISL3, Transposase	NA|87aa|up_3|NZ_CP025958.1_3808565_3808826_+	NA	NA|189aa|up_2|NZ_CP025958.1_3809359_3809926_+	COG3247, HdeD, Uncharacterized conserved protein [Function unknown]	NA|178aa|up_1|NZ_CP025958.1_3810033_3810567_-	NA	NA|80aa|up_0|NZ_CP025958.1_3810746_3810986_-	NA	NA|125aa|down_0|NZ_CP025958.1_3811565_3811940_+	COG3677, COG3677, Transposase and inactivated derivatives [DNA replication, recombination, and repair]	NA|243aa|down_1|NZ_CP025958.1_3811953_3812682_+	NA	NA|715aa|down_2|NZ_CP025958.1_3812953_3815098_+	COG1505, COG1505, Serine proteases of the peptidase family S9A [Amino acid transport and metabolism]	NA|532aa|down_3|NZ_CP025958.1_3815292_3816888_-	COG1595, RpoE, DNA-directed RNA polymerase specialized sigma subunit, sigma24 homolog [Transcription]	NA|231aa|down_4|NZ_CP025958.1_3817091_3817784_+	PRK00702, PRK00702, ribose-5-phosphate isomerase RpiA	NA|502aa|down_5|NZ_CP025958.1_3818018_3819524_+	PRK00654, glgA, glycogen synthase GlgA	NA|540aa|down_6|NZ_CP025958.1_3819660_3821280_-	PRK13581, PRK13581, D-3-phosphoglycerate dehydrogenase; Provisional	NA|611aa|down_7|NZ_CP025958.1_3821669_3823502_-	pfam13360, PQQ_2, PQQ-like domain	NA|274aa|down_8|NZ_CP025958.1_3824073_3824895_+	COG1398, OLE1, Fatty-acid desaturase [Lipid metabolism]	NA|397aa|down_9|NZ_CP025958.1_3825157_3826348_-	COG0758, Smf, Predicted Rossmann fold nucleotide-binding protein involved in DNA uptake [DNA replication, recombination, and repair / Intracellular trafficking and secretion]
GCF_003149495.1_ASM314949v1	NZ_CP025958	Gemmata obscuriglobus strain DSM 5831 chromosome, complete genome	10	4031125-4031224	8	CRISPRCasFinder	no		cas8u1,cas5u,cas7,cas8u2,cas3,cas2,cas1,csa3,DEDDh,RT,cas6,cas8b3,DinG,cas4,PD-DExK	Orphan	GCACAAAGGGCACAAAAAGAAGCCAAAGACGAAGA	35	0	0	NA	NA	NA	1	1	Orphan	cas8u1,cas5u,cas7,cas8u2,cas3,cas2,cas1,csa3,DEDDh,RT,cas6,cas8b3,DinG,cas4,PD-DExK	NA|213aa|up_3|NZ_CP025958.1_4027335_4027974_-,NA|190aa|up_1|NZ_CP025958.1_4028881_4029451_+,NA|281aa|down_0|NZ_CP025958.1_4031376_4032219_-,NA|772aa|down_3|NZ_CP025958.1_4036283_4038599_+	NA|478aa|up_9|NZ_CP025958.1_4021304_4022738_+	pfam01609, DDE_Tnp_1, Transposase DDE domain	NA|86aa|up_8|NZ_CP025958.1_4022787_4023045_+	COG5433, COG5433, Transposase [DNA replication, recombination, and repair]	NA|195aa|up_7|NZ_CP025958.1_4022932_4023517_+	COG5433, COG5433, Transposase [DNA replication, recombination, and repair]	NA|275aa|up_6|NZ_CP025958.1_4023535_4024360_-	pfam13340, DUF4096, Putative transposase of IS4/5 family (DUF4096)	NA|461aa|up_5|NZ_CP025958.1_4024411_4025794_+	pfam01609, DDE_Tnp_1, Transposase DDE domain	NA|480aa|up_4|NZ_CP025958.1_4025939_4027379_+	pfam07394, DUF1501, Protein of unknown function (DUF1501)	NA|213aa|up_3|NZ_CP025958.1_4027335_4027974_-	NA	NA|208aa|up_2|NZ_CP025958.1_4027986_4028610_-	pfam12869, tRNA_anti-like, tRNA_anti-like	NA|190aa|up_1|NZ_CP025958.1_4028881_4029451_+	NA	NA|512aa|up_0|NZ_CP025958.1_4029556_4031092_+	COG1463, Ttg2C, ABC-type transport system involved in resistance to organic solvents, periplasmic component [Secondary metabolites biosynthesis, transport, and catabolism]	NA|281aa|down_0|NZ_CP025958.1_4031376_4032219_-	NA	NA|309aa|down_1|NZ_CP025958.1_4032589_4033516_+	COG1131, CcmA, ABC-type multidrug transport system, ATPase component [Defense mechanisms]	NA|813aa|down_2|NZ_CP025958.1_4033690_4036129_+	COG1277, NosY, ABC-type transport system involved in multi-copper enzyme maturation, permease component [General function prediction only]	NA|772aa|down_3|NZ_CP025958.1_4036283_4038599_+	NA	NA|251aa|down_4|NZ_CP025958.1_4038711_4039464_+	pfam02585, PIG-L, GlcNAc-PI de-N-acetylase	NA|1098aa|down_5|NZ_CP025958.1_4039814_4043108_+	COG2183, Tex, Transcriptional accessory protein [Transcription]	NA|694aa|down_6|NZ_CP025958.1_4043448_4045530_+	TIGR01241, ATP-dependent_zinc_metalloprotease_FtsH, ATP-dependent metalloprotease FtsH	NA|1178aa|down_7|NZ_CP025958.1_4045678_4049212_-	cd14014, STKc_PknB_like, Catalytic domain of bacterial Serine/Threonine kinases, PknB and similar proteins	NA|414aa|down_8|NZ_CP025958.1_4049580_4050822_+	TIGR01730, COG0845:_Membrane-fusion_protein, RND family efflux transporter, MFP subunit	NA|1188aa|down_9|NZ_CP025958.1_4050978_4054542_+	TIGR00915, Probable_aminoglycoside_efflux_pump, The (Largely Gram-negative Bacterial) Hydrophobe/Amphiphile Efflux-1 (HAE1) Family
GCF_003149495.1_ASM314949v1	NZ_CP025958	Gemmata obscuriglobus strain DSM 5831 chromosome, complete genome	11	4183111-4183241	9	CRISPRCasFinder	no		cas8u1,cas5u,cas7,cas8u2,cas3,cas2,cas1,csa3,DEDDh,RT,cas6,cas8b3,DinG,cas4,PD-DExK	Orphan	TCCATCGGCATCCCGCCCATCGG	23	0	0	NA	NA	NA	2	2	Orphan	cas8u1,cas5u,cas7,cas8u2,cas3,cas2,cas1,csa3,DEDDh,RT,cas6,cas8b3,DinG,cas4,PD-DExK	NA|487aa|up_6|NZ_CP025958.1_4170166_4171627_+,NA|370aa|up_0|NZ_CP025958.1_4180711_4181821_-,NA|159aa|down_9|NZ_CP025958.1_4206952_4207429_+	NA|473aa|up_9|NZ_CP025958.1_4164646_4166065_+	cd17319, MFS_ExuT_GudP_like, Hexuronate transporter, Glucarate transporter, and similar transporters of the Major Facilitator Superfamily	NA|252aa|up_8|NZ_CP025958.1_4166066_4166822_-	cd07398, MPP_YbbF-LpxH, Escherichia coli YbbF/LpxH and related proteins, metallophosphatase domain	NA|734aa|up_7|NZ_CP025958.1_4167709_4169911_+	PRK11331, PRK11331, 5-methylcytosine-specific restriction enzyme subunit McrB; Provisional	NA|487aa|up_6|NZ_CP025958.1_4170166_4171627_+	NA	NA|386aa|up_5|NZ_CP025958.1_4171623_4172781_+	pfam10117, McrBC, McrBC 5-methylcytosine restriction system component	NA|212aa|up_4|NZ_CP025958.1_4173015_4173651_+	COG1403, McrA, Restriction endonuclease [Defense mechanisms]	NA|742aa|up_3|NZ_CP025958.1_4173765_4175991_-	TIGR02037, Probable_periplasmic_serine_protease_do/HhoA-like, periplasmic serine protease, Do/DeqQ family	NA|396aa|up_2|NZ_CP025958.1_4176038_4177226_-	TIGR02037, Probable_periplasmic_serine_protease_do/HhoA-like, periplasmic serine protease, Do/DeqQ family	NA|420aa|up_1|NZ_CP025958.1_4179284_4180544_-	pfam08305, NPCBM, NPCBM/NEW2 domain	NA|370aa|up_0|NZ_CP025958.1_4180711_4181821_-	NA	NA|1217aa|down_0|NZ_CP025958.1_4184825_4188476_-	COG0591, PutP, Na+/proline symporter [Amino acid transport and metabolism / General function prediction only]	NA|1313aa|down_1|NZ_CP025958.1_4188579_4192518_-	pfam10060, DUF2298, Uncharacterized membrane protein (DUF2298)	NA|963aa|down_2|NZ_CP025958.1_4192649_4195538_-	pfam07584, BatA, Aerotolerance regulator N-terminal	NA|317aa|down_3|NZ_CP025958.1_4195631_4196582_-	COG1721, COG1721, Uncharacterized conserved protein (some members contain a von Willebrand factor type A (vWA) domain) [General function prediction only]	NA|353aa|down_4|NZ_CP025958.1_4196682_4197741_-	COG0714, COG0714, MoxR-like ATPases [General function prediction only]	NA|409aa|down_5|NZ_CP025958.1_4197900_4199127_-	cd00688, ISOPREN_C2_like, This group contains class II terpene cyclases, protein prenyltransferases beta subunit, two broadly specific proteinase inhibitors alpha2-macroglobulin (alpha (2)-M) and pregnancy zone protein (PZP) and, the C3 C4 and C5 components of vertebrate complement	NA|1665aa|down_6|NZ_CP025958.1_4199542_4204537_-	COG1520, COG1520, FOG: WD40-like repeat [Function unknown]	NA|366aa|down_7|NZ_CP025958.1_4204778_4205876_+	PRK05301, PRK05301, pyrroloquinoline quinone biosynthesis protein PqqE; Provisional	NA|294aa|down_8|NZ_CP025958.1_4205936_4206818_+	COG0631, PTC1, Serine/threonine protein phosphatase [Signal transduction mechanisms]	NA|159aa|down_9|NZ_CP025958.1_4206952_4207429_+	NA
GCF_003149495.1_ASM314949v1	NZ_CP025958	Gemmata obscuriglobus strain DSM 5831 chromosome, complete genome	12	4365297-4365817	7	PILER-CR	no		cas8u1,cas5u,cas7,cas8u2,cas3,cas2,cas1,csa3,DEDDh,RT,cas6,cas8b3,DinG,cas4,PD-DExK	Orphan	CTTCAATTCGGCCACGG	17	0	0	NA	NA	NA	7	7	Orphan	cas8u1,cas5u,cas7,cas8u2,cas3,cas2,cas1,csa3,DEDDh,RT,cas6,cas8b3,DinG,cas4,PD-DExK	NA|107aa|up_9|NZ_CP025958.1_4361832_4362153_+,NA|64aa|up_7|NZ_CP025958.1_4362368_4362560_-,NA|129aa|up_6|NZ_CP025958.1_4362611_4362998_-,NA|143aa|up_5|NZ_CP025958.1_4362982_4363411_-,NA|101aa|up_3|NZ_CP025958.1_4364029_4364332_+,NA|91aa|up_2|NZ_CP025958.1_4364373_4364646_+,NA|93aa|up_1|NZ_CP025958.1_4364731_4365010_+,NA|61aa|up_0|NZ_CP025958.1_4365051_4365234_+,NA|629aa|down_0|NZ_CP025958.1_4365827_4367714_-,NA|169aa|down_1|NZ_CP025958.1_4367791_4368298_+,NA|159aa|down_2|NZ_CP025958.1_4368310_4368787_+,NA|361aa|down_3|NZ_CP025958.1_4368887_4369970_+,NA|161aa|down_4|NZ_CP025958.1_4369962_4370445_-,NA|159aa|down_6|NZ_CP025958.1_4371281_4371758_-,NA|150aa|down_7|NZ_CP025958.1_4371777_4372227_-,NA|198aa|down_9|NZ_CP025958.1_4373333_4373927_-	NA|107aa|up_9|NZ_CP025958.1_4361832_4362153_+	NA	NA|77aa|up_8|NZ_CP025958.1_4362133_4362364_-	cd00093, HTH_XRE, Helix-turn-helix XRE-family like proteins	NA|64aa|up_7|NZ_CP025958.1_4362368_4362560_-	NA	NA|129aa|up_6|NZ_CP025958.1_4362611_4362998_-	NA	NA|143aa|up_5|NZ_CP025958.1_4362982_4363411_-	NA	NA|90aa|up_4|NZ_CP025958.1_4363602_4363872_-	cd00093, HTH_XRE, Helix-turn-helix XRE-family like proteins	NA|101aa|up_3|NZ_CP025958.1_4364029_4364332_+	NA	NA|91aa|up_2|NZ_CP025958.1_4364373_4364646_+	NA	NA|93aa|up_1|NZ_CP025958.1_4364731_4365010_+	NA	NA|61aa|up_0|NZ_CP025958.1_4365051_4365234_+	NA	NA|629aa|down_0|NZ_CP025958.1_4365827_4367714_-	NA	NA|169aa|down_1|NZ_CP025958.1_4367791_4368298_+	NA	NA|159aa|down_2|NZ_CP025958.1_4368310_4368787_+	NA	NA|361aa|down_3|NZ_CP025958.1_4368887_4369970_+	NA	NA|161aa|down_4|NZ_CP025958.1_4369962_4370445_-	NA	NA|181aa|down_5|NZ_CP025958.1_4370462_4371005_-	pfam14373, Imm_superinfect, Superinfection immunity protein	NA|159aa|down_6|NZ_CP025958.1_4371281_4371758_-	NA	NA|150aa|down_7|NZ_CP025958.1_4371777_4372227_-	NA	NA|190aa|down_8|NZ_CP025958.1_4372661_4373231_-	pfam07030, DUF1320, Protein of unknown function (DUF1320)	NA|198aa|down_9|NZ_CP025958.1_4373333_4373927_-	NA
GCF_003149495.1_ASM314949v1	NZ_CP025958	Gemmata obscuriglobus strain DSM 5831 chromosome, complete genome	13	4944163-4944479	6	CRT	no		cas8u1,cas5u,cas7,cas8u2,cas3,cas2,cas1,csa3,DEDDh,RT,cas6,cas8b3,DinG,cas4,PD-DExK	Orphan	NNNGGCGGGCTGACCGGC	18	2	2	4944181-4944198|4944217-4944234	NZ_CP025958.1_4944157-4944174|NZ_CP025958.1_4944157-4944174	NA	7	7	Orphan	cas8u1,cas5u,cas7,cas8u2,cas3,cas2,cas1,csa3,DEDDh,RT,cas6,cas8b3,DinG,cas4,PD-DExK	NA|63aa|up_3|NZ_CP025958.1_4922692_4922881_-,NA|115aa|up_2|NZ_CP025958.1_4923328_4923673_-,NA|135aa|up_1|NZ_CP025958.1_4924224_4924629_-,NA|335aa|down_3|NZ_CP025958.1_4954630_4955635_-,NA|191aa|down_4|NZ_CP025958.1_4955665_4956238_-,NA|119aa|down_5|NZ_CP025958.1_4956375_4956732_-,NA|231aa|down_8|NZ_CP025958.1_4959727_4960420_-	NA|619aa|up_9|NZ_CP025958.1_4911532_4913389_+	PRK10150, PRK10150, beta-D-glucuronidase; Provisional	NA|902aa|up_8|NZ_CP025958.1_4913575_4916281_+	cd04950, GT4_TuaH-like, teichuronic acid biosynthesis glycosyltransferase TuaH and similar proteins	NA|133aa|up_7|NZ_CP025958.1_4916481_4916880_+	PRK14559, PRK14559, serine/threonine phosphatase	NA|725aa|up_6|NZ_CP025958.1_4916928_4919103_+	COG1091, RfbD, dTDP-4-dehydrorhamnose reductase [Cell envelope biogenesis, outer membrane]	NA|186aa|up_5|NZ_CP025958.1_4919226_4919784_-	cd01926, cyclophilin_ABH_like, cyclophilin_ABH_like: Cyclophilin  A, B and H-like cyclophilin-type peptidylprolyl cis- trans isomerase (PPIase) domain	NA|549aa|up_4|NZ_CP025958.1_4920947_4922594_-	pfam10119, MethyTransf_Reg, Predicted methyltransferase regulatory domain	NA|63aa|up_3|NZ_CP025958.1_4922692_4922881_-	NA	NA|115aa|up_2|NZ_CP025958.1_4923328_4923673_-	NA	NA|135aa|up_1|NZ_CP025958.1_4924224_4924629_-	NA	NA|6500aa|up_0|NZ_CP025958.1_4924625_4944125_-	pfam17210, SdrD_B, SdrD B-like domain	NA|1646aa|down_0|NZ_CP025958.1_4945790_4950728_+	cd02511, Beta4Glucosyltransferase, UDP-glucose LOS-beta-1,4 glucosyltransferase is required for biosynthesis of lipooligosaccharide	NA|368aa|down_1|NZ_CP025958.1_4950766_4951870_-	pfam16261, DUF4915, Domain of unknown function (DUF4915)	NA|694aa|down_2|NZ_CP025958.1_4951940_4954022_+	COG3914, Spy, Predicted O-linked N-acetylglucosamine transferase, SPINDLY family [Posttranslational modification, protein turnover, chaperones]	NA|335aa|down_3|NZ_CP025958.1_4954630_4955635_-	NA	NA|191aa|down_4|NZ_CP025958.1_4955665_4956238_-	NA	NA|119aa|down_5|NZ_CP025958.1_4956375_4956732_-	NA	NA|827aa|down_6|NZ_CP025958.1_4956728_4959209_-	pfam12965, DUF3854, Domain of unknown function (DUF3854)	NA|111aa|down_7|NZ_CP025958.1_4959329_4959662_-	pfam07618, DUF1580, Protein of unknown function (DUF1580)	NA|231aa|down_8|NZ_CP025958.1_4959727_4960420_-	NA	NA|885aa|down_9|NZ_CP025958.1_4962632_4965287_-	COG0790, COG0790, FOG: TPR repeat, SEL1 subfamily [General function prediction only]
GCF_003149495.1_ASM314949v1	NZ_CP025958	Gemmata obscuriglobus strain DSM 5831 chromosome, complete genome	14	5109252-5109366	10	CRISPRCasFinder	no		cas8u1,cas5u,cas7,cas8u2,cas3,cas2,cas1,csa3,DEDDh,RT,cas6,cas8b3,DinG,cas4,PD-DExK	Orphan	CGGGACCACCCCTCCCGGCGCCACCCCCTCCCTTTAGGGA	40	0	0	NA	NA	NA	1	1	Orphan	cas8u1,cas5u,cas7,cas8u2,cas3,cas2,cas1,csa3,DEDDh,RT,cas6,cas8b3,DinG,cas4,PD-DExK	NA|285aa|up_9|NZ_CP025958.1_5099008_5099863_+,NA|287aa|up_8|NZ_CP025958.1_5099894_5100755_+,NA|181aa|down_4|NZ_CP025958.1_5113834_5114377_-,NA|48aa|down_6|NZ_CP025958.1_5115951_5116095_-	NA|285aa|up_9|NZ_CP025958.1_5099008_5099863_+	NA	NA|287aa|up_8|NZ_CP025958.1_5099894_5100755_+	NA	NA|383aa|up_7|NZ_CP025958.1_5100905_5102054_+	cd03469, Rieske_RO_Alpha_N, Rieske non-heme iron oxygenase (RO) family, N-terminal Rieske domain of the oxygenase alpha subunit; The RO family comprise a large class of aromatic ring-hydroxylating dioxygenases found predominantly in microorganisms	NA|150aa|up_6|NZ_CP025958.1_5102081_5102531_-	TIGR00738, Putative_HTH-type_transcriptional_regulator, Rrf2 family protein	NA|244aa|up_5|NZ_CP025958.1_5102621_5103353_-	PRK02090, PRK02090, phosphoadenylyl-sulfate reductase	NA|206aa|up_4|NZ_CP025958.1_5103601_5104219_-	TIGR02996, rpt_mate_G_obs, repeat-companion domain TIGR02996	NA|108aa|up_3|NZ_CP025958.1_5104215_5104539_-	pfam06127, DUF962, Protein of unknown function (DUF962)	NA|219aa|up_2|NZ_CP025958.1_5104649_5105306_+	cd00429, RPE, Ribulose-5-phosphate 3-epimerase (RPE)	NA|437aa|up_1|NZ_CP025958.1_5105644_5106955_-	pfam07585, BBP7, Putative beta barrel porin-7 (BBP7)	NA|323aa|up_0|NZ_CP025958.1_5107550_5108519_+	pfam07608, DUF1571, Protein of unknown function (DUF1571)	NA|537aa|down_0|NZ_CP025958.1_5110366_5111977_-	COG0642, BaeS, Signal transduction histidine kinase [Signal transduction mechanisms]	NA|192aa|down_1|NZ_CP025958.1_5111973_5112549_-	cd16433, CheB, Chemotaxis response regulator protein-glutamate methylesterase, CheB	NA|276aa|down_2|NZ_CP025958.1_5112545_5113373_-	COG1352, CheR, Methylase of chemotaxis methyl-accepting proteins [Cell motility and secretion / Signal transduction mechanisms]	NA|124aa|down_3|NZ_CP025958.1_5113466_5113838_-	cd00156, REC, phosphoacceptor receiver (REC) domain of response regulators (RRs) and pseudo response regulators (PRRs)	NA|181aa|down_4|NZ_CP025958.1_5113834_5114377_-	NA	NA|441aa|down_5|NZ_CP025958.1_5114603_5115926_+	PRK09757, PRK09757, PTS N-acetylgalactosamine transporter subunit IIC	NA|48aa|down_6|NZ_CP025958.1_5115951_5116095_-	NA	NA|313aa|down_7|NZ_CP025958.1_5116202_5117141_-	COG0657, Aes, Esterase/lipase [Lipid metabolism]	NA|254aa|down_8|NZ_CP025958.1_5117304_5118066_+	TIGR02168, Chromosome_partition_protein_Smc, chromosome segregation protein SMC, common bacterial type	NA|250aa|down_9|NZ_CP025958.1_5118114_5118864_-	pfam06439, DUF1080, Domain of Unknown Function (DUF1080)
GCF_003149495.1_ASM314949v1	NZ_CP025958	Gemmata obscuriglobus strain DSM 5831 chromosome, complete genome	15	5288200-5288373	8	PILER-CR	no		cas8u1,cas5u,cas7,cas8u2,cas3,cas2,cas1,csa3,DEDDh,RT,cas6,cas8b3,DinG,cas4,PD-DExK	Orphan	GGCCAGCTCCTTCACCCCCGCGTCCGTCAA	30	0	0	NA	NA	NA	2	2	Orphan	cas8u1,cas5u,cas7,cas8u2,cas3,cas2,cas1,csa3,DEDDh,RT,cas6,cas8b3,DinG,cas4,PD-DExK	NA|137aa|up_6|NZ_CP025958.1_5282336_5282747_+,NA|108aa|up_5|NZ_CP025958.1_5282743_5283067_+,NA|722aa|up_3|NZ_CP025958.1_5283786_5285952_+,NA|169aa|up_2|NZ_CP025958.1_5286019_5286526_+,NA|82aa|up_1|NZ_CP025958.1_5287060_5287306_+,NA|182aa|up_0|NZ_CP025958.1_5287361_5287907_+,NA|386aa|down_1|NZ_CP025958.1_5290364_5291522_-,NA|127aa|down_2|NZ_CP025958.1_5291610_5291991_-,NA|388aa|down_5|NZ_CP025958.1_5295911_5297075_+,NA|188aa|down_6|NZ_CP025958.1_5297205_5297769_-,NA|442aa|down_7|NZ_CP025958.1_5297841_5299167_-	NA|318aa|up_9|NZ_CP025958.1_5279602_5280556_+	TIGR02937, RNA_polymerase_sigma_factor, RNA polymerase sigma factor, sigma-70 family	NA|312aa|up_8|NZ_CP025958.1_5280633_5281569_-	pfam13649, Methyltransf_25, Methyltransferase domain	NA|100aa|up_7|NZ_CP025958.1_5281812_5282112_-	cd00093, HTH_XRE, Helix-turn-helix XRE-family like proteins	NA|137aa|up_6|NZ_CP025958.1_5282336_5282747_+	NA	NA|108aa|up_5|NZ_CP025958.1_5282743_5283067_+	NA	NA|67aa|up_4|NZ_CP025958.1_5283252_5283453_+	pfam07618, DUF1580, Protein of unknown function (DUF1580)	NA|722aa|up_3|NZ_CP025958.1_5283786_5285952_+	NA	NA|169aa|up_2|NZ_CP025958.1_5286019_5286526_+	NA	NA|82aa|up_1|NZ_CP025958.1_5287060_5287306_+	NA	NA|182aa|up_0|NZ_CP025958.1_5287361_5287907_+	NA	NA|304aa|down_0|NZ_CP025958.1_5289502_5290414_-	sd00034, LRR_AMN1, leucine-rich repeats, antagonist of mitotic exit network protein 1-like subfamily	NA|386aa|down_1|NZ_CP025958.1_5290364_5291522_-	NA	NA|127aa|down_2|NZ_CP025958.1_5291610_5291991_-	NA	NA|476aa|down_3|NZ_CP025958.1_5292645_5294073_-	COG2204, AtoC, Response regulator containing CheY-like receiver, AAA-type ATPase, and DNA-binding domains [Signal transduction mechanisms]	NA|497aa|down_4|NZ_CP025958.1_5294088_5295579_-	PRK11360, PRK11360, two-component system sensor histidine kinase AtoS	NA|388aa|down_5|NZ_CP025958.1_5295911_5297075_+	NA	NA|188aa|down_6|NZ_CP025958.1_5297205_5297769_-	NA	NA|442aa|down_7|NZ_CP025958.1_5297841_5299167_-	NA	NA|298aa|down_8|NZ_CP025958.1_5299382_5300276_+	pfam13360, PQQ_2, PQQ-like domain	NA|470aa|down_9|NZ_CP025958.1_5300311_5301721_-	cd13148, MATE_like_3, Uncharacterized subfamily of the multidrug and toxic compound extrusion (MATE) proteins
GCF_003149495.1_ASM314949v1	NZ_CP025958	Gemmata obscuriglobus strain DSM 5831 chromosome, complete genome	16	5288546-5288952	9	PILER-CR	no		cas8u1,cas5u,cas7,cas8u2,cas3,cas2,cas1,csa3,DEDDh,RT,cas6,cas8b3,DinG,cas4,PD-DExK	Orphan	AGGGCCTTGAGCCCGGCCAGCTCCTTCACCCCCGCGTCCGTCAC	44	0	0	NA	NA	NA	5	5	Orphan	cas8u1,cas5u,cas7,cas8u2,cas3,cas2,cas1,csa3,DEDDh,RT,cas6,cas8b3,DinG,cas4,PD-DExK	NA|137aa|up_6|NZ_CP025958.1_5282336_5282747_+,NA|108aa|up_5|NZ_CP025958.1_5282743_5283067_+,NA|722aa|up_3|NZ_CP025958.1_5283786_5285952_+,NA|169aa|up_2|NZ_CP025958.1_5286019_5286526_+,NA|82aa|up_1|NZ_CP025958.1_5287060_5287306_+,NA|182aa|up_0|NZ_CP025958.1_5287361_5287907_+,NA|386aa|down_1|NZ_CP025958.1_5290364_5291522_-,NA|127aa|down_2|NZ_CP025958.1_5291610_5291991_-,NA|388aa|down_5|NZ_CP025958.1_5295911_5297075_+,NA|188aa|down_6|NZ_CP025958.1_5297205_5297769_-,NA|442aa|down_7|NZ_CP025958.1_5297841_5299167_-	NA|318aa|up_9|NZ_CP025958.1_5279602_5280556_+	TIGR02937, RNA_polymerase_sigma_factor, RNA polymerase sigma factor, sigma-70 family	NA|312aa|up_8|NZ_CP025958.1_5280633_5281569_-	pfam13649, Methyltransf_25, Methyltransferase domain	NA|100aa|up_7|NZ_CP025958.1_5281812_5282112_-	cd00093, HTH_XRE, Helix-turn-helix XRE-family like proteins	NA|137aa|up_6|NZ_CP025958.1_5282336_5282747_+	NA	NA|108aa|up_5|NZ_CP025958.1_5282743_5283067_+	NA	NA|67aa|up_4|NZ_CP025958.1_5283252_5283453_+	pfam07618, DUF1580, Protein of unknown function (DUF1580)	NA|722aa|up_3|NZ_CP025958.1_5283786_5285952_+	NA	NA|169aa|up_2|NZ_CP025958.1_5286019_5286526_+	NA	NA|82aa|up_1|NZ_CP025958.1_5287060_5287306_+	NA	NA|182aa|up_0|NZ_CP025958.1_5287361_5287907_+	NA	NA|304aa|down_0|NZ_CP025958.1_5289502_5290414_-	sd00034, LRR_AMN1, leucine-rich repeats, antagonist of mitotic exit network protein 1-like subfamily	NA|386aa|down_1|NZ_CP025958.1_5290364_5291522_-	NA	NA|127aa|down_2|NZ_CP025958.1_5291610_5291991_-	NA	NA|476aa|down_3|NZ_CP025958.1_5292645_5294073_-	COG2204, AtoC, Response regulator containing CheY-like receiver, AAA-type ATPase, and DNA-binding domains [Signal transduction mechanisms]	NA|497aa|down_4|NZ_CP025958.1_5294088_5295579_-	PRK11360, PRK11360, two-component system sensor histidine kinase AtoS	NA|388aa|down_5|NZ_CP025958.1_5295911_5297075_+	NA	NA|188aa|down_6|NZ_CP025958.1_5297205_5297769_-	NA	NA|442aa|down_7|NZ_CP025958.1_5297841_5299167_-	NA	NA|298aa|down_8|NZ_CP025958.1_5299382_5300276_+	pfam13360, PQQ_2, PQQ-like domain	NA|470aa|down_9|NZ_CP025958.1_5300311_5301721_-	cd13148, MATE_like_3, Uncharacterized subfamily of the multidrug and toxic compound extrusion (MATE) proteins
GCF_003149495.1_ASM314949v1	NZ_CP025958	Gemmata obscuriglobus strain DSM 5831 chromosome, complete genome	17	5289538-5290357	7	CRT	no		cas8u1,cas5u,cas7,cas8u2,cas3,cas2,cas1,csa3,DEDDh,RT,cas6,cas8b3,DinG,cas4,PD-DExK	Orphan	CCAGNTCCTTCAGCCCNGCGTCCGTCAC	28	0	0	NA	NA	NA	11	11	Orphan	cas8u1,cas5u,cas7,cas8u2,cas3,cas2,cas1,csa3,DEDDh,RT,cas6,cas8b3,DinG,cas4,PD-DExK	NA|137aa|up_7|NZ_CP025958.1_5282336_5282747_+,NA|108aa|up_6|NZ_CP025958.1_5282743_5283067_+,NA|722aa|up_4|NZ_CP025958.1_5283786_5285952_+,NA|169aa|up_3|NZ_CP025958.1_5286019_5286526_+,NA|82aa|up_2|NZ_CP025958.1_5287060_5287306_+,NA|182aa|up_1|NZ_CP025958.1_5287361_5287907_+,NA|386aa|down_0|NZ_CP025958.1_5290364_5291522_-,NA|127aa|down_1|NZ_CP025958.1_5291610_5291991_-,NA|388aa|down_4|NZ_CP025958.1_5295911_5297075_+,NA|188aa|down_5|NZ_CP025958.1_5297205_5297769_-,NA|442aa|down_6|NZ_CP025958.1_5297841_5299167_-,NA|386aa|down_9|NZ_CP025958.1_5302037_5303195_-	NA|312aa|up_9|NZ_CP025958.1_5280633_5281569_-	pfam13649, Methyltransf_25, Methyltransferase domain	NA|100aa|up_8|NZ_CP025958.1_5281812_5282112_-	cd00093, HTH_XRE, Helix-turn-helix XRE-family like proteins	NA|137aa|up_7|NZ_CP025958.1_5282336_5282747_+	NA	NA|108aa|up_6|NZ_CP025958.1_5282743_5283067_+	NA	NA|67aa|up_5|NZ_CP025958.1_5283252_5283453_+	pfam07618, DUF1580, Protein of unknown function (DUF1580)	NA|722aa|up_4|NZ_CP025958.1_5283786_5285952_+	NA	NA|169aa|up_3|NZ_CP025958.1_5286019_5286526_+	NA	NA|82aa|up_2|NZ_CP025958.1_5287060_5287306_+	NA	NA|182aa|up_1|NZ_CP025958.1_5287361_5287907_+	NA	NA|418aa|up_0|NZ_CP025958.1_5288094_5289348_-	sd00033, LRR_RI, leucine-rich repeats, ribonuclease inhibitor (RI)-like subfamily	NA|386aa|down_0|NZ_CP025958.1_5290364_5291522_-	NA	NA|127aa|down_1|NZ_CP025958.1_5291610_5291991_-	NA	NA|476aa|down_2|NZ_CP025958.1_5292645_5294073_-	COG2204, AtoC, Response regulator containing CheY-like receiver, AAA-type ATPase, and DNA-binding domains [Signal transduction mechanisms]	NA|497aa|down_3|NZ_CP025958.1_5294088_5295579_-	PRK11360, PRK11360, two-component system sensor histidine kinase AtoS	NA|388aa|down_4|NZ_CP025958.1_5295911_5297075_+	NA	NA|188aa|down_5|NZ_CP025958.1_5297205_5297769_-	NA	NA|442aa|down_6|NZ_CP025958.1_5297841_5299167_-	NA	NA|298aa|down_7|NZ_CP025958.1_5299382_5300276_+	pfam13360, PQQ_2, PQQ-like domain	NA|470aa|down_8|NZ_CP025958.1_5300311_5301721_-	cd13148, MATE_like_3, Uncharacterized subfamily of the multidrug and toxic compound extrusion (MATE) proteins	NA|386aa|down_9|NZ_CP025958.1_5302037_5303195_-	NA
GCF_003149495.1_ASM314949v1	NZ_CP025958	Gemmata obscuriglobus strain DSM 5831 chromosome, complete genome	18	5413267-5413382	11	CRISPRCasFinder	no		cas8u1,cas5u,cas7,cas8u2,cas3,cas2,cas1,csa3,DEDDh,RT,cas6,cas8b3,DinG,cas4,PD-DExK	Orphan	ACTGGTCAGCCCGCGGGGCGGGTCTGATGCGC	32	0	0	NA	NA	NA	1	1	Orphan	cas8u1,cas5u,cas7,cas8u2,cas3,cas2,cas1,csa3,DEDDh,RT,cas6,cas8b3,DinG,cas4,PD-DExK	NA|76aa|up_9|NZ_CP025958.1_5400328_5400556_+,NA|263aa|down_3|NZ_CP025958.1_5424467_5425256_-,NA|115aa|down_4|NZ_CP025958.1_5425563_5425908_+,NA|128aa|down_6|NZ_CP025958.1_5427732_5428116_-,NA|143aa|down_9|NZ_CP025958.1_5430852_5431281_-	NA|76aa|up_9|NZ_CP025958.1_5400328_5400556_+	NA	NA|168aa|up_8|NZ_CP025958.1_5400651_5401155_-	pfam02469, Fasciclin, Fasciclin domain	NA|405aa|up_7|NZ_CP025958.1_5401261_5402476_+	PRK00558, uvrC, excinuclease ABC subunit UvrC	NA|178aa|up_6|NZ_CP025958.1_5402620_5403154_+	PRK01297, PRK01297, ATP-dependent RNA helicase RhlB; Provisional	NA|692aa|up_5|NZ_CP025958.1_5403804_5405880_+	cd14014, STKc_PknB_like, Catalytic domain of bacterial Serine/Threonine kinases, PknB and similar proteins	NA|357aa|up_4|NZ_CP025958.1_5405916_5406987_+	PRK12270, kgd, multifunctional oxoglutarate decarboxylase/oxoglutarate dehydrogenase thiamine pyrophosphate-binding subunit/dihydrolipoyllysine-residue succinyltransferase subunit	NA|474aa|up_3|NZ_CP025958.1_5407149_5408571_-	pfam05221, AdoHcyase, S-adenosyl-L-homocysteine hydrolase	NA|525aa|up_2|NZ_CP025958.1_5408863_5410438_-	TIGR00591, Deoxyribodipyrimidine_photo-lyase, photolyase PhrII	NA|324aa|up_1|NZ_CP025958.1_5410550_5411522_+	COG1131, CcmA, ABC-type multidrug transport system, ATPase component [Defense mechanisms]	NA|540aa|up_0|NZ_CP025958.1_5411525_5413145_+	COG0842, COG0842, ABC-type multidrug transport system, permease component [Defense mechanisms]	NA|265aa|down_0|NZ_CP025958.1_5413423_5414218_-	COG3971, COG3971, 2-keto-4-pentenoate hydratase [Secondary metabolites biosynthesis, transport, and catabolism]	NA|208aa|down_1|NZ_CP025958.1_5414395_5415019_+	TIGR02937, RNA_polymerase_sigma_factor, RNA polymerase sigma factor, sigma-70 family	NA|2975aa|down_2|NZ_CP025958.1_5415015_5423940_+	TIGR03788, marine_srt_targ, marine proteobacterial sortase target protein	NA|263aa|down_3|NZ_CP025958.1_5424467_5425256_-	NA	NA|115aa|down_4|NZ_CP025958.1_5425563_5425908_+	NA	NA|512aa|down_5|NZ_CP025958.1_5426124_5427660_+	COG1070, XylB, Sugar (pentulose and hexulose) kinases [Carbohydrate transport and metabolism]	NA|128aa|down_6|NZ_CP025958.1_5427732_5428116_-	NA	NA|162aa|down_7|NZ_CP025958.1_5428246_5428732_-	cd03330, Macro_Ttha0132-like, Macrodomain, uncharacterized family similar to Thermus thermophilus hypothetical protein Ttha0132	NA|589aa|down_8|NZ_CP025958.1_5428796_5430563_-	cd02969, PRX_like1, Peroxiredoxin (PRX)-like 1 family; hypothetical proteins that show sequence similarity to PRXs	NA|143aa|down_9|NZ_CP025958.1_5430852_5431281_-	NA
GCF_003149495.1_ASM314949v1	NZ_CP025958	Gemmata obscuriglobus strain DSM 5831 chromosome, complete genome	19	5475761-5475869	12	CRISPRCasFinder	no		cas8u1,cas5u,cas7,cas8u2,cas3,cas2,cas1,csa3,DEDDh,RT,cas6,cas8b3,DinG,cas4,PD-DExK	Orphan	TTCCTCCGGTGCCACCTCCTTCGCCTTCCAG	31	0	0	NA	NA	NA	1	1	Orphan	cas8u1,cas5u,cas7,cas8u2,cas3,cas2,cas1,csa3,DEDDh,RT,cas6,cas8b3,DinG,cas4,PD-DExK	NA|194aa|up_4|NZ_CP025958.1_5463561_5464143_-,NA|51aa|up_1|NZ_CP025958.1_5470480_5470633_-,NA|212aa|up_0|NZ_CP025958.1_5470899_5471535_-,NA|324aa|down_5|NZ_CP025958.1_5482399_5483371_-,NA|197aa|down_9|NZ_CP025958.1_5490677_5491268_-	NA|117aa|up_9|NZ_CP025958.1_5457377_5457728_+	pfam12680, SnoaL_2, SnoaL-like domain	NA|291aa|up_8|NZ_CP025958.1_5458856_5459729_-	PRK00489, hisG, ATP phosphoribosyltransferase; Reviewed	NA|506aa|up_7|NZ_CP025958.1_5459850_5461368_-	pfam07585, BBP7, Putative beta barrel porin-7 (BBP7)	NA|255aa|up_6|NZ_CP025958.1_5461740_5462505_+	pfam13267, DUF4058, Protein of unknown function (DUF4058)	NA|305aa|up_5|NZ_CP025958.1_5462509_5463424_-	COG1413, COG1413, FOG: HEAT repeat [Energy production and conversion]	NA|194aa|up_4|NZ_CP025958.1_5463561_5464143_-	NA	NA|481aa|up_3|NZ_CP025958.1_5464194_5465637_+	COG0469, PykF, Pyruvate kinase [Carbohydrate transport and metabolism]	NA|1140aa|up_2|NZ_CP025958.1_5466041_5469461_+	pfam17957, Big_7, Bacterial Ig domain	NA|51aa|up_1|NZ_CP025958.1_5470480_5470633_-	NA	NA|212aa|up_0|NZ_CP025958.1_5470899_5471535_-	NA	NA|389aa|down_0|NZ_CP025958.1_5476937_5478104_-	COG0635, HemN, Coproporphyrinogen III oxidase and related Fe-S oxidoreductases [Coenzyme metabolism]	NA|410aa|down_1|NZ_CP025958.1_5478213_5479443_-	PHA03247, PHA03247, large tegument protein UL36; Provisional	NA|494aa|down_2|NZ_CP025958.1_5479372_5480854_-	pfam00478, IMPDH, IMP dehydrogenase / GMP reductase domain	NA|243aa|down_3|NZ_CP025958.1_5480999_5481728_-	pfam10099, RskA, Anti-sigma-K factor rskA	NA|180aa|down_4|NZ_CP025958.1_5481724_5482264_-	PRK12519, PRK12519, RNA polymerase sigma factor; Provisional	NA|324aa|down_5|NZ_CP025958.1_5482399_5483371_-	NA	NA|541aa|down_6|NZ_CP025958.1_5483795_5485418_+	pfam07596, SBP_bac_10, Protein of unknown function (DUF1559)	NA|578aa|down_7|NZ_CP025958.1_5485624_5487358_+	pfam07596, SBP_bac_10, Protein of unknown function (DUF1559)	NA|976aa|down_8|NZ_CP025958.1_5487432_5490360_-	cd00200, WD40, WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment	NA|197aa|down_9|NZ_CP025958.1_5490677_5491268_-	NA
GCF_003149495.1_ASM314949v1	NZ_CP025958	Gemmata obscuriglobus strain DSM 5831 chromosome, complete genome	20	5492986-5493058	13	CRISPRCasFinder	no		cas8u1,cas5u,cas7,cas8u2,cas3,cas2,cas1,csa3,DEDDh,RT,cas6,cas8b3,DinG,cas4,PD-DExK	Orphan	TTAGCCACAAAAAAGCACAAAGG	23	0	0	NA	NA	NA	1	1	Orphan	cas8u1,cas5u,cas7,cas8u2,cas3,cas2,cas1,csa3,DEDDh,RT,cas6,cas8b3,DinG,cas4,PD-DExK	NA|324aa|up_5|NZ_CP025958.1_5482399_5483371_-,NA|197aa|up_1|NZ_CP025958.1_5490677_5491268_-,NA|213aa|down_3|NZ_CP025958.1_5499368_5500007_+,NA|445aa|down_5|NZ_CP025958.1_5502010_5503345_+,NA|109aa|down_8|NZ_CP025958.1_5507058_5507385_+,NA|140aa|down_9|NZ_CP025958.1_5507491_5507911_+	NA|410aa|up_9|NZ_CP025958.1_5478213_5479443_-	PHA03247, PHA03247, large tegument protein UL36; Provisional	NA|494aa|up_8|NZ_CP025958.1_5479372_5480854_-	pfam00478, IMPDH, IMP dehydrogenase / GMP reductase domain	NA|243aa|up_7|NZ_CP025958.1_5480999_5481728_-	pfam10099, RskA, Anti-sigma-K factor rskA	NA|180aa|up_6|NZ_CP025958.1_5481724_5482264_-	PRK12519, PRK12519, RNA polymerase sigma factor; Provisional	NA|324aa|up_5|NZ_CP025958.1_5482399_5483371_-	NA	NA|541aa|up_4|NZ_CP025958.1_5483795_5485418_+	pfam07596, SBP_bac_10, Protein of unknown function (DUF1559)	NA|578aa|up_3|NZ_CP025958.1_5485624_5487358_+	pfam07596, SBP_bac_10, Protein of unknown function (DUF1559)	NA|976aa|up_2|NZ_CP025958.1_5487432_5490360_-	cd00200, WD40, WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment	NA|197aa|up_1|NZ_CP025958.1_5490677_5491268_-	NA	NA|469aa|up_0|NZ_CP025958.1_5491568_5492975_+	pfam07394, DUF1501, Protein of unknown function (DUF1501)	NA|286aa|down_0|NZ_CP025958.1_5493162_5494020_-	TIGR03000, plancto_dom_1, Planctomycetes uncharacterized domain TIGR03000	NA|1412aa|down_1|NZ_CP025958.1_5494099_5498335_-	pfam13490, zf-HC2, Putative zinc-finger	NA|210aa|down_2|NZ_CP025958.1_5498441_5499071_-	COG1595, RpoE, DNA-directed RNA polymerase specialized sigma subunit, sigma24 homolog [Transcription]	NA|213aa|down_3|NZ_CP025958.1_5499368_5500007_+	NA	NA|446aa|down_4|NZ_CP025958.1_5500098_5501436_+	COG3014, COG3014, Uncharacterized protein conserved in bacteria [Function unknown]	NA|445aa|down_5|NZ_CP025958.1_5502010_5503345_+	NA	NA|305aa|down_6|NZ_CP025958.1_5503921_5504836_-	PRK10416, PRK10416, signal recognition particle-docking protein FtsY; Provisional	NA|355aa|down_7|NZ_CP025958.1_5505954_5507019_+	COG0564, RluA, Pseudouridylate synthases, 23S RNA-specific [Translation, ribosomal structure and biogenesis]	NA|109aa|down_8|NZ_CP025958.1_5507058_5507385_+	NA	NA|140aa|down_9|NZ_CP025958.1_5507491_5507911_+	NA
GCF_003149495.1_ASM314949v1	NZ_CP025958	Gemmata obscuriglobus strain DSM 5831 chromosome, complete genome	21	5503360-5503468	14	CRISPRCasFinder	no		cas8u1,cas5u,cas7,cas8u2,cas3,cas2,cas1,csa3,DEDDh,RT,cas6,cas8b3,DinG,cas4,PD-DExK	Orphan	TGGGGTAGCTCAGTGGTAGAGCGGCCGGCTCT	32	0	0	NA	NA	NA	1	1	Orphan	cas8u1,cas5u,cas7,cas8u2,cas3,cas2,cas1,csa3,DEDDh,RT,cas6,cas8b3,DinG,cas4,PD-DExK	NA|197aa|up_7|NZ_CP025958.1_5490677_5491268_-,NA|213aa|up_2|NZ_CP025958.1_5499368_5500007_+,NA|445aa|up_0|NZ_CP025958.1_5502010_5503345_+,NA|109aa|down_2|NZ_CP025958.1_5507058_5507385_+,NA|140aa|down_3|NZ_CP025958.1_5507491_5507911_+,NA|162aa|down_6|NZ_CP025958.1_5510419_5510905_+,NA|153aa|down_7|NZ_CP025958.1_5511132_5511591_+	NA|578aa|up_9|NZ_CP025958.1_5485624_5487358_+	pfam07596, SBP_bac_10, Protein of unknown function (DUF1559)	NA|976aa|up_8|NZ_CP025958.1_5487432_5490360_-	cd00200, WD40, WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment	NA|197aa|up_7|NZ_CP025958.1_5490677_5491268_-	NA	NA|469aa|up_6|NZ_CP025958.1_5491568_5492975_+	pfam07394, DUF1501, Protein of unknown function (DUF1501)	NA|286aa|up_5|NZ_CP025958.1_5493162_5494020_-	TIGR03000, plancto_dom_1, Planctomycetes uncharacterized domain TIGR03000	NA|1412aa|up_4|NZ_CP025958.1_5494099_5498335_-	pfam13490, zf-HC2, Putative zinc-finger	NA|210aa|up_3|NZ_CP025958.1_5498441_5499071_-	COG1595, RpoE, DNA-directed RNA polymerase specialized sigma subunit, sigma24 homolog [Transcription]	NA|213aa|up_2|NZ_CP025958.1_5499368_5500007_+	NA	NA|446aa|up_1|NZ_CP025958.1_5500098_5501436_+	COG3014, COG3014, Uncharacterized protein conserved in bacteria [Function unknown]	NA|445aa|up_0|NZ_CP025958.1_5502010_5503345_+	NA	NA|305aa|down_0|NZ_CP025958.1_5503921_5504836_-	PRK10416, PRK10416, signal recognition particle-docking protein FtsY; Provisional	NA|355aa|down_1|NZ_CP025958.1_5505954_5507019_+	COG0564, RluA, Pseudouridylate synthases, 23S RNA-specific [Translation, ribosomal structure and biogenesis]	NA|109aa|down_2|NZ_CP025958.1_5507058_5507385_+	NA	NA|140aa|down_3|NZ_CP025958.1_5507491_5507911_+	NA	NA|368aa|down_4|NZ_CP025958.1_5508219_5509323_-	PRK00059, prsA, peptidylprolyl isomerase; Provisional	NA|74aa|down_5|NZ_CP025958.1_5509535_5509757_+	pfam01165, Ribosomal_S21, Ribosomal protein S21	NA|162aa|down_6|NZ_CP025958.1_5510419_5510905_+	NA	NA|153aa|down_7|NZ_CP025958.1_5511132_5511591_+	NA	NA|148aa|down_8|NZ_CP025958.1_5511640_5512084_+	pfam00582, Usp, Universal stress protein family	NA|329aa|down_9|NZ_CP025958.1_5512567_5513554_-	pfam02457, DisA_N, DisA bacterial checkpoint controller nucleotide-binding
GCF_003149495.1_ASM314949v1	NZ_CP025958	Gemmata obscuriglobus strain DSM 5831 chromosome, complete genome	22	5982112-5982359	15	CRISPRCasFinder	no		cas8u1,cas5u,cas7,cas8u2,cas3,cas2,cas1,csa3,DEDDh,RT,cas6,cas8b3,DinG,cas4,PD-DExK	Orphan	GCGCTCGCCGATTACAACGAAGCGCTCCGCCTCGACCCCAAGCA	44	0	0	NA	NA	NA	2	2	Orphan	cas8u1,cas5u,cas7,cas8u2,cas3,cas2,cas1,csa3,DEDDh,RT,cas6,cas8b3,DinG,cas4,PD-DExK	NA|124aa|up_9|NZ_CP025958.1_5967359_5967731_-,NA|160aa|down_0|NZ_CP025958.1_5983510_5983990_-,NA|85aa|down_2|NZ_CP025958.1_5984666_5984921_+,NA|225aa|down_5|NZ_CP025958.1_5988848_5989523_-	NA|124aa|up_9|NZ_CP025958.1_5967359_5967731_-	NA	NA|285aa|up_8|NZ_CP025958.1_5967740_5968595_-	pfam05721, PhyH, Phytanoyl-CoA dioxygenase (PhyH)	NA|1002aa|up_7|NZ_CP025958.1_5968690_5971696_+	TIGR02604, Piru_Ver_Nterm, putative membrane-bound dehydrogenase domain	NA|570aa|up_6|NZ_CP025958.1_5971837_5973547_+	COG0405, Ggt, Gamma-glutamyltransferase [Amino acid transport and metabolism]	NA|499aa|up_5|NZ_CP025958.1_5973569_5975066_-	pfam00535, Glycos_transf_2, Glycosyl transferase family 2	NA|404aa|up_4|NZ_CP025958.1_5975097_5976309_-	pfam00535, Glycos_transf_2, Glycosyl transferase family 2	NA|234aa|up_3|NZ_CP025958.1_5976538_5977240_-	COG1285, SapB, Uncharacterized membrane protein [Function unknown]	NA|342aa|up_2|NZ_CP025958.1_5977289_5978315_-	TIGR00175, Isocitrate_dehydrogenase_subunit_1_mitochondrial, isocitrate dehydrogenase, NAD-dependent, mitochondrial type	NA|261aa|up_1|NZ_CP025958.1_5978565_5979348_-	TIGR03036, trp_2_3_diox, tryptophan 2,3-dioxygenase	NA|701aa|up_0|NZ_CP025958.1_5979403_5981506_-	TIGR02937, RNA_polymerase_sigma_factor, RNA polymerase sigma factor, sigma-70 family	NA|160aa|down_0|NZ_CP025958.1_5983510_5983990_-	NA	NA|95aa|down_1|NZ_CP025958.1_5984385_5984670_+	COG3609, COG3609, Predicted transcriptional regulators containing the CopG/Arc/MetJ DNA-binding domain [Transcription]	NA|85aa|down_2|NZ_CP025958.1_5984666_5984921_+	NA	NA|339aa|down_3|NZ_CP025958.1_5985054_5986071_-	COG0473, LeuB, Isocitrate/isopropylmalate dehydrogenase [Amino acid transport and metabolism]	NA|742aa|down_4|NZ_CP025958.1_5986381_5988607_+	pfam13717, zinc_ribbon_4, zinc-ribbon domain	NA|225aa|down_5|NZ_CP025958.1_5988848_5989523_-	NA	NA|266aa|down_6|NZ_CP025958.1_5989548_5990346_-	TIGR02996, rpt_mate_G_obs, repeat-companion domain TIGR02996	NA|338aa|down_7|NZ_CP025958.1_5990345_5991359_-	PRK00436, argC, N-acetyl-gamma-glutamyl-phosphate reductase; Validated	NA|149aa|down_8|NZ_CP025958.1_5991522_5991969_-	pfam01797, Y1_Tnp, Transposase IS200 like	NA|462aa|down_9|NZ_CP025958.1_5992169_5993555_+	COG0154, GatA, Asp-tRNAAsn/Glu-tRNAGln amidotransferase A subunit and related amidases [Translation, ribosomal structure and biogenesis]
GCF_003149495.1_ASM314949v1	NZ_CP025958	Gemmata obscuriglobus strain DSM 5831 chromosome, complete genome	23	6425277-6425372	16	CRISPRCasFinder	no		cas8u1,cas5u,cas7,cas8u2,cas3,cas2,cas1,csa3,DEDDh,RT,cas6,cas8b3,DinG,cas4,PD-DExK	Orphan	GCTGTAACGGGGGTTACAGCACC	23	0	0	NA	NA	NA	1	1	Orphan	cas8u1,cas5u,cas7,cas8u2,cas3,cas2,cas1,csa3,DEDDh,RT,cas6,cas8b3,DinG,cas4,PD-DExK	NA|255aa|up_3|NZ_CP025958.1_6418111_6418876_+,NA|65aa|up_1|NZ_CP025958.1_6424244_6424439_-,NA|110aa|down_8|NZ_CP025958.1_6446210_6446540_+,NA|210aa|down_9|NZ_CP025958.1_6446572_6447202_-	NA|191aa|up_9|NZ_CP025958.1_6410292_6410865_+	TIGR02052, periplasmic_mercuric_ion_binding_protein, mercuric transport protein periplasmic component	NA|80aa|up_8|NZ_CP025958.1_6411155_6411395_-	pfam09413, DUF2007, Putative prokaryotic signal transducing protein	NA|452aa|up_7|NZ_CP025958.1_6411818_6413174_+	PRK11360, PRK11360, two-component system sensor histidine kinase AtoS	NA|464aa|up_6|NZ_CP025958.1_6413170_6414562_+	COG2204, AtoC, Response regulator containing CheY-like receiver, AAA-type ATPase, and DNA-binding domains [Signal transduction mechanisms]	NA|606aa|up_5|NZ_CP025958.1_6414785_6416603_+	PRK08292, PRK08292, AMP nucleosidase; Provisional	NA|301aa|up_4|NZ_CP025958.1_6416682_6417585_-	cd10941, CE4_PuuE_HpPgdA_like_2, Putative catalytic domain of uncharacterized prokaryotic polysaccharide deacetylases similar to bacterial PuuE allantoinases and Helicobacter pylori peptidoglycan deacetylase (HpPgdA)	NA|255aa|up_3|NZ_CP025958.1_6418111_6418876_+	NA	NA|1471aa|up_2|NZ_CP025958.1_6419028_6423441_+	TIGR02917, TPR_domain_protein, putative PEP-CTERM system TPR-repeat lipoprotein	NA|65aa|up_1|NZ_CP025958.1_6424244_6424439_-	NA	NA|123aa|up_0|NZ_CP025958.1_6424549_6424918_-	TIGR02890, conserved_hypothetical_protein, regulatory protein, yteA family	NA|536aa|down_0|NZ_CP025958.1_6425831_6427439_-	COG0659, SUL1, Sulfate permease and related transporters (MFS superfamily) [Inorganic ion transport and metabolism]	NA|485aa|down_1|NZ_CP025958.1_6427649_6429104_-	COG2204, AtoC, Response regulator containing CheY-like receiver, AAA-type ATPase, and DNA-binding domains [Signal transduction mechanisms]	NA|484aa|down_2|NZ_CP025958.1_6429122_6430574_-	PRK11360, PRK11360, two-component system sensor histidine kinase AtoS	NA|1031aa|down_3|NZ_CP025958.1_6430924_6434017_+	pfam10070, DUF2309, Uncharacterized protein conserved in bacteria (DUF2309)	NA|494aa|down_4|NZ_CP025958.1_6434009_6435491_+	COG1009, NuoL, NADH:ubiquinone oxidoreductase subunit 5 (chain L)/Multisubunit Na+/H+ antiporter, MnhA subunit [Energy production and conversion / Inorganic ion transport and metabolism]	NA|1289aa|down_5|NZ_CP025958.1_6438316_6442183_+	PRK02224, PRK02224, DNA double-strand break repair Rad50 ATPase	NA|848aa|down_6|NZ_CP025958.1_6442404_6444948_-	COG0178, UvrA, Excinuclease ATPase subunit [DNA replication, recombination, and repair]	NA|123aa|down_7|NZ_CP025958.1_6445474_6445843_+	TIGR03066, Gem_osc_para_1, Gemmata obscuriglobus paralogous family TIGR03066	NA|110aa|down_8|NZ_CP025958.1_6446210_6446540_+	NA	NA|210aa|down_9|NZ_CP025958.1_6446572_6447202_-	NA
GCF_003149495.1_ASM314949v1	NZ_CP025958	Gemmata obscuriglobus strain DSM 5831 chromosome, complete genome	24	6579740-6579859	17	CRISPRCasFinder	no		cas8u1,cas5u,cas7,cas8u2,cas3,cas2,cas1,csa3,DEDDh,RT,cas6,cas8b3,DinG,cas4,PD-DExK	Orphan	TGGGGGTGGGTTGTCTTCGGTTTTGCTCCCC	31	0	0	NA	NA	NA	1	1	Orphan	cas8u1,cas5u,cas7,cas8u2,cas3,cas2,cas1,csa3,DEDDh,RT,cas6,cas8b3,DinG,cas4,PD-DExK	NA|126aa|up_9|NZ_CP025958.1_6573259_6573637_-,NA|168aa|up_8|NZ_CP025958.1_6573644_6574148_-,NA|127aa|up_7|NZ_CP025958.1_6574150_6574531_-,NA|154aa|up_6|NZ_CP025958.1_6574532_6574994_-,NA|116aa|up_5|NZ_CP025958.1_6574998_6575346_-,NA|76aa|up_4|NZ_CP025958.1_6575348_6575576_-,NA|97aa|up_3|NZ_CP025958.1_6575681_6575972_-,NA|402aa|up_2|NZ_CP025958.1_6576012_6577218_-,NA|238aa|up_1|NZ_CP025958.1_6577455_6578169_-,NA|116aa|down_1|NZ_CP025958.1_6582291_6582639_+,NA|141aa|down_2|NZ_CP025958.1_6582702_6583125_+,NA|137aa|down_3|NZ_CP025958.1_6583147_6583558_+,NA|129aa|down_4|NZ_CP025958.1_6583554_6583941_+,NA|184aa|down_7|NZ_CP025958.1_6586206_6586758_-,NA|296aa|down_8|NZ_CP025958.1_6586699_6587587_-,NA|221aa|down_9|NZ_CP025958.1_6587583_6588246_-	NA|126aa|up_9|NZ_CP025958.1_6573259_6573637_-	NA	NA|168aa|up_8|NZ_CP025958.1_6573644_6574148_-	NA	NA|127aa|up_7|NZ_CP025958.1_6574150_6574531_-	NA	NA|154aa|up_6|NZ_CP025958.1_6574532_6574994_-	NA	NA|116aa|up_5|NZ_CP025958.1_6574998_6575346_-	NA	NA|76aa|up_4|NZ_CP025958.1_6575348_6575576_-	NA	NA|97aa|up_3|NZ_CP025958.1_6575681_6575972_-	NA	NA|402aa|up_2|NZ_CP025958.1_6576012_6577218_-	NA	NA|238aa|up_1|NZ_CP025958.1_6577455_6578169_-	NA	NA|383aa|up_0|NZ_CP025958.1_6578231_6579380_-	PRK07003, PRK07003, DNA polymerase III subunit gamma/tau	NA|724aa|down_0|NZ_CP025958.1_6580016_6582188_+	COG4251, COG4251, Bacteriophytochrome (light-regulated signal transduction histidine kinase) [Signal transduction mechanisms]	NA|116aa|down_1|NZ_CP025958.1_6582291_6582639_+	NA	NA|141aa|down_2|NZ_CP025958.1_6582702_6583125_+	NA	NA|137aa|down_3|NZ_CP025958.1_6583147_6583558_+	NA	NA|129aa|down_4|NZ_CP025958.1_6583554_6583941_+	NA	NA|152aa|down_5|NZ_CP025958.1_6583954_6584410_-	cd17580, REC_2_DhkD-like, second phosphoacceptor receiver (REC) domain of Dictyostelium discoideum hybrid signal transduction histidine kinase D and similar domains	NA|198aa|down_6|NZ_CP025958.1_6585465_6586059_-	pfam13508, Acetyltransf_7, Acetyltransferase (GNAT) domain	NA|184aa|down_7|NZ_CP025958.1_6586206_6586758_-	NA	NA|296aa|down_8|NZ_CP025958.1_6586699_6587587_-	NA	NA|221aa|down_9|NZ_CP025958.1_6587583_6588246_-	NA
GCF_003149495.1_ASM314949v1	NZ_CP025958	Gemmata obscuriglobus strain DSM 5831 chromosome, complete genome	25	6761998-6762101	18	CRISPRCasFinder	no		cas8u1,cas5u,cas7,cas8u2,cas3,cas2,cas1,csa3,DEDDh,RT,cas6,cas8b3,DinG,cas4,PD-DExK	Orphan	GCGTCCGTGCGCTCCGGTCGAACCGCCG	28	0	0	NA	NA	NA	1	1	Orphan	cas8u1,cas5u,cas7,cas8u2,cas3,cas2,cas1,csa3,DEDDh,RT,cas6,cas8b3,DinG,cas4,PD-DExK	NA|75aa|up_8|NZ_CP025958.1_6749217_6749442_-,NA|438aa|up_6|NZ_CP025958.1_6751397_6752711_-,NA|92aa|down_0|NZ_CP025958.1_6762350_6762626_-,NA|196aa|down_1|NZ_CP025958.1_6762988_6763576_+,NA|121aa|down_2|NZ_CP025958.1_6763735_6764098_-,NA|66aa|down_4|NZ_CP025958.1_6765191_6765389_-	NA|632aa|up_9|NZ_CP025958.1_6747195_6749091_+	PRK05035, PRK05035, electron transport complex protein RnfC; Provisional	NA|75aa|up_8|NZ_CP025958.1_6749217_6749442_-	NA	NA|294aa|up_7|NZ_CP025958.1_6750299_6751181_+	COG2165, PulG, Type II secretory pathway, pseudopilin PulG [Cell motility and secretion / Intracellular trafficking and secretion]	NA|438aa|up_6|NZ_CP025958.1_6751397_6752711_-	NA	NA|332aa|up_5|NZ_CP025958.1_6754320_6755316_-	cd05282, ETR_like, 2-enoyl thioester reductase-like	NA|479aa|up_4|NZ_CP025958.1_6755397_6756834_+	PRK01642, cls, cardiolipin synthetase; Reviewed	NA|295aa|up_3|NZ_CP025958.1_6757114_6757999_-	cd01949, GGDEF, Diguanylate-cyclase (DGC) or GGDEF domain	NA|614aa|up_2|NZ_CP025958.1_6758168_6760010_-	PRK07764, PRK07764, DNA polymerase III subunits gamma and tau; Validated	NA|258aa|up_1|NZ_CP025958.1_6760282_6761056_+	COG1691, COG1691, NCAIR mutase (PurE)-related proteins [General function prediction only]	NA|225aa|up_0|NZ_CP025958.1_6761098_6761773_-	PRK07764, PRK07764, DNA polymerase III subunits gamma and tau; Validated	NA|92aa|down_0|NZ_CP025958.1_6762350_6762626_-	NA	NA|196aa|down_1|NZ_CP025958.1_6762988_6763576_+	NA	NA|121aa|down_2|NZ_CP025958.1_6763735_6764098_-	NA	NA|341aa|down_3|NZ_CP025958.1_6764157_6765180_-	TIGR02996, rpt_mate_G_obs, repeat-companion domain TIGR02996	NA|66aa|down_4|NZ_CP025958.1_6765191_6765389_-	NA	NA|212aa|down_5|NZ_CP025958.1_6765509_6766145_+	pfam13649, Methyltransf_25, Methyltransferase domain	NA|414aa|down_6|NZ_CP025958.1_6766470_6767712_-	pfam05150, Legionella_OMP, Legionella pneumophila major outer membrane protein precursor	NA|77aa|down_7|NZ_CP025958.1_6768204_6768435_+	COG2336, MazE, Growth regulator [Signal transduction mechanisms]	NA|124aa|down_8|NZ_CP025958.1_6768431_6768803_+	PRK09907, PRK09907, endoribonuclease MazF	NA|637aa|down_9|NZ_CP025958.1_6769094_6771005_-	cd07302, CHD, cyclase homology domain
GCF_003149495.1_ASM314949v1	NZ_CP025958	Gemmata obscuriglobus strain DSM 5831 chromosome, complete genome	26	6785152-6785261	19	CRISPRCasFinder	no		cas8u1,cas5u,cas7,cas8u2,cas3,cas2,cas1,csa3,DEDDh,RT,cas6,cas8b3,DinG,cas4,PD-DExK	Orphan	TCTTCCTCCCCCTCCCTTCAGGGAGGGGGCCGGGGG	36	0	0	NA	NA	NA	1	1	Orphan	cas8u1,cas5u,cas7,cas8u2,cas3,cas2,cas1,csa3,DEDDh,RT,cas6,cas8b3,DinG,cas4,PD-DExK	NA,NA|66aa|down_5|NZ_CP025958.1_6794076_6794274_+	NA|414aa|up_9|NZ_CP025958.1_6766470_6767712_-	pfam05150, Legionella_OMP, Legionella pneumophila major outer membrane protein precursor	NA|77aa|up_8|NZ_CP025958.1_6768204_6768435_+	COG2336, MazE, Growth regulator [Signal transduction mechanisms]	NA|124aa|up_7|NZ_CP025958.1_6768431_6768803_+	PRK09907, PRK09907, endoribonuclease MazF	NA|637aa|up_6|NZ_CP025958.1_6769094_6771005_-	cd07302, CHD, cyclase homology domain	NA|784aa|up_5|NZ_CP025958.1_6771918_6774270_+	pfam07583, PSCyt2, Protein of unknown function (DUF1549)	NA|466aa|up_4|NZ_CP025958.1_6774282_6775680_+	pfam07394, DUF1501, Protein of unknown function (DUF1501)	NA|440aa|up_3|NZ_CP025958.1_6776160_6777480_+	pfam07394, DUF1501, Protein of unknown function (DUF1501)	NA|826aa|up_2|NZ_CP025958.1_6777727_6780205_+	pfam04151, PPC, Bacterial pre-peptidase C-terminal domain	NA|879aa|up_1|NZ_CP025958.1_6780494_6783131_+	pfam07583, PSCyt2, Protein of unknown function (DUF1549)	NA|484aa|up_0|NZ_CP025958.1_6783474_6784926_+	cd00200, WD40, WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment	NA|751aa|down_0|NZ_CP025958.1_6785338_6787591_-	pfam00930, DPPIV_N, Dipeptidyl peptidase IV (DPP IV) N-terminal region	NA|299aa|down_1|NZ_CP025958.1_6787742_6788639_+	cd01834, SGNH_hydrolase_like_2, SGNH_hydrolase subfamily	NA|313aa|down_2|NZ_CP025958.1_6789343_6790282_+	cd19100, AKR_unchar, uncharacterized aldo-keto reductase (AKR) superfamily protein	NA|631aa|down_3|NZ_CP025958.1_6790901_6792794_+	COG0028, IlvB, Thiamine pyrophosphate-requiring enzymes [acetolactate synthase, pyruvate dehydrogenase (cytochrome), glyoxylate carboligase, phosphonopyruvate decarboxylase] [Amino acid transport and metabolism / Coenzyme metabolism]	NA|178aa|down_4|NZ_CP025958.1_6793016_6793550_-	pfam07238, PilZ, PilZ domain	NA|66aa|down_5|NZ_CP025958.1_6794076_6794274_+	NA	NA|471aa|down_6|NZ_CP025958.1_6794368_6795781_-	TIGR02982, heterocyst_DevA, ABC exporter ATP-binding subunit, DevA family	NA|406aa|down_7|NZ_CP025958.1_6795770_6796988_-	TIGR01185, membrane_spanning_subunit, DevC protein	NA|406aa|down_8|NZ_CP025958.1_6797272_6798490_-	TIGR01185, membrane_spanning_subunit, DevC protein	NA|393aa|down_9|NZ_CP025958.1_6798815_6799994_-	TIGR02971, devB-like_secretion_protein, ABC exporter membrane fusion protein, DevB family
GCF_003149495.1_ASM314949v1	NZ_CP025958	Gemmata obscuriglobus strain DSM 5831 chromosome, complete genome	27	6964317-6964535	8	CRT	no	csa3	cas8u1,cas5u,cas7,cas8u2,cas3,cas2,cas1,csa3,DEDDh,RT,cas6,cas8b3,DinG,cas4,PD-DExK	Type I-A	CGNNATCCCGTACNNCAC	18	0	0	NA	NA	NA	4	4	Orphan	cas8u1,cas5u,cas7,cas8u2,cas3,cas2,cas1,csa3,DEDDh,RT,cas6,cas8b3,DinG,cas4,PD-DExK	NA|285aa|up_0|NZ_CP025958.1_6962468_6963323_-,NA|377aa|down_1|NZ_CP025958.1_6967431_6968562_-,NA|211aa|down_9|NZ_CP025958.1_6978298_6978931_-	NA|237aa|up_9|NZ_CP025958.1_6951149_6951860_-	pfam12975, DUF3859, Domain of unknown function (DUF3859)	NA|554aa|up_8|NZ_CP025958.1_6952971_6954633_-	TIGR03385, Coenzyme_A_disulfide_reductase, CoA-disulfide reductase	NA|134aa|up_7|NZ_CP025958.1_6954853_6955255_-	cd00158, RHOD, Rhodanese Homology Domain (RHOD); an alpha beta fold domain found duplicated in the rhodanese protein	NA|194aa|up_6|NZ_CP025958.1_6955345_6955927_-	cd00158, RHOD, Rhodanese Homology Domain (RHOD); an alpha beta fold domain found duplicated in the rhodanese protein	csa3|104aa|up_5|NZ_CP025958.1_6955983_6956295_-	cd00090, HTH_ARSR, Arsenical Resistance Operon Repressor and similar prokaryotic, metal regulated homodimeric repressors	NA|445aa|up_4|NZ_CP025958.1_6957040_6958375_+	pfam07394, DUF1501, Protein of unknown function (DUF1501)	NA|495aa|up_3|NZ_CP025958.1_6958482_6959967_+	pfam07394, DUF1501, Protein of unknown function (DUF1501)	NA|162aa|up_2|NZ_CP025958.1_6960190_6960676_-	pfam13620, CarboxypepD_reg, Carboxypeptidase regulatory-like domain	NA|355aa|up_1|NZ_CP025958.1_6960885_6961950_-	pfam07596, SBP_bac_10, Protein of unknown function (DUF1559)	NA|285aa|up_0|NZ_CP025958.1_6962468_6963323_-	NA	NA|297aa|down_0|NZ_CP025958.1_6966526_6967417_+	COG0657, Aes, Esterase/lipase [Lipid metabolism]	NA|377aa|down_1|NZ_CP025958.1_6967431_6968562_-	NA	NA|401aa|down_2|NZ_CP025958.1_6968558_6969761_-	pfam07611, DUF1574, Protein of unknown function (DUF1574)	NA|377aa|down_3|NZ_CP025958.1_6969750_6970881_-	pfam07611, DUF1574, Protein of unknown function (DUF1574)	NA|377aa|down_4|NZ_CP025958.1_6970877_6972008_-	pfam07611, DUF1574, Protein of unknown function (DUF1574)	NA|512aa|down_5|NZ_CP025958.1_6972024_6973560_-	COG1696, DltB, Predicted membrane protein involved in D-alanine export [Cell envelope biogenesis, outer membrane]	NA|358aa|down_6|NZ_CP025958.1_6973947_6975021_+	PRK05385, PRK05385, phosphoribosylaminoimidazole synthetase; Provisional	NA|300aa|down_7|NZ_CP025958.1_6975418_6976318_-	pfam07596, SBP_bac_10, Protein of unknown function (DUF1559)	NA|370aa|down_8|NZ_CP025958.1_6977084_6978194_+	TIGR02996, rpt_mate_G_obs, repeat-companion domain TIGR02996	NA|211aa|down_9|NZ_CP025958.1_6978298_6978931_-	NA
GCF_003149495.1_ASM314949v1	NZ_CP025958	Gemmata obscuriglobus strain DSM 5831 chromosome, complete genome	28	7545971-7546081	20	CRISPRCasFinder	no		cas8u1,cas5u,cas7,cas8u2,cas3,cas2,cas1,csa3,DEDDh,RT,cas6,cas8b3,DinG,cas4,PD-DExK	Orphan	CCCTGTTACTGACTTGGGACTTGCGACCTGACGACCTGGGAC	42	0	0	NA	NA	NA	1	1	Orphan	cas8u1,cas5u,cas7,cas8u2,cas3,cas2,cas1,csa3,DEDDh,RT,cas6,cas8b3,DinG,cas4,PD-DExK	NA|62aa|up_8|NZ_CP025958.1_7536477_7536663_-,NA|90aa|up_7|NZ_CP025958.1_7536766_7537036_-,NA|82aa|up_4|NZ_CP025958.1_7539138_7539384_-,NA|334aa|up_3|NZ_CP025958.1_7539730_7540732_-,NA	NA|276aa|up_9|NZ_CP025958.1_7535641_7536469_+	pfam10108, DNA_pol_B_exo2, Predicted 3'-5' exonuclease related to the exonuclease domain of PolB	NA|62aa|up_8|NZ_CP025958.1_7536477_7536663_-	NA	NA|90aa|up_7|NZ_CP025958.1_7536766_7537036_-	NA	NA|284aa|up_6|NZ_CP025958.1_7537138_7537990_-	pfam08241, Methyltransf_11, Methyltransferase domain	NA|336aa|up_5|NZ_CP025958.1_7537991_7538999_-	cd04186, GT_2_like_c, Subfamily of Glycosyltransferase Family GT2 of unknown function	NA|82aa|up_4|NZ_CP025958.1_7539138_7539384_-	NA	NA|334aa|up_3|NZ_CP025958.1_7539730_7540732_-	NA	NA|117aa|up_2|NZ_CP025958.1_7540728_7541079_-	COG1695, COG1695, Predicted transcriptional regulators [Transcription]	NA|248aa|up_1|NZ_CP025958.1_7541301_7542045_-	pfam13267, DUF4058, Protein of unknown function (DUF4058)	NA|123aa|up_0|NZ_CP025958.1_7545593_7545962_-	TIGR02436, S23_ribosomal_protein, four helix bundle protein	NA|213aa|down_0|NZ_CP025958.1_7547531_7548170_-	COG1309, AcrR, Transcriptional regulator [Transcription]	NA|326aa|down_1|NZ_CP025958.1_7548405_7549383_+	COG0618, COG0618, Exopolyphosphatase-related proteins [General function prediction only]	NA|591aa|down_2|NZ_CP025958.1_7549860_7551633_+	pfam07690, MFS_1, Major Facilitator Superfamily	NA|799aa|down_3|NZ_CP025958.1_7551792_7554189_-	cd14014, STKc_PknB_like, Catalytic domain of bacterial Serine/Threonine kinases, PknB and similar proteins	NA|210aa|down_4|NZ_CP025958.1_7554225_7554855_-	TIGR02984, Sig-70_plancto1, RNA polymerase sigma-70 factor, Planctomycetaceae-specific subfamily 1	NA|268aa|down_5|NZ_CP025958.1_7555049_7555853_+	PTZ00146, PTZ00146, fibrillarin; Provisional	NA|783aa|down_6|NZ_CP025958.1_7556002_7558351_-	cd14840, D-Ala-D-Ala_dipeptidase_Aad, D-Ala-D-Ala dipeptidase (includes Lactobacillus plantarum Aad peptidase)	NA|838aa|down_7|NZ_CP025958.1_7558455_7560969_+	pfam07583, PSCyt2, Protein of unknown function (DUF1549)	NA|134aa|down_8|NZ_CP025958.1_7560980_7561382_+	COG1459, PulF, Type II secretory pathway, component PulF [Cell motility and secretion / Intracellular trafficking and secretion]	NA|476aa|down_9|NZ_CP025958.1_7561404_7562832_+	pfam07394, DUF1501, Protein of unknown function (DUF1501)
GCF_003149495.1_ASM314949v1	NZ_CP025958	Gemmata obscuriglobus strain DSM 5831 chromosome, complete genome	29	8272817-8272929	21	CRISPRCasFinder	no		cas8u1,cas5u,cas7,cas8u2,cas3,cas2,cas1,csa3,DEDDh,RT,cas6,cas8b3,DinG,cas4,PD-DExK	Orphan	GCGGCGGACGGGACGTCCGCGGTCCCAGGGAACAGCCCACCG	42	0	0	NA	NA	NA	1	1	Orphan	cas8u1,cas5u,cas7,cas8u2,cas3,cas2,cas1,csa3,DEDDh,RT,cas6,cas8b3,DinG,cas4,PD-DExK	NA|73aa|up_2|NZ_CP025958.1_8269836_8270055_+,NA|81aa|up_1|NZ_CP025958.1_8270245_8270488_-,NA	NA|230aa|up_9|NZ_CP025958.1_8265411_8266101_+	COG0664, Crp, cAMP-binding proteins - catabolite gene activator and regulatory subunit of cAMP-dependent protein kinases [Signal transduction mechanisms]	NA|194aa|up_8|NZ_CP025958.1_8266437_8267019_-	cd07909, YciF, YciF bacterial stress response protein, ferritin-like iron-binding domain	NA|201aa|up_7|NZ_CP025958.1_8267098_8267701_-	cd00431, cysteine_hydrolases, Cysteine hydrolases; This family contains amidohydrolases, like CSHase (N-carbamoylsarcosine amidohydrolase), involved in creatine metabolism and nicotinamidase, converting nicotinamide to nicotinic acid and ammonia in the pyridine nucleotide cycle	NA|169aa|up_6|NZ_CP025958.1_8267715_8268222_-	COG4244, COG4244, Predicted membrane protein [Function unknown]	NA|166aa|up_5|NZ_CP025958.1_8268325_8268823_-	pfam13628, DUF4142, Domain of unknown function (DUF4142)	NA|132aa|up_4|NZ_CP025958.1_8268961_8269357_-	PRK12678, PRK12678, transcription termination factor Rho; Provisional	NA|65aa|up_3|NZ_CP025958.1_8269687_8269882_+	TIGR03885, putative_dehydrogenase_protein, probable non-F420 flavinoid oxidoreductase	NA|73aa|up_2|NZ_CP025958.1_8269836_8270055_+	NA	NA|81aa|up_1|NZ_CP025958.1_8270245_8270488_-	NA	NA|573aa|up_0|NZ_CP025958.1_8270998_8272717_+	COG0642, BaeS, Signal transduction histidine kinase [Signal transduction mechanisms]	NA|285aa|down_0|NZ_CP025958.1_8273148_8274003_+	pfam04972, BON, BON domain	NA|388aa|down_1|NZ_CP025958.1_8274509_8275673_-	COG4299, COG4299, Uncharacterized protein conserved in bacteria [Function unknown]	NA|146aa|down_2|NZ_CP025958.1_8275679_8276117_-	pfam16371, MetallophosN, N terminal of Calcineurin-like phosphoesterase	NA|324aa|down_3|NZ_CP025958.1_8276184_8277156_-	pfam07596, SBP_bac_10, Protein of unknown function (DUF1559)	NA|168aa|down_4|NZ_CP025958.1_8277536_8278040_-	COG1247, COG1247, Sortase and related acyltransferases [Cell envelope biogenesis, outer membrane]	NA|60aa|down_5|NZ_CP025958.1_8278310_8278490_-	COG1598, COG1598, Predicted nuclease of the RNAse H fold, HicB family [General    function prediction only]	NA|306aa|down_6|NZ_CP025958.1_8278526_8279444_-	cd19100, AKR_unchar, uncharacterized aldo-keto reductase (AKR) superfamily protein	NA|90aa|down_7|NZ_CP025958.1_8280242_8280512_+	PRK05626, rpsO, 30S ribosomal protein S15; Reviewed	NA|750aa|down_8|NZ_CP025958.1_8280635_8282885_+	PRK11824, PRK11824, polynucleotide phosphorylase/polyadenylase; Provisional	NA|112aa|down_9|NZ_CP025958.1_8283179_8283515_+	cd00552, RaiA, RaiA ("ribosome-associated inhibitor A", also known as Protein Y (PY), YfiA, and SpotY,  is a stress-response protein that binds the ribosomal subunit interface and arrests translation by interfering with aminoacyl-tRNA binding to the ribosomal A site
GCF_003149495.1_ASM314949v1	NZ_CP025958	Gemmata obscuriglobus strain DSM 5831 chromosome, complete genome	30	8422568-8422674	22	CRISPRCasFinder	no		cas8u1,cas5u,cas7,cas8u2,cas3,cas2,cas1,csa3,DEDDh,RT,cas6,cas8b3,DinG,cas4,PD-DExK	Orphan	TTGGGCGCTTCGGCGACGACCGCCGG	26	0	0	NA	NA	NA	1	1	Orphan	cas8u1,cas5u,cas7,cas8u2,cas3,cas2,cas1,csa3,DEDDh,RT,cas6,cas8b3,DinG,cas4,PD-DExK	NA,NA|493aa|down_1|NZ_CP025958.1_8425399_8426878_+,NA|278aa|down_5|NZ_CP025958.1_8431588_8432422_-	NA|415aa|up_9|NZ_CP025958.1_8408460_8409705_+	cd14014, STKc_PknB_like, Catalytic domain of bacterial Serine/Threonine kinases, PknB and similar proteins	NA|798aa|up_8|NZ_CP025958.1_8409791_8412185_-	sd00006, TPR, Tetratricopeptide repeat	NA|466aa|up_7|NZ_CP025958.1_8412270_8413668_-	PRK14902, PRK14902, 16S rRNA (cytosine(967)-C(5))-methyltransferase RsmB	NA|332aa|up_6|NZ_CP025958.1_8413664_8414660_-	PRK00005, fmt, methionyl-tRNA formyltransferase; Reviewed	NA|185aa|up_5|NZ_CP025958.1_8414770_8415325_-	cd00487, Pep_deformylase, Polypeptide or peptide deformylase; a family of metalloenzymes that catalyzes the removal of the N-terminal formyl group in a growing polypeptide chain following translation initiation during protein synthesis in prokaryotes	NA|109aa|up_4|NZ_CP025958.1_8415514_8415841_-	COG2076, EmrE, Membrane transporters of cations and cationic drugs [Inorganic ion transport and metabolism]	NA|418aa|up_3|NZ_CP025958.1_8415910_8417164_-	COG0156, BioF, 7-keto-8-aminopelargonate synthetase and related enzymes [Coenzyme metabolism]	NA|269aa|up_2|NZ_CP025958.1_8417376_8418183_-	TIGR01084, A/G-specific_adenine_glycosylase, A/G-specific adenine glycosylase	NA|164aa|up_1|NZ_CP025958.1_8418255_8418747_-	PRK00521, rbfA, 30S ribosome-binding factor RbfA	NA|98aa|up_0|NZ_CP025958.1_8419086_8419380_-	pfam04456, DUF503, Protein of unknown function (DUF503)	NA|498aa|down_0|NZ_CP025958.1_8423307_8424801_-	PRK09202, nusA, transcription elongation factor NusA; Validated	NA|493aa|down_1|NZ_CP025958.1_8425399_8426878_+	NA	NA|173aa|down_2|NZ_CP025958.1_8427379_8427898_+	pfam11181, YflT, Heat induced stress protein YflT	NA|399aa|down_3|NZ_CP025958.1_8428230_8429427_-	cd05683, M20_peptT_like, M20 Peptidase T like enzymes specifically cleave tripeptides	NA|565aa|down_4|NZ_CP025958.1_8429820_8431515_-	pfam03321, GH3, GH3 auxin-responsive promoter	NA|278aa|down_5|NZ_CP025958.1_8431588_8432422_-	NA	NA|296aa|down_6|NZ_CP025958.1_8432485_8433373_-	COG1506, DAP2, Dipeptidyl aminopeptidases/acylaminoacyl-peptidases [Amino acid transport and metabolism]	NA|430aa|down_7|NZ_CP025958.1_8433632_8434922_-	TIGR03150, fabF, beta-ketoacyl-acyl-carrier-protein synthase II	NA|425aa|down_8|NZ_CP025958.1_8435028_8436303_-	cd00834, KAS_I_II, Beta-ketoacyl-acyl carrier protein (ACP) synthase (KAS), type I and II	NA|164aa|down_9|NZ_CP025958.1_8436529_8437021_-	cd01288, FabZ, FabZ is a 17kD beta-hydroxyacyl-acyl carrier protein (ACP) dehydratase that primarily catalyzes the dehydration of beta-hydroxyacyl-ACP to trans-2-acyl-ACP, the third step in the elongation phase of the bacterial/ plastid, type II, fatty-acid biosynthesis pathway
GCF_003149495.1_ASM314949v1	NZ_CP025958	Gemmata obscuriglobus strain DSM 5831 chromosome, complete genome	31	8721213-8721351	23	CRISPRCasFinder	no		cas8u1,cas5u,cas7,cas8u2,cas3,cas2,cas1,csa3,DEDDh,RT,cas6,cas8b3,DinG,cas4,PD-DExK	Orphan	TTGGCCACAAAAAAGCACAAAGGGCACAAAAGG	33	0	0	NA	NA	NA	1	1	Orphan	cas8u1,cas5u,cas7,cas8u2,cas3,cas2,cas1,csa3,DEDDh,RT,cas6,cas8b3,DinG,cas4,PD-DExK	NA|92aa|up_2|NZ_CP025958.1_8719808_8720084_+,NA|102aa|down_3|NZ_CP025958.1_8726312_8726618_+,NA|433aa|down_6|NZ_CP025958.1_8730976_8732275_-,NA|73aa|down_7|NZ_CP025958.1_8733109_8733328_-,NA|92aa|down_9|NZ_CP025958.1_8735608_8735884_-	NA|637aa|up_9|NZ_CP025958.1_8709522_8711433_-	PRK05444, PRK05444, 1-deoxy-D-xylulose-5-phosphate synthase; Provisional	NA|338aa|up_8|NZ_CP025958.1_8711569_8712583_-	PRK10581, PRK10581, (2E,6E)-farnesyl diphosphate synthase	NA|106aa|up_7|NZ_CP025958.1_8712579_8712897_-	PRK00977, PRK00977, exodeoxyribonuclease VII small subunit; Provisional	NA|433aa|up_6|NZ_CP025958.1_8712973_8714272_-	TIGR00237, exodeoxyribonuclease_VII_large_subunit, exodeoxyribonuclease VII, large subunit	NA|491aa|up_5|NZ_CP025958.1_8714268_8715741_-	COG1807, ArnT, 4-amino-4-deoxy-L-arabinose transferase and related glycosyltransferases of PMT family [Cell envelope biogenesis, outer membrane]	NA|579aa|up_4|NZ_CP025958.1_8716091_8717828_-	PRK00302, lnt, apolipoprotein N-acyltransferase; Reviewed	NA|402aa|up_3|NZ_CP025958.1_8718051_8719257_-	cd00887, MoeA, MoeA family	NA|92aa|up_2|NZ_CP025958.1_8719808_8720084_+	NA	NA|174aa|up_1|NZ_CP025958.1_8720132_8720654_-	COG3493, CitS, Na+/citrate symporter [Energy production and conversion]	NA|152aa|up_0|NZ_CP025958.1_8720750_8721206_+	COG1671, COG1671, Uncharacterized protein conserved in bacteria [Function unknown]	NA|736aa|down_0|NZ_CP025958.1_8721635_8723843_+	pfam07583, PSCyt2, Protein of unknown function (DUF1549)	NA|432aa|down_1|NZ_CP025958.1_8724077_8725373_+	pfam07394, DUF1501, Protein of unknown function (DUF1501)	NA|189aa|down_2|NZ_CP025958.1_8725380_8725947_-	cd10555, EBDH_beta, beta subunit of ethylbenzene-dehydrogenase (EBDH)	NA|102aa|down_3|NZ_CP025958.1_8726312_8726618_+	NA	NA|441aa|down_4|NZ_CP025958.1_8727519_8728842_+	TIGR02937, RNA_polymerase_sigma_factor, RNA polymerase sigma factor, sigma-70 family	NA|568aa|down_5|NZ_CP025958.1_8729036_8730740_-	TIGR02937, RNA_polymerase_sigma_factor, RNA polymerase sigma factor, sigma-70 family	NA|433aa|down_6|NZ_CP025958.1_8730976_8732275_-	NA	NA|73aa|down_7|NZ_CP025958.1_8733109_8733328_-	NA	NA|478aa|down_8|NZ_CP025958.1_8733990_8735424_-	pfam01609, DDE_Tnp_1, Transposase DDE domain	NA|92aa|down_9|NZ_CP025958.1_8735608_8735884_-	NA
GCF_003149495.1_ASM314949v1	NZ_CP025958	Gemmata obscuriglobus strain DSM 5831 chromosome, complete genome	32	8859306-8860201	9	CRT	no	csa3	cas8u1,cas5u,cas7,cas8u2,cas3,cas2,cas1,csa3,DEDDh,RT,cas6,cas8b3,DinG,cas4,PD-DExK	Type I-A	ACGGACGCGGGNNTGAAGGAGNTGGCCGNGCTCAA	35	0	0	NA	NA	NA	12	12	Orphan	cas8u1,cas5u,cas7,cas8u2,cas3,cas2,cas1,csa3,DEDDh,RT,cas6,cas8b3,DinG,cas4,PD-DExK	NA|82aa|up_8|NZ_CP025958.1_8849550_8849796_-,NA|131aa|up_1|NZ_CP025958.1_8856453_8856846_+,NA|65aa|down_2|NZ_CP025958.1_8864044_8864239_-	NA|165aa|up_9|NZ_CP025958.1_8849041_8849536_-	cd06260, DUF820, Domain of unknown function (DUF820)	NA|82aa|up_8|NZ_CP025958.1_8849550_8849796_-	NA	NA|504aa|up_7|NZ_CP025958.1_8850597_8852109_+	TIGR00115, tig, trigger factor	NA|209aa|up_6|NZ_CP025958.1_8852224_8852851_+	PRK00277, clpP, ATP-dependent Clp protease proteolytic subunit; Reviewed	NA|206aa|up_5|NZ_CP025958.1_8853018_8853636_+	PRK00277, clpP, ATP-dependent Clp protease proteolytic subunit; Reviewed	NA|300aa|up_4|NZ_CP025958.1_8853666_8854566_+	PRK12323, PRK12323, DNA polymerase III subunit gamma/tau	NA|160aa|up_3|NZ_CP025958.1_8854666_8855146_+	PRK09364, moaC, cyclic pyranopterin monophosphate synthase MoaC	NA|345aa|up_2|NZ_CP025958.1_8855240_8856275_+	COG0142, IspA, Geranylgeranyl pyrophosphate synthase [Coenzyme metabolism]	NA|131aa|up_1|NZ_CP025958.1_8856453_8856846_+	NA	NA|544aa|up_0|NZ_CP025958.1_8857111_8858743_-	PRK00179, pgi, glucose-6-phosphate isomerase; Reviewed	NA|223aa|down_0|NZ_CP025958.1_8861582_8862251_+	pfam13358, DDE_3, DDE superfamily endonuclease	NA|61aa|down_1|NZ_CP025958.1_8863491_8863674_-	COG2963, COG2963, Transposase and inactivated derivatives [DNA replication, recombination, and repair]	NA|65aa|down_2|NZ_CP025958.1_8864044_8864239_-	NA	NA|736aa|down_3|NZ_CP025958.1_8864401_8866609_+	pfam07583, PSCyt2, Protein of unknown function (DUF1549)	NA|119aa|down_4|NZ_CP025958.1_8866639_8866996_-	cd08161, SET, SET (Su(var)3-9, Enhancer-of-zeste, Trithorax) domain superfamily	NA|473aa|down_5|NZ_CP025958.1_8867176_8868595_+	COG1236, YSH1, Predicted exonuclease of the beta-lactamase fold involved in RNA processing [Translation, ribosomal structure and biogenesis]	csa3|130aa|down_6|NZ_CP025958.1_8868703_8869093_-	cd00090, HTH_ARSR, Arsenical Resistance Operon Repressor and similar prokaryotic, metal regulated homodimeric repressors	NA|317aa|down_7|NZ_CP025958.1_8869112_8870063_-	cd00200, WD40, WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment	NA|392aa|down_8|NZ_CP025958.1_8870219_8871395_+	sd00045, ANK, ankyrin repeats	NA|400aa|down_9|NZ_CP025958.1_8871602_8872802_-	pfam14100, PmoA, Methane oxygenase PmoA
GCF_003149495.1_ASM314949v1	NZ_CP025958	Gemmata obscuriglobus strain DSM 5831 chromosome, complete genome	33	8986790-8986874	24	CRISPRCasFinder	no		cas8u1,cas5u,cas7,cas8u2,cas3,cas2,cas1,csa3,DEDDh,RT,cas6,cas8b3,DinG,cas4,PD-DExK	Orphan	GTCTTCTGCCTTGGTTTCCTTTTGTG	26	1	1	8986816-8986848	NZ_CP025958.1_8986734-8986702	NA	1	1	Orphan	cas8u1,cas5u,cas7,cas8u2,cas3,cas2,cas1,csa3,DEDDh,RT,cas6,cas8b3,DinG,cas4,PD-DExK	NA|169aa|up_3|NZ_CP025958.1_8982066_8982573_-,NA	NA|721aa|up_9|NZ_CP025958.1_8974639_8976802_+	TIGR02604, Piru_Ver_Nterm, putative membrane-bound dehydrogenase domain	NA|168aa|up_8|NZ_CP025958.1_8977009_8977513_+	COG0590, CumB, Cytosine/adenosine deaminases [Nucleotide transport and metabolism / Translation, ribosomal structure and biogenesis]	NA|583aa|up_7|NZ_CP025958.1_8977659_8979408_+	pfam13360, PQQ_2, PQQ-like domain	NA|281aa|up_6|NZ_CP025958.1_8979538_8980381_+	PRK15435, PRK15435, bifunctional DNA-binding transcriptional regulator/O6-methylguanine-DNA methyltransferase Ada	NA|313aa|up_5|NZ_CP025958.1_8980460_8981399_+	pfam10127, Nuc-transf, Predicted nucleotidyltransferase	NA|182aa|up_4|NZ_CP025958.1_8981395_8981941_-	pfam10936, DUF2617, Protein of unknown function DUF2617	NA|169aa|up_3|NZ_CP025958.1_8982066_8982573_-	NA	NA|438aa|up_2|NZ_CP025958.1_8982885_8984199_+	cd17324, MFS_NepI_like, Purine ribonucleoside efflux pump NepI and similar transporters of the Major Facilitator Superfamily	NA|639aa|up_1|NZ_CP025958.1_8984312_8986229_+	smart00752, HTTM, Horizontally Transferred TransMembrane Domain	NA|108aa|up_0|NZ_CP025958.1_8986340_8986664_+	pfam11950, DUF3467, Protein of unknown function (DUF3467)	NA|542aa|down_0|NZ_CP025958.1_8986890_8988516_-	cd16016, AP-SPAP, SPAP is a subclass of alkaline phosphatase (AP)	NA|680aa|down_1|NZ_CP025958.1_8988837_8990877_-	COG4099, COG4099, Predicted peptidase [General function prediction only]	NA|192aa|down_2|NZ_CP025958.1_8991081_8991657_+	PRK00529, PRK00529, elongation factor P; Validated	NA|168aa|down_3|NZ_CP025958.1_8992100_8992604_-	sd00034, LRR_AMN1, leucine-rich repeats, antagonist of mitotic exit network protein 1-like subfamily	NA|488aa|down_4|NZ_CP025958.1_8992622_8994086_-	pfam03050, DDE_Tnp_IS66, Transposase IS66 family	NA|154aa|down_5|NZ_CP025958.1_8994185_8994647_-	sd00034, LRR_AMN1, leucine-rich repeats, antagonist of mitotic exit network protein 1-like subfamily	NA|113aa|down_6|NZ_CP025958.1_8994600_8994939_-	pfam13358, DDE_3, DDE superfamily endonuclease	NA|172aa|down_7|NZ_CP025958.1_8995044_8995560_+	pfam07592, DDE_Tnp_ISAZ013, Rhodopirellula transposase DDE domain	NA|163aa|down_8|NZ_CP025958.1_8995508_8995997_+	pfam07592, DDE_Tnp_ISAZ013, Rhodopirellula transposase DDE domain	NA|382aa|down_9|NZ_CP025958.1_8995973_8997119_-	pfam13808, DDE_Tnp_1_assoc, DDE_Tnp_1-associated
GCF_003149495.1_ASM314949v1	NZ_CP025958	Gemmata obscuriglobus strain DSM 5831 chromosome, complete genome	34	8998477-8998872	25,10	CRISPRCasFinder,CRT	no		cas8u1,cas5u,cas7,cas8u2,cas3,cas2,cas1,csa3,DEDDh,RT,cas6,cas8b3,DinG,cas4,PD-DExK	Orphan	GCTTCAATTCGGCCGCGGCTATGAGCCGCGGAGAAC,GCTTCAATTCGGCCGCGGCTATGAGCCGCGGAGAAC	36,36	0	0	NA	NA	NA:NA	5,5	5	Orphan	cas8u1,cas5u,cas7,cas8u2,cas3,cas2,cas1,csa3,DEDDh,RT,cas6,cas8b3,DinG,cas4,PD-DExK	NA|135aa|up_1|NZ_CP025958.1_8997522_8997927_-,NA	NA|168aa|up_9|NZ_CP025958.1_8992100_8992604_-	sd00034, LRR_AMN1, leucine-rich repeats, antagonist of mitotic exit network protein 1-like subfamily	NA|488aa|up_8|NZ_CP025958.1_8992622_8994086_-	pfam03050, DDE_Tnp_IS66, Transposase IS66 family	NA|154aa|up_7|NZ_CP025958.1_8994185_8994647_-	sd00034, LRR_AMN1, leucine-rich repeats, antagonist of mitotic exit network protein 1-like subfamily	NA|113aa|up_6|NZ_CP025958.1_8994600_8994939_-	pfam13358, DDE_3, DDE superfamily endonuclease	NA|172aa|up_5|NZ_CP025958.1_8995044_8995560_+	pfam07592, DDE_Tnp_ISAZ013, Rhodopirellula transposase DDE domain	NA|163aa|up_4|NZ_CP025958.1_8995508_8995997_+	pfam07592, DDE_Tnp_ISAZ013, Rhodopirellula transposase DDE domain	NA|382aa|up_3|NZ_CP025958.1_8995973_8997119_-	pfam13808, DDE_Tnp_1_assoc, DDE_Tnp_1-associated	NA|102aa|up_2|NZ_CP025958.1_8997269_8997575_+	pfam07592, DDE_Tnp_ISAZ013, Rhodopirellula transposase DDE domain	NA|135aa|up_1|NZ_CP025958.1_8997522_8997927_-	NA	NA|148aa|up_0|NZ_CP025958.1_8997923_8998367_-	pfam13565, HTH_32, Homeodomain-like domain	NA|NA	NA	NA|NA	NA	NA|NA	NA	NA|NA	NA	NA|NA	NA	NA|NA	NA	NA|NA	NA	NA|NA	NA	NA|NA	NA	NA|NA	NA
