assembly_id	genome_id	genome_def	crispr_array_locus_merge	crispr_array_location_merge	crispr_locus_id	crispr_pred_method	array_in_prot	prot_within_array_20000	prot_in_genome	crispr_type_by_cas_prot	consensus_repeat	repeat_length	self-targeting_spacer_number	self-targeting_target_number	spacer_location	protospacer_location	repeat_type	spacer_locus_num	spacer_num	correct_crispr_type	genome_cas_prots	unknown_protein_around_crispr	L10	L10_domain	L9	L9_domain	L8	L8_domain	L7	L7_domain	L6	L6_domain	L5	L5_domain	L4	L4_domain	L3	L3_domain	L2	L2_domain	L1	L1_domain	R1	R1_domain	R2	R2_domain	R3	R3_domain	R4	R4_domain	R5	R5_domain	R6	R6_domain	R7	R7_domain	R8	R8_domain	R9	R9_domain	R10	R10_domain
GCF_002950945.1_ASM295094v1	NZ_CP012673	Sorangium cellulosum strain So ce26 chromosome, complete genome	1	767880-767977	1	CRISPRCasFinder	no	csa3	RT,DEDDh,csa3,WYL,cas3,DinG,PD-DExK,cas6	Type I-A	GCCGGGGGCCTCCCGTCGTTGCGGA	25	0	0	NA	NA	NA	1	1	Orphan	RT,DEDDh,csa3,WYL,cas3,DinG,PD-DExK,cas6	NA|155aa|up_8|NZ_CP012673.1_753905_754370_+,NA|329aa|up_4|NZ_CP012673.1_760140_761127_-,NA|84aa|down_1|NZ_CP012673.1_770989_771241_+,NA|160aa|down_2|NZ_CP012673.1_771189_771669_-,NA|357aa|down_7|NZ_CP012673.1_776325_777396_+	NA|135aa|up_9|NZ_CP012673.1_753504_753909_+	pfam00034, Cytochrom_C, Cytochrome c	NA|155aa|up_8|NZ_CP012673.1_753905_754370_+	NA	NA|166aa|up_7|NZ_CP012673.1_754508_755006_-	cd07812, SRPBCC, START/RHO_alpha_C/PITP/Bet_v1/CoxG/CalC (SRPBCC) ligand-binding domain superfamily	csa3|358aa|up_6|NZ_CP012673.1_755098_756172_-	cd10451, GIY-YIG_LuxR_like, GIY-YIG domain of LuxR and ArsR family transcriptional regulators, and uncharacterized hypothetical proteins found in bacteria	NA|1210aa|up_5|NZ_CP012673.1_756455_760085_-	COG0542, clpA, ATP-binding subunits of Clp protease and DnaK/DnaJ chaperones [Posttranslational modification, protein turnover, chaperones]	NA|329aa|up_4|NZ_CP012673.1_760140_761127_-	NA	NA|1204aa|up_3|NZ_CP012673.1_761360_764972_-	COG0542, clpA, ATP-binding subunits of Clp protease and DnaK/DnaJ chaperones [Posttranslational modification, protein turnover, chaperones]	NA|371aa|up_2|NZ_CP012673.1_765023_766136_-	COG1858, MauG, Cytochrome c peroxidase [Inorganic ion transport and metabolism]	NA|289aa|up_1|NZ_CP012673.1_766608_767475_+	COG0300, DltE, Short-chain dehydrogenases of various substrate specificities [General function prediction only]	NA|55aa|up_0|NZ_CP012673.1_767661_767826_+	COG1773, COG1773, Rubredoxin [Energy production and conversion]	NA|745aa|down_0|NZ_CP012673.1_768584_770819_-	smart00752, HTTM, Horizontally Transferred TransMembrane Domain	NA|84aa|down_1|NZ_CP012673.1_770989_771241_+	NA	NA|160aa|down_2|NZ_CP012673.1_771189_771669_-	NA	NA|276aa|down_3|NZ_CP012673.1_771727_772555_-	cd04238, AAK_NAGK-like, AAK_NAGK-like: N-Acetyl-L-glutamate kinase (NAGK)-like 	NA|428aa|down_4|NZ_CP012673.1_772585_773869_+	cd03885, M20_CPDG2, M20 Peptidase Glutamate carboxypeptidase, a periplasmic enzyme	NA|283aa|down_5|NZ_CP012673.1_774143_774992_+	pfam05685, Uma2, Putative restriction endonuclease	NA|415aa|down_6|NZ_CP012673.1_775050_776295_-	COG1092, COG1092, Predicted SAM-dependent methyltransferases [General function prediction only]	NA|357aa|down_7|NZ_CP012673.1_776325_777396_+	NA	NA|398aa|down_8|NZ_CP012673.1_778027_779221_+	COG3287, COG3287, Uncharacterized conserved protein [Function unknown]	NA|623aa|down_9|NZ_CP012673.1_779247_781116_+	COG0642, BaeS, Signal transduction histidine kinase [Signal transduction mechanisms]
GCF_002950945.1_ASM295094v1	NZ_CP012673	Sorangium cellulosum strain So ce26 chromosome, complete genome	2	1618479-1618583	2	CRISPRCasFinder	no		RT,DEDDh,csa3,WYL,cas3,DinG,PD-DExK,cas6	Orphan	GCGGCGGCCTCGAAGCCCGGCAGCGG	26	0	0	NA	NA	NA	1	1	Orphan	RT,DEDDh,csa3,WYL,cas3,DinG,PD-DExK,cas6	NA,NA|201aa|down_2|NZ_CP012673.1_1620606_1621209_+,NA|450aa|down_4|NZ_CP012673.1_1627549_1628899_-,NA|384aa|down_9|NZ_CP012673.1_1633124_1634276_-	NA|196aa|up_9|NZ_CP012673.1_1602617_1603205_-	TIGR04292, hypothetical_protein_PF0600, heavy-Cys/CGP-CTERM domain protein	NA|231aa|up_8|NZ_CP012673.1_1603517_1604210_-	TIGR02937, RNA_polymerase_sigma_factor, RNA polymerase sigma factor, sigma-70 family	NA|598aa|up_7|NZ_CP012673.1_1604863_1606657_-	COG2204, AtoC, Response regulator containing CheY-like receiver, AAA-type ATPase, and DNA-binding domains [Signal transduction mechanisms]	NA|1306aa|up_6|NZ_CP012673.1_1606807_1610725_+	cd14014, STKc_PknB_like, Catalytic domain of bacterial Serine/Threonine kinases, PknB and similar proteins	NA|569aa|up_5|NZ_CP012673.1_1610675_1612382_-	cd14014, STKc_PknB_like, Catalytic domain of bacterial Serine/Threonine kinases, PknB and similar proteins	NA|599aa|up_4|NZ_CP012673.1_1612549_1614346_+	COG1022, FAA1, Long-chain acyl-CoA synthetases (AMP-forming) [Lipid metabolism]	NA|241aa|up_3|NZ_CP012673.1_1614446_1615169_-	COG0678, AHP1, Peroxiredoxin [Posttranslational modification, protein turnover, chaperones]	NA|219aa|up_2|NZ_CP012673.1_1615395_1616052_-	COG2226, UbiE, Methylase involved in ubiquinone/menaquinone biosynthesis [Coenzyme metabolism]	NA|386aa|up_1|NZ_CP012673.1_1616682_1617840_+	cd07729, AHL_lactonase_MBL-fold, quorum-quenching N-acyl-homoserine lactonase, MBL-fold metallo-hydrolase domain	NA|146aa|up_0|NZ_CP012673.1_1617925_1618363_-	cd08355, TioX_like, Micromonospora sp	NA|145aa|down_0|NZ_CP012673.1_1619386_1619821_-	PRK00118, PRK00118, putative DNA-binding protein; Validated	NA|166aa|down_1|NZ_CP012673.1_1619792_1620290_-	PRK10885, cca, multifunctional CCA addition/repair protein	NA|201aa|down_2|NZ_CP012673.1_1620606_1621209_+	NA	NA|1960aa|down_3|NZ_CP012673.1_1621358_1627238_+	pfam13646, HEAT_2, HEAT repeats	NA|450aa|down_4|NZ_CP012673.1_1627549_1628899_-	NA	NA|511aa|down_5|NZ_CP012673.1_1629072_1630605_-	COG2072, TrkA, Predicted flavoprotein involved in K+ transport [Inorganic ion transport and metabolism]	NA|133aa|down_6|NZ_CP012673.1_1630855_1631254_+	PRK10811, rne, ribonuclease E; Reviewed	NA|252aa|down_7|NZ_CP012673.1_1631211_1631967_+	COG3509, LpqC, Poly(3-hydroxybutyrate) depolymerase [Secondary metabolites biosynthesis, transport, and catabolism]	NA|368aa|down_8|NZ_CP012673.1_1632018_1633122_-	cd07402, MPP_GpdQ, Enterobacter aerogenes GpdQ and related proteins, metallophosphatase domain	NA|384aa|down_9|NZ_CP012673.1_1633124_1634276_-	NA
GCF_002950945.1_ASM295094v1	NZ_CP012673	Sorangium cellulosum strain So ce26 chromosome, complete genome	3	2035597-2035691	3	CRISPRCasFinder	no		RT,DEDDh,csa3,WYL,cas3,DinG,PD-DExK,cas6	Orphan	CGGCGCCGGCGAGGGCGAGGACG	23	0	0	NA	NA	NA	1	1	Orphan	RT,DEDDh,csa3,WYL,cas3,DinG,PD-DExK,cas6	NA|382aa|up_8|NZ_CP012673.1_2023676_2024822_+,NA|364aa|up_4|NZ_CP012673.1_2030630_2031722_+,NA|83aa|up_3|NZ_CP012673.1_2031826_2032075_+,NA|66aa|down_1|NZ_CP012673.1_2038818_2039016_-,NA|161aa|down_8|NZ_CP012673.1_2046639_2047122_+	NA|619aa|up_9|NZ_CP012673.1_2021736_2023593_+	cd16373, DMSOR_beta_like, uncharacterized subfamily of DMSO Reductase beta subunit family	NA|382aa|up_8|NZ_CP012673.1_2023676_2024822_+	NA	NA|543aa|up_7|NZ_CP012673.1_2024873_2026502_-	COG4591, LolE, ABC-type transport system, involved in lipoprotein release, permease component [Cell envelope biogenesis, outer membrane]	NA|636aa|up_6|NZ_CP012673.1_2026679_2028587_+	cd14014, STKc_PknB_like, Catalytic domain of bacterial Serine/Threonine kinases, PknB and similar proteins	NA|461aa|up_5|NZ_CP012673.1_2028667_2030050_+	PRK13299, PRK13299, tRNA CCA-pyrophosphorylase; Provisional	NA|364aa|up_4|NZ_CP012673.1_2030630_2031722_+	NA	NA|83aa|up_3|NZ_CP012673.1_2031826_2032075_+	NA	NA|425aa|up_2|NZ_CP012673.1_2032180_2033455_-	PRK09228, PRK09228, guanine deaminase; Provisional	NA|221aa|up_1|NZ_CP012673.1_2033451_2034114_-	PRK12270, kgd, multifunctional oxoglutarate decarboxylase/oxoglutarate dehydrogenase thiamine pyrophosphate-binding subunit/dihydrolipoyllysine-residue succinyltransferase subunit	NA|364aa|up_0|NZ_CP012673.1_2034222_2035314_-	pfam03781, FGE-sulfatase, Sulfatase-modifying factor enzyme 1	NA|196aa|down_0|NZ_CP012673.1_2038021_2038609_+	cd06260, DUF820, Domain of unknown function (DUF820)	NA|66aa|down_1|NZ_CP012673.1_2038818_2039016_-	NA	NA|323aa|down_2|NZ_CP012673.1_2039012_2039981_-	pfam01590, GAF, GAF domain	NA|395aa|down_3|NZ_CP012673.1_2040308_2041493_+	pfam15887, Peptidase_Mx, Putative zinc-binding metallo-peptidase	NA|349aa|down_4|NZ_CP012673.1_2042034_2043081_+	PRK00188, trpD, anthranilate phosphoribosyltransferase; Provisional	NA|490aa|down_5|NZ_CP012673.1_2043073_2044543_+	PRK09427, PRK09427, bifunctional indole-3-glycerol-phosphate synthase TrpC/phosphoribosylanthranilate isomerase TrpF	NA|398aa|down_6|NZ_CP012673.1_2044595_2045789_+	PRK04346, PRK04346, tryptophan synthase subunit beta; Validated	NA|275aa|down_7|NZ_CP012673.1_2045785_2046610_+	PRK13111, trpA, tryptophan synthase subunit alpha; Provisional	NA|161aa|down_8|NZ_CP012673.1_2046639_2047122_+	NA	NA|1159aa|down_9|NZ_CP012673.1_2047134_2050611_-	COG4775, COG4775, Outer membrane protein/protective antigen OMA87 [Cell envelope biogenesis, outer membrane]
GCF_002950945.1_ASM295094v1	NZ_CP012673	Sorangium cellulosum strain So ce26 chromosome, complete genome	4	2050376-2050491	4	CRISPRCasFinder	no		RT,DEDDh,csa3,WYL,cas3,DinG,PD-DExK,cas6	Orphan	GCGGGCGTCTGGGCTGCAGGCGCCTG	26	0	0	NA	NA	NA	2	2	Orphan	RT,DEDDh,csa3,WYL,cas3,DinG,PD-DExK,cas6	NA|66aa|up_7|NZ_CP012673.1_2038818_2039016_-,NA|161aa|up_0|NZ_CP012673.1_2046639_2047122_+,NA|104aa|down_6|NZ_CP012673.1_2063344_2063656_-,NA|166aa|down_7|NZ_CP012673.1_2063652_2064150_-,NA|414aa|down_8|NZ_CP012673.1_2064146_2065388_-,NA|282aa|down_9|NZ_CP012673.1_2066447_2067293_+	NA|801aa|up_9|NZ_CP012673.1_2035327_2037730_-	cd16148, sulfatase_like, uncharacterized sulfatase subfamily	NA|196aa|up_8|NZ_CP012673.1_2038021_2038609_+	cd06260, DUF820, Domain of unknown function (DUF820)	NA|66aa|up_7|NZ_CP012673.1_2038818_2039016_-	NA	NA|323aa|up_6|NZ_CP012673.1_2039012_2039981_-	pfam01590, GAF, GAF domain	NA|395aa|up_5|NZ_CP012673.1_2040308_2041493_+	pfam15887, Peptidase_Mx, Putative zinc-binding metallo-peptidase	NA|349aa|up_4|NZ_CP012673.1_2042034_2043081_+	PRK00188, trpD, anthranilate phosphoribosyltransferase; Provisional	NA|490aa|up_3|NZ_CP012673.1_2043073_2044543_+	PRK09427, PRK09427, bifunctional indole-3-glycerol-phosphate synthase TrpC/phosphoribosylanthranilate isomerase TrpF	NA|398aa|up_2|NZ_CP012673.1_2044595_2045789_+	PRK04346, PRK04346, tryptophan synthase subunit beta; Validated	NA|275aa|up_1|NZ_CP012673.1_2045785_2046610_+	PRK13111, trpA, tryptophan synthase subunit alpha; Provisional	NA|161aa|up_0|NZ_CP012673.1_2046639_2047122_+	NA	NA|271aa|down_0|NZ_CP012673.1_2050892_2051705_+	pfam12974, Phosphonate-bd, ABC transporter, phosphonate, periplasmic substrate-binding protein	NA|1494aa|down_1|NZ_CP012673.1_2051724_2056206_-	pfam04357, TamB, TamB, inner membrane protein subunit of TAM complex	NA|360aa|down_2|NZ_CP012673.1_2056240_2057320_-	cd10283, MnuA_DNase1-like, Mycoplasma pulmonis MnuA nuclease-like	NA|558aa|down_3|NZ_CP012673.1_2057499_2059173_-	PRK11819, PRK11819, putative ABC transporter ATP-binding protein; Reviewed	NA|549aa|down_4|NZ_CP012673.1_2059417_2061064_-	cd13128, MATE_Wzx_like, Wzx, a subfamily of the multidrug and toxic compound extrusion (MATE)-like proteins	NA|564aa|down_5|NZ_CP012673.1_2061713_2063405_-	pfam12532, DUF3732, Protein of unknown function (DUF3732)	NA|104aa|down_6|NZ_CP012673.1_2063344_2063656_-	NA	NA|166aa|down_7|NZ_CP012673.1_2063652_2064150_-	NA	NA|414aa|down_8|NZ_CP012673.1_2064146_2065388_-	NA	NA|282aa|down_9|NZ_CP012673.1_2066447_2067293_+	NA
GCF_002950945.1_ASM295094v1	NZ_CP012673	Sorangium cellulosum strain So ce26 chromosome, complete genome	5	3534085-3534185	5	CRISPRCasFinder	no		RT,DEDDh,csa3,WYL,cas3,DinG,PD-DExK,cas6	Orphan	CCGAAGGGCAATCCGCGTCCGCG	23	0	0	NA	NA	NA	1	1	Orphan	RT,DEDDh,csa3,WYL,cas3,DinG,PD-DExK,cas6	NA|116aa|up_6|NZ_CP012673.1_3523590_3523938_+,NA|159aa|down_2|NZ_CP012673.1_3535930_3536407_-,NA|127aa|down_3|NZ_CP012673.1_3536624_3537005_+	NA|147aa|up_9|NZ_CP012673.1_3521242_3521683_+	cd04622, CBS_pair_HRP1_like, CBS pair domain found in Hypoxic Response Protein 1 (HRP1) -like proteinds	NA|218aa|up_8|NZ_CP012673.1_3521748_3522402_-	PRK00058, PRK00058, peptide-methionine (S)-S-oxide reductase MsrA	NA|175aa|up_7|NZ_CP012673.1_3522694_3523219_+	COG0262, FolA, Dihydrofolate reductase [Coenzyme metabolism]	NA|116aa|up_6|NZ_CP012673.1_3523590_3523938_+	NA	NA|221aa|up_5|NZ_CP012673.1_3524007_3524670_-	pfam13649, Methyltransf_25, Methyltransferase domain	NA|395aa|up_4|NZ_CP012673.1_3524728_3525913_-	PRK07764, PRK07764, DNA polymerase III subunits gamma and tau; Validated	NA|866aa|up_3|NZ_CP012673.1_3526161_3528759_-	TIGR02412, Aminopeptidase_N, aminopeptidase N, Streptomyces lividans type	NA|361aa|up_2|NZ_CP012673.1_3529012_3530095_-	pfam03706, LPG_synthase_TM, Lysylphosphatidylglycerol synthase TM region	NA|560aa|up_1|NZ_CP012673.1_3530321_3532001_-	COG2124, CypX, Cytochrome P450 [Secondary metabolites biosynthesis, transport, and catabolism]	NA|356aa|up_0|NZ_CP012673.1_3532226_3533294_-	cd13555, PBP2_sulfate_ester_like, Sulfate ester binding protein-like, the type 2 periplasmic binding protein fold	NA|260aa|down_0|NZ_CP012673.1_3534430_3535210_-	PRK07003, PRK07003, DNA polymerase III subunit gamma/tau	NA|224aa|down_1|NZ_CP012673.1_3535259_3535931_-	TIGR02937, RNA_polymerase_sigma_factor, RNA polymerase sigma factor, sigma-70 family	NA|159aa|down_2|NZ_CP012673.1_3535930_3536407_-	NA	NA|127aa|down_3|NZ_CP012673.1_3536624_3537005_+	NA	NA|314aa|down_4|NZ_CP012673.1_3537099_3538041_-	COG2962, RarD, Predicted permeases [General function prediction only]	NA|400aa|down_5|NZ_CP012673.1_3538516_3539716_+	COG2865, COG2865, Predicted transcriptional regulator containing an HTH domain and an uncharacterized domain shared with the mammalian protein Schlafen [Transcription]	NA|354aa|down_6|NZ_CP012673.1_3539712_3540774_+	pfam13614, AAA_31, AAA domain	NA|139aa|down_7|NZ_CP012673.1_3541864_3542281_-	cd02947, TRX_family, TRX family; composed of two groups: Group I, which includes proteins that exclusively encode a TRX domain; and Group II, which are composed of fusion proteins of TRX and additional domains	NA|179aa|down_8|NZ_CP012673.1_3542336_3542873_+	TIGR02937, RNA_polymerase_sigma_factor, RNA polymerase sigma factor, sigma-70 family	NA|266aa|down_9|NZ_CP012673.1_3543010_3543808_+	COG3931, COG3931, Predicted N-formylglutamate amidohydrolase [Amino acid transport and metabolism]
GCF_002950945.1_ASM295094v1	NZ_CP012673	Sorangium cellulosum strain So ce26 chromosome, complete genome	6	3595373-3595500	6	CRISPRCasFinder	no		RT,DEDDh,csa3,WYL,cas3,DinG,PD-DExK,cas6	Orphan	CGAGTCCGCCGCCTGCGCGGCGGCGGCCTGCGACTGACC	39	0	0	NA	NA	NA	1	1	Orphan	RT,DEDDh,csa3,WYL,cas3,DinG,PD-DExK,cas6	NA|134aa|up_8|NZ_CP012673.1_3578517_3578919_+,NA|534aa|down_0|NZ_CP012673.1_3595830_3597432_+,NA|619aa|down_2|NZ_CP012673.1_3598567_3600424_+,NA|355aa|down_7|NZ_CP012673.1_3609012_3610077_+	NA|2229aa|up_9|NZ_CP012673.1_3571834_3578521_+	cd14953, NHL_like_1, Uncharacterized NHL-repeat domain in bacterial proteins	NA|134aa|up_8|NZ_CP012673.1_3578517_3578919_+	NA	NA|230aa|up_7|NZ_CP012673.1_3578984_3579674_-	COG1051, COG1051, ADP-ribose pyrophosphatase [Nucleotide transport and metabolism]	NA|326aa|up_6|NZ_CP012673.1_3579786_3580764_-	PRK00861, PRK00861, putative lipid kinase; Reviewed	NA|520aa|up_5|NZ_CP012673.1_3580854_3582414_-	cd16913, YkuD_like, L,D-transpeptidases/carboxypeptidases similar to Bacillus YkuD	NA|363aa|up_4|NZ_CP012673.1_3582549_3583638_+	cd09084, EEP-2, Exonuclease-Endonuclease-Phosphatase (EEP) domain superfamily; uncharacterized family 2	NA|804aa|up_3|NZ_CP012673.1_3583747_3586159_-	COG3661, AguA, Alpha-glucuronidase [Carbohydrate transport and metabolism]	NA|597aa|up_2|NZ_CP012673.1_3586780_3588571_+	pfam00553, CBM_2, Cellulose binding domain	NA|1087aa|up_1|NZ_CP012673.1_3588943_3592204_+	cd14014, STKc_PknB_like, Catalytic domain of bacterial Serine/Threonine kinases, PknB and similar proteins	NA|415aa|up_0|NZ_CP012673.1_3592738_3593983_+	COG2124, CypX, Cytochrome P450 [Secondary metabolites biosynthesis, transport, and catabolism]	NA|534aa|down_0|NZ_CP012673.1_3595830_3597432_+	NA	NA|273aa|down_1|NZ_CP012673.1_3597568_3598387_+	cd06259, YdcF-like, YdcF-like	NA|619aa|down_2|NZ_CP012673.1_3598567_3600424_+	NA	NA|266aa|down_3|NZ_CP012673.1_3600628_3601426_-	pfam13585, CHU_C, C-terminal domain of CHU protein family	NA|505aa|down_4|NZ_CP012673.1_3601671_3603186_-	TIGR03605, antibiot_sagB, SagB-type dehydrogenase domain	NA|476aa|down_5|NZ_CP012673.1_3603202_3604630_-	cd05682, M20_dipept_dapE, uncharacterized M20 dipeptidase	NA|1358aa|down_6|NZ_CP012673.1_3604925_3608999_+	pfam13665, DUF4150, Domain of unknown function (DUF4150)	NA|355aa|down_7|NZ_CP012673.1_3609012_3610077_+	NA	NA|378aa|down_8|NZ_CP012673.1_3610090_3611224_-	TIGR04021, LLM_DMSO2_sfnG, dimethyl sulfone monooxygenase SfnG	NA|394aa|down_9|NZ_CP012673.1_3611275_3612457_-	COG1960, CaiA, Acyl-CoA dehydrogenases [Lipid metabolism]
GCF_002950945.1_ASM295094v1	NZ_CP012673	Sorangium cellulosum strain So ce26 chromosome, complete genome	7	3968170-3968307	7	CRISPRCasFinder	no		RT,DEDDh,csa3,WYL,cas3,DinG,PD-DExK,cas6	Orphan	GCGCTCTTCGTGGCGGCGCCGCTCCTCTTG	30	0	0	NA	NA	NA	2	2	Orphan	RT,DEDDh,csa3,WYL,cas3,DinG,PD-DExK,cas6	NA|111aa|up_4|NZ_CP012673.1_3960385_3960718_-,NA|239aa|down_0|NZ_CP012673.1_3969234_3969951_+,NA|410aa|down_1|NZ_CP012673.1_3970156_3971386_+,NA|91aa|down_5|NZ_CP012673.1_3974673_3974946_+,NA|592aa|down_6|NZ_CP012673.1_3975280_3977056_-,NA|171aa|down_9|NZ_CP012673.1_3981970_3982483_-	NA|548aa|up_9|NZ_CP012673.1_3946203_3947847_-	COG0595, COG0595, mRNA degradation ribonucleases J1/J2 (metallo-beta-lactamase superfamily) [Translation, ribosomal structure and biogenesis; Replication, recombination and repair]	NA|1665aa|up_8|NZ_CP012673.1_3948204_3953199_+	COG3899, COG3899, Predicted ATPase [General function prediction only]	NA|912aa|up_7|NZ_CP012673.1_3953293_3956029_-	COG3391, COG3391, Uncharacterized conserved protein [Function unknown]	NA|126aa|up_6|NZ_CP012673.1_3957416_3957794_+	cd07043, STAS_anti-anti-sigma_factors, Sulphate Transporter and Anti-Sigma factor antagonist) domain of anti-anti-sigma factors, key regulators of anti-sigma factors by phosphorylation	NA|826aa|up_5|NZ_CP012673.1_3957809_3960287_+	cd06267, PBP1_LacI_sugar_binding-like, ligand binding domain of the LacI transcriptional regulator family belonging to the type 1 periplasmic-binding fold protein superfamily	NA|111aa|up_4|NZ_CP012673.1_3960385_3960718_-	NA	NA|534aa|up_3|NZ_CP012673.1_3962398_3964000_+	COG0606, COG0606, Predicted ATPase with chaperone activity [Posttranslational modification, protein turnover, chaperones]	NA|342aa|up_2|NZ_CP012673.1_3964162_3965188_+	PRK09354, recA, recombinase A; Provisional	NA|160aa|up_1|NZ_CP012673.1_3965335_3965815_+	pfam00436, SSB, Single-strand binding protein family	NA|316aa|up_0|NZ_CP012673.1_3966000_3966948_-	pfam13229, Beta_helix, Right handed beta helix region	NA|239aa|down_0|NZ_CP012673.1_3969234_3969951_+	NA	NA|410aa|down_1|NZ_CP012673.1_3970156_3971386_+	NA	NA|401aa|down_2|NZ_CP012673.1_3971378_3972581_+	pfam04261, Dyp_perox, Dyp-type peroxidase family	NA|308aa|down_3|NZ_CP012673.1_3972660_3973584_-	PRK12678, PRK12678, transcription termination factor Rho; Provisional	NA|148aa|down_4|NZ_CP012673.1_3973845_3974289_-	pfam09537, DUF2383, Domain of unknown function (DUF2383)	NA|91aa|down_5|NZ_CP012673.1_3974673_3974946_+	NA	NA|592aa|down_6|NZ_CP012673.1_3975280_3977056_-	NA	NA|1271aa|down_7|NZ_CP012673.1_3977135_3980948_-	COG1197, Mfd, Transcription-repair coupling factor (superfamily II helicase) [DNA replication, recombination, and repair / Transcription]	NA|319aa|down_8|NZ_CP012673.1_3981017_3981974_-	smart00752, HTTM, Horizontally Transferred TransMembrane Domain	NA|171aa|down_9|NZ_CP012673.1_3981970_3982483_-	NA
GCF_002950945.1_ASM295094v1	NZ_CP012673	Sorangium cellulosum strain So ce26 chromosome, complete genome	8	4178944-4179326	1	CRT	no		RT,DEDDh,csa3,WYL,cas3,DinG,PD-DExK,cas6	Orphan	GGCGGCGGAGCGGGCGCTGGCGGCGG	26	0	0	NA	NA	NA	6	6	Orphan	RT,DEDDh,csa3,WYL,cas3,DinG,PD-DExK,cas6	NA|52aa|up_7|NZ_CP012673.1_4167841_4167997_-,NA|75aa|up_0|NZ_CP012673.1_4176188_4176413_+,NA|126aa|down_0|NZ_CP012673.1_4179771_4180149_-,NA|223aa|down_1|NZ_CP012673.1_4180496_4181165_-,NA|231aa|down_2|NZ_CP012673.1_4181790_4182483_+,NA|277aa|down_4|NZ_CP012673.1_4189292_4190123_-,NA|206aa|down_5|NZ_CP012673.1_4190621_4191239_-,NA|229aa|down_6|NZ_CP012673.1_4191793_4192480_+	NA|330aa|up_9|NZ_CP012673.1_4166264_4167254_-	cd05286, QOR2, Quinone oxidoreductase (QOR)	NA|155aa|up_8|NZ_CP012673.1_4167341_4167806_-	COG1959, COG1959, Predicted transcriptional regulator [Transcription]	NA|52aa|up_7|NZ_CP012673.1_4167841_4167997_-	NA	NA|445aa|up_6|NZ_CP012673.1_4167962_4169297_-	PRK14951, PRK14951, DNA polymerase III subunits gamma and tau; Provisional	NA|149aa|up_5|NZ_CP012673.1_4169452_4169899_-	PRK00872, PRK00872, hypothetical protein; Provisional	NA|409aa|up_4|NZ_CP012673.1_4169895_4171122_-	PRK07538, PRK07538, hypothetical protein; Provisional	NA|400aa|up_3|NZ_CP012673.1_4171181_4172381_-	PRK11551, PRK11551, putative 3-hydroxyphenylpropionic transporter MhpT; Provisional	NA|343aa|up_2|NZ_CP012673.1_4172468_4173497_-	COG3509, LpqC, Poly(3-hydroxybutyrate) depolymerase [Secondary metabolites biosynthesis, transport, and catabolism]	NA|776aa|up_1|NZ_CP012673.1_4173936_4176264_+	TIGR02232, myxo_disulf_rpt, Myxococcus cysteine-rich repeat	NA|75aa|up_0|NZ_CP012673.1_4176188_4176413_+	NA	NA|126aa|down_0|NZ_CP012673.1_4179771_4180149_-	NA	NA|223aa|down_1|NZ_CP012673.1_4180496_4181165_-	NA	NA|231aa|down_2|NZ_CP012673.1_4181790_4182483_+	NA	NA|257aa|down_3|NZ_CP012673.1_4182916_4183687_-	PRK09183, PRK09183, transposase/IS protein; Provisional	NA|277aa|down_4|NZ_CP012673.1_4189292_4190123_-	NA	NA|206aa|down_5|NZ_CP012673.1_4190621_4191239_-	NA	NA|229aa|down_6|NZ_CP012673.1_4191793_4192480_+	NA	NA|842aa|down_7|NZ_CP012673.1_4192496_4195022_-	cd14014, STKc_PknB_like, Catalytic domain of bacterial Serine/Threonine kinases, PknB and similar proteins	NA|449aa|down_8|NZ_CP012673.1_4195079_4196426_-	COG2204, AtoC, Response regulator containing CheY-like receiver, AAA-type ATPase, and DNA-binding domains [Signal transduction mechanisms]	NA|299aa|down_9|NZ_CP012673.1_4196627_4197524_-	pfam12852, Cupin_6, Cupin
GCF_002950945.1_ASM295094v1	NZ_CP012673	Sorangium cellulosum strain So ce26 chromosome, complete genome	9	4181560-4181650	8	CRISPRCasFinder	no		RT,DEDDh,csa3,WYL,cas3,DinG,PD-DExK,cas6	Orphan	GGACGCCTCACACAGGCGGGATATA	25	0	0	NA	NA	NA	1	1	Orphan	RT,DEDDh,csa3,WYL,cas3,DinG,PD-DExK,cas6	NA|75aa|up_3|NZ_CP012673.1_4176188_4176413_+,NA|126aa|up_1|NZ_CP012673.1_4179771_4180149_-,NA|223aa|up_0|NZ_CP012673.1_4180496_4181165_-,NA|231aa|down_0|NZ_CP012673.1_4181790_4182483_+,NA|277aa|down_2|NZ_CP012673.1_4189292_4190123_-,NA|206aa|down_3|NZ_CP012673.1_4190621_4191239_-,NA|229aa|down_4|NZ_CP012673.1_4191793_4192480_+,NA|143aa|down_9|NZ_CP012673.1_4198416_4198845_-	NA|445aa|up_9|NZ_CP012673.1_4167962_4169297_-	PRK14951, PRK14951, DNA polymerase III subunits gamma and tau; Provisional	NA|149aa|up_8|NZ_CP012673.1_4169452_4169899_-	PRK00872, PRK00872, hypothetical protein; Provisional	NA|409aa|up_7|NZ_CP012673.1_4169895_4171122_-	PRK07538, PRK07538, hypothetical protein; Provisional	NA|400aa|up_6|NZ_CP012673.1_4171181_4172381_-	PRK11551, PRK11551, putative 3-hydroxyphenylpropionic transporter MhpT; Provisional	NA|343aa|up_5|NZ_CP012673.1_4172468_4173497_-	COG3509, LpqC, Poly(3-hydroxybutyrate) depolymerase [Secondary metabolites biosynthesis, transport, and catabolism]	NA|776aa|up_4|NZ_CP012673.1_4173936_4176264_+	TIGR02232, myxo_disulf_rpt, Myxococcus cysteine-rich repeat	NA|75aa|up_3|NZ_CP012673.1_4176188_4176413_+	NA	NA|920aa|up_2|NZ_CP012673.1_4176846_4179606_+	sd00038, Kelch, Kelch repeat	NA|126aa|up_1|NZ_CP012673.1_4179771_4180149_-	NA	NA|223aa|up_0|NZ_CP012673.1_4180496_4181165_-	NA	NA|231aa|down_0|NZ_CP012673.1_4181790_4182483_+	NA	NA|257aa|down_1|NZ_CP012673.1_4182916_4183687_-	PRK09183, PRK09183, transposase/IS protein; Provisional	NA|277aa|down_2|NZ_CP012673.1_4189292_4190123_-	NA	NA|206aa|down_3|NZ_CP012673.1_4190621_4191239_-	NA	NA|229aa|down_4|NZ_CP012673.1_4191793_4192480_+	NA	NA|842aa|down_5|NZ_CP012673.1_4192496_4195022_-	cd14014, STKc_PknB_like, Catalytic domain of bacterial Serine/Threonine kinases, PknB and similar proteins	NA|449aa|down_6|NZ_CP012673.1_4195079_4196426_-	COG2204, AtoC, Response regulator containing CheY-like receiver, AAA-type ATPase, and DNA-binding domains [Signal transduction mechanisms]	NA|299aa|down_7|NZ_CP012673.1_4196627_4197524_-	pfam12852, Cupin_6, Cupin	NA|248aa|down_8|NZ_CP012673.1_4197654_4198398_+	cd05374, 17beta-HSD-like_SDR_c, 17beta hydroxysteroid dehydrogenase-like, classical (c) SDRs	NA|143aa|down_9|NZ_CP012673.1_4198416_4198845_-	NA
GCF_002950945.1_ASM295094v1	NZ_CP012673	Sorangium cellulosum strain So ce26 chromosome, complete genome	10	4277255-4277396	9	CRISPRCasFinder	no		RT,DEDDh,csa3,WYL,cas3,DinG,PD-DExK,cas6	Orphan	GCGAGGAGCCGCCGAGCAGCCGGCG	25	0	0	NA	NA	NA	2	2	Orphan	RT,DEDDh,csa3,WYL,cas3,DinG,PD-DExK,cas6	NA|103aa|up_3|NZ_CP012673.1_4269716_4270025_+,NA|367aa|up_2|NZ_CP012673.1_4270265_4271366_+,NA	NA|219aa|up_9|NZ_CP012673.1_4261760_4262417_+	COG0664, Crp, cAMP-binding proteins - catabolite gene activator and regulatory subunit of cAMP-dependent protein kinases [Signal transduction mechanisms]	NA|256aa|up_8|NZ_CP012673.1_4262421_4263189_+	cd07735, class_II_PDE_MBL-fold, class II cyclic nucleotide phosphodiesterases Saccharomyces cerevisiae PDE1, Dictyostelium discoideum PDE1 and PDE7, and related proteins; MBL-fold metallo-hydrolase domain	NA|273aa|up_7|NZ_CP012673.1_4263415_4264234_-	COG0631, PTC1, Serine/threonine protein phosphatase [Signal transduction mechanisms]	NA|547aa|up_6|NZ_CP012673.1_4264453_4266094_-	pfam13450, NAD_binding_8, NAD(P)-binding Rossmann-like domain	NA|507aa|up_5|NZ_CP012673.1_4266098_4267619_-	PRK03612, PRK03612, polyamine aminopropyltransferase	NA|491aa|up_4|NZ_CP012673.1_4267940_4269413_+	pfam13785, DUF4178, Domain of unknown function (DUF4178)	NA|103aa|up_3|NZ_CP012673.1_4269716_4270025_+	NA	NA|367aa|up_2|NZ_CP012673.1_4270265_4271366_+	NA	NA|662aa|up_1|NZ_CP012673.1_4271362_4273348_+	cd00306, Peptidases_S8_S53, Peptidase domain in the S8 and S53 families	NA|783aa|up_0|NZ_CP012673.1_4273534_4275883_+	PRK10364, PRK10364, two-component system sensor histidine kinase ZraS	NA|434aa|down_0|NZ_CP012673.1_4277509_4278811_-	COG1004, Ugd, Predicted UDP-glucose 6-dehydrogenase [Cell envelope biogenesis, outer membrane]	NA|562aa|down_1|NZ_CP012673.1_4279002_4280688_-	cd16913, YkuD_like, L,D-transpeptidases/carboxypeptidases similar to Bacillus YkuD	NA|253aa|down_2|NZ_CP012673.1_4280976_4281735_-	pfam01066, CDP-OH_P_transf, CDP-alcohol phosphatidyltransferase	NA|200aa|down_3|NZ_CP012673.1_4281824_4282424_-	cd06260, DUF820, Domain of unknown function (DUF820)	NA|1296aa|down_4|NZ_CP012673.1_4282779_4286667_+	pfam02128, Peptidase_M36, Fungalysin metallopeptidase (M36)	NA|427aa|down_5|NZ_CP012673.1_4286706_4287987_-	cd14014, STKc_PknB_like, Catalytic domain of bacterial Serine/Threonine kinases, PknB and similar proteins	NA|173aa|down_6|NZ_CP012673.1_4289592_4290111_+	COG3981, COG3981, Predicted acetyltransferase [General function prediction only]	NA|342aa|down_7|NZ_CP012673.1_4290460_4291486_+	COG0412, COG0412, Dienelactone hydrolase and related enzymes [Secondary metabolites biosynthesis, transport, and catabolism]	NA|1048aa|down_8|NZ_CP012673.1_4292932_4296076_+	COG5184, ATS1, Alpha-tubulin suppressor and related RCC1 domain-containing proteins [Cell division and chromosome partitioning / Cytoskeleton]	cas3|1471aa|down_9|NZ_CP012673.1_4297547_4301960_+	COG1201, Lhr, Lhr-like helicases [General function prediction only]
GCF_002950945.1_ASM295094v1	NZ_CP012673	Sorangium cellulosum strain So ce26 chromosome, complete genome	11	4459088-4459237	2	CRT	no		RT,DEDDh,csa3,WYL,cas3,DinG,PD-DExK,cas6	Orphan	GCGAACCCGTTCGCGGCG	18	1	128	4459106-4459123|4459106-4459123|4459106-4459123|4459106-4459123|4459106-4459123|4459106-4459123|4459106-4459123|4459106-4459123|4459106-4459123|4459106-4459123|4459106-4459123|4459106-4459123|4459106-4459123|4459106-4459123|4459106-4459123|4459106-4459123|4459106-4459123|4459106-4459123|4459106-4459123|4459106-4459123|4459106-4459123|4459106-4459123|4459106-4459123|4459106-4459123|4459106-4459123|4459106-4459123|4459106-4459123|4459106-4459123|4459106-4459123|4459106-4459123|4459106-4459123|4459106-4459123|4459106-4459123|4459106-4459123|4459106-4459123|4459106-4459123|4459106-4459123|4459106-4459123|4459106-4459123|4459106-4459123|4459106-4459123|4459106-4459123|4459106-4459123|4459106-4459123|4459106-4459123|4459106-4459123|4459106-4459123|4459106-4459123|4459106-4459123|4459106-4459123|4459106-4459123|4459106-4459123|4459106-4459123|4459106-4459123|4459106-4459123|4459106-4459123|4459106-4459123|4459106-4459123|4459106-4459123|4459106-4459123|4459106-4459123|4459106-4459123|4459106-4459123|4459106-4459123|4459106-4459123|4459106-4459123|4459106-4459123|4459106-4459123|4459106-4459123|4459106-4459123|4459106-4459123|4459106-4459123|4459106-4459123|4459106-4459123|4459106-4459123|4459106-4459123|4459106-4459123|4459106-4459123|4459106-4459123|4459106-4459123|4459106-4459123|4459106-4459123|4459106-4459123|4459106-4459123|4459106-4459123|4459106-4459123|4459106-4459123|4459106-4459123|4459106-4459123|4459106-4459123|4459106-4459123|4459106-4459123|4459106-4459123|4459106-4459123|4459106-4459123|4459106-4459123|4459106-4459123|4459106-4459123|4459106-4459123|4459106-4459123|4459106-4459123|4459106-4459123|4459106-4459123|4459106-4459123|4459106-4459123|4459106-4459123|4459106-4459123|4459106-4459123|4459106-4459123|4459106-4459123|4459106-4459123|4459106-4459123|4459106-4459123|4459106-4459123|4459106-4459123|4459106-4459123|4459106-4459123|4459106-4459123|4459106-4459123|4459106-4459123|4459106-4459123|4459106-4459123|4459106-4459123|4459106-4459123|4459106-4459123|4459106-4459123|4459106-4459123|4459106-4459123	NZ_CP012673.1_997385-997402|NZ_CP012673.1_1156663-1156680|NZ_CP012673.1_2268138-2268121|NZ_CP012673.1_2283506-2283489|NZ_CP012673.1_4688661-4688678|NZ_CP012673.1_6522040-6522057|NZ_CP012673.1_11224241-11224258|NZ_CP012673.1_27244-27261|NZ_CP012673.1_62834-62817|NZ_CP012673.1_94437-94420|NZ_CP012673.1_116050-116033|NZ_CP012673.1_321359-321376|NZ_CP012673.1_736738-736755|NZ_CP012673.1_757678-757661|NZ_CP012673.1_764513-764530|NZ_CP012673.1_776501-776518|NZ_CP012673.1_1019883-1019866|NZ_CP012673.1_1172653-1172670|NZ_CP012673.1_1179030-1179047|NZ_CP012673.1_1458107-1458124|NZ_CP012673.1_1661968-1661985|NZ_CP012673.1_1709041-1709024|NZ_CP012673.1_1872850-1872867|NZ_CP012673.1_2020857-2020840|NZ_CP012673.1_2200768-2200751|NZ_CP012673.1_2200753-2200770|NZ_CP012673.1_2296515-2296498|NZ_CP012673.1_2296614-2296597|NZ_CP012673.1_2296812-2296795|NZ_CP012673.1_2420340-2420323|NZ_CP012673.1_2420325-2420342|NZ_CP012673.1_2420346-2420329|NZ_CP012673.1_2588558-2588575|NZ_CP012673.1_2630892-2630875|NZ_CP012673.1_2634378-2634361|NZ_CP012673.1_2779744-2779761|NZ_CP012673.1_3170446-3170463|NZ_CP012673.1_3324823-3324840|NZ_CP012673.1_3385778-3385761|NZ_CP012673.1_3413789-3413806|NZ_CP012673.1_3514385-3514402|NZ_CP012673.1_3517943-3517926|NZ_CP012673.1_3605326-3605343|NZ_CP012673.1_3628191-3628174|NZ_CP012673.1_3681742-3681725|NZ_CP012673.1_3733292-3733275|NZ_CP012673.1_3736816-3736799|NZ_CP012673.1_3843013-3843030|NZ_CP012673.1_3849846-3849863|NZ_CP012673.1_3929325-3929342|NZ_CP012673.1_3973331-3973348|NZ_CP012673.1_4062827-4062844|NZ_CP012673.1_4365378-4365395|NZ_CP012673.1_4456734-4456751|NZ_CP012673.1_4591820-4591837|NZ_CP012673.1_4647784-4647767|NZ_CP012673.1_4647769-4647786|NZ_CP012673.1_4765213-4765230|NZ_CP012673.1_5497291-5497274|NZ_CP012673.1_5542360-5542343|NZ_CP012673.1_5558517-5558534|NZ_CP012673.1_5623090-5623073|NZ_CP012673.1_5630075-5630092|NZ_CP012673.1_5709704-5709687|NZ_CP012673.1_5768017-5768034|NZ_CP012673.1_5804199-5804216|NZ_CP012673.1_5834380-5834363|NZ_CP012673.1_5834365-5834382|NZ_CP012673.1_6350629-6350646|NZ_CP012673.1_6398716-6398699|NZ_CP012673.1_6903867-6903850|NZ_CP012673.1_6904185-6904202|NZ_CP012673.1_7133528-7133511|NZ_CP012673.1_7133513-7133530|NZ_CP012673.1_7320542-7320559|NZ_CP012673.1_7387234-7387217|NZ_CP012673.1_7428876-7428893|NZ_CP012673.1_7674885-7674902|NZ_CP012673.1_8116765-8116782|NZ_CP012673.1_8166462-8166479|NZ_CP012673.1_8180853-8180836|NZ_CP012673.1_8252631-8252648|NZ_CP012673.1_8412670-8412687|NZ_CP012673.1_8611979-8611996|NZ_CP012673.1_8679577-8679594|NZ_CP012673.1_8684176-8684193|NZ_CP012673.1_8830466-8830483|NZ_CP012673.1_9404758-9404775|NZ_CP012673.1_9450957-9450940|NZ_CP012673.1_9567204-9567221|NZ_CP012673.1_9807105-9807122|NZ_CP012673.1_9896810-9896793|NZ_CP012673.1_10204934-10204917|NZ_CP012673.1_10245998-10245981|NZ_CP012673.1_10339313-10339330|NZ_CP012673.1_10344278-10344295|NZ_CP012673.1_10413820-10413803|NZ_CP012673.1_10630044-10630027|NZ_CP012673.1_10773913-10773930|NZ_CP012673.1_10785279-10785296|NZ_CP012673.1_10871019-10871002|NZ_CP012673.1_10898184-10898167|NZ_CP012673.1_10928329-10928312|NZ_CP012673.1_11149750-11149767|NZ_CP012673.1_11198576-11198593|NZ_CP012673.1_11229921-11229938|NZ_CP012673.1_11425510-11425493|NZ_CP012673.1_11518973-11518956|NZ_CP012673.1_11557826-11557843|NZ_CP012673.1_11572023-11572006|NZ_CP012673.1_11703508-11703491|NZ_CP012673.1_12087972-12087955|NZ_CP012673.1_12234753-12234770|NZ_CP012673.1_12718205-12718222|NZ_CP012673.1_12844816-12844833|NZ_CP012673.1_12893562-12893579|NZ_CP012673.1_13008815-13008832|NZ_CP012673.1_13232743-13232760|NZ_CP012673.1_13575778-13575795|NZ_CP012673.1_13690916-13690933|NZ_CP012673.1_13715487-13715470|NZ_CP012673.1_13757439-13757422|NZ_CP012673.1_14014349-14014366|NZ_CP012673.1_14043240-14043257|NZ_CP012673.1_14137567-14137550|NZ_CP012673.1_14137552-14137569|NZ_CP012673.1_14339543-14339560|NZ_CP012673.1_14474790-14474773	NA	3	3	Orphan	RT,DEDDh,csa3,WYL,cas3,DinG,PD-DExK,cas6	NA|606aa|up_7|NZ_CP012673.1_4445033_4446851_-,NA|287aa|up_4|NZ_CP012673.1_4449563_4450424_-,NA|571aa|up_3|NZ_CP012673.1_4451011_4452724_+,NA|90aa|up_0|NZ_CP012673.1_4458371_4458641_+,NA|509aa|down_4|NZ_CP012673.1_4480058_4481585_+,NA|304aa|down_5|NZ_CP012673.1_4481581_4482493_+,NA|207aa|down_8|NZ_CP012673.1_4487333_4487954_-	NA|316aa|up_9|NZ_CP012673.1_4442131_4443079_-	TIGR04565, hypothetical_protein_N47_G32130, outer membrane beta-barrel protein	NA|456aa|up_8|NZ_CP012673.1_4443492_4444860_-	PRK10416, PRK10416, signal recognition particle-docking protein FtsY; Provisional	NA|606aa|up_7|NZ_CP012673.1_4445033_4446851_-	NA	NA|305aa|up_6|NZ_CP012673.1_4447097_4448012_+	pfam13646, HEAT_2, HEAT repeats	NA|164aa|up_5|NZ_CP012673.1_4449064_4449556_-	PRK12497, PRK12497, YraN family protein	NA|287aa|up_4|NZ_CP012673.1_4449563_4450424_-	NA	NA|571aa|up_3|NZ_CP012673.1_4451011_4452724_+	NA	NA|1284aa|up_2|NZ_CP012673.1_4452809_4456661_+	sd00006, TPR, Tetratricopeptide repeat	NA|494aa|up_1|NZ_CP012673.1_4456660_4458142_+	TIGR02917, TPR_domain_protein, putative PEP-CTERM system TPR-repeat lipoprotein	NA|90aa|up_0|NZ_CP012673.1_4458371_4458641_+	NA	NA|298aa|down_0|NZ_CP012673.1_4461389_4462283_+	sd00006, TPR, Tetratricopeptide repeat	NA|3860aa|down_1|NZ_CP012673.1_4462354_4473934_+	PRK07003, PRK07003, DNA polymerase III subunit gamma/tau	NA|544aa|down_2|NZ_CP012673.1_4474121_4475753_+	cd14014, STKc_PknB_like, Catalytic domain of bacterial Serine/Threonine kinases, PknB and similar proteins	NA|1297aa|down_3|NZ_CP012673.1_4475781_4479672_-	cd14014, STKc_PknB_like, Catalytic domain of bacterial Serine/Threonine kinases, PknB and similar proteins	NA|509aa|down_4|NZ_CP012673.1_4480058_4481585_+	NA	NA|304aa|down_5|NZ_CP012673.1_4481581_4482493_+	NA	NA|563aa|down_6|NZ_CP012673.1_4482643_4484332_+	PTZ00436, PTZ00436, 60S ribosomal protein L19-like protein; Provisional	NA|817aa|down_7|NZ_CP012673.1_4484765_4487216_-	COG2204, AtoC, Response regulator containing CheY-like receiver, AAA-type ATPase, and DNA-binding domains [Signal transduction mechanisms]	NA|207aa|down_8|NZ_CP012673.1_4487333_4487954_-	NA	NA|574aa|down_9|NZ_CP012673.1_4489669_4491391_+	pfam07631, PSD4, Protein of unknown function (DUF1592)
GCF_002950945.1_ASM295094v1	NZ_CP012673	Sorangium cellulosum strain So ce26 chromosome, complete genome	12	4789579-4789676	10	CRISPRCasFinder	no		RT,DEDDh,csa3,WYL,cas3,DinG,PD-DExK,cas6	Orphan	CGCGCGCGCGGCCCCGGCGCTGC	23	0	0	NA	NA	NA	1	1	Orphan	RT,DEDDh,csa3,WYL,cas3,DinG,PD-DExK,cas6	NA,NA|300aa|down_2|NZ_CP012673.1_4793216_4794116_-,NA|76aa|down_3|NZ_CP012673.1_4794255_4794483_+,NA|232aa|down_6|NZ_CP012673.1_4799321_4800017_-	NA|1055aa|up_9|NZ_CP012673.1_4776628_4779793_-	COG0841, AcrB, Cation/multidrug efflux pump [Defense mechanisms]	NA|363aa|up_8|NZ_CP012673.1_4779800_4780889_-	TIGR01730, COG0845:_Membrane-fusion_protein, RND family efflux transporter, MFP subunit	NA|225aa|up_7|NZ_CP012673.1_4781057_4781732_+	pfam16859, TetR_C_11, Bacterial transcriptional repressor C-terminal	NA|401aa|up_6|NZ_CP012673.1_4781735_4782938_-	COG0122, AlkA, 3-methyladenine DNA glycosylase/8-oxoguanine DNA glycosylase [DNA replication, recombination, and repair]	NA|281aa|up_5|NZ_CP012673.1_4783045_4783888_-	COG2207, AraC, AraC-type DNA-binding domain-containing proteins [Transcription]	NA|411aa|up_4|NZ_CP012673.1_4784091_4785324_-	PRK12270, kgd, multifunctional oxoglutarate decarboxylase/oxoglutarate dehydrogenase thiamine pyrophosphate-binding subunit/dihydrolipoyllysine-residue succinyltransferase subunit	NA|351aa|up_3|NZ_CP012673.1_4785546_4786599_+	pfam13304, AAA_21, AAA domain, putative AbiEii toxin, Type IV TA system	NA|294aa|up_2|NZ_CP012673.1_4786595_4787477_+	pfam14491, DUF4435, Protein of unknown function (DUF4435)	NA|173aa|up_1|NZ_CP012673.1_4787534_4788053_+	pfam13302, Acetyltransf_3, Acetyltransferase (GNAT) domain	NA|202aa|up_0|NZ_CP012673.1_4788059_4788665_-	TIGR03027, pepcterm_export, putative polysaccharide export protein, PEP-CTERM sytem-associated	NA|412aa|down_0|NZ_CP012673.1_4790845_4792081_+	COG2124, CypX, Cytochrome P450 [Secondary metabolites biosynthesis, transport, and catabolism]	NA|329aa|down_1|NZ_CP012673.1_4792153_4793140_-	PRK06302, PRK06302, acetyl-CoA carboxylase biotin carboxyl carrier protein	NA|300aa|down_2|NZ_CP012673.1_4793216_4794116_-	NA	NA|76aa|down_3|NZ_CP012673.1_4794255_4794483_+	NA	NA|877aa|down_4|NZ_CP012673.1_4794587_4797218_+	cd05387, BY-kinase, bacterial tyrosine-kinase	NA|496aa|down_5|NZ_CP012673.1_4797640_4799128_+	PRK06958, PRK06958, single-stranded DNA-binding protein; Provisional	NA|232aa|down_6|NZ_CP012673.1_4799321_4800017_-	NA	NA|462aa|down_7|NZ_CP012673.1_4800336_4801722_-	cd07102, ALDH_EDX86601, Uncharacterized aldehyde dehydrogenase of Synechococcus sp	NA|264aa|down_8|NZ_CP012673.1_4801780_4802572_+	COG0631, PTC1, Serine/threonine protein phosphatase [Signal transduction mechanisms]	NA|359aa|down_9|NZ_CP012673.1_4802632_4803709_-	pfam13528, Glyco_trans_1_3, Glycosyl transferase family 1
GCF_002950945.1_ASM295094v1	NZ_CP012673	Sorangium cellulosum strain So ce26 chromosome, complete genome	13	5622297-5622428	11	CRISPRCasFinder	no		RT,DEDDh,csa3,WYL,cas3,DinG,PD-DExK,cas6	Orphan	GCCGCCGGTCGTGGCGCTCGTCGTGCTGCT	30	2	4	5622327-5622347|5622327-5622347|5622327-5622347|5622378-5622398	NZ_CP012673.1_7300405-7300425|NZ_CP012673.1_8039825-8039845|NZ_CP012673.1_10386199-10386179|NZ_CP012673.1_6702962-6702942	NA	2	2	Orphan	RT,DEDDh,csa3,WYL,cas3,DinG,PD-DExK,cas6	NA|178aa|up_4|NZ_CP012673.1_5616913_5617447_-,NA|273aa|up_2|NZ_CP012673.1_5619266_5620085_+,NA|378aa|down_2|NZ_CP012673.1_5624969_5626103_-,NA|121aa|down_4|NZ_CP012673.1_5628568_5628931_-	NA|108aa|up_9|NZ_CP012673.1_5610372_5610696_+	cd02238, cupin_KdgF, pectin degradation protein KdgF and related proteins, cupin domain	NA|556aa|up_8|NZ_CP012673.1_5610823_5612491_+	pfam09492, Pec_lyase, Pectic acid lyase	NA|500aa|up_7|NZ_CP012673.1_5612556_5614056_+	pfam13229, Beta_helix, Right handed beta helix region	NA|402aa|up_6|NZ_CP012673.1_5614154_5615360_+	COG3866, PelB, Pectate lyase [Carbohydrate transport and metabolism]	NA|388aa|up_5|NZ_CP012673.1_5615493_5616657_+	cd00198, vWFA, Von Willebrand factor type A (vWA) domain was originally found in the blood coagulation protein von Willebrand factor (vWF)	NA|178aa|up_4|NZ_CP012673.1_5616913_5617447_-	NA	NA|548aa|up_3|NZ_CP012673.1_5617594_5619238_+	cd14014, STKc_PknB_like, Catalytic domain of bacterial Serine/Threonine kinases, PknB and similar proteins	NA|273aa|up_2|NZ_CP012673.1_5619266_5620085_+	NA	NA|443aa|up_1|NZ_CP012673.1_5620125_5621454_-	COG3829, RocR, Transcriptional regulator containing PAS, AAA-type ATPase, and DNA-binding domains [Transcription / Signal transduction mechanisms]	NA|248aa|up_0|NZ_CP012673.1_5621510_5622254_-	pfam03211, Pectate_lyase, Pectate lyase	NA|220aa|down_0|NZ_CP012673.1_5622974_5623634_+	PRK13875, PRK13875, conjugal transfer protein TrbL; Provisional	NA|363aa|down_1|NZ_CP012673.1_5623784_5624873_-	smart00327, VWA, von Willebrand factor (vWF) type A domain	NA|378aa|down_2|NZ_CP012673.1_5624969_5626103_-	NA	NA|409aa|down_3|NZ_CP012673.1_5626588_5627815_-	COG3866, PelB, Pectate lyase [Carbohydrate transport and metabolism]	NA|121aa|down_4|NZ_CP012673.1_5628568_5628931_-	NA	NA|271aa|down_5|NZ_CP012673.1_5628920_5629733_+	COG3866, PelB, Pectate lyase [Carbohydrate transport and metabolism]	NA|406aa|down_6|NZ_CP012673.1_5629830_5631048_+	COG3866, PelB, Pectate lyase [Carbohydrate transport and metabolism]	NA|330aa|down_7|NZ_CP012673.1_5631176_5632166_+	cd01949, GGDEF, Diguanylate-cyclase (DGC) or GGDEF domain	NA|200aa|down_8|NZ_CP012673.1_5632312_5632912_+	COG4339, COG4339, Uncharacterized protein conserved in bacteria [Function unknown]	NA|440aa|down_9|NZ_CP012673.1_5632992_5634312_+	pfam08757, CotH, CotH kinase protein
GCF_002950945.1_ASM295094v1	NZ_CP012673	Sorangium cellulosum strain So ce26 chromosome, complete genome	14	5698361-5698449	12	CRISPRCasFinder	no		RT,DEDDh,csa3,WYL,cas3,DinG,PD-DExK,cas6	Orphan	CTTGCAGCCGCCCTTGCCCTTGCACTCGTTCT	32	0	0	NA	NA	NA	1	1	Orphan	RT,DEDDh,csa3,WYL,cas3,DinG,PD-DExK,cas6	NA,NA|332aa|down_0|NZ_CP012673.1_5699018_5700014_+	NA|282aa|up_9|NZ_CP012673.1_5688684_5689530_-	COG1116, TauB, ABC-type nitrate/sulfonate/bicarbonate transport system, ATPase component [Inorganic ion transport and metabolism]	NA|266aa|up_8|NZ_CP012673.1_5689534_5690332_-	TIGR01183, Nitrate_transport_permease_protein_NrtB, nitrate ABC transporter, permease protein	NA|440aa|up_7|NZ_CP012673.1_5690390_5691710_-	pfam13379, NMT1_2, NMT1-like family	NA|152aa|up_6|NZ_CP012673.1_5692086_5692542_-	PRK00601, dut, dUTP diphosphatase	NA|462aa|up_5|NZ_CP012673.1_5692560_5693946_-	cd06161, S2P-M50_SpoIVFB, SpoIVFB Site-2 protease (S2P), a zinc metalloprotease (MEROPS family M50B), regulates intramembrane proteolysis (RIP), and is involved in the pro-sigmaK pathway of bacterial spore formation	NA|286aa|up_4|NZ_CP012673.1_5694109_5694967_+	cd00229, SGNH_hydrolase, SGNH_hydrolase, or GDSL_hydrolase, is a diverse family of lipases and esterases	NA|252aa|up_3|NZ_CP012673.1_5695110_5695866_-	cd07989, LPLAT_AGPAT-like, Lysophospholipid Acyltransferases (LPLATs) of Glycerophospholipid Biosynthesis: AGPAT-like	NA|144aa|up_2|NZ_CP012673.1_5696001_5696433_-	COG0432, COG0432, Uncharacterized conserved protein [Function unknown]	NA|284aa|up_1|NZ_CP012673.1_5696429_5697281_-	pfam09836, DUF2063, Putative DNA-binding domain	NA|280aa|up_0|NZ_CP012673.1_5697366_5698206_-	pfam05114, DUF692, Protein of unknown function (DUF692)	NA|332aa|down_0|NZ_CP012673.1_5699018_5700014_+	NA	NA|329aa|down_1|NZ_CP012673.1_5700090_5701077_+	PRK07764, PRK07764, DNA polymerase III subunits gamma and tau; Validated	NA|159aa|down_2|NZ_CP012673.1_5701171_5701648_+	cd07821, PYR_PYL_RCAR_like, Pyrabactin resistance 1 (PYR1), PYR1-like (PYL), regulatory component of abscisic acid receptors (RCARs), and related proteins	NA|857aa|down_3|NZ_CP012673.1_5701678_5704249_-	TIGR00921, Predicted_exporter_of_the_RND_superfamily, The (Largely Archaeal Putative) Hydrophobe/Amphiphile Efflux-3 (HAE3) Family	NA|221aa|down_4|NZ_CP012673.1_5704408_5705071_+	COG0637, COG0637, Predicted phosphatase/phosphohexomutase [General function prediction only]	NA|302aa|down_5|NZ_CP012673.1_5705511_5706417_+	COG0583, LysR, Transcriptional regulator [Transcription]	NA|254aa|down_6|NZ_CP012673.1_5706635_5707397_-	PRK13875, PRK13875, conjugal transfer protein TrbL; Provisional	NA|142aa|down_7|NZ_CP012673.1_5707430_5707856_-	pfam01878, EVE, EVE domain	NA|367aa|down_8|NZ_CP012673.1_5708332_5709433_+	TIGR02937, RNA_polymerase_sigma_factor, RNA polymerase sigma factor, sigma-70 family	NA|343aa|down_9|NZ_CP012673.1_5709604_5710633_+	cd07185, OmpA_C-like, Peptidoglycan binding domains similar to the C-terminal domain of outer-membrane protein OmpA
GCF_002950945.1_ASM295094v1	NZ_CP012673	Sorangium cellulosum strain So ce26 chromosome, complete genome	15	5710424-5710578	13	CRISPRCasFinder	no		RT,DEDDh,csa3,WYL,cas3,DinG,PD-DExK,cas6	Orphan	GCGAAGCCCGCCGCCGCGGCCACGAAGCCCGCGGA	35	0	0	NA	NA	NA	1	1	Orphan	RT,DEDDh,csa3,WYL,cas3,DinG,PD-DExK,cas6	NA|107aa|up_9|NZ_CP012673.1_5698324_5698645_-,NA|332aa|up_8|NZ_CP012673.1_5699018_5700014_+,NA|1476aa|down_0|NZ_CP012673.1_5710619_5715047_-,NA|110aa|down_2|NZ_CP012673.1_5719095_5719425_-	NA|107aa|up_9|NZ_CP012673.1_5698324_5698645_-	NA	NA|332aa|up_8|NZ_CP012673.1_5699018_5700014_+	NA	NA|329aa|up_7|NZ_CP012673.1_5700090_5701077_+	PRK07764, PRK07764, DNA polymerase III subunits gamma and tau; Validated	NA|159aa|up_6|NZ_CP012673.1_5701171_5701648_+	cd07821, PYR_PYL_RCAR_like, Pyrabactin resistance 1 (PYR1), PYR1-like (PYL), regulatory component of abscisic acid receptors (RCARs), and related proteins	NA|857aa|up_5|NZ_CP012673.1_5701678_5704249_-	TIGR00921, Predicted_exporter_of_the_RND_superfamily, The (Largely Archaeal Putative) Hydrophobe/Amphiphile Efflux-3 (HAE3) Family	NA|221aa|up_4|NZ_CP012673.1_5704408_5705071_+	COG0637, COG0637, Predicted phosphatase/phosphohexomutase [General function prediction only]	NA|302aa|up_3|NZ_CP012673.1_5705511_5706417_+	COG0583, LysR, Transcriptional regulator [Transcription]	NA|254aa|up_2|NZ_CP012673.1_5706635_5707397_-	PRK13875, PRK13875, conjugal transfer protein TrbL; Provisional	NA|142aa|up_1|NZ_CP012673.1_5707430_5707856_-	pfam01878, EVE, EVE domain	NA|367aa|up_0|NZ_CP012673.1_5708332_5709433_+	TIGR02937, RNA_polymerase_sigma_factor, RNA polymerase sigma factor, sigma-70 family	NA|1476aa|down_0|NZ_CP012673.1_5710619_5715047_-	NA	NA|339aa|down_1|NZ_CP012673.1_5718027_5719044_+	cd05247, UDP_G4E_1_SDR_e, UDP-glucose 4 epimerase, subgroup 1, extended (e) SDRs	NA|110aa|down_2|NZ_CP012673.1_5719095_5719425_-	NA	NA|801aa|down_3|NZ_CP012673.1_5719703_5722106_+	cd08504, PBP2_OppA, The substrate-binding component of an ABC-type oligopetide import system contains the type 2 periplasmic binding fold	NA|370aa|down_4|NZ_CP012673.1_5722146_5723256_-	pfam04754, Transposase_31, Putative transposase, YhgA-like	NA|798aa|down_5|NZ_CP012673.1_5723522_5725916_+	pfam01055, Glyco_hydro_31, Glycosyl hydrolases family 31	NA|421aa|down_6|NZ_CP012673.1_5726171_5727434_+	TIGR03805, beta_helix_1, parallel beta-helix repeat-containing protein	NA|396aa|down_7|NZ_CP012673.1_5727408_5728596_+	TIGR03806, chp_HNE_0200, conserved hypothetical protein, HNE_0200 family	NA|390aa|down_8|NZ_CP012673.1_5728603_5729773_-	cd08283, FDH_like_1, Glutathione-dependent formaldehyde dehydrogenase related proteins, child 1	NA|266aa|down_9|NZ_CP012673.1_5729780_5730578_-	cd07817, SRPBCC_8, Ligand-binding SRPBCC domain of an uncharacterized subfamily of proteins
GCF_002950945.1_ASM295094v1	NZ_CP012673	Sorangium cellulosum strain So ce26 chromosome, complete genome	16	5883920-5884039	14	CRISPRCasFinder	no		RT,DEDDh,csa3,WYL,cas3,DinG,PD-DExK,cas6	Orphan	CCCGAGCAGGTGCTGTCGGCCTACGCCCCCGAGCAGCGG	39	0	0	NA	NA	NA	1	1	Orphan	RT,DEDDh,csa3,WYL,cas3,DinG,PD-DExK,cas6	NA|74aa|up_1|NZ_CP012673.1_5874769_5874991_+,NA|283aa|down_0|NZ_CP012673.1_5884373_5885222_-,NA|198aa|down_7|NZ_CP012673.1_5892432_5893026_-,NA|135aa|down_8|NZ_CP012673.1_5893046_5893451_-	NA|543aa|up_9|NZ_CP012673.1_5865972_5867601_+	cd14014, STKc_PknB_like, Catalytic domain of bacterial Serine/Threonine kinases, PknB and similar proteins	NA|164aa|up_8|NZ_CP012673.1_5867685_5868177_+	pfam14539, DUF4442, Domain of unknown function (DUF4442)	NA|295aa|up_7|NZ_CP012673.1_5868189_5869074_-	cd06260, DUF820, Domain of unknown function (DUF820)	NA|250aa|up_6|NZ_CP012673.1_5869267_5870017_-	PRK05716, PRK05716, methionine aminopeptidase; Validated	NA|105aa|up_5|NZ_CP012673.1_5870161_5870476_+	PRK09906, PRK09906, DNA-binding transcriptional regulator HcaR; Provisional	NA|615aa|up_4|NZ_CP012673.1_5870505_5872350_+	pfam03747, ADP_ribosyl_GH, ADP-ribosylglycohydrolase	NA|332aa|up_3|NZ_CP012673.1_5872358_5873354_-	PRK09599, PRK09599, NADP-dependent phosphogluconate dehydrogenase	NA|343aa|up_2|NZ_CP012673.1_5873480_5874509_-	cd07041, STAS_RsbR_RsbS_like, Sulphate Transporter and Anti-Sigma factor antagonist domain of the "stressosome" complex proteins RsbS and RsbR, regulators of the bacterial stress activated alternative sigma factor sigma-B by phosphorylation	NA|74aa|up_1|NZ_CP012673.1_5874769_5874991_+	NA	NA|1704aa|up_0|NZ_CP012673.1_5875303_5880415_+	COG3899, COG3899, Predicted ATPase [General function prediction only]	NA|283aa|down_0|NZ_CP012673.1_5884373_5885222_-	NA	NA|453aa|down_1|NZ_CP012673.1_5885602_5886961_+	sd00002, TSP3, Calcium-binding Thrombospondin type 3 (TSP3) repeat	NA|345aa|down_2|NZ_CP012673.1_5886945_5887980_-	COG1079, COG1079, Uncharacterized ABC-type transport system, permease component [General function prediction only]	NA|348aa|down_3|NZ_CP012673.1_5887979_5889023_-	COG4603, COG4603, ABC-type uncharacterized transport system, permease component [General function prediction only]	NA|508aa|down_4|NZ_CP012673.1_5889019_5890543_-	COG3845, COG3845, ABC-type uncharacterized transport systems, ATPase components [General function prediction only]	NA|433aa|down_5|NZ_CP012673.1_5890563_5891862_-	cd19963, PBP1_BMP-like, periplasmic binding component of a basic membrane lipoprotein (BMP) from Brucella abortus and its close homologs in other bacteria	NA|80aa|down_6|NZ_CP012673.1_5892046_5892286_+	COG4118, Phd, Antitoxin of toxin-antitoxin stability system [Cell division and chromosome partitioning]	NA|198aa|down_7|NZ_CP012673.1_5892432_5893026_-	NA	NA|135aa|down_8|NZ_CP012673.1_5893046_5893451_-	NA	NA|530aa|down_9|NZ_CP012673.1_5893864_5895454_-	cd07402, MPP_GpdQ, Enterobacter aerogenes GpdQ and related proteins, metallophosphatase domain
GCF_002950945.1_ASM295094v1	NZ_CP012673	Sorangium cellulosum strain So ce26 chromosome, complete genome	17	5905014-5905113	15	CRISPRCasFinder	no		RT,DEDDh,csa3,WYL,cas3,DinG,PD-DExK,cas6	Orphan	GATCGCGCCCGGCGCGAGCCTGCG	24	0	0	NA	NA	NA	1	1	Orphan	RT,DEDDh,csa3,WYL,cas3,DinG,PD-DExK,cas6	NA|198aa|up_9|NZ_CP012673.1_5892432_5893026_-,NA|135aa|up_8|NZ_CP012673.1_5893046_5893451_-,NA|294aa|up_4|NZ_CP012673.1_5898313_5899195_+,NA|158aa|up_3|NZ_CP012673.1_5899358_5899832_-,NA|89aa|up_1|NZ_CP012673.1_5900739_5901006_+,NA|262aa|down_7|NZ_CP012673.1_5919533_5920319_+,NA|161aa|down_8|NZ_CP012673.1_5920363_5920846_+	NA|198aa|up_9|NZ_CP012673.1_5892432_5893026_-	NA	NA|135aa|up_8|NZ_CP012673.1_5893046_5893451_-	NA	NA|530aa|up_7|NZ_CP012673.1_5893864_5895454_-	cd07402, MPP_GpdQ, Enterobacter aerogenes GpdQ and related proteins, metallophosphatase domain	NA|421aa|up_6|NZ_CP012673.1_5895581_5896844_+	PRK14951, PRK14951, DNA polymerase III subunits gamma and tau; Provisional	NA|322aa|up_5|NZ_CP012673.1_5896878_5897844_-	cd19100, AKR_unchar, uncharacterized aldo-keto reductase (AKR) superfamily protein	NA|294aa|up_4|NZ_CP012673.1_5898313_5899195_+	NA	NA|158aa|up_3|NZ_CP012673.1_5899358_5899832_-	NA	NA|199aa|up_2|NZ_CP012673.1_5900005_5900602_-	pfam08308, PEGA, PEGA domain	NA|89aa|up_1|NZ_CP012673.1_5900739_5901006_+	NA	NA|599aa|up_0|NZ_CP012673.1_5901366_5903163_+	cd14014, STKc_PknB_like, Catalytic domain of bacterial Serine/Threonine kinases, PknB and similar proteins	NA|1182aa|down_0|NZ_CP012673.1_5905198_5908744_-	sd00038, Kelch, Kelch repeat	NA|1106aa|down_1|NZ_CP012673.1_5908987_5912305_+	sd00002, TSP3, Calcium-binding Thrombospondin type 3 (TSP3) repeat	NA|599aa|down_2|NZ_CP012673.1_5912426_5914223_+	cd07185, OmpA_C-like, Peptidoglycan binding domains similar to the C-terminal domain of outer-membrane protein OmpA	NA|512aa|down_3|NZ_CP012673.1_5914454_5915990_+	cd07398, MPP_YbbF-LpxH, Escherichia coli YbbF/LpxH and related proteins, metallophosphatase domain	NA|258aa|down_4|NZ_CP012673.1_5916152_5916926_+	COG0631, PTC1, Serine/threonine protein phosphatase [Signal transduction mechanisms]	NA|355aa|down_5|NZ_CP012673.1_5916957_5918022_-	cd14656, Imelysin-like_EfeO, EfeO is a component of the EfeUOB operon	NA|414aa|down_6|NZ_CP012673.1_5918051_5919293_-	pfam06537, DHOR, Di-haem oxidoreductase, putative peroxidase	NA|262aa|down_7|NZ_CP012673.1_5919533_5920319_+	NA	NA|161aa|down_8|NZ_CP012673.1_5920363_5920846_+	NA	NA|620aa|down_9|NZ_CP012673.1_5920887_5922747_-	cd01153, ACAD_fadE5, Putative acyl-CoA dehydrogenases similar to fadE5
GCF_002950945.1_ASM295094v1	NZ_CP012673	Sorangium cellulosum strain So ce26 chromosome, complete genome	18	5909451-5910164	16,1	CRISPRCasFinder,PILER-CR	no		RT,DEDDh,csa3,WYL,cas3,DinG,PD-DExK,cas6	Orphan	TCGGCGACGCCTGCGACAACTGTCCGG,CGGCGACGCCTGCGACAACTGC	27,22	0	0	NA	NA	NA:NA	10,4	10	Orphan	RT,DEDDh,csa3,WYL,cas3,DinG,PD-DExK,cas6	NA|294aa|up_6|NZ_CP012673.1_5898313_5899195_+,NA|158aa|up_5|NZ_CP012673.1_5899358_5899832_-,NA|89aa|up_3|NZ_CP012673.1_5900739_5901006_+,NA|262aa|down_5|NZ_CP012673.1_5919533_5920319_+,NA|161aa|down_6|NZ_CP012673.1_5920363_5920846_+	NA|530aa|up_9|NZ_CP012673.1_5893864_5895454_-	cd07402, MPP_GpdQ, Enterobacter aerogenes GpdQ and related proteins, metallophosphatase domain	NA|421aa|up_8|NZ_CP012673.1_5895581_5896844_+	PRK14951, PRK14951, DNA polymerase III subunits gamma and tau; Provisional	NA|322aa|up_7|NZ_CP012673.1_5896878_5897844_-	cd19100, AKR_unchar, uncharacterized aldo-keto reductase (AKR) superfamily protein	NA|294aa|up_6|NZ_CP012673.1_5898313_5899195_+	NA	NA|158aa|up_5|NZ_CP012673.1_5899358_5899832_-	NA	NA|199aa|up_4|NZ_CP012673.1_5900005_5900602_-	pfam08308, PEGA, PEGA domain	NA|89aa|up_3|NZ_CP012673.1_5900739_5901006_+	NA	NA|599aa|up_2|NZ_CP012673.1_5901366_5903163_+	cd14014, STKc_PknB_like, Catalytic domain of bacterial Serine/Threonine kinases, PknB and similar proteins	NA|619aa|up_1|NZ_CP012673.1_5903193_5905050_+	pfam08308, PEGA, PEGA domain	NA|1182aa|up_0|NZ_CP012673.1_5905198_5908744_-	sd00038, Kelch, Kelch repeat	NA|599aa|down_0|NZ_CP012673.1_5912426_5914223_+	cd07185, OmpA_C-like, Peptidoglycan binding domains similar to the C-terminal domain of outer-membrane protein OmpA	NA|512aa|down_1|NZ_CP012673.1_5914454_5915990_+	cd07398, MPP_YbbF-LpxH, Escherichia coli YbbF/LpxH and related proteins, metallophosphatase domain	NA|258aa|down_2|NZ_CP012673.1_5916152_5916926_+	COG0631, PTC1, Serine/threonine protein phosphatase [Signal transduction mechanisms]	NA|355aa|down_3|NZ_CP012673.1_5916957_5918022_-	cd14656, Imelysin-like_EfeO, EfeO is a component of the EfeUOB operon	NA|414aa|down_4|NZ_CP012673.1_5918051_5919293_-	pfam06537, DHOR, Di-haem oxidoreductase, putative peroxidase	NA|262aa|down_5|NZ_CP012673.1_5919533_5920319_+	NA	NA|161aa|down_6|NZ_CP012673.1_5920363_5920846_+	NA	NA|620aa|down_7|NZ_CP012673.1_5920887_5922747_-	cd01153, ACAD_fadE5, Putative acyl-CoA dehydrogenases similar to fadE5	NA|158aa|down_8|NZ_CP012673.1_5923064_5923538_+	PRK01885, greB, transcription elongation factor GreB; Reviewed	NA|132aa|down_9|NZ_CP012673.1_5923579_5923975_+	TIGR00068, Lactoylglutathione_lyase, lactoylglutathione lyase
GCF_002950945.1_ASM295094v1	NZ_CP012673	Sorangium cellulosum strain So ce26 chromosome, complete genome	19	6003244-6003477	17	CRISPRCasFinder	no		RT,DEDDh,csa3,WYL,cas3,DinG,PD-DExK,cas6	Orphan	GCGGATCTGACGGGGGCGAAGCTC	24	0	0	NA	NA	NA	4	4	Orphan	RT,DEDDh,csa3,WYL,cas3,DinG,PD-DExK,cas6	NA|181aa|up_5|NZ_CP012673.1_5989021_5989564_-,NA|345aa|up_3|NZ_CP012673.1_5991306_5992341_-,NA|191aa|down_2|NZ_CP012673.1_6008939_6009512_-,NA|277aa|down_7|NZ_CP012673.1_6016237_6017068_+,NA|336aa|down_9|NZ_CP012673.1_6019138_6020146_-	NA|140aa|up_9|NZ_CP012673.1_5983298_5983718_-	cd07041, STAS_RsbR_RsbS_like, Sulphate Transporter and Anti-Sigma factor antagonist domain of the "stressosome" complex proteins RsbS and RsbR, regulators of the bacterial stress activated alternative sigma factor sigma-B by phosphorylation	NA|450aa|up_8|NZ_CP012673.1_5983717_5985067_-	cd07041, STAS_RsbR_RsbS_like, Sulphate Transporter and Anti-Sigma factor antagonist domain of the "stressosome" complex proteins RsbS and RsbR, regulators of the bacterial stress activated alternative sigma factor sigma-B by phosphorylation	NA|230aa|up_7|NZ_CP012673.1_5985542_5986232_-	sd00006, TPR, Tetratricopeptide repeat	NA|839aa|up_6|NZ_CP012673.1_5986474_5988991_-	COG0513, SrmB, Superfamily II DNA and RNA helicases [DNA replication, recombination, and repair / Transcription / Translation, ribosomal structure and biogenesis]	NA|181aa|up_5|NZ_CP012673.1_5989021_5989564_-	NA	NA|548aa|up_4|NZ_CP012673.1_5989644_5991288_+	pfam13868, TPH, Trichohyalin-plectin-homology domain	NA|345aa|up_3|NZ_CP012673.1_5991306_5992341_-	NA	NA|541aa|up_2|NZ_CP012673.1_5994224_5995847_+	cd07128, ALDH_MaoC-N, N-terminal domain of the monoamine oxidase C dehydratase	NA|515aa|up_1|NZ_CP012673.1_5995881_5997426_+	cd14014, STKc_PknB_like, Catalytic domain of bacterial Serine/Threonine kinases, PknB and similar proteins	NA|693aa|up_0|NZ_CP012673.1_5997483_5999562_-	cd09912, DLP_2, Dynamin-like protein including dynamins, mitofusins, and guanylate-binding proteins	NA|196aa|down_0|NZ_CP012673.1_6006044_6006632_+	cd06260, DUF820, Domain of unknown function (DUF820)	NA|330aa|down_1|NZ_CP012673.1_6006676_6007666_-	PRK15442, PRK15442, beta-lactamase TEM; Provisional	NA|191aa|down_2|NZ_CP012673.1_6008939_6009512_-	NA	NA|179aa|down_3|NZ_CP012673.1_6009835_6010372_+	pfam06821, Ser_hydrolase, Serine hydrolase	NA|265aa|down_4|NZ_CP012673.1_6010415_6011210_+	COG2207, AraC, AraC-type DNA-binding domain-containing proteins [Transcription]	NA|154aa|down_5|NZ_CP012673.1_6011234_6011696_+	COG4270, COG4270, Predicted membrane protein [Function unknown]	NA|1227aa|down_6|NZ_CP012673.1_6011858_6015539_+	cd00200, WD40, WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment	NA|277aa|down_7|NZ_CP012673.1_6016237_6017068_+	NA	NA|584aa|down_8|NZ_CP012673.1_6017288_6019040_-	pfam08737, Rgp1, Rgp1	NA|336aa|down_9|NZ_CP012673.1_6019138_6020146_-	NA
GCF_002950945.1_ASM295094v1	NZ_CP012673	Sorangium cellulosum strain So ce26 chromosome, complete genome	20	6007924-6007998	18	CRISPRCasFinder	no		RT,DEDDh,csa3,WYL,cas3,DinG,PD-DExK,cas6	Orphan	CGCCGCCGGTCGATGAGGCCGACG	24	0	0	NA	NA	NA	1	1	Orphan	RT,DEDDh,csa3,WYL,cas3,DinG,PD-DExK,cas6	NA|181aa|up_8|NZ_CP012673.1_5989021_5989564_-,NA|345aa|up_6|NZ_CP012673.1_5991306_5992341_-,NA|191aa|down_0|NZ_CP012673.1_6008939_6009512_-,NA|277aa|down_5|NZ_CP012673.1_6016237_6017068_+,NA|336aa|down_7|NZ_CP012673.1_6019138_6020146_-	NA|839aa|up_9|NZ_CP012673.1_5986474_5988991_-	COG0513, SrmB, Superfamily II DNA and RNA helicases [DNA replication, recombination, and repair / Transcription / Translation, ribosomal structure and biogenesis]	NA|181aa|up_8|NZ_CP012673.1_5989021_5989564_-	NA	NA|548aa|up_7|NZ_CP012673.1_5989644_5991288_+	pfam13868, TPH, Trichohyalin-plectin-homology domain	NA|345aa|up_6|NZ_CP012673.1_5991306_5992341_-	NA	NA|541aa|up_5|NZ_CP012673.1_5994224_5995847_+	cd07128, ALDH_MaoC-N, N-terminal domain of the monoamine oxidase C dehydratase	NA|515aa|up_4|NZ_CP012673.1_5995881_5997426_+	cd14014, STKc_PknB_like, Catalytic domain of bacterial Serine/Threonine kinases, PknB and similar proteins	NA|693aa|up_3|NZ_CP012673.1_5997483_5999562_-	cd09912, DLP_2, Dynamin-like protein including dynamins, mitofusins, and guanylate-binding proteins	NA|1885aa|up_2|NZ_CP012673.1_6000363_6006018_+	cd00200, WD40, WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment	NA|196aa|up_1|NZ_CP012673.1_6006044_6006632_+	cd06260, DUF820, Domain of unknown function (DUF820)	NA|330aa|up_0|NZ_CP012673.1_6006676_6007666_-	PRK15442, PRK15442, beta-lactamase TEM; Provisional	NA|191aa|down_0|NZ_CP012673.1_6008939_6009512_-	NA	NA|179aa|down_1|NZ_CP012673.1_6009835_6010372_+	pfam06821, Ser_hydrolase, Serine hydrolase	NA|265aa|down_2|NZ_CP012673.1_6010415_6011210_+	COG2207, AraC, AraC-type DNA-binding domain-containing proteins [Transcription]	NA|154aa|down_3|NZ_CP012673.1_6011234_6011696_+	COG4270, COG4270, Predicted membrane protein [Function unknown]	NA|1227aa|down_4|NZ_CP012673.1_6011858_6015539_+	cd00200, WD40, WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment	NA|277aa|down_5|NZ_CP012673.1_6016237_6017068_+	NA	NA|584aa|down_6|NZ_CP012673.1_6017288_6019040_-	pfam08737, Rgp1, Rgp1	NA|336aa|down_7|NZ_CP012673.1_6019138_6020146_-	NA	NA|1299aa|down_8|NZ_CP012673.1_6020267_6024164_-	cd14014, STKc_PknB_like, Catalytic domain of bacterial Serine/Threonine kinases, PknB and similar proteins	NA|526aa|down_9|NZ_CP012673.1_6024368_6025946_-	PRK12323, PRK12323, DNA polymerase III subunit gamma/tau
GCF_002950945.1_ASM295094v1	NZ_CP012673	Sorangium cellulosum strain So ce26 chromosome, complete genome	21	6123947-6124029	19	CRISPRCasFinder	no		RT,DEDDh,csa3,WYL,cas3,DinG,PD-DExK,cas6	Orphan	CAGCGGCGGTGATGCCGGCGGCG	23	0	0	NA	NA	NA	1	1	Orphan	RT,DEDDh,csa3,WYL,cas3,DinG,PD-DExK,cas6	NA|277aa|up_8|NZ_CP012673.1_6111502_6112333_-,NA|811aa|down_3|NZ_CP012673.1_6127117_6129550_-,NA|388aa|down_4|NZ_CP012673.1_6130235_6131399_+,NA|237aa|down_9|NZ_CP012673.1_6138096_6138807_-	NA|708aa|up_9|NZ_CP012673.1_6109382_6111506_-	pfam14258, DUF4350, Domain of unknown function (DUF4350)	NA|277aa|up_8|NZ_CP012673.1_6111502_6112333_-	NA	NA|329aa|up_7|NZ_CP012673.1_6112343_6113330_-	pfam01944, SpoIIM, Stage II sporulation protein M	NA|260aa|up_6|NZ_CP012673.1_6113329_6114109_-	pfam06271, RDD, RDD family	NA|241aa|up_5|NZ_CP012673.1_6114409_6115132_+	pfam06161, DUF975, Protein of unknown function (DUF975)	NA|370aa|up_4|NZ_CP012673.1_6116668_6117778_+	pfam01904, DUF72, Protein of unknown function DUF72	NA|522aa|up_3|NZ_CP012673.1_6118056_6119622_+	COG1233, COG1233, Phytoene dehydrogenase and related proteins [Secondary metabolites biosynthesis, transport, and catabolism]	NA|155aa|up_2|NZ_CP012673.1_6119775_6120240_-	PTZ00146, PTZ00146, fibrillarin; Provisional	NA|297aa|up_1|NZ_CP012673.1_6120476_6121367_-	COG1208, GCD1, Nucleoside-diphosphate-sugar pyrophosphorylase involved in lipopolysaccharide biosynthesis/translation initiation factor 2B, gamma/epsilon subunits (eIF-2Bgamma/eIF-2Bepsilon) [Cell envelope biogenesis, outer membrane / Translation, ribosomal structure and biogenesis]	NA|543aa|up_0|NZ_CP012673.1_6121431_6123060_-	cd14014, STKc_PknB_like, Catalytic domain of bacterial Serine/Threonine kinases, PknB and similar proteins	NA|579aa|down_0|NZ_CP012673.1_6124354_6126091_-	COG0443, DnaK, Molecular chaperone [Posttranslational modification, protein turnover, chaperones]	NA|229aa|down_1|NZ_CP012673.1_6126087_6126774_-	pfam01025, GrpE, GrpE	NA|116aa|down_2|NZ_CP012673.1_6126773_6127121_-	TIGR02349, Chaperone_protein_DnaJ, chaperone protein DnaJ	NA|811aa|down_3|NZ_CP012673.1_6127117_6129550_-	NA	NA|388aa|down_4|NZ_CP012673.1_6130235_6131399_+	NA	NA|879aa|down_5|NZ_CP012673.1_6131664_6134301_+	PRK05560, PRK05560, DNA gyrase subunit A; Validated	NA|223aa|down_6|NZ_CP012673.1_6134394_6135063_-	pfam04752, ChaC, ChaC-like protein	NA|399aa|down_7|NZ_CP012673.1_6135259_6136456_-	COG3268, COG3268, Uncharacterized conserved protein [Function unknown]	NA|487aa|down_8|NZ_CP012673.1_6136491_6137952_-	pfam01425, Amidase, Amidase	NA|237aa|down_9|NZ_CP012673.1_6138096_6138807_-	NA
GCF_002950945.1_ASM295094v1	NZ_CP012673	Sorangium cellulosum strain So ce26 chromosome, complete genome	22	6377689-6377767	20	CRISPRCasFinder	no		RT,DEDDh,csa3,WYL,cas3,DinG,PD-DExK,cas6	Orphan	ATCGAGGCGACCGTCGACCAGGGCGCCG	28	0	0	NA	NA	NA	1	1	Orphan	RT,DEDDh,csa3,WYL,cas3,DinG,PD-DExK,cas6	NA|259aa|up_4|NZ_CP012673.1_6369591_6370368_+,NA|99aa|down_2|NZ_CP012673.1_6384917_6385214_+,NA|78aa|down_4|NZ_CP012673.1_6386407_6386641_-	NA|798aa|up_9|NZ_CP012673.1_6361768_6364162_+	TIGR03817, DECH_helic, helicase/secretion neighborhood putative DEAH-box helicase	NA|273aa|up_8|NZ_CP012673.1_6364238_6365057_-	cd05346, SDR_c5, classical (c) SDR, subgroup 5	NA|556aa|up_7|NZ_CP012673.1_6365387_6367055_+	cd14014, STKc_PknB_like, Catalytic domain of bacterial Serine/Threonine kinases, PknB and similar proteins	NA|228aa|up_6|NZ_CP012673.1_6367291_6367975_+	COG4798, COG4798, Predicted methyltransferase [General function prediction only]	NA|382aa|up_5|NZ_CP012673.1_6368042_6369188_-	pfam07603, DUF1566, Protein of unknown function (DUF1566)	NA|259aa|up_4|NZ_CP012673.1_6369591_6370368_+	NA	NA|513aa|up_3|NZ_CP012673.1_6370443_6371982_-	cd04279, ZnMc_MMP_like_1, Zinc-dependent metalloprotease; MMP_like sub-family 1	NA|563aa|up_2|NZ_CP012673.1_6372474_6374163_-	cd03680, MM_CoA_mutase_ICM_like, Coenzyme B12-dependent-methylmalonyl coenzyme A (CoA) mutase (MCM) family, isobutyryl-CoA mutase (ICM)-like subfamily; contains archaeal and bacterial proteins similar to the large subunit of Streptomyces cinnamonensis coenzyme B12-dependent ICM	NA|523aa|up_1|NZ_CP012673.1_6374332_6375901_-	pfam05157, T2SSE_N, Type II secretion system (T2SS), protein E, N-terminal domain	NA|100aa|up_0|NZ_CP012673.1_6375947_6376247_-	pfam04977, DivIC, Septum formation initiator	NA|651aa|down_0|NZ_CP012673.1_6382066_6384019_+	PRK05035, PRK05035, electron transport complex protein RnfC; Provisional	NA|264aa|down_1|NZ_CP012673.1_6383987_6384779_+	COG0177, Nth, Predicted EndoIII-related endonuclease [DNA replication, recombination, and repair]	NA|99aa|down_2|NZ_CP012673.1_6384917_6385214_+	NA	NA|354aa|down_3|NZ_CP012673.1_6385284_6386346_-	COG1703, ArgK, Putative periplasmic protein kinase ArgK and related GTPases of G3E family [Amino acid transport and metabolism]	NA|78aa|down_4|NZ_CP012673.1_6386407_6386641_-	NA	NA|266aa|down_5|NZ_CP012673.1_6386690_6387488_-	PRK01305, PRK01305, arginyl-tRNA-protein transferase; Provisional	NA|102aa|down_6|NZ_CP012673.1_6387845_6388151_+	pfam01985, CRS1_YhbY, CRS1 / YhbY (CRM) domain	NA|1266aa|down_7|NZ_CP012673.1_6388217_6392015_-	PRK06567, PRK06567, putative bifunctional glutamate synthase subunit beta/2-polyprenylphenol hydroxylase; Validated	NA|449aa|down_8|NZ_CP012673.1_6392295_6393642_-	PRK08591, PRK08591, acetyl-CoA carboxylase biotin carboxylase subunit; Validated	NA|167aa|down_9|NZ_CP012673.1_6393664_6394165_-	PRK06302, PRK06302, acetyl-CoA carboxylase biotin carboxyl carrier protein
GCF_002950945.1_ASM295094v1	NZ_CP012673	Sorangium cellulosum strain So ce26 chromosome, complete genome	23	6653344-6653415	21	CRISPRCasFinder	no	csa3	RT,DEDDh,csa3,WYL,cas3,DinG,PD-DExK,cas6	Type I-A	CGGCAGCGCCGGCGCGGCGCCGG	23	0	0	NA	NA	NA	1	1	Orphan	RT,DEDDh,csa3,WYL,cas3,DinG,PD-DExK,cas6	NA|198aa|up_9|NZ_CP012673.1_6641047_6641641_-,NA|560aa|down_1|NZ_CP012673.1_6657007_6658687_-,NA|124aa|down_3|NZ_CP012673.1_6661703_6662075_-,NA|355aa|down_4|NZ_CP012673.1_6662064_6663129_-	NA|198aa|up_9|NZ_CP012673.1_6641047_6641641_-	NA	NA|712aa|up_8|NZ_CP012673.1_6641653_6643789_-	COG1529, CoxL, Aerobic-type carbon monoxide dehydrogenase, large subunit CoxL/CutL homologs [Energy production and conversion]	NA|155aa|up_7|NZ_CP012673.1_6643785_6644250_-	COG2080, CoxS, Aerobic-type carbon monoxide dehydrogenase, small subunit CoxS/CutS homologs [Energy production and conversion]	NA|557aa|up_6|NZ_CP012673.1_6644503_6646174_+	cd04182, GT_2_like_f, GT_2_like_f is a subfamily of the glycosyltransferase family 2 (GT-2) with unknown function	NA|155aa|up_5|NZ_CP012673.1_6646464_6646929_+	COG3837, COG3837, Uncharacterized conserved protein, contains double-stranded beta-helix domain [Function unknown]	NA|289aa|up_4|NZ_CP012673.1_6647008_6647875_-	cd05231, NmrA_TMR_like_1_SDR_a, NmrA (a transcriptional regulator) and triphenylmethane reductase (TMR) like proteins, subgroup 1, atypical (a) SDRs	NA|313aa|up_3|NZ_CP012673.1_6648012_6648951_+	cd08417, PBP2_Nitroaromatics_like, The C-terminal substrate binding domain of LysR-type transcriptional regulators that involved in the catabolism of nitroaromatic/naphthalene compounds and that of related regulators; contains the type 2 periplasmic binding fold	NA|194aa|up_2|NZ_CP012673.1_6649116_6649698_+	cd06260, DUF820, Domain of unknown function (DUF820)	NA|412aa|up_1|NZ_CP012673.1_6649713_6650949_-	TIGR02270, hypothetical_protein_GSU3180, conserved hypothetical protein	NA|727aa|up_0|NZ_CP012673.1_6650980_6653161_-	TIGR03361, VI_Rhs_Vgr, type VI secretion system Vgr family protein	NA|708aa|down_0|NZ_CP012673.1_6654887_6657011_-	pfam01471, PG_binding_1, Putative peptidoglycan binding domain	NA|560aa|down_1|NZ_CP012673.1_6657007_6658687_-	NA	NA|1004aa|down_2|NZ_CP012673.1_6658676_6661688_-	PTZ00234, PTZ00234, variable surface protein Vir12; Provisional	NA|124aa|down_3|NZ_CP012673.1_6661703_6662075_-	NA	NA|355aa|down_4|NZ_CP012673.1_6662064_6663129_-	NA	NA|781aa|down_5|NZ_CP012673.1_6664198_6666541_+	cd07302, CHD, cyclase homology domain	NA|704aa|down_6|NZ_CP012673.1_6666540_6668652_+	PRK12323, PRK12323, DNA polymerase III subunit gamma/tau	NA|142aa|down_7|NZ_CP012673.1_6668696_6669122_-	cd16345, LMWP_ArsC, Arsenate reductase of the LMWP family	NA|239aa|down_8|NZ_CP012673.1_6669118_6669835_-	COG0580, GlpF, Glycerol uptake facilitator and related permeases (Major Intrinsic Protein Family) [Carbohydrate transport and metabolism]	NA|160aa|down_9|NZ_CP012673.1_6669821_6670301_-	cd07254, VOC_like, uncharacterized subfamily of vicinal oxygen chelate (VOC) family
GCF_002950945.1_ASM295094v1	NZ_CP012673	Sorangium cellulosum strain So ce26 chromosome, complete genome	24	6653530-6653666	22	CRISPRCasFinder	no	csa3	RT,DEDDh,csa3,WYL,cas3,DinG,PD-DExK,cas6	Type I-A	CGGCAGCGCCGGCGCGGCGCCGG	23	0	0	NA	NA	NA	2	2	Orphan	RT,DEDDh,csa3,WYL,cas3,DinG,PD-DExK,cas6	NA|198aa|up_9|NZ_CP012673.1_6641047_6641641_-,NA|560aa|down_1|NZ_CP012673.1_6657007_6658687_-,NA|124aa|down_3|NZ_CP012673.1_6661703_6662075_-,NA|355aa|down_4|NZ_CP012673.1_6662064_6663129_-	NA|198aa|up_9|NZ_CP012673.1_6641047_6641641_-	NA	NA|712aa|up_8|NZ_CP012673.1_6641653_6643789_-	COG1529, CoxL, Aerobic-type carbon monoxide dehydrogenase, large subunit CoxL/CutL homologs [Energy production and conversion]	NA|155aa|up_7|NZ_CP012673.1_6643785_6644250_-	COG2080, CoxS, Aerobic-type carbon monoxide dehydrogenase, small subunit CoxS/CutS homologs [Energy production and conversion]	NA|557aa|up_6|NZ_CP012673.1_6644503_6646174_+	cd04182, GT_2_like_f, GT_2_like_f is a subfamily of the glycosyltransferase family 2 (GT-2) with unknown function	NA|155aa|up_5|NZ_CP012673.1_6646464_6646929_+	COG3837, COG3837, Uncharacterized conserved protein, contains double-stranded beta-helix domain [Function unknown]	NA|289aa|up_4|NZ_CP012673.1_6647008_6647875_-	cd05231, NmrA_TMR_like_1_SDR_a, NmrA (a transcriptional regulator) and triphenylmethane reductase (TMR) like proteins, subgroup 1, atypical (a) SDRs	NA|313aa|up_3|NZ_CP012673.1_6648012_6648951_+	cd08417, PBP2_Nitroaromatics_like, The C-terminal substrate binding domain of LysR-type transcriptional regulators that involved in the catabolism of nitroaromatic/naphthalene compounds and that of related regulators; contains the type 2 periplasmic binding fold	NA|194aa|up_2|NZ_CP012673.1_6649116_6649698_+	cd06260, DUF820, Domain of unknown function (DUF820)	NA|412aa|up_1|NZ_CP012673.1_6649713_6650949_-	TIGR02270, hypothetical_protein_GSU3180, conserved hypothetical protein	NA|727aa|up_0|NZ_CP012673.1_6650980_6653161_-	TIGR03361, VI_Rhs_Vgr, type VI secretion system Vgr family protein	NA|708aa|down_0|NZ_CP012673.1_6654887_6657011_-	pfam01471, PG_binding_1, Putative peptidoglycan binding domain	NA|560aa|down_1|NZ_CP012673.1_6657007_6658687_-	NA	NA|1004aa|down_2|NZ_CP012673.1_6658676_6661688_-	PTZ00234, PTZ00234, variable surface protein Vir12; Provisional	NA|124aa|down_3|NZ_CP012673.1_6661703_6662075_-	NA	NA|355aa|down_4|NZ_CP012673.1_6662064_6663129_-	NA	NA|781aa|down_5|NZ_CP012673.1_6664198_6666541_+	cd07302, CHD, cyclase homology domain	NA|704aa|down_6|NZ_CP012673.1_6666540_6668652_+	PRK12323, PRK12323, DNA polymerase III subunit gamma/tau	NA|142aa|down_7|NZ_CP012673.1_6668696_6669122_-	cd16345, LMWP_ArsC, Arsenate reductase of the LMWP family	NA|239aa|down_8|NZ_CP012673.1_6669118_6669835_-	COG0580, GlpF, Glycerol uptake facilitator and related permeases (Major Intrinsic Protein Family) [Carbohydrate transport and metabolism]	NA|160aa|down_9|NZ_CP012673.1_6669821_6670301_-	cd07254, VOC_like, uncharacterized subfamily of vicinal oxygen chelate (VOC) family
GCF_002950945.1_ASM295094v1	NZ_CP012673	Sorangium cellulosum strain So ce26 chromosome, complete genome	25	6925157-6925259	23	CRISPRCasFinder	no		RT,DEDDh,csa3,WYL,cas3,DinG,PD-DExK,cas6	Orphan	CGGCGCCGGCCGCGCGGGCGGCTCT	25	0	0	NA	NA	NA	1	1	Orphan	RT,DEDDh,csa3,WYL,cas3,DinG,PD-DExK,cas6	NA|113aa|up_9|NZ_CP012673.1_6853674_6854013_-,NA|86aa|up_8|NZ_CP012673.1_6854271_6854529_+,NA|121aa|up_6|NZ_CP012673.1_6857994_6858357_+,NA|119aa|up_5|NZ_CP012673.1_6858650_6859007_+,NA	NA|113aa|up_9|NZ_CP012673.1_6853674_6854013_-	NA	NA|86aa|up_8|NZ_CP012673.1_6854271_6854529_+	NA	NA|970aa|up_7|NZ_CP012673.1_6854882_6857792_+	TIGR00128, Malonyl_CoA-acyl_carrier_protein_transacylase, malonyl CoA-acyl carrier protein transacylase	NA|121aa|up_6|NZ_CP012673.1_6857994_6858357_+	NA	NA|119aa|up_5|NZ_CP012673.1_6858650_6859007_+	NA	NA|479aa|up_4|NZ_CP012673.1_6859055_6860492_-	COG0654, UbiH, 2-polyprenyl-6-methoxyphenol hydroxylase and related FAD-dependent oxidoreductases [Coenzyme metabolism / Energy production and conversion]	NA|1001aa|up_3|NZ_CP012673.1_6860492_6863495_-	COG0661, AarF, Predicted unusual protein kinase [General function prediction only]	NA|4514aa|up_2|NZ_CP012673.1_6863494_6877036_-	COG3321, COG3321, Polyketide synthase modules and related proteins [Secondary metabolites biosynthesis, transport, and catabolism]	NA|6851aa|up_1|NZ_CP012673.1_6877051_6897604_-	COG3321, COG3321, Polyketide synthase modules and related proteins [Secondary metabolites biosynthesis, transport, and catabolism]	NA|6132aa|up_0|NZ_CP012673.1_6897615_6916011_-	cd00833, PKS, polyketide synthases (PKSs) polymerize simple fatty acids into a large variety of different products, called polyketides, by successive decarboxylating Claisen condensations	NA|402aa|down_0|NZ_CP012673.1_6931606_6932812_+	PRK09578, PRK09578, MexX/AxyX family multidrug efflux RND transporter periplasmic adaptor subunit	NA|1053aa|down_1|NZ_CP012673.1_6932838_6935997_+	TIGR00915, Probable_aminoglycoside_efflux_pump, The (Largely Gram-negative Bacterial) Hydrophobe/Amphiphile Efflux-1 (HAE1) Family	NA|66aa|down_2|NZ_CP012673.1_6936055_6936253_-	COG1141, Fer, Ferredoxin [Energy production and conversion]	NA|401aa|down_3|NZ_CP012673.1_6936324_6937527_-	COG2124, CypX, Cytochrome P450 [Secondary metabolites biosynthesis, transport, and catabolism]	NA|417aa|down_4|NZ_CP012673.1_6937727_6938978_-	COG0119, LeuA, Isopropylmalate/homocitrate/citramalate synthases [Amino acid transport and metabolism]	NA|361aa|down_5|NZ_CP012673.1_6939585_6940668_-	TIGR01686, modular_polyketide_synthase, FkbH-like domain	NA|393aa|down_6|NZ_CP012673.1_6940664_6941843_-	cd01158, SCAD_SBCAD, Short chain acyl-CoA dehydrogenases and eukaryotic short/branched chain acyl-CoA dehydrogenases	NA|84aa|down_7|NZ_CP012673.1_6941876_6942128_-	pfam00550, PP-binding, Phosphopantetheine attachment site	NA|294aa|down_8|NZ_CP012673.1_6942145_6943027_-	PRK05808, PRK05808, 3-hydroxybutyryl-CoA dehydrogenase; Validated	NA|868aa|down_9|NZ_CP012673.1_6943026_6945630_-	smart00827, PKS_AT, Acyl transferase domain in polyketide synthase (PKS) enzymes
GCF_002950945.1_ASM295094v1	NZ_CP012673	Sorangium cellulosum strain So ce26 chromosome, complete genome	26	7244087-7244200	24	CRISPRCasFinder	no	DEDDh	RT,DEDDh,csa3,WYL,cas3,DinG,PD-DExK,cas6	Unclear	CCGCGCACGCCGCACCGCCCGCGCACGCC	29	0	0	NA	NA	NA	1	1	Orphan	RT,DEDDh,csa3,WYL,cas3,DinG,PD-DExK,cas6	NA|106aa|up_6|NZ_CP012673.1_7236755_7237073_-,NA|225aa|up_2|NZ_CP012673.1_7241260_7241935_+,NA	NA|1284aa|up_9|NZ_CP012673.1_7231555_7235407_-	PRK06039, ileS, isoleucyl-tRNA synthetase; Reviewed	NA|197aa|up_8|NZ_CP012673.1_7235491_7236082_+	cd06260, DUF820, Domain of unknown function (DUF820)	NA|206aa|up_7|NZ_CP012673.1_7236124_7236742_-	cd00564, TMP_TenI, Thiamine monophosphate synthase (TMP synthase)/TenI	NA|106aa|up_6|NZ_CP012673.1_7236755_7237073_-	NA	NA|420aa|up_5|NZ_CP012673.1_7237190_7238450_-	cd03798, GT4_WlbH-like, Bordetella parapertussis WlbH and similar proteins	NA|423aa|up_4|NZ_CP012673.1_7238641_7239910_-	cd03807, GT4_WbnK-like, Shigella dysenteriae WbnK and similar proteins	NA|261aa|up_3|NZ_CP012673.1_7240275_7241058_-	PRK00208, thiG, thiazole synthase; Reviewed	NA|225aa|up_2|NZ_CP012673.1_7241260_7241935_+	NA	NA|292aa|up_1|NZ_CP012673.1_7241999_7242875_+	cd04278, ZnMc_MMP, Zinc-dependent metalloprotease, matrix metalloproteinase (MMP) sub-family	NA|404aa|up_0|NZ_CP012673.1_7242871_7244083_+	cd06142, RNaseD_exo, DEDDy 3'-5' exonuclease domain of Ribonuclease D and similar proteins	NA|565aa|down_0|NZ_CP012673.1_7244315_7246010_+	pfam00924, MS_channel, Mechanosensitive ion channel	NA|346aa|down_1|NZ_CP012673.1_7246057_7247095_-	PLN02389, PLN02389, biotin synthase	NA|351aa|down_2|NZ_CP012673.1_7247177_7248230_-	COG2805, PilT, Tfp pilus assembly protein, pilus retraction ATPase PilT [Cell motility and secretion / Intracellular trafficking and secretion]	NA|462aa|down_3|NZ_CP012673.1_7249045_7250431_-	pfam01964, ThiC_Rad_SAM, Radical SAM ThiC family	NA|151aa|down_4|NZ_CP012673.1_7250571_7251024_-	COG0848, ExbD, Biopolymer transport protein [Intracellular trafficking and secretion]	NA|139aa|down_5|NZ_CP012673.1_7251029_7251446_-	TIGR02801, Protein_TolR, TolR protein	NA|239aa|down_6|NZ_CP012673.1_7251536_7252253_-	TIGR02796, Protein_TolQ, TolQ protein	NA|257aa|down_7|NZ_CP012673.1_7252424_7253195_-	pfam03544, TonB_C, Gram-negative bacterial TonB protein C-terminal	NA|1097aa|down_8|NZ_CP012673.1_7253407_7256698_+	pfam13620, CarboxypepD_reg, Carboxypeptidase regulatory-like domain	NA|328aa|down_9|NZ_CP012673.1_7256729_7257713_+	PRK06958, PRK06958, single-stranded DNA-binding protein; Provisional
GCF_002950945.1_ASM295094v1	NZ_CP012673	Sorangium cellulosum strain So ce26 chromosome, complete genome	27	7277931-7278185	25	CRISPRCasFinder	no		RT,DEDDh,csa3,WYL,cas3,DinG,PD-DExK,cas6	Orphan	ACCTGTCGGGCGCGGACCTCCGCGGCG	27	0	0	NA	NA	NA	3	3	Orphan	RT,DEDDh,csa3,WYL,cas3,DinG,PD-DExK,cas6	NA|358aa|up_8|NZ_CP012673.1_7270376_7271450_-,NA|102aa|up_6|NZ_CP012673.1_7272663_7272969_-,NA|436aa|up_4|NZ_CP012673.1_7274108_7275416_-,NA|268aa|up_3|NZ_CP012673.1_7275632_7276436_+,NA|82aa|up_2|NZ_CP012673.1_7276435_7276681_+,NA|111aa|up_1|NZ_CP012673.1_7276677_7277010_+,NA|132aa|down_1|NZ_CP012673.1_7278929_7279325_+,NA|141aa|down_3|NZ_CP012673.1_7279926_7280349_+,NA|140aa|down_5|NZ_CP012673.1_7282684_7283104_+,NA|174aa|down_6|NZ_CP012673.1_7283100_7283622_+,NA|278aa|down_7|NZ_CP012673.1_7283640_7284474_+,NA|190aa|down_9|NZ_CP012673.1_7285812_7286382_+	NA|499aa|up_9|NZ_CP012673.1_7268619_7270116_-	cd14014, STKc_PknB_like, Catalytic domain of bacterial Serine/Threonine kinases, PknB and similar proteins	NA|358aa|up_8|NZ_CP012673.1_7270376_7271450_-	NA	NA|360aa|up_7|NZ_CP012673.1_7271449_7272529_-	sd00006, TPR, Tetratricopeptide repeat	NA|102aa|up_6|NZ_CP012673.1_7272663_7272969_-	NA	NA|338aa|up_5|NZ_CP012673.1_7273029_7274043_-	pfam13828, DUF4190, Domain of unknown function (DUF4190)	NA|436aa|up_4|NZ_CP012673.1_7274108_7275416_-	NA	NA|268aa|up_3|NZ_CP012673.1_7275632_7276436_+	NA	NA|82aa|up_2|NZ_CP012673.1_7276435_7276681_+	NA	NA|111aa|up_1|NZ_CP012673.1_7276677_7277010_+	NA	NA|240aa|up_0|NZ_CP012673.1_7277006_7277726_+	COG1357, COG1357, Pentapeptide repeats containing protein [Function unknown]	NA|203aa|down_0|NZ_CP012673.1_7278324_7278933_+	COG1357, COG1357, Pentapeptide repeats containing protein [Function unknown]	NA|132aa|down_1|NZ_CP012673.1_7278929_7279325_+	NA	NA|203aa|down_2|NZ_CP012673.1_7279321_7279930_+	cd06571, Bac_DnaA_C, C-terminal domain of bacterial DnaA proteins	NA|141aa|down_3|NZ_CP012673.1_7279926_7280349_+	NA	NA|597aa|down_4|NZ_CP012673.1_7280470_7282261_+	cd00799, INT_Cre_C, C-terminal catalytic domain of Cre recombinase (also called integrase)	NA|140aa|down_5|NZ_CP012673.1_7282684_7283104_+	NA	NA|174aa|down_6|NZ_CP012673.1_7283100_7283622_+	NA	NA|278aa|down_7|NZ_CP012673.1_7283640_7284474_+	NA	NA|424aa|down_8|NZ_CP012673.1_7284536_7285808_+	PRK07003, PRK07003, DNA polymerase III subunit gamma/tau	NA|190aa|down_9|NZ_CP012673.1_7285812_7286382_+	NA
GCF_002950945.1_ASM295094v1	NZ_CP012673	Sorangium cellulosum strain So ce26 chromosome, complete genome	28	7419237-7419484	26	CRISPRCasFinder	no		RT,DEDDh,csa3,WYL,cas3,DinG,PD-DExK,cas6	Orphan	GGGCACCTGCGGCGGCAAGTGCGACG	26	0	0	NA	NA	NA	4	4	Orphan	RT,DEDDh,csa3,WYL,cas3,DinG,PD-DExK,cas6	NA,NA|100aa|down_2|NZ_CP012673.1_7425100_7425400_+	NA|219aa|up_9|NZ_CP012673.1_7405670_7406327_+	PRK12270, kgd, multifunctional oxoglutarate decarboxylase/oxoglutarate dehydrogenase thiamine pyrophosphate-binding subunit/dihydrolipoyllysine-residue succinyltransferase subunit	NA|473aa|up_8|NZ_CP012673.1_7406342_7407761_-	COG0025, NhaP, NhaP-type Na+/H+ and K+/H+ antiporters [Inorganic ion transport and metabolism]	NA|441aa|up_7|NZ_CP012673.1_7407802_7409125_-	COG0475, KefB, Kef-type K+ transport systems, membrane components [Inorganic ion transport and metabolism]	NA|731aa|up_6|NZ_CP012673.1_7409121_7411314_-	PRK07994, PRK07994, DNA polymerase III subunits gamma and tau; Validated	NA|355aa|up_5|NZ_CP012673.1_7411334_7412399_+	pfam02685, Glucokinase, Glucokinase	NA|388aa|up_4|NZ_CP012673.1_7412414_7413578_-	COG0642, BaeS, Signal transduction histidine kinase [Signal transduction mechanisms]	NA|226aa|up_3|NZ_CP012673.1_7413574_7414252_-	COG5012, COG5012, Predicted cobalamin binding protein [General function prediction only]	NA|243aa|up_2|NZ_CP012673.1_7414406_7415135_-	pfam01182, Glucosamine_iso, Glucosamine-6-phosphate isomerases/6-phosphogluconolactonase	NA|382aa|up_1|NZ_CP012673.1_7415131_7416277_-	pfam10128, OpcA_G6PD_assem, Glucose-6-phosphate dehydrogenase subunit	NA|115aa|up_0|NZ_CP012673.1_7418068_7418413_+	COG2442, COG2442, Uncharacterized conserved protein [Function unknown]	NA|1295aa|down_0|NZ_CP012673.1_7420448_7424333_+	cd14014, STKc_PknB_like, Catalytic domain of bacterial Serine/Threonine kinases, PknB and similar proteins	NA|233aa|down_1|NZ_CP012673.1_7424383_7425082_+	cd07989, LPLAT_AGPAT-like, Lysophospholipid Acyltransferases (LPLATs) of Glycerophospholipid Biosynthesis: AGPAT-like	NA|100aa|down_2|NZ_CP012673.1_7425100_7425400_+	NA	NA|1127aa|down_3|NZ_CP012673.1_7425757_7429138_+	COG1352, CheR, Methylase of chemotaxis methyl-accepting proteins [Cell motility and secretion / Signal transduction mechanisms]	NA|123aa|down_4|NZ_CP012673.1_7429212_7429581_+	pfam00072, Response_reg, Response regulator receiver domain	NA|330aa|down_5|NZ_CP012673.1_7429656_7430646_+	cd16433, CheB, Chemotaxis response regulator protein-glutamate methylesterase, CheB	NA|252aa|down_6|NZ_CP012673.1_7430682_7431438_-	cd06260, DUF820, Domain of unknown function (DUF820)	NA|296aa|down_7|NZ_CP012673.1_7431722_7432610_+	cd06561, AlkD_like, A new structural DNA glycosylase	NA|183aa|down_8|NZ_CP012673.1_7432681_7433230_-	COG1633, COG1633, Uncharacterized conserved protein [Function unknown]	NA|772aa|down_9|NZ_CP012673.1_7433288_7435604_-	pfam00930, DPPIV_N, Dipeptidyl peptidase IV (DPP IV) N-terminal region
GCF_002950945.1_ASM295094v1	NZ_CP012673	Sorangium cellulosum strain So ce26 chromosome, complete genome	29	7989286-7989375	27	CRISPRCasFinder	no		RT,DEDDh,csa3,WYL,cas3,DinG,PD-DExK,cas6	Orphan	CCGTCGCCGCTGCCGCCGCCGCC	23	0	0	NA	NA	NA	1	1	Orphan	RT,DEDDh,csa3,WYL,cas3,DinG,PD-DExK,cas6	NA,NA|505aa|down_4|NZ_CP012673.1_7998646_8000161_+	NA|290aa|up_9|NZ_CP012673.1_7973429_7974299_+	cd05269, TMR_SDR_a, triphenylmethane reductase (TMR)-like proteins, NMRa-like, atypical (a) SDRs	NA|632aa|up_8|NZ_CP012673.1_7974716_7976612_+	cd16025, PAS_like, Bacterial Arylsulfatase of Pseudomonas aeruginosa and related proteins	NA|765aa|up_7|NZ_CP012673.1_7976692_7978987_+	sd00006, TPR, Tetratricopeptide repeat	NA|258aa|up_6|NZ_CP012673.1_7979024_7979798_-	cd03212, GST_C_Metaxin1_3, C-terminal, alpha helical domain of Metaxin 1, Metaxin 3, and similar proteins	NA|144aa|up_5|NZ_CP012673.1_7980034_7980466_+	pfam14534, DUF4440, Domain of unknown function (DUF4440)	NA|560aa|up_4|NZ_CP012673.1_7980798_7982478_-	cd09003, GH43_XynD-like, Glycosyl hydrolase family 43 protein such as Bacillus subtilis arabinoxylan arabinofuranohydrolase  (XynD;BsAXH-m23;BSU18160)	NA|812aa|up_3|NZ_CP012673.1_7982751_7985187_-	cd09001, GH43_FsAxh1-like, Glycosyl hydrolase family 43 such as Fibrobacter succinogenes subsp	NA|130aa|up_2|NZ_CP012673.1_7985483_7985873_-	cd07263, VOC_like, uncharacterized subfamily of vicinal oxygen chelate (VOC) family	NA|148aa|up_1|NZ_CP012673.1_7985927_7986371_-	COG2207, AraC, AraC-type DNA-binding domain-containing proteins [Transcription]	NA|478aa|up_0|NZ_CP012673.1_7986506_7987940_-	COG0154, GatA, Asp-tRNAAsn/Glu-tRNAGln amidotransferase A subunit and related amidases [Translation, ribosomal structure and biogenesis]	NA|711aa|down_0|NZ_CP012673.1_7990387_7992520_+	pfam09937, DUF2169, Uncharacterized protein conserved in bacteria (DUF2169)	NA|830aa|down_1|NZ_CP012673.1_7992645_7995135_+	pfam09937, DUF2169, Uncharacterized protein conserved in bacteria (DUF2169)	NA|641aa|down_2|NZ_CP012673.1_7995131_7997054_+	COG1357, COG1357, Pentapeptide repeats containing protein [Function unknown]	NA|515aa|down_3|NZ_CP012673.1_7997083_7998628_+	TIGR03361, VI_Rhs_Vgr, type VI secretion system Vgr family protein	NA|505aa|down_4|NZ_CP012673.1_7998646_8000161_+	NA	NA|289aa|down_5|NZ_CP012673.1_8000216_8001083_+	pfam12059, DUF3540, Protein of unknown function (DUF3540)	NA|147aa|down_6|NZ_CP012673.1_8001141_8001582_+	pfam13665, DUF4150, Domain of unknown function (DUF4150)	NA|881aa|down_7|NZ_CP012673.1_8001688_8004331_-	cd14792, GH27, glycosyl hydrolase family 27 (GH27)	NA|217aa|down_8|NZ_CP012673.1_8006178_8006829_-	pfam13409, GST_N_2, Glutathione S-transferase, N-terminal domain	NA|366aa|down_9|NZ_CP012673.1_8006876_8007974_-	COG0673, MviM, Predicted dehydrogenases and related proteins [General function prediction only]
GCF_002950945.1_ASM295094v1	NZ_CP012673	Sorangium cellulosum strain So ce26 chromosome, complete genome	30	8142696-8144271	2,28,3	PILER-CR,CRISPRCasFinder,CRT	no		RT,DEDDh,csa3,WYL,cas3,DinG,PD-DExK,cas6	Orphan	GCTTCAATGGGGCCGCCGCCTTTCAGCGGCGGAAGG,GCTTCAATGGGGCCGCCGCCTTTCAGCGGCGGAAGG,GCTTCAATGGGGCCGCCGCCTTTCAGCGGCGGAAGG	36,36,36	0	0	NA	NA	NA:NA:NA	21,21,21	21	Orphan	RT,DEDDh,csa3,WYL,cas3,DinG,PD-DExK,cas6	NA|143aa|up_8|NZ_CP012673.1_8127731_8128160_+,NA|609aa|up_7|NZ_CP012673.1_8128156_8129983_+,NA|348aa|up_3|NZ_CP012673.1_8137136_8138180_-,NA|54aa|up_2|NZ_CP012673.1_8138389_8138551_-,NA|75aa|up_0|NZ_CP012673.1_8141856_8142081_+,NA|132aa|down_3|NZ_CP012673.1_8148496_8148892_+,NA|158aa|down_6|NZ_CP012673.1_8151862_8152336_-,NA|141aa|down_7|NZ_CP012673.1_8152447_8152870_-	NA|391aa|up_9|NZ_CP012673.1_8126562_8127735_+	PRK13531, PRK13531, regulatory ATPase RavA; Provisional	NA|143aa|up_8|NZ_CP012673.1_8127731_8128160_+	NA	NA|609aa|up_7|NZ_CP012673.1_8128156_8129983_+	NA	NA|1189aa|up_6|NZ_CP012673.1_8129979_8133546_+	COG0631, PTC1, Serine/threonine protein phosphatase [Signal transduction mechanisms]	NA|635aa|up_5|NZ_CP012673.1_8133542_8135447_+	pfam13231, PMT_2, Dolichyl-phosphate-mannose-protein mannosyltransferase	NA|526aa|up_4|NZ_CP012673.1_8135478_8137056_-	COG1807, ArnT, 4-amino-4-deoxy-L-arabinose transferase and related glycosyltransferases of PMT family [Cell envelope biogenesis, outer membrane]	NA|348aa|up_3|NZ_CP012673.1_8137136_8138180_-	NA	NA|54aa|up_2|NZ_CP012673.1_8138389_8138551_-	NA	NA|624aa|up_1|NZ_CP012673.1_8138565_8140437_-	cd14014, STKc_PknB_like, Catalytic domain of bacterial Serine/Threonine kinases, PknB and similar proteins	NA|75aa|up_0|NZ_CP012673.1_8141856_8142081_+	NA	NA|331aa|down_0|NZ_CP012673.1_8145265_8146258_+	COG0631, PTC1, Serine/threonine protein phosphatase [Signal transduction mechanisms]	NA|282aa|down_1|NZ_CP012673.1_8146532_8147378_+	cd19138, AKR_YeaE, Escherichia coli YeaE and similar proteins	NA|266aa|down_2|NZ_CP012673.1_8147462_8148260_+	PTZ00146, PTZ00146, fibrillarin; Provisional	NA|132aa|down_3|NZ_CP012673.1_8148496_8148892_+	NA	NA|287aa|down_4|NZ_CP012673.1_8149254_8150115_+	COG2207, AraC, AraC-type DNA-binding domain-containing proteins [Transcription]	NA|492aa|down_5|NZ_CP012673.1_8150273_8151749_-	cd02947, TRX_family, TRX family; composed of two groups: Group I, which includes proteins that exclusively encode a TRX domain; and Group II, which are composed of fusion proteins of TRX and additional domains	NA|158aa|down_6|NZ_CP012673.1_8151862_8152336_-	NA	NA|141aa|down_7|NZ_CP012673.1_8152447_8152870_-	NA	NA|575aa|down_8|NZ_CP012673.1_8153241_8154966_+	pfam00656, Peptidase_C14, Caspase domain	NA|848aa|down_9|NZ_CP012673.1_8154952_8157496_-	cd14014, STKc_PknB_like, Catalytic domain of bacterial Serine/Threonine kinases, PknB and similar proteins
GCF_002950945.1_ASM295094v1	NZ_CP012673	Sorangium cellulosum strain So ce26 chromosome, complete genome	31	8801685-8801793	29	CRISPRCasFinder	no		RT,DEDDh,csa3,WYL,cas3,DinG,PD-DExK,cas6	Orphan	CCACCTCCAACGCGAGCCAGCTCGACGCGGACAGCGACGG	40	1	1	8801725-8801753	NZ_CP012673.1_5886337-5886365	NA	1	1	Orphan	RT,DEDDh,csa3,WYL,cas3,DinG,PD-DExK,cas6	NA|105aa|up_9|NZ_CP012673.1_8791270_8791585_-,NA|105aa|up_7|NZ_CP012673.1_8792626_8792941_-,NA|121aa|up_6|NZ_CP012673.1_8793440_8793803_-,NA|211aa|up_4|NZ_CP012673.1_8796640_8797273_-,NA|82aa|up_1|NZ_CP012673.1_8800090_8800336_+,NA|48aa|up_0|NZ_CP012673.1_8800562_8800706_-,NA	NA|105aa|up_9|NZ_CP012673.1_8791270_8791585_-	NA	NA|146aa|up_8|NZ_CP012673.1_8791919_8792357_-	pfam13366, PDDEXK_3, PD-(D/E)XK nuclease superfamily	NA|105aa|up_7|NZ_CP012673.1_8792626_8792941_-	NA	NA|121aa|up_6|NZ_CP012673.1_8793440_8793803_-	NA	NA|817aa|up_5|NZ_CP012673.1_8793966_8796417_-	PRK06958, PRK06958, single-stranded DNA-binding protein; Provisional	NA|211aa|up_4|NZ_CP012673.1_8796640_8797273_-	NA	NA|387aa|up_3|NZ_CP012673.1_8797438_8798599_-	sd00038, Kelch, Kelch repeat	NA|404aa|up_2|NZ_CP012673.1_8798800_8800012_+	cd00054, EGF_CA, Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins	NA|82aa|up_1|NZ_CP012673.1_8800090_8800336_+	NA	NA|48aa|up_0|NZ_CP012673.1_8800562_8800706_-	NA	NA|553aa|down_0|NZ_CP012673.1_8802259_8803918_-	COG4566, TtrR, Response regulator [Signal transduction mechanisms]	NA|433aa|down_1|NZ_CP012673.1_8804772_8806071_-	COG1807, ArnT, 4-amino-4-deoxy-L-arabinose transferase and related glycosyltransferases of PMT family [Cell envelope biogenesis, outer membrane]	NA|333aa|down_2|NZ_CP012673.1_8806338_8807337_+	cd03267, ABC_NatA_like, ATP-binding cassette domain of an uncharacterized transporter similar in sequence to NatA	NA|271aa|down_3|NZ_CP012673.1_8807333_8808146_+	COG4587, COG4587, ABC-type uncharacterized transport system, permease component [General function prediction only]	NA|269aa|down_4|NZ_CP012673.1_8808145_8808952_+	COG3694, COG3694, ABC-type uncharacterized transport system, permease component [General function prediction only]	NA|563aa|down_5|NZ_CP012673.1_8808967_8810656_-	COG0154, GatA, Asp-tRNAAsn/Glu-tRNAGln amidotransferase A subunit and related amidases [Translation, ribosomal structure and biogenesis]	NA|517aa|down_6|NZ_CP012673.1_8810912_8812463_+	cd10277, PQQ_ADH_I, Ethanol dehydrogenase, a bacterial quinoprotein (PQQ-dependent type I alcohol dehydrogenase)	NA|144aa|down_7|NZ_CP012673.1_8812571_8813003_+	COG2050, PaaI, HGG motif-containing thioesterase, possibly involved in aromatic compounds catabolism [Secondary metabolites biosynthesis,    transport, and catabolism]	NA|354aa|down_8|NZ_CP012673.1_8813156_8814218_+	cd03311, CIMS_C_terminal_like, CIMS - Cobalamine-independent methonine synthase, or MetE, C-terminal domain_like	NA|763aa|down_9|NZ_CP012673.1_8814217_8816506_+	COG0642, BaeS, Signal transduction histidine kinase [Signal transduction mechanisms]
GCF_002950945.1_ASM295094v1	NZ_CP012673	Sorangium cellulosum strain So ce26 chromosome, complete genome	32	8850290-8850420	30	CRISPRCasFinder	no		RT,DEDDh,csa3,WYL,cas3,DinG,PD-DExK,cas6	Orphan	CAGTCCGGACAGGGCTACGGCGGACAGGG	29	0	0	NA	NA	NA	2	2	Orphan	RT,DEDDh,csa3,WYL,cas3,DinG,PD-DExK,cas6	NA|138aa|up_9|NZ_CP012673.1_8836701_8837115_+,NA|73aa|up_6|NZ_CP012673.1_8840467_8840686_-,NA|134aa|up_2|NZ_CP012673.1_8845181_8845583_-,NA|127aa|up_0|NZ_CP012673.1_8849155_8849536_-,NA|288aa|down_0|NZ_CP012673.1_8851208_8852072_+,NA|318aa|down_8|NZ_CP012673.1_8932412_8933366_+	NA|138aa|up_9|NZ_CP012673.1_8836701_8837115_+	NA	NA|115aa|up_8|NZ_CP012673.1_8837111_8837456_+	pfam05717, TnpB_IS66, IS66 Orf2 like protein	NA|682aa|up_7|NZ_CP012673.1_8837713_8839759_+	pfam03050, DDE_Tnp_IS66, Transposase IS66 family	NA|73aa|up_6|NZ_CP012673.1_8840467_8840686_-	NA	NA|136aa|up_5|NZ_CP012673.1_8840925_8841333_-	cd07263, VOC_like, uncharacterized subfamily of vicinal oxygen chelate (VOC) family	NA|142aa|up_4|NZ_CP012673.1_8841329_8841755_-	smart00342, HTH_ARAC, helix_turn_helix, arabinose operon control protein	NA|793aa|up_3|NZ_CP012673.1_8841915_8844294_-	COG0178, UvrA, Excinuclease ATPase subunit [DNA replication, recombination, and repair]	NA|134aa|up_2|NZ_CP012673.1_8845181_8845583_-	NA	NA|408aa|up_1|NZ_CP012673.1_8847103_8848327_+	TIGR04362, hypothetical_protein, choice-of-anchor C domain	NA|127aa|up_0|NZ_CP012673.1_8849155_8849536_-	NA	NA|288aa|down_0|NZ_CP012673.1_8851208_8852072_+	NA	NA|740aa|down_1|NZ_CP012673.1_8852296_8854516_+	COG2268, COG2268, Uncharacterized protein conserved in bacteria [Function unknown]	NA|464aa|down_2|NZ_CP012673.1_8854555_8855947_-	pfam01082, Cu2_monooxygen, Copper type II ascorbate-dependent monooxygenase, N-terminal domain	NA|1198aa|down_3|NZ_CP012673.1_8856252_8859846_+	pfam05268, GP38, Phage tail fibre adhesin Gp38	NA|421aa|down_4|NZ_CP012673.1_8861450_8862713_+	pfam13469, Sulfotransfer_3, Sulfotransferase family	NA|7404aa|down_5|NZ_CP012673.1_8862934_8885146_+	COG3321, COG3321, Polyketide synthase modules and related proteins [Secondary metabolites biosynthesis, transport, and catabolism]	NA|10111aa|down_6|NZ_CP012673.1_8885164_8915497_+	COG3321, COG3321, Polyketide synthase modules and related proteins [Secondary metabolites biosynthesis, transport, and catabolism]	NA|5573aa|down_7|NZ_CP012673.1_8915608_8932327_+	COG3321, COG3321, Polyketide synthase modules and related proteins [Secondary metabolites biosynthesis, transport, and catabolism]	NA|318aa|down_8|NZ_CP012673.1_8932412_8933366_+	NA	NA|182aa|down_9|NZ_CP012673.1_8933585_8934131_-	cd00156, REC, phosphoacceptor receiver (REC) domain of response regulators (RRs) and pseudo response regulators (PRRs)
GCF_002950945.1_ASM295094v1	NZ_CP012673	Sorangium cellulosum strain So ce26 chromosome, complete genome	33	9117205-9117438	31	CRISPRCasFinder	no		RT,DEDDh,csa3,WYL,cas3,DinG,PD-DExK,cas6	Orphan	CGCCCGGTGGCGCCTTGACGTAAG	24	0	0	NA	NA	NA	3	3	Orphan	RT,DEDDh,csa3,WYL,cas3,DinG,PD-DExK,cas6	NA|174aa|up_8|NZ_CP012673.1_9101429_9101951_-,NA|687aa|up_4|NZ_CP012673.1_9105794_9107855_+,NA|272aa|up_1|NZ_CP012673.1_9114574_9115390_-,NA|163aa|down_1|NZ_CP012673.1_9119050_9119539_+,NA|127aa|down_2|NZ_CP012673.1_9119504_9119885_-,NA|514aa|down_8|NZ_CP012673.1_9126812_9128354_+	NA|354aa|up_9|NZ_CP012673.1_9100321_9101383_+	cd03514, CrtR_beta-carotene-hydroxylase, Beta-carotene hydroxylase (CrtR), the carotenoid zeaxanthin biosynthetic enzyme catalyzes the addition of hydroxyl groups to the beta-ionone rings of beta-carotene to form zeaxanthin and is found in bacteria and red algae	NA|174aa|up_8|NZ_CP012673.1_9101429_9101951_-	NA	NA|331aa|up_7|NZ_CP012673.1_9102128_9103121_-	pfam12127, YdfA_immunity, SigmaW regulon antibacterial	NA|157aa|up_6|NZ_CP012673.1_9103169_9103640_-	COG1030, NfeD, Membrane-bound serine protease (ClpP class) [Posttranslational modification, protein turnover, chaperones]	NA|503aa|up_5|NZ_CP012673.1_9103636_9105145_-	cd07021, Clp_protease_NfeD_like, Nodulation formation efficiency D (NfeD) is a membrane-bound ClpP-class protease	NA|687aa|up_4|NZ_CP012673.1_9105794_9107855_+	NA	NA|822aa|up_3|NZ_CP012673.1_9108090_9110556_+	sd00002, TSP3, Calcium-binding Thrombospondin type 3 (TSP3) repeat	NA|1338aa|up_2|NZ_CP012673.1_9110552_9114566_-	cd14014, STKc_PknB_like, Catalytic domain of bacterial Serine/Threonine kinases, PknB and similar proteins	NA|272aa|up_1|NZ_CP012673.1_9114574_9115390_-	NA	NA|470aa|up_0|NZ_CP012673.1_9115651_9117061_+	cd14014, STKc_PknB_like, Catalytic domain of bacterial Serine/Threonine kinases, PknB and similar proteins	NA|110aa|down_0|NZ_CP012673.1_9118214_9118544_-	pfam05593, RHS_repeat, RHS Repeat	NA|163aa|down_1|NZ_CP012673.1_9119050_9119539_+	NA	NA|127aa|down_2|NZ_CP012673.1_9119504_9119885_-	NA	NA|329aa|down_3|NZ_CP012673.1_9120224_9121211_-	PRK11689, PRK11689, aromatic amino acid efflux DMT transporter YddG	NA|261aa|down_4|NZ_CP012673.1_9121301_9122084_-	COG2207, AraC, AraC-type DNA-binding domain-containing proteins [Transcription]	NA|487aa|down_5|NZ_CP012673.1_9122253_9123714_+	cd14014, STKc_PknB_like, Catalytic domain of bacterial Serine/Threonine kinases, PknB and similar proteins	NA|354aa|down_6|NZ_CP012673.1_9123779_9124841_+	TIGR02917, TPR_domain_protein, putative PEP-CTERM system TPR-repeat lipoprotein	NA|445aa|down_7|NZ_CP012673.1_9124842_9126177_+	pfam16472, DUF5050, Domain of unknown function (DUF5050)	NA|514aa|down_8|NZ_CP012673.1_9126812_9128354_+	NA	NA|276aa|down_9|NZ_CP012673.1_9128529_9129357_+	TIGR02452, conserved_hypothetical_protein, TIGR02452 family protein
GCF_002950945.1_ASM295094v1	NZ_CP012673	Sorangium cellulosum strain So ce26 chromosome, complete genome	34	9611671-9612043	4,32,3	CRT,CRISPRCasFinder,PILER-CR	no	cas6	RT,DEDDh,csa3,WYL,cas3,DinG,PD-DExK,cas6	Unclear	GNNTCAATGCCCTTTGAGCAGGCATGTCTTGTTCGGG,TCAATGCCCTTTGAGTAGGCATGTCTTGTTCG,GTTTCAATGCCCTTTGAGCAGGCATGTCTTGTTCGGG	37,32,37	0	0	NA	NA	NA:NA:NA	4,4,2	4	Unclear	RT,DEDDh,csa3,WYL,cas3,DinG,PD-DExK,cas6	NA|378aa|up_6|NZ_CP012673.1_9597970_9599104_+,NA|274aa|up_1|NZ_CP012673.1_9609296_9610118_+,NA|88aa|up_0|NZ_CP012673.1_9610580_9610844_-,NA|124aa|down_5|NZ_CP012673.1_9619346_9619718_+,NA|177aa|down_7|NZ_CP012673.1_9621105_9621636_+	NA|1217aa|up_9|NZ_CP012673.1_9591819_9595470_+	TIGR02956, sensor_protein_TorS, TMAO reductase sytem sensor TorS	NA|118aa|up_8|NZ_CP012673.1_9595561_9595915_+	pfam02675, AdoMet_dc, S-adenosylmethionine decarboxylase	NA|475aa|up_7|NZ_CP012673.1_9595968_9597393_-	cd17346, MFS_DtpA_like, Dipeptide and tripeptide permease A (DtpA)-like subfamily of the Major Facilitator Superfamily of transporters	NA|378aa|up_6|NZ_CP012673.1_9597970_9599104_+	NA	NA|323aa|up_5|NZ_CP012673.1_9600979_9601948_-	pfam09983, DUF2220, Uncharacterized protein conserved in bacteria C-term(DUF2220)	NA|1474aa|up_4|NZ_CP012673.1_9601947_9606369_-	PRK04863, mukB, chromosome partition protein MukB	NA|230aa|up_3|NZ_CP012673.1_9606365_9607055_-	PRK05256, PRK05256, chromosome partition protein MukE	NA|442aa|up_2|NZ_CP012673.1_9607051_9608377_-	PRK05260, PRK05260, chromosome partition protein MukF	NA|274aa|up_1|NZ_CP012673.1_9609296_9610118_+	NA	NA|88aa|up_0|NZ_CP012673.1_9610580_9610844_-	NA	cas6|308aa|down_0|NZ_CP012673.1_9612478_9613402_+	pfam10040, CRISPR_Cas6, CRISPR-associated endoribonuclease Cas6	NA|181aa|down_1|NZ_CP012673.1_9613546_9614089_+	PRK12270, kgd, multifunctional oxoglutarate decarboxylase/oxoglutarate dehydrogenase thiamine pyrophosphate-binding subunit/dihydrolipoyllysine-residue succinyltransferase subunit	NA|435aa|down_2|NZ_CP012673.1_9614275_9615580_+	PHA00370, III, attachment protein	NA|484aa|down_3|NZ_CP012673.1_9615930_9617382_+	PRK12678, PRK12678, transcription termination factor Rho; Provisional	NA|429aa|down_4|NZ_CP012673.1_9617630_9618917_+	cd01831, Endoglucanase_E_like, Endoglucanase E-like members of the SGNH hydrolase family; Endoglucanase E catalyzes the endohydrolysis of 1,4-beta-glucosidic linkages in cellulose, lichenin and cereal beta-D-glucans	NA|124aa|down_5|NZ_CP012673.1_9619346_9619718_+	NA	NA|308aa|down_6|NZ_CP012673.1_9619913_9620837_+	cd07986, LPLAT_ACT14924-like, Lysophospholipid Acyltransferases (LPLATs) of Glycerophospholipid Biosynthesis: Unknown ACT14924	NA|177aa|down_7|NZ_CP012673.1_9621105_9621636_+	NA	NA|356aa|down_8|NZ_CP012673.1_9621844_9622912_-	cd03801, GT4_PimA-like, phosphatidyl-myo-inositol mannosyltransferase	NA|524aa|down_9|NZ_CP012673.1_9623083_9624655_+	COG2204, AtoC, Response regulator containing CheY-like receiver, AAA-type ATPase, and DNA-binding domains [Signal transduction mechanisms]
GCF_002950945.1_ASM295094v1	NZ_CP012673	Sorangium cellulosum strain So ce26 chromosome, complete genome	35	9815019-9815094	33	CRISPRCasFinder	no		RT,DEDDh,csa3,WYL,cas3,DinG,PD-DExK,cas6	Orphan	GGCGGCGCAGCGGGCGTGCTGAG	23	0	0	NA	NA	NA	1	1	Orphan	RT,DEDDh,csa3,WYL,cas3,DinG,PD-DExK,cas6	NA|214aa|up_9|NZ_CP012673.1_9794655_9795297_-,NA|393aa|up_7|NZ_CP012673.1_9797397_9798576_+,NA|508aa|up_4|NZ_CP012673.1_9803223_9804747_+,NA|112aa|up_0|NZ_CP012673.1_9814435_9814771_+,NA|230aa|down_2|NZ_CP012673.1_9816659_9817349_-,NA|280aa|down_6|NZ_CP012673.1_9822969_9823809_+	NA|214aa|up_9|NZ_CP012673.1_9794655_9795297_-	NA	NA|562aa|up_8|NZ_CP012673.1_9795312_9796998_-	cd14014, STKc_PknB_like, Catalytic domain of bacterial Serine/Threonine kinases, PknB and similar proteins	NA|393aa|up_7|NZ_CP012673.1_9797397_9798576_+	NA	NA|565aa|up_6|NZ_CP012673.1_9798816_9800511_+	cd14014, STKc_PknB_like, Catalytic domain of bacterial Serine/Threonine kinases, PknB and similar proteins	NA|886aa|up_5|NZ_CP012673.1_9800557_9803215_+	cd01347, ligand_gated_channel, TonB dependent/Ligand-Gated channels are created by a monomeric 22 strand (22,24) anti-parallel beta-barrel	NA|508aa|up_4|NZ_CP012673.1_9803223_9804747_+	NA	NA|1224aa|up_3|NZ_CP012673.1_9804827_9808499_+	cd07473, Peptidases_S8_Subtilisin_like, Peptidase S8 family domain in Subtilisin-like proteins	NA|799aa|up_2|NZ_CP012673.1_9808517_9810914_-	cd14014, STKc_PknB_like, Catalytic domain of bacterial Serine/Threonine kinases, PknB and similar proteins	NA|896aa|up_1|NZ_CP012673.1_9811686_9814374_+	cd01465, vWA_subgroup, VWA subgroup: Von Willebrand factor type A (vWA) domain was originally found in the blood coagulation protein von Willebrand factor (vWF)	NA|112aa|up_0|NZ_CP012673.1_9814435_9814771_+	NA	NA|245aa|down_0|NZ_CP012673.1_9815237_9815972_+	pfam03824, NicO, High-affinity nickel-transport protein	NA|200aa|down_1|NZ_CP012673.1_9815991_9816591_+	PRK12270, kgd, multifunctional oxoglutarate decarboxylase/oxoglutarate dehydrogenase thiamine pyrophosphate-binding subunit/dihydrolipoyllysine-residue succinyltransferase subunit	NA|230aa|down_2|NZ_CP012673.1_9816659_9817349_-	NA	NA|413aa|down_3|NZ_CP012673.1_9817345_9818584_-	TIGR04039, MXAN_0977_Heme2, di-heme enzyme, MXAN_0977 family	NA|318aa|down_4|NZ_CP012673.1_9818604_9819558_-	TIGR04052, hypothetical_protein_MettrDRAFT_3899, AZL_007920/MXAN_0976 family protein	NA|726aa|down_5|NZ_CP012673.1_9819847_9822025_+	pfam00593, TonB_dep_Rec, TonB dependent receptor	NA|280aa|down_6|NZ_CP012673.1_9822969_9823809_+	NA	NA|583aa|down_7|NZ_CP012673.1_9823840_9825589_-	cd14014, STKc_PknB_like, Catalytic domain of bacterial Serine/Threonine kinases, PknB and similar proteins	NA|410aa|down_8|NZ_CP012673.1_9825677_9826907_-	COG0644, FixC, Dehydrogenases (flavoproteins) [Energy production and conversion]	NA|255aa|down_9|NZ_CP012673.1_9826938_9827703_-	pfam08241, Methyltransf_11, Methyltransferase domain
GCF_002950945.1_ASM295094v1	NZ_CP012673	Sorangium cellulosum strain So ce26 chromosome, complete genome	36	10400700-10400828	34	CRISPRCasFinder	no		RT,DEDDh,csa3,WYL,cas3,DinG,PD-DExK,cas6	Orphan	GGCGGCGCGCCCGCGGCGCTGGGC	24	1	1	10400724-10400750	NZ_CP012673.1_10400694-10400720	NA	2	2	Orphan	RT,DEDDh,csa3,WYL,cas3,DinG,PD-DExK,cas6	NA|232aa|up_3|NZ_CP012673.1_10396159_10396855_-,NA|174aa|down_0|NZ_CP012673.1_10402573_10403095_+,NA|598aa|down_9|NZ_CP012673.1_10415262_10417056_-	NA|571aa|up_9|NZ_CP012673.1_10386066_10387779_+	COG5271, MDN1, AAA ATPase containing von Willebrand factor type A (vWA) domain [General function prediction only]	NA|442aa|up_8|NZ_CP012673.1_10387796_10389122_-	PRK06635, PRK06635, aspartate kinase; Reviewed	NA|850aa|up_7|NZ_CP012673.1_10389247_10391797_-	TIGR02917, TPR_domain_protein, putative PEP-CTERM system TPR-repeat lipoprotein	NA|198aa|up_6|NZ_CP012673.1_10391910_10392504_-	pfam05494, MlaC, MlaC protein	NA|470aa|up_5|NZ_CP012673.1_10392726_10394136_-	PRK11360, PRK11360, two-component system sensor histidine kinase AtoS	NA|406aa|up_4|NZ_CP012673.1_10394132_10395350_-	COG1459, PulF, Type II secretory pathway, component PulF [Cell motility and secretion / Intracellular trafficking and secretion]	NA|232aa|up_3|NZ_CP012673.1_10396159_10396855_-	NA	NA|385aa|up_2|NZ_CP012673.1_10396917_10398072_-	pfam13304, AAA_21, AAA domain, putative AbiEii toxin, Type IV TA system	NA|290aa|up_1|NZ_CP012673.1_10398306_10399176_-	PRK13942, PRK13942, protein-L-isoaspartate O-methyltransferase; Provisional	NA|390aa|up_0|NZ_CP012673.1_10399309_10400479_+	pfam00144, Beta-lactamase, Beta-lactamase	NA|174aa|down_0|NZ_CP012673.1_10402573_10403095_+	NA	NA|874aa|down_1|NZ_CP012673.1_10403184_10405806_-	cd14014, STKc_PknB_like, Catalytic domain of bacterial Serine/Threonine kinases, PknB and similar proteins	NA|232aa|down_2|NZ_CP012673.1_10405965_10406661_+	TIGR00266, TIGR00266, TIGR00266 family protein	NA|223aa|down_3|NZ_CP012673.1_10406692_10407361_+	pfam01987, AIM24, Mitochondrial biogenesis AIM24	NA|228aa|down_4|NZ_CP012673.1_10407377_10408061_+	pfam01987, AIM24, Mitochondrial biogenesis AIM24	NA|502aa|down_5|NZ_CP012673.1_10408485_10409991_+	pfam00498, FHA, FHA domain	NA|520aa|down_6|NZ_CP012673.1_10410036_10411596_+	cd01825, SGNH_hydrolase_peri1, SGNH_peri1; putative periplasmic member of the SGNH-family of hydrolases, a diverse family of lipases and esterases	NA|547aa|down_7|NZ_CP012673.1_10411598_10413239_-	COG3202, COG3202, ATP/ADP translocase [Energy production and conversion]	NA|525aa|down_8|NZ_CP012673.1_10413537_10415112_+	PRK14951, PRK14951, DNA polymerase III subunits gamma and tau; Provisional	NA|598aa|down_9|NZ_CP012673.1_10415262_10417056_-	NA
GCF_002950945.1_ASM295094v1	NZ_CP012673	Sorangium cellulosum strain So ce26 chromosome, complete genome	37	10676397-10676623	35	CRISPRCasFinder	no		RT,DEDDh,csa3,WYL,cas3,DinG,PD-DExK,cas6	Orphan	TTCGTGAGGCCGGCGAGCGGGGAGAGGTC	29	0	0	NA	NA	NA	3	3	Orphan	RT,DEDDh,csa3,WYL,cas3,DinG,PD-DExK,cas6	NA|203aa|up_0|NZ_CP012673.1_10673736_10674345_+,NA|76aa|down_1|NZ_CP012673.1_10678293_10678521_-	NA|749aa|up_9|NZ_CP012673.1_10659863_10662110_+	smart00387, HATPase_c, Histidine kinase-like ATPases	NA|743aa|up_8|NZ_CP012673.1_10662510_10664739_-	pfam11369, DUF3160, Protein of unknown function (DUF3160)	NA|287aa|up_7|NZ_CP012673.1_10666147_10667008_+	pfam05114, DUF692, Protein of unknown function (DUF692)	NA|329aa|up_6|NZ_CP012673.1_10667004_10667991_+	pfam09836, DUF2063, Putative DNA-binding domain	NA|209aa|up_5|NZ_CP012673.1_10668222_10668849_+	COG0625, Gst, Glutathione S-transferase [Posttranslational modification, protein turnover, chaperones]	NA|699aa|up_4|NZ_CP012673.1_10668937_10671034_+	COG0488, Uup, ATPase components of ABC transporters with duplicated ATPase domains [General function prediction only]	NA|197aa|up_3|NZ_CP012673.1_10671204_10671795_+	TIGR02196, Gene_56_protein, Glutaredoxin-like protein, YruB-family	NA|398aa|up_2|NZ_CP012673.1_10672009_10673203_+	COG4637, COG4637, Predicted ATPase [General function prediction only]	NA|100aa|up_1|NZ_CP012673.1_10673261_10673561_+	pfam14103, DUF4276, Domain of unknown function (DUF4276)	NA|203aa|up_0|NZ_CP012673.1_10673736_10674345_+	NA	NA|157aa|down_0|NZ_CP012673.1_10677283_10677754_-	cd00657, Ferritin_like, Ferritin-like superfamily of diiron-containing four-helix-bundle proteins	NA|76aa|down_1|NZ_CP012673.1_10678293_10678521_-	NA	NA|269aa|down_2|NZ_CP012673.1_10678683_10679490_-	pfam00797, Acetyltransf_2, N-acetyltransferase	NA|308aa|down_3|NZ_CP012673.1_10679541_10680465_-	COG0462, PrsA, Phosphoribosylpyrophosphate synthetase [Nucleotide transport and metabolism / Amino acid transport and metabolism]	NA|610aa|down_4|NZ_CP012673.1_10680461_10682291_-	COG1092, COG1092, Predicted SAM-dependent methyltransferases [General function prediction only]	NA|345aa|down_5|NZ_CP012673.1_10682290_10683325_-	cd16325, LolA, LolA, a periplasmic chaperone	NA|425aa|down_6|NZ_CP012673.1_10684039_10685314_+	COG2208, RsbU, Serine phosphatase RsbU, regulator of sigma subunit [Signal transduction mechanisms / Transcription]	NA|791aa|down_7|NZ_CP012673.1_10685352_10687725_+	PRK01213, PRK01213, phosphoribosylformylglycinamidine synthase subunit PurL	NA|143aa|down_8|NZ_CP012673.1_10687862_10688291_+	cd00156, REC, phosphoacceptor receiver (REC) domain of response regulators (RRs) and pseudo response regulators (PRRs)	NA|173aa|down_9|NZ_CP012673.1_10688306_10688825_+	pfam14534, DUF4440, Domain of unknown function (DUF4440)
GCF_002950945.1_ASM295094v1	NZ_CP012673	Sorangium cellulosum strain So ce26 chromosome, complete genome	38	10821659-10821765	36	CRISPRCasFinder	no		RT,DEDDh,csa3,WYL,cas3,DinG,PD-DExK,cas6	Orphan	GAGCCCTGCGAGGCGCTGCTCGGGCG	26	0	0	NA	NA	NA	1	1	Orphan	RT,DEDDh,csa3,WYL,cas3,DinG,PD-DExK,cas6	NA|86aa|up_9|NZ_CP012673.1_10812822_10813080_-,NA|248aa|up_6|NZ_CP012673.1_10815704_10816448_-,NA|85aa|up_5|NZ_CP012673.1_10816520_10816775_-,NA|96aa|up_1|NZ_CP012673.1_10820007_10820295_-,NA|110aa|down_0|NZ_CP012673.1_10823403_10823733_+,NA|371aa|down_2|NZ_CP012673.1_10825261_10826374_-,NA|281aa|down_4|NZ_CP012673.1_10827244_10828087_-,NA|248aa|down_6|NZ_CP012673.1_10829131_10829875_-,NA|85aa|down_7|NZ_CP012673.1_10829947_10830202_-	NA|86aa|up_9|NZ_CP012673.1_10812822_10813080_-	NA	NA|443aa|up_8|NZ_CP012673.1_10813435_10814764_+	pfam01609, DDE_Tnp_1, Transposase DDE domain	NA|266aa|up_7|NZ_CP012673.1_10814871_10815669_-	pfam08241, Methyltransf_11, Methyltransferase domain	NA|248aa|up_6|NZ_CP012673.1_10815704_10816448_-	NA	NA|85aa|up_5|NZ_CP012673.1_10816520_10816775_-	NA	NA|143aa|up_4|NZ_CP012673.1_10816842_10817271_-	pfam13673, Acetyltransf_10, Acetyltransferase (GNAT) domain	NA|197aa|up_3|NZ_CP012673.1_10817682_10818273_-	smart00886, Dabb, Stress responsive A/B Barrel Domain	NA|539aa|up_2|NZ_CP012673.1_10818327_10819944_-	pfam00264, Tyrosinase, Common central domain of tyrosinase	NA|96aa|up_1|NZ_CP012673.1_10820007_10820295_-	NA	NA|143aa|up_0|NZ_CP012673.1_10820632_10821061_-	PRK06108, PRK06108, pyridoxal phosphate-dependent aminotransferase	NA|110aa|down_0|NZ_CP012673.1_10823403_10823733_+	NA	NA|392aa|down_1|NZ_CP012673.1_10823929_10825105_-	COG1119, ModF, ABC-type molybdenum transport system, ATPase component/photorepair protein PhrA [Inorganic ion transport and metabolism]	NA|371aa|down_2|NZ_CP012673.1_10825261_10826374_-	NA	NA|207aa|down_3|NZ_CP012673.1_10826556_10827177_-	pfam01434, Peptidase_M41, Peptidase family M41	NA|281aa|down_4|NZ_CP012673.1_10827244_10828087_-	NA	NA|266aa|down_5|NZ_CP012673.1_10828298_10829096_-	pfam08241, Methyltransf_11, Methyltransferase domain	NA|248aa|down_6|NZ_CP012673.1_10829131_10829875_-	NA	NA|85aa|down_7|NZ_CP012673.1_10829947_10830202_-	NA	NA|143aa|down_8|NZ_CP012673.1_10830269_10830698_-	pfam13673, Acetyltransf_10, Acetyltransferase (GNAT) domain	NA|317aa|down_9|NZ_CP012673.1_10831304_10832255_+	TIGR02249, Integrase/recombinase_E2_protein
GCF_002950945.1_ASM295094v1	NZ_CP012673	Sorangium cellulosum strain So ce26 chromosome, complete genome	39	11336903-11336977	37	CRISPRCasFinder	no		RT,DEDDh,csa3,WYL,cas3,DinG,PD-DExK,cas6	Orphan	GCGCGTGGTTCCCGCCGCCGGCGC	24	0	0	NA	NA	NA	1	1	Orphan	RT,DEDDh,csa3,WYL,cas3,DinG,PD-DExK,cas6	NA|541aa|up_5|NZ_CP012673.1_11328369_11329992_+,NA|504aa|up_4|NZ_CP012673.1_11330427_11331939_+,NA|170aa|up_3|NZ_CP012673.1_11332133_11332643_+,NA|59aa|up_1|NZ_CP012673.1_11335962_11336139_-,NA|158aa|down_5|NZ_CP012673.1_11347080_11347554_-,NA|84aa|down_8|NZ_CP012673.1_11353484_11353736_+	NA|393aa|up_9|NZ_CP012673.1_11321882_11323061_-	TIGR04117, Syntroph_Cxxx, Syntrophus aciditrophicus Cys-Xaa-Xaa-Xaa repeat radical SAM target protein	NA|195aa|up_8|NZ_CP012673.1_11323487_11324072_+	cd05381, CAP_PR-1, CAP (cysteine-rich secretory proteins, antigen 5, and pathogenesis-related 1 proteins) domain of pathogenesis-related protein 1 (PR-1) family proteins	NA|383aa|up_7|NZ_CP012673.1_11324223_11325372_+	cd00567, ACAD, Acyl-CoA dehydrogenase	NA|682aa|up_6|NZ_CP012673.1_11325596_11327642_+	cd06456, M3A_DCP, Peptidase family M3, dipeptidyl carboxypeptidase (DCP)	NA|541aa|up_5|NZ_CP012673.1_11328369_11329992_+	NA	NA|504aa|up_4|NZ_CP012673.1_11330427_11331939_+	NA	NA|170aa|up_3|NZ_CP012673.1_11332133_11332643_+	NA	NA|632aa|up_2|NZ_CP012673.1_11332779_11334675_-	TIGR04247, nitrous_oxide_maturation_protein_NosD, nitrous oxide reductase family maturation protein NosD	NA|59aa|up_1|NZ_CP012673.1_11335962_11336139_-	NA	NA|110aa|up_0|NZ_CP012673.1_11336311_11336641_+	pfam07238, PilZ, PilZ domain	NA|202aa|down_0|NZ_CP012673.1_11339575_11340181_-	TIGR02727, Uncharacterized_protein_YqgN, 5,10-methenyltetrahydrofolate synthetase	NA|151aa|down_1|NZ_CP012673.1_11340546_11340999_+	PRK00567, mscL, large-conductance mechanosensitive channel protein MscL	NA|281aa|down_2|NZ_CP012673.1_11341269_11342112_+	PRK11892, PRK11892, pyruvate dehydrogenase subunit beta; Provisional	NA|907aa|down_3|NZ_CP012673.1_11342689_11345410_-	pfam13517, VCBS, Repeat domain in Vibrio, Colwellia, Bradyrhizobium and Shewanella	NA|475aa|down_4|NZ_CP012673.1_11345583_11347008_-	pfam08308, PEGA, PEGA domain	NA|158aa|down_5|NZ_CP012673.1_11347080_11347554_-	NA	NA|1233aa|down_6|NZ_CP012673.1_11347550_11351249_-	cd00200, WD40, WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment	NA|664aa|down_7|NZ_CP012673.1_11351245_11353237_-	pfam12770, CHAT, CHAT domain	NA|84aa|down_8|NZ_CP012673.1_11353484_11353736_+	NA	NA|37aa|down_9|NZ_CP012673.1_11353878_11353989_+	pfam13517, VCBS, Repeat domain in Vibrio, Colwellia, Bradyrhizobium and Shewanella
GCF_002950945.1_ASM295094v1	NZ_CP012673	Sorangium cellulosum strain So ce26 chromosome, complete genome	40	11923044-11923130	38	CRISPRCasFinder	no		RT,DEDDh,csa3,WYL,cas3,DinG,PD-DExK,cas6	Orphan	CGAGAGCGCCCTCGCCCCCTCGG	23	0	0	NA	NA	NA	1	1	Orphan	RT,DEDDh,csa3,WYL,cas3,DinG,PD-DExK,cas6	NA,NA	NA|484aa|up_9|NZ_CP012673.1_11905635_11907087_-	PRK05579, PRK05579, bifunctional phosphopantothenoylcysteine decarboxylase/phosphopantothenate synthase; Validated	NA|176aa|up_8|NZ_CP012673.1_11907225_11907753_+	pfam09413, DUF2007, Putative prokaryotic signal transducing protein	NA|340aa|up_7|NZ_CP012673.1_11907799_11908819_+	PRK12556, PRK12556, tryptophanyl-tRNA synthetase; Provisional	NA|583aa|up_6|NZ_CP012673.1_11908884_11910633_-	pfam04294, VanW, VanW like protein	NA|451aa|up_5|NZ_CP012673.1_11910763_11912116_-	cd14014, STKc_PknB_like, Catalytic domain of bacterial Serine/Threonine kinases, PknB and similar proteins	NA|753aa|up_4|NZ_CP012673.1_11912411_11914670_-	PRK00558, uvrC, excinuclease ABC subunit UvrC	NA|151aa|up_3|NZ_CP012673.1_11916866_11917319_+	TIGR02324, phosphonate_uptake_transporter_ATP-binding_protein, phosphonate C-P lyase system protein PhnL	NA|418aa|up_2|NZ_CP012673.1_11918603_11919857_-	pfam09835, DUF2062, Uncharacterized protein conserved in bacteria (DUF2062)	NA|291aa|up_1|NZ_CP012673.1_11919860_11920733_-	sd00006, TPR, Tetratricopeptide repeat	NA|732aa|up_0|NZ_CP012673.1_11920815_11923011_+	pfam01551, Peptidase_M23, Peptidase family M23	NA|131aa|down_0|NZ_CP012673.1_11923240_11923633_-	PRK00132, rpsI, 30S ribosomal protein S9; Reviewed	NA|150aa|down_1|NZ_CP012673.1_11923641_11924091_-	PRK09216, rplM, 50S ribosomal protein L13; Reviewed	NA|585aa|down_2|NZ_CP012673.1_11924800_11926555_-	COG5000, NtrY, Signal transduction histidine kinase involved in nitrogen fixation and metabolism regulation [Signal transduction mechanisms]	NA|611aa|down_3|NZ_CP012673.1_11926665_11928498_-	PRK00331, PRK00331, isomerizing glutamine--fructose-6-phosphate transaminase	NA|428aa|down_4|NZ_CP012673.1_11928566_11929850_-	cd17335, MFS_MFSD6, Major facilitator superfamily domain-containing protein 6	NA|147aa|down_5|NZ_CP012673.1_11930106_11930547_-	cd12399, RRM_HP0827_like, RNA recognition motif in Helicobacter pylori HP0827 protein and similar proteins	NA|454aa|down_6|NZ_CP012673.1_11930878_11932240_-	cd08490, PBP2_NikA_DppA_OppA_like_3, The substrate-binding component of an uncharacterized ABC-type nickel/dipeptide/oligopeptide-like import system contains the type 2 periplasmic binding fold	NA|463aa|down_7|NZ_CP012673.1_11932236_11933625_-	PRK14951, PRK14951, DNA polymerase III subunits gamma and tau; Provisional	NA|814aa|down_8|NZ_CP012673.1_11933635_11936077_-	COG1033, COG1033, Predicted exporters of the RND superfamily [General function prediction only]	NA|204aa|down_9|NZ_CP012673.1_11936228_11936840_+	pfam01327, Pep_deformylase, Polypeptide deformylase
GCF_002950945.1_ASM295094v1	NZ_CP012673	Sorangium cellulosum strain So ce26 chromosome, complete genome	41	12513345-12513429	39	CRISPRCasFinder	no		RT,DEDDh,csa3,WYL,cas3,DinG,PD-DExK,cas6	Orphan	GCGGCGCCGTGATCGCCGACCTGCC	25	0	0	NA	NA	NA	1	1	Orphan	RT,DEDDh,csa3,WYL,cas3,DinG,PD-DExK,cas6	NA,NA|272aa|down_6|NZ_CP012673.1_12523983_12524799_+	NA|877aa|up_9|NZ_CP012673.1_12496926_12499557_-	pfam04738, Lant_dehydr_N, Lantibiotic dehydratase, C-terminus	NA|268aa|up_8|NZ_CP012673.1_12499553_12500357_-	cd02142, McbC_SagB-like_oxidoreductase, oxidase similar to the microcin B17 processing protein McbC	NA|611aa|up_7|NZ_CP012673.1_12500353_12502186_-	TIGR03605, antibiot_sagB, SagB-type dehydrogenase domain	NA|241aa|up_6|NZ_CP012673.1_12502166_12502889_-	TIGR03882, hypothetical_protein, bacteriocin biosynthesis cyclodehydratase domain	NA|439aa|up_5|NZ_CP012673.1_12502866_12504183_-	TIGR03604, hypothetical_protein, thiazole/oxazole-forming peptide maturase, SagD family component	NA|416aa|up_4|NZ_CP012673.1_12504468_12505716_+	pfam02624, YcaO, YcaO cyclodehydratase, ATP-ad Mg2+-binding	NA|470aa|up_3|NZ_CP012673.1_12505712_12507122_+	NF033432, ThioGly_TfuA_rel, TfuA-related McrA-glycine thioamidation protein	NA|318aa|up_2|NZ_CP012673.1_12507108_12508062_+	COG4152, COG4152, ABC-type uncharacterized transport system, ATPase component [General function prediction only]	NA|438aa|up_1|NZ_CP012673.1_12508058_12509372_+	COG1668, NatB, ABC-type Na+ efflux pump, permease component [Energy production and conversion / Inorganic ion transport and metabolism]	NA|765aa|up_0|NZ_CP012673.1_12509756_12512051_-	cd18805, SF2_C_suv3, C-terminal helicase domain of ATP-dependent RNA helicase	NA|303aa|down_0|NZ_CP012673.1_12514267_12515176_-	cd19100, AKR_unchar, uncharacterized aldo-keto reductase (AKR) superfamily protein	NA|454aa|down_1|NZ_CP012673.1_12515525_12516887_-	pfam07586, HXXSHH, Protein of unknown function (DUF1552)	NA|560aa|down_2|NZ_CP012673.1_12516922_12518602_-	pfam07631, PSD4, Protein of unknown function (DUF1592)	NA|565aa|down_3|NZ_CP012673.1_12519361_12521056_-	cd00144, MPP_PPP_family, phosphoprotein phosphatases of the metallophosphatase superfamily, metallophosphatase domain	NA|424aa|down_4|NZ_CP012673.1_12521129_12522401_-	pfam13442, Cytochrome_CBB3, Cytochrome C oxidase, cbb3-type, subunit III	NA|463aa|down_5|NZ_CP012673.1_12522397_12523786_-	pfam07394, DUF1501, Protein of unknown function (DUF1501)	NA|272aa|down_6|NZ_CP012673.1_12523983_12524799_+	NA	NA|459aa|down_7|NZ_CP012673.1_12524851_12526228_-	pfam07586, HXXSHH, Protein of unknown function (DUF1552)	NA|563aa|down_8|NZ_CP012673.1_12526224_12527913_-	pfam07631, PSD4, Protein of unknown function (DUF1592)	NA|422aa|down_9|NZ_CP012673.1_12528577_12529843_+	cd08987, GH62, Glycosyl hydrolase family 62, characterized arabinofuranosidases
GCF_002950945.1_ASM295094v1	NZ_CP012673	Sorangium cellulosum strain So ce26 chromosome, complete genome	42	13377693-13377787	40	CRISPRCasFinder	no		RT,DEDDh,csa3,WYL,cas3,DinG,PD-DExK,cas6	Orphan	CGAGCGAGCCCGCGGCCGCCGCC	23	0	0	NA	NA	NA	1	1	Orphan	RT,DEDDh,csa3,WYL,cas3,DinG,PD-DExK,cas6	NA|459aa|up_0|NZ_CP012673.1_13374649_13376026_+,NA|129aa|down_3|NZ_CP012673.1_13383349_13383736_+	NA|430aa|up_9|NZ_CP012673.1_13358315_13359605_+	cd10231, YegD_like, Escherichia coli YegD, a putative chaperone protein, and related proteins	NA|428aa|up_8|NZ_CP012673.1_13360678_13361962_-	cd07385, MPP_YkuE_C, Bacillus subtilis YkuE and related proteins, C-terminal metallophosphatase domain	NA|467aa|up_7|NZ_CP012673.1_13362720_13364121_+	pfam01341, Glyco_hydro_6, Glycosyl hydrolases family 6	NA|308aa|up_6|NZ_CP012673.1_13364307_13365231_+	COG0583, LysR, Transcriptional regulator [Transcription]	NA|360aa|up_5|NZ_CP012673.1_13365220_13366300_-	cd00009, AAA, The AAA+ (ATPases Associated with a wide variety of cellular Activities) superfamily represents an ancient group of ATPases belonging to the ASCE (for additional strand, catalytic E) division of the P-loop NTPase fold	NA|250aa|up_4|NZ_CP012673.1_13366365_13367115_-	pfam13462, Thioredoxin_4, Thioredoxin	NA|975aa|up_3|NZ_CP012673.1_13367447_13370372_+	pfam02384, N6_Mtase, N-6 DNA Methylase	NA|165aa|up_2|NZ_CP012673.1_13370432_13370927_-	COG2204, AtoC, Response regulator containing CheY-like receiver, AAA-type ATPase, and DNA-binding domains [Signal transduction mechanisms]	NA|753aa|up_1|NZ_CP012673.1_13371575_13373834_-	pfam17210, SdrD_B, SdrD B-like domain	NA|459aa|up_0|NZ_CP012673.1_13374649_13376026_+	NA	NA|577aa|down_0|NZ_CP012673.1_13378615_13380346_-	TIGR01451, unnamed_protein_product, conserved repeat domain	NA|578aa|down_1|NZ_CP012673.1_13380342_13382076_-	cd07185, OmpA_C-like, Peptidoglycan binding domains similar to the C-terminal domain of outer-membrane protein OmpA	NA|183aa|down_2|NZ_CP012673.1_13382581_13383130_+	COG0317, SpoT, Guanosine polyphosphate pyrophosphohydrolases/synthetases [Signal transduction mechanisms / Transcription]	NA|129aa|down_3|NZ_CP012673.1_13383349_13383736_+	NA	NA|401aa|down_4|NZ_CP012673.1_13383752_13384955_-	pfam07075, DUF1343, Protein of unknown function (DUF1343)	NA|516aa|down_5|NZ_CP012673.1_13385035_13386583_-	TIGR02037, Probable_periplasmic_serine_protease_do/HhoA-like, periplasmic serine protease, Do/DeqQ family	NA|297aa|down_6|NZ_CP012673.1_13386722_13387613_-	PRK00021, truA, tRNA pseudouridine(38-40) synthase TruA	NA|206aa|down_7|NZ_CP012673.1_13388072_13388690_-	PRK14951, PRK14951, DNA polymerase III subunits gamma and tau; Provisional	NA|182aa|down_8|NZ_CP012673.1_13388910_13389456_-	COG0742, COG0742, N6-adenine-specific methylase [DNA replication, recombination, and repair]	NA|373aa|down_9|NZ_CP012673.1_13389493_13390612_-	pfam08308, PEGA, PEGA domain
GCF_002950945.1_ASM295094v1	NZ_CP012673	Sorangium cellulosum strain So ce26 chromosome, complete genome	43	13382269-13382346	41	CRISPRCasFinder	no		RT,DEDDh,csa3,WYL,cas3,DinG,PD-DExK,cas6	Orphan	CGCAGCTCGCTCCGGCCGCGCCG	23	0	0	NA	NA	NA	1	1	Orphan	RT,DEDDh,csa3,WYL,cas3,DinG,PD-DExK,cas6	NA|459aa|up_3|NZ_CP012673.1_13374649_13376026_+,NA|129aa|down_1|NZ_CP012673.1_13383349_13383736_+	NA|308aa|up_9|NZ_CP012673.1_13364307_13365231_+	COG0583, LysR, Transcriptional regulator [Transcription]	NA|360aa|up_8|NZ_CP012673.1_13365220_13366300_-	cd00009, AAA, The AAA+ (ATPases Associated with a wide variety of cellular Activities) superfamily represents an ancient group of ATPases belonging to the ASCE (for additional strand, catalytic E) division of the P-loop NTPase fold	NA|250aa|up_7|NZ_CP012673.1_13366365_13367115_-	pfam13462, Thioredoxin_4, Thioredoxin	NA|975aa|up_6|NZ_CP012673.1_13367447_13370372_+	pfam02384, N6_Mtase, N-6 DNA Methylase	NA|165aa|up_5|NZ_CP012673.1_13370432_13370927_-	COG2204, AtoC, Response regulator containing CheY-like receiver, AAA-type ATPase, and DNA-binding domains [Signal transduction mechanisms]	NA|753aa|up_4|NZ_CP012673.1_13371575_13373834_-	pfam17210, SdrD_B, SdrD B-like domain	NA|459aa|up_3|NZ_CP012673.1_13374649_13376026_+	NA	NA|626aa|up_2|NZ_CP012673.1_13376323_13378201_+	PRK07764, PRK07764, DNA polymerase III subunits gamma and tau; Validated	NA|577aa|up_1|NZ_CP012673.1_13378615_13380346_-	TIGR01451, unnamed_protein_product, conserved repeat domain	NA|578aa|up_0|NZ_CP012673.1_13380342_13382076_-	cd07185, OmpA_C-like, Peptidoglycan binding domains similar to the C-terminal domain of outer-membrane protein OmpA	NA|183aa|down_0|NZ_CP012673.1_13382581_13383130_+	COG0317, SpoT, Guanosine polyphosphate pyrophosphohydrolases/synthetases [Signal transduction mechanisms / Transcription]	NA|129aa|down_1|NZ_CP012673.1_13383349_13383736_+	NA	NA|401aa|down_2|NZ_CP012673.1_13383752_13384955_-	pfam07075, DUF1343, Protein of unknown function (DUF1343)	NA|516aa|down_3|NZ_CP012673.1_13385035_13386583_-	TIGR02037, Probable_periplasmic_serine_protease_do/HhoA-like, periplasmic serine protease, Do/DeqQ family	NA|297aa|down_4|NZ_CP012673.1_13386722_13387613_-	PRK00021, truA, tRNA pseudouridine(38-40) synthase TruA	NA|206aa|down_5|NZ_CP012673.1_13388072_13388690_-	PRK14951, PRK14951, DNA polymerase III subunits gamma and tau; Provisional	NA|182aa|down_6|NZ_CP012673.1_13388910_13389456_-	COG0742, COG0742, N6-adenine-specific methylase [DNA replication, recombination, and repair]	NA|373aa|down_7|NZ_CP012673.1_13389493_13390612_-	pfam08308, PEGA, PEGA domain	NA|478aa|down_8|NZ_CP012673.1_13390821_13392255_+	cd03089, PMM_PGM, The phosphomannomutase/phosphoglucomutase (PMM/PGM) bifunctional enzyme catalyzes the reversible conversion of 1-phospho to 6-phospho-sugars (e	NA|295aa|down_9|NZ_CP012673.1_13392251_13393136_+	PRK09328, PRK09328, N5-glutamine S-adenosyl-L-methionine-dependent methyltransferase; Provisional
GCF_002950945.1_ASM295094v1	NZ_CP012673	Sorangium cellulosum strain So ce26 chromosome, complete genome	44	14105950-14106042	42	CRISPRCasFinder	no		RT,DEDDh,csa3,WYL,cas3,DinG,PD-DExK,cas6	Orphan	CCGGGGCCGCCGGGATCCACGGCGTCC	27	0	0	NA	NA	NA	1	1	Orphan	RT,DEDDh,csa3,WYL,cas3,DinG,PD-DExK,cas6	NA|163aa|up_4|NZ_CP012673.1_14099445_14099934_+,NA|468aa|up_0|NZ_CP012673.1_14103034_14104438_-,NA|293aa|down_2|NZ_CP012673.1_14112613_14113492_-,NA|191aa|down_5|NZ_CP012673.1_14116379_14116952_-,NA|260aa|down_7|NZ_CP012673.1_14118686_14119466_-,NA|294aa|down_8|NZ_CP012673.1_14119468_14120350_-	NA|530aa|up_9|NZ_CP012673.1_14090864_14092454_+	cd06548, GH18_chitinase, The GH18 (glycosyl hydrolases, family 18) type II chitinases hydrolyze chitin, an abundant polymer of N-acetylglucosamine and have been identified in bacteria, fungi, insects, plants, viruses, and protozoan parasites	NA|521aa|up_8|NZ_CP012673.1_14092556_14094119_-	PTZ00146, PTZ00146, fibrillarin; Provisional	NA|204aa|up_7|NZ_CP012673.1_14095514_14096126_-	COG0637, COG0637, Predicted phosphatase/phosphohexomutase [General function prediction only]	NA|731aa|up_6|NZ_CP012673.1_14096398_14098591_+	PRK09420, cpdB, bifunctional 2',3'-cyclic-nucleotide 2'-phosphodiesterase/3'-nucleotidase	NA|247aa|up_5|NZ_CP012673.1_14098587_14099328_+	sd00010, SLR, Sel1-like repeat	NA|163aa|up_4|NZ_CP012673.1_14099445_14099934_+	NA	NA|202aa|up_3|NZ_CP012673.1_14100029_14100635_-	cd02970, PRX_like2, Peroxiredoxin (PRX)-like 2 family; hypothetical proteins that show sequence similarity to PRXs	NA|216aa|up_2|NZ_CP012673.1_14100703_14101351_+	pfam02909, TetR_C, Tetracyclin repressor, C-terminal all-alpha domain	NA|521aa|up_1|NZ_CP012673.1_14101425_14102988_-	PRK06184, PRK06184, hypothetical protein; Provisional	NA|468aa|up_0|NZ_CP012673.1_14103034_14104438_-	NA	NA|771aa|down_0|NZ_CP012673.1_14106726_14109039_-	pfam00759, Glyco_hydro_9, Glycosyl hydrolase family 9	NA|1097aa|down_1|NZ_CP012673.1_14109255_14112546_-	cd14014, STKc_PknB_like, Catalytic domain of bacterial Serine/Threonine kinases, PknB and similar proteins	NA|293aa|down_2|NZ_CP012673.1_14112613_14113492_-	NA	NA|72aa|down_3|NZ_CP012673.1_14113798_14114014_-	COG3829, RocR, Transcriptional regulator containing PAS, AAA-type ATPase, and DNA-binding domains [Transcription / Signal transduction mechanisms]	NA|197aa|down_4|NZ_CP012673.1_14115770_14116361_+	TIGR02937, RNA_polymerase_sigma_factor, RNA polymerase sigma factor, sigma-70 family	NA|191aa|down_5|NZ_CP012673.1_14116379_14116952_-	NA	NA|307aa|down_6|NZ_CP012673.1_14117337_14118258_+	COG1943, COG1943, Transposase and inactivated derivatives [DNA replication, recombination, and repair]	NA|260aa|down_7|NZ_CP012673.1_14118686_14119466_-	NA	NA|294aa|down_8|NZ_CP012673.1_14119468_14120350_-	NA	NA|270aa|down_9|NZ_CP012673.1_14120515_14121325_-	PTZ00146, PTZ00146, fibrillarin; Provisional
