assembly_id	genome_id	genome_def	crispr_array_locus_merge	crispr_array_location_merge	crispr_locus_id	crispr_pred_method	array_in_prot	prot_within_array_20000	prot_in_genome	crispr_type_by_cas_prot	consensus_repeat	repeat_length	self-targeting_spacer_number	self-targeting_target_number	spacer_location	protospacer_location	repeat_type	spacer_locus_num	spacer_num	correct_crispr_type	genome_cas_prots	unknown_protein_around_crispr	L10	L10_domain	L9	L9_domain	L8	L8_domain	L7	L7_domain	L6	L6_domain	L5	L5_domain	L4	L4_domain	L3	L3_domain	L2	L2_domain	L1	L1_domain	R1	R1_domain	R2	R2_domain	R3	R3_domain	R4	R4_domain	R5	R5_domain	R6	R6_domain	R7	R7_domain	R8	R8_domain	R9	R9_domain	R10	R10_domain
GCF_000067165.1_ASM6716v1	NC_010162	Sorangium cellulosum So ce56, complete genome	1	98282-98430	1	PILER-CR	no		cas8u1,cas3,csb2gr5,csb1gr7,cas1,cas2,cas6e,csa3,RT,DEDDh,WYL,DinG,PD-DExK,cmr1gr7,cas10,cmr3gr5,cmr4gr7,cmr5gr11,cmr6gr7,cas6,cas8b3,cas7,cas5	Orphan	GTCGCCGTCGCTGAGCCCGTCGCCGTCGGTGTCGACG	37	0	0	NA	NA	NA	2	2	Orphan	cas8u1,cas3,csb2gr5,csb1gr7,cas1,cas2,cas6e,csa3,RT,DEDDh,WYL,DinG,PD-DExK,cmr1gr7,cas10,cmr3gr5,cmr4gr7,cmr5gr11,cmr6gr7,cas6,cas8b3,cas7,cas5	NA|261aa|up_7|NC_010162.1_87732_88515_+,NA|104aa|down_5|NC_010162.1_106567_106879_+,NA|176aa|down_6|NC_010162.1_107086_107614_+,NA|108aa|down_7|NC_010162.1_107746_108070_+	NA|452aa|up_9|NC_010162.1_85403_86759_-	cd00038, CAP_ED, effector domain of the CAP family of transcription factors; members include CAP (or cAMP receptor protein (CRP)), which binds cAMP, FNR (fumarate and nitrate reduction), which uses an iron-sulfur cluster to sense oxygen) and CooA, a heme containing CO sensor	NA|225aa|up_8|NC_010162.1_86882_87557_-	COG0412, COG0412, Dienelactone hydrolase and related enzymes [Secondary metabolites biosynthesis, transport, and catabolism]	NA|261aa|up_7|NC_010162.1_87732_88515_+	NA	NA|397aa|up_6|NC_010162.1_88684_89875_-	COG1167, ARO8, Transcriptional regulators containing a DNA-binding HTH domain and an aminotransferase domain (MocR family) and their eukaryotic orthologs [Transcription / Amino acid transport and metabolism]	NA|217aa|up_5|NC_010162.1_90284_90935_-	pfam00942, CBM_3, Cellulose binding domain	NA|155aa|up_4|NC_010162.1_91200_91665_-	cd07821, PYR_PYL_RCAR_like, Pyrabactin resistance 1 (PYR1), PYR1-like (PYL), regulatory component of abscisic acid receptors (RCARs), and related proteins	NA|61aa|up_3|NC_010162.1_91888_92071_-	TIGR04526, predic_Ig_block, putative immunoglobulin-blocking virulence protein	NA|548aa|up_2|NC_010162.1_92279_93923_+	PRK06184, PRK06184, hypothetical protein; Provisional	NA|212aa|up_1|NC_010162.1_93992_94628_+	pfam00440, TetR_N, Bacterial regulatory proteins, tetR family	NA|289aa|up_0|NC_010162.1_94678_95545_-	pfam01430, HSP33, Hsp33 protein	NA|609aa|down_0|NC_010162.1_100008_101835_-	cd07185, OmpA_C-like, Peptidoglycan binding domains similar to the C-terminal domain of outer-membrane protein OmpA	NA|204aa|down_1|NC_010162.1_102181_102793_+	COG0605, SodA, Superoxide dismutase [Inorganic ion transport and metabolism]	NA|264aa|down_2|NC_010162.1_102899_103691_+	pfam00583, Acetyltransf_1, Acetyltransferase (GNAT) family	NA|445aa|down_3|NC_010162.1_103736_105071_-	PRK04531, PRK04531, acetylglutamate kinase; Provisional	NA|334aa|down_4|NC_010162.1_105076_106078_-	PRK04523, PRK04523, N-acetylornithine carbamoyltransferase; Reviewed	NA|104aa|down_5|NC_010162.1_106567_106879_+	NA	NA|176aa|down_6|NC_010162.1_107086_107614_+	NA	NA|108aa|down_7|NC_010162.1_107746_108070_+	NA	NA|1255aa|down_8|NC_010162.1_108006_111771_-	COG1074, RecB, ATP-dependent exoDNAse (exonuclease V) beta subunit (contains helicase and exonuclease domains) [DNA replication, recombination, and repair]	NA|320aa|down_9|NC_010162.1_112022_112982_+	cd13962, PT_UbiA_UBIAD1, 1,4-Dihydroxy-2-naphthoate octaprenyltransferase
GCF_000067165.1_ASM6716v1	NC_010162	Sorangium cellulosum So ce56, complete genome	2	228285-228596	1	CRT	no		cas8u1,cas3,csb2gr5,csb1gr7,cas1,cas2,cas6e,csa3,RT,DEDDh,WYL,DinG,PD-DExK,cmr1gr7,cas10,cmr3gr5,cmr4gr7,cmr5gr11,cmr6gr7,cas6,cas8b3,cas7,cas5	Orphan	ANCNCAACCCCGGTCGGG	18	1	1	228303-228350	NC_010162.1_228125-228172	NA	6	6	Orphan	cas8u1,cas3,csb2gr5,csb1gr7,cas1,cas2,cas6e,csa3,RT,DEDDh,WYL,DinG,PD-DExK,cmr1gr7,cas10,cmr3gr5,cmr4gr7,cmr5gr11,cmr6gr7,cas6,cas8b3,cas7,cas5	NA|143aa|up_8|NC_010162.1_219351_219780_+,NA|437aa|down_2|NC_010162.1_230744_232055_+,NA|183aa|down_3|NC_010162.1_232144_232693_-,NA|517aa|down_9|NC_010162.1_237706_239257_+	NA|235aa|up_9|NC_010162.1_218641_219346_+	cd00421, intradiol_dioxygenase, Intradiol dioxygenases catalyze the critical ring-cleavage step in the conversion of catecholate derivatives to citric acid cycle intermediates	NA|143aa|up_8|NC_010162.1_219351_219780_+	NA	NA|279aa|up_7|NC_010162.1_219834_220671_-	PRK09685, PRK09685, DNA-binding transcriptional activator FeaR; Provisional	NA|220aa|up_6|NC_010162.1_220893_221553_-	pfam04264, YceI, YceI-like domain	NA|315aa|up_5|NC_010162.1_221574_222519_-	cd07363, 45_DOPA_Dioxygenase, The Class III extradiol dioxygenase, 4,5-DOPA Dioxygenase, catalyzes the incorporation of both atoms of molecular oxygen into 4,5-dihydroxy-phenylalanine	NA|323aa|up_4|NC_010162.1_222567_223536_+	cd08422, PBP2_CrgA_like, The C-terminal substrate binding domain of LysR-type transcriptional regulator CrgA and its related homologs, contains the type 2 periplasmic binding domain	NA|360aa|up_3|NC_010162.1_223927_225007_+	cd13558, PBP2_SsuA_like_2, Putative substrate binding domain of sulfonate binding protein, the type 2 periplasmic binding protein fold	NA|303aa|up_2|NC_010162.1_225003_225912_+	COG0600, TauC, ABC-type nitrate/sulfonate/bicarbonate transport system, permease component [Inorganic ion transport and metabolism]	NA|263aa|up_1|NC_010162.1_225887_226676_+	COG1116, TauB, ABC-type nitrate/sulfonate/bicarbonate transport system, ATPase component [Inorganic ion transport and metabolism]	NA|437aa|up_0|NC_010162.1_226714_228025_+	TIGR03860, FMN_nitrolo, FMN-dependent oxidoreductase, nitrilotriacetate monooxygenase family	NA|441aa|down_0|NC_010162.1_228627_229950_+	cd07561, Peptidase_S41_CPP_like, C-terminal processing peptidase-like; serine protease family S41	NA|213aa|down_1|NC_010162.1_230106_230745_+	cd03016, PRX_1cys, Peroxiredoxin (PRX) family, 1-cys PRX subfamily; composed of PRXs containing only one conserved cysteine, which serves as the peroxidatic cysteine	NA|437aa|down_2|NC_010162.1_230744_232055_+	NA	NA|183aa|down_3|NC_010162.1_232144_232693_-	NA	NA|136aa|down_4|NC_010162.1_232870_233278_-	cd02220, cupin_ABP1, auxin-binding protein 1, cupin domain	NA|231aa|down_5|NC_010162.1_233467_234160_-	pfam12697, Abhydrolase_6, Alpha/beta hydrolase family	NA|238aa|down_6|NC_010162.1_234408_235122_+	COG1414, IclR, Transcriptional regulator [Transcription]	NA|300aa|down_7|NC_010162.1_235118_236018_+	PRK08320, PRK08320, branched-chain amino acid aminotransferase; Reviewed	NA|307aa|down_8|NC_010162.1_236096_237017_-	cd08417, PBP2_Nitroaromatics_like, The C-terminal substrate binding domain of LysR-type transcriptional regulators that involved in the catabolism of nitroaromatic/naphthalene compounds and that of related regulators; contains the type 2 periplasmic binding fold	NA|517aa|down_9|NC_010162.1_237706_239257_+	NA
GCF_000067165.1_ASM6716v1	NC_010162	Sorangium cellulosum So ce56, complete genome	3	388616-390422	2,1,2,3	PILER-CR,CRISPRCasFinder,CRT,PILER-CR	no	cas8u1,cas3,csb2gr5,csb1gr7,cas1,cas2	cas8u1,cas3,csb2gr5,csb1gr7,cas1,cas2,cas6e,csa3,RT,DEDDh,WYL,DinG,PD-DExK,cmr1gr7,cas10,cmr3gr5,cmr4gr7,cmr5gr11,cmr6gr7,cas6,cas8b3,cas7,cas5	Unclear	CTCTCCGCCGCTGAAAGGCGGCGGCCCCATTGAAGC,CTCTCCGCCGCTGAAAGGCGGCGGCCCCATTGAAGC,CTCTCCGCCGCTGAAAGGCGGCGGCCCCATTGAAGCNNN,CTCTCCGCCGCTGAAAGGCGGCGGCCCCATTGAAGC	36,36,39,36	1	1	389238-389280	NC_010162.1_4036544-4036502	NA:NA:NA:NA	21,24,24,21	24	Unclear	cas8u1,cas3,csb2gr5,csb1gr7,cas1,cas2,cas6e,csa3,RT,DEDDh,WYL,DinG,PD-DExK,cmr1gr7,cas10,cmr3gr5,cmr4gr7,cmr5gr11,cmr6gr7,cas6,cas8b3,cas7,cas5	cas8u1|313aa|up_7|NC_010162.1_375508_376447_-,NA|276aa|up_0|NC_010162.1_387053_387881_-,NA|111aa|down_0|NC_010162.1_390523_390856_+,NA|699aa|down_2|NC_010162.1_393947_396044_+,NA|194aa|down_4|NC_010162.1_397587_398169_+,NA|422aa|down_7|NC_010162.1_400491_401757_+,NA|359aa|down_8|NC_010162.1_402293_403370_-	NA|359aa|up_9|NC_010162.1_373630_374707_-	TIGR02666, Cyclic_pyranopterin_monophosphate_synthase, molybdenum cofactor biosynthesis protein A, bacterial	NA|249aa|up_8|NC_010162.1_374703_375450_-	COG0535, COG0535, Predicted Fe-S oxidoreductases [General function prediction only]	cas8u1|313aa|up_7|NC_010162.1_375508_376447_-	NA	cas3|1002aa|up_6|NC_010162.1_376443_379449_-	TIGR02621, CRISPR-associated_helicase_Cas3, CRISPR-associated helicase Cas3, subtype Dpsyc	csb2gr5|564aa|up_5|NC_010162.1_379441_381133_-	TIGR02165, CRISPR-associated_protein_GSU0054_family, CRISPR-associated protein GSU0054/csb2, Dpsyc system	csb1gr7|426aa|up_4|NC_010162.1_381132_382410_-	pfam09617, Cas_GSU0053, CRISPR-associated protein GSU0053 (Cas_GSU0053)	cas1|311aa|up_3|NC_010162.1_382719_383652_+	cd09634, Cas1_I-II-III, CRISPR/Cas system-associated protein Cas1	cas2|97aa|up_2|NC_010162.1_383742_384033_+	pfam09827, CRISPR_Cas2, CRISPR associated protein Cas2	NA|309aa|up_1|NC_010162.1_384196_385123_-	PRK05687, fliH, flagellar assembly protein FliH	NA|276aa|up_0|NC_010162.1_387053_387881_-	NA	NA|111aa|down_0|NC_010162.1_390523_390856_+	NA	NA|886aa|down_1|NC_010162.1_391228_393886_+	TIGR03361, VI_Rhs_Vgr, type VI secretion system Vgr family protein	NA|699aa|down_2|NC_010162.1_393947_396044_+	NA	NA|469aa|down_3|NC_010162.1_396184_397591_+	sd00010, SLR, Sel1-like repeat	NA|194aa|down_4|NC_010162.1_397587_398169_+	NA	NA|197aa|down_5|NC_010162.1_398259_398850_-	cd06260, DUF820, Domain of unknown function (DUF820)	NA|193aa|down_6|NC_010162.1_398930_399509_-	PRK12366, PRK12366, replication factor A; Reviewed	NA|422aa|down_7|NC_010162.1_400491_401757_+	NA	NA|359aa|down_8|NC_010162.1_402293_403370_-	NA	NA|627aa|down_9|NC_010162.1_403380_405261_-	cd14014, STKc_PknB_like, Catalytic domain of bacterial Serine/Threonine kinases, PknB and similar proteins
GCF_000067165.1_ASM6716v1	NC_010162	Sorangium cellulosum So ce56, complete genome	4	1548335-1548435	2	CRISPRCasFinder	no		cas8u1,cas3,csb2gr5,csb1gr7,cas1,cas2,cas6e,csa3,RT,DEDDh,WYL,DinG,PD-DExK,cmr1gr7,cas10,cmr3gr5,cmr4gr7,cmr5gr11,cmr6gr7,cas6,cas8b3,cas7,cas5	Orphan	CCGGCGGGCACACGCCGCAGTCG	23	0	0	NA	NA	NA	1	1	Orphan	cas8u1,cas3,csb2gr5,csb1gr7,cas1,cas2,cas6e,csa3,RT,DEDDh,WYL,DinG,PD-DExK,cmr1gr7,cas10,cmr3gr5,cmr4gr7,cmr5gr11,cmr6gr7,cas6,cas8b3,cas7,cas5	NA,NA	NA|187aa|up_9|NC_010162.1_1534559_1535120_-	pfam08239, SH3_3, Bacterial SH3 domain	NA|364aa|up_8|NC_010162.1_1535259_1536351_-	pfam03781, FGE-sulfatase, Sulfatase-modifying factor enzyme 1	NA|814aa|up_7|NC_010162.1_1536369_1538811_-	cd16148, sulfatase_like, uncharacterized sulfatase subfamily	NA|198aa|up_6|NC_010162.1_1539109_1539703_+	cd06260, DUF820, Domain of unknown function (DUF820)	NA|676aa|up_5|NC_010162.1_1539883_1541911_+	sd00042, LVIVD, LVIVD repeat	NA|301aa|up_4|NC_010162.1_1541907_1542810_+	PTZ00146, PTZ00146, fibrillarin; Provisional	NA|348aa|up_3|NC_010162.1_1543312_1544356_+	PRK00188, trpD, anthranilate phosphoribosyltransferase; Provisional	NA|494aa|up_2|NC_010162.1_1544348_1545830_+	PRK09427, PRK09427, bifunctional indole-3-glycerol-phosphate synthase TrpC/phosphoribosylanthranilate isomerase TrpF	NA|398aa|up_1|NC_010162.1_1545887_1547081_+	PRK04346, PRK04346, tryptophan synthase subunit beta; Validated	NA|275aa|up_0|NC_010162.1_1547077_1547902_+	PRK13111, trpA, tryptophan synthase subunit alpha; Provisional	NA|1135aa|down_0|NC_010162.1_1549217_1552622_-	TIGR03303, OM_YaeT, outer membrane protein assembly complex, YaeT protein	NA|268aa|down_1|NC_010162.1_1552849_1553653_+	pfam12974, Phosphonate-bd, ABC transporter, phosphonate, periplasmic substrate-binding protein	NA|1473aa|down_2|NC_010162.1_1553710_1558129_-	pfam04357, TamB, TamB, inner membrane protein subunit of TAM complex	NA|558aa|down_3|NC_010162.1_1558345_1560019_-	PRK11819, PRK11819, putative ABC transporter ATP-binding protein; Reviewed	NA|545aa|down_4|NC_010162.1_1560205_1561840_-	cd13128, MATE_Wzx_like, Wzx, a subfamily of the multidrug and toxic compound extrusion (MATE)-like proteins	NA|403aa|down_5|NC_010162.1_1562153_1563362_+	TIGR01730, COG0845:_Membrane-fusion_protein, RND family efflux transporter, MFP subunit	NA|410aa|down_6|NC_010162.1_1563363_1564593_+	COG4591, LolE, ABC-type transport system, involved in lipoprotein release, permease component [Cell envelope biogenesis, outer membrane]	NA|229aa|down_7|NC_010162.1_1564585_1565272_+	cd03255, ABC_MJ0796_LolCDE_FtsE, ATP-binding cassette domain of the transporters involved in export of lipoprotein and macrolide, and cell division protein	NA|231aa|down_8|NC_010162.1_1565846_1566539_-	COG4122, COG4122, Predicted O-methyltransferase [General function prediction only]	NA|319aa|down_9|NC_010162.1_1566750_1567707_+	cd08414, PBP2_LTTR_aromatics_like, The C-terminal substrate binding domain of LysR-type transcriptional regulators involved in the catabolism of aromatic compounds and that of other related regulators, contains type 2 periplasmic binding fold
GCF_000067165.1_ASM6716v1	NC_010162	Sorangium cellulosum So ce56, complete genome	5	1813884-1813954	3	CRISPRCasFinder	no		cas8u1,cas3,csb2gr5,csb1gr7,cas1,cas2,cas6e,csa3,RT,DEDDh,WYL,DinG,PD-DExK,cmr1gr7,cas10,cmr3gr5,cmr4gr7,cmr5gr11,cmr6gr7,cas6,cas8b3,cas7,cas5	Orphan	GAGCGACGCGGCGAGGAGCGCGG	23	0	0	NA	NA	NA	1	1	Orphan	cas8u1,cas3,csb2gr5,csb1gr7,cas1,cas2,cas6e,csa3,RT,DEDDh,WYL,DinG,PD-DExK,cmr1gr7,cas10,cmr3gr5,cmr4gr7,cmr5gr11,cmr6gr7,cas6,cas8b3,cas7,cas5	NA|84aa|up_9|NC_010162.1_1799940_1800192_+,NA|181aa|down_1|NC_010162.1_1815223_1815766_+,NA|343aa|down_3|NC_010162.1_1817088_1818117_+,NA|83aa|down_5|NC_010162.1_1819575_1819824_+	NA|84aa|up_9|NC_010162.1_1799940_1800192_+	NA	NA|710aa|up_8|NC_010162.1_1800272_1802402_+	TIGR02100, Glycogen_operon_protein_GlgX_homolog, glycogen debranching enzyme GlgX	NA|440aa|up_7|NC_010162.1_1802412_1803732_-	PRK01346, PRK01346, enhanced intracellular survival protein Eis	NA|276aa|up_6|NC_010162.1_1803737_1804565_-	cd14014, STKc_PknB_like, Catalytic domain of bacterial Serine/Threonine kinases, PknB and similar proteins	NA|597aa|up_5|NC_010162.1_1804721_1806512_-	COG2146, {NirD}, Ferredoxin subunits of nitrite reductase and ring-hydroxylating dioxygenases [Inorganic ion transport and metabolism / General function prediction only]	NA|389aa|up_4|NC_010162.1_1806648_1807815_-	pfam01139, RtcB, tRNA-splicing ligase RtcB	NA|636aa|up_3|NC_010162.1_1808059_1809967_+	TIGR03897, Lantibiotic_mersacidin_modifying_enzyme, type 2 lantibiotic biosynthesis protein LanM	NA|130aa|up_2|NC_010162.1_1809999_1810389_+	pfam14552, Tautomerase_2, Tautomerase enzyme	NA|279aa|up_1|NC_010162.1_1810454_1811291_-	COG1878, COG1878, Kynurenine formamidase [Amino acid transport and metabolism]	NA|443aa|up_0|NC_010162.1_1811404_1812733_-	TIGR03860, FMN_nitrolo, FMN-dependent oxidoreductase, nitrilotriacetate monooxygenase family	NA|168aa|down_0|NC_010162.1_1814267_1814771_+	cd03379, beta_CA_cladeD, Carbonic anhydrases (CA) are zinc-containing enzymes that catalyze the reversible hydration of carbon dioxide in a two-step mechanism in which the nucleophilic attack of a zinc-bound hydroxide ion on carbon dioxide is followed by the regeneration of an active site by ionization of the zinc-bound water molecule and removal of a proton from the active site	NA|181aa|down_1|NC_010162.1_1815223_1815766_+	NA	NA|320aa|down_2|NC_010162.1_1815762_1816722_+	cd05151, ChoK-like, Choline Kinase and similar proteins	NA|343aa|down_3|NC_010162.1_1817088_1818117_+	NA	NA|299aa|down_4|NC_010162.1_1818252_1819149_+	cd07041, STAS_RsbR_RsbS_like, Sulphate Transporter and Anti-Sigma factor antagonist domain of the "stressosome" complex proteins RsbS and RsbR, regulators of the bacterial stress activated alternative sigma factor sigma-B by phosphorylation	NA|83aa|down_5|NC_010162.1_1819575_1819824_+	NA	NA|510aa|down_6|NC_010162.1_1819956_1821486_+	COG2159, COG2159, Predicted metal-dependent hydrolase of the TIM-barrel fold [General function prediction only]	NA|361aa|down_7|NC_010162.1_1823303_1824386_+	cd13555, PBP2_sulfate_ester_like, Sulfate ester binding protein-like, the type 2 periplasmic binding protein fold	NA|287aa|down_8|NC_010162.1_1824450_1825311_+	COG0600, TauC, ABC-type nitrate/sulfonate/bicarbonate transport system, permease component [Inorganic ion transport and metabolism]	NA|261aa|down_9|NC_010162.1_1825358_1826141_+	COG0600, TauC, ABC-type nitrate/sulfonate/bicarbonate transport system, permease component [Inorganic ion transport and metabolism]
GCF_000067165.1_ASM6716v1	NC_010162	Sorangium cellulosum So ce56, complete genome	6	1846709-1846881	4	CRISPRCasFinder	no		cas8u1,cas3,csb2gr5,csb1gr7,cas1,cas2,cas6e,csa3,RT,DEDDh,WYL,DinG,PD-DExK,cmr1gr7,cas10,cmr3gr5,cmr4gr7,cmr5gr11,cmr6gr7,cas6,cas8b3,cas7,cas5	Orphan	GCGGCGCTCGGCTCGGCGGGCGC	23	0	0	NA	NA	NA	3	3	Orphan	cas8u1,cas3,csb2gr5,csb1gr7,cas1,cas2,cas6e,csa3,RT,DEDDh,WYL,DinG,PD-DExK,cmr1gr7,cas10,cmr3gr5,cmr4gr7,cmr5gr11,cmr6gr7,cas6,cas8b3,cas7,cas5	NA,NA|77aa|down_1|NC_010162.1_1847913_1848144_-,NA|31aa|down_9|NC_010162.1_1864133_1864226_+	NA|268aa|up_9|NC_010162.1_1830767_1831571_-	pfam01569, PAP2, PAP2 superfamily	NA|546aa|up_8|NC_010162.1_1831567_1833205_-	COG0612, PqqL, Predicted Zn-dependent peptidases [General function prediction only]	NA|587aa|up_7|NC_010162.1_1833215_1834976_-	COG0612, PqqL, Predicted Zn-dependent peptidases [General function prediction only]	NA|800aa|up_6|NC_010162.1_1835077_1837477_-	pfam01186, Lysyl_oxidase, Lysyl oxidase	NA|262aa|up_5|NC_010162.1_1837716_1838502_+	cd05332, 11beta-HSD1_like_SDR_c, 11beta-hydroxysteroid dehydrogenase type 1 (11beta-HSD1)-like, classical (c) SDRs	NA|771aa|up_4|NC_010162.1_1838614_1840927_-	COG3488, COG3488, Predicted thiol oxidoreductase [Energy production and conversion]	NA|516aa|up_3|NC_010162.1_1841006_1842554_-	pfam07586, HXXSHH, Protein of unknown function (DUF1552)	NA|541aa|up_2|NC_010162.1_1842654_1844277_-	pfam07631, PSD4, Protein of unknown function (DUF1592)	NA|318aa|up_1|NC_010162.1_1844403_1845357_+	pfam13385, Laminin_G_3, Concanavalin A-like lectin/glucanases superfamily	NA|340aa|up_0|NC_010162.1_1845360_1846380_-	PRK12270, kgd, multifunctional oxoglutarate decarboxylase/oxoglutarate dehydrogenase thiamine pyrophosphate-binding subunit/dihydrolipoyllysine-residue succinyltransferase subunit	NA|209aa|down_0|NC_010162.1_1847290_1847917_-	COG1595, RpoE, DNA-directed RNA polymerase specialized sigma subunit, sigma24 homolog [Transcription]	NA|77aa|down_1|NC_010162.1_1847913_1848144_-	NA	NA|180aa|down_2|NC_010162.1_1848379_1848919_-	PRK05035, PRK05035, electron transport complex protein RnfC; Provisional	NA|346aa|down_3|NC_010162.1_1851451_1852489_+	cd06325, PBP1_ABC_unchar_transporter, type 1 periplasmic ligand-binding domain of uncharacterized ABC-type transport systems predicted to be involved in uptake of amino acids, peptides, or inorganic ions	NA|154aa|down_4|NC_010162.1_1852656_1853118_+	cd00156, REC, phosphoacceptor receiver (REC) domain of response regulators (RRs) and pseudo response regulators (PRRs)	NA|950aa|down_5|NC_010162.1_1853123_1855973_+	PRK11107, PRK11107, hybrid sensory histidine kinase BarA; Provisional	NA|730aa|down_6|NC_010162.1_1856033_1858223_+	TIGR01785, Heme/hemopexin_utilization_protein_C, TonB-dependent heme/hemoglobin receptor family protein	NA|457aa|down_7|NC_010162.1_1858484_1859855_-	cd06346, PBP1_ABC_ligand_binding-like, type 1 periplasmic ligand-binding domain of uncharacterized ABC (Atpase Binding Cassette)-type active transport systems predicted to be involved in uptake of amino acids, peptides, or inorganic ions	NA|567aa|down_8|NC_010162.1_1860226_1861927_+	PRK12323, PRK12323, DNA polymerase III subunit gamma/tau	NA|31aa|down_9|NC_010162.1_1864133_1864226_+	NA
GCF_000067165.1_ASM6716v1	NC_010162	Sorangium cellulosum So ce56, complete genome	7	1969286-1969380	5	CRISPRCasFinder	no		cas8u1,cas3,csb2gr5,csb1gr7,cas1,cas2,cas6e,csa3,RT,DEDDh,WYL,DinG,PD-DExK,cmr1gr7,cas10,cmr3gr5,cmr4gr7,cmr5gr11,cmr6gr7,cas6,cas8b3,cas7,cas5	Orphan	CCTCGCGCGCCTCGCCGCTCGCG	23	0	0	NA	NA	NA	1	1	Orphan	cas8u1,cas3,csb2gr5,csb1gr7,cas1,cas2,cas6e,csa3,RT,DEDDh,WYL,DinG,PD-DExK,cmr1gr7,cas10,cmr3gr5,cmr4gr7,cmr5gr11,cmr6gr7,cas6,cas8b3,cas7,cas5	NA|357aa|up_6|NC_010162.1_1957563_1958634_-,NA|356aa|up_5|NC_010162.1_1958930_1959998_-,NA|197aa|up_4|NC_010162.1_1960313_1960904_-,NA|234aa|down_0|NC_010162.1_1971790_1972492_+,NA|184aa|down_1|NC_010162.1_1972587_1973139_+	NA|436aa|up_9|NC_010162.1_1949478_1950786_-	TIGR01730, COG0845:_Membrane-fusion_protein, RND family efflux transporter, MFP subunit	NA|1056aa|up_8|NC_010162.1_1950782_1953950_-	COG0841, AcrB, Cation/multidrug efflux pump [Defense mechanisms]	NA|722aa|up_7|NC_010162.1_1955085_1957251_-	cd06267, PBP1_LacI_sugar_binding-like, ligand binding domain of the LacI transcriptional regulator family belonging to the type 1 periplasmic-binding fold protein superfamily	NA|357aa|up_6|NC_010162.1_1957563_1958634_-	NA	NA|356aa|up_5|NC_010162.1_1958930_1959998_-	NA	NA|197aa|up_4|NC_010162.1_1960313_1960904_-	NA	NA|493aa|up_3|NC_010162.1_1961054_1962533_+	sd00006, TPR, Tetratricopeptide repeat	NA|565aa|up_2|NC_010162.1_1962591_1964286_+	cd14957, NHL_like_2, Uncharacterized NHL-repeat domain in bacterial and archaeal proteins	NA|436aa|up_1|NC_010162.1_1964568_1965876_+	COG3950, COG3950, Predicted ATP-binding protein involved in virulence [General function prediction only]	NA|265aa|up_0|NC_010162.1_1965872_1966667_+	TIGR02646, Hypothetical_protein_SMc04429, TIGR02646 family protein	NA|234aa|down_0|NC_010162.1_1971790_1972492_+	NA	NA|184aa|down_1|NC_010162.1_1972587_1973139_+	NA	NA|717aa|down_2|NC_010162.1_1973388_1975539_+	cd06267, PBP1_LacI_sugar_binding-like, ligand binding domain of the LacI transcriptional regulator family belonging to the type 1 periplasmic-binding fold protein superfamily	NA|442aa|down_3|NC_010162.1_1975568_1976894_-	pfam04389, Peptidase_M28, Peptidase family M28	NA|451aa|down_4|NC_010162.1_1977201_1978554_-	cd00548, NrfA-like, cytochrome c nitrite reductase and similar proteins	NA|145aa|down_5|NC_010162.1_1978659_1979094_-	TIGR03153, cytochr_NrfH, cytochrome c nitrite reductase, small subunit	NA|415aa|down_6|NC_010162.1_1979641_1980886_+	cd17371, MFS_MucK, Cis,cis-muconate transport protein and similar proteins of the Major Facilitator Superfamily	NA|772aa|down_7|NC_010162.1_1981414_1983730_+	cd00687, Terpene_cyclase_nonplant_C1, Non-plant Terpene Cyclases, Class 1	NA|469aa|down_8|NC_010162.1_1983751_1985158_+	cd00038, CAP_ED, effector domain of the CAP family of transcription factors; members include CAP (or cAMP receptor protein (CRP)), which binds cAMP, FNR (fumarate and nitrate reduction), which uses an iron-sulfur cluster to sense oxygen) and CooA, a heme containing CO sensor	NA|467aa|down_9|NC_010162.1_1985254_1986655_+	cd00038, CAP_ED, effector domain of the CAP family of transcription factors; members include CAP (or cAMP receptor protein (CRP)), which binds cAMP, FNR (fumarate and nitrate reduction), which uses an iron-sulfur cluster to sense oxygen) and CooA, a heme containing CO sensor
GCF_000067165.1_ASM6716v1	NC_010162	Sorangium cellulosum So ce56, complete genome	8	2015485-2015599	6	CRISPRCasFinder	no		cas8u1,cas3,csb2gr5,csb1gr7,cas1,cas2,cas6e,csa3,RT,DEDDh,WYL,DinG,PD-DExK,cmr1gr7,cas10,cmr3gr5,cmr4gr7,cmr5gr11,cmr6gr7,cas6,cas8b3,cas7,cas5	Orphan	GAGGTCGTCGAGGCGCTCGAGGTCGTCGCGCTCGCGGAGGTCG	43	0	0	NA	NA	NA	1	1	Orphan	cas8u1,cas3,csb2gr5,csb1gr7,cas1,cas2,cas6e,csa3,RT,DEDDh,WYL,DinG,PD-DExK,cmr1gr7,cas10,cmr3gr5,cmr4gr7,cmr5gr11,cmr6gr7,cas6,cas8b3,cas7,cas5	NA,NA|199aa|down_2|NC_010162.1_2019599_2020196_-,NA|126aa|down_6|NC_010162.1_2024095_2024473_-,NA|145aa|down_7|NC_010162.1_2024448_2024883_-,NA|125aa|down_8|NC_010162.1_2025380_2025755_-,NA|193aa|down_9|NC_010162.1_2025807_2026386_-	NA|164aa|up_9|NC_010162.1_2001872_2002364_-	pfam03168, LEA_2, Late embryogenesis abundant protein	NA|309aa|up_8|NC_010162.1_2002516_2003443_+	PRK14875, PRK14875, acetoin dehydrogenase E2 subunit dihydrolipoyllysine-residue acetyltransferase; Provisional	NA|330aa|up_7|NC_010162.1_2005644_2006634_+	PRK05269, PRK05269, transaldolase B; Provisional	NA|279aa|up_6|NC_010162.1_2006866_2007703_+	pfam13489, Methyltransf_23, Methyltransferase domain	NA|283aa|up_5|NC_010162.1_2007714_2008563_+	pfam13649, Methyltransf_25, Methyltransferase domain	NA|344aa|up_4|NC_010162.1_2008588_2009620_+	cd10933, CE4_u9, Putative catalytic domain of uncharacterized bacterial proteins from the carbohydrate esterase 4 superfamily	NA|335aa|up_3|NC_010162.1_2009773_2010778_+	pfam06957, COPI_C, Coatomer (COPI) alpha subunit C-terminus	NA|109aa|up_2|NC_010162.1_2011040_2011367_+	PRK00982, acpP, acyl carrier protein; Provisional	NA|315aa|up_1|NC_010162.1_2011363_2012308_+	COG2230, Cfa, Cyclopropane fatty acid synthase and related methyltransferases [Cell envelope biogenesis, outer membrane]	NA|558aa|up_0|NC_010162.1_2012304_2013978_+	cd05931, FAAL, Fatty acyl-AMP ligase (FAAL)	NA|492aa|down_0|NC_010162.1_2016259_2017735_-	pfam13620, CarboxypepD_reg, Carboxypeptidase regulatory-like domain	NA|472aa|down_1|NC_010162.1_2018095_2019511_+	cd13134, MATE_like_8, Uncharacterized subfamily of the multidrug and toxic compound extrusion (MATE) proteins	NA|199aa|down_2|NC_010162.1_2019599_2020196_-	NA	NA|476aa|down_3|NC_010162.1_2020369_2021797_-	COG2204, AtoC, Response regulator containing CheY-like receiver, AAA-type ATPase, and DNA-binding domains [Signal transduction mechanisms]	NA|134aa|down_4|NC_010162.1_2021843_2022245_-	pfam01713, Smr, Smr domain	NA|418aa|down_5|NC_010162.1_2022340_2023594_-	COG0642, BaeS, Signal transduction histidine kinase [Signal transduction mechanisms]	NA|126aa|down_6|NC_010162.1_2024095_2024473_-	NA	NA|145aa|down_7|NC_010162.1_2024448_2024883_-	NA	NA|125aa|down_8|NC_010162.1_2025380_2025755_-	NA	NA|193aa|down_9|NC_010162.1_2025807_2026386_-	NA
GCF_000067165.1_ASM6716v1	NC_010162	Sorangium cellulosum So ce56, complete genome	9	2150276-2150385	7	CRISPRCasFinder	no		cas8u1,cas3,csb2gr5,csb1gr7,cas1,cas2,cas6e,csa3,RT,DEDDh,WYL,DinG,PD-DExK,cmr1gr7,cas10,cmr3gr5,cmr4gr7,cmr5gr11,cmr6gr7,cas6,cas8b3,cas7,cas5	Orphan	GCGGCGCGAGCGGCTCGACGAGC	23	1	12	2150299-2150323|2150299-2150323|2150299-2150323|2150299-2150323|2150299-2150323|2150299-2150323|2150299-2150323|2150299-2150323|2150299-2150323|2150299-2150323|2150299-2150323|2150299-2150323	NC_010162.1_9985161-9985185|NC_010162.1_304192-304168|NC_010162.1_1862307-1862283|NC_010162.1_1862316-1862292|NC_010162.1_1862325-1862301|NC_010162.1_1862397-1862373|NC_010162.1_1891686-1891710|NC_010162.1_1927591-1927615|NC_010162.1_5780065-5780041|NC_010162.1_6108560-6108584|NC_010162.1_8639966-8639990|NC_010162.1_8829529-8829553	NA	2	2	Orphan	cas8u1,cas3,csb2gr5,csb1gr7,cas1,cas2,cas6e,csa3,RT,DEDDh,WYL,DinG,PD-DExK,cmr1gr7,cas10,cmr3gr5,cmr4gr7,cmr5gr11,cmr6gr7,cas6,cas8b3,cas7,cas5	NA|196aa|up_9|NC_010162.1_2138639_2139227_-,NA|111aa|up_1|NC_010162.1_2147828_2148161_-,NA|329aa|down_1|NC_010162.1_2152000_2152987_-,NA|58aa|down_2|NC_010162.1_2152987_2153161_-,NA|291aa|down_4|NC_010162.1_2154998_2155871_-,NA|109aa|down_7|NC_010162.1_2162030_2162357_+,NA|306aa|down_8|NC_010162.1_2162322_2163240_-,NA|261aa|down_9|NC_010162.1_2163815_2164598_+	NA|196aa|up_9|NC_010162.1_2138639_2139227_-	NA	NA|481aa|up_8|NC_010162.1_2139268_2140711_-	pfam15523, Ntox16, Novel toxin 16	NA|479aa|up_7|NC_010162.1_2140904_2142341_-	PRK11856, PRK11856, branched-chain alpha-keto acid dehydrogenase subunit E2; Reviewed	NA|325aa|up_6|NC_010162.1_2142344_2143319_-	COG0022, AcoB, Pyruvate/2-oxoglutarate dehydrogenase complex, dehydrogenase (E1) component, eukaryotic type, beta subunit [Energy production and conversion]	NA|397aa|up_5|NC_010162.1_2143472_2144663_-	TIGR03181, PDH_E1_alph_x, pyruvate dehydrogenase E1 component, alpha subunit	NA|190aa|up_4|NC_010162.1_2144866_2145436_-	cd07176, terB, tellurite resistance protein terB	NA|373aa|up_3|NC_010162.1_2145514_2146633_-	PRK05481, PRK05481, lipoyl synthase; Provisional	NA|344aa|up_2|NC_010162.1_2146682_2147714_-	COG0182, COG0182, Predicted translation initiation factor 2B subunit, eIF-2B alpha/beta/delta family [Translation, ribosomal structure and biogenesis]	NA|111aa|up_1|NC_010162.1_2147828_2148161_-	NA	NA|280aa|up_0|NC_010162.1_2148171_2149011_-	PRK08278, PRK08278, SDR family oxidoreductase	NA|277aa|down_0|NC_010162.1_2151118_2151949_+	COG2819, COG2819, Predicted hydrolase of the alpha/beta superfamily [General function prediction only]	NA|329aa|down_1|NC_010162.1_2152000_2152987_-	NA	NA|58aa|down_2|NC_010162.1_2152987_2153161_-	NA	NA|399aa|down_3|NC_010162.1_2153824_2155021_+	COG0820, COG0820, Predicted Fe-S-cluster redox enzyme [General function prediction only]	NA|291aa|down_4|NC_010162.1_2154998_2155871_-	NA	NA|1158aa|down_5|NC_010162.1_2156469_2159943_+	pfam02898, NO_synthase, Nitric oxide synthase, oxygenase domain	NA|92aa|down_6|NC_010162.1_2161422_2161698_-	pfam08881, CVNH, CVNH domain	NA|109aa|down_7|NC_010162.1_2162030_2162357_+	NA	NA|306aa|down_8|NC_010162.1_2162322_2163240_-	NA	NA|261aa|down_9|NC_010162.1_2163815_2164598_+	NA
GCF_000067165.1_ASM6716v1	NC_010162	Sorangium cellulosum So ce56, complete genome	10	2251718-2251799	8	CRISPRCasFinder	no		cas8u1,cas3,csb2gr5,csb1gr7,cas1,cas2,cas6e,csa3,RT,DEDDh,WYL,DinG,PD-DExK,cmr1gr7,cas10,cmr3gr5,cmr4gr7,cmr5gr11,cmr6gr7,cas6,cas8b3,cas7,cas5	Orphan	CGCGCGTCAGCCGAGCTGAGCGCGC	25	0	0	NA	NA	NA	1	1	Orphan	cas8u1,cas3,csb2gr5,csb1gr7,cas1,cas2,cas6e,csa3,RT,DEDDh,WYL,DinG,PD-DExK,cmr1gr7,cas10,cmr3gr5,cmr4gr7,cmr5gr11,cmr6gr7,cas6,cas8b3,cas7,cas5	NA|176aa|up_7|NC_010162.1_2241010_2241538_+,NA|158aa|up_6|NC_010162.1_2241798_2242272_+,NA|81aa|down_3|NC_010162.1_2256033_2256276_+,NA|140aa|down_5|NC_010162.1_2256983_2257403_-	NA|385aa|up_9|NC_010162.1_2237939_2239094_-	TIGR02037, Probable_periplasmic_serine_protease_do/HhoA-like, periplasmic serine protease, Do/DeqQ family	NA|571aa|up_8|NC_010162.1_2239209_2240922_+	pfam05960, DUF885, Bacterial protein of unknown function (DUF885)	NA|176aa|up_7|NC_010162.1_2241010_2241538_+	NA	NA|158aa|up_6|NC_010162.1_2241798_2242272_+	NA	NA|381aa|up_5|NC_010162.1_2242524_2243668_+	PRK00578, prfB, peptide chain release factor 2; Validated	NA|558aa|up_4|NC_010162.1_2243712_2245386_+	PRK00484, lysS, lysyl-tRNA synthetase; Reviewed	NA|721aa|up_3|NC_010162.1_2245382_2247545_+	COG4591, LolE, ABC-type transport system, involved in lipoprotein release, permease component [Cell envelope biogenesis, outer membrane]	NA|228aa|up_2|NC_010162.1_2247603_2248287_+	cd03255, ABC_MJ0796_LolCDE_FtsE, ATP-binding cassette domain of the transporters involved in export of lipoprotein and macrolide, and cell division protein	NA|694aa|up_1|NC_010162.1_2248275_2250357_-	COG2303, BetA, Choline dehydrogenase and related flavoproteins [Amino acid transport and metabolism]	NA|339aa|up_0|NC_010162.1_2250549_2251566_-	pfam04958, AstA, Arginine N-succinyltransferase beta subunit	NA|245aa|down_0|NC_010162.1_2252673_2253408_+	TIGR02227, Inactive_signal_peptidase_IA	NA|608aa|down_1|NC_010162.1_2253417_2255241_-	TIGR01241, ATP-dependent_zinc_metalloprotease_FtsH, ATP-dependent metalloprotease FtsH	NA|175aa|down_2|NC_010162.1_2255443_2255968_+	PRK14965, PRK14965, DNA polymerase III subunits gamma and tau; Provisional	NA|81aa|down_3|NC_010162.1_2256033_2256276_+	NA	NA|223aa|down_4|NC_010162.1_2256278_2256947_+	TIGR02479, RNA_polymerase_sigma_factor_WhiG, RNA polymerase sigma factor, FliA/WhiG family	NA|140aa|down_5|NC_010162.1_2256983_2257403_-	NA	NA|255aa|down_6|NC_010162.1_2257686_2258451_+	PRK08217, fabG, 3-ketoacyl-(acyl-carrier-protein) reductase; Provisional	NA|71aa|down_7|NC_010162.1_2258790_2259003_+	pfam04324, Fer2_BFD, BFD-like [2Fe-2S] binding domain	NA|157aa|down_8|NC_010162.1_2259201_2259672_+	cd00907, Bacterioferritin, Bacterioferritin, ferritin-like diiron-binding domain	NA|452aa|down_9|NC_010162.1_2259883_2261239_+	COG2204, AtoC, Response regulator containing CheY-like receiver, AAA-type ATPase, and DNA-binding domains [Signal transduction mechanisms]
GCF_000067165.1_ASM6716v1	NC_010162	Sorangium cellulosum So ce56, complete genome	11	2960782-2960874	9	CRISPRCasFinder	no		cas8u1,cas3,csb2gr5,csb1gr7,cas1,cas2,cas6e,csa3,RT,DEDDh,WYL,DinG,PD-DExK,cmr1gr7,cas10,cmr3gr5,cmr4gr7,cmr5gr11,cmr6gr7,cas6,cas8b3,cas7,cas5	Orphan	GTCCGGCCCTCCTGGGTTTCGCAGGCTCGCCGG	33	0	0	NA	NA	NA	1	1	Orphan	cas8u1,cas3,csb2gr5,csb1gr7,cas1,cas2,cas6e,csa3,RT,DEDDh,WYL,DinG,PD-DExK,cmr1gr7,cas10,cmr3gr5,cmr4gr7,cmr5gr11,cmr6gr7,cas6,cas8b3,cas7,cas5	NA|367aa|up_5|NC_010162.1_2950252_2951353_+,NA|170aa|down_2|NC_010162.1_2964195_2964705_+,NA|182aa|down_3|NC_010162.1_2964862_2965408_+,NA|126aa|down_5|NC_010162.1_2966345_2966723_+,NA|185aa|down_6|NC_010162.1_2967003_2967558_+	NA|227aa|up_9|NC_010162.1_2946156_2946837_+	PRK06202, PRK06202, hypothetical protein; Provisional	NA|388aa|up_8|NC_010162.1_2946833_2947997_+	COG0644, FixC, Dehydrogenases (flavoproteins) [Energy production and conversion]	NA|233aa|up_7|NC_010162.1_2948116_2948815_+	COG1595, RpoE, DNA-directed RNA polymerase specialized sigma subunit, sigma24 homolog [Transcription]	NA|414aa|up_6|NC_010162.1_2948811_2950053_+	pfam04773, FecR, FecR protein	NA|367aa|up_5|NC_010162.1_2950252_2951353_+	NA	NA|450aa|up_4|NC_010162.1_2951699_2953049_-	pfam00331, Glyco_hydro_10, Glycosyl hydrolase family 10	NA|369aa|up_3|NC_010162.1_2953427_2954534_-	cd01951, lectin_L-type, legume lectins	NA|233aa|up_2|NC_010162.1_2954758_2955457_+	COG3571, COG3571, Predicted hydrolase of the alpha/beta-hydrolase fold [General function prediction only]	NA|463aa|up_1|NC_010162.1_2955755_2957144_+	pfam00067, p450, Cytochrome P450	NA|378aa|up_0|NC_010162.1_2957226_2958360_-	pfam07090, GATase1_like, Putative glutamine amidotransferase	NA|207aa|down_0|NC_010162.1_2962427_2963048_-	PRK11752, PRK11752, putative S-transferase; Provisional	NA|206aa|down_1|NC_010162.1_2963545_2964163_+	sd00045, ANK, ankyrin repeats	NA|170aa|down_2|NC_010162.1_2964195_2964705_+	NA	NA|182aa|down_3|NC_010162.1_2964862_2965408_+	NA	NA|120aa|down_4|NC_010162.1_2965594_2965954_+	pfam15586, Imm8, Immunity protein 8	NA|126aa|down_5|NC_010162.1_2966345_2966723_+	NA	NA|185aa|down_6|NC_010162.1_2967003_2967558_+	NA	NA|185aa|down_7|NC_010162.1_2967574_2968129_-	COG0835, CheW, Chemotaxis signal transduction protein [Cell motility and secretion / Signal transduction mechanisms]	NA|589aa|down_8|NC_010162.1_2968125_2969892_-	COG0840, Tar, Methyl-accepting chemotaxis protein [Cell motility and secretion / Signal transduction mechanisms]	NA|573aa|down_9|NC_010162.1_2969888_2971607_-	COG0643, CheA, Chemotaxis protein histidine kinase and related kinases [Cell motility and secretion / Signal transduction mechanisms]
GCF_000067165.1_ASM6716v1	NC_010162	Sorangium cellulosum So ce56, complete genome	12	3857397-3857511	10	CRISPRCasFinder	no		cas8u1,cas3,csb2gr5,csb1gr7,cas1,cas2,cas6e,csa3,RT,DEDDh,WYL,DinG,PD-DExK,cmr1gr7,cas10,cmr3gr5,cmr4gr7,cmr5gr11,cmr6gr7,cas6,cas8b3,cas7,cas5	Orphan	CGTCCGGCGCTCCGGCAGGTCGCGATCTGGG	31	0	0	NA	NA	NA	1	1	Orphan	cas8u1,cas3,csb2gr5,csb1gr7,cas1,cas2,cas6e,csa3,RT,DEDDh,WYL,DinG,PD-DExK,cmr1gr7,cas10,cmr3gr5,cmr4gr7,cmr5gr11,cmr6gr7,cas6,cas8b3,cas7,cas5	NA|303aa|up_0|NC_010162.1_3856358_3857267_+,NA|83aa|down_1|NC_010162.1_3858619_3858868_+,NA|151aa|down_2|NC_010162.1_3859277_3859730_+,NA|126aa|down_4|NC_010162.1_3861225_3861603_-,NA|283aa|down_5|NC_010162.1_3862559_3863408_+,NA|653aa|down_9|NC_010162.1_3868563_3870522_+	NA|181aa|up_9|NC_010162.1_3847578_3848121_+	pfam01725, Ham1p_like, Ham1 family	NA|346aa|up_8|NC_010162.1_3848408_3849446_+	cd19091, AKR_PsAKR, Polaromonas Sp	NA|229aa|up_7|NC_010162.1_3849602_3850289_+	COG4359, COG4359, Uncharacterized conserved protein [Function unknown]	NA|159aa|up_6|NC_010162.1_3850365_3850842_-	pfam16242, Pyrid_ox_like, Pyridoxamine 5'-phosphate oxidase like	NA|318aa|up_5|NC_010162.1_3851223_3852177_+	COG0642, BaeS, Signal transduction histidine kinase [Signal transduction mechanisms]	NA|226aa|up_4|NC_010162.1_3852196_3852874_-	PRK13141, hisH, imidazole glycerol phosphate synthase subunit HisH; Provisional	NA|196aa|up_3|NC_010162.1_3852877_3853465_-	PRK00951, hisB, imidazoleglycerol-phosphate dehydratase HisB	NA|530aa|up_2|NC_010162.1_3853660_3855250_+	cd14014, STKc_PknB_like, Catalytic domain of bacterial Serine/Threonine kinases, PknB and similar proteins	NA|372aa|up_1|NC_010162.1_3855246_3856362_+	pfam08308, PEGA, PEGA domain	NA|303aa|up_0|NC_010162.1_3856358_3857267_+	NA	NA|187aa|down_0|NC_010162.1_3858062_3858623_+	TIGR02937, RNA_polymerase_sigma_factor, RNA polymerase sigma factor, sigma-70 family	NA|83aa|down_1|NC_010162.1_3858619_3858868_+	NA	NA|151aa|down_2|NC_010162.1_3859277_3859730_+	NA	NA|383aa|down_3|NC_010162.1_3859808_3860957_-	PRK14954, PRK14954, DNA polymerase III subunits gamma and tau; Provisional	NA|126aa|down_4|NC_010162.1_3861225_3861603_-	NA	NA|283aa|down_5|NC_010162.1_3862559_3863408_+	NA	NA|699aa|down_6|NC_010162.1_3863426_3865523_-	sd00001, TSP3, Calcium-binding Thrombospondin type 3 (TSP3) repeat	NA|587aa|down_7|NC_010162.1_3865611_3867372_-	cd07185, OmpA_C-like, Peptidoglycan binding domains similar to the C-terminal domain of outer-membrane protein OmpA	NA|337aa|down_8|NC_010162.1_3867547_3868558_+	pfam08308, PEGA, PEGA domain	NA|653aa|down_9|NC_010162.1_3868563_3870522_+	NA
GCF_000067165.1_ASM6716v1	NC_010162	Sorangium cellulosum So ce56, complete genome	13	4497267-4497390	11	CRISPRCasFinder	no		cas8u1,cas3,csb2gr5,csb1gr7,cas1,cas2,cas6e,csa3,RT,DEDDh,WYL,DinG,PD-DExK,cmr1gr7,cas10,cmr3gr5,cmr4gr7,cmr5gr11,cmr6gr7,cas6,cas8b3,cas7,cas5	Orphan	GGCGCAGGGCGCACCGGCGGCGGCG	25	0	0	NA	NA	NA	2	2	Orphan	cas8u1,cas3,csb2gr5,csb1gr7,cas1,cas2,cas6e,csa3,RT,DEDDh,WYL,DinG,PD-DExK,cmr1gr7,cas10,cmr3gr5,cmr4gr7,cmr5gr11,cmr6gr7,cas6,cas8b3,cas7,cas5	NA|484aa|up_0|NC_010162.1_4492895_4494347_+,NA|440aa|down_1|NC_010162.1_4499253_4500573_+	NA|676aa|up_9|NC_010162.1_4475352_4477380_+	cd00306, Peptidases_S8_S53, Peptidase domain in the S8 and S53 families	NA|784aa|up_8|NC_010162.1_4477390_4479742_+	COG0642, BaeS, Signal transduction histidine kinase [Signal transduction mechanisms]	NA|496aa|up_7|NC_010162.1_4479738_4481226_+	COG2204, AtoC, Response regulator containing CheY-like receiver, AAA-type ATPase, and DNA-binding domains [Signal transduction mechanisms]	NA|427aa|up_6|NC_010162.1_4481243_4482524_+	pfam00656, Peptidase_C14, Caspase domain	NA|1481aa|up_5|NC_010162.1_4482610_4487053_+	COG2319, COG2319, FOG: WD40 repeat [General function prediction only]	NA|349aa|up_4|NC_010162.1_4487532_4488579_-	TIGR04039, MXAN_0977_Heme2, di-heme enzyme, MXAN_0977 family	NA|331aa|up_3|NC_010162.1_4490063_4491056_-	pfam12697, Abhydrolase_6, Alpha/beta hydrolase family	NA|203aa|up_2|NC_010162.1_4491086_4491695_-	pfam00440, TetR_N, Bacterial regulatory proteins, tetR family	NA|258aa|up_1|NC_010162.1_4491829_4492603_-	cd07712, MBLAC2-like_MBL-fold, uncharacterized human metallo-beta-lactamase domain-containing protein 2 and related proteins; MBL-fold metallo hydrolase domain	NA|484aa|up_0|NC_010162.1_4492895_4494347_+	NA	NA|99aa|down_0|NC_010162.1_4498892_4499189_+	COG3829, RocR, Transcriptional regulator containing PAS, AAA-type ATPase, and DNA-binding domains [Transcription / Signal transduction mechanisms]	NA|440aa|down_1|NC_010162.1_4499253_4500573_+	NA	NA|498aa|down_2|NC_010162.1_4500869_4502363_-	COG3405, CelA, Endoglucanase Y [Carbohydrate transport and metabolism]	NA|1394aa|down_3|NC_010162.1_4503405_4507587_+	TIGR03903, TOMM_kin_cyc, TOMM system kinase/cyclase fusion protein	NA|480aa|down_4|NC_010162.1_4508138_4509578_-	COG1167, ARO8, Transcriptional regulators containing a DNA-binding HTH domain and an aminotransferase domain (MocR family) and their eukaryotic orthologs [Transcription / Amino acid transport and metabolism]	NA|196aa|down_5|NC_010162.1_4509581_4510169_-	COG1280, RhtB, Putative threonine efflux protein [Amino acid transport and metabolism]	NA|283aa|down_6|NC_010162.1_4510392_4511241_-	PRK06180, PRK06180, short chain dehydrogenase; Provisional	NA|320aa|down_7|NC_010162.1_4511365_4512325_+	cd08474, PBP2_CrgA_like_5, The C-terminal substrate binding domain of an uncharacterized LysR-type transcriptional regulator CrgA-like, contains the type 2 periplasmic binding fold	NA|636aa|down_8|NC_010162.1_4512272_4514180_-	TIGR03296, hypothetical_protein, M6 family metalloprotease domain	NA|522aa|down_9|NC_010162.1_4514372_4515938_+	COG2204, AtoC, Response regulator containing CheY-like receiver, AAA-type ATPase, and DNA-binding domains [Signal transduction mechanisms]
GCF_000067165.1_ASM6716v1	NC_010162	Sorangium cellulosum So ce56, complete genome	14	4582802-4582913	12	CRISPRCasFinder	no		cas8u1,cas3,csb2gr5,csb1gr7,cas1,cas2,cas6e,csa3,RT,DEDDh,WYL,DinG,PD-DExK,cmr1gr7,cas10,cmr3gr5,cmr4gr7,cmr5gr11,cmr6gr7,cas6,cas8b3,cas7,cas5	Orphan	GGGCGCTCTGCGTTGCGGTCCGGAGGG	27	0	0	NA	NA	NA	1	1	Orphan	cas8u1,cas3,csb2gr5,csb1gr7,cas1,cas2,cas6e,csa3,RT,DEDDh,WYL,DinG,PD-DExK,cmr1gr7,cas10,cmr3gr5,cmr4gr7,cmr5gr11,cmr6gr7,cas6,cas8b3,cas7,cas5	NA,NA	NA|527aa|up_9|NC_010162.1_4568847_4570428_-	pfam08757, CotH, CotH kinase protein	NA|428aa|up_8|NC_010162.1_4570677_4571961_+	pfam01082, Cu2_monooxygen, Copper type II ascorbate-dependent monooxygenase, N-terminal domain	NA|458aa|up_7|NC_010162.1_4572064_4573438_-	COG2204, AtoC, Response regulator containing CheY-like receiver, AAA-type ATPase, and DNA-binding domains [Signal transduction mechanisms]	NA|493aa|up_6|NC_010162.1_4573434_4574913_-	COG0642, BaeS, Signal transduction histidine kinase [Signal transduction mechanisms]	NA|315aa|up_5|NC_010162.1_4575218_4576163_+	COG3031, PulC, Type II secretory pathway, component PulC [Intracellular trafficking and secretion]	NA|153aa|up_4|NC_010162.1_4576171_4576630_-	PRK08213, PRK08213, gluconate 5-dehydrogenase; Provisional	NA|747aa|up_3|NC_010162.1_4576970_4579211_+	PRK14501, PRK14501, putative bifunctional trehalose-6-phosphate synthase/HAD hydrolase subfamily IIB; Provisional	NA|470aa|up_2|NC_010162.1_4579274_4580684_-	pfam06965, Na_H_antiport_1, Na+/H+ antiporter 1	NA|337aa|up_1|NC_010162.1_4580936_4581947_+	pfam01636, APH, Phosphotransferase enzyme family	NA|216aa|up_0|NC_010162.1_4581915_4582563_-	pfam00300, His_Phos_1, Histidine phosphatase superfamily (branch 1)	NA|311aa|down_0|NC_010162.1_4583237_4584170_-	pfam06719, AraC_N, AraC-type transcriptional regulator N-terminus	NA|284aa|down_1|NC_010162.1_4584322_4585174_+	cd05362, THN_reductase-like_SDR_c, tetrahydroxynaphthalene/trihydroxynaphthalene reductase-like, classical (c) SDRs	NA|443aa|down_2|NC_010162.1_4585708_4587037_+	PRK07772, PRK07772, single-stranded DNA-binding protein; Provisional	NA|362aa|down_3|NC_010162.1_4587060_4588146_+	pfam11790, Glyco_hydro_cc, Glycosyl hydrolase catalytic core	NA|357aa|down_4|NC_010162.1_4588703_4589774_+	COG3509, LpqC, Poly(3-hydroxybutyrate) depolymerase [Secondary metabolites biosynthesis, transport, and catabolism]	NA|457aa|down_5|NC_010162.1_4590002_4591373_+	TIGR01840, poly3-hydroxybutyrate_depolymerase_A_precursor, esterase, PHB depolymerase family	NA|440aa|down_6|NC_010162.1_4591394_4592714_-	COG2133, COG2133, Glucose/sorbosone dehydrogenases [Carbohydrate transport and metabolism]	NA|137aa|down_7|NC_010162.1_4592909_4593320_-	cd06587, VOC, vicinal oxygen chelate (VOC) family	NA|520aa|down_8|NC_010162.1_4593429_4594989_-	PRK05022, PRK05022, nitric oxide reductase transcriptional regulator NorR	NA|773aa|down_9|NC_010162.1_4595195_4597514_+	COG3256, NorB, Nitric oxide reductase large subunit [Inorganic ion transport and metabolism]
GCF_000067165.1_ASM6716v1	NC_010162	Sorangium cellulosum So ce56, complete genome	15	4620752-4621080	13	CRISPRCasFinder	no		cas8u1,cas3,csb2gr5,csb1gr7,cas1,cas2,cas6e,csa3,RT,DEDDh,WYL,DinG,PD-DExK,cmr1gr7,cas10,cmr3gr5,cmr4gr7,cmr5gr11,cmr6gr7,cas6,cas8b3,cas7,cas5	Orphan	CTGCGACGAGCCCGAGGGCACCGGGCGCTGCGAGGAGGGCACGGACGTCG	50	0	0	NA	NA	NA	3	3	Orphan	cas8u1,cas3,csb2gr5,csb1gr7,cas1,cas2,cas6e,csa3,RT,DEDDh,WYL,DinG,PD-DExK,cmr1gr7,cas10,cmr3gr5,cmr4gr7,cmr5gr11,cmr6gr7,cas6,cas8b3,cas7,cas5	NA|201aa|up_9|NC_010162.1_4605982_4606585_-,NA|222aa|up_0|NC_010162.1_4617914_4618580_-,NA|273aa|down_2|NC_010162.1_4624082_4624901_+	NA|201aa|up_9|NC_010162.1_4605982_4606585_-	NA	NA|163aa|up_8|NC_010162.1_4606787_4607276_+	pfam01814, Hemerythrin, Hemerythrin HHE cation binding domain	NA|973aa|up_7|NC_010162.1_4607318_4610237_-	COG4770, COG4770, Acetyl/propionyl-CoA carboxylase, alpha subunit [Lipid metabolism]	NA|578aa|up_6|NC_010162.1_4610239_4611973_-	COG4799, COG4799, Acetyl-CoA carboxylase, carboxyltransferase component (subunits alpha and beta) [Lipid metabolism]	NA|449aa|up_5|NC_010162.1_4612169_4613516_-	COG0427, ACH1, Acetyl-CoA hydrolase [Energy production and conversion]	NA|126aa|up_4|NC_010162.1_4613616_4613994_-	cd04584, CBS_pair_AcuB_like, Two tandem repeats of the cystathionine beta-synthase (CBS pair) domains associated with the ACT domain	NA|264aa|up_3|NC_010162.1_4614218_4615010_-	cd04586, CBS_pair_BON_assoc, Two tandem repeats of the cystathionine beta-synthase (CBS pair) domains associated with the BON (bacterial OsmY and nodulation domain) domain	NA|225aa|up_2|NC_010162.1_4615426_4616101_+	pfam00582, Usp, Universal stress protein family	NA|454aa|up_1|NC_010162.1_4616332_4617694_+	pfam03629, SASA, Carbohydrate esterase, sialic acid-specific acetylesterase	NA|222aa|up_0|NC_010162.1_4617914_4618580_-	NA	NA|381aa|down_0|NC_010162.1_4621454_4622597_+	cd08884, RHO_alpha_C_GbcA-like, C-terminal catalytic domain of GbcA (glycine betaine catabolism A) from Pseudomonas aeruginosa PAO1 and related aromatic ring hydroxylating dioxygenases	NA|283aa|down_1|NC_010162.1_4622641_4623490_-	cd00229, SGNH_hydrolase, SGNH_hydrolase, or GDSL_hydrolase, is a diverse family of lipases and esterases	NA|273aa|down_2|NC_010162.1_4624082_4624901_+	NA	NA|387aa|down_3|NC_010162.1_4624937_4626098_-	pfam15887, Peptidase_Mx, Putative zinc-binding metallo-peptidase	NA|790aa|down_4|NC_010162.1_4626586_4628956_-	cd09001, GH43_FsAxh1-like, Glycosyl hydrolase family 43 such as Fibrobacter succinogenes subsp	NA|350aa|down_5|NC_010162.1_4629133_4630183_-	PRK11618, PRK11618, inner membrane ABC transporter permease protein YjfF; Provisional	NA|332aa|down_6|NC_010162.1_4630179_4631175_-	COG1172, AraH, Ribose/xylose/arabinose/galactoside ABC-type transport systems, permease components [Carbohydrate transport and metabolism]	NA|508aa|down_7|NC_010162.1_4631234_4632758_-	COG1129, MglA, ABC-type sugar transport system, ATPase component [Carbohydrate transport and metabolism]	NA|324aa|down_8|NC_010162.1_4632804_4633776_-	cd06309, PBP1_galactofuranose_YtfQ-like, periplasmic binding domain of ABC-type galactofuranose YtfQ-like transport systems	NA|233aa|down_9|NC_010162.1_4633859_4634558_-	PRK08193, araD, L-ribulose-5-phosphate 4-epimerase AraD
GCF_000067165.1_ASM6716v1	NC_010162	Sorangium cellulosum So ce56, complete genome	16	4986753-4987836	3,14,4,15	CRT,CRISPRCasFinder,PILER-CR,CRISPRCasFinder	no		cas8u1,cas3,csb2gr5,csb1gr7,cas1,cas2,cas6e,csa3,RT,DEDDh,WYL,DinG,PD-DExK,cmr1gr7,cas10,cmr3gr5,cmr4gr7,cmr5gr11,cmr6gr7,cas6,cas8b3,cas7,cas5	Orphan	TCGGCGACGCNTGCGACAACTGCC,GGTCGGCGACGCGTGCGACAACTGCGCTGG,GGCAGTTGTCGCACGCGTCGCCGATCCCGTCG,GGTCGGCGACGCGTGCGACAACTGCGCTGG	24,30,32,30	0	0	NA	NA	NA:NA:NA:NA	14,12,2,12	14	Orphan	cas8u1,cas3,csb2gr5,csb1gr7,cas1,cas2,cas6e,csa3,RT,DEDDh,WYL,DinG,PD-DExK,cmr1gr7,cas10,cmr3gr5,cmr4gr7,cmr5gr11,cmr6gr7,cas6,cas8b3,cas7,cas5	NA,NA|263aa|down_6|NC_010162.1_4997988_4998777_+,NA|162aa|down_8|NC_010162.1_5000651_5001137_+	NA|434aa|up_9|NC_010162.1_4967593_4968895_-	cd19963, PBP1_BMP-like, periplasmic binding component of a basic membrane lipoprotein (BMP) from Brucella abortus and its close homologs in other bacteria	NA|407aa|up_8|NC_010162.1_4968982_4970203_-	cd00198, vWFA, Von Willebrand factor type A (vWA) domain was originally found in the blood coagulation protein von Willebrand factor (vWF)	NA|498aa|up_7|NC_010162.1_4970675_4972169_+	TIGR00785, Uncharacterized_transporter_HI_0020, anion transporter	NA|403aa|up_6|NC_010162.1_4972165_4973374_+	pfam02595, Gly_kinase, Glycerate kinase family	NA|513aa|up_5|NC_010162.1_4973394_4974933_-	cd07402, MPP_GpdQ, Enterobacter aerogenes GpdQ and related proteins, metallophosphatase domain	NA|427aa|up_4|NC_010162.1_4975096_4976377_+	PRK12323, PRK12323, DNA polymerase III subunit gamma/tau	NA|282aa|up_3|NC_010162.1_4977042_4977888_+	pfam05642, Sporozoite_P67, Sporozoite P67 surface antigen	NA|603aa|up_2|NC_010162.1_4978447_4980256_+	cd14014, STKc_PknB_like, Catalytic domain of bacterial Serine/Threonine kinases, PknB and similar proteins	NA|632aa|up_1|NC_010162.1_4980292_4982188_+	pfam08308, PEGA, PEGA domain	NA|1234aa|up_0|NC_010162.1_4982264_4985966_-	PRK13875, PRK13875, conjugal transfer protein TrbL; Provisional	NA|228aa|down_0|NC_010162.1_4989984_4990668_+	PTZ00146, PTZ00146, fibrillarin; Provisional	NA|598aa|down_1|NC_010162.1_4990933_4992727_+	cd07185, OmpA_C-like, Peptidoglycan binding domains similar to the C-terminal domain of outer-membrane protein OmpA	NA|468aa|down_2|NC_010162.1_4993078_4994482_+	cd07398, MPP_YbbF-LpxH, Escherichia coli YbbF/LpxH and related proteins, metallophosphatase domain	NA|258aa|down_3|NC_010162.1_4994602_4995376_+	COG0631, PTC1, Serine/threonine protein phosphatase [Signal transduction mechanisms]	NA|354aa|down_4|NC_010162.1_4995428_4996490_-	cd14656, Imelysin-like_EfeO, EfeO is a component of the EfeUOB operon	NA|411aa|down_5|NC_010162.1_4996571_4997804_-	pfam06537, DHOR, Di-haem oxidoreductase, putative peroxidase	NA|263aa|down_6|NC_010162.1_4997988_4998777_+	NA	NA|514aa|down_7|NC_010162.1_4998813_5000355_-	PTZ00146, PTZ00146, fibrillarin; Provisional	NA|162aa|down_8|NC_010162.1_5000651_5001137_+	NA	NA|620aa|down_9|NC_010162.1_5001185_5003045_-	cd01153, ACAD_fadE5, Putative acyl-CoA dehydrogenases similar to fadE5
GCF_000067165.1_ASM6716v1	NC_010162	Sorangium cellulosum So ce56, complete genome	17	5068437-5068547	16	CRISPRCasFinder	no	cas3	cas8u1,cas3,csb2gr5,csb1gr7,cas1,cas2,cas6e,csa3,RT,DEDDh,WYL,DinG,PD-DExK,cmr1gr7,cas10,cmr3gr5,cmr4gr7,cmr5gr11,cmr6gr7,cas6,cas8b3,cas7,cas5	Unclear	CGCCTCAGGCGCTGGCCGGCGTGACGGGCGCCTCGAAGGCG	41	0	0	NA	NA	NA	1	1	Unclear	cas8u1,cas3,csb2gr5,csb1gr7,cas1,cas2,cas6e,csa3,RT,DEDDh,WYL,DinG,PD-DExK,cmr1gr7,cas10,cmr3gr5,cmr4gr7,cmr5gr11,cmr6gr7,cas6,cas8b3,cas7,cas5	NA|180aa|up_5|NC_010162.1_5060195_5060735_-,NA|345aa|up_3|NC_010162.1_5062378_5063413_-,NA|56aa|down_7|NC_010162.1_5086464_5086632_-	NA|132aa|up_9|NC_010162.1_5054431_5054827_-	cd07041, STAS_RsbR_RsbS_like, Sulphate Transporter and Anti-Sigma factor antagonist domain of the "stressosome" complex proteins RsbS and RsbR, regulators of the bacterial stress activated alternative sigma factor sigma-B by phosphorylation	NA|450aa|up_8|NC_010162.1_5054850_5056200_-	cd07041, STAS_RsbR_RsbS_like, Sulphate Transporter and Anti-Sigma factor antagonist domain of the "stressosome" complex proteins RsbS and RsbR, regulators of the bacterial stress activated alternative sigma factor sigma-B by phosphorylation	NA|230aa|up_7|NC_010162.1_5056803_5057493_-	sd00006, TPR, Tetratricopeptide repeat	cas3|801aa|up_6|NC_010162.1_5057726_5060129_-	COG0513, SrmB, Superfamily II DNA and RNA helicases [DNA replication, recombination, and repair / Transcription / Translation, ribosomal structure and biogenesis]	NA|180aa|up_5|NC_010162.1_5060195_5060735_-	NA	NA|496aa|up_4|NC_010162.1_5060875_5062363_+	pfam05672, MAP7, MAP7 (E-MAP-115) family	NA|345aa|up_3|NC_010162.1_5062378_5063413_-	NA	NA|499aa|up_2|NC_010162.1_5063607_5065104_+	COG1538, TolC, Outer membrane protein [Cell envelope biogenesis, outer membrane / Intracellular trafficking and secretion]	NA|530aa|up_1|NC_010162.1_5065229_5066819_+	PRK11903, PRK11903, 3,4-dehydroadipyl-CoA semialdehyde dehydrogenase	NA|519aa|up_0|NC_010162.1_5066872_5068429_+	cd14014, STKc_PknB_like, Catalytic domain of bacterial Serine/Threonine kinases, PknB and similar proteins	NA|1723aa|down_0|NC_010162.1_5071498_5076667_+	cd00200, WD40, WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment	NA|95aa|down_1|NC_010162.1_5076698_5076983_-	cd06839, PLPDE_III_Btrk_like, Type III Pyridoxal 5-phosphate (PLP)-Dependent Enzyme Btrk Decarboxylase	NA|108aa|down_2|NC_010162.1_5077000_5077324_-	smart00857, Resolvase, Resolvase, N terminal domain	NA|263aa|down_3|NC_010162.1_5077839_5078628_+	COG2207, AraC, AraC-type DNA-binding domain-containing proteins [Transcription]	NA|154aa|down_4|NC_010162.1_5078652_5079114_+	COG4270, COG4270, Predicted membrane protein [Function unknown]	NA|154aa|down_5|NC_010162.1_5084147_5084609_+	pfam13676, TIR_2, TIR domain	NA|585aa|down_6|NC_010162.1_5084637_5086392_-	pfam00339, Arrestin_N, Arrestin (or S-antigen), N-terminal domain	NA|56aa|down_7|NC_010162.1_5086464_5086632_-	NA	NA|1302aa|down_8|NC_010162.1_5086643_5090549_-	cd14014, STKc_PknB_like, Catalytic domain of bacterial Serine/Threonine kinases, PknB and similar proteins	NA|438aa|down_9|NC_010162.1_5090904_5092218_-	PRK12323, PRK12323, DNA polymerase III subunit gamma/tau
GCF_000067165.1_ASM6716v1	NC_010162	Sorangium cellulosum So ce56, complete genome	18	5641050-5641214	17	CRISPRCasFinder	no		cas8u1,cas3,csb2gr5,csb1gr7,cas1,cas2,cas6e,csa3,RT,DEDDh,WYL,DinG,PD-DExK,cmr1gr7,cas10,cmr3gr5,cmr4gr7,cmr5gr11,cmr6gr7,cas6,cas8b3,cas7,cas5	Orphan	AGGTGCGCAGGAGGGTCGCGGGGGCTGCGCAGGAG	35	0	0	NA	NA	NA	2	2	Orphan	cas8u1,cas3,csb2gr5,csb1gr7,cas1,cas2,cas6e,csa3,RT,DEDDh,WYL,DinG,PD-DExK,cmr1gr7,cas10,cmr3gr5,cmr4gr7,cmr5gr11,cmr6gr7,cas6,cas8b3,cas7,cas5	NA|81aa|up_8|NC_010162.1_5633771_5634014_+,NA|132aa|up_7|NC_010162.1_5634029_5634425_-,NA|129aa|up_6|NC_010162.1_5634588_5634975_+,NA|73aa|up_5|NC_010162.1_5635097_5635316_-,NA|224aa|up_2|NC_010162.1_5637814_5638486_+,NA|667aa|up_0|NC_010162.1_5638985_5640986_+,NA|212aa|down_1|NC_010162.1_5643413_5644049_-,NA|57aa|down_3|NC_010162.1_5645387_5645558_+,NA|293aa|down_6|NC_010162.1_5648244_5649123_+,NA|106aa|down_8|NC_010162.1_5650559_5650877_-	NA|480aa|up_9|NC_010162.1_5632333_5633773_-	PHA03169, PHA03169, hypothetical protein; Provisional	NA|81aa|up_8|NC_010162.1_5633771_5634014_+	NA	NA|132aa|up_7|NC_010162.1_5634029_5634425_-	NA	NA|129aa|up_6|NC_010162.1_5634588_5634975_+	NA	NA|73aa|up_5|NC_010162.1_5635097_5635316_-	NA	NA|395aa|up_4|NC_010162.1_5635459_5636644_-	cd00519, Lipase_3, Lipase (class 3)	NA|161aa|up_3|NC_010162.1_5637325_5637808_+	pfam13529, Peptidase_C39_2, Peptidase_C39 like family	NA|224aa|up_2|NC_010162.1_5637814_5638486_+	NA	NA|164aa|up_1|NC_010162.1_5638547_5639039_-	pfam13517, VCBS, Repeat domain in Vibrio, Colwellia, Bradyrhizobium and Shewanella	NA|667aa|up_0|NC_010162.1_5638985_5640986_+	NA	NA|707aa|down_0|NC_010162.1_5641296_5643417_-	cd00338, Ser_Recombinase, Serine Recombinase family, catalytic domain; a DNA binding domain may be present either N- or C-terminal to the catalytic domain	NA|212aa|down_1|NC_010162.1_5643413_5644049_-	NA	NA|388aa|down_2|NC_010162.1_5644088_5645252_-	pfam13683, rve_3, Integrase core domain	NA|57aa|down_3|NC_010162.1_5645387_5645558_+	NA	NA|132aa|down_4|NC_010162.1_5645955_5646351_+	cd06587, VOC, vicinal oxygen chelate (VOC) family	NA|608aa|down_5|NC_010162.1_5646416_5648240_+	cd14014, STKc_PknB_like, Catalytic domain of bacterial Serine/Threonine kinases, PknB and similar proteins	NA|293aa|down_6|NC_010162.1_5648244_5649123_+	NA	NA|287aa|down_7|NC_010162.1_5649122_5649983_+	pfam03781, FGE-sulfatase, Sulfatase-modifying factor enzyme 1	NA|106aa|down_8|NC_010162.1_5650559_5650877_-	NA	NA|129aa|down_9|NC_010162.1_5651166_5651553_+	pfam13360, PQQ_2, PQQ-like domain
GCF_000067165.1_ASM6716v1	NC_010162	Sorangium cellulosum So ce56, complete genome	19	6823169-6823266	18	CRISPRCasFinder	no		cas8u1,cas3,csb2gr5,csb1gr7,cas1,cas2,cas6e,csa3,RT,DEDDh,WYL,DinG,PD-DExK,cmr1gr7,cas10,cmr3gr5,cmr4gr7,cmr5gr11,cmr6gr7,cas6,cas8b3,cas7,cas5	Orphan	CGGTGCCGGCACGGGCGTCGGTGGCGGCGGCGGCA	35	0	0	NA	NA	NA	1	1	Orphan	cas8u1,cas3,csb2gr5,csb1gr7,cas1,cas2,cas6e,csa3,RT,DEDDh,WYL,DinG,PD-DExK,cmr1gr7,cas10,cmr3gr5,cmr4gr7,cmr5gr11,cmr6gr7,cas6,cas8b3,cas7,cas5	NA|154aa|up_6|NC_010162.1_6816750_6817212_-,NA|388aa|down_0|NC_010162.1_6824640_6825804_-,NA|385aa|down_3|NC_010162.1_6827890_6829045_+	NA|193aa|up_9|NC_010162.1_6813604_6814183_-	COG1309, AcrR, Transcriptional regulator [Transcription]	NA|401aa|up_8|NC_010162.1_6814521_6815724_+	cd01159, NcnH, Naphthocyclinone hydroxylase	NA|330aa|up_7|NC_010162.1_6815752_6816742_-	COG2334, COG2334, Putative homoserine kinase type II (protein kinase fold) [General function prediction only]	NA|154aa|up_6|NC_010162.1_6816750_6817212_-	NA	NA|226aa|up_5|NC_010162.1_6817297_6817975_-	pfam13649, Methyltransf_25, Methyltransferase domain	NA|155aa|up_4|NC_010162.1_6818184_6818649_-	pfam07080, DUF1348, Protein of unknown function (DUF1348)	NA|131aa|up_3|NC_010162.1_6818935_6819328_+	COG3686, COG3686, Predicted membrane protein [Function unknown]	NA|359aa|up_2|NC_010162.1_6819356_6820433_-	COG2220, COG2220, Predicted Zn-dependent hydrolases of the beta-lactamase fold [General function prediction only]	NA|192aa|up_1|NC_010162.1_6820429_6821005_-	pfam00440, TetR_N, Bacterial regulatory proteins, tetR family	NA|515aa|up_0|NC_010162.1_6821111_6822656_-	PRK13875, PRK13875, conjugal transfer protein TrbL; Provisional	NA|388aa|down_0|NC_010162.1_6824640_6825804_-	NA	NA|407aa|down_1|NC_010162.1_6825793_6827014_-	pfam04773, FecR, FecR protein	NA|225aa|down_2|NC_010162.1_6827001_6827676_-	TIGR02937, RNA_polymerase_sigma_factor, RNA polymerase sigma factor, sigma-70 family	NA|385aa|down_3|NC_010162.1_6827890_6829045_+	NA	NA|525aa|down_4|NC_010162.1_6829079_6830654_+	cd06421, CESA_CelA_like, CESA_CelA_like are involved in the elongation of the glucan chain of cellulose	NA|389aa|down_5|NC_010162.1_6830656_6831823_+	COG4124, ManB, Beta-mannanase [Carbohydrate transport and metabolism]	NA|256aa|down_6|NC_010162.1_6831819_6832587_+	cd02524, G1P_cytidylyltransferase, G1P_cytidylyltransferase catalyzes the production of CDP-D-Glucose	NA|356aa|down_7|NC_010162.1_6832589_6833657_+	TIGR02622, CDP-glucose_46-dehydratase, CDP-glucose 4,6-dehydratase	NA|313aa|down_8|NC_010162.1_6833653_6834592_+	COG0451, WcaG, Nucleoside-diphosphate-sugar epimerases [Cell envelope biogenesis, outer membrane / Carbohydrate transport and metabolism]	NA|876aa|down_9|NC_010162.1_6835363_6837991_+	cd02851, E_set_GO_C, C-terminal Early set domain associated with the catalytic domain of galactose oxidase
GCF_000067165.1_ASM6716v1	NC_010162	Sorangium cellulosum So ce56, complete genome	20	7284402-7284491	19	CRISPRCasFinder	no		cas8u1,cas3,csb2gr5,csb1gr7,cas1,cas2,cas6e,csa3,RT,DEDDh,WYL,DinG,PD-DExK,cmr1gr7,cas10,cmr3gr5,cmr4gr7,cmr5gr11,cmr6gr7,cas6,cas8b3,cas7,cas5	Orphan	CGACCCAACCAGGTTGACCTGACC	24	0	0	NA	NA	NA	1	1	Orphan	cas8u1,cas3,csb2gr5,csb1gr7,cas1,cas2,cas6e,csa3,RT,DEDDh,WYL,DinG,PD-DExK,cmr1gr7,cas10,cmr3gr5,cmr4gr7,cmr5gr11,cmr6gr7,cas6,cas8b3,cas7,cas5	NA|483aa|up_3|NC_010162.1_7277159_7278608_-,NA|307aa|down_0|NC_010162.1_7284614_7285535_+,NA|317aa|down_3|NC_010162.1_7288209_7289160_-,NA|202aa|down_4|NC_010162.1_7289656_7290262_+	NA|291aa|up_9|NC_010162.1_7263660_7264533_-	PRK13875, PRK13875, conjugal transfer protein TrbL; Provisional	NA|1410aa|up_8|NC_010162.1_7265093_7269323_+	TIGR03903, TOMM_kin_cyc, TOMM system kinase/cyclase fusion protein	NA|150aa|up_7|NC_010162.1_7269474_7269924_+	TIGR03795, RNP_Burkhold, ribosomal natural product, two-chain TOMM family	NA|391aa|up_6|NC_010162.1_7269925_7271098_-	pfam04909, Amidohydro_2, Amidohydrolase	NA|612aa|up_5|NC_010162.1_7273746_7275582_+	COG2204, AtoC, Response regulator containing CheY-like receiver, AAA-type ATPase, and DNA-binding domains [Signal transduction mechanisms]	NA|498aa|up_4|NC_010162.1_7275600_7277094_-	pfam01593, Amino_oxidase, Flavin containing amine oxidoreductase	NA|483aa|up_3|NC_010162.1_7277159_7278608_-	NA	NA|266aa|up_2|NC_010162.1_7279101_7279899_-	COG0642, BaeS, Signal transduction histidine kinase [Signal transduction mechanisms]	NA|392aa|up_1|NC_010162.1_7280910_7282086_+	pfam10282, Lactonase, Lactonase, 7-bladed beta-propeller	NA|407aa|up_0|NC_010162.1_7282149_7283370_-	PRK07877, PRK07877, Rv1355c family protein	NA|307aa|down_0|NC_010162.1_7284614_7285535_+	NA	NA|596aa|down_1|NC_010162.1_7285599_7287387_-	TIGR02680, conserved_hypothetical_protein, TIGR02680 family protein	NA|280aa|down_2|NC_010162.1_7287373_7288213_-	cd07492, Peptidases_S8_8, Peptidase S8 family domain, uncharacterized subfamily 8	NA|317aa|down_3|NC_010162.1_7288209_7289160_-	NA	NA|202aa|down_4|NC_010162.1_7289656_7290262_+	NA	NA|450aa|down_5|NC_010162.1_7291076_7292426_+	pfam14518, Haem_oxygenas_2, Iron-containing redox enzyme	NA|103aa|down_6|NC_010162.1_7292422_7292731_+	cd03467, Rieske, Rieske domain; a [2Fe-2S] cluster binding domain commonly found in Rieske non-heme iron oxygenase (RO) systems such as naphthalene and biphenyl dioxygenases, as well as in plant/cyanobacterial chloroplast b6f and mitochondrial cytochrome bc(1) complexes	NA|422aa|down_7|NC_010162.1_7292727_7293993_+	PRK12767, PRK12767, carbamoyl phosphate synthase-like protein; Provisional	NA|418aa|down_8|NC_010162.1_7293989_7295243_+	cd06173, MFS_MefA_like, Macrolide efflux protein A and similar proteins of the Major Facilitator Superfamily of transporters	NA|414aa|down_9|NC_010162.1_7295239_7296481_+	PRK02186, PRK02186, argininosuccinate lyase; Provisional
GCF_000067165.1_ASM6716v1	NC_010162	Sorangium cellulosum So ce56, complete genome	21	7289307-7289438	20	CRISPRCasFinder	no		cas8u1,cas3,csb2gr5,csb1gr7,cas1,cas2,cas6e,csa3,RT,DEDDh,WYL,DinG,PD-DExK,cmr1gr7,cas10,cmr3gr5,cmr4gr7,cmr5gr11,cmr6gr7,cas6,cas8b3,cas7,cas5	Orphan	CCCAACCCGGGTCAATGCCGGCG	23	1	2	7289388-7289415|7289388-7289415	NC_010162.1_7289231-7289258|NC_010162.1_7289279-7289306	NA	2	2	Orphan	cas8u1,cas3,csb2gr5,csb1gr7,cas1,cas2,cas6e,csa3,RT,DEDDh,WYL,DinG,PD-DExK,cmr1gr7,cas10,cmr3gr5,cmr4gr7,cmr5gr11,cmr6gr7,cas6,cas8b3,cas7,cas5	NA|483aa|up_8|NC_010162.1_7277159_7278608_-,NA|307aa|up_3|NC_010162.1_7284614_7285535_+,NA|317aa|up_0|NC_010162.1_7288209_7289160_-,NA|202aa|down_0|NC_010162.1_7289656_7290262_+	NA|498aa|up_9|NC_010162.1_7275600_7277094_-	pfam01593, Amino_oxidase, Flavin containing amine oxidoreductase	NA|483aa|up_8|NC_010162.1_7277159_7278608_-	NA	NA|266aa|up_7|NC_010162.1_7279101_7279899_-	COG0642, BaeS, Signal transduction histidine kinase [Signal transduction mechanisms]	NA|392aa|up_6|NC_010162.1_7280910_7282086_+	pfam10282, Lactonase, Lactonase, 7-bladed beta-propeller	NA|407aa|up_5|NC_010162.1_7282149_7283370_-	PRK07877, PRK07877, Rv1355c family protein	NA|311aa|up_4|NC_010162.1_7283485_7284418_+	cd08417, PBP2_Nitroaromatics_like, The C-terminal substrate binding domain of LysR-type transcriptional regulators that involved in the catabolism of nitroaromatic/naphthalene compounds and that of related regulators; contains the type 2 periplasmic binding fold	NA|307aa|up_3|NC_010162.1_7284614_7285535_+	NA	NA|596aa|up_2|NC_010162.1_7285599_7287387_-	TIGR02680, conserved_hypothetical_protein, TIGR02680 family protein	NA|280aa|up_1|NC_010162.1_7287373_7288213_-	cd07492, Peptidases_S8_8, Peptidase S8 family domain, uncharacterized subfamily 8	NA|317aa|up_0|NC_010162.1_7288209_7289160_-	NA	NA|202aa|down_0|NC_010162.1_7289656_7290262_+	NA	NA|450aa|down_1|NC_010162.1_7291076_7292426_+	pfam14518, Haem_oxygenas_2, Iron-containing redox enzyme	NA|103aa|down_2|NC_010162.1_7292422_7292731_+	cd03467, Rieske, Rieske domain; a [2Fe-2S] cluster binding domain commonly found in Rieske non-heme iron oxygenase (RO) systems such as naphthalene and biphenyl dioxygenases, as well as in plant/cyanobacterial chloroplast b6f and mitochondrial cytochrome bc(1) complexes	NA|422aa|down_3|NC_010162.1_7292727_7293993_+	PRK12767, PRK12767, carbamoyl phosphate synthase-like protein; Provisional	NA|418aa|down_4|NC_010162.1_7293989_7295243_+	cd06173, MFS_MefA_like, Macrolide efflux protein A and similar proteins of the Major Facilitator Superfamily of transporters	NA|414aa|down_5|NC_010162.1_7295239_7296481_+	PRK02186, PRK02186, argininosuccinate lyase; Provisional	NA|104aa|down_6|NC_010162.1_7296564_7296876_-	cd02226, cupin_YdbB-like, Bacillus subtilis YdbB and related proteins, cupin domain	NA|661aa|down_7|NC_010162.1_7297132_7299115_-	COG1132, MdlB, ABC-type multidrug transport system, ATPase and permease components [Defense mechanisms]	NA|225aa|down_8|NC_010162.1_7299184_7299859_+	PRK07003, PRK07003, DNA polymerase III subunit gamma/tau	NA|501aa|down_9|NC_010162.1_7300162_7301665_-	PRK08187, PRK08187, pyruvate kinase; Validated
GCF_000067165.1_ASM6716v1	NC_010162	Sorangium cellulosum So ce56, complete genome	22	7299818-7300136	4	CRT	no		cas8u1,cas3,csb2gr5,csb1gr7,cas1,cas2,cas6e,csa3,RT,DEDDh,WYL,DinG,PD-DExK,cmr1gr7,cas10,cmr3gr5,cmr4gr7,cmr5gr11,cmr6gr7,cas6,cas8b3,cas7,cas5	Orphan	CNCCCAACCCGGGTCGNCCA	20	1	1	7300095-7300116	NC_010162.1_7300116-7300137	NA	5	5	Orphan	cas8u1,cas3,csb2gr5,csb1gr7,cas1,cas2,cas6e,csa3,RT,DEDDh,WYL,DinG,PD-DExK,cmr1gr7,cas10,cmr3gr5,cmr4gr7,cmr5gr11,cmr6gr7,cas6,cas8b3,cas7,cas5	NA|317aa|up_8|NC_010162.1_7288209_7289160_-,NA|202aa|up_7|NC_010162.1_7289656_7290262_+,NA|175aa|down_3|NC_010162.1_7304745_7305270_-,NA|126aa|down_5|NC_010162.1_7307950_7308328_-,NA|164aa|down_6|NC_010162.1_7308308_7308800_-,NA|126aa|down_8|NC_010162.1_7311445_7311823_-,NA|175aa|down_9|NC_010162.1_7311803_7312328_-	NA|280aa|up_9|NC_010162.1_7287373_7288213_-	cd07492, Peptidases_S8_8, Peptidase S8 family domain, uncharacterized subfamily 8	NA|317aa|up_8|NC_010162.1_7288209_7289160_-	NA	NA|202aa|up_7|NC_010162.1_7289656_7290262_+	NA	NA|450aa|up_6|NC_010162.1_7291076_7292426_+	pfam14518, Haem_oxygenas_2, Iron-containing redox enzyme	NA|103aa|up_5|NC_010162.1_7292422_7292731_+	cd03467, Rieske, Rieske domain; a [2Fe-2S] cluster binding domain commonly found in Rieske non-heme iron oxygenase (RO) systems such as naphthalene and biphenyl dioxygenases, as well as in plant/cyanobacterial chloroplast b6f and mitochondrial cytochrome bc(1) complexes	NA|422aa|up_4|NC_010162.1_7292727_7293993_+	PRK12767, PRK12767, carbamoyl phosphate synthase-like protein; Provisional	NA|418aa|up_3|NC_010162.1_7293989_7295243_+	cd06173, MFS_MefA_like, Macrolide efflux protein A and similar proteins of the Major Facilitator Superfamily of transporters	NA|414aa|up_2|NC_010162.1_7295239_7296481_+	PRK02186, PRK02186, argininosuccinate lyase; Provisional	NA|104aa|up_1|NC_010162.1_7296564_7296876_-	cd02226, cupin_YdbB-like, Bacillus subtilis YdbB and related proteins, cupin domain	NA|661aa|up_0|NC_010162.1_7297132_7299115_-	COG1132, MdlB, ABC-type multidrug transport system, ATPase and permease components [Defense mechanisms]	NA|501aa|down_0|NC_010162.1_7300162_7301665_-	PRK08187, PRK08187, pyruvate kinase; Validated	NA|823aa|down_1|NC_010162.1_7301898_7304367_-	cd06267, PBP1_LacI_sugar_binding-like, ligand binding domain of the LacI transcriptional regulator family belonging to the type 1 periplasmic-binding fold protein superfamily	NA|126aa|down_2|NC_010162.1_7304387_7304765_-	COG5439, COG5439, Uncharacterized conserved protein [Function unknown]	NA|175aa|down_3|NC_010162.1_7304745_7305270_-	NA	NA|830aa|down_4|NC_010162.1_7305443_7307933_-	cd06267, PBP1_LacI_sugar_binding-like, ligand binding domain of the LacI transcriptional regulator family belonging to the type 1 periplasmic-binding fold protein superfamily	NA|126aa|down_5|NC_010162.1_7307950_7308328_-	NA	NA|164aa|down_6|NC_010162.1_7308308_7308800_-	NA	NA|828aa|down_7|NC_010162.1_7308941_7311425_-	cd06267, PBP1_LacI_sugar_binding-like, ligand binding domain of the LacI transcriptional regulator family belonging to the type 1 periplasmic-binding fold protein superfamily	NA|126aa|down_8|NC_010162.1_7311445_7311823_-	NA	NA|175aa|down_9|NC_010162.1_7311803_7312328_-	NA
GCF_000067165.1_ASM6716v1	NC_010162	Sorangium cellulosum So ce56, complete genome	23	7332113-7334595	5,21,5,6	PILER-CR,CRISPRCasFinder,CRT,PILER-CR	no	WYL	cas8u1,cas3,csb2gr5,csb1gr7,cas1,cas2,cas6e,csa3,RT,DEDDh,WYL,DinG,PD-DExK,cmr1gr7,cas10,cmr3gr5,cmr4gr7,cmr5gr11,cmr6gr7,cas6,cas8b3,cas7,cas5	Unclear	CTCTCCGCCGCTGAAAGGCGGCGGCCCCATTGAAGC,CTCTCCGCCGCTGAAAGGCGGCGGCCCCATTGAAGC,CTCTCCGCCGCTGAAAGGCGGCGGCCCCATTGAAGC,CTCTCCGCCGCTGAAAGGCGGCGGCCCCATTGAAGC	36,36,36,36	0	0	NA	NA	NA:NA:NA:NA	31,33,33,31	33	Orphan	cas8u1,cas3,csb2gr5,csb1gr7,cas1,cas2,cas6e,csa3,RT,DEDDh,WYL,DinG,PD-DExK,cmr1gr7,cas10,cmr3gr5,cmr4gr7,cmr5gr11,cmr6gr7,cas6,cas8b3,cas7,cas5	NA|126aa|up_9|NC_010162.1_7311445_7311823_-,NA|175aa|up_8|NC_010162.1_7311803_7312328_-,NA|107aa|up_2|NC_010162.1_7327057_7327378_+,NA|136aa|down_1|NC_010162.1_7337862_7338270_+,NA|373aa|down_4|NC_010162.1_7341723_7342842_+,NA|212aa|down_8|NC_010162.1_7346231_7346867_-,NA|105aa|down_9|NC_010162.1_7351031_7351346_+	NA|126aa|up_9|NC_010162.1_7311445_7311823_-	NA	NA|175aa|up_8|NC_010162.1_7311803_7312328_-	NA	NA|1528aa|up_7|NC_010162.1_7312580_7317164_-	cd14014, STKc_PknB_like, Catalytic domain of bacterial Serine/Threonine kinases, PknB and similar proteins	NA|194aa|up_6|NC_010162.1_7317290_7317872_+	cd06260, DUF820, Domain of unknown function (DUF820)	NA|1712aa|up_5|NC_010162.1_7318205_7323341_+	COG3899, COG3899, Predicted ATPase [General function prediction only]	NA|519aa|up_4|NC_010162.1_7323441_7324998_-	pfam07586, HXXSHH, Protein of unknown function (DUF1552)	NA|514aa|up_3|NC_010162.1_7325051_7326593_-	pfam07631, PSD4, Protein of unknown function (DUF1592)	NA|107aa|up_2|NC_010162.1_7327057_7327378_+	NA	NA|350aa|up_1|NC_010162.1_7327461_7328511_+	PRK14951, PRK14951, DNA polymerase III subunits gamma and tau; Provisional	NA|818aa|up_0|NC_010162.1_7328622_7331076_-	PRK13875, PRK13875, conjugal transfer protein TrbL; Provisional	NA|268aa|down_0|NC_010162.1_7334623_7335426_-	pfam13340, DUF4096, Putative transposase of IS4/5 family (DUF4096)	NA|136aa|down_1|NC_010162.1_7337862_7338270_+	NA	NA|490aa|down_2|NC_010162.1_7338941_7340411_+	TIGR02937, RNA_polymerase_sigma_factor, RNA polymerase sigma factor, sigma-70 family	NA|350aa|down_3|NC_010162.1_7340494_7341544_+	PRK14965, PRK14965, DNA polymerase III subunits gamma and tau; Provisional	NA|373aa|down_4|NC_010162.1_7341723_7342842_+	NA	NA|454aa|down_5|NC_010162.1_7343078_7344440_+	cd01830, XynE_like, SGNH_hydrolase subfamily, similar to the putative arylesterase/acylhydrolase from the rumen anaerobe Prevotella bryantii XynE	NA|152aa|down_6|NC_010162.1_7344653_7345109_-	COG3791, COG3791, Uncharacterized conserved protein [Function unknown]	WYL|325aa|down_7|NC_010162.1_7345243_7346218_+	COG2378, COG2378, Predicted transcriptional regulator [Transcription]	NA|212aa|down_8|NC_010162.1_7346231_7346867_-	NA	NA|105aa|down_9|NC_010162.1_7351031_7351346_+	NA
GCF_000067165.1_ASM6716v1	NC_010162	Sorangium cellulosum So ce56, complete genome	24	7335521-7337235	7,22,6	PILER-CR,CRISPRCasFinder,CRT	no	WYL	cas8u1,cas3,csb2gr5,csb1gr7,cas1,cas2,cas6e,csa3,RT,DEDDh,WYL,DinG,PD-DExK,cmr1gr7,cas10,cmr3gr5,cmr4gr7,cmr5gr11,cmr6gr7,cas6,cas8b3,cas7,cas5	Unclear	CTCTCCGCCGCTGAAAGGCGGCGGCCCCATTGAAGC,CTCTCCGCCGCTGAAAGGCGGCGGCCCCATTGAAGC,CTCTCCGCCGCTGAAAGGCGGCGGCCCCATTGAAGC	36,36,36	0	0	NA	NA	NA:NA:NA	23,23,23	23	Orphan	cas8u1,cas3,csb2gr5,csb1gr7,cas1,cas2,cas6e,csa3,RT,DEDDh,WYL,DinG,PD-DExK,cmr1gr7,cas10,cmr3gr5,cmr4gr7,cmr5gr11,cmr6gr7,cas6,cas8b3,cas7,cas5	NA|175aa|up_9|NC_010162.1_7311803_7312328_-,NA|107aa|up_3|NC_010162.1_7327057_7327378_+,NA|136aa|down_0|NC_010162.1_7337862_7338270_+,NA|373aa|down_3|NC_010162.1_7341723_7342842_+,NA|212aa|down_7|NC_010162.1_7346231_7346867_-,NA|105aa|down_8|NC_010162.1_7351031_7351346_+	NA|175aa|up_9|NC_010162.1_7311803_7312328_-	NA	NA|1528aa|up_8|NC_010162.1_7312580_7317164_-	cd14014, STKc_PknB_like, Catalytic domain of bacterial Serine/Threonine kinases, PknB and similar proteins	NA|194aa|up_7|NC_010162.1_7317290_7317872_+	cd06260, DUF820, Domain of unknown function (DUF820)	NA|1712aa|up_6|NC_010162.1_7318205_7323341_+	COG3899, COG3899, Predicted ATPase [General function prediction only]	NA|519aa|up_5|NC_010162.1_7323441_7324998_-	pfam07586, HXXSHH, Protein of unknown function (DUF1552)	NA|514aa|up_4|NC_010162.1_7325051_7326593_-	pfam07631, PSD4, Protein of unknown function (DUF1592)	NA|107aa|up_3|NC_010162.1_7327057_7327378_+	NA	NA|350aa|up_2|NC_010162.1_7327461_7328511_+	PRK14951, PRK14951, DNA polymerase III subunits gamma and tau; Provisional	NA|818aa|up_1|NC_010162.1_7328622_7331076_-	PRK13875, PRK13875, conjugal transfer protein TrbL; Provisional	NA|268aa|up_0|NC_010162.1_7334623_7335426_-	pfam13340, DUF4096, Putative transposase of IS4/5 family (DUF4096)	NA|136aa|down_0|NC_010162.1_7337862_7338270_+	NA	NA|490aa|down_1|NC_010162.1_7338941_7340411_+	TIGR02937, RNA_polymerase_sigma_factor, RNA polymerase sigma factor, sigma-70 family	NA|350aa|down_2|NC_010162.1_7340494_7341544_+	PRK14965, PRK14965, DNA polymerase III subunits gamma and tau; Provisional	NA|373aa|down_3|NC_010162.1_7341723_7342842_+	NA	NA|454aa|down_4|NC_010162.1_7343078_7344440_+	cd01830, XynE_like, SGNH_hydrolase subfamily, similar to the putative arylesterase/acylhydrolase from the rumen anaerobe Prevotella bryantii XynE	NA|152aa|down_5|NC_010162.1_7344653_7345109_-	COG3791, COG3791, Uncharacterized conserved protein [Function unknown]	WYL|325aa|down_6|NC_010162.1_7345243_7346218_+	COG2378, COG2378, Predicted transcriptional regulator [Transcription]	NA|212aa|down_7|NC_010162.1_7346231_7346867_-	NA	NA|105aa|down_8|NC_010162.1_7351031_7351346_+	NA	NA|752aa|down_9|NC_010162.1_7352757_7355013_+	PHA03378, PHA03378, EBNA-3B; Provisional
GCF_000067165.1_ASM6716v1	NC_010162	Sorangium cellulosum So ce56, complete genome	25	7374617-7374970	8	PILER-CR	no		cas8u1,cas3,csb2gr5,csb1gr7,cas1,cas2,cas6e,csa3,RT,DEDDh,WYL,DinG,PD-DExK,cmr1gr7,cas10,cmr3gr5,cmr4gr7,cmr5gr11,cmr6gr7,cas6,cas8b3,cas7,cas5	Orphan	GCACCCGCCGTAGCGGCCGTCGTTCACGCCGTCGTCGCACTCCTCGCC	48	0	0	NA	NA	NA	3	3	Orphan	cas8u1,cas3,csb2gr5,csb1gr7,cas1,cas2,cas6e,csa3,RT,DEDDh,WYL,DinG,PD-DExK,cmr1gr7,cas10,cmr3gr5,cmr4gr7,cmr5gr11,cmr6gr7,cas6,cas8b3,cas7,cas5	NA|187aa|up_7|NC_010162.1_7363261_7363822_-,NA|298aa|up_2|NC_010162.1_7369496_7370390_+,NA|304aa|down_9|NC_010162.1_7393921_7394833_+	NA|297aa|up_9|NC_010162.1_7361257_7362148_-	PRK12472, PRK12472, hypothetical protein; Provisional	NA|301aa|up_8|NC_010162.1_7362112_7363015_+	pfam04972, BON, BON domain	NA|187aa|up_7|NC_010162.1_7363261_7363822_-	NA	NA|424aa|up_6|NC_010162.1_7364109_7365381_+	pfam04055, Radical_SAM, Radical SAM superfamily	NA|322aa|up_5|NC_010162.1_7365377_7366343_+	cd00609, AAT_like, Aspartate aminotransferase family	NA|349aa|up_4|NC_010162.1_7366366_7367413_-	TIGR03302, OM_YfiO, outer membrane assembly lipoprotein YfiO	NA|491aa|up_3|NC_010162.1_7367454_7368927_-	cd14014, STKc_PknB_like, Catalytic domain of bacterial Serine/Threonine kinases, PknB and similar proteins	NA|298aa|up_2|NC_010162.1_7369496_7370390_+	NA	NA|738aa|up_1|NC_010162.1_7370573_7372787_+	COG2268, COG2268, Uncharacterized protein conserved in bacteria [Function unknown]	NA|464aa|up_0|NC_010162.1_7372881_7374273_-	pfam01082, Cu2_monooxygen, Copper type II ascorbate-dependent monooxygenase, N-terminal domain	NA|401aa|down_0|NC_010162.1_7383755_7384958_+	smart00897, FIST, FIST N domain	NA|478aa|down_1|NC_010162.1_7385521_7386955_+	PTZ00146, PTZ00146, fibrillarin; Provisional	NA|321aa|down_2|NC_010162.1_7387029_7387992_-	cd05292, LDH_2, A subgroup of L-lactate dehydrogenases	NA|397aa|down_3|NC_010162.1_7388115_7389306_-	pfam00924, MS_channel, Mechanosensitive ion channel	NA|149aa|down_4|NC_010162.1_7389389_7389836_+	COG1832, COG1832, Predicted CoA-binding protein [General function prediction only]	NA|141aa|down_5|NC_010162.1_7389953_7390376_-	cd02208, cupin_RmlC-like, RmlC-like cupin superfamily	NA|346aa|down_6|NC_010162.1_7390606_7391644_+	TIGR04470, hypothetical_protein_ALIPUT_00462, radical SAM mobile pair protein B	NA|125aa|down_7|NC_010162.1_7391989_7392364_-	PRK10767, PRK10767, chaperone protein DnaJ; Provisional	NA|162aa|down_8|NC_010162.1_7392950_7393436_+	pfam08818, DUF1801, Domain of unknown function (DU1801)	NA|304aa|down_9|NC_010162.1_7393921_7394833_+	NA
GCF_000067165.1_ASM6716v1	NC_010162	Sorangium cellulosum So ce56, complete genome	26	7780227-7780388	23	CRISPRCasFinder	no		cas8u1,cas3,csb2gr5,csb1gr7,cas1,cas2,cas6e,csa3,RT,DEDDh,WYL,DinG,PD-DExK,cmr1gr7,cas10,cmr3gr5,cmr4gr7,cmr5gr11,cmr6gr7,cas6,cas8b3,cas7,cas5	Orphan	CGCTCGCGGCGCTGCCGCCGCCGCCTCCGGAGCCGCCAGCGCCGGCGCTGCCGC	54	0	0	NA	NA	NA	1	1	Orphan	cas8u1,cas3,csb2gr5,csb1gr7,cas1,cas2,cas6e,csa3,RT,DEDDh,WYL,DinG,PD-DExK,cmr1gr7,cas10,cmr3gr5,cmr4gr7,cmr5gr11,cmr6gr7,cas6,cas8b3,cas7,cas5	NA|141aa|up_5|NC_010162.1_7773110_7773533_-,NA|208aa|up_3|NC_010162.1_7774450_7775074_-,NA|81aa|down_1|NC_010162.1_7781383_7781626_-,NA|595aa|down_2|NC_010162.1_7781595_7783380_-,NA|211aa|down_6|NC_010162.1_7788259_7788892_-,NA|271aa|down_8|NC_010162.1_7790463_7791276_-	NA|445aa|up_9|NC_010162.1_7766057_7767392_+	COG5184, ATS1, Alpha-tubulin suppressor and related RCC1 domain-containing proteins [Cell division and chromosome partitioning / Cytoskeleton]	NA|378aa|up_8|NC_010162.1_7767929_7769063_+	cd05819, NHL, NHL repeat unit of beta-propeller proteins	NA|797aa|up_7|NC_010162.1_7769097_7771488_-	pfam07090, GATase1_like, Putative glutamine amidotransferase	NA|447aa|up_6|NC_010162.1_7771735_7773076_+	cd07041, STAS_RsbR_RsbS_like, Sulphate Transporter and Anti-Sigma factor antagonist domain of the "stressosome" complex proteins RsbS and RsbR, regulators of the bacterial stress activated alternative sigma factor sigma-B by phosphorylation	NA|141aa|up_5|NC_010162.1_7773110_7773533_-	NA	NA|268aa|up_4|NC_010162.1_7773539_7774343_-	pfam13709, DUF4159, Domain of unknown function (DUF4159)	NA|208aa|up_3|NC_010162.1_7774450_7775074_-	NA	NA|487aa|up_2|NC_010162.1_7775289_7776750_+	COG0286, HsdM, Type I restriction-modification system methyltransferase subunit [Defense mechanisms]	NA|441aa|up_1|NC_010162.1_7776746_7778069_+	cd17262, RMtype1_S_Aco12261I-TRD2-CR2, Type I restriction-modification system specificity (S) subunit Target Recognition Domain-ConseRved domain (TRD-CR), similar to Aminobacterium colombiense DSM 12261 S subunit (S	NA|477aa|up_0|NC_010162.1_7778299_7779730_+	COG1032, COG1032, Fe-S oxidoreductase [Energy production and conversion]	NA|178aa|down_0|NC_010162.1_7780822_7781356_+	PHA00370, III, attachment protein	NA|81aa|down_1|NC_010162.1_7781383_7781626_-	NA	NA|595aa|down_2|NC_010162.1_7781595_7783380_-	NA	NA|486aa|down_3|NC_010162.1_7783481_7784939_+	PRK12270, kgd, multifunctional oxoglutarate decarboxylase/oxoglutarate dehydrogenase thiamine pyrophosphate-binding subunit/dihydrolipoyllysine-residue succinyltransferase subunit	NA|460aa|down_4|NC_010162.1_7785006_7786386_+	COG2124, CypX, Cytochrome P450 [Secondary metabolites biosynthesis, transport, and catabolism]	NA|529aa|down_5|NC_010162.1_7786532_7788119_-	cd17919, DEXHc_Snf, DEXH/Q-box helicase domain of DEAD-like helicase Snf family proteins	NA|211aa|down_6|NC_010162.1_7788259_7788892_-	NA	NA|457aa|down_7|NC_010162.1_7789011_7790382_+	COG0823, TolB, Periplasmic component of the Tol biopolymer transport system [Intracellular trafficking and secretion]	NA|271aa|down_8|NC_010162.1_7790463_7791276_-	NA	NA|1331aa|down_9|NC_010162.1_7791536_7795529_-	cd14014, STKc_PknB_like, Catalytic domain of bacterial Serine/Threonine kinases, PknB and similar proteins
GCF_000067165.1_ASM6716v1	NC_010162	Sorangium cellulosum So ce56, complete genome	27	7922463-7922537	24	CRISPRCasFinder	no	csa3	cas8u1,cas3,csb2gr5,csb1gr7,cas1,cas2,cas6e,csa3,RT,DEDDh,WYL,DinG,PD-DExK,cmr1gr7,cas10,cmr3gr5,cmr4gr7,cmr5gr11,cmr6gr7,cas6,cas8b3,cas7,cas5	Type I-A	CTGCAAGTCAGGCCCACGGACTGC	24	0	0	NA	NA	NA	1	1	Orphan	cas8u1,cas3,csb2gr5,csb1gr7,cas1,cas2,cas6e,csa3,RT,DEDDh,WYL,DinG,PD-DExK,cmr1gr7,cas10,cmr3gr5,cmr4gr7,cmr5gr11,cmr6gr7,cas6,cas8b3,cas7,cas5	NA,NA|809aa|down_2|NC_010162.1_7926602_7929029_-	NA|374aa|up_9|NC_010162.1_7911181_7912303_+	pfam00144, Beta-lactamase, Beta-lactamase	NA|375aa|up_8|NC_010162.1_7912354_7913479_+	pfam01384, PHO4, Phosphate transporter family	NA|467aa|up_7|NC_010162.1_7913545_7914946_+	COG1032, COG1032, Fe-S oxidoreductase [Energy production and conversion]	NA|390aa|up_6|NC_010162.1_7914985_7916155_+	PRK11873, arsM, arsenite methyltransferase	NA|216aa|up_5|NC_010162.1_7916269_7916917_+	COG2226, UbiE, Methylase involved in ubiquinone/menaquinone biosynthesis [Coenzyme metabolism]	NA|385aa|up_4|NC_010162.1_7917069_7918224_+	COG2124, CypX, Cytochrome P450 [Secondary metabolites biosynthesis, transport, and catabolism]	NA|268aa|up_3|NC_010162.1_7918438_7919242_-	COG2226, UbiE, Methylase involved in ubiquinone/menaquinone biosynthesis [Coenzyme metabolism]	NA|177aa|up_2|NC_010162.1_7919532_7920063_+	COG1522, Lrp, Transcriptional regulators [Transcription]	NA|264aa|up_1|NC_010162.1_7920269_7921061_+	COG1247, COG1247, Sortase and related acyltransferases [Cell envelope biogenesis, outer membrane]	NA|92aa|up_0|NC_010162.1_7921277_7921553_-	cd02803, OYE_like_FMN_family, Old yellow enzyme (OYE)-like FMN binding domain	NA|137aa|down_0|NC_010162.1_7923998_7924409_+	PLN00413, PLN00413, triacylglycerol lipase	NA|632aa|down_1|NC_010162.1_7924628_7926524_-	COG2303, BetA, Choline dehydrogenase and related flavoproteins [Amino acid transport and metabolism]	NA|809aa|down_2|NC_010162.1_7926602_7929029_-	NA	NA|540aa|down_3|NC_010162.1_7929718_7931338_-	PRK15064, PRK15064, ABC transporter ATP-binding protein; Provisional	NA|216aa|down_4|NC_010162.1_7931419_7932067_-	pfam02663, FmdE, FmdE, Molybdenum formylmethanofuran dehydrogenase operon	NA|535aa|down_5|NC_010162.1_7932375_7933980_+	cd14014, STKc_PknB_like, Catalytic domain of bacterial Serine/Threonine kinases, PknB and similar proteins	NA|328aa|down_6|NC_010162.1_7933990_7934974_+	sd00006, TPR, Tetratricopeptide repeat	NA|527aa|down_7|NC_010162.1_7935045_7936626_+	PTZ00146, PTZ00146, fibrillarin; Provisional	NA|561aa|down_8|NC_010162.1_7936874_7938557_+	PTZ00146, PTZ00146, fibrillarin; Provisional	NA|258aa|down_9|NC_010162.1_7938697_7939471_+	PRK13875, PRK13875, conjugal transfer protein TrbL; Provisional
GCF_000067165.1_ASM6716v1	NC_010162	Sorangium cellulosum So ce56, complete genome	28	8022139-8022790	9,25,7	PILER-CR,CRISPRCasFinder,CRT	no	cas6	cas8u1,cas3,csb2gr5,csb1gr7,cas1,cas2,cas6e,csa3,RT,DEDDh,WYL,DinG,PD-DExK,cmr1gr7,cas10,cmr3gr5,cmr4gr7,cmr5gr11,cmr6gr7,cas6,cas8b3,cas7,cas5	Unclear	GTTTCAATGCCCTTTGAGCAGGCATGTCCCGTTCGGG,GTTTCAATGCCCTTTGAGCAGGCATGTCCCGTTCGGG,GTTTCAATGCCCTTTGAGCAGGCATGTCCCGTTCGGG	37,37,37	0	0	NA	NA	NA:NA:NA	8,8,8	8	Unclear	cas8u1,cas3,csb2gr5,csb1gr7,cas1,cas2,cas6e,csa3,RT,DEDDh,WYL,DinG,PD-DExK,cmr1gr7,cas10,cmr3gr5,cmr4gr7,cmr5gr11,cmr6gr7,cas6,cas8b3,cas7,cas5	NA|207aa|up_8|NC_010162.1_8009259_8009880_-,NA|71aa|up_3|NC_010162.1_8017551_8017764_-,NA|281aa|up_2|NC_010162.1_8018301_8019144_+,NA|363aa|up_1|NC_010162.1_8019351_8020440_-,NA|149aa|down_0|NC_010162.1_8023281_8023728_+,NA|464aa|down_2|NC_010162.1_8025213_8026605_+,NA|109aa|down_7|NC_010162.1_8031854_8032181_-	NA|866aa|up_9|NC_010162.1_8006651_8009249_+	cd14014, STKc_PknB_like, Catalytic domain of bacterial Serine/Threonine kinases, PknB and similar proteins	NA|207aa|up_8|NC_010162.1_8009259_8009880_-	NA	NA|323aa|up_7|NC_010162.1_8010123_8011092_-	pfam09983, DUF2220, Uncharacterized protein conserved in bacteria C-term(DUF2220)	NA|1474aa|up_6|NC_010162.1_8011091_8015513_-	PRK04863, mukB, chromosome partition protein MukB	NA|230aa|up_5|NC_010162.1_8015509_8016199_-	PRK05256, PRK05256, chromosome partition protein MukE	NA|442aa|up_4|NC_010162.1_8016195_8017521_-	PRK05260, PRK05260, chromosome partition protein MukF	NA|71aa|up_3|NC_010162.1_8017551_8017764_-	NA	NA|281aa|up_2|NC_010162.1_8018301_8019144_+	NA	NA|363aa|up_1|NC_010162.1_8019351_8020440_-	NA	NA|329aa|up_0|NC_010162.1_8021048_8022035_+	pfam04986, Y2_Tnp, Putative transposase	NA|149aa|down_0|NC_010162.1_8023281_8023728_+	NA	cas6|305aa|down_1|NC_010162.1_8023937_8024852_+	pfam10040, CRISPR_Cas6, CRISPR-associated endoribonuclease Cas6	NA|464aa|down_2|NC_010162.1_8025213_8026605_+	NA	NA|591aa|down_3|NC_010162.1_8026651_8028424_-	pfam08757, CotH, CotH kinase protein	NA|282aa|down_4|NC_010162.1_8028873_8029719_-	pfam15428, Imm26, Immunity protein 26	NA|365aa|down_5|NC_010162.1_8029856_8030951_-	pfam14281, PDDEXK_4, PD-(D/E)XK nuclease superfamily	NA|250aa|down_6|NC_010162.1_8031026_8031776_-	COG3183, COG3183, Predicted restriction endonuclease [Defense mechanisms]	NA|109aa|down_7|NC_010162.1_8031854_8032181_-	NA	NA|499aa|down_8|NC_010162.1_8032435_8033932_-	PRK14951, PRK14951, DNA polymerase III subunits gamma and tau; Provisional	NA|381aa|down_9|NC_010162.1_8033940_8035083_-	pfam13435, Cytochrome_C554, Cytochrome c554 and c-prime
GCF_000067165.1_ASM6716v1	NC_010162	Sorangium cellulosum So ce56, complete genome	29	8832398-8832508	26	CRISPRCasFinder	no		cas8u1,cas3,csb2gr5,csb1gr7,cas1,cas2,cas6e,csa3,RT,DEDDh,WYL,DinG,PD-DExK,cmr1gr7,cas10,cmr3gr5,cmr4gr7,cmr5gr11,cmr6gr7,cas6,cas8b3,cas7,cas5	Orphan	CCGAGGCGCGGGCCGCGGCCACGGCGC	27	0	0	NA	NA	NA	1	1	Orphan	cas8u1,cas3,csb2gr5,csb1gr7,cas1,cas2,cas6e,csa3,RT,DEDDh,WYL,DinG,PD-DExK,cmr1gr7,cas10,cmr3gr5,cmr4gr7,cmr5gr11,cmr6gr7,cas6,cas8b3,cas7,cas5	NA|160aa|up_3|NC_010162.1_8825919_8826399_+,NA|217aa|up_0|NC_010162.1_8830101_8830752_-,NA|226aa|down_1|NC_010162.1_8833622_8834300_+,NA|124aa|down_4|NC_010162.1_8837089_8837461_+,NA|265aa|down_5|NC_010162.1_8838398_8839193_+	NA|391aa|up_9|NC_010162.1_8816428_8817601_-	COG2124, CypX, Cytochrome P450 [Secondary metabolites biosynthesis, transport, and catabolism]	NA|549aa|up_8|NC_010162.1_8817752_8819399_+	COG3829, RocR, Transcriptional regulator containing PAS, AAA-type ATPase, and DNA-binding domains [Transcription / Signal transduction mechanisms]	NA|340aa|up_7|NC_010162.1_8819785_8820805_-	pfam11199, DUF2891, Protein of unknown function (DUF2891)	NA|319aa|up_6|NC_010162.1_8820817_8821774_-	pfam06166, DUF979, Protein of unknown function (DUF979)	NA|231aa|up_5|NC_010162.1_8821770_8822463_-	pfam06149, DUF969, Protein of unknown function (DUF969)	NA|146aa|up_4|NC_010162.1_8822668_8823106_+	pfam13508, Acetyltransf_7, Acetyltransferase (GNAT) domain	NA|160aa|up_3|NC_010162.1_8825919_8826399_+	NA	NA|757aa|up_2|NC_010162.1_8827373_8829644_-	COG1529, CoxL, Aerobic-type carbon monoxide dehydrogenase, large subunit CoxL/CutL homologs [Energy production and conversion]	NA|153aa|up_1|NC_010162.1_8829646_8830105_-	COG2080, CoxS, Aerobic-type carbon monoxide dehydrogenase, small subunit CoxS/CutS homologs [Energy production and conversion]	NA|217aa|up_0|NC_010162.1_8830101_8830752_-	NA	NA|304aa|down_0|NC_010162.1_8832714_8833626_+	pfam08308, PEGA, PEGA domain	NA|226aa|down_1|NC_010162.1_8833622_8834300_+	NA	NA|315aa|down_2|NC_010162.1_8834424_8835369_-	PRK12270, kgd, multifunctional oxoglutarate decarboxylase/oxoglutarate dehydrogenase thiamine pyrophosphate-binding subunit/dihydrolipoyllysine-residue succinyltransferase subunit	NA|200aa|down_3|NC_010162.1_8836161_8836761_+	pfam04986, Y2_Tnp, Putative transposase	NA|124aa|down_4|NC_010162.1_8837089_8837461_+	NA	NA|265aa|down_5|NC_010162.1_8838398_8839193_+	NA	NA|144aa|down_6|NC_010162.1_8839460_8839892_-	COG1595, RpoE, DNA-directed RNA polymerase specialized sigma subunit, sigma24 homolog [Transcription]	NA|363aa|down_7|NC_010162.1_8840107_8841196_+	pfam08308, PEGA, PEGA domain	NA|471aa|down_8|NC_010162.1_8841393_8842806_+	PTZ00146, PTZ00146, fibrillarin; Provisional	NA|520aa|down_9|NC_010162.1_8842844_8844404_+	PTZ00146, PTZ00146, fibrillarin; Provisional
GCF_000067165.1_ASM6716v1	NC_010162	Sorangium cellulosum So ce56, complete genome	30	9269863-9269963	27	CRISPRCasFinder	no		cas8u1,cas3,csb2gr5,csb1gr7,cas1,cas2,cas6e,csa3,RT,DEDDh,WYL,DinG,PD-DExK,cmr1gr7,cas10,cmr3gr5,cmr4gr7,cmr5gr11,cmr6gr7,cas6,cas8b3,cas7,cas5	Orphan	GTTGCCGCAGCACGAGCCGCAGTCGGTCG	29	0	0	NA	NA	NA	1	1	Orphan	cas8u1,cas3,csb2gr5,csb1gr7,cas1,cas2,cas6e,csa3,RT,DEDDh,WYL,DinG,PD-DExK,cmr1gr7,cas10,cmr3gr5,cmr4gr7,cmr5gr11,cmr6gr7,cas6,cas8b3,cas7,cas5	NA|613aa|up_1|NC_010162.1_9266358_9268197_+,NA|62aa|down_0|NC_010162.1_9271140_9271326_+,NA|663aa|down_1|NC_010162.1_9271332_9273321_-,NA|275aa|down_5|NC_010162.1_9278859_9279684_-,NA|199aa|down_8|NC_010162.1_9282623_9283220_+	NA|379aa|up_9|NC_010162.1_9255209_9256346_+	pfam13205, Big_5, Bacterial Ig-like domain	NA|638aa|up_8|NC_010162.1_9256356_9258270_-	cd14014, STKc_PknB_like, Catalytic domain of bacterial Serine/Threonine kinases, PknB and similar proteins	NA|452aa|up_7|NC_010162.1_9258250_9259606_-	COG2204, AtoC, Response regulator containing CheY-like receiver, AAA-type ATPase, and DNA-binding domains [Signal transduction mechanisms]	NA|251aa|up_6|NC_010162.1_9259901_9260654_-	PRK13875, PRK13875, conjugal transfer protein TrbL; Provisional	NA|319aa|up_5|NC_010162.1_9260885_9261842_-	PRK07003, PRK07003, DNA polymerase III subunit gamma/tau	NA|180aa|up_4|NC_010162.1_9261838_9262378_-	COG1595, RpoE, DNA-directed RNA polymerase specialized sigma subunit, sigma24 homolog [Transcription]	NA|356aa|up_3|NC_010162.1_9262500_9263568_-	PRK14951, PRK14951, DNA polymerase III subunits gamma and tau; Provisional	NA|643aa|up_2|NC_010162.1_9264328_9266257_+	sd00002, TSP3, Calcium-binding Thrombospondin type 3 (TSP3) repeat	NA|613aa|up_1|NC_010162.1_9266358_9268197_+	NA	NA|281aa|up_0|NC_010162.1_9268358_9269201_-	TIGR01250, Proline_iminopeptidase, proline-specific peptidase, Bacillus coagulans-type subfamily	NA|62aa|down_0|NC_010162.1_9271140_9271326_+	NA	NA|663aa|down_1|NC_010162.1_9271332_9273321_-	NA	NA|945aa|down_2|NC_010162.1_9273670_9276505_-	cd14014, STKc_PknB_like, Catalytic domain of bacterial Serine/Threonine kinases, PknB and similar proteins	NA|221aa|down_3|NC_010162.1_9277706_9278369_+	COG1309, AcrR, Transcriptional regulator [Transcription]	NA|141aa|down_4|NC_010162.1_9278431_9278854_-	PRK12270, kgd, multifunctional oxoglutarate decarboxylase/oxoglutarate dehydrogenase thiamine pyrophosphate-binding subunit/dihydrolipoyllysine-residue succinyltransferase subunit	NA|275aa|down_5|NC_010162.1_9278859_9279684_-	NA	NA|684aa|down_6|NC_010162.1_9279674_9281726_-	cd11375, Peptidase_M54, Peptidase family M54, also called archaemetzincins or archaelysins	NA|218aa|down_7|NC_010162.1_9281739_9282393_-	pfam01471, PG_binding_1, Putative peptidoglycan binding domain	NA|199aa|down_8|NC_010162.1_9282623_9283220_+	NA	NA|245aa|down_9|NC_010162.1_9283280_9284015_-	cd00657, Ferritin_like, Ferritin-like superfamily of diiron-containing four-helix-bundle proteins
GCF_000067165.1_ASM6716v1	NC_010162	Sorangium cellulosum So ce56, complete genome	31	9339124-9339213	28	CRISPRCasFinder	no		cas8u1,cas3,csb2gr5,csb1gr7,cas1,cas2,cas6e,csa3,RT,DEDDh,WYL,DinG,PD-DExK,cmr1gr7,cas10,cmr3gr5,cmr4gr7,cmr5gr11,cmr6gr7,cas6,cas8b3,cas7,cas5	Orphan	CCGGCCTGTCCGGCGCGGCGCTCCGC	26	0	0	NA	NA	NA	1	1	Orphan	cas8u1,cas3,csb2gr5,csb1gr7,cas1,cas2,cas6e,csa3,RT,DEDDh,WYL,DinG,PD-DExK,cmr1gr7,cas10,cmr3gr5,cmr4gr7,cmr5gr11,cmr6gr7,cas6,cas8b3,cas7,cas5	NA|59aa|up_8|NC_010162.1_9326413_9326590_+,NA|136aa|up_7|NC_010162.1_9326586_9326994_+,NA|113aa|up_3|NC_010162.1_9334961_9335300_+,NA|143aa|down_3|NC_010162.1_9345313_9345742_+	NA|1259aa|up_9|NC_010162.1_9322621_9326398_+	pfam12770, CHAT, CHAT domain	NA|59aa|up_8|NC_010162.1_9326413_9326590_+	NA	NA|136aa|up_7|NC_010162.1_9326586_9326994_+	NA	NA|1138aa|up_6|NC_010162.1_9327015_9330429_+	pfam12770, CHAT, CHAT domain	NA|1180aa|up_5|NC_010162.1_9330537_9334077_+	pfam12770, CHAT, CHAT domain	NA|157aa|up_4|NC_010162.1_9334437_9334908_-	TIGR02937, RNA_polymerase_sigma_factor, RNA polymerase sigma factor, sigma-70 family	NA|113aa|up_3|NC_010162.1_9334961_9335300_+	NA	NA|141aa|up_2|NC_010162.1_9335293_9335716_+	pfam05717, TnpB_IS66, IS66 Orf2 like protein	NA|555aa|up_1|NC_010162.1_9335712_9337377_+	pfam03050, DDE_Tnp_IS66, Transposase IS66 family	NA|299aa|up_0|NC_010162.1_9337860_9338757_-	PRK14971, PRK14971, DNA polymerase III subunit gamma/tau	NA|371aa|down_0|NC_010162.1_9340189_9341302_+	pfam08308, PEGA, PEGA domain	NA|549aa|down_1|NC_010162.1_9341317_9342964_+	PTZ00146, PTZ00146, fibrillarin; Provisional	NA|513aa|down_2|NC_010162.1_9343001_9344540_+	PTZ00146, PTZ00146, fibrillarin; Provisional	NA|143aa|down_3|NC_010162.1_9345313_9345742_+	NA	NA|476aa|down_4|NC_010162.1_9346136_9347564_+	cd00796, INT_Rci_Hp1_C, Shufflon-specific DNA recombinase Rci and Bacteriophage Hp1_like integrase, C-terminal catalytic domain	NA|296aa|down_5|NC_010162.1_9347695_9348583_-	pfam13200, DUF4015, Putative glycosyl hydrolase domain	NA|389aa|down_6|NC_010162.1_9348798_9349965_-	COG0577, SalY, ABC-type antimicrobial peptide transport system, permease component [Defense mechanisms]	NA|389aa|down_7|NC_010162.1_9350015_9351182_-	COG4591, LolE, ABC-type transport system, involved in lipoprotein release, permease component [Cell envelope biogenesis, outer membrane]	NA|233aa|down_8|NC_010162.1_9351178_9351877_-	cd03255, ABC_MJ0796_LolCDE_FtsE, ATP-binding cassette domain of the transporters involved in export of lipoprotein and macrolide, and cell division protein	NA|410aa|down_9|NC_010162.1_9351873_9353103_-	TIGR01730, COG0845:_Membrane-fusion_protein, RND family efflux transporter, MFP subunit
GCF_000067165.1_ASM6716v1	NC_010162	Sorangium cellulosum So ce56, complete genome	32	9949524-9949691	8	CRT	no		cas8u1,cas3,csb2gr5,csb1gr7,cas1,cas2,cas6e,csa3,RT,DEDDh,WYL,DinG,PD-DExK,cmr1gr7,cas10,cmr3gr5,cmr4gr7,cmr5gr11,cmr6gr7,cas6,cas8b3,cas7,cas5	Orphan	CAACCTCAACCTGGGTCGG	19	0	0	NA	NA	NA	3	3	Orphan	cas8u1,cas3,csb2gr5,csb1gr7,cas1,cas2,cas6e,csa3,RT,DEDDh,WYL,DinG,PD-DExK,cmr1gr7,cas10,cmr3gr5,cmr4gr7,cmr5gr11,cmr6gr7,cas6,cas8b3,cas7,cas5	NA|100aa|up_1|NC_010162.1_9948249_9948549_-,NA|100aa|down_6|NC_010162.1_9956522_9956822_-,NA|52aa|down_7|NC_010162.1_9956964_9957120_-,NA|515aa|down_8|NC_010162.1_9957208_9958753_+	NA|1158aa|up_9|NC_010162.1_9935039_9938513_-	cd14014, STKc_PknB_like, Catalytic domain of bacterial Serine/Threonine kinases, PknB and similar proteins	NA|611aa|up_8|NC_010162.1_9938956_9940789_-	COG2204, AtoC, Response regulator containing CheY-like receiver, AAA-type ATPase, and DNA-binding domains [Signal transduction mechanisms]	NA|94aa|up_7|NC_010162.1_9941040_9941322_-	pfam13031, DUF3892, Protein of unknown function (DUF3892)	NA|292aa|up_6|NC_010162.1_9941459_9942335_-	cd14014, STKc_PknB_like, Catalytic domain of bacterial Serine/Threonine kinases, PknB and similar proteins	NA|271aa|up_5|NC_010162.1_9942354_9943167_-	TIGR02984, Sig-70_plancto1, RNA polymerase sigma-70 factor, Planctomycetaceae-specific subfamily 1	NA|291aa|up_4|NC_010162.1_9943647_9944520_-	COG0266, Nei, Formamidopyrimidine-DNA glycosylase [DNA replication, recombination, and repair]	NA|917aa|up_3|NC_010162.1_9944610_9947361_-	pfam02254, TrkA_N, TrkA-N domain	NA|247aa|up_2|NC_010162.1_9947476_9948217_-	pfam05721, PhyH, Phytanoyl-CoA dioxygenase (PhyH)	NA|100aa|up_1|NC_010162.1_9948249_9948549_-	NA	NA|215aa|up_0|NC_010162.1_9948787_9949432_-	pfam10025, DUF2267, Uncharacterized conserved protein (DUF2267)	NA|378aa|down_0|NC_010162.1_9949696_9950830_-	cd07041, STAS_RsbR_RsbS_like, Sulphate Transporter and Anti-Sigma factor antagonist domain of the "stressosome" complex proteins RsbS and RsbR, regulators of the bacterial stress activated alternative sigma factor sigma-B by phosphorylation	NA|304aa|down_1|NC_010162.1_9951016_9951928_-	cd08422, PBP2_CrgA_like, The C-terminal substrate binding domain of LysR-type transcriptional regulator CrgA and its related homologs, contains the type 2 periplasmic binding domain	NA|298aa|down_2|NC_010162.1_9952027_9952921_+	cd05269, TMR_SDR_a, triphenylmethane reductase (TMR)-like proteins, NMRa-like, atypical (a) SDRs	NA|276aa|down_3|NC_010162.1_9952921_9953749_-	cd02142, McbC_SagB-like_oxidoreductase, oxidase similar to the microcin B17 processing protein McbC	NA|464aa|down_4|NC_010162.1_9953876_9955268_-	TIGR03604, hypothetical_protein, thiazole/oxazole-forming peptide maturase, SagD family component	NA|329aa|down_5|NC_010162.1_9955264_9956251_-	TIGR03882, hypothetical_protein, bacteriocin biosynthesis cyclodehydratase domain	NA|100aa|down_6|NC_010162.1_9956522_9956822_-	NA	NA|52aa|down_7|NC_010162.1_9956964_9957120_-	NA	NA|515aa|down_8|NC_010162.1_9957208_9958753_+	NA	NA|307aa|down_9|NC_010162.1_9959212_9960133_-	COG1943, COG1943, Transposase and inactivated derivatives [DNA replication, recombination, and repair]
GCF_000067165.1_ASM6716v1	NC_010162	Sorangium cellulosum So ce56, complete genome	33	10031840-10031907	29	CRISPRCasFinder	no		cas8u1,cas3,csb2gr5,csb1gr7,cas1,cas2,cas6e,csa3,RT,DEDDh,WYL,DinG,PD-DExK,cmr1gr7,cas10,cmr3gr5,cmr4gr7,cmr5gr11,cmr6gr7,cas6,cas8b3,cas7,cas5	Orphan	GCGGCGGAGCCCGCGAAGGCGCC	23	1	1	10031863-10031884	NC_010162.1_10031896-10031917	NA	1	1	Orphan	cas8u1,cas3,csb2gr5,csb1gr7,cas1,cas2,cas6e,csa3,RT,DEDDh,WYL,DinG,PD-DExK,cmr1gr7,cas10,cmr3gr5,cmr4gr7,cmr5gr11,cmr6gr7,cas6,cas8b3,cas7,cas5	NA,NA|181aa|down_1|NC_010162.1_10033381_10033924_-,NA|352aa|down_2|NC_010162.1_10034092_10035148_-,NA|104aa|down_6|NC_010162.1_10037326_10037638_-,NA|80aa|down_9|NC_010162.1_10042829_10043069_-	NA|207aa|up_9|NC_010162.1_10013534_10014155_+	cd06260, DUF820, Domain of unknown function (DUF820)	NA|691aa|up_8|NC_010162.1_10014186_10016259_-	pfam06202, GDE_C, Amylo-alpha-1,6-glucosidase	NA|649aa|up_7|NC_010162.1_10016248_10018195_-	cd11325, AmyAc_GTHase, Alpha amylase catalytic domain found in Glycosyltrehalose trehalohydrolase (also called Maltooligosyl trehalose Trehalohydrolase)	NA|762aa|up_6|NC_010162.1_10018540_10020826_-	COG1132, MdlB, ABC-type multidrug transport system, ATPase and permease components [Defense mechanisms]	NA|1011aa|up_5|NC_010162.1_10020822_10023855_-	COG1132, MdlB, ABC-type multidrug transport system, ATPase and permease components [Defense mechanisms]	NA|390aa|up_4|NC_010162.1_10023851_10025021_-	pfam00529, HlyD, HlyD membrane-fusion protein of T1SS	NA|74aa|up_3|NC_010162.1_10025189_10025411_+	pfam12559, Inhibitor_I10, Serine endopeptidase inhibitors	NA|79aa|up_2|NC_010162.1_10025799_10026036_+	pfam12559, Inhibitor_I10, Serine endopeptidase inhibitors	NA|333aa|up_1|NC_010162.1_10026081_10027080_+	TIGR04185, RimK-like_ATP-grasp_domain_protein, ATP-grasp ribosomal peptide maturase, MvdC family	NA|342aa|up_0|NC_010162.1_10027076_10028102_+	TIGR04184, hypothetical_protein_HMPREF0204_12500, ATP-grasp ribosomal peptide maturase, MvdD family	NA|363aa|down_0|NC_010162.1_10032259_10033348_+	COG4637, COG4637, Predicted ATPase [General function prediction only]	NA|181aa|down_1|NC_010162.1_10033381_10033924_-	NA	NA|352aa|down_2|NC_010162.1_10034092_10035148_-	NA	NA|130aa|down_3|NC_010162.1_10035560_10035950_-	COG2149, COG2149, Predicted membrane protein [Function unknown]	NA|240aa|down_4|NC_010162.1_10035985_10036705_-	cd02109, arch_bact_SO_family_Moco, bacterial and archael members of the sulfite oxidase (SO) family of molybdopterin binding domains	NA|127aa|down_5|NC_010162.1_10036748_10037129_-	pfam02152, FolB, Dihydroneopterin aldolase	NA|104aa|down_6|NC_010162.1_10037326_10037638_-	NA	NA|247aa|down_7|NC_010162.1_10037741_10038482_-	COG0664, Crp, cAMP-binding proteins - catabolite gene activator and regulatory subunit of cAMP-dependent protein kinases [Signal transduction mechanisms]	NA|1358aa|down_8|NC_010162.1_10038576_10042650_-	sd00008, TPR_YbbN, C-terminal Tetratricopeptide repeat (TPR) region of YbbN and similar motifs	NA|80aa|down_9|NC_010162.1_10042829_10043069_-	NA
GCF_000067165.1_ASM6716v1	NC_010162	Sorangium cellulosum So ce56, complete genome	34	10358743-10358806	30	CRISPRCasFinder	no		cas8u1,cas3,csb2gr5,csb1gr7,cas1,cas2,cas6e,csa3,RT,DEDDh,WYL,DinG,PD-DExK,cmr1gr7,cas10,cmr3gr5,cmr4gr7,cmr5gr11,cmr6gr7,cas6,cas8b3,cas7,cas5	Orphan	ACGCAAGAGCACGCGGGGATACGC	24	1	1	10358767-10358782	NC_010162.1_10358797-10358812	NA	1	1	Orphan	cas8u1,cas3,csb2gr5,csb1gr7,cas1,cas2,cas6e,csa3,RT,DEDDh,WYL,DinG,PD-DExK,cmr1gr7,cas10,cmr3gr5,cmr4gr7,cmr5gr11,cmr6gr7,cas6,cas8b3,cas7,cas5	NA|278aa|up_9|NC_010162.1_10343062_10343896_+,NA|103aa|down_1|NC_010162.1_10360474_10360783_-,NA|186aa|down_2|NC_010162.1_10360902_10361460_-	NA|278aa|up_9|NC_010162.1_10343062_10343896_+	NA	NA|320aa|up_8|NC_010162.1_10343917_10344877_-	PRK09375, PRK09375, quinolinate synthase NadA	NA|113aa|up_7|NC_010162.1_10345115_10345454_+	pfam12321, DUF3634, Protein of unknown function (DUF3634)	NA|464aa|up_6|NC_010162.1_10345494_10346886_+	cd13970, ABC1_ADCK3, Activator of bc1 complex (ABC1) kinases, also called aarF domain containing kinase 3	NA|722aa|up_5|NC_010162.1_10346994_10349160_+	cd14014, STKc_PknB_like, Catalytic domain of bacterial Serine/Threonine kinases, PknB and similar proteins	NA|611aa|up_4|NC_010162.1_10349227_10351060_+	cd05819, NHL, NHL repeat unit of beta-propeller proteins	NA|649aa|up_3|NC_010162.1_10351079_10353026_-	TIGR02063, Ribonuclease_R, ribonuclease R	NA|164aa|up_2|NC_010162.1_10353364_10353856_+	cd00060, FHA, Forkhead associated domain (FHA); found in eukaryotic and prokaryotic proteins	NA|619aa|up_1|NC_010162.1_10353875_10355732_-	sd00006, TPR, Tetratricopeptide repeat	NA|935aa|up_0|NC_010162.1_10355737_10358542_-	cd14014, STKc_PknB_like, Catalytic domain of bacterial Serine/Threonine kinases, PknB and similar proteins	NA|466aa|down_0|NC_010162.1_10359022_10360420_+	COG1858, MauG, Cytochrome c peroxidase [Inorganic ion transport and metabolism]	NA|103aa|down_1|NC_010162.1_10360474_10360783_-	NA	NA|186aa|down_2|NC_010162.1_10360902_10361460_-	NA	NA|444aa|down_3|NC_010162.1_10361622_10362954_-	pfam11583, AurF, P-aminobenzoate N-oxygenase AurF	NA|154aa|down_4|NC_010162.1_10363249_10363711_+	cd09873, PIN_Pae0151-like, VapC-like PIN domain of the Pyrobaculum aerophilum Pae0151 and Pae2754 proteins and homologs	NA|427aa|down_5|NC_010162.1_10363956_10365237_+	pfam11617, Cu-binding_MopE, Putative metal-binding motif	NA|318aa|down_6|NC_010162.1_10366443_10367397_-	cd01144, BtuF, Cobalamin binding protein BtuF	NA|256aa|down_7|NC_010162.1_10367469_10368237_+	PRK05557, fabG, 3-ketoacyl-(acyl-carrier-protein) reductase; Validated	NA|121aa|down_8|NC_010162.1_10368387_10368750_+	COG3791, COG3791, Uncharacterized conserved protein [Function unknown]	NA|253aa|down_9|NC_010162.1_10368928_10369687_+	PRK00481, PRK00481, NAD-dependent deacetylase; Provisional
GCF_000067165.1_ASM6716v1	NC_010162	Sorangium cellulosum So ce56, complete genome	35	10625849-10625965	31	CRISPRCasFinder	no		cas8u1,cas3,csb2gr5,csb1gr7,cas1,cas2,cas6e,csa3,RT,DEDDh,WYL,DinG,PD-DExK,cmr1gr7,cas10,cmr3gr5,cmr4gr7,cmr5gr11,cmr6gr7,cas6,cas8b3,cas7,cas5	Orphan	CGCGCCGGCCGCGCAGGCCGCGCC	24	1	4	10625921-10625941|10625921-10625941|10625921-10625941|10625921-10625941	NC_010162.1_10625948-10625968|NC_010162.1_5531357-5531337|NC_010162.1_7932293-7932313|NC_010162.1_8517584-8517564	NA	2	2	Orphan	cas8u1,cas3,csb2gr5,csb1gr7,cas1,cas2,cas6e,csa3,RT,DEDDh,WYL,DinG,PD-DExK,cmr1gr7,cas10,cmr3gr5,cmr4gr7,cmr5gr11,cmr6gr7,cas6,cas8b3,cas7,cas5	NA|210aa|up_5|NC_010162.1_10617491_10618121_-,NA	NA|313aa|up_9|NC_010162.1_10610729_10611668_-	cd00739, DHPS, DHPS subgroup of Pterin binding enzymes	NA|650aa|up_8|NC_010162.1_10612177_10614127_+	COG1480, COG1480, Predicted membrane-associated HD superfamily hydrolase [General function prediction only]	NA|576aa|up_7|NC_010162.1_10614192_10615920_+	cd01161, VLCAD, Very long chain acyl-CoA dehydrogenase	NA|404aa|up_6|NC_010162.1_10616210_10617422_+	cd14014, STKc_PknB_like, Catalytic domain of bacterial Serine/Threonine kinases, PknB and similar proteins	NA|210aa|up_5|NC_010162.1_10617491_10618121_-	NA	NA|367aa|up_4|NC_010162.1_10618310_10619411_-	pfam04015, DUF362, Domain of unknown function (DUF362)	NA|71aa|up_3|NC_010162.1_10620165_10620378_+	TIGR01259, ComE_operon_protein_1, comEA protein	NA|496aa|up_2|NC_010162.1_10620644_10622132_+	COG0773, MurC, UDP-N-acetylmuramate-alanine ligase [Cell envelope biogenesis, outer membrane]	NA|291aa|up_1|NC_010162.1_10622128_10623001_+	cd01639, IMPase, IMPase, inositol monophosphatase and related domains	NA|281aa|up_0|NC_010162.1_10623548_10624391_+	cd04221, MauL, Methylamine utilization protein MauL	NA|509aa|down_0|NC_010162.1_10626300_10627827_+	pfam03349, Toluene_X, Outer membrane protein transport protein (OMPP1/FadL/TodX)	NA|320aa|down_1|NC_010162.1_10627997_10628957_+	TIGR01292, Thioredoxin_reductase, thioredoxin-disulfide reductase	NA|216aa|down_2|NC_010162.1_10629423_10630071_+	cd10030, UDG-F4_TTUDGA_SPO1dp_like, Uracil DNA glycosylase family 4, includes Thermotoga maritima TTUDGA, Bacillus phage SPO1 DNA polymerase, and similar proteins	NA|234aa|down_3|NC_010162.1_10630072_10630774_+	pfam09858, DUF2085, Predicted membrane protein (DUF2085)	NA|260aa|down_4|NC_010162.1_10632459_10633239_-	pfam02472, ExbD, Biopolymer transport protein ExbD/TolR	NA|543aa|down_5|NC_010162.1_10633301_10634930_+	cd05941, MCS, Malonyl-CoA synthetase (MCS)	NA|202aa|down_6|NC_010162.1_10635003_10635609_+	COG4894, COG4894, Uncharacterized conserved protein [Function unknown]	NA|666aa|down_7|NC_010162.1_10635746_10637744_+	PRK07764, PRK07764, DNA polymerase III subunits gamma and tau; Validated	NA|306aa|down_8|NC_010162.1_10637911_10638829_-	PRK14875, PRK14875, acetoin dehydrogenase E2 subunit dihydrolipoyllysine-residue acetyltransferase; Provisional	NA|549aa|down_9|NC_010162.1_10638934_10640581_-	pfam13435, Cytochrome_C554, Cytochrome c554 and c-prime
GCF_000067165.1_ASM6716v1	NC_010162	Sorangium cellulosum So ce56, complete genome	36	10760793-10760906	32	CRISPRCasFinder	no	csa3	cas8u1,cas3,csb2gr5,csb1gr7,cas1,cas2,cas6e,csa3,RT,DEDDh,WYL,DinG,PD-DExK,cmr1gr7,cas10,cmr3gr5,cmr4gr7,cmr5gr11,cmr6gr7,cas6,cas8b3,cas7,cas5	Type I-A	CATGTCGGACATGGGCCCGCGCCTGGGC	28	0	0	NA	NA	NA	1	1	Orphan	cas8u1,cas3,csb2gr5,csb1gr7,cas1,cas2,cas6e,csa3,RT,DEDDh,WYL,DinG,PD-DExK,cmr1gr7,cas10,cmr3gr5,cmr4gr7,cmr5gr11,cmr6gr7,cas6,cas8b3,cas7,cas5	NA|94aa|up_6|NC_010162.1_10754663_10754945_+,NA|51aa|up_5|NC_010162.1_10755624_10755777_-,NA|342aa|up_4|NC_010162.1_10756541_10757567_+,NA|313aa|up_3|NC_010162.1_10757573_10758512_-,NA|275aa|up_2|NC_010162.1_10758508_10759333_-,NA|255aa|down_6|NC_010162.1_10773366_10774131_+,NA|72aa|down_7|NC_010162.1_10775195_10775411_-	NA|779aa|up_9|NC_010162.1_10747883_10750220_-	cd18805, SF2_C_suv3, C-terminal helicase domain of ATP-dependent RNA helicase	NA|580aa|up_8|NC_010162.1_10752156_10753896_+	cd14014, STKc_PknB_like, Catalytic domain of bacterial Serine/Threonine kinases, PknB and similar proteins	NA|232aa|up_7|NC_010162.1_10753892_10754588_-	cd02136, PnbA_NfnB-like, nitroreductase similar to Mycobacterium smegmatis NfnB	NA|94aa|up_6|NC_010162.1_10754663_10754945_+	NA	NA|51aa|up_5|NC_010162.1_10755624_10755777_-	NA	NA|342aa|up_4|NC_010162.1_10756541_10757567_+	NA	NA|313aa|up_3|NC_010162.1_10757573_10758512_-	NA	NA|275aa|up_2|NC_010162.1_10758508_10759333_-	NA	NA|187aa|up_1|NC_010162.1_10759724_10760285_-	TIGR02937, RNA_polymerase_sigma_factor, RNA polymerase sigma factor, sigma-70 family	NA|112aa|up_0|NC_010162.1_10760281_10760617_-	TIGR02607, Virulence-associated_protein_I, addiction module antidote protein, HigA family	NA|303aa|down_0|NC_010162.1_10761154_10762063_-	cd19100, AKR_unchar, uncharacterized aldo-keto reductase (AKR) superfamily protein	NA|779aa|down_1|NC_010162.1_10762370_10764707_-	sd00006, TPR, Tetratricopeptide repeat	NA|714aa|down_2|NC_010162.1_10764787_10766929_-	cd16025, PAS_like, Bacterial Arylsulfatase of Pseudomonas aeruginosa and related proteins	csa3|318aa|down_3|NC_010162.1_10767494_10768448_+	pfam08241, Methyltransf_11, Methyltransferase domain	NA|1282aa|down_4|NC_010162.1_10768450_10772296_+	PRK09490, metH, B12-dependent methionine synthase; Provisional	NA|211aa|down_5|NC_010162.1_10772313_10772946_+	COG0625, Gst, Glutathione S-transferase [Posttranslational modification, protein turnover, chaperones]	NA|255aa|down_6|NC_010162.1_10773366_10774131_+	NA	NA|72aa|down_7|NC_010162.1_10775195_10775411_-	NA	NA|262aa|down_8|NC_010162.1_10775514_10776300_+	COG0412, COG0412, Dienelactone hydrolase and related enzymes [Secondary metabolites biosynthesis, transport, and catabolism]	NA|900aa|down_9|NC_010162.1_10776513_10779213_-	cd05819, NHL, NHL repeat unit of beta-propeller proteins
GCF_000067165.1_ASM6716v1	NC_010162	Sorangium cellulosum So ce56, complete genome	37	11265300-11265406	33	CRISPRCasFinder	no		cas8u1,cas3,csb2gr5,csb1gr7,cas1,cas2,cas6e,csa3,RT,DEDDh,WYL,DinG,PD-DExK,cmr1gr7,cas10,cmr3gr5,cmr4gr7,cmr5gr11,cmr6gr7,cas6,cas8b3,cas7,cas5	Orphan	GCCTTCGGCGGGCTCGGCCCCCTG	24	0	0	NA	NA	NA	1	1	Orphan	cas8u1,cas3,csb2gr5,csb1gr7,cas1,cas2,cas6e,csa3,RT,DEDDh,WYL,DinG,PD-DExK,cmr1gr7,cas10,cmr3gr5,cmr4gr7,cmr5gr11,cmr6gr7,cas6,cas8b3,cas7,cas5	NA|67aa|up_9|NC_010162.1_11254791_11254992_-,NA|242aa|up_8|NC_010162.1_11255354_11256080_+,NA|94aa|up_7|NC_010162.1_11256189_11256471_+,NA|67aa|up_6|NC_010162.1_11256626_11256827_+,NA|150aa|up_4|NC_010162.1_11257954_11258404_-,NA|193aa|down_6|NC_010162.1_11272482_11273061_+	NA|67aa|up_9|NC_010162.1_11254791_11254992_-	NA	NA|242aa|up_8|NC_010162.1_11255354_11256080_+	NA	NA|94aa|up_7|NC_010162.1_11256189_11256471_+	NA	NA|67aa|up_6|NC_010162.1_11256626_11256827_+	NA	NA|308aa|up_5|NC_010162.1_11256874_11257798_-	COG0053, MMT1, Predicted Co/Zn/Cd cation transporters [Inorganic ion transport and metabolism]	NA|150aa|up_4|NC_010162.1_11257954_11258404_-	NA	NA|265aa|up_3|NC_010162.1_11258703_11259498_+	cd05346, SDR_c5, classical (c) SDR, subgroup 5	NA|146aa|up_2|NC_010162.1_11259618_11260056_+	pfam00403, HMA, Heavy-metal-associated domain	NA|132aa|up_1|NC_010162.1_11260359_11260755_+	cd08026, DUF326, Cysteine-rich 4 helical bundle widely conserved in bacteria	NA|1156aa|up_0|NC_010162.1_11260812_11264280_-	COG3696, COG3696, Putative silver efflux pump [Inorganic ion transport and metabolism]	NA|417aa|down_0|NC_010162.1_11265558_11266809_-	COG1538, TolC, Outer membrane protein [Cell envelope biogenesis, outer membrane / Intracellular trafficking and secretion]	NA|212aa|down_1|NC_010162.1_11267067_11267703_-	COG3544, COG3544, Uncharacterized protein conserved in bacteria [Function unknown]	NA|35aa|down_2|NC_010162.1_11268222_11268327_-	pfam00126, HTH_1, Bacterial regulatory helix-turn-helix protein, lysR family	NA|468aa|down_3|NC_010162.1_11268509_11269913_-	COG2132, SufI, Putative multicopper oxidases [Secondary metabolites biosynthesis, transport, and catabolism]	NA|159aa|down_4|NC_010162.1_11269909_11270386_-	PRK06341, PRK06341, single-stranded DNA-binding protein; Provisional	NA|503aa|down_5|NC_010162.1_11270807_11272316_+	COG2132, SufI, Putative multicopper oxidases [Secondary metabolites biosynthesis, transport, and catabolism]	NA|193aa|down_6|NC_010162.1_11272482_11273061_+	NA	NA|214aa|down_7|NC_010162.1_11273339_11273981_-	pfam00440, TetR_N, Bacterial regulatory proteins, tetR family	NA|301aa|down_8|NC_010162.1_11274011_11274914_+	COG3000, ERG3, Sterol desaturase [Lipid metabolism]	NA|826aa|down_9|NC_010162.1_11274939_11277417_-	cd02094, P-type_ATPase_Cu-like, P-type heavy metal-transporting ATPase, similar to human copper-transporting ATPases, ATP7A and ATP7B
GCF_000067165.1_ASM6716v1	NC_010162	Sorangium cellulosum So ce56, complete genome	38	11879880-11880601	9	CRT	no		cas8u1,cas3,csb2gr5,csb1gr7,cas1,cas2,cas6e,csa3,RT,DEDDh,WYL,DinG,PD-DExK,cmr1gr7,cas10,cmr3gr5,cmr4gr7,cmr5gr11,cmr6gr7,cas6,cas8b3,cas7,cas5	Orphan	GCGNCNTCGNAGCCGCCGTC	20	1	1	11880056-11880113	NC_010162.1_11879861-11879918	NA	12	12	Orphan	cas8u1,cas3,csb2gr5,csb1gr7,cas1,cas2,cas6e,csa3,RT,DEDDh,WYL,DinG,PD-DExK,cmr1gr7,cas10,cmr3gr5,cmr4gr7,cmr5gr11,cmr6gr7,cas6,cas8b3,cas7,cas5	NA|113aa|up_9|NC_010162.1_11866302_11866641_+,NA|455aa|up_7|NC_010162.1_11867740_11869105_-,NA|218aa|up_4|NC_010162.1_11872098_11872752_-,NA|171aa|up_3|NC_010162.1_11872946_11873459_-,NA|270aa|up_0|NC_010162.1_11876942_11877752_+,NA|80aa|down_1|NC_010162.1_11883240_11883480_+,NA|134aa|down_2|NC_010162.1_11883682_11884084_+,NA|405aa|down_3|NC_010162.1_11884902_11886117_+,NA|211aa|down_5|NC_010162.1_11887121_11887754_-	NA|113aa|up_9|NC_010162.1_11866302_11866641_+	NA	NA|341aa|up_8|NC_010162.1_11866665_11867688_+	COG3178, COG3178, Predicted phosphotransferase related to Ser/Thr protein kinases [General function prediction only]	NA|455aa|up_7|NC_010162.1_11867740_11869105_-	NA	NA|502aa|up_6|NC_010162.1_11869395_11870901_-	COG5297, CelA, Cellobiohydrolase A (1,4-beta-cellobiosidase A) [Carbohydrate transport and metabolism]	NA|265aa|up_5|NC_010162.1_11871228_11872023_+	COG1119, ModF, ABC-type molybdenum transport system, ATPase component/photorepair protein PhrA [Inorganic ion transport and metabolism]	NA|218aa|up_4|NC_010162.1_11872098_11872752_-	NA	NA|171aa|up_3|NC_010162.1_11872946_11873459_-	NA	NA|542aa|up_2|NC_010162.1_11874029_11875655_-	COG1233, COG1233, Phytoene dehydrogenase and related proteins [Secondary metabolites biosynthesis, transport, and catabolism]	NA|241aa|up_1|NC_010162.1_11875651_11876374_-	PRK05647, purN, phosphoribosylglycinamide formyltransferase; Reviewed	NA|270aa|up_0|NC_010162.1_11876942_11877752_+	NA	NA|349aa|down_0|NC_010162.1_11881692_11882739_-	PRK05385, PRK05385, phosphoribosylaminoimidazole synthetase; Provisional	NA|80aa|down_1|NC_010162.1_11883240_11883480_+	NA	NA|134aa|down_2|NC_010162.1_11883682_11884084_+	NA	NA|405aa|down_3|NC_010162.1_11884902_11886117_+	NA	NA|208aa|down_4|NC_010162.1_11886349_11886973_-	pfam13468, Glyoxalase_3, Glyoxalase-like domain	NA|211aa|down_5|NC_010162.1_11887121_11887754_-	NA	NA|143aa|down_6|NC_010162.1_11888339_11888768_+	cd03425, MutT_pyrophosphohydrolase, The MutT pyrophosphohydrolase is a prototypical Nudix hydrolase that catalyzes the hydrolysis of nucleoside and deoxynucleoside triphosphates (NTPs and dNTPs) by substitution at a beta-phosphorus to yield a nucleotide monophosphate (NMP) and inorganic pyrophosphate (PPi)	NA|363aa|down_7|NC_010162.1_11888872_11889961_-	COG0785, CcdA, Cytochrome c biogenesis protein [Posttranslational modification, protein turnover, chaperones]	NA|248aa|down_8|NC_010162.1_11889997_11890741_-	COG5662, COG5662, Predicted transmembrane transcriptional regulator (anti-sigma factor) [Transcription]	NA|195aa|down_9|NC_010162.1_11890680_11891265_-	COG1595, RpoE, DNA-directed RNA polymerase specialized sigma subunit, sigma24 homolog [Transcription]
GCF_000067165.1_ASM6716v1	NC_010162	Sorangium cellulosum So ce56, complete genome	39	11931503-11931841	10	CRT	no		cas8u1,cas3,csb2gr5,csb1gr7,cas1,cas2,cas6e,csa3,RT,DEDDh,WYL,DinG,PD-DExK,cmr1gr7,cas10,cmr3gr5,cmr4gr7,cmr5gr11,cmr6gr7,cas6,cas8b3,cas7,cas5	Orphan	CGCCGCCNGNGCCGCCNN	18	6	18	11931521-11931538|11931521-11931538|11931521-11931538|11931521-11931538|11931521-11931538|11931521-11931538|11931521-11931538|11931521-11931538|11931521-11931538|11931521-11931538|11931521-11931538|11931521-11931538|11931521-11931538|11931602-11931619|11931692-11931709|11931728-11931745|11931764-11931787|11931806-11931823	NC_010162.1_1749219-1749202|NC_010162.1_2949706-2949689|NC_010162.1_3512055-3512072|NC_010162.1_4489382-4489399|NC_010162.1_5874954-5874971|NC_010162.1_6442582-6442565|NC_010162.1_6442630-6442613|NC_010162.1_6442678-6442661|NC_010162.1_7377199-7377216|NC_010162.1_11040857-11040840|NC_010162.1_11383146-11383129|NC_010162.1_11383200-11383183|NC_010162.1_11383236-11383219|NC_010162.1_7377154-7377171|NC_010162.1_7377154-7377171|NC_010162.1_7377154-7377171|NC_010162.1_9236407-9236384|NC_010162.1_12947966-12947983	NA	8	8	Orphan	cas8u1,cas3,csb2gr5,csb1gr7,cas1,cas2,cas6e,csa3,RT,DEDDh,WYL,DinG,PD-DExK,cmr1gr7,cas10,cmr3gr5,cmr4gr7,cmr5gr11,cmr6gr7,cas6,cas8b3,cas7,cas5	NA,NA|130aa|down_1|NC_010162.1_11935514_11935904_+,NA|169aa|down_2|NC_010162.1_11936610_11937117_-,NA|65aa|down_5|NC_010162.1_11953645_11953840_+,NA|213aa|down_7|NC_010162.1_11954953_11955592_-	NA|509aa|up_9|NC_010162.1_11911056_11912583_-	COG0488, Uup, ATPase components of ABC transporters with duplicated ATPase domains [General function prediction only]	NA|255aa|up_8|NC_010162.1_11912902_11913667_-	PRK06172, PRK06172, SDR family oxidoreductase	NA|223aa|up_7|NC_010162.1_11913689_11914358_-	COG1309, AcrR, Transcriptional regulator [Transcription]	NA|413aa|up_6|NC_010162.1_11915724_11916963_+	COG2124, CypX, Cytochrome P450 [Secondary metabolites biosynthesis, transport, and catabolism]	NA|330aa|up_5|NC_010162.1_11917025_11918015_+	cd00687, Terpene_cyclase_nonplant_C1, Non-plant Terpene Cyclases, Class 1	NA|1655aa|up_4|NC_010162.1_11918095_11923060_-	COG3899, COG3899, Predicted ATPase [General function prediction only]	NA|610aa|up_3|NC_010162.1_11923720_11925550_-	COG0405, Ggt, Gamma-glutamyltransferase [Amino acid transport and metabolism]	NA|847aa|up_2|NC_010162.1_11925830_11928371_+	cd14014, STKc_PknB_like, Catalytic domain of bacterial Serine/Threonine kinases, PknB and similar proteins	NA|118aa|up_1|NC_010162.1_11928630_11928984_-	cd17580, REC_2_DhkD-like, second phosphoacceptor receiver (REC) domain of Dictyostelium discoideum hybrid signal transduction histidine kinase D and similar domains	NA|642aa|up_0|NC_010162.1_11929202_11931128_+	pfam06874, FBPase_2, Firmicute fructose-1,6-bisphosphatase	NA|730aa|down_0|NC_010162.1_11932906_11935096_+	COG1472, BglX, Beta-glucosidase-related glycosidases [Carbohydrate transport and metabolism]	NA|130aa|down_1|NC_010162.1_11935514_11935904_+	NA	NA|169aa|down_2|NC_010162.1_11936610_11937117_-	NA	NA|2408aa|down_3|NC_010162.1_11937125_11944349_-	pfam03534, SpvB, Salmonella virulence plasmid 65kDa B protein	NA|2995aa|down_4|NC_010162.1_11944351_11953336_-	pfam18276, TcA_TcB_BD, Tc toxin complex TcA C-terminal TcB-binding domain	NA|65aa|down_5|NC_010162.1_11953645_11953840_+	NA	NA|326aa|down_6|NC_010162.1_11953885_11954863_-	cd08241, QOR1, Quinone oxidoreductase (QOR)	NA|213aa|down_7|NC_010162.1_11954953_11955592_-	NA	NA|363aa|down_8|NC_010162.1_11956822_11957911_-	PRK11308, dppF, dipeptide transporter ATP-binding subunit; Provisional	NA|458aa|down_9|NC_010162.1_11957907_11959281_-	COG0444, DppD, ABC-type dipeptide/oligopeptide/nickel transport system, ATPase component [Amino acid transport and metabolism / Inorganic ion transport and metabolism]
GCF_000067165.1_ASM6716v1	NC_010162	Sorangium cellulosum So ce56, complete genome	40	12526816-12526939	34	CRISPRCasFinder	no		cas8u1,cas3,csb2gr5,csb1gr7,cas1,cas2,cas6e,csa3,RT,DEDDh,WYL,DinG,PD-DExK,cmr1gr7,cas10,cmr3gr5,cmr4gr7,cmr5gr11,cmr6gr7,cas6,cas8b3,cas7,cas5	Orphan	ACAAGAGTACGCTTCCGCGCCGAAAACGCCGA	32	0	0	NA	NA	NA	1	1	Orphan	cas8u1,cas3,csb2gr5,csb1gr7,cas1,cas2,cas6e,csa3,RT,DEDDh,WYL,DinG,PD-DExK,cmr1gr7,cas10,cmr3gr5,cmr4gr7,cmr5gr11,cmr6gr7,cas6,cas8b3,cas7,cas5	NA|167aa|up_8|NC_010162.1_12516677_12517178_-,NA|172aa|up_7|NC_010162.1_12517182_12517698_-,NA|241aa|up_6|NC_010162.1_12517717_12518440_-,NA|48aa|up_2|NC_010162.1_12521220_12521364_-,NA	NA|143aa|up_9|NC_010162.1_12516233_12516662_+	smart00347, HTH_MARR, helix_turn_helix multiple antibiotic resistance protein	NA|167aa|up_8|NC_010162.1_12516677_12517178_-	NA	NA|172aa|up_7|NC_010162.1_12517182_12517698_-	NA	NA|241aa|up_6|NC_010162.1_12517717_12518440_-	NA	NA|308aa|up_5|NC_010162.1_12518717_12519641_-	TIGR00950, Uncharacterized_inner_membrane_transporter_YicL, Carboxylate/Amino Acid/Amine Transporter	NA|227aa|up_4|NC_010162.1_12519666_12520347_-	pfam14124, DUF4291, Domain of unknown function (DUF4291)	NA|139aa|up_3|NC_010162.1_12520481_12520898_+	pfam09826, Beta_propel, Beta propeller domain	NA|48aa|up_2|NC_010162.1_12521220_12521364_-	NA	NA|310aa|up_1|NC_010162.1_12521498_12522428_+	cd03392, PAP2_like_2, PAP2_like_2 proteins	NA|1296aa|up_0|NC_010162.1_12522872_12526760_+	TIGR02148, ORFveg106_random, fibro-slime domain	NA|652aa|down_0|NC_010162.1_12527043_12528999_-	COG0642, BaeS, Signal transduction histidine kinase [Signal transduction mechanisms]	NA|663aa|down_1|NC_010162.1_12529263_12531252_+	smart00283, MA, Methyl-accepting chemotaxis-like domains (chemotaxis sensory transducer)	NA|333aa|down_2|NC_010162.1_12531286_12532285_+	cd00060, FHA, Forkhead associated domain (FHA); found in eukaryotic and prokaryotic proteins	NA|150aa|down_3|NC_010162.1_12532356_12532806_+	cd02232, cupin_ARD, acireductone dioxygenase (ARD), cupin domain	NA|123aa|down_4|NC_010162.1_12532821_12533190_+	cd02214, cupin_MJ1618, Methanocaldococcus jannaschii MJ1618 and related proteins, cupin domain	NA|388aa|down_5|NC_010162.1_12533325_12534489_+	smart00903, Flavin_Reduct, Flavin reductase like domain	NA|304aa|down_6|NC_010162.1_12534569_12535481_-	PRK08645, PRK08645, bifunctional homocysteine S-methyltransferase/5,10-methylenetetrahydrofolate reductase protein; Reviewed	NA|96aa|down_7|NC_010162.1_12535496_12535784_-	TIGR04042, conserved_hypothetical_protein, MSMEG_0570 family protein	NA|401aa|down_8|NC_010162.1_12535800_12537003_-	TIGR04047, MSMEG_0565_glyc, glycosyltransferase, MSMEG_0565 family	NA|336aa|down_9|NC_010162.1_12537026_12538034_-	TIGR04049, AIR_rel_sll0787, AIR synthase-related protein, sll0787 family
GCF_000067165.1_ASM6716v1	NC_010162	Sorangium cellulosum So ce56, complete genome	41	12787348-12793800	10,35,11	PILER-CR,CRISPRCasFinder,CRT	no	cas6,cas3,cas8b3,cas7,cas5,cas1,cas2	cas8u1,cas3,csb2gr5,csb1gr7,cas1,cas2,cas6e,csa3,RT,DEDDh,WYL,DinG,PD-DExK,cmr1gr7,cas10,cmr3gr5,cmr4gr7,cmr5gr11,cmr6gr7,cas6,cas8b3,cas7,cas5	Unclear	GCGATCCCCGCCGTGATGCCGGAAGGCGTTGAGCAC,GCGATCCCCGCCGTGATGCCGGAAGGCGTTGAGCAC,GCGATCCCCGCCGTGATGCCGGAAGGCGTTGAGCAC	36,36,36	1	1	12788597-12788633	NC_010162.1_2592484-2592448	I-A,I-B,II-B:I-A,I-B,II-B:I-A,I-B,II-B	26,90,90	90	Unclear	cas8u1,cas3,csb2gr5,csb1gr7,cas1,cas2,cas6e,csa3,RT,DEDDh,WYL,DinG,PD-DExK,cmr1gr7,cas10,cmr3gr5,cmr4gr7,cmr5gr11,cmr6gr7,cas6,cas8b3,cas7,cas5	NA|161aa|up_8|NC_010162.1_12776181_12776664_+,NA|261aa|up_7|NC_010162.1_12776903_12777686_-,NA|132aa|down_0|NC_010162.1_12794559_12794955_+,NA|282aa|down_1|NC_010162.1_12795143_12795989_+,NA|331aa|down_5|NC_010162.1_12801813_12802806_+	NA|495aa|up_9|NC_010162.1_12774549_12776034_+	cd14014, STKc_PknB_like, Catalytic domain of bacterial Serine/Threonine kinases, PknB and similar proteins	NA|161aa|up_8|NC_010162.1_12776181_12776664_+	NA	NA|261aa|up_7|NC_010162.1_12776903_12777686_-	NA	cas6|210aa|up_6|NC_010162.1_12778580_12779210_+	pfam09559, Cas6, Cas6 Crispr	cas3|824aa|up_5|NC_010162.1_12779209_12781681_+	TIGR01587, CRISPR-associated_endonuclease/helicase_Cas3, CRISPR-associated helicase Cas3	cas8b3|518aa|up_4|NC_010162.1_12781665_12783219_+	TIGR03485, hypothetical_protein_L8106_30105, CRISPR-associated protein Cas8a1/Csx13, MYXAN subtype	cas7|340aa|up_3|NC_010162.1_12783223_12784243_+	cd09687, Cas7_I-C, CRISPR/Cas system-associated RAMP superfamily protein Cas7	cas5|225aa|up_2|NC_010162.1_12784239_12784914_+	cd09688, Cas5_I-C, CRISPR/Cas system-associated RAMP superfamily protein Cas5	cas1|576aa|up_1|NC_010162.1_12785066_12786794_+	TIGR03983, hypothetical_protein_LA3181, CRISPR-associated endonuclease Cas1, subtype MYXAN	cas2|99aa|up_0|NC_010162.1_12786802_12787099_+	cd09725, Cas2_I_II_III, CRISPR/Cas system-associated protein Cas2	NA|132aa|down_0|NC_010162.1_12794559_12794955_+	NA	NA|282aa|down_1|NC_010162.1_12795143_12795989_+	NA	NA|262aa|down_2|NC_010162.1_12797987_12798773_+	pfam13628, DUF4142, Domain of unknown function (DUF4142)	NA|481aa|down_3|NC_010162.1_12798871_12800314_-	cd07115, ALDH_HMSADH_HapE, Pseudomonas fluorescens 4-hydroxymuconic semialdehyde dehydrogenase-like	NA|474aa|down_4|NC_010162.1_12800395_12801817_+	TIGR00699, 4-aminobutyrate_aminotransferase, 4-aminobutyrate aminotransferase, eukaryotic type	NA|331aa|down_5|NC_010162.1_12801813_12802806_+	NA	NA|298aa|down_6|NC_010162.1_12802968_12803862_+	PRK00724, PRK00724, formate dehydrogenase accessory sulfurtransferase FdhD	NA|775aa|down_7|NC_010162.1_12803858_12806183_+	TIGR01701, Hypothetical_protein_Rv2900c/MT2968/Mb2924c	NA|469aa|down_8|NC_010162.1_12806217_12807624_+	TIGR00699, 4-aminobutyrate_aminotransferase, 4-aminobutyrate aminotransferase, eukaryotic type	NA|328aa|down_9|NC_010162.1_12807620_12808604_+	cd09993, HDAC_classIV, Histone deacetylase class IV also known as histone deacetylase 11
